TOP ACTIVATIONS
MAX = 0.086

wet ! She closed her drawer again , more ignorant than
She carefully applied the yellow makeup to her nose and smiled
She said that the black board was a place where teachers
because he thought his black board was ruined .
Mom my explained that the box was like a truck .
saw . He thought the journal was silly . He ripped
and down . Inside the drawer was a secret book .
of paper out of his drawer . He asked his little
his work . The black board was perfect . Now he
. L ise thought the laundry was too heavy . She
school . He opened his drawer and took out his blue
mom looked in Benny 's drawer and found the toy car
amazed at how flexible the file was . She asked her
His friend had written the notebook . He
Spot knew what the red box was and they walked by
parts . They think the box is a lie . They
wanted to know what the box was for . She asked
He opened the tool box and found all the
no , because the ash tr ay was not a toy
He found out his special drawer was even more special than

MIN ACTIVATIONS
MIN = -0.126

then her friend started to mis be have . She was
Lucy noticed the weather was uncom fort ably hot . She
it fits !" She helped Sil via put on the coat
a little bird ie named Twe ety . Twe ety was
who was trying to take Twe ety 's nest . Twe
a little bird ie named Twe ety . Twe ety lived
a little bird ie named Twe ety . Twe ety loved
a film camera and asked Twe ety if she could take
to never try to catch Twe ety again . From that
a little bird ie named Twe ety . Twe ety loved
and created a shelter for Twe ety . Twe ety felt
dare to climb up to Twe ety 's nest .
The cat wanted to eat Twe ety ! Twe
friend , a bird named Twe ety , got stuck in
seeds and gave them to Twe ety . Twe
a little bird ie named Twe ety . Twe ety lived
bird and they named it Twe ety . They fed Twe
there was a bird named Twe ety . Twe ety lived
a little bird ie like Twe ety . So , Twe
, a little bird named Twe ety , saw him struggling

INTERVAL 0.033 - 0.086
CONTAINS 6.736%

was so tiny and it looked like it was going to
know the big bird . She did not want to leave
and queens in their palace . And they were very happy
Once upon a time , there was a little boy named
!" Tim my replied , " No , it

INTERVAL -0.020 - 0.033
CONTAINS 72.672%

, but it felt nice . He felt happy and proud
was a dark color . Nobody knew where it came from
pot of gold . <|endoftext|> Once upon a time , there
. Now , go and play with your ship . But
He smiled and went inside to have dinner . <|endoftext|> Once

INTERVAL -0.073 - -0.020
CONTAINS 20.574%

she skipped around . But soon , the sock
's press it !" Suddenly , the ground
the biggest squash she had ever seen ! She picked it
time , there was a little rabbit named Bun . Bun
- old daughter , Mill ie . Mill

INTERVAL -0.126 - -0.073
CONTAINS 0.018%

Mom my said " Let â s go !"
that ?" " Mos quit oes !" Tom said
, but he just couldn â t work it
! <|endoftext|> One day , Sil via was getting ready to
. When Twe ety arrived at the o
Token wet
Token ID9583
Feature activation-1.05e-02

Pos logit contributions
ck+0.020
 breath+0.019
ited+0.016
 tug+0.015
away+0.015
Neg logit contributions
 She-0.015
 He-0.014
 Did-0.014
tr-0.014
 There-0.014
Token!
Token ID0
Feature activation+6.96e-03

Pos logit contributions
 breath+0.017
ck+0.015
away+0.013
ited+0.013
 tug+0.012
Neg logit contributions
 Did-0.019
tr-0.019
 There-0.018
 Now-0.018
 Where-0.018
Token She
Token ID1375
Feature activation+3.05e-02

Pos logit contributions
tr+0.006
 has+0.006
robe+0.005
bridge+0.005
 shines+0.005
Neg logit contributions
 breath-0.016
ck-0.016
 tug-0.014
ited-0.014
 agree-0.014
Token closed
Token ID4838
Feature activation+4.01e-03

Pos logit contributions
 She+0.042
 He+0.040
tr+0.039
 has+0.039
 There+0.038
Neg logit contributions
ck-0.072
 breath-0.066
ited-0.058
away-0.054
 tug-0.054
Token her
Token ID607
Feature activation+5.02e-02

Pos logit contributions
 She+0.006
 He+0.006
 There+0.006
 has+0.006
tr+0.006
Neg logit contributions
ck-0.009
 breath-0.008
ited-0.007
 tug-0.006
away-0.006
Token drawer
Token ID33451
Feature activation+8.41e-02

Pos logit contributions
 She+0.068
 He+0.065
tr+0.063
 Why+0.063
 Did+0.063
Neg logit contributions
ck-0.117
 breath-0.112
ited-0.094
 tug-0.093
away-0.089
Token again
Token ID757
Feature activation-1.11e-02

Pos logit contributions
 Why+0.114
 How+0.114
 She+0.113
tr+0.112
 Did+0.112
Neg logit contributions
ck-0.186
 breath-0.181
ited-0.153
 tug-0.146
away-0.144
Token,
Token ID11
Feature activation+8.95e-03

Pos logit contributions
ck+0.028
 breath+0.026
ited+0.023
 tug+0.022
away+0.022
Neg logit contributions
 She-0.013
tr-0.013
 He-0.013
 Why-0.012
 There-0.012
Token more
Token ID517
Feature activation-5.30e-03

Pos logit contributions
tr+0.013
robe+0.013
 There+0.013
 He+0.013
 Why+0.012
Neg logit contributions
ck-0.020
 breath-0.019
 tug-0.016
ited-0.015
 swat-0.015
Token ignorant
Token ID20109
Feature activation-1.18e-02

Pos logit contributions
ck+0.012
 breath+0.011
ited+0.009
 tug+0.009
away+0.008
Neg logit contributions
 She-0.008
tr-0.008
 He-0.008
 There-0.008
 Why-0.008
Token than
Token ID621
Feature activation-3.42e-02

Pos logit contributions
ck+0.024
 breath+0.024
 tug+0.019
 fear+0.019
ited+0.019
Neg logit contributions
tr-0.018
 Did-0.017
 She-0.017
 Why-0.016
 How-0.016
Token She
Token ID1375
Feature activation+2.85e-02

Pos logit contributions
tr+0.016
 has+0.014
robe+0.014
 living+0.014
 sky+0.013
Neg logit contributions
ck-0.036
 breath-0.034
ited-0.031
 tug-0.031
 agree-0.030
Token carefully
Token ID7773
Feature activation+1.12e-02

Pos logit contributions
 She+0.042
tr+0.040
 He+0.040
 There+0.039
 Why+0.039
Neg logit contributions
ck-0.064
 breath-0.058
ited-0.052
 tug-0.049
away-0.047
Token applied
Token ID5625
Feature activation+2.85e-02

Pos logit contributions
 She+0.018
 He+0.017
tr+0.017
 There+0.017
 has+0.017
Neg logit contributions
ck-0.023
 breath-0.021
ited-0.019
 tug-0.017
away-0.017
Token the
Token ID262
Feature activation+3.07e-02

Pos logit contributions
tr+0.049
 There+0.047
 Did+0.047
 She+0.046
 Why+0.046
Neg logit contributions
ck-0.055
 breath-0.054
ited-0.042
 tug-0.041
away-0.039
Token yellow
Token ID7872
Feature activation+4.24e-02

Pos logit contributions
 She+0.048
 He+0.045
tr+0.045
 There+0.044
 has+0.044
Neg logit contributions
ck-0.066
 breath-0.060
ited-0.053
 tug-0.049
away-0.048
Token makeup
Token ID16029
Feature activation+8.37e-02

Pos logit contributions
 She+0.066
 There+0.064
 Why+0.064
 He+0.063
 Did+0.062
Neg logit contributions
ck-0.087
 breath-0.085
ited-0.069
 tug-0.068
away-0.065
Token to
Token ID284
Feature activation+8.17e-03

Pos logit contributions
 Did+0.130
 Why+0.130
 There+0.128
 Where+0.127
 He+0.124
Neg logit contributions
 breath-0.160
ck-0.155
ited-0.130
away-0.129
 tug-0.128
Token her
Token ID607
Feature activation+2.00e-02

Pos logit contributions
 She+0.014
tr+0.013
 He+0.013
 There+0.013
 has+0.013
Neg logit contributions
ck-0.016
 breath-0.015
ited-0.013
 tug-0.012
away-0.012
Token nose
Token ID9686
Feature activation+2.22e-02

Pos logit contributions
 She+0.032
 He+0.031
 There+0.030
tr+0.030
 Why+0.030
Neg logit contributions
ck-0.042
 breath-0.039
ited-0.033
 tug-0.031
away-0.030
Token and
Token ID290
Feature activation-2.43e-02

Pos logit contributions
 There+0.032
 Why+0.032
 Did+0.032
 How+0.031
 Where+0.031
Neg logit contributions
ck-0.047
 breath-0.044
ited-0.037
 tug-0.035
away-0.035
Token smiled
Token ID13541
Feature activation+1.69e-03

Pos logit contributions
ck+0.050
 breath+0.046
 tug+0.038
ited+0.036
 swat+0.035
Neg logit contributions
 She-0.040
 There-0.040
 Why-0.040
tr-0.039
 He-0.039
Token She
Token ID1375
Feature activation+3.59e-02

Pos logit contributions
tr+0.056
robe+0.050
fl+0.049
 shines+0.047
bridge+0.047
Neg logit contributions
 breath-0.050
ck-0.048
 tug-0.047
ited-0.045
 response-0.044
Token said
Token ID531
Feature activation+6.97e-02

Pos logit contributions
 She+0.061
 There+0.060
robe+0.060
tr+0.059
 He+0.059
Neg logit contributions
ck-0.071
 breath-0.061
 tug-0.054
ited-0.053
 swat-0.047
Token that
Token ID326
Feature activation+5.18e-02

Pos logit contributions
tr+0.102
robe+0.095
 sky+0.090
 He+0.089
 She+0.089
Neg logit contributions
ck-0.152
 breath-0.138
 tug-0.123
ited-0.116
away-0.115
Token the
Token ID262
Feature activation+4.05e-02

Pos logit contributions
tr+0.087
robe+0.080
 He+0.080
 She+0.080
 There+0.080
Neg logit contributions
ck-0.095
 breath-0.084
 tug-0.074
ited-0.069
 common-0.069
Token black
Token ID2042
Feature activation+5.90e-02

Pos logit contributions
 She+0.064
tr+0.061
 He+0.061
 There+0.059
 Did+0.059
Neg logit contributions
ck-0.082
 breath-0.078
ited-0.067
 tug-0.064
 response-0.061
Tokenboard
Token ID3526
Feature activation+8.32e-02

Pos logit contributions
 She+0.061
 There+0.055
 Why+0.054
 He+0.054
 How+0.053
Neg logit contributions
 breath-0.139
ck-0.139
 tug-0.122
ited-0.120
away-0.114
Token was
Token ID373
Feature activation+1.72e-02

Pos logit contributions
 She+0.092
 Why+0.088
 Did+0.088
 Where+0.087
 How+0.087
Neg logit contributions
ck-0.193
 breath-0.178
 tug-0.160
ited-0.158
 promise-0.150
Token a
Token ID257
Feature activation+1.91e-02

Pos logit contributions
 She+0.026
tr+0.025
 He+0.025
 There+0.025
 Did+0.025
Neg logit contributions
ck-0.038
 breath-0.034
ited-0.029
 tug-0.028
away-0.027
Token place
Token ID1295
Feature activation+1.45e-02

Pos logit contributions
 She+0.034
 He+0.032
 There+0.032
 Did+0.032
tr+0.032
Neg logit contributions
ck-0.035
 breath-0.033
ited-0.028
 tug-0.027
away-0.026
Token where
Token ID810
Feature activation+1.98e-02

Pos logit contributions
 She+0.019
 He+0.019
 There+0.019
tr+0.018
 Did+0.018
Neg logit contributions
ck-0.033
 breath-0.031
ited-0.026
away-0.026
 tug-0.025
Token teachers
Token ID7799
Feature activation+3.05e-03

Pos logit contributions
tr+0.031
 She+0.030
 He+0.030
 Why+0.029
 Did+0.029
Neg logit contributions
ck-0.041
 breath-0.038
ited-0.031
 tug-0.030
away-0.030
Token because
Token ID780
Feature activation+5.22e-03

Pos logit contributions
 breath+0.025
ck+0.025
 tug+0.020
 fear+0.020
away+0.019
Neg logit contributions
tr-0.031
robe-0.029
bridge-0.029
 Did-0.029
ase-0.028
Token he
Token ID339
Feature activation+2.98e-02

Pos logit contributions
tr+0.005
 She+0.005
 There+0.005
 has+0.005
 He+0.005
Neg logit contributions
ck-0.014
 breath-0.013
ited-0.012
 tug-0.011
away-0.011
Token thought
Token ID1807
Feature activation+4.71e-02

Pos logit contributions
 She+0.040
 He+0.038
tr+0.038
 has+0.037
 Why+0.036
Neg logit contributions
ck-0.071
 breath-0.066
ited-0.058
away-0.054
 tug-0.053
Token his
Token ID465
Feature activation+4.18e-02

Pos logit contributions
tr+0.067
 She+0.065
 Why+0.061
 How+0.061
 Did+0.060
Neg logit contributions
ck-0.105
 breath-0.098
 tug-0.082
ited-0.078
away-0.077
Token black
Token ID2042
Feature activation+5.23e-02

Pos logit contributions
tr+0.058
 She+0.057
 has+0.055
 He+0.055
robe+0.054
Neg logit contributions
ck-0.101
 breath-0.086
ited-0.084
away-0.076
 contain-0.073
Tokenboard
Token ID3526
Feature activation+8.29e-02

Pos logit contributions
 She+0.075
 There+0.072
 Why+0.070
 He+0.069
 has+0.069
Neg logit contributions
ck-0.112
 breath-0.108
ited-0.092
 tug-0.089
away-0.085
Token was
Token ID373
Feature activation+2.28e-02

Pos logit contributions
 There+0.089
 She+0.089
 Did+0.087
 Why+0.086
 How+0.083
Neg logit contributions
ck-0.200
 breath-0.193
ited-0.173
 tug-0.168
 fear-0.157
Token ruined
Token ID20484
Feature activation+1.53e-02

Pos logit contributions
 She+0.039
tr+0.038
 He+0.037
 There+0.037
 Did+0.037
Neg logit contributions
ck-0.046
 breath-0.042
ited-0.035
 tug-0.033
away-0.032
Token.
Token ID13
Feature activation+4.61e-03

Pos logit contributions
tr+0.026
 There+0.026
 Did+0.025
 She+0.025
 has+0.024
Neg logit contributions
ck-0.029
 breath-0.027
away-0.022
 tug-0.022
ited-0.021
Token
Token ID220
Feature activation+1.53e-02

Pos logit contributions
robe+0.006
tr+0.006
bows+0.005
 has+0.005
 shines+0.005
Neg logit contributions
ck-0.010
 breath-0.009
 tug-0.008
 panic-0.008
ited-0.008
Token
Token ID198
Feature activation-2.60e-02

Pos logit contributions
tr+0.016
 has+0.016
 sky+0.015
robe+0.015
 She+0.014
Neg logit contributions
ck-0.037
 breath-0.033
 tug-0.029
rig-0.029
 agree-0.028
TokenMom
Token ID29252
Feature activation-1.59e-03

Pos logit contributions
ck+0.047
 breath+0.044
ited+0.038
away+0.034
 swat+0.034
Neg logit contributions
 She-0.037
 He-0.035
tr-0.034
 Why-0.033
 There-0.033
Tokenmy
Token ID1820
Feature activation+4.74e-03

Pos logit contributions
ck+0.004
 breath+0.003
ited+0.003
 tug+0.003
 response+0.003
Neg logit contributions
 She-0.002
 There-0.002
 How-0.002
robe-0.002
tr-0.002
Token explained
Token ID4893
Feature activation+3.53e-02

Pos logit contributions
 She+0.008
tr+0.007
 He+0.007
 There+0.007
 Why+0.007
Neg logit contributions
ck-0.010
 breath-0.009
ited-0.008
 tug-0.007
 swat-0.007
Token that
Token ID326
Feature activation+5.35e-02

Pos logit contributions
tr+0.043
 She+0.042
 has+0.041
 There+0.040
robe+0.040
Neg logit contributions
ck-0.086
 breath-0.080
 tug-0.069
ited-0.068
away-0.067
Token the
Token ID262
Feature activation+4.21e-02

Pos logit contributions
tr+0.084
 She+0.083
 He+0.081
 There+0.081
robe+0.079
Neg logit contributions
ck-0.105
 breath-0.098
 tug-0.080
ited-0.078
 fear-0.076
Token box
Token ID3091
Feature activation+8.28e-02

Pos logit contributions
 She+0.064
 Did+0.057
 Why+0.056
robe+0.052
 Well+0.052
Neg logit contributions
 breath-0.080
ck-0.073
 tug-0.071
 reason-0.069
 promise-0.069
Token was
Token ID373
Feature activation+1.77e-02

Pos logit contributions
 How+0.096
 Why+0.095
 Did+0.092
 She+0.092
 Where+0.091
Neg logit contributions
ck-0.196
 breath-0.177
 tug-0.154
 promise-0.152
ited-0.149
Token like
Token ID588
Feature activation+1.12e-02

Pos logit contributions
 She+0.026
 He+0.025
tr+0.025
 There+0.024
 Did+0.024
Neg logit contributions
ck-0.039
 breath-0.036
ited-0.031
 tug-0.030
away-0.029
Token a
Token ID257
Feature activation+1.10e-02

Pos logit contributions
tr+0.019
 She+0.018
 He+0.018
 Did+0.018
 Why+0.017
Neg logit contributions
ck-0.021
 breath-0.020
 tug-0.016
 fear-0.016
ited-0.015
Token truck
Token ID7779
Feature activation+2.24e-02

Pos logit contributions
 She+0.019
 Why+0.017
 There+0.017
 Did+0.017
 He+0.017
Neg logit contributions
ck-0.021
 breath-0.021
 tug-0.018
away-0.016
 response-0.016
Token.
Token ID13
Feature activation+3.07e-02

Pos logit contributions
 Did+0.033
 She+0.032
 There+0.031
 How+0.031
 Now+0.031
Neg logit contributions
ck-0.044
 breath-0.039
 tug-0.034
 believe-0.033
ited-0.033
Token saw
Token ID2497
Feature activation+2.44e-02

Pos logit contributions
 She+0.042
tr+0.040
 There+0.038
 He+0.038
robe+0.038
Neg logit contributions
ck-0.074
 breath-0.064
ited-0.061
 tug-0.056
away-0.053
Token.
Token ID13
Feature activation+3.35e-02

Pos logit contributions
 She+0.038
tr+0.038
 He+0.037
 There+0.036
 Did+0.036
Neg logit contributions
ck-0.051
 breath-0.045
ited-0.040
away-0.038
 tug-0.036
Token He
Token ID679
Feature activation+3.38e-02

Pos logit contributions
tr+0.036
fl+0.033
bows+0.032
 living+0.031
 shines+0.030
Neg logit contributions
ck-0.074
 breath-0.067
 tug-0.064
 common-0.062
away-0.062
Token thought
Token ID1807
Feature activation+4.68e-02

Pos logit contributions
 She+0.043
tr+0.040
 He+0.040
 There+0.040
 Why+0.039
Neg logit contributions
ck-0.084
 breath-0.076
ited-0.068
 tug-0.066
away-0.062
Token the
Token ID262
Feature activation+2.86e-02

Pos logit contributions
tr+0.069
 She+0.065
robe+0.062
 Did+0.062
 Why+0.062
Neg logit contributions
ck-0.104
 breath-0.095
 tug-0.081
away-0.075
 fear-0.075
Token journal
Token ID3989
Feature activation+8.26e-02

Pos logit contributions
 She+0.049
tr+0.047
 Did+0.046
 Why+0.046
 There+0.046
Neg logit contributions
 breath-0.052
ck-0.051
ited-0.042
away-0.041
 response-0.040
Token was
Token ID373
Feature activation+3.74e-02

Pos logit contributions
 There+0.106
 She+0.105
 Why+0.104
 Now+0.104
 Did+0.102
Neg logit contributions
ck-0.189
 breath-0.166
 tug-0.149
ited-0.147
 promise-0.141
Token silly
Token ID14397
Feature activation-1.45e-02

Pos logit contributions
 She+0.065
 Did+0.064
tr+0.062
 There+0.061
 Now+0.061
Neg logit contributions
ck-0.074
 breath-0.065
 tug-0.052
 fear-0.051
 common-0.050
Token.
Token ID13
Feature activation+3.44e-02

Pos logit contributions
ck+0.024
 breath+0.023
away+0.018
 fear+0.018
 tug+0.017
Neg logit contributions
tr-0.028
 Did-0.025
 There-0.024
 She-0.024
cel-0.024
Token He
Token ID679
Feature activation+3.79e-02

Pos logit contributions
tr+0.044
bows+0.041
robe+0.039
cel+0.038
fl+0.038
Neg logit contributions
ck-0.067
 breath-0.059
 tug-0.058
 common-0.056
 behold-0.056
Token ripped
Token ID19551
Feature activation+1.49e-02

Pos logit contributions
 She+0.053
tr+0.050
 He+0.050
 There+0.049
 Why+0.048
Neg logit contributions
ck-0.089
 breath-0.080
ited-0.071
 tug-0.068
away-0.064
Token and
Token ID290
Feature activation-2.42e-02

Pos logit contributions
ck+0.007
 breath+0.006
ited+0.005
 tug+0.005
away+0.005
Neg logit contributions
 She-0.003
 He-0.003
 has-0.002
tr-0.002
 There-0.002
Token down
Token ID866
Feature activation+6.13e-03

Pos logit contributions
ck+0.049
 breath+0.045
ited+0.038
 tug+0.037
 swat+0.035
Neg logit contributions
tr-0.040
 She-0.039
 There-0.038
robe-0.038
 Why-0.038
Token.
Token ID13
Feature activation+1.78e-02

Pos logit contributions
 She+0.009
tr+0.009
 He+0.008
 There+0.008
 Did+0.008
Neg logit contributions
ck-0.014
 breath-0.013
ited-0.011
 tug-0.010
away-0.010
Token Inside
Token ID14384
Feature activation+3.22e-02

Pos logit contributions
tr+0.013
 has+0.012
robe+0.012
 living+0.011
 shines+0.011
Neg logit contributions
ck-0.043
 breath-0.042
 tug-0.039
 agree-0.038
rig-0.037
Token the
Token ID262
Feature activation+1.43e-02

Pos logit contributions
tr+0.050
 Why+0.046
 How+0.045
 Did+0.044
 He+0.044
Neg logit contributions
ck-0.068
 breath-0.066
ited-0.057
 tug-0.052
away-0.050
Token drawer
Token ID33451
Feature activation+8.26e-02

Pos logit contributions
 She+0.019
 Did+0.018
 He+0.018
 Why+0.018
 There+0.018
Neg logit contributions
ck-0.032
 breath-0.031
ited-0.027
 tug-0.026
 response-0.025
Token was
Token ID373
Feature activation+1.19e-02

Pos logit contributions
tr+0.111
 How+0.110
 Why+0.109
 Where+0.103
 He+0.100
Neg logit contributions
ck-0.179
 breath-0.166
ited-0.145
 tug-0.140
 contain-0.137
Token a
Token ID257
Feature activation+1.15e-02

Pos logit contributions
 She+0.019
 There+0.018
 Did+0.018
 He+0.018
 Why+0.018
Neg logit contributions
ck-0.024
 breath-0.022
ited-0.018
 tug-0.018
 response-0.017
Token secret
Token ID3200
Feature activation+2.33e-02

Pos logit contributions
 She+0.020
 He+0.019
 There+0.019
 Why+0.018
tr+0.018
Neg logit contributions
ck-0.022
 breath-0.021
ited-0.018
 tug-0.017
away-0.016
Token book
Token ID1492
Feature activation+3.94e-02

Pos logit contributions
 She+0.038
 Did+0.038
 Why+0.037
 He+0.036
 There+0.036
Neg logit contributions
ck-0.046
 breath-0.045
ited-0.036
away-0.036
 tug-0.036
Token.
Token ID13
Feature activation+1.65e-02

Pos logit contributions
 Did+0.064
 How+0.061
 Why+0.061
 Now+0.059
 He+0.059
Neg logit contributions
ck-0.073
 breath-0.065
 contain-0.058
 believe-0.055
away-0.055
Token of
Token ID286
Feature activation+8.45e-03

Pos logit contributions
 She+0.051
 Did+0.049
 There+0.049
tr+0.048
 has+0.048
Neg logit contributions
ck-0.067
 breath-0.062
ited-0.052
 tug-0.050
away-0.049
Token paper
Token ID3348
Feature activation+4.32e-02

Pos logit contributions
 She+0.012
 He+0.011
 has+0.011
tr+0.011
 sky+0.010
Neg logit contributions
ck-0.020
 breath-0.018
ited-0.016
away-0.015
 tug-0.015
Token out
Token ID503
Feature activation-1.20e-04

Pos logit contributions
 Did+0.080
 There+0.076
 She+0.076
 Now+0.076
 Why+0.075
Neg logit contributions
ck-0.081
 breath-0.077
 tug-0.060
away-0.056
ited-0.056
Token of
Token ID286
Feature activation-1.25e-04

Pos logit contributions
ck+0.000
 breath+0.000
ited+0.000
 tug+0.000
away+0.000
Neg logit contributions
tr-0.000
 She-0.000
 Did-0.000
 Why-0.000
robe-0.000
Token his
Token ID465
Feature activation+2.44e-02

Pos logit contributions
ck+0.000
 breath+0.000
ited+0.000
 tug+0.000
away+0.000
Neg logit contributions
 She-0.000
tr-0.000
 Why-0.000
 There-0.000
 He-0.000
Token drawer
Token ID33451
Feature activation+8.24e-02

Pos logit contributions
 She+0.038
 He+0.036
tr+0.036
 There+0.035
 has+0.035
Neg logit contributions
ck-0.053
 breath-0.048
ited-0.042
 tug-0.039
away-0.039
Token.
Token ID13
Feature activation+2.02e-02

Pos logit contributions
 Why+0.152
 How+0.151
 Did+0.151
 Where+0.150
 There+0.148
Neg logit contributions
ck-0.140
 breath-0.133
ited-0.107
 tug-0.100
 contain-0.100
Token He
Token ID679
Feature activation+2.88e-02

Pos logit contributions
tr+0.019
robe+0.016
bows+0.016
fl+0.015
cel+0.015
Neg logit contributions
 breath-0.049
ck-0.048
 tug-0.045
 agree-0.043
war-0.042
Token asked
Token ID1965
Feature activation+2.27e-02

Pos logit contributions
 She+0.041
 There+0.038
tr+0.038
 He+0.038
 Why+0.038
Neg logit contributions
ck-0.067
 breath-0.061
ited-0.054
 tug-0.053
 swat-0.048
Token his
Token ID465
Feature activation+1.86e-02

Pos logit contributions
tr+0.038
 She+0.038
 There+0.036
 has+0.036
 He+0.035
Neg logit contributions
ck-0.045
 breath-0.042
ited-0.034
 tug-0.034
away-0.033
Token little
Token ID1310
Feature activation+5.57e-04

Pos logit contributions
 She+0.032
 He+0.030
tr+0.030
 There+0.029
 has+0.029
Neg logit contributions
ck-0.038
 breath-0.034
ited-0.030
 tug-0.028
away-0.027
Token his
Token ID465
Feature activation+2.27e-02

Pos logit contributions
tr+0.042
 She+0.041
 Did+0.040
 There+0.040
 has+0.039
Neg logit contributions
ck-0.044
 breath-0.040
ited-0.032
away-0.031
 tug-0.031
Token work
Token ID670
Feature activation+4.36e-02

Pos logit contributions
 She+0.034
tr+0.034
 has+0.033
 He+0.033
 sky+0.033
Neg logit contributions
ck-0.049
 breath-0.041
ited-0.040
away-0.035
war-0.034
Token.
Token ID13
Feature activation+2.40e-02

Pos logit contributions
 She+0.068
 Did+0.066
 There+0.065
tr+0.065
 Why+0.065
Neg logit contributions
ck-0.093
 breath-0.087
ited-0.074
 tug-0.071
away-0.070
Token The
Token ID383
Feature activation+2.08e-02

Pos logit contributions
robe+0.022
tr+0.022
cel+0.020
 living+0.019
bows+0.018
Neg logit contributions
ck-0.052
 breath-0.050
 agree-0.050
 behold-0.047
 response-0.047
Token black
Token ID2042
Feature activation+4.16e-02

Pos logit contributions
 She+0.030
 He+0.029
tr+0.029
 There+0.028
 has+0.028
Neg logit contributions
ck-0.047
 breath-0.043
ited-0.038
 tug-0.035
away-0.035
Tokenboard
Token ID3526
Feature activation+8.19e-02

Pos logit contributions
 She+0.057
 Why+0.054
 There+0.054
 Did+0.053
 How+0.053
Neg logit contributions
ck-0.092
 breath-0.088
ited-0.076
 tug-0.075
 response-0.070
Token was
Token ID373
Feature activation+1.70e-02

Pos logit contributions
 How+0.104
 Why+0.103
 She+0.103
 Did+0.099
 Where+0.098
Neg logit contributions
ck-0.193
 breath-0.179
ited-0.160
 tug-0.157
 contain-0.147
Token perfect
Token ID2818
Feature activation+1.96e-03

Pos logit contributions
 She+0.027
 Did+0.026
tr+0.025
 There+0.025
 Why+0.025
Neg logit contributions
ck-0.036
 breath-0.034
ited-0.028
 tug-0.028
 response-0.026
Token.
Token ID13
Feature activation+2.49e-02

Pos logit contributions
 Did+0.004
tr+0.004
 She+0.003
 There+0.003
 Why+0.003
Neg logit contributions
 breath-0.004
ck-0.003
 tug-0.003
ited-0.003
away-0.003
Token Now
Token ID2735
Feature activation+1.13e-02

Pos logit contributions
robe+0.028
tr+0.027
 living+0.024
cel+0.023
bows+0.022
Neg logit contributions
ck-0.057
 breath-0.053
 agree-0.050
 behold-0.049
 response-0.048
Token he
Token ID339
Feature activation+1.98e-02

Pos logit contributions
tr+0.020
robe+0.019
 Did+0.017
 Why+0.017
 She+0.017
Neg logit contributions
ck-0.023
 breath-0.021
 tug-0.017
away-0.017
 believe-0.017
Token.
Token ID13
Feature activation+1.64e-02

Pos logit contributions
ck+0.041
 breath+0.037
ited+0.034
 tug+0.030
away+0.030
Neg logit contributions
 Did-0.029
tr-0.029
robe-0.029
 He-0.028
 She-0.028
Token L
Token ID406
Feature activation-2.57e-02

Pos logit contributions
tr+0.013
robe+0.012
 has+0.012
bridge+0.011
rob+0.010
Neg logit contributions
ck-0.039
 breath-0.039
 agree-0.035
 tug-0.035
rig-0.034
Tokenise
Token ID786
Feature activation-9.58e-04

Pos logit contributions
ck+0.052
 breath+0.050
ited+0.041
 tug+0.040
away+0.040
Neg logit contributions
 She-0.043
 has-0.040
 There-0.039
 Why-0.039
tr-0.039
Token thought
Token ID1807
Feature activation+4.32e-02

Pos logit contributions
ck+0.002
 breath+0.002
ited+0.002
 tug+0.002
away+0.001
Neg logit contributions
 She-0.002
 He-0.001
tr-0.001
 has-0.001
 There-0.001
Token the
Token ID262
Feature activation+3.78e-02

Pos logit contributions
 She+0.061
tr+0.058
 He+0.058
 has+0.056
 There+0.056
Neg logit contributions
ck-0.100
 breath-0.091
ited-0.080
 tug-0.075
away-0.074
Token laundry
Token ID25724
Feature activation+8.11e-02

Pos logit contributions
 He+0.063
 She+0.062
 Did+0.062
tr+0.061
 Why+0.061
Neg logit contributions
 breath-0.071
ck-0.067
 tug-0.057
ited-0.057
 promise-0.053
Token was
Token ID373
Feature activation+2.77e-02

Pos logit contributions
 Why+0.087
 There+0.087
 Did+0.084
 How+0.083
 She+0.082
Neg logit contributions
 breath-0.196
ck-0.194
ited-0.169
 tug-0.166
 contain-0.162
Token too
Token ID1165
Feature activation-7.56e-03

Pos logit contributions
 She+0.044
tr+0.044
 He+0.043
 Did+0.043
 There+0.042
Neg logit contributions
ck-0.057
 breath-0.052
ited-0.044
 tug-0.043
away-0.040
Token heavy
Token ID4334
Feature activation-1.35e-02

Pos logit contributions
ck+0.015
 breath+0.013
ited+0.012
 swat+0.011
 tug+0.010
Neg logit contributions
 She-0.013
 has-0.013
 He-0.013
 There-0.012
tr-0.012
Token.
Token ID13
Feature activation+1.89e-02

Pos logit contributions
ck+0.024
 breath+0.024
away+0.019
 tug+0.018
ge+0.018
Neg logit contributions
tr-0.024
 Did-0.023
 Where-0.022
 There-0.022
robe-0.021
Token She
Token ID1375
Feature activation+3.18e-02

Pos logit contributions
tr+0.018
robe+0.018
bridge+0.017
 living+0.017
 has+0.017
Neg logit contributions
ck-0.040
 breath-0.038
 tug-0.036
rig-0.035
war-0.034
Token school
Token ID1524
Feature activation+1.10e-02

Pos logit contributions
ck+0.022
 breath+0.020
ited+0.018
 tug+0.016
away+0.016
Neg logit contributions
 She-0.017
 He-0.017
 There-0.016
 has-0.016
 living-0.015
Token.
Token ID13
Feature activation+3.22e-03

Pos logit contributions
 She+0.016
tr+0.016
 He+0.016
 has+0.016
 There+0.015
Neg logit contributions
ck-0.023
 breath-0.021
ited-0.019
 tug-0.018
away-0.018
Token He
Token ID679
Feature activation+2.25e-02

Pos logit contributions
tr+0.002
 living+0.002
 has+0.002
robe+0.001
bows+0.001
Neg logit contributions
ck-0.009
 breath-0.009
 tug-0.008
 agree-0.008
ited-0.008
Token opened
Token ID4721
Feature activation+2.46e-02

Pos logit contributions
tr+0.030
 She+0.028
 Why+0.027
robe+0.026
 How+0.026
Neg logit contributions
ck-0.052
 breath-0.049
ited-0.043
 tug-0.043
away-0.039
Token his
Token ID465
Feature activation+3.21e-02

Pos logit contributions
 She+0.043
tr+0.042
 He+0.041
 Why+0.041
 There+0.041
Neg logit contributions
ck-0.048
 breath-0.044
ited-0.037
 tug-0.034
away-0.034
Token drawer
Token ID33451
Feature activation+8.09e-02

Pos logit contributions
 She+0.048
 Why+0.047
tr+0.047
 Did+0.047
 He+0.046
Neg logit contributions
 breath-0.065
ck-0.063
 tug-0.051
ited-0.051
 fear-0.049
Token and
Token ID290
Feature activation-3.64e-02

Pos logit contributions
 How+0.123
 Why+0.123
 Where+0.120
tr+0.119
 Did+0.117
Neg logit contributions
 breath-0.163
ck-0.158
ited-0.130
away-0.127
 contain-0.125
Token took
Token ID1718
Feature activation+4.46e-03

Pos logit contributions
ck+0.076
 breath+0.070
 tug+0.060
ited+0.059
away+0.055
Neg logit contributions
tr-0.057
 She-0.054
 Why-0.053
 How-0.051
 He-0.051
Token out
Token ID503
Feature activation-1.16e-02

Pos logit contributions
 She+0.006
tr+0.006
 He+0.006
 has+0.006
 There+0.006
Neg logit contributions
ck-0.010
 breath-0.009
ited-0.008
 tug-0.008
away-0.008
Token his
Token ID465
Feature activation+1.73e-02

Pos logit contributions
ck+0.027
 breath+0.024
ited+0.022
 tug+0.020
 swat+0.020
Neg logit contributions
 She-0.016
 He-0.015
 There-0.015
 has-0.015
tr-0.015
Token blue
Token ID4171
Feature activation+2.18e-02

Pos logit contributions
 She+0.027
tr+0.026
 He+0.026
 Why+0.025
 There+0.025
Neg logit contributions
ck-0.037
 breath-0.034
ited-0.030
 tug-0.028
away-0.027
Token mom
Token ID1995
Feature activation+1.19e-03

Pos logit contributions
 She+0.053
 He+0.051
tr+0.050
 There+0.050
 Why+0.049
Neg logit contributions
ck-0.065
 breath-0.059
ited-0.050
 tug-0.048
away-0.046
Token looked
Token ID3114
Feature activation+3.78e-02

Pos logit contributions
tr+0.002
robe+0.002
 There+0.002
 Where+0.002
 How+0.002
Neg logit contributions
ck-0.003
 breath-0.002
ited-0.002
 tug-0.002
 swat-0.002
Token in
Token ID287
Feature activation-1.28e-03

Pos logit contributions
 She+0.052
tr+0.050
 He+0.049
 There+0.048
 has+0.047
Neg logit contributions
ck-0.088
 breath-0.082
ited-0.072
 tug-0.068
away-0.067
Token Benny
Token ID44275
Feature activation-3.02e-02

Pos logit contributions
ck+0.003
 breath+0.002
ited+0.002
 tug+0.002
away+0.002
Neg logit contributions
 She-0.002
 He-0.002
tr-0.002
 has-0.002
 There-0.002
Token's
Token ID338
Feature activation+2.13e-02

Pos logit contributions
ck+0.075
 breath+0.070
ited+0.062
 tug+0.059
away+0.058
Neg logit contributions
 She-0.036
tr-0.034
 He-0.034
 There-0.033
 has-0.033
Token drawer
Token ID33451
Feature activation+8.08e-02

Pos logit contributions
 Did+0.036
tr+0.036
 Why+0.036
 She+0.035
 has+0.034
Neg logit contributions
 breath-0.040
ck-0.037
ited-0.030
 tug-0.030
 response-0.029
Token and
Token ID290
Feature activation-2.40e-02

Pos logit contributions
 Where+0.136
 How+0.134
 Why+0.133
tr+0.131
 There+0.131
Neg logit contributions
ck-0.142
 breath-0.137
ited-0.109
away-0.104
illed-0.103
Token found
Token ID1043
Feature activation+3.47e-02

Pos logit contributions
ck+0.054
 breath+0.049
 tug+0.043
ited+0.042
 swat+0.038
Neg logit contributions
tr-0.035
 She-0.035
 Why-0.033
 There-0.033
 He-0.032
Token the
Token ID262
Feature activation+2.36e-02

Pos logit contributions
tr+0.065
 has+0.060
 She+0.059
 sky+0.059
 There+0.059
Neg logit contributions
ck-0.064
 breath-0.058
 tug-0.048
ited-0.043
 swat-0.042
Token toy
Token ID13373
Feature activation+4.54e-02

Pos logit contributions
 She+0.037
 There+0.035
 Did+0.035
 Why+0.034
 He+0.033
Neg logit contributions
 breath-0.048
ck-0.045
 tug-0.040
ited-0.038
 fear-0.036
Token car
Token ID1097
Feature activation+2.93e-02

Pos logit contributions
 She+0.060
 There+0.060
tr+0.060
 Did+0.058
bows+0.058
Neg logit contributions
 breath-0.090
ck-0.090
 tug-0.080
away-0.073
ge-0.069
Token amazed
Token ID24562
Feature activation-1.91e-02

Pos logit contributions
 She+0.026
 He+0.025
tr+0.025
 has+0.024
 There+0.024
Neg logit contributions
ck-0.043
 breath-0.039
ited-0.035
 tug-0.033
away-0.032
Token at
Token ID379
Feature activation+1.78e-02

Pos logit contributions
ck+0.044
 breath+0.041
ited+0.036
away+0.034
 tug+0.034
Neg logit contributions
 She-0.026
tr-0.026
 He-0.025
 There-0.025
 has-0.025
Token how
Token ID703
Feature activation+2.16e-02

Pos logit contributions
 She+0.029
tr+0.029
 He+0.028
 Did+0.027
 There+0.027
Neg logit contributions
ck-0.038
 breath-0.035
ited-0.029
 tug-0.028
 swat-0.027
Token flexible
Token ID12846
Feature activation-1.24e-03

Pos logit contributions
 She+0.036
 He+0.032
 has+0.031
 "+0.030
 living+0.030
Neg logit contributions
ck-0.049
 breath-0.043
ited-0.040
 contain-0.035
 response-0.035
Token the
Token ID262
Feature activation+1.32e-02

Pos logit contributions
ck+0.003
 breath+0.002
ited+0.002
 tug+0.002
away+0.002
Neg logit contributions
tr-0.002
 Did-0.002
 There-0.002
 He-0.002
 Why-0.002
Token file
Token ID2393
Feature activation+8.08e-02

Pos logit contributions
 She+0.020
tr+0.019
 He+0.019
 has+0.019
 There+0.018
Neg logit contributions
ck-0.029
 breath-0.026
ited-0.023
 tug-0.021
away-0.021
Token was
Token ID373
Feature activation+2.08e-02

Pos logit contributions
 Did+0.098
 Why+0.097
 How+0.096
 Where+0.096
 There+0.092
Neg logit contributions
ck-0.180
 breath-0.166
ited-0.147
 tug-0.146
 believe-0.138
Token.
Token ID13
Feature activation+8.72e-03

Pos logit contributions
 Did+0.033
tr+0.033
 She+0.032
 There+0.032
 Why+0.032
Neg logit contributions
ck-0.042
 breath-0.038
ited-0.032
 tug-0.031
away-0.030
Token She
Token ID1375
Feature activation+2.88e-02

Pos logit contributions
tr+0.008
bridge+0.007
robe+0.007
 has+0.007
 living+0.006
Neg logit contributions
ck-0.020
 breath-0.018
 agree-0.017
rig-0.017
 tug-0.017
Token asked
Token ID1965
Feature activation+2.02e-02

Pos logit contributions
 She+0.043
tr+0.041
 There+0.041
 He+0.041
 Why+0.040
Neg logit contributions
ck-0.065
 breath-0.058
ited-0.051
 tug-0.049
 swat-0.046
Token her
Token ID607
Feature activation+1.30e-02

Pos logit contributions
 She+0.032
 He+0.030
tr+0.029
 Why+0.028
 There+0.028
Neg logit contributions
ck-0.045
 breath-0.041
ited-0.037
 tug-0.033
away-0.033
Token His
Token ID2399
Feature activation+3.14e-02

Pos logit contributions
tr+0.020
robe+0.017
 has+0.017
 living+0.016
bows+0.016
Neg logit contributions
ck-0.037
 breath-0.034
ited-0.030
 tug-0.030
 behold-0.029
Token friend
Token ID1545
Feature activation+6.74e-03

Pos logit contributions
 She+0.048
 He+0.046
tr+0.045
 has+0.045
 There+0.044
Neg logit contributions
ck-0.069
 breath-0.062
ited-0.056
away-0.051
 tug-0.051
Token had
Token ID550
Feature activation+2.78e-02

Pos logit contributions
tr+0.009
 She+0.009
 There+0.009
robe+0.009
 He+0.009
Neg logit contributions
ck-0.015
 breath-0.014
ited-0.012
 tug-0.012
 swat-0.011
Token written
Token ID3194
Feature activation+3.57e-02

Pos logit contributions
 She+0.042
tr+0.041
 He+0.040
 There+0.039
 Why+0.039
Neg logit contributions
ck-0.061
 breath-0.056
ited-0.048
 tug-0.046
away-0.044
Token the
Token ID262
Feature activation+3.19e-02

Pos logit contributions
 She+0.052
tr+0.051
 There+0.050
 He+0.048
 Did+0.048
Neg logit contributions
ck-0.078
 breath-0.071
ited-0.060
 tug-0.058
away-0.058
Token notebook
Token ID20922
Feature activation+8.06e-02

Pos logit contributions
 She+0.056
 He+0.054
 There+0.054
 Why+0.053
 Did+0.053
Neg logit contributions
ck-0.058
 breath-0.055
ited-0.047
 response-0.043
 tug-0.043
Token.
Token ID13
Feature activation+1.73e-02

Pos logit contributions
 Did+0.125
 Why+0.121
 There+0.121
 How+0.120
 Where+0.119
Neg logit contributions
ck-0.153
 breath-0.140
 contain-0.121
ited-0.119
away-0.116
Token
Token ID220
Feature activation+1.60e-02

Pos logit contributions
tr+0.023
robe+0.020
 has+0.019
bows+0.019
 living+0.018
Neg logit contributions
ck-0.038
 breath-0.035
 tug-0.032
 behold-0.030
 agree-0.030
Token
Token ID198
Feature activation-2.14e-02

Pos logit contributions
tr+0.019
 has+0.017
 sky+0.017
umm+0.017
robe+0.017
Neg logit contributions
ck-0.037
 breath-0.032
rig-0.029
 agree-0.029
 tug-0.028
Token
Token ID198
Feature activation-2.31e-02

Pos logit contributions
ck+0.063
 breath+0.058
ited+0.053
 tug+0.051
away+0.050
Neg logit contributions
 She-0.017
tr-0.015
 He-0.015
 has-0.015
 There-0.015
TokenHe
Token ID1544
Feature activation+2.24e-02

Pos logit contributions
ck+0.048
 breath+0.046
ited+0.038
away+0.035
 fear+0.034
Neg logit contributions
 She-0.039
 He-0.038
 Why-0.037
 How-0.035
tr-0.035
Token Spot
Token ID15899
Feature activation-2.31e-02

Pos logit contributions
ck+0.057
 breath+0.052
ited+0.047
 tug+0.043
away+0.043
Neg logit contributions
 She-0.036
 He-0.034
tr-0.033
 has-0.033
 There-0.033
Token knew
Token ID2993
Feature activation+2.89e-02

Pos logit contributions
ck+0.054
 breath+0.048
ited+0.042
 tug+0.041
 believe+0.041
Neg logit contributions
 She-0.030
 There-0.029
tr-0.029
 He-0.029
 How-0.029
Token what
Token ID644
Feature activation+3.61e-02

Pos logit contributions
 She+0.039
tr+0.037
 Did+0.037
robe+0.037
 There+0.036
Neg logit contributions
ck-0.069
 breath-0.062
 tug-0.051
away-0.050
 fear-0.050
Token the
Token ID262
Feature activation+3.09e-02

Pos logit contributions
 She+0.043
 He+0.041
tr+0.040
 has+0.040
 There+0.039
Neg logit contributions
ck-0.091
 breath-0.085
ited-0.076
 tug-0.071
away-0.070
Token red
Token ID2266
Feature activation+5.34e-02

Pos logit contributions
 She+0.056
 Did+0.052
tr+0.052
robe+0.051
 He+0.051
Neg logit contributions
ck-0.058
 breath-0.056
ited-0.044
 tug-0.044
 response-0.043
Token box
Token ID3091
Feature activation+8.05e-02

Pos logit contributions
 She+0.072
 Why+0.070
tr+0.068
 How+0.068
 Did+0.067
Neg logit contributions
ck-0.122
 breath-0.118
ited-0.097
 tug-0.095
away-0.094
Token was
Token ID373
Feature activation+2.36e-02

Pos logit contributions
 How+0.103
 Where+0.102
 She+0.102
 Why+0.102
 Did+0.100
Neg logit contributions
ck-0.190
 breath-0.163
 tug-0.142
 promise-0.141
 believe-0.140
Token and
Token ID290
Feature activation-3.11e-02

Pos logit contributions
tr+0.036
 She+0.036
 Did+0.034
 He+0.034
robe+0.033
Neg logit contributions
ck-0.051
 breath-0.045
away-0.038
 believe-0.038
ited-0.037
Token they
Token ID484
Feature activation+7.86e-03

Pos logit contributions
ck+0.064
 breath+0.058
 tug+0.048
 fear+0.047
 believe+0.045
Neg logit contributions
tr-0.051
robe-0.049
 She-0.049
 Why-0.046
 There-0.045
Token walked
Token ID6807
Feature activation+2.49e-03

Pos logit contributions
tr+0.010
 She+0.010
 How+0.009
robe+0.009
 Why+0.009
Neg logit contributions
ck-0.019
 breath-0.017
ited-0.015
 believe-0.015
 tug-0.015
Token by
Token ID416
Feature activation-1.08e-02

Pos logit contributions
 She+0.004
 He+0.003
tr+0.003
 There+0.003
 has+0.003
Neg logit contributions
ck-0.006
 breath-0.005
ited-0.005
 tug-0.004
away-0.004
Token parts
Token ID3354
Feature activation+1.32e-02

Pos logit contributions
 has+0.029
tr+0.028
 living+0.028
 She+0.028
 He+0.027
Neg logit contributions
ck-0.066
ited-0.059
 breath-0.057
 swat-0.051
away-0.051
Token.
Token ID13
Feature activation+3.48e-02

Pos logit contributions
tr+0.021
 She+0.021
 Did+0.020
 There+0.020
 Why+0.019
Neg logit contributions
ck-0.028
 breath-0.025
ited-0.021
 tug-0.021
away-0.020
Token They
Token ID1119
Feature activation+2.18e-02

Pos logit contributions
tr+0.028
robe+0.026
 living+0.025
 has+0.025
 sky+0.022
Neg logit contributions
ck-0.090
 breath-0.083
 tug-0.077
 common-0.076
 agree-0.074
Token think
Token ID892
Feature activation+2.84e-02

Pos logit contributions
 She+0.033
tr+0.032
 He+0.031
 There+0.031
 Did+0.031
Neg logit contributions
ck-0.048
 breath-0.044
ited-0.038
 tug-0.037
away-0.035
Token the
Token ID262
Feature activation+3.54e-02

Pos logit contributions
 She+0.043
tr+0.042
 He+0.040
 Why+0.040
 Did+0.040
Neg logit contributions
ck-0.062
 breath-0.057
ited-0.048
 tug-0.047
away-0.045
Token box
Token ID3091
Feature activation+8.05e-02

Pos logit contributions
 She+0.057
 Did+0.054
 Why+0.052
 He+0.051
 There+0.051
Neg logit contributions
ck-0.071
 breath-0.070
 tug-0.058
 response-0.054
 fear-0.054
Token is
Token ID318
Feature activation+2.64e-02

Pos logit contributions
 Did+0.065
 Why+0.062
 She+0.061
 Where+0.060
 How+0.059
Neg logit contributions
ck-0.231
 breath-0.210
 promise-0.184
 tug-0.182
ited-0.181
Token a
Token ID257
Feature activation+2.11e-02

Pos logit contributions
 She+0.041
 He+0.039
tr+0.039
 has+0.039
 There+0.038
Neg logit contributions
ck-0.057
 breath-0.052
ited-0.046
 tug-0.042
away-0.042
Token lie
Token ID6486
Feature activation+8.92e-03

Pos logit contributions
 She+0.036
 He+0.034
 There+0.034
 Did+0.033
 Why+0.033
Neg logit contributions
ck-0.042
 breath-0.040
ited-0.033
 tug-0.032
away-0.031
Token.
Token ID13
Feature activation+3.35e-02

Pos logit contributions
 She+0.016
 Did+0.016
 There+0.015
 Where+0.015
 He+0.015
Neg logit contributions
ck-0.016
 breath-0.014
away-0.012
ited-0.011
 tug-0.011
Token They
Token ID1119
Feature activation+1.91e-02

Pos logit contributions
tr+0.028
robe+0.026
 has+0.025
 living+0.025
 sky+0.022
Neg logit contributions
ck-0.086
 breath-0.078
 tug-0.073
 common-0.071
 agree-0.070
Token wanted
Token ID2227
Feature activation+1.79e-02

Pos logit contributions
ck+0.049
 breath+0.045
ited+0.040
 tug+0.038
away+0.037
Neg logit contributions
 She-0.023
tr-0.022
 He-0.022
 There-0.022
 Why-0.021
Token to
Token ID284
Feature activation-2.01e-02

Pos logit contributions
tr+0.025
 She+0.024
 He+0.024
robe+0.024
 has+0.023
Neg logit contributions
ck-0.041
 breath-0.038
ited-0.032
 tug-0.032
away-0.030
Token know
Token ID760
Feature activation+2.35e-02

Pos logit contributions
ck+0.041
 breath+0.037
ited+0.034
away+0.029
 tug+0.029
Neg logit contributions
 She-0.034
 He-0.032
 There-0.031
tr-0.031
 has-0.031
Token what
Token ID644
Feature activation+3.19e-02

Pos logit contributions
 She+0.037
tr+0.036
 He+0.036
 There+0.035
 has+0.035
Neg logit contributions
ck-0.050
 breath-0.045
ited-0.038
 tug-0.037
away-0.036
Token the
Token ID262
Feature activation+2.39e-02

Pos logit contributions
 She+0.042
 He+0.040
tr+0.040
 There+0.039
 has+0.039
Neg logit contributions
ck-0.076
 breath-0.070
ited-0.062
 tug-0.058
away-0.058
Token box
Token ID3091
Feature activation+8.05e-02

Pos logit contributions
 She+0.040
tr+0.039
 Did+0.039
 He+0.038
 Why+0.038
Neg logit contributions
ck-0.045
 breath-0.045
 tug-0.036
ited-0.036
 response-0.034
Token was
Token ID373
Feature activation+2.19e-02

Pos logit contributions
 Why+0.097
 How+0.096
tr+0.096
 Did+0.094
 Where+0.093
Neg logit contributions
ck-0.189
 breath-0.172
ited-0.152
 tug-0.148
 promise-0.145
Token for
Token ID329
Feature activation-3.07e-03

Pos logit contributions
tr+0.033
 She+0.033
 Did+0.032
 He+0.031
 Why+0.031
Neg logit contributions
ck-0.048
 breath-0.043
ited-0.036
away-0.036
 believe-0.036
Token.
Token ID13
Feature activation+1.12e-02

Pos logit contributions
ck+0.007
 breath+0.007
ited+0.006
 tug+0.006
away+0.006
Neg logit contributions
 She-0.004
 He-0.004
tr-0.004
 has-0.004
 There-0.004
Token She
Token ID1375
Feature activation+3.06e-02

Pos logit contributions
robe+0.010
tr+0.009
 has+0.008
 living+0.008
rob+0.008
Neg logit contributions
ck-0.027
 breath-0.027
 agree-0.024
 tug-0.024
ited-0.024
Token asked
Token ID1965
Feature activation+1.97e-02

Pos logit contributions
 She+0.048
 He+0.045
tr+0.045
 There+0.044
 has+0.044
Neg logit contributions
ck-0.066
 breath-0.060
ited-0.053
 tug-0.049
away-0.048
Token
Token ID198
Feature activation-1.56e-02

Pos logit contributions
tr+0.021
robe+0.018
 living+0.017
 has+0.017
cel+0.016
Neg logit contributions
ck-0.033
 breath-0.032
ited-0.028
 tug-0.028
 agree-0.027
Token
Token ID198
Feature activation-1.95e-02

Pos logit contributions
ck+0.048
 breath+0.045
ited+0.042
 tug+0.039
away+0.039
Neg logit contributions
 She-0.011
 He-0.010
 There-0.008
 has-0.008
 Why-0.008
TokenHe
Token ID1544
Feature activation+2.17e-02

Pos logit contributions
ck+0.039
 breath+0.037
ited+0.030
 tug+0.027
 swat+0.027
Neg logit contributions
 She-0.034
 He-0.034
 Why-0.031
 has-0.031
 There-0.031
Token opened
Token ID4721
Feature activation+2.16e-02

Pos logit contributions
 She+0.033
 He+0.031
tr+0.031
 There+0.030
 has+0.030
Neg logit contributions
ck-0.048
 breath-0.044
ited-0.039
 tug-0.036
away-0.035
Token the
Token ID262
Feature activation+2.80e-02

Pos logit contributions
 She+0.036
 He+0.035
tr+0.034
 There+0.033
 has+0.033
Neg logit contributions
ck-0.044
 breath-0.040
ited-0.035
 tug-0.032
away-0.032
Token tool
Token ID2891
Feature activation+8.04e-02

Pos logit contributions
 She+0.038
 He+0.035
 Did+0.035
 Why+0.035
tr+0.035
Neg logit contributions
ck-0.064
 breath-0.061
ited-0.053
 tug-0.051
away-0.049
Tokenbox
Token ID3524
Feature activation+5.65e-02

Pos logit contributions
 How+0.079
 Why+0.079
 She+0.076
 Where+0.076
 Did+0.076
Neg logit contributions
 breath-0.193
ck-0.192
 tug-0.164
ited-0.160
away-0.158
Token and
Token ID290
Feature activation-3.15e-02

Pos logit contributions
tr+0.075
 She+0.072
 How+0.070
 Why+0.070
 Did+0.068
Neg logit contributions
ck-0.134
 breath-0.126
ited-0.108
 tug-0.106
away-0.100
Token found
Token ID1043
Feature activation+3.58e-02

Pos logit contributions
ck+0.067
 breath+0.061
ited+0.052
 tug+0.050
away+0.048
Neg logit contributions
 She-0.051
tr-0.049
 He-0.048
 There-0.047
 Why-0.047
Token all
Token ID477
Feature activation-2.05e-02

Pos logit contributions
tr+0.064
 She+0.060
 Did+0.059
robe+0.059
 has+0.058
Neg logit contributions
ck-0.072
 breath-0.065
 tug-0.054
ited-0.050
away-0.049
Token the
Token ID262
Feature activation+9.09e-03

Pos logit contributions
ck+0.036
 breath+0.033
ited+0.028
 tug+0.025
away+0.024
Neg logit contributions
tr-0.039
 She-0.039
 There-0.038
 Did-0.037
 Why-0.037
Token no
Token ID645
Feature activation-1.06e-02

Pos logit contributions
tr+0.106
 She+0.101
 He+0.100
 sky+0.099
robe+0.099
Neg logit contributions
ck-0.097
 breath-0.086
 tug-0.071
ited-0.068
 swat-0.063
Token,
Token ID11
Feature activation+2.53e-02

Pos logit contributions
ck+0.025
 breath+0.023
ited+0.021
 tug+0.020
away+0.019
Neg logit contributions
 She-0.014
tr-0.014
 He-0.013
 There-0.013
 has-0.013
Token because
Token ID780
Feature activation+1.61e-02

Pos logit contributions
tr+0.041
robe+0.039
 He+0.036
 She+0.036
 Did+0.035
Neg logit contributions
ck-0.052
 breath-0.047
 tug-0.040
 swat-0.038
 fear-0.037
Token the
Token ID262
Feature activation+3.64e-02

Pos logit contributions
tr+0.025
 She+0.025
 He+0.024
 Why+0.024
 There+0.024
Neg logit contributions
ck-0.033
 breath-0.031
 tug-0.025
ited-0.024
 fear-0.024
Token ash
Token ID12530
Feature activation+7.29e-02

Pos logit contributions
 She+0.071
 Did+0.071
 Why+0.069
 There+0.068
 He+0.068
Neg logit contributions
ck-0.056
 breath-0.052
 fear-0.041
 tug-0.039
 response-0.038
Tokentr
Token ID2213
Feature activation+8.02e-02

Pos logit contributions
 Did+0.068
 These+0.067
 There+0.066
 How+0.066
 Why+0.065
Neg logit contributions
ck-0.173
 breath-0.163
 fear-0.136
 promise-0.134
ot-0.134
Tokenay
Token ID323
Feature activation+2.53e-02

Pos logit contributions
 living+0.159
 How+0.146
 Why+0.143
 market+0.143
 Where+0.142
Neg logit contributions
ck-0.117
 breath-0.105
ingle-0.091
ir-0.087
ot-0.080
Token was
Token ID373
Feature activation+1.07e-02

Pos logit contributions
 Why+0.025
 There+0.025
 How+0.025
 Where+0.024
 He+0.024
Neg logit contributions
ck-0.066
 breath-0.058
ited-0.051
 tug-0.050
away-0.047
Token not
Token ID407
Feature activation-3.17e-02

Pos logit contributions
 She+0.017
 Did+0.017
 He+0.016
 There+0.016
 Why+0.016
Neg logit contributions
ck-0.023
 breath-0.021
ited-0.017
 tug-0.017
 swat-0.016
Token a
Token ID257
Feature activation+2.35e-02

Pos logit contributions
ck+0.066
 breath+0.060
ited+0.050
 tug+0.048
away+0.047
Neg logit contributions
 She-0.053
 He-0.050
tr-0.050
 There-0.050
 Did-0.050
Token toy
Token ID13373
Feature activation+4.55e-02

Pos logit contributions
 She+0.041
 There+0.040
 Did+0.039
 Why+0.039
 He+0.039
Neg logit contributions
ck-0.043
 breath-0.042
 tug-0.036
away-0.034
 reason-0.034
Token He
Token ID679
Feature activation+2.32e-02

Pos logit contributions
tr+0.017
bows+0.015
robe+0.015
 has+0.014
 shines+0.014
Neg logit contributions
ck-0.042
 breath-0.040
 tug-0.038
rig-0.036
 behold-0.036
Token found
Token ID1043
Feature activation+3.25e-02

Pos logit contributions
 She+0.032
tr+0.031
 There+0.030
 Why+0.030
 He+0.030
Neg logit contributions
ck-0.054
 breath-0.049
ited-0.044
 tug-0.043
 swat-0.039
Token out
Token ID503
Feature activation-1.43e-02

Pos logit contributions
tr+0.059
 She+0.056
 has+0.054
 Did+0.054
 There+0.053
Neg logit contributions
ck-0.062
 breath-0.059
 tug-0.047
 fear-0.044
ited-0.042
Token his
Token ID465
Feature activation+3.08e-02

Pos logit contributions
ck+0.036
 breath+0.034
ited+0.029
 tug+0.029
away+0.028
Neg logit contributions
tr-0.017
 She-0.017
 Did-0.016
 He-0.016
 There-0.016
Token special
Token ID2041
Feature activation+3.03e-02

Pos logit contributions
 She+0.046
 He+0.044
tr+0.043
 There+0.042
 has+0.042
Neg logit contributions
ck-0.068
 breath-0.063
ited-0.055
 tug-0.052
away-0.050
Token drawer
Token ID33451
Feature activation+8.02e-02

Pos logit contributions
 She+0.044
 He+0.042
tr+0.041
 Why+0.041
 There+0.041
Neg logit contributions
ck-0.067
 breath-0.066
ited-0.055
 tug-0.054
away-0.052
Token was
Token ID373
Feature activation+1.89e-02

Pos logit contributions
 Why+0.132
 How+0.131
 Where+0.129
 Did+0.128
 She+0.127
Neg logit contributions
ck-0.150
 breath-0.148
ited-0.120
 tug-0.117
 contain-0.113
Token even
Token ID772
Feature activation-3.44e-02

Pos logit contributions
 She+0.031
 Did+0.030
 There+0.030
 Why+0.030
tr+0.029
Neg logit contributions
ck-0.038
 breath-0.035
 tug-0.029
ited-0.028
away-0.026
Token more
Token ID517
Feature activation-1.67e-02

Pos logit contributions
ck+0.066
 breath+0.059
ited+0.051
 tug+0.047
away+0.045
Neg logit contributions
 She-0.063
 There-0.060
tr-0.060
 He-0.059
 Why-0.058
Token special
Token ID2041
Feature activation-1.12e-02

Pos logit contributions
ck+0.026
 breath+0.024
 tug+0.019
ited+0.019
away+0.018
Neg logit contributions
tr-0.035
 She-0.034
 Why-0.034
 Did-0.034
 There-0.034
Token than
Token ID621
Feature activation-2.82e-02

Pos logit contributions
 breath+0.020
ck+0.020
away+0.016
 tug+0.016
 believe+0.015
Neg logit contributions
tr-0.021
 She-0.020
 There-0.020
 Did-0.019
 He-0.019
Token then
Token ID788
Feature activation-1.26e-02

Pos logit contributions
ck+0.041
 breath+0.038
ited+0.033
 tug+0.032
away+0.031
Neg logit contributions
 She-0.023
tr-0.023
 He-0.022
 There-0.022
 has-0.021
Token her
Token ID607
Feature activation+1.52e-02

Pos logit contributions
ck+0.032
 breath+0.029
ited+0.027
away+0.025
 tug+0.025
Neg logit contributions
 She-0.015
 He-0.014
 has-0.014
tr-0.014
 There-0.014
Token friend
Token ID1545
Feature activation-1.27e-02

Pos logit contributions
 She+0.024
tr+0.024
 He+0.023
 has+0.023
 There+0.023
Neg logit contributions
ck-0.033
 breath-0.029
ited-0.026
away-0.024
 tug-0.024
Token started
Token ID2067
Feature activation+8.60e-03

Pos logit contributions
ck+0.033
 breath+0.031
ited+0.027
away+0.026
 tug+0.025
Neg logit contributions
 She-0.015
 He-0.014
tr-0.013
 has-0.013
 There-0.013
Token to
Token ID284
Feature activation-3.29e-02

Pos logit contributions
 She+0.008
 "+0.007
 living+0.007
 He+0.007
 Why+0.007
Neg logit contributions
ck-0.025
ited-0.022
 breath-0.022
away-0.021
 behold-0.020
Token mis
Token ID2984
Feature activation-1.26e-01

Pos logit contributions
ck+0.072
away+0.059
 breath+0.058
ited+0.057
shadow+0.052
Neg logit contributions
 She-0.054
 He-0.046
 "-0.045
 How-0.045
 living-0.045
Tokenbe
Token ID1350
Feature activation-4.57e-02

Pos logit contributions
 breath+0.252
ck+0.251
 accomplishment+0.241
 common+0.225
 footing+0.217
Neg logit contributions
 She-0.168
 Why-0.154
tr-0.145
 How-0.140
I-0.139
Tokenhave
Token ID14150
Feature activation-7.84e-03

Pos logit contributions
ck+0.084
 breath+0.082
 contain+0.073
ited+0.071
rig+0.070
Neg logit contributions
 has-0.068
 is-0.066
 She-0.066
 living-0.066
 How-0.064
Token.
Token ID13
Feature activation+1.00e-02

Pos logit contributions
ck+0.019
 breath+0.018
ited+0.016
 tug+0.015
away+0.014
Neg logit contributions
 She-0.010
 He-0.010
tr-0.009
 has-0.009
 There-0.009
Token She
Token ID1375
Feature activation+1.63e-02

Pos logit contributions
tr+0.015
lor+0.013
fl+0.013
bows+0.013
 shines+0.013
Neg logit contributions
 breath-0.017
ck-0.017
 tug-0.016
 err-0.015
ited-0.014
Token was
Token ID373
Feature activation+8.04e-03

Pos logit contributions
 She+0.022
 He+0.021
 has+0.021
tr+0.020
 There+0.020
Neg logit contributions
ck-0.039
 breath-0.036
ited-0.032
away-0.030
 tug-0.029
Token Lucy
Token ID22162
Feature activation-2.83e-02

Pos logit contributions
ck+0.021
 breath+0.019
ited+0.017
 tug+0.016
away+0.016
Neg logit contributions
 She-0.014
tr-0.014
 He-0.013
 has-0.013
 There-0.013
Token noticed
Token ID6810
Feature activation+1.25e-02

Pos logit contributions
ck+0.068
 breath+0.063
ited+0.056
 tug+0.052
away+0.052
Neg logit contributions
 She-0.037
 He-0.035
tr-0.034
 has-0.034
 There-0.034
Token the
Token ID262
Feature activation+1.57e-03

Pos logit contributions
 She+0.017
 He+0.016
tr+0.016
 has+0.016
 There+0.016
Neg logit contributions
ck-0.029
 breath-0.027
ited-0.024
 tug-0.022
away-0.022
Token weather
Token ID6193
Feature activation+8.71e-03

Pos logit contributions
 She+0.002
 There+0.002
 He+0.002
 Did+0.002
 Why+0.002
Neg logit contributions
ck-0.003
 breath-0.003
ited-0.003
away-0.003
 tug-0.003
Token was
Token ID373
Feature activation+4.47e-03

Pos logit contributions
 Did+0.012
 He+0.012
 She+0.011
 How+0.011
 There+0.011
Neg logit contributions
ck-0.019
 breath-0.018
ited-0.016
 tug-0.015
away-0.015
Token uncom
Token ID8820
Feature activation-1.18e-01

Pos logit contributions
 She+0.007
 He+0.007
tr+0.007
 There+0.007
 has+0.007
Neg logit contributions
ck-0.009
 breath-0.009
ited-0.007
 tug-0.007
away-0.007
Tokenfort
Token ID3319
Feature activation-5.66e-02

Pos logit contributions
 breath+0.208
ck+0.190
 fear+0.179
 response+0.160
 common+0.158
Neg logit contributions
tr-0.217
 She-0.175
cel-0.170
 Why-0.170
bows-0.164
Tokenably
Token ID1346
Feature activation-3.08e-02

Pos logit contributions
ck+0.093
 breath+0.086
ited+0.073
 tug+0.071
 response+0.067
Neg logit contributions
tr-0.111
 She-0.109
robe-0.105
 He-0.103
 living-0.100
Token hot
Token ID3024
Feature activation-2.48e-02

Pos logit contributions
ck+0.063
 breath+0.058
ited+0.050
 tug+0.046
away+0.046
Neg logit contributions
 She-0.051
 He-0.049
tr-0.048
 There-0.047
 has-0.047
Token.
Token ID13
Feature activation+7.38e-03

Pos logit contributions
 breath+0.040
ck+0.039
away+0.032
ited+0.032
 response+0.028
Neg logit contributions
 Did-0.046
tr-0.045
 has-0.042
 There-0.042
 Where-0.042
Token She
Token ID1375
Feature activation+2.20e-02

Pos logit contributions
tr+0.003
 has+0.003
robe+0.003
pit+0.002
 living+0.002
Neg logit contributions
 breath-0.021
ck-0.020
 tug-0.019
 panic-0.018
 fear-0.018
Token it
Token ID340
Feature activation+2.86e-02

Pos logit contributions
ck+0.020
 breath+0.018
ited+0.016
 tug+0.015
away+0.015
Neg logit contributions
 She-0.014
 He-0.013
tr-0.013
 has-0.013
 There-0.013
Token fits
Token ID11414
Feature activation-1.72e-02

Pos logit contributions
 She+0.032
 Why+0.032
 Did+0.031
 He+0.031
 There+0.030
Neg logit contributions
ck-0.072
 breath-0.064
ited-0.057
 tug-0.055
 promise-0.053
Token!"
Token ID2474
Feature activation-2.19e-03

Pos logit contributions
ck+0.038
 breath+0.035
ited+0.031
 tug+0.029
away+0.028
Neg logit contributions
 She-0.025
 He-0.024
tr-0.024
 has-0.024
 There-0.023
Token She
Token ID1375
Feature activation+2.20e-02

Pos logit contributions
ck+0.006
 breath+0.005
 tug+0.004
 believe+0.004
 swat+0.004
Neg logit contributions
robe-0.002
 He-0.002
tr-0.002
 has-0.002
 living-0.002
Token helped
Token ID4193
Feature activation-1.47e-02

Pos logit contributions
 She+0.031
tr+0.029
 He+0.029
 has+0.029
 There+0.028
Neg logit contributions
ck-0.051
 breath-0.047
ited-0.041
away-0.039
 tug-0.038
Token Sil
Token ID4243
Feature activation-1.17e-01

Pos logit contributions
ck+0.033
ited+0.031
 breath+0.030
rig+0.026
 contain+0.025
Neg logit contributions
 She-0.022
 He-0.019
tr-0.018
 "-0.018
 There-0.018
Tokenvia
Token ID8869
Feature activation-4.18e-02

Pos logit contributions
 breath+0.246
ck+0.196
 tug+0.186
 avail+0.186
 fear+0.185
Neg logit contributions
tr-0.185
ase-0.170
 has-0.166
 She-0.165
 Why-0.165
Token put
Token ID1234
Feature activation+1.20e-02

Pos logit contributions
ck+0.078
 breath+0.070
ited+0.057
 tug+0.056
 swat+0.052
Neg logit contributions
 She-0.077
 He-0.075
tr-0.075
 Did-0.075
 Why-0.075
Token on
Token ID319
Feature activation-2.81e-02

Pos logit contributions
tr+0.020
 Did+0.020
 Why+0.020
 She+0.020
 There+0.019
Neg logit contributions
ck-0.025
 breath-0.023
 tug-0.018
ited-0.017
away-0.017
Token the
Token ID262
Feature activation+1.54e-02

Pos logit contributions
ck+0.060
 breath+0.053
ited+0.050
away+0.045
 tug+0.044
Neg logit contributions
 She-0.045
 He-0.042
tr-0.041
 has-0.041
 There-0.040
Token coat
Token ID13209
Feature activation+4.11e-02

Pos logit contributions
 She+0.025
 Why+0.024
tr+0.024
 Did+0.024
 He+0.024
Neg logit contributions
ck-0.032
 breath-0.030
ited-0.025
 tug-0.024
away-0.023
Token a
Token ID257
Feature activation-3.91e-02

Pos logit contributions
ck+0.094
 breath+0.088
ited+0.083
away+0.078
ge+0.075
Neg logit contributions
 She-0.037
 He-0.031
 There-0.031
 living-0.030
 sky-0.029
Token little
Token ID1310
Feature activation-6.39e-02

Pos logit contributions
ck+0.092
 breath+0.082
ited+0.076
 contain+0.068
war+0.067
Neg logit contributions
 She-0.056
 He-0.054
 has-0.054
 living-0.052
tr-0.052
Token bird
Token ID6512
Feature activation-4.80e-02

Pos logit contributions
ck+0.168
 breath+0.151
ited+0.139
 tug+0.129
away+0.127
Neg logit contributions
 She-0.073
 He-0.068
 has-0.066
tr-0.066
 There-0.064
Tokenie
Token ID494
Feature activation-4.31e-02

Pos logit contributions
ck+0.107
 breath+0.102
ited+0.092
 tug+0.087
war+0.082
Neg logit contributions
 She-0.069
tr-0.065
 He-0.064
 living-0.064
 has-0.062
Token named
Token ID3706
Feature activation-2.81e-02

Pos logit contributions
ck+0.105
 breath+0.100
ited+0.095
 tug+0.086
war+0.085
Neg logit contributions
 She-0.051
 living-0.049
 He-0.048
tr-0.048
 has-0.047
Token Twe
Token ID24205
Feature activation-1.16e-01

Pos logit contributions
ck+0.071
 breath+0.064
ited+0.064
away+0.057
 tug+0.056
Neg logit contributions
 She-0.038
 He-0.035
 There-0.033
 "-0.032
tr-0.032
Tokenety
Token ID2963
Feature activation-4.74e-02

Pos logit contributions
 breath+0.268
ck+0.257
 contain+0.222
 tug+0.222
ited+0.217
Neg logit contributions
 She-0.165
tr-0.154
 He-0.141
 living-0.135
 has-0.133
Token.
Token ID13
Feature activation+1.10e-02

Pos logit contributions
ck+0.120
 breath+0.115
ited+0.106
 tug+0.099
 response+0.095
Neg logit contributions
 She-0.055
 He-0.052
 living-0.050
tr-0.050
 has-0.049
Token Twe
Token ID24205
Feature activation-9.61e-02

Pos logit contributions
tr+0.026
robe+0.025
 living+0.025
box+0.024
 Why+0.024
Neg logit contributions
ck-0.011
 breath-0.010
away-0.006
ir-0.006
 response-0.006
Tokenety
Token ID2963
Feature activation-1.63e-02

Pos logit contributions
ck+0.195
 breath+0.189
ited+0.156
 tug+0.154
 contain+0.148
Neg logit contributions
 She-0.161
tr-0.149
 He-0.149
 has-0.142
 Why-0.139
Token was
Token ID373
Feature activation+1.12e-02

Pos logit contributions
ck+0.042
 breath+0.037
ited+0.034
 tug+0.032
away+0.031
Neg logit contributions
 She-0.018
tr-0.018
 There-0.018
 He-0.017
 Why-0.017
Token who
Token ID508
Feature activation+4.86e-03

Pos logit contributions
ck+0.079
 breath+0.074
ited+0.067
 tug+0.063
away+0.062
Neg logit contributions
 She-0.040
tr-0.038
 He-0.037
 has-0.037
 living-0.036
Token was
Token ID373
Feature activation+5.09e-03

Pos logit contributions
 She+0.004
 He+0.004
tr+0.004
 has+0.004
 There+0.004
Neg logit contributions
ck-0.014
 breath-0.013
ited-0.012
 tug-0.011
away-0.011
Token trying
Token ID2111
Feature activation+4.18e-03

Pos logit contributions
 She+0.008
 He+0.008
tr+0.008
 There+0.007
 has+0.007
Neg logit contributions
ck-0.011
 breath-0.010
ited-0.009
 tug-0.008
away-0.008
Token to
Token ID284
Feature activation-2.67e-02

Pos logit contributions
 She+0.003
 "+0.003
 living+0.003
 He+0.002
 has+0.002
Neg logit contributions
ck-0.013
ited-0.012
 breath-0.012
 contain-0.010
 behold-0.010
Token take
Token ID1011
Feature activation-1.86e-02

Pos logit contributions
ited+0.055
ck+0.053
 breath+0.051
rig+0.048
illed+0.047
Neg logit contributions
 She-0.035
 He-0.031
 sky-0.030
 is-0.030
 living-0.029
Token Twe
Token ID24205
Feature activation-1.16e-01

Pos logit contributions
ck+0.043
 breath+0.039
ited+0.037
 tug+0.033
 contain+0.032
Neg logit contributions
 She-0.027
 He-0.025
 There-0.024
 has-0.024
tr-0.023
Tokenety
Token ID2963
Feature activation-2.94e-02

Pos logit contributions
 breath+0.259
ck+0.258
 tug+0.215
 contain+0.210
ited+0.204
Neg logit contributions
 She-0.165
tr-0.164
 He-0.142
 has-0.138
 Did-0.138
Token's
Token ID338
Feature activation+7.76e-03

Pos logit contributions
ck+0.083
 breath+0.076
ited+0.074
 tug+0.070
 behold+0.067
Neg logit contributions
tr-0.027
 She-0.026
 living-0.023
 has-0.022
 He-0.021
Token nest
Token ID16343
Feature activation+2.98e-03

Pos logit contributions
tr+0.012
 She+0.012
 sky+0.011
 has+0.011
 There+0.011
Neg logit contributions
ck-0.017
 breath-0.014
ited-0.014
 tug-0.013
away-0.012
Token.
Token ID13
Feature activation-4.96e-03

Pos logit contributions
 She+0.003
tr+0.003
 He+0.003
robe+0.003
 has+0.003
Neg logit contributions
ck-0.008
 breath-0.007
ited-0.007
 tug-0.006
 swat-0.006
Token Twe
Token ID24205
Feature activation-9.45e-02

Pos logit contributions
ck+0.009
 breath+0.008
 tug+0.007
away+0.007
 swat+0.007
Neg logit contributions
robe-0.008
tr-0.008
 living-0.008
oll-0.007
pit-0.007
Token a
Token ID257
Feature activation-3.72e-02

Pos logit contributions
ck+0.079
 breath+0.073
ited+0.068
away+0.063
 tug+0.062
Neg logit contributions
 She-0.035
 He-0.032
 There-0.030
 How-0.029
 living-0.029
Token little
Token ID1310
Feature activation-6.28e-02

Pos logit contributions
ck+0.088
 breath+0.078
ited+0.073
 contain+0.065
war+0.065
Neg logit contributions
 She-0.053
 He-0.051
 has-0.051
 living-0.050
 "-0.049
Token bird
Token ID6512
Feature activation-4.66e-02

Pos logit contributions
ck+0.168
 breath+0.151
ited+0.140
 tug+0.129
 contain+0.128
Neg logit contributions
 She-0.070
 He-0.065
 has-0.063
tr-0.062
 sky-0.060
Tokenie
Token ID494
Feature activation-4.31e-02

Pos logit contributions
ck+0.100
 breath+0.094
ited+0.084
 tug+0.079
away+0.074
Neg logit contributions
 She-0.071
tr-0.068
 He-0.067
 living-0.065
 has-0.065
Token named
Token ID3706
Feature activation-2.80e-02

Pos logit contributions
ck+0.104
 breath+0.098
ited+0.092
 tug+0.084
war+0.082
Neg logit contributions
 She-0.053
 He-0.050
 living-0.050
tr-0.050
 has-0.049
Token Twe
Token ID24205
Feature activation-1.15e-01

Pos logit contributions
ck+0.069
 breath+0.063
ited+0.062
away+0.055
 tug+0.055
Neg logit contributions
 She-0.038
 He-0.036
 There-0.034
tr-0.033
 "-0.032
Tokenety
Token ID2963
Feature activation-4.73e-02

Pos logit contributions
 breath+0.265
ck+0.262
 contain+0.221
 tug+0.220
 fear+0.214
Neg logit contributions
 She-0.162
tr-0.152
 He-0.143
 has-0.133
robe-0.132
Token.
Token ID13
Feature activation+1.08e-02

Pos logit contributions
ck+0.118
 breath+0.111
ited+0.102
 tug+0.095
away+0.091
Neg logit contributions
 She-0.057
 He-0.054
tr-0.053
 living-0.052
 has-0.051
Token Twe
Token ID24205
Feature activation-9.43e-02

Pos logit contributions
tr+0.026
robe+0.025
boards+0.025
 living+0.025
box+0.024
Neg logit contributions
ck-0.012
 breath-0.010
 swat-0.006
away-0.006
 response-0.006
Tokenety
Token ID2963
Feature activation-1.49e-02

Pos logit contributions
ck+0.194
 breath+0.187
ited+0.154
 tug+0.152
 contain+0.147
Neg logit contributions
 She-0.157
 He-0.146
tr-0.146
 has-0.139
 Why-0.136
Token lived
Token ID5615
Feature activation-6.35e-03

Pos logit contributions
ck+0.038
 breath+0.034
ited+0.031
 tug+0.029
away+0.029
Neg logit contributions
 She-0.017
 There-0.016
tr-0.016
 He-0.016
 Why-0.016
Token a
Token ID257
Feature activation-3.47e-02

Pos logit contributions
ck+0.079
 breath+0.073
ited+0.068
away+0.064
 tug+0.062
Neg logit contributions
 She-0.032
 He-0.028
 There-0.027
 sky-0.026
 living-0.026
Token little
Token ID1310
Feature activation-5.93e-02

Pos logit contributions
ck+0.081
 breath+0.072
ited+0.066
 contain+0.058
away+0.058
Neg logit contributions
 She-0.051
 He-0.049
 has-0.049
 living-0.047
tr-0.047
Token bird
Token ID6512
Feature activation-4.56e-02

Pos logit contributions
ck+0.158
 breath+0.142
ited+0.131
 tug+0.121
away+0.120
Neg logit contributions
 She-0.066
 He-0.061
 has-0.060
tr-0.059
 sky-0.057
Tokenie
Token ID494
Feature activation-3.96e-02

Pos logit contributions
ck+0.099
 breath+0.094
ited+0.084
 tug+0.079
 response+0.074
Neg logit contributions
 She-0.069
tr-0.065
 He-0.065
 living-0.063
 has-0.063
Token named
Token ID3706
Feature activation-2.88e-02

Pos logit contributions
ck+0.094
 breath+0.089
ited+0.081
 tug+0.075
 response+0.072
Neg logit contributions
 She-0.052
 He-0.049
tr-0.048
 has-0.048
 living-0.048
Token Twe
Token ID24205
Feature activation-1.15e-01

Pos logit contributions
ck+0.071
 breath+0.065
ited+0.063
away+0.057
 tug+0.056
Neg logit contributions
 She-0.040
 He-0.037
 There-0.035
tr-0.034
 "-0.034
Tokenety
Token ID2963
Feature activation-4.62e-02

Pos logit contributions
 breath+0.272
ck+0.263
 contain+0.230
 tug+0.226
 fear+0.223
Neg logit contributions
 She-0.154
tr-0.145
 He-0.130
 has-0.124
robe-0.124
Token.
Token ID13
Feature activation+9.94e-03

Pos logit contributions
ck+0.115
 breath+0.108
ited+0.099
 tug+0.092
away+0.089
Neg logit contributions
 She-0.057
 He-0.054
tr-0.052
 has-0.051
 living-0.051
Token Twe
Token ID24205
Feature activation-9.30e-02

Pos logit contributions
tr+0.024
 living+0.023
robe+0.023
box+0.023
boards+0.023
Neg logit contributions
ck-0.010
 breath-0.009
away-0.006
 swat-0.005
ited-0.005
Tokenety
Token ID2963
Feature activation-1.27e-02

Pos logit contributions
ck+0.195
 breath+0.190
 tug+0.157
ited+0.156
 contain+0.152
Neg logit contributions
 She-0.151
tr-0.139
 He-0.137
 has-0.132
robe-0.128
Token loved
Token ID6151
Feature activation-1.13e-03

Pos logit contributions
ck+0.032
 breath+0.028
ited+0.025
 tug+0.024
away+0.023
Neg logit contributions
 She-0.015
 There-0.015
tr-0.015
 How-0.014
 Why-0.014
Token a
Token ID257
Feature activation+2.35e-03

Pos logit contributions
robe+0.035
 He+0.035
 Did+0.034
tr+0.034
 She+0.034
Neg logit contributions
ck-0.042
 breath-0.038
 tug-0.031
ited-0.031
 swat-0.030
Token film
Token ID2646
Feature activation+2.40e-02

Pos logit contributions
 She+0.004
 He+0.004
 Why+0.004
 There+0.004
 Did+0.003
Neg logit contributions
ck-0.005
 breath-0.005
ited-0.004
 tug-0.004
away-0.004
Token camera
Token ID4676
Feature activation+2.06e-02

Pos logit contributions
 She+0.030
 Did+0.030
 There+0.030
 He+0.029
 has+0.029
Neg logit contributions
ck-0.053
 breath-0.050
away-0.041
ited-0.041
 contain-0.041
Token and
Token ID290
Feature activation-2.64e-02

Pos logit contributions
 Did+0.027
 There+0.026
 Why+0.026
 How+0.026
 Where+0.026
Neg logit contributions
ck-0.046
 breath-0.043
ited-0.036
away-0.035
 tug-0.035
Token asked
Token ID1965
Feature activation+1.48e-02

Pos logit contributions
ck+0.055
 breath+0.048
 tug+0.041
 swat+0.040
ited+0.039
Neg logit contributions
tr-0.041
robe-0.038
 Why-0.038
umm-0.037
 There-0.037
Token Twe
Token ID24205
Feature activation-1.14e-01

Pos logit contributions
tr+0.022
 She+0.021
 He+0.021
 has+0.021
 There+0.020
Neg logit contributions
ck-0.032
 breath-0.029
ited-0.025
away-0.024
 tug-0.023
Tokenety
Token ID2963
Feature activation-4.26e-02

Pos logit contributions
ck+0.249
 breath+0.246
 tug+0.202
ited+0.197
 contain+0.195
Neg logit contributions
 She-0.174
tr-0.163
 He-0.155
 Why-0.151
 has-0.149
Token if
Token ID611
Feature activation+3.33e-03

Pos logit contributions
ck+0.096
 breath+0.088
ited+0.075
away+0.073
 tug+0.071
Neg logit contributions
 She-0.058
tr-0.058
 He-0.056
 There-0.056
 has-0.055
Token she
Token ID673
Feature activation+1.72e-02

Pos logit contributions
tr+0.003
 She+0.003
 He+0.003
 has+0.003
 There+0.003
Neg logit contributions
ck-0.009
 breath-0.008
ited-0.007
 tug-0.007
away-0.007
Token could
Token ID714
Feature activation-8.93e-03

Pos logit contributions
 She+0.015
tr+0.014
 He+0.014
 There+0.013
 Why+0.013
Neg logit contributions
ck-0.048
 breath-0.044
ited-0.041
 tug-0.039
away-0.038
Token take
Token ID1011
Feature activation-9.65e-03

Pos logit contributions
ck+0.018
 breath+0.017
ited+0.015
 tug+0.013
away+0.013
Neg logit contributions
 She-0.015
 He-0.014
tr-0.014
 has-0.014
 There-0.014
Token to
Token ID284
Feature activation-2.35e-02

Pos logit contributions
ck+0.034
 breath+0.031
away+0.025
 tug+0.025
 swat+0.025
Neg logit contributions
tr-0.023
robe-0.022
 has-0.022
 Did-0.022
 There-0.022
Token never
Token ID1239
Feature activation-4.62e-02

Pos logit contributions
ck+0.050
 breath+0.046
ited+0.038
 tug+0.037
away+0.036
Neg logit contributions
 She-0.037
tr-0.036
 He-0.036
 There-0.036
 Did-0.035
Token try
Token ID1949
Feature activation-1.12e-03

Pos logit contributions
ck+0.096
 breath+0.087
ited+0.077
 tug+0.070
away+0.069
Neg logit contributions
 She-0.075
 He-0.072
tr-0.071
 has-0.071
 There-0.070
Token to
Token ID284
Feature activation-2.83e-02

Pos logit contributions
ck+0.003
 breath+0.002
ited+0.002
 tug+0.002
away+0.002
Neg logit contributions
 She-0.002
tr-0.002
 He-0.001
 There-0.001
 has-0.001
Token catch
Token ID4929
Feature activation-2.49e-02

Pos logit contributions
ck+0.058
ited+0.055
 breath+0.054
rig+0.048
illed+0.047
Neg logit contributions
 She-0.041
 sky-0.037
 He-0.037
 has-0.037
tr-0.036
Token Twe
Token ID24205
Feature activation-1.14e-01

Pos logit contributions
ck+0.051
 breath+0.045
ited+0.038
 tug+0.037
away+0.036
Neg logit contributions
 She-0.042
tr-0.041
 He-0.040
 has-0.040
 There-0.040
Tokenety
Token ID2963
Feature activation-2.18e-02

Pos logit contributions
ck+0.238
 breath+0.233
ited+0.192
 tug+0.191
 contain+0.182
Neg logit contributions
 She-0.179
tr-0.178
 He-0.165
 Why-0.161
 has-0.161
Token again
Token ID757
Feature activation-2.24e-02

Pos logit contributions
ck+0.054
 breath+0.050
ited+0.045
 tug+0.043
away+0.041
Neg logit contributions
 She-0.028
 He-0.026
tr-0.025
 has-0.025
 Why-0.024
Token.
Token ID13
Feature activation+1.93e-03

Pos logit contributions
ck+0.053
 breath+0.049
ited+0.044
 tug+0.041
away+0.040
Neg logit contributions
 She-0.030
 He-0.029
tr-0.028
 has-0.028
 There-0.027
Token From
Token ID3574
Feature activation-1.21e-03

Pos logit contributions
bridge+0.003
robe+0.003
tr+0.003
boards+0.003
pit+0.003
Neg logit contributions
ck-0.003
 breath-0.003
shadow-0.003
 swat-0.003
 agree-0.003
Token that
Token ID326
Feature activation+3.00e-02

Pos logit contributions
ck+0.003
 breath+0.003
ited+0.003
 tug+0.003
away+0.003
Neg logit contributions
 She-0.001
 He-0.001
 has-0.001
tr-0.001
 Why-0.001
Token a
Token ID257
Feature activation-3.62e-02

Pos logit contributions
ck+0.082
 breath+0.076
ited+0.070
away+0.066
 tug+0.064
Neg logit contributions
 She-0.035
 He-0.031
 There-0.029
tr-0.029
 How-0.028
Token little
Token ID1310
Feature activation-6.04e-02

Pos logit contributions
ck+0.085
 breath+0.076
ited+0.070
 contain+0.062
war+0.062
Neg logit contributions
 She-0.052
 He-0.050
 has-0.050
 living-0.049
tr-0.048
Token bird
Token ID6512
Feature activation-4.65e-02

Pos logit contributions
ck+0.161
 breath+0.145
ited+0.133
 tug+0.124
away+0.122
Neg logit contributions
 She-0.068
 He-0.063
 has-0.061
tr-0.061
 There-0.058
Tokenie
Token ID494
Feature activation-4.07e-02

Pos logit contributions
ck+0.102
 breath+0.097
ited+0.087
 tug+0.082
war+0.077
Neg logit contributions
 She-0.069
tr-0.065
 He-0.064
 living-0.063
 has-0.062
Token named
Token ID3706
Feature activation-2.69e-02

Pos logit contributions
ck+0.098
 breath+0.093
ited+0.087
 tug+0.079
war+0.078
Neg logit contributions
 She-0.050
 He-0.047
 living-0.047
tr-0.047
 has-0.046
Token Twe
Token ID24205
Feature activation-1.14e-01

Pos logit contributions
ck+0.066
 breath+0.060
ited+0.059
away+0.052
 tug+0.052
Neg logit contributions
 She-0.037
 He-0.035
 There-0.033
tr-0.032
 "-0.031
Tokenety
Token ID2963
Feature activation-4.62e-02

Pos logit contributions
 breath+0.270
ck+0.259
 contain+0.225
 tug+0.223
 fear+0.220
Neg logit contributions
 She-0.153
tr-0.147
 He-0.134
 has-0.125
 living-0.124
Token.
Token ID13
Feature activation+1.21e-02

Pos logit contributions
ck+0.116
 breath+0.110
ited+0.101
 tug+0.094
away+0.090
Neg logit contributions
 She-0.055
 He-0.052
tr-0.050
 living-0.050
 has-0.049
Token Twe
Token ID24205
Feature activation-9.25e-02

Pos logit contributions
tr+0.029
robe+0.028
 living+0.028
box+0.027
boards+0.027
Neg logit contributions
ck-0.012
 breath-0.011
 swat-0.007
away-0.007
 response-0.007
Tokenety
Token ID2963
Feature activation-1.34e-02

Pos logit contributions
ck+0.193
 breath+0.190
 tug+0.155
ited+0.155
 contain+0.150
Neg logit contributions
 She-0.149
tr-0.139
 He-0.138
 has-0.131
 Why-0.128
Token loved
Token ID6151
Feature activation+3.61e-04

Pos logit contributions
ck+0.034
 breath+0.030
ited+0.027
 tug+0.025
away+0.025
Neg logit contributions
 She-0.016
 There-0.015
tr-0.015
 Why-0.015
 How-0.015
Token and
Token ID290
Feature activation-4.50e-02

Pos logit contributions
ck+0.019
 breath+0.019
away+0.015
ited+0.014
 tug+0.014
Neg logit contributions
tr-0.016
 Did-0.015
 There-0.014
 has-0.014
 Why-0.014
Token created
Token ID2727
Feature activation+1.21e-02

Pos logit contributions
ck+0.084
 breath+0.078
 tug+0.059
ited+0.059
 swat+0.057
Neg logit contributions
tr-0.080
 She-0.077
robe-0.077
 Why-0.075
 There-0.075
Token a
Token ID257
Feature activation-1.20e-02

Pos logit contributions
 Did+0.022
 has+0.022
tr+0.022
 Why+0.022
robe+0.022
Neg logit contributions
ck-0.021
 breath-0.019
 tug-0.015
 fear-0.015
 swat-0.013
Token shelter
Token ID11772
Feature activation+1.15e-02

Pos logit contributions
ck+0.025
 breath+0.023
ited+0.020
 tug+0.018
away+0.018
Neg logit contributions
 She-0.020
 He-0.019
tr-0.019
 has-0.018
 There-0.018
Token for
Token ID329
Feature activation-2.78e-02

Pos logit contributions
 Did+0.019
 She+0.018
 There+0.018
 He+0.018
 Why+0.018
Neg logit contributions
ck-0.023
 breath-0.022
ited-0.018
away-0.018
 tug-0.017
Token Twe
Token ID24205
Feature activation-1.13e-01

Pos logit contributions
ck+0.060
 breath+0.054
ited+0.050
away+0.045
 tug+0.045
Neg logit contributions
 She-0.043
 He-0.041
 There-0.040
 has-0.039
tr-0.039
Tokenety
Token ID2963
Feature activation-3.89e-02

Pos logit contributions
ck+0.253
 breath+0.252
 tug+0.214
 contain+0.208
ited+0.207
Neg logit contributions
 She-0.161
tr-0.156
 He-0.146
 has-0.134
 Did-0.133
Token.
Token ID13
Feature activation+2.48e-03

Pos logit contributions
ck+0.098
 breath+0.091
ited+0.082
 tug+0.077
away+0.076
Neg logit contributions
 She-0.047
tr-0.044
 He-0.043
 has-0.043
 living-0.041
Token Twe
Token ID24205
Feature activation-9.17e-02

Pos logit contributions
pit+0.005
tr+0.005
robe+0.005
bridge+0.005
 living+0.005
Neg logit contributions
ck-0.003
 breath-0.002
 tug-0.002
shadow-0.002
 err-0.002
Tokenety
Token ID2963
Feature activation-1.17e-02

Pos logit contributions
ck+0.188
 breath+0.181
ited+0.149
 tug+0.148
 contain+0.143
Neg logit contributions
 She-0.150
tr-0.142
 He-0.140
 has-0.135
 Why-0.133
Token felt
Token ID2936
Feature activation+6.07e-03

Pos logit contributions
ck+0.032
 breath+0.030
ited+0.027
away+0.026
 tug+0.026
Neg logit contributions
 She-0.011
 He-0.011
 has-0.010
tr-0.010
 There-0.010
Token dare
Token ID16498
Feature activation-3.83e-02

Pos logit contributions
ck+0.096
 breath+0.088
ited+0.076
 tug+0.068
away+0.064
Neg logit contributions
 She-0.103
 He-0.100
 has-0.096
tr-0.095
 There-0.094
Token to
Token ID284
Feature activation-2.32e-02

Pos logit contributions
ck+0.093
 breath+0.086
ited+0.081
 tug+0.072
away+0.071
Neg logit contributions
 She-0.050
 He-0.048
 Why-0.044
tr-0.044
 There-0.043
Token climb
Token ID12080
Feature activation-1.44e-02

Pos logit contributions
ited+0.050
ck+0.049
 breath+0.046
illed+0.042
rig+0.042
Neg logit contributions
 She-0.031
 He-0.028
 only-0.027
 can-0.027
 is-0.027
Token up
Token ID510
Feature activation-1.79e-02

Pos logit contributions
ck+0.033
 breath+0.030
ited+0.027
 tug+0.025
away+0.025
Neg logit contributions
 She-0.020
tr-0.019
 He-0.019
 There-0.019
 has-0.019
Token to
Token ID284
Feature activation-1.07e-02

Pos logit contributions
ck+0.042
 breath+0.037
ited+0.035
 tug+0.031
away+0.031
Neg logit contributions
 She-0.026
 He-0.024
 There-0.023
 has-0.023
tr-0.023
Token Twe
Token ID24205
Feature activation-1.13e-01

Pos logit contributions
ck+0.022
 breath+0.020
ited+0.018
 tug+0.016
away+0.016
Neg logit contributions
 She-0.018
 He-0.017
tr-0.016
 There-0.016
 has-0.016
Tokenety
Token ID2963
Feature activation-3.21e-02

Pos logit contributions
ck+0.237
 breath+0.228
ited+0.187
 tug+0.186
 contain+0.178
Neg logit contributions
 She-0.184
tr-0.180
 He-0.168
 has-0.162
 Why-0.162
Token's
Token ID338
Feature activation+6.11e-03

Pos logit contributions
ck+0.080
 breath+0.074
ited+0.067
 tug+0.063
away+0.061
Neg logit contributions
 She-0.040
 He-0.037
tr-0.037
 has-0.035
 There-0.035
Token nest
Token ID16343
Feature activation+9.81e-03

Pos logit contributions
 She+0.009
tr+0.009
 He+0.009
 has+0.008
 There+0.008
Neg logit contributions
ck-0.014
 breath-0.012
ited-0.011
 tug-0.010
away-0.010
Token.
Token ID13
Feature activation+6.88e-04

Pos logit contributions
 She+0.014
 He+0.013
tr+0.013
 has+0.012
 There+0.012
Neg logit contributions
ck-0.023
 breath-0.021
ited-0.019
 tug-0.017
away-0.017
Token
Token ID198
Feature activation-2.31e-02

Pos logit contributions
tr+0.001
robe+0.001
 living+0.001
 has+0.001
bridge+0.001
Neg logit contributions
ck-0.002
 breath-0.001
ited-0.001
 tug-0.001
 agree-0.001
Token The
Token ID383
Feature activation+1.49e-02

Pos logit contributions
ck+0.002
 breath+0.002
 tug+0.002
 swat+0.001
 panic+0.001
Neg logit contributions
tr-0.001
robe-0.001
pit-0.001
 living-0.001
 has-0.001
Token cat
Token ID3797
Feature activation+1.25e-03

Pos logit contributions
 She+0.015
tr+0.014
 He+0.013
 has+0.013
 There+0.013
Neg logit contributions
ck-0.042
 breath-0.039
ited-0.036
away-0.034
 tug-0.033
Token wanted
Token ID2227
Feature activation+1.03e-02

Pos logit contributions
 She+0.001
 He+0.001
tr+0.001
 There+0.001
 Why+0.001
Neg logit contributions
ck-0.003
 breath-0.003
ited-0.003
 tug-0.003
 swat-0.003
Token to
Token ID284
Feature activation-2.76e-02

Pos logit contributions
 She+0.012
tr+0.012
 He+0.012
 has+0.011
 There+0.011
Neg logit contributions
ck-0.026
 breath-0.024
ited-0.021
 tug-0.020
away-0.020
Token eat
Token ID4483
Feature activation-5.01e-03

Pos logit contributions
ck+0.056
ited+0.054
 breath+0.050
illed+0.047
rig+0.045
Neg logit contributions
 She-0.046
 He-0.042
tr-0.040
 There-0.037
 How-0.037
Token Twe
Token ID24205
Feature activation-1.13e-01

Pos logit contributions
ck+0.010
 breath+0.009
ited+0.008
 tug+0.008
away+0.007
Neg logit contributions
 She-0.008
tr-0.008
 has-0.008
 There-0.008
 He-0.008
Tokenety
Token ID2963
Feature activation-2.91e-02

Pos logit contributions
ck+0.239
 breath+0.234
 tug+0.192
ited+0.187
 contain+0.185
Neg logit contributions
 She-0.178
tr-0.173
 He-0.159
 has-0.156
 Did-0.154
Token!
Token ID0
Feature activation-2.00e-03

Pos logit contributions
ck+0.078
 breath+0.072
ited+0.066
 tug+0.063
 response+0.061
Neg logit contributions
 She-0.031
 He-0.028
tr-0.027
 has-0.027
 living-0.026
Token
Token ID198
Feature activation-2.68e-02

Pos logit contributions
ck+0.004
 breath+0.004
 tug+0.003
away+0.003
 swat+0.003
Neg logit contributions
tr-0.003
robe-0.003
 living-0.003
bridge-0.003
 has-0.003
Token
Token ID198
Feature activation-3.51e-02

Pos logit contributions
ck+0.085
 breath+0.081
ited+0.075
 tug+0.070
away+0.069
Neg logit contributions
 She-0.016
 He-0.014
 There-0.012
 has-0.011
tr-0.011
TokenTwe
Token ID32665
Feature activation-9.44e-02

Pos logit contributions
ck+0.062
 breath+0.056
ited+0.047
 tug+0.042
away+0.041
Neg logit contributions
 She-0.069
 He-0.067
tr-0.066
 has-0.065
 There-0.065
Token friend
Token ID1545
Feature activation-8.76e-03

Pos logit contributions
 She+0.029
 Did+0.028
 Why+0.028
 has+0.028
 He+0.028
Neg logit contributions
ck-0.025
 breath-0.024
ited-0.018
 tug-0.018
 swat-0.017
Token,
Token ID11
Feature activation+1.49e-02

Pos logit contributions
ck+0.021
 breath+0.019
ited+0.017
 tug+0.016
away+0.015
Neg logit contributions
 She-0.012
tr-0.011
 He-0.011
 has-0.011
 There-0.011
Token a
Token ID257
Feature activation-1.08e-02

Pos logit contributions
 She+0.021
 He+0.020
tr+0.019
 There+0.019
 Why+0.019
Neg logit contributions
ck-0.036
 breath-0.033
ited-0.030
away-0.027
 tug-0.027
Token bird
Token ID6512
Feature activation-2.83e-02

Pos logit contributions
ck+0.024
 breath+0.022
ited+0.019
 tug+0.018
away+0.018
Neg logit contributions
 She-0.016
 He-0.016
tr-0.015
 has-0.015
 There-0.015
Token named
Token ID3706
Feature activation-1.18e-02

Pos logit contributions
ck+0.057
 breath+0.052
ited+0.045
 tug+0.041
away+0.041
Neg logit contributions
 She-0.048
 He-0.046
tr-0.046
 There-0.045
 has-0.045
Token Twe
Token ID24205
Feature activation-1.13e-01

Pos logit contributions
ck+0.025
 breath+0.022
ited+0.019
 tug+0.018
away+0.018
Neg logit contributions
 She-0.019
tr-0.018
 He-0.018
 has-0.018
 There-0.018
Tokenety
Token ID2963
Feature activation-3.98e-02

Pos logit contributions
ck+0.227
 breath+0.212
ited+0.177
 tug+0.169
 contain+0.160
Neg logit contributions
 She-0.193
 He-0.184
tr-0.183
 has-0.174
 Why-0.173
Token,
Token ID11
Feature activation+1.49e-02

Pos logit contributions
ck+0.092
 breath+0.082
ited+0.072
 tug+0.068
away+0.067
Neg logit contributions
tr-0.055
 She-0.054
 He-0.052
robe-0.051
 There-0.051
Token got
Token ID1392
Feature activation-5.66e-03

Pos logit contributions
tr+0.021
robe+0.021
umm+0.020
 She+0.019
 sky+0.019
Neg logit contributions
ck-0.032
 breath-0.029
ited-0.025
 tug-0.025
 swat-0.024
Token stuck
Token ID7819
Feature activation-2.10e-02

Pos logit contributions
ck+0.013
 breath+0.012
ited+0.012
away+0.011
rig+0.010
Neg logit contributions
 She-0.008
 He-0.007
 Why-0.007
 How-0.007
 has-0.007
Token in
Token ID287
Feature activation-3.05e-02

Pos logit contributions
ck+0.059
 breath+0.053
ited+0.052
 tug+0.047
 response+0.047
Neg logit contributions
 She-0.019
 He-0.018
tr-0.017
 living-0.017
 has-0.016
Token seeds
Token ID11904
Feature activation+1.49e-02

Pos logit contributions
 She+0.012
 He+0.012
 Why+0.011
 Did+0.011
tr+0.011
Neg logit contributions
ck-0.014
 breath-0.013
ited-0.011
 swat-0.010
 tug-0.010
Token and
Token ID290
Feature activation-2.92e-02

Pos logit contributions
 Did+0.021
 Why+0.021
tr+0.021
 How+0.020
 She+0.020
Neg logit contributions
ck-0.034
 breath-0.032
ited-0.027
 tug-0.026
away-0.025
Token gave
Token ID2921
Feature activation-6.28e-03

Pos logit contributions
ck+0.067
 breath+0.062
ited+0.055
 tug+0.051
away+0.050
Neg logit contributions
 She-0.041
 He-0.039
tr-0.039
 There-0.038
 has-0.038
Token them
Token ID606
Feature activation-5.00e-03

Pos logit contributions
ck+0.012
 breath+0.011
ited+0.009
away+0.008
 tug+0.008
Neg logit contributions
tr-0.011
 She-0.011
 There-0.011
 has-0.011
 Why-0.011
Token to
Token ID284
Feature activation-1.42e-02

Pos logit contributions
ck+0.014
 breath+0.012
ited+0.012
 tug+0.011
 swat+0.011
Neg logit contributions
 She-0.005
 He-0.005
 living-0.004
 has-0.004
tr-0.004
Token Twe
Token ID24205
Feature activation-1.13e-01

Pos logit contributions
ck+0.029
 breath+0.026
ited+0.023
 tug+0.021
away+0.021
Neg logit contributions
 She-0.024
 He-0.023
tr-0.023
 has-0.022
 There-0.022
Tokenety
Token ID2963
Feature activation-3.72e-02

Pos logit contributions
 breath+0.247
ck+0.244
 tug+0.205
 contain+0.204
ited+0.197
Neg logit contributions
 She-0.165
tr-0.159
 He-0.151
 has-0.141
 Why-0.141
Token.
Token ID13
Feature activation+6.82e-03

Pos logit contributions
ck+0.094
 breath+0.086
ited+0.079
 tug+0.074
away+0.071
Neg logit contributions
 She-0.046
 He-0.043
tr-0.042
 has-0.041
 living-0.040
Token
Token ID198
Feature activation-2.19e-02

Pos logit contributions
tr+0.011
pit+0.011
robe+0.010
bridge+0.010
 has+0.010
Neg logit contributions
ck-0.012
 breath-0.010
shadow-0.009
 tug-0.009
 agree-0.008
Token
Token ID198
Feature activation-3.33e-02

Pos logit contributions
ck+0.067
 breath+0.063
ited+0.058
 tug+0.054
away+0.054
Neg logit contributions
 She-0.015
 He-0.014
tr-0.013
 There-0.012
 has-0.012
TokenTwe
Token ID32665
Feature activation-9.06e-02

Pos logit contributions
ck+0.059
 breath+0.054
ited+0.045
 tug+0.041
away+0.040
Neg logit contributions
 She-0.065
 He-0.063
tr-0.061
 has-0.060
 There-0.060
Token a
Token ID257
Feature activation-3.16e-02

Pos logit contributions
ck+0.074
 breath+0.069
ited+0.064
away+0.060
 tug+0.058
Neg logit contributions
 She-0.032
 He-0.029
 There-0.027
 living-0.026
 sky-0.026
Token little
Token ID1310
Feature activation-5.35e-02

Pos logit contributions
ck+0.073
 breath+0.065
ited+0.060
 contain+0.053
away+0.053
Neg logit contributions
 She-0.046
 He-0.045
 has-0.044
 living-0.043
tr-0.043
Token bird
Token ID6512
Feature activation-3.99e-02

Pos logit contributions
ck+0.141
 breath+0.127
ited+0.116
 tug+0.108
away+0.107
Neg logit contributions
 She-0.061
 He-0.057
 has-0.055
tr-0.055
 There-0.053
Tokenie
Token ID494
Feature activation-3.40e-02

Pos logit contributions
ck+0.087
 breath+0.082
ited+0.073
 tug+0.069
 response+0.065
Neg logit contributions
 She-0.060
tr-0.057
 He-0.056
 living-0.055
 has-0.055
Token named
Token ID3706
Feature activation-2.67e-02

Pos logit contributions
ck+0.082
 breath+0.078
ited+0.072
 tug+0.066
 response+0.064
Neg logit contributions
 She-0.042
 living-0.040
 He-0.040
tr-0.040
 has-0.040
Token Twe
Token ID24205
Feature activation-1.12e-01

Pos logit contributions
ck+0.066
 breath+0.061
ited+0.060
away+0.053
 tug+0.053
Neg logit contributions
 She-0.036
 He-0.033
 There-0.032
tr-0.031
 "-0.031
Tokenety
Token ID2963
Feature activation-4.03e-02

Pos logit contributions
 breath+0.268
ck+0.255
 contain+0.224
 tug+0.222
 fear+0.218
Neg logit contributions
 She-0.146
tr-0.143
 He-0.127
 living-0.120
 has-0.120
Token.
Token ID13
Feature activation+1.52e-02

Pos logit contributions
ck+0.100
 breath+0.094
ited+0.085
 tug+0.080
away+0.077
Neg logit contributions
 She-0.050
 He-0.047
tr-0.046
 has-0.045
 living-0.045
Token Twe
Token ID24205
Feature activation-9.23e-02

Pos logit contributions
tr+0.036
 living+0.035
robe+0.035
boards+0.035
box+0.035
Neg logit contributions
ck-0.015
 breath-0.013
ir-0.008
 swat-0.008
 response-0.008
Tokenety
Token ID2963
Feature activation-1.03e-02

Pos logit contributions
ck+0.191
 breath+0.187
ited+0.153
 tug+0.152
 contain+0.147
Neg logit contributions
 She-0.150
tr-0.140
 He-0.139
 has-0.132
 Why-0.129
Token lived
Token ID5615
Feature activation+3.16e-04

Pos logit contributions
ck+0.026
 breath+0.023
ited+0.021
 tug+0.020
away+0.020
Neg logit contributions
 She-0.012
 There-0.012
tr-0.011
 He-0.011
 Why-0.011
Token bird
Token ID6512
Feature activation-6.58e-03

Pos logit contributions
ck+0.016
 breath+0.015
ited+0.013
 tug+0.012
away+0.012
Neg logit contributions
 She-0.008
tr-0.007
 has-0.007
 He-0.007
 sky-0.007
Token and
Token ID290
Feature activation-4.03e-02

Pos logit contributions
ck+0.017
 breath+0.016
ited+0.014
 tug+0.014
away+0.013
Neg logit contributions
 She-0.008
tr-0.007
 He-0.007
 has-0.007
 living-0.006
Token they
Token ID484
Feature activation-1.04e-02

Pos logit contributions
ck+0.085
 breath+0.077
 tug+0.063
ited+0.061
 swat+0.060
Neg logit contributions
 She-0.066
 He-0.065
 There-0.064
tr-0.064
 Why-0.063
Token named
Token ID3706
Feature activation-2.14e-02

Pos logit contributions
ck+0.025
 breath+0.023
ited+0.020
 tug+0.019
away+0.019
Neg logit contributions
 She-0.014
 He-0.013
tr-0.013
 has-0.012
 There-0.012
Token it
Token ID340
Feature activation+1.77e-02

Pos logit contributions
ck+0.046
 breath+0.041
ited+0.039
 tug+0.035
away+0.034
Neg logit contributions
 She-0.034
 He-0.032
 has-0.031
tr-0.031
 There-0.031
Token Twe
Token ID24205
Feature activation-1.12e-01

Pos logit contributions
 She+0.025
 He+0.025
 Why+0.025
 Did+0.025
 There+0.025
Neg logit contributions
ck-0.039
 breath-0.034
ited-0.028
 swat-0.028
 tug-0.027
Tokenety
Token ID2963
Feature activation-3.24e-02

Pos logit contributions
ck+0.257
 breath+0.254
 contain+0.213
 tug+0.210
ited+0.205
Neg logit contributions
tr-0.160
 She-0.157
 He-0.138
 Did-0.136
 has-0.136
Token.
Token ID13
Feature activation+1.32e-02

Pos logit contributions
ck+0.070
 breath+0.063
ited+0.053
 tug+0.051
away+0.050
Neg logit contributions
 She-0.048
 He-0.046
 There-0.045
tr-0.045
 Why-0.045
Token They
Token ID1119
Feature activation+1.22e-03

Pos logit contributions
tr+0.019
pit+0.019
robe+0.019
 has+0.019
box+0.018
Neg logit contributions
ck-0.022
 err-0.019
 breath-0.018
shadow-0.018
 tug-0.017
Token fed
Token ID11672
Feature activation-3.16e-03

Pos logit contributions
 She+0.002
 He+0.001
tr+0.001
 has+0.001
 There+0.001
Neg logit contributions
ck-0.003
 breath-0.003
ited-0.002
away-0.002
 tug-0.002
Token Twe
Token ID24205
Feature activation-9.66e-02

Pos logit contributions
ck+0.006
 breath+0.006
ited+0.005
away+0.004
 tug+0.004
Neg logit contributions
 She-0.005
tr-0.005
 There-0.005
 He-0.005
 Why-0.005
Token there
Token ID612
Feature activation+3.64e-02

Pos logit contributions
ck+0.013
 breath+0.012
ited+0.011
 tug+0.010
away+0.010
Neg logit contributions
 She-0.008
tr-0.008
 He-0.007
 Did-0.007
robe-0.007
Token was
Token ID373
Feature activation-2.93e-02

Pos logit contributions
tr+0.038
 She+0.035
 Why+0.034
robe+0.033
 Did+0.033
Neg logit contributions
ck-0.093
 breath-0.086
ited-0.076
 tug-0.075
 behold-0.071
Token a
Token ID257
Feature activation-3.60e-02

Pos logit contributions
ck+0.081
 breath+0.075
ited+0.069
away+0.065
 tug+0.063
Neg logit contributions
 She-0.034
 He-0.030
 There-0.029
 How-0.028
 living-0.027
Token bird
Token ID6512
Feature activation-4.71e-02

Pos logit contributions
ck+0.084
 breath+0.075
ited+0.069
 contain+0.062
war+0.061
Neg logit contributions
 She-0.052
 He-0.050
 has-0.050
 living-0.048
tr-0.048
Token named
Token ID3706
Feature activation-2.81e-02

Pos logit contributions
ck+0.104
 breath+0.098
ited+0.087
 tug+0.082
away+0.078
Neg logit contributions
 She-0.070
tr-0.066
 He-0.065
 has-0.063
 living-0.063
Token Twe
Token ID24205
Feature activation-1.12e-01

Pos logit contributions
ck+0.072
 breath+0.066
ited+0.066
away+0.059
 tug+0.058
Neg logit contributions
 She-0.036
 He-0.033
 There-0.033
 "-0.031
tr-0.030
Tokenety
Token ID2963
Feature activation-4.66e-02

Pos logit contributions
 breath+0.256
ck+0.245
 tug+0.207
 contain+0.206
ited+0.205
Neg logit contributions
 She-0.165
tr-0.156
 He-0.145
 has-0.138
 living-0.133
Token.
Token ID13
Feature activation+1.08e-02

Pos logit contributions
ck+0.116
 breath+0.110
ited+0.101
 tug+0.094
 response+0.090
Neg logit contributions
 She-0.056
 He-0.053
tr-0.052
 has-0.051
 living-0.050
Token Twe
Token ID24205
Feature activation-9.09e-02

Pos logit contributions
tr+0.026
 living+0.025
box+0.025
robe+0.025
boards+0.024
Neg logit contributions
ck-0.011
 breath-0.010
away-0.006
 response-0.006
 swat-0.006
Tokenety
Token ID2963
Feature activation-1.30e-02

Pos logit contributions
ck+0.182
 breath+0.174
ited+0.145
 tug+0.140
 contain+0.133
Neg logit contributions
 She-0.156
 He-0.146
tr-0.145
 has-0.140
 There-0.137
Token lived
Token ID5615
Feature activation-5.52e-03

Pos logit contributions
ck+0.033
 breath+0.029
ited+0.026
 tug+0.024
away+0.024
Neg logit contributions
 She-0.015
 There-0.015
tr-0.015
 Why-0.014
 How-0.014
Token a
Token ID257
Feature activation+1.23e-03

Pos logit contributions
ck+0.041
 breath+0.037
ited+0.036
rig+0.032
away+0.031
Neg logit contributions
 She-0.027
 He-0.025
 There-0.024
 sky-0.023
 has-0.023
Token little
Token ID1310
Feature activation-2.61e-02

Pos logit contributions
tr+0.002
 He+0.002
 She+0.002
 has+0.002
 sky+0.002
Neg logit contributions
ck-0.003
 breath-0.002
ited-0.002
illed-0.002
ir-0.002
Token bird
Token ID6512
Feature activation-3.45e-02

Pos logit contributions
ck+0.056
 breath+0.049
ited+0.045
away+0.039
 tug+0.039
Neg logit contributions
 She-0.041
 He-0.040
tr-0.039
 has-0.038
 There-0.038
Tokenie
Token ID494
Feature activation-2.58e-02

Pos logit contributions
ck+0.084
 breath+0.078
ited+0.070
 tug+0.066
away+0.063
Neg logit contributions
 She-0.044
tr-0.042
 He-0.042
 has-0.040
 Why-0.040
Token like
Token ID588
Feature activation+3.59e-03

Pos logit contributions
ck+0.060
 breath+0.055
ited+0.049
 tug+0.046
away+0.045
Neg logit contributions
 She-0.035
 He-0.033
tr-0.033
 There-0.032
 has-0.032
Token Twe
Token ID24205
Feature activation-1.12e-01

Pos logit contributions
 She+0.006
tr+0.006
 He+0.005
 has+0.005
 There+0.005
Neg logit contributions
ck-0.008
 breath-0.007
ited-0.006
 tug-0.006
away-0.005
Tokenety
Token ID2963
Feature activation-3.53e-02

Pos logit contributions
 breath+0.247
ck+0.246
 contain+0.206
 tug+0.204
ited+0.197
Neg logit contributions
 She-0.163
tr-0.158
 He-0.146
 has-0.139
 Why-0.139
Token.
Token ID13
Feature activation-4.57e-03

Pos logit contributions
ck+0.086
 breath+0.079
ited+0.071
 tug+0.067
away+0.065
Neg logit contributions
 She-0.046
 He-0.043
tr-0.042
 has-0.042
 There-0.041
Token So
Token ID1406
Feature activation-1.66e-03

Pos logit contributions
ck+0.010
 breath+0.009
 agree+0.008
ited+0.007
 tug+0.007
Neg logit contributions
robe-0.006
tr-0.006
 living-0.005
bridge-0.005
 has-0.005
Token,
Token ID11
Feature activation+1.03e-02

Pos logit contributions
ck+0.003
 breath+0.003
 tug+0.003
away+0.002
ited+0.002
Neg logit contributions
robe-0.003
tr-0.003
 Why-0.003
boards-0.003
 living-0.003
Token Twe
Token ID24205
Feature activation-9.39e-02

Pos logit contributions
tr+0.017
 She+0.016
robe+0.016
 has+0.016
 Why+0.016
Neg logit contributions
ck-0.021
 breath-0.019
ited-0.016
 tug-0.016
away-0.015
Token,
Token ID11
Feature activation+1.67e-02

Pos logit contributions
ck+0.010
 breath+0.009
ited+0.008
 tug+0.007
away+0.007
Neg logit contributions
 She-0.005
 He-0.005
tr-0.005
 has-0.005
 There-0.005
Token a
Token ID257
Feature activation-2.59e-03

Pos logit contributions
 She+0.022
 He+0.021
 There+0.020
tr+0.019
 Why+0.019
Neg logit contributions
ck-0.042
 breath-0.038
ited-0.036
away-0.033
 tug-0.031
Token little
Token ID1310
Feature activation-2.75e-02

Pos logit contributions
ck+0.005
 breath+0.005
ited+0.004
 tug+0.004
away+0.004
Neg logit contributions
 She-0.004
 He-0.004
tr-0.004
 There-0.004
 has-0.004
Token bird
Token ID6512
Feature activation-2.63e-02

Pos logit contributions
ck+0.068
 breath+0.063
ited+0.056
 tug+0.053
away+0.052
Neg logit contributions
 She-0.034
 He-0.032
tr-0.032
 has-0.031
 There-0.031
Token named
Token ID3706
Feature activation-6.97e-03

Pos logit contributions
ck+0.048
 breath+0.043
ited+0.036
 tug+0.033
away+0.032
Neg logit contributions
 She-0.050
 He-0.048
tr-0.048
 There-0.047
 has-0.047
Token Twe
Token ID24205
Feature activation-1.12e-01

Pos logit contributions
ck+0.014
 breath+0.013
 tug+0.011
ited+0.011
 swat+0.010
Neg logit contributions
tr-0.011
 She-0.011
 has-0.010
 He-0.010
robe-0.010
Tokenety
Token ID2963
Feature activation-3.74e-02

Pos logit contributions
ck+0.236
 breath+0.227
ited+0.188
 tug+0.183
 contain+0.179
Neg logit contributions
 She-0.177
tr-0.174
 He-0.166
 Did-0.158
 has-0.158
Token,
Token ID11
Feature activation+1.86e-02

Pos logit contributions
ck+0.085
 breath+0.076
ited+0.066
 tug+0.063
away+0.061
Neg logit contributions
tr-0.054
 She-0.053
robe-0.051
 He-0.050
 There-0.050
Token saw
Token ID2497
Feature activation+1.99e-02

Pos logit contributions
tr+0.030
robe+0.030
umm+0.029
 sky+0.027
 She+0.027
Neg logit contributions
ck-0.038
 breath-0.033
ited-0.030
 tug-0.029
 swat-0.028
Token him
Token ID683
Feature activation-1.13e-02

Pos logit contributions
tr+0.040
robe+0.038
 has+0.036
umm+0.036
 sky+0.035
Neg logit contributions
ck-0.035
 breath-0.029
 tug-0.023
away-0.023
 swat-0.022
Token struggling
Token ID9648
Feature activation-1.03e-02

Pos logit contributions
ck+0.026
 breath+0.024
ited+0.022
away+0.020
 tug+0.019
Neg logit contributions
 She-0.016
 He-0.015
tr-0.014
 has-0.014
 living-0.014
Token was
Token ID373
Feature activation+9.56e-03

Pos logit contributions
 She+0.029
 There+0.027
tr+0.027
 He+0.027
 Why+0.026
Neg logit contributions
ck-0.088
 breath-0.081
ited-0.073
 tug-0.071
 swat-0.068
Token so
Token ID523
Feature activation-3.64e-02

Pos logit contributions
 She+0.014
tr+0.014
 He+0.014
 There+0.013
 has+0.013
Neg logit contributions
ck-0.021
 breath-0.020
ited-0.017
 tug-0.016
 swat-0.016
Token tiny
Token ID7009
Feature activation-3.91e-02

Pos logit contributions
ck+0.079
 breath+0.072
ited+0.064
 tug+0.058
away+0.058
Neg logit contributions
 She-0.061
 He-0.057
 has-0.054
tr-0.054
 There-0.054
Token and
Token ID290
Feature activation-3.36e-02

Pos logit contributions
 breath+0.074
ck+0.074
 tug+0.058
away+0.058
 swat+0.055
Neg logit contributions
tr-0.066
 Did-0.063
robe-0.062
 He-0.059
 There-0.058
Token it
Token ID340
Feature activation+3.57e-02

Pos logit contributions
ck+0.074
 breath+0.068
ited+0.060
 tug+0.056
 swat+0.054
Neg logit contributions
 She-0.050
tr-0.048
 He-0.047
 There-0.047
 Why-0.046
Token looked
Token ID3114
Feature activation+4.42e-02

Pos logit contributions
 She+0.047
 He+0.044
tr+0.044
 There+0.043
 Why+0.042
Neg logit contributions
ck-0.086
 breath-0.079
ited-0.070
 tug-0.067
 swat-0.064
Token like
Token ID588
Feature activation+1.49e-02

Pos logit contributions
 She+0.062
 He+0.060
 has+0.058
 Why+0.056
 There+0.056
Neg logit contributions
ck-0.104
 breath-0.095
ited-0.084
away-0.078
 tug-0.077
Token it
Token ID340
Feature activation+2.88e-02

Pos logit contributions
tr+0.024
 She+0.023
 He+0.022
 Did+0.022
 has+0.022
Neg logit contributions
ck-0.030
 breath-0.029
 tug-0.023
ited-0.023
 response-0.022
Token was
Token ID373
Feature activation+8.59e-03

Pos logit contributions
 She+0.029
 He+0.027
tr+0.027
 There+0.027
 Did+0.027
Neg logit contributions
ck-0.077
 breath-0.071
ited-0.064
 tug-0.061
 swat-0.059
Token going
Token ID1016
Feature activation+1.68e-03

Pos logit contributions
 She+0.013
 He+0.012
tr+0.012
 There+0.012
 has+0.012
Neg logit contributions
ck-0.019
 breath-0.018
ited-0.015
 tug-0.015
away-0.014
Token to
Token ID284
Feature activation-7.01e-03

Pos logit contributions
 She+0.002
tr+0.002
 He+0.002
 There+0.002
 has+0.002
Neg logit contributions
ck-0.004
 breath-0.004
ited-0.003
away-0.003
 tug-0.003
Token know
Token ID760
Feature activation+2.24e-02

Pos logit contributions
ck+0.090
 breath+0.084
ited+0.075
 tug+0.065
away+0.063
Neg logit contributions
 She-0.089
 He-0.087
 has-0.084
tr-0.082
 There-0.081
Token the
Token ID262
Feature activation+1.91e-02

Pos logit contributions
 She+0.032
tr+0.031
 He+0.031
 has+0.030
 There+0.030
Neg logit contributions
ck-0.051
 breath-0.046
ited-0.039
 tug-0.038
away-0.037
Token big
Token ID1263
Feature activation+8.95e-03

Pos logit contributions
 She+0.026
 He+0.025
tr+0.025
 has+0.024
 There+0.024
Neg logit contributions
ck-0.045
 breath-0.041
ited-0.037
 tug-0.034
away-0.034
Token bird
Token ID6512
Feature activation+1.15e-02

Pos logit contributions
 She+0.010
 has+0.010
 He+0.010
tr+0.010
 sky+0.010
Neg logit contributions
ck-0.023
 breath-0.021
ited-0.019
 tug-0.018
away-0.018
Token.
Token ID13
Feature activation+3.50e-02

Pos logit contributions
 She+0.011
 He+0.010
tr+0.010
 There+0.010
 Did+0.010
Neg logit contributions
ck-0.032
 breath-0.029
ited-0.026
 tug-0.025
away-0.024
Token She
Token ID1375
Feature activation+4.02e-02

Pos logit contributions
 living+0.024
tr+0.023
robe+0.022
rob+0.021
 has+0.020
Neg logit contributions
ck-0.088
 breath-0.087
 fear-0.078
 tug-0.077
away-0.076
Token did
Token ID750
Feature activation+1.03e-02

Pos logit contributions
 She+0.053
 He+0.050
tr+0.050
 There+0.049
 Why+0.048
Neg logit contributions
ck-0.097
 breath-0.088
ited-0.079
 tug-0.075
away-0.072
Token not
Token ID407
Feature activation-4.95e-02

Pos logit contributions
 She+0.011
 has+0.010
 He+0.010
 Why+0.010
 How+0.010
Neg logit contributions
ck-0.026
 breath-0.025
ited-0.024
away-0.021
 tug-0.021
Token want
Token ID765
Feature activation+1.56e-03

Pos logit contributions
ck+0.094
 breath+0.089
ited+0.082
 err+0.071
charged+0.070
Neg logit contributions
 She-0.083
 He-0.081
 has-0.079
tr-0.075
 Why-0.075
Token to
Token ID284
Feature activation-3.29e-02

Pos logit contributions
 She+0.002
 He+0.002
tr+0.002
 There+0.002
 has+0.002
Neg logit contributions
ck-0.004
 breath-0.004
ited-0.003
 tug-0.003
away-0.003
Token leave
Token ID2666
Feature activation-5.71e-03

Pos logit contributions
ited+0.069
ck+0.068
 breath+0.064
rig+0.062
illed+0.061
Neg logit contributions
 She-0.045
 He-0.040
 has-0.039
 is-0.038
 only-0.038
Token and
Token ID290
Feature activation-5.92e-02

Pos logit contributions
 Did+0.002
 Where+0.002
tr+0.002
 How+0.002
 Why+0.002
Neg logit contributions
ck-0.002
 breath-0.002
away-0.002
 agree-0.002
ited-0.002
Token queens
Token ID38771
Feature activation-2.13e-02

Pos logit contributions
ck+0.136
 breath+0.125
ited+0.110
 tug+0.103
away+0.102
Neg logit contributions
 She-0.084
tr-0.079
 He-0.079
 There-0.077
 Why-0.076
Token in
Token ID287
Feature activation-3.72e-02

Pos logit contributions
ck+0.043
 breath+0.039
 believe+0.031
away+0.031
ited+0.031
Neg logit contributions
 Why-0.036
 Did-0.036
 How-0.036
tr-0.035
 Where-0.035
Token their
Token ID511
Feature activation+7.73e-03

Pos logit contributions
ck+0.084
 breath+0.073
ited+0.070
 swat+0.063
 tug+0.062
Neg logit contributions
 She-0.056
 has-0.054
 He-0.053
 living-0.053
 sky-0.053
Token palace
Token ID20562
Feature activation+1.72e-02

Pos logit contributions
 She+0.008
 He+0.008
tr+0.008
 Why+0.008
 Did+0.008
Neg logit contributions
ck-0.020
 breath-0.019
ited-0.016
away-0.016
 tug-0.016
Token.
Token ID13
Feature activation+3.48e-02

Pos logit contributions
 She+0.027
 Why+0.027
 Did+0.027
 There+0.027
 He+0.026
Neg logit contributions
ck-0.035
 breath-0.033
ited-0.027
away-0.026
 agree-0.025
Token And
Token ID843
Feature activation-1.52e-02

Pos logit contributions
tr+0.034
robe+0.031
boards+0.030
pit+0.029
 Did+0.028
Neg logit contributions
ck-0.085
 breath-0.077
 common-0.073
 tug-0.070
 agree-0.069
Token they
Token ID484
Feature activation+7.95e-03

Pos logit contributions
ck+0.032
 breath+0.029
 tug+0.024
 believe+0.024
 contain+0.023
Neg logit contributions
tr-0.024
 She-0.023
 He-0.022
 There-0.022
robe-0.022
Token were
Token ID547
Feature activation+1.74e-02

Pos logit contributions
 There+0.010
 She+0.010
tr+0.010
 Why+0.010
 He+0.010
Neg logit contributions
ck-0.019
 breath-0.017
ited-0.016
 believe-0.015
 tug-0.015
Token very
Token ID845
Feature activation-3.17e-02

Pos logit contributions
 She+0.029
tr+0.027
 He+0.027
 There+0.027
 has+0.026
Neg logit contributions
ck-0.036
 breath-0.033
ited-0.028
 tug-0.026
away-0.026
Token happy
Token ID3772
Feature activation-3.72e-02

Pos logit contributions
ck+0.060
 breath+0.055
ited+0.049
 tug+0.044
 swat+0.043
Neg logit contributions
 She-0.058
 has-0.056
 He-0.055
 living-0.053
 There-0.052
TokenOnce
Token ID7454
Feature activation-4.12e-03

Pos logit contributions
ck+0.014
 breath+0.013
ited+0.012
 tug+0.011
away+0.011
Neg logit contributions
tr-0.005
 She-0.005
 Did-0.004
 has-0.004
 Why-0.004
Token upon
Token ID2402
Feature activation-9.30e-03

Pos logit contributions
ck+0.009
 breath+0.008
ited+0.007
 tug+0.006
 swat+0.006
Neg logit contributions
 She-0.007
 He-0.006
 has-0.006
 There-0.006
tr-0.006
Token a
Token ID257
Feature activation-4.16e-02

Pos logit contributions
ck+0.025
 breath+0.023
ited+0.021
away+0.020
 swat+0.019
Neg logit contributions
 She-0.011
 He-0.010
 has-0.009
 There-0.009
 living-0.009
Token time
Token ID640
Feature activation-9.16e-03

Pos logit contributions
ck+0.099
 breath+0.088
 swat+0.076
away+0.075
shadow+0.074
Neg logit contributions
 She-0.058
 has-0.056
 He-0.054
tr-0.053
 There-0.053
Token,
Token ID11
Feature activation-4.91e-03

Pos logit contributions
ck+0.024
 breath+0.022
ited+0.020
 tug+0.018
away+0.018
Neg logit contributions
 She-0.011
 He-0.010
tr-0.010
 There-0.010
 has-0.010
Token there
Token ID612
Feature activation+3.84e-02

Pos logit contributions
ck+0.012
 breath+0.011
ited+0.010
 tug+0.009
away+0.009
Neg logit contributions
 She-0.006
 He-0.006
tr-0.006
 has-0.006
 There-0.006
Token was
Token ID373
Feature activation-2.53e-02

Pos logit contributions
tr+0.033
 She+0.032
 He+0.030
 Why+0.029
 There+0.029
Neg logit contributions
ck-0.106
 breath-0.099
ited-0.089
 tug-0.087
away-0.083
Token a
Token ID257
Feature activation-3.31e-02

Pos logit contributions
ck+0.070
 breath+0.064
ited+0.061
away+0.057
 tug+0.055
Neg logit contributions
 She-0.029
 He-0.025
 There-0.025
 living-0.024
 sky-0.024
Token little
Token ID1310
Feature activation-5.80e-02

Pos logit contributions
ck+0.077
 breath+0.069
ited+0.063
 contain+0.056
war+0.056
Neg logit contributions
 She-0.048
 He-0.046
 has-0.046
 living-0.045
 "-0.044
Token boy
Token ID2933
Feature activation-4.96e-02

Pos logit contributions
ck+0.157
 breath+0.141
ited+0.130
 tug+0.120
 contain+0.119
Neg logit contributions
 She-0.064
 He-0.059
 has-0.057
tr-0.055
 sky-0.055
Token named
Token ID3706
Feature activation-2.23e-02

Pos logit contributions
ck+0.121
 breath+0.116
ited+0.108
 contain+0.098
war+0.098
Neg logit contributions
 She-0.063
 living-0.061
 He-0.058
 has-0.056
tr-0.056
Token!"
Token ID2474
Feature activation+2.84e-03

Pos logit contributions
ck+0.018
 breath+0.017
away+0.015
ited+0.014
 tug+0.012
Neg logit contributions
tr-0.025
robe-0.022
 Did-0.022
 Where-0.022
 How-0.021
Token
Token ID198
Feature activation-2.36e-02

Pos logit contributions
tr+0.005
 has+0.005
robe+0.005
 living+0.004
 Why+0.004
Neg logit contributions
ck-0.005
 breath-0.005
 tug-0.004
ited-0.004
 swat-0.004
Token
Token ID198
Feature activation-3.50e-02

Pos logit contributions
ck+0.074
 breath+0.071
ited+0.065
 tug+0.062
away+0.061
Neg logit contributions
 She-0.015
 He-0.013
 There-0.011
tr-0.011
 Why-0.011
TokenTim
Token ID14967
Feature activation-2.13e-02

Pos logit contributions
ck+0.068
 breath+0.067
ited+0.054
 response+0.048
 tug+0.047
Neg logit contributions
 She-0.067
 He-0.064
 "-0.060
tr-0.059
 Why-0.058
Tokenmy
Token ID1820
Feature activation-6.56e-03

Pos logit contributions
ck+0.059
 breath+0.055
ited+0.049
 tug+0.047
away+0.047
Neg logit contributions
 She-0.021
 He-0.019
tr-0.019
 has-0.018
 Why-0.017
Token replied
Token ID8712
Feature activation+4.10e-02

Pos logit contributions
ck+0.016
 breath+0.015
ited+0.013
away+0.013
 response+0.012
Neg logit contributions
 She-0.008
 He-0.008
 has-0.008
tr-0.008
 There-0.007
Token,
Token ID11
Feature activation+2.76e-02

Pos logit contributions
 She+0.048
 He+0.045
tr+0.043
 There+0.043
 has+0.042
Neg logit contributions
ck-0.106
 breath-0.097
ited-0.089
 tug-0.082
away-0.082
Token "
Token ID366
Feature activation+2.35e-02

Pos logit contributions
tr+0.022
 has+0.018
robe+0.018
umm+0.016
 There+0.016
Neg logit contributions
ck-0.077
 breath-0.074
 tug-0.068
 swat-0.064
 response-0.062
TokenNo
Token ID2949
Feature activation-1.79e-02

Pos logit contributions
 There+0.033
 has+0.033
tr+0.032
 She+0.032
 living+0.031
Neg logit contributions
ck-0.053
 breath-0.045
ited-0.043
 tug-0.040
away-0.040
Token,
Token ID11
Feature activation+2.22e-02

Pos logit contributions
ck+0.048
 breath+0.044
ited+0.041
away+0.038
 tug+0.038
Neg logit contributions
 She-0.020
 He-0.019
 has-0.018
tr-0.017
 There-0.017
Token it
Token ID340
Feature activation+2.88e-02

Pos logit contributions
tr+0.032
 She+0.031
robe+0.031
 Did+0.030
 He+0.029
Neg logit contributions
ck-0.049
 breath-0.045
 tug-0.038
ited-0.038
 swat-0.037
Token,
Token ID11
Feature activation-1.04e-02

Pos logit contributions
 breath+0.025
ck+0.024
 tug+0.021
away+0.019
 response+0.019
Neg logit contributions
tr-0.023
robe-0.022
 She-0.022
 Did-0.021
omm-0.020
Token but
Token ID475
Feature activation-4.45e-03

Pos logit contributions
ck+0.023
 breath+0.021
 tug+0.018
ited+0.017
 response+0.017
Neg logit contributions
 She-0.015
tr-0.015
 Did-0.014
robe-0.014
 There-0.014
Token it
Token ID340
Feature activation+3.82e-02

Pos logit contributions
ck+0.010
 breath+0.009
 tug+0.008
ited+0.007
 fear+0.007
Neg logit contributions
tr-0.007
 She-0.007
 There-0.006
 Did-0.006
 Why-0.006
Token felt
Token ID2936
Feature activation+6.65e-03

Pos logit contributions
 She+0.040
 There+0.037
 He+0.036
 Why+0.036
 Did+0.036
Neg logit contributions
ck-0.104
 breath-0.095
ited-0.086
 tug-0.083
 swat-0.078
Token nice
Token ID3621
Feature activation-1.90e-02

Pos logit contributions
 She+0.011
 He+0.010
tr+0.010
 has+0.010
 There+0.010
Neg logit contributions
ck-0.014
 breath-0.013
ited-0.011
 tug-0.010
away-0.010
Token.
Token ID13
Feature activation+2.99e-02

Pos logit contributions
ck+0.032
 breath+0.031
away+0.024
ited+0.024
 believe+0.023
Neg logit contributions
tr-0.033
 Did-0.033
 She-0.033
 There-0.032
 Where-0.032
Token He
Token ID679
Feature activation+3.31e-02

Pos logit contributions
tr+0.036
bows+0.032
fl+0.030
robe+0.027
 living+0.026
Neg logit contributions
ck-0.068
 breath-0.063
 tug-0.059
 agree-0.055
 quick-0.055
Token felt
Token ID2936
Feature activation+1.00e-02

Pos logit contributions
 She+0.052
tr+0.050
 There+0.049
 Why+0.049
 He+0.049
Neg logit contributions
ck-0.072
 breath-0.064
ited-0.056
 tug-0.055
 swat-0.050
Token happy
Token ID3772
Feature activation-3.23e-02

Pos logit contributions
 She+0.017
 He+0.016
tr+0.016
 has+0.016
 There+0.016
Neg logit contributions
ck-0.020
 breath-0.019
ited-0.016
 tug-0.015
away-0.015
Token and
Token ID290
Feature activation-4.46e-02

Pos logit contributions
ck+0.061
 breath+0.060
 tug+0.045
 fear+0.044
away+0.043
Neg logit contributions
tr-0.061
 Did-0.055
 There-0.054
 These-0.054
robe-0.053
Token proud
Token ID6613
Feature activation-1.51e-02

Pos logit contributions
ck+0.082
 breath+0.073
ited+0.062
 tug+0.057
away+0.056
Neg logit contributions
 She-0.084
 He-0.081
tr-0.080
 has-0.079
 There-0.079
Token was
Token ID373
Feature activation-2.86e-02

Pos logit contributions
tr+0.043
 She+0.038
 Why+0.037
 There+0.037
robe+0.036
Neg logit contributions
ck-0.103
 breath-0.094
ited-0.084
 tug-0.084
 behold-0.079
Token a
Token ID257
Feature activation-3.41e-02

Pos logit contributions
ck+0.080
 breath+0.074
ited+0.069
away+0.065
 tug+0.063
Neg logit contributions
 She-0.033
 He-0.028
 There-0.027
 living-0.026
 sky-0.026
Token dark
Token ID3223
Feature activation-3.37e-02

Pos logit contributions
ck+0.082
 breath+0.073
ited+0.068
 contain+0.062
war+0.061
Neg logit contributions
 She-0.048
 has-0.046
 He-0.046
 living-0.045
 "-0.045
Token color
Token ID3124
Feature activation+1.52e-02

Pos logit contributions
ck+0.066
 breath+0.065
away+0.053
ited+0.052
 tug+0.050
Neg logit contributions
 Did-0.052
 She-0.052
 How-0.051
 Why-0.050
tr-0.050
Token.
Token ID13
Feature activation+1.20e-02

Pos logit contributions
 Did+0.028
 How+0.027
 Why+0.027
 She+0.027
 There+0.026
Neg logit contributions
ck-0.023
 breath-0.023
away-0.018
 tug-0.017
ot-0.017
Token Nobody
Token ID15658
Feature activation-7.33e-03

Pos logit contributions
 She+0.009
tr+0.009
 has+0.008
 He+0.008
 Why+0.008
Neg logit contributions
ck-0.034
 breath-0.032
ited-0.029
 tug-0.028
away-0.028
Token knew
Token ID2993
Feature activation+1.90e-02

Pos logit contributions
ck+0.020
 breath+0.018
ited+0.016
 tug+0.016
away+0.015
Neg logit contributions
 She-0.008
 He-0.007
tr-0.007
 There-0.007
 has-0.007
Token where
Token ID810
Feature activation+2.51e-02

Pos logit contributions
tr+0.032
 She+0.031
 He+0.031
 Did+0.030
robe+0.030
Neg logit contributions
ck-0.039
 breath-0.034
 fear-0.027
away-0.027
 swat-0.026
Token it
Token ID340
Feature activation+2.49e-02

Pos logit contributions
 She+0.034
 He+0.032
tr+0.032
 has+0.031
 There+0.031
Neg logit contributions
ck-0.059
 breath-0.054
ited-0.048
 tug-0.045
away-0.044
Token came
Token ID1625
Feature activation-1.25e-02

Pos logit contributions
 She+0.020
tr+0.018
 He+0.018
 There+0.017
 Why+0.017
Neg logit contributions
ck-0.072
 breath-0.067
ited-0.061
 tug-0.058
away-0.057
Token from
Token ID422
Feature activation-6.06e-03

Pos logit contributions
ck+0.028
 breath+0.025
ited+0.022
away+0.020
 tug+0.020
Neg logit contributions
 She-0.019
tr-0.018
 He-0.018
 There-0.018
 has-0.018
Token pot
Token ID1787
Feature activation+4.76e-02

Pos logit contributions
 She+0.037
 He+0.036
 Did+0.036
 Why+0.036
 There+0.035
Neg logit contributions
ck-0.045
 breath-0.044
ited-0.035
away-0.034
 response-0.034
Token of
Token ID286
Feature activation-4.97e-03

Pos logit contributions
 Did+0.074
tr+0.072
 How+0.071
 There+0.071
 Where+0.071
Neg logit contributions
ck-0.096
 breath-0.088
ited-0.077
 contain-0.072
away-0.072
Token gold
Token ID3869
Feature activation+2.39e-02

Pos logit contributions
ck+0.010
 breath+0.009
ited+0.008
 tug+0.007
away+0.007
Neg logit contributions
 She-0.008
tr-0.008
 He-0.008
 There-0.008
 Why-0.008
Token.
Token ID13
Feature activation+8.64e-03

Pos logit contributions
 Did+0.043
 Why+0.042
 She+0.042
 There+0.041
 Where+0.041
Neg logit contributions
ck-0.046
 breath-0.043
away-0.035
ited-0.033
 tug-0.030
Token<|endoftext|>
Token ID50256
Feature activation+1.19e-02

Pos logit contributions
tr+0.009
robe+0.008
 has+0.008
 living+0.008
 sky+0.007
Neg logit contributions
ck-0.021
 breath-0.019
ited-0.017
 tug-0.017
 agree-0.017
TokenOnce
Token ID7454
Feature activation-6.32e-03

Pos logit contributions
tr+0.014
der+0.011
umm+0.011
robe+0.011
 Did+0.010
Neg logit contributions
ck-0.027
 breath-0.026
ited-0.023
war-0.022
away-0.022
Token upon
Token ID2402
Feature activation-9.40e-03

Pos logit contributions
ck+0.013
 breath+0.011
ited+0.010
 tug+0.009
away+0.009
Neg logit contributions
 She-0.011
 He-0.011
 has-0.010
 There-0.010
tr-0.010
Token a
Token ID257
Feature activation-4.15e-02

Pos logit contributions
ck+0.024
 breath+0.022
ited+0.020
away+0.018
 tug+0.018
Neg logit contributions
 She-0.012
 He-0.011
 There-0.010
tr-0.010
 has-0.010
Token time
Token ID640
Feature activation-6.65e-03

Pos logit contributions
ck+0.097
 breath+0.088
away+0.075
 contain+0.074
shadow+0.074
Neg logit contributions
 She-0.057
 has-0.056
 He-0.055
 living-0.052
tr-0.052
Token,
Token ID11
Feature activation-8.43e-03

Pos logit contributions
ck+0.015
 breath+0.014
ited+0.012
 tug+0.012
away+0.012
Neg logit contributions
tr-0.009
 She-0.009
 He-0.008
 has-0.008
robe-0.008
Token there
Token ID612
Feature activation+3.79e-02

Pos logit contributions
ck+0.019
 breath+0.017
ited+0.015
 tug+0.014
away+0.014
Neg logit contributions
 She-0.012
tr-0.011
 He-0.011
 Did-0.011
 There-0.011
Token.
Token ID13
Feature activation+4.13e-02

Pos logit contributions
ck+0.059
 breath+0.053
ited+0.046
away+0.043
 tug+0.043
Neg logit contributions
tr-0.046
 There-0.045
 She-0.045
 Did-0.045
robe-0.043
Token Now
Token ID2735
Feature activation+1.43e-02

Pos logit contributions
boards+0.048
tr+0.046
robe+0.043
pit+0.043
 sky+0.040
Neg logit contributions
ck-0.081
 breath-0.077
 believe-0.071
iona-0.070
 common-0.069
Token,
Token ID11
Feature activation+2.16e-02

Pos logit contributions
tr+0.025
robe+0.024
 Did+0.023
 She+0.023
 Why+0.022
Neg logit contributions
ck-0.029
 breath-0.027
 tug-0.022
 believe-0.021
 promise-0.021
Token go
Token ID467
Feature activation-1.80e-02

Pos logit contributions
robe+0.040
 She+0.040
tr+0.040
 He+0.038
 Did+0.038
Neg logit contributions
ck-0.039
 breath-0.035
 tug-0.028
 believe-0.027
 swat-0.026
Token and
Token ID290
Feature activation-4.36e-02

Pos logit contributions
 breath+0.032
ck+0.032
away+0.027
ited+0.024
 promise+0.023
Neg logit contributions
tr-0.034
 Did-0.031
robe-0.031
bows-0.031
der-0.030
Token play
Token ID711
Feature activation+9.32e-03

Pos logit contributions
ck+0.074
 breath+0.068
 tug+0.055
 swat+0.051
ited+0.049
Neg logit contributions
robe-0.086
 She-0.084
tr-0.084
boards-0.082
 He-0.082
Token with
Token ID351
Feature activation-1.35e-02

Pos logit contributions
 She+0.012
 He+0.011
tr+0.011
 has+0.011
 Why+0.011
Neg logit contributions
ck-0.023
 breath-0.021
ited-0.019
 tug-0.017
away-0.017
Token your
Token ID534
Feature activation+1.94e-02

Pos logit contributions
ck+0.029
 breath+0.027
ited+0.025
 contain+0.022
rig+0.022
Neg logit contributions
 She-0.022
 He-0.020
 has-0.020
 living-0.020
 There-0.019
Token ship
Token ID4074
Feature activation+3.34e-02

Pos logit contributions
 She+0.023
 He+0.022
 has+0.022
tr+0.021
 living+0.021
Neg logit contributions
ck-0.050
 breath-0.044
ited-0.041
away-0.038
 tug-0.037
Token.
Token ID13
Feature activation+3.91e-02

Pos logit contributions
 She+0.050
 There+0.048
 He+0.047
 Did+0.047
 Why+0.047
Neg logit contributions
ck-0.074
 breath-0.068
ited-0.058
away-0.055
 tug-0.054
Token But
Token ID887
Feature activation+3.50e-03

Pos logit contributions
boards+0.041
robe+0.040
 living+0.039
pit+0.038
tr+0.037
Neg logit contributions
ck-0.089
 breath-0.080
 response-0.074
 believe-0.072
 tug-0.072
Token He
Token ID679
Feature activation+2.07e-02

Pos logit contributions
ck+0.041
 breath+0.038
ited+0.034
 tug+0.033
away+0.032
Neg logit contributions
tr-0.013
 She-0.013
 has-0.012
robe-0.012
 He-0.012
Token smiled
Token ID13541
Feature activation+2.86e-03

Pos logit contributions
 She+0.034
tr+0.033
 There+0.032
 He+0.032
robe+0.032
Neg logit contributions
ck-0.043
 breath-0.039
ited-0.034
 tug-0.033
 swat-0.030
Token and
Token ID290
Feature activation-1.82e-02

Pos logit contributions
tr+0.004
 She+0.004
 Did+0.004
 There+0.004
 has+0.004
Neg logit contributions
ck-0.007
 breath-0.006
ited-0.005
 tug-0.005
away-0.005
Token went
Token ID1816
Feature activation+1.13e-02

Pos logit contributions
ck+0.035
 breath+0.030
 tug+0.026
ited+0.025
 swat+0.023
Neg logit contributions
tr-0.034
 She-0.032
 Why-0.032
robe-0.032
 Did-0.032
Token inside
Token ID2641
Feature activation+1.56e-02

Pos logit contributions
 She+0.015
tr+0.014
 He+0.014
 has+0.014
 There+0.014
Neg logit contributions
ck-0.027
 breath-0.025
ited-0.022
 tug-0.021
away-0.020
Token to
Token ID284
Feature activation-7.16e-03

Pos logit contributions
 She+0.018
 He+0.017
 There+0.016
 living+0.016
 has+0.016
Neg logit contributions
ck-0.041
 breath-0.036
ited-0.034
away-0.031
 tug-0.031
Token have
Token ID423
Feature activation+1.02e-02

Pos logit contributions
ck+0.014
 breath+0.012
ited+0.011
 tug+0.010
away+0.010
Neg logit contributions
 She-0.013
 He-0.012
tr-0.012
 has-0.012
 There-0.012
Token dinner
Token ID8073
Feature activation+2.32e-02

Pos logit contributions
 She+0.014
 He+0.014
tr+0.013
 has+0.013
 There+0.013
Neg logit contributions
ck-0.024
 breath-0.022
ited-0.019
 tug-0.018
away-0.018
Token.
Token ID13
Feature activation+1.69e-02

Pos logit contributions
 There+0.040
 Did+0.040
tr+0.040
 She+0.039
 Why+0.038
Neg logit contributions
ck-0.042
 breath-0.040
ited-0.033
 tug-0.032
away-0.032
Token<|endoftext|>
Token ID50256
Feature activation+1.22e-02

Pos logit contributions
tr+0.017
robe+0.015
 has+0.015
bows+0.014
 sky+0.014
Neg logit contributions
ck-0.041
 breath-0.039
 tug-0.035
ited-0.034
rig-0.034
TokenOnce
Token ID7454
Feature activation-4.13e-03

Pos logit contributions
tr+0.014
 Did+0.011
robe+0.011
umm+0.011
der+0.011
Neg logit contributions
ck-0.028
 breath-0.027
ited-0.023
away-0.023
 tug-0.023
Token she
Token ID673
Feature activation+2.43e-02

Pos logit contributions
ck+0.035
 breath+0.032
ited+0.029
 tug+0.027
away+0.027
Neg logit contributions
 She-0.016
 He-0.015
tr-0.015
 There-0.015
 has-0.015
Token skipped
Token ID26684
Feature activation-1.19e-03

Pos logit contributions
 She+0.039
 He+0.037
tr+0.036
 There+0.036
 Why+0.035
Neg logit contributions
ck-0.053
 breath-0.048
ited-0.042
 tug-0.040
away-0.038
Token around
Token ID1088
Feature activation-1.92e-02

Pos logit contributions
ck+0.003
 breath+0.003
ited+0.002
away+0.002
 tug+0.002
Neg logit contributions
 She-0.002
tr-0.002
 There-0.002
 He-0.002
 has-0.002
Token.
Token ID13
Feature activation+1.25e-02

Pos logit contributions
ck+0.038
 breath+0.036
ited+0.029
 tug+0.029
away+0.028
Neg logit contributions
tr-0.033
 She-0.032
 He-0.031
 There-0.031
 Did-0.030
Token
Token ID198
Feature activation-2.27e-02

Pos logit contributions
tr+0.013
robe+0.012
 has+0.011
 Did+0.010
 He+0.010
Neg logit contributions
ck-0.032
 breath-0.030
ited-0.026
 tug-0.026
 swat-0.025
Token
Token ID198
Feature activation-3.08e-02

Pos logit contributions
ck+0.070
 breath+0.066
ited+0.061
 tug+0.057
away+0.057
Neg logit contributions
 She-0.015
 He-0.013
 There-0.012
 has-0.012
tr-0.012
TokenBut
Token ID1537
Feature activation-5.32e-03

Pos logit contributions
ck+0.065
 breath+0.061
ited+0.052
 tug+0.048
away+0.048
Neg logit contributions
 She-0.051
 He-0.047
 Why-0.046
 has-0.045
 There-0.045
Token soon
Token ID2582
Feature activation-2.05e-02

Pos logit contributions
ck+0.012
 breath+0.011
ited+0.010
away+0.009
 tug+0.009
Neg logit contributions
 She-0.008
 has-0.007
 He-0.007
 There-0.007
 Why-0.007
Token,
Token ID11
Feature activation+5.91e-03

Pos logit contributions
ck+0.048
 breath+0.044
ited+0.039
 tug+0.037
away+0.036
Neg logit contributions
 She-0.028
tr-0.027
 He-0.026
 There-0.026
 has-0.026
Token the
Token ID262
Feature activation+2.02e-02

Pos logit contributions
 She+0.009
 He+0.009
 has+0.009
 There+0.009
 Why+0.008
Neg logit contributions
ck-0.013
 breath-0.012
ited-0.010
away-0.009
 tug-0.009
Token sock
Token ID32263
Feature activation+3.77e-02

Pos logit contributions
 She+0.031
tr+0.030
 He+0.030
 There+0.029
 Did+0.029
Neg logit contributions
ck-0.043
 breath-0.041
ited-0.035
 tug-0.033
 swat-0.032
Token's
Token ID338
Feature activation-2.16e-02

Pos logit contributions
ck+0.003
 breath+0.003
ited+0.003
 tug+0.002
away+0.002
Neg logit contributions
 She-0.001
 He-0.001
tr-0.001
 There-0.001
 has-0.001
Token press
Token ID1803
Feature activation-3.48e-03

Pos logit contributions
ck+0.041
 breath+0.037
ited+0.030
 tug+0.029
away+0.028
Neg logit contributions
 She-0.039
 He-0.038
tr-0.038
 has-0.037
 Why-0.037
Token it
Token ID340
Feature activation+3.20e-02

Pos logit contributions
ck+0.008
 breath+0.007
ited+0.006
 tug+0.006
away+0.006
Neg logit contributions
 She-0.005
 He-0.005
tr-0.005
 has-0.005
 There-0.005
Token!"
Token ID2474
Feature activation-4.59e-03

Pos logit contributions
 She+0.048
 Did+0.047
 Why+0.047
 There+0.046
 He+0.046
Neg logit contributions
ck-0.067
 breath-0.060
away-0.050
ited-0.048
 tug-0.047
Token
Token ID220
Feature activation-7.07e-04

Pos logit contributions
ck+0.010
 breath+0.009
 tug+0.008
 swat+0.008
shadow+0.007
Neg logit contributions
 has-0.006
umm-0.006
 living-0.006
tr-0.006
robe-0.006
Token
Token ID198
Feature activation-2.75e-02

Pos logit contributions
ck+0.002
 breath+0.002
ited+0.002
 tug+0.002
 swat+0.002
Neg logit contributions
tr-0.001
 She-0.001
 has-0.001
 Why-0.001
 Did-0.001
Token
Token ID198
Feature activation-3.53e-02

Pos logit contributions
ck+0.078
 breath+0.071
ited+0.063
 tug+0.062
away+0.061
Neg logit contributions
 She-0.023
tr-0.022
 has-0.021
 There-0.021
 Why-0.020
TokenSuddenly
Token ID38582
Feature activation-6.70e-03

Pos logit contributions
ck+0.060
 breath+0.054
ited+0.046
 tug+0.041
away+0.040
Neg logit contributions
 She-0.071
 He-0.069
tr-0.068
 has-0.067
 There-0.067
Token,
Token ID11
Feature activation+3.60e-04

Pos logit contributions
ck+0.017
 breath+0.016
ited+0.014
 tug+0.013
away+0.013
Neg logit contributions
 She-0.008
 He-0.008
tr-0.007
 There-0.007
 has-0.007
Token the
Token ID262
Feature activation+2.19e-02

Pos logit contributions
 She+0.001
tr+0.001
 He+0.001
 has+0.001
 There+0.001
Neg logit contributions
ck-0.001
 breath-0.001
ited-0.001
 tug-0.001
away-0.001
Token ground
Token ID2323
Feature activation+2.58e-02

Pos logit contributions
 She+0.033
 He+0.032
 Why+0.031
 There+0.030
 Did+0.030
Neg logit contributions
ck-0.048
 breath-0.045
ited-0.038
 tug-0.037
 swat-0.035
Token the
Token ID262
Feature activation+5.59e-03

Pos logit contributions
 She+0.017
tr+0.017
 Did+0.017
 There+0.017
 He+0.017
Neg logit contributions
ck-0.026
 breath-0.024
ited-0.020
 tug-0.020
 swat-0.019
Token biggest
Token ID4094
Feature activation-4.28e-04

Pos logit contributions
 She+0.007
 There+0.007
 Did+0.007
 He+0.007
 Why+0.006
Neg logit contributions
ck-0.013
 breath-0.013
ited-0.011
 tug-0.011
 swat-0.011
Token squash
Token ID34613
Feature activation+3.79e-02

Pos logit contributions
ck+0.001
 breath+0.001
ited+0.001
 tug+0.000
 swat+0.000
Neg logit contributions
tr-0.001
 Did-0.001
 She-0.001
robe-0.001
 Why-0.001
Token she
Token ID673
Feature activation+2.76e-02

Pos logit contributions
tr+0.046
 Did+0.046
 There+0.045
 He+0.045
 She+0.043
Neg logit contributions
ck-0.088
 breath-0.082
ited-0.072
 tug-0.070
 swat-0.069
Token had
Token ID550
Feature activation+2.48e-02

Pos logit contributions
tr+0.035
 She+0.034
 There+0.031
 He+0.031
 sky+0.030
Neg logit contributions
ck-0.067
 breath-0.060
ited-0.056
 tug-0.053
away-0.052
Token ever
Token ID1683
Feature activation-5.12e-02

Pos logit contributions
 She+0.046
 He+0.044
tr+0.044
 There+0.043
 has+0.043
Neg logit contributions
ck-0.047
 breath-0.042
ited-0.036
 tug-0.032
away-0.032
Token seen
Token ID1775
Feature activation+1.16e-02

Pos logit contributions
ck+0.083
 breath+0.074
ited+0.059
 tug+0.055
away+0.054
Neg logit contributions
 She-0.106
 has-0.103
 He-0.103
tr-0.101
 There-0.100
Token!
Token ID0
Feature activation-3.00e-03

Pos logit contributions
 She+0.020
 He+0.019
 There+0.019
tr+0.019
 Did+0.018
Neg logit contributions
ck-0.022
 breath-0.020
ited-0.018
away-0.016
 tug-0.016
Token She
Token ID1375
Feature activation+1.78e-02

Pos logit contributions
ck+0.008
 breath+0.007
 tug+0.007
ited+0.007
war+0.007
Neg logit contributions
tr-0.002
 has-0.002
robe-0.002
 living-0.002
bridge-0.001
Token picked
Token ID6497
Feature activation+2.05e-02

Pos logit contributions
 She+0.021
 He+0.020
tr+0.020
 has+0.020
 There+0.019
Neg logit contributions
ck-0.045
 breath-0.041
ited-0.037
away-0.035
 tug-0.034
Token it
Token ID340
Feature activation+2.76e-02

Pos logit contributions
tr+0.044
robe+0.041
 Did+0.039
 There+0.038
 He+0.038
Neg logit contributions
ck-0.034
 breath-0.031
ited-0.024
 swat-0.024
 tug-0.024
Token time
Token ID640
Feature activation-7.96e-03

Pos logit contributions
ck+0.115
 breath+0.104
shadow+0.090
 swat+0.090
 contain+0.089
Neg logit contributions
 has-0.061
 She-0.061
tr-0.059
 He-0.057
 is-0.057
Token,
Token ID11
Feature activation-8.69e-03

Pos logit contributions
ck+0.021
 breath+0.019
ited+0.018
away+0.017
 tug+0.017
Neg logit contributions
 She-0.009
 He-0.008
 There-0.008
 has-0.008
tr-0.008
Token there
Token ID612
Feature activation+3.57e-02

Pos logit contributions
ck+0.022
 breath+0.021
ited+0.019
away+0.017
 tug+0.017
Neg logit contributions
 She-0.010
 He-0.010
 has-0.009
 There-0.009
tr-0.009
Token was
Token ID373
Feature activation-3.15e-02

Pos logit contributions
tr+0.032
 She+0.031
 There+0.028
 He+0.028
 Why+0.028
Neg logit contributions
ck-0.097
 breath-0.090
ited-0.081
 tug-0.079
away-0.075
Token a
Token ID257
Feature activation-3.95e-02

Pos logit contributions
ck+0.090
 breath+0.083
ited+0.078
away+0.074
ge+0.072
Neg logit contributions
 She-0.034
 He-0.030
 There-0.028
 living-0.027
 sky-0.027
Token little
Token ID1310
Feature activation-6.02e-02

Pos logit contributions
ck+0.094
 breath+0.083
ited+0.077
 contain+0.070
war+0.068
Neg logit contributions
 She-0.056
 He-0.055
 has-0.054
 living-0.053
 "-0.052
Token rabbit
Token ID22746
Feature activation-4.27e-02

Pos logit contributions
ck+0.156
 breath+0.142
ited+0.129
 tug+0.120
away+0.119
Neg logit contributions
 She-0.070
 He-0.066
tr-0.064
 has-0.064
 There-0.062
Token named
Token ID3706
Feature activation-2.81e-02

Pos logit contributions
ck+0.098
 breath+0.091
ited+0.083
 tug+0.076
away+0.073
Neg logit contributions
 She-0.063
 He-0.060
tr-0.057
 living-0.057
 has-0.056
Token Bun
Token ID28515
Feature activation-6.76e-02

Pos logit contributions
ck+0.062
 breath+0.057
ited+0.051
away+0.046
 tug+0.046
Neg logit contributions
 She-0.045
 He-0.043
 There-0.041
tr-0.040
 Why-0.040
Token.
Token ID13
Feature activation+5.21e-03

Pos logit contributions
ck+0.158
 breath+0.151
ited+0.138
 contain+0.130
 tug+0.124
Neg logit contributions
 He-0.091
 She-0.091
 living-0.084
 has-0.081
tr-0.081
Token Bun
Token ID28515
Feature activation-4.47e-02

Pos logit contributions
tr+0.008
 Why+0.008
 has+0.008
 How+0.008
 She+0.008
Neg logit contributions
ck-0.010
 breath-0.009
 tug-0.008
ited-0.007
away-0.007
Token-
Token ID12
Feature activation-5.66e-02

Pos logit contributions
 She+0.016
 He+0.016
 has+0.015
 living+0.015
 sky+0.014
Neg logit contributions
ck-0.019
 breath-0.017
ited-0.016
 tug-0.015
 swat-0.014
Tokenold
Token ID727
Feature activation-1.69e-02

Pos logit contributions
 breath+0.091
ck+0.090
ited+0.070
 tug+0.067
 swat+0.063
Neg logit contributions
 She-0.119
 He-0.113
 living-0.110
 Did-0.109
tr-0.108
Token daughter
Token ID4957
Feature activation-2.31e-02

Pos logit contributions
ck+0.036
 breath+0.033
ited+0.029
 tug+0.027
away+0.026
Neg logit contributions
 She-0.026
tr-0.025
 He-0.024
 There-0.024
 Why-0.024
Token,
Token ID11
Feature activation+1.15e-02

Pos logit contributions
ck+0.045
 breath+0.042
 tug+0.035
ited+0.035
 agree+0.033
Neg logit contributions
tr-0.036
robe-0.034
 She-0.034
 He-0.034
 Why-0.032
Token Mill
Token ID9212
Feature activation-4.68e-02

Pos logit contributions
tr+0.018
 She+0.018
robe+0.018
 He+0.017
 There+0.017
Neg logit contributions
ck-0.023
 breath-0.021
ited-0.018
 tug-0.017
 swat-0.016
Tokenie
Token ID494
Feature activation-2.86e-02

Pos logit contributions
ck+0.061
 breath+0.052
ited+0.041
 tug+0.036
away+0.034
Neg logit contributions
 She-0.113
 He-0.108
 has-0.108
tr-0.107
 living-0.106
Token.
Token ID13
Feature activation+1.20e-02

Pos logit contributions
ck+0.062
 breath+0.057
ited+0.048
 tug+0.047
away+0.045
Neg logit contributions
tr-0.041
 She-0.041
robe-0.040
 There-0.040
 Did-0.040
Token
Token ID220
Feature activation+1.39e-02

Pos logit contributions
robe+0.016
tr+0.015
oll+0.014
boards+0.013
 has+0.013
Neg logit contributions
ck-0.024
 breath-0.024
 response-0.021
 behold-0.020
 agreement-0.020
Token
Token ID198
Feature activation-2.14e-02

Pos logit contributions
tr+0.016
 has+0.015
robe+0.014
 Did+0.014
umm+0.014
Neg logit contributions
ck-0.033
 breath-0.029
 agree-0.025
 tug-0.025
rig-0.025
Token
Token ID198
Feature activation-2.98e-02

Pos logit contributions
ck+0.057
 breath+0.052
ited+0.046
 tug+0.045
away+0.045
Neg logit contributions
tr-0.021
robe-0.019
 has-0.018
 She-0.018
 Why-0.018
TokenMill
Token ID22603
Feature activation-3.92e-02

Pos logit contributions
ck+0.057
 breath+0.052
ited+0.044
 tug+0.041
away+0.040
Neg logit contributions
 She-0.053
 He-0.051
tr-0.051
 has-0.050
 There-0.050
Token Mom
Token ID11254
Feature activation-1.77e-02

Pos logit contributions
ck+0.099
 breath+0.090
ited+0.080
away+0.075
 tug+0.074
Neg logit contributions
 She-0.058
 He-0.055
 has-0.053
tr-0.053
 There-0.052
Tokenmy
Token ID1820
Feature activation-3.75e-03

Pos logit contributions
ck+0.039
 breath+0.036
ited+0.032
 tug+0.030
 response+0.028
Neg logit contributions
 There-0.024
 Why-0.024
 How-0.024
 She-0.024
tr-0.024
Token said
Token ID531
Feature activation+5.21e-02

Pos logit contributions
ck+0.008
 breath+0.008
ited+0.007
 tug+0.006
away+0.006
Neg logit contributions
 She-0.006
tr-0.005
 Why-0.005
 He-0.005
 There-0.005
Token "
Token ID366
Feature activation+2.45e-02

Pos logit contributions
tr+0.085
 She+0.080
 He+0.079
 Why+0.077
 sky+0.077
Neg logit contributions
ck-0.105
 breath-0.098
 tug-0.081
ited-0.080
away-0.076
TokenLet
Token ID5756
Feature activation-5.50e-03

Pos logit contributions
 She+0.042
 He+0.041
tr+0.041
 There+0.040
 has+0.040
Neg logit contributions
ck-0.048
 breath-0.043
ited-0.038
 tug-0.035
away-0.034
Tokenâ
Token ID22940
Feature activation-7.63e-02

Pos logit contributions
ck+0.015
 breath+0.014
ited+0.013
 tug+0.012
away+0.012
Neg logit contributions
 She-0.006
 has-0.005
 He-0.005
tr-0.005
 There-0.005
Token
Token ID26391
Feature activation-4.03e-02

Pos logit contributions
ck+0.140
 breath+0.132
ited+0.101
 tug+0.098
away+0.094
Neg logit contributions
 She-0.142
 He-0.133
 Why-0.133
 How-0.132
 There-0.131
Token
Token ID8151
Feature activation-3.21e-02

Pos logit contributions
ck+0.089
 breath+0.083
ited+0.072
 tug+0.068
away+0.066
Neg logit contributions
 She-0.055
 He-0.054
tr-0.054
robe-0.051
 has-0.051
Tokens
Token ID82
Feature activation-1.95e-02

Pos logit contributions
ck+0.073
 breath+0.067
ited+0.059
 tug+0.055
away+0.055
Neg logit contributions
 She-0.046
 He-0.044
tr-0.043
 has-0.042
 There-0.042
Token go
Token ID467
Feature activation-2.29e-02

Pos logit contributions
ck+0.034
 breath+0.032
 tug+0.025
 believe+0.024
ited+0.023
Neg logit contributions
 Did-0.038
 She-0.038
tr-0.038
 Why-0.037
robe-0.036
Token!"
Token ID2474
Feature activation+1.24e-03

Pos logit contributions
ck+0.044
 breath+0.043
away+0.034
 tug+0.032
ited+0.032
Neg logit contributions
tr-0.041
 She-0.040
 Did-0.039
 Why-0.038
 There-0.037
Token that
Token ID326
Feature activation+3.04e-02

Pos logit contributions
 She+0.010
 has+0.009
 He+0.009
tr+0.009
 There+0.009
Neg logit contributions
ck-0.020
 breath-0.019
ited-0.018
away-0.016
 tug-0.016
Token?"
Token ID1701
Feature activation+7.02e-03

Pos logit contributions
 She+0.049
tr+0.049
 Did+0.048
 There+0.048
 He+0.047
Neg logit contributions
ck-0.062
 breath-0.056
ited-0.046
 swat-0.045
away-0.044
Token
Token ID198
Feature activation-2.67e-02

Pos logit contributions
tr+0.006
 She+0.005
robe+0.005
 has+0.005
 He+0.005
Neg logit contributions
ck-0.020
 breath-0.019
ited-0.016
 tug-0.016
 swat-0.016
Token
Token ID198
Feature activation-3.85e-02

Pos logit contributions
ck+0.085
 breath+0.081
ited+0.077
away+0.070
 tug+0.070
Neg logit contributions
 She-0.016
 He-0.015
 There-0.012
 has-0.012
tr-0.011
Token"
Token ID1
Feature activation+3.23e-02

Pos logit contributions
 breath+0.085
ck+0.085
ited+0.082
 contain+0.070
 response+0.069
Neg logit contributions
 She-0.061
 He-0.057
 has-0.053
 living-0.052
 "-0.052
TokenMos
Token ID32668
Feature activation-7.42e-02

Pos logit contributions
 She+0.037
tr+0.034
 He+0.034
 Why+0.033
 has+0.033
Neg logit contributions
ck-0.083
 breath-0.079
ited-0.071
away-0.067
 contain-0.064
Tokenquit
Token ID47391
Feature activation-2.75e-03

Pos logit contributions
ck+0.156
 breath+0.151
ited+0.123
 tug+0.120
away+0.118
Neg logit contributions
 She-0.118
 He-0.108
 There-0.107
 Why-0.105
 sky-0.105
Tokenoes
Token ID3028
Feature activation-1.85e-03

Pos logit contributions
ck+0.005
 breath+0.005
ited+0.004
away+0.004
 swat+0.004
Neg logit contributions
 She-0.004
 He-0.004
tr-0.004
 There-0.004
 Did-0.004
Token!"
Token ID2474
Feature activation+4.65e-03

Pos logit contributions
ck+0.005
 breath+0.004
ited+0.004
 tug+0.004
 swat+0.004
Neg logit contributions
 She-0.002
tr-0.002
 He-0.002
 There-0.002
 Did-0.002
Token Tom
Token ID4186
Feature activation-1.96e-02

Pos logit contributions
 She+0.007
 He+0.006
 There+0.006
 has+0.006
tr+0.006
Neg logit contributions
ck-0.011
 breath-0.010
ited-0.009
away-0.009
 tug-0.008
Token said
Token ID531
Feature activation+3.20e-02

Pos logit contributions
ck+0.045
 breath+0.041
ited+0.036
away+0.034
 tug+0.034
Neg logit contributions
 She-0.029
 He-0.027
tr-0.026
 has-0.026
 There-0.026
Token,
Token ID11
Feature activation+1.53e-03

Pos logit contributions
ck+0.020
 breath+0.018
ited+0.017
 tug+0.015
 swat+0.014
Neg logit contributions
 She-0.013
 He-0.013
 has-0.013
 There-0.012
 Why-0.012
Token but
Token ID475
Feature activation-6.55e-03

Pos logit contributions
tr+0.003
robe+0.002
 She+0.002
 Did+0.002
ase+0.002
Neg logit contributions
ck-0.003
 breath-0.003
 tug-0.002
 swat-0.002
 fear-0.002
Token he
Token ID339
Feature activation+1.08e-02

Pos logit contributions
ck+0.016
 breath+0.015
 tug+0.013
ited+0.013
away+0.012
Neg logit contributions
tr-0.008
 She-0.008
 There-0.007
 Did-0.007
 Why-0.007
Token just
Token ID655
Feature activation-1.10e-02

Pos logit contributions
 She+0.014
tr+0.013
 There+0.013
 He+0.013
 Why+0.012
Neg logit contributions
ck-0.027
 breath-0.024
ited-0.022
 tug-0.021
 swat-0.020
Token couldn
Token ID3521
Feature activation-6.76e-03

Pos logit contributions
ck+0.025
 breath+0.023
ited+0.021
 tug+0.019
away+0.019
Neg logit contributions
 She-0.015
 He-0.015
tr-0.015
 has-0.014
 There-0.014
Tokenâ
Token ID22940
Feature activation-7.55e-02

Pos logit contributions
ck+0.015
 breath+0.014
 tug+0.012
ited+0.012
 swat+0.011
Neg logit contributions
tr-0.009
robe-0.008
 living-0.008
 Did-0.008
 Why-0.008
Token
Token ID26391
Feature activation-2.06e-02

Pos logit contributions
ck+0.133
 breath+0.126
ited+0.099
 tug+0.093
away+0.091
Neg logit contributions
 She-0.146
 He-0.141
tr-0.138
 has-0.137
 There-0.136
Token
Token ID8151
Feature activation-1.34e-02

Pos logit contributions
ck+0.050
 breath+0.045
ited+0.041
 tug+0.039
away+0.037
Neg logit contributions
 She-0.024
tr-0.023
 He-0.023
 has-0.022
robe-0.022
Tokent
Token ID83
Feature activation-4.63e-02

Pos logit contributions
ck+0.032
 breath+0.029
ited+0.026
 tug+0.024
away+0.024
Neg logit contributions
 She-0.018
 He-0.017
tr-0.017
 There-0.016
 has-0.016
Token work
Token ID670
Feature activation+1.36e-02

Pos logit contributions
ck+0.091
 breath+0.081
ited+0.067
 tug+0.065
away+0.064
Neg logit contributions
tr-0.084
 She-0.082
 Did-0.081
 has-0.079
 There-0.079
Token it
Token ID340
Feature activation+2.89e-02

Pos logit contributions
 She+0.020
tr+0.019
 He+0.019
 There+0.018
 has+0.018
Neg logit contributions
ck-0.031
 breath-0.028
ited-0.025
 tug-0.023
away-0.023
Token!
Token ID0
Feature activation+1.35e-02

Pos logit contributions
ck+0.021
 breath+0.020
ited+0.016
 tug+0.015
away+0.015
Neg logit contributions
tr-0.024
robe-0.022
 Did-0.022
 There-0.021
 has-0.021
Token<|endoftext|>
Token ID50256
Feature activation+1.07e-02

Pos logit contributions
tr+0.015
robe+0.014
boards+0.013
bows+0.013
 has+0.012
Neg logit contributions
ck-0.032
 breath-0.030
 tug-0.025
 swat-0.025
ited-0.025
TokenOne
Token ID3198
Feature activation-7.72e-03

Pos logit contributions
tr+0.013
der+0.010
 Did+0.010
umm+0.010
robe+0.009
Neg logit contributions
ck-0.024
 breath-0.023
ited-0.020
war-0.019
away-0.019
Token day
Token ID1110
Feature activation-9.38e-03

Pos logit contributions
ck+0.022
 breath+0.019
 swat+0.017
charged+0.017
away+0.017
Neg logit contributions
 She-0.008
 has-0.008
 He-0.008
 living-0.007
 sky-0.007
Token,
Token ID11
Feature activation-1.26e-02

Pos logit contributions
ck+0.025
 breath+0.023
ited+0.021
away+0.020
 tug+0.020
Neg logit contributions
 She-0.010
 He-0.009
 There-0.009
 has-0.009
 Why-0.009
Token Sil
Token ID4243
Feature activation-9.51e-02

Pos logit contributions
ck+0.032
 breath+0.029
ited+0.027
away+0.025
 swat+0.024
Neg logit contributions
 She-0.016
 He-0.015
 There-0.014
 has-0.014
 Why-0.014
Tokenvia
Token ID8869
Feature activation-3.22e-02

Pos logit contributions
 breath+0.195
ck+0.159
 tug+0.147
 avail+0.146
 fear+0.146
Neg logit contributions
tr-0.150
 has-0.150
 living-0.145
 She-0.140
 Why-0.136
Token was
Token ID373
Feature activation-1.83e-03

Pos logit contributions
ck+0.086
 breath+0.080
ited+0.072
 tug+0.068
away+0.067
Neg logit contributions
 She-0.033
tr-0.031
 He-0.031
 There-0.029
 has-0.029
Token getting
Token ID1972
Feature activation-1.30e-02

Pos logit contributions
ck+0.004
 breath+0.004
ited+0.003
 tug+0.003
 swat+0.003
Neg logit contributions
 She-0.003
tr-0.003
 He-0.003
 has-0.003
 Did-0.003
Token ready
Token ID3492
Feature activation+5.71e-03

Pos logit contributions
ck+0.032
 breath+0.029
ited+0.027
away+0.025
 swat+0.024
Neg logit contributions
 She-0.017
 He-0.015
 living-0.015
 Why-0.015
 sky-0.015
Token to
Token ID284
Feature activation-1.44e-02

Pos logit contributions
tr+0.008
 She+0.008
 He+0.007
 Why+0.007
 There+0.007
Neg logit contributions
ck-0.013
 breath-0.012
ited-0.010
 tug-0.010
away-0.010
Token.
Token ID13
Feature activation+5.04e-03

Pos logit contributions
ck+0.020
 breath+0.018
ited+0.016
away+0.015
 response+0.015
Neg logit contributions
tr-0.016
 Did-0.015
robe-0.015
 There-0.015
 has-0.015
Token
Token ID220
Feature activation+1.05e-02

Pos logit contributions
robe+0.005
pit+0.005
tr+0.005
 has+0.005
bridge+0.005
Neg logit contributions
ck-0.011
 breath-0.010
 swat-0.009
 tug-0.009
Run-0.009
Token
Token ID198
Feature activation-2.63e-02

Pos logit contributions
tr+0.013
 has+0.011
umm+0.011
 Did+0.011
robe+0.011
Neg logit contributions
ck-0.025
 breath-0.021
rig-0.018
 agree-0.018
 tug-0.018
Token
Token ID198
Feature activation-3.13e-02

Pos logit contributions
ck+0.070
 breath+0.063
ited+0.057
 tug+0.056
away+0.055
Neg logit contributions
tr-0.026
 has-0.024
robe-0.023
 There-0.023
 She-0.022
TokenWhen
Token ID2215
Feature activation+1.42e-04

Pos logit contributions
ck+0.066
 breath+0.061
ited+0.052
 tug+0.048
away+0.047
Neg logit contributions
 She-0.053
 He-0.049
tr-0.048
 There-0.047
 Why-0.047
Token Twe
Token ID24205
Feature activation-9.31e-02

Pos logit contributions
tr+0.000
robe+0.000
 Why+0.000
 He+0.000
 has+0.000
Neg logit contributions
ck-0.000
 breath-0.000
ited-0.000
 tug-0.000
away-0.000
Tokenety
Token ID2963
Feature activation-2.33e-02

Pos logit contributions
 breath+0.198
ck+0.198
 tug+0.162
 contain+0.158
ited+0.157
Neg logit contributions
 She-0.143
tr-0.134
 He-0.129
 has-0.124
 Did-0.120
Token arrived
Token ID5284
Feature activation+2.58e-02

Pos logit contributions
ck+0.058
 breath+0.053
ited+0.047
 tug+0.044
away+0.044
Neg logit contributions
 She-0.029
tr-0.028
 He-0.027
 There-0.027
 has-0.026
Token at
Token ID379
Feature activation-2.86e-03

Pos logit contributions
tr+0.040
 She+0.039
 He+0.038
 has+0.038
 Did+0.038
Neg logit contributions
ck-0.055
 breath-0.051
ited-0.043
away-0.041
 tug-0.041
Token the
Token ID262
Feature activation-1.29e-03

Pos logit contributions
ck+0.005
 breath+0.005
ited+0.004
away+0.003
 tug+0.003
Neg logit contributions
tr-0.005
 She-0.005
 Did-0.005
 Why-0.005
 There-0.005
Token o
Token ID267
Feature activation-1.86e-02

Pos logit contributions
ck+0.002
 breath+0.002
ited+0.002
away+0.002
 response+0.002
Neg logit contributions
 Did-0.002
tr-0.002
 She-0.002
 There-0.002
 Why-0.002