TOP ACTIVATIONS
MAX = 0.121

When Daisy put the magn ifying glass down , the cater
, handing him the magn ifying glass . She looked around
, sixteen , seventeen , eighteen , nineteen , twenty .
, sixteen , seventeen , eighteen , nineteen , twenty ,
see . She sees their toes . They are red and
, Lily noticed that her toes were glowing ! She looked
bitter . It made their tongues hurt and their stomach s
and handed him the magn ifying glass . He
, fifteen , sixteen , seventeen , eighteen , nineteen ,
He brought his magn ifying glass and looked closely .
He pointed his magn ifying glass at different classmates and
, fifteen , sixteen , seventeen , eighteen , nineteen ,
happy and it made her toes start to tap . She
the necklace and the magn ifying glass and told him that
borrowed her mom 's magn ifying glass to take a closer
, twelve , thirteen , fourteen , fifteen , sixteen ,
said , holding the magn ifying glass over a black ant
she declared , " My toes are better now than before
examine it with the magn ifying glass ?" "
us . Where is your menu ?" L ily

MIN ACTIVATIONS
MIN = -0.152

loyal little girl named L ott ie . Every day ,
After a while , D ott ie was no longer scared
. When D ott ie arrived at the lab
a good dog named D ott ie . D ott ie
very special toy for L ott ie . It was a
One day , L ott ie 's mom went to
?" D ott ie per ked up and
a kind deer named D ott ie appeared . " Leave
D ott ie . D ott ie wanted to visit a
Suddenly , they sp ott led a bicycle . It
. They laugh and high - five . They
They smile and high - five each other . They
. They smile and high - five . Ben says ,
went on a scary roller co aster ride . Tim my
colourful dresses with bows and rib bons . Every day she
They played catch and tug - of - war . The
wear hats and clips and rib bons . You can be
colourful and so very high - quality . The girl was
bo o and put a band - aid on it .
. He fet ched a band age he found in the

INTERVAL 0.053 - 0.121
CONTAINS 0.779%

But remember , when the time is up , you have
the store lady where the vanilla was . She pointed to
her t eddy bear 's ear was not fine . It
who needed help , the little bird would be there and
Tom and Mia are sad and angry . They want

INTERVAL -0.015 - 0.053
CONTAINS 79.022%

see them eat and smile . One day ,
the same joy that he felt . <|endoftext|> Once upon a
the sky ! <|endoftext|> Once upon a time , there was
and avoid wasting things . <|endoftext|> Once upon a time ,
ble , Mom my ! A big , shiny , smooth

INTERVAL -0.084 - -0.015
CONTAINS 20.093%

of the whip . Daddy just laughed and said it was
After they finished cooking , Lily 's mom asked her to
end . <|endoftext|> Once upon a time , there was a
and suddenly he noticed something amazing - he could see a
loved sun fl owers . She liked to look at them

INTERVAL -0.152 - -0.084
CONTAINS 0.106%

he was having too much fun playing with them .
they had a lot of fun . The pumpkin was stubborn
were happy they tried something new . Sally also felt good
excited to explore and learn new things .
He loved to cook all sorts of things . One day
Token When
Token ID1649
Feature activation+2.11e-02

Pos logit contributions
 pulp+0.024
 fingers+0.024
ses+0.022
 stem+0.022
sed+0.022
Neg logit contributions
mother-0.025
ock-0.025
reating-0.024
ie-0.023
ty-0.023
Token Daisy
Token ID40355
Feature activation+7.24e-03

Pos logit contributions
 pulp+0.036
 fingers+0.035
 stem+0.032
ses+0.031
 toes+0.031
Neg logit contributions
ock-0.040
mother-0.039
reating-0.038
isky-0.038
ie-0.036
Token put
Token ID1234
Feature activation-1.57e-02

Pos logit contributions
 pulp+0.011
 fingers+0.011
 stem+0.010
ses+0.010
sed+0.009
Neg logit contributions
reating-0.015
ock-0.015
mother-0.014
 Beth-0.014
 operation-0.014
Token the
Token ID262
Feature activation+4.44e-03

Pos logit contributions
reating+0.029
mother+0.029
ock+0.029
isky+0.028
 Beth+0.028
Neg logit contributions
 fingers-0.027
 pulp-0.026
 stem-0.024
ses-0.024
 finger-0.023
Token magn
Token ID7842
Feature activation+2.89e-02

Pos logit contributions
 pulp+0.007
 fingers+0.007
ses+0.006
 stem+0.006
sed+0.006
Neg logit contributions
mother-0.009
 Beth-0.009
ock-0.009
reating-0.009
ie-0.008
Tokenifying
Token ID4035
Feature activation+1.21e-01

Pos logit contributions
 pulp+0.016
 fingers+0.015
ses+0.011
 stem+0.011
sed+0.010
Neg logit contributions
mother-0.087
reating-0.086
 Beth-0.086
ock-0.086
 operation-0.084
Token glass
Token ID5405
Feature activation-3.94e-04

Pos logit contributions
 pulp+0.181
 fingers+0.180
ses+0.171
sed+0.170
 toes+0.158
Neg logit contributions
mother-0.233
ad-0.232
 Beth-0.231
ock-0.225
reating-0.224
Token down
Token ID866
Feature activation+5.65e-02

Pos logit contributions
mother+0.001
reating+0.001
ock+0.001
 Beth+0.001
 operation+0.001
Neg logit contributions
 pulp-0.001
 fingers-0.001
 stem-0.001
ses-0.001
 toes-0.001
Token,
Token ID11
Feature activation+1.02e-02

Pos logit contributions
 pulp+0.101
 fingers+0.100
 stem+0.095
 toes+0.090
ses+0.088
Neg logit contributions
reating-0.102
ock-0.100
mother-0.099
 operation-0.095
isky-0.092
Token the
Token ID262
Feature activation+6.89e-03

Pos logit contributions
 pulp+0.019
 fingers+0.018
 stem+0.017
ses+0.016
 toes+0.016
Neg logit contributions
mother-0.018
ock-0.018
reating-0.017
 operation-0.016
ad-0.016
Token cater
Token ID20825
Feature activation+2.08e-02

Pos logit contributions
 pulp+0.011
 fingers+0.011
ses+0.010
 stem+0.010
sed+0.010
Neg logit contributions
mother-0.013
ock-0.013
reating-0.013
 Beth-0.013
ie-0.013
Token,
Token ID11
Feature activation-6.03e-03

Pos logit contributions
mother+0.039
reating+0.039
ock+0.038
 Beth+0.037
isky+0.036
Neg logit contributions
 fingers-0.043
 pulp-0.042
ses-0.040
sed-0.039
 stem-0.039
Token handing
Token ID22786
Feature activation+7.44e-03

Pos logit contributions
mother+0.010
 Beth+0.010
ock+0.010
reating+0.010
ie+0.009
Neg logit contributions
 pulp-0.011
 fingers-0.011
ses-0.010
 stem-0.010
sed-0.010
Token him
Token ID683
Feature activation+3.00e-02

Pos logit contributions
 fingers+0.015
 pulp+0.015
ses+0.014
 hands+0.014
 finger+0.014
Neg logit contributions
isky-0.011
ock-0.011
mother-0.010
reating-0.010
 operation-0.009
Token the
Token ID262
Feature activation+2.06e-02

Pos logit contributions
 fingers+0.050
 pulp+0.048
 finger+0.044
ses+0.043
 toes+0.043
Neg logit contributions
ock-0.057
mother-0.057
reating-0.057
 Beth-0.055
 operation-0.055
Token magn
Token ID7842
Feature activation+3.75e-02

Pos logit contributions
 pulp+0.033
 fingers+0.032
ses+0.030
 stem+0.029
sed+0.028
Neg logit contributions
mother-0.040
ock-0.039
reating-0.039
 Beth-0.039
 operation-0.038
Tokenifying
Token ID4035
Feature activation+1.21e-01

Pos logit contributions
 pulp+0.011
 fingers+0.010
ses+0.006
 stem+0.005
sed+0.004
Neg logit contributions
 Beth-0.122
mother-0.122
reating-0.121
ock-0.120
 operation-0.119
Token glass
Token ID5405
Feature activation+1.22e-02

Pos logit contributions
 pulp+0.159
 fingers+0.154
ses+0.140
sed+0.137
 toes+0.134
Neg logit contributions
mother-0.268
 Beth-0.260
reating-0.260
ock-0.258
ad-0.254
Token.
Token ID13
Feature activation+4.93e-03

Pos logit contributions
 pulp+0.023
 fingers+0.022
 stem+0.021
ses+0.021
 finger+0.020
Neg logit contributions
mother-0.021
 Beth-0.021
reating-0.021
ock-0.020
 operation-0.020
Token She
Token ID1375
Feature activation-1.82e-02

Pos logit contributions
 pulp+0.009
 fingers+0.009
ses+0.009
 stem+0.008
sed+0.008
Neg logit contributions
mother-0.008
ock-0.008
reating-0.008
 Beth-0.008
ie-0.008
Token looked
Token ID3114
Feature activation+5.11e-03

Pos logit contributions
mother+0.033
 Beth+0.032
ock+0.031
reating+0.031
ie+0.031
Neg logit contributions
 pulp-0.033
 fingers-0.031
ses-0.030
 stem-0.029
sed-0.028
Token around
Token ID1088
Feature activation+3.17e-03

Pos logit contributions
 pulp+0.011
ses+0.010
 fingers+0.010
 stem+0.010
sed+0.009
Neg logit contributions
mother-0.008
ie-0.008
 Beth-0.008
ock-0.007
ty-0.007
Token,
Token ID11
Feature activation+2.18e-02

Pos logit contributions
ses+0.155
sed+0.150
 pulp+0.150
 fingers+0.136
 stem+0.135
Neg logit contributions
ie-0.083
ock-0.083
mother-0.081
ad-0.080
reating-0.079
Token sixteen
Token ID27913
Feature activation+1.03e-01

Pos logit contributions
ses+0.029
 pulp+0.029
sed+0.028
 fingers+0.027
 finger+0.025
Neg logit contributions
ad-0.044
ock-0.043
mother-0.041
reating-0.041
 him-0.040
Token,
Token ID11
Feature activation+2.64e-02

Pos logit contributions
ses+0.250
sed+0.243
 pulp+0.242
 fingers+0.223
 stem+0.221
Neg logit contributions
ock-0.115
ie-0.113
mother-0.113
ad-0.111
reating-0.111
Token seventeen
Token ID38741
Feature activation+1.15e-01

Pos logit contributions
ses+0.032
 pulp+0.031
sed+0.030
 fingers+0.029
 finger+0.026
Neg logit contributions
ad-0.056
ock-0.056
mother-0.054
reating-0.053
 him-0.052
Token,
Token ID11
Feature activation+2.46e-02

Pos logit contributions
ses+0.282
 pulp+0.281
sed+0.274
 stem+0.260
 fingers+0.259
Neg logit contributions
ock-0.123
ie-0.123
ad-0.120
mother-0.118
reating-0.118
Token eighteen
Token ID29095
Feature activation+1.19e-01

Pos logit contributions
ses+0.032
 pulp+0.031
sed+0.030
 fingers+0.029
 finger+0.026
Neg logit contributions
ad-0.051
ock-0.050
mother-0.048
reating-0.048
 him-0.048
Token,
Token ID11
Feature activation+1.74e-02

Pos logit contributions
ses+0.290
 pulp+0.286
sed+0.279
 stem+0.265
 fingers+0.263
Neg logit contributions
ie-0.128
mother-0.124
ock-0.123
reating-0.119
ad-0.118
Token nineteen
Token ID43063
Feature activation+8.13e-02

Pos logit contributions
 pulp+0.023
ses+0.023
sed+0.022
 fingers+0.022
 stem+0.020
Neg logit contributions
ad-0.035
ock-0.035
mother-0.034
reating-0.034
 him-0.033
Token,
Token ID11
Feature activation+2.59e-02

Pos logit contributions
 pulp+0.197
ses+0.196
sed+0.190
 fingers+0.181
plets+0.178
Neg logit contributions
ie-0.088
mother-0.087
ock-0.084
reating-0.083
ad-0.082
Token twenty
Token ID8208
Feature activation+9.12e-03

Pos logit contributions
ses+0.050
 pulp+0.050
sed+0.048
 fingers+0.047
 stem+0.044
Neg logit contributions
mother-0.038
ock-0.037
ad-0.037
reating-0.037
 him-0.035
Token.
Token ID13
Feature activation+1.07e-02

Pos logit contributions
 pulp+0.019
ses+0.019
 fingers+0.018
sed+0.018
 stem+0.017
Neg logit contributions
mother-0.013
ock-0.013
ie-0.013
reating-0.013
ad-0.012
Token,
Token ID11
Feature activation+2.15e-02

Pos logit contributions
ses+0.163
sed+0.157
 pulp+0.157
 fingers+0.145
sing+0.142
Neg logit contributions
ie-0.079
ock-0.075
ad-0.072
mother-0.072
reating-0.069
Token sixteen
Token ID27913
Feature activation+1.03e-01

Pos logit contributions
ses+0.028
 pulp+0.028
sed+0.027
 fingers+0.026
 finger+0.024
Neg logit contributions
ad-0.043
ock-0.042
mother-0.041
 him-0.040
reating-0.040
Token,
Token ID11
Feature activation+2.58e-02

Pos logit contributions
ses+0.253
 pulp+0.248
sed+0.245
 fingers+0.231
 finger+0.227
Neg logit contributions
ie-0.111
ock-0.108
mother-0.105
ad-0.104
reating-0.102
Token seventeen
Token ID38741
Feature activation+1.14e-01

Pos logit contributions
ses+0.030
 pulp+0.030
 fingers+0.028
sed+0.028
 toes+0.025
Neg logit contributions
ad-0.055
ock-0.054
mother-0.052
reating-0.051
 him-0.050
Token,
Token ID11
Feature activation+2.31e-02

Pos logit contributions
ses+0.283
 pulp+0.282
sed+0.275
 fingers+0.263
 stem+0.260
Neg logit contributions
ie-0.117
ock-0.114
ad-0.113
mother-0.108
reating-0.107
Token eighteen
Token ID29095
Feature activation+1.19e-01

Pos logit contributions
ses+0.028
 pulp+0.028
sed+0.026
 fingers+0.026
 finger+0.023
Neg logit contributions
ad-0.048
ock-0.047
mother-0.046
 him-0.046
reating-0.045
Token,
Token ID11
Feature activation+1.63e-02

Pos logit contributions
ses+0.290
 pulp+0.288
sed+0.279
 fingers+0.268
 stem+0.265
Neg logit contributions
ie-0.126
mother-0.119
ock-0.118
reating-0.113
ad-0.113
Token nineteen
Token ID43063
Feature activation+8.20e-02

Pos logit contributions
ses+0.021
 pulp+0.021
sed+0.019
 fingers+0.019
 stem+0.017
Neg logit contributions
ad-0.033
ock-0.033
mother-0.033
reating-0.032
 him-0.032
Token,
Token ID11
Feature activation+2.59e-02

Pos logit contributions
 pulp+0.195
ses+0.194
sed+0.187
 fingers+0.181
plets+0.176
Neg logit contributions
ie-0.092
mother-0.090
ock-0.089
reating-0.086
ad-0.085
Token twenty
Token ID8208
Feature activation+9.50e-03

Pos logit contributions
ses+0.047
 pulp+0.046
sed+0.045
 fingers+0.044
 stem+0.041
Neg logit contributions
mother-0.040
ad-0.039
ock-0.039
reating-0.038
 Beth-0.037
Token,
Token ID11
Feature activation+1.80e-02

Pos logit contributions
 pulp+0.019
ses+0.019
sed+0.018
 fingers+0.018
 stem+0.017
Neg logit contributions
ie-0.014
mother-0.014
ock-0.014
reating-0.013
ad-0.013
Token see
Token ID766
Feature activation+1.40e-03

Pos logit contributions
 pulp+0.010
 fingers+0.009
ses+0.009
 stem+0.009
sed+0.008
Neg logit contributions
mother-0.010
ock-0.010
reating-0.010
 Beth-0.010
ie-0.009
Token.
Token ID13
Feature activation+1.44e-02

Pos logit contributions
 pulp+0.003
 fingers+0.003
ses+0.003
 stem+0.003
sed+0.003
Neg logit contributions
mother-0.002
ock-0.002
 Beth-0.002
reating-0.002
ie-0.002
Token She
Token ID1375
Feature activation-7.32e-03

Pos logit contributions
 pulp+0.036
 fingers+0.034
ses+0.033
 stem+0.033
sed+0.032
Neg logit contributions
mother-0.016
 Beth-0.016
ock-0.015
reating-0.015
ie-0.014
Token sees
Token ID7224
Feature activation+8.94e-03

Pos logit contributions
ock+0.019
reating+0.019
mother+0.019
 Beth+0.019
 operation+0.018
Neg logit contributions
 fingers-0.007
 pulp-0.006
ses-0.006
sed-0.006
 stem-0.005
Token their
Token ID511
Feature activation+2.87e-02

Pos logit contributions
 pulp+0.019
 fingers+0.018
ses+0.017
 stem+0.016
sed+0.016
Neg logit contributions
mother-0.013
ock-0.013
 Beth-0.013
reating-0.012
 operation-0.012
Token toes
Token ID23932
Feature activation+1.18e-01

Pos logit contributions
 pulp+0.044
plets+0.038
 stem+0.037
 fingers+0.037
ses+0.037
Neg logit contributions
mother-0.057
reating-0.056
 Beth-0.055
ock-0.054
ie-0.054
Token.
Token ID13
Feature activation+2.02e-02

Pos logit contributions
 pulp+0.221
 fingers+0.213
ses+0.202
 stem+0.199
sed+0.196
Neg logit contributions
mother-0.203
 Beth-0.197
ock-0.197
reating-0.197
ie-0.188
Token They
Token ID1119
Feature activation+2.11e-02

Pos logit contributions
 pulp+0.048
 fingers+0.047
ses+0.045
 stem+0.044
sed+0.044
Neg logit contributions
mother-0.025
ock-0.024
reating-0.024
 Beth-0.024
ie-0.023
Token are
Token ID389
Feature activation-4.22e-03

Pos logit contributions
 pulp+0.033
 fingers+0.032
ses+0.030
 stem+0.029
sed+0.028
Neg logit contributions
mother-0.043
ock-0.042
reating-0.042
 Beth-0.042
ie-0.040
Token red
Token ID2266
Feature activation+3.36e-02

Pos logit contributions
mother+0.008
 Beth+0.008
ock+0.008
reating+0.008
ie+0.008
Neg logit contributions
 pulp-0.007
 fingers-0.007
ses-0.006
 stem-0.006
sed-0.006
Token and
Token ID290
Feature activation+3.15e-02

Pos logit contributions
 pulp+0.060
 fingers+0.059
ses+0.056
 stem+0.054
sed+0.054
Neg logit contributions
mother-0.061
 Beth-0.059
reating-0.058
ock-0.058
ie-0.055
Token,
Token ID11
Feature activation-7.07e-03

Pos logit contributions
 pulp+0.062
 fingers+0.060
 stem+0.055
ses+0.054
 toes+0.054
Neg logit contributions
reating-0.060
ock-0.060
mother-0.059
isky-0.055
 operation-0.054
Token Lily
Token ID20037
Feature activation-2.50e-02

Pos logit contributions
ock+0.012
mother+0.012
reating+0.011
ie+0.011
isky+0.011
Neg logit contributions
 pulp-0.014
 fingers-0.013
 stem-0.012
ses-0.012
sed-0.012
Token noticed
Token ID6810
Feature activation+2.65e-02

Pos logit contributions
mother+0.048
 Beth+0.047
ie+0.046
ock+0.046
reating+0.045
Neg logit contributions
 pulp-0.041
 fingers-0.039
ses-0.038
 stem-0.037
sed-0.036
Token that
Token ID326
Feature activation-1.23e-02

Pos logit contributions
 pulp+0.040
 fingers+0.039
 stem+0.035
 finger+0.035
ses+0.035
Neg logit contributions
mother-0.052
ock-0.051
reating-0.051
 operation-0.049
isky-0.049
Token her
Token ID607
Feature activation+2.44e-02

Pos logit contributions
mother+0.019
ock+0.019
isky+0.018
reating+0.017
 operation+0.017
Neg logit contributions
 pulp-0.025
 fingers-0.025
 stem-0.023
 finger-0.022
 toes-0.022
Token toes
Token ID23932
Feature activation+1.17e-01

Pos logit contributions
 pulp+0.040
 fingers+0.035
ses+0.033
 stem+0.033
plets+0.032
Neg logit contributions
 Beth-0.047
mother-0.046
 operation-0.046
ie-0.045
ock-0.045
Token were
Token ID547
Feature activation+1.58e-02

Pos logit contributions
 pulp+0.216
 fingers+0.207
ses+0.199
 stem+0.193
sed+0.191
Neg logit contributions
mother-0.203
ock-0.198
 Beth-0.198
reating-0.196
ie-0.192
Token glowing
Token ID21377
Feature activation+1.72e-02

Pos logit contributions
 pulp+0.026
 fingers+0.025
 stem+0.023
ses+0.023
sed+0.022
Neg logit contributions
mother-0.031
reating-0.031
ock-0.030
 Beth-0.030
ie-0.029
Token!
Token ID0
Feature activation+8.36e-03

Pos logit contributions
 fingers+0.028
 pulp+0.027
 stem+0.025
 toes+0.025
 finger+0.023
Neg logit contributions
mother-0.034
reating-0.033
 Beth-0.033
ock-0.032
isky-0.031
Token She
Token ID1375
Feature activation-1.70e-02

Pos logit contributions
 fingers+0.016
 pulp+0.016
 stem+0.016
ses+0.015
sed+0.015
Neg logit contributions
ock-0.014
mother-0.014
reating-0.014
ie-0.013
ty-0.013
Token looked
Token ID3114
Feature activation+3.40e-03

Pos logit contributions
mother+0.032
ock+0.031
reating+0.031
 Beth+0.031
ie+0.030
Neg logit contributions
 pulp-0.029
 fingers-0.028
ses-0.026
 stem-0.026
sed-0.026
Token bitter
Token ID12922
Feature activation+3.24e-02

Pos logit contributions
 pulp+0.042
 fingers+0.040
ses+0.040
sed+0.038
 stem+0.037
Neg logit contributions
 Beth-0.052
ie-0.051
mother-0.051
ock-0.049
ty-0.049
Token.
Token ID13
Feature activation+1.10e-02

Pos logit contributions
 pulp+0.067
 fingers+0.064
ses+0.062
 stem+0.061
sed+0.060
Neg logit contributions
mother-0.049
ock-0.048
reating-0.047
 Beth-0.047
ie-0.046
Token It
Token ID632
Feature activation-1.63e-02

Pos logit contributions
 pulp+0.022
 fingers+0.021
ses+0.020
 stem+0.020
sed+0.020
Neg logit contributions
 Beth-0.018
mother-0.017
ock-0.016
reating-0.016
ad-0.016
Token made
Token ID925
Feature activation+1.90e-02

Pos logit contributions
ie+0.026
ty+0.026
mother+0.025
 Beth+0.024
ad+0.024
Neg logit contributions
 pulp-0.030
 fingers-0.029
ses-0.028
 toes-0.027
plets-0.027
Token their
Token ID511
Feature activation+3.33e-02

Pos logit contributions
 pulp+0.040
 fingers+0.040
ses+0.036
 stem+0.035
sed+0.035
Neg logit contributions
mother-0.029
reating-0.027
ock-0.027
isky-0.026
 Beth-0.026
Token tongues
Token ID39413
Feature activation+1.17e-01

Pos logit contributions
 pulp+0.035
plets+0.029
ses+0.027
 stem+0.027
 fingers+0.027
Neg logit contributions
mother-0.082
ie-0.081
 Beth-0.080
reating-0.079
 operation-0.078
Token hurt
Token ID5938
Feature activation+4.22e-02

Pos logit contributions
ses+0.191
 pulp+0.188
sed+0.182
 fingers+0.171
 stem+0.160
Neg logit contributions
ie-0.215
ock-0.204
ty-0.203
 him-0.202
reating-0.200
Token and
Token ID290
Feature activation+2.67e-02

Pos logit contributions
 fingers+0.069
 pulp+0.065
 toes+0.061
ses+0.059
 finger+0.058
Neg logit contributions
mother-0.084
reating-0.081
 operation-0.079
 Beth-0.079
isky-0.079
Token their
Token ID511
Feature activation+2.30e-02

Pos logit contributions
 pulp+0.041
plets+0.039
ses+0.038
 fingers+0.038
 stem+0.037
Neg logit contributions
ie-0.047
 Beth-0.047
 him-0.046
mother-0.045
ty-0.044
Token stomach
Token ID11384
Feature activation+8.76e-02

Pos logit contributions
 pulp+0.022
 fingers+0.019
ses+0.018
 stem+0.017
sed+0.016
Neg logit contributions
mother-0.059
 Beth-0.058
reating-0.058
ie-0.057
ock-0.057
Tokens
Token ID82
Feature activation+5.74e-03

Pos logit contributions
 pulp+0.150
 fingers+0.141
ses+0.140
sed+0.134
 stem+0.132
Neg logit contributions
mother-0.156
ock-0.156
ie-0.155
 Beth-0.153
ty-0.151
Token and
Token ID290
Feature activation+3.36e-03

Pos logit contributions
 Beth+0.001
reating+0.001
mother+0.001
 Barry+0.001
Ell+0.001
Neg logit contributions
 fingers-0.000
 tongue-0.000
 finger-0.000
 mouth-0.000
 nose-0.000
Token handed
Token ID10158
Feature activation-1.22e-02

Pos logit contributions
 pulp+0.005
 fingers+0.005
ses+0.005
 stem+0.005
sed+0.005
Neg logit contributions
ock-0.007
reating-0.007
mother-0.007
 Beth-0.006
 operation-0.006
Token him
Token ID683
Feature activation+3.04e-02

Pos logit contributions
 operation+0.021
 Beth+0.021
isky+0.020
ock+0.020
mother+0.020
Neg logit contributions
 fingers-0.022
 pulp-0.021
 finger-0.019
 toes-0.019
 hands-0.019
Token the
Token ID262
Feature activation+2.73e-02

Pos logit contributions
 fingers+0.049
 pulp+0.048
 finger+0.043
 stem+0.042
 toes+0.041
Neg logit contributions
ock-0.060
 Beth-0.059
mother-0.059
reating-0.058
 operation-0.056
Token magn
Token ID7842
Feature activation+3.71e-02

Pos logit contributions
 pulp+0.047
 fingers+0.044
ses+0.044
sed+0.042
 stem+0.041
Neg logit contributions
mother-0.050
ock-0.048
reating-0.047
 Beth-0.047
ie-0.046
Tokenifying
Token ID4035
Feature activation+1.15e-01

Pos logit contributions
 pulp+0.024
 fingers+0.022
 stem+0.018
ses+0.017
sed+0.015
Neg logit contributions
 Beth-0.108
mother-0.108
reating-0.108
ock-0.107
 operation-0.106
Token glass
Token ID5405
Feature activation+1.76e-02

Pos logit contributions
 pulp+0.144
 fingers+0.138
ses+0.129
sed+0.125
 toes+0.120
Neg logit contributions
mother-0.266
 Beth-0.257
reating-0.257
ock-0.254
ad-0.250
Token.
Token ID13
Feature activation+1.91e-03

Pos logit contributions
 pulp+0.034
 fingers+0.033
 stem+0.031
ses+0.030
 finger+0.030
Neg logit contributions
mother-0.030
 Beth-0.030
reating-0.029
ock-0.029
 operation-0.029
Token
Token ID198
Feature activation-5.31e-03

Pos logit contributions
 pulp+0.004
 fingers+0.003
ses+0.003
 stem+0.003
sed+0.003
Neg logit contributions
mother-0.003
 Beth-0.003
ock-0.003
reating-0.003
ie-0.003
Token
Token ID198
Feature activation-7.12e-04

Pos logit contributions
mother+0.011
reating+0.011
ock+0.011
 operation+0.010
 Beth+0.010
Neg logit contributions
 pulp-0.008
 fingers-0.008
ses-0.007
 stem-0.007
sed-0.007
TokenHe
Token ID1544
Feature activation+1.64e-02

Pos logit contributions
mother+0.002
ock+0.001
reating+0.001
 Beth+0.001
ie+0.001
Neg logit contributions
 pulp-0.001
 fingers-0.001
ses-0.001
sed-0.001
 stem-0.001
Token,
Token ID11
Feature activation+1.79e-02

Pos logit contributions
ses+0.295
sed+0.285
 pulp+0.270
ifying+0.261
sing+0.257
Neg logit contributions
ie-0.102
 him-0.101
ad-0.098
mother-0.094
ock-0.091
Token fifteen
Token ID17280
Feature activation+6.80e-02

Pos logit contributions
ses+0.029
 pulp+0.029
sed+0.028
 fingers+0.028
 stem+0.026
Neg logit contributions
ock-0.032
ad-0.031
reating-0.031
mother-0.031
 him-0.030
Token,
Token ID11
Feature activation+2.18e-02

Pos logit contributions
ses+0.155
sed+0.150
 pulp+0.150
 fingers+0.136
 stem+0.135
Neg logit contributions
ie-0.083
ock-0.083
mother-0.081
ad-0.080
reating-0.079
Token sixteen
Token ID27913
Feature activation+1.03e-01

Pos logit contributions
ses+0.029
 pulp+0.029
sed+0.028
 fingers+0.027
 finger+0.025
Neg logit contributions
ad-0.044
ock-0.043
mother-0.041
reating-0.041
 him-0.040
Token,
Token ID11
Feature activation+2.64e-02

Pos logit contributions
ses+0.250
sed+0.243
 pulp+0.242
 fingers+0.223
 stem+0.221
Neg logit contributions
ock-0.115
ie-0.113
mother-0.113
ad-0.111
reating-0.111
Token seventeen
Token ID38741
Feature activation+1.15e-01

Pos logit contributions
ses+0.032
 pulp+0.031
sed+0.030
 fingers+0.029
 finger+0.026
Neg logit contributions
ad-0.056
ock-0.056
mother-0.054
reating-0.053
 him-0.052
Token,
Token ID11
Feature activation+2.46e-02

Pos logit contributions
ses+0.282
 pulp+0.281
sed+0.274
 stem+0.260
 fingers+0.259
Neg logit contributions
ock-0.123
ie-0.123
ad-0.120
mother-0.118
reating-0.118
Token eighteen
Token ID29095
Feature activation+1.19e-01

Pos logit contributions
ses+0.032
 pulp+0.031
sed+0.030
 fingers+0.029
 finger+0.026
Neg logit contributions
ad-0.051
ock-0.050
mother-0.048
reating-0.048
 him-0.048
Token,
Token ID11
Feature activation+1.74e-02

Pos logit contributions
ses+0.290
 pulp+0.286
sed+0.279
 stem+0.265
 fingers+0.263
Neg logit contributions
ie-0.128
mother-0.124
ock-0.123
reating-0.119
ad-0.118
Token nineteen
Token ID43063
Feature activation+8.13e-02

Pos logit contributions
 pulp+0.023
ses+0.023
sed+0.022
 fingers+0.022
 stem+0.020
Neg logit contributions
ad-0.035
ock-0.035
mother-0.034
reating-0.034
 him-0.033
Token,
Token ID11
Feature activation+2.59e-02

Pos logit contributions
 pulp+0.197
ses+0.196
sed+0.190
 fingers+0.181
plets+0.178
Neg logit contributions
ie-0.088
mother-0.087
ock-0.084
reating-0.083
ad-0.082
Token
Token ID198
Feature activation-1.37e-03

Pos logit contributions
mother+0.010
reating+0.010
ock+0.010
 Beth+0.010
 operation+0.010
Neg logit contributions
 fingers-0.007
 pulp-0.007
ses-0.006
 finger-0.006
 stem-0.006
TokenHe
Token ID1544
Feature activation+1.10e-02

Pos logit contributions
mother+0.003
reating+0.003
 Beth+0.003
ock+0.003
ie+0.003
Neg logit contributions
 pulp-0.002
 fingers-0.002
 stem-0.002
ses-0.002
 finger-0.002
Token brought
Token ID3181
Feature activation+2.61e-03

Pos logit contributions
 fingers+0.016
 pulp+0.016
sed+0.014
 stem+0.014
 finger+0.013
Neg logit contributions
reating-0.025
ock-0.024
mother-0.023
 Beth-0.023
 operation-0.022
Token his
Token ID465
Feature activation+2.37e-02

Pos logit contributions
 pulp+0.005
 fingers+0.005
ses+0.004
 stem+0.004
sed+0.004
Neg logit contributions
mother-0.005
reating-0.005
ock-0.005
 Beth-0.004
isky-0.004
Token magn
Token ID7842
Feature activation+3.33e-02

Pos logit contributions
 pulp+0.040
plets+0.037
ses+0.034
 stem+0.034
 fingers+0.033
Neg logit contributions
mother-0.044
 Beth-0.043
 operation-0.042
ie-0.041
ock-0.039
Tokenifying
Token ID4035
Feature activation+1.15e-01

Pos logit contributions
 pulp+0.021
 fingers+0.019
ses+0.016
 stem+0.015
sed+0.014
Neg logit contributions
mother-0.098
ock-0.097
 Beth-0.097
reating-0.097
ie-0.094
Token glass
Token ID5405
Feature activation+5.34e-03

Pos logit contributions
 pulp+0.151
 fingers+0.142
ses+0.132
 stem+0.126
sed+0.126
Neg logit contributions
mother-0.260
 Beth-0.255
reating-0.253
ock-0.252
 operation-0.243
Token and
Token ID290
Feature activation+8.37e-03

Pos logit contributions
 pulp+0.010
 fingers+0.010
 stem+0.009
ses+0.009
 finger+0.009
Neg logit contributions
mother-0.010
reating-0.009
 Beth-0.009
ock-0.009
 operation-0.009
Token looked
Token ID3114
Feature activation+3.12e-03

Pos logit contributions
 pulp+0.015
ses+0.014
 fingers+0.014
 stem+0.013
sed+0.013
Neg logit contributions
 Beth-0.015
mother-0.015
ie-0.014
ock-0.014
ty-0.014
Token closely
Token ID7173
Feature activation-8.32e-03

Pos logit contributions
 pulp+0.007
 fingers+0.006
ses+0.006
 stem+0.006
sed+0.006
Neg logit contributions
mother-0.004
 Beth-0.004
ie-0.004
ock-0.004
reating-0.004
Token.
Token ID13
Feature activation-7.65e-03

Pos logit contributions
mother+0.012
ock+0.012
 Beth+0.012
reating+0.012
ie+0.012
Neg logit contributions
 pulp-0.017
 fingers-0.017
ses-0.016
 stem-0.016
sed-0.016
Token
Token ID198
Feature activation+4.55e-04

Pos logit contributions
reating+0.014
ock+0.013
mother+0.013
ie+0.013
 operation+0.013
Neg logit contributions
 pulp-0.008
 fingers-0.008
 finger-0.007
 stem-0.007
ses-0.007
TokenHe
Token ID1544
Feature activation+1.22e-02

Pos logit contributions
 pulp+0.001
 fingers+0.001
 stem+0.001
 finger+0.001
ses+0.001
Neg logit contributions
reating-0.001
ock-0.001
mother-0.001
ie-0.001
 Beth-0.001
Token pointed
Token ID6235
Feature activation+3.22e-02

Pos logit contributions
 pulp+0.019
 fingers+0.019
ses+0.017
 stem+0.017
sed+0.017
Neg logit contributions
reating-0.025
ock-0.025
mother-0.025
 Beth-0.024
ie-0.023
Token his
Token ID465
Feature activation+1.65e-02

Pos logit contributions
 pulp+0.065
 fingers+0.064
ses+0.059
 stem+0.059
sed+0.058
Neg logit contributions
mother-0.051
ock-0.049
 Beth-0.049
reating-0.049
ie-0.046
Token magn
Token ID7842
Feature activation+3.15e-02

Pos logit contributions
 pulp+0.027
ses+0.023
 fingers+0.023
 stem+0.022
plets+0.022
Neg logit contributions
mother-0.033
 Beth-0.031
 operation-0.030
ie-0.030
ock-0.029
Tokenifying
Token ID4035
Feature activation+1.15e-01

Pos logit contributions
 pulp+0.015
 fingers+0.013
ses+0.010
 stem+0.009
sed+0.008
Neg logit contributions
mother-0.098
ock-0.097
reating-0.097
 Beth-0.096
ie-0.094
Token glass
Token ID5405
Feature activation+1.11e-02

Pos logit contributions
 pulp+0.144
 fingers+0.136
ses+0.126
 stem+0.121
sed+0.120
Neg logit contributions
mother-0.267
 Beth-0.261
reating-0.260
ock-0.259
ie-0.251
Token at
Token ID379
Feature activation+4.06e-02

Pos logit contributions
 pulp+0.019
 fingers+0.019
 stem+0.017
ses+0.017
 finger+0.016
Neg logit contributions
mother-0.021
 Beth-0.021
reating-0.021
ock-0.021
 operation-0.020
Token different
Token ID1180
Feature activation-2.29e-02

Pos logit contributions
 fingers+0.074
 pulp+0.071
ses+0.065
 stem+0.065
 toes+0.064
Neg logit contributions
reating-0.071
ock-0.071
isky-0.069
 Beth-0.069
mother-0.069
Token classmates
Token ID28999
Feature activation+8.12e-03

Pos logit contributions
mother+0.046
ock+0.045
 Beth+0.045
reating+0.044
 operation+0.043
Neg logit contributions
 pulp-0.036
 fingers-0.036
 stem-0.032
ses-0.032
sed-0.031
Token and
Token ID290
Feature activation+8.86e-03

Pos logit contributions
 fingers+0.014
 pulp+0.013
 toes+0.012
 finger+0.012
 stem+0.012
Neg logit contributions
 Beth-0.015
ock-0.015
reating-0.015
mother-0.015
 operation-0.014
Token,
Token ID11
Feature activation+1.75e-02

Pos logit contributions
ses+0.299
sed+0.288
 pulp+0.278
sing+0.268
ifying+0.264
Neg logit contributions
ie-0.103
 him-0.096
ad-0.094
ock-0.090
mother-0.089
Token fifteen
Token ID17280
Feature activation+6.87e-02

Pos logit contributions
ses+0.027
 pulp+0.026
sed+0.026
 fingers+0.025
 stem+0.024
Neg logit contributions
ock-0.032
ad-0.032
mother-0.032
reating-0.031
 him-0.031
Token,
Token ID11
Feature activation+2.15e-02

Pos logit contributions
ses+0.163
sed+0.157
 pulp+0.157
 fingers+0.145
sing+0.142
Neg logit contributions
ie-0.079
ock-0.075
ad-0.072
mother-0.072
reating-0.069
Token sixteen
Token ID27913
Feature activation+1.03e-01

Pos logit contributions
ses+0.028
 pulp+0.028
sed+0.027
 fingers+0.026
 finger+0.024
Neg logit contributions
ad-0.043
ock-0.042
mother-0.041
 him-0.040
reating-0.040
Token,
Token ID11
Feature activation+2.58e-02

Pos logit contributions
ses+0.253
 pulp+0.248
sed+0.245
 fingers+0.231
 finger+0.227
Neg logit contributions
ie-0.111
ock-0.108
mother-0.105
ad-0.104
reating-0.102
Token seventeen
Token ID38741
Feature activation+1.14e-01

Pos logit contributions
ses+0.030
 pulp+0.030
 fingers+0.028
sed+0.028
 toes+0.025
Neg logit contributions
ad-0.055
ock-0.054
mother-0.052
reating-0.051
 him-0.050
Token,
Token ID11
Feature activation+2.31e-02

Pos logit contributions
ses+0.283
 pulp+0.282
sed+0.275
 fingers+0.263
 stem+0.260
Neg logit contributions
ie-0.117
ock-0.114
ad-0.113
mother-0.108
reating-0.107
Token eighteen
Token ID29095
Feature activation+1.19e-01

Pos logit contributions
ses+0.028
 pulp+0.028
sed+0.026
 fingers+0.026
 finger+0.023
Neg logit contributions
ad-0.048
ock-0.047
mother-0.046
 him-0.046
reating-0.045
Token,
Token ID11
Feature activation+1.63e-02

Pos logit contributions
ses+0.290
 pulp+0.288
sed+0.279
 fingers+0.268
 stem+0.265
Neg logit contributions
ie-0.126
mother-0.119
ock-0.118
reating-0.113
ad-0.113
Token nineteen
Token ID43063
Feature activation+8.20e-02

Pos logit contributions
ses+0.021
 pulp+0.021
sed+0.019
 fingers+0.019
 stem+0.017
Neg logit contributions
ad-0.033
ock-0.033
mother-0.033
reating-0.032
 him-0.032
Token,
Token ID11
Feature activation+2.59e-02

Pos logit contributions
 pulp+0.195
ses+0.194
sed+0.187
 fingers+0.181
plets+0.176
Neg logit contributions
ie-0.092
mother-0.090
ock-0.089
reating-0.086
ad-0.085
Token happy
Token ID3772
Feature activation+9.46e-04

Pos logit contributions
 pulp+0.030
 fingers+0.029
 stem+0.027
ses+0.027
sed+0.026
Neg logit contributions
mother-0.034
reating-0.033
ock-0.032
 Beth-0.031
ie-0.030
Token and
Token ID290
Feature activation-1.78e-02

Pos logit contributions
 pulp+0.002
 fingers+0.002
ses+0.002
 stem+0.002
sed+0.002
Neg logit contributions
mother-0.002
 Beth-0.001
ock-0.001
reating-0.001
 operation-0.001
Token it
Token ID340
Feature activation-2.38e-02

Pos logit contributions
mother+0.032
ock+0.032
 Beth+0.031
reating+0.031
ie+0.030
Neg logit contributions
 pulp-0.032
 fingers-0.030
ses-0.029
 stem-0.028
sed-0.028
Token made
Token ID925
Feature activation+2.45e-03

Pos logit contributions
reating+0.047
mother+0.046
 Beth+0.045
ock+0.045
 operation+0.043
Neg logit contributions
 pulp-0.038
 fingers-0.037
 stem-0.034
ses-0.034
sed-0.033
Token her
Token ID607
Feature activation+3.14e-02

Pos logit contributions
 fingers+0.004
 pulp+0.004
 hands+0.004
 toes+0.004
 finger+0.004
Neg logit contributions
mother-0.004
isky-0.004
ock-0.004
reating-0.004
oot-0.004
Token toes
Token ID23932
Feature activation+1.13e-01

Pos logit contributions
 pulp+0.054
 fingers+0.052
ses+0.049
 stem+0.048
sed+0.047
Neg logit contributions
mother-0.059
ock-0.057
 Beth-0.057
reating-0.057
ie-0.055
Token start
Token ID923
Feature activation+1.24e-02

Pos logit contributions
 pulp+0.206
 fingers+0.202
 stem+0.187
ses+0.185
 toes+0.179
Neg logit contributions
mother-0.202
 Beth-0.195
reating-0.192
ock-0.192
 operation-0.184
Token to
Token ID284
Feature activation+9.24e-03

Pos logit contributions
 fingers+0.024
 pulp+0.024
 stem+0.022
 toes+0.022
 tongue+0.021
Neg logit contributions
mother-0.020
ock-0.020
 Beth-0.019
reating-0.019
 operation-0.019
Token tap
Token ID9814
Feature activation-1.91e-02

Pos logit contributions
 pulp+0.016
 fingers+0.015
ses+0.014
 stem+0.014
sed+0.014
Neg logit contributions
mother-0.017
 Beth-0.017
reating-0.017
ock-0.017
ie-0.016
Token.
Token ID13
Feature activation+2.53e-03

Pos logit contributions
mother+0.035
reating+0.033
ock+0.032
 Beth+0.032
isky+0.032
Neg logit contributions
 fingers-0.034
 pulp-0.034
 stem-0.031
 toes-0.031
 finger-0.030
Token She
Token ID1375
Feature activation-1.63e-02

Pos logit contributions
 pulp+0.004
 fingers+0.004
 stem+0.004
ses+0.004
sed+0.004
Neg logit contributions
mother-0.005
ock-0.004
reating-0.004
ie-0.004
ty-0.004
Token the
Token ID262
Feature activation+3.25e-02

Pos logit contributions
 fingers+0.076
 pulp+0.076
 finger+0.069
 toes+0.067
ses+0.066
Neg logit contributions
ock-0.079
mother-0.076
 Beth-0.075
reating-0.075
 operation-0.075
Token necklace
Token ID38987
Feature activation+1.86e-02

Pos logit contributions
 pulp+0.054
 fingers+0.051
ses+0.049
sed+0.047
 stem+0.047
Neg logit contributions
mother-0.062
reating-0.060
 Beth-0.060
ock-0.060
ie-0.058
Token and
Token ID290
Feature activation+2.00e-02

Pos logit contributions
 fingers+0.027
 pulp+0.026
ses+0.025
 finger+0.025
 stem+0.025
Neg logit contributions
 Beth-0.038
ock-0.038
reating-0.038
mother-0.037
 operation-0.037
Token the
Token ID262
Feature activation+1.35e-02

Pos logit contributions
 pulp+0.036
 fingers+0.035
ses+0.033
 stem+0.032
sed+0.032
Neg logit contributions
mother-0.036
ock-0.035
reating-0.035
 Beth-0.035
ie-0.033
Token magn
Token ID7842
Feature activation+3.41e-02

Pos logit contributions
 pulp+0.022
 fingers+0.021
ses+0.020
 stem+0.020
sed+0.020
Neg logit contributions
mother-0.026
reating-0.025
 Beth-0.025
ock-0.025
ie-0.024
Tokenifying
Token ID4035
Feature activation+1.13e-01

Pos logit contributions
 pulp+0.023
 fingers+0.021
ses+0.017
 stem+0.017
sed+0.016
Neg logit contributions
 Beth-0.098
mother-0.098
reating-0.098
ock-0.097
 operation-0.097
Token glass
Token ID5405
Feature activation+1.50e-02

Pos logit contributions
 fingers+0.156
 pulp+0.154
 toes+0.143
ses+0.143
sed+0.141
Neg logit contributions
mother-0.241
ad-0.232
 Beth-0.230
reating-0.229
ock-0.223
Token and
Token ID290
Feature activation+1.13e-02

Pos logit contributions
 pulp+0.028
 fingers+0.027
 stem+0.026
ses+0.025
 finger+0.024
Neg logit contributions
 Beth-0.027
mother-0.026
reating-0.026
ock-0.025
 operation-0.025
Token told
Token ID1297
Feature activation-1.14e-02

Pos logit contributions
 pulp+0.021
 fingers+0.020
ses+0.019
 stem+0.019
sed+0.018
Neg logit contributions
mother-0.020
 Beth-0.019
reating-0.019
ock-0.019
ie-0.018
Token him
Token ID683
Feature activation+3.19e-02

Pos logit contributions
mother+0.017
Ell+0.016
 Beth+0.016
ock+0.016
isky+0.015
Neg logit contributions
 fingers-0.025
 pulp-0.023
 toes-0.022
 finger-0.022
 tongue-0.021
Token that
Token ID326
Feature activation-1.32e-02

Pos logit contributions
 fingers+0.052
 pulp+0.051
 finger+0.047
 toes+0.046
 mouth+0.045
Neg logit contributions
ock-0.063
mother-0.060
 operation-0.058
reating-0.057
 Beth-0.056
Token borrowed
Token ID22546
Feature activation-5.34e-03

Pos logit contributions
mother+0.038
 Beth+0.037
ock+0.037
reating+0.037
ie+0.036
Neg logit contributions
 pulp-0.040
 fingers-0.038
ses-0.037
 stem-0.036
sed-0.035
Token her
Token ID607
Feature activation+5.57e-03

Pos logit contributions
mother+0.010
ock+0.010
reating+0.010
 Beth+0.009
 operation+0.009
Neg logit contributions
 pulp-0.010
 fingers-0.009
 stem-0.009
ses-0.009
 finger-0.008
Token mom
Token ID1995
Feature activation-1.14e-03

Pos logit contributions
 pulp+0.010
plets+0.009
ses+0.009
 fingers+0.009
sed+0.008
Neg logit contributions
 Beth-0.010
mother-0.010
ie-0.009
 operation-0.009
ty-0.009
Token's
Token ID338
Feature activation+2.46e-02

Pos logit contributions
 operation+0.002
reating+0.002
 Beth+0.002
ock+0.002
isky+0.001
Neg logit contributions
 fingers-0.002
 pulp-0.002
 stem-0.002
 finger-0.002
 tongue-0.002
Token magn
Token ID7842
Feature activation+3.12e-02

Pos logit contributions
 pulp+0.038
 fingers+0.036
ses+0.034
 stem+0.033
sed+0.033
Neg logit contributions
mother-0.050
 Beth-0.049
ock-0.049
reating-0.049
ie-0.047
Tokenifying
Token ID4035
Feature activation+1.13e-01

Pos logit contributions
 pulp+0.014
 fingers+0.014
ses+0.010
 stem+0.009
sed+0.009
Neg logit contributions
reating-0.097
mother-0.097
ock-0.097
 Beth-0.096
 operation-0.095
Token glass
Token ID5405
Feature activation+9.92e-03

Pos logit contributions
 pulp+0.139
 fingers+0.131
ses+0.120
sed+0.114
 stem+0.114
Neg logit contributions
mother-0.266
 Beth-0.261
reating-0.258
ock-0.257
 operation-0.250
Token to
Token ID284
Feature activation+6.65e-04

Pos logit contributions
 fingers+0.018
 pulp+0.018
 stem+0.017
ses+0.016
 finger+0.016
Neg logit contributions
mother-0.018
reating-0.018
 Beth-0.018
ock-0.018
 operation-0.017
Token take
Token ID1011
Feature activation+3.50e-03

Pos logit contributions
 pulp+0.001
 fingers+0.001
 stem+0.001
ses+0.001
sed+0.001
Neg logit contributions
mother-0.001
reating-0.001
ock-0.001
 Beth-0.001
 operation-0.001
Token a
Token ID257
Feature activation-1.79e-03

Pos logit contributions
 fingers+0.006
 pulp+0.005
 hands+0.005
 stem+0.005
 finger+0.005
Neg logit contributions
reating-0.007
isky-0.007
ock-0.007
mother-0.006
annie-0.006
Token closer
Token ID5699
Feature activation-3.22e-03

Pos logit contributions
mother+0.003
 Beth+0.003
ock+0.003
reating+0.003
ie+0.003
Neg logit contributions
 pulp-0.003
 fingers-0.003
ses-0.003
sed-0.003
 stem-0.003
Token,
Token ID11
Feature activation+1.09e-02

Pos logit contributions
 pulp+0.099
ses+0.093
 fingers+0.092
sed+0.090
 stem+0.087
Neg logit contributions
mother-0.062
ock-0.060
ie-0.060
reating-0.060
 Beth-0.059
Token twelve
Token ID14104
Feature activation+6.41e-02

Pos logit contributions
 pulp+0.017
ses+0.016
sed+0.016
 fingers+0.015
 stem+0.014
Neg logit contributions
mother-0.020
ock-0.020
ad-0.020
 Beth-0.020
reating-0.020
Token,
Token ID11
Feature activation+1.56e-02

Pos logit contributions
 pulp+0.140
ses+0.132
 fingers+0.131
sed+0.128
 stem+0.123
Neg logit contributions
mother-0.087
ock-0.085
ie-0.084
reating-0.084
 Beth-0.082
Token thirteen
Token ID28306
Feature activation+1.10e-01

Pos logit contributions
 pulp+0.023
ses+0.022
 fingers+0.022
sed+0.021
 stem+0.019
Neg logit contributions
mother-0.030
ock-0.029
ad-0.029
reating-0.029
 Beth-0.029
Token,
Token ID11
Feature activation+1.27e-02

Pos logit contributions
ses+0.266
 pulp+0.265
sed+0.258
 fingers+0.247
 finger+0.241
Neg logit contributions
ie-0.118
ock-0.111
ad-0.104
ty-0.102
mother-0.101
Token fourteen
Token ID29167
Feature activation+1.13e-01

Pos logit contributions
ses+0.016
 pulp+0.015
sed+0.015
 fingers+0.015
 finger+0.013
Neg logit contributions
ock-0.028
ad-0.027
mother-0.027
 him-0.027
reating-0.027
Token,
Token ID11
Feature activation+1.75e-02

Pos logit contributions
ses+0.299
sed+0.288
 pulp+0.278
sing+0.268
ifying+0.264
Neg logit contributions
ie-0.103
 him-0.096
ad-0.094
ock-0.090
mother-0.089
Token fifteen
Token ID17280
Feature activation+6.87e-02

Pos logit contributions
ses+0.027
 pulp+0.026
sed+0.026
 fingers+0.025
 stem+0.024
Neg logit contributions
ock-0.032
ad-0.032
mother-0.032
reating-0.031
 him-0.031
Token,
Token ID11
Feature activation+2.15e-02

Pos logit contributions
ses+0.163
sed+0.157
 pulp+0.157
 fingers+0.145
sing+0.142
Neg logit contributions
ie-0.079
ock-0.075
ad-0.072
mother-0.072
reating-0.069
Token sixteen
Token ID27913
Feature activation+1.03e-01

Pos logit contributions
ses+0.028
 pulp+0.028
sed+0.027
 fingers+0.026
 finger+0.024
Neg logit contributions
ad-0.043
ock-0.042
mother-0.041
 him-0.040
reating-0.040
Token,
Token ID11
Feature activation+2.58e-02

Pos logit contributions
ses+0.253
 pulp+0.248
sed+0.245
 fingers+0.231
 finger+0.227
Neg logit contributions
ie-0.111
ock-0.108
mother-0.105
ad-0.104
reating-0.102
Token said
Token ID531
Feature activation-2.68e-02

Pos logit contributions
mother+0.098
ock+0.096
reating+0.096
 Beth+0.095
ie+0.091
Neg logit contributions
 pulp-0.091
 fingers-0.088
ses-0.083
 stem-0.082
sed-0.081
Token,
Token ID11
Feature activation-1.10e-02

Pos logit contributions
reating+0.047
ock+0.047
mother+0.047
 Beth+0.043
isky+0.043
Neg logit contributions
 fingers-0.050
 pulp-0.048
 finger-0.045
ses-0.045
 stem-0.045
Token holding
Token ID4769
Feature activation+1.18e-02

Pos logit contributions
 Beth+0.018
mother+0.018
ty+0.017
ad+0.017
ie+0.017
Neg logit contributions
 pulp-0.020
ses-0.019
 fingers-0.018
sed-0.018
irted-0.017
Token the
Token ID262
Feature activation+1.08e-02

Pos logit contributions
 pulp+0.024
 fingers+0.023
ses+0.022
 stem+0.022
sed+0.021
Neg logit contributions
mother-0.019
reating-0.019
ock-0.019
 Beth-0.018
ie-0.017
Token magn
Token ID7842
Feature activation+3.50e-02

Pos logit contributions
 pulp+0.016
 fingers+0.015
plets+0.015
irted+0.014
ses+0.014
Neg logit contributions
mother-0.021
 operation-0.020
ock-0.020
ie-0.019
 Beth-0.019
Tokenifying
Token ID4035
Feature activation+1.13e-01

Pos logit contributions
 pulp+0.017
 fingers+0.015
ses+0.011
 stem+0.011
sed+0.010
Neg logit contributions
mother-0.108
 Beth-0.107
reating-0.107
ock-0.107
ie-0.104
Token glass
Token ID5405
Feature activation+1.14e-02

Pos logit contributions
 pulp+0.147
 fingers+0.142
ses+0.130
sed+0.128
 toes+0.122
Neg logit contributions
mother-0.254
 Beth-0.247
reating-0.246
ock-0.244
ad-0.241
Token over
Token ID625
Feature activation+3.26e-02

Pos logit contributions
 pulp+0.022
 fingers+0.022
 stem+0.020
ses+0.020
sed+0.020
Neg logit contributions
mother-0.019
 Beth-0.019
reating-0.018
ock-0.018
 operation-0.017
Token a
Token ID257
Feature activation+5.74e-03

Pos logit contributions
 fingers+0.064
 pulp+0.064
 stem+0.057
 toes+0.057
 finger+0.056
Neg logit contributions
ock-0.055
mother-0.054
 operation-0.054
reating-0.053
 Beth-0.050
Token black
Token ID2042
Feature activation+1.59e-04

Pos logit contributions
 pulp+0.009
ses+0.009
 fingers+0.009
sed+0.009
irted+0.008
Neg logit contributions
ock-0.011
mother-0.010
 Beth-0.010
reating-0.010
ie-0.010
Token ant
Token ID1885
Feature activation+3.54e-04

Pos logit contributions
 pulp+0.000
 fingers+0.000
ses+0.000
sed+0.000
 stem+0.000
Neg logit contributions
mother-0.000
ock-0.000
reating-0.000
 Beth-0.000
ie-0.000
Token she
Token ID673
Feature activation-2.33e-02

Pos logit contributions
mother+0.002
reating+0.002
ock+0.002
 Beth+0.002
 operation+0.002
Neg logit contributions
 pulp-0.003
 fingers-0.002
 stem-0.002
ses-0.002
sed-0.002
Token declared
Token ID6875
Feature activation+1.47e-02

Pos logit contributions
reating+0.044
ock+0.043
mother+0.043
 Beth+0.042
 operation+0.042
Neg logit contributions
 pulp-0.039
 fingers-0.038
 stem-0.035
ses-0.035
sed-0.035
Token,
Token ID11
Feature activation-3.03e-03

Pos logit contributions
 fingers+0.022
 finger+0.020
 toes+0.020
 pulp+0.019
 hands+0.019
Neg logit contributions
ock-0.032
reating-0.030
 operation-0.029
mother-0.029
isky-0.029
Token "
Token ID366
Feature activation-2.14e-02

Pos logit contributions
mother+0.005
 Beth+0.005
ock+0.005
reating+0.005
ie+0.005
Neg logit contributions
 pulp-0.006
 fingers-0.005
ses-0.005
 stem-0.005
sed-0.005
TokenMy
Token ID3666
Feature activation+3.02e-03

Pos logit contributions
mother+0.043
reating+0.042
 Beth+0.042
ock+0.042
 operation+0.042
Neg logit contributions
 pulp-0.033
 fingers-0.031
 stem-0.029
ses-0.028
sed-0.027
Token toes
Token ID23932
Feature activation+1.13e-01

Pos logit contributions
 pulp+0.005
 fingers+0.005
ses+0.005
 stem+0.005
sed+0.005
Neg logit contributions
ock-0.006
reating-0.006
mother-0.006
 Beth-0.006
ie-0.005
Token are
Token ID389
Feature activation-5.28e-04

Pos logit contributions
 pulp+0.178
 fingers+0.175
ses+0.163
 stem+0.160
sed+0.157
Neg logit contributions
mother-0.224
reating-0.218
 Beth-0.217
ock-0.215
 operation-0.209
Token better
Token ID1365
Feature activation-1.58e-02

Pos logit contributions
mother+0.001
reating+0.001
ock+0.001
 operation+0.001
 Beth+0.001
Neg logit contributions
 fingers-0.001
 pulp-0.001
 stem-0.001
 toes-0.001
 tongue-0.001
Token now
Token ID783
Feature activation+3.66e-03

Pos logit contributions
mother+0.025
 Beth+0.024
ock+0.024
reating+0.024
ie+0.023
Neg logit contributions
 pulp-0.032
 fingers-0.030
ses-0.030
 stem-0.029
sed-0.029
Token than
Token ID621
Feature activation+3.59e-02

Pos logit contributions
 pulp+0.007
 fingers+0.006
 stem+0.006
ses+0.006
sed+0.006
Neg logit contributions
mother-0.006
reating-0.006
ock-0.006
 Beth-0.006
ie-0.006
Token before
Token ID878
Feature activation+4.64e-02

Pos logit contributions
 pulp+0.069
 fingers+0.068
 stem+0.063
ses+0.061
 toes+0.061
Neg logit contributions
mother-0.059
ock-0.057
reating-0.056
 Beth-0.055
isky-0.054
Token examine
Token ID10716
Feature activation-4.11e-03

Pos logit contributions
 fingers+0.014
 pulp+0.014
 stem+0.013
ses+0.012
 finger+0.012
Neg logit contributions
reating-0.021
mother-0.020
ock-0.020
 Beth-0.020
ie-0.019
Token it
Token ID340
Feature activation+4.55e-04

Pos logit contributions
mother+0.007
reating+0.007
 Beth+0.007
ock+0.007
isky+0.007
Neg logit contributions
 pulp-0.007
 fingers-0.007
 stem-0.007
 finger-0.006
ses-0.006
Token with
Token ID351
Feature activation+2.67e-02

Pos logit contributions
 pulp+0.001
 fingers+0.001
 stem+0.001
 finger+0.001
ses+0.001
Neg logit contributions
mother-0.001
 Beth-0.001
reating-0.001
ock-0.001
isky-0.001
Token the
Token ID262
Feature activation+1.23e-02

Pos logit contributions
 pulp+0.062
 fingers+0.060
ses+0.058
 stem+0.057
sed+0.056
Neg logit contributions
mother-0.034
ock-0.033
 Beth-0.033
reating-0.033
ie-0.031
Token magn
Token ID7842
Feature activation+3.75e-02

Pos logit contributions
 pulp+0.019
 fingers+0.018
ses+0.017
 stem+0.017
sed+0.017
Neg logit contributions
mother-0.025
ock-0.024
 Beth-0.024
reating-0.024
ie-0.023
Tokenifying
Token ID4035
Feature activation+1.13e-01

Pos logit contributions
 pulp+0.021
 fingers+0.019
ses+0.015
 stem+0.014
sed+0.013
Neg logit contributions
mother-0.113
 Beth-0.112
reating-0.112
ock-0.111
ie-0.109
Token glass
Token ID5405
Feature activation+1.65e-02

Pos logit contributions
 pulp+0.139
 fingers+0.133
ses+0.122
sed+0.118
 stem+0.113
Neg logit contributions
mother-0.261
 Beth-0.255
reating-0.254
ock-0.254
 operation-0.248
Token?"
Token ID1701
Feature activation+1.67e-02

Pos logit contributions
 pulp+0.029
 fingers+0.028
 stem+0.026
ses+0.026
sed+0.025
Neg logit contributions
mother-0.030
 Beth-0.030
reating-0.029
ock-0.029
isky-0.028
Token
Token ID198
Feature activation-8.97e-03

Pos logit contributions
 pulp+0.030
 fingers+0.029
ses+0.027
 stem+0.027
sed+0.026
Neg logit contributions
mother-0.030
reating-0.030
ock-0.029
 Beth-0.028
ie-0.028
Token
Token ID198
Feature activation-2.26e-03

Pos logit contributions
mother+0.019
reating+0.019
ock+0.018
 operation+0.018
 Beth+0.017
Neg logit contributions
 fingers-0.012
 pulp-0.012
 stem-0.011
ses-0.011
sed-0.011
Token"
Token ID1
Feature activation+2.24e-02

Pos logit contributions
mother+0.004
ock+0.004
 Beth+0.004
reating+0.004
ie+0.004
Neg logit contributions
 pulp-0.004
 fingers-0.004
ses-0.004
 stem-0.004
sed-0.004
Token us
Token ID514
Feature activation+1.29e-02

Pos logit contributions
mother+0.017
ock+0.016
 Beth+0.016
reating+0.016
ie+0.015
Neg logit contributions
 pulp-0.020
 fingers-0.019
ses-0.018
 stem-0.018
sed-0.018
Token.
Token ID13
Feature activation+8.62e-03

Pos logit contributions
 fingers+0.023
 pulp+0.022
ses+0.021
sed+0.020
 toes+0.020
Neg logit contributions
mother-0.023
 Beth-0.022
reating-0.022
ock-0.022
ie-0.021
Token Where
Token ID6350
Feature activation+7.67e-03

Pos logit contributions
 pulp+0.015
 fingers+0.014
ses+0.013
 stem+0.013
sed+0.013
Neg logit contributions
mother-0.016
 Beth-0.016
reating-0.016
ock-0.016
 operation-0.015
Token is
Token ID318
Feature activation-1.42e-03

Pos logit contributions
 pulp+0.012
 fingers+0.011
ses+0.011
sed+0.010
 stem+0.010
Neg logit contributions
mother-0.016
reating-0.015
ock-0.015
 Beth-0.015
ie-0.015
Token your
Token ID534
Feature activation+2.90e-02

Pos logit contributions
mother+0.003
ock+0.003
reating+0.002
 Beth+0.002
ie+0.002
Neg logit contributions
 pulp-0.003
 fingers-0.002
ses-0.002
 stem-0.002
sed-0.002
Token menu
Token ID6859
Feature activation+1.13e-01

Pos logit contributions
 pulp+0.048
 fingers+0.045
ses+0.043
 stem+0.042
sed+0.041
Neg logit contributions
mother-0.055
 Beth-0.054
reating-0.054
ock-0.053
 operation-0.052
Token?"
Token ID1701
Feature activation+2.05e-02

Pos logit contributions
 pulp+0.216
 fingers+0.211
ses+0.202
 stem+0.197
sed+0.196
Neg logit contributions
mother-0.181
 Beth-0.178
ock-0.174
reating-0.174
ie-0.165
Token
Token ID198
Feature activation-4.34e-03

Pos logit contributions
 pulp+0.037
 fingers+0.036
ses+0.034
 stem+0.033
sed+0.033
Neg logit contributions
mother-0.036
ock-0.035
reating-0.035
 Beth-0.035
ie-0.034
Token
Token ID198
Feature activation+2.14e-03

Pos logit contributions
mother+0.008
ock+0.008
reating+0.008
 Beth+0.008
ie+0.008
Neg logit contributions
 pulp-0.007
 fingers-0.007
ses-0.006
 stem-0.006
sed-0.006
TokenL
Token ID43
Feature activation-4.01e-02

Pos logit contributions
 pulp+0.004
 fingers+0.004
 stem+0.004
ses+0.004
sed+0.004
Neg logit contributions
mother-0.004
ock-0.003
reating-0.003
 Beth-0.003
ie-0.003
Tokenily
Token ID813
Feature activation-9.48e-03

Pos logit contributions
mother+0.059
 Beth+0.059
reating+0.057
ock+0.056
ie+0.055
Neg logit contributions
 pulp-0.084
 fingers-0.081
ses-0.079
 stem-0.076
sed-0.076
Token loyal
Token ID9112
Feature activation-1.59e-02

Pos logit contributions
mother+0.010
 Beth+0.010
ock+0.010
ie+0.010
reating+0.010
Neg logit contributions
 pulp-0.008
 fingers-0.008
ses-0.008
sed-0.007
 stem-0.007
Token little
Token ID1310
Feature activation+1.07e-02

Pos logit contributions
reating+0.041
mother+0.041
ock+0.039
isky+0.039
ad+0.038
Neg logit contributions
ses-0.015
 fingers-0.015
sed-0.015
 pulp-0.014
 stem-0.013
Token girl
Token ID2576
Feature activation-4.33e-02

Pos logit contributions
 fingers+0.014
 pulp+0.013
ses+0.013
 stem+0.012
sed+0.012
Neg logit contributions
mother-0.025
reating-0.025
ock-0.024
ad-0.024
isky-0.023
Token named
Token ID3706
Feature activation-3.39e-02

Pos logit contributions
mother+0.068
ock+0.066
reating+0.066
 Beth+0.066
ie+0.063
Neg logit contributions
 pulp-0.087
 fingers-0.084
ses-0.080
 stem-0.079
sed-0.078
Token L
Token ID406
Feature activation-6.07e-02

Pos logit contributions
mother+0.058
ock+0.057
reating+0.056
ad+0.053
 operation+0.053
Neg logit contributions
 fingers-0.062
 pulp-0.061
 stem-0.058
ses-0.058
sed-0.057
Tokenott
Token ID1252
Feature activation-1.52e-01

Pos logit contributions
 Beth+0.090
 operation+0.088
reating+0.088
 Barry+0.087
mother+0.086
Neg logit contributions
 pulp-0.110
 fingers-0.109
 finger-0.103
 stem-0.101
ses-0.098
Tokenie
Token ID494
Feature activation-3.89e-02

Pos logit contributions
 Beth+0.064
reating+0.052
mother+0.046
ock+0.041
 operation+0.037
Neg logit contributions
 pulp-0.472
 fingers-0.459
 stem-0.446
 finger-0.442
ses-0.440
Token.
Token ID13
Feature activation-2.99e-02

Pos logit contributions
mother+0.060
ock+0.058
 Beth+0.058
reating+0.058
ie+0.055
Neg logit contributions
 pulp-0.080
 fingers-0.077
ses-0.074
 stem-0.073
sed-0.072
Token Every
Token ID3887
Feature activation-9.06e-03

Pos logit contributions
mother+0.056
ty+0.052
ock+0.051
reating+0.049
 operation+0.049
Neg logit contributions
 fingers-0.054
 stem-0.050
 toes-0.048
ses-0.047
 pulp-0.046
Token day
Token ID1110
Feature activation+5.28e-03

Pos logit contributions
mother+0.016
reating+0.016
 Beth+0.016
isky+0.015
ock+0.015
Neg logit contributions
 fingers-0.015
 pulp-0.015
ses-0.015
 stem-0.014
sed-0.014
Token,
Token ID11
Feature activation-1.68e-02

Pos logit contributions
 pulp+0.011
 fingers+0.011
ses+0.010
 stem+0.010
sed+0.010
Neg logit contributions
mother-0.008
ock-0.008
reating-0.008
 Beth-0.008
 operation-0.007
Token After
Token ID2293
Feature activation+1.01e-02

Pos logit contributions
reating+0.006
mother+0.006
ock+0.006
ie+0.006
 operation+0.006
Neg logit contributions
 fingers-0.005
 pulp-0.005
 stem-0.004
ses-0.004
sed-0.004
Token a
Token ID257
Feature activation-8.76e-05

Pos logit contributions
 fingers+0.012
 pulp+0.011
 toes+0.010
ses+0.010
sed+0.010
Neg logit contributions
ock-0.024
ie-0.023
reating-0.023
mother-0.022
heart-0.022
Token while
Token ID981
Feature activation+4.54e-02

Pos logit contributions
mother+0.000
 Beth+0.000
ock+0.000
reating+0.000
ie+0.000
Neg logit contributions
 pulp-0.000
 fingers-0.000
ses-0.000
 stem-0.000
sed-0.000
Token,
Token ID11
Feature activation-1.32e-02

Pos logit contributions
 fingers+0.075
 pulp+0.072
 stem+0.069
 toes+0.066
sed+0.065
Neg logit contributions
reating-0.088
ock-0.087
mother-0.084
ie-0.078
 operation-0.078
Token D
Token ID360
Feature activation+1.81e-03

Pos logit contributions
mother+0.021
ock+0.020
reating+0.020
 Beth+0.019
ie+0.019
Neg logit contributions
 pulp-0.027
 fingers-0.026
 stem-0.024
ses-0.024
sed-0.024
Tokenott
Token ID1252
Feature activation-1.46e-01

Pos logit contributions
 pulp+0.005
 fingers+0.005
 stem+0.004
ses+0.004
 finger+0.004
Neg logit contributions
reating-0.002
ock-0.002
mother-0.002
 Beth-0.002
 operation-0.001
Tokenie
Token ID494
Feature activation-2.72e-02

Pos logit contributions
 Beth+0.100
reating+0.095
mother+0.084
ock+0.083
 operation+0.080
Neg logit contributions
 pulp-0.416
 fingers-0.401
 stem-0.390
ses-0.387
 finger-0.385
Token was
Token ID373
Feature activation-6.32e-03

Pos logit contributions
reating+0.061
ock+0.058
mother+0.055
 Beth+0.054
 operation+0.054
Neg logit contributions
 fingers-0.039
 pulp-0.038
 stem-0.035
 finger-0.035
sed-0.034
Token no
Token ID645
Feature activation+6.61e-02

Pos logit contributions
mother+0.011
reating+0.011
ock+0.011
 Beth+0.011
ie+0.011
Neg logit contributions
 pulp-0.011
 fingers-0.011
ses-0.010
 stem-0.010
sed-0.010
Token longer
Token ID2392
Feature activation+2.58e-02

Pos logit contributions
 pulp+0.141
 fingers+0.140
ses+0.132
 stem+0.130
sed+0.128
Neg logit contributions
mother-0.095
reating-0.091
 Beth-0.091
ock-0.091
ie-0.086
Token scared
Token ID12008
Feature activation+1.92e-02

Pos logit contributions
 pulp+0.042
 fingers+0.040
ses+0.038
 stem+0.037
sed+0.037
Neg logit contributions
mother-0.050
 Beth-0.049
ock-0.049
reating-0.048
ie-0.047
Token.
Token ID13
Feature activation-1.45e-02

Pos logit contributions
ock+0.000
reating+0.000
mother+0.000
isky+0.000
ussy+0.000
Neg logit contributions
 fingers-0.000
 toes-0.000
 necks-0.000
 neck-0.000
 hands-0.000
Token
Token ID198
Feature activation-1.18e-02

Pos logit contributions
mother+0.029
reating+0.028
ock+0.028
 operation+0.026
 Beth+0.026
Neg logit contributions
 fingers-0.023
 pulp-0.023
 stem-0.021
ses-0.021
sed-0.021
Token
Token ID198
Feature activation-7.22e-03

Pos logit contributions
mother+0.025
reating+0.025
 operation+0.024
ock+0.024
 Beth+0.023
Neg logit contributions
 fingers-0.016
 pulp-0.015
 stem-0.014
ses-0.014
 finger-0.013
TokenWhen
Token ID2215
Feature activation+1.54e-02

Pos logit contributions
mother+0.013
ock+0.012
 Beth+0.012
reating+0.012
ie+0.012
Neg logit contributions
 pulp-0.013
 fingers-0.013
ses-0.012
 stem-0.012
sed-0.012
Token D
Token ID360
Feature activation-8.32e-04

Pos logit contributions
 pulp+0.034
ses+0.032
 fingers+0.032
sed+0.031
 stem+0.030
Neg logit contributions
 Beth-0.022
ie-0.021
mother-0.021
ock-0.020
reating-0.020
Tokenott
Token ID1252
Feature activation-1.43e-01

Pos logit contributions
reating+0.001
mother+0.001
 Beth+0.001
ock+0.001
 operation+0.001
Neg logit contributions
 pulp-0.002
 fingers-0.002
 stem-0.002
ses-0.002
 finger-0.002
Tokenie
Token ID494
Feature activation-2.64e-02

Pos logit contributions
 Beth+0.101
reating+0.084
mother+0.072
 operation+0.071
 Kenny+0.069
Neg logit contributions
 pulp-0.401
 fingers-0.386
 stem-0.374
ses-0.372
 finger-0.372
Token arrived
Token ID5284
Feature activation+4.48e-02

Pos logit contributions
reating+0.057
ock+0.055
mother+0.052
 Beth+0.052
isky+0.051
Neg logit contributions
 fingers-0.037
 pulp-0.036
 stem-0.033
sed-0.033
ses-0.032
Token at
Token ID379
Feature activation+4.44e-02

Pos logit contributions
 pulp+0.082
 fingers+0.079
ses+0.075
 stem+0.074
sed+0.073
Neg logit contributions
mother-0.079
ock-0.078
reating-0.077
 Beth-0.077
ie-0.073
Token the
Token ID262
Feature activation-4.33e-03

Pos logit contributions
 fingers+0.067
 pulp+0.065
ses+0.065
 stem+0.061
 finger+0.060
Neg logit contributions
ock-0.086
mother-0.084
isky-0.082
reating-0.081
ie-0.080
Token lab
Token ID2248
Feature activation-6.07e-03

Pos logit contributions
mother+0.006
ock+0.006
 Beth+0.006
reating+0.006
ie+0.006
Neg logit contributions
 pulp-0.009
 fingers-0.009
ses-0.008
 stem-0.008
sed-0.008
Token a
Token ID257
Feature activation-3.24e-03

Pos logit contributions
 Beth+0.044
mother+0.043
ock+0.042
reating+0.042
ie+0.040
Neg logit contributions
 pulp-0.059
 fingers-0.056
ses-0.054
 stem-0.053
sed-0.053
Token good
Token ID922
Feature activation-2.71e-02

Pos logit contributions
mother+0.006
 Beth+0.006
ock+0.006
reating+0.006
ie+0.006
Neg logit contributions
 pulp-0.005
 fingers-0.005
ses-0.005
 stem-0.004
sed-0.004
Token dog
Token ID3290
Feature activation-2.86e-02

Pos logit contributions
mother+0.062
reating+0.062
ock+0.060
 Beth+0.059
isky+0.057
Neg logit contributions
 pulp-0.035
 fingers-0.034
ses-0.032
 stem-0.031
sed-0.031
Token named
Token ID3706
Feature activation-2.81e-02

Pos logit contributions
reating+0.054
mother+0.053
ock+0.052
 Beth+0.051
isky+0.049
Neg logit contributions
 fingers-0.045
 pulp-0.045
ses-0.043
sed-0.042
 stem-0.042
Token D
Token ID360
Feature activation-1.42e-02

Pos logit contributions
mother+0.048
reating+0.044
ock+0.043
ie+0.042
ad+0.041
Neg logit contributions
 fingers-0.054
ses-0.052
 pulp-0.052
sed-0.050
 stem-0.048
Tokenott
Token ID1252
Feature activation-1.43e-01

Pos logit contributions
mother+0.026
reating+0.026
 Beth+0.025
 operation+0.024
 Ellen+0.023
Neg logit contributions
 fingers-0.020
ses-0.020
 pulp-0.020
 stem-0.019
sed-0.019
Tokenie
Token ID494
Feature activation-2.96e-02

Pos logit contributions
 Beth+0.081
reating+0.067
mother+0.062
 Kenny+0.053
ock+0.053
Neg logit contributions
 pulp-0.418
 fingers-0.411
 finger-0.397
 stem-0.397
ses-0.397
Token.
Token ID13
Feature activation-2.67e-02

Pos logit contributions
mother+0.045
ock+0.043
 Beth+0.043
reating+0.043
ie+0.041
Neg logit contributions
 pulp-0.062
 fingers-0.060
ses-0.057
 stem-0.056
sed-0.056
Token D
Token ID360
Feature activation-3.17e-03

Pos logit contributions
mother+0.058
reating+0.054
ock+0.053
 operation+0.051
ty+0.050
Neg logit contributions
 fingers-0.041
 pulp-0.040
 stem-0.039
ses-0.038
 toes-0.036
Tokenott
Token ID1252
Feature activation-1.34e-01

Pos logit contributions
reating+0.003
mother+0.003
 Beth+0.003
ock+0.003
 operation+0.003
Neg logit contributions
 pulp-0.008
 fingers-0.008
 stem-0.008
ses-0.008
 finger-0.008
Tokenie
Token ID494
Feature activation-2.26e-02

Pos logit contributions
 Beth+0.087
reating+0.082
mother+0.071
 operation+0.070
ock+0.070
Neg logit contributions
 pulp-0.379
 fingers-0.368
 stem-0.356
 finger-0.356
ses-0.352
Token very
Token ID845
Feature activation+1.06e-02

Pos logit contributions
 pulp+0.022
 fingers+0.021
 stem+0.019
ses+0.019
sed+0.019
Neg logit contributions
mother-0.022
ock-0.022
reating-0.021
 Beth-0.021
ie-0.021
Token special
Token ID2041
Feature activation-3.16e-02

Pos logit contributions
 pulp+0.022
 fingers+0.021
ses+0.021
sed+0.020
 stem+0.020
Neg logit contributions
mother-0.015
 Beth-0.015
reating-0.015
ock-0.015
ie-0.014
Token toy
Token ID13373
Feature activation-7.68e-03

Pos logit contributions
mother+0.061
ock+0.060
reating+0.060
 Beth+0.059
isky+0.059
Neg logit contributions
 pulp-0.051
 fingers-0.050
 stem-0.047
ses-0.047
sed-0.045
Token for
Token ID329
Feature activation-9.56e-03

Pos logit contributions
ock+0.015
mother+0.014
isky+0.014
reating+0.014
ie+0.013
Neg logit contributions
 fingers-0.013
 pulp-0.012
 finger-0.012
ses-0.011
sed-0.011
Token L
Token ID406
Feature activation-2.10e-02

Pos logit contributions
ock+0.016
mother+0.015
isky+0.014
reating+0.014
lim+0.014
Neg logit contributions
 fingers-0.018
 pulp-0.017
 toes-0.016
 stem-0.016
 finger-0.015
Tokenott
Token ID1252
Feature activation-1.41e-01

Pos logit contributions
reating+0.020
 Beth+0.020
ock+0.020
mother+0.019
 operation+0.019
Neg logit contributions
 pulp-0.053
 fingers-0.051
 stem-0.049
 finger-0.048
 flesh-0.047
Tokenie
Token ID494
Feature activation-1.32e-02

Pos logit contributions
 Beth+0.090
 Kenny+0.067
reating+0.063
mother+0.063
 Barry+0.055
Neg logit contributions
 pulp-0.409
 fingers-0.397
 stem-0.385
 finger-0.384
ses-0.379
Token.
Token ID13
Feature activation-7.38e-03

Pos logit contributions
ock+0.025
reating+0.025
mother+0.024
 Beth+0.024
isky+0.023
Neg logit contributions
 fingers-0.022
 pulp-0.021
 finger-0.020
 stem-0.019
 toes-0.019
Token It
Token ID632
Feature activation-2.78e-02

Pos logit contributions
mother+0.015
ock+0.014
reating+0.014
ie+0.013
ty+0.013
Neg logit contributions
 fingers-0.013
 pulp-0.012
ses-0.012
 stem-0.012
sed-0.011
Token was
Token ID373
Feature activation-3.53e-03

Pos logit contributions
reating+0.060
ock+0.057
mother+0.055
 operation+0.052
isky+0.051
Neg logit contributions
 fingers-0.041
 pulp-0.041
 stem-0.039
sed-0.039
 finger-0.037
Token a
Token ID257
Feature activation+1.45e-02

Pos logit contributions
ock+0.007
mother+0.007
reating+0.007
ie+0.007
isky+0.006
Neg logit contributions
 fingers-0.006
 pulp-0.005
 stem-0.005
ses-0.005
sed-0.005
Token
Token ID198
Feature activation-5.94e-03

Pos logit contributions
mother+0.022
reating+0.021
ock+0.021
 Beth+0.021
 operation+0.020
Neg logit contributions
 fingers-0.016
 pulp-0.016
ses-0.015
 stem-0.015
sed-0.014
TokenOne
Token ID3198
Feature activation+2.95e-02

Pos logit contributions
mother+0.010
 Beth+0.009
reating+0.009
ock+0.009
ie+0.009
Neg logit contributions
 pulp-0.011
 fingers-0.011
 stem-0.010
ses-0.010
 toes-0.009
Token day
Token ID1110
Feature activation+2.40e-02

Pos logit contributions
 pulp+0.068
 fingers+0.065
ses+0.063
sed+0.062
 stem+0.061
Neg logit contributions
mother-0.039
ock-0.038
 Beth-0.038
reating-0.037
ie-0.037
Token,
Token ID11
Feature activation+1.40e-03

Pos logit contributions
 pulp+0.061
ses+0.057
 fingers+0.056
 stem+0.055
sed+0.055
Neg logit contributions
 Beth-0.029
mother-0.026
ie-0.026
reating-0.025
ock-0.025
Token L
Token ID406
Feature activation-3.02e-02

Pos logit contributions
 pulp+0.003
ses+0.003
 fingers+0.003
 stem+0.003
sed+0.003
Neg logit contributions
 Beth-0.002
mother-0.002
reating-0.002
ie-0.002
ock-0.002
Tokenott
Token ID1252
Feature activation-1.38e-01

Pos logit contributions
ock+0.022
mother+0.022
 Beth+0.022
reating+0.022
 operation+0.020
Neg logit contributions
 pulp-0.084
 fingers-0.082
 stem-0.078
ses-0.077
 finger-0.077
Tokenie
Token ID494
Feature activation-1.79e-02

Pos logit contributions
 Beth+0.057
reating+0.046
mother+0.044
ock+0.039
 operation+0.036
Neg logit contributions
 pulp-0.433
 fingers-0.419
 stem-0.406
ses-0.402
 finger-0.400
Token's
Token ID338
Feature activation+1.72e-02

Pos logit contributions
mother+0.034
reating+0.034
ock+0.034
 Beth+0.034
 operation+0.032
Neg logit contributions
 pulp-0.029
 fingers-0.028
ses-0.026
 stem-0.026
sed-0.026
Token mom
Token ID1995
Feature activation+9.67e-03

Pos logit contributions
 fingers+0.024
 pulp+0.023
ses+0.021
sed+0.021
 stem+0.020
Neg logit contributions
ock-0.038
mother-0.038
reating-0.038
 Beth-0.037
ie-0.036
Token went
Token ID1816
Feature activation+8.66e-03

Pos logit contributions
 pulp+0.014
 fingers+0.014
 finger+0.012
 stem+0.012
sed+0.012
Neg logit contributions
reating-0.020
ock-0.019
mother-0.019
 Beth-0.018
 operation-0.018
Token to
Token ID284
Feature activation-7.22e-03

Pos logit contributions
 fingers+0.012
 finger+0.011
 pulp+0.011
 mouth+0.011
 stem+0.011
Neg logit contributions
ock-0.016
isky-0.016
heart-0.016
mother-0.016
ad-0.015
Token?"
Token ID1701
Feature activation+3.16e-03

Pos logit contributions
mother+0.042
 Beth+0.042
reating+0.041
ock+0.040
ad+0.039
Neg logit contributions
 pulp-0.033
 fingers-0.032
 stem-0.030
ses-0.029
 finger-0.028
Token
Token ID220
Feature activation-2.78e-02

Pos logit contributions
 pulp+0.004
 fingers+0.004
 stem+0.004
 finger+0.004
ses+0.004
Neg logit contributions
reating-0.007
mother-0.007
ock-0.007
isky-0.006
 operation-0.006
Token
Token ID198
Feature activation-1.48e-02

Pos logit contributions
reating+0.060
mother+0.059
ock+0.058
ie+0.056
 operation+0.055
Neg logit contributions
 pulp-0.038
 fingers-0.038
ses-0.035
 stem-0.034
sed-0.033
Token
Token ID198
Feature activation-8.11e-03

Pos logit contributions
mother+0.031
reating+0.031
ock+0.030
 operation+0.029
ie+0.028
Neg logit contributions
 pulp-0.021
 fingers-0.021
 stem-0.020
 finger-0.018
ses-0.018
TokenD
Token ID35
Feature activation-1.85e-02

Pos logit contributions
mother+0.016
reating+0.015
ock+0.015
 operation+0.015
ie+0.014
Neg logit contributions
 pulp-0.013
 fingers-0.012
 stem-0.012
 finger-0.011
plets-0.011
Tokenott
Token ID1252
Feature activation-1.38e-01

Pos logit contributions
reating+0.020
ock+0.019
 Beth+0.018
mother+0.018
 operation+0.017
Neg logit contributions
 pulp-0.046
 fingers-0.045
 stem-0.044
 finger-0.043
ses-0.043
Tokenie
Token ID494
Feature activation-2.12e-02

Pos logit contributions
 Beth+0.073
reating+0.068
ock+0.059
mother+0.058
 operation+0.056
Neg logit contributions
 pulp-0.415
 fingers-0.401
 stem-0.390
 finger-0.385
ses-0.385
Token per
Token ID583
Feature activation+2.89e-03

Pos logit contributions
reating+0.049
ock+0.045
mother+0.041
 operation+0.041
 Beth+0.041
Neg logit contributions
 fingers-0.032
 pulp-0.030
 finger-0.029
 stem-0.029
sed-0.028
Tokenked
Token ID9091
Feature activation+2.14e-02

Pos logit contributions
 pulp+0.004
 fingers+0.004
ses+0.004
 stem+0.004
sed+0.004
Neg logit contributions
mother-0.006
reating-0.006
ock-0.006
 Beth-0.006
ie-0.006
Token up
Token ID510
Feature activation-1.27e-02

Pos logit contributions
 fingers+0.039
 pulp+0.037
 stem+0.035
 finger+0.035
 toes+0.034
Neg logit contributions
ock-0.040
mother-0.040
reating-0.040
isky-0.037
 Beth-0.036
Token and
Token ID290
Feature activation+1.12e-06

Pos logit contributions
reating+0.025
ock+0.025
mother+0.024
 Beth+0.022
 operation+0.022
Neg logit contributions
 fingers-0.021
 pulp-0.021
 stem-0.020
ses-0.019
sed-0.019
Token a
Token ID257
Feature activation+7.15e-03

Pos logit contributions
 pulp+0.020
 fingers+0.019
ses+0.018
 stem+0.018
sed+0.018
Neg logit contributions
mother-0.016
ock-0.015
reating-0.015
 Beth-0.015
ie-0.014
Token kind
Token ID1611
Feature activation-3.31e-02

Pos logit contributions
 pulp+0.010
 fingers+0.010
 stem+0.010
ses+0.009
sed+0.009
Neg logit contributions
mother-0.015
reating-0.015
ock-0.015
ie-0.015
 Beth-0.015
Token deer
Token ID20096
Feature activation-5.54e-03

Pos logit contributions
beh+0.069
mother+0.069
reating+0.068
isky+0.066
 Beth+0.065
Neg logit contributions
 finger-0.044
 stem-0.043
 fingers-0.043
 palm-0.041
 tail-0.040
Token named
Token ID3706
Feature activation-2.08e-02

Pos logit contributions
reating+0.010
ock+0.010
mother+0.010
 Beth+0.009
 operation+0.009
Neg logit contributions
 fingers-0.010
 pulp-0.010
sed-0.009
ses-0.009
 stem-0.009
Token D
Token ID360
Feature activation-2.61e-03

Pos logit contributions
mother+0.033
ock+0.032
reating+0.032
ie+0.030
 operation+0.030
Neg logit contributions
sed-0.038
ses-0.038
 fingers-0.038
 pulp-0.036
 stem-0.035
Tokenott
Token ID1252
Feature activation-1.35e-01

Pos logit contributions
reating+0.005
mother+0.004
 Beth+0.004
 Paul+0.004
 Ellen+0.004
Neg logit contributions
ses-0.004
sed-0.004
 fingers-0.004
irted-0.004
irt-0.003
Tokenie
Token ID494
Feature activation-2.15e-02

Pos logit contributions
 Beth+0.107
reating+0.094
mother+0.086
 Kenny+0.079
 operation+0.078
Neg logit contributions
 pulp-0.359
 fingers-0.351
ses-0.345
 finger-0.341
 stem-0.340
Token appeared
Token ID4120
Feature activation+1.51e-03

Pos logit contributions
reating+0.047
ock+0.045
mother+0.041
 operation+0.040
 Beth+0.040
Neg logit contributions
 fingers-0.032
 pulp-0.031
sed-0.031
 finger-0.030
 stem-0.030
Token.
Token ID13
Feature activation-8.26e-03

Pos logit contributions
 fingers+0.002
 finger+0.002
sed+0.002
 pulp+0.002
ses+0.002
Neg logit contributions
reating-0.003
mother-0.003
ock-0.003
 Beth-0.003
 operation-0.003
Token "
Token ID366
Feature activation-1.75e-02

Pos logit contributions
mother+0.015
reating+0.015
ock+0.015
ie+0.014
 operation+0.013
Neg logit contributions
 fingers-0.015
 pulp-0.015
sed-0.014
ses-0.014
 stem-0.014
TokenLeave
Token ID35087
Feature activation+4.16e-02

Pos logit contributions
mother+0.028
ock+0.028
 Beth+0.028
reating+0.027
ie+0.026
Neg logit contributions
 pulp-0.034
 fingers-0.034
ses-0.033
sed-0.032
 stem-0.031
Token D
Token ID360
Feature activation-1.42e-02

Pos logit contributions
mother+0.048
reating+0.044
ock+0.043
ie+0.042
ad+0.041
Neg logit contributions
 fingers-0.054
ses-0.052
 pulp-0.052
sed-0.050
 stem-0.048
Tokenott
Token ID1252
Feature activation-1.43e-01

Pos logit contributions
mother+0.026
reating+0.026
 Beth+0.025
 operation+0.024
 Ellen+0.023
Neg logit contributions
 fingers-0.020
ses-0.020
 pulp-0.020
 stem-0.019
sed-0.019
Tokenie
Token ID494
Feature activation-2.96e-02

Pos logit contributions
 Beth+0.081
reating+0.067
mother+0.062
 Kenny+0.053
ock+0.053
Neg logit contributions
 pulp-0.418
 fingers-0.411
 finger-0.397
 stem-0.397
ses-0.397
Token.
Token ID13
Feature activation-2.67e-02

Pos logit contributions
mother+0.045
ock+0.043
 Beth+0.043
reating+0.043
ie+0.041
Neg logit contributions
 pulp-0.062
 fingers-0.060
ses-0.057
 stem-0.056
sed-0.056
Token D
Token ID360
Feature activation-3.17e-03

Pos logit contributions
mother+0.058
reating+0.054
ock+0.053
 operation+0.051
ty+0.050
Neg logit contributions
 fingers-0.041
 pulp-0.040
 stem-0.039
ses-0.038
 toes-0.036
Tokenott
Token ID1252
Feature activation-1.34e-01

Pos logit contributions
reating+0.003
mother+0.003
 Beth+0.003
ock+0.003
 operation+0.003
Neg logit contributions
 pulp-0.008
 fingers-0.008
 stem-0.008
ses-0.008
 finger-0.008
Tokenie
Token ID494
Feature activation-2.26e-02

Pos logit contributions
 Beth+0.087
reating+0.082
mother+0.071
 operation+0.070
ock+0.070
Neg logit contributions
 pulp-0.379
 fingers-0.368
 stem-0.356
 finger-0.356
ses-0.352
Token wanted
Token ID2227
Feature activation+4.59e-02

Pos logit contributions
reating+0.046
mother+0.044
ock+0.043
 Beth+0.042
 operation+0.041
Neg logit contributions
 pulp-0.036
 fingers-0.035
ses-0.033
sed-0.033
 stem-0.032
Token to
Token ID284
Feature activation-8.18e-03

Pos logit contributions
 fingers+0.085
 pulp+0.081
 finger+0.078
ses+0.076
sed+0.074
Neg logit contributions
mother-0.081
ie-0.076
isky-0.076
ock-0.075
 Beth-0.073
Token visit
Token ID3187
Feature activation-1.55e-02

Pos logit contributions
mother+0.016
reating+0.015
ock+0.015
 Beth+0.014
ie+0.014
Neg logit contributions
 fingers-0.014
 pulp-0.014
 stem-0.013
 finger-0.013
ses-0.012
Token a
Token ID257
Feature activation-2.95e-03

Pos logit contributions
mother+0.029
isky+0.027
reating+0.027
ock+0.027
 Beth+0.027
Neg logit contributions
 fingers-0.026
 pulp-0.023
 stem-0.023
 toes-0.023
 finger-0.023
Token
Token ID198
Feature activation-3.79e-03

Pos logit contributions
mother+0.017
reating+0.016
 operation+0.016
ock+0.016
ie+0.016
Neg logit contributions
 pulp-0.010
 fingers-0.010
 stem-0.009
ses-0.009
 finger-0.009
TokenSuddenly
Token ID38582
Feature activation+3.13e-02

Pos logit contributions
mother+0.008
reating+0.008
 Beth+0.008
ock+0.008
ie+0.007
Neg logit contributions
 pulp-0.006
 fingers-0.005
 stem-0.005
ses-0.005
sed-0.005
Token,
Token ID11
Feature activation-9.15e-03

Pos logit contributions
 pulp+0.065
 fingers+0.063
ses+0.060
 stem+0.059
sed+0.058
Neg logit contributions
mother-0.048
ock-0.046
reating-0.046
 Beth-0.046
ie-0.044
Token they
Token ID484
Feature activation-6.28e-03

Pos logit contributions
 Beth+0.015
mother+0.015
reating+0.014
ock+0.014
ie+0.014
Neg logit contributions
 pulp-0.018
 fingers-0.017
ses-0.016
 stem-0.016
sed-0.016
Token sp
Token ID599
Feature activation-3.82e-03

Pos logit contributions
mother+0.012
ie+0.012
 Beth+0.011
ock+0.011
reating+0.011
Neg logit contributions
 pulp-0.010
ses-0.010
 fingers-0.010
 stem-0.010
sed-0.009
Tokenott
Token ID1252
Feature activation-1.29e-01

Pos logit contributions
mother+0.005
 Beth+0.005
reating+0.004
ock+0.004
 operation+0.004
Neg logit contributions
 pulp-0.009
 fingers-0.009
ses-0.008
 stem-0.008
sed-0.008
Tokenled
Token ID992
Feature activation+2.84e-02

Pos logit contributions
 Beth+0.159
reating+0.154
mother+0.153
ock+0.148
 operation+0.142
Neg logit contributions
 pulp-0.300
 fingers-0.289
ses-0.276
 stem-0.274
sed-0.271
Token a
Token ID257
Feature activation-1.01e-02

Pos logit contributions
 pulp+0.052
 fingers+0.051
ses+0.048
 stem+0.047
sed+0.046
Neg logit contributions
mother-0.050
reating-0.049
ock-0.049
 Beth-0.048
ie-0.046
Token bicycle
Token ID17026
Feature activation+5.38e-03

Pos logit contributions
mother+0.022
reating+0.021
ock+0.021
 Beth+0.021
ie+0.020
Neg logit contributions
 pulp-0.015
 fingers-0.014
 stem-0.013
ses-0.013
sed-0.013
Token.
Token ID13
Feature activation-1.07e-02

Pos logit contributions
 pulp+0.009
 fingers+0.009
ses+0.008
sed+0.008
 stem+0.008
Neg logit contributions
mother-0.010
reating-0.010
 Beth-0.010
ock-0.010
ie-0.009
Token It
Token ID632
Feature activation-3.36e-02

Pos logit contributions
mother+0.025
ock+0.025
reating+0.025
ie+0.024
ty+0.023
Neg logit contributions
 fingers-0.015
 pulp-0.015
ses-0.014
sed-0.013
 stem-0.013
Token.
Token ID13
Feature activation+3.53e-03

Pos logit contributions
 pulp+0.088
 fingers+0.087
ses+0.081
 stem+0.080
sed+0.079
Neg logit contributions
mother-0.074
 Beth-0.072
ock-0.072
reating-0.072
ie-0.068
Token They
Token ID1119
Feature activation+6.21e-03

Pos logit contributions
 pulp+0.008
 fingers+0.007
 stem+0.007
ses+0.007
 finger+0.007
Neg logit contributions
 Beth-0.005
 Kenny-0.004
 him-0.004
mother-0.004
 operation-0.004
Token laugh
Token ID6487
Feature activation+2.01e-02

Pos logit contributions
 pulp+0.010
 fingers+0.010
ses+0.009
 stem+0.009
sed+0.009
Neg logit contributions
mother-0.012
ock-0.012
 Beth-0.012
reating-0.011
ie-0.011
Token and
Token ID290
Feature activation-8.32e-03

Pos logit contributions
 pulp+0.035
 fingers+0.034
ses+0.032
 stem+0.031
sed+0.031
Neg logit contributions
mother-0.037
reating-0.036
 Beth-0.036
ock-0.036
ie-0.035
Token high
Token ID1029
Feature activation-6.72e-03

Pos logit contributions
 Beth+0.014
mother+0.013
ock+0.013
reating+0.013
ad+0.013
Neg logit contributions
 pulp-0.016
 fingers-0.015
ses-0.014
sed-0.014
 stem-0.014
Token-
Token ID12
Feature activation-1.29e-01

Pos logit contributions
mother+0.010
ock+0.010
 Beth+0.010
reating+0.009
ie+0.009
Neg logit contributions
 pulp-0.014
 fingers-0.013
ses-0.013
 stem-0.013
sed-0.013
Tokenfive
Token ID13261
Feature activation-7.05e-03

Pos logit contributions
mother+0.260
 Beth+0.253
reating+0.250
ock+0.242
 operation+0.241
Neg logit contributions
 pulp-0.200
 fingers-0.193
ses-0.185
 stem-0.179
sed-0.176
Token.
Token ID13
Feature activation+2.64e-03

Pos logit contributions
mother+0.010
ock+0.010
reating+0.010
 Beth+0.009
ie+0.009
Neg logit contributions
 pulp-0.015
ses-0.014
 fingers-0.014
 stem-0.014
sed-0.014
Token
Token ID198
Feature activation-4.01e-03

Pos logit contributions
 pulp+0.007
 fingers+0.006
 stem+0.006
ses+0.006
 finger+0.006
Neg logit contributions
 Beth-0.003
 him-0.003
 Kenny-0.002
reating-0.002
 Bun-0.002
Token
Token ID198
Feature activation-4.82e-05

Pos logit contributions
mother+0.006
 Beth+0.006
ock+0.006
reating+0.006
ie+0.006
Neg logit contributions
 pulp-0.008
 fingers-0.008
ses-0.008
 stem-0.007
sed-0.007
TokenThey
Token ID2990
Feature activation+1.69e-02

Pos logit contributions
ock+0.000
isky+0.000
ty+0.000
reating+0.000
mother+0.000
Neg logit contributions
ses-0.000
sed-0.000
 fingers-0.000
 pulp-0.000
 tongue-0.000
Token
Token ID198
Feature activation-2.27e-03

Pos logit contributions
mother+0.016
 Beth+0.016
ock+0.016
reating+0.016
ie+0.015
Neg logit contributions
 pulp-0.024
 fingers-0.023
ses-0.022
 stem-0.022
sed-0.022
TokenThey
Token ID2990
Feature activation+1.56e-02

Pos logit contributions
mother+0.003
ock+0.003
isky+0.003
reating+0.003
ty+0.003
Neg logit contributions
ses-0.006
sed-0.005
 pulp-0.005
 fingers-0.005
 tongue-0.005
Token smile
Token ID8212
Feature activation+1.51e-02

Pos logit contributions
 pulp+0.025
 fingers+0.024
 stem+0.023
ses+0.023
sed+0.022
Neg logit contributions
mother-0.031
reating-0.031
ock-0.030
 Beth-0.030
ie-0.029
Token and
Token ID290
Feature activation-8.93e-03

Pos logit contributions
 pulp+0.037
 stem+0.034
 fingers+0.033
ses+0.032
sed+0.032
Neg logit contributions
mother-0.017
ock-0.017
reating-0.017
 Beth-0.017
 him-0.016
Token high
Token ID1029
Feature activation-1.28e-02

Pos logit contributions
 Beth+0.014
mother+0.014
ock+0.014
reating+0.013
ad+0.013
Neg logit contributions
 pulp-0.018
 fingers-0.017
ses-0.016
sed-0.016
 stem-0.016
Token-
Token ID12
Feature activation-1.28e-01

Pos logit contributions
mother+0.018
ock+0.018
reating+0.018
 Beth+0.018
ie+0.018
Neg logit contributions
 pulp-0.028
 fingers-0.026
ses-0.025
 stem-0.025
sed-0.024
Tokenfive
Token ID13261
Feature activation-7.95e-03

Pos logit contributions
 Beth+0.269
mother+0.266
reating+0.259
 operation+0.252
ock+0.252
Neg logit contributions
 pulp-0.187
 fingers-0.175
ses-0.171
sed-0.162
 stem-0.160
Token each
Token ID1123
Feature activation-2.93e-02

Pos logit contributions
mother+0.011
ock+0.011
reating+0.011
ie+0.011
 Beth+0.011
Neg logit contributions
 pulp-0.017
ses-0.016
 stem-0.016
 fingers-0.016
sed-0.016
Token other
Token ID584
Feature activation-3.66e-02

Pos logit contributions
mother+0.052
ock+0.051
reating+0.050
 Beth+0.050
ie+0.049
Neg logit contributions
 pulp-0.054
 fingers-0.051
ses-0.049
 stem-0.048
sed-0.047
Token.
Token ID13
Feature activation+9.70e-03

Pos logit contributions
 Beth+0.068
ock+0.067
mother+0.067
reating+0.067
isky+0.063
Neg logit contributions
 fingers-0.065
 pulp-0.061
 toes-0.057
 stem-0.056
ses-0.056
Token They
Token ID1119
Feature activation+1.28e-02

Pos logit contributions
 pulp+0.023
 fingers+0.020
 finger+0.020
 stem+0.020
lest+0.019
Neg logit contributions
 Beth-0.013
ad-0.010
 him-0.010
 Kenny-0.010
 Ellie-0.010
Token.
Token ID13
Feature activation+1.28e-02

Pos logit contributions
 fingers+0.056
 pulp+0.054
 stem+0.050
ses+0.049
 toes+0.049
Neg logit contributions
 Beth-0.061
mother-0.061
reating-0.058
isky-0.058
ock-0.057
Token They
Token ID1119
Feature activation+1.44e-02

Pos logit contributions
 pulp+0.023
 fingers+0.022
ses+0.021
 stem+0.020
sed+0.020
Neg logit contributions
mother-0.023
ock-0.022
 Beth-0.022
reating-0.022
ie-0.022
Token smile
Token ID8212
Feature activation+1.79e-02

Pos logit contributions
 pulp+0.025
 fingers+0.024
ses+0.023
sed+0.022
 stem+0.022
Neg logit contributions
mother-0.026
ock-0.026
 Beth-0.026
reating-0.025
ie-0.025
Token and
Token ID290
Feature activation-8.94e-03

Pos logit contributions
 pulp+0.041
 stem+0.038
 fingers+0.037
ses+0.037
sed+0.035
Neg logit contributions
mother-0.023
 Beth-0.023
ock-0.023
reating-0.023
ie-0.022
Token high
Token ID1029
Feature activation-1.20e-02

Pos logit contributions
 Beth+0.014
mother+0.014
ock+0.014
reating+0.013
ad+0.013
Neg logit contributions
 pulp-0.018
 fingers-0.016
ses-0.016
sed-0.016
 stem-0.016
Token-
Token ID12
Feature activation-1.28e-01

Pos logit contributions
mother+0.016
ock+0.016
ie+0.016
 Beth+0.016
reating+0.016
Neg logit contributions
 pulp-0.027
 fingers-0.025
ses-0.024
 stem-0.024
sed-0.023
Tokenfive
Token ID13261
Feature activation-6.28e-03

Pos logit contributions
mother+0.280
 Beth+0.278
reating+0.269
 operation+0.266
ock+0.263
Neg logit contributions
 pulp-0.174
 fingers-0.164
ses-0.155
 stem-0.149
sed-0.146
Token.
Token ID13
Feature activation+1.07e-02

Pos logit contributions
mother+0.010
ock+0.010
reating+0.010
 Beth+0.010
ie+0.009
Neg logit contributions
 pulp-0.013
 fingers-0.012
ses-0.012
 stem-0.011
sed-0.011
Token Ben
Token ID3932
Feature activation+1.65e-02

Pos logit contributions
 pulp+0.022
 fingers+0.021
 stem+0.020
 finger+0.020
ses+0.020
Neg logit contributions
 Beth-0.016
reating-0.014
mother-0.014
ock-0.014
 him-0.014
Token says
Token ID1139
Feature activation-2.21e-02

Pos logit contributions
 pulp+0.025
 fingers+0.024
ses+0.023
 stem+0.022
sed+0.022
Neg logit contributions
mother-0.034
 Beth-0.033
reating-0.033
ock-0.033
ie-0.032
Token,
Token ID11
Feature activation+1.95e-03

Pos logit contributions
mother+0.043
ock+0.042
reating+0.042
 Beth+0.041
ty+0.040
Neg logit contributions
 pulp-0.037
 fingers-0.037
sed-0.034
ses-0.033
 toes-0.032
Token went
Token ID1816
Feature activation+8.25e-03

Pos logit contributions
mother+0.001
ock+0.001
 Beth+0.001
reating+0.001
ie+0.001
Neg logit contributions
 pulp-0.001
 fingers-0.001
 stem-0.001
ses-0.001
 finger-0.001
Token on
Token ID319
Feature activation+4.61e-03

Pos logit contributions
 fingers+0.012
 toes+0.011
 pulp+0.011
ses+0.010
sed+0.010
Neg logit contributions
mother-0.017
ock-0.016
heart-0.016
ie-0.016
isky-0.015
Token a
Token ID257
Feature activation-1.11e-02

Pos logit contributions
 fingers+0.007
 pulp+0.006
 toes+0.006
ses+0.006
 finger+0.006
Neg logit contributions
ock-0.010
ie-0.010
mother-0.010
reating-0.009
isky-0.009
Token scary
Token ID14343
Feature activation-2.99e-02

Pos logit contributions
mother+0.022
reating+0.020
ock+0.020
 Beth+0.020
ad+0.020
Neg logit contributions
 pulp-0.018
 fingers-0.018
 stem-0.017
 finger-0.016
 toes-0.016
Token roller
Token ID24471
Feature activation-4.65e-02

Pos logit contributions
mother+0.040
reating+0.038
 Beth+0.036
ock+0.036
isky+0.033
Neg logit contributions
 fingers-0.067
 pulp-0.067
 stem-0.064
ses-0.064
sed-0.063
Tokenco
Token ID1073
Feature activation-1.26e-01

Pos logit contributions
mother+0.078
 Beth+0.074
reating+0.071
isky+0.068
ie+0.068
Neg logit contributions
 fingers-0.085
 pulp-0.081
ses-0.079
sed-0.079
 stem-0.078
Tokenaster
Token ID1603
Feature activation+2.08e-02

Pos logit contributions
 Beth+0.206
mother+0.206
reating+0.196
ock+0.192
 operation+0.192
Neg logit contributions
 pulp-0.231
 fingers-0.230
 stem-0.210
ses-0.209
 finger-0.204
Token ride
Token ID6594
Feature activation-2.73e-02

Pos logit contributions
 fingers+0.034
 pulp+0.033
 toes+0.031
ses+0.031
sed+0.031
Neg logit contributions
 Beth-0.038
mother-0.037
reating-0.036
ock-0.035
isky-0.035
Token.
Token ID13
Feature activation-6.62e-03

Pos logit contributions
mother+0.057
reating+0.055
ock+0.054
isky+0.053
 Beth+0.052
Neg logit contributions
 fingers-0.037
 toes-0.035
 pulp-0.031
 finger-0.030
sed-0.030
Token Tim
Token ID5045
Feature activation-1.43e-02

Pos logit contributions
mother+0.011
ock+0.011
reating+0.011
 Beth+0.011
ie+0.011
Neg logit contributions
 pulp-0.012
 fingers-0.012
ses-0.011
 stem-0.011
sed-0.011
Tokenmy
Token ID1820
Feature activation-1.25e-02

Pos logit contributions
reating+0.020
 operation+0.016
 Beth+0.016
ock+0.015
mother+0.015
Neg logit contributions
 fingers-0.033
 finger-0.031
 tongue-0.030
 necks-0.030
 hands-0.029
Token colourful
Token ID44799
Feature activation-3.31e-02

Pos logit contributions
 pulp+0.003
ses+0.003
 fingers+0.003
sed+0.003
 stem+0.003
Neg logit contributions
 Beth-0.003
ock-0.003
mother-0.003
ie-0.003
reating-0.003
Token dresses
Token ID27309
Feature activation-3.48e-02

Pos logit contributions
mother+0.065
ock+0.064
 Beth+0.064
reating+0.064
ie+0.061
Neg logit contributions
 pulp-0.053
 fingers-0.051
ses-0.048
 stem-0.047
sed-0.046
Token with
Token ID351
Feature activation+6.22e-03

Pos logit contributions
mother+0.066
 Beth+0.066
reating+0.065
ock+0.064
 operation+0.064
Neg logit contributions
 fingers-0.055
 pulp-0.051
 stem-0.050
 toes-0.049
ses-0.048
Token bows
Token ID37469
Feature activation-2.78e-02

Pos logit contributions
 fingers+0.011
 pulp+0.010
 toes+0.009
 stem+0.009
 finger+0.009
Neg logit contributions
mother-0.012
 operation-0.012
ock-0.011
isky-0.011
ty-0.011
Token and
Token ID290
Feature activation+5.00e-03

Pos logit contributions
mother+0.050
 Beth+0.049
ock+0.049
reating+0.049
 operation+0.048
Neg logit contributions
 fingers-0.048
 pulp-0.047
ses-0.044
 stem-0.044
sed-0.043
Token rib
Token ID12183
Feature activation-1.26e-01

Pos logit contributions
 pulp+0.009
 fingers+0.008
ses+0.008
 stem+0.007
sed+0.007
Neg logit contributions
 Beth-0.009
mother-0.009
reating-0.009
ock-0.009
ie-0.009
Tokenbons
Token ID23461
Feature activation-2.84e-02

Pos logit contributions
mother+0.201
reating+0.194
 Beth+0.192
ock+0.189
 operation+0.188
Neg logit contributions
 fingers-0.247
 pulp-0.245
 stem-0.227
ses-0.225
 necks-0.218
Token.
Token ID13
Feature activation-1.54e-02

Pos logit contributions
mother+0.055
 operation+0.053
 Beth+0.053
reating+0.052
isky+0.051
Neg logit contributions
 fingers-0.046
 pulp-0.044
 stem-0.041
 toes-0.040
ses-0.039
Token Every
Token ID3887
Feature activation+5.29e-03

Pos logit contributions
mother+0.028
reating+0.027
ock+0.026
isky+0.026
ty+0.026
Neg logit contributions
 fingers-0.027
 pulp-0.026
 stem-0.025
ses-0.025
 toes-0.024
Token day
Token ID1110
Feature activation+1.00e-02

Pos logit contributions
 fingers+0.008
 stem+0.008
 pulp+0.008
ses+0.008
 finger+0.008
Neg logit contributions
reating-0.010
mother-0.010
isky-0.010
 Beth-0.010
ock-0.009
Token she
Token ID673
Feature activation-3.52e-02

Pos logit contributions
 pulp+0.020
 fingers+0.020
 stem+0.018
ses+0.018
sed+0.018
Neg logit contributions
mother-0.016
ock-0.015
reating-0.015
 Beth-0.015
ie-0.014
Token They
Token ID1119
Feature activation+1.24e-02

Pos logit contributions
 pulp+0.007
 fingers+0.007
ses+0.006
 stem+0.006
sed+0.006
Neg logit contributions
 Beth-0.007
mother-0.007
ock-0.007
reating-0.006
 operation-0.006
Token played
Token ID2826
Feature activation-2.86e-03

Pos logit contributions
 pulp+0.022
 fingers+0.021
ses+0.020
 stem+0.020
sed+0.019
Neg logit contributions
mother-0.022
 Beth-0.022
ock-0.022
reating-0.021
ie-0.021
Token catch
Token ID4929
Feature activation+5.77e-03

Pos logit contributions
mother+0.005
reating+0.005
 Beth+0.005
ock+0.005
ie+0.005
Neg logit contributions
 pulp-0.005
 fingers-0.005
ses-0.005
 stem-0.004
sed-0.004
Token and
Token ID290
Feature activation+5.43e-03

Pos logit contributions
 fingers+0.009
 pulp+0.009
 toes+0.008
 finger+0.008
 necks+0.008
Neg logit contributions
mother-0.012
 Beth-0.011
reating-0.011
 operation-0.011
ilda-0.011
Token tug
Token ID27762
Feature activation-2.02e-02

Pos logit contributions
 pulp+0.009
 fingers+0.009
ses+0.008
 stem+0.008
sed+0.008
Neg logit contributions
mother-0.010
 Beth-0.010
ock-0.010
reating-0.010
ie-0.009
Token-
Token ID12
Feature activation-1.24e-01

Pos logit contributions
ock+0.035
ie+0.034
mother+0.033
 him+0.033
ty+0.032
Neg logit contributions
 pulp-0.038
ses-0.037
 stem-0.036
sed-0.036
 fingers-0.034
Tokenof
Token ID1659
Feature activation-4.18e-02

Pos logit contributions
mother+0.237
 Beth+0.229
 operation+0.225
reating+0.221
ie+0.214
Neg logit contributions
 pulp-0.208
 fingers-0.194
plets-0.182
ses-0.180
 stem-0.177
Token-
Token ID12
Feature activation-1.18e-01

Pos logit contributions
mother+0.079
 Beth+0.078
reating+0.078
 operation+0.075
ock+0.073
Neg logit contributions
 fingers-0.072
 pulp-0.070
 toes-0.061
 stem-0.061
 finger-0.060
Tokenwar
Token ID5767
Feature activation-2.93e-02

Pos logit contributions
mother+0.216
 Beth+0.210
 operation+0.205
reating+0.199
ie+0.191
Neg logit contributions
 pulp-0.206
 fingers-0.193
plets-0.180
ses-0.177
 stem-0.176
Token.
Token ID13
Feature activation+4.29e-03

Pos logit contributions
mother+0.058
reating+0.056
 Beth+0.056
ock+0.054
 operation+0.052
Neg logit contributions
 fingers-0.049
 pulp-0.046
 toes-0.043
 stem-0.043
ses-0.042
Token The
Token ID383
Feature activation-8.40e-03

Pos logit contributions
 pulp+0.008
 fingers+0.008
 stem+0.007
ses+0.007
sed+0.007
Neg logit contributions
 Beth-0.007
mother-0.007
ock-0.007
reating-0.007
 operation-0.006
Token wear
Token ID5806
Feature activation-3.03e-03

Pos logit contributions
mother+0.106
reating+0.103
 Beth+0.102
ock+0.099
ie+0.096
Neg logit contributions
 fingers-0.083
 pulp-0.080
 stem-0.079
ses-0.075
 finger-0.074
Token hats
Token ID23910
Feature activation+7.07e-04

Pos logit contributions
mother+0.005
reating+0.005
ock+0.005
 Beth+0.005
ie+0.005
Neg logit contributions
 pulp-0.006
 fingers-0.006
ses-0.005
 stem-0.005
sed-0.005
Token and
Token ID290
Feature activation+1.13e-02

Pos logit contributions
 fingers+0.001
 pulp+0.001
 stem+0.001
ses+0.001
 finger+0.001
Neg logit contributions
mother-0.001
 Beth-0.001
reating-0.001
ock-0.001
isky-0.001
Token clips
Token ID19166
Feature activation-3.21e-02

Pos logit contributions
 pulp+0.019
ses+0.017
 fingers+0.017
 stem+0.017
sed+0.017
Neg logit contributions
reating-0.021
mother-0.021
ock-0.020
 Beth-0.020
 operation-0.019
Token and
Token ID290
Feature activation+1.17e-02

Pos logit contributions
mother+0.059
 Beth+0.058
reating+0.056
 operation+0.056
ock+0.056
Neg logit contributions
 fingers-0.055
 pulp-0.054
ses-0.049
 stem-0.049
sed-0.048
Token rib
Token ID12183
Feature activation-1.24e-01

Pos logit contributions
 pulp+0.020
ses+0.018
 fingers+0.018
sed+0.018
 stem+0.018
Neg logit contributions
ock-0.021
reating-0.021
 Beth-0.020
mother-0.020
 operation-0.020
Tokenbons
Token ID23461
Feature activation-2.67e-02

Pos logit contributions
mother+0.193
 Beth+0.188
reating+0.187
ock+0.182
ie+0.180
Neg logit contributions
 pulp-0.245
 fingers-0.245
ses-0.229
 stem-0.226
sed-0.220
Token.
Token ID13
Feature activation-8.16e-05

Pos logit contributions
mother+0.051
 Beth+0.050
isky+0.048
 operation+0.048
reating+0.048
Neg logit contributions
 fingers-0.043
 pulp-0.042
 stem-0.039
ses-0.038
 toes-0.037
Token You
Token ID921
Feature activation-1.06e-03

Pos logit contributions
mother+0.000
ock+0.000
reating+0.000
 Beth+0.000
ie+0.000
Neg logit contributions
 pulp-0.000
 fingers-0.000
ses-0.000
 stem-0.000
sed-0.000
Token can
Token ID460
Feature activation-5.53e-02

Pos logit contributions
mother+0.002
reating+0.002
 Beth+0.002
ock+0.002
ie+0.002
Neg logit contributions
 fingers-0.002
 pulp-0.002
ses-0.002
 stem-0.002
 finger-0.002
Token be
Token ID307
Feature activation+6.29e-03

Pos logit contributions
mother+0.112
 Beth+0.108
reating+0.108
ock+0.104
ie+0.101
Neg logit contributions
 fingers-0.084
 stem-0.079
 pulp-0.078
 finger-0.074
ses-0.073
Token colourful
Token ID44799
Feature activation-1.06e-02

Pos logit contributions
 pulp+0.018
 fingers+0.018
ses+0.017
 stem+0.017
sed+0.016
Neg logit contributions
mother-0.020
ock-0.019
reating-0.019
 Beth-0.019
ie-0.019
Token and
Token ID290
Feature activation+3.69e-03

Pos logit contributions
ock+0.021
mother+0.021
reating+0.020
 Beth+0.020
 operation+0.020
Neg logit contributions
 fingers-0.018
 pulp-0.017
 toes-0.015
 stem-0.015
 finger-0.014
Token so
Token ID523
Feature activation+6.68e-03

Pos logit contributions
 fingers+0.006
 pulp+0.005
 stem+0.005
ses+0.005
sed+0.005
Neg logit contributions
mother-0.008
ock-0.008
reating-0.008
ie-0.007
 operation-0.007
Token very
Token ID845
Feature activation-3.78e-02

Pos logit contributions
 fingers+0.011
 pulp+0.010
 toes+0.009
 stem+0.009
ses+0.009
Neg logit contributions
mother-0.014
ock-0.014
reating-0.013
 operation-0.013
ad-0.012
Token high
Token ID1029
Feature activation-1.07e-02

Pos logit contributions
mother+0.069
ock+0.068
reating+0.066
 Beth+0.064
 operation+0.062
Neg logit contributions
 pulp-0.068
 fingers-0.067
 stem-0.062
ses-0.061
sed-0.060
Token-
Token ID12
Feature activation-1.24e-01

Pos logit contributions
mother+0.019
reating+0.019
ock+0.019
 Beth+0.018
 operation+0.017
Neg logit contributions
 pulp-0.019
 fingers-0.019
 stem-0.017
ses-0.017
sed-0.017
Tokenquality
Token ID13237
Feature activation-6.39e-02

Pos logit contributions
mother+0.219
 operation+0.217
 Beth+0.212
reating+0.209
ock+0.204
Neg logit contributions
 pulp-0.199
plets-0.187
 fingers-0.184
cious-0.179
 stem-0.178
Token.
Token ID13
Feature activation-4.93e-03

Pos logit contributions
reating+0.128
mother+0.127
ock+0.124
isky+0.119
ad+0.119
Neg logit contributions
 fingers-0.098
 toes-0.087
 pulp-0.085
 stem-0.081
 tails-0.076
Token The
Token ID383
Feature activation-8.28e-03

Pos logit contributions
ock+0.009
mother+0.009
reating+0.009
ie+0.009
 operation+0.009
Neg logit contributions
 pulp-0.008
 fingers-0.008
ses-0.008
 stem-0.008
sed-0.007
Token girl
Token ID2576
Feature activation+7.76e-03

Pos logit contributions
ock+0.017
mother+0.017
reating+0.017
 Beth+0.017
ad+0.016
Neg logit contributions
 pulp-0.012
 fingers-0.012
 stem-0.011
ses-0.011
 finger-0.010
Token was
Token ID373
Feature activation-9.56e-03

Pos logit contributions
 fingers+0.013
 pulp+0.013
 stem+0.012
sed+0.012
ses+0.012
Neg logit contributions
reating-0.015
ock-0.015
mother-0.014
 Beth-0.014
 operation-0.014
Tokenbo
Token ID2127
Feature activation+2.18e-02

Pos logit contributions
mother+0.142
 Beth+0.142
reating+0.140
 operation+0.134
isky+0.130
Neg logit contributions
 pulp-0.172
 fingers-0.158
 stem-0.153
ses-0.151
 finger-0.149
Tokeno
Token ID78
Feature activation+5.39e-04

Pos logit contributions
 pulp+0.057
 fingers+0.054
ses+0.052
 stem+0.052
sed+0.051
Neg logit contributions
mother-0.022
ock-0.022
ie-0.021
 Beth-0.020
reating-0.020
Token and
Token ID290
Feature activation+2.06e-03

Pos logit contributions
 fingers+0.001
 pulp+0.001
ses+0.001
 finger+0.001
 stem+0.001
Neg logit contributions
reating-0.001
mother-0.001
 Beth-0.001
isky-0.001
ock-0.001
Token put
Token ID1234
Feature activation-2.50e-02

Pos logit contributions
 pulp+0.004
 fingers+0.003
ses+0.003
 stem+0.003
sed+0.003
Neg logit contributions
mother-0.004
ock-0.004
reating-0.004
 Beth-0.004
ie-0.004
Token a
Token ID257
Feature activation+1.32e-03

Pos logit contributions
reating+0.039
isky+0.038
mother+0.038
ock+0.037
 Beth+0.036
Neg logit contributions
 pulp-0.051
 fingers-0.051
ses-0.047
 stem-0.046
 finger-0.046
Token band
Token ID4097
Feature activation-1.23e-01

Pos logit contributions
 pulp+0.003
 fingers+0.003
ses+0.003
sed+0.003
 stem+0.003
Neg logit contributions
ock-0.001
mother-0.001
 Beth-0.001
 operation-0.001
reating-0.001
Token-
Token ID12
Feature activation-9.38e-02

Pos logit contributions
mother+0.134
ock+0.132
 Beth+0.127
reating+0.126
ie+0.122
Neg logit contributions
 pulp-0.313
 fingers-0.300
ses-0.292
 stem-0.287
sed-0.285
Tokenaid
Token ID1698
Feature activation-6.81e-02

Pos logit contributions
mother+0.128
 Beth+0.124
reating+0.119
ock+0.115
 operation+0.113
Neg logit contributions
 pulp-0.208
 fingers-0.201
ses-0.188
 stem-0.187
 finger-0.184
Token on
Token ID319
Feature activation+1.11e-02

Pos logit contributions
mother+0.126
 Beth+0.124
reating+0.123
ock+0.122
ie+0.118
Neg logit contributions
 pulp-0.117
 fingers-0.113
ses-0.107
 stem-0.105
sed-0.103
Token it
Token ID340
Feature activation-2.46e-02

Pos logit contributions
 pulp+0.024
 fingers+0.023
ses+0.022
 stem+0.021
sed+0.021
Neg logit contributions
mother-0.016
 Beth-0.016
ock-0.016
reating-0.016
ie-0.015
Token.
Token ID13
Feature activation+5.96e-03

Pos logit contributions
mother+0.048
reating+0.048
ock+0.046
 Beth+0.046
isky+0.045
Neg logit contributions
 fingers-0.041
 pulp-0.037
 stem-0.037
 finger-0.036
ses-0.035
Token.
Token ID13
Feature activation-1.95e-03

Pos logit contributions
mother+0.079
reating+0.071
ock+0.070
isky+0.070
Ell+0.067
Neg logit contributions
 fingers-0.096
 finger-0.086
 toes-0.085
 mouth-0.082
 tongue-0.082
Token He
Token ID679
Feature activation+1.39e-02

Pos logit contributions
ie+0.004
mother+0.004
reating+0.004
ock+0.004
ty+0.004
Neg logit contributions
 fingers-0.003
 pulp-0.003
ses-0.003
sed-0.003
 stem-0.003
Token fet
Token ID11351
Feature activation-5.17e-02

Pos logit contributions
 pulp+0.019
 fingers+0.019
 finger+0.017
sed+0.017
 stem+0.017
Neg logit contributions
reating-0.033
ock-0.031
mother-0.030
 Beth-0.029
 operation-0.029
Tokenched
Token ID1740
Feature activation-5.39e-03

Pos logit contributions
mother+0.110
reating+0.108
 Beth+0.102
isky+0.100
 operation+0.099
Neg logit contributions
 fingers-0.072
 pulp-0.071
ses-0.065
 stem-0.061
plets-0.061
Token a
Token ID257
Feature activation-4.19e-03

Pos logit contributions
mother+0.010
reating+0.009
ock+0.009
isky+0.009
ie+0.009
Neg logit contributions
 fingers-0.010
 pulp-0.009
ses-0.009
 finger-0.009
 toes-0.009
Token band
Token ID4097
Feature activation-1.23e-01

Pos logit contributions
mother+0.009
ock+0.008
 Beth+0.008
reating+0.008
ie+0.008
Neg logit contributions
 pulp-0.006
 fingers-0.006
ses-0.006
 stem-0.006
sed-0.006
Tokenage
Token ID496
Feature activation-7.24e-02

Pos logit contributions
mother+0.164
reating+0.160
 Beth+0.158
ock+0.156
ie+0.147
Neg logit contributions
 pulp-0.272
 fingers-0.268
ses-0.256
 stem-0.253
sed-0.249
Token he
Token ID339
Feature activation-3.62e-03

Pos logit contributions
reating+0.157
mother+0.155
 Beth+0.150
isky+0.150
ock+0.144
Neg logit contributions
 fingers-0.099
 pulp-0.089
 stem-0.088
 finger-0.087
ses-0.086
Token found
Token ID1043
Feature activation-1.21e-02

Pos logit contributions
mother+0.007
reating+0.007
ock+0.007
 Beth+0.007
ie+0.006
Neg logit contributions
 pulp-0.006
 fingers-0.006
ses-0.006
sed-0.005
 stem-0.005
Token in
Token ID287
Feature activation+4.19e-02

Pos logit contributions
reating+0.023
mother+0.022
ock+0.022
 Beth+0.022
isky+0.022
Neg logit contributions
 fingers-0.020
 pulp-0.019
 finger-0.018
ses-0.018
sed-0.018
Token the
Token ID262
Feature activation-1.25e-03

Pos logit contributions
 fingers+0.071
 pulp+0.068
ses+0.063
 stem+0.062
 finger+0.062
Neg logit contributions
ock-0.080
reating-0.079
mother-0.078
ie-0.078
isky-0.077
Token But
Token ID887
Feature activation-2.32e-02

Pos logit contributions
 pulp+0.011
 fingers+0.010
ses+0.010
 stem+0.010
sed+0.010
Neg logit contributions
mother-0.012
ock-0.012
reating-0.012
 Beth-0.012
ie-0.012
Token remember
Token ID3505
Feature activation+1.06e-02

Pos logit contributions
mother+0.041
ock+0.040
reating+0.040
 Beth+0.039
ie+0.038
Neg logit contributions
 fingers-0.042
 pulp-0.041
ses-0.039
 stem-0.038
sed-0.038
Token,
Token ID11
Feature activation+3.06e-03

Pos logit contributions
 fingers+0.018
 pulp+0.017
ses+0.016
 stem+0.016
sed+0.016
Neg logit contributions
mother-0.020
ock-0.019
 Beth-0.019
reating-0.019
isky-0.018
Token when
Token ID618
Feature activation+4.67e-02

Pos logit contributions
 fingers+0.006
 pulp+0.006
ses+0.005
 stem+0.005
sed+0.005
Neg logit contributions
mother-0.005
ock-0.005
reating-0.005
 Beth-0.005
ie-0.005
Token the
Token ID262
Feature activation+1.18e-02

Pos logit contributions
 pulp+0.098
 fingers+0.096
ses+0.091
 stem+0.090
sed+0.088
Neg logit contributions
mother-0.070
ock-0.068
reating-0.067
 Beth-0.066
ie-0.063
Token time
Token ID640
Feature activation+6.18e-02

Pos logit contributions
 pulp+0.021
 fingers+0.020
ses+0.019
 stem+0.019
sed+0.019
Neg logit contributions
mother-0.022
 Beth-0.021
reating-0.021
ock-0.021
ie-0.020
Token is
Token ID318
Feature activation-2.44e-02

Pos logit contributions
 fingers+0.095
 pulp+0.093
ses+0.084
 stem+0.080
 toes+0.080
Neg logit contributions
mother-0.126
ock-0.123
 Beth-0.122
reating-0.120
isky-0.119
Token up
Token ID510
Feature activation+8.25e-03

Pos logit contributions
mother+0.043
 Beth+0.041
ock+0.041
reating+0.041
ie+0.039
Neg logit contributions
 pulp-0.045
 fingers-0.044
ses-0.042
 stem-0.041
sed-0.040
Token,
Token ID11
Feature activation+7.35e-03

Pos logit contributions
 pulp+0.017
 fingers+0.016
ses+0.016
 stem+0.015
sed+0.015
Neg logit contributions
mother-0.013
ock-0.012
 Beth-0.012
reating-0.012
ie-0.012
Token you
Token ID345
Feature activation+5.94e-03

Pos logit contributions
 pulp+0.014
 fingers+0.014
ses+0.013
 stem+0.013
sed+0.012
Neg logit contributions
mother-0.012
ock-0.012
reating-0.012
 Beth-0.012
ie-0.011
Token have
Token ID423
Feature activation+2.40e-03

Pos logit contributions
 fingers+0.009
 pulp+0.009
ses+0.009
 stem+0.008
sed+0.008
Neg logit contributions
reating-0.012
mother-0.012
 Beth-0.011
ock-0.011
ie-0.011
Token the
Token ID262
Feature activation+2.45e-02

Pos logit contributions
 pulp+0.028
 fingers+0.028
 toes+0.024
 finger+0.023
ses+0.023
Neg logit contributions
ock-0.046
mother-0.046
reating-0.044
isky-0.044
 Beth-0.043
Token store
Token ID3650
Feature activation+1.03e-02

Pos logit contributions
 pulp+0.051
 fingers+0.049
 stem+0.047
ses+0.046
sed+0.046
Neg logit contributions
ock-0.039
mother-0.038
 Beth-0.038
ie-0.037
reating-0.036
Token lady
Token ID10846
Feature activation+2.33e-02

Pos logit contributions
 fingers+0.015
 pulp+0.015
 finger+0.014
sed+0.014
ses+0.014
Neg logit contributions
ock-0.021
 Beth-0.019
mother-0.018
 operation-0.018
ie-0.018
Token where
Token ID810
Feature activation+3.68e-03

Pos logit contributions
 fingers+0.038
 pulp+0.036
 finger+0.034
 stem+0.033
 toes+0.032
Neg logit contributions
ock-0.043
 Beth-0.041
mother-0.039
ad-0.039
 operation-0.039
Token the
Token ID262
Feature activation+2.90e-02

Pos logit contributions
 pulp+0.005
 fingers+0.005
 stem+0.004
ses+0.004
 finger+0.004
Neg logit contributions
ock-0.008
mother-0.008
reating-0.008
isky-0.008
ty-0.007
Token vanilla
Token ID16858
Feature activation+5.71e-02

Pos logit contributions
 pulp+0.050
 fingers+0.048
ses+0.046
 stem+0.044
sed+0.044
Neg logit contributions
mother-0.054
 Beth-0.053
ock-0.053
reating-0.053
ie-0.051
Token was
Token ID373
Feature activation+1.32e-02

Pos logit contributions
 pulp+0.109
 fingers+0.105
ses+0.100
 stem+0.098
sed+0.097
Neg logit contributions
mother-0.095
 Beth-0.092
ock-0.092
reating-0.092
ie-0.089
Token.
Token ID13
Feature activation+9.77e-03

Pos logit contributions
 pulp+0.026
 fingers+0.025
ses+0.024
 stem+0.024
sed+0.023
Neg logit contributions
mother-0.021
ock-0.021
 Beth-0.020
reating-0.020
ie-0.019
Token She
Token ID1375
Feature activation-1.01e-02

Pos logit contributions
 pulp+0.018
 fingers+0.017
ses+0.016
 stem+0.016
sed+0.016
Neg logit contributions
mother-0.017
ock-0.017
reating-0.017
 Beth-0.017
ie-0.016
Token pointed
Token ID6235
Feature activation+3.79e-02

Pos logit contributions
reating+0.021
ock+0.021
mother+0.020
 Beth+0.020
 operation+0.020
Neg logit contributions
 pulp-0.015
 fingers-0.015
sed-0.014
ses-0.013
 stem-0.013
Token to
Token ID284
Feature activation+1.53e-02

Pos logit contributions
 pulp+0.082
 fingers+0.081
ses+0.075
 stem+0.075
sed+0.074
Neg logit contributions
mother-0.054
ock-0.053
reating-0.052
 Beth-0.052
 operation-0.049
Token her
Token ID607
Feature activation+2.55e-02

Pos logit contributions
mother+0.017
ock+0.017
isky+0.016
reating+0.016
 Beth+0.015
Neg logit contributions
 pulp-0.026
 fingers-0.025
 stem-0.024
ses-0.023
 finger-0.023
Token t
Token ID256
Feature activation+7.97e-03

Pos logit contributions
 pulp+0.043
 fingers+0.041
ses+0.039
 stem+0.038
sed+0.037
Neg logit contributions
mother-0.049
 Beth-0.048
ock-0.047
reating-0.047
ie-0.046
Tokeneddy
Token ID21874
Feature activation+7.47e-03

Pos logit contributions
 fingers+0.018
 pulp+0.018
ses+0.017
sed+0.017
 stem+0.016
Neg logit contributions
mother-0.009
 Beth-0.009
ock-0.009
reating-0.009
 operation-0.009
Token bear
Token ID6842
Feature activation+6.87e-03

Pos logit contributions
 fingers+0.015
 pulp+0.015
ses+0.014
sed+0.014
 stem+0.014
Neg logit contributions
reating-0.012
ock-0.011
isky-0.011
mother-0.011
 Beth-0.010
Token's
Token ID338
Feature activation+3.40e-02

Pos logit contributions
 fingers+0.012
 pulp+0.011
ses+0.011
sed+0.011
 finger+0.010
Neg logit contributions
reating-0.013
ock-0.013
mother-0.013
 Beth-0.012
isky-0.012
Token ear
Token ID1027
Feature activation+6.49e-02

Pos logit contributions
 pulp+0.052
 fingers+0.051
ses+0.047
 stem+0.046
sed+0.046
Neg logit contributions
mother-0.070
ock-0.069
reating-0.069
 Beth-0.068
ie-0.065
Token was
Token ID373
Feature activation+1.27e-02

Pos logit contributions
 pulp+0.133
 fingers+0.128
ses+0.123
 stem+0.120
sed+0.119
Neg logit contributions
mother-0.100
 Beth-0.097
ock-0.096
reating-0.096
ie-0.093
Token not
Token ID407
Feature activation+4.43e-02

Pos logit contributions
 pulp+0.020
 fingers+0.019
 stem+0.018
ses+0.018
sed+0.017
Neg logit contributions
mother-0.026
ock-0.025
reating-0.025
 Beth-0.025
ie-0.024
Token fine
Token ID3734
Feature activation+3.94e-03

Pos logit contributions
 pulp+0.078
 fingers+0.076
 stem+0.070
ses+0.069
 finger+0.068
Neg logit contributions
mother-0.082
reating-0.078
 Beth-0.078
ock-0.078
isky-0.074
Token.
Token ID13
Feature activation+3.26e-03

Pos logit contributions
 fingers+0.007
 pulp+0.007
 toes+0.006
 tongue+0.006
ses+0.006
Neg logit contributions
mother-0.007
 Beth-0.007
ock-0.007
isky-0.007
ad-0.006
Token It
Token ID632
Feature activation-1.78e-02

Pos logit contributions
 pulp+0.006
 fingers+0.006
 stem+0.006
ses+0.006
sed+0.006
Neg logit contributions
ock-0.005
mother-0.005
reating-0.005
ie-0.005
isky-0.005
Token who
Token ID508
Feature activation-1.69e-03

Pos logit contributions
 fingers+0.037
 toes+0.032
 pulp+0.032
 finger+0.031
 hands+0.031
Neg logit contributions
mother-0.040
ock-0.038
ty-0.037
reating-0.037
isky-0.037
Token needed
Token ID2622
Feature activation+7.68e-03

Pos logit contributions
reating+0.003
mother+0.003
ock+0.003
 Beth+0.003
ie+0.003
Neg logit contributions
 pulp-0.003
 fingers-0.003
ses-0.003
 stem-0.003
sed-0.003
Token help
Token ID1037
Feature activation-4.49e-02

Pos logit contributions
 fingers+0.013
 pulp+0.013
 stem+0.012
ses+0.012
 finger+0.012
Neg logit contributions
mother-0.014
reating-0.014
ock-0.014
isky-0.013
ie-0.013
Token,
Token ID11
Feature activation+5.50e-03

Pos logit contributions
mother+0.092
reating+0.088
ock+0.087
ad+0.085
isky+0.084
Neg logit contributions
 fingers-0.069
 pulp-0.062
 toes-0.061
 finger-0.059
 stem-0.059
Token the
Token ID262
Feature activation+1.07e-02

Pos logit contributions
 fingers+0.010
 pulp+0.010
 stem+0.008
 toes+0.008
ses+0.008
Neg logit contributions
mother-0.010
ock-0.010
reating-0.010
ie-0.009
isky-0.009
Token little
Token ID1310
Feature activation+6.60e-02

Pos logit contributions
 pulp+0.021
 fingers+0.020
ses+0.019
sed+0.018
 stem+0.018
Neg logit contributions
mother-0.018
ock-0.017
 operation-0.017
reating-0.017
 Beth-0.017
Token bird
Token ID6512
Feature activation+5.96e-03

Pos logit contributions
 pulp+0.118
 fingers+0.114
ses+0.109
 stem+0.106
sed+0.106
Neg logit contributions
mother-0.118
ock-0.115
 Beth-0.114
reating-0.113
ie-0.111
Token would
Token ID561
Feature activation-2.51e-02

Pos logit contributions
 pulp+0.010
 fingers+0.010
 stem+0.010
 finger+0.009
ses+0.009
Neg logit contributions
reating-0.011
ock-0.010
 Beth-0.010
mother-0.010
isky-0.010
Token be
Token ID307
Feature activation+8.59e-03

Pos logit contributions
reating+0.044
mother+0.043
ock+0.042
 Beth+0.042
ie+0.040
Neg logit contributions
 pulp-0.044
 stem-0.044
 fingers-0.042
ses-0.041
 finger-0.040
Token there
Token ID612
Feature activation+1.95e-03

Pos logit contributions
 pulp+0.016
 fingers+0.015
 stem+0.014
 finger+0.014
ses+0.014
Neg logit contributions
reating-0.015
mother-0.015
ock-0.015
 Beth-0.014
isky-0.014
Token and
Token ID290
Feature activation+5.14e-03

Pos logit contributions
 fingers+0.004
 pulp+0.003
 stem+0.003
ses+0.003
 toes+0.003
Neg logit contributions
ock-0.003
mother-0.003
reating-0.003
 Beth-0.003
isky-0.003
Token
Token ID198
Feature activation-9.00e-03

Pos logit contributions
mother+0.036
reating+0.035
ock+0.034
 Beth+0.034
ie+0.033
Neg logit contributions
 pulp-0.029
 fingers-0.028
ses-0.027
 stem-0.026
sed-0.026
TokenTom
Token ID13787
Feature activation-1.25e-03

Pos logit contributions
mother+0.016
ock+0.016
 Beth+0.016
reating+0.016
ie+0.015
Neg logit contributions
 pulp-0.016
 fingers-0.016
ses-0.015
 stem-0.014
sed-0.014
Token and
Token ID290
Feature activation+1.02e-03

Pos logit contributions
mother+0.002
reating+0.002
 Beth+0.002
ock+0.002
 operation+0.002
Neg logit contributions
 pulp-0.002
 fingers-0.002
ses-0.002
sed-0.002
 stem-0.002
Token Mia
Token ID32189
Feature activation-2.45e-02

Pos logit contributions
 fingers+0.001
 pulp+0.001
ses+0.001
 stem+0.001
 toes+0.001
Neg logit contributions
mother-0.002
reating-0.002
ock-0.002
 operation-0.002
 Beth-0.002
Token are
Token ID389
Feature activation-1.12e-02

Pos logit contributions
mother+0.046
ock+0.045
 Beth+0.045
reating+0.045
ie+0.043
Neg logit contributions
 pulp-0.042
 fingers-0.040
ses-0.038
 stem-0.037
sed-0.036
Token sad
Token ID6507
Feature activation+5.38e-02

Pos logit contributions
mother+0.022
 Beth+0.021
ock+0.021
reating+0.021
ie+0.020
Neg logit contributions
 pulp-0.018
 fingers-0.017
ses-0.016
 stem-0.016
sed-0.016
Token and
Token ID290
Feature activation+4.05e-03

Pos logit contributions
 pulp+0.107
 fingers+0.104
ses+0.099
 stem+0.097
sed+0.096
Neg logit contributions
mother-0.086
 Beth-0.083
ock-0.083
reating-0.083
ie-0.079
Token angry
Token ID7954
Feature activation+2.06e-02

Pos logit contributions
 pulp+0.006
 fingers+0.006
ses+0.006
 stem+0.006
sed+0.005
Neg logit contributions
mother-0.008
 Beth-0.008
ock-0.008
reating-0.008
ie-0.008
Token.
Token ID13
Feature activation+2.34e-03

Pos logit contributions
 fingers+0.038
 pulp+0.037
ses+0.035
sed+0.034
 stem+0.033
Neg logit contributions
mother-0.036
 Beth-0.035
reating-0.035
ock-0.035
ie-0.033
Token They
Token ID1119
Feature activation+9.27e-03

Pos logit contributions
 pulp+0.004
 fingers+0.004
 stem+0.004
ses+0.004
sed+0.004
Neg logit contributions
 Beth-0.004
mother-0.004
ock-0.004
reating-0.004
ie-0.004
Token want
Token ID765
Feature activation+3.75e-02

Pos logit contributions
 pulp+0.015
 fingers+0.014
ses+0.014
 stem+0.013
sed+0.013
Neg logit contributions
mother-0.018
ock-0.018
 Beth-0.018
reating-0.018
ie-0.017
Token see
Token ID766
Feature activation-4.74e-03

Pos logit contributions
mother+0.056
 Beth+0.055
ock+0.054
ad+0.054
reating+0.054
Neg logit contributions
 pulp-0.053
 fingers-0.050
ses-0.049
sed-0.047
 stem-0.045
Token them
Token ID606
Feature activation+1.95e-02

Pos logit contributions
mother+0.007
 operation+0.007
isky+0.007
ock+0.007
 Beth+0.007
Neg logit contributions
 pulp-0.009
 fingers-0.009
 finger-0.008
 stem-0.008
 mouths-0.008
Token eat
Token ID4483
Feature activation+3.03e-02

Pos logit contributions
 pulp+0.038
 fingers+0.036
ses+0.035
 stem+0.034
sed+0.034
Neg logit contributions
mother-0.032
ock-0.031
reating-0.031
 Beth-0.031
ie-0.030
Token and
Token ID290
Feature activation+1.02e-02

Pos logit contributions
 pulp+0.053
 fingers+0.050
 stem+0.046
 toes+0.044
 tongue+0.044
Neg logit contributions
mother-0.059
 Beth-0.057
ock-0.055
reating-0.055
isky-0.055
Token smile
Token ID8212
Feature activation+1.21e-02

Pos logit contributions
 pulp+0.017
 fingers+0.016
ses+0.016
sed+0.015
 stem+0.015
Neg logit contributions
 Beth-0.019
mother-0.018
reating-0.018
ie-0.018
ock-0.018
Token.
Token ID13
Feature activation+4.29e-03

Pos logit contributions
 pulp+0.026
 fingers+0.025
ses+0.024
 stem+0.024
sed+0.024
Neg logit contributions
mother-0.017
 Beth-0.016
reating-0.016
ock-0.016
ie-0.016
Token
Token ID198
Feature activation+1.99e-03

Pos logit contributions
 pulp+0.009
 fingers+0.008
ses+0.008
 stem+0.008
sed+0.008
Neg logit contributions
 Beth-0.007
mother-0.007
ock-0.006
reating-0.006
ie-0.006
Token
Token ID198
Feature activation+5.79e-03

Pos logit contributions
 pulp+0.003
 fingers+0.003
ses+0.003
 stem+0.003
sed+0.003
Neg logit contributions
mother-0.004
reating-0.004
 Beth-0.004
ock-0.004
 operation-0.003
TokenOne
Token ID3198
Feature activation+3.74e-02

Pos logit contributions
 pulp+0.011
 fingers+0.010
 stem+0.010
ses+0.010
sed+0.009
Neg logit contributions
mother-0.010
 Beth-0.010
reating-0.010
ock-0.010
ie-0.009
Token day
Token ID1110
Feature activation+2.90e-02

Pos logit contributions
 pulp+0.099
ses+0.094
 fingers+0.093
sed+0.093
 stem+0.089
Neg logit contributions
ie-0.040
ock-0.039
mother-0.037
 operation-0.035
reating-0.035
Token,
Token ID11
Feature activation+6.73e-03

Pos logit contributions
 pulp+0.072
ses+0.069
 fingers+0.069
 stem+0.067
sed+0.066
Neg logit contributions
 Beth-0.034
mother-0.031
ie-0.029
ock-0.029
reating-0.028
Token the
Token ID262
Feature activation+4.56e-03

Pos logit contributions
 fingers+0.037
 pulp+0.035
 toes+0.031
 finger+0.031
 tongue+0.031
Neg logit contributions
reating-0.043
ock-0.043
mother-0.042
isky-0.042
ie-0.042
Token same
Token ID976
Feature activation+2.65e-02

Pos logit contributions
 pulp+0.008
 fingers+0.008
ses+0.007
 stem+0.007
sed+0.007
Neg logit contributions
mother-0.008
reating-0.008
ock-0.008
 Beth-0.008
ie-0.008
Token joy
Token ID8716
Feature activation+9.90e-03

Pos logit contributions
 fingers+0.046
 pulp+0.046
 stem+0.042
ses+0.042
 finger+0.041
Neg logit contributions
reating-0.048
mother-0.048
 Beth-0.047
ock-0.047
isky-0.045
Token that
Token ID326
Feature activation-1.65e-02

Pos logit contributions
 fingers+0.017
 pulp+0.016
 toes+0.016
 stem+0.015
 tongue+0.015
Neg logit contributions
mother-0.017
isky-0.017
 operation-0.016
 Beth-0.016
reating-0.016
Token he
Token ID339
Feature activation+1.48e-02

Pos logit contributions
reating+0.031
mother+0.030
ock+0.029
isky+0.028
 operation+0.028
Neg logit contributions
 fingers-0.028
 pulp-0.028
ses-0.026
 toes-0.025
 stem-0.025
Token felt
Token ID2936
Feature activation+1.95e-02

Pos logit contributions
 pulp+0.020
 fingers+0.020
ses+0.018
sed+0.018
 finger+0.018
Neg logit contributions
reating-0.033
ock-0.030
 Beth-0.030
mother-0.030
 operation-0.029
Token.
Token ID13
Feature activation-7.01e-03

Pos logit contributions
 pulp+0.038
 fingers+0.037
 stem+0.035
ses+0.034
sed+0.034
Neg logit contributions
mother-0.031
reating-0.031
ock-0.030
 Beth-0.030
ie-0.029
Token<|endoftext|>
Token ID50256
Feature activation+1.02e-02

Pos logit contributions
mother+0.012
reating+0.012
ock+0.012
 Beth+0.012
ie+0.012
Neg logit contributions
 pulp-0.012
 fingers-0.012
ses-0.012
 stem-0.011
sed-0.011
TokenOnce
Token ID7454
Feature activation+2.78e-02

Pos logit contributions
 fingers+0.010
 pulp+0.010
ses+0.009
 stem+0.009
sed+0.008
Neg logit contributions
reating-0.025
mother-0.024
ock-0.024
isky-0.024
 operation-0.024
Token upon
Token ID2402
Feature activation-2.17e-03

Pos logit contributions
 fingers+0.041
 pulp+0.041
ses+0.038
 stem+0.037
sed+0.036
Neg logit contributions
mother-0.056
reating-0.055
ock-0.054
 Beth-0.053
 operation-0.053
Token a
Token ID257
Feature activation-1.97e-02

Pos logit contributions
mother+0.004
ock+0.004
 Beth+0.004
reating+0.004
ie+0.003
Neg logit contributions
 pulp-0.004
 fingers-0.004
ses-0.004
 stem-0.004
sed-0.004
Token the
Token ID262
Feature activation-1.52e-02

Pos logit contributions
 pulp+0.053
 fingers+0.052
ses+0.048
 stem+0.048
sed+0.046
Neg logit contributions
mother-0.050
ock-0.049
reating-0.048
 Beth-0.048
ie-0.048
Token sky
Token ID6766
Feature activation+8.93e-04

Pos logit contributions
mother+0.032
reating+0.032
 Beth+0.032
ock+0.031
ie+0.031
Neg logit contributions
 pulp-0.022
 fingers-0.021
 stem-0.020
ses-0.019
sed-0.018
Token!
Token ID0
Feature activation+5.90e-03

Pos logit contributions
 pulp+0.002
 fingers+0.002
 stem+0.002
ses+0.001
sed+0.001
Neg logit contributions
mother-0.002
reating-0.001
ock-0.001
 Beth-0.001
 operation-0.001
Token<|endoftext|>
Token ID50256
Feature activation+6.22e-03

Pos logit contributions
 pulp+0.011
 fingers+0.011
ses+0.010
 stem+0.010
sed+0.010
Neg logit contributions
 Beth-0.010
mother-0.010
reating-0.009
ock-0.009
ie-0.009
TokenOnce
Token ID7454
Feature activation+2.81e-02

Pos logit contributions
 pulp+0.007
 fingers+0.007
ses+0.007
 stem+0.006
sed+0.006
Neg logit contributions
reating-0.014
mother-0.014
ock-0.014
 operation-0.014
isky-0.014
Token upon
Token ID2402
Feature activation-2.78e-03

Pos logit contributions
 pulp+0.046
 fingers+0.045
ses+0.042
 stem+0.041
sed+0.040
Neg logit contributions
mother-0.054
reating-0.053
ock-0.052
 Beth-0.052
 operation-0.050
Token a
Token ID257
Feature activation-2.22e-02

Pos logit contributions
mother+0.005
ock+0.004
 Beth+0.004
reating+0.004
ie+0.004
Neg logit contributions
 pulp-0.005
 fingers-0.005
ses-0.005
 stem-0.005
sed-0.005
Token time
Token ID640
Feature activation+9.20e-03

Pos logit contributions
ock+0.026
mother+0.026
ie+0.026
 Beth+0.025
ty+0.025
Neg logit contributions
 pulp-0.056
 fingers-0.051
 stem-0.050
ses-0.050
sed-0.048
Token,
Token ID11
Feature activation-3.52e-02

Pos logit contributions
 pulp+0.019
 fingers+0.019
ses+0.018
 stem+0.018
sed+0.018
Neg logit contributions
mother-0.013
ock-0.013
 Beth-0.013
reating-0.013
ie-0.012
Token there
Token ID612
Feature activation-2.85e-02

Pos logit contributions
 Beth+0.048
ie+0.046
 Ellie+0.040
mother+0.039
 Bun+0.039
Neg logit contributions
 pulp-0.085
 fingers-0.078
ses-0.077
 stem-0.076
sed-0.074
Token was
Token ID373
Feature activation-2.64e-02

Pos logit contributions
mother+0.042
reating+0.041
ock+0.041
 Beth+0.040
 operation+0.038
Neg logit contributions
 pulp-0.059
 fingers-0.058
ses-0.055
 stem-0.054
sed-0.054
Token and
Token ID290
Feature activation+1.60e-02

Pos logit contributions
mother+0.090
 Beth+0.087
reating+0.087
isky+0.085
ock+0.084
Neg logit contributions
 fingers-0.079
 pulp-0.078
 stem-0.070
 toes-0.069
 finger-0.068
Token avoid
Token ID3368
Feature activation+4.47e-03

Pos logit contributions
 pulp+0.032
 fingers+0.031
ses+0.029
 stem+0.029
sed+0.028
Neg logit contributions
mother-0.025
reating-0.025
 Beth-0.024
ock-0.024
ie-0.023
Token wasting
Token ID24430
Feature activation-2.04e-04

Pos logit contributions
 pulp+0.008
 fingers+0.008
 stem+0.007
ses+0.007
 finger+0.007
Neg logit contributions
mother-0.008
 Beth-0.008
ock-0.008
reating-0.008
ie-0.007
Token things
Token ID1243
Feature activation-1.68e-02

Pos logit contributions
mother+0.000
 Beth+0.000
reating+0.000
ock+0.000
ie+0.000
Neg logit contributions
 pulp-0.000
 fingers-0.000
 stem-0.000
ses-0.000
sed-0.000
Token.
Token ID13
Feature activation-2.01e-03

Pos logit contributions
mother+0.028
 Beth+0.027
reating+0.027
ock+0.027
ie+0.026
Neg logit contributions
 pulp-0.032
 fingers-0.032
 stem-0.029
ses-0.029
sed-0.028
Token<|endoftext|>
Token ID50256
Feature activation+6.58e-03

Pos logit contributions
mother+0.004
ie+0.004
ock+0.004
reating+0.004
ty+0.004
Neg logit contributions
 pulp-0.003
 fingers-0.003
ses-0.003
 stem-0.003
sed-0.003
TokenOnce
Token ID7454
Feature activation+2.76e-02

Pos logit contributions
 pulp+0.008
 fingers+0.008
ses+0.007
 stem+0.007
sed+0.007
Neg logit contributions
reating-0.015
mother-0.014
isky-0.014
ock-0.014
 Beth-0.014
Token upon
Token ID2402
Feature activation-2.13e-04

Pos logit contributions
 pulp+0.046
 fingers+0.045
ses+0.042
 stem+0.041
sed+0.041
Neg logit contributions
mother-0.052
reating-0.051
ock-0.051
 Beth-0.051
 operation-0.048
Token a
Token ID257
Feature activation-2.24e-02

Pos logit contributions
mother+0.000
ock+0.000
reating+0.000
 Beth+0.000
ie+0.000
Neg logit contributions
 pulp-0.000
 fingers-0.000
ses-0.000
 stem-0.000
sed-0.000
Token time
Token ID640
Feature activation+8.08e-03

Pos logit contributions
ock+0.029
mother+0.029
ie+0.029
 Beth+0.027
 operation+0.027
Neg logit contributions
 pulp-0.053
 fingers-0.049
ses-0.048
 stem-0.047
sed-0.046
Token,
Token ID11
Feature activation-3.69e-02

Pos logit contributions
 pulp+0.017
 fingers+0.016
ses+0.016
 stem+0.015
sed+0.015
Neg logit contributions
mother-0.012
ock-0.012
reating-0.012
 Beth-0.012
ie-0.011
Tokenble
Token ID903
Feature activation+4.32e-02

Pos logit contributions
mother+0.015
ock+0.015
 Beth+0.015
reating+0.015
ie+0.015
Neg logit contributions
 pulp-0.008
 fingers-0.008
ses-0.007
 stem-0.007
sed-0.007
Token,
Token ID11
Feature activation+1.98e-02

Pos logit contributions
 pulp+0.081
 fingers+0.078
ses+0.075
 stem+0.074
sed+0.073
Neg logit contributions
mother-0.073
reating-0.072
 Beth-0.071
ock-0.070
 operation-0.066
Token Mom
Token ID11254
Feature activation+2.38e-02

Pos logit contributions
 pulp+0.036
 fingers+0.034
ses+0.033
 stem+0.032
sed+0.032
Neg logit contributions
mother-0.035
 Beth-0.035
ock-0.034
reating-0.034
ie-0.033
Tokenmy
Token ID1820
Feature activation+1.05e-02

Pos logit contributions
 fingers+0.060
 pulp+0.059
 stem+0.057
ses+0.055
 finger+0.055
Neg logit contributions
 Beth-0.022
mother-0.021
reating-0.021
 operation-0.020
isky-0.020
Token!
Token ID0
Feature activation+2.17e-02

Pos logit contributions
 pulp+0.018
 fingers+0.017
ses+0.016
 stem+0.016
sed+0.016
Neg logit contributions
reating-0.019
ock-0.019
mother-0.019
 Beth-0.019
isky-0.018
Token A
Token ID317
Feature activation+1.58e-02

Pos logit contributions
 pulp+0.039
 fingers+0.037
ses+0.035
 stem+0.034
sed+0.034
Neg logit contributions
 Beth-0.039
mother-0.038
reating-0.038
ock-0.037
 operation-0.036
Token big
Token ID1263
Feature activation+1.16e-02

Pos logit contributions
 pulp+0.026
 fingers+0.025
ses+0.024
sed+0.023
 stem+0.022
Neg logit contributions
mother-0.030
ock-0.030
 Beth-0.030
reating-0.029
ie-0.029
Token,
Token ID11
Feature activation+2.72e-02

Pos logit contributions
 pulp+0.021
 fingers+0.019
ses+0.019
sed+0.018
 stem+0.018
Neg logit contributions
mother-0.021
ock-0.020
 Beth-0.020
ie-0.020
reating-0.020
Token shiny
Token ID22441
Feature activation-1.15e-02

Pos logit contributions
 pulp+0.041
 fingers+0.039
ses+0.037
sed+0.036
 stem+0.036
Neg logit contributions
mother-0.054
 Beth-0.054
ock-0.053
reating-0.053
 operation-0.052
Token,
Token ID11
Feature activation+2.71e-02

Pos logit contributions
mother+0.021
ock+0.021
ie+0.021
reating+0.021
 Beth+0.020
Neg logit contributions
 pulp-0.019
 fingers-0.018
ses-0.018
sed-0.017
 stem-0.017
Token smooth
Token ID7209
Feature activation-1.23e-02

Pos logit contributions
 pulp+0.043
 fingers+0.039
ses+0.038
sed+0.037
 stem+0.036
Neg logit contributions
 Beth-0.052
mother-0.052
ock-0.051
reating-0.050
 operation-0.050
Token of
Token ID286
Feature activation-5.68e-02

Pos logit contributions
ock+0.050
reating+0.050
mother+0.048
 Beth+0.047
ie+0.046
Neg logit contributions
 pulp-0.063
ses-0.060
sed-0.059
 fingers-0.059
 stem-0.057
Token the
Token ID262
Feature activation-9.53e-03

Pos logit contributions
ock+0.101
reating+0.099
mother+0.099
isky+0.098
oot+0.095
Neg logit contributions
 fingers-0.103
 pulp-0.100
 stem-0.092
 toes-0.091
 finger-0.089
Token whip
Token ID22972
Feature activation+4.96e-04

Pos logit contributions
mother+0.018
ock+0.018
 Beth+0.018
reating+0.018
ie+0.017
Neg logit contributions
 pulp-0.016
 fingers-0.015
ses-0.015
 stem-0.014
sed-0.014
Token.
Token ID13
Feature activation-8.77e-03

Pos logit contributions
 fingers+0.001
 pulp+0.001
 stem+0.001
 finger+0.001
 toes+0.001
Neg logit contributions
mother-0.001
 Beth-0.001
reating-0.001
ock-0.001
isky-0.001
Token Daddy
Token ID26926
Feature activation+2.35e-02

Pos logit contributions
mother+0.015
ock+0.015
reating+0.015
 Beth+0.014
ie+0.014
Neg logit contributions
 pulp-0.016
 fingers-0.016
ses-0.015
 stem-0.015
sed-0.015
Token just
Token ID655
Feature activation-1.84e-02

Pos logit contributions
 fingers+0.030
 pulp+0.029
 finger+0.029
sed+0.025
 stem+0.025
Neg logit contributions
reating-0.056
ock-0.051
 Beth-0.050
 operation-0.050
isky-0.049
Token laughed
Token ID13818
Feature activation+2.58e-02

Pos logit contributions
mother+0.033
 Beth+0.032
ock+0.032
ie+0.031
reating+0.031
Neg logit contributions
 pulp-0.033
 fingers-0.031
ses-0.030
 stem-0.030
sed-0.029
Token and
Token ID290
Feature activation-5.50e-03

Pos logit contributions
 fingers+0.037
 pulp+0.034
 toes+0.032
 finger+0.031
 stem+0.030
Neg logit contributions
mother-0.056
reating-0.055
isky-0.053
ock-0.053
 Beth-0.052
Token said
Token ID531
Feature activation-1.42e-02

Pos logit contributions
reating+0.012
ock+0.011
mother+0.011
 operation+0.011
isky+0.011
Neg logit contributions
 pulp-0.008
 fingers-0.008
ses-0.007
sed-0.007
 stem-0.007
Token it
Token ID340
Feature activation-4.82e-03

Pos logit contributions
mother+0.028
ock+0.027
reating+0.027
isky+0.026
 operation+0.025
Neg logit contributions
 pulp-0.024
 fingers-0.024
 finger-0.021
 stem-0.021
 toes-0.021
Token was
Token ID373
Feature activation-1.08e-02

Pos logit contributions
reating+0.011
ock+0.010
mother+0.009
isky+0.009
 Beth+0.009
Neg logit contributions
 fingers-0.007
 pulp-0.007
 finger-0.007
sed-0.007
ses-0.007
TokenAfter
Token ID3260
Feature activation+2.64e-02

Pos logit contributions
mother+0.004
ock+0.004
 Beth+0.003
reating+0.003
ie+0.003
Neg logit contributions
 pulp-0.007
 fingers-0.007
ses-0.006
sed-0.006
 stem-0.006
Token they
Token ID484
Feature activation-1.37e-02

Pos logit contributions
 pulp+0.050
ses+0.050
 fingers+0.048
sed+0.048
 stem+0.046
Neg logit contributions
 Beth-0.043
mother-0.043
reating-0.041
 operation-0.040
ie-0.040
Token finished
Token ID5201
Feature activation+2.27e-02

Pos logit contributions
mother+0.022
ie+0.021
 Beth+0.021
ock+0.020
ad+0.020
Neg logit contributions
ses-0.027
 pulp-0.026
sed-0.026
 fingers-0.025
 stem-0.024
Token cooking
Token ID10801
Feature activation-7.36e-03

Pos logit contributions
 pulp+0.042
ses+0.041
 fingers+0.039
sed+0.039
 stem+0.037
Neg logit contributions
mother-0.039
 Beth-0.038
ie-0.038
reating-0.037
 operation-0.037
Token,
Token ID11
Feature activation+2.12e-03

Pos logit contributions
mother+0.012
reating+0.012
ock+0.012
 Beth+0.012
isky+0.011
Neg logit contributions
 pulp-0.014
 fingers-0.014
 stem-0.013
ses-0.013
sed-0.012
Token Lily
Token ID20037
Feature activation-2.41e-02

Pos logit contributions
 pulp+0.005
ses+0.005
 fingers+0.005
sed+0.005
 stem+0.005
Neg logit contributions
 Beth-0.003
mother-0.002
ie-0.002
reating-0.002
 operation-0.002
Token's
Token ID338
Feature activation+1.29e-02

Pos logit contributions
mother+0.035
ie+0.035
ty+0.033
 Beth+0.032
ock+0.031
Neg logit contributions
ses-0.052
 pulp-0.051
 fingers-0.049
sed-0.048
 stem-0.047
Token mom
Token ID1995
Feature activation+1.57e-03

Pos logit contributions
 pulp+0.025
plets+0.023
ses+0.022
 stem+0.022
 fingers+0.021
Neg logit contributions
mother-0.021
 operation-0.021
ie-0.020
 Beth-0.020
reating-0.019
Token asked
Token ID1965
Feature activation+2.01e-02

Pos logit contributions
 pulp+0.003
 fingers+0.003
 stem+0.002
ses+0.002
 finger+0.002
Neg logit contributions
reating-0.003
ock-0.003
mother-0.003
 Beth-0.003
 operation-0.003
Token her
Token ID607
Feature activation+1.69e-02

Pos logit contributions
 fingers+0.041
 pulp+0.039
 finger+0.036
 toes+0.035
 stem+0.035
Neg logit contributions
ock-0.030
mother-0.029
isky-0.029
ad-0.028
 Beth-0.027
Token to
Token ID284
Feature activation+2.15e-02

Pos logit contributions
 pulp+0.036
 fingers+0.036
 finger+0.034
 stem+0.033
ses+0.033
Neg logit contributions
ock-0.024
mother-0.023
 Beth-0.022
reating-0.022
ad-0.021
Token end
Token ID886
Feature activation+3.10e-02

Pos logit contributions
 pulp+0.004
 fingers+0.004
plets+0.004
ses+0.003
 toes+0.003
Neg logit contributions
mother-0.002
 operation-0.002
reating-0.002
ock-0.002
ad-0.002
Token.
Token ID13
Feature activation+4.07e-03

Pos logit contributions
 fingers+0.048
ses+0.044
 pulp+0.044
 finger+0.043
 stem+0.041
Neg logit contributions
mother-0.057
reating-0.057
isky-0.056
ock-0.056
 Beth-0.055
Token<|endoftext|>
Token ID50256
Feature activation+6.06e-03

Pos logit contributions
 pulp+0.009
 fingers+0.008
ses+0.008
sed+0.008
 stem+0.008
Neg logit contributions
mother-0.006
 Beth-0.006
reating-0.006
ock-0.006
ie-0.006
TokenOnce
Token ID7454
Feature activation+2.66e-02

Pos logit contributions
 pulp+0.010
 fingers+0.010
ses+0.010
 stem+0.009
sed+0.009
Neg logit contributions
mother-0.011
reating-0.011
ock-0.011
 Beth-0.011
ie-0.010
Token upon
Token ID2402
Feature activation-3.31e-03

Pos logit contributions
 pulp+0.050
 fingers+0.047
ses+0.046
 stem+0.045
sed+0.044
Neg logit contributions
mother-0.046
 Beth-0.045
ock-0.044
reating-0.044
ie-0.044
Token a
Token ID257
Feature activation-2.10e-02

Pos logit contributions
ock+0.004
reating+0.004
 him+0.004
mother+0.004
 Beth+0.004
Neg logit contributions
 pulp-0.008
 fingers-0.007
ses-0.007
sed-0.007
 stem-0.007
Token time
Token ID640
Feature activation+9.59e-03

Pos logit contributions
ock+0.025
ie+0.024
mother+0.024
ty+0.023
 operation+0.023
Neg logit contributions
 pulp-0.053
 fingers-0.048
 stem-0.047
ses-0.047
sed-0.045
Token,
Token ID11
Feature activation-3.73e-02

Pos logit contributions
 pulp+0.023
 fingers+0.022
 stem+0.021
ses+0.021
sed+0.020
Neg logit contributions
mother-0.012
 Beth-0.012
ie-0.011
ock-0.011
reating-0.011
Token there
Token ID612
Feature activation-2.76e-02

Pos logit contributions
ie+0.044
 Beth+0.040
 him+0.036
 Ellie+0.036
 Bun+0.034
Neg logit contributions
 pulp-0.098
ses-0.093
 fingers-0.093
sed-0.091
 stem-0.090
Token was
Token ID373
Feature activation-2.75e-02

Pos logit contributions
mother+0.038
 Beth+0.037
ock+0.036
reating+0.036
ie+0.035
Neg logit contributions
 pulp-0.061
 fingers-0.059
ses-0.057
 stem-0.056
sed-0.055
Token a
Token ID257
Feature activation-2.05e-03

Pos logit contributions
 Beth+0.036
reating+0.035
ock+0.035
mother+0.035
 him+0.033
Neg logit contributions
 pulp-0.063
 fingers-0.060
ses-0.059
sed-0.059
 stem-0.058
Token and
Token ID290
Feature activation+6.79e-06

Pos logit contributions
mother+0.052
 Beth+0.051
reating+0.051
ock+0.050
ie+0.048
Neg logit contributions
 pulp-0.063
 fingers-0.061
ses-0.058
 stem-0.057
sed-0.056
Token suddenly
Token ID6451
Feature activation+3.34e-02

Feature not active on prev token;
no predictions were affected.
Token he
Token ID339
Feature activation+1.23e-02

Pos logit contributions
 pulp+0.062
 fingers+0.060
ses+0.056
 stem+0.055
sed+0.055
Neg logit contributions
mother-0.058
ock-0.057
reating-0.056
 Beth-0.056
ie-0.054
Token noticed
Token ID6810
Feature activation+2.89e-02

Pos logit contributions
 pulp+0.020
 fingers+0.019
ses+0.018
 stem+0.017
sed+0.017
Neg logit contributions
mother-0.024
ock-0.024
reating-0.024
 Beth-0.024
ie-0.023
Token something
Token ID1223
Feature activation+1.82e-02

Pos logit contributions
 fingers+0.042
 pulp+0.042
 finger+0.037
 stem+0.036
 toes+0.036
Neg logit contributions
mother-0.059
reating-0.057
ock-0.056
 Beth-0.055
isky-0.055
Token amazing
Token ID4998
Feature activation-3.19e-02

Pos logit contributions
 pulp+0.032
 fingers+0.031
 stem+0.029
 finger+0.028
ses+0.028
Neg logit contributions
mother-0.034
reating-0.032
ock-0.032
 Beth-0.031
 operation-0.030
Token -
Token ID532
Feature activation+4.78e-03

Pos logit contributions
mother+0.051
 Beth+0.050
reating+0.049
ock+0.049
 operation+0.047
Neg logit contributions
 pulp-0.062
 fingers-0.061
ses-0.057
 stem-0.057
sed-0.056
Token he
Token ID339
Feature activation+2.09e-02

Pos logit contributions
 pulp+0.008
 fingers+0.008
ses+0.007
 stem+0.007
 finger+0.007
Neg logit contributions
mother-0.009
ock-0.009
reating-0.009
ie-0.008
 Beth-0.008
Token could
Token ID714
Feature activation-8.71e-03

Pos logit contributions
 pulp+0.035
 fingers+0.034
ses+0.032
 stem+0.031
sed+0.031
Neg logit contributions
mother-0.040
reating-0.039
ock-0.039
 Beth-0.039
 operation-0.037
Token see
Token ID766
Feature activation-9.99e-04

Pos logit contributions
mother+0.016
ock+0.015
 Beth+0.015
reating+0.015
ie+0.015
Neg logit contributions
 pulp-0.015
 fingers-0.015
ses-0.014
sed-0.014
 stem-0.013
Token a
Token ID257
Feature activation+1.58e-02

Pos logit contributions
mother+0.002
 Beth+0.002
reating+0.002
ock+0.002
 operation+0.002
Neg logit contributions
 fingers-0.002
 pulp-0.002
 finger-0.001
 stem-0.001
 toes-0.001
Token loved
Token ID6151
Feature activation-2.24e-02

Pos logit contributions
mother+0.098
ock+0.094
 Beth+0.094
ie+0.092
reating+0.092
Neg logit contributions
 pulp-0.120
 fingers-0.114
ses-0.110
 stem-0.108
sed-0.107
Token sun
Token ID4252
Feature activation-3.85e-02

Pos logit contributions
mother+0.037
ock+0.035
isky+0.034
 Beth+0.033
ty+0.033
Neg logit contributions
 pulp-0.045
 fingers-0.044
 stem-0.039
 toes-0.039
ses-0.038
Tokenfl
Token ID2704
Feature activation-4.86e-02

Pos logit contributions
mother+0.078
reating+0.077
ock+0.077
 Beth+0.076
isky+0.073
Neg logit contributions
 pulp-0.059
 fingers-0.058
 stem-0.053
ses-0.052
sed-0.050
Tokenowers
Token ID3618
Feature activation+7.64e-04

Pos logit contributions
mother+0.109
reating+0.108
 Beth+0.108
ock+0.107
 operation+0.103
Neg logit contributions
 pulp-0.064
 fingers-0.061
ses-0.056
 stem-0.055
sed-0.054
Token.
Token ID13
Feature activation-2.61e-02

Pos logit contributions
 fingers+0.001
 pulp+0.001
 stem+0.001
 finger+0.001
 toes+0.001
Neg logit contributions
reating-0.001
mother-0.001
 Beth-0.001
ock-0.001
isky-0.001
Token She
Token ID1375
Feature activation-3.03e-02

Pos logit contributions
mother+0.048
ock+0.045
ty+0.045
 operation+0.044
isky+0.044
Neg logit contributions
 pulp-0.045
 fingers-0.044
 stem-0.042
ses-0.040
 toes-0.039
Token liked
Token ID8288
Feature activation+1.20e-02

Pos logit contributions
mother+0.048
ie+0.048
 Beth+0.048
ock+0.047
ty+0.044
Neg logit contributions
 pulp-0.059
 fingers-0.057
ses-0.056
sed-0.054
 stem-0.053
Token to
Token ID284
Feature activation-1.77e-02

Pos logit contributions
 pulp+0.028
 fingers+0.027
 stem+0.025
ses+0.024
sed+0.024
Neg logit contributions
mother-0.016
ock-0.015
 Beth-0.015
reating-0.015
 operation-0.014
Token look
Token ID804
Feature activation-3.02e-02

Pos logit contributions
mother+0.032
ock+0.032
 Beth+0.032
reating+0.031
ie+0.031
Neg logit contributions
 pulp-0.030
 fingers-0.029
ses-0.028
sed-0.028
 stem-0.026
Token at
Token ID379
Feature activation+2.17e-02

Pos logit contributions
mother+0.053
 Beth+0.051
ock+0.051
reating+0.050
 operation+0.048
Neg logit contributions
 pulp-0.055
 fingers-0.054
 stem-0.050
ses-0.049
sed-0.048
Token them
Token ID606
Feature activation+6.62e-03

Pos logit contributions
 pulp+0.044
 fingers+0.043
 stem+0.040
ses+0.039
sed+0.038
Neg logit contributions
mother-0.034
ock-0.033
 Beth-0.033
reating-0.033
 operation-0.031
Token he
Token ID339
Feature activation+1.99e-02

Pos logit contributions
 fingers+0.028
 pulp+0.027
 toes+0.024
 finger+0.023
 stem+0.023
Neg logit contributions
mother-0.037
ock-0.033
ty-0.033
isky-0.033
 operation-0.032
Token was
Token ID373
Feature activation-1.14e-04

Pos logit contributions
 fingers+0.032
 pulp+0.031
ses+0.029
sed+0.029
 finger+0.029
Neg logit contributions
reating-0.041
mother-0.039
ock-0.038
 Beth-0.037
 operation-0.035
Token having
Token ID1719
Feature activation+3.68e-02

Pos logit contributions
mother+0.000
ock+0.000
reating+0.000
 Beth+0.000
ie+0.000
Neg logit contributions
 pulp-0.000
 fingers-0.000
ses-0.000
 stem-0.000
 finger-0.000
Token too
Token ID1165
Feature activation+3.52e-03

Pos logit contributions
 fingers+0.066
 pulp+0.057
 toes+0.057
 tongue+0.055
 hands+0.055
Neg logit contributions
mother-0.073
ock-0.067
ie-0.067
isky-0.066
reating-0.065
Token much
Token ID881
Feature activation-1.84e-02

Pos logit contributions
 pulp+0.009
ses+0.009
sed+0.009
 stem+0.008
 fingers+0.008
Neg logit contributions
ock-0.004
 Beth-0.004
reating-0.003
mother-0.003
ie-0.003
Token fun
Token ID1257
Feature activation-8.43e-02

Pos logit contributions
mother+0.027
 Beth+0.026
ock+0.025
reating+0.025
ie+0.025
Neg logit contributions
 pulp-0.039
 fingers-0.038
ses-0.036
 stem-0.036
sed-0.035
Token playing
Token ID2712
Feature activation-1.24e-02

Pos logit contributions
mother+0.150
ock+0.142
 Beth+0.141
reating+0.140
isky+0.135
Neg logit contributions
 fingers-0.152
 pulp-0.146
 toes-0.136
ses-0.135
 stem-0.133
Token with
Token ID351
Feature activation+1.09e-02

Pos logit contributions
mother+0.028
reating+0.026
ock+0.026
isky+0.026
 Beth+0.025
Neg logit contributions
 fingers-0.017
 pulp-0.016
 toes-0.015
 finger-0.015
 stem-0.014
Token them
Token ID606
Feature activation+1.09e-02

Pos logit contributions
 fingers+0.024
 pulp+0.023
 toes+0.021
 stem+0.021
 finger+0.021
Neg logit contributions
mother-0.016
reating-0.015
ock-0.015
isky-0.014
 Beth-0.014
Token.
Token ID13
Feature activation-8.44e-03

Pos logit contributions
 fingers+0.018
 toes+0.016
 pulp+0.015
 finger+0.015
 tongue+0.014
Neg logit contributions
mother-0.023
reating-0.022
ad-0.021
ock-0.021
 Beth-0.021
Token
Token ID220
Feature activation-9.75e-03

Pos logit contributions
 Beth+0.013
mother+0.013
ock+0.012
reating+0.012
ie+0.012
Neg logit contributions
 pulp-0.018
 fingers-0.017
ses-0.016
 stem-0.016
sed-0.016
Token they
Token ID484
Feature activation+1.79e-02

Pos logit contributions
 pulp+0.027
 fingers+0.026
ses+0.025
 stem+0.024
sed+0.024
Neg logit contributions
mother-0.026
ock-0.026
reating-0.026
 Beth-0.025
ie-0.024
Token had
Token ID550
Feature activation+4.57e-02

Pos logit contributions
 pulp+0.031
 fingers+0.030
ses+0.028
 stem+0.028
sed+0.028
Neg logit contributions
mother-0.033
reating-0.033
ock-0.033
 Beth-0.032
 operation-0.031
Token a
Token ID257
Feature activation+5.00e-03

Pos logit contributions
 pulp+0.095
 fingers+0.093
ses+0.088
 stem+0.086
sed+0.086
Neg logit contributions
mother-0.072
ock-0.069
reating-0.069
 Beth-0.068
ie-0.066
Token lot
Token ID1256
Feature activation-3.63e-02

Pos logit contributions
 pulp+0.010
 fingers+0.009
ses+0.009
 stem+0.009
sed+0.008
Neg logit contributions
mother-0.008
 Beth-0.008
ock-0.008
reating-0.008
 operation-0.008
Token of
Token ID286
Feature activation-4.52e-02

Pos logit contributions
mother+0.046
 Beth+0.044
ie+0.040
reating+0.040
 operation+0.039
Neg logit contributions
 pulp-0.084
 fingers-0.083
 stem-0.079
ses-0.078
 finger-0.076
Token fun
Token ID1257
Feature activation-8.39e-02

Pos logit contributions
mother+0.060
ock+0.058
 Beth+0.057
reating+0.057
ie+0.054
Neg logit contributions
 pulp-0.103
 fingers-0.100
ses-0.096
 stem-0.095
sed-0.094
Token.
Token ID13
Feature activation-4.59e-03

Pos logit contributions
mother+0.106
ock+0.105
ie+0.103
reating+0.103
 Beth+0.103
Neg logit contributions
 pulp-0.195
ses-0.182
 fingers-0.181
sed-0.177
 stem-0.176
Token The
Token ID383
Feature activation-4.31e-03

Pos logit contributions
 Beth+0.007
mother+0.007
ock+0.007
reating+0.007
ie+0.007
Neg logit contributions
 pulp-0.009
 fingers-0.009
 stem-0.008
ses-0.008
sed-0.008
Token pumpkin
Token ID30089
Feature activation+5.35e-02

Pos logit contributions
mother+0.007
ie+0.007
 operation+0.007
ock+0.007
ad+0.006
Neg logit contributions
ses-0.008
plets-0.008
sed-0.007
 fingers-0.007
ifying-0.007
Token was
Token ID373
Feature activation-2.80e-04

Pos logit contributions
 pulp+0.100
 fingers+0.097
 stem+0.091
ses+0.090
sed+0.089
Neg logit contributions
reating-0.093
 Beth-0.091
mother-0.090
ock-0.090
 operation-0.083
Token stubborn
Token ID24444
Feature activation+3.83e-02

Pos logit contributions
mother+0.000
 Beth+0.000
ock+0.000
reating+0.000
ie+0.000
Neg logit contributions
 pulp-0.001
 fingers-0.001
ses-0.001
 stem-0.000
sed-0.000
Token were
Token ID547
Feature activation+5.20e-03

Pos logit contributions
 pulp+0.013
 fingers+0.013
ses+0.012
 stem+0.012
sed+0.011
Neg logit contributions
mother-0.014
reating-0.014
ock-0.014
 Beth-0.014
ie-0.013
Token happy
Token ID3772
Feature activation+2.05e-02

Pos logit contributions
 pulp+0.011
 fingers+0.011
ses+0.011
 stem+0.010
sed+0.010
Neg logit contributions
mother-0.007
ock-0.007
reating-0.007
 Beth-0.007
ie-0.007
Token they
Token ID484
Feature activation-7.84e-03

Pos logit contributions
 fingers+0.038
 pulp+0.038
ses+0.034
 stem+0.034
 toes+0.034
Neg logit contributions
ock-0.035
mother-0.035
reating-0.034
 operation-0.033
 Beth-0.033
Token tried
Token ID3088
Feature activation+4.97e-03

Pos logit contributions
mother+0.015
reating+0.015
ock+0.015
 Beth+0.015
 operation+0.014
Neg logit contributions
 pulp-0.013
 fingers-0.012
ses-0.012
 stem-0.011
sed-0.011
Token something
Token ID1223
Feature activation-3.65e-03

Pos logit contributions
 fingers+0.009
 pulp+0.009
 toes+0.008
 tongue+0.008
 finger+0.008
Neg logit contributions
mother-0.009
ock-0.009
reating-0.008
isky-0.008
 Beth-0.008
Token new
Token ID649
Feature activation-8.97e-02

Pos logit contributions
mother+0.007
reating+0.007
ock+0.006
ad+0.006
 Beth+0.006
Neg logit contributions
 fingers-0.006
 pulp-0.006
 stem-0.006
 finger-0.006
 toes-0.006
Token.
Token ID13
Feature activation-1.08e-03

Pos logit contributions
reating+0.174
mother+0.170
ock+0.170
 Beth+0.169
 operation+0.161
Neg logit contributions
 fingers-0.145
 pulp-0.130
 toes-0.128
 finger-0.126
ses-0.125
Token Sally
Token ID25737
Feature activation+1.34e-02

Pos logit contributions
 Beth+0.002
mother+0.001
ock+0.001
ie+0.001
ad+0.001
Neg logit contributions
 pulp-0.002
 fingers-0.002
ses-0.002
sed-0.002
 stem-0.002
Token also
Token ID635
Feature activation+1.03e-02

Pos logit contributions
 pulp+0.022
 fingers+0.021
 stem+0.020
ses+0.020
sed+0.019
Neg logit contributions
reating-0.027
ock-0.026
mother-0.025
 Beth-0.025
 operation-0.024
Token felt
Token ID2936
Feature activation+1.77e-02

Pos logit contributions
 pulp+0.020
 fingers+0.019
ses+0.018
 stem+0.018
sed+0.017
Neg logit contributions
mother-0.017
 Beth-0.017
ock-0.016
ie-0.016
ad-0.015
Token good
Token ID922
Feature activation-2.92e-02

Pos logit contributions
 pulp+0.036
 fingers+0.035
ses+0.034
sed+0.033
 stem+0.033
Neg logit contributions
mother-0.027
 Beth-0.027
ock-0.026
reating-0.026
ie-0.025
Token excited
Token ID6568
Feature activation+2.61e-02

Pos logit contributions
 pulp+0.029
 fingers+0.028
ses+0.027
 stem+0.026
sed+0.026
Neg logit contributions
mother-0.024
 Beth-0.024
ock-0.024
reating-0.024
ie-0.023
Token to
Token ID284
Feature activation-2.77e-02

Pos logit contributions
 fingers+0.047
 pulp+0.044
 toes+0.042
 stem+0.041
ses+0.041
Neg logit contributions
mother-0.046
 Beth-0.045
reating-0.043
ock-0.043
 operation-0.042
Token explore
Token ID7301
Feature activation-7.57e-03

Pos logit contributions
mother+0.052
 Beth+0.052
reating+0.052
ock+0.052
 operation+0.050
Neg logit contributions
 pulp-0.048
 fingers-0.043
ses-0.043
sed-0.042
 stem-0.040
Token and
Token ID290
Feature activation-7.84e-03

Pos logit contributions
mother+0.017
 operation+0.015
reating+0.015
ussy+0.015
isky+0.015
Neg logit contributions
 fingers-0.010
 palm-0.008
 toes-0.008
 pulp-0.008
 stem-0.008
Token learn
Token ID2193
Feature activation-2.78e-02

Pos logit contributions
 Beth+0.013
mother+0.013
reating+0.013
ock+0.012
ie+0.012
Neg logit contributions
 pulp-0.015
 fingers-0.014
ses-0.014
sed-0.013
 stem-0.013
Token new
Token ID649
Feature activation-9.43e-02

Pos logit contributions
mother+0.053
ie+0.048
ad+0.048
father+0.048
ock+0.046
Neg logit contributions
 fingers-0.044
 toes-0.038
 palm-0.037
 finger-0.037
 palms-0.037
Token things
Token ID1243
Feature activation-3.45e-02

Pos logit contributions
mother+0.205
ock+0.198
 Beth+0.193
reating+0.191
isky+0.187
Neg logit contributions
 fingers-0.141
 toes-0.122
 pulp-0.117
 necks-0.112
ses-0.110
Token.
Token ID13
Feature activation-1.38e-02

Pos logit contributions
mother+0.083
 Beth+0.077
 operation+0.076
ock+0.075
father+0.075
Neg logit contributions
 fingers-0.036
 toes-0.030
 finger-0.027
 stem-0.025
 pulp-0.025
Token
Token ID220
Feature activation-1.38e-02

Pos logit contributions
mother+0.023
ock+0.022
reating+0.022
 Beth+0.021
ie+0.021
Neg logit contributions
 pulp-0.027
 fingers-0.026
ses-0.025
 stem-0.025
sed-0.024
Token
Token ID198
Feature activation-1.11e-02

Pos logit contributions
mother+0.027
reating+0.027
ock+0.027
ie+0.026
 operation+0.026
Neg logit contributions
 pulp-0.021
 fingers-0.021
ses-0.019
sed-0.019
 stem-0.019
Token
Token ID198
Feature activation-6.67e-03

Pos logit contributions
mother+0.022
reating+0.021
ock+0.021
 Beth+0.020
 operation+0.020
Neg logit contributions
 pulp-0.017
 fingers-0.017
 stem-0.016
ses-0.016
sed-0.015
Token He
Token ID679
Feature activation+4.21e-03

Pos logit contributions
mother+0.025
 Beth+0.024
ock+0.024
reating+0.024
ie+0.023
Neg logit contributions
 pulp-0.023
 fingers-0.023
ses-0.021
 stem-0.021
sed-0.021
Token loved
Token ID6151
Feature activation-9.43e-03

Pos logit contributions
 pulp+0.008
 fingers+0.008
ses+0.008
 stem+0.007
sed+0.007
Neg logit contributions
mother-0.007
 Beth-0.007
ock-0.007
ie-0.007
reating-0.006
Token to
Token ID284
Feature activation-1.80e-02

Pos logit contributions
mother+0.013
ock+0.013
 Beth+0.012
reating+0.012
isky+0.012
Neg logit contributions
 pulp-0.021
 fingers-0.021
 stem-0.019
ses-0.019
sed-0.019
Token cook
Token ID4255
Feature activation-2.25e-02

Pos logit contributions
mother+0.035
 Beth+0.035
ock+0.035
reating+0.034
ie+0.034
Neg logit contributions
 pulp-0.028
 fingers-0.027
ses-0.027
sed-0.026
 stem-0.024
Token all
Token ID477
Feature activation-3.26e-02

Pos logit contributions
mother+0.041
 Beth+0.040
reating+0.039
ock+0.039
isky+0.037
Neg logit contributions
 pulp-0.040
 fingers-0.040
 stem-0.036
ses-0.035
 toes-0.035
Token sorts
Token ID10524
Feature activation-8.82e-02

Pos logit contributions
mother+0.046
 Beth+0.045
reating+0.044
ock+0.044
 operation+0.041
Neg logit contributions
 pulp-0.070
 fingers-0.068
 stem-0.064
ses-0.064
sed-0.062
Token of
Token ID286
Feature activation-5.67e-02

Pos logit contributions
mother+0.127
 Beth+0.123
 operation+0.122
reating+0.116
isky+0.110
Neg logit contributions
 fingers-0.186
 pulp-0.176
 toes-0.173
 hands-0.158
 finger-0.156
Token things
Token ID1243
Feature activation-3.05e-02

Pos logit contributions
mother+0.113
 Beth+0.108
reating+0.107
ock+0.107
isky+0.102
Neg logit contributions
 pulp-0.094
 fingers-0.092
 stem-0.082
 toes-0.080
 finger-0.078
Token.
Token ID13
Feature activation-1.04e-02

Pos logit contributions
mother+0.062
 Beth+0.059
ock+0.057
reating+0.057
isky+0.057
Neg logit contributions
 fingers-0.047
 pulp-0.046
 toes-0.042
 stem-0.041
 finger-0.040
Token One
Token ID1881
Feature activation+2.03e-02

Pos logit contributions
mother+0.018
reating+0.017
ock+0.017
isky+0.017
ie+0.017
Neg logit contributions
 fingers-0.019
 pulp-0.019
 stem-0.018
ses-0.018
 toes-0.017
Token day
Token ID1110
Feature activation+2.30e-02

Pos logit contributions
 pulp+0.054
ses+0.052
sed+0.051
 fingers+0.051
 stem+0.048
Neg logit contributions
ie-0.022
mother-0.020
ock-0.020
 operation-0.019
 Beth-0.019