TOP ACTIVATIONS
MAX = 0.122

twenty - eight , twenty - nine , thirty , thirty
nine , thirty , thirty - one , thirty - two
, Daddy , and three - year - old little Sus
thirty - one , thirty - two , thirty - three
mom my and her 3 - year - old daughter were
a mom and her 3 - year - old daughter decided
thirty - four , thirty - five , thirty - six
Dad and their little three - year - old daughter ,
John , a three - year - old , was
forty - four , forty - five , forty - six
thirty - seven , thirty - eight , thirty - nine
Mom my and her three - year - old daughter ,
thirty - two , thirty - three , thirty - four
forty - four , forty - five , forty - six
shouted Joe , a 3 - year - old boy .
a mummy and her three - year - old baby were
asked Billy , a three - year - old boy .
thirty - four , thirty - five , thirty - six
T eddy , a three - year - old , was
John , a 3 - year - old boy ,

MIN ACTIVATIONS
MIN = -0.068

the zoo to see the ostr ich . She observed it
were friends and lived near each other . The sun was
are careful . They look both ways . They cross the
<|endoftext|> The two players covered each other in the game .
around the house , entering each room to explore .
Why did you make me lick it ?" L
They bounce the ball to each other . They laugh and
asked Max , " Which leash do you want ?" Max
he saw a point far away . He wanted to shine
the big tank at the pet shop . They were lively
. The eagle sees the ostr ich and wants to catch
big pond . In the pond , there was a sw
was right . Eating different foods was important for her health
sees a big building far away . It is very tall
held hands . They looked both ways . They crossed the
dove , and they hung both pictures on the fridge .
. They liked to spray each other and make funny noises
y . Lily loved to feed Snow y carrots and lettuce
came to the stable to feed him . She brought a
bug . Jen wanted to feed the different insect , but

INTERVAL 0.074 - 0.122
CONTAINS 0.012%

. The three - year - old looked around
Soon , the door bell rang . Lily ran to
<|endoftext|> Ben was a 3 - year - old boy who
little girl who was three - years - old . She
A 3 - year - old toddler walked up to

INTERVAL 0.027 - 0.074
CONTAINS 3.397%

bird tried very hard and finally said , " eight three
landed in a messy pile of leaves .
It was so shiny and spark ly . She picked it
mistake . It jumped too high and bumped its head on
played with the doll all day and all night .

INTERVAL -0.021 - 0.027
CONTAINS 90.434%

wanted to catch the tomato . " I 'm going to
get it , she decided to take her mom 's flashlight
was going to the road . The road was
the mystery together . <|endoftext|> Once upon a time , there
Thanks , mom my !" " You 're welcome

INTERVAL -0.068 - -0.021
CONTAINS 6.157%

and Kim said they were sorry and hugged Dad . They
out of the ground ! Bob was so curious that he
saw something that made her happy : cel ery ! She
. <|endoftext|> Sam was a little boy who liked to imagine
hot soup . He was excited to see what it would
Token twenty
Token ID8208
Feature activation+7.94e-02

Pos logit contributions
br+0.030
sc+0.029
ch+0.028
bench+0.026
run+0.025
Neg logit contributions
ich-0.035
field-0.033
sters-0.032
anut-0.031
opsy-0.031
Token-
Token ID12
Feature activation+1.17e-01

Pos logit contributions
br+0.106
ch+0.106
 Ellie+0.104
sc+0.101
run+0.099
Neg logit contributions
ich-0.147
field-0.133
sters-0.129
ster-0.127
ese-0.124
Tokeneight
Token ID26022
Feature activation+4.60e-02

Pos logit contributions
br+0.162
 Ellie+0.156
sc+0.154
ch+0.154
 Bella+0.150
Neg logit contributions
ich-0.193
illa-0.188
anut-0.174
sters-0.160
ese-0.160
Token,
Token ID11
Feature activation+3.42e-02

Pos logit contributions
sc+0.067
ch+0.067
br+0.067
bench+0.059
 loud+0.059
Neg logit contributions
ich-0.081
sters-0.074
field-0.073
illa-0.070
our-0.070
Token twenty
Token ID8208
Feature activation+8.82e-02

Pos logit contributions
sc+0.051
br+0.048
ch+0.046
bench+0.044
run+0.043
Neg logit contributions
ich-0.059
field-0.055
sters-0.054
illa-0.052
ster-0.051
Token-
Token ID12
Feature activation+1.22e-01

Pos logit contributions
 Ellie+0.116
ch+0.113
sc+0.110
br+0.107
 Beth+0.104
Neg logit contributions
ich-0.161
sters-0.147
field-0.146
illa-0.143
ster-0.143
Tokennine
Token ID30888
Feature activation+6.75e-02

Pos logit contributions
br+0.179
sc+0.173
 Ellie+0.172
 tricky+0.169
 war+0.169
Neg logit contributions
illa-0.183
ich-0.182
anut-0.164
field-0.152
illas-0.148
Token,
Token ID11
Feature activation+3.68e-02

Pos logit contributions
ch+0.097
sc+0.095
br+0.095
 loud+0.093
 Ellie+0.091
Neg logit contributions
ich-0.109
our-0.104
sters-0.103
field-0.103
ster-0.100
Token thirty
Token ID12277
Feature activation+8.51e-02

Pos logit contributions
sc+0.055
br+0.051
ch+0.050
run+0.048
wn+0.046
Neg logit contributions
ich-0.065
field-0.059
sters-0.058
�-0.056
ster-0.056
Token,
Token ID11
Feature activation+4.05e-02

Pos logit contributions
 Ellie+0.105
sc+0.102
ch+0.098
 Beth+0.094
 Bella+0.092
Neg logit contributions
ich-0.163
sters-0.153
illa-0.150
ster-0.149
ese-0.149
Token thirty
Token ID12277
Feature activation+8.60e-02

Pos logit contributions
sc+0.061
br+0.056
ch+0.055
run+0.053
bench+0.051
Neg logit contributions
ich-0.072
field-0.065
sters-0.064
ster-0.062
�-0.062
Tokennine
Token ID30888
Feature activation+6.75e-02

Pos logit contributions
br+0.179
sc+0.173
 Ellie+0.172
 tricky+0.169
 war+0.169
Neg logit contributions
illa-0.183
ich-0.182
anut-0.164
field-0.152
illas-0.148
Token,
Token ID11
Feature activation+3.68e-02

Pos logit contributions
ch+0.097
sc+0.095
br+0.095
 loud+0.093
 Ellie+0.091
Neg logit contributions
ich-0.109
our-0.104
sters-0.103
field-0.103
ster-0.100
Token thirty
Token ID12277
Feature activation+8.51e-02

Pos logit contributions
sc+0.055
br+0.051
ch+0.050
run+0.048
wn+0.046
Neg logit contributions
ich-0.065
field-0.059
sters-0.058
�-0.056
ster-0.056
Token,
Token ID11
Feature activation+4.05e-02

Pos logit contributions
 Ellie+0.105
sc+0.102
ch+0.098
 Beth+0.094
 Bella+0.092
Neg logit contributions
ich-0.163
sters-0.153
illa-0.150
ster-0.149
ese-0.149
Token thirty
Token ID12277
Feature activation+8.60e-02

Pos logit contributions
sc+0.061
br+0.056
ch+0.055
run+0.053
bench+0.051
Neg logit contributions
ich-0.072
field-0.065
sters-0.064
ster-0.062
�-0.062
Token-
Token ID12
Feature activation+1.21e-01

Pos logit contributions
 Ellie+0.106
sc+0.102
ch+0.098
 Beth+0.094
 Bella+0.092
Neg logit contributions
ich-0.166
sters-0.156
illa-0.153
ster-0.152
ese-0.151
Tokenone
Token ID505
Feature activation+5.71e-02

Pos logit contributions
br+0.179
sc+0.179
 tricky+0.178
 Ellie+0.178
 scary+0.178
Neg logit contributions
illa-0.169
ich-0.169
anut-0.150
field-0.140
illas-0.139
Token,
Token ID11
Feature activation+4.31e-02

Pos logit contributions
sc+0.091
ch+0.086
 Ellie+0.086
br+0.083
B+0.080
Neg logit contributions
ich-0.085
field-0.078
our-0.078
illa-0.078
ster-0.076
Token thirty
Token ID12277
Feature activation+8.65e-02

Pos logit contributions
sc+0.066
br+0.059
ch+0.059
run+0.056
bench+0.056
Neg logit contributions
ich-0.076
sters-0.069
field-0.068
ster-0.066
illa-0.066
Token-
Token ID12
Feature activation+1.19e-01

Pos logit contributions
 Ellie+0.105
sc+0.101
ch+0.097
 Beth+0.091
br+0.091
Neg logit contributions
ich-0.170
sters-0.160
illa-0.156
ster-0.156
field-0.155
Tokentwo
Token ID11545
Feature activation+6.27e-02

Pos logit contributions
sc+0.177
 Ellie+0.177
br+0.177
 tricky+0.176
 scary+0.175
Neg logit contributions
ich-0.168
illa-0.168
anut-0.148
field-0.140
illas-0.138
Token,
Token ID11
Feature activation+1.77e-02

Pos logit contributions
ich+0.018
field+0.017
sters+0.016
ster+0.016
illas+0.016
Neg logit contributions
 Ellie-0.007
ch-0.007
br-0.007
 hard-0.007
 loud-0.007
Token Daddy
Token ID26926
Feature activation+1.82e-02

Pos logit contributions
br+0.017
ch+0.016
 Ellie+0.015
 tricky+0.015
run+0.015
Neg logit contributions
ich-0.041
field-0.038
sters-0.036
ster-0.036
ese-0.035
Token,
Token ID11
Feature activation+1.61e-02

Pos logit contributions
br+0.017
 Ellie+0.017
ch+0.017
 hard+0.016
 tricky+0.015
Neg logit contributions
ich-0.042
field-0.038
sters-0.037
ster-0.036
illas-0.035
Token and
Token ID290
Feature activation+4.00e-02

Pos logit contributions
br+0.019
ch+0.018
 Ellie+0.017
 tricky+0.017
run+0.017
Neg logit contributions
ich-0.034
field-0.031
sters-0.030
ster-0.030
ese-0.029
Token three
Token ID1115
Feature activation+5.91e-02

Pos logit contributions
br+0.058
sc+0.054
ch+0.054
run+0.053
 tricky+0.052
Neg logit contributions
ich-0.074
field-0.068
ster-0.065
sters-0.064
opsy-0.063
Token-
Token ID12
Feature activation+1.20e-01

Pos logit contributions
br+0.059
 loud+0.058
ch+0.056
 hard+0.055
 much+0.054
Neg logit contributions
ich-0.134
field-0.127
ster-0.117
sters-0.114
illas-0.113
Tokenyear
Token ID1941
Feature activation+1.05e-01

Pos logit contributions
br+0.149
 tricky+0.145
 complicated+0.136
 loud+0.131
sc+0.131
Neg logit contributions
ich-0.229
illa-0.219
anut-0.212
ese-0.205
sters-0.199
Token-
Token ID12
Feature activation+1.05e-01

Pos logit contributions
sc+0.107
br+0.105
run+0.103
 Ellie+0.103
ch+0.103
Neg logit contributions
ich-0.232
sters-0.209
ster-0.207
ese-0.205
field-0.200
Tokenold
Token ID727
Feature activation+3.23e-02

Pos logit contributions
br+0.157
 tricky+0.154
sc+0.147
 complicated+0.145
 loud+0.145
Neg logit contributions
ich-0.173
anut-0.162
illa-0.160
ese-0.159
sters-0.152
Token little
Token ID1310
Feature activation+1.53e-02

Pos logit contributions
br+0.058
ch+0.056
 tricky+0.054
 Ellie+0.054
run+0.054
Neg logit contributions
ich-0.048
field-0.042
sters-0.040
ster-0.040
ese-0.038
Token Sus
Token ID8932
Feature activation+3.22e-02

Pos logit contributions
br+0.028
ch+0.026
sc+0.026
run+0.026
 tricky+0.025
Neg logit contributions
ich-0.022
sters-0.019
ster-0.018
field-0.018
ese-0.018
Token thirty
Token ID12277
Feature activation+8.60e-02

Pos logit contributions
sc+0.061
br+0.056
ch+0.055
run+0.053
bench+0.051
Neg logit contributions
ich-0.072
field-0.065
sters-0.064
ster-0.062
�-0.062
Token-
Token ID12
Feature activation+1.21e-01

Pos logit contributions
 Ellie+0.106
sc+0.102
ch+0.098
 Beth+0.094
 Bella+0.092
Neg logit contributions
ich-0.166
sters-0.156
illa-0.153
ster-0.152
ese-0.151
Tokenone
Token ID505
Feature activation+5.71e-02

Pos logit contributions
br+0.179
sc+0.179
 tricky+0.178
 Ellie+0.178
 scary+0.178
Neg logit contributions
illa-0.169
ich-0.169
anut-0.150
field-0.140
illas-0.139
Token,
Token ID11
Feature activation+4.31e-02

Pos logit contributions
sc+0.091
ch+0.086
 Ellie+0.086
br+0.083
B+0.080
Neg logit contributions
ich-0.085
field-0.078
our-0.078
illa-0.078
ster-0.076
Token thirty
Token ID12277
Feature activation+8.65e-02

Pos logit contributions
sc+0.066
br+0.059
ch+0.059
run+0.056
bench+0.056
Neg logit contributions
ich-0.076
sters-0.069
field-0.068
ster-0.066
illa-0.066
Token-
Token ID12
Feature activation+1.19e-01

Pos logit contributions
 Ellie+0.105
sc+0.101
ch+0.097
 Beth+0.091
br+0.091
Neg logit contributions
ich-0.170
sters-0.160
illa-0.156
ster-0.156
field-0.155
Tokentwo
Token ID11545
Feature activation+6.27e-02

Pos logit contributions
sc+0.177
 Ellie+0.177
br+0.177
 tricky+0.176
 scary+0.175
Neg logit contributions
ich-0.168
illa-0.168
anut-0.148
field-0.140
illas-0.138
Token,
Token ID11
Feature activation+3.92e-02

Pos logit contributions
 Ellie+0.091
sc+0.091
ch+0.090
br+0.086
bench+0.082
Neg logit contributions
ich-0.107
sters-0.099
field-0.098
our-0.096
ster-0.095
Token thirty
Token ID12277
Feature activation+8.14e-02

Pos logit contributions
sc+0.060
ch+0.055
br+0.054
run+0.051
bench+0.051
Neg logit contributions
ich-0.070
sters-0.063
field-0.063
ster-0.061
illa-0.061
Token-
Token ID12
Feature activation+1.10e-01

Pos logit contributions
 Ellie+0.093
sc+0.091
ch+0.088
br+0.083
 Beth+0.081
Neg logit contributions
ich-0.167
sters-0.157
ster-0.153
field-0.152
illa-0.152
Tokenthree
Token ID15542
Feature activation+7.17e-02

Pos logit contributions
 Ellie+0.167
sc+0.164
br+0.164
 tricky+0.162
 Bella+0.162
Neg logit contributions
ich-0.164
illa-0.157
anut-0.144
field-0.137
sters-0.136
Token mom
Token ID1995
Feature activation+6.12e-03

Pos logit contributions
br+0.011
ch+0.010
 Ellie+0.010
 tricky+0.010
run+0.010
Neg logit contributions
ich-0.028
field-0.026
sters-0.024
ster-0.024
ese-0.024
Tokenmy
Token ID1820
Feature activation+3.41e-02

Pos logit contributions
ch+0.007
br+0.007
 Ellie+0.007
 hard+0.006
 loud+0.006
Neg logit contributions
ich-0.013
field-0.012
sters-0.012
ster-0.011
illas-0.011
Token and
Token ID290
Feature activation+3.35e-02

Pos logit contributions
ch+0.029
br+0.029
 Ellie+0.027
 loud+0.025
 hard+0.025
Neg logit contributions
ich-0.081
field-0.077
sters-0.074
ster-0.072
illas-0.071
Token her
Token ID607
Feature activation+2.75e-02

Pos logit contributions
br+0.030
ch+0.028
sc+0.027
run+0.027
 tricky+0.026
Neg logit contributions
ich-0.079
field-0.072
ster-0.070
sters-0.070
ese-0.068
Token 3
Token ID513
Feature activation+5.30e-02

Pos logit contributions
br+0.031
ch+0.030
sc+0.029
run+0.029
 Ellie+0.029
Neg logit contributions
ich-0.059
field-0.052
ster-0.052
sters-0.051
ese-0.050
Token-
Token ID12
Feature activation+1.14e-01

Pos logit contributions
br+0.063
ch+0.059
 Ellie+0.056
 tricky+0.056
sc+0.056
Neg logit contributions
ich-0.110
field-0.098
ese-0.096
sters-0.096
ster-0.096
Tokenyear
Token ID1941
Feature activation+1.02e-01

Pos logit contributions
br+0.126
 tricky+0.115
 complicated+0.107
ch+0.105
 loud+0.104
Neg logit contributions
ich-0.243
illa-0.223
anut-0.214
ster-0.212
ese-0.210
Token-
Token ID12
Feature activation+1.01e-01

Pos logit contributions
br+0.103
sc+0.102
ch+0.101
 Ellie+0.099
run+0.098
Neg logit contributions
ich-0.228
ster-0.205
sters-0.203
field-0.199
ese-0.198
Tokenold
Token ID727
Feature activation+2.81e-02

Pos logit contributions
br+0.144
 tricky+0.135
sc+0.130
ch+0.130
 complicated+0.127
Neg logit contributions
ich-0.180
illa-0.164
anut-0.160
ese-0.157
sters-0.155
Token daughter
Token ID4957
Feature activation+1.18e-02

Pos logit contributions
br+0.048
ch+0.046
run+0.044
 tricky+0.044
sc+0.044
Neg logit contributions
ich-0.044
field-0.038
ster-0.038
sters-0.037
ese-0.036
Token were
Token ID547
Feature activation+2.58e-02

Pos logit contributions
br+0.014
ch+0.013
 tricky+0.013
run+0.013
sc+0.012
Neg logit contributions
ich-0.025
field-0.023
ster-0.022
sters-0.021
ese-0.021
Token a
Token ID257
Feature activation+8.87e-03

Pos logit contributions
ich+0.005
field+0.004
ster+0.004
sters+0.004
illa+0.004
Neg logit contributions
br-0.001
ch-0.001
sc-0.001
run-0.001
 tricky-0.001
Token mom
Token ID1995
Feature activation+6.97e-03

Pos logit contributions
br+0.008
ch+0.008
 Ellie+0.007
 tricky+0.007
run+0.007
Neg logit contributions
ich-0.021
field-0.019
sters-0.019
ster-0.018
ese-0.018
Token and
Token ID290
Feature activation+3.42e-02

Pos logit contributions
br+0.009
ch+0.009
 Ellie+0.009
 tricky+0.008
 hard+0.008
Neg logit contributions
ich-0.014
field-0.013
sters-0.012
ster-0.012
ese-0.011
Token her
Token ID607
Feature activation+2.33e-02

Pos logit contributions
br+0.033
ch+0.031
sc+0.030
run+0.030
 tricky+0.029
Neg logit contributions
ich-0.079
field-0.072
ster-0.070
sters-0.070
ese-0.068
Token 3
Token ID513
Feature activation+5.37e-02

Pos logit contributions
br+0.026
ch+0.025
 Ellie+0.024
 tricky+0.023
run+0.023
Neg logit contributions
ich-0.051
field-0.046
ster-0.044
sters-0.044
ese-0.043
Token-
Token ID12
Feature activation+1.14e-01

Pos logit contributions
br+0.066
ch+0.062
 Ellie+0.059
sc+0.059
 tricky+0.058
Neg logit contributions
ich-0.110
field-0.097
ese-0.097
sters-0.096
ster-0.096
Tokenyear
Token ID1941
Feature activation+1.02e-01

Pos logit contributions
br+0.136
 tricky+0.125
 complicated+0.118
sc+0.117
ch+0.117
Neg logit contributions
ich-0.236
illa-0.218
anut-0.210
ster-0.204
ese-0.204
Token-
Token ID12
Feature activation+1.00e-01

Pos logit contributions
sc+0.105
br+0.103
ch+0.103
run+0.102
 Ellie+0.099
Neg logit contributions
ich-0.228
ster-0.203
sters-0.201
field-0.197
ese-0.196
Tokenold
Token ID727
Feature activation+2.99e-02

Pos logit contributions
br+0.151
 tricky+0.142
sc+0.141
ch+0.141
run+0.136
Neg logit contributions
ich-0.175
illa-0.158
anut-0.154
ese-0.151
sters-0.150
Token daughter
Token ID4957
Feature activation+1.30e-02

Pos logit contributions
br+0.055
ch+0.052
 tricky+0.051
run+0.050
sc+0.050
Neg logit contributions
ich-0.044
field-0.037
ster-0.037
sters-0.037
ese-0.035
Token decided
Token ID3066
Feature activation+2.89e-03

Pos logit contributions
br+0.017
ch+0.016
 tricky+0.016
run+0.016
sc+0.015
Neg logit contributions
ich-0.025
field-0.023
ster-0.022
sters-0.022
ese-0.021
Token thirty
Token ID12277
Feature activation+4.68e-02

Pos logit contributions
ch+0.008
br+0.008
sc+0.008
 tricky+0.008
run+0.008
Neg logit contributions
ich-0.008
sters-0.007
field-0.007
ster-0.007
ese-0.007
Token-
Token ID12
Feature activation+9.26e-02

Pos logit contributions
ch+0.045
br+0.045
 Ellie+0.044
sc+0.043
run+0.041
Neg logit contributions
ich-0.105
field-0.096
sters-0.096
ster-0.095
ese-0.092
Tokenfour
Token ID14337
Feature activation+3.93e-02

Pos logit contributions
 Ellie+0.170
 loud+0.169
 tricky+0.169
run+0.166
 scary+0.165
Neg logit contributions
ich-0.116
illa-0.099
ster-0.097
sters-0.096
anut-0.093
Token,
Token ID11
Feature activation+2.57e-02

Pos logit contributions
br+0.045
ch+0.045
 Ellie+0.044
run+0.042
sc+0.042
Neg logit contributions
ich-0.080
field-0.074
sters-0.071
ster-0.071
ese-0.069
Token thirty
Token ID12277
Feature activation+7.60e-02

Pos logit contributions
ch+0.039
br+0.039
sc+0.039
 Ellie+0.038
run+0.038
Neg logit contributions
ich-0.041
field-0.037
sters-0.036
ster-0.036
ese-0.034
Token-
Token ID12
Feature activation+1.12e-01

Pos logit contributions
br+0.062
ch+0.061
 Ellie+0.059
sc+0.056
run+0.056
Neg logit contributions
ich-0.182
field-0.168
sters-0.165
ster-0.164
ese-0.160
Tokenfive
Token ID13261
Feature activation+2.01e-02

Pos logit contributions
 tricky+0.212
 Ellie+0.210
 loud+0.206
run+0.205
 confusing+0.203
Neg logit contributions
ich-0.135
illa-0.115
ster-0.113
sters-0.109
anut-0.109
Token,
Token ID11
Feature activation+1.59e-02

Pos logit contributions
br+0.025
ch+0.025
 Ellie+0.024
sc+0.024
 tricky+0.023
Neg logit contributions
ich-0.040
field-0.036
sters-0.035
ster-0.035
ese-0.034
Token thirty
Token ID12277
Feature activation+6.09e-02

Pos logit contributions
ch+0.023
br+0.023
run+0.023
sc+0.023
 Ellie+0.023
Neg logit contributions
ich-0.027
field-0.024
sters-0.023
ster-0.023
ese-0.022
Token-
Token ID12
Feature activation+9.88e-02

Pos logit contributions
br+0.050
ch+0.050
 Ellie+0.048
sc+0.046
run+0.046
Neg logit contributions
ich-0.146
field-0.135
sters-0.132
ster-0.131
ese-0.128
Tokensix
Token ID19412
Feature activation+3.78e-03

Pos logit contributions
 tricky+0.187
 Ellie+0.186
 loud+0.184
run+0.183
br+0.181
Neg logit contributions
ich-0.121
ster-0.100
illa-0.100
sters-0.098
field-0.096
Token Dad
Token ID17415
Feature activation+9.43e-03

Pos logit contributions
br+0.025
ch+0.024
run+0.023
sc+0.023
 tricky+0.023
Neg logit contributions
ich-0.035
field-0.031
ster-0.030
sters-0.030
ese-0.028
Token and
Token ID290
Feature activation+4.19e-02

Pos logit contributions
br+0.010
ch+0.009
 Ellie+0.009
 tricky+0.009
 hard+0.009
Neg logit contributions
ich-0.021
field-0.019
sters-0.019
ster-0.018
ese-0.018
Token their
Token ID511
Feature activation+3.45e-02

Pos logit contributions
br+0.060
 Ellie+0.059
ch+0.059
 loud+0.058
 hard+0.058
Neg logit contributions
ich-0.077
field-0.069
sters-0.065
ster-0.064
illas-0.063
Token little
Token ID1310
Feature activation+1.30e-04

Pos logit contributions
br+0.052
ch+0.050
run+0.048
 Ellie+0.048
sc+0.047
Neg logit contributions
ich-0.062
field-0.056
ster-0.053
sters-0.053
ese-0.051
Token three
Token ID1115
Feature activation+5.44e-02

Pos logit contributions
br+0.000
ch+0.000
 Ellie+0.000
run+0.000
 tricky+0.000
Neg logit contributions
ich-0.000
field-0.000
sters-0.000
ster-0.000
ese-0.000
Token-
Token ID12
Feature activation+1.12e-01

Pos logit contributions
br+0.052
 loud+0.051
 hard+0.050
ch+0.049
 tricky+0.049
Neg logit contributions
ich-0.126
field-0.118
ster-0.109
sters-0.107
illas-0.105
Tokenyear
Token ID1941
Feature activation+1.00e-01

Pos logit contributions
br+0.128
 tricky+0.120
ch+0.110
 complicated+0.109
 loud+0.108
Neg logit contributions
ich-0.231
illa-0.215
anut-0.208
sters-0.203
ster-0.202
Token-
Token ID12
Feature activation+1.00e-01

Pos logit contributions
br+0.094
ch+0.091
 Ellie+0.088
sc+0.087
run+0.085
Neg logit contributions
ich-0.234
sters-0.211
ster-0.210
field-0.210
ese-0.204
Tokenold
Token ID727
Feature activation+2.61e-02

Pos logit contributions
br+0.138
 tricky+0.130
ch+0.127
run+0.123
sc+0.123
Neg logit contributions
ich-0.182
illa-0.165
anut-0.164
ese-0.161
sters-0.160
Token daughter
Token ID4957
Feature activation+1.25e-02

Pos logit contributions
br+0.044
ch+0.043
 Ellie+0.041
 tricky+0.041
run+0.041
Neg logit contributions
ich-0.041
field-0.037
sters-0.034
ster-0.034
ese-0.033
Token,
Token ID11
Feature activation+1.83e-02

Pos logit contributions
br+0.014
ch+0.013
 Ellie+0.013
 tricky+0.013
run+0.013
Neg logit contributions
ich-0.027
field-0.024
sters-0.023
ster-0.023
ese-0.023
Token
Token ID198
Feature activation-1.33e-03

Pos logit contributions
br+0.008
ch+0.008
 Ellie+0.007
 tricky+0.007
run+0.007
Neg logit contributions
ich-0.014
field-0.013
sters-0.012
ster-0.012
ese-0.012
TokenJohn
Token ID7554
Feature activation+8.39e-03

Pos logit contributions
ich+0.002
field+0.002
sters+0.002
ster+0.002
ese+0.002
Neg logit contributions
br-0.002
ch-0.002
 Ellie-0.002
 tricky-0.002
 loud-0.002
Token,
Token ID11
Feature activation+1.11e-02

Pos logit contributions
br+0.011
ch+0.011
 Ellie+0.011
run+0.010
 tricky+0.010
Neg logit contributions
ich-0.016
field-0.015
ster-0.014
sters-0.014
ese-0.013
Token a
Token ID257
Feature activation+2.70e-02

Pos logit contributions
br+0.017
ch+0.016
sc+0.015
 tricky+0.015
run+0.015
Neg logit contributions
ich-0.020
field-0.018
ster-0.017
sters-0.017
illa-0.017
Token three
Token ID1115
Feature activation+5.14e-02

Pos logit contributions
sc+0.040
br+0.039
run+0.038
 Ellie+0.037
ch+0.036
Neg logit contributions
ich-0.051
ster-0.044
field-0.043
opsy-0.040
�-0.039
Token-
Token ID12
Feature activation+1.11e-01

Pos logit contributions
 Ellie+0.072
br+0.071
ch+0.068
sc+0.066
run+0.064
Neg logit contributions
ich-0.089
sters-0.082
ster-0.080
ese-0.080
field-0.076
Tokenyear
Token ID1941
Feature activation+1.00e-01

Pos logit contributions
br+0.148
 tricky+0.146
 baths+0.138
 impossible+0.138
 Bella+0.136
Neg logit contributions
ich-0.187
illa-0.180
anut-0.165
ese-0.163
ster-0.158
Token-
Token ID12
Feature activation+9.59e-02

Pos logit contributions
sc+0.122
 Ellie+0.121
ch+0.118
br+0.118
run+0.116
Neg logit contributions
ich-0.199
ster-0.183
sters-0.175
ese-0.175
field-0.168
Tokenold
Token ID727
Feature activation+2.66e-02

Pos logit contributions
br+0.151
 tricky+0.145
ch+0.141
 Ellie+0.140
 Bella+0.140
Neg logit contributions
ich-0.150
illa-0.136
ese-0.129
anut-0.129
ster-0.126
Token,
Token ID11
Feature activation+5.08e-03

Pos logit contributions
br+0.052
ch+0.048
 tricky+0.048
sc+0.047
 complicated+0.047
Neg logit contributions
ich-0.036
ster-0.033
sters-0.031
ese-0.030
illas-0.029
Token was
Token ID373
Feature activation+1.42e-02

Pos logit contributions
br+0.010
igsaw+0.009
ch+0.009
sc+0.009
run+0.008
Neg logit contributions
ich-0.007
field-0.006
our-0.006
sters-0.006
 porch-0.006
Token forty
Token ID16571
Feature activation+2.94e-02

Pos logit contributions
ch+0.007
br+0.007
 tricky+0.007
sc+0.007
run+0.007
Neg logit contributions
ich-0.007
field-0.006
sters-0.006
ster-0.006
illa-0.006
Token-
Token ID12
Feature activation+9.02e-02

Pos logit contributions
br+0.023
ch+0.022
 Ellie+0.021
 tricky+0.020
run+0.020
Neg logit contributions
ich-0.073
field-0.067
sters-0.065
ster-0.065
ese-0.063
Tokenfour
Token ID14337
Feature activation+3.85e-02

Pos logit contributions
 loud+0.169
 tricky+0.168
 Ellie+0.167
 scary+0.164
run+0.164
Neg logit contributions
ich-0.108
illa-0.092
ster-0.090
sters-0.088
anut-0.086
Token,
Token ID11
Feature activation+2.43e-02

Pos logit contributions
br+0.045
ch+0.045
 Ellie+0.044
sc+0.042
run+0.042
Neg logit contributions
ich-0.077
field-0.071
sters-0.069
ster-0.069
ese-0.067
Token forty
Token ID16571
Feature activation+5.70e-02

Pos logit contributions
ch+0.037
br+0.036
 Ellie+0.036
sc+0.036
run+0.036
Neg logit contributions
ich-0.039
field-0.035
sters-0.035
ster-0.034
ese-0.032
Token-
Token ID12
Feature activation+1.10e-01

Pos logit contributions
br+0.040
ch+0.036
 tricky+0.033
 Ellie+0.033
run+0.033
Neg logit contributions
ich-0.147
field-0.137
sters-0.130
ster-0.130
ese-0.127
Tokenfive
Token ID13261
Feature activation+1.88e-02

Pos logit contributions
 tricky+0.212
 Ellie+0.210
 loud+0.207
run+0.204
 confusing+0.203
Neg logit contributions
ich-0.128
illa-0.109
ster-0.107
sters-0.103
anut-0.102
Token,
Token ID11
Feature activation+1.44e-02

Pos logit contributions
br+0.025
ch+0.024
 Ellie+0.024
sc+0.024
 tricky+0.023
Neg logit contributions
ich-0.036
field-0.032
sters-0.032
ster-0.032
ese-0.031
Token forty
Token ID16571
Feature activation+4.15e-02

Pos logit contributions
ch+0.021
br+0.021
run+0.021
 Ellie+0.021
sc+0.021
Neg logit contributions
ich-0.024
field-0.022
sters-0.021
ster-0.021
ese-0.020
Token-
Token ID12
Feature activation+9.78e-02

Pos logit contributions
br+0.029
ch+0.026
 tricky+0.024
 Ellie+0.024
run+0.024
Neg logit contributions
ich-0.107
field-0.100
sters-0.095
ster-0.095
illas-0.093
Tokensix
Token ID19412
Feature activation+2.51e-03

Pos logit contributions
 tricky+0.190
 Ellie+0.187
 loud+0.186
run+0.184
br+0.182
Neg logit contributions
ich-0.114
illa-0.095
ster-0.094
sters-0.092
field-0.090
Token thirty
Token ID12277
Feature activation+5.74e-02

Pos logit contributions
sc+0.010
ch+0.010
br+0.010
run+0.009
bench+0.009
Neg logit contributions
ich-0.012
field-0.011
sters-0.011
ster-0.010
illa-0.010
Token-
Token ID12
Feature activation+9.20e-02

Pos logit contributions
 Ellie+0.052
ch+0.052
br+0.051
sc+0.051
run+0.046
Neg logit contributions
ich-0.132
field-0.122
sters-0.120
ster-0.120
illa-0.117
Tokenseven
Token ID26548
Feature activation+3.27e-02

Pos logit contributions
 Ellie+0.158
br+0.156
 tricky+0.155
sc+0.153
 loud+0.153
Neg logit contributions
ich-0.132
illa-0.114
ster-0.108
sters-0.107
anut-0.105
Token,
Token ID11
Feature activation+2.82e-02

Pos logit contributions
ch+0.045
 Ellie+0.045
sc+0.044
br+0.043
run+0.041
Neg logit contributions
ich-0.058
field-0.053
sters-0.052
ster-0.051
ese-0.051
Token thirty
Token ID12277
Feature activation+7.91e-02

Pos logit contributions
sc+0.047
ch+0.044
run+0.042
br+0.042
bench+0.042
Neg logit contributions
ich-0.045
field-0.040
sters-0.040
ster-0.039
illa-0.038
Token-
Token ID12
Feature activation+1.10e-01

Pos logit contributions
 Ellie+0.081
ch+0.079
sc+0.079
br+0.074
 Beth+0.070
Neg logit contributions
ich-0.172
sters-0.159
ster-0.157
field-0.156
illa-0.155
Tokeneight
Token ID26022
Feature activation+4.60e-02

Pos logit contributions
 tricky+0.167
sc+0.165
 Ellie+0.164
 loud+0.162
br+0.162
Neg logit contributions
ich-0.164
illa-0.157
anut-0.143
sters-0.135
ster-0.134
Token,
Token ID11
Feature activation+2.33e-02

Pos logit contributions
ch+0.071
sc+0.070
br+0.067
 Ellie+0.062
bench+0.061
Neg logit contributions
ich-0.079
sters-0.073
field-0.071
ster-0.069
illa-0.069
Token thirty
Token ID12277
Feature activation+6.90e-02

Pos logit contributions
sc+0.038
ch+0.036
run+0.035
bench+0.034
br+0.034
Neg logit contributions
ich-0.038
sters-0.033
field-0.033
ster-0.032
illa-0.031
Token-
Token ID12
Feature activation+1.02e-01

Pos logit contributions
 Ellie+0.070
ch+0.068
sc+0.068
br+0.065
 tricky+0.060
Neg logit contributions
ich-0.152
sters-0.140
ster-0.139
field-0.138
illa-0.136
Tokennine
Token ID30888
Feature activation+5.40e-02

Pos logit contributions
 Ellie+0.160
 tricky+0.158
sc+0.157
 Bella+0.156
 loud+0.155
Neg logit contributions
ich-0.151
illa-0.139
anut-0.128
ster-0.126
sters-0.126
Token Mom
Token ID11254
Feature activation+3.27e-03

Pos logit contributions
ich+0.003
field+0.003
ster+0.003
opsy+0.003
illa+0.003
Neg logit contributions
br-0.001
run-0.001
ch-0.001
sc-0.001
bench-0.001
Tokenmy
Token ID1820
Feature activation+2.23e-02

Pos logit contributions
 Ellie+0.004
ch+0.004
 time+0.003
br+0.003
 hard+0.003
Neg logit contributions
ich-0.007
field-0.006
sters-0.006
ster-0.006
illas-0.006
Token and
Token ID290
Feature activation+3.54e-02

Pos logit contributions
ch+0.020
br+0.018
 Ellie+0.018
 time+0.017
 hard+0.017
Neg logit contributions
ich-0.053
field-0.050
sters-0.048
ster-0.047
illas-0.046
Token her
Token ID607
Feature activation+2.73e-02

Pos logit contributions
 Ellie+0.043
ch+0.041
 loud+0.041
 hard+0.041
br+0.040
Neg logit contributions
ich-0.074
field-0.068
sters-0.064
ster-0.064
illas-0.063
Token three
Token ID1115
Feature activation+4.94e-02

Pos logit contributions
br+0.026
ch+0.024
 Ellie+0.023
 tricky+0.023
run+0.023
Neg logit contributions
ich-0.064
field-0.059
sters-0.056
ster-0.056
ese-0.055
Token-
Token ID12
Feature activation+1.10e-01

Pos logit contributions
br+0.067
ch+0.064
 Ellie+0.063
sc+0.061
 tricky+0.061
Neg logit contributions
ich-0.094
field-0.084
sters-0.081
ster-0.081
ese-0.079
Tokenyear
Token ID1941
Feature activation+9.80e-02

Pos logit contributions
br+0.125
 tricky+0.114
 complicated+0.105
 loud+0.103
sc+0.103
Neg logit contributions
ich-0.234
illa-0.216
anut-0.210
field-0.204
ster-0.203
Token-
Token ID12
Feature activation+9.90e-02

Pos logit contributions
sc+0.100
br+0.100
ch+0.099
 Ellie+0.097
run+0.096
Neg logit contributions
ich-0.219
ster-0.195
sters-0.194
field-0.192
ese-0.189
Tokenold
Token ID727
Feature activation+2.72e-02

Pos logit contributions
br+0.146
 tricky+0.139
sc+0.135
ch+0.132
 loud+0.132
Neg logit contributions
ich-0.175
illa-0.157
anut-0.154
ese-0.150
sters-0.147
Token daughter
Token ID4957
Feature activation+1.09e-02

Pos logit contributions
br+0.049
ch+0.047
 tricky+0.046
run+0.046
 Ellie+0.046
Neg logit contributions
ich-0.040
field-0.034
ster-0.033
sters-0.033
ese-0.032
Token,
Token ID11
Feature activation+1.34e-02

Pos logit contributions
br+0.013
ch+0.012
 tricky+0.012
run+0.012
 Ellie+0.012
Neg logit contributions
ich-0.023
field-0.021
ster-0.020
sters-0.020
ese-0.019
Token thirty
Token ID12277
Feature activation+8.65e-02

Pos logit contributions
sc+0.066
br+0.059
ch+0.059
run+0.056
bench+0.056
Neg logit contributions
ich-0.076
sters-0.069
field-0.068
ster-0.066
illa-0.066
Token-
Token ID12
Feature activation+1.19e-01

Pos logit contributions
 Ellie+0.105
sc+0.101
ch+0.097
 Beth+0.091
br+0.091
Neg logit contributions
ich-0.170
sters-0.160
illa-0.156
ster-0.156
field-0.155
Tokentwo
Token ID11545
Feature activation+6.27e-02

Pos logit contributions
sc+0.177
 Ellie+0.177
br+0.177
 tricky+0.176
 scary+0.175
Neg logit contributions
ich-0.168
illa-0.168
anut-0.148
field-0.140
illas-0.138
Token,
Token ID11
Feature activation+3.92e-02

Pos logit contributions
 Ellie+0.091
sc+0.091
ch+0.090
br+0.086
bench+0.082
Neg logit contributions
ich-0.107
sters-0.099
field-0.098
our-0.096
ster-0.095
Token thirty
Token ID12277
Feature activation+8.14e-02

Pos logit contributions
sc+0.060
ch+0.055
br+0.054
run+0.051
bench+0.051
Neg logit contributions
ich-0.070
sters-0.063
field-0.063
ster-0.061
illa-0.061
Token-
Token ID12
Feature activation+1.10e-01

Pos logit contributions
 Ellie+0.093
sc+0.091
ch+0.088
br+0.083
 Beth+0.081
Neg logit contributions
ich-0.167
sters-0.157
ster-0.153
field-0.152
illa-0.152
Tokenthree
Token ID15542
Feature activation+7.17e-02

Pos logit contributions
 Ellie+0.167
sc+0.164
br+0.164
 tricky+0.162
 Bella+0.162
Neg logit contributions
ich-0.164
illa-0.157
anut-0.144
field-0.137
sters-0.136
Token,
Token ID11
Feature activation+3.23e-02

Pos logit contributions
 Ellie+0.108
ch+0.101
sc+0.098
 Beth+0.097
bul+0.097
Neg logit contributions
ich-0.112
field-0.109
our-0.107
sters-0.107
ese-0.104
Token thirty
Token ID12277
Feature activation+7.55e-02

Pos logit contributions
sc+0.048
ch+0.044
br+0.044
run+0.041
bench+0.041
Neg logit contributions
ich-0.059
sters-0.053
field-0.052
ster-0.052
illa-0.051
Token-
Token ID12
Feature activation+1.04e-01

Pos logit contributions
 Ellie+0.083
sc+0.080
ch+0.079
br+0.075
 Beth+0.071
Neg logit contributions
ich-0.160
sters-0.149
field-0.146
ster-0.146
illa-0.144
Tokenfour
Token ID14337
Feature activation+4.80e-02

Pos logit contributions
 Ellie+0.164
br+0.160
sc+0.159
 Bella+0.158
 tricky+0.158
Neg logit contributions
ich-0.159
illa-0.144
anut-0.134
sters-0.132
field-0.132
Token forty
Token ID16571
Feature activation+3.44e-02

Pos logit contributions
ch+0.006
sc+0.006
 Ellie+0.006
br+0.006
 tricky+0.006
Neg logit contributions
ich-0.006
field-0.005
sters-0.005
ster-0.005
illa-0.005
Token-
Token ID12
Feature activation+9.05e-02

Pos logit contributions
br+0.026
ch+0.025
 Ellie+0.023
 tricky+0.022
run+0.022
Neg logit contributions
ich-0.086
field-0.080
sters-0.077
ster-0.077
ese-0.075
Tokenfour
Token ID14337
Feature activation+4.32e-02

Pos logit contributions
 Ellie+0.161
 tricky+0.159
 loud+0.157
 Bella+0.156
 Beth+0.155
Neg logit contributions
ich-0.121
illa-0.103
ster-0.101
sters-0.100
anut-0.097
Token,
Token ID11
Feature activation+3.04e-02

Pos logit contributions
 Ellie+0.054
ch+0.054
br+0.053
sc+0.051
run+0.050
Neg logit contributions
ich-0.082
field-0.076
sters-0.074
ster-0.073
ese-0.072
Token forty
Token ID16571
Feature activation+5.94e-02

Pos logit contributions
sc+0.047
ch+0.047
 Ellie+0.047
run+0.046
 tricky+0.045
Neg logit contributions
ich-0.046
sters-0.041
field-0.041
ster-0.041
ese-0.038
Token-
Token ID12
Feature activation+1.09e-01

Pos logit contributions
br+0.043
ch+0.040
 Ellie+0.038
 tricky+0.037
run+0.037
Neg logit contributions
ich-0.151
field-0.140
sters-0.135
ster-0.134
ese-0.131
Tokenfive
Token ID13261
Feature activation+2.02e-02

Pos logit contributions
 tricky+0.213
 Ellie+0.209
 loud+0.207
 confusing+0.206
 scary+0.205
Neg logit contributions
ich-0.126
illa-0.109
ster-0.103
anut-0.102
sters-0.100
Token,
Token ID11
Feature activation+1.25e-02

Pos logit contributions
ch+0.028
br+0.028
 Ellie+0.027
sc+0.027
 tricky+0.026
Neg logit contributions
ich-0.037
sters-0.034
ster-0.033
field-0.033
illa-0.032
Token forty
Token ID16571
Feature activation+4.25e-02

Pos logit contributions
ch+0.019
sc+0.019
 Ellie+0.019
run+0.019
 tricky+0.019
Neg logit contributions
ich-0.020
field-0.017
sters-0.017
ster-0.017
ese-0.016
Token-
Token ID12
Feature activation+9.10e-02

Pos logit contributions
br+0.031
ch+0.028
 Ellie+0.027
 tricky+0.026
run+0.026
Neg logit contributions
ich-0.108
field-0.101
sters-0.096
ster-0.096
ese-0.094
Tokensix
Token ID19412
Feature activation+6.33e-03

Pos logit contributions
 tricky+0.170
 Ellie+0.168
 loud+0.166
br+0.165
ch+0.163
Neg logit contributions
ich-0.117
illa-0.097
ster-0.096
sters-0.094
anut-0.092
Token shouted
Token ID17293
Feature activation+1.56e-02

Pos logit contributions
ich+0.018
field+0.017
sters+0.015
ster+0.015
opsy+0.015
Neg logit contributions
br-0.016
 tricky-0.014
ch-0.014
sc-0.014
 complicated-0.013
Token Joe
Token ID5689
Feature activation+1.38e-02

Pos logit contributions
br+0.024
 tricky+0.022
bench+0.022
 complicated+0.022
sc+0.021
Neg logit contributions
ich-0.026
ster-0.024
field-0.023
opsy-0.023
illa-0.022
Token,
Token ID11
Feature activation+1.13e-02

Pos logit contributions
br+0.023
 tricky+0.021
 complicated+0.020
ch+0.020
bench+0.020
Neg logit contributions
ich-0.022
ster-0.021
field-0.020
illa-0.020
anut-0.019
Token a
Token ID257
Feature activation+2.51e-02

Pos logit contributions
br+0.017
sc+0.015
 tricky+0.015
ch+0.015
bench+0.015
Neg logit contributions
ich-0.021
field-0.019
ster-0.018
sters-0.017
�-0.017
Token 3
Token ID513
Feature activation+5.46e-02

Pos logit contributions
 Ellie+0.032
br+0.031
sc+0.031
run+0.031
bench+0.029
Neg logit contributions
ich-0.047
field-0.044
ster-0.040
 porch-0.040
opsy-0.037
Token-
Token ID12
Feature activation+1.09e-01

Pos logit contributions
br+0.081
 Ellie+0.079
ch+0.075
 tricky+0.074
 Ros+0.074
Neg logit contributions
ich-0.091
ese-0.082
field-0.081
ster-0.081
illa-0.080
Tokenyear
Token ID1941
Feature activation+1.06e-01

Pos logit contributions
 tricky+0.142
 complicated+0.138
br+0.136
 confusing+0.136
 baths+0.134
Neg logit contributions
ich-0.192
illa-0.181
ese-0.169
anut-0.166
ster-0.163
Token-
Token ID12
Feature activation+9.69e-02

Pos logit contributions
 Ellie+0.145
sc+0.142
 Beth+0.136
 Bella+0.135
ch+0.135
Neg logit contributions
ich-0.193
ster-0.179
ese-0.168
sters-0.166
field-0.165
Tokenold
Token ID727
Feature activation+2.92e-02

Pos logit contributions
 tricky+0.154
br+0.153
 scary+0.149
 confusing+0.148
 Ellie+0.147
Neg logit contributions
ich-0.149
illa-0.137
ese-0.125
ster-0.123
anut-0.122
Token boy
Token ID2933
Feature activation+1.65e-02

Pos logit contributions
br+0.057
 tricky+0.054
ch+0.053
 complicated+0.053
 confusing+0.051
Neg logit contributions
ich-0.039
ster-0.036
sters-0.033
illa-0.033
field-0.032
Token.
Token ID13
Feature activation+1.22e-02

Pos logit contributions
br+0.028
 tricky+0.027
 complicated+0.026
ch+0.025
sc+0.025
Neg logit contributions
ich-0.026
field-0.024
ster-0.024
illa-0.023
illas-0.023
Token a
Token ID257
Feature activation+7.93e-03

Pos logit contributions
br+0.003
bench+0.002
bul+0.002
smart+0.002
run+0.002
Neg logit contributions
ich-0.004
field-0.004
opsy-0.004
�-0.004
ster-0.003
Token mummy
Token ID38567
Feature activation+7.50e-03

Pos logit contributions
run+0.008
br+0.008
sc+0.008
 Ellie+0.008
ch+0.008
Neg logit contributions
ich-0.017
field-0.015
ster-0.015
sters-0.014
opsy-0.014
Token and
Token ID290
Feature activation+3.46e-02

Pos logit contributions
br+0.007
ch+0.007
 Ellie+0.007
 tricky+0.007
run+0.007
Neg logit contributions
ich-0.017
field-0.016
sters-0.015
ster-0.015
ese-0.015
Token her
Token ID607
Feature activation+2.45e-02

Pos logit contributions
br+0.057
sc+0.054
ch+0.053
run+0.052
 tricky+0.050
Neg logit contributions
ich-0.056
ster-0.051
field-0.051
sters-0.049
opsy-0.048
Token three
Token ID1115
Feature activation+4.89e-02

Pos logit contributions
br+0.034
sc+0.032
ch+0.032
run+0.032
 tricky+0.031
Neg logit contributions
ich-0.047
ster-0.043
field-0.041
sters-0.040
opsy-0.040
Token-
Token ID12
Feature activation+1.09e-01

Pos logit contributions
br+0.062
ch+0.059
sc+0.058
 Ellie+0.058
 tricky+0.057
Neg logit contributions
ich-0.095
sters-0.086
field-0.085
ster-0.085
ese-0.084
Tokenyear
Token ID1941
Feature activation+9.73e-02

Pos logit contributions
br+0.130
 tricky+0.118
 complicated+0.108
 loud+0.106
ch+0.106
Neg logit contributions
ich-0.221
illa-0.210
anut-0.201
field-0.198
ster-0.196
Token-
Token ID12
Feature activation+9.71e-02

Pos logit contributions
sc+0.118
 Ellie+0.113
ch+0.112
br+0.112
run+0.111
Neg logit contributions
ich-0.196
ster-0.182
sters-0.179
actus-0.174
ese-0.171
Tokenold
Token ID727
Feature activation+2.52e-02

Pos logit contributions
br+0.124
 tricky+0.113
ch+0.106
sc+0.106
run+0.106
Neg logit contributions
ich-0.185
illa-0.178
anut-0.172
ster-0.166
field-0.166
Token baby
Token ID5156
Feature activation+3.04e-03

Pos logit contributions
br+0.047
ch+0.045
run+0.044
 tricky+0.044
sc+0.044
Neg logit contributions
ich-0.035
ster-0.030
field-0.029
sters-0.029
illa-0.028
Token were
Token ID547
Feature activation+2.21e-02

Pos logit contributions
br+0.005
ch+0.005
 tricky+0.005
 Ellie+0.005
run+0.005
Neg logit contributions
ich-0.005
field-0.004
ster-0.004
sters-0.004
ese-0.004
Token asked
Token ID1965
Feature activation+1.77e-02

Pos logit contributions
br+0.001
sc+0.001
run+0.001
ch+0.001
 tricky+0.001
Neg logit contributions
ich-0.002
field-0.002
sters-0.001
opsy-0.001
ster-0.001
Token Billy
Token ID15890
Feature activation+1.41e-02

Pos logit contributions
br+0.025
sc+0.024
 tricky+0.023
ch+0.022
 complicated+0.022
Neg logit contributions
ich-0.032
field-0.030
illa-0.029
opsy-0.029
ster-0.028
Token,
Token ID11
Feature activation+1.08e-02

Pos logit contributions
br+0.020
ch+0.019
sc+0.018
 tricky+0.017
 Ellie+0.016
Neg logit contributions
ich-0.027
field-0.025
ster-0.024
sters-0.024
illa-0.023
Token a
Token ID257
Feature activation+2.61e-02

Pos logit contributions
br+0.017
sc+0.016
ch+0.016
 tricky+0.015
run+0.015
Neg logit contributions
ich-0.019
field-0.018
ster-0.016
illa-0.016
opsy-0.015
Token three
Token ID1115
Feature activation+4.75e-02

Pos logit contributions
sc+0.037
run+0.036
br+0.036
 Ellie+0.034
ch+0.034
Neg logit contributions
ich-0.048
field-0.044
ster-0.043
 porch-0.040
�-0.040
Token-
Token ID12
Feature activation+1.09e-01

Pos logit contributions
ch+0.069
br+0.069
sc+0.068
 Ellie+0.068
 impossible+0.062
Neg logit contributions
ich-0.079
ster-0.073
sters-0.071
field-0.070
ese-0.068
Tokenyear
Token ID1941
Feature activation+1.04e-01

Pos logit contributions
 tricky+0.147
br+0.145
 baths+0.141
 Bella+0.139
 loud+0.139
Neg logit contributions
ich-0.175
illa-0.174
anut-0.161
ster-0.153
ese-0.146
Token-
Token ID12
Feature activation+9.56e-02

Pos logit contributions
sc+0.133
ch+0.131
 Ellie+0.125
br+0.124
 Beth+0.119
Neg logit contributions
ich-0.202
ster-0.193
sters-0.179
actus-0.178
field-0.175
Tokenold
Token ID727
Feature activation+2.71e-02

Pos logit contributions
br+0.147
 tricky+0.145
 loud+0.141
ch+0.140
sc+0.139
Neg logit contributions
ich-0.148
illa-0.141
anut-0.133
ster-0.128
ese-0.122
Token boy
Token ID2933
Feature activation+1.45e-02

Pos logit contributions
br+0.058
ch+0.054
 tricky+0.052
 complicated+0.051
sc+0.050
Neg logit contributions
ich-0.035
ster-0.032
sters-0.029
illa-0.027
field-0.027
Token.
Token ID13
Feature activation+1.83e-02

Pos logit contributions
br+0.024
 tricky+0.022
sc+0.022
ch+0.022
 complicated+0.021
Neg logit contributions
ich-0.025
field-0.022
ster-0.021
our-0.021
sters-0.021
Token thirty
Token ID12277
Feature activation+7.55e-02

Pos logit contributions
sc+0.048
ch+0.044
br+0.044
run+0.041
bench+0.041
Neg logit contributions
ich-0.059
sters-0.053
field-0.052
ster-0.052
illa-0.051
Token-
Token ID12
Feature activation+1.04e-01

Pos logit contributions
 Ellie+0.083
sc+0.080
ch+0.079
br+0.075
 Beth+0.071
Neg logit contributions
ich-0.160
sters-0.149
field-0.146
ster-0.146
illa-0.144
Tokenfour
Token ID14337
Feature activation+4.80e-02

Pos logit contributions
 Ellie+0.164
br+0.160
sc+0.159
 Bella+0.158
 tricky+0.158
Neg logit contributions
ich-0.159
illa-0.144
anut-0.134
sters-0.132
field-0.132
Token,
Token ID11
Feature activation+3.35e-02

Pos logit contributions
 Ellie+0.066
ch+0.064
sc+0.061
br+0.060
Ell+0.058
Neg logit contributions
ich-0.082
sters-0.078
field-0.078
ster-0.075
ese-0.075
Token thirty
Token ID12277
Feature activation+7.94e-02

Pos logit contributions
sc+0.050
ch+0.049
br+0.048
run+0.047
 Ellie+0.047
Neg logit contributions
ich-0.056
field-0.051
sters-0.050
ster-0.049
illa-0.045
Token-
Token ID12
Feature activation+1.08e-01

Pos logit contributions
 Ellie+0.073
ch+0.072
br+0.071
sc+0.070
run+0.065
Neg logit contributions
ich-0.182
field-0.167
sters-0.166
ster-0.165
illa-0.161
Tokenfive
Token ID13261
Feature activation+2.40e-02

Pos logit contributions
 Ellie+0.188
 tricky+0.186
br+0.183
 Bella+0.183
 loud+0.182
Neg logit contributions
ich-0.146
illa-0.132
anut-0.122
ster-0.119
sters-0.118
Token,
Token ID11
Feature activation+1.46e-02

Pos logit contributions
ch+0.035
br+0.034
sc+0.033
 Ellie+0.033
 tricky+0.031
Neg logit contributions
ich-0.043
sters-0.039
field-0.039
ster-0.038
illa-0.038
Token thirty
Token ID12277
Feature activation+6.37e-02

Pos logit contributions
sc+0.022
ch+0.022
br+0.021
run+0.021
 Ellie+0.021
Neg logit contributions
ich-0.025
field-0.022
sters-0.022
ster-0.021
illa-0.020
Token-
Token ID12
Feature activation+9.46e-02

Pos logit contributions
 Ellie+0.058
ch+0.057
br+0.057
sc+0.055
run+0.051
Neg logit contributions
ich-0.147
field-0.135
sters-0.134
ster-0.133
ese-0.130
Tokensix
Token ID19412
Feature activation+1.39e-02

Pos logit contributions
 Ellie+0.165
br+0.161
 tricky+0.161
sc+0.159
 loud+0.158
Neg logit contributions
ich-0.134
illa-0.115
ster-0.109
sters-0.109
anut-0.107
TokenT
Token ID51
Feature activation+1.78e-02

Pos logit contributions
ich+0.004
field+0.003
sters+0.003
ster+0.003
ese+0.003
Neg logit contributions
br-0.003
ch-0.003
 Ellie-0.003
 tricky-0.003
run-0.003
Tokeneddy
Token ID21874
Feature activation+1.75e-02

Pos logit contributions
br+0.027
ch+0.026
 Ellie+0.026
run+0.026
 tricky+0.025
Neg logit contributions
ich-0.030
field-0.028
sters-0.026
ster-0.025
illas-0.025
Token,
Token ID11
Feature activation+5.71e-03

Pos logit contributions
br+0.025
ch+0.024
sc+0.023
run+0.023
 tricky+0.023
Neg logit contributions
ich-0.032
field-0.030
ster-0.028
sters-0.028
ese-0.027
Token a
Token ID257
Feature activation+2.52e-02

Pos logit contributions
br+0.009
sc+0.009
ch+0.009
 tricky+0.008
run+0.008
Neg logit contributions
ich-0.010
field-0.009
ster-0.009
illa-0.009
sters-0.008
Token three
Token ID1115
Feature activation+5.14e-02

Pos logit contributions
sc+0.033
br+0.033
run+0.032
ch+0.032
 Ellie+0.031
Neg logit contributions
ich-0.047
field-0.043
ster-0.042
�-0.039
 porch-0.039
Token-
Token ID12
Feature activation+1.08e-01

Pos logit contributions
 Ellie+0.073
ch+0.070
 impossible+0.070
sc+0.070
 Ros+0.069
Neg logit contributions
ich-0.082
ster-0.077
sters-0.076
ese-0.075
field-0.071
Tokenyear
Token ID1941
Feature activation+9.85e-02

Pos logit contributions
 tricky+0.148
 baths+0.145
 impossible+0.142
 confusing+0.142
 scary+0.142
Neg logit contributions
illa-0.179
ich-0.174
ster-0.152
anut-0.150
ese-0.142
Token-
Token ID12
Feature activation+9.35e-02

Pos logit contributions
sc+0.145
ch+0.135
 Ellie+0.132
 Beth+0.128
run+0.127
Neg logit contributions
ich-0.174
ster-0.165
sters-0.151
ese-0.145
field-0.144
Tokenold
Token ID727
Feature activation+2.66e-02

Pos logit contributions
 tricky+0.157
 impossible+0.154
 scary+0.153
br+0.153
 confusing+0.152
Neg logit contributions
ich-0.127
illa-0.125
ster-0.107
anut-0.104
ese-0.101
Token,
Token ID11
Feature activation+1.49e-03

Pos logit contributions
br+0.057
ch+0.053
 tricky+0.052
sc+0.052
 complicated+0.052
Neg logit contributions
ster-0.030
ich-0.029
illa-0.026
sters-0.025
ese-0.023
Token was
Token ID373
Feature activation+6.24e-03

Pos logit contributions
br+0.003
ch+0.002
sc+0.002
igsaw+0.002
bench+0.002
Neg logit contributions
ich-0.002
field-0.002
ster-0.002
sters-0.002
 porch-0.002
Token
Token ID198
Feature activation+1.23e-03

Pos logit contributions
br+0.017
ch+0.016
 Ellie+0.016
 tricky+0.016
run+0.015
Neg logit contributions
ich-0.021
field-0.019
sters-0.018
ster-0.018
ese-0.017
TokenJohn
Token ID7554
Feature activation+5.07e-03

Pos logit contributions
br+0.002
ch+0.002
 Ellie+0.002
run+0.002
 tricky+0.002
Neg logit contributions
ich-0.002
field-0.002
sters-0.002
ster-0.002
ese-0.002
Token,
Token ID11
Feature activation+6.68e-03

Pos logit contributions
br+0.008
ch+0.007
 Ellie+0.007
sc+0.007
 tricky+0.007
Neg logit contributions
ich-0.009
field-0.008
ster-0.008
sters-0.008
illa-0.008
Token a
Token ID257
Feature activation+2.83e-02

Pos logit contributions
br+0.009
sc+0.008
ch+0.008
 tricky+0.008
run+0.008
Neg logit contributions
ich-0.013
field-0.012
ster-0.012
illa-0.012
sters-0.011
Token 3
Token ID513
Feature activation+4.75e-02

Pos logit contributions
br+0.038
run+0.037
sc+0.037
 Ellie+0.036
ch+0.035
Neg logit contributions
ich-0.055
field-0.049
ster-0.049
opsy-0.044
 porch-0.044
Token-
Token ID12
Feature activation+1.08e-01

Pos logit contributions
br+0.072
ch+0.067
 Ellie+0.066
 impossible+0.065
 tricky+0.065
Neg logit contributions
ich-0.079
ster-0.071
ese-0.070
illa-0.070
field-0.069
Tokenyear
Token ID1941
Feature activation+9.82e-02

Pos logit contributions
 tricky+0.128
br+0.128
 complicated+0.121
 loud+0.120
 impossible+0.118
Neg logit contributions
ich-0.206
illa-0.197
ster-0.185
anut-0.184
ese-0.178
Token-
Token ID12
Feature activation+9.46e-02

Pos logit contributions
sc+0.110
 Ellie+0.109
br+0.109
ch+0.108
run+0.105
Neg logit contributions
ich-0.203
ster-0.191
sters-0.181
field-0.180
actus-0.176
Tokenold
Token ID727
Feature activation+2.43e-02

Pos logit contributions
br+0.137
 tricky+0.134
 loud+0.130
ch+0.128
 Ellie+0.127
Neg logit contributions
ich-0.161
illa-0.149
ster-0.143
anut-0.141
ese-0.137
Token boy
Token ID2933
Feature activation+1.06e-02

Pos logit contributions
br+0.050
ch+0.046
 tricky+0.046
 complicated+0.045
sc+0.044
Neg logit contributions
ich-0.031
ster-0.029
sters-0.026
illa-0.025
field-0.024
Token,
Token ID11
Feature activation+1.17e-02

Pos logit contributions
br+0.018
 tricky+0.017
 complicated+0.017
 loud+0.016
ch+0.016
Neg logit contributions
ich-0.016
field-0.015
illas-0.015
ster-0.015
illa-0.015
Token the
Token ID262
Feature activation-1.11e-02

Pos logit contributions
ich+0.008
field+0.007
sters+0.007
ster+0.007
ese+0.007
Neg logit contributions
br-0.003
 Ellie-0.003
ch-0.003
 tricky-0.002
 loud-0.002
Token zoo
Token ID26626
Feature activation-1.82e-02

Pos logit contributions
ich+0.018
illas+0.017
sters+0.017
field+0.017
our+0.016
Neg logit contributions
 loud-0.016
ch-0.016
 tricky-0.015
 hard-0.015
 fun-0.015
Token to
Token ID284
Feature activation-1.86e-02

Pos logit contributions
ich+0.048
field+0.044
sters+0.044
opsy+0.044
ster+0.043
Neg logit contributions
 loud-0.009
 hard-0.009
 of-0.009
 much-0.009
br-0.009
Token see
Token ID766
Feature activation-6.85e-03

Pos logit contributions
ich+0.033
field+0.030
sters+0.028
ster+0.028
ese+0.027
Neg logit contributions
br-0.028
ch-0.027
 Ellie-0.026
 tricky-0.026
run-0.026
Token the
Token ID262
Feature activation-1.88e-02

Pos logit contributions
ich+0.017
field+0.016
ster+0.015
sters+0.015
ese+0.015
Neg logit contributions
br-0.005
 Ellie-0.005
ch-0.005
 loud-0.005
 tricky-0.005
Token ostr
Token ID42387
Feature activation-6.81e-02

Pos logit contributions
ich+0.034
field+0.032
sters+0.031
ster+0.030
illas+0.030
Neg logit contributions
br-0.025
ch-0.025
 loud-0.025
 hard-0.024
 Ellie-0.024
Tokenich
Token ID488
Feature activation-2.94e-02

Pos logit contributions
 decide+0.040
 porch+0.038
riage+0.038
 dining+0.037
 seeker+0.036
Neg logit contributions
ch-0.144
br-0.143
 cho-0.132
 Tweet-0.131
run-0.129
Token.
Token ID13
Feature activation-1.70e-03

Pos logit contributions
ich+0.066
field+0.061
sters+0.058
ster+0.058
ese+0.056
Neg logit contributions
br-0.028
ch-0.028
 Ellie-0.027
run-0.026
 loud-0.026
Token She
Token ID1375
Feature activation-7.17e-03

Pos logit contributions
ich+0.003
field+0.003
sters+0.003
ster+0.003
 porch+0.003
Neg logit contributions
br-0.002
ch-0.002
 tricky-0.002
sc-0.002
run-0.002
Token observed
Token ID6515
Feature activation-6.79e-03

Pos logit contributions
ich+0.013
field+0.011
sters+0.011
ster+0.011
ese+0.010
Neg logit contributions
br-0.011
ch-0.011
 Ellie-0.010
 tricky-0.010
run-0.010
Token it
Token ID340
Feature activation+4.98e-03

Pos logit contributions
ich+0.012
sters+0.010
field+0.010
ster+0.010
ese+0.010
Neg logit contributions
bench-0.011
br-0.011
sc-0.011
heet-0.010
igsaw-0.010
Token were
Token ID547
Feature activation-4.00e-03

Pos logit contributions
ich+0.053
field+0.046
ster+0.045
sters+0.044
ese+0.043
Neg logit contributions
br-0.032
 Ellie-0.032
 hard-0.032
 loud-0.031
ch-0.031
Token friends
Token ID2460
Feature activation-2.78e-02

Pos logit contributions
ich+0.009
field+0.008
sters+0.008
ster+0.008
ese+0.008
Neg logit contributions
br-0.004
ch-0.004
run-0.004
sc-0.004
 tricky-0.003
Token and
Token ID290
Feature activation+1.30e-02

Pos logit contributions
ich+0.068
field+0.066
sters+0.063
ster+0.061
actus+0.061
Neg logit contributions
 much-0.024
 Ellie-0.024
 hard-0.024
 loud-0.023
ch-0.022
Token lived
Token ID5615
Feature activation+4.87e-03

Pos logit contributions
br+0.017
ch+0.017
 Ellie+0.016
 tricky+0.016
 loud+0.016
Neg logit contributions
ich-0.025
field-0.023
sters-0.022
ster-0.022
ese-0.021
Token near
Token ID1474
Feature activation-2.53e-02

Pos logit contributions
 hard+0.004
br+0.004
 loud+0.004
 tricky+0.004
ch+0.004
Neg logit contributions
ich-0.012
field-0.011
sters-0.011
ster-0.011
ese-0.010
Token each
Token ID1123
Feature activation-6.75e-02

Pos logit contributions
ich+0.064
field+0.059
sters+0.057
ster+0.057
ese+0.055
Neg logit contributions
br-0.019
ch-0.018
 Ellie-0.017
 tricky-0.017
run-0.016
Token other
Token ID584
Feature activation-1.83e-02

Pos logit contributions
ich+0.158
sters+0.146
field+0.144
our+0.139
illas+0.136
Neg logit contributions
 of-0.078
 Ellie-0.075
 hard-0.075
 time-0.075
 loud-0.070
Token.
Token ID13
Feature activation+2.12e-03

Pos logit contributions
ich+0.037
field+0.034
sters+0.033
ster+0.032
ese+0.031
Neg logit contributions
br-0.023
ch-0.022
run-0.021
 Ellie-0.021
 tricky-0.021
Token The
Token ID383
Feature activation+2.83e-03

Pos logit contributions
 Ellie+0.002
br+0.002
ch+0.002
 loud+0.002
sc+0.002
Neg logit contributions
ich-0.005
field-0.005
ster-0.004
sters-0.004
anut-0.004
Token sun
Token ID4252
Feature activation-1.43e-02

Pos logit contributions
 hard+0.003
 Ellie+0.003
 loud+0.003
br+0.003
 tricky+0.003
Neg logit contributions
ich-0.006
field-0.006
illas-0.005
illa-0.005
sters-0.005
Token was
Token ID373
Feature activation+1.35e-03

Pos logit contributions
ich+0.031
field+0.028
ster+0.027
sters+0.027
ese+0.026
Neg logit contributions
br-0.016
ch-0.015
run-0.015
 tricky-0.015
 Ellie-0.015
Token are
Token ID389
Feature activation-6.15e-03

Pos logit contributions
ich+0.063
field+0.054
ster+0.052
sters+0.052
ese+0.050
Neg logit contributions
br-0.046
 hard-0.044
ch-0.044
 loud-0.044
 tricky-0.043
Token careful
Token ID8161
Feature activation-3.86e-02

Pos logit contributions
ich+0.011
field+0.010
anut+0.010
ster+0.010
sters+0.009
Neg logit contributions
br-0.009
sc-0.009
ch-0.009
run-0.008
 Ellie-0.008
Token.
Token ID13
Feature activation-4.59e-03

Pos logit contributions
ich+0.077
field+0.068
sters+0.065
ster+0.065
ese+0.064
Neg logit contributions
br-0.052
ch-0.049
 Ellie-0.047
 tricky-0.047
sc-0.046
Token They
Token ID1119
Feature activation-3.48e-02

Pos logit contributions
ich+0.009
field+0.008
sters+0.008
opsy+0.008
ster+0.008
Neg logit contributions
br-0.006
ch-0.005
 tricky-0.005
sc-0.005
run-0.005
Token look
Token ID804
Feature activation+4.78e-03

Pos logit contributions
ich+0.062
field+0.055
ster+0.052
sters+0.052
ese+0.050
Neg logit contributions
br-0.052
ch-0.050
 Ellie-0.049
 tricky-0.049
 loud-0.048
Token both
Token ID1111
Feature activation-6.64e-02

Pos logit contributions
br+0.006
ch+0.006
 Ellie+0.006
run+0.006
 tricky+0.005
Neg logit contributions
ich-0.009
field-0.008
ster-0.008
sters-0.008
ese-0.008
Token ways
Token ID2842
Feature activation-6.97e-03

Pos logit contributions
ich+0.146
field+0.131
illa+0.124
ster+0.124
sters+0.123
Neg logit contributions
 hard-0.075
 loud-0.074
 tricky-0.071
br-0.071
 fun-0.069
Token.
Token ID13
Feature activation-3.67e-03

Pos logit contributions
ich+0.013
field+0.012
ster+0.011
illas+0.011
anut+0.010
Neg logit contributions
br-0.011
lic-0.010
ch-0.010
 tricky-0.009
sc-0.009
Token They
Token ID1119
Feature activation-3.47e-02

Pos logit contributions
ich+0.007
field+0.006
opsy+0.006
sters+0.006
mm+0.006
Neg logit contributions
br-0.005
 tricky-0.005
ch-0.005
sc-0.004
run-0.004
Token cross
Token ID3272
Feature activation-7.16e-03

Pos logit contributions
ich+0.068
field+0.057
ster+0.056
sters+0.055
illa+0.054
Neg logit contributions
br-0.047
 hard-0.046
 loud-0.046
 tricky-0.045
 Ellie-0.045
Token the
Token ID262
Feature activation-2.23e-03

Pos logit contributions
ich+0.012
illas+0.011
field+0.011
ese+0.010
anut+0.010
Neg logit contributions
br-0.011
 Ellie-0.010
lic-0.010
Ell-0.010
 Beth-0.010
Token<|endoftext|>
Token ID50256
Feature activation-1.32e-02

Pos logit contributions
ich+0.002
field+0.002
ster+0.002
sters+0.002
ese+0.002
Neg logit contributions
br-0.001
ch-0.001
 tricky-0.001
 Ellie-0.001
run-0.001
TokenThe
Token ID464
Feature activation+9.67e-03

Pos logit contributions
ich+0.020
field+0.018
ster+0.017
sters+0.017
illas+0.016
Neg logit contributions
br-0.022
ch-0.022
 tricky-0.021
 Ellie-0.021
run-0.021
Token two
Token ID734
Feature activation+3.50e-03

Pos logit contributions
br+0.014
sc+0.013
run+0.013
bench+0.013
drivers+0.012
Neg logit contributions
ich-0.018
field-0.016
ster-0.016
opsy-0.015
actus-0.015
Token players
Token ID1938
Feature activation-2.24e-02

Pos logit contributions
br+0.005
sc+0.005
bench+0.005
run+0.005
ch+0.005
Neg logit contributions
ich-0.006
field-0.005
ese-0.005
sters-0.005
ster-0.005
Token covered
Token ID5017
Feature activation+1.72e-04

Pos logit contributions
ich+0.039
field+0.037
ster+0.034
sters+0.034
illas+0.034
Neg logit contributions
br-0.033
ch-0.031
 tricky-0.031
run-0.030
 Ellie-0.030
Token each
Token ID1123
Feature activation-6.63e-02

Pos logit contributions
br+0.000
ch+0.000
run+0.000
 tricky+0.000
bench+0.000
Neg logit contributions
ich-0.000
field-0.000
sters-0.000
ster-0.000
illa-0.000
Token other
Token ID584
Feature activation-1.60e-02

Pos logit contributions
ich+0.113
field+0.101
sters+0.095
ster+0.095
ese+0.091
Neg logit contributions
br-0.104
ch-0.100
 Ellie-0.098
 tricky-0.097
run-0.097
Token in
Token ID287
Feature activation-1.78e-02

Pos logit contributions
ich+0.030
field+0.028
ster+0.026
sters+0.026
ese+0.025
Neg logit contributions
br-0.022
ch-0.021
 Ellie-0.020
run-0.020
 tricky-0.019
Token the
Token ID262
Feature activation-1.30e-02

Pos logit contributions
 porch+0.029
field+0.029
ich+0.029
ese+0.027
 wrist+0.027
Neg logit contributions
br-0.026
ch-0.023
lic-0.023
sc-0.021
 Ellie-0.021
Token game
Token ID983
Feature activation-1.14e-02

Pos logit contributions
ich+0.018
field+0.016
ster+0.014
 porch+0.014
ese+0.014
Neg logit contributions
br-0.025
run-0.024
 Ellie-0.023
ch-0.023
smart-0.023
Token.
Token ID13
Feature activation-3.83e-03

Pos logit contributions
field+0.019
ich+0.019
ster+0.017
illas+0.017
sters+0.017
Neg logit contributions
br-0.017
ch-0.017
run-0.015
 Ellie-0.015
 tricky-0.015
Token around
Token ID1088
Feature activation-2.42e-03

Pos logit contributions
br+0.001
ch+0.001
 Ellie+0.001
 tricky+0.001
run+0.001
Neg logit contributions
ich-0.003
field-0.003
sters-0.002
ster-0.002
ese-0.002
Token the
Token ID262
Feature activation-2.44e-03

Pos logit contributions
ich+0.005
field+0.005
ster+0.005
sters+0.004
illa+0.004
Neg logit contributions
br-0.003
ch-0.003
run-0.003
 tricky-0.002
sc-0.002
Token house
Token ID2156
Feature activation+3.36e-03

Pos logit contributions
ich+0.004
sters+0.004
illas+0.004
mm+0.004
our+0.004
Neg logit contributions
 loud-0.004
ch-0.004
 hard-0.004
 fun-0.004
 old-0.004
Token,
Token ID11
Feature activation-1.59e-02

Pos logit contributions
br+0.005
ch+0.004
run+0.004
sc+0.004
 tricky+0.004
Neg logit contributions
ich-0.007
field-0.006
ster-0.006
sters-0.006
our-0.005
Token entering
Token ID8218
Feature activation-1.23e-02

Pos logit contributions
ich+0.029
field+0.026
sters+0.026
ster+0.025
ese+0.025
Neg logit contributions
br-0.022
ch-0.021
 Ellie-0.021
 loud-0.021
 tricky-0.021
Token each
Token ID1123
Feature activation-6.62e-02

Pos logit contributions
ich+0.027
field+0.025
ster+0.024
sters+0.024
illa+0.023
Neg logit contributions
br-0.013
ch-0.013
run-0.012
sc-0.012
 Ellie-0.011
Token room
Token ID2119
Feature activation+2.64e-02

Pos logit contributions
ich+0.134
sters+0.121
field+0.121
ster+0.115
our+0.111
Neg logit contributions
ch-0.084
br-0.083
 loud-0.079
 Ellie-0.078
 time-0.077
Token to
Token ID284
Feature activation-8.39e-03

Pos logit contributions
ch+0.041
br+0.040
sc+0.037
run+0.035
 Ellie+0.035
Neg logit contributions
ich-0.047
field-0.045
sters-0.041
our-0.040
ster-0.040
Token explore
Token ID7301
Feature activation-1.52e-02

Pos logit contributions
ich+0.015
field+0.014
sters+0.013
ster+0.013
ese+0.013
Neg logit contributions
br-0.012
ch-0.012
 Ellie-0.011
 tricky-0.011
run-0.011
Token.
Token ID13
Feature activation+2.09e-03

Pos logit contributions
ich+0.031
field+0.029
sters+0.027
ster+0.027
ese+0.026
Neg logit contributions
br-0.018
ch-0.017
run-0.017
sc-0.016
 tricky-0.016
Token
Token ID198
Feature activation+2.23e-03

Pos logit contributions
ch+0.002
 Ellie+0.002
br+0.002
run+0.002
 much+0.002
Neg logit contributions
ich-0.005
sters-0.004
ese-0.004
field-0.004
ster-0.004
Token Why
Token ID4162
Feature activation-2.89e-02

Pos logit contributions
br+0.006
 tricky+0.006
ch+0.006
 Ellie+0.006
 hard+0.006
Neg logit contributions
ich-0.009
field-0.008
sters-0.008
ster-0.007
ese-0.007
Token did
Token ID750
Feature activation-3.11e-02

Pos logit contributions
ich+0.057
field+0.052
ster+0.048
sters+0.048
ese+0.047
Neg logit contributions
br-0.039
ch-0.037
 loud-0.036
 hard-0.036
 tricky-0.036
Token you
Token ID345
Feature activation-1.83e-03

Pos logit contributions
ich+0.040
sters+0.036
ster+0.033
field+0.032
mm+0.032
Neg logit contributions
sc-0.061
run-0.058
bench-0.058
br-0.057
ch-0.057
Token make
Token ID787
Feature activation-1.34e-02

Pos logit contributions
ich+0.003
field+0.003
sters+0.003
ster+0.003
illas+0.003
Neg logit contributions
br-0.003
ch-0.003
 Ellie-0.003
 tricky-0.003
 loud-0.003
Token me
Token ID502
Feature activation-1.44e-03

Pos logit contributions
ich+0.029
sters+0.026
field+0.026
ster+0.026
ese+0.025
Neg logit contributions
br-0.015
sc-0.015
run-0.014
ch-0.014
 Ellie-0.013
Token lick
Token ID36728
Feature activation-6.56e-02

Pos logit contributions
ich+0.003
field+0.003
sters+0.003
ster+0.003
illas+0.003
Neg logit contributions
br-0.002
ch-0.002
 tricky-0.002
 loud-0.002
 Ellie-0.002
Token it
Token ID340
Feature activation+1.87e-02

Pos logit contributions
ich+0.137
field+0.125
sters+0.119
ster+0.119
ese+0.116
Neg logit contributions
br-0.077
ch-0.074
 Ellie-0.071
 tricky-0.070
run-0.070
Token?"
Token ID1701
Feature activation-9.14e-04

Pos logit contributions
br+0.038
ch+0.037
 Ellie+0.036
 tricky+0.036
run+0.036
Neg logit contributions
ich-0.023
field-0.020
sters-0.018
ster-0.018
ese-0.017
Token
Token ID198
Feature activation+3.18e-03

Pos logit contributions
ich+0.002
field+0.001
ster+0.001
sters+0.001
our+0.001
Neg logit contributions
br-0.001
ch-0.001
sc-0.001
 tricky-0.001
run-0.001
Token
Token ID198
Feature activation-3.79e-03

Pos logit contributions
br+0.004
ch+0.004
sc+0.004
 tricky+0.004
 Ellie+0.004
Neg logit contributions
ich-0.006
field-0.005
ster-0.005
sters-0.005
actus-0.005
TokenL
Token ID43
Feature activation+3.34e-02

Pos logit contributions
ich+0.008
field+0.007
ster+0.007
sters+0.007
ese+0.007
Neg logit contributions
br-0.004
ch-0.004
 Ellie-0.004
run-0.004
sc-0.004
Token They
Token ID1119
Feature activation-2.64e-02

Pos logit contributions
br+0.003
ch+0.003
 Ellie+0.003
run+0.003
 tricky+0.003
Neg logit contributions
ich-0.010
field-0.009
ster-0.009
sters-0.009
ese-0.009
Token bounce
Token ID20110
Feature activation-1.04e-02

Pos logit contributions
ich+0.048
field+0.044
sters+0.041
ster+0.041
ese+0.040
Neg logit contributions
br-0.038
ch-0.036
run-0.035
 Ellie-0.035
 tricky-0.035
Token the
Token ID262
Feature activation-1.47e-03

Pos logit contributions
ich+0.016
field+0.015
sters+0.014
ster+0.014
actus+0.014
Neg logit contributions
run-0.016
br-0.014
lic-0.014
 Beth-0.014
smart-0.014
Token ball
Token ID2613
Feature activation-2.44e-02

Pos logit contributions
ich+0.003
field+0.002
sters+0.002
ster+0.002
ese+0.002
Neg logit contributions
br-0.002
ch-0.002
 Ellie-0.002
run-0.002
 tricky-0.002
Token to
Token ID284
Feature activation-1.57e-02

Pos logit contributions
ich+0.041
field+0.040
ster+0.037
sters+0.035
our+0.035
Neg logit contributions
run-0.033
br-0.033
 Beth-0.033
lic-0.033
 complicated-0.032
Token each
Token ID1123
Feature activation-6.53e-02

Pos logit contributions
ich+0.036
field+0.033
sters+0.032
ster+0.032
actus+0.031
Neg logit contributions
 Ellie-0.016
 loud-0.016
ch-0.016
 hard-0.015
 tricky-0.015
Token other
Token ID584
Feature activation-1.95e-02

Pos logit contributions
ich+0.136
field+0.129
sters+0.129
ster+0.120
asis+0.119
Neg logit contributions
ch-0.084
 hard-0.082
 time-0.080
 loud-0.080
sc-0.080
Token.
Token ID13
Feature activation-1.15e-03

Pos logit contributions
ich+0.037
field+0.034
ster+0.032
sters+0.032
ese+0.031
Neg logit contributions
br-0.027
ch-0.025
 Ellie-0.024
run-0.024
 tricky-0.024
Token They
Token ID1119
Feature activation-2.66e-02

Pos logit contributions
ich+0.003
field+0.002
ster+0.002
sters+0.002
ese+0.002
Neg logit contributions
br-0.001
ch-0.001
 Ellie-0.001
run-0.001
 tricky-0.001
Token laugh
Token ID6487
Feature activation-1.72e-02

Pos logit contributions
ich+0.049
field+0.045
sters+0.042
ster+0.042
ese+0.041
Neg logit contributions
br-0.037
ch-0.036
 Ellie-0.035
run-0.034
 tricky-0.034
Token and
Token ID290
Feature activation-7.50e-03

Pos logit contributions
ich+0.032
field+0.028
ster+0.028
sters+0.028
ese+0.027
Neg logit contributions
br-0.023
run-0.023
ch-0.022
 tricky-0.022
 Ellie-0.022
Token asked
Token ID1965
Feature activation+1.82e-04

Pos logit contributions
br+0.001
ch+0.001
run+0.001
sc+0.001
 Ellie+0.001
Neg logit contributions
ich-0.001
field-0.001
sters-0.001
ster-0.001
illa-0.001
Token Max
Token ID5436
Feature activation+7.62e-03

Pos logit contributions
br+0.000
ch+0.000
sc+0.000
lic+0.000
 spe+0.000
Neg logit contributions
ich-0.000
illa-0.000
ster-0.000
sters-0.000
illas-0.000
Token,
Token ID11
Feature activation-3.68e-03

Pos logit contributions
br+0.012
ch+0.011
sc+0.011
bench+0.011
 complicated+0.010
Neg logit contributions
ich-0.013
ster-0.012
sters-0.012
illa-0.012
field-0.011
Token "
Token ID366
Feature activation-9.45e-04

Pos logit contributions
ich+0.007
field+0.006
ster+0.006
sters+0.006
illa+0.006
Neg logit contributions
br-0.005
ch-0.005
sc-0.005
 tricky-0.005
bench-0.005
TokenWhich
Token ID13828
Feature activation-7.75e-03

Pos logit contributions
ich+0.001
field+0.001
ster+0.001
sters+0.001
ese+0.001
Neg logit contributions
br-0.002
ch-0.002
 tricky-0.002
run-0.002
 Ellie-0.002
Token leash
Token ID40529
Feature activation-6.39e-02

Pos logit contributions
ich+0.014
field+0.013
ster+0.012
ese+0.012
sters+0.012
Neg logit contributions
lic-0.011
br-0.011
ch-0.010
run-0.010
sc-0.010
Token do
Token ID466
Feature activation+2.80e-03

Pos logit contributions
ich+0.112
field+0.103
sters+0.099
ster+0.097
illas+0.094
Neg logit contributions
br-0.092
ch-0.089
run-0.085
 Ellie-0.085
 tricky-0.084
Token you
Token ID345
Feature activation+1.71e-02

Pos logit contributions
lic+0.005
sc+0.005
br+0.005
bench+0.005
 tant+0.005
Neg logit contributions
ich-0.003
sters-0.003
field-0.003
illas-0.003
ster-0.003
Token want
Token ID765
Feature activation-4.65e-03

Pos logit contributions
br+0.019
 tricky+0.019
ch+0.019
 Ellie+0.018
 hard+0.018
Neg logit contributions
ich-0.038
field-0.033
ster-0.032
sters-0.032
ese-0.031
Token?"
Token ID1701
Feature activation+6.44e-03

Pos logit contributions
ich+0.007
field+0.006
ster+0.006
sters+0.006
ese+0.006
Neg logit contributions
br-0.008
ch-0.007
run-0.007
 tricky-0.007
sc-0.007
Token Max
Token ID5436
Feature activation+1.19e-02

Pos logit contributions
br+0.007
 tricky+0.006
ch+0.006
sc+0.006
run+0.006
Neg logit contributions
ich-0.014
field-0.013
our-0.012
ster-0.012
sters-0.012
Token he
Token ID339
Feature activation-1.58e-02

Pos logit contributions
ich+0.009
field+0.009
ster+0.008
sters+0.008
illas+0.008
Neg logit contributions
br-0.004
run-0.004
ch-0.004
sc-0.004
 tricky-0.004
Token saw
Token ID2497
Feature activation-2.13e-03

Pos logit contributions
ich+0.030
field+0.027
sters+0.026
ster+0.026
ese+0.025
Neg logit contributions
br-0.022
ch-0.021
 tricky-0.020
 Ellie-0.020
run-0.020
Token a
Token ID257
Feature activation-1.56e-03

Pos logit contributions
ich+0.005
field+0.005
ster+0.005
sters+0.005
ese+0.005
Neg logit contributions
br-0.002
sc-0.002
run-0.002
ch-0.002
 Ellie-0.001
Token point
Token ID966
Feature activation+1.90e-03

Pos logit contributions
ich+0.004
field+0.003
ster+0.003
sters+0.003
ese+0.003
Neg logit contributions
br-0.001
ch-0.001
run-0.001
 Ellie-0.001
sc-0.001
Token far
Token ID1290
Feature activation-1.21e-02

Pos logit contributions
 Ellie+0.002
br+0.002
 tricky+0.002
sc+0.002
ch+0.002
Neg logit contributions
ich-0.004
field-0.004
ster-0.004
sters-0.003
our-0.003
Token away
Token ID1497
Feature activation-6.38e-02

Pos logit contributions
ich+0.019
field+0.018
ster+0.017
actus+0.017
asis+0.016
Neg logit contributions
drivers-0.019
run-0.018
br-0.017
 subt-0.017
 Ellie-0.017
Token.
Token ID13
Feature activation+4.17e-03

Pos logit contributions
ich+0.156
field+0.141
sters+0.140
ese+0.138
ster+0.137
Neg logit contributions
br-0.053
ch-0.049
 loud-0.048
 tricky-0.047
 hard-0.047
Token He
Token ID679
Feature activation-1.23e-02

Pos logit contributions
br+0.004
ch+0.004
 tricky+0.004
sc+0.004
run+0.004
Neg logit contributions
ich-0.009
field-0.008
ster-0.008
sters-0.008
our-0.008
Token wanted
Token ID2227
Feature activation-2.01e-02

Pos logit contributions
ich+0.023
field+0.021
ster+0.020
sters+0.020
ese+0.019
Neg logit contributions
br-0.017
ch-0.016
 Ellie-0.016
run-0.016
sc-0.016
Token to
Token ID284
Feature activation-2.04e-02

Pos logit contributions
ich+0.046
field+0.043
ster+0.042
sters+0.041
illa+0.039
Neg logit contributions
br-0.019
sc-0.018
run-0.017
ch-0.017
bench-0.016
Token shine
Token ID18340
Feature activation-1.05e-02

Pos logit contributions
ich+0.046
sters+0.040
field+0.040
illas+0.039
ster+0.039
Neg logit contributions
br-0.023
 loud-0.022
ch-0.022
 Ellie-0.022
 hard-0.021
Token the
Token ID262
Feature activation-1.19e-02

Pos logit contributions
ich+0.036
field+0.036
 porch+0.032
ese+0.030
illas+0.030
Neg logit contributions
br-0.034
lic-0.032
sc-0.031
ch-0.031
run-0.029
Token big
Token ID1263
Feature activation-1.70e-02

Pos logit contributions
ich+0.016
field+0.015
ster+0.013
ese+0.013
sters+0.013
Neg logit contributions
br-0.023
run-0.023
 Ellie-0.023
sc-0.022
ch-0.022
Token tank
Token ID6873
Feature activation-3.80e-02

Pos logit contributions
ich+0.027
field+0.025
ster+0.023
sters+0.023
ese+0.023
Neg logit contributions
br-0.028
run-0.027
 Ellie-0.027
ch-0.027
sc-0.026
Token at
Token ID379
Feature activation-1.88e-02

Pos logit contributions
ich+0.082
field+0.075
sters+0.071
ster+0.071
ese+0.069
Neg logit contributions
br-0.042
ch-0.040
 Ellie-0.039
run-0.038
 tricky-0.038
Token the
Token ID262
Feature activation-1.48e-02

Pos logit contributions
ich+0.046
sters+0.042
field+0.041
ster+0.041
ese+0.040
Neg logit contributions
 Ellie-0.015
br-0.014
ch-0.014
 tricky-0.014
 hard-0.014
Token pet
Token ID4273
Feature activation-6.38e-02

Pos logit contributions
ich+0.022
field+0.019
sters+0.018
ster+0.018
ese+0.017
Neg logit contributions
br-0.026
ch-0.025
 tricky-0.025
 Ellie-0.024
 loud-0.024
Token shop
Token ID6128
Feature activation-1.49e-02

Pos logit contributions
ich+0.137
field+0.122
sters+0.117
anut+0.116
ster+0.116
Neg logit contributions
ch-0.071
br-0.070
sc-0.067
 Ellie-0.065
run-0.062
Token.
Token ID13
Feature activation-3.05e-03

Pos logit contributions
ich+0.032
field+0.030
sters+0.028
ster+0.028
ese+0.028
Neg logit contributions
br-0.016
ch-0.015
 Ellie-0.015
 tricky-0.015
run-0.015
Token They
Token ID1119
Feature activation-2.45e-02

Pos logit contributions
ich+0.007
field+0.006
our+0.006
sters+0.006
ster+0.006
Neg logit contributions
br-0.003
ch-0.003
sc-0.003
 tricky-0.002
run-0.002
Token were
Token ID547
Feature activation+2.62e-03

Pos logit contributions
ich+0.042
field+0.039
sters+0.036
ster+0.036
illas+0.035
Neg logit contributions
br-0.038
ch-0.036
 Ellie-0.035
run-0.035
sc-0.035
Token lively
Token ID29696
Feature activation+8.19e-03

Pos logit contributions
br+0.004
sc+0.004
run+0.004
ch+0.004
igsaw+0.003
Neg logit contributions
ich-0.005
field-0.004
ese-0.004
sters-0.004
anut-0.004
Token.
Token ID13
Feature activation+3.06e-04

Pos logit contributions
ich+0.013
field+0.012
ster+0.011
sters+0.011
ese+0.011
Neg logit contributions
br-0.009
 tricky-0.008
 Ellie-0.008
run-0.008
 complicated-0.008
Token The
Token ID383
Feature activation-1.63e-03

Pos logit contributions
br+0.000
sc+0.000
 tricky+0.000
ch+0.000
bench+0.000
Neg logit contributions
ich-0.001
field-0.001
ster-0.001
sters-0.000
our-0.000
Token eagle
Token ID31176
Feature activation-1.37e-02

Pos logit contributions
ich+0.003
field+0.003
sters+0.003
illa+0.003
our+0.003
Neg logit contributions
 loud-0.002
 hard-0.002
br-0.002
ch-0.002
 scary-0.002
Token sees
Token ID7224
Feature activation+2.46e-04

Pos logit contributions
ich+0.024
field+0.022
sters+0.021
ster+0.021
ese+0.020
Neg logit contributions
br-0.020
 Ellie-0.019
ch-0.019
run-0.019
 tricky-0.019
Token the
Token ID262
Feature activation-2.05e-02

Pos logit contributions
bench+0.000
br+0.000
sc+0.000
heet+0.000
 tricky+0.000
Neg logit contributions
ich-0.000
sters-0.000
ese-0.000
ster-0.000
anut-0.000
Token ostr
Token ID42387
Feature activation-6.36e-02

Pos logit contributions
ich+0.035
field+0.032
sters+0.030
ster+0.030
illas+0.029
Neg logit contributions
br-0.031
ch-0.030
 tricky-0.029
 loud-0.029
 hard-0.029
Tokenich
Token ID488
Feature activation-3.31e-02

Pos logit contributions
 dining+0.040
field+0.040
 decide+0.039
 porch+0.037
atmeal+0.037
Neg logit contributions
ch-0.134
br-0.129
 cho-0.126
run-0.122
--0.120
Token and
Token ID290
Feature activation-6.32e-03

Pos logit contributions
ich+0.061
field+0.055
ster+0.053
sters+0.053
ese+0.051
Neg logit contributions
br-0.047
 Ellie-0.045
 tricky-0.045
ch-0.044
run-0.043
Token wants
Token ID3382
Feature activation-1.11e-02

Pos logit contributions
ich+0.012
field+0.011
sters+0.011
ster+0.010
illas+0.010
Neg logit contributions
br-0.009
bench-0.008
sc-0.008
ch-0.008
 tricky-0.008
Token to
Token ID284
Feature activation-1.71e-02

Pos logit contributions
ich+0.021
ese+0.019
ster+0.019
sters+0.019
anut+0.018
Neg logit contributions
bench-0.015
br-0.014
igsaw-0.014
heet-0.013
sc-0.013
Token catch
Token ID4929
Feature activation-2.51e-02

Pos logit contributions
ich+0.030
field+0.027
ster+0.025
sters+0.025
ese+0.025
Neg logit contributions
br-0.026
ch-0.024
sc-0.024
run-0.024
 tricky-0.024
Token big
Token ID1263
Feature activation-6.39e-03

Pos logit contributions
ich+0.001
field+0.001
illas+0.001
sters+0.001
ese+0.001
Neg logit contributions
 loud-0.000
br-0.000
ch-0.000
 hard-0.000
 tricky-0.000
Token pond
Token ID16723
Feature activation-3.39e-02

Pos logit contributions
ich+0.012
field+0.011
sters+0.010
ster+0.010
ese+0.010
Neg logit contributions
br-0.009
ch-0.008
 Ellie-0.008
 tricky-0.008
run-0.008
Token.
Token ID13
Feature activation-5.16e-04

Pos logit contributions
ich+0.082
field+0.076
sters+0.074
ster+0.073
ese+0.072
Neg logit contributions
br-0.027
ch-0.027
 Ellie-0.025
 tricky-0.024
sc-0.024
Token In
Token ID554
Feature activation-8.07e-03

Pos logit contributions
ich+0.001
field+0.001
sters+0.001
ster+0.001
ese+0.001
Neg logit contributions
br-0.001
ch-0.001
sc-0.001
 tricky-0.001
run-0.001
Token the
Token ID262
Feature activation-2.69e-02

Pos logit contributions
ich+0.020
field+0.019
sters+0.018
ster+0.018
ese+0.018
Neg logit contributions
br-0.006
ch-0.006
 Ellie-0.006
 hard-0.005
 tricky-0.005
Token pond
Token ID16723
Feature activation-6.34e-02

Pos logit contributions
ich+0.046
illa+0.041
sters+0.039
field+0.039
illas+0.039
Neg logit contributions
ch-0.043
 loud-0.043
 hard-0.042
 fun-0.041
 time-0.041
Token,
Token ID11
Feature activation-2.08e-02

Pos logit contributions
ich+0.155
field+0.140
sters+0.139
illa+0.137
ese+0.136
Neg logit contributions
 hard-0.053
br-0.051
 much-0.050
ch-0.049
 loud-0.049
Token there
Token ID612
Feature activation+6.13e-03

Pos logit contributions
ich+0.050
field+0.047
ese+0.044
ster+0.044
sters+0.044
Neg logit contributions
br-0.017
 Ellie-0.017
 hard-0.017
 loud-0.017
ch-0.017
Token was
Token ID373
Feature activation-6.31e-03

Pos logit contributions
br+0.006
ch+0.006
 Ellie+0.005
 hard+0.005
 loud+0.005
Neg logit contributions
ich-0.014
field-0.013
sters-0.013
ster-0.013
anut-0.012
Token a
Token ID257
Feature activation-2.40e-03

Pos logit contributions
ich+0.016
field+0.015
ster+0.015
sters+0.014
ese+0.014
Neg logit contributions
br-0.004
ch-0.004
run-0.004
 Ellie-0.004
sc-0.003
Token sw
Token ID1509
Feature activation-1.96e-02

Pos logit contributions
ich+0.006
field+0.006
sters+0.005
illas+0.005
illa+0.005
Neg logit contributions
br-0.002
 loud-0.002
ch-0.002
 tricky-0.002
 hard-0.002
Token was
Token ID373
Feature activation+6.25e-03

Pos logit contributions
ich+0.013
field+0.011
illa+0.011
ster+0.011
sters+0.011
Neg logit contributions
br-0.012
ch-0.011
sc-0.010
run-0.010
 tricky-0.010
Token right
Token ID826
Feature activation+7.55e-03

Pos logit contributions
br+0.008
ch+0.008
run+0.007
sc+0.007
 Ellie+0.007
Neg logit contributions
ich-0.013
field-0.011
ster-0.011
sters-0.011
ese-0.011
Token.
Token ID13
Feature activation+7.87e-03

Pos logit contributions
br+0.009
ch+0.009
 Ellie+0.008
run+0.008
 tricky+0.008
Neg logit contributions
ich-0.016
field-0.015
ster-0.014
sters-0.014
ese-0.013
Token Eating
Token ID37537
Feature activation-2.31e-02

Pos logit contributions
br+0.009
ch+0.009
 tricky+0.008
sc+0.008
run+0.008
Neg logit contributions
ich-0.016
field-0.015
sters-0.014
ster-0.014
our-0.014
Token different
Token ID1180
Feature activation-9.51e-04

Pos logit contributions
ich+0.044
field+0.042
sters+0.039
ster+0.038
ese+0.037
Neg logit contributions
br-0.031
run-0.030
ch-0.030
sc-0.028
 Ellie-0.028
Token foods
Token ID9013
Feature activation-6.32e-02

Pos logit contributions
ich+0.001
field+0.001
sters+0.001
ese+0.001
ster+0.001
Neg logit contributions
br-0.002
ch-0.002
run-0.002
 Ellie-0.001
sc-0.001
Token was
Token ID373
Feature activation-8.18e-04

Pos logit contributions
field+0.086
ich+0.086
sters+0.084
our+0.081
ster+0.078
Neg logit contributions
ch-0.105
br-0.103
 Ellie-0.099
run-0.098
Ell-0.097
Token important
Token ID1593
Feature activation+2.36e-02

Pos logit contributions
anut+0.001
ich+0.001
field+0.001
opsy+0.001
sters+0.001
Neg logit contributions
br-0.001
sc-0.001
ch-0.001
run-0.001
igsaw-0.001
Token for
Token ID329
Feature activation+1.63e-03

Pos logit contributions
ch+0.040
br+0.039
lic+0.036
Ell+0.035
 Ellie+0.035
Neg logit contributions
ich-0.036
sters-0.034
field-0.033
ster-0.032
our-0.031
Token her
Token ID607
Feature activation+6.92e-03

Pos logit contributions
br+0.002
sc+0.002
ch+0.002
lic+0.002
run+0.002
Neg logit contributions
ich-0.003
field-0.003
sters-0.003
anut-0.003
illa-0.003
Token health
Token ID1535
Feature activation-1.02e-02

Pos logit contributions
br+0.011
ch+0.011
sc+0.010
 Ellie+0.010
run+0.010
Neg logit contributions
ich-0.012
field-0.011
ster-0.011
sters-0.010
illa-0.010
Token sees
Token ID7224
Feature activation+3.88e-03

Pos logit contributions
ich+0.027
field+0.024
sters+0.022
ster+0.022
ese+0.021
Neg logit contributions
br-0.033
ch-0.032
 Ellie-0.031
run-0.031
 tricky-0.031
Token a
Token ID257
Feature activation-3.03e-03

Pos logit contributions
sc+0.006
br+0.005
run+0.005
ch+0.005
bench+0.005
Neg logit contributions
ich-0.007
ster-0.007
field-0.006
sters-0.006
illa-0.006
Token big
Token ID1263
Feature activation-1.94e-02

Pos logit contributions
ich+0.007
field+0.006
ster+0.006
sters+0.006
actus+0.006
Neg logit contributions
run-0.003
sc-0.003
br-0.003
ch-0.003
 Ellie-0.003
Token building
Token ID2615
Feature activation+2.14e-02

Pos logit contributions
ich+0.034
field+0.030
ster+0.029
sters+0.028
ese+0.027
Neg logit contributions
br-0.030
run-0.029
sc-0.029
ch-0.029
 Ellie-0.028
Token far
Token ID1290
Feature activation-2.78e-02

Pos logit contributions
br+0.028
 Ellie+0.028
ch+0.027
run+0.027
 Beth+0.026
Neg logit contributions
ich-0.041
field-0.037
ster-0.035
sters-0.034
 porch-0.033
Token away
Token ID1497
Feature activation-6.32e-02

Pos logit contributions
ich+0.044
field+0.042
asis+0.037
actus+0.037
ster+0.037
Neg logit contributions
run-0.044
br-0.043
drivers-0.040
 Ellie-0.038
lic-0.038
Token.
Token ID13
Feature activation-4.25e-03

Pos logit contributions
ich+0.147
field+0.133
sters+0.132
ster+0.129
ese+0.129
Neg logit contributions
br-0.059
ch-0.055
 tricky-0.054
 loud-0.054
 Ellie-0.053
Token It
Token ID632
Feature activation+1.27e-02

Pos logit contributions
ich+0.009
field+0.008
ster+0.008
sters+0.008
illa+0.007
Neg logit contributions
br-0.005
sc-0.004
ch-0.004
 tricky-0.004
run-0.004
Token is
Token ID318
Feature activation+5.30e-03

Pos logit contributions
run+0.025
br+0.023
sc+0.023
 Ellie+0.023
ch+0.022
Neg logit contributions
field-0.014
ich-0.014
sters-0.012
asis-0.011
ster-0.011
Token very
Token ID845
Feature activation-1.91e-02

Pos logit contributions
sc+0.007
run+0.007
br+0.007
ch+0.006
igsaw+0.006
Neg logit contributions
ich-0.010
field-0.009
 porch-0.009
ster-0.008
 backyard-0.008
Token tall
Token ID7331
Feature activation+2.41e-02

Pos logit contributions
ich+0.044
field+0.040
ster+0.038
sters+0.037
actus+0.037
Neg logit contributions
br-0.018
sc-0.018
run-0.018
ch-0.017
 Ellie-0.016
Token held
Token ID2714
Feature activation-2.03e-02

Pos logit contributions
ich+0.066
field+0.058
sters+0.057
ster+0.056
asis+0.055
Neg logit contributions
br-0.032
 loud-0.031
 hard-0.031
 Ellie-0.031
 fun-0.030
Token hands
Token ID2832
Feature activation-3.62e-02

Pos logit contributions
ich+0.035
field+0.031
ster+0.031
sters+0.031
illas+0.030
Neg logit contributions
br-0.032
ch-0.029
lic-0.029
 tricky-0.028
sc-0.028
Token.
Token ID13
Feature activation+1.30e-04

Pos logit contributions
ich+0.065
field+0.059
ster+0.056
sters+0.055
ese+0.054
Neg logit contributions
br-0.055
ch-0.051
 tricky-0.050
run-0.049
sc-0.049
Token They
Token ID1119
Feature activation-3.01e-02

Pos logit contributions
 tricky+0.000
br+0.000
sc+0.000
ch+0.000
bench+0.000
Neg logit contributions
ich-0.000
field-0.000
sters-0.000
ster-0.000
�-0.000
Token looked
Token ID3114
Feature activation-2.26e-02

Pos logit contributions
ich+0.063
field+0.055
sters+0.054
ster+0.053
ese+0.052
Neg logit contributions
br-0.035
 loud-0.033
 hard-0.033
 Ellie-0.033
ch-0.033
Token both
Token ID1111
Feature activation-6.29e-02

Pos logit contributions
ich+0.044
field+0.040
ster+0.038
sters+0.038
ese+0.037
Neg logit contributions
br-0.030
ch-0.028
 Ellie-0.027
run-0.027
 tricky-0.027
Token ways
Token ID2842
Feature activation-1.20e-03

Pos logit contributions
ich+0.124
field+0.113
ster+0.107
sters+0.107
ese+0.104
Neg logit contributions
br-0.082
ch-0.080
 Ellie-0.077
run-0.075
sc-0.075
Token.
Token ID13
Feature activation-1.04e-05

Pos logit contributions
ich+0.002
field+0.002
ster+0.002
illas+0.002
sters+0.002
Neg logit contributions
br-0.002
ch-0.001
lic-0.001
sc-0.001
 tricky-0.001
Token They
Token ID1119
Feature activation-3.00e-02

Pos logit contributions
ich+0.000
field+0.000
sters+0.000
ster+0.000
opsy+0.000
Neg logit contributions
br-0.000
 tricky-0.000
bench-0.000
ch-0.000
sc-0.000
Token crossed
Token ID12606
Feature activation-5.56e-03

Pos logit contributions
ich+0.064
field+0.055
sters+0.054
ster+0.053
ese+0.053
Neg logit contributions
br-0.033
 hard-0.033
 loud-0.033
 Ellie-0.032
 tricky-0.032
Token the
Token ID262
Feature activation+6.13e-03

Pos logit contributions
ich+0.009
field+0.009
illas+0.008
ster+0.008
ese+0.007
Neg logit contributions
br-0.009
run-0.009
lic-0.008
ch-0.008
sc-0.008
Token dove
Token ID38862
Feature activation+3.17e-03

Pos logit contributions
br+0.001
ch+0.001
 Ellie+0.001
sc+0.001
run+0.001
Neg logit contributions
ich-0.001
field-0.001
ster-0.001
sters-0.001
our-0.001
Token,
Token ID11
Feature activation-1.50e-02

Pos logit contributions
br+0.005
run+0.005
ch+0.004
 complicated+0.004
sc+0.004
Neg logit contributions
ich-0.005
field-0.005
ster-0.005
sters-0.005
ese-0.005
Token and
Token ID290
Feature activation+1.59e-02

Pos logit contributions
ich+0.023
field+0.021
ster+0.019
sters+0.019
 porch+0.018
Neg logit contributions
br-0.026
ch-0.024
sc-0.024
lic-0.024
igsaw-0.023
Token they
Token ID484
Feature activation-1.71e-02

Pos logit contributions
br+0.026
ch+0.025
run+0.025
sc+0.024
lic+0.024
Neg logit contributions
ich-0.022
ster-0.022
sters-0.021
field-0.021
 forget-0.019
Token hung
Token ID9174
Feature activation+1.52e-02

Pos logit contributions
ich+0.036
field+0.032
sters+0.031
ster+0.031
ese+0.030
Neg logit contributions
br-0.021
ch-0.020
 Ellie-0.020
 hard-0.019
 loud-0.019
Token both
Token ID1111
Feature activation-6.27e-02

Pos logit contributions
br+0.024
Ell+0.023
 Ellie+0.022
lic+0.022
sc+0.022
Neg logit contributions
ich-0.026
ster-0.023
sters-0.023
ble-0.023
field-0.021
Token pictures
Token ID5986
Feature activation+4.79e-03

Pos logit contributions
ich+0.114
field+0.104
ster+0.099
sters+0.096
ese+0.096
Neg logit contributions
br-0.089
 Ellie-0.087
run-0.085
ch-0.083
lic-0.082
Token on
Token ID319
Feature activation-3.52e-03

Pos logit contributions
 tricky+0.008
 complicated+0.007
 unfair+0.007
run+0.007
 frustrating+0.007
Neg logit contributions
ich-0.007
field-0.006
ster-0.006
sters-0.006
illas-0.006
Token the
Token ID262
Feature activation-2.49e-03

Pos logit contributions
ich+0.006
field+0.006
ster+0.005
illas+0.005
sters+0.005
Neg logit contributions
br-0.005
run-0.005
lic-0.005
ch-0.005
sc-0.005
Token fridge
Token ID25772
Feature activation+7.05e-03

Pos logit contributions
ich+0.005
field+0.004
ster+0.004
sters+0.004
ese+0.004
Neg logit contributions
br-0.004
run-0.004
ch-0.004
 Ellie-0.003
 tricky-0.003
Token.
Token ID13
Feature activation+5.23e-04

Pos logit contributions
ch+0.011
br+0.011
run+0.011
 Ellie+0.011
 tricky+0.010
Neg logit contributions
ich-0.011
field-0.010
sters-0.010
ster-0.010
our-0.010
Token.
Token ID13
Feature activation-7.45e-03

Pos logit contributions
ich+0.065
field+0.061
ster+0.056
sters+0.055
ese+0.053
Neg logit contributions
br-0.051
ch-0.049
run-0.048
 tricky-0.048
 Ellie-0.046
Token They
Token ID1119
Feature activation-2.78e-02

Pos logit contributions
ich+0.015
field+0.014
our+0.013
sters+0.013
 porch+0.013
Neg logit contributions
br-0.009
ch-0.008
 tricky-0.008
sc-0.008
run-0.007
Token liked
Token ID8288
Feature activation+9.12e-04

Pos logit contributions
ich+0.051
field+0.046
sters+0.044
ster+0.043
ese+0.042
Neg logit contributions
br-0.040
ch-0.038
 Ellie-0.037
run-0.037
 tricky-0.037
Token to
Token ID284
Feature activation-1.42e-02

Pos logit contributions
br+0.001
sc+0.001
bench+0.001
igsaw+0.001
lic+0.001
Neg logit contributions
ich-0.002
 porch-0.001
field-0.001
ese-0.001
sters-0.001
Token spray
Token ID11662
Feature activation-1.42e-02

Pos logit contributions
ich+0.023
field+0.021
ese+0.019
illas+0.019
sters+0.019
Neg logit contributions
br-0.023
run-0.021
ch-0.021
 Ellie-0.021
sc-0.021
Token each
Token ID1123
Feature activation-6.27e-02

Pos logit contributions
ese+0.016
sters+0.016
 ourselves+0.015
ich+0.015
flower+0.014
Neg logit contributions
bench-0.026
Ell-0.026
run-0.025
br-0.024
lic-0.024
Token other
Token ID584
Feature activation-1.08e-02

Pos logit contributions
ich+0.096
field+0.085
ster+0.079
sters+0.078
ese+0.077
Neg logit contributions
br-0.107
 Ellie-0.102
ch-0.102
run-0.101
 tricky-0.101
Token and
Token ID290
Feature activation-1.67e-02

Pos logit contributions
ich+0.017
field+0.016
ster+0.015
sters+0.015
ese+0.015
Neg logit contributions
br-0.017
run-0.017
lic-0.016
ch-0.016
 Ellie-0.016
Token make
Token ID787
Feature activation-3.03e-03

Pos logit contributions
ich+0.024
field+0.023
sters+0.022
 porch+0.022
illas+0.021
Neg logit contributions
br-0.030
ch-0.027
run-0.026
sc-0.026
lic-0.026
Token funny
Token ID8258
Feature activation-2.36e-02

Pos logit contributions
ich+0.005
ese+0.005
sters+0.005
 sandbox+0.004
 ourselves+0.004
Neg logit contributions
br-0.005
bench-0.005
run-0.004
igsaw-0.004
Ell-0.004
Token noises
Token ID26782
Feature activation+1.70e-03

Pos logit contributions
ich+0.042
sters+0.039
ese+0.037
ster+0.036
field+0.035
Neg logit contributions
 Ellie-0.034
br-0.034
 tricky-0.032
run-0.032
ch-0.032
Tokeny
Token ID88
Feature activation-1.67e-02

Pos logit contributions
ich+0.026
field+0.025
sters+0.024
ster+0.023
ese+0.023
Neg logit contributions
ch-0.011
br-0.011
 Ellie-0.011
run-0.010
sc-0.010
Token.
Token ID13
Feature activation+1.98e-03

Pos logit contributions
ich+0.038
field+0.035
sters+0.034
ster+0.033
ese+0.033
Neg logit contributions
br-0.016
ch-0.016
 Ellie-0.015
 tricky-0.014
 loud-0.014
Token Lily
Token ID20037
Feature activation-8.00e-03

Pos logit contributions
br+0.002
ch+0.002
 Ellie+0.002
run+0.002
 tricky+0.002
Neg logit contributions
ich-0.005
field-0.004
sters-0.004
ster-0.004
ese-0.004
Token loved
Token ID6151
Feature activation-3.07e-03

Pos logit contributions
ich+0.017
field+0.015
sters+0.015
ster+0.015
ese+0.014
Neg logit contributions
br-0.009
 Ellie-0.009
ch-0.009
 hard-0.009
 tricky-0.008
Token to
Token ID284
Feature activation-2.01e-02

Pos logit contributions
ich+0.007
field+0.006
sters+0.006
ese+0.006
opsy+0.006
Neg logit contributions
br-0.003
sc-0.003
run-0.003
 tricky-0.003
bench-0.003
Token feed
Token ID3745
Feature activation-6.26e-02

Pos logit contributions
ich+0.032
field+0.030
sters+0.027
ster+0.027
opsy+0.027
Neg logit contributions
br-0.032
sc-0.030
run-0.030
ch-0.030
 tricky-0.030
Token Snow
Token ID7967
Feature activation-2.26e-02

Pos logit contributions
ich+0.144
field+0.133
sters+0.126
ster+0.126
ese+0.123
Neg logit contributions
br-0.061
ch-0.058
 Ellie-0.055
 tricky-0.054
run-0.054
Tokeny
Token ID88
Feature activation-2.16e-02

Pos logit contributions
ich+0.049
field+0.046
sters+0.043
ster+0.043
ese+0.042
Neg logit contributions
br-0.024
ch-0.024
 Ellie-0.022
run-0.022
sc-0.022
Token carrots
Token ID34397
Feature activation-4.39e-02

Pos logit contributions
ich+0.038
field+0.035
sters+0.034
ster+0.033
ese+0.033
Neg logit contributions
br-0.032
ch-0.031
run-0.030
 Ellie-0.030
 tricky-0.029
Token and
Token ID290
Feature activation-5.74e-03

Pos logit contributions
ich+0.100
field+0.091
sters+0.087
ster+0.086
ese+0.085
Neg logit contributions
br-0.044
ch-0.042
 Ellie-0.041
 tricky-0.040
 hard-0.039
Token lettuce
Token ID39406
Feature activation-3.73e-02

Pos logit contributions
ich+0.009
field+0.009
sters+0.008
ster+0.008
ese+0.008
Neg logit contributions
br-0.009
run-0.009
ch-0.009
sc-0.009
 tricky-0.009
Token came
Token ID1625
Feature activation+1.14e-02

Pos logit contributions
br+0.013
ch+0.012
 tricky+0.012
run+0.012
sc+0.012
Neg logit contributions
ich-0.022
field-0.020
ster-0.019
sters-0.019
illa-0.019
Token to
Token ID284
Feature activation-2.45e-03

Pos logit contributions
br+0.009
ch+0.007
 Ellie+0.007
 tricky+0.007
sc+0.007
Neg logit contributions
ich-0.029
field-0.027
ster-0.026
sters-0.026
ese-0.025
Token the
Token ID262
Feature activation-1.16e-02

Pos logit contributions
ich+0.004
field+0.004
illa+0.003
illas+0.003
opsy+0.003
Neg logit contributions
br-0.004
sc-0.004
ch-0.004
run-0.004
 tricky-0.004
Token stable
Token ID8245
Feature activation-2.19e-02

Pos logit contributions
ich+0.020
sters+0.018
field+0.018
illas+0.017
ster+0.017
Neg logit contributions
ch-0.017
br-0.017
 loud-0.017
 hard-0.016
 tricky-0.016
Token to
Token ID284
Feature activation-1.35e-02

Pos logit contributions
ich+0.051
field+0.046
sters+0.046
ster+0.045
ese+0.044
Neg logit contributions
br-0.019
ch-0.018
 Ellie-0.018
 tricky-0.018
run-0.018
Token feed
Token ID3745
Feature activation-6.24e-02

Pos logit contributions
ich+0.022
field+0.021
ster+0.019
sters+0.019
ese+0.019
Neg logit contributions
br-0.021
ch-0.020
run-0.020
sc-0.020
 tricky-0.020
Token him
Token ID683
Feature activation-8.81e-03

Pos logit contributions
ich+0.137
field+0.128
sters+0.123
ese+0.121
ster+0.121
Neg logit contributions
br-0.067
ch-0.060
 tricky-0.059
run-0.058
sc-0.056
Token.
Token ID13
Feature activation+6.78e-04

Pos logit contributions
ich+0.020
field+0.018
sters+0.017
ster+0.017
ese+0.017
Neg logit contributions
br-0.009
ch-0.009
 Ellie-0.008
 tricky-0.008
run-0.008
Token She
Token ID1375
Feature activation-6.20e-03

Pos logit contributions
br+0.001
sc+0.001
ch+0.001
bul+0.001
bench+0.001
Neg logit contributions
ich-0.001
field-0.001
 porch-0.001
opsy-0.001
sters-0.001
Token brought
Token ID3181
Feature activation-8.33e-03

Pos logit contributions
ich+0.012
field+0.011
sters+0.011
ster+0.011
ese+0.010
Neg logit contributions
br-0.008
ch-0.007
 Ellie-0.007
 tricky-0.007
 loud-0.007
Token a
Token ID257
Feature activation+1.38e-02

Pos logit contributions
ese+0.013
ich+0.012
anut+0.012
illa+0.012
sters+0.012
Neg logit contributions
br-0.013
bench-0.013
 complicated-0.012
 tricky-0.012
run-0.012
Token bug
Token ID5434
Feature activation+1.89e-03

Pos logit contributions
ich+0.094
field+0.085
sters+0.080
ster+0.080
ese+0.077
Neg logit contributions
br-0.078
ch-0.075
 Ellie-0.073
 tricky-0.073
run-0.073
Token.
Token ID13
Feature activation-4.24e-03

Pos logit contributions
br+0.003
ch+0.003
run+0.003
 tricky+0.003
sc+0.003
Neg logit contributions
ich-0.003
field-0.003
sters-0.003
ster-0.003
ese-0.003
Token Jen
Token ID13364
Feature activation-1.14e-02

Pos logit contributions
ich+0.009
field+0.009
sters+0.008
ster+0.008
ese+0.008
Neg logit contributions
br-0.004
ch-0.004
 Ellie-0.004
 tricky-0.004
run-0.004
Token wanted
Token ID2227
Feature activation-2.29e-02

Pos logit contributions
ich+0.024
field+0.022
ster+0.021
sters+0.021
ese+0.020
Neg logit contributions
br-0.013
 Ellie-0.012
ch-0.012
 tricky-0.012
 hard-0.012
Token to
Token ID284
Feature activation-2.04e-02

Pos logit contributions
ich+0.049
field+0.045
sters+0.044
ster+0.043
ese+0.043
Neg logit contributions
br-0.025
sc-0.024
run-0.023
ch-0.022
bench-0.022
Token feed
Token ID3745
Feature activation-6.22e-02

Pos logit contributions
ich+0.034
field+0.031
sters+0.028
ster+0.028
ese+0.028
Neg logit contributions
br-0.032
ch-0.031
sc-0.030
run-0.030
 tricky-0.030
Token the
Token ID262
Feature activation-2.07e-02

Pos logit contributions
ich+0.143
field+0.132
sters+0.127
ster+0.126
ese+0.124
Neg logit contributions
br-0.060
ch-0.056
run-0.053
 tricky-0.053
 Ellie-0.053
Token different
Token ID1180
Feature activation-8.07e-03

Pos logit contributions
field+0.037
ich+0.036
illas+0.035
asis+0.033
sters+0.033
Neg logit contributions
 loud-0.029
 hard-0.029
 scary-0.028
 tricky-0.028
ch-0.028
Token insect
Token ID12833
Feature activation-3.78e-03

Pos logit contributions
ich+0.013
field+0.012
ster+0.011
illas+0.011
sters+0.011
Neg logit contributions
br-0.013
ch-0.013
 tricky-0.013
 Ellie-0.013
 loud-0.013
Token,
Token ID11
Feature activation-1.50e-02

Pos logit contributions
ich+0.007
field+0.007
sters+0.006
ster+0.006
ese+0.006
Neg logit contributions
br-0.005
ch-0.005
run-0.005
bench-0.005
sc-0.004
Token but
Token ID475
Feature activation-1.64e-02

Pos logit contributions
ich+0.023
field+0.021
sters+0.021
 porch+0.020
ese+0.020
Neg logit contributions
br-0.023
bench-0.021
ch-0.021
sc-0.021
igsaw-0.021
Token.
Token ID13
Feature activation+8.00e-03

Pos logit contributions
ch+0.050
br+0.049
Ell+0.049
run+0.048
lic+0.047
Neg logit contributions
ich-0.034
field-0.033
ster-0.033
sters-0.032
our-0.031
Token
Token ID198
Feature activation+5.66e-03

Pos logit contributions
br+0.009
ch+0.009
 tricky+0.008
 loud+0.008
sc+0.008
Neg logit contributions
ich-0.017
field-0.016
ster-0.015
sters-0.014
illa-0.014
Token
Token ID198
Feature activation-4.76e-03

Pos logit contributions
br+0.007
ch+0.006
 Ellie+0.006
 tricky+0.006
run+0.006
Neg logit contributions
ich-0.012
field-0.011
sters-0.010
ster-0.010
ese-0.010
TokenThe
Token ID464
Feature activation+8.90e-03

Pos logit contributions
ich+0.008
field+0.007
ster+0.006
sters+0.006
ese+0.006
Neg logit contributions
br-0.008
 Ellie-0.007
ch-0.007
run-0.007
 tricky-0.007
Token three
Token ID1115
Feature activation+3.37e-02

Pos logit contributions
br+0.010
ch+0.010
 tricky+0.009
 loud+0.009
 hard+0.009
Neg logit contributions
ich-0.019
field-0.018
sters-0.017
ster-0.016
ese-0.016
Token-
Token ID12
Feature activation+9.42e-02

Pos logit contributions
 Ellie+0.036
br+0.036
ch+0.036
sc+0.034
 tricky+0.034
Neg logit contributions
ich-0.071
sters-0.065
field-0.065
ster-0.064
ese-0.063
Tokenyear
Token ID1941
Feature activation+8.64e-02

Pos logit contributions
 tricky+0.105
br+0.101
 loud+0.100
 Ellie+0.097
 confusing+0.097
Neg logit contributions
ich-0.192
illa-0.181
ster-0.172
anut-0.170
sters-0.163
Token-
Token ID12
Feature activation+8.35e-02

Pos logit contributions
sc+0.091
ch+0.089
br+0.088
 Ellie+0.087
run+0.085
Neg logit contributions
ich-0.192
ster-0.175
sters-0.171
field-0.170
actus-0.165
Tokenold
Token ID727
Feature activation+9.50e-03

Pos logit contributions
 tricky+0.116
br+0.116
sc+0.115
ch+0.114
 loud+0.113
Neg logit contributions
ich-0.147
illa-0.133
ster-0.129
anut-0.127
sters-0.124
Token looked
Token ID3114
Feature activation-7.25e-03

Pos logit contributions
br+0.015
ch+0.014
run+0.014
 tricky+0.014
sc+0.014
Neg logit contributions
ich-0.016
ster-0.015
sters-0.014
field-0.014
illa-0.013
Token around
Token ID1088
Feature activation+1.90e-03

Pos logit contributions
ich+0.012
field+0.011
ster+0.011
sters+0.011
anut+0.010
Neg logit contributions
br-0.011
ch-0.010
 Ellie-0.010
sc-0.010
run-0.009
Token
Token ID198
Feature activation-1.04e-02

Pos logit contributions
ich+0.006
field+0.005
ster+0.005
sters+0.005
ese+0.005
Neg logit contributions
br-0.003
ch-0.003
 Ellie-0.003
 tricky-0.003
run-0.003
TokenSoon
Token ID28093
Feature activation+3.19e-02

Pos logit contributions
ich+0.018
field+0.017
sters+0.016
ster+0.015
ese+0.015
Neg logit contributions
br-0.016
ch-0.015
 tricky-0.015
 Ellie-0.015
sc-0.015
Token,
Token ID11
Feature activation-6.48e-03

Pos logit contributions
run+0.050
ch+0.047
sc+0.046
 tricky+0.045
br+0.045
Neg logit contributions
ich-0.050
sters-0.049
ster-0.049
field-0.046
illas-0.042
Token the
Token ID262
Feature activation-5.13e-03

Pos logit contributions
ich+0.011
field+0.011
sters+0.010
ster+0.010
illas+0.010
Neg logit contributions
run-0.010
br-0.009
ch-0.009
sc-0.009
bench-0.009
Token door
Token ID3420
Feature activation+4.39e-02

Pos logit contributions
ich+0.008
ster+0.007
field+0.006
 porch+0.006
actus+0.006
Neg logit contributions
run-0.010
smart-0.009
lic-0.009
 Ellie-0.009
br-0.009
Tokenbell
Token ID7923
Feature activation+8.84e-02

Pos logit contributions
igsaw+0.064
ch+0.064
 Ellie+0.063
 baths+0.063
run+0.063
Neg logit contributions
ich-0.063
field-0.060
our-0.060
 doorstep-0.057
ster-0.057
Token rang
Token ID28077
Feature activation+1.62e-02

Pos logit contributions
 tricky+0.136
ch+0.136
run+0.134
br+0.133
sc+0.128
Neg logit contributions
ich-0.139
our-0.126
field-0.124
sters-0.120
ster-0.119
Token.
Token ID13
Feature activation-3.30e-03

Pos logit contributions
br+0.028
run+0.028
sc+0.027
ch+0.027
 tricky+0.027
Neg logit contributions
ich-0.021
ster-0.019
field-0.018
our-0.018
illa-0.018
Token Lily
Token ID20037
Feature activation-8.06e-03

Pos logit contributions
ich+0.005
field+0.005
ster+0.005
sters+0.005
our+0.005
Neg logit contributions
br-0.005
ch-0.005
 tricky-0.004
sc-0.004
 hard-0.004
Token ran
Token ID4966
Feature activation-2.48e-03

Pos logit contributions
ich+0.016
field+0.015
sters+0.014
ster+0.014
illas+0.013
Neg logit contributions
br-0.010
ch-0.010
 Ellie-0.010
 tricky-0.010
run-0.010
Token to
Token ID284
Feature activation+8.35e-03

Pos logit contributions
ich+0.004
ster+0.004
field+0.004
 doorstep+0.004
 porch+0.004
Neg logit contributions
br-0.003
run-0.003
ch-0.003
 tricky-0.003
 complicated-0.003
Token<|endoftext|>
Token ID50256
Feature activation-5.64e-03

Pos logit contributions
ich+0.003
field+0.003
sters+0.003
ster+0.003
ese+0.002
Neg logit contributions
br-0.001
ch-0.001
 Ellie-0.001
run-0.001
 tricky-0.001
TokenBen
Token ID11696
Feature activation+4.09e-06

Pos logit contributions
ich+0.010
field+0.009
ster+0.008
sters+0.008
illas+0.008
Neg logit contributions
br-0.008
 tricky-0.008
ch-0.008
run-0.008
 Ellie-0.008
Token was
Token ID373
Feature activation+9.84e-03

Feature not active on prev token;
no predictions were affected.
Token a
Token ID257
Feature activation+4.33e-03

Pos logit contributions
br+0.016
sc+0.014
ch+0.014
drivers+0.013
 Ellie+0.013
Neg logit contributions
ich-0.017
field-0.015
illa-0.014
opsy-0.014
ster-0.013
Token 3
Token ID513
Feature activation+3.97e-02

Pos logit contributions
br+0.004
ch+0.004
 Ellie+0.004
run+0.004
sc+0.004
Neg logit contributions
ich-0.010
field-0.009
ster-0.009
sters-0.008
ese-0.008
Token-
Token ID12
Feature activation+8.93e-02

Pos logit contributions
br+0.047
ch+0.045
 Ellie+0.042
 tricky+0.042
sc+0.042
Neg logit contributions
ich-0.083
field-0.075
ster-0.073
sters-0.072
ese-0.072
Tokenyear
Token ID1941
Feature activation+9.17e-02

Pos logit contributions
br+0.105
 tricky+0.094
 Ellie+0.093
ch+0.090
 complicated+0.089
Neg logit contributions
ich-0.188
illa-0.170
ster-0.168
anut-0.166
field-0.160
Token-
Token ID12
Feature activation+8.21e-02

Pos logit contributions
br+0.097
ch+0.096
 Ellie+0.095
sc+0.094
run+0.090
Neg logit contributions
ich-0.202
ster-0.184
field-0.180
sters-0.178
ese-0.173
Tokenold
Token ID727
Feature activation+8.58e-03

Pos logit contributions
br+0.123
 tricky+0.116
ch+0.114
 Ellie+0.112
sc+0.111
Neg logit contributions
ich-0.144
illa-0.126
ster-0.126
anut-0.124
field-0.122
Token boy
Token ID2933
Feature activation-1.71e-04

Pos logit contributions
br+0.015
ch+0.014
 tricky+0.013
sc+0.013
 Ellie+0.013
Neg logit contributions
ich-0.014
ster-0.012
field-0.012
sters-0.012
illa-0.011
Token who
Token ID508
Feature activation-1.44e-02

Pos logit contributions
ich+0.000
field+0.000
ster+0.000
illa+0.000
illas+0.000
Neg logit contributions
br-0.000
 tricky-0.000
ch-0.000
 loud-0.000
 complicated-0.000
Token little
Token ID1310
Feature activation-1.16e-02

Pos logit contributions
ich+0.008
field+0.007
ster+0.007
sters+0.006
actus+0.006
Neg logit contributions
br-0.005
run-0.005
sc-0.005
 tricky-0.005
ch-0.005
Token girl
Token ID2576
Feature activation+3.22e-03

Pos logit contributions
ich+0.019
field+0.016
ster+0.016
actus+0.015
sters+0.015
Neg logit contributions
run-0.020
br-0.019
sc-0.019
 tricky-0.018
 Ellie-0.018
Token who
Token ID508
Feature activation-2.01e-02

Pos logit contributions
br+0.004
ch+0.004
 Ellie+0.004
 tricky+0.004
run+0.004
Neg logit contributions
ich-0.006
field-0.006
ster-0.005
sters-0.005
ese-0.005
Token was
Token ID373
Feature activation+6.42e-03

Pos logit contributions
ich+0.033
field+0.030
ster+0.028
sters+0.027
ese+0.027
Neg logit contributions
br-0.031
run-0.030
ch-0.030
sc-0.029
 Ellie-0.029
Token three
Token ID1115
Feature activation+4.65e-02

Pos logit contributions
br+0.010
sc+0.009
run+0.009
ch+0.009
bench+0.008
Neg logit contributions
ich-0.012
field-0.010
opsy-0.010
illa-0.010
anut-0.010
Token-
Token ID12
Feature activation+8.36e-02

Pos logit contributions
br+0.052
 loud+0.052
 hard+0.050
 tricky+0.050
run+0.050
Neg logit contributions
ich-0.098
field-0.094
ster-0.085
sters-0.084
illas-0.083
Tokenyears
Token ID19002
Feature activation+7.48e-02

Pos logit contributions
br+0.117
 tricky+0.109
ch+0.106
 loud+0.105
 complicated+0.104
Neg logit contributions
ich-0.160
anut-0.138
illa-0.138
ster-0.135
sters-0.134
Token-
Token ID12
Feature activation+6.80e-02

Pos logit contributions
br+0.058
ch+0.055
 Ellie+0.052
 tricky+0.051
run+0.050
Neg logit contributions
ich-0.186
field-0.171
sters-0.165
ster-0.165
ese-0.162
Tokenold
Token ID727
Feature activation+6.89e-03

Pos logit contributions
br+0.080
ch+0.077
 tricky+0.075
sc+0.074
run+0.073
Neg logit contributions
ich-0.142
field-0.126
ster-0.123
sters-0.123
illa-0.121
Token.
Token ID13
Feature activation+2.81e-03

Pos logit contributions
br+0.007
ch+0.007
 Ellie+0.007
run+0.006
 tricky+0.006
Neg logit contributions
ich-0.015
field-0.015
sters-0.014
ster-0.013
ese-0.013
Token She
Token ID1375
Feature activation-8.30e-03

Pos logit contributions
br+0.003
ch+0.003
 tricky+0.003
run+0.003
sc+0.003
Neg logit contributions
ich-0.006
field-0.006
ster-0.005
sters-0.005
ese-0.005
Token
Token ID198
Feature activation-6.68e-03

Pos logit contributions
br+0.005
ch+0.005
 tricky+0.005
sc+0.005
 Ellie+0.005
Neg logit contributions
ich-0.006
field-0.006
illa-0.005
ster-0.005
opsy-0.005
TokenA
Token ID32
Feature activation+3.44e-02

Pos logit contributions
ich+0.011
field+0.009
ster+0.009
illa+0.008
ese+0.008
Neg logit contributions
br-0.011
run-0.010
 tricky-0.010
sc-0.010
 Ellie-0.010
Token 3
Token ID513
Feature activation+3.78e-02

Pos logit contributions
run+0.050
sc+0.048
bench+0.047
 aback+0.045
wn+0.045
Neg logit contributions
ich-0.064
ster-0.056
field-0.054
opsy-0.051
actus-0.051
Token-
Token ID12
Feature activation+9.97e-02

Pos logit contributions
br+0.050
ch+0.047
sc+0.046
 tricky+0.046
run+0.045
Neg logit contributions
ich-0.072
ster-0.064
ese-0.063
illa-0.062
field-0.062
Tokenyear
Token ID1941
Feature activation+9.35e-02

Pos logit contributions
br+0.112
 tricky+0.110
sc+0.106
 complicated+0.104
ch+0.103
Neg logit contributions
ich-0.200
illa-0.185
ster-0.177
anut-0.175
ese-0.169
Token-
Token ID12
Feature activation+8.74e-02

Pos logit contributions
sc+0.099
run+0.093
ch+0.093
br+0.092
 Ellie+0.089
Neg logit contributions
ich-0.207
ster-0.188
sters-0.182
field-0.180
ese-0.178
Tokenold
Token ID727
Feature activation+1.80e-02

Pos logit contributions
br+0.124
sc+0.123
ch+0.121
 tricky+0.120
run+0.117
Neg logit contributions
ich-0.152
illa-0.135
ster-0.131
anut-0.129
sters-0.127
Token toddler
Token ID30773
Feature activation-3.00e-03

Pos logit contributions
br+0.034
ch+0.032
sc+0.032
 tricky+0.032
 complicated+0.032
Neg logit contributions
ich-0.025
ster-0.023
sters-0.021
illa-0.020
ese-0.020
Token walked
Token ID6807
Feature activation+4.58e-03

Pos logit contributions
ich+0.005
field+0.004
ster+0.004
illas+0.004
sters+0.004
Neg logit contributions
br-0.005
 complicated-0.005
 tricky-0.005
sc-0.005
ch-0.005
Token up
Token ID510
Feature activation-5.57e-03

Pos logit contributions
br+0.007
run+0.006
ch+0.006
 complicated+0.006
 tricky+0.006
Neg logit contributions
ich-0.008
field-0.008
ster-0.007
illa-0.007
ese-0.006
Token to
Token ID284
Feature activation-3.50e-03

Pos logit contributions
field+0.009
ich+0.008
 porch+0.008
fields+0.007
illas+0.007
Neg logit contributions
br-0.009
ch-0.008
run-0.008
 unfair-0.008
lic-0.008
Token bird
Token ID6512
Feature activation+5.46e-04

Pos logit contributions
ich+0.032
field+0.030
sters+0.028
illa+0.028
anut+0.028
Neg logit contributions
br-0.020
ch-0.019
 loud-0.019
 hard-0.018
 Ellie-0.018
Token tried
Token ID3088
Feature activation-1.49e-02

Pos logit contributions
br+0.001
ch+0.001
sc+0.001
run+0.001
 tricky+0.001
Neg logit contributions
ich-0.001
field-0.001
ster-0.001
sters-0.001
ese-0.001
Token very
Token ID845
Feature activation-2.17e-02

Pos logit contributions
ich+0.029
field+0.027
sters+0.025
ster+0.025
illa+0.025
Neg logit contributions
br-0.019
sc-0.018
ch-0.018
run-0.016
lic-0.016
Token hard
Token ID1327
Feature activation+2.43e-02

Pos logit contributions
ich+0.056
field+0.053
ster+0.051
sters+0.051
opsy+0.050
Neg logit contributions
ch-0.013
br-0.012
sc-0.011
run-0.011
 Ellie-0.010
Token and
Token ID290
Feature activation+1.52e-02

Pos logit contributions
ch+0.045
br+0.044
lic+0.042
sc+0.042
Ell+0.041
Neg logit contributions
our-0.029
ich-0.028
field-0.026
sters-0.026
ster-0.023
Token finally
Token ID3443
Feature activation+2.83e-02

Pos logit contributions
igsaw+0.027
br+0.025
run+0.025
bench+0.024
sc+0.023
Neg logit contributions
field-0.020
ich-0.017
our-0.016
sters-0.016
 forget-0.016
Token said
Token ID531
Feature activation+2.02e-03

Pos logit contributions
ch+0.036
br+0.036
run+0.035
sc+0.035
 Ellie+0.034
Neg logit contributions
ich-0.053
field-0.050
ster-0.048
sters-0.048
illa-0.045
Token,
Token ID11
Feature activation+9.65e-03

Pos logit contributions
ch+0.003
sc+0.003
bench+0.003
 tricky+0.003
run+0.003
Neg logit contributions
ich-0.003
ster-0.003
sters-0.003
field-0.003
our-0.003
Token "
Token ID366
Feature activation+3.87e-03

Pos logit contributions
br+0.011
ch+0.010
 tricky+0.010
run+0.010
 Ellie+0.010
Neg logit contributions
ich-0.021
field-0.019
sters-0.018
ster-0.018
ese-0.017
Tokeneight
Token ID26022
Feature activation+1.21e-02

Pos logit contributions
br+0.006
run+0.006
 Ellie+0.006
 tricky+0.006
 loud+0.006
Neg logit contributions
ich-0.006
field-0.006
sters-0.005
ese-0.005
ster-0.005
Token three
Token ID1115
Feature activation+4.42e-02

Pos logit contributions
br+0.015
ch+0.015
 Ellie+0.014
sc+0.014
run+0.014
Neg logit contributions
ich-0.024
field-0.022
sters-0.022
ster-0.021
ese-0.021
Token landed
Token ID11406
Feature activation-1.22e-02

Pos logit contributions
ich+0.003
field+0.002
ster+0.002
sters+0.002
illas+0.002
Neg logit contributions
br-0.002
run-0.002
ch-0.002
 tricky-0.002
sc-0.002
Token in
Token ID287
Feature activation-1.38e-03

Pos logit contributions
ich+0.026
field+0.023
ster+0.022
sters+0.022
ese+0.021
Neg logit contributions
br-0.014
 Ellie-0.013
ch-0.013
run-0.013
 tricky-0.013
Token a
Token ID257
Feature activation-1.73e-04

Pos logit contributions
ich+0.002
field+0.002
 porch+0.002
illa+0.002
 wrist+0.002
Neg logit contributions
br-0.002
sc-0.002
run-0.002
ch-0.002
lic-0.002
Token messy
Token ID24097
Feature activation-5.29e-03

Pos logit contributions
ich+0.000
field+0.000
ster+0.000
sters+0.000
ese+0.000
Neg logit contributions
br-0.000
run-0.000
ch-0.000
 Ellie-0.000
sc-0.000
Token pile
Token ID14540
Feature activation-6.35e-04

Pos logit contributions
ich+0.008
field+0.007
ster+0.006
actus+0.006
sters+0.006
Neg logit contributions
br-0.010
run-0.009
sc-0.009
ch-0.009
 Ellie-0.009
Token of
Token ID286
Feature activation+2.79e-02

Pos logit contributions
ich+0.001
field+0.001
ster+0.001
our+0.001
sters+0.001
Neg logit contributions
br-0.001
 complicated-0.001
ch-0.001
 tricky-0.001
 Ellie-0.001
Token leaves
Token ID5667
Feature activation-7.11e-03

Pos logit contributions
br+0.041
sc+0.038
ch+0.038
run+0.038
 Ellie+0.037
Neg logit contributions
ich-0.050
field-0.044
sters-0.043
ster-0.042
ese-0.042
Token.
Token ID13
Feature activation+3.28e-03

Pos logit contributions
ich+0.011
field+0.011
sters+0.009
ster+0.009
our+0.009
Neg logit contributions
br-0.011
 Ellie-0.011
 complicated-0.011
ch-0.010
 tricky-0.010
Token
Token ID220
Feature activation-1.67e-02

Pos logit contributions
br+0.004
ch+0.004
 tricky+0.004
run+0.004
sc+0.004
Neg logit contributions
ich-0.007
field-0.006
ster-0.006
sters-0.006
actus-0.006
Token
Token ID198
Feature activation+4.41e-03

Pos logit contributions
ich+0.036
field+0.033
sters+0.031
ster+0.031
ese+0.030
Neg logit contributions
br-0.019
ch-0.018
 Ellie-0.018
run-0.017
 tricky-0.017
Token
Token ID198
Feature activation-5.32e-03

Pos logit contributions
br+0.005
ch+0.005
 Ellie+0.005
 tricky+0.005
run+0.005
Neg logit contributions
ich-0.009
field-0.008
ster-0.008
sters-0.008
ese-0.007
Token It
Token ID632
Feature activation+2.16e-02

Pos logit contributions
br+0.008
ch+0.007
 tricky+0.007
run+0.007
sc+0.007
Neg logit contributions
ich-0.015
field-0.014
sters-0.013
ster-0.013
ese-0.013
Token was
Token ID373
Feature activation+6.71e-03

Pos logit contributions
run+0.032
br+0.032
sc+0.030
ch+0.030
bench+0.030
Neg logit contributions
ich-0.035
field-0.034
ster-0.030
sters-0.030
opsy-0.030
Token so
Token ID523
Feature activation-1.24e-02

Pos logit contributions
sc+0.010
br+0.009
run+0.009
ch+0.009
smart+0.008
Neg logit contributions
ich-0.012
field-0.011
 porch-0.011
ble-0.010
fields-0.010
Token shiny
Token ID22441
Feature activation+2.70e-02

Pos logit contributions
ich+0.026
field+0.025
actus+0.023
opsy+0.023
sters+0.023
Neg logit contributions
sc-0.013
ch-0.013
br-0.013
run-0.013
bench-0.011
Token and
Token ID290
Feature activation+1.34e-02

Pos logit contributions
run+0.043
ch+0.041
br+0.040
sc+0.040
bench+0.039
Neg logit contributions
ich-0.043
sters-0.038
ster-0.038
field-0.037
actus-0.036
Token spark
Token ID9009
Feature activation+5.53e-02

Pos logit contributions
run+0.018
br+0.018
ch+0.017
sc+0.017
bench+0.016
Neg logit contributions
ich-0.023
field-0.023
sters-0.021
 porch-0.020
ster-0.020
Tokenly
Token ID306
Feature activation+1.33e-02

Pos logit contributions
run+0.073
bench+0.073
 tricky+0.071
 Ellie+0.070
 subt+0.067
Neg logit contributions
ich-0.093
sters-0.092
our-0.092
ster-0.090
actus-0.086
Token.
Token ID13
Feature activation+7.65e-03

Pos logit contributions
ch+0.022
run+0.021
bench+0.021
 Ellie+0.021
Ell+0.020
Neg logit contributions
ich-0.019
sters-0.018
ster-0.018
field-0.017
 porch-0.015
Token She
Token ID1375
Feature activation-2.37e-03

Pos logit contributions
br+0.010
ch+0.010
 tricky+0.009
sc+0.009
run+0.009
Neg logit contributions
ich-0.015
field-0.014
sters-0.013
ster-0.013
opsy-0.012
Token picked
Token ID6497
Feature activation-2.79e-02

Pos logit contributions
ich+0.004
field+0.004
sters+0.004
ster+0.004
ese+0.004
Neg logit contributions
br-0.003
ch-0.003
 Ellie-0.003
run-0.003
 tricky-0.003
Token it
Token ID340
Feature activation+1.08e-02

Pos logit contributions
ich+0.062
field+0.057
ster+0.055
sters+0.055
ese+0.053
Neg logit contributions
br-0.029
ch-0.027
run-0.026
 tricky-0.026
 Ellie-0.026
Token mistake
Token ID7457
Feature activation-1.81e-03

Pos logit contributions
ich+0.040
ster+0.036
field+0.035
sters+0.034
opsy+0.033
Neg logit contributions
br-0.032
sc-0.031
 Ellie-0.030
ch-0.030
run-0.030
Token.
Token ID13
Feature activation+7.11e-03

Pos logit contributions
ich+0.003
ster+0.002
sters+0.002
field+0.002
illa+0.002
Neg logit contributions
br-0.003
 tricky-0.003
ch-0.003
 complicated-0.003
Ell-0.003
Token It
Token ID632
Feature activation+1.77e-02

Pos logit contributions
br+0.010
ch+0.009
sc+0.009
 tricky+0.009
 hard+0.008
Neg logit contributions
ich-0.012
field-0.012
ster-0.011
sters-0.011
our-0.011
Token jumped
Token ID11687
Feature activation-1.34e-02

Pos logit contributions
igsaw+0.029
bench+0.029
br+0.029
 Ellie+0.028
run+0.027
Neg logit contributions
field-0.025
ich-0.024
ster-0.023
sters-0.021
opsy-0.021
Token too
Token ID1165
Feature activation-1.53e-02

Pos logit contributions
ich+0.019
ster+0.018
ble+0.017
sters+0.016
field+0.016
Neg logit contributions
bench-0.022
run-0.021
br-0.020
Ell-0.020
 Ellie-0.020
Token high
Token ID1029
Feature activation+2.68e-02

Pos logit contributions
ich+0.031
field+0.030
ster+0.027
opsy+0.027
sters+0.026
Neg logit contributions
br-0.017
ch-0.016
run-0.015
 Ellie-0.015
sc-0.015
Token and
Token ID290
Feature activation+4.13e-03

Pos logit contributions
br+0.043
run+0.042
ch+0.040
 tricky+0.038
lic+0.038
Neg logit contributions
ich-0.045
field-0.041
ster-0.039
sters-0.038
our-0.037
Token bumped
Token ID38034
Feature activation+1.65e-02

Pos logit contributions
br+0.006
bench+0.006
igsaw+0.006
heet+0.006
run+0.006
Neg logit contributions
field-0.006
ich-0.006
 forget-0.005
ster-0.005
sters-0.005
Token its
Token ID663
Feature activation-4.41e-03

Pos logit contributions
br+0.022
ch+0.020
sc+0.020
 Ellie+0.019
 tricky+0.019
Neg logit contributions
ich-0.034
field-0.029
ster-0.029
sters-0.029
opsy-0.028
Token head
Token ID1182
Feature activation-2.44e-02

Pos logit contributions
ich+0.006
field+0.006
ster+0.005
sters+0.005
ese+0.005
Neg logit contributions
br-0.008
sc-0.008
ch-0.008
 Ellie-0.008
run-0.008
Token on
Token ID319
Feature activation+1.32e-02

Pos logit contributions
ich+0.042
ster+0.037
field+0.037
our+0.035
sters+0.035
Neg logit contributions
 Ellie-0.035
igsaw-0.035
br-0.035
bench-0.034
 subt-0.033
Token played
Token ID2826
Feature activation-7.16e-03

Pos logit contributions
ich+0.052
field+0.048
sters+0.045
ster+0.045
ese+0.044
Neg logit contributions
br-0.030
ch-0.029
 Ellie-0.028
 tricky-0.028
run-0.027
Token with
Token ID351
Feature activation-5.39e-03

Pos logit contributions
ich+0.014
ster+0.012
field+0.012
sters+0.012
ese+0.011
Neg logit contributions
br-0.009
ch-0.009
run-0.009
sc-0.008
 tricky-0.008
Token the
Token ID262
Feature activation+5.19e-03

Pos logit contributions
ich+0.010
sters+0.008
field+0.008
ster+0.008
illas+0.008
Neg logit contributions
br-0.008
sc-0.007
ch-0.007
run-0.007
lic-0.007
Token doll
Token ID3654
Feature activation-1.09e-02

Pos logit contributions
run+0.008
ch+0.008
br+0.008
sc+0.007
 tricky+0.007
Neg logit contributions
ich-0.009
ster-0.008
field-0.008
sters-0.007
actus-0.007
Token all
Token ID477
Feature activation-2.81e-02

Pos logit contributions
ich+0.022
field+0.020
ster+0.019
sters+0.019
ese+0.018
Neg logit contributions
br-0.014
ch-0.013
run-0.012
 Ellie-0.012
 tricky-0.012
Token day
Token ID1110
Feature activation+3.44e-02

Pos logit contributions
ich+0.064
field+0.059
ster+0.056
sters+0.055
ese+0.054
Neg logit contributions
br-0.028
ch-0.026
run-0.025
 Ellie-0.025
 tricky-0.025
Token and
Token ID290
Feature activation-6.16e-03

Pos logit contributions
br+0.024
 hard+0.022
ch+0.022
 Ellie+0.022
 tricky+0.022
Neg logit contributions
ich-0.088
field-0.081
sters-0.079
ster-0.078
ese-0.078
Token all
Token ID477
Feature activation-1.28e-02

Pos logit contributions
ich+0.012
field+0.011
sters+0.011
ster+0.011
ese+0.010
Neg logit contributions
br-0.008
ch-0.008
run-0.007
 tricky-0.007
sc-0.007
Token night
Token ID1755
Feature activation+3.81e-02

Pos logit contributions
ich+0.026
field+0.025
ster+0.023
illa+0.023
illas+0.023
Neg logit contributions
br-0.014
ch-0.013
sc-0.013
run-0.013
 tricky-0.013
Token.
Token ID13
Feature activation-2.11e-03

Pos logit contributions
br+0.038
ch+0.037
run+0.035
 Ellie+0.035
 tricky+0.035
Neg logit contributions
ich-0.086
field-0.079
sters-0.075
ster-0.075
ese-0.073
Token
Token ID198
Feature activation+1.19e-03

Pos logit contributions
ich+0.005
field+0.005
ster+0.005
sters+0.005
illas+0.005
Neg logit contributions
 Ellie-0.002
br-0.002
ch-0.001
 much-0.001
 hard-0.001
Token wanted
Token ID2227
Feature activation-1.22e-02

Pos logit contributions
br+0.007
ch+0.007
sc+0.007
run+0.006
 Ellie+0.006
Neg logit contributions
ich-0.009
field-0.008
ster-0.008
sters-0.008
ese-0.007
Token to
Token ID284
Feature activation-8.04e-03

Pos logit contributions
ich+0.022
field+0.021
ese+0.020
sters+0.020
ster+0.020
Neg logit contributions
br-0.017
sc-0.016
run-0.015
lic-0.015
bench-0.014
Token catch
Token ID4929
Feature activation-2.05e-02

Pos logit contributions
ich+0.015
field+0.014
sters+0.013
ster+0.013
ese+0.012
Neg logit contributions
br-0.011
ch-0.011
 Ellie-0.011
 tricky-0.010
run-0.010
Token the
Token ID262
Feature activation-2.95e-03

Pos logit contributions
ich+0.040
sters+0.036
field+0.036
ster+0.036
ese+0.035
Neg logit contributions
br-0.027
run-0.024
sc-0.024
ch-0.023
bench-0.023
Token tomato
Token ID24240
Feature activation-2.17e-02

Pos logit contributions
ich+0.005
illas+0.005
field+0.005
illa+0.005
sters+0.004
Neg logit contributions
 hard-0.005
 loud-0.005
 tricky-0.004
 time-0.004
 fun-0.004
Token.
Token ID13
Feature activation+3.85e-03

Pos logit contributions
ich+0.039
field+0.038
sters+0.037
ster+0.036
ese+0.034
Neg logit contributions
br-0.031
ch-0.030
run-0.028
lic-0.028
sc-0.027
Token "
Token ID366
Feature activation-5.53e-03

Pos logit contributions
br+0.005
ch+0.004
 tricky+0.004
run+0.004
 Ellie+0.004
Neg logit contributions
ich-0.008
field-0.007
sters-0.007
ster-0.007
ese-0.007
TokenI
Token ID40
Feature activation-2.12e-02

Pos logit contributions
ich+0.009
field+0.008
ster+0.007
sters+0.007
ese+0.007
Neg logit contributions
br-0.009
 Ellie-0.009
ch-0.009
 tricky-0.009
run-0.008
Token'm
Token ID1101
Feature activation+2.21e-03

Pos logit contributions
ich+0.031
field+0.029
ster+0.027
our+0.026
opsy+0.026
Neg logit contributions
br-0.036
 Ellie-0.035
ch-0.033
sc-0.032
run-0.032
Token going
Token ID1016
Feature activation+1.31e-02

Pos logit contributions
br+0.004
sc+0.004
bench+0.003
ch+0.003
run+0.003
Neg logit contributions
ich-0.004
field-0.003
anut-0.003
sters-0.003
opsy-0.003
Token to
Token ID284
Feature activation-1.11e-02

Pos logit contributions
br+0.010
ch+0.009
run+0.008
 Ellie+0.008
 tricky+0.008
Neg logit contributions
ich-0.034
field-0.031
sters-0.030
ster-0.030
ese-0.029
Token get
Token ID651
Feature activation-1.70e-02

Pos logit contributions
ich+0.016
sters+0.014
field+0.014
illas+0.014
ese+0.014
Neg logit contributions
 loud-0.010
 Ellie-0.009
 hard-0.009
 tricky-0.009
 time-0.009
Token it
Token ID340
Feature activation+1.49e-02

Pos logit contributions
ich+0.034
field+0.031
sters+0.029
ster+0.029
anut+0.029
Neg logit contributions
br-0.022
sc-0.020
ch-0.020
 Ellie-0.019
lic-0.019
Token,
Token ID11
Feature activation-1.35e-02

Pos logit contributions
br+0.025
ch+0.024
lic+0.023
sc+0.022
run+0.022
Neg logit contributions
ich-0.022
field-0.021
ster-0.019
our-0.018
sters-0.018
Token she
Token ID673
Feature activation-1.07e-02

Pos logit contributions
ich+0.023
field+0.021
sters+0.020
ster+0.020
 porch+0.019
Neg logit contributions
br-0.022
ch-0.020
lic-0.019
sc-0.019
bench-0.018
Token decided
Token ID3066
Feature activation-6.55e-03

Pos logit contributions
ich+0.019
field+0.017
ster+0.016
sters+0.016
opsy+0.015
Neg logit contributions
br-0.017
ch-0.016
 Ellie-0.016
run-0.015
sc-0.015
Token to
Token ID284
Feature activation+7.01e-03

Pos logit contributions
ich+0.016
field+0.015
sters+0.015
ster+0.015
ese+0.014
Neg logit contributions
br-0.005
ch-0.005
sc-0.005
run-0.004
 tricky-0.004
Token take
Token ID1011
Feature activation-1.08e-02

Pos logit contributions
br+0.011
ch+0.011
sc+0.010
run+0.010
 tricky+0.010
Neg logit contributions
ich-0.012
field-0.011
ster-0.010
sters-0.010
ese-0.010
Token her
Token ID607
Feature activation+6.61e-03

Pos logit contributions
ich+0.018
field+0.016
ster+0.016
illa+0.016
sters+0.015
Neg logit contributions
sc-0.015
br-0.014
run-0.014
lic-0.014
 tricky-0.014
Token mom
Token ID1995
Feature activation+7.46e-03

Pos logit contributions
br+0.011
ch+0.010
lic+0.010
sc+0.010
 Ellie+0.010
Neg logit contributions
ich-0.011
field-0.010
ster-0.010
sters-0.010
illas-0.009
Token's
Token ID338
Feature activation+3.25e-02

Pos logit contributions
br+0.013
ch+0.012
 loud+0.012
 tricky+0.011
 Ellie+0.011
Neg logit contributions
ich-0.012
field-0.011
illa-0.010
ster-0.010
ese-0.010
Token flashlight
Token ID38371
Feature activation+2.09e-02

Pos logit contributions
br+0.067
lic+0.064
ch+0.063
 Ellie+0.060
Ell+0.057
Neg logit contributions
ich-0.040
ster-0.038
field-0.036
 porch-0.035
 wrist-0.034
Token was
Token ID373
Feature activation+2.94e-03

Pos logit contributions
ich+0.031
field+0.028
ster+0.027
illas+0.026
sters+0.025
Neg logit contributions
br-0.026
igsaw-0.025
 tricky-0.025
 Ellie-0.025
 complicated-0.024
Token going
Token ID1016
Feature activation+1.03e-02

Pos logit contributions
sc+0.005
br+0.005
igsaw+0.005
 Ellie+0.004
pill+0.004
Neg logit contributions
ich-0.005
field-0.004
ble-0.004
anut-0.004
ster-0.004
Token to
Token ID284
Feature activation-1.92e-03

Pos logit contributions
br+0.015
run+0.014
igsaw+0.013
lic+0.012
bul+0.012
Neg logit contributions
ich-0.019
field-0.016
our-0.015
ese-0.015
sters-0.015
Token the
Token ID262
Feature activation-1.11e-02

Pos logit contributions
ich+0.003
field+0.003
 porch+0.003
ese+0.003
asis+0.003
Neg logit contributions
sc-0.003
run-0.003
br-0.003
lic-0.003
ch-0.003
Token road
Token ID2975
Feature activation-9.18e-03

Pos logit contributions
ich+0.017
field+0.015
ster+0.014
sters+0.014
ese+0.013
Neg logit contributions
br-0.020
ch-0.019
run-0.019
 Ellie-0.019
 tricky-0.018
Token.
Token ID13
Feature activation+2.43e-03

Pos logit contributions
ich+0.014
field+0.013
illas+0.011
our+0.011
ster+0.011
Neg logit contributions
br-0.016
ch-0.015
 Ellie-0.015
 Bella-0.014
 Beth-0.014
Token
Token ID198
Feature activation+2.92e-03

Pos logit contributions
br+0.003
sc+0.003
ch+0.003
 tricky+0.003
 Ellie+0.003
Neg logit contributions
ich-0.004
ster-0.004
field-0.004
sters-0.004
mm-0.004
Token
Token ID198
Feature activation-3.09e-03

Pos logit contributions
br+0.004
ch+0.004
sc+0.004
 tricky+0.004
 Ellie+0.004
Neg logit contributions
ich-0.005
field-0.005
ster-0.004
sters-0.004
illa-0.004
TokenThe
Token ID464
Feature activation+4.90e-03

Pos logit contributions
ich+0.006
field+0.005
ster+0.005
sters+0.005
ese+0.005
Neg logit contributions
br-0.005
ch-0.004
 Ellie-0.004
run-0.004
 tricky-0.004
Token road
Token ID2975
Feature activation-4.06e-03

Pos logit contributions
br+0.008
run+0.008
 Ellie+0.008
ch+0.008
sc+0.008
Neg logit contributions
ich-0.008
field-0.007
ster-0.007
ese-0.006
sters-0.006
Token was
Token ID373
Feature activation+3.68e-03

Pos logit contributions
ich+0.007
field+0.006
ster+0.006
illas+0.006
sters+0.006
Neg logit contributions
br-0.006
 Ellie-0.006
run-0.006
ch-0.006
igsaw-0.005
Token the
Token ID262
Feature activation+7.61e-03

Pos logit contributions
ch+0.058
br+0.057
sc+0.055
lic+0.052
bench+0.050
Neg logit contributions
ich-0.070
field-0.067
illas-0.061
illa-0.060
ese-0.055
Token mystery
Token ID10715
Feature activation+1.42e-02

Pos logit contributions
run+0.012
ch+0.010
lic+0.010
sc+0.010
br+0.010
Neg logit contributions
ich-0.013
field-0.013
 porch-0.012
asis-0.011
 doorstep-0.011
Token together
Token ID1978
Feature activation+1.43e-02

Pos logit contributions
ch+0.021
 Ellie+0.020
run+0.019
sc+0.019
br+0.019
Neg logit contributions
ich-0.024
field-0.024
sters-0.021
ster-0.021
illas-0.020
Token.
Token ID13
Feature activation+3.37e-03

Pos logit contributions
ch+0.022
br+0.022
sc+0.021
 Ellie+0.020
 tricky+0.020
Neg logit contributions
ich-0.024
field-0.023
sters-0.021
illas-0.021
ster-0.020
Token<|endoftext|>
Token ID50256
Feature activation-1.06e-02

Pos logit contributions
br+0.004
ch+0.004
 tricky+0.004
sc+0.004
 Ellie+0.004
Neg logit contributions
ich-0.007
field-0.006
ster-0.006
sters-0.006
illas-0.006
TokenOnce
Token ID7454
Feature activation-4.46e-03

Pos logit contributions
ich+0.015
field+0.014
ster+0.012
illas+0.012
sters+0.012
Neg logit contributions
br-0.019
 tricky-0.019
ch-0.018
run-0.018
 Ellie-0.018
Token upon
Token ID2402
Feature activation-4.77e-03

Pos logit contributions
ich+0.005
field+0.004
ster+0.004
sters+0.004
ese+0.004
Neg logit contributions
br-0.009
ch-0.009
 Ellie-0.009
 tricky-0.009
sc-0.009
Token a
Token ID257
Feature activation-5.92e-03

Pos logit contributions
ich+0.012
field+0.011
ster+0.010
sters+0.010
ese+0.010
Neg logit contributions
br-0.004
ch-0.003
run-0.003
sc-0.003
 tricky-0.003
Token time
Token ID640
Feature activation+2.39e-02

Pos logit contributions
ich+0.015
field+0.015
opsy+0.014
ese+0.014
anut+0.014
Neg logit contributions
br-0.003
 Ellie-0.003
run-0.003
ch-0.003
sc-0.003
Token,
Token ID11
Feature activation-4.47e-03

Pos logit contributions
ch+0.039
sc+0.039
bench+0.039
br+0.038
 tricky+0.037
Neg logit contributions
ich-0.036
field-0.030
ster-0.030
sters-0.029
illas-0.029
Token there
Token ID612
Feature activation+1.74e-02

Pos logit contributions
ich+0.008
field+0.007
ster+0.007
opsy+0.007
illas+0.006
Neg logit contributions
br-0.007
ch-0.006
sc-0.006
bench-0.006
 tricky-0.006
TokenThanks
Token ID9690
Feature activation-7.05e-03

Pos logit contributions
br+0.002
ch+0.002
 Ellie+0.002
 tricky+0.002
run+0.002
Neg logit contributions
ich-0.002
field-0.002
ster-0.001
sters-0.001
ese-0.001
Token,
Token ID11
Feature activation+4.89e-03

Pos logit contributions
ich+0.014
ster+0.012
field+0.012
sters+0.012
illa+0.011
Neg logit contributions
br-0.009
sc-0.009
ch-0.009
 tricky-0.009
run-0.008
Token mom
Token ID1995
Feature activation-2.99e-03

Pos logit contributions
br+0.008
sc+0.008
ch+0.008
 tricky+0.008
run+0.008
Neg logit contributions
ich-0.008
field-0.007
ster-0.007
sters-0.006
illa-0.006
Tokenmy
Token ID1820
Feature activation+2.01e-02

Pos logit contributions
ich+0.006
field+0.005
sters+0.005
ster+0.005
illas+0.005
Neg logit contributions
br-0.004
ch-0.004
 Ellie-0.004
run-0.004
 tricky-0.004
Token!"
Token ID2474
Feature activation-1.20e-03

Pos logit contributions
br+0.030
ch+0.030
 Ellie+0.029
 tricky+0.029
run+0.028
Neg logit contributions
ich-0.035
field-0.032
sters-0.030
ster-0.029
ese-0.028
Token
Token ID198
Feature activation+1.62e-02

Pos logit contributions
ich+0.002
field+0.002
sters+0.002
ster+0.002
our+0.002
Neg logit contributions
br-0.002
ch-0.002
 tricky-0.002
sc-0.002
 complicated-0.001
Token
Token ID198
Feature activation+6.08e-03

Pos logit contributions
ch+0.015
 Ellie+0.015
br+0.015
run+0.014
 loud+0.014
Neg logit contributions
ich-0.038
field-0.036
sters-0.035
ster-0.034
illas-0.033
Token"
Token ID1
Feature activation-1.27e-02

Pos logit contributions
ch+0.008
br+0.008
 Ellie+0.007
run+0.007
 loud+0.007
Neg logit contributions
ich-0.012
field-0.011
sters-0.011
ster-0.011
illas-0.011
TokenYou
Token ID1639
Feature activation-1.39e-03

Pos logit contributions
ich+0.021
field+0.018
ster+0.017
sters+0.017
ese+0.016
Neg logit contributions
br-0.021
ch-0.020
run-0.020
 tricky-0.020
 Ellie-0.020
Token're
Token ID821
Feature activation-1.26e-02

Pos logit contributions
ich+0.002
field+0.002
illas+0.002
sters+0.002
ster+0.002
Neg logit contributions
br-0.003
ch-0.002
 hard-0.002
 Ellie-0.002
 loud-0.002
Token welcome
Token ID7062
Feature activation-4.57e-03

Pos logit contributions
ich+0.029
field+0.028
sters+0.026
actus+0.026
illas+0.026
Neg logit contributions
 Ellie-0.013
 hard-0.012
 much-0.012
br-0.012
 tricky-0.012
Token and
Token ID290
Feature activation+2.34e-02

Pos logit contributions
ich+0.042
field+0.038
sters+0.036
ster+0.036
illas+0.035
Neg logit contributions
br-0.024
ch-0.023
 Ellie-0.022
 tricky-0.022
run-0.022
Token Kim
Token ID6502
Feature activation+1.43e-02

Pos logit contributions
br+0.031
sc+0.031
 tricky+0.027
ch+0.026
 complicated+0.026
Neg logit contributions
ich-0.048
field-0.039
illas-0.038
 porch-0.038
ese-0.038
Token said
Token ID531
Feature activation-4.91e-03

Pos logit contributions
 hard+0.016
 loud+0.015
br+0.015
ch+0.015
 Ellie+0.015
Neg logit contributions
ich-0.032
field-0.028
ster-0.027
sters-0.027
actus-0.025
Token they
Token ID484
Feature activation-4.45e-03

Pos logit contributions
ich+0.007
sters+0.007
ster+0.007
mm+0.007
field+0.006
Neg logit contributions
sc-0.008
ch-0.008
 tricky-0.008
run-0.007
br-0.007
Token were
Token ID547
Feature activation+3.76e-03

Pos logit contributions
ich+0.010
field+0.009
ster+0.008
sters+0.008
ese+0.008
Neg logit contributions
br-0.005
 hard-0.005
ch-0.005
 loud-0.005
 Ellie-0.005
Token sorry
Token ID7926
Feature activation-2.65e-02

Pos logit contributions
br+0.004
ch+0.004
 Ellie+0.004
 tricky+0.004
run+0.004
Neg logit contributions
ich-0.008
field-0.008
sters-0.007
ster-0.007
ese-0.007
Token and
Token ID290
Feature activation+2.85e-03

Pos logit contributions
ich+0.044
sters+0.039
ster+0.038
our+0.037
field+0.036
Neg logit contributions
br-0.041
sc-0.040
bench-0.039
 tricky-0.038
ch-0.038
Token hugged
Token ID41130
Feature activation-1.15e-02

Pos logit contributions
br+0.004
ch+0.004
sc+0.004
run+0.004
 tricky+0.004
Neg logit contributions
ich-0.005
field-0.005
ster-0.004
sters-0.004
ese-0.004
Token Dad
Token ID17415
Feature activation-5.89e-03

Pos logit contributions
ich+0.020
field+0.018
ese+0.017
illa+0.017
ster+0.017
Neg logit contributions
br-0.018
 tricky-0.016
sc-0.016
ch-0.016
run-0.016
Token.
Token ID13
Feature activation-3.51e-04

Pos logit contributions
ich+0.012
field+0.011
sters+0.011
ster+0.011
ese+0.011
Neg logit contributions
br-0.007
ch-0.006
 Ellie-0.006
 tricky-0.006
run-0.006
Token They
Token ID1119
Feature activation-2.64e-02

Pos logit contributions
ich+0.001
field+0.001
sters+0.001
ster+0.001
ese+0.001
Neg logit contributions
br-0.000
ch-0.000
 Ellie-0.000
 tricky-0.000
run-0.000
Token out
Token ID503
Feature activation-4.18e-03

Pos logit contributions
ich+0.003
ster+0.003
field+0.003
our+0.003
ble+0.003
Neg logit contributions
igsaw-0.003
bench-0.003
run-0.003
br-0.003
 subt-0.003
Token of
Token ID286
Feature activation+2.78e-02

Pos logit contributions
ich+0.007
ster+0.006
field+0.006
ble+0.006
flower+0.006
Neg logit contributions
bench-0.006
igsaw-0.006
br-0.006
 subt-0.005
 unfair-0.005
Token the
Token ID262
Feature activation-1.28e-02

Pos logit contributions
br+0.041
sc+0.038
lic+0.038
run+0.037
ch+0.035
Neg logit contributions
field-0.048
ich-0.046
 porch-0.042
fields-0.040
 flesh-0.039
Token ground
Token ID2323
Feature activation-1.22e-02

Pos logit contributions
ich+0.020
field+0.018
ster+0.016
sters+0.016
ese+0.015
Neg logit contributions
br-0.023
run-0.022
ch-0.022
 Ellie-0.021
 tricky-0.021
Token!
Token ID0
Feature activation+4.98e-03

Pos logit contributions
ich+0.023
field+0.021
ster+0.019
sters+0.019
our+0.018
Neg logit contributions
br-0.017
ch-0.016
 Ellie-0.016
run-0.016
 tricky-0.015
Token Bob
Token ID5811
Feature activation-2.65e-02

Pos logit contributions
br+0.006
ch+0.006
 tricky+0.006
sc+0.006
run+0.006
Neg logit contributions
ich-0.010
field-0.009
sters-0.009
ster-0.009
ese-0.008
Token was
Token ID373
Feature activation+4.46e-03

Pos logit contributions
ich+0.052
field+0.048
ster+0.046
sters+0.045
ese+0.044
Neg logit contributions
br-0.035
ch-0.033
 Ellie-0.032
sc-0.032
 tricky-0.032
Token so
Token ID523
Feature activation-2.20e-02

Pos logit contributions
br+0.007
sc+0.006
ch+0.006
igsaw+0.006
 Ellie+0.006
Neg logit contributions
ich-0.009
field-0.008
ster-0.007
sters-0.007
ese-0.007
Token curious
Token ID11040
Feature activation+4.88e-03

Pos logit contributions
ich+0.044
field+0.040
ster+0.039
sters+0.039
opsy+0.037
Neg logit contributions
br-0.029
ch-0.028
sc-0.027
 Ellie-0.026
run-0.025
Token that
Token ID326
Feature activation-8.95e-03

Pos logit contributions
sc+0.009
br+0.009
bench+0.009
wn+0.009
Ell+0.009
Neg logit contributions
ich-0.006
sters-0.005
ster-0.005
field-0.005
actus-0.004
Token he
Token ID339
Feature activation-9.02e-03

Pos logit contributions
ich+0.019
field+0.018
ster+0.017
sters+0.017
illa+0.016
Neg logit contributions
br-0.011
ch-0.010
sc-0.010
run-0.009
bench-0.009
Token saw
Token ID2497
Feature activation+6.82e-03

Pos logit contributions
ich+0.003
field+0.003
sters+0.003
ster+0.003
ese+0.003
Neg logit contributions
br-0.003
ch-0.003
 Ellie-0.003
 tricky-0.003
run-0.003
Token something
Token ID1223
Feature activation+2.05e-02

Pos logit contributions
sc+0.011
br+0.011
ch+0.011
run+0.010
bench+0.009
Neg logit contributions
ich-0.012
sters-0.011
field-0.011
 porch-0.011
illa-0.011
Token that
Token ID326
Feature activation-1.12e-02

Pos logit contributions
run+0.026
bench+0.025
ch+0.025
sc+0.024
br+0.024
Neg logit contributions
ich-0.037
sters-0.035
field-0.034
ster-0.034
actus-0.033
Token made
Token ID925
Feature activation-3.24e-03

Pos logit contributions
field+0.016
ich+0.015
ster+0.013
 porch+0.013
sters+0.013
Neg logit contributions
run-0.020
br-0.018
igsaw-0.018
ch-0.018
smart-0.017
Token her
Token ID607
Feature activation-2.14e-02

Pos logit contributions
illa+0.004
ich+0.004
 porch+0.004
field+0.004
ese+0.004
Neg logit contributions
br-0.005
sc-0.005
run-0.005
Ell-0.005
B-0.005
Token happy
Token ID3772
Feature activation-2.29e-02

Pos logit contributions
ich+0.036
field+0.033
ster+0.030
sters+0.030
ese+0.029
Neg logit contributions
br-0.035
ch-0.033
sc-0.032
 Ellie-0.032
 tricky-0.031
Token:
Token ID25
Feature activation-2.23e-03

Pos logit contributions
ich+0.038
field+0.035
sters+0.034
ster+0.034
 porch+0.032
Neg logit contributions
br-0.037
ch-0.037
sc-0.036
run-0.034
 tricky-0.033
Token cel
Token ID18725
Feature activation-8.93e-03

Pos logit contributions
ich+0.004
 porch+0.004
sters+0.004
field+0.004
ster+0.003
Neg logit contributions
sc-0.003
br-0.003
bench-0.003
ch-0.003
run-0.003
Tokenery
Token ID1924
Feature activation-1.24e-02

Pos logit contributions
ich+0.014
field+0.012
ster+0.012
sters+0.012
ese+0.011
Neg logit contributions
 Ellie-0.015
run-0.015
 tricky-0.015
br-0.015
ch-0.015
Token!
Token ID0
Feature activation+5.39e-03

Pos logit contributions
ich+0.022
field+0.020
ster+0.019
sters+0.019
ese+0.018
Neg logit contributions
br-0.018
ch-0.018
 Ellie-0.018
run-0.018
sc-0.017
Token She
Token ID1375
Feature activation-4.31e-03

Pos logit contributions
br+0.005
ch+0.005
 Ellie+0.005
run+0.004
 tricky+0.004
Neg logit contributions
ich-0.013
field-0.012
sters-0.011
ster-0.011
ese-0.011
Token.
Token ID13
Feature activation+1.29e-03

Pos logit contributions
run+0.020
 Beth+0.020
ch+0.020
 tricky+0.020
 Ellie+0.020
Neg logit contributions
ich-0.019
field-0.018
ster-0.017
sters-0.017
illas-0.016
Token<|endoftext|>
Token ID50256
Feature activation-1.23e-02

Pos logit contributions
br+0.001
ch+0.001
 Ellie+0.001
 tricky+0.001
sc+0.001
Neg logit contributions
ich-0.003
field-0.003
sters-0.003
ster-0.003
ese-0.002
TokenSam
Token ID16305
Feature activation-9.56e-03

Pos logit contributions
ich+0.016
field+0.014
ster+0.013
sters+0.013
ese+0.012
Neg logit contributions
br-0.024
ch-0.023
 Ellie-0.023
 tricky-0.023
run-0.023
Token was
Token ID373
Feature activation+5.79e-04

Pos logit contributions
ich+0.021
field+0.018
sters+0.018
ster+0.017
ese+0.017
Neg logit contributions
br-0.011
ch-0.011
 hard-0.010
 Ellie-0.010
 tricky-0.010
Token a
Token ID257
Feature activation-6.21e-03

Pos logit contributions
br+0.001
ch+0.001
sc+0.001
run+0.001
 Ellie+0.001
Neg logit contributions
ich-0.001
field-0.001
ster-0.001
sters-0.001
opsy-0.001
Token little
Token ID1310
Feature activation-2.36e-02

Pos logit contributions
ich+0.014
field+0.014
sters+0.014
ese+0.013
illas+0.013
Neg logit contributions
 loud-0.005
br-0.005
ch-0.005
 hard-0.005
 tricky-0.005
Token boy
Token ID2933
Feature activation-1.37e-02

Pos logit contributions
ich+0.043
field+0.038
ster+0.037
sters+0.036
our+0.034
Neg logit contributions
br-0.036
run-0.034
sc-0.034
ch-0.033
 Ellie-0.033
Token who
Token ID508
Feature activation-2.82e-02

Pos logit contributions
ich+0.027
field+0.025
ster+0.024
sters+0.024
ese+0.023
Neg logit contributions
br-0.017
ch-0.016
 Ellie-0.016
 tricky-0.016
run-0.016
Token liked
Token ID8288
Feature activation+3.88e-03

Pos logit contributions
ich+0.036
field+0.034
ster+0.031
actus+0.030
asis+0.030
Neg logit contributions
run-0.053
br-0.052
ch-0.051
sc-0.051
 Ellie-0.050
Token to
Token ID284
Feature activation-1.56e-02

Pos logit contributions
br+0.005
sc+0.005
lic+0.005
ch+0.005
run+0.005
Neg logit contributions
ich-0.008
field-0.007
 porch-0.007
sters-0.006
ster-0.006
Token imagine
Token ID5967
Feature activation+1.01e-02

Pos logit contributions
field+0.020
ich+0.020
our+0.018
 wrist+0.018
 porch+0.018
Neg logit contributions
sc-0.029
run-0.028
ch-0.028
br-0.028
lic-0.026
Token hot
Token ID3024
Feature activation-2.42e-02

Pos logit contributions
ich+0.007
field+0.007
ster+0.006
sters+0.006
ese+0.006
Neg logit contributions
br-0.010
 Ellie-0.010
run-0.010
ch-0.010
 tricky-0.010
Token soup
Token ID17141
Feature activation-4.44e-03

Pos logit contributions
ich+0.040
field+0.036
ster+0.034
sters+0.034
ese+0.032
Neg logit contributions
br-0.039
ch-0.038
 Ellie-0.037
run-0.037
 tricky-0.036
Token.
Token ID13
Feature activation+5.12e-03

Pos logit contributions
ich+0.008
field+0.008
ster+0.008
sters+0.008
ese+0.007
Neg logit contributions
br-0.006
ch-0.006
 Ellie-0.006
run-0.006
 tricky-0.005
Token He
Token ID679
Feature activation-8.37e-03

Pos logit contributions
br+0.006
ch+0.006
 tricky+0.006
sc+0.006
 hard+0.005
Neg logit contributions
ich-0.009
field-0.009
sters-0.008
mm-0.008
ster-0.008
Token was
Token ID373
Feature activation+1.52e-03

Pos logit contributions
ich+0.014
field+0.013
ster+0.012
sters+0.012
anut+0.012
Neg logit contributions
br-0.013
 Ellie-0.013
ch-0.012
run-0.012
sc-0.012
Token excited
Token ID6568
Feature activation-2.07e-02

Pos logit contributions
br+0.002
sc+0.002
ch+0.002
 Ellie+0.002
run+0.002
Neg logit contributions
ich-0.003
field-0.003
anut-0.003
sters-0.002
ster-0.002
Token to
Token ID284
Feature activation-6.92e-03

Pos logit contributions
ich+0.038
field+0.035
sters+0.034
ster+0.034
ese+0.033
Neg logit contributions
br-0.030
ch-0.028
sc-0.027
run-0.027
 tricky-0.026
Token see
Token ID766
Feature activation+7.37e-05

Pos logit contributions
ich+0.014
field+0.012
sters+0.012
ster+0.012
illas+0.011
Neg logit contributions
br-0.009
ch-0.009
 loud-0.009
 Ellie-0.009
 hard-0.009
Token what
Token ID644
Feature activation-2.58e-02

Pos logit contributions
br+0.000
sc+0.000
ch+0.000
bench+0.000
 tricky+0.000
Neg logit contributions
ich-0.000
sters-0.000
illa-0.000
field-0.000
ster-0.000
Token it
Token ID340
Feature activation+1.17e-02

Pos logit contributions
ich+0.038
field+0.037
sters+0.035
ster+0.034
illas+0.032
Neg logit contributions
run-0.040
sc-0.040
ch-0.039
br-0.038
wn-0.037
Token would
Token ID561
Feature activation-1.68e-02

Pos logit contributions
br+0.015
ch+0.014
run+0.014
 Ellie+0.014
sc+0.014
Neg logit contributions
ich-0.023
field-0.021
sters-0.020
ster-0.020
ese-0.019