TOP ACTIVATIONS
MAX = 0.137

new words every day . Conf used is just a word
me when I said it wasn 't safe ? You know
go and drink something that didn 't look right ? You
you . I know you didn 't mean to break it
we 're sorry . We didn 't mean to make you
cat . I know you didn 't mean to break my
It 's OK . You didn 't scare me . You
." " He didn 't cause us any trouble
It will be easy and exc ite ." Ben
. It is easy and exc ite . And we are
to me . Friends who don 't lie to me .
wanted some apples . We didn 't know they were yours
" No , you didn 't , Tom ! I
sorry , Lily . Tom didn 't know how to play
sorry , Sara . I didn 't see your nose .
you . I know you didn 't mean to rip the
" No , I didn 't take them . Maybe
an accident . The seed didn 't mean to bite you
sorry Nem o . I didn 't know the waves would
Sorry , sorry , we didn 't see the wave ,"

MIN ACTIVATIONS
MIN = -0.088

she finally stepped into the chamber , she saw that it
they walked further into the chamber , they heard a voice
As she walked through the cabin , she kept being clumsy
was outside walking around the garden when he saw something exciting
girl was walking down the hallway . She was feeling a
As he walked into the classroom , he heard the teacher
girl was walking around the garden with her big brother .
. Inside the lab was very cold so Sandy
Jimmy was walking down the hall when he heard a strange
. She ran around the room and out the door .
Lu cy walked around the attic and noticed something strange .
little girl ran around the room , looking in all the
ina was walking through the garden with her mom . She
soon as she entered the room and started to strike the
Charlotte was walking around the garden , she noticed a big
As she went upstairs , Emmy noticed something strange
big classroom . Inside the classroom were many children and a
She marched around the room , showing her parents her
When he walked into the classroom , he went straight to
When Lisa looked around the room , she saw that the

INTERVAL 0.080 - 0.137
CONTAINS 0.621%

cool !" " Me ow !" the cat says
Ben said , " I don 't know .
her mom my said it wasn 't worth anything . Lily
stop !" Lily said . " That is my book .
smiled and said , " Don 't worry , Ros ie

INTERVAL 0.024 - 0.080
CONTAINS 18.482%

just want to see your pretty wings ."
What is this place mom my ?" Tommy said .
learned that sometimes it 's okay to be scared , but
the bus . Let 's go and see ."
, but no one seemed to value him . One day

INTERVAL -0.032 - 0.024
CONTAINS 71.214%

time , there lived a mighty mouse . He had a
They went when the sun was shining and the sky was
and always played hockey together no matter what . They were
flew home , happy to have soared in the dark sky
� I am the King of the Jungle . I order

INTERVAL -0.088 - -0.032
CONTAINS 9.683%

Jim was so happy that his wife had made him
One day , they saw some bright green things poking out
animals . Suddenly , Lily saw a candy wrapper on the
the end of the day , their teacher was very proud
another page in the book and the girl looked at it
Token new
Token ID649
Feature activation+4.69e-02

Pos logit contributions
uch+0.115
ases+0.114
 Deal+0.111
 myself+0.110
rr+0.110
Neg logit contributions
 he-0.055
 surrounded-0.053
 full-0.048
 which-0.047
 filled-0.042
Token words
Token ID2456
Feature activation+4.04e-02

Pos logit contributions
uch+0.107
 Deal+0.103
 myself+0.102
ases+0.101
 Go+0.100
Neg logit contributions
 he-0.055
 surrounded-0.053
 fields-0.049
 which-0.048
 woods-0.048
Token every
Token ID790
Feature activation+4.24e-02

Pos logit contributions
ases+0.099
 Deal+0.098
 myself+0.098
uch+0.097
rr+0.096
Neg logit contributions
 he-0.043
 surrounded-0.042
 full-0.037
 which-0.034
 filled-0.031
Token day
Token ID1110
Feature activation+2.12e-02

Pos logit contributions
 myself+0.109
ases+0.105
 Deal+0.103
 yourselves+0.103
 Go+0.103
Neg logit contributions
 surrounded-0.043
 he-0.038
 full-0.035
 distance-0.033
 spotted-0.032
Token.
Token ID13
Feature activation+6.42e-02

Pos logit contributions
 myself+0.051
ases+0.050
 Deal+0.050
uch+0.049
rr+0.049
Neg logit contributions
 surrounded-0.023
 he-0.022
 full-0.020
 which-0.018
 filled-0.017
Token Conf
Token ID7326
Feature activation+1.37e-01

Pos logit contributions
uch+0.103
ases+0.101
 myself+0.100
rr+0.096
 ourselves+0.093
Neg logit contributions
 surrounded-0.118
 he-0.113
 full-0.108
 woods-0.102
 clearing-0.101
Tokenused
Token ID1484
Feature activation+3.52e-02

Pos logit contributions
 myself+0.222
vern+0.220
 Deal+0.212
ases+0.211
 losers+0.210
Neg logit contributions
 surrounded-0.222
 he-0.193
 full-0.178
 which-0.177
 drove-0.174
Token is
Token ID318
Feature activation+2.85e-02

Pos logit contributions
 myself+0.078
ases+0.078
 Deal+0.076
uch+0.075
 Go+0.075
Neg logit contributions
 surrounded-0.044
 he-0.042
 full-0.039
 woods-0.036
 which-0.035
Token just
Token ID655
Feature activation+1.59e-02

Pos logit contributions
 myself+0.062
ases+0.060
 Go+0.059
rr+0.058
 yourselves+0.058
Neg logit contributions
 surrounded-0.039
 he-0.036
 full-0.034
 spotted-0.031
 clearing-0.031
Token a
Token ID257
Feature activation+1.18e-02

Pos logit contributions
 myself+0.036
 yourself+0.033
 yourselves+0.032
 ourselves+0.032
 Go+0.032
Neg logit contributions
 surrounded-0.021
 he-0.019
 distance-0.019
 spotted-0.019
 clearing-0.018
Token word
Token ID1573
Feature activation+3.07e-02

Pos logit contributions
 myself+0.022
ases+0.021
 Go+0.021
rr+0.021
 yourself+0.020
Neg logit contributions
 surrounded-0.019
 he-0.018
 full-0.017
 spotted-0.016
 distance-0.016
Token me
Token ID502
Feature activation+4.33e-02

Pos logit contributions
ases+0.065
uch+0.065
 Deal+0.064
vern+0.060
rr+0.060
Neg logit contributions
 he-0.075
 surrounded-0.069
 which-0.067
 fields-0.065
 woods-0.064
Token when
Token ID618
Feature activation+4.26e-02

Pos logit contributions
 myself+0.056
ases+0.055
 Go+0.053
 Deal+0.052
rr+0.051
Neg logit contributions
 surrounded-0.096
 he-0.094
 full-0.089
 which-0.085
 spotted-0.085
Token I
Token ID314
Feature activation+3.95e-02

Pos logit contributions
 myself+0.053
ases+0.052
rr+0.048
 Go+0.047
 Deal+0.047
Neg logit contributions
 surrounded-0.099
 he-0.096
 full-0.091
 woods-0.087
 spotted-0.087
Token said
Token ID531
Feature activation+4.05e-02

Pos logit contributions
 myself+0.097
ases+0.092
 yourselves+0.091
 Go+0.090
 ourselves+0.090
Neg logit contributions
 surrounded-0.047
 he-0.042
 full-0.039
 spotted-0.037
 curled-0.036
Token it
Token ID340
Feature activation+3.55e-02

Pos logit contributions
 myself+0.077
ases+0.076
rr+0.073
 Go+0.072
 Deal+0.071
Neg logit contributions
 surrounded-0.066
 he-0.064
 full-0.059
 spotted-0.055
 which-0.054
Token wasn
Token ID2492
Feature activation+1.30e-01

Pos logit contributions
 myself+0.074
ases+0.072
 yourselves+0.069
 Go+0.069
rr+0.069
Neg logit contributions
 surrounded-0.053
 he-0.049
 full-0.046
 spotted-0.043
 curled-0.042
Token't
Token ID470
Feature activation+7.04e-02

Pos logit contributions
ases+0.200
 yourselves+0.179
 ourselves+0.174
ymes+0.172
rr+0.172
Neg logit contributions
 he-0.294
 surrounded-0.266
 full-0.256
 she-0.240
 filled-0.237
Token safe
Token ID3338
Feature activation+7.56e-02

Pos logit contributions
 myself+0.143
ases+0.132
 Go+0.132
 yourself+0.131
rr+0.131
Neg logit contributions
 surrounded-0.109
 he-0.099
 full-0.093
 distance-0.091
 clearing-0.091
Token?
Token ID30
Feature activation+6.33e-02

Pos logit contributions
 myself+0.158
ases+0.153
 Go+0.150
rr+0.147
 Deal+0.147
Neg logit contributions
 surrounded-0.112
 he-0.102
 full-0.098
 distance-0.091
 spotted-0.090
Token You
Token ID921
Feature activation+4.70e-02

Pos logit contributions
 myself+0.119
ases+0.119
rr+0.113
uch+0.113
 ourselves+0.112
Neg logit contributions
 surrounded-0.103
 he-0.100
 full-0.091
 clearing-0.085
 which-0.085
Token know
Token ID760
Feature activation+2.26e-02

Pos logit contributions
 myself+0.099
ases+0.096
 Go+0.093
 yourselves+0.093
rr+0.091
Neg logit contributions
 surrounded-0.071
 he-0.066
 full-0.061
 spotted-0.058
 filled-0.057
Token go
Token ID467
Feature activation+4.32e-02

Pos logit contributions
ases+0.145
 myself+0.144
 Go+0.139
rr+0.138
 yourselves+0.136
Neg logit contributions
 surrounded-0.117
 full-0.107
 he-0.106
 filled-0.099
 curled-0.097
Token and
Token ID290
Feature activation+3.22e-02

Pos logit contributions
 myself+0.095
ases+0.091
 Go+0.090
 yourself+0.088
 yourselves+0.088
Neg logit contributions
 surrounded-0.059
 he-0.053
 full-0.050
 wooden-0.048
 distance-0.048
Token drink
Token ID4144
Feature activation+4.81e-02

Pos logit contributions
 myself+0.058
ases+0.054
 Go+0.054
 yourselves+0.052
 yourself+0.052
Neg logit contributions
 surrounded-0.057
 he-0.053
 full-0.051
 distance-0.049
 spotted-0.049
Token something
Token ID1223
Feature activation+3.22e-02

Pos logit contributions
ases+0.115
 myself+0.114
rr+0.113
 Go+0.111
 Deal+0.109
Neg logit contributions
 surrounded-0.054
 he-0.052
 full-0.049
 which-0.043
 filled-0.043
Token that
Token ID326
Feature activation+2.58e-02

Pos logit contributions
ases+0.074
 myself+0.072
rr+0.071
 Go+0.070
 Deal+0.069
Neg logit contributions
 he-0.042
 surrounded-0.042
 full-0.040
 which-0.035
 filled-0.034
Token didn
Token ID1422
Feature activation+1.27e-01

Pos logit contributions
 myself+0.050
ases+0.049
 Go+0.047
rr+0.047
 yourselves+0.046
Neg logit contributions
 surrounded-0.042
 he-0.040
 full-0.038
 distance-0.035
 spotted-0.035
Token't
Token ID470
Feature activation+7.83e-02

Pos logit contributions
ases+0.205
rav+0.193
rr+0.190
 yourselves+0.190
ymes+0.188
Neg logit contributions
 he-0.265
 surrounded-0.229
 full-0.224
 filled-0.217
 she-0.214
Token look
Token ID804
Feature activation+7.11e-02

Pos logit contributions
 myself+0.146
ases+0.137
 Go+0.136
rr+0.133
 yourselves+0.132
Neg logit contributions
 surrounded-0.134
 he-0.124
 full-0.118
 distance-0.114
 spotted-0.114
Token right
Token ID826
Feature activation+7.17e-02

Pos logit contributions
 myself+0.147
 Go+0.136
 yourself+0.136
ases+0.135
 yourselves+0.135
Neg logit contributions
 surrounded-0.106
 he-0.093
 clearing-0.090
 distance-0.090
 woods-0.090
Token?
Token ID30
Feature activation+6.23e-02

Pos logit contributions
 myself+0.093
ases+0.088
 Go+0.086
rr+0.082
 yourselves+0.081
Neg logit contributions
 surrounded-0.163
 he-0.156
 full-0.150
 spotted-0.145
 distance-0.143
Token You
Token ID921
Feature activation+4.55e-02

Pos logit contributions
 myself+0.111
ases+0.110
rr+0.105
uch+0.103
 ourselves+0.103
Neg logit contributions
 surrounded-0.107
 he-0.102
 full-0.096
 clearing-0.089
 spotted-0.089
Token you
Token ID345
Feature activation+4.87e-02

Pos logit contributions
vern+0.078
ases+0.069
uch+0.068
 Deal+0.066
rr+0.065
Neg logit contributions
 he-0.147
 bushes-0.128
 surrounded-0.124
 which-0.123
 they-0.123
Token.
Token ID13
Feature activation+6.97e-02

Pos logit contributions
ases+0.126
 Deal+0.125
 myself+0.124
uch+0.120
 yourselves+0.120
Neg logit contributions
 he-0.043
 surrounded-0.042
 full-0.038
 which-0.034
 while-0.033
Token I
Token ID314
Feature activation+5.72e-02

Pos logit contributions
 myself+0.113
ases+0.111
rr+0.106
uch+0.106
 ourselves+0.104
Neg logit contributions
 surrounded-0.130
 he-0.126
 full-0.118
 curled-0.109
 rabbit-0.108
Token know
Token ID760
Feature activation+3.47e-02

Pos logit contributions
 myself+0.102
 yourselves+0.091
ases+0.091
 Go+0.089
rr+0.089
Neg logit contributions
 surrounded-0.109
 he-0.102
 full-0.100
 spotted-0.097
 filled-0.094
Token you
Token ID345
Feature activation+5.39e-02

Pos logit contributions
ases+0.039
 myself+0.038
rr+0.037
 Deal+0.036
uch+0.035
Neg logit contributions
 he-0.084
 surrounded-0.082
 full-0.077
 which-0.075
 filled-0.072
Token didn
Token ID1422
Feature activation+1.26e-01

Pos logit contributions
 myself+0.100
ases+0.096
 Deal+0.095
 Go+0.093
 yourselves+0.093
Neg logit contributions
 surrounded-0.095
 he-0.089
 full-0.086
 spotted-0.082
 filled-0.080
Token't
Token ID470
Feature activation+7.90e-02

Pos logit contributions
 Deal+0.192
 yourselves+0.186
ases+0.183
ternally+0.181
oco+0.180
Neg logit contributions
 he-0.259
 surrounded-0.219
 full-0.214
 which-0.204
 she-0.203
Token mean
Token ID1612
Feature activation+6.64e-02

Pos logit contributions
 myself+0.158
 Deal+0.157
ases+0.155
 Go+0.153
uch+0.148
Neg logit contributions
 surrounded-0.117
 he-0.111
 full-0.108
 distance-0.100
 spotted-0.097
Token to
Token ID284
Feature activation+7.12e-02

Pos logit contributions
 Deal+0.182
 myself+0.173
uch+0.169
prise+0.166
ases+0.166
Neg logit contributions
 he-0.069
 which-0.050
 while-0.047
 surrounded-0.047
 though-0.046
Token break
Token ID2270
Feature activation+6.33e-02

Pos logit contributions
 Deal+0.147
ases+0.140
 Go+0.139
 myself+0.138
vern+0.137
Neg logit contributions
 he-0.108
 surrounded-0.096
 which-0.092
 rabbit-0.090
 saw-0.089
Token it
Token ID340
Feature activation+4.82e-02

Pos logit contributions
 Deal+0.135
rr+0.132
uch+0.131
 Go+0.129
ases+0.128
Neg logit contributions
 he-0.087
 surrounded-0.078
 which-0.075
 bushes-0.073
 though-0.070
Token we
Token ID356
Feature activation+5.28e-02

Pos logit contributions
 myself+0.071
ases+0.067
 Go+0.065
rr+0.063
 yourselves+0.063
Neg logit contributions
 surrounded-0.096
 he-0.091
 full-0.087
 distance-0.084
 spotted-0.084
Token're
Token ID821
Feature activation+6.80e-02

Pos logit contributions
 myself+0.102
ases+0.097
 Go+0.095
 yourselves+0.094
rr+0.093
Neg logit contributions
 surrounded-0.088
 he-0.082
 full-0.078
 spotted-0.074
 distance-0.073
Token sorry
Token ID7926
Feature activation+7.81e-02

Pos logit contributions
 myself+0.123
 Go+0.112
ases+0.112
 yourselves+0.111
 yourself+0.111
Neg logit contributions
 surrounded-0.121
 he-0.114
 distance-0.108
 spotted-0.108
 full-0.106
Token.
Token ID13
Feature activation+7.58e-02

Pos logit contributions
ases+0.192
 myself+0.185
uch+0.184
 Deal+0.179
 yourselves+0.177
Neg logit contributions
 he-0.092
 surrounded-0.083
 full-0.079
 which-0.069
 filled-0.066
Token We
Token ID775
Feature activation+4.80e-02

Pos logit contributions
 myself+0.112
ases+0.105
 Go+0.103
rr+0.100
 yourselves+0.099
Neg logit contributions
 surrounded-0.159
 he-0.149
 full-0.144
 distance-0.139
 spotted-0.139
Token didn
Token ID1422
Feature activation+1.24e-01

Pos logit contributions
 myself+0.101
ases+0.096
 Go+0.094
 yourselves+0.093
rr+0.092
Neg logit contributions
 surrounded-0.073
 he-0.067
 full-0.064
 spotted-0.061
 distance-0.059
Token't
Token ID470
Feature activation+7.62e-02

Pos logit contributions
ases+0.182
rr+0.174
 yourselves+0.173
oco+0.170
 yourself+0.169
Neg logit contributions
 he-0.262
 surrounded-0.225
 they-0.214
 full-0.214
 she-0.208
Token mean
Token ID1612
Feature activation+5.81e-02

Pos logit contributions
 myself+0.149
ases+0.147
 Deal+0.145
 ourselves+0.143
 Go+0.142
Neg logit contributions
 surrounded-0.113
 he-0.111
 full-0.108
 saw-0.097
 distance-0.096
Token to
Token ID284
Feature activation+6.59e-02

Pos logit contributions
 Deal+0.156
 myself+0.151
ases+0.148
uch+0.147
rr+0.146
Neg logit contributions
 he-0.059
 surrounded-0.044
 which-0.042
 full-0.039
 saw-0.038
Token make
Token ID787
Feature activation+6.23e-02

Pos logit contributions
 Deal+0.138
ases+0.135
 myself+0.132
 Go+0.131
rr+0.128
Neg logit contributions
 he-0.101
 surrounded-0.089
 full-0.082
 which-0.082
 distance-0.081
Token you
Token ID345
Feature activation+5.01e-02

Pos logit contributions
rr+0.104
 Deal+0.103
rav+0.101
uch+0.101
vern+0.101
Neg logit contributions
 he-0.106
 fields-0.095
 bushes-0.092
 wolf-0.092
 stream-0.091
Token cat
Token ID3797
Feature activation+3.33e-02

Pos logit contributions
 myself+0.057
ases+0.055
 Go+0.053
rr+0.052
 yourselves+0.052
Neg logit contributions
 surrounded-0.052
 he-0.048
 full-0.046
 distance-0.044
 spotted-0.043
Token.
Token ID13
Feature activation+6.65e-02

Pos logit contributions
 myself+0.074
ases+0.070
 Go+0.070
rr+0.068
 yourselves+0.068
Neg logit contributions
 surrounded-0.045
 he-0.040
 full-0.038
 distance-0.036
 spotted-0.036
Token I
Token ID314
Feature activation+4.97e-02

Pos logit contributions
ases+0.108
 myself+0.104
uch+0.103
rr+0.101
 ourselves+0.097
Neg logit contributions
 surrounded-0.124
 he-0.121
 full-0.115
 filled-0.104
 which-0.104
Token know
Token ID760
Feature activation+2.88e-02

Pos logit contributions
 myself+0.091
ases+0.085
 yourselves+0.083
 Go+0.083
 ourselves+0.081
Neg logit contributions
 surrounded-0.089
 he-0.083
 full-0.081
 spotted-0.077
 filled-0.075
Token you
Token ID345
Feature activation+4.88e-02

Pos logit contributions
ases+0.029
 myself+0.026
rr+0.026
uch+0.025
 Deal+0.024
Neg logit contributions
 he-0.074
 surrounded-0.072
 full-0.069
 which-0.068
 filled-0.065
Token didn
Token ID1422
Feature activation+1.24e-01

Pos logit contributions
 myself+0.097
ases+0.094
 Go+0.091
 Deal+0.090
 yourselves+0.090
Neg logit contributions
 surrounded-0.079
 he-0.074
 full-0.071
 spotted-0.067
 filled-0.066
Token't
Token ID470
Feature activation+7.19e-02

Pos logit contributions
rav+0.171
ases+0.170
 yourselves+0.165
 Deal+0.164
rr+0.160
Neg logit contributions
 he-0.254
 surrounded-0.226
 full-0.224
 filled-0.217
 which-0.216
Token mean
Token ID1612
Feature activation+5.85e-02

Pos logit contributions
 myself+0.142
 Deal+0.141
ases+0.141
 Go+0.139
 ourselves+0.136
Neg logit contributions
 surrounded-0.106
 he-0.100
 full-0.100
 which-0.089
 filled-0.088
Token to
Token ID284
Feature activation+7.33e-02

Pos logit contributions
 Deal+0.150
 myself+0.148
ases+0.147
uch+0.147
rr+0.145
Neg logit contributions
 he-0.060
 surrounded-0.048
 which-0.047
 while-0.043
 full-0.043
Token break
Token ID2270
Feature activation+5.81e-02

Pos logit contributions
 Deal+0.158
ases+0.147
 Go+0.146
vern+0.145
uch+0.142
Neg logit contributions
 he-0.107
 which-0.093
 surrounded-0.093
 full-0.091
 saw-0.090
Token my
Token ID616
Feature activation+1.89e-02

Pos logit contributions
 Deal+0.122
rr+0.122
uch+0.121
ases+0.120
 Go+0.120
Neg logit contributions
 he-0.077
 surrounded-0.074
 which-0.067
 full-0.064
 bushes-0.064
Token It
Token ID632
Feature activation+5.32e-02

Pos logit contributions
 myself+0.090
 Go+0.083
ases+0.080
 yourselves+0.078
 yourself+0.077
Neg logit contributions
 surrounded-0.169
 he-0.154
 distance-0.153
 full-0.153
 spotted-0.152
Token's
Token ID338
Feature activation+5.91e-02

Pos logit contributions
 myself+0.121
ases+0.119
rr+0.115
 yourselves+0.115
 Deal+0.113
Neg logit contributions
 surrounded-0.070
 he-0.066
 full-0.060
 curled-0.055
 filled-0.055
Token OK
Token ID7477
Feature activation+8.57e-02

Pos logit contributions
ases+0.101
 myself+0.101
 Go+0.097
rr+0.096
 yourselves+0.091
Neg logit contributions
 surrounded-0.110
 he-0.101
 full-0.101
 distance-0.092
 which-0.092
Token.
Token ID13
Feature activation+8.02e-02

Pos logit contributions
ases+0.262
 Deal+0.257
oco+0.254
 myself+0.252
 yourselves+0.249
Neg logit contributions
 he-0.055
 full-0.038
 surrounded-0.034
 she-0.029
 had-0.026
Token You
Token ID921
Feature activation+5.30e-02

Pos logit contributions
 myself+0.127
 Go+0.118
ases+0.118
 yourselves+0.113
 yourself+0.112
Neg logit contributions
 surrounded-0.162
 he-0.149
 full-0.145
 distance-0.142
 spotted-0.142
Token didn
Token ID1422
Feature activation+1.23e-01

Pos logit contributions
 myself+0.089
ases+0.084
 yourselves+0.082
 Go+0.081
 Deal+0.080
Neg logit contributions
 surrounded-0.103
 he-0.098
 full-0.095
 filled-0.090
 saw-0.088
Token't
Token ID470
Feature activation+7.58e-02

Pos logit contributions
ases+0.193
 yourselves+0.189
rav+0.184
ipp+0.180
oco+0.178
Neg logit contributions
 he-0.240
 full-0.202
 surrounded-0.200
 she-0.193
 they-0.191
Token scare
Token ID19437
Feature activation+5.42e-02

Pos logit contributions
 myself+0.154
 Deal+0.152
ases+0.150
 yourselves+0.145
 Go+0.145
Neg logit contributions
 surrounded-0.109
 he-0.104
 full-0.103
 rabbit-0.093
 distance-0.093
Token me
Token ID502
Feature activation+4.71e-02

Pos logit contributions
vern+0.077
uch+0.076
 Deal+0.076
rr+0.072
 myself+0.072
Neg logit contributions
 he-0.110
 bushes-0.105
 surrounded-0.103
 rabbit-0.101
 distance-0.096
Token.
Token ID13
Feature activation+8.07e-02

Pos logit contributions
ases+0.117
 Deal+0.117
 myself+0.116
 Go+0.115
rr+0.114
Neg logit contributions
 surrounded-0.046
 he-0.044
 full-0.040
 which-0.035
 distance-0.035
Token You
Token ID921
Feature activation+5.30e-02

Pos logit contributions
 myself+0.124
ases+0.116
 Go+0.114
rr+0.111
 yourselves+0.110
Neg logit contributions
 surrounded-0.165
 he-0.154
 full-0.149
 distance-0.144
 spotted-0.143
Token."
Token ID526
Feature activation-2.22e-03

Pos logit contributions
 myself+0.030
ases+0.029
 Go+0.029
rr+0.028
 Deal+0.028
Neg logit contributions
 surrounded-0.016
 he-0.015
 full-0.013
 distance-0.013
 spotted-0.012
Token
Token ID198
Feature activation+2.82e-02

Pos logit contributions
 surrounded+0.004
 he+0.003
 full+0.003
 spotted+0.003
 distance+0.003
Neg logit contributions
 myself-0.004
 Go-0.004
ases-0.004
 yourselves-0.004
rr-0.004
Token
Token ID198
Feature activation+3.30e-02

Pos logit contributions
 myself+0.070
uch+0.066
ases+0.066
 Go+0.065
 yourselves+0.064
Neg logit contributions
 he-0.032
 surrounded-0.030
 full-0.027
 woods-0.024
 clearing-0.024
Token"
Token ID1
Feature activation+6.61e-02

Pos logit contributions
 myself+0.059
ases+0.056
 Go+0.055
rr+0.054
 yourselves+0.053
Neg logit contributions
 surrounded-0.059
 he-0.056
 full-0.053
 distance-0.051
 clearing-0.050
TokenHe
Token ID1544
Feature activation+6.39e-02

Pos logit contributions
 myself+0.117
ases+0.113
 Go+0.109
rr+0.108
 yourselves+0.106
Neg logit contributions
 surrounded-0.120
 he-0.115
 full-0.107
 distance-0.103
 clearing-0.101
Token didn
Token ID1422
Feature activation+1.23e-01

Pos logit contributions
 myself+0.087
ases+0.081
rr+0.077
 Go+0.077
 yourselves+0.077
Neg logit contributions
 surrounded-0.142
 he-0.135
 full-0.130
 spotted-0.125
 curled-0.124
Token't
Token ID470
Feature activation+8.26e-02

Pos logit contributions
ases+0.176
 yourselves+0.171
rr+0.167
 yourself+0.162
ags+0.155
Neg logit contributions
 he-0.281
 surrounded-0.249
 full-0.236
 stream-0.224
 filled-0.224
Token cause
Token ID2728
Feature activation+5.76e-02

Pos logit contributions
 myself+0.146
ases+0.145
 Go+0.138
 yourselves+0.136
 Deal+0.136
Neg logit contributions
 surrounded-0.146
 he-0.139
 full-0.134
 distance-0.125
 spotted-0.124
Token us
Token ID514
Feature activation+5.20e-02

Pos logit contributions
 Deal+0.114
uch+0.113
 myself+0.112
ases+0.111
rr+0.106
Neg logit contributions
 he-0.088
 surrounded-0.083
 bushes-0.077
 woods-0.073
 fields-0.072
Token any
Token ID597
Feature activation+2.72e-02

Pos logit contributions
 myself+0.103
ases+0.101
 Go+0.099
 Deal+0.098
 yourselves+0.096
Neg logit contributions
 surrounded-0.081
 he-0.076
 full-0.073
 distance-0.068
 spotted-0.067
Token trouble
Token ID5876
Feature activation+6.83e-02

Pos logit contributions
 myself+0.051
ases+0.048
 Go+0.048
rr+0.046
 yourselves+0.046
Neg logit contributions
 surrounded-0.047
 he-0.043
 full-0.041
 distance-0.039
 spotted-0.039
Token It
Token ID632
Feature activation+4.15e-02

Pos logit contributions
 myself+0.110
ases+0.107
uch+0.103
rr+0.103
 yourselves+0.102
Neg logit contributions
 surrounded-0.084
 he-0.077
 full-0.074
 clearing-0.070
 wooden-0.068
Token will
Token ID481
Feature activation+6.90e-02

Pos logit contributions
 myself+0.059
ases+0.056
 Go+0.054
 yourselves+0.053
rr+0.053
Neg logit contributions
 surrounded-0.089
 he-0.084
 full-0.081
 wooden-0.077
 spotted-0.077
Token be
Token ID307
Feature activation+2.58e-02

Pos logit contributions
 myself+0.130
ases+0.129
 Go+0.127
rr+0.123
 Deal+0.120
Neg logit contributions
 surrounded-0.115
 he-0.107
 full-0.104
 distance-0.097
 wooden-0.096
Token easy
Token ID2562
Feature activation+7.01e-02

Pos logit contributions
 myself+0.048
 yourself+0.045
 ourselves+0.045
 yourselves+0.044
 you+0.042
Neg logit contributions
 surrounded-0.042
 he-0.041
 distance-0.038
 spotted-0.037
 woman-0.037
Token and
Token ID290
Feature activation+2.89e-02

Pos logit contributions
 myself+0.150
ases+0.145
 Go+0.142
rr+0.139
 yourselves+0.139
Neg logit contributions
 surrounded-0.100
 he-0.091
 full-0.086
 distance-0.081
 spotted-0.080
Token exc
Token ID2859
Feature activation+1.23e-01

Pos logit contributions
 myself+0.060
 Go+0.055
 ourselves+0.055
ases+0.055
 yourself+0.055
Neg logit contributions
 surrounded-0.044
 he-0.039
 distance-0.037
 spotted-0.037
 full-0.037
Tokenite
Token ID578
Feature activation+5.80e-02

Pos logit contributions
 myself+0.236
 Deal+0.234
 yourselves+0.229
 yourself+0.224
vern+0.224
Neg logit contributions
 surrounded-0.170
 he-0.163
 full-0.161
 filled-0.146
 stream-0.144
Token."
Token ID526
Feature activation+9.96e-04

Pos logit contributions
 Deal+0.108
 myself+0.094
uch+0.093
ases+0.093
rr+0.091
Neg logit contributions
 he-0.104
 surrounded-0.099
 fields-0.092
 stream-0.092
 which-0.091
Token
Token ID198
Feature activation+3.82e-02

Pos logit contributions
 myself+0.003
uch+0.003
ases+0.002
rr+0.002
 yourselves+0.002
Neg logit contributions
 he-0.001
 surrounded-0.001
 full-0.001
 woods-0.001
 bushes-0.001
Token
Token ID198
Feature activation+4.88e-02

Pos logit contributions
 myself+0.112
uch+0.109
ases+0.108
 yourselves+0.105
 yourself+0.104
Neg logit contributions
 he-0.029
 surrounded-0.018
 full-0.017
 they-0.015
 woods-0.014
TokenBen
Token ID11696
Feature activation-8.53e-03

Pos logit contributions
ases+0.092
 myself+0.090
uch+0.087
 Go+0.086
 Deal+0.084
Neg logit contributions
 he-0.076
 surrounded-0.075
 full-0.068
 clearing-0.067
 field-0.067
Token.
Token ID13
Feature activation+7.25e-02

Pos logit contributions
ases+0.126
 myself+0.123
rr+0.122
 Deal+0.118
 Go+0.117
Neg logit contributions
 surrounded-0.116
 full-0.115
 he-0.114
 filled-0.104
 which-0.103
Token It
Token ID632
Feature activation+3.80e-02

Pos logit contributions
 myself+0.137
ases+0.135
uch+0.133
rr+0.130
 yourself+0.126
Neg logit contributions
 surrounded-0.115
 he-0.108
 full-0.103
 filled-0.095
 which-0.095
Token is
Token ID318
Feature activation+4.70e-02

Pos logit contributions
 myself+0.086
ases+0.083
rr+0.080
 yourselves+0.080
 Go+0.080
Neg logit contributions
 surrounded-0.051
 he-0.046
 full-0.043
 spotted-0.040
 curled-0.039
Token easy
Token ID2562
Feature activation+7.13e-02

Pos logit contributions
 myself+0.086
 yourself+0.079
 ourselves+0.078
 Go+0.078
 yourselves+0.078
Neg logit contributions
 surrounded-0.081
 he-0.076
 spotted-0.072
 distance-0.071
 clearing-0.069
Token and
Token ID290
Feature activation+2.12e-02

Pos logit contributions
 myself+0.142
ases+0.136
 Go+0.133
rr+0.131
 yourselves+0.130
Neg logit contributions
 surrounded-0.112
 he-0.104
 full-0.099
 distance-0.094
 spotted-0.093
Token exc
Token ID2859
Feature activation+1.23e-01

Pos logit contributions
 myself+0.041
 ourselves+0.038
 Go+0.038
 you+0.038
 yourself+0.037
Neg logit contributions
 surrounded-0.035
 clearing-0.031
 he-0.030
 spotted-0.030
 distance-0.030
Tokenite
Token ID578
Feature activation+6.17e-02

Pos logit contributions
 myself+0.239
 Deal+0.236
 yourself+0.231
 yourselves+0.230
vern+0.225
Neg logit contributions
 surrounded-0.167
 full-0.159
 he-0.157
 filled-0.142
 distance-0.139
Token.
Token ID13
Feature activation+7.12e-02

Pos logit contributions
 Deal+0.119
 myself+0.104
uch+0.101
 yourselves+0.101
ases+0.100
Neg logit contributions
 he-0.106
 surrounded-0.103
 full-0.094
 which-0.093
 stream-0.093
Token And
Token ID843
Feature activation+3.78e-02

Pos logit contributions
 myself+0.138
ases+0.136
uch+0.132
rr+0.130
 yourselves+0.126
Neg logit contributions
 surrounded-0.111
 he-0.103
 full-0.100
 which-0.092
 filled-0.091
Token we
Token ID356
Feature activation+4.68e-02

Pos logit contributions
 myself+0.070
 Go+0.065
ases+0.064
 ourselves+0.063
 yourselves+0.063
Neg logit contributions
 surrounded-0.066
 he-0.060
 full-0.058
 distance-0.057
 spotted-0.057
Token are
Token ID389
Feature activation+4.88e-02

Pos logit contributions
 myself+0.079
ases+0.074
 Go+0.073
 yourselves+0.072
rr+0.071
Neg logit contributions
 surrounded-0.089
 he-0.083
 full-0.080
 spotted-0.077
 distance-0.076
Token to
Token ID284
Feature activation+6.00e-02

Pos logit contributions
 myself+0.148
uch+0.144
 Deal+0.143
ases+0.140
 Go+0.138
Neg logit contributions
 surrounded-0.083
 he-0.081
 full-0.074
 which-0.066
 filled-0.065
Token me
Token ID502
Feature activation+3.90e-02

Pos logit contributions
 Deal+0.088
uch+0.087
ases+0.086
vern+0.084
 myself+0.083
Neg logit contributions
 he-0.125
 surrounded-0.113
 which-0.105
 full-0.105
 saw-0.104
Token.
Token ID13
Feature activation+7.40e-02

Pos logit contributions
 myself+0.090
ases+0.089
 Deal+0.088
 Go+0.088
 yourselves+0.086
Neg logit contributions
 surrounded-0.046
 he-0.043
 full-0.040
 which-0.036
 filled-0.035
Token Friends
Token ID14213
Feature activation+5.91e-02

Pos logit contributions
uch+0.154
 myself+0.153
ases+0.152
rr+0.146
 yourselves+0.143
Neg logit contributions
 he-0.103
 surrounded-0.102
 full-0.091
 which-0.083
 rabbit-0.082
Token who
Token ID508
Feature activation+5.83e-02

Pos logit contributions
 myself+0.111
ases+0.108
 Deal+0.102
 Go+0.102
xc+0.101
Neg logit contributions
 he-0.097
 surrounded-0.095
 full-0.090
 saw-0.089
 had-0.086
Token don
Token ID836
Feature activation+1.23e-01

Pos logit contributions
 myself+0.089
ases+0.081
 Deal+0.080
 Go+0.080
rr+0.080
Neg logit contributions
 surrounded-0.119
 he-0.117
 full-0.109
 distance-0.104
 spotted-0.104
Token't
Token ID470
Feature activation+8.29e-02

Pos logit contributions
rav+0.166
 Deal+0.165
prise+0.164
ases+0.163
xc+0.162
Neg logit contributions
 he-0.252
 surrounded-0.242
 full-0.222
 they-0.220
 filled-0.214
Token lie
Token ID6486
Feature activation+6.81e-02

Pos logit contributions
 myself+0.152
ases+0.148
 Deal+0.143
 Go+0.143
 yourselves+0.140
Neg logit contributions
 surrounded-0.143
 he-0.138
 full-0.131
 distance-0.123
 spotted-0.121
Token to
Token ID284
Feature activation+4.98e-02

Pos logit contributions
 myself+0.159
ases+0.153
 Go+0.152
 Deal+0.151
rr+0.150
Neg logit contributions
 surrounded-0.083
 he-0.079
 full-0.072
 spotted-0.065
 which-0.065
Token me
Token ID502
Feature activation+4.81e-02

Pos logit contributions
ases+0.068
uch+0.066
 Deal+0.065
 myself+0.064
 Go+0.063
Neg logit contributions
 he-0.108
 surrounded-0.102
 full-0.095
 which-0.094
 distance-0.094
Token.
Token ID13
Feature activation+7.83e-02

Pos logit contributions
ases+0.106
 Deal+0.106
 myself+0.105
 Go+0.103
rr+0.102
Neg logit contributions
 surrounded-0.060
 he-0.058
 full-0.052
 which-0.050
 fields-0.046
Token wanted
Token ID2227
Feature activation+4.95e-02

Pos logit contributions
 myself+0.078
 Deal+0.073
ases+0.072
 ourselves+0.072
 Go+0.071
Neg logit contributions
 surrounded-0.061
 he-0.056
 full-0.054
 spotted-0.052
 filled-0.050
Token some
Token ID617
Feature activation+3.06e-02

Pos logit contributions
 Deal+0.134
ases+0.130
rr+0.128
uch+0.127
its+0.127
Neg logit contributions
 he-0.049
 surrounded-0.037
 which-0.036
 bushes-0.035
 full-0.034
Token apples
Token ID22514
Feature activation+3.50e-02

Pos logit contributions
 myself+0.073
ases+0.071
 Go+0.070
rr+0.069
 Deal+0.069
Neg logit contributions
 he-0.035
 surrounded-0.034
 full-0.030
 wooden-0.028
 bushes-0.028
Token.
Token ID13
Feature activation+7.82e-02

Pos logit contributions
 Deal+0.082
ases+0.081
 myself+0.080
 Go+0.079
rr+0.078
Neg logit contributions
 he-0.044
 surrounded-0.042
 full-0.039
 which-0.036
 filled-0.034
Token We
Token ID775
Feature activation+5.06e-02

Pos logit contributions
 myself+0.139
ases+0.133
rr+0.128
uch+0.127
 yourself+0.125
Neg logit contributions
 surrounded-0.138
 he-0.130
 full-0.124
 which-0.116
 wooden-0.115
Token didn
Token ID1422
Feature activation+1.23e-01

Pos logit contributions
 myself+0.097
 yourselves+0.088
 Deal+0.087
ases+0.087
rr+0.086
Neg logit contributions
 surrounded-0.089
 he-0.084
 full-0.081
 spotted-0.078
 saw-0.076
Token't
Token ID470
Feature activation+7.93e-02

Pos logit contributions
 Deal+0.189
rr+0.186
ternally+0.183
rav+0.183
 yourselves+0.183
Neg logit contributions
 he-0.254
 they-0.202
 surrounded-0.202
 she-0.200
 full-0.199
Token know
Token ID760
Feature activation+3.80e-02

Pos logit contributions
 myself+0.151
 Deal+0.151
 Go+0.149
ases+0.146
 ourselves+0.144
Neg logit contributions
 surrounded-0.121
 he-0.115
 full-0.114
 saw-0.104
 rabbit-0.104
Token they
Token ID484
Feature activation+4.51e-02

Pos logit contributions
 myself+0.081
ases+0.081
rr+0.080
 Deal+0.080
uch+0.078
Neg logit contributions
 he-0.052
 surrounded-0.051
 which-0.045
 full-0.044
 bushes-0.040
Token were
Token ID547
Feature activation+7.08e-02

Pos logit contributions
 myself+0.104
 Deal+0.103
ases+0.100
rr+0.099
 yourselves+0.096
Neg logit contributions
 surrounded-0.059
 he-0.056
 full-0.052
 saw-0.050
 had-0.050
Token yours
Token ID12431
Feature activation+6.84e-02

Pos logit contributions
 myself+0.128
ases+0.117
 Go+0.116
 yourself+0.116
 yourselves+0.115
Neg logit contributions
 surrounded-0.127
 he-0.116
 distance-0.113
 full-0.111
 clearing-0.110
Token
Token ID198
Feature activation+2.58e-02

Pos logit contributions
 myself+0.064
ases+0.063
uch+0.062
 yourselves+0.060
 ourselves+0.060
Neg logit contributions
 he-0.025
 surrounded-0.022
 full-0.020
 saw-0.017
 spotted-0.017
Token"
Token ID1
Feature activation+6.31e-02

Pos logit contributions
 myself+0.039
 Go+0.036
ases+0.035
 yourself+0.034
 yourselves+0.034
Neg logit contributions
 surrounded-0.054
 he-0.049
 full-0.048
 distance-0.047
 clearing-0.047
TokenNo
Token ID2949
Feature activation+3.86e-02

Pos logit contributions
 myself+0.105
rr+0.104
ases+0.104
 yourselves+0.101
 ourselves+0.101
Neg logit contributions
 he-0.116
 surrounded-0.113
 full-0.100
 rabbit-0.099
 distance-0.098
Token,
Token ID11
Feature activation+2.09e-02

Pos logit contributions
 myself+0.101
ases+0.101
 Go+0.098
 yourselves+0.096
 Deal+0.096
Neg logit contributions
 he-0.034
 surrounded-0.034
 full-0.029
 wooden-0.026
 which-0.025
Token you
Token ID345
Feature activation+5.42e-02

Pos logit contributions
 myself+0.037
 Go+0.035
ases+0.035
 yourselves+0.033
 yourself+0.033
Neg logit contributions
 surrounded-0.038
 he-0.034
 full-0.034
 distance-0.033
 clearing-0.033
Token didn
Token ID1422
Feature activation+1.23e-01

Pos logit contributions
 myself+0.108
 yourselves+0.101
ases+0.100
 yourself+0.098
 ourselves+0.098
Neg logit contributions
 surrounded-0.090
 he-0.086
 full-0.082
 filled-0.076
 spotted-0.076
Token't
Token ID470
Feature activation+8.58e-02

Pos logit contributions
ags+0.200
oco+0.196
ases+0.194
 Deal+0.193
 yourselves+0.192
Neg logit contributions
 he-0.256
 surrounded-0.202
 full-0.198
 they-0.196
 she-0.192
Token,
Token ID11
Feature activation+3.44e-02

Pos logit contributions
 Deal+0.146
 myself+0.135
 yourselves+0.133
 Go+0.130
ases+0.130
Neg logit contributions
 he-0.162
 surrounded-0.155
 full-0.148
 saw-0.145
 spotted-0.139
Token Tom
Token ID4186
Feature activation+5.88e-02

Pos logit contributions
 myself+0.063
ases+0.062
rr+0.059
 Go+0.059
 ourselves+0.058
Neg logit contributions
 surrounded-0.059
 he-0.056
 full-0.052
 spotted-0.049
 which-0.049
Token!
Token ID0
Feature activation+8.55e-02

Pos logit contributions
 myself+0.133
ases+0.131
 Deal+0.129
 Go+0.126
 yourselves+0.125
Neg logit contributions
 surrounded-0.074
 he-0.073
 full-0.065
 spotted-0.059
 filled-0.057
Token I
Token ID314
Feature activation+5.25e-02

Pos logit contributions
vern+0.142
uch+0.133
 ourselves+0.131
ases+0.128
prise+0.126
Neg logit contributions
 he-0.157
 surrounded-0.146
 rabbit-0.135
 full-0.133
 Suddenly-0.131
Token sorry
Token ID7926
Feature activation+7.46e-02

Pos logit contributions
 myself+0.116
ases+0.113
rr+0.110
 Go+0.108
uch+0.105
Neg logit contributions
 surrounded-0.096
 he-0.090
 full-0.087
 filled-0.079
 clearing-0.077
Token,
Token ID11
Feature activation+2.45e-02

Pos logit contributions
ases+0.220
uch+0.216
 myself+0.211
 Deal+0.205
oco+0.202
Neg logit contributions
 he-0.056
 surrounded-0.037
 full-0.037
 which-0.030
 they-0.029
Token Lily
Token ID20037
Feature activation+4.42e-02

Pos logit contributions
 myself+0.048
ases+0.047
 Go+0.045
rr+0.045
uch+0.044
Neg logit contributions
 surrounded-0.039
 he-0.037
 full-0.035
 wooden-0.032
 distance-0.032
Token.
Token ID13
Feature activation+6.64e-02

Pos logit contributions
 myself+0.078
 Go+0.073
 you+0.072
 yourself+0.071
 ourselves+0.071
Neg logit contributions
 surrounded-0.080
 clearing-0.072
 distance-0.072
 spotted-0.068
 woods-0.068
Token Tom
Token ID4186
Feature activation+3.92e-02

Pos logit contributions
 myself+0.102
ases+0.096
rr+0.092
 Go+0.091
 yourself+0.090
Neg logit contributions
 surrounded-0.135
 he-0.128
 full-0.122
 wooden-0.115
 distance-0.115
Token didn
Token ID1422
Feature activation+1.23e-01

Pos logit contributions
 myself+0.074
ases+0.069
 Go+0.069
 yourself+0.067
 yourselves+0.067
Neg logit contributions
 surrounded-0.066
 he-0.059
 full-0.057
 distance-0.057
 clearing-0.056
Token't
Token ID470
Feature activation+7.31e-02

Pos logit contributions
ases+0.195
rr+0.181
 yourselves+0.179
ags+0.174
 yourself+0.172
Neg logit contributions
 he-0.252
 she-0.211
 they-0.208
 surrounded-0.204
 full-0.203
Token know
Token ID760
Feature activation+4.37e-02

Pos logit contributions
 myself+0.155
ases+0.155
 Deal+0.151
 Go+0.149
 yourselves+0.147
Neg logit contributions
 surrounded-0.096
 he-0.096
 full-0.094
 saw-0.083
 rabbit-0.082
Token how
Token ID703
Feature activation+4.94e-02

Pos logit contributions
 myself+0.094
ases+0.094
rr+0.093
 Deal+0.092
 Go+0.089
Neg logit contributions
 he-0.061
 surrounded-0.058
 full-0.050
 which-0.050
 bushes-0.046
Token to
Token ID284
Feature activation+7.38e-02

Pos logit contributions
ases+0.126
 Deal+0.123
 myself+0.122
rr+0.119
oco+0.118
Neg logit contributions
 he-0.058
 surrounded-0.047
 full-0.047
 she-0.039
 filled-0.038
Token play
Token ID711
Feature activation+4.75e-02

Pos logit contributions
 Deal+0.173
ases+0.168
 Go+0.165
rac+0.159
rr+0.158
Neg logit contributions
 he-0.088
 saw-0.076
 surrounded-0.076
 full-0.075
 rabbit-0.073
Token sorry
Token ID7926
Feature activation+7.86e-02

Pos logit contributions
 myself+0.107
ases+0.105
rr+0.102
 Go+0.101
 ourselves+0.098
Neg logit contributions
 surrounded-0.091
 he-0.086
 full-0.081
 filled-0.074
 spotted-0.073
Token,
Token ID11
Feature activation+2.93e-02

Pos logit contributions
ases+0.234
uch+0.226
 myself+0.224
rr+0.218
 yourselves+0.218
Neg logit contributions
 he-0.061
 full-0.038
 surrounded-0.037
 rabbit-0.032
 she-0.028
Token Sara
Token ID24799
Feature activation+5.06e-02

Pos logit contributions
 myself+0.058
ases+0.057
rr+0.056
 Go+0.055
uch+0.055
Neg logit contributions
 surrounded-0.045
 he-0.045
 full-0.041
 which-0.037
 curled-0.037
Token.
Token ID13
Feature activation+7.46e-02

Pos logit contributions
 myself+0.113
ases+0.107
 Go+0.107
rr+0.105
 yourself+0.104
Neg logit contributions
 surrounded-0.068
 he-0.059
 distance-0.057
 clearing-0.056
 full-0.056
Token I
Token ID314
Feature activation+4.61e-02

Pos logit contributions
 myself+0.103
ases+0.097
rr+0.093
 Go+0.092
 yourselves+0.090
Neg logit contributions
 surrounded-0.163
 he-0.156
 full-0.149
 distance-0.142
 wooden-0.141
Token didn
Token ID1422
Feature activation+1.23e-01

Pos logit contributions
 myself+0.110
ases+0.104
 Go+0.102
 yourselves+0.102
rr+0.101
Neg logit contributions
 surrounded-0.057
 he-0.052
 full-0.049
 spotted-0.046
 filled-0.044
Token't
Token ID470
Feature activation+7.18e-02

Pos logit contributions
ases+0.181
rr+0.181
 yourselves+0.175
 Deal+0.171
oco+0.167
Neg logit contributions
 he-0.264
 she-0.213
 they-0.206
 that-0.206
 surrounded-0.205
Token see
Token ID766
Feature activation+3.72e-02

Pos logit contributions
 myself+0.150
ases+0.147
 Deal+0.145
 Go+0.144
 yourselves+0.142
Neg logit contributions
 surrounded-0.096
 he-0.094
 full-0.091
 rabbit-0.084
 saw-0.083
Token your
Token ID534
Feature activation+1.03e-02

Pos logit contributions
rr+0.056
vern+0.053
 Deal+0.051
ases+0.050
uch+0.050
Neg logit contributions
 he-0.083
 which-0.071
 she-0.070
 surrounded-0.070
 wolf-0.067
Token nose
Token ID9686
Feature activation+5.37e-02

Pos logit contributions
 myself+0.024
ases+0.023
 Go+0.023
rr+0.022
 yourselves+0.022
Neg logit contributions
 surrounded-0.013
 he-0.011
 full-0.011
 distance-0.010
 wooden-0.010
Token.
Token ID13
Feature activation+7.83e-02

Pos logit contributions
 myself+0.126
ases+0.124
 Deal+0.123
 Go+0.122
rr+0.122
Neg logit contributions
 surrounded-0.063
 he-0.061
 full-0.054
 distance-0.048
 which-0.048
Token you
Token ID345
Feature activation+3.54e-02

Pos logit contributions
uch+0.044
ases+0.042
 Deal+0.042
rr+0.041
 Go+0.041
Neg logit contributions
 he-0.035
 surrounded-0.031
 which-0.029
 full-0.029
 bushes-0.027
Token.
Token ID13
Feature activation+5.31e-02

Pos logit contributions
 myself+0.076
ases+0.075
 Deal+0.074
rr+0.073
 Go+0.072
Neg logit contributions
 surrounded-0.047
 he-0.047
 full-0.043
 which-0.039
 spotted-0.038
Token I
Token ID314
Feature activation+5.53e-02

Pos logit contributions
uch+0.084
 myself+0.083
ases+0.083
rr+0.079
vern+0.078
Neg logit contributions
 he-0.098
 surrounded-0.098
 full-0.089
 woods-0.082
 clearing-0.082
Token know
Token ID760
Feature activation+3.09e-02

Pos logit contributions
 myself+0.094
ases+0.085
 yourselves+0.084
 Go+0.083
rr+0.083
Neg logit contributions
 surrounded-0.107
 he-0.102
 full-0.098
 spotted-0.095
 distance-0.092
Token you
Token ID345
Feature activation+4.88e-02

Pos logit contributions
ases+0.037
 myself+0.037
rr+0.035
 Deal+0.035
uch+0.034
Neg logit contributions
 he-0.072
 surrounded-0.071
 full-0.066
 which-0.064
 filled-0.062
Token didn
Token ID1422
Feature activation+1.22e-01

Pos logit contributions
 myself+0.087
ases+0.083
 Go+0.081
 Deal+0.080
 yourselves+0.080
Neg logit contributions
 surrounded-0.089
 he-0.084
 full-0.080
 spotted-0.077
 filled-0.074
Token't
Token ID470
Feature activation+7.41e-02

Pos logit contributions
 Deal+0.171
ases+0.167
rav+0.164
 yourselves+0.162
ternally+0.162
Neg logit contributions
 he-0.268
 surrounded-0.230
 full-0.224
 which-0.215
 inside-0.213
Token mean
Token ID1612
Feature activation+6.21e-02

Pos logit contributions
 myself+0.136
ases+0.133
 Deal+0.131
 Go+0.130
uch+0.126
Neg logit contributions
 surrounded-0.124
 he-0.119
 full-0.113
 distance-0.106
 spotted-0.104
Token to
Token ID284
Feature activation+6.73e-02

Pos logit contributions
 Deal+0.165
 myself+0.157
uch+0.154
ases+0.152
rr+0.151
Neg logit contributions
 he-0.068
 which-0.051
 surrounded-0.050
 while-0.045
 though-0.045
Token rip
Token ID18344
Feature activation+5.81e-02

Pos logit contributions
 Deal+0.134
ases+0.129
 myself+0.128
 Go+0.127
uch+0.125
Neg logit contributions
 he-0.107
 surrounded-0.098
 which-0.090
 full-0.090
 distance-0.089
Token the
Token ID262
Feature activation+1.24e-02

Pos logit contributions
 Deal+0.106
rr+0.106
ases+0.105
 myself+0.105
uch+0.104
Neg logit contributions
 he-0.092
 surrounded-0.089
 which-0.080
 full-0.078
 bushes-0.078
Token
Token ID198
Feature activation+2.98e-02

Pos logit contributions
 myself+0.053
uch+0.053
ases+0.052
 yourselves+0.050
 Go+0.050
Neg logit contributions
 he-0.016
 surrounded-0.012
 full-0.011
 woods-0.010
 which-0.009
Token"
Token ID1
Feature activation+6.86e-02

Pos logit contributions
 myself+0.049
ases+0.048
 Go+0.046
rr+0.044
uch+0.044
Neg logit contributions
 surrounded-0.055
 he-0.054
 full-0.051
 distance-0.048
 clearing-0.048
TokenNo
Token ID2949
Feature activation+4.16e-02

Pos logit contributions
 myself+0.141
ases+0.133
 Go+0.131
rr+0.130
 yourself+0.126
Neg logit contributions
 surrounded-0.106
 he-0.098
 full-0.094
 distance-0.091
 wooden-0.087
Token,
Token ID11
Feature activation+3.31e-02

Pos logit contributions
 myself+0.114
ases+0.112
 Go+0.110
 yourselves+0.107
rr+0.107
Neg logit contributions
 he-0.034
 surrounded-0.032
 full-0.028
 wooden-0.025
 rabbit-0.024
Token I
Token ID314
Feature activation+5.54e-02

Pos logit contributions
 myself+0.063
ases+0.059
 Go+0.059
 yourselves+0.057
rr+0.057
Neg logit contributions
 surrounded-0.056
 he-0.052
 full-0.049
 distance-0.048
 spotted-0.048
Token didn
Token ID1422
Feature activation+1.22e-01

Pos logit contributions
 myself+0.122
ases+0.113
 Go+0.112
 yourselves+0.112
 ourselves+0.112
Neg logit contributions
 surrounded-0.083
 he-0.076
 full-0.074
 spotted-0.071
 filled-0.067
Token't
Token ID470
Feature activation+8.27e-02

Pos logit contributions
ases+0.192
 yourselves+0.188
oco+0.188
rr+0.187
ags+0.185
Neg logit contributions
 he-0.258
 she-0.217
 surrounded-0.215
 full-0.214
 they-0.207
Token take
Token ID1011
Feature activation+4.36e-02

Pos logit contributions
ases+0.154
 myself+0.154
 Deal+0.153
 Go+0.153
 yourselves+0.146
Neg logit contributions
 surrounded-0.134
 full-0.132
 he-0.129
 filled-0.114
 which-0.113
Token them
Token ID606
Feature activation+4.90e-02

Pos logit contributions
rr+0.096
 myself+0.095
ases+0.095
 Deal+0.092
uch+0.092
Neg logit contributions
 he-0.053
 surrounded-0.052
 full-0.047
 which-0.044
 bushes-0.043
Token.
Token ID13
Feature activation+8.30e-02

Pos logit contributions
ases+0.104
 Deal+0.101
 Go+0.101
 myself+0.100
rr+0.099
Neg logit contributions
 surrounded-0.067
 he-0.067
 full-0.063
 which-0.057
 distance-0.055
Token Maybe
Token ID6674
Feature activation+1.23e-02

Pos logit contributions
 myself+0.154
ases+0.149
rr+0.145
uch+0.142
 ourselves+0.140
Neg logit contributions
 surrounded-0.135
 he-0.130
 full-0.123
 wooden-0.113
 which-0.113
Token an
Token ID281
Feature activation+4.50e-02

Pos logit contributions
 myself+0.117
 Go+0.108
ases+0.107
 yourself+0.107
 yourselves+0.107
Neg logit contributions
 surrounded-0.100
 he-0.091
 distance-0.086
 full-0.086
 spotted-0.086
Token accident
Token ID5778
Feature activation+5.71e-02

Pos logit contributions
 myself+0.099
ases+0.094
 Go+0.093
rr+0.091
 yourselves+0.091
Neg logit contributions
 surrounded-0.062
 he-0.056
 full-0.053
 distance-0.051
 spotted-0.050
Token.
Token ID13
Feature activation+8.69e-02

Pos logit contributions
 myself+0.134
ases+0.128
 Go+0.127
rr+0.125
 yourselves+0.124
Neg logit contributions
 surrounded-0.070
 he-0.061
 full-0.058
 distance-0.056
 spotted-0.055
Token The
Token ID383
Feature activation+8.83e-03

Pos logit contributions
 myself+0.170
ases+0.166
rr+0.164
uch+0.160
 ourselves+0.155
Neg logit contributions
 surrounded-0.133
 he-0.121
 full-0.117
 curled-0.107
 filled-0.107
Token seed
Token ID9403
Feature activation+2.95e-02

Pos logit contributions
 myself+0.021
 yourself+0.020
 yourselves+0.020
 ourselves+0.019
ases+0.019
Neg logit contributions
 surrounded-0.011
 he-0.009
 curled-0.009
 spotted-0.009
 full-0.008
Token didn
Token ID1422
Feature activation+1.22e-01

Pos logit contributions
 myself+0.050
ases+0.048
 Go+0.047
rr+0.046
 Deal+0.046
Neg logit contributions
 surrounded-0.055
 he-0.052
 full-0.050
 distance-0.047
 spotted-0.047
Token't
Token ID470
Feature activation+7.56e-02

Pos logit contributions
ases+0.190
rr+0.182
 yourselves+0.182
oco+0.174
rav+0.171
Neg logit contributions
 he-0.249
 surrounded-0.220
 full-0.214
 she-0.211
 filled-0.206
Token mean
Token ID1612
Feature activation+5.96e-02

Pos logit contributions
 myself+0.144
ases+0.136
 Go+0.134
rr+0.131
 yourselves+0.131
Neg logit contributions
 surrounded-0.128
 he-0.117
 full-0.112
 distance-0.108
 spotted-0.107
Token to
Token ID284
Feature activation+7.83e-02

Pos logit contributions
 Deal+0.151
rr+0.146
ases+0.145
 myself+0.144
uch+0.144
Neg logit contributions
 he-0.069
 surrounded-0.053
 which-0.049
 she-0.048
 full-0.048
Token bite
Token ID13197
Feature activation+6.23e-02

Pos logit contributions
ases+0.168
 Go+0.166
 Deal+0.166
 myself+0.161
vern+0.158
Neg logit contributions
 he-0.106
 surrounded-0.105
 full-0.096
 distance-0.095
 rabbit-0.093
Token you
Token ID345
Feature activation+4.38e-02

Pos logit contributions
 Deal+0.088
rr+0.078
vern+0.077
ases+0.077
 myself+0.072
Neg logit contributions
 he-0.141
 surrounded-0.130
 bushes-0.128
 she-0.123
 which-0.122
Token sorry
Token ID7926
Feature activation+6.41e-02

Pos logit contributions
 myself+0.097
ases+0.091
 Go+0.090
rr+0.088
 yourselves+0.088
Neg logit contributions
 surrounded-0.095
 he-0.088
 full-0.084
 distance-0.082
 spotted-0.082
Token Nem
Token ID22547
Feature activation+7.46e-02

Pos logit contributions
 myself+0.159
ases+0.157
rr+0.151
 Go+0.151
 yourselves+0.150
Neg logit contributions
 surrounded-0.067
 he-0.066
 full-0.058
 which-0.050
 wooden-0.049
Tokeno
Token ID78
Feature activation+5.11e-02

Pos logit contributions
 myself+0.134
 yourselves+0.127
ases+0.125
 Deal+0.124
 yourself+0.122
Neg logit contributions
 surrounded-0.135
 he-0.129
 full-0.123
 spotted-0.115
 rabbit-0.115
Token.
Token ID13
Feature activation+6.88e-02

Pos logit contributions
 myself+0.118
ases+0.113
 Go+0.112
 yourselves+0.110
rr+0.110
Neg logit contributions
 surrounded-0.064
 he-0.058
 full-0.055
 distance-0.051
 spotted-0.050
Token I
Token ID314
Feature activation+5.36e-02

Pos logit contributions
 myself+0.099
ases+0.095
rr+0.090
 yourselves+0.089
 Go+0.088
Neg logit contributions
 surrounded-0.146
 he-0.141
 full-0.134
 spotted-0.126
 distance-0.125
Token didn
Token ID1422
Feature activation+1.22e-01

Pos logit contributions
 myself+0.104
ases+0.098
 yourselves+0.096
 Go+0.096
rr+0.094
Neg logit contributions
 surrounded-0.090
 he-0.084
 full-0.080
 spotted-0.076
 curled-0.075
Token't
Token ID470
Feature activation+8.02e-02

Pos logit contributions
ases+0.201
 yourselves+0.197
rr+0.193
ymes+0.180
ags+0.180
Neg logit contributions
 he-0.260
 surrounded-0.211
 full-0.203
 she-0.200
 that-0.192
Token know
Token ID760
Feature activation+4.08e-02

Pos logit contributions
 myself+0.154
ases+0.152
 yourselves+0.147
 Go+0.146
 Deal+0.144
Neg logit contributions
 surrounded-0.130
 he-0.125
 full-0.119
 rabbit-0.111
 distance-0.110
Token the
Token ID262
Feature activation+3.17e-02

Pos logit contributions
 myself+0.083
ases+0.083
rr+0.081
 Deal+0.080
 Go+0.078
Neg logit contributions
 he-0.061
 surrounded-0.060
 full-0.052
 which-0.051
 fields-0.048
Token waves
Token ID9813
Feature activation+4.51e-02

Pos logit contributions
 myself+0.067
ases+0.064
 Go+0.063
rr+0.062
 yourselves+0.062
Neg logit contributions
 surrounded-0.047
 he-0.043
 full-0.040
 distance-0.038
 spotted-0.038
Token would
Token ID561
Feature activation+8.38e-02

Pos logit contributions
 myself+0.090
ases+0.086
 Deal+0.085
rr+0.084
 yourselves+0.084
Neg logit contributions
 surrounded-0.071
 he-0.068
 full-0.063
 distance-0.059
 spotted-0.059
TokenSorry
Token ID14385
Feature activation+6.84e-02

Pos logit contributions
 myself+0.097
ases+0.092
 Go+0.090
 yourselves+0.088
rr+0.088
Neg logit contributions
 surrounded-0.097
 he-0.089
 full-0.086
 distance-0.083
 spotted-0.083
Token,
Token ID11
Feature activation+2.62e-02

Pos logit contributions
ases+0.193
 myself+0.189
uch+0.185
 Go+0.181
 Deal+0.180
Neg logit contributions
 he-0.052
 surrounded-0.047
 full-0.041
 which-0.035
 filled-0.032
Token sorry
Token ID7926
Feature activation+7.48e-02

Pos logit contributions
 myself+0.046
 Go+0.043
 yourselves+0.042
 Deal+0.042
 ourselves+0.042
Neg logit contributions
 surrounded-0.049
 spotted-0.043
 he-0.043
 distance-0.043
 clearing-0.042
Token,
Token ID11
Feature activation+3.56e-02

Pos logit contributions
ases+0.190
 myself+0.188
uch+0.186
 Deal+0.180
 yourselves+0.175
Neg logit contributions
 he-0.078
 surrounded-0.064
 full-0.063
 rabbit-0.056
 while-0.053
Token we
Token ID356
Feature activation+5.14e-02

Pos logit contributions
 myself+0.063
 Go+0.059
ases+0.058
 yourselves+0.057
 ourselves+0.057
Neg logit contributions
 surrounded-0.066
 he-0.060
 full-0.057
 distance-0.057
 spotted-0.057
Token didn
Token ID1422
Feature activation+1.22e-01

Pos logit contributions
 myself+0.108
ases+0.100
 Go+0.097
 yourselves+0.097
 ourselves+0.094
Neg logit contributions
 surrounded-0.081
 he-0.073
 full-0.073
 spotted-0.070
 filled-0.067
Token't
Token ID470
Feature activation+7.24e-02

Pos logit contributions
ases+0.178
oco+0.178
 Deal+0.175
rav+0.174
rr+0.173
Neg logit contributions
 he-0.254
 they-0.214
 surrounded-0.211
 full-0.210
 she-0.205
Token see
Token ID766
Feature activation+3.53e-02

Pos logit contributions
 myself+0.145
ases+0.138
 Deal+0.138
 Go+0.135
 yourselves+0.134
Neg logit contributions
 surrounded-0.108
 full-0.103
 he-0.102
 saw-0.093
 rabbit-0.092
Token the
Token ID262
Feature activation+1.50e-02

Pos logit contributions
rr+0.068
 Deal+0.065
ases+0.064
vern+0.063
 myself+0.063
Neg logit contributions
 he-0.058
 surrounded-0.054
 which-0.051
 full-0.048
 bushes-0.048
Token wave
Token ID6769
Feature activation+4.53e-02

Pos logit contributions
 myself+0.035
ases+0.033
 yourselves+0.033
rr+0.033
 Go+0.033
Neg logit contributions
 surrounded-0.019
 he-0.017
 full-0.015
 spotted-0.015
 clearing-0.015
Token,"
Token ID553
Feature activation+7.39e-03

Pos logit contributions
 myself+0.105
ases+0.101
 Go+0.098
 Deal+0.098
 yourselves+0.098
Neg logit contributions
 surrounded-0.057
 he-0.053
 full-0.048
 distance-0.045
 spotted-0.045
Token she
Token ID673
Feature activation-2.58e-02

Pos logit contributions
 surrounded+0.060
aver+0.058
 wooden+0.055
 barrel+0.053
 abandoned+0.052
Neg logit contributions
 you-0.106
 ourselves-0.092
 myself-0.092
't-0.087
 Go-0.086
Token finally
Token ID3443
Feature activation-1.36e-02

Pos logit contributions
 surrounded+0.029
 wooden+0.027
 fields+0.027
 wolf+0.026
 woods+0.025
Neg logit contributions
 myself-0.052
oh-0.050
ases-0.050
rr-0.050
uch-0.050
Token stepped
Token ID10764
Feature activation-1.87e-02

Pos logit contributions
 wolf+0.018
 fields+0.017
 wooden+0.017
 barrel+0.017
 woods+0.016
Neg logit contributions
ases-0.024
rr-0.024
oh-0.023
 myself-0.023
uch-0.023
Token into
Token ID656
Feature activation-4.21e-02

Pos logit contributions
 surrounded+0.026
 pretended+0.023
 gir+0.023
 drove+0.023
 woman+0.023
Neg logit contributions
 myself-0.034
 you-0.032
 yourself-0.032
 Go-0.032
 Deal-0.030
Token the
Token ID262
Feature activation-2.99e-02

Pos logit contributions
 surrounded+0.056
 he+0.050
 drove+0.048
 spotted+0.048
 distance+0.047
Neg logit contributions
 myself-0.087
 yourself-0.079
 you-0.079
 ourselves-0.078
ases-0.078
Token chamber
Token ID11847
Feature activation-8.81e-02

Pos logit contributions
 surrounded+0.049
 which+0.045
 spotted+0.044
 curled+0.043
 he+0.042
Neg logit contributions
 myself-0.055
 you-0.050
 yourself-0.050
 yourselves-0.048
 Deal-0.047
Token,
Token ID11
Feature activation-5.73e-02

Pos logit contributions
 surrounded+0.126
 wolf+0.116
 roasted+0.114
 curled+0.113
aver+0.112
Neg logit contributions
 you-0.159
 myself-0.159
 ourselves-0.145
 yourself-0.141
 Go-0.140
Token she
Token ID673
Feature activation-4.01e-02

Pos logit contributions
aver+0.065
 barrel+0.065
 wooden+0.064
 miner+0.064
 polar+0.063
Neg logit contributions
 you-0.120
 myself-0.113
 yourself-0.110
 ourselves-0.105
 yourselves-0.104
Token saw
Token ID2497
Feature activation-4.49e-02

Pos logit contributions
 surrounded+0.037
 he+0.033
 distance+0.032
 wooden+0.032
 wolf+0.030
Neg logit contributions
 myself-0.094
 Go-0.089
 yourself-0.089
ases-0.088
rr-0.088
Token that
Token ID326
Feature activation-4.58e-02

Pos logit contributions
 surrounded+0.065
 distance+0.060
 clearing+0.056
 drove+0.056
 full+0.056
Neg logit contributions
 myself-0.087
 yourself-0.083
 ourselves-0.080
 yourselves-0.080
 you-0.079
Token it
Token ID340
Feature activation-2.61e-02

Pos logit contributions
 surrounded+0.057
 drove+0.055
 wooden+0.055
 miner+0.054
 distance+0.053
Neg logit contributions
 myself-0.092
 you-0.090
 yourself-0.089
 ourselves-0.086
 yourselves-0.084
Token they
Token ID484
Feature activation-1.56e-02

Pos logit contributions
 surrounded+0.058
 hollow+0.057
 woman+0.053
 shield+0.052
 curled+0.052
Neg logit contributions
 you-0.106
 ourselves-0.103
 us-0.096
 myself-0.096
 yourself-0.087
Token walked
Token ID6807
Feature activation-4.54e-02

Pos logit contributions
 surrounded+0.017
 he+0.016
 wooden+0.016
 knowing+0.016
 woman+0.015
Neg logit contributions
ases-0.033
rr-0.032
 myself-0.030
 Go-0.030
uch-0.030
Token further
Token ID2252
Feature activation-4.81e-02

Pos logit contributions
 surrounded+0.075
 roasted+0.066
 curled+0.065
 warmed+0.064
 shone+0.063
Neg logit contributions
 myself-0.076
 ourselves-0.071
 you-0.070
ases-0.067
 us-0.065
Token into
Token ID656
Feature activation-4.74e-02

Pos logit contributions
 surrounded+0.077
 curled+0.070
 roasted+0.069
 miner+0.067
 full+0.066
Neg logit contributions
 ourselves-0.081
 myself-0.080
 you-0.076
 yourself-0.076
ases-0.075
Token the
Token ID262
Feature activation-3.06e-02

Pos logit contributions
 surrounded+0.068
 full+0.059
 knowing+0.059
 curled+0.058
 which+0.057
Neg logit contributions
ases-0.088
 myself-0.088
 ourselves-0.087
 you-0.085
 us-0.082
Token chamber
Token ID11847
Feature activation-8.59e-02

Pos logit contributions
 surrounded+0.049
 which+0.043
 spotted+0.042
 curled+0.042
 knowing+0.041
Neg logit contributions
 myself-0.058
 you-0.054
 yourself-0.053
 yourselves-0.053
 ourselves-0.051
Token,
Token ID11
Feature activation-5.72e-02

Pos logit contributions
 surrounded+0.134
 wolf+0.126
 curled+0.125
 roasted+0.121
 drove+0.120
Neg logit contributions
 you-0.145
 myself-0.137
 ourselves-0.133
rr-0.125
 yourself-0.123
Token they
Token ID484
Feature activation-4.15e-02

Pos logit contributions
 distance+0.081
 hollow+0.081
 woman+0.079
 doorway+0.075
 hunter+0.074
Neg logit contributions
 you-0.110
 ourselves-0.109
 us-0.095
 myself-0.095
 yourselves-0.089
Token heard
Token ID2982
Feature activation-4.83e-02

Pos logit contributions
 wooden+0.046
 clearing+0.039
 surrounded+0.039
 wolf+0.038
 barrel+0.038
Neg logit contributions
 us-0.081
rr-0.079
 you-0.079
uch-0.076
oh-0.076
Token a
Token ID257
Feature activation-2.98e-02

Pos logit contributions
 surrounded+0.081
 distance+0.075
 clearing+0.073
 hollow+0.071
 wooden+0.071
Neg logit contributions
 Go-0.077
 us-0.076
 ourselves-0.076
 you-0.076
 myself-0.075
Token voice
Token ID3809
Feature activation-4.33e-02

Pos logit contributions
 surrounded+0.048
 he+0.040
 full+0.039
 curled+0.039
 spotted+0.038
Neg logit contributions
 myself-0.055
rr-0.054
 Go-0.053
 you-0.052
 Deal-0.051
Token As
Token ID1081
Feature activation-5.00e-02

Pos logit contributions
 surrounded+0.016
 he+0.015
 full+0.014
 distance+0.013
 spotted+0.013
Neg logit contributions
 myself-0.033
ases-0.031
 Go-0.031
rr-0.031
 yourselves-0.030
Token she
Token ID673
Feature activation-1.74e-02

Pos logit contributions
 surrounded+0.059
 barrel+0.056
 miner+0.052
 wooden+0.051
 spotted+0.048
Neg logit contributions
 you-0.114
 myself-0.108
 ourselves-0.102
 yourself-0.099
 Go-0.096
Token walked
Token ID6807
Feature activation-4.11e-02

Pos logit contributions
 surrounded+0.019
 full+0.016
 distance+0.016
 he+0.016
 wooden+0.015
Neg logit contributions
 myself-0.038
uch-0.037
ases-0.037
 Go-0.037
rr-0.036
Token through
Token ID832
Feature activation-5.51e-02

Pos logit contributions
 surrounded+0.068
 roasted+0.063
 warmed+0.058
 wooden+0.057
 drove+0.057
Neg logit contributions
 myself-0.071
 you-0.064
 yourself-0.061
 ourselves-0.061
 Go-0.059
Token the
Token ID262
Feature activation-2.73e-02

Pos logit contributions
 surrounded+0.075
 miner+0.069
aver+0.067
 barrel+0.066
 drove+0.065
Neg logit contributions
 myself-0.109
 you-0.103
 yourself-0.100
 ourselves-0.097
ases-0.096
Token cabin
Token ID9351
Feature activation-8.25e-02

Pos logit contributions
 surrounded+0.041
 he+0.035
 which+0.032
 curled+0.032
 knowing+0.031
Neg logit contributions
 myself-0.056
 yourself-0.054
 yourselves-0.053
 you-0.051
 Go-0.051
Token,
Token ID11
Feature activation-5.73e-02

Pos logit contributions
 surrounded+0.118
 curled+0.108
 roasted+0.108
 hollow+0.105
 wooden+0.104
Neg logit contributions
 myself-0.144
 you-0.142
 ourselves-0.132
 yourself-0.129
 yourselves-0.125
Token she
Token ID673
Feature activation-4.58e-02

Pos logit contributions
 barrel+0.066
 surrounded+0.065
 miner+0.064
 wooden+0.063
 distance+0.063
Neg logit contributions
 you-0.125
 myself-0.117
 yourself-0.112
 yourselves-0.108
 Go-0.107
Token kept
Token ID4030
Feature activation-1.26e-02

Pos logit contributions
 surrounded+0.042
 wooden+0.040
 barrel+0.039
 full+0.038
 which+0.037
Neg logit contributions
uch-0.096
 myself-0.095
oh-0.093
 yourself-0.093
 you-0.093
Token being
Token ID852
Feature activation-1.54e-02

Pos logit contributions
 surrounded+0.017
ger+0.017
 rabbit+0.017
 miner+0.017
 distance+0.017
Neg logit contributions
 myself-0.023
 yourself-0.022
 you-0.021
 yourselves-0.020
 Go-0.020
Token clumsy
Token ID39120
Feature activation-1.35e-02

Pos logit contributions
 he+0.022
 barrel+0.022
 miner+0.022
 woods+0.021
 wooden+0.021
Neg logit contributions
 myself-0.029
 yourself-0.029
 yourselves-0.026
 you-0.025
 ourselves-0.024
Token was
Token ID373
Feature activation-1.89e-02

Pos logit contributions
 surrounded+0.009
 distance+0.008
 he+0.008
 clearing+0.008
 full+0.008
Neg logit contributions
 myself-0.019
 Go-0.018
ases-0.018
rr-0.018
 yourself-0.018
Token outside
Token ID2354
Feature activation-4.35e-02

Pos logit contributions
 surrounded+0.024
 he+0.024
 distance+0.022
 bushes+0.021
 stream+0.020
Neg logit contributions
 myself-0.040
ases-0.038
 Go-0.038
 yourself-0.037
 Deal-0.037
Token walking
Token ID6155
Feature activation-5.73e-02

Pos logit contributions
 surrounded+0.064
 clearing+0.061
 distance+0.061
 woman+0.060
 curled+0.059
Neg logit contributions
 myself-0.079
 you-0.073
ases-0.071
 Go-0.070
 yourself-0.069
Token around
Token ID1088
Feature activation-5.23e-02

Pos logit contributions
 surrounded+0.088
 curled+0.081
 warmed+0.079
 widened+0.078
 roasted+0.078
Neg logit contributions
 myself-0.105
 ourselves-0.094
 you-0.094
ases-0.090
 yourself-0.089
Token the
Token ID262
Feature activation-3.88e-02

Pos logit contributions
 surrounded+0.071
 curled+0.066
 hollow+0.064
 full+0.063
 miner+0.062
Neg logit contributions
 myself-0.103
 you-0.100
 us-0.095
 yourself-0.095
 ourselves-0.093
Token garden
Token ID11376
Feature activation-8.20e-02

Pos logit contributions
 surrounded+0.049
 he+0.044
 full+0.040
 curled+0.039
 spotted+0.039
Neg logit contributions
 myself-0.089
 yourself-0.083
 yourselves-0.082
 ourselves-0.082
 Go-0.080
Token when
Token ID618
Feature activation-5.76e-02

Pos logit contributions
 wolf+0.136
 hunter+0.125
 surrounded+0.121
 eerie+0.121
 curled+0.120
Neg logit contributions
 myself-0.131
 you-0.121
 ourselves-0.120
 yourself-0.116
 Go-0.108
Token he
Token ID339
Feature activation-3.27e-02

Pos logit contributions
 surrounded+0.034
 curled+0.028
 rabbit+0.026
 clearing+0.026
 distance+0.024
Neg logit contributions
 myself-0.152
 you-0.150
 ourselves-0.145
 yourselves-0.143
 yourself-0.142
Token saw
Token ID2497
Feature activation-4.89e-02

Pos logit contributions
 surrounded+0.023
 wooden+0.019
 he+0.017
 which+0.017
 full+0.017
Neg logit contributions
 myself-0.085
uch-0.084
 yourself-0.083
 you-0.082
 Go-0.082
Token something
Token ID1223
Feature activation-2.98e-02

Pos logit contributions
 surrounded+0.080
 distance+0.075
 winding+0.071
 clearing+0.070
 spotted+0.069
Neg logit contributions
 myself-0.087
 yourself-0.082
 yourselves-0.079
 ourselves-0.078
 you-0.077
Token exciting
Token ID7895
Feature activation-1.45e-02

Pos logit contributions
 surrounded+0.041
 distance+0.038
 fields+0.037
 clearing+0.036
 valleys+0.036
Neg logit contributions
 Go-0.054
 you-0.054
 myself-0.054
 ourselves-0.050
 yourself-0.049
Token girl
Token ID2576
Feature activation-3.95e-02

Pos logit contributions
 surrounded+0.028
 he+0.024
 curled+0.023
 full+0.023
 distance+0.021
Neg logit contributions
 myself-0.066
 you-0.065
 Go-0.064
rr-0.063
ases-0.062
Token was
Token ID373
Feature activation-3.09e-02

Pos logit contributions
 surrounded+0.036
 he+0.032
 distance+0.029
 full+0.028
 clearing+0.027
Neg logit contributions
 Go-0.099
 myself-0.098
ases-0.096
rr-0.094
 you-0.094
Token walking
Token ID6155
Feature activation-3.84e-02

Pos logit contributions
 surrounded+0.040
 he+0.038
 distance+0.034
 full+0.033
 spotted+0.033
Neg logit contributions
 myself-0.068
 Go-0.064
ases-0.064
 yourselves-0.063
 yourself-0.063
Token down
Token ID866
Feature activation-4.68e-02

Pos logit contributions
 surrounded+0.057
 curled+0.052
 he+0.051
 miner+0.050
 widened+0.050
Neg logit contributions
 myself-0.075
 you-0.069
 yourself-0.066
 Go-0.065
 ourselves-0.064
Token the
Token ID262
Feature activation-3.04e-02

Pos logit contributions
 surrounded+0.064
 curled+0.060
 miner+0.060
 spotted+0.057
 barrel+0.057
Neg logit contributions
 myself-0.095
 you-0.093
 yourselves-0.091
 yourself-0.090
ases-0.087
Token hallway
Token ID23959
Feature activation-7.99e-02

Pos logit contributions
 surrounded+0.039
 he+0.033
 spotted+0.032
 full+0.031
 curled+0.031
Neg logit contributions
 myself-0.070
 yourselves-0.065
 ourselves-0.064
 yourself-0.063
 you-0.063
Token.
Token ID13
Feature activation-2.86e-02

Pos logit contributions
 wolf+0.119
 rabbit+0.116
 hunter+0.113
 deer+0.112
 hollow+0.111
Neg logit contributions
 myself-0.145
 you-0.136
 Go-0.131
 yourselves-0.129
 yourself-0.126
Token She
Token ID1375
Feature activation-4.89e-02

Pos logit contributions
 surrounded+0.044
 he+0.041
 distance+0.038
 full+0.037
 curled+0.037
Neg logit contributions
 myself-0.055
 Go-0.054
 yourself-0.050
 yourselves-0.050
 Deal-0.049
Token was
Token ID373
Feature activation-2.31e-02

Pos logit contributions
 surrounded+0.037
 full+0.029
 clearing+0.029
 distance+0.029
 wooden+0.028
Neg logit contributions
 myself-0.125
 Go-0.123
 you-0.121
 yourself-0.121
 yourselves-0.121
Token feeling
Token ID4203
Feature activation-3.64e-02

Pos logit contributions
 he+0.032
 barrel+0.030
 distance+0.029
 rabbit+0.029
 surrounded+0.029
Neg logit contributions
 myself-0.045
 yourself-0.042
 yourselves-0.042
 Go-0.041
 you-0.040
Token a
Token ID257
Feature activation-3.40e-02

Pos logit contributions
 surrounded+0.050
 barrel+0.049
 he+0.047
 clearing+0.047
 rabbit+0.047
Neg logit contributions
 myself-0.071
 yourself-0.067
 yourselves-0.065
 you-0.065
 Go-0.064
Token As
Token ID1081
Feature activation-4.63e-02

Pos logit contributions
 surrounded+0.025
 full+0.021
 curled+0.021
 distance+0.020
 spotted+0.020
Neg logit contributions
 myself-0.036
 Go-0.035
ases-0.033
 Deal-0.033
rr-0.033
Token he
Token ID339
Feature activation-2.53e-02

Pos logit contributions
 surrounded+0.041
 curled+0.035
 spotted+0.031
 hollow+0.030
 wooden+0.029
Neg logit contributions
 you-0.119
 myself-0.116
 ourselves-0.110
 us-0.106
 yourself-0.103
Token walked
Token ID6807
Feature activation-4.48e-02

Pos logit contributions
 surrounded+0.027
 he+0.023
 full+0.022
 distance+0.022
 clearing+0.021
Neg logit contributions
 myself-0.060
ases-0.060
 Go-0.059
rr-0.057
uch-0.057
Token into
Token ID656
Feature activation-4.54e-02

Pos logit contributions
 surrounded+0.071
 curled+0.058
 roasted+0.058
 hollow+0.056
 warmed+0.056
Neg logit contributions
 myself-0.083
 you-0.075
 yourself-0.072
 Go-0.071
ases-0.071
Token the
Token ID262
Feature activation-4.45e-02

Pos logit contributions
 surrounded+0.049
 he+0.041
 full+0.040
 distance+0.037
 spotted+0.037
Neg logit contributions
 myself-0.112
ases-0.108
 Go-0.106
rr-0.104
 yourselves-0.104
Token classroom
Token ID15806
Feature activation-7.96e-02

Pos logit contributions
 surrounded+0.077
 he+0.070
 full+0.066
 curled+0.065
 clearing+0.064
Neg logit contributions
 myself-0.082
ases-0.076
 Go-0.076
 yourselves-0.074
 yourself-0.074
Token,
Token ID11
Feature activation-5.73e-02

Pos logit contributions
 surrounded+0.112
 clearing+0.100
 wolf+0.099
 distance+0.099
 hollow+0.098
Neg logit contributions
 myself-0.157
 you-0.152
 Go-0.150
rr-0.145
ases-0.143
Token he
Token ID339
Feature activation-4.68e-02

Pos logit contributions
 surrounded+0.060
 distance+0.058
 curled+0.058
yt+0.055
 doorway+0.053
Neg logit contributions
 you-0.126
 myself-0.120
 ourselves-0.115
't-0.113
 yourself-0.107
Token heard
Token ID2982
Feature activation-4.07e-02

Pos logit contributions
 surrounded+0.038
 he+0.031
 distance+0.030
 clearing+0.030
 full+0.029
Neg logit contributions
 myself-0.118
 Go-0.115
ases-0.114
uch-0.113
 yourself-0.112
Token the
Token ID262
Feature activation-3.01e-02

Pos logit contributions
 surrounded+0.064
 distance+0.056
 clearing+0.055
 barrel+0.053
 full+0.053
Neg logit contributions
 myself-0.077
 Go-0.073
 yourself-0.072
 you-0.070
 yourselves-0.069
Token teacher
Token ID4701
Feature activation-4.60e-02

Pos logit contributions
 surrounded+0.046
 he+0.041
 full+0.039
 clearing+0.038
 spotted+0.037
Neg logit contributions
 myself-0.062
ases-0.058
 Go-0.058
rr-0.057
 yourselves-0.057
Token girl
Token ID2576
Feature activation-4.00e-02

Pos logit contributions
 surrounded+0.013
 he+0.011
 full+0.010
 distance+0.009
 clearing+0.009
Neg logit contributions
 myself-0.035
ases-0.034
 Go-0.034
rr-0.033
 yourselves-0.033
Token was
Token ID373
Feature activation-3.06e-02

Pos logit contributions
 surrounded+0.041
 he+0.035
 distance+0.034
 full+0.033
 clearing+0.033
Neg logit contributions
 myself-0.097
 Go-0.095
ases-0.094
 yourself-0.091
 yourselves-0.091
Token walking
Token ID6155
Feature activation-4.49e-02

Pos logit contributions
 he+0.040
 surrounded+0.040
 distance+0.037
 barrel+0.035
 bushes+0.035
Neg logit contributions
 myself-0.064
 Go-0.060
 yourself-0.059
ases-0.059
 yourselves-0.058
Token around
Token ID1088
Feature activation-5.79e-02

Pos logit contributions
 surrounded+0.065
 curled+0.060
 miner+0.058
 widened+0.057
 hollow+0.057
Neg logit contributions
 myself-0.088
 you-0.081
 ourselves-0.077
 yourself-0.076
ases-0.074
Token the
Token ID262
Feature activation-3.81e-02

Pos logit contributions
 surrounded+0.078
 curled+0.074
 miner+0.073
 hollow+0.073
 barrel+0.072
Neg logit contributions
 myself-0.112
 you-0.110
 yourself-0.104
 us-0.102
 ourselves-0.102
Token garden
Token ID11376
Feature activation-7.94e-02

Pos logit contributions
 surrounded+0.054
 he+0.049
 full+0.045
 curled+0.044
 spotted+0.043
Neg logit contributions
 myself-0.082
 yourselves-0.076
 yourself-0.076
 ourselves-0.075
 Go-0.074
Token with
Token ID351
Feature activation-4.72e-02

Pos logit contributions
 wolf+0.121
 hunter+0.114
 woman+0.105
 deer+0.105
 miner+0.105
Neg logit contributions
 myself-0.142
 you-0.132
 yourself-0.131
 ourselves-0.130
 yourselves-0.120
Token her
Token ID607
Feature activation-4.40e-02

Pos logit contributions
 surrounded+0.078
 he+0.072
 distance+0.068
 clearing+0.068
 spotted+0.067
Neg logit contributions
 myself-0.089
 yourself-0.081
ases-0.080
 Go-0.079
 ourselves-0.079
Token big
Token ID1263
Feature activation-4.72e-02

Pos logit contributions
 surrounded+0.080
 he+0.073
 curled+0.071
 clearing+0.070
 distance+0.069
Neg logit contributions
 myself-0.071
 ourselves-0.067
ases-0.067
 Go-0.065
 you-0.065
Token brother
Token ID3956
Feature activation-6.07e-02

Pos logit contributions
 surrounded+0.064
 he+0.058
 full+0.052
 clearing+0.052
 distance+0.051
Neg logit contributions
 myself-0.101
ases-0.097
 Go-0.097
rr-0.095
 Deal-0.094
Token.
Token ID13
Feature activation-2.31e-02

Pos logit contributions
 surrounded+0.095
 distance+0.094
 clearing+0.089
 doorway+0.083
 wooden+0.083
Neg logit contributions
 Go-0.102
 myself-0.098
 you-0.097
rr-0.092
ases-0.089
Token.
Token ID13
Feature activation-4.86e-03

Pos logit contributions
 surrounded+0.089
 hollow+0.077
 miner+0.077
 barrel+0.076
 rabbit+0.076
Neg logit contributions
 myself-0.118
 Go-0.113
 you-0.113
ases-0.113
 ourselves-0.112
Token
Token ID198
Feature activation+2.12e-02

Pos logit contributions
 surrounded+0.006
 he+0.006
 full+0.005
 distance+0.005
 spotted+0.005
Neg logit contributions
 myself-0.011
ases-0.010
 Go-0.010
rr-0.010
 yourselves-0.010
Token
Token ID198
Feature activation+3.01e-02

Pos logit contributions
 myself+0.055
ases+0.054
uch+0.052
 yourselves+0.052
 yourself+0.052
Neg logit contributions
 he-0.022
 surrounded-0.020
 full-0.018
 spotted-0.016
 distance-0.015
TokenInside
Token ID24441
Feature activation-3.94e-02

Pos logit contributions
 myself+0.066
ases+0.064
 Go+0.061
rr+0.061
 yourselves+0.061
Neg logit contributions
 surrounded-0.042
 he-0.040
 full-0.038
 spotted-0.035
 distance-0.034
Token the
Token ID262
Feature activation-4.06e-02

Pos logit contributions
 surrounded+0.044
 he+0.036
 full+0.035
 distance+0.035
 wooden+0.035
Neg logit contributions
 myself-0.092
ases-0.088
 ourselves-0.087
 Go-0.087
 you-0.086
Token lab
Token ID2248
Feature activation-7.94e-02

Pos logit contributions
 surrounded+0.070
 curled+0.064
 shone+0.063
 perched+0.062
 he+0.059
Neg logit contributions
 myself-0.064
 you-0.063
 Deal-0.062
 Go-0.060
rr-0.060
Token was
Token ID373
Feature activation-3.82e-02

Pos logit contributions
 surrounded+0.096
 distance+0.087
 wolf+0.083
 wooden+0.082
 hollow+0.082
Neg logit contributions
 you-0.163
 myself-0.162
 Go-0.160
 ourselves-0.154
 yourself-0.152
Token very
Token ID845
Feature activation-1.50e-02

Pos logit contributions
 surrounded+0.054
 distance+0.051
 he+0.048
 woman+0.048
 rabbit+0.048
Neg logit contributions
 myself-0.069
 yourself-0.066
 yourselves-0.064
 Go-0.064
 ourselves-0.062
Token cold
Token ID4692
Feature activation-1.64e-02

Pos logit contributions
 surrounded+0.021
 he+0.018
 distance+0.018
 which+0.017
 clearing+0.017
Neg logit contributions
 myself-0.031
 Go-0.029
 yourselves-0.029
 ourselves-0.029
 yourself-0.029
Token so
Token ID523
Feature activation-2.97e-02

Pos logit contributions
 surrounded+0.024
 spotted+0.022
 clearing+0.020
 curled+0.020
 hollow+0.020
Neg logit contributions
 myself-0.032
 you-0.031
 ourselves-0.031
 yourselves-0.030
 yourself-0.030
Token Sandy
Token ID17581
Feature activation-2.77e-02

Pos logit contributions
 surrounded+0.046
 clearing+0.043
 distance+0.042
 bushes+0.040
 fields+0.039
Neg logit contributions
 myself-0.056
 you-0.054
 Go-0.053
 ourselves-0.052
 yourselves-0.052
Token Jimmy
Token ID12963
Feature activation-8.42e-03

Pos logit contributions
 myself+0.021
 Go+0.020
 you+0.019
ases+0.019
 yourself+0.019
Neg logit contributions
 surrounded-0.013
 he-0.011
 full-0.011
 distance-0.011
 wooden-0.011
Token was
Token ID373
Feature activation-1.70e-02

Pos logit contributions
 surrounded+0.009
 he+0.008
 distance+0.007
 full+0.007
 clearing+0.007
Neg logit contributions
 myself-0.020
 Go-0.020
ases-0.020
rr-0.019
 yourself-0.019
Token walking
Token ID6155
Feature activation-4.20e-02

Pos logit contributions
 surrounded+0.024
 he+0.023
 distance+0.021
 full+0.020
 spotted+0.020
Neg logit contributions
 myself-0.036
 Go-0.034
ases-0.034
 yourselves-0.033
 yourself-0.033
Token down
Token ID866
Feature activation-4.54e-02

Pos logit contributions
 surrounded+0.063
 curled+0.057
 hollow+0.056
 widened+0.056
 eerie+0.055
Neg logit contributions
 myself-0.082
 you-0.073
ases-0.073
 ourselves-0.072
 Go-0.071
Token the
Token ID262
Feature activation-3.10e-02

Pos logit contributions
 surrounded+0.062
 curled+0.058
 miner+0.055
 spotted+0.054
 hollow+0.054
Neg logit contributions
 myself-0.093
 you-0.089
 yourselves-0.087
ases-0.087
 yourself-0.087
Token hall
Token ID6899
Feature activation-7.91e-02

Pos logit contributions
 surrounded+0.041
 spotted+0.033
 he+0.033
 full+0.032
 curled+0.032
Neg logit contributions
 myself-0.070
 yourselves-0.064
 ourselves-0.064
 you-0.063
 yourself-0.063
Token when
Token ID618
Feature activation-5.82e-02

Pos logit contributions
 wolf+0.122
 rabbit+0.115
 surrounded+0.115
 hollow+0.113
 deer+0.113
Neg logit contributions
 myself-0.139
 you-0.127
 Go-0.127
ases-0.122
rr-0.120
Token he
Token ID339
Feature activation-3.22e-02

Pos logit contributions
 surrounded+0.035
 clearing+0.024
 curled+0.024
 hollow+0.023
 wooden+0.023
Neg logit contributions
 you-0.158
 myself-0.157
 ourselves-0.151
 yourselves-0.150
 yourself-0.149
Token heard
Token ID2982
Feature activation-3.72e-02

Pos logit contributions
 surrounded+0.023
 wooden+0.020
 which+0.018
 full+0.017
 he+0.017
Neg logit contributions
uch-0.081
 myself-0.080
 you-0.080
 Go-0.079
 yourself-0.079
Token a
Token ID257
Feature activation-2.60e-02

Pos logit contributions
 surrounded+0.060
 distance+0.052
 clearing+0.050
 spotted+0.050
 full+0.050
Neg logit contributions
 myself-0.069
 Go-0.066
 yourself-0.065
 yourselves-0.063
 you-0.063
Token strange
Token ID6283
Feature activation-1.36e-02

Pos logit contributions
 surrounded+0.041
 he+0.035
 full+0.034
 spotted+0.033
 fields+0.033
Neg logit contributions
 myself-0.049
 Go-0.048
rr-0.048
 you-0.046
 Deal-0.045
Token.
Token ID13
Feature activation-4.73e-03

Pos logit contributions
 surrounded+0.038
 he+0.032
 full+0.032
 clearing+0.031
 spotted+0.031
Neg logit contributions
 myself-0.061
 Go-0.056
ases-0.055
 ourselves-0.055
 you-0.054
Token She
Token ID1375
Feature activation-3.62e-02

Pos logit contributions
 surrounded+0.006
 he+0.006
 full+0.005
 distance+0.005
 spotted+0.005
Neg logit contributions
 myself-0.011
 Go-0.010
ases-0.010
 yourself-0.010
 yourselves-0.010
Token ran
Token ID4966
Feature activation-2.63e-02

Pos logit contributions
 surrounded+0.046
 distance+0.039
 full+0.039
 he+0.038
 clearing+0.037
Neg logit contributions
 myself-0.076
 Go-0.076
ases-0.075
rr-0.070
 you-0.070
Token around
Token ID1088
Feature activation-4.75e-02

Pos logit contributions
 surrounded+0.035
 shone+0.029
 he+0.028
 full+0.028
 wooden+0.027
Neg logit contributions
 myself-0.060
 Go-0.055
ases-0.054
 you-0.053
 ourselves-0.052
Token the
Token ID262
Feature activation-1.50e-02

Pos logit contributions
 surrounded+0.060
 full+0.052
 he+0.052
 distance+0.052
 hollow+0.050
Neg logit contributions
 myself-0.105
ases-0.097
 you-0.097
 Go-0.095
 ourselves-0.094
Token room
Token ID2119
Feature activation-7.89e-02

Pos logit contributions
 surrounded+0.016
 he+0.015
 full+0.013
 distance+0.012
 curled+0.012
Neg logit contributions
 myself-0.037
 yourself-0.035
 yourselves-0.035
ases-0.035
 Go-0.034
Token and
Token ID290
Feature activation-4.19e-02

Pos logit contributions
 wolf+0.106
 surrounded+0.103
 hollow+0.102
 clearing+0.100
 lamb+0.098
Neg logit contributions
 myself-0.149
ases-0.136
 you-0.131
 Go-0.128
rr-0.126
Token out
Token ID503
Feature activation-4.59e-02

Pos logit contributions
 miner+0.051
 hut+0.051
 wooden+0.051
 barrel+0.049
 wolf+0.049
Neg logit contributions
 myself-0.079
 you-0.077
oh-0.072
 Go-0.069
 us-0.068
Token the
Token ID262
Feature activation-1.57e-02

Pos logit contributions
 surrounded+0.062
 clearing+0.060
 barrel+0.059
 woods+0.059
 distance+0.058
Neg logit contributions
ases-0.093
 myself-0.089
 Go-0.088
rr-0.083
 you-0.082
Token door
Token ID3420
Feature activation-6.51e-02

Pos logit contributions
 surrounded+0.021
 he+0.019
 full+0.017
 spotted+0.016
 curled+0.016
Neg logit contributions
 myself-0.035
ases-0.033
 Go-0.033
rr-0.033
 yourself-0.033
Token.
Token ID13
Feature activation-9.09e-03

Pos logit contributions
 wolf+0.089
 barrel+0.086
 miner+0.085
 hunter+0.082
 clearing+0.080
Neg logit contributions
 myself-0.120
ases-0.115
 Go-0.113
 you-0.108
oh-0.103
TokenLu
Token ID25596
Feature activation+1.39e-02

Pos logit contributions
 myself+0.046
 Go+0.044
 yourself+0.042
 yourselves+0.042
ases+0.042
Neg logit contributions
 surrounded-0.041
 he-0.038
 full-0.036
 wooden-0.035
 distance-0.034
Tokency
Token ID948
Feature activation-4.72e-02

Pos logit contributions
 myself+0.023
ases+0.021
 Go+0.021
rr+0.020
 yourselves+0.020
Neg logit contributions
 surrounded-0.027
 he-0.025
 full-0.024
 distance-0.023
 wooden-0.023
Token walked
Token ID6807
Feature activation-3.82e-02

Pos logit contributions
 surrounded+0.052
 distance+0.048
 he+0.045
 full+0.044
 clearing+0.042
Neg logit contributions
 myself-0.106
 Go-0.103
rr-0.101
ases-0.101
 you-0.097
Token around
Token ID1088
Feature activation-5.41e-02

Pos logit contributions
 surrounded+0.063
 wolf+0.060
 chick+0.059
 miner+0.057
 rabbit+0.056
Neg logit contributions
 myself-0.064
 you-0.052
 yourself-0.051
 ourselves-0.050
 Go-0.050
Token the
Token ID262
Feature activation-2.69e-02

Pos logit contributions
 surrounded+0.072
 wolf+0.069
 barrel+0.069
 hollow+0.068
aver+0.068
Neg logit contributions
 myself-0.107
 you-0.100
 yourself-0.096
 ourselves-0.095
 yourselves-0.094
Token attic
Token ID46113
Feature activation-7.87e-02

Pos logit contributions
 surrounded+0.040
 he+0.037
 curled+0.033
 spotted+0.033
 full+0.033
Neg logit contributions
 myself-0.056
 yourselves-0.052
 yourself-0.052
 Go-0.052
 ourselves-0.050
Token and
Token ID290
Feature activation-5.13e-02

Pos logit contributions
 polar+0.122
 wolf+0.122
 roasted+0.115
 Mighty+0.113
 lamb+0.112
Neg logit contributions
 myself-0.136
 you-0.120
 ourselves-0.115
 yourself-0.111
 yourselves-0.107
Token noticed
Token ID6810
Feature activation-5.15e-02

Pos logit contributions
 wolf+0.068
 chick+0.063
 hunter+0.063
 polar+0.061
 barrel+0.061
Neg logit contributions
 myself-0.099
 you-0.096
oh-0.094
 yourself-0.090
 yourselves-0.084
Token something
Token ID1223
Feature activation-2.66e-02

Pos logit contributions
 surrounded+0.077
 distance+0.073
 full+0.067
 clearing+0.065
 spotted+0.065
Neg logit contributions
 myself-0.101
 yourself-0.096
 yourselves-0.094
 you-0.094
 ourselves-0.094
Token strange
Token ID6283
Feature activation-1.39e-02

Pos logit contributions
 surrounded+0.039
 distance+0.037
 fields+0.036
 clearing+0.036
 valleys+0.035
Neg logit contributions
 you-0.045
 Go-0.044
 myself-0.044
 yourself-0.041
 ourselves-0.041
Token.
Token ID13
Feature activation-1.80e-02

Pos logit contributions
 surrounded+0.021
 distance+0.020
 clearing+0.020
 wooden+0.019
 woman+0.018
Neg logit contributions
 myself-0.023
 Go-0.023
ases-0.023
 you-0.022
 yourself-0.022
Token little
Token ID1310
Feature activation-1.14e-02

Pos logit contributions
 surrounded+0.009
 he+0.008
 full+0.007
 spotted+0.007
 which+0.007
Neg logit contributions
 myself-0.016
 Go-0.015
 yourselves-0.015
rr-0.015
 yourself-0.015
Token girl
Token ID2576
Feature activation-3.53e-02

Pos logit contributions
 surrounded+0.014
 he+0.013
 which+0.012
 full+0.012
 spotted+0.011
Neg logit contributions
 myself-0.025
 you-0.024
 yourself-0.023
 Go-0.023
 yourselves-0.023
Token ran
Token ID4966
Feature activation-2.67e-02

Pos logit contributions
 surrounded+0.045
 distance+0.041
 woods+0.039
 he+0.039
 full+0.038
Neg logit contributions
 myself-0.076
 Go-0.073
ases-0.069
rr-0.069
 you-0.067
Token around
Token ID1088
Feature activation-4.91e-02

Pos logit contributions
 surrounded+0.040
 shone+0.034
 he+0.033
 full+0.033
 woods+0.032
Neg logit contributions
 myself-0.054
 Go-0.050
ases-0.048
 you-0.048
 ourselves-0.047
Token the
Token ID262
Feature activation-2.20e-02

Pos logit contributions
 surrounded+0.065
 distance+0.056
 full+0.056
 hollow+0.056
 barrel+0.055
Neg logit contributions
 myself-0.102
 you-0.095
 Go-0.093
ases-0.093
 ourselves-0.092
Token room
Token ID2119
Feature activation-7.81e-02

Pos logit contributions
 surrounded+0.034
 he+0.031
 full+0.029
 spotted+0.027
 distance+0.027
Neg logit contributions
 myself-0.045
 yourselves-0.042
 Go-0.042
 yourself-0.042
ases-0.041
Token,
Token ID11
Feature activation-5.64e-02

Pos logit contributions
 wolf+0.109
 surrounded+0.104
 clearing+0.104
 rabbit+0.102
 barrel+0.102
Neg logit contributions
 myself-0.143
 you-0.127
ases-0.125
 ourselves-0.124
 Go-0.122
Token looking
Token ID2045
Feature activation-4.63e-02

Pos logit contributions
 wolf+0.074
 woods+0.073
bread+0.071
ger+0.070
 crops+0.070
Neg logit contributions
 myself-0.104
 you-0.090
 yourself-0.087
 Go-0.082
 yourselves-0.081
Token in
Token ID287
Feature activation-4.76e-02

Pos logit contributions
 surrounded+0.069
 clearing+0.065
 woman+0.063
 miner+0.063
 stream+0.062
Neg logit contributions
 myself-0.071
 you-0.069
 us-0.069
ases-0.067
 Go-0.066
Token all
Token ID477
Feature activation-4.10e-02

Pos logit contributions
 surrounded+0.062
 miner+0.055
 he+0.055
 roasted+0.052
 drove+0.052
Neg logit contributions
 myself-0.098
 yourself-0.093
 yourselves-0.092
ases-0.092
 ourselves-0.089
Token the
Token ID262
Feature activation-2.01e-02

Pos logit contributions
 surrounded+0.054
 miner+0.051
 spotted+0.050
 wolf+0.049
 rabbit+0.048
Neg logit contributions
 myself-0.088
ases-0.080
 yourself-0.078
 you-0.078
 ourselves-0.076
Tokenina
Token ID1437
Feature activation-2.41e-03

Pos logit contributions
 myself+0.110
 Deal+0.109
ases+0.108
 Go+0.107
 yourselves+0.103
Neg logit contributions
 surrounded-0.078
 he-0.075
 full-0.069
 woods-0.069
 clearing-0.067
Token was
Token ID373
Feature activation-2.03e-02

Pos logit contributions
 surrounded+0.002
 he+0.002
 full+0.002
 distance+0.002
 clearing+0.002
Neg logit contributions
 myself-0.006
 Go-0.006
ases-0.006
 yourself-0.006
 yourselves-0.006
Token walking
Token ID6155
Feature activation-4.25e-02

Pos logit contributions
 surrounded+0.026
 he+0.025
 distance+0.023
 bushes+0.022
 spotted+0.022
Neg logit contributions
 myself-0.044
 Go-0.042
ases-0.042
 yourself-0.041
 yourselves-0.041
Token through
Token ID832
Feature activation-5.91e-02

Pos logit contributions
 surrounded+0.061
 curled+0.055
 hollow+0.055
 widened+0.052
 miner+0.052
Neg logit contributions
 myself-0.085
 you-0.077
 ourselves-0.075
 Go-0.073
 yourself-0.073
Token the
Token ID262
Feature activation-3.52e-02

Pos logit contributions
 surrounded+0.076
 he+0.069
 miner+0.067
 full+0.066
 curled+0.065
Neg logit contributions
 myself-0.129
 yourself-0.121
 you-0.120
 yourselves-0.118
 ourselves-0.117
Token garden
Token ID11376
Feature activation-7.80e-02

Pos logit contributions
 surrounded+0.049
 he+0.043
 curled+0.042
 full+0.040
 spotted+0.039
Neg logit contributions
 myself-0.076
 yourself-0.072
 yourselves-0.071
 ourselves-0.070
 you-0.069
Token with
Token ID351
Feature activation-4.95e-02

Pos logit contributions
 wolf+0.107
 hunter+0.097
 curled+0.097
 barrel+0.096
 rabbit+0.096
Neg logit contributions
 myself-0.152
 you-0.143
 yourself-0.142
 ourselves-0.139
 yourselves-0.134
Token her
Token ID607
Feature activation-3.92e-02

Pos logit contributions
 surrounded+0.081
 he+0.074
 distance+0.071
 clearing+0.071
 spotted+0.069
Neg logit contributions
 myself-0.093
 yourself-0.086
ases-0.085
 Go-0.084
 ourselves-0.084
Token mom
Token ID1995
Feature activation-4.89e-02

Pos logit contributions
 surrounded+0.069
 he+0.063
 full+0.060
 distance+0.059
 clearing+0.059
Neg logit contributions
 myself-0.069
ases-0.065
 Go-0.065
 ourselves-0.063
 yourselves-0.062
Token.
Token ID13
Feature activation-2.38e-02

Pos logit contributions
 surrounded+0.073
 he+0.065
 clearing+0.064
 curled+0.064
 distance+0.064
Neg logit contributions
 myself-0.093
 Go-0.088
rr-0.086
 you-0.086
 ourselves-0.085
Token She
Token ID1375
Feature activation-4.58e-02

Pos logit contributions
 surrounded+0.032
 he+0.029
 distance+0.027
 full+0.026
 curled+0.026
Neg logit contributions
 Go-0.049
 myself-0.049
 yourselves-0.046
 Deal-0.046
 ourselves-0.046
Token soon
Token ID2582
Feature activation-3.56e-02

Pos logit contributions
 surrounded+0.041
 he+0.030
 full+0.029
 clearing+0.029
 spotted+0.029
Neg logit contributions
 myself-0.129
 Go-0.123
 you-0.121
ases-0.120
 yourself-0.120
Token as
Token ID355
Feature activation-3.93e-02

Pos logit contributions
 barrel+0.055
 hollow+0.052
 hut+0.048
 clearing+0.048
 surrounded+0.047
Neg logit contributions
 you-0.064
ases-0.063
 Go-0.061
 myself-0.060
 us-0.057
Token she
Token ID673
Feature activation-1.38e-02

Pos logit contributions
 surrounded+0.025
 he+0.021
 full+0.018
 distance+0.015
 spotted+0.014
Neg logit contributions
 myself-0.115
ases-0.112
 Go-0.111
rr-0.110
 yourselves-0.109
Token entered
Token ID5982
Feature activation-5.87e-02

Pos logit contributions
 surrounded+0.014
 he+0.013
 full+0.012
 distance+0.011
 spotted+0.011
Neg logit contributions
 myself-0.035
ases-0.034
 Go-0.033
rr-0.033
 yourselves-0.032
Token the
Token ID262
Feature activation-9.38e-03

Pos logit contributions
 surrounded+0.068
 he+0.058
 full+0.057
 distance+0.054
 spotted+0.053
Neg logit contributions
 myself-0.142
ases-0.134
 Go-0.132
 yourself-0.132
 yourselves-0.131
Token room
Token ID2119
Feature activation-7.77e-02

Pos logit contributions
 surrounded+0.016
 he+0.015
 full+0.014
 distance+0.014
 woods+0.014
Neg logit contributions
 myself-0.018
ases-0.017
 Go-0.017
 Deal-0.017
rr-0.017
Token and
Token ID290
Feature activation-3.12e-02

Pos logit contributions
 surrounded+0.104
 clearing+0.092
 distance+0.091
 hollow+0.090
 curled+0.089
Neg logit contributions
 myself-0.158
 you-0.152
 Go-0.150
ases-0.148
 yourself-0.145
Token started
Token ID2067
Feature activation-2.57e-02

Pos logit contributions
 surrounded+0.035
 full+0.031
 clearing+0.029
 distance+0.029
 he+0.029
Neg logit contributions
 myself-0.073
 you-0.069
ases-0.069
 Go-0.069
 yourself-0.068
Token to
Token ID284
Feature activation+9.83e-03

Pos logit contributions
 surrounded+0.042
 distance+0.036
 he+0.036
 full+0.036
 clearing+0.036
Neg logit contributions
 myself-0.048
ases-0.046
 Go-0.045
 yourself-0.044
rr-0.043
Token strike
Token ID5587
Feature activation-3.17e-02

Pos logit contributions
 myself+0.018
ases+0.017
 Go+0.017
 yourselves+0.017
rr+0.017
Neg logit contributions
 surrounded-0.017
 he-0.016
 full-0.015
 distance-0.014
 spotted-0.014
Token the
Token ID262
Feature activation-2.82e-03

Pos logit contributions
 surrounded+0.044
 he+0.039
 full+0.037
 distance+0.035
 spotted+0.035
Neg logit contributions
 myself-0.070
ases-0.067
 Go-0.066
 yourself-0.064
 yourselves-0.064
Token Charlotte
Token ID14685
Feature activation-6.23e-03

Pos logit contributions
 surrounded+0.038
 distance+0.031
 spotted+0.029
 clearing+0.029
 wooden+0.029
Neg logit contributions
 myself-0.089
 you-0.085
 ourselves-0.082
 Go-0.082
 yourself-0.082
Token was
Token ID373
Feature activation+5.63e-04

Pos logit contributions
 surrounded+0.006
 he+0.006
 full+0.005
 distance+0.005
 wooden+0.005
Neg logit contributions
 myself-0.015
 Go-0.015
ases-0.015
rr-0.014
uch-0.014
Token walking
Token ID6155
Feature activation-4.81e-02

Pos logit contributions
 myself+0.001
 Go+0.001
ases+0.001
 yourself+0.001
 yourselves+0.001
Neg logit contributions
 surrounded-0.001
 he-0.001
 distance-0.001
 wooden-0.001
 rabbit-0.001
Token around
Token ID1088
Feature activation-5.46e-02

Pos logit contributions
 surrounded+0.073
 roasted+0.067
 widened+0.067
 drove+0.066
 curled+0.064
Neg logit contributions
 myself-0.090
 you-0.080
 yourself-0.079
 ourselves-0.077
ases-0.074
Token the
Token ID262
Feature activation-2.61e-02

Pos logit contributions
 surrounded+0.074
 miner+0.069
 drove+0.069
 barrel+0.068
 hollow+0.067
Neg logit contributions
 you-0.104
 myself-0.101
 yourself-0.094
 us-0.094
 ourselves-0.092
Token garden
Token ID11376
Feature activation-7.76e-02

Pos logit contributions
 surrounded+0.029
 he+0.027
 drove+0.022
 curled+0.022
 full+0.021
Neg logit contributions
 myself-0.064
 yourself-0.062
 yourselves-0.061
 ourselves-0.060
 you-0.058
Token,
Token ID11
Feature activation-6.34e-02

Pos logit contributions
 wolf+0.110
 wooden+0.110
 hunter+0.109
 drove+0.108
 barrel+0.108
Neg logit contributions
 you-0.135
 myself-0.129
 yourself-0.122
 ourselves-0.117
 us-0.115
Token she
Token ID673
Feature activation-4.30e-02

Pos logit contributions
 barrel+0.082
 polar+0.078
 miner+0.077
 wooden+0.075
 crocod+0.070
Neg logit contributions
 you-0.133
 yourself-0.124
 myself-0.123
 yourselves-0.114
 Go-0.113
Token noticed
Token ID6810
Feature activation-5.31e-02

Pos logit contributions
 surrounded+0.033
 wooden+0.030
 he+0.029
 barrel+0.027
 full+0.027
Neg logit contributions
 myself-0.105
 yourself-0.102
 you-0.101
 Go-0.100
uch-0.100
Token a
Token ID257
Feature activation-3.38e-02

Pos logit contributions
 surrounded+0.079
 distance+0.073
 wooden+0.071
 full+0.070
 clearing+0.069
Neg logit contributions
 myself-0.100
 yourself-0.097
 you-0.094
 yourselves-0.092
 ourselves-0.090
Token big
Token ID1263
Feature activation-2.90e-02

Pos logit contributions
 surrounded+0.045
 he+0.040
 curled+0.035
 drove+0.034
 fields+0.034
Neg logit contributions
 myself-0.068
rr-0.065
 yourself-0.064
 Go-0.064
 you-0.064
Token
Token ID198
Feature activation+1.53e-02

Pos logit contributions
 surrounded+0.016
 he+0.014
 full+0.013
 distance+0.013
 clearing+0.013
Neg logit contributions
 myself-0.023
 Go-0.022
ases-0.021
 yourselves-0.021
 ourselves-0.021
Token
Token ID198
Feature activation+2.43e-02

Pos logit contributions
 myself+0.039
ases+0.039
 yourselves+0.037
uch+0.037
 Go+0.036
Neg logit contributions
 he-0.016
 surrounded-0.016
 full-0.014
 spotted-0.013
 distance-0.013
TokenAs
Token ID1722
Feature activation-5.00e-02

Pos logit contributions
 myself+0.051
ases+0.048
 Go+0.048
rr+0.047
 yourselves+0.047
Neg logit contributions
 surrounded-0.036
 he-0.033
 full-0.031
 distance-0.030
 spotted-0.030
Token she
Token ID673
Feature activation-3.14e-02

Pos logit contributions
 surrounded+0.061
 barrel+0.055
 clearing+0.054
 hollow+0.053
 curled+0.051
Neg logit contributions
 you-0.113
 myself-0.105
 ourselves-0.102
 Go-0.099
 yourself-0.099
Token went
Token ID1816
Feature activation-3.57e-02

Pos logit contributions
 surrounded+0.037
 distance+0.032
 he+0.032
 full+0.031
 knowing+0.030
Neg logit contributions
 myself-0.064
ases-0.064
 Go-0.063
rr-0.063
uch-0.063
Token upstairs
Token ID26148
Feature activation-7.76e-02

Pos logit contributions
 surrounded+0.059
 spotted+0.051
 widened+0.050
 rabbit+0.050
 chick+0.049
Neg logit contributions
 myself-0.064
 you-0.060
 yourself-0.059
 Go-0.058
ases-0.057
Token,
Token ID11
Feature activation-6.11e-02

Pos logit contributions
 surrounded+0.120
 hollow+0.116
 distance+0.113
 hunter+0.113
 rabbit+0.112
Neg logit contributions
 you-0.123
 myself-0.115
ases-0.114
 us-0.110
rr-0.109
Token Emmy
Token ID33953
Feature activation-4.35e-02

Pos logit contributions
 clearing+0.101
 camel+0.101
 surrounded+0.101
 hollow+0.100
 barrel+0.100
Neg logit contributions
 you-0.105
 myself-0.094
't-0.089
 yourselves-0.089
 yourself-0.088
Token noticed
Token ID6810
Feature activation-5.84e-02

Pos logit contributions
 surrounded+0.045
 he+0.042
 distance+0.041
 full+0.040
 barrel+0.040
Neg logit contributions
oh-0.090
 myself-0.089
uch-0.089
 Go-0.086
rr-0.086
Token something
Token ID1223
Feature activation-2.49e-02

Pos logit contributions
 surrounded+0.091
 distance+0.086
 clearing+0.081
 full+0.081
 miner+0.080
Neg logit contributions
 myself-0.107
 yourself-0.104
 yourselves-0.102
 you-0.101
 ourselves-0.098
Token strange
Token ID6283
Feature activation-1.42e-02

Pos logit contributions
 surrounded+0.035
 distance+0.033
 fields+0.033
 clearing+0.032
 valleys+0.031
Neg logit contributions
 you-0.044
 myself-0.044
 Go-0.043
 yourself-0.041
 ourselves-0.040
Token big
Token ID1263
Feature activation-3.76e-02

Pos logit contributions
 surrounded+0.069
 he+0.061
 beneath+0.056
 knowing+0.054
 though+0.054
Neg logit contributions
 you-0.095
 myself-0.094
 yourself-0.092
 Go-0.091
 us-0.091
Token classroom
Token ID15806
Feature activation-6.17e-02

Pos logit contributions
 surrounded+0.053
 he+0.048
 beneath+0.043
 knowing+0.043
 full+0.042
Neg logit contributions
 myself-0.073
 you-0.070
rr-0.070
 yourself-0.069
 Go-0.068
Token.
Token ID13
Feature activation-2.04e-02

Pos logit contributions
 surrounded+0.081
 wolf+0.081
 rabbit+0.080
 distance+0.077
 lamb+0.077
Neg logit contributions
 myself-0.120
 Go-0.115
 you-0.113
 yourself-0.107
 yourselves-0.106
Token Inside
Token ID14384
Feature activation-4.67e-02

Pos logit contributions
 surrounded+0.027
 stream+0.023
 distance+0.023
 rabbit+0.022
 curled+0.022
Neg logit contributions
 myself-0.044
 Go-0.044
 yourselves-0.041
 Deal-0.040
 yourself-0.040
Token the
Token ID262
Feature activation-3.19e-02

Pos logit contributions
 surrounded+0.048
 full+0.039
 wooden+0.038
 distance+0.038
 curled+0.038
Neg logit contributions
 myself-0.112
 Go-0.106
 you-0.106
ases-0.106
 ourselves-0.104
Token classroom
Token ID15806
Feature activation-7.73e-02

Pos logit contributions
 surrounded+0.051
 he+0.047
 full+0.043
 curled+0.043
 clearing+0.042
Neg logit contributions
 myself-0.063
ases-0.059
 Go-0.058
 yourself-0.058
rr-0.058
Token were
Token ID547
Feature activation-3.32e-02

Pos logit contributions
 surrounded+0.103
 distance+0.096
 wolf+0.090
 clearing+0.090
 rabbit+0.090
Neg logit contributions
 myself-0.158
 Go-0.155
 you-0.153
ases-0.147
 yourselves-0.146
Token many
Token ID867
Feature activation-3.53e-02

Pos logit contributions
 surrounded+0.047
 he+0.041
 distance+0.040
 clearing+0.040
 full+0.039
Neg logit contributions
 myself-0.070
 Go-0.066
ases-0.066
 yourselves-0.065
 yourself-0.065
Token children
Token ID1751
Feature activation-4.36e-02

Pos logit contributions
 surrounded+0.055
 he+0.049
 full+0.047
 distance+0.046
 clearing+0.046
Neg logit contributions
 myself-0.070
ases-0.067
 Go-0.065
 yourselves-0.064
rr-0.064
Token and
Token ID290
Feature activation-4.50e-02

Pos logit contributions
 surrounded+0.069
 distance+0.065
 clearing+0.064
 wooden+0.062
 he+0.060
Neg logit contributions
 Go-0.079
 myself-0.077
ases-0.077
 you-0.074
rr-0.073
Token a
Token ID257
Feature activation-4.98e-02

Pos logit contributions
 surrounded+0.065
 clearing+0.061
 distance+0.059
 hollow+0.057
 curled+0.056
Neg logit contributions
 myself-0.089
 you-0.086
 Go-0.086
 ourselves-0.085
 yourselves-0.084
Token
Token ID198
Feature activation+2.78e-02

Pos logit contributions
 myself+0.069
ases+0.069
 yourselves+0.067
uch+0.067
 ourselves+0.066
Neg logit contributions
 he-0.026
 surrounded-0.024
 clearing-0.022
 spotted-0.020
 full-0.019
TokenShe
Token ID3347
Feature activation-3.03e-02

Pos logit contributions
 myself+0.056
ases+0.055
 Go+0.053
 yourselves+0.052
 ourselves+0.052
Neg logit contributions
 surrounded-0.043
 he-0.040
 full-0.037
 clearing-0.036
 spotted-0.036
Token marched
Token ID23558
Feature activation-2.72e-02

Pos logit contributions
 surrounded+0.031
 distance+0.027
 he+0.027
 full+0.025
 clearing+0.025
Neg logit contributions
 myself-0.072
 Go-0.070
ases-0.070
rr-0.070
uch-0.067
Token around
Token ID1088
Feature activation-4.71e-02

Pos logit contributions
 surrounded+0.041
 woods+0.036
 hollow+0.036
 wolf+0.035
 clearing+0.035
Neg logit contributions
 myself-0.054
 Go-0.048
 ourselves-0.047
 yourself-0.047
 you-0.047
Token the
Token ID262
Feature activation-2.60e-02

Pos logit contributions
 surrounded+0.060
 he+0.052
 full+0.051
 distance+0.050
 hollow+0.049
Neg logit contributions
 myself-0.105
ases-0.098
 you-0.097
 Go-0.097
 ourselves-0.095
Token room
Token ID2119
Feature activation-7.73e-02

Pos logit contributions
 surrounded+0.038
 he+0.035
 full+0.032
 spotted+0.031
 distance+0.030
Neg logit contributions
 myself-0.055
 Go-0.052
ases-0.052
 yourselves-0.051
 yourself-0.051
Token,
Token ID11
Feature activation-5.25e-02

Pos logit contributions
 wolf+0.106
 surrounded+0.102
 hollow+0.101
 barrel+0.099
 rabbit+0.098
Neg logit contributions
 myself-0.149
ases-0.137
 you-0.135
 Go-0.133
rr-0.131
Token showing
Token ID4478
Feature activation-8.55e-03

Pos logit contributions
ger+0.078
 miner+0.077
 barrel+0.077
 wolf+0.076
 hunter+0.075
Neg logit contributions
 myself-0.089
 you-0.084
 yourself-0.079
 yourselves-0.077
 Go-0.074
Token her
Token ID607
Feature activation-1.73e-02

Pos logit contributions
 surrounded+0.012
 he+0.011
 full+0.010
 distance+0.010
 spotted+0.010
Neg logit contributions
 myself-0.018
ases-0.018
 Go-0.017
rr-0.017
 yourselves-0.017
Token parents
Token ID3397
Feature activation-3.54e-02

Pos logit contributions
 surrounded+0.025
 he+0.022
 full+0.021
 distance+0.021
 clearing+0.020
Neg logit contributions
 myself-0.037
ases-0.035
 Go-0.035
 yourself-0.034
 ourselves-0.034
Token her
Token ID607
Feature activation-1.29e-02

Pos logit contributions
 surrounded+0.048
 woods+0.041
 clearing+0.041
 he+0.041
 spotted+0.040
Neg logit contributions
 myself-0.077
 ourselves-0.072
 Go-0.071
 yourself-0.070
ases-0.070
TokenWhen
Token ID2215
Feature activation-3.61e-02

Pos logit contributions
 myself+0.058
ases+0.055
 Go+0.055
rr+0.054
 yourselves+0.053
Neg logit contributions
 surrounded-0.038
 he-0.034
 full-0.032
 distance-0.031
 spotted-0.030
Token he
Token ID339
Feature activation-2.42e-02

Pos logit contributions
 surrounded+0.032
 curled+0.030
 hollow+0.029
 distance+0.026
 spotted+0.025
Neg logit contributions
 you-0.094
 myself-0.088
 ourselves-0.086
 yourselves-0.082
 Go-0.082
Token walked
Token ID6807
Feature activation-4.49e-02

Pos logit contributions
 surrounded+0.022
 he+0.019
 full+0.018
 distance+0.017
 clearing+0.016
Neg logit contributions
 myself-0.063
ases-0.061
 Go-0.061
rr-0.059
 yourselves-0.059
Token into
Token ID656
Feature activation-4.22e-02

Pos logit contributions
 surrounded+0.069
 curled+0.059
 hollow+0.057
 full+0.056
 distance+0.055
Neg logit contributions
 myself-0.088
 you-0.081
 Go-0.080
ases-0.079
rr-0.079
Token the
Token ID262
Feature activation-3.71e-02

Pos logit contributions
 surrounded+0.042
 he+0.036
 full+0.033
 distance+0.031
 spotted+0.031
Neg logit contributions
 myself-0.109
ases-0.105
 Go-0.103
rr-0.102
 yourselves-0.102
Token classroom
Token ID15806
Feature activation-7.73e-02

Pos logit contributions
 surrounded+0.065
 he+0.059
 full+0.056
 spotted+0.055
 curled+0.055
Neg logit contributions
 myself-0.067
ases-0.063
 Go-0.063
 yourselves-0.061
 yourself-0.061
Token,
Token ID11
Feature activation-5.39e-02

Pos logit contributions
 surrounded+0.105
 distance+0.092
 clearing+0.092
 hollow+0.090
 wolf+0.090
Neg logit contributions
 myself-0.160
 Go-0.155
 you-0.154
ases-0.149
rr-0.149
Token he
Token ID339
Feature activation-4.23e-02

Pos logit contributions
 surrounded+0.039
 distance+0.036
 hollow+0.036
 clearing+0.036
 curled+0.033
Neg logit contributions
 you-0.140
 myself-0.137
 ourselves-0.128
 yourselves-0.128
 yourself-0.128
Token went
Token ID1816
Feature activation-2.87e-02

Pos logit contributions
 surrounded+0.032
 he+0.025
 distance+0.025
 clearing+0.025
 full+0.024
Neg logit contributions
 myself-0.111
 Go-0.108
ases-0.107
 yourself-0.105
 yourselves-0.104
Token straight
Token ID3892
Feature activation-2.00e-02

Pos logit contributions
 surrounded+0.047
 he+0.040
 full+0.038
 wooden+0.038
 fields+0.038
Neg logit contributions
 myself-0.055
 Go-0.051
ases-0.050
 yourself-0.049
 you-0.048
Token to
Token ID284
Feature activation-2.28e-02

Pos logit contributions
 surrounded+0.032
 he+0.028
 full+0.028
 distance+0.027
 curled+0.026
Neg logit contributions
 myself-0.039
ases-0.037
 Go-0.036
 yourselves-0.036
 yourself-0.036
TokenWhen
Token ID2215
Feature activation-4.44e-02

Pos logit contributions
 myself+0.060
ases+0.060
 ourselves+0.057
 Go+0.057
 yourselves+0.057
Neg logit contributions
 surrounded-0.035
 he-0.033
 full-0.031
 distance-0.029
 spotted-0.029
Token Lisa
Token ID15378
Feature activation-2.38e-02

Pos logit contributions
 surrounded+0.058
 hollow+0.052
 clearing+0.050
 miner+0.050
 barrel+0.049
Neg logit contributions
 you-0.097
 myself-0.091
 Go-0.090
 ourselves-0.088
ases-0.085
Token looked
Token ID3114
Feature activation-4.54e-02

Pos logit contributions
 surrounded+0.027
 wooden+0.024
 full+0.024
 he+0.023
 clearing+0.022
Neg logit contributions
 Go-0.051
uch-0.051
ases-0.050
 myself-0.050
 you-0.049
Token around
Token ID1088
Feature activation-5.62e-02

Pos logit contributions
 surrounded+0.079
 shone+0.071
 rabbit+0.070
 woods+0.070
 clearing+0.069
Neg logit contributions
 you-0.064
 myself-0.064
 us-0.061
ases-0.059
ags-0.059
Token the
Token ID262
Feature activation-1.78e-02

Pos logit contributions
 surrounded+0.085
 drove+0.075
aver+0.075
 miner+0.075
 hollow+0.074
Neg logit contributions
 you-0.102
 myself-0.096
 us-0.092
ases-0.090
 Go-0.090
Token room
Token ID2119
Feature activation-7.72e-02

Pos logit contributions
 surrounded+0.030
 he+0.029
 curled+0.026
 echoed+0.026
 spotted+0.026
Neg logit contributions
 myself-0.032
 yourselves-0.029
 yourself-0.029
 you-0.029
 Go-0.028
Token,
Token ID11
Feature activation-6.92e-02

Pos logit contributions
 surrounded+0.119
 wolf+0.118
 roasted+0.117
 clearing+0.115
 deer+0.115
Neg logit contributions
 you-0.124
 myself-0.122
 Go-0.109
 ourselves-0.109
ases-0.109
Token she
Token ID673
Feature activation-3.85e-02

Pos logit contributions
 polar+0.084
 clearing+0.079
 stream+0.078
 barrel+0.078
aver+0.077
Neg logit contributions
 you-0.150
 myself-0.140
 yourself-0.136
 yourselves-0.135
 Go-0.131
Token saw
Token ID2497
Feature activation-4.20e-02

Pos logit contributions
 surrounded+0.029
 wooden+0.025
 he+0.023
 distance+0.023
 clearing+0.022
Neg logit contributions
 myself-0.096
 you-0.093
uch-0.093
 Go-0.092
 yourself-0.092
Token that
Token ID326
Feature activation-5.17e-02

Pos logit contributions
 surrounded+0.063
 distance+0.055
 full+0.055
 clearing+0.054
 spotted+0.053
Neg logit contributions
 myself-0.083
 yourself-0.078
 yourselves-0.078
ases-0.078
 ourselves-0.076
Token the
Token ID262
Feature activation-8.76e-03

Pos logit contributions
 clearing+0.072
 surrounded+0.071
 stream+0.069
aver+0.067
 distance+0.067
Neg logit contributions
 you-0.096
 myself-0.094
 yourself-0.090
 yourselves-0.088
 Go-0.087
Token cool
Token ID3608
Feature activation+5.87e-02

Pos logit contributions
 myself+0.074
ases+0.070
 Go+0.070
 yourselves+0.068
rr+0.068
Neg logit contributions
 surrounded-0.042
 he-0.038
 full-0.036
 distance-0.034
 spotted-0.034
Token!"
Token ID2474
Feature activation-5.85e-03

Pos logit contributions
 myself+0.093
ases+0.088
 Go+0.086
rr+0.084
 yourselves+0.083
Neg logit contributions
 surrounded-0.117
 he-0.109
 full-0.105
 distance-0.102
 spotted-0.101
Token
Token ID198
Feature activation+2.42e-02

Pos logit contributions
 surrounded+0.010
 spotted+0.009
 distance+0.009
 clearing+0.009
 curled+0.009
Neg logit contributions
 myself-0.011
 Go-0.010
 yourselves-0.010
ases-0.010
 yourself-0.010
Token
Token ID198
Feature activation+3.19e-02

Pos logit contributions
 myself+0.063
uch+0.062
ases+0.061
 Deal+0.059
 Go+0.059
Neg logit contributions
 he-0.024
 surrounded-0.020
 full-0.019
 clearing-0.016
 which-0.016
Token"
Token ID1
Feature activation+6.18e-02

Pos logit contributions
 myself+0.052
ases+0.049
 Go+0.048
rr+0.047
 yourselves+0.046
Neg logit contributions
 surrounded-0.062
 he-0.059
 full-0.056
 distance-0.054
 clearing-0.053
TokenMe
Token ID5308
Feature activation+8.15e-02

Pos logit contributions
 myself+0.118
ases+0.115
rr+0.112
 Go+0.111
uch+0.111
Neg logit contributions
 surrounded-0.103
 he-0.096
 full-0.091
 distance-0.087
 wooden-0.085
Tokenow
Token ID322
Feature activation+5.94e-02

Pos logit contributions
ases+0.169
 Go+0.166
 myself+0.166
vern+0.165
 Deal+0.164
Neg logit contributions
 surrounded-0.119
 full-0.116
 he-0.101
 filled-0.093
 which-0.093
Token!"
Token ID2474
Feature activation+4.11e-03

Pos logit contributions
 myself+0.082
ases+0.077
 Go+0.075
 Deal+0.075
 yourselves+0.073
Neg logit contributions
 surrounded-0.128
 he-0.122
 full-0.118
 spotted-0.112
 filled-0.112
Token the
Token ID262
Feature activation-5.95e-03

Pos logit contributions
 myself+0.008
 Go+0.007
 you+0.007
 yourselves+0.007
ases+0.007
Neg logit contributions
 surrounded-0.007
 woods-0.006
 spotted-0.006
 clearing-0.006
 shone-0.006
Token cat
Token ID3797
Feature activation+9.73e-04

Pos logit contributions
 surrounded+0.008
 he+0.007
 full+0.006
 curled+0.006
 spotted+0.006
Neg logit contributions
 myself-0.014
 Go-0.013
ases-0.013
rr-0.013
 yourselves-0.013
Token says
Token ID1139
Feature activation+1.40e-02

Pos logit contributions
 myself+0.002
ases+0.002
rr+0.001
 Go+0.001
 yourselves+0.001
Neg logit contributions
 surrounded-0.002
 he-0.002
 full-0.002
 spotted-0.002
 wooden-0.002
Token
Token ID198
Feature activation+2.25e-02

Pos logit contributions
 surrounded+0.003
 he+0.003
 full+0.002
 spotted+0.002
 clearing+0.002
Neg logit contributions
 myself-0.005
ases-0.004
 Go-0.004
rr-0.004
uch-0.004
Token
Token ID198
Feature activation+3.52e-02

Pos logit contributions
 myself+0.061
ases+0.061
uch+0.060
 Deal+0.058
 Go+0.058
Neg logit contributions
 he-0.020
 surrounded-0.016
 full-0.014
 which-0.013
 saw-0.013
TokenBen
Token ID11696
Feature activation-7.29e-03

Pos logit contributions
 myself+0.060
ases+0.058
 Go+0.056
 Deal+0.054
rr+0.054
Neg logit contributions
 surrounded-0.064
 he-0.062
 full-0.058
 distance-0.056
 spotted-0.056
Token said
Token ID531
Feature activation-2.11e-03

Pos logit contributions
 surrounded+0.008
 he+0.007
 distance+0.006
 full+0.006
 clearing+0.006
Neg logit contributions
 myself-0.018
 Go-0.017
ases-0.017
rr-0.017
 yourself-0.017
Token,
Token ID11
Feature activation-3.94e-04

Pos logit contributions
 surrounded+0.002
 he+0.002
 full+0.002
 distance+0.002
 clearing+0.002
Neg logit contributions
 myself-0.005
ases-0.005
 Go-0.005
rr-0.005
 yourselves-0.005
Token "
Token ID366
Feature activation+8.25e-02

Pos logit contributions
 surrounded+0.000
 distance+0.000
 woods+0.000
 clearing+0.000
 fields+0.000
Neg logit contributions
 myself-0.001
 Go-0.001
 you-0.001
 Deal-0.001
rr-0.001
TokenI
Token ID40
Feature activation+4.28e-02

Pos logit contributions
 myself+0.163
ases+0.162
 Deal+0.161
rr+0.156
 Go+0.155
Neg logit contributions
 surrounded-0.128
 he-0.124
 full-0.111
 distance-0.107
 which-0.105
Token don
Token ID836
Feature activation+8.13e-02

Pos logit contributions
 myself+0.075
ases+0.071
 Go+0.070
rr+0.068
 yourselves+0.068
Neg logit contributions
 surrounded-0.080
 he-0.074
 full-0.071
 spotted-0.068
 distance-0.068
Token't
Token ID470
Feature activation+5.76e-02

Pos logit contributions
ases+0.083
oco+0.079
 Deal+0.079
 myself+0.079
 yourselves+0.078
Neg logit contributions
 he-0.208
 surrounded-0.201
 full-0.190
 which-0.179
 filled-0.179
Token know
Token ID760
Feature activation+2.55e-02

Pos logit contributions
 myself+0.080
ases+0.074
 Go+0.072
rr+0.071
 yourself+0.070
Neg logit contributions
 surrounded-0.127
 he-0.119
 full-0.115
 distance-0.112
 clearing-0.112
Token.
Token ID13
Feature activation+7.42e-02

Pos logit contributions
 myself+0.059
 Go+0.056
 yourself+0.055
ases+0.055
 ourselves+0.055
Neg logit contributions
 surrounded-0.033
 distance-0.029
 full-0.028
 he-0.028
 clearing-0.027
Token her
Token ID607
Feature activation-2.24e-02

Pos logit contributions
 distance+0.071
 surrounded+0.067
 valleys+0.067
 clearing+0.066
 woods+0.066
Neg logit contributions
 you-0.074
 myself-0.067
 Go-0.066
 ourselves-0.064
 yourself-0.064
Token mom
Token ID1995
Feature activation-1.66e-02

Pos logit contributions
 surrounded+0.040
 distance+0.040
 clearing+0.038
 fields+0.036
 he+0.035
Neg logit contributions
 myself-0.034
rr-0.034
 Go-0.033
ases-0.033
uch-0.033
Tokenmy
Token ID1820
Feature activation-1.89e-02

Pos logit contributions
 surrounded+0.037
 distance+0.035
 woods+0.035
 clearing+0.034
 wooden+0.034
Neg logit contributions
 myself-0.020
 yourself-0.018
 you-0.018
rr-0.018
 Go-0.018
Token said
Token ID531
Feature activation+5.96e-03

Pos logit contributions
 distance+0.028
 surrounded+0.026
 woods+0.026
 clearing+0.026
 fields+0.025
Neg logit contributions
 you-0.035
 Go-0.034
uch-0.034
 myself-0.033
 us-0.033
Token it
Token ID340
Feature activation+3.33e-03

Pos logit contributions
 Go+0.012
 myself+0.012
 you+0.012
 ourselves+0.011
 yourself+0.011
Neg logit contributions
 distance-0.008
 surrounded-0.008
 clearing-0.008
 woods-0.008
 fields-0.008
Token wasn
Token ID2492
Feature activation+9.66e-02

Pos logit contributions
 myself+0.008
ases+0.008
 Go+0.008
rr+0.007
 yourselves+0.007
Neg logit contributions
 surrounded-0.004
 he-0.004
 full-0.003
 distance-0.003
 spotted-0.003
Token't
Token ID470
Feature activation+4.91e-02

Pos logit contributions
ases+0.137
rr+0.127
 yourselves+0.124
ymes+0.117
 ourselves+0.114
Neg logit contributions
 he-0.224
 surrounded-0.211
 full-0.208
 she-0.191
 filled-0.187
Token worth
Token ID2861
Feature activation+2.34e-02

Pos logit contributions
 myself+0.099
 yourself+0.092
 you+0.089
 yours+0.089
 ourselves+0.088
Neg logit contributions
 surrounded-0.076
 fields-0.070
 distance-0.069
 spotted-0.068
 woods-0.067
Token anything
Token ID1997
Feature activation+6.96e-03

Pos logit contributions
 myself+0.049
 yourself+0.046
 Go+0.045
 yourselves+0.045
ases+0.045
Neg logit contributions
 surrounded-0.034
 distance-0.031
 clearing-0.030
 he-0.030
 barrel-0.030
Token.
Token ID13
Feature activation-1.15e-02

Pos logit contributions
 myself+0.016
 Go+0.015
ases+0.015
 yourself+0.015
 yourselves+0.014
Neg logit contributions
 surrounded-0.009
 he-0.008
 clearing-0.008
 distance-0.008
 full-0.008
Token Lily
Token ID20037
Feature activation-1.85e-02

Pos logit contributions
 surrounded+0.015
 he+0.013
 distance+0.013
 full+0.012
 clearing+0.012
Neg logit contributions
 myself-0.026
 Go-0.025
ases-0.025
 yourselves-0.024
 Deal-0.024
Token stop
Token ID2245
Feature activation+4.47e-02

Pos logit contributions
 myself+0.038
ases+0.037
 Go+0.035
rr+0.035
 ourselves+0.034
Neg logit contributions
 surrounded-0.054
 he-0.052
 full-0.049
 spotted-0.047
 distance-0.046
Token!"
Token ID2474
Feature activation+4.44e-03

Pos logit contributions
 myself+0.090
ases+0.083
 Go+0.083
 yourselves+0.082
 yourself+0.082
Neg logit contributions
 surrounded-0.072
 he-0.065
 full-0.063
 distance-0.061
 clearing-0.060
Token Lily
Token ID20037
Feature activation-6.49e-03

Pos logit contributions
 myself+0.009
 you+0.008
 Go+0.008
 yourselves+0.008
 yourself+0.008
Neg logit contributions
 surrounded-0.007
 fields-0.007
 clearing-0.007
 woods-0.006
 shone-0.006
Token said
Token ID531
Feature activation+4.15e-03

Pos logit contributions
 surrounded+0.009
 clearing+0.008
 distance+0.008
 woods+0.008
 he+0.008
Neg logit contributions
 myself-0.014
 Go-0.013
ases-0.013
 you-0.013
rr-0.012
Token.
Token ID13
Feature activation+4.31e-03

Pos logit contributions
 myself+0.009
 Go+0.009
ases+0.009
rr+0.008
 ourselves+0.008
Neg logit contributions
 surrounded-0.006
 clearing-0.005
 woods-0.005
 distance-0.005
 fields-0.005
Token "
Token ID366
Feature activation+8.14e-02

Pos logit contributions
 myself+0.011
ases+0.011
 Go+0.011
rr+0.011
 yourselves+0.011
Neg logit contributions
 surrounded-0.004
 he-0.003
 full-0.003
 distance-0.003
 spotted-0.003
TokenThat
Token ID2504
Feature activation+3.90e-02

Pos logit contributions
ases+0.160
 myself+0.157
rr+0.153
 ourselves+0.153
 Deal+0.152
Neg logit contributions
 surrounded-0.122
 he-0.117
 distance-0.105
 full-0.103
 spotted-0.098
Token is
Token ID318
Feature activation+4.98e-02

Pos logit contributions
 myself+0.071
ases+0.067
 Go+0.066
rr+0.065
 yourselves+0.064
Neg logit contributions
 surrounded-0.069
 he-0.064
 full-0.061
 distance-0.059
 spotted-0.058
Token my
Token ID616
Feature activation+1.28e-02

Pos logit contributions
 myself+0.076
ases+0.072
 Go+0.070
rr+0.069
 yourselves+0.068
Neg logit contributions
 surrounded-0.102
 he-0.096
 full-0.093
 distance-0.089
 spotted-0.089
Token book
Token ID1492
Feature activation+3.11e-02

Pos logit contributions
 myself+0.026
ases+0.026
rr+0.025
 Go+0.025
 ourselves+0.025
Neg logit contributions
 surrounded-0.019
 he-0.018
 full-0.017
 wooden-0.016
 woods-0.016
Token.
Token ID13
Feature activation+8.34e-02

Pos logit contributions
ases+0.065
 myself+0.064
 Deal+0.063
rr+0.062
 yourselves+0.062
Neg logit contributions
 surrounded-0.046
 he-0.044
 full-0.042
 filled-0.038
 which-0.037
Token smiled
Token ID13541
Feature activation-8.91e-03

Pos logit contributions
 surrounded+0.041
 distance+0.039
 woods+0.037
 he+0.036
 wooden+0.035
Neg logit contributions
rr-0.054
 myself-0.053
ases-0.051
 Go-0.051
 us-0.050
Token and
Token ID290
Feature activation-3.18e-02

Pos logit contributions
 surrounded+0.013
 clearing+0.012
 distance+0.012
 woods+0.012
 barrel+0.012
Neg logit contributions
 myself-0.017
rr-0.016
 Deal-0.016
 Go-0.016
 yourself-0.016
Token said
Token ID531
Feature activation-6.95e-03

Pos logit contributions
 distance+0.056
 woods+0.055
 surrounded+0.051
 clearing+0.051
 barrel+0.050
Neg logit contributions
 you-0.048
 myself-0.047
 Go-0.045
rac-0.044
ipp-0.043
Token,
Token ID11
Feature activation-8.34e-03

Pos logit contributions
 surrounded+0.009
 distance+0.009
 clearing+0.008
 woods+0.008
 he+0.008
Neg logit contributions
 myself-0.015
 Go-0.015
ases-0.014
 you-0.014
 yourself-0.014
Token "
Token ID366
Feature activation+6.26e-02

Pos logit contributions
 surrounded+0.010
 distance+0.009
 woods+0.009
 bushes+0.008
 clearing+0.008
Neg logit contributions
 myself-0.019
 Go-0.019
 Deal-0.018
 you-0.018
rr-0.017
TokenDon
Token ID3987
Feature activation+9.25e-02

Pos logit contributions
 myself+0.108
ases+0.101
 Go+0.100
rr+0.097
 yourselves+0.097
Neg logit contributions
 surrounded-0.116
 he-0.108
 full-0.104
 distance-0.100
 spotted-0.099
Token't
Token ID470
Feature activation+5.86e-02

Pos logit contributions
ases+0.127
 yourselves+0.106
rr+0.106
 myself+0.106
oco+0.102
Neg logit contributions
 he-0.207
 surrounded-0.196
 full-0.189
 she-0.185
 had-0.181
Token worry
Token ID5490
Feature activation+4.10e-02

Pos logit contributions
ases+0.098
 myself+0.094
 Go+0.091
 yourselves+0.088
 Deal+0.088
Neg logit contributions
 surrounded-0.110
 he-0.106
 full-0.101
 distance-0.096
 spotted-0.095
Token,
Token ID11
Feature activation+2.77e-02

Pos logit contributions
 myself+0.097
 Go+0.092
ases+0.091
 yourself+0.090
 ourselves+0.089
Neg logit contributions
 surrounded-0.051
 he-0.044
 distance-0.042
 full-0.042
 spotted-0.040
Token Ros
Token ID10018
Feature activation+5.57e-02

Pos logit contributions
 myself+0.058
 yourself+0.054
 Go+0.054
 you+0.053
 ourselves+0.053
Neg logit contributions
 surrounded-0.044
 distance-0.038
 he-0.037
 spotted-0.037
 full-0.037
Tokenie
Token ID494
Feature activation+4.81e-02

Pos logit contributions
 myself+0.106
ases+0.102
 Go+0.102
 yourselves+0.098
 Deal+0.097
Neg logit contributions
 surrounded-0.091
 he-0.085
 full-0.081
 spotted-0.077
 wooden-0.076
Token just
Token ID655
Feature activation+2.65e-02

Pos logit contributions
 myself+0.087
 yourselves+0.080
ases+0.078
rr+0.076
 ourselves+0.075
Neg logit contributions
 surrounded-0.118
 he-0.112
 full-0.109
 spotted-0.106
 curled-0.103
Token want
Token ID765
Feature activation+3.94e-02

Pos logit contributions
 myself+0.046
ases+0.044
 Go+0.043
 yourselves+0.043
rr+0.042
Neg logit contributions
 surrounded-0.048
 he-0.045
 full-0.043
 spotted-0.042
 distance-0.041
Token to
Token ID284
Feature activation+6.20e-02

Pos logit contributions
 Deal+0.098
ases+0.097
rr+0.095
uch+0.093
 myself+0.093
Neg logit contributions
 he-0.043
 surrounded-0.041
 full-0.036
 which-0.035
 bushes-0.035
Token see
Token ID766
Feature activation+2.47e-02

Pos logit contributions
 Deal+0.131
ases+0.130
 myself+0.129
rr+0.128
 Go+0.127
Neg logit contributions
 surrounded-0.087
 he-0.082
 full-0.078
 distance-0.072
 rabbit-0.071
Token your
Token ID534
Feature activation+9.73e-03

Pos logit contributions
 myself+0.055
ases+0.054
rr+0.053
 Go+0.053
 Deal+0.053
Neg logit contributions
 surrounded-0.032
 he-0.030
 full-0.027
 which-0.026
 fields-0.025
Token pretty
Token ID2495
Feature activation+2.89e-02

Pos logit contributions
 myself+0.021
ases+0.021
 Go+0.021
rr+0.020
 Deal+0.020
Neg logit contributions
 surrounded-0.013
 full-0.012
 he-0.012
 distance-0.011
 wooden-0.011
Token wings
Token ID12098
Feature activation+3.56e-02

Pos logit contributions
 myself+0.066
ases+0.063
 Go+0.063
rr+0.061
 yourselves+0.061
Neg logit contributions
 surrounded-0.037
 he-0.033
 full-0.031
 distance-0.030
 spotted-0.030
Token."
Token ID526
Feature activation+4.87e-03

Pos logit contributions
 myself+0.057
ases+0.057
 Go+0.055
 Deal+0.054
rr+0.054
Neg logit contributions
 surrounded-0.069
 he-0.064
 full-0.062
 distance-0.059
 which-0.058
Token
Token ID220
Feature activation+3.74e-02

Pos logit contributions
 myself+0.011
ases+0.010
 Go+0.010
rr+0.010
 yourselves+0.010
Neg logit contributions
 surrounded-0.007
 he-0.006
 full-0.006
 distance-0.005
 spotted-0.005
Token
Token ID198
Feature activation+3.28e-02

Pos logit contributions
 myself+0.103
ases+0.101
uch+0.100
rr+0.099
 yourselves+0.098
Neg logit contributions
 he-0.026
 surrounded-0.023
 full-0.020
 Suddenly-0.016
 which-0.016
Token
Token ID198
Feature activation+3.60e-02

Pos logit contributions
ases+0.090
 myself+0.086
uch+0.086
 Deal+0.084
 Go+0.084
Neg logit contributions
 he-0.029
 surrounded-0.024
 distance-0.020
 clearing-0.020
 full-0.020
Token What
Token ID1867
Feature activation+2.47e-02

Pos logit contributions
 myself+0.099
 Go+0.093
ases+0.092
rr+0.090
 yourselves+0.089
Neg logit contributions
 surrounded-0.093
 he-0.086
 full-0.082
 distance-0.080
 spotted-0.079
Token is
Token ID318
Feature activation+4.82e-02

Pos logit contributions
 myself+0.042
ases+0.041
 Deal+0.040
 yourselves+0.038
 ourselves+0.037
Neg logit contributions
 surrounded-0.047
 he-0.045
 full-0.043
 filled-0.041
 spotted-0.040
Token this
Token ID428
Feature activation+5.61e-02

Pos logit contributions
 myself+0.104
 you+0.099
 Go+0.099
 ourselves+0.095
 yourselves+0.095
Neg logit contributions
 surrounded-0.068
 distance-0.061
 spotted-0.059
 clearing-0.057
 full-0.057
Token place
Token ID1295
Feature activation+5.97e-02

Pos logit contributions
 myself+0.054
ases+0.050
 Go+0.048
rr+0.045
 yourselves+0.045
Neg logit contributions
 surrounded-0.146
 he-0.139
 full-0.135
 distance-0.131
 spotted-0.131
Token mom
Token ID1995
Feature activation+5.52e-02

Pos logit contributions
 myself+0.083
ases+0.082
 Go+0.077
 Deal+0.076
rr+0.076
Neg logit contributions
 surrounded-0.131
 he-0.124
 full-0.121
 filled-0.114
 distance-0.113
Tokenmy
Token ID1820
Feature activation+6.87e-02

Pos logit contributions
 myself+0.101
ases+0.101
 Go+0.098
 Deal+0.096
 yourselves+0.093
Neg logit contributions
 surrounded-0.096
 he-0.089
 full-0.085
 spotted-0.080
 distance-0.079
Token?"
Token ID1701
Feature activation-2.49e-03

Pos logit contributions
 myself+0.072
ases+0.068
 Go+0.065
 Deal+0.062
rr+0.061
Neg logit contributions
 surrounded-0.174
 he-0.165
 full-0.160
 spotted-0.155
 distance-0.153
Token Tommy
Token ID19919
Feature activation-3.99e-03

Pos logit contributions
 surrounded+0.005
 clearing+0.005
 distance+0.005
 fields+0.004
 barrel+0.004
Neg logit contributions
 Go-0.004
 you-0.004
 myself-0.004
 yourselves-0.003
 yourself-0.003
Token said
Token ID531
Feature activation-1.22e-02

Pos logit contributions
 surrounded+0.004
 he+0.003
 full+0.003
 distance+0.003
 clearing+0.003
Neg logit contributions
 myself-0.010
ases-0.010
 Go-0.010
rr-0.010
 yourselves-0.010
Token.
Token ID13
Feature activation+4.21e-04

Pos logit contributions
 surrounded+0.018
 woods+0.018
 clearing+0.018
 fields+0.017
 bushes+0.017
Neg logit contributions
 myself-0.023
 Go-0.023
rr-0.022
ases-0.021
 you-0.021
Token
Token ID198
Feature activation+2.91e-02

Pos logit contributions
 myself+0.001
 Go+0.001
ases+0.001
rr+0.001
 yourselves+0.001
Neg logit contributions
 surrounded-0.001
 he-0.001
 full-0.001
 distance-0.001
 clearing-0.001
Token learned
Token ID4499
Feature activation-9.78e-03

Pos logit contributions
 surrounded+0.017
 he+0.015
 full+0.015
 wooden+0.015
 clearing+0.015
Neg logit contributions
 myself-0.026
ases-0.025
 Go-0.024
 yourself-0.024
uch-0.024
Token that
Token ID326
Feature activation-1.53e-02

Pos logit contributions
 surrounded+0.014
 spotted+0.012
 distance+0.012
 clearing+0.012
 full+0.012
Neg logit contributions
 myself-0.021
 yourself-0.020
 Go-0.020
 ourselves-0.020
ases-0.020
Token sometimes
Token ID3360
Feature activation+1.06e-02

Pos logit contributions
 surrounded+0.022
 he+0.019
 full+0.019
 distance+0.018
 spotted+0.018
Neg logit contributions
 myself-0.033
 Go-0.031
ases-0.031
 yourself-0.030
 ourselves-0.030
Token it
Token ID340
Feature activation+3.08e-02

Pos logit contributions
 myself+0.023
ases+0.022
 Go+0.021
 yourselves+0.021
 yourself+0.021
Neg logit contributions
 surrounded-0.015
 he-0.014
 full-0.013
 distance-0.013
 spotted-0.013
Token's
Token ID338
Feature activation+3.63e-02

Pos logit contributions
 myself+0.071
ases+0.070
 Deal+0.070
 yourselves+0.069
rr+0.068
Neg logit contributions
 surrounded-0.038
 he-0.037
 full-0.031
 woods-0.030
 rabbit-0.030
Token okay
Token ID8788
Feature activation+4.91e-02

Pos logit contributions
 myself+0.068
ases+0.065
 Go+0.064
rr+0.062
 yourselves+0.062
Neg logit contributions
 surrounded-0.062
 he-0.057
 full-0.054
 distance-0.052
 spotted-0.052
Token to
Token ID284
Feature activation+7.31e-02

Pos logit contributions
 myself+0.118
ases+0.115
 Deal+0.114
rr+0.111
 yourselves+0.110
Neg logit contributions
 surrounded-0.057
 he-0.056
 full-0.049
 which-0.046
 spotted-0.044
Token be
Token ID307
Feature activation+2.19e-02

Pos logit contributions
 myself+0.126
ases+0.124
 Go+0.121
 yourselves+0.116
 Deal+0.115
Neg logit contributions
 surrounded-0.132
 he-0.125
 full-0.120
 distance-0.114
 spotted-0.112
Token scared
Token ID12008
Feature activation+5.25e-02

Pos logit contributions
 myself+0.047
 yourself+0.045
ases+0.045
 yourselves+0.044
 Go+0.044
Neg logit contributions
 surrounded-0.030
 he-0.028
 clearing-0.026
 distance-0.026
 wooden-0.026
Token,
Token ID11
Feature activation+6.04e-03

Pos logit contributions
 myself+0.125
ases+0.120
 Go+0.119
rr+0.117
 yourselves+0.116
Neg logit contributions
 surrounded-0.062
 he-0.056
 full-0.052
 distance-0.048
 spotted-0.048
Token but
Token ID475
Feature activation+7.26e-03

Pos logit contributions
 myself+0.016
 Go+0.015
 yourself+0.015
 you+0.015
 ourselves+0.015
Neg logit contributions
 surrounded-0.006
 clearing-0.005
 distance-0.005
 full-0.004
 spotted-0.004
Token the
Token ID262
Feature activation-5.48e-03

Pos logit contributions
 Deal+0.040
rr+0.040
uch+0.040
ases+0.040
 Go+0.039
Neg logit contributions
 surrounded-0.014
 he-0.013
 full-0.012
 fields-0.011
 bushes-0.011
Token bus
Token ID1323
Feature activation+1.42e-02

Pos logit contributions
 surrounded+0.007
 he+0.006
 full+0.006
 distance+0.006
 wooden+0.005
Neg logit contributions
 myself-0.013
ases-0.013
 Go-0.012
rr-0.012
 ourselves-0.012
Token.
Token ID13
Feature activation+6.71e-02

Pos logit contributions
 myself+0.032
ases+0.031
 Deal+0.031
 Go+0.031
rr+0.030
Neg logit contributions
 surrounded-0.018
 he-0.017
 full-0.016
 distance-0.014
 which-0.014
Token Let
Token ID3914
Feature activation+3.44e-02

Pos logit contributions
 myself+0.124
ases+0.121
uch+0.117
rr+0.117
 yourselves+0.113
Neg logit contributions
 surrounded-0.111
 he-0.104
 full-0.099
 clearing-0.093
 woods-0.092
Token's
Token ID338
Feature activation+6.19e-02

Pos logit contributions
ases+0.083
rr+0.080
 myself+0.080
 Deal+0.079
vern+0.075
Neg logit contributions
 he-0.039
 surrounded-0.039
 saw-0.035
 started-0.034
 rabbit-0.034
Token go
Token ID467
Feature activation+3.12e-02

Pos logit contributions
 myself+0.119
ases+0.117
 Go+0.112
rr+0.112
 yourselves+0.110
Neg logit contributions
 surrounded-0.100
 he-0.096
 full-0.092
 filled-0.086
 clearing-0.084
Token and
Token ID290
Feature activation+3.89e-02

Pos logit contributions
 myself+0.070
ases+0.067
 Go+0.067
rr+0.065
 yourselves+0.065
Neg logit contributions
 surrounded-0.041
 he-0.039
 full-0.036
 distance-0.034
 spotted-0.033
Token see
Token ID766
Feature activation+1.98e-02

Pos logit contributions
 myself+0.070
 Go+0.065
ases+0.064
 yourself+0.063
 yourselves+0.063
Neg logit contributions
 surrounded-0.069
 he-0.062
 full-0.061
 wooden-0.059
 distance-0.059
Token."
Token ID526
Feature activation-1.18e-02

Pos logit contributions
 myself+0.041
ases+0.041
 Go+0.039
rr+0.039
uch+0.039
Neg logit contributions
 surrounded-0.028
 he-0.027
 full-0.025
 which-0.023
 spotted-0.023
Token
Token ID198
Feature activation+2.35e-02

Pos logit contributions
 surrounded+0.016
 he+0.014
 full+0.013
 distance+0.013
 spotted+0.013
Neg logit contributions
 myself-0.027
ases-0.025
 Go-0.025
 yourselves-0.025
rr-0.025
Token
Token ID198
Feature activation+2.96e-02

Pos logit contributions
 myself+0.061
uch+0.060
ases+0.059
 Go+0.058
 Deal+0.057
Neg logit contributions
 he-0.023
 surrounded-0.020
 full-0.019
 distance-0.017
 saw-0.017
Token,
Token ID11
Feature activation-4.74e-02

Pos logit contributions
 surrounded+0.042
 clearing+0.040
 woods+0.039
 fields+0.038
 woman+0.038
Neg logit contributions
 myself-0.059
 you-0.057
rr-0.055
 ourselves-0.052
 yourself-0.051
Token but
Token ID475
Feature activation-3.22e-02

Pos logit contributions
 clearing+0.051
 shone+0.051
 rabbit+0.049
 woman+0.049
 woods+0.049
Neg logit contributions
 myself-0.096
 you-0.091
 yourself-0.082
't-0.082
 yourselves-0.081
Token no
Token ID645
Feature activation+1.99e-03

Pos logit contributions
 surrounded+0.019
 clearing+0.015
 wooden+0.015
 woman+0.014
 distance+0.014
Neg logit contributions
 myself-0.090
 you-0.089
 Go-0.085
 ourselves-0.084
uch-0.084
Token one
Token ID530
Feature activation-3.33e-02

Pos logit contributions
 myself+0.005
 you+0.004
 ourselves+0.004
 yourself+0.004
 Go+0.004
Neg logit contributions
 surrounded-0.002
 full-0.002
 spotted-0.002
 clearing-0.002
 filled-0.002
Token seemed
Token ID3947
Feature activation-2.08e-03

Pos logit contributions
 surrounded+0.042
 wooden+0.037
 clearing+0.037
 fields+0.036
 full+0.035
Neg logit contributions
 Go-0.068
ases-0.067
 myself-0.067
uch-0.065
 you-0.065
Token to
Token ID284
Feature activation+4.13e-02

Pos logit contributions
 surrounded+0.003
 clearing+0.003
 fields+0.003
 woods+0.003
 he+0.003
Neg logit contributions
 myself-0.004
 yourself-0.003
 you-0.003
 Go-0.003
 yourselves-0.003
Token value
Token ID1988
Feature activation+1.07e-02

Pos logit contributions
 myself+0.075
ases+0.074
 Go+0.073
rr+0.070
 Deal+0.070
Neg logit contributions
 surrounded-0.072
 he-0.068
 full-0.065
 distance-0.062
 spotted-0.060
Token him
Token ID683
Feature activation-1.26e-03

Pos logit contributions
 myself+0.025
ases+0.024
 Go+0.023
 yourselves+0.023
 yourself+0.023
Neg logit contributions
 surrounded-0.013
 he-0.011
 full-0.011
 spotted-0.011
 clearing-0.011
Token.
Token ID13
Feature activation-7.46e-03

Pos logit contributions
 surrounded+0.002
 he+0.001
 full+0.001
 distance+0.001
 spotted+0.001
Neg logit contributions
 myself-0.003
ases-0.003
 Go-0.003
rr-0.003
 yourselves-0.003
Token One
Token ID1881
Feature activation-2.85e-02

Pos logit contributions
 surrounded+0.008
 he+0.007
 full+0.006
 distance+0.006
 spotted+0.006
Neg logit contributions
 myself-0.019
 Go-0.018
ases-0.018
rr-0.018
 yourselves-0.018
Token day
Token ID1110
Feature activation-3.24e-02

Pos logit contributions
 widened+0.058
 revealing+0.055
 valleys+0.055
 surrounded+0.052
 continuing+0.050
Neg logit contributions
 you-0.034
uch-0.034
rr-0.031
 myself-0.030
oof-0.030
Token time
Token ID640
Feature activation-1.69e-02

Pos logit contributions
 surrounded+0.027
 full+0.026
 he+0.024
 spotted+0.023
 distance+0.022
Neg logit contributions
 myself-0.053
ases-0.051
 yourselves-0.051
 ourselves-0.050
 yourself-0.049
Token,
Token ID11
Feature activation-1.51e-02

Pos logit contributions
 he+0.013
 surrounded+0.013
 full+0.010
 spotted+0.007
 which+0.007
Neg logit contributions
 myself-0.049
ases-0.048
 yourselves-0.047
 yourself-0.046
 Deal-0.046
Token there
Token ID612
Feature activation-1.37e-02

Pos logit contributions
 surrounded+0.014
 distance+0.013
 fields+0.012
 woman+0.011
 he+0.011
Neg logit contributions
 myself-0.038
 Go-0.036
 you-0.035
rr-0.035
 ourselves-0.035
Token lived
Token ID5615
Feature activation-1.23e-02

Pos logit contributions
 surrounded+0.015
 he+0.013
 full+0.012
 spotted+0.011
 distance+0.011
Neg logit contributions
 myself-0.035
ases-0.034
 Go-0.033
rr-0.033
 yourselves-0.032
Token a
Token ID257
Feature activation-3.50e-02

Pos logit contributions
 surrounded+0.011
 full+0.009
 he+0.009
 filled+0.008
 clearing+0.008
Neg logit contributions
 myself-0.033
ases-0.033
 Go-0.032
uch-0.032
 Deal-0.031
Token mighty
Token ID18680
Feature activation-2.24e-02

Pos logit contributions
 surrounded+0.047
 distance+0.039
 he+0.039
 spotted+0.038
 curled+0.038
Neg logit contributions
 you-0.070
 myself-0.068
rr-0.068
 Go-0.067
 yourselves-0.064
Token mouse
Token ID10211
Feature activation-4.57e-02

Pos logit contributions
 surrounded+0.028
 he+0.024
 distance+0.023
 spotted+0.023
 full+0.021
Neg logit contributions
 myself-0.048
 Go-0.047
 you-0.046
rr-0.046
ases-0.045
Token.
Token ID13
Feature activation-6.32e-03

Pos logit contributions
 surrounded+0.057
 distance+0.055
 wooden+0.054
 widened+0.053
 winding+0.053
Neg logit contributions
 Go-0.092
 myself-0.092
 you-0.091
ases-0.090
rr-0.086
Token He
Token ID679
Feature activation+1.43e-03

Pos logit contributions
 surrounded+0.008
 spotted+0.007
 he+0.007
 distance+0.006
 full+0.006
Neg logit contributions
 myself-0.015
 Go-0.014
ases-0.014
rr-0.013
 yourself-0.013
Token had
Token ID550
Feature activation-9.88e-03

Pos logit contributions
 myself+0.003
 Go+0.003
ases+0.003
rr+0.003
 yourself+0.003
Neg logit contributions
 surrounded-0.002
 he-0.001
 distance-0.001
 full-0.001
 clearing-0.001
Token a
Token ID257
Feature activation-2.56e-02

Pos logit contributions
 surrounded+0.014
 distance+0.012
 clearing+0.012
 he+0.012
 spotted+0.012
Neg logit contributions
 myself-0.021
 yourself-0.020
 Go-0.020
 you-0.019
 yourselves-0.019
Token They
Token ID1119
Feature activation-2.70e-02

Pos logit contributions
 surrounded+0.032
 miner+0.028
 he+0.028
 barrel+0.028
 clearing+0.027
Neg logit contributions
 Go-0.039
 Deal-0.038
 myself-0.037
 yourselves-0.035
ases-0.035
Token went
Token ID1816
Feature activation-2.30e-02

Pos logit contributions
 surrounded+0.027
 he+0.025
 clearing+0.025
 distance+0.024
 woods+0.023
Neg logit contributions
 myself-0.061
ases-0.060
 Go-0.060
 you-0.058
rr-0.058
Token when
Token ID618
Feature activation-4.29e-02

Pos logit contributions
 surrounded+0.036
 he+0.030
 spotted+0.030
 curled+0.029
 distance+0.029
Neg logit contributions
 myself-0.045
 Go-0.042
ases-0.042
 you-0.041
 ourselves-0.040
Token the
Token ID262
Feature activation-1.49e-02

Pos logit contributions
 surrounded+0.036
 clearing+0.026
 he+0.026
 full+0.025
 spotted+0.025
Neg logit contributions
 myself-0.115
 Go-0.112
 yourselves-0.111
 ourselves-0.110
 you-0.110
Token sun
Token ID4252
Feature activation-2.72e-02

Pos logit contributions
 surrounded+0.016
 he+0.013
 curled+0.012
 spotted+0.012
 clearing+0.011
Neg logit contributions
 myself-0.036
 Go-0.036
ases-0.034
rr-0.034
 yourselves-0.034
Token was
Token ID373
Feature activation-1.03e-02

Pos logit contributions
 surrounded+0.029
 clearing+0.028
 hollow+0.028
 winding+0.027
 he+0.026
Neg logit contributions
uch-0.057
 Go-0.055
ases-0.054
oh-0.053
 myself-0.053
Token shining
Token ID22751
Feature activation-2.44e-02

Pos logit contributions
 surrounded+0.011
 he+0.010
 spotted+0.009
 curled+0.009
 rabbit+0.009
Neg logit contributions
 myself-0.024
 yourself-0.023
 yourselves-0.023
 ourselves-0.023
uch-0.022
Token and
Token ID290
Feature activation-2.94e-02

Pos logit contributions
 surrounded+0.036
 clearing+0.034
 spotted+0.033
 miner+0.032
 curled+0.032
Neg logit contributions
 myself-0.043
 ourselves-0.042
 you-0.042
 us-0.040
 yourself-0.040
Token the
Token ID262
Feature activation-2.36e-02

Pos logit contributions
 miner+0.037
 surrounded+0.037
ger+0.035
 clearing+0.035
 hollow+0.034
Neg logit contributions
 myself-0.059
 you-0.059
 ourselves-0.057
 yourselves-0.056
 yourself-0.055
Token sky
Token ID6766
Feature activation-3.20e-02

Pos logit contributions
 surrounded+0.028
 he+0.026
 curled+0.023
 clearing+0.023
 full+0.022
Neg logit contributions
 myself-0.052
 yourselves-0.050
 you-0.050
 yourself-0.050
 Go-0.049
Token was
Token ID373
Feature activation-1.86e-02

Pos logit contributions
 barrel+0.047
 clearing+0.047
 surrounded+0.046
 miner+0.045
 he+0.045
Neg logit contributions
uch-0.054
 us-0.052
 you-0.051
 ourselves-0.051
 myself-0.050
Token and
Token ID290
Feature activation-1.90e-02

Pos logit contributions
 surrounded+0.009
 clearing+0.008
 distance+0.008
 wooden+0.008
 hollow+0.008
Neg logit contributions
 myself-0.014
 you-0.014
 ourselves-0.014
 Go-0.014
ases-0.014
Token always
Token ID1464
Feature activation-1.47e-02

Pos logit contributions
 surrounded+0.021
 clearing+0.019
 he+0.018
 wooden+0.018
 woods+0.018
Neg logit contributions
 myself-0.045
 Go-0.043
 ourselves-0.043
ases-0.042
 you-0.042
Token played
Token ID2826
Feature activation+7.87e-03

Pos logit contributions
 surrounded+0.016
 clearing+0.016
 he+0.015
 woods+0.015
 distance+0.015
Neg logit contributions
 myself-0.033
rr-0.031
 ourselves-0.031
 Go-0.031
ases-0.031
Token hockey
Token ID12217
Feature activation+1.00e-02

Pos logit contributions
 myself+0.016
ases+0.016
 Go+0.015
rr+0.015
 yourselves+0.015
Neg logit contributions
 surrounded-0.012
 he-0.011
 full-0.010
 clearing-0.009
 spotted-0.009
Token together
Token ID1978
Feature activation-1.74e-02

Pos logit contributions
 myself+0.022
 Deal+0.021
 Go+0.021
ases+0.021
 yourselves+0.021
Neg logit contributions
 surrounded-0.013
 he-0.013
 full-0.012
 wooden-0.011
 fields-0.011
Token no
Token ID645
Feature activation+1.32e-02

Pos logit contributions
 surrounded+0.020
 he+0.018
 full+0.017
 distance+0.016
 spotted+0.016
Neg logit contributions
 myself-0.042
ases-0.040
 Go-0.040
rr-0.039
 yourselves-0.039
Token matter
Token ID2300
Feature activation+2.14e-02

Pos logit contributions
 myself+0.029
 Go+0.027
ases+0.027
 yourselves+0.027
 ourselves+0.026
Neg logit contributions
 surrounded-0.018
 he-0.017
 distance-0.016
 full-0.016
 clearing-0.016
Token what
Token ID644
Feature activation-6.56e-03

Pos logit contributions
 myself+0.046
ases+0.044
 Go+0.044
rr+0.043
 yourselves+0.043
Neg logit contributions
 surrounded-0.030
 he-0.027
 full-0.026
 distance-0.024
 spotted-0.024
Token.
Token ID13
Feature activation-3.69e-03

Pos logit contributions
 surrounded+0.007
 he+0.007
 full+0.006
 spotted+0.005
 filled+0.005
Neg logit contributions
 myself-0.017
ases-0.016
 Deal-0.016
rr-0.016
 Go-0.016
Token They
Token ID1119
Feature activation-2.04e-02

Pos logit contributions
 surrounded+0.005
 he+0.004
 full+0.004
 distance+0.004
 clearing+0.004
Neg logit contributions
 myself-0.009
ases-0.008
 yourselves-0.008
rr-0.008
 yourself-0.008
Token were
Token ID547
Feature activation-4.92e-03

Pos logit contributions
 surrounded+0.024
 he+0.021
 full+0.019
 clearing+0.019
 wooden+0.019
Neg logit contributions
 myself-0.048
ases-0.046
 Go-0.046
rr-0.045
 ourselves-0.044
Token flew
Token ID13112
Feature activation-1.76e-02

Pos logit contributions
 surrounded+0.024
 wooden+0.022
 clearing+0.022
 distance+0.022
dream+0.021
Neg logit contributions
 myself-0.041
 you-0.039
 Go-0.038
 ourselves-0.038
ases-0.036
Token home
Token ID1363
Feature activation-3.04e-02

Pos logit contributions
 surrounded+0.033
 wooden+0.029
 spotted+0.029
 woods+0.028
 woman+0.028
Neg logit contributions
 myself-0.031
 ourselves-0.027
 yourself-0.027
 you-0.026
ases-0.026
Token,
Token ID11
Feature activation-3.33e-02

Pos logit contributions
 barrel+0.049
 woods+0.048
 surrounded+0.047
 clearing+0.047
 fields+0.047
Neg logit contributions
 myself-0.056
 ourselves-0.049
 you-0.047
 yourself-0.046
 Go-0.044
Token happy
Token ID3772
Feature activation-7.28e-03

Pos logit contributions
ger+0.044
 barrel+0.042
 woods+0.042
 shack+0.040
 bushes+0.039
Neg logit contributions
 myself-0.068
 you-0.061
 yourself-0.059
 ourselves-0.057
 yourselves-0.055
Token to
Token ID284
Feature activation+2.84e-02

Pos logit contributions
 surrounded+0.011
 clearing+0.010
 distance+0.010
 spotted+0.010
 wooden+0.010
Neg logit contributions
 myself-0.015
 ourselves-0.014
 you-0.014
 Go-0.013
 yourself-0.013
Token have
Token ID423
Feature activation-1.46e-03

Pos logit contributions
 myself+0.051
ases+0.048
 Go+0.048
rr+0.046
 yourselves+0.046
Neg logit contributions
 surrounded-0.051
 he-0.048
 full-0.046
 distance-0.044
 spotted-0.043
Token soared
Token ID35292
Feature activation+5.90e-03

Pos logit contributions
 surrounded+0.002
 wooden+0.002
 he+0.002
 distance+0.002
 clearing+0.002
Neg logit contributions
 myself-0.003
 yourself-0.003
ases-0.003
 Go-0.003
 ourselves-0.003
Token in
Token ID287
Feature activation-2.58e-02

Pos logit contributions
 myself+0.013
 yourself+0.012
ases+0.012
 ourselves+0.012
 yourselves+0.011
Neg logit contributions
 surrounded-0.008
 he-0.007
 wooden-0.007
 full-0.007
 clearing-0.007
Token the
Token ID262
Feature activation-1.96e-02

Pos logit contributions
 surrounded+0.029
 he+0.025
 spotted+0.024
 full+0.023
 curled+0.022
Neg logit contributions
 myself-0.063
ases-0.058
 yourself-0.058
 ourselves-0.057
 yourselves-0.057
Token dark
Token ID3223
Feature activation-8.19e-03

Pos logit contributions
 surrounded+0.023
 he+0.020
 curled+0.020
 which+0.018
 though+0.018
Neg logit contributions
 myself-0.043
 you-0.041
 yourself-0.041
uch-0.041
 yourselves-0.040
Token sky
Token ID6766
Feature activation-2.41e-02

Pos logit contributions
 surrounded+0.011
 he+0.010
 curled+0.009
 full+0.009
 spotted+0.009
Neg logit contributions
 myself-0.017
 yourself-0.016
rr-0.016
 yourselves-0.015
 you-0.015
Token�
Token ID241
Feature activation+6.08e-02

Pos logit contributions
ases+0.089
 myself+0.087
rr+0.085
 Deal+0.084
uch+0.084
Neg logit contributions
 surrounded-0.089
 he-0.080
 full-0.080
 clearing-0.078
 filled-0.076
TokenI
Token ID40
Feature activation+4.77e-02

Pos logit contributions
 myself+0.115
ases+0.112
rr+0.107
 Deal+0.107
 Go+0.106
Neg logit contributions
 surrounded-0.101
 he-0.098
 full-0.091
 spotted-0.086
 distance-0.084
Token am
Token ID716
Feature activation+2.40e-02

Pos logit contributions
 myself+0.074
ases+0.068
 yourselves+0.067
 ourselves+0.066
 Go+0.066
Neg logit contributions
 surrounded-0.101
 he-0.095
 full-0.092
 spotted-0.090
 filled-0.087
Token the
Token ID262
Feature activation+1.25e-02

Pos logit contributions
 myself+0.052
ases+0.050
 Go+0.049
rr+0.048
 yourselves+0.048
Neg logit contributions
 surrounded-0.033
 he-0.030
 full-0.029
 distance-0.027
 spotted-0.027
Token King
Token ID2677
Feature activation+1.98e-02

Pos logit contributions
 myself+0.027
ases+0.026
 Go+0.025
rr+0.025
 yourselves+0.025
Neg logit contributions
 surrounded-0.018
 he-0.017
 full-0.016
 distance-0.015
 clearing-0.015
Token of
Token ID286
Feature activation+2.15e-02

Pos logit contributions
 myself+0.041
ases+0.039
 Go+0.039
rr+0.038
 yourselves+0.038
Neg logit contributions
 surrounded-0.029
 he-0.026
 full-0.025
 distance-0.024
 spotted-0.024
Token the
Token ID262
Feature activation-2.30e-03

Pos logit contributions
 myself+0.051
uch+0.050
ases+0.050
rr+0.050
 Go+0.049
Neg logit contributions
 surrounded-0.024
 he-0.023
 full-0.021
 distance-0.018
 clearing-0.018
Token Jungle
Token ID26411
Feature activation+1.39e-02

Pos logit contributions
 surrounded+0.003
 he+0.003
 full+0.003
 spotted+0.003
 distance+0.003
Neg logit contributions
 myself-0.005
ases-0.005
 Go-0.005
rr-0.005
 yourselves-0.005
Token.
Token ID13
Feature activation+3.75e-02

Pos logit contributions
 myself+0.029
ases+0.028
 Go+0.027
rr+0.027
 yourselves+0.027
Neg logit contributions
 surrounded-0.021
 he-0.019
 full-0.018
 spotted-0.017
 distance-0.017
Token I
Token ID314
Feature activation+5.06e-02

Pos logit contributions
 myself+0.076
ases+0.075
uch+0.071
rr+0.071
 ourselves+0.071
Neg logit contributions
 surrounded-0.057
 he-0.053
 full-0.050
 clearing-0.047
 distance-0.046
Token order
Token ID1502
Feature activation+3.12e-02

Pos logit contributions
 myself+0.083
ases+0.076
 yourselves+0.075
 Go+0.075
 ourselves+0.075
Neg logit contributions
 surrounded-0.101
 he-0.095
 full-0.092
 spotted-0.090
 distance-0.087
Token
Token ID198
Feature activation+2.08e-02

Pos logit contributions
 surrounded+0.018
 he+0.016
 full+0.016
 distance+0.015
 clearing+0.015
Neg logit contributions
 myself-0.027
ases-0.025
 Go-0.025
rr-0.024
 yourselves-0.024
TokenJim
Token ID18050
Feature activation-1.57e-02

Pos logit contributions
 myself+0.054
ases+0.053
 yourselves+0.051
uch+0.051
 Go+0.051
Neg logit contributions
 he-0.021
 surrounded-0.019
 full-0.017
 spotted-0.015
 which-0.015
Token was
Token ID373
Feature activation-5.12e-03

Pos logit contributions
 surrounded+0.019
 distance+0.017
 he+0.017
 woods+0.016
 clearing+0.016
Neg logit contributions
 myself-0.035
 Go-0.033
rr-0.033
ases-0.033
uch-0.032
Token so
Token ID523
Feature activation+9.63e-04

Pos logit contributions
 surrounded+0.007
 he+0.006
 distance+0.006
 woods+0.006
 bushes+0.006
Neg logit contributions
 myself-0.011
 Go-0.010
ases-0.010
 yourself-0.010
 yourselves-0.010
Token happy
Token ID3772
Feature activation-1.93e-02

Pos logit contributions
 myself+0.002
ases+0.002
 Go+0.002
rr+0.002
 yourselves+0.002
Neg logit contributions
 surrounded-0.001
 he-0.001
 full-0.001
 distance-0.001
 clearing-0.001
Token that
Token ID326
Feature activation-5.06e-02

Pos logit contributions
 surrounded+0.030
 distance+0.029
 clearing+0.027
 hollow+0.027
 wooden+0.027
Neg logit contributions
 myself-0.038
 you-0.037
 Go-0.034
 yourself-0.034
 ourselves-0.033
Token his
Token ID465
Feature activation-5.72e-03

Pos logit contributions
 clearing+0.045
 stream+0.040
 surrounded+0.039
 widened+0.038
 bushes+0.038
Neg logit contributions
 myself-0.131
 you-0.131
 yourself-0.118
 Go-0.117
 ourselves-0.112
Token wife
Token ID3656
Feature activation-3.89e-02

Pos logit contributions
 surrounded+0.008
 he+0.007
 clearing+0.007
 distance+0.007
 full+0.007
Neg logit contributions
 myself-0.012
ases-0.011
 Go-0.011
 yourself-0.011
rr-0.011
Token had
Token ID550
Feature activation+9.30e-03

Pos logit contributions
 surrounded+0.043
 distance+0.040
 clearing+0.039
 he+0.037
 bushes+0.037
Neg logit contributions
 myself-0.088
 Go-0.085
ases-0.083
 you-0.081
uch-0.081
Token made
Token ID925
Feature activation-4.70e-03

Pos logit contributions
 myself+0.021
 yourself+0.020
ases+0.020
 Go+0.020
 yourselves+0.019
Neg logit contributions
 surrounded-0.011
 he-0.010
 clearing-0.010
 distance-0.010
 woods-0.009
Token him
Token ID683
Feature activation-1.32e-03

Pos logit contributions
 surrounded+0.006
 distance+0.005
 he+0.005
 full+0.005
 clearing+0.005
Neg logit contributions
 myself-0.011
ases-0.010
 Go-0.010
 yourself-0.010
 ourselves-0.010
TokenOne
Token ID3198
Feature activation-2.18e-02

Pos logit contributions
 myself+0.046
ases+0.044
 Go+0.042
 yourselves+0.042
 ourselves+0.041
Neg logit contributions
 surrounded-0.051
 he-0.049
 full-0.047
 spotted-0.045
 distance-0.044
Token day
Token ID1110
Feature activation-3.77e-02

Pos logit contributions
 surrounded+0.036
 widened+0.034
 hollow+0.034
 bushes+0.033
 clearing+0.032
Neg logit contributions
rr-0.036
 you-0.035
 myself-0.034
uch-0.033
ases-0.032
Token,
Token ID11
Feature activation-4.91e-02

Pos logit contributions
 surrounded+0.052
 distance+0.047
 clearing+0.045
 hollow+0.043
 woods+0.043
Neg logit contributions
 Go-0.078
 myself-0.078
ases-0.075
 you-0.075
rr-0.075
Token they
Token ID484
Feature activation-1.48e-02

Pos logit contributions
 surrounded+0.063
 hollow+0.060
 clearing+0.059
 distance+0.058
 woods+0.056
Neg logit contributions
 you-0.103
 myself-0.101
 Go-0.101
 yourselves-0.100
 ourselves-0.098
Token saw
Token ID2497
Feature activation-4.74e-02

Pos logit contributions
 surrounded+0.012
 he+0.011
 clearing+0.009
 full+0.009
 wooden+0.009
Neg logit contributions
 myself-0.038
ases-0.038
rr-0.038
 Go-0.037
 yourselves-0.036
Token some
Token ID617
Feature activation-3.83e-02

Pos logit contributions
 surrounded+0.082
 distance+0.079
 clearing+0.076
 hollow+0.073
 winding+0.072
Neg logit contributions
 ourselves-0.077
 yourselves-0.076
ases-0.074
 myself-0.073
 you-0.073
Token bright
Token ID6016
Feature activation-2.05e-02

Pos logit contributions
 surrounded+0.058
 he+0.050
 clearing+0.049
 distance+0.049
 curled+0.048
Neg logit contributions
ases-0.071
 Go-0.071
rr-0.070
 you-0.070
 myself-0.070
Token green
Token ID4077
Feature activation-1.67e-02

Pos logit contributions
 surrounded+0.033
 he+0.030
 curled+0.029
 full+0.029
 distance+0.029
Neg logit contributions
ases-0.037
 myself-0.036
rr-0.035
 yourselves-0.034
 Go-0.034
Token things
Token ID1243
Feature activation-3.59e-02

Pos logit contributions
 surrounded+0.022
 full+0.019
 distance+0.019
 he+0.018
 curled+0.018
Neg logit contributions
ases-0.035
 myself-0.035
rr-0.034
 Go-0.034
 yourselves-0.034
Token poking
Token ID39048
Feature activation-6.54e-03

Pos logit contributions
 surrounded+0.046
 clearing+0.043
 distance+0.043
 spotted+0.041
 woods+0.040
Neg logit contributions
 myself-0.074
ases-0.072
 Go-0.072
 ourselves-0.071
 you-0.070
Token out
Token ID503
Feature activation-2.36e-02

Pos logit contributions
 surrounded+0.009
 he+0.007
 full+0.007
 clearing+0.007
 distance+0.007
Neg logit contributions
 myself-0.015
ases-0.014
 Go-0.014
rr-0.014
 ourselves-0.014
Token animals
Token ID4695
Feature activation-3.58e-02

Pos logit contributions
 surrounded+0.057
 clearing+0.054
 doorway+0.054
 miner+0.052
 distance+0.051
Neg logit contributions
 you-0.074
 myself-0.073
 yourself-0.067
 Go-0.066
 ourselves-0.065
Token.
Token ID13
Feature activation-1.64e-02

Pos logit contributions
 surrounded+0.054
 distance+0.052
 wooden+0.051
 clearing+0.050
 eerie+0.049
Neg logit contributions
ases-0.062
 Go-0.061
 myself-0.061
 you-0.059
 ourselves-0.059
Token Suddenly
Token ID24975
Feature activation-4.29e-02

Pos logit contributions
 surrounded+0.025
 he+0.022
 distance+0.022
 clearing+0.021
 curled+0.021
Neg logit contributions
 Go-0.031
 myself-0.030
 Deal-0.029
 you-0.029
 yourselves-0.029
Token,
Token ID11
Feature activation-4.87e-02

Pos logit contributions
 surrounded+0.066
 distance+0.061
 clearing+0.058
 full+0.056
 he+0.054
Neg logit contributions
 you-0.082
 Go-0.082
 myself-0.082
 ourselves-0.078
 yourself-0.077
Token Lily
Token ID20037
Feature activation-3.70e-02

Pos logit contributions
 clearing+0.071
 surrounded+0.071
 distance+0.070
 hollow+0.070
 winding+0.066
Neg logit contributions
 you-0.098
 ourselves-0.087
 yourselves-0.086
 myself-0.085
 Go-0.082
Token saw
Token ID2497
Feature activation-3.98e-02

Pos logit contributions
 surrounded+0.034
 he+0.028
 clearing+0.028
 distance+0.027
 full+0.027
Neg logit contributions
uch-0.091
 myself-0.090
rr-0.088
 Go-0.087
 yourself-0.086
Token a
Token ID257
Feature activation-3.20e-02

Pos logit contributions
 surrounded+0.064
 distance+0.059
 full+0.056
 clearing+0.056
 beneath+0.054
Neg logit contributions
 myself-0.075
 yourselves-0.070
 yourself-0.070
 ourselves-0.070
 you-0.069
Token candy
Token ID18550
Feature activation-2.11e-02

Pos logit contributions
 surrounded+0.044
 he+0.038
 spotted+0.034
 beneath+0.034
 curled+0.034
Neg logit contributions
 myself-0.063
 you-0.061
rr-0.060
 yourself-0.060
uch-0.060
Token wrapper
Token ID29908
Feature activation-1.35e-02

Pos logit contributions
 surrounded+0.028
 spotted+0.025
 distance+0.025
 valleys+0.023
 clearing+0.023
Neg logit contributions
 myself-0.044
 you-0.042
uch-0.042
 yourself-0.041
 Go-0.041
Token on
Token ID319
Feature activation-1.08e-02

Pos logit contributions
 surrounded+0.017
 distance+0.017
 clearing+0.016
 fields+0.015
 spotted+0.015
Neg logit contributions
 myself-0.029
 Go-0.027
 yourself-0.027
 yourselves-0.027
uch-0.027
Token the
Token ID262
Feature activation-9.83e-03

Pos logit contributions
 surrounded+0.014
 he+0.012
 spotted+0.012
 full+0.012
 distance+0.011
Neg logit contributions
 myself-0.025
 yourself-0.023
 yourselves-0.023
 ourselves-0.023
 Go-0.023
Token the
Token ID262
Feature activation-8.84e-03

Pos logit contributions
 myself+0.004
 Go+0.004
ases+0.004
rr+0.004
 yourselves+0.004
Neg logit contributions
 surrounded-0.001
 he-0.001
 full-0.001
 distance-0.001
 clearing-0.001
Token end
Token ID886
Feature activation-2.59e-02

Pos logit contributions
 surrounded+0.016
 he+0.015
 full+0.014
 distance+0.013
 spotted+0.013
Neg logit contributions
 myself-0.016
ases-0.015
 Go-0.015
rr-0.014
 yourselves-0.014
Token of
Token ID286
Feature activation-1.63e-02

Pos logit contributions
 surrounded+0.033
 he+0.029
 full+0.027
 distance+0.026
 spotted+0.026
Neg logit contributions
 myself-0.060
ases-0.057
 Go-0.057
rr-0.056
 yourselves-0.055
Token the
Token ID262
Feature activation-7.83e-03

Pos logit contributions
 surrounded+0.016
 he+0.013
 full+0.012
 distance+0.011
 spotted+0.011
Neg logit contributions
 myself-0.043
ases-0.041
 Go-0.041
rr-0.040
 yourselves-0.040
Token day
Token ID1110
Feature activation-4.30e-02

Pos logit contributions
 surrounded+0.010
 he+0.009
 full+0.008
 distance+0.008
 wooden+0.008
Neg logit contributions
 myself-0.019
ases-0.018
 Go-0.018
 yourselves-0.017
rr-0.017
Token,
Token ID11
Feature activation-4.20e-02

Pos logit contributions
 surrounded+0.052
 he+0.044
 full+0.043
 clearing+0.042
 distance+0.042
Neg logit contributions
 myself-0.099
 Go-0.096
ases-0.095
rr-0.093
 yourselves-0.093
Token their
Token ID511
Feature activation-1.37e-02

Pos logit contributions
 surrounded+0.054
 clearing+0.048
 wooden+0.048
 distance+0.047
 hollow+0.047
Neg logit contributions
 myself-0.090
 you-0.088
 ourselves-0.087
 Go-0.086
 yourselves-0.085
Token teacher
Token ID4701
Feature activation-3.04e-02

Pos logit contributions
 surrounded+0.020
 he+0.019
 full+0.018
 wooden+0.017
 woods+0.017
Neg logit contributions
 myself-0.029
ases-0.028
 Go-0.027
 yourselves-0.027
rr-0.027
Token was
Token ID373
Feature activation+8.39e-03

Pos logit contributions
 surrounded+0.033
 he+0.029
 full+0.027
 distance+0.026
 clearing+0.025
Neg logit contributions
 myself-0.074
ases-0.072
 Go-0.071
rr-0.069
 yourselves-0.069
Token very
Token ID845
Feature activation+2.57e-03

Pos logit contributions
 myself+0.018
ases+0.017
 Go+0.017
 yourselves+0.017
 yourself+0.017
Neg logit contributions
 surrounded-0.012
 he-0.011
 full-0.010
 distance-0.010
 wooden-0.010
Token proud
Token ID6613
Feature activation-8.96e-04

Pos logit contributions
 myself+0.006
ases+0.006
 Go+0.006
rr+0.006
 yourselves+0.006
Neg logit contributions
 surrounded-0.003
 he-0.003
 full-0.002
 distance-0.002
 spotted-0.002
Token another
Token ID1194
Feature activation-9.18e-03

Pos logit contributions
 surrounded+0.031
 he+0.027
 woods+0.026
 distance+0.026
 full+0.026
Neg logit contributions
 myself-0.054
ases-0.052
 yourself-0.050
 Go-0.049
 you-0.049
Token page
Token ID2443
Feature activation-2.05e-02

Pos logit contributions
 surrounded+0.016
 he+0.014
 fields+0.014
 woods+0.013
 full+0.013
Neg logit contributions
 myself-0.017
ases-0.016
 Go-0.015
rr-0.015
 Deal-0.015
Token in
Token ID287
Feature activation-2.22e-02

Pos logit contributions
 woods+0.028
 surrounded+0.027
 clearing+0.027
 wolf+0.026
 fields+0.026
Neg logit contributions
 myself-0.041
ases-0.037
 Go-0.036
 you-0.036
 ourselves-0.036
Token the
Token ID262
Feature activation-2.08e-02

Pos logit contributions
 surrounded+0.029
 he+0.027
 full+0.024
 spotted+0.024
 woods+0.024
Neg logit contributions
 myself-0.049
ases-0.047
 Go-0.046
 yourself-0.045
 ourselves-0.045
Token book
Token ID1492
Feature activation-3.57e-02

Pos logit contributions
 surrounded+0.032
 he+0.029
 full+0.025
 spotted+0.025
 curled+0.025
Neg logit contributions
 myself-0.043
 Go-0.040
 yourself-0.040
 Deal-0.039
 yourselves-0.039
Token and
Token ID290
Feature activation-3.30e-02

Pos logit contributions
 surrounded+0.050
 he+0.043
 clearing+0.042
 woods+0.042
 spotted+0.041
Neg logit contributions
 myself-0.074
 Go-0.069
ases-0.068
rr-0.067
 ourselves-0.066
Token the
Token ID262
Feature activation-1.78e-02

Pos logit contributions
 surrounded+0.037
 clearing+0.035
 distance+0.035
 woods+0.035
 wooden+0.033
Neg logit contributions
 myself-0.072
 you-0.070
 Go-0.067
 yourself-0.066
 ourselves-0.066
Token girl
Token ID2576
Feature activation-3.75e-02

Pos logit contributions
 surrounded+0.023
 he+0.021
 full+0.019
 curled+0.018
 spotted+0.018
Neg logit contributions
 myself-0.040
 Go-0.037
rr-0.037
 yourself-0.037
ases-0.037
Token looked
Token ID3114
Feature activation-1.38e-02

Pos logit contributions
 surrounded+0.043
 woods+0.040
 clearing+0.039
 distance+0.039
 he+0.038
Neg logit contributions
 myself-0.081
 Go-0.078
ases-0.076
 you-0.075
uch-0.074
Token at
Token ID379
Feature activation-1.65e-02

Pos logit contributions
 surrounded+0.020
 woods+0.020
 he+0.019
 distance+0.019
 clearing+0.018
Neg logit contributions
 myself-0.025
 you-0.024
 Go-0.024
 yourself-0.023
ases-0.023
Token it
Token ID340
Feature activation-1.72e-02

Pos logit contributions
 surrounded+0.018
 he+0.016
 full+0.015
 distance+0.014
 spotted+0.014
Neg logit contributions
 myself-0.041
ases-0.039
 Go-0.039
 yourselves-0.038
 ourselves-0.038