TOP ACTIVATIONS
MAX = 0.190

there were two friends , Bre anna and Mason . They
edge of a lake . Bre anna said , " We
Yes !" So Bre anna and Mason packed some
. Both Mir a and Ha iley learned
packed some snacks , grabbed Bre anna 's cabin , and
One day , Mir a and Ben see a
playing . Tim cl apped his hands and the big boxes
appeared . He cl apped his hands and shouted " Good
went home . <|endoftext|> Me gan was playing in the park
very happy . He waved his tail even faster . Lucy
move . He cl apped his hands and his mom came
were two best friends , Mir a and Max . They
He cl apped his hands and started running around
One day , Suzanne decided to go to the
The little boy cl apped his hands and said , "
. Tim my cl apped his hands and shouted with joy
The frog waved his foot around . " No
was jumping around , waving his hand in the air .
themselves . Henry cl apped his hands as the tower grew
. Tim my cl apped his hands too and smiled .

MIN ACTIVATIONS
MIN = -0.226

She tasted the soup and spat it out .
He cou g hed and spat out some water . He
It makes me want to spit ." " Me
She made a face and spat it out .
like the noise . It spat out the float y and
the coffee down . He spat it out , then ran
big surprises if it was wished on with a pure heart
a bad cook ." He spat out the cake .
He made a face and spat it out .
some things that cannot be wished for and he had learned
that sharing is better than competing . They learn that purple
. But she did not spit it out . She said
hard and plastic . It spat it out and made a
. They make bubbles and spit . They smile at each
was so excited and he gob bled it up quickly !
. He tasted it and spat it out . He was
wait too long so it wished to grow quickly .
found some delicious berries and gob bled them up .
big field of strawberries and gob bled them up ! They
tasted bad , and he spit it out . He was

INTERVAL 0.086 - 0.190
CONTAINS 14.473%

and frightened . They saw the frog and ran away .
, there was a little girl named Lily . She loved
little girl named Sarah . She loved to explore outside with
. M ia 's mom saw that Mia
" Yes , please !" Jane was good at painting and

INTERVAL -0.018 - 0.086
CONTAINS 38.250%

it was wet too - it had been raining all morning
should enjoy them . They are special for your birthday ."
. The mom my and son smiled when they saw their
. In the distance the deer could see something that made
Lily 's forehead and feels that it is not hot .

INTERVAL -0.122 - -0.018
CONTAINS 44.967%

her . One day , she saw a man
yogurt and made a big picnic lunch . They ate and
and Tom nodded . They felt sorry and ashamed . They
water and said , " hello ." The daughter was so
above . Then you 'll feel strong and stay hanging until

INTERVAL -0.226 - -0.122
CONTAINS 2.310%

Who do you want to invite ?" Anna says
down the street . He wanted to play with it ,
with many feathers . Tom wants to catch the bird and
a funny sound . He tried again and again , but
scarf . He tries to pull them off with his mouth
Token there
Token ID612
Feature activation-2.39e-02

Pos logit contributions
 going+0.052
 returning+0.051
all+0.050
 pointing+0.050
 buying+0.050
Neg logit contributions
ig-0.054
uffy-0.048
aj-0.048
 fort-0.047
iper-0.046
Token were
Token ID547
Feature activation+4.46e-03

Pos logit contributions
 going+0.043
 pointing+0.042
all+0.042
 returning+0.041
't+0.041
Neg logit contributions
ig-0.035
ches-0.032
iper-0.032
aj-0.032
 fort-0.032
Token two
Token ID734
Feature activation+7.37e-02

Pos logit contributions
ig+0.009
iper+0.008
aj+0.008
 fort+0.008
eh+0.008
Neg logit contributions
 pointing-0.005
 returning-0.005
 going-0.005
 buying-0.005
't-0.005
Token friends
Token ID2460
Feature activation-8.47e-03

Pos logit contributions
ches+0.163
ig+0.160
ital+0.160
 web+0.158
ox+0.157
Neg logit contributions
 going-0.080
 him-0.072
 pointing-0.066
 Joey-0.061
 Mike-0.061
Token,
Token ID11
Feature activation-9.49e-03

Pos logit contributions
 going+0.012
 him+0.012
 pointing+0.011
 returning+0.011
all+0.011
Neg logit contributions
ig-0.016
aj-0.016
ches-0.016
iper-0.016
 fort-0.015
Token Bre
Token ID3719
Feature activation+1.90e-01

Pos logit contributions
 going+0.011
 pointing+0.011
all+0.010
 returning+0.010
't+0.010
Neg logit contributions
ig-0.021
aj-0.020
ches-0.020
iper-0.020
 fort-0.019
Tokenanna
Token ID7697
Feature activation+7.40e-02

Pos logit contributions
ig+0.347
 fort+0.328
 web+0.318
aj+0.317
ches+0.312
Neg logit contributions
all-0.279
 going-0.278
 pointing-0.262
 him-0.259
 returning-0.250
Token and
Token ID290
Feature activation+2.89e-02

Pos logit contributions
ig+0.137
iper+0.129
ches+0.129
aj+0.128
 fort+0.126
Neg logit contributions
 going-0.104
 pointing-0.102
all-0.101
 returning-0.100
't-0.099
Token Mason
Token ID14737
Feature activation+1.17e-01

Pos logit contributions
iper+0.059
ig+0.058
ches+0.055
ox+0.055
uffy+0.054
Neg logit contributions
all-0.032
 returning-0.032
 going-0.031
 pointing-0.030
't-0.030
Token.
Token ID13
Feature activation-4.43e-02

Pos logit contributions
ig+0.292
aj+0.280
ches+0.279
iper+0.279
 fort+0.277
Neg logit contributions
 going-0.096
 pointing-0.091
all-0.085
 him-0.085
 returning-0.085
Token They
Token ID1119
Feature activation+5.57e-02

Pos logit contributions
 returning+0.057
 going+0.056
all+0.055
 pointing+0.054
't+0.054
Neg logit contributions
ig-0.086
iper-0.083
ches-0.082
 fort-0.082
aj-0.081
Token edge
Token ID5743
Feature activation-2.02e-02

Pos logit contributions
ig+0.245
iper+0.226
ches+0.225
aj+0.223
ox+0.218
Neg logit contributions
 going-0.271
 pointing-0.256
 returning-0.255
 him-0.254
't-0.252
Token of
Token ID286
Feature activation-9.80e-03

Pos logit contributions
 going+0.015
 pointing+0.014
all+0.014
 returning+0.014
't+0.013
Neg logit contributions
ig-0.051
iper-0.049
ches-0.049
aj-0.049
 fort-0.048
Token a
Token ID257
Feature activation+9.70e-02

Pos logit contributions
all+0.012
't+0.012
 going+0.012
 returning+0.012
 pointing+0.012
Neg logit contributions
ig-0.019
iper-0.018
 fort-0.018
 web-0.017
aj-0.017
Token lake
Token ID13546
Feature activation-2.23e-02

Pos logit contributions
ig+0.173
iper+0.169
aj+0.164
ches+0.163
ox+0.162
Neg logit contributions
 going-0.148
 pointing-0.143
 returning-0.141
't-0.139
 rep-0.135
Token.
Token ID13
Feature activation-4.08e-02

Pos logit contributions
 going+0.024
all+0.023
't+0.023
 returning+0.023
 pointing+0.023
Neg logit contributions
ig-0.048
 fort-0.046
ches-0.045
aj-0.045
iper-0.044
Token Bre
Token ID3719
Feature activation+1.87e-01

Pos logit contributions
 going+0.052
 returning+0.050
all+0.050
 pointing+0.049
't+0.048
Neg logit contributions
ig-0.081
iper-0.078
aj-0.077
ches-0.077
 fort-0.075
Tokenanna
Token ID7697
Feature activation+6.79e-02

Pos logit contributions
 fort+0.277
ig+0.267
ringe+0.254
 web+0.249
aj+0.240
Neg logit contributions
all-0.354
 going-0.336
 pointing-0.325
 Mike-0.314
 him-0.314
Token said
Token ID531
Feature activation-4.53e-02

Pos logit contributions
ig+0.113
iper+0.107
ches+0.106
 fort+0.106
aj+0.105
Neg logit contributions
 going-0.111
 pointing-0.110
all-0.106
 returning-0.105
't-0.105
Token,
Token ID11
Feature activation-1.71e-02

Pos logit contributions
all+0.076
 going+0.075
 returning+0.075
 buying+0.074
Welcome+0.072
Neg logit contributions
ig-0.070
iper-0.061
ches-0.060
 fort-0.060
ox-0.059
Token "
Token ID366
Feature activation-2.67e-02

Pos logit contributions
no+0.031
reating+0.030
 possible+0.029
Welcome+0.029
 happen+0.029
Neg logit contributions
ches-0.018
aj-0.017
ig-0.017
ows-0.017
ox-0.016
TokenWe
Token ID1135
Feature activation+2.98e-03

Pos logit contributions
 going+0.032
 returning+0.031
 buying+0.030
 pointing+0.030
't+0.030
Neg logit contributions
ig-0.052
ox-0.050
iper-0.050
ches-0.050
c-0.048
TokenYes
Token ID5297
Feature activation-6.16e-02

Pos logit contributions
 going+0.032
 returning+0.031
 buying+0.030
 pointing+0.029
 rep+0.028
Neg logit contributions
ig-0.058
ches-0.056
ox-0.056
iper-0.056
ets-0.054
Token!"
Token ID2474
Feature activation-7.69e-02

Pos logit contributions
 going+0.082
 pointing+0.080
 returning+0.080
 buying+0.078
all+0.078
Neg logit contributions
ig-0.122
iper-0.113
ches-0.113
aj-0.112
 fort-0.111
Token
Token ID198
Feature activation-6.68e-03

Pos logit contributions
 going+0.099
 pointing+0.096
all+0.095
 returning+0.095
't+0.093
Neg logit contributions
ig-0.154
iper-0.145
ches-0.145
aj-0.144
 fort-0.142
Token
Token ID198
Feature activation-4.55e-03

Pos logit contributions
 going+0.007
 pointing+0.006
 returning+0.006
 buying+0.006
all+0.006
Neg logit contributions
ig-0.015
ches-0.015
aj-0.014
iper-0.014
 fort-0.014
TokenSo
Token ID2396
Feature activation-3.70e-02

Pos logit contributions
 going+0.007
 pointing+0.007
 returning+0.006
all+0.006
't+0.006
Neg logit contributions
ig-0.008
iper-0.008
ches-0.008
aj-0.008
 fort-0.008
Token Bre
Token ID3719
Feature activation+1.85e-01

Pos logit contributions
 going+0.047
 returning+0.047
 pointing+0.046
all+0.045
't+0.044
Neg logit contributions
ig-0.075
iper-0.071
ches-0.068
aj-0.067
 fort-0.067
Tokenanna
Token ID7697
Feature activation+6.17e-02

Pos logit contributions
ig+0.262
 fort+0.259
 web+0.248
aj+0.240
uddle+0.231
Neg logit contributions
all-0.346
 going-0.331
 him-0.315
 pointing-0.315
 Mike-0.306
Token and
Token ID290
Feature activation+2.31e-02

Pos logit contributions
ig+0.113
aj+0.108
ches+0.108
iper+0.107
 fort+0.107
Neg logit contributions
 going-0.089
 pointing-0.086
all-0.085
 returning-0.083
't-0.083
Token Mason
Token ID14737
Feature activation+1.14e-01

Pos logit contributions
ig+0.041
iper+0.041
ches+0.039
ox+0.037
aj+0.037
Neg logit contributions
 returning-0.034
 going-0.034
 buying-0.033
all-0.032
 pointing-0.032
Token packed
Token ID11856
Feature activation-1.10e-01

Pos logit contributions
ig+0.208
aj+0.206
 web+0.202
 fort+0.199
ches+0.196
Neg logit contributions
 going-0.168
 pointing-0.159
 him-0.157
all-0.154
 returning-0.151
Token some
Token ID617
Feature activation+2.18e-02

Pos logit contributions
 going+0.117
 pointing+0.113
all+0.110
 returning+0.109
't+0.107
Neg logit contributions
ig-0.246
iper-0.235
ches-0.234
aj-0.233
 fort-0.230
Token.
Token ID13
Feature activation-4.28e-02

Pos logit contributions
 going+0.067
 pointing+0.067
all+0.067
 returning+0.066
 Crab+0.066
Neg logit contributions
ig-0.123
ches-0.117
iper-0.116
aj-0.114
 web-0.112
Token
Token ID220
Feature activation-2.65e-02

Pos logit contributions
 going+0.049
 returning+0.046
 pointing+0.046
all+0.044
 buying+0.044
Neg logit contributions
ig-0.091
aj-0.087
iper-0.087
ches-0.087
ox-0.085
Token
Token ID198
Feature activation-9.94e-04

Pos logit contributions
 going+0.028
 returning+0.027
 pointing+0.027
 buying+0.026
all+0.026
Neg logit contributions
ig-0.058
aj-0.057
ches-0.056
iper-0.055
uffy-0.054
Token
Token ID198
Feature activation+1.51e-03

Pos logit contributions
 going+0.001
 pointing+0.001
 returning+0.001
 buying+0.001
all+0.001
Neg logit contributions
ig-0.002
ches-0.002
iper-0.002
aj-0.002
ox-0.002
TokenBoth
Token ID10265
Feature activation+2.28e-02

Pos logit contributions
ig+0.003
ches+0.002
iper+0.002
aj+0.002
 fort+0.002
Neg logit contributions
 going-0.002
 pointing-0.002
 returning-0.002
 buying-0.002
all-0.002
Token Mir
Token ID7381
Feature activation+1.84e-01

Pos logit contributions
ig+0.045
iper+0.043
ches+0.043
aj+0.043
 fort+0.042
Neg logit contributions
 going-0.030
 pointing-0.029
all-0.029
 returning-0.028
't-0.028
Tokena
Token ID64
Feature activation+1.51e-01

Pos logit contributions
 fort+0.289
uddle+0.259
 ramp+0.253
 web+0.253
aj+0.252
Neg logit contributions
all-0.357
 going-0.310
 pointing-0.306
 Lib-0.305
 ever-0.302
Token and
Token ID290
Feature activation+4.61e-02

Pos logit contributions
aj+0.267
 fort+0.256
ig+0.254
ox+0.248
iper+0.240
Neg logit contributions
 going-0.232
all-0.224
 pointing-0.220
 returning-0.219
 him-0.218
Token Ha
Token ID9398
Feature activation-2.34e-02

Pos logit contributions
ig+0.086
iper+0.082
ches+0.081
aj+0.079
ox+0.078
Neg logit contributions
 going-0.065
 pointing-0.063
 returning-0.063
all-0.062
't-0.061
Tokeniley
Token ID9618
Feature activation+1.03e-01

Pos logit contributions
all+0.039
 him+0.037
 going+0.037
 Mike+0.036
 pointing+0.036
Neg logit contributions
 fort-0.039
aj-0.038
ches-0.037
 web-0.037
ables-0.036
Token learned
Token ID4499
Feature activation-1.02e-01

Pos logit contributions
ig+0.181
aj+0.174
iper+0.171
 fort+0.170
ches+0.170
Neg logit contributions
 going-0.156
 pointing-0.151
all-0.148
't-0.147
 returning-0.147
Token packed
Token ID11856
Feature activation-1.10e-01

Pos logit contributions
ig+0.208
aj+0.206
 web+0.202
 fort+0.199
ches+0.196
Neg logit contributions
 going-0.168
 pointing-0.159
 him-0.157
all-0.154
 returning-0.151
Token some
Token ID617
Feature activation+2.18e-02

Pos logit contributions
 going+0.117
 pointing+0.113
all+0.110
 returning+0.109
't+0.107
Neg logit contributions
ig-0.246
iper-0.235
ches-0.234
aj-0.233
 fort-0.230
Token snacks
Token ID28487
Feature activation-8.25e-02

Pos logit contributions
ig+0.037
iper+0.035
ches+0.034
aj+0.034
ox+0.034
Neg logit contributions
 going-0.035
 him-0.033
 pointing-0.033
 returning-0.033
all-0.032
Token,
Token ID11
Feature activation-1.70e-02

Pos logit contributions
 going+0.124
 pointing+0.123
 returning+0.123
all+0.121
't+0.121
Neg logit contributions
ig-0.142
ches-0.133
 fort-0.133
iper-0.133
aj-0.132
Token grabbed
Token ID13176
Feature activation-9.11e-02

Pos logit contributions
 going+0.027
 pointing+0.025
 him+0.024
 buying+0.024
 returning+0.023
Neg logit contributions
ig-0.034
 web-0.031
ches-0.030
uffy-0.030
aj-0.029
Token Bre
Token ID3719
Feature activation+1.82e-01

Pos logit contributions
 going+0.094
 pointing+0.090
all+0.087
 returning+0.087
 him+0.085
Neg logit contributions
ig-0.206
aj-0.196
iper-0.195
ches-0.195
 fort-0.193
Tokenanna
Token ID7697
Feature activation+6.87e-02

Pos logit contributions
 fort+0.240
 web+0.237
ig+0.234
 ramp+0.221
 slammed+0.221
Neg logit contributions
all-0.339
 going-0.323
 him-0.311
 pointing-0.302
Welcome-0.296
Token's
Token ID338
Feature activation+9.20e-02

Pos logit contributions
ig+0.122
aj+0.116
iper+0.115
 fort+0.114
ches+0.114
Neg logit contributions
all-0.098
 going-0.097
't-0.097
 pointing-0.097
 returning-0.096
Token cabin
Token ID9351
Feature activation-3.99e-02

Pos logit contributions
ig+0.120
ches+0.111
iper+0.111
aj+0.108
ox+0.104
Neg logit contributions
 going-0.183
 pointing-0.179
 returning-0.176
all-0.175
 him-0.174
Token,
Token ID11
Feature activation-2.26e-02

Pos logit contributions
 going+0.041
 pointing+0.040
all+0.039
 returning+0.039
't+0.038
Neg logit contributions
ig-0.090
iper-0.085
ches-0.085
aj-0.085
 fort-0.083
Token and
Token ID290
Feature activation+2.13e-02

Pos logit contributions
 going+0.033
 pointing+0.032
 returning+0.031
all+0.031
 buying+0.030
Neg logit contributions
ig-0.043
ches-0.040
iper-0.040
aj-0.040
 fort-0.039
Token
Token ID198
Feature activation+8.16e-03

Pos logit contributions
 going+0.046
 returning+0.044
all+0.042
 buying+0.041
 pointing+0.041
Neg logit contributions
ig-0.086
aj-0.081
ches-0.081
ox-0.080
iper-0.080
Token
Token ID198
Feature activation+9.04e-03

Pos logit contributions
ig+0.019
ches+0.018
aj+0.018
iper+0.018
 fort+0.017
Neg logit contributions
 going-0.008
 returning-0.008
 pointing-0.008
all-0.008
 buying-0.007
TokenOne
Token ID3198
Feature activation+1.81e-02

Pos logit contributions
ig+0.016
ches+0.015
iper+0.015
aj+0.015
ox+0.014
Neg logit contributions
 going-0.014
 returning-0.014
 pointing-0.013
all-0.013
 buying-0.013
Token day
Token ID1110
Feature activation-3.98e-02

Pos logit contributions
ches+0.045
iper+0.045
aj+0.045
ig+0.044
auldron+0.044
Neg logit contributions
 going-0.015
all-0.013
 pointing-0.013
 it-0.011
't-0.011
Token,
Token ID11
Feature activation-6.94e-03

Pos logit contributions
 going+0.059
 returning+0.058
all+0.058
 pointing+0.056
 buying+0.056
Neg logit contributions
ig-0.072
aj-0.065
iper-0.065
ches-0.065
 fort-0.064
Token Mir
Token ID7381
Feature activation+1.81e-01

Pos logit contributions
 going+0.007
 returning+0.007
all+0.007
 pointing+0.007
't+0.007
Neg logit contributions
ig-0.015
iper-0.015
ches-0.014
aj-0.014
 fort-0.014
Tokena
Token ID64
Feature activation+1.41e-01

Pos logit contributions
 fort+0.258
 web+0.244
uddle+0.244
aj+0.239
 proper+0.233
Neg logit contributions
all-0.336
 going-0.316
 pointing-0.306
 ever-0.297
 him-0.288
Token and
Token ID290
Feature activation+4.69e-02

Pos logit contributions
ig+0.246
iper+0.230
aj+0.224
 web+0.219
 fort+0.218
Neg logit contributions
 going-0.231
 pointing-0.213
 him-0.205
't-0.204
 rep-0.200
Token Ben
Token ID3932
Feature activation+1.19e-01

Pos logit contributions
ig+0.076
iper+0.074
ches+0.072
aj+0.071
ox+0.070
Neg logit contributions
 going-0.075
 returning-0.075
all-0.074
 pointing-0.074
't-0.073
Token see
Token ID766
Feature activation-5.84e-02

Pos logit contributions
ig+0.261
aj+0.249
iper+0.245
ches+0.243
 fort+0.238
Neg logit contributions
 going-0.141
 pointing-0.131
 him-0.129
't-0.128
 returning-0.126
Token a
Token ID257
Feature activation+9.13e-02

Pos logit contributions
 going+0.063
all+0.062
 pointing+0.060
 returning+0.060
't+0.059
Neg logit contributions
ig-0.128
iper-0.122
ches-0.121
aj-0.121
 fort-0.120
Token playing
Token ID2712
Feature activation-1.08e-01

Pos logit contributions
 Crab+0.095
Welcome+0.093
all+0.092
 Lib+0.092
't+0.090
Neg logit contributions
ig-0.188
 web-0.176
ches-0.175
 fort-0.173
iper-0.173
Token.
Token ID13
Feature activation-3.74e-02

Pos logit contributions
 going+0.097
 pointing+0.094
 him+0.091
 it+0.089
 along+0.075
Neg logit contributions
iper-0.274
aj-0.272
ig-0.263
ox-0.257
ches-0.256
Token Tim
Token ID5045
Feature activation+1.36e-01

Pos logit contributions
 going+0.052
 returning+0.050
all+0.050
 pointing+0.049
't+0.048
Neg logit contributions
ig-0.066
 fort-0.065
ches-0.064
aj-0.063
iper-0.063
Token cl
Token ID537
Feature activation-8.90e-02

Pos logit contributions
aj+0.259
 spine+0.251
 fort+0.250
ox+0.245
auldron+0.245
Neg logit contributions
 going-0.193
 pointing-0.183
all-0.175
 it-0.168
 him-0.166
Tokenapped
Token ID6320
Feature activation-9.75e-02

Pos logit contributions
 going+0.115
 pointing+0.110
 returning+0.108
all+0.108
't+0.107
Neg logit contributions
ig-0.182
iper-0.172
aj-0.171
ches-0.170
 fort-0.167
Token his
Token ID465
Feature activation+1.81e-01

Pos logit contributions
 going+0.093
 him+0.091
 pointing+0.091
all+0.086
 returning+0.084
Neg logit contributions
ig-0.232
aj-0.222
iper-0.221
ches-0.220
ox-0.216
Token hands
Token ID2832
Feature activation-2.56e-02

Pos logit contributions
auldron+0.373
eh+0.371
issy+0.361
ches+0.360
isk+0.355
Neg logit contributions
 pointing-0.206
 going-0.172
 him-0.157
 giving-0.152
't-0.152
Token and
Token ID290
Feature activation+3.63e-02

Pos logit contributions
 returning+0.039
Welcome+0.039
 going+0.039
 buying+0.039
't+0.038
Neg logit contributions
ig-0.042
ches-0.040
 fort-0.038
iper-0.038
ets-0.037
Token the
Token ID262
Feature activation+1.68e-01

Pos logit contributions
ig+0.067
iper+0.063
aj+0.063
ches+0.063
ox+0.061
Neg logit contributions
 going-0.054
 pointing-0.054
 returning-0.051
all-0.051
 him-0.051
Token big
Token ID1263
Feature activation+7.43e-02

Pos logit contributions
ig+0.289
ches+0.264
ox+0.262
aj+0.260
iper+0.255
Neg logit contributions
 going-0.299
 pointing-0.280
 buying-0.276
't-0.271
 returning-0.270
Token boxes
Token ID10559
Feature activation-2.13e-02

Pos logit contributions
ig+0.120
ches+0.114
ox+0.112
aj+0.112
iper+0.111
Neg logit contributions
 going-0.130
 pointing-0.125
 him-0.121
 buying-0.121
 returning-0.119
Token appeared
Token ID4120
Feature activation-6.19e-02

Pos logit contributions
 going+0.074
 pointing+0.073
 returning+0.071
all+0.071
't+0.070
Neg logit contributions
ig-0.081
iper-0.077
ches-0.076
aj-0.076
 fort-0.074
Token.
Token ID13
Feature activation-3.89e-02

Pos logit contributions
't+0.078
 buying+0.076
all+0.075
 pointing+0.075
 Crab+0.074
Neg logit contributions
ig-0.120
ches-0.115
 fort-0.112
 web-0.111
aj-0.109
Token He
Token ID679
Feature activation+9.46e-02

Pos logit contributions
 going+0.043
 pointing+0.043
 returning+0.041
all+0.041
't+0.041
Neg logit contributions
ig-0.086
ches-0.080
iper-0.080
aj-0.080
 fort-0.079
Token cl
Token ID537
Feature activation-9.96e-02

Pos logit contributions
ig+0.176
aj+0.166
iper+0.164
 fort+0.162
ets+0.160
Neg logit contributions
 pointing-0.149
 going-0.148
 returning-0.141
 him-0.137
't-0.135
Tokenapped
Token ID6320
Feature activation-1.01e-01

Pos logit contributions
 going+0.147
 returning+0.135
 pointing+0.135
't+0.135
 buying+0.133
Neg logit contributions
ig-0.195
iper-0.179
aj-0.179
ox-0.174
ches-0.174
Token his
Token ID465
Feature activation+1.80e-01

Pos logit contributions
 going+0.101
 pointing+0.098
all+0.096
 returning+0.096
't+0.095
Neg logit contributions
ig-0.229
iper-0.219
ches-0.219
aj-0.217
 fort-0.215
Token hands
Token ID2832
Feature activation-2.83e-02

Pos logit contributions
issy+0.315
aj+0.313
ig+0.309
ital+0.305
leton+0.305
Neg logit contributions
 pointing-0.264
 going-0.252
 returning-0.231
 along-0.222
 him-0.221
Token and
Token ID290
Feature activation+3.64e-02

Pos logit contributions
Welcome+0.046
 buying+0.046
't+0.046
 returning+0.045
 going+0.045
Neg logit contributions
ig-0.042
ches-0.042
 fort-0.039
 web-0.038
ets-0.038
Token shouted
Token ID17293
Feature activation-7.17e-02

Pos logit contributions
ig+0.070
iper+0.066
aj+0.065
ches+0.065
 fort+0.064
Neg logit contributions
 going-0.052
 pointing-0.052
 returning-0.050
 him-0.049
all-0.048
Token "
Token ID366
Feature activation-1.62e-02

Pos logit contributions
Welcome+0.114
 buying+0.110
no+0.110
 returning+0.110
all+0.109
Neg logit contributions
ches-0.110
ig-0.109
ows-0.106
 web-0.103
 fort-0.103
TokenGood
Token ID10248
Feature activation+2.60e-02

Pos logit contributions
 going+0.021
 pointing+0.020
 buying+0.020
 returning+0.020
't+0.018
Neg logit contributions
ig-0.031
ches-0.031
iper-0.030
eh-0.029
ox-0.029
Token went
Token ID1816
Feature activation-9.02e-02

Pos logit contributions
ig+0.123
aj+0.122
 web+0.116
ox+0.116
 fort+0.115
Neg logit contributions
 going-0.099
 him-0.096
 pointing-0.091
 returning-0.090
all-0.089
Token home
Token ID1363
Feature activation-4.92e-02

Pos logit contributions
 pointing+0.132
all+0.131
 going+0.130
't+0.129
 returning+0.127
Neg logit contributions
ig-0.166
 fort-0.153
ches-0.151
aj-0.150
iper-0.149
Token.
Token ID13
Feature activation-3.62e-02

Pos logit contributions
 going+0.052
all+0.052
 pointing+0.051
't+0.051
Welcome+0.050
Neg logit contributions
ig-0.105
aj-0.101
 fort-0.100
ches-0.098
iper-0.097
Token<|endoftext|>
Token ID50256
Feature activation-1.74e-02

Pos logit contributions
 going+0.046
all+0.045
 pointing+0.043
no+0.043
 returning+0.043
Neg logit contributions
iper-0.067
aj-0.067
ig-0.066
ches-0.064
ables-0.063
TokenMe
Token ID5308
Feature activation+6.15e-02

Pos logit contributions
 going+0.027
 returning+0.027
all+0.026
 pointing+0.026
 buying+0.026
Neg logit contributions
ig-0.029
ches-0.027
 fort-0.026
uffy-0.026
aj-0.026
Tokengan
Token ID1030
Feature activation+1.80e-01

Pos logit contributions
ig+0.146
ches+0.140
 fort+0.139
 web+0.139
aj+0.137
Neg logit contributions
 going-0.059
all-0.056
 pointing-0.055
 him-0.055
 Mike-0.052
Token was
Token ID373
Feature activation+1.34e-02

Pos logit contributions
ig+0.301
iper+0.287
ches+0.284
aj+0.282
 fort+0.280
Neg logit contributions
 going-0.293
 pointing-0.283
all-0.277
 returning-0.276
 him-0.275
Token playing
Token ID2712
Feature activation-1.16e-01

Pos logit contributions
ig+0.030
iper+0.028
aj+0.028
ches+0.028
 fort+0.028
Neg logit contributions
 going-0.014
 pointing-0.014
all-0.014
 returning-0.013
't-0.013
Token in
Token ID287
Feature activation+2.35e-02

Pos logit contributions
 going+0.110
 pointing+0.104
 him+0.101
all+0.099
 returning+0.098
Neg logit contributions
ig-0.271
iper-0.265
aj-0.265
ches-0.262
ox-0.257
Token the
Token ID262
Feature activation+1.55e-01

Pos logit contributions
ig+0.049
 fort+0.046
iper+0.046
 web+0.045
aj+0.045
Neg logit contributions
all-0.026
 going-0.025
 pointing-0.025
 returning-0.025
't-0.025
Token park
Token ID3952
Feature activation-3.97e-02

Pos logit contributions
ig+0.204
ches+0.195
iper+0.192
aj+0.187
ox+0.181
Neg logit contributions
 going-0.313
 pointing-0.303
 returning-0.298
 him-0.293
all-0.291
Token very
Token ID845
Feature activation-6.64e-03

Pos logit contributions
aj+0.020
ig+0.020
ches+0.019
 web+0.019
 fort+0.019
Neg logit contributions
Welcome-0.020
all-0.019
 returning-0.019
't-0.019
no-0.019
Token happy
Token ID3772
Feature activation-4.78e-02

Pos logit contributions
 buying+0.012
 returning+0.012
 going+0.012
 him+0.011
all+0.011
Neg logit contributions
aj-0.009
ig-0.008
 forgot-0.008
uffy-0.008
eh-0.008
Token.
Token ID13
Feature activation-3.84e-02

Pos logit contributions
Welcome+0.074
't+0.070
 returning+0.070
 Abby+0.069
 buying+0.068
Neg logit contributions
ig-0.076
 fort-0.075
 web-0.071
ets-0.069
aj-0.068
Token He
Token ID679
Feature activation+9.27e-02

Pos logit contributions
 going+0.045
 pointing+0.043
all+0.043
 returning+0.043
't+0.041
Neg logit contributions
ig-0.078
aj-0.076
 fort-0.076
ches-0.076
iper-0.076
Token waved
Token ID26834
Feature activation-8.23e-02

Pos logit contributions
ig+0.184
 fort+0.175
iper+0.172
aj+0.171
ox+0.169
Neg logit contributions
 going-0.133
 pointing-0.126
all-0.120
 him-0.120
't-0.117
Token his
Token ID465
Feature activation+1.80e-01

Pos logit contributions
 going+0.097
 pointing+0.094
all+0.093
 returning+0.093
't+0.092
Neg logit contributions
ig-0.172
iper-0.164
ches-0.163
aj-0.162
 fort-0.161
Token tail
Token ID7894
Feature activation+9.47e-03

Pos logit contributions
ig+0.251
aj+0.238
iper+0.233
 fort+0.230
eh+0.225
Neg logit contributions
 pointing-0.342
 going-0.334
't-0.321
 returning-0.311
 him-0.311
Token even
Token ID772
Feature activation+2.85e-02

Pos logit contributions
ig+0.018
ches+0.017
iper+0.017
 fort+0.017
 web+0.017
Neg logit contributions
 going-0.012
all-0.012
 buying-0.012
 returning-0.012
't-0.012
Token faster
Token ID5443
Feature activation-4.23e-02

Pos logit contributions
uffy+0.060
iper+0.059
ig+0.059
ox+0.058
ables+0.056
Neg logit contributions
 going-0.037
 pointing-0.035
 him-0.033
 along-0.032
 returning-0.031
Token.
Token ID13
Feature activation-3.54e-02

Pos logit contributions
 going+0.050
 pointing+0.049
all+0.048
 returning+0.047
 him+0.046
Neg logit contributions
ig-0.089
iper-0.085
aj-0.085
ches-0.085
 fort-0.083
Token Lucy
Token ID22162
Feature activation+1.60e-01

Pos logit contributions
 going+0.043
 pointing+0.041
all+0.040
 returning+0.040
 him+0.039
Neg logit contributions
ig-0.071
aj-0.070
ches-0.069
 fort-0.069
iper-0.069
Token move
Token ID1445
Feature activation-9.08e-02

Pos logit contributions
ig+0.118
 web+0.115
 fort+0.115
ches+0.113
house+0.106
Neg logit contributions
all-0.097
 Crab-0.096
 buying-0.095
't-0.095
 Mike-0.094
Token.
Token ID13
Feature activation-3.75e-02

Pos logit contributions
't+0.119
 going+0.118
 returning+0.117
 buying+0.117
all+0.115
Neg logit contributions
ig-0.173
ches-0.171
 fort-0.161
iper-0.157
ets-0.157
Token He
Token ID679
Feature activation+9.57e-02

Pos logit contributions
 going+0.053
 returning+0.051
all+0.050
 pointing+0.050
't+0.049
Neg logit contributions
ig-0.066
ches-0.065
iper-0.065
aj-0.065
 fort-0.063
Token cl
Token ID537
Feature activation-9.95e-02

Pos logit contributions
aj+0.180
ig+0.178
 fort+0.177
uffy+0.176
iper+0.174
Neg logit contributions
 pointing-0.141
 going-0.134
 him-0.130
 it-0.128
all-0.128
Tokenapped
Token ID6320
Feature activation-1.03e-01

Pos logit contributions
 going+0.152
't+0.140
 returning+0.139
 pointing+0.138
 buying+0.136
Neg logit contributions
ig-0.195
iper-0.175
aj-0.175
ox-0.170
ches-0.167
Token his
Token ID465
Feature activation+1.80e-01

Pos logit contributions
 going+0.108
't+0.103
 returning+0.103
 pointing+0.103
all+0.103
Neg logit contributions
ig-0.225
ches-0.215
iper-0.214
 fort-0.210
aj-0.210
Token hands
Token ID2832
Feature activation-2.75e-02

Pos logit contributions
 fort+0.346
aj+0.342
ital+0.340
auldron+0.330
isk+0.329
Neg logit contributions
 pointing-0.249
 it-0.216
 going-0.209
 him-0.202
 reaching-0.196
Token and
Token ID290
Feature activation+3.59e-02

Pos logit contributions
 going+0.044
 buying+0.044
Welcome+0.044
 returning+0.044
't+0.043
Neg logit contributions
ig-0.044
ches-0.041
iper-0.039
 fort-0.038
ets-0.038
Token his
Token ID465
Feature activation+1.68e-01

Pos logit contributions
ig+0.063
aj+0.060
iper+0.060
ches+0.059
 fort+0.058
Neg logit contributions
 going-0.056
 pointing-0.056
 returning-0.054
all-0.053
 him-0.053
Token mom
Token ID1995
Feature activation+6.74e-02

Pos logit contributions
aj+0.268
ches+0.263
ig+0.258
 fort+0.245
iper+0.244
Neg logit contributions
 pointing-0.281
 going-0.270
 him-0.257
 it-0.250
no-0.247
Token came
Token ID1625
Feature activation-6.73e-02

Pos logit contributions
ig+0.105
aj+0.099
iper+0.099
ches+0.097
 fort+0.097
Neg logit contributions
 going-0.116
 pointing-0.115
all-0.112
 returning-0.112
 him-0.110
Token were
Token ID547
Feature activation-2.67e-03

Pos logit contributions
 going+0.041
 pointing+0.040
't+0.039
 him+0.039
all+0.039
Neg logit contributions
ig-0.044
iper-0.044
ches-0.043
aj-0.043
 fort-0.041
Token two
Token ID734
Feature activation+7.31e-02

Pos logit contributions
 pointing+0.003
 returning+0.003
 going+0.003
all+0.003
 buying+0.003
Neg logit contributions
ig-0.005
iper-0.005
aj-0.005
 fort-0.005
ox-0.005
Token best
Token ID1266
Feature activation+3.49e-02

Pos logit contributions
ital+0.173
ches+0.172
ables+0.166
 web+0.164
ets+0.162
Neg logit contributions
 going-0.066
 him-0.063
 pointing-0.054
 them-0.049
 Mike-0.048
Token friends
Token ID2460
Feature activation-1.28e-02

Pos logit contributions
ches+0.074
ig+0.070
iper+0.069
aj+0.067
 web+0.066
Neg logit contributions
 going-0.038
 him-0.035
 pointing-0.034
't-0.033
all-0.033
Token,
Token ID11
Feature activation-1.24e-02

Pos logit contributions
 going+0.018
 pointing+0.018
 him+0.018
 returning+0.017
all+0.017
Neg logit contributions
ig-0.024
aj-0.024
ches-0.023
iper-0.023
 fort-0.023
Token Mir
Token ID7381
Feature activation+1.80e-01

Pos logit contributions
 going+0.015
 pointing+0.014
 him+0.013
 returning+0.013
all+0.013
Neg logit contributions
ig-0.027
ches-0.026
aj-0.026
iper-0.025
ets-0.025
Tokena
Token ID64
Feature activation+1.54e-01

Pos logit contributions
uddle+0.239
 fort+0.235
aj+0.228
ig+0.227
ches+0.227
Neg logit contributions
all-0.346
 going-0.327
 pointing-0.322
 him-0.317
 ever-0.313
Token and
Token ID290
Feature activation+4.62e-02

Pos logit contributions
ig+0.364
aj+0.344
 fort+0.342
ches+0.342
iper+0.342
Neg logit contributions
 going-0.144
 pointing-0.142
 him-0.137
all-0.134
 returning-0.129
Token Max
Token ID5436
Feature activation+1.23e-01

Pos logit contributions
iper+0.072
ig+0.070
ox+0.067
ppy+0.066
 fort+0.066
Neg logit contributions
all-0.076
 returning-0.070
't-0.070
no-0.068
 pointing-0.068
Token.
Token ID13
Feature activation-3.99e-02

Pos logit contributions
ig+0.295
iper+0.283
ches+0.282
aj+0.281
 fort+0.278
Neg logit contributions
 going-0.108
 pointing-0.103
all-0.101
 returning-0.100
't-0.097
Token They
Token ID1119
Feature activation+5.62e-02

Pos logit contributions
 going+0.049
all+0.047
 returning+0.047
 pointing+0.046
't+0.045
Neg logit contributions
ig-0.081
iper-0.078
ches-0.078
 fort-0.078
aj-0.077
Token
Token ID198
Feature activation+1.08e-03

Pos logit contributions
 going+0.025
 returning+0.024
 pointing+0.024
 buying+0.023
all+0.023
Neg logit contributions
ig-0.044
aj-0.044
ches-0.044
uffy-0.042
iper-0.042
Token
Token ID198
Feature activation+2.50e-03

Pos logit contributions
ches+0.002
ig+0.002
aj+0.002
iper+0.002
ables+0.002
Neg logit contributions
 going-0.001
 buying-0.001
 returning-0.001
 pointing-0.001
 rep-0.001
TokenHe
Token ID1544
Feature activation+8.72e-02

Pos logit contributions
ig+0.004
ches+0.004
iper+0.003
aj+0.003
 web+0.003
Neg logit contributions
 going-0.004
 pointing-0.004
 returning-0.004
 buying-0.004
 him-0.004
Token cl
Token ID537
Feature activation-9.98e-02

Pos logit contributions
aj+0.170
uffy+0.169
ig+0.167
iper+0.166
 fort+0.165
Neg logit contributions
 going-0.119
all-0.113
 pointing-0.112
 it-0.111
 him-0.110
Tokenapped
Token ID6320
Feature activation-1.02e-01

Pos logit contributions
 going+0.144
 pointing+0.134
 returning+0.133
't+0.132
all+0.130
Neg logit contributions
ig-0.198
aj-0.180
iper-0.180
ches-0.176
ox-0.174
Token his
Token ID465
Feature activation+1.80e-01

Pos logit contributions
 going+0.101
 pointing+0.097
all+0.095
 returning+0.094
 him+0.091
Neg logit contributions
ig-0.237
iper-0.226
ches-0.226
aj-0.225
 fort-0.222
Token hands
Token ID2832
Feature activation-2.98e-02

Pos logit contributions
auldron+0.339
aj+0.334
eh+0.326
isk+0.325
issy+0.324
Neg logit contributions
 pointing-0.238
 going-0.221
't-0.210
 it-0.206
 along-0.205
Token and
Token ID290
Feature activation+3.30e-02

Pos logit contributions
 buying+0.043
Welcome+0.043
 returning+0.043
 going+0.043
't+0.042
Neg logit contributions
ig-0.051
ches-0.049
 fort-0.047
iper-0.046
ets-0.046
Token started
Token ID2067
Feature activation-8.80e-02

Pos logit contributions
ig+0.061
iper+0.058
aj+0.057
ches+0.057
 fort+0.056
Neg logit contributions
 going-0.049
 pointing-0.048
 returning-0.047
all-0.046
 him-0.046
Token running
Token ID2491
Feature activation-1.04e-01

Pos logit contributions
Welcome+0.120
otion+0.120
 Lib+0.117
 Abby+0.116
all+0.113
Neg logit contributions
ig-0.157
 web-0.148
 fort-0.144
ches-0.142
aj-0.139
Token around
Token ID1088
Feature activation-2.05e-02

Pos logit contributions
 pointing+0.140
 going+0.140
all+0.140
 returning+0.139
 buying+0.138
Neg logit contributions
ig-0.194
ches-0.188
 fort-0.186
 web-0.180
ets-0.178
Token
Token ID198
Feature activation+4.40e-03

Pos logit contributions
 going+0.023
 returning+0.022
 pointing+0.022
 buying+0.021
 rep+0.021
Neg logit contributions
ig-0.039
aj-0.038
ches-0.038
uffy-0.037
iper-0.037
Token
Token ID198
Feature activation+5.58e-03

Pos logit contributions
ig+0.009
ches+0.009
iper+0.009
aj+0.009
ox+0.009
Neg logit contributions
 going-0.005
 returning-0.005
 pointing-0.005
 buying-0.005
all-0.005
TokenOne
Token ID3198
Feature activation+2.13e-02

Pos logit contributions
ig+0.010
iper+0.009
ches+0.009
aj+0.009
 fort+0.009
Neg logit contributions
 going-0.008
 pointing-0.008
 returning-0.008
all-0.008
 him-0.008
Token day
Token ID1110
Feature activation-4.03e-02

Pos logit contributions
ig+0.044
aj+0.042
ches+0.042
iper+0.042
 fort+0.041
Neg logit contributions
 going-0.027
all-0.026
 pointing-0.026
 returning-0.025
't-0.024
Token,
Token ID11
Feature activation-1.04e-02

Pos logit contributions
 going+0.061
all+0.060
 returning+0.060
 pointing+0.058
't+0.058
Neg logit contributions
ig-0.071
iper-0.064
aj-0.063
 fort-0.063
ches-0.063
Token Suzanne
Token ID45587
Feature activation+1.80e-01

Pos logit contributions
 going+0.011
 returning+0.011
 pointing+0.010
all+0.010
't+0.010
Neg logit contributions
ig-0.023
iper-0.022
ches-0.021
aj-0.021
 fort-0.021
Token decided
Token ID3066
Feature activation-7.87e-02

Pos logit contributions
ig+0.365
aj+0.354
iper+0.350
 web+0.347
 fort+0.340
Neg logit contributions
 going-0.250
 him-0.222
 pointing-0.217
't-0.215
all-0.213
Token to
Token ID284
Feature activation-2.30e-02

Pos logit contributions
all+0.080
 pointing+0.077
 returning+0.076
Welcome+0.075
 Hipp+0.074
Neg logit contributions
ig-0.159
ches-0.156
iper-0.156
 fort-0.155
aj-0.155
Token go
Token ID467
Feature activation-8.87e-02

Pos logit contributions
 pointing+0.035
all+0.035
 returning+0.035
Welcome+0.034
 going+0.034
Neg logit contributions
ig-0.037
iper-0.036
 fort-0.035
ches-0.035
house-0.035
Token to
Token ID284
Feature activation-1.33e-02

Pos logit contributions
 pointing+0.096
all+0.092
 returning+0.091
't+0.089
 going+0.088
Neg logit contributions
ig-0.194
ches-0.185
 fort-0.182
aj-0.180
iper-0.180
Token the
Token ID262
Feature activation+1.49e-01

Pos logit contributions
all+0.017
 pointing+0.017
 returning+0.016
't+0.016
Welcome+0.016
Neg logit contributions
ig-0.024
 fort-0.023
iper-0.023
ches-0.023
 web-0.023
TokenThe
Token ID464
Feature activation+1.53e-01

Pos logit contributions
 going+0.001
 pointing+0.001
 returning+0.001
all+0.001
 buying+0.001
Neg logit contributions
ig-0.001
iper-0.001
ches-0.001
 fort-0.001
aj-0.001
Token little
Token ID1310
Feature activation+7.43e-02

Pos logit contributions
ig+0.228
iper+0.212
ches+0.211
aj+0.210
 fort+0.207
Neg logit contributions
 going-0.277
 pointing-0.271
all-0.268
 returning-0.267
't-0.264
Token boy
Token ID2933
Feature activation+9.05e-02

Pos logit contributions
ches+0.108
ig+0.108
iper+0.105
aj+0.104
ox+0.098
Neg logit contributions
 going-0.137
 pointing-0.136
 him-0.131
all-0.130
 returning-0.129
Token cl
Token ID537
Feature activation-9.36e-02

Pos logit contributions
aj+0.185
iper+0.177
uffy+0.175
ig+0.174
 fort+0.171
Neg logit contributions
all-0.114
 him-0.113
 going-0.113
 pointing-0.111
 it-0.108
Tokenapped
Token ID6320
Feature activation-9.68e-02

Pos logit contributions
 going+0.146
 pointing+0.134
 returning+0.133
't+0.133
 buying+0.130
Neg logit contributions
ig-0.179
aj-0.162
iper-0.161
ches-0.155
ox-0.154
Token his
Token ID465
Feature activation+1.80e-01

Pos logit contributions
 going+0.102
 pointing+0.098
all+0.096
 returning+0.096
't+0.095
Neg logit contributions
ig-0.216
iper-0.206
ches-0.206
aj-0.204
 fort-0.203
Token hands
Token ID2832
Feature activation-2.68e-02

Pos logit contributions
aj+0.315
ches+0.307
ig+0.297
iper+0.295
ital+0.295
Neg logit contributions
 pointing-0.253
 going-0.249
 him-0.232
 Joey-0.227
't-0.226
Token and
Token ID290
Feature activation+3.34e-02

Pos logit contributions
 going+0.041
 returning+0.040
 pointing+0.040
 buying+0.040
Welcome+0.040
Neg logit contributions
ig-0.045
ches-0.043
iper-0.042
 fort-0.042
ets-0.040
Token said
Token ID531
Feature activation-4.85e-02

Pos logit contributions
ig+0.065
aj+0.062
ches+0.062
iper+0.061
ox+0.060
Neg logit contributions
 going-0.047
 pointing-0.046
 returning-0.044
 him-0.044
all-0.044
Token,
Token ID11
Feature activation-1.50e-02

Pos logit contributions
Welcome+0.092
all+0.090
 going+0.089
 returning+0.089
 buying+0.088
Neg logit contributions
ig-0.066
 fort-0.053
ows-0.052
iper-0.052
ches-0.052
Token "
Token ID366
Feature activation-2.29e-02

Pos logit contributions
Welcome+0.024
reating+0.024
no+0.023
 possible+0.023
all+0.023
Neg logit contributions
ig-0.019
ches-0.018
aj-0.018
 fort-0.017
iper-0.017
Token.
Token ID13
Feature activation-4.08e-02

Pos logit contributions
Welcome+0.085
 Crab+0.085
 returning+0.085
't+0.084
all+0.084
Neg logit contributions
ig-0.139
 fort-0.137
ches-0.131
ows-0.131
ets-0.130
Token Tim
Token ID5045
Feature activation+1.31e-01

Pos logit contributions
 going+0.043
 returning+0.041
all+0.041
 pointing+0.041
't+0.038
Neg logit contributions
ig-0.087
aj-0.085
 fort-0.085
ches-0.083
iper-0.083
Tokenmy
Token ID1820
Feature activation+9.87e-02

Pos logit contributions
ns+0.279
 clinic+0.272
ows+0.270
 web+0.264
eh+0.264
Neg logit contributions
all-0.154
 him-0.143
 going-0.138
 pointing-0.113
't-0.108
Token cl
Token ID537
Feature activation-8.96e-02

Pos logit contributions
iper+0.185
ig+0.182
 fort+0.181
aj+0.180
uffy+0.177
Neg logit contributions
 pointing-0.140
 going-0.139
all-0.137
't-0.127
 it-0.126
Tokenapped
Token ID6320
Feature activation-1.01e-01

Pos logit contributions
 going+0.117
 pointing+0.112
 returning+0.110
all+0.109
't+0.108
Neg logit contributions
ig-0.182
iper-0.171
aj-0.171
ches-0.169
 fort-0.167
Token his
Token ID465
Feature activation+1.80e-01

Pos logit contributions
 going+0.101
 pointing+0.097
all+0.095
 returning+0.094
't+0.092
Neg logit contributions
ig-0.232
iper-0.221
ches-0.220
aj-0.220
 fort-0.217
Token hands
Token ID2832
Feature activation-2.55e-02

Pos logit contributions
isk+0.351
ches+0.346
eh+0.342
issy+0.340
iper+0.338
Neg logit contributions
 pointing-0.226
 going-0.202
 it-0.183
 giving-0.175
 him-0.175
Token and
Token ID290
Feature activation+3.69e-02

Pos logit contributions
Welcome+0.038
 going+0.037
 returning+0.037
 buying+0.037
't+0.037
Neg logit contributions
ig-0.043
ches-0.040
 fort-0.039
iper-0.038
 web-0.038
Token shouted
Token ID17293
Feature activation-7.04e-02

Pos logit contributions
ig+0.065
iper+0.062
ches+0.061
aj+0.061
 fort+0.059
Neg logit contributions
 going-0.058
 pointing-0.058
 returning-0.055
all-0.055
 him-0.054
Token with
Token ID351
Feature activation-3.83e-02

Pos logit contributions
Welcome+0.116
no+0.114
 returning+0.111
 buying+0.111
 rep+0.110
Neg logit contributions
ig-0.100
ches-0.099
 fort-0.098
ows-0.096
ets-0.094
Token joy
Token ID8716
Feature activation-7.99e-02

Pos logit contributions
all+0.052
 pointing+0.052
't+0.051
 going+0.050
 buying+0.050
Neg logit contributions
 web-0.066
 fort-0.066
ig-0.066
iper-0.066
ets-0.063
Token
Token ID198
Feature activation-5.89e-03

Pos logit contributions
 going+0.031
 returning+0.030
 pointing+0.030
 buying+0.029
all+0.029
Neg logit contributions
ches-0.055
ig-0.055
aj-0.053
iper-0.053
uffy-0.053
Token
Token ID198
Feature activation-2.39e-03

Pos logit contributions
 going+0.005
 pointing+0.005
all+0.005
 returning+0.005
't+0.004
Neg logit contributions
ig-0.014
iper-0.014
ches-0.014
aj-0.014
 fort-0.014
TokenThe
Token ID464
Feature activation+1.48e-01

Pos logit contributions
't+0.003
all+0.003
 pointing+0.003
 returning+0.003
 going+0.003
Neg logit contributions
ig-0.005
aj-0.005
iper-0.004
 fort-0.004
uffy-0.004
Token frog
Token ID21264
Feature activation+6.23e-02

Pos logit contributions
ig+0.223
 fort+0.211
 web+0.205
ches+0.201
aj+0.193
Neg logit contributions
 going-0.261
all-0.259
 returning-0.254
 pointing-0.251
 him-0.249
Token waved
Token ID26834
Feature activation-8.20e-02

Pos logit contributions
aj+0.121
ig+0.120
 fort+0.118
iper+0.115
ets+0.114
Neg logit contributions
all-0.084
 pointing-0.083
 Crab-0.075
 going-0.075
 Joey-0.075
Token his
Token ID465
Feature activation+1.80e-01

Pos logit contributions
 going+0.106
 pointing+0.104
all+0.101
 returning+0.100
 him+0.100
Neg logit contributions
ig-0.165
aj-0.156
iper-0.155
ches-0.155
 fort-0.153
Token foot
Token ID2366
Feature activation-5.78e-03

Pos logit contributions
ig+0.282
house+0.274
ches+0.272
aj+0.269
iper+0.264
Neg logit contributions
 going-0.298
 pointing-0.293
't-0.275
 him-0.274
 returning-0.271
Token around
Token ID1088
Feature activation-1.78e-02

Pos logit contributions
 going+0.008
 pointing+0.007
all+0.007
 returning+0.007
't+0.007
Neg logit contributions
ig-0.011
ches-0.011
iper-0.011
aj-0.011
 fort-0.011
Token.
Token ID13
Feature activation-3.68e-02

Pos logit contributions
 going+0.025
all+0.025
't+0.025
 returning+0.024
 pointing+0.024
Neg logit contributions
ig-0.032
ches-0.030
iper-0.029
 fort-0.029
 web-0.029
Token "
Token ID366
Feature activation-1.75e-02

Pos logit contributions
 going+0.043
 pointing+0.042
all+0.041
 returning+0.041
't+0.040
Neg logit contributions
ig-0.077
iper-0.074
ches-0.073
aj-0.073
 fort-0.072
TokenNo
Token ID2949
Feature activation-8.15e-03

Pos logit contributions
 going+0.021
 returning+0.020
 pointing+0.020
 buying+0.019
 him+0.019
Neg logit contributions
ig-0.035
ches-0.034
iper-0.033
ox-0.033
eh-0.032
Token was
Token ID373
Feature activation+1.78e-02

Pos logit contributions
ig+0.199
iper+0.197
aj+0.194
ches+0.184
 fort+0.181
Neg logit contributions
 going-0.155
 him-0.141
 pointing-0.140
't-0.135
 it-0.131
Token jumping
Token ID14284
Feature activation-9.66e-02

Pos logit contributions
ig+0.034
iper+0.032
ches+0.032
aj+0.032
 fort+0.031
Neg logit contributions
 going-0.024
all-0.023
 pointing-0.023
 returning-0.023
't-0.023
Token around
Token ID1088
Feature activation-2.02e-02

Pos logit contributions
 going+0.096
 pointing+0.089
't+0.088
all+0.087
 him+0.087
Neg logit contributions
ig-0.228
iper-0.223
aj-0.222
ox-0.217
 fort-0.212
Token,
Token ID11
Feature activation-3.28e-03

Pos logit contributions
 returning+0.027
all+0.026
Welcome+0.026
't+0.026
 pointing+0.026
Neg logit contributions
ig-0.037
 fort-0.035
 web-0.034
ches-0.034
iper-0.033
Token waving
Token ID25849
Feature activation-8.37e-02

Pos logit contributions
 going+0.003
 pointing+0.003
all+0.003
 returning+0.003
't+0.003
Neg logit contributions
ig-0.008
iper-0.007
ches-0.007
aj-0.007
 fort-0.007
Token his
Token ID465
Feature activation+1.79e-01

Pos logit contributions
 going+0.090
 returning+0.090
't+0.090
all+0.089
 buying+0.088
Neg logit contributions
ig-0.178
aj-0.168
iper-0.167
 fort-0.166
ches-0.166
Token hand
Token ID1021
Feature activation-3.93e-02

Pos logit contributions
ches+0.295
iper+0.285
ig+0.272
ets+0.261
ns+0.259
Neg logit contributions
 pointing-0.298
 going-0.292
 him-0.270
't-0.260
 returning-0.260
Token in
Token ID287
Feature activation+2.96e-02

Pos logit contributions
 going+0.046
 returning+0.045
all+0.045
't+0.044
 buying+0.043
Neg logit contributions
ig-0.082
ches-0.078
aj-0.077
iper-0.076
 fort-0.075
Token the
Token ID262
Feature activation+1.59e-01

Pos logit contributions
ig+0.060
iper+0.057
ches+0.057
aj+0.057
 fort+0.057
Neg logit contributions
 going-0.036
all-0.035
 pointing-0.035
't-0.034
 returning-0.034
Token air
Token ID1633
Feature activation-2.51e-02

Pos logit contributions
iper+0.325
ig+0.324
ches+0.322
aj+0.308
ox+0.308
Neg logit contributions
 going-0.206
 him-0.192
Welcome-0.191
 pointing-0.190
 returning-0.179
Token.
Token ID13
Feature activation-3.35e-02

Pos logit contributions
 going+0.026
 returning+0.025
all+0.025
 pointing+0.025
't+0.025
Neg logit contributions
ig-0.055
ches-0.052
aj-0.052
 fort-0.052
iper-0.051
Token themselves
Token ID2405
Feature activation+7.75e-02

Pos logit contributions
ig+0.067
 web+0.065
 fort+0.064
ches+0.062
ox+0.061
Neg logit contributions
't-0.040
Welcome-0.040
all-0.040
 going-0.040
 Crab-0.039
Token.
Token ID13
Feature activation-3.57e-02

Pos logit contributions
ig+0.160
 fort+0.153
ches+0.153
 web+0.150
aj+0.145
Neg logit contributions
 Crab-0.088
all-0.088
Welcome-0.086
 returning-0.086
 going-0.085
Token Henry
Token ID8616
Feature activation+1.39e-01

Pos logit contributions
 going+0.048
 returning+0.046
all+0.046
 pointing+0.044
 buying+0.044
Neg logit contributions
iper-0.065
ches-0.063
ig-0.063
 web-0.061
 fort-0.061
Token cl
Token ID537
Feature activation-9.15e-02

Pos logit contributions
ig+0.253
aj+0.251
 fort+0.249
iper+0.241
uffy+0.233
Neg logit contributions
 pointing-0.211
 going-0.203
't-0.199
all-0.197
 him-0.196
Tokenapped
Token ID6320
Feature activation-9.86e-02

Pos logit contributions
 going+0.146
't+0.132
 returning+0.132
 buying+0.132
 pointing+0.131
Neg logit contributions
ig-0.174
iper-0.156
aj-0.155
ox-0.151
ches-0.150
Token his
Token ID465
Feature activation+1.79e-01

Pos logit contributions
 going+0.096
 pointing+0.093
all+0.091
 returning+0.090
't+0.088
Neg logit contributions
ig-0.228
iper-0.217
ches-0.217
aj-0.217
 fort-0.213
Token hands
Token ID2832
Feature activation-2.54e-02

Pos logit contributions
aj+0.313
auldron+0.304
eh+0.303
udd+0.301
ital+0.299
Neg logit contributions
 pointing-0.263
 going-0.242
 him-0.231
't-0.226
 returning-0.225
Token as
Token ID355
Feature activation-5.12e-02

Pos logit contributions
 going+0.036
 buying+0.035
 returning+0.035
all+0.035
Welcome+0.035
Neg logit contributions
ig-0.045
ches-0.043
iper-0.041
 fort-0.041
 web-0.040
Token the
Token ID262
Feature activation+1.58e-01

Pos logit contributions
 going+0.047
all+0.046
 returning+0.045
 pointing+0.045
 buying+0.044
Neg logit contributions
ig-0.114
ches-0.111
iper-0.110
 fort-0.109
aj-0.107
Token tower
Token ID10580
Feature activation-9.00e-03

Pos logit contributions
aj+0.288
ig+0.287
ches+0.284
iper+0.277
ox+0.268
Neg logit contributions
 going-0.241
 pointing-0.227
't-0.221
 rep-0.218
 returning-0.217
Token grew
Token ID6348
Feature activation-1.17e-01

Pos logit contributions
 going+0.014
 pointing+0.014
 returning+0.014
all+0.014
't+0.014
Neg logit contributions
ig-0.015
iper-0.015
aj-0.015
ches-0.014
ox-0.014
Token.
Token ID13
Feature activation-4.16e-02

Pos logit contributions
ig+0.317
iper+0.301
ches+0.300
aj+0.299
 fort+0.295
Neg logit contributions
 going-0.189
 pointing-0.183
all-0.180
 returning-0.179
't-0.177
Token Tim
Token ID5045
Feature activation+1.32e-01

Pos logit contributions
 returning+0.058
 going+0.058
all+0.056
 possible+0.055
no+0.055
Neg logit contributions
aj-0.067
house-0.067
uffy-0.066
 fort-0.065
ig-0.065
Tokenmy
Token ID1820
Feature activation+1.03e-01

Pos logit contributions
ns+0.263
 web+0.261
 fort+0.260
eh+0.254
ars+0.251
Neg logit contributions
all-0.173
 him-0.169
 going-0.161
 Mike-0.139
 pointing-0.138
Token cl
Token ID537
Feature activation-8.66e-02

Pos logit contributions
 fort+0.200
iper+0.194
ig+0.193
eh+0.191
aj+0.187
Neg logit contributions
 going-0.148
 pointing-0.146
all-0.141
't-0.139
 it-0.136
Tokenapped
Token ID6320
Feature activation-9.85e-02

Pos logit contributions
 going+0.119
 pointing+0.112
 returning+0.111
all+0.109
't+0.109
Neg logit contributions
ig-0.173
aj-0.161
iper-0.159
ches-0.157
ox-0.155
Token his
Token ID465
Feature activation+1.79e-01

Pos logit contributions
 going+0.108
 pointing+0.104
all+0.102
 returning+0.101
 him+0.100
Neg logit contributions
ig-0.219
iper-0.207
aj-0.207
ches-0.207
 fort-0.203
Token hands
Token ID2832
Feature activation-2.44e-02

Pos logit contributions
isk+0.333
 fort+0.331
eh+0.330
auldron+0.330
ches+0.328
Neg logit contributions
 pointing-0.247
 going-0.228
't-0.203
 it-0.200
 giving-0.200
Token too
Token ID1165
Feature activation-8.63e-03

Pos logit contributions
 buying+0.037
 returning+0.037
 going+0.037
 Crab+0.037
 pointing+0.037
Neg logit contributions
ig-0.040
ches-0.037
 fort-0.036
aj-0.035
ets-0.035
Token and
Token ID290
Feature activation+2.74e-02

Pos logit contributions
 pointing+0.012
 going+0.012
 returning+0.012
 buying+0.012
all+0.012
Neg logit contributions
ig-0.015
ches-0.015
aj-0.014
ets-0.014
iper-0.014
Token smiled
Token ID13541
Feature activation-4.03e-02

Pos logit contributions
ig+0.051
iper+0.048
ches+0.048
aj+0.048
 fort+0.047
Neg logit contributions
 going-0.038
 pointing-0.037
all-0.037
 returning-0.037
't-0.036
Token.
Token ID13
Feature activation-3.95e-02

Pos logit contributions
 buying+0.061
otion+0.060
't+0.058
 returning+0.058
Welcome+0.057
Neg logit contributions
 fort-0.064
ig-0.063
aj-0.061
ches-0.058
udd-0.058
Token She
Token ID1375
Feature activation+9.39e-02

Pos logit contributions
ig+0.107
aj+0.097
iper+0.097
 fort+0.097
ches+0.096
Neg logit contributions
 pointing-0.077
 going-0.076
 returning-0.075
't-0.074
all-0.073
Token tasted
Token ID29187
Feature activation-1.06e-01

Pos logit contributions
ig+0.172
aj+0.159
iper+0.158
 fort+0.158
ox+0.154
Neg logit contributions
 going-0.143
 him-0.137
 pointing-0.133
 it-0.127
't-0.127
Token the
Token ID262
Feature activation+1.49e-01

Pos logit contributions
Welcome+0.126
 returning+0.123
 pointing+0.122
 rep+0.117
't+0.117
Neg logit contributions
ig-0.198
ches-0.198
 web-0.186
ets-0.184
iper-0.180
Token soup
Token ID17141
Feature activation-1.20e-02

Pos logit contributions
ig+0.263
iper+0.251
ches+0.248
aj+0.247
 fort+0.240
Neg logit contributions
 going-0.227
 pointing-0.219
 returning-0.213
all-0.213
't-0.211
Token and
Token ID290
Feature activation+1.71e-02

Pos logit contributions
 pointing+0.017
 going+0.017
 returning+0.017
all+0.017
't+0.016
Neg logit contributions
ig-0.022
ches-0.021
 fort-0.020
iper-0.020
 web-0.020
Token spat
Token ID15246
Feature activation-2.11e-01

Pos logit contributions
ig+0.033
iper+0.031
aj+0.030
ches+0.030
 fort+0.030
Neg logit contributions
 going-0.025
 pointing-0.024
 returning-0.023
 him-0.023
all-0.023
Token it
Token ID340
Feature activation+5.77e-02

Pos logit contributions
 returning+0.159
Welcome+0.158
 buying+0.153
 going+0.151
 pointing+0.150
Neg logit contributions
ches-0.544
ig-0.542
iper-0.509
ox-0.493
 web-0.492
Token out
Token ID503
Feature activation-1.31e-02

Pos logit contributions
ches+0.113
ig+0.109
 fort+0.104
ets+0.100
 web+0.100
Neg logit contributions
 buying-0.077
 returning-0.076
 pointing-0.076
 Lib-0.075
 Crab-0.074
Token.
Token ID13
Feature activation-3.93e-02

Pos logit contributions
Welcome+0.017
 going+0.017
all+0.017
't+0.017
 Crab+0.017
Neg logit contributions
ig-0.024
 web-0.024
ches-0.023
 fort-0.023
aj-0.022
Token
Token ID198
Feature activation+5.98e-03

Pos logit contributions
 going+0.055
 returning+0.053
all+0.051
 pointing+0.050
 buying+0.049
Neg logit contributions
aj-0.070
ig-0.068
iper-0.067
ches-0.066
 web-0.065
Token
Token ID198
Feature activation+5.54e-03

Pos logit contributions
ig+0.012
aj+0.012
iper+0.012
ches+0.012
ox+0.011
Neg logit contributions
 going-0.007
 pointing-0.007
 returning-0.007
 buying-0.007
all-0.006
Token He
Token ID679
Feature activation+8.35e-02

Pos logit contributions
 going+0.055
 returning+0.052
all+0.051
 pointing+0.051
 buying+0.049
Neg logit contributions
ig-0.086
aj-0.085
iper-0.083
ches-0.082
 fort-0.081
Token cou
Token ID2284
Feature activation-6.95e-02

Pos logit contributions
ig+0.159
aj+0.147
iper+0.146
 fort+0.144
uffy+0.141
Neg logit contributions
 going-0.127
 him-0.123
 it-0.112
 pointing-0.112
't-0.110
Tokeng
Token ID70
Feature activation-9.69e-02

Pos logit contributions
 going+0.124
't+0.121
 pointing+0.120
all+0.119
 returning+0.118
Neg logit contributions
ig-0.106
iper-0.099
aj-0.098
 fort-0.096
ox-0.096
Tokenhed
Token ID704
Feature activation-1.28e-01

Pos logit contributions
 him+0.115
 going+0.114
all+0.112
't+0.110
 returning+0.109
Neg logit contributions
ig-0.206
aj-0.194
 fort-0.193
iper-0.191
ox-0.188
Token and
Token ID290
Feature activation+2.80e-02

Pos logit contributions
 going+0.166
 pointing+0.161
 him+0.158
all+0.157
 returning+0.157
Neg logit contributions
ig-0.258
iper-0.244
aj-0.244
ches-0.240
 fort-0.237
Token spat
Token ID15246
Feature activation-2.07e-01

Pos logit contributions
ig+0.050
iper+0.048
aj+0.045
ches+0.045
 fort+0.045
Neg logit contributions
 going-0.046
 pointing-0.044
 him-0.043
 returning-0.042
't-0.041
Token out
Token ID503
Feature activation-1.16e-02

Pos logit contributions
 returning+0.218
 buying+0.217
 pointing+0.215
 going+0.211
Welcome+0.209
Neg logit contributions
ig-0.473
ches-0.472
iper-0.437
aj-0.428
 fort-0.423
Token some
Token ID617
Feature activation+2.07e-02

Pos logit contributions
 going+0.013
 pointing+0.012
all+0.012
 returning+0.012
't+0.012
Neg logit contributions
ig-0.025
iper-0.024
ches-0.024
aj-0.024
 fort-0.024
Token water
Token ID1660
Feature activation-4.54e-02

Pos logit contributions
ig+0.036
iper+0.034
ches+0.034
aj+0.033
 fort+0.033
Neg logit contributions
 going-0.032
 pointing-0.031
all-0.031
 returning-0.031
't-0.030
Token.
Token ID13
Feature activation-3.92e-02

Pos logit contributions
all+0.058
Welcome+0.056
 pointing+0.055
 Lib+0.055
't+0.055
Neg logit contributions
ig-0.088
ches-0.088
 fort-0.084
aj-0.082
ets-0.081
Token He
Token ID679
Feature activation+8.57e-02

Pos logit contributions
 going+0.049
 returning+0.046
all+0.045
 pointing+0.045
 buying+0.043
Neg logit contributions
ig-0.078
aj-0.078
ches-0.075
iper-0.075
 fort-0.073
TokenIt
Token ID1026
Feature activation+2.49e-02

Pos logit contributions
 going+0.031
 returning+0.030
 pointing+0.028
 buying+0.028
 Lib+0.027
Neg logit contributions
ox-0.038
ches-0.037
ig-0.037
ney-0.036
ns-0.036
Token makes
Token ID1838
Feature activation-9.71e-02

Pos logit contributions
ig+0.031
iper+0.030
aj+0.029
 fort+0.027
ox+0.027
Neg logit contributions
 going-0.051
't-0.050
 pointing-0.049
all-0.049
 him-0.049
Token me
Token ID502
Feature activation+6.01e-02

Pos logit contributions
 pointing+0.139
 returning+0.135
 going+0.134
all+0.132
 Lib+0.132
Neg logit contributions
ches-0.171
ig-0.168
iper-0.166
 fort-0.163
 web-0.162
Token want
Token ID765
Feature activation-1.43e-01

Pos logit contributions
ig+0.103
ches+0.097
aj+0.095
iper+0.095
 fort+0.094
Neg logit contributions
 going-0.093
 pointing-0.092
all-0.090
 returning-0.090
't-0.088
Token to
Token ID284
Feature activation-2.75e-02

Pos logit contributions
 going+0.135
 pointing+0.129
all+0.126
 returning+0.125
't+0.123
Neg logit contributions
ig-0.335
iper-0.320
ches-0.319
aj-0.318
 fort-0.314
Token spit
Token ID27591
Feature activation-2.03e-01

Pos logit contributions
 pointing+0.041
 going+0.040
 returning+0.040
all+0.039
Welcome+0.039
Neg logit contributions
ches-0.044
ig-0.044
 web-0.043
 fort-0.043
iper-0.042
Token."
Token ID526
Feature activation-5.03e-02

Pos logit contributions
 pointing+0.207
 going+0.205
 returning+0.203
all+0.200
 buying+0.195
Neg logit contributions
ches-0.455
ig-0.451
iper-0.434
 fort-0.423
aj-0.421
Token
Token ID198
Feature activation+4.84e-03

Pos logit contributions
 going+0.053
 returning+0.049
 pointing+0.048
all+0.048
 buying+0.046
Neg logit contributions
ig-0.112
aj-0.109
iper-0.107
ches-0.107
 fort-0.105
Token
Token ID198
Feature activation+5.81e-03

Pos logit contributions
ig+0.011
aj+0.011
ches+0.011
iper+0.011
 fort+0.010
Neg logit contributions
 going-0.005
 pointing-0.005
 returning-0.004
all-0.004
 buying-0.004
Token"
Token ID1
Feature activation-2.22e-02

Pos logit contributions
ig+0.012
iper+0.012
ches+0.012
 fort+0.012
ets+0.012
Neg logit contributions
't-0.006
 pointing-0.006
all-0.006
 him-0.006
 returning-0.006
TokenMe
Token ID5308
Feature activation+4.17e-02

Pos logit contributions
 going+0.029
 returning+0.028
 buying+0.027
all+0.026
 pointing+0.026
Neg logit contributions
ig-0.042
ches-0.041
 web-0.041
ox-0.041
aj-0.041
Token She
Token ID1375
Feature activation+9.67e-02

Pos logit contributions
 going+0.055
 returning+0.052
all+0.051
 pointing+0.049
no+0.048
Neg logit contributions
aj-0.067
ig-0.065
 web-0.064
ox-0.064
iper-0.064
Token made
Token ID925
Feature activation-9.43e-02

Pos logit contributions
ig+0.193
 fort+0.181
iper+0.179
aj+0.175
ox+0.167
Neg logit contributions
 going-0.137
 pointing-0.134
 him-0.132
't-0.129
 it-0.126
Token a
Token ID257
Feature activation+7.97e-02

Pos logit contributions
 returning+0.101
 going+0.099
 pointing+0.098
all+0.096
Welcome+0.095
Neg logit contributions
ig-0.199
ches-0.195
iper-0.192
aj-0.191
 web-0.190
Token face
Token ID1986
Feature activation-5.69e-02

Pos logit contributions
ig+0.137
iper+0.134
ches+0.129
aj+0.127
ets+0.121
Neg logit contributions
 pointing-0.126
 going-0.125
 returning-0.119
't-0.119
 him-0.117
Token and
Token ID290
Feature activation+3.11e-02

Pos logit contributions
 Joey+0.094
Welcome+0.093
 Crab+0.091
 Mike+0.091
 Lib+0.090
Neg logit contributions
ig-0.088
ches-0.085
 web-0.084
house-0.083
c-0.081
Token spat
Token ID15246
Feature activation-2.03e-01

Pos logit contributions
ig+0.059
iper+0.055
ches+0.054
aj+0.054
 fort+0.053
Neg logit contributions
 going-0.046
 pointing-0.045
 him-0.043
 returning-0.043
 buying-0.042
Token it
Token ID340
Feature activation+6.30e-02

Pos logit contributions
 returning+0.174
Welcome+0.170
 buying+0.169
 going+0.161
 rep+0.161
Neg logit contributions
ig-0.508
ches-0.505
iper-0.466
 web-0.461
ox-0.460
Token out
Token ID503
Feature activation-8.83e-03

Pos logit contributions
ches+0.115
ig+0.109
 fort+0.105
 web+0.101
ets+0.100
Neg logit contributions
 rep-0.091
 Mike-0.091
 Lib-0.091
 buying-0.091
 Crab-0.090
Token.
Token ID13
Feature activation-3.87e-02

Pos logit contributions
 Crab+0.013
all+0.013
Welcome+0.012
 Lib+0.012
't+0.012
Neg logit contributions
 web-0.015
ig-0.015
 fort-0.015
ches-0.014
aj-0.013
Token
Token ID198
Feature activation+6.67e-03

Pos logit contributions
 going+0.061
 returning+0.059
all+0.058
 buying+0.054
 happen+0.054
Neg logit contributions
aj-0.061
uffy-0.058
ox-0.057
 web-0.057
ig-0.057
Token
Token ID198
Feature activation+6.89e-03

Pos logit contributions
ig+0.013
aj+0.013
ches+0.012
ox+0.012
iper+0.012
Neg logit contributions
 going-0.009
 pointing-0.008
 returning-0.008
 buying-0.008
all-0.008
Token like
Token ID588
Feature activation-6.95e-02

Pos logit contributions
 going+0.033
 pointing+0.033
 returning+0.032
all+0.032
 buying+0.031
Neg logit contributions
ig-0.058
ches-0.057
iper-0.056
 fort-0.055
aj-0.055
Token the
Token ID262
Feature activation+1.48e-01

Pos logit contributions
 going+0.069
 pointing+0.069
all+0.068
 returning+0.067
't+0.065
Neg logit contributions
ig-0.153
ches-0.149
iper-0.147
 fort-0.146
aj-0.144
Token noise
Token ID7838
Feature activation-6.78e-02

Pos logit contributions
ig+0.254
 fort+0.239
ches+0.239
aj+0.237
iper+0.237
Neg logit contributions
 going-0.224
all-0.224
 pointing-0.220
 returning-0.219
't-0.217
Token.
Token ID13
Feature activation-4.44e-02

Pos logit contributions
all+0.081
 pointing+0.080
 returning+0.079
 going+0.078
 rep+0.077
Neg logit contributions
ig-0.133
ches-0.132
 fort-0.125
ets-0.124
 web-0.121
Token It
Token ID632
Feature activation+5.26e-02

Pos logit contributions
 going+0.051
 pointing+0.048
 returning+0.048
all+0.047
 buying+0.045
Neg logit contributions
ig-0.094
aj-0.090
ches-0.090
iper-0.089
 fort-0.088
Token spat
Token ID15246
Feature activation-2.02e-01

Pos logit contributions
ig+0.102
iper+0.100
aj+0.096
ox+0.091
 fort+0.089
Neg logit contributions
 going-0.079
 pointing-0.076
 him-0.074
't-0.074
all-0.070
Token out
Token ID503
Feature activation-9.10e-03

Pos logit contributions
 buying+0.218
 returning+0.218
 pointing+0.216
 going+0.216
all+0.207
Neg logit contributions
ches-0.450
ig-0.444
iper-0.421
 fort-0.412
 web-0.412
Token the
Token ID262
Feature activation+1.60e-01

Pos logit contributions
 going+0.010
 pointing+0.010
all+0.010
 returning+0.010
 him+0.009
Neg logit contributions
ig-0.020
iper-0.019
ches-0.019
aj-0.019
 fort-0.018
Token float
Token ID12178
Feature activation-5.04e-02

Pos logit contributions
ig+0.271
ches+0.256
 fort+0.253
aj+0.253
iper+0.252
Neg logit contributions
 going-0.249
all-0.247
 pointing-0.243
 returning-0.242
't-0.239
Tokeny
Token ID88
Feature activation+1.19e-01

Pos logit contributions
 going+0.063
 pointing+0.061
all+0.060
 returning+0.060
't+0.059
Neg logit contributions
ig-0.102
iper-0.097
ches-0.097
aj-0.096
 fort-0.095
Token and
Token ID290
Feature activation+1.99e-02

Pos logit contributions
ig+0.208
iper+0.197
ches+0.195
aj+0.195
 fort+0.191
Neg logit contributions
 going-0.182
 pointing-0.178
all-0.175
 returning-0.174
't-0.172
Token the
Token ID262
Feature activation+1.38e-01

Pos logit contributions
ig+0.001
iper+0.001
ches+0.001
aj+0.001
 fort+0.001
Neg logit contributions
 going-0.000
 pointing-0.000
all-0.000
 returning-0.000
't-0.000
Token coffee
Token ID6891
Feature activation-2.50e-02

Pos logit contributions
ig+0.225
ches+0.215
iper+0.212
aj+0.209
ox+0.204
Neg logit contributions
 going-0.232
 pointing-0.222
 returning-0.217
 buying-0.213
 him-0.213
Token down
Token ID866
Feature activation-2.95e-02

Pos logit contributions
 going+0.037
 pointing+0.036
all+0.036
 returning+0.036
't+0.035
Neg logit contributions
ig-0.044
ches-0.042
aj-0.042
iper-0.041
 fort-0.041
Token.
Token ID13
Feature activation-4.91e-02

Pos logit contributions
 going+0.035
 pointing+0.035
all+0.034
 returning+0.034
't+0.033
Neg logit contributions
ig-0.060
ches-0.057
iper-0.057
aj-0.057
 fort-0.057
Token He
Token ID679
Feature activation+8.18e-02

Pos logit contributions
 going+0.065
 returning+0.061
 pointing+0.059
all+0.059
 buying+0.057
Neg logit contributions
ig-0.092
aj-0.092
iper-0.090
ches-0.088
 fort-0.086
Token spat
Token ID15246
Feature activation-2.02e-01

Pos logit contributions
ig+0.149
iper+0.136
 fort+0.135
aj+0.135
ox+0.133
Neg logit contributions
 going-0.130
 pointing-0.124
 him-0.124
't-0.118
 Mike-0.117
Token it
Token ID340
Feature activation+6.02e-02

Pos logit contributions
 returning+0.170
 buying+0.167
Welcome+0.166
 going+0.166
 pointing+0.163
Neg logit contributions
ches-0.502
ig-0.499
iper-0.467
 web-0.451
ets-0.451
Token out
Token ID503
Feature activation-1.06e-02

Pos logit contributions
ches+0.120
ig+0.118
 fort+0.110
ets+0.108
iper+0.108
Neg logit contributions
 pointing-0.074
 buying-0.074
 returning-0.074
all-0.073
Welcome-0.072
Token,
Token ID11
Feature activation-2.29e-02

Pos logit contributions
Welcome+0.015
't+0.015
all+0.015
 going+0.015
 Lib+0.014
Neg logit contributions
ig-0.018
ches-0.018
 fort-0.018
 web-0.017
aj-0.017
Token then
Token ID788
Feature activation-2.99e-02

Pos logit contributions
 going+0.033
all+0.032
 pointing+0.032
 returning+0.032
't+0.032
Neg logit contributions
ig-0.041
ches-0.039
iper-0.039
aj-0.039
 fort-0.038
Token ran
Token ID4966
Feature activation-9.83e-02

Pos logit contributions
 going+0.036
all+0.035
 pointing+0.035
 returning+0.035
't+0.034
Neg logit contributions
ig-0.061
ches-0.058
iper-0.058
aj-0.057
 fort-0.056
Token big
Token ID1263
Feature activation+2.88e-02

Pos logit contributions
 going+0.126
 pointing+0.120
all+0.119
 returning+0.118
 him+0.117
Neg logit contributions
ig-0.266
aj-0.254
iper-0.252
ches-0.252
 fort-0.249
Token surprises
Token ID24072
Feature activation-9.73e-02

Pos logit contributions
ig+0.051
iper+0.048
ches+0.048
aj+0.048
 fort+0.047
Neg logit contributions
 going-0.043
 pointing-0.042
all-0.041
 returning-0.041
't-0.041
Token if
Token ID611
Feature activation-7.31e-02

Pos logit contributions
 pointing+0.109
 going+0.105
't+0.102
 returning+0.102
all+0.101
Neg logit contributions
ig-0.207
ches-0.200
 fort-0.196
iper-0.196
aj-0.192
Token it
Token ID340
Feature activation+4.34e-02

Pos logit contributions
all+0.065
 pointing+0.064
 going+0.064
 returning+0.061
't+0.060
Neg logit contributions
ig-0.168
iper-0.162
ches-0.160
 fort-0.158
aj-0.157
Token was
Token ID373
Feature activation-1.85e-02

Pos logit contributions
ig+0.070
aj+0.066
iper+0.065
 fort+0.063
ox+0.062
Neg logit contributions
 going-0.076
 pointing-0.071
all-0.071
't-0.071
 him-0.071
Token wished
Token ID16555
Feature activation-2.01e-01

Pos logit contributions
all+0.026
 pointing+0.026
't+0.026
 going+0.025
 returning+0.025
Neg logit contributions
ig-0.032
ches-0.031
iper-0.031
aj-0.030
 fort-0.030
Token on
Token ID319
Feature activation-3.41e-02

Pos logit contributions
 going+0.247
 pointing+0.247
all+0.239
 returning+0.238
't+0.234
Neg logit contributions
ig-0.403
ches-0.384
iper-0.383
aj-0.378
 fort-0.374
Token with
Token ID351
Feature activation-5.75e-02

Pos logit contributions
 going+0.034
 pointing+0.034
all+0.033
 returning+0.032
't+0.032
Neg logit contributions
ig-0.075
iper-0.072
ches-0.072
aj-0.071
 fort-0.071
Token a
Token ID257
Feature activation+8.01e-02

Pos logit contributions
 going+0.055
all+0.055
 pointing+0.055
't+0.052
 returning+0.052
Neg logit contributions
ig-0.128
iper-0.125
ches-0.123
 web-0.121
ox-0.120
Token pure
Token ID5899
Feature activation+6.03e-03

Pos logit contributions
ig+0.129
ches+0.119
iper+0.119
 fort+0.119
aj+0.118
Neg logit contributions
 going-0.132
all-0.131
 pointing-0.130
't-0.127
 returning-0.127
Token heart
Token ID2612
Feature activation-4.99e-02

Pos logit contributions
ig+0.011
iper+0.011
ches+0.010
aj+0.010
 fort+0.010
Neg logit contributions
 going-0.009
 pointing-0.008
all-0.008
 returning-0.008
't-0.008
Token a
Token ID257
Feature activation+7.79e-02

Pos logit contributions
 going+0.004
all+0.004
 pointing+0.004
 returning+0.004
't+0.004
Neg logit contributions
ig-0.006
aj-0.006
ches-0.006
iper-0.006
 fort-0.006
Token bad
Token ID2089
Feature activation-1.70e-02

Pos logit contributions
ig+0.133
iper+0.126
ches+0.126
aj+0.125
ox+0.120
Neg logit contributions
 going-0.124
 pointing-0.121
 returning-0.118
't-0.115
all-0.115
Token cook
Token ID4255
Feature activation-1.13e-01

Pos logit contributions
all+0.030
 going+0.030
 pointing+0.029
't+0.029
 returning+0.029
Neg logit contributions
ig-0.025
 fort-0.024
iper-0.023
 web-0.023
ches-0.022
Token."
Token ID526
Feature activation-4.60e-02

Pos logit contributions
 pointing+0.129
 going+0.128
all+0.126
 returning+0.125
't+0.121
Neg logit contributions
ig-0.235
ches-0.224
 fort-0.224
iper-0.221
aj-0.219
Token He
Token ID679
Feature activation+8.33e-02

Pos logit contributions
 going+0.054
 returning+0.050
all+0.050
 pointing+0.050
 buying+0.047
Neg logit contributions
ig-0.095
aj-0.092
iper-0.092
ches-0.091
 fort-0.089
Token spat
Token ID15246
Feature activation-2.01e-01

Pos logit contributions
ig+0.163
iper+0.145
ox+0.144
aj+0.144
 fort+0.143
Neg logit contributions
 going-0.125
 him-0.120
 pointing-0.119
 rep-0.113
't-0.112
Token out
Token ID503
Feature activation-1.21e-02

Pos logit contributions
 returning+0.197
 going+0.195
 buying+0.193
 pointing+0.192
Welcome+0.191
Neg logit contributions
ig-0.464
ches-0.461
iper-0.437
 fort-0.430
 web-0.425
Token the
Token ID262
Feature activation+1.54e-01

Pos logit contributions
 going+0.011
 pointing+0.011
 him+0.011
all+0.010
 returning+0.010
Neg logit contributions
ig-0.028
aj-0.027
iper-0.027
ches-0.027
 fort-0.026
Token cake
Token ID12187
Feature activation-1.34e-02

Pos logit contributions
ig+0.246
ches+0.233
 fort+0.233
aj+0.230
iper+0.230
Neg logit contributions
all-0.251
 going-0.249
 returning-0.244
 pointing-0.244
't-0.241
Token.
Token ID13
Feature activation-3.64e-02

Pos logit contributions
Welcome+0.021
 returning+0.019
 Hipp+0.019
all+0.019
 going+0.019
Neg logit contributions
ig-0.023
ches-0.023
 fort-0.022
ows-0.021
 web-0.021
Token
Token ID198
Feature activation+5.81e-03

Pos logit contributions
 going+0.052
 returning+0.048
all+0.047
 buying+0.046
 pointing+0.045
Neg logit contributions
aj-0.065
ig-0.064
ches-0.062
 fort-0.062
iper-0.062
Token He
Token ID679
Feature activation+8.10e-02

Pos logit contributions
 going+0.058
 returning+0.054
 pointing+0.054
all+0.052
 buying+0.051
Neg logit contributions
aj-0.082
ig-0.081
 fort-0.077
iper-0.077
uffy-0.077
Token made
Token ID925
Feature activation-8.87e-02

Pos logit contributions
ig+0.184
 fort+0.172
aj+0.170
iper+0.169
ables+0.168
Neg logit contributions
 him-0.092
 going-0.090
 it-0.085
 pointing-0.081
't-0.078
Token a
Token ID257
Feature activation+7.89e-02

Pos logit contributions
 going+0.083
 pointing+0.082
 returning+0.082
all+0.079
't+0.078
Neg logit contributions
ig-0.202
ches-0.197
iper-0.194
aj-0.194
 fort-0.192
Token face
Token ID1986
Feature activation-5.36e-02

Pos logit contributions
ig+0.139
iper+0.138
ches+0.131
ets+0.129
aj+0.127
Neg logit contributions
 going-0.120
 pointing-0.117
 returning-0.112
't-0.112
 him-0.111
Token and
Token ID290
Feature activation+2.86e-02

Pos logit contributions
 going+0.075
all+0.073
 pointing+0.073
 returning+0.072
't+0.072
Neg logit contributions
ig-0.099
ches-0.094
aj-0.092
iper-0.091
 fort-0.091
Token spat
Token ID15246
Feature activation-2.00e-01

Pos logit contributions
ig+0.058
iper+0.053
ches+0.053
aj+0.052
uffy+0.052
Neg logit contributions
 going-0.040
 pointing-0.040
 him-0.038
 returning-0.037
 rep-0.036
Token it
Token ID340
Feature activation+6.43e-02

Pos logit contributions
Welcome+0.211
 buying+0.206
 returning+0.204
 rep+0.195
 pointing+0.192
Neg logit contributions
ches-0.469
ig-0.458
iper-0.420
 fort-0.410
ows-0.407
Token out
Token ID503
Feature activation-5.88e-03

Pos logit contributions
ches+0.130
ig+0.126
 fort+0.123
ets+0.117
iper+0.116
Neg logit contributions
 pointing-0.080
 returning-0.078
 buying-0.078
all-0.077
 Crab-0.077
Token.
Token ID13
Feature activation-4.07e-02

Pos logit contributions
't+0.007
 pointing+0.007
all+0.007
 going+0.007
 Crab+0.007
Neg logit contributions
 fort-0.011
ig-0.011
ches-0.011
aj-0.011
 web-0.011
Token
Token ID198
Feature activation+5.43e-03

Pos logit contributions
 going+0.065
 returning+0.062
all+0.059
 pointing+0.059
 buying+0.059
Neg logit contributions
aj-0.066
ig-0.061
 fort-0.060
uffy-0.060
uel-0.059
Token
Token ID198
Feature activation+6.59e-03

Pos logit contributions
ig+0.012
ches+0.011
aj+0.011
iper+0.011
ox+0.011
Neg logit contributions
 going-0.006
 pointing-0.006
 returning-0.006
 buying-0.005
all-0.005
Token some
Token ID617
Feature activation+2.46e-02

Pos logit contributions
ig+0.021
iper+0.020
aj+0.020
ches+0.019
ox+0.019
Neg logit contributions
 going-0.010
 him-0.010
 pointing-0.009
 returning-0.008
all-0.008
Token things
Token ID1243
Feature activation-3.15e-02

Pos logit contributions
ig+0.048
aj+0.046
iper+0.046
ches+0.044
ox+0.044
Neg logit contributions
 going-0.037
 him-0.034
 returning-0.033
 pointing-0.033
 buying-0.032
Token that
Token ID326
Feature activation+3.04e-03

Pos logit contributions
 going+0.034
 pointing+0.033
all+0.033
 returning+0.032
't+0.032
Neg logit contributions
ig-0.068
ches-0.065
iper-0.065
aj-0.064
 fort-0.064
Token cannot
Token ID2314
Feature activation-8.18e-02

Pos logit contributions
ig+0.006
iper+0.005
aj+0.005
ches+0.005
 fort+0.005
Neg logit contributions
 going-0.004
 pointing-0.004
all-0.004
 returning-0.004
't-0.004
Token be
Token ID307
Feature activation+1.85e-03

Pos logit contributions
 pointing+0.134
all+0.133
 buying+0.130
 going+0.130
 returning+0.129
Neg logit contributions
ig-0.123
ches-0.122
 fort-0.121
 web-0.117
iper-0.113
Token wished
Token ID16555
Feature activation-2.00e-01

Pos logit contributions
ig+0.004
iper+0.004
aj+0.003
ches+0.003
 fort+0.003
Neg logit contributions
 going-0.003
 pointing-0.002
 returning-0.002
 him-0.002
all-0.002
Token for
Token ID329
Feature activation-3.82e-02

Pos logit contributions
 going+0.211
 pointing+0.197
all+0.196
 him+0.193
 returning+0.193
Neg logit contributions
ig-0.452
iper-0.431
aj-0.430
ches-0.427
 fort-0.421
Token and
Token ID290
Feature activation+2.62e-02

Pos logit contributions
all+0.022
 pointing+0.021
 going+0.021
't+0.020
 returning+0.020
Neg logit contributions
ig-0.100
iper-0.097
ches-0.096
 fort-0.096
ox-0.094
Token he
Token ID339
Feature activation+7.55e-02

Pos logit contributions
ig+0.053
 fort+0.051
iper+0.051
ches+0.051
 web+0.050
Neg logit contributions
all-0.029
 pointing-0.028
't-0.028
 going-0.027
 returning-0.027
Token had
Token ID550
Feature activation-2.48e-02

Pos logit contributions
aj+0.158
ig+0.148
 fort+0.146
iper+0.144
eh+0.144
Neg logit contributions
 going-0.103
't-0.098
 him-0.097
 it-0.091
all-0.090
Token learned
Token ID4499
Feature activation-1.06e-01

Pos logit contributions
 going+0.026
 pointing+0.023
 returning+0.023
 him+0.023
all+0.023
Neg logit contributions
ig-0.059
aj-0.056
iper-0.056
ches-0.056
 fort-0.055
Token that
Token ID326
Feature activation-2.77e-04

Pos logit contributions
all+0.109
 pointing+0.108
 going+0.108
't+0.105
 returning+0.103
Neg logit contributions
ig-0.218
ches-0.211
aj-0.210
 fort-0.210
 web-0.207
Token sharing
Token ID7373
Feature activation-1.26e-01

Pos logit contributions
all+0.000
't+0.000
 Lib+0.000
 returning+0.000
 Joey+0.000
Neg logit contributions
ig-0.001
 fort-0.001
 web-0.001
ox-0.001
ches-0.001
Token is
Token ID318
Feature activation-1.64e-04

Pos logit contributions
 going+0.224
 pointing+0.218
 him+0.216
 returning+0.214
't+0.213
Neg logit contributions
ig-0.195
iper-0.187
aj-0.183
ches-0.176
eh-0.173
Token better
Token ID1365
Feature activation-6.38e-02

Pos logit contributions
all+0.000
 pointing+0.000
't+0.000
 going+0.000
 Mike+0.000
Neg logit contributions
ig-0.000
ches-0.000
iper-0.000
 fort-0.000
aj-0.000
Token than
Token ID621
Feature activation-5.14e-02

Pos logit contributions
 going+0.088
 pointing+0.086
all+0.085
 returning+0.084
't+0.083
Neg logit contributions
ig-0.122
ches-0.114
iper-0.114
aj-0.113
 fort-0.112
Token competing
Token ID11780
Feature activation-2.00e-01

Pos logit contributions
all+0.050
 Lib+0.049
Welcome+0.048
't+0.047
 pointing+0.046
Neg logit contributions
ig-0.105
ets-0.103
 web-0.103
ches-0.102
aj-0.101
Token.
Token ID13
Feature activation-4.03e-02

Pos logit contributions
 Abby+0.244
 Joey+0.241
all+0.239
 Lib+0.239
 returning+0.238
Neg logit contributions
ches-0.383
ig-0.372
ets-0.368
ables-0.355
ows-0.354
Token They
Token ID1119
Feature activation+5.65e-02

Pos logit contributions
 going+0.052
all+0.048
 returning+0.048
 pointing+0.048
 buying+0.045
Neg logit contributions
aj-0.078
ig-0.078
iper-0.076
ches-0.075
ox-0.075
Token learn
Token ID2193
Feature activation-1.02e-01

Pos logit contributions
ig+0.112
aj+0.108
iper+0.106
ches+0.105
 fort+0.103
Neg logit contributions
 going-0.076
't-0.071
 pointing-0.071
all-0.070
 him-0.070
Token that
Token ID326
Feature activation+1.64e-03

Pos logit contributions
all+0.117
 pointing+0.116
 going+0.115
't+0.114
 Joey+0.111
Neg logit contributions
ig-0.202
 fort-0.195
ches-0.195
aj-0.195
 web-0.193
Token purple
Token ID14032
Feature activation+6.35e-04

Pos logit contributions
ig+0.003
 fort+0.003
 web+0.003
ches+0.003
iper+0.003
Neg logit contributions
all-0.002
't-0.002
 Lib-0.002
 Abby-0.002
 Joey-0.002
Token.
Token ID13
Feature activation-3.62e-02

Pos logit contributions
't+0.025
 pointing+0.025
 returning+0.024
all+0.024
Welcome+0.024
Neg logit contributions
ig-0.026
 fort-0.026
ets-0.025
ches-0.025
 web-0.024
Token But
Token ID887
Feature activation-1.44e-02

Pos logit contributions
 going+0.046
 returning+0.043
 pointing+0.042
all+0.042
 buying+0.041
Neg logit contributions
ig-0.071
aj-0.070
iper-0.069
ox-0.068
 web-0.068
Token she
Token ID673
Feature activation+8.38e-02

Pos logit contributions
 going+0.015
 pointing+0.014
all+0.014
 returning+0.014
't+0.013
Neg logit contributions
ig-0.033
iper-0.031
ches-0.031
aj-0.031
 fort-0.031
Token did
Token ID750
Feature activation-1.04e-01

Pos logit contributions
ig+0.161
iper+0.158
 fort+0.157
aj+0.154
ables+0.154
Neg logit contributions
 going-0.112
 pointing-0.112
 it-0.112
 him-0.110
all-0.108
Token not
Token ID407
Feature activation-2.79e-02

Pos logit contributions
Welcome+0.149
all+0.143
 returning+0.143
 pointing+0.143
 going+0.141
Neg logit contributions
ig-0.184
ches-0.178
aj-0.172
 web-0.171
 fort-0.169
Token spit
Token ID27591
Feature activation-2.00e-01

Pos logit contributions
 returning+0.039
 pointing+0.038
 going+0.038
all+0.038
 buying+0.037
Neg logit contributions
ig-0.050
ches-0.049
 web-0.048
iper-0.047
ox-0.047
Token it
Token ID340
Feature activation+5.88e-02

Pos logit contributions
 pointing+0.180
 returning+0.179
 going+0.178
 buying+0.175
all+0.173
Neg logit contributions
ig-0.475
ches-0.469
iper-0.448
 fort-0.439
 web-0.436
Token out
Token ID503
Feature activation-8.39e-03

Pos logit contributions
ches+0.112
ig+0.105
 fort+0.101
ets+0.098
ows+0.096
Neg logit contributions
 Crab-0.083
 buying-0.083
 rep-0.082
 pointing-0.081
 Hipp-0.081
Token.
Token ID13
Feature activation-3.93e-02

Pos logit contributions
't+0.010
all+0.010
 Crab+0.010
 pointing+0.010
 going+0.010
Neg logit contributions
ig-0.016
 fort-0.016
ches-0.016
 web-0.016
aj-0.015
Token She
Token ID1375
Feature activation+9.85e-02

Pos logit contributions
 going+0.054
 returning+0.050
all+0.049
 pointing+0.049
 buying+0.047
Neg logit contributions
aj-0.073
ig-0.072
iper-0.070
ox-0.070
 web-0.070
Token said
Token ID531
Feature activation-4.90e-02

Pos logit contributions
ig+0.198
iper+0.191
 fort+0.188
aj+0.187
ables+0.186
Neg logit contributions
 going-0.129
 him-0.128
 it-0.128
 pointing-0.127
't-0.119
Token hard
Token ID1327
Feature activation-3.67e-02

Pos logit contributions
 going+0.011
 pointing+0.011
all+0.011
 returning+0.011
't+0.011
Neg logit contributions
ig-0.021
ches-0.020
iper-0.020
aj-0.020
 fort-0.020
Token and
Token ID290
Feature activation+2.45e-02

Pos logit contributions
Welcome+0.054
 Crab+0.052
 Hipp+0.051
 Abby+0.051
 Cr+0.051
Neg logit contributions
ig-0.066
ches-0.065
ets-0.065
 fort-0.064
aj-0.062
Token plastic
Token ID7309
Feature activation-2.20e-02

Pos logit contributions
ig+0.047
iper+0.044
aj+0.044
ches+0.043
ox+0.043
Neg logit contributions
 going-0.035
 pointing-0.034
 returning-0.033
all-0.033
 him-0.033
Token.
Token ID13
Feature activation-4.39e-02

Pos logit contributions
 going+0.022
 pointing+0.021
all+0.021
 returning+0.021
't+0.021
Neg logit contributions
ig-0.050
ches-0.047
iper-0.047
aj-0.047
 fort-0.047
Token It
Token ID632
Feature activation+5.09e-02

Pos logit contributions
 going+0.052
 pointing+0.050
all+0.049
 returning+0.049
't+0.047
Neg logit contributions
ig-0.092
iper-0.088
ches-0.088
aj-0.088
 fort-0.086
Token spat
Token ID15246
Feature activation-1.99e-01

Pos logit contributions
ig+0.098
iper+0.098
aj+0.096
ox+0.091
ets+0.089
Neg logit contributions
 going-0.072
 pointing-0.071
 him-0.070
't-0.070
 it-0.067
Token it
Token ID340
Feature activation+6.62e-02

Pos logit contributions
 returning+0.199
 pointing+0.199
 going+0.199
 buying+0.198
all+0.188
Neg logit contributions
ches-0.457
ig-0.450
iper-0.429
 fort-0.415
 web-0.414
Token out
Token ID503
Feature activation-8.22e-03

Pos logit contributions
ches+0.126
ig+0.120
 fort+0.117
ets+0.111
iper+0.110
Neg logit contributions
 buying-0.089
 pointing-0.089
 Lib-0.089
 rep-0.088
all-0.087
Token and
Token ID290
Feature activation+1.83e-02

Pos logit contributions
't+0.012
 going+0.012
all+0.012
Welcome+0.011
 pointing+0.011
Neg logit contributions
ig-0.014
ches-0.014
 fort-0.014
 web-0.014
iper-0.013
Token made
Token ID925
Feature activation-9.99e-02

Pos logit contributions
ig+0.033
ches+0.031
iper+0.031
aj+0.031
 fort+0.030
Neg logit contributions
 going-0.027
 pointing-0.027
all-0.026
 returning-0.026
't-0.026
Token a
Token ID257
Feature activation+7.59e-02

Pos logit contributions
 going+0.093
 pointing+0.093
 returning+0.092
all+0.092
 buying+0.089
Neg logit contributions
ig-0.225
ches-0.220
iper-0.219
 fort-0.218
aj-0.214
Token.
Token ID13
Feature activation-4.51e-02

Pos logit contributions
 pointing+0.074
 going+0.073
 returning+0.073
all+0.073
Welcome+0.071
Neg logit contributions
ig-0.123
ches-0.120
aj-0.116
 fort-0.116
iper-0.115
Token They
Token ID1119
Feature activation+5.20e-02

Pos logit contributions
 going+0.065
 returning+0.062
all+0.062
 pointing+0.058
 buying+0.057
Neg logit contributions
aj-0.080
 fort-0.079
ig-0.078
ches-0.077
iper-0.077
Token make
Token ID787
Feature activation-8.60e-02

Pos logit contributions
ig+0.107
aj+0.102
iper+0.101
eh+0.097
ches+0.097
Neg logit contributions
 going-0.068
 him-0.065
't-0.063
 pointing-0.062
 rep-0.059
Token bubbles
Token ID25037
Feature activation-6.32e-02

Pos logit contributions
 going+0.098
 pointing+0.095
all+0.093
 returning+0.092
't+0.091
Neg logit contributions
ig-0.185
iper-0.176
ches-0.175
aj-0.174
 fort-0.172
Token and
Token ID290
Feature activation+2.87e-02

Pos logit contributions
 going+0.083
 pointing+0.082
all+0.081
 returning+0.081
't+0.079
Neg logit contributions
ig-0.123
ches-0.117
 fort-0.115
iper-0.114
aj-0.113
Token spit
Token ID27591
Feature activation-1.99e-01

Pos logit contributions
ig+0.046
iper+0.043
ches+0.043
aj+0.043
 fort+0.042
Neg logit contributions
 going-0.048
 pointing-0.047
 returning-0.046
all-0.046
 him-0.045
Token.
Token ID13
Feature activation-4.19e-02

Pos logit contributions
 returning+0.184
 going+0.183
 pointing+0.180
all+0.179
 buying+0.177
Neg logit contributions
ches-0.468
ig-0.464
iper-0.443
 fort-0.440
ets-0.434
Token They
Token ID1119
Feature activation+5.79e-02

Pos logit contributions
 going+0.063
 returning+0.061
all+0.060
 pointing+0.056
 buying+0.055
Neg logit contributions
aj-0.071
 fort-0.069
ig-0.068
uffy-0.068
ches-0.068
Token smile
Token ID8212
Feature activation-7.25e-02

Pos logit contributions
ig+0.116
aj+0.110
iper+0.110
ches+0.107
 fort+0.106
Neg logit contributions
 going-0.078
 pointing-0.074
 him-0.072
't-0.072
all-0.070
Token at
Token ID379
Feature activation-5.13e-02

Pos logit contributions
 Joey+0.131
 Crab+0.130
 Lib+0.130
 Abby+0.130
all+0.127
Neg logit contributions
 fort-0.097
ches-0.094
aj-0.094
ig-0.093
ets-0.093
Token each
Token ID1123
Feature activation+7.81e-02

Pos logit contributions
 going+0.051
 pointing+0.049
all+0.048
 returning+0.047
 him+0.047
Neg logit contributions
ig-0.119
iper-0.113
aj-0.113
ches-0.113
 fort-0.111
Token was
Token ID373
Feature activation+5.49e-03

Pos logit contributions
ig+0.156
iper+0.147
aj+0.146
 fort+0.145
ox+0.143
Neg logit contributions
 going-0.131
 pointing-0.125
 him-0.124
 Mike-0.118
all-0.118
Token so
Token ID523
Feature activation-2.48e-02

Pos logit contributions
ig+0.011
ches+0.010
aj+0.010
iper+0.010
 fort+0.010
Neg logit contributions
all-0.007
 going-0.007
't-0.007
 pointing-0.007
 returning-0.007
Token excited
Token ID6568
Feature activation-9.86e-02

Pos logit contributions
all+0.036
 returning+0.036
 Crab+0.036
Welcome+0.035
't+0.035
Neg logit contributions
aj-0.040
ig-0.039
eh-0.037
 web-0.037
ches-0.036
Token and
Token ID290
Feature activation+1.98e-02

Pos logit contributions
Welcome+0.117
 returning+0.115
 pointing+0.115
't+0.113
 going+0.110
Neg logit contributions
ig-0.188
ches-0.186
iper-0.184
 fort-0.181
ets-0.180
Token he
Token ID339
Feature activation+7.77e-02

Pos logit contributions
ig+0.038
iper+0.036
ches+0.036
aj+0.036
 fort+0.035
Neg logit contributions
all-0.025
 going-0.025
't-0.024
 pointing-0.024
 returning-0.024
Token gob
Token ID48484
Feature activation-1.99e-01

Pos logit contributions
ig+0.145
iper+0.136
aj+0.136
 fort+0.136
ox+0.133
Neg logit contributions
 going-0.114
 pointing-0.111
all-0.107
 him-0.106
't-0.105
Tokenbled
Token ID9342
Feature activation-4.87e-02

Pos logit contributions
Welcome+0.285
 buying+0.285
 going+0.283
 returning+0.277
 rep+0.274
Neg logit contributions
ig-0.385
ows-0.347
aj-0.347
ches-0.346
uddle-0.338
Token it
Token ID340
Feature activation+5.34e-02

Pos logit contributions
 returning+0.045
 pointing+0.044
 rep+0.044
 buying+0.043
't+0.042
Neg logit contributions
ig-0.113
ches-0.112
 fort-0.104
iper-0.103
ets-0.103
Token up
Token ID510
Feature activation-2.65e-02

Pos logit contributions
ig+0.106
ches+0.105
 fort+0.098
aj+0.097
iper+0.096
Neg logit contributions
 pointing-0.068
 returning-0.068
 buying-0.067
 going-0.067
 rep-0.066
Token quickly
Token ID2952
Feature activation-3.75e-02

Pos logit contributions
 pointing+0.030
't+0.030
 going+0.030
 returning+0.029
Welcome+0.029
Neg logit contributions
ig-0.054
aj-0.051
ches-0.051
 fort-0.051
iper-0.050
Token!
Token ID0
Feature activation-3.56e-02

Pos logit contributions
 going+0.044
 pointing+0.044
all+0.044
Welcome+0.044
't+0.044
Neg logit contributions
ig-0.075
ches-0.073
ets-0.071
aj-0.071
 fort-0.070
Token.
Token ID13
Feature activation-3.96e-02

Pos logit contributions
all+0.077
 pointing+0.077
't+0.074
 returning+0.073
Welcome+0.073
Neg logit contributions
ig-0.119
ches-0.117
 fort-0.115
ets-0.110
ows-0.109
Token He
Token ID679
Feature activation+8.90e-02

Pos logit contributions
 going+0.050
 pointing+0.048
 returning+0.047
all+0.047
't+0.045
Neg logit contributions
ig-0.080
aj-0.077
iper-0.075
ches-0.075
 fort-0.074
Token tasted
Token ID29187
Feature activation-8.33e-02

Pos logit contributions
ig+0.187
iper+0.182
 fort+0.171
aj+0.170
ables+0.170
Neg logit contributions
 it-0.116
 going-0.114
 him-0.114
 pointing-0.109
 Mike-0.102
Token it
Token ID340
Feature activation+7.49e-02

Pos logit contributions
all+0.076
't+0.075
Welcome+0.075
 rep+0.075
 pointing+0.073
Neg logit contributions
ig-0.189
ches-0.179
 fort-0.174
 web-0.173
aj-0.171
Token and
Token ID290
Feature activation+3.06e-02

Pos logit contributions
ig+0.135
ches+0.133
 fort+0.126
iper+0.122
ets+0.120
Neg logit contributions
 pointing-0.106
 returning-0.105
all-0.105
 buying-0.104
 going-0.103
Token spat
Token ID15246
Feature activation-1.99e-01

Pos logit contributions
ig+0.054
iper+0.051
aj+0.049
ches+0.049
 fort+0.049
Neg logit contributions
 going-0.050
 pointing-0.048
 him-0.047
 returning-0.047
 buying-0.045
Token it
Token ID340
Feature activation+6.68e-02

Pos logit contributions
 returning+0.182
Welcome+0.179
 buying+0.178
 rep+0.175
 pointing+0.174
Neg logit contributions
ig-0.482
ches-0.478
 fort-0.431
iper-0.431
ows-0.430
Token out
Token ID503
Feature activation-3.78e-03

Pos logit contributions
ches+0.125
ig+0.120
 fort+0.116
ets+0.110
ows+0.108
Neg logit contributions
 Lib-0.096
 rep-0.095
 pointing-0.095
 returning-0.095
 buying-0.094
Token.
Token ID13
Feature activation-3.79e-02

Pos logit contributions
't+0.006
 Lib+0.006
all+0.006
 Crab+0.006
Welcome+0.006
Neg logit contributions
 fort-0.006
ig-0.006
ches-0.006
 web-0.006
ows-0.005
Token He
Token ID679
Feature activation+8.97e-02

Pos logit contributions
 going+0.052
 returning+0.049
all+0.048
 pointing+0.048
 buying+0.045
Neg logit contributions
aj-0.071
ig-0.071
 fort-0.067
ches-0.066
ox-0.065
Token was
Token ID373
Feature activation+1.07e-02

Pos logit contributions
ig+0.186
iper+0.181
 fort+0.173
aj+0.170
ables+0.166
Neg logit contributions
 it-0.120
 him-0.118
 going-0.117
 pointing-0.115
't-0.109
Token wait
Token ID4043
Feature activation-1.33e-01

Pos logit contributions
 pointing+0.044
 returning+0.043
all+0.043
Welcome+0.042
 going+0.042
Neg logit contributions
ig-0.048
 web-0.046
ches-0.046
iper-0.045
 fort-0.045
Token too
Token ID1165
Feature activation-3.18e-02

Pos logit contributions
 going+0.143
all+0.143
 pointing+0.143
 returning+0.142
't+0.140
Neg logit contributions
ig-0.284
ches-0.271
 fort-0.269
aj-0.269
iper-0.269
Token long
Token ID890
Feature activation-1.45e-02

Pos logit contributions
all+0.048
 pointing+0.048
 buying+0.048
 returning+0.048
 going+0.047
Neg logit contributions
ig-0.054
ox-0.050
iper-0.050
ches-0.050
 web-0.049
Token so
Token ID523
Feature activation-1.87e-02

Pos logit contributions
Welcome+0.020
 pointing+0.019
 Crab+0.019
 Abby+0.019
 returning+0.019
Neg logit contributions
ches-0.026
ets-0.025
ig-0.025
iper-0.024
ows-0.023
Token it
Token ID340
Feature activation+6.08e-02

Pos logit contributions
all+0.017
 going+0.016
 buying+0.016
 pointing+0.015
 returning+0.015
Neg logit contributions
ig-0.042
 web-0.040
iper-0.040
 fort-0.040
ches-0.040
Token wished
Token ID16555
Feature activation-1.98e-01

Pos logit contributions
ig+0.108
aj+0.107
iper+0.105
ox+0.098
uffy+0.098
Neg logit contributions
 going-0.096
 pointing-0.089
 him-0.088
all-0.085
't-0.085
Token to
Token ID284
Feature activation-3.70e-02

Pos logit contributions
 going+0.245
 pointing+0.234
all+0.231
 returning+0.229
't+0.225
Neg logit contributions
ig-0.410
iper-0.391
aj-0.388
ches-0.388
 fort-0.380
Token grow
Token ID1663
Feature activation-1.08e-01

Pos logit contributions
 pointing+0.058
 going+0.058
all+0.058
 returning+0.057
 buying+0.056
Neg logit contributions
ig-0.060
iper-0.057
 fort-0.057
 web-0.057
ches-0.056
Token quickly
Token ID2952
Feature activation-4.11e-02

Pos logit contributions
 going+0.133
 pointing+0.129
all+0.127
 returning+0.126
't+0.124
Neg logit contributions
ig-0.222
iper-0.211
ches-0.210
aj-0.210
 fort-0.206
Token.
Token ID13
Feature activation-3.87e-02

Pos logit contributions
all+0.049
 pointing+0.048
 Crab+0.048
Welcome+0.047
't+0.047
Neg logit contributions
ig-0.082
ches-0.080
 fort-0.079
 web-0.078
ets-0.078
Token
Token ID198
Feature activation+5.53e-03

Pos logit contributions
 pointing+0.048
 going+0.048
't+0.048
all+0.047
 him+0.046
Neg logit contributions
ig-0.083
aj-0.077
ches-0.076
iper-0.075
 fort-0.074
Token found
Token ID1043
Feature activation-6.65e-02

Pos logit contributions
ig+0.167
aj+0.167
iper+0.160
uffy+0.158
 fort+0.155
Neg logit contributions
 going-0.134
 pointing-0.130
 him-0.128
 it-0.127
 returning-0.121
Token some
Token ID617
Feature activation+2.90e-02

Pos logit contributions
 going+0.064
 pointing+0.061
all+0.060
 returning+0.059
 him+0.058
Neg logit contributions
ig-0.156
aj-0.148
iper-0.148
ches-0.148
 fort-0.145
Token delicious
Token ID12625
Feature activation-1.27e-02

Pos logit contributions
ig+0.057
iper+0.053
aj+0.053
ches+0.053
ox+0.052
Neg logit contributions
 going-0.040
 pointing-0.038
 returning-0.038
 him-0.038
 buying-0.037
Token berries
Token ID36322
Feature activation-2.96e-02

Pos logit contributions
 going+0.019
all+0.019
 pointing+0.019
't+0.019
 returning+0.018
Neg logit contributions
ig-0.022
 fort-0.021
ches-0.021
iper-0.021
aj-0.021
Token and
Token ID290
Feature activation+1.80e-02

Pos logit contributions
 pointing+0.047
Welcome+0.046
 rep+0.045
all+0.045
't+0.044
Neg logit contributions
ig-0.048
ches-0.048
 fort-0.044
ets-0.044
ows-0.043
Token gob
Token ID48484
Feature activation-1.98e-01

Pos logit contributions
ig+0.033
aj+0.031
iper+0.031
ches+0.031
 fort+0.030
Neg logit contributions
 going-0.026
 pointing-0.025
 returning-0.025
all-0.025
 him-0.024
Tokenbled
Token ID9342
Feature activation-4.83e-02

Pos logit contributions
 going+0.276
 rep+0.272
 buying+0.270
 returning+0.267
Welcome+0.267
Neg logit contributions
ig-0.380
ches-0.364
ows-0.354
aj-0.345
iper-0.344
Token them
Token ID606
Feature activation+7.63e-02

Pos logit contributions
 pointing+0.038
 returning+0.038
 rep+0.037
't+0.037
 going+0.036
Neg logit contributions
ches-0.118
ig-0.118
iper-0.110
 fort-0.110
ets-0.109
Token up
Token ID510
Feature activation-2.23e-02

Pos logit contributions
ig+0.145
ches+0.143
 fort+0.136
aj+0.132
ows+0.132
Neg logit contributions
 pointing-0.102
 returning-0.101
 rep-0.099
 going-0.099
Welcome-0.099
Token.
Token ID13
Feature activation-3.96e-02

Pos logit contributions
't+0.029
 pointing+0.029
Welcome+0.028
otion+0.028
 returning+0.028
Neg logit contributions
ig-0.042
ches-0.040
 fort-0.039
aj-0.039
iper-0.038
Token
Token ID198
Feature activation+1.12e-03

Pos logit contributions
 going+0.050
 pointing+0.048
 returning+0.047
all+0.047
 buying+0.045
Neg logit contributions
ig-0.080
iper-0.076
ches-0.076
aj-0.076
 fort-0.075
Token big
Token ID1263
Feature activation+5.64e-02

Pos logit contributions
ig+0.154
iper+0.148
ches+0.147
aj+0.144
ox+0.141
Neg logit contributions
 going-0.134
 pointing-0.132
 returning-0.128
't-0.125
 buying-0.124
Token field
Token ID2214
Feature activation-4.08e-02

Pos logit contributions
ig+0.094
ches+0.093
iper+0.093
aj+0.092
ox+0.090
Neg logit contributions
 going-0.091
 pointing-0.088
 returning-0.087
 buying-0.084
 him-0.082
Token of
Token ID286
Feature activation-3.48e-03

Pos logit contributions
 going+0.047
 pointing+0.045
all+0.044
 returning+0.044
't+0.044
Neg logit contributions
ig-0.086
ches-0.082
iper-0.082
aj-0.081
 fort-0.081
Token strawberries
Token ID36973
Feature activation-2.18e-02

Pos logit contributions
all+0.006
't+0.006
 pointing+0.006
 returning+0.006
 going+0.006
Neg logit contributions
ig-0.005
 fort-0.005
iper-0.005
 web-0.005
ches-0.005
Token and
Token ID290
Feature activation+2.31e-02

Pos logit contributions
 pointing+0.033
Welcome+0.033
 rep+0.032
 returning+0.032
't+0.032
Neg logit contributions
ig-0.038
ches-0.035
 fort-0.034
ets-0.033
eh-0.032
Token gob
Token ID48484
Feature activation-1.98e-01

Pos logit contributions
ig+0.040
iper+0.038
ches+0.038
 fort+0.037
aj+0.037
Neg logit contributions
 going-0.034
all-0.034
 pointing-0.033
't-0.033
 returning-0.033
Tokenbled
Token ID9342
Feature activation-5.02e-02

Pos logit contributions
 buying+0.301
 going+0.300
 rep+0.299
 returning+0.298
Welcome+0.296
Neg logit contributions
ig-0.351
ows-0.334
ches-0.323
uddle-0.320
aj-0.320
Token them
Token ID606
Feature activation+7.55e-02

Pos logit contributions
 returning+0.044
 pointing+0.043
 rep+0.043
't+0.043
 going+0.042
Neg logit contributions
ches-0.119
ig-0.118
 fort-0.111
iper-0.110
ets-0.110
Token up
Token ID510
Feature activation-2.23e-02

Pos logit contributions
ig+0.135
ches+0.133
 fort+0.127
ows+0.124
aj+0.122
Neg logit contributions
 returning-0.108
 pointing-0.106
 rep-0.106
Welcome-0.104
 buying-0.104
Token!
Token ID0
Feature activation-3.40e-02

Pos logit contributions
't+0.031
otion+0.030
Welcome+0.030
 pointing+0.030
 returning+0.029
Neg logit contributions
ig-0.039
aj-0.037
ches-0.037
 fort-0.037
iper-0.035
Token They
Token ID1119
Feature activation+5.07e-02

Pos logit contributions
 going+0.043
 returning+0.040
 pointing+0.040
all+0.040
 buying+0.039
Neg logit contributions
ig-0.066
iper-0.065
aj-0.065
ches-0.064
 fort-0.063
Token tasted
Token ID29187
Feature activation-1.02e-01

Pos logit contributions
iper+0.104
ig+0.103
aj+0.101
ox+0.096
ets+0.092
Neg logit contributions
 going-0.072
 pointing-0.068
't-0.067
 him-0.067
all-0.065
Token bad
Token ID2089
Feature activation-2.36e-02

Pos logit contributions
 pointing+0.144
 going+0.138
 returning+0.138
all+0.137
't+0.135
Neg logit contributions
ig-0.186
ches-0.180
 fort-0.175
iper-0.173
aj-0.172
Token,
Token ID11
Feature activation-1.82e-02

Pos logit contributions
 pointing+0.034
't+0.033
 going+0.033
 returning+0.033
all+0.032
Neg logit contributions
ig-0.043
 fort-0.041
ches-0.040
ets-0.040
iper-0.039
Token and
Token ID290
Feature activation+2.63e-02

Pos logit contributions
 going+0.027
 pointing+0.026
all+0.026
 returning+0.025
't+0.025
Neg logit contributions
ig-0.033
iper-0.031
ches-0.031
aj-0.031
 fort-0.030
Token he
Token ID339
Feature activation+7.99e-02

Pos logit contributions
ig+0.059
iper+0.057
ches+0.056
aj+0.056
 fort+0.055
Neg logit contributions
 going-0.028
 pointing-0.026
 returning-0.026
all-0.026
 him-0.025
Token spit
Token ID27591
Feature activation-1.97e-01

Pos logit contributions
ig+0.156
iper+0.147
aj+0.146
 fort+0.144
ox+0.140
Neg logit contributions
 going-0.115
 him-0.111
 it-0.107
 pointing-0.104
't-0.102
Token it
Token ID340
Feature activation+6.53e-02

Pos logit contributions
 going+0.129
 pointing+0.126
 returning+0.122
all+0.120
 buying+0.116
Neg logit contributions
ig-0.519
ches-0.508
iper-0.495
aj-0.486
 fort-0.485
Token out
Token ID503
Feature activation-8.63e-03

Pos logit contributions
ches+0.117
ig+0.112
 fort+0.106
ets+0.102
 web+0.099
Neg logit contributions
 buying-0.101
 rep-0.099
 pointing-0.098
 Mike-0.098
 returning-0.098
Token.
Token ID13
Feature activation-3.94e-02

Pos logit contributions
't+0.011
Welcome+0.011
 pointing+0.011
 going+0.011
 Lib+0.011
Neg logit contributions
ig-0.016
ches-0.016
 web-0.015
 fort-0.015
aj-0.015
Token He
Token ID679
Feature activation+8.65e-02

Pos logit contributions
 going+0.056
 returning+0.053
all+0.051
 pointing+0.051
 buying+0.049
Neg logit contributions
aj-0.072
ig-0.070
iper-0.067
ches-0.067
 fort-0.066
Token was
Token ID373
Feature activation+7.91e-03

Pos logit contributions
ig+0.177
iper+0.167
aj+0.164
 fort+0.162
uffy+0.157
Neg logit contributions
 him-0.119
 going-0.117
 it-0.115
 pointing-0.112
't-0.106
Token and
Token ID290
Feature activation+4.18e-02

Pos logit contributions
't+0.138
 returning+0.136
 pointing+0.136
 going+0.135
 buying+0.134
Neg logit contributions
ig-0.129
 fort-0.127
ets-0.125
aj-0.123
ches-0.121
Token frightened
Token ID24776
Feature activation-8.93e-02

Pos logit contributions
ig+0.078
iper+0.074
ches+0.074
aj+0.073
 fort+0.072
Neg logit contributions
 going-0.059
 pointing-0.057
all-0.057
 returning-0.056
't-0.056
Token.
Token ID13
Feature activation-3.29e-02

Pos logit contributions
 returning+0.123
 buying+0.121
all+0.119
't+0.119
 going+0.118
Neg logit contributions
ches-0.155
ig-0.151
 fort-0.150
ets-0.149
 web-0.144
Token They
Token ID1119
Feature activation+5.82e-02

Pos logit contributions
 going+0.044
all+0.043
 returning+0.043
 pointing+0.041
 buying+0.040
Neg logit contributions
ig-0.061
aj-0.060
iper-0.060
ches-0.060
 fort-0.060
Token saw
Token ID2497
Feature activation-6.43e-02

Pos logit contributions
ig+0.113
aj+0.106
 fort+0.102
iper+0.102
ches+0.097
Neg logit contributions
 going-0.090
 pointing-0.087
 him-0.085
't-0.081
 Mike-0.080
Token the
Token ID262
Feature activation+1.62e-01

Pos logit contributions
all+0.075
 going+0.075
 returning+0.073
 pointing+0.072
't+0.071
Neg logit contributions
ig-0.130
ches-0.126
iper-0.126
 fort-0.125
aj-0.124
Token frog
Token ID21264
Feature activation+5.82e-02

Pos logit contributions
ig+0.250
 fort+0.238
aj+0.232
 web+0.231
ches+0.230
Neg logit contributions
all-0.271
 going-0.269
 pointing-0.264
 returning-0.263
 him-0.257
Token and
Token ID290
Feature activation+3.08e-02

Pos logit contributions
ig+0.116
ches+0.110
 fort+0.108
iper+0.107
aj+0.106
Neg logit contributions
 going-0.073
all-0.073
 pointing-0.071
't-0.070
 returning-0.070
Token ran
Token ID4966
Feature activation-8.94e-02

Pos logit contributions
ig+0.068
aj+0.064
iper+0.063
ches+0.062
 fort+0.061
Neg logit contributions
 pointing-0.040
 going-0.039
 him-0.038
 returning-0.036
 it-0.035
Token away
Token ID1497
Feature activation-2.28e-02

Pos logit contributions
 returning+0.111
 rep+0.111
 pointing+0.110
 buying+0.109
all+0.109
Neg logit contributions
ches-0.165
 fort-0.161
ets-0.156
ig-0.154
iper-0.151
Token.
Token ID13
Feature activation-3.69e-02

Pos logit contributions
Welcome+0.030
all+0.029
anes+0.029
 going+0.029
 Crab+0.029
Neg logit contributions
 web-0.041
 fort-0.039
ig-0.038
ows-0.038
ches-0.038
Token,
Token ID11
Feature activation-1.28e-02

Pos logit contributions
 going+0.042
 returning+0.042
all+0.041
 pointing+0.041
 buying+0.041
Neg logit contributions
ig-0.045
uffy-0.040
aj-0.040
 fort-0.039
ches-0.038
Token there
Token ID612
Feature activation-1.71e-02

Pos logit contributions
 going+0.010
all+0.010
 pointing+0.010
 returning+0.010
't+0.009
Neg logit contributions
ig-0.031
ches-0.030
iper-0.030
aj-0.030
 fort-0.030
Token was
Token ID373
Feature activation+2.03e-02

Pos logit contributions
 going+0.029
 pointing+0.028
all+0.028
 returning+0.027
't+0.027
Neg logit contributions
ig-0.028
iper-0.026
aj-0.026
ches-0.026
 fort-0.025
Token a
Token ID257
Feature activation+8.73e-02

Pos logit contributions
ig+0.038
 fort+0.038
iper+0.037
eh+0.037
 web+0.036
Neg logit contributions
 returning-0.025
 pointing-0.025
all-0.024
't-0.024
 going-0.024
Token little
Token ID1310
Feature activation+6.91e-02

Pos logit contributions
ig+0.158
ches+0.156
iper+0.155
aj+0.151
ets+0.150
Neg logit contributions
 going-0.133
 pointing-0.131
 returning-0.123
't-0.120
 buying-0.119
Token girl
Token ID2576
Feature activation+9.22e-02

Pos logit contributions
ches+0.107
aj+0.106
ig+0.104
iper+0.104
ets+0.099
Neg logit contributions
 going-0.118
 pointing-0.115
 returning-0.112
 Crab-0.111
 him-0.110
Token named
Token ID3706
Feature activation-6.16e-02

Pos logit contributions
ig+0.135
iper+0.128
aj+0.128
ches+0.127
 fort+0.123
Neg logit contributions
 going-0.169
 pointing-0.164
 returning-0.161
all-0.160
't-0.159
Token Lily
Token ID20037
Feature activation+1.43e-01

Pos logit contributions
 going+0.096
 pointing+0.094
all+0.093
 returning+0.093
't+0.090
Neg logit contributions
ig-0.106
iper-0.100
ches-0.099
aj-0.099
 fort-0.097
Token.
Token ID13
Feature activation-3.68e-02

Pos logit contributions
ig+0.316
ches+0.302
ox+0.300
iper+0.299
 fort+0.298
Neg logit contributions
 going-0.142
all-0.142
 pointing-0.141
 returning-0.139
't-0.138
Token She
Token ID1375
Feature activation+1.08e-01

Pos logit contributions
 going+0.031
 pointing+0.029
't+0.029
all+0.028
 him+0.028
Neg logit contributions
ig-0.092
iper-0.088
ches-0.087
aj-0.087
ox-0.085
Token loved
Token ID6151
Feature activation-7.30e-02

Pos logit contributions
ig+0.203
iper+0.195
aj+0.194
ches+0.184
 fort+0.183
Neg logit contributions
 going-0.183
't-0.166
 pointing-0.165
 him-0.159
 returning-0.156
Token little
Token ID1310
Feature activation+6.90e-02

Pos logit contributions
ig+0.155
ches+0.154
iper+0.154
aj+0.150
ets+0.150
Neg logit contributions
 going-0.126
 pointing-0.123
 returning-0.115
 rep-0.111
't-0.111
Token girl
Token ID2576
Feature activation+9.23e-02

Pos logit contributions
aj+0.113
ches+0.112
iper+0.108
ig+0.108
ets+0.106
Neg logit contributions
 going-0.112
 pointing-0.109
 Joey-0.107
 Crab-0.107
 returning-0.105
Token named
Token ID3706
Feature activation-6.07e-02

Pos logit contributions
ig+0.143
aj+0.139
iper+0.139
ches+0.138
 fort+0.132
Neg logit contributions
 going-0.162
 pointing-0.156
 returning-0.153
 buying-0.150
all-0.149
Token Sarah
Token ID10490
Feature activation+1.47e-01

Pos logit contributions
 going+0.094
 pointing+0.092
 returning+0.091
all+0.091
't+0.089
Neg logit contributions
ig-0.105
iper-0.099
ches-0.099
aj-0.098
 fort-0.097
Token.
Token ID13
Feature activation-3.73e-02

Pos logit contributions
ig+0.357
iper+0.342
aj+0.341
ches+0.340
 fort+0.335
Neg logit contributions
 going-0.127
 pointing-0.120
all-0.116
 returning-0.116
't-0.112
Token She
Token ID1375
Feature activation+1.07e-01

Pos logit contributions
 going+0.036
 pointing+0.034
all+0.034
 returning+0.033
't+0.033
Neg logit contributions
ig-0.088
ches-0.084
iper-0.083
aj-0.083
 fort-0.081
Token loved
Token ID6151
Feature activation-7.60e-02

Pos logit contributions
ig+0.197
iper+0.191
aj+0.189
ches+0.182
 fort+0.176
Neg logit contributions
 going-0.183
't-0.166
 pointing-0.165
 him-0.160
 returning-0.156
Token to
Token ID284
Feature activation-1.21e-02

Pos logit contributions
otion+0.091
't+0.087
all+0.086
 rep+0.084
 pointing+0.084
Neg logit contributions
 fort-0.142
 scar-0.136
ig-0.134
iper-0.133
 web-0.132
Token explore
Token ID7301
Feature activation-6.68e-02

Pos logit contributions
all+0.021
 returning+0.020
 pointing+0.020
no+0.020
 Lib+0.019
Neg logit contributions
ig-0.018
 fort-0.017
iper-0.017
 web-0.016
ox-0.016
Token outside
Token ID2354
Feature activation-5.03e-02

Pos logit contributions
 going+0.087
 pointing+0.084
all+0.083
 returning+0.082
't+0.081
Neg logit contributions
ig-0.133
iper-0.126
ches-0.126
aj-0.125
 fort-0.123
Token with
Token ID351
Feature activation-4.48e-02

Pos logit contributions
 Crab+0.073
 Sebastian+0.072
 rep+0.068
all+0.068
 Lib+0.068
Neg logit contributions
ig-0.087
 fort-0.082
 web-0.082
house-0.079
 room-0.077
Token.
Token ID13
Feature activation-3.54e-02

Pos logit contributions
 going+0.100
all+0.099
't+0.098
 pointing+0.097
 returning+0.096
Neg logit contributions
ig-0.207
ches-0.196
 fort-0.194
 web-0.192
aj-0.192
Token
Token ID220
Feature activation-1.84e-02

Pos logit contributions
 going+0.046
 returning+0.043
 pointing+0.043
all+0.043
 buying+0.042
Neg logit contributions
ig-0.070
iper-0.067
ches-0.067
aj-0.067
 fort-0.065
Token
Token ID198
Feature activation+3.99e-03

Pos logit contributions
 going+0.018
 pointing+0.017
 returning+0.017
all+0.016
 buying+0.016
Neg logit contributions
ig-0.042
aj-0.041
ches-0.040
iper-0.040
ox-0.039
Token
Token ID198
Feature activation+5.34e-03

Pos logit contributions
ig+0.009
aj+0.008
ches+0.008
iper+0.008
uffy+0.008
Neg logit contributions
 going-0.005
 returning-0.004
 pointing-0.004
 buying-0.004
all-0.004
TokenM
Token ID44
Feature activation+8.54e-02

Pos logit contributions
ig+0.010
iper+0.010
ches+0.010
aj+0.010
 fort+0.009
Neg logit contributions
 going-0.007
 pointing-0.007
all-0.007
 returning-0.007
't-0.007
Tokenia
Token ID544
Feature activation+1.51e-01

Pos logit contributions
ches+0.074
ig+0.071
iper+0.070
ox+0.065
 fort+0.065
Neg logit contributions
all-0.202
't-0.197
 going-0.194
 pointing-0.191
 returning-0.188
Token's
Token ID338
Feature activation+9.19e-02

Pos logit contributions
aj+0.284
 fort+0.280
ig+0.272
iper+0.267
ets+0.263
Neg logit contributions
 pointing-0.212
 going-0.211
 him-0.209
't-0.208
 it-0.200
Token mom
Token ID1995
Feature activation+6.46e-02

Pos logit contributions
ches+0.208
iper+0.201
ig+0.201
ets+0.197
eh+0.196
Neg logit contributions
 going-0.100
 pointing-0.095
't-0.092
 returning-0.090
 him-0.087
Token saw
Token ID2497
Feature activation-5.54e-02

Pos logit contributions
ig+0.120
aj+0.117
 fort+0.114
iper+0.112
ox+0.110
Neg logit contributions
 going-0.096
 pointing-0.095
't-0.092
 him-0.092
 returning-0.090
Token that
Token ID326
Feature activation+1.75e-02

Pos logit contributions
 going+0.057
 returning+0.056
 pointing+0.055
all+0.055
't+0.053
Neg logit contributions
ig-0.121
ches-0.117
iper-0.115
aj-0.114
 fort-0.113
Token Mia
Token ID32189
Feature activation+1.42e-01

Pos logit contributions
ig+0.038
iper+0.036
ches+0.036
aj+0.036
 fort+0.036
Neg logit contributions
 going-0.019
 pointing-0.018
 returning-0.018
all-0.018
't-0.018
Token "
Token ID366
Feature activation-2.62e-02

Pos logit contributions
Welcome+0.022
 returning+0.022
 going+0.022
all+0.022
 buying+0.022
Neg logit contributions
ches-0.034
ig-0.034
iper-0.033
aj-0.032
 fort-0.031
TokenYes
Token ID5297
Feature activation-6.38e-02

Pos logit contributions
 going+0.033
 pointing+0.032
 returning+0.032
 buying+0.031
all+0.030
Neg logit contributions
ig-0.053
ches-0.050
iper-0.050
ox-0.048
aj-0.048
Token,
Token ID11
Feature activation-2.41e-02

Pos logit contributions
 pointing+0.097
 going+0.096
 returning+0.095
 buying+0.093
all+0.092
Neg logit contributions
ig-0.115
ches-0.106
iper-0.106
 fort-0.103
aj-0.102
Token please
Token ID3387
Feature activation-6.95e-02

Pos logit contributions
 pointing+0.027
 returning+0.027
 going+0.027
 buying+0.026
all+0.024
Neg logit contributions
ig-0.050
iper-0.048
ches-0.048
 fort-0.047
aj-0.046
Token!"
Token ID2474
Feature activation-7.50e-02

Pos logit contributions
 pointing+0.079
 Crab+0.079
 returning+0.078
 going+0.077
 him+0.076
Neg logit contributions
ig-0.147
ches-0.137
aj-0.136
iper-0.135
 fort-0.135
Token Jane
Token ID12091
Feature activation+1.45e-01

Pos logit contributions
 going+0.089
 pointing+0.084
 returning+0.084
all+0.081
 buying+0.080
Neg logit contributions
ig-0.156
iper-0.148
aj-0.148
ches-0.147
 fort-0.147
Token was
Token ID373
Feature activation+7.08e-03

Pos logit contributions
aj+0.269
 fort+0.266
iper+0.257
auldron+0.250
ig+0.248
Neg logit contributions
all-0.205
 pointing-0.200
 going-0.193
't-0.192
 it-0.180
Token good
Token ID922
Feature activation-3.58e-02

Pos logit contributions
ig+0.011
aj+0.011
ches+0.011
 web+0.011
eh+0.011
Neg logit contributions
Welcome-0.010
all-0.010
 Crab-0.010
 him-0.010
 returning-0.010
Token at
Token ID379
Feature activation-5.04e-02

Pos logit contributions
Welcome+0.060
 Abby+0.058
 Joey+0.055
 Hipp+0.055
 Gig+0.054
Neg logit contributions
ig-0.060
aj-0.053
ets-0.052
 fort-0.052
udd-0.051
Token painting
Token ID12036
Feature activation-1.16e-01

Pos logit contributions
 Crab+0.071
Welcome+0.070
 Lib+0.069
 Abby+0.068
 Hipp+0.068
Neg logit contributions
ig-0.098
 fort-0.086
 web-0.085
house-0.080
 room-0.078
Token and
Token ID290
Feature activation+2.74e-02

Pos logit contributions
 going+0.120
 pointing+0.117
all+0.114
 returning+0.114
't+0.112
Neg logit contributions
ig-0.258
ches-0.245
iper-0.244
aj-0.243
 fort-0.242
Token it
Token ID340
Feature activation+5.85e-02

Pos logit contributions
 going+0.011
 pointing+0.011
 returning+0.011
all+0.011
't+0.011
Neg logit contributions
ig-0.017
iper-0.016
aj-0.016
ches-0.015
 fort-0.015
Token was
Token ID373
Feature activation+3.60e-03

Pos logit contributions
iper+0.110
ig+0.110
aj+0.108
 fort+0.104
ox+0.103
Neg logit contributions
 going-0.086
 pointing-0.085
all-0.082
't-0.080
 it-0.078
Token wet
Token ID9583
Feature activation-4.51e-02

Pos logit contributions
ig+0.007
iper+0.007
ches+0.007
aj+0.007
 fort+0.006
Neg logit contributions
 going-0.005
 pointing-0.005
all-0.005
 returning-0.005
't-0.005
Token too
Token ID1165
Feature activation-2.71e-02

Pos logit contributions
 pointing+0.068
 returning+0.066
 Crab+0.066
all+0.066
 buying+0.066
Neg logit contributions
ig-0.077
 fort-0.073
ches-0.072
 web-0.068
ets-0.068
Token -
Token ID532
Feature activation-2.86e-02

Pos logit contributions
 pointing+0.036
 returning+0.035
 buying+0.035
 going+0.035
 Crab+0.035
Neg logit contributions
ig-0.051
ches-0.049
aj-0.048
 fort-0.048
iper-0.048
Token it
Token ID340
Feature activation+5.91e-02

Pos logit contributions
all+0.022
't+0.022
 returning+0.022
 pointing+0.021
 Crab+0.021
Neg logit contributions
ig-0.066
 fort-0.063
 web-0.063
iper-0.062
ches-0.061
Token had
Token ID550
Feature activation-2.93e-02

Pos logit contributions
iper+0.115
ig+0.114
aj+0.113
ox+0.106
 fort+0.105
Neg logit contributions
 going-0.088
 pointing-0.082
 it-0.080
't-0.078
all-0.075
Token been
Token ID587
Feature activation-4.62e-03

Pos logit contributions
 going+0.037
 pointing+0.035
 returning+0.034
all+0.034
't+0.034
Neg logit contributions
ig-0.062
iper-0.059
ches-0.058
aj-0.057
 fort-0.056
Token raining
Token ID43079
Feature activation-3.85e-02

Pos logit contributions
 going+0.007
 pointing+0.006
all+0.006
 returning+0.006
't+0.006
Neg logit contributions
ig-0.009
iper-0.009
ches-0.008
 fort-0.008
ox-0.008
Token all
Token ID477
Feature activation+7.09e-03

Pos logit contributions
 going+0.039
 pointing+0.037
all+0.037
 returning+0.036
't+0.036
Neg logit contributions
ig-0.088
iper-0.085
aj-0.084
ches-0.083
 fort-0.082
Token morning
Token ID3329
Feature activation-5.06e-02

Pos logit contributions
ig+0.016
ches+0.015
aj+0.015
 fort+0.015
iper+0.014
Neg logit contributions
 pointing-0.007
 returning-0.007
 going-0.007
 Crab-0.007
't-0.007
Token should
Token ID815
Feature activation-8.86e-02

Pos logit contributions
ig+0.035
aj+0.033
iper+0.033
 fort+0.032
ches+0.032
Neg logit contributions
 going-0.025
't-0.024
 him-0.023
 pointing-0.023
all-0.023
Token enjoy
Token ID2883
Feature activation-1.08e-01

Pos logit contributions
 pointing+0.134
all+0.132
 returning+0.131
 going+0.131
Welcome+0.128
Neg logit contributions
ig-0.150
iper-0.143
ches-0.141
aj-0.140
 fort-0.138
Token them
Token ID606
Feature activation+7.27e-02

Pos logit contributions
Welcome+0.152
all+0.146
't+0.145
 Lib+0.142
 pointing+0.141
Neg logit contributions
ig-0.189
aj-0.183
 web-0.180
 fort-0.175
ches-0.169
Token.
Token ID13
Feature activation-3.62e-02

Pos logit contributions
ig+0.160
ches+0.153
aj+0.152
iper+0.151
 fort+0.150
Neg logit contributions
 going-0.076
 pointing-0.076
 returning-0.074
all-0.072
't-0.071
Token They
Token ID1119
Feature activation+5.29e-02

Pos logit contributions
 going+0.051
 returning+0.048
 pointing+0.048
no+0.046
all+0.045
Neg logit contributions
ig-0.065
aj-0.064
iper-0.064
ox-0.063
house-0.061
Token are
Token ID389
Feature activation-3.18e-03

Pos logit contributions
ig+0.109
aj+0.105
iper+0.103
 fort+0.101
ches+0.099
Neg logit contributions
 going-0.070
't-0.067
 him-0.065
 pointing-0.062
all-0.061
Token special
Token ID2041
Feature activation-2.16e-02

Pos logit contributions
 going+0.004
 pointing+0.004
all+0.004
 returning+0.004
't+0.004
Neg logit contributions
ig-0.006
iper-0.006
ches-0.006
aj-0.006
 fort-0.005
Token for
Token ID329
Feature activation-3.42e-02

Pos logit contributions
 pointing+0.027
 going+0.026
't+0.026
 returning+0.026
all+0.025
Neg logit contributions
ig-0.043
aj-0.041
 fort-0.041
 web-0.040
ches-0.040
Token your
Token ID534
Feature activation+1.27e-01

Pos logit contributions
all+0.044
 pointing+0.042
Welcome+0.042
 Lib+0.042
't+0.042
Neg logit contributions
ig-0.060
iper-0.059
 web-0.057
aj-0.057
ches-0.057
Token birthday
Token ID10955
Feature activation-7.17e-02

Pos logit contributions
ig+0.226
ches+0.215
iper+0.214
aj+0.209
 fort+0.204
Neg logit contributions
 going-0.191
 pointing-0.185
 returning-0.181
all-0.180
't-0.178
Token."
Token ID526
Feature activation-4.13e-02

Pos logit contributions
 going+0.066
 pointing+0.062
all+0.061
 returning+0.060
't+0.059
Neg logit contributions
ig-0.170
iper-0.162
ches-0.162
aj-0.160
 fort-0.158
Token.
Token ID13
Feature activation-4.28e-02

Pos logit contributions
 going+0.064
 pointing+0.062
all+0.062
 returning+0.061
't+0.061
Neg logit contributions
ig-0.136
ches-0.129
iper-0.128
aj-0.127
 fort-0.127
Token The
Token ID383
Feature activation+1.44e-01

Pos logit contributions
 going+0.056
 pointing+0.054
 returning+0.053
all+0.053
't+0.052
Neg logit contributions
ig-0.082
iper-0.080
aj-0.080
ches-0.079
 fort-0.078
Token mom
Token ID1995
Feature activation+6.25e-02

Pos logit contributions
ig+0.254
ches+0.245
iper+0.240
ox+0.236
ets+0.229
Neg logit contributions
 going-0.232
 pointing-0.221
 returning-0.214
't-0.209
 him-0.209
Tokenmy
Token ID1820
Feature activation+8.70e-02

Pos logit contributions
ig+0.092
iper+0.091
 web+0.090
ox+0.089
aj+0.089
Neg logit contributions
all-0.110
 going-0.105
 pointing-0.103
 him-0.102
 returning-0.099
Token and
Token ID290
Feature activation+3.19e-02

Pos logit contributions
ig+0.157
aj+0.154
iper+0.153
 fort+0.151
ox+0.149
Neg logit contributions
all-0.122
 pointing-0.121
 going-0.120
 returning-0.113
 him-0.112
Token son
Token ID3367
Feature activation+2.85e-02

Pos logit contributions
ig+0.069
ches+0.066
aj+0.065
iper+0.065
 fort+0.064
Neg logit contributions
 going-0.037
 pointing-0.036
all-0.035
 returning-0.035
 him-0.034
Token smiled
Token ID13541
Feature activation-4.24e-02

Pos logit contributions
ig+0.049
aj+0.049
iper+0.048
 fort+0.047
ox+0.046
Neg logit contributions
 going-0.044
 pointing-0.043
 him-0.042
all-0.041
 returning-0.041
Token when
Token ID618
Feature activation-4.98e-02

Pos logit contributions
't+0.066
 buying+0.064
 going+0.064
 returning+0.063
all+0.063
Neg logit contributions
ig-0.069
 fort-0.067
aj-0.065
iper-0.065
ches-0.064
Token they
Token ID484
Feature activation+6.13e-02

Pos logit contributions
all+0.052
 going+0.047
 returning+0.046
't+0.046
 pointing+0.045
Neg logit contributions
ig-0.104
 fort-0.103
iper-0.100
ches-0.097
aj-0.097
Token saw
Token ID2497
Feature activation-6.87e-02

Pos logit contributions
ig+0.100
aj+0.098
iper+0.095
 fort+0.093
uffy+0.093
Neg logit contributions
 going-0.104
 him-0.100
 pointing-0.098
 rep-0.095
 returning-0.095
Token their
Token ID511
Feature activation+1.54e-01

Pos logit contributions
 going+0.072
 pointing+0.070
all+0.070
 returning+0.068
't+0.068
Neg logit contributions
ig-0.152
iper-0.146
ches-0.145
aj-0.144
 fort-0.143
Token.
Token ID13
Feature activation-4.22e-02

Pos logit contributions
all+0.098
 Crab+0.092
 pointing+0.091
 Lib+0.091
 buying+0.091
Neg logit contributions
ig-0.143
ches-0.140
 fort-0.139
 web-0.139
elope-0.130
Token In
Token ID554
Feature activation+2.14e-02

Pos logit contributions
 going+0.048
't+0.047
 pointing+0.047
 him+0.046
all+0.045
Neg logit contributions
ig-0.097
aj-0.090
ches-0.089
iper-0.088
ox-0.086
Token the
Token ID262
Feature activation+1.49e-01

Pos logit contributions
ig+0.051
aj+0.051
ches+0.050
iper+0.049
ox+0.048
Neg logit contributions
 going-0.020
 pointing-0.019
 returning-0.019
't-0.019
 him-0.018
Token distance
Token ID5253
Feature activation-3.77e-02

Pos logit contributions
aj+0.337
uffy+0.297
ows+0.296
ig+0.296
ches+0.282
Neg logit contributions
 going-0.215
 him-0.211
Welcome-0.202
 returning-0.190
't-0.190
Token the
Token ID262
Feature activation+1.48e-01

Pos logit contributions
 going+0.044
 pointing+0.043
 him+0.043
 returning+0.042
all+0.041
Neg logit contributions
ig-0.079
aj-0.078
iper-0.077
ches-0.077
 fort-0.075
Token deer
Token ID20096
Feature activation+4.59e-02

Pos logit contributions
aj+0.223
ig+0.221
iper+0.207
ches+0.204
ox+0.197
Neg logit contributions
 going-0.282
 him-0.269
 returning-0.269
 pointing-0.268
Welcome-0.268
Token could
Token ID714
Feature activation-8.74e-02

Pos logit contributions
aj+0.076
iper+0.071
ig+0.071
 fort+0.067
ox+0.066
Neg logit contributions
 going-0.082
 him-0.080
 pointing-0.080
't-0.078
 returning-0.077
Token see
Token ID766
Feature activation-7.04e-02

Pos logit contributions
 going+0.115
 pointing+0.114
all+0.114
 returning+0.112
 buying+0.111
Neg logit contributions
ig-0.163
ches-0.156
iper-0.155
 fort-0.154
 web-0.151
Token something
Token ID1223
Feature activation+8.71e-03

Pos logit contributions
 going+0.080
 pointing+0.078
all+0.078
 returning+0.076
't+0.076
Neg logit contributions
ig-0.149
ches-0.142
iper-0.142
aj-0.140
 fort-0.140
Token that
Token ID326
Feature activation+2.58e-03

Pos logit contributions
ig+0.013
ches+0.013
 fort+0.012
 web+0.012
iper+0.012
Neg logit contributions
all-0.015
't-0.014
 pointing-0.014
 returning-0.014
 going-0.014
Token made
Token ID925
Feature activation-9.33e-02

Pos logit contributions
ig+0.004
aj+0.004
iper+0.004
ches+0.004
ox+0.004
Neg logit contributions
 going-0.004
 pointing-0.004
 him-0.004
 returning-0.004
't-0.004
Token Lily
Token ID20037
Feature activation+1.41e-01

Pos logit contributions
 going+0.103
 returning+0.098
 pointing+0.098
all+0.098
 buying+0.093
Neg logit contributions
ig-0.234
ches-0.228
iper-0.223
aj-0.220
 fort-0.218
Token's
Token ID338
Feature activation+9.38e-02

Pos logit contributions
ig+0.166
iper+0.160
aj+0.154
 fort+0.150
ches+0.147
Neg logit contributions
 going-0.300
 pointing-0.296
all-0.286
 returning-0.286
 him-0.286
Token forehead
Token ID22645
Feature activation-5.91e-02

Pos logit contributions
ig+0.132
iper+0.125
ches+0.123
aj+0.118
ets+0.114
Neg logit contributions
 going-0.179
 pointing-0.178
 returning-0.171
 him-0.171
't-0.170
Token and
Token ID290
Feature activation+3.75e-02

Pos logit contributions
 going+0.073
all+0.073
 returning+0.071
't+0.071
 pointing+0.071
Neg logit contributions
ig-0.117
ches-0.112
aj-0.110
 fort-0.109
ox-0.108
Token feels
Token ID5300
Feature activation-3.36e-02

Pos logit contributions
ig+0.062
iper+0.059
ches+0.058
aj+0.057
 fort+0.057
Neg logit contributions
 going-0.063
 pointing-0.062
 him-0.059
 returning-0.059
't-0.058
Token that
Token ID326
Feature activation+7.57e-03

Pos logit contributions
all+0.053
't+0.049
 going+0.048
 Lib+0.048
Welcome+0.048
Neg logit contributions
ig-0.056
 web-0.054
ches-0.054
aj-0.052
iper-0.051
Token it
Token ID340
Feature activation+6.68e-02

Pos logit contributions
ig+0.015
 web+0.015
ches+0.015
aj+0.014
 fort+0.014
Neg logit contributions
all-0.009
 returning-0.008
 going-0.008
 buying-0.008
 Crab-0.008
Token is
Token ID318
Feature activation+9.35e-04

Pos logit contributions
ig+0.108
iper+0.107
aj+0.100
ox+0.097
 fort+0.097
Neg logit contributions
 going-0.120
't-0.118
 him-0.111
 pointing-0.110
all-0.105
Token not
Token ID407
Feature activation-3.10e-02

Pos logit contributions
ig+0.002
ches+0.002
aj+0.002
iper+0.002
 fort+0.001
Neg logit contributions
 going-0.001
all-0.001
 pointing-0.001
 returning-0.001
't-0.001
Token hot
Token ID3024
Feature activation-2.16e-02

Pos logit contributions
 going+0.041
 pointing+0.039
 returning+0.038
all+0.038
 him+0.037
Neg logit contributions
ig-0.063
iper-0.059
ches-0.059
aj-0.059
 fort-0.058
Token.
Token ID13
Feature activation-3.92e-02

Pos logit contributions
't+0.031
 pointing+0.030
 returning+0.030
 Cr+0.030
all+0.030
Neg logit contributions
 fort-0.038
ets-0.037
ig-0.036
 web-0.034
aj-0.034
Token her
Token ID607
Feature activation+1.31e-01

Pos logit contributions
Welcome+0.191
all+0.185
 returning+0.174
 Crab+0.172
 Hello+0.172
Neg logit contributions
ig-0.236
ches-0.232
iper-0.225
 fort-0.223
 web-0.223
Token.
Token ID13
Feature activation-4.12e-02

Pos logit contributions
ig+0.248
 fort+0.238
ches+0.236
house+0.232
aj+0.232
Neg logit contributions
Welcome-0.173
all-0.168
 returning-0.164
't-0.157
 going-0.156
Token
Token ID198
Feature activation+2.33e-03

Pos logit contributions
 going+0.049
 returning+0.047
 pointing+0.045
all+0.045
 buying+0.044
Neg logit contributions
ig-0.085
iper-0.082
ches-0.081
aj-0.081
 fort-0.080
Token
Token ID198
Feature activation+3.21e-03

Pos logit contributions
ig+0.004
ches+0.004
iper+0.004
uffy+0.004
aj+0.004
Neg logit contributions
 going-0.003
 returning-0.003
 buying-0.003
 pointing-0.003
all-0.003
TokenOne
Token ID3198
Feature activation+1.74e-02

Pos logit contributions
ig+0.005
ches+0.005
iper+0.005
aj+0.005
 fort+0.005
Neg logit contributions
 going-0.005
 pointing-0.005
 returning-0.005
all-0.005
 buying-0.005
Token day
Token ID1110
Feature activation-4.38e-02

Pos logit contributions
ig+0.035
aj+0.034
ches+0.034
iper+0.034
ox+0.033
Neg logit contributions
 going-0.022
 pointing-0.022
all-0.021
 returning-0.021
't-0.021
Token,
Token ID11
Feature activation-1.38e-02

Pos logit contributions
 going+0.062
all+0.061
 returning+0.060
 pointing+0.060
't+0.059
Neg logit contributions
ig-0.080
iper-0.075
ches-0.074
aj-0.073
 fort-0.073
Token she
Token ID673
Feature activation+9.81e-02

Pos logit contributions
 going+0.013
 pointing+0.013
all+0.012
 returning+0.012
't+0.012
Neg logit contributions
ig-0.032
iper-0.031
ches-0.031
aj-0.031
 fort-0.030
Token saw
Token ID2497
Feature activation-6.07e-02

Pos logit contributions
ig+0.177
iper+0.165
ches+0.164
aj+0.163
 fort+0.158
Neg logit contributions
 going-0.154
 pointing-0.149
 returning-0.144
 him-0.142
all-0.142
Token a
Token ID257
Feature activation+9.33e-02

Pos logit contributions
 going+0.067
all+0.066
 returning+0.064
 pointing+0.064
't+0.063
Neg logit contributions
ig-0.129
iper-0.124
ches-0.123
 fort-0.122
aj-0.122
Token man
Token ID582
Feature activation+4.74e-02

Pos logit contributions
ig+0.164
iper+0.159
ches+0.158
aj+0.156
ets+0.152
Neg logit contributions
 going-0.145
 pointing-0.144
 returning-0.138
 buying-0.134
't-0.133
Token yogurt
Token ID32132
Feature activation-1.29e-02

Pos logit contributions
ig+0.157
ches+0.146
iper+0.146
aj+0.146
 fort+0.145
Neg logit contributions
 going-0.108
 pointing-0.106
all-0.104
 returning-0.104
't-0.104
Token and
Token ID290
Feature activation+2.02e-02

Pos logit contributions
 going+0.016
 pointing+0.016
 returning+0.016
't+0.016
all+0.016
Neg logit contributions
ig-0.025
ches-0.024
 fort-0.024
aj-0.023
iper-0.023
Token made
Token ID925
Feature activation-9.73e-02

Pos logit contributions
ig+0.036
iper+0.034
ches+0.034
aj+0.033
 fort+0.033
Neg logit contributions
 going-0.031
 pointing-0.030
 returning-0.030
all-0.029
 him-0.029
Token a
Token ID257
Feature activation+7.09e-02

Pos logit contributions
 going+0.099
 pointing+0.096
 returning+0.096
all+0.095
't+0.095
Neg logit contributions
ig-0.215
iper-0.207
ches-0.206
 fort-0.203
aj-0.202
Token big
Token ID1263
Feature activation+4.50e-02

Pos logit contributions
ig+0.145
iper+0.140
ches+0.139
aj+0.136
ox+0.135
Neg logit contributions
 going-0.092
 pointing-0.090
 returning-0.086
 him-0.083
 buying-0.082
Token picnic
Token ID35715
Feature activation-7.05e-02

Pos logit contributions
ig+0.077
iper+0.074
ches+0.074
aj+0.073
ox+0.072
Neg logit contributions
 going-0.071
 pointing-0.069
 returning-0.068
all-0.067
 him-0.066
Token lunch
Token ID9965
Feature activation-8.52e-02

Pos logit contributions
 going+0.096
 pointing+0.096
all+0.095
't+0.095
 returning+0.093
Neg logit contributions
ig-0.132
 fort-0.127
ches-0.123
iper-0.123
aj-0.122
Token.
Token ID13
Feature activation-4.40e-02

Pos logit contributions
 pointing+0.095
 going+0.093
all+0.092
 returning+0.092
't+0.091
Neg logit contributions
ig-0.178
 fort-0.175
ches-0.170
iper-0.168
ets-0.167
Token They
Token ID1119
Feature activation+5.16e-02

Pos logit contributions
 going+0.063
 pointing+0.059
 returning+0.059
all+0.058
't+0.056
Neg logit contributions
ig-0.081
iper-0.079
aj-0.077
ches-0.077
 fort-0.075
Token ate
Token ID15063
Feature activation-1.42e-01

Pos logit contributions
ig+0.098
aj+0.094
iper+0.091
ox+0.091
ches+0.090
Neg logit contributions
 going-0.076
 pointing-0.071
 him-0.071
 returning-0.070
all-0.070
Token and
Token ID290
Feature activation+2.86e-02

Pos logit contributions
 going+0.169
 pointing+0.166
all+0.164
 returning+0.163
't+0.163
Neg logit contributions
ig-0.295
ches-0.279
iper-0.277
 fort-0.276
aj-0.276
Token and
Token ID290
Feature activation+4.67e-02

Pos logit contributions
ig+0.283
 fort+0.273
iper+0.270
ches+0.261
eh+0.261
Neg logit contributions
 pointing-0.142
all-0.138
't-0.131
 going-0.130
 him-0.129
Token Tom
Token ID4186
Feature activation+1.24e-01

Pos logit contributions
ig+0.093
aj+0.088
ches+0.088
iper+0.087
 fort+0.086
Neg logit contributions
 going-0.062
 pointing-0.061
 him-0.059
all-0.059
 returning-0.059
Token nodded
Token ID14464
Feature activation-6.66e-02

Pos logit contributions
ig+0.190
aj+0.176
iper+0.176
ches+0.176
 fort+0.175
Neg logit contributions
 going-0.220
 pointing-0.219
 him-0.215
all-0.211
't-0.210
Token.
Token ID13
Feature activation-3.47e-02

Pos logit contributions
't+0.097
all+0.094
 Crab+0.093
 buying+0.093
 Lib+0.092
Neg logit contributions
ig-0.112
aj-0.111
 fort-0.108
ox-0.107
ches-0.107
Token They
Token ID1119
Feature activation+5.76e-02

Pos logit contributions
 going+0.050
all+0.048
 returning+0.047
 pointing+0.046
 him+0.045
Neg logit contributions
aj-0.063
ig-0.063
 fort-0.062
iper-0.060
ox-0.059
Token felt
Token ID2936
Feature activation-2.59e-02

Pos logit contributions
ig+0.119
aj+0.110
 fort+0.108
ches+0.106
 web+0.106
Neg logit contributions
 going-0.080
 him-0.077
 pointing-0.076
 rep-0.074
't-0.071
Token sorry
Token ID7926
Feature activation-8.75e-02

Pos logit contributions
all+0.039
't+0.037
 pointing+0.037
 going+0.036
 returning+0.036
Neg logit contributions
ig-0.043
ches-0.043
iper-0.042
aj-0.042
 fort-0.042
Token and
Token ID290
Feature activation+3.76e-02

Pos logit contributions
 pointing+0.120
 going+0.120
 returning+0.118
all+0.117
 buying+0.115
Neg logit contributions
ig-0.165
ches-0.156
 fort-0.155
aj-0.155
iper-0.153
Token ashamed
Token ID22461
Feature activation-9.37e-02

Pos logit contributions
ig+0.065
iper+0.061
ches+0.061
aj+0.061
 fort+0.060
Neg logit contributions
 going-0.057
all-0.056
 pointing-0.056
 returning-0.055
't-0.055
Token.
Token ID13
Feature activation-3.38e-02

Pos logit contributions
 Lib+0.153
 Crab+0.153
 Abby+0.153
 returning+0.149
 Joey+0.149
Neg logit contributions
ets-0.145
ches-0.138
ows-0.137
ig-0.135
 fort-0.135
Token They
Token ID1119
Feature activation+6.03e-02

Pos logit contributions
 going+0.051
all+0.049
 returning+0.049
 him+0.047
't+0.047
Neg logit contributions
aj-0.059
ig-0.057
 fort-0.056
ox-0.055
iper-0.055
Token water
Token ID1660
Feature activation-2.08e-02

Pos logit contributions
ig+0.307
iper+0.300
ox+0.298
aj+0.292
ches+0.290
Neg logit contributions
 going-0.234
 pointing-0.221
 returning-0.215
 him-0.211
 rep-0.206
Token and
Token ID290
Feature activation+1.69e-02

Pos logit contributions
 going+0.027
all+0.026
 pointing+0.026
't+0.026
 returning+0.026
Neg logit contributions
ig-0.041
ches-0.039
 fort-0.037
iper-0.037
aj-0.037
Token said
Token ID531
Feature activation-6.10e-02

Pos logit contributions
ig+0.029
iper+0.027
ches+0.027
aj+0.027
 fort+0.026
Neg logit contributions
 going-0.026
 pointing-0.026
all-0.025
 returning-0.025
't-0.025
Token,
Token ID11
Feature activation-2.28e-02

Pos logit contributions
 going+0.081
all+0.080
 returning+0.078
 buying+0.077
't+0.076
Neg logit contributions
ig-0.113
ches-0.106
iper-0.105
aj-0.103
ox-0.102
Token "
Token ID366
Feature activation-3.25e-02

Pos logit contributions
Welcome+0.032
no+0.031
all+0.031
 Crab+0.030
reating+0.030
Neg logit contributions
ches-0.036
aj-0.035
ig-0.035
 web-0.033
iper-0.033
Tokenhello
Token ID31373
Feature activation-7.75e-02

Pos logit contributions
 going+0.043
 pointing+0.039
 returning+0.039
 buying+0.039
't+0.036
Neg logit contributions
ig-0.063
ches-0.061
iper-0.060
ox-0.059
eh-0.058
Token."
Token ID526
Feature activation-6.83e-02

Pos logit contributions
 going+0.095
 returning+0.094
 pointing+0.092
 buying+0.091
't+0.089
Neg logit contributions
ig-0.158
ches-0.149
iper-0.148
aj-0.145
ox-0.141
Token The
Token ID383
Feature activation+1.39e-01

Pos logit contributions
 going+0.075
 pointing+0.074
all+0.073
 returning+0.071
't+0.071
Neg logit contributions
ig-0.149
iper-0.141
ches-0.141
aj-0.140
 fort-0.139
Token daughter
Token ID4957
Feature activation+5.49e-02

Pos logit contributions
ig+0.292
aj+0.277
iper+0.276
ches+0.275
 fort+0.272
Neg logit contributions
 going-0.168
 pointing-0.163
all-0.157
 returning-0.155
't-0.154
Token was
Token ID373
Feature activation+5.29e-03

Pos logit contributions
ig+0.093
 fort+0.092
aj+0.091
iper+0.090
ches+0.087
Neg logit contributions
 pointing-0.089
 going-0.084
 Mike-0.083
all-0.083
 him-0.081
Token so
Token ID523
Feature activation-2.89e-02

Pos logit contributions
ig+0.010
ches+0.010
aj+0.010
iper+0.010
 web+0.010
Neg logit contributions
all-0.006
 going-0.006
't-0.006
 returning-0.006
Welcome-0.006
Token above
Token ID2029
Feature activation-3.36e-02

Pos logit contributions
 going+0.106
 pointing+0.104
 returning+0.102
all+0.101
 buying+0.098
Neg logit contributions
ig-0.206
ches-0.198
aj-0.194
iper-0.194
 fort-0.192
Token.
Token ID13
Feature activation-4.23e-02

Pos logit contributions
 going+0.036
 him+0.035
 pointing+0.034
all+0.033
't+0.033
Neg logit contributions
ig-0.074
aj-0.072
iper-0.071
ches-0.070
 fort-0.069
Token Then
Token ID3244
Feature activation-3.75e-02

Pos logit contributions
 going+0.051
 returning+0.048
 pointing+0.048
 him+0.046
no+0.046
Neg logit contributions
ig-0.085
aj-0.083
iper-0.082
ches-0.081
ox-0.080
Token you
Token ID345
Feature activation+6.68e-02

Pos logit contributions
 pointing+0.041
all+0.041
 going+0.041
 returning+0.040
't+0.039
Neg logit contributions
ig-0.079
iper-0.074
aj-0.073
 fort-0.072
ches-0.072
Token'll
Token ID1183
Feature activation-7.85e-02

Pos logit contributions
ig+0.112
aj+0.112
iper+0.109
 fort+0.108
ches+0.104
Neg logit contributions
 going-0.105
't-0.103
all-0.099
 pointing-0.096
 him-0.096
Token feel
Token ID1254
Feature activation-2.55e-02

Pos logit contributions
 going+0.112
 pointing+0.112
all+0.110
 returning+0.109
 buying+0.106
Neg logit contributions
ig-0.144
ches-0.134
iper-0.133
aj-0.131
 fort-0.131
Token strong
Token ID1913
Feature activation-3.18e-02

Pos logit contributions
 pointing+0.038
all+0.038
 going+0.037
't+0.037
 returning+0.036
Neg logit contributions
ig-0.044
ches-0.042
iper-0.041
aj-0.041
 fort-0.040
Token and
Token ID290
Feature activation+3.58e-02

Pos logit contributions
 pointing+0.048
Welcome+0.046
 Crab+0.046
't+0.045
 Cr+0.045
Neg logit contributions
ig-0.052
ches-0.051
ets-0.050
aj-0.050
 fort-0.049
Token stay
Token ID2652
Feature activation-5.19e-02

Pos logit contributions
ig+0.061
iper+0.057
ches+0.057
aj+0.056
 fort+0.055
Neg logit contributions
 going-0.057
 pointing-0.055
all-0.055
 returning-0.054
't-0.054
Token hanging
Token ID10938
Feature activation-1.33e-01

Pos logit contributions
 going+0.069
 pointing+0.066
 returning+0.065
all+0.064
't+0.063
Neg logit contributions
ig-0.103
iper-0.098
ches-0.098
aj-0.096
 fort-0.095
Token until
Token ID1566
Feature activation-4.09e-02

Pos logit contributions
 pointing+0.141
 returning+0.132
Welcome+0.129
 buying+0.128
 Crab+0.127
Neg logit contributions
ig-0.292
ches-0.282
 fort-0.278
aj-0.276
 web-0.269
Token Who
Token ID5338
Feature activation-5.38e-02

Pos logit contributions
't+0.038
 going+0.038
 pointing+0.037
all+0.037
 him+0.036
Neg logit contributions
ig-0.111
ches-0.109
aj-0.106
iper-0.106
 fort-0.104
Token do
Token ID466
Feature activation-9.94e-02

Pos logit contributions
 going+0.088
 him+0.081
't+0.081
 pointing+0.080
all+0.076
Neg logit contributions
ig-0.093
aj-0.091
 web-0.088
 fort-0.088
ches-0.086
Token you
Token ID345
Feature activation+4.77e-02

Pos logit contributions
 going+0.086
 pointing+0.080
't+0.078
all+0.077
 returning+0.076
Neg logit contributions
ig-0.244
iper-0.234
aj-0.233
ches-0.233
 fort-0.231
Token want
Token ID765
Feature activation-1.44e-01

Pos logit contributions
ig+0.102
iper+0.097
ches+0.096
aj+0.096
 fort+0.095
Neg logit contributions
 going-0.055
 pointing-0.053
all-0.052
 returning-0.052
't-0.051
Token to
Token ID284
Feature activation-2.84e-02

Pos logit contributions
 pointing+0.132
 going+0.130
all+0.127
 returning+0.127
 buying+0.119
Neg logit contributions
ig-0.330
ches-0.323
iper-0.320
aj-0.316
 fort-0.315
Token invite
Token ID14037
Feature activation-1.35e-01

Pos logit contributions
 pointing+0.046
Welcome+0.046
 returning+0.045
all+0.045
no+0.045
Neg logit contributions
 fort-0.039
ig-0.039
iper-0.038
ches-0.037
house-0.037
Token?"
Token ID1701
Feature activation-5.42e-02

Pos logit contributions
 going+0.175
 pointing+0.172
 returning+0.168
all+0.167
't+0.163
Neg logit contributions
ig-0.264
ches-0.251
iper-0.251
aj-0.250
 fort-0.245
Token
Token ID198
Feature activation-2.99e-04

Pos logit contributions
 going+0.042
 pointing+0.041
't+0.040
all+0.039
 returning+0.038
Neg logit contributions
ig-0.137
iper-0.131
ches-0.130
aj-0.130
 fort-0.129
Token
Token ID198
Feature activation+2.67e-03

Pos logit contributions
 going+0.000
't+0.000
 pointing+0.000
 him+0.000
all+0.000
Neg logit contributions
ig-0.001
iper-0.001
ches-0.001
aj-0.001
 fort-0.001
TokenAnna
Token ID31160
Feature activation+1.38e-01

Pos logit contributions
ig+0.004
iper+0.004
 fort+0.003
ches+0.003
auldron+0.003
Neg logit contributions
't-0.005
Welcome-0.005
all-0.005
 him-0.005
no-0.005
Token says
Token ID1139
Feature activation-4.63e-02

Pos logit contributions
ig+0.248
eh+0.238
iper+0.234
ox+0.228
 fort+0.226
Neg logit contributions
 going-0.211
 pointing-0.204
 him-0.199
't-0.194
all-0.193
Token down
Token ID866
Feature activation-1.19e-02

Pos logit contributions
 returning+0.161
all+0.160
 pointing+0.160
 going+0.159
 rep+0.157
Neg logit contributions
ig-0.276
ches-0.272
 fort-0.262
iper-0.257
ox-0.255
Token the
Token ID262
Feature activation+1.65e-01

Pos logit contributions
 pointing+0.017
all+0.017
 rep+0.016
 Crab+0.016
 Abby+0.016
Neg logit contributions
 fort-0.019
ches-0.018
house-0.018
ig-0.018
ney-0.018
Token street
Token ID4675
Feature activation-2.69e-02

Pos logit contributions
ig+0.270
iper+0.263
aj+0.252
ches+0.239
ox+0.238
Neg logit contributions
 going-0.302
't-0.294
 pointing-0.277
 buying-0.274
 returning-0.273
Token.
Token ID13
Feature activation-4.13e-02

Pos logit contributions
all+0.034
 Abby+0.033
 Crab+0.033
 Joey+0.033
Welcome+0.032
Neg logit contributions
 fort-0.050
ney-0.049
ig-0.049
house-0.048
ets-0.047
Token He
Token ID679
Feature activation+9.32e-02

Pos logit contributions
 going+0.049
 returning+0.047
all+0.047
 pointing+0.046
 buying+0.044
Neg logit contributions
ig-0.083
 fort-0.081
aj-0.080
ches-0.080
iper-0.079
Token wanted
Token ID2227
Feature activation-1.32e-01

Pos logit contributions
ig+0.215
iper+0.206
aj+0.204
 fort+0.203
ets+0.203
Neg logit contributions
 pointing-0.104
 going-0.104
 him-0.092
't-0.091
all-0.091
Token to
Token ID284
Feature activation-2.38e-02

Pos logit contributions
all+0.129
 pointing+0.125
 rep+0.124
 returning+0.124
 going+0.123
Neg logit contributions
ig-0.285
ches-0.279
 fort-0.270
iper-0.268
ox-0.266
Token play
Token ID711
Feature activation-1.12e-01

Pos logit contributions
otion+0.041
all+0.040
 pointing+0.040
 Crab+0.039
 rep+0.039
Neg logit contributions
 belt-0.030
 fort-0.030
house-0.030
ig-0.030
 ramp-0.029
Token with
Token ID351
Feature activation-5.24e-02

Pos logit contributions
all+0.127
 Abby+0.124
 going+0.123
 pointing+0.122
 returning+0.122
Neg logit contributions
ig-0.228
 fort-0.219
house-0.219
ches-0.219
 web-0.207
Token it
Token ID340
Feature activation+6.81e-02

Pos logit contributions
all+0.041
 going+0.040
 returning+0.039
 pointing+0.039
 rep+0.039
Neg logit contributions
ig-0.127
ches-0.124
 fort-0.119
iper-0.118
ox-0.117
Token,
Token ID11
Feature activation-1.67e-02

Pos logit contributions
ig+0.135
iper+0.127
ches+0.127
aj+0.126
 fort+0.125
Neg logit contributions
 going-0.089
 pointing-0.087
all-0.085
 returning-0.085
't-0.084
Token with
Token ID351
Feature activation-3.64e-02

Pos logit contributions
all+0.021
 Abby+0.020
 Joey+0.020
 Lib+0.020
Welcome+0.020
Neg logit contributions
ig-0.025
 web-0.024
ches-0.023
 fort-0.023
ox-0.022
Token many
Token ID867
Feature activation+1.75e-02

Pos logit contributions
all+0.061
 rep+0.057
 going+0.057
 buying+0.057
 returning+0.056
Neg logit contributions
 web-0.055
ig-0.053
ox-0.052
ches-0.052
iper-0.050
Token feathers
Token ID27400
Feature activation-3.02e-02

Pos logit contributions
ig+0.025
iper+0.023
ches+0.023
aj+0.023
 fort+0.022
Neg logit contributions
 going-0.033
 pointing-0.032
all-0.032
 returning-0.032
't-0.031
Token.
Token ID13
Feature activation-3.77e-02

Pos logit contributions
Welcome+0.050
 Abby+0.048
 Crab+0.047
 Cr+0.046
 Joey+0.046
Neg logit contributions
ig-0.051
ches-0.051
ows-0.050
ets-0.049
 web-0.046
Token Tom
Token ID4186
Feature activation+1.25e-01

Pos logit contributions
 going+0.055
 returning+0.054
all+0.053
 pointing+0.052
 giving+0.050
Neg logit contributions
aj-0.062
ig-0.059
ables-0.057
ches-0.056
 fort-0.056
Token wants
Token ID3382
Feature activation-1.27e-01

Pos logit contributions
ig+0.219
iper+0.218
aj+0.213
 fort+0.209
eh+0.198
Neg logit contributions
 going-0.196
 pointing-0.193
 him-0.187
't-0.183
all-0.180
Token to
Token ID284
Feature activation-2.14e-02

Pos logit contributions
 going+0.122
all+0.121
 returning+0.120
 pointing+0.119
 rep+0.114
Neg logit contributions
ig-0.278
ches-0.276
iper-0.268
ox-0.267
 fort-0.265
Token catch
Token ID4929
Feature activation-1.07e-01

Pos logit contributions
 going+0.031
 pointing+0.031
 returning+0.031
all+0.031
 buying+0.031
Neg logit contributions
ig-0.036
ches-0.036
 web-0.035
ox-0.034
 fort-0.034
Token the
Token ID262
Feature activation+1.49e-01

Pos logit contributions
 returning+0.119
 rep+0.118
 buying+0.118
 going+0.116
all+0.112
Neg logit contributions
ches-0.218
ig-0.211
c-0.209
 web-0.206
ns-0.200
Token bird
Token ID6512
Feature activation+5.57e-02

Pos logit contributions
ig+0.250
iper+0.234
ches+0.234
aj+0.231
 fort+0.229
Neg logit contributions
 going-0.240
 pointing-0.234
all-0.233
 returning-0.231
't-0.227
Token and
Token ID290
Feature activation+2.94e-02

Pos logit contributions
ig+0.115
ches+0.109
iper+0.107
 fort+0.106
aj+0.106
Neg logit contributions
 going-0.068
 pointing-0.067
 returning-0.066
all-0.065
't-0.064
Token a
Token ID257
Feature activation+7.94e-02

Pos logit contributions
 returning+0.104
 going+0.104
 pointing+0.103
all+0.101
 buying+0.101
Neg logit contributions
ig-0.191
ches-0.186
iper-0.183
 fort-0.182
aj-0.181
Token funny
Token ID8258
Feature activation-7.36e-03

Pos logit contributions
iper+0.157
ig+0.157
aj+0.152
ches+0.149
ets+0.143
Neg logit contributions
 going-0.104
 pointing-0.101
't-0.098
 returning-0.096
 him-0.093
Token sound
Token ID2128
Feature activation-6.83e-02

Pos logit contributions
 going+0.012
all+0.011
 pointing+0.011
 returning+0.011
't+0.011
Neg logit contributions
ig-0.012
ches-0.012
iper-0.011
 fort-0.011
aj-0.011
Token.
Token ID13
Feature activation-3.85e-02

Pos logit contributions
 Abby+0.107
 Lib+0.104
 Joey+0.103
 Crab+0.101
Welcome+0.099
Neg logit contributions
ches-0.117
ig-0.115
ets-0.108
 web-0.108
 fort-0.106
Token He
Token ID679
Feature activation+8.59e-02

Pos logit contributions
 going+0.048
 pointing+0.045
 returning+0.045
all+0.044
 buying+0.043
Neg logit contributions
ig-0.078
aj-0.076
 fort-0.074
ches-0.074
iper-0.074
Token tried
Token ID3088
Feature activation-1.48e-01

Pos logit contributions
ig+0.173
iper+0.172
aj+0.166
 fort+0.166
uffy+0.157
Neg logit contributions
 going-0.115
 pointing-0.115
 him-0.112
 it-0.110
't-0.106
Token again
Token ID757
Feature activation-5.03e-02

Pos logit contributions
all+0.152
 going+0.152
 returning+0.151
Welcome+0.151
 Joey+0.147
Neg logit contributions
ig-0.311
ches-0.306
aj-0.296
 web-0.294
 fort-0.291
Token and
Token ID290
Feature activation+2.26e-02

Pos logit contributions
 going+0.069
 pointing+0.067
 returning+0.067
 buying+0.066
all+0.066
Neg logit contributions
ig-0.095
ches-0.090
 web-0.087
 fort-0.087
ox-0.087
Token again
Token ID757
Feature activation-4.53e-02

Pos logit contributions
ig+0.047
iper+0.046
aj+0.046
ches+0.044
 fort+0.044
Neg logit contributions
 going-0.029
 pointing-0.028
 him-0.027
 returning-0.026
all-0.026
Token,
Token ID11
Feature activation-2.41e-02

Pos logit contributions
 going+0.053
all+0.051
 returning+0.051
 pointing+0.050
 buying+0.050
Neg logit contributions
ig-0.095
ches-0.090
 fort-0.087
 web-0.087
aj-0.087
Token but
Token ID475
Feature activation-8.51e-03

Pos logit contributions
 going+0.036
 pointing+0.035
 him+0.034
 returning+0.034
't+0.033
Neg logit contributions
ig-0.044
iper-0.043
aj-0.043
ches-0.042
 fort-0.041
Token scarf
Token ID36953
Feature activation-2.63e-02

Pos logit contributions
ig+0.264
ches+0.251
iper+0.251
aj+0.248
ox+0.235
Neg logit contributions
 going-0.252
 pointing-0.243
 returning-0.236
 him-0.235
 buying-0.228
Token.
Token ID13
Feature activation-3.90e-02

Pos logit contributions
 pointing+0.026
 going+0.026
all+0.026
 returning+0.026
 buying+0.025
Neg logit contributions
ig-0.059
ches-0.057
 fort-0.056
iper-0.055
aj-0.055
Token He
Token ID679
Feature activation+8.57e-02

Pos logit contributions
 going+0.059
 returning+0.056
all+0.054
 pointing+0.054
 buying+0.053
Neg logit contributions
aj-0.067
ig-0.066
 fort-0.065
ox-0.063
ches-0.062
Token tries
Token ID8404
Feature activation-1.51e-01

Pos logit contributions
ig+0.148
iper+0.144
ney+0.135
ox+0.133
uffy+0.132
Neg logit contributions
 going-0.141
 him-0.133
 pointing-0.130
't-0.124
 it-0.122
Token to
Token ID284
Feature activation-2.66e-02

Pos logit contributions
 going+0.149
 returning+0.143
all+0.142
 pointing+0.141
Welcome+0.138
Neg logit contributions
ig-0.336
ches-0.327
aj-0.320
 fort-0.317
ox-0.317
Token pull
Token ID2834
Feature activation-1.45e-01

Pos logit contributions
 going+0.039
 pointing+0.038
all+0.038
 returning+0.038
 buying+0.037
Neg logit contributions
ig-0.047
ches-0.046
 web-0.044
iper-0.044
 fort-0.044
Token them
Token ID606
Feature activation+8.35e-02

Pos logit contributions
 going+0.122
 pointing+0.116
 returning+0.115
all+0.114
 buying+0.111
Neg logit contributions
ig-0.355
ches-0.343
iper-0.333
 fort-0.331
ox-0.327
Token off
Token ID572
Feature activation-5.89e-02

Pos logit contributions
ig+0.137
ches+0.133
 fort+0.127
ows+0.120
ox+0.118
Neg logit contributions
 returning-0.137
 buying-0.137
 rep-0.136
 pointing-0.135
Welcome-0.133
Token with
Token ID351
Feature activation-5.24e-02

Pos logit contributions
 going+0.063
 pointing+0.060
all+0.059
 returning+0.059
't+0.057
Neg logit contributions
ig-0.130
ches-0.125
iper-0.123
 fort-0.122
aj-0.122
Token his
Token ID465
Feature activation+1.54e-01

Pos logit contributions
all+0.051
 going+0.051
 pointing+0.049
 returning+0.048
't+0.047
Neg logit contributions
ig-0.118
iper-0.114
ches-0.113
 fort-0.113
 web-0.111
Token mouth
Token ID5422
Feature activation-3.15e-02

Pos logit contributions
ig+0.201
iper+0.199
ches+0.197
ets+0.189
aj+0.185
Neg logit contributions
 going-0.292
 pointing-0.285
 him-0.284
't-0.274
 returning-0.270