TOP ACTIVATIONS
MAX = 0.201

and play with your or nam ents . Maybe you can
time playing with her or nam ents and making her special
thought they looked like ac rob ats . She enjoyed the
the animals , the ac rob ats and the clown s
saw clown s and ac rob ats and lions and elephants
clown s juggling and ac rob ats doing flips . But
clown s and the ac rob ats and the lions and
to help sort the or nam ents . She picked up
of fun watching the ac rob ats and clown s .
the rest of the or nam ents onto the tree .
a box of shiny or nam ents that spark led in
he especially liked the or nam ents he kept around .
yellow , or white . Which one do you want ?"
and a red car . Which one do you want ?"
tree with lights and or nam ents . They also saw
never played with the or nam ents again . They still
and chocolate and nuts . Which one do you want ,
to look at the or nam ents on the tree .
who liked to collect or nam ents . Everywhere she went
friend , Sam , " Which writing place should I go

MIN ACTIVATIONS
MIN = -0.119

skipping and talking together until dusk . Todd said goodbye
and car away and c umin and cor i ander and
proud of herself for staying ahead for so long , so
Tim my waited patiently until noon , and then he went
a cos y sleep until spring time . <|endoftext|> One day
burst of speed and raced ahead of Luke .
she would have to wait ages until he got inside the
to many more happy days ahead . <|endoftext|> Once upon a
was very excited and ran ahead . When he arrived ,
and the monkeys hol le red from the trees . One
see what other adventures lay ahead . <|endoftext|> Once upon a
was so excited he ran ahead and his parents could hardly
snacks for the long journey ahead . And all the way
Maybe someone will come and free you , but I have
* The End * <|endoftext|> Once upon a time
tail pleaded to be set free , but the wolves would
the young girl to go ahead . The old
Jack asked Sandy hes it antly , " How do
still see the boys running ahead . Suddenly , one of
happiest when he was running free with the wind . What

INTERVAL 0.121 - 0.201
CONTAINS 0.005%

They saw the wood cut ter coming to the woods
and hopped away . The je ep watched it go .
box full of beautiful or nam ents .
time playing with her or nam ents and making her special
of fun watching the ac rob ats and clown s .

INTERVAL 0.041 - 0.121
CONTAINS 4.070%

and her patch on her knee . Her mom was proud
to splash ! He felt so content and happy playing in
likes to play with his drum . He has a big
orig ami figure . But no matter how hard she tried
know . " No , what do they eat ?" he

INTERVAL -0.039 - 0.041
CONTAINS 92.830%

find a place to cover herself from the rain . She
the animals for being so patient and left the stable .
, but she was a little scared . She asked her
p aj amas and went to bed . In
. We are going to see the elephants next . They

INTERVAL -0.119 - -0.039
CONTAINS 3.094%

came in and it was time for dinner . <|endoftext|> Once
bye , small girl !" ch ir ped the mouse ,
little bird to go back home . The wind
the log and made it safely to the other side .
made the sky feel so cheerful . From that day on
Token and
Token ID290
Feature activation+3.27e-02

Pos logit contributions
 each+0.017
 me+0.015
 to+0.015
 be+0.011
 ho+0.010
Neg logit contributions
bridge-0.056
pron-0.049
ined-0.047
iche-0.047
aster-0.047
Token play
Token ID711
Feature activation+3.34e-02

Pos logit contributions
bridge+0.083
pron+0.069
aster+0.068
iche+0.067
ined+0.066
Neg logit contributions
 to-0.045
 each-0.044
 me-0.039
 for-0.034
 be-0.033
Token with
Token ID351
Feature activation+2.08e-02

Pos logit contributions
bridge+0.102
pron+0.089
iche+0.088
ined+0.087
aster+0.085
Neg logit contributions
 to-0.036
 each-0.033
 me-0.029
 for-0.025
 be-0.023
Token your
Token ID534
Feature activation+3.96e-02

Pos logit contributions
bridge+0.056
pron+0.047
ined+0.047
aster+0.047
iche+0.046
Neg logit contributions
 to-0.027
 each-0.025
 me-0.022
 be-0.021
 for-0.020
Token or
Token ID393
Feature activation+7.23e-02

Pos logit contributions
bridge+0.105
pron+0.090
ined+0.086
iche+0.086
aster+0.083
Neg logit contributions
 to-0.054
 each-0.051
 me-0.045
 for-0.040
 be-0.040
Tokennam
Token ID7402
Feature activation+1.67e-01

Pos logit contributions
bridge+0.184
pron+0.160
aster+0.159
iche+0.156
atmeal+0.150
Neg logit contributions
 to-0.088
 each-0.085
 me-0.075
,-0.066
 be-0.066
Tokenents
Token ID658
Feature activation+2.60e-02

Pos logit contributions
bridge+0.437
pron+0.367
iche+0.363
ined+0.362
aster+0.346
Neg logit contributions
 to-0.235
 each-0.220
 me-0.208
 for-0.188
 be-0.185
Token.
Token ID13
Feature activation+1.92e-02

Pos logit contributions
bridge+0.081
pron+0.069
ined+0.069
iche+0.068
aster+0.068
Neg logit contributions
 to-0.022
 each-0.021
 me-0.020
 be-0.016
 for-0.013
Token Maybe
Token ID6674
Feature activation+1.56e-03

Pos logit contributions
bridge+0.053
ined+0.048
pron+0.047
atmeal+0.045
iche+0.044
Neg logit contributions
 to-0.026
 each-0.025
 me-0.023
 be-0.020
,-0.020
Token you
Token ID345
Feature activation-1.76e-03

Pos logit contributions
bridge+0.004
pron+0.004
iche+0.003
ined+0.003
aster+0.003
Neg logit contributions
 to-0.002
 each-0.002
 me-0.002
 be-0.002
 for-0.002
Token can
Token ID460
Feature activation-1.31e-02

Pos logit contributions
 to+0.002
 each+0.002
 me+0.002
 for+0.001
 be+0.001
Neg logit contributions
bridge-0.005
pron-0.004
iche-0.004
ined-0.004
aster-0.004
Token time
Token ID640
Feature activation+2.11e-03

Pos logit contributions
bridge+0.032
pron+0.027
ined+0.026
iche+0.026
aster+0.025
Neg logit contributions
 to-0.022
 each-0.021
 me-0.019
 be-0.017
 for-0.017
Token playing
Token ID2712
Feature activation+2.86e-02

Pos logit contributions
bridge+0.006
pron+0.005
ined+0.005
iche+0.005
aster+0.005
Neg logit contributions
 to-0.002
 each-0.002
 me-0.002
 be-0.002
 for-0.002
Token with
Token ID351
Feature activation+1.79e-02

Pos logit contributions
bridge+0.084
pron+0.071
ined+0.070
iche+0.069
aster+0.069
Neg logit contributions
 to-0.031
 each-0.029
 me-0.026
 be-0.023
 for-0.021
Token her
Token ID607
Feature activation+4.49e-03

Pos logit contributions
bridge+0.055
pron+0.047
ined+0.046
iche+0.046
aster+0.045
Neg logit contributions
 to-0.017
 each-0.016
 me-0.014
 be-0.012
 for-0.012
Token or
Token ID393
Feature activation+5.29e-02

Pos logit contributions
bridge+0.012
pron+0.010
ined+0.010
iche+0.010
aster+0.010
Neg logit contributions
 to-0.006
 each-0.006
 me-0.005
 be-0.005
 for-0.005
Tokennam
Token ID7402
Feature activation+1.62e-01

Pos logit contributions
bridge+0.139
pron+0.119
ined+0.116
iche+0.115
aster+0.113
Neg logit contributions
 to-0.072
 each-0.069
 me-0.062
 be-0.055
 for-0.054
Tokenents
Token ID658
Feature activation+2.17e-02

Pos logit contributions
bridge+0.425
pron+0.359
ined+0.353
iche+0.351
aster+0.335
Neg logit contributions
 to-0.224
 each-0.210
 me-0.198
 for-0.180
 be-0.176
Token and
Token ID290
Feature activation+2.02e-02

Pos logit contributions
bridge+0.069
pron+0.059
ined+0.059
iche+0.058
aster+0.057
Neg logit contributions
 to-0.019
 each-0.017
 me-0.016
 be-0.012
 for-0.011
Token making
Token ID1642
Feature activation+1.93e-02

Pos logit contributions
bridge+0.049
pron+0.042
ined+0.040
hes+0.038
aster+0.037
Neg logit contributions
 to-0.029
 each-0.028
 me-0.027
 be-0.023
 for-0.023
Token her
Token ID607
Feature activation-1.43e-03

Pos logit contributions
bridge+0.052
pron+0.042
ined+0.042
iche+0.042
hes+0.041
Neg logit contributions
 to-0.024
 each-0.022
 me-0.020
 be-0.020
 ho-0.019
Token special
Token ID2041
Feature activation+9.14e-03

Pos logit contributions
 to+0.002
 each+0.002
 me+0.002
 for+0.002
 be+0.002
Neg logit contributions
bridge-0.004
pron-0.003
iche-0.003
ined-0.003
aster-0.003
Token thought
Token ID1807
Feature activation+2.93e-02

Pos logit contributions
bridge+0.019
pron+0.017
ined+0.016
hes+0.016
atmeal+0.015
Neg logit contributions
 to-0.011
 each-0.010
 for-0.010
 me-0.009
 home-0.009
Token they
Token ID484
Feature activation+1.01e-02

Pos logit contributions
bridge+0.078
pron+0.067
aster+0.065
ined+0.064
hes+0.063
Neg logit contributions
 to-0.035
 each-0.035
 me-0.033
 be-0.029
 home-0.026
Token looked
Token ID3114
Feature activation+3.87e-02

Pos logit contributions
bridge+0.027
ined+0.022
pron+0.022
hes+0.021
iche+0.021
Neg logit contributions
 to-0.014
 me-0.012
 each-0.012
 for-0.011
 home-0.011
Token like
Token ID588
Feature activation-1.50e-02

Pos logit contributions
bridge+0.103
pron+0.087
ined+0.086
iche+0.085
hes+0.081
Neg logit contributions
 to-0.050
 each-0.049
 me-0.043
 be-0.040
 for-0.036
Token ac
Token ID936
Feature activation+4.14e-02

Pos logit contributions
 to+0.017
 each+0.017
 me+0.017
 be+0.015
's+0.014
Neg logit contributions
bridge-0.040
pron-0.033
hes-0.031
aster-0.030
ined-0.030
Tokenrob
Token ID22609
Feature activation+1.58e-01

Pos logit contributions
bridge+0.043
pron+0.024
iche+0.023
ined+0.022
aster+0.021
Neg logit contributions
 to-0.127
 each-0.125
 me-0.119
 for-0.113
,-0.112
Tokenats
Token ID1381
Feature activation-2.82e-03

Pos logit contributions
bridge+0.314
pron+0.245
iche+0.233
ined+0.229
aster+0.227
Neg logit contributions
 to-0.335
 each-0.328
 me-0.304
 for-0.285
,-0.280
Token.
Token ID13
Feature activation+3.38e-03

Pos logit contributions
 to+0.002
 each+0.002
 me+0.002
 be+0.001
 home+0.001
Neg logit contributions
bridge-0.009
pron-0.008
ined-0.007
iche-0.007
hes-0.007
Token She
Token ID1375
Feature activation+5.46e-03

Pos logit contributions
bridge+0.006
iche+0.005
pron+0.005
hes+0.005
Here+0.005
Neg logit contributions
 each-0.006
 to-0.006
 ho-0.006
 me-0.005
 be-0.005
Token enjoyed
Token ID8359
Feature activation+2.36e-02

Pos logit contributions
bridge+0.013
pron+0.011
ined+0.010
hes+0.010
aster+0.010
Neg logit contributions
 to-0.008
 each-0.007
 for-0.007
 home-0.007
 me-0.007
Token the
Token ID262
Feature activation+2.35e-02

Pos logit contributions
bridge+0.055
pron+0.047
ined+0.045
aster+0.045
iche+0.044
Neg logit contributions
 to-0.037
 each-0.035
 me-0.033
's-0.030
 be-0.030
Token the
Token ID262
Feature activation+2.28e-02

Pos logit contributions
bridge+0.090
pron+0.073
hes+0.073
ined+0.073
iche+0.072
Neg logit contributions
 to-0.038
 each-0.034
 me-0.032
 be-0.030
 for-0.027
Token animals
Token ID4695
Feature activation+2.75e-02

Pos logit contributions
bridge+0.060
pron+0.051
ined+0.050
iche+0.050
aster+0.049
Neg logit contributions
 to-0.033
 each-0.032
 me-0.029
 for-0.026
 be-0.025
Token,
Token ID11
Feature activation+1.69e-02

Pos logit contributions
bridge+0.095
pron+0.083
ined+0.082
iche+0.082
aster+0.081
Neg logit contributions
 to-0.018
 each-0.015
 me-0.012
 be-0.008
 for-0.008
Token the
Token ID262
Feature activation+2.17e-02

Pos logit contributions
bridge+0.051
ined+0.044
pron+0.044
iche+0.043
atmeal+0.041
Neg logit contributions
 to-0.021
 each-0.020
 me-0.016
 be-0.015
 for-0.015
Token ac
Token ID936
Feature activation+5.43e-02

Pos logit contributions
bridge+0.057
pron+0.047
iche+0.046
ined+0.046
aster+0.045
Neg logit contributions
 to-0.032
 each-0.030
 me-0.027
 be-0.024
 for-0.024
Tokenrob
Token ID22609
Feature activation+1.56e-01

Pos logit contributions
bridge+0.068
pron+0.042
iche+0.041
aster+0.038
 narrative+0.036
Neg logit contributions
 to-0.155
 each-0.150
 me-0.147
,-0.140
 for-0.137
Tokenats
Token ID1381
Feature activation+5.49e-03

Pos logit contributions
bridge+0.346
pron+0.278
iche+0.259
ined+0.252
aster+0.250
Neg logit contributions
 to-0.298
 each-0.295
 me-0.266
 for-0.251
,-0.251
Token and
Token ID290
Feature activation+1.40e-02

Pos logit contributions
bridge+0.018
ined+0.016
pron+0.016
iche+0.016
aster+0.015
Neg logit contributions
 to-0.003
 each-0.003
 me-0.003
 be-0.002
 for-0.001
Token the
Token ID262
Feature activation+1.65e-02

Pos logit contributions
bridge+0.038
aster+0.032
pron+0.031
iche+0.030
hes+0.030
Neg logit contributions
 to-0.015
 me-0.014
 each-0.014
's-0.011
 for-0.011
Token clown
Token ID25573
Feature activation+3.14e-02

Pos logit contributions
bridge+0.040
pron+0.032
iche+0.032
ined+0.031
aster+0.030
Neg logit contributions
 to-0.026
 each-0.025
 me-0.022
 for-0.020
 be-0.020
Tokens
Token ID82
Feature activation-2.34e-02

Pos logit contributions
bridge+0.099
pron+0.085
ined+0.085
iche+0.085
aster+0.083
Neg logit contributions
 to-0.026
 each-0.023
 me-0.022
 be-0.016
 for-0.015
Token saw
Token ID2497
Feature activation+5.02e-02

Pos logit contributions
bridge+0.088
pron+0.074
ined+0.074
iche+0.071
hes+0.071
Neg logit contributions
 to-0.041
 each-0.038
 me-0.034
 for-0.031
 home-0.030
Token clown
Token ID25573
Feature activation+2.78e-02

Pos logit contributions
bridge+0.144
pron+0.119
ined+0.119
iche+0.118
aster+0.117
Neg logit contributions
 to-0.060
 each-0.056
 me-0.051
 be-0.045
 for-0.043
Tokens
Token ID82
Feature activation-3.24e-02

Pos logit contributions
bridge+0.081
iche+0.069
ined+0.068
pron+0.067
atmeal+0.066
Neg logit contributions
 to-0.025
 each-0.024
 me-0.022
 be-0.019
 home-0.017
Token and
Token ID290
Feature activation+2.02e-02

Pos logit contributions
 to+0.023
 each+0.021
 me+0.018
 be+0.013
 for+0.011
Neg logit contributions
bridge-0.106
ined-0.092
pron-0.092
iche-0.090
aster-0.089
Token ac
Token ID936
Feature activation+4.11e-02

Pos logit contributions
bridge+0.052
pron+0.043
iche+0.042
ined+0.042
aster+0.041
Neg logit contributions
 to-0.030
 each-0.028
 me-0.026
 for-0.023
 be-0.023
Tokenrob
Token ID22609
Feature activation+1.48e-01

Pos logit contributions
bridge+0.038
ined+0.020
pron+0.020
iche+0.018
aster+0.016
Neg logit contributions
 to-0.132
 each-0.127
 me-0.123
 be-0.118
 for-0.117
Tokenats
Token ID1381
Feature activation-4.47e-03

Pos logit contributions
bridge+0.285
pron+0.223
ined+0.217
iche+0.216
aster+0.208
Neg logit contributions
 to-0.322
 each-0.310
 me-0.292
 for-0.271
 be-0.270
Token and
Token ID290
Feature activation+2.00e-02

Pos logit contributions
 to+0.003
 each+0.003
 me+0.003
 be+0.002
 for+0.002
Neg logit contributions
bridge-0.015
ined-0.012
pron-0.012
iche-0.012
hes-0.012
Token lions
Token ID31420
Feature activation+3.78e-02

Pos logit contributions
bridge+0.050
pron+0.041
aster+0.040
iche+0.040
ined+0.039
Neg logit contributions
 to-0.030
 each-0.029
 me-0.027
 for-0.024
 be-0.023
Token and
Token ID290
Feature activation+1.04e-02

Pos logit contributions
bridge+0.114
ined+0.094
iche+0.093
hes+0.092
aster+0.092
Neg logit contributions
 to-0.034
 each-0.031
 me-0.030
 be-0.023
 home-0.021
Token elephants
Token ID25368
Feature activation+2.91e-02

Pos logit contributions
bridge+0.025
pron+0.020
iche+0.020
ined+0.020
aster+0.020
Neg logit contributions
 to-0.017
 each-0.016
 me-0.015
 for-0.014
 be-0.013
Token clown
Token ID25573
Feature activation+8.05e-03

Pos logit contributions
bridge+0.071
pron+0.058
ined+0.058
hes+0.057
aster+0.057
Neg logit contributions
 to-0.027
 each-0.024
 me-0.023
 be-0.021
 for-0.020
Tokens
Token ID82
Feature activation-4.59e-02

Pos logit contributions
bridge+0.023
ined+0.020
pron+0.020
iche+0.020
atmeal+0.019
Neg logit contributions
 to-0.007
 each-0.007
 me-0.006
 be-0.005
 for-0.005
Token juggling
Token ID48097
Feature activation+3.91e-02

Pos logit contributions
 to+0.037
 each+0.034
 me+0.030
 be+0.022
 for+0.022
Neg logit contributions
bridge-0.141
ined-0.123
pron-0.122
iche-0.120
aster-0.116
Token and
Token ID290
Feature activation-4.26e-03

Pos logit contributions
bridge+0.103
aster+0.078
ined+0.077
hes+0.077
pron+0.077
Neg logit contributions
 to-0.043
 each-0.041
 me-0.039
 be-0.036
 home-0.035
Token ac
Token ID936
Feature activation+2.19e-02

Pos logit contributions
 to+0.007
 each+0.007
 me+0.006
 for+0.006
 be+0.006
Neg logit contributions
bridge-0.010
pron-0.008
iche-0.008
ined-0.008
aster-0.008
Tokenrob
Token ID22609
Feature activation+1.48e-01

Pos logit contributions
bridge+0.018
pron+0.009
ined+0.009
iche+0.008
aster+0.007
Neg logit contributions
 to-0.072
 each-0.070
 me-0.067
 be-0.065
 for-0.064
Tokenats
Token ID1381
Feature activation-2.84e-02

Pos logit contributions
bridge+0.293
pron+0.232
iche+0.223
ined+0.219
aster+0.217
Neg logit contributions
 to-0.310
 each-0.301
 me-0.280
 for-0.261
 be-0.258
Token doing
Token ID1804
Feature activation-1.35e-03

Pos logit contributions
 to+0.024
 each+0.023
 me+0.021
 be+0.017
 for+0.015
Neg logit contributions
bridge-0.088
ined-0.075
pron-0.075
iche-0.074
hes-0.072
Token flips
Token ID45971
Feature activation-4.77e-02

Pos logit contributions
 to+0.002
 each+0.002
 me+0.002
 be+0.002
 for+0.002
Neg logit contributions
bridge-0.003
pron-0.003
ined-0.003
iche-0.003
aster-0.003
Token.
Token ID13
Feature activation-6.30e-03

Pos logit contributions
 each+0.045
 to+0.044
 me+0.042
 be+0.040
's+0.032
Neg logit contributions
bridge-0.136
ined-0.113
pron-0.108
iche-0.107
hes-0.107
Token But
Token ID887
Feature activation-4.02e-03

Pos logit contributions
 to+0.009
 each+0.009
 me+0.008
 be+0.007
 for+0.007
Neg logit contributions
bridge-0.017
pron-0.014
ined-0.014
iche-0.014
aster-0.014
Token clown
Token ID25573
Feature activation+1.79e-02

Pos logit contributions
bridge+0.016
ined+0.014
iche+0.014
pron+0.014
aster+0.013
Neg logit contributions
 to-0.011
 each-0.010
 me-0.010
 for-0.009
 be-0.008
Tokens
Token ID82
Feature activation-4.41e-02

Pos logit contributions
bridge+0.055
pron+0.047
ined+0.047
iche+0.047
aster+0.046
Neg logit contributions
 to-0.017
 each-0.016
 me-0.014
 be-0.011
 for-0.010
Token and
Token ID290
Feature activation+3.51e-03

Pos logit contributions
 to+0.040
 each+0.036
 me+0.032
 be+0.025
 for+0.024
Neg logit contributions
bridge-0.140
pron-0.121
ined-0.120
iche-0.119
aster-0.117
Token the
Token ID262
Feature activation+4.81e-03

Pos logit contributions
bridge+0.010
pron+0.008
aster+0.008
ined+0.008
hes+0.008
Neg logit contributions
 to-0.004
 each-0.004
 me-0.004
 for-0.003
 home-0.003
Token ac
Token ID936
Feature activation+3.69e-02

Pos logit contributions
bridge+0.012
pron+0.010
ined+0.009
iche+0.009
aster+0.009
Neg logit contributions
 to-0.008
 each-0.008
 me-0.007
 be-0.006
 for-0.006
Tokenrob
Token ID22609
Feature activation+1.47e-01

Pos logit contributions
bridge+0.043
iche+0.029
pron+0.028
aster+0.025
obb+0.024
Neg logit contributions
 to-0.104
 each-0.102
 me-0.099
,-0.094
 for-0.092
Tokenats
Token ID1381
Feature activation-1.34e-02

Pos logit contributions
bridge+0.317
pron+0.259
iche+0.242
aster+0.235
ined+0.233
Neg logit contributions
 to-0.284
 each-0.280
 me-0.255
 for-0.240
,-0.237
Token and
Token ID290
Feature activation+3.28e-03

Pos logit contributions
 to+0.010
 each+0.010
 me+0.008
 be+0.006
 for+0.006
Neg logit contributions
bridge-0.044
pron-0.038
ined-0.038
iche-0.037
aster-0.037
Token the
Token ID262
Feature activation-3.03e-04

Pos logit contributions
bridge+0.009
aster+0.008
pron+0.008
hes+0.008
iche+0.008
Neg logit contributions
 to-0.003
 each-0.003
 me-0.003
 for-0.002
's-0.002
Token lions
Token ID31420
Feature activation+3.10e-02

Pos logit contributions
 to+0.001
 each+0.000
 me+0.000
 for+0.000
 be+0.000
Neg logit contributions
bridge-0.001
pron-0.001
iche-0.001
ined-0.001
aster-0.001
Token and
Token ID290
Feature activation-6.51e-03

Pos logit contributions
bridge+0.098
ined+0.082
pron+0.082
iche+0.082
aster+0.081
Neg logit contributions
 to-0.028
 each-0.025
 me-0.023
 be-0.017
 for-0.016
Token to
Token ID284
Feature activation+2.80e-03

Pos logit contributions
bridge+0.001
pron+0.001
aster+0.001
ined+0.001
iche+0.001
Neg logit contributions
 each-0.000
 me-0.000
 to-0.000
 ho-0.000
 heads-0.000
Token help
Token ID1037
Feature activation+2.01e-03

Pos logit contributions
bridge+0.008
pron+0.007
ined+0.007
iche+0.007
hes+0.007
Neg logit contributions
 to-0.004
 each-0.003
 be-0.003
 me-0.003
,-0.003
Token sort
Token ID3297
Feature activation+1.03e-02

Pos logit contributions
bridge+0.006
aster+0.005
pron+0.005
iche+0.005
ined+0.005
Neg logit contributions
 to-0.002
 each-0.002
 me-0.002
 be-0.001
 ho-0.001
Token the
Token ID262
Feature activation+2.17e-02

Pos logit contributions
bridge+0.027
ined+0.023
aster+0.022
hes+0.022
pron+0.022
Neg logit contributions
 to-0.010
 me-0.010
 be-0.009
 each-0.009
 ho-0.009
Token or
Token ID393
Feature activation+4.30e-02

Pos logit contributions
bridge+0.051
pron+0.044
iche+0.041
ined+0.041
aster+0.039
Neg logit contributions
 each-0.033
 to-0.033
 me-0.029
 for-0.026
 be-0.026
Tokennam
Token ID7402
Feature activation+1.45e-01

Pos logit contributions
bridge+0.095
pron+0.080
iche+0.076
aster+0.076
ined+0.076
Neg logit contributions
 to-0.072
 each-0.069
 me-0.066
 for-0.059
 be-0.059
Tokenents
Token ID658
Feature activation+9.06e-03

Pos logit contributions
bridge+0.373
pron+0.312
iche+0.309
ined+0.307
hes+0.293
Neg logit contributions
 to-0.202
 each-0.188
 me-0.183
 for-0.167
 be-0.161
Token.
Token ID13
Feature activation+1.89e-03

Pos logit contributions
bridge+0.029
ined+0.024
pron+0.024
aster+0.024
iche+0.024
Neg logit contributions
 to-0.007
 each-0.007
 me-0.007
 be-0.005
 ho-0.004
Token She
Token ID1375
Feature activation-4.00e-03

Pos logit contributions
bridge+0.004
pron+0.003
iche+0.003
aster+0.003
ined+0.003
Neg logit contributions
 to-0.003
 each-0.003
 me-0.003
 be-0.003
 for-0.003
Token picked
Token ID6497
Feature activation+2.85e-02

Pos logit contributions
 to+0.006
 each+0.005
 me+0.005
 for+0.005
 home+0.005
Neg logit contributions
bridge-0.009
pron-0.008
ined-0.008
hes-0.007
obb-0.007
Token up
Token ID510
Feature activation+3.88e-02

Pos logit contributions
bridge+0.079
pron+0.065
ined+0.065
iche+0.065
hes+0.063
Neg logit contributions
 to-0.035
 each-0.032
 me-0.030
 be-0.028
 for-0.025
Token of
Token ID286
Feature activation+3.26e-02

Pos logit contributions
 to+0.043
 each+0.041
 me+0.038
 be+0.034
 for+0.033
Neg logit contributions
bridge-0.071
pron-0.059
ined-0.059
iche-0.058
aster-0.057
Token fun
Token ID1257
Feature activation-1.53e-02

Pos logit contributions
bridge+0.093
pron+0.079
ined+0.078
iche+0.078
aster+0.076
Neg logit contributions
 to-0.040
 each-0.038
 me-0.034
 be-0.029
 for-0.029
Token watching
Token ID4964
Feature activation-2.97e-03

Pos logit contributions
 to+0.013
 each+0.011
 me+0.009
 for+0.007
 be+0.007
Neg logit contributions
bridge-0.051
pron-0.045
iche-0.044
ined-0.043
aster-0.043
Token the
Token ID262
Feature activation-1.44e-02

Pos logit contributions
 to+0.004
 each+0.004
 me+0.004
 be+0.004
 for+0.003
Neg logit contributions
bridge-0.007
iche-0.006
aster-0.006
pron-0.006
ined-0.006
Token ac
Token ID936
Feature activation+2.24e-02

Pos logit contributions
 to+0.031
 each+0.029
 me+0.027
 be+0.025
 for+0.025
Neg logit contributions
bridge-0.028
pron-0.022
iche-0.021
ined-0.021
aster-0.020
Tokenrob
Token ID22609
Feature activation+1.45e-01

Pos logit contributions
bridge+0.026
pron+0.017
iche+0.016
 narrative+0.015
ined+0.014
Neg logit contributions
 to-0.064
 each-0.062
 me-0.061
 for-0.058
,-0.057
Tokenats
Token ID1381
Feature activation-2.89e-02

Pos logit contributions
bridge+0.298
pron+0.237
ined+0.224
iche+0.224
aster+0.215
Neg logit contributions
 to-0.295
 each-0.287
 me-0.266
 for-0.251
,-0.244
Token and
Token ID290
Feature activation-2.04e-02

Pos logit contributions
 to+0.028
 each+0.026
 me+0.023
 be+0.019
 for+0.018
Neg logit contributions
bridge-0.088
pron-0.075
ined-0.074
iche-0.074
aster-0.073
Token clown
Token ID25573
Feature activation-8.41e-03

Pos logit contributions
 to+0.029
 each+0.027
 me+0.027
's+0.023
 for+0.023
Neg logit contributions
bridge-0.050
aster-0.040
pron-0.040
iche-0.039
ined-0.038
Tokens
Token ID82
Feature activation-5.72e-02

Pos logit contributions
 to+0.007
 each+0.007
 me+0.007
 be+0.005
 for+0.005
Neg logit contributions
bridge-0.025
iche-0.021
pron-0.021
ined-0.021
atmeal-0.021
Token.
Token ID13
Feature activation-1.63e-02

Pos logit contributions
 to+0.051
 each+0.047
 me+0.042
 be+0.032
 for+0.031
Neg logit contributions
bridge-0.181
pron-0.155
ined-0.154
iche-0.152
aster-0.150
Token the
Token ID262
Feature activation+9.30e-03

Pos logit contributions
 to+0.031
 me+0.029
 each+0.028
 be+0.025
's+0.022
Neg logit contributions
bridge-0.085
aster-0.071
iche-0.070
ined-0.069
pron-0.069
Token rest
Token ID1334
Feature activation+1.31e-02

Pos logit contributions
bridge+0.022
pron+0.018
iche+0.018
ined+0.018
aster+0.017
Neg logit contributions
 to-0.016
 each-0.015
 me-0.014
 for-0.012
 be-0.012
Token of
Token ID286
Feature activation+2.36e-02

Pos logit contributions
bridge+0.034
pron+0.028
ined+0.028
iche+0.027
aster+0.027
Neg logit contributions
 to-0.018
 each-0.017
 me-0.017
 be-0.015
 for-0.013
Token the
Token ID262
Feature activation+7.99e-03

Pos logit contributions
bridge+0.070
aster+0.062
pron+0.062
iche+0.060
ined+0.060
Neg logit contributions
 to-0.022
 each-0.019
 me-0.018
's-0.015
 be-0.015
Token or
Token ID393
Feature activation+3.43e-02

Pos logit contributions
bridge+0.018
pron+0.015
iche+0.015
ined+0.015
aster+0.014
Neg logit contributions
 to-0.014
 each-0.013
 me-0.012
 for-0.011
 be-0.011
Tokennam
Token ID7402
Feature activation+1.44e-01

Pos logit contributions
bridge+0.092
pron+0.078
iche+0.076
ined+0.076
aster+0.075
Neg logit contributions
 to-0.046
 each-0.043
 me-0.040
 be-0.034
 for-0.034
Tokenents
Token ID658
Feature activation+8.16e-03

Pos logit contributions
bridge+0.359
iche+0.302
pron+0.302
ined+0.293
aster+0.286
Neg logit contributions
 to-0.208
 each-0.196
 me-0.194
 for-0.175
 be-0.171
Token onto
Token ID4291
Feature activation-3.59e-03

Pos logit contributions
bridge+0.026
pron+0.022
ined+0.022
iche+0.021
aster+0.021
Neg logit contributions
 to-0.007
 each-0.007
 me-0.006
 be-0.005
 for-0.004
Token the
Token ID262
Feature activation+1.20e-02

Pos logit contributions
 to+0.003
 each+0.003
 me+0.003
 be+0.002
 for+0.002
Neg logit contributions
bridge-0.011
pron-0.009
aster-0.009
ined-0.009
iche-0.009
Token tree
Token ID5509
Feature activation-1.44e-02

Pos logit contributions
bridge+0.026
pron+0.022
iche+0.020
aster+0.020
ined+0.020
Neg logit contributions
 to-0.022
 each-0.021
 me-0.019
 for-0.018
 be-0.018
Token.
Token ID13
Feature activation+2.20e-03

Pos logit contributions
 each+0.013
 me+0.012
 to+0.012
 be+0.009
 ho+0.008
Neg logit contributions
bridge-0.043
pron-0.037
ined-0.036
iche-0.036
aster-0.035
Token a
Token ID257
Feature activation-9.73e-03

Pos logit contributions
bridge+0.055
pron+0.046
hes+0.045
aster+0.044
ined+0.042
Neg logit contributions
 each-0.024
 to-0.023
 me-0.021
 for-0.020
,-0.019
Token box
Token ID3091
Feature activation+4.31e-03

Pos logit contributions
 to+0.014
 each+0.014
 me+0.012
 for+0.011
 be+0.011
Neg logit contributions
bridge-0.025
pron-0.022
iche-0.021
ined-0.021
aster-0.020
Token of
Token ID286
Feature activation+5.39e-02

Pos logit contributions
bridge+0.011
pron+0.009
ined+0.009
iche+0.009
aster+0.009
Neg logit contributions
 to-0.006
 each-0.006
 me-0.006
 be-0.005
 for-0.005
Token shiny
Token ID22441
Feature activation+2.96e-02

Pos logit contributions
bridge+0.131
pron+0.109
ined+0.106
iche+0.103
aster+0.102
Neg logit contributions
 to-0.086
 each-0.083
 me-0.074
 be-0.068
 for-0.067
Token or
Token ID393
Feature activation+6.54e-02

Pos logit contributions
bridge+0.073
pron+0.061
ined+0.060
iche+0.058
aster+0.057
Neg logit contributions
 to-0.047
 each-0.045
 me-0.041
 be-0.037
 for-0.036
Tokennam
Token ID7402
Feature activation+1.41e-01

Pos logit contributions
bridge+0.136
pron+0.112
ined+0.105
hes+0.102
iche+0.102
Neg logit contributions
 to-0.124
 each-0.119
 me-0.112
 for-0.102
 be-0.102
Tokenents
Token ID658
Feature activation+8.80e-03

Pos logit contributions
bridge+0.362
pron+0.300
iche+0.297
ined+0.295
hes+0.284
Neg logit contributions
 to-0.201
 each-0.191
 me-0.184
 for-0.165
 be-0.162
Token that
Token ID326
Feature activation-9.83e-03

Pos logit contributions
bridge+0.027
pron+0.023
ined+0.023
iche+0.022
aster+0.022
Neg logit contributions
 to-0.008
 each-0.008
 me-0.007
 be-0.006
 for-0.005
Token spark
Token ID9009
Feature activation-5.92e-04

Pos logit contributions
 to+0.015
 each+0.014
 me+0.013
 be+0.012
 for+0.011
Neg logit contributions
bridge-0.024
pron-0.020
ined-0.020
iche-0.019
hes-0.019
Tokenled
Token ID992
Feature activation-2.15e-02

Pos logit contributions
 to+0.001
 each+0.001
 me+0.001
 be+0.001
 for+0.001
Neg logit contributions
bridge-0.001
ined-0.001
pron-0.001
iche-0.001
hes-0.001
Token in
Token ID287
Feature activation+1.40e-02

Pos logit contributions
 to+0.021
 each+0.020
 me+0.019
 be+0.017
's+0.014
Neg logit contributions
bridge-0.063
pron-0.054
ined-0.053
iche-0.051
aster-0.050
Token he
Token ID339
Feature activation+2.15e-02

Pos logit contributions
bridge+0.030
iche+0.025
pron+0.025
ined+0.024
aster+0.024
Neg logit contributions
 to-0.018
 each-0.017
 me-0.015
 be-0.014
 for-0.014
Token especially
Token ID2592
Feature activation+3.26e-02

Pos logit contributions
bridge+0.058
pron+0.049
ined+0.049
iche+0.047
aster+0.046
Neg logit contributions
 to-0.029
 each-0.027
 me-0.025
 for-0.022
 be-0.021
Token liked
Token ID8288
Feature activation+1.37e-02

Pos logit contributions
bridge+0.086
pron+0.071
iche+0.071
ined+0.070
aster+0.069
Neg logit contributions
 to-0.049
 each-0.046
 me-0.042
 be-0.037
 for-0.037
Token the
Token ID262
Feature activation+2.26e-02

Pos logit contributions
bridge+0.049
pron+0.043
ined+0.042
aster+0.042
iche+0.042
Neg logit contributions
 each-0.003
 to-0.002
 me-0.002
 be-0.001
's+0.000
Token or
Token ID393
Feature activation+5.16e-02

Pos logit contributions
bridge+0.057
ined+0.047
iche+0.047
pron+0.047
aster+0.046
Neg logit contributions
 to-0.037
 each-0.034
 me-0.032
 for-0.029
 be-0.028
Tokennam
Token ID7402
Feature activation+1.41e-01

Pos logit contributions
bridge+0.147
ined+0.124
pron+0.124
iche+0.124
atmeal+0.118
Neg logit contributions
 to-0.071
 each-0.065
 me-0.059
 be-0.053
 for-0.052
Tokenents
Token ID658
Feature activation+1.32e-02

Pos logit contributions
bridge+0.380
pron+0.320
iche+0.317
ined+0.316
aster+0.304
Neg logit contributions
 to-0.191
 each-0.178
 me-0.167
 for-0.148
 be-0.146
Token he
Token ID339
Feature activation+2.19e-02

Pos logit contributions
bridge+0.043
pron+0.038
iche+0.037
ined+0.037
aster+0.036
Neg logit contributions
 to-0.011
 each-0.010
 me-0.008
 for-0.006
 be-0.006
Token kept
Token ID4030
Feature activation-1.91e-02

Pos logit contributions
bridge+0.057
pron+0.048
ined+0.048
iche+0.046
atmeal+0.045
Neg logit contributions
 to-0.031
 each-0.029
 me-0.026
 for-0.024
 be-0.023
Token around
Token ID1088
Feature activation-1.32e-02

Pos logit contributions
 to+0.022
 each+0.021
 me+0.018
 for+0.016
 be+0.015
Neg logit contributions
bridge-0.056
pron-0.048
iche-0.047
ined-0.047
aster-0.046
Token.
Token ID13
Feature activation-6.29e-03

Pos logit contributions
 to+0.011
 each+0.010
 me+0.009
 be+0.007
 for+0.006
Neg logit contributions
bridge-0.042
pron-0.036
ined-0.036
iche-0.036
aster-0.035
Token yellow
Token ID7872
Feature activation+1.66e-02

Pos logit contributions
bridge+0.058
pron+0.050
iche+0.048
ined+0.048
aster+0.048
Neg logit contributions
 to-0.024
 each-0.023
 me-0.020
 for-0.017
 be-0.017
Token,
Token ID11
Feature activation+2.22e-02

Pos logit contributions
bridge+0.053
pron+0.045
aster+0.044
iche+0.043
ined+0.043
Neg logit contributions
 each-0.012
 to-0.012
 me-0.009
 be-0.006
's-0.006
Token or
Token ID393
Feature activation+6.52e-02

Pos logit contributions
bridge+0.065
pron+0.056
iche+0.054
ined+0.054
aster+0.053
Neg logit contributions
 to-0.025
 each-0.023
 me-0.020
 for-0.017
 be-0.017
Token white
Token ID2330
Feature activation-1.08e-02

Pos logit contributions
bridge+0.169
pron+0.142
iche+0.138
ined+0.137
aster+0.136
Neg logit contributions
 to-0.094
 each-0.088
 me-0.080
 for-0.070
 be-0.070
Token.
Token ID13
Feature activation+1.59e-03

Pos logit contributions
 each+0.010
 to+0.010
 me+0.008
 be+0.007
's+0.006
Neg logit contributions
bridge-0.033
pron-0.027
aster-0.027
iche-0.027
ined-0.026
Token Which
Token ID9022
Feature activation+1.41e-01

Pos logit contributions
bridge+0.004
pron+0.003
ined+0.003
iche+0.003
aster+0.003
Neg logit contributions
 to-0.003
 each-0.003
 me-0.002
 be-0.002
 for-0.002
Token one
Token ID530
Feature activation+4.22e-02

Pos logit contributions
bridge+0.335
pron+0.289
aster+0.287
iche+0.278
ined+0.276
Neg logit contributions
 to-0.206
 each-0.193
 me-0.171
 for-0.164
 home-0.147
Token do
Token ID466
Feature activation+4.34e-02

Pos logit contributions
bridge+0.107
pron+0.092
iche+0.092
ined+0.090
aster+0.090
Neg logit contributions
 to-0.059
 each-0.057
 me-0.051
 for-0.045
 be-0.044
Token you
Token ID345
Feature activation+4.80e-02

Pos logit contributions
bridge+0.135
pron+0.116
ined+0.115
iche+0.114
aster+0.112
Neg logit contributions
 to-0.043
 each-0.039
 me-0.034
 be-0.028
 for-0.028
Token want
Token ID765
Feature activation+2.78e-02

Pos logit contributions
bridge+0.128
pron+0.107
ined+0.106
iche+0.105
aster+0.103
Neg logit contributions
 to-0.069
 each-0.065
 me-0.059
 be-0.052
 for-0.052
Token?"
Token ID1701
Feature activation+3.24e-02

Pos logit contributions
bridge+0.051
pron+0.042
aster+0.040
ined+0.040
iche+0.040
Neg logit contributions
 each-0.055
 to-0.054
 me-0.050
 be-0.047
 ho-0.047
Token and
Token ID290
Feature activation+2.29e-02

Pos logit contributions
bridge+0.055
pron+0.048
ined+0.048
iche+0.048
aster+0.046
Neg logit contributions
 to-0.013
 each-0.012
 me-0.010
 be-0.008
 for-0.008
Token a
Token ID257
Feature activation-6.22e-03

Pos logit contributions
bridge+0.072
pron+0.063
ined+0.062
iche+0.062
aster+0.061
Neg logit contributions
 to-0.021
 each-0.019
 me-0.017
 be-0.013
 for-0.013
Token red
Token ID2266
Feature activation+3.53e-02

Pos logit contributions
 to+0.010
 each+0.009
 me+0.008
 be+0.008
 for+0.007
Neg logit contributions
bridge-0.016
pron-0.013
ined-0.013
iche-0.013
aster-0.013
Token car
Token ID1097
Feature activation+4.52e-02

Pos logit contributions
bridge+0.081
pron+0.066
aster+0.062
ined+0.062
iche+0.061
Neg logit contributions
 to-0.060
 each-0.059
 me-0.054
 for-0.048
's-0.047
Token.
Token ID13
Feature activation+1.04e-02

Pos logit contributions
bridge+0.136
pron+0.110
ined+0.110
aster+0.108
hes+0.107
Neg logit contributions
 each-0.040
 to-0.040
 me-0.036
 ho-0.025
 home-0.025
Token Which
Token ID9022
Feature activation+1.40e-01

Pos logit contributions
bridge+0.026
pron+0.024
ined+0.024
atmeal+0.023
iche+0.022
Neg logit contributions
 to-0.016
 each-0.015
 me-0.014
 be-0.013
,-0.013
Token one
Token ID530
Feature activation+4.17e-02

Pos logit contributions
bridge+0.345
aster+0.288
pron+0.286
ined+0.279
iche+0.274
Neg logit contributions
 to-0.197
 each-0.184
 me-0.168
 for-0.157
 ho-0.141
Token do
Token ID466
Feature activation+4.27e-02

Pos logit contributions
bridge+0.109
pron+0.093
iche+0.092
ined+0.091
aster+0.090
Neg logit contributions
 to-0.057
 each-0.055
 me-0.049
 for-0.043
 be-0.042
Token you
Token ID345
Feature activation+4.66e-02

Pos logit contributions
bridge+0.132
pron+0.114
ined+0.113
iche+0.112
aster+0.110
Neg logit contributions
 to-0.043
 each-0.040
 me-0.035
 be-0.029
 for-0.029
Token want
Token ID765
Feature activation+2.52e-02

Pos logit contributions
bridge+0.135
pron+0.113
ined+0.112
iche+0.110
aster+0.108
Neg logit contributions
 to-0.061
 each-0.057
 me-0.051
 be-0.047
 for-0.047
Token?"
Token ID1701
Feature activation+3.64e-02

Pos logit contributions
bridge+0.047
pron+0.039
aster+0.037
ined+0.037
iche+0.037
Neg logit contributions
 each-0.049
 to-0.048
 me-0.045
 be-0.042
 ho-0.042
Token tree
Token ID5509
Feature activation-2.63e-03

Pos logit contributions
bridge+0.014
pron+0.012
iche+0.012
aster+0.012
ined+0.011
Neg logit contributions
 to-0.010
 each-0.009
 me-0.008
 be-0.007
 for-0.007
Token with
Token ID351
Feature activation+1.22e-02

Pos logit contributions
 to+0.002
 each+0.002
 me+0.002
 for+0.001
 be+0.001
Neg logit contributions
bridge-0.008
pron-0.007
iche-0.007
ined-0.007
aster-0.007
Token lights
Token ID7588
Feature activation-8.93e-03

Pos logit contributions
bridge+0.034
ined+0.028
pron+0.028
hes+0.027
iche+0.027
Neg logit contributions
 to-0.015
 each-0.014
 me-0.013
 be-0.012
 for-0.012
Token and
Token ID290
Feature activation+1.70e-02

Pos logit contributions
 to+0.007
 each+0.007
 me+0.006
 be+0.004
 for+0.004
Neg logit contributions
bridge-0.029
pron-0.025
ined-0.025
iche-0.025
aster-0.024
Token or
Token ID393
Feature activation+4.97e-02

Pos logit contributions
bridge+0.047
pron+0.039
ined+0.038
iche+0.038
aster+0.038
Neg logit contributions
 to-0.022
 each-0.021
 me-0.019
 for-0.017
 be-0.016
Tokennam
Token ID7402
Feature activation+1.40e-01

Pos logit contributions
bridge+0.134
pron+0.110
iche+0.109
aster+0.108
ined+0.108
Neg logit contributions
 to-0.066
 each-0.062
 me-0.059
 for-0.050
 be-0.050
Tokenents
Token ID658
Feature activation+8.17e-03

Pos logit contributions
bridge+0.351
ined+0.285
iche+0.284
pron+0.280
hes+0.274
Neg logit contributions
 to-0.205
 each-0.190
 me-0.189
 for-0.172
 be-0.165
Token.
Token ID13
Feature activation+1.59e-02

Pos logit contributions
bridge+0.026
pron+0.022
ined+0.022
iche+0.022
aster+0.022
Neg logit contributions
 to-0.007
 each-0.007
 me-0.006
 be-0.005
 for-0.004
Token They
Token ID1119
Feature activation+6.60e-03

Pos logit contributions
bridge+0.043
pron+0.037
ined+0.036
iche+0.036
aster+0.035
Neg logit contributions
 to-0.023
 each-0.021
 me-0.019
 for-0.017
 be-0.017
Token also
Token ID635
Feature activation+2.30e-02

Pos logit contributions
bridge+0.017
pron+0.014
ined+0.014
hes+0.013
aster+0.013
Neg logit contributions
 to-0.009
 me-0.008
 each-0.008
 for-0.008
 home-0.007
Token saw
Token ID2497
Feature activation+2.82e-02

Pos logit contributions
bridge+0.061
ined+0.051
pron+0.050
iche+0.049
aster+0.049
Neg logit contributions
 to-0.032
 each-0.029
 me-0.028
 for-0.024
 be-0.023
Token never
Token ID1239
Feature activation+5.16e-03

Pos logit contributions
bridge+0.033
ined+0.028
pron+0.028
aster+0.027
hes+0.027
Neg logit contributions
 to-0.023
 me-0.021
 each-0.021
 for-0.018
 ho-0.017
Token played
Token ID2826
Feature activation+4.86e-03

Pos logit contributions
bridge+0.011
iche+0.010
aster+0.010
pron+0.009
atmeal+0.009
Neg logit contributions
 each-0.008
 to-0.007
 me-0.007
 for-0.006
 home-0.006
Token with
Token ID351
Feature activation-8.34e-03

Pos logit contributions
bridge+0.015
pron+0.013
iche+0.013
ined+0.013
aster+0.013
Neg logit contributions
 to-0.006
 each-0.005
 me-0.004
 for-0.004
,-0.004
Token the
Token ID262
Feature activation+4.10e-03

Pos logit contributions
 to+0.012
 each+0.011
 me+0.010
 be+0.009
 for+0.009
Neg logit contributions
bridge-0.021
pron-0.018
iche-0.018
ined-0.018
aster-0.018
Token or
Token ID393
Feature activation+3.95e-02

Pos logit contributions
bridge+0.010
pron+0.008
ined+0.008
iche+0.008
aster+0.008
Neg logit contributions
 to-0.007
 each-0.007
 me-0.006
 for-0.006
 be-0.005
Tokennam
Token ID7402
Feature activation+1.40e-01

Pos logit contributions
bridge+0.108
pron+0.091
iche+0.090
ined+0.090
aster+0.088
Neg logit contributions
 to-0.053
 each-0.050
 me-0.045
 be-0.040
 for-0.039
Tokenents
Token ID658
Feature activation+4.84e-03

Pos logit contributions
bridge+0.369
pron+0.311
iche+0.310
ined+0.304
aster+0.294
Neg logit contributions
 to-0.193
 each-0.182
 me-0.172
 for-0.154
 be-0.152
Token again
Token ID757
Feature activation-3.28e-03

Pos logit contributions
bridge+0.014
pron+0.012
iche+0.012
ined+0.012
aster+0.012
Neg logit contributions
 to-0.006
 each-0.005
 me-0.004
 for-0.004
 be-0.004
Token.
Token ID13
Feature activation-1.33e-03

Pos logit contributions
 to+0.003
 each+0.002
 me+0.002
 be+0.001
 for+0.001
Neg logit contributions
bridge-0.011
pron-0.009
ined-0.009
iche-0.009
aster-0.009
Token They
Token ID1119
Feature activation-9.62e-03

Pos logit contributions
 to+0.001
 each+0.001
 me+0.001
 be+0.001
 for+0.001
Neg logit contributions
bridge-0.004
pron-0.004
ined-0.004
iche-0.003
atmeal-0.003
Token still
Token ID991
Feature activation-3.12e-02

Pos logit contributions
 to+0.015
 each+0.014
 me+0.013
 for+0.012
 be+0.012
Neg logit contributions
bridge-0.024
pron-0.019
ined-0.019
iche-0.019
aster-0.019
Token and
Token ID290
Feature activation+1.77e-02

Pos logit contributions
bridge+0.029
pron+0.025
hes+0.024
atmeal+0.024
aster+0.024
Neg logit contributions
 to-0.011
 me-0.010
 each-0.010
 Ch-0.008
 be-0.008
Token chocolate
Token ID11311
Feature activation+2.86e-02

Pos logit contributions
bridge+0.042
pron+0.035
iche+0.034
ined+0.033
aster+0.033
Neg logit contributions
 to-0.030
 each-0.028
 me-0.026
 for-0.024
 be-0.023
Token and
Token ID290
Feature activation+1.80e-02

Pos logit contributions
bridge+0.074
pron+0.062
atmeal+0.061
hes+0.061
iche+0.060
Neg logit contributions
 to-0.032
 me-0.031
 each-0.030
 be-0.027
's-0.025
Token nuts
Token ID14380
Feature activation+2.81e-02

Pos logit contributions
bridge+0.042
pron+0.034
iche+0.033
ined+0.033
aster+0.033
Neg logit contributions
 to-0.031
 each-0.030
 me-0.028
 for-0.025
 be-0.025
Token.
Token ID13
Feature activation+1.76e-02

Pos logit contributions
bridge+0.077
pron+0.064
hes+0.063
iche+0.063
kin+0.062
Neg logit contributions
 me-0.027
 each-0.027
 to-0.027
 be-0.022
's-0.021
Token Which
Token ID9022
Feature activation+1.40e-01

Pos logit contributions
bridge+0.043
ined+0.037
pron+0.036
iche+0.035
aster+0.034
Neg logit contributions
 to-0.030
 each-0.028
 me-0.026
 be-0.023
 for-0.023
Token one
Token ID530
Feature activation+4.52e-02

Pos logit contributions
bridge+0.354
pron+0.304
aster+0.296
iche+0.295
ined+0.293
Neg logit contributions
 to-0.202
 each-0.190
 me-0.172
 for-0.157
 be-0.151
Token do
Token ID466
Feature activation+4.33e-02

Pos logit contributions
bridge+0.112
pron+0.096
iche+0.095
ined+0.093
aster+0.092
Neg logit contributions
 to-0.067
 each-0.065
 me-0.058
 for-0.052
 be-0.051
Token you
Token ID345
Feature activation+4.66e-02

Pos logit contributions
bridge+0.127
pron+0.109
iche+0.107
ined+0.107
aster+0.106
Neg logit contributions
 to-0.048
 each-0.045
 me-0.039
 for-0.033
 be-0.033
Token want
Token ID765
Feature activation+2.37e-02

Pos logit contributions
bridge+0.129
pron+0.108
ined+0.107
iche+0.105
aster+0.103
Neg logit contributions
 to-0.065
 each-0.060
 me-0.055
 for-0.049
 be-0.049
Token,
Token ID11
Feature activation+1.39e-02

Pos logit contributions
bridge+0.045
pron+0.037
iche+0.035
ined+0.035
aster+0.035
Neg logit contributions
 each-0.047
 to-0.047
 me-0.044
 be-0.040
 ho-0.040
Token to
Token ID284
Feature activation-1.49e-03

Pos logit contributions
bridge+0.025
pron+0.023
aster+0.023
atmeal+0.022
iche+0.022
Neg logit contributions
 each-0.008
 to-0.008
 me-0.007
 be-0.007
's-0.007
Token look
Token ID804
Feature activation+5.83e-03

Pos logit contributions
 to+0.002
 each+0.002
 me+0.002
 be+0.002
 for+0.001
Neg logit contributions
bridge-0.004
ined-0.003
pron-0.003
iche-0.003
hes-0.003
Token at
Token ID379
Feature activation+1.22e-02

Pos logit contributions
bridge+0.016
pron+0.014
iche+0.014
ined+0.014
aster+0.014
Neg logit contributions
 to-0.006
 each-0.006
 be-0.005
 me-0.005
's-0.004
Token the
Token ID262
Feature activation+1.23e-02

Pos logit contributions
bridge+0.035
pron+0.030
aster+0.030
ined+0.029
iche+0.029
Neg logit contributions
 to-0.013
 each-0.011
 me-0.010
 be-0.009
 for-0.009
Token or
Token ID393
Feature activation+4.58e-02

Pos logit contributions
bridge+0.030
pron+0.025
ined+0.024
iche+0.024
aster+0.023
Neg logit contributions
 to-0.020
 each-0.019
 me-0.018
 for-0.016
 be-0.016
Tokennam
Token ID7402
Feature activation+1.40e-01

Pos logit contributions
bridge+0.099
ined+0.076
pron+0.074
iche+0.072
atmeal+0.072
Neg logit contributions
 to-0.101
 each-0.096
 me-0.090
 be-0.084
 for-0.083
Tokenents
Token ID658
Feature activation+9.36e-03

Pos logit contributions
bridge+0.375
pron+0.318
iche+0.315
ined+0.313
aster+0.303
Neg logit contributions
 to-0.186
 each-0.174
 me-0.161
 for-0.146
 be-0.143
Token on
Token ID319
Feature activation+3.58e-02

Pos logit contributions
bridge+0.030
pron+0.027
iche+0.026
ined+0.026
aster+0.026
Neg logit contributions
 to-0.008
 each-0.007
 me-0.006
 for-0.005
 be-0.005
Token the
Token ID262
Feature activation+5.57e-03

Pos logit contributions
bridge+0.111
pron+0.096
ined+0.094
iche+0.094
aster+0.093
Neg logit contributions
 to-0.035
 each-0.031
 me-0.027
 for-0.023
 be-0.022
Token tree
Token ID5509
Feature activation-9.95e-03

Pos logit contributions
bridge+0.011
pron+0.009
iche+0.008
ined+0.008
aster+0.008
Neg logit contributions
 to-0.011
 each-0.011
 me-0.010
 be-0.009
 for-0.009
Token.
Token ID13
Feature activation+1.87e-03

Pos logit contributions
 to+0.008
 each+0.008
 me+0.007
 be+0.005
 for+0.005
Neg logit contributions
bridge-0.032
pron-0.027
ined-0.027
iche-0.027
aster-0.027
Token who
Token ID508
Feature activation+2.05e-02

Pos logit contributions
 to+0.044
 each+0.043
 me+0.041
 be+0.038
 for+0.038
Neg logit contributions
bridge-0.034
pron-0.027
ined-0.027
aster-0.027
iche-0.026
Token liked
Token ID8288
Feature activation+1.17e-02

Pos logit contributions
bridge+0.053
pron+0.044
ined+0.044
iche+0.043
aster+0.042
Neg logit contributions
 to-0.031
 each-0.029
 me-0.027
 for-0.024
 be-0.024
Token to
Token ID284
Feature activation+2.17e-03

Pos logit contributions
bridge+0.040
pron+0.038
aster+0.037
iche+0.036
atmeal+0.036
Neg logit contributions
 each-0.002
 me-0.002
 be-0.000
 to-0.000
's+0.000
Token collect
Token ID2824
Feature activation+5.35e-02

Pos logit contributions
bridge+0.006
ined+0.005
pron+0.005
iche+0.005
 narrative+0.005
Neg logit contributions
 to-0.003
 each-0.003
 be-0.002
 me-0.002
,-0.002
Token or
Token ID393
Feature activation+4.61e-02

Pos logit contributions
bridge+0.132
ined+0.110
iche+0.105
pron+0.104
hes+0.103
Neg logit contributions
 to-0.077
 each-0.075
 me-0.067
 be-0.065
 ho-0.058
Tokennam
Token ID7402
Feature activation+1.38e-01

Pos logit contributions
bridge+0.088
pron+0.068
ined+0.067
iche+0.067
aster+0.064
Neg logit contributions
 to-0.102
 each-0.098
 me-0.092
 be-0.086
 for-0.085
Tokenents
Token ID658
Feature activation+5.45e-03

Pos logit contributions
bridge+0.360
iche+0.301
pron+0.301
ined+0.301
hes+0.284
Neg logit contributions
 to-0.191
 each-0.178
 me-0.168
 for-0.154
 be-0.150
Token.
Token ID13
Feature activation-1.60e-03

Pos logit contributions
bridge+0.017
ined+0.015
pron+0.015
iche+0.014
aster+0.014
Neg logit contributions
 to-0.004
 each-0.004
 me-0.004
 be-0.003
 for-0.003
Token Everywhere
Token ID48201
Feature activation+4.62e-02

Pos logit contributions
 to+0.003
 each+0.002
 me+0.002
 for+0.002
 be+0.002
Neg logit contributions
bridge-0.004
pron-0.003
iche-0.003
ined-0.003
aster-0.003
Token she
Token ID673
Feature activation+2.81e-02

Pos logit contributions
bridge+0.097
pron+0.080
ined+0.078
aster+0.076
iche+0.074
Neg logit contributions
 to-0.082
 each-0.079
 me-0.074
 for-0.068
 be-0.068
Token went
Token ID1816
Feature activation+2.59e-02

Pos logit contributions
bridge+0.085
pron+0.073
ined+0.072
iche+0.071
aster+0.070
Neg logit contributions
 to-0.030
 each-0.027
 me-0.024
 for-0.020
 be-0.020
Token friend
Token ID1545
Feature activation+5.06e-03

Pos logit contributions
bridge+0.168
pron+0.149
aster+0.140
iche+0.131
atmeal+0.131
Neg logit contributions
 each-0.082
 me-0.073
 to-0.070
 be-0.058
 Fl-0.056
Token,
Token ID11
Feature activation-2.28e-02

Pos logit contributions
bridge+0.014
aster+0.013
pron+0.012
iche+0.012
atmeal+0.012
Neg logit contributions
 each-0.004
 me-0.004
 to-0.003
 be-0.003
 heads-0.003
Token Sam
Token ID3409
Feature activation+1.12e-03

Pos logit contributions
 to+0.026
 each+0.025
 me+0.024
 be+0.020
 heads+0.020
Neg logit contributions
bridge-0.059
pron-0.052
aster-0.051
iche-0.050
hes-0.049
Token,
Token ID11
Feature activation-1.02e-02

Pos logit contributions
bridge+0.003
pron+0.003
aster+0.003
ined+0.003
iche+0.003
Neg logit contributions
 each-0.001
 to-0.001
 me-0.001
 be-0.001
 ho-0.001
Token "
Token ID366
Feature activation-1.23e-02

Pos logit contributions
 each+0.010
 to+0.009
 me+0.009
 be+0.008
 heads+0.008
Neg logit contributions
bridge-0.028
pron-0.025
aster-0.024
ined-0.023
iche-0.023
TokenWhich
Token ID13828
Feature activation+1.37e-01

Pos logit contributions
 to+0.027
 each+0.026
 me+0.025
 be+0.023
 for+0.022
Neg logit contributions
bridge-0.024
pron-0.019
ined-0.019
iche-0.018
aster-0.017
Token writing
Token ID3597
Feature activation+3.42e-02

Pos logit contributions
bridge+0.321
pron+0.270
aster+0.263
ined+0.253
iche+0.252
Neg logit contributions
 each-0.212
 to-0.208
 me-0.186
 for-0.168
 be-0.159
Token place
Token ID1295
Feature activation+2.17e-02

Pos logit contributions
bridge+0.098
pron+0.085
ined+0.082
iche+0.081
hes+0.079
Neg logit contributions
 to-0.039
 each-0.037
 me-0.032
 be-0.026
 for-0.026
Token should
Token ID815
Feature activation+5.85e-02

Pos logit contributions
bridge+0.058
pron+0.049
ined+0.049
iche+0.047
hes+0.047
Neg logit contributions
 each-0.024
 to-0.024
 me-0.020
 ho-0.020
 h-0.017
Token I
Token ID314
Feature activation+2.24e-02

Pos logit contributions
bridge+0.159
pron+0.135
ined+0.132
iche+0.132
aster+0.130
Neg logit contributions
 to-0.079
 each-0.074
 me-0.067
 for-0.059
 be-0.058
Token go
Token ID467
Feature activation+9.16e-03

Pos logit contributions
bridge+0.059
pron+0.049
ined+0.048
iche+0.048
aster+0.047
Neg logit contributions
 to-0.034
 each-0.032
 me-0.029
 be-0.026
 for-0.026
Token skipping
Token ID31017
Feature activation-3.35e-02

Pos logit contributions
 me+0.019
 each+0.017
 be+0.013
 to+0.013
's+0.011
Neg logit contributions
bridge-0.096
ined-0.087
pron-0.087
aster-0.084
hes-0.084
Token and
Token ID290
Feature activation-1.87e-02

Pos logit contributions
 me+0.028
 to+0.026
 be+0.026
 each+0.024
's+0.022
Neg logit contributions
bridge-0.090
ined-0.077
pron-0.072
rob-0.071
aster-0.070
Token talking
Token ID3375
Feature activation-3.37e-02

Pos logit contributions
 to+0.031
 each+0.029
 me+0.027
 be+0.024
 for+0.024
Neg logit contributions
bridge-0.042
pron-0.035
iche-0.035
ined-0.034
aster-0.034
Token together
Token ID1978
Feature activation-4.80e-02

Pos logit contributions
 me+0.033
 each+0.028
 be+0.027
 to+0.025
 ho+0.024
Neg logit contributions
bridge-0.092
ined-0.078
pron-0.074
hes-0.074
iest-0.070
Token until
Token ID1566
Feature activation-5.46e-02

Pos logit contributions
 me+0.043
 to+0.041
 each+0.038
 be+0.034
 home+0.030
Neg logit contributions
bridge-0.131
ined-0.112
pron-0.112
aster-0.108
hes-0.108
Token dusk
Token ID46166
Feature activation-1.19e-01

Pos logit contributions
 to+0.080
 each+0.075
 me+0.066
,+0.060
 for+0.057
Neg logit contributions
bridge-0.157
ined-0.131
pron-0.130
iche-0.128
atmeal-0.124
Token.
Token ID13
Feature activation-2.69e-02

Pos logit contributions
 each+0.118
 to+0.111
 me+0.103
 be+0.087
 for+0.081
Neg logit contributions
bridge-0.350
ined-0.293
iche-0.289
pron-0.289
hes-0.285
Token
Token ID198
Feature activation-1.85e-02

Pos logit contributions
 to+0.039
 each+0.037
 me+0.032
 for+0.029
 be+0.028
Neg logit contributions
bridge-0.076
ined-0.063
pron-0.063
atmeal-0.059
iche-0.059
TokenTodd
Token ID42817
Feature activation-2.34e-02

Pos logit contributions
 to+0.020
 each+0.018
 me+0.016
 be+0.013
 for+0.012
Neg logit contributions
bridge-0.058
pron-0.049
ined-0.049
iche-0.048
atmeal-0.048
Token said
Token ID531
Feature activation-1.61e-02

Pos logit contributions
 to+0.026
 each+0.025
 me+0.020
 for+0.019
 be+0.017
Neg logit contributions
bridge-0.065
ined-0.057
pron-0.057
iche-0.055
aster-0.054
Token goodbye
Token ID24829
Feature activation+1.02e-02

Pos logit contributions
 each+0.020
 to+0.018
 me+0.018
 be+0.015
 for+0.014
Neg logit contributions
bridge-0.043
pron-0.036
ined-0.035
aster-0.034
iche-0.034
Token and
Token ID290
Feature activation-3.29e-02

Pos logit contributions
 each+0.059
 to+0.053
 me+0.040
 be+0.039
 for+0.035
Neg logit contributions
bridge-0.256
ined-0.223
pron-0.221
aster-0.217
iche-0.217
Token car
Token ID1097
Feature activation-2.10e-02

Pos logit contributions
 to+0.043
 each+0.042
 me+0.037
 for+0.033
 home+0.033
Neg logit contributions
bridge-0.083
pron-0.070
aster-0.070
hes-0.068
iche-0.067
Tokenaway
Token ID8272
Feature activation-5.80e-02

Pos logit contributions
 each+0.015
 me+0.013
 to+0.013
 be+0.009
 for+0.008
Neg logit contributions
bridge-0.064
pron-0.053
ined-0.053
hes-0.052
aster-0.052
Token and
Token ID290
Feature activation-3.28e-02

Pos logit contributions
 to+0.075
 each+0.073
 me+0.064
 for+0.062
 home+0.059
Neg logit contributions
bridge-0.140
ined-0.117
pron-0.113
hes-0.110
aster-0.110
Token c
Token ID269
Feature activation-3.96e-02

Pos logit contributions
 to+0.043
 each+0.041
 me+0.036
 for+0.033
,+0.032
Neg logit contributions
bridge-0.083
aster-0.070
pron-0.069
hes-0.068
iche-0.066
Tokenumin
Token ID7230
Feature activation-1.15e-01

Pos logit contributions
 to+0.084
 each+0.081
 be+0.074
 me+0.073
 for+0.072
Neg logit contributions
bridge-0.074
ined-0.061
pron-0.059
iche-0.058
hes-0.056
Token and
Token ID290
Feature activation-3.23e-02

Pos logit contributions
 each+0.053
 me+0.039
 to+0.036
 be+0.024
 for+0.018
Neg logit contributions
bridge-0.377
pron-0.326
ined-0.326
hes-0.322
aster-0.319
Token cor
Token ID1162
Feature activation-2.70e-02

Pos logit contributions
 to+0.042
 each+0.040
 me+0.035
 for+0.032
,+0.031
Neg logit contributions
bridge-0.083
pron-0.070
aster-0.069
hes-0.068
iche-0.066
Tokeni
Token ID72
Feature activation-5.35e-02

Pos logit contributions
 to+0.043
 each+0.041
 be+0.039
 me+0.039
,+0.036
Neg logit contributions
bridge-0.060
iche-0.049
aster-0.048
pron-0.046
atmeal-0.046
Tokenander
Token ID4066
Feature activation-5.39e-02

Pos logit contributions
 each+0.048
 to+0.044
 me+0.039
 be+0.033
 for+0.030
Neg logit contributions
bridge-0.155
pron-0.133
hes-0.131
ined-0.130
aster-0.129
Token and
Token ID290
Feature activation-3.18e-02

Pos logit contributions
 each+0.044
 to+0.038
 me+0.038
 be+0.030
 for+0.028
Neg logit contributions
bridge-0.162
pron-0.136
ined-0.135
hes-0.134
aster-0.131
Token proud
Token ID6613
Feature activation-2.05e-02

Pos logit contributions
 to+0.031
 each+0.030
 me+0.026
 be+0.021
 for+0.017
Neg logit contributions
bridge-0.138
pron-0.118
ined-0.118
iche-0.112
atmeal-0.110
Token of
Token ID286
Feature activation+1.71e-02

Pos logit contributions
 each+0.031
 to+0.029
 me+0.028
 be+0.026
's+0.024
Neg logit contributions
bridge-0.047
pron-0.037
ined-0.036
aster-0.035
iche-0.035
Token herself
Token ID5223
Feature activation-3.54e-02

Pos logit contributions
bridge+0.046
ined+0.039
pron+0.038
aster+0.038
iche+0.037
Neg logit contributions
 to-0.022
 each-0.021
 me-0.018
's-0.016
 be-0.016
Token for
Token ID329
Feature activation-3.60e-02

Pos logit contributions
 each+0.031
 me+0.030
 to+0.027
 be+0.023
 heads+0.020
Neg logit contributions
bridge-0.103
pron-0.086
ined-0.083
kin-0.082
aster-0.081
Token staying
Token ID10589
Feature activation-2.70e-02

Pos logit contributions
 to+0.055
 each+0.052
 me+0.047
's+0.043
 be+0.042
Neg logit contributions
bridge-0.086
pron-0.073
aster-0.071
ined-0.071
iche-0.068
Token ahead
Token ID4058
Feature activation-1.14e-01

Pos logit contributions
 to+0.033
 each+0.031
 me+0.028
 for+0.024
 be+0.024
Neg logit contributions
bridge-0.078
pron-0.066
iche-0.066
ined-0.065
aster-0.064
Token for
Token ID329
Feature activation-5.33e-02

Pos logit contributions
 each+0.114
 to+0.112
 me+0.097
 be+0.084
 for+0.073
Neg logit contributions
bridge-0.330
pron-0.284
ined-0.275
aster-0.272
iche-0.270
Token so
Token ID523
Feature activation+3.13e-03

Pos logit contributions
 to+0.063
 each+0.059
's+0.048
 me+0.048
 be+0.044
Neg logit contributions
bridge-0.143
aster-0.125
pron-0.121
ined-0.117
hes-0.115
Token long
Token ID890
Feature activation-5.52e-02

Pos logit contributions
bridge+0.008
pron+0.006
ined+0.006
iche+0.006
aster+0.006
Neg logit contributions
 to-0.005
 each-0.005
 me-0.004
 be-0.004
 for-0.004
Token,
Token ID11
Feature activation-1.40e-02

Pos logit contributions
 to+0.045
 each+0.045
 me+0.037
 be+0.032
's+0.028
Neg logit contributions
bridge-0.172
pron-0.145
ined-0.143
aster-0.143
iche-0.139
Token so
Token ID523
Feature activation+1.53e-02

Pos logit contributions
 to+0.018
 each+0.017
 me+0.015
 be+0.013
 for+0.013
Neg logit contributions
bridge-0.039
pron-0.033
ined-0.032
iche-0.032
aster-0.032
TokenTim
Token ID14967
Feature activation-4.71e-03

Pos logit contributions
 to+0.010
 each+0.010
 me+0.009
 be+0.008
's+0.008
Neg logit contributions
bridge-0.015
iche-0.012
pron-0.012
ined-0.012
atmeal-0.011
Tokenmy
Token ID1820
Feature activation+1.78e-02

Pos logit contributions
 to+0.007
 each+0.007
 for+0.006
 me+0.006
 be+0.006
Neg logit contributions
bridge-0.010
pron-0.009
ined-0.009
iche-0.009
atmeal-0.009
Token waited
Token ID13488
Feature activation-3.83e-02

Pos logit contributions
bridge+0.043
pron+0.040
ined+0.038
aster+0.037
iche+0.036
Neg logit contributions
 to-0.021
 each-0.020
 for-0.018
 me-0.016
 home-0.016
Token patiently
Token ID37220
Feature activation-6.13e-02

Pos logit contributions
 each+0.037
 to+0.037
 me+0.033
 be+0.029
's+0.027
Neg logit contributions
bridge-0.110
pron-0.098
ined-0.094
aster-0.092
iche-0.092
Token until
Token ID1566
Feature activation-5.55e-02

Pos logit contributions
 each+0.045
 to+0.043
 me+0.041
 be+0.029
's+0.028
Neg logit contributions
bridge-0.183
pron-0.165
aster-0.161
ined-0.159
iche-0.158
Token noon
Token ID19613
Feature activation-1.11e-01

Pos logit contributions
 to+0.077
 each+0.067
,+0.059
 me+0.057
 for+0.054
Neg logit contributions
bridge-0.172
iche-0.147
ined-0.146
pron-0.141
opl-0.140
Token,
Token ID11
Feature activation-2.39e-02

Pos logit contributions
 each+0.063
 to+0.062
 me+0.051
 be+0.033
 for+0.028
Neg logit contributions
bridge-0.375
pron-0.327
ined-0.322
iche-0.322
aster-0.320
Token and
Token ID290
Feature activation-1.51e-02

Pos logit contributions
 to+0.030
 each+0.027
 me+0.024
 for+0.021
,+0.020
Neg logit contributions
bridge-0.072
ined-0.062
iche-0.061
pron-0.061
atmeal-0.058
Token then
Token ID788
Feature activation-3.24e-02

Pos logit contributions
 to+0.021
 each+0.020
 me+0.018
 for+0.016
 be+0.016
Neg logit contributions
bridge-0.042
iche-0.035
pron-0.035
ined-0.035
atmeal-0.033
Token he
Token ID339
Feature activation-7.64e-03

Pos logit contributions
 to+0.058
 each+0.056
 me+0.052
 be+0.047
 for+0.047
Neg logit contributions
bridge-0.073
pron-0.059
ined-0.057
iche-0.057
aster-0.057
Token went
Token ID1816
Feature activation-2.16e-02

Pos logit contributions
 to+0.011
 each+0.011
 for+0.009
 me+0.009
 be+0.008
Neg logit contributions
bridge-0.019
pron-0.016
aster-0.016
iche-0.016
ined-0.015
Token a
Token ID257
Feature activation-5.39e-02

Pos logit contributions
 to+0.053
 me+0.045
 each+0.044
's+0.038
 be+0.036
Neg logit contributions
bridge-0.121
aster-0.107
pron-0.104
ined-0.103
hes-0.103
Token cos
Token ID8615
Feature activation-3.36e-02

Pos logit contributions
 each+0.092
 to+0.083
 me+0.081
 heads+0.072
's+0.069
Neg logit contributions
pron-0.114
bridge-0.114
aster-0.098
iche-0.098
atmeal-0.090
Tokeny
Token ID88
Feature activation-4.64e-02

Pos logit contributions
 to+0.030
 each+0.027
 me+0.021
 for+0.018
,+0.017
Neg logit contributions
bridge-0.110
pron-0.095
iche-0.094
ined-0.093
aster-0.092
Token sleep
Token ID3993
Feature activation-6.13e-02

Pos logit contributions
 each+0.085
 to+0.082
 me+0.073
's+0.068
 how+0.065
Neg logit contributions
bridge-0.089
pron-0.084
aster-0.075
iche-0.070
kin-0.070
Token until
Token ID1566
Feature activation-6.31e-02

Pos logit contributions
 to+0.055
 each+0.051
 me+0.047
 be+0.037
 for+0.032
Neg logit contributions
bridge-0.192
pron-0.165
ined-0.162
iche-0.158
aster-0.158
Token spring
Token ID6076
Feature activation-1.10e-01

Pos logit contributions
 to+0.080
 each+0.074
 me+0.066
 for+0.058
 be+0.057
Neg logit contributions
bridge-0.182
pron-0.154
ined-0.153
iche-0.153
aster-0.147
Tokentime
Token ID2435
Feature activation-7.36e-02

Pos logit contributions
 each+0.161
 to+0.160
 me+0.147
 be+0.137
 for+0.125
Neg logit contributions
bridge-0.276
ined-0.231
pron-0.227
iche-0.221
hes-0.217
Token.
Token ID13
Feature activation-2.64e-02

Pos logit contributions
 each+0.068
 me+0.066
 to+0.061
 ho+0.050
 scar+0.049
Neg logit contributions
bridge-0.202
pron-0.171
ined-0.165
iche-0.164
aster-0.162
Token<|endoftext|>
Token ID50256
Feature activation-1.92e-02

Pos logit contributions
 to+0.042
 each+0.040
 me+0.037
 be+0.033
 for+0.033
Neg logit contributions
bridge-0.062
pron-0.052
iche-0.052
hes-0.050
ined-0.050
TokenOne
Token ID3198
Feature activation-2.02e-02

Pos logit contributions
 to+0.028
 each+0.027
 me+0.025
 be+0.022
 for+0.022
Neg logit contributions
bridge-0.050
pron-0.042
ined-0.041
iche-0.041
aster-0.040
Token day
Token ID1110
Feature activation-3.33e-02

Pos logit contributions
 to+0.038
 each+0.037
 me+0.034
 for+0.031
 be+0.031
Neg logit contributions
bridge-0.044
pron-0.036
ined-0.035
iche-0.035
aster-0.035
Token burst
Token ID11173
Feature activation-8.09e-02

Pos logit contributions
 to+0.049
 each+0.046
 me+0.042
 for+0.038
 be+0.038
Neg logit contributions
bridge-0.074
pron-0.062
iche-0.060
ined-0.060
aster-0.059
Token of
Token ID286
Feature activation+1.38e-02

Pos logit contributions
 each+0.123
 to+0.119
 me+0.107
 for+0.096
 be+0.096
Neg logit contributions
bridge-0.201
ined-0.151
aster-0.151
pron-0.148
obb-0.146
Token speed
Token ID2866
Feature activation-2.40e-02

Pos logit contributions
bridge+0.034
pron+0.029
ined+0.028
aster+0.028
iche+0.028
Neg logit contributions
 to-0.021
 each-0.020
 me-0.018
 be-0.017
 for-0.017
Token and
Token ID290
Feature activation-1.87e-02

Pos logit contributions
 each+0.018
 to+0.018
 me+0.018
 be+0.013
 ho+0.012
Neg logit contributions
bridge-0.070
ined-0.059
pron-0.058
aster-0.057
opl-0.055
Token raced
Token ID32897
Feature activation-2.62e-02

Pos logit contributions
 to+0.027
 each+0.026
 me+0.023
 for+0.021
 be+0.021
Neg logit contributions
bridge-0.047
pron-0.039
ined-0.038
aster-0.037
iche-0.037
Token ahead
Token ID4058
Feature activation-1.10e-01

Pos logit contributions
 each+0.025
 to+0.025
 me+0.024
 be+0.022
's+0.021
Neg logit contributions
bridge-0.070
ined-0.054
aster-0.053
pron-0.053
roads-0.052
Token of
Token ID286
Feature activation-9.45e-03

Pos logit contributions
 each+0.096
 to+0.095
 me+0.087
 be+0.069
 for+0.062
Neg logit contributions
bridge-0.330
pron-0.278
ined-0.275
aster-0.273
iche-0.268
Token Luke
Token ID11336
Feature activation-5.74e-03

Pos logit contributions
 to+0.011
 each+0.010
 me+0.009
 be+0.008
 for+0.008
Neg logit contributions
bridge-0.027
pron-0.023
ined-0.023
iche-0.023
aster-0.023
Token.
Token ID13
Feature activation-2.53e-02

Pos logit contributions
 each+0.005
 me+0.005
 to+0.005
 scar+0.004
 be+0.004
Neg logit contributions
bridge-0.015
pron-0.012
aster-0.012
iche-0.012
kin-0.012
Token
Token ID198
Feature activation-1.72e-02

Pos logit contributions
 to+0.038
 each+0.035
 me+0.032
 be+0.028
 for+0.028
Neg logit contributions
bridge-0.067
pron-0.056
ined-0.056
iche-0.055
aster-0.054
Token
Token ID198
Feature activation-1.45e-02

Pos logit contributions
 to+0.017
 each+0.015
 me+0.013
 be+0.010
 for+0.010
Neg logit contributions
bridge-0.055
pron-0.047
ined-0.047
iche-0.047
atmeal-0.046
Token she
Token ID673
Feature activation-8.10e-03

Pos logit contributions
 to+0.015
 each+0.014
 me+0.013
 for+0.011
 be+0.011
Neg logit contributions
bridge-0.023
pron-0.019
ined-0.019
iche-0.019
aster-0.019
Token would
Token ID561
Feature activation+9.61e-03

Pos logit contributions
 each+0.011
 to+0.011
 me+0.009
 for+0.009
 home+0.009
Neg logit contributions
bridge-0.019
pron-0.017
ined-0.016
hes-0.015
iche-0.015
Token have
Token ID423
Feature activation-2.59e-02

Pos logit contributions
bridge+0.026
pron+0.022
aster+0.021
iche+0.021
ined+0.021
Neg logit contributions
 to-0.013
 each-0.012
 me-0.011
 for-0.010
's-0.009
Token to
Token ID284
Feature activation-1.41e-02

Pos logit contributions
 each+0.024
 me+0.022
 to+0.022
 heads+0.021
 ho+0.021
Neg logit contributions
bridge-0.071
pron-0.058
hes-0.058
aster-0.057
iche-0.054
Token wait
Token ID4043
Feature activation-2.45e-02

Pos logit contributions
 to+0.019
 each+0.017
 me+0.015
 for+0.014
's+0.013
Neg logit contributions
bridge-0.037
aster-0.032
pron-0.032
iche-0.031
ined-0.030
Token ages
Token ID9337
Feature activation-1.10e-01

Pos logit contributions
 to+0.019
 each+0.018
 me+0.016
 be+0.012
's+0.012
Neg logit contributions
bridge-0.077
pron-0.068
ined-0.066
aster-0.065
iche-0.065
Token until
Token ID1566
Feature activation-5.51e-02

Pos logit contributions
 each+0.086
 to+0.081
 me+0.078
 be+0.056
's+0.047
Neg logit contributions
bridge-0.343
pron-0.298
ined-0.284
aster-0.284
iche-0.281
Token he
Token ID339
Feature activation-1.46e-02

Pos logit contributions
 to+0.065
 each+0.057
 me+0.050
 for+0.044
 be+0.043
Neg logit contributions
bridge-0.171
ined-0.149
iche-0.146
pron-0.144
hes-0.139
Token got
Token ID1392
Feature activation-2.02e-02

Pos logit contributions
 to+0.018
 each+0.018
 me+0.015
 for+0.015
 home+0.013
Neg logit contributions
bridge-0.039
aster-0.032
pron-0.032
ined-0.032
iche-0.031
Token inside
Token ID2641
Feature activation-2.84e-02

Pos logit contributions
 to+0.013
 each+0.012
 me+0.010
 be+0.007
 for+0.007
Neg logit contributions
bridge-0.068
pron-0.060
ined-0.059
iche-0.059
aster-0.058
Token the
Token ID262
Feature activation+1.53e-03

Pos logit contributions
 to+0.022
 each+0.020
 me+0.017
 be+0.013
 for+0.012
Neg logit contributions
bridge-0.091
pron-0.080
ined-0.078
aster-0.078
iche-0.078
Token to
Token ID284
Feature activation-8.15e-03

Pos logit contributions
bridge+0.047
pron+0.041
ined+0.041
iche+0.039
atmeal+0.039
Neg logit contributions
 me-0.004
 each-0.004
 be-0.002
 heads-0.002
 scar-0.001
Token many
Token ID867
Feature activation-2.48e-03

Pos logit contributions
 to+0.010
 each+0.009
 me+0.009
 for+0.007
's+0.007
Neg logit contributions
bridge-0.022
aster-0.018
pron-0.018
iche-0.018
ined-0.018
Token more
Token ID517
Feature activation+3.73e-02

Pos logit contributions
 to+0.004
 each+0.003
 me+0.003
 be+0.003
 for+0.003
Neg logit contributions
bridge-0.006
ined-0.005
pron-0.005
aster-0.005
iche-0.005
Token happy
Token ID3772
Feature activation-5.63e-03

Pos logit contributions
bridge+0.105
pron+0.088
ined+0.087
iche+0.087
aster+0.085
Neg logit contributions
 to-0.047
 each-0.044
 me-0.040
 be-0.035
 for-0.034
Token days
Token ID1528
Feature activation-6.38e-02

Pos logit contributions
 to+0.007
 each+0.007
 me+0.007
 be+0.006
's+0.006
Neg logit contributions
bridge-0.014
ined-0.011
iche-0.011
aster-0.011
pron-0.011
Token ahead
Token ID4058
Feature activation-1.09e-01

Pos logit contributions
 me+0.056
 to+0.056
 each+0.055
 be+0.044
 h+0.039
Neg logit contributions
bridge-0.191
pron-0.160
ined-0.158
aster-0.156
iche-0.154
Token.
Token ID13
Feature activation-2.65e-02

Pos logit contributions
 to+0.104
 each+0.099
 me+0.089
 be+0.074
 for+0.067
Neg logit contributions
bridge-0.329
pron-0.281
ined-0.277
iche-0.274
aster-0.272
Token<|endoftext|>
Token ID50256
Feature activation-1.47e-02

Pos logit contributions
 to+0.034
 each+0.032
 me+0.028
 be+0.025
 for+0.025
Neg logit contributions
bridge-0.076
pron-0.065
ined-0.064
iche-0.063
atmeal-0.062
TokenOnce
Token ID7454
Feature activation-1.20e-02

Pos logit contributions
 to+0.022
 each+0.021
 me+0.019
 be+0.017
 for+0.017
Neg logit contributions
bridge-0.038
pron-0.032
iche-0.032
ined-0.032
aster-0.031
Token upon
Token ID2402
Feature activation-3.83e-02

Pos logit contributions
 to+0.016
 each+0.016
 me+0.014
 be+0.013
 for+0.012
Neg logit contributions
bridge-0.031
pron-0.027
ined-0.026
iche-0.026
aster-0.025
Token a
Token ID257
Feature activation-4.61e-02

Pos logit contributions
 to+0.028
 each+0.025
 me+0.019
 be+0.015
 for+0.014
Neg logit contributions
bridge-0.129
pron-0.114
ined-0.113
iche-0.113
aster-0.110
Token was
Token ID373
Feature activation-2.18e-02

Pos logit contributions
bridge+0.006
pron+0.005
ined+0.005
hes+0.005
aster+0.005
Neg logit contributions
 to-0.002
 each-0.002
 me-0.002
 for-0.002
 home-0.002
Token very
Token ID845
Feature activation+1.94e-02

Pos logit contributions
 to+0.022
 each+0.020
 me+0.018
 be+0.015
 for+0.015
Neg logit contributions
bridge-0.067
pron-0.058
ined-0.057
iche-0.057
aster-0.056
Token excited
Token ID6568
Feature activation-1.17e-02

Pos logit contributions
bridge+0.049
pron+0.041
ined+0.040
iche+0.040
aster+0.039
Neg logit contributions
 to-0.031
 each-0.029
 me-0.027
 be-0.025
 for-0.024
Token and
Token ID290
Feature activation-1.42e-02

Pos logit contributions
 each+0.010
 me+0.008
 ho+0.008
 to+0.008
 be+0.007
Neg logit contributions
bridge-0.032
pron-0.027
aster-0.027
ined-0.026
atmeal-0.026
Token ran
Token ID4966
Feature activation-2.16e-02

Pos logit contributions
 to+0.022
 each+0.022
 me+0.020
 for+0.018
 be+0.018
Neg logit contributions
bridge-0.033
pron-0.028
ined-0.027
aster-0.026
iche-0.026
Token ahead
Token ID4058
Feature activation-1.09e-01

Pos logit contributions
 each+0.013
 me+0.011
 to+0.011
 be+0.009
 ho+0.008
Neg logit contributions
bridge-0.068
aster-0.058
pron-0.058
hes-0.057
ined-0.057
Token.
Token ID13
Feature activation-2.71e-02

Pos logit contributions
 each+0.127
 to+0.125
 me+0.116
 be+0.098
 for+0.090
Neg logit contributions
bridge-0.296
pron-0.250
ined-0.247
aster-0.244
iche-0.238
Token When
Token ID1649
Feature activation-3.16e-02

Pos logit contributions
 to+0.054
 each+0.051
 me+0.048
 be+0.044
 for+0.044
Neg logit contributions
bridge-0.058
pron-0.046
ined-0.045
iche-0.045
aster-0.044
Token he
Token ID339
Feature activation-1.71e-03

Pos logit contributions
 to+0.062
 each+0.059
 me+0.055
 be+0.050
 for+0.050
Neg logit contributions
bridge-0.070
pron-0.056
iche-0.056
ined-0.056
aster-0.052
Token arrived
Token ID5284
Feature activation+1.34e-02

Pos logit contributions
 each+0.002
 to+0.002
 me+0.002
 for+0.002
 home+0.002
Neg logit contributions
bridge-0.004
pron-0.004
aster-0.004
ined-0.004
iche-0.004
Token,
Token ID11
Feature activation-7.23e-03

Pos logit contributions
bridge+0.045
pron+0.040
ined+0.039
iche+0.038
aster+0.038
Neg logit contributions
 to-0.008
 each-0.007
 me-0.006
 be-0.004
 for-0.004
Token and
Token ID290
Feature activation-1.21e-02

Pos logit contributions
 each+0.030
 me+0.028
 be+0.026
 to+0.025
 home+0.022
Neg logit contributions
bridge-0.108
ined-0.092
hes-0.088
iche-0.085
pron-0.085
Token the
Token ID262
Feature activation-3.42e-03

Pos logit contributions
 to+0.016
 each+0.015
 me+0.014
 for+0.012
 be+0.012
Neg logit contributions
bridge-0.032
pron-0.027
ined-0.026
iche-0.026
aster-0.026
Token monkeys
Token ID26697
Feature activation+2.50e-02

Pos logit contributions
 to+0.006
 each+0.005
 me+0.005
 for+0.004
 be+0.004
Neg logit contributions
bridge-0.008
iche-0.007
pron-0.007
ined-0.006
aster-0.006
Token hol
Token ID6039
Feature activation-1.57e-02

Pos logit contributions
bridge+0.068
ined+0.057
pron+0.056
iche+0.056
aster+0.055
Neg logit contributions
 to-0.033
 each-0.030
 me-0.029
 be-0.025
 for-0.024
Tokenle
Token ID293
Feature activation-9.47e-02

Pos logit contributions
 to+0.032
 each+0.030
 me+0.028
 for+0.027
,+0.026
Neg logit contributions
bridge-0.033
pron-0.027
iche-0.026
ined-0.025
aster-0.024
Tokenred
Token ID445
Feature activation-1.07e-01

Pos logit contributions
 me+0.129
 to+0.123
 each+0.110
 be+0.109
 home+0.105
Neg logit contributions
bridge-0.237
ined-0.201
hes-0.182
iche-0.178
aster-0.173
Token from
Token ID422
Feature activation-3.35e-02

Pos logit contributions
 me+0.120
 to+0.093
 be+0.090
 each+0.087
 us+0.078
Neg logit contributions
bridge-0.282
ined-0.253
hes-0.232
iche-0.225
pron-0.219
Token the
Token ID262
Feature activation-2.49e-02

Pos logit contributions
 to+0.042
 each+0.037
 me+0.033
's+0.033
 be+0.032
Neg logit contributions
bridge-0.090
ined-0.076
aster-0.076
iche-0.075
pron-0.075
Token trees
Token ID7150
Feature activation-3.18e-02

Pos logit contributions
 to+0.052
 each+0.051
 me+0.044
's+0.043
 for+0.043
Neg logit contributions
bridge-0.048
iche-0.036
pron-0.036
aster-0.035
ined-0.034
Token.
Token ID13
Feature activation-1.21e-02

Pos logit contributions
 to+0.025
 me+0.025
 each+0.024
 be+0.018
 for+0.015
Neg logit contributions
bridge-0.099
ined-0.084
iche-0.083
pron-0.082
hes-0.081
Token One
Token ID1881
Feature activation+1.91e-02

Pos logit contributions
 to+0.018
 each+0.016
 me+0.015
,+0.013
 be+0.013
Neg logit contributions
bridge-0.033
ined-0.028
pron-0.028
atmeal-0.027
iche-0.027
Token see
Token ID766
Feature activation-1.50e-02

Pos logit contributions
 to+0.056
 each+0.053
 me+0.048
 for+0.043
 be+0.041
Neg logit contributions
bridge-0.111
pron-0.093
iche-0.092
ined-0.091
aster-0.091
Token what
Token ID644
Feature activation+3.33e-02

Pos logit contributions
 to+0.018
 each+0.017
 me+0.015
 be+0.013
 for+0.013
Neg logit contributions
bridge-0.043
pron-0.037
ined-0.036
iche-0.036
aster-0.036
Token other
Token ID584
Feature activation+2.53e-02

Pos logit contributions
bridge+0.080
pron+0.066
ined+0.064
iche+0.063
aster+0.063
Neg logit contributions
 to-0.053
 each-0.051
 me-0.047
 for-0.043
 be-0.043
Token adventures
Token ID17545
Feature activation-2.93e-02

Pos logit contributions
bridge+0.067
pron+0.054
ined+0.054
iche+0.054
aster+0.053
Neg logit contributions
 to-0.035
 each-0.033
 me-0.030
 be-0.026
 for-0.026
Token lay
Token ID3830
Feature activation-3.20e-02

Pos logit contributions
 to+0.053
 each+0.052
 me+0.050
 be+0.045
 for+0.043
Neg logit contributions
bridge-0.063
ined-0.049
pron-0.049
aster-0.047
iche-0.047
Token ahead
Token ID4058
Feature activation-1.07e-01

Pos logit contributions
 to+0.035
 each+0.032
 me+0.028
 for+0.024
,+0.022
Neg logit contributions
bridge-0.098
pron-0.085
iche-0.084
ined-0.083
aster-0.082
Token.
Token ID13
Feature activation-2.74e-02

Pos logit contributions
 to+0.097
 each+0.091
 me+0.079
 be+0.065
 for+0.060
Neg logit contributions
bridge-0.333
pron-0.286
ined-0.283
iche-0.280
aster-0.278
Token<|endoftext|>
Token ID50256
Feature activation-1.48e-02

Pos logit contributions
 to+0.039
 each+0.037
 me+0.034
 be+0.030
 for+0.030
Neg logit contributions
bridge-0.071
pron-0.060
iche-0.059
ined-0.059
aster-0.058
TokenOnce
Token ID7454
Feature activation-9.47e-03

Pos logit contributions
 to+0.017
 each+0.016
 me+0.014
 be+0.012
 for+0.012
Neg logit contributions
bridge-0.043
pron-0.037
ined-0.036
iche-0.036
aster-0.035
Token upon
Token ID2402
Feature activation-3.79e-02

Pos logit contributions
 to+0.013
 each+0.013
 me+0.011
 be+0.010
 for+0.010
Neg logit contributions
bridge-0.024
pron-0.021
ined-0.021
iche-0.020
aster-0.020
Token a
Token ID257
Feature activation-4.52e-02

Pos logit contributions
 to+0.026
 each+0.022
 me+0.017
 be+0.013
 for+0.012
Neg logit contributions
bridge-0.131
pron-0.116
ined-0.115
iche-0.115
atmeal-0.111
Token was
Token ID373
Feature activation-2.06e-02

Pos logit contributions
 to+0.008
 each+0.008
 me+0.006
 for+0.006
 home+0.006
Neg logit contributions
bridge-0.020
pron-0.017
ined-0.017
aster-0.016
iche-0.016
Token so
Token ID523
Feature activation+1.60e-02

Pos logit contributions
 to+0.024
 each+0.023
 me+0.020
 for+0.017
 be+0.017
Neg logit contributions
bridge-0.060
pron-0.050
ined-0.050
iche-0.050
aster-0.049
Token excited
Token ID6568
Feature activation-1.39e-02

Pos logit contributions
bridge+0.038
pron+0.031
ined+0.031
iche+0.031
aster+0.030
Neg logit contributions
 to-0.028
 each-0.026
 me-0.024
 be-0.022
 for-0.022
Token he
Token ID339
Feature activation+9.62e-04

Pos logit contributions
 each+0.010
 me+0.009
 ho+0.008
 h+0.008
 to+0.008
Neg logit contributions
bridge-0.039
ined-0.034
pron-0.034
aster-0.033
iche-0.033
Token ran
Token ID4966
Feature activation-1.33e-02

Pos logit contributions
bridge+0.002
pron+0.002
ined+0.002
hes+0.002
atmeal+0.002
Neg logit contributions
 each-0.001
 to-0.001
 me-0.001
 for-0.001
 home-0.001
Token ahead
Token ID4058
Feature activation-1.07e-01

Pos logit contributions
 each+0.013
 to+0.011
 me+0.011
 be+0.009
's+0.009
Neg logit contributions
bridge-0.038
ined-0.031
pron-0.031
aster-0.031
iche-0.030
Token and
Token ID290
Feature activation-2.90e-02

Pos logit contributions
 each+0.100
 to+0.093
 me+0.091
 be+0.075
 heads+0.071
Neg logit contributions
bridge-0.304
pron-0.259
aster-0.257
ined-0.253
iche-0.246
Token his
Token ID465
Feature activation-2.24e-02

Pos logit contributions
 each+0.039
 to+0.038
 me+0.034
 for+0.030
 be+0.030
Neg logit contributions
bridge-0.070
pron-0.059
aster-0.057
ined-0.057
iche-0.057
Token parents
Token ID3397
Feature activation-2.54e-02

Pos logit contributions
 to+0.030
 each+0.030
 for+0.025
's+0.024
 me+0.024
Neg logit contributions
bridge-0.052
pron-0.047
kin-0.043
ined-0.041
 sincere-0.041
Token could
Token ID714
Feature activation-4.45e-02

Pos logit contributions
 each+0.027
 me+0.026
 to+0.026
 be+0.022
 heads+0.020
Neg logit contributions
bridge-0.067
pron-0.057
ined-0.054
atmeal-0.052
aster-0.052
Token hardly
Token ID8941
Feature activation-3.12e-02

Pos logit contributions
 to+0.072
 each+0.069
 me+0.062
 be+0.057
 for+0.056
Neg logit contributions
bridge-0.112
pron-0.093
ined-0.092
iche-0.091
aster-0.088
Token snacks
Token ID28487
Feature activation-2.35e-02

Pos logit contributions
bridge+0.027
pron+0.022
ined+0.022
iche+0.022
aster+0.021
Neg logit contributions
 to-0.018
 each-0.017
 me-0.015
 for-0.014
 be-0.014
Token for
Token ID329
Feature activation-3.27e-02

Pos logit contributions
 me+0.018
 each+0.015
 be+0.014
 to+0.012
 heads+0.011
Neg logit contributions
bridge-0.071
pron-0.061
aster-0.059
kin-0.059
hes-0.058
Token the
Token ID262
Feature activation-1.80e-02

Pos logit contributions
 to+0.038
 each+0.034
 me+0.031
 be+0.027
 for+0.026
Neg logit contributions
bridge-0.095
pron-0.081
iche-0.080
ined-0.080
aster-0.080
Token long
Token ID890
Feature activation-3.09e-02

Pos logit contributions
 to+0.030
 each+0.028
 me+0.026
 be+0.023
 for+0.023
Neg logit contributions
bridge-0.045
pron-0.037
ined-0.037
iche-0.037
aster-0.035
Token journey
Token ID7002
Feature activation-3.91e-02

Pos logit contributions
 to+0.059
 each+0.058
 me+0.053
 for+0.048
 be+0.048
Neg logit contributions
bridge-0.065
pron-0.053
aster-0.051
iche-0.051
ined-0.050
Token ahead
Token ID4058
Feature activation-1.06e-01

Pos logit contributions
 me+0.034
 each+0.032
 to+0.031
 be+0.025
 ho+0.023
Neg logit contributions
bridge-0.117
ined-0.097
pron-0.097
aster-0.095
iche-0.093
Token.
Token ID13
Feature activation-2.17e-02

Pos logit contributions
 to+0.088
 each+0.086
 me+0.080
 be+0.062
 for+0.052
Neg logit contributions
bridge-0.326
pron-0.281
ined-0.277
aster-0.273
iche-0.273
Token And
Token ID843
Feature activation-3.08e-02

Pos logit contributions
 to+0.036
 each+0.034
 me+0.031
 for+0.028
 be+0.028
Neg logit contributions
bridge-0.055
pron-0.046
ined-0.045
iche-0.043
atmeal-0.042
Token all
Token ID477
Feature activation-5.90e-02

Pos logit contributions
 to+0.040
 me+0.038
 each+0.038
 be+0.032
 h+0.031
Neg logit contributions
bridge-0.075
aster-0.062
pron-0.062
ined-0.061
iche-0.061
Token the
Token ID262
Feature activation-3.54e-03

Pos logit contributions
 to+0.065
 each+0.059
 me+0.053
 be+0.045
 for+0.043
Neg logit contributions
bridge-0.175
pron-0.149
ined-0.148
iche-0.147
aster-0.146
Token way
Token ID835
Feature activation-1.32e-02

Pos logit contributions
 to+0.005
 each+0.005
 me+0.005
 for+0.004
 be+0.004
Neg logit contributions
bridge-0.009
pron-0.008
ined-0.008
iche-0.008
aster-0.007
Token Maybe
Token ID6674
Feature activation-2.19e-02

Pos logit contributions
 to+0.032
 each+0.030
 me+0.029
 be+0.027
's+0.025
Neg logit contributions
bridge-0.053
pron-0.046
ined-0.045
atmeal-0.042
iche-0.042
Token someone
Token ID2130
Feature activation-1.85e-02

Pos logit contributions
 to+0.025
 each+0.023
 me+0.020
 for+0.017
 be+0.017
Neg logit contributions
bridge-0.062
pron-0.053
aster-0.053
iche-0.052
ined-0.051
Token will
Token ID481
Feature activation-3.58e-02

Pos logit contributions
 each+0.022
 to+0.022
 me+0.020
 be+0.017
 home+0.017
Neg logit contributions
bridge-0.047
pron-0.040
ined-0.040
aster-0.039
hes-0.039
Token come
Token ID1282
Feature activation-2.68e-02

Pos logit contributions
 to+0.044
 each+0.043
 me+0.037
 for+0.032
's+0.031
Neg logit contributions
bridge-0.095
pron-0.081
aster-0.080
iche-0.080
ined-0.078
Token and
Token ID290
Feature activation-3.35e-02

Pos logit contributions
 each+0.021
 to+0.020
 me+0.018
 be+0.013
's+0.012
Neg logit contributions
bridge-0.081
pron-0.069
iche-0.068
ined-0.068
aster-0.067
Token free
Token ID1479
Feature activation-1.06e-01

Pos logit contributions
 each+0.044
 to+0.043
 me+0.038
 for+0.034
,+0.034
Neg logit contributions
bridge-0.083
aster-0.072
pron-0.070
iche-0.069
atmeal-0.066
Token you
Token ID345
Feature activation-2.03e-02

Pos logit contributions
 to+0.130
 each+0.128
 me+0.112
 be+0.104
 for+0.096
Neg logit contributions
bridge-0.284
pron-0.233
aster-0.232
iche-0.231
ined-0.230
Token,
Token ID11
Feature activation-2.20e-02

Pos logit contributions
 each+0.025
 to+0.023
 me+0.022
 be+0.018
 h+0.017
Neg logit contributions
bridge-0.051
iche-0.042
ined-0.042
pron-0.041
hes-0.041
Token but
Token ID475
Feature activation-2.82e-02

Pos logit contributions
 to+0.028
 each+0.027
 me+0.023
 be+0.020
 for+0.020
Neg logit contributions
bridge-0.058
pron-0.048
aster-0.047
iche-0.046
 narrative-0.044
Token I
Token ID314
Feature activation-2.45e-02

Pos logit contributions
 to+0.033
 each+0.032
 me+0.027
's+0.025
,+0.024
Neg logit contributions
bridge-0.072
pron-0.063
aster-0.061
ined-0.060
iche-0.060
Token have
Token ID423
Feature activation-2.51e-02

Pos logit contributions
 each+0.032
 to+0.031
 me+0.030
 for+0.024
 be+0.023
Neg logit contributions
bridge-0.061
iche-0.052
pron-0.052
aster-0.051
ined-0.051
Token
Token ID198
Feature activation-2.02e-02

Pos logit contributions
bridge+0.080
pron+0.068
iche+0.068
ined+0.067
aster+0.066
Neg logit contributions
 to-0.034
 each-0.032
 me-0.029
 for-0.025
 be-0.024
Token
Token ID198
Feature activation-1.70e-02

Pos logit contributions
 to+0.024
 each+0.022
 me+0.020
 for+0.017
 be+0.017
Neg logit contributions
bridge-0.059
pron-0.050
ined-0.050
iche-0.050
aster-0.048
Token*
Token ID9
Feature activation-8.06e-02

Pos logit contributions
 to+0.033
 each+0.032
 me+0.029
 be+0.027
 for+0.026
Neg logit contributions
bridge-0.037
pron-0.031
iche-0.031
ined-0.031
obb-0.029
TokenThe
Token ID464
Feature activation-2.30e-02

Pos logit contributions
 to+0.123
 each+0.115
 me+0.107
 be+0.098
 for+0.097
Neg logit contributions
bridge-0.207
pron-0.174
iche-0.171
ined-0.168
atmeal-0.166
Token End
Token ID5268
Feature activation-8.01e-03

Pos logit contributions
 to+0.041
 each+0.039
 me+0.037
 be+0.034
 for+0.033
Neg logit contributions
bridge-0.054
pron-0.044
iche-0.044
ined-0.043
aster-0.043
Token*
Token ID9
Feature activation-1.05e-01

Pos logit contributions
 to+0.006
 each+0.005
 me+0.004
 for+0.003
 be+0.003
Neg logit contributions
bridge-0.027
pron-0.023
ined-0.023
iche-0.023
aster-0.023
Token<|endoftext|>
Token ID50256
Feature activation-1.64e-02

Pos logit contributions
 to+0.161
 each+0.152
 me+0.139
 be+0.124
 for+0.124
Neg logit contributions
bridge-0.272
pron-0.227
ined-0.223
iche-0.223
aster-0.217
TokenOnce
Token ID7454
Feature activation-9.01e-03

Pos logit contributions
 to+0.023
 each+0.021
 me+0.019
,+0.017
 be+0.016
Neg logit contributions
bridge-0.045
pron-0.039
iche-0.037
ined-0.037
aster-0.037
Token upon
Token ID2402
Feature activation-3.44e-02

Pos logit contributions
 to+0.012
 each+0.012
 me+0.011
 be+0.010
 for+0.009
Neg logit contributions
bridge-0.023
pron-0.020
ined-0.019
iche-0.019
aster-0.019
Token a
Token ID257
Feature activation-4.02e-02

Pos logit contributions
 to+0.024
 each+0.020
 me+0.015
 be+0.011
 for+0.010
Neg logit contributions
bridge-0.119
pron-0.105
ined-0.105
iche-0.104
atmeal-0.101
Token time
Token ID640
Feature activation-2.10e-02

Pos logit contributions
 to+0.081
 each+0.078
 me+0.073
 for+0.067
 be+0.067
Neg logit contributions
bridge-0.080
pron-0.066
iche-0.064
ined-0.063
aster-0.062
Token tail
Token ID7894
Feature activation-3.60e-02

Pos logit contributions
 to+0.011
 each+0.011
 me+0.009
 for+0.008
's+0.008
Neg logit contributions
bridge-0.019
pron-0.016
iche-0.015
ined-0.015
aster-0.015
Token pleaded
Token ID16407
Feature activation-2.26e-02

Pos logit contributions
 each+0.041
 to+0.040
 me+0.039
 for+0.032
 home+0.032
Neg logit contributions
bridge-0.097
ined-0.080
pron-0.078
hes-0.075
iche-0.074
Token to
Token ID284
Feature activation-4.51e-02

Pos logit contributions
 each+0.016
 to+0.016
 me+0.016
 be+0.011
 scar+0.011
Neg logit contributions
bridge-0.066
pron-0.058
aster-0.057
ined-0.057
iche-0.056
Token be
Token ID307
Feature activation-3.91e-02

Pos logit contributions
 to+0.057
 each+0.051
 me+0.045
's+0.043
 for+0.043
Neg logit contributions
bridge-0.116
aster-0.101
pron-0.097
iche-0.095
ined-0.095
Token set
Token ID900
Feature activation-7.09e-02

Pos logit contributions
 to+0.056
 each+0.053
 me+0.047
 be+0.042
 for+0.042
Neg logit contributions
bridge-0.100
ined-0.083
pron-0.082
iche-0.081
aster-0.081
Token free
Token ID1479
Feature activation-1.05e-01

Pos logit contributions
 to+0.066
 each+0.065
 me+0.059
 be+0.048
's+0.041
Neg logit contributions
bridge-0.207
pron-0.173
ined-0.173
aster-0.173
iche-0.171
Token,
Token ID11
Feature activation-1.77e-02

Pos logit contributions
 each+0.080
 me+0.078
 to+0.070
 be+0.066
 scar+0.056
Neg logit contributions
bridge-0.303
ined-0.246
iest-0.237
iche-0.236
pron-0.235
Token but
Token ID475
Feature activation-2.04e-02

Pos logit contributions
 to+0.020
 each+0.020
 me+0.018
 be+0.015
's+0.014
Neg logit contributions
bridge-0.048
pron-0.041
iche-0.039
aster-0.039
ined-0.038
Token the
Token ID262
Feature activation-9.90e-03

Pos logit contributions
 to+0.023
 each+0.022
 me+0.020
 be+0.016
 for+0.016
Neg logit contributions
bridge-0.056
pron-0.048
aster-0.047
ined-0.047
iche-0.046
Token wolves
Token ID23214
Feature activation-2.83e-02

Pos logit contributions
 to+0.015
 each+0.014
 me+0.012
 for+0.011
 be+0.011
Neg logit contributions
bridge-0.025
pron-0.021
iche-0.020
ined-0.020
aster-0.020
Token would
Token ID561
Feature activation+2.09e-04

Pos logit contributions
 to+0.037
 each+0.035
 me+0.031
 for+0.027
 be+0.026
Neg logit contributions
bridge-0.076
ined-0.064
pron-0.064
iche-0.062
aster-0.061
Token the
Token ID262
Feature activation-1.10e-02

Pos logit contributions
 to+0.016
 each+0.015
 me+0.014
 be+0.014
's+0.013
Neg logit contributions
bridge-0.041
aster-0.039
pron-0.035
iche-0.035
ined-0.034
Token young
Token ID1862
Feature activation-1.91e-02

Pos logit contributions
 to+0.016
 each+0.016
 me+0.014
 for+0.012
 be+0.012
Neg logit contributions
bridge-0.027
pron-0.023
iche-0.021
aster-0.021
ined-0.021
Token girl
Token ID2576
Feature activation-2.69e-02

Pos logit contributions
 to+0.032
 each+0.032
 me+0.031
 be+0.027
 for+0.027
Neg logit contributions
bridge-0.040
pron-0.033
aster-0.031
iche-0.029
ined-0.029
Token to
Token ID284
Feature activation-2.45e-02

Pos logit contributions
 me+0.012
 each+0.009
 to+0.005
 be+0.004
 us+0.004
Neg logit contributions
bridge-0.087
aster-0.074
pron-0.074
hes-0.074
ined-0.073
Token go
Token ID467
Feature activation-2.78e-02

Pos logit contributions
 to+0.029
 each+0.027
 me+0.024
 for+0.022
's+0.020
Neg logit contributions
bridge-0.066
pron-0.056
aster-0.056
ined-0.055
iche-0.055
Token ahead
Token ID4058
Feature activation-1.05e-01

Pos logit contributions
 each+0.026
 to+0.026
 me+0.023
 be+0.018
's+0.017
Neg logit contributions
bridge-0.082
ined-0.070
pron-0.069
iche-0.068
aster-0.068
Token.
Token ID13
Feature activation-2.03e-02

Pos logit contributions
 each+0.097
 me+0.090
 to+0.086
 be+0.076
 heads+0.073
Neg logit contributions
bridge-0.287
pron-0.245
aster-0.241
ined-0.239
kin-0.231
Token
Token ID198
Feature activation-1.45e-02

Pos logit contributions
 to+0.031
 each+0.030
 me+0.028
 for+0.026
 be+0.025
Neg logit contributions
bridge-0.048
pron-0.040
iche-0.040
aster-0.039
ined-0.038
Token
Token ID198
Feature activation-1.25e-02

Pos logit contributions
 to+0.016
 each+0.015
 me+0.013
 be+0.011
 for+0.011
Neg logit contributions
bridge-0.043
pron-0.037
ined-0.037
iche-0.036
aster-0.036
TokenThe
Token ID464
Feature activation-2.31e-02

Pos logit contributions
 to+0.025
 each+0.024
 me+0.023
 be+0.021
 for+0.020
Neg logit contributions
bridge-0.026
pron-0.021
ined-0.021
iche-0.020
atmeal-0.020
Token old
Token ID1468
Feature activation+1.31e-03

Pos logit contributions
 to+0.030
 each+0.029
 me+0.025
 for+0.022
 be+0.022
Neg logit contributions
bridge-0.063
pron-0.054
ined-0.052
iche-0.052
aster-0.052
Token
Token ID198
Feature activation+5.68e-03

Pos logit contributions
bridge+0.016
pron+0.014
ined+0.014
iche+0.014
aster+0.013
Neg logit contributions
 to-0.005
 each-0.005
 me-0.004
 for-0.004
 be-0.004
TokenJack
Token ID14295
Feature activation-5.68e-03

Pos logit contributions
bridge+0.013
pron+0.011
ined+0.010
iche+0.010
atmeal+0.010
Neg logit contributions
 to-0.011
 each-0.011
 me-0.010
 be-0.009
,-0.009
Token asked
Token ID1965
Feature activation+1.63e-02

Pos logit contributions
 to+0.007
 each+0.007
 for+0.006
 home+0.005
 me+0.005
Neg logit contributions
bridge-0.014
ined-0.012
pron-0.012
iche-0.012
aster-0.011
Token Sandy
Token ID17581
Feature activation-4.46e-02

Pos logit contributions
bridge+0.038
aster+0.031
ined+0.031
iche+0.030
kin+0.030
Neg logit contributions
 to-0.022
 each-0.022
 me-0.020
 be-0.019
's-0.018
Token hes
Token ID10818
Feature activation-1.18e-02

Pos logit contributions
 each+0.034
 me+0.032
 to+0.030
 be+0.024
 heads+0.021
Neg logit contributions
bridge-0.127
iche-0.112
aster-0.111
pron-0.111
ined-0.108
Tokenit
Token ID270
Feature activation-1.05e-01

Pos logit contributions
 to+0.020
 each+0.018
's+0.016
 me+0.015
 and+0.015
Neg logit contributions
bridge-0.031
aster-0.025
iche-0.024
atmeal-0.023
opl-0.023
Tokenantly
Token ID3875
Feature activation-4.24e-02

Pos logit contributions
 to+0.134
 each+0.131
 me+0.112
 be+0.098
 for+0.097
Neg logit contributions
bridge-0.278
ined-0.238
pron-0.236
iche-0.229
hes-0.224
Token,
Token ID11
Feature activation-2.21e-02

Pos logit contributions
 each+0.026
 me+0.022
 to+0.021
 home+0.012
 be+0.012
Neg logit contributions
bridge-0.135
pron-0.120
ined-0.118
iche-0.116
aster-0.115
Token "
Token ID366
Feature activation-3.56e-02

Pos logit contributions
 to+0.021
 each+0.020
 me+0.017
 be+0.015
 home+0.014
Neg logit contributions
bridge-0.065
pron-0.057
ined-0.056
iche-0.055
aster-0.054
TokenHow
Token ID2437
Feature activation+2.17e-02

Pos logit contributions
 to+0.074
 each+0.071
 me+0.070
 be+0.061
's+0.060
Neg logit contributions
bridge-0.073
ined-0.059
iche-0.057
pron-0.057
obb-0.057
Token do
Token ID466
Feature activation+9.17e-03

Pos logit contributions
bridge+0.061
pron+0.051
ined+0.050
iche+0.050
aster+0.049
Neg logit contributions
 to-0.028
 each-0.026
 me-0.024
 be-0.021
 for-0.021
Token still
Token ID991
Feature activation-3.18e-02

Pos logit contributions
 to+0.020
 each+0.019
 me+0.018
 for+0.016
 be+0.015
Neg logit contributions
bridge-0.030
pron-0.025
aster-0.024
ined-0.024
iche-0.024
Token see
Token ID766
Feature activation+7.30e-03

Pos logit contributions
 to+0.044
 each+0.042
 me+0.039
 for+0.034
 be+0.032
Neg logit contributions
bridge-0.083
pron-0.069
ined-0.069
aster-0.068
iche-0.068
Token the
Token ID262
Feature activation+1.35e-03

Pos logit contributions
bridge+0.021
ined+0.018
pron+0.017
aster+0.017
iche+0.017
Neg logit contributions
 to-0.008
 each-0.007
 me-0.007
 be-0.006
 for-0.006
Token boys
Token ID6510
Feature activation+5.48e-03

Pos logit contributions
bridge+0.003
pron+0.003
iche+0.003
ined+0.003
aster+0.003
Neg logit contributions
 to-0.002
 each-0.002
 me-0.002
 be-0.002
 for-0.002
Token running
Token ID2491
Feature activation-1.06e-02

Pos logit contributions
bridge+0.015
pron+0.012
ined+0.012
iche+0.012
aster+0.012
Neg logit contributions
 to-0.007
 each-0.007
 me-0.006
 be-0.005
 for-0.005
Token ahead
Token ID4058
Feature activation-1.05e-01

Pos logit contributions
 each+0.013
 to+0.012
 me+0.012
 be+0.011
 ho+0.010
Neg logit contributions
bridge-0.027
ined-0.021
obb-0.020
roads-0.020
aster-0.020
Token.
Token ID13
Feature activation-2.41e-02

Pos logit contributions
 to+0.081
 each+0.080
 me+0.072
 be+0.058
 for+0.047
Neg logit contributions
bridge-0.324
pron-0.277
ined-0.273
aster-0.272
iche-0.268
Token Suddenly
Token ID24975
Feature activation+2.34e-03

Pos logit contributions
 to+0.040
 each+0.038
 me+0.035
 for+0.032
 be+0.032
Neg logit contributions
bridge-0.057
pron-0.047
iche-0.047
ined-0.046
aster-0.045
Token,
Token ID11
Feature activation-1.63e-02

Pos logit contributions
bridge+0.007
aster+0.006
ined+0.006
pron+0.006
iche+0.006
Neg logit contributions
 to-0.002
 each-0.002
 me-0.001
 be-0.001
 for-0.001
Token one
Token ID530
Feature activation+8.88e-04

Pos logit contributions
 to+0.017
 each+0.016
 me+0.013
 for+0.011
,+0.011
Neg logit contributions
bridge-0.051
pron-0.044
iche-0.044
ined-0.044
atmeal-0.042
Token of
Token ID286
Feature activation+3.10e-02

Pos logit contributions
bridge+0.002
pron+0.002
ined+0.002
aster+0.002
iche+0.002
Neg logit contributions
 to-0.001
 each-0.001
 me-0.001
 be-0.001
 for-0.001
Token happiest
Token ID49414
Feature activation+1.98e-02

Pos logit contributions
 to+0.036
 each+0.036
 me+0.032
 be+0.028
 for+0.026
Neg logit contributions
bridge-0.070
pron-0.060
ined-0.060
hes-0.058
iche-0.058
Token when
Token ID618
Feature activation-4.08e-02

Pos logit contributions
bridge+0.056
pron+0.047
ined+0.047
iche+0.046
aster+0.045
Neg logit contributions
 to-0.022
 each-0.022
 me-0.019
 be-0.017
 for-0.016
Token he
Token ID339
Feature activation-6.47e-03

Pos logit contributions
 to+0.069
 each+0.068
 me+0.063
 be+0.057
 for+0.056
Neg logit contributions
bridge-0.092
ined-0.073
pron-0.073
iche-0.072
aster-0.071
Token was
Token ID373
Feature activation-3.14e-02

Pos logit contributions
 to+0.009
 each+0.008
 me+0.007
 for+0.007
 be+0.006
Neg logit contributions
bridge-0.017
ined-0.014
pron-0.014
atmeal-0.014
iche-0.014
Token running
Token ID2491
Feature activation-1.58e-02

Pos logit contributions
 to+0.045
 each+0.043
 me+0.039
 be+0.035
 for+0.034
Neg logit contributions
bridge-0.083
pron-0.070
ined-0.069
iche-0.068
aster-0.067
Token free
Token ID1479
Feature activation-1.04e-01

Pos logit contributions
 each+0.015
 to+0.015
 me+0.014
 be+0.012
 ho+0.011
Neg logit contributions
bridge-0.045
ined-0.038
pron-0.037
iche-0.036
aster-0.036
Token with
Token ID351
Feature activation-2.83e-02

Pos logit contributions
 each+0.089
 to+0.081
 me+0.078
 be+0.071
's+0.058
Neg logit contributions
bridge-0.309
ined-0.255
pron-0.249
aster-0.244
iche-0.242
Token the
Token ID262
Feature activation-1.16e-02

Pos logit contributions
 to+0.035
 each+0.032
 me+0.029
 be+0.026
 for+0.026
Neg logit contributions
bridge-0.077
ined-0.063
pron-0.063
aster-0.063
iche-0.062
Token wind
Token ID2344
Feature activation-2.48e-02

Pos logit contributions
 to+0.019
 each+0.019
 me+0.016
,+0.015
 for+0.014
Neg logit contributions
bridge-0.027
pron-0.022
ined-0.020
aster-0.020
iche-0.020
Token.
Token ID13
Feature activation-2.60e-02

Pos logit contributions
 to+0.020
 each+0.019
 me+0.018
 be+0.013
 for+0.011
Neg logit contributions
bridge-0.077
pron-0.066
ined-0.066
iche-0.064
aster-0.063
Token What
Token ID1867
Feature activation+2.97e-02

Pos logit contributions
 to+0.034
 each+0.032
 me+0.029
 for+0.025
 be+0.025
Neg logit contributions
bridge-0.072
pron-0.060
iche-0.060
ined-0.060
aster-0.059
Token
Token ID198
Feature activation+2.23e-02

Pos logit contributions
bridge+0.070
pron+0.060
ined+0.059
iche+0.059
aster+0.058
Neg logit contributions
 to-0.032
 each-0.030
 me-0.027
 for-0.023
 be-0.023
TokenThey
Token ID2990
Feature activation+1.70e-02

Pos logit contributions
bridge+0.058
ined+0.048
pron+0.047
iche+0.047
atmeal+0.047
Neg logit contributions
 to-0.035
 each-0.033
 me-0.032
 be-0.028
,-0.026
Token saw
Token ID2497
Feature activation+3.69e-02

Pos logit contributions
bridge+0.045
pron+0.037
aster+0.037
ined+0.037
iche+0.036
Neg logit contributions
 to-0.024
 each-0.021
 me-0.019
 for-0.018
 be-0.017
Token the
Token ID262
Feature activation+2.73e-02

Pos logit contributions
bridge+0.109
ined+0.091
pron+0.091
aster+0.091
iche+0.090
Neg logit contributions
 to-0.038
 each-0.034
 me-0.033
 be-0.028
 for-0.026
Token wood
Token ID4898
Feature activation+2.04e-02

Pos logit contributions
bridge+0.067
ined+0.055
pron+0.055
iche+0.055
aster+0.055
Neg logit contributions
 to-0.046
 each-0.043
 me-0.041
 for-0.037
 be-0.036
Tokencut
Token ID8968
Feature activation+1.22e-01

Pos logit contributions
bridge+0.060
pron+0.051
ined+0.051
iche+0.051
aster+0.050
Neg logit contributions
 to-0.023
 each-0.021
 me-0.019
 be-0.016
 for-0.016
Tokenter
Token ID353
Feature activation-8.27e-03

Pos logit contributions
bridge+0.291
pron+0.241
iche+0.236
ined+0.233
aster+0.228
Neg logit contributions
 to-0.210
 each-0.201
 me-0.185
 for-0.169
 be-0.168
Token coming
Token ID2406
Feature activation+1.92e-02

Pos logit contributions
 to+0.011
 each+0.011
 me+0.010
 be+0.008
 for+0.008
Neg logit contributions
bridge-0.022
ined-0.019
pron-0.018
iche-0.018
aster-0.018
Token to
Token ID284
Feature activation-1.03e-02

Pos logit contributions
bridge+0.048
ined+0.040
aster+0.039
iche+0.038
hes+0.037
Neg logit contributions
 me-0.019
 each-0.019
 scar-0.016
 to-0.016
 h-0.014
Token the
Token ID262
Feature activation+7.23e-03

Pos logit contributions
 to+0.012
 each+0.010
 me+0.009
's+0.009
 for+0.008
Neg logit contributions
bridge-0.028
aster-0.026
iche-0.024
pron-0.023
ined-0.023
Token woods
Token ID16479
Feature activation-1.13e-02

Pos logit contributions
bridge+0.016
pron+0.013
iche+0.013
ined+0.013
aster+0.013
Neg logit contributions
 to-0.013
 each-0.013
 me-0.011
 be-0.011
 for-0.011
Token and
Token ID290
Feature activation+8.40e-02

Pos logit contributions
bridge+0.013
pron+0.011
ined+0.011
iche+0.011
aster+0.011
Neg logit contributions
 to-0.005
 each-0.005
 me-0.004
 be-0.004
 for-0.004
Token hopped
Token ID45230
Feature activation+1.73e-02

Pos logit contributions
bridge+0.227
pron+0.191
ined+0.187
iche+0.186
aster+0.185
Neg logit contributions
 to-0.109
 each-0.103
 me-0.092
 be-0.080
 for-0.080
Token away
Token ID1497
Feature activation+2.29e-02

Pos logit contributions
bridge+0.050
pron+0.043
ined+0.042
iche+0.042
aster+0.041
Neg logit contributions
 to-0.020
 each-0.019
 me-0.016
 be-0.014
 for-0.014
Token.
Token ID13
Feature activation+3.35e-02

Pos logit contributions
bridge+0.063
ined+0.055
pron+0.053
iche+0.053
kin+0.052
Neg logit contributions
 each-0.020
 me-0.018
 to-0.018
's-0.014
 be-0.014
Token The
Token ID383
Feature activation+2.52e-02

Pos logit contributions
bridge+0.087
pron+0.073
iche+0.072
ined+0.072
aster+0.070
Neg logit contributions
 to-0.049
 each-0.046
 me-0.042
 for-0.038
 be-0.037
Token je
Token ID11223
Feature activation+1.25e-01

Pos logit contributions
bridge+0.066
pron+0.055
ined+0.055
iche+0.054
aster+0.053
Neg logit contributions
 to-0.037
 each-0.035
 me-0.032
 be-0.028
 for-0.028
Tokenep
Token ID538
Feature activation+3.54e-02

Pos logit contributions
bridge+0.236
pron+0.188
iche+0.186
aster+0.179
ined+0.177
Neg logit contributions
 to-0.270
 each-0.257
 me-0.248
 be-0.229
 for-0.227
Token watched
Token ID7342
Feature activation+2.11e-02

Pos logit contributions
bridge+0.099
ined+0.083
pron+0.080
iche+0.078
aster+0.077
Neg logit contributions
 to-0.044
 each-0.042
 me-0.038
 for-0.032
 be-0.031
Token it
Token ID340
Feature activation-4.04e-03

Pos logit contributions
bridge+0.062
pron+0.052
aster+0.052
ined+0.052
iche+0.051
Neg logit contributions
 to-0.020
 each-0.019
 me-0.016
 be-0.014
's-0.014
Token go
Token ID467
Feature activation-4.22e-03

Pos logit contributions
 each+0.005
 to+0.005
 me+0.005
 heads+0.003
 ho+0.003
Neg logit contributions
bridge-0.010
ined-0.008
aster-0.008
iche-0.008
pron-0.008
Token.
Token ID13
Feature activation+1.63e-02

Pos logit contributions
 each+0.004
 to+0.004
 me+0.004
 be+0.003
's+0.003
Neg logit contributions
bridge-0.012
ined-0.010
pron-0.010
iche-0.010
aster-0.010
Token box
Token ID3091
Feature activation-3.47e-03

Pos logit contributions
 to+0.010
 each+0.010
 me+0.009
 be+0.008
 for+0.008
Neg logit contributions
bridge-0.017
pron-0.015
ined-0.014
iche-0.014
aster-0.014
Token full
Token ID1336
Feature activation-1.01e-02

Pos logit contributions
 to+0.004
 each+0.004
 me+0.003
 for+0.003
,+0.003
Neg logit contributions
bridge-0.010
iche-0.009
pron-0.009
ined-0.009
aster-0.009
Token of
Token ID286
Feature activation+4.95e-02

Pos logit contributions
 each+0.018
 to+0.018
 me+0.017
 be+0.016
 ho+0.015
Neg logit contributions
bridge-0.021
pron-0.017
ined-0.016
aster-0.015
iche-0.015
Token beautiful
Token ID4950
Feature activation-2.64e-02

Pos logit contributions
bridge+0.126
pron+0.104
ined+0.102
iche+0.100
aster+0.098
Neg logit contributions
 to-0.075
 each-0.070
 me-0.064
 be-0.059
 for-0.057
Token or
Token ID393
Feature activation+5.20e-02

Pos logit contributions
 to+0.040
 each+0.039
 me+0.035
 be+0.033
 for+0.032
Neg logit contributions
bridge-0.066
pron-0.053
ined-0.053
iche-0.051
aster-0.049
Tokennam
Token ID7402
Feature activation+1.30e-01

Pos logit contributions
bridge+0.095
pron+0.073
ined+0.070
iche+0.069
aster+0.067
Neg logit contributions
 to-0.116
 each-0.112
 me-0.106
 be-0.099
 for-0.098
Tokenents
Token ID658
Feature activation+5.87e-03

Pos logit contributions
bridge+0.335
pron+0.278
iche+0.277
ined+0.276
hes+0.264
Neg logit contributions
 to-0.184
 each-0.175
 me-0.167
 for-0.151
 be-0.150
Token.
Token ID13
Feature activation-5.94e-04

Pos logit contributions
bridge+0.018
pron+0.015
ined+0.015
iche+0.015
aster+0.015
Neg logit contributions
 to-0.006
 each-0.006
 me-0.005
 be-0.004
 for-0.004
Token
Token ID220
Feature activation+4.63e-02

Pos logit contributions
 each+0.001
 to+0.001
 me+0.001
 for+0.001
 be+0.001
Neg logit contributions
bridge-0.001
pron-0.001
iche-0.001
hes-0.001
aster-0.001
Token
Token ID198
Feature activation+2.76e-03

Pos logit contributions
bridge+0.127
pron+0.107
ined+0.106
iche+0.106
hes+0.105
Neg logit contributions
 to-0.057
 each-0.055
 me-0.050
 be-0.044
 for-0.042
Token
Token ID198
Feature activation+3.53e-03

Pos logit contributions
bridge+0.007
pron+0.006
iche+0.006
ined+0.006
hes+0.006
Neg logit contributions
 to-0.003
 each-0.003
 me-0.003
 be-0.003
 for-0.003
Token time
Token ID640
Feature activation+2.11e-03

Pos logit contributions
bridge+0.032
pron+0.027
ined+0.026
iche+0.026
aster+0.025
Neg logit contributions
 to-0.022
 each-0.021
 me-0.019
 be-0.017
 for-0.017
Token playing
Token ID2712
Feature activation+2.86e-02

Pos logit contributions
bridge+0.006
pron+0.005
ined+0.005
iche+0.005
aster+0.005
Neg logit contributions
 to-0.002
 each-0.002
 me-0.002
 be-0.002
 for-0.002
Token with
Token ID351
Feature activation+1.79e-02

Pos logit contributions
bridge+0.084
pron+0.071
ined+0.070
iche+0.069
aster+0.069
Neg logit contributions
 to-0.031
 each-0.029
 me-0.026
 be-0.023
 for-0.021
Token her
Token ID607
Feature activation+4.49e-03

Pos logit contributions
bridge+0.055
pron+0.047
ined+0.046
iche+0.046
aster+0.045
Neg logit contributions
 to-0.017
 each-0.016
 me-0.014
 be-0.012
 for-0.012
Token or
Token ID393
Feature activation+5.29e-02

Pos logit contributions
bridge+0.012
pron+0.010
ined+0.010
iche+0.010
aster+0.010
Neg logit contributions
 to-0.006
 each-0.006
 me-0.005
 be-0.005
 for-0.005
Tokennam
Token ID7402
Feature activation+1.62e-01

Pos logit contributions
bridge+0.139
pron+0.119
ined+0.116
iche+0.115
aster+0.113
Neg logit contributions
 to-0.072
 each-0.069
 me-0.062
 be-0.055
 for-0.054
Tokenents
Token ID658
Feature activation+2.17e-02

Pos logit contributions
bridge+0.425
pron+0.359
ined+0.353
iche+0.351
aster+0.335
Neg logit contributions
 to-0.224
 each-0.210
 me-0.198
 for-0.180
 be-0.176
Token and
Token ID290
Feature activation+2.02e-02

Pos logit contributions
bridge+0.069
pron+0.059
ined+0.059
iche+0.058
aster+0.057
Neg logit contributions
 to-0.019
 each-0.017
 me-0.016
 be-0.012
 for-0.011
Token making
Token ID1642
Feature activation+1.93e-02

Pos logit contributions
bridge+0.049
pron+0.042
ined+0.040
hes+0.038
aster+0.037
Neg logit contributions
 to-0.029
 each-0.028
 me-0.027
 be-0.023
 for-0.023
Token her
Token ID607
Feature activation-1.43e-03

Pos logit contributions
bridge+0.052
pron+0.042
ined+0.042
iche+0.042
hes+0.041
Neg logit contributions
 to-0.024
 each-0.022
 me-0.020
 be-0.020
 ho-0.019
Token special
Token ID2041
Feature activation+9.14e-03

Pos logit contributions
 to+0.002
 each+0.002
 me+0.002
 for+0.002
 be+0.002
Neg logit contributions
bridge-0.004
pron-0.003
iche-0.003
ined-0.003
aster-0.003
Token of
Token ID286
Feature activation+3.26e-02

Pos logit contributions
 to+0.043
 each+0.041
 me+0.038
 be+0.034
 for+0.033
Neg logit contributions
bridge-0.071
pron-0.059
ined-0.059
iche-0.058
aster-0.057
Token fun
Token ID1257
Feature activation-1.53e-02

Pos logit contributions
bridge+0.093
pron+0.079
ined+0.078
iche+0.078
aster+0.076
Neg logit contributions
 to-0.040
 each-0.038
 me-0.034
 be-0.029
 for-0.029
Token watching
Token ID4964
Feature activation-2.97e-03

Pos logit contributions
 to+0.013
 each+0.011
 me+0.009
 for+0.007
 be+0.007
Neg logit contributions
bridge-0.051
pron-0.045
iche-0.044
ined-0.043
aster-0.043
Token the
Token ID262
Feature activation-1.44e-02

Pos logit contributions
 to+0.004
 each+0.004
 me+0.004
 be+0.004
 for+0.003
Neg logit contributions
bridge-0.007
iche-0.006
aster-0.006
pron-0.006
ined-0.006
Token ac
Token ID936
Feature activation+2.24e-02

Pos logit contributions
 to+0.031
 each+0.029
 me+0.027
 be+0.025
 for+0.025
Neg logit contributions
bridge-0.028
pron-0.022
iche-0.021
ined-0.021
aster-0.020
Tokenrob
Token ID22609
Feature activation+1.45e-01

Pos logit contributions
bridge+0.026
pron+0.017
iche+0.016
 narrative+0.015
ined+0.014
Neg logit contributions
 to-0.064
 each-0.062
 me-0.061
 for-0.058
,-0.057
Tokenats
Token ID1381
Feature activation-2.89e-02

Pos logit contributions
bridge+0.298
pron+0.237
ined+0.224
iche+0.224
aster+0.215
Neg logit contributions
 to-0.295
 each-0.287
 me-0.266
 for-0.251
,-0.244
Token and
Token ID290
Feature activation-2.04e-02

Pos logit contributions
 to+0.028
 each+0.026
 me+0.023
 be+0.019
 for+0.018
Neg logit contributions
bridge-0.088
pron-0.075
ined-0.074
iche-0.074
aster-0.073
Token clown
Token ID25573
Feature activation-8.41e-03

Pos logit contributions
 to+0.029
 each+0.027
 me+0.027
's+0.023
 for+0.023
Neg logit contributions
bridge-0.050
aster-0.040
pron-0.040
iche-0.039
ined-0.038
Tokens
Token ID82
Feature activation-5.72e-02

Pos logit contributions
 to+0.007
 each+0.007
 me+0.007
 be+0.005
 for+0.005
Neg logit contributions
bridge-0.025
iche-0.021
pron-0.021
ined-0.021
atmeal-0.021
Token.
Token ID13
Feature activation-1.63e-02

Pos logit contributions
 to+0.051
 each+0.047
 me+0.042
 be+0.032
 for+0.031
Neg logit contributions
bridge-0.181
pron-0.155
ined-0.154
iche-0.152
aster-0.150
Token and
Token ID290
Feature activation-1.12e-02

Pos logit contributions
 me+0.012
 each+0.012
 ho+0.009
 heads+0.008
 us+0.008
Neg logit contributions
bridge-0.062
aster-0.053
pron-0.053
iche-0.053
hes-0.052
Token her
Token ID607
Feature activation-1.93e-02

Pos logit contributions
 to+0.016
 me+0.016
 each+0.015
 ho+0.014
 for+0.013
Neg logit contributions
bridge-0.024
pron-0.020
hes-0.019
aster-0.019
ined-0.018
Token patch
Token ID8529
Feature activation-3.40e-02

Pos logit contributions
 to+0.024
 each+0.023
 me+0.021
 be+0.018
 for+0.018
Neg logit contributions
bridge-0.054
pron-0.046
ined-0.045
iche-0.044
aster-0.043
Token on
Token ID319
Feature activation+4.05e-02

Pos logit contributions
 each+0.039
 to+0.038
 me+0.036
 be+0.029
 ho+0.028
Neg logit contributions
bridge-0.096
ined-0.081
pron-0.079
hes-0.077
kin-0.076
Token her
Token ID607
Feature activation-1.19e-02

Pos logit contributions
bridge+0.124
pron+0.108
aster+0.106
iche+0.106
ined+0.105
Neg logit contributions
 to-0.037
 each-0.034
 me-0.029
 for-0.025
 be-0.024
Token knee
Token ID10329
Feature activation+4.73e-02

Pos logit contributions
 each+0.016
 to+0.016
 me+0.014
ur+0.013
 for+0.013
Neg logit contributions
bridge-0.026
pron-0.024
atmeal-0.022
aster-0.022
kin-0.021
Token.
Token ID13
Feature activation-8.55e-03

Pos logit contributions
bridge+0.138
pron+0.119
ined+0.118
hes+0.114
aster+0.112
Neg logit contributions
 each-0.041
 me-0.039
 to-0.038
 ho-0.030
 be-0.029
Token Her
Token ID2332
Feature activation+1.91e-02

Pos logit contributions
 to+0.013
 each+0.013
 me+0.012
 be+0.010
 for+0.010
Neg logit contributions
bridge-0.021
pron-0.018
iche-0.018
ined-0.018
aster-0.017
Token mom
Token ID1995
Feature activation+2.11e-02

Pos logit contributions
bridge+0.053
pron+0.047
ined+0.044
iche+0.044
kin+0.043
Neg logit contributions
 to-0.023
 each-0.022
 me-0.019
 for-0.017
's-0.015
Token was
Token ID373
Feature activation-1.10e-02

Pos logit contributions
bridge+0.053
ined+0.048
pron+0.044
hes+0.044
atmeal+0.041
Neg logit contributions
 to-0.028
 each-0.024
 me-0.023
 home-0.022
 for-0.022
Token proud
Token ID6613
Feature activation-1.17e-02

Pos logit contributions
 to+0.015
 each+0.014
 me+0.013
 be+0.012
 for+0.011
Neg logit contributions
bridge-0.030
iche-0.026
pron-0.026
aster-0.025
ined-0.025
Token to
Token ID284
Feature activation-2.36e-03

Pos logit contributions
 me+0.009
 each+0.009
 to+0.006
 be+0.006
's+0.005
Neg logit contributions
bridge-0.031
ined-0.028
aster-0.028
hes-0.027
iche-0.027
Token splash
Token ID22870
Feature activation-1.09e-02

Pos logit contributions
 to+0.004
 each+0.003
 me+0.003
 for+0.003
 be+0.003
Neg logit contributions
bridge-0.006
pron-0.005
iche-0.005
ined-0.005
aster-0.005
Token!
Token ID0
Feature activation-1.32e-02

Pos logit contributions
 to+0.012
 each+0.011
 me+0.011
 be+0.010
 ho+0.009
Neg logit contributions
bridge-0.030
ined-0.025
obb-0.024
iche-0.024
aster-0.024
Token He
Token ID679
Feature activation+2.93e-02

Pos logit contributions
 to+0.026
 each+0.024
 me+0.022
 be+0.021
 for+0.021
Neg logit contributions
bridge-0.030
pron-0.024
ined-0.024
iche-0.023
aster-0.022
Token felt
Token ID2936
Feature activation+3.57e-02

Pos logit contributions
bridge+0.074
ined+0.063
pron+0.062
hes+0.060
iche+0.060
Neg logit contributions
 to-0.040
 each-0.036
 me-0.034
 for-0.031
 home-0.029
Token so
Token ID523
Feature activation+4.77e-02

Pos logit contributions
bridge+0.110
pron+0.094
ined+0.093
iche+0.093
aster+0.091
Neg logit contributions
 to-0.036
 each-0.033
 me-0.029
 be-0.024
 for-0.024
Token content
Token ID2695
Feature activation-2.34e-02

Pos logit contributions
bridge+0.120
pron+0.100
iche+0.097
ined+0.097
aster+0.095
Neg logit contributions
 to-0.077
 each-0.073
 me-0.067
 be-0.061
 for-0.060
Token and
Token ID290
Feature activation+1.73e-02

Pos logit contributions
 me+0.022
 each+0.021
 to+0.019
 be+0.016
's+0.016
Neg logit contributions
bridge-0.067
ined-0.057
pron-0.055
aster-0.055
hes-0.054
Token happy
Token ID3772
Feature activation+1.11e-02

Pos logit contributions
bridge+0.041
pron+0.034
ined+0.034
iche+0.033
aster+0.032
Neg logit contributions
 to-0.029
 each-0.028
 me-0.026
 for-0.023
 be-0.023
Token playing
Token ID2712
Feature activation+3.22e-02

Pos logit contributions
bridge+0.032
ined+0.027
pron+0.026
aster+0.025
iche+0.025
Neg logit contributions
 me-0.010
 each-0.010
 to-0.009
 be-0.008
 ho-0.008
Token in
Token ID287
Feature activation+1.23e-02

Pos logit contributions
bridge+0.098
pron+0.083
ined+0.083
iche+0.082
aster+0.080
Neg logit contributions
 to-0.034
 each-0.031
 me-0.028
 be-0.023
 for-0.022
Token likes
Token ID7832
Feature activation+1.72e-02

Pos logit contributions
 to+0.022
 each+0.020
 me+0.018
 for+0.017
 be+0.017
Neg logit contributions
bridge-0.035
ined-0.031
pron-0.030
aster-0.029
iche-0.029
Token to
Token ID284
Feature activation+2.75e-02

Pos logit contributions
bridge+0.060
pron+0.055
aster+0.054
iche+0.053
atmeal+0.052
Neg logit contributions
 each-0.003
 me-0.002
 to-0.001
 be-0.001
's+0.000
Token play
Token ID711
Feature activation+4.31e-02

Pos logit contributions
bridge+0.076
ined+0.064
pron+0.064
iche+0.063
hes+0.060
Neg logit contributions
 to-0.038
 each-0.036
 me-0.032
 be-0.029
 for-0.027
Token with
Token ID351
Feature activation+1.54e-02

Pos logit contributions
bridge+0.136
pron+0.117
iche+0.117
ined+0.115
aster+0.114
Neg logit contributions
 to-0.043
 each-0.039
 me-0.034
 for-0.028
,-0.027
Token his
Token ID465
Feature activation+2.39e-02

Pos logit contributions
bridge+0.038
pron+0.031
ined+0.031
iche+0.031
aster+0.030
Neg logit contributions
 to-0.026
 each-0.024
 me-0.022
 be-0.020
 for-0.020
Token drum
Token ID13026
Feature activation+5.12e-02

Pos logit contributions
bridge+0.057
pron+0.049
ined+0.047
iche+0.046
aster+0.045
Neg logit contributions
 to-0.038
 each-0.036
 me-0.033
 for-0.030
 be-0.030
Token.
Token ID13
Feature activation+1.89e-02

Pos logit contributions
bridge+0.161
pron+0.139
ined+0.138
iche+0.134
aster+0.133
Neg logit contributions
 to-0.043
 each-0.040
 me-0.036
 be-0.029
 for-0.024
Token He
Token ID679
Feature activation+3.89e-02

Pos logit contributions
bridge+0.046
pron+0.040
ined+0.039
iche+0.038
hes+0.037
Neg logit contributions
 to-0.034
 each-0.032
 me-0.028
 be-0.027
,-0.026
Token has
Token ID468
Feature activation+1.37e-02

Pos logit contributions
bridge+0.100
pron+0.083
iche+0.082
ined+0.082
aster+0.079
Neg logit contributions
 to-0.061
 each-0.057
 me-0.052
 be-0.047
 for-0.047
Token a
Token ID257
Feature activation+1.75e-02

Pos logit contributions
bridge+0.046
pron+0.040
iche+0.040
ined+0.040
aster+0.039
Neg logit contributions
 to-0.012
 each-0.010
 me-0.008
 be-0.006
 for-0.006
Token big
Token ID1263
Feature activation+2.76e-02

Pos logit contributions
bridge+0.054
ined+0.046
aster+0.045
iche+0.045
atmeal+0.045
Neg logit contributions
 to-0.021
 each-0.017
 me-0.017
 be-0.015
,-0.014
Token orig
Token ID1796
Feature activation+1.72e-02

Pos logit contributions
bridge+0.039
ined+0.031
pron+0.031
aster+0.031
iche+0.030
Neg logit contributions
 to-0.032
 each-0.031
 me-0.028
 be-0.026
 for-0.026
Tokenami
Token ID6277
Feature activation+1.22e-02

Pos logit contributions
bridge+0.032
pron+0.025
iche+0.024
hes+0.023
ined+0.023
Neg logit contributions
 to-0.037
 each-0.036
 me-0.035
 for-0.031
 be-0.031
Token figure
Token ID3785
Feature activation+2.89e-02

Pos logit contributions
bridge+0.035
pron+0.029
aster+0.028
ined+0.028
iche+0.028
Neg logit contributions
 each-0.014
 to-0.014
 me-0.012
 be-0.010
 for-0.009
Token.
Token ID13
Feature activation+9.59e-03

Pos logit contributions
bridge+0.091
pron+0.077
ined+0.076
aster+0.074
hes+0.074
Neg logit contributions
 each-0.024
 to-0.024
 me-0.022
 be-0.017
 ho-0.014
Token But
Token ID887
Feature activation+1.99e-02

Pos logit contributions
bridge+0.021
pron+0.016
iche+0.016
aster+0.016
ined+0.016
Neg logit contributions
 each-0.016
 to-0.016
 me-0.015
 ho-0.014
 for-0.014
Token no
Token ID645
Feature activation+7.54e-02

Pos logit contributions
bridge+0.056
pron+0.048
ined+0.048
aster+0.047
iche+0.046
Neg logit contributions
 to-0.022
 each-0.022
 me-0.019
 be-0.016
 for-0.015
Token matter
Token ID2300
Feature activation+3.68e-02

Pos logit contributions
bridge+0.196
pron+0.162
iche+0.161
ined+0.160
hes+0.154
Neg logit contributions
 to-0.119
 each-0.112
 me-0.103
 for-0.094
 be-0.094
Token how
Token ID703
Feature activation+2.70e-02

Pos logit contributions
bridge+0.092
ined+0.081
otion+0.078
aster+0.078
pron+0.075
Neg logit contributions
 to-0.040
 each-0.036
 me-0.034
 heads-0.030
by-0.030
Token hard
Token ID1327
Feature activation+2.38e-02

Pos logit contributions
bridge+0.071
pron+0.061
ined+0.059
iche+0.059
aster+0.058
Neg logit contributions
 to-0.038
 each-0.036
 me-0.033
 be-0.029
 for-0.029
Token she
Token ID673
Feature activation+3.35e-02

Pos logit contributions
bridge+0.058
pron+0.048
aster+0.047
ined+0.046
iche+0.045
Neg logit contributions
 to-0.035
 each-0.034
 me-0.032
 be-0.028
 for-0.027
Token tried
Token ID3088
Feature activation+1.32e-02

Pos logit contributions
bridge+0.087
pron+0.074
ined+0.074
iche+0.072
aster+0.071
Neg logit contributions
 to-0.048
 each-0.044
 me-0.040
 for-0.037
 be-0.035
Token know
Token ID760
Feature activation-1.75e-02

Pos logit contributions
bridge+0.114
pron+0.098
ined+0.098
iche+0.095
atmeal+0.092
Neg logit contributions
 to-0.051
 each-0.047
 me-0.042
 be-0.038
 for-0.037
Token.
Token ID13
Feature activation+1.09e-02

Pos logit contributions
 to+0.019
 each+0.018
 me+0.016
 be+0.013
 for+0.013
Neg logit contributions
bridge-0.052
pron-0.044
ined-0.044
iche-0.043
aster-0.043
Token "
Token ID366
Feature activation+7.72e-04

Pos logit contributions
bridge+0.025
pron+0.020
ined+0.020
iche+0.020
aster+0.019
Neg logit contributions
 to-0.020
 each-0.019
 me-0.018
 for-0.016
 be-0.016
TokenNo
Token ID2949
Feature activation+3.95e-02

Pos logit contributions
bridge+0.002
pron+0.002
ined+0.002
atmeal+0.002
iche+0.002
Neg logit contributions
 to-0.001
 each-0.001
 me-0.001
 be-0.001
's-0.001
Token,
Token ID11
Feature activation+1.49e-02

Pos logit contributions
bridge+0.120
pron+0.107
aster+0.107
iche+0.104
atmeal+0.101
Neg logit contributions
 each-0.029
 to-0.027
 me-0.019
 home-0.014
 back-0.013
Token what
Token ID644
Feature activation+7.17e-02

Pos logit contributions
bridge+0.043
ined+0.037
iche+0.036
pron+0.036
aster+0.035
Neg logit contributions
 to-0.019
 each-0.018
 me-0.016
 be-0.015
 for-0.014
Token do
Token ID466
Feature activation+3.28e-02

Pos logit contributions
bridge+0.207
pron+0.176
ined+0.174
iche+0.173
aster+0.169
Neg logit contributions
 to-0.089
 each-0.082
 me-0.073
 be-0.063
 for-0.063
Token they
Token ID484
Feature activation+1.98e-02

Pos logit contributions
bridge+0.095
pron+0.081
aster+0.079
iche+0.079
ined+0.079
Neg logit contributions
 to-0.036
 each-0.034
 me-0.030
 for-0.026
 be-0.025
Token eat
Token ID4483
Feature activation+1.34e-03

Pos logit contributions
bridge+0.047
iche+0.042
pron+0.041
ined+0.040
aster+0.040
Neg logit contributions
 to-0.030
 each-0.027
 me-0.024
 home-0.021
 for-0.021
Token?"
Token ID1701
Feature activation+3.24e-02

Pos logit contributions
bridge+0.002
pron+0.002
iche+0.002
ined+0.002
aster+0.002
Neg logit contributions
 to-0.003
 each-0.003
 me-0.002
 be-0.002
 for-0.002
Token he
Token ID339
Feature activation+3.44e-02

Pos logit contributions
bridge+0.084
ined+0.072
pron+0.071
iche+0.070
atmeal+0.067
Neg logit contributions
 to-0.056
 each-0.050
 me-0.047
,-0.043
 be-0.042
Token find
Token ID1064
Feature activation+2.64e-02

Pos logit contributions
bridge+0.039
aster+0.034
pron+0.033
iche+0.033
ined+0.032
Neg logit contributions
 to-0.019
 each-0.018
 me-0.015
 for-0.014
's-0.013
Token a
Token ID257
Feature activation-4.89e-03

Pos logit contributions
bridge+0.070
ined+0.059
aster+0.059
hes+0.059
pron+0.057
Neg logit contributions
 to-0.029
 each-0.028
 me-0.024
 be-0.024
 ho-0.023
Token place
Token ID1295
Feature activation+3.36e-03

Pos logit contributions
 each+0.007
 to+0.006
 me+0.006
 for+0.006
's+0.005
Neg logit contributions
bridge-0.011
pron-0.011
iche-0.010
aster-0.009
ined-0.009
Token to
Token ID284
Feature activation+1.32e-02

Pos logit contributions
bridge+0.010
pron+0.009
ined+0.009
aster+0.009
kin+0.009
Neg logit contributions
 me-0.002
 each-0.001
 ho-0.001
 heads-0.001
 be-0.001
Token cover
Token ID3002
Feature activation-1.53e-02

Pos logit contributions
bridge+0.034
aster+0.029
pron+0.028
ined+0.028
iche+0.028
Neg logit contributions
 to-0.018
 each-0.017
 me-0.015
 for-0.014
's-0.013
Token herself
Token ID5223
Feature activation-6.73e-03

Pos logit contributions
 to+0.015
 each+0.014
 me+0.012
 be+0.011
 for+0.010
Neg logit contributions
bridge-0.045
ined-0.039
pron-0.038
iche-0.037
aster-0.037
Token from
Token ID422
Feature activation+1.91e-02

Pos logit contributions
 to+0.007
 each+0.007
 me+0.006
 be+0.005
 ho+0.005
Neg logit contributions
bridge-0.019
pron-0.016
ined-0.016
iche-0.016
hes-0.016
Token the
Token ID262
Feature activation+1.15e-02

Pos logit contributions
bridge+0.057
pron+0.049
ined+0.049
iche+0.048
aster+0.048
Neg logit contributions
 to-0.020
 each-0.019
 me-0.016
 be-0.014
 for-0.013
Token rain
Token ID6290
Feature activation-8.34e-03

Pos logit contributions
bridge+0.025
pron+0.020
ined+0.019
iche+0.019
obb+0.019
Neg logit contributions
 to-0.021
 each-0.020
 me-0.018
 for-0.017
's-0.017
Token.
Token ID13
Feature activation+5.03e-03

Pos logit contributions
 me+0.008
 each+0.008
 to+0.007
 ho+0.007
 be+0.006
Neg logit contributions
bridge-0.022
pron-0.018
atmeal-0.018
ined-0.018
iche-0.018
Token She
Token ID1375
Feature activation+6.43e-03

Pos logit contributions
bridge+0.007
pron+0.006
iche+0.006
Here+0.006
obb+0.006
Neg logit contributions
 each-0.010
 ho-0.009
 to-0.009
 h-0.008
 for-0.008
Token the
Token ID262
Feature activation+2.40e-02

Pos logit contributions
bridge+0.084
aster+0.074
pron+0.071
iche+0.069
ined+0.069
Neg logit contributions
 to-0.034
 each-0.030
 me-0.027
's-0.025
 be-0.025
Token animals
Token ID4695
Feature activation+8.93e-03

Pos logit contributions
bridge+0.059
pron+0.050
iche+0.048
ined+0.047
aster+0.047
Neg logit contributions
 to-0.037
 each-0.036
 me-0.032
 be-0.029
 for-0.029
Token for
Token ID329
Feature activation-5.83e-03

Pos logit contributions
bridge+0.029
pron+0.025
aster+0.025
ined+0.024
iche+0.024
Neg logit contributions
 to-0.005
 each-0.005
 me-0.005
 be-0.004
 h-0.002
Token being
Token ID852
Feature activation-1.69e-03

Pos logit contributions
 to+0.008
 each+0.007
 me+0.006
's+0.006
 be+0.006
Neg logit contributions
bridge-0.016
ined-0.013
pron-0.013
aster-0.013
hes-0.013
Token so
Token ID523
Feature activation+3.13e-02

Pos logit contributions
 to+0.002
 each+0.002
 me+0.002
 be+0.002
 for+0.002
Neg logit contributions
bridge-0.005
ined-0.004
pron-0.004
iche-0.004
hes-0.004
Token patient
Token ID5827
Feature activation-2.81e-02

Pos logit contributions
bridge+0.077
pron+0.063
ined+0.063
iche+0.062
hes+0.061
Neg logit contributions
 to-0.048
 each-0.046
 me-0.043
 for-0.038
 be-0.038
Token and
Token ID290
Feature activation-6.62e-03

Pos logit contributions
 to+0.020
 each+0.019
 me+0.017
 be+0.013
 for+0.010
Neg logit contributions
bridge-0.088
pron-0.077
ined-0.076
iche-0.074
aster-0.074
Token left
Token ID1364
Feature activation-2.34e-02

Pos logit contributions
 to+0.010
 each+0.010
 me+0.009
 for+0.008
 be+0.008
Neg logit contributions
bridge-0.016
pron-0.013
ined-0.013
hes-0.013
aster-0.013
Token the
Token ID262
Feature activation-1.51e-03

Pos logit contributions
 to+0.021
 each+0.020
 me+0.018
 be+0.015
's+0.014
Neg logit contributions
bridge-0.071
pron-0.061
ined-0.061
aster-0.060
iche-0.060
Token stable
Token ID8245
Feature activation-5.97e-03

Pos logit contributions
 to+0.003
 each+0.003
 me+0.002
 be+0.002
 for+0.002
Neg logit contributions
bridge-0.004
pron-0.003
iche-0.003
ined-0.003
aster-0.003
Token.
Token ID13
Feature activation+8.25e-04

Pos logit contributions
 to+0.005
 each+0.005
 me+0.004
 be+0.003
 for+0.003
Neg logit contributions
bridge-0.019
pron-0.016
ined-0.016
iche-0.016
aster-0.016
Token,
Token ID11
Feature activation-9.16e-04

Pos logit contributions
 each+0.002
 me+0.002
 heads+0.002
 to+0.002
 hands+0.002
Neg logit contributions
bridge-0.007
he-0.006
aster-0.006
opl-0.005
pron-0.005
Token but
Token ID475
Feature activation-1.64e-03

Pos logit contributions
 each+0.001
 to+0.001
 me+0.001
 be+0.001
 home+0.001
Neg logit contributions
bridge-0.002
pron-0.002
aster-0.002
iche-0.002
hes-0.002
Token she
Token ID673
Feature activation+7.72e-03

Pos logit contributions
 to+0.002
 each+0.002
 me+0.002
 for+0.002
 be+0.002
Neg logit contributions
bridge-0.004
pron-0.003
aster-0.003
kin-0.003
ined-0.003
Token was
Token ID373
Feature activation-5.09e-03

Pos logit contributions
bridge+0.020
pron+0.018
ined+0.017
hes+0.017
aster+0.016
Neg logit contributions
 to-0.010
 each-0.009
 me-0.008
 for-0.008
 home-0.007
Token a
Token ID257
Feature activation-2.12e-02

Pos logit contributions
 to+0.006
 each+0.005
 me+0.005
 for+0.004
 be+0.004
Neg logit contributions
bridge-0.015
pron-0.013
iche-0.013
ined-0.013
aster-0.013
Token little
Token ID1310
Feature activation-9.73e-03

Pos logit contributions
 to+0.022
 each+0.020
 me+0.018
 be+0.015
 for+0.014
Neg logit contributions
bridge-0.066
ined-0.057
iche-0.055
pron-0.055
aster-0.054
Token scared
Token ID12008
Feature activation+1.25e-02

Pos logit contributions
 to+0.017
 each+0.017
 me+0.016
 be+0.014
 for+0.014
Neg logit contributions
bridge-0.020
pron-0.017
iche-0.016
aster-0.016
ined-0.016
Token.
Token ID13
Feature activation-4.18e-03

Pos logit contributions
bridge+0.041
pron+0.035
iche+0.035
ined+0.035
aster+0.034
Neg logit contributions
 to-0.010
 each-0.010
 me-0.008
 be-0.006
 for-0.006
Token She
Token ID1375
Feature activation-7.11e-03

Pos logit contributions
 each+0.007
 to+0.007
 ho+0.006
 me+0.006
 for+0.006
Neg logit contributions
bridge-0.008
iche-0.007
pron-0.007
hes-0.006
ined-0.006
Token asked
Token ID1965
Feature activation+1.77e-02

Pos logit contributions
 to+0.011
 each+0.010
 for+0.009
 home+0.008
,+0.008
Neg logit contributions
bridge-0.016
pron-0.014
ined-0.014
hes-0.014
aster-0.013
Token her
Token ID607
Feature activation-3.59e-02

Pos logit contributions
bridge+0.040
aster+0.037
ined+0.034
pron+0.034
hes+0.034
Neg logit contributions
 to-0.020
 me-0.019
 be-0.019
 each-0.019
's-0.018
Token p
Token ID279
Feature activation+7.23e-04

Pos logit contributions
 to+0.021
 each+0.020
 me+0.018
 for+0.016
 be+0.016
Neg logit contributions
bridge-0.035
pron-0.030
ined-0.029
iche-0.028
aster-0.028
Tokenaj
Token ID1228
Feature activation+1.63e-02

Pos logit contributions
bridge+0.001
pron+0.001
ined+0.001
atmeal+0.001
iche+0.001
Neg logit contributions
 each-0.002
 to-0.002
 me-0.002
 from-0.002
 be-0.002
Tokenamas
Token ID17485
Feature activation-4.33e-02

Pos logit contributions
bridge+0.021
pron+0.017
ined+0.015
iche+0.014
atmeal+0.014
Neg logit contributions
 to-0.044
 each-0.043
 for-0.040
 me-0.040
 be-0.038
Token and
Token ID290
Feature activation-1.27e-02

Pos logit contributions
 me+0.030
 each+0.029
 to+0.026
 be+0.021
 ho+0.016
Neg logit contributions
bridge-0.134
pron-0.120
ined-0.118
hes-0.116
aster-0.112
Token went
Token ID1816
Feature activation-1.91e-02

Pos logit contributions
 to+0.017
 each+0.016
 me+0.014
 for+0.013
 be+0.012
Neg logit contributions
bridge-0.034
pron-0.029
aster-0.028
iche-0.028
ined-0.028
Token to
Token ID284
Feature activation-3.46e-02

Pos logit contributions
 each+0.008
 to+0.005
 me+0.004
 be+0.002
's+0.001
Neg logit contributions
bridge-0.067
pron-0.060
iche-0.059
ined-0.058
hes-0.058
Token bed
Token ID3996
Feature activation-4.47e-02

Pos logit contributions
 each+0.053
 to+0.051
 me+0.045
 ho+0.042
 for+0.042
Neg logit contributions
bridge-0.071
aster-0.067
pron-0.064
iche-0.062
hes-0.059
Token.
Token ID13
Feature activation-6.83e-04

Pos logit contributions
 to+0.035
 each+0.031
 me+0.026
 be+0.020
 for+0.019
Neg logit contributions
bridge-0.148
pron-0.129
ined-0.128
iche-0.127
aster-0.125
Token
Token ID198
Feature activation+6.78e-03

Pos logit contributions
 to+0.001
 each+0.001
 me+0.001
 for+0.001
 be+0.001
Neg logit contributions
bridge-0.002
pron-0.002
iche-0.002
ined-0.002
aster-0.002
Token
Token ID198
Feature activation+7.59e-03

Pos logit contributions
bridge+0.019
pron+0.017
ined+0.016
iche+0.016
aster+0.016
Neg logit contributions
 to-0.008
 each-0.007
 me-0.007
 be-0.006
 for-0.006
TokenIn
Token ID818
Feature activation-6.79e-03

Pos logit contributions
bridge+0.016
ined+0.013
pron+0.013
iche+0.013
atmeal+0.013
Neg logit contributions
 to-0.016
 each-0.015
 me-0.014
 be-0.013
,-0.013
Token.
Token ID13
Feature activation-2.91e-03

Pos logit contributions
bridge+0.027
kin+0.022
pron+0.022
hes+0.022
iche+0.022
Neg logit contributions
 each-0.012
 to-0.012
 me-0.011
 be-0.010
 scar-0.009
Token We
Token ID775
Feature activation+1.62e-03

Pos logit contributions
 to+0.004
 each+0.004
 me+0.004
 be+0.004
's+0.003
Neg logit contributions
bridge-0.008
pron-0.007
ined-0.007
atmeal-0.006
iche-0.006
Token are
Token ID389
Feature activation-2.59e-02

Pos logit contributions
bridge+0.004
iche+0.003
pron+0.003
ined+0.003
aster+0.003
Neg logit contributions
 to-0.003
 each-0.003
 me-0.002
 for-0.002
 be-0.002
Token going
Token ID1016
Feature activation+8.56e-03

Pos logit contributions
 to+0.039
 each+0.037
 me+0.034
 for+0.031
 be+0.030
Neg logit contributions
bridge-0.069
pron-0.058
ined-0.057
iche-0.056
aster-0.055
Token to
Token ID284
Feature activation-1.12e-02

Pos logit contributions
bridge+0.031
pron+0.027
ined+0.027
iche+0.027
aster+0.027
Neg logit contributions
 each-0.002
 to-0.001
 me-0.001
 be+0.000
's+0.001
Token see
Token ID766
Feature activation+2.24e-02

Pos logit contributions
 to+0.013
 each+0.013
 me+0.011
 for+0.010
s+0.010
Neg logit contributions
bridge-0.029
aster-0.026
iche-0.024
pron-0.023
hes-0.023
Token the
Token ID262
Feature activation+1.35e-02

Pos logit contributions
bridge+0.056
aster+0.049
hes+0.047
iche+0.046
ined+0.045
Neg logit contributions
 to-0.026
 each-0.023
 be-0.022
 me-0.022
 ho-0.020
Token elephants
Token ID25368
Feature activation+3.68e-02

Pos logit contributions
bridge+0.029
pron+0.023
iche+0.023
ined+0.022
aster+0.022
Neg logit contributions
 to-0.026
 each-0.025
 me-0.022
 for-0.021
 be-0.020
Token next
Token ID1306
Feature activation-8.81e-03

Pos logit contributions
bridge+0.107
aster+0.090
iche+0.088
pron+0.088
atmeal+0.088
Neg logit contributions
 to-0.035
 each-0.034
 me-0.030
 be-0.024
 home-0.022
Token.
Token ID13
Feature activation-1.24e-02

Pos logit contributions
 each+0.014
 to+0.012
 me+0.012
 be+0.012
's+0.012
Neg logit contributions
bridge-0.018
pron-0.015
iche-0.015
aster-0.014
ined-0.014
Token They
Token ID1119
Feature activation-2.21e-03

Pos logit contributions
 to+0.018
 each+0.017
 me+0.016
 be+0.014
 for+0.014
Neg logit contributions
bridge-0.033
pron-0.029
ined-0.028
iche-0.027
atmeal-0.027
Token came
Token ID1625
Feature activation-5.35e-02

Pos logit contributions
 to+0.026
 each+0.025
 me+0.022
 for+0.020
 be+0.020
Neg logit contributions
bridge-0.048
pron-0.040
ined-0.039
aster-0.038
iche-0.038
Token in
Token ID287
Feature activation-4.15e-02

Pos logit contributions
 each+0.034
 me+0.029
 to+0.025
 be+0.024
 scar+0.023
Neg logit contributions
bridge-0.156
ined-0.137
iche-0.133
pron-0.132
aster-0.131
Token and
Token ID290
Feature activation-3.07e-02

Pos logit contributions
 each+0.031
 to+0.030
 me+0.028
's+0.021
 be+0.020
Neg logit contributions
bridge-0.126
pron-0.111
aster-0.110
ined-0.107
iche-0.107
Token it
Token ID340
Feature activation-2.55e-02

Pos logit contributions
 to+0.043
 each+0.041
 me+0.038
 be+0.033
 for+0.033
Neg logit contributions
bridge-0.080
pron-0.068
ined-0.066
iche-0.066
aster-0.065
Token was
Token ID373
Feature activation-3.59e-02

Pos logit contributions
 to+0.027
 each+0.027
 me+0.022
 for+0.020
 ho+0.020
Neg logit contributions
bridge-0.075
ined-0.064
pron-0.063
hes-0.061
iche-0.061
Token time
Token ID640
Feature activation-5.03e-02

Pos logit contributions
 to+0.059
 each+0.055
 me+0.052
 for+0.047
 be+0.046
Neg logit contributions
bridge-0.091
iche-0.074
pron-0.074
ined-0.073
atmeal-0.072
Token for
Token ID329
Feature activation-4.34e-02

Pos logit contributions
 each+0.018
 me+0.016
 to+0.008
 be+0.005
's+0.002
Neg logit contributions
bridge-0.170
pron-0.157
ined-0.150
iche-0.150
hes-0.149
Token dinner
Token ID8073
Feature activation-6.37e-02

Pos logit contributions
 to+0.066
 each+0.061
 me+0.056
 be+0.051
's+0.051
Neg logit contributions
bridge-0.108
aster-0.090
pron-0.090
iche-0.088
ined-0.087
Token.
Token ID13
Feature activation-1.51e-02

Pos logit contributions
 me+0.057
 each+0.055
 to+0.055
 be+0.047
 ho+0.044
Neg logit contributions
bridge-0.180
pron-0.158
aster-0.151
iche-0.149
atmeal-0.147
Token<|endoftext|>
Token ID50256
Feature activation-1.25e-02

Pos logit contributions
 to+0.024
 each+0.023
 me+0.021
 be+0.019
 for+0.018
Neg logit contributions
bridge-0.039
pron-0.032
ined-0.032
iche-0.031
aster-0.031
TokenOnce
Token ID7454
Feature activation-9.55e-03

Pos logit contributions
 to+0.014
 each+0.013
 me+0.011
 for+0.009
 be+0.009
Neg logit contributions
bridge-0.038
pron-0.033
iche-0.032
ined-0.032
aster-0.031
Token bye
Token ID33847
Feature activation-2.29e-02

Pos logit contributions
 each+0.048
 to+0.046
 me+0.035
 for+0.033
 be+0.032
Neg logit contributions
bridge-0.101
pron-0.087
aster-0.084
iche-0.083
ined-0.082
Token,
Token ID11
Feature activation-1.30e-02

Pos logit contributions
 each+0.023
 to+0.019
 be+0.016
 me+0.015
 for+0.014
Neg logit contributions
bridge-0.066
pron-0.057
iche-0.055
aster-0.054
ined-0.053
Token small
Token ID1402
Feature activation-3.43e-02

Pos logit contributions
 to+0.019
 each+0.019
 me+0.015
 be+0.013
 for+0.013
Neg logit contributions
bridge-0.033
pron-0.028
iche-0.026
aster-0.026
ined-0.026
Token girl
Token ID2576
Feature activation-2.10e-02

Pos logit contributions
 to+0.053
 each+0.052
 me+0.045
 be+0.041
's+0.038
Neg logit contributions
bridge-0.080
pron-0.068
iche-0.065
ined-0.064
aster-0.062
Token!"
Token ID2474
Feature activation-1.63e-02

Pos logit contributions
 each+0.026
 to+0.024
 me+0.021
 heads+0.019
 be+0.018
Neg logit contributions
bridge-0.056
pron-0.044
ined-0.044
aster-0.044
iche-0.041
Token ch
Token ID442
Feature activation-4.59e-02

Pos logit contributions
 to+0.021
 each+0.021
 me+0.019
 ho+0.018
 for+0.016
Neg logit contributions
bridge-0.040
aster-0.033
ined-0.033
hes-0.033
pron-0.031
Tokenir
Token ID343
Feature activation-7.10e-02

Pos logit contributions
 to+0.095
 each+0.092
 me+0.087
 for+0.080
,+0.080
Neg logit contributions
bridge-0.094
pron-0.076
iche-0.073
atmeal-0.070
ined-0.069
Tokenped
Token ID9124
Feature activation-7.08e-02

Pos logit contributions
 to+0.147
 each+0.140
 me+0.126
 be+0.119
 for+0.117
Neg logit contributions
bridge-0.141
ined-0.114
aster-0.102
iche-0.101
hes-0.100
Token the
Token ID262
Feature activation-2.24e-02

Pos logit contributions
 to+0.079
 each+0.079
 me+0.077
 be+0.067
 ho+0.063
Neg logit contributions
bridge-0.177
aster-0.145
pron-0.143
ined-0.141
hes-0.138
Token mouse
Token ID10211
Feature activation-3.29e-02

Pos logit contributions
 to+0.032
 each+0.030
 me+0.026
 for+0.024
 be+0.023
Neg logit contributions
bridge-0.059
pron-0.051
ined-0.049
iche-0.049
aster-0.047
Token,
Token ID11
Feature activation-2.42e-02

Pos logit contributions
 each+0.035
 to+0.032
 me+0.030
 us+0.022
 for+0.021
Neg logit contributions
bridge-0.090
ined-0.072
pron-0.072
aster-0.071
iche-0.069
Token little
Token ID1310
Feature activation-1.43e-02

Pos logit contributions
 to+0.000
 each+0.000
 me+0.000
 for+0.000
 be+0.000
Neg logit contributions
bridge-0.000
pron-0.000
iche-0.000
ined-0.000
aster-0.000
Token bird
Token ID6512
Feature activation-1.67e-02

Pos logit contributions
 to+0.020
 each+0.020
 me+0.018
 for+0.016
's+0.015
Neg logit contributions
bridge-0.034
pron-0.029
iche-0.028
ined-0.028
aster-0.027
Token to
Token ID284
Feature activation-7.97e-03

Pos logit contributions
 each+0.007
 me+0.006
 to+0.004
 heads+0.003
 be+0.001
Neg logit contributions
bridge-0.054
pron-0.048
aster-0.048
iche-0.048
ined-0.047
Token go
Token ID467
Feature activation-1.23e-02

Pos logit contributions
 to+0.009
 each+0.008
 me+0.007
 for+0.006
 be+0.006
Neg logit contributions
bridge-0.024
pron-0.020
iche-0.020
aster-0.020
ined-0.020
Token back
Token ID736
Feature activation-8.83e-03

Pos logit contributions
 to+0.007
 each+0.007
 me+0.005
 be+0.003
 for+0.003
Neg logit contributions
bridge-0.042
pron-0.037
ined-0.036
iche-0.036
aster-0.036
Token home
Token ID1363
Feature activation-3.98e-02

Pos logit contributions
 each+0.005
 me+0.004
 to+0.003
 be+0.003
 ho+0.002
Neg logit contributions
bridge-0.027
ined-0.024
pron-0.023
iche-0.023
aster-0.023
Token.
Token ID13
Feature activation-9.75e-03

Pos logit contributions
 each+0.037
 me+0.036
 to+0.033
 ho+0.026
 be+0.025
Neg logit contributions
bridge-0.108
pron-0.095
ined-0.092
aster-0.090
iche-0.090
Token
Token ID198
Feature activation-7.64e-04

Pos logit contributions
 to+0.015
 each+0.015
 me+0.014
 for+0.013
 be+0.012
Neg logit contributions
bridge-0.023
iche-0.019
pron-0.019
aster-0.019
ined-0.019
Token
Token ID198
Feature activation+1.31e-03

Pos logit contributions
 to+0.001
 each+0.001
 me+0.001
 be+0.001
 for+0.001
Neg logit contributions
bridge-0.002
pron-0.002
iche-0.002
ined-0.002
aster-0.002
TokenThe
Token ID464
Feature activation-9.17e-03

Pos logit contributions
bridge+0.003
iche+0.003
ined+0.002
pron+0.002
atmeal+0.002
Neg logit contributions
 to-0.002
 each-0.002
 me-0.002
 be-0.002
's-0.002
Token wind
Token ID2344
Feature activation-7.03e-03

Pos logit contributions
 to+0.009
 each+0.008
 me+0.007
 be+0.006
 for+0.006
Neg logit contributions
bridge-0.029
pron-0.025
iche-0.025
ined-0.025
aster-0.024
Token the
Token ID262
Feature activation-2.61e-03

Pos logit contributions
 me+0.018
 each+0.018
 to+0.018
 ho+0.015
 be+0.014
Neg logit contributions
bridge-0.053
aster-0.046
ined-0.044
iche-0.042
hes-0.042
Token log
Token ID2604
Feature activation-1.43e-03

Pos logit contributions
 to+0.005
 each+0.005
 me+0.005
 for+0.004
 be+0.004
Neg logit contributions
bridge-0.005
pron-0.004
ined-0.004
iche-0.004
aster-0.004
Token and
Token ID290
Feature activation-1.42e-02

Pos logit contributions
 me+0.001
 each+0.001
 to+0.001
 be+0.001
 ho+0.001
Neg logit contributions
bridge-0.004
ined-0.004
pron-0.003
iche-0.003
aster-0.003
Token made
Token ID925
Feature activation-1.78e-02

Pos logit contributions
 to+0.019
 me+0.019
 each+0.018
 be+0.015
 for+0.015
Neg logit contributions
bridge-0.034
aster-0.028
pron-0.027
hes-0.027
ined-0.027
Token it
Token ID340
Feature activation-1.04e-03

Pos logit contributions
 to+0.021
 me+0.019
 each+0.019
 be+0.018
 ho+0.017
Neg logit contributions
bridge-0.048
hes-0.040
aster-0.039
ined-0.037
iche-0.037
Token safely
Token ID11512
Feature activation-6.47e-02

Pos logit contributions
 me+0.001
 each+0.001
 to+0.001
 ho+0.001
 us+0.001
Neg logit contributions
bridge-0.003
ined-0.002
hes-0.002
iche-0.002
atmeal-0.002
Token to
Token ID284
Feature activation-1.08e-02

Pos logit contributions
 me+0.058
 each+0.057
 to+0.046
 heads+0.037
 be+0.035
Neg logit contributions
bridge-0.187
ined-0.157
iche-0.154
hes-0.149
pron-0.148
Token the
Token ID262
Feature activation-3.79e-03

Pos logit contributions
 to+0.012
 each+0.011
 me+0.010
 for+0.008
 be+0.008
Neg logit contributions
bridge-0.032
aster-0.027
pron-0.027
iche-0.027
ined-0.027
Token other
Token ID584
Feature activation+3.72e-02

Pos logit contributions
 to+0.008
 each+0.007
 me+0.006
 for+0.006
,+0.006
Neg logit contributions
bridge-0.008
iche-0.006
pron-0.006
ined-0.006
aster-0.006
Token side
Token ID1735
Feature activation-1.74e-02

Pos logit contributions
bridge+0.072
aster+0.057
iche+0.055
opl+0.054
ined+0.053
Neg logit contributions
 to-0.067
 each-0.056
 me-0.055
upp-0.053
ur-0.053
Token.
Token ID13
Feature activation-1.07e-02

Pos logit contributions
 to+0.017
 each+0.016
 me+0.015
 be+0.012
 for+0.011
Neg logit contributions
bridge-0.052
pron-0.044
ined-0.044
aster-0.044
iche-0.043
Token made
Token ID925
Feature activation-1.99e-02

Pos logit contributions
 to+0.008
 each+0.008
 me+0.007
 home+0.006
 ho+0.006
Neg logit contributions
bridge-0.017
ined-0.014
pron-0.014
hes-0.013
iche-0.013
Token the
Token ID262
Feature activation-5.30e-03

Pos logit contributions
 to+0.022
 each+0.020
 me+0.018
 ho+0.017
 be+0.017
Neg logit contributions
bridge-0.054
ined-0.046
pron-0.046
aster-0.045
hes-0.044
Token sky
Token ID6766
Feature activation-3.02e-02

Pos logit contributions
 to+0.010
 each+0.010
 me+0.009
 for+0.008
 be+0.008
Neg logit contributions
bridge-0.011
pron-0.009
ined-0.009
iche-0.009
aster-0.008
Token feel
Token ID1254
Feature activation-1.23e-02

Pos logit contributions
 to+0.046
 each+0.045
 me+0.045
 ho+0.036
 for+0.036
Neg logit contributions
bridge-0.071
ined-0.058
pron-0.057
iche-0.056
aster-0.053
Token so
Token ID523
Feature activation+1.57e-02

Pos logit contributions
 to+0.016
 each+0.015
 me+0.014
 for+0.012
 be+0.012
Neg logit contributions
bridge-0.034
pron-0.029
iche-0.028
ined-0.028
aster-0.028
Token cheerful
Token ID37999
Feature activation-4.75e-02

Pos logit contributions
bridge+0.038
pron+0.031
ined+0.031
iche+0.031
aster+0.030
Neg logit contributions
 to-0.026
 each-0.025
 me-0.023
 be-0.021
 for-0.021
Token.
Token ID13
Feature activation-1.91e-02

Pos logit contributions
 each+0.038
 to+0.036
 me+0.036
 be+0.030
 ho+0.027
Neg logit contributions
bridge-0.144
ined-0.121
pron-0.121
iche-0.114
hes-0.114
Token From
Token ID3574
Feature activation+1.33e-02

Pos logit contributions
 to+0.031
 each+0.030
 me+0.028
 for+0.025
 be+0.025
Neg logit contributions
bridge-0.045
pron-0.037
iche-0.037
ined-0.037
aster-0.036
Token that
Token ID326
Feature activation-2.23e-02

Pos logit contributions
bridge+0.031
ined+0.027
pron+0.027
aster+0.026
iche+0.025
Neg logit contributions
 to-0.020
 each-0.018
 me-0.016
 be-0.016
 for-0.015
Token day
Token ID1110
Feature activation-3.76e-03

Pos logit contributions
 to+0.043
 each+0.042
 me+0.038
 be+0.035
 for+0.034
Neg logit contributions
bridge-0.046
pron-0.039
ined-0.037
iche-0.036
aster-0.036
Token on
Token ID319
Feature activation+2.84e-02

Pos logit contributions
 to+0.007
 each+0.007
 me+0.007
 be+0.006
 for+0.006
Neg logit contributions
bridge-0.008
pron-0.006
ined-0.006
iche-0.006
aster-0.006