TOP ACTIVATIONS
MAX = 0.294

So the younger sister began to colour . She
! You are the best sister ever !" They
Once there was a little sister named Sarah . She loved
sister would show her little sister how to measure the ingredients
little brother ! Jack 's brother wanted a piece of the
little brother ! Jack 's brother wanted a piece of the
being kind to his little sister . From that day on
He said to his little brother , " I have to
of fun playing with her brother in the backyard . She
L izz ie and her sister had to share the bedroom
she played with her big sister . It made her very
. It was his little brother ! Jack 's brother wanted
. It was his little brother ! Jack 's brother wanted
was hungry ! The eldest brother opened the box of meat
had to listen to her sister . So she went to
his bed when the little sister came in . She was
! The regular sister would show her little sister
. But then her big sister came home from school and
the other was the younger sister . They were walking down
very well . Her big sister , Emma , had a

MIN ACTIVATIONS
MIN = -0.329

and pointed at the grass ho pper . He said ,
in circles with the grass ho pper . They jumped and
so happy that the grass ho pper had said yes .
" Is it a grass ho pper ?" she guesses .
Wow ! These are so ooo spicy ". They both laughed
an ant and a grass ho pper . The ant was
lady bug and a grass ho pper . One
The nos y grass ho pper hopped over to Tom
€ � � Gr ass ho pper , let â €
fan , like " w ooo " and " br rr
introduced to a smooth grass ho pper called Jim . Ted
." The grass ho pper sighed and decided to
and picked up a grass ho pper ! Both friends were
and saw a huge grass ho pper , who had been
It is not a grass ho pper or a cricket .
and squ ished the grass ho ppers with her fingers to
. It was a grass ho pper ! Tom was so
" Wh ooo , Wh ooo ," said the noise .
?" The grass ho pper said , " I
alarm . It goes w ooo w ooo w ooo .

INTERVAL 0.138 - 0.294
CONTAINS 4.330%

done ." The boy smiled and nodded his head
It was a little card with a picture of a star
her hand , and the two of them played a noisy
<|endoftext|> Once there was a boy named Jack . He liked
, his friend asked him if he took the cookie .

INTERVAL -0.017 - 0.138
CONTAINS 61.119%

house for it , like a nest ." Mom my said
win , so they were competitive . Tom started
the roof and sighed . He was tired but proud .
. " We used them all ." M ia
her a cookie . Lily was so happy and wanted to

INTERVAL -0.173 - -0.017
CONTAINS 32.408%

it . He held on tight and tried to pull .
the lake , they noticed that something was wrong .
it . She waited and waited , but
. He said , " Look , the dark is not
a gun again . <|endoftext|> Anna and Ben are friends .

INTERVAL -0.329 - -0.173
CONTAINS 2.143%

his truck and they had fun together . <|endoftext|> Once upon
They had a lot of fun making noises and pretending to
with Ben . Lily joined them with her doll . They
laughed and had so much fun . When it was time
needles and threads and helps them sew the hole . The
Token
Token ID198
Feature activation+2.57e-02

Pos logit contributions
arter+0.195
ulp+0.189
iri+0.187
 mir+0.185
igi+0.184
Neg logit contributions
 All-0.179
 ties-0.155
 Being-0.154
 who-0.153
Let-0.151
Token
Token ID198
Feature activation+2.38e-02

Pos logit contributions
 All+0.026
Let+0.024
 who+0.024
 ties+0.023
 Stone+0.023
Neg logit contributions
 awful-0.056
iri-0.055
gh-0.055
ulp-0.055
 mir-0.054
TokenSo
Token ID2396
Feature activation+4.99e-03

Pos logit contributions
 All+0.033
Let+0.031
 who+0.030
 ties+0.029
 Stone+0.029
Neg logit contributions
 awful-0.044
iri-0.044
gh-0.043
 mir-0.043
ulp-0.043
Token the
Token ID262
Feature activation+7.98e-02

Pos logit contributions
 All+0.007
Let+0.007
 who+0.007
 ties+0.006
 Stone+0.006
Neg logit contributions
 awful-0.009
iri-0.009
gh-0.009
ulp-0.009
 mir-0.009
Token younger
Token ID7099
Feature activation+1.52e-01

Pos logit contributions
 All+0.103
Let+0.097
 who+0.096
 Stone+0.089
 ties+0.089
Neg logit contributions
 awful-0.157
iri-0.153
ulp-0.153
arter-0.152
gh-0.152
Token sister
Token ID6621
Feature activation+2.90e-01

Pos logit contributions
 All+0.186
Let+0.170
 Trying+0.169
 Using+0.167
Find+0.167
Neg logit contributions
iri-0.278
gh-0.273
 mir-0.271
 awful-0.270
ww-0.269
Token began
Token ID2540
Feature activation+7.56e-02

Pos logit contributions
 All+0.330
Let+0.320
 Stone+0.310
 Being+0.303
Find+0.300
Neg logit contributions
 awful-0.568
iri-0.542
gh-0.539
 mir-0.539
oto-0.537
Token to
Token ID284
Feature activation-5.40e-02

Pos logit contributions
 All+0.120
Let+0.115
 who+0.112
 Stone+0.110
 ties+0.107
Neg logit contributions
 awful-0.123
gh-0.121
ulp-0.119
 mir-0.118
iri-0.117
Token colour
Token ID9568
Feature activation+2.52e-02

Pos logit contributions
 awful+0.093
gh+0.092
 mir+0.089
ulp+0.088
iri+0.088
Neg logit contributions
 All-0.079
Let-0.076
 who-0.074
 Being-0.073
 Did-0.073
Token.
Token ID13
Feature activation+1.28e-02

Pos logit contributions
 All+0.031
Let+0.029
 who+0.029
 ties+0.027
 Stone+0.027
Neg logit contributions
 awful-0.051
iri-0.050
ulp-0.050
arter-0.050
 mir-0.050
Token She
Token ID1375
Feature activation+1.60e-02

Pos logit contributions
es+0.014
 ties+0.014
 who+0.014
mun+0.014
 led+0.014
Neg logit contributions
 awful-0.022
ummy-0.022
ife-0.021
oor-0.021
ulp-0.021
Token!
Token ID0
Feature activation-1.03e-01

Pos logit contributions
 awful+0.262
iri+0.255
ulp+0.253
gh+0.253
arter+0.252
Neg logit contributions
 All-0.160
Let-0.151
 who-0.149
 ties-0.139
 Stone-0.138
Token You
Token ID921
Feature activation+4.88e-02

Pos logit contributions
 awful+0.197
iri+0.196
 mir+0.195
ife+0.191
arter+0.191
Neg logit contributions
 ties-0.107
 led-0.104
 Stone-0.104
es-0.101
Let-0.099
Token are
Token ID389
Feature activation+4.60e-02

Pos logit contributions
Let+0.056
 Being+0.055
 All+0.054
 Using+0.054
Being+0.052
Neg logit contributions
 awful-0.102
 mir-0.089
 storm-0.088
alt-0.087
 awesome-0.087
Token the
Token ID262
Feature activation+6.44e-02

Pos logit contributions
 All+0.074
Let+0.071
 who+0.068
 Stone+0.066
 ties+0.066
Neg logit contributions
 awful-0.078
iri-0.073
arter-0.073
 mir-0.073
gh-0.073
Token best
Token ID1266
Feature activation+5.65e-02

Pos logit contributions
 All+0.089
 who+0.088
Let+0.085
 Stone+0.081
 ties+0.080
Neg logit contributions
ulp-0.120
 awful-0.117
arter-0.116
urg-0.116
iri-0.116
Token sister
Token ID6621
Feature activation+2.90e-01

Pos logit contributions
 All+0.083
 who+0.081
Let+0.078
 ties+0.073
 Stone+0.071
Neg logit contributions
 awful-0.103
iri-0.101
gh-0.099
ulp-0.098
arter-0.098
Token ever
Token ID1683
Feature activation-2.29e-01

Pos logit contributions
 All+0.516
Let+0.492
 who+0.471
 Stone+0.469
 ties+0.468
Neg logit contributions
 awful-0.422
gh-0.403
 mir-0.401
iri-0.399
ulp-0.396
Token!"
Token ID2474
Feature activation-5.89e-02

Pos logit contributions
 awful+0.328
iri+0.327
ulp+0.318
arter+0.317
 mir+0.315
Neg logit contributions
 who-0.425
 All-0.425
Let-0.408
 Being-0.386
 ties-0.385
Token
Token ID198
Feature activation+2.13e-02

Pos logit contributions
 awful+0.136
iri+0.133
ulp+0.132
 mir+0.130
oto+0.130
Neg logit contributions
 All-0.048
Let-0.046
 Stone-0.045
 ties-0.044
 who-0.042
Token
Token ID198
Feature activation+2.19e-02

Pos logit contributions
 All+0.016
Let+0.014
 who+0.013
 Stone+0.013
 ties+0.013
Neg logit contributions
 awful-0.053
iri-0.052
ulp-0.052
gh-0.052
 mir-0.051
TokenThey
Token ID2990
Feature activation-8.01e-02

Pos logit contributions
 All+0.031
Let+0.028
 who+0.028
 ties+0.028
 Stone+0.028
Neg logit contributions
 awful-0.039
iri-0.039
gh-0.038
 mir-0.038
oto-0.038
TokenOnce
Token ID7454
Feature activation-6.94e-02

Pos logit contributions
gh+0.099
alt+0.097
oor+0.094
igi+0.094
ulp+0.093
Neg logit contributions
 ties-0.078
nam-0.073
Find-0.071
 puzzles-0.071
Being-0.071
Token there
Token ID612
Feature activation+1.53e-01

Pos logit contributions
 awful+0.136
iri+0.132
gh+0.132
 mir+0.131
ulp+0.130
Neg logit contributions
 All-0.091
 who-0.086
Let-0.084
 Stone-0.079
 ties-0.078
Token was
Token ID373
Feature activation+4.88e-02

Pos logit contributions
 All+0.121
Let+0.114
 ties+0.114
 who+0.107
 Stone+0.104
Neg logit contributions
gh-0.358
 awful-0.358
iri-0.353
ulp-0.348
 mir-0.348
Token a
Token ID257
Feature activation+4.53e-02

Pos logit contributions
 All+0.027
 who+0.026
Let+0.025
 ties+0.016
My+0.015
Neg logit contributions
arter-0.134
 mir-0.134
 awful-0.133
urg-0.132
iri-0.132
Token little
Token ID1310
Feature activation+6.54e-02

Pos logit contributions
 All+0.056
 who+0.055
Let+0.054
 Stone+0.049
 ties+0.049
Neg logit contributions
 awful-0.092
ulp-0.089
gh-0.089
arter-0.088
iri-0.088
Token sister
Token ID6621
Feature activation+2.89e-01

Pos logit contributions
 All+0.048
 who+0.047
Let+0.045
 Stone+0.039
 ties+0.038
Neg logit contributions
 awful-0.166
gh-0.164
ulp-0.162
iri-0.160
arter-0.160
Token named
Token ID3706
Feature activation+8.58e-02

Pos logit contributions
 All+0.301
Let+0.286
 ties+0.282
 Stone+0.282
Find+0.277
Neg logit contributions
 awful-0.607
iri-0.591
 mir-0.591
arter-0.589
ulp-0.585
Token Sarah
Token ID10490
Feature activation-1.73e-02

Pos logit contributions
 All+0.082
 who+0.079
Let+0.075
 ties+0.063
 Stone+0.062
Neg logit contributions
iri-0.208
 awful-0.202
 mir-0.202
gh-0.201
ulp-0.198
Token.
Token ID13
Feature activation-1.16e-03

Pos logit contributions
 awful+0.027
gh+0.025
ulp+0.025
iri+0.025
oto+0.025
Neg logit contributions
 All-0.029
 Stone-0.026
 ties-0.026
Find-0.026
Let-0.025
Token She
Token ID1375
Feature activation+1.31e-02

Pos logit contributions
 awful+0.003
arter+0.002
iri+0.002
ulp+0.002
gh+0.002
Neg logit contributions
 All-0.001
Let-0.001
 who-0.001
 ties-0.001
 Stone-0.001
Token loved
Token ID6151
Feature activation+3.84e-02

Pos logit contributions
 All+0.013
 Stone+0.012
 ties+0.011
 Being+0.011
 Using+0.011
Neg logit contributions
gh-0.029
 awful-0.029
 mir-0.027
ww-0.027
ulp-0.027
Token sister
Token ID6621
Feature activation+2.88e-01

Pos logit contributions
 awful+0.091
iri+0.089
 mir+0.089
ulp+0.088
gh+0.088
Neg logit contributions
 All-0.072
Let-0.068
 who-0.066
 Stone-0.063
 Being-0.063
Token would
Token ID561
Feature activation+1.79e-02

Pos logit contributions
 All+0.302
Let+0.299
 Stone+0.290
Find+0.276
 ties+0.274
Neg logit contributions
 awful-0.594
iri-0.564
gh-0.559
 mir-0.555
urg-0.549
Token show
Token ID905
Feature activation+4.23e-02

Pos logit contributions
 All+0.024
Let+0.023
 who+0.023
 ties+0.021
 Stone+0.021
Neg logit contributions
 awful-0.034
iri-0.033
ulp-0.033
arter-0.033
gh-0.033
Token her
Token ID607
Feature activation-9.69e-02

Pos logit contributions
 All+0.055
Let+0.053
 who+0.051
 Stone+0.049
 ties+0.049
Neg logit contributions
 awful-0.082
iri-0.079
gh-0.079
ulp-0.079
arter-0.078
Token little
Token ID1310
Feature activation+6.58e-02

Pos logit contributions
 awful+0.186
gh+0.177
ulp+0.176
arter+0.170
iri+0.170
Neg logit contributions
Let-0.127
 All-0.126
 Stone-0.115
 Did-0.115
 Being-0.113
Token sister
Token ID6621
Feature activation+2.89e-01

Pos logit contributions
 All+0.074
Let+0.067
 who+0.065
 Being+0.061
 Stone+0.061
Neg logit contributions
 awful-0.140
ulp-0.135
 mir-0.135
arter-0.134
iri-0.134
Token how
Token ID703
Feature activation+1.06e-01

Pos logit contributions
 All+0.378
Let+0.378
 Stone+0.346
 ties+0.336
 who+0.333
Neg logit contributions
 awful-0.544
gh-0.527
ulp-0.516
iri-0.510
arter-0.508
Token to
Token ID284
Feature activation-5.95e-02

Pos logit contributions
 All+0.179
 who+0.172
Let+0.170
 ties+0.156
 Being+0.154
Neg logit contributions
iri-0.175
ulp-0.170
arter-0.170
 awful-0.167
urg-0.166
Token measure
Token ID3953
Feature activation+1.21e-01

Pos logit contributions
 awful+0.122
iri+0.119
gh+0.119
arter+0.118
ulp+0.118
Neg logit contributions
 All-0.069
Let-0.067
 who-0.063
 Stone-0.061
 Being-0.060
Token the
Token ID262
Feature activation+6.21e-02

Pos logit contributions
 All+0.134
Let+0.128
 who+0.118
 Stone+0.114
 ties+0.112
Neg logit contributions
 awful-0.257
iri-0.246
gh-0.246
arter-0.245
ulp-0.245
Token ingredients
Token ID9391
Feature activation+1.46e-01

Pos logit contributions
 All+0.090
Let+0.085
 who+0.084
 ties+0.080
 Stone+0.080
Neg logit contributions
 awful-0.112
iri-0.110
ulp-0.109
gh-0.108
arter-0.108
Token little
Token ID1310
Feature activation+7.05e-02

Pos logit contributions
 awful+0.122
ulp+0.121
iri+0.120
urg+0.119
arter+0.118
Neg logit contributions
 All-0.086
 who-0.086
Let-0.081
 ties-0.079
 Stone-0.078
Token brother
Token ID3956
Feature activation+2.88e-01

Pos logit contributions
 All+0.077
Let+0.072
 who+0.071
 ties+0.065
 Stone+0.065
Neg logit contributions
 awful-0.152
iri-0.148
ulp-0.147
gh-0.147
arter-0.147
Token!
Token ID0
Feature activation-8.79e-02

Pos logit contributions
 All+0.436
Let+0.427
 Stone+0.402
 ties+0.401
Find+0.389
Neg logit contributions
 awful-0.492
gh-0.468
iri-0.457
arter-0.457
 mir-0.451
Token Jack
Token ID3619
Feature activation-2.90e-02

Pos logit contributions
 awful+0.176
iri+0.170
arter+0.170
 mir+0.168
ulp+0.168
Neg logit contributions
 ties-0.088
 Stone-0.084
Let-0.083
 led-0.082
es-0.080
Token's
Token ID338
Feature activation+9.52e-02

Pos logit contributions
 awful+0.063
gh+0.060
iri+0.059
oto+0.058
urg+0.057
Neg logit contributions
 All-0.028
Let-0.025
 Stone-0.025
 ties-0.025
Find-0.025
Token brother
Token ID3956
Feature activation+2.89e-01

Pos logit contributions
 All+0.133
Let+0.127
 who+0.121
 Stone+0.117
 Being+0.116
Neg logit contributions
 awful-0.178
gh-0.171
iri-0.168
ulp-0.168
arter-0.167
Token wanted
Token ID2227
Feature activation+4.23e-02

Pos logit contributions
 All+0.298
Let+0.294
 Stone+0.285
 ties+0.282
Find+0.269
Neg logit contributions
 awful-0.619
gh-0.595
iri-0.574
 mir-0.564
ulp-0.560
Token a
Token ID257
Feature activation+4.76e-02

Pos logit contributions
 All+0.057
Let+0.053
 who+0.053
 ties+0.049
 Stone+0.048
Neg logit contributions
 awful-0.082
iri-0.081
arter-0.081
ulp-0.081
 mir-0.079
Token piece
Token ID3704
Feature activation+1.75e-01

Pos logit contributions
 All+0.061
 who+0.060
Let+0.059
 Stone+0.055
 ties+0.055
Neg logit contributions
 awful-0.094
iri-0.092
urg-0.091
ulp-0.091
arter-0.090
Token of
Token ID286
Feature activation+8.15e-02

Pos logit contributions
 who+0.144
 All+0.140
Let+0.112
 Stone+0.105
 ties+0.101
Neg logit contributions
iri-0.443
urg-0.430
ulp-0.423
 mir-0.421
arter-0.420
Token the
Token ID262
Feature activation+7.20e-02

Pos logit contributions
 All+0.119
Let+0.114
 who+0.110
 Stone+0.106
 Being+0.104
Neg logit contributions
 awful-0.149
gh-0.141
ulp-0.141
 mir-0.140
arter-0.139
Token little
Token ID1310
Feature activation+7.05e-02

Pos logit contributions
 awful+0.122
ulp+0.121
iri+0.120
urg+0.119
arter+0.118
Neg logit contributions
 All-0.086
 who-0.086
Let-0.081
 ties-0.079
 Stone-0.078
Token brother
Token ID3956
Feature activation+2.88e-01

Pos logit contributions
 All+0.077
Let+0.072
 who+0.071
 ties+0.065
 Stone+0.065
Neg logit contributions
 awful-0.152
iri-0.148
ulp-0.147
gh-0.147
arter-0.147
Token!
Token ID0
Feature activation-8.79e-02

Pos logit contributions
 All+0.436
Let+0.427
 Stone+0.402
 ties+0.401
Find+0.389
Neg logit contributions
 awful-0.492
gh-0.468
iri-0.457
arter-0.457
 mir-0.451
Token Jack
Token ID3619
Feature activation-2.90e-02

Pos logit contributions
 awful+0.176
iri+0.170
arter+0.170
 mir+0.168
ulp+0.168
Neg logit contributions
 ties-0.088
 Stone-0.084
Let-0.083
 led-0.082
es-0.080
Token's
Token ID338
Feature activation+9.52e-02

Pos logit contributions
 awful+0.063
gh+0.060
iri+0.059
oto+0.058
urg+0.057
Neg logit contributions
 All-0.028
Let-0.025
 Stone-0.025
 ties-0.025
Find-0.025
Token brother
Token ID3956
Feature activation+2.89e-01

Pos logit contributions
 All+0.133
Let+0.127
 who+0.121
 Stone+0.117
 Being+0.116
Neg logit contributions
 awful-0.178
gh-0.171
iri-0.168
ulp-0.168
arter-0.167
Token wanted
Token ID2227
Feature activation+4.23e-02

Pos logit contributions
 All+0.298
Let+0.294
 Stone+0.285
 ties+0.282
Find+0.269
Neg logit contributions
 awful-0.619
gh-0.595
iri-0.574
 mir-0.564
ulp-0.560
Token a
Token ID257
Feature activation+4.76e-02

Pos logit contributions
 All+0.057
Let+0.053
 who+0.053
 ties+0.049
 Stone+0.048
Neg logit contributions
 awful-0.082
iri-0.081
arter-0.081
ulp-0.081
 mir-0.079
Token piece
Token ID3704
Feature activation+1.75e-01

Pos logit contributions
 All+0.061
 who+0.060
Let+0.059
 Stone+0.055
 ties+0.055
Neg logit contributions
 awful-0.094
iri-0.092
urg-0.091
ulp-0.091
arter-0.090
Token of
Token ID286
Feature activation+8.15e-02

Pos logit contributions
 who+0.144
 All+0.140
Let+0.112
 Stone+0.105
 ties+0.101
Neg logit contributions
iri-0.443
urg-0.430
ulp-0.423
 mir-0.421
arter-0.420
Token the
Token ID262
Feature activation+7.20e-02

Pos logit contributions
 All+0.119
Let+0.114
 who+0.110
 Stone+0.106
 Being+0.104
Neg logit contributions
 awful-0.149
gh-0.141
ulp-0.141
 mir-0.140
arter-0.139
Token being
Token ID852
Feature activation-2.65e-03

Pos logit contributions
 All+0.005
 Stone+0.004
Let+0.004
 ties+0.004
 who+0.004
Neg logit contributions
 awful-0.006
ulp-0.005
gh-0.005
arter-0.005
oto-0.005
Token kind
Token ID1611
Feature activation+5.48e-02

Pos logit contributions
 awful+0.004
 awesome+0.004
gh+0.004
 enjoyable+0.004
 mir+0.003
Neg logit contributions
 All-0.005
 Stone-0.004
 ties-0.004
 Did-0.004
Let-0.004
Token to
Token ID284
Feature activation-5.83e-02

Pos logit contributions
 All+0.082
 ties+0.078
Let+0.078
Find+0.077
 Stone+0.077
Neg logit contributions
 awful-0.091
gh-0.086
ulp-0.083
oto-0.083
 mir-0.081
Token his
Token ID465
Feature activation-5.91e-02

Pos logit contributions
 awful+0.104
iri+0.101
ulp+0.100
 mir+0.100
gh+0.100
Neg logit contributions
 All-0.087
Let-0.082
 who-0.082
 ties-0.077
 Stone-0.077
Token little
Token ID1310
Feature activation+6.73e-02

Pos logit contributions
 awful+0.109
iri+0.106
ulp+0.105
gh+0.105
arter+0.105
Neg logit contributions
 All-0.084
Let-0.079
 who-0.079
 ties-0.074
 Stone-0.074
Token sister
Token ID6621
Feature activation+2.89e-01

Pos logit contributions
 All+0.064
Let+0.058
 who+0.057
 Stone+0.052
 ties+0.052
Neg logit contributions
 awful-0.154
ulp-0.150
iri-0.150
arter-0.150
 mir-0.149
Token.
Token ID13
Feature activation+2.69e-03

Pos logit contributions
 All+0.433
Let+0.427
 Stone+0.406
Find+0.400
 ties+0.398
Neg logit contributions
 awful-0.487
gh-0.458
ulp-0.449
arter-0.445
 mir-0.439
Token From
Token ID3574
Feature activation-1.24e-02

Pos logit contributions
 ties+0.002
 All+0.002
 who+0.002
Let+0.002
 Stone+0.002
Neg logit contributions
 awful-0.006
ulp-0.006
arter-0.006
iri-0.006
 mir-0.006
Token that
Token ID326
Feature activation-3.37e-02

Pos logit contributions
 mir+0.026
ulp+0.026
gh+0.026
arter+0.026
urg+0.026
Neg logit contributions
 who-0.015
 All-0.014
Let-0.013
 ties-0.012
 Stone-0.012
Token day
Token ID1110
Feature activation-3.24e-02

Pos logit contributions
 awful+0.042
gh+0.041
ulp+0.039
 mir+0.038
arter+0.038
Neg logit contributions
 All-0.066
Let-0.064
 who-0.064
 ties-0.063
 Stone-0.061
Token on
Token ID319
Feature activation+9.58e-02

Pos logit contributions
gh+0.076
 awful+0.075
ulp+0.074
arter+0.073
iri+0.072
Neg logit contributions
 All-0.028
Let-0.027
 who-0.026
 ties-0.025
 Stone-0.025
TokenHe
Token ID1544
Feature activation-3.27e-02

Pos logit contributions
 All+0.054
Let+0.050
 ties+0.050
 who+0.050
 Stone+0.049
Neg logit contributions
 awful-0.068
iri-0.068
gh-0.068
 mir-0.066
oto-0.065
Token said
Token ID531
Feature activation+3.23e-03

Pos logit contributions
 awful+0.068
 mir+0.066
gh+0.066
iri+0.066
ulp+0.063
Neg logit contributions
 All-0.036
 Stone-0.033
Let-0.033
 Being-0.031
 who-0.031
Token to
Token ID284
Feature activation-5.96e-02

Pos logit contributions
 All+0.004
 who+0.004
Let+0.004
 Stone+0.003
 ties+0.003
Neg logit contributions
 awful-0.006
arter-0.006
iri-0.006
ulp-0.006
 mir-0.006
Token his
Token ID465
Feature activation-6.11e-02

Pos logit contributions
iri+0.104
 awful+0.103
oto+0.102
gh+0.102
arter+0.101
Neg logit contributions
Let-0.083
 All-0.081
 ties-0.077
 Did-0.077
 Stone-0.077
Token little
Token ID1310
Feature activation+6.43e-02

Pos logit contributions
 awful+0.104
iri+0.101
arter+0.100
ulp+0.100
gh+0.099
Neg logit contributions
 All-0.096
Let-0.091
 who-0.089
 ties-0.085
 Stone-0.085
Token brother
Token ID3956
Feature activation+2.89e-01

Pos logit contributions
 All+0.066
Let+0.060
 who+0.057
 Stone+0.054
 ties+0.053
Neg logit contributions
 awful-0.142
iri-0.139
 mir-0.139
arter-0.137
ulp-0.137
Token,
Token ID11
Feature activation-5.70e-04

Pos logit contributions
 All+0.445
Let+0.445
 ties+0.420
Find+0.410
 Stone+0.409
Neg logit contributions
 awful-0.460
gh-0.450
iri-0.446
oto-0.444
arter-0.431
Token "
Token ID366
Feature activation+1.61e-01

Pos logit contributions
 awful+0.001
iri+0.001
 mir+0.001
oto+0.001
gh+0.001
Neg logit contributions
 All-0.001
 ties-0.001
Let-0.001
 draws-0.001
 Stone-0.001
TokenI
Token ID40
Feature activation+7.99e-03

Pos logit contributions
 All+0.181
 ties+0.178
 led+0.170
es+0.169
 puzzles+0.165
Neg logit contributions
iri-0.317
oto-0.309
ww-0.309
gh-0.308
 awful-0.306
Token have
Token ID423
Feature activation+1.41e-01

Pos logit contributions
 All+0.010
Let+0.009
 who+0.009
 Stone+0.008
 ties+0.008
Neg logit contributions
 awful-0.016
gh-0.016
 mir-0.016
iri-0.016
arter-0.016
Token to
Token ID284
Feature activation-5.61e-02

Pos logit contributions
 All+0.138
Let+0.128
 who+0.126
 ties+0.115
 Stone+0.113
Neg logit contributions
 awful-0.323
iri-0.317
arter-0.315
ulp-0.315
gh-0.312
Token of
Token ID286
Feature activation+8.19e-02

Pos logit contributions
 awful+0.083
ulp+0.079
 mir+0.078
gh+0.078
iri+0.077
Neg logit contributions
 All-0.050
Let-0.048
 who-0.046
 Stone-0.046
 ties-0.044
Token fun
Token ID1257
Feature activation-2.07e-01

Pos logit contributions
 All+0.159
Let+0.153
 who+0.151
 Stone+0.146
 ties+0.144
Neg logit contributions
 awful-0.109
ulp-0.103
iri-0.102
 mir-0.102
gh-0.102
Token playing
Token ID2712
Feature activation+7.25e-02

Pos logit contributions
ulp+0.393
gh+0.392
 awful+0.386
arter+0.376
urg+0.372
Neg logit contributions
 All-0.263
Let-0.259
 Stone-0.250
 Did-0.234
 Being-0.234
Token with
Token ID351
Feature activation+1.53e-01

Pos logit contributions
 All+0.067
Let+0.063
 Being+0.062
 Stone+0.058
Being+0.056
Neg logit contributions
 awful-0.164
gh-0.163
ulp-0.157
arter-0.155
 mir-0.154
Token her
Token ID607
Feature activation-8.57e-02

Pos logit contributions
 All+0.210
Let+0.202
 who+0.190
 Stone+0.189
 Being+0.188
Neg logit contributions
 awful-0.283
gh-0.272
iri-0.272
ulp-0.270
 mir-0.267
Token brother
Token ID3956
Feature activation+2.88e-01

Pos logit contributions
 awful+0.168
arter+0.163
ulp+0.162
gh+0.161
iri+0.161
Neg logit contributions
 All-0.110
Let-0.103
 Being-0.095
 Did-0.092
 who-0.092
Token in
Token ID287
Feature activation+1.24e-01

Pos logit contributions
 All+0.404
Let+0.403
Find+0.365
 Being+0.360
 Stone+0.359
Neg logit contributions
 awful-0.517
gh-0.489
iri-0.485
arter-0.480
 mir-0.472
Token the
Token ID262
Feature activation+7.40e-02

Pos logit contributions
 All+0.132
Let+0.131
 Stone+0.121
 Being+0.121
 Did+0.120
Neg logit contributions
 awful-0.257
gh-0.245
arter-0.245
ulp-0.244
 mir-0.237
Token backyard
Token ID24296
Feature activation+9.98e-02

Pos logit contributions
 All+0.087
Let+0.082
 who+0.080
 Stone+0.075
 ties+0.075
Neg logit contributions
 awful-0.154
iri-0.149
ulp-0.149
arter-0.148
 mir-0.148
Token.
Token ID13
Feature activation+1.04e-03

Pos logit contributions
 All+0.142
Let+0.141
 ties+0.135
 Stone+0.133
Find+0.130
Neg logit contributions
 awful-0.169
gh-0.163
 mir-0.161
ulp-0.161
arter-0.159
Token She
Token ID1375
Feature activation+1.23e-02

Pos logit contributions
 ties+0.001
es+0.001
 Stone+0.001
 led+0.001
Let+0.001
Neg logit contributions
iri-0.002
 awful-0.002
arter-0.002
ife-0.002
ulp-0.002
TokenL
Token ID43
Feature activation-1.75e-02

Pos logit contributions
 All+0.043
 who+0.040
Let+0.040
 ties+0.039
 Stone+0.039
Neg logit contributions
 awful-0.056
iri-0.055
gh-0.055
 mir-0.054
ulp-0.053
Tokenizz
Token ID6457
Feature activation-1.20e-01

Pos logit contributions
 awful+0.029
iri+0.029
ulp+0.028
gh+0.028
arter+0.028
Neg logit contributions
 All-0.028
Let-0.027
 who-0.026
 ties-0.025
 Stone-0.025
Tokenie
Token ID494
Feature activation-5.31e-02

Pos logit contributions
urg+0.076
iri+0.075
 awful+0.073
oto+0.073
ulp+0.070
Neg logit contributions
 All-0.306
Let-0.290
Find-0.288
 Stone-0.286
 ties-0.284
Token and
Token ID290
Feature activation+1.36e-03

Pos logit contributions
 awful+0.109
 mir+0.108
gh+0.106
iri+0.106
urg+0.105
Neg logit contributions
 All-0.055
 Stone-0.050
Let-0.049
 Being-0.048
Find-0.047
Token her
Token ID607
Feature activation-8.71e-02

Pos logit contributions
 All+0.002
Let+0.002
 Using+0.002
 ties+0.002
 Did+0.002
Neg logit contributions
iri-0.002
oto-0.002
urg-0.002
 awful-0.002
ulp-0.002
Token sister
Token ID6621
Feature activation+2.88e-01

Pos logit contributions
 awful+0.148
iri+0.144
ulp+0.142
gh+0.142
arter+0.142
Neg logit contributions
 All-0.136
Let-0.129
 who-0.126
 ties-0.121
 Stone-0.120
Token had
Token ID550
Feature activation+8.68e-02

Pos logit contributions
 All+0.333
Let+0.325
 Stone+0.311
 ties+0.306
 Being+0.306
Neg logit contributions
 awful-0.569
gh-0.551
iri-0.541
 mir-0.532
urg-0.532
Token to
Token ID284
Feature activation-5.68e-02

Pos logit contributions
 All+0.094
Let+0.088
 who+0.087
 ties+0.080
 Stone+0.078
Neg logit contributions
 awful-0.190
arter-0.188
iri-0.188
ulp-0.186
 mir-0.184
Token share
Token ID2648
Feature activation+1.13e-01

Pos logit contributions
 awful+0.106
iri+0.103
gh+0.102
ulp+0.102
arter+0.102
Neg logit contributions
 All-0.079
Let-0.074
 who-0.073
 Stone-0.069
 ties-0.069
Token the
Token ID262
Feature activation+6.55e-02

Pos logit contributions
 All+0.163
Let+0.156
 Did+0.146
 Being+0.145
 Stone+0.145
Neg logit contributions
 awful-0.203
ulp-0.195
gh-0.195
iri-0.191
arter-0.189
Token bedroom
Token ID14043
Feature activation+1.12e-01

Pos logit contributions
 All+0.088
Let+0.083
 who+0.083
 Being+0.078
 Stone+0.076
Neg logit contributions
 awful-0.125
gh-0.119
ulp-0.119
arter-0.119
 mir-0.118
Token she
Token ID673
Feature activation-2.16e-02

Pos logit contributions
 All+0.167
Let+0.161
 who+0.158
 Stone+0.153
 ties+0.152
Neg logit contributions
 awful-0.217
gh-0.211
iri-0.210
 mir-0.208
arter-0.207
Token played
Token ID2826
Feature activation+6.75e-02

Pos logit contributions
 awful+0.045
gh+0.045
 mir+0.044
iri+0.043
ulp+0.042
Neg logit contributions
 All-0.023
Let-0.021
 Stone-0.021
 Being-0.020
 Using-0.020
Token with
Token ID351
Feature activation+1.53e-01

Pos logit contributions
 All+0.075
Let+0.074
 who+0.068
 Being+0.068
 Stone+0.067
Neg logit contributions
 awful-0.140
gh-0.138
ulp-0.134
arter-0.133
 mir-0.130
Token her
Token ID607
Feature activation-8.42e-02

Pos logit contributions
 All+0.190
Let+0.182
 who+0.176
 Stone+0.167
 ties+0.164
Neg logit contributions
 awful-0.306
ulp-0.294
iri-0.294
gh-0.293
arter-0.292
Token big
Token ID1263
Feature activation-1.00e-01

Pos logit contributions
 awful+0.163
arter+0.157
ulp+0.155
gh+0.155
 mir+0.152
Neg logit contributions
 All-0.109
Let-0.106
 who-0.099
 Being-0.097
 Did-0.094
Token sister
Token ID6621
Feature activation+2.88e-01

Pos logit contributions
 awful+0.212
iri+0.208
ulp+0.205
gh+0.205
 mir+0.205
Neg logit contributions
 All-0.115
Let-0.107
 who-0.106
 ties-0.099
 Stone-0.098
Token.
Token ID13
Feature activation+4.72e-03

Pos logit contributions
 All+0.399
Let+0.396
 Being+0.358
 Stone+0.355
 who+0.355
Neg logit contributions
 awful-0.521
gh-0.504
arter-0.491
iri-0.490
ulp-0.487
Token It
Token ID632
Feature activation+3.59e-02

Pos logit contributions
 ties+0.005
es+0.004
 led+0.004
 who+0.004
 kit+0.004
Neg logit contributions
iri-0.010
ife-0.010
arter-0.009
 awful-0.009
ummy-0.009
Token made
Token ID925
Feature activation+8.69e-02

Pos logit contributions
 All+0.026
 ties+0.026
 Stone+0.024
 who+0.024
 Did+0.024
Neg logit contributions
 awful-0.088
gh-0.082
 mir-0.080
 unpleasant-0.078
 awesome-0.078
Token her
Token ID607
Feature activation-8.60e-02

Pos logit contributions
 All+0.119
 who+0.108
Let+0.108
 ties+0.099
 Stone+0.098
Neg logit contributions
 mir-0.167
arter-0.166
 awful-0.165
iri-0.165
gh-0.164
Token very
Token ID845
Feature activation-1.75e-01

Pos logit contributions
 awful+0.152
ulp+0.143
arter+0.142
iri+0.142
gh+0.142
Neg logit contributions
 All-0.126
Let-0.123
 who-0.116
 Stone-0.113
 Being-0.113
Token.
Token ID13
Feature activation+3.32e-04

Pos logit contributions
 All+0.036
Let+0.035
 who+0.034
 Stone+0.032
 ties+0.032
Neg logit contributions
 awful-0.052
gh-0.050
ulp-0.050
iri-0.049
arter-0.049
Token It
Token ID632
Feature activation+4.20e-02

Pos logit contributions
 ties+0.000
Let+0.000
 Stone+0.000
 who+0.000
 led+0.000
Neg logit contributions
 awful-0.001
iri-0.001
 mir-0.001
gh-0.001
ulp-0.001
Token was
Token ID373
Feature activation+5.57e-02

Pos logit contributions
 All+0.042
 Stone+0.038
Let+0.038
 who+0.037
 Being+0.036
Neg logit contributions
 awful-0.097
gh-0.090
 mir-0.089
arter-0.087
iri-0.087
Token his
Token ID465
Feature activation-6.44e-02

Pos logit contributions
 All+0.076
 who+0.075
Let+0.072
 ties+0.069
 led+0.066
Neg logit contributions
iri-0.104
ulp-0.101
oor-0.100
arter-0.100
gh-0.099
Token little
Token ID1310
Feature activation+7.05e-02

Pos logit contributions
 awful+0.122
ulp+0.121
iri+0.120
urg+0.119
arter+0.118
Neg logit contributions
 All-0.086
 who-0.086
Let-0.081
 ties-0.079
 Stone-0.078
Token brother
Token ID3956
Feature activation+2.88e-01

Pos logit contributions
 All+0.077
Let+0.072
 who+0.071
 ties+0.065
 Stone+0.065
Neg logit contributions
 awful-0.152
iri-0.148
ulp-0.147
gh-0.147
arter-0.147
Token!
Token ID0
Feature activation-8.79e-02

Pos logit contributions
 All+0.436
Let+0.427
 Stone+0.402
 ties+0.401
Find+0.389
Neg logit contributions
 awful-0.492
gh-0.468
iri-0.457
arter-0.457
 mir-0.451
Token Jack
Token ID3619
Feature activation-2.90e-02

Pos logit contributions
 awful+0.176
iri+0.170
arter+0.170
 mir+0.168
ulp+0.168
Neg logit contributions
 ties-0.088
 Stone-0.084
Let-0.083
 led-0.082
es-0.080
Token's
Token ID338
Feature activation+9.52e-02

Pos logit contributions
 awful+0.063
gh+0.060
iri+0.059
oto+0.058
urg+0.057
Neg logit contributions
 All-0.028
Let-0.025
 Stone-0.025
 ties-0.025
Find-0.025
Token brother
Token ID3956
Feature activation+2.89e-01

Pos logit contributions
 All+0.133
Let+0.127
 who+0.121
 Stone+0.117
 Being+0.116
Neg logit contributions
 awful-0.178
gh-0.171
iri-0.168
ulp-0.168
arter-0.167
Token wanted
Token ID2227
Feature activation+4.23e-02

Pos logit contributions
 All+0.298
Let+0.294
 Stone+0.285
 ties+0.282
Find+0.269
Neg logit contributions
 awful-0.619
gh-0.595
iri-0.574
 mir-0.564
ulp-0.560
Token.
Token ID13
Feature activation+3.32e-04

Pos logit contributions
 All+0.036
Let+0.035
 who+0.034
 Stone+0.032
 ties+0.032
Neg logit contributions
 awful-0.052
gh-0.050
ulp-0.050
iri-0.049
arter-0.049
Token It
Token ID632
Feature activation+4.20e-02

Pos logit contributions
 ties+0.000
Let+0.000
 Stone+0.000
 who+0.000
 led+0.000
Neg logit contributions
 awful-0.001
iri-0.001
 mir-0.001
gh-0.001
ulp-0.001
Token was
Token ID373
Feature activation+5.57e-02

Pos logit contributions
 All+0.042
 Stone+0.038
Let+0.038
 who+0.037
 Being+0.036
Neg logit contributions
 awful-0.097
gh-0.090
 mir-0.089
arter-0.087
iri-0.087
Token his
Token ID465
Feature activation-6.44e-02

Pos logit contributions
 All+0.076
 who+0.075
Let+0.072
 ties+0.069
 led+0.066
Neg logit contributions
iri-0.104
ulp-0.101
oor-0.100
arter-0.100
gh-0.099
Token little
Token ID1310
Feature activation+7.05e-02

Pos logit contributions
 awful+0.122
ulp+0.121
iri+0.120
urg+0.119
arter+0.118
Neg logit contributions
 All-0.086
 who-0.086
Let-0.081
 ties-0.079
 Stone-0.078
Token brother
Token ID3956
Feature activation+2.88e-01

Pos logit contributions
 All+0.077
Let+0.072
 who+0.071
 ties+0.065
 Stone+0.065
Neg logit contributions
 awful-0.152
iri-0.148
ulp-0.147
gh-0.147
arter-0.147
Token!
Token ID0
Feature activation-8.79e-02

Pos logit contributions
 All+0.436
Let+0.427
 Stone+0.402
 ties+0.401
Find+0.389
Neg logit contributions
 awful-0.492
gh-0.468
iri-0.457
arter-0.457
 mir-0.451
Token Jack
Token ID3619
Feature activation-2.90e-02

Pos logit contributions
 awful+0.176
iri+0.170
arter+0.170
 mir+0.168
ulp+0.168
Neg logit contributions
 ties-0.088
 Stone-0.084
Let-0.083
 led-0.082
es-0.080
Token's
Token ID338
Feature activation+9.52e-02

Pos logit contributions
 awful+0.063
gh+0.060
iri+0.059
oto+0.058
urg+0.057
Neg logit contributions
 All-0.028
Let-0.025
 Stone-0.025
 ties-0.025
Find-0.025
Token brother
Token ID3956
Feature activation+2.89e-01

Pos logit contributions
 All+0.133
Let+0.127
 who+0.121
 Stone+0.117
 Being+0.116
Neg logit contributions
 awful-0.178
gh-0.171
iri-0.168
ulp-0.168
arter-0.167
Token wanted
Token ID2227
Feature activation+4.23e-02

Pos logit contributions
 All+0.298
Let+0.294
 Stone+0.285
 ties+0.282
Find+0.269
Neg logit contributions
 awful-0.619
gh-0.595
iri-0.574
 mir-0.564
ulp-0.560
Token was
Token ID373
Feature activation+5.57e-02

Pos logit contributions
 awful+0.083
gh+0.081
 mir+0.080
iri+0.080
ulp+0.079
Neg logit contributions
 All-0.039
Let-0.036
 Stone-0.034
 who-0.034
 Being-0.033
Token hungry
Token ID14720
Feature activation-9.71e-02

Pos logit contributions
 All+0.085
Let+0.081
 who+0.078
 Stone+0.076
 ties+0.075
Neg logit contributions
 awful-0.097
gh-0.093
iri-0.093
ulp-0.093
 mir-0.092
Token!
Token ID0
Feature activation-9.36e-02

Pos logit contributions
iri+0.221
 awful+0.218
arter+0.217
ulp+0.213
 mir+0.213
Neg logit contributions
 who-0.098
 All-0.098
Let-0.089
 ties-0.077
 led-0.076
Token The
Token ID383
Feature activation+1.01e-01

Pos logit contributions
 awful+0.187
iri+0.186
 mir+0.184
ulp+0.183
gh+0.180
Neg logit contributions
 ties-0.093
 Stone-0.089
Let-0.088
es-0.086
 draws-0.086
Token eldest
Token ID34331
Feature activation+1.23e-01

Pos logit contributions
 All+0.142
Let+0.134
 who+0.132
 ties+0.125
 Stone+0.125
Neg logit contributions
 awful-0.187
iri-0.181
 mir-0.180
ulp-0.179
arter-0.179
Token brother
Token ID3956
Feature activation+2.88e-01

Pos logit contributions
 All+0.151
Let+0.144
 ties+0.143
Find+0.142
Being+0.141
Neg logit contributions
iri-0.226
oto-0.223
 awful-0.219
 mir-0.219
urg-0.216
Token opened
Token ID4721
Feature activation+9.25e-03

Pos logit contributions
 All+0.313
Let+0.301
 Stone+0.285
 ties+0.282
 Being+0.277
Neg logit contributions
 awful-0.601
iri-0.589
 mir-0.578
gh-0.577
urg-0.569
Token the
Token ID262
Feature activation+6.53e-02

Pos logit contributions
 All+0.009
Let+0.008
 who+0.008
 ties+0.007
 Stone+0.007
Neg logit contributions
 awful-0.022
iri-0.021
arter-0.021
ulp-0.021
 mir-0.021
Token box
Token ID3091
Feature activation+1.63e-01

Pos logit contributions
 All+0.096
 who+0.092
Let+0.091
 ties+0.089
 Stone+0.087
Neg logit contributions
iri-0.115
gh-0.113
ulp-0.112
urg-0.111
 awful-0.110
Token of
Token ID286
Feature activation+7.42e-02

Pos logit contributions
 All+0.169
Let+0.157
 who+0.156
 ties+0.142
 Stone+0.141
Neg logit contributions
 awful-0.360
iri-0.352
arter-0.350
 mir-0.349
ulp-0.349
Token meat
Token ID6174
Feature activation-2.23e-02

Pos logit contributions
 All+0.125
Let+0.119
 who+0.115
 Being+0.112
 Stone+0.112
Neg logit contributions
 awful-0.119
ulp-0.112
 mir-0.112
iri-0.111
arter-0.111
Token had
Token ID550
Feature activation+8.22e-02

Pos logit contributions
 awful+0.064
gh+0.062
iri+0.060
 mir+0.060
oto+0.058
Neg logit contributions
 All-0.032
Let-0.029
 ties-0.028
 Stone-0.028
 Being-0.028
Token to
Token ID284
Feature activation-5.84e-02

Pos logit contributions
 All+0.127
Let+0.120
 who+0.118
 Stone+0.113
 ties+0.112
Neg logit contributions
 awful-0.142
gh-0.137
iri-0.136
ulp-0.136
 mir-0.136
Token listen
Token ID6004
Feature activation-3.30e-02

Pos logit contributions
 awful+0.111
arter+0.108
iri+0.108
ulp+0.107
 mir+0.106
Neg logit contributions
 All-0.081
 who-0.075
Let-0.073
 ties-0.071
 Stone-0.069
Token to
Token ID284
Feature activation-5.98e-02

Pos logit contributions
iri+0.071
arter+0.071
 mir+0.069
ulp+0.068
 awful+0.067
Neg logit contributions
 All-0.040
 who-0.040
Let-0.037
 ties-0.034
 Stone-0.032
Token her
Token ID607
Feature activation-9.22e-02

Pos logit contributions
 awful+0.104
ulp+0.099
iri+0.098
arter+0.098
gh+0.098
Neg logit contributions
 All-0.089
Let-0.089
 who-0.083
 Stone-0.082
 ties-0.082
Token sister
Token ID6621
Feature activation+2.88e-01

Pos logit contributions
 awful+0.149
gh+0.143
iri+0.143
ulp+0.142
arter+0.141
Neg logit contributions
 All-0.152
Let-0.148
 who-0.141
 Stone-0.135
 ties-0.135
Token.
Token ID13
Feature activation-8.24e-03

Pos logit contributions
 All+0.412
Let+0.410
 Stone+0.375
 ties+0.368
Find+0.365
Neg logit contributions
 awful-0.515
gh-0.493
ulp-0.477
arter-0.476
iri-0.469
Token So
Token ID1406
Feature activation+8.93e-02

Pos logit contributions
 awful+0.018
iri+0.017
ulp+0.017
ife+0.017
 mir+0.017
Neg logit contributions
 ties-0.007
Let-0.007
 Stone-0.007
 who-0.007
 All-0.007
Token she
Token ID673
Feature activation-2.79e-02

Pos logit contributions
 All+0.133
Let+0.131
 who+0.125
 ties+0.122
 Stone+0.122
Neg logit contributions
 awful-0.154
gh-0.148
iri-0.148
ulp-0.146
arter-0.145
Token went
Token ID1816
Feature activation-2.26e-02

Pos logit contributions
 awful+0.056
 mir+0.055
iri+0.054
gh+0.053
urg+0.053
Neg logit contributions
 All-0.033
 Being-0.030
Let-0.030
 Stone-0.029
 Trying-0.028
Token to
Token ID284
Feature activation-5.88e-02

Pos logit contributions
 awful+0.042
iri+0.040
ulp+0.040
arter+0.040
gh+0.039
Neg logit contributions
 All-0.033
 who-0.031
Let-0.031
 ties-0.030
 Stone-0.029
Token his
Token ID465
Feature activation-6.40e-02

Pos logit contributions
 All+0.099
Let+0.098
 Did+0.098
 Being+0.094
 who+0.093
Neg logit contributions
 awful-0.189
gh-0.187
 mir-0.179
arter-0.178
alt-0.176
Token bed
Token ID3996
Feature activation-1.66e-02

Pos logit contributions
iri+0.136
igi+0.133
ulp+0.132
 mir+0.131
 awful+0.130
Neg logit contributions
 who-0.079
 All-0.075
 ties-0.072
Let-0.070
 shop-0.068
Token when
Token ID618
Feature activation+1.12e-01

Pos logit contributions
 awful+0.031
gh+0.030
ulp+0.030
iri+0.030
arter+0.030
Neg logit contributions
 All-0.022
Let-0.021
 who-0.021
 ties-0.020
 Stone-0.020
Token the
Token ID262
Feature activation+6.53e-02

Pos logit contributions
 All+0.159
Let+0.151
 who+0.150
 ties+0.144
 Stone+0.144
Neg logit contributions
 awful-0.201
gh-0.196
 mir-0.193
iri-0.192
arter-0.190
Token little
Token ID1310
Feature activation+6.35e-02

Pos logit contributions
 All+0.102
Let+0.098
 who+0.096
 Stone+0.090
 ties+0.090
Neg logit contributions
iri-0.114
ulp-0.110
gh-0.109
urg-0.108
 awful-0.108
Token sister
Token ID6621
Feature activation+2.88e-01

Pos logit contributions
 All+0.079
Being+0.070
 Using+0.069
 Being+0.069
es+0.068
Neg logit contributions
 storm-0.122
 awful-0.121
 rainy-0.114
arter-0.113
 mir-0.112
Token came
Token ID1625
Feature activation-4.83e-02

Pos logit contributions
 All+0.331
Let+0.317
 Being+0.313
Being+0.309
 Stone+0.309
Neg logit contributions
 awful-0.568
gh-0.562
 mir-0.548
iri-0.545
urg-0.532
Token in
Token ID287
Feature activation+1.18e-01

Pos logit contributions
 awful+0.087
iri+0.086
gh+0.085
arter+0.085
ulp+0.085
Neg logit contributions
 All-0.069
Let-0.064
 who-0.062
 ties-0.060
 Stone-0.060
Token.
Token ID13
Feature activation-1.46e-02

Pos logit contributions
 All+0.160
Let+0.148
 who+0.145
 Stone+0.144
 ties+0.141
Neg logit contributions
 awful-0.214
gh-0.210
ulp-0.210
 mir-0.207
iri-0.205
Token She
Token ID1375
Feature activation+5.14e-03

Pos logit contributions
 awful+0.030
arter+0.028
 mir+0.028
ulp+0.028
ife+0.027
Neg logit contributions
 ties-0.015
Let-0.014
 who-0.014
es-0.014
 Stone-0.013
Token was
Token ID373
Feature activation+4.42e-02

Pos logit contributions
 All+0.006
 Stone+0.005
Let+0.005
 ties+0.005
 Being+0.005
Neg logit contributions
 awful-0.010
 mir-0.010
iri-0.010
gh-0.010
oto-0.009
Token!
Token ID0
Feature activation-9.16e-02

Pos logit contributions
 awful+0.130
 mir+0.126
gh+0.124
arter+0.123
ulp+0.122
Neg logit contributions
 All-0.070
Let-0.067
 who-0.063
 Stone-0.061
 ties-0.060
Token
Token ID198
Feature activation+3.13e-02

Pos logit contributions
 awful+0.188
iri+0.184
 mir+0.181
gh+0.181
ulp+0.180
Neg logit contributions
 ties-0.089
 Stone-0.086
Let-0.084
 All-0.083
 who-0.081
Token
Token ID198
Feature activation+3.03e-02

Pos logit contributions
 All+0.028
Let+0.025
 who+0.025
 ties+0.023
 Stone+0.023
Neg logit contributions
 awful-0.073
iri-0.072
ulp-0.071
gh-0.071
 mir-0.071
TokenThe
Token ID464
Feature activation+4.48e-02

Pos logit contributions
 All+0.042
Let+0.039
 who+0.039
 ties+0.038
 Stone+0.038
Neg logit contributions
 awful-0.055
iri-0.055
gh-0.054
 mir-0.053
ulp-0.053
Token regular
Token ID3218
Feature activation-5.06e-02

Pos logit contributions
 All+0.062
Let+0.058
 who+0.057
 ties+0.054
 Stone+0.054
Neg logit contributions
 awful-0.084
iri-0.082
ulp-0.081
arter-0.081
gh-0.081
Token sister
Token ID6621
Feature activation+2.88e-01

Pos logit contributions
 awful+0.091
iri+0.089
 mir+0.089
ulp+0.088
gh+0.088
Neg logit contributions
 All-0.072
Let-0.068
 who-0.066
 Stone-0.063
 Being-0.063
Token would
Token ID561
Feature activation+1.79e-02

Pos logit contributions
 All+0.302
Let+0.299
 Stone+0.290
Find+0.276
 ties+0.274
Neg logit contributions
 awful-0.594
iri-0.564
gh-0.559
 mir-0.555
urg-0.549
Token show
Token ID905
Feature activation+4.23e-02

Pos logit contributions
 All+0.024
Let+0.023
 who+0.023
 ties+0.021
 Stone+0.021
Neg logit contributions
 awful-0.034
iri-0.033
ulp-0.033
arter-0.033
gh-0.033
Token her
Token ID607
Feature activation-9.69e-02

Pos logit contributions
 All+0.055
Let+0.053
 who+0.051
 Stone+0.049
 ties+0.049
Neg logit contributions
 awful-0.082
iri-0.079
gh-0.079
ulp-0.079
arter-0.078
Token little
Token ID1310
Feature activation+6.58e-02

Pos logit contributions
 awful+0.186
gh+0.177
ulp+0.176
arter+0.170
iri+0.170
Neg logit contributions
Let-0.127
 All-0.126
 Stone-0.115
 Did-0.115
 Being-0.113
Token sister
Token ID6621
Feature activation+2.89e-01

Pos logit contributions
 All+0.074
Let+0.067
 who+0.065
 Being+0.061
 Stone+0.061
Neg logit contributions
 awful-0.140
ulp-0.135
 mir-0.135
arter-0.134
iri-0.134
Token.
Token ID13
Feature activation+3.31e-03

Pos logit contributions
 awful+0.047
gh+0.045
iri+0.044
ulp+0.044
oto+0.044
Neg logit contributions
 ties-0.015
 All-0.015
Let-0.015
 Stone-0.014
 draws-0.014
Token But
Token ID887
Feature activation+5.02e-02

Pos logit contributions
 ties+0.003
 Stone+0.003
 All+0.003
Let+0.003
 who+0.003
Neg logit contributions
 awful-0.007
iri-0.007
 mir-0.007
arter-0.007
gh-0.007
Token then
Token ID788
Feature activation+2.58e-02

Pos logit contributions
 All+0.068
Let+0.065
 Stone+0.061
 ties+0.060
 who+0.060
Neg logit contributions
 awful-0.093
iri-0.089
gh-0.089
 mir-0.088
arter-0.088
Token her
Token ID607
Feature activation-8.51e-02

Pos logit contributions
 All+0.036
Let+0.035
 who+0.033
 Stone+0.033
 ties+0.032
Neg logit contributions
 awful-0.047
gh-0.045
iri-0.045
 mir-0.045
ulp-0.044
Token big
Token ID1263
Feature activation-1.04e-01

Pos logit contributions
 awful+0.144
gh+0.139
iri+0.138
ulp+0.137
arter+0.136
Neg logit contributions
 All-0.132
Let-0.126
 who-0.122
 Stone-0.118
 ties-0.117
Token sister
Token ID6621
Feature activation+2.88e-01

Pos logit contributions
 awful+0.238
iri+0.232
arter+0.231
ulp+0.231
gh+0.231
Neg logit contributions
 All-0.098
Let-0.091
 who-0.089
 Stone-0.080
 ties-0.080
Token came
Token ID1625
Feature activation-4.82e-02

Pos logit contributions
 All+0.324
Let+0.313
 who+0.294
 Stone+0.292
 ties+0.285
Neg logit contributions
 awful-0.597
gh-0.581
iri-0.572
 mir-0.569
arter-0.562
Token home
Token ID1363
Feature activation-1.14e-01

Pos logit contributions
 awful+0.104
iri+0.100
ulp+0.100
 mir+0.100
arter+0.100
Neg logit contributions
 All-0.056
 who-0.055
Let-0.054
 Stone-0.048
 ties-0.048
Token from
Token ID422
Feature activation+2.57e-03

Pos logit contributions
iri+0.253
 awful+0.252
gh+0.247
 mir+0.247
ulp+0.245
Neg logit contributions
 All-0.119
 who-0.117
Let-0.109
 Stone-0.097
 ties-0.097
Token school
Token ID1524
Feature activation+1.04e-02

Pos logit contributions
 All+0.003
Let+0.003
 who+0.003
 ties+0.003
 Stone+0.003
Neg logit contributions
 awful-0.005
iri-0.005
gh-0.005
ulp-0.005
 mir-0.005
Token and
Token ID290
Feature activation-1.35e-02

Pos logit contributions
 All+0.015
Let+0.015
 Stone+0.014
 ties+0.014
Find+0.013
Neg logit contributions
 awful-0.018
gh-0.017
ulp-0.017
 mir-0.017
oor-0.016
Token the
Token ID262
Feature activation+5.76e-02

Pos logit contributions
 awful+0.013
gh+0.012
 mir+0.012
arter+0.012
ulp+0.012
Neg logit contributions
 All-0.009
 Stone-0.008
Let-0.008
 Being-0.008
 Using-0.008
Token other
Token ID584
Feature activation+4.76e-02

Pos logit contributions
 All+0.073
Let+0.068
 who+0.067
 Stone+0.063
 ties+0.063
Neg logit contributions
 awful-0.115
iri-0.110
 mir-0.110
gh-0.109
arter-0.109
Token was
Token ID373
Feature activation+4.65e-02

Pos logit contributions
 All+0.055
Let+0.052
 Stone+0.050
 who+0.049
 ties+0.049
Neg logit contributions
 awful-0.097
gh-0.093
ulp-0.092
 mir-0.092
arter-0.091
Token the
Token ID262
Feature activation+6.29e-02

Pos logit contributions
 All+0.073
 who+0.071
Let+0.070
 ties+0.065
 Stone+0.063
Neg logit contributions
iri-0.078
 awful-0.077
arter-0.077
ulp-0.076
 mir-0.075
Token younger
Token ID7099
Feature activation+1.49e-01

Pos logit contributions
 All+0.095
Let+0.091
 who+0.090
 ties+0.085
 Stone+0.085
Neg logit contributions
 awful-0.109
iri-0.107
ulp-0.106
arter-0.105
gh-0.105
Token sister
Token ID6621
Feature activation+2.88e-01

Pos logit contributions
 All+0.164
Let+0.145
 Stone+0.145
 Using+0.144
 ties+0.142
Neg logit contributions
 awful-0.305
gh-0.295
 mir-0.291
arter-0.290
iri-0.286
Token.
Token ID13
Feature activation-6.87e-03

Pos logit contributions
 All+0.418
Let+0.407
 Stone+0.389
 ties+0.385
Find+0.374
Neg logit contributions
 awful-0.491
gh-0.476
 mir-0.451
ulp-0.449
arter-0.447
Token They
Token ID1119
Feature activation+2.19e-02

Pos logit contributions
 awful+0.013
ife+0.013
arter+0.013
iri+0.013
 mir+0.013
Neg logit contributions
 ties-0.007
es-0.006
 Stone-0.006
 led-0.006
 puzzles-0.006
Token were
Token ID547
Feature activation+5.02e-02

Pos logit contributions
 Trying+0.025
 Being+0.025
 All+0.025
 Stone+0.024
 Using+0.024
Neg logit contributions
 awful-0.042
 mir-0.037
alt-0.037
iri-0.037
oor-0.036
Token walking
Token ID6155
Feature activation-2.76e-02

Pos logit contributions
 All+0.078
Let+0.075
 who+0.073
 Stone+0.071
 ties+0.070
Neg logit contributions
 awful-0.086
iri-0.082
gh-0.081
arter-0.081
 mir-0.081
Token down
Token ID866
Feature activation+5.44e-02

Pos logit contributions
 awful+0.055
iri+0.054
gh+0.053
 mir+0.053
ulp+0.053
Neg logit contributions
 All-0.034
Let-0.031
 who-0.031
 ties-0.029
 Stone-0.029
Token very
Token ID845
Feature activation-1.79e-01

Pos logit contributions
 awful+0.030
gh+0.028
ulp+0.028
arter+0.028
iri+0.027
Neg logit contributions
 All-0.022
Let-0.021
 Stone-0.020
 ties-0.019
 who-0.019
Token well
Token ID880
Feature activation+2.71e-02

Pos logit contributions
gh+0.362
iri+0.360
 awful+0.357
oto+0.353
urg+0.353
Neg logit contributions
 All-0.228
 who-0.213
Let-0.207
 ties-0.192
 Stone-0.187
Token.
Token ID13
Feature activation+2.98e-03

Pos logit contributions
 who+0.029
 All+0.029
Let+0.027
 ties+0.025
 Stone+0.025
Neg logit contributions
iri-0.059
 awful-0.058
ulp-0.058
arter-0.057
 mir-0.057
Token Her
Token ID2332
Feature activation-2.92e-02

Pos logit contributions
 ties+0.003
es+0.003
 who+0.003
 batteries+0.003
 tapes+0.003
Neg logit contributions
 awful-0.005
iri-0.005
ife-0.005
ulp-0.005
ummy-0.005
Token big
Token ID1263
Feature activation-1.04e-01

Pos logit contributions
 awful+0.053
iri+0.051
gh+0.051
ulp+0.051
arter+0.051
Neg logit contributions
 All-0.043
Let-0.041
 who-0.040
 Stone-0.037
 ties-0.037
Token sister
Token ID6621
Feature activation+2.88e-01

Pos logit contributions
 awful+0.237
arter+0.232
ulp+0.230
iri+0.230
 mir+0.229
Neg logit contributions
 All-0.101
Let-0.095
 who-0.091
 Being-0.082
 Stone-0.081
Token,
Token ID11
Feature activation-1.23e-02

Pos logit contributions
 All+0.301
Let+0.300
 Stone+0.291
 ties+0.289
 Being+0.280
Neg logit contributions
 awful-0.593
gh-0.578
iri-0.565
 mir-0.551
oto-0.549
Token Emma
Token ID18966
Feature activation+2.90e-02

Pos logit contributions
 awful+0.025
iri+0.023
 mir+0.023
oto+0.023
ulp+0.023
Neg logit contributions
 All-0.014
 ties-0.013
 Stone-0.013
Let-0.013
 draws-0.013
Token,
Token ID11
Feature activation-8.60e-03

Pos logit contributions
 All+0.047
Let+0.045
 ties+0.043
Make+0.043
Find+0.043
Neg logit contributions
 awful-0.044
gh-0.043
iri-0.042
 mir-0.041
urg-0.041
Token had
Token ID550
Feature activation+8.44e-02

Pos logit contributions
 awful+0.017
iri+0.017
oto+0.016
 mir+0.016
ulp+0.016
Neg logit contributions
 All-0.010
 ties-0.009
 Stone-0.009
 tapes-0.008
Find-0.008
Token a
Token ID257
Feature activation+3.79e-02

Pos logit contributions
 All+0.095
Let+0.088
 who+0.088
 Stone+0.083
 ties+0.081
Neg logit contributions
 awful-0.180
gh-0.174
iri-0.173
 mir-0.173
ulp-0.171
Token and
Token ID290
Feature activation-2.28e-02

Pos logit contributions
 All+0.161
 who+0.154
Let+0.153
 ties+0.141
 Stone+0.139
Neg logit contributions
iri-0.249
 awful-0.245
arter-0.244
 mir-0.241
urg-0.239
Token pointed
Token ID6235
Feature activation-8.36e-03

Pos logit contributions
 awful+0.041
 mir+0.038
ulp+0.036
 refreshing+0.035
urg+0.035
Neg logit contributions
 All-0.032
Being-0.030
 Being-0.029
 Stone-0.029
If-0.027
Token at
Token ID379
Feature activation+8.06e-03

Pos logit contributions
arter+0.017
iri+0.017
 awful+0.017
 mir+0.017
urg+0.017
Neg logit contributions
 All-0.010
 who-0.010
Let-0.010
 ties-0.009
 Stone-0.008
Token the
Token ID262
Feature activation+5.47e-02

Pos logit contributions
mun+0.012
 All+0.012
 Did+0.012
Let+0.011
Being+0.011
Neg logit contributions
 awful-0.013
ulp-0.012
iri-0.012
arter-0.011
oto-0.011
Token grass
Token ID8701
Feature activation-9.79e-03

Pos logit contributions
 All+0.085
 Being+0.076
 who+0.075
Let+0.075
 Using+0.073
Neg logit contributions
 awful-0.097
ulp-0.090
gh-0.088
iri-0.088
 mir-0.087
Tokenho
Token ID8873
Feature activation-3.29e-01

Pos logit contributions
 awful+0.019
iri+0.018
ulp+0.018
gh+0.018
arter+0.018
Neg logit contributions
 All-0.013
Let-0.012
 who-0.012
 ties-0.011
 Stone-0.011
Tokenpper
Token ID2848
Feature activation-6.41e-02

Pos logit contributions
 awful+0.418
 mir+0.393
arter+0.392
ulp+0.389
iri+0.380
Neg logit contributions
 All-0.658
 who-0.647
Let-0.638
 ties-0.617
 Stone-0.614
Token.
Token ID13
Feature activation-1.70e-02

Pos logit contributions
 awful+0.110
ulp+0.105
gh+0.105
arter+0.104
iri+0.104
Neg logit contributions
 All-0.095
Let-0.094
 Stone-0.084
Being-0.084
 Being-0.084
Token He
Token ID679
Feature activation+1.33e-02

Pos logit contributions
 awful+0.034
 mir+0.034
ulp+0.034
arter+0.033
gh+0.032
Neg logit contributions
 who-0.018
mun-0.017
Let-0.017
 ties-0.016
 All-0.016
Token said
Token ID531
Feature activation-1.92e-02

Pos logit contributions
 All+0.020
 Being+0.018
 Stone+0.018
 Did+0.018
 Trying+0.018
Neg logit contributions
 awful-0.023
iri-0.021
 mir-0.020
oto-0.020
gh-0.019
Token,
Token ID11
Feature activation-2.23e-02

Pos logit contributions
 awful+0.040
iri+0.040
arter+0.039
ulp+0.039
 mir+0.039
Neg logit contributions
 All-0.023
 who-0.022
Let-0.021
 Stone-0.019
es-0.019
Token in
Token ID287
Feature activation+1.09e-01

Pos logit contributions
 All+0.084
Let+0.081
 who+0.076
 ties+0.076
 Stone+0.075
Neg logit contributions
 awful-0.141
gh-0.139
 mir-0.135
ulp-0.133
arter-0.133
Token circles
Token ID13332
Feature activation+6.53e-02

Pos logit contributions
 All+0.090
Let+0.082
 who+0.079
 Stone+0.073
 ties+0.073
Neg logit contributions
 awful-0.264
iri-0.256
ulp-0.256
 mir-0.255
gh-0.255
Token with
Token ID351
Feature activation+1.39e-01

Pos logit contributions
 All+0.087
Let+0.083
 who+0.081
 Stone+0.077
 ties+0.077
Neg logit contributions
 awful-0.124
gh-0.120
iri-0.119
ulp-0.119
 mir-0.118
Token the
Token ID262
Feature activation+6.85e-02

Pos logit contributions
 All+0.162
Let+0.151
 who+0.149
 ties+0.138
 Stone+0.136
Neg logit contributions
 awful-0.293
iri-0.286
arter-0.285
ulp-0.284
gh-0.284
Token grass
Token ID8701
Feature activation-6.29e-03

Pos logit contributions
 All+0.100
Let+0.095
 who+0.094
 ties+0.089
 Stone+0.089
Neg logit contributions
 awful-0.124
iri-0.120
ulp-0.119
gh-0.119
arter-0.119
Tokenho
Token ID8873
Feature activation-3.28e-01

Pos logit contributions
 awful+0.012
iri+0.012
ulp+0.012
arter+0.012
gh+0.012
Neg logit contributions
 All-0.008
Let-0.008
 who-0.008
 ties-0.007
 Stone-0.007
Tokenpper
Token ID2848
Feature activation-6.56e-02

Pos logit contributions
 awful+0.454
 mir+0.421
arter+0.419
ulp+0.410
 storm+0.381
Neg logit contributions
 who-0.638
 All-0.631
Let-0.610
es-0.608
 ties-0.604
Token.
Token ID13
Feature activation-3.61e-03

Pos logit contributions
 awful+0.118
gh+0.114
ulp+0.113
iri+0.112
 mir+0.112
Neg logit contributions
 All-0.090
Let-0.088
 who-0.082
 Being-0.080
 Stone-0.080
Token They
Token ID1119
Feature activation+2.25e-02

Pos logit contributions
 awful+0.007
oor+0.007
gh+0.007
oto+0.007
 mir+0.007
Neg logit contributions
 who-0.004
es-0.004
 Stone-0.004
 ties-0.004
Let-0.004
Token jumped
Token ID11687
Feature activation-1.20e-01

Pos logit contributions
 All+0.030
 Being+0.029
 Using+0.028
 Trying+0.028
 Did+0.028
Neg logit contributions
 awful-0.039
gh-0.038
iri-0.038
 mir-0.038
alt-0.037
Token and
Token ID290
Feature activation-8.74e-03

Pos logit contributions
arter+0.304
iri+0.298
ulp+0.293
 awful+0.293
 mir+0.291
Neg logit contributions
 All-0.110
Let-0.095
 who-0.094
 ties-0.090
 Stone-0.083
Token so
Token ID523
Feature activation-1.34e-01

Pos logit contributions
 All+0.062
Let+0.059
 who+0.058
 Stone+0.057
 ties+0.056
Neg logit contributions
 awful-0.062
iri-0.059
gh-0.059
ulp-0.059
 mir-0.059
Token happy
Token ID3772
Feature activation-5.75e-02

Pos logit contributions
 awful+0.243
arter+0.239
gh+0.238
iri+0.238
ulp+0.236
Neg logit contributions
 All-0.201
 who-0.191
Let-0.190
 ties-0.172
 Stone-0.170
Token that
Token ID326
Feature activation-4.24e-02

Pos logit contributions
 awful+0.104
iri+0.101
ulp+0.100
gh+0.100
arter+0.100
Neg logit contributions
 All-0.083
Let-0.079
 who-0.078
 Stone-0.074
 ties-0.074
Token the
Token ID262
Feature activation+6.89e-02

Pos logit contributions
 awful+0.073
 mir+0.071
gh+0.071
iri+0.070
arter+0.069
Neg logit contributions
 All-0.062
Let-0.061
 who-0.059
 ties-0.057
 Stone-0.057
Token grass
Token ID8701
Feature activation-5.65e-03

Pos logit contributions
 All+0.104
Let+0.099
 who+0.098
 ties+0.093
 Stone+0.093
Neg logit contributions
 awful-0.122
iri-0.118
ulp-0.117
gh-0.116
arter-0.116
Tokenho
Token ID8873
Feature activation-3.28e-01

Pos logit contributions
 awful+0.014
iri+0.013
 mir+0.013
gh+0.013
arter+0.013
Neg logit contributions
 All-0.004
Let-0.004
 who-0.004
 ties-0.004
 Being-0.004
Tokenpper
Token ID2848
Feature activation-6.56e-02

Pos logit contributions
gh+0.330
iri+0.310
 awful+0.294
igi+0.286
oor+0.284
Neg logit contributions
 All-0.760
Let-0.714
 who-0.703
 Being-0.695
 ties-0.683
Token had
Token ID550
Feature activation+7.50e-02

Pos logit contributions
 awful+0.149
 mir+0.143
gh+0.140
iri+0.140
ulp+0.138
Neg logit contributions
 All-0.058
Let-0.053
 Being-0.050
Being-0.049
 Stone-0.048
Token said
Token ID531
Feature activation-6.16e-03

Pos logit contributions
 All+0.090
Let+0.084
 who+0.083
 ties+0.077
 Stone+0.076
Neg logit contributions
 awful-0.155
iri-0.153
arter-0.152
ulp-0.151
gh-0.150
Token yes
Token ID3763
Feature activation-4.82e-02

Pos logit contributions
 awful+0.012
arter+0.011
iri+0.011
ulp+0.011
 mir+0.011
Neg logit contributions
 All-0.009
 who-0.008
Let-0.008
 Stone-0.007
 ties-0.007
Token.
Token ID13
Feature activation-3.16e-03

Pos logit contributions
arter+0.115
ulp+0.114
igi+0.110
iri+0.108
 mir+0.107
Neg logit contributions
 All-0.056
 who-0.056
 was-0.045
 shop-0.041
 "-0.040
Token "
Token ID366
Feature activation+1.45e-01

Pos logit contributions
iri+0.013
ulp+0.013
 awful+0.013
 mir+0.013
arter+0.013
Neg logit contributions
 All-0.005
 who-0.005
Let-0.005
 ties-0.005
 Stone-0.005
TokenIs
Token ID3792
Feature activation+2.46e-02

Pos logit contributions
 All+0.158
 who+0.140
 ties+0.137
 Stone+0.136
es+0.133
Neg logit contributions
 awful-0.305
iri-0.304
 mir-0.298
gh-0.293
oto-0.291
Token it
Token ID340
Feature activation-7.50e-02

Pos logit contributions
 All+0.033
Let+0.031
 who+0.030
 Stone+0.029
 ties+0.028
Neg logit contributions
 awful-0.048
iri-0.046
gh-0.046
 mir-0.045
ulp-0.045
Token a
Token ID257
Feature activation+2.73e-02

Pos logit contributions
 awful+0.169
ulp+0.168
iri+0.167
gh+0.165
 mir+0.165
Neg logit contributions
 All-0.075
 who-0.071
Let-0.066
 ties-0.063
 Stone-0.063
Token grass
Token ID8701
Feature activation-6.70e-03

Pos logit contributions
 who+0.040
 All+0.039
Let+0.038
 Stone+0.037
 ties+0.036
Neg logit contributions
urg-0.049
ulp-0.048
 awful-0.048
iri-0.047
arter-0.046
Tokenho
Token ID8873
Feature activation-3.28e-01

Pos logit contributions
 awful+0.013
ulp+0.013
gh+0.013
iri+0.012
 mir+0.012
Neg logit contributions
 All-0.009
 who-0.008
Let-0.008
 Stone-0.008
 ties-0.008
Tokenpper
Token ID2848
Feature activation-6.29e-02

Pos logit contributions
 awful+0.343
iri+0.331
gh+0.329
ulp+0.322
arter+0.321
Neg logit contributions
 All-0.726
Let-0.702
 who-0.691
 ties-0.666
 Stone-0.666
Token?"
Token ID1701
Feature activation+3.78e-02

Pos logit contributions
 awful+0.108
iri+0.105
arter+0.104
 mir+0.104
gh+0.104
Neg logit contributions
 All-0.096
Let-0.092
 who-0.089
 ties-0.085
 Stone-0.085
Token she
Token ID673
Feature activation-3.82e-02

Pos logit contributions
 All+0.027
 who+0.024
Let+0.024
 Stone+0.023
 ties+0.022
Neg logit contributions
 awful-0.094
iri-0.093
 mir-0.092
ulp-0.092
gh-0.090
Token guesses
Token ID44774
Feature activation-5.49e-03

Pos logit contributions
 awful+0.080
iri+0.074
 mir+0.074
oto+0.073
gh+0.072
Neg logit contributions
 All-0.042
Let-0.040
 who-0.040
 Stone-0.037
Make-0.036
Token.
Token ID13
Feature activation-5.26e-03

Pos logit contributions
 awful+0.010
gh+0.010
ulp+0.010
iri+0.010
arter+0.009
Neg logit contributions
 All-0.008
Let-0.007
 who-0.007
 Stone-0.007
 ties-0.007
TokenWow
Token ID22017
Feature activation-3.70e-02

Pos logit contributions
 All+0.189
 who+0.185
 ties+0.179
 led+0.173
 Stone+0.173
Neg logit contributions
 awful-0.263
iri-0.263
gh-0.259
ww-0.255
oto-0.249
Token!
Token ID0
Feature activation-9.98e-02

Pos logit contributions
 awful+0.063
 mir+0.062
iri+0.061
ulp+0.061
arter+0.060
Neg logit contributions
 All-0.059
 who-0.056
Let-0.054
 Stone-0.052
 ties-0.052
Token These
Token ID2312
Feature activation+1.12e-01

Pos logit contributions
mmm+0.183
ww+0.178
oor+0.177
arter+0.175
mmmm+0.170
Neg logit contributions
 who-0.127
 led-0.124
 ties-0.124
mun-0.120
Let-0.113
Token are
Token ID389
Feature activation+4.35e-02

Pos logit contributions
 All+0.140
Let+0.135
 who+0.130
 Stone+0.127
 Being+0.121
Neg logit contributions
 awful-0.226
 mir-0.212
iri-0.210
gh-0.210
arter-0.208
Token so
Token ID523
Feature activation-1.38e-01

Pos logit contributions
 All+0.083
 who+0.080
Let+0.079
 ties+0.076
 Stone+0.075
Neg logit contributions
ulp-0.057
iri-0.057
 awful-0.057
arter-0.056
gh-0.056
Tokenooo
Token ID34160
Feature activation-3.27e-01

Pos logit contributions
iri+0.168
ulp+0.163
 awful+0.161
arter+0.160
gh+0.160
Neg logit contributions
 All-0.287
Let-0.269
 who-0.268
 ties-0.255
 Stone-0.253
Token spicy
Token ID26880
Feature activation-1.47e-01

Pos logit contributions
iri+0.438
ulp+0.433
arter+0.427
 mir+0.422
 awful+0.415
Neg logit contributions
 All-0.641
 who-0.606
Let-0.596
 ties-0.577
 Stone-0.563
Token".
Token ID1911
Feature activation-4.76e-03

Pos logit contributions
 awful+0.205
iri+0.205
ulp+0.201
arter+0.198
 mir+0.197
Neg logit contributions
 All-0.275
 who-0.261
Let-0.257
 ties-0.245
 Stone-0.243
Token They
Token ID1119
Feature activation+1.99e-02

Pos logit contributions
 awful+0.009
gh+0.009
ulp+0.009
oto+0.009
oor+0.009
Neg logit contributions
 ties-0.005
 who-0.005
Let-0.005
 Stone-0.005
 All-0.004
Token both
Token ID1111
Feature activation-3.72e-02

Pos logit contributions
 All+0.027
 Being+0.025
Let+0.025
 Stone+0.025
 Trying+0.024
Neg logit contributions
 awful-0.037
iri-0.036
ulp-0.034
oto-0.034
gh-0.034
Token laughed
Token ID13818
Feature activation-1.85e-02

Pos logit contributions
 awful+0.074
iri+0.071
ulp+0.070
 mir+0.069
gh+0.069
Neg logit contributions
 All-0.046
Let-0.043
 Stone-0.042
 ties-0.042
 who-0.041
Token an
Token ID281
Feature activation-2.20e-02

Pos logit contributions
 All+0.031
Let+0.028
 Stone+0.028
 who+0.027
 ties+0.027
Neg logit contributions
 awful-0.081
iri-0.078
ulp-0.078
gh-0.077
oto-0.076
Token ant
Token ID1885
Feature activation+4.18e-02

Pos logit contributions
gh+0.038
ulp+0.035
iri+0.035
 awful+0.035
arter+0.034
Neg logit contributions
 who-0.037
 All-0.036
Let-0.033
 Stone-0.032
 ties-0.031
Token and
Token ID290
Feature activation-1.07e-02

Pos logit contributions
 All+0.044
Let+0.044
Find+0.042
 ties+0.041
 Stone+0.041
Neg logit contributions
 awful-0.085
arter-0.084
 mir-0.084
iri-0.083
gh-0.082
Token a
Token ID257
Feature activation+3.05e-02

Pos logit contributions
 awful+0.019
 mir+0.018
iri+0.018
gh+0.018
arter+0.018
Neg logit contributions
 All-0.015
 ties-0.014
Let-0.014
 Stone-0.014
pen-0.014
Token grass
Token ID8701
Feature activation-4.12e-03

Pos logit contributions
 All+0.043
Let+0.040
 ties+0.039
 who+0.038
 Being+0.038
Neg logit contributions
 awful-0.055
 mir-0.054
iri-0.054
arter-0.053
gh-0.052
Tokenho
Token ID8873
Feature activation-3.26e-01

Pos logit contributions
 awful+0.005
iri+0.005
arter+0.005
ulp+0.005
gh+0.005
Neg logit contributions
 All-0.008
Let-0.008
 ties-0.008
 draws-0.008
 Being-0.007
Tokenpper
Token ID2848
Feature activation-6.31e-02

Pos logit contributions
 awful+0.414
 mir+0.384
ulp+0.376
arter+0.376
iri+0.356
Neg logit contributions
 All-0.660
 who-0.660
Let-0.631
 Stone-0.619
es-0.613
Token.
Token ID13
Feature activation-7.34e-03

Pos logit contributions
 mir+0.073
 awful+0.072
iri+0.071
arter+0.066
oto+0.066
Neg logit contributions
Find-0.115
 tapes-0.115
 draws-0.113
 ties-0.113
 Using-0.111
Token The
Token ID383
Feature activation+1.04e-01

Pos logit contributions
iri+0.011
 mir+0.011
arter+0.011
 storm+0.011
 awful+0.011
Neg logit contributions
es-0.011
 kit-0.010
mun-0.010
pen-0.010
 who-0.010
Token ant
Token ID1885
Feature activation+4.21e-02

Pos logit contributions
 All+0.148
Let+0.145
 who+0.139
 Stone+0.132
 ties+0.130
Neg logit contributions
gh-0.196
 awful-0.193
ulp-0.189
arter-0.189
urg-0.188
Token was
Token ID373
Feature activation+3.55e-02

Pos logit contributions
 All+0.040
 Using+0.039
Find+0.038
pen+0.036
 Stone+0.036
Neg logit contributions
 awful-0.090
 mir-0.089
gh-0.087
iri-0.086
oto-0.083
Token lady
Token ID10846
Feature activation+6.57e-02

Pos logit contributions
 All+0.041
Let+0.039
 who+0.038
 ties+0.036
 Stone+0.036
Neg logit contributions
 awful-0.053
iri-0.052
ulp-0.051
arter-0.051
gh-0.051
Tokenbug
Token ID25456
Feature activation+1.01e-01

Pos logit contributions
 All+0.079
Let+0.074
Being+0.065
 Stone+0.065
 Being+0.064
Neg logit contributions
iri-0.134
gh-0.132
 mir-0.129
arter-0.128
 awful-0.125
Token and
Token ID290
Feature activation-8.27e-03

Pos logit contributions
 All+0.151
Let+0.147
Being+0.139
 Being+0.139
 Trying+0.133
Neg logit contributions
iri-0.169
 mir-0.168
gh-0.165
arter-0.164
 awful-0.163
Token a
Token ID257
Feature activation+2.90e-02

Pos logit contributions
 awful+0.015
 mir+0.014
iri+0.014
oto+0.014
ww+0.014
Neg logit contributions
 Stone-0.010
 All-0.010
pen-0.010
 Using-0.010
 Trying-0.010
Token grass
Token ID8701
Feature activation-5.45e-03

Pos logit contributions
 All+0.042
Let+0.039
 who+0.038
 Stone+0.036
 ties+0.036
Neg logit contributions
 awful-0.052
iri-0.051
 mir-0.051
ulp-0.051
gh-0.050
Tokenho
Token ID8873
Feature activation-3.26e-01

Pos logit contributions
iri+0.008
 awful+0.008
arter+0.007
 mir+0.007
gh+0.007
Neg logit contributions
 All-0.010
Let-0.010
 who-0.009
 Being-0.009
 ties-0.009
Tokenpper
Token ID2848
Feature activation-6.32e-02

Pos logit contributions
 awful+0.441
 mir+0.400
arter+0.393
ulp+0.388
 enjoyable+0.375
Neg logit contributions
 who-0.644
 All-0.629
es-0.618
Let-0.608
 ties-0.603
Token.
Token ID13
Feature activation-6.82e-03

Pos logit contributions
iri+0.097
 mir+0.096
 awful+0.096
arter+0.093
ulp+0.093
Neg logit contributions
 All-0.100
Let-0.097
Find-0.091
Being-0.091
Make-0.089
Token
Token ID198
Feature activation+1.64e-02

Pos logit contributions
ulp+0.014
iri+0.013
 mir+0.013
oto+0.013
 awful+0.013
Neg logit contributions
es-0.007
 who-0.007
 Stone-0.007
 kit-0.007
Let-0.007
Token
Token ID198
Feature activation+1.70e-02

Pos logit contributions
 All+0.011
Let+0.010
 who+0.010
 Stone+0.009
 ties+0.009
Neg logit contributions
iri-0.041
gh-0.040
 awful-0.040
 mir-0.040
oto-0.040
TokenOne
Token ID3198
Feature activation-6.70e-02

Pos logit contributions
 All+0.023
Let+0.022
 who+0.022
 ties+0.020
 Stone+0.020
Neg logit contributions
 awful-0.032
iri-0.031
ulp-0.031
arter-0.031
gh-0.031
Token
Token ID198
Feature activation+1.95e-02

Pos logit contributions
 All+0.009
Let+0.008
 who+0.007
 ties+0.006
 Stone+0.006
Neg logit contributions
 awful-0.053
iri-0.052
ulp-0.052
arter-0.052
gh-0.052
TokenThe
Token ID464
Feature activation+3.64e-02

Pos logit contributions
 All+0.028
Let+0.026
 who+0.026
 ties+0.024
 Stone+0.024
Neg logit contributions
 awful-0.036
iri-0.035
arter-0.035
ulp-0.035
 mir-0.035
Token nos
Token ID43630
Feature activation-3.28e-02

Pos logit contributions
 All+0.053
 who+0.050
Let+0.050
 Stone+0.047
 ties+0.047
Neg logit contributions
 awful-0.066
iri-0.064
 mir-0.063
gh-0.062
ulp-0.062
Tokeny
Token ID88
Feature activation-4.24e-02

Pos logit contributions
 awful+0.053
iri+0.052
arter+0.051
ulp+0.051
 mir+0.051
Neg logit contributions
 All-0.053
Let-0.051
 who-0.050
 ties-0.048
 Stone-0.048
Token grass
Token ID8701
Feature activation-3.32e-03

Pos logit contributions
 awful+0.089
ulp+0.088
arter+0.087
gh+0.087
iri+0.086
Neg logit contributions
 All-0.050
Let-0.047
 who-0.046
 Stone-0.043
 ties-0.042
Tokenho
Token ID8873
Feature activation-3.26e-01

Pos logit contributions
 awful+0.006
iri+0.006
gh+0.006
 mir+0.006
arter+0.006
Neg logit contributions
 All-0.005
Let-0.004
 who-0.004
 ties-0.004
 Stone-0.004
Tokenpper
Token ID2848
Feature activation-6.37e-02

Pos logit contributions
gh+0.287
iri+0.284
 awful+0.277
ulp+0.258
oto+0.256
Neg logit contributions
 All-0.776
Let-0.745
 who-0.734
 ties-0.714
 Being-0.712
Token hopped
Token ID45230
Feature activation-5.68e-03

Pos logit contributions
 awful+0.130
 mir+0.128
gh+0.126
iri+0.126
ulp+0.123
Neg logit contributions
 All-0.068
Let-0.064
 Being-0.063
Being-0.061
 Stone-0.060
Token over
Token ID625
Feature activation-3.93e-02

Pos logit contributions
arter+0.013
iri+0.013
ulp+0.013
urg+0.013
 awful+0.012
Neg logit contributions
 All-0.006
Let-0.006
 who-0.006
 ties-0.005
 led-0.005
Token to
Token ID284
Feature activation-6.02e-02

Pos logit contributions
 awful+0.076
iri+0.074
ulp+0.073
gh+0.073
 mir+0.073
Neg logit contributions
 All-0.052
Let-0.049
 who-0.048
 Stone-0.045
 ties-0.045
Token Tom
Token ID4186
Feature activation-9.53e-02

Pos logit contributions
 awful+0.115
iri+0.112
 mir+0.111
ulp+0.111
arter+0.111
Neg logit contributions
 All-0.079
Let-0.076
 who-0.072
 Stone-0.072
 ties-0.070
Token€
Token ID26391
Feature activation-1.65e-01

Pos logit contributions
 awful+0.049
iri+0.048
gh+0.047
arter+0.047
ulp+0.047
Neg logit contributions
 All-0.035
Let-0.033
 who-0.033
 ties-0.031
 Stone-0.031
Token�
Token ID129
Feature activation-1.07e-01

Pos logit contributions
gh+0.311
 awful+0.310
iri+0.306
arter+0.301
ulp+0.300
Neg logit contributions
 All-0.211
Let-0.204
 who-0.198
 ties-0.191
 Stone-0.190
Token�
Token ID241
Feature activation-5.22e-02

Pos logit contributions
ulp+0.192
iri+0.181
 mir+0.180
arter+0.180
ww+0.177
Neg logit contributions
 All-0.174
 ties-0.150
 Riding-0.150
 The-0.150
 who-0.150
TokenGr
Token ID8642
Feature activation-1.03e-02

Pos logit contributions
 awful+0.122
iri+0.119
ulp+0.119
gh+0.118
arter+0.118
Neg logit contributions
 All-0.048
Let-0.044
 who-0.043
 ties-0.039
 Stone-0.039
Tokenass
Token ID562
Feature activation-7.15e-02

Pos logit contributions
 awful+0.017
gh+0.016
iri+0.016
 mir+0.016
arter+0.016
Neg logit contributions
 All-0.017
 who-0.016
Let-0.016
 Stone-0.015
 ties-0.015
Tokenho
Token ID8873
Feature activation-3.25e-01

Pos logit contributions
 awful+0.114
ulp+0.113
 mir+0.111
iri+0.110
arter+0.109
Neg logit contributions
 All-0.118
 who-0.117
Let-0.110
 Stone-0.106
es-0.105
Tokenpper
Token ID2848
Feature activation-6.25e-02

Pos logit contributions
 awful+0.335
iri+0.314
ulp+0.313
arter+0.311
 mir+0.310
Neg logit contributions
 All-0.728
Let-0.704
 who-0.700
 Stone-0.674
 ties-0.673
Token,
Token ID11
Feature activation-1.05e-02

Pos logit contributions
 awful+0.077
gh+0.074
arter+0.073
iri+0.073
 mir+0.072
Neg logit contributions
 All-0.124
Let-0.121
 who-0.116
 ties-0.116
 Stone-0.114
Token let
Token ID1309
Feature activation+1.03e-01

Pos logit contributions
 awful+0.020
 mir+0.019
iri+0.019
arter+0.019
gh+0.019
Neg logit contributions
 All-0.014
Let-0.013
 Stone-0.012
 who-0.012
 ties-0.012
Tokenâ
Token ID22940
Feature activation-1.63e-01

Pos logit contributions
 All+0.118
Let+0.107
 who+0.107
 ties+0.098
 Stone+0.097
Neg logit contributions
 awful-0.220
ulp-0.217
arter-0.217
iri-0.216
 mir-0.214
Token€
Token ID26391
Feature activation-1.66e-01

Pos logit contributions
 awful+0.202
iri+0.195
ulp+0.193
arter+0.192
 mir+0.192
Neg logit contributions
 All-0.336
Let-0.319
 who-0.316
 Stone-0.303
 ties-0.301
Token fan
Token ID4336
Feature activation+1.12e-01

Pos logit contributions
 All+0.085
Let+0.080
 who+0.080
 Stone+0.075
 Being+0.074
Neg logit contributions
 awful-0.119
 mir-0.113
ulp-0.113
gh-0.112
iri-0.112
Token,
Token ID11
Feature activation-1.69e-02

Pos logit contributions
 All+0.144
 who+0.135
Let+0.135
 ties+0.126
 Stone+0.125
Neg logit contributions
 awful-0.221
iri-0.216
ulp-0.214
gh-0.213
arter-0.213
Token like
Token ID588
Feature activation+3.03e-03

Pos logit contributions
 awful+0.031
 mir+0.030
iri+0.029
arter+0.029
ulp+0.029
Neg logit contributions
 All-0.023
Let-0.022
 Stone-0.022
 Using-0.020
 Being-0.020
Token "
Token ID366
Feature activation+1.47e-01

Pos logit contributions
 All+0.004
Let+0.003
 Stone+0.003
 who+0.003
 Using+0.003
Neg logit contributions
 awful-0.006
 mir-0.006
gh-0.006
arter-0.006
ulp-0.006
Tokenw
Token ID86
Feature activation-1.86e-01

Pos logit contributions
 All+0.223
 who+0.213
 ties+0.204
 Stone+0.202
Let+0.202
Neg logit contributions
 awful-0.239
gh-0.236
iri-0.236
 mir-0.229
ww-0.227
Tokenooo
Token ID34160
Feature activation-3.25e-01

Pos logit contributions
 awful+0.288
 mir+0.276
arter+0.275
iri+0.275
ulp+0.271
Neg logit contributions
 All-0.323
 who-0.306
Let-0.303
 ties-0.292
 Stone-0.290
Token"
Token ID1
Feature activation+1.02e-01

Pos logit contributions
 awful+0.494
arter+0.481
ulp+0.470
 mir+0.468
iri+0.462
Neg logit contributions
 All-0.591
 who-0.560
Let-0.543
 ties-0.524
 Stone-0.523
Token and
Token ID290
Feature activation-1.19e-02

Pos logit contributions
 All+0.119
Let+0.112
 who+0.110
 Stone+0.102
 ties+0.102
Neg logit contributions
 awful-0.213
iri-0.208
ulp-0.207
arter-0.207
 mir-0.206
Token "
Token ID366
Feature activation+1.46e-01

Pos logit contributions
 mir+0.025
 awful+0.025
gh+0.025
oor+0.024
arter+0.024
Neg logit contributions
 All-0.011
Let-0.010
pen-0.010
 Stone-0.010
es-0.010
Tokenbr
Token ID1671
Feature activation-6.09e-02

Pos logit contributions
 All+0.222
Let+0.211
 who+0.209
 ties+0.198
 Stone+0.198
Neg logit contributions
 awful-0.252
iri-0.245
ulp-0.242
gh-0.242
arter-0.242
Tokenrr
Token ID21062
Feature activation-7.22e-02

Pos logit contributions
 awful+0.104
iri+0.101
ulp+0.100
arter+0.100
gh+0.100
Neg logit contributions
 All-0.095
Let-0.090
 who-0.089
 Stone-0.084
 ties-0.084
Token introduced
Token ID5495
Feature activation+2.52e-02

Pos logit contributions
 All+0.049
 who+0.047
Let+0.046
 ties+0.043
 Stone+0.043
Neg logit contributions
 awful-0.069
iri-0.067
ulp-0.066
 mir-0.066
arter-0.066
Token to
Token ID284
Feature activation-6.41e-02

Pos logit contributions
 All+0.033
 who+0.031
Let+0.031
 ties+0.028
 Stone+0.028
Neg logit contributions
 awful-0.050
arter-0.049
iri-0.049
ulp-0.048
 mir-0.048
Token a
Token ID257
Feature activation+2.81e-02

Pos logit contributions
 awful+0.127
gh+0.120
 mir+0.119
arter+0.119
oto+0.118
Neg logit contributions
Let-0.079
 All-0.074
 Using-0.069
 Did-0.069
 Stone-0.069
Token smooth
Token ID7209
Feature activation-6.98e-02

Pos logit contributions
 All+0.044
 who+0.043
Let+0.042
 Stone+0.040
 ties+0.039
Neg logit contributions
 awful-0.048
ulp-0.048
iri-0.047
urg-0.047
arter-0.046
Token grass
Token ID8701
Feature activation-3.69e-03

Pos logit contributions
 awful+0.133
iri+0.129
gh+0.128
ulp+0.128
arter+0.127
Neg logit contributions
 All-0.095
Let-0.089
 who-0.089
 Stone-0.084
 ties-0.084
Tokenho
Token ID8873
Feature activation-3.25e-01

Pos logit contributions
 awful+0.007
iri+0.007
arter+0.007
ulp+0.007
 mir+0.007
Neg logit contributions
 All-0.005
Let-0.005
 who-0.005
 ties-0.005
 Stone-0.005
Tokenpper
Token ID2848
Feature activation-6.12e-02

Pos logit contributions
 awful+0.448
 mir+0.413
arter+0.403
ulp+0.403
urg+0.377
Neg logit contributions
 who-0.631
 All-0.618
es-0.590
 Stone-0.589
 ties-0.584
Token called
Token ID1444
Feature activation+2.42e-02

Pos logit contributions
 mir+0.092
 awful+0.092
iri+0.090
arter+0.090
oto+0.089
Neg logit contributions
 All-0.095
Let-0.095
Find-0.089
 Using-0.087
Being-0.087
Token Jim
Token ID5395
Feature activation-1.55e-01

Pos logit contributions
Let+0.031
 All+0.030
 who+0.029
 ties+0.028
Being+0.028
Neg logit contributions
 awful-0.045
gh-0.044
arter-0.044
iri-0.044
ulp-0.043
Token.
Token ID13
Feature activation-7.08e-03

Pos logit contributions
 awful+0.266
iri+0.266
gh+0.262
 mir+0.260
arter+0.257
Neg logit contributions
 All-0.229
Let-0.218
 ties-0.204
 Stone-0.199
 who-0.198
Token Ted
Token ID11396
Feature activation-3.75e-02

Pos logit contributions
ulp+0.014
oto+0.014
 awful+0.013
ife+0.013
arter+0.013
Neg logit contributions
 who-0.008
 ties-0.008
 kit-0.007
Let-0.007
es-0.007
Token."
Token ID526
Feature activation+1.30e-02

Pos logit contributions
 awful+0.215
iri+0.213
ulp+0.209
arter+0.207
gh+0.207
Neg logit contributions
 All-0.230
 who-0.217
Let-0.216
 ties-0.205
 Stone-0.205
Token
Token ID198
Feature activation+2.18e-02

Pos logit contributions
 draws+0.013
 puzzles+0.013
 included+0.013
 ties+0.012
 drew+0.012
Neg logit contributions
 awful-0.025
iri-0.025
oor-0.025
ulp-0.024
mmm-0.024
Token
Token ID198
Feature activation+2.21e-02

Pos logit contributions
 All+0.017
 draws+0.017
 ties+0.017
 Stone+0.017
Let+0.017
Neg logit contributions
iri-0.049
gh-0.049
 awful-0.049
oto-0.048
 mir-0.047
TokenThe
Token ID464
Feature activation+3.63e-02

Pos logit contributions
 All+0.028
 draws+0.026
 Stone+0.025
 ties+0.025
 Did+0.025
Neg logit contributions
 awful-0.042
iri-0.041
gh-0.041
 mir-0.041
ww-0.040
Token grass
Token ID8701
Feature activation-2.73e-03

Pos logit contributions
 All+0.060
Let+0.057
 who+0.056
 ties+0.054
 Stone+0.053
Neg logit contributions
 awful-0.060
iri-0.057
 mir-0.056
arter-0.056
ulp-0.055
Tokenho
Token ID8873
Feature activation-3.24e-01

Pos logit contributions
 awful+0.003
iri+0.003
ulp+0.003
arter+0.003
gh+0.003
Neg logit contributions
 All-0.006
Let-0.005
 who-0.005
 ties-0.005
 Stone-0.005
Tokenpper
Token ID2848
Feature activation-6.23e-02

Pos logit contributions
gh+0.258
iri+0.238
 awful+0.228
oto+0.212
igi+0.211
Neg logit contributions
 All-0.825
Let-0.782
 who-0.772
 Being-0.758
 ties-0.748
Token sighed
Token ID21893
Feature activation-3.93e-02

Pos logit contributions
 awful+0.128
gh+0.117
 mir+0.115
iri+0.115
urg+0.113
Neg logit contributions
 All-0.069
Let-0.065
 Being-0.065
 Did-0.064
Make-0.063
Token and
Token ID290
Feature activation-7.25e-03

Pos logit contributions
 awful+0.085
iri+0.085
arter+0.084
 mir+0.084
ulp+0.083
Neg logit contributions
 All-0.043
 who-0.040
Let-0.039
 Stone-0.035
 ties-0.035
Token decided
Token ID3066
Feature activation-6.14e-02

Pos logit contributions
 awful+0.015
iri+0.014
 mir+0.014
ulp+0.014
gh+0.014
Neg logit contributions
 All-0.009
Let-0.008
 who-0.008
 Stone-0.008
 ties-0.008
Token to
Token ID284
Feature activation-6.21e-02

Pos logit contributions
arter+0.121
iri+0.121
ulp+0.120
 awful+0.118
 mir+0.116
Neg logit contributions
 All-0.083
 who-0.080
Let-0.078
 ties-0.071
 Being-0.070
Token and
Token ID290
Feature activation-9.97e-03

Pos logit contributions
 awful+0.075
 mir+0.074
gh+0.073
iri+0.073
ulp+0.072
Neg logit contributions
 All-0.055
 Stone-0.049
 Being-0.049
Let-0.048
 ties-0.048
Token picked
Token ID6497
Feature activation+3.81e-02

Pos logit contributions
 awful+0.017
alt+0.015
 mir+0.015
 erupt+0.015
ulp+0.015
Neg logit contributions
 All-0.014
Being-0.012
 Being-0.012
 Stone-0.012
 acts-0.012
Token up
Token ID510
Feature activation+5.65e-03

Pos logit contributions
 All+0.058
Let+0.055
 Stone+0.055
 who+0.055
 Being+0.055
Neg logit contributions
 awful-0.063
iri-0.058
ulp-0.057
oto-0.057
gh-0.055
Token a
Token ID257
Feature activation+2.99e-02

Pos logit contributions
 draws+0.006
Let+0.006
 Did+0.006
 All+0.006
 Stone+0.006
Neg logit contributions
 awful-0.012
iri-0.011
ulp-0.011
oto-0.011
 mir-0.011
Token grass
Token ID8701
Feature activation-3.30e-03

Pos logit contributions
 All+0.047
Let+0.044
 who+0.043
 Being+0.041
 Stone+0.041
Neg logit contributions
 awful-0.051
gh-0.049
iri-0.049
ulp-0.049
 mir-0.049
Tokenho
Token ID8873
Feature activation-3.24e-01

Pos logit contributions
 awful+0.006
iri+0.006
ulp+0.006
 mir+0.006
gh+0.006
Neg logit contributions
 All-0.004
 who-0.004
Let-0.004
 Stone-0.004
 ties-0.004
Tokenpper
Token ID2848
Feature activation-6.07e-02

Pos logit contributions
 awful+0.445
 mir+0.415
arter+0.404
ulp+0.398
iri+0.377
Neg logit contributions
 who-0.634
 All-0.621
es-0.600
Let-0.591
 Stone-0.590
Token!
Token ID0
Feature activation-9.48e-02

Pos logit contributions
 awful+0.109
gh+0.108
ulp+0.107
iri+0.105
arter+0.104
Neg logit contributions
 All-0.083
Let-0.079
 Being-0.074
Being-0.074
 Did-0.073
Token Both
Token ID5747
Feature activation+5.84e-02

Pos logit contributions
 awful+0.163
ulp+0.162
iri+0.160
oor+0.159
 mir+0.157
Neg logit contributions
 who-0.125
 ties-0.125
es-0.119
 draws-0.119
 Stone-0.118
Token friends
Token ID2460
Feature activation+1.69e-01

Pos logit contributions
 All+0.066
Let+0.063
 who+0.062
 ties+0.058
 Stone+0.058
Neg logit contributions
 awful-0.123
iri-0.119
ulp-0.118
 mir-0.118
arter-0.117
Token were
Token ID547
Feature activation+4.68e-02

Pos logit contributions
 All+0.202
Let+0.197
 Stone+0.194
 ties+0.192
Find+0.191
Neg logit contributions
 awful-0.312
ulp-0.304
oor-0.295
urg-0.294
alt-0.294
Token and
Token ID290
Feature activation-9.58e-03

Pos logit contributions
 All+0.075
Let+0.070
 who+0.069
 ties+0.063
 Stone+0.063
Neg logit contributions
 awful-0.163
iri-0.159
ulp-0.158
arter-0.158
gh-0.158
Token saw
Token ID2497
Feature activation+2.18e-02

Pos logit contributions
 awful+0.021
 mir+0.020
gh+0.020
iri+0.020
ulp+0.020
Neg logit contributions
 All-0.010
Let-0.009
 Stone-0.009
 ties-0.009
 who-0.008
Token a
Token ID257
Feature activation+2.99e-02

Pos logit contributions
Let+0.027
 All+0.026
 Stone+0.025
 draws+0.025
 ties+0.025
Neg logit contributions
 awful-0.045
 mir-0.039
oto-0.039
arter-0.039
iri-0.039
Token huge
Token ID3236
Feature activation-1.31e-01

Pos logit contributions
 All+0.046
Let+0.043
 who+0.042
 ties+0.040
 Being+0.040
Neg logit contributions
 awful-0.052
 mir-0.050
iri-0.050
arter-0.049
gh-0.049
Token grass
Token ID8701
Feature activation-3.68e-03

Pos logit contributions
 awful+0.249
iri+0.241
arter+0.240
ulp+0.239
 mir+0.239
Neg logit contributions
 All-0.178
Let-0.169
 who-0.165
 ties-0.156
 Stone-0.155
Tokenho
Token ID8873
Feature activation-3.24e-01

Pos logit contributions
 awful+0.004
arter+0.004
iri+0.004
gh+0.004
 mir+0.004
Neg logit contributions
 All-0.008
Let-0.007
 who-0.007
 ties-0.007
 Being-0.007
Tokenpper
Token ID2848
Feature activation-5.96e-02

Pos logit contributions
 awful+0.453
 mir+0.427
 enjoyable+0.408
arter+0.407
ulp+0.407
Neg logit contributions
 who-0.615
 All-0.599
es-0.583
Let-0.576
 Stone-0.566
Token,
Token ID11
Feature activation-1.40e-02

Pos logit contributions
 awful+0.104
gh+0.100
arter+0.098
iri+0.098
ulp+0.098
Neg logit contributions
 All-0.088
Let-0.085
 Being-0.079
Being-0.078
 Did-0.078
Token who
Token ID508
Feature activation+1.79e-01

Pos logit contributions
 awful+0.026
 mir+0.024
iri+0.023
gh+0.023
ulp+0.023
Neg logit contributions
 All-0.020
 Stone-0.019
Let-0.019
es-0.018
 who-0.018
Token had
Token ID550
Feature activation+7.48e-02

Pos logit contributions
 All+0.179
 ties+0.179
Make+0.176
 draws+0.172
 kit+0.170
Neg logit contributions
 awful-0.369
iri-0.339
 mir-0.332
gh-0.332
oto-0.331
Token been
Token ID587
Feature activation-1.30e-01

Pos logit contributions
 All+0.105
Let+0.100
 who+0.099
 ties+0.093
 Stone+0.092
Neg logit contributions
 awful-0.138
iri-0.135
arter-0.134
ulp-0.134
gh-0.133
TokenIt
Token ID1026
Feature activation-7.40e-02

Pos logit contributions
 All+0.148
 who+0.137
 ties+0.133
 Using+0.132
 Stone+0.132
Neg logit contributions
iri-0.156
oto-0.156
 awful-0.156
 mir-0.152
ww-0.150
Token is
Token ID318
Feature activation-2.18e-03

Pos logit contributions
 awful+0.172
gh+0.164
arter+0.161
 mir+0.160
iri+0.160
Neg logit contributions
 All-0.067
Let-0.067
 who-0.060
 ties-0.056
 Stone-0.055
Token not
Token ID407
Feature activation-1.29e-01

Pos logit contributions
oor+0.005
ulp+0.005
ooky+0.005
urg+0.004
unct+0.004
Neg logit contributions
 who-0.002
 All-0.002
 ties-0.002
 two-0.002
 Stone-0.002
Token a
Token ID257
Feature activation+2.78e-02

Pos logit contributions
iri+0.286
ulp+0.285
gh+0.282
 awful+0.280
 mir+0.279
Neg logit contributions
 who-0.136
 All-0.130
Let-0.117
 ties-0.115
 Stone-0.114
Token grass
Token ID8701
Feature activation-3.21e-04

Pos logit contributions
 who+0.036
Let+0.034
 All+0.033
 Stone+0.031
 ties+0.030
Neg logit contributions
urg-0.057
 awful-0.055
ulp-0.055
oor-0.055
iri-0.054
Tokenho
Token ID8873
Feature activation-3.23e-01

Pos logit contributions
 awful+0.001
ulp+0.001
gh+0.001
iri+0.001
urg+0.001
Neg logit contributions
 All-0.000
 who-0.000
Let-0.000
 Stone-0.000
 ties-0.000
Tokenpper
Token ID2848
Feature activation-5.87e-02

Pos logit contributions
 awful+0.402
 mir+0.382
ulp+0.373
arter+0.366
iri+0.357
Neg logit contributions
 who-0.648
 All-0.646
 ties-0.622
Let-0.620
es-0.617
Token or
Token ID393
Feature activation-4.92e-02

Pos logit contributions
 awful+0.110
ulp+0.108
iri+0.107
arter+0.106
gh+0.106
Neg logit contributions
 All-0.080
 who-0.079
Let-0.075
 ties-0.071
 Stone-0.071
Token a
Token ID257
Feature activation+2.80e-02

Pos logit contributions
 awful+0.101
 mir+0.097
arter+0.096
iri+0.095
ulp+0.093
Neg logit contributions
 All-0.059
Let-0.055
 Stone-0.051
 Using-0.050
 Being-0.049
Token cricket
Token ID18836
Feature activation+7.56e-02

Pos logit contributions
 who+0.036
 All+0.035
Let+0.034
 Stone+0.032
 ties+0.032
Neg logit contributions
 awful-0.055
urg-0.054
ulp-0.054
iri-0.053
oor-0.052
Token.
Token ID13
Feature activation-1.57e-02

Pos logit contributions
 All+0.114
 who+0.109
Let+0.109
 ties+0.102
 Stone+0.102
Neg logit contributions
 awful-0.131
iri-0.128
ulp-0.127
arter-0.126
gh-0.126
Token and
Token ID290
Feature activation-8.44e-03

Pos logit contributions
arter+0.068
 mir+0.067
iri+0.066
gh+0.066
ulp+0.066
Neg logit contributions
 who-0.022
 All-0.021
Let-0.019
 ties-0.015
 Stone-0.015
Token squ
Token ID2809
Feature activation-1.48e-01

Pos logit contributions
 awful+0.016
ulp+0.015
iri+0.015
 mir+0.015
gh+0.015
Neg logit contributions
 All-0.011
Let-0.010
 who-0.010
 Stone-0.010
 Being-0.010
Tokenished
Token ID1348
Feature activation+3.85e-02

Pos logit contributions
 awful+0.232
arter+0.231
 mir+0.231
gh+0.229
iri+0.224
Neg logit contributions
 All-0.249
Let-0.241
 who-0.237
 ties-0.232
 Being-0.219
Token the
Token ID262
Feature activation+6.64e-02

Pos logit contributions
 All+0.044
Let+0.041
 who+0.040
 ties+0.036
 Stone+0.036
Neg logit contributions
arter-0.082
 awful-0.082
iri-0.081
 mir-0.081
ulp-0.080
Token grass
Token ID8701
Feature activation-5.51e-03

Pos logit contributions
 All+0.088
Let+0.087
 who+0.083
 ties+0.080
 Stone+0.078
Neg logit contributions
 awful-0.128
arter-0.126
iri-0.126
gh-0.125
 mir-0.125
Tokenho
Token ID8873
Feature activation-3.23e-01

Pos logit contributions
iri+0.012
 awful+0.012
 mir+0.012
arter+0.012
ulp+0.012
Neg logit contributions
 All-0.006
 who-0.006
Let-0.006
 Stone-0.005
 ties-0.005
Tokenppers
Token ID11799
Feature activation+6.74e-02

Pos logit contributions
 awful+0.400
 mir+0.383
arter+0.381
ulp+0.369
iri+0.352
Neg logit contributions
 All-0.656
 who-0.654
Let-0.645
es-0.625
 ties-0.617
Token with
Token ID351
Feature activation+1.40e-01

Pos logit contributions
 All+0.063
 who+0.060
Let+0.058
 ties+0.053
 led+0.051
Neg logit contributions
iri-0.156
 awful-0.155
arter-0.154
 mir-0.154
oto-0.151
Token her
Token ID607
Feature activation-9.78e-02

Pos logit contributions
 All+0.175
Let+0.163
 who+0.160
 ties+0.149
 Stone+0.146
Neg logit contributions
 awful-0.283
arter-0.280
gh-0.279
 mir-0.277
iri-0.277
Token fingers
Token ID9353
Feature activation+8.95e-02

Pos logit contributions
 awful+0.198
arter+0.193
 mir+0.193
iri+0.192
gh+0.189
Neg logit contributions
 All-0.120
 who-0.118
Let-0.114
 ties-0.109
 Stone-0.106
Token to
Token ID284
Feature activation-6.23e-02

Pos logit contributions
 All+0.098
 who+0.096
Let+0.089
 ties+0.084
 Stone+0.080
Neg logit contributions
 awful-0.193
 mir-0.193
iri-0.193
arter-0.191
ulp-0.186
Token.
Token ID13
Feature activation-9.26e-04

Pos logit contributions
 All+0.124
Let+0.118
 who+0.112
 Stone+0.111
 ties+0.110
Neg logit contributions
 awful-0.186
gh-0.177
ulp-0.177
 mir-0.176
iri-0.176
Token It
Token ID632
Feature activation+2.71e-02

Pos logit contributions
 awful+0.002
gh+0.002
ulp+0.002
oor+0.002
alt+0.002
Neg logit contributions
 who-0.001
es-0.001
 Stone-0.001
mun-0.001
 ties-0.001
Token was
Token ID373
Feature activation+4.45e-02

Pos logit contributions
 All+0.018
 who+0.017
Let+0.016
 Being+0.016
 ties+0.016
Neg logit contributions
 awful-0.069
gh-0.065
 mir-0.063
iri-0.062
oto-0.062
Token a
Token ID257
Feature activation+3.65e-02

Pos logit contributions
 All+0.036
Let+0.033
 who+0.032
 ties+0.029
 Stone+0.028
Neg logit contributions
 awful-0.109
iri-0.107
ulp-0.107
arter-0.107
gh-0.106
Token grass
Token ID8701
Feature activation-6.22e-04

Pos logit contributions
 All+0.055
 who+0.053
Let+0.053
 Stone+0.050
 ties+0.049
Neg logit contributions
 awful-0.063
ulp-0.063
iri-0.062
arter-0.061
urg-0.061
Tokenho
Token ID8873
Feature activation-3.22e-01

Pos logit contributions
 awful+0.001
ulp+0.001
iri+0.001
 mir+0.001
gh+0.001
Neg logit contributions
 All-0.001
 who-0.001
Let-0.001
 Stone-0.001
 ties-0.001
Tokenpper
Token ID2848
Feature activation-5.98e-02

Pos logit contributions
 awful+0.442
 mir+0.421
arter+0.406
ulp+0.402
iri+0.381
Neg logit contributions
 who-0.629
 All-0.616
Let-0.591
es-0.583
 Stone-0.583
Token!
Token ID0
Feature activation-9.05e-02

Pos logit contributions
 awful+0.089
iri+0.086
gh+0.085
ulp+0.085
arter+0.085
Neg logit contributions
 All-0.105
Let-0.101
 who-0.099
 ties-0.095
 Stone-0.095
Token Tom
Token ID4186
Feature activation-9.19e-02

Pos logit contributions
 awful+0.138
gh+0.134
oor+0.130
oto+0.128
ulp+0.125
Neg logit contributions
 who-0.140
 Stone-0.135
es-0.129
mun-0.127
 included-0.126
Token was
Token ID373
Feature activation+4.39e-02

Pos logit contributions
 awful+0.199
gh+0.194
iri+0.192
 mir+0.187
oto+0.187
Neg logit contributions
 All-0.090
Let-0.079
 Stone-0.078
 ties-0.077
 Being-0.077
Token so
Token ID523
Feature activation-1.32e-01

Pos logit contributions
 All+0.074
Let+0.071
 Stone+0.069
 ties+0.068
 draws+0.068
Neg logit contributions
 awful-0.068
gh-0.062
iri-0.062
 mir-0.062
ulp-0.061
Token"
Token ID1
Feature activation+1.11e-01

Pos logit contributions
 All+0.026
 ties+0.026
 Stone+0.025
 who+0.025
Let+0.025
Neg logit contributions
 awful-0.036
gh-0.036
 mir-0.035
iri-0.035
oto-0.034
TokenWh
Token ID1199
Feature activation-1.08e-01

Pos logit contributions
 ties+0.166
 All+0.165
 who+0.161
es+0.158
 Using+0.156
Neg logit contributions
iri-0.188
 awful-0.182
oto-0.179
 mir-0.179
gh-0.174
Tokenooo
Token ID34160
Feature activation-3.22e-01

Pos logit contributions
ulp+0.175
 awful+0.175
 mir+0.170
arter+0.163
iri+0.159
Neg logit contributions
 All-0.181
 who-0.172
 Stone-0.168
 Being-0.159
 ties-0.158
Token,
Token ID11
Feature activation-1.08e-02

Pos logit contributions
 awful+0.463
ulp+0.449
arter+0.431
 mir+0.424
iri+0.422
Neg logit contributions
 All-0.615
 who-0.601
 Stone-0.559
Let-0.547
 ties-0.539
Token Wh
Token ID854
Feature activation-3.51e-02

Pos logit contributions
 awful+0.020
 mir+0.019
iri+0.019
arter+0.019
gh+0.018
Neg logit contributions
 All-0.015
Let-0.015
 ties-0.014
 who-0.013
 Using-0.013
Tokenooo
Token ID34160
Feature activation-3.22e-01

Pos logit contributions
 awful+0.061
ulp+0.058
 mir+0.057
iri+0.054
arter+0.054
Neg logit contributions
 who-0.054
 All-0.054
 Stone-0.052
Let-0.049
es-0.049
Token,"
Token ID553
Feature activation+7.66e-02

Pos logit contributions
ulp+0.497
 awful+0.492
arter+0.471
 mir+0.459
iri+0.458
Neg logit contributions
 All-0.580
 who-0.577
 Stone-0.521
es-0.506
 ties-0.503
Token said
Token ID531
Feature activation-3.80e-03

Pos logit contributions
 All+0.114
Let+0.112
 Stone+0.109
pen+0.108
 Being+0.103
Neg logit contributions
 awful-0.134
iri-0.122
oto-0.120
 mir-0.120
gh-0.117
Token the
Token ID262
Feature activation+7.31e-02

Pos logit contributions
 awful+0.008
gh+0.008
 mir+0.008
iri+0.008
ulp+0.007
Neg logit contributions
Let-0.004
 All-0.004
 who-0.004
 Being-0.004
 ties-0.004
Token noise
Token ID7838
Feature activation-4.65e-02

Pos logit contributions
 All+0.108
Let+0.105
 who+0.099
 ties+0.098
es+0.096
Neg logit contributions
 awful-0.132
iri-0.123
 mir-0.123
gh-0.120
oto-0.118
Token.
Token ID13
Feature activation-4.60e-03

Pos logit contributions
 awful+0.092
iri+0.091
ulp+0.090
arter+0.090
 mir+0.089
Neg logit contributions
 All-0.059
 who-0.056
Let-0.055
 Stone-0.051
 ties-0.051
Token?"
Token ID1701
Feature activation+4.63e-02

Pos logit contributions
Let+0.166
 All+0.166
 Stone+0.155
 who+0.155
 Being+0.153
Neg logit contributions
 awful-0.154
gh-0.153
ulp-0.140
arter-0.138
 mir-0.137
Token
Token ID198
Feature activation+2.40e-02

Pos logit contributions
 puzzles+0.043
 Stone+0.042
 draws+0.042
 ties+0.041
es+0.040
Neg logit contributions
 awful-0.098
 mir-0.091
iri-0.090
gh-0.088
ulp-0.086
Token
Token ID198
Feature activation+2.44e-02

Pos logit contributions
 All+0.019
 draws+0.018
 ties+0.018
 Stone+0.018
Let+0.018
Neg logit contributions
gh-0.055
iri-0.054
 awful-0.053
 mir-0.053
oto-0.052
TokenThe
Token ID464
Feature activation+3.85e-02

Pos logit contributions
 All+0.029
 ties+0.027
 draws+0.027
Let+0.027
 Stone+0.026
Neg logit contributions
 awful-0.048
gh-0.047
iri-0.047
 mir-0.047
ww-0.046
Token grass
Token ID8701
Feature activation-5.35e-05

Pos logit contributions
 All+0.063
Let+0.060
 who+0.059
 ties+0.057
 Stone+0.056
Neg logit contributions
 awful-0.064
iri-0.061
 mir-0.060
arter-0.060
ulp-0.059
Tokenho
Token ID8873
Feature activation-3.22e-01

Pos logit contributions
 awful+0.000
ulp+0.000
iri+0.000
arter+0.000
 mir+0.000
Neg logit contributions
 All-0.000
Let-0.000
 who-0.000
 Stone-0.000
 ties-0.000
Tokenpper
Token ID2848
Feature activation-6.00e-02

Pos logit contributions
gh+0.266
iri+0.254
 awful+0.245
oto+0.227
igi+0.223
Neg logit contributions
 All-0.803
Let-0.767
 who-0.755
 Being-0.737
 ties-0.734
Token said
Token ID531
Feature activation-6.29e-03

Pos logit contributions
 awful+0.122
iri+0.116
gh+0.115
urg+0.112
 mir+0.112
Neg logit contributions
 All-0.067
Let-0.064
 Being-0.062
 ties-0.062
Make-0.061
Token,
Token ID11
Feature activation-9.43e-03

Pos logit contributions
 awful+0.012
iri+0.012
arter+0.012
ulp+0.012
 mir+0.012
Neg logit contributions
 All-0.008
 who-0.008
Let-0.008
 Stone-0.007
 ties-0.007
Token "
Token ID366
Feature activation+1.51e-01

Pos logit contributions
 awful+0.023
gh+0.023
iri+0.022
ulp+0.022
 mir+0.022
Neg logit contributions
 All-0.007
Let-0.006
 ties-0.006
 who-0.006
 Stone-0.005
TokenI
Token ID40
Feature activation-5.79e-03

Pos logit contributions
 All+0.144
 ties+0.139
 who+0.138
es+0.135
 Stone+0.130
Neg logit contributions
 awful-0.323
iri-0.321
gh-0.319
ww-0.318
 mir-0.312
Token alarm
Token ID10436
Feature activation-8.83e-02

Pos logit contributions
 All+0.092
Let+0.089
 who+0.089
 ties+0.083
 Stone+0.083
Neg logit contributions
 awful-0.115
ulp-0.114
iri-0.114
arter-0.112
gh-0.112
Token.
Token ID13
Feature activation-9.05e-03

Pos logit contributions
iri+0.181
 awful+0.178
ulp+0.177
 mir+0.176
arter+0.175
Neg logit contributions
 All-0.110
 who-0.103
Let-0.101
 ties-0.093
 Stone-0.092
Token It
Token ID632
Feature activation+2.87e-02

Pos logit contributions
 awful+0.020
ulp+0.020
arter+0.019
 mir+0.019
urg+0.019
Neg logit contributions
Let-0.008
mun-0.008
 who-0.008
 ties-0.008
 Stone-0.008
Token goes
Token ID2925
Feature activation-7.33e-02

Pos logit contributions
 All+0.030
Let+0.028
 who+0.027
 Stone+0.027
 Being+0.025
Neg logit contributions
 awful-0.064
 mir-0.061
arter-0.060
gh-0.059
ulp-0.059
Token w
Token ID266
Feature activation-9.84e-02

Pos logit contributions
iri+0.163
ulp+0.163
arter+0.158
 awful+0.153
ife+0.153
Neg logit contributions
Let-0.086
 All-0.086
 who-0.085
 ties-0.076
 led-0.076
Tokenooo
Token ID34160
Feature activation-3.22e-01

Pos logit contributions
iri+0.098
 mir+0.094
igi+0.093
gh+0.086
 possible+0.085
Neg logit contributions
 All-0.224
 who-0.214
 ties-0.209
es-0.208
 tapes-0.201
Token w
Token ID266
Feature activation-9.77e-02

Pos logit contributions
ulp+0.862
arter+0.840
iri+0.840
 mir+0.801
 awful+0.800
Neg logit contributions
 who-0.299
 All-0.246
es-0.203
Let-0.196
 ties-0.196
Tokenooo
Token ID34160
Feature activation-3.21e-01

Pos logit contributions
iri+0.103
 mir+0.099
igi+0.099
 possible+0.091
gh+0.088
Neg logit contributions
 All-0.211
 who-0.201
es-0.201
 ties-0.196
 tapes-0.189
Token w
Token ID266
Feature activation-9.70e-02

Pos logit contributions
ulp+0.844
arter+0.819
iri+0.810
 awful+0.776
 mir+0.776
Neg logit contributions
 who-0.325
 All-0.251
es-0.224
 with-0.221
 ties-0.201
Tokenooo
Token ID34160
Feature activation-3.20e-01

Pos logit contributions
iri+0.105
 mir+0.104
igi+0.102
 possible+0.095
 unforgettable+0.092
Neg logit contributions
 All-0.200
es-0.194
 who-0.190
 ties-0.184
 tapes-0.177
Token.
Token ID13
Feature activation-1.92e-03

Pos logit contributions
ulp+0.777
arter+0.752
iri+0.739
 awful+0.717
 unforgettable+0.716
Neg logit contributions
 who-0.381
 All-0.299
 with-0.285
es-0.283
's-0.249
Token done
Token ID1760
Feature activation-1.49e-01

Pos logit contributions
iri+0.042
 awful+0.041
 mir+0.040
arter+0.039
gh+0.039
Neg logit contributions
 All-0.104
 who-0.101
Let-0.099
 ties-0.097
 Stone-0.095
Token."
Token ID526
Feature activation+1.47e-02

Pos logit contributions
 awful+0.293
iri+0.293
 mir+0.285
arter+0.284
oto+0.280
Neg logit contributions
 All-0.196
 who-0.184
Let-0.181
 ties-0.167
 Stone-0.163
Token
Token ID198
Feature activation+2.09e-02

Pos logit contributions
 ties+0.017
 included+0.016
Find+0.015
 Stone+0.015
 led+0.015
Neg logit contributions
ulp-0.027
oor-0.027
mmm-0.027
ummy-0.026
ife-0.026
Token
Token ID198
Feature activation+2.10e-02

Pos logit contributions
 All+0.026
Let+0.024
 ties+0.024
 Stone+0.024
 who+0.023
Neg logit contributions
 awful-0.040
gh-0.040
iri-0.040
oto-0.040
ulp-0.039
TokenThe
Token ID464
Feature activation+3.32e-02

Pos logit contributions
 All+0.028
 ties+0.026
 Stone+0.026
Let+0.025
 draws+0.025
Neg logit contributions
 awful-0.039
gh-0.038
iri-0.038
oto-0.037
 mir-0.037
Token boy
Token ID2933
Feature activation+1.67e-01

Pos logit contributions
 All+0.038
es+0.037
 Using+0.034
 Being+0.033
 Stone+0.033
Neg logit contributions
 awful-0.072
 storm-0.066
 mir-0.065
oto-0.064
 awesome-0.064
Token smiled
Token ID13541
Feature activation+4.27e-02

Pos logit contributions
 All+0.165
Let+0.153
 Stone+0.153
 ties+0.146
Find+0.142
Neg logit contributions
 awful-0.368
gh-0.351
urg-0.349
ulp-0.346
iri-0.345
Token and
Token ID290
Feature activation-8.99e-03

Pos logit contributions
 All+0.038
 who+0.036
Let+0.034
 Stone+0.030
 ties+0.029
Neg logit contributions
gh-0.103
arter-0.103
 mir-0.103
 awful-0.103
iri-0.103
Token nodded
Token ID14464
Feature activation+1.16e-01

Pos logit contributions
 awful+0.018
ulp+0.017
iri+0.017
 mir+0.017
urg+0.017
Neg logit contributions
 All-0.011
Let-0.010
 who-0.010
 Stone-0.010
 ties-0.010
Token his
Token ID465
Feature activation-7.26e-02

Pos logit contributions
 who+0.116
 All+0.109
Let+0.100
 led+0.084
es+0.082
Neg logit contributions
ww-0.274
arter-0.273
iri-0.273
 awful-0.271
 mir-0.271
Token head
Token ID1182
Feature activation+1.24e-01

Pos logit contributions
 awful+0.156
 mir+0.154
iri+0.153
arter+0.153
igi+0.148
Neg logit contributions
 All-0.082
Let-0.078
 Stone-0.073
 who-0.073
 ties-0.072
Token It
Token ID632
Feature activation+2.47e-02

Pos logit contributions
gh+0.110
iri+0.109
arter+0.106
 awful+0.106
 mir+0.106
Neg logit contributions
Let-0.086
 All-0.084
 Being-0.083
 Did-0.083
 who-0.081
Token was
Token ID373
Feature activation+5.19e-02

Pos logit contributions
 All+0.020
Let+0.017
 who+0.017
 Stone+0.017
 ties+0.016
Neg logit contributions
 awful-0.061
gh-0.057
 mir-0.057
ulp-0.056
arter-0.056
Token a
Token ID257
Feature activation+4.19e-02

Pos logit contributions
 All+0.072
 who+0.069
Let+0.068
 ties+0.064
 Stone+0.063
Neg logit contributions
iri-0.096
 awful-0.095
arter-0.093
ulp-0.093
gh-0.093
Token little
Token ID1310
Feature activation+6.68e-02

Pos logit contributions
 who+0.066
 All+0.065
Let+0.064
 Stone+0.061
 ties+0.060
Neg logit contributions
ulp-0.069
iri-0.069
urg-0.069
 awful-0.069
arter-0.067
Token card
Token ID2657
Feature activation+9.94e-02

Pos logit contributions
 All+0.077
Let+0.070
 who+0.066
 ties+0.063
 Stone+0.063
Neg logit contributions
 awful-0.143
 mir-0.137
arter-0.136
iri-0.135
ulp-0.134
Token with
Token ID351
Feature activation+1.53e-01

Pos logit contributions
 All+0.127
Let+0.121
 who+0.116
 Stone+0.111
 ties+0.111
Neg logit contributions
 awful-0.195
iri-0.189
gh-0.188
ulp-0.188
arter-0.187
Token a
Token ID257
Feature activation+4.02e-02

Pos logit contributions
 All+0.152
Let+0.141
 who+0.138
 ties+0.127
 Stone+0.126
Neg logit contributions
 awful-0.344
iri-0.336
ulp-0.334
gh-0.333
arter-0.333
Token picture
Token ID4286
Feature activation+5.85e-02

Pos logit contributions
 All+0.062
 who+0.060
Let+0.059
 ties+0.057
 Stone+0.055
Neg logit contributions
 awful-0.068
iri-0.067
ulp-0.066
arter-0.066
gh-0.065
Token of
Token ID286
Feature activation+7.42e-02

Pos logit contributions
 All+0.070
Let+0.065
 who+0.064
 ties+0.060
 Stone+0.060
Neg logit contributions
 awful-0.121
iri-0.117
ulp-0.116
gh-0.116
arter-0.116
Token a
Token ID257
Feature activation+3.89e-02

Pos logit contributions
 All+0.069
Let+0.066
 who+0.060
 Stone+0.059
 ties+0.057
Neg logit contributions
 awful-0.171
iri-0.165
ulp-0.163
gh-0.163
 mir-0.163
Token star
Token ID3491
Feature activation+9.97e-02

Pos logit contributions
 All+0.055
 who+0.054
Let+0.053
 ties+0.050
 Stone+0.050
Neg logit contributions
 awful-0.070
iri-0.069
ulp-0.069
arter-0.068
gh-0.068
Token her
Token ID607
Feature activation-9.63e-02

Pos logit contributions
 awful+0.131
gh+0.126
 mir+0.125
iri+0.125
ulp+0.124
Neg logit contributions
 All-0.070
Let-0.068
 Did-0.064
 Being-0.063
es-0.061
Token hand
Token ID1021
Feature activation+1.00e-01

Pos logit contributions
 awful+0.167
gh+0.163
 mir+0.159
iri+0.158
ulp+0.157
Neg logit contributions
 All-0.145
Let-0.138
 Did-0.132
 Being-0.130
 who-0.125
Token,
Token ID11
Feature activation-1.39e-02

Pos logit contributions
 All+0.134
Let+0.128
 who+0.122
 Stone+0.117
 Being+0.117
Neg logit contributions
 awful-0.188
gh-0.184
iri-0.180
 mir-0.179
ulp-0.178
Token and
Token ID290
Feature activation-9.72e-03

Pos logit contributions
 awful+0.024
 mir+0.023
ulp+0.022
gh+0.022
oor+0.021
Neg logit contributions
 All-0.019
es-0.018
 Stone-0.018
Let-0.017
 Did-0.017
Token the
Token ID262
Feature activation+6.51e-02

Pos logit contributions
 awful+0.016
 mir+0.015
ww+0.015
gh+0.014
iri+0.014
Neg logit contributions
 All-0.014
 Being-0.013
 Stone-0.013
 Did-0.013
Let-0.012
Token two
Token ID734
Feature activation+1.57e-01

Pos logit contributions
 All+0.096
es+0.095
 Using+0.093
 Did+0.093
 Being+0.090
Neg logit contributions
 awful-0.113
 storm-0.104
oto-0.103
ww-0.100
 mir-0.100
Token of
Token ID286
Feature activation+7.30e-02

Pos logit contributions
 All+0.192
Let+0.190
 Did+0.180
 Stone+0.180
 who+0.177
Neg logit contributions
 awful-0.311
 mir-0.292
gh-0.288
iri-0.286
ulp-0.278
Token them
Token ID606
Feature activation-1.95e-01

Pos logit contributions
 All+0.151
Let+0.148
 Stone+0.144
 Did+0.143
 who+0.143
Neg logit contributions
iri-0.082
 awful-0.082
 mir-0.078
gh-0.077
oto-0.077
Token played
Token ID2826
Feature activation+5.50e-02

Pos logit contributions
 awful+0.366
gh+0.356
 mir+0.343
oor+0.340
iri+0.339
Neg logit contributions
 All-0.249
 Being-0.241
 Did-0.230
Let-0.227
 Trying-0.225
Token a
Token ID257
Feature activation+2.97e-02

Pos logit contributions
 Did+0.072
 All+0.071
 Being+0.070
Let+0.068
Being+0.066
Neg logit contributions
gh-0.102
 awful-0.100
alt-0.096
ulp-0.095
 awesome-0.094
Token noisy
Token ID31210
Feature activation-1.32e-01

Pos logit contributions
 who+0.042
 All+0.042
Let+0.040
 ties+0.039
 Stone+0.038
Neg logit contributions
arter-0.056
 awful-0.054
iri-0.054
ulp-0.054
urg-0.054
Token<|endoftext|>
Token ID50256
Feature activation-7.14e-02

Pos logit contributions
 awful+0.189
iri+0.187
mmm+0.185
ife+0.185
 mir+0.182
Neg logit contributions
 ties-0.103
 puzzles-0.091
 led-0.091
 draws-0.089
 drew-0.087
TokenOnce
Token ID7454
Feature activation-7.52e-02

Pos logit contributions
gh+0.125
oor+0.118
alt+0.118
oto+0.115
iri+0.113
Neg logit contributions
 ties-0.088
nam-0.083
 draws-0.081
Being-0.081
 who-0.081
Token there
Token ID612
Feature activation+1.51e-01

Pos logit contributions
 awful+0.146
iri+0.141
gh+0.140
 mir+0.140
ulp+0.140
Neg logit contributions
 All-0.101
 who-0.094
Let-0.093
 Stone-0.087
 ties-0.086
Token was
Token ID373
Feature activation+4.15e-02

Pos logit contributions
 ties+0.106
 All+0.102
Let+0.100
 who+0.097
 draws+0.092
Neg logit contributions
gh-0.363
 awful-0.359
 mir-0.347
iri-0.347
ulp-0.344
Token a
Token ID257
Feature activation+3.64e-02

Pos logit contributions
 All+0.026
Let+0.024
 who+0.024
 ties+0.018
 Stone+0.016
Neg logit contributions
 awful-0.111
arter-0.110
 mir-0.109
iri-0.109
gh-0.109
Token boy
Token ID2933
Feature activation+1.74e-01

Pos logit contributions
 All+0.048
Let+0.045
 who+0.045
 ties+0.042
 Stone+0.042
Neg logit contributions
 awful-0.070
iri-0.068
ulp-0.068
arter-0.068
 mir-0.068
Token named
Token ID3706
Feature activation+7.68e-02

Pos logit contributions
 All+0.185
Let+0.179
 ties+0.169
 Stone+0.169
Find+0.163
Neg logit contributions
arter-0.362
gh-0.362
iri-0.359
 awful-0.358
 mir-0.357
Token Jack
Token ID3619
Feature activation-4.12e-02

Pos logit contributions
 All+0.101
 who+0.093
Let+0.090
 Stone+0.080
 ties+0.080
Neg logit contributions
 awful-0.160
iri-0.158
 mir-0.156
ulp-0.156
gh-0.153
Token.
Token ID13
Feature activation-7.51e-03

Pos logit contributions
 awful+0.063
iri+0.063
gh+0.062
oto+0.061
arter+0.061
Neg logit contributions
 All-0.068
Let-0.064
 ties-0.063
 Stone-0.061
Find-0.061
Token He
Token ID679
Feature activation+2.51e-02

Pos logit contributions
 awful+0.014
arter+0.014
ulp+0.014
gh+0.014
iri+0.013
Neg logit contributions
 who-0.010
 All-0.009
 ties-0.009
Let-0.009
 Stone-0.009
Token liked
Token ID8288
Feature activation+1.19e-01

Pos logit contributions
 All+0.023
 Using+0.020
 ties+0.020
 Stone+0.020
 Being+0.019
Neg logit contributions
gh-0.058
 awful-0.057
 mir-0.055
iri-0.055
oto-0.054
Token,
Token ID11
Feature activation-1.62e-02

Pos logit contributions
 awful+0.087
iri+0.086
arter+0.086
 mir+0.086
gh+0.086
Neg logit contributions
 All-0.051
 who-0.048
Let-0.045
 led-0.040
 ties-0.040
Token his
Token ID465
Feature activation-6.77e-02

Pos logit contributions
 awful+0.028
iri+0.026
gh+0.026
ulp+0.026
arter+0.026
Neg logit contributions
 All-0.023
Let-0.022
 ties-0.022
 Stone-0.022
 Being-0.022
Token friend
Token ID1545
Feature activation+1.43e-01

Pos logit contributions
 awful+0.135
iri+0.131
ulp+0.131
arter+0.129
 mir+0.128
Neg logit contributions
 All-0.087
 who-0.082
Let-0.080
 ties-0.079
 Stone-0.077
Token asked
Token ID1965
Feature activation+8.14e-02

Pos logit contributions
 All+0.194
Let+0.186
 who+0.179
 ties+0.175
 Stone+0.173
Neg logit contributions
 awful-0.267
iri-0.260
gh-0.258
arter-0.256
 mir-0.256
Token him
Token ID683
Feature activation-1.88e-01

Pos logit contributions
 All+0.144
Let+0.137
 who+0.137
 Stone+0.130
 ties+0.129
Neg logit contributions
 awful-0.125
iri-0.119
ulp-0.119
arter-0.119
 mir-0.118
Token if
Token ID611
Feature activation+1.85e-01

Pos logit contributions
 awful+0.321
gh+0.312
oto+0.310
arter+0.309
iri+0.308
Neg logit contributions
Let-0.276
 All-0.273
 ties-0.249
 Stone-0.247
Find-0.240
Token he
Token ID339
Feature activation-3.02e-02

Pos logit contributions
 All+0.271
Let+0.257
 who+0.250
 ties+0.243
 Stone+0.242
Neg logit contributions
 awful-0.326
iri-0.316
 mir-0.316
gh-0.315
arter-0.308
Token took
Token ID1718
Feature activation+8.17e-02

Pos logit contributions
 awful+0.065
gh+0.062
 mir+0.061
iri+0.061
arter+0.059
Neg logit contributions
 All-0.032
 ties-0.029
 Stone-0.029
 Using-0.028
Let-0.028
Token the
Token ID262
Feature activation+6.35e-02

Pos logit contributions
 All+0.086
Let+0.079
 who+0.078
 ties+0.073
 Stone+0.071
Neg logit contributions
 awful-0.181
iri-0.177
arter-0.177
ulp-0.176
 mir-0.176
Token cookie
Token ID19751
Feature activation+1.35e-01

Pos logit contributions
 All+0.103
 Being+0.097
 who+0.095
Let+0.095
 Did+0.093
Neg logit contributions
 awful-0.106
gh-0.100
ulp-0.096
 ours-0.091
ww-0.091
Token.
Token ID13
Feature activation-3.48e-03

Pos logit contributions
 All+0.159
 who+0.158
Let+0.146
 ties+0.142
 Stone+0.141
Neg logit contributions
iri-0.278
 awful-0.277
 mir-0.274
arter-0.273
ulp-0.272
Token house
Token ID2156
Feature activation+9.51e-02

Pos logit contributions
 who+0.075
 All+0.073
Let+0.071
 ties+0.068
 Stone+0.065
Neg logit contributions
gh-0.105
iri-0.104
arter-0.104
 awful-0.103
ulp-0.103
Token for
Token ID329
Feature activation+1.29e-03

Pos logit contributions
 All+0.132
 who+0.129
Let+0.123
 ties+0.119
 Stone+0.114
Neg logit contributions
iri-0.179
 awful-0.176
arter-0.174
ulp-0.173
gh-0.173
Token it
Token ID340
Feature activation-7.40e-02

Pos logit contributions
 All+0.002
 who+0.001
Let+0.001
 ties+0.001
 Stone+0.001
Neg logit contributions
 awful-0.003
iri-0.003
arter-0.003
gh-0.003
ulp-0.003
Token,
Token ID11
Feature activation-1.36e-02

Pos logit contributions
iri+0.135
 awful+0.132
ulp+0.131
arter+0.131
 mir+0.130
Neg logit contributions
 All-0.108
 who-0.105
Let-0.101
 ties-0.097
 Stone-0.094
Token like
Token ID588
Feature activation+7.14e-03

Pos logit contributions
 awful+0.026
iri+0.025
ulp+0.025
gh+0.025
arter+0.025
Neg logit contributions
 All-0.018
Let-0.017
 who-0.017
 Stone-0.016
 ties-0.016
Token a
Token ID257
Feature activation+2.93e-02

Pos logit contributions
 All+0.008
Let+0.008
 who+0.008
 Stone+0.007
 ties+0.007
Neg logit contributions
 awful-0.015
 mir-0.014
ulp-0.014
gh-0.014
arter-0.014
Token nest
Token ID16343
Feature activation+8.86e-03

Pos logit contributions
 who+0.041
 All+0.041
Let+0.040
 ties+0.038
 Stone+0.037
Neg logit contributions
 awful-0.053
iri-0.053
ulp-0.053
arter-0.052
oto-0.052
Token."
Token ID526
Feature activation+1.20e-02

Pos logit contributions
 All+0.012
 who+0.012
Let+0.012
 ties+0.011
 Stone+0.011
Neg logit contributions
 awful-0.016
iri-0.016
ulp-0.016
arter-0.016
gh-0.016
Token Mom
Token ID11254
Feature activation-9.86e-02

Pos logit contributions
 All+0.018
 Stone+0.018
 ties+0.018
Let+0.018
 who+0.017
Neg logit contributions
 awful-0.019
ulp-0.019
iri-0.018
 mir-0.018
gh-0.017
Tokenmy
Token ID1820
Feature activation+3.01e-02

Pos logit contributions
 awful+0.200
oto+0.197
iri+0.197
gh+0.193
urg+0.191
Neg logit contributions
 All-0.117
Let-0.108
 ties-0.102
 Stone-0.099
 who-0.099
Token said
Token ID531
Feature activation-8.57e-03

Pos logit contributions
 All+0.045
Let+0.043
 ties+0.041
 Stone+0.040
 Being+0.039
Neg logit contributions
 awful-0.052
oto-0.050
gh-0.049
iri-0.049
urg-0.048
Token win
Token ID1592
Feature activation+6.94e-02

Pos logit contributions
 All+0.171
 who+0.158
Let+0.155
 ties+0.151
 Stone+0.150
Neg logit contributions
 awful-0.295
iri-0.288
 mir-0.286
gh-0.285
ulp-0.281
Token,
Token ID11
Feature activation-1.03e-02

Pos logit contributions
 All+0.104
Let+0.103
 Stone+0.096
 who+0.095
 ties+0.094
Neg logit contributions
 awful-0.118
gh-0.115
ulp-0.109
arter-0.109
oor-0.108
Token so
Token ID523
Feature activation-1.33e-01

Pos logit contributions
 awful+0.020
gh+0.019
 mir+0.018
arter+0.018
iri+0.018
Neg logit contributions
 All-0.013
Let-0.013
 Stone-0.012
 ties-0.012
 who-0.012
Token they
Token ID484
Feature activation-2.06e-02

Pos logit contributions
 awful+0.252
iri+0.243
gh+0.242
ulp+0.241
 mir+0.240
Neg logit contributions
 All-0.179
Let-0.171
 who-0.167
 Stone-0.159
 ties-0.159
Token were
Token ID547
Feature activation+5.11e-02

Pos logit contributions
 awful+0.042
gh+0.039
iri+0.039
 mir+0.039
oto+0.038
Neg logit contributions
 All-0.024
Let-0.023
 Being-0.022
 Stone-0.022
 ties-0.021
Token competitive
Token ID7606
Feature activation+5.69e-03

Pos logit contributions
 All+0.079
Let+0.076
 who+0.072
 Stone+0.072
 ties+0.070
Neg logit contributions
 awful-0.088
gh-0.084
iri-0.083
 mir-0.082
ulp-0.082
Token.
Token ID13
Feature activation+1.43e-03

Pos logit contributions
 All+0.009
Let+0.008
 Stone+0.008
 Did+0.008
 Being+0.008
Neg logit contributions
 awful-0.010
gh-0.009
 mir-0.009
arter-0.009
ulp-0.009
Token
Token ID198
Feature activation+2.97e-02

Pos logit contributions
 ties+0.001
 led+0.001
 Stone+0.001
es+0.001
 who+0.001
Neg logit contributions
 awful-0.003
iri-0.003
ife-0.003
arter-0.003
oor-0.003
Token
Token ID198
Feature activation+3.14e-02

Pos logit contributions
 All+0.020
Let+0.018
 who+0.018
 Stone+0.018
 ties+0.018
Neg logit contributions
 awful-0.073
iri-0.073
gh-0.072
ulp-0.072
 mir-0.072
TokenTom
Token ID13787
Feature activation-1.22e-01

Pos logit contributions
 All+0.042
Let+0.040
 who+0.040
 Stone+0.038
 ties+0.038
Neg logit contributions
 awful-0.058
iri-0.057
gh-0.057
 mir-0.056
ulp-0.056
Token started
Token ID2067
Feature activation+5.97e-02

Pos logit contributions
 awful+0.250
gh+0.242
oto+0.234
iri+0.234
alt+0.232
Neg logit contributions
 All-0.127
Let-0.118
 Stone-0.115
Find-0.113
Make-0.111
Token the
Token ID262
Feature activation+7.21e-02

Pos logit contributions
Let+0.018
 All+0.018
 Did+0.016
 Stone+0.016
 ties+0.016
Neg logit contributions
 awful-0.035
iri-0.034
oto-0.034
arter-0.033
ulp-0.033
Token roof
Token ID9753
Feature activation+3.73e-02

Pos logit contributions
 All+0.113
Let+0.108
 who+0.107
 Stone+0.101
 ties+0.101
Neg logit contributions
 awful-0.123
iri-0.118
 mir-0.117
ulp-0.117
gh-0.116
Token and
Token ID290
Feature activation-5.06e-03

Pos logit contributions
 All+0.039
Let+0.036
 who+0.035
 ties+0.032
 Stone+0.032
Neg logit contributions
 awful-0.083
iri-0.081
ulp-0.080
gh-0.080
arter-0.080
Token sighed
Token ID21893
Feature activation-3.04e-02

Pos logit contributions
 awful+0.010
 mir+0.009
iri+0.009
ulp+0.009
urg+0.009
Neg logit contributions
 All-0.006
 Stone-0.006
Let-0.006
 Trying-0.006
 Using-0.006
Token.
Token ID13
Feature activation-6.73e-04

Pos logit contributions
 awful+0.063
iri+0.063
 mir+0.062
arter+0.062
ulp+0.061
Neg logit contributions
 All-0.036
 who-0.034
Let-0.033
 ties-0.030
 Stone-0.030
Token He
Token ID679
Feature activation+2.85e-02

Pos logit contributions
 awful+0.001
 mir+0.001
ulp+0.001
arter+0.001
ife+0.001
Neg logit contributions
 ties-0.001
es-0.001
 who-0.001
mun-0.001
Let-0.001
Token was
Token ID373
Feature activation+4.56e-02

Pos logit contributions
 All+0.034
Let+0.031
 Stone+0.031
 ties+0.029
 Being+0.029
Neg logit contributions
 awful-0.059
 mir-0.056
iri-0.056
gh-0.054
oto-0.054
Token tired
Token ID10032
Feature activation-5.75e-02

Pos logit contributions
 All+0.074
 who+0.072
Let+0.070
 ties+0.066
 Stone+0.065
Neg logit contributions
 awful-0.074
iri-0.073
gh-0.072
arter-0.072
ulp-0.072
Token but
Token ID475
Feature activation+6.65e-02

Pos logit contributions
iri+0.148
ulp+0.143
arter+0.143
gh+0.143
igi+0.142
Neg logit contributions
 who-0.049
 All-0.047
Let-0.037
 shop-0.035
 store-0.032
Token proud
Token ID6613
Feature activation-3.07e-02

Pos logit contributions
 All+0.112
Let+0.108
 ties+0.106
 Stone+0.105
 draws+0.103
Neg logit contributions
 awful-0.106
 mir-0.097
ulp-0.095
arter-0.094
iri-0.094
Token.
Token ID13
Feature activation-1.62e-03

Pos logit contributions
 awful+0.062
ulp+0.060
iri+0.060
arter+0.059
 mir+0.059
Neg logit contributions
 All-0.038
Let-0.037
 who-0.035
 Stone-0.034
 ties-0.034
Token.
Token ID13
Feature activation-2.17e-02

Pos logit contributions
 awful+0.033
iri+0.032
 mir+0.032
arter+0.031
ulp+0.031
Neg logit contributions
 All-0.021
 who-0.019
Let-0.019
 ties-0.018
 Stone-0.018
Token "
Token ID366
Feature activation+1.40e-01

Pos logit contributions
 awful+0.046
ulp+0.046
 mir+0.046
arter+0.045
iri+0.044
Neg logit contributions
pen-0.020
 ties-0.020
 drew-0.018
 Stone-0.018
 All-0.018
TokenWe
Token ID1135
Feature activation-2.10e-02

Pos logit contributions
 All+0.170
 ties+0.154
 Stone+0.153
 led+0.153
 Using+0.148
Neg logit contributions
 awful-0.276
gh-0.260
oor-0.253
iri-0.252
 mir-0.252
Token used
Token ID973
Feature activation+1.65e-01

Pos logit contributions
 awful+0.044
gh+0.040
arter+0.039
 mir+0.039
alt+0.039
Neg logit contributions
 All-0.024
 Being-0.023
Let-0.023
 Stone-0.022
 Using-0.021
Token them
Token ID606
Feature activation-2.03e-01

Pos logit contributions
 All+0.195
Let+0.183
 who+0.179
 Stone+0.167
 ties+0.166
Neg logit contributions
 awful-0.344
iri-0.332
ulp-0.332
gh-0.331
arter-0.330
Token all
Token ID477
Feature activation+4.54e-02

Pos logit contributions
iri+0.393
 awful+0.380
ulp+0.379
arter+0.379
 mir+0.377
Neg logit contributions
 All-0.281
 who-0.275
Let-0.260
 ties-0.252
 Stone-0.247
Token."
Token ID526
Feature activation-2.62e-03

Pos logit contributions
 All+0.070
Let+0.066
 who+0.063
 Stone+0.063
 ties+0.062
Neg logit contributions
 awful-0.079
ulp-0.074
gh-0.073
arter-0.073
 mir-0.073
Token
Token ID198
Feature activation+6.74e-03

Pos logit contributions
ulp+0.006
 awful+0.006
iri+0.006
 mir+0.006
arter+0.006
Neg logit contributions
 All-0.002
Let-0.002
 Stone-0.002
 ties-0.002
 who-0.002
Token
Token ID198
Feature activation+7.04e-03

Pos logit contributions
 All+0.003
Let+0.002
 who+0.002
 ties+0.002
 Stone+0.002
Neg logit contributions
 awful-0.019
iri-0.019
gh-0.019
arter-0.019
 mir-0.019
TokenM
Token ID44
Feature activation-1.33e-01

Pos logit contributions
 All+0.009
 who+0.009
Let+0.009
 ties+0.008
 Stone+0.008
Neg logit contributions
 awful-0.015
arter-0.014
gh-0.014
ulp-0.014
iri-0.014
Tokenia
Token ID544
Feature activation-1.44e-01

Pos logit contributions
 awful+0.174
 ours+0.162
 enjoyable+0.156
 unpleasant+0.155
 unforgettable+0.155
Neg logit contributions
 All-0.253
es-0.244
 two-0.239
 ties-0.235
 who-0.230
Token her
Token ID607
Feature activation-9.63e-02

Pos logit contributions
 All+0.214
Let+0.197
 Stone+0.196
 who+0.194
 ties+0.189
Neg logit contributions
 awful-0.187
ulp-0.174
iri-0.169
gh-0.168
oto-0.166
Token a
Token ID257
Feature activation+3.04e-02

Pos logit contributions
 awful+0.181
oto+0.179
ulp+0.172
iri+0.171
 attention+0.167
Neg logit contributions
 All-0.119
 Did-0.118
Find-0.112
Let-0.109
 Being-0.108
Token cookie
Token ID19751
Feature activation+1.36e-01

Pos logit contributions
 All+0.052
Let+0.050
 who+0.049
 ties+0.047
 Stone+0.047
Neg logit contributions
 awful-0.047
iri-0.045
arter-0.045
ulp-0.045
gh-0.045
Token.
Token ID13
Feature activation-8.16e-03

Pos logit contributions
 All+0.161
 who+0.158
Let+0.151
 ties+0.142
 Stone+0.140
Neg logit contributions
 awful-0.282
arter-0.275
iri-0.274
gh-0.272
ulp-0.271
Token Lily
Token ID20037
Feature activation+1.50e-03

Pos logit contributions
 awful+0.017
ulp+0.016
arter+0.016
 mir+0.015
urg+0.015
Neg logit contributions
 who-0.009
es-0.008
 Stone-0.008
 ties-0.008
Let-0.008
Token was
Token ID373
Feature activation+3.84e-02

Pos logit contributions
 All+0.002
 Being+0.002
 Stone+0.002
 Believe+0.002
Make+0.001
Neg logit contributions
 awful-0.003
iri-0.003
gh-0.003
oto-0.003
urg-0.003
Token so
Token ID523
Feature activation-1.38e-01

Pos logit contributions
 All+0.069
Let+0.067
 who+0.064
 Stone+0.063
 ties+0.063
Neg logit contributions
 awful-0.057
iri-0.054
ulp-0.053
gh-0.053
 mir-0.053
Token happy
Token ID3772
Feature activation-5.79e-02

Pos logit contributions
arter+0.237
gh+0.236
 awful+0.234
 mir+0.231
iri+0.230
Neg logit contributions
 All-0.210
Let-0.205
 who-0.204
 ties-0.188
 liked-0.185
Token and
Token ID290
Feature activation-1.16e-02

Pos logit contributions
 awful+0.113
iri+0.110
ulp+0.109
gh+0.109
arter+0.109
Neg logit contributions
 All-0.077
Let-0.072
 who-0.072
 ties-0.067
 Stone-0.067
Token wanted
Token ID2227
Feature activation+1.79e-02

Pos logit contributions
 awful+0.025
iri+0.024
ulp+0.024
 mir+0.024
gh+0.024
Neg logit contributions
 All-0.013
Let-0.012
 who-0.012
 Stone-0.011
 ties-0.011
Token to
Token ID284
Feature activation-6.42e-02

Pos logit contributions
 All+0.027
Let+0.026
 who+0.025
 Stone+0.024
 ties+0.024
Neg logit contributions
 awful-0.032
iri-0.030
gh-0.030
ulp-0.030
 mir-0.030
Token it
Token ID340
Feature activation-7.19e-02

Pos logit contributions
 All+0.093
Let+0.088
 who+0.086
 Stone+0.080
 ties+0.080
Neg logit contributions
 awful-0.161
iri-0.156
ulp-0.155
gh-0.155
arter-0.154
Token.
Token ID13
Feature activation-6.09e-03

Pos logit contributions
iri+0.160
 mir+0.153
arter+0.151
 awful+0.151
ulp+0.150
Neg logit contributions
 All-0.087
 who-0.078
Let-0.077
 ties-0.074
 Stone-0.072
Token He
Token ID679
Feature activation+2.14e-02

Pos logit contributions
 mir+0.009
ulp+0.009
oto+0.009
 awful+0.009
oor+0.009
Neg logit contributions
es-0.009
 puzzles-0.008
mun-0.008
 who-0.008
 dolls-0.007
Token held
Token ID2714
Feature activation+1.03e-01

Pos logit contributions
 All+0.028
 Being+0.027
 Did+0.025
 Stone+0.025
 Trying+0.025
Neg logit contributions
 awful-0.039
 mir-0.038
gh-0.038
urg-0.036
oto-0.035
Token on
Token ID319
Feature activation+8.18e-02

Pos logit contributions
 All+0.141
 Being+0.137
Let+0.134
Being+0.134
 who+0.133
Neg logit contributions
 awful-0.178
ulp-0.173
gh-0.172
iri-0.166
alt-0.163
Token tight
Token ID5381
Feature activation-8.28e-02

Pos logit contributions
 All+0.124
Let+0.118
 who+0.117
 ties+0.110
 Stone+0.110
Neg logit contributions
 awful-0.143
iri-0.138
ulp-0.137
gh-0.137
arter-0.137
Token and
Token ID290
Feature activation-9.65e-03

Pos logit contributions
 awful+0.168
iri+0.162
gh+0.162
ulp+0.162
arter+0.161
Neg logit contributions
 All-0.103
Let-0.097
 who-0.095
 Stone-0.089
 ties-0.089
Token tried
Token ID3088
Feature activation+9.81e-02

Pos logit contributions
 awful+0.019
gh+0.019
 mir+0.018
ulp+0.018
urg+0.017
Neg logit contributions
 All-0.012
 Being-0.011
 Stone-0.011
Let-0.011
Being-0.010
Token to
Token ID284
Feature activation-6.31e-02

Pos logit contributions
 All+0.163
Let+0.159
 who+0.150
 Stone+0.149
 ties+0.147
Neg logit contributions
gh-0.150
 awful-0.148
ulp-0.143
 mir-0.141
iri-0.140
Token pull
Token ID2834
Feature activation+7.72e-02

Pos logit contributions
 awful+0.112
gh+0.108
ulp+0.105
arter+0.104
iri+0.103
Neg logit contributions
 All-0.090
 Did-0.087
Let-0.087
 Being-0.086
es-0.083
Token.
Token ID13
Feature activation-6.20e-03

Pos logit contributions
 All+0.083
Let+0.078
 who+0.076
 Stone+0.071
 ties+0.070
Neg logit contributions
 awful-0.168
iri-0.163
gh-0.162
ulp-0.162
arter-0.161
Token the
Token ID262
Feature activation+5.95e-02

Pos logit contributions
Let+0.020
 All+0.020
 Did+0.020
 Being+0.019
es+0.019
Neg logit contributions
iri-0.036
 awful-0.036
 mir-0.036
ulp-0.035
oto-0.035
Token lake
Token ID13546
Feature activation-5.72e-03

Pos logit contributions
 All+0.061
Let+0.054
 who+0.054
 Stone+0.051
 shop+0.046
Neg logit contributions
ulp-0.140
gh-0.137
 awful-0.136
iri-0.135
oor-0.135
Token,
Token ID11
Feature activation-1.63e-02

Pos logit contributions
 awful+0.010
iri+0.010
ulp+0.010
gh+0.010
arter+0.010
Neg logit contributions
 All-0.008
Let-0.008
 who-0.008
 ties-0.007
 Stone-0.007
Token they
Token ID484
Feature activation-2.62e-02

Pos logit contributions
 awful+0.020
 storm+0.020
 mir+0.019
 rainy+0.019
iri+0.018
Neg logit contributions
pen-0.029
 draws-0.029
 Being-0.028
 Using-0.028
 ties-0.028
Token noticed
Token ID6810
Feature activation+1.49e-02

Pos logit contributions
iri+0.053
 awful+0.053
 mir+0.053
gh+0.051
oto+0.051
Neg logit contributions
 All-0.029
 Being-0.026
 Using-0.026
Let-0.026
 ties-0.026
Token that
Token ID326
Feature activation-4.66e-02

Pos logit contributions
 All+0.015
Let+0.015
 Stone+0.014
 ties+0.014
 Using+0.014
Neg logit contributions
 awful-0.033
 mir-0.030
iri-0.030
ulp-0.030
gh-0.030
Token something
Token ID1223
Feature activation-3.38e-03

Pos logit contributions
 awful+0.093
 mir+0.089
 rainy+0.086
 storm+0.086
iri+0.085
Neg logit contributions
 Using-0.054
 All-0.054
pen-0.051
 draws-0.051
 ties-0.049
Token was
Token ID373
Feature activation+3.75e-02

Pos logit contributions
iri+0.006
arter+0.006
ulp+0.006
 awful+0.006
urg+0.006
Neg logit contributions
 All-0.005
 who-0.005
Let-0.005
 ties-0.005
 led-0.005
Token wrong
Token ID2642
Feature activation-8.06e-02

Pos logit contributions
 who+0.059
 All+0.058
Let+0.056
 ties+0.054
 led+0.053
Neg logit contributions
iri-0.063
ulp-0.061
oor-0.060
arter-0.060
urg-0.060
Token.
Token ID13
Feature activation-6.22e-03

Pos logit contributions
 awful+0.161
iri+0.157
ulp+0.156
arter+0.156
gh+0.155
Neg logit contributions
 All-0.102
 who-0.096
Let-0.096
 ties-0.088
 Stone-0.088
Token
Token ID198
Feature activation+1.49e-02

Pos logit contributions
 awful+0.012
ulp+0.012
 mir+0.011
iri+0.011
alt+0.011
Neg logit contributions
 ties-0.007
es-0.006
pen-0.006
 acts-0.006
mun-0.006
Token it
Token ID340
Feature activation-6.84e-02

Pos logit contributions
 All+0.094
Let+0.086
 who+0.084
 ties+0.075
 Stone+0.074
Neg logit contributions
 awful-0.284
iri-0.278
ulp-0.276
arter-0.276
gh-0.276
Token.
Token ID13
Feature activation-1.28e-03

Pos logit contributions
 awful+0.137
iri+0.134
arter+0.133
 mir+0.132
ulp+0.132
Neg logit contributions
 All-0.086
 who-0.081
Let-0.081
 ties-0.075
 Stone-0.074
Token
Token ID220
Feature activation+3.25e-02

Pos logit contributions
 awful+0.003
iri+0.003
ulp+0.003
 mir+0.003
ife+0.002
Neg logit contributions
 ties-0.001
 Stone-0.001
 led-0.001
 who-0.001
es-0.001
Token
Token ID198
Feature activation+3.00e-02

Pos logit contributions
Let+0.024
 ties+0.024
 All+0.024
 Stone+0.023
 who+0.023
Neg logit contributions
ulp-0.074
 awful-0.073
iri-0.073
gh-0.072
oto-0.072
Token
Token ID198
Feature activation+3.16e-02

Pos logit contributions
 All+0.021
 ties+0.020
 Stone+0.019
 who+0.019
Let+0.019
Neg logit contributions
iri-0.072
 awful-0.072
gh-0.071
oto-0.071
ulp-0.070
TokenShe
Token ID3347
Feature activation-6.70e-02

Pos logit contributions
 All+0.043
 who+0.040
 ties+0.040
Let+0.039
 Stone+0.039
Neg logit contributions
 awful-0.058
iri-0.057
gh-0.057
 mir-0.056
ulp-0.055
Token waited
Token ID13488
Feature activation-6.37e-02

Pos logit contributions
 awful+0.136
 mir+0.132
iri+0.131
urg+0.130
ulp+0.130
Neg logit contributions
 All-0.080
 Stone-0.072
Let-0.070
 Being-0.069
 Trying-0.066
Token and
Token ID290
Feature activation+1.90e-03

Pos logit contributions
iri+0.126
arter+0.124
 awful+0.123
ww+0.123
 mir+0.120
Neg logit contributions
 who-0.089
 All-0.084
Let-0.078
 ties-0.075
 led-0.072
Token waited
Token ID13488
Feature activation-6.47e-02

Pos logit contributions
 All+0.003
Let+0.002
 who+0.002
 Stone+0.002
 ties+0.002
Neg logit contributions
 awful-0.004
iri-0.004
ulp-0.004
gh-0.004
 mir-0.004
Token,
Token ID11
Feature activation+3.17e-03

Pos logit contributions
iri+0.138
arter+0.135
ww+0.134
 awful+0.134
 mir+0.132
Neg logit contributions
 who-0.082
 All-0.078
Let-0.068
 ties-0.066
 led-0.064
Token but
Token ID475
Feature activation+6.90e-02

Pos logit contributions
 All+0.004
Let+0.004
 Stone+0.004
 who+0.004
 ties+0.004
Neg logit contributions
 awful-0.006
iri-0.006
ulp-0.006
 mir-0.005
gh-0.005
Token.
Token ID13
Feature activation-1.07e-02

Pos logit contributions
 All+0.071
 who+0.067
Let+0.066
 ties+0.061
 Stone+0.061
Neg logit contributions
 awful-0.119
iri-0.118
arter-0.116
ulp-0.116
 mir-0.116
Token He
Token ID679
Feature activation+2.08e-02

Pos logit contributions
ulp+0.020
 awful+0.019
arter+0.018
 mir+0.018
urg+0.018
Neg logit contributions
mun-0.013
 who-0.013
 Stone-0.013
Let-0.013
 ties-0.013
Token said
Token ID531
Feature activation-1.52e-02

Pos logit contributions
 All+0.029
 Being+0.028
 Trying+0.026
Being+0.026
 Using+0.026
Neg logit contributions
 mir-0.037
 awful-0.036
iri-0.034
urg-0.034
gh-0.033
Token,
Token ID11
Feature activation-1.45e-02

Pos logit contributions
 awful+0.032
arter+0.031
iri+0.031
ulp+0.030
 mir+0.030
Neg logit contributions
 All-0.019
 who-0.018
Let-0.017
 ties-0.016
es-0.016
Token "
Token ID366
Feature activation+1.48e-01

Pos logit contributions
 mir+0.031
 awful+0.031
gh+0.031
iri+0.030
urg+0.030
Neg logit contributions
 All-0.013
 ties-0.012
pen-0.012
Let-0.012
 Stone-0.011
TokenLook
Token ID8567
Feature activation-1.49e-01

Pos logit contributions
 All+0.190
 who+0.181
 ties+0.177
es+0.171
 Stone+0.170
Neg logit contributions
iri-0.273
ww-0.270
 awful-0.270
 mir-0.267
oto-0.265
Token,
Token ID11
Feature activation-1.46e-02

Pos logit contributions
 mir+0.352
 awful+0.340
arter+0.332
ulp+0.330
iri+0.325
Neg logit contributions
 who-0.165
 All-0.152
es-0.133
 ties-0.128
 Stone-0.123
Token the
Token ID262
Feature activation+6.57e-02

Pos logit contributions
 awful+0.027
 mir+0.026
arter+0.025
gh+0.025
iri+0.025
Neg logit contributions
Let-0.020
 All-0.020
 Using-0.019
Being-0.018
 Being-0.018
Token dark
Token ID3223
Feature activation-1.72e-01

Pos logit contributions
 All+0.112
Let+0.106
 who+0.105
 Stone+0.101
 ties+0.099
Neg logit contributions
iri-0.105
gh-0.103
ulp-0.102
 awful-0.101
arter-0.100
Token is
Token ID318
Feature activation+1.13e-03

Pos logit contributions
iri+0.309
 awful+0.308
gh+0.303
ulp+0.298
arter+0.296
Neg logit contributions
 All-0.255
 who-0.242
Let-0.241
 Stone-0.223
 ties-0.220
Token not
Token ID407
Feature activation-1.28e-01

Pos logit contributions
 who+0.001
 All+0.001
 led+0.001
Let+0.001
 ties+0.001
Neg logit contributions
iri-0.002
ulp-0.002
oto-0.002
gh-0.002
igi-0.002
Token a
Token ID257
Feature activation+2.39e-02

Pos logit contributions
 All+0.186
 who+0.177
Let+0.175
 ties+0.164
 Stone+0.162
Neg logit contributions
 awful-0.271
iri-0.267
ulp-0.264
 mir-0.263
arter-0.263
Token gun
Token ID2485
Feature activation+6.75e-02

Pos logit contributions
 All+0.037
Let+0.035
 who+0.034
 ties+0.033
 Stone+0.033
Neg logit contributions
 awful-0.041
iri-0.040
gh-0.040
ulp-0.040
arter-0.040
Token again
Token ID757
Feature activation-2.10e-02

Pos logit contributions
 All+0.119
Let+0.116
 who+0.110
 Stone+0.108
 ties+0.108
Neg logit contributions
 awful-0.100
gh-0.095
iri-0.092
 mir-0.092
arter-0.092
Token.
Token ID13
Feature activation-1.73e-02

Pos logit contributions
 awful+0.042
iri+0.041
ulp+0.041
arter+0.041
gh+0.041
Neg logit contributions
 All-0.026
 who-0.025
Let-0.025
 ties-0.023
 Stone-0.022
Token<|endoftext|>
Token ID50256
Feature activation-7.37e-02

Pos logit contributions
 awful+0.036
arter+0.035
 mir+0.035
ulp+0.035
iri+0.035
Neg logit contributions
 ties-0.017
pen-0.017
es-0.016
mun-0.016
 Stone-0.015
TokenAnna
Token ID31160
Feature activation-2.48e-02

Pos logit contributions
gh+0.110
ummy+0.107
ake+0.105
ulp+0.105
alt+0.104
Neg logit contributions
nam-0.098
 who-0.096
 ties-0.096
 sew-0.092
 dial-0.091
Token and
Token ID290
Feature activation-9.27e-03

Pos logit contributions
 mir+0.044
iri+0.044
oto+0.044
gh+0.043
 awful+0.043
Neg logit contributions
 All-0.032
Find-0.028
 Trying-0.028
 Using-0.028
 Being-0.027
Token Ben
Token ID3932
Feature activation-3.70e-02

Pos logit contributions
 awful+0.015
iri+0.015
ulp+0.014
oto+0.014
arter+0.014
Neg logit contributions
 All-0.014
Let-0.014
 Stone-0.014
 ties-0.013
es-0.013
Token are
Token ID389
Feature activation+4.06e-02

Pos logit contributions
 awful+0.073
iri+0.070
 mir+0.070
oto+0.069
oor+0.068
Neg logit contributions
 All-0.040
 Stone-0.037
 Trying-0.036
 Using-0.036
 ties-0.035
Token friends
Token ID2460
Feature activation+1.69e-01

Pos logit contributions
 All+0.047
Let+0.044
 who+0.042
 Stone+0.041
 ties+0.040
Neg logit contributions
 awful-0.085
iri-0.083
ulp-0.083
arter-0.082
gh-0.082
Token.
Token ID13
Feature activation-6.68e-03

Pos logit contributions
 All+0.260
Let+0.254
 Stone+0.247
 ties+0.240
Find+0.238
Neg logit contributions
 awful-0.277
iri-0.270
ulp-0.265
gh-0.262
oto-0.257
Token his
Token ID465
Feature activation-7.20e-02

Pos logit contributions
 All+0.189
Let+0.179
 who+0.176
 Stone+0.166
 ties+0.165
Neg logit contributions
 awful-0.266
iri-0.257
ulp-0.256
gh-0.256
arter-0.255
Token truck
Token ID7779
Feature activation+5.20e-02

Pos logit contributions
 awful+0.134
arter+0.129
gh+0.126
 mir+0.125
ulp+0.125
Neg logit contributions
 All-0.103
Let-0.097
 Being-0.090
 Stone-0.089
 Did-0.088
Token and
Token ID290
Feature activation-9.03e-03

Pos logit contributions
 All+0.072
Let+0.070
 who+0.065
 Being+0.065
 ties+0.064
Neg logit contributions
 awful-0.095
gh-0.092
arter-0.090
ulp-0.089
iri-0.088
Token they
Token ID484
Feature activation-2.69e-02

Pos logit contributions
 awful+0.015
gh+0.015
 mir+0.014
arter+0.014
ulp+0.014
Neg logit contributions
 All-0.013
 Stone-0.013
Let-0.012
 Being-0.012
 ties-0.012
Token had
Token ID550
Feature activation+7.42e-02

Pos logit contributions
 awful+0.058
gh+0.057
iri+0.055
 mir+0.055
ulp+0.055
Neg logit contributions
 All-0.029
Let-0.027
 Being-0.026
 who-0.025
 Stone-0.025
Token fun
Token ID1257
Feature activation-2.18e-01

Pos logit contributions
 All+0.113
Let+0.107
 who+0.106
 Stone+0.101
 ties+0.100
Neg logit contributions
 awful-0.130
iri-0.124
gh-0.124
ulp-0.124
 mir-0.123
Token together
Token ID1978
Feature activation+1.29e-02

Pos logit contributions
ulp+0.360
gh+0.356
 awful+0.341
oor+0.340
arter+0.328
Neg logit contributions
 Stone-0.339
Let-0.320
 All-0.315
 Did-0.307
Being-0.305
Token.
Token ID13
Feature activation-1.45e-03

Pos logit contributions
 All+0.019
Let+0.019
 Stone+0.018
 Did+0.018
 Being+0.018
Neg logit contributions
gh-0.022
 awful-0.022
ulp-0.022
arter-0.021
oor-0.020
Token<|endoftext|>
Token ID50256
Feature activation-7.24e-02

Pos logit contributions
 awful+0.003
ulp+0.003
arter+0.003
gh+0.003
oor+0.003
Neg logit contributions
 Stone-0.001
 ties-0.001
Let-0.001
 who-0.001
 All-0.001
TokenOnce
Token ID7454
Feature activation-8.10e-02

Pos logit contributions
gh+0.130
oto+0.120
ulp+0.119
alt+0.119
iri+0.116
Neg logit contributions
 ties-0.091
 draws-0.085
 acts-0.083
Being-0.083
Find-0.082
Token upon
Token ID2402
Feature activation+6.79e-02

Pos logit contributions
 awful+0.167
iri+0.162
 mir+0.162
gh+0.160
arter+0.159
Neg logit contributions
 All-0.101
 who-0.094
Let-0.091
 Stone-0.084
 ties-0.083
Token They
Token ID1119
Feature activation+3.28e-02

Pos logit contributions
arter+0.013
iri+0.013
ulp+0.012
 awful+0.012
 mir+0.012
Neg logit contributions
pen-0.008
 who-0.008
 Stone-0.008
 ties-0.008
 All-0.007
Token had
Token ID550
Feature activation+8.06e-02

Pos logit contributions
 Being+0.037
 All+0.037
 Trying+0.035
 Using+0.034
 Believe+0.034
Neg logit contributions
 awful-0.063
gh-0.061
 mir-0.060
alt-0.060
 rainy-0.059
Token a
Token ID257
Feature activation+3.91e-02

Pos logit contributions
 All+0.083
 Stone+0.073
Let+0.072
 who+0.071
 Being+0.070
Neg logit contributions
 awful-0.181
gh-0.175
 mir-0.173
iri-0.171
ulp-0.170
Token lot
Token ID1256
Feature activation-4.85e-02

Pos logit contributions
 All+0.054
 who+0.053
Let+0.053
 ties+0.051
 Stone+0.049
Neg logit contributions
 awful-0.072
iri-0.071
urg-0.071
ulp-0.071
arter-0.069
Token of
Token ID286
Feature activation+7.09e-02

Pos logit contributions
 awful+0.105
iri+0.101
ulp+0.101
gh+0.101
arter+0.101
Neg logit contributions
 All-0.053
Let-0.050
 who-0.049
 Stone-0.046
 ties-0.045
Token fun
Token ID1257
Feature activation-2.14e-01

Pos logit contributions
 All+0.146
Let+0.141
 who+0.139
 Stone+0.136
 Being+0.133
Neg logit contributions
 awful-0.087
ulp-0.080
gh-0.080
 mir-0.079
arter-0.077
Token making
Token ID1642
Feature activation+9.36e-02

Pos logit contributions
ulp+0.386
gh+0.383
 awful+0.372
arter+0.362
urg+0.358
Neg logit contributions
 All-0.297
Let-0.296
 Stone-0.294
 Being-0.271
Find-0.271
Token noises
Token ID26782
Feature activation-1.40e-02

Pos logit contributions
 All+0.137
Let+0.131
 who+0.128
 Stone+0.123
 ties+0.121
Neg logit contributions
 awful-0.168
ulp-0.161
iri-0.161
gh-0.160
arter-0.160
Token and
Token ID290
Feature activation-4.39e-03

Pos logit contributions
 awful+0.030
iri+0.029
ulp+0.029
arter+0.029
gh+0.029
Neg logit contributions
 All-0.016
Let-0.015
 who-0.015
 ties-0.014
 Stone-0.013
Token pretending
Token ID23662
Feature activation+8.66e-02

Pos logit contributions
 awful+0.008
arter+0.008
gh+0.008
 mir+0.008
ulp+0.007
Neg logit contributions
 All-0.006
Let-0.006
 Stone-0.006
 who-0.005
 ties-0.005
Token to
Token ID284
Feature activation-6.03e-02

Pos logit contributions
 All+0.115
 who+0.108
Let+0.106
 led+0.095
 ties+0.095
Neg logit contributions
iri-0.176
ulp-0.169
arter-0.169
 awful-0.169
 mir-0.167
Token with
Token ID351
Feature activation+1.46e-01

Pos logit contributions
 awful+0.049
gh+0.046
ulp+0.045
iri+0.045
 mir+0.045
Neg logit contributions
 All-0.035
Let-0.034
 who-0.032
 Stone-0.032
 Did-0.031
Token Ben
Token ID3932
Feature activation-3.67e-02

Pos logit contributions
 All+0.158
Let+0.151
 who+0.145
 Stone+0.137
 ties+0.136
Neg logit contributions
 awful-0.314
iri-0.306
ulp-0.304
arter-0.302
 mir-0.301
Token.
Token ID13
Feature activation-1.91e-02

Pos logit contributions
 awful+0.063
gh+0.060
ulp+0.059
iri+0.059
arter+0.058
Neg logit contributions
 All-0.057
Let-0.055
 Stone-0.050
 Did-0.049
Find-0.049
Token Lily
Token ID20037
Feature activation+1.14e-03

Pos logit contributions
 awful+0.036
ulp+0.036
arter+0.036
 mir+0.035
iri+0.035
Neg logit contributions
 who-0.023
Let-0.023
 Stone-0.022
 ties-0.022
 All-0.021
Token joined
Token ID5399
Feature activation+6.82e-02

Pos logit contributions
 All+0.002
 Using+0.001
 Some+0.001
If+0.001
 Did+0.001
Neg logit contributions
 mir-0.002
iri-0.002
 awful-0.002
gh-0.002
oto-0.002
Token them
Token ID606
Feature activation-2.02e-01

Pos logit contributions
 All+0.089
Let+0.085
 who+0.081
 Did+0.080
 Stone+0.079
Neg logit contributions
iri-0.128
ulp-0.127
 awful-0.124
oto-0.123
gh-0.122
Token with
Token ID351
Feature activation+1.45e-01

Pos logit contributions
 awful+0.387
ulp+0.380
arter+0.376
 mir+0.375
iri+0.375
Neg logit contributions
 All-0.255
Let-0.249
 Being-0.222
 Stone-0.219
 Did-0.217
Token her
Token ID607
Feature activation-9.87e-02

Pos logit contributions
 All+0.181
 who+0.168
Let+0.168
 ties+0.156
 Stone+0.154
Neg logit contributions
 awful-0.293
gh-0.286
arter-0.285
iri-0.284
ulp-0.284
Token doll
Token ID3654
Feature activation+1.78e-01

Pos logit contributions
 awful+0.192
iri+0.186
ulp+0.185
arter+0.185
gh+0.185
Neg logit contributions
 All-0.130
Let-0.122
 who-0.115
 Being-0.111
 Stone-0.110
Token.
Token ID13
Feature activation-2.00e-02

Pos logit contributions
 All+0.240
Let+0.233
 Being+0.212
 Stone+0.205
Find+0.204
Neg logit contributions
 awful-0.331
iri-0.323
gh-0.318
 mir-0.317
arter-0.316
Token They
Token ID1119
Feature activation+1.93e-02

Pos logit contributions
ulp+0.042
iri+0.041
arter+0.041
 awful+0.040
oto+0.040
Neg logit contributions
 who-0.019
Let-0.019
 ties-0.018
 Stone-0.018
 included-0.018
Token laughed
Token ID13818
Feature activation-1.52e-02

Pos logit contributions
 All+0.094
Let+0.089
 who+0.087
 Stone+0.082
 ties+0.082
Neg logit contributions
 awful-0.157
iri-0.152
ulp-0.151
gh-0.151
 mir-0.151
Token and
Token ID290
Feature activation-1.77e-02

Pos logit contributions
 awful+0.036
iri+0.035
arter+0.035
 mir+0.035
gh+0.035
Neg logit contributions
 All-0.014
 who-0.013
Let-0.013
 ties-0.011
 Stone-0.011
Token had
Token ID550
Feature activation+7.34e-02

Pos logit contributions
 awful+0.035
iri+0.035
 mir+0.034
ulp+0.033
oto+0.033
Neg logit contributions
 All-0.021
Let-0.019
 who-0.019
 Stone-0.018
 Being-0.018
Token so
Token ID523
Feature activation-1.35e-01

Pos logit contributions
 All+0.126
Let+0.120
 who+0.119
 ties+0.114
 led+0.110
Neg logit contributions
arter-0.119
iri-0.117
ulp-0.114
 awful-0.113
gh-0.112
Token much
Token ID881
Feature activation-1.89e-01

Pos logit contributions
 awful+0.139
iri+0.130
ulp+0.129
gh+0.128
arter+0.128
Neg logit contributions
 All-0.303
Let-0.294
 who-0.291
 Stone-0.282
 ties-0.281
Token fun
Token ID1257
Feature activation-2.15e-01

Pos logit contributions
iri+0.258
urg+0.246
igi+0.242
arter+0.242
gh+0.241
Neg logit contributions
 All-0.385
 who-0.359
Let-0.350
 ties-0.348
 led-0.335
Token.
Token ID13
Feature activation-9.89e-03

Pos logit contributions
 awful+0.396
ulp+0.390
gh+0.388
arter+0.384
 mir+0.378
Neg logit contributions
 All-0.295
Let-0.286
 Stone-0.273
 who-0.266
 ties-0.261
Token When
Token ID1649
Feature activation+9.86e-04

Pos logit contributions
 awful+0.019
ulp+0.018
arter+0.018
oor+0.018
alt+0.017
Neg logit contributions
pen-0.011
 who-0.011
 ties-0.011
 Stone-0.011
es-0.011
Token it
Token ID340
Feature activation-6.90e-02

Pos logit contributions
 All+0.001
Let+0.001
 who+0.001
 ties+0.001
 led+0.001
Neg logit contributions
 awful-0.002
iri-0.002
arter-0.002
oto-0.002
gh-0.002
Token was
Token ID373
Feature activation+4.22e-02

Pos logit contributions
arter+0.206
 awful+0.206
iri+0.206
ulp+0.204
 mir+0.201
Neg logit contributions
 All-0.022
Let-0.015
 who-0.012
 led-0.012
 was-0.007
Token time
Token ID640
Feature activation-1.04e-01

Pos logit contributions
 All+0.055
 two+0.048
 led+0.047
 included+0.046
 helped+0.046
Neg logit contributions
 mir-0.085
iri-0.083
vern-0.081
ram-0.080
rim-0.080
Token needles
Token ID25209
Feature activation+5.87e-02

Pos logit contributions
 All+0.098
Let+0.091
 Being+0.087
 Stone+0.087
 Did+0.087
Neg logit contributions
 awful-0.090
ulp-0.084
 mir-0.082
urg-0.077
oto-0.077
Token and
Token ID290
Feature activation-2.38e-02

Pos logit contributions
 All+0.066
Let+0.061
 who+0.058
 Stone+0.055
 ties+0.055
Neg logit contributions
 awful-0.125
ulp-0.120
iri-0.120
gh-0.120
arter-0.119
Token threads
Token ID14390
Feature activation+2.23e-01

Pos logit contributions
 awful+0.043
 mir+0.041
 storm+0.040
 attention+0.039
 ours+0.039
Neg logit contributions
 All-0.031
 Stone-0.029
pen-0.029
 Believe-0.027
 Trying-0.027
Token and
Token ID290
Feature activation-2.38e-02

Pos logit contributions
 All+0.286
Let+0.269
 who+0.251
 Stone+0.245
 Being+0.240
Neg logit contributions
 awful-0.438
ulp-0.419
 mir-0.417
gh-0.417
iri-0.414
Token helps
Token ID5419
Feature activation+1.17e-01

Pos logit contributions
 awful+0.044
 mir+0.041
 storm+0.040
 attention+0.039
 unpleasant+0.039
Neg logit contributions
 All-0.031
 Stone-0.029
pen-0.029
 Using-0.028
 Trying-0.028
Token them
Token ID606
Feature activation-2.03e-01

Pos logit contributions
 All+0.172
 who+0.165
Let+0.164
 ties+0.153
 Stone+0.152
Neg logit contributions
 awful-0.212
iri-0.208
arter-0.207
ulp-0.206
 mir-0.205
Token sew
Token ID34249
Feature activation+2.48e-01

Pos logit contributions
 awful+0.316
 happen+0.306
 erupt+0.304
gh+0.303
ulp+0.302
Neg logit contributions
Let-0.289
 Did-0.283
 All-0.282
 Being-0.280
mun-0.273
Token the
Token ID262
Feature activation+5.36e-02

Pos logit contributions
 All+0.342
Let+0.318
 who+0.308
 Being+0.297
 Stone+0.294
Neg logit contributions
 awful-0.469
gh-0.449
 mir-0.439
iri-0.439
ulp-0.438
Token hole
Token ID7604
Feature activation+7.44e-02

Pos logit contributions
 All+0.099
 who+0.092
Let+0.092
 Stone+0.088
 Being+0.088
Neg logit contributions
 awful-0.078
iri-0.074
ulp-0.073
 mir-0.073
gh-0.072
Token.
Token ID13
Feature activation-1.94e-02

Pos logit contributions
 All+0.083
 who+0.082
Let+0.079
 ties+0.073
 Stone+0.071
Neg logit contributions
iri-0.159
 awful-0.156
 mir-0.155
gh-0.154
arter-0.154
Token The
Token ID383
Feature activation+9.71e-02

Pos logit contributions
 awful+0.042
ulp+0.041
arter+0.040
 mir+0.040
iri+0.039
Neg logit contributions
 Stone-0.018
 who-0.018
 All-0.017
Let-0.017
 ties-0.017