TOP ACTIVATIONS
MAX = 0.266

We have different likes and disl ikes . We can be
corn field . The st alks looked happy to see him
She gave Joe some ch alks and showed him how to
both picked up their ch alks and started to draw on
he put all the ch alks back in the box and
the sea of yellow st alks . His Mama smiled and
carefully pulled at the st alks of cel ery until they
painted the leaves and st alks . Then
walked through the tall st alks of corn and discovered a
to draw with her ch alks . But then , she
and dirt . The st alks were hidden and the corn
he saw the long st alks growing there . All different
see were the wheat st alks sway ing in the wind
board with his colorful ch alks . One day , Tim
we might break the st alks ." L ily
She had many colorful ch alks to make her pictures .
take the box of ch alks out and find the best
run through the tall st alks . One day
was full of tall st alks . Each one was loaded
feed it with colorful ch alks . He would draw and

MIN ACTIVATIONS
MIN = -0.185

time , there was a 3 year old called Sarah .
very tired and needed to rest . She flew to a
Lily didn 't want to rest , but her mom my
time , there was a 3 year old girl named Mia
time , there was a 3 year old named Jack .
very scared and begged for mercy . He said , '
very tired and wanted to rest . Suddenly , a little
time , there was a 3 year old named Anna .
time , there lived a 3 - year - old girl
, Tim my went to rest under a big tree .
l le . She was 3 years old and loved to
a Mom my and a 3 year old girl went for
need to let the TV rest for a while ." But
<|endoftext|> Joe y was a 3 - year - old boy
for a cozy spot to rest . He found a soft
a daddy who had a 3 year old child . Each
time there was a little 3 year old who wanted to
In this bedroom lived a 3 year old child and their
loudly . A 3 - year - old toddler
<|endoftext|> One day , a 3 year old girl named Sally

INTERVAL 0.153 - 0.266
CONTAINS 0.045%

the teacher 's sho el aces . Instead , she pe
but then she remembered the chalk . She scoop
a marble . He liked mar bles . He took it
him . They each took turns trying to get the laser
day she got her past els . She always tells everyone

INTERVAL 0.040 - 0.153
CONTAINS 10.792%

that he had trusted his friends and they had trusted him
dangerous !" But Tim didn 't listen and picked up the
new toy to c uddle at night , and told him
both sat down . Sam couldn 't wait to eat his
said . " I came here to find some food .

INTERVAL -0.072 - 0.040
CONTAINS 85.918%

sorry I broke your car . I love you ."
! Once they got home , the little boy
. As he walked nearer , the bells grew in sound
smiled down at the little boy and said , " This
day , Tom went to the woods to play with his

INTERVAL -0.185 - -0.072
CONTAINS 3.245%

found a bug . Sally watched as it gob bled up
hard . One day , Lily 's mom
. Tom had a big field where he grew tasty corn
He decided that he would never forget his shoes again
you ." S ara and Tom sighed . They
Token We
Token ID775
Feature activation+1.43e-02

Pos logit contributions
anks+0.026
wheel+0.026
uts+0.025
vell+0.025
vals+0.025
Neg logit contributions
 year-0.029
abel-0.024
ara-0.021
 barn-0.020
 more-0.020
Token have
Token ID423
Feature activation+1.94e-02

Pos logit contributions
anks+0.030
uts+0.029
alks+0.028
 disl+0.028
vals+0.027
Neg logit contributions
 year-0.024
abel-0.018
 more-0.017
ie-0.016
 often-0.016
Token different
Token ID1180
Feature activation-1.34e-02

Pos logit contributions
anks+0.043
uts+0.042
wheel+0.041
 disl+0.040
vals+0.040
Neg logit contributions
 year-0.029
abel-0.022
ie-0.018
 more-0.018
ara-0.018
Token likes
Token ID7832
Feature activation-2.53e-02

Pos logit contributions
 year+0.026
abel+0.020
ie+0.018
ara+0.018
 barn+0.018
Neg logit contributions
anks-0.024
uts-0.024
wheel-0.023
 disl-0.023
vals-0.022
Token and
Token ID290
Feature activation-5.98e-03

Pos logit contributions
 year+0.041
abel+0.031
ie+0.027
ara+0.026
 barn+0.026
Neg logit contributions
anks-0.053
uts-0.052
wheel-0.051
 disl-0.050
vals-0.049
Token disl
Token ID19621
Feature activation+2.66e-01

Pos logit contributions
 year+0.011
 more+0.009
abel+0.008
 me+0.008
ie+0.008
Neg logit contributions
anks-0.011
uts-0.011
vell-0.010
wheel-0.010
vals-0.010
Tokenikes
Token ID7938
Feature activation+3.94e-02

Pos logit contributions
wheel+0.243
anks+0.239
 disl+0.239
alks+0.223
vals+0.222
Neg logit contributions
 year-0.715
abel-0.612
ie-0.583
ara-0.578
 more-0.552
Token.
Token ID13
Feature activation+1.59e-02

Pos logit contributions
anks+0.086
uts+0.084
wheel+0.082
 disl+0.079
vals+0.079
Neg logit contributions
 year-0.060
abel-0.046
ie-0.038
 barn-0.038
ara-0.037
Token We
Token ID775
Feature activation+1.51e-02

Pos logit contributions
anks+0.028
wheel+0.027
uts+0.027
vell+0.026
 disl+0.026
Neg logit contributions
 year-0.031
abel-0.025
ara-0.022
 more-0.022
 barn-0.022
Token can
Token ID460
Feature activation+4.39e-02

Pos logit contributions
anks+0.033
uts+0.032
alks+0.031
 disl+0.029
vals+0.029
Neg logit contributions
 year-0.024
abel-0.018
 more-0.017
 often-0.017
ie-0.016
Token be
Token ID307
Feature activation+2.77e-02

Pos logit contributions
anks+0.078
uts+0.076
 disl+0.074
wheel+0.073
mons+0.072
Neg logit contributions
 year-0.084
abel-0.068
ie-0.063
 more-0.063
 me-0.063
Token corn
Token ID11676
Feature activation-4.52e-02

Pos logit contributions
 disl+0.027
 supposed+0.027
vals+0.026
anks+0.025
vell+0.025
Neg logit contributions
 year-0.023
abel-0.019
heart-0.016
ie-0.016
ara-0.015
Token field
Token ID2214
Feature activation-5.93e-02

Pos logit contributions
 year+0.072
abel+0.057
ara+0.051
heart+0.047
ie+0.046
Neg logit contributions
anks-0.093
uts-0.091
wheel-0.087
vals-0.086
vell-0.083
Token.
Token ID13
Feature activation+2.63e-02

Pos logit contributions
 year+0.088
abel+0.067
ara+0.057
ie+0.055
 barn+0.053
Neg logit contributions
anks-0.132
uts-0.128
wheel-0.126
 disl-0.122
vals-0.120
Token The
Token ID383
Feature activation+1.34e-02

Pos logit contributions
anks+0.063
wheel+0.061
vell+0.061
uts+0.061
vals+0.060
Neg logit contributions
 year-0.034
abel-0.024
ara-0.020
 shop-0.019
 barn-0.019
Token st
Token ID336
Feature activation+1.99e-02

Pos logit contributions
vals+0.025
 disl+0.025
vell+0.024
anks+0.024
wheel+0.024
Neg logit contributions
 year-0.020
abel-0.018
ara-0.014
heart-0.014
 awoke-0.014
Tokenalks
Token ID23833
Feature activation+2.65e-01

Pos logit contributions
anks+0.029
uts+0.028
wheel+0.027
 disl+0.026
vals+0.026
Neg logit contributions
 year-0.045
abel-0.038
ie-0.034
ara-0.034
 barn-0.033
Token looked
Token ID3114
Feature activation-1.84e-02

Pos logit contributions
anks+0.528
uts+0.524
wheel+0.511
 disl+0.495
vals+0.493
Neg logit contributions
 year-0.437
abel-0.338
ara-0.295
 barn-0.292
ie-0.289
Token happy
Token ID3772
Feature activation+4.32e-02

Pos logit contributions
 year+0.034
abel+0.028
ara+0.025
 barn+0.024
ie+0.023
Neg logit contributions
anks-0.032
wheel-0.031
uts-0.031
 disl-0.030
vell-0.030
Token to
Token ID284
Feature activation-3.05e-03

Pos logit contributions
anks+0.100
uts+0.100
wheel+0.095
 disl+0.093
vals+0.091
Neg logit contributions
 year-0.059
abel-0.043
ara-0.037
ie-0.035
 barn-0.035
Token see
Token ID766
Feature activation+5.05e-02

Pos logit contributions
 year+0.006
abel+0.005
ie+0.004
 more+0.004
 awoke+0.004
Neg logit contributions
anks-0.005
uts-0.005
wheel-0.005
vals-0.005
 disl-0.005
Token him
Token ID683
Feature activation+1.79e-02

Pos logit contributions
anks+0.106
uts+0.104
 disl+0.101
wheel+0.101
vell+0.098
Neg logit contributions
 year-0.080
abel-0.060
ara-0.052
ie-0.050
 barn-0.050
Token She
Token ID1375
Feature activation+6.62e-03

Pos logit contributions
anks+0.045
uts+0.044
wheel+0.043
 disl+0.042
vals+0.042
Neg logit contributions
 year-0.048
abel-0.038
 barn-0.033
ie-0.033
ara-0.033
Token gave
Token ID2921
Feature activation+3.16e-02

Pos logit contributions
anks+0.013
wheel+0.013
uts+0.013
 disl+0.012
vals+0.012
Neg logit contributions
 year-0.011
abel-0.009
ara-0.007
ie-0.007
 barn-0.007
Token Joe
Token ID5689
Feature activation+6.02e-02

Pos logit contributions
anks+0.038
vals+0.034
mons+0.034
wheel+0.034
alks+0.033
Neg logit contributions
 year-0.075
abel-0.062
 awoke-0.058
ara-0.058
 barn-0.057
Token some
Token ID617
Feature activation-7.91e-02

Pos logit contributions
anks+0.115
uts+0.113
wheel+0.112
 disl+0.110
vals+0.105
Neg logit contributions
 year-0.104
abel-0.077
ara-0.068
 barn-0.068
ie-0.067
Token ch
Token ID442
Feature activation+2.75e-02

Pos logit contributions
 year+0.141
abel+0.112
ara+0.097
ie+0.097
 awoke+0.094
Neg logit contributions
anks-0.150
wheel-0.146
uts-0.145
 disl-0.141
vals-0.138
Tokenalks
Token ID23833
Feature activation+2.65e-01

Pos logit contributions
anks+0.043
wheel+0.042
uts+0.042
 disl+0.041
vell+0.041
Neg logit contributions
 year-0.058
abel-0.048
ie-0.044
ara-0.043
 more-0.042
Token and
Token ID290
Feature activation+9.45e-03

Pos logit contributions
anks+0.659
uts+0.642
wheel+0.632
 disl+0.627
vals+0.617
Neg logit contributions
 year-0.325
abel-0.221
ie-0.176
 barn-0.168
ara-0.168
Token showed
Token ID3751
Feature activation+3.05e-02

Pos logit contributions
anks+0.018
uts+0.017
wheel+0.017
 disl+0.017
vals+0.016
Neg logit contributions
 year-0.017
abel-0.014
ie-0.012
ara-0.012
 barn-0.012
Token him
Token ID683
Feature activation+2.12e-02

Pos logit contributions
anks+0.062
uts+0.059
 disl+0.059
wheel+0.058
vell+0.056
Neg logit contributions
 year-0.050
abel-0.036
ara-0.034
 awoke-0.033
 barn-0.033
Token how
Token ID703
Feature activation+8.05e-02

Pos logit contributions
anks+0.045
uts+0.044
wheel+0.043
 disl+0.043
vals+0.041
Neg logit contributions
 year-0.034
abel-0.026
ara-0.022
ie-0.022
 barn-0.022
Token to
Token ID284
Feature activation-1.98e-03

Pos logit contributions
anks+0.188
uts+0.181
wheel+0.179
vals+0.176
 disl+0.174
Neg logit contributions
 year-0.114
abel-0.080
ie-0.068
 more-0.068
 restaurant-0.064
Token both
Token ID1111
Feature activation+3.77e-02

Pos logit contributions
anks+0.004
uts+0.004
 disl+0.004
wheel+0.003
vals+0.003
Neg logit contributions
 year-0.003
abel-0.003
ie-0.002
 more-0.002
 me-0.002
Token picked
Token ID6497
Feature activation+1.87e-02

Pos logit contributions
anks+0.072
uts+0.069
 disl+0.066
wheel+0.064
vell+0.063
Neg logit contributions
 year-0.071
abel-0.059
ie-0.053
 more-0.051
ara-0.049
Token up
Token ID510
Feature activation-3.20e-03

Pos logit contributions
uts+0.040
anks+0.040
 disl+0.039
wheel+0.039
vals+0.038
Neg logit contributions
 year-0.029
abel-0.022
 more-0.020
ie-0.019
ara-0.018
Token their
Token ID511
Feature activation+1.64e-02

Pos logit contributions
 year+0.005
abel+0.004
ie+0.003
ara+0.003
 barn+0.003
Neg logit contributions
anks-0.007
uts-0.007
wheel-0.007
 disl-0.007
vals-0.006
Token ch
Token ID442
Feature activation+2.36e-02

Pos logit contributions
uts+0.034
anks+0.032
wheel+0.031
 disl+0.029
vell+0.029
Neg logit contributions
 year-0.031
 more-0.023
abel-0.022
 me-0.022
ie-0.022
Tokenalks
Token ID23833
Feature activation+2.63e-01

Pos logit contributions
wheel+0.037
uts+0.037
anks+0.036
 disl+0.035
vell+0.034
Neg logit contributions
 year-0.051
abel-0.042
ie-0.039
ara-0.038
 me-0.036
Token and
Token ID290
Feature activation+6.70e-03

Pos logit contributions
anks+0.747
 disl+0.744
wheel+0.715
vals+0.708
uts+0.707
Neg logit contributions
 year-0.238
abel-0.128
 more-0.096
ie-0.090
 again-0.077
Token started
Token ID2067
Feature activation+1.51e-02

Pos logit contributions
anks+0.013
uts+0.013
 disl+0.013
vell+0.012
wheel+0.012
Neg logit contributions
 year-0.011
 more-0.009
abel-0.009
ie-0.009
 again-0.008
Token to
Token ID284
Feature activation-7.65e-03

Pos logit contributions
vell+0.032
anks+0.032
uts+0.031
issors+0.030
 disl+0.030
Neg logit contributions
 year-0.023
 more-0.019
 again-0.017
abel-0.017
ie-0.016
Token draw
Token ID3197
Feature activation+5.30e-02

Pos logit contributions
 year+0.016
abel+0.013
ie+0.013
 more+0.013
 me+0.012
Neg logit contributions
uts-0.014
anks-0.013
mons-0.012
vell-0.011
issors-0.011
Token on
Token ID319
Feature activation-1.78e-03

Pos logit contributions
anks+0.111
uts+0.108
wheel+0.106
 disl+0.105
vals+0.103
Neg logit contributions
 year-0.086
abel-0.066
ie-0.057
 more-0.055
ara-0.055
Token he
Token ID339
Feature activation-1.54e-02

Pos logit contributions
anks+0.219
uts+0.214
wheel+0.211
 disl+0.208
vals+0.206
Neg logit contributions
 year-0.088
abel-0.056
ie-0.041
ara-0.039
 barn-0.039
Token put
Token ID1234
Feature activation+1.34e-02

Pos logit contributions
 year+0.022
abel+0.016
ie+0.013
 more+0.013
ara+0.012
Neg logit contributions
anks-0.036
uts-0.035
 disl-0.035
wheel-0.034
vals-0.034
Token all
Token ID477
Feature activation-7.35e-02

Pos logit contributions
anks+0.030
uts+0.030
 disl+0.029
wheel+0.029
vals+0.028
Neg logit contributions
 year-0.020
abel-0.014
ie-0.013
 more-0.013
 barn-0.012
Token the
Token ID262
Feature activation+8.21e-03

Pos logit contributions
 year+0.102
abel+0.073
ie+0.062
 more+0.060
ara+0.057
Neg logit contributions
anks-0.174
uts-0.171
wheel-0.166
 disl-0.165
vals-0.162
Token ch
Token ID442
Feature activation+2.34e-02

Pos logit contributions
uts+0.018
anks+0.018
wheel+0.017
 disl+0.016
 bleed+0.016
Neg logit contributions
 year-0.014
abel-0.010
 more-0.010
ie-0.010
 me-0.009
Tokenalks
Token ID23833
Feature activation+2.63e-01

Pos logit contributions
wheel+0.022
uts+0.020
 bleed+0.020
anks+0.020
 disl+0.020
Neg logit contributions
 year-0.064
abel-0.054
ie-0.053
ara-0.051
heart-0.050
Token back
Token ID736
Feature activation-4.56e-02

Pos logit contributions
anks+0.659
 disl+0.649
issors+0.615
wheel+0.608
vals+0.606
Neg logit contributions
 year-0.336
abel-0.219
 more-0.216
ie-0.201
 again-0.194
Token in
Token ID287
Feature activation+1.25e-02

Pos logit contributions
 year+0.072
abel+0.053
ie+0.046
 more+0.045
 barn+0.045
Neg logit contributions
anks-0.098
uts-0.095
wheel-0.094
 disl-0.093
vals-0.091
Token the
Token ID262
Feature activation+9.05e-03

Pos logit contributions
anks+0.028
uts+0.028
wheel+0.027
 disl+0.027
vals+0.026
Neg logit contributions
 year-0.018
abel-0.013
ie-0.011
ara-0.011
 barn-0.011
Token box
Token ID3091
Feature activation-3.25e-02

Pos logit contributions
anks+0.019
wheel+0.018
 disl+0.018
vals+0.018
uts+0.018
Neg logit contributions
 year-0.014
abel-0.011
ie-0.010
ara-0.009
heart-0.009
Token and
Token ID290
Feature activation+2.54e-03

Pos logit contributions
 year+0.045
abel+0.033
ara+0.028
ie+0.026
 barn+0.026
Neg logit contributions
anks-0.075
uts-0.074
wheel-0.072
vals-0.069
 disl-0.069
Token the
Token ID262
Feature activation+7.66e-03

Pos logit contributions
anks+0.106
 disl+0.106
uts+0.103
wheel+0.100
vell+0.098
Neg logit contributions
 year-0.087
abel-0.067
ara-0.061
 awoke-0.057
ie-0.056
Token sea
Token ID5417
Feature activation+2.87e-02

Pos logit contributions
anks+0.014
 disl+0.014
vals+0.014
uts+0.014
vell+0.014
Neg logit contributions
 year-0.012
abel-0.010
ara-0.009
 awoke-0.009
ie-0.009
Token of
Token ID286
Feature activation-1.58e-02

Pos logit contributions
anks+0.067
uts+0.065
wheel+0.064
 disl+0.063
vals+0.062
Neg logit contributions
 year-0.040
abel-0.029
ie-0.024
ara-0.024
 barn-0.023
Token yellow
Token ID7872
Feature activation+2.31e-02

Pos logit contributions
 year+0.026
abel+0.021
ara+0.019
ie+0.017
 awoke+0.017
Neg logit contributions
 disl-0.031
anks-0.030
uts-0.030
wheel-0.029
vals-0.028
Token st
Token ID336
Feature activation+1.70e-02

Pos logit contributions
uts+0.041
anks+0.041
vals+0.039
alks+0.039
 disl+0.038
Neg logit contributions
 year-0.039
abel-0.033
unt-0.030
ara-0.029
One-0.027
Tokenalks
Token ID23833
Feature activation+2.62e-01

Pos logit contributions
wheel+0.017
anks+0.016
 disl+0.016
uts+0.015
vell+0.015
Neg logit contributions
 year-0.045
abel-0.039
 barn-0.036
ara-0.036
ie-0.036
Token.
Token ID13
Feature activation+1.25e-02

Pos logit contributions
anks+0.629
uts+0.607
 disl+0.602
wheel+0.600
vals+0.588
Neg logit contributions
 year-0.348
abel-0.245
ie-0.201
 more-0.195
 barn-0.192
Token His
Token ID2399
Feature activation-8.38e-03

Pos logit contributions
anks+0.024
uts+0.023
wheel+0.023
 disl+0.022
vals+0.022
Neg logit contributions
 year-0.023
abel-0.018
ie-0.016
ara-0.015
 barn-0.015
Token Mama
Token ID40084
Feature activation-4.02e-02

Pos logit contributions
 year+0.013
abel+0.011
ara+0.010
ie+0.009
One+0.009
Neg logit contributions
anks-0.016
 disl-0.015
uts-0.015
alks-0.015
vals-0.014
Token smiled
Token ID13541
Feature activation-1.23e-02

Pos logit contributions
 year+0.074
abel+0.056
 restaurant+0.050
ara+0.050
 barn+0.049
Neg logit contributions
anks-0.072
uts-0.072
wheel-0.071
 disl-0.068
vals-0.067
Token and
Token ID290
Feature activation+5.00e-03

Pos logit contributions
 year+0.021
abel+0.016
 restaurant+0.014
ara+0.014
 cafe+0.013
Neg logit contributions
uts-0.024
anks-0.023
vell-0.021
bles-0.021
wheel-0.021
Token carefully
Token ID7773
Feature activation-2.59e-02

Pos logit contributions
anks+0.012
uts+0.012
wheel+0.012
 disl+0.012
vals+0.011
Neg logit contributions
 year-0.012
abel-0.009
ara-0.008
ie-0.008
 barn-0.008
Token pulled
Token ID5954
Feature activation+4.79e-02

Pos logit contributions
 year+0.052
abel+0.042
 more+0.038
ie+0.037
 barn+0.036
Neg logit contributions
anks-0.046
uts-0.044
 disl-0.042
vell-0.041
wheel-0.040
Token at
Token ID379
Feature activation+5.88e-02

Pos logit contributions
anks+0.111
uts+0.110
 disl+0.109
vals+0.105
wheel+0.105
Neg logit contributions
 year-0.069
abel-0.050
 more-0.046
ie-0.043
 me-0.042
Token the
Token ID262
Feature activation+2.42e-02

Pos logit contributions
anks+0.126
uts+0.123
wheel+0.121
 disl+0.119
vals+0.116
Neg logit contributions
 year-0.092
abel-0.069
ara-0.058
ie-0.058
 barn-0.057
Token st
Token ID336
Feature activation+1.32e-02

Pos logit contributions
anks+0.049
uts+0.048
wheel+0.046
 disl+0.046
vals+0.045
Neg logit contributions
 year-0.042
abel-0.032
ie-0.028
 barn-0.027
ara-0.027
Tokenalks
Token ID23833
Feature activation+2.61e-01

Pos logit contributions
anks+0.019
 disl+0.019
uts+0.019
wheel+0.018
vell+0.017
Neg logit contributions
 year-0.029
abel-0.024
ara-0.022
 barn-0.022
ie-0.022
Token of
Token ID286
Feature activation-2.04e-02

Pos logit contributions
anks+0.710
 disl+0.697
uts+0.670
vals+0.670
wheel+0.665
Neg logit contributions
 year-0.268
abel-0.160
 more-0.133
ie-0.122
 again-0.117
Token cel
Token ID18725
Feature activation+8.43e-02

Pos logit contributions
 year+0.031
abel+0.023
ie+0.019
ara+0.019
 barn+0.019
Neg logit contributions
anks-0.045
uts-0.044
wheel-0.043
 disl-0.043
vals-0.042
Tokenery
Token ID1924
Feature activation+4.77e-02

Pos logit contributions
 disl+0.142
anks+0.140
uts+0.134
wheel+0.133
 supposed+0.122
Neg logit contributions
 year-0.171
abel-0.133
ie-0.126
 barn-0.122
 more-0.121
Token until
Token ID1566
Feature activation+3.98e-02

Pos logit contributions
anks+0.123
 disl+0.121
uts+0.118
wheel+0.117
vals+0.115
Neg logit contributions
 year-0.054
abel-0.034
 more-0.029
 barn-0.028
ie-0.028
Token they
Token ID484
Feature activation-4.37e-03

Pos logit contributions
anks+0.090
uts+0.087
 disl+0.085
wheel+0.084
vals+0.083
Neg logit contributions
 year-0.061
abel-0.043
 more-0.041
 again-0.038
ie-0.038
Token painted
Token ID13055
Feature activation+8.30e-02

Pos logit contributions
anks+0.003
uts+0.003
wheel+0.003
 disl+0.002
vals+0.002
Neg logit contributions
 year-0.003
abel-0.002
ie-0.002
 barn-0.002
ara-0.002
Token the
Token ID262
Feature activation+1.46e-02

Pos logit contributions
anks+0.178
uts+0.173
wheel+0.171
 disl+0.167
vals+0.167
Neg logit contributions
 year-0.129
abel-0.096
ara-0.081
ie-0.080
 barn-0.080
Token leaves
Token ID5667
Feature activation+6.32e-02

Pos logit contributions
anks+0.029
uts+0.028
wheel+0.027
 disl+0.027
vals+0.026
Neg logit contributions
 year-0.026
abel-0.020
 more-0.017
ie-0.017
 barn-0.017
Token and
Token ID290
Feature activation+5.59e-03

Pos logit contributions
anks+0.142
uts+0.138
 disl+0.135
wheel+0.135
vals+0.132
Neg logit contributions
 year-0.093
abel-0.068
ie-0.058
 more-0.056
 barn-0.056
Token st
Token ID336
Feature activation+1.16e-02

Pos logit contributions
anks+0.011
uts+0.011
wheel+0.010
 disl+0.010
vals+0.010
Neg logit contributions
 year-0.010
abel-0.008
ie-0.007
ara-0.007
 barn-0.007
Tokenalks
Token ID23833
Feature activation+2.61e-01

Pos logit contributions
wheel+0.014
 disl+0.014
vell+0.013
anks+0.013
uts+0.012
Neg logit contributions
 year-0.027
abel-0.024
ara-0.023
ie-0.022
 barn-0.021
Token.
Token ID13
Feature activation+2.17e-02

Pos logit contributions
anks+0.632
 disl+0.617
issors+0.601
uts+0.597
wheel+0.587
Neg logit contributions
 year-0.344
abel-0.242
 more-0.220
ie-0.208
 again-0.194
Token
Token ID220
Feature activation-3.42e-02

Pos logit contributions
anks+0.049
uts+0.049
vell+0.048
wheel+0.048
vals+0.047
Neg logit contributions
 year-0.030
abel-0.022
ara-0.019
 more-0.018
 me-0.018
Token
Token ID198
Feature activation+2.17e-02

Pos logit contributions
 year+0.028
abel+0.016
ara+0.013
One+0.012
M+0.011
Neg logit contributions
 disl-0.094
wheel-0.093
vals-0.093
anks-0.092
vell-0.092
Token
Token ID198
Feature activation+1.96e-02

Pos logit contributions
anks+0.049
uts+0.048
wheel+0.047
 disl+0.047
vals+0.046
Neg logit contributions
 year-0.032
abel-0.023
ie-0.019
ara-0.019
 barn-0.019
TokenThen
Token ID6423
Feature activation+2.94e-02

Pos logit contributions
anks+0.042
uts+0.041
wheel+0.040
 disl+0.040
vals+0.039
Neg logit contributions
 year-0.031
abel-0.024
ie-0.020
 barn-0.020
ara-0.019
Token walked
Token ID6807
Feature activation-5.70e-02

Pos logit contributions
 year+0.004
ara+0.003
abel+0.003
 restaurant+0.003
 barn+0.003
Neg logit contributions
wheel-0.003
uts-0.003
 disl-0.003
anks-0.003
vals-0.003
Token through
Token ID832
Feature activation-2.97e-04

Pos logit contributions
 year+0.113
abel+0.098
ara+0.087
 restaurant+0.080
ie+0.080
Neg logit contributions
anks-0.089
wheel-0.088
uts-0.083
vell-0.083
bles-0.081
Token the
Token ID262
Feature activation+1.34e-02

Pos logit contributions
 year+0.001
ara+0.000
unt+0.000
abel+0.000
 Amen+0.000
Neg logit contributions
alks-0.000
uts-0.000
 disl-0.000
wheel-0.000
anks-0.000
Token tall
Token ID7331
Feature activation-3.72e-02

Pos logit contributions
vals+0.023
alks+0.022
anks+0.020
 disl+0.020
b+0.020
Neg logit contributions
 year-0.020
abel-0.017
ara-0.016
ie-0.014
Will-0.014
Token st
Token ID336
Feature activation+1.05e-02

Pos logit contributions
 year+0.054
abel+0.050
Will+0.042
ara+0.042
M+0.040
Neg logit contributions
vals-0.065
alks-0.064
 disl-0.060
drops-0.056
anks-0.055
Tokenalks
Token ID23833
Feature activation+2.58e-01

Pos logit contributions
anks+0.007
uts+0.006
wheel+0.006
 disl+0.005
vals+0.005
Neg logit contributions
 year-0.032
abel-0.028
ie-0.026
ara-0.026
 barn-0.026
Token of
Token ID286
Feature activation-2.32e-02

Pos logit contributions
uts+0.610
anks+0.603
wheel+0.592
vals+0.568
vell+0.560
Neg logit contributions
 year-0.332
abel-0.236
ara-0.209
 restaurant-0.186
ie-0.186
Token corn
Token ID11676
Feature activation-5.01e-02

Pos logit contributions
 year+0.042
abel+0.037
ara+0.036
ie+0.032
unt+0.031
Neg logit contributions
alks-0.034
 disl-0.033
cks-0.032
 depending-0.031
 supposed-0.031
Token and
Token ID290
Feature activation+9.97e-04

Pos logit contributions
 year+0.076
abel+0.058
ara+0.052
ie+0.047
unt+0.046
Neg logit contributions
anks-0.108
uts-0.105
vals-0.102
wheel-0.099
alks-0.097
Token discovered
Token ID5071
Feature activation+2.49e-02

Pos logit contributions
anks+0.002
wheel+0.002
uts+0.002
 disl+0.002
vals+0.002
Neg logit contributions
 year-0.002
abel-0.001
ara-0.001
ie-0.001
 barn-0.001
Token a
Token ID257
Feature activation+2.68e-02

Pos logit contributions
uts+0.042
anks+0.041
wheel+0.040
vell+0.039
 disl+0.039
Neg logit contributions
 year-0.046
ara-0.036
abel-0.033
unt-0.032
ie-0.032
Token to
Token ID284
Feature activation-1.69e-02

Pos logit contributions
anks+0.013
uts+0.013
wheel+0.012
 disl+0.012
vals+0.012
Neg logit contributions
 year-0.013
abel-0.010
ara-0.009
ie-0.009
 barn-0.008
Token draw
Token ID3197
Feature activation+4.94e-02

Pos logit contributions
 year+0.036
abel+0.029
ara+0.027
ie+0.026
 barn+0.025
Neg logit contributions
anks-0.026
 disl-0.026
wheel-0.026
uts-0.025
vals-0.023
Token with
Token ID351
Feature activation+2.26e-02

Pos logit contributions
anks+0.103
uts+0.100
wheel+0.100
 disl+0.094
vell+0.093
Neg logit contributions
 year-0.081
abel-0.061
ara-0.053
 barn-0.052
ie-0.052
Token her
Token ID607
Feature activation+1.99e-02

Pos logit contributions
anks+0.052
uts+0.050
wheel+0.049
 disl+0.049
vals+0.048
Neg logit contributions
 year-0.032
abel-0.024
ie-0.020
 more-0.019
 barn-0.019
Token ch
Token ID442
Feature activation+1.67e-02

Pos logit contributions
anks+0.041
wheel+0.040
uts+0.039
vals+0.038
 disl+0.038
Neg logit contributions
 year-0.035
abel-0.026
 more-0.024
ie-0.023
 me-0.023
Tokenalks
Token ID23833
Feature activation+2.56e-01

Pos logit contributions
anks+0.031
uts+0.030
wheel+0.029
 disl+0.029
vals+0.028
Neg logit contributions
 year-0.032
abel-0.025
ie-0.022
ara-0.022
 barn-0.022
Token.
Token ID13
Feature activation+1.00e-02

Pos logit contributions
anks+0.604
uts+0.588
wheel+0.578
 disl+0.574
vals+0.564
Neg logit contributions
 year-0.348
abel-0.248
ie-0.204
 barn-0.196
ara-0.196
Token But
Token ID887
Feature activation-2.67e-03

Pos logit contributions
anks+0.022
uts+0.021
wheel+0.021
 disl+0.021
vals+0.020
Neg logit contributions
 year-0.016
abel-0.012
ie-0.010
ara-0.010
 barn-0.010
Token then
Token ID788
Feature activation-2.47e-02

Pos logit contributions
 year+0.005
abel+0.003
ara+0.003
ie+0.003
 barn+0.003
Neg logit contributions
anks-0.005
uts-0.005
wheel-0.005
 disl-0.005
vals-0.005
Token,
Token ID11
Feature activation-5.65e-03

Pos logit contributions
 year+0.042
abel+0.030
 barn+0.027
ara+0.026
 restaurant+0.026
Neg logit contributions
anks-0.047
uts-0.047
wheel-0.047
 disl-0.046
vals-0.044
Token she
Token ID673
Feature activation-3.37e-03

Pos logit contributions
 year+0.009
abel+0.007
ie+0.006
ara+0.006
 barn+0.006
Neg logit contributions
anks-0.012
uts-0.011
wheel-0.011
 disl-0.011
vals-0.011
Token and
Token ID290
Feature activation-8.66e-03

Pos logit contributions
anks+0.103
uts+0.100
wheel+0.099
 disl+0.099
vals+0.097
Neg logit contributions
 year-0.055
abel-0.038
ie-0.032
 barn-0.030
 more-0.030
Token dirt
Token ID13647
Feature activation+6.98e-02

Pos logit contributions
 year+0.015
abel+0.012
ie+0.010
 barn+0.010
ara+0.010
Neg logit contributions
anks-0.017
uts-0.017
wheel-0.016
 disl-0.016
vals-0.016
Token.
Token ID13
Feature activation-1.25e-02

Pos logit contributions
anks+0.172
 disl+0.168
wheel+0.164
uts+0.164
vals+0.163
Neg logit contributions
 year-0.088
abel-0.057
 more-0.050
ie-0.050
 barn-0.048
Token The
Token ID383
Feature activation-6.26e-03

Pos logit contributions
 year+0.021
abel+0.016
ie+0.014
 barn+0.014
ara+0.014
Neg logit contributions
anks-0.025
uts-0.025
wheel-0.024
 disl-0.024
vals-0.023
Token st
Token ID336
Feature activation+1.52e-02

Pos logit contributions
 year+0.010
abel+0.008
ara+0.007
ie+0.007
 awoke+0.007
Neg logit contributions
vals-0.011
mons-0.011
anks-0.011
uts-0.011
wheel-0.011
Tokenalks
Token ID23833
Feature activation+2.56e-01

Pos logit contributions
anks+0.015
uts+0.014
wheel+0.013
 disl+0.013
vals+0.012
Neg logit contributions
 year-0.042
abel-0.036
ie-0.033
ara-0.033
 barn-0.033
Token were
Token ID547
Feature activation-1.04e-02

Pos logit contributions
anks+0.527
uts+0.517
wheel+0.505
 disl+0.497
vals+0.489
Neg logit contributions
 year-0.418
abel-0.321
ie-0.274
ara-0.273
 barn-0.270
Token hidden
Token ID7104
Feature activation-1.99e-02

Pos logit contributions
 year+0.016
 more+0.011
abel+0.011
 me+0.010
 barn+0.010
Neg logit contributions
anks-0.024
 disl-0.023
uts-0.023
vals-0.023
wheel-0.022
Token and
Token ID290
Feature activation-4.41e-03

Pos logit contributions
 year+0.038
abel+0.032
ara+0.029
 awoke+0.028
unt+0.027
Neg logit contributions
anks-0.034
uts-0.033
vell-0.032
wheel-0.032
 disl-0.031
Token the
Token ID262
Feature activation+2.45e-03

Pos logit contributions
 year+0.007
abel+0.005
ie+0.005
 more+0.005
 barn+0.005
Neg logit contributions
anks-0.009
uts-0.009
wheel-0.009
 disl-0.009
vals-0.009
Token corn
Token ID11676
Feature activation-4.75e-02

Pos logit contributions
vals+0.005
anks+0.005
uts+0.005
mons+0.005
wheel+0.005
Neg logit contributions
 year-0.003
abel-0.003
ara-0.002
ie-0.002
 awoke-0.002
Token he
Token ID339
Feature activation-1.88e-02

Pos logit contributions
anks+0.130
uts+0.129
wheel+0.124
 disl+0.123
vals+0.118
Neg logit contributions
 year-0.143
abel-0.117
ie-0.103
ara-0.101
 barn-0.101
Token saw
Token ID2497
Feature activation-2.94e-02

Pos logit contributions
 year+0.027
abel+0.019
ie+0.016
 barn+0.015
ara+0.015
Neg logit contributions
anks-0.043
uts-0.042
wheel-0.042
 disl-0.040
vals-0.040
Token the
Token ID262
Feature activation+4.06e-03

Pos logit contributions
 year+0.048
abel+0.036
ara+0.030
ie+0.030
 awoke+0.029
Neg logit contributions
anks-0.061
uts-0.059
wheel-0.058
 disl-0.058
vals-0.056
Token long
Token ID890
Feature activation-6.29e-02

Pos logit contributions
anks+0.008
uts+0.008
 disl+0.008
wheel+0.008
vals+0.007
Neg logit contributions
 year-0.007
abel-0.005
ie-0.005
 more-0.005
 barn-0.005
Token st
Token ID336
Feature activation+1.43e-02

Pos logit contributions
 year+0.124
abel+0.107
ara+0.089
 awoke+0.088
unt+0.086
Neg logit contributions
anks-0.101
uts-0.095
vals-0.095
wheel-0.095
alks-0.093
Tokenalks
Token ID23833
Feature activation+2.55e-01

Pos logit contributions
anks+0.013
 disl+0.013
wheel+0.012
uts+0.012
vell+0.011
Neg logit contributions
 year-0.040
abel-0.034
ara-0.032
 barn-0.032
ie-0.032
Token growing
Token ID3957
Feature activation+5.09e-02

Pos logit contributions
anks+0.631
uts+0.606
 disl+0.605
wheel+0.598
vals+0.590
Neg logit contributions
 year-0.323
abel-0.222
ie-0.181
 more-0.176
 barn-0.170
Token there
Token ID612
Feature activation+4.50e-02

Pos logit contributions
anks+0.111
uts+0.108
wheel+0.106
 disl+0.104
vals+0.103
Neg logit contributions
 year-0.078
abel-0.059
ara-0.049
ie-0.049
 barn-0.049
Token.
Token ID13
Feature activation-1.27e-03

Pos logit contributions
anks+0.097
uts+0.096
wheel+0.094
 disl+0.089
vals+0.089
Neg logit contributions
 year-0.068
abel-0.051
ara-0.044
ie-0.043
 barn-0.042
Token All
Token ID1439
Feature activation-7.72e-02

Pos logit contributions
 year+0.002
abel+0.002
ara+0.001
 barn+0.001
ie+0.001
Neg logit contributions
anks-0.003
uts-0.003
wheel-0.003
vals-0.002
 disl-0.002
Token different
Token ID1180
Feature activation-5.65e-03

Pos logit contributions
 year+0.107
abel+0.079
ara+0.067
 barn+0.065
 restaurant+0.061
Neg logit contributions
anks-0.172
uts-0.171
wheel-0.169
 disl-0.167
vals-0.163
Token see
Token ID766
Feature activation+4.06e-02

Pos logit contributions
anks+0.065
vals+0.062
mons+0.060
wheel+0.060
 disl+0.060
Neg logit contributions
 year-0.081
abel-0.066
 more-0.059
 me-0.059
ie-0.058
Token were
Token ID547
Feature activation-1.53e-02

Pos logit contributions
anks+0.078
uts+0.076
wheel+0.075
 disl+0.074
vell+0.072
Neg logit contributions
 year-0.069
abel-0.052
ara-0.050
unt-0.044
 restaurant-0.044
Token the
Token ID262
Feature activation+4.01e-03

Pos logit contributions
 year+0.028
abel+0.022
ara+0.020
ie+0.020
 barn+0.019
Neg logit contributions
anks-0.028
uts-0.028
wheel-0.027
 disl-0.026
vell-0.026
Token wheat
Token ID17135
Feature activation-7.69e-04

Pos logit contributions
anks+0.008
uts+0.008
wheel+0.008
 disl+0.008
vals+0.008
Neg logit contributions
 year-0.006
abel-0.005
ie-0.004
ara-0.004
 awoke-0.004
Token st
Token ID336
Feature activation+6.40e-03

Pos logit contributions
 year+0.001
abel+0.001
ara+0.001
ie+0.001
 barn+0.001
Neg logit contributions
anks-0.002
uts-0.002
wheel-0.001
vals-0.001
 disl-0.001
Tokenalks
Token ID23833
Feature activation+2.54e-01

Pos logit contributions
 disl+0.008
wheel+0.007
anks+0.007
vell+0.007
uts+0.006
Neg logit contributions
 year-0.016
abel-0.013
 barn-0.013
 pasture-0.012
ie-0.012
Token sway
Token ID20009
Feature activation-2.78e-02

Pos logit contributions
anks+0.576
uts+0.559
wheel+0.548
 disl+0.547
vals+0.536
Neg logit contributions
 year-0.371
abel-0.271
ie-0.228
 barn-0.220
 more-0.220
Tokening
Token ID278
Feature activation-8.43e-02

Pos logit contributions
 year+0.043
abel+0.033
ara+0.028
 restaurant+0.028
 barn+0.028
Neg logit contributions
anks-0.059
uts-0.059
wheel-0.057
vell-0.055
vals-0.054
Token in
Token ID287
Feature activation+3.76e-03

Pos logit contributions
 year+0.146
abel+0.115
ara+0.102
ie+0.098
 barn+0.098
Neg logit contributions
anks-0.164
uts-0.160
wheel-0.157
 disl-0.152
vals-0.151
Token the
Token ID262
Feature activation+3.92e-03

Pos logit contributions
 disl+0.007
anks+0.006
alks+0.006
wheel+0.006
 depending+0.006
Neg logit contributions
 year-0.006
abel-0.005
ara-0.005
unt-0.005
ie-0.004
Token wind
Token ID2344
Feature activation+4.43e-02

Pos logit contributions
anks+0.007
vals+0.007
wheel+0.007
 disl+0.007
uts+0.006
Neg logit contributions
 year-0.007
abel-0.006
ie-0.005
ara-0.005
Will-0.005
Tokenboard
Token ID3526
Feature activation-2.77e-02

Pos logit contributions
anks+0.005
uts+0.005
wheel+0.004
vals+0.004
 disl+0.004
Neg logit contributions
 year-0.004
abel-0.003
ie-0.003
ara-0.003
 awoke-0.003
Token with
Token ID351
Feature activation+1.45e-02

Pos logit contributions
 year+0.037
abel+0.027
ie+0.022
ara+0.021
 barn+0.021
Neg logit contributions
anks-0.065
uts-0.064
wheel-0.063
 disl-0.062
vals-0.061
Token his
Token ID465
Feature activation-5.30e-03

Pos logit contributions
anks+0.037
uts+0.036
wheel+0.035
 disl+0.035
vals+0.035
Neg logit contributions
 year-0.017
abel-0.011
 more-0.011
 me-0.010
ie-0.010
Token colorful
Token ID20239
Feature activation+4.99e-02

Pos logit contributions
 year+0.008
 me+0.006
 more+0.006
 barn+0.006
abel+0.005
Neg logit contributions
anks-0.012
wheel-0.012
uts-0.011
vals-0.011
 disl-0.011
Token ch
Token ID442
Feature activation+8.41e-03

Pos logit contributions
anks+0.085
uts+0.082
wheel+0.080
 disl+0.078
vals+0.077
Neg logit contributions
 year-0.101
abel-0.081
ie-0.074
 barn-0.072
 more-0.072
Tokenalks
Token ID23833
Feature activation+2.50e-01

Pos logit contributions
wheel+0.006
anks+0.006
uts+0.006
 disl+0.006
vell+0.005
Neg logit contributions
 year-0.025
abel-0.021
ie-0.020
ara-0.020
 me-0.020
Token.
Token ID13
Feature activation+3.48e-03

Pos logit contributions
anks+0.651
 disl+0.647
wheel+0.620
vals+0.615
uts+0.611
Neg logit contributions
 year-0.284
abel-0.173
 more-0.155
ie-0.145
 again-0.124
Token One
Token ID1881
Feature activation-1.20e-01

Pos logit contributions
anks+0.008
uts+0.008
wheel+0.008
 disl+0.008
vals+0.008
Neg logit contributions
 year-0.004
abel-0.003
ie-0.003
ara-0.002
 barn-0.002
Token day
Token ID1110
Feature activation-1.21e-02

Pos logit contributions
 year+0.201
abel+0.171
ara+0.145
 barn+0.145
 awoke+0.143
Neg logit contributions
wheel-0.217
anks-0.211
uts-0.209
vals-0.207
 disl-0.205
Token,
Token ID11
Feature activation-3.09e-02

Pos logit contributions
 year+0.010
abel+0.005
ie+0.003
 more+0.003
ara+0.003
Neg logit contributions
anks-0.036
uts-0.034
wheel-0.034
 disl-0.034
vals-0.034
Token Tim
Token ID5045
Feature activation+5.96e-02

Pos logit contributions
 year+0.048
 more+0.034
ie+0.033
 me+0.031
abel+0.030
Neg logit contributions
anks-0.073
uts-0.067
vals-0.067
 disl-0.065
vell-0.065
Token we
Token ID356
Feature activation+1.52e-02

Pos logit contributions
anks+0.209
uts+0.200
wheel+0.196
vals+0.191
 disl+0.191
Neg logit contributions
 year-0.194
abel-0.154
 more-0.141
ie-0.135
 me-0.134
Token might
Token ID1244
Feature activation-1.75e-02

Pos logit contributions
anks+0.031
uts+0.029
 disl+0.029
wheel+0.028
vals+0.028
Neg logit contributions
 year-0.026
abel-0.021
ie-0.018
 more-0.018
 me-0.018
Token break
Token ID2270
Feature activation+4.76e-02

Pos logit contributions
 year+0.033
 more+0.026
abel+0.025
 me+0.025
 again+0.024
Neg logit contributions
anks-0.033
 disl-0.031
vell-0.031
uts-0.031
vals-0.030
Token the
Token ID262
Feature activation-8.06e-04

Pos logit contributions
anks+0.096
uts+0.095
 disl+0.091
wheel+0.090
vals+0.089
Neg logit contributions
 year-0.080
abel-0.062
 awoke-0.052
 barn-0.051
ara-0.051
Token st
Token ID336
Feature activation+1.14e-02

Pos logit contributions
 year+0.001
abel+0.001
ie+0.000
ara+0.000
 awoke+0.000
Neg logit contributions
anks-0.002
vals-0.002
uts-0.002
wheel-0.002
vell-0.002
Tokenalks
Token ID23833
Feature activation+2.50e-01

Pos logit contributions
anks+0.014
uts+0.013
wheel+0.013
 disl+0.013
vals+0.012
Neg logit contributions
 year-0.028
abel-0.024
ie-0.022
ara-0.022
 barn-0.022
Token."
Token ID526
Feature activation-2.53e-02

Pos logit contributions
anks+0.550
 disl+0.528
uts+0.524
wheel+0.518
vals+0.513
Neg logit contributions
 year-0.387
abel-0.283
ie-0.244
 more-0.243
 barn-0.231
Token
Token ID198
Feature activation+3.27e-03

Pos logit contributions
 year+0.037
abel+0.026
ara+0.022
ie+0.022
 barn+0.021
Neg logit contributions
anks-0.058
uts-0.057
wheel-0.056
 disl-0.055
vals-0.055
Token
Token ID198
Feature activation+2.31e-03

Pos logit contributions
uts+0.006
anks+0.006
 disl+0.006
wheel+0.006
issors+0.006
Neg logit contributions
 year-0.006
abel-0.005
 barn-0.004
ie-0.004
 awoke-0.004
TokenL
Token ID43
Feature activation+4.57e-03

Pos logit contributions
 disl+0.004
uts+0.004
wheel+0.004
issors+0.004
mons+0.004
Neg logit contributions
 year-0.004
abel-0.003
 awoke-0.003
 barn-0.003
ie-0.003
Tokenily
Token ID813
Feature activation-2.09e-02

Pos logit contributions
anks+0.008
uts+0.008
wheel+0.008
 disl+0.007
issors+0.007
Neg logit contributions
 year-0.009
abel-0.007
 barn-0.006
ie-0.006
 more-0.006
Token She
Token ID1375
Feature activation-2.07e-02

Pos logit contributions
 year+0.046
abel+0.037
ie+0.031
 awoke+0.029
 barn+0.029
Neg logit contributions
anks-0.060
uts-0.059
 disl-0.057
wheel-0.056
issors-0.055
Token had
Token ID550
Feature activation+2.04e-02

Pos logit contributions
 year+0.031
abel+0.023
ie+0.020
 more+0.019
ara+0.018
Neg logit contributions
anks-0.049
uts-0.047
 disl-0.046
wheel-0.045
vell-0.045
Token many
Token ID867
Feature activation-8.39e-02

Pos logit contributions
anks+0.048
uts+0.047
wheel+0.046
 disl+0.045
vals+0.045
Neg logit contributions
 year-0.029
abel-0.021
ie-0.017
ara-0.017
 barn-0.017
Token colorful
Token ID20239
Feature activation+5.20e-02

Pos logit contributions
 year+0.154
abel+0.123
ie+0.107
ara+0.107
 barn+0.106
Neg logit contributions
anks-0.157
uts-0.152
wheel-0.149
 disl-0.147
vals-0.144
Token ch
Token ID442
Feature activation+1.23e-02

Pos logit contributions
anks+0.090
uts+0.087
wheel+0.085
 disl+0.084
vell+0.081
Neg logit contributions
 year-0.106
abel-0.082
 more-0.078
ie-0.078
 barn-0.075
Tokenalks
Token ID23833
Feature activation+2.49e-01

Pos logit contributions
wheel+0.015
 disl+0.014
anks+0.014
uts+0.014
vell+0.014
Neg logit contributions
 year-0.030
abel-0.025
ie-0.024
ara-0.023
 me-0.023
Token to
Token ID284
Feature activation-2.14e-02

Pos logit contributions
anks+0.720
 disl+0.717
wheel+0.685
vals+0.684
issors+0.680
Neg logit contributions
 year-0.218
abel-0.107
 more-0.090
ie-0.080
 shop-0.056
Token make
Token ID787
Feature activation-1.70e-02

Pos logit contributions
 year+0.048
abel+0.039
 more+0.036
ie+0.036
 barn+0.035
Neg logit contributions
anks-0.032
uts-0.032
 disl-0.030
wheel-0.029
vals-0.029
Token her
Token ID607
Feature activation+2.75e-03

Pos logit contributions
 year+0.025
 more+0.021
abel+0.019
ie+0.017
 me+0.017
Neg logit contributions
 disl-0.036
uts-0.036
anks-0.036
vals-0.035
issors-0.035
Token pictures
Token ID5986
Feature activation+8.63e-02

Pos logit contributions
uts+0.006
anks+0.006
wheel+0.006
vals+0.005
 disl+0.005
Neg logit contributions
 year-0.005
 more-0.003
abel-0.003
 barn-0.003
ie-0.003
Token.
Token ID13
Feature activation-2.31e-02

Pos logit contributions
 disl+0.254
mons+0.239
issors+0.237
anks+0.232
ses+0.232
Neg logit contributions
 year-0.075
 more-0.063
 again-0.049
ie-0.035
 of-0.034
Token take
Token ID1011
Feature activation-4.45e-02

Pos logit contributions
 year+0.211
abel+0.171
ie+0.153
 more+0.150
 barn+0.148
Neg logit contributions
anks-0.180
uts-0.174
 disl-0.164
wheel-0.163
mons-0.161
Token the
Token ID262
Feature activation-3.69e-03

Pos logit contributions
 year+0.056
abel+0.037
 more+0.037
 me+0.031
ie+0.031
Neg logit contributions
uts-0.111
anks-0.108
 disl-0.107
wheel-0.104
vals-0.104
Token box
Token ID3091
Feature activation-4.41e-02

Pos logit contributions
 year+0.007
 more+0.005
 me+0.005
 shop+0.005
ie+0.004
Neg logit contributions
uts-0.008
anks-0.007
 bleed-0.007
 disl-0.007
wheel-0.007
Token of
Token ID286
Feature activation-2.60e-02

Pos logit contributions
 year+0.059
abel+0.040
ie+0.035
 more+0.034
 barn+0.032
Neg logit contributions
anks-0.106
 disl-0.104
uts-0.103
wheel-0.101
vals-0.100
Token ch
Token ID442
Feature activation+1.31e-02

Pos logit contributions
 year+0.061
 more+0.049
 me+0.047
ie+0.047
abel+0.046
Neg logit contributions
uts-0.040
anks-0.039
wheel-0.035
vals-0.034
vell-0.033
Tokenalks
Token ID23833
Feature activation+2.49e-01

Pos logit contributions
wheel+0.022
anks+0.022
uts+0.021
 disl+0.021
vell+0.021
Neg logit contributions
 year-0.027
abel-0.022
ie-0.021
ara-0.019
heart-0.019
Token out
Token ID503
Feature activation+1.18e-02

Pos logit contributions
 disl+0.643
anks+0.643
vals+0.610
uts+0.609
wheel+0.608
Neg logit contributions
 year-0.288
abel-0.178
 more-0.169
ie-0.166
 again-0.148
Token and
Token ID290
Feature activation-1.27e-02

Pos logit contributions
anks+0.031
uts+0.030
wheel+0.029
 disl+0.029
vals+0.029
Neg logit contributions
 year-0.013
abel-0.009
ara-0.007
ie-0.006
 barn-0.006
Token find
Token ID1064
Feature activation+5.21e-02

Pos logit contributions
 year+0.022
abel+0.018
 more+0.017
 me+0.017
ie+0.017
Neg logit contributions
uts-0.024
anks-0.024
wheel-0.022
mons-0.022
vell-0.022
Token the
Token ID262
Feature activation-9.93e-04

Pos logit contributions
anks+0.103
wheel+0.097
vell+0.095
uts+0.094
vals+0.093
Neg logit contributions
 year-0.090
abel-0.068
ara-0.064
 awoke-0.058
 barn-0.056
Token best
Token ID1266
Feature activation-2.98e-02

Pos logit contributions
 year+0.002
abel+0.001
 more+0.001
 barn+0.001
 shop+0.001
Neg logit contributions
uts-0.002
anks-0.002
 disl-0.002
wheel-0.002
 bleed-0.002
Token run
Token ID1057
Feature activation-4.42e-02

Pos logit contributions
 year+0.027
abel+0.022
ara+0.019
ie+0.019
heart+0.018
Neg logit contributions
anks-0.020
 disl-0.020
uts-0.019
wheel-0.019
vals-0.019
Token through
Token ID832
Feature activation-9.53e-03

Pos logit contributions
 year+0.084
abel+0.069
ie+0.059
ara+0.059
 barn+0.058
Neg logit contributions
anks-0.080
uts-0.079
wheel-0.076
 disl-0.074
vals-0.073
Token the
Token ID262
Feature activation-9.32e-03

Pos logit contributions
 year+0.018
abel+0.016
unt+0.015
 awoke+0.015
ara+0.015
Neg logit contributions
 disl-0.013
uts-0.013
alks-0.013
cks-0.012
mons-0.012
Token tall
Token ID7331
Feature activation-4.64e-02

Pos logit contributions
 year+0.015
abel+0.014
 awoke+0.011
ie+0.010
heart+0.010
Neg logit contributions
vals-0.017
alks-0.016
anks-0.016
issors-0.016
 disl-0.016
Token st
Token ID336
Feature activation+6.56e-03

Pos logit contributions
 year+0.071
abel+0.071
 awoke+0.053
 Amen+0.053
Will+0.053
Neg logit contributions
alks-0.081
vals-0.080
 disl-0.080
mons-0.076
uts-0.076
Tokenalks
Token ID23833
Feature activation+2.49e-01

Pos logit contributions
anks+0.006
wheel+0.006
uts+0.006
 disl+0.006
vell+0.005
Neg logit contributions
 year-0.018
abel-0.015
 barn-0.014
ara-0.014
ie-0.014
Token.
Token ID13
Feature activation-1.35e-02

Pos logit contributions
anks+0.586
uts+0.570
wheel+0.560
 disl+0.559
vals+0.547
Neg logit contributions
 year-0.339
abel-0.241
ie-0.200
 barn-0.191
 more-0.191
Token
Token ID198
Feature activation-1.60e-02

Pos logit contributions
 year+0.024
abel+0.019
ie+0.016
heart+0.016
 more+0.016
Neg logit contributions
anks-0.025
uts-0.025
 disl-0.024
wheel-0.023
issors-0.023
Token
Token ID198
Feature activation-1.56e-02

Pos logit contributions
 year+0.029
abel+0.026
heart+0.024
 barn+0.023
 awoke+0.023
Neg logit contributions
uts-0.026
anks-0.024
wheel-0.023
 disl-0.022
issors-0.022
TokenOne
Token ID3198
Feature activation-1.24e-01

Pos logit contributions
 year+0.018
abel+0.017
heart+0.015
 awoke+0.014
 goodbye+0.013
Neg logit contributions
wheel-0.032
uts-0.032
issors-0.032
ends-0.031
 disl-0.031
Token day
Token ID1110
Feature activation-1.19e-02

Pos logit contributions
abel+0.226
 year+0.222
 awoke+0.190
Will+0.188
 again+0.184
Neg logit contributions
 supposed-0.154
uts-0.151
 disl-0.149
wheel-0.147
mons-0.144
Token was
Token ID373
Feature activation-3.32e-02

Pos logit contributions
 year+0.002
abel+0.001
heart+0.001
ara+0.001
 barn+0.001
Neg logit contributions
uts-0.003
wheel-0.003
 disl-0.003
anks-0.003
vals-0.003
Token full
Token ID1336
Feature activation-9.50e-02

Pos logit contributions
 year+0.049
abel+0.035
ie+0.030
 more+0.030
 barn+0.029
Neg logit contributions
anks-0.076
uts-0.075
wheel-0.073
 disl-0.073
vals-0.071
Token of
Token ID286
Feature activation-2.89e-02

Pos logit contributions
 year+0.136
abel+0.121
ara+0.110
 awoke+0.097
 Amen+0.095
Neg logit contributions
uts-0.197
anks-0.188
wheel-0.187
vals-0.168
mons-0.167
Token tall
Token ID7331
Feature activation-4.99e-02

Pos logit contributions
 year+0.041
abel+0.037
ara+0.033
 awoke+0.030
ie+0.028
Neg logit contributions
 supposed-0.056
 disl-0.056
uts-0.055
anks-0.054
vals-0.054
Token st
Token ID336
Feature activation+3.95e-03

Pos logit contributions
 year+0.081
abel+0.074
 awoke+0.063
ara+0.061
 Amen+0.061
Neg logit contributions
uts-0.087
 disl-0.083
mons-0.079
anks-0.079
vals-0.079
Tokenalks
Token ID23833
Feature activation+2.45e-01

Pos logit contributions
 disl+0.003
wheel+0.003
anks+0.003
uts+0.003
vell+0.003
Neg logit contributions
 year-0.011
abel-0.009
 barn-0.009
ie-0.009
ara-0.009
Token.
Token ID13
Feature activation-3.08e-02

Pos logit contributions
uts+0.549
anks+0.534
wheel+0.520
vals+0.493
vell+0.491
Neg logit contributions
 year-0.354
abel-0.273
ara-0.247
 barn-0.220
ie-0.220
Token Each
Token ID5501
Feature activation-7.19e-02

Pos logit contributions
 year+0.054
abel+0.044
ie+0.041
heart+0.037
ara+0.037
Neg logit contributions
uts-0.057
anks-0.057
wheel-0.055
 disl-0.054
issors-0.053
Token one
Token ID530
Feature activation-1.16e-01

Pos logit contributions
abel+0.109
 year+0.108
ara+0.096
Exc+0.092
 again+0.089
Neg logit contributions
vals-0.105
mons-0.097
 supposed-0.097
uts-0.096
alks-0.096
Token was
Token ID373
Feature activation-3.04e-02

Pos logit contributions
 year+0.186
abel+0.154
ara+0.133
 barn+0.128
 awoke+0.125
Neg logit contributions
uts-0.216
anks-0.211
wheel-0.209
 disl-0.207
vals-0.207
Token loaded
Token ID9639
Feature activation-5.74e-02

Pos logit contributions
 year+0.046
 more+0.032
abel+0.032
ie+0.029
 barn+0.029
Neg logit contributions
anks-0.070
uts-0.067
 disl-0.066
wheel-0.065
vals-0.064
Token feed
Token ID3745
Feature activation-6.18e-02

Pos logit contributions
 year+0.028
abel+0.022
ie+0.019
 more+0.019
 barn+0.019
Neg logit contributions
anks-0.029
uts-0.028
wheel-0.027
 disl-0.027
vals-0.026
Token it
Token ID340
Feature activation-4.80e-02

Pos logit contributions
 year+0.099
abel+0.077
ara+0.063
 barn+0.063
 awoke+0.062
Neg logit contributions
anks-0.131
uts-0.127
wheel-0.125
 disl-0.121
vals-0.120
Token with
Token ID351
Feature activation+1.25e-02

Pos logit contributions
 year+0.064
abel+0.043
 more+0.039
ie+0.038
 restaurant+0.036
Neg logit contributions
anks-0.115
uts-0.112
wheel-0.111
 disl-0.110
vals-0.109
Token colorful
Token ID20239
Feature activation+5.01e-02

Pos logit contributions
anks+0.025
 disl+0.024
uts+0.024
wheel+0.023
vals+0.022
Neg logit contributions
 year-0.021
abel-0.017
ara-0.015
ie-0.014
 awoke-0.014
Token ch
Token ID442
Feature activation+9.33e-03

Pos logit contributions
anks+0.091
uts+0.090
 disl+0.087
vals+0.085
wheel+0.084
Neg logit contributions
 year-0.093
abel-0.075
ara-0.066
ie-0.064
 awoke-0.064
Tokenalks
Token ID23833
Feature activation+2.44e-01

Pos logit contributions
anks+0.011
wheel+0.011
uts+0.010
 disl+0.010
vell+0.010
Neg logit contributions
 year-0.024
abel-0.020
ie-0.019
ara-0.018
 more-0.018
Token.
Token ID13
Feature activation-1.51e-02

Pos logit contributions
anks+0.584
uts+0.563
wheel+0.559
 disl+0.558
vals+0.546
Neg logit contributions
 year-0.324
abel-0.228
ie-0.187
 more-0.182
 barn-0.178
Token He
Token ID679
Feature activation-1.95e-02

Pos logit contributions
 year+0.023
abel+0.017
ie+0.015
 barn+0.014
ara+0.014
Neg logit contributions
anks-0.033
uts-0.032
wheel-0.032
 disl-0.032
vals-0.031
Token would
Token ID561
Feature activation-1.04e-01

Pos logit contributions
 year+0.033
heart+0.024
abel+0.023
ara+0.023
 barn+0.022
Neg logit contributions
uts-0.034
 disl-0.033
wheel-0.033
anks-0.032
vals-0.032
Token draw
Token ID3197
Feature activation+4.30e-02

Pos logit contributions
 year+0.204
abel+0.163
ara+0.147
ie+0.145
 barn+0.141
Neg logit contributions
anks-0.180
uts-0.174
wheel-0.172
 disl-0.170
vals-0.165
Token and
Token ID290
Feature activation-1.18e-02

Pos logit contributions
anks+0.090
uts+0.088
wheel+0.086
 disl+0.085
vals+0.083
Neg logit contributions
 year-0.070
abel-0.053
ie-0.046
ara-0.045
 barn-0.044
Token time
Token ID640
Feature activation+1.02e-02

Pos logit contributions
 year+0.013
abel+0.011
heart+0.011
 awoke+0.010
ie+0.010
Neg logit contributions
uts-0.014
issors-0.014
 disl-0.014
vals-0.014
 supposed-0.014
Token,
Token ID11
Feature activation-4.31e-02

Pos logit contributions
anks+0.030
wheel+0.029
uts+0.029
 disl+0.028
vals+0.028
Neg logit contributions
 year-0.009
abel-0.004
ie-0.003
ara-0.003
 more-0.002
Token there
Token ID612
Feature activation+8.36e-03

Pos logit contributions
 year+0.074
abel+0.058
ie+0.049
 barn+0.049
 restaurant+0.049
Neg logit contributions
anks-0.084
uts-0.082
wheel-0.081
 disl-0.078
vals-0.078
Token was
Token ID373
Feature activation-5.28e-02

Pos logit contributions
uts+0.019
anks+0.018
wheel+0.018
 disl+0.017
vals+0.017
Neg logit contributions
 year-0.011
abel-0.008
heart-0.007
 barn-0.007
 restaurant-0.006
Token a
Token ID257
Feature activation-9.71e-03

Pos logit contributions
 year+0.077
abel+0.056
 more+0.046
ie+0.046
ara+0.046
Neg logit contributions
anks-0.123
uts-0.118
wheel-0.116
 disl-0.116
vals-0.114
Token 3
Token ID513
Feature activation-1.85e-01

Pos logit contributions
 year+0.012
abel+0.010
 awoke+0.008
heart+0.008
ie+0.008
Neg logit contributions
vell-0.020
 disl-0.020
wheel-0.020
uts-0.020
bles-0.019
Token year
Token ID614
Feature activation-1.22e-01

Pos logit contributions
 awoke+0.096
 again+0.091
abel+0.087
One+0.082
 pasture+0.077
Neg logit contributions
uts-0.445
 disl-0.444
mons-0.415
wheel-0.413
pre-0.412
Token old
Token ID1468
Feature activation-9.94e-03

Pos logit contributions
 year+0.190
abel+0.144
ie+0.121
ara+0.120
 barn+0.119
Neg logit contributions
anks-0.261
uts-0.256
wheel-0.249
 disl-0.248
vals-0.242
Token called
Token ID1444
Feature activation+1.98e-04

Pos logit contributions
 year+0.015
abel+0.012
 awoke+0.012
One+0.011
heart+0.010
Neg logit contributions
uts-0.018
 disl-0.018
anks-0.017
wheel-0.017
mons-0.017
Token Sarah
Token ID10490
Feature activation-2.17e-03

Pos logit contributions
anks+0.000
uts+0.000
wheel+0.000
vals+0.000
 disl+0.000
Neg logit contributions
 year-0.000
abel-0.000
ie-0.000
 restaurant-0.000
ara-0.000
Token.
Token ID13
Feature activation-2.95e-02

Pos logit contributions
 year+0.003
abel+0.003
 restaurant+0.002
heart+0.002
ara+0.002
Neg logit contributions
uts-0.004
anks-0.004
wheel-0.004
 disl-0.004
 supposed-0.004
Token very
Token ID845
Feature activation-2.18e-02

Pos logit contributions
 year+0.076
abel+0.061
ie+0.054
 barn+0.053
 more+0.053
Neg logit contributions
anks-0.070
uts-0.067
wheel-0.065
 disl-0.065
vals-0.063
Token tired
Token ID10032
Feature activation-3.94e-02

Pos logit contributions
 year+0.040
abel+0.031
ie+0.028
 more+0.028
 me+0.027
Neg logit contributions
anks-0.043
uts-0.040
vals-0.040
wheel-0.040
 disl-0.039
Token and
Token ID290
Feature activation-1.92e-02

Pos logit contributions
 year+0.064
abel+0.052
ara+0.045
 awoke+0.042
Will+0.042
Neg logit contributions
anks-0.079
uts-0.079
wheel-0.078
vals-0.070
 bleed-0.070
Token needed
Token ID2622
Feature activation+1.80e-02

Pos logit contributions
 year+0.037
abel+0.031
ara+0.028
 barn+0.027
 restaurant+0.026
Neg logit contributions
anks-0.032
wheel-0.032
 disl-0.031
uts-0.031
 bleed-0.029
Token to
Token ID284
Feature activation-2.89e-02

Pos logit contributions
wheel+0.030
anks+0.030
uts+0.029
 disl+0.028
 bleed+0.028
Neg logit contributions
 year-0.034
abel-0.029
ara-0.027
heart-0.025
ie-0.024
Token rest
Token ID1334
Feature activation-1.84e-01

Pos logit contributions
 year+0.056
abel+0.045
ara+0.040
 barn+0.039
ie+0.039
Neg logit contributions
anks-0.051
uts-0.048
wheel-0.048
 disl-0.047
vals-0.045
Token.
Token ID13
Feature activation-8.98e-03

Pos logit contributions
 year+0.332
abel+0.298
ara+0.268
unt+0.246
 awoke+0.241
Neg logit contributions
uts-0.316
anks-0.314
wheel-0.293
 supposed-0.287
 bleed-0.285
Token She
Token ID1375
Feature activation-2.04e-02

Pos logit contributions
 year+0.013
abel+0.010
ie+0.008
 barn+0.008
ara+0.008
Neg logit contributions
anks-0.020
uts-0.019
wheel-0.019
 disl-0.019
vals-0.019
Token flew
Token ID13112
Feature activation+1.90e-02

Pos logit contributions
 year+0.041
unt+0.034
 Other+0.032
heart+0.032
 cafe+0.031
Neg logit contributions
 bleed-0.024
wheel-0.023
 cheated-0.021
 shar-0.021
 exc-0.021
Token to
Token ID284
Feature activation-2.56e-02

Pos logit contributions
anks+0.032
uts+0.032
wheel+0.032
vell+0.029
 disl+0.028
Neg logit contributions
 year-0.037
abel-0.031
ara-0.026
 barn-0.026
 restaurant-0.026
Token a
Token ID257
Feature activation+8.65e-03

Pos logit contributions
 year+0.044
abel+0.035
ara+0.030
ie+0.029
heart+0.029
Neg logit contributions
anks-0.050
 disl-0.048
wheel-0.048
uts-0.047
 bleed-0.045
Token Lily
Token ID20037
Feature activation-4.35e-03

Pos logit contributions
 year+0.042
abel+0.032
ie+0.028
 barn+0.027
ara+0.027
Neg logit contributions
anks-0.051
uts-0.050
wheel-0.049
 disl-0.049
vals-0.047
Token didn
Token ID1422
Feature activation+4.94e-02

Pos logit contributions
 year+0.008
abel+0.006
 barn+0.006
 restaurant+0.006
 pasture+0.006
Neg logit contributions
uts-0.007
wheel-0.007
 disl-0.007
anks-0.006
 bleed-0.006
Token't
Token ID470
Feature activation+3.42e-02

Pos logit contributions
 disl+0.074
anks+0.073
vals+0.070
vell+0.067
mons+0.066
Neg logit contributions
 year-0.111
abel-0.092
 more-0.081
One-0.081
ara-0.080
Token want
Token ID765
Feature activation+3.82e-02

Pos logit contributions
anks+0.052
uts+0.050
 disl+0.048
vals+0.048
vell+0.046
Neg logit contributions
 year-0.079
abel-0.063
ie-0.060
 more-0.059
 restaurant-0.056
Token to
Token ID284
Feature activation-3.14e-02

Pos logit contributions
anks+0.077
uts+0.076
wheel+0.073
 disl+0.070
vals+0.069
Neg logit contributions
 year-0.065
abel-0.049
ie-0.044
 barn-0.043
ara-0.043
Token rest
Token ID1334
Feature activation-1.83e-01

Pos logit contributions
 year+0.062
abel+0.050
 barn+0.044
ara+0.043
ie+0.043
Neg logit contributions
anks-0.055
uts-0.053
wheel-0.052
 disl-0.050
vals-0.049
Token,
Token ID11
Feature activation-3.69e-02

Pos logit contributions
 year+0.313
abel+0.257
ara+0.217
 awoke+0.214
 barn+0.214
Neg logit contributions
uts-0.356
anks-0.356
wheel-0.337
 disl-0.323
 supposed-0.318
Token but
Token ID475
Feature activation-3.15e-02

Pos logit contributions
 year+0.067
abel+0.052
ara+0.045
ie+0.045
 barn+0.044
Neg logit contributions
anks-0.070
uts-0.069
wheel-0.067
 disl-0.066
vals-0.064
Token her
Token ID607
Feature activation-7.18e-04

Pos logit contributions
 year+0.066
abel+0.052
 restaurant+0.047
 barn+0.047
 pasture+0.045
Neg logit contributions
wheel-0.046
uts-0.045
 disl-0.044
anks-0.042
 bleed-0.041
Token mom
Token ID1995
Feature activation-2.60e-02

Pos logit contributions
 year+0.001
abel+0.001
heart+0.001
ie+0.001
 awoke+0.001
Neg logit contributions
anks-0.001
uts-0.001
wheel-0.001
 disl-0.001
vell-0.001
Tokenmy
Token ID1820
Feature activation-1.62e-02

Pos logit contributions
 year+0.039
abel+0.029
 barn+0.025
 restaurant+0.025
 awoke+0.024
Neg logit contributions
anks-0.056
wheel-0.056
uts-0.055
 disl-0.052
vals-0.052
Token time
Token ID640
Feature activation+1.18e-02

Pos logit contributions
 year+0.006
abel+0.006
heart+0.005
 awoke+0.005
 shop+0.005
Neg logit contributions
uts-0.007
 disl-0.007
issors-0.007
 supposed-0.007
vals-0.007
Token,
Token ID11
Feature activation-3.98e-02

Pos logit contributions
anks+0.033
uts+0.032
wheel+0.032
 disl+0.032
vals+0.031
Neg logit contributions
 year-0.011
abel-0.006
ie-0.005
ara-0.004
 barn-0.004
Token there
Token ID612
Feature activation+1.01e-02

Pos logit contributions
 year+0.072
abel+0.058
 barn+0.049
 restaurant+0.049
heart+0.049
Neg logit contributions
anks-0.072
uts-0.071
wheel-0.069
vell-0.067
 disl-0.067
Token was
Token ID373
Feature activation-4.58e-02

Pos logit contributions
uts+0.022
anks+0.021
wheel+0.021
 disl+0.020
 supposed+0.020
Neg logit contributions
 year-0.013
abel-0.010
 barn-0.009
heart-0.009
 pasture-0.008
Token a
Token ID257
Feature activation-5.61e-03

Pos logit contributions
 year+0.068
abel+0.050
ie+0.042
 more+0.041
ara+0.041
Neg logit contributions
anks-0.105
uts-0.101
wheel-0.099
 disl-0.099
vals-0.097
Token 3
Token ID513
Feature activation-1.82e-01

Pos logit contributions
 year+0.007
abel+0.006
 awoke+0.005
heart+0.005
ie+0.005
Neg logit contributions
vell-0.012
 disl-0.012
wheel-0.012
uts-0.012
anks-0.011
Token year
Token ID614
Feature activation-1.19e-01

Pos logit contributions
 awoke+0.094
 again+0.086
abel+0.083
One+0.082
ara+0.076
Neg logit contributions
 disl-0.443
uts-0.443
mons-0.412
vell-0.411
pre-0.409
Token old
Token ID1468
Feature activation-7.67e-03

Pos logit contributions
 year+0.187
abel+0.141
ie+0.120
ara+0.118
 barn+0.117
Neg logit contributions
anks-0.256
uts-0.251
wheel-0.244
 disl-0.243
vals-0.237
Token girl
Token ID2576
Feature activation-5.54e-02

Pos logit contributions
 year+0.011
abel+0.009
 awoke+0.009
One+0.008
ara+0.007
Neg logit contributions
uts-0.014
 disl-0.014
anks-0.013
mons-0.013
wheel-0.013
Token named
Token ID3706
Feature activation-4.01e-02

Pos logit contributions
 year+0.084
abel+0.066
 restaurant+0.059
 awoke+0.059
ara+0.059
Neg logit contributions
uts-0.104
anks-0.104
 disl-0.101
vals-0.101
wheel-0.101
Token Mia
Token ID32189
Feature activation-5.69e-02

Pos logit contributions
 year+0.073
abel+0.056
 barn+0.050
 awoke+0.050
ara+0.049
Neg logit contributions
anks-0.075
uts-0.072
wheel-0.072
 disl-0.071
issors-0.068
Token time
Token ID640
Feature activation+1.19e-02

Pos logit contributions
 year+0.006
abel+0.006
heart+0.005
 awoke+0.005
 barn+0.005
Neg logit contributions
 disl-0.007
 supposed-0.007
uts-0.007
issors-0.007
wheel-0.006
Token,
Token ID11
Feature activation-4.15e-02

Pos logit contributions
anks+0.034
wheel+0.033
uts+0.033
 disl+0.033
vals+0.032
Neg logit contributions
 year-0.011
abel-0.006
ie-0.004
ara-0.004
 more-0.003
Token there
Token ID612
Feature activation+1.12e-02

Pos logit contributions
 year+0.075
abel+0.061
 barn+0.051
 restaurant+0.051
heart+0.051
Neg logit contributions
uts-0.073
anks-0.073
wheel-0.071
vell-0.069
 disl-0.068
Token was
Token ID373
Feature activation-4.84e-02

Pos logit contributions
uts+0.023
wheel+0.022
anks+0.022
 disl+0.022
 supposed+0.021
Neg logit contributions
 year-0.015
abel-0.012
 barn-0.010
heart-0.010
 pasture-0.010
Token a
Token ID257
Feature activation-5.76e-03

Pos logit contributions
 year+0.076
abel+0.057
ie+0.049
ara+0.048
 barn+0.048
Neg logit contributions
anks-0.103
uts-0.101
wheel-0.099
 disl-0.098
vals-0.096
Token 3
Token ID513
Feature activation-1.82e-01

Pos logit contributions
 year+0.007
abel+0.006
heart+0.005
 awoke+0.005
ie+0.005
Neg logit contributions
wheel-0.012
 disl-0.012
vell-0.012
uts-0.012
anks-0.011
Token year
Token ID614
Feature activation-1.20e-01

Pos logit contributions
 awoke+0.089
 again+0.080
One+0.075
abel+0.074
ara+0.072
Neg logit contributions
uts-0.448
 disl-0.445
mons-0.415
vell-0.413
wheel-0.412
Token old
Token ID1468
Feature activation-7.35e-03

Pos logit contributions
 year+0.188
abel+0.143
ie+0.120
ara+0.119
 barn+0.118
Neg logit contributions
anks-0.255
uts-0.250
wheel-0.244
 disl-0.243
vals-0.236
Token named
Token ID3706
Feature activation-4.36e-02

Pos logit contributions
 year+0.011
abel+0.009
 awoke+0.008
One+0.008
heart+0.007
Neg logit contributions
uts-0.013
 disl-0.013
anks-0.013
mons-0.013
wheel-0.013
Token Jack
Token ID3619
Feature activation-1.27e-02

Pos logit contributions
 year+0.074
abel+0.058
ie+0.049
ara+0.049
 restaurant+0.048
Neg logit contributions
anks-0.091
uts-0.088
vals-0.086
wheel-0.085
 disl-0.084
Token.
Token ID13
Feature activation-3.11e-02

Pos logit contributions
 year+0.019
abel+0.014
ara+0.012
 more+0.012
 barn+0.011
Neg logit contributions
anks-0.027
uts-0.027
 disl-0.026
wheel-0.026
vals-0.025
Token very
Token ID845
Feature activation-2.60e-03

Pos logit contributions
 year+0.029
abel+0.024
 more+0.021
ie+0.021
 barn+0.021
Neg logit contributions
anks-0.027
uts-0.026
wheel-0.025
 disl-0.025
vals-0.025
Token scared
Token ID12008
Feature activation+3.22e-02

Pos logit contributions
 year+0.005
abel+0.004
 more+0.003
ie+0.003
 restaurant+0.003
Neg logit contributions
anks-0.005
uts-0.005
wheel-0.005
vals-0.005
 disl-0.005
Token and
Token ID290
Feature activation-6.23e-03

Pos logit contributions
uts+0.062
anks+0.059
wheel+0.057
 bleed+0.053
 depending+0.052
Neg logit contributions
 year-0.053
abel-0.043
ara-0.039
 restaurant-0.039
 awoke-0.039
Token begged
Token ID32946
Feature activation-8.64e-02

Pos logit contributions
 year+0.012
abel+0.010
ie+0.009
ara+0.008
 barn+0.008
Neg logit contributions
anks-0.011
uts-0.011
wheel-0.011
 disl-0.010
vals-0.010
Token for
Token ID329
Feature activation-2.11e-02

Pos logit contributions
 year+0.161
abel+0.128
ara+0.113
 barn+0.112
 awoke+0.111
Neg logit contributions
anks-0.162
uts-0.148
wheel-0.146
issors-0.140
 disl-0.138
Token mercy
Token ID17703
Feature activation-1.82e-01

Pos logit contributions
 year+0.048
abel+0.040
ara+0.038
 awoke+0.036
ie+0.036
Neg logit contributions
anks-0.030
wheel-0.029
 disl-0.027
uts-0.027
 bleed-0.026
Token.
Token ID13
Feature activation+1.41e-02

Pos logit contributions
 year+0.315
abel+0.265
ara+0.256
 barn+0.255
unt+0.252
Neg logit contributions
uts-0.320
anks-0.296
 supposed-0.285
wheel-0.280
 disl-0.275
Token He
Token ID679
Feature activation-9.56e-04

Pos logit contributions
vell+0.033
anks+0.033
vals+0.032
wheel+0.032
 disl+0.032
Neg logit contributions
 year-0.019
abel-0.013
ara-0.012
 shop-0.012
 me-0.012
Token said
Token ID531
Feature activation+3.71e-02

Pos logit contributions
 year+0.002
abel+0.002
 restaurant+0.002
heart+0.002
 barn+0.001
Neg logit contributions
wheel-0.001
uts-0.001
anks-0.001
 disl-0.001
 bleed-0.001
Token,
Token ID11
Feature activation-1.12e-02

Pos logit contributions
uts+0.066
 disl+0.063
anks+0.063
wheel+0.062
mons+0.058
Neg logit contributions
 year-0.071
abel-0.057
 barn-0.053
 pasture-0.052
ara-0.051
Token '
Token ID705
Feature activation+6.17e-02

Pos logit contributions
 year+0.018
abel+0.014
ie+0.012
ara+0.012
 barn+0.012
Neg logit contributions
anks-0.023
uts-0.023
wheel-0.022
 disl-0.022
vals-0.021
Token very
Token ID845
Feature activation-1.99e-02

Pos logit contributions
 year+0.048
abel+0.036
 more+0.032
ie+0.032
 barn+0.031
Neg logit contributions
anks-0.057
uts-0.056
wheel-0.054
vals-0.054
 disl-0.053
Token tired
Token ID10032
Feature activation-3.95e-02

Pos logit contributions
 year+0.036
abel+0.028
ie+0.025
ara+0.024
 more+0.024
Neg logit contributions
anks-0.039
uts-0.037
wheel-0.037
 disl-0.036
vals-0.035
Token and
Token ID290
Feature activation-1.87e-02

Pos logit contributions
 year+0.066
abel+0.055
ara+0.050
Will+0.046
 awoke+0.045
Neg logit contributions
uts-0.076
anks-0.075
wheel-0.075
 supposed-0.066
 bleed-0.065
Token wanted
Token ID2227
Feature activation+4.86e-05

Pos logit contributions
 year+0.037
abel+0.031
ara+0.028
 barn+0.027
 restaurant+0.026
Neg logit contributions
anks-0.030
 disl-0.030
uts-0.030
wheel-0.030
 bleed-0.028
Token to
Token ID284
Feature activation-3.18e-02

Pos logit contributions
uts+0.000
 supposed+0.000
anks+0.000
wheel+0.000
 disl+0.000
Neg logit contributions
 year-0.000
ara-0.000
abel-0.000
heart-0.000
ie-0.000
Token rest
Token ID1334
Feature activation-1.82e-01

Pos logit contributions
 year+0.060
abel+0.049
ara+0.043
 barn+0.042
ie+0.042
Neg logit contributions
anks-0.057
wheel-0.054
uts-0.054
 disl-0.053
vals-0.051
Token.
Token ID13
Feature activation-1.97e-02

Pos logit contributions
 year+0.361
abel+0.331
ara+0.296
unt+0.276
 barn+0.269
Neg logit contributions
anks-0.277
uts-0.272
wheel-0.260
 bleed-0.254
 supposed-0.250
Token Suddenly
Token ID24975
Feature activation-4.95e-02

Pos logit contributions
 year+0.034
abel+0.026
ie+0.023
 more+0.022
 barn+0.022
Neg logit contributions
anks-0.040
uts-0.039
wheel-0.038
 disl-0.037
vals-0.036
Token,
Token ID11
Feature activation-2.57e-02

Pos logit contributions
 year+0.071
abel+0.053
 restaurant+0.043
ara+0.041
 cafe+0.041
Neg logit contributions
uts-0.108
anks-0.107
vals-0.102
wheel-0.102
 disl-0.100
Token a
Token ID257
Feature activation+6.77e-03

Pos logit contributions
 year+0.039
abel+0.029
ie+0.024
ara+0.024
 barn+0.024
Neg logit contributions
anks-0.057
uts-0.056
wheel-0.055
 disl-0.054
vals-0.053
Token little
Token ID1310
Feature activation-4.42e-02

Pos logit contributions
wheel+0.014
vell+0.013
anks+0.013
uts+0.013
 disl+0.013
Neg logit contributions
 year-0.010
abel-0.009
ie-0.007
ara-0.007
heart-0.007
Token time
Token ID640
Feature activation+1.29e-02

Pos logit contributions
 year+0.009
abel+0.008
 awoke+0.007
heart+0.007
 shop+0.007
Neg logit contributions
uts-0.010
issors-0.010
 disl-0.010
 supposed-0.010
vals-0.010
Token,
Token ID11
Feature activation-4.06e-02

Pos logit contributions
anks+0.037
uts+0.036
wheel+0.036
 disl+0.035
vals+0.035
Neg logit contributions
 year-0.012
abel-0.006
ie-0.004
ara-0.004
 more-0.004
Token there
Token ID612
Feature activation+1.00e-02

Pos logit contributions
 year+0.069
abel+0.054
ie+0.046
 barn+0.045
ara+0.045
Neg logit contributions
anks-0.080
uts-0.079
wheel-0.077
 disl-0.075
vals-0.075
Token was
Token ID373
Feature activation-5.15e-02

Pos logit contributions
uts+0.021
anks+0.020
wheel+0.020
 disl+0.020
vals+0.019
Neg logit contributions
 year-0.014
abel-0.010
 barn-0.009
heart-0.009
unt-0.008
Token a
Token ID257
Feature activation-9.18e-03

Pos logit contributions
 year+0.081
abel+0.061
ie+0.052
ara+0.051
 barn+0.051
Neg logit contributions
anks-0.110
uts-0.108
wheel-0.105
 disl-0.104
vals-0.102
Token 3
Token ID513
Feature activation-1.81e-01

Pos logit contributions
 year+0.012
abel+0.010
 awoke+0.008
heart+0.008
ie+0.007
Neg logit contributions
vell-0.019
wheel-0.019
 disl-0.019
uts-0.019
anks-0.018
Token year
Token ID614
Feature activation-1.22e-01

Pos logit contributions
 awoke+0.089
 again+0.086
One+0.078
ara+0.074
abel+0.072
Neg logit contributions
 disl-0.434
uts-0.432
vell-0.407
mons-0.407
 supposed-0.404
Token old
Token ID1468
Feature activation-7.70e-03

Pos logit contributions
 year+0.191
abel+0.144
ie+0.122
ara+0.120
 barn+0.119
Neg logit contributions
anks-0.263
uts-0.257
wheel-0.251
 disl-0.249
vals-0.243
Token named
Token ID3706
Feature activation-4.09e-02

Pos logit contributions
 year+0.011
abel+0.009
 awoke+0.009
One+0.008
ara+0.008
Neg logit contributions
uts-0.014
 disl-0.014
anks-0.013
mons-0.013
wheel-0.013
Token Anna
Token ID11735
Feature activation-9.70e-02

Pos logit contributions
 year+0.067
abel+0.054
ie+0.045
ara+0.045
 me+0.044
Neg logit contributions
anks-0.087
uts-0.084
vals-0.083
wheel-0.081
 disl-0.081
Token.
Token ID13
Feature activation-2.96e-02

Pos logit contributions
 year+0.146
abel+0.109
ara+0.091
 barn+0.090
heart+0.090
Neg logit contributions
uts-0.205
anks-0.205
 disl-0.197
wheel-0.194
vals-0.191
Token time
Token ID640
Feature activation+1.46e-02

Pos logit contributions
 year+0.003
abel+0.003
 cottage+0.003
 barn+0.003
heart+0.003
Neg logit contributions
uts-0.003
 supposed-0.003
 disl-0.003
issors-0.003
vals-0.003
Token,
Token ID11
Feature activation-3.76e-02

Pos logit contributions
anks+0.040
uts+0.039
wheel+0.039
 disl+0.038
vals+0.038
Neg logit contributions
 year-0.014
abel-0.009
ie-0.006
ara-0.006
 barn-0.006
Token there
Token ID612
Feature activation+1.76e-02

Pos logit contributions
 year+0.070
abel+0.058
 barn+0.050
 restaurant+0.050
heart+0.049
Neg logit contributions
uts-0.063
anks-0.061
wheel-0.061
vell-0.059
bles-0.059
Token lived
Token ID5615
Feature activation-6.88e-02

Pos logit contributions
uts+0.033
wheel+0.032
 disl+0.031
anks+0.031
 supposed+0.030
Neg logit contributions
 year-0.025
abel-0.021
 barn-0.019
heart-0.019
 pasture-0.018
Token a
Token ID257
Feature activation-4.98e-03

Pos logit contributions
 year+0.112
abel+0.086
ie+0.075
ara+0.071
 barn+0.071
Neg logit contributions
anks-0.143
uts-0.139
wheel-0.136
 disl-0.133
vals-0.131
Token 3
Token ID513
Feature activation-1.81e-01

Pos logit contributions
 year+0.007
abel+0.005
heart+0.004
 awoke+0.004
ie+0.004
Neg logit contributions
 disl-0.010
wheel-0.010
uts-0.010
vell-0.010
anks-0.010
Token-
Token ID12
Feature activation-2.62e-05

Pos logit contributions
 awoke+0.100
abel+0.091
One+0.090
 again+0.088
ara+0.082
Neg logit contributions
uts-0.435
 disl-0.426
wheel-0.398
mons-0.396
pre-0.395
Tokenyear
Token ID1941
Feature activation-1.27e-01

Pos logit contributions
 year+0.000
abel+0.000
ara+0.000
 barn+0.000
 awoke+0.000
Neg logit contributions
anks-0.000
uts-0.000
wheel-0.000
 disl-0.000
vell-0.000
Token-
Token ID12
Feature activation-9.30e-04

Pos logit contributions
 year+0.134
abel+0.098
ara+0.079
 awoke+0.076
Will+0.075
Neg logit contributions
anks-0.322
uts-0.321
wheel-0.306
 disl-0.304
vals-0.298
Tokenold
Token ID727
Feature activation+1.28e-02

Pos logit contributions
 year+0.002
abel+0.001
 barn+0.001
ara+0.001
 awoke+0.001
Neg logit contributions
anks-0.002
uts-0.002
wheel-0.002
 disl-0.002
vals-0.002
Token girl
Token ID2576
Feature activation-5.46e-02

Pos logit contributions
anks+0.027
wheel+0.027
uts+0.026
 disl+0.026
vell+0.025
Neg logit contributions
 year-0.017
abel-0.013
One-0.011
 awoke-0.011
ara-0.010
Token,
Token ID11
Feature activation-3.34e-02

Pos logit contributions
 year+0.016
abel+0.010
ie+0.007
ara+0.006
 barn+0.006
Neg logit contributions
anks-0.046
uts-0.045
wheel-0.045
 disl-0.044
vals-0.044
Token Tim
Token ID5045
Feature activation+5.91e-02

Pos logit contributions
 year+0.061
 more+0.043
abel+0.043
ie+0.043
 restaurant+0.040
Neg logit contributions
anks-0.068
uts-0.065
vals-0.062
wheel-0.062
 disl-0.061
Tokenmy
Token ID1820
Feature activation-1.70e-02

Pos logit contributions
anks+0.120
wheel+0.116
uts+0.114
 supposed+0.112
 disl+0.109
Neg logit contributions
 year-0.092
abel-0.071
 barn-0.059
 more-0.057
 restaurant-0.057
Token went
Token ID1816
Feature activation+2.04e-02

Pos logit contributions
 year+0.027
abel+0.021
 barn+0.017
heart+0.017
 more+0.017
Neg logit contributions
uts-0.034
anks-0.033
wheel-0.033
 disl-0.032
vals-0.031
Token to
Token ID284
Feature activation-2.82e-02

Pos logit contributions
anks+0.040
wheel+0.039
uts+0.038
vals+0.035
 disl+0.035
Neg logit contributions
 year-0.036
abel-0.030
ara-0.025
heart-0.025
ie-0.024
Token rest
Token ID1334
Feature activation-1.81e-01

Pos logit contributions
 year+0.038
abel+0.026
ie+0.022
 restaurant+0.022
 more+0.022
Neg logit contributions
anks-0.068
uts-0.068
wheel-0.065
vals-0.065
 disl-0.064
Token under
Token ID739
Feature activation-1.93e-02

Pos logit contributions
 year+0.332
abel+0.306
ara+0.248
 awoke+0.243
 barn+0.241
Neg logit contributions
anks-0.306
uts-0.297
wheel-0.284
bles-0.272
 supposed-0.271
Token a
Token ID257
Feature activation+8.42e-03

Pos logit contributions
 year+0.032
abel+0.026
ara+0.022
ie+0.022
heart+0.022
Neg logit contributions
anks-0.037
uts-0.037
 disl-0.036
wheel-0.036
alks-0.034
Token big
Token ID1263
Feature activation-3.11e-02

Pos logit contributions
uts+0.014
wheel+0.014
 disl+0.013
 explode+0.012
issors+0.012
Neg logit contributions
 year-0.014
abel-0.013
heart-0.012
ara-0.011
Will-0.010
Token tree
Token ID5509
Feature activation-5.08e-02

Pos logit contributions
 year+0.046
abel+0.039
Will+0.033
ara+0.032
heart+0.032
Neg logit contributions
wheel-0.060
uts-0.059
anks-0.058
 disl-0.058
alks-0.055
Token.
Token ID13
Feature activation-1.43e-02

Pos logit contributions
 year+0.077
abel+0.062
ara+0.054
heart+0.050
ie+0.049
Neg logit contributions
anks-0.106
uts-0.105
wheel-0.105
vals-0.098
 disl-0.097
Tokenl
Token ID75
Feature activation+2.86e-02

Pos logit contributions
 year+0.026
abel+0.019
ie+0.016
ara+0.016
 barn+0.015
Neg logit contributions
anks-0.041
uts-0.039
wheel-0.039
 disl-0.038
vals-0.038
Tokenle
Token ID293
Feature activation-1.47e-01

Pos logit contributions
anks+0.061
uts+0.060
wheel+0.057
vals+0.055
mons+0.055
Neg logit contributions
 year-0.043
abel-0.033
heart-0.029
 restaurant-0.028
 more-0.028
Token.
Token ID13
Feature activation-3.54e-02

Pos logit contributions
 year+0.210
abel+0.171
 awoke+0.155
 restaurant+0.154
 more+0.154
Neg logit contributions
uts-0.281
anks-0.262
mons-0.259
 disl-0.256
wheel-0.255
Token She
Token ID1375
Feature activation-3.33e-02

Pos logit contributions
 year+0.051
abel+0.037
ie+0.031
ara+0.030
 barn+0.030
Neg logit contributions
anks-0.081
uts-0.079
wheel-0.078
 disl-0.077
vals-0.075
Token was
Token ID373
Feature activation-4.89e-02

Pos logit contributions
 year+0.049
abel+0.039
heart+0.034
 barn+0.034
 more+0.032
Neg logit contributions
uts-0.068
wheel-0.066
anks-0.066
 disl-0.066
vals-0.064
Token 3
Token ID513
Feature activation-1.81e-01

Pos logit contributions
 year+0.068
abel+0.049
 more+0.041
ie+0.040
 barn+0.039
Neg logit contributions
anks-0.117
uts-0.115
 disl-0.113
wheel-0.111
vals-0.110
Token years
Token ID812
Feature activation-4.86e-02

Pos logit contributions
 awoke+0.192
ara+0.185
 year+0.179
abel+0.167
One+0.166
Neg logit contributions
uts-0.384
 disl-0.361
anks-0.345
 supposed-0.343
aces-0.341
Token old
Token ID1468
Feature activation-6.75e-03

Pos logit contributions
 year+0.077
abel+0.059
ie+0.050
ara+0.050
 barn+0.049
Neg logit contributions
anks-0.102
uts-0.100
wheel-0.097
 disl-0.097
vals-0.094
Token and
Token ID290
Feature activation-2.33e-02

Pos logit contributions
 year+0.011
abel+0.009
 awoke+0.008
One+0.008
heart+0.008
Neg logit contributions
uts-0.012
anks-0.012
mons-0.011
wheel-0.011
 disl-0.011
Token loved
Token ID6151
Feature activation-5.93e-02

Pos logit contributions
 year+0.034
abel+0.025
ie+0.021
ara+0.021
 barn+0.021
Neg logit contributions
anks-0.052
uts-0.051
wheel-0.050
 disl-0.050
vals-0.049
Token to
Token ID284
Feature activation-3.45e-02

Pos logit contributions
 year+0.092
abel+0.069
ie+0.059
 barn+0.057
ara+0.057
Neg logit contributions
anks-0.129
uts-0.126
wheel-0.123
 disl-0.122
vals-0.120
Token a
Token ID257
Feature activation-7.10e-03

Pos logit contributions
 year+0.070
abel+0.053
ie+0.045
 barn+0.044
ara+0.044
Neg logit contributions
anks-0.091
uts-0.089
wheel-0.087
 disl-0.086
vals-0.085
Token Mom
Token ID11254
Feature activation-1.96e-02

Pos logit contributions
 year+0.009
abel+0.007
ie+0.005
 awoke+0.005
heart+0.005
Neg logit contributions
uts-0.016
anks-0.016
wheel-0.016
 disl-0.016
vell-0.016
Tokenmy
Token ID1820
Feature activation-2.13e-02

Pos logit contributions
 year+0.028
abel+0.021
 more+0.017
 barn+0.017
 restaurant+0.016
Neg logit contributions
anks-0.043
uts-0.042
 disl-0.042
wheel-0.042
vals-0.041
Token and
Token ID290
Feature activation-2.24e-02

Pos logit contributions
 year+0.028
abel+0.020
 more+0.016
ara+0.016
ie+0.016
Neg logit contributions
anks-0.050
uts-0.049
wheel-0.048
 disl-0.048
vals-0.047
Token a
Token ID257
Feature activation-7.20e-03

Pos logit contributions
 year+0.035
abel+0.026
ie+0.022
ara+0.022
 barn+0.022
Neg logit contributions
anks-0.049
uts-0.047
wheel-0.046
 disl-0.046
vals-0.045
Token 3
Token ID513
Feature activation-1.81e-01

Pos logit contributions
 year+0.009
abel+0.007
ie+0.005
 awoke+0.005
heart+0.005
Neg logit contributions
 disl-0.016
uts-0.016
wheel-0.016
anks-0.016
vell-0.016
Token year
Token ID614
Feature activation-1.20e-01

Pos logit contributions
ara+0.057
abel+0.053
 again+0.053
 awoke+0.052
One+0.050
Neg logit contributions
 disl-0.468
uts-0.461
mons-0.455
 supposed-0.448
pre-0.441
Token old
Token ID1468
Feature activation-8.38e-03

Pos logit contributions
 year+0.191
abel+0.149
ara+0.125
ie+0.124
 awoke+0.122
Neg logit contributions
anks-0.252
uts-0.247
 disl-0.240
wheel-0.239
vals-0.232
Token girl
Token ID2576
Feature activation-5.56e-02

Pos logit contributions
 year+0.012
abel+0.010
One+0.009
 awoke+0.009
Will+0.008
Neg logit contributions
 disl-0.015
mons-0.015
anks-0.015
uts-0.015
pre-0.014
Token went
Token ID1816
Feature activation+1.35e-02

Pos logit contributions
 year+0.077
abel+0.058
 restaurant+0.049
ara+0.049
heart+0.048
Neg logit contributions
anks-0.119
uts-0.114
vals-0.114
 disl-0.114
wheel-0.112
Token for
Token ID329
Feature activation-4.20e-02

Pos logit contributions
anks+0.029
uts+0.028
wheel+0.027
 disl+0.027
vals+0.026
Neg logit contributions
 year-0.022
abel-0.017
ara-0.014
ie-0.014
 barn-0.013
Token need
Token ID761
Feature activation-1.08e-02

Pos logit contributions
anks+0.037
uts+0.036
 disl+0.035
wheel+0.034
vals+0.034
Neg logit contributions
 year-0.036
abel-0.029
 more-0.025
ie-0.025
 awoke-0.024
Token to
Token ID284
Feature activation-2.87e-02

Pos logit contributions
 year+0.020
abel+0.017
ie+0.015
ara+0.015
heart+0.014
Neg logit contributions
anks-0.019
uts-0.018
wheel-0.018
vals-0.017
 subtract-0.017
Token let
Token ID1309
Feature activation-4.17e-02

Pos logit contributions
 year+0.060
abel+0.049
ie+0.044
ara+0.044
 barn+0.044
Neg logit contributions
anks-0.045
wheel-0.043
uts-0.043
 disl-0.042
vals-0.041
Token the
Token ID262
Feature activation-9.08e-03

Pos logit contributions
 year+0.055
abel+0.039
ie+0.031
ara+0.030
 barn+0.030
Neg logit contributions
anks-0.100
uts-0.098
wheel-0.096
 disl-0.095
vals-0.093
Token TV
Token ID3195
Feature activation-6.90e-03

Pos logit contributions
 year+0.014
abel+0.011
ie+0.010
ara+0.010
 barn+0.009
Neg logit contributions
anks-0.019
uts-0.018
wheel-0.018
 disl-0.018
vals-0.018
Token rest
Token ID1334
Feature activation-1.80e-01

Pos logit contributions
 year+0.012
abel+0.009
 more+0.008
 me+0.008
ie+0.008
Neg logit contributions
anks-0.014
uts-0.014
 disl-0.014
vals-0.013
mons-0.013
Token for
Token ID329
Feature activation-3.10e-02

Pos logit contributions
 year+0.335
abel+0.287
 awoke+0.249
 barn+0.248
ara+0.247
Neg logit contributions
anks-0.319
uts-0.316
wheel-0.295
issors-0.284
mons-0.279
Token a
Token ID257
Feature activation+8.67e-03

Pos logit contributions
 year+0.051
abel+0.040
ie+0.036
ara+0.035
heart+0.035
Neg logit contributions
anks-0.062
uts-0.061
wheel-0.059
 disl-0.059
vals-0.057
Token while
Token ID981
Feature activation-9.70e-02

Pos logit contributions
uts+0.017
wheel+0.017
anks+0.017
 disl+0.017
vell+0.016
Neg logit contributions
 year-0.013
abel-0.011
heart-0.009
ie-0.009
ara-0.009
Token."
Token ID526
Feature activation-3.24e-02

Pos logit contributions
 year+0.179
abel+0.152
ara+0.133
 awoke+0.131
ie+0.129
Neg logit contributions
anks-0.169
uts-0.165
wheel-0.156
issors-0.149
vals-0.149
Token But
Token ID887
Feature activation-1.52e-02

Pos logit contributions
 year+0.049
abel+0.036
ie+0.030
ara+0.030
 barn+0.029
Neg logit contributions
anks-0.072
uts-0.071
wheel-0.069
 disl-0.068
vals-0.067
Token<|endoftext|>
Token ID50256
Feature activation-6.22e-02

Pos logit contributions
anks+0.038
uts+0.037
wheel+0.036
 disl+0.036
vals+0.035
Neg logit contributions
 year-0.028
abel-0.021
ie-0.018
 barn-0.018
ara-0.017
TokenJoe
Token ID19585
Feature activation-2.96e-02

Pos logit contributions
 year+0.103
abel+0.085
ie+0.075
heart+0.074
 barn+0.072
Neg logit contributions
anks-0.121
uts-0.119
wheel-0.114
 disl-0.108
vals-0.108
Tokeny
Token ID88
Feature activation-8.83e-02

Pos logit contributions
 year+0.038
abel+0.027
 more+0.022
 barn+0.022
 restaurant+0.021
Neg logit contributions
anks-0.068
uts-0.067
 disl-0.066
wheel-0.066
vals-0.064
Token was
Token ID373
Feature activation-4.16e-02

Pos logit contributions
 year+0.115
abel+0.080
ie+0.065
 barn+0.063
ara+0.063
Neg logit contributions
anks-0.210
uts-0.206
wheel-0.202
 disl-0.201
vals-0.196
Token a
Token ID257
Feature activation-7.20e-03

Pos logit contributions
 year+0.073
abel+0.055
 more+0.050
 barn+0.048
 restaurant+0.047
Neg logit contributions
anks-0.085
uts-0.084
 disl-0.082
vals-0.081
wheel-0.079
Token 3
Token ID513
Feature activation-1.80e-01

Pos logit contributions
 year+0.011
abel+0.009
ie+0.008
heart+0.008
 awoke+0.008
Neg logit contributions
wheel-0.013
 disl-0.013
vell-0.013
anks-0.012
uts-0.012
Token-
Token ID12
Feature activation+2.36e-03

Pos logit contributions
ara+0.075
 awoke+0.071
One+0.066
abel+0.065
 barn+0.058
Neg logit contributions
uts-0.456
 disl-0.451
wheel-0.433
 supposed-0.421
 exc-0.409
Tokenyear
Token ID1941
Feature activation-1.23e-01

Pos logit contributions
wheel+0.005
anks+0.005
uts+0.005
 disl+0.005
vals+0.005
Neg logit contributions
 year-0.003
abel-0.003
 barn-0.002
ara-0.002
 awoke-0.002
Token-
Token ID12
Feature activation+1.79e-03

Pos logit contributions
 year+0.131
abel+0.102
ara+0.088
 awoke+0.086
Will+0.085
Neg logit contributions
uts-0.308
anks-0.308
wheel-0.291
 disl-0.289
alks-0.282
Tokenold
Token ID727
Feature activation+1.25e-02

Pos logit contributions
anks+0.003
uts+0.003
wheel+0.003
 disl+0.003
vals+0.003
Neg logit contributions
 year-0.003
abel-0.002
 barn-0.002
ara-0.002
 awoke-0.002
Token boy
Token ID2933
Feature activation-4.24e-02

Pos logit contributions
anks+0.027
uts+0.026
wheel+0.026
 disl+0.025
mons+0.024
Neg logit contributions
 year-0.016
abel-0.013
One-0.011
ara-0.010
 awoke-0.010
Token for
Token ID329
Feature activation-3.37e-02

Pos logit contributions
 year+0.143
abel+0.133
ara+0.121
Will+0.115
One+0.111
Neg logit contributions
anks-0.088
uts-0.077
vell-0.074
bles-0.072
issors-0.072
Token a
Token ID257
Feature activation+1.55e-03

Pos logit contributions
 year+0.068
abel+0.061
ara+0.056
unt+0.053
heart+0.052
Neg logit contributions
anks-0.045
 bleed-0.043
 disl-0.041
uts-0.040
 choke-0.039
Token cozy
Token ID37438
Feature activation-4.42e-02

Pos logit contributions
uts+0.002
wheel+0.002
vell+0.002
 bleed+0.002
anks+0.002
Neg logit contributions
 year-0.003
abel-0.003
heart-0.002
ie-0.002
ara-0.002
Token spot
Token ID4136
Feature activation-7.63e-02

Pos logit contributions
abel+0.078
 year+0.072
aw+0.066
heart+0.064
ement+0.062
Neg logit contributions
anks-0.055
uts-0.054
wheel-0.052
 bleed-0.052
 cartridge-0.051
Token to
Token ID284
Feature activation-2.87e-02

Pos logit contributions
 year+0.127
abel+0.103
ara+0.090
 barn+0.085
heart+0.084
Neg logit contributions
anks-0.153
uts-0.150
wheel-0.146
vell-0.137
 disl-0.136
Token rest
Token ID1334
Feature activation-1.80e-01

Pos logit contributions
 year+0.063
abel+0.056
heart+0.052
unt+0.052
ara+0.050
Neg logit contributions
wheel-0.039
anks-0.037
 bleed-0.036
 choke-0.032
 disl-0.031
Token.
Token ID13
Feature activation-2.29e-02

Pos logit contributions
 year+0.330
abel+0.310
ara+0.274
unt+0.266
 awoke+0.264
Neg logit contributions
uts-0.301
anks-0.299
bles-0.267
 supposed-0.265
wheel-0.263
Token He
Token ID679
Feature activation-2.05e-02

Pos logit contributions
 year+0.038
abel+0.030
ie+0.025
ara+0.024
 barn+0.024
Neg logit contributions
anks-0.047
uts-0.046
wheel-0.045
 disl-0.044
vals-0.042
Token found
Token ID1043
Feature activation+3.22e-02

Pos logit contributions
 year+0.042
heart+0.036
unt+0.034
abel+0.034
ement+0.033
Neg logit contributions
wheel-0.024
 bleed-0.022
ends-0.022
 supposed-0.021
uts-0.021
Token a
Token ID257
Feature activation+5.15e-03

Pos logit contributions
anks+0.046
wheel+0.044
uts+0.042
 bleed+0.042
mons+0.040
Neg logit contributions
 year-0.071
abel-0.056
ara-0.054
heart-0.053
unt-0.050
Token soft
Token ID2705
Feature activation-5.53e-02

Pos logit contributions
wheel+0.008
uts+0.008
vell+0.008
 bleed+0.008
anks+0.008
Neg logit contributions
 year-0.009
abel-0.008
ie-0.007
heart-0.007
ara-0.007
Token a
Token ID257
Feature activation-5.13e-03

Pos logit contributions
 year+0.034
abel+0.025
ie+0.021
ara+0.020
 barn+0.020
Neg logit contributions
anks-0.052
uts-0.051
wheel-0.050
 disl-0.049
vals-0.048
Token daddy
Token ID35822
Feature activation-4.10e-03

Pos logit contributions
 year+0.007
abel+0.005
ie+0.004
heart+0.004
 awoke+0.004
Neg logit contributions
 disl-0.011
wheel-0.011
uts-0.011
vell-0.011
anks-0.011
Token who
Token ID508
Feature activation-5.93e-02

Pos logit contributions
 year+0.006
abel+0.004
ara+0.003
ie+0.003
 barn+0.003
Neg logit contributions
anks-0.009
uts-0.009
wheel-0.009
 disl-0.009
vals-0.009
Token had
Token ID550
Feature activation+3.97e-03

Pos logit contributions
 year+0.081
abel+0.059
ie+0.048
ara+0.047
 barn+0.047
Neg logit contributions
anks-0.139
uts-0.136
wheel-0.133
 disl-0.132
vals-0.130
Token a
Token ID257
Feature activation-4.79e-03

Pos logit contributions
uts+0.007
anks+0.007
wheel+0.007
 disl+0.007
 supposed+0.007
Neg logit contributions
 year-0.007
abel-0.006
ie-0.005
ara-0.005
heart-0.005
Token 3
Token ID513
Feature activation-1.80e-01

Pos logit contributions
 year+0.007
abel+0.006
ie+0.006
heart+0.006
 awoke+0.005
Neg logit contributions
 disl-0.009
anks-0.008
wheel-0.008
uts-0.008
 supposed-0.008
Token year
Token ID614
Feature activation-1.15e-01

Pos logit contributions
ara+0.065
abel+0.062
 awoke+0.054
 again+0.047
One+0.044
Neg logit contributions
 disl-0.483
uts-0.480
mons-0.456
 supposed-0.453
wheel-0.445
Token old
Token ID1468
Feature activation-3.29e-03

Pos logit contributions
 year+0.191
abel+0.156
ara+0.138
 awoke+0.136
Will+0.133
Neg logit contributions
anks-0.222
uts-0.221
 disl-0.216
wheel-0.211
issors-0.201
Token child
Token ID1200
Feature activation-7.29e-02

Pos logit contributions
 year+0.005
abel+0.004
 awoke+0.004
One+0.004
ara+0.004
Neg logit contributions
anks-0.006
 disl-0.005
mons-0.005
uts-0.005
drivers-0.005
Token.
Token ID13
Feature activation-2.78e-02

Pos logit contributions
 year+0.103
abel+0.078
ara+0.074
 awoke+0.071
heart+0.068
Neg logit contributions
uts-0.152
anks-0.148
wheel-0.144
 disl-0.144
vals-0.140
Token Each
Token ID5501
Feature activation-8.36e-02

Pos logit contributions
 year+0.050
abel+0.039
ie+0.035
 more+0.033
ara+0.033
Neg logit contributions
anks-0.053
uts-0.052
 disl-0.051
wheel-0.050
vals-0.049
Token time
Token ID640
Feature activation+1.14e-02

Pos logit contributions
 year+0.010
abel+0.009
heart+0.008
 awoke+0.008
 cottage+0.008
Neg logit contributions
uts-0.010
issors-0.010
 supposed-0.010
 disl-0.010
vals-0.010
Token there
Token ID612
Feature activation+1.06e-02

Pos logit contributions
anks+0.033
wheel+0.032
uts+0.032
 disl+0.032
vals+0.032
Neg logit contributions
 year-0.010
abel-0.005
ie-0.003
ara-0.003
 more-0.003
Token was
Token ID373
Feature activation-4.57e-02

Pos logit contributions
uts+0.025
anks+0.025
wheel+0.025
 disl+0.024
vals+0.024
Neg logit contributions
 year-0.012
abel-0.009
 barn-0.007
heart-0.006
 more-0.006
Token a
Token ID257
Feature activation-7.41e-03

Pos logit contributions
 year+0.065
abel+0.046
 more+0.039
ie+0.038
 barn+0.038
Neg logit contributions
anks-0.109
uts-0.105
wheel-0.103
 disl-0.103
vals-0.101
Token little
Token ID1310
Feature activation-5.66e-02

Pos logit contributions
 year+0.010
abel+0.008
 awoke+0.007
heart+0.007
ie+0.006
Neg logit contributions
uts-0.015
wheel-0.015
 disl-0.015
vell-0.015
anks-0.015
Token 3
Token ID513
Feature activation-1.80e-01

Pos logit contributions
 year+0.077
abel+0.064
 awoke+0.056
One+0.049
Will+0.045
Neg logit contributions
wheel-0.112
uts-0.112
mons-0.111
anks-0.111
vell-0.110
Token year
Token ID614
Feature activation-1.18e-01

Pos logit contributions
 awoke+0.112
abel+0.104
ara+0.095
 again+0.094
One+0.094
Neg logit contributions
uts-0.435
 disl-0.423
mons-0.399
wheel-0.392
 supposed-0.389
Token old
Token ID1468
Feature activation-8.13e-03

Pos logit contributions
 year+0.184
abel+0.140
ie+0.118
ara+0.118
 barn+0.116
Neg logit contributions
anks-0.251
uts-0.246
wheel-0.239
 disl-0.238
vals-0.232
Token who
Token ID508
Feature activation-5.79e-02

Pos logit contributions
 year+0.012
abel+0.010
 awoke+0.009
One+0.009
heart+0.008
Neg logit contributions
uts-0.015
anks-0.015
 disl-0.015
wheel-0.014
mons-0.014
Token wanted
Token ID2227
Feature activation-7.54e-03

Pos logit contributions
 year+0.074
abel+0.051
ie+0.040
 awoke+0.037
 more+0.036
Neg logit contributions
anks-0.147
uts-0.143
wheel-0.140
 disl-0.136
vals-0.136
Token to
Token ID284
Feature activation-3.58e-02

Pos logit contributions
 year+0.014
abel+0.011
ie+0.011
ara+0.011
heart+0.010
Neg logit contributions
uts-0.013
anks-0.013
wheel-0.013
 disl-0.012
 supposed-0.012
Token In
Token ID554
Feature activation-2.88e-02

Pos logit contributions
 year+0.054
abel+0.044
ie+0.040
ara+0.038
 more+0.038
Neg logit contributions
anks-0.049
uts-0.048
wheel-0.047
 disl-0.046
vals-0.044
Token this
Token ID428
Feature activation-3.37e-03

Pos logit contributions
 year+0.046
abel+0.036
ie+0.030
ara+0.029
heart+0.029
Neg logit contributions
anks-0.059
uts-0.059
 disl-0.057
wheel-0.057
vals-0.056
Token bedroom
Token ID14043
Feature activation-3.51e-02

Pos logit contributions
 year+0.005
abel+0.005
aw+0.004
heart+0.004
Sun+0.004
Neg logit contributions
anks-0.005
 disl-0.005
wheel-0.005
vell-0.005
 supposed-0.005
Token lived
Token ID5615
Feature activation-6.67e-02

Pos logit contributions
 year+0.048
abel+0.038
 agreement+0.034
 Amen+0.034
One+0.033
Neg logit contributions
uts-0.072
anks-0.071
mons-0.067
vals-0.067
 disl-0.066
Token a
Token ID257
Feature activation-1.63e-03

Pos logit contributions
 year+0.123
abel+0.106
heart+0.097
ie+0.096
ara+0.093
Neg logit contributions
anks-0.108
wheel-0.106
vell-0.106
uts-0.106
 supposed-0.099
Token 3
Token ID513
Feature activation-1.80e-01

Pos logit contributions
 year+0.002
abel+0.002
ie+0.002
heart+0.002
 awoke+0.002
Neg logit contributions
uts-0.003
wheel-0.003
vell-0.003
 disl-0.003
anks-0.003
Token year
Token ID614
Feature activation-1.17e-01

Pos logit contributions
ara+0.067
 again+0.059
abel+0.057
 awoke+0.056
One+0.050
Neg logit contributions
uts-0.479
 disl-0.464
 supposed-0.440
wheel-0.436
vell-0.435
Token old
Token ID1468
Feature activation-3.87e-03

Pos logit contributions
 year+0.188
abel+0.147
ara+0.125
ie+0.123
 awoke+0.122
Neg logit contributions
anks-0.241
uts-0.237
 disl-0.229
wheel-0.229
vals-0.220
Token child
Token ID1200
Feature activation-7.46e-02

Pos logit contributions
 year+0.006
abel+0.005
One+0.004
 awoke+0.004
heart+0.004
Neg logit contributions
uts-0.007
anks-0.007
mons-0.007
 disl-0.007
wheel-0.006
Token and
Token ID290
Feature activation-1.91e-02

Pos logit contributions
 year+0.108
abel+0.083
ara+0.078
 awoke+0.075
heart+0.073
Neg logit contributions
uts-0.152
anks-0.148
wheel-0.144
 disl-0.141
vals-0.139
Token their
Token ID511
Feature activation-8.87e-03

Pos logit contributions
 year+0.031
abel+0.024
ara+0.022
 awoke+0.019
 barn+0.019
Neg logit contributions
anks-0.038
 disl-0.038
uts-0.037
wheel-0.036
 supposed-0.034
Token loudly
Token ID23112
Feature activation-8.58e-02

Pos logit contributions
 year+0.034
abel+0.026
ie+0.022
ara+0.021
 barn+0.021
Neg logit contributions
anks-0.049
uts-0.048
wheel-0.047
 disl-0.046
vals-0.046
Token.
Token ID13
Feature activation-2.51e-02

Pos logit contributions
 year+0.113
abel+0.079
ie+0.064
 barn+0.064
 more+0.063
Neg logit contributions
anks-0.206
uts-0.200
wheel-0.198
 disl-0.197
vals-0.194
Token
Token ID198
Feature activation-2.48e-02

Pos logit contributions
 year+0.045
abel+0.035
ie+0.032
ara+0.030
 barn+0.030
Neg logit contributions
anks-0.048
uts-0.047
wheel-0.046
 disl-0.045
vals-0.044
Token
Token ID198
Feature activation-2.32e-02

Pos logit contributions
 year+0.045
abel+0.040
 barn+0.037
heart+0.037
ie+0.037
Neg logit contributions
uts-0.041
anks-0.036
wheel-0.036
 disl-0.034
issors-0.034
TokenA
Token ID32
Feature activation-1.60e-02

Pos logit contributions
 year+0.034
abel+0.027
ie+0.025
 more+0.023
 barn+0.022
Neg logit contributions
uts-0.050
wheel-0.048
 disl-0.048
anks-0.046
issors-0.045
Token 3
Token ID513
Feature activation-1.80e-01

Pos logit contributions
 year+0.017
abel+0.015
ie+0.011
 awoke+0.010
 barn+0.010
Neg logit contributions
wheel-0.038
uts-0.037
 disl-0.036
anks-0.036
vell-0.036
Token-
Token ID12
Feature activation-2.67e-03

Pos logit contributions
abel+0.070
 pasture+0.067
ara+0.066
 barn+0.061
One+0.059
Neg logit contributions
uts-0.458
 disl-0.451
wheel-0.442
mons-0.427
cks-0.418
Tokenyear
Token ID1941
Feature activation-1.29e-01

Pos logit contributions
 year+0.004
abel+0.003
ara+0.002
 barn+0.002
 restaurant+0.002
Neg logit contributions
anks-0.006
wheel-0.006
uts-0.006
 disl-0.006
vals-0.006
Token-
Token ID12
Feature activation-2.81e-03

Pos logit contributions
 year+0.143
abel+0.110
Will+0.093
ara+0.092
 awoke+0.089
Neg logit contributions
uts-0.321
anks-0.317
wheel-0.302
 disl-0.299
alks-0.295
Tokenold
Token ID727
Feature activation+1.06e-02

Pos logit contributions
 year+0.005
abel+0.004
 barn+0.003
ara+0.003
 awoke+0.003
Neg logit contributions
anks-0.005
uts-0.005
wheel-0.005
 disl-0.005
vals-0.005
Token toddler
Token ID30773
Feature activation-1.20e-01

Pos logit contributions
anks+0.021
 disl+0.021
wheel+0.021
uts+0.020
mons+0.020
Neg logit contributions
 year-0.015
abel-0.011
One-0.010
 pasture-0.010
 barn-0.009
Token<|endoftext|>
Token ID50256
Feature activation-4.97e-02

Pos logit contributions
anks+0.041
uts+0.040
wheel+0.039
vals+0.038
 disl+0.038
Neg logit contributions
 year-0.031
abel-0.023
ara-0.020
ie-0.019
 barn-0.019
TokenOne
Token ID3198
Feature activation-1.39e-01

Pos logit contributions
 year+0.075
abel+0.056
ie+0.048
 barn+0.046
ara+0.046
Neg logit contributions
anks-0.109
uts-0.106
wheel-0.104
 disl-0.102
vals-0.101
Token day
Token ID1110
Feature activation-1.58e-02

Pos logit contributions
 year+0.222
abel+0.186
 barn+0.165
ara+0.160
 awoke+0.154
Neg logit contributions
wheel-0.259
uts-0.255
 disl-0.254
anks-0.252
vals-0.241
Token,
Token ID11
Feature activation-4.07e-02

Pos logit contributions
 year+0.014
abel+0.007
ie+0.005
ara+0.004
 me+0.004
Neg logit contributions
anks-0.047
 disl-0.045
wheel-0.045
uts-0.044
vals-0.044
Token a
Token ID257
Feature activation-4.07e-03

Pos logit contributions
 year+0.064
abel+0.048
ie+0.041
 barn+0.040
ara+0.040
Neg logit contributions
anks-0.087
uts-0.086
wheel-0.084
 disl-0.083
vals-0.081
Token 3
Token ID513
Feature activation-1.80e-01

Pos logit contributions
 year+0.005
abel+0.004
 awoke+0.003
ie+0.003
heart+0.003
Neg logit contributions
uts-0.009
wheel-0.009
 disl-0.009
vell-0.009
anks-0.009
Token year
Token ID614
Feature activation-1.22e-01

Pos logit contributions
abel+0.071
One+0.068
 again+0.063
 pasture+0.063
 awoke+0.063
Neg logit contributions
 disl-0.447
uts-0.442
wheel-0.430
mons-0.430
pre-0.421
Token old
Token ID1468
Feature activation-5.45e-03

Pos logit contributions
 year+0.186
abel+0.141
ie+0.118
ara+0.117
 barn+0.116
Neg logit contributions
anks-0.264
uts-0.260
wheel-0.253
 disl-0.251
vals-0.245
Token girl
Token ID2576
Feature activation-5.29e-02

Pos logit contributions
 year+0.008
abel+0.006
One+0.005
 awoke+0.005
 more+0.005
Neg logit contributions
 disl-0.011
uts-0.010
wheel-0.010
anks-0.010
mons-0.010
Token named
Token ID3706
Feature activation-3.76e-02

Pos logit contributions
 year+0.071
abel+0.052
 more+0.042
 barn+0.042
ie+0.042
Neg logit contributions
anks-0.117
uts-0.115
 disl-0.115
wheel-0.113
vals-0.113
Token Sally
Token ID25737
Feature activation+6.20e-03

Pos logit contributions
 year+0.063
abel+0.048
ie+0.041
 barn+0.040
ara+0.040
Neg logit contributions
anks-0.078
uts-0.076
wheel-0.075
 disl-0.074
vals-0.072
Token the
Token ID262
Feature activation-2.65e-03

Pos logit contributions
 year+0.001
abel+0.000
 more+0.000
ie+0.000
 me+0.000
Neg logit contributions
anks-0.001
uts-0.001
vals-0.001
 disl-0.001
wheel-0.001
Token teacher
Token ID4701
Feature activation-3.00e-03

Pos logit contributions
 year+0.005
abel+0.004
 more+0.003
 barn+0.003
ie+0.003
Neg logit contributions
anks-0.005
uts-0.005
 disl-0.005
wheel-0.005
vals-0.005
Token's
Token ID338
Feature activation-3.05e-02

Pos logit contributions
 year+0.004
abel+0.003
ie+0.003
 more+0.003
ara+0.003
Neg logit contributions
anks-0.007
uts-0.007
 disl-0.007
vals-0.006
wheel-0.006
Token sho
Token ID7354
Feature activation+3.55e-03

Pos logit contributions
 year+0.054
abel+0.041
 more+0.038
 shop+0.037
 me+0.036
Neg logit contributions
anks-0.058
uts-0.057
 disl-0.056
vals-0.056
wheel-0.055
Tokenel
Token ID417
Feature activation+3.18e-02

Pos logit contributions
anks+0.007
uts+0.007
wheel+0.007
issors+0.006
 disl+0.006
Neg logit contributions
 year-0.006
abel-0.005
 restaurant-0.005
 barn-0.004
 more-0.004
Tokenaces
Token ID2114
Feature activation+1.90e-01

Pos logit contributions
 disl+0.023
anks+0.022
uts+0.021
wheel+0.019
vals+0.018
Neg logit contributions
 year-0.094
abel-0.081
ie-0.079
 more-0.078
 me-0.077
Token.
Token ID13
Feature activation-1.39e-02

Pos logit contributions
anks+0.450
 disl+0.446
vals+0.432
uts+0.427
wheel+0.417
Neg logit contributions
 year-0.259
abel-0.177
 more-0.156
ie-0.156
 again-0.143
Token Instead
Token ID5455
Feature activation-2.91e-02

Pos logit contributions
 year+0.021
abel+0.016
ie+0.013
ara+0.013
 barn+0.013
Neg logit contributions
anks-0.031
uts-0.030
wheel-0.030
 disl-0.029
vals-0.028
Token,
Token ID11
Feature activation-1.56e-02

Pos logit contributions
 year+0.040
abel+0.030
 awoke+0.026
 restaurant+0.026
 Amen+0.026
Neg logit contributions
uts-0.065
anks-0.063
wheel-0.062
issors-0.057
vals-0.057
Token she
Token ID673
Feature activation-1.26e-02

Pos logit contributions
 year+0.030
abel+0.025
ara+0.022
ie+0.022
 awoke+0.022
Neg logit contributions
uts-0.026
anks-0.026
wheel-0.026
 disl-0.025
mons-0.025
Token pe
Token ID613
Feature activation+7.62e-02

Pos logit contributions
 year+0.023
abel+0.018
ie+0.016
ara+0.015
 barn+0.015
Neg logit contributions
anks-0.024
uts-0.023
 disl-0.023
wheel-0.023
vals-0.022
Token but
Token ID475
Feature activation-3.97e-04

Pos logit contributions
 year+0.004
abel+0.003
 more+0.003
 barn+0.003
 me+0.003
Neg logit contributions
anks-0.005
uts-0.005
vals-0.005
 disl-0.005
wheel-0.005
Token then
Token ID788
Feature activation-1.34e-02

Pos logit contributions
 year+0.001
abel+0.001
ara+0.001
 barn+0.001
 restaurant+0.001
Neg logit contributions
wheel-0.001
uts-0.001
anks-0.001
 disl-0.001
mons-0.001
Token she
Token ID673
Feature activation+2.78e-03

Pos logit contributions
 year+0.025
abel+0.019
ie+0.017
 barn+0.017
ara+0.017
Neg logit contributions
anks-0.024
uts-0.024
wheel-0.023
 disl-0.023
vals-0.022
Token remembered
Token ID12086
Feature activation-2.10e-02

Pos logit contributions
uts+0.005
wheel+0.005
anks+0.005
vell+0.005
vals+0.005
Neg logit contributions
 year-0.005
abel-0.003
 barn-0.003
 restaurant-0.003
heart-0.003
Token the
Token ID262
Feature activation+2.02e-02

Pos logit contributions
 year+0.037
abel+0.028
ara+0.026
ie+0.024
 barn+0.024
Neg logit contributions
uts-0.040
 disl-0.039
anks-0.039
wheel-0.038
 supposed-0.035
Token chalk
Token ID30860
Feature activation+1.60e-01

Pos logit contributions
anks+0.040
wheel+0.038
uts+0.038
 disl+0.038
vell+0.036
Neg logit contributions
 year-0.034
abel-0.027
ie-0.024
ara-0.024
 awoke-0.023
Token.
Token ID13
Feature activation+2.54e-02

Pos logit contributions
anks+0.360
 disl+0.344
uts+0.343
vals+0.337
wheel+0.335
Neg logit contributions
 year-0.241
abel-0.178
ie-0.151
 more-0.146
 barn-0.143
Token
Token ID198
Feature activation+2.12e-02

Pos logit contributions
anks+0.055
uts+0.054
wheel+0.053
 disl+0.052
vals+0.051
Neg logit contributions
 year-0.039
abel-0.029
ie-0.025
 barn-0.024
 more-0.024
Token
Token ID198
Feature activation+1.85e-02

Pos logit contributions
uts+0.042
anks+0.041
wheel+0.040
 disl+0.040
issors+0.038
Neg logit contributions
 year-0.035
abel-0.028
ie-0.025
 barn-0.024
 restaurant-0.024
TokenShe
Token ID3347
Feature activation+1.65e-02

Pos logit contributions
uts+0.041
wheel+0.040
 disl+0.039
anks+0.039
vals+0.037
Neg logit contributions
 year-0.027
abel-0.021
ie-0.019
 barn-0.018
 restaurant-0.017
Token scoop
Token ID23529
Feature activation-2.85e-02

Pos logit contributions
wheel+0.027
uts+0.026
anks+0.026
 disl+0.025
vals+0.024
Neg logit contributions
 year-0.031
abel-0.023
 restaurant-0.023
ie-0.022
 barn-0.022
Token a
Token ID257
Feature activation+2.39e-02

Pos logit contributions
 year+0.015
abel+0.011
 more+0.011
 barn+0.010
 me+0.009
Neg logit contributions
anks-0.028
 disl-0.027
uts-0.027
wheel-0.027
vals-0.026
Token marble
Token ID30623
Feature activation+1.04e-01

Pos logit contributions
anks+0.045
uts+0.043
wheel+0.042
 disl+0.042
vals+0.041
Neg logit contributions
 year-0.044
abel-0.035
ie-0.031
ara-0.030
 barn-0.030
Token.
Token ID13
Feature activation+1.51e-02

Pos logit contributions
anks+0.264
 disl+0.264
uts+0.248
vals+0.246
issors+0.244
Neg logit contributions
 year-0.130
abel-0.085
 more-0.076
ie-0.070
 shop-0.068
Token He
Token ID679
Feature activation+4.20e-04

Pos logit contributions
anks+0.035
vals+0.034
 disl+0.034
wheel+0.034
uts+0.033
Neg logit contributions
 year-0.020
abel-0.014
ara-0.013
 shop-0.013
 more-0.012
Token liked
Token ID8288
Feature activation-5.82e-02

Pos logit contributions
anks+0.001
uts+0.001
wheel+0.001
 disl+0.001
vals+0.001
Neg logit contributions
 year-0.001
abel-0.000
ie-0.000
ara-0.000
 barn-0.000
Token mar
Token ID1667
Feature activation+2.12e-01

Pos logit contributions
 year+0.078
abel+0.055
 more+0.052
 me+0.048
ie+0.048
Neg logit contributions
anks-0.139
uts-0.134
wheel-0.133
vals-0.132
 disl-0.131
Tokenbles
Token ID7689
Feature activation+2.12e-01

Pos logit contributions
 disl+0.253
ses+0.201
 supervision+0.194
anks+0.192
 supposed+0.185
Neg logit contributions
 year-0.539
ie-0.470
abel-0.454
ara-0.439
--0.436
Token.
Token ID13
Feature activation+1.55e-02

Pos logit contributions
anks+0.586
 disl+0.573
vals+0.562
issors+0.560
wheel+0.552
Neg logit contributions
 year-0.205
 more-0.122
abel-0.113
ie-0.094
 again-0.093
Token He
Token ID679
Feature activation+3.13e-04

Pos logit contributions
anks+0.038
vals+0.037
wheel+0.037
 disl+0.037
uts+0.036
Neg logit contributions
 year-0.019
abel-0.013
ara-0.012
 shop-0.011
 more-0.011
Token took
Token ID1718
Feature activation+1.10e-02

Pos logit contributions
anks+0.001
uts+0.001
wheel+0.001
 disl+0.001
vals+0.001
Neg logit contributions
 year-0.001
abel-0.000
ie-0.000
ara-0.000
 more-0.000
Token it
Token ID340
Feature activation-3.96e-02

Pos logit contributions
uts+0.028
anks+0.027
 disl+0.027
wheel+0.026
vals+0.026
Neg logit contributions
 year-0.013
abel-0.009
 more-0.009
 me-0.008
ie-0.008
Token him
Token ID683
Feature activation+2.09e-02

Pos logit contributions
 year+0.054
abel+0.040
ie+0.034
ara+0.033
 barn+0.033
Neg logit contributions
anks-0.078
uts-0.076
wheel-0.075
 disl-0.074
vals-0.073
Token.
Token ID13
Feature activation+2.57e-02

Pos logit contributions
vals+0.053
anks+0.052
 disl+0.052
mons+0.051
issors+0.051
Neg logit contributions
 year-0.024
 more-0.016
 again-0.015
ie-0.014
abel-0.014
Token They
Token ID1119
Feature activation-9.76e-03

Pos logit contributions
anks+0.055
uts+0.053
wheel+0.053
 disl+0.052
vell+0.052
Neg logit contributions
 year-0.041
abel-0.030
 barn-0.026
ara-0.026
 more-0.026
Token each
Token ID1123
Feature activation-8.05e-02

Pos logit contributions
 year+0.018
abel+0.013
ara+0.013
 restaurant+0.012
heart+0.012
Neg logit contributions
wheel-0.018
anks-0.017
uts-0.017
vell-0.017
 disl-0.017
Token took
Token ID1718
Feature activation+1.57e-02

Pos logit contributions
 year+0.150
abel+0.138
 barn+0.128
unt+0.126
 restaurant+0.124
Neg logit contributions
wheel-0.125
 unst-0.103
 cheated-0.098
 supposed-0.098
vell-0.097
Token turns
Token ID4962
Feature activation+1.54e-01

Pos logit contributions
anks+0.035
uts+0.034
 disl+0.033
wheel+0.033
vals+0.033
Neg logit contributions
 year-0.024
abel-0.018
 more-0.016
ie-0.015
 me-0.014
Token trying
Token ID2111
Feature activation+3.95e-02

Pos logit contributions
anks+0.334
vals+0.317
issors+0.314
 disl+0.314
vell+0.303
Neg logit contributions
 year-0.251
 more-0.190
 again-0.188
 shop-0.164
abel-0.164
Token to
Token ID284
Feature activation-1.79e-03

Pos logit contributions
anks+0.086
uts+0.085
wheel+0.082
 disl+0.082
vals+0.079
Neg logit contributions
 year-0.061
abel-0.046
ara-0.039
ie-0.039
 barn-0.037
Token get
Token ID651
Feature activation+4.23e-02

Pos logit contributions
 year+0.003
 more+0.003
 me+0.003
ie+0.003
abel+0.003
Neg logit contributions
anks-0.003
uts-0.003
vals-0.003
mons-0.003
issors-0.003
Token the
Token ID262
Feature activation+1.72e-02

Pos logit contributions
uts+0.116
anks+0.114
vals+0.111
issors+0.108
vell+0.108
Neg logit contributions
 year-0.041
 more-0.034
 again-0.028
 me-0.025
ie-0.022
Token laser
Token ID12855
Feature activation+1.34e-01

Pos logit contributions
uts+0.034
anks+0.031
mons+0.029
vals+0.028
 disl+0.028
Neg logit contributions
 year-0.034
 little-0.028
 more-0.028
 me-0.027
 end-0.027
Token day
Token ID1110
Feature activation+9.51e-03

Pos logit contributions
anks+0.035
uts+0.034
wheel+0.034
 disl+0.034
vals+0.033
Neg logit contributions
 year-0.028
abel-0.022
ie-0.019
ara-0.019
 barn-0.018
Token she
Token ID673
Feature activation-7.00e-04

Pos logit contributions
anks+0.020
uts+0.020
wheel+0.019
 disl+0.019
vals+0.018
Neg logit contributions
 year-0.015
abel-0.011
ara-0.010
 barn-0.010
ie-0.010
Token got
Token ID1392
Feature activation+4.02e-02

Pos logit contributions
 year+0.001
abel+0.001
ie+0.001
ara+0.001
 barn+0.001
Neg logit contributions
anks-0.001
uts-0.001
wheel-0.001
 disl-0.001
vals-0.001
Token her
Token ID607
Feature activation+2.46e-02

Pos logit contributions
anks+0.092
uts+0.090
 disl+0.088
wheel+0.088
vals+0.087
Neg logit contributions
 year-0.059
abel-0.043
 more-0.039
ie-0.037
 me-0.035
Token past
Token ID1613
Feature activation+9.71e-03

Pos logit contributions
anks+0.051
uts+0.051
wheel+0.050
 disl+0.048
vals+0.048
Neg logit contributions
 year-0.042
abel-0.031
ie-0.027
 more-0.027
 barn-0.027
Tokenels
Token ID1424
Feature activation+1.93e-01

Pos logit contributions
anks+0.020
uts+0.020
wheel+0.018
 disl+0.018
vals+0.018
Neg logit contributions
 year-0.015
abel-0.011
ara-0.010
 awoke-0.010
 barn-0.010
Token.
Token ID13
Feature activation+2.53e-02

Pos logit contributions
anks+0.466
uts+0.452
 disl+0.448
wheel+0.444
vals+0.439
Neg logit contributions
 year-0.254
abel-0.178
ie-0.148
 more-0.141
 barn-0.139
Token She
Token ID1375
Feature activation-5.88e-04

Pos logit contributions
anks+0.058
uts+0.056
wheel+0.055
 disl+0.055
vals+0.054
Neg logit contributions
 year-0.036
abel-0.027
ie-0.022
ara-0.022
 barn-0.021
Token always
Token ID1464
Feature activation-4.28e-02

Pos logit contributions
 year+0.001
abel+0.001
ie+0.001
 barn+0.001
ara+0.001
Neg logit contributions
anks-0.001
uts-0.001
wheel-0.001
 disl-0.001
vals-0.001
Token tells
Token ID4952
Feature activation-2.75e-02

Pos logit contributions
 year+0.078
abel+0.060
 barn+0.053
ara+0.053
ie+0.052
Neg logit contributions
anks-0.080
 disl-0.077
uts-0.077
wheel-0.077
vals-0.074
Token everyone
Token ID2506
Feature activation+2.84e-03

Pos logit contributions
 year+0.040
abel+0.028
ara+0.025
 barn+0.024
 awoke+0.024
Neg logit contributions
anks-0.062
uts-0.061
wheel-0.059
 disl-0.059
vals-0.057
Token that
Token ID326
Feature activation+3.06e-02

Pos logit contributions
anks+0.086
uts+0.084
wheel+0.080
 disl+0.079
vals+0.076
Neg logit contributions
 year-0.080
abel-0.064
ara-0.056
ie-0.054
 barn-0.054
Token he
Token ID339
Feature activation-2.27e-03

Pos logit contributions
anks+0.065
uts+0.063
wheel+0.061
 disl+0.061
vals+0.060
Neg logit contributions
 year-0.050
abel-0.037
 more-0.032
ie-0.032
 barn-0.032
Token had
Token ID550
Feature activation+5.68e-02

Pos logit contributions
 year+0.004
abel+0.003
ara+0.003
 restaurant+0.003
 barn+0.003
Neg logit contributions
anks-0.004
wheel-0.004
uts-0.004
vell-0.004
 disl-0.004
Token trusted
Token ID13467
Feature activation-3.32e-02

Pos logit contributions
anks+0.097
uts+0.094
wheel+0.091
 disl+0.090
vals+0.088
Neg logit contributions
 year-0.115
abel-0.093
ie-0.083
ara-0.081
 barn-0.081
Token his
Token ID465
Feature activation+1.74e-02

Pos logit contributions
 year+0.051
abel+0.038
ie+0.032
ara+0.032
 barn+0.031
Neg logit contributions
anks-0.073
uts-0.071
wheel-0.069
 disl-0.069
vals-0.067
Token friends
Token ID2460
Feature activation+7.69e-02

Pos logit contributions
anks+0.037
uts+0.036
 disl+0.035
wheel+0.035
vell+0.034
Neg logit contributions
 year-0.027
abel-0.021
ara-0.018
ie-0.018
 awoke-0.017
Token and
Token ID290
Feature activation+1.77e-02

Pos logit contributions
uts+0.167
anks+0.165
wheel+0.160
vell+0.154
 disl+0.153
Neg logit contributions
 year-0.109
abel-0.085
ara-0.079
ie-0.071
 barn-0.069
Token they
Token ID484
Feature activation+1.21e-02

Pos logit contributions
anks+0.035
uts+0.034
wheel+0.034
 disl+0.033
vals+0.032
Neg logit contributions
 year-0.030
abel-0.024
ara-0.021
ie-0.020
 barn-0.020
Token had
Token ID550
Feature activation+5.89e-02

Pos logit contributions
anks+0.025
uts+0.024
wheel+0.024
vell+0.023
 disl+0.023
Neg logit contributions
 year-0.020
abel-0.015
ara-0.013
 restaurant-0.013
 barn-0.012
Token trusted
Token ID13467
Feature activation-3.00e-02

Pos logit contributions
anks+0.100
uts+0.096
wheel+0.094
 disl+0.091
 supposed+0.091
Neg logit contributions
 year-0.115
abel-0.095
ie-0.086
ara-0.085
heart-0.082
Token him
Token ID683
Feature activation+2.52e-02

Pos logit contributions
 year+0.045
abel+0.034
ie+0.028
ara+0.028
 barn+0.028
Neg logit contributions
anks-0.066
uts-0.065
wheel-0.063
 disl-0.063
vals-0.062
Token dangerous
Token ID4923
Feature activation-9.85e-03

Pos logit contributions
 year+0.040
 more+0.036
 me+0.033
 barn+0.027
 one+0.027
Neg logit contributions
vals-0.067
uts-0.067
anks-0.067
vell-0.065
mons-0.065
Token!"
Token ID2474
Feature activation-1.18e-02

Pos logit contributions
 year+0.013
abel+0.008
ie+0.008
 me+0.008
 more+0.008
Neg logit contributions
anks-0.023
vals-0.023
wheel-0.023
 disl-0.023
uts-0.023
Token But
Token ID887
Feature activation-1.35e-02

Pos logit contributions
 year+0.017
abel+0.011
ara+0.010
 me+0.010
ie+0.010
Neg logit contributions
anks-0.027
vell-0.027
vals-0.026
uts-0.026
wheel-0.026
Token Tim
Token ID5045
Feature activation+6.57e-02

Pos logit contributions
 year+0.029
abel+0.024
ie+0.021
 barn+0.021
ara+0.021
Neg logit contributions
anks-0.021
uts-0.021
wheel-0.020
 disl-0.020
vals-0.019
Token didn
Token ID1422
Feature activation+6.22e-02

Pos logit contributions
anks+0.122
uts+0.119
wheel+0.118
 disl+0.113
vals+0.110
Neg logit contributions
 year-0.121
abel-0.094
 barn-0.083
 restaurant-0.083
ie-0.081
Token't
Token ID470
Feature activation+4.72e-02

Pos logit contributions
 disl+0.105
anks+0.104
vals+0.102
vell+0.098
mons+0.096
Neg logit contributions
 year-0.128
abel-0.098
One-0.094
 more-0.092
 me-0.090
Token listen
Token ID6004
Feature activation-1.36e-01

Pos logit contributions
anks+0.104
vals+0.100
uts+0.099
alks+0.098
mons+0.097
Neg logit contributions
 year-0.072
 more-0.053
ie-0.050
abel-0.046
 again-0.045
Token and
Token ID290
Feature activation-6.84e-03

Pos logit contributions
 year+0.191
abel+0.149
 barn+0.125
ara+0.123
 awoke+0.117
Neg logit contributions
uts-0.312
anks-0.304
wheel-0.295
 disl-0.287
issors-0.273
Token picked
Token ID6497
Feature activation+9.83e-03

Pos logit contributions
 year+0.010
abel+0.008
 more+0.008
ie+0.007
 again+0.007
Neg logit contributions
anks-0.015
uts-0.015
vals-0.014
issors-0.014
 disl-0.014
Token up
Token ID510
Feature activation-1.53e-02

Pos logit contributions
uts+0.023
vals+0.022
anks+0.022
 disl+0.022
wheel+0.022
Neg logit contributions
 year-0.013
 more-0.010
abel-0.009
 again-0.009
ie-0.009
Token the
Token ID262
Feature activation+2.58e-03

Pos logit contributions
 year+0.023
abel+0.017
ie+0.014
ara+0.014
 barn+0.014
Neg logit contributions
anks-0.034
uts-0.033
wheel-0.033
 disl-0.032
vals-0.032
Token new
Token ID649
Feature activation-5.28e-02

Pos logit contributions
anks+0.033
uts+0.032
wheel+0.032
 disl+0.031
vals+0.030
Neg logit contributions
 year-0.025
abel-0.019
ie-0.017
ara-0.016
 barn-0.015
Token toy
Token ID13373
Feature activation+3.11e-02

Pos logit contributions
 year+0.087
abel+0.072
ara+0.063
ie+0.062
heart+0.058
Neg logit contributions
anks-0.099
wheel-0.097
 disl-0.096
uts-0.095
vals-0.089
Token to
Token ID284
Feature activation-1.28e-02

Pos logit contributions
anks+0.076
 disl+0.075
uts+0.074
wheel+0.073
vals+0.073
Neg logit contributions
 year-0.039
abel-0.027
 more-0.023
 barn-0.023
ie-0.022
Token c
Token ID269
Feature activation-2.68e-02

Pos logit contributions
 year+0.029
abel+0.025
ara+0.023
heart+0.022
 barn+0.022
Neg logit contributions
wheel-0.017
anks-0.017
 disl-0.017
 bleed-0.015
uts-0.015
Tokenuddle
Token ID24500
Feature activation+4.07e-02

Pos logit contributions
 year+0.055
abel+0.045
 more+0.042
 awoke+0.042
Will+0.042
Neg logit contributions
anks-0.046
uts-0.045
wheel-0.038
 disl-0.038
vell-0.038
Token at
Token ID379
Feature activation+4.94e-02

Pos logit contributions
anks+0.087
uts+0.085
wheel+0.083
 disl+0.082
vals+0.080
Neg logit contributions
 year-0.064
abel-0.049
ie-0.041
ara-0.041
 barn-0.040
Token night
Token ID1755
Feature activation-2.15e-02

Pos logit contributions
anks+0.107
uts+0.102
wheel+0.102
 disl+0.100
vals+0.097
Neg logit contributions
 year-0.076
abel-0.058
ara-0.048
ie-0.046
heart-0.046
Token,
Token ID11
Feature activation-1.84e-02

Pos logit contributions
 year+0.030
abel+0.022
ie+0.018
ara+0.018
 barn+0.018
Neg logit contributions
anks-0.050
uts-0.049
wheel-0.048
 disl-0.047
vals-0.046
Token and
Token ID290
Feature activation-4.79e-03

Pos logit contributions
 year+0.026
abel+0.019
ie+0.015
 barn+0.015
 more+0.015
Neg logit contributions
anks-0.043
uts-0.042
wheel-0.041
 disl-0.041
vals-0.040
Token told
Token ID1297
Feature activation-4.94e-02

Pos logit contributions
 year+0.009
abel+0.007
ara+0.006
ie+0.006
 barn+0.006
Neg logit contributions
anks-0.009
uts-0.009
wheel-0.009
 disl-0.009
vals-0.008
Token him
Token ID683
Feature activation+1.36e-02

Pos logit contributions
 year+0.088
abel+0.066
ara+0.062
 awoke+0.059
 barn+0.059
Neg logit contributions
anks-0.094
 disl-0.092
uts-0.091
wheel-0.089
issors-0.084
Token both
Token ID1111
Feature activation+3.61e-02

Pos logit contributions
 year+0.004
abel+0.003
heart+0.003
ara+0.003
unt+0.003
Neg logit contributions
wheel-0.003
anks-0.003
uts-0.003
vals-0.003
vell-0.003
Token sat
Token ID3332
Feature activation-5.47e-02

Pos logit contributions
anks+0.059
wheel+0.059
uts+0.058
 disl+0.057
vals+0.057
Neg logit contributions
 year-0.070
abel-0.053
 barn-0.050
ara-0.050
heart-0.049
Token down
Token ID866
Feature activation+2.54e-02

Pos logit contributions
 year+0.129
abel+0.116
unt+0.105
One+0.102
 awoke+0.101
Neg logit contributions
anks-0.070
mons-0.063
bles-0.061
uts-0.060
vell-0.057
Token.
Token ID13
Feature activation+1.73e-02

Pos logit contributions
anks+0.052
uts+0.050
wheel+0.049
 disl+0.048
vals+0.047
Neg logit contributions
 year-0.042
abel-0.033
ie-0.028
ara-0.028
 barn-0.027
Token Sam
Token ID3409
Feature activation+5.00e-02

Pos logit contributions
vell+0.037
anks+0.037
uts+0.036
wheel+0.036
vals+0.035
Neg logit contributions
 year-0.026
abel-0.019
 barn-0.017
ara-0.017
 restaurant-0.017
Token couldn
Token ID3521
Feature activation+9.06e-02

Pos logit contributions
uts+0.098
anks+0.096
wheel+0.096
 disl+0.094
vals+0.091
Neg logit contributions
 year-0.082
abel-0.058
 barn-0.053
ie-0.053
heart-0.052
Token't
Token ID470
Feature activation+5.94e-02

Pos logit contributions
anks+0.113
uts+0.105
 disl+0.103
wheel+0.102
vals+0.099
Neg logit contributions
 year-0.226
abel-0.190
ie-0.173
ara-0.173
 more-0.171
Token wait
Token ID4043
Feature activation-8.04e-02

Pos logit contributions
anks+0.093
uts+0.089
wheel+0.086
 disl+0.085
vals+0.083
Neg logit contributions
 year-0.129
abel-0.106
ie-0.096
 more-0.094
ara-0.094
Token to
Token ID284
Feature activation-6.39e-03

Pos logit contributions
 year+0.129
abel+0.099
ie+0.084
ara+0.084
 barn+0.083
Neg logit contributions
anks-0.168
uts-0.164
wheel-0.160
 disl-0.158
vals-0.155
Token eat
Token ID4483
Feature activation-5.70e-02

Pos logit contributions
 year+0.012
abel+0.009
ie+0.008
 more+0.008
 me+0.008
Neg logit contributions
anks-0.012
uts-0.012
wheel-0.011
 disl-0.011
vals-0.011
Token his
Token ID465
Feature activation+8.20e-03

Pos logit contributions
 year+0.097
abel+0.077
 awoke+0.065
unt+0.065
 barn+0.064
Neg logit contributions
anks-0.111
uts-0.107
wheel-0.106
 disl-0.104
mons-0.100
Token said
Token ID531
Feature activation+3.10e-02

Pos logit contributions
 year+0.043
abel+0.034
 barn+0.031
 restaurant+0.031
ara+0.031
Neg logit contributions
uts-0.035
anks-0.035
wheel-0.034
 disl-0.033
vals-0.032
Token.
Token ID13
Feature activation+1.57e-03

Pos logit contributions
anks+0.073
uts+0.071
wheel+0.070
 disl+0.069
vals+0.068
Neg logit contributions
 year-0.043
abel-0.031
ie-0.025
ara-0.025
 barn-0.025
Token "
Token ID366
Feature activation-4.26e-02

Pos logit contributions
anks+0.003
uts+0.003
wheel+0.003
 disl+0.003
vals+0.003
Neg logit contributions
 year-0.003
abel-0.002
ie-0.002
 restaurant-0.002
 barn-0.002
TokenI
Token ID40
Feature activation-2.35e-02

Pos logit contributions
 year+0.070
abel+0.049
One+0.049
Will+0.049
M+0.047
Neg logit contributions
anks-0.095
uts-0.085
vell-0.084
vals-0.084
issors-0.084
Token came
Token ID1625
Feature activation+1.56e-02

Pos logit contributions
 year+0.047
abel+0.038
ie+0.033
 more+0.033
ara+0.033
Neg logit contributions
anks-0.042
uts-0.041
wheel-0.039
issors-0.038
vals-0.038
Token here
Token ID994
Feature activation+6.64e-02

Pos logit contributions
anks+0.035
uts+0.035
wheel+0.034
 disl+0.034
vals+0.034
Neg logit contributions
 year-0.023
abel-0.016
ie-0.015
 more-0.015
 me-0.014
Token to
Token ID284
Feature activation-2.01e-02

Pos logit contributions
anks+0.137
uts+0.136
wheel+0.130
 disl+0.126
vell+0.122
Neg logit contributions
 year-0.102
abel-0.082
ara-0.070
ie-0.069
 barn-0.068
Token find
Token ID1064
Feature activation+5.00e-02

Pos logit contributions
 year+0.037
abel+0.029
ie+0.026
 more+0.025
 barn+0.025
Neg logit contributions
anks-0.038
uts-0.037
wheel-0.036
vals-0.035
 disl-0.035
Token some
Token ID617
Feature activation-9.28e-02

Pos logit contributions
anks+0.082
 disl+0.080
uts+0.077
wheel+0.076
vell+0.074
Neg logit contributions
 year-0.098
abel-0.078
ara-0.070
 awoke-0.069
 restaurant-0.068
Token food
Token ID2057
Feature activation-6.70e-02

Pos logit contributions
 year+0.149
abel+0.125
ara+0.105
 awoke+0.100
ie+0.099
Neg logit contributions
anks-0.188
uts-0.181
 disl-0.179
wheel-0.178
vals-0.174
Token.
Token ID13
Feature activation+8.18e-03

Pos logit contributions
 year+0.093
abel+0.066
ie+0.056
 more+0.055
 barn+0.054
Neg logit contributions
anks-0.156
uts-0.151
wheel-0.150
 disl-0.149
vals-0.146
Token sorry
Token ID7926
Feature activation+3.91e-03

Pos logit contributions
anks+0.007
vals+0.007
uts+0.007
mons+0.006
issors+0.006
Neg logit contributions
 year-0.007
abel-0.006
 more-0.005
 me-0.005
 shop-0.005
Token I
Token ID314
Feature activation-9.95e-03

Pos logit contributions
anks+0.007
uts+0.007
wheel+0.006
 disl+0.006
vals+0.006
Neg logit contributions
 year-0.008
abel-0.007
 awoke-0.006
 barn-0.006
 restaurant-0.006
Token broke
Token ID6265
Feature activation+5.70e-02

Pos logit contributions
 year+0.016
abel+0.012
 woke+0.011
ie+0.011
 awoke+0.010
Neg logit contributions
anks-0.021
alks-0.021
issors-0.021
vals-0.020
uts-0.020
Token your
Token ID534
Feature activation-1.67e-02

Pos logit contributions
anks+0.124
uts+0.120
wheel+0.119
 disl+0.117
vals+0.116
Neg logit contributions
 year-0.089
abel-0.066
ie-0.057
 more-0.056
ara-0.056
Token car
Token ID1097
Feature activation+1.29e-02

Pos logit contributions
 year+0.027
 barn+0.018
 more+0.018
abel+0.018
 shop+0.017
Neg logit contributions
uts-0.037
wheel-0.036
anks-0.036
vals-0.035
vell-0.033
Token.
Token ID13
Feature activation+2.53e-02

Pos logit contributions
anks+0.032
 disl+0.031
uts+0.031
vals+0.030
wheel+0.030
Neg logit contributions
 year-0.016
abel-0.011
 more-0.009
ie-0.009
 again-0.009
Token I
Token ID314
Feature activation-1.03e-02

Pos logit contributions
anks+0.047
uts+0.046
wheel+0.045
vell+0.043
vals+0.043
Neg logit contributions
 year-0.046
abel-0.037
ara-0.033
 more-0.032
 barn-0.032
Token love
Token ID1842
Feature activation-3.70e-02

Pos logit contributions
 year+0.017
abel+0.013
ie+0.011
 more+0.011
ara+0.011
Neg logit contributions
anks-0.022
uts-0.021
vals-0.020
wheel-0.020
issors-0.020
Token you
Token ID345
Feature activation+3.57e-02

Pos logit contributions
 year+0.069
abel+0.054
ie+0.050
 me+0.050
 more+0.050
Neg logit contributions
anks-0.069
uts-0.066
wheel-0.066
vell-0.064
vals-0.064
Token."
Token ID526
Feature activation+4.03e-03

Pos logit contributions
anks+0.068
uts+0.067
wheel+0.064
issors+0.060
 disl+0.060
Neg logit contributions
 year-0.064
abel-0.053
ara-0.046
 barn-0.046
 restaurant-0.044
Token
Token ID198
Feature activation+2.00e-02

Pos logit contributions
anks+0.009
uts+0.009
wheel+0.009
 disl+0.009
vals+0.009
Neg logit contributions
 year-0.006
abel-0.004
ie-0.004
ara-0.004
 barn-0.004
Token!
Token ID0
Feature activation+4.32e-02

Pos logit contributions
 year+0.230
abel+0.177
 barn+0.153
ara+0.149
 awoke+0.148
Neg logit contributions
anks-0.319
uts-0.316
wheel-0.306
 disl-0.292
issors-0.286
Token
Token ID198
Feature activation+1.67e-02

Pos logit contributions
anks+0.105
vell+0.104
wheel+0.104
 disl+0.103
uts+0.101
Neg logit contributions
 year-0.055
abel-0.038
ara-0.032
 shop-0.031
 me-0.029
Token
Token ID198
Feature activation+1.42e-02

Pos logit contributions
anks+0.036
uts+0.036
wheel+0.035
 disl+0.034
vals+0.034
Neg logit contributions
 year-0.025
abel-0.019
ie-0.016
 barn-0.016
 more-0.016
TokenOnce
Token ID7454
Feature activation+4.97e-02

Pos logit contributions
anks+0.033
uts+0.032
wheel+0.031
 disl+0.031
vals+0.031
Neg logit contributions
 year-0.020
abel-0.015
ie-0.012
ara-0.012
 barn-0.012
Token they
Token ID484
Feature activation-8.69e-03

Pos logit contributions
anks+0.092
uts+0.089
wheel+0.087
 disl+0.085
vals+0.084
Neg logit contributions
 year-0.092
abel-0.072
 barn-0.063
ie-0.062
ara-0.062
Token got
Token ID1392
Feature activation+3.61e-02

Pos logit contributions
 year+0.015
abel+0.012
 barn+0.011
ie+0.010
ara+0.010
Neg logit contributions
anks-0.016
wheel-0.016
uts-0.015
 disl-0.015
vals-0.015
Token home
Token ID1363
Feature activation+4.10e-03

Pos logit contributions
anks+0.079
uts+0.077
wheel+0.075
 disl+0.074
vals+0.073
Neg logit contributions
 year-0.056
abel-0.042
ie-0.036
ara-0.035
 barn-0.035
Token,
Token ID11
Feature activation-9.08e-03

Pos logit contributions
anks+0.010
uts+0.010
wheel+0.010
 disl+0.009
vell+0.009
Neg logit contributions
 year-0.005
abel-0.004
ara-0.003
 barn-0.003
ie-0.003
Token the
Token ID262
Feature activation+9.89e-03

Pos logit contributions
 year+0.015
abel+0.011
ie+0.010
ara+0.010
 barn+0.009
Neg logit contributions
anks-0.019
uts-0.018
wheel-0.018
 disl-0.018
vals-0.017
Token little
Token ID1310
Feature activation-2.51e-02

Pos logit contributions
wheel+0.021
anks+0.021
 disl+0.021
uts+0.021
vell+0.020
Neg logit contributions
 year-0.013
abel-0.011
ie-0.009
heart-0.009
 awoke-0.008
Token boy
Token ID2933
Feature activation-1.30e-02

Pos logit contributions
 year+0.034
abel+0.025
ie+0.020
 barn+0.019
ara+0.019
Neg logit contributions
anks-0.059
uts-0.057
wheel-0.056
 disl-0.056
vals-0.055
Token.
Token ID13
Feature activation+1.23e-02

Pos logit contributions
anks+0.111
uts+0.107
 disl+0.106
wheel+0.106
vals+0.104
Neg logit contributions
 year-0.051
abel-0.034
ie-0.028
 more-0.026
 barn-0.026
Token As
Token ID1081
Feature activation-9.31e-03

Pos logit contributions
anks+0.026
uts+0.026
wheel+0.025
 disl+0.025
vell+0.025
Neg logit contributions
 year-0.019
abel-0.014
 barn-0.012
 more-0.012
 restaurant-0.012
Token he
Token ID339
Feature activation-1.40e-02

Pos logit contributions
 year+0.017
abel+0.014
ara+0.012
ie+0.012
 barn+0.012
Neg logit contributions
anks-0.017
uts-0.017
wheel-0.017
 disl-0.016
vals-0.016
Token walked
Token ID6807
Feature activation-5.32e-02

Pos logit contributions
 year+0.024
abel+0.019
 barn+0.017
ara+0.017
 restaurant+0.017
Neg logit contributions
wheel-0.025
uts-0.025
anks-0.025
vals-0.024
 disl-0.024
Token nearer
Token ID40671
Feature activation-2.18e-02

Pos logit contributions
 year+0.079
abel+0.063
ara+0.053
 awoke+0.046
 barn+0.046
Neg logit contributions
anks-0.117
wheel-0.115
uts-0.113
vell-0.107
 disl-0.107
Token,
Token ID11
Feature activation+2.59e-03

Pos logit contributions
 year+0.033
abel+0.024
ara+0.021
 restaurant+0.019
 barn+0.019
Neg logit contributions
uts-0.048
anks-0.047
wheel-0.046
 disl-0.044
vals-0.044
Token the
Token ID262
Feature activation+1.55e-02

Pos logit contributions
anks+0.005
uts+0.005
wheel+0.005
 disl+0.005
vals+0.005
Neg logit contributions
 year-0.005
abel-0.004
ie-0.003
ara-0.003
 barn-0.003
Token bells
Token ID30987
Feature activation+4.14e-02

Pos logit contributions
anks+0.029
wheel+0.029
uts+0.029
 disl+0.028
vals+0.028
Neg logit contributions
 year-0.026
abel-0.022
ie-0.019
ara-0.019
 restaurant-0.018
Token grew
Token ID6348
Feature activation+3.44e-02

Pos logit contributions
anks+0.083
uts+0.079
 disl+0.078
wheel+0.077
vals+0.075
Neg logit contributions
 year-0.072
abel-0.056
ie-0.049
 more-0.048
 barn-0.047
Token in
Token ID287
Feature activation+1.55e-02

Pos logit contributions
anks+0.065
uts+0.064
wheel+0.062
 disl+0.060
vals+0.060
Neg logit contributions
 year-0.062
abel-0.049
ara-0.043
ie-0.042
 restaurant-0.042
Token sound
Token ID2128
Feature activation-6.13e-02

Pos logit contributions
anks+0.028
wheel+0.027
 disl+0.027
uts+0.027
 subtract+0.027
Neg logit contributions
 year-0.025
abel-0.021
ara-0.019
ie-0.019
unt-0.017
Token smiled
Token ID13541
Feature activation-2.08e-02

Pos logit contributions
 year+0.031
abel+0.022
 restaurant+0.020
 barn+0.020
One+0.020
Neg logit contributions
uts-0.033
wheel-0.033
anks-0.033
vals-0.031
 disl-0.031
Token down
Token ID866
Feature activation+2.28e-02

Pos logit contributions
 year+0.034
abel+0.025
ara+0.022
 restaurant+0.021
 barn+0.021
Neg logit contributions
anks-0.043
uts-0.043
wheel-0.040
vell-0.039
 disl-0.038
Token at
Token ID379
Feature activation+4.84e-02

Pos logit contributions
uts+0.039
anks+0.037
wheel+0.035
alks+0.034
vell+0.033
Neg logit contributions
 year-0.039
abel-0.036
ie-0.032
ara-0.031
 awoke-0.030
Token the
Token ID262
Feature activation+5.13e-03

Pos logit contributions
 disl+0.086
anks+0.084
alks+0.078
uts+0.078
wheel+0.077
Neg logit contributions
 year-0.084
abel-0.067
ara-0.061
 awoke-0.060
unt-0.059
Token little
Token ID1310
Feature activation-2.80e-02

Pos logit contributions
anks+0.013
wheel+0.012
uts+0.012
 disl+0.012
vell+0.012
Neg logit contributions
 year-0.006
abel-0.004
ie-0.003
ara-0.003
 awoke-0.003
Token boy
Token ID2933
Feature activation-1.77e-02

Pos logit contributions
 year+0.038
abel+0.028
ie+0.022
ara+0.021
 awoke+0.021
Neg logit contributions
anks-0.064
uts-0.063
wheel-0.062
 disl-0.062
vals-0.060
Token and
Token ID290
Feature activation-1.19e-03

Pos logit contributions
 year+0.020
abel+0.013
ie+0.010
 more+0.010
 barn+0.009
Neg logit contributions
anks-0.046
uts-0.045
 disl-0.045
wheel-0.045
vals-0.044
Token said
Token ID531
Feature activation+2.72e-02

Pos logit contributions
 year+0.002
abel+0.002
ie+0.002
ara+0.002
 barn+0.001
Neg logit contributions
anks-0.002
uts-0.002
wheel-0.002
 disl-0.002
vals-0.002
Token,
Token ID11
Feature activation-9.33e-03

Pos logit contributions
uts+0.062
 disl+0.060
anks+0.059
wheel+0.058
vell+0.055
Neg logit contributions
 year-0.038
abel-0.028
 barn-0.026
ara-0.024
ie-0.024
Token "
Token ID366
Feature activation-3.61e-02

Pos logit contributions
 year+0.015
abel+0.011
ie+0.010
ara+0.009
 barn+0.009
Neg logit contributions
anks-0.020
uts-0.019
wheel-0.019
 disl-0.019
vals-0.018
TokenThis
Token ID1212
Feature activation-7.59e-03

Pos logit contributions
 year+0.066
abel+0.052
 barn+0.046
ie+0.045
 restaurant+0.045
Neg logit contributions
wheel-0.066
anks-0.066
uts-0.065
 disl-0.064
vals-0.063
Token day
Token ID1110
Feature activation-9.05e-03

Pos logit contributions
 year+0.210
abel+0.193
ara+0.169
Will+0.166
 barn+0.164
Neg logit contributions
wheel-0.186
anks-0.180
uts-0.180
 disl-0.176
vals-0.169
Token,
Token ID11
Feature activation-2.72e-02

Pos logit contributions
 year+0.011
abel+0.008
ie+0.006
 barn+0.006
heart+0.006
Neg logit contributions
anks-0.022
uts-0.022
wheel-0.021
vals-0.021
 disl-0.021
Token Tom
Token ID4186
Feature activation-1.39e-03

Pos logit contributions
 year+0.054
abel+0.042
 more+0.039
ara+0.038
 restaurant+0.037
Neg logit contributions
anks-0.049
uts-0.047
 disl-0.046
wheel-0.045
vals-0.045
Token went
Token ID1816
Feature activation+2.96e-02

Pos logit contributions
 year+0.002
abel+0.001
ie+0.001
 barn+0.001
ara+0.001
Neg logit contributions
anks-0.003
uts-0.003
wheel-0.003
 disl-0.003
vals-0.003
Token to
Token ID284
Feature activation-2.18e-02

Pos logit contributions
anks+0.062
uts+0.060
wheel+0.060
 disl+0.058
vals+0.057
Neg logit contributions
 year-0.048
abel-0.038
ie-0.032
ara-0.031
 barn-0.030
Token the
Token ID262
Feature activation-8.37e-03

Pos logit contributions
 year+0.028
abel+0.019
 more+0.016
 restaurant+0.016
 me+0.016
Neg logit contributions
uts-0.054
anks-0.054
vals-0.052
wheel-0.052
vell-0.051
Token woods
Token ID16479
Feature activation-1.96e-02

Pos logit contributions
 year+0.013
abel+0.010
ie+0.009
heart+0.008
 awoke+0.007
Neg logit contributions
anks-0.018
wheel-0.017
uts-0.017
 disl-0.017
vell-0.016
Token to
Token ID284
Feature activation-2.05e-02

Pos logit contributions
 year+0.031
abel+0.025
ara+0.022
ie+0.021
 awoke+0.020
Neg logit contributions
anks-0.040
uts-0.039
wheel-0.039
 disl-0.037
vals-0.036
Token play
Token ID711
Feature activation+3.36e-03

Pos logit contributions
 year+0.041
abel+0.033
ie+0.030
 more+0.029
 barn+0.029
Neg logit contributions
anks-0.036
uts-0.035
wheel-0.033
vals-0.033
 disl-0.033
Token with
Token ID351
Feature activation+1.77e-02

Pos logit contributions
anks+0.007
wheel+0.007
uts+0.006
 disl+0.006
vell+0.006
Neg logit contributions
 year-0.006
abel-0.005
ara-0.004
 awoke-0.004
 barn-0.004
Token his
Token ID465
Feature activation-7.08e-03

Pos logit contributions
anks+0.036
uts+0.035
 disl+0.034
wheel+0.034
issors+0.032
Neg logit contributions
 year-0.029
abel-0.023
ara-0.020
 awoke-0.020
 barn-0.020
Token found
Token ID1043
Feature activation+4.76e-02

Pos logit contributions
anks+0.070
uts+0.069
wheel+0.067
 disl+0.066
vals+0.064
Neg logit contributions
 year-0.082
abel-0.066
ie-0.060
ara-0.059
 barn-0.057
Token a
Token ID257
Feature activation+2.56e-02

Pos logit contributions
anks+0.095
uts+0.093
wheel+0.091
 disl+0.089
mons+0.087
Neg logit contributions
 year-0.080
abel-0.062
ara-0.054
 restaurant-0.053
ie-0.052
Token bug
Token ID5434
Feature activation+4.81e-02

Pos logit contributions
anks+0.047
uts+0.047
wheel+0.047
 disl+0.046
vell+0.045
Neg logit contributions
 year-0.042
abel-0.036
ie-0.031
ara-0.030
heart-0.029
Token.
Token ID13
Feature activation+1.89e-02

Pos logit contributions
anks+0.102
uts+0.099
wheel+0.098
 disl+0.096
vals+0.094
Neg logit contributions
 year-0.077
abel-0.058
ie-0.050
ara-0.049
 barn-0.048
Token Sally
Token ID25737
Feature activation+3.25e-02

Pos logit contributions
anks+0.039
uts+0.037
wheel+0.037
vals+0.036
 disl+0.036
Neg logit contributions
 year-0.032
abel-0.024
ara-0.021
 barn-0.021
ie-0.021
Token watched
Token ID7342
Feature activation-9.89e-02

Pos logit contributions
uts+0.072
anks+0.072
wheel+0.071
 disl+0.069
vals+0.067
Neg logit contributions
 year-0.047
abel-0.034
 barn-0.028
ara-0.028
ie-0.028
Token as
Token ID355
Feature activation+1.05e-01

Pos logit contributions
 year+0.159
abel+0.120
ie+0.103
ara+0.101
 barn+0.100
Neg logit contributions
anks-0.210
uts-0.204
wheel-0.200
 disl-0.198
vals-0.194
Token it
Token ID340
Feature activation-3.58e-02

Pos logit contributions
uts+0.208
anks+0.203
wheel+0.201
 disl+0.200
mons+0.189
Neg logit contributions
 year-0.174
abel-0.136
 barn-0.118
ie-0.117
 awoke-0.116
Token gob
Token ID48484
Feature activation-6.14e-03

Pos logit contributions
 year+0.076
abel+0.061
ara+0.056
heart+0.055
unt+0.054
Neg logit contributions
wheel-0.052
uts-0.052
anks-0.052
 disl-0.049
vals-0.049
Tokenbled
Token ID9342
Feature activation+2.72e-02

Pos logit contributions
 year+0.018
abel+0.016
 barn+0.015
 restaurant+0.015
 awoke+0.015
Neg logit contributions
anks-0.005
uts-0.005
wheel-0.004
vell-0.004
mons-0.004
Token up
Token ID510
Feature activation-9.08e-03

Pos logit contributions
anks+0.055
uts+0.054
wheel+0.052
 disl+0.051
vals+0.051
Neg logit contributions
 year-0.045
abel-0.035
 barn-0.029
ara-0.029
 restaurant-0.029
Token hard
Token ID1327
Feature activation-1.40e-02

Pos logit contributions
 year+0.064
abel+0.045
 more+0.042
ie+0.041
 again+0.037
Neg logit contributions
anks-0.097
 disl-0.094
uts-0.094
vals-0.091
wheel-0.090
Token.
Token ID13
Feature activation+1.16e-02

Pos logit contributions
 year+0.019
abel+0.012
 more+0.011
ie+0.011
 again+0.010
Neg logit contributions
anks-0.033
uts-0.033
 disl-0.033
wheel-0.032
vals-0.032
Token
Token ID220
Feature activation-4.37e-02

Pos logit contributions
anks+0.025
uts+0.025
wheel+0.024
 disl+0.024
issors+0.023
Neg logit contributions
 year-0.018
abel-0.013
ie-0.012
 barn-0.011
 awoke-0.011
Token
Token ID198
Feature activation+9.72e-03

Pos logit contributions
 year+0.051
abel+0.034
ara+0.028
ie+0.026
One+0.026
Neg logit contributions
anks-0.109
 disl-0.108
vals-0.107
wheel-0.106
uts-0.104
Token
Token ID198
Feature activation+8.56e-03

Pos logit contributions
uts+0.017
anks+0.016
wheel+0.016
issors+0.015
 disl+0.015
Neg logit contributions
 year-0.017
abel-0.015
 barn-0.014
 awoke-0.013
 restaurant-0.013
TokenOne
Token ID3198
Feature activation-1.19e-01

Pos logit contributions
wheel+0.021
uts+0.021
issors+0.020
 disl+0.020
anks+0.019
Neg logit contributions
 year-0.009
abel-0.007
 awoke-0.006
 barn-0.006
 restaurant-0.006
Token day
Token ID1110
Feature activation-9.71e-03

Pos logit contributions
 year+0.198
abel+0.173
 barn+0.156
 awoke+0.150
 pasture+0.146
Neg logit contributions
wheel-0.207
uts-0.199
anks-0.198
 disl-0.197
vals-0.189
Token,
Token ID11
Feature activation-2.95e-02

Pos logit contributions
 year+0.012
abel+0.008
 barn+0.007
ie+0.006
 restaurant+0.006
Neg logit contributions
uts-0.024
anks-0.024
wheel-0.023
 disl-0.023
vals-0.022
Token Lily
Token ID20037
Feature activation+9.60e-03

Pos logit contributions
 year+0.050
abel+0.039
ie+0.034
 barn+0.033
ara+0.033
Neg logit contributions
anks-0.059
uts-0.057
wheel-0.056
 disl-0.056
vals-0.054
Token's
Token ID338
Feature activation-3.69e-02

Pos logit contributions
anks+0.023
uts+0.022
wheel+0.022
 disl+0.021
vals+0.021
Neg logit contributions
 year-0.013
abel-0.010
ie-0.008
ara-0.008
 barn-0.008
Token mom
Token ID1995
Feature activation-2.39e-02

Pos logit contributions
 year+0.043
abel+0.027
 shop+0.023
 more+0.022
 restaurant+0.022
Neg logit contributions
anks-0.095
uts-0.095
 disl-0.094
wheel-0.093
vals-0.093
Token.
Token ID13
Feature activation-3.17e-02

Pos logit contributions
 year+0.023
abel+0.018
heart+0.015
 awoke+0.015
ara+0.015
Neg logit contributions
uts-0.026
anks-0.026
 disl-0.025
wheel-0.025
 supposed-0.024
Token Tom
Token ID4186
Feature activation-1.23e-02

Pos logit contributions
 year+0.070
abel+0.058
ie+0.053
 barn+0.051
ara+0.051
Neg logit contributions
anks-0.048
uts-0.047
wheel-0.045
 disl-0.044
vals-0.043
Token had
Token ID550
Feature activation+1.18e-02

Pos logit contributions
 year+0.017
abel+0.012
ie+0.010
ara+0.010
 barn+0.010
Neg logit contributions
anks-0.029
uts-0.028
wheel-0.028
 disl-0.027
vals-0.027
Token a
Token ID257
Feature activation-3.25e-03

Pos logit contributions
anks+0.025
uts+0.024
wheel+0.023
 disl+0.023
vals+0.023
Neg logit contributions
 year-0.019
abel-0.015
ie-0.013
ara-0.012
heart-0.012
Token big
Token ID1263
Feature activation-3.62e-02

Pos logit contributions
 year+0.005
abel+0.004
ie+0.004
heart+0.004
ara+0.003
Neg logit contributions
anks-0.006
uts-0.006
 disl-0.006
wheel-0.006
vals-0.006
Token field
Token ID2214
Feature activation-8.21e-02

Pos logit contributions
 year+0.058
abel+0.050
ara+0.041
Will+0.040
ie+0.040
Neg logit contributions
anks-0.071
 disl-0.068
uts-0.066
issors-0.065
vals-0.065
Token where
Token ID810
Feature activation-6.23e-03

Pos logit contributions
 year+0.143
abel+0.126
ara+0.108
 awoke+0.107
heart+0.104
Neg logit contributions
anks-0.149
uts-0.148
wheel-0.144
vals-0.136
issors-0.135
Token he
Token ID339
Feature activation-3.82e-02

Pos logit contributions
 year+0.012
abel+0.010
ara+0.009
ie+0.008
 awoke+0.008
Neg logit contributions
anks-0.011
uts-0.011
 disl-0.010
wheel-0.010
vals-0.010
Token grew
Token ID6348
Feature activation+1.60e-02

Pos logit contributions
 year+0.061
abel+0.047
ie+0.040
ara+0.039
 barn+0.039
Neg logit contributions
anks-0.080
uts-0.078
wheel-0.077
 disl-0.076
vals-0.074
Token tasty
Token ID25103
Feature activation-4.63e-02

Pos logit contributions
anks+0.033
uts+0.033
wheel+0.031
 disl+0.031
vell+0.030
Neg logit contributions
 year-0.026
abel-0.021
ara-0.018
 awoke-0.017
ie-0.017
Token corn
Token ID11676
Feature activation-6.14e-02

Pos logit contributions
 year+0.073
abel+0.062
ara+0.053
 awoke+0.053
ie+0.049
Neg logit contributions
anks-0.095
uts-0.095
mons-0.087
wheel-0.085
 disl-0.085
Token
Token ID198
Feature activation+2.33e-02

Pos logit contributions
anks+0.058
uts+0.056
wheel+0.055
 disl+0.055
vals+0.054
Neg logit contributions
 year-0.036
abel-0.026
ie-0.022
ara-0.021
 barn-0.021
TokenHe
Token ID1544
Feature activation+1.25e-02

Pos logit contributions
anks+0.052
uts+0.051
wheel+0.050
 disl+0.050
vals+0.048
Neg logit contributions
 year-0.035
abel-0.026
ie-0.022
 barn-0.021
ara-0.021
Token decided
Token ID3066
Feature activation+9.56e-03

Pos logit contributions
wheel+0.020
anks+0.019
 disl+0.019
uts+0.018
 cheated+0.018
Neg logit contributions
 year-0.023
abel-0.018
 barn-0.018
 restaurant-0.018
ara-0.017
Token that
Token ID326
Feature activation+1.91e-02

Pos logit contributions
anks+0.021
uts+0.021
wheel+0.020
 disl+0.020
vals+0.020
Neg logit contributions
 year-0.014
abel-0.011
ie-0.009
ara-0.009
 barn-0.009
Token he
Token ID339
Feature activation-1.49e-02

Pos logit contributions
anks+0.038
uts+0.037
wheel+0.036
 disl+0.036
vals+0.035
Neg logit contributions
 year-0.033
abel-0.026
ie-0.023
ara-0.022
 barn-0.022
Token would
Token ID561
Feature activation-8.93e-02

Pos logit contributions
 year+0.023
abel+0.017
 barn+0.015
ara+0.015
ie+0.015
Neg logit contributions
anks-0.032
uts-0.031
wheel-0.031
 disl-0.030
vals-0.029
Token never
Token ID1239
Feature activation-3.56e-03

Pos logit contributions
 year+0.169
abel+0.134
ie+0.119
ara+0.117
 barn+0.116
Neg logit contributions
anks-0.163
uts-0.158
wheel-0.154
 disl-0.152
vals-0.149
Token forget
Token ID6044
Feature activation+8.02e-02

Pos logit contributions
 year+0.007
abel+0.006
ie+0.005
 more+0.005
 again+0.005
Neg logit contributions
anks-0.006
uts-0.006
 disl-0.006
vals-0.006
issors-0.006
Token his
Token ID465
Feature activation+9.65e-03

Pos logit contributions
anks+0.155
uts+0.154
 disl+0.153
wheel+0.149
vell+0.138
Neg logit contributions
 year-0.136
abel-0.106
ara-0.098
ie-0.091
 barn-0.091
Token shoes
Token ID10012
Feature activation+3.89e-02

Pos logit contributions
anks+0.019
uts+0.018
wheel+0.018
vals+0.017
 disl+0.017
Neg logit contributions
 year-0.018
abel-0.014
 more-0.012
ie-0.012
 barn-0.012
Token again
Token ID757
Feature activation-1.31e-01

Pos logit contributions
anks+0.089
uts+0.087
wheel+0.085
 disl+0.085
vals+0.084
Neg logit contributions
 year-0.055
abel-0.040
ie-0.034
 more-0.032
ara-0.032
Token you
Token ID345
Feature activation+3.18e-02

Pos logit contributions
anks+0.035
uts+0.034
 disl+0.034
wheel+0.033
issors+0.032
Neg logit contributions
 year-0.042
abel-0.034
 awoke-0.031
 barn-0.030
ie-0.030
Token."
Token ID526
Feature activation-6.22e-03

Pos logit contributions
anks+0.064
uts+0.062
wheel+0.061
 disl+0.057
vell+0.057
Neg logit contributions
 year-0.052
abel-0.042
ara-0.037
 barn-0.036
 awoke-0.036
Token
Token ID198
Feature activation+1.49e-02

Pos logit contributions
 year+0.010
abel+0.007
ie+0.006
 barn+0.006
 awoke+0.006
Neg logit contributions
anks-0.014
uts-0.013
wheel-0.013
 disl-0.013
vals-0.012
Token
Token ID198
Feature activation+1.27e-02

Pos logit contributions
anks+0.029
uts+0.029
 disl+0.028
wheel+0.028
issors+0.026
Neg logit contributions
 year-0.025
abel-0.021
 barn-0.018
 awoke-0.017
ie-0.017
TokenS
Token ID50
Feature activation-1.61e-02

Pos logit contributions
uts+0.024
wheel+0.023
 disl+0.023
anks+0.022
vals+0.022
Neg logit contributions
 year-0.022
abel-0.020
 awoke-0.017
ie-0.017
heart-0.016
Tokenara
Token ID3301
Feature activation-8.92e-02

Pos logit contributions
 year+0.014
abel+0.007
 more+0.004
 awoke+0.004
 barn+0.004
Neg logit contributions
anks-0.046
uts-0.044
vals-0.044
wheel-0.044
issors-0.043
Token and
Token ID290
Feature activation-9.62e-03

Pos logit contributions
 year+0.141
abel+0.103
 barn+0.092
ie+0.091
heart+0.089
Neg logit contributions
uts-0.185
wheel-0.184
anks-0.183
 disl-0.174
mons-0.169
Token Tom
Token ID4186
Feature activation+8.44e-03

Pos logit contributions
 year+0.019
abel+0.016
ara+0.014
 more+0.014
ie+0.014
Neg logit contributions
anks-0.017
uts-0.017
wheel-0.016
 disl-0.016
vell-0.016
Token sighed
Token ID21893
Feature activation-9.68e-02

Pos logit contributions
wheel+0.015
anks+0.014
uts+0.014
vell+0.014
 bleed+0.013
Neg logit contributions
 year-0.016
abel-0.012
 barn-0.012
 restaurant-0.011
heart-0.011
Token.
Token ID13
Feature activation+6.54e-03

Pos logit contributions
 year+0.174
abel+0.148
 barn+0.127
ara+0.126
heart+0.125
Neg logit contributions
uts-0.178
anks-0.178
wheel-0.167
 disl-0.166
 bleed-0.162
Token They
Token ID1119
Feature activation-2.23e-02

Pos logit contributions
anks+0.012
uts+0.011
 disl+0.011
wheel+0.010
issors+0.010
Neg logit contributions
 year-0.013
abel-0.011
ie-0.010
heart-0.009
 awoke-0.009