TOP ACTIVATIONS
MAX = 0.140

son . He is three years old , like you .
Mia . She was three years old and the happiest little
Lucy . She was three years old and loved to explore
. Tom was only three years old , and he loved
Sam . She was three years old . Sam was very
. She was only 3 years old , but she knew
Tim . Tim was three years old . He was always
, who was only two years old . Lily wanted to
Anna . She was three years old and loved to explore
my . He was 3 years old and it was a
. She was only three years old and very excited to
Lisa . I 'm three years old ."
brother . He is six years old and he thinks he
twins . They are three years old . They like to
though he was only three years old , he felt proud
brother . He is six years old . He thinks he
Ann . Ann was three years old . One day ,
a little boy of three years old came up to her
. Tom was only 3 years old and he had just
Sally . She was three years old and very happy .

MIN ACTIVATIONS
MIN = -0.259

saw a dusty old ash tr ay on the shelf .
decided to make an ash tr ay out of clay .
he sp ied an ash tr ay . He picked up
very tired . The ash tr ay watched as the little
backyard there was an ash tr ay . It was the
. It was an ash tr ay . He wanted to
, Lily found an ash tr ay in the world behind
The house had an ash tr ay outside where people put
gave him a small ash tr ay as a gift .
He picked up the ash tr ay and snapped it -
and accidentally dropped her ash tr ay on the ground .
. The ash tr ay was ready to provide
They grabbed an old ash tr ay and put it in
to create her own ash tr ay . She went to
. He took the ash tr ay outside and forgot to
cigarette and using an ash tr ay . L
accidentally knocked over the ash tr ay and it broke into
" That is an ash tr ay . It is not
Tim saw a small ash tr ay outside the door .
was right . The ash tr ay is not for kids

INTERVAL 0.041 - 0.140
CONTAINS 4.015%

tickets to go on a hot air balloon ride . We
and swim . He was tired but happy . He had
on , Sue and the dog were best friends . They
it tried , it couldn 't find its way back .
have an idea . Why don 't we take a picture

INTERVAL -0.059 - 0.041
CONTAINS 95.058%

I could use some help ." So the witch
into a big pot . Then , her mom lit the
was a family that was ready to take a trip to
day onwards , the pink worm always looked forward to adding
and he came across an unusual man . The man had

INTERVAL -0.159 - -0.059
CONTAINS 0.924%

." L ily realized her mistake and quickly cleaned
dog was barking loudly and sc aring everyone .
toys . They have a car , a doll , a
a time , a mysterious rocket flew up in the sky
looked like new again ! Everyone cheered when they saw it

INTERVAL -0.259 - -0.159
CONTAINS 0.003%

Did you destroy my ash tr ay ?" Lily felt bad
uffy playing with the ash tr ay . She told Fl
not played with the ash tr ay . They wish they
. Tim felt even sad der now , and he cried
. Rex sn arl s . He does not
Token son
Token ID3367
Feature activation+9.12e-03

Pos logit contributions
utes+0.119
tr+0.107
arl+0.095
itted+0.092
der+0.092
Neg logit contributions
!-0.124
!"-0.115
,-0.100
 years-0.099
 scared-0.098
Token.
Token ID13
Feature activation+3.75e-03

Pos logit contributions
!+0.007
!"+0.006
 years+0.005
 scared+0.004
 fright+0.004
Neg logit contributions
utes-0.025
tr-0.025
itted-0.022
oys-0.021
der-0.021
Token He
Token ID679
Feature activation-1.95e-02

Pos logit contributions
!+0.005
!"+0.005
,+0.004
."+0.004
.+0.004
Neg logit contributions
utes-0.009
tr-0.008
ute-0.008
Years-0.008
itted-0.008
Token is
Token ID318
Feature activation+2.54e-02

Pos logit contributions
utes+0.047
tr+0.046
der+0.041
itted+0.041
arl+0.041
Neg logit contributions
!-0.024
!"-0.021
 years-0.017
,-0.016
 scared-0.016
Token three
Token ID1115
Feature activation-2.69e-02

Pos logit contributions
!+0.031
!"+0.027
,+0.022
 years+0.021
 scared+0.020
Neg logit contributions
utes-0.067
tr-0.061
arl-0.057
der-0.056
itted-0.056
Token years
Token ID812
Feature activation+1.40e-01

Pos logit contributions
utes+0.066
der+0.063
 chords+0.059
 songs+0.059
tr+0.058
Neg logit contributions
!-0.019
!"-0.016
 free-0.016
?!"-0.015
 fright-0.015
Token old
Token ID1468
Feature activation+5.76e-03

Pos logit contributions
!+0.199
!"+0.179
,+0.153
 years+0.144
.+0.142
Neg logit contributions
utes-0.348
tr-0.324
der-0.293
arl-0.293
itted-0.293
Token,
Token ID11
Feature activation+1.43e-03

Pos logit contributions
!+0.004
!"+0.003
 free+0.002
 scared+0.002
e+0.002
Neg logit contributions
utes-0.016
der-0.016
tr-0.016
arl-0.014
itted-0.014
Token like
Token ID588
Feature activation+2.22e-02

Pos logit contributions
!+0.002
!"+0.001
,+0.001
.+0.001
 years+0.001
Neg logit contributions
utes-0.004
tr-0.004
arl-0.003
itted-0.003
osity-0.003
Token you
Token ID345
Feature activation-2.97e-02

Pos logit contributions
!+0.034
!"+0.031
,+0.027
.+0.026
 years+0.025
Neg logit contributions
utes-0.053
tr-0.049
arl-0.045
itted-0.045
ute-0.044
Token.
Token ID13
Feature activation+1.47e-03

Pos logit contributions
utes+0.089
tr+0.084
arl+0.079
der+0.078
itted+0.076
Neg logit contributions
!-0.019
!"-0.014
 years-0.011
 scared-0.010
,-0.008
Token Mia
Token ID32189
Feature activation+2.44e-03

Pos logit contributions
!+0.011
!"+0.009
,+0.008
.+0.007
 scared+0.007
Neg logit contributions
utes-0.023
tr-0.021
arl-0.019
itted-0.019
der-0.019
Token.
Token ID13
Feature activation+1.31e-02

Pos logit contributions
!+0.002
!"+0.001
 years+0.001
 scared+0.001
 trouble+0.001
Neg logit contributions
utes-0.007
tr-0.007
der-0.006
itted-0.006
 conductor-0.006
Token She
Token ID1375
Feature activation+1.40e-02

Pos logit contributions
!+0.019
!"+0.017
,+0.014
 years+0.013
 scared+0.013
Neg logit contributions
utes-0.032
tr-0.029
arl-0.027
der-0.027
itted-0.027
Token was
Token ID373
Feature activation+1.46e-02

Pos logit contributions
!+0.019
!"+0.018
 years+0.014
!".+0.013
,+0.013
Neg logit contributions
utes-0.032
tr-0.031
der-0.027
 conductor-0.027
itted-0.027
Token three
Token ID1115
Feature activation-3.00e-02

Pos logit contributions
!+0.017
!"+0.014
,+0.012
 scared+0.011
 years+0.011
Neg logit contributions
utes-0.040
tr-0.037
arl-0.035
der-0.035
itted-0.034
Token years
Token ID812
Feature activation+1.40e-01

Pos logit contributions
utes+0.068
 songs+0.061
der+0.060
as+0.059
itted+0.059
Neg logit contributions
!-0.026
!"-0.025
?!"-0.025
't-0.025
!".-0.023
Token old
Token ID1468
Feature activation+5.64e-03

Pos logit contributions
!+0.226
!"+0.208
 years+0.172
,+0.170
e+0.170
Neg logit contributions
utes-0.305
tr-0.267
arl-0.255
der-0.248
itted-0.245
Token and
Token ID290
Feature activation-1.06e-02

Pos logit contributions
!+0.005
!"+0.004
 free+0.003
 scared+0.003
!".+0.003
Neg logit contributions
utes-0.015
tr-0.015
der-0.014
itted-0.013
 conductor-0.013
Token the
Token ID262
Feature activation+4.20e-03

Pos logit contributions
utes+0.022
tr+0.021
itted+0.018
 conductor+0.018
der+0.018
Neg logit contributions
!-0.017
!"-0.016
,-0.013
!".-0.013
 years-0.013
Token happiest
Token ID49414
Feature activation-4.19e-02

Pos logit contributions
!+0.007
!"+0.006
,+0.006
 scared+0.005
.+0.005
Neg logit contributions
utes-0.010
tr-0.008
itted-0.008
arl-0.008
der-0.008
Token little
Token ID1310
Feature activation-2.82e-02

Pos logit contributions
utes+0.087
tr+0.077
arl+0.073
itted+0.069
osity+0.068
Neg logit contributions
!-0.081
!"-0.073
,-0.068
.-0.065
 scared-0.062
Token Lucy
Token ID22162
Feature activation+8.86e-02

Pos logit contributions
!+0.018
!"+0.016
 years+0.014
,+0.014
 scared+0.013
Neg logit contributions
utes-0.029
tr-0.027
arl-0.025
der-0.024
itted-0.024
Token.
Token ID13
Feature activation+1.04e-02

Pos logit contributions
!+0.065
!"+0.059
 years+0.047
 trouble+0.039
 scared+0.036
Neg logit contributions
utes-0.243
tr-0.238
itted-0.219
der-0.218
arl-0.215
Token She
Token ID1375
Feature activation+1.25e-02

Pos logit contributions
!+0.016
!"+0.014
 years+0.012
 scared+0.011
,+0.011
Neg logit contributions
utes-0.024
tr-0.022
arl-0.020
der-0.020
itted-0.020
Token was
Token ID373
Feature activation+1.75e-02

Pos logit contributions
!+0.016
!"+0.015
 years+0.012
,+0.012
 scared+0.011
Neg logit contributions
utes-0.030
tr-0.028
itted-0.025
der-0.025
 conductor-0.025
Token three
Token ID1115
Feature activation-3.08e-02

Pos logit contributions
!+0.022
!"+0.019
,+0.016
 scared+0.015
 years+0.015
Neg logit contributions
utes-0.047
tr-0.043
arl-0.041
der-0.040
itted-0.039
Token years
Token ID812
Feature activation+1.39e-01

Pos logit contributions
utes+0.071
itted+0.065
 songs+0.064
der+0.064
 chords+0.063
Neg logit contributions
!"-0.026
!-0.026
't-0.024
?!"-0.023
!".-0.023
Token old
Token ID1468
Feature activation+5.41e-03

Pos logit contributions
!+0.224
!"+0.207
 years+0.169
,+0.169
 scared+0.167
Neg logit contributions
utes-0.308
tr-0.271
arl-0.257
der-0.251
itted-0.249
Token and
Token ID290
Feature activation-1.08e-02

Pos logit contributions
!+0.005
!"+0.005
 free+0.003
 scared+0.003
e+0.003
Neg logit contributions
utes-0.014
tr-0.014
der-0.013
itted-0.012
 conductor-0.012
Token loved
Token ID6151
Feature activation+2.64e-02

Pos logit contributions
utes+0.020
tr+0.019
itted+0.016
 conductor+0.016
Years+0.016
Neg logit contributions
!-0.018
!"-0.018
!".-0.014
 years-0.014
!",-0.014
Token to
Token ID284
Feature activation+1.69e-03

Pos logit contributions
!+0.040
!"+0.037
,+0.031
 years+0.030
 scared+0.029
Neg logit contributions
utes-0.062
tr-0.056
arl-0.052
der-0.051
itted-0.051
Token explore
Token ID7301
Feature activation+3.82e-02

Pos logit contributions
!+0.003
!"+0.002
 years+0.002
 scared+0.002
,+0.002
Neg logit contributions
utes-0.004
tr-0.003
arl-0.003
der-0.003
ute-0.003
Token.
Token ID13
Feature activation+1.11e-02

Pos logit contributions
!+0.025
!"+0.023
 years+0.019
 trouble+0.016
 free+0.016
Neg logit contributions
utes-0.091
tr-0.088
der-0.083
 conductor-0.080
arl-0.079
Token Tom
Token ID4186
Feature activation+2.86e-02

Pos logit contributions
!+0.015
!"+0.014
,+0.012
 years+0.011
 scared+0.011
Neg logit contributions
utes-0.028
tr-0.025
arl-0.023
der-0.023
itted-0.023
Token was
Token ID373
Feature activation+1.27e-02

Pos logit contributions
!+0.043
!"+0.043
 fright+0.039
 trouble+0.038
 years+0.037
Neg logit contributions
tr-0.053
utes-0.051
der-0.044
 conductor-0.044
itted-0.043
Token only
Token ID691
Feature activation+6.62e-03

Pos logit contributions
!+0.014
!"+0.012
,+0.010
 scared+0.010
 years+0.009
Neg logit contributions
utes-0.035
tr-0.033
arl-0.031
der-0.031
itted-0.030
Token three
Token ID1115
Feature activation-3.17e-02

Pos logit contributions
!+0.013
!"+0.012
,+0.011
 scared+0.011
.+0.011
Neg logit contributions
utes-0.013
tr-0.012
arl-0.011
der-0.011
osity-0.011
Token years
Token ID812
Feature activation+1.39e-01

Pos logit contributions
utes+0.076
der+0.073
itted+0.067
asers+0.067
 chords+0.067
Neg logit contributions
!"-0.025
!-0.025
?!"-0.021
!".-0.021
 free-0.020
Token old
Token ID1468
Feature activation+6.73e-03

Pos logit contributions
!+0.214
!"+0.195
,+0.162
 years+0.159
 scared+0.157
Neg logit contributions
utes-0.319
tr-0.288
arl-0.270
der-0.266
itted-0.261
Token,
Token ID11
Feature activation-1.68e-03

Pos logit contributions
!+0.005
!"+0.004
 free+0.002
 scared+0.002
 years+0.002
Neg logit contributions
utes-0.020
tr-0.019
der-0.018
itted-0.017
arl-0.017
Token and
Token ID290
Feature activation-1.09e-02

Pos logit contributions
utes+0.004
tr+0.004
der+0.004
arl+0.004
itted+0.004
Neg logit contributions
!-0.002
!"-0.002
,-0.002
 years-0.002
 scared-0.001
Token he
Token ID339
Feature activation+2.14e-02

Pos logit contributions
utes+0.028
tr+0.026
arl+0.024
der+0.024
itted+0.024
Neg logit contributions
!-0.014
!"-0.012
,-0.010
 years-0.010
 scared-0.010
Token loved
Token ID6151
Feature activation+2.53e-02

Pos logit contributions
!+0.035
!"+0.033
,+0.026
!".+0.026
 years+0.026
Neg logit contributions
utes-0.043
tr-0.042
der-0.036
 conductor-0.036
itted-0.036
Token Sam
Token ID3409
Feature activation-1.23e-02

Pos logit contributions
!+0.015
!"+0.014
,+0.012
 years+0.011
 scared+0.011
Neg logit contributions
utes-0.027
tr-0.025
arl-0.023
der-0.023
itted-0.023
Token.
Token ID13
Feature activation+1.10e-02

Pos logit contributions
utes+0.034
tr+0.033
der+0.031
arl+0.030
 conductor+0.030
Neg logit contributions
!-0.008
!"-0.008
 years-0.006
 trouble-0.005
 scared-0.005
Token She
Token ID1375
Feature activation+1.38e-02

Pos logit contributions
!+0.016
!"+0.014
,+0.012
.+0.011
 scared+0.011
Neg logit contributions
utes-0.027
tr-0.025
der-0.023
arl-0.023
itted-0.023
Token was
Token ID373
Feature activation+1.61e-02

Pos logit contributions
!+0.018
!"+0.017
 years+0.013
!".+0.012
,+0.012
Neg logit contributions
utes-0.033
tr-0.031
 conductor-0.027
itted-0.027
der-0.027
Token three
Token ID1115
Feature activation-3.13e-02

Pos logit contributions
!+0.018
!"+0.016
,+0.013
 scared+0.013
 years+0.012
Neg logit contributions
utes-0.045
tr-0.041
arl-0.039
der-0.039
osity-0.038
Token years
Token ID812
Feature activation+1.39e-01

Pos logit contributions
utes+0.073
itted+0.065
der+0.064
 songs+0.064
as+0.063
Neg logit contributions
!"-0.026
!-0.026
?!"-0.024
't-0.024
!".-0.024
Token old
Token ID1468
Feature activation+5.73e-03

Pos logit contributions
!+0.220
!"+0.202
,+0.166
 years+0.165
 scared+0.162
Neg logit contributions
utes-0.313
tr-0.277
arl-0.262
der-0.256
itted-0.254
Token.
Token ID13
Feature activation+3.14e-03

Pos logit contributions
!+0.005
!"+0.005
 free+0.003
 scared+0.003
!".+0.003
Neg logit contributions
utes-0.015
tr-0.015
der-0.014
itted-0.013
 conductor-0.013
Token Sam
Token ID3409
Feature activation-1.87e-02

Pos logit contributions
!+0.004
!"+0.004
,+0.003
 years+0.003
 scared+0.003
Neg logit contributions
utes-0.008
tr-0.007
arl-0.007
itted-0.007
der-0.007
Token was
Token ID373
Feature activation+1.34e-02

Pos logit contributions
utes+0.037
tr+0.037
 conductor+0.032
der+0.032
itted+0.032
Neg logit contributions
!-0.026
!"-0.026
 trouble-0.021
 fright-0.021
 free-0.021
Token very
Token ID845
Feature activation+2.09e-02

Pos logit contributions
!+0.018
!"+0.016
,+0.013
 years+0.013
.+0.012
Neg logit contributions
utes-0.033
tr-0.031
itted-0.028
arl-0.028
der-0.028
Token.
Token ID13
Feature activation+1.76e-03

Pos logit contributions
utes+0.097
tr+0.092
der+0.085
arl+0.085
itted+0.084
Neg logit contributions
!-0.027
!"-0.024
 years-0.016
,-0.015
 scared-0.014
Token She
Token ID1375
Feature activation+6.99e-03

Pos logit contributions
!+0.003
!"+0.002
 years+0.002
 scared+0.002
,+0.002
Neg logit contributions
utes-0.004
tr-0.004
arl-0.003
itted-0.003
der-0.003
Token was
Token ID373
Feature activation+1.34e-02

Pos logit contributions
!+0.009
!"+0.008
,+0.006
 years+0.006
 scared+0.006
Neg logit contributions
utes-0.018
tr-0.017
der-0.015
itted-0.015
arl-0.015
Token only
Token ID691
Feature activation+8.09e-03

Pos logit contributions
!+0.017
!"+0.016
!".+0.012
 years+0.012
,+0.012
Neg logit contributions
utes-0.033
tr-0.031
itted-0.029
der-0.027
arl-0.027
Token 3
Token ID513
Feature activation-1.13e-02

Pos logit contributions
!+0.018
!"+0.016
,+0.015
 scared+0.015
 years+0.014
Neg logit contributions
utes-0.014
tr-0.013
arl-0.012
der-0.011
ute-0.011
Token years
Token ID812
Feature activation+1.39e-01

Pos logit contributions
der+0.026
 lear+0.023
 songs+0.023
utes+0.023
 chords+0.023
Neg logit contributions
?!"-0.009
!"-0.009
?",-0.009
?".-0.009
�-0.008
Token old
Token ID1468
Feature activation+4.55e-03

Pos logit contributions
!+0.214
!"+0.194
,+0.162
 years+0.159
 scared+0.155
Neg logit contributions
utes-0.321
tr-0.290
arl-0.273
der-0.267
itted-0.265
Token,
Token ID11
Feature activation-3.54e-04

Pos logit contributions
!"+0.004
!+0.003
 free+0.003
e+0.002
!".+0.002
Neg logit contributions
der-0.012
tr-0.011
utes-0.011
itted-0.010
 lear-0.010
Token but
Token ID475
Feature activation+1.47e-02

Pos logit contributions
utes+0.001
tr+0.001
der+0.001
itted+0.001
arl+0.001
Neg logit contributions
!-0.000
!"-0.000
,-0.000
 years-0.000
 scared-0.000
Token she
Token ID673
Feature activation+2.80e-02

Pos logit contributions
!+0.021
!"+0.020
 years+0.015
!".+0.015
,+0.014
Neg logit contributions
utes-0.033
tr-0.031
der-0.030
itted-0.028
osity-0.028
Token knew
Token ID2993
Feature activation-4.49e-02

Pos logit contributions
!+0.041
!"+0.039
!".+0.030
."+0.030
,+0.030
Neg logit contributions
utes-0.062
tr-0.060
der-0.055
itted-0.052
 conductor-0.052
Token Tim
Token ID5045
Feature activation+4.67e-02

Pos logit contributions
!+0.014
!"+0.012
,+0.010
 years+0.009
 scared+0.009
Neg logit contributions
utes-0.029
tr-0.027
arl-0.025
itted-0.025
der-0.025
Token.
Token ID13
Feature activation+1.36e-02

Pos logit contributions
!+0.061
!"+0.058
 years+0.050
 scared+0.046
,+0.045
Neg logit contributions
utes-0.104
tr-0.100
arl-0.091
der-0.090
itted-0.088
Token Tim
Token ID5045
Feature activation+3.94e-02

Pos logit contributions
!+0.018
!"+0.016
,+0.013
 years+0.013
 scared+0.013
Neg logit contributions
utes-0.034
tr-0.032
arl-0.030
der-0.029
itted-0.029
Token was
Token ID373
Feature activation+1.36e-02

Pos logit contributions
!+0.071
!"+0.069
 fright+0.063
 trouble+0.062
 years+0.060
Neg logit contributions
tr-0.062
utes-0.059
itted-0.051
ashed-0.050
 conductor-0.049
Token three
Token ID1115
Feature activation-2.99e-02

Pos logit contributions
!+0.016
!"+0.014
,+0.011
 scared+0.011
 years+0.010
Neg logit contributions
utes-0.037
tr-0.034
arl-0.032
der-0.032
itted-0.031
Token years
Token ID812
Feature activation+1.39e-01

Pos logit contributions
utes+0.065
der+0.063
 songs+0.059
itted+0.059
tr+0.059
Neg logit contributions
!"-0.027
!-0.026
!".-0.024
?!"-0.024
't-0.023
Token old
Token ID1468
Feature activation+7.09e-03

Pos logit contributions
!+0.211
!"+0.192
,+0.160
 years+0.156
 scared+0.153
Neg logit contributions
utes-0.325
tr-0.295
arl-0.275
der-0.270
itted-0.268
Token.
Token ID13
Feature activation+5.07e-03

Pos logit contributions
!+0.007
!"+0.006
 years+0.004
 free+0.004
 scared+0.004
Neg logit contributions
utes-0.019
tr-0.018
der-0.017
itted-0.016
arl-0.016
Token He
Token ID679
Feature activation-1.45e-02

Pos logit contributions
!+0.008
!"+0.008
 years+0.007
 scared+0.006
,+0.006
Neg logit contributions
utes-0.011
tr-0.010
arl-0.009
der-0.009
itted-0.009
Token was
Token ID373
Feature activation+1.27e-02

Pos logit contributions
utes+0.032
tr+0.030
der+0.027
itted+0.026
 conductor+0.026
Neg logit contributions
!-0.023
!"-0.021
,-0.017
 years-0.017
!".-0.016
Token always
Token ID1464
Feature activation-4.02e-03

Pos logit contributions
!+0.017
!"+0.015
,+0.013
 years+0.012
 scared+0.012
Neg logit contributions
utes-0.032
tr-0.030
arl-0.028
der-0.027
itted-0.027
Token,
Token ID11
Feature activation+8.25e-03

Pos logit contributions
!+0.001
!"+0.001
 fright+0.001
 years+0.001
 panic+0.001
Neg logit contributions
utes-0.003
tr-0.003
der-0.003
 conductor-0.003
itted-0.003
Token who
Token ID508
Feature activation-1.83e-02

Pos logit contributions
!+0.010
!"+0.009
,+0.008
 years+0.007
.+0.007
Neg logit contributions
utes-0.022
tr-0.020
arl-0.019
itted-0.019
der-0.019
Token was
Token ID373
Feature activation+1.48e-02

Pos logit contributions
utes+0.045
tr+0.042
der+0.038
arl+0.038
itted+0.038
Neg logit contributions
!-0.025
!"-0.023
 years-0.018
,-0.018
 scared-0.017
Token only
Token ID691
Feature activation+8.28e-03

Pos logit contributions
!+0.020
!"+0.017
,+0.015
 scared+0.014
.+0.014
Neg logit contributions
utes-0.038
tr-0.035
arl-0.034
der-0.033
itted-0.033
Token two
Token ID734
Feature activation-5.41e-02

Pos logit contributions
!+0.016
!"+0.014
,+0.014
.+0.013
 scared+0.013
Neg logit contributions
utes-0.017
tr-0.016
arl-0.014
ute-0.014
der-0.014
Token years
Token ID812
Feature activation+1.39e-01

Pos logit contributions
utes+0.114
tr+0.103
arl+0.095
der+0.092
itted+0.092
Neg logit contributions
!-0.096
!"-0.088
,-0.077
 years-0.074
 scared-0.073
Token old
Token ID1468
Feature activation+4.01e-03

Pos logit contributions
!+0.194
!"+0.170
,+0.149
.+0.137
 years+0.137
Neg logit contributions
utes-0.349
tr-0.326
arl-0.296
itted-0.295
der-0.292
Token.
Token ID13
Feature activation+6.63e-03

Pos logit contributions
!+0.002
!"+0.001
,+0.001
 years+0.000
 scared+0.000
Neg logit contributions
utes-0.013
tr-0.013
arl-0.012
der-0.012
itted-0.012
Token Lily
Token ID20037
Feature activation+9.20e-04

Pos logit contributions
!+0.007
!"+0.006
,+0.005
 years+0.004
 scared+0.004
Neg logit contributions
utes-0.019
tr-0.017
arl-0.016
der-0.016
itted-0.016
Token wanted
Token ID2227
Feature activation+2.61e-02

Pos logit contributions
!+0.001
!"+0.001
 years+0.001
 free+0.001
!".+0.001
Neg logit contributions
utes-0.002
tr-0.002
der-0.002
 conductor-0.002
itted-0.002
Token to
Token ID284
Feature activation+8.53e-04

Pos logit contributions
!+0.027
 the+0.023
,+0.023
!"+0.021
.+0.020
Neg logit contributions
utes-0.077
arl-0.071
tr-0.069
ute-0.068
osity-0.067
Token Anna
Token ID11735
Feature activation+1.57e-02

Pos logit contributions
!+0.018
!"+0.016
 years+0.014
,+0.013
 scared+0.013
Neg logit contributions
utes-0.026
tr-0.024
arl-0.022
osity-0.022
itted-0.021
Token.
Token ID13
Feature activation+1.01e-02

Pos logit contributions
!+0.011
!"+0.010
 trouble+0.009
 free+0.009
 fright+0.008
Neg logit contributions
utes-0.041
tr-0.040
der-0.037
 conductor-0.037
itted-0.036
Token She
Token ID1375
Feature activation+1.18e-02

Pos logit contributions
!+0.017
!"+0.016
 years+0.014
 scared+0.013
 hurt+0.013
Neg logit contributions
utes-0.021
tr-0.019
arl-0.018
der-0.018
itted-0.018
Token was
Token ID373
Feature activation+1.72e-02

Pos logit contributions
!+0.018
!"+0.017
 years+0.013
,+0.013
!".+0.013
Neg logit contributions
utes-0.026
tr-0.025
itted-0.021
der-0.021
 conductor-0.021
Token three
Token ID1115
Feature activation-3.03e-02

Pos logit contributions
!+0.023
!"+0.020
,+0.018
 scared+0.017
 years+0.017
Neg logit contributions
utes-0.044
tr-0.040
arl-0.038
der-0.038
osity-0.037
Token years
Token ID812
Feature activation+1.39e-01

Pos logit contributions
utes+0.071
der+0.066
itted+0.065
 songs+0.064
tr+0.062
Neg logit contributions
!-0.026
!"-0.025
?!"-0.022
!".-0.022
't-0.021
Token old
Token ID1468
Feature activation+4.66e-03

Pos logit contributions
!+0.221
!"+0.202
,+0.166
 years+0.166
 scared+0.162
Neg logit contributions
utes-0.311
tr-0.277
arl-0.263
der-0.256
itted-0.254
Token and
Token ID290
Feature activation-9.47e-03

Pos logit contributions
!+0.005
!"+0.004
 free+0.003
 scared+0.003
e+0.003
Neg logit contributions
utes-0.011
tr-0.011
der-0.011
itted-0.010
 conductor-0.010
Token loved
Token ID6151
Feature activation+2.65e-02

Pos logit contributions
utes+0.018
tr+0.017
itted+0.015
 conductor+0.014
Years+0.014
Neg logit contributions
!-0.016
!"-0.016
!".-0.013
,-0.012
!",-0.012
Token to
Token ID284
Feature activation+1.58e-03

Pos logit contributions
!+0.041
!"+0.037
,+0.031
 years+0.030
 scared+0.029
Neg logit contributions
utes-0.062
tr-0.056
arl-0.052
der-0.051
itted-0.051
Token explore
Token ID7301
Feature activation+3.90e-02

Pos logit contributions
!+0.002
!"+0.002
 years+0.002
 scared+0.002
,+0.002
Neg logit contributions
utes-0.003
tr-0.003
arl-0.003
der-0.003
ute-0.003
Tokenmy
Token ID1820
Feature activation+5.74e-03

Pos logit contributions
!+0.061
!"+0.058
 years+0.052
 scared+0.050
 free+0.049
Neg logit contributions
utes-0.071
tr-0.069
arl-0.061
itted-0.059
der-0.058
Token.
Token ID13
Feature activation+9.01e-03

Pos logit contributions
!+0.004
!"+0.003
 years+0.002
,+0.002
 scared+0.001
Neg logit contributions
utes-0.018
tr-0.017
der-0.016
arl-0.016
itted-0.016
Token He
Token ID679
Feature activation-1.90e-02

Pos logit contributions
!+0.013
!"+0.012
 years+0.011
 scared+0.010
,+0.009
Neg logit contributions
utes-0.021
tr-0.020
arl-0.018
der-0.017
itted-0.017
Token was
Token ID373
Feature activation+1.46e-02

Pos logit contributions
utes+0.038
tr+0.036
itted+0.031
der+0.031
ute+0.030
Neg logit contributions
!-0.033
!"-0.031
,-0.026
 years-0.026
!".-0.025
Token 3
Token ID513
Feature activation-1.08e-02

Pos logit contributions
!+0.014
!"+0.011
 scared+0.010
,+0.010
 the+0.009
Neg logit contributions
utes-0.043
tr-0.040
der-0.039
arl-0.039
osity-0.038
Token years
Token ID812
Feature activation+1.38e-01

Pos logit contributions
utes+0.026
tr+0.024
der+0.024
oys+0.023
itted+0.023
Neg logit contributions
!-0.010
!"-0.009
!".-0.007
 free-0.007
!",-0.007
Token old
Token ID1468
Feature activation+5.75e-03

Pos logit contributions
!+0.205
!"+0.184
,+0.156
 years+0.150
 scared+0.146
Neg logit contributions
utes-0.332
tr-0.304
arl-0.282
der-0.277
itted-0.276
Token and
Token ID290
Feature activation-1.14e-02

Pos logit contributions
!+0.006
!"+0.005
 free+0.004
 scared+0.004
 years+0.004
Neg logit contributions
utes-0.015
tr-0.015
der-0.014
itted-0.013
arl-0.013
Token it
Token ID340
Feature activation-4.87e-03

Pos logit contributions
utes+0.023
tr+0.022
itted+0.019
arl+0.019
der+0.019
Neg logit contributions
!-0.020
!"-0.019
,-0.015
 years-0.015
!".-0.015
Token was
Token ID373
Feature activation+1.45e-02

Pos logit contributions
tr+0.011
utes+0.010
 conductor+0.009
itted+0.009
Years+0.009
Neg logit contributions
!"-0.007
!-0.007
!",-0.005
!".-0.005
 years-0.005
Token a
Token ID257
Feature activation+4.80e-03

Pos logit contributions
!+0.019
!"+0.018
 years+0.013
,+0.013
!".+0.013
Neg logit contributions
utes-0.035
tr-0.033
itted-0.030
arl-0.030
der-0.029
Token.
Token ID13
Feature activation+8.30e-03

Pos logit contributions
!+0.036
!"+0.028
,+0.017
 years+0.015
 scared+0.014
Neg logit contributions
utes-0.169
tr-0.159
arl-0.150
der-0.148
itted-0.148
Token She
Token ID1375
Feature activation+1.22e-02

Pos logit contributions
!+0.011
!"+0.010
,+0.008
.+0.007
 years+0.007
Neg logit contributions
utes-0.021
tr-0.020
arl-0.018
der-0.018
itted-0.018
Token was
Token ID373
Feature activation+1.57e-02

Pos logit contributions
!+0.014
!"+0.012
,+0.009
 years+0.009
 scared+0.009
Neg logit contributions
utes-0.033
tr-0.030
der-0.028
itted-0.028
arl-0.028
Token only
Token ID691
Feature activation+8.80e-03

Pos logit contributions
!+0.019
!"+0.017
,+0.014
 years+0.013
 scared+0.013
Neg logit contributions
utes-0.042
tr-0.039
arl-0.036
der-0.036
itted-0.035
Token three
Token ID1115
Feature activation-3.23e-02

Pos logit contributions
!+0.019
!"+0.017
,+0.016
 scared+0.015
.+0.015
Neg logit contributions
utes-0.017
tr-0.015
arl-0.014
der-0.013
ute-0.013
Token years
Token ID812
Feature activation+1.38e-01

Pos logit contributions
utes+0.079
der+0.077
itted+0.073
 chords+0.071
 lear+0.070
Neg logit contributions
!"-0.024
!-0.023
?!"-0.020
!".-0.018
 free-0.017
Token old
Token ID1468
Feature activation+5.14e-03

Pos logit contributions
!+0.214
!"+0.196
,+0.162
 years+0.159
 scared+0.158
Neg logit contributions
utes-0.316
tr-0.282
arl-0.266
der-0.262
itted-0.259
Token and
Token ID290
Feature activation-1.38e-02

Pos logit contributions
!+0.004
!"+0.004
 free+0.002
 scared+0.002
 years+0.002
Neg logit contributions
utes-0.014
tr-0.014
der-0.013
itted-0.012
 conductor-0.012
Token very
Token ID845
Feature activation+2.10e-02

Pos logit contributions
utes+0.031
tr+0.029
der+0.026
itted+0.026
osity+0.026
Neg logit contributions
!-0.019
!"-0.018
,-0.014
 years-0.014
!".-0.013
Token excited
Token ID6568
Feature activation+3.30e-02

Pos logit contributions
!+0.030
!"+0.028
 years+0.022
e+0.021
!".+0.021
Neg logit contributions
utes-0.046
tr-0.040
itted-0.040
der-0.039
arl-0.039
Token to
Token ID284
Feature activation+5.12e-04

Pos logit contributions
!+0.032
!"+0.027
,+0.021
 years+0.019
 scared+0.018
Neg logit contributions
utes-0.096
tr-0.089
arl-0.084
der-0.083
itted-0.082
Token Lisa
Token ID15378
Feature activation-2.44e-02

Pos logit contributions
!+0.037
!"+0.033
,+0.028
 years+0.027
 scared+0.026
Neg logit contributions
utes-0.058
tr-0.053
arl-0.049
der-0.049
itted-0.048
Token.
Token ID13
Feature activation-2.58e-03

Pos logit contributions
utes+0.069
tr+0.067
der+0.061
 conductor+0.060
itted+0.059
Neg logit contributions
!-0.015
!"-0.011
 scared-0.010
 panic-0.010
 years-0.010
Token I
Token ID314
Feature activation+5.67e-03

Pos logit contributions
utes+0.007
tr+0.006
arl+0.006
der+0.006
itted+0.006
Neg logit contributions
!-0.003
!"-0.003
,-0.003
 scared-0.002
 years-0.002
Token'm
Token ID1101
Feature activation+2.35e-02

Pos logit contributions
!+0.008
!"+0.007
 years+0.006
,+0.006
!".+0.005
Neg logit contributions
utes-0.013
tr-0.012
ute-0.011
itted-0.011
arl-0.011
Token three
Token ID1115
Feature activation-2.81e-02

Pos logit contributions
!+0.030
!"+0.027
 years+0.021
,+0.021
!".+0.019
Neg logit contributions
utes-0.059
tr-0.054
arl-0.050
itted-0.050
der-0.050
Token years
Token ID812
Feature activation+1.38e-01

Pos logit contributions
utes+0.070
der+0.064
 songs+0.062
 chords+0.061
itted+0.059
Neg logit contributions
!-0.016
 panic-0.015
 reach-0.014
 search-0.014
't-0.012
Token old
Token ID1468
Feature activation+3.19e-03

Pos logit contributions
!+0.211
!"+0.190
,+0.160
 years+0.155
 scared+0.152
Neg logit contributions
utes-0.325
tr-0.294
arl-0.275
der-0.269
itted-0.268
Token."
Token ID526
Feature activation+2.49e-02

Pos logit contributions
!+0.002
!"+0.001
 years+0.001
 free+0.001
 scared+0.001
Neg logit contributions
utes-0.009
der-0.009
tr-0.009
osity-0.008
itted-0.008
Token
Token ID220
Feature activation+3.14e-02

Pos logit contributions
!+0.026
!"+0.023
 years+0.017
 scared+0.017
,+0.016
Neg logit contributions
utes-0.068
tr-0.063
itted-0.059
der-0.058
arl-0.058
Token
Token ID198
Feature activation-2.95e-04

Pos logit contributions
!+0.034
!"+0.029
 years+0.026
,+0.022
 scared+0.022
Neg logit contributions
utes-0.087
tr-0.079
arl-0.075
itted-0.074
der-0.072
Token
Token ID198
Feature activation-5.08e-04

Pos logit contributions
utes+0.001
tr+0.001
arl+0.001
itted+0.001
der+0.001
Neg logit contributions
!-0.000
!"-0.000
,-0.000
 years-0.000
 scared-0.000
Token brother
Token ID3956
Feature activation-3.92e-02

Pos logit contributions
!+0.053
!"+0.050
 years+0.044
,+0.043
 scared+0.043
Neg logit contributions
utes-0.036
tr-0.031
arl-0.028
der-0.027
itted-0.027
Token.
Token ID13
Feature activation+7.36e-03

Pos logit contributions
utes+0.110
tr+0.107
der+0.102
arl+0.099
itted+0.098
Neg logit contributions
!-0.024
!"-0.021
 years-0.016
 scared-0.013
 free-0.013
Token He
Token ID679
Feature activation-1.93e-02

Pos logit contributions
!+0.011
!"+0.010
,+0.009
 years+0.008
 scared+0.008
Neg logit contributions
utes-0.017
tr-0.016
arl-0.015
der-0.014
itted-0.014
Token is
Token ID318
Feature activation+2.40e-02

Pos logit contributions
utes+0.042
tr+0.040
der+0.037
arl+0.036
itted+0.035
Neg logit contributions
!-0.028
!"-0.025
 years-0.022
,-0.021
 free-0.020
Token six
Token ID2237
Feature activation-2.38e-02

Pos logit contributions
!+0.031
!"+0.027
,+0.022
 years+0.021
 scared+0.020
Neg logit contributions
utes-0.062
tr-0.057
arl-0.053
itted-0.052
der-0.052
Token years
Token ID812
Feature activation+1.38e-01

Pos logit contributions
utes+0.060
der+0.055
tr+0.051
arl+0.051
itted+0.050
Neg logit contributions
!-0.022
!"-0.021
 free-0.017
!".-0.017
 scared-0.016
Token old
Token ID1468
Feature activation+3.66e-03

Pos logit contributions
!+0.200
!"+0.179
,+0.153
 years+0.144
.+0.141
Neg logit contributions
utes-0.339
tr-0.314
arl-0.287
itted-0.284
der-0.283
Token and
Token ID290
Feature activation-5.74e-03

Pos logit contributions
!+0.002
!"+0.002
,+0.001
 years+0.001
 scared+0.001
Neg logit contributions
utes-0.012
tr-0.011
der-0.010
arl-0.010
itted-0.010
Token he
Token ID339
Feature activation+2.32e-02

Pos logit contributions
utes+0.015
tr+0.014
arl+0.013
ute+0.013
itted+0.013
Neg logit contributions
!-0.008
!"-0.007
,-0.006
 scared-0.005
.-0.005
Token thinks
Token ID6834
Feature activation-2.62e-03

Pos logit contributions
!+0.038
!"+0.034
 years+0.031
,+0.030
!".+0.029
Neg logit contributions
utes-0.043
tr-0.043
der-0.037
itted-0.036
 lear-0.035
Token he
Token ID339
Feature activation+2.33e-02

Pos logit contributions
utes+0.007
tr+0.006
arl+0.006
itted+0.006
der+0.006
Neg logit contributions
!-0.004
!"-0.003
,-0.003
 years-0.003
.-0.003
Token twins
Token ID20345
Feature activation-5.72e-03

Pos logit contributions
!+0.058
!"+0.053
,+0.046
 years+0.044
 scared+0.043
Neg logit contributions
utes-0.084
tr-0.077
arl-0.071
der-0.070
itted-0.069
Token.
Token ID13
Feature activation+1.78e-04

Pos logit contributions
utes+0.014
der+0.013
tr+0.012
 conductor+0.011
ute+0.011
Neg logit contributions
 trouble-0.005
 panic-0.005
!-0.005
!"-0.004
 fright-0.004
Token They
Token ID1119
Feature activation-6.14e-03

Pos logit contributions
!+0.000
!"+0.000
,+0.000
 years+0.000
 scared+0.000
Neg logit contributions
utes-0.000
tr-0.000
arl-0.000
der-0.000
itted-0.000
Token are
Token ID389
Feature activation+3.67e-02

Pos logit contributions
utes+0.015
tr+0.014
arl+0.013
der+0.013
itted+0.013
Neg logit contributions
!-0.009
!"-0.008
,-0.006
 years-0.006
 scared-0.006
Token three
Token ID1115
Feature activation-2.42e-02

Pos logit contributions
!+0.051
!"+0.044
,+0.042
 scared+0.038
.+0.038
Neg logit contributions
utes-0.091
tr-0.088
der-0.084
arl-0.082
osity-0.080
Token years
Token ID812
Feature activation+1.38e-01

Pos logit contributions
utes+0.060
der+0.055
 songs+0.051
itted+0.050
 chords+0.050
Neg logit contributions
!"-0.018
!-0.018
?!"-0.017
't-0.016
!".-0.016
Token old
Token ID1468
Feature activation+3.86e-03

Pos logit contributions
!+0.197
!"+0.176
,+0.152
 years+0.142
.+0.139
Neg logit contributions
utes-0.339
tr-0.316
arl-0.289
der-0.286
itted-0.285
Token.
Token ID13
Feature activation-1.31e-04

Pos logit contributions
!+0.003
!"+0.002
 scared+0.001
 years+0.001
 free+0.001
Neg logit contributions
utes-0.012
tr-0.011
der-0.010
arl-0.010
itted-0.010
Token They
Token ID1119
Feature activation-5.96e-03

Pos logit contributions
utes+0.000
tr+0.000
arl+0.000
der+0.000
itted+0.000
Neg logit contributions
!-0.000
!"-0.000
,-0.000
 years-0.000
 scared-0.000
Token like
Token ID588
Feature activation+2.15e-02

Pos logit contributions
utes+0.015
tr+0.013
arl+0.013
der+0.012
itted+0.012
Neg logit contributions
!-0.008
!"-0.007
,-0.006
 years-0.006
 scared-0.006
Token to
Token ID284
Feature activation+3.60e-04

Pos logit contributions
!+0.022
 the+0.018
!"+0.018
,+0.017
.+0.017
Neg logit contributions
utes-0.061
tr-0.059
arl-0.058
der-0.057
uster-0.056
Token though
Token ID996
Feature activation+3.03e-02

Pos logit contributions
utes+0.034
tr+0.031
arl+0.029
itted+0.029
der+0.028
Neg logit contributions
!-0.018
!"-0.016
,-0.014
 scared-0.013
 years-0.013
Token he
Token ID339
Feature activation+2.07e-02

Pos logit contributions
!+0.036
!"+0.036
 years+0.027
!".+0.027
!",+0.026
Neg logit contributions
utes-0.071
der-0.063
tr-0.062
osity-0.057
 conductor-0.056
Token was
Token ID373
Feature activation+1.60e-02

Pos logit contributions
!"+0.028
!+0.027
!".+0.023
,"+0.022
."+0.022
Neg logit contributions
tr-0.043
utes-0.042
der-0.040
ashed-0.038
itted-0.037
Token only
Token ID691
Feature activation+1.13e-02

Pos logit contributions
!+0.022
!"+0.020
 years+0.015
,+0.015
!".+0.015
Neg logit contributions
utes-0.039
tr-0.035
itted-0.033
der-0.033
arl-0.033
Token three
Token ID1115
Feature activation-2.60e-02

Pos logit contributions
!+0.021
 scared+0.018
,+0.018
!"+0.018
.+0.017
Neg logit contributions
utes-0.025
tr-0.023
arl-0.021
ute-0.020
osity-0.019
Token years
Token ID812
Feature activation+1.38e-01

Pos logit contributions
der+0.066
utes+0.063
itted+0.058
 chords+0.057
 lear+0.056
Neg logit contributions
!"-0.022
!-0.021
!".-0.016
 free-0.016
?!"-0.015
Token old
Token ID1468
Feature activation+4.58e-03

Pos logit contributions
!+0.198
!"+0.176
,+0.153
 years+0.142
.+0.140
Neg logit contributions
utes-0.343
tr-0.318
arl-0.289
itted-0.286
der-0.285
Token,
Token ID11
Feature activation-5.29e-04

Pos logit contributions
!+0.002
!"+0.002
 free+0.001
 scared+0.001
 years+0.001
Neg logit contributions
utes-0.014
tr-0.013
der-0.013
itted-0.012
osity-0.012
Token he
Token ID339
Feature activation+2.18e-02

Pos logit contributions
utes+0.001
tr+0.001
arl+0.001
itted+0.001
der+0.001
Neg logit contributions
!-0.001
!"-0.000
,-0.000
 years-0.000
 scared-0.000
Token felt
Token ID2936
Feature activation-2.05e-02

Pos logit contributions
!+0.031
!"+0.029
,+0.023
 years+0.022
!".+0.022
Neg logit contributions
utes-0.051
tr-0.047
der-0.044
itted-0.043
 conductor-0.042
Token proud
Token ID6613
Feature activation-3.46e-03

Pos logit contributions
utes+0.043
der+0.040
itted+0.039
tr+0.038
arl+0.037
Neg logit contributions
!-0.029
!"-0.029
!".-0.021
!",-0.021
 years-0.021
Token brother
Token ID3956
Feature activation-3.77e-02

Pos logit contributions
!+0.053
!"+0.050
 years+0.044
,+0.043
 scared+0.043
Neg logit contributions
utes-0.037
tr-0.033
der-0.029
arl-0.029
itted-0.028
Token.
Token ID13
Feature activation+1.43e-03

Pos logit contributions
tr+0.101
utes+0.098
der+0.094
oys+0.090
 conductor+0.090
Neg logit contributions
!"-0.022
!-0.020
 free-0.016
 years-0.015
!",-0.014
Token He
Token ID679
Feature activation-2.03e-02

Pos logit contributions
!+0.002
!"+0.002
,+0.002
.+0.001
 scared+0.001
Neg logit contributions
utes-0.004
tr-0.003
arl-0.003
ute-0.003
itted-0.003
Token is
Token ID318
Feature activation+2.42e-02

Pos logit contributions
utes+0.045
tr+0.045
der+0.040
arl+0.039
itted+0.038
Neg logit contributions
!-0.027
!"-0.025
 years-0.021
 free-0.020
!".-0.020
Token six
Token ID2237
Feature activation-2.52e-02

Pos logit contributions
!+0.032
!"+0.029
,+0.023
 years+0.022
.+0.021
Neg logit contributions
utes-0.061
tr-0.057
arl-0.053
itted-0.052
der-0.052
Token years
Token ID812
Feature activation+1.38e-01

Pos logit contributions
utes+0.053
der+0.052
tr+0.046
 lear+0.045
 chords+0.044
Neg logit contributions
!"-0.028
!-0.028
 free-0.025
!".-0.025
!",-0.024
Token old
Token ID1468
Feature activation+4.51e-03

Pos logit contributions
!+0.202
!"+0.181
,+0.155
 years+0.147
 scared+0.143
Neg logit contributions
utes-0.335
tr-0.309
arl-0.284
itted-0.280
der-0.279
Token.
Token ID13
Feature activation+8.46e-04

Pos logit contributions
!+0.003
!"+0.002
 years+0.001
,+0.001
 scared+0.001
Neg logit contributions
utes-0.014
tr-0.013
der-0.013
arl-0.013
itted-0.013
Token He
Token ID679
Feature activation-2.06e-02

Pos logit contributions
!+0.001
!"+0.001
,+0.001
.+0.001
 scared+0.001
Neg logit contributions
utes-0.002
tr-0.002
ute-0.002
arl-0.002
itted-0.002
Token thinks
Token ID6834
Feature activation-3.02e-03

Pos logit contributions
utes+0.048
tr+0.046
der+0.042
arl+0.041
itted+0.040
Neg logit contributions
!-0.028
!"-0.025
 years-0.021
,-0.020
 free-0.019
Token he
Token ID339
Feature activation+2.46e-02

Pos logit contributions
utes+0.008
tr+0.008
arl+0.007
itted+0.007
ute+0.007
Neg logit contributions
!-0.004
!"-0.003
,-0.003
.-0.002
 years-0.002
Token Ann
Token ID5506
Feature activation+3.98e-02

Pos logit contributions
!+0.013
!"+0.012
,+0.010
.+0.009
 scared+0.009
Neg logit contributions
utes-0.027
tr-0.024
arl-0.023
der-0.023
itted-0.023
Token.
Token ID13
Feature activation+5.41e-03

Pos logit contributions
!+0.028
!"+0.022
,+0.014
 years+0.011
.+0.011
Neg logit contributions
utes-0.128
tr-0.119
arl-0.113
itted-0.111
der-0.111
Token Ann
Token ID5506
Feature activation+3.76e-02

Pos logit contributions
!+0.008
!"+0.007
,+0.006
 years+0.005
.+0.005
Neg logit contributions
utes-0.013
tr-0.012
arl-0.011
der-0.011
itted-0.011
Token was
Token ID373
Feature activation+9.74e-03

Pos logit contributions
!+0.047
!"+0.043
 free+0.033
 scared+0.033
 trouble+0.033
Neg logit contributions
utes-0.088
tr-0.085
der-0.076
itted-0.076
 conductor-0.074
Token three
Token ID1115
Feature activation-3.09e-02

Pos logit contributions
!+0.012
!"+0.011
,+0.008
 years+0.008
.+0.007
Neg logit contributions
utes-0.026
tr-0.024
arl-0.022
itted-0.022
der-0.022
Token years
Token ID812
Feature activation+1.38e-01

Pos logit contributions
utes+0.069
der+0.065
 songs+0.064
 chords+0.063
itted+0.063
Neg logit contributions
!"-0.026
?!"-0.025
!-0.025
!".-0.023
't-0.022
Token old
Token ID1468
Feature activation+3.96e-03

Pos logit contributions
!+0.219
!"+0.203
,+0.165
 years+0.165
 scared+0.163
Neg logit contributions
utes-0.306
tr-0.271
arl-0.257
der-0.251
itted-0.248
Token.
Token ID13
Feature activation+1.40e-03

Pos logit contributions
!+0.003
!"+0.003
 scared+0.002
 years+0.002
 free+0.002
Neg logit contributions
utes-0.011
tr-0.011
der-0.010
itted-0.010
arl-0.009
Token One
Token ID1881
Feature activation+5.26e-03

Pos logit contributions
!+0.002
!"+0.002
 years+0.001
,+0.001
 scared+0.001
Neg logit contributions
utes-0.003
tr-0.003
arl-0.003
itted-0.003
der-0.003
Token day
Token ID1110
Feature activation-7.97e-03

Pos logit contributions
!+0.010
!"+0.010
,+0.008
 years+0.008
 scared+0.008
Neg logit contributions
utes-0.010
tr-0.009
arl-0.008
itted-0.008
der-0.008
Token,
Token ID11
Feature activation-4.74e-03

Pos logit contributions
utes+0.024
tr+0.022
der+0.020
itted+0.020
arl+0.020
Neg logit contributions
!-0.006
!"-0.005
 years-0.004
 scared-0.003
,-0.003
Token a
Token ID257
Feature activation+1.28e-03

Pos logit contributions
utes+0.005
tr+0.004
arl+0.004
itted+0.004
osity+0.004
Neg logit contributions
!-0.002
!"-0.001
,-0.001
 scared-0.001
.-0.001
Token little
Token ID1310
Feature activation-2.75e-02

Pos logit contributions
!+0.002
!"+0.002
,+0.001
 scared+0.001
 years+0.001
Neg logit contributions
utes-0.003
tr-0.003
itted-0.003
arl-0.003
der-0.003
Token boy
Token ID2933
Feature activation+7.88e-04

Pos logit contributions
utes+0.056
tr+0.049
arl+0.045
itted+0.044
der+0.044
Neg logit contributions
!-0.054
!"-0.049
,-0.046
 scared-0.044
.-0.042
Token of
Token ID286
Feature activation-3.75e-02

Pos logit contributions
!+0.001
!"+0.001
 years+0.001
!".+0.001
."+0.001
Neg logit contributions
utes-0.002
tr-0.002
 conductor-0.001
itted-0.001
der-0.001
Token three
Token ID1115
Feature activation-3.05e-02

Pos logit contributions
utes+0.116
tr+0.109
arl+0.102
der+0.101
itted+0.101
Neg logit contributions
!-0.032
!"-0.025
,-0.019
 years-0.017
 scared-0.016
Token years
Token ID812
Feature activation+1.38e-01

Pos logit contributions
utes+0.066
itted+0.058
der+0.058
 chords+0.055
ashed+0.055
Neg logit contributions
!-0.041
!"-0.040
!".-0.033
 free-0.031
."-0.030
Token old
Token ID1468
Feature activation+4.42e-03

Pos logit contributions
!+0.205
!"+0.184
,+0.156
 years+0.149
 scared+0.146
Neg logit contributions
utes-0.331
tr-0.303
arl-0.280
der-0.275
itted-0.275
Token came
Token ID1625
Feature activation-1.50e-02

Pos logit contributions
!+0.005
!"+0.005
 years+0.004
,+0.004
 scared+0.004
Neg logit contributions
utes-0.011
tr-0.010
der-0.010
itted-0.010
 conductor-0.009
Token up
Token ID510
Feature activation+7.40e-03

Pos logit contributions
utes+0.035
tr+0.031
arl+0.029
itted+0.029
osity+0.028
Neg logit contributions
!-0.022
!"-0.021
 years-0.017
,-0.016
 scared-0.016
Token to
Token ID284
Feature activation+8.42e-04

Pos logit contributions
!+0.009
!"+0.009
 years+0.007
!".+0.007
 scared+0.007
Neg logit contributions
utes-0.017
tr-0.016
itted-0.015
arl-0.015
der-0.015
Token her
Token ID607
Feature activation+1.47e-02

Pos logit contributions
!+0.001
!"+0.001
 years+0.001
 scared+0.001
,+0.001
Neg logit contributions
utes-0.002
tr-0.002
 conductor-0.002
arl-0.002
 lear-0.002
Token.
Token ID13
Feature activation+1.08e-02

Pos logit contributions
!+0.020
!"+0.016
 years+0.008
,+0.007
 scared+0.006
Neg logit contributions
utes-0.116
tr-0.110
arl-0.103
der-0.103
itted-0.102
Token Tom
Token ID4186
Feature activation+3.09e-02

Pos logit contributions
!+0.015
!"+0.014
,+0.012
.+0.011
 scared+0.010
Neg logit contributions
utes-0.027
tr-0.024
arl-0.023
itted-0.023
der-0.023
Token was
Token ID373
Feature activation+1.11e-02

Pos logit contributions
!+0.042
!"+0.041
 years+0.034
!".+0.032
 trouble+0.032
Neg logit contributions
utes-0.065
tr-0.064
der-0.057
 conductor-0.056
itted-0.056
Token only
Token ID691
Feature activation+5.26e-03

Pos logit contributions
!+0.013
!"+0.011
,+0.009
 years+0.008
 scared+0.008
Neg logit contributions
utes-0.031
tr-0.028
arl-0.027
der-0.026
itted-0.026
Token 3
Token ID513
Feature activation-1.42e-02

Pos logit contributions
!+0.011
!"+0.010
,+0.009
 scared+0.009
.+0.009
Neg logit contributions
utes-0.010
tr-0.009
arl-0.009
der-0.008
osity-0.008
Token years
Token ID812
Feature activation+1.38e-01

Pos logit contributions
utes+0.034
der+0.033
itted+0.031
 chords+0.031
oys+0.031
Neg logit contributions
!"-0.011
!-0.011
?!"-0.009
!".-0.009
 free-0.008
Token old
Token ID1468
Feature activation+6.30e-03

Pos logit contributions
!+0.206
!"+0.186
,+0.157
 years+0.151
 scared+0.147
Neg logit contributions
utes-0.329
tr-0.300
arl-0.279
der-0.274
itted-0.273
Token and
Token ID290
Feature activation-1.08e-02

Pos logit contributions
!+0.005
!"+0.005
 free+0.003
 years+0.003
 scared+0.003
Neg logit contributions
utes-0.017
tr-0.017
der-0.016
itted-0.015
arl-0.015
Token he
Token ID339
Feature activation+2.11e-02

Pos logit contributions
utes+0.025
tr+0.023
der+0.021
itted+0.021
arl+0.021
Neg logit contributions
!-0.016
!"-0.014
,-0.012
 years-0.011
!".-0.011
Token had
Token ID550
Feature activation-1.81e-02

Pos logit contributions
!+0.032
!"+0.031
!".+0.025
,+0.024
."+0.024
Neg logit contributions
tr-0.042
utes-0.041
 conductor-0.036
der-0.036
itted-0.035
Token just
Token ID655
Feature activation-2.13e-02

Pos logit contributions
utes+0.046
tr+0.043
arl+0.040
der+0.039
itted+0.039
Neg logit contributions
!-0.024
!"-0.021
,-0.017
 years-0.017
 scared-0.016
Token Sally
Token ID25737
Feature activation+5.32e-02

Pos logit contributions
utes+0.004
tr+0.004
arl+0.004
itted+0.004
der+0.004
Neg logit contributions
!-0.002
!"-0.002
,-0.002
.-0.002
 scared-0.002
Token.
Token ID13
Feature activation+1.03e-02

Pos logit contributions
!+0.042
!"+0.039
 years+0.037
 trouble+0.031
 fright+0.031
Neg logit contributions
utes-0.139
tr-0.137
der-0.125
itted-0.119
 conductor-0.118
Token She
Token ID1375
Feature activation+1.39e-02

Pos logit contributions
!+0.014
!"+0.012
,+0.010
 years+0.009
.+0.009
Neg logit contributions
utes-0.026
tr-0.024
arl-0.022
der-0.022
itted-0.022
Token was
Token ID373
Feature activation+1.55e-02

Pos logit contributions
!+0.019
!"+0.017
 years+0.014
,+0.013
 scared+0.013
Neg logit contributions
utes-0.033
tr-0.031
der-0.028
itted-0.027
 conductor-0.027
Token three
Token ID1115
Feature activation-3.05e-02

Pos logit contributions
!+0.018
!"+0.016
,+0.013
 scared+0.012
 years+0.012
Neg logit contributions
utes-0.043
tr-0.039
arl-0.037
der-0.037
itted-0.036
Token years
Token ID812
Feature activation+1.38e-01

Pos logit contributions
utes+0.070
itted+0.062
der+0.061
 songs+0.061
as+0.060
Neg logit contributions
!-0.025
!"-0.024
't-0.023
?!"-0.022
!".-0.021
Token old
Token ID1468
Feature activation+5.29e-03

Pos logit contributions
!+0.220
!"+0.201
,+0.166
 years+0.166
 scared+0.163
Neg logit contributions
utes-0.308
tr-0.271
arl-0.258
der-0.252
itted-0.249
Token and
Token ID290
Feature activation-1.14e-02

Pos logit contributions
!+0.004
!"+0.004
 free+0.003
 scared+0.003
e+0.003
Neg logit contributions
utes-0.014
tr-0.014
der-0.013
itted-0.012
 conductor-0.012
Token very
Token ID845
Feature activation+2.02e-02

Pos logit contributions
utes+0.022
tr+0.022
itted+0.019
 conductor+0.018
Years+0.018
Neg logit contributions
!-0.018
!"-0.018
,-0.014
!".-0.014
 years-0.014
Token happy
Token ID3772
Feature activation-1.24e-03

Pos logit contributions
!+0.031
!"+0.028
 years+0.024
e+0.022
!".+0.022
Neg logit contributions
utes-0.042
tr-0.037
itted-0.036
arl-0.036
osity-0.036
Token.
Token ID13
Feature activation+5.58e-04

Pos logit contributions
utes+0.004
tr+0.004
arl+0.004
der+0.004
itted+0.004
Neg logit contributions
!-0.001
!"-0.001
,-0.000
 years-0.000
 scared-0.000
Token saw
Token ID2497
Feature activation-1.94e-02

Pos logit contributions
!+0.033
!"+0.030
 years+0.025
,+0.024
 scared+0.023
Neg logit contributions
utes-0.057
tr-0.053
der-0.048
itted-0.048
arl-0.047
Token a
Token ID257
Feature activation+4.89e-03

Pos logit contributions
utes+0.061
tr+0.056
arl+0.055
itted+0.053
osity+0.053
Neg logit contributions
!-0.018
!"-0.013
,-0.012
 the-0.011
 scared-0.010
Token dusty
Token ID36972
Feature activation+1.29e-02

Pos logit contributions
!+0.007
!"+0.006
,+0.005
 scared+0.005
 years+0.005
Neg logit contributions
utes-0.012
tr-0.011
itted-0.011
arl-0.011
der-0.011
Token old
Token ID1468
Feature activation+3.09e-03

Pos logit contributions
!+0.022
!"+0.020
,+0.018
 years+0.017
 scared+0.016
Neg logit contributions
utes-0.028
tr-0.026
arl-0.024
itted-0.023
der-0.023
Token ash
Token ID12530
Feature activation-4.21e-02

Pos logit contributions
!+0.005
!"+0.005
,+0.004
 years+0.004
 scared+0.004
Neg logit contributions
utes-0.007
tr-0.006
arl-0.006
itted-0.006
der-0.006
Tokentr
Token ID2213
Feature activation-2.37e-01

Pos logit contributions
utes+0.055
arl+0.040
osity+0.039
uster+0.038
itted+0.037
Neg logit contributions
!-0.122
!"-0.109
.-0.109
,-0.108
 tree-0.095
Tokenay
Token ID323
Feature activation-3.89e-02

Pos logit contributions
utes+0.426
tr+0.346
arl+0.328
osity+0.324
itted+0.321
Neg logit contributions
!-0.515
!"-0.473
,-0.430
.-0.406
 years-0.391
Token on
Token ID319
Feature activation+3.74e-03

Pos logit contributions
utes+0.109
tr+0.098
arl+0.094
itted+0.092
der+0.090
Neg logit contributions
!-0.048
!"-0.040
,-0.035
.-0.032
 years-0.028
Token the
Token ID262
Feature activation+9.21e-03

Pos logit contributions
!+0.003
!"+0.003
 years+0.002
 scared+0.002
,+0.002
Neg logit contributions
utes-0.011
tr-0.010
der-0.010
arl-0.009
 conductor-0.009
Token shelf
Token ID18316
Feature activation+5.29e-02

Pos logit contributions
!+0.016
!"+0.014
,+0.012
 years+0.012
.+0.012
Neg logit contributions
utes-0.021
tr-0.018
arl-0.017
itted-0.017
der-0.017
Token.
Token ID13
Feature activation-1.73e-03

Pos logit contributions
!+0.028
!"+0.022
 years+0.009
,+0.008
 scared+0.008
Neg logit contributions
utes-0.173
tr-0.164
der-0.154
arl-0.153
itted-0.152
Token decided
Token ID3066
Feature activation+8.33e-03

Pos logit contributions
!+0.031
!"+0.029
 years+0.023
!".+0.022
,+0.022
Neg logit contributions
utes-0.053
tr-0.050
ashed-0.045
der-0.044
 conductor-0.044
Token to
Token ID284
Feature activation-2.04e-04

Pos logit contributions
!+0.013
!"+0.012
 years+0.010
,+0.010
 scared+0.010
Neg logit contributions
utes-0.019
tr-0.017
der-0.016
arl-0.016
itted-0.015
Token make
Token ID787
Feature activation-8.57e-03

Pos logit contributions
utes+0.000
tr+0.000
arl+0.000
ute+0.000
der+0.000
Neg logit contributions
!-0.000
!"-0.000
 years-0.000
,-0.000
 scared-0.000
Token an
Token ID281
Feature activation+3.95e-02

Pos logit contributions
utes+0.022
tr+0.019
der+0.018
itted+0.018
ute+0.018
Neg logit contributions
!-0.011
!"-0.010
 years-0.008
,-0.008
 scared-0.007
Token ash
Token ID12530
Feature activation-4.31e-02

Pos logit contributions
!+0.068
!"+0.062
,+0.054
 years+0.053
 scared+0.052
Neg logit contributions
utes-0.085
tr-0.077
arl-0.071
der-0.069
itted-0.069
Tokentr
Token ID2213
Feature activation-2.37e-01

Pos logit contributions
utes+0.044
arl+0.031
itted+0.028
osity+0.028
uster+0.028
Neg logit contributions
!-0.137
!"-0.128
,-0.121
.-0.121
,"-0.111
Tokenay
Token ID323
Feature activation-4.02e-02

Pos logit contributions
utes+0.408
tr+0.339
arl+0.314
der+0.309
itted+0.306
Neg logit contributions
!-0.522
!"-0.485
,-0.436
.-0.413
 scared-0.410
Token out
Token ID503
Feature activation-7.83e-03

Pos logit contributions
utes+0.103
tr+0.093
arl+0.089
itted+0.087
der+0.087
Neg logit contributions
!-0.057
!"-0.050
,-0.042
.-0.038
 scared-0.038
Token of
Token ID286
Feature activation-3.78e-02

Pos logit contributions
utes+0.018
tr+0.016
arl+0.015
der+0.015
itted+0.015
Neg logit contributions
!-0.013
!"-0.011
,-0.010
 years-0.009
 scared-0.009
Token clay
Token ID21558
Feature activation-5.19e-02

Pos logit contributions
utes+0.102
tr+0.095
arl+0.089
der+0.088
itted+0.087
Neg logit contributions
!-0.041
!"-0.036
,-0.028
 years-0.027
 scared-0.026
Token.
Token ID13
Feature activation+7.86e-04

Pos logit contributions
utes+0.160
tr+0.150
arl+0.141
der+0.140
itted+0.139
Neg logit contributions
!-0.040
!"-0.032
,-0.021
 years-0.020
 scared-0.018
Token he
Token ID339
Feature activation+2.37e-02

Pos logit contributions
utes+0.042
tr+0.039
der+0.037
 conductor+0.034
itted+0.034
Neg logit contributions
!-0.024
!"-0.023
 years-0.017
!".-0.017
!",-0.016
Token sp
Token ID599
Feature activation-7.83e-03

Pos logit contributions
!+0.033
!"+0.032
!".+0.026
 years+0.026
!",+0.025
Neg logit contributions
utes-0.051
tr-0.048
ashed-0.045
 conductor-0.042
itted-0.042
Tokenied
Token ID798
Feature activation-5.13e-02

Pos logit contributions
utes+0.011
tr+0.011
osity+0.009
 conductor+0.009
 lear+0.009
Neg logit contributions
!-0.018
!"-0.017
,-0.015
.-0.015
 years-0.015
Token an
Token ID281
Feature activation+4.09e-02

Pos logit contributions
utes+0.134
tr+0.128
der+0.120
itted+0.113
arl+0.113
Neg logit contributions
!-0.054
!"-0.051
 years-0.039
 scared-0.035
,-0.035
Token ash
Token ID12530
Feature activation-4.17e-02

Pos logit contributions
!+0.071
!"+0.064
,+0.057
 years+0.055
 scared+0.054
Neg logit contributions
utes-0.090
tr-0.081
arl-0.076
itted-0.073
osity-0.073
Tokentr
Token ID2213
Feature activation-2.37e-01

Pos logit contributions
utes+0.034
osity+0.019
arl+0.017
uster+0.016
itted+0.016
Neg logit contributions
!-0.141
!"-0.132
.-0.126
,-0.125
."-0.116
Tokenay
Token ID323
Feature activation-3.97e-02

Pos logit contributions
utes+0.400
tr+0.334
arl+0.304
itted+0.298
der+0.298
Neg logit contributions
!-0.530
!"-0.492
,-0.444
.-0.422
 years-0.418
Token.
Token ID13
Feature activation+8.81e-04

Pos logit contributions
utes+0.135
tr+0.122
arl+0.119
itted+0.117
osity+0.116
Neg logit contributions
!-0.026
!"-0.018
,-0.012
.-0.010
."-0.004
Token He
Token ID679
Feature activation-2.27e-02

Pos logit contributions
!+0.001
!"+0.001
 years+0.001
,+0.001
 scared+0.001
Neg logit contributions
utes-0.002
tr-0.002
der-0.002
itted-0.002
arl-0.002
Token picked
Token ID6497
Feature activation-1.48e-02

Pos logit contributions
utes+0.061
tr+0.057
arl+0.053
der+0.052
itted+0.052
Neg logit contributions
!-0.027
!"-0.023
,-0.019
 years-0.018
 scared-0.017
Token up
Token ID510
Feature activation+6.56e-03

Pos logit contributions
utes+0.046
tr+0.043
arl+0.040
der+0.040
itted+0.040
Neg logit contributions
!-0.011
!"-0.009
,-0.006
 years-0.006
 scared-0.005
Token very
Token ID845
Feature activation+2.40e-02

Pos logit contributions
!+0.015
!"+0.013
,+0.011
 years+0.010
 scared+0.010
Neg logit contributions
utes-0.030
tr-0.028
arl-0.026
itted-0.026
der-0.025
Token tired
Token ID10032
Feature activation+6.65e-02

Pos logit contributions
!+0.031
!"+0.028
,+0.022
 years+0.022
.+0.020
Neg logit contributions
utes-0.061
tr-0.056
arl-0.053
itted-0.052
der-0.051
Token.
Token ID13
Feature activation-5.88e-03

Pos logit contributions
!+0.037
!"+0.030
 years+0.016
 scared+0.013
,+0.012
Neg logit contributions
utes-0.208
tr-0.198
der-0.191
arl-0.188
itted-0.183
Token The
Token ID383
Feature activation+1.25e-03

Pos logit contributions
utes+0.014
tr+0.013
arl+0.012
der+0.012
itted+0.012
Neg logit contributions
!-0.009
!"-0.008
,-0.007
 years-0.007
 scared-0.006
Token ash
Token ID12530
Feature activation-4.35e-02

Pos logit contributions
!+0.002
!"+0.002
,+0.001
 scared+0.001
.+0.001
Neg logit contributions
utes-0.003
tr-0.003
itted-0.003
osity-0.003
arl-0.003
Tokentr
Token ID2213
Feature activation-2.37e-01

Pos logit contributions
utes+0.113
arl+0.101
osity+0.094
itted+0.092
uster+0.091
Neg logit contributions
!-0.062
!"-0.053
,-0.050
.-0.046
 scared-0.045
Tokenay
Token ID323
Feature activation-3.92e-02

Pos logit contributions
utes+0.441
tr+0.359
arl+0.338
der+0.336
 chords+0.330
Neg logit contributions
!-0.492
!"-0.451
,-0.411
.-0.378
 scared-0.377
Token watched
Token ID7342
Feature activation-3.25e-02

Pos logit contributions
utes+0.109
tr+0.102
arl+0.094
der+0.094
itted+0.093
Neg logit contributions
!-0.042
!"-0.036
,-0.028
 years-0.027
 scared-0.026
Token as
Token ID355
Feature activation+5.08e-03

Pos logit contributions
utes+0.092
tr+0.086
arl+0.080
itted+0.079
der+0.079
Neg logit contributions
!-0.035
!"-0.030
,-0.024
 years-0.021
 scared-0.021
Token the
Token ID262
Feature activation+6.93e-03

Pos logit contributions
!+0.004
!"+0.004
 years+0.002
,+0.002
 scared+0.002
Neg logit contributions
utes-0.015
tr-0.014
der-0.013
arl-0.013
itted-0.013
Token little
Token ID1310
Feature activation-2.53e-02

Pos logit contributions
!+0.011
!"+0.009
 scared+0.009
,+0.009
.+0.008
Neg logit contributions
utes-0.017
tr-0.015
osity-0.015
itted-0.015
der-0.014
Token backyard
Token ID24296
Feature activation-9.19e-04

Pos logit contributions
!+0.014
!"+0.012
,+0.012
 scared+0.011
.+0.011
Neg logit contributions
utes-0.026
tr-0.023
der-0.023
osity-0.023
arl-0.022
Token there
Token ID612
Feature activation+2.06e-02

Pos logit contributions
utes+0.003
tr+0.003
der+0.002
itted+0.002
arl+0.002
Neg logit contributions
!-0.001
!"-0.000
 years-0.000
 scared-0.000
!".-0.000
Token was
Token ID373
Feature activation+1.24e-02

Pos logit contributions
!"+0.035
!+0.035
 years+0.031
 fright+0.030
!".+0.029
Neg logit contributions
utes-0.036
tr-0.035
apped-0.030
der-0.030
itted-0.030
Token an
Token ID281
Feature activation+4.04e-02

Pos logit contributions
!+0.014
!"+0.012
,+0.010
 years+0.009
 scared+0.009
Neg logit contributions
utes-0.034
tr-0.032
arl-0.030
der-0.029
itted-0.029
Token ash
Token ID12530
Feature activation-4.22e-02

Pos logit contributions
!+0.065
!"+0.059
,+0.051
 years+0.049
 scared+0.048
Neg logit contributions
utes-0.092
tr-0.084
arl-0.077
itted-0.076
der-0.076
Tokentr
Token ID2213
Feature activation-2.37e-01

Pos logit contributions
utes+0.037
arl+0.021
osity+0.020
uster+0.018
itted+0.018
Neg logit contributions
!-0.137
!"-0.129
,-0.122
.-0.122
 scared-0.114
Tokenay
Token ID323
Feature activation-3.99e-02

Pos logit contributions
utes+0.389
tr+0.331
arl+0.299
der+0.292
itted+0.289
Neg logit contributions
!-0.535
!"-0.499
,-0.450
 years-0.431
 scared-0.428
Token.
Token ID13
Feature activation-9.05e-04

Pos logit contributions
utes+0.125
tr+0.114
arl+0.110
itted+0.109
osity+0.107
Neg logit contributions
!-0.035
!"-0.028
,-0.021
.-0.018
 scared-0.014
Token It
Token ID632
Feature activation-1.71e-02

Pos logit contributions
utes+0.002
tr+0.001
itted+0.001
arl+0.001
der+0.001
Neg logit contributions
!-0.002
!"-0.002
 years-0.001
 hurt-0.001
 scared-0.001
Token was
Token ID373
Feature activation+1.23e-02

Pos logit contributions
tr+0.042
utes+0.038
der+0.034
 conductor+0.034
itted+0.033
Neg logit contributions
!-0.022
!"-0.021
 years-0.016
!".-0.016
!",-0.015
Token the
Token ID262
Feature activation+8.91e-03

Pos logit contributions
!+0.015
!"+0.013
,+0.010
 years+0.010
 scared+0.009
Neg logit contributions
utes-0.032
tr-0.030
arl-0.028
itted-0.028
der-0.027
Token.
Token ID13
Feature activation+1.19e-03

Pos logit contributions
utes+0.094
tr+0.086
arl+0.083
itted+0.082
der+0.081
Neg logit contributions
!-0.024
!"-0.018
,-0.013
.-0.011
 years-0.010
Token It
Token ID632
Feature activation-1.69e-02

Pos logit contributions
!+0.002
!"+0.002
 years+0.002
e+0.002
,+0.002
Neg logit contributions
utes-0.002
tr-0.002
itted-0.002
arl-0.002
der-0.002
Token was
Token ID373
Feature activation+1.34e-02

Pos logit contributions
tr+0.038
utes+0.031
 conductor+0.030
der+0.028
Years+0.027
Neg logit contributions
!"-0.023
 years-0.022
!-0.021
!".-0.019
."-0.018
Token an
Token ID281
Feature activation+3.97e-02

Pos logit contributions
!+0.014
!"+0.011
,+0.010
 scared+0.009
 years+0.008
Neg logit contributions
utes-0.039
tr-0.036
arl-0.034
itted-0.033
der-0.033
Token ash
Token ID12530
Feature activation-4.30e-02

Pos logit contributions
!+0.065
!"+0.059
,+0.051
 years+0.049
 scared+0.048
Neg logit contributions
utes-0.088
tr-0.080
arl-0.073
der-0.072
itted-0.072
Tokentr
Token ID2213
Feature activation-2.36e-01

Pos logit contributions
utes+0.031
arl+0.015
itted+0.014
osity+0.013
 chords+0.012
Neg logit contributions
!-0.149
!"-0.139
,-0.132
.-0.130
 scared-0.121
Tokenay
Token ID323
Feature activation-3.92e-02

Pos logit contributions
utes+0.377
tr+0.325
arl+0.289
der+0.282
itted+0.280
Neg logit contributions
!-0.542
!"-0.506
,-0.457
 years-0.444
 scared-0.439
Token.
Token ID13
Feature activation+1.41e-03

Pos logit contributions
utes+0.122
tr+0.116
arl+0.109
der+0.108
itted+0.107
Neg logit contributions
!-0.026
!"-0.021
 years-0.014
,-0.013
 scared-0.012
Token He
Token ID679
Feature activation-2.28e-02

Pos logit contributions
!+0.002
!"+0.002
 years+0.002
e+0.002
,+0.002
Neg logit contributions
utes-0.003
tr-0.003
itted-0.002
arl-0.002
der-0.002
Token wanted
Token ID2227
Feature activation+2.55e-02

Pos logit contributions
utes+0.060
tr+0.056
arl+0.052
der+0.051
itted+0.051
Neg logit contributions
!-0.028
!"-0.025
,-0.020
 years-0.019
 scared-0.018
Token to
Token ID284
Feature activation+5.38e-04

Pos logit contributions
!+0.042
!"+0.039
 years+0.034
 scared+0.031
!".+0.031
Neg logit contributions
utes-0.052
tr-0.048
der-0.043
itted-0.042
 conductor-0.041
Token,
Token ID11
Feature activation+1.75e-03

Pos logit contributions
!+0.001
!"+0.001
 years+0.000
 scared+0.000
!".+0.000
Neg logit contributions
utes-0.003
tr-0.003
arl-0.003
itted-0.003
der-0.003
Token Lily
Token ID20037
Feature activation-1.38e-03

Pos logit contributions
!+0.002
!"+0.002
,+0.002
.+0.002
 scared+0.002
Neg logit contributions
utes-0.005
tr-0.004
arl-0.004
itted-0.004
osity-0.004
Token found
Token ID1043
Feature activation+5.71e-03

Pos logit contributions
utes+0.003
tr+0.003
ashed+0.003
itted+0.003
 conductor+0.003
Neg logit contributions
!-0.002
!"-0.002
 years-0.002
 fright-0.001
 free-0.001
Token an
Token ID281
Feature activation+3.98e-02

Pos logit contributions
!+0.005
!"+0.004
,+0.003
 scared+0.003
 the+0.003
Neg logit contributions
utes-0.017
tr-0.016
arl-0.015
osity-0.015
der-0.015
Token ash
Token ID12530
Feature activation-4.21e-02

Pos logit contributions
!+0.060
!"+0.054
,+0.047
 years+0.044
 scared+0.044
Neg logit contributions
utes-0.096
tr-0.087
arl-0.082
itted-0.079
der-0.079
Tokentr
Token ID2213
Feature activation-2.36e-01

Pos logit contributions
utes+0.042
osity+0.027
uster+0.026
arl+0.025
 chords+0.025
Neg logit contributions
!-0.137
!"-0.125
,-0.123
.-0.123
 tree-0.113
Tokenay
Token ID323
Feature activation-3.94e-02

Pos logit contributions
utes+0.420
tr+0.343
arl+0.321
der+0.320
osity+0.316
Neg logit contributions
!-0.512
!"-0.474
,-0.430
.-0.402
 scared-0.395
Token in
Token ID287
Feature activation-7.44e-04

Pos logit contributions
utes+0.103
tr+0.091
arl+0.088
itted+0.087
osity+0.085
Neg logit contributions
!-0.057
!"-0.049
,-0.043
.-0.041
 the-0.035
Token the
Token ID262
Feature activation+1.07e-02

Pos logit contributions
utes+0.002
tr+0.001
arl+0.001
der+0.001
Years+0.001
Neg logit contributions
!-0.001
!"-0.001
!",-0.001
 scared-0.001
 years-0.001
Token world
Token ID995
Feature activation+5.52e-02

Pos logit contributions
!+0.015
!"+0.013
,+0.012
.+0.011
 years+0.011
Neg logit contributions
utes-0.028
der-0.024
osity-0.023
itted-0.023
arl-0.023
Token behind
Token ID2157
Feature activation+1.44e-02

Pos logit contributions
!"+0.042
!+0.042
 panic+0.039
 trouble+0.036
 scared+0.036
Neg logit contributions
tr-0.144
utes-0.138
der-0.121
arl-0.120
oys-0.118
Token The
Token ID383
Feature activation+3.66e-03

Pos logit contributions
!+0.009
!"+0.008
 years+0.008
e+0.007
 scared+0.007
Neg logit contributions
utes-0.009
tr-0.008
der-0.008
arl-0.008
itted-0.008
Token house
Token ID2156
Feature activation+1.01e-02

Pos logit contributions
!+0.006
!"+0.006
,+0.005
 years+0.005
 scared+0.005
Neg logit contributions
utes-0.008
tr-0.007
arl-0.007
der-0.006
itted-0.006
Token had
Token ID550
Feature activation-1.77e-02

Pos logit contributions
!+0.012
!"+0.010
 years+0.008
!".+0.007
,+0.007
Neg logit contributions
utes-0.026
tr-0.026
itted-0.023
der-0.023
ute-0.022
Token an
Token ID281
Feature activation+3.92e-02

Pos logit contributions
utes+0.042
tr+0.037
arl+0.035
der+0.034
itted+0.034
Neg logit contributions
!-0.027
!"-0.025
,-0.021
 scared-0.020
 years-0.020
Token ash
Token ID12530
Feature activation-4.29e-02

Pos logit contributions
!+0.066
!"+0.060
,+0.050
 years+0.049
 scared+0.048
Neg logit contributions
utes-0.085
tr-0.077
arl-0.070
der-0.069
itted-0.069
Tokentr
Token ID2213
Feature activation-2.36e-01

Pos logit contributions
utes+0.034
 chords+0.016
osity+0.016
uster+0.016
arl+0.016
Neg logit contributions
!-0.144
!"-0.135
,-0.133
.-0.130
 tree-0.120
Tokenay
Token ID323
Feature activation-4.00e-02

Pos logit contributions
utes+0.399
tr+0.335
arl+0.305
der+0.303
itted+0.298
Neg logit contributions
!-0.524
!"-0.489
,-0.442
.-0.417
 scared-0.416
Token outside
Token ID2354
Feature activation+2.04e-02

Pos logit contributions
utes+0.116
tr+0.105
arl+0.101
itted+0.099
der+0.099
Neg logit contributions
!-0.043
!"-0.037
,-0.031
.-0.027
,"-0.023
Token where
Token ID810
Feature activation-2.03e-02

Pos logit contributions
!+0.021
!"+0.020
 years+0.016
!",+0.015
 scared+0.015
Neg logit contributions
tr-0.048
utes-0.047
der-0.043
osity-0.041
itted-0.041
Token people
Token ID661
Feature activation+9.92e-03

Pos logit contributions
utes+0.049
tr+0.048
der+0.044
 conductor+0.042
osity+0.041
Neg logit contributions
!-0.024
!"-0.021
 years-0.017
!".-0.015
e-0.014
Token put
Token ID1234
Feature activation+1.24e-02

Pos logit contributions
!+0.015
!"+0.014
 years+0.012
!".+0.011
,+0.011
Neg logit contributions
utes-0.021
tr-0.019
osity-0.018
itted-0.017
arl-0.017
Token gave
Token ID2921
Feature activation+1.38e-02

Pos logit contributions
utes+0.021
tr+0.019
arl+0.018
itted+0.017
der+0.017
Neg logit contributions
!-0.012
!"-0.011
 years-0.009
,-0.009
 scared-0.009
Token him
Token ID683
Feature activation+2.33e-02

Pos logit contributions
!+0.016
!"+0.015
 years+0.011
,+0.011
 scared+0.011
Neg logit contributions
utes-0.036
tr-0.033
arl-0.031
itted-0.031
der-0.030
Token a
Token ID257
Feature activation+2.07e-03

Pos logit contributions
!+0.025
!"+0.022
,+0.017
 years+0.016
 scared+0.015
Neg logit contributions
utes-0.065
tr-0.060
arl-0.057
der-0.056
itted-0.056
Token small
Token ID1402
Feature activation-6.94e-04

Pos logit contributions
!+0.003
!"+0.002
,+0.002
 scared+0.002
.+0.002
Neg logit contributions
utes-0.006
der-0.005
itted-0.005
osity-0.005
tr-0.005
Token ash
Token ID12530
Feature activation-4.18e-02

Pos logit contributions
utes+0.002
tr+0.001
arl+0.001
der+0.001
osity+0.001
Neg logit contributions
!-0.001
!"-0.001
,-0.001
.-0.001
 scared-0.001
Tokentr
Token ID2213
Feature activation-2.36e-01

Pos logit contributions
utes+0.045
 chords+0.028
osity+0.028
arl+0.026
itted+0.026
Neg logit contributions
!-0.135
!"-0.124
,-0.123
.-0.121
,"-0.109
Tokenay
Token ID323
Feature activation-4.14e-02

Pos logit contributions
utes+0.400
tr+0.334
arl+0.306
der+0.301
itted+0.299
Neg logit contributions
!-0.529
!"-0.491
,-0.445
.-0.420
 years-0.414
Token as
Token ID355
Feature activation+5.41e-03

Pos logit contributions
utes+0.123
tr+0.110
arl+0.106
itted+0.104
ute+0.103
Neg logit contributions
!-0.047
!"-0.039
,-0.034
.-0.030
,"-0.025
Token a
Token ID257
Feature activation+2.39e-03

Pos logit contributions
!+0.005
!"+0.004
,+0.003
.+0.003
 the+0.003
Neg logit contributions
utes-0.016
tr-0.015
arl-0.015
itted-0.014
der-0.014
Token gift
Token ID6979
Feature activation-1.78e-02

Pos logit contributions
!+0.004
!"+0.003
,+0.003
.+0.003
 scared+0.003
Neg logit contributions
utes-0.006
itted-0.005
der-0.005
osity-0.005
angers-0.005
Token.
Token ID13
Feature activation+5.68e-03

Pos logit contributions
utes+0.058
tr+0.051
arl+0.051
itted+0.050
osity+0.049
Neg logit contributions
!-0.016
!"-0.012
,-0.010
.-0.009
."-0.006
Token He
Token ID679
Feature activation-2.27e-02

Pos logit contributions
!+0.001
!"+0.001
 years+0.001
,+0.001
 scared+0.001
Neg logit contributions
utes-0.002
tr-0.002
der-0.002
itted-0.002
arl-0.002
Token picked
Token ID6497
Feature activation-1.48e-02

Pos logit contributions
utes+0.061
tr+0.057
arl+0.053
der+0.052
itted+0.052
Neg logit contributions
!-0.027
!"-0.023
,-0.019
 years-0.018
 scared-0.017
Token up
Token ID510
Feature activation+6.56e-03

Pos logit contributions
utes+0.046
tr+0.043
arl+0.040
der+0.040
itted+0.040
Neg logit contributions
!-0.011
!"-0.009
,-0.006
 years-0.006
 scared-0.005
Token the
Token ID262
Feature activation+1.09e-02

Pos logit contributions
!+0.006
!"+0.006
 years+0.004
,+0.004
 scared+0.004
Neg logit contributions
utes-0.019
tr-0.017
der-0.016
arl-0.016
itted-0.016
Token ash
Token ID12530
Feature activation-4.19e-02

Pos logit contributions
!+0.016
!"+0.015
,+0.013
 scared+0.012
.+0.012
Neg logit contributions
utes-0.028
arl-0.023
der-0.023
tr-0.023
itted-0.023
Tokentr
Token ID2213
Feature activation-2.36e-01

Pos logit contributions
utes+0.035
arl+0.018
itted+0.015
osity+0.015
 chords+0.014
Neg logit contributions
!-0.141
!"-0.132
,-0.131
.-0.126
,"-0.118
Tokenay
Token ID323
Feature activation-3.92e-02

Pos logit contributions
utes+0.407
tr+0.338
arl+0.311
der+0.305
itted+0.304
Neg logit contributions
!-0.520
!"-0.482
,-0.438
.-0.412
 years-0.405
Token and
Token ID290
Feature activation-1.11e-02

Pos logit contributions
utes+0.115
tr+0.107
arl+0.101
der+0.100
itted+0.099
Neg logit contributions
!-0.036
!"-0.031
,-0.022
 years-0.021
 scared-0.020
Token snapped
Token ID20821
Feature activation-4.27e-02

Pos logit contributions
utes+0.028
tr+0.026
arl+0.024
der+0.024
itted+0.024
Neg logit contributions
!-0.015
!"-0.013
,-0.011
 years-0.010
 scared-0.010
Token it
Token ID340
Feature activation-3.66e-03

Pos logit contributions
utes+0.116
der+0.106
tr+0.105
itted+0.101
arl+0.099
Neg logit contributions
!-0.033
!"-0.031
 years-0.030
 scared-0.022
 panic-0.020
Token -
Token ID532
Feature activation+2.44e-02

Pos logit contributions
utes+0.009
tr+0.008
arl+0.008
der+0.008
itted+0.008
Neg logit contributions
!-0.005
!"-0.004
 years-0.004
,-0.004
 scared-0.003
Token and
Token ID290
Feature activation-1.30e-02

Pos logit contributions
utes+0.144
tr+0.131
arl+0.127
itted+0.125
osity+0.124
Neg logit contributions
!-0.034
!"-0.026
,-0.018
.-0.014
 scared-0.013
Token accidentally
Token ID14716
Feature activation-1.04e-02

Pos logit contributions
utes+0.036
tr+0.032
arl+0.032
itted+0.031
ute+0.031
Neg logit contributions
!-0.016
!"-0.014
,-0.012
 scared-0.012
 the-0.011
Token dropped
Token ID5710
Feature activation-1.29e-02

Pos logit contributions
utes+0.023
tr+0.022
der+0.019
arl+0.019
itted+0.019
Neg logit contributions
!-0.016
!"-0.014
 years-0.013
,-0.012
 scared-0.011
Token her
Token ID607
Feature activation+1.55e-02

Pos logit contributions
utes+0.036
tr+0.035
der+0.033
arl+0.032
itted+0.032
Neg logit contributions
!-0.011
!"-0.009
 years-0.007
,-0.007
 scared-0.006
Token ash
Token ID12530
Feature activation-4.32e-02

Pos logit contributions
!+0.028
!"+0.025
,+0.023
.+0.022
 scared+0.021
Neg logit contributions
utes-0.035
osity-0.031
arl-0.030
itted-0.029
tr-0.028
Tokentr
Token ID2213
Feature activation-2.36e-01

Pos logit contributions
utes+0.032
osity+0.016
 chords+0.013
itted+0.012
arl+0.012
Neg logit contributions
!-0.147
!"-0.138
.-0.131
,-0.131
."-0.122
Tokenay
Token ID323
Feature activation-3.98e-02

Pos logit contributions
utes+0.363
tr+0.319
arl+0.280
der+0.271
itted+0.270
Neg logit contributions
!-0.545
!"-0.511
,-0.462
 years-0.455
 scared-0.448
Token on
Token ID319
Feature activation+2.96e-03

Pos logit contributions
utes+0.128
tr+0.121
arl+0.114
der+0.114
itted+0.113
Neg logit contributions
!-0.023
!"-0.018
 years-0.009
,-0.009
 scared-0.008
Token the
Token ID262
Feature activation+8.29e-03

Pos logit contributions
!+0.003
!"+0.002
 years+0.002
 scared+0.002
!",+0.002
Neg logit contributions
tr-0.007
utes-0.007
der-0.007
arl-0.007
 conductor-0.006
Token ground
Token ID2323
Feature activation-1.50e-02

Pos logit contributions
!+0.013
!"+0.012
,+0.011
.+0.010
 years+0.010
Neg logit contributions
utes-0.020
tr-0.016
itted-0.016
arl-0.016
osity-0.016
Token.
Token ID13
Feature activation+1.12e-03

Pos logit contributions
utes+0.043
tr+0.043
der+0.041
arl+0.039
itted+0.038
Neg logit contributions
!-0.008
!"-0.007
 years-0.007
 scared-0.005
e-0.005
Token.
Token ID13
Feature activation-5.55e-03

Pos logit contributions
utes+0.052
tr+0.049
der+0.046
arl+0.046
itted+0.046
Neg logit contributions
!-0.015
!"-0.013
,-0.009
 years-0.009
 scared-0.008
Token
Token ID198
Feature activation-5.43e-05

Pos logit contributions
utes+0.014
tr+0.012
arl+0.012
der+0.011
itted+0.011
Neg logit contributions
!-0.008
!"-0.007
,-0.006
 years-0.006
 scared-0.006
Token
Token ID198
Feature activation-1.94e-04

Pos logit contributions
utes+0.000
tr+0.000
arl+0.000
ute+0.000
Years+0.000
Neg logit contributions
!-0.000
!"-0.000
 years-0.000
 scared-0.000
,-0.000
TokenThe
Token ID464
Feature activation-3.21e-03

Pos logit contributions
utes+0.000
tr+0.000
arl+0.000
der+0.000
itted+0.000
Neg logit contributions
!-0.000
!"-0.000
,-0.000
 years-0.000
 scared-0.000
Token ash
Token ID12530
Feature activation-4.35e-02

Pos logit contributions
utes+0.008
tr+0.007
itted+0.007
osity+0.007
der+0.007
Neg logit contributions
!-0.004
!"-0.004
,-0.004
 scared-0.004
.-0.003
Tokentr
Token ID2213
Feature activation-2.36e-01

Pos logit contributions
utes+0.055
arl+0.043
osity+0.034
uster+0.032
itted+0.031
Neg logit contributions
!-0.125
,-0.115
!"-0.113
.-0.110
 scared-0.105
Tokenay
Token ID323
Feature activation-3.88e-02

Pos logit contributions
utes+0.457
tr+0.359
 chords+0.352
arl+0.349
Years+0.343
Neg logit contributions
!-0.479
!"-0.433
,-0.400
.-0.360
 scared-0.358
Token was
Token ID373
Feature activation+1.17e-02

Pos logit contributions
utes+0.107
tr+0.099
arl+0.094
itted+0.092
der+0.091
Neg logit contributions
!-0.044
!"-0.038
,-0.030
 years-0.028
 scared-0.027
Token ready
Token ID3492
Feature activation-1.54e-03

Pos logit contributions
!+0.015
!"+0.014
 years+0.010
,+0.010
 scared+0.010
Neg logit contributions
utes-0.029
tr-0.027
arl-0.025
itted-0.025
der-0.025
Token to
Token ID284
Feature activation-1.31e-03

Pos logit contributions
utes+0.004
tr+0.003
der+0.003
arl+0.003
itted+0.003
Neg logit contributions
!-0.002
!"-0.002
 years-0.002
 scared-0.002
,-0.002
Token provide
Token ID2148
Feature activation+1.45e-02

Pos logit contributions
utes+0.003
tr+0.003
arl+0.002
der+0.002
ute+0.002
Neg logit contributions
!-0.002
!"-0.002
 years-0.002
 scared-0.002
,-0.002
Token They
Token ID1119
Feature activation-1.06e-02

Pos logit contributions
utes+0.006
tr+0.005
arl+0.005
der+0.005
itted+0.005
Neg logit contributions
!-0.006
!"-0.005
,-0.005
 years-0.005
 scared-0.004
Token grabbed
Token ID13176
Feature activation-4.05e-02

Pos logit contributions
utes+0.027
tr+0.025
arl+0.025
der+0.024
itted+0.023
Neg logit contributions
!-0.015
!"-0.013
,-0.011
 scared-0.010
.-0.010
Token an
Token ID281
Feature activation+3.92e-02

Pos logit contributions
utes+0.113
tr+0.104
arl+0.099
der+0.097
osity+0.097
Neg logit contributions
!-0.046
!"-0.040
,-0.034
 scared-0.030
 years-0.029
Token old
Token ID1468
Feature activation+2.65e-03

Pos logit contributions
!+0.065
!"+0.060
,+0.052
 years+0.049
 scared+0.049
Neg logit contributions
utes-0.089
tr-0.082
arl-0.079
osity-0.077
der-0.074
Token ash
Token ID12530
Feature activation-4.29e-02

Pos logit contributions
!+0.005
!"+0.005
,+0.004
.+0.004
 scared+0.004
Neg logit contributions
utes-0.005
tr-0.005
arl-0.005
itted-0.004
der-0.004
Tokentr
Token ID2213
Feature activation-2.35e-01

Pos logit contributions
utes+0.059
osity+0.046
itted+0.043
arl+0.043
 chords+0.041
Neg logit contributions
!-0.121
,-0.112
!"-0.110
.-0.107
 tree-0.098
Tokenay
Token ID323
Feature activation-3.93e-02

Pos logit contributions
utes+0.420
tr+0.347
osity+0.323
arl+0.323
der+0.322
Neg logit contributions
!-0.507
!"-0.469
,-0.427
.-0.398
,"-0.385
Token and
Token ID290
Feature activation-1.32e-02

Pos logit contributions
utes+0.126
tr+0.117
arl+0.113
osity+0.109
itted+0.109
Neg logit contributions
!-0.034
!"-0.024
,-0.022
.-0.017
,"-0.011
Token put
Token ID1234
Feature activation+1.17e-02

Pos logit contributions
utes+0.035
arl+0.032
tr+0.031
itted+0.030
osity+0.030
Neg logit contributions
!-0.018
!"-0.015
,-0.014
 scared-0.013
 the-0.013
Token it
Token ID340
Feature activation-6.46e-03

Pos logit contributions
!+0.007
!"+0.005
,+0.003
 years+0.002
 scared+0.002
Neg logit contributions
utes-0.039
tr-0.036
arl-0.034
der-0.034
itted-0.034
Token in
Token ID287
Feature activation-1.52e-03

Pos logit contributions
utes+0.018
tr+0.016
arl+0.016
osity+0.015
der+0.015
Neg logit contributions
!-0.008
!"-0.007
,-0.006
.-0.006
 scared-0.005
Token to
Token ID284
Feature activation-5.66e-04

Pos logit contributions
!+0.008
!"+0.007
,+0.006
 years+0.006
 scared+0.006
Neg logit contributions
utes-0.012
tr-0.011
arl-0.010
der-0.010
itted-0.010
Token create
Token ID2251
Feature activation-4.50e-02

Pos logit contributions
utes+0.001
tr+0.001
arl+0.001
der+0.001
itted+0.001
Neg logit contributions
!-0.001
!"-0.001
,-0.001
 years-0.001
 scared-0.001
Token her
Token ID607
Feature activation+1.49e-02

Pos logit contributions
utes+0.104
tr+0.091
der+0.086
ute+0.083
 chords+0.081
Neg logit contributions
!-0.060
!"-0.057
 years-0.049
 scared-0.047
,-0.046
Token own
Token ID898
Feature activation+3.31e-02

Pos logit contributions
!+0.026
!"+0.024
,+0.021
.+0.020
 years+0.020
Neg logit contributions
utes-0.032
tr-0.029
arl-0.028
der-0.027
osity-0.027
Token ash
Token ID12530
Feature activation-4.21e-02

Pos logit contributions
!+0.063
!"+0.058
,+0.051
.+0.049
 years+0.048
Neg logit contributions
utes-0.066
tr-0.060
arl-0.056
der-0.054
itted-0.054
Tokentr
Token ID2213
Feature activation-2.35e-01

Pos logit contributions
utes+0.030
arl+0.017
osity+0.015
uster+0.015
itted+0.014
Neg logit contributions
!-0.140
!"-0.133
,-0.125
.-0.125
."-0.118
Tokenay
Token ID323
Feature activation-4.10e-02

Pos logit contributions
utes+0.384
tr+0.326
arl+0.296
der+0.289
itted+0.287
Neg logit contributions
!-0.533
!"-0.497
,-0.449
 years-0.430
.-0.427
Token.
Token ID13
Feature activation+1.33e-03

Pos logit contributions
utes+0.135
tr+0.125
arl+0.120
itted+0.119
der+0.118
Neg logit contributions
!-0.026
!"-0.020
,-0.012
.-0.008
 years-0.007
Token She
Token ID1375
Feature activation+2.97e-03

Pos logit contributions
!+0.002
!"+0.002
 years+0.002
 scared+0.002
,+0.002
Neg logit contributions
utes-0.003
tr-0.003
arl-0.002
itted-0.002
der-0.002
Token went
Token ID1816
Feature activation-1.49e-02

Pos logit contributions
!+0.004
!"+0.004
,+0.003
 years+0.003
 scared+0.003
Neg logit contributions
utes-0.008
arl-0.007
tr-0.007
osity-0.007
der-0.006
Token to
Token ID284
Feature activation+1.18e-04

Pos logit contributions
utes+0.038
tr+0.035
der+0.033
arl+0.033
itted+0.032
Neg logit contributions
!-0.021
!"-0.019
,-0.017
.-0.015
 scared-0.015
Token.
Token ID13
Feature activation+7.62e-04

Pos logit contributions
utes+0.057
tr+0.054
der+0.053
arl+0.050
itted+0.050
Neg logit contributions
!-0.010
!"-0.008
 years-0.005
 scared-0.004
,-0.004
Token He
Token ID679
Feature activation-2.38e-02

Pos logit contributions
!+0.001
!"+0.001
,+0.001
 scared+0.001
 years+0.001
Neg logit contributions
utes-0.002
tr-0.002
arl-0.002
itted-0.002
ute-0.002
Token took
Token ID1718
Feature activation-2.60e-02

Pos logit contributions
utes+0.063
tr+0.057
arl+0.055
itted+0.053
osity+0.053
Neg logit contributions
!-0.031
!"-0.027
,-0.022
 scared-0.021
 years-0.021
Token the
Token ID262
Feature activation+9.40e-03

Pos logit contributions
utes+0.075
tr+0.069
arl+0.065
der+0.064
itted+0.064
Neg logit contributions
!-0.027
!"-0.023
,-0.018
 years-0.016
 scared-0.016
Token ash
Token ID12530
Feature activation-4.35e-02

Pos logit contributions
!+0.016
!"+0.014
,+0.013
 scared+0.012
.+0.012
Neg logit contributions
utes-0.022
tr-0.018
arl-0.018
der-0.018
itted-0.018
Tokentr
Token ID2213
Feature activation-2.35e-01

Pos logit contributions
utes+0.041
osity+0.022
arl+0.022
 chords+0.020
itted+0.020
Neg logit contributions
!-0.142
!"-0.132
,-0.131
.-0.126
 scared-0.116
Tokenay
Token ID323
Feature activation-3.93e-02

Pos logit contributions
utes+0.418
tr+0.349
arl+0.319
osity+0.317
der+0.317
Neg logit contributions
!-0.509
!"-0.468
,-0.427
.-0.397
 scared-0.390
Token outside
Token ID2354
Feature activation+2.15e-02

Pos logit contributions
utes+0.111
tr+0.103
arl+0.098
der+0.096
itted+0.096
Neg logit contributions
!-0.043
!"-0.037
,-0.030
 years-0.026
.-0.026
Token and
Token ID290
Feature activation-1.27e-02

Pos logit contributions
!+0.025
!"+0.022
 years+0.018
 scared+0.017
,+0.016
Neg logit contributions
utes-0.055
tr-0.051
der-0.048
arl-0.047
itted-0.047
Token forgot
Token ID16453
Feature activation+1.85e-04

Pos logit contributions
utes+0.035
tr+0.032
arl+0.031
itted+0.030
der+0.030
Neg logit contributions
!-0.015
!"-0.013
,-0.011
 scared-0.010
 years-0.010
Token to
Token ID284
Feature activation-7.83e-04

Pos logit contributions
!+0.000
!"+0.000
,+0.000
 years+0.000
 scared+0.000
Neg logit contributions
utes-0.000
tr-0.000
arl-0.000
der-0.000
itted-0.000
Token cigarette
Token ID17779
Feature activation-4.54e-02

Pos logit contributions
!+0.008
!"+0.007
,+0.006
 scared+0.006
.+0.006
Neg logit contributions
utes-0.013
itted-0.011
tr-0.011
osity-0.011
arl-0.011
Token and
Token ID290
Feature activation-9.87e-03

Pos logit contributions
utes+0.141
tr+0.124
arl+0.123
osity+0.122
itted+0.121
Neg logit contributions
!-0.044
!"-0.035
,-0.028
.-0.026
 scared-0.021
Token using
Token ID1262
Feature activation+1.30e-02

Pos logit contributions
utes+0.026
tr+0.023
arl+0.023
ute+0.022
itted+0.022
Neg logit contributions
!-0.013
!"-0.012
,-0.010
 scared-0.009
.-0.009
Token an
Token ID281
Feature activation+4.06e-02

Pos logit contributions
!+0.016
!"+0.014
,+0.011
 years+0.010
 scared+0.010
Neg logit contributions
utes-0.035
tr-0.032
arl-0.030
der-0.029
itted-0.029
Token ash
Token ID12530
Feature activation-4.35e-02

Pos logit contributions
!+0.075
!"+0.069
,+0.061
 scared+0.059
.+0.058
Neg logit contributions
utes-0.086
tr-0.075
arl-0.072
osity-0.070
itted-0.067
Tokentr
Token ID2213
Feature activation-2.35e-01

Pos logit contributions
utes+0.044
osity+0.030
 chords+0.029
uster+0.023
Years+0.023
Neg logit contributions
!-0.140
.-0.130
!"-0.128
,-0.126
."-0.115
Tokenay
Token ID323
Feature activation-3.91e-02

Pos logit contributions
utes+0.382
tr+0.327
arl+0.292
der+0.285
itted+0.283
Neg logit contributions
!-0.534
!"-0.499
,-0.450
 years-0.432
 scared-0.429
Token.
Token ID13
Feature activation+5.72e-03

Pos logit contributions
utes+0.122
tr+0.111
arl+0.107
osity+0.105
der+0.105
Neg logit contributions
!-0.036
!"-0.028
,-0.022
.-0.021
,"-0.015
Token
Token ID198
Feature activation+6.95e-03

Pos logit contributions
!+0.008
!"+0.007
 years+0.007
e+0.006
 scared+0.005
Neg logit contributions
utes-0.013
tr-0.013
der-0.012
arl-0.012
itted-0.011
Token
Token ID198
Feature activation+6.65e-03

Pos logit contributions
!+0.010
 years+0.009
!"+0.008
 scared+0.008
 trapped+0.007
Neg logit contributions
utes-0.015
tr-0.014
arl-0.013
ute-0.012
itted-0.012
TokenL
Token ID43
Feature activation-2.94e-02

Pos logit contributions
!+0.008
!"+0.007
,+0.006
 years+0.006
 scared+0.006
Neg logit contributions
utes-0.017
tr-0.016
arl-0.015
der-0.015
itted-0.015
Token accidentally
Token ID14716
Feature activation-8.13e-03

Pos logit contributions
!+0.027
!"+0.023
,+0.018
 years+0.017
 scared+0.016
Neg logit contributions
utes-0.063
tr-0.059
der-0.054
itted-0.054
arl-0.054
Token knocked
Token ID13642
Feature activation+1.59e-03

Pos logit contributions
utes+0.018
tr+0.017
der+0.015
arl+0.015
itted+0.015
Neg logit contributions
!-0.013
!"-0.012
 years-0.010
,-0.010
 scared-0.009
Token over
Token ID625
Feature activation-1.75e-03

Pos logit contributions
!+0.002
!"+0.002
,+0.002
.+0.001
 scared+0.001
Neg logit contributions
utes-0.004
tr-0.004
arl-0.004
itted-0.004
osity-0.004
Token the
Token ID262
Feature activation+9.37e-03

Pos logit contributions
utes+0.005
tr+0.004
der+0.004
arl+0.004
itted+0.004
Neg logit contributions
!-0.002
!"-0.002
 years-0.001
 scared-0.001
,-0.001
Token ash
Token ID12530
Feature activation-4.22e-02

Pos logit contributions
!+0.016
!"+0.014
,+0.013
.+0.012
 scared+0.012
Neg logit contributions
utes-0.022
tr-0.019
itted-0.018
osity-0.018
arl-0.018
Tokentr
Token ID2213
Feature activation-2.35e-01

Pos logit contributions
utes+0.035
itted+0.016
osity+0.015
arl+0.014
 chords+0.014
Neg logit contributions
!-0.141
!"-0.132
,-0.129
.-0.126
."-0.117
Tokenay
Token ID323
Feature activation-3.90e-02

Pos logit contributions
utes+0.414
tr+0.334
arl+0.314
itted+0.312
osity+0.312
Neg logit contributions
!-0.517
!"-0.475
,-0.436
.-0.411
 scared-0.395
Token and
Token ID290
Feature activation-1.15e-02

Pos logit contributions
utes+0.114
tr+0.106
arl+0.100
der+0.099
itted+0.099
Neg logit contributions
!-0.036
!"-0.031
,-0.023
 years-0.021
 scared-0.020
Token it
Token ID340
Feature activation-4.55e-03

Pos logit contributions
utes+0.032
tr+0.030
der+0.028
arl+0.028
itted+0.027
Neg logit contributions
!-0.011
!"-0.010
 years-0.007
,-0.007
 scared-0.006
Token broke
Token ID6265
Feature activation-4.10e-02

Pos logit contributions
tr+0.010
utes+0.009
der+0.008
itted+0.008
arl+0.008
Neg logit contributions
!-0.007
!"-0.007
 years-0.006
!".-0.005
,-0.005
Token into
Token ID656
Feature activation-2.60e-02

Pos logit contributions
utes+0.104
tr+0.099
der+0.095
itted+0.089
arl+0.089
Neg logit contributions
!-0.038
!"-0.036
 years-0.034
e-0.029
 scared-0.029
Token "
Token ID366
Feature activation-3.15e-02

Pos logit contributions
!+0.003
!"+0.003
,+0.002
 years+0.002
 scared+0.002
Neg logit contributions
utes-0.003
tr-0.003
arl-0.003
der-0.003
itted-0.003
TokenThat
Token ID2504
Feature activation-3.45e-02

Pos logit contributions
utes+0.075
arl+0.068
osity+0.064
tr+0.064
der+0.064
Neg logit contributions
!-0.045
!"-0.041
 scared-0.034
 hurt-0.033
 years-0.033
Token is
Token ID318
Feature activation+2.19e-02

Pos logit contributions
tr+0.092
utes+0.090
der+0.080
ute+0.078
 conductor+0.076
Neg logit contributions
!-0.035
 years-0.029
!"-0.028
 free-0.023
 scared-0.022
Token an
Token ID281
Feature activation+3.93e-02

Pos logit contributions
!+0.025
!"+0.022
,+0.017
 years+0.017
 scared+0.016
Neg logit contributions
utes-0.059
tr-0.055
arl-0.051
der-0.050
itted-0.050
Token ash
Token ID12530
Feature activation-4.21e-02

Pos logit contributions
!+0.068
!"+0.062
,+0.053
 years+0.053
 scared+0.052
Neg logit contributions
utes-0.079
tr-0.074
der-0.067
arl-0.066
itted-0.064
Tokentr
Token ID2213
Feature activation-2.35e-01

Pos logit contributions
utes+0.026
itted+0.009
arl+0.008
osity+0.007
 chords+0.006
Neg logit contributions
!-0.147
!"-0.141
,-0.132
.-0.129
."-0.125
Tokenay
Token ID323
Feature activation-3.82e-02

Pos logit contributions
utes+0.356
tr+0.316
arl+0.274
der+0.265
itted+0.263
Neg logit contributions
!-0.548
!"-0.513
,-0.465
 years-0.461
 scared-0.454
Token.
Token ID13
Feature activation-1.88e-03

Pos logit contributions
utes+0.119
tr+0.117
arl+0.108
der+0.107
osity+0.105
Neg logit contributions
!-0.019
!"-0.013
 years-0.012
 scared-0.009
 panic-0.005
Token It
Token ID632
Feature activation-1.62e-02

Pos logit contributions
utes+0.005
tr+0.004
arl+0.004
der+0.004
itted+0.004
Neg logit contributions
!-0.003
!"-0.002
,-0.002
 scared-0.002
."-0.002
Token is
Token ID318
Feature activation+2.20e-02

Pos logit contributions
tr+0.039
utes+0.037
der+0.033
osity+0.032
 conductor+0.032
Neg logit contributions
!-0.021
!"-0.018
 years-0.017
,-0.015
!".-0.014
Token not
Token ID407
Feature activation+3.14e-02

Pos logit contributions
!+0.027
!"+0.024
,+0.020
 years+0.019
 scared+0.018
Neg logit contributions
utes-0.058
tr-0.053
arl-0.050
der-0.049
itted-0.049
Token Tim
Token ID5045
Feature activation+3.68e-02

Pos logit contributions
utes+0.002
tr+0.002
arl+0.002
ute+0.002
der+0.002
Neg logit contributions
!-0.001
!"-0.001
,-0.001
 scared-0.001
.-0.001
Token saw
Token ID2497
Feature activation-2.03e-02

Pos logit contributions
!"+0.054
!+0.054
!".+0.046
!",+0.043
 years+0.042
Neg logit contributions
tr-0.070
utes-0.069
 conductor-0.065
ashed-0.063
itted-0.061
Token a
Token ID257
Feature activation+3.72e-03

Pos logit contributions
utes+0.063
tr+0.058
arl+0.057
itted+0.055
osity+0.055
Neg logit contributions
!-0.017
!"-0.013
,-0.012
 scared-0.012
 the-0.012
Token small
Token ID1402
Feature activation+1.29e-03

Pos logit contributions
!+0.005
!"+0.004
,+0.004
 scared+0.003
 years+0.003
Neg logit contributions
utes-0.010
tr-0.009
arl-0.008
itted-0.008
der-0.008
Token ash
Token ID12530
Feature activation-4.25e-02

Pos logit contributions
!+0.002
!"+0.002
,+0.002
 years+0.002
 scared+0.002
Neg logit contributions
utes-0.003
tr-0.003
arl-0.002
der-0.002
itted-0.002
Tokentr
Token ID2213
Feature activation-2.35e-01

Pos logit contributions
utes+0.044
uster+0.028
osity+0.027
arl+0.026
itted+0.025
Neg logit contributions
!-0.135
!"-0.123
,-0.122
.-0.121
 scared-0.110
Tokenay
Token ID323
Feature activation-3.89e-02

Pos logit contributions
utes+0.404
tr+0.339
arl+0.307
der+0.302
osity+0.302
Neg logit contributions
!-0.521
!"-0.482
,-0.438
.-0.413
 scared-0.409
Token outside
Token ID2354
Feature activation+2.16e-02

Pos logit contributions
utes+0.103
tr+0.091
arl+0.087
itted+0.087
osity+0.084
Neg logit contributions
!-0.055
!"-0.047
,-0.042
.-0.039
 scared-0.035
Token the
Token ID262
Feature activation+8.41e-03

Pos logit contributions
!+0.017
!"+0.017
!",+0.012
 years+0.012
 free+0.011
Neg logit contributions
utes-0.055
tr-0.055
der-0.051
arl-0.049
 conductor-0.049
Token door
Token ID3420
Feature activation-4.08e-02

Pos logit contributions
!+0.015
!"+0.013
,+0.012
 years+0.011
 scared+0.011
Neg logit contributions
utes-0.018
tr-0.016
arl-0.015
itted-0.015
der-0.015
Token.
Token ID13
Feature activation-7.56e-04

Pos logit contributions
tr+0.117
utes+0.115
der+0.106
itted+0.103
arl+0.103
Neg logit contributions
!-0.026
!"-0.026
 years-0.018
!".-0.015
 scared-0.014
Token was
Token ID373
Feature activation+8.12e-03

Pos logit contributions
!+0.007
!"+0.006
 years+0.005
,+0.005
 free+0.005
Neg logit contributions
utes-0.010
tr-0.010
der-0.009
itted-0.009
 conductor-0.009
Token right
Token ID826
Feature activation-1.15e-02

Pos logit contributions
!+0.011
!"+0.010
,+0.008
 years+0.008
!".+0.007
Neg logit contributions
utes-0.019
tr-0.018
arl-0.017
itted-0.016
der-0.016
Token.
Token ID13
Feature activation+2.84e-04

Pos logit contributions
utes+0.036
tr+0.034
arl+0.033
der+0.032
itted+0.032
Neg logit contributions
!-0.008
!"-0.006
,-0.003
 years-0.003
 scared-0.003
Token The
Token ID383
Feature activation+3.09e-03

Pos logit contributions
!+0.000
!"+0.000
,+0.000
 years+0.000
 scared+0.000
Neg logit contributions
utes-0.001
tr-0.001
arl-0.001
der-0.001
itted-0.001
Token ash
Token ID12530
Feature activation-4.27e-02

Pos logit contributions
!+0.005
!"+0.005
,+0.004
.+0.004
 scared+0.004
Neg logit contributions
utes-0.007
itted-0.006
osity-0.006
tr-0.006
uster-0.006
Tokentr
Token ID2213
Feature activation-2.35e-01

Pos logit contributions
utes+0.074
arl+0.058
itted+0.055
osity+0.055
uster+0.055
Neg logit contributions
!-0.097
!"-0.091
,-0.084
.-0.081
 scared-0.078
Tokenay
Token ID323
Feature activation-4.11e-02

Pos logit contributions
utes+0.418
tr+0.337
arl+0.318
itted+0.317
osity+0.316
Neg logit contributions
!-0.510
!"-0.476
,-0.427
.-0.401
,"-0.391
Token is
Token ID318
Feature activation+2.38e-02

Pos logit contributions
utes+0.117
tr+0.111
arl+0.103
der+0.102
itted+0.101
Neg logit contributions
!-0.039
!"-0.033
 years-0.025
,-0.024
 scared-0.023
Token not
Token ID407
Feature activation+3.41e-02

Pos logit contributions
!+0.028
!"+0.025
,+0.020
 scared+0.019
 years+0.018
Neg logit contributions
utes-0.066
tr-0.059
arl-0.056
itted-0.056
der-0.055
Token for
Token ID329
Feature activation-2.11e-02

Pos logit contributions
!+0.047
!"+0.042
,+0.035
 scared+0.033
.+0.033
Neg logit contributions
utes-0.088
tr-0.079
arl-0.074
itted-0.073
der-0.073
Token kids
Token ID3988
Feature activation+2.84e-04

Pos logit contributions
utes+0.053
tr+0.048
arl+0.047
der+0.046
itted+0.045
Neg logit contributions
!-0.026
!"-0.023
,-0.018
 years-0.018
 scared-0.017
Token tickets
Token ID8587
Feature activation+1.76e-02

Pos logit contributions
!+0.039
!"+0.035
,+0.030
 years+0.029
 scared+0.028
Neg logit contributions
utes-0.063
tr-0.058
arl-0.053
der-0.053
itted-0.052
Token to
Token ID284
Feature activation-1.12e-03

Pos logit contributions
!+0.025
!"+0.022
,+0.018
 years+0.017
 scared+0.017
Neg logit contributions
utes-0.044
tr-0.040
arl-0.037
der-0.037
itted-0.037
Token go
Token ID467
Feature activation+1.36e-02

Pos logit contributions
utes+0.003
tr+0.002
arl+0.002
der+0.002
itted+0.002
Neg logit contributions
!-0.002
!"-0.001
,-0.001
 years-0.001
 scared-0.001
Token on
Token ID319
Feature activation+2.88e-03

Pos logit contributions
!+0.018
!"+0.016
 years+0.014
 scared+0.014
,+0.013
Neg logit contributions
utes-0.033
tr-0.029
arl-0.028
ute-0.027
 conductor-0.026
Token a
Token ID257
Feature activation+5.96e-03

Pos logit contributions
!+0.003
 scared+0.002
!"+0.002
 panic+0.002
 years+0.002
Neg logit contributions
utes-0.007
tr-0.006
der-0.006
 chords-0.006
ute-0.006
Token hot
Token ID3024
Feature activation+4.55e-02

Pos logit contributions
!+0.011
!"+0.010
,+0.009
 years+0.009
 scared+0.008
Neg logit contributions
utes-0.012
tr-0.011
arl-0.010
der-0.010
ute-0.009
Token air
Token ID1633
Feature activation+3.36e-03

Pos logit contributions
!+0.085
!"+0.077
 scared+0.073
 years+0.069
 free+0.067
Neg logit contributions
utes-0.084
tr-0.078
arl-0.069
ute-0.066
 conductor-0.062
Token balloon
Token ID21190
Feature activation+3.56e-02

Pos logit contributions
!+0.003
!"+0.002
 years+0.001
,+0.001
 scared+0.001
Neg logit contributions
utes-0.010
tr-0.009
arl-0.009
der-0.009
itted-0.009
Token ride
Token ID6594
Feature activation+7.82e-03

Pos logit contributions
!+0.024
!"+0.018
,+0.011
 years+0.010
 scared+0.009
Neg logit contributions
utes-0.114
tr-0.106
arl-0.100
der-0.099
itted-0.099
Token.
Token ID13
Feature activation+3.64e-03

Pos logit contributions
!+0.007
!"+0.005
 scared+0.005
 years+0.004
,+0.004
Neg logit contributions
utes-0.023
tr-0.021
arl-0.019
ute-0.019
der-0.019
Token We
Token ID775
Feature activation-1.18e-02

Pos logit contributions
!+0.005
!"+0.005
,+0.004
."+0.004
.+0.004
Neg logit contributions
utes-0.009
tr-0.008
itted-0.007
der-0.007
arl-0.007
Token and
Token ID290
Feature activation-1.16e-02

Pos logit contributions
!+0.012
!"+0.011
 years+0.008
,+0.008
 scared+0.008
Neg logit contributions
utes-0.033
tr-0.030
der-0.029
arl-0.028
itted-0.028
Token swim
Token ID9422
Feature activation+3.74e-03

Pos logit contributions
utes+0.028
tr+0.025
arl+0.023
der+0.023
itted+0.023
Neg logit contributions
!-0.017
!"-0.015
,-0.013
 years-0.013
 scared-0.012
Token.
Token ID13
Feature activation+2.21e-03

Pos logit contributions
!+0.004
!"+0.003
 years+0.003
,+0.003
 scared+0.002
Neg logit contributions
utes-0.010
tr-0.009
arl-0.009
der-0.009
itted-0.008
Token He
Token ID679
Feature activation-2.21e-02

Pos logit contributions
!+0.003
!"+0.003
,+0.002
.+0.002
 scared+0.002
Neg logit contributions
utes-0.006
tr-0.005
arl-0.005
ute-0.005
itted-0.005
Token was
Token ID373
Feature activation+1.28e-02

Pos logit contributions
utes+0.051
tr+0.046
der+0.042
ashed+0.042
 conductor+0.041
Neg logit contributions
!-0.032
!"-0.029
 years-0.024
,-0.023
!".-0.022
Token tired
Token ID10032
Feature activation+6.88e-02

Pos logit contributions
!+0.019
!"+0.017
 years+0.014
,"+0.014
!",+0.013
Neg logit contributions
utes-0.028
tr-0.025
itted-0.024
ashed-0.023
 conductor-0.023
Token but
Token ID475
Feature activation+1.09e-02

Pos logit contributions
!+0.077
!"+0.065
,+0.054
 years+0.047
.+0.047
Neg logit contributions
utes-0.195
tr-0.180
arl-0.168
itted-0.166
der-0.164
Token happy
Token ID3772
Feature activation+3.68e-03

Pos logit contributions
!+0.014
!"+0.013
,+0.010
 years+0.010
!".+0.009
Neg logit contributions
utes-0.028
tr-0.025
der-0.024
itted-0.023
arl-0.023
Token.
Token ID13
Feature activation+2.63e-03

Pos logit contributions
!+0.003
!"+0.003
 years+0.002
 scared+0.002
,+0.002
Neg logit contributions
utes-0.010
tr-0.010
der-0.009
itted-0.009
arl-0.009
Token He
Token ID679
Feature activation-2.22e-02

Pos logit contributions
!+0.004
!"+0.004
,+0.003
 years+0.003
 scared+0.003
Neg logit contributions
utes-0.006
tr-0.006
arl-0.005
der-0.005
itted-0.005
Token had
Token ID550
Feature activation-2.00e-02

Pos logit contributions
utes+0.050
tr+0.045
der+0.041
itted+0.040
arl+0.040
Neg logit contributions
!-0.035
!"-0.032
 years-0.026
,-0.026
 scared-0.025
Token on
Token ID319
Feature activation-5.69e-05

Pos logit contributions
utes+0.129
tr+0.118
der+0.113
osity+0.111
arl+0.109
Neg logit contributions
!-0.063
!"-0.052
,-0.052
.-0.042
 the-0.039
Token,
Token ID11
Feature activation-4.59e-03

Pos logit contributions
utes+0.000
tr+0.000
der+0.000
arl+0.000
itted+0.000
Neg logit contributions
!-0.000
!"-0.000
 scared-0.000
 years-0.000
!".-0.000
Token Sue
Token ID26089
Feature activation+4.79e-02

Pos logit contributions
utes+0.012
tr+0.011
arl+0.010
der+0.010
itted+0.010
Neg logit contributions
!-0.005
!"-0.005
 years-0.004
,-0.003
 scared-0.003
Token and
Token ID290
Feature activation-1.59e-02

Pos logit contributions
!+0.071
!"+0.066
 years+0.055
 scared+0.052
,+0.051
Neg logit contributions
utes-0.106
tr-0.100
der-0.090
itted-0.089
osity-0.087
Token the
Token ID262
Feature activation+8.15e-03

Pos logit contributions
utes+0.049
tr+0.046
arl+0.043
der+0.043
itted+0.043
Neg logit contributions
!-0.013
!"-0.010
,-0.007
 years-0.006
 scared-0.006
Token dog
Token ID3290
Feature activation+4.56e-02

Pos logit contributions
!+0.012
!"+0.010
 scared+0.010
,+0.009
.+0.009
Neg logit contributions
utes-0.021
tr-0.018
osity-0.018
der-0.017
arl-0.017
Token were
Token ID547
Feature activation+2.59e-02

Pos logit contributions
!+0.073
!"+0.064
,+0.058
 scared+0.054
.+0.053
Neg logit contributions
utes-0.108
tr-0.096
arl-0.092
itted-0.088
der-0.088
Token best
Token ID1266
Feature activation-1.78e-02

Pos logit contributions
!+0.042
!"+0.039
 years+0.032
,+0.031
,"+0.031
Neg logit contributions
utes-0.056
tr-0.049
itted-0.046
arl-0.044
osity-0.044
Token friends
Token ID2460
Feature activation-1.50e-02

Pos logit contributions
utes+0.041
tr+0.039
arl+0.035
der+0.034
osity+0.034
Neg logit contributions
!-0.029
!"-0.026
,-0.023
 years-0.021
.-0.021
Token.
Token ID13
Feature activation-1.59e-03

Pos logit contributions
utes+0.041
tr+0.039
der+0.038
itted+0.037
 lear+0.036
Neg logit contributions
!"-0.010
!-0.010
 scared-0.007
 hurt-0.007
 panic-0.006
Token They
Token ID1119
Feature activation-1.24e-02

Pos logit contributions
utes+0.004
tr+0.004
arl+0.004
der+0.004
itted+0.004
Neg logit contributions
!-0.002
!"-0.002
,-0.001
 years-0.001
 scared-0.001
Token it
Token ID340
Feature activation-3.90e-03

Pos logit contributions
!+0.020
!"+0.018
 years+0.014
 scared+0.013
e+0.012
Neg logit contributions
utes-0.052
tr-0.046
der-0.045
itted-0.043
osity-0.043
Token tried
Token ID3088
Feature activation-1.69e-02

Pos logit contributions
tr+0.007
utes+0.007
 conductor+0.006
ashed+0.006
itted+0.006
Neg logit contributions
!-0.006
!"-0.006
 years-0.005
!".-0.005
!",-0.005
Token,
Token ID11
Feature activation+2.01e-03

Pos logit contributions
utes+0.046
der+0.041
tr+0.041
osity+0.040
itted+0.039
Neg logit contributions
!"-0.011
!-0.010
 years-0.010
e-0.009
 scared-0.007
Token it
Token ID340
Feature activation-4.05e-03

Pos logit contributions
!+0.001
!"+0.001
 years+0.001
,+0.001
 scared+0.001
Neg logit contributions
utes-0.006
tr-0.006
der-0.006
arl-0.005
itted-0.005
Token couldn
Token ID3521
Feature activation+2.37e-02

Pos logit contributions
tr+0.008
utes+0.008
 conductor+0.007
der+0.007
itted+0.007
Neg logit contributions
!-0.007
!"-0.006
 years-0.005
!".-0.005
,-0.005
Token't
Token ID470
Feature activation+6.91e-02

Pos logit contributions
!+0.033
 years+0.032
!"+0.030
 scared+0.027
 seen+0.025
Neg logit contributions
tr-0.049
utes-0.047
der-0.042
arl-0.040
ula-0.040
Token find
Token ID1064
Feature activation+2.36e-02

Pos logit contributions
!+0.100
!"+0.097
 years+0.096
 scared+0.082
 trouble+0.081
Neg logit contributions
utes-0.129
tr-0.126
 conductor-0.124
der-0.114
 engineer-0.112
Token its
Token ID663
Feature activation-4.88e-02

Pos logit contributions
!+0.025
!"+0.022
 years+0.020
 scared+0.017
!",+0.017
Neg logit contributions
utes-0.056
tr-0.052
der-0.049
 conductor-0.048
itted-0.047
Token way
Token ID835
Feature activation+2.23e-02

Pos logit contributions
utes+0.119
tr+0.112
arl+0.102
der+0.100
itted+0.100
Neg logit contributions
!-0.066
!"-0.059
 years-0.048
,-0.047
 scared-0.044
Token back
Token ID736
Feature activation-4.51e-02

Pos logit contributions
!+0.036
!"+0.033
 years+0.033
 scared+0.030
e+0.029
Neg logit contributions
utes-0.046
tr-0.040
arl-0.037
ute-0.037
itted-0.036
Token.
Token ID13
Feature activation+1.03e-03

Pos logit contributions
utes+0.099
tr+0.089
arl+0.083
ute+0.083
itted+0.080
Neg logit contributions
!-0.059
!"-0.055
 years-0.055
 scared-0.050
e-0.049
Token have
Token ID423
Feature activation+2.74e-02

Pos logit contributions
!+0.024
!"+0.022
 years+0.021
,+0.017
 free+0.017
Neg logit contributions
utes-0.033
tr-0.032
arl-0.029
itted-0.028
 lear-0.028
Token an
Token ID281
Feature activation+3.85e-02

Pos logit contributions
!+0.034
!"+0.030
,+0.024
 years+0.023
 scared+0.023
Neg logit contributions
utes-0.072
tr-0.066
arl-0.061
itted-0.061
der-0.061
Token idea
Token ID2126
Feature activation+1.56e-02

Pos logit contributions
!+0.066
!"+0.060
,+0.054
 years+0.051
 scared+0.050
Neg logit contributions
utes-0.085
tr-0.078
arl-0.073
osity-0.071
der-0.070
Token.
Token ID13
Feature activation+7.17e-04

Pos logit contributions
 panic+0.009
 years+0.008
 scared+0.008
!+0.008
 free+0.007
Neg logit contributions
utes-0.041
tr-0.040
der-0.038
oys-0.038
arl-0.036
Token Why
Token ID4162
Feature activation-2.63e-02

Pos logit contributions
!+0.001
!"+0.001
,+0.001
.+0.001
."+0.001
Neg logit contributions
utes-0.002
tr-0.002
arl-0.001
itted-0.001
ute-0.001
Token don
Token ID836
Feature activation+7.39e-02

Pos logit contributions
utes+0.062
tr+0.057
der+0.052
arl+0.051
ute+0.050
Neg logit contributions
!-0.035
!"-0.031
 years-0.030
 free-0.028
 scared-0.027
Token't
Token ID470
Feature activation+6.73e-02

Pos logit contributions
!+0.102
 years+0.101
!"+0.092
 seen+0.088
 scared+0.081
Neg logit contributions
utes-0.158
tr-0.132
ula-0.132
ute-0.127
der-0.124
Token we
Token ID356
Feature activation-1.72e-03

Pos logit contributions
!+0.088
 years+0.082
!"+0.080
 scared+0.070
e+0.068
Neg logit contributions
utes-0.140
tr-0.122
der-0.120
ute-0.112
osity-0.111
Token take
Token ID1011
Feature activation+5.21e-03

Pos logit contributions
utes+0.004
tr+0.004
arl+0.003
ute+0.003
itted+0.003
Neg logit contributions
!-0.003
!"-0.002
 years-0.002
 scared-0.002
,-0.002
Token a
Token ID257
Feature activation+3.27e-03

Pos logit contributions
!+0.007
!"+0.006
 years+0.005
,+0.005
 scared+0.005
Neg logit contributions
utes-0.013
tr-0.012
arl-0.011
itted-0.011
der-0.011
Token picture
Token ID4286
Feature activation-2.69e-02

Pos logit contributions
!+0.005
!"+0.005
,+0.004
 scared+0.004
.+0.004
Neg logit contributions
utes-0.007
der-0.007
tr-0.007
itted-0.007
osity-0.006
Token I
Token ID314
Feature activation+4.08e-03

Pos logit contributions
utes+0.072
tr+0.064
der+0.062
arl+0.061
itted+0.061
Neg logit contributions
!-0.032
!"-0.030
,-0.025
."-0.022
.-0.021
Token could
Token ID714
Feature activation+7.21e-03

Pos logit contributions
!+0.006
!"+0.006
 years+0.005
,+0.005
 scared+0.005
Neg logit contributions
utes-0.009
tr-0.008
arl-0.008
itted-0.008
der-0.007
Token use
Token ID779
Feature activation+8.65e-03

Pos logit contributions
!+0.011
 years+0.010
!"+0.010
 scared+0.009
 bush+0.009
Neg logit contributions
utes-0.014
tr-0.013
 conductor-0.011
ute-0.011
der-0.011
Token some
Token ID617
Feature activation-1.24e-02

Pos logit contributions
!+0.009
!"+0.008
 years+0.006
,+0.006
 scared+0.006
Neg logit contributions
utes-0.024
tr-0.022
arl-0.020
der-0.020
itted-0.020
Token help
Token ID1037
Feature activation+2.01e-02

Pos logit contributions
utes+0.027
tr+0.025
arl+0.023
itted+0.022
der+0.022
Neg logit contributions
!-0.021
!"-0.019
,-0.017
 years-0.016
.-0.016
Token."
Token ID526
Feature activation+2.65e-02

Pos logit contributions
!+0.027
!"+0.024
,+0.019
 years+0.018
.+0.018
Neg logit contributions
utes-0.052
tr-0.048
arl-0.045
der-0.044
itted-0.044
Token
Token ID198
Feature activation-2.59e-04

Pos logit contributions
!+0.022
!"+0.017
,+0.013
.+0.011
 years+0.010
Neg logit contributions
utes-0.082
tr-0.077
arl-0.073
der-0.072
osity-0.071
Token
Token ID198
Feature activation-2.65e-04

Pos logit contributions
utes+0.001
tr+0.001
arl+0.001
der+0.001
itted+0.001
Neg logit contributions
!-0.000
!"-0.000
,-0.000
 years-0.000
 scared-0.000
TokenSo
Token ID2396
Feature activation-9.39e-03

Pos logit contributions
utes+0.001
tr+0.001
arl+0.001
itted+0.001
der+0.001
Neg logit contributions
!-0.000
!"-0.000
 years-0.000
,-0.000
 the-0.000
Token the
Token ID262
Feature activation+7.45e-03

Pos logit contributions
utes+0.031
tr+0.029
arl+0.027
itted+0.027
der+0.027
Neg logit contributions
!-0.006
!"-0.005
,-0.003
 years-0.002
 scared-0.002
Token witch
Token ID16365
Feature activation+2.27e-02

Pos logit contributions
!+0.013
!"+0.012
,+0.011
 scared+0.010
 years+0.010
Neg logit contributions
utes-0.016
tr-0.014
arl-0.014
itted-0.013
der-0.013
Token into
Token ID656
Feature activation-2.84e-02

Pos logit contributions
!+0.008
!"+0.007
 years+0.006
 scared+0.005
,+0.005
Neg logit contributions
utes-0.020
tr-0.018
arl-0.017
der-0.017
itted-0.017
Token a
Token ID257
Feature activation+3.91e-03

Pos logit contributions
utes+0.086
tr+0.080
arl+0.076
der+0.075
itted+0.074
Neg logit contributions
!-0.022
!"-0.019
,-0.012
 years-0.011
 scared-0.011
Token big
Token ID1263
Feature activation+2.18e-02

Pos logit contributions
!+0.006
!"+0.005
,+0.005
 years+0.004
 scared+0.004
Neg logit contributions
utes-0.009
tr-0.008
arl-0.008
der-0.008
itted-0.008
Token pot
Token ID1787
Feature activation+7.98e-03

Pos logit contributions
!+0.031
!"+0.029
 years+0.026
 scared+0.025
!".+0.023
Neg logit contributions
utes-0.045
tr-0.042
arl-0.037
ute-0.036
der-0.036
Token.
Token ID13
Feature activation-2.27e-03

Pos logit contributions
!+0.005
!"+0.004
,+0.002
 years+0.002
.+0.002
Neg logit contributions
utes-0.026
tr-0.024
arl-0.023
itted-0.023
der-0.023
Token Then
Token ID3244
Feature activation-3.50e-02

Pos logit contributions
utes+0.006
tr+0.006
 chords+0.005
ute+0.005
der+0.005
Neg logit contributions
!-0.003
,-0.002
!"-0.002
.-0.002
 the-0.002
Token,
Token ID11
Feature activation-2.38e-03

Pos logit contributions
utes+0.111
tr+0.103
arl+0.098
itted+0.098
der+0.097
Neg logit contributions
!-0.019
!"-0.016
 years-0.009
 scared-0.007
!".-0.004
Token her
Token ID607
Feature activation+1.51e-02

Pos logit contributions
utes+0.008
tr+0.007
arl+0.007
osity+0.007
der+0.007
Neg logit contributions
!-0.002
,-0.002
 the-0.002
!"-0.001
 it-0.001
Token mom
Token ID1995
Feature activation-7.95e-03

Pos logit contributions
!+0.026
!"+0.024
,+0.022
 scared+0.020
.+0.020
Neg logit contributions
utes-0.033
tr-0.029
arl-0.028
der-0.028
osity-0.027
Token lit
Token ID6578
Feature activation-3.40e-03

Pos logit contributions
utes+0.022
tr+0.020
arl+0.020
der+0.019
ute+0.019
Neg logit contributions
!-0.010
!"-0.008
,-0.007
.-0.006
 scared-0.006
Token the
Token ID262
Feature activation+6.71e-03

Pos logit contributions
utes+0.010
tr+0.009
der+0.009
itted+0.009
arl+0.009
Neg logit contributions
!-0.003
!"-0.002
 years-0.002
,-0.002
 scared-0.001
Token was
Token ID373
Feature activation+1.14e-02

Pos logit contributions
!"+0.021
!+0.020
 years+0.019
 free+0.017
!".+0.016
Neg logit contributions
utes-0.033
tr-0.032
apped-0.028
der-0.027
ute-0.027
Token a
Token ID257
Feature activation+3.36e-03

Pos logit contributions
!+0.012
!"+0.010
,+0.008
 years+0.007
 scared+0.007
Neg logit contributions
utes-0.033
tr-0.030
arl-0.029
der-0.028
osity-0.028
Token family
Token ID1641
Feature activation+2.35e-02

Pos logit contributions
!+0.005
!"+0.005
,+0.004
 years+0.004
 scared+0.004
Neg logit contributions
utes-0.008
tr-0.007
arl-0.007
itted-0.006
der-0.006
Token that
Token ID326
Feature activation-4.31e-02

Pos logit contributions
!+0.035
!"+0.030
,+0.027
.+0.025
 years+0.025
Neg logit contributions
utes-0.058
tr-0.054
arl-0.051
der-0.050
itted-0.049
Token was
Token ID373
Feature activation+1.28e-02

Pos logit contributions
utes+0.104
tr+0.093
arl+0.090
der+0.088
itted+0.086
Neg logit contributions
!-0.067
!"-0.058
,-0.053
.-0.050
 scared-0.050
Token ready
Token ID3492
Feature activation-4.07e-03

Pos logit contributions
!+0.014
!"+0.012
,+0.010
 scared+0.009
.+0.009
Neg logit contributions
utes-0.036
tr-0.033
der-0.032
arl-0.032
osity-0.031
Token to
Token ID284
Feature activation-8.80e-04

Pos logit contributions
utes+0.010
tr+0.009
arl+0.009
itted+0.008
der+0.008
Neg logit contributions
!-0.006
!"-0.006
,-0.005
.-0.005
 years-0.004
Token take
Token ID1011
Feature activation+5.40e-03

Pos logit contributions
utes+0.002
tr+0.002
arl+0.002
der+0.002
itted+0.002
Neg logit contributions
!-0.001
!"-0.001
,-0.001
 years-0.001
 scared-0.001
Token a
Token ID257
Feature activation+3.67e-03

Pos logit contributions
!+0.007
!"+0.006
,+0.005
 years+0.005
.+0.004
Neg logit contributions
utes-0.014
tr-0.013
arl-0.013
der-0.012
osity-0.012
Token trip
Token ID5296
Feature activation+3.05e-02

Pos logit contributions
!+0.006
!"+0.005
,+0.005
 scared+0.004
.+0.004
Neg logit contributions
utes-0.009
der-0.008
tr-0.008
itted-0.008
arl-0.007
Token to
Token ID284
Feature activation-2.26e-04

Pos logit contributions
!+0.042
!"+0.037
,+0.031
 years+0.030
 scared+0.029
Neg logit contributions
utes-0.076
tr-0.070
arl-0.065
der-0.064
itted-0.064
Token day
Token ID1110
Feature activation+3.33e-03

Pos logit contributions
utes+0.068
tr+0.061
arl+0.053
itted+0.053
ute+0.052
Neg logit contributions
!-0.078
!"-0.073
 years-0.064
 scared-0.062
,-0.061
Token onwards
Token ID31719
Feature activation+2.88e-02

Pos logit contributions
!+0.005
!"+0.005
,+0.004
 years+0.004
 scared+0.004
Neg logit contributions
utes-0.008
tr-0.007
arl-0.007
der-0.007
itted-0.007
Token,
Token ID11
Feature activation+1.30e-03

Pos logit contributions
!+0.019
!"+0.017
 years+0.012
!".+0.009
 scared+0.009
Neg logit contributions
utes-0.083
tr-0.076
der-0.074
osity-0.072
itted-0.072
Token the
Token ID262
Feature activation+1.23e-02

Pos logit contributions
!+0.001
!"+0.001
 years+0.001
,+0.001
 scared+0.001
Neg logit contributions
utes-0.004
tr-0.003
der-0.003
osity-0.003
itted-0.003
Token pink
Token ID11398
Feature activation+1.28e-02

Pos logit contributions
!+0.020
!"+0.018
 years+0.015
,+0.015
 scared+0.015
Neg logit contributions
utes-0.027
tr-0.025
arl-0.023
itted-0.023
der-0.022
Token worm
Token ID23849
Feature activation-3.75e-03

Pos logit contributions
!+0.021
!"+0.019
,+0.016
 years+0.015
 scared+0.015
Neg logit contributions
utes-0.029
tr-0.027
arl-0.025
der-0.024
itted-0.024
Token always
Token ID1464
Feature activation+5.45e-04

Pos logit contributions
utes+0.009
tr+0.009
der+0.008
itted+0.008
osity+0.008
Neg logit contributions
!-0.005
!"-0.004
 years-0.003
,-0.003
 scared-0.003
Token looked
Token ID3114
Feature activation-1.51e-02

Pos logit contributions
!+0.001
!"+0.001
 years+0.001
,+0.001
 scared+0.001
Neg logit contributions
utes-0.001
tr-0.001
itted-0.001
der-0.001
osity-0.001
Token forward
Token ID2651
Feature activation-1.59e-02

Pos logit contributions
utes+0.032
tr+0.029
arl+0.027
der+0.026
itted+0.026
Neg logit contributions
!-0.026
!"-0.024
,-0.020
 years-0.020
 scared-0.020
Token to
Token ID284
Feature activation+5.74e-04

Pos logit contributions
utes+0.040
tr+0.037
arl+0.034
der+0.033
itted+0.033
Neg logit contributions
!-0.023
!"-0.021
,-0.018
.-0.017
 scared-0.016
Token adding
Token ID4375
Feature activation-4.38e-02

Pos logit contributions
!+0.001
!"+0.001
 years+0.001
 scared+0.001
,+0.001
Neg logit contributions
utes-0.001
tr-0.001
itted-0.001
arl-0.001
der-0.001
Token and
Token ID290
Feature activation-1.52e-02

Pos logit contributions
!+0.028
!"+0.024
,+0.019
 years+0.018
 scared+0.017
Neg logit contributions
utes-0.066
tr-0.061
arl-0.058
der-0.057
itted-0.056
Token he
Token ID339
Feature activation+1.72e-02

Pos logit contributions
utes+0.041
tr+0.038
arl+0.036
der+0.035
itted+0.035
Neg logit contributions
!-0.018
!"-0.015
,-0.012
 years-0.011
 scared-0.011
Token came
Token ID1625
Feature activation-1.66e-02

Pos logit contributions
!+0.022
!"+0.020
 years+0.015
,+0.015
!".+0.014
Neg logit contributions
utes-0.043
tr-0.040
itted-0.037
der-0.036
 conductor-0.036
Token across
Token ID1973
Feature activation-1.78e-02

Pos logit contributions
utes+0.035
tr+0.031
itted+0.028
arl+0.028
der+0.028
Neg logit contributions
!-0.029
!"-0.027
 years-0.023
,-0.023
 scared-0.022
Token an
Token ID281
Feature activation+3.98e-02

Pos logit contributions
utes+0.043
tr+0.040
der+0.037
itted+0.035
arl+0.035
Neg logit contributions
!-0.022
!"-0.021
 years-0.016
,-0.015
 scared-0.015
Token unusual
Token ID8468
Feature activation-1.38e-02

Pos logit contributions
!+0.063
!"+0.057
,+0.049
 years+0.047
 scared+0.046
Neg logit contributions
utes-0.092
tr-0.083
arl-0.077
der-0.076
itted-0.075
Token man
Token ID582
Feature activation+5.00e-02

Pos logit contributions
utes+0.032
tr+0.029
arl+0.027
itted+0.026
der+0.026
Neg logit contributions
!-0.022
!"-0.020
,-0.017
 years-0.016
 scared-0.016
Token.
Token ID13
Feature activation-7.95e-03

Pos logit contributions
!+0.047
!"+0.045
 years+0.032
 scared+0.031
,+0.030
Neg logit contributions
utes-0.133
tr-0.126
der-0.115
itted-0.115
osity-0.113
Token The
Token ID383
Feature activation+7.41e-04

Pos logit contributions
utes+0.019
tr+0.018
arl+0.016
der+0.016
itted+0.016
Neg logit contributions
!-0.011
!"-0.010
,-0.008
 years-0.008
 scared-0.008
Token man
Token ID582
Feature activation+5.07e-02

Pos logit contributions
!+0.001
!"+0.001
,+0.001
 scared+0.001
.+0.001
Neg logit contributions
utes-0.002
tr-0.002
arl-0.001
itted-0.001
osity-0.001
Token had
Token ID550
Feature activation-1.90e-02

Pos logit contributions
!+0.058
!"+0.052
,+0.039
 years+0.038
 scared+0.036
Neg logit contributions
utes-0.135
tr-0.126
der-0.116
itted-0.115
arl-0.115
Token."
Token ID526
Feature activation+1.44e-02

Pos logit contributions
!+0.007
 years+0.005
 scared+0.005
!"+0.004
e+0.004
Neg logit contributions
utes-0.038
tr-0.037
der-0.034
arl-0.034
ute-0.033
Token
Token ID198
Feature activation-6.47e-04

Pos logit contributions
!+0.016
!"+0.014
,+0.011
 years+0.010
 scared+0.010
Neg logit contributions
utes-0.040
tr-0.037
arl-0.034
der-0.034
itted-0.034
Token
Token ID198
Feature activation-2.89e-04

Pos logit contributions
utes+0.002
tr+0.002
arl+0.002
itted+0.002
der+0.002
Neg logit contributions
!-0.001
!"-0.001
,-0.000
 years-0.000
 scared-0.000
TokenL
Token ID43
Feature activation-4.09e-02

Pos logit contributions
utes+0.001
tr+0.001
der+0.001
itted+0.001
arl+0.001
Neg logit contributions
!-0.000
!"-0.000
 years-0.000
,-0.000
.-0.000
Tokenily
Token ID813
Feature activation-5.10e-02

Pos logit contributions
utes+0.074
tr+0.068
osity+0.058
itted+0.058
arl+0.056
Neg logit contributions
!-0.088
!"-0.082
 years-0.072
,-0.072
.-0.071
Token realized
Token ID6939
Feature activation-6.08e-02

Pos logit contributions
utes+0.130
tr+0.123
der+0.113
itted+0.111
arl+0.111
Neg logit contributions
!-0.062
!"-0.055
 years-0.043
,-0.042
 scared-0.041
Token her
Token ID607
Feature activation+1.54e-02

Pos logit contributions
utes+0.164
tr+0.152
arl+0.142
itted+0.142
osity+0.138
Neg logit contributions
!-0.084
!"-0.068
,-0.064
.-0.059
 the-0.054
Token mistake
Token ID7457
Feature activation-1.34e-02

Pos logit contributions
!+0.027
!"+0.024
,+0.022
 scared+0.021
.+0.021
Neg logit contributions
utes-0.034
tr-0.030
arl-0.029
osity-0.028
itted-0.028
Token and
Token ID290
Feature activation-1.18e-02

Pos logit contributions
utes+0.044
tr+0.040
arl+0.038
ute+0.037
itted+0.037
Neg logit contributions
!-0.011
!"-0.008
,-0.007
.-0.006
 scared-0.005
Token quickly
Token ID2952
Feature activation-3.91e-03

Pos logit contributions
utes+0.030
tr+0.027
arl+0.026
itted+0.025
ute+0.024
Neg logit contributions
!-0.017
!"-0.015
,-0.013
 scared-0.012
 years-0.012
Token cleaned
Token ID20750
Feature activation-1.63e-02

Pos logit contributions
utes+0.011
tr+0.009
arl+0.009
ute+0.009
osity+0.009
Neg logit contributions
!-0.005
!"-0.004
,-0.004
.-0.004
 scared-0.004
Token dog
Token ID3290
Feature activation+4.67e-02

Pos logit contributions
!+0.003
!"+0.003
,+0.002
 scared+0.002
 years+0.002
Neg logit contributions
utes-0.004
tr-0.003
arl-0.003
der-0.003
itted-0.003
Token was
Token ID373
Feature activation+1.35e-02

Pos logit contributions
!"+0.057
!+0.056
!".+0.046
 years+0.043
."+0.043
Neg logit contributions
tr-0.107
utes-0.100
 conductor-0.092
der-0.091
itted-0.087
Token barking
Token ID48961
Feature activation-1.27e-02

Pos logit contributions
!+0.014
!"+0.012
,+0.010
 years+0.009
 scared+0.009
Neg logit contributions
utes-0.038
tr-0.035
arl-0.033
der-0.032
itted-0.032
Token loudly
Token ID23112
Feature activation-1.14e-03

Pos logit contributions
utes+0.030
tr+0.028
der+0.027
itted+0.026
arl+0.025
Neg logit contributions
!-0.016
!"-0.016
 years-0.013
!".-0.012
,"-0.012
Token and
Token ID290
Feature activation-1.23e-02

Pos logit contributions
utes+0.003
tr+0.003
der+0.003
arl+0.003
itted+0.003
Neg logit contributions
!-0.001
!"-0.001
 years-0.001
 scared-0.001
,-0.001
Token sc
Token ID629
Feature activation-7.45e-02

Pos logit contributions
utes+0.028
tr+0.026
der+0.025
arl+0.024
osity+0.024
Neg logit contributions
!-0.016
!"-0.015
 years-0.013
!".-0.012
e-0.011
Tokenaring
Token ID1723
Feature activation-6.99e-02

Pos logit contributions
utes+0.129
tr+0.114
arl+0.102
der+0.099
itted+0.099
Neg logit contributions
!-0.160
!"-0.149
,-0.134
 years-0.131
 scared-0.129
Token everyone
Token ID2506
Feature activation-4.16e-02

Pos logit contributions
utes+0.182
tr+0.174
arl+0.165
der+0.164
itted+0.160
Neg logit contributions
!-0.068
!"-0.061
 years-0.048
,-0.042
!".-0.039
Token.
Token ID13
Feature activation+3.29e-03

Pos logit contributions
utes+0.091
tr+0.087
der+0.084
osity+0.080
ashed+0.078
Neg logit contributions
!-0.044
!"-0.043
 years-0.041
 trouble-0.036
 search-0.034
Token
Token ID198
Feature activation+3.97e-03

Pos logit contributions
!+0.005
 years+0.005
!"+0.004
e+0.004
 free+0.004
Neg logit contributions
utes-0.007
tr-0.006
der-0.006
arl-0.005
ashed-0.005
Token
Token ID198
Feature activation+3.93e-03

Pos logit contributions
!+0.006
 years+0.005
!"+0.005
 scared+0.004
 trapped+0.004
Neg logit contributions
utes-0.009
tr-0.008
arl-0.007
ute-0.007
Years-0.007
Token toys
Token ID14958
Feature activation+6.18e-03

Pos logit contributions
utes+0.067
tr+0.060
arl+0.055
der+0.055
itted+0.055
Neg logit contributions
!-0.060
!"-0.055
,-0.049
 years-0.047
 scared-0.046
Token.
Token ID13
Feature activation+6.78e-03

Pos logit contributions
!+0.004
!"+0.003
,+0.002
 years+0.001
 scared+0.001
Neg logit contributions
utes-0.020
tr-0.019
arl-0.018
der-0.018
itted-0.018
Token They
Token ID1119
Feature activation-4.78e-03

Pos logit contributions
!+0.012
!"+0.011
,+0.010
.+0.010
 the+0.009
Neg logit contributions
utes-0.015
tr-0.013
ute-0.012
itted-0.012
arl-0.012
Token have
Token ID423
Feature activation+2.83e-02

Pos logit contributions
utes+0.012
tr+0.011
arl+0.010
der+0.010
itted+0.010
Neg logit contributions
!-0.007
!"-0.006
,-0.005
 years-0.005
 scared-0.004
Token a
Token ID257
Feature activation+2.01e-03

Pos logit contributions
!+0.040
!"+0.036
,+0.031
 years+0.030
.+0.029
Neg logit contributions
utes-0.072
tr-0.065
arl-0.061
ute-0.060
osity-0.060
Token car
Token ID1097
Feature activation-5.99e-02

Pos logit contributions
!+0.003
!"+0.002
,+0.002
 scared+0.002
.+0.002
Neg logit contributions
utes-0.005
der-0.005
itted-0.005
osity-0.005
tr-0.004
Token,
Token ID11
Feature activation+3.60e-03

Pos logit contributions
utes+0.194
osity+0.170
tr+0.170
arl+0.169
itted+0.168
Neg logit contributions
!-0.052
!"-0.041
,-0.038
.-0.031
,"-0.020
Token a
Token ID257
Feature activation+4.09e-03

Pos logit contributions
!+0.003
,+0.003
!"+0.003
 the+0.003
.+0.002
Neg logit contributions
utes-0.011
osity-0.010
arl-0.010
ute-0.010
der-0.010
Token doll
Token ID3654
Feature activation-9.59e-03

Pos logit contributions
!+0.007
!"+0.006
,+0.006
.+0.005
 scared+0.005
Neg logit contributions
utes-0.010
itted-0.009
der-0.009
osity-0.008
tr-0.008
Token,
Token ID11
Feature activation+5.86e-03

Pos logit contributions
utes+0.033
tr+0.030
arl+0.029
itted+0.029
der+0.029
Neg logit contributions
!-0.005
!"-0.004
,-0.002
.-0.001
 years-0.001
Token a
Token ID257
Feature activation+5.66e-03

Pos logit contributions
!+0.004
,+0.004
!"+0.003
 the+0.003
.+0.003
Neg logit contributions
utes-0.019
osity-0.017
arl-0.017
ute-0.017
tr-0.017
Token a
Token ID257
Feature activation-5.09e-05

Pos logit contributions
utes+0.005
tr+0.005
der+0.005
itted+0.005
arl+0.005
Neg logit contributions
!-0.003
!"-0.003
,-0.002
 years-0.002
!".-0.002
Token time
Token ID640
Feature activation+8.89e-03

Pos logit contributions
utes+0.000
tr+0.000
arl+0.000
itted+0.000
Years+0.000
Neg logit contributions
!-0.000
!"-0.000
 scared-0.000
!".-0.000
,"-0.000
Token,
Token ID11
Feature activation-6.40e-03

Pos logit contributions
!+0.006
 years+0.006
!"+0.006
 trouble+0.004
 scared+0.004
Neg logit contributions
utes-0.025
tr-0.023
arl-0.023
ute-0.021
itting-0.021
Token a
Token ID257
Feature activation+1.58e-03

Pos logit contributions
utes+0.021
tr+0.020
der+0.019
 chords+0.019
osity+0.019
Neg logit contributions
!-0.004
!"-0.003
,-0.003
 scared-0.002
 the-0.002
Token mysterious
Token ID11428
Feature activation-5.23e-03

Pos logit contributions
!+0.002
!"+0.002
,+0.002
 scared+0.002
.+0.002
Neg logit contributions
utes-0.004
tr-0.004
der-0.003
itted-0.003
osity-0.003
Token rocket
Token ID10701
Feature activation-6.31e-02

Pos logit contributions
utes+0.012
tr+0.011
itted+0.010
arl+0.010
der+0.010
Neg logit contributions
!-0.009
!"-0.008
,-0.007
.-0.006
 scared-0.006
Token flew
Token ID13112
Feature activation-3.08e-02

Pos logit contributions
utes+0.160
tr+0.154
der+0.136
ute+0.135
 conductor+0.133
Neg logit contributions
!-0.072
!"-0.064
 years-0.053
 scared-0.052
 free-0.049
Token up
Token ID510
Feature activation+3.00e-03

Pos logit contributions
utes+0.078
tr+0.072
arl+0.067
der+0.067
itted+0.066
Neg logit contributions
!-0.043
!"-0.038
,-0.032
.-0.029
 scared-0.029
Token in
Token ID287
Feature activation-8.40e-03

Pos logit contributions
!+0.004
!"+0.004
 years+0.003
,+0.003
 scared+0.003
Neg logit contributions
utes-0.007
tr-0.006
der-0.006
itted-0.006
arl-0.006
Token the
Token ID262
Feature activation+1.14e-02

Pos logit contributions
utes+0.023
tr+0.021
arl+0.019
 conductor+0.019
 chords+0.019
Neg logit contributions
!-0.007
!"-0.006
 scared-0.005
 years-0.005
!",-0.004
Token sky
Token ID6766
Feature activation-1.57e-02

Pos logit contributions
!+0.021
!"+0.020
,+0.017
 years+0.017
 scared+0.016
Neg logit contributions
utes-0.023
tr-0.021
arl-0.019
der-0.019
itted-0.019
Token looked
Token ID3114
Feature activation-1.65e-02

Pos logit contributions
utes+0.040
tr+0.037
arl+0.036
itted+0.034
der+0.034
Neg logit contributions
!-0.016
!"-0.014
,-0.011
 years-0.010
 scared-0.010
Token like
Token ID588
Feature activation+1.92e-02

Pos logit contributions
utes+0.041
tr+0.037
arl+0.035
der+0.035
itted+0.034
Neg logit contributions
!-0.022
!"-0.020
 years-0.016
,-0.016
 scared-0.015
Token new
Token ID649
Feature activation-4.53e-03

Pos logit contributions
!+0.019
!"+0.015
,+0.013
 scared+0.011
 the+0.011
Neg logit contributions
utes-0.057
tr-0.054
arl-0.052
itted-0.050
ute-0.049
Token again
Token ID757
Feature activation+7.40e-03

Pos logit contributions
utes+0.012
tr+0.012
der+0.011
itted+0.011
osity+0.010
Neg logit contributions
!-0.004
!"-0.004
 years-0.003
 scared-0.003
!".-0.002
Token!
Token ID0
Feature activation-2.35e-02

Pos logit contributions
!"+0.004
!+0.004
 scared+0.004
 fright+0.003
 panic+0.003
Neg logit contributions
utes-0.020
tr-0.019
der-0.018
oys-0.017
osity-0.017
Token Everyone
Token ID11075
Feature activation-6.60e-02

Pos logit contributions
utes+0.063
tr+0.058
arl+0.054
der+0.054
itted+0.053
Neg logit contributions
!-0.029
!"-0.025
,-0.020
 years-0.019
 scared-0.019
Token cheered
Token ID34767
Feature activation+1.41e-02

Pos logit contributions
utes+0.177
tr+0.164
arl+0.153
der+0.151
itted+0.151
Neg logit contributions
!-0.078
!"-0.069
,-0.055
 years-0.052
 scared-0.050
Token when
Token ID618
Feature activation-1.72e-02

Pos logit contributions
!+0.018
!"+0.016
,+0.013
.+0.012
 years+0.011
Neg logit contributions
utes-0.038
tr-0.036
arl-0.033
itted-0.032
der-0.032
Token they
Token ID484
Feature activation+3.80e-03

Pos logit contributions
utes+0.038
der+0.032
tr+0.032
 conductor+0.031
osity+0.031
Neg logit contributions
!-0.026
!"-0.024
 years-0.020
!".-0.019
e-0.019
Token saw
Token ID2497
Feature activation-2.08e-02

Pos logit contributions
!+0.005
!"+0.005
 years+0.004
,+0.004
!".+0.004
Neg logit contributions
utes-0.009
tr-0.008
der-0.007
itted-0.007
arl-0.007
Token it
Token ID340
Feature activation-4.79e-03

Pos logit contributions
utes+0.064
tr+0.059
arl+0.056
der+0.055
itted+0.055
Neg logit contributions
!-0.017
!"-0.014
,-0.010
 years-0.009
 scared-0.009
TokenDid
Token ID11633
Feature activation-4.08e-02

Pos logit contributions
utes+0.070
tr+0.060
itted+0.058
arl+0.057
osity+0.057
Neg logit contributions
!-0.053
!"-0.047
 years-0.042
 scared-0.041
,-0.039
Token you
Token ID345
Feature activation-3.34e-02

Pos logit contributions
utes+0.111
tr+0.104
der+0.098
arl+0.096
osity+0.095
Neg logit contributions
!-0.041
!"-0.036
 years-0.027
,-0.025
 scared-0.024
Token destroy
Token ID4117
Feature activation-1.77e-02

Pos logit contributions
utes+0.084
tr+0.079
arl+0.073
der+0.072
itted+0.071
Neg logit contributions
!-0.042
!"-0.037
 years-0.030
,-0.030
 scared-0.027
Token my
Token ID616
Feature activation-5.38e-02

Pos logit contributions
utes+0.046
tr+0.043
der+0.041
arl+0.039
ute+0.038
Neg logit contributions
!-0.018
!"-0.016
 years-0.013
 scared-0.012
,-0.012
Token ash
Token ID12530
Feature activation-4.09e-02

Pos logit contributions
utes+0.112
tr+0.104
arl+0.093
der+0.092
itted+0.091
Neg logit contributions
!-0.094
!"-0.086
,-0.075
 years-0.074
 scared-0.071
Tokentr
Token ID2213
Feature activation-2.30e-01

Pos logit contributions
utes+0.016
itted+0.000
arl+0.000
tr+0.000
osity-0.001
Neg logit contributions
!-0.146
!"-0.140
,-0.132
.-0.128
 scared-0.126
Tokenay
Token ID323
Feature activation-3.93e-02

Pos logit contributions
utes+0.363
tr+0.310
arl+0.277
der+0.271
itted+0.269
Neg logit contributions
!-0.529
!"-0.495
,-0.447
 years-0.433
 scared-0.428
Token?"
Token ID1701
Feature activation+5.87e-03

Pos logit contributions
utes+0.112
tr+0.108
arl+0.099
der+0.097
itted+0.097
Neg logit contributions
!-0.036
!"-0.030
 years-0.024
,-0.022
 scared-0.022
Token Lily
Token ID20037
Feature activation-1.36e-03

Pos logit contributions
!+0.008
!"+0.008
,+0.006
 years+0.006
 scared+0.006
Neg logit contributions
utes-0.014
tr-0.013
arl-0.012
der-0.012
itted-0.012
Token felt
Token ID2936
Feature activation-2.26e-02

Pos logit contributions
tr+0.003
utes+0.003
der+0.002
 conductor+0.002
itted+0.002
Neg logit contributions
!-0.002
!"-0.002
!".-0.002
 years-0.002
 free-0.002
Token bad
Token ID2089
Feature activation-1.58e-02

Pos logit contributions
utes+0.043
tr+0.041
der+0.040
itted+0.039
arl+0.038
Neg logit contributions
!-0.037
!"-0.035
!".-0.029
 years-0.029
!",-0.026
Tokenuffy
Token ID15352
Feature activation+7.63e-03

Pos logit contributions
!+0.084
!"+0.079
,+0.071
 years+0.070
 scared+0.069
Neg logit contributions
utes-0.060
tr-0.052
arl-0.046
itted-0.044
ute-0.044
Token playing
Token ID2712
Feature activation-1.32e-02

Pos logit contributions
!+0.009
!"+0.009
 years+0.007
!".+0.006
!",+0.006
Neg logit contributions
utes-0.018
tr-0.018
der-0.016
itted-0.015
arl-0.015
Token with
Token ID351
Feature activation+3.48e-02

Pos logit contributions
utes+0.034
tr+0.032
der+0.030
arl+0.029
itted+0.029
Neg logit contributions
!-0.015
!"-0.013
 years-0.010
 scared-0.009
,-0.009
Token the
Token ID262
Feature activation+9.38e-03

Pos logit contributions
!+0.044
!"+0.042
!".+0.033
 years+0.032
!",+0.031
Neg logit contributions
utes-0.080
tr-0.077
der-0.071
arl-0.070
itted-0.069
Token ash
Token ID12530
Feature activation-4.16e-02

Pos logit contributions
!+0.015
!"+0.013
,+0.012
 scared+0.011
.+0.011
Neg logit contributions
utes-0.022
tr-0.020
arl-0.019
der-0.018
itted-0.018
Tokentr
Token ID2213
Feature activation-2.31e-01

Pos logit contributions
utes+0.038
osity+0.021
arl+0.020
uster+0.019
 chords+0.018
Neg logit contributions
!-0.139
.-0.127
!"-0.127
,-0.126
."-0.114
Tokenay
Token ID323
Feature activation-3.73e-02

Pos logit contributions
utes+0.374
tr+0.319
arl+0.287
der+0.281
itted+0.278
Neg logit contributions
!-0.525
!"-0.489
,-0.443
 years-0.426
 scared-0.421
Token.
Token ID13
Feature activation+6.29e-04

Pos logit contributions
utes+0.124
tr+0.116
arl+0.111
der+0.109
itted+0.109
Neg logit contributions
!-0.021
!"-0.015
,-0.008
 years-0.006
 scared-0.005
Token She
Token ID1375
Feature activation+5.83e-03

Pos logit contributions
!+0.001
!"+0.001
 years+0.001
,+0.001
 scared+0.001
Neg logit contributions
utes-0.001
tr-0.001
der-0.001
arl-0.001
itted-0.001
Token told
Token ID1297
Feature activation-5.31e-03

Pos logit contributions
!+0.007
!"+0.007
,+0.005
 years+0.005
 scared+0.005
Neg logit contributions
utes-0.015
tr-0.014
arl-0.013
der-0.013
itted-0.013
Token Fl
Token ID1610
Feature activation+3.68e-02

Pos logit contributions
utes+0.012
tr+0.012
arl+0.011
der+0.011
itted+0.010
Neg logit contributions
!-0.007
!"-0.006
 years-0.005
 scared-0.005
!".-0.005
Token not
Token ID407
Feature activation+3.18e-02

Pos logit contributions
utes+0.052
tr+0.047
arl+0.044
der+0.043
itted+0.043
Neg logit contributions
!-0.028
!"-0.025
,-0.021
 years-0.020
 scared-0.019
Token played
Token ID2826
Feature activation-2.23e-02

Pos logit contributions
!+0.041
!"+0.036
,+0.030
 scared+0.029
.+0.028
Neg logit contributions
utes-0.086
tr-0.076
arl-0.073
ute-0.072
der-0.071
Token with
Token ID351
Feature activation+3.50e-02

Pos logit contributions
utes+0.057
tr+0.053
der+0.050
arl+0.049
ute+0.047
Neg logit contributions
!-0.026
!"-0.023
 years-0.019
 scared-0.017
,-0.017
Token the
Token ID262
Feature activation+9.84e-03

Pos logit contributions
!+0.061
!"+0.057
 years+0.047
,+0.047
 scared+0.046
Neg logit contributions
utes-0.072
tr-0.064
der-0.059
arl-0.058
itted-0.055
Token ash
Token ID12530
Feature activation-4.14e-02

Pos logit contributions
!+0.016
!"+0.015
,+0.013
 scared+0.013
.+0.013
Neg logit contributions
utes-0.022
tr-0.020
itted-0.019
osity-0.018
arl-0.018
Tokentr
Token ID2213
Feature activation-2.34e-01

Pos logit contributions
utes+0.129
itted+0.112
arl+0.111
osity+0.110
uster+0.108
Neg logit contributions
!-0.040
!"-0.033
,-0.027
.-0.026
."-0.020
Tokenay
Token ID323
Feature activation-3.94e-02

Pos logit contributions
utes+0.364
tr+0.318
arl+0.280
der+0.271
itted+0.269
Neg logit contributions
!-0.540
!"-0.505
,-0.457
 years-0.448
 scared-0.442
Token.
Token ID13
Feature activation+5.91e-03

Pos logit contributions
utes+0.120
tr+0.114
arl+0.106
der+0.106
itted+0.104
Neg logit contributions
!-0.028
!"-0.023
 years-0.016
 scared-0.014
,-0.014
Token They
Token ID1119
Feature activation-5.76e-03

Pos logit contributions
!+0.010
!"+0.009
,+0.008
.+0.008
 scared+0.008
Neg logit contributions
utes-0.013
tr-0.012
arl-0.011
osity-0.011
der-0.011
Token wish
Token ID4601
Feature activation-8.74e-04

Pos logit contributions
utes+0.013
tr+0.012
arl+0.011
der+0.011
itted+0.010
Neg logit contributions
!-0.009
!"-0.008
,-0.007
 years-0.007
!".-0.006
Token they
Token ID484
Feature activation+5.98e-03

Pos logit contributions
utes+0.002
tr+0.002
itted+0.002
arl+0.002
uster+0.002
Neg logit contributions
!-0.001
,-0.001
!"-0.001
.-0.001
 the-0.001
Token.
Token ID13
Feature activation-3.60e-03

Pos logit contributions
utes+0.060
tr+0.057
der+0.053
arl+0.052
itted+0.052
Neg logit contributions
!-0.016
!"-0.014
 years-0.010
 scared-0.009
,-0.009
Token Tim
Token ID5045
Feature activation+3.60e-02

Pos logit contributions
utes+0.009
tr+0.008
arl+0.008
der+0.008
itted+0.008
Neg logit contributions
!-0.005
!"-0.004
,-0.004
 years-0.004
 scared-0.003
Token felt
Token ID2936
Feature activation-2.32e-02

Pos logit contributions
!+0.045
!"+0.040
 years+0.034
 trouble+0.030
!".+0.030
Neg logit contributions
utes-0.084
tr-0.078
der-0.072
 conductor-0.072
itted-0.071
Token even
Token ID772
Feature activation-2.06e-02

Pos logit contributions
utes+0.046
der+0.042
itted+0.041
tr+0.040
arl+0.040
Neg logit contributions
!-0.034
!"-0.032
 years-0.025
!".-0.024
!",-0.023
Token sad
Token ID6507
Feature activation-2.10e-02

Pos logit contributions
utes+0.049
tr+0.045
arl+0.041
der+0.041
itted+0.041
Neg logit contributions
!-0.031
!"-0.028
,-0.023
 years-0.022
 scared-0.022
Tokender
Token ID1082
Feature activation-1.72e-01

Pos logit contributions
utes+0.063
tr+0.058
arl+0.054
itted+0.053
osity+0.052
Neg logit contributions
!-0.024
!"-0.020
,-0.018
.-0.016
 scared-0.013
Token now
Token ID783
Feature activation-3.98e-03

Pos logit contributions
utes+0.444
der+0.428
tr+0.395
arl+0.387
itted+0.386
Neg logit contributions
!-0.163
!"-0.153
 scared-0.106
 years-0.104
e-0.102
Token,
Token ID11
Feature activation+5.03e-04

Pos logit contributions
utes+0.010
tr+0.009
der+0.009
arl+0.009
itted+0.009
Neg logit contributions
!-0.005
!"-0.004
 years-0.003
,-0.003
 scared-0.003
Token and
Token ID290
Feature activation-1.13e-02

Pos logit contributions
!+0.001
!"+0.001
,+0.000
 years+0.000
 scared+0.000
Neg logit contributions
utes-0.001
tr-0.001
arl-0.001
der-0.001
itted-0.001
Token he
Token ID339
Feature activation+1.91e-02

Pos logit contributions
utes+0.028
tr+0.026
der+0.025
arl+0.024
itted+0.024
Neg logit contributions
!-0.014
!"-0.013
,-0.010
 years-0.010
 scared-0.009
Token cried
Token ID16896
Feature activation+3.08e-02

Pos logit contributions
!+0.030
!"+0.028
,+0.022
 years+0.022
!".+0.022
Neg logit contributions
utes-0.039
tr-0.038
der-0.035
 conductor-0.033
itted-0.033
Token.
Token ID13
Feature activation+5.34e-03

Pos logit contributions
!+0.004
!"+0.003
 years+0.002
,+0.001
 scared+0.001
Neg logit contributions
utes-0.023
tr-0.022
der-0.021
arl-0.021
itted-0.020
Token
Token ID198
Feature activation+7.10e-03

Pos logit contributions
!+0.006
!"+0.005
 years+0.004
,+0.004
 scared+0.004
Neg logit contributions
utes-0.015
tr-0.014
arl-0.013
der-0.013
itted-0.013
Token
Token ID198
Feature activation+7.01e-03

Pos logit contributions
!+0.010
!"+0.008
 years+0.008
 scared+0.007
e+0.007
Neg logit contributions
utes-0.017
arl-0.015
tr-0.014
der-0.013
ute-0.013
TokenRex
Token ID47389
Feature activation-4.05e-02

Pos logit contributions
!+0.013
!"+0.012
,+0.010
 years+0.010
 scared+0.010
Neg logit contributions
utes-0.014
tr-0.013
arl-0.012
oys-0.011
itted-0.011
Token sn
Token ID3013
Feature activation+2.43e-02

Pos logit contributions
utes+0.083
tr+0.083
der+0.079
arl+0.071
ashed+0.071
Neg logit contributions
!-0.058
!"-0.053
 years-0.049
!".-0.044
!",-0.044
Tokenarl
Token ID7063
Feature activation-1.80e-01

Pos logit contributions
!+0.050
!"+0.047
,+0.042
.+0.040
 years+0.039
Neg logit contributions
utes-0.045
tr-0.040
der-0.036
osity-0.035
itted-0.035
Tokens
Token ID82
Feature activation+1.89e-02

Pos logit contributions
utes+0.451
tr+0.435
osity+0.398
der+0.387
Years+0.384
Neg logit contributions
!-0.236
!"-0.215
,-0.195
.-0.174
."-0.153
Token.
Token ID13
Feature activation+6.49e-03

Pos logit contributions
!+0.027
!"+0.024
,+0.020
 years+0.019
 scared+0.018
Neg logit contributions
utes-0.047
tr-0.043
arl-0.040
der-0.039
itted-0.039
Token He
Token ID679
Feature activation-1.84e-02

Pos logit contributions
!+0.009
!"+0.008
,+0.007
 scared+0.007
.+0.006
Neg logit contributions
utes-0.016
tr-0.015
arl-0.014
itted-0.013
osity-0.013
Token does
Token ID857
Feature activation+4.25e-03

Pos logit contributions
utes+0.041
tr+0.039
der+0.037
arl+0.035
itted+0.034
Neg logit contributions
!-0.028
!"-0.025
 years-0.021
,-0.020
!".-0.020
Token not
Token ID407
Feature activation+3.25e-02

Pos logit contributions
!+0.005
!"+0.005
 years+0.004
,+0.004
 scared+0.004
Neg logit contributions
utes-0.011
tr-0.010
arl-0.009
der-0.009
itted-0.009