TOP ACTIVATIONS
MAX = 5.431

and car away and c umin and cor i ander and
" He was delayed and hasn 't come back yet ."
Daisy â s fingertips . She stopped and watched
working in the fields and doesn 't have time to come
The bread smells delicious , doesn 't it ?" Kim nodded
, " B eware ! Don 't eat the r asp
. Now she doesn 't miss hearing the strange
his eyes and smiled . Pret ending was so much fun
my replied , " He doesn 't have a home to
's always been brilliant and hasn 't let me down .
you always so lazy ? Don 't you want to have
said " go away ! Don 't disturb this lovely little
make must stay outside . Don 't bring in any spoiled
. You can call me Crystal . â
him , John . He doesn 't know any better ,"
a lot of value , doesn 't mean it 's not
she said softly . " Please take care of me and
looks very far away . Don 't forget , you can
sweet heart , Mom my doesn 't have a torch ",
it does look weird , doesn 't it ?" They watched

MIN ACTIVATIONS
MIN = -6.830

dolls around in a wheel bar row she had found in
can spell better than you " or " You spelled that
and " I love you " and " cr acker ".
wrote " We love you " with their cr ay ons
He said " thank you " to Tom and ran home
" and " thank you " a lot .
it said " thank you " to the hipp o .
say " B less you " to anyone who sne ez
writes " I love you " on his paper . He
to sing with you more " They sing more
and said " thank you " to her mom . She
" and " thank you " for everything .
wrote " I love you " on it . He put
likes to say " hello " and " I love you
and " I love you " on their pictures . They
naughty and did not share " and included a smile y
and " I love you " on their pictures .
it away in a wheel bar row . Once
filled up lots of wheel bar rows with the waste and
so he said " yes " and went to find his

INTERVAL 2.366 - 5.431
CONTAINS 1.427%

" Hello , goat . What 's your name ?" Lily
said . " Really ? A witch ?" their
She said goodbye to her t eddy bear and promised to
Tom opened it and saw a new superhero costume . It
a big door that she hadn 't seen before and decided

INTERVAL -0.700 - 2.366
CONTAINS 72.995%

didn 't work , and his mom found out what he
don 't eat me , Mr . Bear !" the hunter
came out . Max and Sam were happy . They played
throws some bread on the ground . Ben does the same
even in the kitchen . But he could not find the

INTERVAL -3.765 - -0.700
CONTAINS 25.498%

The little girl felt much better after that . She
't get out of the cave . The blanket couldn 't
Tim my 's mom then said it was time to go
and yellow . They watch the sunrise for a
looked forward to trying new kinds of cake together . <|endoftext|>

INTERVAL -6.830 - -3.765
CONTAINS 0.080%

have it . He smiled as he saw all the animals
good idea . We should always pick the right word to
did not notice what Max did at first . They were
" Good idea ," Mia says . She helps
She didn 't know what to do ! Suddenly
Token and
Token ID290
Feature activation+1.05e+00

Pos logit contributions
+0.083
 mid+0.059
 dismay+0.059
itz+0.058
 �+0.058
Neg logit contributions
glers-0.116
bel-0.112
tub-0.103
kins-0.102
limits-0.102
Token car
Token ID1097
Feature activation+9.33e-01

Pos logit contributions
+0.026
itz+0.019
 �+0.019
 dismay+0.019
 mid+0.019
Neg logit contributions
glers-0.031
bel-0.030
kins-0.027
limits-0.027
tub-0.027
Tokenaway
Token ID8272
Feature activation+1.67e+00

Pos logit contributions
+0.022
 mid+0.016
 dismay+0.016
 �+0.016
 Will+0.016
Neg logit contributions
glers-0.029
bel-0.029
tub-0.026
kins-0.026
limits-0.026
Token and
Token ID290
Feature activation+1.04e+00

Pos logit contributions
+0.042
 mid+0.031
 dismay+0.030
 �+0.030
 Will+0.030
Neg logit contributions
glers-0.050
bel-0.050
kins-0.045
limits-0.044
tub-0.043
Token c
Token ID269
Feature activation+2.58e+00

Pos logit contributions
+0.025
 �+0.019
itz+0.019
 mid+0.019
 dismay+0.019
Neg logit contributions
glers-0.031
bel-0.030
kins-0.027
limits-0.027
tub-0.027
Tokenumin
Token ID7230
Feature activation+5.43e+00

Pos logit contributions
+0.090
 mid+0.071
 Will+0.071
 �+0.070
 dismay+0.070
Neg logit contributions
glers-0.055
bel-0.053
kins-0.046
tub-0.046
limits-0.045
Token and
Token ID290
Feature activation+1.09e+00

Pos logit contributions
+0.126
 dismay+0.090
 mid+0.089
 �+0.087
itz+0.087
Neg logit contributions
glers-0.173
bel-0.169
tub-0.154
kins-0.154
limits-0.153
Token cor
Token ID1162
Feature activation+2.53e+00

Pos logit contributions
+0.026
 �+0.019
itz+0.019
 mid+0.019
 dismay+0.019
Neg logit contributions
glers-0.033
bel-0.032
kins-0.029
limits-0.029
tub-0.029
Tokeni
Token ID72
Feature activation+2.72e+00

Pos logit contributions
+0.063
 dismay+0.050
itz+0.048
 delight+0.048
 �+0.048
Neg logit contributions
glers-0.072
bel-0.070
akers-0.062
kins-0.060
ards-0.060
Tokenander
Token ID4066
Feature activation+2.30e+00

Pos logit contributions
+0.063
 mid+0.045
 dismay+0.044
 �+0.044
 Will+0.044
Neg logit contributions
glers-0.087
bel-0.084
kins-0.077
tub-0.077
limits-0.077
Token and
Token ID290
Feature activation+1.29e+00

Pos logit contributions
+0.054
 mid+0.039
 dismay+0.039
itz+0.038
 Will+0.038
Neg logit contributions
glers-0.072
bel-0.071
tub-0.064
kins-0.064
limits-0.064
Token "
Token ID366
Feature activation+1.50e+00

Pos logit contributions
+0.024
 fingert+0.018
 mid+0.018
 farewell+0.018
 dismay+0.018
Neg logit contributions
glers-0.030
bel-0.030
tub-0.028
limits-0.027
kins-0.027
TokenHe
Token ID1544
Feature activation+2.40e-01

Pos logit contributions
+0.040
 mid+0.030
 dismay+0.030
 �+0.030
 farewell+0.030
Neg logit contributions
glers-0.042
bel-0.041
limits-0.037
tub-0.037
kins-0.036
Token was
Token ID373
Feature activation+6.13e-01

Pos logit contributions
+0.006
 Will+0.004
 �+0.004
 dismay+0.004
 mid+0.004
Neg logit contributions
glers-0.008
bel-0.008
tub-0.007
kins-0.007
limits-0.007
Token delayed
Token ID11038
Feature activation-3.48e-01

Pos logit contributions
+0.016
 �+0.011
 mid+0.011
 Will+0.011
 dismay+0.011
Neg logit contributions
glers-0.019
bel-0.018
kins-0.017
tub-0.016
limits-0.016
Token and
Token ID290
Feature activation+1.07e+00

Pos logit contributions
glers+0.012
bel+0.011
kins+0.010
tub+0.010
limits+0.010
Neg logit contributions
-0.008
 mid-0.005
 Will-0.005
 �-0.005
 dismay-0.005
Token hasn
Token ID5818
Feature activation+5.04e+00

Pos logit contributions
+0.032
 Will+0.024
 �+0.023
 dismay+0.022
 mid+0.021
Neg logit contributions
glers-0.031
bel-0.029
kins-0.028
tub-0.027
akers-0.026
Token't
Token ID470
Feature activation-2.07e-01

Pos logit contributions
+0.129
 delight+0.117
 fingert+0.116
Mine+0.115
 amaz+0.115
Neg logit contributions
glers-0.119
bel-0.117
limits-0.115
ained-0.106
tub-0.106
Token come
Token ID1282
Feature activation+1.76e+00

Pos logit contributions
glers+0.006
bel+0.006
limits+0.006
tub+0.006
kins+0.006
Neg logit contributions
-0.005
 �-0.004
 mid-0.004
 fingert-0.004
 dismay-0.004
Token back
Token ID736
Feature activation+1.44e+00

Pos logit contributions
+0.041
 Will+0.027
 �+0.027
 mid+0.027
 farewell+0.027
Neg logit contributions
glers-0.059
bel-0.056
kins-0.053
tub-0.052
limits-0.051
Token yet
Token ID1865
Feature activation+1.47e+00

Pos logit contributions
+0.036
 Will+0.026
 mid+0.025
 dismay+0.025
 �+0.025
Neg logit contributions
glers-0.045
bel-0.042
kins-0.040
tub-0.039
limits-0.039
Token."
Token ID526
Feature activation-7.26e-02

Pos logit contributions
+0.038
 Will+0.027
 �+0.027
 mid+0.027
 dismay+0.027
Neg logit contributions
glers-0.045
bel-0.043
kins-0.039
tub-0.039
limits-0.038
Token Daisy
Token ID40355
Feature activation+2.79e+00

Pos logit contributions
glers+0.024
bel+0.022
kins+0.021
tub+0.021
limits+0.020
Neg logit contributions
-0.020
 �-0.014
 farewell-0.014
 fingert-0.014
 Will-0.013
Tokenâ
Token ID22940
Feature activation-9.00e-01

Pos logit contributions
+0.057
 farewell+0.041
Mine+0.041
 mid+0.041
 Will+0.040
Neg logit contributions
glers-0.092
bel-0.089
tub-0.083
kins-0.082
limits-0.081
Token
Token ID26391
Feature activation+2.08e+00

Pos logit contributions
glers+0.023
bel+0.021
limits+0.021
asis+0.020
tub+0.020
Neg logit contributions
-0.026
 Will-0.021
 dismay-0.020
 mid-0.020
 fingert-0.020
Token
Token ID8151
Feature activation+1.15e+00

Pos logit contributions
+0.057
 �+0.042
 mid+0.042
 Will+0.041
 dismay+0.041
Neg logit contributions
glers-0.060
bel-0.057
kins-0.051
tub-0.051
limits-0.051
Tokens
Token ID82
Feature activation+2.08e+00

Pos logit contributions
+0.027
 mid+0.021
 Will+0.021
 delight+0.021
 dismay+0.021
Neg logit contributions
glers-0.035
bel-0.033
tub-0.031
kins-0.031
ained-0.030
Token fingertips
Token ID40142
Feature activation+4.99e+00

Pos logit contributions
+0.042
 Will+0.029
AY+0.028
 Pret+0.027
itz+0.027
Neg logit contributions
glers-0.071
bel-0.071
tub-0.065
kins-0.065
limits-0.062
Token.
Token ID13
Feature activation+1.02e+00

Pos logit contributions
+0.124
 farewell+0.099
 persever+0.099
 Pret+0.098
 dismay+0.097
Neg logit contributions
glers-0.140
bel-0.134
tub-0.124
kins-0.123
limits-0.121
Token She
Token ID1375
Feature activation+3.71e-01

Pos logit contributions
+0.027
 mid+0.021
 fingert+0.021
 farewell+0.020
 dismay+0.020
Neg logit contributions
glers-0.027
bel-0.026
tub-0.025
limits-0.024
kins-0.023
Token stopped
Token ID5025
Feature activation+1.07e+00

Pos logit contributions
+0.009
AY+0.009
 farewell+0.008
Mine+0.008
 Pret+0.008
Neg logit contributions
bel-0.009
glers-0.009
kins-0.008
ards-0.008
limits-0.007
Token and
Token ID290
Feature activation+1.35e+00

Pos logit contributions
+0.028
 farewell+0.022
AY+0.022
Mine+0.022
nown+0.021
Neg logit contributions
glers-0.029
bel-0.029
limits-0.025
tub-0.025
kins-0.025
Token watched
Token ID7342
Feature activation+1.78e+00

Pos logit contributions
+0.035
AY+0.027
 mid+0.027
itz+0.027
 fingert+0.027
Neg logit contributions
glers-0.037
bel-0.036
limits-0.033
kins-0.032
tub-0.032
Token working
Token ID1762
Feature activation-1.67e+00

Pos logit contributions
glers+0.018
bel+0.016
tub+0.015
kins+0.015
limits+0.015
Neg logit contributions
-0.013
 Will-0.009
 three-0.009
 mid-0.009
 eight-0.008
Token in
Token ID287
Feature activation+9.09e-01

Pos logit contributions
glers+0.057
bel+0.053
tub+0.050
asis+0.049
limits+0.049
Neg logit contributions
-0.038
 Will-0.026
 �-0.025
 mid-0.025
 farewell-0.025
Token the
Token ID262
Feature activation-1.86e+00

Pos logit contributions
+0.022
 �+0.016
 mid+0.016
 Will+0.015
 dismay+0.015
Neg logit contributions
glers-0.029
bel-0.027
kins-0.025
tub-0.025
limits-0.025
Token fields
Token ID7032
Feature activation-7.24e-02

Pos logit contributions
glers+0.043
bel+0.042
kins+0.037
tub+0.036
limits+0.035
Neg logit contributions
-0.063
 �-0.049
 mid-0.049
 dismay-0.049
 farewell-0.049
Token and
Token ID290
Feature activation+6.38e-01

Pos logit contributions
glers+0.003
bel+0.003
limits+0.002
kins+0.002
angers+0.002
Neg logit contributions
-0.001
 three-0.001
 �-0.001
 mid-0.001
 Will-0.001
Token doesn
Token ID1595
Feature activation+4.99e+00

Pos logit contributions
+0.018
 Will+0.013
 �+0.013
 mid+0.013
 dismay+0.012
Neg logit contributions
glers-0.019
bel-0.017
kins-0.017
tub-0.016
akers-0.016
Token't
Token ID470
Feature activation-8.76e-01

Pos logit contributions
+0.132
 persever+0.120
 fingert+0.118
 amaz+0.117
 delight+0.117
Neg logit contributions
glers-0.123
bel-0.110
tub-0.106
limits-0.102
ained-0.099
Token have
Token ID423
Feature activation-8.90e-01

Pos logit contributions
glers+0.030
bel+0.029
kins+0.028
tub+0.027
cards+0.026
Neg logit contributions
-0.021
 Will-0.014
 mid-0.013
 dismay-0.013
 farewell-0.013
Token time
Token ID640
Feature activation+1.54e-01

Pos logit contributions
bel+0.023
glers+0.023
limits+0.021
ained+0.020
tub+0.020
Neg logit contributions
-0.024
itz-0.019
 �-0.019
 fingert-0.018
 delight-0.018
Token to
Token ID284
Feature activation-1.18e+00

Pos logit contributions
+0.003
 Will+0.002
 farewell+0.002
 �+0.002
 mid+0.002
Neg logit contributions
glers-0.005
bel-0.005
kins-0.005
tub-0.005
limits-0.005
Token come
Token ID1282
Feature activation+1.85e+00

Pos logit contributions
glers+0.038
bel+0.036
kins+0.033
tub+0.033
limits+0.033
Neg logit contributions
-0.028
 mid-0.020
 �-0.020
 Will-0.020
 dismay-0.020
Token The
Token ID383
Feature activation+5.68e-01

Pos logit contributions
+0.039
 mid+0.027
 dismay+0.027
 fingert+0.026
 �+0.026
Neg logit contributions
glers-0.063
bel-0.061
tub-0.056
kins-0.056
limits-0.055
Token bread
Token ID8509
Feature activation+1.13e+00

Pos logit contributions
+0.017
 �+0.012
 mid+0.012
 delight+0.012
 dismay+0.012
Neg logit contributions
glers-0.016
bel-0.015
limits-0.014
tub-0.013
kins-0.013
Token smells
Token ID25760
Feature activation+9.38e-01

Pos logit contributions
+0.027
 �+0.017
 mid+0.017
 fingert+0.016
 delight+0.016
Neg logit contributions
bel-0.037
glers-0.037
limits-0.035
tub-0.033
ained-0.033
Token delicious
Token ID12625
Feature activation+5.85e-01

Pos logit contributions
+0.023
 mid+0.016
 �+0.016
 Will+0.016
 dismay+0.016
Neg logit contributions
glers-0.029
bel-0.028
kins-0.026
tub-0.026
limits-0.026
Token,
Token ID11
Feature activation-1.68e-01

Pos logit contributions
+0.014
 dismay+0.011
 mid+0.011
 farewell+0.010
itz+0.010
Neg logit contributions
glers-0.018
bel-0.017
kins-0.016
tub-0.016
limits-0.015
Token doesn
Token ID1595
Feature activation+4.89e+00

Pos logit contributions
glers+0.006
bel+0.006
limits+0.005
tub+0.005
kins+0.005
Neg logit contributions
-0.004
 �-0.003
 Will-0.003
 mid-0.003
 fingert-0.002
Token't
Token ID470
Feature activation-6.17e-01

Pos logit contributions
+0.129
 farewell+0.121
 dismay+0.120
 persever+0.119
 amaz+0.118
Neg logit contributions
glers-0.114
bel-0.103
tub-0.102
kins-0.095
limits-0.093
Token it
Token ID340
Feature activation+1.11e+00

Pos logit contributions
glers+0.020
bel+0.019
tub+0.017
kins+0.017
limits+0.017
Neg logit contributions
-0.015
 Will-0.011
 mid-0.011
 �-0.011
 dismay-0.010
Token?"
Token ID1701
Feature activation-1.89e-01

Pos logit contributions
+0.030
 mid+0.022
 �+0.022
 Will+0.022
 dismay+0.022
Neg logit contributions
glers-0.032
bel-0.031
kins-0.028
tub-0.028
limits-0.027
Token Kim
Token ID6502
Feature activation+3.83e-01

Pos logit contributions
glers+0.005
bel+0.005
tub+0.004
kins+0.004
limits+0.004
Neg logit contributions
-0.005
 mid-0.004
 �-0.004
 Will-0.004
 dismay-0.004
Token nodded
Token ID14464
Feature activation-4.36e-01

Pos logit contributions
 persever+0.010
+0.010
 farewell+0.010
AY+0.009
Mine+0.009
Neg logit contributions
glers-0.008
bel-0.008
kins-0.007
ards-0.006
limits-0.006
Token,
Token ID11
Feature activation+5.47e-01

Pos logit contributions
glers+0.011
bel+0.011
tub+0.010
kins+0.010
limits+0.010
Neg logit contributions
-0.010
itz-0.008
 dismay-0.008
 mid-0.007
 fingert-0.007
Token "
Token ID366
Feature activation+1.05e+00

Pos logit contributions
+0.014
 dismay+0.012
 persever+0.012
itz+0.012
AY+0.011
Neg logit contributions
bel-0.014
glers-0.014
tub-0.013
kins-0.013
avers-0.012
TokenB
Token ID33
Feature activation+1.02e+00

Pos logit contributions
+0.027
 dismay+0.023
 fingertips+0.023
 fingert+0.023
 farewell+0.022
Neg logit contributions
glers-0.029
bel-0.027
limits-0.024
tub-0.023
avers-0.023
Tokeneware
Token ID29725
Feature activation+1.77e+00

Pos logit contributions
+0.029
 dismay+0.024
 delight+0.023
 mid+0.023
 �+0.023
Neg logit contributions
glers-0.027
bel-0.025
tub-0.023
akers-0.023
kins-0.023
Token!
Token ID0
Feature activation+2.57e+00

Pos logit contributions
+0.043
 mid+0.031
 �+0.031
 dismay+0.031
 Will+0.030
Neg logit contributions
glers-0.055
bel-0.053
tub-0.049
kins-0.049
limits-0.048
Token Don
Token ID2094
Feature activation+4.86e+00

Pos logit contributions
+0.055
 fingert+0.041
 mid+0.041
 dismay+0.041
itz+0.040
Neg logit contributions
glers-0.082
bel-0.079
tub-0.074
limits-0.073
kins-0.071
Token't
Token ID470
Feature activation+6.95e-02

Pos logit contributions
+0.119
 fingert+0.095
 dismay+0.095
 mid+0.094
 delight+0.094
Neg logit contributions
glers-0.143
bel-0.135
limits-0.123
tub-0.122
kins-0.121
Token eat
Token ID4483
Feature activation+3.19e-01

Pos logit contributions
+0.002
 �+0.001
 farewell+0.001
 fingert+0.001
itz+0.001
Neg logit contributions
glers-0.002
bel-0.002
limits-0.002
tub-0.002
asis-0.002
Token the
Token ID262
Feature activation-1.68e+00

Pos logit contributions
+0.008
 mid+0.006
 �+0.006
 Will+0.006
 dismay+0.006
Neg logit contributions
glers-0.010
bel-0.010
tub-0.009
kins-0.009
limits-0.009
Token r
Token ID374
Feature activation-1.96e+00

Pos logit contributions
glers+0.051
bel+0.051
limits+0.047
kins+0.045
tub+0.045
Neg logit contributions
-0.042
 mid-0.030
 �-0.030
 Will-0.029
 delight-0.029
Tokenasp
Token ID5126
Feature activation+2.45e+00

Pos logit contributions
bel+0.049
glers+0.048
limits+0.045
kins+0.041
tub+0.040
Neg logit contributions
-0.056
itz-0.045
AY-0.044
 fingert-0.043
 mid-0.043
Token.
Token ID13
Feature activation+9.58e-01

Pos logit contributions
glers+0.030
bel+0.029
tub+0.027
limits+0.027
kins+0.027
Neg logit contributions
-0.021
 �-0.014
 mid-0.014
 dismay-0.013
 farewell-0.013
Token
Token ID198
Feature activation+9.57e-01

Pos logit contributions
+0.023
 mid+0.018
 fingert+0.018
 dismay+0.017
 tr+0.017
Neg logit contributions
glers-0.027
bel-0.027
tub-0.025
limits-0.025
kins-0.024
Token
Token ID198
Feature activation+9.94e-01

Pos logit contributions
+0.022
 mid+0.016
 dismay+0.016
AY+0.016
 fingert+0.016
Neg logit contributions
glers-0.030
bel-0.029
tub-0.027
kins-0.027
limits-0.026
TokenNow
Token ID3844
Feature activation+1.85e+00

Pos logit contributions
+0.026
 mid+0.020
 Will+0.020
 dismay+0.020
 fingert+0.020
Neg logit contributions
glers-0.028
bel-0.028
limits-0.025
kins-0.024
tub-0.024
Token she
Token ID673
Feature activation+1.66e+00

Pos logit contributions
+0.047
 �+0.033
 Will+0.032
 dismay+0.032
 farewell+0.032
Neg logit contributions
glers-0.058
bel-0.056
tub-0.052
kins-0.051
limits-0.051
Token doesn
Token ID1595
Feature activation+4.84e+00

Pos logit contributions
+0.041
itz+0.032
 mid+0.032
 �+0.032
 fingert+0.032
Neg logit contributions
glers-0.048
bel-0.046
kins-0.042
tub-0.042
limits-0.041
Token't
Token ID470
Feature activation-4.65e-01

Pos logit contributions
+0.115
 dismay+0.103
 farewell+0.101
 fingertips+0.101
 delight+0.101
Neg logit contributions
glers-0.126
bel-0.118
tub-0.113
limits-0.108
kins-0.107
Token miss
Token ID2051
Feature activation+6.93e-01

Pos logit contributions
glers+0.013
bel+0.013
limits+0.012
tub+0.012
kins+0.011
Neg logit contributions
-0.012
 mid-0.009
 �-0.009
itz-0.009
 fingert-0.009
Token hearing
Token ID4854
Feature activation+7.06e-01

Pos logit contributions
+0.017
 fingert+0.013
itz+0.013
 mid+0.013
AY+0.013
Neg logit contributions
glers-0.020
bel-0.019
kins-0.018
limits-0.018
tub-0.017
Token the
Token ID262
Feature activation-1.32e-01

Pos logit contributions
+0.017
 mid+0.013
itz+0.013
 fingert+0.013
AY+0.013
Neg logit contributions
glers-0.020
bel-0.020
kins-0.018
limits-0.018
tub-0.018
Token strange
Token ID6283
Feature activation+9.26e-01

Pos logit contributions
bel+0.003
glers+0.003
tub+0.003
kins+0.003
cards+0.003
Neg logit contributions
-0.004
 fingert-0.003
itz-0.003
 Will-0.003
 �-0.003
Token his
Token ID465
Feature activation+6.88e-02

Pos logit contributions
+0.025
itz+0.019
 fingert+0.019
 Will+0.019
 dismay+0.018
Neg logit contributions
glers-0.025
bel-0.024
tub-0.021
kins-0.021
limits-0.021
Token eyes
Token ID2951
Feature activation+2.60e+00

Pos logit contributions
+0.002
Mine+0.001
 Will+0.001
itz+0.001
H+0.001
Neg logit contributions
bel-0.002
glers-0.002
asis-0.002
kins-0.002
tub-0.002
Token and
Token ID290
Feature activation+1.69e+00

Pos logit contributions
+0.064
Mine+0.056
 persever+0.055
 Pret+0.054
 dismay+0.054
Neg logit contributions
glers-0.070
bel-0.067
tub-0.062
kins-0.059
limits-0.059
Token smiled
Token ID13541
Feature activation+1.29e+00

Pos logit contributions
+0.047
itz+0.035
 mid+0.035
 dismay+0.035
 �+0.035
Neg logit contributions
glers-0.046
bel-0.045
kins-0.040
tub-0.040
limits-0.039
Token.
Token ID13
Feature activation+4.07e-01

Pos logit contributions
+0.033
 dismay+0.024
Mine+0.024
itz+0.024
 mid+0.023
Neg logit contributions
bel-0.038
glers-0.038
tub-0.034
kins-0.033
limits-0.032
Token Pret
Token ID37123
Feature activation+4.83e+00

Pos logit contributions
+0.013
 dismay+0.011
itz+0.011
 fingert+0.011
 mid+0.011
Neg logit contributions
glers-0.007
bel-0.007
tub-0.007
limits-0.006
ained-0.006
Tokenending
Token ID1571
Feature activation+1.32e+00

Pos logit contributions
+0.112
 dismay+0.102
 t+0.099
 delight+0.096
 ear+0.095
Neg logit contributions
bel-0.143
glers-0.137
akers-0.133
avers-0.128
limits-0.128
Token was
Token ID373
Feature activation+3.29e-01

Pos logit contributions
+0.031
 �+0.022
 Will+0.021
 mid+0.021
 dismay+0.021
Neg logit contributions
glers-0.043
bel-0.042
limits-0.039
tub-0.038
kins-0.038
Token so
Token ID523
Feature activation-3.36e-02

Pos logit contributions
+0.008
 mid+0.005
 Will+0.005
 dismay+0.005
itz+0.005
Neg logit contributions
glers-0.010
bel-0.010
tub-0.009
kins-0.009
limits-0.009
Token much
Token ID881
Feature activation+3.27e-01

Pos logit contributions
glers+0.001
bel+0.001
kins+0.001
tub+0.001
limits+0.001
Neg logit contributions
-0.001
 �-0.001
 mid-0.001
 Will-0.001
 dismay-0.001
Token fun
Token ID1257
Feature activation-8.04e-01

Pos logit contributions
+0.008
 mid+0.006
 Will+0.006
 dismay+0.006
itz+0.006
Neg logit contributions
glers-0.010
bel-0.009
tub-0.008
kins-0.008
limits-0.008
Tokenmy
Token ID1820
Feature activation+6.29e-02

Pos logit contributions
glers+0.032
bel+0.028
kins+0.025
limits+0.025
akers+0.024
Neg logit contributions
-0.047
 farewell-0.039
 fingert-0.039
AY-0.039
 mid-0.039
Token replied
Token ID8712
Feature activation-4.63e-01

Pos logit contributions
+0.002
AY+0.002
 fingert+0.001
 Pret+0.001
 persever+0.001
Neg logit contributions
glers-0.002
bel-0.001
kins-0.001
limits-0.001
ards-0.001
Token,
Token ID11
Feature activation+1.01e-01

Pos logit contributions
bel+0.012
glers+0.012
kins+0.011
tub+0.011
limits+0.011
Neg logit contributions
-0.013
AY-0.010
 mid-0.010
itz-0.010
 dismay-0.010
Token "
Token ID366
Feature activation+9.80e-01

Pos logit contributions
+0.003
 dismay+0.002
AY+0.002
 mid+0.002
 farewell+0.002
Neg logit contributions
glers-0.003
bel-0.003
tub-0.003
kins-0.003
limits-0.003
TokenHe
Token ID1544
Feature activation+2.22e-01

Pos logit contributions
+0.023
 Will+0.015
 �+0.015
itz+0.014
Mine+0.014
Neg logit contributions
glers-0.033
bel-0.032
kins-0.030
tub-0.030
asis-0.029
Token doesn
Token ID1595
Feature activation+4.83e+00

Pos logit contributions
+0.004
 Will+0.003
 dismay+0.002
 �+0.002
 farewell+0.002
Neg logit contributions
glers-0.008
bel-0.008
tub-0.008
kins-0.007
limits-0.007
Token't
Token ID470
Feature activation-5.22e-01

Pos logit contributions
+0.133
 farewell+0.131
 amaz+0.128
 dismay+0.128
 persever+0.128
Neg logit contributions
glers-0.104
tub-0.088
bel-0.084
ics-0.080
limits-0.080
Token have
Token ID423
Feature activation-1.40e-01

Pos logit contributions
glers+0.017
bel+0.016
kins+0.015
tub+0.015
limits+0.014
Neg logit contributions
-0.013
 Will-0.009
 mid-0.009
 dismay-0.009
 �-0.009
Token a
Token ID257
Feature activation-9.30e-01

Pos logit contributions
glers+0.004
bel+0.004
limits+0.003
tub+0.003
akers+0.003
Neg logit contributions
-0.004
itz-0.003
 �-0.003
 fingert-0.003
 mid-0.003
Token home
Token ID1363
Feature activation-1.16e+00

Pos logit contributions
glers+0.026
bel+0.025
tub+0.024
asis+0.022
kins+0.022
Neg logit contributions
-0.025
itz-0.019
 mid-0.018
 Will-0.018
AY-0.018
Token to
Token ID284
Feature activation-9.06e-01

Pos logit contributions
glers+0.040
bel+0.038
limits+0.036
kins+0.036
tub+0.035
Neg logit contributions
-0.025
 Will-0.016
 �-0.016
 mid-0.015
 three-0.015
Token's
Token ID338
Feature activation+9.17e-01

Pos logit contributions
+0.033
 mid+0.022
 �+0.022
 Will+0.022
 dismay+0.022
Neg logit contributions
glers-0.053
bel-0.051
kins-0.047
tub-0.047
limits-0.047
Token always
Token ID1464
Feature activation-7.29e-01

Pos logit contributions
+0.023
 mid+0.016
 Will+0.016
 �+0.016
 dismay+0.016
Neg logit contributions
glers-0.028
bel-0.027
kins-0.025
tub-0.025
limits-0.025
Token been
Token ID587
Feature activation-7.95e-01

Pos logit contributions
glers+0.023
bel+0.022
tub+0.020
kins+0.020
limits+0.020
Neg logit contributions
-0.018
 �-0.013
 mid-0.013
 Will-0.013
 dismay-0.013
Token brilliant
Token ID10457
Feature activation+1.42e+00

Pos logit contributions
glers+0.025
bel+0.025
limits+0.022
tub+0.022
kins+0.022
Neg logit contributions
-0.020
 Will-0.014
 mid-0.014
 �-0.014
 dismay-0.013
Token and
Token ID290
Feature activation+1.32e-01

Pos logit contributions
+0.034
 Will+0.023
 �+0.023
 mid+0.023
 farewell+0.022
Neg logit contributions
glers-0.046
bel-0.045
limits-0.041
kins-0.041
tub-0.040
Token hasn
Token ID5818
Feature activation+4.83e+00

Pos logit contributions
+0.003
 �+0.002
 Will+0.002
 mid+0.002
 dismay+0.002
Neg logit contributions
glers-0.004
bel-0.004
tub-0.004
kins-0.003
limits-0.003
Token't
Token ID470
Feature activation-1.75e-01

Pos logit contributions
+0.121
 amaz+0.116
Mine+0.115
 delight+0.115
 fingert+0.113
Neg logit contributions
glers-0.107
limits-0.106
bel-0.103
ained-0.098
tub-0.097
Token let
Token ID1309
Feature activation+1.88e+00

Pos logit contributions
glers+0.005
bel+0.005
tub+0.005
kins+0.004
limits+0.004
Neg logit contributions
-0.004
itz-0.003
 �-0.003
 fingert-0.003
 dismay-0.003
Token me
Token ID502
Feature activation+7.42e-01

Pos logit contributions
+0.041
 fingert+0.036
 amaz+0.036
 mid+0.036
Mine+0.036
Neg logit contributions
glers-0.050
limits-0.047
bel-0.046
kins-0.045
tub-0.044
Token down
Token ID866
Feature activation+4.33e-02

Pos logit contributions
+0.019
 �+0.014
 mid+0.014
 dismay+0.014
itz+0.014
Neg logit contributions
glers-0.022
bel-0.021
kins-0.019
tub-0.019
limits-0.019
Token.
Token ID13
Feature activation+1.36e+00

Pos logit contributions
+0.001
 mid+0.001
 Will+0.001
 �+0.001
 dismay+0.001
Neg logit contributions
glers-0.001
bel-0.001
limits-0.001
kins-0.001
tub-0.001
Token you
Token ID345
Feature activation+2.25e+00

Pos logit contributions
+0.014
 mid+0.011
 dismay+0.011
itz+0.010
 fingert+0.010
Neg logit contributions
glers-0.016
bel-0.015
kins-0.014
limits-0.014
tub-0.014
Token always
Token ID1464
Feature activation+9.53e-03

Pos logit contributions
+0.064
 mid+0.048
 �+0.048
 dismay+0.047
 Will+0.047
Neg logit contributions
glers-0.062
bel-0.059
tub-0.054
kins-0.053
limits-0.053
Token so
Token ID523
Feature activation-9.13e-01

Pos logit contributions
+0.000
 mid+0.000
 Will+0.000
 dismay+0.000
 �+0.000
Neg logit contributions
glers-0.000
bel-0.000
kins-0.000
limits-0.000
tub-0.000
Token lazy
Token ID16931
Feature activation-2.11e-01

Pos logit contributions
glers+0.025
bel+0.024
kins+0.022
limits+0.022
cards+0.021
Neg logit contributions
-0.026
 Will-0.019
 mid-0.019
 dismay-0.019
 delight-0.019
Token?
Token ID30
Feature activation+1.46e+00

Pos logit contributions
glers+0.005
bel+0.005
limits+0.004
kins+0.004
tub+0.004
Neg logit contributions
-0.007
 Will-0.006
 mid-0.006
 �-0.005
itz-0.005
Token Don
Token ID2094
Feature activation+4.82e+00

Pos logit contributions
+0.033
 farewell+0.027
 dismay+0.027
 mid+0.027
 fingert+0.027
Neg logit contributions
glers-0.042
bel-0.040
tub-0.038
limits-0.037
avers-0.036
Token't
Token ID470
Feature activation+1.50e-01

Pos logit contributions
+0.121
 dismay+0.097
 fingert+0.097
 delight+0.097
 mid+0.097
Neg logit contributions
glers-0.140
bel-0.131
tub-0.119
limits-0.119
ained-0.116
Token you
Token ID345
Feature activation+2.08e+00

Pos logit contributions
+0.004
 �+0.003
 farewell+0.003
 fingert+0.003
itz+0.003
Neg logit contributions
glers-0.005
bel-0.004
tub-0.004
limits-0.004
asis-0.004
Token want
Token ID765
Feature activation+1.31e+00

Pos logit contributions
+0.055
 mid+0.040
 �+0.040
 Will+0.040
 dismay+0.040
Neg logit contributions
glers-0.061
bel-0.059
tub-0.053
kins-0.053
limits-0.053
Token to
Token ID284
Feature activation-5.66e-01

Pos logit contributions
+0.039
 amaz+0.034
itz+0.033
 dismay+0.033
 fingertips+0.033
Neg logit contributions
glers-0.030
bel-0.026
ards-0.025
 thirsty-0.024
tub-0.024
Token have
Token ID423
Feature activation+1.76e-02

Pos logit contributions
glers+0.018
bel+0.017
tub+0.016
kins+0.016
limits+0.016
Neg logit contributions
-0.013
 �-0.009
 mid-0.009
 dismay-0.009
 Will-0.009
Token said
Token ID531
Feature activation-5.77e-02

Pos logit contributions
+0.051
 �+0.039
 Will+0.038
 mid+0.038
 dismay+0.038
Neg logit contributions
glers-0.046
bel-0.044
tub-0.040
kins-0.040
limits-0.039
Token "
Token ID366
Feature activation+3.11e-01

Pos logit contributions
glers+0.002
bel+0.002
kins+0.002
tub+0.002
limits+0.002
Neg logit contributions
-0.001
 mid-0.001
 dismay-0.001
 fingert-0.001
itz-0.001
Tokengo
Token ID2188
Feature activation+1.94e+00

Pos logit contributions
+0.008
 fingert+0.006
 dismay+0.006
 farewell+0.006
 �+0.006
Neg logit contributions
glers-0.009
bel-0.009
limits-0.008
kins-0.008
tub-0.008
Token away
Token ID1497
Feature activation+1.15e+00

Pos logit contributions
+0.048
 farewell+0.032
 Will+0.032
 mid+0.032
 �+0.032
Neg logit contributions
glers-0.063
bel-0.060
tub-0.056
kins-0.055
limits-0.055
Token!
Token ID0
Feature activation+1.58e+00

Pos logit contributions
+0.026
 �+0.017
 farewell+0.017
 mid+0.017
 dismay+0.017
Neg logit contributions
glers-0.039
bel-0.037
limits-0.035
tub-0.034
kins-0.034
Token Don
Token ID2094
Feature activation+4.81e+00

Pos logit contributions
+0.034
 fingert+0.027
 mid+0.027
 dismay+0.026
 fingertips+0.025
Neg logit contributions
glers-0.048
bel-0.047
limits-0.044
tub-0.043
kins-0.041
Token't
Token ID470
Feature activation+2.74e-01

Pos logit contributions
+0.119
 fingert+0.099
 mid+0.098
 dismay+0.097
 delight+0.097
Neg logit contributions
glers-0.136
bel-0.130
limits-0.119
ained-0.113
tub-0.113
Token disturb
Token ID17037
Feature activation+1.21e+00

Pos logit contributions
+0.006
 �+0.004
 fingert+0.004
 mid+0.004
 Pret+0.004
Neg logit contributions
glers-0.009
bel-0.009
limits-0.008
asis-0.008
tub-0.008
Token this
Token ID428
Feature activation+1.73e+00

Pos logit contributions
+0.028
 fingert+0.021
nown+0.021
 mid+0.021
 farewell+0.021
Neg logit contributions
glers-0.035
bel-0.033
limits-0.031
kins-0.031
ards-0.031
Token lovely
Token ID14081
Feature activation+9.66e-01

Pos logit contributions
+0.043
 mid+0.031
 �+0.031
 Will+0.031
 dismay+0.031
Neg logit contributions
glers-0.053
bel-0.051
tub-0.047
kins-0.047
limits-0.046
Token little
Token ID1310
Feature activation+9.50e-02

Pos logit contributions
+0.027
 mid+0.020
 �+0.020
 farewell+0.020
 dismay+0.020
Neg logit contributions
glers-0.027
bel-0.026
limits-0.024
kins-0.023
tub-0.023
Token make
Token ID787
Feature activation-1.01e+00

Pos logit contributions
+0.001
 mid+0.001
 Will+0.001
 �+0.001
 dismay+0.001
Neg logit contributions
glers-0.001
bel-0.001
tub-0.001
kins-0.001
limits-0.001
Token must
Token ID1276
Feature activation-1.04e+00

Pos logit contributions
glers+0.031
bel+0.030
kins+0.027
tub+0.027
limits+0.027
Neg logit contributions
-0.025
 mid-0.018
 �-0.018
 Will-0.018
 dismay-0.018
Token stay
Token ID2652
Feature activation-1.38e+00

Pos logit contributions
glers+0.035
bel+0.034
tub+0.031
kins+0.031
limits+0.030
Neg logit contributions
-0.024
 dismay-0.016
 �-0.016
 delight-0.016
 Will-0.015
Token outside
Token ID2354
Feature activation+9.92e-01

Pos logit contributions
glers+0.048
bel+0.043
asis+0.039
tub+0.039
limits+0.039
Neg logit contributions
-0.035
 dismay-0.022
 mid-0.022
 delight-0.022
 �-0.021
Token.
Token ID13
Feature activation+1.30e+00

Pos logit contributions
+0.023
 mid+0.015
 Will+0.015
 �+0.014
 three+0.014
Neg logit contributions
glers-0.035
bel-0.033
limits-0.031
kins-0.030
asis-0.030
Token Don
Token ID2094
Feature activation+4.80e+00

Pos logit contributions
+0.030
 mid+0.025
 farewell+0.025
 fingert+0.025
 dismay+0.024
Neg logit contributions
glers-0.035
bel-0.033
tub-0.033
limits-0.032
ained-0.032
Token't
Token ID470
Feature activation+1.11e-01

Pos logit contributions
+0.116
 fingert+0.092
 mid+0.091
 dismay+0.091
 delight+0.090
Neg logit contributions
glers-0.142
bel-0.135
limits-0.123
tub-0.123
kins-0.122
Token bring
Token ID2222
Feature activation-5.95e-02

Pos logit contributions
+0.003
 farewell+0.002
 �+0.002
 mid+0.002
 fingert+0.002
Neg logit contributions
glers-0.003
bel-0.003
limits-0.003
tub-0.003
kins-0.003
Token in
Token ID287
Feature activation+1.73e+00

Pos logit contributions
glers+0.001
bel+0.001
kins+0.001
limits+0.001
tub+0.001
Neg logit contributions
-0.002
itz-0.001
 mid-0.001
AY-0.001
 �-0.001
Token any
Token ID597
Feature activation-2.26e-01

Pos logit contributions
+0.042
 mid+0.030
 �+0.030
 Will+0.030
 dismay+0.030
Neg logit contributions
glers-0.054
bel-0.052
kins-0.048
tub-0.048
limits-0.047
Token spoiled
Token ID34781
Feature activation-4.10e-01

Pos logit contributions
glers+0.006
bel+0.006
kins+0.006
tub+0.005
limits+0.005
Neg logit contributions
-0.006
itz-0.005
 Will-0.005
 �-0.005
 mid-0.005
Token.
Token ID13
Feature activation+5.80e-01

Pos logit contributions
+0.015
 �+0.010
 mid+0.010
 Will+0.010
 farewell+0.010
Neg logit contributions
glers-0.022
bel-0.021
tub-0.019
limits-0.019
kins-0.019
Token You
Token ID921
Feature activation+3.12e+00

Pos logit contributions
+0.013
 dismay+0.011
 mid+0.011
 fingert+0.011
 farewell+0.010
Neg logit contributions
glers-0.016
bel-0.016
tub-0.015
limits-0.015
avers-0.014
Token can
Token ID460
Feature activation+1.70e+00

Pos logit contributions
+0.079
 Will+0.055
 mid+0.052
 dismay+0.052
 �+0.051
Neg logit contributions
glers-0.100
bel-0.096
tub-0.088
kins-0.088
limits-0.088
Token call
Token ID869
Feature activation+1.68e+00

Pos logit contributions
+0.041
 mid+0.032
 �+0.032
 dismay+0.031
 farewell+0.031
Neg logit contributions
glers-0.051
bel-0.048
tub-0.044
limits-0.043
kins-0.043
Token me
Token ID502
Feature activation+2.94e+00

Pos logit contributions
+0.043
itz+0.036
Mine+0.035
 dismay+0.034
 fingert+0.034
Neg logit contributions
bel-0.040
glers-0.039
kins-0.035
cards-0.034
tub-0.034
Token Crystal
Token ID12969
Feature activation+4.79e+00

Pos logit contributions
+0.075
 mid+0.054
 dismay+0.054
 �+0.054
itz+0.053
Neg logit contributions
glers-0.088
bel-0.085
kins-0.077
tub-0.077
limits-0.076
Token.
Token ID13
Feature activation+6.61e-01

Pos logit contributions
+0.117
 dismay+0.091
 fingert+0.090
itz+0.089
 mid+0.089
Neg logit contributions
glers-0.141
bel-0.136
kins-0.126
tub-0.125
limits-0.119
Tokenâ
Token ID22940
Feature activation-7.03e-01

Pos logit contributions
+0.015
 dismay+0.013
 fingert+0.013
 mid+0.013
 fingertips+0.013
Neg logit contributions
glers-0.018
bel-0.017
tub-0.016
limits-0.015
avers-0.015
Token
Token ID26391
Feature activation+2.27e+00

Pos logit contributions
glers+0.020
bel+0.020
tub+0.018
kins+0.018
limits+0.018
Neg logit contributions
-0.019
 �-0.014
 mid-0.014
 Will-0.014
 farewell-0.014
Token
Token ID198
Feature activation+7.31e-01

Pos logit contributions
+0.054
 �+0.035
 Will+0.035
 Pret+0.034
 mid+0.034
Neg logit contributions
glers-0.079
bel-0.074
limits-0.068
tub-0.068
kins-0.067
Token
Token ID198
Feature activation+7.11e-01

Pos logit contributions
+0.016
 mid+0.012
 dismay+0.012
 fingert+0.011
 farewell+0.011
Neg logit contributions
glers-0.024
bel-0.023
tub-0.021
kins-0.021
limits-0.021
Token him
Token ID683
Feature activation+2.67e+00

Pos logit contributions
+0.056
 mid+0.040
 �+0.040
 Will+0.039
 dismay+0.039
Neg logit contributions
glers-0.072
bel-0.070
kins-0.064
tub-0.064
limits-0.063
Token,
Token ID11
Feature activation+5.04e-01

Pos logit contributions
+0.065
 �+0.045
 Will+0.045
 mid+0.045
 dismay+0.045
Neg logit contributions
glers-0.085
bel-0.082
tub-0.075
kins-0.075
limits-0.074
Token John
Token ID1757
Feature activation+2.11e+00

Pos logit contributions
+0.012
 mid+0.008
 �+0.008
 Will+0.008
 dismay+0.008
Neg logit contributions
glers-0.017
bel-0.016
kins-0.015
tub-0.015
limits-0.015
Token.
Token ID13
Feature activation+9.73e-01

Pos logit contributions
+0.050
 Will+0.029
 �+0.027
nown+0.025
:+0.025
Neg logit contributions
bel-0.077
glers-0.076
tub-0.071
angers-0.071
ained-0.071
Token He
Token ID679
Feature activation+5.97e-01

Pos logit contributions
+0.023
 mid+0.018
 fingert+0.017
 dismay+0.017
 farewell+0.017
Neg logit contributions
glers-0.029
bel-0.028
tub-0.026
limits-0.026
kins-0.025
Token doesn
Token ID1595
Feature activation+4.78e+00

Pos logit contributions
+0.014
 Will+0.009
 mid+0.009
 �+0.009
 dismay+0.009
Neg logit contributions
glers-0.020
bel-0.019
tub-0.017
kins-0.017
limits-0.017
Token't
Token ID470
Feature activation-7.58e-01

Pos logit contributions
+0.124
 farewell+0.117
 fingertips+0.116
 delight+0.116
 amaz+0.115
Neg logit contributions
glers-0.115
bel-0.100
tub-0.098
limits-0.094
ained-0.092
Token know
Token ID760
Feature activation-6.75e-02

Pos logit contributions
glers+0.023
bel+0.023
kins+0.021
tub+0.021
limits+0.020
Neg logit contributions
-0.020
 Will-0.014
 mid-0.014
 �-0.014
 dismay-0.014
Token any
Token ID597
Feature activation-8.69e-01

Pos logit contributions
glers+0.002
bel+0.002
kins+0.002
limits+0.002
tub+0.002
Neg logit contributions
-0.002
 mid-0.001
 dismay-0.001
itz-0.001
 fingert-0.001
Token better
Token ID1365
Feature activation-1.41e+00

Pos logit contributions
glers+0.024
bel+0.022
kins+0.020
tub+0.020
limits+0.020
Neg logit contributions
-0.022
itz-0.018
 Will-0.017
 fingert-0.017
 mid-0.017
Token,"
Token ID553
Feature activation-2.00e+00

Pos logit contributions
glers+0.042
bel+0.039
tub+0.036
limits+0.035
kins+0.035
Neg logit contributions
-0.039
 �-0.028
 Will-0.028
 dismay-0.027
 farewell-0.027
Token a
Token ID257
Feature activation-1.12e+00

Pos logit contributions
glers+0.041
bel+0.040
limits+0.037
tub+0.036
kins+0.036
Neg logit contributions
-0.036
itz-0.028
 dismay-0.028
 mid-0.028
 �-0.027
Token lot
Token ID1256
Feature activation-9.94e-01

Pos logit contributions
bel+0.028
glers+0.027
tub+0.027
asis+0.025
kins+0.025
Neg logit contributions
-0.031
 mid-0.024
itz-0.024
AY-0.023
 Will-0.023
Token of
Token ID286
Feature activation-2.79e+00

Pos logit contributions
glers+0.026
bel+0.024
limits+0.022
tub+0.022
kins+0.022
Neg logit contributions
-0.027
 dismay-0.023
itz-0.022
 mid-0.022
 Will-0.022
Token value
Token ID1988
Feature activation-9.28e-01

Pos logit contributions
glers+0.076
bel+0.074
kins+0.067
limits+0.067
tub+0.067
Neg logit contributions
-0.076
 mid-0.058
 dismay-0.057
itz-0.057
 �-0.057
Token,
Token ID11
Feature activation+2.70e-02

Pos logit contributions
glers+0.029
bel+0.029
tub+0.026
limits+0.026
kins+0.026
Neg logit contributions
-0.022
 �-0.016
 mid-0.015
 Will-0.015
 dismay-0.015
Token doesn
Token ID1595
Feature activation+4.77e+00

Pos logit contributions
+0.001
 �+0.001
 Will+0.001
 dismay+0.001
 mid+0.001
Neg logit contributions
glers-0.001
bel-0.001
asis-0.001
tub-0.001
kins-0.001
Token't
Token ID470
Feature activation-8.58e-01

Pos logit contributions
+0.124
 farewell+0.115
 dismay+0.114
 delight+0.113
 amaz+0.113
Neg logit contributions
glers-0.112
bel-0.103
tub-0.099
kins-0.097
limits-0.097
Token mean
Token ID1612
Feature activation-1.08e+00

Pos logit contributions
glers+0.027
bel+0.026
kins+0.024
tub+0.024
limits+0.024
Neg logit contributions
-0.020
 mid-0.014
 �-0.014
 Will-0.014
 dismay-0.014
Token it
Token ID340
Feature activation+4.34e-01

Pos logit contributions
glers+0.035
bel+0.034
tub+0.031
kins+0.031
asis+0.030
Neg logit contributions
-0.027
 �-0.018
 Will-0.018
 farewell-0.018
 dismay-0.018
Token's
Token ID338
Feature activation-4.20e-01

Pos logit contributions
+0.011
 �+0.008
 Will+0.008
 mid+0.008
 dismay+0.008
Neg logit contributions
glers-0.013
bel-0.013
kins-0.012
tub-0.012
limits-0.012
Token not
Token ID407
Feature activation-7.41e-01

Pos logit contributions
glers+0.012
bel+0.012
kins+0.011
tub+0.011
limits+0.011
Neg logit contributions
-0.011
 mid-0.008
 �-0.008
 Will-0.008
 dismay-0.008
Token she
Token ID673
Feature activation+7.12e-01

Pos logit contributions
glers+0.026
bel+0.026
kins+0.023
limits+0.023
tub+0.023
Neg logit contributions
-0.025
 mid-0.019
itz-0.019
 dismay-0.019
 fingert-0.019
Token said
Token ID531
Feature activation-8.98e-01

Pos logit contributions
+0.022
 mid+0.018
 dismay+0.018
 fingert+0.018
AY+0.018
Neg logit contributions
glers-0.016
bel-0.016
kins-0.014
tub-0.014
limits-0.013
Token softly
Token ID26625
Feature activation+2.49e+00

Pos logit contributions
bel+0.026
glers+0.026
kins+0.024
limits+0.023
tub+0.023
Neg logit contributions
-0.023
Mine-0.017
 mid-0.017
itz-0.017
 dismay-0.017
Token.
Token ID13
Feature activation+1.98e+00

Pos logit contributions
+0.060
 mid+0.043
 dismay+0.042
 fingert+0.042
 �+0.042
Neg logit contributions
glers-0.078
bel-0.075
kins-0.069
tub-0.069
limits-0.068
Token "
Token ID366
Feature activation+1.88e+00

Pos logit contributions
+0.049
 fingert+0.041
itz+0.040
 dismay+0.040
Mine+0.040
Neg logit contributions
bel-0.052
glers-0.049
limits-0.047
tub-0.046
kins-0.044
TokenPlease
Token ID5492
Feature activation+4.76e+00

Pos logit contributions
+0.051
 dismay+0.042
 fingert+0.040
 tr+0.040
 �+0.040
Neg logit contributions
bel-0.051
glers-0.046
limits-0.043
tub-0.042
kins-0.041
Token take
Token ID1011
Feature activation+7.17e-01

Pos logit contributions
+0.118
 mid+0.089
 dismay+0.088
Mine+0.088
 �+0.087
Neg logit contributions
glers-0.138
bel-0.136
kins-0.124
tub-0.123
limits-0.120
Token care
Token ID1337
Feature activation+1.09e+00

Pos logit contributions
+0.018
 �+0.015
itz+0.015
Mine+0.015
 fingertips+0.015
Neg logit contributions
bel-0.017
limits-0.016
glers-0.016
kins-0.015
akers-0.015
Token of
Token ID286
Feature activation+7.69e-01

Pos logit contributions
+0.027
 mid+0.019
 �+0.019
 Will+0.019
 dismay+0.019
Neg logit contributions
glers-0.034
bel-0.033
kins-0.030
tub-0.030
limits-0.029
Token me
Token ID502
Feature activation+1.49e+00

Pos logit contributions
+0.019
AY+0.015
 fingert+0.015
 �+0.014
itz+0.014
Neg logit contributions
glers-0.021
bel-0.021
kins-0.020
limits-0.020
tub-0.019
Token and
Token ID290
Feature activation+1.78e+00

Pos logit contributions
+0.038
 Will+0.027
 mid+0.027
 �+0.027
 farewell+0.027
Neg logit contributions
glers-0.046
bel-0.044
tub-0.040
kins-0.040
limits-0.040
Token looks
Token ID3073
Feature activation+1.39e-01

Pos logit contributions
+0.003
 Will+0.002
 mid+0.002
 �+0.002
 tr+0.001
Neg logit contributions
bel-0.004
glers-0.004
limits-0.004
tub-0.004
ained-0.004
Token very
Token ID845
Feature activation-4.38e-01

Pos logit contributions
+0.003
 �+0.002
 Will+0.002
 mid+0.002
 dismay+0.002
Neg logit contributions
glers-0.004
bel-0.004
kins-0.004
limits-0.004
tub-0.004
Token far
Token ID1290
Feature activation-1.46e+00

Pos logit contributions
glers+0.014
bel+0.013
limits+0.012
tub+0.012
kins+0.012
Neg logit contributions
-0.011
 �-0.008
 mid-0.008
 Will-0.008
 dismay-0.007
Token away
Token ID1497
Feature activation+9.76e-01

Pos logit contributions
glers+0.050
bel+0.049
tub+0.044
kins+0.044
akers+0.043
Neg logit contributions
-0.034
 �-0.023
 mid-0.022
 farewell-0.022
 Will-0.021
Token.
Token ID13
Feature activation+1.33e+00

Pos logit contributions
+0.024
 �+0.015
 mid+0.015
 Will+0.015
 eight+0.015
Neg logit contributions
glers-0.032
bel-0.031
limits-0.029
kins-0.028
tub-0.028
Token Don
Token ID2094
Feature activation+4.76e+00

Pos logit contributions
+0.031
 mid+0.022
 dismay+0.022
 fingert+0.022
itz+0.022
Neg logit contributions
glers-0.041
bel-0.040
tub-0.037
limits-0.037
kins-0.036
Token't
Token ID470
Feature activation+3.57e-02

Pos logit contributions
+0.115
 mid+0.085
 dismay+0.085
 fingert+0.085
 �+0.084
Neg logit contributions
glers-0.146
bel-0.140
tub-0.128
limits-0.128
kins-0.127
Token forget
Token ID6044
Feature activation+2.21e+00

Pos logit contributions
+0.001
 �+0.001
 farewell+0.001
 mid+0.001
 fingert+0.001
Neg logit contributions
glers-0.001
bel-0.001
limits-0.001
tub-0.001
kins-0.001
Token,
Token ID11
Feature activation+8.74e-01

Pos logit contributions
+0.056
Mine+0.043
 dismay+0.042
 fingert+0.042
itz+0.042
Neg logit contributions
glers-0.062
bel-0.060
limits-0.056
tub-0.055
kins-0.054
Token you
Token ID345
Feature activation+2.53e+00

Pos logit contributions
+0.021
 mid+0.015
 �+0.015
 dismay+0.015
 Will+0.015
Neg logit contributions
glers-0.027
bel-0.026
tub-0.024
kins-0.024
limits-0.024
Token can
Token ID460
Feature activation+1.52e+00

Pos logit contributions
+0.063
 Will+0.043
 farewell+0.041
 dismay+0.041
 mid+0.040
Neg logit contributions
glers-0.083
bel-0.081
tub-0.073
kins-0.073
limits-0.072
Token sweet
Token ID6029
Feature activation+8.62e-01

Pos logit contributions
+0.012
 mid+0.009
 farewell+0.009
AY+0.009
itz+0.009
Neg logit contributions
glers-0.018
bel-0.017
kins-0.016
tub-0.016
limits-0.016
Tokenheart
Token ID11499
Feature activation+6.90e-01

Pos logit contributions
+0.027
itz+0.020
 Will+0.020
 �+0.020
 mid+0.020
Neg logit contributions
bel-0.022
glers-0.021
limits-0.020
tub-0.019
kins-0.019
Token,
Token ID11
Feature activation+8.43e-01

Pos logit contributions
+0.018
 �+0.012
nown+0.011
 Will+0.011
 ear+0.011
Neg logit contributions
glers-0.023
bel-0.022
tub-0.020
ained-0.020
limits-0.019
Token Mom
Token ID11254
Feature activation-8.82e-01

Pos logit contributions
+0.021
 mid+0.015
 dismay+0.015
 farewell+0.015
itz+0.015
Neg logit contributions
glers-0.025
bel-0.024
kins-0.022
tub-0.022
limits-0.022
Tokenmy
Token ID1820
Feature activation+3.93e-01

Pos logit contributions
bel+0.029
glers+0.029
tub+0.028
kins+0.027
ained+0.027
Neg logit contributions
-0.023
 Will-0.013
 �-0.013
 t-0.013
 half-0.012
Token doesn
Token ID1595
Feature activation+4.76e+00

Pos logit contributions
+0.010
 �+0.006
 Will+0.006
nown+0.005
 wo+0.005
Neg logit contributions
glers-0.014
bel-0.013
ained-0.013
tub-0.012
angers-0.012
Token't
Token ID470
Feature activation-1.16e+00

Pos logit contributions
+0.127
 farewell+0.115
 persever+0.114
 dismay+0.113
 fingert+0.111
Neg logit contributions
glers-0.116
tub-0.101
bel-0.100
kins-0.097
limits-0.095
Token have
Token ID423
Feature activation-6.57e-01

Pos logit contributions
glers+0.038
bel+0.037
kins+0.034
tub+0.034
limits+0.033
Neg logit contributions
-0.030
 Will-0.020
 mid-0.020
 �-0.019
 dismay-0.019
Token a
Token ID257
Feature activation-6.45e-01

Pos logit contributions
bel+0.016
glers+0.016
limits+0.015
tub+0.014
akers+0.014
Neg logit contributions
-0.018
 �-0.014
itz-0.014
Mine-0.014
 Pret-0.014
Token torch
Token ID28034
Feature activation-8.78e-02

Pos logit contributions
glers+0.018
bel+0.018
tub+0.017
asis+0.016
kins+0.016
Neg logit contributions
-0.017
 mid-0.013
itz-0.013
Mine-0.012
AY-0.012
Token",
Token ID1600
Feature activation-2.60e+00

Pos logit contributions
glers+0.003
bel+0.002
kins+0.002
tub+0.002
limits+0.002
Neg logit contributions
-0.002
 �-0.002
 Will-0.002
 mid-0.002
 three-0.002
Token it
Token ID340
Feature activation+7.67e-01

Pos logit contributions
glers+0.013
bel+0.013
kins+0.012
tub+0.012
limits+0.011
Neg logit contributions
-0.010
 mid-0.007
 dismay-0.007
 fingert-0.007
itz-0.007
Token does
Token ID857
Feature activation+8.74e-02

Pos logit contributions
+0.016
 mid+0.011
 �+0.011
 dismay+0.010
 Will+0.010
Neg logit contributions
glers-0.027
bel-0.026
kins-0.024
tub-0.024
limits-0.024
Token look
Token ID804
Feature activation+2.49e-01

Pos logit contributions
+0.002
 mid+0.002
 �+0.001
 Will+0.001
 dismay+0.001
Neg logit contributions
glers-0.003
bel-0.003
kins-0.002
tub-0.002
limits-0.002
Token weird
Token ID7650
Feature activation+3.38e-01

Pos logit contributions
+0.006
 mid+0.005
 dismay+0.005
itz+0.005
 Will+0.005
Neg logit contributions
glers-0.007
bel-0.007
tub-0.007
kins-0.007
limits-0.006
Token,
Token ID11
Feature activation-4.58e-01

Pos logit contributions
+0.009
 mid+0.006
 �+0.006
 dismay+0.006
 Will+0.006
Neg logit contributions
glers-0.010
bel-0.010
tub-0.009
kins-0.009
limits-0.009
Token doesn
Token ID1595
Feature activation+4.75e+00

Pos logit contributions
glers+0.013
bel+0.013
tub+0.012
kins+0.012
limits+0.012
Neg logit contributions
-0.012
 mid-0.009
 dismay-0.009
 Will-0.009
 �-0.009
Token't
Token ID470
Feature activation-5.23e-01

Pos logit contributions
+0.125
 farewell+0.124
 dismay+0.122
 persever+0.121
Mine+0.120
Neg logit contributions
glers-0.102
tub-0.092
bel-0.089
kins-0.085
ained-0.084
Token it
Token ID340
Feature activation+8.84e-01

Pos logit contributions
glers+0.015
bel+0.015
tub+0.014
limits+0.013
kins+0.013
Neg logit contributions
-0.013
 farewell-0.010
 �-0.010
 mid-0.010
 dismay-0.010
Token?"
Token ID1701
Feature activation-4.84e-01

Pos logit contributions
+0.024
 mid+0.018
 �+0.018
 dismay+0.018
 Will+0.017
Neg logit contributions
glers-0.025
bel-0.024
tub-0.022
kins-0.022
limits-0.022
Token They
Token ID1119
Feature activation-1.34e+00

Pos logit contributions
glers+0.017
bel+0.016
kins+0.015
tub+0.015
limits+0.015
Neg logit contributions
-0.011
 �-0.007
 Will-0.007
 mid-0.007
 dismay-0.007
Token watched
Token ID7342
Feature activation+1.63e+00

Pos logit contributions
glers+0.035
bel+0.034
kins+0.030
tub+0.028
avers+0.027
Neg logit contributions
-0.037
 Pret-0.030
itz-0.030
AY-0.030
 fingert-0.029
Token dolls
Token ID36062
Feature activation-2.21e-01

Pos logit contributions
+0.012
 mid+0.010
 �+0.010
 dismay+0.010
itz+0.010
Neg logit contributions
glers-0.009
bel-0.009
tub-0.008
kins-0.007
limits-0.007
Token around
Token ID1088
Feature activation+9.50e-01

Pos logit contributions
glers+0.007
bel+0.007
kins+0.006
limits+0.006
tub+0.006
Neg logit contributions
-0.005
 �-0.004
 mid-0.004
 Will-0.004
 dismay-0.004
Token in
Token ID287
Feature activation+8.01e-01

Pos logit contributions
+0.025
 farewell+0.019
itz+0.018
AY+0.018
 fingert+0.018
Neg logit contributions
glers-0.027
bel-0.026
kins-0.024
tub-0.024
limits-0.023
Token a
Token ID257
Feature activation-2.45e-01

Pos logit contributions
+0.020
 farewell+0.017
nown+0.017
itz+0.017
 wo+0.017
Neg logit contributions
glers-0.021
bel-0.020
kins-0.019
tub-0.018
limits-0.018
Token wheel
Token ID7825
Feature activation-2.42e+00

Pos logit contributions
glers+0.007
bel+0.007
tub+0.006
kins+0.006
limits+0.006
Neg logit contributions
-0.006
 Will-0.005
 mid-0.005
itz-0.005
 �-0.005
Tokenbar
Token ID5657
Feature activation-6.26e+00

Pos logit contributions
glers+0.089
bel+0.087
kins+0.086
limits+0.082
tub+0.080
Neg logit contributions
-0.047
 �-0.027
 tr-0.027
 mid-0.026
 three-0.025
Tokenrow
Token ID808
Feature activation-1.64e+00

Pos logit contributions
glers+0.138
bel+0.123
limits+0.123
 thirsty+0.109
tub+0.106
Neg logit contributions
-0.192
 arm-0.167
 tr-0.161
 fingert-0.161
 ear-0.158
Token she
Token ID673
Feature activation-1.41e+00

Pos logit contributions
glers+0.064
bel+0.061
limits+0.059
kins+0.058
asis+0.058
Neg logit contributions
-0.027
 �-0.016
 three-0.015
 tr-0.014
 arm-0.014
Token had
Token ID550
Feature activation-1.17e+00

Pos logit contributions
glers+0.041
bel+0.038
kins+0.035
tub+0.035
limits+0.035
Neg logit contributions
-0.039
 Will-0.029
 mid-0.028
 �-0.028
 fingert-0.028
Token found
Token ID1043
Feature activation-1.72e+00

Pos logit contributions
glers+0.034
bel+0.034
tub+0.030
kins+0.030
limits+0.030
Neg logit contributions
-0.031
 mid-0.023
 �-0.022
 dismay-0.022
 Will-0.022
Token in
Token ID287
Feature activation+7.71e-01

Pos logit contributions
glers+0.051
bel+0.049
tub+0.044
limits+0.044
kins+0.044
Neg logit contributions
-0.045
 �-0.033
 mid-0.033
 dismay-0.032
 Will-0.032
Token can
Token ID460
Feature activation+5.52e-03

Pos logit contributions
+0.028
 mid+0.021
 �+0.021
 dismay+0.021
 Will+0.021
Neg logit contributions
glers-0.030
bel-0.029
tub-0.026
kins-0.026
limits-0.026
Token spell
Token ID4822
Feature activation+1.14e+00

Pos logit contributions
+0.000
 dismay+0.000
 farewell+0.000
 mid+0.000
 �+0.000
Neg logit contributions
glers-0.000
bel-0.000
kins-0.000
tub-0.000
limits-0.000
Token better
Token ID1365
Feature activation-1.06e+00

Pos logit contributions
+0.027
 dismay+0.021
 mid+0.021
itz+0.021
 fingert+0.020
Neg logit contributions
glers-0.034
bel-0.033
kins-0.030
tub-0.030
limits-0.029
Token than
Token ID621
Feature activation-1.14e+00

Pos logit contributions
bel+0.029
glers+0.029
kins+0.028
tub+0.026
akers+0.026
Neg logit contributions
-0.027
 dismay-0.021
itz-0.021
 mid-0.021
 fingert-0.021
Token you
Token ID345
Feature activation+1.49e+00

Pos logit contributions
glers+0.034
bel+0.032
tub+0.029
kins+0.029
limits+0.029
Neg logit contributions
-0.029
itz-0.021
 dismay-0.021
 fingert-0.021
 farewell-0.021
Token"
Token ID1
Feature activation-6.25e+00

Pos logit contributions
+0.038
itz+0.031
 dismay+0.030
 fingert+0.030
 mid+0.029
Neg logit contributions
glers-0.040
bel-0.038
kins-0.037
tub-0.036
akers-0.035
Token or
Token ID393
Feature activation-8.56e-02

Pos logit contributions
glers+0.197
bel+0.187
kins+0.173
tub+0.172
limits+0.172
Neg logit contributions
-0.154
 Will-0.108
 �-0.108
 mid-0.107
 dismay-0.104
Token "
Token ID366
Feature activation-1.27e+00

Pos logit contributions
glers+0.003
bel+0.003
tub+0.002
kins+0.002
limits+0.002
Neg logit contributions
-0.002
 �-0.002
 mid-0.002
 Will-0.002
 dismay-0.002
TokenYou
Token ID1639
Feature activation+3.39e+00

Pos logit contributions
bel+0.034
glers+0.034
limits+0.030
kins+0.030
tub+0.030
Neg logit contributions
-0.035
 dismay-0.028
 �-0.027
 fingert-0.027
 farewell-0.027
Token spelled
Token ID32213
Feature activation+3.30e-01

Pos logit contributions
+0.082
 mid+0.058
 �+0.058
 dismay+0.058
 Will+0.058
Neg logit contributions
glers-0.106
bel-0.102
kins-0.093
tub-0.093
limits-0.092
Token that
Token ID326
Feature activation-8.85e-01

Pos logit contributions
+0.007
 mid+0.006
 dismay+0.006
 fingert+0.006
itz+0.005
Neg logit contributions
glers-0.010
bel-0.010
kins-0.009
tub-0.009
limits-0.009
Token and
Token ID290
Feature activation+6.99e-01

Pos logit contributions
glers+0.191
bel+0.182
kins+0.170
limits+0.169
tub+0.168
Neg logit contributions
-0.130
 �-0.088
 Will-0.088
 mid-0.087
 dismay-0.084
Token "
Token ID366
Feature activation-9.78e-01

Pos logit contributions
+0.016
 dismay+0.013
itz+0.012
 mid+0.012
AY+0.012
Neg logit contributions
glers-0.021
bel-0.021
tub-0.018
kins-0.018
limits-0.018
TokenI
Token ID40
Feature activation+1.50e+00

Pos logit contributions
bel+0.027
glers+0.026
tub+0.023
limits+0.022
ained+0.022
Neg logit contributions
-0.026
 dismay-0.022
 �-0.021
 fingert-0.021
itz-0.021
Token love
Token ID1842
Feature activation+6.82e-01

Pos logit contributions
+0.041
 mid+0.030
 �+0.030
 dismay+0.030
 Will+0.030
Neg logit contributions
glers-0.043
bel-0.041
tub-0.037
kins-0.037
limits-0.037
Token you
Token ID345
Feature activation+1.22e+00

Pos logit contributions
+0.019
 dismay+0.016
itz+0.015
nown+0.015
 fingert+0.015
Neg logit contributions
glers-0.018
bel-0.016
ards-0.014
kins-0.014
ics-0.014
Token"
Token ID1
Feature activation-6.24e+00

Pos logit contributions
+0.030
 dismay+0.026
itz+0.025
 mid+0.024
 fingert+0.024
Neg logit contributions
glers-0.033
bel-0.030
kins-0.030
tub-0.029
akers-0.029
Token and
Token ID290
Feature activation+9.40e-01

Pos logit contributions
glers+0.204
bel+0.192
kins+0.181
tub+0.179
limits+0.178
Neg logit contributions
-0.148
 Will-0.103
 �-0.101
 mid-0.100
itz-0.096
Token "
Token ID366
Feature activation-1.12e+00

Pos logit contributions
+0.023
itz+0.018
 dismay+0.018
 mid+0.018
 fingert+0.018
Neg logit contributions
glers-0.027
bel-0.026
tub-0.024
kins-0.024
limits-0.023
Tokencr
Token ID6098
Feature activation+1.82e+00

Pos logit contributions
bel+0.029
glers+0.028
ained+0.024
tub+0.024
limits+0.024
Neg logit contributions
-0.032
 dismay-0.026
 �-0.025
 fingert-0.025
itz-0.025
Tokenacker
Token ID10735
Feature activation+2.37e+00

Pos logit contributions
+0.054
 dismay+0.043
 �+0.042
 mid+0.042
 farewell+0.042
Neg logit contributions
glers-0.048
bel-0.043
tub-0.040
akers-0.039
kins-0.039
Token".
Token ID1911
Feature activation-8.82e-01

Pos logit contributions
+0.057
 dismay+0.042
 mid+0.042
 �+0.041
 Will+0.041
Neg logit contributions
glers-0.073
bel-0.071
kins-0.065
tub-0.065
limits-0.063
Token wrote
Token ID2630
Feature activation-2.05e-01

Pos logit contributions
+0.011
 Will+0.008
 �+0.008
 mid+0.008
 delight+0.008
Neg logit contributions
glers-0.014
bel-0.013
limits-0.013
asis-0.012
tub-0.012
Token "
Token ID366
Feature activation-1.29e+00

Pos logit contributions
glers+0.006
bel+0.006
kins+0.006
tub+0.005
limits+0.005
Neg logit contributions
-0.005
 mid-0.004
 dismay-0.004
itz-0.004
 fingert-0.004
TokenWe
Token ID1135
Feature activation+7.88e-01

Pos logit contributions
bel+0.037
glers+0.036
limits+0.031
tub+0.031
kins+0.031
Neg logit contributions
-0.033
 dismay-0.027
 delight-0.026
 �-0.026
 farewell-0.026
Token love
Token ID1842
Feature activation-8.06e-02

Pos logit contributions
+0.020
 mid+0.014
 �+0.014
 Will+0.014
 dismay+0.014
Neg logit contributions
glers-0.024
bel-0.023
kins-0.021
tub-0.021
limits-0.021
Token you
Token ID345
Feature activation+8.37e-01

Pos logit contributions
glers+0.002
bel+0.002
kins+0.002
akers+0.002
cards+0.002
Neg logit contributions
-0.002
 dismay-0.002
itz-0.002
 fingert-0.002
 farewell-0.002
Token"
Token ID1
Feature activation-6.24e+00

Pos logit contributions
+0.021
 dismay+0.017
itz+0.017
 fingert+0.016
 mid+0.016
Neg logit contributions
glers-0.023
bel-0.022
kins-0.022
akers-0.021
tub-0.021
Token with
Token ID351
Feature activation+7.88e-01

Pos logit contributions
glers+0.197
bel+0.186
kins+0.172
limits+0.172
tub+0.171
Neg logit contributions
-0.154
 Will-0.110
 �-0.107
 mid-0.107
 dismay-0.103
Token their
Token ID511
Feature activation-5.62e-01

Pos logit contributions
+0.021
AY+0.016
itz+0.016
nown+0.016
 fingert+0.015
Neg logit contributions
glers-0.023
bel-0.021
kins-0.020
tub-0.019
limits-0.018
Token cr
Token ID1067
Feature activation+9.19e-01

Pos logit contributions
glers+0.016
bel+0.016
kins+0.015
tub+0.014
akers+0.014
Neg logit contributions
-0.015
itz-0.011
nown-0.011
AY-0.011
 mid-0.011
Tokenay
Token ID323
Feature activation-1.11e+00

Pos logit contributions
+0.020
 dismay+0.016
 persever+0.014
 hes+0.014
 �+0.014
Neg logit contributions
glers-0.031
bel-0.030
tub-0.028
kins-0.028
akers-0.028
Tokenons
Token ID684
Feature activation-1.09e+00

Pos logit contributions
glers+0.013
kins+0.012
bel+0.012
akers+0.010
avers+0.009
Neg logit contributions
-0.047
 dismay-0.043
 �-0.041
itz-0.041
 mid-0.040
Token He
Token ID679
Feature activation-1.65e+00

Pos logit contributions
glers+0.047
bel+0.045
tub+0.042
limits+0.041
kins+0.041
Neg logit contributions
-0.035
 mid-0.025
 dismay-0.025
 fingert-0.024
itz-0.024
Token said
Token ID531
Feature activation-1.57e+00

Pos logit contributions
glers+0.040
bel+0.037
kins+0.034
tub+0.033
avers+0.031
Neg logit contributions
-0.048
 dismay-0.040
 fingertips-0.040
 mid-0.040
 farewell-0.039
Token "
Token ID366
Feature activation-6.42e-01

Pos logit contributions
glers+0.043
bel+0.043
kins+0.038
tub+0.038
limits+0.037
Neg logit contributions
-0.042
itz-0.033
 dismay-0.033
 mid-0.033
 fingert-0.032
Tokenthank
Token ID40716
Feature activation-2.86e+00

Pos logit contributions
glers+0.016
bel+0.015
kins+0.014
tub+0.014
limits+0.014
Neg logit contributions
-0.020
 mid-0.015
 Will-0.015
 �-0.015
 dismay-0.015
Token you
Token ID345
Feature activation+8.81e-01

Pos logit contributions
glers+0.077
bel+0.075
limits+0.066
kins+0.065
akers+0.063
Neg logit contributions
-0.076
 dismay-0.063
 fingert-0.061
 mid-0.060
itz-0.060
Token"
Token ID1
Feature activation-6.05e+00

Pos logit contributions
+0.021
 dismay+0.016
 mid+0.016
itz+0.015
 fingert+0.015
Neg logit contributions
glers-0.027
bel-0.026
tub-0.024
kins-0.024
limits-0.024
Token to
Token ID284
Feature activation-1.01e+00

Pos logit contributions
glers+0.213
bel+0.200
kins+0.193
limits+0.191
asis+0.189
Neg logit contributions
-0.129
 Will-0.085
 �-0.083
 mid-0.081
AY-0.078
Token Tom
Token ID4186
Feature activation+4.39e-01

Pos logit contributions
glers+0.028
bel+0.026
kins+0.024
tub+0.024
limits+0.023
Neg logit contributions
-0.029
 mid-0.021
 farewell-0.021
 �-0.021
 fingert-0.021
Token and
Token ID290
Feature activation+5.71e-01

Pos logit contributions
+0.010
 Will+0.006
 �+0.006
 mid+0.006
 dismay+0.006
Neg logit contributions
glers-0.015
bel-0.015
limits-0.014
tub-0.014
kins-0.014
Token ran
Token ID4966
Feature activation-1.04e+00

Pos logit contributions
+0.016
 mid+0.012
 dismay+0.012
 �+0.012
itz+0.012
Neg logit contributions
glers-0.016
bel-0.015
kins-0.014
tub-0.014
limits-0.014
Token home
Token ID1363
Feature activation+8.32e-01

Pos logit contributions
glers+0.029
bel+0.029
tub+0.026
limits+0.026
akers+0.025
Neg logit contributions
-0.028
itz-0.021
 mid-0.021
 fingert-0.021
 dismay-0.020
Token"
Token ID1
Feature activation-4.38e+00

Pos logit contributions
glers+0.008
bel+0.007
kins+0.007
tub+0.007
limits+0.007
Neg logit contributions
-0.006
 dismay-0.004
itz-0.004
 fingert-0.004
 mid-0.004
Token and
Token ID290
Feature activation+4.57e-01

Pos logit contributions
glers+0.158
bel+0.149
kins+0.140
limits+0.140
tub+0.139
Neg logit contributions
-0.090
 �-0.059
 Will-0.058
 mid-0.057
 farewell-0.053
Token "
Token ID366
Feature activation-1.22e+00

Pos logit contributions
+0.011
itz+0.008
 dismay+0.008
 fingert+0.008
AY+0.008
Neg logit contributions
bel-0.014
glers-0.013
kins-0.012
limits-0.012
tub-0.012
Tokenthank
Token ID40716
Feature activation-2.71e+00

Pos logit contributions
glers+0.026
bel+0.025
kins+0.022
tub+0.022
limits+0.022
Neg logit contributions
-0.041
 mid-0.033
 �-0.033
 Will-0.033
 dismay-0.032
Token you
Token ID345
Feature activation+1.47e+00

Pos logit contributions
glers+0.077
bel+0.074
kins+0.067
limits+0.066
akers+0.065
Neg logit contributions
-0.070
 fingert-0.055
 dismay-0.055
itz-0.054
 mid-0.054
Token"
Token ID1
Feature activation-5.99e+00

Pos logit contributions
+0.035
 mid+0.025
 dismay+0.025
itz+0.025
 fingert+0.024
Neg logit contributions
glers-0.047
bel-0.045
kins-0.041
tub-0.041
limits-0.041
Token a
Token ID257
Feature activation+7.05e-01

Pos logit contributions
glers+0.203
bel+0.191
kins+0.179
limits+0.179
tub+0.177
Neg logit contributions
-0.136
 Will-0.092
 �-0.092
 mid-0.089
itz-0.085
Token lot
Token ID1256
Feature activation-1.40e+00

Pos logit contributions
+0.017
itz+0.013
 mid+0.013
 fingert+0.013
 fingertips+0.013
Neg logit contributions
bel-0.021
glers-0.021
tub-0.019
kins-0.018
akers-0.018
Token.
Token ID13
Feature activation-3.75e-01

Pos logit contributions
glers+0.041
bel+0.039
kins+0.036
tub+0.036
limits+0.035
Neg logit contributions
-0.036
 dismay-0.027
 mid-0.027
 fingert-0.026
itz-0.026
Token
Token ID198
Feature activation-4.43e-01

Pos logit contributions
glers+0.013
bel+0.013
kins+0.012
limits+0.012
asis+0.012
Neg logit contributions
-0.008
 �-0.006
 Will-0.005
 Pret-0.005
 mid-0.005
Token
Token ID198
Feature activation-4.42e-01

Pos logit contributions
glers+0.017
bel+0.016
limits+0.015
kins+0.015
tub+0.015
Neg logit contributions
-0.008
 Will-0.005
 �-0.005
 mid-0.005
 dismay-0.005
Token it
Token ID340
Feature activation+5.93e-01

Pos logit contributions
+0.006
 Will+0.004
 �+0.004
 mid+0.004
 delight+0.004
Neg logit contributions
glers-0.009
bel-0.008
limits-0.008
akers-0.008
tub-0.008
Token said
Token ID531
Feature activation-1.24e+00

Pos logit contributions
+0.016
 Will+0.011
 �+0.011
 mid+0.011
 dismay+0.011
Neg logit contributions
glers-0.018
bel-0.017
limits-0.015
tub-0.015
kins-0.015
Token "
Token ID366
Feature activation-8.72e-01

Pos logit contributions
bel+0.033
glers+0.033
kins+0.031
tub+0.030
limits+0.029
Neg logit contributions
-0.033
itz-0.026
 dismay-0.025
 mid-0.025
 fingert-0.025
Tokenthank
Token ID40716
Feature activation-2.55e+00

Pos logit contributions
glers+0.018
bel+0.017
tub+0.015
kins+0.015
limits+0.015
Neg logit contributions
-0.031
 Will-0.024
 mid-0.024
 �-0.024
itz-0.024
Token you
Token ID345
Feature activation+9.88e-01

Pos logit contributions
glers+0.064
bel+0.061
kins+0.055
asis+0.054
limits+0.054
Neg logit contributions
-0.071
 dismay-0.060
 �-0.058
 fingert-0.058
itz-0.057
Token"
Token ID1
Feature activation-5.96e+00

Pos logit contributions
+0.025
itz+0.019
 dismay+0.019
 mid+0.018
Mine+0.018
Neg logit contributions
glers-0.029
bel-0.028
kins-0.026
tub-0.026
akers-0.025
Token to
Token ID284
Feature activation-8.22e-01

Pos logit contributions
glers+0.211
bel+0.199
limits+0.190
kins+0.188
tub+0.186
Neg logit contributions
-0.128
 Will-0.084
 �-0.081
 mid-0.080
 Pret-0.076
Token the
Token ID262
Feature activation+4.62e-01

Pos logit contributions
glers+0.027
bel+0.026
tub+0.024
limits+0.024
kins+0.024
Neg logit contributions
-0.019
 mid-0.013
 Will-0.013
 �-0.013
 dismay-0.013
Token hipp
Token ID18568
Feature activation+9.42e-02

Pos logit contributions
+0.012
 Will+0.008
 mid+0.008
 dismay+0.008
 delight+0.008
Neg logit contributions
glers-0.014
bel-0.014
limits-0.013
kins-0.013
tub-0.012
Tokeno
Token ID78
Feature activation+5.87e-01

Pos logit contributions
+0.003
 Will+0.003
 �+0.003
 mid+0.003
itz+0.002
Neg logit contributions
glers-0.002
bel-0.002
limits-0.002
kins-0.002
tub-0.002
Token.
Token ID13
Feature activation-1.83e-01

Pos logit contributions
+0.015
 mid+0.011
itz+0.011
 dismay+0.011
 �+0.010
Neg logit contributions
glers-0.018
bel-0.017
tub-0.016
kins-0.016
limits-0.015
Token say
Token ID910
Feature activation-8.71e-01

Pos logit contributions
glers+0.029
bel+0.028
kins+0.026
tub+0.026
limits+0.026
Neg logit contributions
-0.021
 mid-0.015
 �-0.015
 Will-0.015
 dismay-0.015
Token "
Token ID366
Feature activation-1.64e+00

Pos logit contributions
glers+0.024
bel+0.024
kins+0.021
limits+0.021
tub+0.021
Neg logit contributions
-0.023
itz-0.017
 mid-0.017
AY-0.017
Mine-0.017
TokenB
Token ID33
Feature activation-5.18e-01

Pos logit contributions
glers+0.035
bel+0.035
limits+0.029
kins+0.029
tub+0.029
Neg logit contributions
-0.055
 �-0.045
 dismay-0.044
 farewell-0.044
 delight-0.043
Tokenless
Token ID1203
Feature activation+9.28e-01

Pos logit contributions
glers+0.013
bel+0.013
akers+0.012
tub+0.012
limits+0.012
Neg logit contributions
-0.015
 delight-0.012
 dismay-0.012
 �-0.012
 mid-0.012
Token you
Token ID345
Feature activation+1.46e+00

Pos logit contributions
+0.024
 dismay+0.020
 farewell+0.019
 delight+0.019
 persever+0.019
Neg logit contributions
glers-0.025
bel-0.023
tub-0.022
kins-0.022
ards-0.021
Token"
Token ID1
Feature activation-5.89e+00

Pos logit contributions
+0.034
 dismay+0.027
 fingert+0.026
Mine+0.026
 �+0.026
Neg logit contributions
glers-0.043
bel-0.041
kins-0.039
tub-0.039
akers-0.038
Token to
Token ID284
Feature activation-7.85e-01

Pos logit contributions
glers+0.183
bel+0.174
kins+0.161
tub+0.160
limits+0.159
Neg logit contributions
-0.147
 mid-0.104
 �-0.104
 Will-0.104
 dismay-0.101
Token anyone
Token ID2687
Feature activation+7.75e-01

Pos logit contributions
glers+0.024
bel+0.023
limits+0.021
tub+0.020
kins+0.020
Neg logit contributions
-0.020
 mid-0.014
 �-0.014
 Will-0.014
AY-0.014
Token who
Token ID508
Feature activation+3.08e-01

Pos logit contributions
+0.019
 mid+0.014
 farewell+0.014
 dismay+0.014
 �+0.014
Neg logit contributions
glers-0.024
bel-0.023
kins-0.021
tub-0.021
limits-0.020
Token sne
Token ID10505
Feature activation-2.42e+00

Pos logit contributions
+0.008
 Will+0.006
 mid+0.006
 dismay+0.006
 �+0.006
Neg logit contributions
glers-0.009
bel-0.008
tub-0.008
kins-0.008
limits-0.008
Tokenez
Token ID8471
Feature activation+6.31e-01

Pos logit contributions
glers+0.091
bel+0.089
tub+0.083
kins+0.082
limits+0.081
Neg logit contributions
-0.044
 Will-0.027
 mid-0.026
 dismay-0.026
 farewell-0.026
Token writes
Token ID6797
Feature activation-3.91e-01

Pos logit contributions
glers+0.034
bel+0.032
kins+0.028
tub+0.028
limits+0.028
Neg logit contributions
-0.036
 dismay-0.029
 fingertips-0.028
itz-0.028
 mid-0.028
Token "
Token ID366
Feature activation-1.15e+00

Pos logit contributions
glers+0.012
bel+0.011
kins+0.010
tub+0.010
limits+0.010
Neg logit contributions
-0.010
 mid-0.007
 �-0.007
 Will-0.007
 dismay-0.007
TokenI
Token ID40
Feature activation+1.22e+00

Pos logit contributions
glers+0.031
bel+0.031
tub+0.027
limits+0.027
kins+0.027
Neg logit contributions
-0.032
 dismay-0.024
 �-0.024
 mid-0.024
 farewell-0.024
Token love
Token ID1842
Feature activation+6.40e-01

Pos logit contributions
+0.026
 Will+0.017
 �+0.016
 mid+0.016
 farewell+0.016
Neg logit contributions
glers-0.043
bel-0.041
limits-0.038
kins-0.038
tub-0.038
Token you
Token ID345
Feature activation+1.23e+00

Pos logit contributions
+0.018
 dismay+0.015
itz+0.014
 farewell+0.014
 fingert+0.014
Neg logit contributions
glers-0.016
bel-0.015
kins-0.014
akers-0.014
tub-0.013
Token"
Token ID1
Feature activation-5.85e+00

Pos logit contributions
+0.031
itz+0.025
 dismay+0.024
 fingert+0.024
 mid+0.024
Neg logit contributions
glers-0.034
bel-0.033
kins-0.032
tub-0.031
akers-0.030
Token on
Token ID319
Feature activation+1.62e+00

Pos logit contributions
glers+0.186
bel+0.173
kins+0.164
tub+0.163
limits+0.162
Neg logit contributions
-0.142
 Will-0.103
 �-0.099
 mid-0.098
itz-0.094
Token his
Token ID465
Feature activation-6.85e-01

Pos logit contributions
+0.041
itz+0.030
 fingert+0.030
AY+0.030
 mid+0.030
Neg logit contributions
glers-0.047
bel-0.046
kins-0.042
tub-0.042
limits-0.041
Token paper
Token ID3348
Feature activation+7.50e-01

Pos logit contributions
bel+0.018
kins+0.016
glers+0.016
tub+0.015
akers+0.015
Neg logit contributions
-0.019
itz-0.016
 �-0.016
 Will-0.015
AY-0.015
Token.
Token ID13
Feature activation-6.70e-01

Pos logit contributions
+0.016
 �+0.011
 Will+0.010
 mid+0.010
 fingert+0.010
Neg logit contributions
glers-0.026
bel-0.025
limits-0.024
ained-0.023
tub-0.023
Token He
Token ID679
Feature activation-2.57e-01

Pos logit contributions
glers+0.016
bel+0.016
tub+0.014
kins+0.014
limits+0.014
Neg logit contributions
-0.021
 mid-0.016
 dismay-0.016
 fingert-0.016
itz-0.016
Token to
Token ID284
Feature activation-1.41e+00

Pos logit contributions
+0.014
nown+0.013
itz+0.012
 Deal+0.012
 fingertips+0.012
Neg logit contributions
glers-0.011
bel-0.010
tub-0.009
ards-0.009
 pies-0.008
Token sing
Token ID1702
Feature activation+1.29e+00

Pos logit contributions
glers+0.044
bel+0.042
tub+0.038
kins+0.038
limits+0.038
Neg logit contributions
-0.034
 mid-0.025
 �-0.025
 dismay-0.024
 Will-0.024
Token with
Token ID351
Feature activation+2.77e-01

Pos logit contributions
+0.035
 dismay+0.028
itz+0.027
 mid+0.027
 fingert+0.027
Neg logit contributions
glers-0.035
bel-0.034
tub-0.031
kins-0.031
limits-0.029
Token you
Token ID345
Feature activation+1.37e+00

Pos logit contributions
+0.008
nown+0.006
 �+0.006
itz+0.006
 fingert+0.006
Neg logit contributions
glers-0.007
bel-0.007
kins-0.006
ards-0.006
tub-0.006
Token more
Token ID517
Feature activation-2.13e+00

Pos logit contributions
+0.034
itz+0.027
 dismay+0.027
 fingert+0.026
 mid+0.026
Neg logit contributions
glers-0.040
bel-0.037
kins-0.036
tub-0.035
akers-0.033
Token"
Token ID1
Feature activation-5.81e+00

Pos logit contributions
glers+0.056
bel+0.054
kins+0.050
tub+0.048
akers+0.046
Neg logit contributions
-0.060
 dismay-0.048
 mid-0.048
itz-0.047
Mine-0.047
Token
Token ID198
Feature activation-1.21e+00

Pos logit contributions
glers+0.183
bel+0.175
limits+0.163
tub+0.162
kins+0.161
Neg logit contributions
-0.142
 �-0.100
 Will-0.099
 mid-0.098
itz-0.095
Token
Token ID198
Feature activation-6.16e-01

Pos logit contributions
glers+0.042
bel+0.040
tub+0.038
limits+0.037
kins+0.037
Neg logit contributions
-0.026
 �-0.017
 mid-0.017
 Will-0.017
 dismay-0.017
TokenThey
Token ID2990
Feature activation-4.66e-01

Pos logit contributions
glers+0.018
bel+0.017
tub+0.016
limits+0.015
kins+0.015
Neg logit contributions
-0.017
 �-0.012
 mid-0.012
 dismay-0.012
 Will-0.012
Token sing
Token ID1702
Feature activation+1.51e+00

Pos logit contributions
glers+0.013
bel+0.012
kins+0.011
tub+0.011
limits+0.011
Neg logit contributions
-0.012
 mid-0.009
 Pret-0.009
AY-0.009
itz-0.009
Token more
Token ID517
Feature activation-2.61e+00

Pos logit contributions
+0.039
itz+0.028
 mid+0.028
 dismay+0.028
 fingert+0.028
Neg logit contributions
glers-0.045
bel-0.044
tub-0.040
kins-0.040
limits-0.038
Token and
Token ID290
Feature activation-1.38e+00

Pos logit contributions
+0.015
 Will+0.010
 �+0.010
 mid+0.010
 dismay+0.009
Neg logit contributions
glers-0.027
bel-0.026
limits-0.024
kins-0.024
tub-0.024
Token said
Token ID531
Feature activation-1.32e+00

Pos logit contributions
glers+0.047
bel+0.044
asis+0.042
tub+0.041
ained+0.041
Neg logit contributions
-0.032
 delight-0.022
 Will-0.022
 �-0.022
 fingert-0.021
Token "
Token ID366
Feature activation-1.08e+00

Pos logit contributions
glers+0.038
bel+0.037
kins+0.034
tub+0.033
limits+0.033
Neg logit contributions
-0.034
 mid-0.026
 dismay-0.026
itz-0.026
 fingert-0.025
Tokenthank
Token ID40716
Feature activation-2.55e+00

Pos logit contributions
glers+0.027
bel+0.026
tub+0.023
kins+0.023
limits+0.023
Neg logit contributions
-0.033
 mid-0.025
 Will-0.025
 �-0.025
 dismay-0.025
Token you
Token ID345
Feature activation+7.28e-01

Pos logit contributions
glers+0.060
bel+0.059
kins+0.054
limits+0.053
akers+0.051
Neg logit contributions
-0.070
 dismay-0.060
 farewell-0.058
 fingert-0.058
 �-0.058
Token"
Token ID1
Feature activation-5.81e+00

Pos logit contributions
+0.018
 mid+0.012
 dismay+0.012
 Will+0.012
 �+0.012
Neg logit contributions
glers-0.023
bel-0.022
kins-0.020
tub-0.020
limits-0.020
Token to
Token ID284
Feature activation-8.33e-01

Pos logit contributions
glers+0.211
bel+0.196
kins+0.187
limits+0.187
tub+0.186
Neg logit contributions
-0.120
 �-0.075
 Will-0.075
 mid-0.073
AY-0.069
Token her
Token ID607
Feature activation+2.07e+00

Pos logit contributions
glers+0.024
bel+0.022
kins+0.021
tub+0.020
limits+0.020
Neg logit contributions
-0.023
 farewell-0.017
AY-0.016
itz-0.016
nown-0.016
Token mom
Token ID1995
Feature activation-1.33e+00

Pos logit contributions
+0.034
 Will+0.020
 �+0.020
 dismay+0.019
 mid+0.018
Neg logit contributions
glers-0.082
bel-0.080
limits-0.076
tub-0.074
asis-0.074
Token.
Token ID13
Feature activation-4.59e-01

Pos logit contributions
glers+0.044
bel+0.043
limits+0.040
tub+0.039
kins+0.039
Neg logit contributions
-0.031
 �-0.021
 Will-0.021
 mid-0.020
 dismay-0.020
Token She
Token ID1375
Feature activation-9.30e-01

Pos logit contributions
glers+0.012
bel+0.012
tub+0.011
limits+0.010
kins+0.010
Neg logit contributions
-0.013
 dismay-0.010
 mid-0.010
 fingert-0.010
 persever-0.010
Token"
Token ID1
Feature activation-4.27e+00

Pos logit contributions
glers+0.019
bel+0.018
kins+0.017
tub+0.017
akers+0.016
Neg logit contributions
-0.016
 dismay-0.012
itz-0.012
 fingert-0.011
 mid-0.011
Token and
Token ID290
Feature activation+1.16e+00

Pos logit contributions
glers+0.156
bel+0.148
kins+0.140
limits+0.140
asis+0.138
Neg logit contributions
-0.084
 �-0.055
 Will-0.053
 mid-0.053
 fingert-0.049
Token "
Token ID366
Feature activation-7.19e-01

Pos logit contributions
+0.027
 dismay+0.020
 mid+0.019
itz+0.019
 fingert+0.019
Neg logit contributions
glers-0.037
bel-0.036
kins-0.033
tub-0.033
limits-0.032
Tokenthank
Token ID40716
Feature activation-2.47e+00

Pos logit contributions
glers+0.012
bel+0.011
kins+0.010
limits+0.009
tub+0.009
Neg logit contributions
-0.028
 Will-0.023
 mid-0.023
 �-0.023
itz-0.022
Token you
Token ID345
Feature activation+1.34e+00

Pos logit contributions
glers+0.072
bel+0.069
limits+0.062
kins+0.062
tub+0.061
Neg logit contributions
-0.062
 dismay-0.048
 mid-0.047
 fingert-0.047
 farewell-0.047
Token"
Token ID1
Feature activation-5.79e+00

Pos logit contributions
+0.032
 mid+0.022
 dismay+0.022
 �+0.022
itz+0.022
Neg logit contributions
glers-0.043
bel-0.041
tub-0.038
kins-0.038
limits-0.037
Token for
Token ID329
Feature activation-1.43e+00

Pos logit contributions
glers+0.205
bel+0.191
kins+0.184
asis+0.184
limits+0.183
Neg logit contributions
-0.122
 �-0.079
 Will-0.079
 mid-0.075
AY-0.072
Token everything
Token ID2279
Feature activation-6.10e-01

Pos logit contributions
glers+0.045
bel+0.043
tub+0.040
limits+0.040
kins+0.039
Neg logit contributions
-0.035
 Will-0.025
 mid-0.025
 �-0.025
 dismay-0.024
Token.
Token ID13
Feature activation+5.57e-01

Pos logit contributions
glers+0.016
bel+0.016
limits+0.015
asis+0.015
tub+0.014
Neg logit contributions
-0.018
 �-0.014
 Will-0.013
 mid-0.013
 farewell-0.013
Token
Token ID198
Feature activation-9.60e-02

Pos logit contributions
+0.011
 �+0.007
 Will+0.007
 Pret+0.006
 delight+0.006
Neg logit contributions
glers-0.021
bel-0.020
kins-0.019
limits-0.019
asis-0.019
Token
Token ID198
Feature activation-3.85e-01

Pos logit contributions
glers+0.004
bel+0.004
limits+0.003
kins+0.003
asis+0.003
Neg logit contributions
-0.002
 Will-0.001
 �-0.001
 dismay-0.001
itz-0.001
Token wrote
Token ID2630
Feature activation-2.09e-01

Pos logit contributions
+0.009
 Will+0.007
 �+0.007
 mid+0.006
 dismay+0.006
Neg logit contributions
glers-0.013
bel-0.012
tub-0.011
limits-0.011
kins-0.011
Token "
Token ID366
Feature activation-9.49e-01

Pos logit contributions
glers+0.007
bel+0.006
kins+0.006
tub+0.006
limits+0.006
Neg logit contributions
-0.005
itz-0.004
 mid-0.004
 dismay-0.004
 fingert-0.003
TokenI
Token ID40
Feature activation+1.53e+00

Pos logit contributions
bel+0.027
glers+0.026
limits+0.023
tub+0.023
kins+0.023
Neg logit contributions
-0.024
 dismay-0.021
 delight-0.020
 �-0.020
 farewell-0.020
Token love
Token ID1842
Feature activation+2.77e-01

Pos logit contributions
+0.038
 mid+0.027
 dismay+0.027
 �+0.027
itz+0.027
Neg logit contributions
glers-0.047
bel-0.045
tub-0.041
kins-0.041
limits-0.041
Token you
Token ID345
Feature activation+8.62e-01

Pos logit contributions
+0.007
 dismay+0.006
itz+0.006
 farewell+0.006
 fingert+0.006
Neg logit contributions
glers-0.007
bel-0.007
kins-0.006
akers-0.006
cards-0.006
Token"
Token ID1
Feature activation-5.77e+00

Pos logit contributions
+0.021
 dismay+0.017
itz+0.017
 fingert+0.017
 mid+0.016
Neg logit contributions
glers-0.024
bel-0.023
kins-0.022
tub-0.021
akers-0.021
Token on
Token ID319
Feature activation+1.33e+00

Pos logit contributions
glers+0.180
bel+0.170
kins+0.159
tub+0.158
limits+0.157
Neg logit contributions
-0.143
 Will-0.102
 �-0.101
 mid-0.101
 dismay-0.098
Token it
Token ID340
Feature activation+5.58e-01

Pos logit contributions
+0.035
itz+0.026
AY+0.026
 farewell+0.026
 fingert+0.026
Neg logit contributions
glers-0.037
bel-0.036
kins-0.033
tub-0.033
limits-0.031
Token.
Token ID13
Feature activation-2.31e-01

Pos logit contributions
+0.014
itz+0.011
 farewell+0.011
 mid+0.011
AY+0.011
Neg logit contributions
glers-0.016
bel-0.015
tub-0.014
kins-0.014
limits-0.013
Token He
Token ID679
Feature activation-4.12e-01

Pos logit contributions
glers+0.006
bel+0.006
tub+0.005
limits+0.005
kins+0.005
Neg logit contributions
-0.007
 mid-0.005
itz-0.005
 fingert-0.005
 farewell-0.005
Token put
Token ID1234
Feature activation-1.85e+00

Pos logit contributions
glers+0.011
bel+0.010
kins+0.009
tub+0.009
limits+0.009
Neg logit contributions
-0.012
 farewell-0.009
 mid-0.009
 �-0.009
AY-0.009
Token likes
Token ID7832
Feature activation+1.43e+00

Pos logit contributions
+0.001
 Will+0.000
 �+0.000
 mid+0.000
 dismay+0.000
Neg logit contributions
glers-0.001
bel-0.001
kins-0.001
limits-0.001
tub-0.001
Token to
Token ID284
Feature activation-2.74e-01

Pos logit contributions
+0.037
 dismay+0.027
itz+0.027
 fingert+0.026
 mid+0.026
Neg logit contributions
glers-0.043
bel-0.041
tub-0.037
kins-0.036
limits-0.035
Token say
Token ID910
Feature activation-5.96e-01

Pos logit contributions
glers+0.009
bel+0.008
kins+0.008
tub+0.008
limits+0.007
Neg logit contributions
-0.007
 mid-0.005
 �-0.005
 dismay-0.005
 Will-0.005
Token "
Token ID366
Feature activation-1.33e+00

Pos logit contributions
glers+0.019
bel+0.018
kins+0.016
tub+0.016
limits+0.016
Neg logit contributions
-0.014
 dismay-0.010
itz-0.010
 fingert-0.010
 mid-0.010
Tokenhello
Token ID31373
Feature activation+1.15e+00

Pos logit contributions
bel+0.035
glers+0.034
limits+0.029
tub+0.029
ained+0.029
Neg logit contributions
-0.037
 dismay-0.031
 �-0.030
 delight-0.029
itz-0.029
Token"
Token ID1
Feature activation-5.73e+00

Pos logit contributions
+0.030
 dismay+0.028
 fingert+0.027
 fingertips+0.026
 persever+0.026
Neg logit contributions
glers-0.028
bel-0.027
kins-0.025
tub-0.024
akers-0.023
Token and
Token ID290
Feature activation+6.99e-01

Pos logit contributions
glers+0.191
bel+0.182
kins+0.170
limits+0.169
tub+0.168
Neg logit contributions
-0.130
 �-0.088
 Will-0.088
 mid-0.087
 dismay-0.084
Token "
Token ID366
Feature activation-9.78e-01

Pos logit contributions
+0.016
 dismay+0.013
itz+0.012
 mid+0.012
AY+0.012
Neg logit contributions
glers-0.021
bel-0.021
tub-0.018
kins-0.018
limits-0.018
TokenI
Token ID40
Feature activation+1.50e+00

Pos logit contributions
bel+0.027
glers+0.026
tub+0.023
limits+0.022
ained+0.022
Neg logit contributions
-0.026
 dismay-0.022
 �-0.021
 fingert-0.021
itz-0.021
Token love
Token ID1842
Feature activation+6.82e-01

Pos logit contributions
+0.041
 mid+0.030
 �+0.030
 dismay+0.030
 Will+0.030
Neg logit contributions
glers-0.043
bel-0.041
tub-0.037
kins-0.037
limits-0.037
Token you
Token ID345
Feature activation+1.22e+00

Pos logit contributions
+0.019
 dismay+0.016
itz+0.015
nown+0.015
 fingert+0.015
Neg logit contributions
glers-0.018
bel-0.016
ards-0.014
kins-0.014
ics-0.014
Token and
Token ID290
Feature activation-1.73e-02

Pos logit contributions
+0.044
 �+0.030
 mid+0.030
 Will+0.030
 farewell+0.029
Neg logit contributions
glers-0.063
bel-0.061
limits-0.056
tub-0.056
kins-0.056
Token "
Token ID366
Feature activation-8.19e-01

Pos logit contributions
glers+0.001
bel+0.000
tub+0.000
limits+0.000
kins+0.000
Neg logit contributions
-0.000
 �-0.000
 Will-0.000
 mid-0.000
 dismay-0.000
TokenI
Token ID40
Feature activation+1.12e+00

Pos logit contributions
glers+0.023
bel+0.023
limits+0.020
tub+0.020
kins+0.020
Neg logit contributions
-0.021
 dismay-0.017
 �-0.017
 farewell-0.016
 fingert-0.016
Token love
Token ID1842
Feature activation+4.06e-01

Pos logit contributions
+0.021
 Will+0.013
 �+0.013
 mid+0.012
 dismay+0.012
Neg logit contributions
glers-0.042
bel-0.040
tub-0.038
kins-0.038
limits-0.038
Token you
Token ID345
Feature activation+7.56e-01

Pos logit contributions
+0.011
 dismay+0.009
 farewell+0.009
 fingert+0.009
itz+0.008
Neg logit contributions
glers-0.011
bel-0.011
kins-0.009
akers-0.009
tub-0.009
Token"
Token ID1
Feature activation-5.71e+00

Pos logit contributions
+0.020
 dismay+0.016
itz+0.016
 persever+0.015
 mid+0.015
Neg logit contributions
glers-0.020
bel-0.019
kins-0.019
akers-0.018
tub-0.018
Token on
Token ID319
Feature activation+1.41e+00

Pos logit contributions
glers+0.189
bel+0.177
asis+0.169
kins+0.168
limits+0.167
Neg logit contributions
-0.132
 Will-0.091
 �-0.089
 mid-0.087
itz-0.084
Token their
Token ID511
Feature activation-1.52e+00

Pos logit contributions
+0.035
itz+0.024
 mid+0.024
 farewell+0.024
 dismay+0.024
Neg logit contributions
glers-0.044
bel-0.043
kins-0.039
tub-0.039
limits-0.038
Token pictures
Token ID5986
Feature activation-1.18e-01

Pos logit contributions
bel+0.037
glers+0.037
kins+0.034
akers+0.031
tub+0.031
Neg logit contributions
-0.046
itz-0.037
 Will-0.036
 �-0.036
AY-0.035
Token.
Token ID13
Feature activation-2.39e-01

Pos logit contributions
glers+0.004
bel+0.004
limits+0.003
kins+0.003
tub+0.003
Neg logit contributions
-0.003
 �-0.002
 mid-0.002
 Will-0.002
 dismay-0.002
Token They
Token ID1119
Feature activation+4.66e-02

Pos logit contributions
glers+0.006
bel+0.006
tub+0.005
kins+0.005
limits+0.005
Neg logit contributions
-0.007
 mid-0.006
 dismay-0.006
 fingert-0.006
 farewell-0.006
Token naughty
Token ID45128
Feature activation-7.21e-01

Pos logit contributions
glers+0.034
bel+0.033
tub+0.030
kins+0.030
limits+0.029
Neg logit contributions
-0.031
 mid-0.022
 �-0.022
 Will-0.022
 dismay-0.022
Token and
Token ID290
Feature activation-2.80e-01

Pos logit contributions
glers+0.024
bel+0.023
kins+0.021
limits+0.021
tub+0.021
Neg logit contributions
-0.017
 �-0.012
 Will-0.012
 mid-0.012
 dismay-0.011
Token did
Token ID750
Feature activation-3.10e-01

Pos logit contributions
glers+0.009
bel+0.009
kins+0.008
tub+0.008
asis+0.008
Neg logit contributions
-0.007
 �-0.005
 Will-0.005
 mid-0.004
 dismay-0.004
Token not
Token ID407
Feature activation-5.78e-01

Pos logit contributions
glers+0.007
bel+0.007
kins+0.006
limits+0.006
tub+0.006
Neg logit contributions
-0.010
 mid-0.008
itz-0.008
 Will-0.008
 fingert-0.008
Token share
Token ID2648
Feature activation-9.87e-01

Pos logit contributions
glers+0.017
bel+0.016
limits+0.014
kins+0.014
tub+0.014
Neg logit contributions
-0.015
 �-0.011
 mid-0.011
itz-0.011
 farewell-0.011
Token"
Token ID1
Feature activation-5.71e+00

Pos logit contributions
glers+0.026
bel+0.025
kins+0.024
tub+0.023
limits+0.022
Neg logit contributions
-0.026
itz-0.021
 fingert-0.021
 farewell-0.021
AY-0.021
Token and
Token ID290
Feature activation+1.31e+00

Pos logit contributions
glers+0.173
bel+0.164
kins+0.152
tub+0.151
limits+0.150
Neg logit contributions
-0.147
 Will-0.105
 �-0.105
 mid-0.105
 dismay-0.103
Token included
Token ID3017
Feature activation+3.55e-01

Pos logit contributions
+0.035
 Will+0.026
 �+0.026
 dismay+0.025
 delight+0.024
Neg logit contributions
glers-0.039
bel-0.037
asis-0.035
kins-0.034
tub-0.034
Token a
Token ID257
Feature activation+1.14e+00

Pos logit contributions
+0.008
 Will+0.006
 �+0.006
 mid+0.006
 dismay+0.006
Neg logit contributions
glers-0.012
bel-0.011
limits-0.010
kins-0.010
tub-0.010
Token smile
Token ID8212
Feature activation+3.59e-01

Pos logit contributions
+0.027
 mid+0.019
 Will+0.019
 �+0.019
 dismay+0.019
Neg logit contributions
glers-0.036
bel-0.034
tub-0.032
kins-0.031
limits-0.031
Tokeny
Token ID88
Feature activation-5.19e-01

Pos logit contributions
+0.009
 dismay+0.006
 mid+0.006
itz+0.006
 Will+0.006
Neg logit contributions
glers-0.011
bel-0.011
kins-0.010
tub-0.010
limits-0.009
Token and
Token ID290
Feature activation-1.88e-01

Pos logit contributions
+0.043
 dismay+0.031
itz+0.030
 mid+0.030
 fingert+0.030
Neg logit contributions
glers-0.059
bel-0.057
kins-0.052
tub-0.052
limits-0.051
Token "
Token ID366
Feature activation-1.10e+00

Pos logit contributions
glers+0.005
bel+0.005
kins+0.005
tub+0.005
limits+0.005
Neg logit contributions
-0.005
itz-0.004
 dismay-0.004
 mid-0.004
AY-0.004
TokenI
Token ID40
Feature activation+8.70e-01

Pos logit contributions
bel+0.030
glers+0.029
tub+0.026
limits+0.025
kins+0.025
Neg logit contributions
-0.030
 dismay-0.025
 �-0.024
 delight-0.024
 farewell-0.024
Token love
Token ID1842
Feature activation+6.34e-01

Pos logit contributions
+0.019
 mid+0.013
 �+0.013
 dismay+0.013
 Will+0.013
Neg logit contributions
glers-0.029
bel-0.028
kins-0.026
tub-0.026
limits-0.026
Token you
Token ID345
Feature activation+9.24e-01

Pos logit contributions
+0.017
 dismay+0.014
 farewell+0.013
itz+0.013
nown+0.013
Neg logit contributions
glers-0.018
bel-0.017
kins-0.016
tub-0.015
akers-0.015
Token"
Token ID1
Feature activation-5.70e+00

Pos logit contributions
+0.023
 dismay+0.019
itz+0.019
 persever+0.018
 mid+0.018
Neg logit contributions
glers-0.026
bel-0.024
kins-0.024
tub-0.023
akers-0.022
Token on
Token ID319
Feature activation+1.43e+00

Pos logit contributions
glers+0.185
bel+0.174
asis+0.166
kins+0.164
limits+0.164
Neg logit contributions
-0.136
 Will-0.095
 �-0.092
 mid-0.090
itz-0.087
Token their
Token ID511
Feature activation-1.40e+00

Pos logit contributions
+0.036
itz+0.026
 farewell+0.026
 fingert+0.026
 dismay+0.026
Neg logit contributions
glers-0.042
bel-0.041
kins-0.038
tub-0.037
limits-0.036
Token pictures
Token ID5986
Feature activation-4.76e-02

Pos logit contributions
glers+0.034
bel+0.033
kins+0.031
tub+0.029
asis+0.028
Neg logit contributions
-0.041
itz-0.034
 �-0.033
nown-0.033
 Will-0.032
Token.
Token ID13
Feature activation-5.49e-01

Pos logit contributions
glers+0.002
bel+0.002
limits+0.001
tub+0.001
kins+0.001
Neg logit contributions
-0.001
 �-0.001
 mid-0.001
 Will-0.001
 dismay-0.001
Token
Token ID198
Feature activation-3.80e-01

Pos logit contributions
glers+0.016
bel+0.015
tub+0.014
kins+0.013
limits+0.013
Neg logit contributions
-0.014
 mid-0.011
 dismay-0.011
 fingert-0.011
 farewell-0.011
Token it
Token ID340
Feature activation+1.26e+00

Pos logit contributions
glers+0.004
bel+0.004
kins+0.004
tub+0.004
limits+0.003
Neg logit contributions
-0.004
itz-0.003
 mid-0.003
 dismay-0.003
AY-0.003
Token away
Token ID1497
Feature activation+1.11e+00

Pos logit contributions
+0.030
 mid+0.021
 �+0.021
 Will+0.021
 dismay+0.021
Neg logit contributions
glers-0.040
bel-0.038
kins-0.035
tub-0.035
limits-0.035
Token in
Token ID287
Feature activation+8.60e-01

Pos logit contributions
+0.027
 mid+0.019
 �+0.019
 Will+0.019
 dismay+0.019
Neg logit contributions
glers-0.035
bel-0.034
tub-0.031
kins-0.031
limits-0.031
Token a
Token ID257
Feature activation+2.25e-01

Pos logit contributions
+0.023
itz+0.018
 farewell+0.018
 wo+0.018
AY+0.017
Neg logit contributions
glers-0.024
bel-0.023
kins-0.022
tub-0.021
limits-0.020
Token wheel
Token ID7825
Feature activation-1.38e+00

Pos logit contributions
+0.006
itz+0.005
 Will+0.005
AY+0.005
 mid+0.005
Neg logit contributions
glers-0.006
bel-0.005
tub-0.005
kins-0.005
asis-0.005
Tokenbar
Token ID5657
Feature activation-5.69e+00

Pos logit contributions
glers+0.048
bel+0.047
kins+0.045
limits+0.045
tub+0.043
Neg logit contributions
-0.029
 �-0.018
 mid-0.018
 Will-0.017
 fingert-0.017
Tokenrow
Token ID808
Feature activation-9.76e-01

Pos logit contributions
glers+0.128
bel+0.116
limits+0.110
 thirsty+0.097
tub+0.094
Neg logit contributions
-0.179
 fingert-0.148
 arm-0.146
 tr-0.145
 half-0.142
Token.
Token ID13
Feature activation+4.53e-01

Pos logit contributions
glers+0.037
bel+0.035
limits+0.034
kins+0.033
asis+0.033
Neg logit contributions
-0.017
 �-0.011
 fingert-0.010
 mid-0.009
 tr-0.009
Token
Token ID198
Feature activation+4.92e-01

Pos logit contributions
+0.011
 mid+0.008
itz+0.008
 dismay+0.008
 fingert+0.008
Neg logit contributions
glers-0.014
bel-0.014
tub-0.013
kins-0.012
limits-0.012
Token
Token ID198
Feature activation+4.12e-01

Pos logit contributions
+0.010
 mid+0.007
 �+0.007
 Will+0.007
 dismay+0.007
Neg logit contributions
glers-0.017
bel-0.016
kins-0.015
tub-0.015
limits-0.015
TokenOnce
Token ID7454
Feature activation+1.12e+00

Pos logit contributions
+0.011
 mid+0.008
itz+0.008
 dismay+0.008
 fingert+0.008
Neg logit contributions
glers-0.012
bel-0.012
tub-0.011
limits-0.011
kins-0.010
Token filled
Token ID5901
Feature activation-1.12e+00

Pos logit contributions
glers+0.025
bel+0.023
kins+0.021
tub+0.021
limits+0.020
Neg logit contributions
-0.027
 dismay-0.021
 mid-0.021
itz-0.021
 farewell-0.021
Token up
Token ID510
Feature activation-5.08e-01

Pos logit contributions
glers+0.039
bel+0.037
limits+0.035
asis+0.034
akers+0.034
Neg logit contributions
-0.024
 mid-0.017
 Will-0.016
 delight-0.016
 �-0.016
Token lots
Token ID6041
Feature activation+1.84e-01

Pos logit contributions
glers+0.017
bel+0.016
limits+0.015
kins+0.015
tub+0.015
Neg logit contributions
-0.011
 mid-0.008
 Will-0.008
 dismay-0.008
 �-0.008
Token of
Token ID286
Feature activation-3.08e+00

Pos logit contributions
+0.005
 dismay+0.004
 Will+0.004
itz+0.004
AY+0.004
Neg logit contributions
glers-0.005
bel-0.005
kins-0.004
tub-0.004
limits-0.004
Token wheel
Token ID7825
Feature activation-1.66e+00

Pos logit contributions
glers+0.091
bel+0.086
limits+0.080
tub+0.078
asis+0.078
Neg logit contributions
-0.081
 delight-0.062
 mid-0.061
 dismay-0.060
Mine-0.059
Tokenbar
Token ID5657
Feature activation-5.69e+00

Pos logit contributions
glers+0.057
bel+0.054
limits+0.054
kins+0.053
asis+0.052
Neg logit contributions
-0.037
 �-0.023
 tr-0.022
 mid-0.022
 fingert-0.020
Tokenrows
Token ID8516
Feature activation-1.06e+00

Pos logit contributions
glers+0.130
bel+0.121
limits+0.115
 thirsty+0.103
tub+0.101
Neg logit contributions
-0.175
 tr-0.143
 half-0.141
 fingert-0.140
 eight-0.140
Token with
Token ID351
Feature activation+3.67e-01

Pos logit contributions
glers+0.037
bel+0.036
limits+0.033
kins+0.033
tub+0.033
Neg logit contributions
-0.024
 �-0.015
 mid-0.015
 Will-0.014
 farewell-0.014
Token the
Token ID262
Feature activation-6.69e-01

Pos logit contributions
+0.010
itz+0.007
 dismay+0.007
 Will+0.007
 mid+0.007
Neg logit contributions
glers-0.011
bel-0.010
kins-0.009
tub-0.009
limits-0.009
Token waste
Token ID7030
Feature activation-4.68e-01

Pos logit contributions
glers+0.019
bel+0.018
limits+0.016
kins+0.016
tub+0.016
Neg logit contributions
-0.019
 mid-0.014
 dismay-0.014
 �-0.014
 delight-0.014
Token and
Token ID290
Feature activation+1.24e+00

Pos logit contributions
glers+0.015
bel+0.014
limits+0.013
kins+0.013
tub+0.013
Neg logit contributions
-0.011
 �-0.008
 mid-0.008
 Will-0.008
 dismay-0.008
Token so
Token ID523
Feature activation-1.53e+00

Pos logit contributions
glers+0.040
bel+0.038
kins+0.035
tub+0.035
limits+0.035
Neg logit contributions
-0.033
 �-0.024
 Will-0.024
 mid-0.023
 fingert-0.022
Token he
Token ID339
Feature activation-1.80e+00

Pos logit contributions
glers+0.033
bel+0.031
tub+0.028
limits+0.027
kins+0.027
Neg logit contributions
-0.053
 �-0.042
 Will-0.042
 mid-0.042
 dismay-0.041
Token said
Token ID531
Feature activation-1.37e+00

Pos logit contributions
glers+0.052
bel+0.050
limits+0.046
tub+0.046
kins+0.045
Neg logit contributions
-0.049
 Will-0.036
 �-0.035
 mid-0.035
 fingert-0.035
Token "
Token ID366
Feature activation-1.40e+00

Pos logit contributions
glers+0.036
bel+0.036
kins+0.032
tub+0.032
limits+0.031
Neg logit contributions
-0.038
itz-0.030
 dismay-0.030
 mid-0.030
AY-0.029
Tokenyes
Token ID8505
Feature activation+1.46e+00

Pos logit contributions
glers+0.033
bel+0.030
kins+0.028
tub+0.027
limits+0.027
Neg logit contributions
-0.046
 Will-0.036
 mid-0.036
 �-0.035
 fingert-0.035
Token"
Token ID1
Feature activation-5.69e+00

Pos logit contributions
+0.040
 mid+0.033
 dismay+0.033
 delight+0.033
itz+0.033
Neg logit contributions
glers-0.037
bel-0.035
akers-0.032
kins-0.032
tub-0.031
Token and
Token ID290
Feature activation-3.36e-01

Pos logit contributions
glers+0.205
bel+0.196
kins+0.184
limits+0.183
tub+0.182
Neg logit contributions
-0.114
 �-0.073
 Will-0.073
 mid-0.072
 fingert-0.068
Token went
Token ID1816
Feature activation-1.54e+00

Pos logit contributions
glers+0.010
bel+0.009
kins+0.008
tub+0.008
limits+0.008
Neg logit contributions
-0.009
 mid-0.007
 dismay-0.007
 �-0.007
 Will-0.007
Token to
Token ID284
Feature activation+3.43e-01

Pos logit contributions
glers+0.053
bel+0.051
kins+0.048
tub+0.047
limits+0.047
Neg logit contributions
-0.033
 Will-0.022
 �-0.022
 mid-0.022
 farewell-0.021
Token find
Token ID1064
Feature activation-2.35e-01

Pos logit contributions
+0.010
 farewell+0.007
itz+0.007
 fingert+0.007
nown+0.007
Neg logit contributions
glers-0.009
bel-0.009
tub-0.008
limits-0.008
kins-0.008
Token his
Token ID465
Feature activation-2.52e-01

Pos logit contributions
glers+0.007
bel+0.007
kins+0.006
tub+0.006
limits+0.006
Neg logit contributions
-0.006
itz-0.004
nown-0.004
AY-0.004
 farewell-0.004
Token "
Token ID366
Feature activation+8.96e-01

Pos logit contributions
+0.008
 �+0.006
 Will+0.006
 mid+0.006
 dismay+0.006
Neg logit contributions
glers-0.011
bel-0.010
kins-0.009
limits-0.009
asis-0.009
TokenHello
Token ID15496
Feature activation+2.66e+00

Pos logit contributions
+0.024
 fingertips+0.021
 dismay+0.021
 fingert+0.020
 farewell+0.020
Neg logit contributions
bel-0.023
glers-0.022
limits-0.019
ained-0.018
avers-0.018
Token,
Token ID11
Feature activation+1.84e+00

Pos logit contributions
+0.069
 mid+0.051
 dismay+0.051
 fingert+0.050
 �+0.050
Neg logit contributions
glers-0.079
bel-0.075
kins-0.069
tub-0.068
limits-0.067
Token goat
Token ID26917
Feature activation+1.02e+00

Pos logit contributions
+0.049
 mid+0.039
 fingert+0.039
 dismay+0.039
 fingertips+0.039
Neg logit contributions
glers-0.051
bel-0.045
tub-0.043
kins-0.042
avers-0.041
Token.
Token ID13
Feature activation+1.97e+00

Pos logit contributions
+0.021
 Will+0.012
 �+0.011
 wo+0.010
 Lizard+0.010
Neg logit contributions
bel-0.039
glers-0.038
ained-0.037
limits-0.037
tub-0.036
Token What
Token ID1867
Feature activation+3.52e+00

Pos logit contributions
+0.048
 dismay+0.040
 mid+0.039
 fingert+0.039
itz+0.038
Neg logit contributions
glers-0.054
bel-0.052
tub-0.048
limits-0.047
avers-0.045
Token's
Token ID338
Feature activation-2.94e-01

Pos logit contributions
+0.093
itz+0.082
 dismay+0.081
 fingert+0.081
 farewell+0.080
Neg logit contributions
glers-0.089
bel-0.083
kins-0.081
tub-0.076
ics-0.072
Token your
Token ID534
Feature activation+7.31e-01

Pos logit contributions
glers+0.008
bel+0.008
kins+0.007
limits+0.007
tub+0.007
Neg logit contributions
-0.008
 dismay-0.006
 mid-0.006
 Will-0.006
 delight-0.006
Token name
Token ID1438
Feature activation+1.67e+00

Pos logit contributions
+0.019
 mid+0.014
 dismay+0.014
itz+0.014
 Will+0.014
Neg logit contributions
glers-0.020
bel-0.020
tub-0.018
asis-0.018
kins-0.017
Token?"
Token ID1701
Feature activation-3.34e-02

Pos logit contributions
+0.061
 mid+0.049
 �+0.049
 Will+0.049
 dismay+0.049
Neg logit contributions
glers-0.032
bel-0.030
tub-0.026
kins-0.026
limits-0.025
Token Lily
Token ID20037
Feature activation+6.92e-01

Pos logit contributions
glers+0.001
bel+0.001
kins+0.001
akers+0.001
asis+0.001
Neg logit contributions
-0.001
 Will-0.001
 �-0.001
 mid-0.001
 dismay-0.001
Token said
Token ID531
Feature activation-2.05e+00

Pos logit contributions
glers+0.014
bel+0.013
kins+0.012
tub+0.012
limits+0.011
Neg logit contributions
-0.017
 mid-0.013
 �-0.013
 dismay-0.013
 Will-0.013
Token.
Token ID13
Feature activation-1.80e-01

Pos logit contributions
glers+0.065
bel+0.063
kins+0.057
tub+0.057
limits+0.057
Neg logit contributions
-0.049
 mid-0.035
 �-0.034
 Will-0.034
 dismay-0.034
Token
Token ID198
Feature activation+7.96e-01

Pos logit contributions
bel+0.006
glers+0.006
tub+0.005
kins+0.005
limits+0.005
Neg logit contributions
-0.004
 mid-0.003
itz-0.003
 fingert-0.003
AY-0.003
Token
Token ID198
Feature activation+8.51e-01

Pos logit contributions
+0.018
 mid+0.013
 dismay+0.013
itz+0.012
 fingert+0.012
Neg logit contributions
glers-0.026
bel-0.025
kins-0.023
tub-0.023
limits-0.023
Token"
Token ID1
Feature activation-8.70e-01

Pos logit contributions
+0.021
 �+0.014
 mid+0.014
 Will+0.014
 dismay+0.014
Neg logit contributions
glers-0.028
bel-0.026
tub-0.024
kins-0.024
limits-0.024
TokenReally
Token ID26392
Feature activation+3.79e+00

Pos logit contributions
glers+0.027
bel+0.027
limits+0.024
kins+0.024
tub+0.024
Neg logit contributions
-0.021
 dismay-0.015
 mid-0.015
 �-0.015
 fingert-0.015
Token?
Token ID30
Feature activation-2.74e-01

Pos logit contributions
+0.126
 fingert+0.108
Mine+0.107
 farewell+0.107
 dismay+0.107
Neg logit contributions
glers-0.075
bel-0.071
kins-0.069
tub-0.063
asis-0.061
Token A
Token ID317
Feature activation+3.10e+00

Pos logit contributions
glers+0.010
bel+0.009
kins+0.009
tub+0.008
limits+0.008
Neg logit contributions
-0.006
 Will-0.004
 �-0.004
 mid-0.004
 Pret-0.004
Token witch
Token ID16365
Feature activation-1.64e-01

Pos logit contributions
+0.102
 mid+0.080
 �+0.079
 Will+0.079
 dismay+0.079
Neg logit contributions
glers-0.070
bel-0.067
tub-0.059
kins-0.059
limits-0.058
Token?"
Token ID1701
Feature activation-1.70e+00

Pos logit contributions
glers+0.005
bel+0.005
tub+0.004
limits+0.004
kins+0.004
Neg logit contributions
-0.005
 �-0.003
 Will-0.003
 mid-0.003
 wo-0.003
Token their
Token ID511
Feature activation+7.62e-01

Pos logit contributions
glers+0.058
bel+0.055
asis+0.054
tub+0.052
kins+0.052
Neg logit contributions
-0.039
 �-0.028
 Will-0.027
 dismay-0.026
 delight-0.025
Token She
Token ID1375
Feature activation+1.05e-02

Pos logit contributions
+0.007
 mid+0.005
 fingert+0.005
 farewell+0.005
itz+0.005
Neg logit contributions
glers-0.005
bel-0.005
tub-0.004
limits-0.004
ained-0.004
Token said
Token ID531
Feature activation-7.08e-01

Pos logit contributions
+0.000
 mid+0.000
AY+0.000
itz+0.000
 �+0.000
Neg logit contributions
glers-0.000
bel-0.000
kins-0.000
tub-0.000
limits-0.000
Token goodbye
Token ID24829
Feature activation+5.03e-01

Pos logit contributions
glers+0.020
bel+0.020
kins+0.018
tub+0.018
limits+0.018
Neg logit contributions
-0.018
 mid-0.014
 dismay-0.014
itz-0.014
AY-0.013
Token to
Token ID284
Feature activation+6.53e-03

Pos logit contributions
+0.011
 �+0.007
 Will+0.007
 farewell+0.007
 mid+0.007
Neg logit contributions
glers-0.018
bel-0.017
kins-0.016
tub-0.016
limits-0.016
Token her
Token ID607
Feature activation+2.19e+00

Pos logit contributions
+0.000
nown+0.000
AY+0.000
 fingert+0.000
itz+0.000
Neg logit contributions
glers-0.000
bel-0.000
kins-0.000
tub-0.000
limits-0.000
Token t
Token ID256
Feature activation+3.06e+00

Pos logit contributions
+0.051
AY+0.036
Mine+0.035
OW+0.034
 mid+0.034
Neg logit contributions
glers-0.072
bel-0.069
kins-0.066
tub-0.065
asis-0.061
Tokeneddy
Token ID21874
Feature activation+7.32e-02

Pos logit contributions
+0.080
 mid+0.058
 Will+0.058
 dismay+0.058
 �+0.058
Neg logit contributions
glers-0.090
bel-0.086
kins-0.079
tub-0.079
limits-0.078
Token bear
Token ID6842
Feature activation+1.52e+00

Pos logit contributions
+0.002
 Will+0.001
 �+0.001
 mid+0.001
 farewell+0.001
Neg logit contributions
glers-0.002
bel-0.002
limits-0.002
tub-0.002
kins-0.002
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
+0.035
 mid+0.025
 dismay+0.025
itz+0.025
 fingert+0.025
Neg logit contributions
glers-0.048
bel-0.047
tub-0.043
kins-0.043
limits-0.042
Token promised
Token ID8072
Feature activation+1.21e+00

Pos logit contributions
+0.024
 Will+0.017
 farewell+0.017
 t+0.016
 dismay+0.016
Neg logit contributions
glers-0.033
bel-0.031
tub-0.029
akers-0.029
ained-0.029
Token to
Token ID284
Feature activation-2.32e-01

Pos logit contributions
+0.028
 mid+0.020
 dismay+0.020
 Will+0.020
 �+0.020
Neg logit contributions
glers-0.039
bel-0.038
limits-0.034
tub-0.034
kins-0.034
Token Tom
Token ID4186
Feature activation+3.75e-01

Pos logit contributions
+0.005
 dismay+0.004
 mid+0.004
 fingert+0.004
itz+0.004
Neg logit contributions
glers-0.005
bel-0.005
tub-0.005
limits-0.004
kins-0.004
Token opened
Token ID4721
Feature activation+3.87e-01

Pos logit contributions
+0.009
itz+0.007
 mid+0.007
 dismay+0.007
AY+0.007
Neg logit contributions
bel-0.011
glers-0.011
kins-0.010
tub-0.010
limits-0.009
Token it
Token ID340
Feature activation+2.07e+00

Pos logit contributions
+0.010
itz+0.007
 fingert+0.007
 dismay+0.007
 mid+0.007
Neg logit contributions
glers-0.012
bel-0.011
kins-0.010
tub-0.010
limits-0.010
Token and
Token ID290
Feature activation+1.35e+00

Pos logit contributions
+0.051
itz+0.040
 dismay+0.039
Mine+0.038
 fingert+0.038
Neg logit contributions
glers-0.061
bel-0.059
tub-0.056
kins-0.055
limits-0.052
Token saw
Token ID2497
Feature activation-3.33e-01

Pos logit contributions
+0.040
 Will+0.030
 �+0.030
 mid+0.030
 dismay+0.030
Neg logit contributions
glers-0.036
bel-0.034
limits-0.030
tub-0.030
kins-0.030
Token a
Token ID257
Feature activation+2.61e+00

Pos logit contributions
glers+0.012
bel+0.011
limits+0.011
kins+0.011
asis+0.010
Neg logit contributions
-0.007
 �-0.005
 Will-0.005
 delight-0.005
 dismay-0.004
Token new
Token ID649
Feature activation+1.00e+00

Pos logit contributions
+0.067
 farewell+0.050
itz+0.050
AY+0.049
 mid+0.049
Neg logit contributions
glers-0.076
bel-0.073
tub-0.069
kins-0.067
asis-0.064
Token superhero
Token ID17343
Feature activation-2.58e-01

Pos logit contributions
+0.028
 Will+0.021
itz+0.021
 farewell+0.021
 mid+0.021
Neg logit contributions
glers-0.027
bel-0.026
tub-0.024
kins-0.023
asis-0.022
Token costume
Token ID16569
Feature activation+6.88e-02

Pos logit contributions
glers+0.009
bel+0.008
limits+0.008
ained+0.008
tub+0.008
Neg logit contributions
-0.006
 �-0.004
 tr-0.004
 Will-0.004
 t-0.004
Token.
Token ID13
Feature activation-6.88e-02

Pos logit contributions
+0.002
 �+0.001
 mid+0.001
 tr+0.001
 Will+0.001
Neg logit contributions
glers-0.002
bel-0.002
limits-0.002
kins-0.002
akers-0.002
Token It
Token ID632
Feature activation+1.60e+00

Pos logit contributions
glers+0.002
bel+0.002
tub+0.002
limits+0.001
kins+0.001
Neg logit contributions
-0.002
itz-0.002
 fingert-0.002
 dismay-0.002
 mid-0.002
Token a
Token ID257
Feature activation+1.28e+00

Pos logit contributions
glers+0.025
bel+0.025
limits+0.023
tub+0.023
kins+0.022
Neg logit contributions
-0.020
 mid-0.014
AY-0.014
 fingertips-0.014
itz-0.014
Token big
Token ID1263
Feature activation-6.38e-01

Pos logit contributions
+0.034
 farewell+0.026
 Will+0.026
itz+0.025
 mid+0.025
Neg logit contributions
glers-0.036
bel-0.035
tub-0.033
kins-0.031
asis-0.030
Token door
Token ID3420
Feature activation-2.18e+00

Pos logit contributions
glers+0.016
bel+0.015
tub+0.014
kins+0.013
asis+0.013
Neg logit contributions
-0.019
 Will-0.015
itz-0.015
 farewell-0.015
 fingert-0.015
Token that
Token ID326
Feature activation-1.12e+00

Pos logit contributions
glers+0.071
bel+0.066
kins+0.063
tub+0.063
asis+0.061
Neg logit contributions
-0.054
 �-0.037
 mid-0.037
 dismay-0.034
 fingert-0.034
Token she
Token ID673
Feature activation-3.14e-01

Pos logit contributions
glers+0.040
bel+0.037
akers+0.036
asis+0.035
kins+0.035
Neg logit contributions
-0.025
 Will-0.016
 �-0.016
 dismay-0.015
 mid-0.015
Token hadn
Token ID8020
Feature activation+2.83e+00

Pos logit contributions
glers+0.008
bel+0.008
kins+0.007
tub+0.007
limits+0.007
Neg logit contributions
-0.009
 mid-0.007
itz-0.007
 farewell-0.007
 �-0.007
Token't
Token ID470
Feature activation-1.01e+00

Pos logit contributions
 delight+0.075
 dismay+0.072
 farewell+0.072
Mine+0.072
+0.072
Neg logit contributions
glers-0.061
bel-0.059
tub-0.056
limits-0.055
ics-0.053
Token seen
Token ID1775
Feature activation-1.10e+00

Pos logit contributions
glers+0.024
bel+0.022
kins+0.020
tub+0.020
limits+0.019
Neg logit contributions
-0.034
 mid-0.026
 �-0.026
 Will-0.026
 dismay-0.026
Token before
Token ID878
Feature activation+9.82e-02

Pos logit contributions
glers+0.030
bel+0.029
tub+0.027
limits+0.027
kins+0.026
Neg logit contributions
-0.028
Mine-0.023
 farewell-0.022
 dismay-0.021
 delight-0.021
Token and
Token ID290
Feature activation+1.75e+00

Pos logit contributions
+0.002
 Will+0.002
 farewell+0.002
itz+0.002
 dismay+0.002
Neg logit contributions
glers-0.003
bel-0.003
tub-0.003
kins-0.003
limits-0.003
Token decided
Token ID3066
Feature activation+1.16e-01

Pos logit contributions
+0.046
 �+0.033
 Will+0.032
 dismay+0.032
 mid+0.032
Neg logit contributions
glers-0.054
bel-0.051
kins-0.047
tub-0.047
akers-0.046
Token didn
Token ID1422
Feature activation+2.34e+00

Pos logit contributions
+0.032
 mid+0.023
 farewell+0.023
 �+0.023
 Will+0.022
Neg logit contributions
glers-0.044
bel-0.043
tub-0.039
kins-0.039
limits-0.039
Token't
Token ID470
Feature activation-7.07e-01

Pos logit contributions
 ear+0.064
 fingertips+0.063
 delight+0.063
 amaz+0.062
+0.062
Neg logit contributions
glers-0.050
bel-0.043
limits-0.042
tub-0.040
ained-0.040
Token work
Token ID670
Feature activation-7.41e-01

Pos logit contributions
glers+0.021
bel+0.020
kins+0.018
tub+0.018
limits+0.018
Neg logit contributions
-0.019
 �-0.014
 mid-0.014
 dismay-0.014
 Will-0.014
Token,
Token ID11
Feature activation-1.78e-01

Pos logit contributions
glers+0.024
bel+0.023
tub+0.021
kins+0.021
limits+0.021
Neg logit contributions
-0.017
 mid-0.012
 Will-0.012
 dismay-0.012
itz-0.012
Token and
Token ID290
Feature activation+1.29e+00

Pos logit contributions
glers+0.006
bel+0.006
kins+0.005
asis+0.005
tub+0.005
Neg logit contributions
-0.005
 �-0.003
 Will-0.003
 mid-0.003
 dismay-0.003
Token his
Token ID465
Feature activation+7.73e-01

Pos logit contributions
+0.030
 �+0.021
 Will+0.021
 dismay+0.020
 mid+0.019
Neg logit contributions
glers-0.043
bel-0.042
kins-0.039
tub-0.039
asis-0.039
Token mom
Token ID1995
Feature activation-1.89e+00

Pos logit contributions
+0.024
 �+0.018
AY+0.018
 mid+0.018
 Will+0.017
Neg logit contributions
glers-0.019
bel-0.019
kins-0.018
tub-0.018
akers-0.016
Token found
Token ID1043
Feature activation-8.48e-01

Pos logit contributions
glers+0.057
bel+0.057
tub+0.055
asis+0.054
kins+0.053
Neg logit contributions
-0.049
 �-0.034
 Will-0.033
 dismay-0.032
 t-0.032
Token out
Token ID503
Feature activation+5.77e-01

Pos logit contributions
glers+0.029
bel+0.028
tub+0.026
kins+0.026
asis+0.026
Neg logit contributions
-0.019
 �-0.013
 mid-0.013
 Will-0.012
 dismay-0.012
Token what
Token ID644
Feature activation+2.03e+00

Pos logit contributions
+0.016
 mid+0.011
 Will+0.011
 �+0.011
 dismay+0.011
Neg logit contributions
glers-0.017
bel-0.016
kins-0.015
tub-0.014
limits-0.014
Token he
Token ID339
Feature activation-3.07e+00

Pos logit contributions
+0.054
 mid+0.040
 dismay+0.040
 farewell+0.040
itz+0.040
Neg logit contributions
glers-0.059
bel-0.057
kins-0.051
tub-0.051
limits-0.051
Token don
Token ID836
Feature activation+3.23e+00

Pos logit contributions
+0.076
 mid+0.057
itz+0.056
 �+0.056
 dismay+0.056
Neg logit contributions
glers-0.090
bel-0.087
kins-0.079
tub-0.078
limits-0.078
Token't
Token ID470
Feature activation-7.38e-01

Pos logit contributions
+0.083
 farewell+0.075
 dismay+0.075
 fingertips+0.074
 fingert+0.074
Neg logit contributions
glers-0.082
bel-0.073
limits-0.069
kins-0.068
ics-0.066
Token eat
Token ID4483
Feature activation+5.39e-01

Pos logit contributions
glers+0.022
bel+0.020
limits+0.019
asis+0.019
kins+0.018
Neg logit contributions
-0.018
 �-0.014
 mid-0.014
 farewell-0.014
 Pret-0.014
Token me
Token ID502
Feature activation+7.75e-01

Pos logit contributions
+0.013
itz+0.010
 fingert+0.010
 mid+0.010
 dismay+0.010
Neg logit contributions
glers-0.016
bel-0.015
kins-0.014
limits-0.014
tub-0.014
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
+0.021
 mid+0.015
 �+0.014
 Will+0.014
 dismay+0.014
Neg logit contributions
glers-0.023
bel-0.023
tub-0.021
kins-0.020
limits-0.020
Token Mr
Token ID1770
Feature activation+5.07e-01

Pos logit contributions
+0.024
 mid+0.018
 dismay+0.017
 fingert+0.017
 farewell+0.017
Neg logit contributions
glers-0.031
bel-0.030
tub-0.027
kins-0.027
limits-0.027
Token.
Token ID13
Feature activation+1.93e+00

Pos logit contributions
+0.012
 Will+0.009
 mid+0.009
 �+0.009
 dismay+0.008
Neg logit contributions
glers-0.016
bel-0.016
limits-0.014
tub-0.014
kins-0.014
Token Bear
Token ID14732
Feature activation+5.44e-01

Pos logit contributions
+0.052
 farewell+0.040
 mid+0.040
 fingert+0.040
Mine+0.040
Neg logit contributions
glers-0.052
bel-0.049
tub-0.047
asis-0.045
kins-0.045
Token!"
Token ID2474
Feature activation-2.14e+00

Pos logit contributions
+0.014
nown+0.009
 �+0.009
 wo+0.008
 Will+0.008
Neg logit contributions
glers-0.017
bel-0.017
angers-0.016
ained-0.016
limits-0.016
Token the
Token ID262
Feature activation+9.77e-01

Pos logit contributions
glers+0.073
bel+0.069
akers+0.066
ained+0.066
asis+0.065
Neg logit contributions
-0.054
 Will-0.041
 �-0.038
 dismay-0.034
 delight-0.034
Token hunter
Token ID19177
Feature activation-1.13e+00

Pos logit contributions
+0.026
 mid+0.019
 �+0.019
 dismay+0.019
 farewell+0.019
Neg logit contributions
glers-0.028
bel-0.027
tub-0.025
kins-0.025
limits-0.024
Token came
Token ID1625
Feature activation+3.28e-01

Pos logit contributions
glers+0.005
bel+0.005
kins+0.005
limits+0.005
akers+0.005
Neg logit contributions
-0.004
 mid-0.003
 �-0.003
 Will-0.003
 farewell-0.003
Token out
Token ID503
Feature activation+1.67e+00

Pos logit contributions
+0.008
 Will+0.006
 dismay+0.006
 mid+0.006
 �+0.006
Neg logit contributions
glers-0.010
bel-0.010
tub-0.009
limits-0.009
kins-0.009
Token.
Token ID13
Feature activation+6.24e-01

Pos logit contributions
+0.038
 mid+0.026
 �+0.026
 dismay+0.025
 Will+0.025
Neg logit contributions
glers-0.056
bel-0.054
kins-0.050
tub-0.050
limits-0.050
Token Max
Token ID5436
Feature activation+1.14e+00

Pos logit contributions
+0.015
 mid+0.011
 dismay+0.010
 �+0.010
 Will+0.010
Neg logit contributions
glers-0.020
bel-0.019
tub-0.017
kins-0.017
limits-0.017
Token and
Token ID290
Feature activation+8.59e-01

Pos logit contributions
+0.027
 mid+0.020
 dismay+0.019
AY+0.019
itz+0.019
Neg logit contributions
glers-0.035
bel-0.034
tub-0.031
kins-0.031
limits-0.031
Token Sam
Token ID3409
Feature activation-5.16e-01

Pos logit contributions
+0.019
 Will+0.014
 �+0.014
 dismay+0.013
 mid+0.013
Neg logit contributions
glers-0.029
bel-0.028
asis-0.026
kins-0.026
limits-0.026
Token were
Token ID547
Feature activation+1.15e-01

Pos logit contributions
glers+0.014
bel+0.013
kins+0.012
tub+0.012
limits+0.012
Neg logit contributions
-0.015
 mid-0.011
 �-0.011
 Will-0.011
 dismay-0.011
Token happy
Token ID3772
Feature activation+6.09e-01

Pos logit contributions
+0.003
 dismay+0.002
 delight+0.002
 �+0.002
 mid+0.002
Neg logit contributions
glers-0.004
bel-0.004
kins-0.003
tub-0.003
limits-0.003
Token.
Token ID13
Feature activation-1.29e+00

Pos logit contributions
+0.015
 Will+0.011
 �+0.011
 mid+0.010
 dismay+0.010
Neg logit contributions
glers-0.020
bel-0.019
limits-0.017
kins-0.017
tub-0.017
Token They
Token ID1119
Feature activation-2.10e+00

Pos logit contributions
glers+0.029
bel+0.027
tub+0.024
kins+0.024
limits+0.024
Neg logit contributions
-0.042
 mid-0.034
 dismay-0.034
itz-0.033
 farewell-0.033
Token played
Token ID2826
Feature activation-1.35e+00

Pos logit contributions
glers+0.061
bel+0.059
tub+0.054
kins+0.054
limits+0.053
Neg logit contributions
-0.056
 mid-0.040
 �-0.040
 Will-0.040
 dismay-0.040
Token throws
Token ID12542
Feature activation-9.27e-01

Pos logit contributions
glers+0.007
bel+0.006
kins+0.006
tub+0.006
limits+0.005
Neg logit contributions
-0.006
 fingertips-0.004
 mid-0.004
 fingert-0.004
AY-0.004
Token some
Token ID617
Feature activation-8.89e-01

Pos logit contributions
glers+0.025
bel+0.024
kins+0.021
limits+0.020
ards+0.020
Neg logit contributions
-0.025
itz-0.021
AY-0.020
 fingertips-0.020
 fingert-0.020
Token bread
Token ID8509
Feature activation-7.83e-02

Pos logit contributions
glers+0.020
bel+0.019
kins+0.017
tub+0.016
akers+0.015
Neg logit contributions
-0.030
 dismay-0.024
itz-0.024
 Will-0.024
 �-0.024
Token on
Token ID319
Feature activation+1.15e+00

Pos logit contributions
glers+0.003
bel+0.002
tub+0.002
limits+0.002
kins+0.002
Neg logit contributions
-0.002
 �-0.001
 mid-0.001
 Will-0.001
 dismay-0.001
Token the
Token ID262
Feature activation-2.20e+00

Pos logit contributions
+0.033
 persever+0.027
nown+0.027
AY+0.027
itz+0.026
Neg logit contributions
glers-0.028
bel-0.025
kins-0.024
tub-0.024
limits-0.022
Token ground
Token ID2323
Feature activation+1.24e-01

Pos logit contributions
glers+0.052
bel+0.050
kins+0.045
tub+0.044
limits+0.044
Neg logit contributions
-0.070
 Will-0.055
 mid-0.055
 �-0.055
 dismay-0.054
Token.
Token ID13
Feature activation-3.36e-01

Pos logit contributions
+0.003
 �+0.002
 mid+0.002
 farewell+0.002
 fingert+0.002
Neg logit contributions
glers-0.004
bel-0.004
limits-0.004
kins-0.004
asis-0.004
Token Ben
Token ID3932
Feature activation+1.10e-01

Pos logit contributions
glers+0.010
bel+0.010
tub+0.009
kins+0.009
limits+0.009
Neg logit contributions
-0.009
 mid-0.006
 dismay-0.006
itz-0.006
 farewell-0.006
Token does
Token ID857
Feature activation+1.80e-01

Pos logit contributions
+0.003
itz+0.002
AY+0.002
 dismay+0.002
 fingertips+0.002
Neg logit contributions
glers-0.003
bel-0.003
kins-0.003
tub-0.003
limits-0.003
Token the
Token ID262
Feature activation-9.92e-01

Pos logit contributions
+0.006
 mid+0.005
AY+0.005
itz+0.005
 dismay+0.005
Neg logit contributions
glers-0.004
bel-0.004
kins-0.004
limits-0.003
tub-0.003
Token same
Token ID976
Feature activation+1.80e-01

Pos logit contributions
glers+0.030
bel+0.029
limits+0.026
tub+0.026
kins+0.026
Neg logit contributions
-0.026
 mid-0.019
 delight-0.019
 dismay-0.019
 farewell-0.019
Token even
Token ID772
Feature activation-1.51e+00

Pos logit contributions
+0.033
 Will+0.025
 �+0.024
 mid+0.023
 dismay+0.023
Neg logit contributions
glers-0.040
bel-0.036
akers-0.034
limits-0.034
asis-0.034
Token in
Token ID287
Feature activation+1.13e+00

Pos logit contributions
glers+0.045
bel+0.041
asis+0.040
limits+0.040
kins+0.040
Neg logit contributions
-0.039
 Will-0.030
 mid-0.029
 �-0.028
 dismay-0.028
Token the
Token ID262
Feature activation-9.17e-01

Pos logit contributions
+0.030
itz+0.023
AY+0.023
 farewell+0.022
nown+0.022
Neg logit contributions
glers-0.032
bel-0.030
kins-0.029
tub-0.028
limits-0.027
Token kitchen
Token ID9592
Feature activation-5.25e-01

Pos logit contributions
glers+0.022
bel+0.021
tub+0.019
kins+0.019
limits+0.019
Neg logit contributions
-0.029
 �-0.022
 mid-0.022
itz-0.022
 Will-0.022
Token.
Token ID13
Feature activation+1.01e-01

Pos logit contributions
glers+0.019
bel+0.019
limits+0.018
ained+0.017
asis+0.017
Neg logit contributions
-0.010
 �-0.006
 Will-0.006
 tr-0.006
 delight-0.006
Token But
Token ID887
Feature activation+9.39e-01

Pos logit contributions
+0.003
 mid+0.002
 dismay+0.002
itz+0.002
 Will+0.002
Neg logit contributions
glers-0.003
bel-0.003
tub-0.002
limits-0.002
kins-0.002
Token he
Token ID339
Feature activation+3.30e-01

Pos logit contributions
+0.024
 �+0.017
 Will+0.017
 mid+0.017
 dismay+0.017
Neg logit contributions
glers-0.029
bel-0.028
tub-0.025
kins-0.025
limits-0.025
Token could
Token ID714
Feature activation+5.54e-01

Pos logit contributions
+0.009
 mid+0.007
 farewell+0.007
 �+0.006
AY+0.006
Neg logit contributions
glers-0.010
bel-0.009
kins-0.008
tub-0.008
limits-0.008
Token not
Token ID407
Feature activation+2.72e-01

Pos logit contributions
+0.014
 dismay+0.010
 Will+0.010
 t+0.009
 delight+0.009
Neg logit contributions
glers-0.019
akers-0.018
cards-0.017
kins-0.017
avers-0.017
Token find
Token ID1064
Feature activation-3.01e-01

Pos logit contributions
+0.009
 farewell+0.008
 �+0.007
AY+0.007
 persever+0.007
Neg logit contributions
glers-0.006
bel-0.005
tub-0.005
kins-0.005
asis-0.005
Token the
Token ID262
Feature activation-8.24e-01

Pos logit contributions
glers+0.008
bel+0.008
limits+0.007
kins+0.007
tub+0.007
Neg logit contributions
-0.008
itz-0.006
AY-0.006
 �-0.006
 mid-0.006
Token
Token ID198
Feature activation+2.95e-02

Pos logit contributions
glers+0.011
bel+0.010
tub+0.010
kins+0.010
limits+0.010
Neg logit contributions
-0.007
 mid-0.005
 dismay-0.005
 Will-0.005
 fingert-0.005
TokenThe
Token ID464
Feature activation-2.45e-01

Pos logit contributions
+0.001
 mid+0.001
 dismay+0.001
 fingert+0.001
itz+0.001
Neg logit contributions
glers-0.001
bel-0.001
limits-0.001
tub-0.001
kins-0.001
Token little
Token ID1310
Feature activation+5.98e-01

Pos logit contributions
glers+0.007
bel+0.007
tub+0.007
kins+0.007
asis+0.006
Neg logit contributions
-0.006
 mid-0.004
 Will-0.004
AY-0.004
 fingert-0.004
Token girl
Token ID2576
Feature activation-4.18e-01

Pos logit contributions
+0.019
 mid+0.014
 �+0.014
 Will+0.014
 dismay+0.014
Neg logit contributions
glers-0.015
bel-0.014
kins-0.013
tub-0.013
limits-0.012
Token felt
Token ID2936
Feature activation-9.11e-01

Pos logit contributions
glers+0.012
bel+0.012
kins+0.010
limits+0.010
tub+0.010
Neg logit contributions
-0.010
AY-0.008
 mid-0.008
itz-0.008
Mine-0.008
Token much
Token ID881
Feature activation-2.68e+00

Pos logit contributions
glers+0.026
bel+0.025
tub+0.023
kins+0.022
limits+0.022
Neg logit contributions
-0.025
 mid-0.018
 Will-0.018
 �-0.018
itz-0.018
Token better
Token ID1365
Feature activation-1.68e+00

Pos logit contributions
glers+0.077
bel+0.073
limits+0.068
kins+0.067
tub+0.067
Neg logit contributions
-0.068
 Will-0.052
 mid-0.052
AY-0.052
itz-0.052
Token after
Token ID706
Feature activation-2.49e-01

Pos logit contributions
glers+0.055
bel+0.053
tub+0.049
kins+0.049
limits+0.049
Neg logit contributions
-0.039
 mid-0.027
 �-0.027
 Will-0.026
 dismay-0.026
Token that
Token ID326
Feature activation-2.61e+00

Pos logit contributions
glers+0.007
bel+0.006
kins+0.006
tub+0.006
limits+0.006
Neg logit contributions
-0.007
Mine-0.005
 farewell-0.005
 fingert-0.005
 dismay-0.005
Token.
Token ID13
Feature activation+1.45e-01

Pos logit contributions
glers+0.076
bel+0.073
kins+0.068
tub+0.067
limits+0.067
Neg logit contributions
-0.066
 Will-0.049
AY-0.049
itz-0.049
 mid-0.049
Token She
Token ID1375
Feature activation-6.77e-01

Pos logit contributions
+0.005
 mid+0.004
 fingert+0.004
itz+0.004
 dismay+0.004
Neg logit contributions
glers-0.003
bel-0.003
tub-0.003
limits-0.003
ained-0.003
Token't
Token ID470
Feature activation-7.64e-01

Pos logit contributions
+0.054
 fingert+0.046
 delight+0.046
 fingertips+0.046
 dismay+0.046
Neg logit contributions
glers-0.056
bel-0.052
limits-0.049
tub-0.048
ained-0.046
Token get
Token ID651
Feature activation-6.43e-01

Pos logit contributions
glers+0.021
bel+0.020
tub+0.019
kins+0.018
limits+0.018
Neg logit contributions
-0.021
 �-0.016
 dismay-0.016
 mid-0.016
itz-0.016
Token out
Token ID503
Feature activation+3.06e-01

Pos logit contributions
glers+0.022
bel+0.021
tub+0.020
kins+0.019
limits+0.019
Neg logit contributions
-0.015
 Will-0.010
 mid-0.010
 dismay-0.010
 farewell-0.010
Token of
Token ID286
Feature activation-7.50e-01

Pos logit contributions
+0.007
 mid+0.005
 �+0.005
 Will+0.005
 dismay+0.005
Neg logit contributions
glers-0.010
bel-0.009
tub-0.009
kins-0.009
limits-0.009
Token the
Token ID262
Feature activation-1.40e+00

Pos logit contributions
glers+0.023
bel+0.022
kins+0.020
tub+0.020
limits+0.020
Neg logit contributions
-0.019
 �-0.014
itz-0.014
 fingert-0.013
 mid-0.013
Token cave
Token ID11527
Feature activation-7.07e-01

Pos logit contributions
bel+0.035
glers+0.034
kins+0.030
limits+0.029
akers+0.029
Neg logit contributions
-0.044
 dismay-0.035
 Will-0.035
 mid-0.035
 t-0.034
Token.
Token ID13
Feature activation-2.27e-01

Pos logit contributions
glers+0.026
bel+0.026
kins+0.024
akers+0.024
limits+0.024
Neg logit contributions
-0.014
 �-0.008
 mid-0.008
 dismay-0.008
 tr-0.007
Token The
Token ID383
Feature activation+6.96e-01

Pos logit contributions
glers+0.008
bel+0.007
kins+0.007
tub+0.007
akers+0.007
Neg logit contributions
-0.005
 �-0.004
 Will-0.004
 dismay-0.004
 farewell-0.004
Token blanket
Token ID18447
Feature activation+2.64e-01

Pos logit contributions
+0.017
 Will+0.012
 dismay+0.012
 mid+0.012
 �+0.012
Neg logit contributions
glers-0.022
bel-0.022
limits-0.020
kins-0.019
tub-0.019
Token couldn
Token ID3521
Feature activation+1.60e+00

Pos logit contributions
+0.006
 dismay+0.004
 t+0.003
 �+0.003
 mid+0.003
Neg logit contributions
glers-0.009
bel-0.009
akers-0.009
limits-0.008
angers-0.008
Token't
Token ID470
Feature activation-1.16e+00

Pos logit contributions
+0.040
 dismay+0.033
 delight+0.033
 fingert+0.033
 farewell+0.033
Neg logit contributions
glers-0.044
bel-0.041
limits-0.038
tub-0.038
kins-0.037
TokenTim
Token ID14967
Feature activation-1.96e-01

Pos logit contributions
glers+0.002
bel+0.002
kins+0.002
tub+0.002
limits+0.002
Neg logit contributions
-0.001
 �-0.001
Mine-0.001
 mid-0.001
 Will-0.001
Tokenmy
Token ID1820
Feature activation+1.94e-02

Pos logit contributions
glers+0.005
bel+0.005
tub+0.004
kins+0.004
limits+0.004
Neg logit contributions
-0.006
 �-0.005
 mid-0.005
 dismay-0.005
 Will-0.005
Token's
Token ID338
Feature activation+1.26e+00

Pos logit contributions
+0.000
itz+0.000
 mid+0.000
 �+0.000
 fingert+0.000
Neg logit contributions
glers-0.001
bel-0.001
kins-0.001
tub-0.001
limits-0.001
Token mom
Token ID1995
Feature activation-1.30e+00

Pos logit contributions
+0.030
itz+0.020
 mid+0.020
 �+0.020
AY+0.019
Neg logit contributions
bel-0.041
glers-0.041
tub-0.037
kins-0.037
akers-0.035
Token then
Token ID788
Feature activation-1.03e+00

Pos logit contributions
glers+0.038
bel+0.037
limits+0.034
tub+0.034
kins+0.033
Neg logit contributions
-0.035
 Will-0.025
 �-0.025
 mid-0.025
 dismay-0.025
Token said
Token ID531
Feature activation-1.22e+00

Pos logit contributions
glers+0.029
bel+0.028
tub+0.026
kins+0.026
limits+0.025
Neg logit contributions
-0.029
 �-0.021
 Will-0.021
 mid-0.021
 dismay-0.021
Token it
Token ID340
Feature activation+1.90e+00

Pos logit contributions
glers+0.033
bel+0.032
tub+0.029
kins+0.029
limits+0.029
Neg logit contributions
-0.035
 �-0.027
 Will-0.026
 mid-0.026
 dismay-0.026
Token was
Token ID373
Feature activation+4.41e-01

Pos logit contributions
+0.042
 mid+0.030
 dismay+0.030
itz+0.029
 fingert+0.029
Neg logit contributions
glers-0.063
bel-0.061
tub-0.056
kins-0.056
limits-0.055
Token time
Token ID640
Feature activation+2.98e-01

Pos logit contributions
+0.009
 �+0.007
 Will+0.006
 mid+0.006
 dismay+0.006
Neg logit contributions
glers-0.016
bel-0.015
limits-0.014
tub-0.014
asis-0.014
Token to
Token ID284
Feature activation+1.01e-01

Pos logit contributions
+0.007
 �+0.005
 mid+0.004
 Will+0.004
 farewell+0.004
Neg logit contributions
glers-0.010
bel-0.010
kins-0.009
tub-0.009
limits-0.009
Token go
Token ID467
Feature activation-1.93e-01

Pos logit contributions
+0.002
 �+0.002
 Will+0.002
 mid+0.002
 fingert+0.002
Neg logit contributions
glers-0.003
bel-0.003
kins-0.003
tub-0.003
limits-0.003
Token and
Token ID290
Feature activation+6.02e-01

Pos logit contributions
+0.037
 dismay+0.029
 farewell+0.028
 persever+0.027
 mid+0.027
Neg logit contributions
glers-0.042
bel-0.039
tub-0.037
kins-0.036
limits-0.035
Token yellow
Token ID7872
Feature activation+1.32e+00

Pos logit contributions
+0.015
AY+0.013
 dismay+0.013
 farewell+0.012
itz+0.012
Neg logit contributions
glers-0.017
bel-0.016
kins-0.015
tub-0.014
limits-0.014
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
+0.037
 dismay+0.030
 persever+0.029
 farewell+0.028
itz+0.028
Neg logit contributions
glers-0.036
bel-0.033
tub-0.031
kins-0.031
ained-0.029
Token
Token ID198
Feature activation-6.07e-01

Pos logit contributions
glers+0.032
bel+0.031
tub+0.028
kins+0.028
limits+0.028
Neg logit contributions
-0.025
 mid-0.018
 dismay-0.018
 farewell-0.017
 fingert-0.017
Token
Token ID198
Feature activation-7.58e-01

Pos logit contributions
glers+0.019
bel+0.019
tub+0.017
kins+0.017
limits+0.017
Neg logit contributions
-0.014
 dismay-0.010
 mid-0.010
itz-0.010
 farewell-0.010
TokenThey
Token ID2990
Feature activation-1.26e+00

Pos logit contributions
glers+0.027
bel+0.026
tub+0.025
asis+0.024
kins+0.024
Neg logit contributions
-0.017
 �-0.010
Mine-0.010
 mid-0.009
itz-0.009
Token watch
Token ID2342
Feature activation+2.28e+00

Pos logit contributions
glers+0.036
bel+0.034
kins+0.031
tub+0.030
limits+0.030
Neg logit contributions
-0.031
nown-0.024
itz-0.024
 �-0.024
 dismay-0.024
Token the
Token ID262
Feature activation+1.72e-01

Pos logit contributions
+0.067
 persever+0.053
itz+0.053
 fingert+0.052
nown+0.052
Neg logit contributions
bel-0.056
glers-0.056
tub-0.049
kins-0.047
ained-0.046
Token sunrise
Token ID40048
Feature activation+1.23e+00

Pos logit contributions
+0.005
 fingert+0.004
 Will+0.004
 dismay+0.004
 persever+0.004
Neg logit contributions
glers-0.005
bel-0.004
tub-0.004
asis-0.004
kins-0.004
Token for
Token ID329
Feature activation-5.12e-01

Pos logit contributions
+0.034
 dismay+0.025
 Will+0.025
 mid+0.025
 �+0.025
Neg logit contributions
glers-0.035
bel-0.034
tub-0.030
kins-0.030
limits-0.030
Token a
Token ID257
Feature activation+5.31e-01

Pos logit contributions
glers+0.013
bel+0.012
kins+0.012
ts+0.011
akers+0.011
Neg logit contributions
-0.015
nown-0.012
 �-0.011
AY-0.010
se-0.010
Token looked
Token ID3114
Feature activation-1.48e+00

Pos logit contributions
glers+0.056
bel+0.052
kins+0.048
tub+0.047
limits+0.047
Neg logit contributions
-0.069
 �-0.054
 farewell-0.053
 mid-0.053
itz-0.053
Token forward
Token ID2651
Feature activation+1.25e+00

Pos logit contributions
glers+0.050
bel+0.048
limits+0.045
tub+0.044
kins+0.044
Neg logit contributions
-0.033
 �-0.023
 mid-0.023
 farewell-0.023
 Will-0.023
Token to
Token ID284
Feature activation-1.62e-01

Pos logit contributions
+0.031
 dismay+0.022
Mine+0.022
itz+0.022
 fingert+0.022
Neg logit contributions
glers-0.038
bel-0.038
tub-0.034
kins-0.033
limits-0.033
Token trying
Token ID2111
Feature activation-7.33e-01

Pos logit contributions
glers+0.004
bel+0.004
kins+0.003
tub+0.003
akers+0.003
Neg logit contributions
-0.005
 fingert-0.004
itz-0.004
 �-0.004
nown-0.004
Token new
Token ID649
Feature activation-1.56e+00

Pos logit contributions
glers+0.019
bel+0.018
kins+0.017
tub+0.016
akers+0.016
Neg logit contributions
-0.021
itz-0.016
nown-0.016
AY-0.016
Mine-0.016
Token kinds
Token ID6982
Feature activation-1.65e+00

Pos logit contributions
glers+0.034
bel+0.033
kins+0.029
tub+0.028
limits+0.028
Neg logit contributions
-0.053
 �-0.042
itz-0.041
 mid-0.041
 Will-0.041
Token of
Token ID286
Feature activation-3.20e+00

Pos logit contributions
glers+0.052
bel+0.050
limits+0.045
tub+0.045
kins+0.045
Neg logit contributions
-0.040
 �-0.028
 mid-0.028
 Will-0.028
 dismay-0.028
Token cake
Token ID12187
Feature activation-4.48e-01

Pos logit contributions
glers+0.082
bel+0.079
kins+0.072
limits+0.069
tub+0.069
Neg logit contributions
-0.096
 �-0.074
itz-0.074
 farewell-0.073
 Will-0.073
Token together
Token ID1978
Feature activation-1.21e-01

Pos logit contributions
glers+0.013
bel+0.013
limits+0.012
tub+0.012
kins+0.012
Neg logit contributions
-0.012
 Will-0.008
 �-0.008
 mid-0.008
 dismay-0.008
Token.
Token ID13
Feature activation-5.34e-01

Pos logit contributions
glers+0.004
bel+0.003
kins+0.003
tub+0.003
akers+0.003
Neg logit contributions
-0.003
itz-0.002
 dismay-0.002
 farewell-0.002
 fingert-0.002
Token<|endoftext|>
Token ID50256
Feature activation-1.63e-01

Pos logit contributions
glers+0.017
bel+0.016
tub+0.015
limits+0.015
kins+0.015
Neg logit contributions
-0.012
 mid-0.009
itz-0.009
 dismay-0.009
 fingert-0.009
Token have
Token ID423
Feature activation-4.20e-01

Pos logit contributions
glers+0.024
bel+0.023
tub+0.020
kins+0.020
limits+0.020
Neg logit contributions
-0.026
 �-0.021
itz-0.020
 mid-0.020
 fingert-0.020
Token it
Token ID340
Feature activation+7.84e-01

Pos logit contributions
glers+0.012
bel+0.012
kins+0.011
tub+0.011
limits+0.011
Neg logit contributions
-0.011
itz-0.008
 �-0.008
 mid-0.008
 fingert-0.008
Token.
Token ID13
Feature activation+8.78e-02

Pos logit contributions
+0.018
 mid+0.012
 �+0.012
 Will+0.012
 farewell+0.012
Neg logit contributions
glers-0.026
bel-0.025
kins-0.023
limits-0.023
tub-0.023
Token He
Token ID679
Feature activation-8.99e-01

Pos logit contributions
+0.003
itz+0.002
 mid+0.002
 farewell+0.002
 fingert+0.002
Neg logit contributions
glers-0.002
bel-0.002
tub-0.002
limits-0.002
kins-0.002
Token smiled
Token ID13541
Feature activation+4.49e-01

Pos logit contributions
glers+0.024
bel+0.023
tub+0.021
kins+0.021
limits+0.020
Neg logit contributions
-0.025
 mid-0.019
 �-0.019
itz-0.019
 dismay-0.019
Token as
Token ID355
Feature activation-3.79e+00

Pos logit contributions
+0.010
 mid+0.007
 dismay+0.007
 Will+0.007
 �+0.007
Neg logit contributions
glers-0.014
bel-0.014
tub-0.013
kins-0.013
limits-0.013
Token he
Token ID339
Feature activation-1.32e+00

Pos logit contributions
glers+0.093
bel+0.086
limits+0.080
tub+0.078
asis+0.077
Neg logit contributions
-0.122
 Will-0.097
 �-0.095
 farewell-0.094
 mid-0.092
Token saw
Token ID2497
Feature activation-1.19e+00

Pos logit contributions
glers+0.032
bel+0.031
kins+0.026
tub+0.026
asis+0.026
Neg logit contributions
-0.040
 �-0.033
AY-0.032
 mid-0.032
 dismay-0.032
Token all
Token ID477
Feature activation-5.48e-01

Pos logit contributions
glers+0.038
bel+0.037
kins+0.034
tub+0.034
limits+0.034
Neg logit contributions
-0.028
 mid-0.020
 Will-0.020
 �-0.020
 dismay-0.020
Token the
Token ID262
Feature activation-1.51e+00

Pos logit contributions
bel+0.018
glers+0.018
kins+0.017
limits+0.017
tub+0.017
Neg logit contributions
-0.012
 mid-0.008
 �-0.008
 delight-0.008
 Will-0.008
Token animals
Token ID4695
Feature activation-1.18e+00

Pos logit contributions
glers+0.044
bel+0.042
kins+0.038
tub+0.038
limits+0.037
Neg logit contributions
-0.041
 �-0.030
 mid-0.030
 dismay-0.030
 Will-0.030
Token good
Token ID922
Feature activation-2.99e+00

Pos logit contributions
glers+0.043
bel+0.042
tub+0.038
kins+0.038
limits+0.038
Neg logit contributions
-0.032
 mid-0.022
 �-0.022
 Will-0.022
 dismay-0.022
Token idea
Token ID2126
Feature activation-4.53e+00

Pos logit contributions
glers+0.097
bel+0.091
limits+0.086
kins+0.085
tub+0.081
Neg logit contributions
-0.072
 farewell-0.052
 dismay-0.052
 Will-0.052
 �-0.051
Token.
Token ID13
Feature activation-6.13e-01

Pos logit contributions
glers+0.162
bel+0.157
limits+0.146
tub+0.143
kins+0.142
Neg logit contributions
-0.096
 �-0.063
 Will-0.059
 mid-0.057
 fingert-0.056
Token We
Token ID775
Feature activation-7.93e-01

Pos logit contributions
glers+0.018
bel+0.017
kins+0.016
tub+0.016
limits+0.016
Neg logit contributions
-0.016
 �-0.012
 Will-0.012
 mid-0.012
 dismay-0.012
Token should
Token ID815
Feature activation-1.70e+00

Pos logit contributions
glers+0.024
bel+0.024
limits+0.022
kins+0.022
tub+0.022
Neg logit contributions
-0.021
 Will-0.015
 �-0.014
 dismay-0.014
 farewell-0.014
Token always
Token ID1464
Feature activation-3.83e+00

Pos logit contributions
bel+0.055
glers+0.055
tub+0.050
kins+0.049
limits+0.048
Neg logit contributions
-0.043
 dismay-0.028
 Will-0.028
 delight-0.028
 �-0.028
Token pick
Token ID2298
Feature activation-8.58e-01

Pos logit contributions
glers+0.120
bel+0.117
kins+0.107
limits+0.107
tub+0.106
Neg logit contributions
-0.088
 mid-0.064
 �-0.064
 Will-0.063
itz-0.063
Token the
Token ID262
Feature activation-2.47e+00

Pos logit contributions
glers+0.023
bel+0.022
kins+0.020
tub+0.020
limits+0.020
Neg logit contributions
-0.025
 mid-0.019
 �-0.019
 dismay-0.019
 Will-0.019
Token right
Token ID826
Feature activation-1.08e+00

Pos logit contributions
glers+0.067
bel+0.064
tub+0.058
kins+0.058
limits+0.057
Neg logit contributions
-0.070
 Will-0.052
 mid-0.052
 dismay-0.052
 �-0.052
Token word
Token ID1573
Feature activation+1.47e+00

Pos logit contributions
glers+0.033
bel+0.031
tub+0.028
limits+0.028
kins+0.028
Neg logit contributions
-0.028
 �-0.020
 mid-0.020
 dismay-0.020
 farewell-0.020
Token to
Token ID284
Feature activation-1.17e+00

Pos logit contributions
+0.036
itz+0.027
 dismay+0.027
 mid+0.027
 farewell+0.027
Neg logit contributions
glers-0.044
bel-0.042
tub-0.039
kins-0.039
limits-0.038
Token did
Token ID750
Feature activation-6.79e-01

Pos logit contributions
glers+0.020
bel+0.019
kins+0.017
tub+0.017
limits+0.016
Neg logit contributions
-0.022
itz-0.017
 �-0.017
 mid-0.017
AY-0.017
Token not
Token ID407
Feature activation-9.97e-01

Pos logit contributions
glers+0.014
bel+0.014
kins+0.012
limits+0.012
tub+0.012
Neg logit contributions
-0.023
itz-0.018
 mid-0.018
 Will-0.018
 fingert-0.018
Token notice
Token ID4003
Feature activation-1.17e-01

Pos logit contributions
glers+0.029
bel+0.027
kins+0.025
tub+0.025
limits+0.025
Neg logit contributions
-0.027
 �-0.020
 mid-0.020
 dismay-0.020
 Will-0.020
Token what
Token ID644
Feature activation+1.27e+00

Pos logit contributions
glers+0.003
bel+0.003
tub+0.003
kins+0.003
limits+0.003
Neg logit contributions
-0.003
 mid-0.002
 �-0.002
 dismay-0.002
 fingert-0.002
Token Max
Token ID5436
Feature activation-2.00e-01

Pos logit contributions
+0.036
 mid+0.028
itz+0.027
 fingert+0.027
AY+0.027
Neg logit contributions
glers-0.035
bel-0.033
tub-0.030
kins-0.030
limits-0.029
Token did
Token ID750
Feature activation-3.88e+00

Pos logit contributions
bel+0.007
glers+0.007
limits+0.006
tub+0.006
kins+0.006
Neg logit contributions
-0.005
 Will-0.003
 �-0.003
 dismay-0.003
 mid-0.003
Token at
Token ID379
Feature activation-2.38e+00

Pos logit contributions
glers+0.135
bel+0.129
limits+0.119
tub+0.119
kins+0.117
Neg logit contributions
-0.086
 �-0.056
 mid-0.055
 dismay-0.054
 Will-0.053
Token first
Token ID717
Feature activation-2.15e+00

Pos logit contributions
glers+0.063
kins+0.055
tub+0.053
ics+0.052
ained+0.051
Neg logit contributions
-0.073
 farewell-0.055
nown-0.055
 wo-0.053
Sur-0.053
Token.
Token ID13
Feature activation-1.70e+00

Pos logit contributions
glers+0.069
bel+0.067
tub+0.061
kins+0.061
limits+0.061
Neg logit contributions
-0.051
 mid-0.035
 �-0.035
 Will-0.035
 dismay-0.035
Token They
Token ID1119
Feature activation-8.20e-01

Pos logit contributions
glers+0.043
bel+0.041
kins+0.038
asis+0.038
akers+0.036
Neg logit contributions
-0.054
 �-0.043
 Will-0.042
 dismay-0.041
Mine-0.041
Token were
Token ID547
Feature activation-4.66e-01

Pos logit contributions
glers+0.022
bel+0.021
kins+0.019
tub+0.019
limits+0.019
Neg logit contributions
-0.023
 mid-0.018
 �-0.017
 Will-0.017
itz-0.017
Token
Token ID198
Feature activation-1.75e-01

Pos logit contributions
glers+0.071
bel+0.067
limits+0.064
asis+0.063
akers+0.063
Neg logit contributions
-0.032
 �-0.021
 Will-0.020
 delight-0.019
 dismay-0.018
Token
Token ID198
Feature activation-1.12e-01

Pos logit contributions
glers+0.006
bel+0.006
limits+0.006
kins+0.006
tub+0.006
Neg logit contributions
-0.003
 �-0.002
 Will-0.002
 dismay-0.002
 mid-0.002
Token"
Token ID1
Feature activation-1.88e+00

Pos logit contributions
glers+0.004
bel+0.004
limits+0.004
tub+0.004
kins+0.004
Neg logit contributions
-0.002
 �-0.001
 dismay-0.001
Mine-0.001
 tr-0.001
TokenGood
Token ID10248
Feature activation-9.27e-01

Pos logit contributions
bel+0.043
glers+0.042
limits+0.036
kins+0.036
tub+0.034
Neg logit contributions
-0.059
 dismay-0.049
 mid-0.049
 farewell-0.048
 Will-0.047
Token idea
Token ID2126
Feature activation-3.05e+00

Pos logit contributions
glers+0.026
bel+0.024
limits+0.022
kins+0.022
tub+0.021
Neg logit contributions
-0.027
 three-0.020
 tr-0.020
 ear-0.020
 Will-0.020
Token,"
Token ID553
Feature activation-3.91e+00

Pos logit contributions
glers+0.081
bel+0.077
tub+0.069
kins+0.069
limits+0.069
Neg logit contributions
-0.090
 �-0.068
 mid-0.067
 Will-0.067
 dismay-0.067
Token Mia
Token ID32189
Feature activation-4.11e-01

Pos logit contributions
glers+0.133
bel+0.130
limits+0.126
asis+0.124
tub+0.123
Neg logit contributions
-0.089
 Will-0.062
 �-0.060
 delight-0.060
 dismay-0.056
Token says
Token ID1139
Feature activation-3.17e+00

Pos logit contributions
glers+0.010
bel+0.009
kins+0.008
tub+0.008
limits+0.008
Neg logit contributions
-0.012
 dismay-0.010
 fingertips-0.010
 fingert-0.010
 mid-0.010
Token.
Token ID13
Feature activation-9.94e-01

Pos logit contributions
glers+0.092
bel+0.090
kins+0.083
tub+0.081
limits+0.081
Neg logit contributions
-0.081
 dismay-0.061
itz-0.060
 mid-0.060
Mine-0.059
Token She
Token ID1375
Feature activation-1.28e+00

Pos logit contributions
glers+0.031
bel+0.030
tub+0.028
kins+0.027
limits+0.027
Neg logit contributions
-0.024
 dismay-0.017
 mid-0.017
 farewell-0.017
 fingert-0.017
Token helps
Token ID5419
Feature activation-4.14e-01

Pos logit contributions
glers+0.033
bel+0.032
kins+0.031
tub+0.029
avers+0.029
Neg logit contributions
-0.032
 dismay-0.028
 fingertips-0.027
 mid-0.026
 fingert-0.026
Token She
Token ID1375
Feature activation+1.61e-01

Pos logit contributions
+0.016
 �+0.011
 mid+0.011
 Will+0.011
 dismay+0.011
Neg logit contributions
glers-0.024
bel-0.023
kins-0.021
tub-0.021
limits-0.021
Token didn
Token ID1422
Feature activation+3.12e+00

Pos logit contributions
+0.004
 mid+0.003
 �+0.003
 dismay+0.003
 Will+0.003
Neg logit contributions
glers-0.005
bel-0.005
kins-0.004
tub-0.004
limits-0.004
Token't
Token ID470
Feature activation-3.38e-01

Pos logit contributions
+0.079
 delight+0.073
 fingertips+0.072
 dismay+0.071
 farewell+0.071
Neg logit contributions
glers-0.080
bel-0.075
tub-0.069
ained-0.069
limits-0.068
Token know
Token ID760
Feature activation-6.39e-01

Pos logit contributions
glers+0.013
bel+0.012
kins+0.012
cards+0.012
tub+0.011
Neg logit contributions
-0.007
 Will-0.004
 ear-0.004
 three-0.004
 t-0.004
Token what
Token ID644
Feature activation+1.65e+00

Pos logit contributions
glers+0.018
bel+0.017
kins+0.016
tub+0.016
limits+0.016
Neg logit contributions
-0.017
AY-0.013
 farewell-0.013
Mine-0.013
 mid-0.013
Token to
Token ID284
Feature activation-3.92e+00

Pos logit contributions
+0.040
 mid+0.029
 dismay+0.029
 farewell+0.029
itz+0.028
Neg logit contributions
glers-0.051
bel-0.050
tub-0.046
kins-0.046
limits-0.045
Token do
Token ID466
Feature activation-2.99e+00

Pos logit contributions
bel+0.135
glers+0.133
tub+0.121
limits+0.120
cards+0.119
Neg logit contributions
-0.088
 mid-0.057
 delight-0.056
 dismay-0.055
 t-0.055
Token!
Token ID0
Feature activation+2.93e-01

Pos logit contributions
glers+0.104
bel+0.099
limits+0.091
tub+0.090
asis+0.090
Neg logit contributions
-0.067
 Will-0.043
 �-0.042
 mid-0.042
 dismay-0.041
Token
Token ID198
Feature activation+1.96e-01

Pos logit contributions
+0.007
 mid+0.005
 fingert+0.005
 dismay+0.005
 farewell+0.005
Neg logit contributions
glers-0.009
bel-0.008
tub-0.008
limits-0.008
kins-0.007
Token
Token ID198
Feature activation+2.79e-01

Pos logit contributions
+0.004
 mid+0.003
 �+0.003
 Will+0.003
 dismay+0.003
Neg logit contributions
glers-0.007
bel-0.006
tub-0.006
kins-0.006
limits-0.006
TokenSuddenly
Token ID38582
Feature activation+5.13e-02

Pos logit contributions
+0.006
 mid+0.004
 dismay+0.004
 fingert+0.004
 Will+0.004
Neg logit contributions
glers-0.009
bel-0.009
limits-0.008
tub-0.008
kins-0.008