TOP ACTIVATIONS
MAX = 7.537

! The print was the bug 's new
Suddenly , the print began to move around the
and saw that the purple print had been ruined . They
started to happen . The print started to shine and spark
bird flies down and pe cks Tom on the head .
and laughed even harder . This made the band of fish
When the mixture was done , it looked
their arms and shouting . This made the tiger scared ,
. The card was very cheap . It
. He noticed that the print in his new coloring book
. The symbol worked ! Every time Andy
and sugar . The secret mixture was so good that it
be taller than her . This made Max very sad .
blue bike with t as sels on the handle bars and
. Now , the herb print hangs in the kitchen and
and his mom laughed . This was going to be the
. The symbol said goodbye and took off
bed . The cube was so happy to have
his family that night . This time , instead of running
was also very happy . This was the best day ever

MIN ACTIVATIONS
MIN = -11.038

to never try to catch Twe ety again . From that
who was trying to take Twe ety 's nest . Twe
dare to climb up to Twe ety 's nest .
The cat wanted to eat Twe ety ! Twe
Lucy noticed the weather was uncom fort ably hot . She
Tom was trying to catch Twe ety , but Twe ety
happy that she could help Twe ety and learned that it
! Tom wanted to catch Twe ety and eat her for
, Bugs said , " Twe ety , I have a
a little bird ie like Twe ety . So , Twe
world from up high like Twe ety , but he was
You should take your rain co ats and boots if you
like to pick flowers on Tues days ."
anas available for feeding the gor illas ." She said to
want to share my ice - cream with me ?"
. We cannot eat play - d oh or pr unes
some bananas and feed the gor illas ?" Tim my nodded
Hello ! My name is Twe ety . What are you
for my stickers and play - d ough . They are
that said " No T resp assing " and " Private

INTERVAL 2.893 - 7.537
CONTAINS 8.316%

a loud sound . All the other animals in the woods
fl icked his tail . He didn 't want to be
bike gently , but suddenly the gears got stuck . She
time . But when they looked up , the sun was
The suit was special because it could perform amazing tricks .

INTERVAL -1.751 - 2.893
CONTAINS 73.363%

shelf so it would be safe . Jill read and enjoyed
I love you , Luna . You are my best friend
to go on it . He ran to the slide and
picture was taken and then ran off to find something else
. One day , Tim said he wanted to

INTERVAL -6.394 - -1.751
CONTAINS 18.174%

, I will be your friend ." They played together all
, there was a little boy named Tim my . Tim
was a cute little pig named Pete . Pete had a
, there was a rich man named Tom . Tom had
cook . She wanted to be a good chef when she

INTERVAL -11.038 - -6.394
CONTAINS 0.147%

? You have a boo - bo o on your head
and Ben did not want to do that . They thought
's mom helped her sew Mr . Fl uffy back together
L ily didn 't want to accept mom my 's request
Ben . Do you want to buy some laundry ? I
Token!
Token ID0
Feature activation+9.71e-01

Pos logit contributions
lic+0.016
der+0.016
 is+0.015
 has+0.015
 can+0.015
Neg logit contributions
 argue-0.024
 reason-0.023
 worry-0.023
keeping-0.023
 weep-0.023
Token
Token ID220
Feature activation+6.54e-01

Pos logit contributions
der+0.015
 is+0.014
lic+0.014
 can+0.013
 makes+0.013
Neg logit contributions
keeping-0.022
cle-0.021
 argue-0.021
 moderation-0.020
 mischief-0.020
Token
Token ID198
Feature activation+1.07e+00

Pos logit contributions
der+0.010
 is+0.009
 can+0.009
 has+0.009
lic+0.009
Neg logit contributions
 argue-0.016
cle-0.015
keeping-0.015
 reason-0.015
 elbows-0.015
Token
Token ID198
Feature activation+9.47e-01

Pos logit contributions
der+0.018
 is+0.016
 can+0.016
 has+0.015
 makes+0.015
Neg logit contributions
 argue-0.025
cle-0.024
keeping-0.024
 reason-0.024
 elbows-0.023
TokenThe
Token ID464
Feature activation+5.11e+00

Pos logit contributions
der+0.018
 is+0.017
 can+0.017
 has+0.017
 looks+0.016
Neg logit contributions
 argue-0.019
cle-0.019
 reason-0.019
aster-0.018
keeping-0.018
Token print
Token ID3601
Feature activation+7.50e+00

Pos logit contributions
 is+0.071
 can+0.067
 has+0.065
der+0.064
 makes+0.058
Neg logit contributions
keeping-0.139
 argue-0.139
cle-0.139
 elbows-0.137
ge-0.131
Token was
Token ID373
Feature activation+2.63e+00

Pos logit contributions
der+0.094
lic+0.088
 is+0.077
 can+0.076
boards+0.075
Neg logit contributions
 argue-0.197
 reason-0.195
 weep-0.191
cle-0.187
keeping-0.186
Token the
Token ID262
Feature activation+3.54e+00

Pos logit contributions
der+0.041
 is+0.038
 can+0.037
lic+0.036
boards+0.035
Neg logit contributions
 argue-0.064
cle-0.061
 reason-0.061
keeping-0.061
 elbows-0.060
Token bug
Token ID5434
Feature activation+1.41e+00

Pos logit contributions
 is+0.045
 can+0.045
 has+0.041
der+0.038
 looks+0.037
Neg logit contributions
keeping-0.101
 argue-0.100
 elbows-0.100
cle-0.100
ge-0.096
Token's
Token ID338
Feature activation+9.17e-01

Pos logit contributions
der+0.020
 is+0.018
lic+0.018
 can+0.018
 has+0.017
Neg logit contributions
 argue-0.035
keeping-0.034
cle-0.034
 reason-0.034
 weep-0.034
Token new
Token ID649
Feature activation+9.31e-01

Pos logit contributions
 is+0.016
 can+0.016
der+0.015
 has+0.015
 makes+0.014
Neg logit contributions
 argue-0.021
cle-0.020
 elbows-0.020
keeping-0.020
aster-0.019
Token
Token ID198
Feature activation+1.28e+00

Pos logit contributions
der+0.009
 is+0.009
 can+0.008
 has+0.008
lic+0.008
Neg logit contributions
 argue-0.015
cle-0.014
keeping-0.014
 reason-0.014
 elbows-0.014
Token
Token ID198
Feature activation+1.11e+00

Pos logit contributions
der+0.025
 is+0.021
boards+0.021
 can+0.021
 looks+0.021
Neg logit contributions
 argue-0.025
 reason-0.025
 weep-0.024
cle-0.024
Susan-0.024
TokenSuddenly
Token ID38582
Feature activation+7.68e-01

Pos logit contributions
der+0.027
boards+0.025
 is+0.025
 looks+0.024
lic+0.024
Neg logit contributions
 reason-0.016
 argue-0.016
cle-0.015
Susan-0.015
 weep-0.015
Token,
Token ID11
Feature activation+2.92e-01

Pos logit contributions
der+0.013
lic+0.012
boards+0.012
 is+0.011
 can+0.011
Neg logit contributions
 argue-0.017
 reason-0.016
 weep-0.016
 worry-0.016
 elbows-0.016
Token the
Token ID262
Feature activation+4.46e+00

Pos logit contributions
der+0.005
 is+0.005
lic+0.005
boards+0.005
 can+0.005
Neg logit contributions
 argue-0.006
 reason-0.006
 weep-0.006
 elbows-0.006
 panic-0.006
Token print
Token ID3601
Feature activation+7.32e+00

Pos logit contributions
der+0.077
 is+0.075
 can+0.072
lic+0.069
 has+0.069
Neg logit contributions
 argue-0.101
cle-0.098
keeping-0.096
 reason-0.096
 elbows-0.096
Token began
Token ID2540
Feature activation+1.54e+00

Pos logit contributions
der+0.118
lic+0.104
 is+0.103
 helps+0.100
 can+0.099
Neg logit contributions
 argue-0.162
 reason-0.162
 weep-0.157
cle-0.153
 worry-0.151
Token to
Token ID284
Feature activation-2.94e+00

Pos logit contributions
der+0.023
 is+0.022
 can+0.021
 has+0.020
lic+0.020
Neg logit contributions
 argue-0.039
cle-0.038
keeping-0.037
 elbows-0.037
 reason-0.037
Token move
Token ID1445
Feature activation+1.15e+00

Pos logit contributions
 argue+0.062
cle+0.058
 reason+0.058
keeping+0.057
 elbows+0.057
Neg logit contributions
der-0.056
 is-0.054
 can-0.052
lic-0.052
 has-0.051
Token around
Token ID1088
Feature activation-7.79e-01

Pos logit contributions
der+0.019
lic+0.018
 is+0.017
 has+0.017
 helps+0.017
Neg logit contributions
 argue-0.026
keeping-0.025
 reason-0.025
 worry-0.024
 afar-0.024
Token the
Token ID262
Feature activation+1.54e+00

Pos logit contributions
 argue+0.019
 reason+0.018
keeping+0.018
 weep+0.018
cle+0.017
Neg logit contributions
der-0.012
lic-0.011
 is-0.011
 can-0.011
 has-0.011
Token and
Token ID290
Feature activation-1.23e+00

Pos logit contributions
 argue+0.002
 reason+0.002
cle+0.002
 afar+0.002
 weep+0.002
Neg logit contributions
der-0.001
lic-0.001
 is-0.001
 can-0.001
 has-0.001
Token saw
Token ID2497
Feature activation+3.56e+00

Pos logit contributions
 argue+0.028
cle+0.026
keeping+0.026
 elbows+0.026
 reason+0.026
Neg logit contributions
der-0.022
 is-0.021
 can-0.020
lic-0.020
 has-0.020
Token that
Token ID326
Feature activation+1.57e+00

Pos logit contributions
lic+0.058
 has+0.052
boards+0.052
der+0.050
br+0.049
Neg logit contributions
 afar-0.083
 argue-0.080
 worry-0.076
 elbows-0.076
 mischief-0.075
Token the
Token ID262
Feature activation+3.82e+00

Pos logit contributions
der+0.026
lic+0.026
 is+0.025
boards+0.024
 can+0.024
Neg logit contributions
 argue-0.036
 reason-0.034
cle-0.033
 afar-0.033
keeping-0.033
Token purple
Token ID14032
Feature activation+2.42e+00

Pos logit contributions
der+0.070
 is+0.066
 can+0.064
lic+0.063
 has+0.062
Neg logit contributions
 argue-0.083
cle-0.080
 reason-0.079
keeping-0.078
 elbows-0.078
Token print
Token ID3601
Feature activation+7.27e+00

Pos logit contributions
der+0.047
lic+0.043
 is+0.043
 can+0.042
 helps+0.041
Neg logit contributions
 argue-0.049
cle-0.047
 reason-0.047
keeping-0.045
 elbows-0.045
Token had
Token ID550
Feature activation+2.56e+00

Pos logit contributions
der+0.103
lic+0.096
lessly+0.075
 helps+0.075
icked+0.074
Neg logit contributions
 reason-0.178
 argue-0.178
 weep-0.170
keeping-0.167
 harm-0.166
Token been
Token ID587
Feature activation+1.70e+00

Pos logit contributions
der+0.048
 is+0.046
lic+0.046
 has+0.044
 can+0.044
Neg logit contributions
 argue-0.050
 reason-0.048
keeping-0.047
 worry-0.045
 weep-0.045
Token ruined
Token ID20484
Feature activation+2.99e+00

Pos logit contributions
der+0.031
 is+0.030
 can+0.028
lic+0.028
 has+0.028
Neg logit contributions
 argue-0.037
cle-0.035
keeping-0.035
 reason-0.035
 elbows-0.035
Token.
Token ID13
Feature activation+9.48e-01

Pos logit contributions
lic+0.050
der+0.049
 helps+0.045
 is+0.045
 has+0.045
Neg logit contributions
 argue-0.066
keeping-0.064
 reason-0.063
cle-0.062
 worry-0.061
Token They
Token ID1119
Feature activation+1.87e+00

Pos logit contributions
lic+0.017
br+0.017
 can+0.016
 is+0.016
 has+0.016
Neg logit contributions
keeping-0.018
 mischief-0.017
 generation-0.017
 afar-0.017
 argue-0.017
Token started
Token ID2067
Feature activation+6.95e-01

Pos logit contributions
der+0.072
lic+0.068
 is+0.067
 can+0.065
 has+0.065
Neg logit contributions
 argue-0.041
 reason-0.041
keeping-0.039
cle-0.037
 weep-0.037
Token to
Token ID284
Feature activation-3.61e+00

Pos logit contributions
 is+0.009
der+0.009
 can+0.008
 has+0.008
 makes+0.008
Neg logit contributions
 argue-0.019
cle-0.019
 elbows-0.018
 afar-0.018
keeping-0.018
Token happen
Token ID1645
Feature activation+9.44e-01

Pos logit contributions
 argue+0.068
 reason+0.065
 weep+0.064
 worry+0.063
aster+0.063
Neg logit contributions
der-0.082
lic-0.077
boards-0.074
ing-0.071
 has-0.071
Token.
Token ID13
Feature activation+4.16e-01

Pos logit contributions
lic+0.016
der+0.016
 has+0.015
 is+0.015
 can+0.015
Neg logit contributions
 argue-0.020
keeping-0.020
 reason-0.019
cle-0.019
 worry-0.018
Token The
Token ID383
Feature activation+4.17e+00

Pos logit contributions
der+0.006
 is+0.006
 can+0.005
 has+0.005
lic+0.005
Neg logit contributions
 argue-0.010
keeping-0.010
 weep-0.010
 reason-0.010
 elbows-0.010
Token print
Token ID3601
Feature activation+7.27e+00

Pos logit contributions
der+0.076
 is+0.072
 can+0.070
lic+0.068
 has+0.067
Neg logit contributions
 argue-0.091
cle-0.087
 reason-0.086
keeping-0.086
 elbows-0.086
Token started
Token ID2067
Feature activation+1.25e+00

Pos logit contributions
der+0.107
lic+0.096
ows+0.085
arl+0.083
lessly+0.083
Neg logit contributions
 reason-0.172
 argue-0.171
 weep-0.169
 panic-0.161
 worry-0.160
Token to
Token ID284
Feature activation-2.58e+00

Pos logit contributions
der+0.019
 is+0.019
 can+0.018
 has+0.017
lic+0.017
Neg logit contributions
 argue-0.030
cle-0.030
keeping-0.029
 elbows-0.029
 reason-0.029
Token shine
Token ID18340
Feature activation+1.30e+00

Pos logit contributions
 argue+0.056
 reason+0.052
 weep+0.052
 elbows+0.051
cle+0.051
Neg logit contributions
der-0.049
 is-0.046
lic-0.045
 can-0.045
 has-0.044
Token and
Token ID290
Feature activation-2.08e+00

Pos logit contributions
der+0.023
lic+0.022
icked+0.022
 teaches+0.022
boards+0.021
Neg logit contributions
 worry-0.025
 argue-0.023
 afar-0.023
 panic-0.023
keeping-0.023
Token spark
Token ID9009
Feature activation+1.83e+00

Pos logit contributions
 argue+0.046
 weep+0.043
 worry+0.043
 elbows+0.042
 panic+0.042
Neg logit contributions
der-0.037
lic-0.036
boards-0.035
ing-0.032
 is-0.032
Token bird
Token ID6512
Feature activation+1.60e+00

Pos logit contributions
 is+0.077
 can+0.074
der+0.073
 has+0.072
 makes+0.066
Neg logit contributions
 argue-0.106
cle-0.105
keeping-0.104
ge-0.099
 moderation-0.099
Token flies
Token ID17607
Feature activation+2.15e+00

Pos logit contributions
der+0.026
lic+0.023
boards+0.023
lessly+0.022
ing+0.019
Neg logit contributions
 argue-0.038
 weep-0.037
 elbows-0.036
 panic-0.036
 harm-0.035
Token down
Token ID866
Feature activation+1.43e+00

Pos logit contributions
der+0.043
 is+0.037
lic+0.037
 has+0.036
boards+0.036
Neg logit contributions
 argue-0.043
 elbows-0.040
 weep-0.040
 panic-0.040
 reason-0.039
Token and
Token ID290
Feature activation+2.14e-01

Pos logit contributions
der+0.021
 is+0.019
lic+0.019
 can+0.019
 has+0.018
Neg logit contributions
 argue-0.035
 reason-0.034
keeping-0.034
cle-0.034
 elbows-0.033
Token pe
Token ID613
Feature activation-1.04e+00

Pos logit contributions
lic+0.004
der+0.004
boards+0.004
lessly+0.004
ing+0.003
Neg logit contributions
 argue-0.005
 elbows-0.005
 weep-0.004
 afar-0.004
 panic-0.004
Tokencks
Token ID4657
Feature activation+7.20e+00

Pos logit contributions
 argue+0.017
cle+0.016
keeping+0.016
 elbows+0.016
 reason+0.016
Neg logit contributions
der-0.025
 is-0.024
 can-0.023
 has-0.023
lic-0.023
Token Tom
Token ID4186
Feature activation-1.95e+00

Pos logit contributions
lic+0.152
der+0.140
boards+0.129
 contains+0.116
washer+0.115
Neg logit contributions
 argue-0.135
 elbows-0.129
aster-0.125
 mischief-0.124
 panic-0.122
Token on
Token ID319
Feature activation+2.84e-01

Pos logit contributions
 argue+0.051
cle+0.050
keeping+0.049
 reason+0.049
 elbows+0.049
Neg logit contributions
der-0.026
 is-0.025
 can-0.024
 has-0.023
lic-0.022
Token the
Token ID262
Feature activation+1.80e+00

Pos logit contributions
der+0.004
 is+0.004
 can+0.004
 has+0.004
lic+0.004
Neg logit contributions
 argue-0.007
cle-0.007
keeping-0.007
 elbows-0.007
 reason-0.007
Token head
Token ID1182
Feature activation-2.89e-01

Pos logit contributions
der+0.033
 is+0.032
 can+0.031
lic+0.030
 has+0.030
Neg logit contributions
 argue-0.039
cle-0.037
keeping-0.037
 elbows-0.037
 reason-0.037
Token.
Token ID13
Feature activation+7.61e-01

Pos logit contributions
 argue+0.007
cle+0.006
keeping+0.006
 elbows+0.006
 weep+0.006
Neg logit contributions
der-0.005
 is-0.004
lic-0.004
 has-0.004
 can-0.004
Token and
Token ID290
Feature activation-7.67e-01

Pos logit contributions
 argue+0.002
keeping+0.002
 reason+0.002
 elbows+0.002
 worry+0.002
Neg logit contributions
lic-0.001
 is-0.001
der-0.001
 has-0.001
 can-0.001
Token laughed
Token ID13818
Feature activation+3.91e+00

Pos logit contributions
 argue+0.016
 worry+0.015
 weep+0.015
 panic+0.015
 afar+0.015
Neg logit contributions
der-0.014
boards-0.014
lic-0.013
 is-0.013
 lid-0.012
Token even
Token ID772
Feature activation+1.23e-02

Pos logit contributions
lic+0.072
 has+0.070
 is+0.069
boards+0.069
 looks+0.069
Neg logit contributions
 argue-0.072
 worry-0.069
 afar-0.068
 reason-0.067
 weep-0.066
Token harder
Token ID7069
Feature activation-5.94e-01

Pos logit contributions
der+0.000
 is+0.000
boards+0.000
lic+0.000
 can+0.000
Neg logit contributions
 argue-0.000
 reason-0.000
 afar-0.000
 weep-0.000
 elbows-0.000
Token.
Token ID13
Feature activation+1.23e+00

Pos logit contributions
 argue+0.014
keeping+0.013
 reason+0.013
 elbows+0.013
cle+0.013
Neg logit contributions
lic-0.010
 is-0.009
der-0.009
 can-0.009
boards-0.009
Token This
Token ID770
Feature activation+7.17e+00

Pos logit contributions
der+0.023
 is+0.023
 can+0.022
 has+0.021
lic+0.021
Neg logit contributions
 argue-0.025
cle-0.024
keeping-0.024
 reason-0.024
 elbows-0.024
Token made
Token ID925
Feature activation+4.36e+00

Pos logit contributions
der+0.175
lic+0.169
ows+0.157
boards+0.154
 is+0.152
Neg logit contributions
 reason-0.113
 argue-0.111
 panic-0.100
 weep-0.098
 moderation-0.098
Token the
Token ID262
Feature activation+2.80e+00

Pos logit contributions
der+0.087
 is+0.085
 can+0.082
 has+0.080
lic+0.079
Neg logit contributions
 argue-0.087
cle-0.083
keeping-0.082
 elbows-0.081
 reason-0.081
Token band
Token ID4097
Feature activation-4.74e-01

Pos logit contributions
der+0.050
 is+0.049
 can+0.047
 has+0.046
 makes+0.044
Neg logit contributions
 argue-0.062
cle-0.060
keeping-0.059
 elbows-0.058
 reason-0.057
Token of
Token ID286
Feature activation+1.35e-01

Pos logit contributions
 argue+0.010
 reason+0.009
cle+0.009
 weep+0.009
keeping+0.009
Neg logit contributions
der-0.009
 is-0.008
 can-0.008
 has-0.008
lic-0.008
Token fish
Token ID5916
Feature activation-3.87e-01

Pos logit contributions
der+0.002
 is+0.002
 can+0.002
lic+0.002
 has+0.002
Neg logit contributions
 argue-0.003
 reason-0.003
 elbows-0.003
keeping-0.003
cle-0.003
Token
Token ID220
Feature activation+1.60e+00

Pos logit contributions
der+0.039
 is+0.037
 can+0.036
lic+0.036
 has+0.035
Neg logit contributions
 argue-0.040
cle-0.039
keeping-0.038
 elbows-0.037
 reason-0.037
Token
Token ID198
Feature activation+1.77e+00

Pos logit contributions
der+0.026
 is+0.024
 can+0.023
 has+0.023
lic+0.023
Neg logit contributions
 argue-0.036
cle-0.036
keeping-0.035
 elbows-0.035
 reason-0.034
Token
Token ID198
Feature activation+1.59e+00

Pos logit contributions
der+0.023
 is+0.023
 can+0.022
 has+0.021
lic+0.021
Neg logit contributions
 argue-0.047
keeping-0.045
cle-0.045
 elbows-0.045
 reason-0.045
TokenWhen
Token ID2215
Feature activation+9.66e-01

Pos logit contributions
der+0.041
 is+0.040
 can+0.039
 has+0.039
lic+0.038
Neg logit contributions
 argue-0.022
keeping-0.021
cle-0.020
 elbows-0.020
 reason-0.020
Token the
Token ID262
Feature activation+4.65e+00

Pos logit contributions
der+0.018
lic+0.017
 is+0.016
 can+0.015
boards+0.015
Neg logit contributions
 argue-0.021
cle-0.021
 reason-0.020
aster-0.020
 elbows-0.020
Token mixture
Token ID11710
Feature activation+7.17e+00

Pos logit contributions
der+0.098
 is+0.089
lic+0.088
 can+0.086
 has+0.085
Neg logit contributions
 argue-0.092
 reason-0.090
cle-0.085
 worry-0.083
 weep-0.083
Token was
Token ID373
Feature activation+2.26e+00

Pos logit contributions
der+0.124
lic+0.103
arl+0.096
lessly+0.093
ows+0.092
Neg logit contributions
aster-0.160
 argue-0.156
 reason-0.154
 panic-0.148
 weep-0.146
Token done
Token ID1760
Feature activation+1.63e+00

Pos logit contributions
der+0.044
 is+0.043
 can+0.040
 has+0.040
 helps+0.039
Neg logit contributions
 argue-0.047
 reason-0.043
aster-0.042
 worry-0.042
cle-0.041
Token,
Token ID11
Feature activation+3.15e-01

Pos logit contributions
der+0.027
 is+0.026
lic+0.026
 has+0.025
 can+0.025
Neg logit contributions
 argue-0.036
 reason-0.033
 worry-0.033
aster-0.033
 weep-0.032
Token it
Token ID340
Feature activation+4.79e+00

Pos logit contributions
der+0.008
lic+0.007
 is+0.007
 can+0.007
 has+0.007
Neg logit contributions
 argue-0.005
cle-0.005
 reason-0.005
 elbows-0.004
keeping-0.004
Token looked
Token ID3114
Feature activation+5.65e+00

Pos logit contributions
arl+0.080
der+0.080
boards+0.072
lic+0.069
olls+0.069
Neg logit contributions
 argue-0.097
 panic-0.095
 reason-0.095
 weep-0.095
 worry-0.091
Token their
Token ID511
Feature activation+6.12e-01

Pos logit contributions
der+0.053
 is+0.051
lic+0.051
 has+0.048
 can+0.048
Neg logit contributions
 argue-0.077
 elbows-0.073
cle-0.073
keeping-0.072
 reason-0.071
Token arms
Token ID5101
Feature activation+1.79e+00

Pos logit contributions
 is+0.011
 can+0.011
der+0.011
 has+0.010
 makes+0.010
Neg logit contributions
cle-0.013
 argue-0.013
keeping-0.013
aster-0.012
 moderation-0.012
Token and
Token ID290
Feature activation-4.84e-01

Pos logit contributions
 is+0.029
der+0.028
 has+0.027
 can+0.027
 helps+0.027
Neg logit contributions
 argue-0.040
ge-0.037
 weep-0.037
 afar-0.037
 worry-0.036
Token shouting
Token ID19642
Feature activation+4.01e+00

Pos logit contributions
 argue+0.009
cle+0.008
 elbows+0.008
keeping+0.008
 reason+0.008
Neg logit contributions
der-0.011
 is-0.010
 can-0.010
lic-0.010
 has-0.010
Token.
Token ID13
Feature activation+1.39e+00

Pos logit contributions
 is+0.063
 has+0.062
lic+0.060
 helps+0.060
 contains+0.059
Neg logit contributions
 argue-0.089
 reason-0.080
 worry-0.078
 weep-0.077
keeping-0.077
Token This
Token ID770
Feature activation+7.14e+00

Pos logit contributions
der+0.020
 is+0.020
 can+0.019
 has+0.018
lic+0.018
Neg logit contributions
 argue-0.035
cle-0.034
keeping-0.034
 elbows-0.033
 reason-0.033
Token made
Token ID925
Feature activation+4.19e+00

Pos logit contributions
der+0.161
lic+0.152
icked+0.139
ows+0.138
arl+0.135
Neg logit contributions
 reason-0.131
 argue-0.123
 quarrel-0.116
 panic-0.115
 afar-0.113
Token the
Token ID262
Feature activation+2.60e+00

Pos logit contributions
der+0.070
 is+0.066
 can+0.064
 has+0.062
lic+0.062
Neg logit contributions
 argue-0.097
cle-0.093
 reason-0.093
keeping-0.092
 elbows-0.091
Token tiger
Token ID26241
Feature activation-1.89e+00

Pos logit contributions
der+0.051
 is+0.049
 can+0.048
 has+0.046
lic+0.046
Neg logit contributions
 argue-0.053
cle-0.051
keeping-0.050
 reason-0.050
 elbows-0.050
Token scared
Token ID12008
Feature activation-1.83e-01

Pos logit contributions
 argue+0.039
cle+0.037
keeping+0.036
 reason+0.036
 elbows+0.036
Neg logit contributions
der-0.037
 is-0.036
 can-0.034
lic-0.034
 has-0.034
Token,
Token ID11
Feature activation-1.19e+00

Pos logit contributions
 argue+0.004
 reason+0.004
keeping+0.004
cle+0.004
 elbows+0.004
Neg logit contributions
der-0.003
 is-0.003
 can-0.003
lic-0.003
 has-0.003
Token.
Token ID13
Feature activation+8.08e-01

Pos logit contributions
 is+0.012
der+0.012
 can+0.012
 has+0.011
 makes+0.010
Neg logit contributions
 argue-0.027
cle-0.026
 elbows-0.026
keeping-0.026
 reason-0.025
Token
Token ID220
Feature activation+7.43e-01

Pos logit contributions
der+0.015
 is+0.014
 can+0.013
 has+0.013
lic+0.013
Neg logit contributions
 argue-0.017
keeping-0.017
cle-0.016
 reason-0.016
 elbows-0.016
Token
Token ID198
Feature activation+1.09e+00

Pos logit contributions
der+0.010
 is+0.010
 can+0.009
 has+0.009
lic+0.009
Neg logit contributions
 argue-0.019
cle-0.019
keeping-0.019
 elbows-0.019
 reason-0.018
Token
Token ID198
Feature activation+9.08e-01

Pos logit contributions
 is+0.012
 can+0.011
lic+0.011
der+0.011
 has+0.010
Neg logit contributions
 argue-0.032
 elbows-0.031
cle-0.031
keeping-0.031
 afar-0.031
TokenThe
Token ID464
Feature activation+5.12e+00

Pos logit contributions
 is+0.016
 can+0.015
der+0.015
lic+0.015
 has+0.014
Neg logit contributions
 argue-0.020
keeping-0.020
 elbows-0.020
cle-0.019
 afar-0.019
Token card
Token ID2657
Feature activation+7.13e+00

Pos logit contributions
 is+0.069
der+0.067
 can+0.065
 has+0.063
 makes+0.059
Neg logit contributions
 argue-0.136
keeping-0.133
cle-0.132
 elbows-0.131
 afar-0.128
Token was
Token ID373
Feature activation+2.96e+00

Pos logit contributions
der+0.096
 is+0.089
lic+0.086
 can+0.086
 has+0.082
Neg logit contributions
 argue-0.186
 reason-0.179
cle-0.178
 weep-0.176
keeping-0.176
Token very
Token ID845
Feature activation+3.74e-01

Pos logit contributions
der+0.048
 is+0.047
 can+0.045
 has+0.043
lic+0.043
Neg logit contributions
 argue-0.070
cle-0.067
 reason-0.066
keeping-0.066
 elbows-0.066
Token cheap
Token ID7026
Feature activation+1.12e+00

Pos logit contributions
der+0.008
 is+0.008
 can+0.007
lic+0.007
 has+0.007
Neg logit contributions
 argue-0.007
cle-0.007
 reason-0.007
 weep-0.007
 elbows-0.007
Token.
Token ID13
Feature activation+1.31e+00

Pos logit contributions
lic+0.017
der+0.016
 is+0.015
 can+0.015
 has+0.015
Neg logit contributions
 argue-0.027
 reason-0.026
keeping-0.026
cle-0.026
 weep-0.025
Token It
Token ID632
Feature activation+3.82e+00

Pos logit contributions
der+0.031
 is+0.030
 has+0.029
 can+0.029
 makes+0.029
Neg logit contributions
 argue-0.019
keeping-0.018
 worry-0.018
 weep-0.018
 reason-0.018
Token.
Token ID13
Feature activation+7.92e-01

Pos logit contributions
 argue+0.043
cle+0.041
keeping+0.041
 elbows+0.041
 reason+0.041
Neg logit contributions
der-0.023
 is-0.022
 can-0.021
 has-0.020
lic-0.020
Token He
Token ID679
Feature activation+3.78e+00

Pos logit contributions
der+0.009
 is+0.009
 can+0.009
 has+0.008
lic+0.008
Neg logit contributions
 argue-0.022
 elbows-0.022
cle-0.022
 reason-0.022
keeping-0.022
Token noticed
Token ID6810
Feature activation+3.91e+00

Pos logit contributions
lic+0.060
der+0.059
boards+0.054
ing+0.052
 is+0.049
Neg logit contributions
 argue-0.090
 weep-0.086
 afar-0.082
 reason-0.081
 panic-0.081
Token that
Token ID326
Feature activation+2.15e+00

Pos logit contributions
der+0.048
 is+0.047
 can+0.045
 has+0.044
lic+0.044
Neg logit contributions
 argue-0.104
cle-0.101
keeping-0.101
 reason-0.100
 elbows-0.100
Token the
Token ID262
Feature activation+3.81e+00

Pos logit contributions
der+0.032
 is+0.031
 can+0.030
lic+0.029
 has+0.029
Neg logit contributions
 argue-0.053
cle-0.051
keeping-0.051
 reason-0.051
 elbows-0.050
Token print
Token ID3601
Feature activation+7.13e+00

Pos logit contributions
der+0.065
 is+0.064
 can+0.062
 has+0.059
 makes+0.056
Neg logit contributions
 argue-0.087
cle-0.084
keeping-0.083
 elbows-0.082
 reason-0.081
Token in
Token ID287
Feature activation-5.69e-03

Pos logit contributions
der+0.095
lic+0.092
icked+0.072
ows+0.072
 helps+0.071
Neg logit contributions
 argue-0.177
 reason-0.174
 weep-0.173
 worry-0.167
 harm-0.166
Token his
Token ID465
Feature activation+5.50e-03

Pos logit contributions
 argue+0.000
cle+0.000
keeping+0.000
 reason+0.000
 elbows+0.000
Neg logit contributions
der-0.000
 is-0.000
 can-0.000
lic-0.000
 has-0.000
Token new
Token ID649
Feature activation+3.68e-01

Pos logit contributions
 is+0.000
der+0.000
 can+0.000
 has+0.000
 makes+0.000
Neg logit contributions
 argue-0.000
cle-0.000
keeping-0.000
 reason-0.000
 elbows-0.000
Token coloring
Token ID33988
Feature activation+4.67e+00

Pos logit contributions
 is+0.007
der+0.006
 can+0.006
 has+0.006
 looks+0.006
Neg logit contributions
 argue-0.008
cle-0.008
 elbows-0.008
keeping-0.008
 reason-0.007
Token book
Token ID1492
Feature activation+3.88e+00

Pos logit contributions
der+0.076
lic+0.071
 helps+0.063
 is+0.062
 can+0.062
Neg logit contributions
keeping-0.105
 argue-0.102
 reason-0.101
cle-0.100
aster-0.099
Token.
Token ID13
Feature activation+1.18e+00

Pos logit contributions
lic+0.017
der+0.017
boards+0.016
 is+0.015
 has+0.015
Neg logit contributions
keeping-0.022
 argue-0.021
 reason-0.021
cle-0.020
 worry-0.020
Token
Token ID220
Feature activation+1.36e+00

Pos logit contributions
der+0.017
 is+0.016
 can+0.016
 has+0.015
lic+0.015
Neg logit contributions
 argue-0.030
cle-0.029
keeping-0.029
 elbows-0.029
 reason-0.029
Token
Token ID198
Feature activation+1.67e+00

Pos logit contributions
der+0.022
 is+0.020
 can+0.019
 has+0.019
 makes+0.018
Neg logit contributions
 argue-0.032
cle-0.031
keeping-0.031
 reason-0.030
 elbows-0.030
Token
Token ID198
Feature activation+1.46e+00

Pos logit contributions
der+0.024
 is+0.023
 can+0.022
 has+0.022
lic+0.021
Neg logit contributions
 argue-0.042
cle-0.041
keeping-0.040
 elbows-0.040
 reason-0.040
TokenThe
Token ID464
Feature activation+5.68e+00

Pos logit contributions
der+0.028
 is+0.027
 can+0.026
 has+0.025
lic+0.025
Neg logit contributions
 argue-0.031
cle-0.029
keeping-0.029
 elbows-0.029
 reason-0.029
Token symbol
Token ID6194
Feature activation+7.13e+00

Pos logit contributions
 is+0.084
der+0.082
 can+0.079
 has+0.077
lic+0.072
Neg logit contributions
 argue-0.142
keeping-0.139
cle-0.139
 elbows-0.137
 weep-0.133
Token worked
Token ID3111
Feature activation+4.81e+00

Pos logit contributions
der+0.098
lic+0.088
boards+0.086
 can+0.079
ows+0.078
Neg logit contributions
 reason-0.170
 argue-0.169
cle-0.166
 weep-0.164
 worry-0.162
Token!
Token ID0
Feature activation+2.30e+00

Pos logit contributions
der+0.085
lic+0.079
boards+0.076
 covers+0.075
 helps+0.074
Neg logit contributions
 argue-0.097
 reason-0.097
ge-0.095
 worry-0.092
cle-0.090
Token Every
Token ID3887
Feature activation+3.09e+00

Pos logit contributions
der+0.038
 is+0.036
 can+0.035
 has+0.034
lic+0.034
Neg logit contributions
 argue-0.052
keeping-0.051
cle-0.051
 reason-0.050
 weep-0.048
Token time
Token ID640
Feature activation-1.94e-01

Pos logit contributions
der+0.073
lic+0.068
 is+0.067
 can+0.066
boards+0.066
Neg logit contributions
 argue-0.051
 reason-0.049
cle-0.047
 weep-0.046
keeping-0.046
Token Andy
Token ID12382
Feature activation-1.27e+00

Pos logit contributions
 argue+0.005
 reason+0.004
cle+0.004
keeping+0.004
 worry+0.004
Neg logit contributions
der-0.003
lic-0.003
 is-0.003
 has-0.003
 can-0.003
Token and
Token ID290
Feature activation-1.85e+00

Pos logit contributions
 argue+0.004
keeping+0.004
 reason+0.003
 weep+0.003
 worry+0.003
Neg logit contributions
der-0.003
lic-0.003
 is-0.003
boards-0.003
 has-0.003
Token sugar
Token ID7543
Feature activation+6.06e-01

Pos logit contributions
 argue+0.040
keeping+0.040
 reason+0.039
 worry+0.039
 weep+0.039
Neg logit contributions
lic-0.032
der-0.031
boards-0.029
 is-0.027
ows-0.027
Token.
Token ID13
Feature activation+1.16e+00

Pos logit contributions
lic+0.011
der+0.011
 is+0.010
boards+0.009
 can+0.009
Neg logit contributions
 argue-0.012
keeping-0.012
 reason-0.012
cle-0.011
 worry-0.011
Token The
Token ID383
Feature activation+5.09e+00

Pos logit contributions
der+0.020
 is+0.019
lic+0.018
 can+0.018
 has+0.018
Neg logit contributions
 argue-0.026
cle-0.025
keeping-0.024
 reason-0.024
 weep-0.024
Token secret
Token ID3200
Feature activation+5.20e+00

Pos logit contributions
der+0.083
 is+0.080
 can+0.077
 has+0.074
lic+0.074
Neg logit contributions
 argue-0.120
cle-0.116
keeping-0.114
 reason-0.114
 elbows-0.113
Token mixture
Token ID11710
Feature activation+7.10e+00

Pos logit contributions
der+0.072
lic+0.066
boards+0.059
 can+0.053
 is+0.053
Neg logit contributions
 reason-0.133
 argue-0.130
keeping-0.130
 weep-0.127
 worry-0.126
Token was
Token ID373
Feature activation+2.98e+00

Pos logit contributions
der+0.097
lic+0.091
boards+0.074
arl+0.064
icked+0.063
Neg logit contributions
 argue-0.173
 reason-0.172
keeping-0.171
 weep-0.171
 worry-0.168
Token so
Token ID523
Feature activation+1.54e-01

Pos logit contributions
der+0.051
boards+0.047
lic+0.046
 is+0.044
 helps+0.042
Neg logit contributions
 argue-0.066
 reason-0.064
 worry-0.063
cle-0.063
 afar-0.062
Token good
Token ID922
Feature activation-1.63e-01

Pos logit contributions
der+0.003
boards+0.003
lic+0.003
les+0.003
ows+0.003
Neg logit contributions
 worry-0.003
 panic-0.003
 mischief-0.003
 reason-0.003
 moderation-0.003
Token that
Token ID326
Feature activation+2.37e+00

Pos logit contributions
keeping+0.003
 argue+0.003
 reason+0.003
cle+0.003
 afar+0.003
Neg logit contributions
lic-0.003
der-0.003
boards-0.003
 lid-0.003
 is-0.003
Token it
Token ID340
Feature activation+4.04e+00

Pos logit contributions
der+0.044
lic+0.043
boards+0.038
 is+0.036
ows+0.036
Neg logit contributions
 argue-0.050
cle-0.050
 reason-0.049
 worry-0.048
 moderation-0.048
Token be
Token ID307
Feature activation-1.52e+00

Pos logit contributions
cle+0.041
 argue+0.041
keeping+0.040
 elbows+0.040
 reason+0.038
Neg logit contributions
 is-0.044
der-0.044
 can-0.043
 has-0.041
 makes-0.040
Token taller
Token ID25242
Feature activation-3.23e+00

Pos logit contributions
cle+0.033
 argue+0.033
keeping+0.032
 elbows+0.032
 reason+0.031
Neg logit contributions
 is-0.027
 can-0.026
der-0.025
 all-0.025
 has-0.025
Token than
Token ID621
Feature activation-3.78e+00

Pos logit contributions
 argue+0.067
cle+0.064
keeping+0.063
 elbows+0.063
 reason+0.062
Neg logit contributions
der-0.062
 is-0.061
 can-0.058
 has-0.056
lic-0.055
Token her
Token ID607
Feature activation-2.09e+00

Pos logit contributions
 argue+0.077
cle+0.073
keeping+0.073
 reason+0.072
 elbows+0.072
Neg logit contributions
der-0.073
 is-0.071
 can-0.068
lic-0.067
 has-0.067
Token.
Token ID13
Feature activation-4.26e-01

Pos logit contributions
 argue+0.050
cle+0.049
 elbows+0.049
keeping+0.048
 weep+0.047
Neg logit contributions
 is-0.034
der-0.034
 can-0.033
 has-0.031
 makes-0.029
Token This
Token ID770
Feature activation+7.10e+00

Pos logit contributions
 argue+0.012
cle+0.012
keeping+0.012
 reason+0.012
 elbows+0.012
Neg logit contributions
 is-0.005
der-0.005
 can-0.005
 has-0.005
 makes-0.004
Token made
Token ID925
Feature activation+4.52e+00

Pos logit contributions
der+0.164
lic+0.157
 is+0.142
 can+0.142
 has+0.140
Neg logit contributions
 argue-0.121
 reason-0.121
 weep-0.109
cle-0.108
 moderation-0.108
Token Max
Token ID5436
Feature activation-3.20e+00

Pos logit contributions
der+0.099
 is+0.090
 has+0.089
lic+0.089
 can+0.088
Neg logit contributions
 argue-0.081
 reason-0.080
 elbows-0.077
 panic-0.077
 weep-0.076
Token very
Token ID845
Feature activation-1.41e+00

Pos logit contributions
cle+0.070
 argue+0.069
keeping+0.068
 elbows+0.066
 reason+0.066
Neg logit contributions
 is-0.057
der-0.056
 can-0.054
 has-0.053
 makes-0.051
Token sad
Token ID6507
Feature activation+4.73e-01

Pos logit contributions
 argue+0.028
cle+0.028
keeping+0.027
 elbows+0.027
 reason+0.026
Neg logit contributions
der-0.028
 is-0.028
 can-0.027
 has-0.026
 makes-0.025
Token.
Token ID13
Feature activation-1.23e-01

Pos logit contributions
lic+0.008
 has+0.008
boards+0.007
 is+0.007
 can+0.007
Neg logit contributions
 argue-0.010
 reason-0.010
 weep-0.010
 worry-0.010
keeping-0.009
Token blue
Token ID4171
Feature activation+6.11e-01

Pos logit contributions
 argue+0.046
 reason+0.044
cle+0.044
keeping+0.043
 weep+0.043
Neg logit contributions
der-0.057
 is-0.055
 can-0.053
lic-0.053
 has-0.052
Token bike
Token ID7161
Feature activation-8.98e-01

Pos logit contributions
der+0.011
 is+0.010
lic+0.010
 has+0.010
 can+0.010
Neg logit contributions
 argue-0.013
 reason-0.013
cle-0.013
 weep-0.013
keeping-0.012
Token with
Token ID351
Feature activation+1.31e+00

Pos logit contributions
 argue+0.021
cle+0.021
keeping+0.021
 reason+0.020
 weep+0.020
Neg logit contributions
der-0.014
lic-0.013
 is-0.012
 makes-0.011
boards-0.011
Token t
Token ID256
Feature activation-2.64e+00

Pos logit contributions
der+0.025
lic+0.024
 is+0.023
boards+0.023
 has+0.023
Neg logit contributions
 afar-0.026
 mischief-0.025
keeping-0.025
 reason-0.025
 argue-0.025
Tokenas
Token ID292
Feature activation-2.31e+00

Pos logit contributions
keeping+0.053
cle+0.053
 argue+0.050
ge+0.049
 elbows+0.049
Neg logit contributions
 is-0.052
 has-0.051
 can-0.050
der-0.050
 makes-0.049
Tokensels
Token ID14002
Feature activation+7.10e+00

Pos logit contributions
 argue+0.054
keeping+0.051
cle+0.051
 weep+0.050
 worry+0.050
Neg logit contributions
der-0.037
lic-0.035
 is-0.035
boards-0.032
 makes-0.032
Token on
Token ID319
Feature activation-1.31e+00

Pos logit contributions
lic+0.124
der+0.121
auc+0.108
rooms+0.107
ours+0.104
Neg logit contributions
keeping-0.138
 reason-0.126
 seen-0.124
 perspective-0.121
 worry-0.120
Token the
Token ID262
Feature activation+9.97e-01

Pos logit contributions
 argue+0.029
cle+0.028
 reason+0.027
keeping+0.027
 elbows+0.027
Neg logit contributions
der-0.023
 is-0.022
lic-0.022
 can-0.021
 has-0.021
Token handle
Token ID5412
Feature activation+8.54e-01

Pos logit contributions
der+0.016
 is+0.015
lic+0.015
 can+0.014
 has+0.013
Neg logit contributions
 argue-0.024
 reason-0.023
cle-0.023
keeping-0.023
 elbows-0.023
Tokenbars
Token ID34046
Feature activation-8.61e-01

Pos logit contributions
lic+0.014
der+0.014
 helps+0.012
 is+0.012
 teaches+0.012
Neg logit contributions
keeping-0.018
ge-0.018
cle-0.018
 argue-0.017
 worry-0.017
Token and
Token ID290
Feature activation-1.87e+00

Pos logit contributions
 argue+0.019
ge+0.018
cle+0.018
keeping+0.018
 worry+0.017
Neg logit contributions
lic-0.015
der-0.014
boards-0.013
 is-0.013
 helps-0.013
Token.
Token ID13
Feature activation+8.69e-01

Pos logit contributions
lic+0.020
der+0.017
 teaches+0.017
 helps+0.017
 contains+0.017
Neg logit contributions
keeping-0.018
 argue-0.018
ge-0.018
 harm-0.017
 reason-0.017
Token Now
Token ID2735
Feature activation+1.24e+00

Pos logit contributions
der+0.012
 is+0.011
lic+0.011
 can+0.011
 has+0.010
Neg logit contributions
 argue-0.022
keeping-0.021
cle-0.021
 reason-0.021
 weep-0.021
Token,
Token ID11
Feature activation-3.91e-01

Pos logit contributions
lic+0.023
der+0.022
boards+0.020
auc+0.019
 is+0.019
Neg logit contributions
 argue-0.027
 worry-0.026
 reason-0.026
 weep-0.025
 panic-0.024
Token the
Token ID262
Feature activation+4.27e+00

Pos logit contributions
 argue+0.009
cle+0.009
 reason+0.009
keeping+0.009
 weep+0.009
Neg logit contributions
der-0.006
 is-0.006
lic-0.006
 can-0.006
 has-0.006
Token herb
Token ID21047
Feature activation+3.32e+00

Pos logit contributions
der+0.074
 is+0.069
lic+0.067
 can+0.064
 has+0.064
Neg logit contributions
 argue-0.097
 reason-0.095
cle-0.093
 weep-0.091
keeping-0.090
Token print
Token ID3601
Feature activation+7.08e+00

Pos logit contributions
der+0.051
lic+0.047
lessly+0.041
boards+0.039
arl+0.035
Neg logit contributions
 argue-0.081
 weep-0.078
 reason-0.078
 worry-0.076
 panic-0.075
Token hangs
Token ID28087
Feature activation+2.08e+00

Pos logit contributions
der+0.106
lic+0.094
lessly+0.075
arl+0.075
boards+0.074
Neg logit contributions
 reason-0.167
 argue-0.158
 weep-0.157
 worry-0.157
keeping-0.157
Token in
Token ID287
Feature activation-1.62e+00

Pos logit contributions
der+0.035
lic+0.032
 helps+0.031
 has+0.030
 is+0.030
Neg logit contributions
keeping-0.043
 argue-0.041
 Johnson-0.040
 reason-0.040
 worry-0.039
Token the
Token ID262
Feature activation+1.83e+00

Pos logit contributions
 argue+0.039
cle+0.038
keeping+0.037
 elbows+0.037
 reason+0.037
Neg logit contributions
der-0.026
 is-0.025
 can-0.024
 has-0.023
lic-0.023
Token kitchen
Token ID9592
Feature activation+2.64e+00

Pos logit contributions
der+0.033
lic+0.029
 is+0.029
 can+0.028
 has+0.027
Neg logit contributions
 argue-0.041
cle-0.040
 reason-0.039
 elbows-0.039
 afar-0.039
Token and
Token ID290
Feature activation-3.60e-01

Pos logit contributions
lic+0.048
der+0.048
icked+0.046
lessly+0.043
rooms+0.041
Neg logit contributions
 argue-0.050
keeping-0.049
 reason-0.049
 worry-0.047
aster-0.045
Token and
Token ID290
Feature activation+3.49e-01

Pos logit contributions
 is+0.046
 has+0.046
 can+0.045
lic+0.045
 helps+0.043
Neg logit contributions
 argue-0.056
cle-0.054
 worry-0.053
keeping-0.053
 reason-0.053
Token his
Token ID465
Feature activation+2.04e+00

Pos logit contributions
der+0.007
boards+0.007
lic+0.007
 is+0.006
 can+0.006
Neg logit contributions
 weep-0.007
 worry-0.007
 panic-0.007
 argue-0.007
 elbows-0.006
Token mom
Token ID1995
Feature activation-4.01e-01

Pos logit contributions
 is+0.036
der+0.036
 can+0.035
 has+0.033
 makes+0.032
Neg logit contributions
 argue-0.045
cle-0.044
keeping-0.043
 elbows-0.042
 reason-0.042
Token laughed
Token ID13818
Feature activation+4.85e+00

Pos logit contributions
 argue+0.009
 weep+0.009
 worry+0.009
 panic+0.008
cle+0.008
Neg logit contributions
der-0.007
 is-0.006
 makes-0.006
boards-0.006
 can-0.006
Token.
Token ID13
Feature activation+1.63e+00

Pos logit contributions
 has+0.093
 is+0.092
lic+0.092
 helps+0.090
boards+0.089
Neg logit contributions
 argue-0.089
 worry-0.089
 weep-0.084
 reason-0.082
 panic-0.080
Token This
Token ID770
Feature activation+7.07e+00

Pos logit contributions
der+0.027
 is+0.025
lic+0.024
 can+0.024
 has+0.023
Neg logit contributions
 argue-0.036
cle-0.036
 elbows-0.036
keeping-0.034
 mischief-0.034
Token was
Token ID373
Feature activation+3.24e+00

Pos logit contributions
der+0.139
lic+0.131
ows+0.122
icked+0.117
arl+0.114
Neg logit contributions
 reason-0.147
 argue-0.141
 panic-0.137
 worry-0.135
 mischief-0.133
Token going
Token ID1016
Feature activation-1.03e+00

Pos logit contributions
der+0.055
 is+0.051
 can+0.049
lic+0.049
 has+0.049
Neg logit contributions
 argue-0.072
cle-0.071
 reason-0.070
 afar-0.069
 elbows-0.068
Token to
Token ID284
Feature activation-2.36e+00

Pos logit contributions
 argue+0.024
 reason+0.023
cle+0.023
keeping+0.023
 elbows+0.022
Neg logit contributions
der-0.018
 is-0.016
lic-0.016
 can-0.015
 has-0.015
Token be
Token ID307
Feature activation-2.28e-01

Pos logit contributions
 argue+0.051
cle+0.050
keeping+0.049
 elbows+0.049
 reason+0.048
Neg logit contributions
der-0.042
 is-0.042
 can-0.040
 has-0.038
 makes-0.037
Token the
Token ID262
Feature activation+2.41e+00

Pos logit contributions
 argue+0.006
cle+0.006
keeping+0.005
 weep+0.005
 elbows+0.005
Neg logit contributions
 is-0.003
der-0.003
 can-0.003
 has-0.003
lic-0.003
Token.
Token ID13
Feature activation+7.05e-01

Pos logit contributions
 argue+0.078
cle+0.076
keeping+0.075
 elbows+0.074
 reason+0.074
Neg logit contributions
der-0.043
 is-0.041
 can-0.039
lic-0.038
 has-0.038
Token
Token ID220
Feature activation+1.25e+00

Pos logit contributions
der+0.010
 is+0.010
 can+0.009
 has+0.009
lic+0.009
Neg logit contributions
 argue-0.018
cle-0.017
keeping-0.017
 elbows-0.017
 reason-0.017
Token
Token ID198
Feature activation+1.15e+00

Pos logit contributions
der+0.020
 is+0.018
 can+0.018
 has+0.017
lic+0.017
Neg logit contributions
 argue-0.029
cle-0.029
keeping-0.029
 elbows-0.028
 reason-0.028
Token
Token ID198
Feature activation+9.57e-01

Pos logit contributions
der+0.016
 is+0.016
 can+0.015
 has+0.014
lic+0.014
Neg logit contributions
 argue-0.030
cle-0.029
keeping-0.029
 elbows-0.029
 reason-0.028
TokenThe
Token ID464
Feature activation+5.33e+00

Pos logit contributions
der+0.016
 is+0.016
 can+0.016
lic+0.015
 has+0.015
Neg logit contributions
 argue-0.022
cle-0.021
keeping-0.021
 elbows-0.021
 reason-0.020
Token symbol
Token ID6194
Feature activation+7.07e+00

Pos logit contributions
der+0.086
 is+0.085
 can+0.081
 has+0.078
lic+0.076
Neg logit contributions
 argue-0.126
cle-0.122
keeping-0.121
 elbows-0.120
 reason-0.119
Token said
Token ID531
Feature activation+4.42e+00

Pos logit contributions
der+0.105
lic+0.094
 is+0.092
 can+0.091
boards+0.090
Neg logit contributions
 reason-0.164
 argue-0.160
cle-0.159
 weep-0.158
 worry-0.154
Token goodbye
Token ID24829
Feature activation+7.00e-01

Pos logit contributions
 has+0.068
boards+0.067
der+0.067
 is+0.066
 makes+0.066
Neg logit contributions
 worry-0.089
 argue-0.088
 weep-0.087
 reason-0.085
 Susan-0.085
Token and
Token ID290
Feature activation+5.03e-01

Pos logit contributions
der+0.011
 is+0.011
lic+0.011
 can+0.011
boards+0.010
Neg logit contributions
 argue-0.015
keeping-0.015
cle-0.015
 reason-0.015
 weep-0.015
Token took
Token ID1718
Feature activation+3.85e+00

Pos logit contributions
der+0.011
boards+0.010
 is+0.010
lic+0.010
 can+0.009
Neg logit contributions
 argue-0.009
 weep-0.009
keeping-0.009
 worry-0.009
 reason-0.008
Token off
Token ID572
Feature activation-1.43e+00

Pos logit contributions
der+0.078
 is+0.068
boards+0.068
 can+0.068
lic+0.068
Neg logit contributions
keeping-0.075
 reason-0.074
 argue-0.074
 elbows-0.073
 mischief-0.072
Token bed
Token ID3996
Feature activation-4.42e-01

Pos logit contributions
 argue+0.000
cle+0.000
 elbows+0.000
ge+0.000
keeping+0.000
Neg logit contributions
 is-0.000
der-0.000
 all-0.000
 that-0.000
 can-0.000
Token.
Token ID13
Feature activation+1.27e+00

Pos logit contributions
keeping+0.010
 argue+0.009
cle+0.009
 reason+0.009
 weep+0.009
Neg logit contributions
 is-0.007
 can-0.007
 has-0.007
der-0.007
lic-0.007
Token
Token ID198
Feature activation+1.13e+00

Pos logit contributions
der+0.020
 is+0.019
 can+0.018
 has+0.018
lic+0.017
Neg logit contributions
 argue-0.031
cle-0.030
keeping-0.030
 elbows-0.029
 reason-0.029
Token
Token ID198
Feature activation+9.95e-01

Pos logit contributions
 is+0.012
lic+0.011
 can+0.010
der+0.010
 has+0.009
Neg logit contributions
 argue-0.034
 elbows-0.034
keeping-0.033
cle-0.032
 afar-0.032
TokenThe
Token ID464
Feature activation+4.92e+00

Pos logit contributions
der+0.015
 is+0.015
 can+0.015
lic+0.015
 has+0.014
Neg logit contributions
 argue-0.024
keeping-0.023
 elbows-0.023
cle-0.023
 weep-0.022
Tokencube
Token ID40296
Feature activation+7.06e+00

Pos logit contributions
der+0.077
 is+0.075
 can+0.072
 has+0.069
lic+0.068
Neg logit contributions
 argue-0.120
cle-0.116
keeping-0.115
 elbows-0.114
 reason-0.113
Token was
Token ID373
Feature activation+2.87e+00

Pos logit contributions
der+0.097
 is+0.088
lic+0.085
 can+0.085
 has+0.081
Neg logit contributions
 argue-0.182
cle-0.179
 reason-0.176
 weep-0.174
keeping-0.174
Token so
Token ID523
Feature activation-1.72e-01

Pos logit contributions
der+0.047
 is+0.046
 can+0.044
 has+0.043
lic+0.042
Neg logit contributions
 argue-0.067
cle-0.065
keeping-0.064
 elbows-0.064
 reason-0.063
Token happy
Token ID3772
Feature activation+3.55e-01

Pos logit contributions
 argue+0.002
 weep+0.002
 worry+0.002
 panic+0.002
cle+0.002
Neg logit contributions
der-0.004
boards-0.004
 is-0.004
 can-0.004
lic-0.004
Token to
Token ID284
Feature activation-2.17e+00

Pos logit contributions
der+0.004
 is+0.004
lic+0.004
 can+0.004
 has+0.004
Neg logit contributions
 argue-0.009
keeping-0.009
 reason-0.009
cle-0.009
 weep-0.009
Token have
Token ID423
Feature activation+1.23e+00

Pos logit contributions
 argue+0.045
cle+0.045
 elbows+0.044
keeping+0.044
 reason+0.043
Neg logit contributions
 is-0.040
der-0.039
 can-0.039
 has-0.037
 makes-0.036
Token his
Token ID465
Feature activation-9.57e-01

Pos logit contributions
der+0.015
 is+0.015
lic+0.015
 can+0.014
 has+0.014
Neg logit contributions
 argue-0.019
 reason-0.018
cle-0.018
keeping-0.018
 elbows-0.018
Token family
Token ID1641
Feature activation-1.88e+00

Pos logit contributions
 argue+0.021
keeping+0.021
cle+0.020
 elbows+0.020
Susan+0.020
Neg logit contributions
 is-0.018
 can-0.017
der-0.016
 has-0.016
 makes-0.016
Token that
Token ID326
Feature activation+1.89e+00

Pos logit contributions
 argue+0.048
cle+0.047
keeping+0.047
 reason+0.046
 weep+0.045
Neg logit contributions
der-0.026
 is-0.024
lic-0.024
 can-0.023
 has-0.022
Token night
Token ID1755
Feature activation-3.72e-01

Pos logit contributions
der+0.030
lic+0.028
 is+0.026
 can+0.024
boards+0.024
Neg logit contributions
 argue-0.046
 reason-0.045
cle-0.044
 weep-0.044
keeping-0.044
Token.
Token ID13
Feature activation+1.39e+00

Pos logit contributions
cle+0.008
keeping+0.008
 reason+0.008
 argue+0.008
aster+0.008
Neg logit contributions
der-0.006
lic-0.006
 is-0.005
 can-0.005
icked-0.005
Token This
Token ID770
Feature activation+7.05e+00

Pos logit contributions
der+0.018
 is+0.017
 can+0.017
lic+0.016
 has+0.016
Neg logit contributions
 argue-0.037
cle-0.036
keeping-0.036
 reason-0.035
 elbows-0.035
Token time
Token ID640
Feature activation+8.37e-01

Pos logit contributions
der+0.172
lic+0.169
icked+0.155
ows+0.151
arl+0.150
Neg logit contributions
 reason-0.116
 panic-0.105
 mischief-0.105
 worry-0.104
 moderation-0.103
Token,
Token ID11
Feature activation+4.86e-01

Pos logit contributions
der+0.016
lic+0.015
arl+0.012
auc+0.012
boards+0.012
Neg logit contributions
 argue-0.017
 worry-0.017
 reason-0.017
 Susan-0.016
aster-0.016
Token instead
Token ID2427
Feature activation+1.65e+00

Pos logit contributions
der+0.007
lic+0.007
 is+0.007
 can+0.006
 has+0.006
Neg logit contributions
 argue-0.012
cle-0.012
 reason-0.011
keeping-0.011
 elbows-0.011
Token of
Token ID286
Feature activation-8.63e-01

Pos logit contributions
der+0.018
lic+0.015
boards+0.013
 is+0.012
 can+0.012
Neg logit contributions
 reason-0.046
 worry-0.046
 argue-0.045
 weep-0.045
keeping-0.045
Token running
Token ID2491
Feature activation-1.08e+00

Pos logit contributions
 argue+0.015
 reason+0.014
 weep+0.014
 elbows+0.014
keeping+0.014
Neg logit contributions
der-0.019
lic-0.019
 is-0.019
 can-0.018
boards-0.018
Token was
Token ID373
Feature activation+2.63e+00

Pos logit contributions
der+0.042
lic+0.040
boards+0.039
 is+0.038
 can+0.037
Neg logit contributions
 argue-0.053
 weep-0.051
 reason-0.051
cle-0.050
 worry-0.049
Token also
Token ID635
Feature activation+1.23e+00

Pos logit contributions
der+0.049
 is+0.047
boards+0.046
lic+0.045
 can+0.045
Neg logit contributions
 argue-0.056
 weep-0.053
 reason-0.052
 worry-0.052
keeping-0.051
Token very
Token ID845
Feature activation+6.14e-01

Pos logit contributions
der+0.024
 is+0.022
boards+0.022
lic+0.022
 can+0.021
Neg logit contributions
 argue-0.025
 weep-0.024
keeping-0.023
 worry-0.023
 reason-0.023
Token happy
Token ID3772
Feature activation+2.43e-01

Pos logit contributions
der+0.014
boards+0.014
 is+0.013
lic+0.013
 can+0.013
Neg logit contributions
 argue-0.011
 worry-0.010
 weep-0.010
 panic-0.010
 reason-0.010
Token.
Token ID13
Feature activation+1.16e+00

Pos logit contributions
lic+0.004
 is+0.003
 has+0.003
 can+0.003
der+0.003
Neg logit contributions
 argue-0.006
keeping-0.006
 reason-0.006
 worry-0.005
cle-0.005
Token This
Token ID770
Feature activation+7.04e+00

Pos logit contributions
der+0.019
lic+0.017
boards+0.016
 is+0.016
 can+0.016
Neg logit contributions
cle-0.024
keeping-0.024
Susan-0.024
 argue-0.024
 Susan-0.024
Token was
Token ID373
Feature activation+3.34e+00

Pos logit contributions
der+0.138
lic+0.129
ows+0.120
boards+0.115
icked+0.114
Neg logit contributions
 reason-0.147
 argue-0.138
 moderation-0.136
 panic-0.133
cle-0.133
Token the
Token ID262
Feature activation+3.62e+00

Pos logit contributions
der+0.060
boards+0.055
 is+0.055
lic+0.055
 has+0.053
Neg logit contributions
 argue-0.071
cle-0.070
 reason-0.070
 worry-0.068
 afar-0.068
Token best
Token ID1266
Feature activation-1.08e+00

Pos logit contributions
 is+0.043
der+0.042
 can+0.041
 has+0.038
 makes+0.036
Neg logit contributions
 argue-0.101
keeping-0.099
cle-0.099
 elbows-0.097
 weep-0.095
Token day
Token ID1110
Feature activation-6.39e-01

Pos logit contributions
 argue+0.021
cle+0.020
keeping+0.020
 elbows+0.020
 reason+0.019
Neg logit contributions
der-0.022
 is-0.022
 can-0.021
 has-0.021
 makes-0.020
Token ever
Token ID1683
Feature activation-1.34e+00

Pos logit contributions
cle+0.011
keeping+0.011
aster+0.010
 argue+0.010
 reason+0.010
Neg logit contributions
der-0.015
lic-0.014
 is-0.013
 can-0.013
boards-0.013
Token to
Token ID284
Feature activation-2.42e+00

Pos logit contributions
der+0.001
 is+0.001
 can+0.001
 has+0.001
lic+0.001
Neg logit contributions
 argue-0.002
cle-0.002
 elbows-0.002
keeping-0.002
 reason-0.002
Token never
Token ID1239
Feature activation-3.07e+00

Pos logit contributions
 argue+0.051
cle+0.048
keeping+0.047
 reason+0.047
 elbows+0.047
Neg logit contributions
der-0.046
 is-0.045
 can-0.043
lic-0.042
 has-0.042
Token try
Token ID1949
Feature activation-2.13e+00

Pos logit contributions
cle+0.069
keeping+0.067
 elbows+0.066
 argue+0.064
 moderation+0.064
Neg logit contributions
 is-0.055
 can-0.054
der-0.053
 has-0.050
 makes-0.049
Token to
Token ID284
Feature activation-7.08e+00

Pos logit contributions
cle+0.060
 argue+0.057
keeping+0.057
 elbows+0.055
aster+0.055
Neg logit contributions
der-0.028
 is-0.028
 can-0.026
 all-0.025
 makes-0.025
Token catch
Token ID4929
Feature activation-6.35e-01

Pos logit contributions
cle+0.156
keeping+0.152
 elbows+0.151
 moderation+0.144
 mischief+0.140
Neg logit contributions
 is-0.132
 can-0.129
 all-0.123
der-0.118
 that-0.118
Token Twe
Token ID24205
Feature activation-1.10e+01

Pos logit contributions
cle+0.009
 argue+0.009
keeping+0.008
 elbows+0.008
aster+0.008
Neg logit contributions
 is-0.016
der-0.016
 can-0.016
 all-0.015
 that-0.015
Tokenety
Token ID2963
Feature activation-5.10e+00

Pos logit contributions
 mischief+0.158
 argue+0.157
aster+0.149
 elbows+0.148
 weep+0.146
Neg logit contributions
ing-0.289
 is-0.280
lic-0.275
 can-0.268
der-0.265
Token again
Token ID757
Feature activation-4.09e+00

Pos logit contributions
cle+0.157
 elbows+0.150
 argue+0.146
keeping+0.146
 mischief+0.142
Neg logit contributions
 is-0.059
der-0.055
 can-0.052
 that-0.049
 all-0.048
Token.
Token ID13
Feature activation+1.98e-01

Pos logit contributions
 argue+0.117
cle+0.115
 elbows+0.115
keeping+0.112
 reason+0.111
Neg logit contributions
der-0.045
 is-0.045
 can-0.042
 has-0.039
 makes-0.037
Token From
Token ID3574
Feature activation+2.01e+00

Pos logit contributions
der+0.003
lic+0.003
 is+0.003
 can+0.003
boards+0.003
Neg logit contributions
 argue-0.004
 Johnson-0.004
cle-0.004
keeping-0.004
 weep-0.004
Token that
Token ID326
Feature activation+1.27e+00

Pos logit contributions
der+0.015
lic+0.013
 is+0.013
 can+0.012
boards+0.012
Neg logit contributions
cle-0.064
 argue-0.064
 afar-0.063
keeping-0.062
 elbows-0.062
Token who
Token ID508
Feature activation-4.14e-01

Pos logit contributions
cle+0.055
 argue+0.054
keeping+0.053
 reason+0.052
 elbows+0.051
Neg logit contributions
 is-0.031
 that-0.028
 can-0.028
der-0.028
 has-0.027
Token was
Token ID373
Feature activation+6.47e-01

Pos logit contributions
 argue+0.012
cle+0.012
keeping+0.012
 elbows+0.012
 reason+0.012
Neg logit contributions
der-0.004
 is-0.004
 can-0.004
 has-0.004
lic-0.004
Token trying
Token ID2111
Feature activation+5.07e-01

Pos logit contributions
 is+0.012
der+0.012
 can+0.011
 all+0.011
 has+0.011
Neg logit contributions
cle-0.014
keeping-0.013
 argue-0.013
 elbows-0.013
 reason-0.013
Token to
Token ID284
Feature activation-5.78e+00

Pos logit contributions
 is+0.005
 can+0.005
der+0.005
 all+0.005
 that+0.005
Neg logit contributions
cle-0.016
keeping-0.015
 elbows-0.015
 argue-0.015
aster-0.015
Token take
Token ID1011
Feature activation-1.15e-01

Pos logit contributions
cle+0.128
keeping+0.122
 elbows+0.120
 argue+0.113
 Johnson+0.112
Neg logit contributions
 is-0.111
 can-0.106
 all-0.104
 that-0.101
 makes-0.101
Token Twe
Token ID24205
Feature activation-1.08e+01

Pos logit contributions
 argue+0.003
cle+0.003
 elbows+0.003
ge+0.003
keeping+0.003
Neg logit contributions
 is-0.001
 can-0.001
 all-0.001
 that-0.001
 makes-0.001
Tokenety
Token ID2963
Feature activation-5.40e+00

Pos logit contributions
 argue+0.171
 mischief+0.158
 weep+0.152
 elbows+0.150
keeping+0.144
Neg logit contributions
ing-0.267
lic-0.252
 is-0.250
 can-0.242
s-0.241
Token's
Token ID338
Feature activation-3.82e+00

Pos logit contributions
cle+0.166
 elbows+0.162
 argue+0.159
 afar+0.158
 mischief+0.155
Neg logit contributions
 is-0.056
 all-0.052
 can-0.052
 that-0.049
ing-0.045
Token nest
Token ID16343
Feature activation-9.85e-01

Pos logit contributions
cle+0.094
 argue+0.092
Susan+0.090
keeping+0.089
ge+0.089
Neg logit contributions
 is-0.064
 can-0.064
 that-0.056
 makes-0.055
 looks-0.055
Token.
Token ID13
Feature activation-1.15e+00

Pos logit contributions
 argue+0.026
cle+0.025
keeping+0.025
 elbows+0.025
 reason+0.024
Neg logit contributions
der-0.013
 is-0.013
 can-0.012
 has-0.012
lic-0.012
Token Twe
Token ID24205
Feature activation-6.13e+00

Pos logit contributions
 argue+0.022
cle+0.021
keeping+0.021
 elbows+0.021
 reason+0.021
Neg logit contributions
der-0.024
 is-0.023
 can-0.023
 has-0.022
lic-0.022
Token dare
Token ID16498
Feature activation-2.66e+00

Pos logit contributions
cle+0.067
 elbows+0.067
keeping+0.066
 argue+0.061
 moderation+0.060
Neg logit contributions
 can-0.076
 is-0.074
 has-0.070
der-0.069
lic-0.068
Token to
Token ID284
Feature activation-6.40e+00

Pos logit contributions
keeping+0.076
cle+0.075
 moderation+0.075
days+0.075
 elbows+0.073
Neg logit contributions
 is-0.029
 all-0.029
 can-0.029
 that-0.028
 for-0.026
Token climb
Token ID12080
Feature activation-3.47e+00

Pos logit contributions
cle+0.177
keeping+0.174
 elbows+0.164
 moderation+0.158
 generation+0.154
Neg logit contributions
 all-0.101
 can-0.101
 is-0.099
 that-0.094
 makes-0.088
Token up
Token ID510
Feature activation-2.08e+00

Pos logit contributions
cle+0.109
 argue+0.104
 reason+0.101
 moderation+0.100
aster+0.099
Neg logit contributions
 is-0.038
 all-0.038
 can-0.035
 that-0.034
der-0.032
Token to
Token ID284
Feature activation-4.16e+00

Pos logit contributions
cle+0.065
 argue+0.061
 elbows+0.060
please+0.059
 reason+0.059
Neg logit contributions
 is-0.022
 all-0.021
der-0.020
 can-0.020
 that-0.019
Token Twe
Token ID24205
Feature activation-1.04e+01

Pos logit contributions
cle+0.118
 elbows+0.114
 argue+0.111
keeping+0.110
days+0.107
Neg logit contributions
 is-0.057
 all-0.052
 can-0.051
 that-0.049
 looks-0.044
Tokenety
Token ID2963
Feature activation-4.23e+00

Pos logit contributions
 argue+0.157
 mischief+0.153
 weep+0.147
 reason+0.142
 footing+0.139
Neg logit contributions
ing-0.256
 is-0.254
lic-0.245
s-0.240
der-0.234
Token's
Token ID338
Feature activation-1.96e+00

Pos logit contributions
cle+0.139
 elbows+0.132
 argue+0.131
 reason+0.129
keeping+0.129
Neg logit contributions
 is-0.038
der-0.035
 can-0.034
 all-0.031
 that-0.031
Token nest
Token ID16343
Feature activation-3.11e-01

Pos logit contributions
cle+0.042
 argue+0.040
aster+0.039
keeping+0.038
 reason+0.038
Neg logit contributions
 is-0.038
der-0.038
 can-0.035
 has-0.034
 looks-0.033
Token.
Token ID13
Feature activation-1.03e+00

Pos logit contributions
 argue+0.008
keeping+0.008
cle+0.007
 elbows+0.007
 reason+0.007
Neg logit contributions
der-0.005
 is-0.004
 can-0.004
lic-0.004
 has-0.004
Token
Token ID198
Feature activation-1.29e-01

Pos logit contributions
 argue+0.027
cle+0.026
 elbows+0.026
keeping+0.026
 reason+0.026
Neg logit contributions
der-0.014
 is-0.014
 can-0.013
 has-0.013
lic-0.012
Token The
Token ID383
Feature activation+3.82e+00

Pos logit contributions
 argue+0.022
cle+0.021
 elbows+0.021
keeping+0.021
 reason+0.021
Neg logit contributions
der-0.016
 is-0.016
 can-0.015
 has-0.015
lic-0.014
Token cat
Token ID3797
Feature activation-5.88e-02

Pos logit contributions
 is+0.073
 can+0.070
der+0.068
 has+0.068
 makes+0.065
Neg logit contributions
 argue-0.083
keeping-0.083
cle-0.081
ge-0.079
 elbows-0.077
Token wanted
Token ID2227
Feature activation+1.13e+00

Pos logit contributions
 argue+0.002
cle+0.002
 elbows+0.002
 reason+0.002
 weep+0.002
Neg logit contributions
der-0.001
 is-0.001
 can-0.001
lic-0.001
 has-0.001
Token to
Token ID284
Feature activation-5.00e+00

Pos logit contributions
 is+0.012
 can+0.012
 that+0.011
 all+0.011
 helps+0.010
Neg logit contributions
 argue-0.034
 elbows-0.033
cle-0.033
keeping-0.032
ge-0.032
Token eat
Token ID4483
Feature activation+5.73e-01

Pos logit contributions
cle+0.128
keeping+0.122
 elbows+0.119
ge+0.119
 perspective+0.115
Neg logit contributions
 is-0.086
 that-0.079
 all-0.079
 can-0.079
 makes-0.075
Token Twe
Token ID24205
Feature activation-1.04e+01

Pos logit contributions
 is+0.012
der+0.012
 can+0.011
 has+0.011
 all+0.011
Neg logit contributions
 argue-0.011
cle-0.011
keeping-0.011
 elbows-0.010
 reason-0.010
Tokenety
Token ID2963
Feature activation-4.97e+00

Pos logit contributions
 argue+0.164
 mischief+0.150
 elbows+0.145
 weep+0.141
 reason+0.137
Neg logit contributions
ing-0.259
 is-0.251
lic-0.248
 can-0.235
s-0.234
Token!
Token ID0
Feature activation-6.90e-01

Pos logit contributions
cle+0.157
 elbows+0.151
 argue+0.149
keeping+0.148
 afar+0.146
Neg logit contributions
 is-0.056
 all-0.049
 can-0.046
 that-0.046
der-0.046
Token
Token ID198
Feature activation-2.59e-01

Pos logit contributions
 argue+0.016
cle+0.015
keeping+0.015
 elbows+0.015
 reason+0.015
Neg logit contributions
der-0.012
 is-0.011
 can-0.011
 has-0.010
lic-0.010
Token
Token ID198
Feature activation-6.14e-01

Pos logit contributions
 argue+0.007
 elbows+0.007
keeping+0.007
cle+0.007
 weep+0.007
Neg logit contributions
 is-0.003
 can-0.003
der-0.003
lic-0.003
 has-0.003
TokenTwe
Token ID32665
Feature activation-7.72e+00

Pos logit contributions
 argue+0.007
cle+0.006
keeping+0.006
 elbows+0.006
 reason+0.006
Neg logit contributions
der-0.018
 is-0.018
 can-0.017
 has-0.017
lic-0.017
Token Lucy
Token ID22162
Feature activation-2.55e+00

Pos logit contributions
 argue+0.057
cle+0.057
keeping+0.056
 elbows+0.055
cker+0.053
Neg logit contributions
 is-0.042
 all-0.040
der-0.040
 can-0.040
 that-0.039
Token noticed
Token ID6810
Feature activation+2.36e+00

Pos logit contributions
 argue+0.071
cle+0.070
keeping+0.067
 reason+0.067
 elbows+0.066
Neg logit contributions
 is-0.035
 can-0.032
 has-0.031
 makes-0.029
 looks-0.028
Token the
Token ID262
Feature activation+1.97e+00

Pos logit contributions
der+0.034
 is+0.033
 can+0.031
 has+0.030
lic+0.029
Neg logit contributions
 argue-0.060
cle-0.058
keeping-0.058
 elbows-0.057
 reason-0.057
Token weather
Token ID6193
Feature activation-1.76e+00

Pos logit contributions
der+0.027
 is+0.026
 can+0.025
 has+0.024
lic+0.024
Neg logit contributions
 argue-0.051
cle-0.049
keeping-0.049
 elbows-0.049
 reason-0.049
Token was
Token ID373
Feature activation+1.54e+00

Pos logit contributions
 argue+0.040
 reason+0.038
keeping+0.038
 weep+0.038
 panic+0.038
Neg logit contributions
der-0.028
 can-0.025
lic-0.025
 is-0.024
 helps-0.024
Token uncom
Token ID8820
Feature activation-1.02e+01

Pos logit contributions
der+0.030
 is+0.029
 can+0.028
 has+0.028
lic+0.028
Neg logit contributions
 argue-0.031
 reason-0.029
 elbows-0.029
cle-0.029
 weep-0.028
Tokenfort
Token ID3319
Feature activation-2.71e+00

Pos logit contributions
 mischief+0.208
 elbows+0.198
keeping+0.189
 reason+0.184
 moderation+0.178
Neg logit contributions
lic-0.189
 can-0.178
 looks-0.174
ing-0.169
der-0.169
Tokenably
Token ID1346
Feature activation-1.94e+00

Pos logit contributions
 argue+0.050
 elbows+0.049
cle+0.047
 weep+0.047
 mischief+0.046
Neg logit contributions
der-0.058
 is-0.055
lic-0.052
ing-0.052
 all-0.051
Token hot
Token ID3024
Feature activation-1.69e+00

Pos logit contributions
 argue+0.045
 reason+0.042
 weep+0.042
 worry+0.041
 elbows+0.041
Neg logit contributions
der-0.032
lic-0.031
 is-0.030
 can-0.029
boards-0.029
Token.
Token ID13
Feature activation-5.40e-01

Pos logit contributions
keeping+0.037
 argue+0.034
 reason+0.034
 elbows+0.034
cle+0.034
Neg logit contributions
der-0.029
lic-0.029
 is-0.027
 has-0.026
 can-0.026
Token She
Token ID1375
Feature activation+1.78e+00

Pos logit contributions
 argue+0.016
cle+0.016
keeping+0.016
 elbows+0.015
 reason+0.015
Neg logit contributions
der-0.005
 is-0.005
 can-0.005
 has-0.005
lic-0.004
Token Tom
Token ID4186
Feature activation-1.14e+00

Pos logit contributions
 argue+0.027
cle+0.026
 elbows+0.026
keeping+0.026
 reason+0.026
Neg logit contributions
 is-0.013
der-0.013
 can-0.012
 has-0.012
lic-0.011
Token was
Token ID373
Feature activation+1.80e+00

Pos logit contributions
 argue+0.031
cle+0.030
keeping+0.030
 elbows+0.030
 reason+0.030
Neg logit contributions
 is-0.014
der-0.014
 can-0.013
 has-0.013
lic-0.012
Token trying
Token ID2111
Feature activation+7.98e-01

Pos logit contributions
 is+0.030
 has+0.028
 all+0.028
 can+0.028
der+0.028
Neg logit contributions
cle-0.043
keeping-0.042
 argue-0.042
aster-0.040
 reason-0.040
Token to
Token ID284
Feature activation-4.99e+00

Pos logit contributions
 is+0.008
 can+0.007
 all+0.007
 that+0.007
der+0.007
Neg logit contributions
keeping-0.026
cle-0.025
 elbows-0.024
 argue-0.024
 generation-0.024
Token catch
Token ID4929
Feature activation+6.32e-01

Pos logit contributions
cle+0.105
keeping+0.103
 elbows+0.099
 moderation+0.093
days+0.091
Neg logit contributions
 is-0.096
 all-0.096
 can-0.094
 that-0.093
 makes-0.090
Token Twe
Token ID24205
Feature activation-1.02e+01

Pos logit contributions
 is+0.009
 all+0.008
 that+0.008
 can+0.008
der+0.008
Neg logit contributions
cle-0.018
 argue-0.017
keeping-0.016
 elbows-0.016
days-0.016
Tokenety
Token ID2963
Feature activation-4.04e+00

Pos logit contributions
 argue+0.159
 mischief+0.158
 elbows+0.146
 reason+0.140
 weep+0.139
Neg logit contributions
ing-0.253
lic-0.246
 is-0.244
s-0.236
der-0.230
Token,
Token ID11
Feature activation-3.13e+00

Pos logit contributions
cle+0.126
keeping+0.121
 argue+0.120
 elbows+0.119
 afar+0.116
Neg logit contributions
 is-0.045
 can-0.041
der-0.040
 all-0.037
 that-0.036
Token but
Token ID475
Feature activation-7.02e-01

Pos logit contributions
 argue+0.073
cle+0.073
keeping+0.071
 elbows+0.069
 reason+0.068
Neg logit contributions
 is-0.052
der-0.052
 can-0.051
 has-0.049
 makes-0.046
Token Twe
Token ID24205
Feature activation-6.94e+00

Pos logit contributions
 argue+0.021
keeping+0.021
cle+0.021
 elbows+0.020
 reason+0.020
Neg logit contributions
 is-0.007
der-0.007
 can-0.007
 has-0.006
 makes-0.006
Tokenety
Token ID2963
Feature activation-5.00e-01

Pos logit contributions
 mischief+0.107
 argue+0.105
 reason+0.102
keeping+0.097
aster+0.096
Neg logit contributions
 is-0.171
der-0.165
ing-0.165
 can-0.162
 has-0.161
Token happy
Token ID3772
Feature activation-1.83e-01

Pos logit contributions
 is+0.031
der+0.031
 can+0.030
 has+0.029
 all+0.029
Neg logit contributions
cle-0.041
 argue-0.041
keeping-0.040
 reason-0.038
 elbows-0.038
Token that
Token ID326
Feature activation+1.05e+00

Pos logit contributions
 argue+0.005
cle+0.005
keeping+0.005
 reason+0.005
 elbows+0.005
Neg logit contributions
der-0.002
 is-0.002
 can-0.002
lic-0.002
 has-0.002
Token she
Token ID673
Feature activation+4.59e-01

Pos logit contributions
der+0.016
 is+0.016
 can+0.015
 has+0.014
 makes+0.014
Neg logit contributions
 argue-0.026
cle-0.026
keeping-0.025
 elbows-0.025
 reason-0.025
Token could
Token ID714
Feature activation-2.00e+00

Pos logit contributions
der+0.008
 is+0.006
lic+0.006
boards+0.006
 can+0.006
Neg logit contributions
 argue-0.011
 weep-0.010
keeping-0.010
cle-0.010
 reason-0.010
Token help
Token ID1037
Feature activation-3.73e-01

Pos logit contributions
 argue+0.037
cle+0.034
 reason+0.034
 weep+0.034
keeping+0.034
Neg logit contributions
der-0.044
 is-0.042
lic-0.040
 can-0.040
 has-0.040
Token Twe
Token ID24205
Feature activation-1.02e+01

Pos logit contributions
 argue+0.009
cle+0.009
 elbows+0.008
keeping+0.008
 afar+0.008
Neg logit contributions
 is-0.006
der-0.006
 can-0.006
 all-0.006
 makes-0.006
Tokenety
Token ID2963
Feature activation-5.08e+00

Pos logit contributions
 mischief+0.158
keeping+0.148
 argue+0.142
 elbows+0.139
 afar+0.137
Neg logit contributions
ing-0.246
der-0.238
 is-0.226
lic-0.224
 can-0.222
Token and
Token ID290
Feature activation-2.31e+00

Pos logit contributions
cle+0.106
 argue+0.104
 elbows+0.103
keeping+0.101
 afar+0.099
Neg logit contributions
 is-0.095
der-0.094
 can-0.092
 all-0.086
 has-0.086
Token learned
Token ID4499
Feature activation+2.95e+00

Pos logit contributions
 argue+0.049
 reason+0.046
 weep+0.046
keeping+0.046
cle+0.046
Neg logit contributions
der-0.044
lic-0.042
boards-0.040
 is-0.040
ing-0.038
Token that
Token ID326
Feature activation+1.69e+00

Pos logit contributions
lic+0.041
 has+0.041
 can+0.038
der+0.038
boards+0.037
Neg logit contributions
 argue-0.071
 reason-0.071
keeping-0.069
 afar-0.067
 worry-0.066
Token it
Token ID340
Feature activation+3.23e+00

Pos logit contributions
 is+0.023
der+0.021
 can+0.021
 that+0.021
 all+0.021
Neg logit contributions
cle-0.047
 argue-0.045
keeping-0.045
 elbows-0.045
cker-0.043
Token!
Token ID0
Feature activation+4.41e-01

Pos logit contributions
der+0.037
lic+0.037
 is+0.034
 can+0.033
 makes+0.033
Neg logit contributions
keeping-0.048
 argue-0.047
 weep-0.046
 reason-0.044
 elbows-0.043
Token Tom
Token ID4186
Feature activation-6.29e-01

Pos logit contributions
der+0.010
 is+0.010
 can+0.009
lic+0.009
 has+0.009
Neg logit contributions
 argue-0.008
cle-0.007
keeping-0.007
 elbows-0.007
 reason-0.007
Token wanted
Token ID2227
Feature activation+1.54e+00

Pos logit contributions
 argue+0.017
cle+0.016
 reason+0.016
keeping+0.016
 elbows+0.016
Neg logit contributions
der-0.008
 is-0.008
 can-0.007
lic-0.007
 has-0.007
Token to
Token ID284
Feature activation-5.10e+00

Pos logit contributions
 is+0.018
 can+0.018
 that+0.017
der+0.017
 all+0.016
Neg logit contributions
 argue-0.044
 elbows-0.043
cle-0.043
keeping-0.042
 weep-0.041
Token catch
Token ID4929
Feature activation+6.34e-01

Pos logit contributions
cle+0.129
 elbows+0.127
keeping+0.121
ge+0.120
 perspective+0.117
Neg logit contributions
 is-0.083
 all-0.081
 can-0.080
 that-0.079
 big-0.073
Token Twe
Token ID24205
Feature activation-1.01e+01

Pos logit contributions
 is+0.012
 all+0.011
 can+0.011
 that+0.011
der+0.011
Neg logit contributions
cle-0.014
 argue-0.014
 elbows-0.013
keeping-0.013
days-0.012
Tokenety
Token ID2963
Feature activation-4.20e+00

Pos logit contributions
 argue+0.153
 mischief+0.146
 elbows+0.139
 weep+0.130
keeping+0.128
Neg logit contributions
ing-0.257
 is-0.249
lic-0.249
 can-0.238
der-0.237
Token and
Token ID290
Feature activation-2.66e+00

Pos logit contributions
cle+0.133
 elbows+0.127
keeping+0.126
 argue+0.126
 generation+0.122
Neg logit contributions
 is-0.045
 can-0.041
der-0.039
 all-0.038
 that-0.036
Token eat
Token ID4483
Feature activation+1.69e+00

Pos logit contributions
 argue+0.056
cle+0.055
keeping+0.054
 elbows+0.054
 reason+0.053
Neg logit contributions
 is-0.048
der-0.048
 can-0.047
 has-0.045
 makes-0.044
Token her
Token ID607
Feature activation-3.19e+00

Pos logit contributions
der+0.032
lic+0.031
 is+0.030
 has+0.030
 can+0.029
Neg logit contributions
 argue-0.034
 reason-0.033
keeping-0.032
cle-0.032
 weep-0.032
Token for
Token ID329
Feature activation-1.09e+00

Pos logit contributions
cle+0.090
 argue+0.086
 elbows+0.084
aster+0.084
keeping+0.082
Neg logit contributions
 is-0.048
 all-0.044
 can-0.042
 looks-0.039
der-0.039
Token,
Token ID11
Feature activation-1.29e+00

Pos logit contributions
der+0.004
 is+0.004
lic+0.004
 has+0.004
 can+0.004
Neg logit contributions
 argue-0.006
 reason-0.006
 weep-0.006
 elbows-0.006
keeping-0.006
Token Bugs
Token ID44991
Feature activation+1.01e+00

Pos logit contributions
 argue+0.023
keeping+0.022
cle+0.022
 elbows+0.022
 weep+0.021
Neg logit contributions
der-0.029
 is-0.029
 can-0.029
 all-0.027
 has-0.027
Token said
Token ID531
Feature activation+3.00e+00

Pos logit contributions
der+0.015
 is+0.014
 can+0.013
 has+0.013
 makes+0.013
Neg logit contributions
 argue-0.024
cle-0.024
 reason-0.024
 weep-0.024
keeping-0.023
Token,
Token ID11
Feature activation-4.10e-01

Pos logit contributions
der+0.047
 is+0.045
 has+0.043
lic+0.042
boards+0.042
Neg logit contributions
 argue-0.070
 weep-0.066
 worry-0.066
 reason-0.065
cle-0.064
Token "
Token ID366
Feature activation-1.70e+00

Pos logit contributions
 argue+0.013
cle+0.012
 reason+0.012
 weep+0.012
keeping+0.012
Neg logit contributions
der-0.003
 is-0.003
 can-0.003
 has-0.003
 makes-0.003
TokenTwe
Token ID32665
Feature activation-9.83e+00

Pos logit contributions
 argue+0.037
 elbows+0.035
aster+0.035
keeping+0.034
cle+0.034
Neg logit contributions
 is-0.031
 can-0.031
der-0.030
lic-0.029
 has-0.028
Tokenety
Token ID2963
Feature activation-2.35e+00

Pos logit contributions
 mischief+0.157
 argue+0.154
 reason+0.144
 weep+0.143
 panic+0.142
Neg logit contributions
ing-0.227
der-0.221
 is-0.221
 can-0.220
 has-0.215
Token,
Token ID11
Feature activation-1.06e+00

Pos logit contributions
 argue+0.063
cle+0.061
keeping+0.060
 elbows+0.060
 reason+0.060
Neg logit contributions
der-0.031
 is-0.029
 can-0.028
lic-0.027
 has-0.027
Token I
Token ID314
Feature activation-3.34e+00

Pos logit contributions
 argue+0.024
 worry+0.024
 reason+0.024
 weep+0.024
 panic+0.023
Neg logit contributions
der-0.018
lic-0.017
boards-0.016
ows-0.015
 is-0.015
Token have
Token ID423
Feature activation+2.65e-01

Pos logit contributions
keeping+0.073
cle+0.073
 elbows+0.073
 argue+0.072
Susan+0.068
Neg logit contributions
 is-0.064
 can-0.064
 has-0.058
 makes-0.055
 looks-0.054
Token a
Token ID257
Feature activation-1.58e+00

Pos logit contributions
der+0.004
 is+0.004
 can+0.004
 has+0.004
lic+0.004
Neg logit contributions
 argue-0.006
cle-0.006
keeping-0.006
 elbows-0.006
 reason-0.006
Token a
Token ID257
Feature activation-2.55e+00

Pos logit contributions
 argue+0.011
cle+0.010
keeping+0.010
 elbows+0.010
days+0.010
Neg logit contributions
 all-0.007
 is-0.007
 that-0.007
der-0.007
 can-0.007
Token little
Token ID1310
Feature activation-4.25e+00

Pos logit contributions
 elbows+0.065
keeping+0.065
 moderation+0.061
cle+0.060
 argue+0.060
Neg logit contributions
 can-0.041
 is-0.039
der-0.037
 has-0.035
lic-0.034
Token bird
Token ID6512
Feature activation-3.71e+00

Pos logit contributions
cle+0.095
 elbows+0.093
keeping+0.093
Susan+0.090
 moderation+0.090
Neg logit contributions
 can-0.077
der-0.076
 is-0.076
 looks-0.069
 has-0.069
Tokenie
Token ID494
Feature activation-4.08e+00

Pos logit contributions
cle+0.116
 elbows+0.114
 argue+0.113
 mischief+0.113
keeping+0.112
Neg logit contributions
 can-0.037
 is-0.036
 that-0.035
 of-0.031
der-0.030
Token like
Token ID588
Feature activation-1.22e+00

Pos logit contributions
 argue+0.120
cle+0.120
 elbows+0.119
keeping+0.115
 afar+0.114
Neg logit contributions
 can-0.046
 is-0.046
der-0.041
 has-0.040
 that-0.037
Token Twe
Token ID24205
Feature activation-9.70e+00

Pos logit contributions
 argue+0.031
cle+0.030
 elbows+0.030
keeping+0.030
 reason+0.029
Neg logit contributions
der-0.018
 is-0.018
 can-0.017
 has-0.016
 that-0.015
Tokenety
Token ID2963
Feature activation-4.67e+00

Pos logit contributions
 argue+0.168
 mischief+0.162
keeping+0.152
 reason+0.152
 weep+0.149
Neg logit contributions
ing-0.224
der-0.216
 is-0.211
udd-0.205
 can-0.205
Token.
Token ID13
Feature activation-6.89e-01

Pos logit contributions
cle+0.141
 argue+0.141
 elbows+0.138
keeping+0.137
 reason+0.134
Neg logit contributions
 is-0.054
 can-0.050
der-0.045
 has-0.045
 that-0.041
Token So
Token ID1406
Feature activation+4.44e-01

Pos logit contributions
 argue+0.018
cle+0.018
 elbows+0.018
keeping+0.018
 reason+0.018
Neg logit contributions
der-0.009
 is-0.009
 can-0.009
 has-0.008
lic-0.008
Token,
Token ID11
Feature activation+7.92e-01

Pos logit contributions
der+0.006
 is+0.006
 can+0.005
 has+0.005
lic+0.005
Neg logit contributions
 argue-0.012
cle-0.012
keeping-0.012
 elbows-0.011
 reason-0.011
Token Twe
Token ID24205
Feature activation-5.31e+00

Pos logit contributions
 that+0.016
 all+0.015
 is+0.015
 can+0.015
der+0.015
Neg logit contributions
keeping-0.017
 argue-0.017
 elbows-0.016
cle-0.016
 reason-0.015
Token world
Token ID995
Feature activation-2.44e+00

Pos logit contributions
der+0.019
 is+0.018
 can+0.018
 has+0.017
 makes+0.016
Neg logit contributions
 argue-0.023
cle-0.022
keeping-0.022
 elbows-0.022
 reason-0.021
Token from
Token ID422
Feature activation-1.87e+00

Pos logit contributions
 argue+0.059
 weep+0.055
 reason+0.055
keeping+0.054
 worry+0.054
Neg logit contributions
der-0.040
lic-0.036
 is-0.032
boards-0.032
 can-0.031
Token up
Token ID510
Feature activation-1.66e+00

Pos logit contributions
cle+0.054
 argue+0.053
 elbows+0.051
keeping+0.050
 reason+0.050
Neg logit contributions
 is-0.023
 all-0.023
 that-0.022
der-0.021
 can-0.020
Token high
Token ID1029
Feature activation-3.05e+00

Pos logit contributions
cle+0.042
 argue+0.041
 elbows+0.041
keeping+0.040
 reason+0.039
Neg logit contributions
 is-0.025
 can-0.024
der-0.023
 has-0.022
 all-0.022
Token like
Token ID588
Feature activation-1.16e+00

Pos logit contributions
 argue+0.075
keeping+0.073
cle+0.073
 reason+0.072
 elbows+0.072
Neg logit contributions
der-0.045
 is-0.044
 can-0.042
lic-0.041
 has-0.040
Token Twe
Token ID24205
Feature activation-9.66e+00

Pos logit contributions
 argue+0.026
keeping+0.025
cle+0.025
 reason+0.025
 elbows+0.025
Neg logit contributions
der-0.019
 is-0.019
 can-0.018
lic-0.018
 has-0.018
Tokenety
Token ID2963
Feature activation-4.11e+00

Pos logit contributions
 argue+0.152
 mischief+0.135
 weep+0.133
 reason+0.131
 elbows+0.131
Neg logit contributions
ing-0.250
der-0.244
lic-0.237
 is-0.235
 has-0.226
Token,
Token ID11
Feature activation-3.45e+00

Pos logit contributions
 argue+0.115
cle+0.114
 elbows+0.111
keeping+0.110
 reason+0.109
Neg logit contributions
 is-0.052
der-0.051
 can-0.049
 has-0.046
 makes-0.042
Token but
Token ID475
Feature activation-6.44e-01

Pos logit contributions
 argue+0.081
cle+0.078
keeping+0.077
 elbows+0.076
 reason+0.076
Neg logit contributions
der-0.057
 is-0.055
 can-0.053
 has-0.051
lic-0.051
Token he
Token ID339
Feature activation+2.04e+00

Pos logit contributions
 argue+0.018
cle+0.017
keeping+0.017
 elbows+0.017
 reason+0.017
Neg logit contributions
der-0.008
 is-0.008
 can-0.008
 has-0.007
lic-0.007
Token was
Token ID373
Feature activation+2.09e+00

Pos logit contributions
 is+0.029
der+0.028
 can+0.027
 has+0.026
 makes+0.024
Neg logit contributions
 argue-0.053
keeping-0.052
 elbows-0.052
cle-0.052
 reason-0.050
Token You
Token ID921
Feature activation-4.96e-01

Pos logit contributions
 argue+0.021
cle+0.020
keeping+0.020
 elbows+0.019
 reason+0.019
Neg logit contributions
der-0.011
 is-0.010
 can-0.010
lic-0.010
 has-0.009
Token should
Token ID815
Feature activation-2.41e+00

Pos logit contributions
 argue+0.015
 reason+0.014
cle+0.014
 weep+0.014
keeping+0.014
Neg logit contributions
der-0.006
 is-0.005
lic-0.005
 can-0.004
 has-0.004
Token take
Token ID1011
Feature activation-1.11e+00

Pos logit contributions
 argue+0.050
cle+0.048
keeping+0.047
 elbows+0.047
 reason+0.047
Neg logit contributions
der-0.046
 is-0.045
 can-0.043
 has-0.042
lic-0.042
Token your
Token ID534
Feature activation-2.63e+00

Pos logit contributions
 argue+0.026
cle+0.026
keeping+0.025
 elbows+0.025
 reason+0.025
Neg logit contributions
der-0.018
 is-0.018
 can-0.017
 has-0.016
lic-0.016
Token rain
Token ID6290
Feature activation-3.56e+00

Pos logit contributions
cle+0.064
Susan+0.062
 argue+0.060
keeping+0.059
ge+0.059
Neg logit contributions
 can-0.046
 is-0.046
 looks-0.040
 has-0.040
 helps-0.040
Tokenco
Token ID1073
Feature activation-9.60e+00

Pos logit contributions
 argue+0.078
cle+0.077
 elbows+0.075
keeping+0.074
 reason+0.074
Neg logit contributions
 is-0.062
der-0.062
 can-0.060
 has-0.056
lic-0.056
Tokenats
Token ID1381
Feature activation-2.99e+00

Pos logit contributions
 elbows+0.225
 argue+0.210
 afar+0.209
 worry+0.207
 weep+0.207
Neg logit contributions
 is-0.174
 that-0.172
 of-0.166
ing-0.153
 all-0.152
Token and
Token ID290
Feature activation-4.47e+00

Pos logit contributions
 argue+0.076
cle+0.074
keeping+0.073
 reason+0.072
 elbows+0.072
Neg logit contributions
der-0.042
 is-0.041
 can-0.039
 has-0.037
lic-0.037
Token boots
Token ID14412
Feature activation-2.50e+00

Pos logit contributions
 argue+0.093
cle+0.089
keeping+0.088
 elbows+0.088
 reason+0.087
Neg logit contributions
der-0.085
 is-0.082
 can-0.079
lic-0.077
 has-0.077
Token if
Token ID611
Feature activation-1.16e+00

Pos logit contributions
 argue+0.053
keeping+0.051
 reason+0.048
cle+0.048
 weep+0.047
Neg logit contributions
der-0.044
lic-0.044
 is-0.041
 has-0.041
 looks-0.040
Token you
Token ID345
Feature activation-3.81e+00

Pos logit contributions
 argue+0.027
cle+0.026
 reason+0.026
 elbows+0.025
keeping+0.025
Neg logit contributions
der-0.019
lic-0.018
 is-0.017
 can-0.017
 has-0.016
Token like
Token ID588
Feature activation-1.27e+00

Pos logit contributions
 argue+0.040
 reason+0.037
cle+0.037
 weep+0.036
keeping+0.036
Neg logit contributions
der-0.045
 is-0.042
lic-0.041
 can-0.041
 has-0.040
Token to
Token ID284
Feature activation-5.53e+00

Pos logit contributions
 argue+0.030
cle+0.029
keeping+0.029
 elbows+0.029
 reason+0.028
Neg logit contributions
der-0.020
 is-0.020
 can-0.019
lic-0.018
 has-0.018
Token pick
Token ID2298
Feature activation+1.65e+00

Pos logit contributions
cle+0.099
keeping+0.099
 argue+0.099
 elbows+0.098
 reason+0.093
Neg logit contributions
 is-0.117
der-0.114
 can-0.114
 has-0.109
 makes-0.107
Token flowers
Token ID12734
Feature activation-2.24e+00

Pos logit contributions
der+0.030
lic+0.029
 has+0.026
 can+0.025
 is+0.025
Neg logit contributions
 afar-0.034
keeping-0.033
 argue-0.032
 mischief-0.032
 reason-0.032
Token on
Token ID319
Feature activation-2.60e+00

Pos logit contributions
 argue+0.047
keeping+0.047
cle+0.045
 moderation+0.045
 reason+0.044
Neg logit contributions
der-0.038
lic-0.037
 is-0.033
boards-0.033
 has-0.033
Token Tues
Token ID48496
Feature activation-9.57e+00

Pos logit contributions
 argue+0.062
cle+0.060
keeping+0.059
 reason+0.059
 elbows+0.058
Neg logit contributions
der-0.041
 is-0.039
 can-0.038
lic-0.038
 has-0.037
Tokendays
Token ID12545
Feature activation-1.37e+00

Pos logit contributions
 mischief+0.102
 reason+0.101
 panic+0.091
 worry+0.086
 weep+0.085
Neg logit contributions
 is-0.278
 can-0.260
 has-0.259
 looks-0.247
 makes-0.245
Token."
Token ID526
Feature activation-9.63e-01

Pos logit contributions
 argue+0.027
keeping+0.027
 reason+0.026
cle+0.026
days+0.026
Neg logit contributions
der-0.025
lic-0.023
 is-0.022
icked-0.021
 has-0.021
Token
Token ID220
Feature activation+2.33e-03

Pos logit contributions
 argue+0.028
 reason+0.027
 elbows+0.026
keeping+0.026
cle+0.026
Neg logit contributions
 is-0.012
der-0.011
 can-0.011
 has-0.011
 makes-0.010
Token
Token ID198
Feature activation+1.73e-01

Pos logit contributions
der+0.000
 is+0.000
 can+0.000
 has+0.000
lic+0.000
Neg logit contributions
 argue-0.000
cle-0.000
keeping-0.000
 elbows-0.000
 reason-0.000
Token
Token ID198
Feature activation+5.36e-02

Pos logit contributions
 is+0.002
der+0.001
 can+0.001
lic+0.001
 has+0.001
Neg logit contributions
 argue-0.005
 elbows-0.005
keeping-0.005
 reason-0.005
 weep-0.005
Tokenanas
Token ID15991
Feature activation-1.36e+00

Pos logit contributions
 argue+0.120
 elbows+0.117
 weep+0.113
keeping+0.110
 afar+0.108
Neg logit contributions
 is-0.137
der-0.136
lic-0.130
 can-0.129
 has-0.125
Token available
Token ID1695
Feature activation+5.70e-01

Pos logit contributions
 argue+0.032
cle+0.030
 reason+0.028
 worry+0.028
keeping+0.027
Neg logit contributions
der-0.023
lic-0.020
boards-0.018
 has-0.018
 is-0.017
Token for
Token ID329
Feature activation-7.90e-01

Pos logit contributions
der+0.009
lic+0.008
 is+0.008
 can+0.007
 has+0.007
Neg logit contributions
 argue-0.013
keeping-0.013
cle-0.013
 reason-0.013
 Johnson-0.012
Token feeding
Token ID13017
Feature activation-9.47e-01

Pos logit contributions
 argue+0.019
cle+0.018
 elbows+0.018
keeping+0.018
 weep+0.018
Neg logit contributions
der-0.012
 is-0.012
 can-0.012
 has-0.011
lic-0.011
Token the
Token ID262
Feature activation+4.72e-01

Pos logit contributions
 argue+0.022
cle+0.022
keeping+0.021
 reason+0.021
 elbows+0.021
Neg logit contributions
der-0.015
 is-0.014
 can-0.014
 has-0.014
lic-0.014
Token gor
Token ID30344
Feature activation-9.47e+00

Pos logit contributions
der+0.010
 is+0.010
 can+0.010
lic+0.009
 has+0.009
Neg logit contributions
 argue-0.009
cle-0.008
 reason-0.008
keeping-0.008
 elbows-0.008
Tokenillas
Token ID25314
Feature activation-5.00e+00

Pos logit contributions
 argue+0.106
 reason+0.103
 weep+0.102
 elbows+0.100
keeping+0.100
Neg logit contributions
 is-0.268
der-0.265
 can-0.258
 has-0.257
lic-0.247
Token."
Token ID526
Feature activation-5.21e-01

Pos logit contributions
 argue+0.113
cle+0.109
keeping+0.107
 reason+0.106
 elbows+0.105
Neg logit contributions
der-0.085
 is-0.082
 can-0.079
lic-0.077
 has-0.077
Token She
Token ID1375
Feature activation+1.76e+00

Pos logit contributions
cle+0.011
 argue+0.011
 elbows+0.011
 reason+0.010
keeping+0.010
Neg logit contributions
 is-0.010
der-0.010
 can-0.010
 has-0.009
boards-0.009
Token said
Token ID531
Feature activation+2.74e+00

Pos logit contributions
der+0.033
 is+0.030
 can+0.030
 has+0.028
lessly+0.028
Neg logit contributions
 argue-0.036
cle-0.036
 reason-0.034
please-0.033
 weep-0.033
Token to
Token ID284
Feature activation-1.12e+00

Pos logit contributions
der+0.034
 is+0.031
 has+0.031
lic+0.031
 makes+0.029
Neg logit contributions
 argue-0.071
cle-0.069
 worry-0.068
 reason-0.067
 weep-0.066
Token want
Token ID765
Feature activation+2.43e-01

Pos logit contributions
keeping+0.085
 elbows+0.085
cle+0.084
Susan+0.076
 Johnson+0.076
Neg logit contributions
 can-0.079
 is-0.076
 has-0.073
 makes-0.067
der-0.066
Token to
Token ID284
Feature activation-6.01e+00

Pos logit contributions
der+0.004
lic+0.004
 is+0.004
 can+0.004
 has+0.004
Neg logit contributions
 argue-0.005
 reason-0.005
cle-0.005
keeping-0.005
 weep-0.005
Token share
Token ID2648
Feature activation+1.87e-01

Pos logit contributions
 argue+0.108
cle+0.105
keeping+0.105
 elbows+0.104
 reason+0.101
Neg logit contributions
 is-0.129
der-0.127
 can-0.125
 has-0.120
 makes-0.118
Token my
Token ID616
Feature activation-5.35e+00

Pos logit contributions
der+0.003
lic+0.003
 is+0.003
 can+0.003
 has+0.003
Neg logit contributions
 argue-0.004
cle-0.004
 reason-0.004
keeping-0.004
 elbows-0.004
Token ice
Token ID4771
Feature activation-4.00e+00

Pos logit contributions
keeping+0.147
 argue+0.147
Susan+0.142
 elbows+0.142
cle+0.142
Neg logit contributions
 is-0.078
 can-0.069
 has-0.064
 looks-0.063
 makes-0.058
Token-
Token ID12
Feature activation-9.47e+00

Pos logit contributions
 argue+0.095
 elbows+0.090
cle+0.090
keeping+0.090
 reason+0.089
Neg logit contributions
 is-0.065
der-0.064
 can-0.061
 has-0.058
lic-0.058
Tokencream
Token ID36277
Feature activation-2.69e-01

Pos logit contributions
 argue+0.216
 elbows+0.209
 weep+0.202
 reason+0.202
 mischief+0.199
Neg logit contributions
 is-0.165
der-0.159
 can-0.154
lic-0.152
 has-0.148
Token with
Token ID351
Feature activation-8.90e-01

Pos logit contributions
 argue+0.006
cle+0.006
keeping+0.006
 reason+0.006
 elbows+0.006
Neg logit contributions
der-0.004
lic-0.004
 is-0.004
 has-0.004
 can-0.004
Token me
Token ID502
Feature activation-4.70e+00

Pos logit contributions
 argue+0.023
cle+0.023
 reason+0.022
 elbows+0.022
keeping+0.022
Neg logit contributions
der-0.012
 is-0.011
lic-0.011
 can-0.011
 has-0.010
Token?"
Token ID1701
Feature activation-8.34e-01

Pos logit contributions
 argue+0.103
cle+0.099
keeping+0.097
 elbows+0.097
 reason+0.096
Neg logit contributions
der-0.084
 is-0.081
 can-0.078
 has-0.076
lic-0.076
Token
Token ID198
Feature activation-2.12e-01

Pos logit contributions
 argue+0.020
cle+0.019
 reason+0.019
keeping+0.019
 elbows+0.019
Neg logit contributions
der-0.013
 is-0.012
 can-0.012
lic-0.012
 has-0.012
Token.
Token ID13
Feature activation+5.59e-01

Pos logit contributions
der+0.032
lic+0.031
 is+0.027
boards+0.027
 can+0.027
Neg logit contributions
 argue-0.046
 reason-0.044
keeping-0.044
cle-0.043
 weep-0.043
Token We
Token ID775
Feature activation-6.85e-01

Pos logit contributions
der+0.007
 is+0.007
 can+0.006
lic+0.006
 has+0.006
Neg logit contributions
 argue-0.015
cle-0.015
keeping-0.014
 elbows-0.014
 reason-0.014
Token cannot
Token ID2314
Feature activation-3.60e+00

Pos logit contributions
 argue+0.016
cle+0.016
keeping+0.015
 reason+0.015
 elbows+0.015
Neg logit contributions
der-0.011
 is-0.011
 can-0.010
lic-0.010
 has-0.010
Token eat
Token ID4483
Feature activation-3.20e-01

Pos logit contributions
 argue+0.067
cle+0.065
keeping+0.064
 elbows+0.064
 reason+0.062
Neg logit contributions
der-0.075
 is-0.074
 can-0.072
 has-0.070
lic-0.068
Token play
Token ID711
Feature activation-1.48e+00

Pos logit contributions
 argue+0.008
cle+0.007
keeping+0.007
 reason+0.007
 elbows+0.007
Neg logit contributions
der-0.005
 is-0.005
 can-0.005
lic-0.005
 has-0.005
Token-
Token ID12
Feature activation-9.46e+00

Pos logit contributions
 argue+0.036
cle+0.034
keeping+0.034
 reason+0.034
aster+0.033
Neg logit contributions
der-0.023
 is-0.021
lic-0.021
 can-0.021
 has-0.020
Tokend
Token ID67
Feature activation-5.78e+00

Pos logit contributions
 argue+0.211
 elbows+0.211
cle+0.200
 weep+0.199
 reason+0.198
Neg logit contributions
 is-0.167
lic-0.162
der-0.159
 can-0.155
boards-0.152
Tokenoh
Token ID1219
Feature activation-4.56e+00

Pos logit contributions
 argue+0.116
cle+0.111
keeping+0.109
 reason+0.108
 elbows+0.106
Neg logit contributions
der-0.115
 is-0.109
 can-0.106
 has-0.104
lic-0.103
Token or
Token ID393
Feature activation-4.81e+00

Pos logit contributions
 elbows+0.101
 argue+0.101
cle+0.101
 reason+0.099
 panic+0.099
Neg logit contributions
 is-0.078
der-0.078
 can-0.071
 has-0.070
 makes-0.069
Token pr
Token ID778
Feature activation-4.40e+00

Pos logit contributions
 argue+0.105
cle+0.098
keeping+0.098
 elbows+0.097
 reason+0.097
Neg logit contributions
der-0.088
 is-0.084
lic-0.081
 can-0.080
 has-0.078
Tokenunes
Token ID4015
Feature activation-2.81e+00

Pos logit contributions
 argue+0.080
 elbows+0.075
keeping+0.075
cle+0.073
aster+0.071
Neg logit contributions
 is-0.090
der-0.089
 has-0.088
 can-0.087
lic-0.085
Token some
Token ID617
Feature activation-1.35e+00

Pos logit contributions
der+0.007
 has+0.006
lic+0.006
 is+0.006
 can+0.006
Neg logit contributions
 argue-0.010
cle-0.009
 afar-0.009
 reason-0.009
keeping-0.009
Token bananas
Token ID35484
Feature activation-3.91e+00

Pos logit contributions
 reason+0.028
 argue+0.028
cle+0.027
aster+0.026
 afar+0.026
Neg logit contributions
der-0.025
 has-0.022
 is-0.022
lic-0.022
 can-0.021
Token and
Token ID290
Feature activation-4.46e+00

Pos logit contributions
 argue+0.081
cle+0.076
 reason+0.075
aster+0.072
 moderation+0.072
Neg logit contributions
der-0.073
lic-0.069
 is-0.066
 has-0.065
 helps-0.064
Token feed
Token ID3745
Feature activation-1.26e+00

Pos logit contributions
 argue+0.085
cle+0.080
 reason+0.078
keeping+0.078
 elbows+0.077
Neg logit contributions
der-0.094
 is-0.090
lic-0.087
 can-0.086
 has-0.085
Token the
Token ID262
Feature activation+6.52e-01

Pos logit contributions
 argue+0.029
cle+0.029
 reason+0.028
keeping+0.028
aster+0.027
Neg logit contributions
der-0.021
 is-0.018
lic-0.018
 has-0.018
 can-0.017
Token gor
Token ID30344
Feature activation-9.43e+00

Pos logit contributions
der+0.015
 is+0.014
 can+0.014
lic+0.013
 has+0.013
Neg logit contributions
 argue-0.011
cle-0.011
 reason-0.011
keeping-0.010
 elbows-0.010
Tokenillas
Token ID25314
Feature activation-4.36e+00

Pos logit contributions
 argue+0.105
 reason+0.103
 elbows+0.101
 weep+0.100
keeping+0.100
Neg logit contributions
 is-0.269
der-0.264
 can-0.259
 has-0.257
lic-0.247
Token?"
Token ID1701
Feature activation-1.30e+00

Pos logit contributions
 argue+0.101
cle+0.095
 reason+0.092
keeping+0.091
 moderation+0.091
Neg logit contributions
der-0.075
lic-0.068
 is-0.067
boards-0.064
 can-0.064
Token Tim
Token ID5045
Feature activation-2.15e+00

Pos logit contributions
keeping+0.036
 elbows+0.036
cle+0.035
 unnecessary+0.035
 argue+0.035
Neg logit contributions
 is-0.020
 "-0.018
 He-0.018
 that-0.017
 can-0.017
Tokenmy
Token ID1820
Feature activation-2.07e-01

Pos logit contributions
keeping+0.046
 argue+0.046
 elbows+0.045
cle+0.044
 generation+0.044
Neg logit contributions
 is-0.041
 can-0.038
der-0.038
 has-0.037
 looks-0.036
Token nodded
Token ID14464
Feature activation+1.69e+00

Pos logit contributions
keeping+0.006
 argue+0.006
 elbows+0.005
cle+0.005
 afar+0.005
Neg logit contributions
der-0.003
 is-0.003
 can-0.002
 has-0.002
 looks-0.002
TokenHello
Token ID15496
Feature activation-1.63e+00

Pos logit contributions
 argue+0.038
 elbows+0.037
keeping+0.035
 mischief+0.035
 reason+0.035
Neg logit contributions
 is-0.028
der-0.026
 can-0.026
 has-0.025
lic-0.025
Token!
Token ID0
Feature activation-1.48e+00

Pos logit contributions
 argue+0.045
keeping+0.043
 elbows+0.043
cle+0.043
 reason+0.042
Neg logit contributions
 is-0.021
der-0.020
 can-0.020
 has-0.018
lic-0.018
Token My
Token ID2011
Feature activation-9.24e-01

Pos logit contributions
 argue+0.037
cle+0.035
 elbows+0.035
keeping+0.035
 reason+0.035
Neg logit contributions
der-0.022
 is-0.022
 can-0.021
 has-0.021
lic-0.020
Token name
Token ID1438
Feature activation+1.85e+00

Pos logit contributions
 argue+0.025
keeping+0.025
cle+0.025
 elbows+0.024
ge+0.024
Neg logit contributions
 is-0.013
 can-0.012
der-0.011
 has-0.011
 looks-0.011
Token is
Token ID318
Feature activation+4.89e-01

Pos logit contributions
der+0.017
lic+0.013
lessly+0.012
boards+0.011
auc+0.009
Neg logit contributions
 weep-0.056
e-0.056
 argue-0.055
 worry-0.055
 panic-0.054
Token Twe
Token ID24205
Feature activation-9.42e+00

Pos logit contributions
 is+0.009
der+0.008
 can+0.008
 has+0.008
lic+0.008
Neg logit contributions
 argue-0.011
 elbows-0.011
cle-0.011
keeping-0.011
 reason-0.010
Tokenety
Token ID2963
Feature activation-4.13e+00

Pos logit contributions
 argue+0.138
 elbows+0.121
 mischief+0.120
 reason+0.114
 weep+0.113
Neg logit contributions
ing-0.250
der-0.243
lic-0.241
 is-0.241
 can-0.239
Token.
Token ID13
Feature activation-1.51e+00

Pos logit contributions
 argue+0.111
 elbows+0.108
cle+0.108
keeping+0.107
 reason+0.106
Neg logit contributions
 is-0.056
der-0.054
 can-0.052
 has-0.050
 makes-0.047
Token What
Token ID1867
Feature activation+1.21e+00

Pos logit contributions
 argue+0.030
 weep+0.029
cle+0.029
 Johnson+0.028
keeping+0.028
Neg logit contributions
der-0.029
 is-0.026
boards-0.025
lic-0.025
les-0.024
Token are
Token ID389
Feature activation+4.08e-01

Pos logit contributions
der+0.012
 is+0.010
lic+0.010
 can+0.009
 has+0.009
Neg logit contributions
 argue-0.036
 reason-0.035
cle-0.035
 weep-0.034
keeping-0.034
Token you
Token ID345
Feature activation-3.52e+00

Pos logit contributions
der+0.008
 is+0.007
 can+0.007
lic+0.007
 has+0.007
Neg logit contributions
 argue-0.009
cle-0.008
keeping-0.008
 reason-0.008
 elbows-0.008
Token for
Token ID329
Feature activation-1.66e+00

Pos logit contributions
cle+0.075
 argue+0.075
 reason+0.073
 Johnson+0.073
 worry+0.072
Neg logit contributions
der-0.046
lic-0.043
boards-0.042
 is-0.039
 makes-0.039
Token my
Token ID616
Feature activation-2.65e+00

Pos logit contributions
 argue+0.032
cle+0.031
keeping+0.030
 elbows+0.030
 reason+0.030
Neg logit contributions
der-0.034
 is-0.033
 can-0.032
lic-0.031
 has-0.031
Token stickers
Token ID28568
Feature activation-2.80e-01

Pos logit contributions
keeping+0.070
 elbows+0.069
 argue+0.067
Susan+0.065
cle+0.065
Neg logit contributions
 is-0.042
 can-0.040
 has-0.037
 looks-0.035
der-0.034
Token and
Token ID290
Feature activation-3.57e+00

Pos logit contributions
keeping+0.006
cle+0.006
 argue+0.006
 reason+0.006
 moderation+0.006
Neg logit contributions
der-0.005
lic-0.005
 is-0.004
boards-0.004
 looks-0.004
Token play
Token ID711
Feature activation-1.13e+00

Pos logit contributions
 argue+0.074
cle+0.070
keeping+0.070
 elbows+0.070
 reason+0.069
Neg logit contributions
der-0.068
lic-0.068
boards-0.063
 is-0.062
 can-0.060
Token-
Token ID12
Feature activation-9.37e+00

Pos logit contributions
 argue+0.025
keeping+0.024
 reason+0.024
cle+0.024
 afar+0.024
Neg logit contributions
der-0.019
lic-0.017
 has-0.017
 is-0.017
 can-0.017
Tokend
Token ID67
Feature activation-4.62e+00

Pos logit contributions
 argue+0.197
 elbows+0.195
 weep+0.183
 mischief+0.183
 reason+0.183
Neg logit contributions
 is-0.176
lic-0.174
der-0.169
 can-0.167
 has-0.162
Tokenough
Token ID619
Feature activation-8.79e-01

Pos logit contributions
 argue+0.096
cle+0.092
keeping+0.090
 reason+0.090
aster+0.089
Neg logit contributions
der-0.091
 is-0.082
 can-0.081
lic-0.080
 has-0.079
Token.
Token ID13
Feature activation-9.23e-02

Pos logit contributions
 argue+0.021
keeping+0.021
cle+0.021
 reason+0.020
 elbows+0.020
Neg logit contributions
der-0.013
lic-0.012
 is-0.012
 can-0.012
 has-0.012
Token They
Token ID1119
Feature activation+1.78e+00

Pos logit contributions
cle+0.002
 argue+0.002
keeping+0.002
 moderation+0.002
 weep+0.002
Neg logit contributions
der-0.002
boards-0.001
lic-0.001
 is-0.001
ing-0.001
Token are
Token ID389
Feature activation+1.89e+00

Pos logit contributions
der+0.029
lic+0.025
boards+0.024
ing+0.021
lights+0.020
Neg logit contributions
 argue-0.047
 reason-0.044
 weep-0.044
 worry-0.043
 panic-0.043
Token that
Token ID326
Feature activation+1.43e+00

Pos logit contributions
 argue+0.006
 reason+0.005
 worry+0.005
cle+0.005
 weep+0.005
Neg logit contributions
lic-0.004
der-0.004
 is-0.004
 has-0.003
 looks-0.003
Token said
Token ID531
Feature activation+3.11e+00

Pos logit contributions
der+0.021
 is+0.019
lic+0.018
 can+0.017
 has+0.017
Neg logit contributions
 argue-0.037
cle-0.035
 reason-0.035
keeping-0.035
 elbows-0.035
Token "
Token ID366
Feature activation-2.11e+00

Pos logit contributions
der+0.044
lic+0.043
 is+0.039
 has+0.039
boards+0.038
Neg logit contributions
 argue-0.073
 worry-0.069
 reason-0.069
cle-0.068
aster-0.067
TokenNo
Token ID2949
Feature activation+1.47e-01

Pos logit contributions
 argue+0.038
 weep+0.037
 elbows+0.037
 mischief+0.036
keeping+0.035
Neg logit contributions
 is-0.044
 can-0.042
der-0.042
ing-0.041
lic-0.040
Token T
Token ID309
Feature activation-2.67e+00

Pos logit contributions
der+0.002
 is+0.002
 can+0.002
lic+0.002
 has+0.002
Neg logit contributions
 argue-0.004
cle-0.003
 reason-0.003
keeping-0.003
 elbows-0.003
Tokenresp
Token ID4363
Feature activation-9.34e+00

Pos logit contributions
 argue+0.050
cle+0.049
keeping+0.048
 reason+0.047
 elbows+0.046
Neg logit contributions
der-0.055
 is-0.054
 can-0.052
 has-0.051
lic-0.051
Tokenassing
Token ID19696
Feature activation-1.57e+00

Pos logit contributions
 reason+0.130
cle+0.128
 elbows+0.123
 worry+0.120
 argue+0.119
Neg logit contributions
 is-0.228
 can-0.227
der-0.217
 has-0.216
 looks-0.211
Token"
Token ID1
Feature activation-2.86e+00

Pos logit contributions
 argue+0.032
keeping+0.031
cle+0.031
 reason+0.030
aster+0.029
Neg logit contributions
der-0.029
lic-0.026
 is-0.026
 can-0.025
 has-0.025
Token and
Token ID290
Feature activation-2.90e+00

Pos logit contributions
 argue+0.071
cle+0.069
keeping+0.068
 reason+0.068
 elbows+0.067
Neg logit contributions
der-0.042
 is-0.040
 can-0.039
lic-0.037
 has-0.037
Token "
Token ID366
Feature activation-3.28e+00

Pos logit contributions
 argue+0.064
 weep+0.058
 worry+0.057
 elbows+0.057
 afar+0.056
Neg logit contributions
lic-0.052
der-0.049
boards-0.049
 is-0.048
ing-0.045
TokenPrivate
Token ID29067
Feature activation-3.13e+00

Pos logit contributions
 weep+0.062
 elbows+0.061
 argue+0.060
keeping+0.059
 mischief+0.059
Neg logit contributions
der-0.066
 is-0.065
 can-0.063
ing-0.060
lic-0.058
Token a
Token ID257
Feature activation-2.19e+00

Pos logit contributions
der+0.026
 is+0.025
 can+0.024
 has+0.024
lic+0.023
Neg logit contributions
 argue-0.041
 reason-0.039
 elbows-0.039
keeping-0.039
cle-0.039
Token loud
Token ID7812
Feature activation-3.07e+00

Pos logit contributions
 argue+0.060
keeping+0.060
cle+0.059
 elbows+0.058
 reason+0.057
Neg logit contributions
 is-0.026
der-0.026
 can-0.024
 has-0.023
lic-0.022
Token sound
Token ID2128
Feature activation+6.60e-01

Pos logit contributions
 argue+0.067
cle+0.064
keeping+0.063
 reason+0.063
 elbows+0.063
Neg logit contributions
der-0.055
 is-0.053
 can-0.051
 has-0.050
lic-0.049
Token.
Token ID13
Feature activation-6.02e-01

Pos logit contributions
der+0.010
 is+0.010
 can+0.009
 has+0.009
lic+0.009
Neg logit contributions
 argue-0.016
 reason-0.015
cle-0.015
 weep-0.015
aster-0.015
Token All
Token ID1439
Feature activation+1.18e+00

Pos logit contributions
 argue+0.016
cle+0.015
keeping+0.015
 elbows+0.015
 reason+0.015
Neg logit contributions
der-0.008
 is-0.008
 can-0.008
 has-0.008
lic-0.007
Token the
Token ID262
Feature activation+2.92e+00

Pos logit contributions
der+0.021
lic+0.018
 is+0.018
boards+0.017
 can+0.017
Neg logit contributions
 argue-0.027
aster-0.025
cle-0.025
 weep-0.025
 reason-0.025
Token other
Token ID584
Feature activation-2.41e-02

Pos logit contributions
 is+0.043
 can+0.041
der+0.039
 has+0.039
 makes+0.038
Neg logit contributions
keeping-0.076
 argue-0.075
cle-0.075
aster-0.071
 elbows-0.071
Token animals
Token ID4695
Feature activation-1.09e+00

Pos logit contributions
 argue+0.001
cle+0.001
keeping+0.001
 elbows+0.000
 reason+0.000
Neg logit contributions
der-0.000
 is-0.000
 can-0.000
 has-0.000
lic-0.000
Token in
Token ID287
Feature activation-2.72e+00

Pos logit contributions
 argue+0.030
 reason+0.028
cle+0.028
 weep+0.028
keeping+0.028
Neg logit contributions
der-0.014
 is-0.013
lic-0.012
 can-0.012
 has-0.011
Token the
Token ID262
Feature activation+7.22e-01

Pos logit contributions
 argue+0.066
cle+0.063
 elbows+0.062
 reason+0.062
keeping+0.062
Neg logit contributions
der-0.044
 is-0.040
lic-0.039
 can-0.039
 has-0.037
Token woods
Token ID16479
Feature activation-4.71e-01

Pos logit contributions
 is+0.013
 can+0.012
 has+0.012
 looks+0.012
 makes+0.012
Neg logit contributions
keeping-0.016
 argue-0.016
cle-0.016
ge-0.015
 reason-0.015
Token fl
Token ID781
Feature activation+1.54e+00

Pos logit contributions
 argue+0.009
 weep+0.008
 worry+0.008
cle+0.008
 reason+0.008
Neg logit contributions
der-0.007
boards-0.006
 is-0.006
lic-0.006
 can-0.006
Tokenicked
Token ID9484
Feature activation+3.51e+00

Pos logit contributions
 is+0.027
der+0.025
 makes+0.025
 can+0.025
 has+0.025
Neg logit contributions
cle-0.034
 elbows-0.033
 argue-0.033
ree-0.032
owing-0.032
Token his
Token ID465
Feature activation-1.65e+00

Pos logit contributions
 is+0.052
der+0.051
 can+0.050
 all+0.047
 that+0.045
Neg logit contributions
 argue-0.088
keeping-0.087
cle-0.086
 elbows-0.084
 reason-0.083
Token tail
Token ID7894
Feature activation+6.79e-01

Pos logit contributions
 argue+0.038
keeping+0.037
cle+0.036
 moderation+0.035
ge+0.035
Neg logit contributions
 is-0.029
der-0.028
 can-0.027
 has-0.026
 looks-0.026
Token.
Token ID13
Feature activation+1.06e+00

Pos logit contributions
der+0.009
 is+0.009
 can+0.009
lic+0.009
 has+0.009
Neg logit contributions
 argue-0.017
 elbows-0.016
keeping-0.016
cle-0.016
 reason-0.016
Token He
Token ID679
Feature activation+3.03e+00

Pos logit contributions
der+0.016
 is+0.016
 can+0.015
lic+0.014
 has+0.014
Neg logit contributions
 argue-0.026
cle-0.025
keeping-0.025
 reason-0.024
 elbows-0.024
Token didn
Token ID1422
Feature activation-1.49e-01

Pos logit contributions
der+0.048
boards+0.044
lic+0.044
 is+0.043
 can+0.042
Neg logit contributions
 argue-0.072
 weep-0.067
 reason-0.066
cle-0.066
 afar-0.065
Token't
Token ID470
Feature activation-2.96e+00

Pos logit contributions
 argue+0.003
cle+0.003
keeping+0.003
 elbows+0.003
 reason+0.003
Neg logit contributions
der-0.003
 is-0.003
 can-0.003
 has-0.003
lic-0.003
Token want
Token ID765
Feature activation+5.91e-01

Pos logit contributions
cle+0.052
 argue+0.051
 elbows+0.051
keeping+0.051
 moderation+0.047
Neg logit contributions
der-0.064
 is-0.064
 can-0.063
 has-0.061
lic-0.060
Token to
Token ID284
Feature activation-6.62e+00

Pos logit contributions
 is+0.008
der+0.008
 can+0.008
 has+0.007
 makes+0.007
Neg logit contributions
 argue-0.015
cle-0.015
 elbows-0.015
keeping-0.015
 weep-0.014
Token be
Token ID307
Feature activation-3.35e+00

Pos logit contributions
cle+0.162
 elbows+0.153
keeping+0.151
 moderation+0.145
 afar+0.142
Neg logit contributions
 all-0.112
 is-0.111
 can-0.108
 that-0.106
der-0.097
Token bike
Token ID7161
Feature activation+3.38e-01

Pos logit contributions
 argue+0.010
keeping+0.010
cle+0.010
aster+0.009
 moderation+0.009
Neg logit contributions
 is-0.011
 can-0.010
der-0.010
 has-0.010
 looks-0.010
Token gently
Token ID15165
Feature activation-1.02e+00

Pos logit contributions
der+0.005
 is+0.004
 can+0.004
 has+0.004
lic+0.004
Neg logit contributions
 argue-0.009
cle-0.009
keeping-0.008
 elbows-0.008
 reason-0.008
Token,
Token ID11
Feature activation-1.86e+00

Pos logit contributions
 argue+0.027
cle+0.026
keeping+0.025
 elbows+0.025
 reason+0.025
Neg logit contributions
der-0.014
 is-0.014
 can-0.013
lic-0.013
 has-0.012
Token but
Token ID475
Feature activation+1.07e+00

Pos logit contributions
 argue+0.042
cle+0.041
keeping+0.040
 reason+0.039
 elbows+0.039
Neg logit contributions
der-0.032
 is-0.032
 can-0.031
 has-0.030
 makes-0.029
Token suddenly
Token ID6451
Feature activation+1.09e+00

Pos logit contributions
der+0.016
 is+0.016
 can+0.015
 has+0.015
 makes+0.014
Neg logit contributions
 argue-0.027
cle-0.026
keeping-0.026
 reason-0.025
 elbows-0.025
Token the
Token ID262
Feature activation+4.56e+00

Pos logit contributions
der+0.015
 is+0.015
 can+0.014
 has+0.014
lic+0.013
Neg logit contributions
 argue-0.028
cle-0.027
keeping-0.027
 reason-0.026
 elbows-0.026
Token gears
Token ID28713
Feature activation+9.10e-01

Pos logit contributions
der+0.075
 is+0.074
 can+0.071
 has+0.069
lic+0.066
Neg logit contributions
 argue-0.107
cle-0.103
keeping-0.102
 elbows-0.100
 reason-0.100
Token got
Token ID1392
Feature activation+2.82e+00

Pos logit contributions
der+0.017
 is+0.016
lic+0.015
 has+0.015
 looks+0.015
Neg logit contributions
 argue-0.019
 reason-0.018
 elbows-0.017
 weep-0.017
ge-0.017
Token stuck
Token ID7819
Feature activation+1.53e+00

Pos logit contributions
der+0.050
 is+0.049
 can+0.048
 has+0.046
lic+0.046
Neg logit contributions
 argue-0.061
cle-0.060
keeping-0.059
 reason-0.058
 elbows-0.058
Token.
Token ID13
Feature activation+7.79e-01

Pos logit contributions
der+0.022
 is+0.021
 can+0.020
lic+0.020
 has+0.019
Neg logit contributions
 argue-0.039
keeping-0.037
cle-0.037
 elbows-0.037
 reason-0.037
Token She
Token ID1375
Feature activation+3.20e+00

Pos logit contributions
der+0.009
 is+0.008
 can+0.008
 has+0.008
lic+0.007
Neg logit contributions
 argue-0.022
cle-0.022
keeping-0.021
 elbows-0.021
 reason-0.021
Token time
Token ID640
Feature activation-2.37e+00

Pos logit contributions
 is+0.033
der+0.033
 can+0.032
 has+0.031
 makes+0.030
Neg logit contributions
 argue-0.046
cle-0.046
keeping-0.045
 elbows-0.044
 weep-0.043
Token.
Token ID13
Feature activation+9.78e-01

Pos logit contributions
 argue+0.058
keeping+0.056
cle+0.055
 reason+0.055
 elbows+0.055
Neg logit contributions
der-0.034
 is-0.032
lic-0.032
 can-0.030
 has-0.029
Token But
Token ID887
Feature activation+3.85e-01

Pos logit contributions
der+0.013
 is+0.012
 can+0.012
lic+0.011
 has+0.011
Neg logit contributions
 argue-0.026
cle-0.025
keeping-0.025
 elbows-0.024
 reason-0.024
Token when
Token ID618
Feature activation-3.56e-02

Pos logit contributions
 is+0.006
der+0.006
 can+0.005
 has+0.005
 that+0.005
Neg logit contributions
keeping-0.010
 argue-0.010
 elbows-0.010
cle-0.010
 reason-0.009
Token they
Token ID484
Feature activation+1.76e+00

Pos logit contributions
cle+0.001
keeping+0.001
 argue+0.001
 elbows+0.001
 weep+0.001
Neg logit contributions
 is-0.000
 can-0.000
der-0.000
 has-0.000
 makes-0.000
Token looked
Token ID3114
Feature activation+3.42e+00

Pos logit contributions
der+0.026
 is+0.025
 can+0.024
 has+0.023
lic+0.023
Neg logit contributions
 argue-0.044
cle-0.043
keeping-0.043
 elbows-0.042
 reason-0.042
Token up
Token ID510
Feature activation+1.65e+00

Pos logit contributions
der+0.060
lic+0.060
 is+0.055
 teaches+0.053
boards+0.052
Neg logit contributions
 argue-0.071
 afar-0.069
 weep-0.066
 reason-0.065
 worry-0.064
Token,
Token ID11
Feature activation-5.20e-02

Pos logit contributions
der+0.027
lic+0.026
 is+0.025
 has+0.023
 can+0.023
Neg logit contributions
 argue-0.038
 reason-0.036
 elbows-0.036
 worry-0.036
cle-0.036
Token the
Token ID262
Feature activation+4.98e+00

Pos logit contributions
 argue+0.001
cle+0.001
keeping+0.001
 elbows+0.001
 reason+0.001
Neg logit contributions
der-0.001
 is-0.001
 can-0.001
 has-0.001
lic-0.001
Token sun
Token ID4252
Feature activation+2.35e+00

Pos logit contributions
der+0.082
 is+0.076
lic+0.075
 can+0.073
 has+0.070
Neg logit contributions
 argue-0.116
 reason-0.112
 elbows-0.111
cle-0.110
 weep-0.108
Token was
Token ID373
Feature activation+3.28e+00

Pos logit contributions
der+0.035
lic+0.030
 is+0.027
boards+0.027
 helps+0.026
Neg logit contributions
 argue-0.058
cle-0.056
 elbows-0.056
 reason-0.055
 weep-0.055
Token The
Token ID383
Feature activation+2.66e+00

Pos logit contributions
 argue+0.027
 elbows+0.027
cle+0.026
 reason+0.026
keeping+0.026
Neg logit contributions
 is-0.010
 can-0.009
der-0.009
 has-0.008
 makes-0.008
Token suit
Token ID6050
Feature activation+2.72e+00

Pos logit contributions
der+0.060
lic+0.047
icked+0.046
 contains+0.045
 teaches+0.045
Neg logit contributions
 reason-0.050
cle-0.050
 argue-0.049
 worry-0.048
 weep-0.047
Token was
Token ID373
Feature activation+1.65e+00

Pos logit contributions
der+0.046
lic+0.037
lessly+0.035
 helps+0.034
 can+0.032
Neg logit contributions
cle-0.062
 argue-0.059
 weep-0.059
keeping-0.058
gy-0.058
Token special
Token ID2041
Feature activation+9.38e-01

Pos logit contributions
der+0.029
boards+0.025
 is+0.025
 can+0.024
lic+0.024
Neg logit contributions
 argue-0.037
cle-0.036
 weep-0.036
 worry-0.036
 reason-0.035
Token because
Token ID780
Feature activation-6.39e-01

Pos logit contributions
der+0.015
lic+0.015
 is+0.015
 has+0.014
 can+0.014
Neg logit contributions
 argue-0.021
keeping-0.020
cle-0.020
 reason-0.020
 weep-0.019
Token it
Token ID340
Feature activation+3.49e+00

Pos logit contributions
 worry+0.011
cle+0.011
 moderation+0.011
aster+0.011
 argue+0.011
Neg logit contributions
lic-0.014
der-0.014
ows-0.012
boards-0.012
les-0.011
Token could
Token ID714
Feature activation-2.14e+00

Pos logit contributions
der+0.062
lic+0.047
arl+0.043
boards+0.042
ing+0.042
Neg logit contributions
 weep-0.083
 panic-0.081
 argue-0.079
cle-0.079
 worry-0.079
Token perform
Token ID1620
Feature activation+1.71e+00

Pos logit contributions
 argue+0.042
 weep+0.039
 worry+0.038
 reason+0.037
 panic+0.037
Neg logit contributions
der-0.049
lic-0.045
boards-0.045
ing-0.044
 is-0.043
Token amazing
Token ID4998
Feature activation-1.93e+00

Pos logit contributions
der+0.031
lic+0.027
boards+0.026
 helps+0.025
 has+0.025
Neg logit contributions
 argue-0.034
 reason-0.034
 mischief-0.033
 afar-0.032
aster-0.032
Token tricks
Token ID15910
Feature activation-2.31e-01

Pos logit contributions
 argue+0.038
 reason+0.037
keeping+0.036
 mischief+0.036
 elbows+0.035
Neg logit contributions
der-0.036
lic-0.034
 has-0.034
 is-0.033
 can-0.033
Token.
Token ID13
Feature activation-6.75e-01

Pos logit contributions
 argue+0.005
cle+0.005
keeping+0.005
 reason+0.005
 elbows+0.005
Neg logit contributions
der-0.004
lic-0.003
 is-0.003
 has-0.003
 looks-0.003
Token shelf
Token ID18316
Feature activation+1.23e+00

Pos logit contributions
der+0.006
 is+0.006
 can+0.005
 has+0.005
lic+0.005
Neg logit contributions
 argue-0.008
cle-0.007
keeping-0.007
 elbows-0.007
 reason-0.007
Token so
Token ID523
Feature activation-4.50e-01

Pos logit contributions
lic+0.019
der+0.019
 is+0.019
 can+0.018
 has+0.018
Neg logit contributions
 argue-0.028
keeping-0.028
cle-0.027
 reason-0.026
 weep-0.026
Token it
Token ID340
Feature activation+3.91e+00

Pos logit contributions
 argue+0.010
cle+0.010
keeping+0.010
aster+0.010
 reason+0.010
Neg logit contributions
der-0.007
lic-0.007
 is-0.007
 can-0.006
 helps-0.006
Token would
Token ID561
Feature activation-6.69e-01

Pos logit contributions
der+0.066
lic+0.065
boards+0.055
ing+0.054
lessly+0.054
Neg logit contributions
 argue-0.088
 weep-0.085
 reason-0.082
 panic-0.081
 worry-0.079
Token be
Token ID307
Feature activation-5.51e-01

Pos logit contributions
 argue+0.013
 weep+0.012
 reason+0.012
 worry+0.012
cle+0.011
Neg logit contributions
der-0.014
lic-0.014
boards-0.013
 is-0.013
ing-0.013
Token safe
Token ID3338
Feature activation-7.47e-01

Pos logit contributions
 argue+0.013
cle+0.013
keeping+0.013
 elbows+0.013
 reason+0.013
Neg logit contributions
 is-0.008
der-0.008
 can-0.008
 has-0.008
lic-0.008
Token.
Token ID13
Feature activation+8.30e-01

Pos logit contributions
keeping+0.017
 argue+0.017
 reason+0.016
cle+0.016
 worry+0.016
Neg logit contributions
lic-0.013
der-0.012
 is-0.011
boards-0.011
 can-0.011
Token Jill
Token ID22772
Feature activation+6.15e-01

Pos logit contributions
der+0.016
 is+0.015
 can+0.015
lic+0.015
 has+0.014
Neg logit contributions
 argue-0.017
keeping-0.016
 reason-0.016
cle-0.015
 weep-0.015
Token read
Token ID1100
Feature activation+2.89e+00

Pos logit contributions
der+0.009
 is+0.008
lic+0.008
 can+0.008
 has+0.007
Neg logit contributions
 argue-0.016
cle-0.015
 weep-0.015
keeping-0.015
 reason-0.015
Token and
Token ID290
Feature activation-5.60e-02

Pos logit contributions
der+0.052
lic+0.047
 is+0.046
 has+0.045
 can+0.045
Neg logit contributions
 argue-0.062
 reason-0.060
keeping-0.059
cle-0.059
 afar-0.058
Token enjoyed
Token ID8359
Feature activation+9.14e-01

Pos logit contributions
 argue+0.001
keeping+0.001
 weep+0.001
 reason+0.001
 elbows+0.001
Neg logit contributions
lic-0.001
der-0.001
boards-0.001
 is-0.001
 can-0.001
TokenI
Token ID40
Feature activation-2.17e+00

Pos logit contributions
 argue+0.005
keeping+0.004
cle+0.004
 reason+0.004
 elbows+0.004
Neg logit contributions
der-0.003
 is-0.003
 can-0.003
 has-0.003
lic-0.003
Token love
Token ID1842
Feature activation-7.75e-01

Pos logit contributions
 argue+0.042
keeping+0.041
cle+0.041
 elbows+0.041
 reason+0.039
Neg logit contributions
 is-0.044
der-0.044
 can-0.043
 has-0.041
lic-0.041
Token you
Token ID345
Feature activation-4.24e+00

Pos logit contributions
 argue+0.018
cle+0.017
keeping+0.017
 elbows+0.017
 reason+0.017
Neg logit contributions
der-0.013
 is-0.013
 can-0.012
 has-0.012
lic-0.012
Token,
Token ID11
Feature activation-2.58e+00

Pos logit contributions
 argue+0.107
cle+0.103
 reason+0.100
 weep+0.100
 worry+0.099
Neg logit contributions
der-0.064
lic-0.058
 is-0.057
boards-0.055
 can-0.054
Token Luna
Token ID23694
Feature activation-2.14e+00

Pos logit contributions
 argue+0.041
 reason+0.039
 worry+0.038
 weep+0.038
 panic+0.036
Neg logit contributions
der-0.060
boards-0.059
lic-0.059
 is-0.054
ing-0.053
Token.
Token ID13
Feature activation-4.21e-01

Pos logit contributions
 argue+0.045
cle+0.044
 reason+0.042
 weep+0.042
keeping+0.042
Neg logit contributions
der-0.038
lic-0.036
boards-0.036
lessly-0.031
 is-0.031
Token You
Token ID921
Feature activation+5.48e-01

Pos logit contributions
cle+0.009
 argue+0.009
 moderation+0.009
 weep+0.009
keeping+0.009
Neg logit contributions
der-0.007
lic-0.007
boards-0.006
 is-0.006
 can-0.006
Token are
Token ID389
Feature activation+2.45e+00

Pos logit contributions
der+0.009
lic+0.007
boards+0.007
ing+0.007
lights+0.007
Neg logit contributions
 argue-0.014
 worry-0.014
 weep-0.014
 panic-0.013
 reason-0.013
Token my
Token ID616
Feature activation-4.53e-01

Pos logit contributions
der+0.045
boards+0.040
lic+0.039
 is+0.038
 can+0.037
Neg logit contributions
 argue-0.054
cle-0.052
 moderation-0.052
 reason-0.052
 worry-0.051
Token best
Token ID1266
Feature activation-2.13e+00

Pos logit contributions
 argue+0.012
keeping+0.011
cle+0.011
 elbows+0.011
aster+0.011
Neg logit contributions
 is-0.007
der-0.006
 can-0.006
 has-0.006
 helps-0.006
Token friend
Token ID1545
Feature activation-1.20e+00

Pos logit contributions
 argue+0.042
keeping+0.041
 elbows+0.041
cle+0.040
 afar+0.039
Neg logit contributions
der-0.043
 is-0.043
 can-0.042
 has-0.040
 makes-0.039
Token to
Token ID284
Feature activation-5.11e+00

Pos logit contributions
 is+0.020
der+0.019
 can+0.019
 has+0.017
 makes+0.017
Neg logit contributions
 argue-0.040
cle-0.040
 elbows-0.039
keeping-0.039
 weep-0.039
Token go
Token ID467
Feature activation-4.50e+00

Pos logit contributions
cle+0.110
keeping+0.106
 elbows+0.106
 moderation+0.099
 afar+0.098
Neg logit contributions
 is-0.104
 can-0.101
 all-0.094
 makes-0.092
 looks-0.092
Token on
Token ID319
Feature activation-3.24e+00

Pos logit contributions
cle+0.123
 argue+0.123
 elbows+0.121
keeping+0.120
 reason+0.118
Neg logit contributions
 is-0.057
 can-0.054
 has-0.051
der-0.050
lic-0.047
Token it
Token ID340
Feature activation+4.45e-01

Pos logit contributions
cle+0.103
cker+0.098
 argue+0.098
 weep+0.097
 reason+0.096
Neg logit contributions
 is-0.034
 all-0.033
 that-0.032
 can-0.026
 looks-0.024
Token.
Token ID13
Feature activation-4.51e-01

Pos logit contributions
der+0.005
 is+0.005
 can+0.005
 has+0.005
 makes+0.004
Neg logit contributions
 argue-0.012
cle-0.012
 elbows-0.012
keeping-0.012
 reason-0.012
Token He
Token ID679
Feature activation+2.80e+00

Pos logit contributions
 argue+0.014
cle+0.014
keeping+0.014
 elbows+0.014
 reason+0.014
Neg logit contributions
der-0.004
 is-0.004
 can-0.003
 has-0.003
lic-0.003
Token ran
Token ID4966
Feature activation+2.02e-01

Pos logit contributions
der+0.051
 is+0.049
 can+0.047
lic+0.045
 has+0.045
Neg logit contributions
 argue-0.061
cle-0.059
 reason-0.058
 elbows-0.057
keeping-0.057
Token to
Token ID284
Feature activation-1.87e+00

Pos logit contributions
 is+0.003
 can+0.002
der+0.002
 all+0.002
 that+0.002
Neg logit contributions
cle-0.006
 elbows-0.006
 moderation-0.006
 argue-0.005
 weep-0.005
Token the
Token ID262
Feature activation+2.60e+00

Pos logit contributions
cle+0.061
 elbows+0.059
keeping+0.059
cker+0.056
 argue+0.055
Neg logit contributions
 all-0.018
 is-0.018
 can-0.015
 that-0.014
 helps-0.013
Token slide
Token ID10649
Feature activation-6.23e-01

Pos logit contributions
der+0.069
lic+0.064
 is+0.062
 can+0.061
 makes+0.060
Neg logit contributions
 argue-0.037
 reason-0.034
 elbows-0.033
 weep-0.032
 panic-0.032
Token and
Token ID290
Feature activation-1.58e+00

Pos logit contributions
 argue+0.015
cle+0.014
 reason+0.013
keeping+0.013
 weep+0.013
Neg logit contributions
der-0.010
lic-0.010
 is-0.009
 can-0.009
 has-0.009
Token picture
Token ID4286
Feature activation+5.42e+00

Pos logit contributions
 is+0.064
 can+0.062
der+0.062
 has+0.060
 makes+0.056
Neg logit contributions
 argue-0.111
keeping-0.110
cle-0.109
 elbows-0.107
 reason-0.104
Token was
Token ID373
Feature activation+3.08e+00

Pos logit contributions
der+0.078
lic+0.072
 is+0.069
 can+0.066
 has+0.064
Neg logit contributions
 argue-0.130
 reason-0.127
 weep-0.126
keeping-0.123
cle-0.123
Token taken
Token ID2077
Feature activation+3.14e+00

Pos logit contributions
der+0.053
 is+0.050
 can+0.048
 has+0.046
boards+0.046
Neg logit contributions
 argue-0.070
cle-0.067
 reason-0.067
keeping-0.066
 weep-0.065
Token and
Token ID290
Feature activation-5.79e-01

Pos logit contributions
der+0.065
lic+0.059
 has+0.059
 is+0.058
boards+0.056
Neg logit contributions
 reason-0.061
 worry-0.058
 argue-0.058
keeping-0.057
 panic-0.056
Token then
Token ID788
Feature activation+8.13e-01

Pos logit contributions
 argue+0.012
keeping+0.012
 weep+0.012
 worry+0.012
 reason+0.012
Neg logit contributions
der-0.011
lic-0.010
boards-0.009
 is-0.009
 can-0.009
Token ran
Token ID4966
Feature activation+8.39e-01

Pos logit contributions
der+0.015
lic+0.015
 is+0.014
boards+0.014
 can+0.013
Neg logit contributions
 argue-0.017
 worry-0.016
 weep-0.016
 reason-0.016
keeping-0.016
Token off
Token ID572
Feature activation-5.39e-01

Pos logit contributions
der+0.015
 is+0.014
 can+0.014
 has+0.013
lic+0.013
Neg logit contributions
 argue-0.018
keeping-0.018
 reason-0.017
 weep-0.017
 elbows-0.017
Token to
Token ID284
Feature activation-2.19e+00

Pos logit contributions
cle+0.014
 argue+0.014
 elbows+0.014
 reason+0.013
keeping+0.013
Neg logit contributions
 is-0.007
der-0.007
 can-0.007
 has-0.007
lic-0.006
Token find
Token ID1064
Feature activation+3.18e+00

Pos logit contributions
 argue+0.044
cle+0.043
keeping+0.042
 elbows+0.041
 reason+0.041
Neg logit contributions
der-0.043
 is-0.043
 can-0.041
 has-0.040
lic-0.039
Token something
Token ID1223
Feature activation+2.92e+00

Pos logit contributions
der+0.067
lic+0.064
 has+0.061
 is+0.060
boards+0.059
Neg logit contributions
 reason-0.059
 moderation-0.057
 afar-0.057
 mischief-0.057
aster-0.056
Token else
Token ID2073
Feature activation-2.88e-01

Pos logit contributions
der+0.050
lic+0.045
 is+0.044
 has+0.044
boards+0.043
Neg logit contributions
 reason-0.065
 argue-0.062
 worry-0.061
 harm-0.059
 weep-0.059
Token.
Token ID13
Feature activation+2.22e-01

Pos logit contributions
 argue+0.064
cle+0.062
keeping+0.062
 elbows+0.061
 reason+0.061
Neg logit contributions
der-0.034
 is-0.033
 can-0.032
lic-0.031
 has-0.030
Token
Token ID198
Feature activation+6.62e-01

Pos logit contributions
 is+0.003
der+0.002
 can+0.002
 has+0.002
 makes+0.002
Neg logit contributions
 argue-0.006
 elbows-0.006
cle-0.006
 reason-0.006
keeping-0.006
Token
Token ID198
Feature activation+1.48e-01

Pos logit contributions
 is+0.011
 can+0.010
der+0.009
 has+0.009
lic+0.009
Neg logit contributions
 argue-0.017
 elbows-0.016
keeping-0.016
cle-0.015
 weep-0.015
TokenOne
Token ID3198
Feature activation+1.33e+00

Pos logit contributions
 is+0.002
 can+0.002
der+0.002
 has+0.002
lic+0.002
Neg logit contributions
 elbows-0.004
keeping-0.004
 argue-0.004
 weep-0.003
 worry-0.003
Token day
Token ID1110
Feature activation+4.60e-01

Pos logit contributions
 is+0.030
 can+0.028
 has+0.028
der+0.027
 makes+0.026
Neg logit contributions
keeping-0.025
 argue-0.025
 elbows-0.025
cle-0.023
 reason-0.022
Token,
Token ID11
Feature activation-8.15e-01

Pos logit contributions
 is+0.005
 can+0.005
der+0.005
 has+0.004
lic+0.004
Neg logit contributions
 argue-0.014
keeping-0.013
 elbows-0.013
cle-0.013
 weep-0.013
Token Tim
Token ID5045
Feature activation-1.03e+00

Pos logit contributions
 argue+0.022
 elbows+0.022
 weep+0.021
iver+0.021
keeping+0.021
Neg logit contributions
 all-0.011
 is-0.010
 that-0.010
 can-0.010
der-0.009
Token said
Token ID531
Feature activation+3.61e+00

Pos logit contributions
 argue+0.024
cle+0.024
 elbows+0.023
keeping+0.023
 reason+0.023
Neg logit contributions
 is-0.018
 can-0.017
 has-0.017
der-0.016
 looks-0.016
Token he
Token ID339
Feature activation+2.72e+00

Pos logit contributions
der+0.059
 is+0.055
 has+0.052
lic+0.051
 can+0.051
Neg logit contributions
 argue-0.086
 reason-0.080
cle-0.079
 weep-0.079
 worry-0.079
Token wanted
Token ID2227
Feature activation+1.63e+00

Pos logit contributions
der+0.042
lic+0.035
 is+0.034
boards+0.033
 helps+0.033
Neg logit contributions
 argue-0.067
 reason-0.064
cle-0.064
 weep-0.063
 worry-0.062
Token to
Token ID284
Feature activation-4.59e+00

Pos logit contributions
 is+0.021
der+0.021
 can+0.020
 makes+0.018
 has+0.018
Neg logit contributions
 argue-0.044
cle-0.044
 elbows-0.043
keeping-0.042
 weep-0.042
Token,
Token ID11
Feature activation-1.79e+00

Pos logit contributions
 argue+0.023
 worry+0.023
cle+0.023
 reason+0.022
 weep+0.022
Neg logit contributions
der-0.019
boards-0.018
 has-0.017
lic-0.017
 is-0.017
Token I
Token ID314
Feature activation-3.19e+00

Pos logit contributions
 worry+0.035
cle+0.034
 argue+0.034
 reason+0.033
 weep+0.033
Neg logit contributions
der-0.035
boards-0.033
lic-0.030
 is-0.029
les-0.029
Token will
Token ID481
Feature activation-2.22e+00

Pos logit contributions
 argue+0.082
cle+0.080
keeping+0.079
 elbows+0.079
 reason+0.078
Neg logit contributions
 is-0.045
der-0.044
 can-0.043
 has-0.041
lic-0.040
Token be
Token ID307
Feature activation-1.57e+00

Pos logit contributions
 argue+0.044
cle+0.041
keeping+0.041
 reason+0.041
 elbows+0.040
Neg logit contributions
der-0.045
 is-0.044
 can-0.043
 has-0.042
lic-0.042
Token your
Token ID534
Feature activation-1.22e+00

Pos logit contributions
 argue+0.041
cle+0.040
keeping+0.039
 elbows+0.039
 reason+0.039
Neg logit contributions
 is-0.022
 can-0.021
 has-0.020
 all-0.020
der-0.020
Token friend
Token ID1545
Feature activation-2.26e+00

Pos logit contributions
aster+0.027
keeping+0.027
 elbows+0.026
 argue+0.026
cle+0.025
Neg logit contributions
 is-0.023
 can-0.022
der-0.021
 has-0.020
 makes-0.020
Token."
Token ID526
Feature activation-4.77e-01

Pos logit contributions
 argue+0.058
cle+0.056
 elbows+0.056
keeping+0.055
 reason+0.055
Neg logit contributions
 is-0.032
der-0.032
 can-0.030
 has-0.029
lic-0.028
Token They
Token ID1119
Feature activation+2.01e+00

Pos logit contributions
 argue+0.010
cle+0.010
 elbows+0.010
keeping+0.010
 weep+0.010
Neg logit contributions
der-0.008
 is-0.008
lic-0.008
 can-0.007
boards-0.007
Token played
Token ID2826
Feature activation+1.12e+00

Pos logit contributions
der+0.040
 is+0.038
 can+0.036
lic+0.036
 has+0.035
Neg logit contributions
 argue-0.042
cle-0.038
 elbows-0.038
 weep-0.038
 reason-0.038
Token together
Token ID1978
Feature activation-5.25e-01

Pos logit contributions
der+0.016
 is+0.015
 can+0.014
lic+0.014
 has+0.014
Neg logit contributions
 argue-0.029
 reason-0.027
keeping-0.027
 elbows-0.027
cle-0.027
Token all
Token ID477
Feature activation-1.58e+00

Pos logit contributions
 argue+0.015
 reason+0.014
cle+0.014
 elbows+0.014
keeping+0.014
Neg logit contributions
der-0.006
 is-0.006
lic-0.006
 can-0.005
 has-0.005
Token,
Token ID11
Feature activation-3.89e+00

Pos logit contributions
 argue+0.039
keeping+0.038
cle+0.038
 elbows+0.037
 reason+0.037
Neg logit contributions
 is-0.016
der-0.015
 can-0.015
 has-0.014
lic-0.013
Token there
Token ID612
Feature activation+2.22e+00

Pos logit contributions
 argue+0.087
cle+0.083
 weep+0.082
 reason+0.081
 elbows+0.080
Neg logit contributions
der-0.069
 is-0.061
boards-0.060
lic-0.060
 helps-0.058
Token was
Token ID373
Feature activation-1.23e+00

Pos logit contributions
der+0.022
lic+0.018
 is+0.018
 can+0.017
 has+0.016
Neg logit contributions
 argue-0.067
cle-0.065
 reason-0.065
keeping-0.064
 weep-0.064
Token a
Token ID257
Feature activation-1.65e+00

Pos logit contributions
 argue+0.046
cle+0.045
ge+0.045
keeping+0.044
 elbows+0.043
Neg logit contributions
 all-0.007
 that-0.006
 is-0.006
 many-0.006
 big-0.005
Token little
Token ID1310
Feature activation-3.46e+00

Pos logit contributions
keeping+0.048
 argue+0.047
 elbows+0.046
cle+0.045
 reason+0.044
Neg logit contributions
 is-0.019
 can-0.019
der-0.018
 has-0.018
lic-0.016
Token boy
Token ID2933
Feature activation-2.46e+00

Pos logit contributions
 argue+0.080
cle+0.077
keeping+0.076
 elbows+0.076
 reason+0.075
Neg logit contributions
der-0.058
 is-0.056
 can-0.054
 has-0.052
lic-0.052
Token named
Token ID3706
Feature activation-3.55e+00

Pos logit contributions
 argue+0.055
keeping+0.054
 reason+0.052
 elbows+0.052
 mischief+0.052
Neg logit contributions
 is-0.049
 that-0.047
 can-0.044
 of-0.044
 has-0.044
Token Tim
Token ID5045
Feature activation-4.20e+00

Pos logit contributions
 argue+0.060
cle+0.057
keeping+0.056
 elbows+0.055
 reason+0.055
Neg logit contributions
der-0.083
 is-0.081
 can-0.079
 has-0.077
lic-0.077
Tokenmy
Token ID1820
Feature activation-2.20e+00

Pos logit contributions
 argue+0.099
 elbows+0.097
keeping+0.096
cle+0.095
 mischief+0.094
Neg logit contributions
 is-0.076
 can-0.070
 has-0.068
 all-0.064
der-0.064
Token.
Token ID13
Feature activation-1.39e+00

Pos logit contributions
 argue+0.069
keeping+0.067
 elbows+0.067
 reason+0.066
 mischief+0.066
Neg logit contributions
 is-0.022
der-0.020
 can-0.019
 has-0.018
 that-0.018
Token Tim
Token ID5045
Feature activation-2.60e+00

Pos logit contributions
 argue+0.026
cle+0.025
keeping+0.025
 elbows+0.024
 reason+0.024
Neg logit contributions
der-0.029
 is-0.029
 can-0.028
 has-0.027
lic-0.027
Token was
Token ID373
Feature activation-1.14e+00

Pos logit contributions
der+0.024
 is+0.020
 can+0.019
lic+0.019
 has+0.018
Neg logit contributions
 argue-0.077
cle-0.075
 reason-0.074
keeping-0.074
 elbows-0.074
Token a
Token ID257
Feature activation-1.45e+00

Pos logit contributions
 argue+0.042
cle+0.041
ge+0.041
keeping+0.041
 elbows+0.039
Neg logit contributions
 all-0.007
 is-0.006
 that-0.006
 big-0.005
 many-0.005
Token cute
Token ID13779
Feature activation-1.08e+00

Pos logit contributions
keeping+0.043
 argue+0.042
 elbows+0.042
cle+0.040
 reason+0.039
Neg logit contributions
 is-0.016
 can-0.016
der-0.015
 has-0.015
lic-0.013
Token little
Token ID1310
Feature activation-3.08e+00

Pos logit contributions
 argue+0.031
 elbows+0.030
keeping+0.030
cle+0.029
ge+0.028
Neg logit contributions
 is-0.014
der-0.013
 can-0.013
 has-0.012
 looks-0.011
Token pig
Token ID12967
Feature activation-1.85e+00

Pos logit contributions
 argue+0.066
cle+0.064
keeping+0.063
 reason+0.062
 elbows+0.062
Neg logit contributions
der-0.056
 is-0.055
 can-0.052
 has-0.051
lic-0.051
Token named
Token ID3706
Feature activation-3.54e+00

Pos logit contributions
 argue+0.037
 reason+0.036
 elbows+0.036
 mischief+0.035
keeping+0.034
Neg logit contributions
 is-0.039
 that-0.037
der-0.036
 can-0.036
 all-0.035
Token Pete
Token ID17542
Feature activation-3.81e+00

Pos logit contributions
 argue+0.071
cle+0.067
 elbows+0.067
keeping+0.067
 reason+0.066
Neg logit contributions
der-0.070
 is-0.070
 can-0.067
 has-0.065
lic-0.065
Token.
Token ID13
Feature activation-1.04e+00

Pos logit contributions
 argue+0.104
cle+0.100
 elbows+0.099
keeping+0.099
 reason+0.099
Neg logit contributions
der-0.048
 is-0.048
 can-0.045
 has-0.043
lic-0.041
Token Pete
Token ID17542
Feature activation-1.95e+00

Pos logit contributions
 argue+0.018
 moderation+0.018
 mischief+0.017
 afar+0.017
 panic+0.017
Neg logit contributions
der-0.022
lic-0.021
 is-0.020
ows-0.020
 has-0.019
Token had
Token ID550
Feature activation+8.34e-01

Pos logit contributions
 argue+0.049
cle+0.049
keeping+0.047
 weep+0.047
 reason+0.047
Neg logit contributions
der-0.029
lic-0.025
 is-0.024
 can-0.024
boards-0.023
Token a
Token ID257
Feature activation-3.26e-01

Pos logit contributions
der+0.013
 is+0.012
 can+0.012
lic+0.011
 has+0.011
Neg logit contributions
 argue-0.020
cle-0.020
keeping-0.020
 elbows-0.020
 reason-0.019
Token,
Token ID11
Feature activation-4.04e+00

Pos logit contributions
 argue+0.039
keeping+0.039
cle+0.038
 elbows+0.038
 reason+0.037
Neg logit contributions
 is-0.015
 can-0.013
der-0.013
 has-0.012
lic-0.011
Token there
Token ID612
Feature activation+2.20e+00

Pos logit contributions
 argue+0.097
cle+0.093
 reason+0.092
 elbows+0.091
 weep+0.091
Neg logit contributions
der-0.065
 is-0.058
boards-0.056
lic-0.056
 can-0.055
Token was
Token ID373
Feature activation-1.21e+00

Pos logit contributions
der+0.022
lic+0.018
 is+0.018
 can+0.017
 has+0.017
Neg logit contributions
 argue-0.066
cle-0.064
 reason-0.064
keeping-0.063
 elbows-0.063
Token a
Token ID257
Feature activation-1.55e+00

Pos logit contributions
 argue+0.045
ge+0.044
cle+0.043
keeping+0.043
 elbows+0.041
Neg logit contributions
 all-0.008
 is-0.008
 that-0.007
 big-0.006
 many-0.006
Token rich
Token ID5527
Feature activation+2.66e-01

Pos logit contributions
keeping+0.047
 argue+0.046
 elbows+0.045
cle+0.044
 reason+0.043
Neg logit contributions
 is-0.017
 can-0.016
der-0.015
 has-0.015
lic-0.014
Token man
Token ID582
Feature activation-2.67e+00

Pos logit contributions
der+0.005
 is+0.005
 can+0.005
lic+0.005
 has+0.004
Neg logit contributions
 argue-0.005
cle-0.005
 reason-0.005
keeping-0.005
 weep-0.005
Token named
Token ID3706
Feature activation-3.33e+00

Pos logit contributions
 argue+0.063
 elbows+0.063
 reason+0.062
 mischief+0.060
keeping+0.059
Neg logit contributions
 is-0.047
 that-0.045
 has-0.043
 of-0.042
 can-0.042
Token Tom
Token ID4186
Feature activation-3.38e+00

Pos logit contributions
 argue+0.064
 elbows+0.063
keeping+0.061
cle+0.061
 reason+0.060
Neg logit contributions
 is-0.069
der-0.068
 can-0.067
 has-0.065
lic-0.064
Token.
Token ID13
Feature activation-1.38e+00

Pos logit contributions
 argue+0.099
 elbows+0.096
 reason+0.095
keeping+0.095
cle+0.094
Neg logit contributions
 is-0.039
der-0.036
 can-0.035
 has-0.034
 makes-0.031
Token Tom
Token ID4186
Feature activation-1.42e+00

Pos logit contributions
 argue+0.026
cle+0.024
 elbows+0.023
keeping+0.023
 reason+0.023
Neg logit contributions
der-0.030
 is-0.028
 can-0.027
lic-0.027
 has-0.026
Token had
Token ID550
Feature activation+9.70e-01

Pos logit contributions
 argue+0.039
cle+0.039
keeping+0.038
 weep+0.037
 reason+0.037
Neg logit contributions
der-0.018
lic-0.015
 is-0.015
 can-0.014
boards-0.013
Token cook
Token ID4255
Feature activation+1.52e+00

Pos logit contributions
 argue+0.102
cle+0.094
 reason+0.093
 moderation+0.093
 worry+0.091
Neg logit contributions
der-0.087
lic-0.076
 is-0.072
boards-0.070
 can-0.070
Token.
Token ID13
Feature activation-1.17e+00

Pos logit contributions
der+0.025
lic+0.022
 is+0.022
 can+0.022
 has+0.021
Neg logit contributions
 argue-0.036
cle-0.035
 reason-0.034
keeping-0.034
 elbows-0.034
Token She
Token ID1375
Feature activation+1.35e+00

Pos logit contributions
 argue+0.030
cle+0.030
keeping+0.030
 elbows+0.030
 reason+0.029
Neg logit contributions
der-0.016
 is-0.016
 can-0.016
 has-0.015
 makes-0.014
Token wanted
Token ID2227
Feature activation+5.11e-01

Pos logit contributions
der+0.023
lic+0.020
ing+0.019
boards+0.019
lessly+0.019
Neg logit contributions
 argue-0.031
cle-0.030
 reason-0.029
aster-0.029
please-0.029
Token to
Token ID284
Feature activation-4.96e+00

Pos logit contributions
der+0.008
 is+0.008
 can+0.008
 has+0.007
lic+0.007
Neg logit contributions
 argue-0.012
cle-0.012
 reason-0.011
keeping-0.011
 elbows-0.011
Token be
Token ID307
Feature activation-2.38e+00

Pos logit contributions
 argue+0.107
cle+0.103
keeping+0.101
 elbows+0.101
 reason+0.100
Neg logit contributions
der-0.090
 is-0.089
 can-0.085
 has-0.082
lic-0.082
Token a
Token ID257
Feature activation-2.16e+00

Pos logit contributions
 argue+0.059
cle+0.057
keeping+0.057
 elbows+0.057
 weep+0.056
Neg logit contributions
 is-0.035
 can-0.034
der-0.034
lic-0.032
 has-0.032
Token good
Token ID922
Feature activation-2.82e+00

Pos logit contributions
keeping+0.053
 elbows+0.053
 argue+0.053
cle+0.050
 weep+0.049
Neg logit contributions
 is-0.033
 can-0.033
der-0.032
 has-0.030
lic-0.029
Token chef
Token ID21221
Feature activation-1.97e+00

Pos logit contributions
 elbows+0.058
 argue+0.058
keeping+0.057
cle+0.056
 weep+0.053
Neg logit contributions
 is-0.055
 can-0.054
der-0.053
 has-0.051
 helps-0.050
Token when
Token ID618
Feature activation-3.55e+00

Pos logit contributions
 argue+0.051
cle+0.049
keeping+0.049
 reason+0.048
 elbows+0.048
Neg logit contributions
der-0.027
 is-0.026
 can-0.025
lic-0.024
 has-0.024
Token she
Token ID673
Feature activation-9.82e-01

Pos logit contributions
 argue+0.097
cle+0.094
keeping+0.093
 elbows+0.092
 reason+0.091
Neg logit contributions
der-0.046
 is-0.045
 can-0.043
 has-0.041
 makes-0.039
Token?
Token ID30
Feature activation-7.24e-01

Pos logit contributions
keeping+0.061
cle+0.060
 argue+0.060
 generation+0.059
 elbows+0.059
Neg logit contributions
 is-0.037
 can-0.034
der-0.031
 has-0.031
 makes-0.030
Token You
Token ID921
Feature activation+7.60e-02

Pos logit contributions
 argue+0.018
cle+0.017
keeping+0.017
 reason+0.017
 elbows+0.017
Neg logit contributions
der-0.011
 is-0.011
 can-0.010
 has-0.010
 makes-0.009
Token have
Token ID423
Feature activation+1.19e+00

Pos logit contributions
der+0.001
 is+0.001
lic+0.001
 can+0.001
 has+0.001
Neg logit contributions
 argue-0.002
 weep-0.002
 reason-0.002
 elbows-0.002
cle-0.002
Token a
Token ID257
Feature activation-5.92e-01

Pos logit contributions
 is+0.018
der+0.018
 can+0.017
 has+0.016
 makes+0.016
Neg logit contributions
 argue-0.030
cle-0.029
keeping-0.029
 reason-0.028
 elbows-0.028
Token boo
Token ID22513
Feature activation-3.38e+00

Pos logit contributions
 argue+0.012
keeping+0.012
cle+0.012
 elbows+0.011
 reason+0.011
Neg logit contributions
 is-0.012
der-0.012
 can-0.011
 has-0.011
 makes-0.010
Token-
Token ID12
Feature activation-6.70e+00

Pos logit contributions
 argue+0.077
cle+0.074
keeping+0.073
 elbows+0.073
 reason+0.072
Neg logit contributions
der-0.058
 is-0.056
 can-0.054
 has-0.052
lic-0.052
Tokenbo
Token ID2127
Feature activation-4.29e+00

Pos logit contributions
 argue+0.130
 elbows+0.122
 reason+0.121
keeping+0.121
cle+0.121
Neg logit contributions
 is-0.137
der-0.137
 can-0.132
lic-0.128
 has-0.128
Tokeno
Token ID78
Feature activation-2.20e+00

Pos logit contributions
 argue+0.076
 reason+0.073
 elbows+0.073
cle+0.071
 mischief+0.071
Neg logit contributions
 is-0.099
 can-0.094
 has-0.092
der-0.092
lic-0.088
Token on
Token ID319
Feature activation-9.93e-01

Pos logit contributions
 argue+0.058
cle+0.056
keeping+0.055
 elbows+0.055
 reason+0.055
Neg logit contributions
der-0.030
 is-0.030
 can-0.028
 has-0.027
lic-0.026
Token your
Token ID534
Feature activation-2.93e+00

Pos logit contributions
 argue+0.023
cle+0.022
keeping+0.022
 elbows+0.022
 reason+0.022
Neg logit contributions
der-0.017
 is-0.016
 can-0.015
lic-0.015
 has-0.015
Token head
Token ID1182
Feature activation-1.69e+00

Pos logit contributions
 argue+0.050
cle+0.047
keeping+0.046
 reason+0.046
 elbows+0.045
Neg logit contributions
der-0.068
 is-0.067
 can-0.064
 has-0.063
 makes-0.062
Token and
Token ID290
Feature activation+7.93e-01

Pos logit contributions
 argue+0.002
 weep+0.002
cle+0.002
keeping+0.002
 reason+0.002
Neg logit contributions
der-0.001
lic-0.001
 is-0.001
 can-0.001
 has-0.001
Token Ben
Token ID3932
Feature activation-4.95e-01

Pos logit contributions
der+0.013
 is+0.012
lic+0.012
 can+0.012
 has+0.011
Neg logit contributions
 argue-0.019
cle-0.018
 elbows-0.018
 reason-0.018
 weep-0.018
Token did
Token ID750
Feature activation+2.67e+00

Pos logit contributions
 argue+0.012
 weep+0.012
 worry+0.011
 panic+0.011
 reason+0.011
Neg logit contributions
der-0.008
lic-0.008
 is-0.007
br-0.007
boards-0.007
Token not
Token ID407
Feature activation-3.27e+00

Pos logit contributions
 is+0.036
 that+0.034
der+0.034
 all+0.033
 can+0.032
Neg logit contributions
keeping-0.077
cle-0.075
 elbows-0.073
please-0.072
cker-0.070
Token want
Token ID765
Feature activation+5.27e-01

Pos logit contributions
cle+0.056
keeping+0.055
 elbows+0.055
 argue+0.053
 afar+0.051
Neg logit contributions
 is-0.074
 can-0.074
der-0.071
 has-0.070
 makes-0.068
Token to
Token ID284
Feature activation-7.53e+00

Pos logit contributions
der+0.008
 is+0.008
 can+0.008
 has+0.007
lic+0.007
Neg logit contributions
 argue-0.013
cle-0.012
keeping-0.012
 reason-0.012
 elbows-0.012
Token do
Token ID466
Feature activation-1.89e-01

Pos logit contributions
cle+0.165
keeping+0.165
 elbows+0.162
 afar+0.160
 generation+0.155
Neg logit contributions
 is-0.134
 can-0.127
 that-0.126
 makes-0.122
 all-0.120
Token that
Token ID326
Feature activation-9.83e-01

Pos logit contributions
 argue+0.005
keeping+0.005
 reason+0.005
 elbows+0.005
cle+0.005
Neg logit contributions
der-0.003
 is-0.003
lic-0.003
 can-0.002
 has-0.002
Token.
Token ID13
Feature activation+3.97e-01

Pos logit contributions
cle+0.027
 argue+0.027
 elbows+0.026
keeping+0.026
 afar+0.025
Neg logit contributions
der-0.013
 is-0.013
 can-0.011
 has-0.011
 makes-0.011
Token They
Token ID1119
Feature activation+2.83e+00

Pos logit contributions
der+0.004
 is+0.004
lic+0.004
 can+0.004
 has+0.003
Neg logit contributions
 argue-0.012
keeping-0.011
cle-0.011
 elbows-0.011
 reason-0.011
Token thought
Token ID1807
Feature activation+4.09e+00

Pos logit contributions
der+0.051
ing+0.050
lic+0.049
boards+0.047
 is+0.046
Neg logit contributions
 argue-0.065
 panic-0.059
 weep-0.059
 worry-0.058
 reason-0.058
Token's
Token ID338
Feature activation+1.40e+00

Pos logit contributions
 argue+0.002
cle+0.002
 weep+0.002
 elbows+0.002
 reason+0.002
Neg logit contributions
der-0.001
lic-0.001
 is-0.001
 can-0.001
boards-0.001
Token mom
Token ID1995
Feature activation-1.33e+00

Pos logit contributions
der+0.025
 is+0.025
 can+0.024
 has+0.023
 makes+0.022
Neg logit contributions
 argue-0.031
cle-0.029
keeping-0.029
 elbows-0.029
 reason-0.029
Token helped
Token ID4193
Feature activation+1.37e+00

Pos logit contributions
 argue+0.028
cle+0.028
please+0.026
 worry+0.026
 weep+0.026
Neg logit contributions
der-0.026
 is-0.021
lic-0.020
lessly-0.019
boards-0.019
Token her
Token ID607
Feature activation-1.70e+00

Pos logit contributions
 is+0.023
 can+0.022
der+0.021
 all+0.020
 makes+0.020
Neg logit contributions
cle-0.031
 argue-0.031
 elbows-0.031
keeping-0.030
 afar-0.030
Token sew
Token ID34249
Feature activation+1.96e+00

Pos logit contributions
 argue+0.036
cle+0.035
keeping+0.035
 elbows+0.035
 reason+0.034
Neg logit contributions
der-0.031
 is-0.031
 can-0.029
 has-0.028
lic-0.028
Token Mr
Token ID1770
Feature activation-6.80e+00

Pos logit contributions
der+0.036
lic+0.035
 has+0.033
 can+0.031
 is+0.031
Neg logit contributions
 argue-0.040
keeping-0.040
 reason-0.039
 elbows-0.038
 afar-0.038
Token.
Token ID13
Feature activation-1.22e+00

Pos logit contributions
 argue+0.175
cle+0.171
keeping+0.168
 reason+0.166
 elbows+0.166
Neg logit contributions
der-0.095
 is-0.091
 can-0.087
lic-0.086
 has-0.083
Token Fl
Token ID1610
Feature activation-3.51e+00

Pos logit contributions
 argue+0.028
 reason+0.026
 weep+0.026
 elbows+0.026
keeping+0.026
Neg logit contributions
der-0.021
 can-0.020
 is-0.019
 has-0.019
 makes-0.017
Tokenuffy
Token ID15352
Feature activation-1.92e+00

Pos logit contributions
 argue+0.064
keeping+0.061
cle+0.060
 reason+0.059
 elbows+0.059
Neg logit contributions
der-0.077
 is-0.076
 can-0.073
 has-0.071
lic-0.069
Token back
Token ID736
Feature activation-1.31e+00

Pos logit contributions
 argue+0.047
 weep+0.044
 reason+0.043
 elbows+0.043
keeping+0.043
Neg logit contributions
der-0.030
lic-0.027
 is-0.026
 can-0.025
boards-0.025
Token together
Token ID1978
Feature activation+5.55e-01

Pos logit contributions
 argue+0.031
keeping+0.029
 reason+0.029
cle+0.029
 weep+0.029
Neg logit contributions
der-0.021
 is-0.020
lic-0.019
 can-0.019
 has-0.019
TokenL
Token ID43
Feature activation-1.52e+00

Pos logit contributions
 argue+0.013
cle+0.013
 reason+0.012
keeping+0.012
 elbows+0.012
Neg logit contributions
der-0.011
 is-0.011
 can-0.010
 has-0.010
lic-0.010
Tokenily
Token ID813
Feature activation+9.75e-01

Pos logit contributions
 argue+0.040
 reason+0.039
keeping+0.039
cle+0.039
 weep+0.038
Neg logit contributions
der-0.019
 is-0.018
 can-0.018
 has-0.017
lic-0.017
Token didn
Token ID1422
Feature activation+5.00e-04

Pos logit contributions
der+0.018
 is+0.017
 can+0.017
lic+0.016
 has+0.016
Neg logit contributions
 argue-0.021
cle-0.020
 reason-0.019
 weep-0.019
 elbows-0.019
Token't
Token ID470
Feature activation-3.11e+00

Pos logit contributions
der+0.000
 is+0.000
 can+0.000
 has+0.000
lic+0.000
Neg logit contributions
 argue-0.000
cle-0.000
keeping-0.000
 reason-0.000
 elbows-0.000
Token want
Token ID765
Feature activation+6.22e-02

Pos logit contributions
cle+0.049
 argue+0.049
keeping+0.048
 elbows+0.048
 afar+0.045
Neg logit contributions
der-0.073
 is-0.072
 can-0.072
 has-0.069
lic-0.067
Token to
Token ID284
Feature activation-7.22e+00

Pos logit contributions
 is+0.001
der+0.001
 can+0.001
 has+0.001
 makes+0.001
Neg logit contributions
 argue-0.002
cle-0.002
keeping-0.002
 elbows-0.002
 reason-0.001
Token accept
Token ID2453
Feature activation+9.29e-01

Pos logit contributions
cle+0.175
keeping+0.174
 elbows+0.170
 argue+0.170
 afar+0.167
Neg logit contributions
 is-0.110
 can-0.107
der-0.107
 has-0.097
 makes-0.097
Token mom
Token ID1995
Feature activation-5.06e+00

Pos logit contributions
der+0.014
 is+0.014
 can+0.013
 has+0.012
 makes+0.012
Neg logit contributions
 argue-0.023
cle-0.023
keeping-0.022
 elbows-0.022
 reason-0.022
Tokenmy
Token ID1820
Feature activation-4.73e+00

Pos logit contributions
cle+0.105
 argue+0.104
 elbows+0.103
keeping+0.103
 afar+0.102
Neg logit contributions
 is-0.098
 can-0.094
 has-0.090
der-0.088
 makes-0.087
Token's
Token ID338
Feature activation-2.51e+00

Pos logit contributions
cle+0.131
 argue+0.130
 elbows+0.128
keeping+0.127
 reason+0.125
Neg logit contributions
 is-0.058
der-0.055
 can-0.054
 has-0.051
 makes-0.048
Token request
Token ID2581
Feature activation+9.84e-01

Pos logit contributions
cle+0.053
keeping+0.051
 argue+0.050
 afar+0.049
 elbows+0.049
Neg logit contributions
 is-0.049
der-0.047
 can-0.047
 helps-0.044
 makes-0.044
Token Ben
Token ID3932
Feature activation-1.17e+00

Pos logit contributions
 worry+0.026
 reason+0.025
 argue+0.024
aster+0.023
 weep+0.023
Neg logit contributions
der-0.041
boards-0.039
lic-0.039
led-0.038
lessly-0.037
Token.
Token ID13
Feature activation-1.15e+00

Pos logit contributions
 argue+0.027
cle+0.027
keeping+0.026
 reason+0.026
 weep+0.025
Neg logit contributions
der-0.019
 is-0.017
lic-0.017
 can-0.017
boards-0.016
Token Do
Token ID2141
Feature activation+1.37e+00

Pos logit contributions
 argue+0.026
cle+0.025
keeping+0.025
 elbows+0.025
 weep+0.025
Neg logit contributions
der-0.020
 is-0.018
lic-0.017
 can-0.017
 has-0.017
Token you
Token ID345
Feature activation-4.50e+00

Pos logit contributions
der+0.022
 is+0.022
 can+0.021
 has+0.020
lic+0.020
Neg logit contributions
 argue-0.032
cle-0.031
keeping-0.031
 elbows-0.031
 reason-0.031
Token want
Token ID765
Feature activation-3.50e-01

Pos logit contributions
cle+0.108
keeping+0.105
 elbows+0.104
 Johnson+0.099
Susan+0.096
Neg logit contributions
 can-0.089
 is-0.083
 has-0.079
 makes-0.073
 of-0.073
Token to
Token ID284
Feature activation-6.60e+00

Pos logit contributions
 argue+0.008
 reason+0.008
cle+0.008
keeping+0.008
 elbows+0.008
Neg logit contributions
der-0.006
 is-0.005
lic-0.005
 can-0.005
 has-0.005
Token buy
Token ID2822
Feature activation+1.15e+00

Pos logit contributions
keeping+0.122
cle+0.122
 elbows+0.120
 argue+0.115
 afar+0.113
Neg logit contributions
 is-0.143
 can-0.141
 makes-0.131
der-0.130
 has-0.130
Token some
Token ID617
Feature activation-1.03e+00

Pos logit contributions
der+0.021
lic+0.019
 has+0.018
 can+0.018
 is+0.018
Neg logit contributions
 argue-0.025
 afar-0.024
cle-0.024
 elbows-0.024
 reason-0.024
Token laundry
Token ID25724
Feature activation-3.87e-01

Pos logit contributions
 argue+0.023
 reason+0.023
cle+0.023
 elbows+0.022
 moderation+0.022
Neg logit contributions
der-0.017
lic-0.016
 is-0.015
 has-0.015
 can-0.015
Token?
Token ID30
Feature activation-8.04e-01

Pos logit contributions
 argue+0.006
cle+0.006
keeping+0.006
 reason+0.006
 moderation+0.006
Neg logit contributions
der-0.009
lic-0.008
lessly-0.008
boards-0.007
 has-0.007
Token I
Token ID314
Feature activation-3.00e+00

Pos logit contributions
 argue+0.019
cle+0.019
keeping+0.018
 elbows+0.018
 weep+0.018
Neg logit contributions
der-0.013
 is-0.011
lic-0.011
 can-0.010
boards-0.010