TOP ACTIVATIONS
MAX = 0.100

boy crying . She shouted out to him , " Don
Max shouted out , " Hey Benny !
" Hello ?" Lily called out . " Who 's there
auc et !" Lily called out . Her
O uch !" She cries out . She pulls back her
sharp pain . She cries out loud . "
friends ?" Tim my called out , running after Lily .
home . She quickly shouted out " Johnny , don 't
this here ," Lily said out loud . Just then ,
his finger . He cries out in pain .
his finger . He cries out loud . "
O uch !" he cried out . His mom quickly came
a shock . He cries out loud . He lets go
is this ?" she said out loud . She tried to
her mom , she shouted out , " Mom , look
's happening ?" Mia called out . The old
mom my !" she called out . " I found something
" Hello ?" she called out nervously . "
large man . He yelled out to her ,
's there ?" She cried out . "

MIN ACTIVATIONS
MIN = -0.087

they went to the sports team â s try
ant hill . At the ant hill , the ants welcomed
Dad took him to the judge . The judge asked George
on his way from the ant hill , he noticed a
they went to the little neighbor 's birthday party . Toby
go home . Do you want some yogurt ?" Tom and
of fun . Eventually the children realised that it didn â
of them went to the closest or chard .
she went back to the sport every day , and she
, Tim went to the contest with his mom . There
more waste . Do you want to help me ?"
mom my went to the neighbor 's house and she used
decided to play in his neighbour 's yard . He scam
One day the mayor of the town had an
so she went to the neighbor 's house to listen .
permit , but you also need to respect and care .
of you . But we need to go to the doctor
She went to the contest and sang her song .
said she should go and have fun at the party .
day of their vacation the family got into their car and

INTERVAL 0.053 - 0.100
CONTAINS 0.478%

bs , which were sticky and white . "
their toys . They said , " Sure , you can
her away , and said , " Go away , you
couldn 't find it . Then , her
ily nodded . She said , " I understand , mom

INTERVAL 0.006 - 0.053
CONTAINS 34.199%

forget about the iron . They are happy . <|endoftext|> Tom
He told her to go away and ignore her .
looked at the pies . They all looked y ummy and
can look at how they act , and how they treat
he was playing in his bedroom when he noticed something strange

INTERVAL -0.041 - 0.006
CONTAINS 64.505%

plan . He asked his big sister , Lily , to
Lily . She loved to go to the park with her
The pig asked the cat , " Can you
ie . We need to pay the bills first ."
They are not happy . " Will the dog

INTERVAL -0.087 - -0.041
CONTAINS 0.818%

reach higher . Finally , Bob was able to grab the
charitable . You don 't need material possessions to support those
So Joey and his teacher went to the garden .
it was sad . They needed the tower to heal them
ily was sick . She had a hot head and a
Token boy
Token ID2933
Feature activation-9.56e-03

Pos logit contributions
ifying+0.005
 upon+0.004
suit+0.004
 low+0.004
 vel+0.004
Neg logit contributions
old-0.006
off-0.005
 artist-0.005
 tournament-0.005
 endings-0.005
Token crying
Token ID13774
Feature activation+2.78e-02

Pos logit contributions
old+0.016
 artist+0.014
off+0.013
er+0.013
 example+0.013
Neg logit contributions
ifying-0.019
 upon-0.015
suit-0.015
 low-0.014
 Deal-0.013
Token.
Token ID13
Feature activation+1.16e-02

Pos logit contributions
ifying+0.070
 upon+0.057
 Deal+0.057
suit+0.057
arl+0.053
Neg logit contributions
old-0.029
er-0.029
 artist-0.024
 days-0.024
 that-0.024
Token She
Token ID1375
Feature activation+1.50e-02

Pos logit contributions
ifying+0.021
 upon+0.015
suit+0.015
 Deal+0.014
 low+0.014
Neg logit contributions
old-0.022
 artist-0.020
er-0.019
off-0.019
 example-0.018
Token shouted
Token ID17293
Feature activation+4.04e-02

Pos logit contributions
ifying+0.034
suit+0.027
 upon+0.027
 Deal+0.025
arl+0.025
Neg logit contributions
old-0.022
 artist-0.019
er-0.018
off-0.017
 members-0.017
Token out
Token ID503
Feature activation+9.95e-02

Pos logit contributions
ifying+0.094
suit+0.074
 upon+0.072
 Deal+0.068
arl+0.067
Neg logit contributions
old-0.056
er-0.047
 artist-0.047
off-0.045
 example-0.044
Token to
Token ID284
Feature activation+2.15e-02

Pos logit contributions
ifying+0.252
suit+0.212
 Deal+0.196
 upon+0.196
arl+0.183
Neg logit contributions
old-0.111
 example-0.095
er-0.094
 that-0.091
 artist-0.090
Token him
Token ID683
Feature activation+1.33e-02

Pos logit contributions
ifying+0.062
suit+0.051
istered+0.047
 upon+0.044
arl+0.042
Neg logit contributions
old-0.025
 adventure-0.020
 that-0.019
 artist-0.019
off-0.019
Token,
Token ID11
Feature activation+4.44e-02

Pos logit contributions
ifying+0.031
suit+0.024
 upon+0.024
 Deal+0.022
 low+0.022
Neg logit contributions
old-0.019
er-0.016
 artist-0.016
off-0.015
 example-0.015
Token "
Token ID366
Feature activation-3.77e-02

Pos logit contributions
ifying+0.095
suit+0.075
istered+0.064
 Deal+0.062
kin+0.060
Neg logit contributions
 that-0.071
old-0.067
er-0.063
 example-0.058
 artist-0.057
TokenDon
Token ID3987
Feature activation+2.96e-03

Pos logit contributions
old+0.073
off+0.067
 artist+0.067
er+0.065
 members+0.063
Neg logit contributions
ifying-0.064
suit-0.049
 upon-0.045
 low-0.043
 Deal-0.042
Token
Token ID220
Feature activation+5.24e-02

Pos logit contributions
ifying+0.011
 upon+0.009
suit+0.009
 low+0.008
 Deal+0.008
Neg logit contributions
old-0.011
 artist-0.010
off-0.010
er-0.010
 example-0.009
Token
Token ID198
Feature activation+1.47e-02

Pos logit contributions
ifying+0.119
suit+0.094
 vel+0.086
 upon+0.084
 scr+0.082
Neg logit contributions
old-0.079
off-0.063
er-0.060
 members-0.060
 artist-0.056
Token
Token ID198
Feature activation+7.45e-03

Pos logit contributions
ifying+0.024
 upon+0.019
suit+0.018
 low+0.017
 Deal+0.016
Neg logit contributions
old-0.028
 artist-0.027
off-0.025
er-0.025
 endings-0.024
TokenMax
Token ID11518
Feature activation-2.00e-02

Pos logit contributions
ifying+0.014
 upon+0.011
suit+0.011
 low+0.010
 Deal+0.010
Neg logit contributions
old-0.013
 artist-0.012
off-0.011
er-0.011
 example-0.011
Token shouted
Token ID17293
Feature activation+4.07e-02

Pos logit contributions
old+0.032
 artist+0.029
off+0.028
 endings+0.028
 example+0.027
Neg logit contributions
ifying-0.037
 upon-0.033
suit-0.030
 low-0.030
 scr-0.027
Token out
Token ID503
Feature activation+9.86e-02

Pos logit contributions
ifying+0.093
suit+0.072
 upon+0.072
 Deal+0.067
arl+0.066
Neg logit contributions
old-0.057
 artist-0.050
er-0.049
 example-0.047
off-0.047
Token,
Token ID11
Feature activation+5.93e-02

Pos logit contributions
ifying+0.227
suit+0.185
 upon+0.173
 Deal+0.167
 low+0.159
Neg logit contributions
old-0.133
er-0.115
 example-0.115
 artist-0.115
 adventure-0.110
Token "
Token ID366
Feature activation-2.63e-02

Pos logit contributions
ifying+0.131
suit+0.097
istered+0.086
 Deal+0.084
 upon+0.082
Neg logit contributions
 that-0.090
er-0.084
old-0.081
 example-0.077
 adventure-0.072
TokenHey
Token ID10814
Feature activation-3.44e-03

Pos logit contributions
old+0.048
 artist+0.044
off+0.044
er+0.042
 endings+0.041
Neg logit contributions
ifying-0.047
 upon-0.036
suit-0.035
 low-0.033
 tim-0.032
Token Benny
Token ID44275
Feature activation-5.79e-04

Pos logit contributions
old+0.005
 artist+0.004
er+0.004
off+0.004
 tournament+0.004
Neg logit contributions
ifying-0.008
 upon-0.006
suit-0.006
 low-0.006
 Deal-0.006
Token!
Token ID0
Feature activation-1.15e-02

Pos logit contributions
old+0.001
 artist+0.001
er+0.001
off+0.001
 example+0.001
Neg logit contributions
ifying-0.001
 upon-0.001
suit-0.001
 low-0.001
 Deal-0.001
Token "
Token ID366
Feature activation-2.01e-02

Pos logit contributions
ifying+0.044
 upon+0.032
suit+0.030
 Deal+0.030
 tim+0.029
Neg logit contributions
old-0.039
er-0.034
 artist-0.033
off-0.032
 that-0.032
TokenHello
Token ID15496
Feature activation-2.52e-03

Pos logit contributions
old+0.043
 artist+0.039
off+0.039
er+0.038
 example+0.037
Neg logit contributions
ifying-0.030
 upon-0.022
suit-0.021
 low-0.019
 Deal-0.018
Token?"
Token ID1701
Feature activation+3.08e-02

Pos logit contributions
old+0.004
 artist+0.003
er+0.003
off+0.003
 example+0.003
Neg logit contributions
ifying-0.006
 upon-0.004
suit-0.004
 low-0.004
 Deal-0.004
Token Lily
Token ID20037
Feature activation+1.29e-02

Pos logit contributions
ifying+0.058
 upon+0.048
suit+0.047
 low+0.046
 limp+0.040
Neg logit contributions
old-0.050
 artist-0.046
off-0.045
 endings-0.044
 example-0.044
Token called
Token ID1444
Feature activation+4.30e-02

Pos logit contributions
ifying+0.026
 upon+0.023
 low+0.022
suit+0.021
 scr+0.020
Neg logit contributions
old-0.018
 artist-0.017
 endings-0.016
off-0.016
 example-0.016
Token out
Token ID503
Feature activation+9.84e-02

Pos logit contributions
ifying+0.088
suit+0.066
 upon+0.065
 Deal+0.061
arl+0.059
Neg logit contributions
old-0.070
 artist-0.062
er-0.061
off-0.058
 example-0.058
Token.
Token ID13
Feature activation+2.34e-02

Pos logit contributions
ifying+0.239
suit+0.195
 upon+0.185
 Deal+0.183
arl+0.169
Neg logit contributions
old-0.125
er-0.105
 artist-0.102
 that-0.100
 adventure-0.098
Token "
Token ID366
Feature activation-8.79e-03

Pos logit contributions
ifying+0.048
istered+0.033
suit+0.033
 Deal+0.033
arl+0.032
Neg logit contributions
old-0.038
 that-0.034
er-0.033
 artist-0.032
 Everyone-0.031
TokenWho
Token ID8241
Feature activation-1.41e-02

Pos logit contributions
old+0.016
off+0.015
 artist+0.015
 members+0.014
 tournament+0.014
Neg logit contributions
ifying-0.015
suit-0.011
 upon-0.011
 low-0.010
 Deal-0.010
Token's
Token ID338
Feature activation-1.20e-02

Pos logit contributions
old+0.028
 artist+0.027
er+0.026
 example+0.026
off+0.026
Neg logit contributions
ifying-0.021
 upon-0.016
suit-0.014
 low-0.013
 tim-0.013
Token there
Token ID612
Feature activation+8.42e-03

Pos logit contributions
old+0.022
 artist+0.020
off+0.019
er+0.019
 endings+0.019
Neg logit contributions
ifying-0.021
 upon-0.017
 low-0.015
suit-0.015
 tim-0.014
Tokenauc
Token ID14272
Feature activation+1.52e-02

Pos logit contributions
ifying+0.013
 upon+0.011
suit+0.011
arl+0.010
 low+0.010
Neg logit contributions
old-0.006
 artist-0.005
 endings-0.005
off-0.005
 example-0.005
Tokenet
Token ID316
Feature activation+2.07e-02

Pos logit contributions
ifying+0.032
 upon+0.026
suit+0.026
 low+0.024
 Deal+0.023
Neg logit contributions
old-0.023
 artist-0.020
off-0.019
er-0.018
 endings-0.018
Token!"
Token ID2474
Feature activation+2.41e-02

Pos logit contributions
ifying+0.043
 upon+0.035
suit+0.033
 low+0.031
 tim+0.030
Neg logit contributions
old-0.033
 artist-0.028
off-0.027
 example-0.027
 endings-0.027
Token Lily
Token ID20037
Feature activation+6.23e-03

Pos logit contributions
ifying+0.049
 upon+0.042
suit+0.040
 low+0.039
 Deal+0.036
Neg logit contributions
old-0.036
 artist-0.033
off-0.032
 example-0.031
 tournament-0.030
Token called
Token ID1444
Feature activation+4.82e-02

Pos logit contributions
ifying+0.011
 upon+0.010
suit+0.009
 scr+0.009
 low+0.009
Neg logit contributions
old-0.009
off-0.009
 artist-0.009
 endings-0.009
 tournament-0.009
Token out
Token ID503
Feature activation+9.74e-02

Pos logit contributions
ifying+0.105
 upon+0.079
suit+0.079
 Deal+0.076
arl+0.073
Neg logit contributions
old-0.073
 artist-0.066
er-0.064
 adventure-0.060
 example-0.060
Token.
Token ID13
Feature activation+3.75e-02

Pos logit contributions
ifying+0.225
suit+0.182
 upon+0.175
 Deal+0.171
arl+0.159
Neg logit contributions
old-0.132
 artist-0.115
er-0.114
 that-0.107
off-0.106
Token
Token ID220
Feature activation+6.78e-02

Pos logit contributions
ifying+0.099
istered+0.073
suit+0.073
 tim+0.070
 vel+0.070
Neg logit contributions
old-0.040
 that-0.037
er-0.035
 Everyone-0.034
 members-0.032
Token
Token ID198
Feature activation+2.53e-02

Pos logit contributions
ifying+0.148
suit+0.116
 upon+0.106
 vel+0.104
 low+0.102
Neg logit contributions
old-0.108
off-0.088
er-0.086
 artist-0.083
 members-0.083
Token
Token ID198
Feature activation+1.42e-02

Pos logit contributions
ifying+0.058
suit+0.045
 vel+0.041
 upon+0.041
 Deal+0.040
Neg logit contributions
old-0.038
off-0.031
er-0.030
 members-0.029
 artist-0.028
TokenHer
Token ID9360
Feature activation+2.05e-02

Pos logit contributions
ifying+0.031
suit+0.023
 upon+0.023
 Deal+0.021
 vel+0.021
Neg logit contributions
old-0.023
off-0.019
 artist-0.019
er-0.019
 members-0.018
TokenO
Token ID46
Feature activation+1.18e-02

Feature not active on prev token;
no predictions were affected.
Tokenuch
Token ID794
Feature activation+1.39e-02

Pos logit contributions
ifying+0.023
 upon+0.018
suit+0.018
 low+0.016
 tim+0.016
Neg logit contributions
old-0.020
 artist-0.018
off-0.017
er-0.017
 example-0.017
Token!"
Token ID2474
Feature activation+2.80e-02

Pos logit contributions
ifying+0.035
 upon+0.027
 Deal+0.027
suit+0.027
Me+0.025
Neg logit contributions
old-0.015
er-0.014
off-0.013
 artist-0.013
 that-0.012
Token She
Token ID1375
Feature activation+3.66e-02

Pos logit contributions
ifying+0.057
 upon+0.045
suit+0.044
 low+0.041
 Deal+0.040
Neg logit contributions
old-0.045
 artist-0.039
off-0.038
er-0.037
 endings-0.036
Token cries
Token ID24691
Feature activation+4.90e-02

Pos logit contributions
ifying+0.082
 upon+0.067
suit+0.065
 low+0.062
 Deal+0.059
Neg logit contributions
old-0.051
 artist-0.045
off-0.043
er-0.042
 example-0.040
Token out
Token ID503
Feature activation+9.70e-02

Pos logit contributions
ifying+0.093
 upon+0.076
suit+0.072
 low+0.071
 Deal+0.066
Neg logit contributions
old-0.083
 artist-0.076
off-0.073
 tournament-0.071
 endings-0.069
Token.
Token ID13
Feature activation+4.41e-02

Pos logit contributions
ifying+0.226
suit+0.182
 upon+0.171
 Deal+0.168
arl+0.158
Neg logit contributions
old-0.129
er-0.116
 artist-0.110
 example-0.108
 days-0.107
Token She
Token ID1375
Feature activation+2.96e-02

Pos logit contributions
ifying+0.102
suit+0.077
 upon+0.074
 vel+0.071
istered+0.071
Neg logit contributions
old-0.057
 that-0.054
er-0.050
 example-0.047
 endings-0.047
Token pulls
Token ID16194
Feature activation+1.52e-02

Pos logit contributions
ifying+0.072
suit+0.056
arl+0.056
 upon+0.053
bel+0.051
Neg logit contributions
old-0.036
 wishes-0.031
er-0.029
 prints-0.029
 hospital-0.027
Token back
Token ID736
Feature activation+1.63e-02

Pos logit contributions
ifying+0.031
suit+0.024
 upon+0.024
 Deal+0.021
arl+0.021
Neg logit contributions
old-0.025
er-0.021
 artist-0.021
off-0.021
 example-0.021
Token her
Token ID607
Feature activation+1.27e-02

Pos logit contributions
ifying+0.032
 upon+0.025
suit+0.024
 Deal+0.022
 low+0.022
Neg logit contributions
old-0.028
 artist-0.024
off-0.024
er-0.024
 example-0.023
Token sharp
Token ID7786
Feature activation+3.33e-02

Pos logit contributions
ifying+0.081
 upon+0.067
suit+0.065
 Deal+0.059
bat+0.056
Neg logit contributions
 that-0.055
 days-0.054
old-0.053
is-0.053
 artist-0.051
Token pain
Token ID2356
Feature activation+4.25e-02

Pos logit contributions
ifying+0.066
 upon+0.052
suit+0.050
 Deal+0.046
 low+0.046
Neg logit contributions
old-0.054
 artist-0.050
er-0.048
 example-0.046
off-0.046
Token.
Token ID13
Feature activation+1.40e-02

Pos logit contributions
ifying+0.089
 upon+0.072
 Deal+0.072
suit+0.071
Me+0.064
Neg logit contributions
er-0.058
old-0.058
 artist-0.055
 that-0.054
 days-0.050
Token She
Token ID1375
Feature activation+2.19e-02

Pos logit contributions
ifying+0.030
suit+0.024
 upon+0.023
 vel+0.021
 scr+0.021
Neg logit contributions
old-0.020
 endings-0.017
 artist-0.016
off-0.016
er-0.016
Token cries
Token ID24691
Feature activation+3.84e-02

Pos logit contributions
ifying+0.052
suit+0.043
 upon+0.042
arl+0.039
 tim+0.038
Neg logit contributions
old-0.026
er-0.021
 wishes-0.021
 artist-0.020
 that-0.020
Token out
Token ID503
Feature activation+9.62e-02

Pos logit contributions
ifying+0.088
suit+0.070
 upon+0.068
 Deal+0.064
Me+0.061
Neg logit contributions
old-0.051
er-0.046
 artist-0.043
off-0.042
 that-0.041
Token loud
Token ID7812
Feature activation+2.36e-02

Pos logit contributions
ifying+0.223
suit+0.189
 Deal+0.166
 upon+0.163
 haven+0.152
Neg logit contributions
old-0.127
 that-0.120
er-0.118
 days-0.113
 example-0.111
Token.
Token ID13
Feature activation+1.75e-02

Pos logit contributions
ifying+0.054
 upon+0.044
suit+0.043
 Deal+0.040
 low+0.040
Neg logit contributions
old-0.031
er-0.029
 artist-0.027
off-0.027
 example-0.026
Token
Token ID198
Feature activation+1.94e-02

Pos logit contributions
ifying+0.041
suit+0.033
 upon+0.032
 vel+0.030
 scr+0.029
Neg logit contributions
old-0.022
 that-0.018
 endings-0.018
 artist-0.017
er-0.017
Token
Token ID198
Feature activation+1.30e-02

Pos logit contributions
ifying+0.037
 upon+0.028
suit+0.028
 low+0.025
 Deal+0.025
Neg logit contributions
old-0.035
 artist-0.030
off-0.030
er-0.029
 example-0.028
Token"
Token ID1
Feature activation-8.39e-03

Pos logit contributions
ifying+0.025
 upon+0.019
suit+0.019
 low+0.017
 Deal+0.016
Neg logit contributions
old-0.023
off-0.020
 artist-0.020
er-0.020
 example-0.019
Token friends
Token ID2460
Feature activation-1.34e-02

Pos logit contributions
old+0.017
 artist+0.015
off+0.014
er+0.014
 example+0.013
Neg logit contributions
ifying-0.036
 upon-0.030
suit-0.030
 low-0.028
 Deal-0.027
Token?"
Token ID1701
Feature activation+2.18e-02

Pos logit contributions
old+0.024
off+0.021
 artist+0.021
 example+0.020
unn+0.020
Neg logit contributions
ifying-0.023
 upon-0.019
 low-0.018
suit-0.017
 Deal-0.016
Token Tim
Token ID5045
Feature activation+4.18e-03

Pos logit contributions
ifying+0.039
 upon+0.034
 low+0.031
suit+0.029
 Deal+0.028
Neg logit contributions
old-0.037
off-0.035
 artist-0.035
er-0.034
 example-0.033
Tokenmy
Token ID1820
Feature activation+1.57e-02

Pos logit contributions
ifying+0.006
 upon+0.005
suit+0.005
 low+0.005
 Deal+0.004
Neg logit contributions
old-0.008
 artist-0.008
off-0.007
 endings-0.007
 example-0.007
Token called
Token ID1444
Feature activation+3.93e-02

Pos logit contributions
 low+0.024
 upon+0.023
ifying+0.023
suit+0.022
 mis+0.021
Neg logit contributions
old-0.026
 endings-0.025
 example-0.024
 artist-0.024
off-0.024
Token out
Token ID503
Feature activation+9.53e-02

Pos logit contributions
ifying+0.081
 upon+0.065
suit+0.063
 low+0.060
 Deal+0.058
Neg logit contributions
old-0.061
 artist-0.055
off-0.053
er-0.051
 example-0.050
Token,
Token ID11
Feature activation+5.25e-02

Pos logit contributions
ifying+0.235
suit+0.192
 upon+0.180
 Deal+0.177
 vel+0.168
Neg logit contributions
old-0.114
er-0.100
 artist-0.095
 that-0.094
 example-0.092
Token running
Token ID2491
Feature activation+7.81e-03

Pos logit contributions
ifying+0.128
suit+0.105
arl+0.095
 Deal+0.093
 upon+0.090
Neg logit contributions
 that-0.061
er-0.059
old-0.058
 more-0.057
 artist-0.057
Token after
Token ID706
Feature activation+1.39e-02

Pos logit contributions
ifying+0.018
 upon+0.014
suit+0.013
 Deal+0.013
 low+0.013
Neg logit contributions
old-0.011
 artist-0.010
off-0.009
er-0.009
 example-0.009
Token Lily
Token ID20037
Feature activation-1.60e-02

Pos logit contributions
ifying+0.027
 upon+0.021
suit+0.021
 low+0.019
 Deal+0.019
Neg logit contributions
old-0.023
 artist-0.021
off-0.020
er-0.020
 example-0.019
Token.
Token ID13
Feature activation+2.65e-02

Pos logit contributions
old+0.025
 artist+0.023
off+0.022
er+0.021
 example+0.021
Neg logit contributions
ifying-0.032
 upon-0.026
suit-0.025
 low-0.024
 Deal-0.023
Token home
Token ID1363
Feature activation+1.79e-03

Pos logit contributions
old+0.010
 artist+0.009
off+0.009
er+0.009
 example+0.008
Neg logit contributions
ifying-0.015
 upon-0.013
suit-0.012
 low-0.012
 Deal-0.011
Token.
Token ID13
Feature activation-9.85e-04

Pos logit contributions
ifying+0.004
 upon+0.003
suit+0.003
 Deal+0.003
 low+0.003
Neg logit contributions
old-0.003
 artist-0.002
er-0.002
off-0.002
 members-0.002
Token She
Token ID1375
Feature activation+5.66e-03

Pos logit contributions
old+0.002
 artist+0.002
er+0.002
off+0.002
 example+0.001
Neg logit contributions
ifying-0.002
suit-0.001
 upon-0.001
 tim-0.001
 low-0.001
Token quickly
Token ID2952
Feature activation-1.15e-02

Pos logit contributions
ifying+0.011
 upon+0.008
suit+0.008
 low+0.008
 Deal+0.008
Neg logit contributions
old-0.010
 artist-0.009
er-0.008
off-0.008
 example-0.008
Token shouted
Token ID17293
Feature activation+3.31e-02

Pos logit contributions
old+0.024
 artist+0.024
 endings+0.023
off+0.023
 example+0.022
Neg logit contributions
ifying-0.015
 upon-0.011
suit-0.011
 low-0.010
 scr-0.009
Token out
Token ID503
Feature activation+9.46e-02

Pos logit contributions
ifying+0.074
suit+0.057
 upon+0.057
 Deal+0.053
arl+0.053
Neg logit contributions
old-0.048
er-0.042
 artist-0.040
 example-0.040
 that-0.039
Token "
Token ID366
Feature activation-2.61e-02

Pos logit contributions
ifying+0.219
suit+0.184
 upon+0.166
 Deal+0.165
 haven+0.156
Neg logit contributions
old-0.124
 example-0.112
er-0.110
 that-0.109
 adventure-0.107
TokenJohnny
Token ID44960
Feature activation-1.59e-02

Pos logit contributions
old+0.044
 artist+0.040
off+0.040
er+0.038
 members+0.037
Neg logit contributions
ifying-0.050
suit-0.039
 upon-0.038
 low-0.037
 Deal-0.036
Token,
Token ID11
Feature activation+1.10e-02

Pos logit contributions
old+0.021
er+0.019
 artist+0.018
off+0.018
 example+0.016
Neg logit contributions
ifying-0.037
 upon-0.029
suit-0.029
 low-0.029
 Deal-0.028
Token don
Token ID836
Feature activation+3.02e-03

Pos logit contributions
ifying+0.025
suit+0.020
 upon+0.020
 low+0.019
 Deal+0.019
Neg logit contributions
old-0.015
 artist-0.013
off-0.013
er-0.012
 that-0.012
Token't
Token ID470
Feature activation-4.06e-03

Pos logit contributions
ifying+0.006
suit+0.005
 upon+0.005
 low+0.005
 vel+0.005
Neg logit contributions
old-0.004
off-0.004
er-0.004
 artist-0.004
 that-0.004
Token this
Token ID428
Feature activation-1.32e-02

Pos logit contributions
old+0.004
 artist+0.004
er+0.004
off+0.004
 example+0.003
Neg logit contributions
ifying-0.007
 upon-0.006
suit-0.006
 low-0.006
 Deal-0.006
Token here
Token ID994
Feature activation+3.53e-03

Pos logit contributions
old+0.019
 artist+0.017
off+0.016
er+0.016
 example+0.016
Neg logit contributions
ifying-0.028
 upon-0.023
suit-0.022
 low-0.021
 Deal-0.021
Token,"
Token ID553
Feature activation+3.51e-02

Pos logit contributions
ifying+0.009
 upon+0.007
suit+0.007
 low+0.007
 Deal+0.007
Neg logit contributions
old-0.004
 artist-0.004
off-0.003
er-0.003
 example-0.003
Token Lily
Token ID20037
Feature activation+5.71e-03

Pos logit contributions
ifying+0.075
 upon+0.062
suit+0.060
 low+0.058
 Deal+0.054
Neg logit contributions
old-0.051
 artist-0.046
off-0.044
er-0.042
 example-0.042
Token said
Token ID531
Feature activation+3.95e-02

Pos logit contributions
 upon+0.009
ifying+0.009
 low+0.008
suit+0.008
 scr+0.008
Neg logit contributions
old-0.009
 endings-0.009
 artist-0.009
off-0.009
 example-0.009
Token out
Token ID503
Feature activation+9.44e-02

Pos logit contributions
ifying+0.083
 upon+0.067
suit+0.065
 low+0.061
 Deal+0.060
Neg logit contributions
old-0.060
 artist-0.053
off-0.051
er-0.051
 example-0.049
Token loud
Token ID7812
Feature activation+1.69e-02

Pos logit contributions
ifying+0.190
suit+0.153
 upon+0.144
 Deal+0.140
arl+0.128
Neg logit contributions
old-0.152
er-0.140
 artist-0.137
 that-0.136
 example-0.130
Token.
Token ID13
Feature activation+1.93e-02

Pos logit contributions
ifying+0.034
 upon+0.027
suit+0.027
 low+0.025
 Deal+0.024
Neg logit contributions
old-0.027
 artist-0.024
off-0.023
er-0.023
 example-0.022
Token Just
Token ID2329
Feature activation+1.10e-02

Pos logit contributions
ifying+0.036
 upon+0.024
suit+0.024
 Deal+0.023
istered+0.021
Neg logit contributions
old-0.036
 artist-0.033
er-0.033
off-0.031
 example-0.030
Token then
Token ID788
Feature activation+2.21e-02

Pos logit contributions
ifying+0.027
 upon+0.022
suit+0.021
 low+0.020
 Deal+0.020
Neg logit contributions
old-0.014
 artist-0.012
er-0.011
off-0.011
 example-0.011
Token,
Token ID11
Feature activation+8.56e-03

Pos logit contributions
ifying+0.057
suit+0.046
arl+0.044
 upon+0.044
 Deal+0.043
Neg logit contributions
old-0.026
 artist-0.021
er-0.020
 that-0.019
 example-0.019
Token his
Token ID465
Feature activation+3.51e-02

Pos logit contributions
ifying+0.056
suit+0.045
 upon+0.043
 Deal+0.043
arl+0.042
Neg logit contributions
old-0.032
er-0.029
 that-0.026
 artist-0.026
 more-0.025
Token finger
Token ID7660
Feature activation+3.86e-02

Pos logit contributions
ifying+0.076
 haven+0.059
 upon+0.057
arl+0.054
suit+0.050
Neg logit contributions
er-0.045
 example-0.043
old-0.043
 more-0.042
 artist-0.042
Token.
Token ID13
Feature activation+1.57e-02

Pos logit contributions
ifying+0.087
 upon+0.071
suit+0.069
 Deal+0.067
arl+0.065
Neg logit contributions
old-0.052
er-0.046
 artist-0.044
off-0.043
 example-0.041
Token He
Token ID679
Feature activation+2.03e-02

Pos logit contributions
ifying+0.032
suit+0.024
 upon+0.024
 vel+0.022
 tim+0.021
Neg logit contributions
old-0.024
er-0.021
 artist-0.021
 example-0.020
off-0.020
Token cries
Token ID24691
Feature activation+3.78e-02

Pos logit contributions
ifying+0.042
 upon+0.033
suit+0.033
arl+0.030
 low+0.030
Neg logit contributions
old-0.032
 artist-0.028
er-0.027
off-0.027
 example-0.026
Token out
Token ID503
Feature activation+9.40e-02

Pos logit contributions
ifying+0.085
 upon+0.066
suit+0.065
 Deal+0.063
 haven+0.059
Neg logit contributions
old-0.051
er-0.047
 artist-0.044
 example-0.042
 days-0.042
Token in
Token ID287
Feature activation+3.94e-02

Pos logit contributions
ifying+0.211
suit+0.176
 Deal+0.161
 upon+0.151
 haven+0.150
Neg logit contributions
 that-0.128
er-0.124
old-0.121
 more-0.119
 adventure-0.118
Token pain
Token ID2356
Feature activation+1.66e-02

Pos logit contributions
ifying+0.087
suit+0.072
 upon+0.070
 haven+0.064
istered+0.062
Neg logit contributions
 example-0.051
old-0.050
 adventure-0.049
 days-0.049
er-0.045
Token.
Token ID13
Feature activation+2.26e-02

Pos logit contributions
ifying+0.041
 upon+0.033
suit+0.033
 Deal+0.032
 low+0.031
Neg logit contributions
er-0.018
old-0.018
 days-0.015
 that-0.015
 artist-0.014
Token
Token ID198
Feature activation+2.07e-02

Pos logit contributions
ifying+0.049
suit+0.038
 upon+0.036
 vel+0.034
 tim+0.033
Neg logit contributions
old-0.032
er-0.028
 example-0.027
 that-0.027
 artist-0.026
Token
Token ID198
Feature activation+1.53e-02

Pos logit contributions
ifying+0.038
 upon+0.029
suit+0.028
 low+0.026
 Deal+0.025
Neg logit contributions
old-0.038
 artist-0.033
off-0.033
er-0.032
 example-0.031
Token his
Token ID465
Feature activation+1.88e-02

Pos logit contributions
ifying+0.074
suit+0.059
 Deal+0.057
 upon+0.056
arl+0.054
Neg logit contributions
er-0.038
old-0.037
 that-0.032
 more-0.032
 days-0.031
Token finger
Token ID7660
Feature activation+3.71e-02

Pos logit contributions
ifying+0.039
 upon+0.029
suit+0.029
arl+0.029
 haven+0.027
Neg logit contributions
old-0.026
 example-0.026
 mistakes-0.026
 artist-0.025
 days-0.025
Token.
Token ID13
Feature activation+1.94e-02

Pos logit contributions
ifying+0.082
suit+0.067
 Deal+0.067
 upon+0.066
arl+0.062
Neg logit contributions
old-0.049
er-0.048
 that-0.044
off-0.042
 more-0.041
Token He
Token ID679
Feature activation+2.41e-02

Pos logit contributions
ifying+0.040
suit+0.031
 upon+0.030
 vel+0.027
 low+0.026
Neg logit contributions
old-0.031
 artist-0.026
er-0.026
off-0.026
 example-0.025
Token cries
Token ID24691
Feature activation+3.67e-02

Pos logit contributions
ifying+0.049
suit+0.039
 upon+0.039
 low+0.035
arl+0.035
Neg logit contributions
old-0.038
 artist-0.033
er-0.032
off-0.032
 example-0.030
Token out
Token ID503
Feature activation+9.36e-02

Pos logit contributions
ifying+0.081
 upon+0.064
suit+0.064
 Deal+0.059
 low+0.057
Neg logit contributions
old-0.051
er-0.045
 artist-0.045
off-0.043
 example-0.042
Token loud
Token ID7812
Feature activation+2.19e-02

Pos logit contributions
ifying+0.198
suit+0.161
 Deal+0.145
 upon+0.143
 haven+0.133
Neg logit contributions
old-0.140
er-0.135
 that-0.132
 example-0.129
 days-0.127
Token.
Token ID13
Feature activation+2.32e-02

Pos logit contributions
ifying+0.050
 upon+0.041
suit+0.040
 Deal+0.038
 low+0.037
Neg logit contributions
old-0.029
er-0.027
 artist-0.025
off-0.025
 example-0.024
Token
Token ID198
Feature activation+1.65e-02

Pos logit contributions
ifying+0.051
suit+0.040
 upon+0.038
 vel+0.036
 scr+0.034
Neg logit contributions
old-0.033
er-0.028
 artist-0.028
 example-0.027
off-0.027
Token
Token ID198
Feature activation+1.40e-02

Pos logit contributions
ifying+0.029
 upon+0.023
suit+0.021
 low+0.020
 Deal+0.019
Neg logit contributions
old-0.031
 artist-0.028
off-0.027
er-0.027
 example-0.026
Token"
Token ID1
Feature activation-4.61e-03

Pos logit contributions
ifying+0.025
 upon+0.019
suit+0.019
 low+0.017
 Deal+0.017
Neg logit contributions
old-0.026
 artist-0.023
off-0.023
er-0.022
 example-0.022
TokenO
Token ID46
Feature activation+5.02e-03

Pos logit contributions
old+0.023
 artist+0.021
off+0.021
er+0.020
 members+0.020
Neg logit contributions
ifying-0.021
suit-0.017
 upon-0.017
arl-0.015
 low-0.014
Tokenuch
Token ID794
Feature activation+1.52e-02

Pos logit contributions
ifying+0.009
 upon+0.007
suit+0.006
 low+0.006
arl+0.006
Neg logit contributions
old-0.009
 artist-0.009
 example-0.008
er-0.008
off-0.008
Token!"
Token ID2474
Feature activation+1.98e-02

Pos logit contributions
ifying+0.038
 Deal+0.031
 upon+0.030
suit+0.030
Me+0.029
Neg logit contributions
old-0.016
er-0.016
 artist-0.014
 that-0.014
off-0.014
Token he
Token ID339
Feature activation+2.81e-02

Pos logit contributions
ifying+0.039
 upon+0.033
suit+0.031
 low+0.030
 Deal+0.028
Neg logit contributions
old-0.031
 artist-0.028
off-0.028
 example-0.026
 tournament-0.026
Token cried
Token ID16896
Feature activation+5.18e-02

Pos logit contributions
ifying+0.062
 upon+0.051
suit+0.049
 low+0.046
 Deal+0.046
Neg logit contributions
old-0.040
 artist-0.035
off-0.033
er-0.033
 example-0.031
Token out
Token ID503
Feature activation+9.32e-02

Pos logit contributions
ifying+0.113
 upon+0.091
suit+0.089
 Deal+0.082
 low+0.081
Neg logit contributions
old-0.075
 artist-0.067
er-0.065
off-0.062
 example-0.061
Token.
Token ID13
Feature activation+1.99e-02

Pos logit contributions
ifying+0.197
suit+0.160
 upon+0.151
 Deal+0.142
arl+0.134
Neg logit contributions
old-0.143
 artist-0.129
er-0.127
 example-0.121
 adventure-0.120
Token His
Token ID2399
Feature activation+1.18e-02

Pos logit contributions
ifying+0.040
suit+0.029
 upon+0.028
 Deal+0.026
arl+0.026
Neg logit contributions
old-0.033
er-0.031
 artist-0.030
 example-0.029
 site-0.029
Token mom
Token ID1995
Feature activation+4.24e-03

Pos logit contributions
ifying+0.025
 upon+0.019
suit+0.019
 low+0.017
 tim+0.017
Neg logit contributions
old-0.019
 artist-0.017
 adventure-0.016
 example-0.016
 days-0.016
Token quickly
Token ID2952
Feature activation+8.56e-03

Pos logit contributions
ifying+0.008
suit+0.006
 upon+0.006
 Deal+0.006
 low+0.006
Neg logit contributions
old-0.007
 artist-0.007
er-0.007
off-0.006
 members-0.006
Token came
Token ID1625
Feature activation+3.56e-03

Pos logit contributions
ifying+0.014
 upon+0.011
suit+0.010
 low+0.010
arl+0.009
Neg logit contributions
old-0.017
 artist-0.015
off-0.015
 endings-0.015
 example-0.014
Token a
Token ID257
Feature activation+4.12e-02

Pos logit contributions
ifying+0.025
suit+0.022
 upon+0.020
arl+0.019
 Deal+0.019
Neg logit contributions
old-0.015
 that-0.015
 more-0.014
 mistakes-0.012
 days-0.012
Token shock
Token ID6380
Feature activation+1.94e-02

Pos logit contributions
ifying+0.078
 upon+0.064
suit+0.064
 Deal+0.058
 haven+0.055
Neg logit contributions
is-0.055
 days-0.055
 that-0.055
 more-0.054
old-0.053
Token.
Token ID13
Feature activation+1.66e-02

Pos logit contributions
ifying+0.041
 upon+0.033
suit+0.032
 Deal+0.031
 low+0.030
Neg logit contributions
old-0.028
 artist-0.025
er-0.025
off-0.024
 days-0.023
Token He
Token ID679
Feature activation+1.91e-02

Pos logit contributions
ifying+0.033
suit+0.025
 upon+0.025
 Deal+0.022
 vel+0.022
Neg logit contributions
old-0.026
er-0.023
 artist-0.022
 example-0.022
 days-0.021
Token cries
Token ID24691
Feature activation+3.80e-02

Pos logit contributions
ifying+0.040
suit+0.032
 upon+0.031
arl+0.031
 Deal+0.028
Neg logit contributions
old-0.028
 artist-0.024
er-0.024
off-0.023
 mistakes-0.023
Token out
Token ID503
Feature activation+9.29e-02

Pos logit contributions
ifying+0.081
 upon+0.064
suit+0.063
 Deal+0.059
 low+0.056
Neg logit contributions
old-0.056
er-0.050
 artist-0.049
off-0.047
 example-0.046
Token loud
Token ID7812
Feature activation+2.36e-02

Pos logit contributions
ifying+0.193
suit+0.155
 upon+0.142
 Deal+0.141
arl+0.128
Neg logit contributions
old-0.146
er-0.136
 days-0.130
 adventure-0.129
 that-0.129
Token.
Token ID13
Feature activation+2.15e-02

Pos logit contributions
ifying+0.055
 upon+0.045
suit+0.043
 Deal+0.042
 low+0.040
Neg logit contributions
old-0.030
er-0.028
 artist-0.026
off-0.026
 example-0.025
Token He
Token ID679
Feature activation+1.68e-02

Pos logit contributions
ifying+0.049
suit+0.038
 upon+0.036
 vel+0.035
istered+0.034
Neg logit contributions
old-0.028
er-0.025
 that-0.023
 example-0.023
 days-0.023
Token lets
Token ID8781
Feature activation+1.21e-02

Pos logit contributions
ifying+0.037
suit+0.029
 upon+0.029
arl+0.028
 Deal+0.026
Neg logit contributions
old-0.024
er-0.020
 artist-0.019
 hospital-0.019
off-0.019
Token go
Token ID467
Feature activation-1.69e-02

Pos logit contributions
ifying+0.030
suit+0.023
istered+0.022
 upon+0.022
arl+0.021
Neg logit contributions
old-0.015
er-0.014
 that-0.013
 hospital-0.013
 more-0.013
Token is
Token ID318
Feature activation-1.35e-02

Pos logit contributions
ifying+0.014
 upon+0.011
suit+0.010
 low+0.010
 tim+0.009
Neg logit contributions
old-0.017
 artist-0.016
er-0.016
off-0.016
 example-0.016
Token this
Token ID428
Feature activation+1.99e-02

Pos logit contributions
old+0.019
 endings+0.014
 members+0.014
off+0.014
 occasions+0.014
Neg logit contributions
ifying-0.032
 upon-0.027
 low-0.026
suit-0.024
 drop-0.024
Token?"
Token ID1701
Feature activation+3.10e-02

Pos logit contributions
ifying+0.044
 upon+0.035
suit+0.035
 Deal+0.033
 low+0.033
Neg logit contributions
old-0.028
 artist-0.025
er-0.024
off-0.023
 example-0.022
Token she
Token ID673
Feature activation+3.16e-02

Pos logit contributions
ifying+0.060
 upon+0.050
suit+0.048
 low+0.048
 Deal+0.042
Neg logit contributions
old-0.050
 artist-0.046
off-0.044
 endings-0.042
 example-0.042
Token said
Token ID531
Feature activation+4.40e-02

Pos logit contributions
ifying+0.070
 upon+0.062
 low+0.058
suit+0.058
 scr+0.053
Neg logit contributions
old-0.041
 artist-0.035
off-0.035
 endings-0.033
 example-0.032
Token out
Token ID503
Feature activation+9.27e-02

Pos logit contributions
ifying+0.099
 upon+0.079
suit+0.078
 Deal+0.074
arl+0.072
Neg logit contributions
old-0.062
 artist-0.054
er-0.052
off-0.051
 example-0.049
Token loud
Token ID7812
Feature activation+2.50e-02

Pos logit contributions
ifying+0.181
suit+0.148
 upon+0.137
 Deal+0.136
arl+0.125
Neg logit contributions
old-0.151
 that-0.139
 artist-0.137
er-0.136
 example-0.131
Token.
Token ID13
Feature activation+2.14e-02

Pos logit contributions
ifying+0.054
 upon+0.043
suit+0.042
 Deal+0.040
 low+0.039
Neg logit contributions
old-0.037
 artist-0.033
er-0.033
off-0.032
 example-0.030
Token She
Token ID1375
Feature activation+1.73e-02

Pos logit contributions
ifying+0.036
 upon+0.025
suit+0.025
 Deal+0.023
 tim+0.022
Neg logit contributions
old-0.041
 artist-0.037
er-0.037
off-0.036
 example-0.036
Token tried
Token ID3088
Feature activation-2.83e-03

Pos logit contributions
ifying+0.034
suit+0.026
 upon+0.026
arl+0.025
 Deal+0.024
Neg logit contributions
old-0.029
 artist-0.026
er-0.025
off-0.025
 members-0.024
Token to
Token ID284
Feature activation+1.25e-02

Pos logit contributions
old+0.004
 artist+0.004
off+0.004
er+0.004
 endings+0.004
Neg logit contributions
ifying-0.006
 upon-0.005
 low-0.004
suit-0.004
 Deal-0.004
Token her
Token ID607
Feature activation-2.74e-03

Pos logit contributions
old+0.063
 artist+0.062
off+0.062
 tournament+0.061
 endings+0.060
Neg logit contributions
ifying-0.042
 upon-0.037
 low-0.032
 drop-0.026
suit-0.026
Token mom
Token ID1995
Feature activation-9.38e-03

Pos logit contributions
old+0.004
 artist+0.004
 adventure+0.004
 members+0.003
er+0.003
Neg logit contributions
ifying-0.006
suit-0.005
arl-0.004
 upon-0.004
 tim-0.004
Token,
Token ID11
Feature activation+1.07e-02

Pos logit contributions
old+0.014
er+0.012
 artist+0.012
off+0.011
 members+0.011
Neg logit contributions
ifying-0.020
suit-0.016
 upon-0.015
arl-0.015
 Deal-0.015
Token she
Token ID673
Feature activation-3.13e-03

Pos logit contributions
ifying+0.026
suit+0.021
arl+0.020
 upon+0.020
 Deal+0.020
Neg logit contributions
old-0.014
 artist-0.012
er-0.011
 members-0.011
 that-0.011
Token shouted
Token ID17293
Feature activation+3.19e-02

Pos logit contributions
old+0.005
er+0.004
 members+0.004
 artist+0.004
 that+0.004
Neg logit contributions
ifying-0.007
arl-0.006
 Deal-0.005
suit-0.005
 upon-0.005
Token out
Token ID503
Feature activation+9.24e-02

Pos logit contributions
ifying+0.077
suit+0.060
 upon+0.058
 vel+0.056
arl+0.056
Neg logit contributions
old-0.039
er-0.035
 that-0.033
 example-0.032
 artist-0.032
Token,
Token ID11
Feature activation+4.69e-02

Pos logit contributions
ifying+0.222
suit+0.189
 upon+0.169
 Deal+0.166
 haven+0.160
Neg logit contributions
old-0.113
 example-0.100
 that-0.100
er-0.099
 adventure-0.096
Token "
Token ID366
Feature activation-3.27e-02

Pos logit contributions
ifying+0.105
suit+0.081
istered+0.072
 vel+0.071
 Deal+0.070
Neg logit contributions
 that-0.068
er-0.060
old-0.057
 example-0.055
 one-0.052
TokenMom
Token ID29252
Feature activation-8.79e-03

Pos logit contributions
old+0.056
 artist+0.051
off+0.050
er+0.048
 endings+0.047
Neg logit contributions
ifying-0.061
suit-0.049
 upon-0.047
 low-0.045
arl-0.042
Token,
Token ID11
Feature activation+1.69e-02

Pos logit contributions
old+0.011
 artist+0.010
er+0.009
off+0.008
 that+0.008
Neg logit contributions
ifying-0.021
suit-0.018
 low-0.016
Me-0.016
 vel-0.016
Token look
Token ID804
Feature activation+2.84e-02

Pos logit contributions
ifying+0.036
suit+0.030
 low+0.028
 upon+0.028
istered+0.027
Neg logit contributions
old-0.024
 that-0.022
 artist-0.022
 adventure-0.021
off-0.020
Token's
Token ID338
Feature activation-4.83e-03

Pos logit contributions
ifying+0.016
 upon+0.012
suit+0.011
 low+0.011
 Deal+0.010
Neg logit contributions
old-0.019
 artist-0.017
off-0.017
er-0.017
 example-0.016
Token happening
Token ID5836
Feature activation-9.98e-03

Pos logit contributions
old+0.006
 artist+0.005
off+0.005
er+0.005
 endings+0.005
Neg logit contributions
ifying-0.011
 upon-0.009
suit-0.009
 low-0.009
 Deal-0.008
Token?"
Token ID1701
Feature activation+2.29e-02

Pos logit contributions
old+0.013
 artist+0.011
off+0.011
er+0.011
 adventure+0.010
Neg logit contributions
ifying-0.023
suit-0.018
 upon-0.018
 Deal-0.018
istered-0.017
Token Mia
Token ID32189
Feature activation+3.61e-03

Pos logit contributions
ifying+0.042
 upon+0.035
 low+0.034
suit+0.031
 Deal+0.030
Neg logit contributions
old-0.038
 artist-0.035
off-0.035
 tournament-0.033
 endings-0.033
Token called
Token ID1444
Feature activation+4.69e-02

Pos logit contributions
ifying+0.008
 upon+0.007
 low+0.007
suit+0.007
 scr+0.006
Neg logit contributions
old-0.004
 artist-0.004
off-0.004
 endings-0.004
 tournament-0.004
Token out
Token ID503
Feature activation+9.22e-02

Pos logit contributions
ifying+0.104
suit+0.083
 upon+0.081
 Deal+0.076
arl+0.075
Neg logit contributions
old-0.066
er-0.058
 artist-0.058
 example-0.054
off-0.053
Token.
Token ID13
Feature activation+2.08e-02

Pos logit contributions
ifying+0.219
suit+0.184
 upon+0.170
 Deal+0.170
arl+0.163
Neg logit contributions
old-0.117
er-0.103
 that-0.102
 example-0.094
 artist-0.093
Token
Token ID198
Feature activation+1.62e-02

Pos logit contributions
ifying+0.042
suit+0.033
 upon+0.030
istered+0.029
arl+0.028
Neg logit contributions
old-0.034
er-0.029
 artist-0.028
 that-0.028
 example-0.028
Token
Token ID198
Feature activation+8.55e-03

Pos logit contributions
ifying+0.029
 upon+0.022
suit+0.022
 low+0.020
 Deal+0.019
Neg logit contributions
old-0.030
 artist-0.027
off-0.026
er-0.026
 example-0.025
TokenThe
Token ID464
Feature activation+7.47e-03

Pos logit contributions
ifying+0.015
 upon+0.012
suit+0.012
 low+0.011
 Deal+0.010
Neg logit contributions
old-0.016
 artist-0.014
off-0.014
er-0.014
 example-0.013
Token old
Token ID1468
Feature activation-8.00e-03

Pos logit contributions
ifying+0.015
 upon+0.012
 low+0.011
suit+0.011
 Deal+0.011
Neg logit contributions
old-0.012
 artist-0.010
off-0.010
 endings-0.010
 tournament-0.010
Token mom
Token ID1995
Feature activation-1.44e-02

Pos logit contributions
ifying+0.062
suit+0.048
 upon+0.047
 low+0.045
 Deal+0.044
Neg logit contributions
old-0.055
 artist-0.053
off-0.049
 adventure-0.048
er-0.048
Tokenmy
Token ID1820
Feature activation-2.12e-03

Pos logit contributions
old+0.024
 artist+0.022
er+0.021
off+0.021
 members+0.019
Neg logit contributions
ifying-0.029
suit-0.022
 upon-0.022
 low-0.021
 Deal-0.020
Token!"
Token ID2474
Feature activation+2.29e-02

Pos logit contributions
old+0.003
 artist+0.003
er+0.003
off+0.002
 adventure+0.002
Neg logit contributions
ifying-0.005
 upon-0.004
suit-0.004
 Deal-0.004
 low-0.004
Token she
Token ID673
Feature activation+3.49e-02

Pos logit contributions
ifying+0.045
 upon+0.037
suit+0.035
 low+0.034
 Deal+0.032
Neg logit contributions
old-0.037
 artist-0.034
off-0.033
 example-0.031
er-0.031
Token called
Token ID1444
Feature activation+4.41e-02

Pos logit contributions
ifying+0.076
 upon+0.065
suit+0.063
 low+0.061
 Deal+0.057
Neg logit contributions
old-0.048
 artist-0.042
off-0.041
er-0.039
 example-0.038
Token out
Token ID503
Feature activation+9.18e-02

Pos logit contributions
ifying+0.103
suit+0.080
 Deal+0.076
 upon+0.076
arl+0.074
Neg logit contributions
old-0.060
 artist-0.053
er-0.053
 adventure-0.051
 that-0.050
Token.
Token ID13
Feature activation+3.59e-02

Pos logit contributions
ifying+0.215
suit+0.177
 upon+0.166
 Deal+0.164
arl+0.150
Neg logit contributions
old-0.124
 artist-0.106
er-0.105
 that-0.105
 example-0.100
Token "
Token ID366
Feature activation+9.48e-03

Pos logit contributions
ifying+0.085
suit+0.062
istered+0.060
 vel+0.060
 tim+0.058
Neg logit contributions
old-0.045
 that-0.044
er-0.042
 Everyone-0.040
 artist-0.036
TokenI
Token ID40
Feature activation+1.60e-02

Pos logit contributions
ifying+0.016
suit+0.013
 upon+0.013
 Deal+0.011
arl+0.011
Neg logit contributions
old-0.017
 artist-0.016
off-0.016
 members-0.015
 endings-0.014
Token found
Token ID1043
Feature activation-2.15e-02

Pos logit contributions
ifying+0.027
 upon+0.019
suit+0.019
 Deal+0.019
 low+0.018
Neg logit contributions
old-0.031
 artist-0.029
off-0.027
er-0.026
 members-0.026
Token something
Token ID1223
Feature activation-1.32e-02

Pos logit contributions
old+0.040
 tournament+0.037
 artist+0.037
off+0.035
unn+0.034
Neg logit contributions
ifying-0.037
 upon-0.028
 low-0.024
 tim-0.023
suit-0.023
Token"
Token ID1
Feature activation-1.14e-02

Pos logit contributions
ifying+0.001
suit+0.001
 upon+0.001
 low+0.000
 Deal+0.000
Neg logit contributions
old-0.001
 artist-0.000
off-0.000
er-0.000
 example-0.000
TokenHello
Token ID15496
Feature activation+9.62e-04

Pos logit contributions
old+0.019
 artist+0.018
off+0.017
er+0.017
 members+0.017
Neg logit contributions
ifying-0.021
suit-0.017
 upon-0.017
arl-0.015
 Deal-0.015
Token?"
Token ID1701
Feature activation+2.33e-02

Pos logit contributions
ifying+0.002
suit+0.002
 upon+0.002
 Deal+0.002
 low+0.002
Neg logit contributions
old-0.001
 artist-0.001
er-0.001
off-0.001
 adventure-0.001
Token she
Token ID673
Feature activation+3.49e-02

Pos logit contributions
ifying+0.044
 upon+0.036
 low+0.036
suit+0.033
Me+0.030
Neg logit contributions
old-0.038
 artist-0.035
off-0.035
 tournament-0.033
 example-0.033
Token called
Token ID1444
Feature activation+4.21e-02

Pos logit contributions
ifying+0.074
 upon+0.064
 low+0.063
suit+0.062
 scr+0.056
Neg logit contributions
old-0.048
off-0.042
 artist-0.042
 endings-0.039
 tournament-0.039
Token out
Token ID503
Feature activation+9.18e-02

Pos logit contributions
ifying+0.088
suit+0.067
 upon+0.065
 Deal+0.062
arl+0.061
Neg logit contributions
old-0.066
 artist-0.058
er-0.057
 adventure-0.054
 example-0.054
Token nervously
Token ID43324
Feature activation+3.25e-02

Pos logit contributions
ifying+0.215
suit+0.175
 upon+0.167
 Deal+0.164
arl+0.156
Neg logit contributions
old-0.124
er-0.105
 artist-0.102
 that-0.098
 example-0.097
Token.
Token ID13
Feature activation+1.79e-02

Pos logit contributions
ifying+0.061
 upon+0.050
suit+0.047
 low+0.046
 Deal+0.042
Neg logit contributions
old-0.055
 artist-0.050
off-0.049
 endings-0.047
 tournament-0.047
Token
Token ID198
Feature activation+5.54e-03

Pos logit contributions
ifying+0.035
suit+0.025
 upon+0.024
istered+0.023
 Deal+0.022
Neg logit contributions
old-0.032
 that-0.027
er-0.027
 artist-0.026
 example-0.026
Token
Token ID198
Feature activation-3.99e-03

Pos logit contributions
ifying+0.010
 upon+0.008
suit+0.008
 low+0.007
 Deal+0.007
Neg logit contributions
old-0.010
 artist-0.009
off-0.009
er-0.008
 example-0.008
Token"
Token ID1
Feature activation-2.48e-03

Pos logit contributions
old+0.007
 artist+0.006
off+0.006
er+0.006
 example+0.006
Neg logit contributions
ifying-0.007
 upon-0.006
suit-0.006
 low-0.005
 Deal-0.005
Token large
Token ID1588
Feature activation-2.63e-03

Pos logit contributions
ifying+0.008
 Deal+0.007
 upon+0.006
suit+0.006
arl+0.006
Neg logit contributions
 artist-0.005
old-0.005
 tournament-0.005
 island-0.005
 adventure-0.005
Token man
Token ID582
Feature activation-2.02e-02

Pos logit contributions
old+0.004
 artist+0.004
off+0.003
er+0.003
 example+0.003
Neg logit contributions
ifying-0.006
 upon-0.005
suit-0.004
 low-0.004
 Deal-0.004
Token.
Token ID13
Feature activation+9.36e-03

Pos logit contributions
old+0.036
 artist+0.033
off+0.032
 endings+0.032
 example+0.032
Neg logit contributions
ifying-0.036
 upon-0.029
suit-0.026
 low-0.025
 tim-0.023
Token He
Token ID679
Feature activation+1.81e-02

Pos logit contributions
ifying+0.020
 upon+0.016
suit+0.015
arl+0.014
istered+0.014
Neg logit contributions
old-0.015
er-0.012
off-0.012
 that-0.012
 artist-0.011
Token yelled
Token ID22760
Feature activation+2.95e-02

Pos logit contributions
ifying+0.039
suit+0.030
 Deal+0.030
arl+0.029
 upon+0.029
Neg logit contributions
old-0.028
 artist-0.024
er-0.023
off-0.023
 members-0.022
Token out
Token ID503
Feature activation+9.16e-02

Pos logit contributions
ifying+0.067
suit+0.051
 upon+0.050
 Deal+0.049
arl+0.048
Neg logit contributions
old-0.042
 artist-0.036
er-0.035
off-0.034
 example-0.034
Token to
Token ID284
Feature activation+1.68e-02

Pos logit contributions
ifying+0.234
suit+0.200
 Deal+0.184
 upon+0.181
 haven+0.172
Neg logit contributions
old-0.100
 that-0.086
 example-0.086
er-0.082
 adventure-0.081
Token her
Token ID607
Feature activation+6.47e-03

Pos logit contributions
ifying+0.042
suit+0.035
istered+0.032
 upon+0.031
arl+0.030
Neg logit contributions
old-0.024
 artist-0.019
off-0.019
 visitors-0.019
 adventure-0.019
Token,
Token ID11
Feature activation+4.16e-02

Pos logit contributions
ifying+0.016
suit+0.013
 upon+0.013
 Deal+0.012
arl+0.012
Neg logit contributions
old-0.008
 artist-0.007
er-0.006
 adventure-0.006
 example-0.006
Token
Token ID6184
Feature activation+1.17e-02

Pos logit contributions
ifying+0.090
suit+0.069
istered+0.063
 Deal+0.062
arl+0.060
Neg logit contributions
 that-0.064
old-0.059
er-0.053
 example-0.050
 one-0.049
Token
Token ID95
Feature activation+6.70e-04

Pos logit contributions
ifying+0.022
 upon+0.018
suit+0.017
 low+0.016
 Deal+0.016
Neg logit contributions
old-0.020
 artist-0.018
off-0.017
er-0.017
 example-0.016
Token's
Token ID338
Feature activation-4.76e-03

Pos logit contributions
old+0.034
 artist+0.032
off+0.031
er+0.031
 example+0.030
Neg logit contributions
ifying-0.034
 upon-0.027
suit-0.025
 low-0.023
 tim-0.022
Token there
Token ID612
Feature activation+8.24e-03

Pos logit contributions
old+0.008
 artist+0.007
off+0.007
er+0.007
 example+0.007
Neg logit contributions
ifying-0.009
 upon-0.007
suit-0.007
 low-0.007
 Deal-0.006
Token?"
Token ID1701
Feature activation+2.04e-02

Pos logit contributions
ifying+0.020
suit+0.017
 Deal+0.015
arl+0.015
istered+0.015
Neg logit contributions
old-0.010
 artist-0.009
off-0.008
 adventure-0.008
er-0.008
Token She
Token ID1375
Feature activation+2.95e-02

Pos logit contributions
ifying+0.034
 upon+0.031
 low+0.030
 mis+0.025
suit+0.025
Neg logit contributions
old-0.034
off-0.033
unn-0.033
 artist-0.032
 island-0.032
Token cried
Token ID16896
Feature activation+4.84e-02

Pos logit contributions
ifying+0.071
 upon+0.059
suit+0.057
 low+0.054
 Deal+0.054
Neg logit contributions
old-0.036
 artist-0.031
off-0.029
er-0.029
 example-0.027
Token out
Token ID503
Feature activation+9.16e-02

Pos logit contributions
ifying+0.095
 upon+0.077
suit+0.073
 low+0.070
 Deal+0.067
Neg logit contributions
old-0.079
 artist-0.071
off-0.069
er-0.067
 example-0.065
Token.
Token ID13
Feature activation+1.96e-02

Pos logit contributions
ifying+0.204
suit+0.170
 upon+0.156
 Deal+0.152
arl+0.146
Neg logit contributions
old-0.130
er-0.115
 artist-0.113
 example-0.110
 adventure-0.108
Token
Token ID220
Feature activation+5.55e-02

Pos logit contributions
ifying+0.036
suit+0.026
 upon+0.026
 tim+0.023
arl+0.023
Neg logit contributions
old-0.036
 artist-0.032
er-0.031
off-0.031
 example-0.031
Token
Token ID198
Feature activation+1.24e-02

Pos logit contributions
ifying+0.110
suit+0.085
 upon+0.081
 low+0.076
 Deal+0.073
Neg logit contributions
old-0.096
off-0.081
 artist-0.081
er-0.079
 members-0.076
Token
Token ID198
Feature activation+1.97e-03

Pos logit contributions
ifying+0.019
 upon+0.015
 low+0.013
suit+0.013
 Deal+0.013
Neg logit contributions
old-0.025
 artist-0.024
er-0.022
off-0.022
 endings-0.022
Token"
Token ID1
Feature activation-6.96e-03

Pos logit contributions
ifying+0.003
 upon+0.003
 low+0.002
 Deal+0.002
suit+0.002
Neg logit contributions
old-0.004
 artist-0.004
unn-0.003
er-0.003
off-0.003
Token they
Token ID484
Feature activation-2.71e-02

Pos logit contributions
ifying+0.003
 upon+0.002
suit+0.002
 low+0.002
 Deal+0.002
Neg logit contributions
old-0.002
 artist-0.002
off-0.002
er-0.002
 example-0.002
Token went
Token ID1816
Feature activation-2.85e-02

Pos logit contributions
old+0.046
 artist+0.041
off+0.040
er+0.039
 example+0.038
Neg logit contributions
ifying-0.053
 upon-0.042
suit-0.040
 low-0.038
 Deal-0.037
Token to
Token ID284
Feature activation-1.85e-02

Pos logit contributions
old+0.048
 artist+0.045
 endings+0.043
off+0.041
er+0.041
Neg logit contributions
ifying-0.054
 upon-0.045
suit-0.043
 low-0.042
 tim-0.037
Token the
Token ID262
Feature activation-3.08e-02

Pos logit contributions
old+0.031
 artist+0.028
off+0.027
er+0.026
 example+0.026
Neg logit contributions
ifying-0.036
 upon-0.028
suit-0.027
 low-0.025
 Deal-0.025
Token sports
Token ID5701
Feature activation-5.20e-02

Pos logit contributions
old+0.047
 artist+0.040
er+0.039
off+0.039
unn+0.038
Neg logit contributions
ifying-0.066
 upon-0.054
suit-0.051
 low-0.051
 Deal-0.046
Token team
Token ID1074
Feature activation-6.84e-02

Pos logit contributions
old+0.080
unn+0.070
 endings+0.069
 artist+0.068
off+0.066
Neg logit contributions
ifying-0.097
 upon-0.089
suit-0.087
 low-0.079
-0.071
Tokenâ
Token ID22940
Feature activation+5.38e-03

Pos logit contributions
old+0.112
unn+0.112
 artist+0.109
 endings+0.108
oot+0.101
Neg logit contributions
ifying-0.112
 upon-0.104
suit-0.095
 low-0.085
 tim-0.074
Token
Token ID26391
Feature activation-4.76e-02

Pos logit contributions
ifying+0.009
suit+0.007
 upon+0.007
 Deal+0.007
 low+0.006
Neg logit contributions
old-0.010
 artist-0.009
 endings-0.008
er-0.008
off-0.008
Token
Token ID8151
Feature activation-1.69e-02

Pos logit contributions
old+0.080
 artist+0.072
off+0.069
er+0.068
 example+0.066
Neg logit contributions
ifying-0.092
 upon-0.073
suit-0.071
 low-0.066
 Deal-0.064
Tokens
Token ID82
Feature activation-2.27e-02

Pos logit contributions
old+0.029
 artist+0.026
off+0.025
er+0.024
 example+0.023
Neg logit contributions
ifying-0.032
 upon-0.025
suit-0.025
 low-0.023
 Deal-0.023
Token try
Token ID1949
Feature activation-1.33e-02

Pos logit contributions
old+0.030
 example+0.026
er+0.026
 artist+0.026
 adventure+0.026
Neg logit contributions
ifying-0.053
suit-0.041
 upon-0.041
 Deal-0.040
arl-0.039
Token ant
Token ID1885
Feature activation-5.68e-02

Pos logit contributions
old+0.029
off+0.024
 artist+0.024
er+0.024
oot+0.023
Neg logit contributions
ifying-0.032
 upon-0.027
 low-0.026
suit-0.024
Me-0.023
Token hill
Token ID12788
Feature activation-2.07e-02

Pos logit contributions
old+0.087
 artist+0.086
 endings+0.085
unn+0.083
oot+0.082
Neg logit contributions
 upon-0.090
ifying-0.090
 low-0.080
suit-0.070
 scr-0.069
Token.
Token ID13
Feature activation-9.17e-03

Pos logit contributions
old+0.040
 artist+0.039
 endings+0.038
off+0.037
unn+0.036
Neg logit contributions
ifying-0.029
 upon-0.028
 low-0.026
suit-0.024
 tim-0.019
Token At
Token ID1629
Feature activation-3.53e-03

Pos logit contributions
old+0.018
 artist+0.017
off+0.017
er+0.016
 members+0.016
Neg logit contributions
ifying-0.014
 upon-0.011
suit-0.010
 low-0.010
 Deal-0.009
Token the
Token ID262
Feature activation-1.15e-02

Pos logit contributions
old+0.007
 artist+0.006
off+0.006
er+0.006
 endings+0.006
Neg logit contributions
ifying-0.006
 upon-0.005
 low-0.005
suit-0.005
 Deal-0.004
Token ant
Token ID1885
Feature activation-6.80e-02

Pos logit contributions
old+0.021
 artist+0.018
off+0.018
er+0.018
 endings+0.017
Neg logit contributions
ifying-0.021
 upon-0.018
 low-0.016
suit-0.016
 Deal-0.014
Token hill
Token ID12788
Feature activation-2.82e-02

Pos logit contributions
old+0.097
 artist+0.093
 endings+0.088
 example+0.087
off+0.084
Neg logit contributions
ifying-0.129
 upon-0.121
 low-0.113
suit-0.106
 tim-0.098
Token,
Token ID11
Feature activation-1.49e-02

Pos logit contributions
old+0.048
 artist+0.046
off+0.044
 endings+0.042
 example+0.041
Neg logit contributions
ifying-0.050
 upon-0.045
 low-0.041
suit-0.040
 tim-0.034
Token the
Token ID262
Feature activation+1.26e-02

Pos logit contributions
old+0.020
 artist+0.018
er+0.017
off+0.017
 days+0.016
Neg logit contributions
ifying-0.034
suit-0.027
 upon-0.026
 Deal-0.026
arl-0.025
Token ants
Token ID27842
Feature activation-2.71e-02

Pos logit contributions
ifying+0.025
 upon+0.020
suit+0.019
 low+0.019
 Deal+0.018
Neg logit contributions
old-0.021
 artist-0.018
off-0.018
er-0.017
 endings-0.017
Token welcomed
Token ID15846
Feature activation-5.43e-03

Pos logit contributions
old+0.052
 artist+0.048
off+0.046
 example+0.045
er+0.045
Neg logit contributions
ifying-0.045
 upon-0.036
suit-0.033
 low-0.032
 tim-0.029
Token Dad
Token ID17415
Feature activation-2.50e-02

Pos logit contributions
ifying+0.062
suit+0.051
 upon+0.047
istered+0.047
arl+0.046
Neg logit contributions
old-0.034
 artist-0.029
 adventure-0.028
off-0.028
 that-0.027
Token took
Token ID1718
Feature activation-2.14e-02

Pos logit contributions
old+0.039
 artist+0.035
off+0.033
er+0.033
 members+0.032
Neg logit contributions
ifying-0.052
 upon-0.041
suit-0.041
 low-0.038
 Deal-0.038
Token him
Token ID683
Feature activation-2.11e-02

Pos logit contributions
old+0.036
 artist+0.034
er+0.032
off+0.031
 tournament+0.031
Neg logit contributions
ifying-0.040
 upon-0.032
 low-0.030
suit-0.030
 tim-0.028
Token to
Token ID284
Feature activation-1.39e-02

Pos logit contributions
old+0.033
 artist+0.032
er+0.030
 endings+0.029
 tournament+0.029
Neg logit contributions
ifying-0.041
 upon-0.034
suit-0.031
 low-0.030
 tim-0.029
Token the
Token ID262
Feature activation-3.42e-02

Pos logit contributions
old+0.023
 artist+0.021
off+0.020
er+0.019
 example+0.019
Neg logit contributions
ifying-0.027
 upon-0.021
suit-0.021
 low-0.019
 Deal-0.019
Token judge
Token ID5052
Feature activation-6.79e-02

Pos logit contributions
old+0.043
 artist+0.035
off+0.035
er+0.034
 endings+0.032
Neg logit contributions
ifying-0.081
 upon-0.067
suit-0.064
 low-0.063
 Deal-0.059
Token.
Token ID13
Feature activation-5.43e-03

Pos logit contributions
 artist+0.092
old+0.091
unn+0.091
 endings+0.089
oot+0.087
Neg logit contributions
ifying-0.133
 upon-0.119
suit-0.108
 low-0.100
 tim-0.091
Token The
Token ID383
Feature activation-5.52e-03

Pos logit contributions
old+0.009
 artist+0.008
off+0.008
er+0.008
 example+0.008
Neg logit contributions
ifying-0.011
 upon-0.008
suit-0.008
 low-0.008
 Deal-0.007
Token judge
Token ID5052
Feature activation-4.63e-02

Pos logit contributions
old+0.007
 artist+0.006
off+0.006
er+0.005
 endings+0.005
Neg logit contributions
ifying-0.013
 upon-0.011
suit-0.011
 low-0.010
 Deal-0.010
Token asked
Token ID1965
Feature activation+2.32e-03

Pos logit contributions
old+0.087
off+0.081
 artist+0.080
 endings+0.078
 example+0.077
Neg logit contributions
ifying-0.074
 upon-0.060
suit-0.055
 low-0.052
 tim-0.046
Token George
Token ID4502
Feature activation-6.41e-03

Pos logit contributions
ifying+0.005
suit+0.004
 upon+0.004
arl+0.004
 Deal+0.004
Neg logit contributions
old-0.003
 artist-0.003
er-0.003
off-0.003
 example-0.003
Token on
Token ID319
Feature activation+2.07e-02

Pos logit contributions
ifying+0.013
suit+0.010
 Deal+0.010
arl+0.010
 upon+0.010
Neg logit contributions
old-0.005
 that-0.005
 days-0.004
 artist-0.004
 adventure-0.004
Token his
Token ID465
Feature activation-3.19e-03

Pos logit contributions
ifying+0.042
 upon+0.033
suit+0.033
 Deal+0.030
arl+0.030
Neg logit contributions
old-0.033
 artist-0.029
er-0.029
 days-0.028
off-0.028
Token way
Token ID835
Feature activation-1.17e-02

Pos logit contributions
old+0.007
er+0.006
 artist+0.006
off+0.006
minded+0.006
Neg logit contributions
ifying-0.005
 upon-0.004
 low-0.004
 Deal-0.003
suit-0.003
Token from
Token ID422
Feature activation-7.57e-03

Pos logit contributions
old+0.018
 artist+0.016
off+0.015
er+0.015
 example+0.015
Neg logit contributions
ifying-0.025
 upon-0.020
suit-0.019
 low-0.018
 Deal-0.018
Token the
Token ID262
Feature activation-1.80e-02

Pos logit contributions
old+0.014
 artist+0.013
off+0.013
er+0.012
 endings+0.012
Neg logit contributions
ifying-0.013
 upon-0.010
suit-0.010
 low-0.010
 tim-0.009
Token ant
Token ID1885
Feature activation-6.70e-02

Pos logit contributions
old+0.028
 artist+0.025
off+0.024
er+0.024
 example+0.023
Neg logit contributions
ifying-0.037
 upon-0.030
suit-0.029
 low-0.027
 Deal-0.027
Token hill
Token ID12788
Feature activation-2.27e-02

Pos logit contributions
 artist+0.102
old+0.101
 example+0.099
is+0.097
oot+0.097
Neg logit contributions
ifying-0.115
 upon-0.113
 low-0.096
suit-0.089
 mis-0.082
Token,
Token ID11
Feature activation-1.53e-02

Pos logit contributions
old+0.039
 artist+0.037
off+0.035
 endings+0.034
 example+0.034
Neg logit contributions
ifying-0.041
 upon-0.036
suit-0.032
 low-0.032
 tim-0.027
Token he
Token ID339
Feature activation-1.84e-02

Pos logit contributions
old+0.018
 that+0.015
 artist+0.015
er+0.014
 adventure+0.014
Neg logit contributions
ifying-0.038
 Deal-0.030
arl-0.030
suit-0.030
 upon-0.029
Token noticed
Token ID6810
Feature activation-2.52e-02

Pos logit contributions
old+0.031
 artist+0.027
off+0.026
er+0.026
 members+0.026
Neg logit contributions
ifying-0.037
suit-0.027
 Deal-0.027
arl-0.027
 upon-0.026
Token a
Token ID257
Feature activation+1.25e-02

Pos logit contributions
old+0.041
 artist+0.039
off+0.038
unn+0.036
 tournament+0.036
Neg logit contributions
ifying-0.048
 upon-0.040
 low-0.036
suit-0.034
 scr-0.032
Token they
Token ID484
Feature activation-3.09e-02

Pos logit contributions
old+0.027
 artist+0.025
off+0.024
 endings+0.023
minded+0.022
Neg logit contributions
ifying-0.032
 upon-0.026
suit-0.024
 low-0.023
 tim-0.021
Token went
Token ID1816
Feature activation-2.96e-02

Pos logit contributions
old+0.050
 artist+0.044
off+0.043
 members+0.042
er+0.041
Neg logit contributions
ifying-0.062
 upon-0.048
 Deal-0.047
suit-0.047
arl-0.045
Token to
Token ID284
Feature activation-2.19e-02

Pos logit contributions
old+0.055
 artist+0.052
 endings+0.049
off+0.048
er+0.047
Neg logit contributions
ifying-0.052
 upon-0.042
suit-0.040
 low-0.038
 tim-0.034
Token the
Token ID262
Feature activation-3.24e-02

Pos logit contributions
old+0.038
 artist+0.035
off+0.033
er+0.033
 example+0.032
Neg logit contributions
ifying-0.041
 upon-0.032
suit-0.031
 low-0.029
 Deal-0.028
Token little
Token ID1310
Feature activation-2.85e-02

Pos logit contributions
old+0.045
 artist+0.038
off+0.037
er+0.036
unn+0.036
Neg logit contributions
ifying-0.074
 upon-0.061
suit-0.058
 low-0.057
Me-0.052
Token neighbor
Token ID4780
Feature activation-6.70e-02

Pos logit contributions
old+0.057
oot+0.049
off+0.048
unn+0.047
rums+0.045
Neg logit contributions
ifying-0.048
 upon-0.042
suit-0.041
 low-0.037
 tim-0.031
Token's
Token ID338
Feature activation-3.56e-02

Pos logit contributions
old+0.081
 artist+0.073
 endings+0.071
off+0.069
 example+0.069
Neg logit contributions
ifying-0.154
 upon-0.133
suit-0.126
 low-0.118
 tim-0.112
Token birthday
Token ID10955
Feature activation-4.17e-02

Pos logit contributions
old+0.056
 artist+0.050
unn+0.048
er+0.048
off+0.047
Neg logit contributions
ifying-0.070
 upon-0.058
suit-0.055
 low-0.053
 Deal-0.048
Token party
Token ID2151
Feature activation-2.72e-02

Pos logit contributions
old+0.053
 artist+0.045
off+0.043
er+0.041
 endings+0.040
Neg logit contributions
ifying-0.097
 upon-0.081
suit-0.080
 low-0.075
 Deal-0.071
Token.
Token ID13
Feature activation-1.93e-02

Pos logit contributions
old+0.043
 artist+0.038
off+0.037
er+0.035
 example+0.035
Neg logit contributions
ifying-0.056
 upon-0.045
suit-0.044
 low-0.041
 Deal-0.039
Token Toby
Token ID37578
Feature activation-3.44e-02

Pos logit contributions
old+0.027
 artist+0.023
off+0.023
 tournament+0.022
 example+0.021
Neg logit contributions
ifying-0.040
suit-0.033
 upon-0.033
 low-0.032
 tim-0.031
Token go
Token ID467
Feature activation-4.10e-02

Pos logit contributions
old+0.010
 artist+0.009
off+0.009
er+0.009
 adventure+0.009
Neg logit contributions
ifying-0.011
 upon-0.009
suit-0.009
 low-0.008
 Deal-0.008
Token home
Token ID1363
Feature activation-7.01e-03

Pos logit contributions
old+0.066
 artist+0.064
 endings+0.062
 tournament+0.056
off+0.056
Neg logit contributions
ifying-0.077
 upon-0.070
suit-0.064
 low-0.061
arl-0.060
Token.
Token ID13
Feature activation-7.31e-03

Pos logit contributions
old+0.012
 artist+0.010
off+0.010
er+0.010
 endings+0.010
Neg logit contributions
ifying-0.014
 upon-0.011
suit-0.011
 low-0.010
 Deal-0.010
Token Do
Token ID2141
Feature activation-1.02e-02

Pos logit contributions
old+0.012
 artist+0.010
off+0.010
er+0.010
 example+0.010
Neg logit contributions
ifying-0.015
 upon-0.012
suit-0.011
 low-0.011
 Deal-0.010
Token you
Token ID345
Feature activation-1.21e-02

Pos logit contributions
old+0.010
 artist+0.009
off+0.008
er+0.008
 example+0.007
Neg logit contributions
ifying-0.027
 upon-0.023
suit-0.022
 low-0.021
 Deal-0.021
Token want
Token ID765
Feature activation-6.60e-02

Pos logit contributions
old+0.026
 artist+0.023
off+0.023
er+0.022
 example+0.022
Neg logit contributions
ifying-0.019
 upon-0.013
suit-0.013
 low-0.012
 Deal-0.011
Token some
Token ID617
Feature activation-4.25e-02

Pos logit contributions
old+0.146
 artist+0.145
 site+0.131
 tournament+0.129
 endings+0.128
Neg logit contributions
ifying-0.079
 upon-0.064
 Sn-0.049
 low-0.048
 t-0.047
Token yogurt
Token ID32132
Feature activation-2.23e-03

Pos logit contributions
old+0.068
 artist+0.061
off+0.058
 endings+0.056
er+0.055
Neg logit contributions
ifying-0.084
 upon-0.069
suit-0.065
 low-0.062
 Deal-0.061
Token?"
Token ID1701
Feature activation+8.67e-03

Pos logit contributions
old+0.003
 artist+0.002
er+0.002
off+0.002
 that+0.002
Neg logit contributions
ifying-0.005
 upon-0.004
suit-0.004
 Deal-0.004
 low-0.004
Token Tom
Token ID4186
Feature activation-5.49e-03

Pos logit contributions
ifying+0.015
 upon+0.013
suit+0.012
 low+0.011
 Deal+0.011
Neg logit contributions
old-0.015
 artist-0.014
off-0.014
er-0.013
 example-0.013
Token and
Token ID290
Feature activation+4.28e-02

Pos logit contributions
old+0.009
 artist+0.009
off+0.008
 endings+0.008
er+0.008
Neg logit contributions
ifying-0.010
 upon-0.008
suit-0.008
 low-0.007
 Deal-0.007
Token of
Token ID286
Feature activation-1.05e-02

Pos logit contributions
old+0.033
 artist+0.031
 tournament+0.028
off+0.028
 example+0.027
Neg logit contributions
ifying-0.080
 upon-0.068
suit-0.064
 low-0.063
 scr-0.059
Token fun
Token ID1257
Feature activation-2.31e-02

Pos logit contributions
old+0.015
 artist+0.013
off+0.012
er+0.012
 tournament+0.012
Neg logit contributions
ifying-0.023
 upon-0.019
suit-0.018
 low-0.018
 Deal-0.017
Token.
Token ID13
Feature activation-3.17e-02

Pos logit contributions
old+0.039
 artist+0.036
off+0.034
 tournament+0.034
er+0.032
Neg logit contributions
ifying-0.042
 upon-0.036
suit-0.032
 low-0.030
 tim-0.028
Token Eventually
Token ID16178
Feature activation-5.21e-03

Pos logit contributions
old+0.055
 artist+0.050
off+0.048
er+0.047
 example+0.045
Neg logit contributions
ifying-0.060
 upon-0.048
suit-0.046
 low-0.043
 Deal-0.042
Token the
Token ID262
Feature activation+2.15e-03

Pos logit contributions
old+0.008
 artist+0.008
off+0.007
er+0.007
 endings+0.007
Neg logit contributions
ifying-0.010
 upon-0.009
suit-0.008
 low-0.008
 Deal-0.007
Token children
Token ID1751
Feature activation-6.59e-02

Pos logit contributions
ifying+0.005
 upon+0.004
suit+0.004
 low+0.004
 Deal+0.004
Neg logit contributions
old-0.003
 artist-0.003
off-0.003
er-0.002
 endings-0.002
Token realised
Token ID19169
Feature activation-1.76e-02

Pos logit contributions
old+0.123
 artist+0.114
off+0.110
er+0.108
 example+0.107
Neg logit contributions
ifying-0.112
 upon-0.088
suit-0.082
 low-0.078
 Deal-0.071
Token that
Token ID326
Feature activation-2.58e-02

Pos logit contributions
old+0.020
 artist+0.019
off+0.018
unn+0.016
 endings+0.016
Neg logit contributions
ifying-0.042
 upon-0.037
suit-0.033
 low-0.033
 scr-0.031
Token it
Token ID340
Feature activation+1.23e-02

Pos logit contributions
old+0.042
off+0.037
 artist+0.037
unn+0.035
er+0.035
Neg logit contributions
ifying-0.050
 upon-0.043
 low-0.037
suit-0.037
 scr-0.034
Token didn
Token ID1422
Feature activation+1.53e-02

Pos logit contributions
ifying+0.020
 upon+0.016
suit+0.015
 low+0.014
 Deal+0.013
Neg logit contributions
old-0.024
 artist-0.022
off-0.021
er-0.021
 example-0.020
Tokenâ
Token ID22940
Feature activation+9.38e-03

Pos logit contributions
ifying+0.025
 upon+0.019
suit+0.017
 low+0.017
 Deal+0.016
Neg logit contributions
old-0.031
 artist-0.028
off-0.027
er-0.026
 endings-0.026
Token of
Token ID286
Feature activation+6.73e-03

Pos logit contributions
old+0.014
 artist+0.013
off+0.012
 example+0.012
er+0.012
Neg logit contributions
ifying-0.021
 upon-0.018
suit-0.016
 low-0.016
 Deal-0.015
Token them
Token ID606
Feature activation-1.85e-02

Pos logit contributions
ifying+0.011
 upon+0.009
 low+0.007
arl+0.007
 Deal+0.007
Neg logit contributions
 artist-0.012
old-0.012
 endings-0.011
minded-0.011
 tournament-0.011
Token went
Token ID1816
Feature activation-2.22e-02

Pos logit contributions
old+0.032
 artist+0.029
off+0.028
er+0.028
 example+0.027
Neg logit contributions
ifying-0.035
 upon-0.027
suit-0.026
 low-0.025
 Deal-0.024
Token to
Token ID284
Feature activation-2.08e-02

Pos logit contributions
old+0.038
 artist+0.035
off+0.033
 endings+0.033
er+0.032
Neg logit contributions
ifying-0.042
 upon-0.034
suit-0.033
 low-0.032
arl-0.028
Token the
Token ID262
Feature activation-2.39e-02

Pos logit contributions
old+0.034
 artist+0.030
off+0.029
 example+0.028
 adventure+0.028
Neg logit contributions
ifying-0.043
suit-0.033
 upon-0.033
 low-0.030
arl-0.030
Token closest
Token ID11706
Feature activation-6.58e-02

Pos logit contributions
old+0.031
 artist+0.027
off+0.026
er+0.025
 example+0.024
Neg logit contributions
ifying-0.056
 upon-0.046
suit-0.045
 low-0.043
 Deal-0.041
Token or
Token ID393
Feature activation-8.12e-03

Pos logit contributions
unn+0.103
old+0.103
 endings+0.100
rums+0.099
 artist+0.099
Neg logit contributions
ifying-0.104
 upon-0.098
 low-0.090
suit-0.089
 basket-0.081
Tokenchard
Token ID30215
Feature activation-3.88e-02

Pos logit contributions
old+0.018
 artist+0.017
 members+0.016
off+0.016
 example+0.016
Neg logit contributions
ifying-0.011
suit-0.007
arl-0.007
 upon-0.007
 Deal-0.007
Token.
Token ID13
Feature activation-3.00e-02

Pos logit contributions
old+0.066
 artist+0.062
off+0.060
 tournament+0.057
 endings+0.056
Neg logit contributions
ifying-0.070
 upon-0.060
suit-0.054
 low-0.053
 scr-0.047
Token
Token ID198
Feature activation-5.18e-03

Pos logit contributions
old+0.053
 artist+0.047
off+0.046
er+0.045
 example+0.044
Neg logit contributions
ifying-0.056
 upon-0.044
suit-0.043
 low-0.040
 Deal-0.038
Token
Token ID198
Feature activation-1.33e-02

Pos logit contributions
old+0.010
 artist+0.009
off+0.009
er+0.009
 example+0.009
Neg logit contributions
ifying-0.009
 upon-0.007
suit-0.006
 low-0.006
 Deal-0.006
Token she
Token ID673
Feature activation-2.97e-02

Pos logit contributions
old+0.029
 artist+0.026
er+0.025
off+0.025
 example+0.024
Neg logit contributions
ifying-0.041
 upon-0.033
suit-0.033
 low-0.030
 Deal-0.030
Token went
Token ID1816
Feature activation-3.58e-02

Pos logit contributions
old+0.052
 artist+0.048
off+0.046
er+0.045
 example+0.044
Neg logit contributions
ifying-0.055
 upon-0.043
suit-0.042
 low-0.039
 Deal-0.038
Token back
Token ID736
Feature activation-4.24e-02

Pos logit contributions
old+0.065
 artist+0.061
 endings+0.060
off+0.057
er+0.055
Neg logit contributions
ifying-0.062
 upon-0.052
suit-0.048
 low-0.047
arl-0.042
Token to
Token ID284
Feature activation-2.28e-02

Pos logit contributions
old+0.061
 artist+0.055
off+0.052
er+0.051
 endings+0.051
Neg logit contributions
ifying-0.090
 upon-0.077
suit-0.072
 low-0.069
 tim-0.065
Token the
Token ID262
Feature activation-2.29e-02

Pos logit contributions
old+0.037
 artist+0.033
off+0.032
er+0.031
 example+0.030
Neg logit contributions
ifying-0.046
 upon-0.037
suit-0.036
 low-0.033
 Deal-0.032
Token sport
Token ID6332
Feature activation-6.55e-02

Pos logit contributions
old+0.033
 artist+0.029
off+0.028
er+0.027
 endings+0.026
Neg logit contributions
ifying-0.050
 upon-0.041
suit-0.040
 low-0.038
 Deal-0.036
Token every
Token ID790
Feature activation-2.55e-02

Pos logit contributions
old+0.114
 artist+0.102
unn+0.101
off+0.100
 endings+0.098
Neg logit contributions
ifying-0.108
 upon-0.098
suit-0.092
 low-0.085
 tim-0.072
Token day
Token ID1110
Feature activation-2.63e-02

Pos logit contributions
old+0.031
 artist+0.027
off+0.025
er+0.025
 endings+0.025
Neg logit contributions
ifying-0.061
 upon-0.052
suit-0.050
 low-0.049
 Deal-0.045
Token,
Token ID11
Feature activation-2.56e-02

Pos logit contributions
old+0.045
 artist+0.040
off+0.039
 endings+0.038
er+0.037
Neg logit contributions
ifying-0.049
 upon-0.040
suit-0.038
 low-0.037
 tim-0.033
Token and
Token ID290
Feature activation-3.00e-02

Pos logit contributions
old+0.039
 artist+0.035
er+0.033
off+0.033
 members+0.032
Neg logit contributions
ifying-0.055
suit-0.043
 upon-0.042
 Deal-0.040
arl-0.039
Token she
Token ID673
Feature activation-2.99e-02

Pos logit contributions
old+0.049
 artist+0.044
off+0.043
er+0.041
 example+0.041
Neg logit contributions
ifying-0.058
 upon-0.048
suit-0.044
 low-0.043
 tim-0.039
Token,
Token ID11
Feature activation+6.12e-03

Pos logit contributions
old+0.007
off+0.006
 artist+0.006
minded+0.005
 endings+0.005
Neg logit contributions
ifying-0.007
 upon-0.006
suit-0.005
 low-0.005
 tim-0.005
Token Tim
Token ID5045
Feature activation-3.91e-02

Pos logit contributions
ifying+0.015
suit+0.013
 upon+0.012
 Deal+0.012
arl+0.012
Neg logit contributions
old-0.007
 artist-0.006
er-0.006
 days-0.006
 adventure-0.006
Token went
Token ID1816
Feature activation-2.22e-02

Pos logit contributions
old+0.071
 artist+0.064
off+0.062
er+0.061
 example+0.061
Neg logit contributions
ifying-0.069
 upon-0.055
suit-0.052
 low-0.049
 tim-0.046
Token to
Token ID284
Feature activation-1.59e-02

Pos logit contributions
old+0.034
 artist+0.030
off+0.029
er+0.028
 endings+0.028
Neg logit contributions
ifying-0.046
 upon-0.038
suit-0.037
 low-0.035
 Deal-0.033
Token the
Token ID262
Feature activation-3.23e-02

Pos logit contributions
old+0.029
 artist+0.026
off+0.025
er+0.025
 endings+0.024
Neg logit contributions
ifying-0.028
 upon-0.023
suit-0.021
 low-0.021
 Deal-0.019
Token contest
Token ID8414
Feature activation-6.46e-02

Pos logit contributions
old+0.043
unn+0.035
er+0.035
off+0.034
 artist+0.034
Neg logit contributions
ifying-0.073
 upon-0.062
 low-0.059
suit-0.058
Me-0.054
Token with
Token ID351
Feature activation-1.88e-02

Pos logit contributions
unn+0.125
old+0.120
 endings+0.118
 mistakes+0.115
rums+0.113
Neg logit contributions
ifying-0.088
 upon-0.081
suit-0.072
 low-0.063
 tim-0.055
Token his
Token ID465
Feature activation-1.70e-02

Pos logit contributions
old+0.039
 artist+0.036
off+0.035
er+0.034
 endings+0.034
Neg logit contributions
ifying-0.029
 upon-0.022
 low-0.020
suit-0.020
 Deal-0.018
Token mom
Token ID1995
Feature activation-2.08e-02

Pos logit contributions
old+0.025
 artist+0.023
 example+0.021
off+0.021
er+0.021
Neg logit contributions
ifying-0.037
 upon-0.029
suit-0.029
 low-0.027
arl-0.026
Token.
Token ID13
Feature activation-4.35e-03

Pos logit contributions
old+0.032
 artist+0.028
off+0.027
er+0.026
 example+0.025
Neg logit contributions
ifying-0.044
 upon-0.035
suit-0.034
 low-0.033
 Deal-0.032
Token There
Token ID1318
Feature activation+1.13e-02

Pos logit contributions
old+0.007
 artist+0.006
er+0.006
off+0.006
 example+0.006
Neg logit contributions
ifying-0.009
 upon-0.007
suit-0.007
 Deal-0.006
istered-0.006
Token more
Token ID517
Feature activation-1.78e-02

Pos logit contributions
ifying+0.017
 upon+0.013
suit+0.012
 low+0.012
 tim+0.011
Neg logit contributions
old-0.015
 artist-0.013
off-0.013
 tournament-0.012
er-0.012
Token waste
Token ID7030
Feature activation-1.76e-02

Pos logit contributions
 artist+0.028
old+0.027
 tournament+0.027
 endings+0.027
minded+0.026
Neg logit contributions
ifying-0.027
 upon-0.026
 low-0.023
suit-0.023
 tim-0.021
Token.
Token ID13
Feature activation-1.83e-02

Pos logit contributions
old+0.028
 artist+0.025
off+0.023
 endings+0.023
 example+0.022
Neg logit contributions
ifying-0.036
 upon-0.029
suit-0.028
 low-0.027
arl-0.025
Token Do
Token ID2141
Feature activation-2.23e-02

Pos logit contributions
old+0.033
 artist+0.030
off+0.030
er+0.029
 members+0.028
Neg logit contributions
ifying-0.033
 upon-0.025
suit-0.024
 Deal-0.023
 low-0.023
Token you
Token ID345
Feature activation-6.27e-03

Pos logit contributions
old+0.033
 artist+0.030
er+0.027
off+0.027
 example+0.027
Neg logit contributions
ifying-0.045
 upon-0.041
 Deal-0.036
 low-0.035
Me-0.034
Token want
Token ID765
Feature activation-6.45e-02

Pos logit contributions
old+0.013
 artist+0.012
off+0.011
er+0.011
 example+0.011
Neg logit contributions
ifying-0.010
 upon-0.008
suit-0.007
 low-0.007
 Deal-0.006
Token to
Token ID284
Feature activation-1.27e-02

Pos logit contributions
old+0.152
 artist+0.141
 endings+0.140
unn+0.139
 impression+0.134
Neg logit contributions
ifying-0.064
 upon-0.052
 low-0.038
 t-0.037
 loose-0.031
Token help
Token ID1037
Feature activation-2.44e-02

Pos logit contributions
old+0.025
 artist+0.023
off+0.022
er+0.022
 endings+0.021
Neg logit contributions
ifying-0.021
 upon-0.016
suit-0.015
 low-0.015
 Deal-0.014
Token me
Token ID502
Feature activation-2.65e-02

Pos logit contributions
old+0.046
 endings+0.040
 artist+0.040
off+0.039
 tournament+0.038
Neg logit contributions
ifying-0.038
 upon-0.032
 low-0.030
suit-0.029
 Deal-0.027
Token?"
Token ID1701
Feature activation+1.59e-02

Pos logit contributions
old+0.043
 artist+0.039
off+0.038
er+0.037
 endings+0.036
Neg logit contributions
ifying-0.052
 upon-0.042
suit-0.040
 low-0.038
 Deal-0.036
Token
Token ID198
Feature activation+2.42e-02

Pos logit contributions
ifying+0.026
 upon+0.021
suit+0.019
 low+0.018
 Deal+0.017
Neg logit contributions
old-0.031
 artist-0.029
off-0.028
er-0.027
 endings-0.027
Token mom
Token ID1995
Feature activation-2.69e-02

Pos logit contributions
ifying+0.007
 upon+0.005
suit+0.005
 low+0.005
 Deal+0.005
Neg logit contributions
old-0.005
 artist-0.005
er-0.005
 days-0.005
 members-0.005
Tokenmy
Token ID1820
Feature activation-3.00e-02

Pos logit contributions
old+0.046
 artist+0.042
off+0.040
er+0.040
 members+0.038
Neg logit contributions
ifying-0.052
 upon-0.040
suit-0.039
 low-0.037
 Deal-0.036
Token went
Token ID1816
Feature activation-2.36e-02

Pos logit contributions
old+0.052
 artist+0.047
off+0.046
er+0.045
 example+0.044
Neg logit contributions
ifying-0.056
 upon-0.044
suit-0.042
 low-0.040
 Deal-0.038
Token to
Token ID284
Feature activation-1.96e-02

Pos logit contributions
old+0.037
 artist+0.033
off+0.032
er+0.031
 endings+0.030
Neg logit contributions
ifying-0.049
 upon-0.039
suit-0.038
 low-0.036
 Deal-0.034
Token the
Token ID262
Feature activation-2.15e-02

Pos logit contributions
old+0.034
 artist+0.031
off+0.030
er+0.030
 endings+0.029
Neg logit contributions
ifying-0.036
 upon-0.029
suit-0.027
 low-0.026
 Deal-0.025
Token neighbor
Token ID4780
Feature activation-6.45e-02

Pos logit contributions
old+0.031
 artist+0.026
unn+0.025
er+0.025
off+0.025
Neg logit contributions
ifying-0.047
 upon-0.038
 low-0.037
suit-0.036
 vel-0.034
Token's
Token ID338
Feature activation-2.87e-02

Pos logit contributions
old+0.077
 artist+0.070
 endings+0.069
 example+0.067
off+0.065
Neg logit contributions
ifying-0.149
 upon-0.128
suit-0.119
 low-0.113
 tim-0.109
Token house
Token ID2156
Feature activation-2.15e-02

Pos logit contributions
old+0.044
 artist+0.039
unn+0.038
er+0.037
off+0.037
Neg logit contributions
ifying-0.056
 upon-0.047
suit-0.044
 low-0.043
 Deal-0.041
Token and
Token ID290
Feature activation-2.48e-02

Pos logit contributions
old+0.032
 artist+0.028
off+0.027
er+0.027
 example+0.026
Neg logit contributions
ifying-0.046
 upon-0.037
suit-0.036
 low-0.034
 Deal-0.033
Token she
Token ID673
Feature activation-1.83e-02

Pos logit contributions
old+0.040
 artist+0.036
off+0.034
er+0.034
 example+0.033
Neg logit contributions
ifying-0.050
 upon-0.040
suit-0.039
 low-0.036
 Deal-0.035
Token used
Token ID973
Feature activation-2.15e-02

Pos logit contributions
old+0.030
 artist+0.025
er+0.024
 members+0.024
off+0.024
Neg logit contributions
ifying-0.039
suit-0.029
arl-0.029
 Deal-0.029
 upon-0.029
Token decided
Token ID3066
Feature activation-2.13e-02

Pos logit contributions
old+0.071
 artist+0.069
 example+0.067
 endings+0.066
 occasions+0.065
Neg logit contributions
ifying-0.055
 upon-0.050
suit-0.044
 low-0.039
 tim-0.036
Token to
Token ID284
Feature activation-3.35e-03

Pos logit contributions
old+0.044
 artist+0.042
unn+0.042
off+0.040
 occasions+0.039
Neg logit contributions
 upon-0.027
ifying-0.025
 low-0.023
suit-0.019
 drop-0.018
Token play
Token ID711
Feature activation+7.84e-03

Pos logit contributions
old+0.007
 artist+0.007
is+0.007
er+0.006
unn+0.006
Neg logit contributions
ifying-0.005
 upon-0.004
 low-0.003
suit-0.003
 drop-0.003
Token in
Token ID287
Feature activation-4.14e-03

Pos logit contributions
ifying+0.013
 upon+0.011
suit+0.010
 low+0.010
 tim+0.009
Neg logit contributions
old-0.015
 artist-0.013
off-0.013
unn-0.012
 endings-0.012
Token his
Token ID465
Feature activation-6.76e-03

Pos logit contributions
old+0.008
 artist+0.007
off+0.007
er+0.007
 endings+0.007
Neg logit contributions
ifying-0.007
 upon-0.006
 low-0.005
suit-0.005
 tim-0.004
Token neighbour
Token ID12250
Feature activation-6.45e-02

Pos logit contributions
old+0.012
 artist+0.011
er+0.011
off+0.010
minded+0.010
Neg logit contributions
ifying-0.012
 upon-0.010
suit-0.009
 low-0.009
 tim-0.008
Token's
Token ID338
Feature activation-2.02e-02

Pos logit contributions
old+0.082
 artist+0.075
 example+0.072
off+0.072
unn+0.072
Neg logit contributions
ifying-0.138
 upon-0.119
suit-0.114
 low-0.109
 tim-0.100
Token yard
Token ID12699
Feature activation-2.10e-02

Pos logit contributions
old+0.036
 artist+0.032
unn+0.030
er+0.030
 endings+0.030
Neg logit contributions
ifying-0.036
 upon-0.030
suit-0.029
 low-0.028
 Deal-0.024
Token.
Token ID13
Feature activation-7.85e-03

Pos logit contributions
old+0.037
 artist+0.034
off+0.033
unn+0.032
 endings+0.031
Neg logit contributions
ifying-0.038
 upon-0.031
suit-0.029
 low-0.028
 tim-0.025
Token He
Token ID679
Feature activation-5.97e-03

Pos logit contributions
old+0.015
er+0.013
off+0.013
 artist+0.013
 example+0.012
Neg logit contributions
ifying-0.014
 upon-0.011
suit-0.010
 Deal-0.010
 low-0.009
Token scam
Token ID19126
Feature activation-4.19e-03

Pos logit contributions
old+0.011
 artist+0.010
off+0.009
er+0.009
 example+0.009
Neg logit contributions
ifying-0.011
 upon-0.008
suit-0.008
 Deal-0.008
 low-0.008
Token
Token ID198
Feature activation-9.34e-03

Pos logit contributions
old+0.050
 artist+0.047
off+0.045
er+0.044
 example+0.044
Neg logit contributions
ifying-0.042
 upon-0.033
suit-0.030
 low-0.030
 Deal-0.028
Token
Token ID198
Feature activation-1.24e-02

Pos logit contributions
old+0.019
 artist+0.019
er+0.018
off+0.018
 endings+0.018
Neg logit contributions
ifying-0.013
 upon-0.011
 low-0.009
suit-0.009
 tim-0.008
TokenOne
Token ID3198
Feature activation-6.41e-03

Pos logit contributions
old+0.020
 artist+0.018
er+0.017
off+0.017
unn+0.017
Neg logit contributions
ifying-0.024
 upon-0.020
 low-0.019
suit-0.018
Me-0.018
Token day
Token ID1110
Feature activation+1.54e-02

Pos logit contributions
old+0.008
 artist+0.006
off+0.006
er+0.006
 endings+0.006
Neg logit contributions
ifying-0.015
 upon-0.013
suit-0.013
 low-0.012
 tim-0.012
Token the
Token ID262
Feature activation-3.98e-03

Pos logit contributions
ifying+0.035
 upon+0.028
suit+0.028
 Deal+0.026
 low+0.026
Neg logit contributions
old-0.021
 artist-0.018
er-0.018
off-0.017
 days-0.017
Token mayor
Token ID9591
Feature activation-6.44e-02

Pos logit contributions
old+0.006
 artist+0.005
off+0.005
er+0.005
 endings+0.005
Neg logit contributions
ifying-0.008
 upon-0.007
suit-0.007
 low-0.007
 tim-0.006
Token of
Token ID286
Feature activation+1.19e-02

Pos logit contributions
old+0.116
 endings+0.114
 example+0.114
 occasions+0.109
off+0.108
Neg logit contributions
ifying-0.098
 upon-0.088
suit-0.072
 low-0.068
 haven-0.061
Token the
Token ID262
Feature activation-1.32e-02

Pos logit contributions
ifying+0.024
suit+0.019
 upon+0.019
 Deal+0.018
 low+0.017
Neg logit contributions
old-0.019
 artist-0.017
er-0.016
off-0.016
 example-0.016
Token town
Token ID3240
Feature activation-4.60e-02

Pos logit contributions
old+0.022
er+0.018
off+0.018
 endings+0.018
oot+0.018
Neg logit contributions
ifying-0.024
 upon-0.021
 low-0.020
suit-0.019
 mis-0.018
Token had
Token ID550
Feature activation-3.40e-02

Pos logit contributions
old+0.090
 artist+0.083
off+0.083
 example+0.082
 endings+0.081
Neg logit contributions
ifying-0.068
 upon-0.058
suit-0.050
 low-0.048
 tim-0.041
Token an
Token ID281
Feature activation-1.17e-02

Pos logit contributions
old+0.060
 artist+0.055
off+0.052
er+0.052
unn+0.050
Neg logit contributions
ifying-0.063
 upon-0.052
 low-0.046
suit-0.046
 scr-0.041
Token so
Token ID523
Feature activation+6.23e-03

Pos logit contributions
old+0.004
 artist+0.004
er+0.004
off+0.004
 adventure+0.004
Neg logit contributions
ifying-0.007
suit-0.005
 upon-0.005
 Deal-0.005
arl-0.005
Token she
Token ID673
Feature activation-1.09e-02

Pos logit contributions
ifying+0.015
suit+0.012
 upon+0.012
 Deal+0.011
arl+0.011
Neg logit contributions
old-0.008
 artist-0.007
er-0.006
off-0.006
 days-0.006
Token went
Token ID1816
Feature activation-2.56e-02

Pos logit contributions
old+0.019
 artist+0.017
off+0.017
er+0.017
 members+0.016
Neg logit contributions
ifying-0.020
 upon-0.016
suit-0.015
 Deal-0.014
 low-0.014
Token to
Token ID284
Feature activation-1.61e-02

Pos logit contributions
old+0.042
 artist+0.038
 endings+0.036
off+0.036
er+0.035
Neg logit contributions
ifying-0.050
 upon-0.042
suit-0.039
 low-0.038
arl-0.035
Token the
Token ID262
Feature activation-2.50e-02

Pos logit contributions
old+0.027
 artist+0.025
off+0.024
er+0.023
 endings+0.023
Neg logit contributions
ifying-0.031
 upon-0.025
suit-0.023
 low-0.023
 Deal-0.022
Token neighbor
Token ID4780
Feature activation-6.43e-02

Pos logit contributions
old+0.037
unn+0.030
off+0.030
 artist+0.030
 endings+0.030
Neg logit contributions
ifying-0.055
 upon-0.044
suit-0.043
 low-0.042
 vel-0.038
Token's
Token ID338
Feature activation-2.81e-02

Pos logit contributions
old+0.080
 artist+0.072
 example+0.071
 endings+0.071
off+0.069
Neg logit contributions
ifying-0.145
 upon-0.124
suit-0.114
 low-0.110
 tim-0.103
Token house
Token ID2156
Feature activation-1.96e-02

Pos logit contributions
old+0.043
 artist+0.038
er+0.037
off+0.036
unn+0.036
Neg logit contributions
ifying-0.057
 upon-0.047
suit-0.044
 low-0.043
 Deal-0.040
Token to
Token ID284
Feature activation-1.33e-02

Pos logit contributions
old+0.028
 artist+0.025
er+0.024
off+0.024
 example+0.023
Neg logit contributions
ifying-0.043
 upon-0.034
suit-0.033
 Deal-0.032
 low-0.031
Token listen
Token ID6004
Feature activation-1.32e-03

Pos logit contributions
old+0.021
 artist+0.019
off+0.018
 adventure+0.017
er+0.017
Neg logit contributions
ifying-0.028
suit-0.023
 upon-0.022
 low-0.020
 Deal-0.020
Token.
Token ID13
Feature activation-2.36e-02

Pos logit contributions
old+0.002
 artist+0.002
off+0.002
er+0.002
 example+0.002
Neg logit contributions
ifying-0.003
 upon-0.002
suit-0.002
 low-0.002
 Deal-0.002
Token permit
Token ID8749
Feature activation-3.83e-02

Pos logit contributions
old+0.011
 artist+0.010
off+0.010
er+0.010
 example+0.009
Neg logit contributions
ifying-0.011
 upon-0.008
suit-0.008
 low-0.008
 Deal-0.007
Token,
Token ID11
Feature activation-1.20e-02

Pos logit contributions
old+0.054
 artist+0.049
er+0.044
off+0.043
 tournament+0.043
Neg logit contributions
ifying-0.080
 upon-0.067
 low-0.062
suit-0.061
 Deal-0.060
Token but
Token ID475
Feature activation-1.56e-02

Pos logit contributions
old+0.023
 artist+0.020
off+0.020
er+0.019
 tournament+0.019
Neg logit contributions
ifying-0.021
 upon-0.017
suit-0.015
 low-0.015
 Deal-0.014
Token you
Token ID345
Feature activation+2.31e-03

Pos logit contributions
old+0.028
 artist+0.025
off+0.024
er+0.024
 endings+0.023
Neg logit contributions
ifying-0.028
 upon-0.024
 low-0.020
suit-0.020
 Deal-0.020
Token also
Token ID635
Feature activation+5.71e-03

Pos logit contributions
ifying+0.004
 upon+0.003
suit+0.002
 low+0.002
 Deal+0.002
Neg logit contributions
old-0.005
 artist-0.004
off-0.004
er-0.004
 example-0.004
Token need
Token ID761
Feature activation-6.43e-02

Pos logit contributions
ifying+0.011
 upon+0.008
suit+0.008
 low+0.007
 Deal+0.007
Neg logit contributions
old-0.010
 artist-0.009
off-0.009
er-0.009
 endings-0.008
Token to
Token ID284
Feature activation-1.03e-02

Pos logit contributions
old+0.122
 artist+0.117
unn+0.107
oot+0.105
off+0.105
Neg logit contributions
ifying-0.100
 upon-0.082
 low-0.071
arl-0.064
suit-0.063
Token respect
Token ID2461
Feature activation-1.08e-02

Pos logit contributions
old+0.020
 artist+0.018
off+0.017
er+0.017
 endings+0.017
Neg logit contributions
ifying-0.016
 upon-0.013
suit-0.012
 low-0.012
 Deal-0.012
Token and
Token ID290
Feature activation+2.11e-02

Pos logit contributions
old+0.015
 artist+0.013
off+0.013
er+0.012
 example+0.012
Neg logit contributions
ifying-0.024
 upon-0.020
suit-0.019
 low-0.018
 Deal-0.018
Token care
Token ID1337
Feature activation+2.84e-02

Pos logit contributions
ifying+0.042
 upon+0.034
suit+0.032
 low+0.031
 Deal+0.030
Neg logit contributions
old-0.034
 artist-0.030
off-0.029
er-0.029
 example-0.027
Token.
Token ID13
Feature activation-1.75e-02

Pos logit contributions
ifying+0.063
 upon+0.051
suit+0.050
 low+0.047
 Deal+0.046
Neg logit contributions
old-0.041
 artist-0.035
off-0.035
er-0.034
 example-0.032
Token of
Token ID286
Feature activation-6.37e-03

Pos logit contributions
old+0.048
 tournament+0.048
 artist+0.047
 site+0.045
ican+0.045
Neg logit contributions
ifying-0.062
 upon-0.062
arl-0.048
 low-0.047
 Deal-0.046
Token you
Token ID345
Feature activation+2.47e-02

Pos logit contributions
old+0.006
 artist+0.005
off+0.005
er+0.005
 example+0.004
Neg logit contributions
ifying-0.017
 upon-0.014
suit-0.014
 low-0.013
 Deal-0.013
Token.
Token ID13
Feature activation-1.80e-02

Pos logit contributions
ifying+0.049
 upon+0.040
suit+0.038
 low+0.036
 Deal+0.035
Neg logit contributions
old-0.040
 artist-0.036
off-0.035
er-0.033
 endings-0.033
Token But
Token ID887
Feature activation+9.80e-04

Pos logit contributions
old+0.031
 artist+0.028
off+0.027
er+0.026
 tournament+0.026
Neg logit contributions
ifying-0.034
 upon-0.027
suit-0.026
 Deal-0.024
 low-0.024
Token we
Token ID356
Feature activation+1.27e-02

Pos logit contributions
ifying+0.002
 upon+0.002
suit+0.002
 low+0.002
 Deal+0.002
Neg logit contributions
old-0.001
 artist-0.001
off-0.001
er-0.001
 tournament-0.001
Token need
Token ID761
Feature activation-6.41e-02

Pos logit contributions
ifying+0.021
 upon+0.016
suit+0.015
 low+0.014
 Deal+0.014
Neg logit contributions
old-0.025
 artist-0.023
off-0.022
er-0.022
 example-0.021
Token to
Token ID284
Feature activation-8.05e-03

Pos logit contributions
 artist+0.118
old+0.117
 tournament+0.108
off+0.105
er+0.103
Neg logit contributions
ifying-0.105
 upon-0.086
 low-0.073
 Deal-0.070
 vel-0.069
Token go
Token ID467
Feature activation-4.64e-02

Pos logit contributions
old+0.016
 artist+0.014
off+0.014
er+0.013
 tournament+0.013
Neg logit contributions
ifying-0.013
 upon-0.010
 low-0.009
suit-0.009
 Deal-0.009
Token to
Token ID284
Feature activation-2.31e-02

Pos logit contributions
old+0.084
 artist+0.081
 endings+0.077
 tournament+0.073
off+0.072
Neg logit contributions
ifying-0.081
 upon-0.070
 low-0.064
suit-0.063
arl-0.057
Token the
Token ID262
Feature activation-2.96e-02

Pos logit contributions
old+0.046
 artist+0.044
off+0.042
er+0.042
 tournament+0.040
Neg logit contributions
ifying-0.034
 upon-0.029
 low-0.026
 tim-0.024
 Deal-0.024
Token doctor
Token ID6253
Feature activation-3.61e-02

Pos logit contributions
old+0.027
 artist+0.022
off+0.020
er+0.019
 endings+0.018
Neg logit contributions
ifying-0.080
 upon-0.069
suit-0.067
 low-0.065
 Deal-0.063
Token
Token ID198
Feature activation-9.61e-03

Pos logit contributions
ifying+0.005
 upon+0.004
suit+0.004
 low+0.004
 Deal+0.004
Neg logit contributions
old-0.006
 artist-0.005
off-0.005
er-0.005
 example-0.005
TokenShe
Token ID3347
Feature activation+5.34e-03

Pos logit contributions
old+0.017
 artist+0.016
er+0.015
off+0.015
 endings+0.015
Neg logit contributions
ifying-0.017
 upon-0.013
suit-0.012
 low-0.012
 Deal-0.012
Token went
Token ID1816
Feature activation-2.37e-02

Pos logit contributions
ifying+0.010
 upon+0.008
suit+0.008
 low+0.007
 Deal+0.007
Neg logit contributions
old-0.009
 artist-0.008
off-0.008
er-0.008
 example-0.008
Token to
Token ID284
Feature activation-1.34e-02

Pos logit contributions
old+0.038
 artist+0.034
off+0.033
er+0.032
 endings+0.032
Neg logit contributions
ifying-0.047
 upon-0.038
suit-0.037
 low-0.035
 Deal-0.033
Token the
Token ID262
Feature activation-1.96e-02

Pos logit contributions
old+0.023
 artist+0.021
off+0.020
er+0.020
 example+0.019
Neg logit contributions
ifying-0.025
 upon-0.020
suit-0.019
 low-0.018
 Deal-0.017
Token contest
Token ID8414
Feature activation-6.41e-02

Pos logit contributions
old+0.028
off+0.023
 artist+0.023
er+0.022
unn+0.022
Neg logit contributions
ifying-0.043
 upon-0.036
suit-0.034
 low-0.034
 Deal-0.031
Token and
Token ID290
Feature activation-2.29e-02

Pos logit contributions
unn+0.114
old+0.112
 endings+0.108
 artist+0.104
 mistakes+0.104
Neg logit contributions
ifying-0.096
 upon-0.086
suit-0.081
 low-0.076
 tim-0.062
Token sang
Token ID25889
Feature activation+8.68e-04

Pos logit contributions
old+0.039
 endings+0.035
off+0.035
 artist+0.035
unn+0.034
Neg logit contributions
ifying-0.041
 upon-0.035
 low-0.033
suit-0.032
 tim-0.028
Token her
Token ID607
Feature activation+2.35e-02

Pos logit contributions
ifying+0.002
 upon+0.001
suit+0.001
 Deal+0.001
arl+0.001
Neg logit contributions
old-0.001
 artist-0.001
er-0.001
off-0.001
 example-0.001
Token song
Token ID3496
Feature activation+1.33e-02

Pos logit contributions
ifying+0.048
 upon+0.036
suit+0.035
arl+0.033
 Deal+0.032
Neg logit contributions
old-0.037
 artist-0.036
 example-0.034
 adventure-0.033
off-0.032
Token.
Token ID13
Feature activation+1.70e-03

Pos logit contributions
ifying+0.028
 upon+0.023
suit+0.022
 low+0.021
 Deal+0.020
Neg logit contributions
old-0.020
 artist-0.018
off-0.017
er-0.017
 example-0.016
Token said
Token ID531
Feature activation+7.59e-03

Pos logit contributions
old+0.002
 artist+0.002
off+0.002
er+0.002
 members+0.002
Neg logit contributions
ifying-0.003
 upon-0.002
suit-0.002
 low-0.002
 Deal-0.002
Token she
Token ID673
Feature activation+3.46e-02

Pos logit contributions
ifying+0.016
 upon+0.013
suit+0.012
 low+0.012
 Deal+0.011
Neg logit contributions
old-0.011
 artist-0.010
off-0.010
 tournament-0.009
er-0.009
Token should
Token ID815
Feature activation-1.37e-02

Pos logit contributions
ifying+0.068
arl+0.051
 Deal+0.050
 upon+0.050
suit+0.048
Neg logit contributions
old-0.063
 artist-0.056
er-0.053
 members-0.052
off-0.051
Token go
Token ID467
Feature activation-4.43e-02

Pos logit contributions
old+0.026
 artist+0.024
off+0.023
er+0.022
 endings+0.022
Neg logit contributions
ifying-0.023
 upon-0.018
suit-0.016
 low-0.016
 Deal-0.015
Token and
Token ID290
Feature activation-2.37e-02

Pos logit contributions
old+0.079
 artist+0.076
 endings+0.073
off+0.069
 tournament+0.069
Neg logit contributions
ifying-0.077
 upon-0.064
suit-0.059
 low-0.057
arl-0.051
Token have
Token ID423
Feature activation-6.39e-02

Pos logit contributions
old+0.042
 artist+0.039
off+0.037
er+0.037
 example+0.036
Neg logit contributions
ifying-0.043
 upon-0.034
suit-0.032
 low-0.031
 Deal-0.029
Token fun
Token ID1257
Feature activation-2.60e-02

Pos logit contributions
old+0.117
 artist+0.116
er+0.105
unn+0.105
oot+0.104
Neg logit contributions
ifying-0.101
 upon-0.087
 low-0.081
suit-0.071
 vel-0.070
Token at
Token ID379
Feature activation-2.67e-02

Pos logit contributions
old+0.048
 artist+0.044
off+0.042
 tournament+0.041
unn+0.040
Neg logit contributions
ifying-0.045
 upon-0.037
suit-0.032
 low-0.031
 tim-0.030
Token the
Token ID262
Feature activation-1.62e-02

Pos logit contributions
old+0.060
 artist+0.055
 endings+0.055
er+0.054
off+0.054
Neg logit contributions
ifying-0.030
 upon-0.025
 low-0.024
 tim-0.020
 Sn-0.019
Token party
Token ID2151
Feature activation-3.51e-02

Pos logit contributions
old+0.021
 artist+0.017
off+0.017
er+0.016
 endings+0.015
Neg logit contributions
ifying-0.038
 upon-0.032
suit-0.031
 low-0.030
 Deal-0.028
Token.
Token ID13
Feature activation-1.29e-02

Pos logit contributions
old+0.058
 artist+0.051
off+0.049
 endings+0.048
unn+0.047
Neg logit contributions
ifying-0.067
 upon-0.056
suit-0.054
 low-0.050
 tim-0.046
Token day
Token ID1110
Feature activation-2.91e-02

Pos logit contributions
old+0.028
 artist+0.025
minded+0.023
er+0.023
oot+0.023
Neg logit contributions
ifying-0.028
 upon-0.023
suit-0.022
 low-0.022
 tim-0.019
Token of
Token ID286
Feature activation-9.31e-03

Pos logit contributions
old+0.036
 artist+0.033
off+0.032
minded+0.032
 example+0.030
Neg logit contributions
ifying-0.068
 upon-0.059
suit-0.055
 low-0.052
 tim-0.051
Token their
Token ID511
Feature activation-2.24e-02

Pos logit contributions
old+0.017
 artist+0.015
off+0.015
er+0.014
 endings+0.014
Neg logit contributions
ifying-0.017
 upon-0.014
suit-0.013
 low-0.012
 Deal-0.012
Token vacation
Token ID14600
Feature activation-4.04e-02

Pos logit contributions
old+0.036
 artist+0.032
er+0.031
oot+0.029
off+0.028
Neg logit contributions
ifying-0.045
 upon-0.038
suit-0.036
 low-0.035
 Deal-0.032
Token the
Token ID262
Feature activation-2.20e-02

Pos logit contributions
old+0.069
 artist+0.063
off+0.060
 endings+0.058
 example+0.057
Neg logit contributions
ifying-0.074
 upon-0.064
suit-0.061
 low-0.057
 tim-0.052
Token family
Token ID1641
Feature activation-6.39e-02

Pos logit contributions
old+0.038
 artist+0.032
er+0.031
 endings+0.029
rums+0.029
Neg logit contributions
ifying-0.044
 upon-0.038
suit-0.035
 low-0.034
arl-0.031
Token got
Token ID1392
Feature activation-3.99e-02

Pos logit contributions
old+0.116
 artist+0.110
 endings+0.108
 example+0.107
er+0.106
Neg logit contributions
ifying-0.099
 upon-0.090
suit-0.075
 low-0.070
 tim-0.064
Token into
Token ID656
Feature activation+2.22e-03

Pos logit contributions
old+0.070
 artist+0.064
off+0.062
er+0.061
 endings+0.059
Neg logit contributions
ifying-0.074
 upon-0.060
suit-0.056
 low-0.054
 Deal-0.049
Token their
Token ID511
Feature activation-1.60e-02

Pos logit contributions
ifying+0.005
 upon+0.004
suit+0.004
 Deal+0.003
 low+0.003
Neg logit contributions
old-0.004
 artist-0.003
 example-0.003
off-0.003
er-0.003
Token car
Token ID1097
Feature activation-1.34e-02

Pos logit contributions
old+0.024
 artist+0.021
off+0.020
er+0.020
 example+0.019
Neg logit contributions
ifying-0.034
 upon-0.028
suit-0.027
 low-0.025
 Deal-0.025
Token and
Token ID290
Feature activation-1.42e-02

Pos logit contributions
old+0.024
 artist+0.022
off+0.021
 endings+0.020
 example+0.020
Neg logit contributions
ifying-0.025
 upon-0.020
suit-0.019
 low-0.018
 tim-0.017
Tokenbs
Token ID1443
Feature activation+1.80e-02

Pos logit contributions
ifying+0.000
 upon+0.000
suit+0.000
 Deal+0.000
 low+0.000
Neg logit contributions
old-0.000
off-0.000
 artist-0.000
er-0.000
 members-0.000
Token,
Token ID11
Feature activation+2.66e-02

Pos logit contributions
ifying+0.038
 upon+0.031
suit+0.031
 Deal+0.030
 low+0.029
Neg logit contributions
old-0.026
er-0.023
 artist-0.022
off-0.021
 members-0.021
Token which
Token ID543
Feature activation+1.87e-02

Pos logit contributions
ifying+0.055
suit+0.044
 upon+0.044
 Deal+0.042
arl+0.041
Neg logit contributions
old-0.037
 artist-0.035
er-0.035
 that-0.033
 example-0.032
Token were
Token ID547
Feature activation+1.61e-02

Pos logit contributions
ifying+0.031
 upon+0.023
suit+0.022
 low+0.021
 Deal+0.020
Neg logit contributions
old-0.037
 artist-0.034
off-0.033
er-0.032
 example-0.031
Token sticky
Token ID23408
Feature activation+1.45e-02

Pos logit contributions
ifying+0.031
 Deal+0.025
suit+0.024
 upon+0.023
arl+0.023
Neg logit contributions
old-0.024
er-0.024
 adventure-0.023
 days-0.023
 that-0.023
Token and
Token ID290
Feature activation+7.13e-02

Pos logit contributions
ifying+0.028
 upon+0.022
suit+0.021
 low+0.020
 tim+0.018
Neg logit contributions
old-0.025
 artist-0.022
off-0.022
 tournament-0.021
 example-0.021
Token white
Token ID2330
Feature activation+9.77e-03

Pos logit contributions
ifying+0.129
 upon+0.105
 Deal+0.101
arl+0.095
suit+0.094
Neg logit contributions
 days-0.113
 that-0.108
 more-0.107
er-0.105
old-0.104
Token.
Token ID13
Feature activation+5.56e-03

Pos logit contributions
ifying+0.020
 upon+0.017
suit+0.016
 low+0.015
 Deal+0.015
Neg logit contributions
old-0.015
 artist-0.013
er-0.013
off-0.012
 example-0.012
Token
Token ID198
Feature activation+7.67e-03

Pos logit contributions
ifying+0.010
 upon+0.008
suit+0.008
 low+0.007
 Deal+0.007
Neg logit contributions
old-0.010
 artist-0.009
off-0.009
er-0.008
 example-0.008
Token
Token ID198
Feature activation+2.75e-03

Pos logit contributions
ifying+0.012
 upon+0.010
suit+0.008
 low+0.008
 Deal+0.008
Neg logit contributions
old-0.015
 artist-0.015
off-0.014
er-0.014
 endings-0.013
Token"
Token ID1
Feature activation-5.12e-03

Pos logit contributions
ifying+0.005
 upon+0.004
suit+0.003
 low+0.003
 Deal+0.003
Neg logit contributions
old-0.005
 artist-0.005
off-0.005
er-0.005
 endings-0.005
Token their
Token ID511
Feature activation+1.23e-02

Pos logit contributions
ifying+0.061
 upon+0.048
suit+0.048
 Deal+0.047
arl+0.045
Neg logit contributions
old-0.029
er-0.028
 artist-0.026
 that-0.025
off-0.024
Token toys
Token ID14958
Feature activation+2.39e-02

Pos logit contributions
ifying+0.027
 upon+0.022
suit+0.021
 low+0.020
 Deal+0.020
Neg logit contributions
old-0.017
 artist-0.016
off-0.015
er-0.015
 example-0.014
Token.
Token ID13
Feature activation+6.81e-03

Pos logit contributions
ifying+0.046
 upon+0.037
suit+0.036
 low+0.034
 Deal+0.031
Neg logit contributions
old-0.040
 artist-0.036
off-0.034
 endings-0.033
 example-0.033
Token They
Token ID1119
Feature activation+1.92e-02

Pos logit contributions
ifying+0.014
 upon+0.011
suit+0.011
 low+0.010
 Deal+0.010
Neg logit contributions
old-0.011
 artist-0.010
off-0.009
er-0.009
 example-0.009
Token said
Token ID531
Feature activation+1.47e-02

Pos logit contributions
ifying+0.037
 upon+0.029
suit+0.027
 Deal+0.027
arl+0.026
Neg logit contributions
old-0.032
 artist-0.030
er-0.028
 members-0.028
off-0.028
Token,
Token ID11
Feature activation+5.36e-02

Pos logit contributions
ifying+0.027
 upon+0.022
suit+0.021
 low+0.020
 Deal+0.018
Neg logit contributions
old-0.026
off-0.023
 artist-0.023
 members-0.021
er-0.021
Token "
Token ID366
Feature activation-1.25e-02

Pos logit contributions
ifying+0.111
suit+0.084
 upon+0.080
istered+0.079
 Deal+0.074
Neg logit contributions
 that-0.078
er-0.076
 artist-0.075
old-0.074
 example-0.073
TokenSure
Token ID19457
Feature activation-7.56e-04

Pos logit contributions
old+0.026
 artist+0.023
er+0.022
off+0.022
 adventure+0.022
Neg logit contributions
ifying-0.020
 upon-0.015
suit-0.015
 low-0.014
Me-0.013
Token,
Token ID11
Feature activation-3.38e-03

Pos logit contributions
old+0.001
 artist+0.001
off+0.001
 tournament+0.001
 endings+0.001
Neg logit contributions
ifying-0.001
 upon-0.001
suit-0.001
 low-0.001
 Deal-0.001
Token you
Token ID345
Feature activation+2.49e-02

Pos logit contributions
old+0.005
 artist+0.004
off+0.004
er+0.004
 endings+0.004
Neg logit contributions
ifying-0.007
 upon-0.006
suit-0.006
 low-0.005
 Deal-0.005
Token can
Token ID460
Feature activation-1.24e-02

Pos logit contributions
ifying+0.036
 upon+0.025
suit+0.025
 low+0.023
 tim+0.022
Neg logit contributions
old-0.053
 artist-0.051
off-0.049
 tournament-0.047
er-0.047
Token her
Token ID607
Feature activation+1.89e-02

Pos logit contributions
ifying+0.021
 upon+0.016
suit+0.016
 Deal+0.015
 low+0.014
Neg logit contributions
old-0.016
 artist-0.014
er-0.014
off-0.014
 example-0.014
Token away
Token ID1497
Feature activation+2.65e-02

Pos logit contributions
ifying+0.041
 Deal+0.031
 upon+0.031
suit+0.031
 freshly+0.030
Neg logit contributions
old-0.025
 that-0.024
er-0.024
off-0.023
 members-0.022
Token,
Token ID11
Feature activation+1.70e-02

Pos logit contributions
ifying+0.060
 upon+0.048
suit+0.047
 Deal+0.047
arl+0.044
Neg logit contributions
old-0.034
er-0.032
 artist-0.030
 example-0.029
 adventure-0.028
Token and
Token ID290
Feature activation+3.19e-03

Pos logit contributions
ifying+0.039
suit+0.031
 upon+0.030
 Deal+0.030
arl+0.029
Neg logit contributions
old-0.022
 that-0.020
er-0.020
 artist-0.019
 more-0.018
Token said
Token ID531
Feature activation+2.54e-02

Pos logit contributions
ifying+0.006
 upon+0.005
suit+0.005
 Deal+0.005
 low+0.005
Neg logit contributions
old-0.005
 artist-0.005
er-0.004
off-0.004
 example-0.004
Token,
Token ID11
Feature activation+5.56e-02

Pos logit contributions
ifying+0.054
 upon+0.043
suit+0.042
 low+0.040
 Deal+0.039
Neg logit contributions
old-0.038
 artist-0.034
off-0.033
er-0.032
 example-0.031
Token "
Token ID366
Feature activation-2.13e-02

Pos logit contributions
ifying+0.106
suit+0.082
 upon+0.075
istered+0.070
 Deal+0.069
Neg logit contributions
 that-0.091
er-0.088
old-0.085
 example-0.083
 days-0.082
TokenGo
Token ID5247
Feature activation-2.27e-02

Pos logit contributions
old+0.040
 artist+0.036
off+0.035
er+0.034
 example+0.033
Neg logit contributions
ifying-0.038
 upon-0.029
suit-0.028
 low-0.026
 Deal-0.025
Token away
Token ID1497
Feature activation+1.38e-02

Pos logit contributions
old+0.041
off+0.037
er+0.036
 artist+0.036
 example+0.035
Neg logit contributions
ifying-0.041
 upon-0.032
suit-0.030
 Deal-0.029
 low-0.028
Token,
Token ID11
Feature activation-2.05e-04

Pos logit contributions
ifying+0.033
suit+0.026
 upon+0.026
 low+0.025
 Deal+0.025
Neg logit contributions
old-0.017
er-0.015
 artist-0.015
 example-0.014
off-0.014
Token you
Token ID345
Feature activation+1.78e-02

Pos logit contributions
old+0.000
 artist+0.000
off+0.000
er+0.000
 adventure+0.000
Neg logit contributions
ifying-0.000
suit-0.000
 upon-0.000
 low-0.000
 Deal-0.000
Token couldn
Token ID3521
Feature activation+3.25e-03

Pos logit contributions
ifying+0.011
suit+0.008
 upon+0.008
arl+0.008
 Deal+0.008
Neg logit contributions
old-0.014
 artist-0.013
er-0.012
 members-0.012
off-0.012
Token't
Token ID470
Feature activation-1.64e-02

Pos logit contributions
ifying+0.006
 upon+0.005
suit+0.005
 low+0.004
 Deal+0.004
Neg logit contributions
old-0.006
 artist-0.005
off-0.005
er-0.005
 example-0.005
Token find
Token ID1064
Feature activation-2.47e-02

Pos logit contributions
old+0.026
 artist+0.023
off+0.023
er+0.022
 example+0.022
Neg logit contributions
ifying-0.033
 upon-0.027
suit-0.026
 low-0.025
 Deal-0.023
Token it
Token ID340
Feature activation+4.80e-03

Pos logit contributions
old+0.038
 artist+0.034
off+0.033
er+0.031
 tournament+0.031
Neg logit contributions
ifying-0.051
 upon-0.041
suit-0.040
 low-0.039
 Deal-0.036
Token.
Token ID13
Feature activation-8.15e-04

Pos logit contributions
ifying+0.010
 upon+0.008
suit+0.008
 low+0.007
 Deal+0.007
Neg logit contributions
old-0.007
 artist-0.007
off-0.006
er-0.006
 example-0.006
Token
Token ID220
Feature activation+5.46e-02

Pos logit contributions
old+0.001
er+0.001
off+0.001
 artist+0.001
 members+0.001
Neg logit contributions
ifying-0.002
suit-0.001
 upon-0.001
 tim-0.001
 Deal-0.001
Token
Token ID198
Feature activation+6.88e-03

Pos logit contributions
ifying+0.126
suit+0.099
 vel+0.092
 scr+0.088
 Deal+0.088
Neg logit contributions
old-0.082
off-0.065
er-0.062
 members-0.062
 artist-0.058
Token
Token ID198
Feature activation+2.05e-03

Pos logit contributions
ifying+0.011
 upon+0.009
suit+0.008
 low+0.008
 Deal+0.007
Neg logit contributions
old-0.013
 artist-0.012
off-0.012
er-0.012
 example-0.011
TokenThen
Token ID6423
Feature activation+1.04e-02

Pos logit contributions
ifying+0.004
 upon+0.003
suit+0.003
 low+0.003
 Deal+0.003
Neg logit contributions
old-0.003
 artist-0.003
off-0.003
er-0.003
 example-0.003
Token,
Token ID11
Feature activation+1.07e-02

Pos logit contributions
ifying+0.022
 upon+0.018
suit+0.017
 low+0.016
 Deal+0.016
Neg logit contributions
old-0.016
 artist-0.014
off-0.014
er-0.013
 example-0.013
Token her
Token ID607
Feature activation+8.90e-03

Pos logit contributions
ifying+0.024
suit+0.020
arl+0.019
 upon+0.019
 Deal+0.019
Neg logit contributions
old-0.014
 artist-0.013
 days-0.012
er-0.012
 that-0.012
Tokenily
Token ID813
Feature activation+6.21e-03

Pos logit contributions
ifying+0.025
 upon+0.022
suit+0.021
 Deal+0.020
 low+0.020
Neg logit contributions
old-0.009
 artist-0.008
off-0.007
er-0.007
 tournament-0.007
Token nodded
Token ID14464
Feature activation+5.45e-02

Pos logit contributions
ifying+0.011
 upon+0.008
suit+0.008
 low+0.008
 Deal+0.007
Neg logit contributions
old-0.011
 artist-0.010
off-0.010
 endings-0.010
 example-0.010
Token.
Token ID13
Feature activation+2.86e-02

Pos logit contributions
ifying+0.153
suit+0.123
 Deal+0.113
 tim+0.112
 upon+0.112
Neg logit contributions
er-0.052
old-0.046
 artist-0.044
 more-0.041
 that-0.041
Token She
Token ID1375
Feature activation+3.20e-02

Pos logit contributions
ifying+0.056
suit+0.043
 upon+0.042
 tim+0.039
 low+0.037
Neg logit contributions
old-0.048
 artist-0.045
er-0.042
off-0.040
 example-0.040
Token said
Token ID531
Feature activation+2.22e-02

Pos logit contributions
ifying+0.075
 upon+0.060
suit+0.058
 low+0.056
 Deal+0.055
Neg logit contributions
old-0.043
 artist-0.038
er-0.036
off-0.035
 example-0.033
Token,
Token ID11
Feature activation+5.81e-02

Pos logit contributions
ifying+0.041
 upon+0.034
suit+0.031
 low+0.031
arl+0.029
Neg logit contributions
old-0.038
 artist-0.034
off-0.034
 tournament-0.032
 endings-0.032
Token "
Token ID366
Feature activation-8.71e-03

Pos logit contributions
ifying+0.124
suit+0.096
 upon+0.090
istered+0.084
 tim+0.082
Neg logit contributions
 that-0.087
 artist-0.081
er-0.081
old-0.076
 days-0.076
TokenI
Token ID40
Feature activation+3.10e-02

Pos logit contributions
old+0.018
 artist+0.016
er+0.016
off+0.015
 example+0.015
Neg logit contributions
ifying-0.014
 upon-0.010
suit-0.010
 low-0.009
Me-0.009
Token understand
Token ID1833
Feature activation-2.76e-03

Pos logit contributions
ifying+0.070
suit+0.054
 upon+0.052
 low+0.050
 Deal+0.049
Neg logit contributions
old-0.046
 artist-0.040
off-0.038
 example-0.036
 members-0.036
Token,
Token ID11
Feature activation+2.88e-02

Pos logit contributions
old+0.004
 artist+0.003
er+0.003
off+0.003
 example+0.003
Neg logit contributions
ifying-0.006
suit-0.005
 upon-0.005
 low-0.005
 Deal-0.004
Token mom
Token ID1995
Feature activation-1.33e-02

Pos logit contributions
ifying+0.054
suit+0.041
 upon+0.041
 low+0.037
 tim+0.036
Neg logit contributions
old-0.050
 artist-0.047
er-0.044
off-0.044
 example-0.043
Token forget
Token ID6044
Feature activation+9.62e-04

Pos logit contributions
ifying+0.032
 upon+0.024
suit+0.024
 Deal+0.021
arl+0.021
Neg logit contributions
old-0.028
 artist-0.027
off-0.025
 tournament-0.024
er-0.024
Token about
Token ID546
Feature activation+4.72e-04

Pos logit contributions
ifying+0.002
 upon+0.002
 low+0.002
suit+0.002
 tim+0.002
Neg logit contributions
old-0.001
 artist-0.001
off-0.001
unn-0.001
 tournament-0.001
Token the
Token ID262
Feature activation-2.48e-03

Pos logit contributions
ifying+0.001
suit+0.001
 upon+0.001
 Deal+0.001
 low+0.001
Neg logit contributions
old-0.001
 artist-0.001
er-0.001
off-0.001
 example-0.001
Token iron
Token ID6953
Feature activation+7.21e-03

Pos logit contributions
old+0.003
 artist+0.003
off+0.003
er+0.003
 example+0.003
Neg logit contributions
ifying-0.006
 upon-0.005
suit-0.005
 Deal-0.004
 low-0.004
Token.
Token ID13
Feature activation+6.31e-03

Pos logit contributions
ifying+0.013
 upon+0.010
suit+0.010
 low+0.009
 scr+0.008
Neg logit contributions
old-0.013
off-0.012
 endings-0.011
 artist-0.011
 example-0.011
Token They
Token ID1119
Feature activation+1.75e-02

Pos logit contributions
ifying+0.013
 upon+0.010
suit+0.010
 low+0.009
 Deal+0.009
Neg logit contributions
old-0.010
 artist-0.009
off-0.009
er-0.009
 example-0.008
Token are
Token ID389
Feature activation-1.07e-02

Pos logit contributions
ifying+0.035
suit+0.026
 upon+0.026
arl+0.023
 Deal+0.023
Neg logit contributions
old-0.029
 artist-0.028
off-0.025
 adventure-0.025
 tournament-0.025
Token happy
Token ID3772
Feature activation-2.56e-02

Pos logit contributions
old+0.019
 artist+0.018
er+0.017
 example+0.017
off+0.016
Neg logit contributions
ifying-0.020
suit-0.015
 upon-0.015
 Deal-0.013
 low-0.013
Token.
Token ID13
Feature activation-2.07e-03

Pos logit contributions
old+0.057
off+0.052
 tournament+0.049
 artist+0.048
unn+0.048
Neg logit contributions
 upon-0.031
ifying-0.031
 low-0.022
suit-0.020
 scr-0.018
Token<|endoftext|>
Token ID50256
Feature activation-3.50e-03

Pos logit contributions
old+0.003
 artist+0.003
off+0.003
er+0.003
 example+0.003
Neg logit contributions
ifying-0.004
 upon-0.003
suit-0.003
 low-0.003
 Deal-0.003
TokenTom
Token ID13787
Feature activation-1.64e-02

Pos logit contributions
old+0.007
off+0.006
 artist+0.006
er+0.006
 days+0.006
Neg logit contributions
ifying-0.006
 upon-0.004
suit-0.004
 Deal-0.004
 low-0.004
Token He
Token ID679
Feature activation+7.03e-03

Pos logit contributions
ifying+0.001
suit+0.001
 upon+0.001
 low+0.001
 tim+0.001
Neg logit contributions
old-0.001
 artist-0.001
off-0.001
er-0.001
 endings-0.001
Token told
Token ID1297
Feature activation+1.40e-02

Pos logit contributions
ifying+0.012
 upon+0.010
suit+0.009
 low+0.009
 scr+0.008
Neg logit contributions
old-0.013
 artist-0.012
off-0.012
 endings-0.011
 example-0.011
Token her
Token ID607
Feature activation+1.37e-03

Pos logit contributions
ifying+0.017
 upon+0.013
 low+0.012
 scr+0.011
 Deal+0.011
Neg logit contributions
old-0.031
off-0.028
 artist-0.028
 tournament-0.028
unn-0.027
Token to
Token ID284
Feature activation+5.46e-03

Pos logit contributions
ifying+0.002
 upon+0.002
suit+0.002
 low+0.002
arl+0.002
Neg logit contributions
old-0.002
off-0.002
 artist-0.002
minded-0.002
 example-0.002
Token go
Token ID467
Feature activation-1.48e-02

Pos logit contributions
ifying+0.010
 upon+0.008
suit+0.007
 low+0.007
 Deal+0.007
Neg logit contributions
old-0.010
 artist-0.009
off-0.009
er-0.009
 example-0.008
Token away
Token ID1497
Feature activation+9.39e-03

Pos logit contributions
old+0.024
 artist+0.022
 endings+0.022
off+0.021
er+0.020
Neg logit contributions
ifying-0.028
 upon-0.024
suit-0.023
 low-0.022
arl-0.020
Token and
Token ID290
Feature activation-1.71e-02

Pos logit contributions
ifying+0.019
 upon+0.015
suit+0.015
 low+0.014
 Deal+0.013
Neg logit contributions
old-0.015
 artist-0.014
off-0.013
er-0.013
 example-0.012
Token ignore
Token ID8856
Feature activation+1.11e-02

Pos logit contributions
old+0.030
off+0.027
 artist+0.027
is+0.026
unn+0.026
Neg logit contributions
ifying-0.029
 upon-0.024
 low-0.023
suit-0.023
 limp-0.020
Token her
Token ID607
Feature activation+1.27e-02

Pos logit contributions
ifying+0.022
 upon+0.018
suit+0.017
 Deal+0.015
arl+0.015
Neg logit contributions
old-0.018
 artist-0.017
er-0.017
off-0.016
 example-0.016
Token.
Token ID13
Feature activation-7.58e-03

Pos logit contributions
ifying+0.030
suit+0.023
 upon+0.023
 Deal+0.022
 low+0.022
Neg logit contributions
old-0.017
 artist-0.016
er-0.015
 example-0.014
off-0.014
Token
Token ID198
Feature activation+5.15e-03

Pos logit contributions
old+0.014
 artist+0.012
off+0.012
er+0.012
 example+0.011
Neg logit contributions
ifying-0.014
 upon-0.011
suit-0.010
 low-0.010
 Deal-0.009
Token looked
Token ID3114
Feature activation+2.40e-02

Pos logit contributions
old+0.018
 artist+0.016
off+0.016
er+0.015
 endings+0.015
Neg logit contributions
ifying-0.019
 upon-0.015
suit-0.014
 low-0.013
 Deal-0.013
Token at
Token ID379
Feature activation+3.43e-03

Pos logit contributions
ifying+0.054
suit+0.042
 Deal+0.042
 upon+0.041
arl+0.041
Neg logit contributions
old-0.033
 artist-0.027
off-0.026
er-0.026
 that-0.026
Token the
Token ID262
Feature activation+1.63e-03

Pos logit contributions
ifying+0.007
suit+0.006
 upon+0.006
 Deal+0.005
arl+0.005
Neg logit contributions
old-0.005
 artist-0.005
er-0.005
off-0.004
 example-0.004
Token pies
Token ID47199
Feature activation+1.34e-03

Pos logit contributions
ifying+0.004
suit+0.003
 upon+0.003
 Deal+0.003
 low+0.003
Neg logit contributions
old-0.002
 artist-0.002
off-0.002
er-0.002
 example-0.001
Token.
Token ID13
Feature activation+1.52e-02

Pos logit contributions
ifying+0.002
 upon+0.002
 low+0.002
suit+0.002
 scr+0.001
Neg logit contributions
old-0.002
 artist-0.002
 endings-0.002
off-0.002
 example-0.002
Token They
Token ID1119
Feature activation+8.32e-03

Pos logit contributions
ifying+0.031
 upon+0.025
suit+0.024
 low+0.022
 Deal+0.022
Neg logit contributions
old-0.024
 artist-0.021
off-0.020
er-0.020
 example-0.020
Token all
Token ID477
Feature activation-3.06e-03

Pos logit contributions
ifying+0.016
suit+0.012
arl+0.012
 Deal+0.012
 upon+0.011
Neg logit contributions
old-0.014
 artist-0.013
 members-0.012
er-0.012
 tournament-0.012
Token looked
Token ID3114
Feature activation+2.43e-02

Pos logit contributions
old+0.006
 artist+0.005
off+0.005
er+0.005
 endings+0.005
Neg logit contributions
ifying-0.006
 upon-0.004
suit-0.004
 low-0.004
 Deal-0.004
Token y
Token ID331
Feature activation+3.99e-03

Pos logit contributions
ifying+0.057
suit+0.045
 Deal+0.045
arl+0.044
 upon+0.043
Neg logit contributions
old-0.030
 artist-0.026
 that-0.026
 more-0.026
er-0.025
Tokenummy
Token ID13513
Feature activation+7.29e-03

Pos logit contributions
ifying+0.009
 upon+0.008
suit+0.007
arl+0.007
 low+0.007
Neg logit contributions
old-0.005
 artist-0.005
 tournament-0.004
 example-0.004
off-0.004
Token and
Token ID290
Feature activation+3.71e-02

Pos logit contributions
ifying+0.014
 upon+0.011
suit+0.011
 low+0.010
 Deal+0.010
Neg logit contributions
old-0.012
 artist-0.011
off-0.011
er-0.010
 example-0.010
Token can
Token ID460
Feature activation-9.25e-03

Pos logit contributions
ifying+0.029
 upon+0.020
suit+0.019
 low+0.017
 Deal+0.016
Neg logit contributions
old-0.048
 artist-0.045
off-0.044
er-0.043
 example-0.042
Token look
Token ID804
Feature activation+2.55e-02

Pos logit contributions
old+0.017
 artist+0.015
off+0.015
er+0.014
 endings+0.014
Neg logit contributions
ifying-0.017
 upon-0.013
suit-0.012
 low-0.012
 Deal-0.011
Token at
Token ID379
Feature activation-1.58e-02

Pos logit contributions
ifying+0.053
suit+0.043
 Deal+0.042
 upon+0.041
arl+0.041
Neg logit contributions
old-0.036
 artist-0.033
 that-0.030
er-0.030
off-0.030
Token how
Token ID703
Feature activation+2.86e-03

Pos logit contributions
old+0.026
 artist+0.023
off+0.022
 tournament+0.021
 endings+0.021
Neg logit contributions
ifying-0.031
 upon-0.025
 low-0.023
suit-0.022
 Deal-0.021
Token they
Token ID484
Feature activation+2.65e-02

Pos logit contributions
ifying+0.006
 upon+0.005
suit+0.005
 low+0.005
 Deal+0.005
Neg logit contributions
old-0.004
 artist-0.004
off-0.003
er-0.003
 example-0.003
Token act
Token ID719
Feature activation+7.91e-03

Pos logit contributions
ifying+0.046
suit+0.033
 upon+0.032
arl+0.030
 Deal+0.030
Neg logit contributions
old-0.050
 artist-0.047
off-0.046
 members-0.044
er-0.044
Token,
Token ID11
Feature activation-2.05e-03

Pos logit contributions
ifying+0.016
 upon+0.013
suit+0.012
 low+0.012
 Deal+0.011
Neg logit contributions
old-0.013
 artist-0.011
off-0.011
er-0.011
 example-0.010
Token and
Token ID290
Feature activation-6.18e-03

Pos logit contributions
old+0.003
 artist+0.003
off+0.003
er+0.003
 example+0.003
Neg logit contributions
ifying-0.004
 upon-0.003
suit-0.003
 low-0.003
 Deal-0.003
Token how
Token ID703
Feature activation+6.25e-03

Pos logit contributions
old+0.008
 artist+0.007
off+0.007
er+0.007
 tournament+0.007
Neg logit contributions
ifying-0.014
 upon-0.011
suit-0.011
 low-0.010
 Deal-0.010
Token they
Token ID484
Feature activation+1.52e-02

Pos logit contributions
ifying+0.013
 upon+0.011
suit+0.010
 low+0.010
 Deal+0.010
Neg logit contributions
old-0.010
 artist-0.008
off-0.008
er-0.008
 example-0.008
Token treat
Token ID2190
Feature activation+8.97e-03

Pos logit contributions
ifying+0.025
 upon+0.018
suit+0.018
 Deal+0.017
 low+0.016
Neg logit contributions
old-0.029
 artist-0.028
off-0.027
er-0.026
 members-0.026
Token he
Token ID339
Feature activation-2.34e-02

Pos logit contributions
ifying+0.022
suit+0.017
 upon+0.017
 Deal+0.017
arl+0.016
Neg logit contributions
old-0.012
 artist-0.010
er-0.010
off-0.010
 example-0.009
Token was
Token ID373
Feature activation-2.02e-02

Pos logit contributions
old+0.040
 artist+0.036
off+0.035
er+0.034
 example+0.033
Neg logit contributions
ifying-0.045
 upon-0.035
suit-0.034
 low-0.032
 Deal-0.031
Token playing
Token ID2712
Feature activation+9.24e-03

Pos logit contributions
old+0.035
 artist+0.032
off+0.031
er+0.030
 endings+0.030
Neg logit contributions
ifying-0.037
 upon-0.030
suit-0.028
 low-0.027
 Deal-0.026
Token in
Token ID287
Feature activation+1.99e-03

Pos logit contributions
ifying+0.018
 upon+0.014
suit+0.014
 low+0.013
 Deal+0.013
Neg logit contributions
old-0.015
 artist-0.014
er-0.013
off-0.013
 example-0.013
Token his
Token ID465
Feature activation-5.30e-03

Pos logit contributions
ifying+0.004
suit+0.003
 upon+0.003
 Deal+0.003
 low+0.003
Neg logit contributions
old-0.003
 artist-0.003
off-0.003
er-0.003
 example-0.003
Token bedroom
Token ID14043
Feature activation+6.42e-03

Pos logit contributions
old+0.009
 artist+0.008
off+0.007
er+0.007
 endings+0.007
Neg logit contributions
ifying-0.011
 upon-0.009
suit-0.008
 low-0.008
 Deal-0.007
Token when
Token ID618
Feature activation-4.22e-03

Pos logit contributions
ifying+0.013
 upon+0.011
suit+0.010
 low+0.010
 Deal+0.009
Neg logit contributions
old-0.010
 artist-0.009
off-0.009
er-0.008
 example-0.008
Token he
Token ID339
Feature activation-5.03e-03

Pos logit contributions
old+0.005
 artist+0.004
er+0.004
 days+0.004
 endings+0.004
Neg logit contributions
ifying-0.010
suit-0.008
 upon-0.008
arl-0.008
 Deal-0.008
Token noticed
Token ID6810
Feature activation-2.51e-02

Pos logit contributions
old+0.008
 artist+0.007
er+0.007
 members+0.007
off+0.007
Neg logit contributions
ifying-0.010
arl-0.008
 Deal-0.008
suit-0.008
 upon-0.007
Token something
Token ID1223
Feature activation+7.01e-03

Pos logit contributions
old+0.046
off+0.044
 artist+0.043
unn+0.042
 tournament+0.041
Neg logit contributions
ifying-0.042
 upon-0.035
 low-0.030
suit-0.027
 vel-0.027
Token strange
Token ID6283
Feature activation+1.00e-02

Pos logit contributions
ifying+0.014
 upon+0.011
suit+0.010
 Deal+0.010
arl+0.010
Neg logit contributions
old-0.012
 artist-0.010
er-0.010
off-0.010
 adventure-0.010
Token plan
Token ID1410
Feature activation-5.34e-03

Pos logit contributions
old+0.011
 artist+0.010
 adventure+0.010
 example+0.010
 tournament+0.009
Neg logit contributions
ifying-0.018
suit-0.014
 upon-0.013
 Deal-0.012
istered-0.012
Token.
Token ID13
Feature activation-2.56e-05

Pos logit contributions
old+0.009
 artist+0.008
off+0.008
 tournament+0.008
 endings+0.007
Neg logit contributions
ifying-0.010
 upon-0.008
suit-0.008
 low-0.007
 tim-0.007
Token He
Token ID679
Feature activation-3.09e-03

Pos logit contributions
old+0.000
 artist+0.000
off+0.000
 tournament+0.000
er+0.000
Neg logit contributions
ifying-0.000
 upon-0.000
suit-0.000
 low-0.000
 Deal-0.000
Token asked
Token ID1965
Feature activation+1.45e-02

Pos logit contributions
old+0.006
 artist+0.006
 endings+0.006
off+0.005
 example+0.005
Neg logit contributions
ifying-0.004
 upon-0.004
suit-0.004
 low-0.004
 tim-0.003
Token his
Token ID465
Feature activation-4.78e-03

Pos logit contributions
ifying+0.029
suit+0.021
 upon+0.021
 Deal+0.019
 low+0.019
Neg logit contributions
old-0.026
 artist-0.023
er-0.023
off-0.022
 example-0.022
Token big
Token ID1263
Feature activation-2.40e-02

Pos logit contributions
old+0.010
off+0.009
 artist+0.009
 endings+0.009
er+0.009
Neg logit contributions
ifying-0.007
 upon-0.006
suit-0.005
 low-0.005
 Deal-0.005
Token sister
Token ID6621
Feature activation-1.43e-02

Pos logit contributions
old+0.046
off+0.041
 endings+0.040
way+0.040
 artist+0.039
Neg logit contributions
ifying-0.032
 upon-0.031
suit-0.029
 low-0.027
arl-0.025
Token,
Token ID11
Feature activation+2.57e-02

Pos logit contributions
old+0.023
 artist+0.021
off+0.021
 endings+0.020
er+0.019
Neg logit contributions
ifying-0.027
 upon-0.023
suit-0.022
 low-0.021
 Deal-0.020
Token Lily
Token ID20037
Feature activation+7.57e-05

Pos logit contributions
ifying+0.063
suit+0.048
istered+0.042
 upon+0.039
arl+0.038
Neg logit contributions
old-0.037
 that-0.034
 more-0.033
er-0.032
 example-0.032
Token,
Token ID11
Feature activation+2.05e-02

Pos logit contributions
ifying+0.000
 upon+0.000
suit+0.000
 low+0.000
 Deal+0.000
Neg logit contributions
old-0.000
 artist-0.000
off-0.000
er-0.000
 example-0.000
Token to
Token ID284
Feature activation-2.93e-03

Pos logit contributions
ifying+0.053
suit+0.043
istered+0.039
 scr+0.037
 limp+0.037
Neg logit contributions
old-0.027
 that-0.024
er-0.021
 adventure-0.020
 days-0.020
Token Lily
Token ID20037
Feature activation-3.56e-03

Pos logit contributions
ifying+0.022
suit+0.017
 upon+0.017
 Deal+0.016
 low+0.016
Neg logit contributions
old-0.014
 artist-0.013
off-0.012
er-0.012
 example-0.011
Token.
Token ID13
Feature activation+1.55e-02

Pos logit contributions
old+0.006
 artist+0.006
 tournament+0.005
 example+0.005
 endings+0.005
Neg logit contributions
ifying-0.006
 upon-0.005
 low-0.005
suit-0.005
 tim-0.004
Token She
Token ID1375
Feature activation+2.34e-02

Pos logit contributions
ifying+0.032
suit+0.024
 upon+0.023
 tim+0.022
 Deal+0.021
Neg logit contributions
old-0.024
 artist-0.023
er-0.022
off-0.021
 days-0.020
Token loved
Token ID6151
Feature activation-2.17e-02

Pos logit contributions
ifying+0.046
 upon+0.034
suit+0.033
 Deal+0.032
arl+0.031
Neg logit contributions
old-0.042
 artist-0.039
off-0.036
er-0.036
 days-0.036
Token to
Token ID284
Feature activation+5.87e-03

Pos logit contributions
old+0.034
 artist+0.030
er+0.029
off+0.029
 example+0.028
Neg logit contributions
ifying-0.043
 upon-0.036
suit-0.033
 low-0.033
 vel-0.031
Token go
Token ID467
Feature activation-1.10e-02

Pos logit contributions
ifying+0.012
 upon+0.010
suit+0.009
 Deal+0.008
 low+0.008
Neg logit contributions
old-0.009
 artist-0.009
off-0.008
 adventure-0.008
er-0.008
Token to
Token ID284
Feature activation+1.61e-04

Pos logit contributions
old+0.021
 artist+0.019
off+0.018
er+0.018
 endings+0.018
Neg logit contributions
ifying-0.019
 upon-0.015
suit-0.014
 low-0.013
 tim-0.012
Token the
Token ID262
Feature activation-2.83e-02

Pos logit contributions
ifying+0.000
 upon+0.000
 low+0.000
suit+0.000
 Deal+0.000
Neg logit contributions
old-0.000
 artist-0.000
er-0.000
off-0.000
 endings-0.000
Token park
Token ID3952
Feature activation-1.40e-02

Pos logit contributions
old+0.036
 artist+0.030
er+0.029
off+0.028
 endings+0.027
Neg logit contributions
ifying-0.067
 upon-0.056
suit-0.053
 low-0.052
 Deal-0.049
Token with
Token ID351
Feature activation+6.68e-04

Pos logit contributions
old+0.029
 endings+0.027
 artist+0.026
unn+0.025
 mistakes+0.025
Neg logit contributions
ifying-0.020
 upon-0.018
 low-0.016
suit-0.016
 tim-0.014
Token her
Token ID607
Feature activation-3.67e-04

Pos logit contributions
ifying+0.001
 upon+0.001
suit+0.001
 low+0.001
 Deal+0.001
Neg logit contributions
old-0.001
 artist-0.001
off-0.001
er-0.001
 example-0.001
Token
Token ID198
Feature activation-5.30e-03

Pos logit contributions
old+0.035
 artist+0.031
off+0.030
er+0.030
 example+0.029
Neg logit contributions
ifying-0.037
 upon-0.029
suit-0.028
 low-0.026
 Deal-0.025
Token
Token ID198
Feature activation-1.08e-02

Pos logit contributions
old+0.010
 artist+0.009
off+0.009
er+0.008
 example+0.008
Neg logit contributions
ifying-0.009
 upon-0.007
suit-0.007
 low-0.007
 Deal-0.006
TokenThe
Token ID464
Feature activation+1.45e-02

Pos logit contributions
old+0.019
 artist+0.017
off+0.016
er+0.016
 example+0.016
Neg logit contributions
ifying-0.020
 upon-0.016
suit-0.015
 low-0.014
 Deal-0.014
Token pig
Token ID12967
Feature activation-2.60e-02

Pos logit contributions
ifying+0.030
 upon+0.024
suit+0.023
 low+0.022
 Deal+0.021
Neg logit contributions
old-0.022
 artist-0.020
off-0.019
er-0.019
 example-0.018
Token asked
Token ID1965
Feature activation+9.98e-03

Pos logit contributions
old+0.049
 artist+0.045
off+0.044
 endings+0.043
 example+0.042
Neg logit contributions
ifying-0.042
 upon-0.035
suit-0.032
 low-0.030
 tim-0.027
Token the
Token ID262
Feature activation-6.18e-03

Pos logit contributions
ifying+0.022
suit+0.017
 upon+0.017
 Deal+0.016
 low+0.015
Neg logit contributions
old-0.016
er-0.014
 artist-0.013
 example-0.013
off-0.012
Token cat
Token ID3797
Feature activation-1.44e-02

Pos logit contributions
old+0.008
 artist+0.008
er+0.007
off+0.007
 example+0.007
Neg logit contributions
ifying-0.014
 upon-0.011
suit-0.011
 low-0.010
 Deal-0.010
Token,
Token ID11
Feature activation+2.72e-02

Pos logit contributions
old+0.023
 artist+0.021
off+0.020
 endings+0.020
 tournament+0.019
Neg logit contributions
ifying-0.028
 upon-0.024
suit-0.021
 low-0.021
 Deal-0.019
Token "
Token ID366
Feature activation-1.92e-02

Pos logit contributions
ifying+0.052
suit+0.040
istered+0.035
arl+0.034
 Deal+0.034
Neg logit contributions
old-0.049
 that-0.045
er-0.043
 example-0.042
 artist-0.042
TokenCan
Token ID6090
Feature activation-1.90e-02

Pos logit contributions
old+0.037
off+0.035
 artist+0.035
er+0.034
minded+0.033
Neg logit contributions
ifying-0.031
suit-0.024
 upon-0.023
 low-0.021
 tim-0.021
Token you
Token ID345
Feature activation-1.85e-02

Pos logit contributions
old+0.025
 artist+0.022
off+0.020
er+0.019
 example+0.019
Neg logit contributions
ifying-0.044
 upon-0.037
suit-0.035
 low-0.034
 Deal-0.033
Tokenie
Token ID494
Feature activation+2.29e-03

Pos logit contributions
ifying+0.013
 upon+0.011
suit+0.011
 low+0.010
arl+0.010
Neg logit contributions
old-0.006
 artist-0.005
 example-0.005
 tournament-0.005
off-0.005
Token.
Token ID13
Feature activation-1.35e-02

Pos logit contributions
ifying+0.005
 upon+0.004
suit+0.004
 low+0.004
 Deal+0.003
Neg logit contributions
old-0.003
 artist-0.003
off-0.003
er-0.003
 example-0.003
Token We
Token ID775
Feature activation+6.60e-03

Pos logit contributions
old+0.024
 artist+0.021
off+0.021
er+0.020
 endings+0.020
Neg logit contributions
ifying-0.026
 upon-0.020
suit-0.019
 Deal-0.018
 low-0.018
Token need
Token ID761
Feature activation-5.72e-02

Pos logit contributions
ifying+0.010
 upon+0.007
suit+0.007
 low+0.006
 Deal+0.006
Neg logit contributions
old-0.014
 artist-0.013
off-0.013
er-0.012
 example-0.012
Token to
Token ID284
Feature activation-1.58e-02

Pos logit contributions
 artist+0.105
old+0.105
 tournament+0.096
 endings+0.093
ican+0.093
Neg logit contributions
ifying-0.093
 upon-0.077
 low-0.064
suit-0.059
 total-0.058
Token pay
Token ID1414
Feature activation-1.39e-02

Pos logit contributions
old+0.027
 artist+0.025
off+0.024
er+0.023
 example+0.023
Neg logit contributions
ifying-0.030
 upon-0.024
suit-0.023
 low-0.021
 Deal-0.021
Token the
Token ID262
Feature activation-1.86e-03

Pos logit contributions
old+0.022
 artist+0.020
off+0.019
er+0.019
 tournament+0.019
Neg logit contributions
ifying-0.028
 upon-0.022
suit-0.021
 low-0.020
 Deal-0.020
Token bills
Token ID9024
Feature activation+2.26e-02

Pos logit contributions
old+0.003
 artist+0.003
off+0.002
er+0.002
 endings+0.002
Neg logit contributions
ifying-0.004
 upon-0.003
suit-0.003
 low-0.003
 Deal-0.003
Token first
Token ID717
Feature activation-1.22e-02

Pos logit contributions
ifying+0.050
 upon+0.040
suit+0.039
 Deal+0.038
 low+0.038
Neg logit contributions
old-0.031
 artist-0.027
er-0.027
off-0.027
 members-0.025
Token."
Token ID526
Feature activation-5.81e-03

Pos logit contributions
old+0.018
 artist+0.016
off+0.015
er+0.014
 endings+0.014
Neg logit contributions
ifying-0.026
 upon-0.022
suit-0.021
 low-0.020
 Deal-0.019
Token
Token ID198
Feature activation+8.35e-03

Pos logit contributions
old+0.011
 artist+0.011
off+0.010
 example+0.010
 tournament+0.010
Neg logit contributions
ifying-0.009
 upon-0.007
suit-0.007
 low-0.006
 Deal-0.006
Token They
Token ID1119
Feature activation+3.87e-03

Pos logit contributions
ifying+0.009
 upon+0.007
suit+0.006
 low+0.006
 Deal+0.006
Neg logit contributions
old-0.007
 artist-0.006
off-0.006
er-0.006
 example-0.006
Token are
Token ID389
Feature activation-9.01e-03

Pos logit contributions
ifying+0.008
suit+0.006
 upon+0.006
arl+0.005
 Deal+0.005
Neg logit contributions
old-0.006
 adventure-0.006
 artist-0.006
off-0.005
 members-0.005
Token not
Token ID407
Feature activation-1.65e-03

Pos logit contributions
old+0.015
 artist+0.013
er+0.013
 members+0.013
 adventure+0.013
Neg logit contributions
ifying-0.018
suit-0.014
 upon-0.013
 Deal-0.012
arl-0.012
Token happy
Token ID3772
Feature activation-2.10e-02

Pos logit contributions
old+0.003
 artist+0.003
er+0.002
off+0.002
 members+0.002
Neg logit contributions
ifying-0.003
suit-0.002
 upon-0.002
 Deal-0.002
arl-0.002
Token.
Token ID13
Feature activation-7.10e-04

Pos logit contributions
old+0.039
 artist+0.034
off+0.033
 tournament+0.032
is+0.032
Neg logit contributions
ifying-0.036
 upon-0.031
suit-0.027
 low-0.026
arl-0.024
Token
Token ID198
Feature activation+3.77e-03

Pos logit contributions
old+0.001
 artist+0.001
off+0.001
er+0.001
 example+0.001
Neg logit contributions
ifying-0.001
 upon-0.001
suit-0.001
 low-0.001
 Deal-0.001
Token
Token ID198
Feature activation+1.67e-03

Pos logit contributions
ifying+0.006
 upon+0.005
suit+0.004
 low+0.004
 Deal+0.004
Neg logit contributions
old-0.007
 artist-0.007
off-0.007
er-0.007
 example-0.006
Token"
Token ID1
Feature activation-2.30e-02

Pos logit contributions
ifying+0.003
 upon+0.002
suit+0.002
 low+0.002
arl+0.002
Neg logit contributions
old-0.003
 artist-0.003
off-0.003
 endings-0.003
er-0.003
TokenWill
Token ID8743
Feature activation-8.60e-03

Pos logit contributions
old+0.045
 artist+0.042
off+0.040
er+0.040
 example+0.039
Neg logit contributions
ifying-0.038
 upon-0.029
suit-0.028
 low-0.025
 Deal-0.025
Token the
Token ID262
Feature activation-1.08e-03

Pos logit contributions
old+0.009
off+0.008
er+0.008
 artist+0.007
 that+0.007
Neg logit contributions
ifying-0.022
suit-0.018
 upon-0.018
 Deal-0.017
arl-0.017
Token dog
Token ID3290
Feature activation-1.31e-02

Pos logit contributions
old+0.002
 artist+0.001
 adventure+0.001
off+0.001
er+0.001
Neg logit contributions
ifying-0.002
 upon-0.002
suit-0.002
 Deal-0.002
 low-0.002
Token reach
Token ID3151
Feature activation-6.61e-03

Pos logit contributions
old+0.024
 artist+0.022
off+0.021
er+0.020
 example+0.020
Neg logit contributions
ifying-0.026
 upon-0.020
suit-0.020
 low-0.019
 vel-0.017
Token higher
Token ID2440
Feature activation+7.41e-03

Pos logit contributions
old+0.009
er+0.008
 artist+0.008
 example+0.008
 adventure+0.008
Neg logit contributions
ifying-0.015
suit-0.012
 upon-0.011
 Deal-0.011
arl-0.011
Token.
Token ID13
Feature activation-1.92e-02

Pos logit contributions
ifying+0.016
 upon+0.012
suit+0.012
 low+0.011
 Deal+0.011
Neg logit contributions
old-0.011
 artist-0.010
er-0.010
off-0.010
 example-0.009
Token Finally
Token ID9461
Feature activation+5.12e-03

Pos logit contributions
old+0.034
 artist+0.030
off+0.029
er+0.029
 example+0.028
Neg logit contributions
ifying-0.036
 upon-0.028
suit-0.027
 low-0.025
 Deal-0.024
Token,
Token ID11
Feature activation-1.65e-02

Pos logit contributions
ifying+0.011
 upon+0.009
suit+0.009
 low+0.008
 Deal+0.008
Neg logit contributions
old-0.008
 artist-0.007
off-0.007
er-0.006
 example-0.006
Token Bob
Token ID5811
Feature activation-4.53e-02

Pos logit contributions
old+0.021
 artist+0.018
off+0.017
er+0.017
 example+0.017
Neg logit contributions
ifying-0.038
suit-0.031
 upon-0.031
 Deal-0.029
 low-0.029
Token was
Token ID373
Feature activation-1.57e-02

Pos logit contributions
old+0.087
 artist+0.080
off+0.078
 endings+0.075
er+0.075
Neg logit contributions
ifying-0.075
 upon-0.059
suit-0.056
 low-0.053
 tim-0.048
Token able
Token ID1498
Feature activation-2.73e-02

Pos logit contributions
old+0.022
 artist+0.018
 example+0.018
er+0.017
 members+0.017
Neg logit contributions
ifying-0.037
suit-0.029
 upon-0.028
arl-0.027
 Deal-0.026
Token to
Token ID284
Feature activation-2.82e-03

Pos logit contributions
old+0.039
 artist+0.035
off+0.033
er+0.033
 example+0.031
Neg logit contributions
ifying-0.059
 upon-0.049
suit-0.047
 low-0.045
 Deal-0.043
Token grab
Token ID5552
Feature activation-2.03e-03

Pos logit contributions
old+0.005
 artist+0.005
off+0.005
er+0.005
 endings+0.005
Neg logit contributions
ifying-0.005
 upon-0.004
suit-0.003
 low-0.003
 Deal-0.003
Token the
Token ID262
Feature activation-3.66e-04

Pos logit contributions
old+0.003
 artist+0.003
er+0.003
off+0.003
 example+0.003
Neg logit contributions
ifying-0.004
 upon-0.003
suit-0.003
 Deal-0.003
arl-0.003
Token charitable
Token ID21803
Feature activation-1.07e-02

Pos logit contributions
ifying+0.002
 upon+0.002
suit+0.001
 low+0.001
 Deal+0.001
Neg logit contributions
old-0.001
 artist-0.001
off-0.001
er-0.001
 example-0.001
Token.
Token ID13
Feature activation-2.83e-02

Pos logit contributions
old+0.015
 artist+0.013
off+0.013
er+0.013
 example+0.012
Neg logit contributions
ifying-0.023
 upon-0.019
suit-0.019
 low-0.018
 Deal-0.017
Token You
Token ID921
Feature activation+1.17e-02

Pos logit contributions
old+0.051
 artist+0.047
off+0.045
er+0.044
 example+0.043
Neg logit contributions
ifying-0.051
 upon-0.040
suit-0.038
 low-0.035
 Deal-0.034
Token don
Token ID836
Feature activation+1.76e-03

Pos logit contributions
ifying+0.018
 upon+0.013
suit+0.013
 low+0.012
 Deal+0.011
Neg logit contributions
old-0.024
 artist-0.022
off-0.021
er-0.021
 example-0.021
Token't
Token ID470
Feature activation+6.13e-03

Pos logit contributions
ifying+0.003
 upon+0.002
suit+0.002
 low+0.002
 Deal+0.002
Neg logit contributions
old-0.003
 artist-0.003
off-0.003
er-0.003
 example-0.003
Token need
Token ID761
Feature activation-4.67e-02

Pos logit contributions
ifying+0.010
 upon+0.008
suit+0.007
 low+0.007
 Deal+0.006
Neg logit contributions
old-0.012
 artist-0.011
off-0.011
er-0.011
 example-0.010
Token material
Token ID2587
Feature activation+1.73e-02

Pos logit contributions
old+0.079
 artist+0.073
off+0.068
er+0.068
 tournament+0.066
Neg logit contributions
ifying-0.086
 upon-0.071
 low-0.063
suit-0.059
 Deal-0.058
Token possessions
Token ID23309
Feature activation+2.68e-02

Pos logit contributions
ifying+0.040
 upon+0.033
suit+0.033
 low+0.031
 Deal+0.031
Neg logit contributions
old-0.022
 artist-0.019
off-0.019
er-0.018
 endings-0.017
Token to
Token ID284
Feature activation+2.17e-02

Pos logit contributions
ifying+0.060
suit+0.048
 upon+0.048
 Deal+0.047
 low+0.045
Neg logit contributions
old-0.035
er-0.031
 artist-0.031
off-0.031
 adventure-0.028
Token support
Token ID1104
Feature activation+4.85e-03

Pos logit contributions
ifying+0.043
 upon+0.034
suit+0.033
 low+0.030
 Deal+0.030
Neg logit contributions
old-0.036
 artist-0.032
off-0.031
er-0.031
 example-0.030
Token those
Token ID883
Feature activation-5.75e-03

Pos logit contributions
ifying+0.012
 upon+0.010
suit+0.010
 low+0.009
 Deal+0.009
Neg logit contributions
old-0.006
 artist-0.005
off-0.005
er-0.005
 endings-0.004
Token
Token ID198
Feature activation-6.86e-03

Pos logit contributions
ifying+0.003
 upon+0.003
 low+0.002
suit+0.002
 Deal+0.002
Neg logit contributions
old-0.005
 artist-0.005
er-0.004
off-0.004
 endings-0.004
TokenSo
Token ID2396
Feature activation-1.39e-03

Pos logit contributions
old+0.012
 artist+0.011
er+0.010
off+0.010
 endings+0.010
Neg logit contributions
ifying-0.013
 upon-0.011
 low-0.010
suit-0.009
Me-0.009
Token Joey
Token ID26154
Feature activation-3.82e-02

Pos logit contributions
old+0.002
 artist+0.002
off+0.002
 endings+0.002
er+0.002
Neg logit contributions
ifying-0.003
 upon-0.002
 low-0.002
suit-0.002
 Deal-0.002
Token and
Token ID290
Feature activation+1.85e-02

Pos logit contributions
old+0.067
 artist+0.062
 endings+0.060
off+0.060
 example+0.058
Neg logit contributions
ifying-0.067
 upon-0.054
suit-0.052
 low-0.048
 tim-0.044
Token his
Token ID465
Feature activation-5.26e-03

Pos logit contributions
ifying+0.026
 upon+0.022
 low+0.018
suit+0.018
 Deal+0.016
Neg logit contributions
old-0.038
off-0.035
 artist-0.035
er-0.034
 endings-0.034
Token teacher
Token ID4701
Feature activation-4.55e-02

Pos logit contributions
old+0.008
off+0.007
er+0.007
 artist+0.007
 endings+0.007
Neg logit contributions
ifying-0.010
 upon-0.008
suit-0.008
 low-0.008
 Deal-0.007
Token went
Token ID1816
Feature activation-2.90e-02

Pos logit contributions
old+0.083
 artist+0.080
 endings+0.078
off+0.076
er+0.074
Neg logit contributions
ifying-0.075
 upon-0.059
suit-0.057
 low-0.053
 tim-0.048
Token to
Token ID284
Feature activation-2.13e-02

Pos logit contributions
old+0.053
 artist+0.050
 endings+0.048
off+0.046
er+0.045
Neg logit contributions
ifying-0.051
 upon-0.042
suit-0.040
 low-0.038
 tim-0.034
Token the
Token ID262
Feature activation-2.25e-02

Pos logit contributions
old+0.037
 artist+0.033
off+0.032
er+0.031
 example+0.030
Neg logit contributions
ifying-0.041
 upon-0.032
suit-0.031
 low-0.029
 Deal-0.028
Token garden
Token ID11376
Feature activation-3.18e-02

Pos logit contributions
old+0.038
oot+0.033
off+0.032
unn+0.032
 artist+0.032
Neg logit contributions
ifying-0.043
 upon-0.036
 low-0.034
suit-0.033
Me-0.029
Token.
Token ID13
Feature activation-2.21e-02

Pos logit contributions
old+0.059
 artist+0.055
off+0.054
 endings+0.053
unn+0.052
Neg logit contributions
ifying-0.050
 upon-0.047
 low-0.040
suit-0.040
 scr-0.034
Token it
Token ID340
Feature activation-7.55e-03

Pos logit contributions
ifying+0.020
suit+0.015
 upon+0.015
 Deal+0.015
arl+0.014
Neg logit contributions
old-0.018
 artist-0.016
er-0.015
off-0.015
 days-0.015
Token was
Token ID373
Feature activation+1.33e-03

Pos logit contributions
old+0.016
 artist+0.014
off+0.014
 example+0.013
er+0.013
Neg logit contributions
ifying-0.011
 upon-0.009
suit-0.008
 low-0.008
 scr-0.007
Token sad
Token ID6507
Feature activation-2.94e-04

Pos logit contributions
ifying+0.002
suit+0.002
 upon+0.002
 Deal+0.002
arl+0.002
Neg logit contributions
old-0.002
 artist-0.002
er-0.002
 example-0.002
off-0.002
Token.
Token ID13
Feature activation-1.04e-02

Pos logit contributions
old+0.001
 artist+0.000
off+0.000
 tournament+0.000
er+0.000
Neg logit contributions
ifying-0.001
 upon-0.000
suit-0.000
 low-0.000
arl-0.000
Token They
Token ID1119
Feature activation+3.46e-03

Pos logit contributions
old+0.021
 artist+0.019
off+0.018
er+0.018
is+0.018
Neg logit contributions
ifying-0.017
 upon-0.014
 low-0.012
suit-0.012
 Deal-0.011
Token needed
Token ID2622
Feature activation-4.71e-02

Pos logit contributions
ifying+0.006
 upon+0.004
suit+0.004
 low+0.004
 Deal+0.004
Neg logit contributions
old-0.007
 artist-0.006
off-0.006
er-0.006
 example-0.006
Token the
Token ID262
Feature activation-1.84e-02

Pos logit contributions
old+0.092
 artist+0.090
off+0.087
unn+0.086
 tournament+0.083
Neg logit contributions
ifying-0.068
 upon-0.058
 low-0.056
 limp-0.045
 loose-0.045
Token tower
Token ID10580
Feature activation-8.50e-04

Pos logit contributions
old+0.027
 artist+0.023
off+0.023
er+0.022
 endings+0.021
Neg logit contributions
ifying-0.040
 upon-0.033
 low-0.031
suit-0.031
 Deal-0.029
Token to
Token ID284
Feature activation+9.40e-03

Pos logit contributions
old+0.001
 artist+0.001
off+0.001
er+0.001
 example+0.001
Neg logit contributions
ifying-0.002
 upon-0.001
suit-0.001
 low-0.001
 Deal-0.001
Token heal
Token ID12035
Feature activation+1.34e-02

Pos logit contributions
ifying+0.016
 upon+0.013
 low+0.012
suit+0.012
 Deal+0.011
Neg logit contributions
old-0.018
 artist-0.016
off-0.015
er-0.015
 endings-0.015
Token them
Token ID606
Feature activation+2.50e-02

Pos logit contributions
ifying+0.031
 upon+0.025
suit+0.025
 Deal+0.023
arl+0.023
Neg logit contributions
old-0.017
 artist-0.016
er-0.016
 example-0.015
off-0.015
Tokenily
Token ID813
Feature activation-3.12e-02

Pos logit contributions
old+0.020
 artist+0.020
 endings+0.018
minded+0.018
off+0.018
Neg logit contributions
ifying-0.021
 upon-0.019
 Deal-0.016
suit-0.015
 low-0.015
Token was
Token ID373
Feature activation-1.07e-02

Pos logit contributions
old+0.055
 artist+0.050
off+0.048
er+0.048
 example+0.046
Neg logit contributions
ifying-0.058
 upon-0.045
suit-0.043
 low-0.041
 Deal-0.040
Token sick
Token ID6639
Feature activation-9.44e-03

Pos logit contributions
old+0.017
 artist+0.016
 example+0.014
off+0.014
er+0.014
Neg logit contributions
ifying-0.022
suit-0.017
 upon-0.016
 Deal-0.015
arl-0.015
Token.
Token ID13
Feature activation+3.29e-03

Pos logit contributions
old+0.015
 artist+0.014
off+0.013
er+0.013
 example+0.012
Neg logit contributions
ifying-0.019
 upon-0.015
suit-0.015
 low-0.014
 Deal-0.013
Token She
Token ID1375
Feature activation+2.33e-02

Pos logit contributions
ifying+0.006
suit+0.005
 upon+0.005
 low+0.004
arl+0.004
Neg logit contributions
old-0.005
 artist-0.005
er-0.005
 endings-0.005
off-0.004
Token had
Token ID550
Feature activation-4.23e-02

Pos logit contributions
ifying+0.045
arl+0.034
 upon+0.033
suit+0.033
 Deal+0.033
Neg logit contributions
old-0.040
 days-0.036
 artist-0.035
er-0.034
 members-0.034
Token a
Token ID257
Feature activation-1.95e-02

Pos logit contributions
old+0.059
 artist+0.052
off+0.050
er+0.049
 days+0.049
Neg logit contributions
ifying-0.092
suit-0.075
 upon-0.074
 Deal-0.072
arl-0.069
Token hot
Token ID3024
Feature activation-1.80e-02

Pos logit contributions
old+0.031
 artist+0.031
 adventure+0.030
 hospital+0.029
 talent+0.028
Neg logit contributions
ifying-0.037
 upon-0.027
suit-0.027
 Deal-0.026
arl-0.025
Token head
Token ID1182
Feature activation+2.98e-02

Pos logit contributions
old+0.027
 artist+0.025
er+0.023
off+0.023
 days+0.023
Neg logit contributions
ifying-0.038
 upon-0.030
suit-0.029
 low-0.028
 Deal-0.028
Token and
Token ID290
Feature activation+1.67e-02

Pos logit contributions
ifying+0.068
 Deal+0.064
suit+0.058
 upon+0.058
 vel+0.055
Neg logit contributions
old-0.034
 that-0.032
er-0.032
 artist-0.029
 members-0.027
Token a
Token ID257
Feature activation-1.22e-03

Pos logit contributions
ifying+0.036
suit+0.029
 upon+0.028
 Deal+0.027
arl+0.027
Neg logit contributions
old-0.024
 days-0.022
 artist-0.021
er-0.020
 that-0.020