TOP ACTIVATIONS
MAX = 8.827

's so special about Wed nes days ?" Max
much . Please don 't ever wander again ." She said
Both of them put their anx ieties aside . From then
Tim grabbed the hook and anx iously put it in the
hunter came across a deep rav ine . He thought he
slipped and fell into the rav ine below .
the same urge to explore Ur ge Land everyday , and
M ummy watched him anx iously as Ali waited for
is to always shout ' fore ' when you hit the
to never try to catch Twe ety again . From that
Lucy noticed the weather was uncom fort ably hot . She
was always excited for Wed nes days because it was the
are you ?" she shouted anx iously .
in the park on a bree zy day . Mary was
dare to climb up to Twe ety 's nest .
don 't think you can ever take it off !"
to give the sheet to Nem o and D ory .
I promise you won 't ever have to be scared again
a special hat for Wed nes days ." L
you . I won 't ever do it again . Next

MIN ACTIVATIONS
MIN = -7.581

with his trunk . The label fell off and the elephant
destroy the card . The card was very scared . But
stepped carefully , because the bottle was very delicate .
ground . The label said " save me !"
pped and fell . The wine glass flew out of her
. The card was incredible . It had
wherever she went . The label was very attractive and was
started to paint . The wine made the colors look different
and a note . The note said , " If you
was too small ! His photo had shrunk .
was also wrong . The photo is not worth our happiness
the river bank . The sign said " Bridge this way
Mom my explained that the box was like a truck .
back and smiled . The bucket was all clean and shiny
on her slice . The note said ' Happy Birthday '.
note from Sammy . The note said that Sammy had left
said to Max " This wine is so hot ! Let
year old child . The card had a picture of an
and said , " This card made me happy , maybe
started to happen . The print started to shine and spark

INTERVAL 4.725 - 8.827
CONTAINS 0.632%

said , " I want to go on a boat adventure
. They did not want to get in trouble . They
too . Do you want to help me get ready ?"
. He did not want to make his mom angry .
smiling , but he couldn 't bring himself to smile back

INTERVAL 0.623 - 4.725
CONTAINS 34.843%

She said , � � € � � Jimmy , watch
she never forgot the lesson she had learnt in the park
, but the dragon smiled and said hello .
she had been so silly and crazy .
day G ia would explore her backyard and find something new

INTERVAL -3.479 - 0.623
CONTAINS 60.804%

, Lily 's mom gave her a snack to feed her
could go near the beast because it was too fierce .
best friends , and they played together every day . <|endoftext|>
. Finally , the soldier spotted some candy .
ite in the tree . He was not happy .

INTERVAL -7.581 - -3.479
CONTAINS 3.721%

friendly bug . The bug said , " Hi , lizard
they were so impressed with what Bob had done . They
big slide and they all took turns sliding down it .
fisher s heard Jack and saw Tom 's boat . They
clown is not real . It is just a picture .
Token's
Token ID338
Feature activation-2.31e-01

Pos logit contributions
aw+0.037
It+0.035
 living+0.035
 that+0.031
 brand+0.030
Neg logit contributions
 argue-0.093
sing-0.088
 whine-0.084
days-0.082
 breath-0.079
Token so
Token ID523
Feature activation+1.52e+00

Pos logit contributions
aw+0.003
 living+0.003
It+0.003
 that+0.002
That+0.002
Neg logit contributions
 argue-0.007
sing-0.007
 spine-0.006
 whine-0.006
 breath-0.006
Token special
Token ID2041
Feature activation-3.56e-01

Pos logit contributions
 argue+0.045
sing+0.043
 whine+0.041
 spine+0.040
 breath+0.040
Neg logit contributions
aw-0.019
 living-0.017
It-0.015
 that-0.013
ing-0.013
Token about
Token ID546
Feature activation+5.80e-01

Pos logit contributions
aw+0.008
It+0.008
 living+0.007
That+0.007
hes+0.007
Neg logit contributions
 argue-0.007
sing-0.006
 spine-0.006
 whine-0.006
days-0.006
Token Wed
Token ID3300
Feature activation+5.01e+00

Pos logit contributions
 argue+0.014
sing+0.014
 breath+0.012
 spine+0.012
 whine+0.012
Neg logit contributions
aw-0.010
It-0.009
 living-0.008
That-0.008
hes-0.008
Tokennes
Token ID2516
Feature activation+8.83e+00

Pos logit contributions
 argue+0.065
sing+0.065
 spine+0.058
 breath+0.058
 whine+0.056
Neg logit contributions
 living-0.141
It-0.141
aw-0.139
 that-0.137
 all-0.130
Tokendays
Token ID12545
Feature activation+1.79e+00

Pos logit contributions
sing+0.182
 argue+0.180
 spine+0.173
 breath+0.166
 whine+0.164
Neg logit contributions
 living-0.190
 that-0.182
It-0.178
aw-0.175
 all-0.169
Token?"
Token ID1701
Feature activation+1.52e+00

Pos logit contributions
 argue+0.034
days+0.032
sing+0.031
 whine+0.029
 spine+0.028
Neg logit contributions
aw-0.041
It-0.039
 living-0.039
That-0.035
See-0.035
Token
Token ID198
Feature activation-6.23e-01

Pos logit contributions
 argue+0.045
sing+0.044
 spine+0.043
 whine+0.041
 breath+0.041
Neg logit contributions
 living-0.018
aw-0.018
It-0.016
 that-0.016
 all-0.014
Token
Token ID198
Feature activation-1.70e-01

Pos logit contributions
 living+0.007
aw+0.007
It+0.007
 that+0.006
 all+0.006
Neg logit contributions
 argue-0.019
sing-0.018
 spine-0.017
 breath-0.017
 whine-0.017
TokenMax
Token ID11518
Feature activation+3.17e-01

Pos logit contributions
 living+0.003
aw+0.003
 that+0.003
It+0.003
 all+0.002
Neg logit contributions
 argue-0.004
sing-0.004
 spine-0.004
 whine-0.004
 breath-0.004
Token much
Token ID881
Feature activation+1.44e+00

Pos logit contributions
 argue+0.064
sing+0.063
 spine+0.061
 whine+0.059
 breath+0.059
Neg logit contributions
 living-0.014
aw-0.013
It-0.012
 that-0.012
 all-0.009
Token.
Token ID13
Feature activation-5.29e-01

Pos logit contributions
 argue+0.034
sing+0.033
 spine+0.031
 whine+0.030
 breath+0.030
Neg logit contributions
 living-0.027
aw-0.026
It-0.026
 that-0.026
 all-0.024
Token Please
Token ID4222
Feature activation+3.56e+00

Pos logit contributions
aw+0.010
 living+0.009
It+0.009
 that+0.009
 all+0.008
Neg logit contributions
 argue-0.012
sing-0.012
 spine-0.011
 whine-0.011
days-0.011
Token don
Token ID836
Feature activation+2.43e+00

Pos logit contributions
sing+0.083
 argue+0.083
 spine+0.079
 whine+0.075
 breath+0.074
Neg logit contributions
 living-0.066
It-0.064
 that-0.063
aw-0.063
 all-0.058
Token't
Token ID470
Feature activation+5.14e+00

Pos logit contributions
 argue+0.048
sing+0.041
 breath+0.040
 whine+0.039
days+0.036
Neg logit contributions
aw-0.056
 living-0.055
 prett-0.048
 biggest-0.047
 brand-0.047
Token ever
Token ID1683
Feature activation+8.08e+00

Pos logit contributions
sing+0.105
 spine+0.098
 argue+0.097
 breath+0.090
days+0.087
Neg logit contributions
 that-0.112
It-0.108
 living-0.107
 all-0.105
aw-0.101
Token wander
Token ID27776
Feature activation+7.15e-01

Pos logit contributions
sing+0.159
 spine+0.148
 argue+0.145
 breath+0.137
irds+0.137
Neg logit contributions
 that-0.176
 living-0.168
 all-0.164
It-0.164
aw-0.156
Token again
Token ID757
Feature activation+4.35e+00

Pos logit contributions
 argue+0.017
sing+0.016
 spine+0.015
 breath+0.015
 whine+0.015
Neg logit contributions
 living-0.013
aw-0.013
It-0.013
 that-0.012
 all-0.011
Token."
Token ID526
Feature activation+1.13e+00

Pos logit contributions
 argue+0.096
sing+0.094
 spine+0.088
 whine+0.085
 breath+0.084
Neg logit contributions
 living-0.087
aw-0.085
It-0.084
 that-0.083
 all-0.077
Token She
Token ID1375
Feature activation-1.77e+00

Pos logit contributions
 argue+0.041
sing+0.041
 breath+0.039
 spine+0.039
cker+0.037
Neg logit contributions
 that-0.008
aw-0.008
 living-0.008
It-0.008
 all-0.005
Token said
Token ID531
Feature activation-2.68e+00

Pos logit contributions
 living+0.032
aw+0.030
It+0.030
 that+0.028
ay+0.027
Neg logit contributions
 argue-0.041
sing-0.039
 whine-0.037
 breath-0.037
 spine-0.036
Token Both
Token ID5747
Feature activation-1.06e+00

Pos logit contributions
aw+0.002
 living+0.002
 that+0.002
It+0.001
 all+0.001
Neg logit contributions
 argue-0.003
sing-0.003
 spine-0.003
 whine-0.003
days-0.003
Token of
Token ID286
Feature activation+1.07e+00

Pos logit contributions
aw+0.020
It+0.019
 living+0.018
hes+0.016
 brand+0.016
Neg logit contributions
 argue-0.024
 breath-0.023
 whine-0.022
days-0.021
sing-0.021
Token them
Token ID606
Feature activation-1.41e+00

Pos logit contributions
 argue+0.024
 breath+0.022
sing+0.021
 whine+0.021
days+0.020
Neg logit contributions
aw-0.020
 living-0.019
It-0.019
 biggest-0.018
 brand-0.018
Token put
Token ID1234
Feature activation-2.99e+00

Pos logit contributions
 living+0.021
aw+0.020
It+0.020
 that+0.019
 all+0.017
Neg logit contributions
 argue-0.038
sing-0.038
 spine-0.035
 whine-0.035
 breath-0.035
Token their
Token ID511
Feature activation-4.59e-01

Pos logit contributions
 living+0.052
It+0.050
aw+0.049
 brand+0.045
That+0.044
Neg logit contributions
 argue-0.067
 breath-0.066
sing-0.061
 whine-0.060
days-0.058
Token anx
Token ID7296
Feature activation+7.95e+00

Pos logit contributions
 living+0.008
aw+0.008
It+0.008
 that+0.008
 all+0.007
Neg logit contributions
 argue-0.011
sing-0.011
 spine-0.010
 whine-0.010
 breath-0.010
Tokenieties
Token ID9545
Feature activation-5.98e-01

Pos logit contributions
 argue+0.115
days+0.104
sing+0.104
 spine+0.102
 drown+0.101
Neg logit contributions
aw-0.204
 that-0.190
 living-0.185
 all-0.174
 best-0.173
Token aside
Token ID7263
Feature activation+1.10e+00

Pos logit contributions
 living+0.009
aw+0.009
It+0.009
 that+0.008
That+0.008
Neg logit contributions
 argue-0.016
sing-0.015
 breath-0.015
 spine-0.014
 whine-0.014
Token.
Token ID13
Feature activation-5.84e-01

Pos logit contributions
 argue+0.028
sing+0.027
 spine+0.026
 breath+0.025
 whine+0.025
Neg logit contributions
aw-0.018
 living-0.018
It-0.018
 that-0.016
That-0.015
Token From
Token ID3574
Feature activation-1.33e+00

Pos logit contributions
aw+0.009
 living+0.008
 that+0.008
It+0.007
 all+0.007
Neg logit contributions
 argue-0.015
sing-0.015
 spine-0.014
 whine-0.014
days-0.014
Token then
Token ID788
Feature activation+2.03e+00

Pos logit contributions
 living+0.014
It+0.014
aw+0.013
hes+0.011
 biggest+0.011
Neg logit contributions
 argue-0.038
sing-0.038
 breath-0.037
 whine-0.035
 spine-0.035
TokenTim
Token ID14967
Feature activation+9.67e-01

Pos logit contributions
 living+0.010
aw+0.010
 that+0.009
It+0.009
 all+0.009
Neg logit contributions
 argue-0.010
sing-0.010
 spine-0.009
 whine-0.009
 breath-0.009
Token grabbed
Token ID13176
Feature activation-2.97e+00

Pos logit contributions
 argue+0.028
sing+0.027
 spine+0.026
 whine+0.025
 breath+0.025
Neg logit contributions
 living-0.013
aw-0.013
It-0.012
 that-0.012
 all-0.011
Token the
Token ID262
Feature activation-1.17e+00

Pos logit contributions
 living+0.051
aw+0.050
It+0.049
 that+0.047
 all+0.043
Neg logit contributions
 argue-0.074
sing-0.072
 spine-0.068
 breath-0.067
 whine-0.067
Token hook
Token ID8011
Feature activation-1.53e+00

Pos logit contributions
 living+0.027
aw+0.027
It+0.026
 that+0.026
 all+0.025
Neg logit contributions
sing-0.021
 argue-0.021
 spine-0.020
 breath-0.019
 whine-0.018
Token and
Token ID290
Feature activation+2.12e+00

Pos logit contributions
 living+0.020
aw+0.020
It+0.019
 that+0.019
 all+0.017
Neg logit contributions
 argue-0.044
sing-0.043
 spine-0.041
 whine-0.040
 breath-0.040
Token anx
Token ID7296
Feature activation+7.86e+00

Pos logit contributions
 argue+0.052
sing+0.051
 spine+0.048
 breath+0.047
 whine+0.047
Neg logit contributions
 living-0.037
aw-0.036
It-0.035
 that-0.034
 all-0.031
Tokeniously
Token ID6819
Feature activation+3.34e+00

Pos logit contributions
 drown+0.132
 seek+0.128
days+0.123
sing+0.122
unny+0.122
Neg logit contributions
 that-0.184
aw-0.165
 all-0.161
 living-0.150
 so-0.147
Token put
Token ID1234
Feature activation-3.39e+00

Pos logit contributions
 argue+0.087
sing+0.087
 spine+0.082
 whine+0.079
days+0.077
Neg logit contributions
 that-0.053
 living-0.052
aw-0.051
It-0.049
 all-0.048
Token it
Token ID340
Feature activation-1.90e+00

Pos logit contributions
 living+0.065
aw+0.064
It+0.056
pill+0.054
That+0.053
Neg logit contributions
 breath-0.078
 argue-0.072
sing-0.067
 spine-0.064
 whine-0.063
Token in
Token ID287
Feature activation+2.52e+00

Pos logit contributions
 living+0.022
aw+0.022
It+0.021
 that+0.020
 all+0.017
Neg logit contributions
 argue-0.057
sing-0.056
 spine-0.054
 breath-0.053
 whine-0.052
Token the
Token ID262
Feature activation-7.97e-01

Pos logit contributions
sing+0.064
 argue+0.064
 spine+0.060
 whine+0.058
days+0.055
Neg logit contributions
 living-0.042
It-0.041
 that-0.041
aw-0.039
 all-0.038
Token hunter
Token ID19177
Feature activation-5.19e-02

Pos logit contributions
aw+0.049
 living+0.046
hes+0.045
It+0.044
pill+0.044
Neg logit contributions
 spine-0.032
sing-0.032
 argue-0.030
 breath-0.029
cker-0.027
Token came
Token ID1625
Feature activation-2.78e-01

Pos logit contributions
 living+0.001
aw+0.001
It+0.001
 that+0.001
 all+0.001
Neg logit contributions
 argue-0.001
sing-0.001
 spine-0.001
 whine-0.001
 breath-0.001
Token across
Token ID1973
Feature activation-2.23e-01

Pos logit contributions
 living+0.004
It+0.004
aw+0.004
 that+0.004
 all+0.004
Neg logit contributions
 argue-0.008
sing-0.007
 spine-0.007
 whine-0.007
 breath-0.007
Token a
Token ID257
Feature activation-6.34e-01

Pos logit contributions
 living+0.002
 that+0.002
It+0.002
aw+0.002
 all+0.002
Neg logit contributions
 argue-0.008
sing-0.008
 spine-0.007
 whine-0.007
days-0.007
Token deep
Token ID2769
Feature activation+3.98e+00

Pos logit contributions
aw+0.009
hes+0.008
 living+0.008
It+0.008
 that+0.008
Neg logit contributions
 argue-0.017
sing-0.017
 breath-0.016
 spine-0.016
 whine-0.015
Token rav
Token ID24343
Feature activation+7.85e+00

Pos logit contributions
 argue+0.092
sing+0.090
 spine+0.086
 breath+0.085
 whine+0.082
Neg logit contributions
 living-0.074
aw-0.073
It-0.072
 that-0.068
That-0.063
Tokenine
Token ID500
Feature activation+1.99e-01

Pos logit contributions
 argue+0.141
 whine+0.125
sing+0.123
days+0.115
 breath+0.112
Neg logit contributions
aw-0.194
 living-0.179
 that-0.173
It-0.160
 all-0.157
Token.
Token ID13
Feature activation-7.74e-01

Pos logit contributions
sing+0.004
 spine+0.004
 argue+0.004
 breath+0.004
 whine+0.003
Neg logit contributions
It-0.004
aw-0.004
That-0.004
 living-0.004
ay-0.003
Token He
Token ID679
Feature activation-1.91e+00

Pos logit contributions
 living+0.010
aw+0.010
It+0.010
 that+0.009
 all+0.008
Neg logit contributions
 argue-0.022
sing-0.022
 spine-0.021
 whine-0.020
 breath-0.020
Token thought
Token ID1807
Feature activation-2.72e+00

Pos logit contributions
 living+0.028
aw+0.027
It+0.026
 that+0.025
 all+0.022
Neg logit contributions
 argue-0.052
sing-0.051
 spine-0.048
 breath-0.047
 whine-0.047
Token he
Token ID339
Feature activation-2.57e+00

Pos logit contributions
aw+0.039
 living+0.039
It+0.038
 that+0.033
That+0.032
Neg logit contributions
 argue-0.073
sing-0.072
 spine-0.069
 breath-0.068
 whine-0.066
Token slipped
Token ID18859
Feature activation+1.10e+00

Pos logit contributions
 living+0.029
It+0.027
aw+0.027
 that+0.025
 biggest+0.024
Neg logit contributions
 argue-0.051
sing-0.049
 breath-0.046
 whine-0.046
 spine-0.046
Token and
Token ID290
Feature activation+1.48e+00

Pos logit contributions
 argue+0.032
sing+0.032
 spine+0.030
 breath+0.029
 whine+0.029
Neg logit contributions
 living-0.014
aw-0.014
It-0.013
 that-0.013
 all-0.011
Token fell
Token ID3214
Feature activation+7.15e-01

Pos logit contributions
 argue+0.031
sing+0.030
 spine+0.028
 whine+0.027
 breath+0.027
Neg logit contributions
 living-0.031
aw-0.031
It-0.030
 that-0.030
 all-0.028
Token into
Token ID656
Feature activation+3.54e-01

Pos logit contributions
 argue+0.017
sing+0.017
 spine+0.016
 breath+0.016
 whine+0.016
Neg logit contributions
aw-0.012
 living-0.012
It-0.012
 that-0.011
That-0.010
Token the
Token ID262
Feature activation-6.90e-01

Pos logit contributions
 argue+0.010
sing+0.010
 spine+0.009
 whine+0.009
 breath+0.009
Neg logit contributions
 living-0.005
It-0.005
aw-0.005
 that-0.005
 all-0.004
Token rav
Token ID24343
Feature activation+7.85e+00

Pos logit contributions
aw+0.014
 living+0.013
It+0.013
 that+0.012
hes+0.012
Neg logit contributions
sing-0.014
 argue-0.014
 breath-0.014
 spine-0.014
 whine-0.013
Tokenine
Token ID500
Feature activation-6.27e-02

Pos logit contributions
 argue+0.150
sing+0.139
 whine+0.134
days+0.127
 spine+0.126
Neg logit contributions
aw-0.183
 living-0.163
 that-0.157
It-0.151
 is-0.145
Token below
Token ID2174
Feature activation+3.99e-01

Pos logit contributions
It+0.001
aw+0.001
 living+0.001
 that+0.001
That+0.001
Neg logit contributions
 argue-0.001
sing-0.001
 spine-0.001
 breath-0.001
 whine-0.001
Token.
Token ID13
Feature activation-3.37e-01

Pos logit contributions
 argue+0.011
sing+0.011
 spine+0.010
 whine+0.010
 breath+0.009
Neg logit contributions
 living-0.006
aw-0.006
 that-0.006
It-0.006
 all-0.006
Token
Token ID198
Feature activation-1.25e+00

Pos logit contributions
 living+0.004
aw+0.004
It+0.004
 that+0.004
 all+0.004
Neg logit contributions
 argue-0.010
sing-0.010
 spine-0.009
 whine-0.009
 breath-0.009
Token
Token ID198
Feature activation-5.06e-01

Pos logit contributions
 living+0.014
aw+0.013
It+0.013
 that+0.013
 all+0.011
Neg logit contributions
 argue-0.039
sing-0.038
 spine-0.036
 whine-0.035
 breath-0.035
Token the
Token ID262
Feature activation-2.05e+00

Pos logit contributions
 living+0.049
aw+0.048
It+0.044
ay+0.044
ing+0.043
Neg logit contributions
 argue-0.068
 breath-0.067
sing-0.066
 whine-0.065
 resist-0.065
Token same
Token ID976
Feature activation-3.95e-02

Pos logit contributions
aw+0.028
 living+0.028
It+0.027
 that+0.026
 all+0.022
Neg logit contributions
 argue-0.058
sing-0.057
 spine-0.055
 breath-0.053
 whine-0.053
Token urge
Token ID14960
Feature activation-8.69e-02

Pos logit contributions
aw+0.001
It+0.001
 living+0.001
 that+0.001
hes+0.001
Neg logit contributions
sing-0.001
 argue-0.001
 spine-0.001
 breath-0.001
 whine-0.001
Token to
Token ID284
Feature activation+4.01e+00

Pos logit contributions
aw+0.002
 living+0.002
It+0.002
 that+0.001
 brand+0.001
Neg logit contributions
 argue-0.002
sing-0.002
 spine-0.002
 whine-0.002
 breath-0.002
Token explore
Token ID7301
Feature activation-3.52e-01

Pos logit contributions
 argue+0.082
sing+0.082
 spine+0.077
 whine+0.072
 breath+0.071
Neg logit contributions
 living-0.086
 that-0.083
It-0.083
aw-0.082
 all-0.079
Token Ur
Token ID8799
Feature activation+7.84e+00

Pos logit contributions
It+0.006
aw+0.006
 living+0.006
hes+0.006
That+0.005
Neg logit contributions
 argue-0.008
sing-0.008
 breath-0.007
 spine-0.007
 whine-0.007
Tokenge
Token ID469
Feature activation+1.42e-01

Pos logit contributions
sing+0.097
 argue+0.083
 spine+0.082
 whine+0.080
 breath+0.078
Neg logit contributions
 living-0.225
 that-0.224
aw-0.224
ing-0.217
It-0.215
Token Land
Token ID6379
Feature activation-6.54e-01

Pos logit contributions
 argue+0.003
sing+0.003
 spine+0.003
 breath+0.003
 whine+0.003
Neg logit contributions
It-0.003
 living-0.003
aw-0.003
 that-0.002
That-0.002
Token everyday
Token ID10908
Feature activation+1.61e+00

Pos logit contributions
 living+0.012
It+0.012
aw+0.012
 that+0.011
That+0.010
Neg logit contributions
 argue-0.015
sing-0.015
 spine-0.014
 breath-0.014
 whine-0.013
Token,
Token ID11
Feature activation+2.12e+00

Pos logit contributions
 argue+0.037
sing+0.036
 spine+0.034
 breath+0.033
 whine+0.033
Neg logit contributions
 living-0.029
aw-0.029
It-0.029
 that-0.027
That-0.025
Token and
Token ID290
Feature activation+1.87e+00

Pos logit contributions
 argue+0.051
sing+0.050
 spine+0.048
 breath+0.046
 whine+0.045
Neg logit contributions
aw-0.037
It-0.034
 living-0.034
hes-0.031
ay-0.030
Token
Token ID198
Feature activation-3.95e-01

Pos logit contributions
 living+0.011
aw+0.011
 that+0.010
It+0.010
 all+0.009
Neg logit contributions
 argue-0.027
sing-0.026
 spine-0.025
 whine-0.025
 breath-0.025
TokenM
Token ID44
Feature activation+3.78e-01

Pos logit contributions
 living+0.006
aw+0.006
 that+0.006
It+0.006
 all+0.005
Neg logit contributions
 argue-0.010
sing-0.010
 spine-0.009
 whine-0.009
 breath-0.009
Tokenummy
Token ID13513
Feature activation-1.41e+00

Pos logit contributions
 argue+0.007
sing+0.007
 spine+0.006
 whine+0.006
 breath+0.006
Neg logit contributions
aw-0.009
 living-0.009
It-0.009
 that-0.009
ay-0.008
Token watched
Token ID7342
Feature activation-1.27e+00

Pos logit contributions
 living+0.023
aw+0.022
It+0.022
 that+0.021
 biggest+0.019
Neg logit contributions
 argue-0.036
sing-0.034
 whine-0.032
 spine-0.032
 breath-0.032
Token him
Token ID683
Feature activation+3.14e+00

Pos logit contributions
 that+0.021
 living+0.020
 all+0.019
It+0.018
aw+0.018
Neg logit contributions
 argue-0.035
sing-0.033
 spine-0.032
 whine-0.031
days-0.030
Token anx
Token ID7296
Feature activation+7.71e+00

Pos logit contributions
 argue+0.076
sing+0.074
 spine+0.071
 whine+0.068
 breath+0.067
Neg logit contributions
 living-0.056
 that-0.054
It-0.052
aw-0.052
 all-0.050
Tokeniously
Token ID6819
Feature activation+3.08e+00

Pos logit contributions
days+0.138
 drown+0.130
 argue+0.124
unny+0.118
sing+0.116
Neg logit contributions
 that-0.176
aw-0.170
 all-0.146
 so-0.143
 living-0.142
Token as
Token ID355
Feature activation+7.00e-01

Pos logit contributions
 argue+0.076
sing+0.074
 spine+0.070
 whine+0.068
 breath+0.067
Neg logit contributions
 living-0.054
aw-0.052
It-0.052
 that-0.051
 all-0.046
Token Ali
Token ID12104
Feature activation+2.02e+00

Pos logit contributions
 argue+0.018
sing+0.017
 spine+0.016
 whine+0.016
 breath+0.015
Neg logit contributions
 living-0.012
 that-0.012
It-0.012
aw-0.011
 all-0.011
Token waited
Token ID13488
Feature activation-2.31e-01

Pos logit contributions
 argue+0.050
sing+0.049
 spine+0.046
 whine+0.045
 breath+0.045
Neg logit contributions
 living-0.035
aw-0.034
It-0.034
 that-0.033
 all-0.030
Token for
Token ID329
Feature activation-2.16e-01

Pos logit contributions
 living+0.004
aw+0.004
It+0.003
 that+0.003
 all+0.003
Neg logit contributions
 argue-0.006
sing-0.006
 spine-0.006
 whine-0.005
 breath-0.005
Token is
Token ID318
Feature activation-2.25e+00

Pos logit contributions
 living+0.023
aw+0.022
It+0.022
 that+0.018
 brand+0.017
Neg logit contributions
 argue-0.081
sing-0.078
 whine-0.074
 spine-0.074
 breath-0.073
Token to
Token ID284
Feature activation+2.13e+00

Pos logit contributions
aw+0.038
It+0.036
 living+0.035
That+0.032
 that+0.031
Neg logit contributions
 argue-0.057
sing-0.055
 whine-0.051
 breath-0.051
 spine-0.050
Token always
Token ID1464
Feature activation+2.79e+00

Pos logit contributions
 argue+0.044
sing+0.044
 spine+0.041
 whine+0.039
 breath+0.039
Neg logit contributions
 living-0.045
aw-0.044
It-0.044
 that-0.043
 all-0.040
Token shout
Token ID9191
Feature activation-3.46e-01

Pos logit contributions
 argue+0.063
sing+0.061
 spine+0.057
 whine+0.056
 breath+0.055
Neg logit contributions
 living-0.054
aw-0.054
It-0.053
 that-0.051
 all-0.047
Token '
Token ID705
Feature activation+1.65e+00

Pos logit contributions
 living+0.005
 that+0.005
aw+0.005
It+0.005
 all+0.004
Neg logit contributions
 argue-0.010
sing-0.010
 spine-0.009
 whine-0.009
 breath-0.009
Tokenfore
Token ID754
Feature activation+7.69e+00

Pos logit contributions
 argue+0.039
sing+0.038
 spine+0.036
 whine+0.035
 breath+0.034
Neg logit contributions
 living-0.031
aw-0.030
It-0.029
 that-0.029
 all-0.026
Token'
Token ID6
Feature activation+8.04e-01

Pos logit contributions
sing+0.212
 spine+0.202
 argue+0.198
 breath+0.190
 whine+0.189
Neg logit contributions
 that-0.105
 living-0.103
It-0.102
aw-0.101
 all-0.090
Token when
Token ID618
Feature activation+1.69e+00

Pos logit contributions
 argue+0.020
sing+0.020
 spine+0.018
 breath+0.018
 whine+0.018
Neg logit contributions
 living-0.014
aw-0.014
It-0.013
 that-0.013
 all-0.012
Token you
Token ID345
Feature activation+1.75e+00

Pos logit contributions
 argue+0.045
sing+0.045
 spine+0.042
 whine+0.041
 breath+0.041
Neg logit contributions
 living-0.026
aw-0.025
It-0.025
 that-0.024
 all-0.022
Token hit
Token ID2277
Feature activation-6.23e-01

Pos logit contributions
 argue+0.043
sing+0.042
 spine+0.040
 whine+0.038
 breath+0.038
Neg logit contributions
 living-0.031
aw-0.030
It-0.029
 that-0.029
 all-0.027
Token the
Token ID262
Feature activation-1.03e+00

Pos logit contributions
 living+0.010
aw+0.009
It+0.009
 that+0.009
 all+0.008
Neg logit contributions
 argue-0.016
sing-0.016
 spine-0.015
 breath-0.015
 whine-0.015
Token to
Token ID284
Feature activation+4.36e+00

Pos logit contributions
 argue+0.035
sing+0.035
 spine+0.032
 whine+0.032
 breath+0.031
Neg logit contributions
 living-0.024
 that-0.024
It-0.024
aw-0.024
 all-0.022
Token never
Token ID1239
Feature activation+5.23e+00

Pos logit contributions
sing+0.099
 argue+0.096
 spine+0.095
 whine+0.087
days+0.087
Neg logit contributions
 living-0.083
 that-0.083
It-0.080
 all-0.079
aw-0.078
Token try
Token ID1949
Feature activation+1.60e+00

Pos logit contributions
sing+0.115
 spine+0.108
 argue+0.107
 breath+0.102
 whine+0.099
Neg logit contributions
 living-0.104
 that-0.104
It-0.098
 all-0.095
aw-0.093
Token to
Token ID284
Feature activation+6.36e+00

Pos logit contributions
 argue+0.041
sing+0.041
 spine+0.039
 whine+0.037
 breath+0.037
Neg logit contributions
 living-0.027
 that-0.026
aw-0.026
It-0.025
 all-0.024
Token catch
Token ID4929
Feature activation-8.90e-01

Pos logit contributions
sing+0.140
 spine+0.135
 argue+0.125
 whine+0.121
days+0.115
Neg logit contributions
 that-0.135
 all-0.129
It-0.127
 living-0.125
aw-0.118
Token Twe
Token ID24205
Feature activation+7.68e+00

Pos logit contributions
 living+0.024
aw+0.023
It+0.023
 that+0.021
 brand+0.021
Neg logit contributions
 argue-0.014
sing-0.012
 breath-0.011
 whine-0.011
 spine-0.010
Tokenety
Token ID2963
Feature activation+3.64e+00

Pos logit contributions
 argue+0.134
sing+0.132
 whine+0.122
 spine+0.121
 breath+0.120
Neg logit contributions
aw-0.197
It-0.188
 living-0.185
 that-0.176
ing-0.172
Token again
Token ID757
Feature activation+4.49e+00

Pos logit contributions
sing+0.102
 argue+0.099
 spine+0.097
 whine+0.094
 breath+0.092
Neg logit contributions
 that-0.056
 living-0.053
aw-0.052
 all-0.050
It-0.046
Token.
Token ID13
Feature activation+6.22e-01

Pos logit contributions
sing+0.120
 argue+0.119
 spine+0.113
 whine+0.110
 breath+0.108
Neg logit contributions
 that-0.070
 living-0.069
aw-0.067
It-0.065
 all-0.063
Token From
Token ID3574
Feature activation-1.19e+00

Pos logit contributions
 argue+0.015
sing+0.015
 spine+0.014
 whine+0.013
 breath+0.013
Neg logit contributions
 living-0.011
aw-0.011
It-0.010
 that-0.009
hes-0.009
Token that
Token ID326
Feature activation-8.76e-01

Pos logit contributions
 living+0.008
It+0.006
aw+0.006
ay+0.004
hes+0.004
Neg logit contributions
sing-0.041
 argue-0.040
 breath-0.039
 spine-0.038
cker-0.038
Token Lucy
Token ID22162
Feature activation+1.56e+00

Pos logit contributions
 argue+0.021
sing+0.020
 spine+0.019
 whine+0.018
 breath+0.018
Neg logit contributions
 that-0.018
 living-0.018
It-0.018
 all-0.017
aw-0.017
Token noticed
Token ID6810
Feature activation-3.43e+00

Pos logit contributions
 argue+0.046
sing+0.046
 spine+0.043
 breath+0.042
 whine+0.042
Neg logit contributions
 living-0.020
aw-0.020
It-0.019
 that-0.018
 all-0.016
Token the
Token ID262
Feature activation-2.17e+00

Pos logit contributions
 living+0.045
aw+0.044
It+0.043
ay+0.039
 biggest+0.037
Neg logit contributions
 argue-0.089
sing-0.088
 breath-0.083
 whine-0.082
 spine-0.081
Token weather
Token ID6193
Feature activation+6.85e-01

Pos logit contributions
aw+0.027
It+0.026
 living+0.026
 that+0.025
That+0.021
Neg logit contributions
sing-0.062
 argue-0.062
 spine-0.059
 whine-0.057
 breath-0.057
Token was
Token ID373
Feature activation-1.81e+00

Pos logit contributions
 argue+0.016
sing+0.016
 whine+0.015
 spine+0.015
days+0.015
Neg logit contributions
aw-0.011
 living-0.010
It-0.010
 brand-0.010
 biggest-0.010
Token uncom
Token ID8820
Feature activation+7.62e+00

Pos logit contributions
aw+0.032
 living+0.028
It+0.027
 that+0.026
ay+0.024
Neg logit contributions
 argue-0.045
sing-0.045
 whine-0.042
 breath-0.041
 spine-0.041
Tokenfort
Token ID3319
Feature activation+3.10e+00

Pos logit contributions
sing+0.153
 argue+0.149
 breath+0.143
 spine+0.142
 whine+0.140
Neg logit contributions
aw-0.157
It-0.144
 that-0.142
 living-0.133
 brand-0.130
Tokenably
Token ID1346
Feature activation+2.36e+00

Pos logit contributions
 argue+0.065
sing+0.062
 whine+0.058
 spine+0.057
 breath+0.056
Neg logit contributions
 that-0.065
aw-0.065
 living-0.065
 all-0.061
It-0.061
Token hot
Token ID3024
Feature activation+1.72e+00

Pos logit contributions
 argue+0.063
sing+0.061
 spine+0.058
 breath+0.057
 whine+0.057
Neg logit contributions
aw-0.036
 living-0.035
It-0.034
 that-0.031
That-0.029
Token.
Token ID13
Feature activation-1.26e-01

Pos logit contributions
 argue+0.032
sing+0.031
 spine+0.031
 breath+0.029
 whine+0.029
Neg logit contributions
It-0.036
aw-0.034
 brand-0.034
 living-0.033
That-0.032
Token She
Token ID1375
Feature activation-1.72e+00

Pos logit contributions
aw+0.001
 living+0.001
It+0.001
 that+0.001
 all+0.001
Neg logit contributions
 argue-0.004
sing-0.004
 spine-0.004
 whine-0.003
 breath-0.003
Token was
Token ID373
Feature activation-9.22e-01

Pos logit contributions
 argue+0.096
sing+0.094
 spine+0.089
 whine+0.088
 breath+0.087
Neg logit contributions
 living-0.042
aw-0.041
It-0.040
 that-0.038
 all-0.034
Token always
Token ID1464
Feature activation-1.29e-01

Pos logit contributions
aw+0.016
It+0.015
 that+0.014
That+0.014
 living+0.014
Neg logit contributions
sing-0.023
 argue-0.022
 breath-0.021
 whine-0.020
 spine-0.020
Token excited
Token ID6568
Feature activation+1.91e+00

Pos logit contributions
aw+0.002
It+0.002
 that+0.002
 living+0.002
That+0.002
Neg logit contributions
sing-0.003
 argue-0.003
 breath-0.003
 spine-0.003
 whine-0.003
Token for
Token ID329
Feature activation+2.04e+00

Pos logit contributions
 argue+0.045
sing+0.044
 spine+0.042
 breath+0.040
 whine+0.040
Neg logit contributions
It-0.034
aw-0.034
 living-0.034
 that-0.031
That-0.029
Token Wed
Token ID3300
Feature activation+4.24e+00

Pos logit contributions
 argue+0.052
sing+0.050
 spine+0.047
 breath+0.046
 whine+0.046
Neg logit contributions
 living-0.034
aw-0.034
It-0.033
 that-0.031
 all-0.028
Tokennes
Token ID2516
Feature activation+7.60e+00

Pos logit contributions
sing+0.063
 argue+0.062
 breath+0.060
 spine+0.057
 whine+0.056
Neg logit contributions
It-0.113
 living-0.109
 that-0.108
aw-0.105
 all-0.102
Tokendays
Token ID12545
Feature activation+1.36e+00

Pos logit contributions
sing+0.079
 argue+0.077
 breath+0.075
 spine+0.072
 whine+0.066
Neg logit contributions
 living-0.238
 that-0.233
 all-0.221
 of-0.215
 so-0.215
Token because
Token ID780
Feature activation+9.68e-01

Pos logit contributions
 argue+0.028
sing+0.027
 spine+0.026
days+0.026
 whine+0.025
Neg logit contributions
It-0.027
aw-0.027
 living-0.026
hes-0.023
That-0.023
Token it
Token ID340
Feature activation-4.81e+00

Pos logit contributions
 argue+0.020
days+0.019
 breath+0.019
sing+0.018
 whine+0.018
Neg logit contributions
aw-0.020
It-0.017
 living-0.017
hes-0.016
br-0.015
Token was
Token ID373
Feature activation-1.54e+00

Pos logit contributions
aw+0.090
ay+0.068
 living+0.066
It+0.064
hes+0.062
Neg logit contributions
 argue-0.106
days-0.103
 seek-0.102
sing-0.099
 whine-0.099
Token the
Token ID262
Feature activation-1.58e+00

Pos logit contributions
aw+0.027
It+0.024
 living+0.022
ing+0.021
That+0.021
Neg logit contributions
 argue-0.036
sing-0.036
 whine-0.034
 breath-0.033
days-0.032
Token are
Token ID389
Feature activation-1.34e+00

Pos logit contributions
 living+0.036
aw+0.036
It+0.034
 that+0.033
 all+0.029
Neg logit contributions
 argue-0.083
sing-0.080
 spine-0.075
 whine-0.075
 breath-0.074
Token you
Token ID345
Feature activation+2.54e+00

Pos logit contributions
 living+0.022
aw+0.021
It+0.021
 that+0.020
 all+0.018
Neg logit contributions
 argue-0.034
sing-0.033
 spine-0.031
 breath-0.031
 whine-0.030
Token?"
Token ID1701
Feature activation+2.25e+00

Pos logit contributions
 argue+0.069
sing+0.067
 spine+0.063
 breath+0.062
 whine+0.062
Neg logit contributions
aw-0.038
 living-0.038
It-0.037
 that-0.036
 all-0.031
Token she
Token ID673
Feature activation+5.60e-01

Pos logit contributions
 argue+0.069
sing+0.069
 spine+0.064
days+0.062
 breath+0.062
Neg logit contributions
 that-0.029
 living-0.027
It-0.027
aw-0.024
 all-0.024
Token shouted
Token ID17293
Feature activation-1.41e+00

Pos logit contributions
sing+0.017
 spine+0.017
 argue+0.017
days+0.016
 breath+0.015
Neg logit contributions
It-0.007
 that-0.007
 living-0.007
 all-0.006
aw-0.006
Token anx
Token ID7296
Feature activation+7.60e+00

Pos logit contributions
 that+0.017
 all+0.016
 living+0.014
 so+0.012
 about+0.011
Neg logit contributions
 argue-0.045
ifi-0.045
cker-0.044
days-0.044
sing-0.044
Tokeniously
Token ID6819
Feature activation+3.24e+00

Pos logit contributions
days+0.140
 argue+0.119
 drown+0.117
packs+0.115
 seek+0.112
Neg logit contributions
 that-0.191
aw-0.170
 all-0.156
 so-0.150
 living-0.140
Token.
Token ID13
Feature activation+2.31e-01

Pos logit contributions
sing+0.095
 argue+0.094
days+0.090
ifi+0.089
 spine+0.089
Neg logit contributions
 that-0.043
 living-0.042
 all-0.038
 so-0.031
aw-0.031
Token
Token ID220
Feature activation-2.76e-01

Pos logit contributions
 argue+0.007
sing+0.007
 breath+0.007
 spine+0.007
 whine+0.007
Neg logit contributions
 living-0.003
It-0.002
 that-0.002
aw-0.002
 all-0.002
Token
Token ID198
Feature activation-5.71e-01

Pos logit contributions
 living+0.003
aw+0.003
It+0.003
 that+0.003
 all+0.002
Neg logit contributions
 argue-0.008
sing-0.008
 spine-0.008
 whine-0.008
 breath-0.008
Token
Token ID198
Feature activation-2.80e-01

Pos logit contributions
 living+0.005
It+0.005
 that+0.005
aw+0.005
 all+0.004
Neg logit contributions
 argue-0.019
sing-0.019
 spine-0.018
 whine-0.018
 breath-0.017
Token in
Token ID287
Feature activation+1.81e+00

Pos logit contributions
sing+0.027
 argue+0.026
 drown+0.026
 spine+0.026
 whine+0.025
Neg logit contributions
 all-0.010
 living-0.010
 that-0.009
 many-0.007
 on-0.007
Token the
Token ID262
Feature activation+3.31e-01

Pos logit contributions
sing+0.053
 argue+0.052
 spine+0.050
 whine+0.048
cker+0.046
Neg logit contributions
 living-0.025
 that-0.023
It-0.023
aw-0.021
 all-0.021
Token park
Token ID3952
Feature activation+1.86e+00

Pos logit contributions
 argue+0.009
sing+0.009
 spine+0.008
 whine+0.008
 breath+0.008
Neg logit contributions
aw-0.005
It-0.004
hes-0.004
 living-0.004
 that-0.004
Token on
Token ID319
Feature activation+2.65e+00

Pos logit contributions
 argue+0.053
sing+0.052
 spine+0.050
 whine+0.049
 breath+0.048
Neg logit contributions
 living-0.028
 that-0.027
aw-0.025
 all-0.024
It-0.023
Token a
Token ID257
Feature activation+1.97e+00

Pos logit contributions
 argue+0.075
sing+0.073
 spine+0.070
 whine+0.068
 breath+0.068
Neg logit contributions
 living-0.037
aw-0.036
It-0.035
 that-0.034
 all-0.031
Token bree
Token ID20780
Feature activation+7.49e+00

Pos logit contributions
 argue+0.053
sing+0.051
 spine+0.048
 whine+0.048
 breath+0.048
Neg logit contributions
aw-0.029
 living-0.029
It-0.028
 that-0.027
 all-0.024
Tokenzy
Token ID7357
Feature activation+4.61e+00

Pos logit contributions
sing+0.144
 spine+0.137
 argue+0.128
days+0.127
irds+0.126
Neg logit contributions
 that-0.173
 living-0.152
aw-0.150
 all-0.144
It-0.142
Token day
Token ID1110
Feature activation+4.61e+00

Pos logit contributions
 argue+0.123
sing+0.120
 spine+0.114
 whine+0.111
 breath+0.110
Neg logit contributions
 living-0.072
aw-0.070
It-0.068
 that-0.067
 all-0.060
Token.
Token ID13
Feature activation+7.46e-02

Pos logit contributions
 argue+0.115
sing+0.112
 spine+0.106
 whine+0.103
 breath+0.102
Neg logit contributions
 living-0.080
aw-0.078
It-0.077
 that-0.075
 all-0.069
Token Mary
Token ID5335
Feature activation+8.74e-01

Pos logit contributions
 argue+0.001
sing+0.001
 spine+0.001
 whine+0.001
 breath+0.001
Neg logit contributions
 living-0.002
aw-0.002
It-0.002
 that-0.002
 brand-0.001
Token was
Token ID373
Feature activation-1.63e+00

Pos logit contributions
 argue+0.024
sing+0.024
 spine+0.022
 whine+0.022
 breath+0.022
Neg logit contributions
 living-0.011
aw-0.011
It-0.011
 that-0.011
 brand-0.009
Token dare
Token ID16498
Feature activation+2.81e+00

Pos logit contributions
sing+0.083
 argue+0.076
 spine+0.074
 breath+0.069
 whine+0.068
Neg logit contributions
It-0.082
 that-0.081
 living-0.079
 all-0.078
aw-0.077
Token to
Token ID284
Feature activation+5.30e+00

Pos logit contributions
sing+0.079
days+0.075
 argue+0.075
 spine+0.073
 whine+0.070
Neg logit contributions
 that-0.043
 all-0.040
 living-0.037
It-0.037
aw-0.036
Token climb
Token ID12080
Feature activation+1.27e+00

Pos logit contributions
sing+0.129
 spine+0.125
days+0.117
 argue+0.111
cle+0.110
Neg logit contributions
 that-0.106
 all-0.102
It-0.095
 living-0.091
 big-0.089
Token up
Token ID510
Feature activation+1.70e+00

Pos logit contributions
 argue+0.038
sing+0.038
 spine+0.037
 whine+0.036
days+0.035
Neg logit contributions
 that-0.016
 living-0.015
 all-0.015
aw-0.014
It-0.014
Token to
Token ID284
Feature activation+3.12e+00

Pos logit contributions
sing+0.053
 argue+0.051
 spine+0.050
 whine+0.048
days+0.048
Neg logit contributions
 that-0.023
 all-0.021
 living-0.019
It-0.019
aw-0.018
Token Twe
Token ID24205
Feature activation+7.48e+00

Pos logit contributions
sing+0.084
 argue+0.083
 spine+0.081
 whine+0.077
days+0.075
Neg logit contributions
 that-0.049
 living-0.048
It-0.046
 all-0.045
aw-0.044
Tokenety
Token ID2963
Feature activation+4.34e+00

Pos logit contributions
 argue+0.126
sing+0.123
 breath+0.112
 whine+0.111
 spine+0.111
Neg logit contributions
aw-0.197
 living-0.188
It-0.188
 that-0.176
ing-0.167
Token's
Token ID338
Feature activation+1.76e+00

Pos logit contributions
sing+0.120
 argue+0.119
 spine+0.112
 whine+0.111
 breath+0.109
Neg logit contributions
 living-0.065
 that-0.064
aw-0.061
 all-0.058
It-0.057
Token nest
Token ID16343
Feature activation+5.98e-01

Pos logit contributions
 argue+0.042
sing+0.040
 whine+0.039
days+0.038
 spine+0.038
Neg logit contributions
 living-0.033
 that-0.031
 all-0.030
aw-0.028
 best-0.028
Token.
Token ID13
Feature activation+9.58e-01

Pos logit contributions
 argue+0.015
sing+0.015
 spine+0.014
 whine+0.014
 breath+0.013
Neg logit contributions
 living-0.010
aw-0.010
 that-0.010
It-0.010
 all-0.009
Token
Token ID198
Feature activation-9.17e-01

Pos logit contributions
 argue+0.025
sing+0.023
 spine+0.022
 whine+0.022
 breath+0.021
Neg logit contributions
 living-0.015
aw-0.014
It-0.013
 that-0.012
hes-0.012
Token don
Token ID836
Feature activation+2.97e+00

Pos logit contributions
 argue+0.021
sing+0.020
 spine+0.019
 whine+0.019
 breath+0.019
Neg logit contributions
 living-0.016
aw-0.016
It-0.016
 that-0.015
 all-0.014
Token't
Token ID470
Feature activation+4.03e+00

Pos logit contributions
 argue+0.056
sing+0.046
 breath+0.046
 whine+0.044
 drown+0.043
Neg logit contributions
aw-0.070
 living-0.069
 brand-0.061
 prett-0.060
 biggest-0.060
Token think
Token ID892
Feature activation-3.76e-02

Pos logit contributions
sing+0.081
 argue+0.080
 spine+0.075
 whine+0.070
 breath+0.070
Neg logit contributions
 living-0.087
It-0.086
 that-0.086
aw-0.085
 all-0.080
Token you
Token ID345
Feature activation+9.30e-01

Pos logit contributions
 living+0.001
aw+0.001
It+0.001
That+0.000
 brand+0.000
Neg logit contributions
 argue-0.001
sing-0.001
 spine-0.001
 breath-0.001
 whine-0.001
Token can
Token ID460
Feature activation+4.27e+00

Pos logit contributions
 argue+0.025
sing+0.025
 spine+0.024
 whine+0.023
 breath+0.023
Neg logit contributions
 living-0.014
aw-0.013
It-0.013
 that-0.013
 all-0.012
Token ever
Token ID1683
Feature activation+7.41e+00

Pos logit contributions
 argue+0.089
sing+0.088
 spine+0.082
 whine+0.079
 breath+0.078
Neg logit contributions
 living-0.089
aw-0.087
It-0.087
 that-0.086
 all-0.080
Token take
Token ID1011
Feature activation-1.03e+00

Pos logit contributions
sing+0.139
 argue+0.135
 spine+0.128
 whine+0.122
 breath+0.119
Neg logit contributions
 living-0.167
 that-0.166
It-0.165
aw-0.161
 all-0.158
Token it
Token ID340
Feature activation+7.59e-01

Pos logit contributions
 living+0.018
It+0.017
aw+0.016
That+0.015
 that+0.014
Neg logit contributions
 argue-0.026
 breath-0.024
sing-0.024
 spine-0.023
 whine-0.023
Token off
Token ID572
Feature activation+3.48e+00

Pos logit contributions
 argue+0.019
sing+0.019
 spine+0.017
 whine+0.017
 breath+0.017
Neg logit contributions
 living-0.013
 that-0.013
aw-0.013
It-0.012
 all-0.012
Token!"
Token ID2474
Feature activation+8.86e-01

Pos logit contributions
 argue+0.081
sing+0.079
 spine+0.075
 whine+0.072
 breath+0.072
Neg logit contributions
 living-0.064
aw-0.063
It-0.062
 that-0.060
 all-0.055
Token
Token ID198
Feature activation-6.94e-01

Pos logit contributions
 argue+0.023
sing+0.022
 spine+0.021
 whine+0.020
 breath+0.020
Neg logit contributions
 living-0.015
aw-0.014
It-0.014
 that-0.014
 all-0.013
Token to
Token ID284
Feature activation+5.49e+00

Pos logit contributions
 living+0.028
aw+0.028
It+0.027
 that+0.026
 all+0.024
Neg logit contributions
 argue-0.038
sing-0.037
 spine-0.035
 whine-0.034
 breath-0.034
Token give
Token ID1577
Feature activation-2.09e+00

Pos logit contributions
sing+0.131
 spine+0.129
irds+0.121
cle+0.116
packs+0.112
Neg logit contributions
 that-0.122
 all-0.120
 so-0.098
 made-0.098
 big-0.096
Token the
Token ID262
Feature activation-1.12e+00

Pos logit contributions
 that+0.027
 all+0.026
It+0.026
 living+0.025
aw+0.025
Neg logit contributions
sing-0.064
 argue-0.062
 spine-0.062
 whine-0.058
days-0.057
Token sheet
Token ID9629
Feature activation-8.00e-01

Pos logit contributions
 living+0.022
It+0.021
 that+0.021
aw+0.021
 all+0.020
Neg logit contributions
 argue-0.026
sing-0.025
 spine-0.024
 whine-0.023
 breath-0.022
Token to
Token ID284
Feature activation+2.86e+00

Pos logit contributions
 living+0.012
 that+0.012
It+0.011
aw+0.011
 all+0.011
Neg logit contributions
 argue-0.022
sing-0.021
 spine-0.020
 whine-0.020
 breath-0.019
Token Nem
Token ID22547
Feature activation+7.40e+00

Pos logit contributions
 argue+0.063
sing+0.062
 spine+0.059
 whine+0.056
 breath+0.054
Neg logit contributions
 that-0.058
 living-0.057
It-0.057
aw-0.056
 all-0.055
Tokeno
Token ID78
Feature activation+3.36e+00

Pos logit contributions
 argue+0.184
sing+0.178
 spine+0.170
 whine+0.170
 breath+0.169
Neg logit contributions
 living-0.126
 that-0.123
It-0.121
aw-0.121
ing-0.112
Token and
Token ID290
Feature activation+2.05e+00

Pos logit contributions
sing+0.097
 argue+0.093
 spine+0.092
 whine+0.088
 breath+0.086
Neg logit contributions
 that-0.050
 living-0.045
 all-0.044
aw-0.042
 is-0.038
Token D
Token ID360
Feature activation+1.94e+00

Pos logit contributions
 argue+0.044
sing+0.044
 spine+0.041
 whine+0.039
 breath+0.039
Neg logit contributions
 living-0.042
aw-0.041
 that-0.041
It-0.041
 all-0.038
Tokenory
Token ID652
Feature activation+2.96e-01

Pos logit contributions
sing+0.036
 argue+0.035
 spine+0.031
 whine+0.031
 breath+0.030
Neg logit contributions
aw-0.048
 living-0.047
It-0.047
 that-0.046
 all-0.044
Token.
Token ID13
Feature activation+2.94e-01

Pos logit contributions
sing+0.009
 argue+0.008
 spine+0.008
 whine+0.008
 breath+0.008
Neg logit contributions
 that-0.004
 living-0.004
 all-0.004
aw-0.004
It-0.004
Token I
Token ID314
Feature activation+6.51e-01

Pos logit contributions
 argue+0.007
sing+0.007
 spine+0.006
 breath+0.006
 whine+0.006
Neg logit contributions
 living-0.004
It-0.004
aw-0.004
 that-0.003
 all-0.003
Token promise
Token ID6991
Feature activation+5.64e-01

Pos logit contributions
 argue+0.017
sing+0.016
 spine+0.016
 breath+0.015
 whine+0.015
Neg logit contributions
 living-0.011
It-0.010
aw-0.010
 that-0.010
 all-0.009
Token you
Token ID345
Feature activation+2.17e+00

Pos logit contributions
 argue+0.015
sing+0.015
 spine+0.014
 whine+0.014
 breath+0.014
Neg logit contributions
 that-0.009
 living-0.009
It-0.008
aw-0.008
 all-0.008
Token won
Token ID1839
Feature activation+2.67e+00

Pos logit contributions
 argue+0.061
sing+0.060
 spine+0.057
 whine+0.055
 breath+0.055
Neg logit contributions
 living-0.030
aw-0.029
It-0.029
 that-0.028
 all-0.025
Token't
Token ID470
Feature activation+5.55e+00

Pos logit contributions
 argue+0.066
sing+0.063
 spine+0.059
 whine+0.058
 breath+0.058
Neg logit contributions
 living-0.047
aw-0.047
It-0.045
 that-0.043
 all-0.040
Token ever
Token ID1683
Feature activation+7.32e+00

Pos logit contributions
sing+0.111
 argue+0.109
 spine+0.104
 whine+0.097
 breath+0.097
Neg logit contributions
 living-0.120
 that-0.119
It-0.119
aw-0.117
 all-0.111
Token have
Token ID423
Feature activation-3.96e-01

Pos logit contributions
sing+0.130
 argue+0.123
 spine+0.121
 whine+0.112
 breath+0.111
Neg logit contributions
 that-0.172
 living-0.170
It-0.168
aw-0.167
 all-0.161
Token to
Token ID284
Feature activation+5.81e+00

Pos logit contributions
 living+0.006
aw+0.006
It+0.005
 that+0.005
 all+0.005
Neg logit contributions
 argue-0.011
sing-0.010
 breath-0.010
 whine-0.010
 spine-0.010
Token be
Token ID307
Feature activation+1.73e+00

Pos logit contributions
sing+0.120
 spine+0.114
 argue+0.111
 whine+0.101
 breath+0.101
Neg logit contributions
 that-0.126
 living-0.123
It-0.121
aw-0.118
 all-0.117
Token scared
Token ID12008
Feature activation+2.65e+00

Pos logit contributions
 argue+0.043
sing+0.041
 breath+0.039
 whine+0.039
 spine+0.038
Neg logit contributions
 living-0.029
aw-0.028
It-0.028
 that-0.026
That-0.025
Token again
Token ID757
Feature activation+4.44e+00

Pos logit contributions
 argue+0.055
sing+0.052
 spine+0.049
 breath+0.048
 whine+0.048
Neg logit contributions
 living-0.054
aw-0.054
It-0.053
 that-0.050
 brand-0.049
Token a
Token ID257
Feature activation-2.38e+00

Pos logit contributions
aw+0.033
 living+0.033
It+0.032
 that+0.030
That+0.027
Neg logit contributions
 argue-0.058
sing-0.058
 spine-0.054
 whine-0.053
 breath-0.053
Token special
Token ID2041
Feature activation-1.46e+00

Pos logit contributions
aw+0.032
 living+0.032
It+0.031
 that+0.030
 all+0.027
Neg logit contributions
 argue-0.067
sing-0.066
 spine-0.063
 whine-0.061
 breath-0.061
Token hat
Token ID6877
Feature activation-2.68e+00

Pos logit contributions
 living+0.027
aw+0.026
It+0.026
 that+0.026
 all+0.023
Neg logit contributions
 argue-0.035
sing-0.034
 spine-0.032
 whine-0.031
 breath-0.030
Token for
Token ID329
Feature activation+7.38e-01

Pos logit contributions
aw+0.048
It+0.048
 living+0.047
That+0.041
 that+0.041
Neg logit contributions
 argue-0.062
sing-0.061
 spine-0.058
 whine-0.054
 breath-0.054
Token Wed
Token ID3300
Feature activation+4.83e+00

Pos logit contributions
 argue+0.019
sing+0.019
 spine+0.018
 breath+0.017
 whine+0.017
Neg logit contributions
 living-0.011
aw-0.011
It-0.011
 that-0.011
 all-0.009
Tokennes
Token ID2516
Feature activation+7.29e+00

Pos logit contributions
sing+0.066
 argue+0.065
 breath+0.065
 spine+0.060
 whine+0.059
Neg logit contributions
It-0.130
 living-0.128
aw-0.127
 that-0.126
 all-0.119
Tokendays
Token ID12545
Feature activation+2.86e-01

Pos logit contributions
sing+0.088
 argue+0.087
 breath+0.083
 spine+0.082
 whine+0.077
Neg logit contributions
 that-0.211
 living-0.211
 of-0.197
 all-0.196
aw-0.193
Token."
Token ID526
Feature activation+7.97e-01

Pos logit contributions
 argue+0.006
sing+0.006
 spine+0.005
days+0.005
 whine+0.005
Neg logit contributions
aw-0.006
It-0.006
 living-0.006
 that-0.005
That-0.005
Token
Token ID198
Feature activation-1.31e+00

Pos logit contributions
 argue+0.022
sing+0.022
 spine+0.021
 whine+0.021
 breath+0.021
Neg logit contributions
aw-0.010
 living-0.010
It-0.009
 that-0.009
 all-0.008
Token
Token ID198
Feature activation-6.53e-01

Pos logit contributions
aw+0.018
 living+0.017
 that+0.015
It+0.015
 all+0.014
Neg logit contributions
 argue-0.035
sing-0.034
 breath-0.032
 spine-0.032
 whine-0.032
TokenL
Token ID43
Feature activation-4.59e-01

Pos logit contributions
aw+0.014
 living+0.013
 that+0.013
 all+0.012
 biggest+0.012
Neg logit contributions
 argue-0.013
sing-0.011
 seek-0.011
 whine-0.011
days-0.011
Token you
Token ID345
Feature activation+3.96e+00

Pos logit contributions
 argue+0.009
sing+0.009
 breath+0.008
 spine+0.008
 whine+0.008
Neg logit contributions
aw-0.006
It-0.006
 living-0.006
That-0.005
 that-0.005
Token.
Token ID13
Feature activation-7.73e-04

Pos logit contributions
 argue+0.104
sing+0.103
 spine+0.098
 whine+0.096
 breath+0.095
Neg logit contributions
 living-0.063
 that-0.062
aw-0.059
 all-0.058
It-0.055
Token I
Token ID314
Feature activation+7.04e-01

Pos logit contributions
aw+0.000
 living+0.000
It+0.000
 that+0.000
 all+0.000
Neg logit contributions
 argue-0.000
sing-0.000
 spine-0.000
 whine-0.000
 breath-0.000
Token won
Token ID1839
Feature activation+1.41e+00

Pos logit contributions
 argue+0.018
sing+0.018
 spine+0.017
 whine+0.017
 breath+0.016
Neg logit contributions
 living-0.011
aw-0.011
It-0.011
 that-0.011
 all-0.010
Token't
Token ID470
Feature activation+5.09e+00

Pos logit contributions
 argue+0.034
sing+0.032
 breath+0.030
 spine+0.030
 whine+0.030
Neg logit contributions
aw-0.026
 living-0.026
It-0.024
 that-0.023
 all-0.021
Token ever
Token ID1683
Feature activation+7.28e+00

Pos logit contributions
sing+0.113
 argue+0.109
 spine+0.106
 whine+0.100
 breath+0.098
Neg logit contributions
 that-0.101
It-0.100
 living-0.099
aw-0.096
 all-0.094
Token do
Token ID466
Feature activation+1.16e+00

Pos logit contributions
sing+0.143
 argue+0.136
 spine+0.133
 whine+0.127
 breath+0.121
Neg logit contributions
 that-0.158
 living-0.157
It-0.153
aw-0.153
 all-0.150
Token it
Token ID340
Feature activation+1.18e+00

Pos logit contributions
 argue+0.033
sing+0.031
 spine+0.030
 breath+0.030
 whine+0.029
Neg logit contributions
 living-0.016
aw-0.015
It-0.015
 that-0.013
That-0.012
Token again
Token ID757
Feature activation+4.10e+00

Pos logit contributions
 argue+0.025
sing+0.024
 spine+0.022
 breath+0.022
 whine+0.021
Neg logit contributions
 living-0.025
aw-0.025
It-0.024
 that-0.023
 all-0.021
Token.
Token ID13
Feature activation-3.25e-01

Pos logit contributions
 argue+0.090
sing+0.087
 spine+0.081
 whine+0.079
 breath+0.078
Neg logit contributions
 living-0.083
aw-0.082
It-0.080
 that-0.077
 all-0.072
Token Next
Token ID7406
Feature activation-7.62e-01

Pos logit contributions
aw+0.005
 living+0.005
It+0.005
 that+0.005
 all+0.005
Neg logit contributions
sing-0.008
 argue-0.008
 spine-0.008
 whine-0.007
days-0.007
Token with
Token ID351
Feature activation-1.66e+00

Pos logit contributions
sing+0.044
 argue+0.044
 spine+0.043
 whine+0.042
cle+0.042
Neg logit contributions
 that-0.023
 all-0.022
 living-0.021
It-0.018
aw-0.018
Token his
Token ID465
Feature activation+5.49e-01

Pos logit contributions
 living+0.032
aw+0.031
It+0.030
 that+0.028
That+0.026
Neg logit contributions
 argue-0.039
sing-0.037
 breath-0.036
 spine-0.034
 whine-0.034
Token trunk
Token ID21427
Feature activation-2.35e+00

Pos logit contributions
 argue+0.013
sing+0.013
 spine+0.012
 whine+0.012
 breath+0.012
Neg logit contributions
 living-0.010
aw-0.010
It-0.009
 that-0.009
 all-0.009
Token.
Token ID13
Feature activation+5.09e-01

Pos logit contributions
 living+0.041
aw+0.040
It+0.039
 that+0.039
 all+0.036
Neg logit contributions
 argue-0.058
sing-0.057
 spine-0.053
 whine-0.052
 breath-0.051
Token The
Token ID383
Feature activation-2.36e+00

Pos logit contributions
 argue+0.015
sing+0.014
 spine+0.014
 whine+0.013
 breath+0.013
Neg logit contributions
 living-0.007
aw-0.006
It-0.006
 that-0.006
 all-0.005
Token label
Token ID6167
Feature activation-7.58e+00

Pos logit contributions
aw+0.049
 living+0.045
It+0.044
 that+0.042
hes+0.040
Neg logit contributions
sing-0.051
 argue-0.051
 spine-0.046
 breath-0.045
 whine-0.044
Token fell
Token ID3214
Feature activation+4.75e-01

Pos logit contributions
 living+0.122
aw+0.116
It+0.112
 brand+0.101
 biggest+0.099
Neg logit contributions
 argue-0.191
sing-0.182
 breath-0.175
 whine-0.168
 spine-0.163
Token off
Token ID572
Feature activation+2.00e+00

Pos logit contributions
 argue+0.012
sing+0.012
 spine+0.011
 breath+0.011
 whine+0.011
Neg logit contributions
aw-0.008
 living-0.008
It-0.008
 that-0.007
That-0.007
Token and
Token ID290
Feature activation+5.77e-01

Pos logit contributions
 argue+0.060
sing+0.059
 spine+0.056
 whine+0.055
days+0.053
Neg logit contributions
 living-0.026
 that-0.026
aw-0.024
It-0.024
 all-0.023
Token the
Token ID262
Feature activation-1.25e+00

Pos logit contributions
 argue+0.015
sing+0.015
 spine+0.014
 breath+0.014
 whine+0.014
Neg logit contributions
 living-0.009
aw-0.009
It-0.008
 that-0.008
 all-0.007
Token elephant
Token ID20950
Feature activation-2.21e+00

Pos logit contributions
aw+0.025
It+0.024
 living+0.024
 that+0.023
hes+0.021
Neg logit contributions
sing-0.027
 argue-0.027
 spine-0.026
 breath-0.024
 whine-0.024
Token destroy
Token ID4117
Feature activation-1.46e+00

Pos logit contributions
 spine+0.079
sing+0.079
 argue+0.076
days+0.071
 whine+0.070
Neg logit contributions
 that-0.085
 all-0.083
It-0.081
 living-0.080
aw-0.073
Token the
Token ID262
Feature activation-8.88e-01

Pos logit contributions
 living+0.023
aw+0.022
It+0.022
 that+0.020
 brand+0.018
Neg logit contributions
 argue-0.038
sing-0.037
 spine-0.035
 breath-0.035
 whine-0.034
Token card
Token ID2657
Feature activation-2.55e+00

Pos logit contributions
aw+0.015
It+0.015
 living+0.015
 that+0.014
That+0.013
Neg logit contributions
 argue-0.021
sing-0.021
 spine-0.020
 breath-0.019
 whine-0.019
Token.
Token ID13
Feature activation+4.02e-01

Pos logit contributions
 living+0.044
aw+0.043
It+0.043
 that+0.042
 all+0.038
Neg logit contributions
 argue-0.063
sing-0.061
 spine-0.058
 whine-0.056
 breath-0.056
Token The
Token ID383
Feature activation-2.22e+00

Pos logit contributions
 argue+0.011
sing+0.011
 spine+0.010
 breath+0.010
 whine+0.010
Neg logit contributions
It-0.006
 living-0.006
aw-0.006
 that-0.006
 all-0.005
Token card
Token ID2657
Feature activation-7.30e+00

Pos logit contributions
 living+0.035
aw+0.035
It+0.034
 that+0.032
 all+0.029
Neg logit contributions
 argue-0.058
sing-0.057
 spine-0.053
 breath-0.052
 whine-0.052
Token was
Token ID373
Feature activation-2.77e+00

Pos logit contributions
 living+0.091
aw+0.089
It+0.086
 that+0.080
 all+0.070
Neg logit contributions
 argue-0.214
sing-0.209
 spine-0.198
 breath-0.195
 whine-0.194
Token very
Token ID845
Feature activation-1.10e+00

Pos logit contributions
aw+0.046
 living+0.039
It+0.039
That+0.035
ing+0.035
Neg logit contributions
sing-0.075
 argue-0.074
 breath-0.070
 whine-0.066
 spine-0.065
Token scared
Token ID12008
Feature activation+8.91e-01

Pos logit contributions
 living+0.015
aw+0.014
It+0.014
 that+0.014
 all+0.012
Neg logit contributions
 argue-0.032
sing-0.031
 spine-0.030
 whine-0.029
 breath-0.028
Token.
Token ID13
Feature activation+5.93e-01

Pos logit contributions
 argue+0.020
sing+0.020
 breath+0.019
 spine+0.019
 whine+0.018
Neg logit contributions
 living-0.015
aw-0.015
It-0.015
hes-0.014
 biggest-0.014
Token But
Token ID887
Feature activation+2.20e+00

Pos logit contributions
 argue+0.017
sing+0.017
 spine+0.016
 whine+0.016
 breath+0.016
Neg logit contributions
 living-0.008
aw-0.007
It-0.007
 that-0.007
 all-0.006
Token stepped
Token ID10764
Feature activation-5.41e-01

Pos logit contributions
 argue+0.052
sing+0.052
 spine+0.050
 breath+0.048
 whine+0.048
Neg logit contributions
 living-0.021
 that-0.020
It-0.019
aw-0.019
 all-0.017
Token carefully
Token ID7773
Feature activation+2.27e+00

Pos logit contributions
aw+0.009
 living+0.008
It+0.008
 that+0.008
That+0.007
Neg logit contributions
 argue-0.014
sing-0.013
 breath-0.013
 spine-0.013
 whine-0.012
Token,
Token ID11
Feature activation+1.74e+00

Pos logit contributions
 argue+0.061
sing+0.059
 spine+0.057
 breath+0.055
 whine+0.055
Neg logit contributions
aw-0.034
It-0.034
 living-0.034
 that-0.030
That-0.028
Token because
Token ID780
Feature activation+7.50e-01

Pos logit contributions
sing+0.039
 argue+0.039
 spine+0.037
 whine+0.035
 breath+0.035
Neg logit contributions
aw-0.034
 living-0.032
It-0.031
ay-0.030
 that-0.029
Token the
Token ID262
Feature activation-2.51e+00

Pos logit contributions
 argue+0.016
sing+0.015
 breath+0.015
 whine+0.015
 spine+0.015
Neg logit contributions
aw-0.014
 living-0.013
It-0.013
 brand-0.012
 biggest-0.011
Token bottle
Token ID9294
Feature activation-7.30e+00

Pos logit contributions
aw+0.042
It+0.035
 that+0.031
hes+0.031
ing+0.031
Neg logit contributions
 argue-0.068
sing-0.067
 whine-0.064
 spine-0.064
 breath-0.062
Token was
Token ID373
Feature activation-2.16e+00

Pos logit contributions
aw+0.097
 living+0.094
It+0.084
 brand+0.074
 that+0.073
Neg logit contributions
 argue-0.193
sing-0.190
 breath-0.182
 whine-0.178
 spine-0.176
Token very
Token ID845
Feature activation+2.18e-02

Pos logit contributions
aw+0.033
 living+0.029
It+0.029
 that+0.025
That+0.025
Neg logit contributions
 argue-0.058
sing-0.058
 breath-0.055
 whine-0.054
 spine-0.053
Token delicate
Token ID19217
Feature activation-3.59e-01

Pos logit contributions
 argue+0.001
sing+0.001
 spine+0.001
 whine+0.001
 breath+0.001
Neg logit contributions
aw-0.000
 living-0.000
It-0.000
 that-0.000
 all-0.000
Token.
Token ID13
Feature activation+9.72e-01

Pos logit contributions
It+0.007
aw+0.007
 living+0.007
hes+0.006
That+0.006
Neg logit contributions
 argue-0.008
sing-0.007
 spine-0.007
 breath-0.007
 whine-0.007
Token
Token ID198
Feature activation-4.13e-01

Pos logit contributions
 argue+0.025
sing+0.024
 spine+0.024
 whine+0.024
 breath+0.022
Neg logit contributions
 living-0.015
aw-0.014
 that-0.013
 brand-0.013
It-0.012
Token ground
Token ID2323
Feature activation+4.43e-01

Pos logit contributions
aw+0.006
It+0.006
 living+0.006
 that+0.006
 all+0.005
Neg logit contributions
sing-0.009
 argue-0.009
 spine-0.009
 whine-0.008
 breath-0.008
Token.
Token ID13
Feature activation+1.00e-02

Pos logit contributions
 argue+0.009
sing+0.009
 spine+0.009
 whine+0.008
 breath+0.008
Neg logit contributions
aw-0.009
It-0.009
 living-0.009
 biggest-0.008
 brand-0.008
Token
Token ID198
Feature activation-4.41e-01

Pos logit contributions
 argue+0.000
sing+0.000
 spine+0.000
 whine+0.000
 breath+0.000
Neg logit contributions
 living-0.000
aw-0.000
It-0.000
 that-0.000
 all-0.000
Token
Token ID198
Feature activation+2.59e-01

Pos logit contributions
 living+0.004
It+0.004
aw+0.004
 that+0.004
 all+0.003
Neg logit contributions
 argue-0.014
sing-0.014
 spine-0.014
 whine-0.013
 breath-0.013
TokenThe
Token ID464
Feature activation-2.45e+00

Pos logit contributions
 argue+0.008
sing+0.008
 spine+0.007
 breath+0.007
 whine+0.007
Neg logit contributions
It-0.003
 living-0.003
aw-0.003
 that-0.003
That-0.003
Token label
Token ID6167
Feature activation-7.29e+00

Pos logit contributions
aw+0.043
It+0.041
 living+0.041
 that+0.039
 all+0.037
Neg logit contributions
sing-0.059
 argue-0.059
 spine-0.054
 whine-0.053
 breath-0.053
Token said
Token ID531
Feature activation-4.98e+00

Pos logit contributions
 living+0.121
aw+0.116
It+0.110
 biggest+0.109
 loads+0.101
Neg logit contributions
 argue-0.170
sing-0.163
 breath-0.155
 whine-0.152
 weep-0.144
Token "
Token ID366
Feature activation-2.88e+00

Pos logit contributions
 living+0.080
aw+0.078
It+0.078
 that+0.077
 all+0.069
Neg logit contributions
 argue-0.130
sing-0.127
 spine-0.121
 whine-0.117
 breath-0.116
Tokensave
Token ID21928
Feature activation-4.70e-01

Pos logit contributions
It+0.042
aw+0.037
 living+0.036
 that+0.035
That+0.034
Neg logit contributions
sing-0.084
 argue-0.082
 spine-0.079
 whine-0.076
 breath-0.075
Token me
Token ID502
Feature activation+1.55e+00

Pos logit contributions
aw+0.008
 living+0.007
It+0.007
 that+0.007
 brand+0.006
Neg logit contributions
 argue-0.012
sing-0.011
 breath-0.011
 whine-0.010
 spine-0.010
Token!"
Token ID2474
Feature activation+5.41e-01

Pos logit contributions
 argue+0.038
sing+0.037
 spine+0.035
 whine+0.034
 breath+0.033
Neg logit contributions
 living-0.027
aw-0.027
It-0.026
 that-0.025
 all-0.023
Tokenpped
Token ID1496
Feature activation-7.50e-01

Pos logit contributions
 argue+0.068
sing+0.068
 spine+0.065
 whine+0.063
days+0.061
Neg logit contributions
 living-0.021
aw-0.020
It-0.020
 that-0.019
 all-0.017
Token and
Token ID290
Feature activation+1.91e+00

Pos logit contributions
 living+0.010
aw+0.009
It+0.009
 that+0.009
 all+0.007
Neg logit contributions
 argue-0.022
sing-0.021
 spine-0.020
 breath-0.020
 whine-0.020
Token fell
Token ID3214
Feature activation+6.48e-01

Pos logit contributions
 argue+0.036
sing+0.034
 spine+0.032
 whine+0.031
 breath+0.031
Neg logit contributions
 living-0.044
aw-0.044
It-0.042
 that-0.041
 all-0.038
Token.
Token ID13
Feature activation-3.73e-01

Pos logit contributions
 argue+0.016
sing+0.016
 spine+0.015
 whine+0.015
 breath+0.015
Neg logit contributions
 living-0.011
aw-0.011
It-0.010
 that-0.010
 all-0.009
Token The
Token ID383
Feature activation-2.99e+00

Pos logit contributions
 living+0.006
aw+0.005
It+0.005
 that+0.005
 all+0.005
Neg logit contributions
 argue-0.010
sing-0.010
 spine-0.009
 whine-0.009
 breath-0.009
Token wine
Token ID8237
Feature activation-7.27e+00

Pos logit contributions
 living+0.052
aw+0.051
It+0.051
 that+0.049
 all+0.045
Neg logit contributions
 argue-0.073
sing-0.072
 spine-0.068
 whine-0.066
 breath-0.066
Token glass
Token ID5405
Feature activation-6.31e+00

Pos logit contributions
aw+0.130
 living+0.124
It+0.115
 brand+0.109
 that+0.108
Neg logit contributions
 argue-0.167
 breath-0.162
sing-0.161
 whine-0.150
 spine-0.150
Token flew
Token ID13112
Feature activation-1.87e+00

Pos logit contributions
 living+0.114
aw+0.113
It+0.107
 brand+0.099
 biggest+0.098
Neg logit contributions
 argue-0.138
sing-0.137
 breath-0.137
 spine-0.128
 whine-0.124
Token out
Token ID503
Feature activation+1.63e+00

Pos logit contributions
 living+0.035
aw+0.033
It+0.032
 that+0.030
hes+0.029
Neg logit contributions
 argue-0.042
sing-0.041
 breath-0.040
 spine-0.039
 whine-0.037
Token of
Token ID286
Feature activation+1.54e-01

Pos logit contributions
 argue+0.049
sing+0.048
 spine+0.046
 whine+0.045
 breath+0.045
Neg logit contributions
 living-0.019
aw-0.019
It-0.018
 that-0.018
 all-0.016
Token her
Token ID607
Feature activation+1.35e+00

Pos logit contributions
 argue+0.004
sing+0.003
 breath+0.003
 spine+0.003
 whine+0.003
Neg logit contributions
aw-0.003
 living-0.003
It-0.003
 that-0.003
ing-0.002
Token.
Token ID13
Feature activation+3.10e-01

Pos logit contributions
 argue+0.122
sing+0.122
 spine+0.115
 whine+0.110
 breath+0.110
Neg logit contributions
 living-0.075
 that-0.072
aw-0.069
It-0.068
 all-0.065
Token
Token ID220
Feature activation-4.55e-01

Pos logit contributions
 argue+0.008
sing+0.008
 spine+0.007
 breath+0.007
 whine+0.007
Neg logit contributions
 living-0.005
It-0.005
aw-0.005
 that-0.005
 all-0.005
Token
Token ID198
Feature activation-3.62e-01

Pos logit contributions
 living+0.006
aw+0.006
 that+0.006
It+0.005
 all+0.005
Neg logit contributions
 argue-0.012
sing-0.012
 spine-0.011
 whine-0.011
days-0.011
Token
Token ID198
Feature activation-5.67e-01

Pos logit contributions
 living+0.005
aw+0.005
It+0.004
 that+0.004
 all+0.004
Neg logit contributions
 argue-0.010
sing-0.010
 spine-0.010
 whine-0.010
 breath-0.009
TokenThe
Token ID464
Feature activation-2.64e+00

Pos logit contributions
 living+0.012
aw+0.012
It+0.012
 that+0.012
 all+0.011
Neg logit contributions
 argue-0.012
sing-0.011
 spine-0.011
 whine-0.010
 breath-0.010
Token card
Token ID2657
Feature activation-7.23e+00

Pos logit contributions
 living+0.043
aw+0.043
It+0.042
 that+0.040
 all+0.037
Neg logit contributions
 argue-0.067
sing-0.066
 spine-0.062
 whine-0.061
 breath-0.061
Token was
Token ID373
Feature activation-2.72e+00

Pos logit contributions
aw+0.097
 living+0.097
It+0.093
 that+0.081
That+0.076
Neg logit contributions
 argue-0.197
sing-0.194
 breath-0.182
 spine-0.180
 whine-0.178
Token incredible
Token ID8082
Feature activation+4.17e-01

Pos logit contributions
aw+0.036
 living+0.032
It+0.031
ing+0.027
That+0.027
Neg logit contributions
sing-0.079
 argue-0.077
 breath-0.073
 whine-0.069
 spine-0.069
Token.
Token ID13
Feature activation-6.63e-01

Pos logit contributions
sing+0.010
 argue+0.009
 spine+0.009
 breath+0.009
 whine+0.008
Neg logit contributions
It-0.008
aw-0.007
 living-0.007
hes-0.007
That-0.006
Token It
Token ID632
Feature activation-4.20e+00

Pos logit contributions
 living+0.011
aw+0.010
 that+0.009
It+0.009
 all+0.009
Neg logit contributions
sing-0.017
 argue-0.017
 spine-0.015
 whine-0.015
 breath-0.015
Token had
Token ID550
Feature activation-3.70e+00

Pos logit contributions
aw+0.062
 living+0.058
It+0.054
ay+0.048
hes+0.047
Neg logit contributions
 argue-0.109
sing-0.108
 breath-0.100
cker-0.098
 whine-0.097
Token wherever
Token ID14530
Feature activation+7.11e-01

Pos logit contributions
 argue+0.007
sing+0.007
 spine+0.007
 whine+0.006
 breath+0.006
Neg logit contributions
 living-0.007
aw-0.007
It-0.007
 that-0.007
 all-0.007
Token she
Token ID673
Feature activation+4.35e+00

Pos logit contributions
sing+0.021
 argue+0.021
 spine+0.020
 whine+0.019
 breath+0.019
Neg logit contributions
 living-0.010
 that-0.009
It-0.009
aw-0.008
 all-0.008
Token went
Token ID1816
Feature activation+2.05e+00

Pos logit contributions
 argue+0.109
sing+0.105
 spine+0.100
 whine+0.099
 breath+0.098
Neg logit contributions
 living-0.071
aw-0.069
It-0.068
 that-0.067
 all-0.060
Token.
Token ID13
Feature activation+2.76e-01

Pos logit contributions
 argue+0.046
sing+0.044
 spine+0.042
 whine+0.041
 breath+0.041
Neg logit contributions
aw-0.039
 living-0.039
It-0.038
 that-0.036
 brand-0.033
Token The
Token ID383
Feature activation-2.71e+00

Pos logit contributions
 argue+0.008
sing+0.008
 spine+0.007
 breath+0.007
 whine+0.007
Neg logit contributions
 living-0.004
aw-0.004
It-0.004
 that-0.003
 all-0.003
Token label
Token ID6167
Feature activation-7.17e+00

Pos logit contributions
aw+0.048
It+0.043
 living+0.042
 that+0.041
 all+0.039
Neg logit contributions
 argue-0.065
sing-0.065
 spine-0.062
 whine-0.059
 breath-0.058
Token was
Token ID373
Feature activation-2.25e+00

Pos logit contributions
aw+0.098
It+0.093
 living+0.090
 brand+0.077
That+0.076
Neg logit contributions
 argue-0.194
sing-0.185
 whine-0.176
 breath-0.176
 spine-0.174
Token very
Token ID845
Feature activation-7.39e-01

Pos logit contributions
aw+0.038
It+0.029
 living+0.028
ing+0.027
That+0.026
Neg logit contributions
 argue-0.061
sing-0.060
 breath-0.057
 whine-0.056
 spine-0.054
Token attractive
Token ID10966
Feature activation-4.44e-01

Pos logit contributions
aw+0.008
 living+0.008
It+0.008
 that+0.007
 all+0.006
Neg logit contributions
 argue-0.023
sing-0.022
 spine-0.021
 whine-0.021
 breath-0.021
Token and
Token ID290
Feature activation-6.71e-01

Pos logit contributions
It+0.009
aw+0.008
 living+0.007
hes+0.007
That+0.007
Neg logit contributions
sing-0.010
 argue-0.009
 spine-0.009
 breath-0.009
 whine-0.008
Token was
Token ID373
Feature activation-2.34e+00

Pos logit contributions
aw+0.013
 living+0.011
arre+0.010
It+0.010
ing+0.010
Neg logit contributions
sing-0.015
 argue-0.015
 whine-0.014
 seek-0.014
 breath-0.014
Token started
Token ID2067
Feature activation-1.74e-01

Pos logit contributions
 argue+0.034
sing+0.033
 whine+0.031
 breath+0.030
 spine+0.030
Neg logit contributions
aw-0.021
 living-0.021
It-0.020
 that-0.019
ay-0.018
Token to
Token ID284
Feature activation+4.37e+00

Pos logit contributions
 living+0.003
 that+0.003
aw+0.003
It+0.003
 all+0.003
Neg logit contributions
 argue-0.004
sing-0.004
 spine-0.004
 whine-0.004
 breath-0.004
Token paint
Token ID7521
Feature activation-2.47e+00

Pos logit contributions
 argue+0.089
sing+0.088
 spine+0.085
 breath+0.078
 whine+0.078
Neg logit contributions
 living-0.093
It-0.089
 that-0.089
aw-0.087
 all-0.084
Token.
Token ID13
Feature activation-1.25e-01

Pos logit contributions
 living+0.047
aw+0.044
It+0.044
That+0.041
 that+0.039
Neg logit contributions
 argue-0.052
sing-0.049
 breath-0.049
 whine-0.045
 spine-0.045
Token The
Token ID383
Feature activation-2.59e+00

Pos logit contributions
aw+0.002
 living+0.002
 that+0.002
It+0.002
ing+0.002
Neg logit contributions
 argue-0.003
sing-0.003
 spine-0.003
 whine-0.003
days-0.002
Token wine
Token ID8237
Feature activation-7.10e+00

Pos logit contributions
aw+0.045
It+0.043
 living+0.042
 that+0.040
 all+0.037
Neg logit contributions
 argue-0.064
sing-0.063
 spine-0.058
 whine-0.057
 breath-0.057
Token made
Token ID925
Feature activation-3.51e+00

Pos logit contributions
aw+0.106
 living+0.094
It+0.085
 brand+0.081
 that+0.074
Neg logit contributions
 argue-0.184
sing-0.178
 breath-0.176
 whine-0.166
comb-0.161
Token the
Token ID262
Feature activation-1.69e+00

Pos logit contributions
 living+0.065
aw+0.063
It+0.057
That+0.051
 that+0.050
Neg logit contributions
 argue-0.075
sing-0.074
 breath-0.074
 whine-0.066
 Suz-0.063
Token colors
Token ID7577
Feature activation-1.96e+00

Pos logit contributions
aw+0.027
 living+0.027
It+0.027
 that+0.025
 all+0.023
Neg logit contributions
 argue-0.043
sing-0.043
 spine-0.040
 breath-0.038
 whine-0.038
Token look
Token ID804
Feature activation-4.09e+00

Pos logit contributions
aw+0.033
 living+0.033
It+0.030
That+0.026
 that+0.026
Neg logit contributions
 argue-0.049
sing-0.049
 breath-0.046
 spine-0.044
 whine-0.043
Token different
Token ID1180
Feature activation-1.04e+00

Pos logit contributions
 living+0.065
aw+0.058
It+0.052
rob+0.049
 Now+0.049
Neg logit contributions
 argue-0.106
 breath-0.100
sing-0.099
 whine-0.092
 resist-0.090
Token and
Token ID290
Feature activation+1.07e+00

Pos logit contributions
It+0.034
aw+0.033
 living+0.031
That+0.030
 brand+0.029
Neg logit contributions
 argue-0.056
sing-0.055
 spine-0.052
 breath-0.051
 whine-0.048
Token a
Token ID257
Feature activation-1.67e+00

Pos logit contributions
 argue+0.025
sing+0.024
 breath+0.023
 whine+0.022
 seek+0.021
Neg logit contributions
aw-0.019
 living-0.019
It-0.017
ay-0.016
ing-0.016
Token note
Token ID3465
Feature activation-4.12e+00

Pos logit contributions
aw+0.028
 living+0.025
 that+0.024
It+0.024
 all+0.022
Neg logit contributions
sing-0.043
 argue-0.043
 spine-0.040
 breath-0.039
 whine-0.039
Token.
Token ID13
Feature activation-4.16e-01

Pos logit contributions
aw+0.070
It+0.068
 living+0.064
 loads+0.060
That+0.060
Neg logit contributions
 argue-0.096
sing-0.093
 spine-0.088
 breath-0.086
 whine-0.084
Token The
Token ID383
Feature activation-3.05e+00

Pos logit contributions
 living+0.007
aw+0.007
It+0.007
 that+0.006
 all+0.006
Neg logit contributions
 argue-0.010
sing-0.010
 spine-0.009
 whine-0.009
 breath-0.009
Token note
Token ID3465
Feature activation-7.09e+00

Pos logit contributions
aw+0.064
 living+0.062
It+0.060
 that+0.057
 all+0.054
Neg logit contributions
 argue-0.065
sing-0.062
 breath-0.058
 spine-0.056
 whine-0.055
Token said
Token ID531
Feature activation-5.05e+00

Pos logit contributions
aw+0.135
 living+0.129
It+0.122
 biggest+0.112
 loads+0.112
Neg logit contributions
 argue-0.154
sing-0.144
 breath-0.137
 whine-0.133
 spine-0.126
Token,
Token ID11
Feature activation+2.58e-01

Pos logit contributions
 living+0.068
aw+0.066
It+0.065
 that+0.064
 all+0.056
Neg logit contributions
 argue-0.146
sing-0.143
 spine-0.137
 whine-0.133
 breath-0.132
Token "
Token ID366
Feature activation-3.55e+00

Pos logit contributions
 argue+0.008
sing+0.008
 spine+0.007
 whine+0.007
 breath+0.007
Neg logit contributions
aw-0.003
 living-0.003
It-0.003
 that-0.003
 biggest-0.002
TokenIf
Token ID1532
Feature activation-1.11e-01

Pos logit contributions
 living+0.044
aw+0.043
It+0.043
 that+0.041
 all+0.035
Neg logit contributions
 argue-0.105
sing-0.104
 spine-0.099
 whine-0.096
 breath-0.096
Token you
Token ID345
Feature activation+2.45e+00

Pos logit contributions
 that+0.002
 living+0.002
It+0.001
aw+0.001
 all+0.001
Neg logit contributions
 argue-0.003
sing-0.003
 spine-0.003
 whine-0.003
 breath-0.003
Token was
Token ID373
Feature activation-2.45e+00

Pos logit contributions
aw+0.010
It+0.010
 living+0.009
That+0.008
hes+0.008
Neg logit contributions
 argue-0.012
sing-0.012
 spine-0.011
 breath-0.011
 whine-0.011
Token too
Token ID1165
Feature activation+7.77e-01

Pos logit contributions
aw+0.038
It+0.035
 living+0.035
That+0.031
 that+0.031
Neg logit contributions
 argue-0.065
sing-0.064
 breath-0.060
 spine-0.059
 whine-0.059
Token small
Token ID1402
Feature activation+1.75e+00

Pos logit contributions
 argue+0.022
sing+0.021
 breath+0.020
 spine+0.020
 whine+0.020
Neg logit contributions
aw-0.011
 living-0.010
It-0.010
 that-0.009
That-0.008
Token!
Token ID0
Feature activation-9.62e-02

Pos logit contributions
 argue+0.039
sing+0.037
 spine+0.036
 breath+0.035
 whine+0.034
Neg logit contributions
It-0.035
aw-0.034
 living-0.031
That-0.030
hes-0.029
Token His
Token ID2399
Feature activation-5.88e-01

Pos logit contributions
 living+0.001
aw+0.001
It+0.001
 that+0.001
 all+0.001
Neg logit contributions
 argue-0.003
sing-0.003
 spine-0.003
 whine-0.003
 breath-0.003
Token photo
Token ID4590
Feature activation-7.04e+00

Pos logit contributions
 living+0.013
aw+0.013
It+0.013
 that+0.012
 all+0.012
Neg logit contributions
 argue-0.012
sing-0.011
 spine-0.011
 breath-0.010
 whine-0.010
Token had
Token ID550
Feature activation-3.91e+00

Pos logit contributions
aw+0.090
 living+0.089
It+0.084
 that+0.073
 brand+0.072
Neg logit contributions
 argue-0.190
sing-0.188
 spine-0.180
 breath-0.180
 whine-0.175
Token shrunk
Token ID38900
Feature activation-4.79e-01

Pos logit contributions
aw+0.067
 living+0.066
It+0.062
 that+0.058
ing+0.057
Neg logit contributions
 argue-0.092
sing-0.088
 breath-0.086
 spine-0.084
 whine-0.080
Token.
Token ID13
Feature activation-5.24e-01

Pos logit contributions
It+0.009
 living+0.009
aw+0.009
That+0.008
hes+0.008
Neg logit contributions
sing-0.011
 argue-0.010
 breath-0.010
 spine-0.010
 whine-0.009
Token
Token ID198
Feature activation-1.63e+00

Pos logit contributions
 living+0.007
aw+0.007
It+0.006
 that+0.006
 all+0.006
Neg logit contributions
 argue-0.015
sing-0.015
 spine-0.014
 whine-0.014
 breath-0.013
Token
Token ID198
Feature activation-8.78e-01

Pos logit contributions
 living+0.018
aw+0.017
It+0.017
 that+0.016
 all+0.014
Neg logit contributions
 argue-0.051
sing-0.050
 spine-0.048
 whine-0.046
 breath-0.046
Token was
Token ID373
Feature activation-1.90e+00

Pos logit contributions
 argue+0.029
sing+0.028
 spine+0.026
 whine+0.025
 breath+0.025
Neg logit contributions
 living-0.024
aw-0.023
It-0.023
 that-0.022
 all-0.021
Token also
Token ID635
Feature activation-1.67e+00

Pos logit contributions
aw+0.026
It+0.025
 living+0.024
 that+0.022
That+0.021
Neg logit contributions
sing-0.053
 argue-0.053
 breath-0.049
 whine-0.049
 spine-0.049
Token wrong
Token ID2642
Feature activation+5.14e-01

Pos logit contributions
aw+0.026
It+0.026
 living+0.025
 that+0.024
That+0.022
Neg logit contributions
 argue-0.043
sing-0.043
 whine-0.039
 spine-0.039
 breath-0.039
Token.
Token ID13
Feature activation+1.41e-01

Pos logit contributions
 argue+0.013
sing+0.013
 spine+0.012
 whine+0.012
 breath+0.012
Neg logit contributions
 living-0.009
aw-0.008
 that-0.008
It-0.008
 all-0.007
Token The
Token ID383
Feature activation-2.60e+00

Pos logit contributions
 argue+0.004
sing+0.004
 spine+0.004
 breath+0.004
 whine+0.004
Neg logit contributions
 living-0.002
It-0.002
aw-0.002
 that-0.002
 all-0.001
Token photo
Token ID4590
Feature activation-6.98e+00

Pos logit contributions
 living+0.043
aw+0.042
It+0.041
 that+0.040
 all+0.037
Neg logit contributions
 argue-0.066
sing-0.065
 spine-0.061
 whine-0.059
 breath-0.059
Token is
Token ID318
Feature activation-2.80e+00

Pos logit contributions
 living+0.082
It+0.080
aw+0.080
 brand+0.071
 that+0.066
Neg logit contributions
 argue-0.201
sing-0.191
 whine-0.182
 breath-0.180
 spine-0.180
Token not
Token ID407
Feature activation+7.43e-02

Pos logit contributions
aw+0.046
It+0.043
 living+0.041
That+0.037
 that+0.037
Neg logit contributions
sing-0.073
 argue-0.072
 whine-0.066
 breath-0.065
 spine-0.065
Token worth
Token ID2861
Feature activation+9.09e-01

Pos logit contributions
 argue+0.002
sing+0.002
 breath+0.002
 spine+0.002
 whine+0.002
Neg logit contributions
aw-0.001
It-0.001
 living-0.001
That-0.001
 that-0.001
Token our
Token ID674
Feature activation+1.86e+00

Pos logit contributions
 argue+0.023
sing+0.022
 breath+0.021
 spine+0.020
 whine+0.020
Neg logit contributions
aw-0.015
 living-0.015
It-0.015
ay-0.013
 that-0.013
Token happiness
Token ID12157
Feature activation+1.81e+00

Pos logit contributions
 argue+0.037
sing+0.036
 spine+0.034
 whine+0.033
days+0.032
Neg logit contributions
 living-0.041
 that-0.040
aw-0.038
It-0.037
 all-0.036
Token the
Token ID262
Feature activation+5.26e-01

Pos logit contributions
 argue+0.019
sing+0.019
 breath+0.018
 spine+0.017
 whine+0.017
Neg logit contributions
aw-0.014
 living-0.014
It-0.014
 that-0.013
That-0.012
Token river
Token ID7850
Feature activation+4.31e-01

Pos logit contributions
 argue+0.011
sing+0.011
 spine+0.010
 breath+0.010
 whine+0.010
Neg logit contributions
aw-0.010
It-0.010
 living-0.010
 that-0.010
That-0.010
Tokenbank
Token ID17796
Feature activation+7.69e-01

Pos logit contributions
 argue+0.010
sing+0.009
 spine+0.009
 whine+0.008
 breath+0.008
Neg logit contributions
It-0.008
 living-0.008
aw-0.008
 that-0.008
That-0.007
Token.
Token ID13
Feature activation+1.63e-01

Pos logit contributions
 argue+0.017
sing+0.017
 spine+0.016
 whine+0.015
 breath+0.015
Neg logit contributions
It-0.015
 living-0.015
aw-0.014
 that-0.014
That-0.013
Token The
Token ID383
Feature activation-2.20e+00

Pos logit contributions
 argue+0.004
sing+0.004
 spine+0.004
 whine+0.004
 breath+0.004
Neg logit contributions
aw-0.002
 living-0.002
 that-0.002
It-0.002
 all-0.002
Token sign
Token ID1051
Feature activation-6.98e+00

Pos logit contributions
 living+0.036
aw+0.036
It+0.036
 that+0.034
 all+0.031
Neg logit contributions
 argue-0.056
sing-0.055
 spine-0.052
 whine-0.050
 breath-0.050
Token said
Token ID531
Feature activation-4.68e+00

Pos logit contributions
aw+0.100
 living+0.096
It+0.094
 brand+0.083
 biggest+0.083
Neg logit contributions
 argue-0.185
sing-0.171
 breath-0.165
 whine-0.163
 spine-0.155
Token "
Token ID366
Feature activation-2.62e+00

Pos logit contributions
 living+0.076
aw+0.074
It+0.072
 that+0.067
 all+0.063
Neg logit contributions
 argue-0.119
sing-0.117
 spine-0.109
 whine-0.106
 breath-0.106
TokenBridge
Token ID37385
Feature activation-4.48e-01

Pos logit contributions
It+0.045
 living+0.045
aw+0.045
 that+0.042
 all+0.039
Neg logit contributions
 argue-0.065
sing-0.064
 spine-0.060
 whine-0.058
 breath-0.058
Token this
Token ID428
Feature activation-1.69e+00

Pos logit contributions
 living+0.008
aw+0.008
It+0.008
 that+0.007
That+0.007
Neg logit contributions
 argue-0.010
sing-0.010
 breath-0.010
 spine-0.009
 whine-0.009
Token way
Token ID835
Feature activation+2.09e+00

Pos logit contributions
 living+0.032
aw+0.032
It+0.032
 that+0.030
 all+0.028
Neg logit contributions
 argue-0.038
sing-0.037
 spine-0.035
 breath-0.034
 whine-0.034
TokenMom
Token ID29252
Feature activation+1.68e-01

Pos logit contributions
 living+0.008
aw+0.008
 that+0.008
It+0.008
 all+0.007
Neg logit contributions
 argue-0.014
sing-0.013
 spine-0.013
 whine-0.012
 breath-0.012
Tokenmy
Token ID1820
Feature activation-1.60e+00

Pos logit contributions
 argue+0.004
sing+0.004
 whine+0.004
 breath+0.004
days+0.004
Neg logit contributions
aw-0.003
 living-0.003
It-0.003
 that-0.003
 biggest-0.003
Token explained
Token ID4893
Feature activation-3.73e+00

Pos logit contributions
 living+0.027
aw+0.026
It+0.026
 that+0.024
 all+0.022
Neg logit contributions
 argue-0.040
sing-0.039
 spine-0.036
 whine-0.036
 breath-0.036
Token that
Token ID326
Feature activation-4.00e+00

Pos logit contributions
 living+0.036
aw+0.035
It+0.035
 that+0.028
That+0.026
Neg logit contributions
 argue-0.119
sing-0.115
 whine-0.110
 spine-0.109
 breath-0.109
Token the
Token ID262
Feature activation-3.92e+00

Pos logit contributions
aw+0.071
It+0.067
 living+0.066
 that+0.061
 brand+0.057
Neg logit contributions
 argue-0.098
sing-0.094
 whine-0.088
 spine-0.087
 breath-0.086
Token box
Token ID3091
Feature activation-6.98e+00

Pos logit contributions
aw+0.084
It+0.073
 that+0.069
ocked+0.068
ing+0.068
Neg logit contributions
 argue-0.089
sing-0.084
 whine-0.080
 spine-0.078
 breath-0.073
Token was
Token ID373
Feature activation-2.72e+00

Pos logit contributions
aw+0.088
It+0.080
 living+0.079
That+0.064
 brand+0.061
Neg logit contributions
 argue-0.201
sing-0.197
 spine-0.182
 whine-0.182
days-0.182
Token like
Token ID588
Feature activation-3.03e+00

Pos logit contributions
aw+0.038
 living+0.037
It+0.036
 that+0.033
That+0.030
Neg logit contributions
 argue-0.076
sing-0.075
 spine-0.070
 whine-0.069
 breath-0.069
Token a
Token ID257
Feature activation-8.52e-01

Pos logit contributions
aw+0.047
It+0.043
 living+0.042
That+0.037
ay+0.036
Neg logit contributions
 argue-0.078
sing-0.077
 breath-0.072
 spine-0.071
 whine-0.071
Token truck
Token ID7779
Feature activation-7.19e-01

Pos logit contributions
aw+0.015
 living+0.013
It+0.013
 that+0.013
 all+0.012
Neg logit contributions
 argue-0.022
sing-0.021
 spine-0.020
 whine-0.020
 breath-0.019
Token.
Token ID13
Feature activation-4.93e-01

Pos logit contributions
aw+0.012
It+0.012
 living+0.012
That+0.010
 biggest+0.010
Neg logit contributions
 argue-0.018
sing-0.017
 spine-0.016
 whine-0.016
days-0.015
Token back
Token ID736
Feature activation+1.80e+00

Pos logit contributions
aw+0.015
 living+0.014
It+0.013
That+0.013
 that+0.012
Neg logit contributions
 breath-0.025
 argue-0.025
sing-0.024
 whine-0.023
 spine-0.023
Token and
Token ID290
Feature activation+2.11e+00

Pos logit contributions
 argue+0.048
sing+0.047
 spine+0.044
 breath+0.044
 whine+0.043
Neg logit contributions
aw-0.028
It-0.027
 living-0.026
 that-0.024
That-0.023
Token smiled
Token ID13541
Feature activation-9.48e-01

Pos logit contributions
 argue+0.049
sing+0.047
 whine+0.045
 breath+0.045
 spine+0.044
Neg logit contributions
aw-0.040
 living-0.038
It-0.037
ay-0.035
 that-0.035
Token.
Token ID13
Feature activation-5.82e-01

Pos logit contributions
 living+0.017
aw+0.017
It+0.016
 that+0.016
 all+0.014
Neg logit contributions
 argue-0.023
sing-0.022
 spine-0.021
 breath-0.020
 whine-0.020
Token The
Token ID383
Feature activation-2.92e+00

Pos logit contributions
aw+0.009
 living+0.009
It+0.008
 that+0.008
hes+0.007
Neg logit contributions
sing-0.015
 argue-0.014
 spine-0.014
 whine-0.013
 breath-0.013
Token bucket
Token ID19236
Feature activation-6.95e+00

Pos logit contributions
aw+0.053
It+0.049
 living+0.049
 that+0.048
hes+0.043
Neg logit contributions
 argue-0.070
sing-0.069
 spine-0.065
 whine-0.064
 breath-0.064
Token was
Token ID373
Feature activation-2.31e+00

Pos logit contributions
aw+0.094
 living+0.086
It+0.081
 biggest+0.069
 that+0.067
Neg logit contributions
 argue-0.189
sing-0.189
 breath-0.184
 spine-0.175
 whine-0.174
Token all
Token ID477
Feature activation-2.00e+00

Pos logit contributions
aw+0.041
 living+0.033
pill+0.032
It+0.031
ing+0.030
Neg logit contributions
sing-0.061
 argue-0.059
 breath-0.059
 whine-0.054
 spine-0.053
Token clean
Token ID3424
Feature activation+3.91e-02

Pos logit contributions
aw+0.041
It+0.035
 living+0.032
That+0.031
 that+0.030
Neg logit contributions
 argue-0.045
 breath-0.044
sing-0.044
 whine-0.041
 spine-0.041
Token and
Token ID290
Feature activation+8.04e-01

Pos logit contributions
 argue+0.001
sing+0.001
 breath+0.001
 spine+0.001
 whine+0.001
Neg logit contributions
aw-0.001
It-0.001
 living-0.001
hes-0.001
That-0.001
Token shiny
Token ID22441
Feature activation-1.58e+00

Pos logit contributions
sing+0.017
 breath+0.017
 argue+0.016
 Suz+0.016
 whine+0.015
Neg logit contributions
aw-0.017
 living-0.014
pill-0.013
It-0.013
br-0.013
Token on
Token ID319
Feature activation+1.22e+00

Pos logit contributions
aw+0.047
It+0.045
 living+0.045
 loads+0.041
 biggest+0.040
Neg logit contributions
 argue-0.075
sing-0.072
 spine-0.070
 whine-0.068
 breath-0.067
Token her
Token ID607
Feature activation+2.48e+00

Pos logit contributions
 argue+0.031
sing+0.030
 spine+0.028
 breath+0.028
 whine+0.027
Neg logit contributions
 living-0.020
aw-0.020
It-0.018
 that-0.018
 brand-0.016
Token slice
Token ID16416
Feature activation-4.57e+00

Pos logit contributions
 argue+0.054
sing+0.051
 spine+0.049
 whine+0.048
 breath+0.046
Neg logit contributions
 living-0.052
 that-0.050
aw-0.050
It-0.049
 all-0.046
Token.
Token ID13
Feature activation+1.81e-01

Pos logit contributions
 living+0.079
aw+0.077
It+0.076
 that+0.070
 brand+0.067
Neg logit contributions
 argue-0.107
sing-0.105
 spine-0.100
 whine-0.096
 breath-0.095
Token The
Token ID383
Feature activation-2.66e+00

Pos logit contributions
 argue+0.005
sing+0.005
 spine+0.005
 whine+0.005
 breath+0.005
Neg logit contributions
 living-0.002
aw-0.002
It-0.002
 that-0.002
 all-0.002
Token note
Token ID3465
Feature activation-6.94e+00

Pos logit contributions
aw+0.046
 living+0.045
It+0.044
 that+0.042
 all+0.039
Neg logit contributions
 argue-0.065
sing-0.064
 spine-0.060
 breath-0.058
 whine-0.058
Token said
Token ID531
Feature activation-4.98e+00

Pos logit contributions
aw+0.123
 living+0.119
It+0.113
 biggest+0.103
 loads+0.101
Neg logit contributions
 argue-0.157
sing-0.140
 breath-0.139
 whine-0.137
 weep-0.131
Token '
Token ID705
Feature activation-1.01e+00

Pos logit contributions
 living+0.071
aw+0.070
It+0.065
 that+0.058
 all+0.056
Neg logit contributions
 argue-0.130
sing-0.129
 spine-0.120
 breath-0.118
 whine-0.118
TokenHappy
Token ID25082
Feature activation-6.36e-02

Pos logit contributions
 living+0.016
aw+0.016
It+0.015
 that+0.015
 all+0.013
Neg logit contributions
 argue-0.027
sing-0.026
 spine-0.025
 whine-0.024
 breath-0.024
Token Birthday
Token ID33511
Feature activation-3.52e-01

Pos logit contributions
 living+0.001
aw+0.001
It+0.001
 that+0.001
 all+0.001
Neg logit contributions
 argue-0.001
sing-0.001
 spine-0.001
 whine-0.001
 breath-0.001
Token'.
Token ID4458
Feature activation-3.01e-01

Pos logit contributions
 living+0.007
 that+0.007
aw+0.007
It+0.007
 all+0.006
Neg logit contributions
 argue-0.008
sing-0.008
 spine-0.007
 whine-0.007
 breath-0.007
Token note
Token ID3465
Feature activation-4.19e+00

Pos logit contributions
aw+0.019
 living+0.018
It+0.018
 that+0.017
 all+0.015
Neg logit contributions
 argue-0.034
sing-0.033
 spine-0.031
 breath-0.031
 whine-0.030
Token from
Token ID422
Feature activation+2.82e-01

Pos logit contributions
aw+0.062
It+0.060
 living+0.057
 loads+0.053
That+0.053
Neg logit contributions
 argue-0.102
sing-0.100
 spine-0.098
 breath-0.094
 whine-0.090
Token Sammy
Token ID44067
Feature activation+1.93e+00

Pos logit contributions
 argue+0.007
sing+0.007
 spine+0.007
 whine+0.007
 breath+0.007
Neg logit contributions
 living-0.005
aw-0.004
It-0.004
 that-0.004
 all-0.004
Token.
Token ID13
Feature activation-7.32e-02

Pos logit contributions
 argue+0.051
sing+0.050
 spine+0.047
 whine+0.046
 breath+0.045
Neg logit contributions
 living-0.031
aw-0.030
It-0.029
 that-0.029
 all-0.026
Token The
Token ID383
Feature activation-2.33e+00

Pos logit contributions
aw+0.001
 living+0.001
It+0.001
 that+0.001
hes+0.001
Neg logit contributions
sing-0.002
 argue-0.002
 spine-0.002
 breath-0.002
 whine-0.002
Token note
Token ID3465
Feature activation-6.94e+00

Pos logit contributions
aw+0.049
 living+0.048
It+0.047
 that+0.044
hes+0.044
Neg logit contributions
 argue-0.047
sing-0.046
 spine-0.043
 breath-0.043
 whine-0.040
Token said
Token ID531
Feature activation-4.52e+00

Pos logit contributions
aw+0.123
 living+0.118
It+0.113
 biggest+0.105
 loads+0.101
Neg logit contributions
 argue-0.155
sing-0.146
 breath-0.142
 spine-0.134
 whine-0.133
Token that
Token ID326
Feature activation-3.26e+00

Pos logit contributions
 living+0.064
aw+0.062
It+0.058
That+0.049
 that+0.049
Neg logit contributions
 argue-0.120
sing-0.118
 breath-0.111
 spine-0.110
 whine-0.107
Token Sammy
Token ID44067
Feature activation-1.20e+00

Pos logit contributions
aw+0.070
It+0.064
 living+0.063
 that+0.057
That+0.056
Neg logit contributions
 argue-0.069
sing-0.066
 spine-0.060
 breath-0.060
 whine-0.059
Token had
Token ID550
Feature activation-3.32e+00

Pos logit contributions
aw+0.014
 living+0.014
It+0.013
 that+0.011
 brand+0.010
Neg logit contributions
 argue-0.036
sing-0.034
 spine-0.033
 breath-0.032
 whine-0.032
Token left
Token ID1364
Feature activation-6.43e-01

Pos logit contributions
aw+0.055
 living+0.054
It+0.053
 that+0.048
That+0.046
Neg logit contributions
 argue-0.083
sing-0.080
 spine-0.076
 breath-0.075
 whine-0.073
Token said
Token ID531
Feature activation-3.26e+00

Pos logit contributions
 argue+0.028
sing+0.026
 whine+0.025
 breath+0.024
 spine+0.024
Neg logit contributions
 living-0.019
It-0.018
aw-0.018
 that-0.017
 brand-0.016
Token to
Token ID284
Feature activation+1.74e+00

Pos logit contributions
 that+0.053
 living+0.050
It+0.049
aw+0.048
 all+0.046
Neg logit contributions
sing-0.089
 argue-0.087
 spine-0.086
 whine-0.079
 breath-0.079
Token Max
Token ID5436
Feature activation+2.46e+00

Pos logit contributions
 argue+0.041
sing+0.040
 spine+0.038
 whine+0.037
 breath+0.036
Neg logit contributions
 living-0.032
aw-0.031
It-0.031
 that-0.031
 all-0.028
Token "
Token ID366
Feature activation-2.61e+00

Pos logit contributions
 argue+0.063
sing+0.062
 spine+0.059
 breath+0.056
 whine+0.056
Neg logit contributions
 living-0.041
 that-0.039
aw-0.039
It-0.039
 all-0.036
TokenThis
Token ID1212
Feature activation-4.33e+00

Pos logit contributions
 living+0.040
 that+0.038
aw+0.038
 brand+0.034
It+0.034
Neg logit contributions
 argue-0.070
sing-0.065
 spine-0.062
 whine-0.062
days-0.062
Token wine
Token ID8237
Feature activation-6.93e+00

Pos logit contributions
aw+0.070
It+0.063
 living+0.060
 that+0.054
 brand+0.052
Neg logit contributions
 argue-0.112
sing-0.104
 whine-0.098
 breath-0.096
days-0.094
Token is
Token ID318
Feature activation-3.17e+00

Pos logit contributions
aw+0.083
It+0.072
 living+0.066
 brand+0.058
hes+0.053
Neg logit contributions
 argue-0.193
sing-0.188
 breath-0.181
days-0.180
 whine-0.175
Token so
Token ID523
Feature activation-2.18e-01

Pos logit contributions
aw+0.046
It+0.038
 living+0.038
ing+0.032
ay+0.032
Neg logit contributions
 argue-0.088
sing-0.087
 breath-0.081
 whine-0.079
 spine-0.077
Token hot
Token ID3024
Feature activation+1.44e+00

Pos logit contributions
aw+0.003
 living+0.003
It+0.003
 sky+0.002
ing+0.002
Neg logit contributions
sing-0.006
 argue-0.006
 whine-0.005
days-0.005
 breath-0.005
Token!
Token ID0
Feature activation-1.55e-01

Pos logit contributions
 argue+0.030
sing+0.028
 spine+0.027
 breath+0.026
 whine+0.026
Neg logit contributions
It-0.027
aw-0.026
 living-0.026
 brand-0.025
 biggest-0.024
Token Let
Token ID3914
Feature activation-9.91e-01

Pos logit contributions
 living+0.002
aw+0.002
It+0.002
 that+0.002
 all+0.002
Neg logit contributions
 argue-0.004
sing-0.004
 spine-0.004
 whine-0.004
 breath-0.004
Token year
Token ID614
Feature activation+2.94e-01

Pos logit contributions
 argue+0.010
sing+0.010
 spine+0.010
 whine+0.009
 breath+0.009
Neg logit contributions
 living-0.008
aw-0.007
It-0.007
 that-0.007
 all-0.007
Token old
Token ID1468
Feature activation+3.18e-01

Pos logit contributions
 argue+0.009
sing+0.009
 spine+0.009
 whine+0.008
 breath+0.008
Neg logit contributions
It-0.003
 living-0.003
aw-0.003
 that-0.003
 all-0.002
Token child
Token ID1200
Feature activation+1.95e+00

Pos logit contributions
 argue+0.007
sing+0.007
 spine+0.007
 whine+0.006
 breath+0.006
Neg logit contributions
 living-0.006
aw-0.006
 that-0.006
It-0.006
 all-0.005
Token.
Token ID13
Feature activation+1.59e-01

Pos logit contributions
 argue+0.051
sing+0.050
 spine+0.048
 whine+0.046
 breath+0.046
Neg logit contributions
 living-0.031
aw-0.030
It-0.029
 that-0.029
 all-0.026
Token The
Token ID383
Feature activation-1.85e+00

Pos logit contributions
 argue+0.004
sing+0.004
 spine+0.004
 whine+0.004
 breath+0.004
Neg logit contributions
 living-0.003
aw-0.003
It-0.002
 that-0.002
 all-0.002
Token card
Token ID2657
Feature activation-6.90e+00

Pos logit contributions
 living+0.034
aw+0.034
It+0.033
 that+0.032
 all+0.030
Neg logit contributions
 argue-0.043
sing-0.042
 spine-0.040
 whine-0.038
 breath-0.038
Token had
Token ID550
Feature activation-3.61e+00

Pos logit contributions
aw+0.089
It+0.081
 living+0.080
That+0.065
 biggest+0.065
Neg logit contributions
 argue-0.191
sing-0.187
 breath-0.175
 spine-0.174
 whine-0.174
Token a
Token ID257
Feature activation-1.46e+00

Pos logit contributions
aw+0.054
ing+0.053
 living+0.052
It+0.049
That+0.048
Neg logit contributions
 argue-0.080
 breath-0.075
sing-0.075
 whine-0.069
 spine-0.068
Token picture
Token ID4286
Feature activation-4.65e+00

Pos logit contributions
aw+0.026
 living+0.021
It+0.021
ing+0.021
hes+0.021
Neg logit contributions
sing-0.037
 argue-0.037
 breath-0.034
 spine-0.033
 whine-0.033
Token of
Token ID286
Feature activation-1.43e+00

Pos logit contributions
It+0.057
aw+0.056
That+0.050
 living+0.049
 loads+0.046
Neg logit contributions
sing-0.120
 spine-0.118
 argue-0.113
 breath-0.111
days-0.108
Token an
Token ID281
Feature activation-8.08e-02

Pos logit contributions
aw+0.023
 living+0.021
It+0.021
ing+0.019
 that+0.018
Neg logit contributions
 argue-0.035
sing-0.035
 breath-0.032
 spine-0.031
 whine-0.031
Token and
Token ID290
Feature activation+1.68e+00

Pos logit contributions
 living+0.014
aw+0.014
It+0.014
 that+0.013
 all+0.012
Neg logit contributions
 argue-0.019
sing-0.019
 spine-0.018
 whine-0.017
 breath-0.017
Token said
Token ID531
Feature activation-3.85e+00

Pos logit contributions
 argue+0.041
sing+0.038
 whine+0.037
 breath+0.036
 spine+0.035
Neg logit contributions
aw-0.029
 living-0.029
It-0.028
ay-0.026
 that-0.024
Token,
Token ID11
Feature activation+2.65e-01

Pos logit contributions
 living+0.065
aw+0.063
 that+0.063
It+0.062
 all+0.056
Neg logit contributions
 argue-0.098
sing-0.097
 spine-0.092
 whine-0.088
 breath-0.088
Token "
Token ID366
Feature activation-3.34e+00

Pos logit contributions
 argue+0.008
sing+0.008
 spine+0.008
 whine+0.007
 breath+0.007
Neg logit contributions
aw-0.003
 living-0.003
It-0.003
 that-0.002
 all-0.002
TokenThis
Token ID1212
Feature activation-4.49e+00

Pos logit contributions
 living+0.047
aw+0.044
 that+0.043
 all+0.040
It+0.038
Neg logit contributions
 argue-0.093
sing-0.088
 spine-0.085
 whine-0.085
 breath-0.082
Token card
Token ID2657
Feature activation-6.90e+00

Pos logit contributions
aw+0.039
It+0.037
 living+0.036
 that+0.028
 brand+0.028
Neg logit contributions
 argue-0.144
sing-0.138
 whine-0.131
 breath-0.129
 spine-0.127
Token made
Token ID925
Feature activation-3.03e+00

Pos logit contributions
aw+0.055
It+0.053
 living+0.052
 that+0.038
That+0.036
Neg logit contributions
 argue-0.225
sing-0.223
 breath-0.209
 whine-0.209
 spine-0.209
Token me
Token ID502
Feature activation+2.99e+00

Pos logit contributions
aw+0.054
 living+0.053
It+0.051
That+0.045
 that+0.044
Neg logit contributions
 argue-0.068
sing-0.065
 breath-0.063
 whine-0.060
 spine-0.058
Token happy
Token ID3772
Feature activation+6.67e-01

Pos logit contributions
 argue+0.073
sing+0.071
 spine+0.069
 whine+0.065
 breath+0.064
Neg logit contributions
 living-0.052
 that-0.052
aw-0.051
It-0.050
 all-0.048
Token,
Token ID11
Feature activation+8.20e-01

Pos logit contributions
 argue+0.016
sing+0.015
 spine+0.015
 breath+0.014
 whine+0.014
Neg logit contributions
aw-0.012
 living-0.012
It-0.012
 that-0.011
 all-0.010
Token maybe
Token ID3863
Feature activation+9.79e-01

Pos logit contributions
 argue+0.017
sing+0.016
 spine+0.015
 whine+0.015
 breath+0.014
Neg logit contributions
aw-0.017
 living-0.017
It-0.017
 that-0.016
hes-0.015
Token started
Token ID2067
Feature activation-2.09e-02

Pos logit contributions
 living+0.048
aw+0.047
It+0.046
 that+0.044
 all+0.041
Neg logit contributions
 argue-0.060
sing-0.058
 spine-0.055
 breath-0.053
 whine-0.053
Token to
Token ID284
Feature activation+2.83e+00

Pos logit contributions
 that+0.000
 living+0.000
 all+0.000
aw+0.000
It+0.000
Neg logit contributions
 argue-0.001
sing-0.001
 spine-0.001
 whine-0.001
 drown-0.001
Token happen
Token ID1645
Feature activation+4.59e-01

Pos logit contributions
 argue+0.052
sing+0.049
 spine+0.045
 breath+0.045
 whine+0.044
Neg logit contributions
aw-0.069
 living-0.068
It-0.065
 that-0.063
 brand-0.059
Token.
Token ID13
Feature activation-1.77e-01

Pos logit contributions
 argue+0.011
sing+0.011
 spine+0.010
 whine+0.010
 breath+0.010
Neg logit contributions
 living-0.008
aw-0.008
It-0.008
 that-0.008
 all-0.007
Token The
Token ID383
Feature activation-2.44e+00

Pos logit contributions
 living+0.002
aw+0.002
It+0.002
 that+0.002
 all+0.002
Neg logit contributions
 argue-0.005
sing-0.005
 spine-0.005
 whine-0.005
 breath-0.005
Token print
Token ID3601
Feature activation-6.90e+00

Pos logit contributions
aw+0.046
 living+0.044
It+0.044
 that+0.042
 all+0.039
Neg logit contributions
 argue-0.057
sing-0.056
 spine-0.052
 breath-0.050
 whine-0.050
Token started
Token ID2067
Feature activation-6.42e-01

Pos logit contributions
aw+0.088
 living+0.083
It+0.079
 brand+0.070
 that+0.068
Neg logit contributions
 argue-0.192
sing-0.185
 breath-0.178
 spine-0.174
 whine-0.173
Token to
Token ID284
Feature activation+2.66e+00

Pos logit contributions
 living+0.011
 that+0.010
aw+0.010
It+0.010
 all+0.009
Neg logit contributions
 argue-0.017
sing-0.016
 spine-0.016
 whine-0.015
 breath-0.015
Token shine
Token ID18340
Feature activation-1.19e+00

Pos logit contributions
 argue+0.059
sing+0.057
 spine+0.055
 whine+0.052
 breath+0.051
Neg logit contributions
 living-0.053
It-0.051
 that-0.051
aw-0.049
 all-0.048
Token and
Token ID290
Feature activation+2.19e+00

Pos logit contributions
 living+0.020
It+0.019
aw+0.019
That+0.017
 that+0.017
Neg logit contributions
 argue-0.028
sing-0.028
 breath-0.027
 spine-0.026
 whine-0.024
Token spark
Token ID9009
Feature activation-7.68e-01

Pos logit contributions
 argue+0.054
sing+0.052
 whine+0.049
 breath+0.049
 spine+0.047
Neg logit contributions
aw-0.041
It-0.035
 living-0.035
ay-0.033
ing-0.032
Token said
Token ID531
Feature activation-3.13e+00

Pos logit contributions
 argue+0.055
sing+0.054
 spine+0.051
 whine+0.049
 breath+0.049
Neg logit contributions
 living-0.035
 that-0.034
It-0.034
aw-0.033
 all-0.031
Token,
Token ID11
Feature activation+4.07e-01

Pos logit contributions
 living+0.051
 that+0.050
aw+0.050
It+0.049
 all+0.045
Neg logit contributions
 argue-0.081
sing-0.080
 spine-0.077
 whine-0.073
 breath-0.072
Token "
Token ID366
Feature activation-2.66e+00

Pos logit contributions
 argue+0.012
sing+0.012
 spine+0.012
 whine+0.011
 breath+0.011
Neg logit contributions
aw-0.005
 living-0.004
It-0.004
 that-0.004
 all-0.003
TokenI
Token ID40
Feature activation+1.42e+00

Pos logit contributions
 living+0.057
 that+0.054
aw+0.054
 all+0.050
It+0.048
Neg logit contributions
 argue-0.056
sing-0.053
 spine-0.049
 whine-0.048
days-0.048
Token want
Token ID765
Feature activation-1.60e+00

Pos logit contributions
 argue+0.029
sing+0.028
 spine+0.026
 whine+0.025
 breath+0.025
Neg logit contributions
 living-0.032
aw-0.031
It-0.030
 that-0.030
 all-0.028
Token to
Token ID284
Feature activation+4.89e+00

Pos logit contributions
aw+0.032
 living+0.031
It+0.031
 that+0.028
That+0.027
Neg logit contributions
 argue-0.035
sing-0.033
 breath-0.031
 spine-0.031
 whine-0.030
Token go
Token ID467
Feature activation+3.42e+00

Pos logit contributions
sing+0.106
 spine+0.101
 argue+0.099
 whine+0.093
 breath+0.088
Neg logit contributions
 that-0.102
 all-0.098
It-0.097
 living-0.096
aw-0.092
Token on
Token ID319
Feature activation+2.78e+00

Pos logit contributions
 argue+0.093
sing+0.090
 spine+0.085
 breath+0.084
 whine+0.083
Neg logit contributions
 living-0.051
aw-0.051
It-0.049
 that-0.045
That-0.041
Token a
Token ID257
Feature activation+2.00e+00

Pos logit contributions
sing+0.083
 argue+0.083
 spine+0.080
 whine+0.077
days+0.074
Neg logit contributions
 that-0.035
 living-0.035
aw-0.034
It-0.033
 all-0.031
Token boat
Token ID8848
Feature activation+1.94e+00

Pos logit contributions
 argue+0.041
sing+0.041
 spine+0.039
 whine+0.038
 breath+0.035
Neg logit contributions
 living-0.042
 that-0.042
It-0.040
aw-0.039
 all-0.039
Token adventure
Token ID8855
Feature activation+2.26e+00

Pos logit contributions
 argue+0.049
sing+0.048
 spine+0.046
 whine+0.045
 breath+0.044
Neg logit contributions
 living-0.032
aw-0.032
 that-0.031
It-0.031
 all-0.029
Token.
Token ID13
Feature activation-4.36e-01

Pos logit contributions
 spine+0.054
sing+0.054
cle+0.054
cker+0.053
comb+0.052
Neg logit contributions
 all-0.016
 living-0.015
 that-0.014
 on-0.012
 of-0.011
Token They
Token ID1119
Feature activation-9.07e-01

Pos logit contributions
 living+0.006
aw+0.006
It+0.006
 that+0.006
 all+0.005
Neg logit contributions
 argue-0.012
sing-0.012
 spine-0.011
 whine-0.011
 breath-0.011
Token did
Token ID750
Feature activation-5.89e-01

Pos logit contributions
aw+0.018
 living+0.017
It+0.016
 biggest+0.016
 that+0.016
Neg logit contributions
 argue-0.021
 breath-0.019
 whine-0.019
sing-0.018
 weep-0.017
Token not
Token ID407
Feature activation+4.09e+00

Pos logit contributions
 that+0.008
It+0.007
 all+0.006
 living+0.006
 many+0.005
Neg logit contributions
 spine-0.020
sing-0.018
irds-0.018
cker-0.017
days-0.017
Token want
Token ID765
Feature activation-1.62e+00

Pos logit contributions
sing+0.094
 spine+0.088
 argue+0.082
 footing+0.075
irds+0.075
Neg logit contributions
It-0.083
 living-0.082
 that-0.081
 all-0.077
aw-0.075
Token to
Token ID284
Feature activation+5.68e+00

Pos logit contributions
 that+0.025
 living+0.023
 all+0.021
It+0.020
aw+0.020
Neg logit contributions
sing-0.047
 argue-0.046
 spine-0.045
days-0.043
 whine-0.042
Token get
Token ID651
Feature activation-9.17e-01

Pos logit contributions
sing+0.137
 spine+0.134
irds+0.118
cker+0.113
 argue+0.113
Neg logit contributions
 that-0.118
It-0.104
 all-0.103
 living-0.099
 big-0.095
Token in
Token ID287
Feature activation+2.60e+00

Pos logit contributions
 that+0.013
 living+0.013
 all+0.012
It+0.011
aw+0.010
Neg logit contributions
sing-0.027
 spine-0.027
 argue-0.026
irds-0.024
comb-0.024
Token trouble
Token ID5876
Feature activation+2.69e+00

Pos logit contributions
sing+0.071
 argue+0.070
 spine+0.067
 whine+0.063
days+0.062
Neg logit contributions
 living-0.040
 that-0.039
It-0.037
aw-0.036
 all-0.035
Token.
Token ID13
Feature activation-1.66e-01

Pos logit contributions
 argue+0.068
sing+0.068
 spine+0.064
 whine+0.062
 breath+0.061
Neg logit contributions
 living-0.045
 that-0.043
aw-0.043
It-0.042
 all-0.039
Token They
Token ID1119
Feature activation-9.39e-01

Pos logit contributions
 living+0.002
aw+0.002
It+0.002
 that+0.002
 all+0.002
Neg logit contributions
 argue-0.005
sing-0.005
 spine-0.004
 whine-0.004
 breath-0.004
Token too
Token ID1165
Feature activation+2.89e+00

Pos logit contributions
 argue+0.083
sing+0.080
 spine+0.078
 whine+0.076
 breath+0.075
Neg logit contributions
 living-0.035
 that-0.034
It-0.033
aw-0.033
 all-0.030
Token.
Token ID13
Feature activation-7.72e-01

Pos logit contributions
 argue+0.074
sing+0.073
 spine+0.069
 whine+0.067
 breath+0.065
Neg logit contributions
 living-0.049
 that-0.048
aw-0.046
It-0.045
 all-0.043
Token Do
Token ID2141
Feature activation-2.09e+00

Pos logit contributions
 living+0.014
aw+0.013
It+0.013
 that+0.013
 all+0.012
Neg logit contributions
 argue-0.019
sing-0.018
 spine-0.017
 whine-0.017
 breath-0.016
Token you
Token ID345
Feature activation+3.40e+00

Pos logit contributions
 living+0.030
aw+0.029
 that+0.029
It+0.029
 all+0.026
Neg logit contributions
 argue-0.058
sing-0.058
 spine-0.055
 whine-0.053
 breath-0.053
Token want
Token ID765
Feature activation-7.83e-01

Pos logit contributions
sing+0.081
 spine+0.077
 argue+0.075
 breath+0.069
 whine+0.069
Neg logit contributions
It-0.060
 that-0.059
 all-0.059
aw-0.058
 living-0.057
Token to
Token ID284
Feature activation+5.74e+00

Pos logit contributions
 living+0.013
aw+0.013
It+0.013
 that+0.013
 all+0.012
Neg logit contributions
 argue-0.020
sing-0.019
 spine-0.018
 whine-0.018
 breath-0.018
Token help
Token ID1037
Feature activation+1.64e+00

Pos logit contributions
sing+0.130
 spine+0.125
 argue+0.123
 whine+0.114
days+0.111
Neg logit contributions
 that-0.116
It-0.109
 living-0.108
 all-0.107
aw-0.102
Token me
Token ID502
Feature activation+5.75e+00

Pos logit contributions
 argue+0.046
sing+0.046
 spine+0.043
 whine+0.042
 breath+0.042
Neg logit contributions
 living-0.023
aw-0.022
It-0.022
 that-0.022
 all-0.020
Token get
Token ID651
Feature activation-5.96e-01

Pos logit contributions
 argue+0.128
sing+0.125
 spine+0.117
 whine+0.113
 breath+0.113
Neg logit contributions
 living-0.114
aw-0.112
It-0.110
 that-0.107
 all-0.099
Token ready
Token ID3492
Feature activation+3.47e+00

Pos logit contributions
 living+0.008
aw+0.008
It+0.008
 that+0.008
 all+0.007
Neg logit contributions
 argue-0.017
sing-0.016
 spine-0.016
 whine-0.015
 breath-0.015
Token?"
Token ID1701
Feature activation+1.86e+00

Pos logit contributions
 argue+0.077
sing+0.074
 spine+0.070
 breath+0.068
 whine+0.068
Neg logit contributions
aw-0.068
 living-0.068
It-0.067
 that-0.062
That-0.059
Token.
Token ID13
Feature activation+4.05e-01

Pos logit contributions
 argue+0.047
sing+0.046
 spine+0.042
 whine+0.041
 breath+0.041
Neg logit contributions
aw-0.036
 living-0.036
It-0.035
 that-0.033
 brand-0.031
Token He
Token ID679
Feature activation-2.33e+00

Pos logit contributions
 argue+0.012
sing+0.012
 spine+0.011
 whine+0.011
 breath+0.011
Neg logit contributions
 living-0.005
aw-0.005
It-0.005
 that-0.004
 all-0.004
Token did
Token ID750
Feature activation-3.91e-01

Pos logit contributions
 living+0.044
It+0.039
ay+0.039
aw+0.039
 that+0.037
Neg logit contributions
 argue-0.054
sing-0.050
 whine-0.049
 breath-0.046
 spine-0.045
Token not
Token ID407
Feature activation+3.93e+00

Pos logit contributions
 that+0.007
 living+0.006
It+0.006
aw+0.006
 all+0.006
Neg logit contributions
sing-0.010
 argue-0.010
 spine-0.010
 whine-0.009
 breath-0.009
Token want
Token ID765
Feature activation-1.79e+00

Pos logit contributions
sing+0.083
 argue+0.082
 spine+0.078
 breath+0.073
 whine+0.072
Neg logit contributions
 living-0.082
It-0.079
aw-0.079
 that-0.079
 all-0.074
Token to
Token ID284
Feature activation+5.67e+00

Pos logit contributions
 living+0.031
 that+0.030
aw+0.030
It+0.029
 all+0.027
Neg logit contributions
 argue-0.044
sing-0.044
 spine-0.041
 whine-0.040
 breath-0.039
Token make
Token ID787
Feature activation-2.07e+00

Pos logit contributions
sing+0.122
 spine+0.118
 argue+0.107
irds+0.104
 breath+0.101
Neg logit contributions
 that-0.125
 living-0.117
It-0.114
 all-0.113
aw-0.105
Token his
Token ID465
Feature activation+2.22e+00

Pos logit contributions
 living+0.039
aw+0.038
It+0.038
 that+0.038
 all+0.035
Neg logit contributions
 argue-0.048
sing-0.048
 spine-0.045
 whine-0.043
 breath-0.043
Token mom
Token ID1995
Feature activation+4.73e+00

Pos logit contributions
 argue+0.040
 spine+0.040
sing+0.039
cle+0.037
cker+0.037
Neg logit contributions
 living-0.051
 that-0.050
 all-0.049
 best-0.048
 big-0.047
Token angry
Token ID7954
Feature activation+2.92e+00

Pos logit contributions
 argue+0.094
sing+0.093
 spine+0.087
 breath+0.082
 whine+0.082
Neg logit contributions
 living-0.104
aw-0.102
It-0.100
 that-0.100
 all-0.093
Token.
Token ID13
Feature activation+3.07e-01

Pos logit contributions
 argue+0.075
sing+0.074
 spine+0.070
 whine+0.067
 breath+0.067
Neg logit contributions
 living-0.048
aw-0.047
It-0.046
 that-0.046
 all-0.042
Token smiling
Token ID16755
Feature activation-6.92e-01

Pos logit contributions
aw+0.032
It+0.029
ay+0.028
 living+0.028
boards+0.026
Neg logit contributions
 argue-0.050
sing-0.050
 whine-0.047
 breath-0.046
 weep-0.044
Token,
Token ID11
Feature activation+9.65e-01

Pos logit contributions
 living+0.011
It+0.011
aw+0.011
hes+0.010
 that+0.010
Neg logit contributions
sing-0.017
 argue-0.017
 spine-0.016
 breath-0.016
 whine-0.015
Token but
Token ID475
Feature activation+1.44e+00

Pos logit contributions
sing+0.024
 argue+0.024
 spine+0.023
 whine+0.022
 breath+0.022
Neg logit contributions
aw-0.016
 living-0.015
It-0.014
ay-0.014
 that-0.013
Token he
Token ID339
Feature activation-1.69e+00

Pos logit contributions
 argue+0.035
sing+0.034
 spine+0.033
 whine+0.032
 breath+0.032
Neg logit contributions
 living-0.024
aw-0.024
It-0.022
 that-0.021
ay-0.021
Token couldn
Token ID3521
Feature activation+1.68e+00

Pos logit contributions
 living+0.027
aw+0.025
It+0.024
 that+0.023
 biggest+0.022
Neg logit contributions
 argue-0.043
sing-0.042
 whine-0.040
 spine-0.039
 breath-0.039
Token't
Token ID470
Feature activation+5.38e+00

Pos logit contributions
 argue+0.038
sing+0.036
 whine+0.033
 spine+0.033
 breath+0.033
Neg logit contributions
 living-0.033
aw-0.032
It-0.029
 that-0.029
 brand-0.028
Token bring
Token ID2222
Feature activation-2.16e+00

Pos logit contributions
sing+0.102
 argue+0.095
 spine+0.093
 whine+0.087
 breath+0.085
Neg logit contributions
 that-0.129
It-0.124
 all-0.122
 living-0.120
aw-0.112
Token himself
Token ID2241
Feature activation+1.62e+00

Pos logit contributions
 living+0.037
aw+0.037
It+0.036
 that+0.032
That+0.031
Neg logit contributions
 argue-0.052
sing-0.050
 breath-0.049
 spine-0.047
 whine-0.047
Token to
Token ID284
Feature activation+2.74e+00

Pos logit contributions
 argue+0.043
sing+0.042
 spine+0.040
 whine+0.039
 breath+0.038
Neg logit contributions
 living-0.026
aw-0.025
 that-0.024
It-0.024
 all-0.022
Token smile
Token ID8212
Feature activation+1.25e+00

Pos logit contributions
sing+0.070
 argue+0.070
 spine+0.066
 whine+0.062
 breath+0.062
Neg logit contributions
 living-0.046
 that-0.044
It-0.043
aw-0.043
 all-0.041
Token back
Token ID736
Feature activation+2.08e+00

Pos logit contributions
 argue+0.029
sing+0.028
 spine+0.027
 whine+0.026
 breath+0.026
Neg logit contributions
 living-0.022
aw-0.022
It-0.022
 that-0.020
That-0.019
Token She
Token ID1375
Feature activation-1.53e+00

Pos logit contributions
 argue+0.001
 whine+0.001
sing+0.000
 spine+0.000
 breath+0.000
Neg logit contributions
aw-0.001
 living-0.001
 that-0.001
It-0.000
 best-0.000
Token said
Token ID531
Feature activation-4.41e+00

Pos logit contributions
aw+0.025
 living+0.025
It+0.024
 that+0.023
 biggest+0.022
Neg logit contributions
 argue-0.038
sing-0.037
 whine-0.035
 breath-0.034
 spine-0.033
Token,
Token ID11
Feature activation-2.35e-02

Pos logit contributions
 living+0.070
aw+0.069
It+0.067
 that+0.063
 all+0.059
Neg logit contributions
 argue-0.114
sing-0.110
 spine-0.104
 whine-0.103
 breath-0.102
Token �
Token ID6184
Feature activation+2.41e+00

Pos logit contributions
aw+0.000
 living+0.000
It+0.000
 that+0.000
 all+0.000
Neg logit contributions
 argue-0.001
sing-0.001
 spine-0.001
 whine-0.001
 breath-0.001
Token�
Token ID95
Feature activation+1.45e+00

Pos logit contributions
 argue+0.054
sing+0.052
 spine+0.049
 breath+0.049
 whine+0.048
Neg logit contributions
aw-0.048
 living-0.047
It-0.047
 that-0.047
 all-0.041
Token€
Token ID26391
Feature activation+3.19e+00

Pos logit contributions
 argue+0.034
sing+0.034
 spine+0.032
 whine+0.031
 breath+0.031
Neg logit contributions
 living-0.027
aw-0.026
It-0.026
 that-0.025
 all-0.023
Token�
Token ID129
Feature activation+2.18e+00

Pos logit contributions
 argue+0.073
sing+0.071
 spine+0.067
 whine+0.064
 breath+0.064
Neg logit contributions
 living-0.060
aw-0.058
It-0.057
 that-0.056
 all-0.053
Token�
Token ID241
Feature activation-1.59e+00

Pos logit contributions
 argue+0.055
sing+0.054
 spine+0.051
 breath+0.050
 whine+0.049
Neg logit contributions
 living-0.036
aw-0.036
It-0.035
 that-0.035
 all-0.032
TokenJimmy
Token ID40335
Feature activation+1.15e+00

Pos logit contributions
 living+0.021
aw+0.021
 that+0.019
It+0.019
 all+0.018
Neg logit contributions
 argue-0.045
sing-0.044
 spine-0.042
 whine-0.041
 breath-0.041
Token,
Token ID11
Feature activation-1.11e+00

Pos logit contributions
 argue+0.031
sing+0.030
 spine+0.028
 whine+0.028
 breath+0.028
Neg logit contributions
 living-0.017
aw-0.017
It-0.016
 that-0.015
 all-0.014
Token watch
Token ID2342
Feature activation-1.47e+00

Pos logit contributions
aw+0.020
 living+0.018
It+0.018
 biggest+0.017
ing+0.016
Neg logit contributions
 argue-0.026
sing-0.025
 whine-0.024
 spine-0.024
 breath-0.023
Token she
Token ID673
Feature activation-1.19e+00

Pos logit contributions
 argue+0.026
sing+0.024
 whine+0.023
 breath+0.022
days+0.022
Neg logit contributions
aw-0.021
It-0.020
 living-0.020
 brand-0.018
ay-0.018
Token never
Token ID1239
Feature activation+2.97e+00

Pos logit contributions
aw+0.016
 living+0.016
It+0.015
 brand+0.014
 that+0.013
Neg logit contributions
 argue-0.033
sing-0.031
 whine-0.029
 breath-0.029
days-0.029
Token forgot
Token ID16453
Feature activation-2.31e+00

Pos logit contributions
 argue+0.070
sing+0.068
 spine+0.064
 whine+0.062
 breath+0.062
Neg logit contributions
 living-0.055
aw-0.054
It-0.053
 that-0.052
 all-0.048
Token the
Token ID262
Feature activation-1.69e+00

Pos logit contributions
It+0.036
aw+0.036
 living+0.035
ay+0.031
That+0.031
Neg logit contributions
 argue-0.057
sing-0.053
 whine-0.051
 breath-0.051
 spine-0.050
Token lesson
Token ID11483
Feature activation+1.82e-01

Pos logit contributions
 living+0.028
It+0.025
 that+0.025
aw+0.025
 all+0.024
Neg logit contributions
 argue-0.044
sing-0.043
 spine-0.041
 whine-0.040
 breath-0.039
Token she
Token ID673
Feature activation+7.37e-01

Pos logit contributions
 argue+0.005
sing+0.005
 spine+0.004
 whine+0.004
 breath+0.004
Neg logit contributions
 living-0.003
aw-0.003
It-0.003
 that-0.002
 brand-0.002
Token had
Token ID550
Feature activation-1.16e+00

Pos logit contributions
 argue+0.020
sing+0.019
 spine+0.018
 whine+0.018
 breath+0.017
Neg logit contributions
 living-0.011
aw-0.011
It-0.011
 that-0.010
 brand-0.009
Token learnt
Token ID26338
Feature activation-1.50e+00

Pos logit contributions
 living+0.018
aw+0.017
It+0.017
 that+0.017
 all+0.015
Neg logit contributions
 argue-0.031
sing-0.031
 spine-0.029
 whine-0.028
 breath-0.028
Token in
Token ID287
Feature activation+2.90e+00

Pos logit contributions
 living+0.026
aw+0.026
It+0.025
 brand+0.023
hes+0.022
Neg logit contributions
 argue-0.034
sing-0.030
 whine-0.030
 breath-0.030
 spine-0.030
Token the
Token ID262
Feature activation-4.72e-01

Pos logit contributions
 argue+0.069
sing+0.062
 breath+0.061
 whine+0.060
 spine+0.057
Neg logit contributions
aw-0.058
It-0.052
 living-0.049
 that-0.047
 brand-0.047
Token park
Token ID3952
Feature activation+1.03e+00

Pos logit contributions
 living+0.007
aw+0.007
It+0.007
 that+0.007
 all+0.006
Neg logit contributions
 argue-0.013
sing-0.012
 spine-0.012
 whine-0.011
 breath-0.011
Token,
Token ID11
Feature activation+1.21e+00

Pos logit contributions
sing+0.037
 argue+0.037
 spine+0.036
 breath+0.035
 whine+0.034
Neg logit contributions
aw-0.031
 living-0.031
It-0.031
hes-0.029
 brand-0.028
Token but
Token ID475
Feature activation+1.65e+00

Pos logit contributions
 argue+0.028
sing+0.027
 spine+0.026
 whine+0.025
 breath+0.025
Neg logit contributions
aw-0.023
 living-0.023
It-0.021
 that-0.020
 brand-0.019
Token the
Token ID262
Feature activation-2.52e+00

Pos logit contributions
 argue+0.046
sing+0.045
 spine+0.042
 whine+0.041
 breath+0.041
Neg logit contributions
 living-0.024
aw-0.023
It-0.023
 that-0.023
 all-0.021
Token dragon
Token ID10441
Feature activation-2.35e+00

Pos logit contributions
aw+0.056
It+0.052
 living+0.052
 that+0.050
 brand+0.047
Neg logit contributions
 argue-0.050
sing-0.050
 spine-0.047
 breath-0.046
 whine-0.044
Token smiled
Token ID13541
Feature activation-1.09e+00

Pos logit contributions
 living+0.030
aw+0.029
It+0.028
 that+0.027
 all+0.024
Neg logit contributions
 argue-0.069
sing-0.068
 spine-0.064
 whine-0.063
 breath-0.063
Token and
Token ID290
Feature activation+1.01e+00

Pos logit contributions
 living+0.017
aw+0.017
It+0.017
 that+0.015
hes+0.014
Neg logit contributions
sing-0.028
 argue-0.027
 spine-0.026
 breath-0.025
 whine-0.024
Token said
Token ID531
Feature activation-4.19e+00

Pos logit contributions
 argue+0.024
sing+0.023
 breath+0.022
 spine+0.022
 whine+0.022
Neg logit contributions
 living-0.018
aw-0.018
It-0.017
 that-0.016
ay-0.015
Token hello
Token ID23748
Feature activation+1.05e+00

Pos logit contributions
 living+0.070
aw+0.069
It+0.068
 that+0.067
 all+0.061
Neg logit contributions
 argue-0.107
sing-0.105
 spine-0.099
 whine-0.096
 breath-0.095
Token.
Token ID13
Feature activation+5.77e-02

Pos logit contributions
 argue+0.031
sing+0.029
 whine+0.028
 spine+0.028
 breath+0.027
Neg logit contributions
 that-0.015
aw-0.014
 living-0.014
It-0.014
 all-0.013
Token
Token ID220
Feature activation-6.46e-01

Pos logit contributions
 argue+0.001
sing+0.001
 spine+0.001
 whine+0.001
 breath+0.001
Neg logit contributions
aw-0.001
 living-0.001
It-0.001
 that-0.001
 all-0.001
Token
Token ID198
Feature activation-6.43e-01

Pos logit contributions
aw+0.009
 living+0.009
 that+0.008
It+0.008
 brand+0.008
Neg logit contributions
 argue-0.017
sing-0.016
 spine-0.016
 whine-0.015
days-0.015
Token she
Token ID673
Feature activation-1.61e+00

Pos logit contributions
 argue+0.034
sing+0.033
 whine+0.031
 breath+0.031
 spine+0.031
Neg logit contributions
aw-0.025
 living-0.023
It-0.022
 brand-0.020
 biggest-0.020
Token had
Token ID550
Feature activation-2.59e+00

Pos logit contributions
aw+0.030
 living+0.029
It+0.027
 biggest+0.026
 that+0.026
Neg logit contributions
 argue-0.037
sing-0.035
 whine-0.033
 breath-0.033
 spine-0.032
Token been
Token ID587
Feature activation-1.07e+00

Pos logit contributions
aw+0.043
 living+0.043
It+0.042
 that+0.040
 all+0.036
Neg logit contributions
 argue-0.065
sing-0.063
 spine-0.059
 whine-0.058
 breath-0.058
Token so
Token ID523
Feature activation+9.96e-01

Pos logit contributions
aw+0.016
It+0.016
 living+0.015
ay+0.015
That+0.014
Neg logit contributions
 argue-0.028
sing-0.027
 breath-0.026
 whine-0.025
 spine-0.024
Token silly
Token ID14397
Feature activation+1.56e+00

Pos logit contributions
 argue+0.026
sing+0.025
 whine+0.024
 breath+0.024
 spine+0.024
Neg logit contributions
 living-0.015
aw-0.015
It-0.014
 that-0.014
 brand-0.012
Token and
Token ID290
Feature activation+1.98e+00

Pos logit contributions
 argue+0.037
sing+0.035
 spine+0.034
 breath+0.034
 whine+0.033
Neg logit contributions
It-0.027
aw-0.026
 living-0.025
That-0.023
hes-0.023
Token crazy
Token ID7165
Feature activation+1.13e+00

Pos logit contributions
 argue+0.047
sing+0.046
 whine+0.044
 breath+0.043
 spine+0.043
Neg logit contributions
aw-0.035
It-0.031
 living-0.031
ay-0.030
br-0.027
Token.
Token ID13
Feature activation+2.40e-01

Pos logit contributions
 argue+0.026
sing+0.025
 spine+0.024
 breath+0.024
 whine+0.024
Neg logit contributions
It-0.020
aw-0.020
 living-0.020
 that-0.018
That-0.017
Token
Token ID220
Feature activation-2.63e-01

Pos logit contributions
 argue+0.006
sing+0.006
 spine+0.006
 whine+0.005
 breath+0.005
Neg logit contributions
aw-0.004
 living-0.004
 that-0.003
It-0.003
 brand-0.003
Token
Token ID198
Feature activation-2.40e-01

Pos logit contributions
aw+0.004
 living+0.004
 that+0.003
It+0.003
 brand+0.003
Neg logit contributions
 argue-0.007
sing-0.007
 spine-0.006
 whine-0.006
days-0.006
Token
Token ID198
Feature activation-1.87e-01

Pos logit contributions
aw+0.003
 living+0.003
 that+0.003
It+0.003
 biggest+0.003
Neg logit contributions
 argue-0.006
sing-0.006
 spine-0.006
 whine-0.006
 breath-0.006
Token day
Token ID1110
Feature activation+9.43e-01

Pos logit contributions
 living+0.020
aw+0.020
It+0.019
 that+0.019
 all+0.017
Neg logit contributions
 argue-0.026
sing-0.025
 spine-0.023
 whine-0.023
 breath-0.023
Token G
Token ID402
Feature activation+1.78e+00

Pos logit contributions
 argue+0.022
sing+0.021
 spine+0.020
 breath+0.020
 whine+0.020
Neg logit contributions
 living-0.018
aw-0.017
It-0.017
 that-0.016
 brand-0.015
Tokenia
Token ID544
Feature activation-5.14e-01

Pos logit contributions
 argue+0.035
sing+0.032
 whine+0.031
 spine+0.029
 breath+0.028
Neg logit contributions
aw-0.043
 living-0.043
It-0.042
 that-0.041
 all-0.039
Token would
Token ID561
Feature activation+2.21e+00

Pos logit contributions
 living+0.006
aw+0.006
It+0.006
 that+0.005
 all+0.005
Neg logit contributions
 argue-0.016
sing-0.015
 spine-0.015
 whine-0.014
 breath-0.014
Token explore
Token ID7301
Feature activation-7.17e-01

Pos logit contributions
sing+0.043
 argue+0.043
 spine+0.042
 breath+0.038
 whine+0.037
Neg logit contributions
 living-0.050
 that-0.047
It-0.046
 all-0.046
aw-0.044
Token her
Token ID607
Feature activation+2.64e+00

Pos logit contributions
aw+0.013
It+0.012
 living+0.012
 that+0.011
That+0.011
Neg logit contributions
 argue-0.016
sing-0.015
 breath-0.015
 spine-0.014
 whine-0.014
Token backyard
Token ID24296
Feature activation+3.49e-01

Pos logit contributions
 argue+0.073
sing+0.071
 spine+0.068
 whine+0.066
 breath+0.066
Neg logit contributions
 living-0.038
aw-0.037
It-0.037
 that-0.035
 all-0.032
Token and
Token ID290
Feature activation+3.15e+00

Pos logit contributions
 argue+0.008
sing+0.008
 spine+0.007
 breath+0.007
days+0.007
Neg logit contributions
aw-0.006
It-0.006
 living-0.005
That-0.005
 brand-0.005
Token find
Token ID1064
Feature activation-4.21e+00

Pos logit contributions
 argue+0.053
sing+0.051
 spine+0.046
 whine+0.045
 breath+0.044
Neg logit contributions
aw-0.080
 living-0.079
It-0.078
 that-0.076
 all-0.071
Token something
Token ID1223
Feature activation-2.60e+00

Pos logit contributions
It+0.065
aw+0.063
That+0.062
 living+0.058
ay+0.057
Neg logit contributions
 breath-0.095
 argue-0.090
sing-0.089
 whine-0.086
 resist-0.084
Token new
Token ID649
Feature activation+5.19e-01

Pos logit contributions
aw+0.029
It+0.026
 living+0.026
 that+0.023
That+0.020
Neg logit contributions
 argue-0.079
sing-0.078
 whine-0.073
 spine-0.073
 breath-0.073
Token,
Token ID11
Feature activation+2.21e-01

Pos logit contributions
 living+0.005
aw+0.005
It+0.005
 that+0.005
 all+0.005
Neg logit contributions
 argue-0.009
sing-0.009
 spine-0.008
 whine-0.008
 breath-0.008
Token Lily
Token ID20037
Feature activation-4.02e-01

Pos logit contributions
 argue+0.006
sing+0.006
 spine+0.006
 whine+0.005
 breath+0.005
Neg logit contributions
 that-0.004
 living-0.004
It-0.003
 all-0.003
aw-0.003
Token's
Token ID338
Feature activation-8.43e-01

Pos logit contributions
 living+0.007
It+0.007
aw+0.006
 that+0.006
ay+0.006
Neg logit contributions
 argue-0.010
sing-0.009
 spine-0.009
 whine-0.009
 breath-0.009
Token mom
Token ID1995
Feature activation+3.07e-01

Pos logit contributions
 living+0.020
aw+0.020
It+0.020
 that+0.020
 all+0.018
Neg logit contributions
 argue-0.015
sing-0.015
 spine-0.014
 whine-0.013
 breath-0.013
Token gave
Token ID2921
Feature activation-5.36e+00

Pos logit contributions
 argue+0.007
sing+0.006
 whine+0.006
cker+0.006
days+0.005
Neg logit contributions
aw-0.006
 living-0.006
 biggest-0.006
It-0.006
 that-0.005
Token her
Token ID607
Feature activation+5.79e-01

Pos logit contributions
 living+0.114
aw+0.113
It+0.111
 that+0.106
 brand+0.100
Neg logit contributions
 argue-0.110
sing-0.102
 breath-0.098
 whine-0.097
 spine-0.095
Token a
Token ID257
Feature activation-1.79e+00

Pos logit contributions
 argue+0.018
sing+0.018
 spine+0.017
 whine+0.016
 breath+0.016
Neg logit contributions
 living-0.007
aw-0.006
It-0.006
 that-0.006
 all-0.005
Token snack
Token ID26906
Feature activation-1.93e+00

Pos logit contributions
 living+0.024
aw+0.023
It+0.023
 that+0.022
 all+0.020
Neg logit contributions
 argue-0.051
sing-0.050
 spine-0.047
 whine-0.046
 breath-0.046
Token to
Token ID284
Feature activation+3.60e+00

Pos logit contributions
 living+0.031
aw+0.031
It+0.030
 that+0.029
 all+0.027
Neg logit contributions
 argue-0.049
sing-0.048
 spine-0.046
 whine-0.044
 breath-0.044
Token feed
Token ID3745
Feature activation-2.42e+00

Pos logit contributions
 argue+0.063
sing+0.061
 spine+0.056
 whine+0.054
 breath+0.053
Neg logit contributions
 living-0.089
aw-0.087
It-0.086
 that-0.085
 all-0.080
Token her
Token ID607
Feature activation+2.10e+00

Pos logit contributions
 living+0.039
aw+0.038
It+0.038
 that+0.037
 all+0.033
Neg logit contributions
 argue-0.062
sing-0.061
 spine-0.057
 whine-0.056
 breath-0.056
Token could
Token ID714
Feature activation+3.32e+00

Pos logit contributions
aw+0.011
 living+0.011
It+0.011
 that+0.010
 brand+0.009
Neg logit contributions
 argue-0.021
sing-0.021
 spine-0.020
 whine-0.020
 breath-0.019
Token go
Token ID467
Feature activation+2.57e+00

Pos logit contributions
sing+0.066
 spine+0.063
 argue+0.063
 whine+0.058
days+0.056
Neg logit contributions
 that-0.074
 living-0.071
It-0.071
 all-0.070
aw-0.064
Token near
Token ID1474
Feature activation+5.89e-01

Pos logit contributions
 argue+0.068
sing+0.067
 spine+0.063
 whine+0.061
 breath+0.061
Neg logit contributions
 living-0.041
aw-0.039
 that-0.039
It-0.039
 all-0.035
Token the
Token ID262
Feature activation+2.76e-01

Pos logit contributions
 argue+0.016
sing+0.016
 spine+0.015
 whine+0.015
 breath+0.015
Neg logit contributions
 living-0.009
aw-0.008
It-0.008
 that-0.008
 all-0.007
Token beast
Token ID13824
Feature activation+3.10e+00

Pos logit contributions
 argue+0.006
sing+0.006
 spine+0.006
 breath+0.006
 whine+0.005
Neg logit contributions
It-0.005
aw-0.005
 living-0.005
hes-0.005
 brand-0.005
Token because
Token ID780
Feature activation+3.64e-01

Pos logit contributions
 argue+0.081
sing+0.080
 spine+0.076
 whine+0.074
 breath+0.074
Neg logit contributions
 living-0.049
 that-0.047
aw-0.047
It-0.045
 all-0.043
Token it
Token ID340
Feature activation-4.77e+00

Pos logit contributions
 argue+0.009
sing+0.009
 spine+0.009
 breath+0.009
 whine+0.009
Neg logit contributions
aw-0.005
 living-0.005
It-0.005
 brand-0.004
That-0.004
Token was
Token ID373
Feature activation-2.02e+00

Pos logit contributions
aw+0.063
It+0.058
 living+0.057
 brand+0.054
ay+0.051
Neg logit contributions
 argue-0.136
sing-0.130
 whine-0.123
 breath-0.122
 spine-0.121
Token too
Token ID1165
Feature activation+1.08e+00

Pos logit contributions
aw+0.030
It+0.028
 living+0.027
ay+0.024
 that+0.024
Neg logit contributions
 argue-0.055
sing-0.055
 breath-0.051
 whine-0.050
 spine-0.050
Token fierce
Token ID14800
Feature activation+2.08e+00

Pos logit contributions
 argue+0.030
sing+0.029
 spine+0.028
 whine+0.027
 breath+0.027
Neg logit contributions
 living-0.016
aw-0.015
It-0.015
 that-0.015
 all-0.013
Token.
Token ID13
Feature activation+1.63e-01

Pos logit contributions
 argue+0.047
sing+0.046
 spine+0.044
 breath+0.042
 whine+0.042
Neg logit contributions
It-0.039
aw-0.038
 living-0.037
That-0.034
 that-0.033
Token best
Token ID1266
Feature activation+2.15e+00

Pos logit contributions
 living+0.041
aw+0.039
It+0.039
 that+0.037
 all+0.033
Neg logit contributions
 argue-0.119
sing-0.117
 spine-0.113
 whine-0.109
 breath-0.108
Token friends
Token ID2460
Feature activation-2.47e-01

Pos logit contributions
 argue+0.066
sing+0.064
 spine+0.062
 whine+0.060
 breath+0.058
Neg logit contributions
 living-0.027
aw-0.026
 that-0.025
It-0.024
 all-0.023
Token,
Token ID11
Feature activation+1.71e+00

Pos logit contributions
 living+0.004
aw+0.004
It+0.004
 that+0.004
 all+0.004
Neg logit contributions
 argue-0.006
sing-0.006
 spine-0.006
 whine-0.005
 breath-0.005
Token and
Token ID290
Feature activation+1.19e+00

Pos logit contributions
 argue+0.052
sing+0.051
 spine+0.049
 whine+0.048
 breath+0.047
Neg logit contributions
 living-0.020
aw-0.019
It-0.019
 that-0.019
 all-0.017
Token they
Token ID484
Feature activation-4.48e-01

Pos logit contributions
 argue+0.027
sing+0.026
 spine+0.024
 whine+0.024
 breath+0.024
Neg logit contributions
 living-0.023
aw-0.023
It-0.022
 that-0.021
 brand-0.020
Token played
Token ID2826
Feature activation-2.01e+00

Pos logit contributions
 living+0.006
aw+0.006
It+0.006
 that+0.006
 all+0.005
Neg logit contributions
 argue-0.012
sing-0.012
 spine-0.012
 whine-0.011
 breath-0.011
Token together
Token ID1978
Feature activation+2.35e+00

Pos logit contributions
 living+0.029
aw+0.028
It+0.028
 that+0.027
 brand+0.024
Neg logit contributions
 argue-0.054
sing-0.053
 breath-0.050
 spine-0.049
 whine-0.048
Token every
Token ID790
Feature activation+5.08e-01

Pos logit contributions
 argue+0.069
sing+0.068
 spine+0.065
 whine+0.063
 breath+0.063
Neg logit contributions
 living-0.029
aw-0.029
It-0.028
 that-0.027
 all-0.024
Token day
Token ID1110
Feature activation+3.85e+00

Pos logit contributions
 argue+0.013
sing+0.012
 spine+0.012
 whine+0.011
 breath+0.011
Neg logit contributions
 living-0.009
It-0.009
aw-0.009
 that-0.008
 all-0.008
Token.
Token ID13
Feature activation-1.04e+00

Pos logit contributions
 argue+0.090
sing+0.088
 spine+0.083
 breath+0.080
 whine+0.080
Neg logit contributions
 living-0.070
aw-0.069
It-0.069
 that-0.065
 all-0.060
Token<|endoftext|>
Token ID50256
Feature activation-1.16e+00

Pos logit contributions
aw+0.016
 living+0.016
It+0.015
 brand+0.015
 that+0.014
Neg logit contributions
 argue-0.025
sing-0.025
 spine-0.023
 whine-0.023
days-0.022
Token.
Token ID13
Feature activation-1.53e-01

Pos logit contributions
 living+0.030
aw+0.030
It+0.030
 that+0.027
 all+0.025
Neg logit contributions
 argue-0.038
sing-0.038
 spine-0.035
 breath-0.034
 whine-0.034
Token
Token ID198
Feature activation-1.27e+00

Pos logit contributions
aw+0.002
 living+0.002
 that+0.002
It+0.002
 all+0.002
Neg logit contributions
 argue-0.004
sing-0.004
 spine-0.004
 whine-0.004
 breath-0.004
Token
Token ID198
Feature activation-6.15e-01

Pos logit contributions
 living+0.016
aw+0.016
It+0.015
 that+0.015
 all+0.013
Neg logit contributions
 argue-0.037
sing-0.036
 spine-0.034
 whine-0.034
 breath-0.033
TokenFinally
Token ID11158
Feature activation+9.00e-01

Pos logit contributions
 living+0.008
aw+0.008
It+0.008
 that+0.008
 all+0.007
Neg logit contributions
 argue-0.017
sing-0.017
 spine-0.016
 whine-0.016
 breath-0.016
Token,
Token ID11
Feature activation+3.29e-01

Pos logit contributions
 argue+0.023
sing+0.022
 spine+0.021
 whine+0.020
 breath+0.020
Neg logit contributions
 living-0.015
aw-0.015
It-0.015
 that-0.015
 all-0.014
Token the
Token ID262
Feature activation-1.91e+00

Pos logit contributions
 argue+0.009
sing+0.009
 spine+0.009
 whine+0.008
 breath+0.008
Neg logit contributions
 that-0.005
 living-0.005
It-0.005
aw-0.005
 all-0.004
Token soldier
Token ID10686
Feature activation-1.39e+00

Pos logit contributions
aw+0.049
It+0.045
 living+0.043
hes+0.042
 that+0.041
Neg logit contributions
 argue-0.032
 breath-0.031
sing-0.031
 spine-0.030
 whine-0.028
Token spotted
Token ID13489
Feature activation-3.72e+00

Pos logit contributions
 living+0.019
aw+0.019
It+0.018
 that+0.018
 all+0.016
Neg logit contributions
 argue-0.039
sing-0.039
 spine-0.037
 whine-0.036
 breath-0.036
Token some
Token ID617
Feature activation-2.14e+00

Pos logit contributions
 living+0.058
It+0.057
aw+0.054
That+0.049
 that+0.048
Neg logit contributions
 argue-0.091
sing-0.091
 breath-0.085
 spine-0.085
 whine-0.082
Token candy
Token ID18550
Feature activation-2.00e+00

Pos logit contributions
It+0.029
 living+0.027
aw+0.025
That+0.025
 that+0.024
Neg logit contributions
 argue-0.054
 breath-0.054
sing-0.053
 whine-0.052
 resist-0.049
Token.
Token ID13
Feature activation-8.12e-01

Pos logit contributions
It+0.033
aw+0.032
 living+0.031
 Everything+0.029
That+0.029
Neg logit contributions
 argue-0.045
sing-0.044
 breath-0.043
 spine-0.042
comb-0.041
Tokenite
Token ID578
Feature activation-1.12e+00

Pos logit contributions
 argue+0.015
sing+0.015
 spine+0.014
 breath+0.014
 whine+0.013
Neg logit contributions
 living-0.015
aw-0.015
It-0.015
 that-0.015
 all-0.014
Token in
Token ID287
Feature activation+1.74e+00

Pos logit contributions
 living+0.018
aw+0.018
It+0.018
 that+0.016
 brand+0.016
Neg logit contributions
 argue-0.028
sing-0.027
 spine-0.026
 breath-0.025
 whine-0.025
Token the
Token ID262
Feature activation-4.78e-01

Pos logit contributions
 argue+0.047
sing+0.046
 spine+0.043
 whine+0.042
 breath+0.042
Neg logit contributions
 living-0.027
aw-0.026
It-0.025
 that-0.025
 all-0.022
Token tree
Token ID5509
Feature activation+1.21e+00

Pos logit contributions
aw+0.007
It+0.007
 that+0.007
 living+0.006
ing+0.006
Neg logit contributions
 argue-0.012
sing-0.012
 spine-0.012
 whine-0.011
 breath-0.011
Token.
Token ID13
Feature activation-6.87e-01

Pos logit contributions
 argue+0.027
sing+0.026
 spine+0.025
 whine+0.024
 breath+0.023
Neg logit contributions
It-0.023
 living-0.023
aw-0.023
 that-0.021
 brand-0.021
Token He
Token ID679
Feature activation-2.49e+00

Pos logit contributions
 living+0.009
aw+0.009
 that+0.008
It+0.008
 all+0.007
Neg logit contributions
 argue-0.019
sing-0.019
 spine-0.018
 whine-0.018
 breath-0.017
Token was
Token ID373
Feature activation-3.44e+00

Pos logit contributions
 living+0.037
 biggest+0.035
aw+0.034
ay+0.034
 brand+0.032
Neg logit contributions
 argue-0.063
sing-0.060
 whine-0.058
 breath-0.058
cker-0.054
Token not
Token ID407
Feature activation+5.23e-01

Pos logit contributions
aw+0.061
 living+0.058
It+0.057
ay+0.053
That+0.053
Neg logit contributions
sing-0.083
 argue-0.080
 breath-0.076
 whine-0.074
 spine-0.071
Token happy
Token ID3772
Feature activation+1.66e+00

Pos logit contributions
 argue+0.014
sing+0.014
 spine+0.013
 whine+0.013
 breath+0.012
Neg logit contributions
 living-0.008
aw-0.008
It-0.008
 that-0.008
 all-0.007
Token.
Token ID13
Feature activation-7.48e-01

Pos logit contributions
 argue+0.043
sing+0.043
 spine+0.041
 whine+0.039
 breath+0.039
Neg logit contributions
 living-0.028
 that-0.027
aw-0.026
It-0.025
 all-0.024
Token
Token ID198
Feature activation-1.22e+00

Pos logit contributions
 living+0.010
aw+0.010
 that+0.009
It+0.009
 all+0.009
Neg logit contributions
 argue-0.020
sing-0.020
 spine-0.019
 whine-0.018
days-0.018
Token friendly
Token ID8030
Feature activation+5.29e-01

Pos logit contributions
 living+0.010
It+0.009
aw+0.009
 that+0.009
 all+0.007
Neg logit contributions
 argue-0.023
sing-0.023
 spine-0.022
 whine-0.021
 breath-0.021
Token bug
Token ID5434
Feature activation+1.47e-01

Pos logit contributions
 argue+0.014
sing+0.013
 spine+0.013
 whine+0.012
days+0.012
Neg logit contributions
 living-0.009
 that-0.009
aw-0.009
It-0.009
 all-0.008
Token.
Token ID13
Feature activation-2.44e-01

Pos logit contributions
 argue+0.004
sing+0.004
 spine+0.004
 whine+0.004
 breath+0.004
Neg logit contributions
 living-0.003
 that-0.002
aw-0.002
It-0.002
 all-0.002
Token The
Token ID383
Feature activation-2.29e+00

Pos logit contributions
 living+0.003
 that+0.003
aw+0.003
It+0.003
 all+0.003
Neg logit contributions
 argue-0.007
sing-0.007
 spine-0.007
 breath-0.006
 whine-0.006
Token bug
Token ID5434
Feature activation-3.36e+00

Pos logit contributions
 living+0.048
It+0.047
 that+0.046
aw+0.046
 all+0.043
Neg logit contributions
 argue-0.050
sing-0.048
 spine-0.045
 whine-0.044
days-0.042
Token said
Token ID531
Feature activation-4.40e+00

Pos logit contributions
 living+0.045
aw+0.045
It+0.044
 that+0.041
 brand+0.036
Neg logit contributions
 argue-0.093
sing-0.091
 spine-0.085
 whine-0.085
 breath-0.084
Token,
Token ID11
Feature activation+7.85e-02

Pos logit contributions
 that+0.065
 living+0.056
 all+0.052
aw+0.051
It+0.050
Neg logit contributions
 argue-0.136
 spine-0.133
sing-0.133
 whine-0.125
days-0.124
Token "
Token ID366
Feature activation-3.41e+00

Pos logit contributions
 argue+0.003
sing+0.003
 spine+0.002
 whine+0.002
 breath+0.002
Neg logit contributions
 living-0.001
aw-0.001
It-0.001
 that-0.001
 all-0.001
TokenHi
Token ID17250
Feature activation+8.94e-02

Pos logit contributions
 living+0.051
It+0.051
aw+0.051
 that+0.048
 all+0.043
Neg logit contributions
 argue-0.093
sing-0.092
 spine-0.087
 whine-0.084
 breath-0.084
Token,
Token ID11
Feature activation+3.31e-01

Pos logit contributions
 argue+0.002
sing+0.002
 spine+0.002
 whine+0.002
days+0.002
Neg logit contributions
 living-0.001
aw-0.001
 that-0.001
It-0.001
 all-0.001
Token lizard
Token ID42406
Feature activation+7.49e-01

Pos logit contributions
sing+0.008
 argue+0.008
 breath+0.007
 spine+0.007
 whine+0.007
Neg logit contributions
aw-0.006
 living-0.005
It-0.005
hes-0.005
 brand-0.005
Token they
Token ID484
Feature activation-1.31e+00

Pos logit contributions
 argue+0.026
sing+0.025
 whine+0.023
 breath+0.023
 spine+0.023
Neg logit contributions
aw-0.019
It-0.019
 living-0.019
 that-0.017
ay-0.016
Token were
Token ID547
Feature activation-2.80e+00

Pos logit contributions
aw+0.015
 living+0.014
It+0.014
 that+0.012
 biggest+0.011
Neg logit contributions
 argue-0.040
sing-0.038
 breath-0.037
 whine-0.036
days-0.035
Token so
Token ID523
Feature activation+5.75e-01

Pos logit contributions
aw+0.032
It+0.031
 living+0.030
That+0.026
 that+0.026
Neg logit contributions
 argue-0.083
sing-0.081
 breath-0.078
 whine-0.075
days-0.073
Token impressed
Token ID12617
Feature activation-2.29e+00

Pos logit contributions
 argue+0.014
sing+0.013
 breath+0.012
 whine+0.012
days+0.012
Neg logit contributions
aw-0.010
 living-0.010
It-0.010
 that-0.009
 sky-0.009
Token with
Token ID351
Feature activation-2.76e+00

Pos logit contributions
 living+0.035
It+0.033
aw+0.032
 that+0.029
 brand+0.028
Neg logit contributions
 argue-0.059
sing-0.058
 spine-0.055
 breath-0.054
 whine-0.053
Token what
Token ID644
Feature activation-3.57e+00

Pos logit contributions
aw+0.053
 living+0.052
It+0.052
br+0.049
ing+0.047
Neg logit contributions
 argue-0.058
 breath-0.056
sing-0.052
 whine-0.052
 Jen-0.047
Token Bob
Token ID5811
Feature activation+1.37e+00

Pos logit contributions
aw+0.058
 living+0.057
It+0.055
 that+0.052
 brand+0.048
Neg logit contributions
 argue-0.091
sing-0.086
 whine-0.082
 spine-0.081
 breath-0.081
Token had
Token ID550
Feature activation-1.17e+00

Pos logit contributions
 argue+0.039
sing+0.038
 whine+0.036
 spine+0.036
 breath+0.036
Neg logit contributions
 living-0.017
It-0.016
aw-0.016
 that-0.015
 brand-0.014
Token done
Token ID1760
Feature activation+2.87e-02

Pos logit contributions
aw+0.019
 living+0.019
It+0.018
 that+0.017
 biggest+0.017
Neg logit contributions
 argue-0.029
sing-0.026
 breath-0.026
 spine-0.025
days-0.024
Token.
Token ID13
Feature activation-3.00e-01

Pos logit contributions
 argue+0.001
sing+0.001
 spine+0.001
 breath+0.001
 whine+0.001
Neg logit contributions
aw-0.001
It-0.001
 living-0.001
 that-0.000
 biggest-0.000
Token They
Token ID1119
Feature activation-1.41e+00

Pos logit contributions
 living+0.004
aw+0.004
It+0.004
 that+0.004
 all+0.004
Neg logit contributions
 argue-0.008
sing-0.008
 spine-0.008
 whine-0.007
 breath-0.007
Token big
Token ID1263
Feature activation-5.29e-01

Pos logit contributions
aw+0.027
 living+0.023
It+0.022
 that+0.021
hes+0.019
Neg logit contributions
sing-0.074
 argue-0.074
 breath-0.069
 spine-0.069
 whine-0.068
Token slide
Token ID10649
Feature activation-8.26e-01

Pos logit contributions
aw+0.009
It+0.009
 living+0.009
 that+0.008
hes+0.008
Neg logit contributions
 argue-0.013
sing-0.013
 spine-0.012
 breath-0.012
 whine-0.012
Token and
Token ID290
Feature activation+1.71e+00

Pos logit contributions
 living+0.014
It+0.014
 brand+0.014
aw+0.014
 biggest+0.013
Neg logit contributions
 argue-0.018
sing-0.018
 spine-0.017
 whine-0.016
 breath-0.016
Token they
Token ID484
Feature activation-7.23e-01

Pos logit contributions
 argue+0.041
sing+0.038
 whine+0.037
 breath+0.037
 spine+0.035
Neg logit contributions
 living-0.031
aw-0.030
It-0.028
 that-0.027
ay-0.026
Token all
Token ID477
Feature activation-2.59e-01

Pos logit contributions
 living+0.011
aw+0.011
It+0.011
 that+0.011
 all+0.009
Neg logit contributions
 argue-0.019
sing-0.019
 spine-0.018
 whine-0.017
 breath-0.017
Token took
Token ID1718
Feature activation-3.67e+00

Pos logit contributions
 living+0.004
aw+0.004
It+0.004
 that+0.004
 all+0.003
Neg logit contributions
 argue-0.007
sing-0.007
 spine-0.006
 whine-0.006
 breath-0.006
Token turns
Token ID4962
Feature activation+1.28e+00

Pos logit contributions
 living+0.045
aw+0.045
It+0.043
 that+0.042
 all+0.038
Neg logit contributions
 argue-0.110
sing-0.109
 spine-0.104
 whine-0.100
 breath-0.099
Token sliding
Token ID22292
Feature activation+5.17e-01

Pos logit contributions
 argue+0.032
sing+0.031
 spine+0.029
 whine+0.028
 breath+0.028
Neg logit contributions
 living-0.022
aw-0.021
It-0.021
 that-0.021
 all-0.019
Token down
Token ID866
Feature activation+7.61e-01

Pos logit contributions
 argue+0.013
sing+0.012
 breath+0.011
 spine+0.011
 whine+0.011
Neg logit contributions
 living-0.009
aw-0.009
It-0.009
 that-0.008
 brand-0.008
Token it
Token ID340
Feature activation-7.45e-01

Pos logit contributions
 argue+0.020
sing+0.020
 spine+0.019
 whine+0.018
 breath+0.018
Neg logit contributions
 living-0.012
aw-0.012
It-0.012
 that-0.012
 all-0.011
Token.
Token ID13
Feature activation-9.84e-03

Pos logit contributions
 living+0.015
aw+0.014
It+0.014
 that+0.013
 brand+0.013
Neg logit contributions
 argue-0.016
sing-0.016
 spine-0.015
 breath-0.014
 whine-0.014
Token fisher
Token ID17685
Feature activation-5.39e-01

Pos logit contributions
 living+0.036
aw+0.036
It+0.035
 that+0.034
 all+0.032
Neg logit contributions
 argue-0.049
sing-0.048
 spine-0.045
 whine-0.044
 breath-0.044
Tokens
Token ID82
Feature activation-1.73e+00

Pos logit contributions
 living+0.010
aw+0.009
It+0.009
 that+0.009
 all+0.008
Neg logit contributions
 argue-0.013
sing-0.013
 spine-0.012
 whine-0.011
 breath-0.011
Token heard
Token ID2982
Feature activation-2.50e+00

Pos logit contributions
 living+0.030
It+0.029
aw+0.029
 that+0.028
That+0.025
Neg logit contributions
 argue-0.042
sing-0.042
 spine-0.038
 whine-0.038
 breath-0.038
Token Jack
Token ID3619
Feature activation+4.15e+00

Pos logit contributions
aw+0.039
 living+0.039
 that+0.039
It+0.038
 all+0.035
Neg logit contributions
 argue-0.066
sing-0.066
 spine-0.063
 whine-0.060
 breath-0.059
Token and
Token ID290
Feature activation+2.58e+00

Pos logit contributions
sing+0.137
 argue+0.137
 spine+0.132
 breath+0.127
 whine+0.127
Neg logit contributions
 living-0.036
 that-0.033
aw-0.031
It-0.030
 all-0.026
Token saw
Token ID2497
Feature activation-4.57e+00

Pos logit contributions
 argue+0.055
sing+0.053
 spine+0.050
 whine+0.048
 breath+0.048
Neg logit contributions
 living-0.054
aw-0.053
It-0.052
 that-0.051
 all-0.048
Token Tom
Token ID4186
Feature activation+3.06e+00

Pos logit contributions
 living+0.089
It+0.087
aw+0.087
 biggest+0.080
br+0.080
Neg logit contributions
 argue-0.091
 breath-0.086
sing-0.085
 whine-0.080
 Jen-0.075
Token's
Token ID338
Feature activation+7.07e-01

Pos logit contributions
 argue+0.091
sing+0.091
 spine+0.087
 whine+0.084
 breath+0.084
Neg logit contributions
 living-0.038
aw-0.035
 that-0.035
It-0.034
 all-0.030
Token boat
Token ID8848
Feature activation-9.44e-01

Pos logit contributions
 argue+0.016
sing+0.016
 spine+0.015
 whine+0.015
days+0.015
Neg logit contributions
 living-0.014
 that-0.013
aw-0.012
It-0.012
 all-0.012
Token.
Token ID13
Feature activation-1.33e-01

Pos logit contributions
 living+0.016
It+0.016
aw+0.016
 that+0.015
That+0.014
Neg logit contributions
 argue-0.023
sing-0.023
 spine-0.021
 breath-0.021
 whine-0.021
Token They
Token ID1119
Feature activation-1.04e+00

Pos logit contributions
 living+0.001
aw+0.001
It+0.001
 that+0.001
 all+0.001
Neg logit contributions
 argue-0.004
sing-0.004
 spine-0.004
 whine-0.004
 breath-0.004
Token clown
Token ID25573
Feature activation-2.01e+00

Pos logit contributions
aw+0.063
It+0.061
 living+0.060
 that+0.057
hes+0.055
Neg logit contributions
 argue-0.070
sing-0.068
 breath-0.064
 whine-0.063
 spine-0.063
Token is
Token ID318
Feature activation-2.04e+00

Pos logit contributions
 living+0.021
It+0.020
aw+0.020
 that+0.018
 brand+0.016
Neg logit contributions
 argue-0.062
sing-0.061
 spine-0.057
 whine-0.057
 breath-0.057
Token not
Token ID407
Feature activation+8.86e-01

Pos logit contributions
aw+0.030
It+0.028
 living+0.027
That+0.025
ing+0.024
Neg logit contributions
sing-0.054
 argue-0.053
 whine-0.049
 spine-0.049
 breath-0.048
Token real
Token ID1103
Feature activation-3.39e-01

Pos logit contributions
 argue+0.025
sing+0.024
 spine+0.023
 breath+0.023
 whine+0.023
Neg logit contributions
aw-0.012
 living-0.012
It-0.012
 that-0.011
That-0.010
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
 living+0.006
It+0.006
aw+0.006
 that+0.006
 all+0.005
Neg logit contributions
 argue-0.008
sing-0.008
 spine-0.007
 whine-0.007
 breath-0.007
Token It
Token ID632
Feature activation-5.02e+00

Pos logit contributions
 living+0.013
aw+0.012
It+0.012
 that+0.011
 all+0.010
Neg logit contributions
 argue-0.030
sing-0.029
 spine-0.028
 whine-0.027
 breath-0.026
Token is
Token ID318
Feature activation-2.45e+00

Pos logit contributions
aw+0.046
It+0.045
 living+0.041
 brand+0.041
ay+0.037
Neg logit contributions
 argue-0.160
sing-0.153
 whine-0.145
 drown-0.141
 breath-0.141
Token just
Token ID655
Feature activation+6.43e-01

Pos logit contributions
aw+0.036
It+0.035
 living+0.033
That+0.030
 that+0.029
Neg logit contributions
sing-0.067
 argue-0.065
 spine-0.060
 whine-0.060
 breath-0.059
Token a
Token ID257
Feature activation-2.33e+00

Pos logit contributions
sing+0.018
 argue+0.017
 spine+0.016
 breath+0.016
 whine+0.016
Neg logit contributions
It-0.009
aw-0.009
 living-0.009
That-0.008
 that-0.008
Token picture
Token ID4286
Feature activation-3.76e+00

Pos logit contributions
aw+0.036
 living+0.036
It+0.035
 that+0.033
 all+0.030
Neg logit contributions
 argue-0.062
sing-0.060
 spine-0.057
 whine-0.056
 breath-0.055
Token.
Token ID13
Feature activation-1.12e+00

Pos logit contributions
It+0.048
aw+0.046
 living+0.044
 brand+0.040
That+0.039
Neg logit contributions
 argue-0.103
sing-0.103
 spine-0.099
 breath-0.095
days-0.095