TOP ACTIVATIONS
MAX = 13.133

and so she threw a tant rum when her parents said
She cried and threw a tant rum . Her mom tried
to cry and threw a tant rum . She didn 't
her feet and threw a tant rum . She grabbed the
listen . She threw a tant rum in the store .
sad . She threw a tant rum and cried loudly .
and started to throw a tant rum . Suddenly , she
kept crying and throwing a tant rum . Finally
. He kept throwing a tant rum . His mom was
them and never threw a tant rum again . <|endoftext|> Once
his feet and threw a tant rum . "
temper and often threw temper tant rums . One day ,
surprise . He threw a tant rum . He kicked and
stubborn . She throws a tant rum . She kicks and
she could carry the or nam ents back with her .
a big box of or nam ents . They were so
branches . But the or nam ents were slippery and some
They reached for the or nam ents and touched them softly
here and explore the or nam ents ." Da
the lock and the or nam ents . "

MIN ACTIVATIONS
MIN = -12.145

, Pete 's family was planning to go on a special
Finally , after much planning , the group set off
ens . Mitt ens was planning to catch Tim my and
M ummy and Daddy were planning a picnic . "
excited . He had been planning this trip for months .
So the creative club started planning for their game . They
it took a lot of planning and hard work , things
and her mom my were planning treatment for Buster . Mia
S ue and Mike were planning a picnic in the park
birthday . Her mom was planning to make a special banana
know , Grand ma was planning a surprise for him .
. Tim my realized that planning and hard work can help
mom said that she was planning to sell some of Emily
and her mom my were planning a very special day .
. The family had been planning for months . They mailed
Mary and her family were planning a trip to visit far
lesson . With a little planning and compassion , you can
'd had lots of fun planning all the way . <|endoftext|>
pa and Joe had been planning for a long time .
took out her map , planning out the next place to

INTERVAL 6.814 - 13.133
CONTAINS 0.238%

book . Even after looking everywhere , she never got it
, let 's play something else ." The oct
� Let â € ™ s fight to see who can
their feathers and the squirrel s build nests in the trees
, her mom lit the candles on the cake and said

INTERVAL 0.494 - 6.814
CONTAINS 31.169%

carried a basket and her grandma carried a big bucket .
need something new . We can pick a new game ,
and he wanted to have lots of fun with them .
had long legs . " Look , mom my ! The
is perfect ." The little boy smiled at the

INTERVAL -5.825 - 0.494
CONTAINS 67.680%

back to play with her rope . <|endoftext|> One summer day
and dad got mad at them for losing the juice and
best friends . They liked to play in the park every
. He does not see the car . The car is
was thankful for the special lesson about being unique . <|endoftext|>

INTERVAL -12.145 - -5.825
CONTAINS 0.912%

was so happy with her creation . But then
would never be so angry again . The moral
saw some children playing and having fun . Joe asked Jane
in her pocket and kept exploring . She looked under leaves
explain that he was only trying to look at what was
Token and
Token ID290
Feature activation-6.36e-01

Pos logit contributions
 planning+0.036
reetings+0.036
alis+0.033
 solution+0.033
 possible+0.032
Neg logit contributions
s-0.020
 else-0.020
 are-0.019
bs-0.018
nam-0.018
Token so
Token ID523
Feature activation+6.04e-01

Pos logit contributions
 planning+0.056
reetings+0.050
 possible+0.048
alis+0.048
 solution+0.048
Neg logit contributions
s-0.030
 else-0.027
 tant-0.027
 are-0.026
cream-0.026
Token she
Token ID673
Feature activation-3.25e-01

Pos logit contributions
cream+0.038
 tant+0.036
 trash+0.034
controlled+0.034
s+0.033
Neg logit contributions
 planning-0.043
reetings-0.038
 possible-0.037
joy-0.036
 spending-0.036
Token threw
Token ID9617
Feature activation+9.01e-01

Pos logit contributions
 planning+0.027
 example+0.023
alis+0.023
 solution+0.023
reetings+0.022
Neg logit contributions
nam-0.017
 are-0.016
bs-0.016
s-0.016
 tant-0.015
Token a
Token ID257
Feature activation-8.92e-01

Pos logit contributions
s+0.065
 else+0.062
 Sorry+0.061
cream+0.059
bs+0.058
Neg logit contributions
 planning-0.051
joy-0.049
 example-0.048
 possible-0.046
 addition-0.046
Token tant
Token ID24246
Feature activation+1.31e+01

Pos logit contributions
 solution+0.069
 planning+0.060
 addition+0.058
joy+0.058
 possible+0.055
Neg logit contributions
cream-0.049
nam-0.047
 are-0.044
 Sorry-0.044
 were-0.044
Tokenrum
Token ID6582
Feature activation+9.24e-02

Pos logit contributions
s+0.978
 else+0.965
nam+0.936
bs+0.934
 are+0.920
Neg logit contributions
 planning-0.799
reetings-0.751
 solution-0.680
alis-0.678
joy-0.675
Token when
Token ID618
Feature activation+2.87e+00

Pos logit contributions
 else+0.004
s+0.004
nam+0.004
bs+0.004
 are+0.004
Neg logit contributions
 planning-0.008
reetings-0.008
joy-0.007
 possible-0.007
 example-0.007
Token her
Token ID607
Feature activation+9.01e-02

Pos logit contributions
s+0.145
 else+0.140
bs+0.134
nam+0.133
 are+0.131
Neg logit contributions
 planning-0.245
reetings-0.230
joy-0.218
 possible-0.217
 solution-0.215
Token parents
Token ID3397
Feature activation+3.28e+00

Pos logit contributions
nam+0.004
s+0.004
bs+0.004
 are+0.004
 else+0.004
Neg logit contributions
 planning-0.008
 solution-0.007
joy-0.007
 example-0.007
 self-0.007
Token said
Token ID531
Feature activation+1.07e+00

Pos logit contributions
nam+0.161
 tant+0.140
bs+0.138
s+0.137
apy+0.136
Neg logit contributions
 planning-0.293
reetings-0.257
 solution-0.256
 known-0.242
 joining-0.240
Token She
Token ID1375
Feature activation-6.68e-01

Pos logit contributions
s+0.015
bs+0.013
 else+0.013
nam+0.013
 are+0.012
Neg logit contributions
 planning-0.025
reetings-0.025
 possible-0.024
 solution-0.023
alis-0.022
Token cried
Token ID16896
Feature activation+5.61e+00

Pos logit contributions
 planning+0.061
reetings+0.056
 solution+0.054
 example+0.054
alis+0.053
Neg logit contributions
s-0.029
nam-0.029
 else-0.028
bs-0.027
 are-0.027
Token and
Token ID290
Feature activation-6.53e-01

Pos logit contributions
s+0.294
 else+0.291
nam+0.280
bs+0.274
 are+0.272
Neg logit contributions
 planning-0.465
reetings-0.443
 solution-0.414
joy-0.412
alis-0.409
Token threw
Token ID9617
Feature activation+8.77e-01

Pos logit contributions
 planning+0.060
 self+0.053
 example+0.053
 solution+0.052
 spending+0.052
Neg logit contributions
s-0.028
 else-0.026
 are-0.025
cream-0.024
bs-0.023
Token a
Token ID257
Feature activation-9.18e-01

Pos logit contributions
s+0.058
 else+0.054
cream+0.053
nam+0.051
 Sorry+0.051
Neg logit contributions
 planning-0.058
 solution-0.053
 example-0.053
joy-0.053
alis-0.051
Token tant
Token ID24246
Feature activation+1.31e+01

Pos logit contributions
 solution+0.082
 planning+0.076
joy+0.074
 addition+0.070
reetings+0.068
Neg logit contributions
cream-0.040
nam-0.035
 are-0.034
 Sorry-0.034
 both-0.033
Tokenrum
Token ID6582
Feature activation+7.34e-02

Pos logit contributions
s+0.987
 else+0.975
nam+0.950
bs+0.943
 are+0.935
Neg logit contributions
 planning-0.783
reetings-0.739
 solution-0.662
joy-0.656
alis-0.655
Token.
Token ID13
Feature activation+3.02e-01

Pos logit contributions
s+0.003
 else+0.003
nam+0.003
bs+0.003
 are+0.003
Neg logit contributions
 planning-0.007
reetings-0.007
 solution-0.006
joy-0.006
 possible-0.006
Token Her
Token ID2332
Feature activation-7.32e-01

Pos logit contributions
s+0.017
bs+0.016
 else+0.016
nam+0.015
 are+0.015
Neg logit contributions
 planning-0.025
reetings-0.024
 possible-0.023
 solution-0.022
joy-0.022
Token mom
Token ID1995
Feature activation+5.33e+00

Pos logit contributions
 planning+0.064
 solution+0.060
joy+0.059
 self+0.058
 example+0.056
Neg logit contributions
s-0.035
cream-0.034
bs-0.033
 else-0.033
 are-0.032
Token tried
Token ID3088
Feature activation-1.29e+00

Pos logit contributions
nam+0.265
s+0.241
bs+0.239
 else+0.235
 are+0.231
Neg logit contributions
 planning-0.464
reetings-0.419
 solution-0.413
 example-0.400
alis-0.397
Token to
Token ID284
Feature activation-7.68e-01

Pos logit contributions
s+0.020
 else+0.019
 tug+0.018
nam+0.018
bs+0.017
Neg logit contributions
reetings-0.051
 possible-0.043
 Fairy-0.042
alis-0.042
ator-0.041
Token cry
Token ID3960
Feature activation+5.38e+00

Pos logit contributions
 planning+0.070
reetings+0.065
 solution+0.064
alis+0.061
 example+0.061
Neg logit contributions
s-0.036
 else-0.033
bs-0.033
nam-0.031
cream-0.031
Token and
Token ID290
Feature activation-6.99e-01

Pos logit contributions
 else+0.303
nam+0.299
s+0.298
bs+0.286
 are+0.285
Neg logit contributions
 planning-0.416
reetings-0.383
 solution-0.369
joy-0.366
 example-0.365
Token threw
Token ID9617
Feature activation+8.23e-01

Pos logit contributions
 planning+0.061
reetings+0.055
 solution+0.054
joy+0.053
alis+0.052
Neg logit contributions
s-0.035
 else-0.033
bs-0.032
nam-0.032
cream-0.031
Token a
Token ID257
Feature activation-9.62e-01

Pos logit contributions
s+0.054
 else+0.052
cream+0.051
 Sorry+0.050
nam+0.050
Neg logit contributions
 planning-0.052
 example-0.049
 solution-0.049
joy-0.048
 possible-0.047
Token tant
Token ID24246
Feature activation+1.31e+01

Pos logit contributions
 solution+0.084
 planning+0.077
joy+0.074
 addition+0.074
reetings+0.071
Neg logit contributions
cream-0.044
nam-0.042
 Sorry-0.038
 are-0.038
bs-0.037
Tokenrum
Token ID6582
Feature activation+7.53e-03

Pos logit contributions
s+0.979
 else+0.966
nam+0.939
bs+0.935
 are+0.924
Neg logit contributions
 planning-0.787
reetings-0.740
 solution-0.666
joy-0.661
alis-0.661
Token.
Token ID13
Feature activation+2.43e-01

Pos logit contributions
 else+0.000
s+0.000
nam+0.000
bs+0.000
 are+0.000
Neg logit contributions
 planning-0.001
reetings-0.001
joy-0.001
 possible-0.001
 solution-0.001
Token She
Token ID1375
Feature activation-7.05e-01

Pos logit contributions
s+0.013
nam+0.012
bs+0.012
 else+0.012
 trash+0.011
Neg logit contributions
reetings-0.021
 planning-0.020
 possible-0.019
 solution-0.018
joy-0.018
Token didn
Token ID1422
Feature activation+1.08e+00

Pos logit contributions
 planning+0.050
 self+0.043
 known+0.042
alis+0.041
 solution+0.041
Neg logit contributions
nam-0.042
ifying-0.040
 are-0.040
 Sorry-0.040
bs-0.040
Token't
Token ID470
Feature activation-2.48e+00

Pos logit contributions
s+0.073
 else+0.071
 are+0.068
bs+0.067
nam+0.066
Neg logit contributions
 planning-0.075
reetings-0.072
joy-0.067
 solution-0.065
alis-0.065
Token her
Token ID607
Feature activation-5.51e-02

Pos logit contributions
s+0.094
 else+0.093
nam+0.090
 are+0.088
bs+0.088
Neg logit contributions
 planning-0.131
reetings-0.118
joy-0.118
 solution-0.116
 example-0.116
Token feet
Token ID3625
Feature activation+6.18e+00

Pos logit contributions
 planning+0.005
 solution+0.004
joy+0.004
 self+0.004
 example+0.004
Neg logit contributions
cream-0.003
w-0.003
bs-0.003
 else-0.002
s-0.002
Token and
Token ID290
Feature activation-7.52e-01

Pos logit contributions
s+0.321
 else+0.318
nam+0.312
bs+0.305
 are+0.301
Neg logit contributions
 planning-0.505
reetings-0.476
 solution-0.451
joy-0.444
 possible-0.444
Token threw
Token ID9617
Feature activation+7.81e-01

Pos logit contributions
 planning+0.063
joy+0.055
 spending+0.053
 creating+0.053
 solution+0.053
Neg logit contributions
s-0.033
cream-0.033
 are-0.032
 else-0.031
w-0.030
Token a
Token ID257
Feature activation-1.02e+00

Pos logit contributions
s+0.048
 else+0.046
bs+0.045
cream+0.044
nam+0.044
Neg logit contributions
 planning-0.053
 example-0.049
joy-0.049
 solution-0.048
 possible-0.048
Token tant
Token ID24246
Feature activation+1.30e+01

Pos logit contributions
 solution+0.097
joy+0.090
 planning+0.090
 addition+0.086
 possible+0.083
Neg logit contributions
cream-0.036
nam-0.031
 Sorry-0.029
 are-0.028
 were-0.027
Tokenrum
Token ID6582
Feature activation-3.77e-02

Pos logit contributions
s+1.009
 else+0.998
nam+0.978
 are+0.964
bs+0.963
Neg logit contributions
 planning-0.745
reetings-0.708
 solution-0.623
 possible-0.618
joy-0.615
Token.
Token ID13
Feature activation+1.98e-01

Pos logit contributions
 planning+0.003
reetings+0.003
joy+0.003
 possible+0.003
 solution+0.003
Neg logit contributions
 else-0.002
nam-0.002
s-0.002
bs-0.002
 tant-0.002
Token She
Token ID1375
Feature activation-7.54e-01

Pos logit contributions
s+0.011
bs+0.010
 else+0.010
nam+0.010
 are+0.010
Neg logit contributions
 planning-0.016
reetings-0.016
 possible-0.015
 solution-0.015
alis-0.014
Token grabbed
Token ID13176
Feature activation+2.77e+00

Pos logit contributions
 planning+0.056
 known+0.047
 self+0.047
 shown+0.046
 actually+0.046
Neg logit contributions
nam-0.040
 Sorry-0.037
bs-0.037
 are-0.036
s-0.036
Token the
Token ID262
Feature activation-5.86e-01

Pos logit contributions
s+0.179
 else+0.172
 are+0.161
cream+0.160
 Sorry+0.159
Neg logit contributions
 planning-0.184
 example-0.171
joy-0.166
 possible-0.163
alis-0.161
Token listen
Token ID6004
Feature activation-8.70e-01

Pos logit contributions
 planning+0.129
reetings+0.126
 solution+0.118
 possible+0.116
 example+0.116
Neg logit contributions
s-0.078
nam-0.074
 else-0.072
cream-0.068
bs-0.067
Token.
Token ID13
Feature activation+1.86e-01

Pos logit contributions
 planning+0.074
reetings+0.073
alis+0.069
 solution+0.066
 possible+0.065
Neg logit contributions
s-0.049
 else-0.045
bs-0.043
 are-0.041
 were-0.039
Token She
Token ID1375
Feature activation-7.74e-01

Pos logit contributions
s+0.008
bs+0.007
 are+0.007
nam+0.007
 else+0.007
Neg logit contributions
 planning-0.018
reetings-0.017
 possible-0.016
 solution-0.016
alis-0.016
Token threw
Token ID9617
Feature activation+7.75e-01

Pos logit contributions
 planning+0.068
reetings+0.063
 solution+0.061
alis+0.060
 example+0.060
Neg logit contributions
nam-0.036
s-0.035
bs-0.033
 else-0.033
 are-0.031
Token a
Token ID257
Feature activation-1.01e+00

Pos logit contributions
s+0.045
 else+0.045
cream+0.043
nam+0.042
bs+0.042
Neg logit contributions
 planning-0.055
 example-0.051
joy-0.051
 solution-0.050
 possible-0.049
Token tant
Token ID24246
Feature activation+1.30e+01

Pos logit contributions
 solution+0.092
 planning+0.088
joy+0.084
 addition+0.083
reetings+0.082
Neg logit contributions
cream-0.039
nam-0.038
 are-0.032
bs-0.031
 both-0.031
Tokenrum
Token ID6582
Feature activation-3.65e-02

Pos logit contributions
s+0.998
 else+0.987
nam+0.970
 are+0.955
bs+0.951
Neg logit contributions
 planning-0.750
reetings-0.717
 solution-0.629
 possible-0.623
 example-0.622
Token in
Token ID287
Feature activation+2.78e+00

Pos logit contributions
 planning+0.003
reetings+0.003
joy+0.003
 solution+0.003
 possible+0.003
Neg logit contributions
s-0.002
 else-0.002
nam-0.002
bs-0.002
 are-0.002
Token the
Token ID262
Feature activation-5.96e-01

Pos logit contributions
s+0.147
 else+0.143
nam+0.137
bs+0.137
cream+0.133
Neg logit contributions
 planning-0.229
reetings-0.215
 solution-0.206
joy-0.204
 example-0.203
Token store
Token ID3650
Feature activation-1.50e+00

Pos logit contributions
 planning+0.047
 solution+0.045
 example+0.042
reetings+0.042
joy+0.041
Neg logit contributions
bs-0.035
nam-0.035
 else-0.034
s-0.032
cream-0.032
Token.
Token ID13
Feature activation+1.98e-01

Pos logit contributions
reetings+0.141
 planning+0.138
alis+0.128
 possible+0.127
 solution+0.124
Neg logit contributions
s-0.067
 else-0.064
 are-0.060
nam-0.059
bs-0.058
Token sad
Token ID6507
Feature activation+1.68e+00

Pos logit contributions
 planning+0.076
reetings+0.071
 solution+0.068
joy+0.067
alis+0.067
Neg logit contributions
s-0.034
 else-0.033
nam-0.032
 are-0.032
bs-0.031
Token.
Token ID13
Feature activation+1.24e-01

Pos logit contributions
nam+0.077
 else+0.077
s+0.077
bs+0.073
 are+0.072
Neg logit contributions
 planning-0.148
reetings-0.139
 solution-0.132
joy-0.132
 possible-0.131
Token She
Token ID1375
Feature activation-8.33e-01

Pos logit contributions
s+0.007
 else+0.006
nam+0.006
bs+0.006
 are+0.006
Neg logit contributions
 planning-0.010
reetings-0.010
 possible-0.009
 solution-0.009
alis-0.009
Token threw
Token ID9617
Feature activation+7.20e-01

Pos logit contributions
 planning+0.069
reetings+0.061
 example+0.061
alis+0.060
 solution+0.060
Neg logit contributions
s-0.042
nam-0.041
bs-0.040
 are-0.040
 else-0.039
Token a
Token ID257
Feature activation-1.08e+00

Pos logit contributions
s+0.045
 else+0.044
cream+0.042
 Sorry+0.041
nam+0.041
Neg logit contributions
 planning-0.049
 example-0.045
 solution-0.045
joy-0.045
alis-0.043
Token tant
Token ID24246
Feature activation+1.30e+01

Pos logit contributions
 solution+0.096
 planning+0.087
joy+0.085
 addition+0.084
 transformation+0.079
Neg logit contributions
cream-0.047
nam-0.043
 Sorry-0.042
 are-0.038
 elbows-0.037
Tokenrum
Token ID6582
Feature activation-9.89e-02

Pos logit contributions
s+0.967
 else+0.954
nam+0.926
bs+0.925
 are+0.911
Neg logit contributions
 planning-0.786
reetings-0.737
 solution-0.667
alis-0.662
joy-0.661
Token and
Token ID290
Feature activation-7.95e-01

Pos logit contributions
 planning+0.008
reetings+0.008
joy+0.007
 example+0.007
 possible+0.007
Neg logit contributions
 else-0.005
s-0.005
nam-0.005
bs-0.005
 are-0.005
Token cried
Token ID16896
Feature activation+5.47e+00

Pos logit contributions
 planning+0.069
joy+0.060
 solution+0.059
reetings+0.058
 spending+0.058
Neg logit contributions
s-0.039
 else-0.037
 are-0.036
bs-0.036
cream-0.035
Token loudly
Token ID23112
Feature activation+1.74e+00

Pos logit contributions
s+0.275
 else+0.272
nam+0.261
bs+0.255
 are+0.254
Neg logit contributions
 planning-0.464
reetings-0.440
 solution-0.415
joy-0.412
 possible-0.409
Token.
Token ID13
Feature activation+1.54e-01

Pos logit contributions
s+0.069
 else+0.061
bs+0.060
 are+0.058
 were+0.055
Neg logit contributions
reetings-0.170
 planning-0.168
alis-0.158
 possible-0.155
 solution-0.155
Token and
Token ID290
Feature activation-8.20e-01

Pos logit contributions
s+0.139
 else+0.133
 are+0.125
bs+0.124
nam+0.120
Neg logit contributions
 planning-0.265
reetings-0.263
alis-0.241
 possible-0.240
 solution-0.239
Token started
Token ID2067
Feature activation+3.60e-01

Pos logit contributions
 planning+0.074
 solution+0.065
reetings+0.064
 example+0.064
alis+0.064
Neg logit contributions
s-0.035
 else-0.034
 are-0.034
nam-0.033
bs-0.032
Token to
Token ID284
Feature activation-8.71e-01

Pos logit contributions
s+0.019
 else+0.018
bs+0.016
 trash+0.016
nam+0.016
Neg logit contributions
reetings-0.034
alis-0.030
 possible-0.029
 Fairy-0.029
 planning-0.029
Token throw
Token ID3714
Feature activation+5.71e-01

Pos logit contributions
 planning+0.083
 solution+0.076
reetings+0.074
alis+0.072
 example+0.071
Neg logit contributions
s-0.035
 else-0.034
nam-0.034
bs-0.033
cream-0.032
Token a
Token ID257
Feature activation-1.08e+00

Pos logit contributions
 else+0.034
nam+0.034
s+0.032
cream+0.032
 Sorry+0.031
Neg logit contributions
 planning-0.039
 solution-0.036
 example-0.036
joy-0.035
alis-0.035
Token tant
Token ID24246
Feature activation+1.29e+01

Pos logit contributions
 solution+0.083
 planning+0.078
joy+0.071
 addition+0.071
 recipe+0.070
Neg logit contributions
nam-0.060
cream-0.058
 bicycles-0.054
 are-0.053
 Sorry-0.053
Tokenrum
Token ID6582
Feature activation-1.03e-01

Pos logit contributions
s+0.977
 else+0.965
nam+0.941
bs+0.933
 are+0.926
Neg logit contributions
 planning-0.770
reetings-0.728
 solution-0.651
joy-0.644
alis-0.643
Token.
Token ID13
Feature activation+1.17e-01

Pos logit contributions
 planning+0.010
reetings+0.009
 solution+0.009
joy+0.009
 possible+0.009
Neg logit contributions
s-0.004
 else-0.004
nam-0.004
bs-0.004
 are-0.004
Token Suddenly
Token ID24975
Feature activation-1.34e+00

Pos logit contributions
s+0.007
 else+0.007
bs+0.006
nam+0.006
 are+0.006
Neg logit contributions
 planning-0.009
reetings-0.009
 possible-0.008
 solution-0.008
joy-0.008
Token,
Token ID11
Feature activation+2.43e-01

Pos logit contributions
 planning+0.118
reetings+0.114
alis+0.106
 solution+0.106
 possible+0.105
Neg logit contributions
s-0.064
 else-0.062
bs-0.059
nam-0.059
 are-0.058
Token she
Token ID673
Feature activation-4.98e-01

Pos logit contributions
s+0.014
 else+0.013
bs+0.013
cream+0.013
 are+0.012
Neg logit contributions
 planning-0.019
reetings-0.017
joy-0.017
 possible-0.017
 solution-0.017
Token kept
Token ID4030
Feature activation-8.38e-01

Pos logit contributions
 planning+0.069
alis+0.060
 solution+0.059
 example+0.059
 Fairy+0.058
Neg logit contributions
nam-0.045
 are-0.043
s-0.042
 tant-0.041
bs-0.040
Token crying
Token ID13774
Feature activation-6.56e-01

Pos logit contributions
reetings+0.061
alis+0.049
 possible+0.049
 planning+0.049
 Fairy+0.048
Neg logit contributions
nam-0.058
 else-0.056
 both-0.055
s-0.055
 trash-0.055
Token and
Token ID290
Feature activation-8.29e-01

Pos logit contributions
 planning+0.044
reetings+0.040
 example+0.040
 solution+0.040
joy+0.039
Neg logit contributions
nam-0.042
cream-0.042
 else-0.040
s-0.040
bs-0.039
Token throwing
Token ID9644
Feature activation-3.61e+00

Pos logit contributions
 planning+0.057
reetings+0.053
 solution+0.049
joy+0.049
alis+0.048
Neg logit contributions
s-0.056
 else-0.055
bs-0.053
nam-0.053
 are-0.052
Token a
Token ID257
Feature activation-1.09e+00

Pos logit contributions
 planning+0.248
 solution+0.233
joy+0.224
 example+0.224
 addition+0.219
Neg logit contributions
s-0.215
 else-0.213
nam-0.206
cream-0.202
 Sorry-0.201
Token tant
Token ID24246
Feature activation+1.29e+01

Pos logit contributions
 solution+0.097
 planning+0.089
 addition+0.084
 recipe+0.079
 transformation+0.079
Neg logit contributions
nam-0.051
cream-0.046
 Sorry-0.044
 are-0.043
w-0.040
Tokenrum
Token ID6582
Feature activation-1.20e-01

Pos logit contributions
s+0.974
 else+0.962
nam+0.938
bs+0.930
 are+0.923
Neg logit contributions
 planning-0.772
reetings-0.729
 solution-0.653
joy-0.647
alis-0.646
Token.
Token ID13
Feature activation+1.17e-01

Pos logit contributions
 planning+0.011
reetings+0.010
joy+0.010
 solution+0.010
 possible+0.010
Neg logit contributions
s-0.005
 else-0.005
nam-0.005
bs-0.005
 are-0.005
Token
Token ID198
Feature activation+3.16e-01

Pos logit contributions
s+0.006
 else+0.006
bs+0.005
nam+0.005
 are+0.005
Neg logit contributions
 planning-0.010
reetings-0.010
 possible-0.009
 solution-0.009
alis-0.009
Token
Token ID198
Feature activation+3.17e-01

Pos logit contributions
s+0.016
 else+0.016
nam+0.015
bs+0.015
 are+0.015
Neg logit contributions
 planning-0.027
reetings-0.026
 solution-0.024
joy-0.024
alis-0.023
TokenFinally
Token ID11158
Feature activation+1.71e+00

Pos logit contributions
s+0.020
nam+0.019
 else+0.019
 are+0.019
bs+0.019
Neg logit contributions
 planning-0.023
reetings-0.022
 solution-0.020
alis-0.020
 example-0.019
Token.
Token ID13
Feature activation+7.89e-02

Pos logit contributions
 planning+0.089
reetings+0.086
alis+0.081
 solution+0.080
joy+0.080
Neg logit contributions
s-0.040
 else-0.039
nam-0.036
bs-0.036
 are-0.036
Token He
Token ID679
Feature activation+1.00e+00

Pos logit contributions
s+0.004
 else+0.004
nam+0.003
bs+0.003
 are+0.003
Neg logit contributions
 planning-0.007
reetings-0.007
joy-0.006
 solution-0.006
 example-0.006
Token kept
Token ID4030
Feature activation-8.64e-01

Pos logit contributions
nam+0.046
s+0.043
bs+0.042
 else+0.040
 are+0.039
Neg logit contributions
 planning-0.090
reetings-0.083
 solution-0.080
alis-0.079
 example-0.079
Token throwing
Token ID9644
Feature activation-3.64e+00

Pos logit contributions
reetings+0.056
 planning+0.054
alis+0.048
 solution+0.047
 possible+0.047
Neg logit contributions
 else-0.061
s-0.061
nam-0.060
bs-0.058
 are-0.058
Token a
Token ID257
Feature activation-1.11e+00

Pos logit contributions
 solution+0.246
 planning+0.245
joy+0.244
 example+0.239
 addition+0.222
Neg logit contributions
s-0.204
 else-0.202
bs-0.192
 Sorry-0.192
cream-0.187
Token tant
Token ID24246
Feature activation+1.29e+01

Pos logit contributions
 solution+0.085
 planning+0.075
 addition+0.073
joy+0.072
reetings+0.071
Neg logit contributions
nam-0.060
cream-0.056
 were-0.055
 are-0.055
 elbows-0.055
Tokenrum
Token ID6582
Feature activation-1.27e-01

Pos logit contributions
s+0.998
 else+0.990
nam+0.972
bs+0.955
 are+0.955
Neg logit contributions
 planning-0.735
reetings-0.703
 solution-0.618
 possible-0.612
joy-0.608
Token.
Token ID13
Feature activation+9.78e-02

Pos logit contributions
 planning+0.011
reetings+0.010
joy+0.010
 possible+0.010
 example+0.010
Neg logit contributions
bs-0.006
 else-0.006
nam-0.006
s-0.006
 tant-0.006
Token His
Token ID2399
Feature activation+1.30e+00

Pos logit contributions
s+0.004
 else+0.004
nam+0.004
bs+0.004
 are+0.004
Neg logit contributions
 planning-0.009
reetings-0.009
 solution-0.008
joy-0.008
alis-0.008
Token mom
Token ID1995
Feature activation+5.11e+00

Pos logit contributions
s+0.076
bs+0.073
 else+0.073
cream+0.072
nam+0.067
Neg logit contributions
 planning-0.098
 solution-0.097
 self-0.091
joy-0.091
 example-0.087
Token was
Token ID373
Feature activation-8.54e-01

Pos logit contributions
nam+0.281
bs+0.240
s+0.238
 Sorry+0.235
 tant+0.233
Neg logit contributions
 planning-0.427
 solution-0.380
reetings-0.368
joy-0.365
 Fairy-0.362
Token them
Token ID606
Feature activation-3.08e+00

Pos logit contributions
reetings+0.202
 planning+0.199
alis+0.187
 solution+0.183
joy+0.181
Neg logit contributions
 else-0.117
nam-0.112
s-0.110
 are-0.108
bs-0.105
Token and
Token ID290
Feature activation-8.70e-01

Pos logit contributions
 planning+0.276
reetings+0.270
joy+0.251
alis+0.251
 solution+0.247
Neg logit contributions
s-0.142
 else-0.140
nam-0.131
 are-0.130
bs-0.130
Token never
Token ID1239
Feature activation-1.37e-01

Pos logit contributions
 planning+0.073
reetings+0.069
 solution+0.064
 possible+0.064
joy+0.064
Neg logit contributions
s-0.045
 else-0.044
nam-0.043
bs-0.042
 are-0.041
Token threw
Token ID9617
Feature activation+6.34e-01

Pos logit contributions
 planning+0.013
reetings+0.012
 solution+0.011
joy+0.011
alis+0.011
Neg logit contributions
s-0.006
 else-0.006
nam-0.006
bs-0.006
 are-0.006
Token a
Token ID257
Feature activation-1.18e+00

Pos logit contributions
s+0.035
 else+0.034
nam+0.033
bs+0.033
 are+0.032
Neg logit contributions
 planning-0.050
reetings-0.047
 solution-0.044
 possible-0.044
joy-0.044
Token tant
Token ID24246
Feature activation+1.28e+01

Pos logit contributions
 planning+0.097
 solution+0.097
reetings+0.093
joy+0.092
 possible+0.089
Neg logit contributions
nam-0.055
cream-0.052
s-0.051
 are-0.050
 were-0.048
Tokenrum
Token ID6582
Feature activation-2.08e-01

Pos logit contributions
s+0.855
 else+0.845
bs+0.819
nam+0.798
 tant+0.793
Neg logit contributions
 planning-0.891
reetings-0.809
alis-0.797
 solution-0.777
joy-0.771
Token again
Token ID757
Feature activation-7.41e+00

Pos logit contributions
 planning+0.020
reetings+0.020
alis+0.019
 solution+0.019
joy+0.018
Neg logit contributions
s-0.008
 else-0.008
 are-0.007
bs-0.007
nam-0.007
Token.
Token ID13
Feature activation+3.19e-03

Pos logit contributions
 planning+0.766
reetings+0.760
alis+0.727
joy+0.704
 example+0.700
Neg logit contributions
s-0.256
 else-0.239
bs-0.215
 are-0.211
nam-0.196
Token<|endoftext|>
Token ID50256
Feature activation+2.88e+00

Pos logit contributions
s+0.000
 else+0.000
bs+0.000
nam+0.000
 are+0.000
Neg logit contributions
 planning-0.000
reetings-0.000
 possible-0.000
 solution-0.000
alis-0.000
TokenOnce
Token ID7454
Feature activation+9.42e-01

Pos logit contributions
s+0.115
 are+0.106
 else+0.102
 were+0.099
 trash+0.083
Neg logit contributions
 planning-0.285
reetings-0.276
 example-0.268
alis-0.266
 possible-0.266
Token his
Token ID465
Feature activation+2.16e+00

Pos logit contributions
s+0.126
 else+0.124
nam+0.122
bs+0.114
 are+0.112
Neg logit contributions
 planning-0.292
reetings-0.282
joy-0.267
 solution-0.267
 example-0.267
Token feet
Token ID3625
Feature activation+5.99e+00

Pos logit contributions
 else+0.108
s+0.108
nam+0.105
bs+0.104
cream+0.103
Neg logit contributions
 planning-0.180
 solution-0.167
reetings-0.164
joy-0.163
 example-0.163
Token and
Token ID290
Feature activation-9.38e-01

Pos logit contributions
s+0.310
 else+0.310
nam+0.304
bs+0.297
 are+0.292
Neg logit contributions
 planning-0.490
reetings-0.461
 solution-0.436
 possible-0.432
joy-0.429
Token threw
Token ID9617
Feature activation+5.88e-01

Pos logit contributions
 planning+0.084
joy+0.076
reetings+0.073
 solution+0.073
 example+0.072
Neg logit contributions
s-0.041
 else-0.040
 are-0.038
bs-0.038
cream-0.038
Token a
Token ID257
Feature activation-1.21e+00

Pos logit contributions
s+0.034
 else+0.033
bs+0.032
nam+0.032
cream+0.030
Neg logit contributions
 planning-0.042
 example-0.040
joy-0.040
 solution-0.040
 possible-0.039
Token tant
Token ID24246
Feature activation+1.28e+01

Pos logit contributions
 solution+0.098
 planning+0.095
reetings+0.092
joy+0.089
 addition+0.087
Neg logit contributions
nam-0.058
cream-0.056
 else-0.053
 are-0.052
s-0.051
Tokenrum
Token ID6582
Feature activation-2.39e-01

Pos logit contributions
s+0.938
 else+0.925
bs+0.895
nam+0.893
 are+0.879
Neg logit contributions
 planning-0.798
reetings-0.743
alis-0.682
 solution-0.680
joy-0.675
Token.
Token ID13
Feature activation-1.17e-02

Pos logit contributions
 planning+0.021
reetings+0.019
joy+0.018
 possible+0.018
 solution+0.018
Neg logit contributions
nam-0.011
 else-0.011
bs-0.011
s-0.011
 tant-0.010
Token
Token ID198
Feature activation+1.74e-01

Pos logit contributions
 planning+0.001
reetings+0.001
 possible+0.001
 solution+0.001
alis+0.001
Neg logit contributions
s-0.001
 else-0.001
bs-0.000
nam-0.000
 are-0.000
Token
Token ID198
Feature activation+1.72e-01

Pos logit contributions
s+0.009
 else+0.009
nam+0.009
bs+0.008
 are+0.008
Neg logit contributions
 planning-0.015
reetings-0.014
 solution-0.013
joy-0.013
 possible-0.013
Token"
Token ID1
Feature activation-2.64e+00

Pos logit contributions
nam+0.010
 are+0.009
s+0.009
 else+0.009
bs+0.009
Neg logit contributions
 planning-0.013
reetings-0.013
 example-0.012
 solution-0.012
 possible-0.012
Token temper
Token ID4124
Feature activation+1.69e+00

Pos logit contributions
 planning+0.127
reetings+0.119
 solution+0.115
joy+0.113
 example+0.112
Neg logit contributions
s-0.075
 else-0.073
nam-0.072
bs-0.069
 are-0.068
Token and
Token ID290
Feature activation-9.62e-01

Pos logit contributions
s+0.086
 else+0.086
nam+0.085
bs+0.082
 are+0.080
Neg logit contributions
 planning-0.139
reetings-0.131
 solution-0.124
joy-0.121
 example-0.121
Token often
Token ID1690
Feature activation-2.54e+00

Pos logit contributions
 planning+0.089
reetings+0.083
 solution+0.079
 possible+0.078
 example+0.078
Neg logit contributions
s-0.042
 else-0.039
bs-0.039
nam-0.038
 are-0.037
Token threw
Token ID9617
Feature activation+5.77e-01

Pos logit contributions
 planning+0.232
reetings+0.220
 solution+0.207
 possible+0.204
joy+0.204
Neg logit contributions
s-0.113
 else-0.108
nam-0.106
bs-0.103
 are-0.100
Token temper
Token ID4124
Feature activation+1.70e+00

Pos logit contributions
s+0.030
nam+0.028
 else+0.028
bs+0.028
 Sorry+0.028
Neg logit contributions
 planning-0.045
 solution-0.042
joy-0.041
 example-0.041
 possible-0.040
Token tant
Token ID24246
Feature activation+1.28e+01

Pos logit contributions
nam+0.084
 else+0.077
 Sorry+0.075
 are+0.075
s+0.073
Neg logit contributions
 planning-0.139
reetings-0.133
 solution-0.128
 example-0.128
joy-0.127
Tokenrums
Token ID45241
Feature activation+7.25e-01

Pos logit contributions
s+0.825
 else+0.819
nam+0.784
bs+0.784
cream+0.758
Neg logit contributions
 planning-0.901
reetings-0.832
alis-0.810
 solution-0.800
joy-0.789
Token.
Token ID13
Feature activation-2.02e-02

Pos logit contributions
s+0.036
 else+0.035
bs+0.034
nam+0.034
 are+0.034
Neg logit contributions
 planning-0.061
reetings-0.058
 solution-0.054
 possible-0.054
 example-0.053
Token One
Token ID1881
Feature activation-1.48e+00

Pos logit contributions
 planning+0.002
reetings+0.002
alis+0.001
 solution+0.001
 possible+0.001
Neg logit contributions
s-0.001
 else-0.001
nam-0.001
 are-0.001
bs-0.001
Token day
Token ID1110
Feature activation-6.89e-01

Pos logit contributions
 planning+0.100
reetings+0.095
 solution+0.086
joy+0.085
alis+0.085
Neg logit contributions
s-0.102
 else-0.100
nam-0.097
bs-0.097
 are-0.096
Token,
Token ID11
Feature activation+1.08e-01

Pos logit contributions
 planning+0.058
reetings+0.054
 solution+0.051
joy+0.051
 possible+0.050
Neg logit contributions
s-0.036
 else-0.035
bs-0.035
nam-0.035
 are-0.033
Token surprise
Token ID5975
Feature activation+4.21e-02

Pos logit contributions
 planning+0.066
reetings+0.062
 solution+0.060
joy+0.058
 possible+0.058
Neg logit contributions
s-0.048
 else-0.046
bs-0.046
nam-0.044
 are-0.043
Token.
Token ID13
Feature activation-3.30e-02

Pos logit contributions
s+0.002
nam+0.002
bs+0.002
 else+0.002
 are+0.002
Neg logit contributions
 planning-0.004
reetings-0.004
 solution-0.003
 possible-0.003
 example-0.003
Token He
Token ID679
Feature activation+9.06e-01

Pos logit contributions
 planning+0.003
reetings+0.003
joy+0.002
 example+0.002
 solution+0.002
Neg logit contributions
s-0.002
 else-0.002
nam-0.002
bs-0.002
 are-0.002
Token threw
Token ID9617
Feature activation+5.61e-01

Pos logit contributions
nam+0.047
s+0.043
bs+0.043
 are+0.040
apy+0.040
Neg logit contributions
 planning-0.075
 example-0.067
 known-0.065
alis-0.065
 solution-0.065
Token a
Token ID257
Feature activation-1.23e+00

Pos logit contributions
 else+0.037
s+0.036
bs+0.036
nam+0.035
 Sorry+0.033
Neg logit contributions
 planning-0.033
 example-0.033
 solution-0.033
joy-0.033
 possible-0.031
Token tant
Token ID24246
Feature activation+1.28e+01

Pos logit contributions
 solution+0.097
 planning+0.090
joy+0.086
reetings+0.085
 addition+0.084
Neg logit contributions
cream-0.062
nam-0.061
 were-0.056
 else-0.056
 are-0.056
Tokenrum
Token ID6582
Feature activation-2.53e-01

Pos logit contributions
s+1.013
 else+1.006
nam+0.996
 are+0.981
bs+0.971
Neg logit contributions
 planning-0.695
reetings-0.675
 example-0.580
 solution-0.580
 possible-0.579
Token.
Token ID13
Feature activation-2.57e-02

Pos logit contributions
 planning+0.021
reetings+0.019
joy+0.018
 possible+0.018
 example+0.018
Neg logit contributions
nam-0.013
bs-0.013
 else-0.013
 tant-0.012
s-0.012
Token He
Token ID679
Feature activation+9.22e-01

Pos logit contributions
 planning+0.002
reetings+0.002
joy+0.002
 solution+0.002
 example+0.002
Neg logit contributions
s-0.001
 else-0.001
nam-0.001
bs-0.001
 are-0.001
Token kicked
Token ID12165
Feature activation+6.14e-01

Pos logit contributions
nam+0.054
 Sorry+0.049
bs+0.048
 bicycle+0.046
s+0.046
Neg logit contributions
 planning-0.064
 known-0.057
 shown-0.057
 self-0.055
 example-0.053
Token and
Token ID290
Feature activation-9.60e-01

Pos logit contributions
bs+0.040
 else+0.040
nam+0.040
s+0.039
 Sorry+0.039
Neg logit contributions
 planning-0.036
 example-0.036
 solution-0.036
joy-0.036
 possible-0.033
Token stubborn
Token ID24444
Feature activation-7.29e-01

Pos logit contributions
reetings+0.030
 planning+0.028
alis+0.027
 Fairy+0.026
joy+0.026
Neg logit contributions
 else-0.011
nam-0.010
 both-0.010
 trash-0.009
bs-0.009
Token.
Token ID13
Feature activation-3.49e-02

Pos logit contributions
 planning+0.072
reetings+0.071
alis+0.067
 solution+0.065
 possible+0.065
Neg logit contributions
s-0.028
 else-0.027
 are-0.025
bs-0.024
nam-0.023
Token She
Token ID1375
Feature activation-9.71e-01

Pos logit contributions
 planning+0.003
reetings+0.003
 solution+0.002
joy+0.002
alis+0.002
Neg logit contributions
s-0.002
 else-0.002
nam-0.002
bs-0.002
 are-0.002
Token throws
Token ID12542
Feature activation-1.08e+00

Pos logit contributions
 planning+0.077
reetings+0.071
alis+0.070
 example+0.068
 solution+0.067
Neg logit contributions
nam-0.054
s-0.048
 else-0.046
cream-0.046
 tant-0.045
Token a
Token ID257
Feature activation-1.24e+00

Pos logit contributions
 planning+0.072
 example+0.068
joy+0.067
 solution+0.067
 possible+0.065
Neg logit contributions
s-0.066
 else-0.066
nam-0.063
cream-0.062
bs-0.061
Token tant
Token ID24246
Feature activation+1.28e+01

Pos logit contributions
 solution+0.111
 planning+0.108
joy+0.105
reetings+0.104
 addition+0.102
Neg logit contributions
cream-0.048
nam-0.047
 are-0.042
 were-0.041
 else-0.040
Tokenrum
Token ID6582
Feature activation-2.70e-01

Pos logit contributions
s+0.964
 else+0.952
nam+0.926
bs+0.921
 are+0.912
Neg logit contributions
 planning-0.764
reetings-0.720
 solution-0.646
joy-0.640
alis-0.640
Token.
Token ID13
Feature activation-4.47e-02

Pos logit contributions
 planning+0.023
reetings+0.022
 possible+0.021
joy+0.021
 example+0.021
Neg logit contributions
 else-0.013
nam-0.013
s-0.012
bs-0.012
 tant-0.012
Token She
Token ID1375
Feature activation-9.88e-01

Pos logit contributions
 planning+0.004
reetings+0.003
 solution+0.003
joy+0.003
alis+0.003
Neg logit contributions
s-0.003
 else-0.002
nam-0.002
bs-0.002
 are-0.002
Token kicks
Token ID17364
Feature activation+7.09e-01

Pos logit contributions
 planning+0.065
alis+0.060
 example+0.057
 winning+0.055
 seeking+0.055
Neg logit contributions
nam-0.066
cream-0.056
apy-0.053
bs-0.052
 else-0.052
Token and
Token ID290
Feature activation-9.72e-01

Pos logit contributions
cream+0.043
nam+0.042
 else+0.042
s+0.041
bs+0.040
Neg logit contributions
 planning-0.046
 example-0.044
joy-0.043
 solution-0.043
reetings-0.042
Token she
Token ID673
Feature activation-3.49e-01

Pos logit contributions
 else+0.103
s+0.096
 are+0.092
 were+0.091
nam+0.087
Neg logit contributions
alis-0.283
reetings-0.280
inger-0.266
 solution-0.265
iddle-0.262
Token could
Token ID714
Feature activation+3.75e+00

Pos logit contributions
 planning+0.034
reetings+0.033
 solution+0.031
 possible+0.030
joy+0.030
Neg logit contributions
s-0.014
 else-0.013
nam-0.012
bs-0.012
 are-0.012
Token carry
Token ID3283
Feature activation+1.02e+00

Pos logit contributions
 are+0.154
 else+0.149
s+0.147
 were+0.137
bs+0.133
Neg logit contributions
 planning-0.357
reetings-0.353
joy-0.337
alis-0.331
 example-0.326
Token the
Token ID262
Feature activation-4.92e-01

Pos logit contributions
s+0.050
 else+0.049
nam+0.047
bs+0.047
 are+0.046
Neg logit contributions
 planning-0.087
reetings-0.083
 solution-0.078
 possible-0.078
joy-0.077
Token or
Token ID393
Feature activation+1.95e+00

Pos logit contributions
 planning+0.042
reetings+0.041
joy+0.037
alis+0.037
 possible+0.037
Neg logit contributions
nam-0.025
s-0.024
 else-0.024
 are-0.024
bs-0.022
Tokennam
Token ID7402
Feature activation+1.24e+01

Pos logit contributions
s+0.061
bs+0.052
 else+0.050
 are+0.048
cream+0.047
Neg logit contributions
 planning-0.205
reetings-0.193
 possible-0.189
 solution-0.188
alis-0.183
Tokenents
Token ID658
Feature activation+8.74e-01

Pos logit contributions
s+0.709
 else+0.692
nam+0.670
bs+0.661
 are+0.651
Neg logit contributions
 planning-0.972
reetings-0.932
 solution-0.857
joy-0.852
alis-0.851
Token back
Token ID736
Feature activation-5.05e-01

Pos logit contributions
s+0.037
 else+0.036
 are+0.034
bs+0.033
nam+0.032
Neg logit contributions
 planning-0.083
reetings-0.081
joy-0.076
alis-0.076
 example-0.075
Token with
Token ID351
Feature activation+1.58e+00

Pos logit contributions
 planning+0.042
reetings+0.042
alis+0.038
joy+0.037
 example+0.037
Neg logit contributions
s-0.027
 else-0.026
 are-0.025
nam-0.025
bs-0.024
Token her
Token ID607
Feature activation+7.92e-02

Pos logit contributions
s+0.083
 else+0.080
nam+0.077
bs+0.077
 are+0.075
Neg logit contributions
 planning-0.132
reetings-0.125
 solution-0.117
 possible-0.117
joy-0.116
Token.
Token ID13
Feature activation+3.16e-01

Pos logit contributions
s+0.004
 else+0.004
nam+0.004
bs+0.004
 are+0.004
Neg logit contributions
 planning-0.007
reetings-0.007
alis-0.006
 solution-0.006
joy-0.006
Token a
Token ID257
Feature activation-9.27e-01

Pos logit contributions
 planning+0.199
reetings+0.191
 solution+0.177
joy+0.175
 possible+0.175
Neg logit contributions
s-0.130
 else-0.128
nam-0.123
bs-0.122
 are-0.121
Token big
Token ID1263
Feature activation+3.61e+00

Pos logit contributions
 planning+0.071
 solution+0.065
 addition+0.060
reetings+0.060
 possible+0.059
Neg logit contributions
bs-0.055
s-0.054
 are-0.052
nam-0.051
w-0.051
Token box
Token ID3091
Feature activation+1.50e+00

Pos logit contributions
s+0.185
 else+0.183
bs+0.179
nam+0.176
 are+0.165
Neg logit contributions
 planning-0.307
reetings-0.276
 solution-0.275
joy-0.271
 example-0.268
Token of
Token ID286
Feature activation-5.81e-01

Pos logit contributions
s+0.077
 else+0.075
bs+0.071
nam+0.071
 are+0.070
Neg logit contributions
 planning-0.126
reetings-0.124
joy-0.114
alis-0.113
 solution-0.113
Token or
Token ID393
Feature activation+1.93e+00

Pos logit contributions
 planning+0.046
 Fairy+0.039
 solution+0.039
 addition+0.039
 example+0.037
Neg logit contributions
s-0.035
bs-0.035
w-0.031
 else-0.030
 tant-0.029
Tokennam
Token ID7402
Feature activation+1.24e+01

Pos logit contributions
s+0.044
 are+0.032
bs+0.031
 tant+0.030
 else+0.029
Neg logit contributions
 planning-0.225
 possible-0.205
reetings-0.204
 solution-0.202
 example-0.201
Tokenents
Token ID658
Feature activation+8.50e-01

Pos logit contributions
 else+0.738
s+0.732
 are+0.714
bs+0.713
nam+0.705
Neg logit contributions
 planning-0.916
reetings-0.858
 solution-0.798
 possible-0.793
alis-0.788
Token.
Token ID13
Feature activation+2.84e-01

Pos logit contributions
s+0.034
 else+0.031
 are+0.030
bs+0.029
nam+0.028
Neg logit contributions
 planning-0.083
reetings-0.083
joy-0.078
alis-0.077
 solution-0.076
Token They
Token ID1119
Feature activation-3.60e-01

Pos logit contributions
s+0.015
 else+0.014
bs+0.013
 are+0.013
nam+0.013
Neg logit contributions
reetings-0.025
 planning-0.025
 possible-0.023
 example-0.022
 believing-0.022
Token were
Token ID547
Feature activation+5.43e+00

Pos logit contributions
 planning+0.027
 addition+0.022
 solving+0.022
 known+0.021
 Fairy+0.021
Neg logit contributions
 tant-0.022
 elbows-0.020
 trash-0.020
bs-0.020
w-0.019
Token so
Token ID523
Feature activation+6.04e-01

Pos logit contributions
s+0.246
 else+0.245
nam+0.239
 are+0.227
bs+0.226
Neg logit contributions
 planning-0.482
reetings-0.480
joy-0.440
 solution-0.440
alis-0.437
Token branches
Token ID13737
Feature activation+2.86e+00

Pos logit contributions
 planning+0.043
reetings+0.040
 solution+0.039
 possible+0.038
alis+0.038
Neg logit contributions
s-0.032
 else-0.031
bs-0.030
 are-0.028
nam-0.028
Token.
Token ID13
Feature activation+2.54e-01

Pos logit contributions
s+0.127
 else+0.122
 are+0.114
bs+0.113
nam+0.111
Neg logit contributions
 planning-0.266
reetings-0.258
joy-0.240
alis-0.238
 solution-0.237
Token But
Token ID887
Feature activation+2.68e+00

Pos logit contributions
nam+0.015
 else+0.014
bs+0.014
s+0.014
 are+0.014
Neg logit contributions
 planning-0.020
reetings-0.017
joy-0.017
 solution-0.017
 example-0.016
Token the
Token ID262
Feature activation-5.25e-01

Pos logit contributions
s+0.136
bs+0.133
nam+0.132
cream+0.131
 else+0.130
Neg logit contributions
 planning-0.224
reetings-0.195
 possible-0.194
 example-0.193
 solution-0.193
Token or
Token ID393
Feature activation+1.92e+00

Pos logit contributions
 planning+0.043
reetings+0.041
 solution+0.039
 possible+0.038
alis+0.038
Neg logit contributions
s-0.028
 else-0.027
bs-0.026
nam-0.025
 are-0.025
Tokennam
Token ID7402
Feature activation+1.24e+01

Pos logit contributions
s+0.051
bs+0.046
 else+0.041
 are+0.038
cream+0.038
Neg logit contributions
 planning-0.210
 possible-0.195
reetings-0.193
 solution-0.191
 example-0.190
Tokenents
Token ID658
Feature activation+8.36e-01

Pos logit contributions
 else+0.746
s+0.733
nam+0.718
bs+0.714
 are+0.711
Neg logit contributions
 planning-0.907
reetings-0.854
 solution-0.791
alis-0.788
 possible-0.785
Token were
Token ID547
Feature activation+5.41e+00

Pos logit contributions
s+0.018
nam+0.018
 else+0.018
bs+0.016
 are+0.015
Neg logit contributions
 planning-0.093
reetings-0.090
 solution-0.085
 possible-0.085
alis-0.084
Token slippery
Token ID32911
Feature activation+3.08e+00

Pos logit contributions
 else+0.289
nam+0.263
s+0.257
 both+0.255
 are+0.239
Neg logit contributions
reetings-0.493
 planning-0.451
joy-0.432
alis-0.429
 Fairy-0.426
Token and
Token ID290
Feature activation-6.50e-01

Pos logit contributions
s+0.141
 else+0.134
 are+0.130
bs+0.123
nam+0.122
Neg logit contributions
 planning-0.278
reetings-0.274
alis-0.252
 solution-0.249
joy-0.249
Token some
Token ID617
Feature activation-7.68e-01

Pos logit contributions
 planning+0.054
 possible+0.050
reetings+0.049
 solution+0.048
alis+0.047
Neg logit contributions
s-0.032
 else-0.030
bs-0.029
nam-0.029
cream-0.029
Token They
Token ID1119
Feature activation-4.10e-01

Pos logit contributions
s+0.008
 are+0.006
nam+0.006
bs+0.006
 were+0.006
Neg logit contributions
reetings-0.024
 planning-0.024
 possible-0.023
alis-0.023
 solution-0.023
Token reached
Token ID4251
Feature activation-1.16e+00

Pos logit contributions
 planning+0.031
 known+0.029
 Fairy+0.029
 shown+0.028
 solution+0.027
Neg logit contributions
 tant-0.021
nam-0.020
s-0.020
controlled-0.019
cream-0.019
Token for
Token ID329
Feature activation-2.20e+00

Pos logit contributions
 planning+0.091
reetings+0.087
 solution+0.080
joy+0.079
alis+0.079
Neg logit contributions
s-0.067
 else-0.065
nam-0.063
bs-0.063
 are-0.062
Token the
Token ID262
Feature activation-5.32e-01

Pos logit contributions
 planning+0.175
joy+0.153
 example+0.151
 possible+0.150
 solution+0.149
Neg logit contributions
s-0.130
bs-0.121
 else-0.120
w-0.117
 tant-0.114
Token or
Token ID393
Feature activation+1.91e+00

Pos logit contributions
 planning+0.044
reetings+0.042
 solution+0.040
 possible+0.039
joy+0.039
Neg logit contributions
s-0.028
 else-0.028
bs-0.027
nam-0.026
 are-0.026
Tokennam
Token ID7402
Feature activation+1.24e+01

Pos logit contributions
s+0.049
w+0.041
 are+0.040
cream+0.038
haw+0.035
Neg logit contributions
 planning-0.207
 possible-0.193
 solution-0.190
 example-0.186
 creating-0.185
Tokenents
Token ID658
Feature activation+8.37e-01

Pos logit contributions
 else+0.739
s+0.730
 are+0.715
nam+0.713
bs+0.711
Neg logit contributions
 planning-0.909
reetings-0.859
 solution-0.794
 possible-0.793
joy-0.785
Token and
Token ID290
Feature activation-6.63e-01

Pos logit contributions
s+0.039
 else+0.037
 are+0.036
bs+0.035
 were+0.033
Neg logit contributions
 planning-0.077
reetings-0.076
alis-0.070
joy-0.070
 example-0.069
Token touched
Token ID12615
Feature activation+2.19e+00

Pos logit contributions
 planning+0.054
 solution+0.047
 Fairy+0.047
reetings+0.047
alis+0.047
Neg logit contributions
s-0.035
 else-0.032
 tant-0.031
 are-0.031
bs-0.031
Token them
Token ID606
Feature activation-2.88e+00

Pos logit contributions
 else+0.128
s+0.126
nam+0.122
bs+0.121
 are+0.115
Neg logit contributions
 planning-0.163
reetings-0.153
 possible-0.144
 solution-0.144
 example-0.143
Token softly
Token ID26625
Feature activation-1.55e+00

Pos logit contributions
 planning+0.223
 solution+0.206
 possible+0.202
reetings+0.200
 example+0.196
Neg logit contributions
nam-0.148
 else-0.140
bs-0.140
 tant-0.139
w-0.138
Token here
Token ID994
Feature activation-3.68e+00

Pos logit contributions
reetings+0.042
 planning+0.041
alis+0.039
 solution+0.038
joy+0.038
Neg logit contributions
s-0.022
 else-0.021
 are-0.021
nam-0.021
 both-0.020
Token and
Token ID290
Feature activation-6.96e-01

Pos logit contributions
reetings+0.319
 planning+0.318
alis+0.303
joy+0.295
 solution+0.287
Neg logit contributions
s-0.180
 else-0.173
 are-0.162
 both-0.160
nam-0.157
Token explore
Token ID7301
Feature activation-2.79e+00

Pos logit contributions
 planning+0.058
reetings+0.055
 possible+0.051
 solution+0.051
alis+0.050
Neg logit contributions
s-0.037
 else-0.035
nam-0.034
bs-0.034
 tant-0.033
Token the
Token ID262
Feature activation-5.49e-01

Pos logit contributions
 planning+0.244
reetings+0.236
alis+0.221
 solution+0.218
joy+0.218
Neg logit contributions
 else-0.139
s-0.137
 are-0.128
nam-0.126
bs-0.124
Token or
Token ID393
Feature activation+1.89e+00

Pos logit contributions
 planning+0.037
reetings+0.036
alis+0.032
joy+0.032
 possible+0.032
Neg logit contributions
 else-0.037
s-0.036
 are-0.036
nam-0.035
 trash-0.034
Tokennam
Token ID7402
Feature activation+1.23e+01

Pos logit contributions
s+0.094
bs+0.087
 else+0.085
cream+0.083
 tant+0.083
Neg logit contributions
 planning-0.162
reetings-0.153
 possible-0.147
 solution-0.144
alis-0.141
Tokenents
Token ID658
Feature activation+8.23e-01

Pos logit contributions
s+0.710
 else+0.696
nam+0.673
bs+0.665
 are+0.657
Neg logit contributions
 planning-0.960
reetings-0.919
 solution-0.846
joy-0.841
alis-0.839
Token."
Token ID526
Feature activation-1.38e+00

Pos logit contributions
s+0.041
 else+0.040
 are+0.039
bs+0.037
nam+0.036
Neg logit contributions
 planning-0.072
reetings-0.069
joy-0.067
alis-0.065
 solution-0.065
Token
Token ID198
Feature activation+4.65e-01

Pos logit contributions
 planning+0.120
reetings+0.117
 possible+0.108
 solution+0.108
alis+0.107
Neg logit contributions
s-0.068
 else-0.066
bs-0.062
nam-0.062
 are-0.061
Token
Token ID198
Feature activation+4.68e-01

Pos logit contributions
s+0.023
 else+0.022
bs+0.020
 are+0.020
nam+0.020
Neg logit contributions
 planning-0.041
reetings-0.040
alis-0.037
 solution-0.037
 possible-0.037
TokenDa
Token ID26531
Feature activation-5.08e+00

Pos logit contributions
 else+0.023
s+0.022
bs+0.018
nam+0.018
 are+0.018
Neg logit contributions
 planning-0.043
reetings-0.041
alis-0.039
joy-0.039
 possible-0.038
Token the
Token ID262
Feature activation-6.03e-01

Pos logit contributions
s+0.089
 else+0.084
cream+0.081
bs+0.081
 Sorry+0.077
Neg logit contributions
 planning-0.107
 possible-0.097
 example-0.096
 solution-0.095
 addition-0.093
Token lock
Token ID5793
Feature activation+4.06e+00

Pos logit contributions
 planning+0.045
 solution+0.045
 possible+0.042
reetings+0.041
 Fairy+0.041
Neg logit contributions
s-0.035
cream-0.034
 else-0.034
bs-0.032
apy-0.030
Token and
Token ID290
Feature activation-7.40e-01

Pos logit contributions
 else+0.184
s+0.182
nam+0.182
bs+0.175
 are+0.169
Neg logit contributions
 planning-0.356
reetings-0.340
 possible-0.320
 solution-0.319
alis-0.314
Token the
Token ID262
Feature activation-5.87e-01

Pos logit contributions
 planning+0.060
 solution+0.051
 example+0.050
 possible+0.050
reetings+0.050
Neg logit contributions
s-0.041
bs-0.038
 else-0.037
 tant-0.037
w-0.037
Token or
Token ID393
Feature activation+1.85e+00

Pos logit contributions
 planning+0.043
 solution+0.043
reetings+0.039
 Fairy+0.039
joy+0.039
Neg logit contributions
s-0.036
cream-0.034
 else-0.033
bs-0.032
apy-0.029
Tokennam
Token ID7402
Feature activation+1.23e+01

Pos logit contributions
s+0.113
w+0.103
cream+0.102
 tant+0.099
bs+0.098
Neg logit contributions
 planning-0.138
 possible-0.123
 solution-0.118
 solving-0.117
 creating-0.116
Tokenents
Token ID658
Feature activation+7.76e-01

Pos logit contributions
 else+0.774
 are+0.751
s+0.743
bs+0.743
nam+0.740
Neg logit contributions
 planning-0.856
reetings-0.795
 possible-0.750
 solution-0.748
 example-0.728
Token.
Token ID13
Feature activation+2.21e-01

Pos logit contributions
s+0.031
 else+0.029
 are+0.028
bs+0.027
nam+0.026
Neg logit contributions
 planning-0.076
reetings-0.075
alis-0.070
joy-0.070
 example-0.069
Token
Token ID198
Feature activation+4.11e-01

Pos logit contributions
s+0.011
 else+0.011
nam+0.010
bs+0.010
 are+0.010
Neg logit contributions
 planning-0.019
reetings-0.018
 solution-0.017
joy-0.017
 possible-0.017
Token
Token ID198
Feature activation+4.15e-01

Pos logit contributions
nam+0.023
s+0.022
 else+0.022
bs+0.022
 are+0.021
Neg logit contributions
 planning-0.032
reetings-0.031
 solution-0.028
 possible-0.028
joy-0.028
Token"
Token ID1
Feature activation-2.38e+00

Pos logit contributions
nam+0.031
 tant+0.031
 are+0.030
bs+0.030
cream+0.030
Neg logit contributions
 planning-0.024
reetings-0.023
 example-0.019
 solution-0.019
 possible-0.019
Token,
Token ID11
Feature activation+1.01e-01

Pos logit contributions
 planning+0.061
reetings+0.058
 solution+0.054
alis+0.054
 possible+0.054
Neg logit contributions
s-0.037
 else-0.036
bs-0.034
nam-0.034
 are-0.033
Token Pete
Token ID17542
Feature activation-4.10e+00

Pos logit contributions
s+0.008
 else+0.007
nam+0.007
bs+0.007
 are+0.007
Neg logit contributions
 planning-0.006
reetings-0.006
joy-0.005
 solution-0.005
 possible-0.005
Token's
Token ID338
Feature activation+1.61e-01

Pos logit contributions
 planning+0.400
reetings+0.392
alis+0.371
 possible+0.371
 solution+0.366
Neg logit contributions
s-0.162
 else-0.151
 are-0.140
bs-0.136
 were-0.132
Token family
Token ID1641
Feature activation+3.58e-01

Pos logit contributions
bs+0.009
s+0.009
nam+0.009
 else+0.009
w+0.008
Neg logit contributions
 planning-0.013
 solution-0.012
joy-0.012
 example-0.011
 addition-0.011
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
s+0.016
 else+0.016
 are+0.015
 were+0.015
bs+0.014
Neg logit contributions
 planning-0.032
reetings-0.032
alis-0.030
 possible-0.030
joy-0.029
Token planning
Token ID5410
Feature activation-1.21e+01

Pos logit contributions
reetings+0.080
alis+0.069
brance+0.067
lad+0.066
 Fairy+0.066
Neg logit contributions
 else-0.062
 both-0.060
 were-0.057
s-0.057
 are-0.055
Token to
Token ID284
Feature activation-1.03e+00

Pos logit contributions
reetings+1.107
alis+1.063
 planning+1.029
 Fairy+0.983
inger+0.966
Neg logit contributions
 else-0.557
s-0.534
 are-0.516
 were-0.492
 both-0.482
Token go
Token ID467
Feature activation+1.44e+00

Pos logit contributions
 planning+0.091
reetings+0.090
joy+0.087
alis+0.084
 possible+0.081
Neg logit contributions
 else-0.047
 are-0.047
s-0.044
nam-0.042
 were-0.042
Token on
Token ID319
Feature activation+1.66e+00

Pos logit contributions
s+0.072
 are+0.065
 else+0.064
bs+0.060
 both+0.058
Neg logit contributions
reetings-0.126
 planning-0.120
alis-0.120
 Fairy-0.116
joy-0.115
Token a
Token ID257
Feature activation-1.23e+00

Pos logit contributions
s+0.072
 else+0.064
 both+0.063
 are+0.062
nam+0.060
Neg logit contributions
reetings-0.159
 planning-0.146
alis-0.144
 Fairy-0.139
ross-0.137
Token special
Token ID2041
Feature activation-1.79e-01

Pos logit contributions
 planning+0.094
reetings+0.089
 solution+0.083
 possible+0.082
alis+0.082
Neg logit contributions
s-0.072
 else-0.070
nam-0.069
bs-0.069
 are-0.068
Token
Token ID198
Feature activation+1.58e-01

Pos logit contributions
s+0.006
 else+0.006
bs+0.005
 were+0.005
 are+0.005
Neg logit contributions
 planning-0.018
reetings-0.017
alis-0.017
 solution-0.017
 example-0.017
TokenFinally
Token ID11158
Feature activation+1.58e+00

Pos logit contributions
 else+0.010
s+0.010
bs+0.009
 were+0.009
 both+0.009
Neg logit contributions
 planning-0.012
reetings-0.011
joy-0.011
 solution-0.010
alis-0.010
Token,
Token ID11
Feature activation+8.79e-02

Pos logit contributions
 are+0.048
s+0.048
 were+0.047
 else+0.046
bs+0.044
Neg logit contributions
reetings-0.166
 planning-0.161
alis-0.159
iddles-0.155
iddle-0.153
Token after
Token ID706
Feature activation+3.67e+00

Pos logit contributions
s+0.007
 else+0.007
bs+0.007
 both+0.007
nam+0.007
Neg logit contributions
reetings-0.005
 planning-0.005
joy-0.004
alis-0.004
 solution-0.004
Token much
Token ID881
Feature activation+2.59e+00

Pos logit contributions
s+0.153
 were+0.149
 else+0.149
 both+0.144
 are+0.143
Neg logit contributions
reetings-0.350
 solution-0.322
alis-0.316
 planning-0.311
iddle-0.299
Token planning
Token ID5410
Feature activation-1.21e+01

Pos logit contributions
 else+0.161
s+0.159
 are+0.153
bs+0.152
 were+0.147
Neg logit contributions
reetings-0.193
alis-0.172
 Fairy-0.167
 planning-0.166
inger-0.166
Token,
Token ID11
Feature activation+7.76e-02

Pos logit contributions
reetings+1.163
alis+1.124
 planning+1.121
 Fairy+1.047
joy+1.033
Neg logit contributions
 are-0.497
 else-0.469
s-0.463
 were-0.456
bs-0.413
Token the
Token ID262
Feature activation-8.30e-01

Pos logit contributions
s+0.007
 else+0.007
 were+0.007
 both+0.007
 are+0.007
Neg logit contributions
reetings-0.004
 planning-0.004
alis-0.003
iddle-0.003
joy-0.003
Token group
Token ID1448
Feature activation-6.35e+00

Pos logit contributions
reetings+0.066
 planning+0.066
alis+0.059
joy+0.059
 possible+0.057
Neg logit contributions
 else-0.044
 are-0.043
s-0.043
nam-0.042
 trash-0.041
Token set
Token ID900
Feature activation-5.66e+00

Pos logit contributions
reetings+0.584
 planning+0.575
alis+0.546
joy+0.543
 possible+0.541
Neg logit contributions
s-0.298
 else-0.287
 were-0.281
 are-0.270
bs-0.254
Token off
Token ID572
Feature activation+1.39e+00

Pos logit contributions
reetings+0.578
 planning+0.539
alis+0.537
joy+0.510
Gu+0.506
Neg logit contributions
s-0.256
 were-0.204
 are-0.204
 both-0.196
bs-0.194
Tokenens
Token ID641
Feature activation+3.92e+00

Pos logit contributions
s+0.055
 else+0.048
bs+0.046
 are+0.046
nam+0.040
Neg logit contributions
 planning-0.233
reetings-0.225
 possible-0.214
 solution-0.214
 example-0.213
Token.
Token ID13
Feature activation-2.96e-02

Pos logit contributions
s+0.120
 else+0.110
 are+0.104
 were+0.103
 both+0.093
Neg logit contributions
reetings-0.420
 planning-0.416
 solution-0.397
 example-0.396
 possible-0.395
Token Mitt
Token ID16627
Feature activation+2.09e+00

Pos logit contributions
 planning+0.003
reetings+0.003
 possible+0.002
 solution+0.002
 example+0.002
Neg logit contributions
s-0.002
bs-0.001
 else-0.001
 are-0.001
nam-0.001
Tokenens
Token ID641
Feature activation+3.92e+00

Pos logit contributions
s+0.047
 are+0.039
bs+0.038
 else+0.038
 were+0.033
Neg logit contributions
 planning-0.241
reetings-0.232
 possible-0.224
 example-0.223
 solution-0.223
Token was
Token ID373
Feature activation-1.01e+00

Pos logit contributions
s+0.151
 else+0.146
 are+0.133
nam+0.133
bs+0.131
Neg logit contributions
 planning-0.383
reetings-0.374
 possible-0.351
 solution-0.349
joy-0.349
Token planning
Token ID5410
Feature activation-1.21e+01

Pos logit contributions
reetings+0.120
alk+0.106
brance+0.105
alis+0.105
lad+0.105
Neg logit contributions
 both-0.030
 else-0.026
nam-0.025
 were-0.022
 dirty-0.022
Token to
Token ID284
Feature activation-1.02e+00

Pos logit contributions
reetings+1.095
 planning+1.018
alis+1.004
 possible+0.942
 Fairy+0.941
Neg logit contributions
 else-0.579
nam-0.550
s-0.543
 are-0.529
 trash-0.521
Token catch
Token ID4929
Feature activation-1.80e+00

Pos logit contributions
 planning+0.090
reetings+0.087
joy+0.082
alis+0.080
 possible+0.080
Neg logit contributions
 else-0.048
s-0.048
 are-0.046
nam-0.046
bs-0.044
Token Tim
Token ID5045
Feature activation+1.59e+00

Pos logit contributions
 planning+0.140
reetings+0.133
 solution+0.123
joy+0.123
alis+0.122
Neg logit contributions
s-0.102
 else-0.101
nam-0.099
bs-0.097
 are-0.096
Tokenmy
Token ID1820
Feature activation+5.00e-01

Pos logit contributions
s+0.084
 else+0.076
 are+0.075
nam+0.069
 were+0.069
Neg logit contributions
 planning-0.140
reetings-0.137
joy-0.128
 possible-0.127
 example-0.125
Token and
Token ID290
Feature activation-9.71e-01

Pos logit contributions
s+0.022
 else+0.021
nam+0.020
 are+0.020
bs+0.020
Neg logit contributions
 planning-0.046
reetings-0.045
joy-0.042
alis-0.041
 solution-0.041
TokenM
Token ID44
Feature activation+2.35e+00

Pos logit contributions
s+0.131
 are+0.118
 were+0.114
 else+0.108
bs+0.091
Neg logit contributions
 planning-0.269
 example-0.260
reetings-0.257
 possible-0.256
alis-0.252
Tokenummy
Token ID13513
Feature activation-5.49e-01

Pos logit contributions
nam+0.128
s+0.119
 else+0.118
bs+0.117
 tant+0.115
Neg logit contributions
 planning-0.197
reetings-0.187
joy-0.172
 example-0.169
 solution-0.169
Token and
Token ID290
Feature activation-9.86e-01

Pos logit contributions
 planning+0.049
reetings+0.049
 possible+0.046
 solution+0.046
alis+0.046
Neg logit contributions
s-0.025
 else-0.023
 are-0.023
bs-0.022
 were-0.021
Token Daddy
Token ID26926
Feature activation+6.11e+00

Pos logit contributions
 planning+0.066
reetings+0.059
joy+0.055
 spending+0.055
 possible+0.053
Neg logit contributions
s-0.068
 else-0.068
bs-0.065
 tant-0.064
 trash-0.062
Token were
Token ID547
Feature activation+5.09e+00

Pos logit contributions
 else+0.261
s+0.260
nam+0.258
bs+0.245
 tant+0.235
Neg logit contributions
 planning-0.566
reetings-0.539
 solution-0.499
joy-0.498
alis-0.493
Token planning
Token ID5410
Feature activation-1.21e+01

Pos logit contributions
s+0.279
 else+0.264
 both+0.260
 are+0.256
nam+0.252
Neg logit contributions
reetings-0.417
 possible-0.378
 planning-0.378
alis-0.377
 solution-0.369
Token a
Token ID257
Feature activation-1.24e+00

Pos logit contributions
reetings+1.057
 planning+1.028
alis+0.987
 solution+0.939
joy+0.939
Neg logit contributions
 else-0.582
s-0.580
 are-0.548
nam-0.540
bs-0.524
Token picnic
Token ID35715
Feature activation-2.17e+00

Pos logit contributions
 planning+0.094
reetings+0.087
 solution+0.083
 possible+0.081
 example+0.080
Neg logit contributions
s-0.073
nam-0.072
bs-0.071
 else-0.071
 are-0.070
Token.
Token ID13
Feature activation-3.47e-02

Pos logit contributions
reetings+0.224
 planning+0.215
alis+0.205
joy+0.203
 example+0.198
Neg logit contributions
s-0.078
 else-0.073
 are-0.067
 were-0.063
nam-0.061
Token
Token ID198
Feature activation+1.53e-01

Pos logit contributions
 planning+0.003
reetings+0.003
 solution+0.003
 possible+0.003
alis+0.003
Neg logit contributions
s-0.002
 else-0.002
nam-0.002
bs-0.001
 are-0.001
Token"
Token ID1
Feature activation-2.64e+00

Pos logit contributions
s+0.007
 else+0.007
 are+0.006
bs+0.006
nam+0.006
Neg logit contributions
 planning-0.014
reetings-0.013
alis-0.013
 example-0.012
 solution-0.012
Token excited
Token ID6568
Feature activation-9.88e-01

Pos logit contributions
 planning+0.285
reetings+0.270
 solution+0.251
joy+0.250
 possible+0.249
Neg logit contributions
s-0.191
 else-0.187
nam-0.182
bs-0.180
 are-0.180
Token.
Token ID13
Feature activation-4.05e-02

Pos logit contributions
 planning+0.089
reetings+0.086
 solution+0.080
alis+0.080
joy+0.080
Neg logit contributions
s-0.045
 else-0.044
nam-0.042
bs-0.041
 are-0.041
Token He
Token ID679
Feature activation+9.01e-01

Pos logit contributions
 planning+0.004
reetings+0.004
 possible+0.003
 solution+0.003
alis+0.003
Neg logit contributions
s-0.002
 else-0.002
nam-0.002
 are-0.002
bs-0.002
Token had
Token ID550
Feature activation-1.18e+00

Pos logit contributions
s+0.038
 else+0.037
nam+0.037
bs+0.035
 are+0.035
Neg logit contributions
 planning-0.084
reetings-0.080
 solution-0.075
joy-0.074
alis-0.074
Token been
Token ID587
Feature activation-4.23e+00

Pos logit contributions
reetings+0.099
 planning+0.095
alis+0.092
 Fairy+0.088
 solution+0.088
Neg logit contributions
 else-0.062
s-0.061
 were-0.057
 are-0.057
nam-0.055
Token planning
Token ID5410
Feature activation-1.21e+01

Pos logit contributions
reetings+0.328
alis+0.302
alk+0.296
 Fairy+0.286
 former+0.281
Neg logit contributions
 else-0.261
nam-0.247
 both-0.239
s-0.237
 trash-0.216
Token this
Token ID428
Feature activation+2.20e-01

Pos logit contributions
reetings+1.041
 planning+1.034
alis+0.976
 solution+0.935
joy+0.933
Neg logit contributions
 else-0.584
s-0.575
nam-0.547
 are-0.545
bs-0.525
Token trip
Token ID5296
Feature activation+1.42e+00

Pos logit contributions
 else+0.012
s+0.012
 are+0.011
nam+0.011
 trash+0.011
Neg logit contributions
reetings-0.017
 planning-0.017
alis-0.016
joy-0.016
 Fairy-0.015
Token for
Token ID329
Feature activation-2.49e+00

Pos logit contributions
s+0.070
 else+0.066
 are+0.062
 were+0.060
bs+0.058
Neg logit contributions
 planning-0.123
reetings-0.121
alis-0.116
joy-0.115
 example-0.110
Token months
Token ID1933
Feature activation-2.02e+00

Pos logit contributions
reetings+0.224
 planning+0.218
alis+0.212
 solution+0.206
joy+0.202
Neg logit contributions
 else-0.106
 are-0.101
nam-0.101
s-0.100
 were-0.098
Token.
Token ID13
Feature activation-4.15e-02

Pos logit contributions
reetings+0.183
 planning+0.182
alis+0.173
joy+0.170
 solution+0.169
Neg logit contributions
s-0.092
 else-0.085
 are-0.080
bs-0.078
nam-0.076
Token So
Token ID1406
Feature activation+3.51e+00

Pos logit contributions
 planning+0.306
reetings+0.301
 possible+0.277
alis+0.274
 solution+0.273
Neg logit contributions
s-0.177
 else-0.167
bs-0.159
nam-0.156
 are-0.156
Token the
Token ID262
Feature activation-8.21e-01

Pos logit contributions
s+0.140
 else+0.135
 are+0.126
nam+0.124
bs+0.123
Neg logit contributions
 planning-0.338
reetings-0.332
alis-0.312
joy-0.308
 solution-0.305
Token creative
Token ID7325
Feature activation+4.01e-02

Pos logit contributions
reetings+0.076
alis+0.070
 planning+0.070
joy+0.069
 solution+0.065
Neg logit contributions
 else-0.037
s-0.037
 are-0.037
 both-0.035
 were-0.034
Token club
Token ID3430
Feature activation-3.25e+00

Pos logit contributions
s+0.002
 else+0.002
 are+0.002
 were+0.002
bs+0.002
Neg logit contributions
reetings-0.004
 planning-0.004
alis-0.003
 possible-0.003
joy-0.003
Token started
Token ID2067
Feature activation+2.23e-01

Pos logit contributions
reetings+0.314
 planning+0.308
joy+0.293
alis+0.287
 possible+0.279
Neg logit contributions
s-0.142
 else-0.137
 are-0.127
 were-0.126
bs-0.115
Token planning
Token ID5410
Feature activation-1.21e+01

Pos logit contributions
s+0.013
 else+0.012
 were+0.012
 both+0.011
 are+0.011
Neg logit contributions
reetings-0.020
alis-0.017
brance-0.016
 Fairy-0.016
inger-0.016
Token for
Token ID329
Feature activation-2.47e+00

Pos logit contributions
reetings+1.143
alis+1.069
 planning+1.061
 Fairy+0.995
brance+0.984
Neg logit contributions
 else-0.538
 are-0.519
s-0.518
 were-0.488
nam-0.475
Token their
Token ID511
Feature activation+5.08e-01

Pos logit contributions
reetings+0.244
 planning+0.223
alis+0.223
iddle+0.207
ross+0.206
Neg logit contributions
 else-0.105
 are-0.097
 both-0.097
nam-0.095
 were-0.093
Token game
Token ID983
Feature activation-2.71e+00

Pos logit contributions
 else+0.031
s+0.030
 are+0.028
nam+0.028
bs+0.027
Neg logit contributions
reetings-0.040
 planning-0.037
alis-0.035
joy-0.034
 Fairy-0.033
Token.
Token ID13
Feature activation-1.87e-02

Pos logit contributions
reetings+0.268
 planning+0.263
alis+0.250
joy+0.247
 Fairy+0.239
Neg logit contributions
s-0.106
 else-0.105
 are-0.094
 were-0.088
bs-0.085
Token They
Token ID1119
Feature activation-6.73e-01

Pos logit contributions
reetings+0.002
 planning+0.002
 possible+0.002
 solution+0.002
 believing+0.002
Neg logit contributions
s-0.001
 else-0.001
 are-0.001
bs-0.001
nam-0.001
Token it
Token ID340
Feature activation-3.29e-01

Pos logit contributions
 else+0.077
s+0.074
 are+0.072
 were+0.069
nam+0.067
Neg logit contributions
reetings-0.194
 planning-0.187
alis-0.183
joy-0.175
 example-0.174
Token took
Token ID1718
Feature activation-2.84e-01

Pos logit contributions
reetings+0.032
 planning+0.032
joy+0.031
 example+0.030
alis+0.030
Neg logit contributions
s-0.013
 else-0.013
 are-0.012
 were-0.011
bs-0.010
Token a
Token ID257
Feature activation-1.23e+00

Pos logit contributions
 planning+0.025
reetings+0.024
joy+0.022
alis+0.022
 possible+0.022
Neg logit contributions
 else-0.014
s-0.014
bs-0.013
 are-0.013
nam-0.013
Token lot
Token ID1256
Feature activation+6.04e+00

Pos logit contributions
 planning+0.087
 solution+0.082
reetings+0.081
 possible+0.077
joy+0.076
Neg logit contributions
nam-0.077
s-0.076
cream-0.075
 are-0.073
 else-0.072
Token of
Token ID286
Feature activation-9.04e-01

Pos logit contributions
s+0.336
 else+0.332
bs+0.317
 are+0.312
nam+0.305
Neg logit contributions
 planning-0.490
reetings-0.467
joy-0.435
alis-0.433
 solution-0.429
Token planning
Token ID5410
Feature activation-1.21e+01

Pos logit contributions
 planning+0.072
reetings+0.068
 solution+0.063
joy+0.062
 possible+0.062
Neg logit contributions
s-0.052
 else-0.050
nam-0.049
bs-0.048
 are-0.047
Token and
Token ID290
Feature activation-9.69e-01

Pos logit contributions
 planning+1.061
reetings+1.057
alis+0.990
joy+0.959
 solution+0.947
Neg logit contributions
 else-0.572
s-0.562
 are-0.546
nam-0.528
bs-0.525
Token hard
Token ID1327
Feature activation+1.59e+00

Pos logit contributions
reetings+0.078
 planning+0.076
joy+0.072
alis+0.072
 solution+0.069
Neg logit contributions
 else-0.053
s-0.051
 are-0.051
nam-0.049
bs-0.049
Token work
Token ID670
Feature activation-2.77e+00

Pos logit contributions
s+0.088
bs+0.081
 else+0.081
 are+0.080
 were+0.077
Neg logit contributions
reetings-0.130
 planning-0.122
alis-0.119
 Fairy-0.116
 solution-0.114
Token,
Token ID11
Feature activation+9.68e-02

Pos logit contributions
 planning+0.255
reetings+0.254
alis+0.240
joy+0.238
 example+0.230
Neg logit contributions
s-0.122
 else-0.119
 are-0.113
bs-0.109
 were-0.107
Token things
Token ID1243
Feature activation+2.96e+00

Pos logit contributions
 else+0.004
s+0.004
 are+0.004
nam+0.003
bs+0.003
Neg logit contributions
reetings-0.009
 planning-0.009
alis-0.009
joy-0.008
 solution-0.008
Token and
Token ID290
Feature activation-9.71e-01

Pos logit contributions
 planning+0.564
reetings+0.555
joy+0.523
alis+0.521
 possible+0.520
Neg logit contributions
s-0.210
 else-0.205
bs-0.185
 are-0.183
 were-0.173
Token her
Token ID607
Feature activation-2.80e-01

Pos logit contributions
 planning+0.072
reetings+0.071
alis+0.064
joy+0.064
 possible+0.064
Neg logit contributions
s-0.059
 else-0.058
nam-0.057
bs-0.056
 are-0.056
Token mom
Token ID1995
Feature activation+4.99e+00

Pos logit contributions
 planning+0.023
reetings+0.022
 solution+0.021
joy+0.020
alis+0.020
Neg logit contributions
s-0.015
 else-0.015
nam-0.014
bs-0.014
 are-0.014
Tokenmy
Token ID1820
Feature activation+5.00e-01

Pos logit contributions
 else+0.294
s+0.293
nam+0.292
bs+0.283
 are+0.273
Neg logit contributions
 planning-0.377
reetings-0.353
 solution-0.327
joy-0.319
alis-0.318
Token were
Token ID547
Feature activation+5.09e+00

Pos logit contributions
nam+0.022
 else+0.021
s+0.021
bs+0.020
 tant+0.020
Neg logit contributions
 planning-0.046
reetings-0.042
 solution-0.040
joy-0.040
 Fairy-0.040
Token planning
Token ID5410
Feature activation-1.21e+01

Pos logit contributions
 else+0.248
 both+0.234
nam+0.223
 were+0.222
s+0.219
Neg logit contributions
reetings-0.462
alis-0.415
 planning-0.402
joy-0.397
 possible-0.391
Token treatment
Token ID3513
Feature activation-2.48e+00

Pos logit contributions
reetings+1.039
 planning+1.005
alis+0.969
joy+0.911
 solution+0.910
Neg logit contributions
 else-0.600
s-0.596
 are-0.568
nam-0.558
bs-0.550
Token for
Token ID329
Feature activation-2.49e+00

Pos logit contributions
 planning+0.209
reetings+0.205
alis+0.190
joy+0.189
 solution+0.187
Neg logit contributions
s-0.126
 else-0.122
bs-0.116
 are-0.116
nam-0.115
Token Buster
Token ID34452
Feature activation-1.09e+00

Pos logit contributions
reetings+0.221
 planning+0.208
alis+0.199
 solution+0.195
joy+0.194
Neg logit contributions
 else-0.117
nam-0.115
s-0.113
 are-0.111
bs-0.109
Token.
Token ID13
Feature activation-2.55e-02

Pos logit contributions
reetings+0.102
 planning+0.102
joy+0.095
alis+0.094
 solution+0.092
Neg logit contributions
s-0.047
 else-0.044
 are-0.042
bs-0.041
 were-0.039
Token Mia
Token ID32189
Feature activation-2.69e+00

Pos logit contributions
reetings+0.002
 planning+0.002
 possible+0.002
 solution+0.002
 believing+0.002
Neg logit contributions
s-0.001
 else-0.001
bs-0.001
 were-0.001
 are-0.001
TokenS
Token ID50
Feature activation+6.43e-01

Pos logit contributions
s+0.121
 are+0.112
 were+0.109
 else+0.099
 trash+0.085
Neg logit contributions
 planning-0.274
reetings-0.265
 example-0.264
 possible-0.262
alis-0.257
Tokenue
Token ID518
Feature activation-5.05e+00

Pos logit contributions
s+0.052
 else+0.050
 are+0.049
bs+0.048
nam+0.047
Neg logit contributions
 planning-0.037
reetings-0.034
 possible-0.031
 example-0.031
 solution-0.030
Token and
Token ID290
Feature activation-9.68e-01

Pos logit contributions
 planning+0.439
reetings+0.420
 solution+0.389
joy+0.388
alis+0.383
Neg logit contributions
nam-0.242
 else-0.235
s-0.234
bs-0.222
 tant-0.214
Token Mike
Token ID4995
Feature activation+7.51e-01

Pos logit contributions
 planning+0.064
reetings+0.059
 spending+0.055
joy+0.055
 possible+0.055
Neg logit contributions
 tant-0.065
s-0.064
 else-0.063
bs-0.062
nam-0.061
Token were
Token ID547
Feature activation+5.09e+00

Pos logit contributions
nam+0.033
 else+0.028
 tant+0.028
bs+0.027
s+0.026
Neg logit contributions
 planning-0.071
reetings-0.067
 solution-0.061
joy-0.061
 known-0.061
Token planning
Token ID5410
Feature activation-1.21e+01

Pos logit contributions
s+0.298
 else+0.289
 are+0.285
 both+0.285
nam+0.282
Neg logit contributions
reetings-0.393
 planning-0.368
alis-0.351
 solution-0.345
 possible-0.345
Token a
Token ID257
Feature activation-1.23e+00

Pos logit contributions
reetings+1.093
 planning+1.043
alis+1.022
 Fairy+0.968
joy+0.960
Neg logit contributions
 else-0.568
s-0.555
 are-0.523
nam-0.518
bs-0.491
Token picnic
Token ID35715
Feature activation-2.16e+00

Pos logit contributions
 planning+0.092
reetings+0.085
 solution+0.080
 possible+0.079
 example+0.078
Neg logit contributions
s-0.075
nam-0.073
bs-0.073
 else-0.073
 are-0.072
Token in
Token ID287
Feature activation+2.56e+00

Pos logit contributions
reetings+0.217
 planning+0.210
alis+0.198
joy+0.197
 example+0.191
Neg logit contributions
s-0.082
 else-0.078
 are-0.071
nam-0.068
 were-0.066
Token the
Token ID262
Feature activation-8.18e-01

Pos logit contributions
s+0.111
 else+0.109
nam+0.104
 are+0.104
bs+0.097
Neg logit contributions
 planning-0.233
reetings-0.231
alis-0.216
 solution-0.212
joy-0.211
Token park
Token ID3952
Feature activation-3.48e+00

Pos logit contributions
 planning+0.058
reetings+0.056
joy+0.050
 solution+0.050
alis+0.050
Neg logit contributions
s-0.053
 else-0.052
nam-0.050
 are-0.050
bs-0.049
Token birthday
Token ID10955
Feature activation+1.29e+00

Pos logit contributions
 planning+0.222
reetings+0.215
alis+0.195
 possible+0.194
 solution+0.193
Neg logit contributions
s-0.202
 else-0.197
 are-0.191
nam-0.190
bs-0.189
Token.
Token ID13
Feature activation-1.45e-02

Pos logit contributions
s+0.068
 else+0.067
 are+0.062
bs+0.060
nam+0.059
Neg logit contributions
reetings-0.106
 planning-0.106
alis-0.098
joy-0.095
 possible-0.095
Token Her
Token ID2332
Feature activation-1.06e+00

Pos logit contributions
 planning+0.001
reetings+0.001
 possible+0.001
 solution+0.001
alis+0.001
Neg logit contributions
s-0.001
 else-0.001
bs-0.001
nam-0.001
 are-0.001
Token mom
Token ID1995
Feature activation+5.01e+00

Pos logit contributions
 planning+0.107
reetings+0.099
 solution+0.097
joy+0.097
 example+0.096
Neg logit contributions
s-0.038
 else-0.036
nam-0.035
bs-0.035
 are-0.033
Token was
Token ID373
Feature activation-9.71e-01

Pos logit contributions
s+0.204
 else+0.197
 are+0.182
bs+0.174
 were+0.174
Neg logit contributions
 planning-0.476
reetings-0.472
 possible-0.444
alis-0.443
joy-0.440
Token planning
Token ID5410
Feature activation-1.21e+01

Pos logit contributions
reetings+0.094
alis+0.086
 planning+0.082
 possible+0.081
brance+0.081
Neg logit contributions
 else-0.045
 both-0.039
nam-0.039
 were-0.038
 are-0.038
Token to
Token ID284
Feature activation-9.99e-01

Pos logit contributions
reetings+1.061
 planning+1.013
alis+0.994
joy+0.927
 solution+0.926
Neg logit contributions
 else-0.597
s-0.571
 are-0.559
nam-0.539
bs-0.527
Token make
Token ID787
Feature activation+7.68e-01

Pos logit contributions
 planning+0.084
reetings+0.081
joy+0.075
alis+0.075
 solution+0.075
Neg logit contributions
s-0.050
 else-0.050
nam-0.048
 are-0.047
bs-0.047
Token a
Token ID257
Feature activation-1.20e+00

Pos logit contributions
 else+0.036
s+0.035
 are+0.032
bs+0.030
 both+0.030
Neg logit contributions
reetings-0.073
 planning-0.068
alis-0.065
joy-0.063
 solution-0.062
Token special
Token ID2041
Feature activation-1.42e-01

Pos logit contributions
 planning+0.088
 solution+0.081
reetings+0.077
 example+0.076
 possible+0.076
Neg logit contributions
s-0.075
bs-0.074
nam-0.073
 are-0.070
w-0.069
Token banana
Token ID25996
Feature activation+2.46e+00

Pos logit contributions
 planning+0.011
reetings+0.011
alis+0.010
joy+0.010
 possible+0.010
Neg logit contributions
 else-0.008
s-0.008
 are-0.007
nam-0.007
bs-0.007
Token know
Token ID760
Feature activation+8.46e-01

Pos logit contributions
 planning+0.316
reetings+0.302
 solution+0.282
 possible+0.280
alis+0.276
Neg logit contributions
s-0.178
nam-0.174
 else-0.173
bs-0.166
 are-0.161
Token,
Token ID11
Feature activation+1.04e-01

Pos logit contributions
s+0.034
 else+0.032
nam+0.032
bs+0.032
 are+0.030
Neg logit contributions
 planning-0.081
reetings-0.077
 solution-0.072
 possible-0.072
joy-0.072
Token Grand
Token ID5675
Feature activation+5.38e+00

Pos logit contributions
 else+0.004
s+0.004
 are+0.003
nam+0.003
 were+0.003
Neg logit contributions
reetings-0.010
 planning-0.010
alis-0.010
 solution-0.009
 possible-0.009
Tokenma
Token ID2611
Feature activation-1.01e-01

Pos logit contributions
s+0.298
 else+0.292
 are+0.278
cream+0.267
bs+0.260
Neg logit contributions
 planning-0.431
reetings-0.414
alis-0.395
 possible-0.388
 solution-0.385
Token was
Token ID373
Feature activation-9.70e-01

Pos logit contributions
 planning+0.009
reetings+0.009
alis+0.009
joy+0.009
 solution+0.008
Neg logit contributions
s-0.005
 else-0.005
 are-0.004
bs-0.004
 were-0.004
Token planning
Token ID5410
Feature activation-1.21e+01

Pos logit contributions
reetings+0.083
alis+0.077
brance+0.073
alk+0.073
joy+0.071
Neg logit contributions
 else-0.055
 both-0.049
nam-0.047
 are-0.044
 were-0.043
Token a
Token ID257
Feature activation-1.22e+00

Pos logit contributions
 planning+1.039
reetings+1.039
alis+0.970
joy+0.936
 solution+0.936
Neg logit contributions
 else-0.585
s-0.568
nam-0.545
 are-0.540
bs-0.527
Token surprise
Token ID5975
Feature activation+5.54e-02

Pos logit contributions
 planning+0.096
reetings+0.092
alis+0.084
joy+0.084
 possible+0.084
Neg logit contributions
 else-0.069
s-0.068
nam-0.065
bs-0.064
 are-0.063
Token for
Token ID329
Feature activation-2.47e+00

Pos logit contributions
 else+0.003
s+0.003
 are+0.003
bs+0.003
nam+0.003
Neg logit contributions
 planning-0.004
reetings-0.004
alis-0.004
joy-0.004
 solution-0.004
Token him
Token ID683
Feature activation+4.58e+00

Pos logit contributions
 planning+0.244
reetings+0.241
alis+0.227
 solution+0.224
joy+0.223
Neg logit contributions
 else-0.087
s-0.085
nam-0.081
 are-0.078
bs-0.076
Token.
Token ID13
Feature activation-8.72e-03

Pos logit contributions
 else+0.205
s+0.200
 are+0.184
bs+0.176
 were+0.169
Neg logit contributions
 planning-0.415
reetings-0.411
alis-0.393
joy-0.385
 solution-0.378
Token.
Token ID13
Feature activation-2.55e-02

Pos logit contributions
s+0.212
 else+0.207
nam+0.197
bs+0.194
 are+0.194
Neg logit contributions
 planning-0.408
reetings-0.394
 solution-0.366
alis-0.364
joy-0.363
Token Tim
Token ID5045
Feature activation+1.60e+00

Pos logit contributions
 planning+0.002
reetings+0.002
joy+0.002
 solution+0.002
alis+0.002
Neg logit contributions
s-0.001
 else-0.001
nam-0.001
bs-0.001
 are-0.001
Tokenmy
Token ID1820
Feature activation+5.11e-01

Pos logit contributions
s+0.088
 else+0.084
 are+0.082
 were+0.077
bs+0.077
Neg logit contributions
 planning-0.131
reetings-0.125
 possible-0.118
alis-0.116
joy-0.115
Token realized
Token ID6939
Feature activation+1.15e+00

Pos logit contributions
s+0.021
 else+0.021
nam+0.019
bs+0.019
 are+0.019
Neg logit contributions
 planning-0.048
reetings-0.047
 solution-0.044
 possible-0.043
alis-0.043
Token that
Token ID326
Feature activation+4.59e+00

Pos logit contributions
s+0.031
 else+0.030
nam+0.028
bs+0.027
 are+0.026
Neg logit contributions
 planning-0.125
reetings-0.122
 solution-0.115
alis-0.115
joy-0.114
Token planning
Token ID5410
Feature activation-1.21e+01

Pos logit contributions
s+0.215
 else+0.214
nam+0.206
 are+0.201
bs+0.196
Neg logit contributions
 planning-0.402
reetings-0.393
alis-0.364
 solution-0.362
joy-0.361
Token and
Token ID290
Feature activation-9.60e-01

Pos logit contributions
 planning+1.089
reetings+1.059
alis+1.000
joy+0.987
 solution+0.973
Neg logit contributions
 else-0.547
s-0.540
 are-0.520
nam-0.507
bs-0.488
Token hard
Token ID1327
Feature activation+1.60e+00

Pos logit contributions
 planning+0.085
reetings+0.084
alis+0.079
joy+0.078
 solution+0.077
Neg logit contributions
 else-0.043
s-0.043
 are-0.041
nam-0.040
bs-0.039
Token work
Token ID670
Feature activation-2.76e+00

Pos logit contributions
s+0.089
 else+0.082
bs+0.081
 are+0.081
 were+0.077
Neg logit contributions
reetings-0.127
 planning-0.124
alis-0.120
 Fairy-0.117
 solution-0.116
Token can
Token ID460
Feature activation+2.68e+00

Pos logit contributions
 planning+0.246
reetings+0.239
alis+0.228
joy+0.225
 example+0.219
Neg logit contributions
s-0.130
 else-0.127
 are-0.120
bs-0.115
nam-0.112
Token help
Token ID1037
Feature activation+5.63e-01

Pos logit contributions
 are+0.136
 else+0.134
s+0.134
 were+0.123
 both+0.115
Neg logit contributions
 planning-0.233
joy-0.221
reetings-0.220
alis-0.207
 example-0.206
Token mom
Token ID1995
Feature activation+5.01e+00

Pos logit contributions
 planning+0.112
reetings+0.108
 solution+0.103
joy+0.102
 possible+0.101
Neg logit contributions
s-0.032
 else-0.031
nam-0.029
bs-0.028
 are-0.028
Token said
Token ID531
Feature activation+7.39e-01

Pos logit contributions
s+0.200
 else+0.197
 were+0.172
 are+0.169
bs+0.164
Neg logit contributions
reetings-0.487
 planning-0.485
 possible-0.460
alis-0.454
joy-0.450
Token that
Token ID326
Feature activation+4.60e+00

Pos logit contributions
 else+0.030
s+0.029
 are+0.026
bs+0.026
nam+0.025
Neg logit contributions
 planning-0.072
reetings-0.072
alis-0.070
 solution-0.066
 possible-0.065
Token she
Token ID673
Feature activation-6.34e-01

Pos logit contributions
 else+0.207
s+0.196
 are+0.191
 were+0.186
nam+0.180
Neg logit contributions
reetings-0.435
 planning-0.421
alis-0.410
joy-0.406
inger-0.399
Token was
Token ID373
Feature activation-9.58e-01

Pos logit contributions
 planning+0.062
reetings+0.061
joy+0.057
alis+0.057
 possible+0.057
Neg logit contributions
s-0.024
 else-0.023
 are-0.022
bs-0.021
nam-0.021
Token planning
Token ID5410
Feature activation-1.21e+01

Pos logit contributions
reetings+0.092
alis+0.086
inger+0.085
alk+0.085
eway+0.083
Neg logit contributions
 both-0.049
 else-0.046
nam-0.040
 very-0.038
 are-0.038
Token to
Token ID284
Feature activation-9.88e-01

Pos logit contributions
reetings+1.087
 planning+1.047
alis+1.028
joy+0.965
 solution+0.949
Neg logit contributions
 else-0.579
s-0.532
 are-0.529
nam-0.522
bs-0.500
Token sell
Token ID3677
Feature activation+1.06e+00

Pos logit contributions
reetings+0.090
 planning+0.089
joy+0.087
alis+0.084
 possible+0.081
Neg logit contributions
 are-0.043
 else-0.043
 were-0.038
s-0.038
 both-0.038
Token some
Token ID617
Feature activation-1.04e+00

Pos logit contributions
s+0.056
 else+0.055
bs+0.053
nam+0.053
 are+0.052
Neg logit contributions
 planning-0.087
reetings-0.082
 solution-0.077
 possible-0.077
joy-0.076
Token of
Token ID286
Feature activation-8.47e-01

Pos logit contributions
 planning+0.082
reetings+0.075
 solution+0.074
 possible+0.072
 example+0.072
Neg logit contributions
s-0.056
nam-0.055
bs-0.053
 else-0.052
w-0.050
Token Emily
Token ID17608
Feature activation-6.90e+00

Pos logit contributions
reetings+0.082
 planning+0.078
joy+0.073
alis+0.073
 possible+0.069
Neg logit contributions
 else-0.038
nam-0.037
 are-0.036
s-0.033
 trash-0.033
Token and
Token ID290
Feature activation-9.34e-01

Pos logit contributions
 planning+0.270
reetings+0.264
 possible+0.248
 solution+0.246
alis+0.244
Neg logit contributions
s-0.127
 else-0.121
 are-0.114
bs-0.114
nam-0.109
Token her
Token ID607
Feature activation-2.37e-01

Pos logit contributions
 planning+0.068
reetings+0.061
joy+0.057
 spending+0.057
 Fairy+0.056
Neg logit contributions
s-0.059
 else-0.057
 tant-0.056
bs-0.055
nam-0.055
Token mom
Token ID1995
Feature activation+5.02e+00

Pos logit contributions
 planning+0.020
joy+0.018
 solution+0.018
 Fairy+0.018
 example+0.018
Neg logit contributions
s-0.011
nam-0.011
cream-0.011
 else-0.011
bs-0.011
Tokenmy
Token ID1820
Feature activation+5.39e-01

Pos logit contributions
 else+0.306
s+0.304
nam+0.304
bs+0.297
 tant+0.286
Neg logit contributions
 planning-0.375
reetings-0.350
 solution-0.319
joy-0.317
alis-0.312
Token were
Token ID547
Feature activation+5.13e+00

Pos logit contributions
 else+0.020
nam+0.020
s+0.020
bs+0.019
 tant+0.018
Neg logit contributions
 planning-0.053
reetings-0.050
joy-0.047
 solution-0.047
alis-0.046
Token planning
Token ID5410
Feature activation-1.21e+01

Pos logit contributions
 both+0.277
s+0.272
 else+0.270
nam+0.265
 are+0.253
Neg logit contributions
reetings-0.440
 possible-0.387
alis-0.378
 Fairy-0.375
 solution-0.369
Token a
Token ID257
Feature activation-1.19e+00

Pos logit contributions
reetings+1.175
alis+1.085
 planning+1.064
 Fairy+1.024
inger+1.011
Neg logit contributions
 else-0.528
s-0.499
 are-0.481
 both-0.472
nam-0.466
Token very
Token ID845
Feature activation+3.85e+00

Pos logit contributions
 planning+0.090
reetings+0.084
 solution+0.079
 possible+0.078
joy+0.077
Neg logit contributions
s-0.071
 else-0.069
nam-0.068
bs-0.067
 are-0.067
Token special
Token ID2041
Feature activation-1.30e-01

Pos logit contributions
s+0.225
 else+0.217
nam+0.217
 are+0.216
bs+0.213
Neg logit contributions
 planning-0.297
reetings-0.277
 solution-0.260
 possible-0.258
 example-0.255
Token day
Token ID1110
Feature activation-6.69e-01

Pos logit contributions
reetings+0.010
 planning+0.009
alis+0.009
joy+0.009
 possible+0.009
Neg logit contributions
 else-0.008
s-0.007
 trash-0.007
 are-0.007
bs-0.007
Token.
Token ID13
Feature activation+2.22e-02

Pos logit contributions
reetings+0.069
 planning+0.065
alis+0.064
 solution+0.063
joy+0.061
Neg logit contributions
s-0.024
 else-0.023
bs-0.021
nam-0.019
 are-0.019
Token.
Token ID13
Feature activation+1.57e-02

Pos logit contributions
reetings+0.053
 planning+0.052
alis+0.049
joy+0.049
 example+0.047
Neg logit contributions
s-0.018
 else-0.016
 are-0.013
 were-0.013
nam-0.013
Token The
Token ID383
Feature activation+1.29e-01

Pos logit contributions
s+0.001
 else+0.001
nam+0.001
 are+0.001
bs+0.001
Neg logit contributions
 planning-0.001
reetings-0.001
 possible-0.001
 solution-0.001
joy-0.001
Token family
Token ID1641
Feature activation+4.10e-01

Pos logit contributions
bs+0.008
 else+0.008
s+0.008
nam+0.008
w+0.007
Neg logit contributions
 planning-0.010
 solution-0.008
 example-0.008
joy-0.008
 addition-0.008
Token had
Token ID550
Feature activation-1.13e+00

Pos logit contributions
nam+0.020
bs+0.019
 else+0.019
s+0.019
 tant+0.018
Neg logit contributions
 planning-0.037
reetings-0.034
 solution-0.032
 example-0.031
 addition-0.031
Token been
Token ID587
Feature activation-4.17e+00

Pos logit contributions
reetings+0.108
 planning+0.100
alis+0.098
 solution+0.095
 Fairy+0.094
Neg logit contributions
s-0.048
 else-0.048
 are-0.046
 were-0.046
nam-0.044
Token planning
Token ID5410
Feature activation-1.21e+01

Pos logit contributions
reetings+0.356
alis+0.338
alk+0.318
 Fairy+0.315
 phrase+0.310
Neg logit contributions
 else-0.219
s-0.211
nam-0.209
 both-0.207
 were-0.195
Token for
Token ID329
Feature activation-2.44e+00

Pos logit contributions
reetings+1.065
 planning+1.017
alis+1.008
 Fairy+0.946
joy+0.944
Neg logit contributions
 else-0.572
s-0.568
 are-0.544
nam-0.530
 were-0.518
Token months
Token ID1933
Feature activation-1.97e+00

Pos logit contributions
reetings+0.216
 planning+0.209
alis+0.200
 solution+0.194
 possible+0.191
Neg logit contributions
 else-0.113
s-0.112
 are-0.108
nam-0.108
 were-0.102
Token.
Token ID13
Feature activation+3.48e-02

Pos logit contributions
reetings+0.179
alis+0.169
 planning+0.168
joy+0.163
 solution+0.161
Neg logit contributions
s-0.095
 else-0.088
 are-0.081
bs-0.081
 were-0.080
Token They
Token ID1119
Feature activation-6.27e-01

Pos logit contributions
s+0.002
 else+0.002
nam+0.002
bs+0.002
 are+0.002
Neg logit contributions
 planning-0.003
reetings-0.003
 solution-0.003
 possible-0.003
joy-0.003
Token mailed
Token ID37789
Feature activation-1.68e+00

Pos logit contributions
 planning+0.056
reetings+0.048
 example+0.046
 known+0.046
 solution+0.046
Neg logit contributions
s-0.031
 tant-0.031
 else-0.030
bs-0.029
nam-0.029
Token Mary
Token ID5335
Feature activation-4.91e+00

Pos logit contributions
s+0.001
 are+0.001
 were+0.001
bs+0.001
 else+0.001
Neg logit contributions
 planning-0.001
reetings-0.001
 possible-0.001
 solution-0.001
alis-0.001
Token and
Token ID290
Feature activation-9.01e-01

Pos logit contributions
 planning+0.378
reetings+0.328
 Fairy+0.326
 solution+0.314
 addition+0.309
Neg logit contributions
 tant-0.293
nam-0.284
bs-0.255
s-0.246
apy-0.243
Token her
Token ID607
Feature activation-2.15e-01

Pos logit contributions
 planning+0.062
reetings+0.055
 spending+0.053
 possible+0.053
 addition+0.053
Neg logit contributions
s-0.061
 else-0.059
 tant-0.058
bs-0.058
cream-0.055
Token family
Token ID1641
Feature activation+4.31e-01

Pos logit contributions
 planning+0.019
reetings+0.018
 solution+0.018
joy+0.018
 Fairy+0.017
Neg logit contributions
s-0.010
 else-0.009
nam-0.009
bs-0.009
 are-0.008
Token were
Token ID547
Feature activation+5.16e+00

Pos logit contributions
 tant+0.020
nam+0.019
bs+0.019
 else+0.019
s+0.018
Neg logit contributions
 planning-0.040
 addition-0.035
 spending-0.034
reetings-0.034
 solution-0.034
Token planning
Token ID5410
Feature activation-1.21e+01

Pos logit contributions
s+0.234
nam+0.229
 are+0.226
 else+0.226
 were+0.216
Neg logit contributions
reetings-0.473
 planning-0.444
alis-0.425
joy-0.424
 solution-0.421
Token a
Token ID257
Feature activation-1.17e+00

Pos logit contributions
reetings+1.148
 planning+1.068
alis+1.064
 Fairy+1.014
joy+1.000
Neg logit contributions
 else-0.532
s-0.512
nam-0.491
 are-0.489
bs-0.468
Token trip
Token ID5296
Feature activation+1.49e+00

Pos logit contributions
 planning+0.085
reetings+0.080
 solution+0.074
 possible+0.073
joy+0.073
Neg logit contributions
s-0.073
 else-0.072
nam-0.070
bs-0.069
 are-0.068
Token to
Token ID284
Feature activation-9.48e-01

Pos logit contributions
s+0.072
 else+0.066
 are+0.060
bs+0.059
 were+0.058
Neg logit contributions
reetings-0.135
 planning-0.131
joy-0.127
alis-0.127
 Fairy-0.122
Token visit
Token ID3187
Feature activation-3.04e+00

Pos logit contributions
reetings+0.085
 planning+0.085
joy+0.079
alis+0.079
 solution+0.076
Neg logit contributions
 else-0.042
 are-0.042
s-0.040
nam-0.040
 were-0.037
Token far
Token ID1290
Feature activation+3.97e+00

Pos logit contributions
 planning+0.276
reetings+0.268
alis+0.249
 solution+0.249
joy+0.247
Neg logit contributions
s-0.137
 else-0.134
nam-0.128
bs-0.125
 are-0.125
Token lesson
Token ID11483
Feature activation-4.33e+00

Pos logit contributions
reetings+0.179
 planning+0.178
joy+0.171
alis+0.164
brance+0.158
Neg logit contributions
s-0.163
 else-0.156
 tant-0.145
 are-0.144
 trash-0.144
Token.
Token ID13
Feature activation+3.48e-02

Pos logit contributions
 planning+0.399
alis+0.396
reetings+0.392
joy+0.384
 solution+0.373
Neg logit contributions
s-0.193
 else-0.183
 are-0.176
 were-0.168
bs-0.151
Token With
Token ID2080
Feature activation-2.57e+00

Pos logit contributions
s+0.001
 else+0.001
 are+0.001
bs+0.001
 both+0.001
Neg logit contributions
 planning-0.003
reetings-0.003
 possible-0.003
 solution-0.003
alis-0.003
Token a
Token ID257
Feature activation-1.17e+00

Pos logit contributions
 planning+0.207
reetings+0.193
 example+0.183
 possible+0.183
joy+0.181
Neg logit contributions
s-0.143
 else-0.137
nam-0.133
bs-0.131
 are-0.128
Token little
Token ID1310
Feature activation+7.08e-01

Pos logit contributions
 planning+0.092
reetings+0.086
 solution+0.084
joy+0.082
 possible+0.081
Neg logit contributions
s-0.065
nam-0.064
 else-0.062
cream-0.061
 are-0.060
Token planning
Token ID5410
Feature activation-1.21e+01

Pos logit contributions
s+0.035
 else+0.035
bs+0.033
 are+0.033
nam+0.032
Neg logit contributions
 planning-0.060
reetings-0.059
alis-0.054
 possible-0.053
joy-0.053
Token and
Token ID290
Feature activation-9.00e-01

Pos logit contributions
reetings+1.054
 planning+1.052
alis+0.997
joy+0.956
 possible+0.950
Neg logit contributions
s-0.560
 else-0.555
 are-0.543
bs-0.526
 were-0.507
Token compassion
Token ID15213
Feature activation-5.27e-01

Pos logit contributions
reetings+0.072
 planning+0.071
alis+0.068
joy+0.068
 solution+0.065
Neg logit contributions
 else-0.049
s-0.047
 are-0.047
bs-0.045
 were-0.044
Token,
Token ID11
Feature activation+1.64e-01

Pos logit contributions
 planning+0.046
reetings+0.045
alis+0.042
joy+0.042
 solution+0.041
Neg logit contributions
s-0.026
 else-0.025
 are-0.024
bs-0.023
nam-0.022
Token you
Token ID345
Feature activation+2.94e+00

Pos logit contributions
 else+0.008
s+0.008
 are+0.007
bs+0.007
nam+0.007
Neg logit contributions
 planning-0.014
reetings-0.014
alis-0.013
joy-0.013
 solution-0.013
Token can
Token ID460
Feature activation+2.74e+00

Pos logit contributions
s+0.104
 else+0.101
 are+0.094
bs+0.094
nam+0.093
Neg logit contributions
 planning-0.295
reetings-0.284
joy-0.268
 solution-0.267
alis-0.266
Token'd
Token ID1549
Feature activation-5.78e+00

Pos logit contributions
 planning+0.086
reetings+0.084
 solution+0.078
 possible+0.078
joy+0.077
Neg logit contributions
s-0.040
 else-0.039
nam-0.036
bs-0.036
 are-0.036
Token had
Token ID550
Feature activation-1.12e+00

Pos logit contributions
reetings+0.518
 planning+0.515
alis+0.479
joy+0.470
 possible+0.463
Neg logit contributions
 are-0.262
 else-0.255
s-0.237
 were-0.236
bs-0.232
Token lots
Token ID6041
Feature activation+2.13e+00

Pos logit contributions
reetings+0.115
alis+0.102
ator+0.102
 Fairy+0.100
 planning+0.100
Neg logit contributions
 else-0.046
 both-0.044
 were-0.043
 are-0.041
s-0.039
Token of
Token ID286
Feature activation-8.28e-01

Pos logit contributions
s+0.119
 else+0.118
bs+0.112
nam+0.111
 are+0.111
Neg logit contributions
 planning-0.169
reetings-0.163
 solution-0.150
alis-0.150
joy-0.149
Token fun
Token ID1257
Feature activation-9.88e-01

Pos logit contributions
 planning+0.070
reetings+0.070
alis+0.065
 possible+0.063
 solution+0.063
Neg logit contributions
 else-0.041
s-0.039
 are-0.039
nam-0.038
bs-0.038
Token planning
Token ID5410
Feature activation-1.21e+01

Pos logit contributions
reetings+0.076
 planning+0.075
alis+0.071
 solution+0.069
 possible+0.068
Neg logit contributions
s-0.057
 else-0.057
 are-0.054
bs-0.052
nam-0.052
Token all
Token ID477
Feature activation+2.28e+00

Pos logit contributions
reetings+1.084
 planning+1.050
alis+1.018
 Fairy+0.955
 solution+0.955
Neg logit contributions
 else-0.570
s-0.539
 are-0.537
nam-0.512
 were-0.496
Token the
Token ID262
Feature activation-7.61e-01

Pos logit contributions
s+0.083
 are+0.081
 else+0.078
 were+0.070
 both+0.060
Neg logit contributions
reetings-0.235
alis-0.230
 planning-0.221
brance-0.218
joy-0.215
Token way
Token ID835
Feature activation-3.13e+00

Pos logit contributions
reetings+0.063
 planning+0.062
alis+0.060
joy+0.058
 Fairy+0.058
Neg logit contributions
 else-0.039
 are-0.038
s-0.037
 trash-0.035
nam-0.035
Token.
Token ID13
Feature activation+3.48e-02

Pos logit contributions
 planning+0.257
reetings+0.255
alis+0.237
 example+0.232
joy+0.230
Neg logit contributions
s-0.176
 else-0.165
 are-0.156
bs-0.155
nam-0.152
Token<|endoftext|>
Token ID50256
Feature activation+2.93e+00

Pos logit contributions
s+0.001
 are+0.001
 else+0.001
bs+0.001
 were+0.001
Neg logit contributions
 planning-0.003
reetings-0.003
 possible-0.003
alis-0.003
 solution-0.003
Tokenpa
Token ID8957
Feature activation+3.16e+00

Pos logit contributions
s+0.216
 else+0.206
 are+0.202
nam+0.193
 were+0.190
Neg logit contributions
 planning-0.354
reetings-0.350
alis-0.330
 possible-0.325
 solution-0.324
Token and
Token ID290
Feature activation-8.76e-01

Pos logit contributions
s+0.134
 else+0.124
 are+0.121
 were+0.116
bs+0.115
Neg logit contributions
 planning-0.294
reetings-0.290
 possible-0.277
alis-0.276
 solution-0.273
Token Joe
Token ID5689
Feature activation-2.99e+00

Pos logit contributions
 planning+0.070
reetings+0.061
 spending+0.060
joy+0.058
 example+0.058
Neg logit contributions
s-0.051
 else-0.049
bs-0.047
 tant-0.046
 are-0.043
Token had
Token ID550
Feature activation-1.09e+00

Pos logit contributions
 planning+0.284
reetings+0.269
joy+0.249
 solution+0.246
 addition+0.245
Neg logit contributions
nam-0.126
 else-0.111
 tant-0.107
s-0.100
bs-0.100
Token been
Token ID587
Feature activation-4.13e+00

Pos logit contributions
reetings+0.098
 planning+0.095
alis+0.090
 solution+0.088
 Fairy+0.087
Neg logit contributions
s-0.050
 else-0.049
 are-0.047
nam-0.046
 were-0.046
Token planning
Token ID5410
Feature activation-1.20e+01

Pos logit contributions
reetings+0.345
alis+0.312
 Fairy+0.300
alk+0.299
brance+0.299
Neg logit contributions
 else-0.236
nam-0.225
 both-0.216
s-0.208
 are-0.200
Token for
Token ID329
Feature activation-2.40e+00

Pos logit contributions
reetings+1.031
 planning+0.985
alis+0.967
 Fairy+0.911
joy+0.901
Neg logit contributions
 else-0.608
s-0.592
 are-0.562
nam-0.548
 were-0.537
Token a
Token ID257
Feature activation-1.13e+00

Pos logit contributions
reetings+0.226
alis+0.210
 planning+0.207
 solution+0.200
 Fairy+0.197
Neg logit contributions
 else-0.105
 are-0.097
s-0.097
 both-0.094
 were-0.093
Token long
Token ID890
Feature activation+1.41e+00

Pos logit contributions
 planning+0.088
reetings+0.085
 solution+0.078
alis+0.078
joy+0.077
Neg logit contributions
s-0.065
 else-0.064
nam-0.061
bs-0.061
 are-0.060
Token time
Token ID640
Feature activation-2.84e+00

Pos logit contributions
s+0.095
 else+0.094
bs+0.091
 are+0.091
 tant+0.088
Neg logit contributions
reetings-0.093
 planning-0.090
 possible-0.081
alis-0.080
joy-0.079
Token.
Token ID13
Feature activation+7.80e-02

Pos logit contributions
reetings+0.296
alis+0.279
 planning+0.278
 solution+0.268
 Fairy+0.268
Neg logit contributions
 else-0.096
s-0.095
 are-0.080
bs-0.079
 were-0.077
Token took
Token ID1718
Feature activation-7.26e-02

Pos logit contributions
 planning+0.041
reetings+0.040
joy+0.038
 solution+0.037
 possible+0.037
Neg logit contributions
s-0.018
 else-0.017
bs-0.016
 are-0.016
 were-0.015
Token out
Token ID503
Feature activation+2.56e+00

Pos logit contributions
 planning+0.006
reetings+0.006
 solution+0.006
alis+0.006
joy+0.006
Neg logit contributions
s-0.003
 else-0.003
bs-0.003
 are-0.003
nam-0.003
Token her
Token ID607
Feature activation-1.25e-01

Pos logit contributions
s+0.123
 else+0.121
nam+0.117
bs+0.117
 are+0.115
Neg logit contributions
 planning-0.223
reetings-0.222
alis-0.203
 solution-0.201
 possible-0.200
Token map
Token ID3975
Feature activation+8.48e-01

Pos logit contributions
 planning+0.010
reetings+0.010
 solution+0.009
joy+0.009
 possible+0.009
Neg logit contributions
s-0.006
 else-0.006
nam-0.006
bs-0.006
 are-0.006
Token,
Token ID11
Feature activation+2.04e-01

Pos logit contributions
s+0.041
 else+0.038
bs+0.036
nam+0.036
 are+0.036
Neg logit contributions
 planning-0.076
reetings-0.075
alis-0.069
joy-0.068
 example-0.068
Token planning
Token ID5410
Feature activation-1.20e+01

Pos logit contributions
s+0.010
 else+0.010
nam+0.010
bs+0.009
 are+0.009
Neg logit contributions
reetings-0.018
 planning-0.017
alis-0.016
 solution-0.016
 possible-0.016
Token out
Token ID503
Feature activation+2.48e+00

Pos logit contributions
reetings+1.150
 planning+1.101
alis+1.074
joy+1.011
 solution+1.002
Neg logit contributions
 else-0.514
 are-0.477
s-0.475
nam-0.460
bs-0.440
Token the
Token ID262
Feature activation-7.48e-01

Pos logit contributions
 else+0.113
s+0.106
 are+0.103
bs+0.101
nam+0.100
Neg logit contributions
reetings-0.223
 planning-0.219
alis-0.213
 solution-0.201
brance-0.199
Token next
Token ID1306
Feature activation-3.11e+00

Pos logit contributions
 planning+0.050
reetings+0.048
alis+0.043
joy+0.043
 solution+0.043
Neg logit contributions
s-0.050
 else-0.050
nam-0.049
 are-0.048
bs-0.048
Token place
Token ID1295
Feature activation-4.30e+00

Pos logit contributions
reetings+0.218
 planning+0.217
joy+0.191
alis+0.190
 possible+0.187
Neg logit contributions
 else-0.198
s-0.196
 are-0.182
bs-0.179
nam-0.178
Token to
Token ID284
Feature activation-9.71e-01

Pos logit contributions
reetings+0.424
 planning+0.411
alis+0.405
joy+0.389
 example+0.387
Neg logit contributions
s-0.175
 else-0.168
 are-0.151
 were-0.146
bs-0.141
Token book
Token ID1492
Feature activation+1.17e+00

Pos logit contributions
 planning+0.166
reetings+0.161
joy+0.149
alis+0.149
 solution+0.148
Neg logit contributions
s-0.078
 else-0.078
nam-0.072
 are-0.072
bs-0.072
Token.
Token ID13
Feature activation-5.03e-02

Pos logit contributions
s+0.049
 else+0.048
 are+0.044
bs+0.043
nam+0.043
Neg logit contributions
 planning-0.111
reetings-0.108
joy-0.101
alis-0.100
 example-0.100
Token Even
Token ID3412
Feature activation-2.11e+00

Pos logit contributions
 planning+0.004
reetings+0.004
 possible+0.004
 solution+0.004
alis+0.004
Neg logit contributions
s-0.002
 else-0.002
bs-0.002
nam-0.002
 are-0.002
Token after
Token ID706
Feature activation+3.68e+00

Pos logit contributions
 planning+0.169
reetings+0.166
alis+0.152
 solution+0.151
joy+0.150
Neg logit contributions
 else-0.117
s-0.116
nam-0.112
bs-0.109
 are-0.107
Token looking
Token ID2045
Feature activation-5.11e+00

Pos logit contributions
 else+0.177
s+0.171
nam+0.170
 are+0.167
 were+0.166
Neg logit contributions
reetings-0.330
 planning-0.303
 solution-0.301
alis-0.296
 possible-0.282
Token everywhere
Token ID8347
Feature activation+8.86e+00

Pos logit contributions
 planning+0.429
reetings+0.400
joy+0.399
 solution+0.388
 example+0.382
Neg logit contributions
nam-0.245
cream-0.245
 else-0.227
s-0.223
w-0.219
Token,
Token ID11
Feature activation+8.61e-02

Pos logit contributions
 else+0.409
s+0.406
 are+0.380
bs+0.365
 were+0.357
Neg logit contributions
 planning-0.802
reetings-0.781
alis-0.740
 example-0.724
joy-0.721
Token she
Token ID673
Feature activation-6.70e-01

Pos logit contributions
s+0.004
 else+0.004
nam+0.004
 are+0.004
bs+0.004
Neg logit contributions
 planning-0.008
reetings-0.008
 solution-0.007
alis-0.007
 possible-0.007
Token never
Token ID1239
Feature activation-2.23e-01

Pos logit contributions
 planning+0.063
reetings+0.061
 solution+0.057
 possible+0.057
alis+0.056
Neg logit contributions
s-0.028
 else-0.028
nam-0.026
bs-0.026
 are-0.026
Token got
Token ID1392
Feature activation+2.69e+00

Pos logit contributions
 planning+0.021
reetings+0.020
alis+0.019
 solution+0.019
 possible+0.019
Neg logit contributions
 else-0.009
s-0.009
 are-0.009
bs-0.009
nam-0.009
Token it
Token ID340
Feature activation-3.30e-01

Pos logit contributions
 else+0.128
s+0.124
nam+0.120
bs+0.114
 are+0.112
Neg logit contributions
reetings-0.240
 planning-0.240
alis-0.220
 example-0.216
 solution-0.214
Token,
Token ID11
Feature activation+3.76e-01

Pos logit contributions
 planning+0.277
reetings+0.270
 possible+0.253
 solution+0.252
alis+0.250
Neg logit contributions
s-0.113
 else-0.104
 are-0.100
bs-0.098
nam-0.092
Token let
Token ID1309
Feature activation-1.31e+00

Pos logit contributions
 else+0.016
s+0.016
 are+0.015
bs+0.015
nam+0.014
Neg logit contributions
 planning-0.035
reetings-0.034
joy-0.032
alis-0.032
 solution-0.032
Token's
Token ID338
Feature activation+4.66e-01

Pos logit contributions
 planning+0.116
reetings+0.111
 solution+0.105
alis+0.104
 example+0.103
Neg logit contributions
nam-0.060
 else-0.058
s-0.058
bs-0.056
 are-0.053
Token play
Token ID711
Feature activation+1.27e+00

Pos logit contributions
s+0.016
 else+0.016
 are+0.015
nam+0.014
bs+0.014
Neg logit contributions
 planning-0.047
reetings-0.046
joy-0.043
 solution-0.043
 possible-0.043
Token something
Token ID1223
Feature activation+6.09e-01

Pos logit contributions
 else+0.061
s+0.061
 are+0.059
bs+0.053
 were+0.052
Neg logit contributions
 planning-0.116
reetings-0.113
 possible-0.106
 solution-0.103
joy-0.103
Token else
Token ID2073
Feature activation+8.73e+00

Pos logit contributions
s+0.006
 else+0.005
nam+0.004
bs+0.004
 are+0.004
Neg logit contributions
 planning-0.076
reetings-0.074
 solution-0.071
 possible-0.071
alis-0.070
Token."
Token ID526
Feature activation-1.36e+00

Pos logit contributions
 else+0.447
s+0.446
bs+0.393
 are+0.393
nam+0.385
Neg logit contributions
 planning-0.769
reetings-0.750
joy-0.699
alis-0.699
 solution-0.687
Token
Token ID198
Feature activation+4.75e-01

Pos logit contributions
 planning+0.110
reetings+0.105
joy+0.098
 solution+0.097
alis+0.096
Neg logit contributions
 else-0.072
s-0.071
nam-0.071
bs-0.068
 are-0.067
Token
Token ID198
Feature activation+4.80e-01

Pos logit contributions
nam+0.029
 else+0.027
bs+0.027
s+0.026
 tant+0.026
Neg logit contributions
 planning-0.035
reetings-0.034
 solution-0.031
joy-0.031
 example-0.030
TokenThe
Token ID464
Feature activation+1.07e+00

Pos logit contributions
s+0.022
 else+0.021
nam+0.021
bs+0.021
 are+0.020
Neg logit contributions
 planning-0.043
reetings-0.041
 solution-0.039
joy-0.038
 possible-0.038
Token oct
Token ID19318
Feature activation+1.69e+00

Pos logit contributions
bs+0.060
s+0.060
 else+0.060
nam+0.059
 are+0.055
Neg logit contributions
 planning-0.085
reetings-0.080
 solution-0.075
 example-0.075
joy-0.074
Token�
Token ID241
Feature activation-1.93e+00

Pos logit contributions
 planning+0.025
reetings+0.025
joy+0.024
 solution+0.023
 possible+0.022
Neg logit contributions
s-0.019
 both-0.019
 else-0.018
bs-0.018
 are-0.018
TokenLet
Token ID5756
Feature activation-1.02e+00

Pos logit contributions
 planning+0.179
reetings+0.173
 solution+0.163
alis+0.162
 possible+0.158
Neg logit contributions
s-0.082
 else-0.081
bs-0.075
 are-0.074
nam-0.074
Tokenâ
Token ID22940
Feature activation-1.81e+00

Pos logit contributions
 planning+0.085
reetings+0.082
 solution+0.077
 possible+0.075
alis+0.075
Neg logit contributions
nam-0.053
s-0.052
 else-0.052
bs-0.050
 are-0.048
Token€
Token ID26391
Feature activation-4.59e+00

Pos logit contributions
 planning+0.158
reetings+0.144
joy+0.143
 possible+0.139
 solution+0.137
Neg logit contributions
 are-0.089
s-0.086
 else-0.085
bs-0.084
cream-0.078
Tokenâ„¢
Token ID8151
Feature activation+1.93e-01

Pos logit contributions
 planning+0.348
reetings+0.332
 solution+0.305
 possible+0.303
alis+0.303
Neg logit contributions
s-0.275
 else-0.270
nam-0.260
bs-0.259
 are-0.256
Tokens
Token ID82
Feature activation+6.90e+00

Pos logit contributions
nam+0.010
 else+0.009
 trash+0.009
bs+0.009
s+0.009
Neg logit contributions
 planning-0.016
reetings-0.015
joy-0.014
 possible-0.014
 solution-0.014
Token fight
Token ID1907
Feature activation+4.09e-01

Pos logit contributions
s+0.302
 else+0.297
nam+0.281
 are+0.280
bs+0.280
Neg logit contributions
 planning-0.630
reetings-0.608
joy-0.566
 solution-0.565
 possible-0.563
Token to
Token ID284
Feature activation-8.36e-01

Pos logit contributions
s+0.021
 else+0.020
 are+0.019
nam+0.019
bs+0.019
Neg logit contributions
 planning-0.035
reetings-0.034
alis-0.032
joy-0.032
 solution-0.032
Token see
Token ID766
Feature activation+1.43e+00

Pos logit contributions
 planning+0.070
reetings+0.066
 solution+0.062
 possible+0.062
 example+0.062
Neg logit contributions
s-0.043
 else-0.041
bs-0.041
nam-0.041
 are-0.039
Token who
Token ID508
Feature activation+4.86e+00

Pos logit contributions
s+0.071
nam+0.069
bs+0.068
 tant+0.067
 else+0.066
Neg logit contributions
 planning-0.117
reetings-0.113
 possible-0.107
 example-0.105
 solution-0.104
Token can
Token ID460
Feature activation+2.85e+00

Pos logit contributions
nam+0.188
s+0.175
 else+0.169
 tant+0.167
bs+0.165
Neg logit contributions
 planning-0.458
reetings-0.445
 solution-0.410
 possible-0.410
alis-0.405
Token their
Token ID511
Feature activation+6.32e-01

Pos logit contributions
nam+0.018
 else+0.017
s+0.017
bs+0.016
 are+0.016
Neg logit contributions
 planning-0.049
reetings-0.048
 possible-0.044
 solution-0.044
joy-0.044
Token feathers
Token ID27400
Feature activation+1.08e+00

Pos logit contributions
nam+0.039
 else+0.038
s+0.038
bs+0.037
cream+0.036
Neg logit contributions
 planning-0.046
joy-0.041
reetings-0.041
 solution-0.041
 example-0.040
Token and
Token ID290
Feature activation-8.29e-01

Pos logit contributions
nam+0.034
 else+0.034
s+0.031
bs+0.030
 tant+0.029
Neg logit contributions
 planning-0.109
reetings-0.104
 solution-0.099
 possible-0.098
alis-0.098
Token the
Token ID262
Feature activation-6.75e-01

Pos logit contributions
 planning+0.069
reetings+0.064
 solution+0.061
joy+0.060
 possible+0.060
Neg logit contributions
s-0.042
 else-0.042
bs-0.040
nam-0.039
 are-0.038
Token squirrel
Token ID33039
Feature activation+4.06e-01

Pos logit contributions
 planning+0.053
reetings+0.049
 solution+0.047
joy+0.046
alis+0.046
Neg logit contributions
s-0.039
 else-0.038
bs-0.037
nam-0.037
 are-0.035
Tokens
Token ID82
Feature activation+6.87e+00

Pos logit contributions
nam+0.013
 tant+0.011
 trash+0.011
 else+0.011
 Sorry+0.010
Neg logit contributions
 planning-0.041
reetings-0.038
 solution-0.037
alis-0.036
 Fairy-0.036
Token build
Token ID1382
Feature activation-3.40e+00

Pos logit contributions
nam+0.247
s+0.245
 else+0.243
bs+0.222
 tant+0.215
Neg logit contributions
 planning-0.676
reetings-0.633
 solution-0.611
joy-0.605
 possible-0.596
Token nests
Token ID44382
Feature activation+5.83e-01

Pos logit contributions
 planning+0.301
reetings+0.297
alis+0.269
joy+0.268
 solution+0.267
Neg logit contributions
s-0.161
 else-0.159
 are-0.153
bs-0.148
nam-0.147
Token in
Token ID287
Feature activation+2.71e+00

Pos logit contributions
s+0.023
 else+0.021
 are+0.020
bs+0.019
 were+0.019
Neg logit contributions
reetings-0.058
 planning-0.058
joy-0.054
 example-0.053
alis-0.053
Token the
Token ID262
Feature activation-6.55e-01

Pos logit contributions
 are+0.112
 else+0.107
s+0.106
 were+0.105
nam+0.099
Neg logit contributions
reetings-0.264
 planning-0.253
alis-0.239
ross-0.228
joy-0.228
Token trees
Token ID7150
Feature activation+6.81e-01

Pos logit contributions
 planning+0.039
reetings+0.036
alis+0.035
 solution+0.034
joy+0.034
Neg logit contributions
s-0.049
nam-0.049
 else-0.048
bs-0.048
 tant-0.046
Token,
Token ID11
Feature activation+1.48e-01

Pos logit contributions
s+0.105
 else+0.103
nam+0.101
 are+0.099
bs+0.098
Neg logit contributions
 planning-0.183
reetings-0.182
alis-0.168
 solution-0.165
joy-0.163
Token her
Token ID607
Feature activation-2.14e-01

Pos logit contributions
s+0.007
 else+0.007
nam+0.007
bs+0.007
 are+0.007
Neg logit contributions
 planning-0.013
reetings-0.013
alis-0.012
 solution-0.012
joy-0.011
Token mom
Token ID1995
Feature activation+5.04e+00

Pos logit contributions
 planning+0.020
reetings+0.018
 solution+0.018
joy+0.018
 example+0.018
Neg logit contributions
s-0.009
 else-0.009
nam-0.009
bs-0.009
 are-0.009
Token lit
Token ID6578
Feature activation-2.82e+00

Pos logit contributions
s+0.226
 else+0.206
 were+0.204
 are+0.199
bs+0.187
Neg logit contributions
reetings-0.462
 planning-0.456
 possible-0.447
alis-0.439
joy-0.434
Token the
Token ID262
Feature activation-7.89e-01

Pos logit contributions
reetings+0.255
 planning+0.252
 solution+0.228
alis+0.228
joy+0.226
Neg logit contributions
s-0.130
 else-0.127
nam-0.124
 are-0.115
bs-0.113
Token candles
Token ID32268
Feature activation+7.03e+00

Pos logit contributions
 planning+0.060
reetings+0.056
 solution+0.053
joy+0.052
alis+0.052
Neg logit contributions
s-0.048
 else-0.046
bs-0.046
nam-0.044
 are-0.044
Token on
Token ID319
Feature activation+1.69e+00

Pos logit contributions
s+0.324
 else+0.311
nam+0.295
 are+0.294
bs+0.293
Neg logit contributions
 planning-0.632
reetings-0.614
joy-0.568
 solution-0.565
alis-0.564
Token the
Token ID262
Feature activation-8.02e-01

Pos logit contributions
s+0.070
 else+0.069
 are+0.068
nam+0.067
 were+0.064
Neg logit contributions
reetings-0.158
 planning-0.158
alis-0.143
 possible-0.142
joy-0.141
Token cake
Token ID12187
Feature activation+2.52e+00

Pos logit contributions
 planning+0.059
reetings+0.053
 solution+0.053
 Fairy+0.051
 possible+0.051
Neg logit contributions
s-0.050
bs-0.049
 else-0.047
nam-0.046
 are-0.045
Token and
Token ID290
Feature activation-9.48e-01

Pos logit contributions
s+0.117
 else+0.116
nam+0.113
bs+0.111
 are+0.108
Neg logit contributions
 planning-0.221
reetings-0.210
 solution-0.199
 possible-0.196
alis-0.196
Token said
Token ID531
Feature activation+7.30e-01

Pos logit contributions
 planning+0.085
reetings+0.084
alis+0.077
 solution+0.077
joy+0.077
Neg logit contributions
s-0.042
 else-0.042
nam-0.040
bs-0.039
 are-0.039
Token carried
Token ID5281
Feature activation-2.90e+00

Pos logit contributions
 planning+0.226
reetings+0.226
 possible+0.212
joy+0.208
 solution+0.207
Neg logit contributions
s-0.085
 else-0.081
bs-0.073
 are-0.072
nam-0.072
Token a
Token ID257
Feature activation-1.21e+00

Pos logit contributions
 planning+0.228
reetings+0.213
 solution+0.200
 example+0.200
alis+0.200
Neg logit contributions
s-0.161
 else-0.161
nam-0.152
 are-0.151
bs-0.150
Token basket
Token ID7988
Feature activation+5.99e-01

Pos logit contributions
 planning+0.093
 solution+0.090
reetings+0.085
 addition+0.084
alis+0.082
Neg logit contributions
cream-0.063
 are-0.061
nam-0.061
 else-0.058
bs-0.058
Token and
Token ID290
Feature activation-9.50e-01

Pos logit contributions
s+0.032
 else+0.032
nam+0.031
bs+0.031
 are+0.030
Neg logit contributions
 planning-0.048
reetings-0.046
 solution-0.043
 possible-0.042
joy-0.042
Token her
Token ID607
Feature activation-2.49e-01

Pos logit contributions
 planning+0.087
reetings+0.079
 solution+0.077
joy+0.076
alis+0.076
Neg logit contributions
s-0.040
 else-0.040
 are-0.038
nam-0.038
bs-0.037
Token grandma
Token ID49890
Feature activation+4.07e+00

Pos logit contributions
 planning+0.023
 solution+0.021
joy+0.021
 addition+0.020
 example+0.020
Neg logit contributions
s-0.010
bs-0.010
 else-0.010
w-0.009
 are-0.009
Token carried
Token ID5281
Feature activation-2.91e+00

Pos logit contributions
s+0.196
 else+0.189
bs+0.177
 are+0.175
nam+0.173
Neg logit contributions
reetings-0.363
 planning-0.362
 possible-0.332
alis-0.332
joy-0.331
Token a
Token ID257
Feature activation-1.21e+00

Pos logit contributions
 planning+0.232
reetings+0.219
 solution+0.204
alis+0.203
joy+0.203
Neg logit contributions
s-0.160
 else-0.158
nam-0.152
bs-0.150
 are-0.150
Token big
Token ID1263
Feature activation+3.33e+00

Pos logit contributions
 planning+0.094
 solution+0.091
 addition+0.084
reetings+0.083
 possible+0.081
Neg logit contributions
cream-0.063
 are-0.063
nam-0.062
bs-0.061
w-0.058
Token bucket
Token ID19236
Feature activation+2.02e+00

Pos logit contributions
 else+0.170
s+0.168
nam+0.167
bs+0.160
 are+0.157
Neg logit contributions
 planning-0.274
reetings-0.253
 solution-0.250
joy-0.244
alis-0.243
Token.
Token ID13
Feature activation+1.61e-03

Pos logit contributions
 else+0.117
s+0.117
nam+0.115
bs+0.112
 are+0.111
Neg logit contributions
 planning-0.153
reetings-0.144
 solution-0.135
 possible-0.134
alis-0.133
Token need
Token ID761
Feature activation+1.46e+00

Pos logit contributions
s+0.099
 else+0.097
 are+0.094
bs+0.090
 were+0.084
Neg logit contributions
 planning-0.223
reetings-0.215
joy-0.205
alis-0.205
 possible-0.202
Token something
Token ID1223
Feature activation+5.75e-01

Pos logit contributions
 else+0.078
s+0.075
 are+0.073
bs+0.070
nam+0.068
Neg logit contributions
 planning-0.121
reetings-0.120
alis-0.111
joy-0.110
 solution-0.108
Token new
Token ID649
Feature activation-4.09e+00

Pos logit contributions
s+0.036
nam+0.035
 else+0.035
bs+0.034
 are+0.034
Neg logit contributions
 planning-0.041
reetings-0.039
 solution-0.036
 possible-0.036
 example-0.035
Token.
Token ID13
Feature activation+2.43e-01

Pos logit contributions
 planning+0.343
reetings+0.342
alis+0.322
joy+0.311
 solution+0.309
Neg logit contributions
 else-0.211
s-0.206
 are-0.199
bs-0.189
 were-0.179
Token We
Token ID775
Feature activation-4.56e-01

Pos logit contributions
 else+0.012
s+0.011
 are+0.011
nam+0.011
bs+0.011
Neg logit contributions
 planning-0.021
reetings-0.020
 solution-0.019
 possible-0.019
joy-0.019
Token can
Token ID460
Feature activation+2.95e+00

Pos logit contributions
 planning+0.043
reetings+0.041
alis+0.039
 solution+0.039
 possible+0.038
Neg logit contributions
nam-0.018
s-0.017
cream-0.016
 else-0.016
 tant-0.015
Token pick
Token ID2298
Feature activation+6.38e-01

Pos logit contributions
s+0.150
nam+0.150
 else+0.146
bs+0.145
 tant+0.141
Neg logit contributions
 planning-0.245
reetings-0.232
 solution-0.219
alis-0.217
 possible-0.216
Token a
Token ID257
Feature activation-9.35e-01

Pos logit contributions
s+0.033
 else+0.033
 are+0.031
nam+0.031
bs+0.031
Neg logit contributions
 planning-0.054
reetings-0.052
 solution-0.048
joy-0.047
alis-0.047
Token new
Token ID649
Feature activation-4.06e+00

Pos logit contributions
reetings+0.076
 planning+0.076
joy+0.070
ross+0.065
alis+0.065
Neg logit contributions
 else-0.054
s-0.054
 trash-0.048
 are-0.048
bs-0.047
Token game
Token ID983
Feature activation-2.41e+00

Pos logit contributions
reetings+0.361
 planning+0.343
joy+0.310
ross+0.309
 known+0.305
Neg logit contributions
s-0.222
 are-0.199
 else-0.198
 trash-0.183
 tant-0.177
Token,
Token ID11
Feature activation+3.95e-01

Pos logit contributions
 planning+0.216
reetings+0.213
joy+0.205
alis+0.198
 solution+0.196
Neg logit contributions
 else-0.118
s-0.114
 are-0.108
bs-0.100
 were-0.096
Token and
Token ID290
Feature activation-6.55e-01

Pos logit contributions
s+0.024
 else+0.023
nam+0.022
bs+0.022
 are+0.022
Neg logit contributions
 planning-0.038
reetings-0.036
 solution-0.034
joy-0.033
 possible-0.033
Token he
Token ID339
Feature activation+2.90e+00

Pos logit contributions
 planning+0.056
reetings+0.054
 solution+0.050
 possible+0.050
joy+0.050
Neg logit contributions
s-0.032
 else-0.032
nam-0.030
bs-0.030
 are-0.030
Token wanted
Token ID2227
Feature activation+1.59e+00

Pos logit contributions
s+0.120
nam+0.120
 else+0.117
bs+0.112
 tant+0.108
Neg logit contributions
 planning-0.267
reetings-0.255
 solution-0.239
 possible-0.238
alis-0.235
Token to
Token ID284
Feature activation-8.19e-01

Pos logit contributions
 else+0.091
s+0.089
nam+0.084
 are+0.083
bs+0.082
Neg logit contributions
 planning-0.126
reetings-0.124
alis-0.114
joy-0.112
 solution-0.111
Token have
Token ID423
Feature activation-2.84e+00

Pos logit contributions
 planning+0.071
reetings+0.068
 solution+0.064
joy+0.063
alis+0.063
Neg logit contributions
s-0.040
 else-0.039
nam-0.037
bs-0.037
 are-0.037
Token lots
Token ID6041
Feature activation+2.23e+00

Pos logit contributions
 planning+0.256
reetings+0.255
alis+0.233
 solution+0.231
 Fairy+0.227
Neg logit contributions
 else-0.136
s-0.127
 are-0.121
nam-0.121
bs-0.118
Token of
Token ID286
Feature activation-7.56e-01

Pos logit contributions
s+0.132
nam+0.130
 else+0.128
bs+0.126
 are+0.124
Neg logit contributions
 planning-0.167
reetings-0.159
 possible-0.147
 solution-0.146
joy-0.146
Token fun
Token ID1257
Feature activation-9.32e-01

Pos logit contributions
 planning+0.064
reetings+0.063
alis+0.058
 solution+0.057
joy+0.057
Neg logit contributions
 else-0.039
s-0.038
nam-0.037
 are-0.036
bs-0.036
Token with
Token ID351
Feature activation+1.33e+00

Pos logit contributions
 planning+0.088
reetings+0.085
 solution+0.079
alis+0.079
joy+0.079
Neg logit contributions
s-0.039
 else-0.038
nam-0.036
bs-0.035
 are-0.035
Token them
Token ID606
Feature activation-3.11e+00

Pos logit contributions
s+0.057
 else+0.054
bs+0.054
nam+0.053
 are+0.052
Neg logit contributions
 planning-0.122
reetings-0.116
 possible-0.110
joy-0.110
 solution-0.109
Token.
Token ID13
Feature activation+4.40e-02

Pos logit contributions
 planning+0.306
reetings+0.299
alis+0.280
 solution+0.277
joy+0.277
Neg logit contributions
s-0.121
 else-0.117
 are-0.107
nam-0.106
bs-0.104
Token had
Token ID550
Feature activation-1.18e+00

Pos logit contributions
 planning+0.086
reetings+0.081
 possible+0.076
 solution+0.076
alis+0.076
Neg logit contributions
s-0.046
 else-0.045
bs-0.043
nam-0.043
 are-0.043
Token long
Token ID890
Feature activation+1.31e+00

Pos logit contributions
 planning+0.105
reetings+0.101
 solution+0.094
joy+0.093
alis+0.093
Neg logit contributions
s-0.055
 else-0.053
nam-0.051
bs-0.051
 are-0.050
Token legs
Token ID7405
Feature activation+3.96e+00

Pos logit contributions
s+0.059
 else+0.054
 are+0.053
bs+0.053
nam+0.050
Neg logit contributions
 planning-0.121
reetings-0.118
 possible-0.110
 solution-0.110
alis-0.109
Token.
Token ID13
Feature activation-2.96e-02

Pos logit contributions
s+0.189
 else+0.182
bs+0.173
nam+0.173
 are+0.172
Neg logit contributions
 planning-0.350
reetings-0.338
 solution-0.314
joy-0.313
alis-0.311
Token "
Token ID366
Feature activation-4.87e+00

Pos logit contributions
 planning+0.003
reetings+0.003
 possible+0.003
 solution+0.002
 example+0.002
Neg logit contributions
s-0.001
 else-0.001
bs-0.001
nam-0.001
 are-0.001
TokenLook
Token ID8567
Feature activation+8.17e-01

Pos logit contributions
 planning+0.387
reetings+0.385
alis+0.359
 Fairy+0.352
joy+0.351
Neg logit contributions
 else-0.244
s-0.234
bs-0.233
 are-0.230
nam-0.222
Token,
Token ID11
Feature activation+7.20e-02

Pos logit contributions
 tant+0.049
nam+0.049
 trash+0.048
cream+0.047
s+0.046
Neg logit contributions
 planning-0.051
reetings-0.049
 solution-0.047
 Fairy-0.046
joy-0.046
Token mom
Token ID1995
Feature activation+4.98e+00

Pos logit contributions
s+0.003
bs+0.003
 else+0.003
 tant+0.003
nam+0.003
Neg logit contributions
 planning-0.006
reetings-0.006
 Fairy-0.006
 solution-0.006
 possible-0.006
Tokenmy
Token ID1820
Feature activation+5.01e-01

Pos logit contributions
s+0.280
 else+0.266
 are+0.259
nam+0.247
bs+0.244
Neg logit contributions
 planning-0.407
reetings-0.387
joy-0.365
alis-0.364
 solution-0.363
Token!
Token ID0
Feature activation-1.12e+00

Pos logit contributions
s+0.025
 else+0.024
 are+0.023
bs+0.023
nam+0.023
Neg logit contributions
 planning-0.043
reetings-0.042
alis-0.039
 solution-0.038
joy-0.038
Token The
Token ID383
Feature activation+7.55e-02

Pos logit contributions
 planning+0.090
reetings+0.085
 solution+0.080
 possible+0.079
joy+0.079
Neg logit contributions
s-0.062
 else-0.060
nam-0.058
 are-0.058
bs-0.057
Token is
Token ID318
Feature activation-2.78e-01

Pos logit contributions
 planning+0.462
reetings+0.444
joy+0.424
alis+0.422
 solution+0.416
Neg logit contributions
 else-0.192
s-0.190
 are-0.177
bs-0.172
nam-0.156
Token perfect
Token ID2818
Feature activation-2.98e+00

Pos logit contributions
reetings+0.022
joy+0.021
alis+0.021
 planning+0.020
brance+0.020
Neg logit contributions
 else-0.016
 both-0.015
 are-0.014
nam-0.014
 trash-0.014
Token."
Token ID526
Feature activation-1.66e+00

Pos logit contributions
alis+0.271
reetings+0.270
 planning+0.263
brance+0.258
ydia+0.252
Neg logit contributions
 are-0.116
s-0.116
 else-0.112
 both-0.102
 were-0.098
Token
Token ID198
Feature activation+1.58e-01

Pos logit contributions
 planning+0.147
reetings+0.143
 possible+0.133
 solution+0.133
alis+0.132
Neg logit contributions
s-0.081
 else-0.077
nam-0.073
bs-0.073
 are-0.072
Token
Token ID198
Feature activation+1.72e-01

Pos logit contributions
s+0.008
 else+0.008
nam+0.007
bs+0.007
 are+0.007
Neg logit contributions
 planning-0.014
reetings-0.013
 solution-0.012
alis-0.012
 possible-0.012
TokenThe
Token ID464
Feature activation+7.53e-01

Pos logit contributions
 else+0.008
s+0.008
nam+0.008
bs+0.008
 are+0.007
Neg logit contributions
 planning-0.015
reetings-0.015
 solution-0.014
joy-0.014
alis-0.014
Token little
Token ID1310
Feature activation+6.40e-01

Pos logit contributions
 else+0.046
s+0.044
nam+0.044
 are+0.043
bs+0.042
Neg logit contributions
 planning-0.056
reetings-0.055
alis-0.050
joy-0.049
 possible-0.049
Token boy
Token ID2933
Feature activation+1.84e+00

Pos logit contributions
s+0.029
 else+0.028
nam+0.027
 are+0.026
bs+0.026
Neg logit contributions
 planning-0.058
reetings-0.058
alis-0.052
 possible-0.052
brance-0.051
Token smiled
Token ID13541
Feature activation-1.26e+00

Pos logit contributions
s+0.077
 else+0.075
 are+0.068
bs+0.068
nam+0.067
Neg logit contributions
 planning-0.174
reetings-0.171
 possible-0.160
 solution-0.158
alis-0.158
Token at
Token ID379
Feature activation+4.10e+00

Pos logit contributions
 planning+0.115
reetings+0.113
alis+0.105
 possible+0.102
 solution+0.102
Neg logit contributions
s-0.059
 else-0.054
nam-0.051
bs-0.051
 are-0.051
Token the
Token ID262
Feature activation-8.30e-01

Pos logit contributions
 else+0.162
s+0.160
nam+0.159
 are+0.153
bs+0.148
Neg logit contributions
 planning-0.391
reetings-0.382
alis-0.355
 solution-0.354
 possible-0.346
Token back
Token ID736
Feature activation-6.71e-01

Pos logit contributions
s+0.102
bs+0.092
 else+0.087
 both+0.084
 are+0.082
Neg logit contributions
reetings-0.212
 planning-0.192
 possible-0.188
 Fairy-0.183
alis-0.183
Token to
Token ID284
Feature activation-8.43e-01

Pos logit contributions
reetings+0.062
alis+0.057
 planning+0.055
 possible+0.054
ross+0.053
Neg logit contributions
s-0.035
 else-0.030
bs-0.030
 both-0.028
 are-0.028
Token play
Token ID711
Feature activation+1.16e+00

Pos logit contributions
reetings+0.083
alis+0.073
inger+0.073
iddle+0.071
ydia+0.068
Neg logit contributions
 are-0.038
 both-0.036
 else-0.036
 him-0.035
 were-0.034
Token with
Token ID351
Feature activation+1.43e+00

Pos logit contributions
 else+0.047
 are+0.042
s+0.042
 were+0.040
bs+0.039
Neg logit contributions
reetings-0.117
 planning-0.111
 possible-0.105
alis-0.102
inger-0.100
Token her
Token ID607
Feature activation-7.28e-02

Pos logit contributions
 else+0.070
s+0.069
nam+0.066
bs+0.066
 are+0.065
Neg logit contributions
 planning-0.125
reetings-0.124
alis-0.112
 solution-0.111
 possible-0.111
Token rope
Token ID17182
Feature activation-7.37e-01

Pos logit contributions
reetings+0.006
 planning+0.006
alis+0.005
 possible+0.005
 example+0.005
Neg logit contributions
s-0.004
 else-0.004
nam-0.004
bs-0.004
 are-0.004
Token.
Token ID13
Feature activation+1.67e-01

Pos logit contributions
reetings+0.071
 planning+0.070
 example+0.064
joy+0.063
alis+0.063
Neg logit contributions
s-0.031
 else-0.029
 are-0.027
bs-0.026
 were-0.026
Token<|endoftext|>
Token ID50256
Feature activation+3.05e+00

Pos logit contributions
s+0.008
bs+0.007
 else+0.007
 are+0.007
nam+0.007
Neg logit contributions
 planning-0.015
reetings-0.014
 solution-0.013
alis-0.013
 possible-0.013
TokenOne
Token ID3198
Feature activation-6.70e-01

Pos logit contributions
s+0.124
 are+0.110
 were+0.109
 else+0.101
 both+0.085
Neg logit contributions
 planning-0.300
reetings-0.294
 example-0.289
 possible-0.286
alis-0.282
Token summer
Token ID3931
Feature activation-2.91e-01

Pos logit contributions
 planning+0.046
reetings+0.044
alis+0.040
 solution+0.040
joy+0.040
Neg logit contributions
s-0.045
 else-0.044
bs-0.042
 are-0.042
nam-0.042
Token day
Token ID1110
Feature activation-4.96e-01

Pos logit contributions
reetings+0.026
 planning+0.025
alis+0.025
 possible+0.024
 example+0.024
Neg logit contributions
s-0.015
 were-0.013
 are-0.013
 else-0.012
 both-0.011
Token and
Token ID290
Feature activation-9.57e-01

Pos logit contributions
s+0.245
 else+0.237
 are+0.226
bs+0.221
nam+0.217
Neg logit contributions
 planning-0.432
reetings-0.417
joy-0.395
 example-0.390
 possible-0.390
Token dad
Token ID9955
Feature activation+6.22e+00

Pos logit contributions
 planning+0.101
reetings+0.095
joy+0.093
 solution+0.092
 possible+0.091
Neg logit contributions
s-0.029
 else-0.027
bs-0.025
 are-0.023
nam-0.023
Token got
Token ID1392
Feature activation+2.68e+00

Pos logit contributions
s+0.253
nam+0.249
 else+0.248
bs+0.238
 tant+0.225
Neg logit contributions
 planning-0.583
reetings-0.560
 solution-0.522
 possible-0.516
alis-0.513
Token mad
Token ID8805
Feature activation+1.93e+00

Pos logit contributions
s+0.145
nam+0.139
 else+0.138
bs+0.136
 are+0.134
Neg logit contributions
 planning-0.220
reetings-0.201
 possible-0.193
 solution-0.190
joy-0.187
Token at
Token ID379
Feature activation+4.10e+00

Pos logit contributions
s+0.089
 else+0.085
 are+0.081
bs+0.079
nam+0.077
Neg logit contributions
 planning-0.174
reetings-0.168
alis-0.160
joy-0.158
 solution-0.157
Token them
Token ID606
Feature activation-3.20e+00

Pos logit contributions
s+0.195
 else+0.192
nam+0.183
bs+0.180
 are+0.180
Neg logit contributions
 planning-0.359
reetings-0.347
 solution-0.322
alis-0.321
joy-0.321
Token for
Token ID329
Feature activation-2.50e+00

Pos logit contributions
 planning+0.290
reetings+0.281
joy+0.269
alis+0.268
 possible+0.261
Neg logit contributions
s-0.148
 else-0.141
 are-0.140
 were-0.127
bs-0.124
Token losing
Token ID6078
Feature activation-3.31e+00

Pos logit contributions
reetings+0.215
 planning+0.205
alis+0.202
joy+0.198
 solution+0.193
Neg logit contributions
 else-0.124
nam-0.119
 are-0.119
 both-0.113
s-0.109
Token the
Token ID262
Feature activation-8.28e-01

Pos logit contributions
 planning+0.286
reetings+0.274
 solution+0.255
joy+0.253
 possible+0.253
Neg logit contributions
s-0.162
 else-0.159
nam-0.153
bs-0.151
 are-0.149
Token juice
Token ID13135
Feature activation+1.69e+00

Pos logit contributions
 planning+0.074
reetings+0.069
 solution+0.067
joy+0.066
 possible+0.065
Neg logit contributions
s-0.040
 else-0.038
bs-0.037
nam-0.037
cream-0.034
Token and
Token ID290
Feature activation-9.71e-01

Pos logit contributions
s+0.088
 else+0.087
nam+0.085
bs+0.084
 are+0.081
Neg logit contributions
 planning-0.139
reetings-0.133
 solution-0.124
 possible-0.123
alis-0.122
Token best
Token ID1266
Feature activation-2.86e+00

Pos logit contributions
 else+0.283
nam+0.272
 both+0.259
 are+0.250
s+0.250
Neg logit contributions
reetings-0.513
 planning-0.469
alis-0.455
brance-0.441
 Fairy-0.441
Token friends
Token ID2460
Feature activation-2.68e+00

Pos logit contributions
reetings+0.219
 planning+0.218
alis+0.198
 possible+0.195
 solution+0.194
Neg logit contributions
s-0.170
 else-0.168
bs-0.160
 are-0.157
nam-0.155
Token.
Token ID13
Feature activation+2.81e-01

Pos logit contributions
reetings+0.278
 planning+0.274
alis+0.259
 solution+0.257
 possible+0.251
Neg logit contributions
s-0.093
 else-0.090
 are-0.082
 were-0.078
bs-0.075
Token They
Token ID1119
Feature activation-4.22e-01

Pos logit contributions
s+0.012
 else+0.011
 are+0.011
bs+0.011
nam+0.010
Neg logit contributions
 planning-0.027
reetings-0.025
 solution-0.025
 possible-0.024
alis-0.024
Token liked
Token ID8288
Feature activation+3.15e+00

Pos logit contributions
 planning+0.039
reetings+0.037
alis+0.035
 solution+0.035
joy+0.035
Neg logit contributions
s-0.017
 else-0.017
nam-0.016
bs-0.015
 tant-0.015
Token to
Token ID284
Feature activation-8.19e-01

Pos logit contributions
 else+0.155
s+0.150
nam+0.141
 are+0.137
 were+0.136
Neg logit contributions
reetings-0.292
alis-0.265
 planning-0.264
 solution-0.254
 Fairy-0.250
Token play
Token ID711
Feature activation+1.14e+00

Pos logit contributions
 planning+0.085
reetings+0.082
joy+0.078
 solution+0.077
 possible+0.077
Neg logit contributions
s-0.026
 else-0.026
 are-0.025
bs-0.024
nam-0.024
Token in
Token ID287
Feature activation+2.71e+00

Pos logit contributions
 else+0.048
s+0.048
 are+0.047
 were+0.043
 both+0.041
Neg logit contributions
reetings-0.111
 planning-0.109
 possible-0.104
joy-0.100
 solution-0.099
Token the
Token ID262
Feature activation-6.85e-01

Pos logit contributions
 else+0.094
 are+0.093
s+0.090
 were+0.083
nam+0.083
Neg logit contributions
reetings-0.289
 planning-0.273
alis-0.262
 solution-0.252
joy-0.251
Token park
Token ID3952
Feature activation-3.37e+00

Pos logit contributions
 planning+0.047
reetings+0.045
 solution+0.041
joy+0.041
alis+0.041
Neg logit contributions
s-0.046
 else-0.045
nam-0.044
bs-0.044
 are-0.043
Token every
Token ID790
Feature activation+2.41e+00

Pos logit contributions
reetings+0.357
 planning+0.345
alis+0.320
joy+0.318
 solution+0.317
Neg logit contributions
s-0.128
 are-0.118
 else-0.117
 were-0.114
 both-0.104
Token.
Token ID13
Feature activation+1.17e-01

Pos logit contributions
 planning+0.377
reetings+0.350
 solution+0.343
joy+0.340
 example+0.339
Neg logit contributions
nam-0.236
 else-0.228
bs-0.225
s-0.223
cream-0.222
Token He
Token ID679
Feature activation+1.07e+00

Pos logit contributions
s+0.004
 else+0.004
bs+0.004
 are+0.004
nam+0.004
Neg logit contributions
 planning-0.012
reetings-0.011
 solution-0.011
 possible-0.011
alis-0.010
Token does
Token ID857
Feature activation+5.33e-02

Pos logit contributions
nam+0.061
bs+0.052
cream+0.052
 else+0.051
s+0.051
Neg logit contributions
 planning-0.084
reetings-0.077
alis-0.076
 example-0.075
 solution-0.074
Token not
Token ID407
Feature activation-1.55e+00

Pos logit contributions
 else+0.003
s+0.003
 are+0.003
bs+0.003
nam+0.003
Neg logit contributions
 planning-0.004
reetings-0.004
alis-0.004
joy-0.004
 solution-0.004
Token see
Token ID766
Feature activation+1.42e+00

Pos logit contributions
 planning+0.120
reetings+0.117
 possible+0.114
 solution+0.112
alis+0.107
Neg logit contributions
s-0.088
nam-0.085
cream-0.083
apy-0.079
 tant-0.076
Token the
Token ID262
Feature activation-6.52e-01

Pos logit contributions
s+0.087
bs+0.081
nam+0.081
w+0.077
apy+0.077
Neg logit contributions
 planning-0.102
 possible-0.096
 example-0.093
joy-0.092
reetings-0.090
Token car
Token ID1097
Feature activation+2.98e+00

Pos logit contributions
 planning+0.052
joy+0.051
 solution+0.050
 importance+0.047
 example+0.046
Neg logit contributions
bs-0.039
apy-0.037
s-0.036
 else-0.036
nam-0.033
Token.
Token ID13
Feature activation+1.48e-01

Pos logit contributions
nam+0.144
s+0.142
bs+0.140
 else+0.139
 tant+0.130
Neg logit contributions
 planning-0.255
reetings-0.240
 solution-0.228
 possible-0.227
alis-0.224
Token The
Token ID383
Feature activation+2.68e-01

Pos logit contributions
s+0.006
 else+0.005
 are+0.005
bs+0.005
nam+0.005
Neg logit contributions
 planning-0.015
reetings-0.014
 solution-0.013
 possible-0.013
alis-0.013
Token car
Token ID1097
Feature activation+2.98e+00

Pos logit contributions
bs+0.014
s+0.014
 else+0.013
cream+0.013
nam+0.013
Neg logit contributions
 planning-0.022
 solution-0.022
joy-0.021
 example-0.021
 importance-0.020
Token is
Token ID318
Feature activation-1.19e-01

Pos logit contributions
nam+0.167
bs+0.150
 else+0.146
cream+0.143
s+0.141
Neg logit contributions
 planning-0.235
reetings-0.218
 solution-0.213
alis-0.211
 example-0.211
Token was
Token ID373
Feature activation-7.94e-01

Pos logit contributions
 planning+0.070
reetings+0.067
 solution+0.062
joy+0.062
 possible+0.062
Neg logit contributions
s-0.037
 else-0.036
nam-0.034
bs-0.034
 are-0.034
Token thankful
Token ID25535
Feature activation-2.82e+00

Pos logit contributions
reetings+0.084
alk+0.076
alis+0.076
brance+0.075
iddles+0.072
Neg logit contributions
 else-0.032
 both-0.031
nam-0.025
 are-0.025
 were-0.024
Token for
Token ID329
Feature activation-2.27e+00

Pos logit contributions
 planning+0.257
reetings+0.253
alis+0.237
 solution+0.232
joy+0.231
Neg logit contributions
s-0.128
 else-0.124
bs-0.115
 are-0.115
nam-0.108
Token the
Token ID262
Feature activation-6.16e-01

Pos logit contributions
reetings+0.218
 planning+0.211
alis+0.200
 solution+0.196
joy+0.192
Neg logit contributions
 else-0.093
nam-0.093
 are-0.089
 both-0.085
s-0.083
Token special
Token ID2041
Feature activation+4.11e-02

Pos logit contributions
 planning+0.050
reetings+0.049
alis+0.045
 possible+0.043
joy+0.043
Neg logit contributions
 else-0.034
 are-0.032
s-0.032
nam-0.031
 trash-0.031
Token lesson
Token ID11483
Feature activation-4.15e+00

Pos logit contributions
 else+0.002
s+0.002
 are+0.002
 trash+0.001
 tant+0.001
Neg logit contributions
 planning-0.004
reetings-0.004
alis-0.004
brance-0.004
 possible-0.004
Token about
Token ID546
Feature activation-3.99e+00

Pos logit contributions
reetings+0.421
 planning+0.416
alis+0.410
 solution+0.393
joy+0.387
Neg logit contributions
s-0.151
 else-0.144
 are-0.128
 were-0.126
bs-0.117
Token being
Token ID852
Feature activation-8.09e+00

Pos logit contributions
reetings+0.317
 planning+0.304
alis+0.297
joy+0.284
 solution+0.281
Neg logit contributions
 are-0.223
 else-0.222
nam-0.221
s-0.208
 both-0.207
Token unique
Token ID3748
Feature activation-1.54e+00

Pos logit contributions
reetings+0.711
 planning+0.684
alis+0.674
joy+0.639
brance+0.634
Neg logit contributions
 else-0.443
nam-0.372
bs-0.360
 both-0.356
s-0.349
Token.
Token ID13
Feature activation+1.96e-01

Pos logit contributions
 planning+0.141
reetings+0.140
alis+0.132
 possible+0.127
 solution+0.126
Neg logit contributions
s-0.067
 else-0.066
 are-0.063
bs-0.059
 were-0.059
Token<|endoftext|>
Token ID50256
Feature activation+3.08e+00

Pos logit contributions
s+0.008
bs+0.007
 are+0.007
 were+0.007
 else+0.006
Neg logit contributions
 planning-0.019
reetings-0.018
 solution-0.018
alis-0.018
 possible-0.017
Token was
Token ID373
Feature activation-7.40e-01

Pos logit contributions
 planning+0.067
reetings+0.061
 solution+0.059
 example+0.058
alis+0.058
Neg logit contributions
s-0.032
nam-0.032
 else-0.031
bs-0.031
 are-0.030
Token so
Token ID523
Feature activation+5.20e-01

Pos logit contributions
reetings+0.092
alk+0.080
brance+0.079
alis+0.078
joy+0.078
Neg logit contributions
nam-0.020
 both-0.020
 else-0.017
 were-0.015
 are-0.013
Token happy
Token ID3772
Feature activation-1.18e+00

Pos logit contributions
s+0.032
cream+0.031
 tant+0.031
 else+0.030
bs+0.030
Neg logit contributions
 planning-0.039
reetings-0.033
 possible-0.032
 solution-0.032
 example-0.031
Token with
Token ID351
Feature activation+1.49e+00

Pos logit contributions
reetings+0.121
 planning+0.111
alis+0.110
 solution+0.108
 example+0.105
Neg logit contributions
s-0.051
 else-0.045
 are-0.039
 were-0.039
bs-0.037
Token her
Token ID607
Feature activation-3.17e-03

Pos logit contributions
s+0.075
 else+0.075
nam+0.072
bs+0.070
 are+0.070
Neg logit contributions
 planning-0.127
reetings-0.124
 solution-0.114
alis-0.113
joy-0.113
Token creation
Token ID6282
Feature activation-6.53e+00

Pos logit contributions
reetings+0.000
 planning+0.000
alis+0.000
joy+0.000
 possible+0.000
Neg logit contributions
s-0.000
 else-0.000
bs-0.000
 are-0.000
 trash-0.000
Token.
Token ID13
Feature activation+2.40e-01

Pos logit contributions
reetings+0.641
 planning+0.638
alis+0.580
joy+0.580
 possible+0.575
Neg logit contributions
s-0.248
 else-0.244
 are-0.222
bs-0.219
nam-0.211
Token
Token ID198
Feature activation+4.34e-01

Pos logit contributions
s+0.011
 else+0.010
 are+0.010
bs+0.009
nam+0.009
Neg logit contributions
 planning-0.022
reetings-0.022
 possible-0.021
 solution-0.021
 example-0.020
Token
Token ID198
Feature activation+4.28e-01

Pos logit contributions
s+0.023
 else+0.022
nam+0.022
bs+0.021
 are+0.021
Neg logit contributions
 planning-0.036
reetings-0.034
 solution-0.032
joy-0.031
 possible-0.031
TokenBut
Token ID1537
Feature activation+1.25e+00

Pos logit contributions
s+0.025
 else+0.024
nam+0.024
bs+0.024
 are+0.024
Neg logit contributions
 planning-0.033
reetings-0.031
alis-0.029
 solution-0.029
 example-0.028
Token then
Token ID788
Feature activation+1.47e+00

Pos logit contributions
s+0.051
 else+0.050
nam+0.049
bs+0.047
 are+0.046
Neg logit contributions
 planning-0.118
reetings-0.115
 solution-0.107
 possible-0.106
joy-0.106
Token would
Token ID561
Feature activation-2.55e-01

Pos logit contributions
s+0.108
 else+0.105
nam+0.099
bs+0.098
 are+0.097
Neg logit contributions
 planning-0.267
reetings-0.258
 solution-0.242
joy-0.241
alis-0.240
Token never
Token ID1239
Feature activation-7.89e-02

Pos logit contributions
 planning+0.021
reetings+0.020
joy+0.019
alis+0.019
 possible+0.018
Neg logit contributions
 are-0.013
 else-0.013
s-0.013
 were-0.012
bs-0.012
Token be
Token ID307
Feature activation-4.02e+00

Pos logit contributions
 planning+0.007
reetings+0.007
 solution+0.006
 possible+0.006
alis+0.006
Neg logit contributions
s-0.004
 else-0.004
nam-0.004
bs-0.004
 are-0.003
Token so
Token ID523
Feature activation+4.09e-01

Pos logit contributions
reetings+0.304
 planning+0.294
alis+0.273
 Fairy+0.265
brance+0.259
Neg logit contributions
 else-0.261
nam-0.241
s-0.230
 trash-0.228
 both-0.227
Token angry
Token ID7954
Feature activation+2.96e+00

Pos logit contributions
s+0.021
 else+0.020
 are+0.019
nam+0.019
bs+0.019
Neg logit contributions
 planning-0.035
reetings-0.033
 possible-0.031
 solution-0.031
joy-0.031
Token again
Token ID757
Feature activation-7.31e+00

Pos logit contributions
 else+0.161
s+0.156
 are+0.145
 were+0.136
bs+0.136
Neg logit contributions
 planning-0.242
reetings-0.238
alis-0.229
brance-0.217
 solution-0.217
Token.
Token ID13
Feature activation+1.19e-01

Pos logit contributions
reetings+0.717
alis+0.717
 planning+0.711
 solution+0.667
 Fairy+0.658
Neg logit contributions
s-0.263
 else-0.259
 are-0.229
 were-0.217
 both-0.209
Token
Token ID198
Feature activation+3.01e-01

Pos logit contributions
s+0.006
 else+0.005
 are+0.005
nam+0.005
bs+0.005
Neg logit contributions
 planning-0.011
reetings-0.010
alis-0.010
 possible-0.010
 solution-0.010
Token
Token ID198
Feature activation+3.10e-01

Pos logit contributions
nam+0.015
s+0.014
 else+0.014
bs+0.014
 are+0.013
Neg logit contributions
 planning-0.026
reetings-0.025
 solution-0.023
joy-0.023
 possible-0.023
TokenThe
Token ID464
Feature activation+9.10e-01

Pos logit contributions
s+0.016
 else+0.015
nam+0.015
bs+0.015
 are+0.014
Neg logit contributions
 planning-0.027
reetings-0.026
 solution-0.024
 possible-0.024
joy-0.023
Token moral
Token ID6573
Feature activation-5.26e+00

Pos logit contributions
 else+0.065
nam+0.064
s+0.063
 are+0.062
bs+0.060
Neg logit contributions
 planning-0.057
reetings-0.054
alis-0.051
joy-0.049
 solution-0.048
Token saw
Token ID2497
Feature activation+2.62e+00

Pos logit contributions
cream+0.015
s+0.015
 tant+0.015
nam+0.014
 else+0.014
Neg logit contributions
 planning-0.034
reetings-0.031
joy-0.031
 known-0.030
 solution-0.030
Token some
Token ID617
Feature activation-8.85e-01

Pos logit contributions
s+0.157
bs+0.153
nam+0.152
 else+0.147
 tant+0.145
Neg logit contributions
 planning-0.194
reetings-0.177
joy-0.170
 possible-0.170
 example-0.169
Token children
Token ID1751
Feature activation-9.63e-01

Pos logit contributions
 planning+0.069
reetings+0.065
 solution+0.061
joy+0.061
 possible+0.060
Neg logit contributions
s-0.049
 else-0.047
nam-0.047
bs-0.047
 are-0.045
Token playing
Token ID2712
Feature activation-2.64e+00

Pos logit contributions
 planning+0.062
reetings+0.061
 solution+0.055
alis+0.055
 example+0.054
Neg logit contributions
s-0.068
 else-0.067
 are-0.064
nam-0.064
bs-0.063
Token and
Token ID290
Feature activation-7.69e-01

Pos logit contributions
 planning+0.246
reetings+0.242
 solution+0.223
 possible+0.223
alis+0.222
Neg logit contributions
s-0.113
 else-0.112
 are-0.106
nam-0.103
bs-0.102
Token having
Token ID1719
Feature activation-7.15e+00

Pos logit contributions
reetings+0.058
 planning+0.056
alis+0.053
 solution+0.052
 example+0.052
Neg logit contributions
s-0.047
 else-0.047
nam-0.045
 were-0.045
 are-0.045
Token fun
Token ID1257
Feature activation-8.28e-01

Pos logit contributions
reetings+0.685
 planning+0.654
 solution+0.609
 Fairy+0.604
alis+0.604
Neg logit contributions
 else-0.322
 are-0.312
 were-0.305
s-0.297
 both-0.283
Token.
Token ID13
Feature activation+1.76e-01

Pos logit contributions
reetings+0.084
 planning+0.083
alis+0.079
 Fairy+0.078
 solution+0.078
Neg logit contributions
s-0.031
 else-0.029
 are-0.026
 were-0.025
bs-0.025
Token Joe
Token ID5689
Feature activation-2.87e+00

Pos logit contributions
s+0.009
 else+0.009
 were+0.008
 are+0.008
bs+0.008
Neg logit contributions
reetings-0.015
 planning-0.015
 possible-0.014
 solution-0.014
alis-0.013
Token asked
Token ID1965
Feature activation-2.18e+00

Pos logit contributions
 planning+0.285
reetings+0.282
 possible+0.265
 solution+0.263
alis+0.262
Neg logit contributions
s-0.110
 else-0.104
 are-0.092
bs-0.091
 were-0.088
Token Jane
Token ID12091
Feature activation-5.72e+00

Pos logit contributions
alis+0.190
 planning+0.189
reetings+0.185
brance+0.173
 possible+0.171
Neg logit contributions
 else-0.110
 are-0.107
 were-0.106
s-0.106
 both-0.093
Token in
Token ID287
Feature activation+2.82e+00

Pos logit contributions
 else+0.069
s+0.067
nam+0.067
bs+0.065
 are+0.064
Neg logit contributions
 planning-0.135
reetings-0.127
 possible-0.124
 solution-0.123
 example-0.120
Token her
Token ID607
Feature activation-3.36e-02

Pos logit contributions
 are+0.105
 else+0.102
s+0.101
nam+0.101
 both+0.100
Neg logit contributions
reetings-0.283
 planning-0.272
alis-0.255
 solution-0.244
 possible-0.242
Token pocket
Token ID10000
Feature activation+2.44e+00

Pos logit contributions
 planning+0.003
 solution+0.002
joy+0.002
 example+0.002
 addition+0.002
Neg logit contributions
s-0.002
 else-0.002
nam-0.002
bs-0.002
w-0.002
Token and
Token ID290
Feature activation-8.02e-01

Pos logit contributions
 else+0.125
s+0.122
nam+0.121
bs+0.119
 are+0.117
Neg logit contributions
 planning-0.200
reetings-0.190
 solution-0.179
 possible-0.176
joy-0.175
Token kept
Token ID4030
Feature activation-8.26e-01

Pos logit contributions
 planning+0.066
 solution+0.057
 Fairy+0.056
 possible+0.056
 addition+0.055
Neg logit contributions
s-0.041
 else-0.039
 tant-0.039
nam-0.036
bs-0.036
Token exploring
Token ID13504
Feature activation-6.75e+00

Pos logit contributions
reetings+0.073
 planning+0.069
alis+0.065
 solution+0.065
 possible+0.063
Neg logit contributions
s-0.039
 else-0.039
nam-0.038
bs-0.037
 are-0.036
Token.
Token ID13
Feature activation+8.17e-02

Pos logit contributions
 planning+0.653
reetings+0.648
alis+0.604
 solution+0.588
joy+0.582
Neg logit contributions
s-0.267
 else-0.266
nam-0.249
 are-0.241
bs-0.231
Token She
Token ID1375
Feature activation-8.85e-01

Pos logit contributions
s+0.003
bs+0.003
 both+0.003
 else+0.003
 were+0.003
Neg logit contributions
reetings-0.007
 planning-0.007
 possible-0.007
 solution-0.007
 believing-0.007
Token looked
Token ID3114
Feature activation+2.10e+00

Pos logit contributions
 planning+0.080
reetings+0.076
 solution+0.071
joy+0.071
alis+0.071
Neg logit contributions
s-0.040
 else-0.039
nam-0.038
bs-0.037
 are-0.037
Token under
Token ID739
Feature activation+2.98e+00

Pos logit contributions
s+0.096
 else+0.093
nam+0.091
 are+0.089
bs+0.088
Neg logit contributions
 planning-0.188
reetings-0.175
joy-0.170
 solution-0.167
 possible-0.165
Token leaves
Token ID5667
Feature activation+7.80e-01

Pos logit contributions
s+0.142
 else+0.140
nam+0.134
bs+0.132
 are+0.131
Neg logit contributions
 planning-0.261
reetings-0.253
 solution-0.234
alis-0.233
 possible-0.232
Token explain
Token ID4727
Feature activation-1.40e+00

Pos logit contributions
 planning+0.074
reetings+0.071
 solution+0.068
 possible+0.066
 example+0.065
Neg logit contributions
s-0.046
 else-0.043
nam-0.043
bs-0.043
cream-0.040
Token that
Token ID326
Feature activation+4.72e+00

Pos logit contributions
 planning+0.124
reetings+0.118
 solution+0.112
 possible+0.111
joy+0.111
Neg logit contributions
s-0.064
 else-0.061
nam-0.060
bs-0.060
 are-0.058
Token he
Token ID339
Feature activation+2.80e+00

Pos logit contributions
s+0.155
 else+0.153
nam+0.145
 are+0.139
bs+0.138
Neg logit contributions
 planning-0.482
reetings-0.469
alis-0.442
 solution-0.439
joy-0.437
Token was
Token ID373
Feature activation-8.44e-01

Pos logit contributions
nam+0.118
s+0.116
 else+0.113
bs+0.111
 tant+0.107
Neg logit contributions
 planning-0.258
reetings-0.246
 solution-0.231
 possible-0.228
alis-0.226
Token only
Token ID691
Feature activation-6.69e-01

Pos logit contributions
reetings+0.071
alis+0.066
joy+0.063
 planning+0.063
 addition+0.063
Neg logit contributions
 else-0.046
 both-0.042
 trash-0.040
nam-0.039
 are-0.038
Token trying
Token ID2111
Feature activation-6.13e+00

Pos logit contributions
reetings+0.051
alis+0.048
 planning+0.048
 Fairy+0.045
brance+0.045
Neg logit contributions
 else-0.039
nam-0.038
 are-0.037
s-0.037
bs-0.036
Token to
Token ID284
Feature activation-8.53e-01

Pos logit contributions
reetings+0.512
 planning+0.500
alis+0.482
 Fairy+0.463
joy+0.449
Neg logit contributions
s-0.336
 else-0.328
bs-0.294
 trash-0.291
 are-0.290
Token look
Token ID804
Feature activation+2.34e+00

Pos logit contributions
 planning+0.072
reetings+0.069
 solution+0.064
joy+0.064
alis+0.064
Neg logit contributions
s-0.043
 else-0.043
nam-0.041
bs-0.040
 are-0.040
Token at
Token ID379
Feature activation+4.28e+00

Pos logit contributions
s+0.140
nam+0.134
w+0.132
cream+0.131
bs+0.131
Neg logit contributions
 planning-0.173
reetings-0.158
 possible-0.153
 solution-0.153
joy-0.152
Token what
Token ID644
Feature activation+4.52e+00

Pos logit contributions
s+0.218
bs+0.211
 else+0.205
nam+0.197
 are+0.189
Neg logit contributions
 planning-0.361
reetings-0.335
 possible-0.326
 solution-0.323
joy-0.316
Token was
Token ID373
Feature activation-8.13e-01

Pos logit contributions
nam+0.191
bs+0.190
s+0.188
 tant+0.180
cream+0.168
Neg logit contributions
 planning-0.410
reetings-0.386
 possible-0.374
 solution-0.374
joy-0.354