TOP ACTIVATIONS
MAX = 0.178

sc ooter . She tr und led along the sidewalk all
The storm th und ered and roared above ,
Every time it th und ered afterwards , the famous
a tall tree . Un bek now st to him ,
long tail and a sharp spine . Can we stay here
songs and shaking her t amb our ine . Whenever she
an incredibly big plane flying overhead . The lion cub was
Jack asked , looking around ignor antly . "
for air , he cl amb ered out of the p
sh iver go down her spine . It tasted different than
a big dark cloud appeared overhead . It started to rain
saw a distant bird flying overhead . " Look , look
saw a brown bird flying overhead with something in its mouth
felt a chill down his spine . He suddenly remembered the
saw a big airplane flying overhead . He was so excited
a chill run down his spine . He could not believe
sh iver go down her spine . It was very cold
castle with a dragon flying overhead . She showed the picture
big , adventurous bird flying overhead . They both forgot about
saw a big plane flying overhead . Lily felt dizz y

MIN ACTIVATIONS
MIN = -0.172

. She wanted to say ' hello ' to her child
um said , " Say ' please show me a surprise
. You have to say ' please ' and ' thank
would whisper " thank you " to whoever put it there
it , like you learned ' cat '. It is not
family said " thank you " before they left . They
He said " thank you " to Tina and told her
. He wanted to say ' hello ', so he waved
. He would always say ' thank - you ' to
. The next task said ' clean the floor '. He
illy was known as the ' s illy re ind eer
child said " Thank you " to John and ran home
all said " thank you " to Lily and her friends
It said " thank you " by ticking louder . The
also made the robot say ' thank you ' after it
he said " thank you " to his mom .
prince said " thank you " over and over again .
bird said " thank you " to its friends . They
and said " thank you " to her mom my for
and said " thank you " to its friends . The

INTERVAL 0.090 - 0.178
CONTAINS 0.895%

other and the doctor said goodbye to the patient .
both hands . Then he said the same magic word .
little girl loved it right away . Every night
Dad likes chocolate ," she says . Anna and
J ill cheered and said , " You did it

INTERVAL 0.003 - 0.090
CONTAINS 50.335%

flew all around , visiting different places and meeting new friends
filled with gold coins . The plate was so
The tree grew very tall and was covered in sweet p
Anna runs after the bubbles . She tries to catch them
little girl named Lily . She loved playing outside , especially

INTERVAL -0.085 - 0.003
CONTAINS 46.946%

wanted to know how big the cage was .
boy named Tim . He liked to play outside and look
warm !" " Can I try it ?" Ben
very happy . One day , the red ball saw a
But then they heard a whistle . It was a

INTERVAL -0.172 - -0.085
CONTAINS 1.825%

She said , " Let 's go on a
amp oline and shouted , " W oo - h oo
in ," says Molly . " I have toys and books
at them . " Good bye !" Anna says
think I can help you get down !"
Token sc
Token ID629
Feature activation+2.94e-02

Pos logit contributions
ently+0.061
ered+0.060
eway+0.060
anced+0.060
am+0.058
Neg logit contributions
please-0.050
 especially-0.044
Why-0.041
 responsible-0.040
 chefs-0.040
Tokenooter
Token ID25141
Feature activation+6.81e-02

Pos logit contributions
eway+0.049
heit+0.046
ently+0.046
ered+0.044
inct+0.043
Neg logit contributions
 especially-0.057
please-0.057
 responsible-0.055
 bored-0.055
 boss-0.055
Token.
Token ID13
Feature activation+2.68e-02

Pos logit contributions
eway+0.146
ently+0.142
ered+0.140
ides+0.138
ogo+0.137
Neg logit contributions
please-0.107
 especially-0.104
 boss-0.097
 busy-0.093
 bored-0.093
Token She
Token ID1375
Feature activation+2.07e-02

Pos logit contributions
eway+0.050
ered+0.050
ently+0.050
aned+0.047
anced+0.047
Neg logit contributions
please-0.049
 especially-0.045
 boss-0.042
 responsible-0.042
 busy-0.041
Token tr
Token ID491
Feature activation+5.44e-02

Pos logit contributions
ered+0.043
eway+0.042
ently+0.042
aned+0.041
anced+0.040
Neg logit contributions
please-0.033
 especially-0.030
 boss-0.028
 busy-0.028
 responsible-0.028
Tokenund
Token ID917
Feature activation+1.78e-01

Pos logit contributions
eway+0.078
ently+0.076
ered+0.074
ides+0.073
acle+0.071
Neg logit contributions
please-0.120
 especially-0.118
 busy-0.106
 bored-0.105
 boss-0.105
Tokenled
Token ID992
Feature activation+2.85e-02

Pos logit contributions
eway+0.253
ides+0.233
utt+0.229
ogo+0.223
acle+0.221
Neg logit contributions
please-0.391
 especially-0.386
 creative-0.363
 boss-0.362
 different-0.358
Token along
Token ID1863
Feature activation+6.22e-02

Pos logit contributions
ered+0.074
eway+0.073
ently+0.072
aned+0.070
anced+0.070
Neg logit contributions
please-0.032
 especially-0.028
 boss-0.025
 responsible-0.025
 busy-0.024
Token the
Token ID262
Feature activation-9.08e-04

Pos logit contributions
ered+0.145
eway+0.143
ently+0.142
aned+0.135
anced+0.133
Neg logit contributions
please-0.088
 especially-0.074
 chefs-0.071
 responsible-0.069
 thirsty-0.069
Token sidewalk
Token ID24058
Feature activation+3.22e-02

Pos logit contributions
please+0.002
 especially+0.001
 chefs+0.001
 bored+0.001
 thirsty+0.001
Neg logit contributions
eway-0.002
ently-0.002
ered-0.002
aned-0.002
ogo-0.002
Token all
Token ID477
Feature activation+6.81e-03

Pos logit contributions
eway+0.073
ently+0.072
ered+0.072
ides+0.069
heit+0.069
Neg logit contributions
please-0.046
 especially-0.044
 boss-0.039
 busy-0.039
 bored-0.038
Token
Token ID198
Feature activation-2.53e-02

Pos logit contributions
ered+0.041
eway+0.041
heit+0.040
ently+0.040
aned+0.040
Neg logit contributions
please-0.016
 especially-0.015
 busy-0.013
 boss-0.013
 bored-0.013
Token
Token ID198
Feature activation-2.76e-02

Pos logit contributions
please+0.037
 especially+0.035
 boss+0.032
 bored+0.032
 responsible+0.031
Neg logit contributions
eway-0.056
ently-0.056
ered-0.056
aned-0.053
ides-0.053
TokenThe
Token ID464
Feature activation+2.42e-02

Pos logit contributions
please+0.032
 especially+0.029
 bored+0.028
 boss+0.027
 chefs+0.026
Neg logit contributions
eway-0.070
ered-0.069
ently-0.068
aned-0.066
rum-0.065
Token storm
Token ID6388
Feature activation-3.69e-03

Pos logit contributions
ered+0.048
ently+0.048
eway+0.047
aned+0.046
heit+0.045
Neg logit contributions
please-0.039
 especially-0.035
 chefs-0.033
 responsible-0.031
 thirsty-0.031
Token th
Token ID294
Feature activation-2.61e-02

Pos logit contributions
 boss+0.005
 especially+0.005
please+0.005
 busy+0.005
 spoiled+0.005
Neg logit contributions
ogo-0.008
heit-0.008
rum-0.008
eway-0.008
ently-0.008
Tokenund
Token ID917
Feature activation+1.76e-01

Pos logit contributions
please+0.049
 especially+0.047
 boss+0.044
 bored+0.043
 busy+0.043
Neg logit contributions
ently-0.048
ered-0.048
eway-0.048
aned-0.046
anced-0.045
Tokenered
Token ID1068
Feature activation+5.83e-02

Pos logit contributions
ides+0.229
eway+0.227
utt+0.222
ogo+0.217
oor+0.209
Neg logit contributions
please-0.367
 different-0.362
 boss-0.359
 especially-0.358
 creative-0.357
Token and
Token ID290
Feature activation+2.27e-02

Pos logit contributions
eway+0.132
ently+0.132
ered+0.130
ides+0.128
aned+0.128
Neg logit contributions
please-0.079
 especially-0.078
 busy-0.071
 boss-0.071
 bored-0.069
Token roared
Token ID45615
Feature activation+7.33e-02

Pos logit contributions
eway+0.057
ently+0.057
ered+0.057
aned+0.055
ides+0.054
Neg logit contributions
please-0.027
 especially-0.025
 boss-0.022
 busy-0.022
 bored-0.021
Token above
Token ID2029
Feature activation+7.15e-02

Pos logit contributions
eway+0.158
ently+0.158
ered+0.158
aned+0.151
ides+0.150
Neg logit contributions
please-0.111
 especially-0.105
 boss-0.097
 busy-0.095
 bored-0.094
Token,
Token ID11
Feature activation-3.23e-02

Pos logit contributions
ogo+0.190
eway+0.190
aned+0.189
ently+0.188
ides+0.186
Neg logit contributions
 especially-0.067
please-0.064
 different-0.060
 boss-0.060
 busy-0.058
Token
Token ID198
Feature activation-2.44e-02

Pos logit contributions
please+0.025
 especially+0.023
 busy+0.020
 responsible+0.019
 boss+0.019
Neg logit contributions
ered-0.057
ently-0.057
anced-0.056
eway-0.056
ides-0.056
TokenEvery
Token ID6109
Feature activation+5.88e-05

Pos logit contributions
please+0.031
 especially+0.029
 boss+0.026
 responsible+0.026
 busy+0.026
Neg logit contributions
ently-0.059
ered-0.059
eway-0.059
anced-0.057
ides-0.057
Token time
Token ID640
Feature activation+1.77e-02

Pos logit contributions
aned+0.000
ently+0.000
ides+0.000
ered+0.000
anced+0.000
Neg logit contributions
 especially-0.000
please-0.000
 busy-0.000
 boss-0.000
 different-0.000
Token it
Token ID340
Feature activation+3.01e-02

Pos logit contributions
ently+0.046
eway+0.046
aned+0.045
ides+0.045
ogo+0.045
Neg logit contributions
 especially-0.019
please-0.019
 boss-0.018
 bored-0.017
 busy-0.017
Token th
Token ID294
Feature activation-2.23e-02

Pos logit contributions
ently+0.061
eway+0.061
ered+0.058
ides+0.058
aned+0.058
Neg logit contributions
please-0.051
 especially-0.049
 boss-0.046
 busy-0.045
 bored-0.044
Tokenund
Token ID917
Feature activation+1.73e-01

Pos logit contributions
please+0.052
 especially+0.050
 boss+0.048
 bored+0.047
 busy+0.047
Neg logit contributions
ered-0.031
eway-0.030
ently-0.030
aned-0.028
anced-0.028
Tokenered
Token ID1068
Feature activation+5.87e-02

Pos logit contributions
ides+0.150
eway+0.141
utt+0.131
ogo+0.129
acle+0.127
Neg logit contributions
 boss-0.432
please-0.431
 especially-0.428
 creative-0.426
 different-0.426
Token afterwards
Token ID12979
Feature activation+4.62e-04

Pos logit contributions
eway+0.125
ently+0.125
ered+0.124
aned+0.121
ides+0.121
Neg logit contributions
please-0.089
 especially-0.086
 boss-0.079
 busy-0.078
 bored-0.077
Token,
Token ID11
Feature activation-3.94e-02

Pos logit contributions
eway+0.001
ently+0.001
ered+0.001
aned+0.001
ogo+0.001
Neg logit contributions
please-0.001
 especially-0.001
 boss-0.001
 busy-0.001
 bored-0.001
Token the
Token ID262
Feature activation+5.70e-03

Pos logit contributions
please+0.047
 especially+0.044
 boss+0.040
 busy+0.039
 bored+0.038
Neg logit contributions
eway-0.098
ently-0.098
ered-0.098
aned-0.095
ides-0.095
Token famous
Token ID5863
Feature activation-2.26e-03

Pos logit contributions
eway+0.010
ered+0.010
ently+0.010
ield+0.009
inct+0.009
Neg logit contributions
please-0.010
 especially-0.009
nam-0.009
 chefs-0.009
 responsible-0.008
Token a
Token ID257
Feature activation-2.14e-02

Pos logit contributions
please+0.017
 especially+0.014
 responsible+0.013
 boss+0.013
 busy+0.013
Neg logit contributions
ered-0.032
eway-0.031
ently-0.030
anced-0.030
am-0.029
Token tall
Token ID7331
Feature activation-3.62e-03

Pos logit contributions
please+0.029
 especially+0.025
 bored+0.022
Why+0.022
 responsible+0.022
Neg logit contributions
eway-0.047
heit-0.045
acle-0.045
ield-0.045
 replied-0.045
Token tree
Token ID5509
Feature activation+3.42e-02

Pos logit contributions
please+0.005
 especially+0.005
 boss+0.005
 busy+0.005
 bored+0.005
Neg logit contributions
ered-0.008
ently-0.008
eway-0.008
aned-0.008
ides-0.008
Token.
Token ID13
Feature activation+2.17e-02

Pos logit contributions
heit+0.088
ently+0.087
anced+0.087
ered+0.086
eway+0.086
Neg logit contributions
 especially-0.039
please-0.036
 busy-0.035
 boss-0.035
 thirsty-0.034
Token Un
Token ID791
Feature activation-2.01e-02

Pos logit contributions
ently+0.055
rum+0.054
eway+0.053
ered+0.052
aned+0.052
Neg logit contributions
 especially-0.027
I-0.026
 dizz-0.026
please-0.026
 bored-0.025
Tokenbek
Token ID47083
Feature activation+1.68e-01

Pos logit contributions
please+0.040
 especially+0.037
 boss+0.035
 busy+0.035
 responsible+0.034
Neg logit contributions
eway-0.035
ently-0.034
ered-0.034
aned-0.033
rum-0.032
Tokennow
Token ID2197
Feature activation+7.73e-02

Pos logit contributions
ides+0.213
eway+0.209
ogo+0.193
ently+0.179
rum+0.176
Neg logit contributions
please-0.398
 boss-0.378
 nos-0.355
 creative-0.348
 competitive-0.345
Tokenst
Token ID301
Feature activation+4.87e-02

Pos logit contributions
eway+0.189
ently+0.187
ides+0.185
rum+0.184
aned+0.184
Neg logit contributions
 especially-0.105
please-0.102
 boss-0.096
 busy-0.093
 restless-0.091
Token to
Token ID284
Feature activation+3.06e-02

Pos logit contributions
acle+0.064
iously+0.064
osaurs+0.063
aned+0.062
ides+0.062
Neg logit contributions
 different-0.097
 especially-0.094
 pretty-0.086
 like-0.085
 clean-0.085
Token him
Token ID683
Feature activation+3.19e-02

Pos logit contributions
ently+0.079
eway+0.079
ered+0.078
aned+0.076
ides+0.076
Neg logit contributions
please-0.035
 especially-0.032
 boss-0.028
 busy-0.027
 responsible-0.027
Token,
Token ID11
Feature activation-4.30e-02

Pos logit contributions
eway+0.075
aned+0.074
ently+0.073
ides+0.072
anced+0.072
Neg logit contributions
 especially-0.044
please-0.041
 boss-0.038
 busy-0.037
 different-0.036
Token long
Token ID890
Feature activation+2.39e-02

Pos logit contributions
please+0.043
 especially+0.040
 boss+0.039
 busy+0.037
 bored+0.037
Neg logit contributions
ently-0.059
ered-0.058
eway-0.058
aned-0.056
anced-0.055
Token tail
Token ID7894
Feature activation+5.96e-02

Pos logit contributions
ently+0.056
eway+0.053
anced+0.053
utt+0.052
ered+0.052
Neg logit contributions
 boss-0.033
 especially-0.032
please-0.032
 different-0.031
 thirsty-0.030
Token and
Token ID290
Feature activation+1.96e-02

Pos logit contributions
ently+0.136
ered+0.136
eway+0.136
aned+0.131
heit+0.130
Neg logit contributions
please-0.084
 especially-0.080
 boss-0.074
 bored-0.071
 busy-0.070
Token a
Token ID257
Feature activation-1.35e-02

Pos logit contributions
ently+0.053
eway+0.053
ered+0.053
aned+0.052
ides+0.051
Neg logit contributions
please-0.019
 especially-0.018
 boss-0.016
 bored-0.015
 busy-0.015
Token sharp
Token ID7786
Feature activation+6.27e-02

Pos logit contributions
please+0.021
 especially+0.020
 boss+0.019
 busy+0.018
 bored+0.018
Neg logit contributions
ently-0.029
ered-0.029
eway-0.029
aned-0.028
anced-0.027
Token spine
Token ID19656
Feature activation+1.66e-01

Pos logit contributions
eway+0.120
ently+0.119
ogo+0.114
utt+0.111
anced+0.111
Neg logit contributions
 boss-0.111
please-0.110
 especially-0.103
 like-0.102
 shy-0.102
Token.
Token ID13
Feature activation+3.64e-02

Pos logit contributions
heit+0.416
eway+0.411
ogo+0.406
ently+0.402
utt+0.401
Neg logit contributions
 especially-0.169
please-0.166
 boss-0.154
 bored-0.151
 like-0.147
Token Can
Token ID1680
Feature activation-3.22e-02

Pos logit contributions
ered+0.073
ently+0.073
eway+0.071
aned+0.070
ogo+0.068
Neg logit contributions
please-0.063
 especially-0.060
 boss-0.058
Why-0.055
 bored-0.055
Token we
Token ID356
Feature activation-6.29e-02

Pos logit contributions
please+0.047
 especially+0.046
 boss+0.042
 bored+0.040
 busy+0.040
Neg logit contributions
eway-0.073
ently-0.072
ered-0.071
ides-0.069
aned-0.069
Token stay
Token ID2652
Feature activation-2.07e-02

Pos logit contributions
please+0.100
 especially+0.092
 boss+0.084
 busy+0.084
 responsible+0.084
Neg logit contributions
ered-0.132
eway-0.131
ently-0.129
ides-0.125
aned-0.124
Token here
Token ID994
Feature activation+4.83e-02

Pos logit contributions
please+0.030
 especially+0.027
 boss+0.025
 responsible+0.024
 bored+0.024
Neg logit contributions
ered-0.047
eway-0.046
ently-0.046
am-0.044
ides-0.043
Token songs
Token ID7259
Feature activation+5.70e-02

Pos logit contributions
eway+0.101
ered+0.098
ently+0.097
anced+0.092
ided+0.091
Neg logit contributions
please-0.095
 especially-0.085
 thirsty-0.082
 bored-0.080
 responsible-0.079
Token and
Token ID290
Feature activation+2.65e-02

Pos logit contributions
eway+0.116
ered+0.115
ently+0.115
aned+0.110
ides+0.109
Neg logit contributions
please-0.095
 especially-0.088
 boss-0.082
 busy-0.080
 bored-0.080
Token shaking
Token ID17275
Feature activation+2.86e-02

Pos logit contributions
ered+0.054
eway+0.054
ently+0.051
anced+0.050
zed+0.049
Neg logit contributions
please-0.041
 responsible-0.035
 especially-0.035
 chefs-0.034
 boss-0.034
Token her
Token ID607
Feature activation+3.20e-02

Pos logit contributions
ered+0.061
eway+0.060
ently+0.058
am+0.055
anced+0.055
Neg logit contributions
please-0.046
 especially-0.039
 thirsty-0.038
 responsible-0.038
 chefs-0.038
Token t
Token ID256
Feature activation-1.37e-02

Pos logit contributions
ered+0.052
eway+0.052
ently+0.052
aned+0.049
ides+0.049
Neg logit contributions
please-0.067
 especially-0.063
 boss-0.058
 busy-0.057
 responsible-0.057
Tokenamb
Token ID4131
Feature activation+1.63e-01

Pos logit contributions
please+0.025
 especially+0.024
 boss+0.022
 busy+0.021
 responsible+0.021
Neg logit contributions
eway-0.026
ered-0.026
ently-0.025
heit-0.024
aned-0.024
Tokenour
Token ID454
Feature activation+4.82e-02

Pos logit contributions
eway+0.313
rum+0.262
 jaws+0.247
ool+0.243
acle+0.237
Neg logit contributions
please-0.300
 especially-0.282
 bored-0.275
 boss-0.255
 creative-0.251
Tokenine
Token ID500
Feature activation+2.05e-02

Pos logit contributions
eway+0.104
ered+0.103
ently+0.102
heit+0.101
rum+0.098
Neg logit contributions
please-0.072
 especially-0.070
 boss-0.064
 busy-0.061
 bored-0.060
Token.
Token ID13
Feature activation+2.81e-02

Pos logit contributions
eway+0.048
ered+0.047
ently+0.046
ogo+0.045
rum+0.045
Neg logit contributions
please-0.028
 especially-0.027
 boss-0.025
 busy-0.025
 bored-0.024
Token Whenever
Token ID21326
Feature activation+2.19e-03

Pos logit contributions
eway+0.052
ently+0.051
ered+0.051
aned+0.049
ides+0.048
Neg logit contributions
please-0.052
 especially-0.049
 boss-0.046
 busy-0.045
 responsible-0.045
Token she
Token ID673
Feature activation+1.95e-04

Pos logit contributions
ered+0.005
eway+0.005
ently+0.005
anced+0.004
am+0.004
Neg logit contributions
please-0.003
 especially-0.003
 responsible-0.003
 chefs-0.002
 busy-0.002
Token an
Token ID281
Feature activation+2.47e-02

Pos logit contributions
please+0.009
 especially+0.008
 responsible+0.007
 boss+0.007
 busy+0.007
Neg logit contributions
eway-0.020
ered-0.019
ently-0.019
anced-0.018
rum-0.018
Token incredibly
Token ID8131
Feature activation-2.02e-02

Pos logit contributions
ered+0.054
eway+0.053
ently+0.053
aned+0.051
rum+0.050
Neg logit contributions
please-0.038
 especially-0.034
 boss-0.031
 responsible-0.031
 bored-0.030
Token big
Token ID1263
Feature activation-7.28e-03

Pos logit contributions
please+0.017
 especially+0.015
 boss+0.013
 busy+0.013
 bored+0.012
Neg logit contributions
ently-0.057
ered-0.057
eway-0.057
aned-0.056
ides-0.055
Token plane
Token ID6614
Feature activation+5.51e-02

Pos logit contributions
please+0.013
 especially+0.012
 boss+0.011
 busy+0.011
 bored+0.011
Neg logit contributions
ently-0.014
ered-0.014
eway-0.014
aned-0.014
ides-0.014
Token flying
Token ID7348
Feature activation+2.57e-02

Pos logit contributions
heit+0.153
ently+0.149
ogo+0.148
anced+0.147
rum+0.146
Neg logit contributions
 busy-0.048
 especially-0.047
 nearby-0.047
 boss-0.047
 like-0.046
Token overhead
Token ID16965
Feature activation+1.63e-01

Pos logit contributions
ently+0.071
ides+0.071
ogo+0.068
rum+0.068
heit+0.068
Neg logit contributions
 boss-0.023
 different-0.023
 especially-0.023
 busy-0.022
 very-0.022
Token.
Token ID13
Feature activation+1.98e-02

Pos logit contributions
ogo+0.481
heit+0.473
aned+0.472
ently+0.458
ered+0.454
Neg logit contributions
 different-0.121
 like-0.113
 especially-0.109
 busy-0.107
 boss-0.103
Token The
Token ID383
Feature activation+2.55e-02

Pos logit contributions
ered+0.050
ently+0.050
ides+0.050
aned+0.048
heit+0.048
Neg logit contributions
please-0.023
 especially-0.022
 bored-0.022
 thirsty-0.022
 busy-0.021
Token lion
Token ID18744
Feature activation+3.45e-03

Pos logit contributions
ently+0.055
eway+0.055
ered+0.055
aned+0.052
anced+0.052
Neg logit contributions
please-0.041
 especially-0.039
 boss-0.038
 busy-0.038
 wealthy-0.036
Token cub
Token ID13617
Feature activation+2.19e-02

Pos logit contributions
eway+0.009
ently+0.008
ides+0.008
rum+0.008
ogo+0.008
Neg logit contributions
 especially-0.005
please-0.004
 boss-0.004
 busy-0.004
 shy-0.004
Token was
Token ID373
Feature activation+7.89e-03

Pos logit contributions
eway+0.049
ently+0.048
ered+0.045
ides+0.045
ogo+0.045
Neg logit contributions
please-0.034
 especially-0.033
 boss-0.032
 busy-0.031
 different-0.030
Token Jack
Token ID3619
Feature activation+8.40e-03

Pos logit contributions
please+0.057
 especially+0.052
 responsible+0.050
 chefs+0.049
 boss+0.048
Neg logit contributions
ered-0.066
ently-0.064
eway-0.064
 replied-0.062
aned-0.061
Token asked
Token ID1965
Feature activation+3.22e-02

Pos logit contributions
eway+0.022
ently+0.021
ered+0.021
anced+0.021
ides+0.020
Neg logit contributions
please-0.010
 especially-0.009
 boss-0.008
 bored-0.008
 busy-0.008
Token,
Token ID11
Feature activation-4.64e-02

Pos logit contributions
ered+0.063
ently+0.061
eway+0.061
ided+0.058
aned+0.058
Neg logit contributions
please-0.058
 especially-0.053
 responsible-0.049
 chefs-0.048
 busy-0.048
Token looking
Token ID2045
Feature activation+5.75e-04

Pos logit contributions
please+0.074
 especially+0.066
 responsible+0.063
 chefs+0.063
 creative+0.060
Neg logit contributions
ered-0.093
aned-0.089
ently-0.089
eway-0.088
rum-0.086
Token around
Token ID1088
Feature activation+2.16e-02

Pos logit contributions
ered+0.001
eway+0.001
ently+0.001
am+0.001
aned+0.001
Neg logit contributions
please-0.001
 especially-0.001
 responsible-0.001
 chefs-0.001
 boss-0.001
Token ignor
Token ID11477
Feature activation+1.62e-01

Pos logit contributions
eway+0.046
ered+0.046
ently+0.046
aned+0.043
rum+0.043
Neg logit contributions
please-0.034
 especially-0.031
 boss-0.028
 responsible-0.028
 bored-0.027
Tokenantly
Token ID3875
Feature activation+2.01e-02

Pos logit contributions
cases+0.180
ogo+0.180
eway+0.178
utt+0.175
heit+0.174
Neg logit contributions
please-0.382
 boss-0.353
Why-0.347
I-0.339
 shy-0.323
Token.
Token ID13
Feature activation+2.10e-02

Pos logit contributions
eway+0.046
ently+0.046
ered+0.045
ogo+0.045
aned+0.044
Neg logit contributions
please-0.028
 especially-0.027
 boss-0.025
 busy-0.025
 bored-0.024
Token
Token ID198
Feature activation-2.25e-02

Pos logit contributions
ered+0.048
eway+0.047
ently+0.047
aned+0.045
ides+0.045
Neg logit contributions
please-0.030
 especially-0.028
 boss-0.026
 busy-0.025
 bored-0.025
Token
Token ID198
Feature activation-2.48e-02

Pos logit contributions
please+0.030
 especially+0.028
 boss+0.025
 busy+0.025
 responsible+0.024
Neg logit contributions
ered-0.053
eway-0.053
ently-0.053
aned-0.051
ides-0.051
Token"
Token ID1
Feature activation-1.53e-01

Pos logit contributions
please+0.045
 especially+0.042
 bored+0.040
 thirsty+0.039
 boss+0.038
Neg logit contributions
eway-0.047
ered-0.046
ently-0.046
aned-0.043
ided-0.043
Token for
Token ID329
Feature activation-2.54e-03

Pos logit contributions
ently+0.168
eway+0.168
ered+0.166
ides+0.161
aned+0.160
Neg logit contributions
please-0.114
 especially-0.109
 boss-0.101
 busy-0.098
 bored-0.097
Token air
Token ID1633
Feature activation+5.27e-02

Pos logit contributions
please+0.004
 especially+0.004
 boss+0.004
 bored+0.004
 responsible+0.004
Neg logit contributions
ered-0.005
eway-0.005
ently-0.005
ides-0.005
aned-0.005
Token,
Token ID11
Feature activation-3.39e-02

Pos logit contributions
eway+0.117
ently+0.117
ered+0.116
aned+0.112
ides+0.112
Neg logit contributions
please-0.074
 especially-0.073
 boss-0.067
 busy-0.065
 bored-0.064
Token he
Token ID339
Feature activation+1.73e-02

Pos logit contributions
please+0.054
 especially+0.049
 boss+0.046
 busy+0.045
 responsible+0.045
Neg logit contributions
eway-0.071
ered-0.071
ently-0.071
aned-0.068
ides-0.067
Token cl
Token ID537
Feature activation+2.21e-02

Pos logit contributions
eway+0.035
ently+0.035
ered+0.034
ides+0.033
rum+0.033
Neg logit contributions
please-0.029
 especially-0.029
 boss-0.026
 busy-0.025
 bored-0.025
Tokenamb
Token ID4131
Feature activation+1.61e-01

Pos logit contributions
eway+0.035
ently+0.034
heit+0.033
ered+0.032
ogo+0.032
Neg logit contributions
please-0.047
 especially-0.046
 boss-0.043
 bored-0.042
 thirsty-0.042
Tokenered
Token ID1068
Feature activation+6.15e-02

Pos logit contributions
eway+0.214
 trailer+0.165
inct+0.151
acle+0.150
lor+0.146
Neg logit contributions
please-0.361
 especially-0.357
 creative-0.338
 careless-0.334
 bored-0.333
Token out
Token ID503
Feature activation+4.45e-03

Pos logit contributions
eway+0.132
ered+0.131
ently+0.131
aned+0.126
ides+0.125
Neg logit contributions
please-0.095
 especially-0.089
 boss-0.082
 busy-0.080
 bored-0.080
Token of
Token ID286
Feature activation-5.13e-02

Pos logit contributions
ently+0.010
eway+0.010
ered+0.010
ides+0.009
aned+0.009
Neg logit contributions
please-0.007
 especially-0.006
 boss-0.006
 busy-0.006
 bored-0.006
Token the
Token ID262
Feature activation-6.65e-03

Pos logit contributions
please+0.088
 especially+0.077
 boss+0.071
 responsible+0.070
 bored+0.069
Neg logit contributions
ered-0.108
eway-0.106
ently-0.101
anced-0.099
rum-0.098
Token p
Token ID279
Feature activation-3.32e-02

Pos logit contributions
please+0.009
 especially+0.008
 responsible+0.007
 chefs+0.006
 boss+0.006
Neg logit contributions
ered-0.016
eway-0.016
ently-0.016
aned-0.015
am-0.015
Token sh
Token ID427
Feature activation-1.33e-02

Pos logit contributions
please+0.038
 especially+0.033
 chefs+0.032
 bored+0.030
Why+0.029
Neg logit contributions
ered-0.056
ently-0.055
aned-0.054
eway-0.054
ides-0.054
Tokeniver
Token ID1428
Feature activation+1.14e-02

Pos logit contributions
please+0.024
 especially+0.023
 boss+0.021
 busy+0.021
 bored+0.021
Neg logit contributions
ered-0.025
eway-0.025
ently-0.025
aned-0.024
ides-0.023
Token go
Token ID467
Feature activation-3.66e-02

Pos logit contributions
ered+0.025
eway+0.025
ently+0.024
aned+0.023
ides+0.023
Neg logit contributions
please-0.018
 especially-0.016
 boss-0.015
 busy-0.015
 responsible-0.015
Token down
Token ID866
Feature activation+3.81e-02

Pos logit contributions
please+0.051
 especially+0.047
 boss+0.044
 busy+0.044
 bored+0.043
Neg logit contributions
ently-0.086
eway-0.085
ered-0.085
anced-0.082
ides-0.082
Token her
Token ID607
Feature activation+1.75e-02

Pos logit contributions
eway+0.075
ently+0.073
ered+0.073
aned+0.068
anced+0.066
Neg logit contributions
please-0.067
 especially-0.061
 responsible-0.057
 boss-0.056
 chefs-0.056
Token spine
Token ID19656
Feature activation+1.61e-01

Pos logit contributions
ently+0.029
eway+0.027
ered+0.027
aned+0.027
am+0.026
Neg logit contributions
please-0.036
 especially-0.034
 boss-0.031
Why-0.031
 responsible-0.031
Token.
Token ID13
Feature activation+1.69e-02

Pos logit contributions
heit+0.406
eway+0.400
ered+0.393
ogo+0.392
anced+0.383
Neg logit contributions
 especially-0.176
please-0.175
 busy-0.165
 bored-0.160
 boss-0.156
Token It
Token ID632
Feature activation+3.31e-02

Pos logit contributions
eway+0.033
ered+0.033
ently+0.033
aned+0.031
ides+0.031
Neg logit contributions
please-0.030
 especially-0.028
 boss-0.026
 bored-0.026
 busy-0.026
Token tasted
Token ID29187
Feature activation-2.41e-02

Pos logit contributions
ered+0.061
eway+0.060
ently+0.059
aned+0.057
anced+0.055
Neg logit contributions
please-0.062
 especially-0.058
 boss-0.054
 responsible-0.054
 busy-0.054
Token different
Token ID1180
Feature activation+1.09e-02

Pos logit contributions
please+0.032
 responsible+0.023
 chefs+0.023
 especially+0.022
nam+0.022
Neg logit contributions
ered-0.060
ently-0.056
eway-0.056
heit-0.054
aned-0.054
Token than
Token ID621
Feature activation-3.05e-02

Pos logit contributions
ered+0.020
ently+0.020
eway+0.020
iously+0.019
rum+0.018
Neg logit contributions
please-0.021
 especially-0.018
 responsible-0.018
 bored-0.017
 busy-0.017
Token a
Token ID257
Feature activation-1.12e-02

Pos logit contributions
please+0.051
 especially+0.048
 busy+0.045
 boss+0.045
 bored+0.045
Neg logit contributions
eway-0.096
ently-0.096
ides-0.096
ered-0.095
aned-0.094
Token big
Token ID1263
Feature activation-6.89e-03

Pos logit contributions
please+0.014
 especially+0.013
 bored+0.011
 chefs+0.011
 responsible+0.011
Neg logit contributions
eway-0.027
ered-0.027
ently-0.026
heit-0.026
aned-0.026
Token dark
Token ID3223
Feature activation+3.66e-02

Pos logit contributions
please+0.010
 especially+0.009
 boss+0.009
 busy+0.009
 bored+0.008
Neg logit contributions
ently-0.016
eway-0.016
ered-0.016
aned-0.015
ides-0.015
Token cloud
Token ID6279
Feature activation+1.13e-02

Pos logit contributions
ently+0.079
ered+0.079
eway+0.078
aned+0.075
ides+0.075
Neg logit contributions
please-0.057
 especially-0.053
 boss-0.050
 busy-0.048
 bored-0.048
Token appeared
Token ID4120
Feature activation+2.93e-02

Pos logit contributions
ently+0.026
ogo+0.025
heit+0.025
eway+0.025
ides+0.025
Neg logit contributions
 especially-0.016
please-0.016
Why-0.015
 boss-0.014
 busy-0.014
Token overhead
Token ID16965
Feature activation+1.61e-01

Pos logit contributions
heit+0.075
ered+0.074
eway+0.073
aned+0.073
ides+0.072
Neg logit contributions
 especially-0.030
please-0.029
 busy-0.028
 boss-0.027
 restless-0.026
Token.
Token ID13
Feature activation+1.78e-02

Pos logit contributions
ogo+0.457
ently+0.453
aned+0.453
heit+0.450
anced+0.449
Neg logit contributions
 different-0.133
 especially-0.131
 like-0.129
 boss-0.127
 busy-0.127
Token It
Token ID632
Feature activation+3.01e-02

Pos logit contributions
ered+0.044
ogo+0.044
eway+0.044
ently+0.043
ides+0.043
Neg logit contributions
 bored-0.021
please-0.021
 restless-0.020
 especially-0.020
 boss-0.020
Token started
Token ID2067
Feature activation-4.00e-02

Pos logit contributions
ently+0.065
eway+0.063
ides+0.063
rum+0.061
ered+0.061
Neg logit contributions
please-0.049
 especially-0.048
 boss-0.044
 busy-0.043
 bored-0.042
Token to
Token ID284
Feature activation+2.72e-02

Pos logit contributions
please+0.049
 especially+0.044
 boss+0.039
 responsible+0.039
 busy+0.038
Neg logit contributions
eway-0.099
ered-0.099
ently-0.097
aned-0.095
anced-0.094
Token rain
Token ID6290
Feature activation-6.49e-03

Pos logit contributions
ered+0.055
eway+0.054
ently+0.051
aned+0.051
heit+0.050
Neg logit contributions
please-0.046
 especially-0.042
 responsible-0.039
 chefs-0.039
 bored-0.038
Token saw
Token ID2497
Feature activation-6.56e-03

Pos logit contributions
ered+0.059
eway+0.057
ently+0.057
aned+0.055
anced+0.055
Neg logit contributions
please-0.038
 especially-0.034
 boss-0.032
 responsible-0.031
 chefs-0.031
Token a
Token ID257
Feature activation-2.34e-02

Pos logit contributions
please+0.011
 especially+0.009
 responsible+0.008
 boss+0.008
 chefs+0.008
Neg logit contributions
ered-0.014
eway-0.014
ently-0.013
anced-0.013
am-0.013
Token distant
Token ID12899
Feature activation+3.76e-02

Pos logit contributions
please+0.032
 especially+0.028
 chefs+0.025
 responsible+0.024
 bored+0.024
Neg logit contributions
eway-0.052
acle-0.049
ered-0.049
heit-0.048
 replied-0.048
Token bird
Token ID6512
Feature activation-4.97e-03

Pos logit contributions
eway+0.076
ered+0.076
ently+0.076
aned+0.072
ides+0.072
Neg logit contributions
please-0.063
 especially-0.059
 boss-0.054
 busy-0.053
 bored-0.053
Token flying
Token ID7348
Feature activation+2.27e-02

Pos logit contributions
please+0.007
 especially+0.007
 boss+0.006
 busy+0.006
 thirsty+0.006
Neg logit contributions
ently-0.012
eway-0.012
heit-0.011
anced-0.011
ered-0.011
Token overhead
Token ID16965
Feature activation+1.61e-01

Pos logit contributions
ently+0.050
ered+0.050
eway+0.050
aned+0.048
ides+0.048
Neg logit contributions
please-0.033
 especially-0.032
 boss-0.029
 busy-0.028
 bored-0.028
Token.
Token ID13
Feature activation+1.56e-02

Pos logit contributions
ogo+0.395
aned+0.388
ered+0.388
ently+0.388
eway+0.388
Neg logit contributions
 especially-0.185
please-0.183
 boss-0.171
 busy-0.170
 different-0.170
Token "
Token ID366
Feature activation-1.41e-01

Pos logit contributions
eway+0.031
ered+0.031
ently+0.030
aned+0.029
ides+0.029
Neg logit contributions
please-0.027
 especially-0.026
 boss-0.024
 bored-0.024
 busy-0.023
TokenLook
Token ID8567
Feature activation+7.28e-02

Pos logit contributions
please+0.184
 bored+0.178
 chefs+0.174
 especially+0.172
 busy+0.171
Neg logit contributions
eway-0.315
ered-0.302
ently-0.295
anced-0.293
ides-0.289
Token,
Token ID11
Feature activation-5.78e-02

Pos logit contributions
eway+0.157
ently+0.152
ered+0.149
aned+0.148
anced+0.147
Neg logit contributions
please-0.115
 especially-0.108
 boss-0.103
 busy-0.100
 bored-0.099
Token look
Token ID804
Feature activation-8.80e-04

Pos logit contributions
please+0.074
 especially+0.070
 boss+0.065
 busy+0.062
 bored+0.062
Neg logit contributions
ently-0.141
eway-0.140
ered-0.139
aned-0.137
ides-0.136
Token saw
Token ID2497
Feature activation-6.68e-03

Pos logit contributions
eway+0.070
ently+0.070
ered+0.069
ides+0.068
rum+0.068
Neg logit contributions
please-0.026
 especially-0.023
 boss-0.020
 bored-0.020
 busy-0.020
Token a
Token ID257
Feature activation-2.29e-02

Pos logit contributions
please+0.010
 especially+0.008
 responsible+0.008
 boss+0.007
 thirsty+0.007
Neg logit contributions
eway-0.016
ered-0.015
ently-0.014
anced-0.014
ield-0.014
Token brown
Token ID7586
Feature activation+1.98e-02

Pos logit contributions
please+0.032
 especially+0.029
 bored+0.024
 chefs+0.023
 responsible+0.023
Neg logit contributions
eway-0.050
ield-0.047
acle-0.047
 replied-0.047
heit-0.047
Token bird
Token ID6512
Feature activation-4.00e-03

Pos logit contributions
ently+0.049
ered+0.049
eway+0.048
aned+0.048
ides+0.047
Neg logit contributions
please-0.025
 especially-0.023
 boss-0.023
 busy-0.021
 restless-0.021
Token flying
Token ID7348
Feature activation+2.31e-02

Pos logit contributions
please+0.005
 especially+0.005
 boss+0.004
 busy+0.004
Why+0.004
Neg logit contributions
ently-0.010
anced-0.010
eway-0.010
heit-0.010
aned-0.010
Token overhead
Token ID16965
Feature activation+1.61e-01

Pos logit contributions
ently+0.058
ides+0.056
ered+0.056
eway+0.056
aned+0.055
Neg logit contributions
please-0.027
 especially-0.026
 boss-0.024
 busy-0.023
 bored-0.023
Token with
Token ID351
Feature activation+7.78e-03

Pos logit contributions
ogo+0.456
heit+0.453
aned+0.446
ently+0.443
ered+0.434
Neg logit contributions
 especially-0.134
 different-0.129
 like-0.126
 boss-0.124
 busy-0.119
Token something
Token ID1223
Feature activation-3.18e-02

Pos logit contributions
ered+0.019
eway+0.018
anced+0.018
ently+0.018
rum+0.017
Neg logit contributions
please-0.010
 especially-0.009
 responsible-0.008
 boss-0.008
 busy-0.008
Token in
Token ID287
Feature activation+2.76e-02

Pos logit contributions
please+0.042
 especially+0.040
 boss+0.036
 busy+0.036
 bored+0.035
Neg logit contributions
eway-0.075
ently-0.075
ered-0.074
aned-0.072
ides-0.071
Token its
Token ID663
Feature activation+7.95e-02

Pos logit contributions
ently+0.060
eway+0.058
ides+0.057
ered+0.057
aned+0.056
Neg logit contributions
please-0.042
 especially-0.040
 boss-0.038
 busy-0.037
 different-0.037
Token mouth
Token ID5422
Feature activation+3.41e-02

Pos logit contributions
eway+0.138
ently+0.138
ered+0.135
ides+0.130
aned+0.130
Neg logit contributions
please-0.158
 especially-0.150
 boss-0.145
 busy-0.141
 bored-0.140
Token felt
Token ID2936
Feature activation-3.51e-02

Pos logit contributions
eway+0.043
ently+0.041
ered+0.040
ides+0.040
rum+0.039
Neg logit contributions
please-0.028
 especially-0.027
 boss-0.025
 busy-0.024
 bored-0.024
Token a
Token ID257
Feature activation-2.49e-02

Pos logit contributions
please+0.045
 chefs+0.035
 especially+0.034
 boss+0.030
 fashion+0.030
Neg logit contributions
ered-0.089
eway-0.084
ently-0.083
zed-0.081
aned-0.081
Token chill
Token ID20493
Feature activation+4.06e-03

Pos logit contributions
please+0.033
 especially+0.029
 chefs+0.026
 bored+0.025
 boss+0.025
Neg logit contributions
ered-0.059
ently-0.058
eway-0.058
aned-0.057
heit-0.056
Token down
Token ID866
Feature activation+5.18e-02

Pos logit contributions
eway+0.009
ently+0.009
ered+0.009
aned+0.009
heit+0.009
Neg logit contributions
please-0.006
 especially-0.005
 boss-0.005
 busy-0.005
 bored-0.005
Token his
Token ID465
Feature activation+4.93e-02

Pos logit contributions
eway+0.126
heit+0.125
ered+0.125
ides+0.124
ently+0.123
Neg logit contributions
please-0.062
 especially-0.062
 busy-0.058
 boss-0.057
 bored-0.057
Token spine
Token ID19656
Feature activation+1.60e-01

Pos logit contributions
eway+0.065
ered+0.064
ently+0.063
aned+0.059
anced+0.058
Neg logit contributions
please-0.118
 especially-0.114
 boss-0.109
 busy-0.108
 responsible-0.107
Token.
Token ID13
Feature activation+1.92e-02

Pos logit contributions
heit+0.412
eway+0.402
ogo+0.398
osaurs+0.392
ered+0.391
Neg logit contributions
 especially-0.171
please-0.158
 busy-0.156
 boss-0.147
 bored-0.146
Token He
Token ID679
Feature activation+4.69e-02

Pos logit contributions
ered+0.047
ently+0.045
heit+0.045
ides+0.045
eway+0.045
Neg logit contributions
please-0.026
 especially-0.023
 bored-0.022
 boss-0.022
 busy-0.022
Token suddenly
Token ID6451
Feature activation+2.09e-03

Pos logit contributions
eway+0.113
ently+0.110
ides+0.107
rum+0.107
ogo+0.103
Neg logit contributions
 especially-0.068
please-0.066
 boss-0.060
 busy-0.057
 restless-0.056
Token remembered
Token ID12086
Feature activation-1.40e-02

Pos logit contributions
eway+0.005
ently+0.005
ered+0.005
ides+0.005
rum+0.005
Neg logit contributions
please-0.003
 especially-0.003
 boss-0.002
 bored-0.002
 busy-0.002
Token the
Token ID262
Feature activation-5.57e-03

Pos logit contributions
please+0.018
 especially+0.018
 boss+0.016
 busy+0.016
 bored+0.016
Neg logit contributions
ently-0.033
eway-0.033
ides-0.033
aned-0.033
ered-0.032
Token saw
Token ID2497
Feature activation-5.50e-03

Pos logit contributions
ently+0.085
eway+0.085
ered+0.083
ides+0.083
aned+0.082
Neg logit contributions
please-0.028
 especially-0.026
 bored-0.022
 boss-0.022
 busy-0.022
Token a
Token ID257
Feature activation-2.32e-02

Pos logit contributions
please+0.007
 especially+0.006
 responsible+0.006
 boss+0.006
 busy+0.005
Neg logit contributions
eway-0.013
ered-0.013
ently-0.013
ogo-0.012
anced-0.012
Token big
Token ID1263
Feature activation-1.14e-02

Pos logit contributions
please+0.030
 especially+0.026
 bored+0.022
 responsible+0.022
 chefs+0.022
Neg logit contributions
eway-0.055
ered-0.052
heit-0.051
acle-0.051
ogo-0.051
Token airplane
Token ID19401
Feature activation+5.68e-02

Pos logit contributions
please+0.020
 especially+0.019
 boss+0.017
 busy+0.017
 bored+0.017
Neg logit contributions
eway-0.022
ered-0.022
ently-0.022
aned-0.021
ides-0.021
Token flying
Token ID7348
Feature activation+2.27e-02

Pos logit contributions
heit+0.156
ently+0.152
anced+0.151
rum+0.149
ogo+0.149
Neg logit contributions
 especially-0.056
 busy-0.052
 boss-0.051
 like-0.051
please-0.049
Token overhead
Token ID16965
Feature activation+1.60e-01

Pos logit contributions
ently+0.061
ides+0.060
rum+0.059
ered+0.059
aned+0.059
Neg logit contributions
 especially-0.023
 boss-0.022
 different-0.021
please-0.021
 busy-0.021
Token.
Token ID13
Feature activation+2.13e-02

Pos logit contributions
aned+0.479
ogo+0.475
heit+0.473
ently+0.459
ered+0.457
Neg logit contributions
 different-0.120
 like-0.112
 especially-0.101
 busy-0.100
 pretty-0.097
Token He
Token ID679
Feature activation+3.92e-02

Pos logit contributions
ently+0.048
ered+0.048
ides+0.047
ogo+0.047
heit+0.046
Neg logit contributions
 boss-0.030
please-0.030
 especially-0.030
 restless-0.029
 bored-0.029
Token was
Token ID373
Feature activation+4.15e-03

Pos logit contributions
ently+0.097
eway+0.097
ides+0.095
rum+0.092
ancing+0.090
Neg logit contributions
please-0.054
 especially-0.053
 boss-0.048
 different-0.046
 busy-0.045
Token so
Token ID523
Feature activation-2.25e-02

Pos logit contributions
ered+0.011
eway+0.011
ently+0.011
aned+0.011
anced+0.011
Neg logit contributions
please-0.004
 especially-0.003
 responsible-0.003
 boss-0.003
 chefs-0.003
Token excited
Token ID6568
Feature activation+2.86e-02

Pos logit contributions
please+0.021
 especially+0.017
 chefs+0.014
Why+0.013
 boss+0.013
Neg logit contributions
ered-0.064
eway-0.063
ently-0.062
aned-0.060
am-0.060
Token a
Token ID257
Feature activation-2.62e-02

Pos logit contributions
please+0.051
 chefs+0.039
 especially+0.038
I+0.035
Girls+0.035
Neg logit contributions
ered-0.087
eway-0.082
ently-0.079
zed-0.078
aned-0.076
Token chill
Token ID20493
Feature activation+3.36e-03

Pos logit contributions
please+0.037
 especially+0.032
 chefs+0.031
 bored+0.028
 thirsty+0.027
Neg logit contributions
ered-0.059
ently-0.058
eway-0.057
aned-0.056
ides-0.056
Token run
Token ID1057
Feature activation+6.92e-03

Pos logit contributions
eway+0.008
ently+0.008
ered+0.008
heit+0.008
aned+0.007
Neg logit contributions
please-0.005
 especially-0.004
 boss-0.004
 busy-0.004
 bored-0.004
Token down
Token ID866
Feature activation+4.23e-02

Pos logit contributions
ently+0.016
ered+0.016
eway+0.016
heit+0.015
ogo+0.015
Neg logit contributions
 boss-0.009
 especially-0.009
please-0.009
 busy-0.009
 restless-0.009
Token his
Token ID465
Feature activation+4.24e-02

Pos logit contributions
ered+0.098
ently+0.098
eway+0.098
heit+0.097
ides+0.097
Neg logit contributions
please-0.055
 especially-0.054
 busy-0.050
 boss-0.050
 bored-0.048
Token spine
Token ID19656
Feature activation+1.60e-01

Pos logit contributions
eway+0.055
ered+0.055
ently+0.054
aned+0.050
anced+0.050
Neg logit contributions
please-0.101
 especially-0.098
 boss-0.094
 busy-0.093
 responsible-0.092
Token.
Token ID13
Feature activation+1.86e-02

Pos logit contributions
heit+0.411
eway+0.399
ogo+0.397
haw+0.393
ered+0.390
Neg logit contributions
 especially-0.170
 busy-0.159
please-0.153
 different-0.149
 bored-0.149
Token He
Token ID679
Feature activation+4.37e-02

Pos logit contributions
ered+0.034
ently+0.033
eway+0.033
heit+0.033
rum+0.032
Neg logit contributions
please-0.034
 especially-0.034
 boss-0.033
 busy-0.032
 bored-0.032
Token could
Token ID714
Feature activation-1.15e-02

Pos logit contributions
eway+0.098
ently+0.097
ides+0.093
rum+0.092
ogo+0.091
Neg logit contributions
please-0.067
 especially-0.066
 boss-0.059
 busy-0.057
 bored-0.056
Token not
Token ID407
Feature activation+1.47e-02

Pos logit contributions
please+0.016
 especially+0.014
 boss+0.013
 chefs+0.012
 responsible+0.012
Neg logit contributions
ered-0.027
eway-0.027
ently-0.027
aned-0.026
ides-0.025
Token believe
Token ID1975
Feature activation+1.11e-02

Pos logit contributions
ered+0.031
eway+0.029
ides+0.029
ently+0.028
acle+0.027
Neg logit contributions
please-0.024
 chefs-0.021
 responsible-0.020
 bored-0.020
 especially-0.020
Token sh
Token ID427
Feature activation-1.04e-02

Pos logit contributions
please+0.033
 chefs+0.028
 especially+0.028
Why+0.025
 pige+0.025
Neg logit contributions
ered-0.045
ently-0.044
zed-0.043
heit-0.043
iously-0.043
Tokeniver
Token ID1428
Feature activation+1.24e-02

Pos logit contributions
please+0.019
 especially+0.018
 boss+0.017
 chefs+0.017
 bored+0.016
Neg logit contributions
ered-0.019
eway-0.019
ently-0.019
aned-0.018
ides-0.018
Token go
Token ID467
Feature activation-3.58e-02

Pos logit contributions
ered+0.026
eway+0.026
ently+0.026
aned+0.025
ides+0.025
Neg logit contributions
please-0.020
 especially-0.018
 boss-0.017
 responsible-0.016
 busy-0.016
Token down
Token ID866
Feature activation+3.95e-02

Pos logit contributions
please+0.041
 especially+0.037
 boss+0.033
 busy+0.032
 responsible+0.032
Neg logit contributions
eway-0.091
ered-0.091
ently-0.091
aned-0.088
ides-0.087
Token her
Token ID607
Feature activation+1.75e-02

Pos logit contributions
eway+0.071
ered+0.070
ently+0.069
aned+0.062
anced+0.062
Neg logit contributions
please-0.074
 especially-0.066
 responsible-0.065
 boss-0.063
 chefs-0.063
Token spine
Token ID19656
Feature activation+1.60e-01

Pos logit contributions
ently+0.027
eway+0.026
ered+0.026
aned+0.025
am+0.025
Neg logit contributions
please-0.038
 especially-0.035
 chefs-0.033
 boss-0.032
Why-0.032
Token.
Token ID13
Feature activation+2.22e-02

Pos logit contributions
eway+0.388
heit+0.387
ered+0.383
ogo+0.375
ently+0.373
Neg logit contributions
please-0.189
 especially-0.188
 busy-0.175
 bored-0.172
 boss-0.169
Token It
Token ID632
Feature activation+3.25e-02

Pos logit contributions
eway+0.044
ered+0.044
ently+0.043
aned+0.041
ides+0.041
Neg logit contributions
please-0.039
 especially-0.036
 boss-0.034
 busy-0.033
 bored-0.033
Token was
Token ID373
Feature activation+1.38e-02

Pos logit contributions
ered+0.058
eway+0.056
ently+0.055
aned+0.054
anced+0.052
Neg logit contributions
please-0.063
 especially-0.058
 chefs-0.055
 responsible-0.055
 boss-0.055
Token very
Token ID845
Feature activation-3.52e-02

Pos logit contributions
ered+0.032
eway+0.031
aned+0.030
ogo+0.029
ently+0.029
Neg logit contributions
please-0.019
 chefs-0.015
 responsible-0.014
 pige-0.013
 especially-0.013
Token cold
Token ID4692
Feature activation+8.31e-03

Pos logit contributions
please+0.033
 especially+0.028
 boss+0.023
 chefs+0.023
 responsible+0.022
Neg logit contributions
eway-0.097
ered-0.097
ently-0.096
aned-0.094
rum-0.092
Token castle
Token ID16669
Feature activation+7.16e-03

Pos logit contributions
please+0.024
 especially+0.022
 responsible+0.020
 bored+0.020
Why+0.019
Neg logit contributions
eway-0.025
ered-0.025
ently-0.025
ield-0.024
 replied-0.024
Token with
Token ID351
Feature activation+3.25e-03

Pos logit contributions
ered+0.018
eway+0.018
ently+0.018
heit+0.017
aned+0.017
Neg logit contributions
please-0.008
 especially-0.008
 boss-0.007
 busy-0.007
 bored-0.007
Token a
Token ID257
Feature activation-3.03e-02

Pos logit contributions
ered+0.007
eway+0.007
anced+0.007
am+0.007
ently+0.006
Neg logit contributions
please-0.004
 especially-0.004
 responsible-0.004
 thirsty-0.004
 bored-0.004
Token dragon
Token ID10441
Feature activation+5.98e-03

Pos logit contributions
please+0.041
 especially+0.037
 responsible+0.033
 bored+0.032
 chefs+0.031
Neg logit contributions
eway-0.069
ered-0.069
ently-0.068
heit-0.066
aned-0.066
Token flying
Token ID7348
Feature activation+1.97e-02

Pos logit contributions
ently+0.015
ered+0.015
eway+0.015
heit+0.014
aned+0.014
Neg logit contributions
please-0.008
 especially-0.007
 boss-0.007
 busy-0.006
 bored-0.006
Token overhead
Token ID16965
Feature activation+1.60e-01

Pos logit contributions
ently+0.053
ides+0.052
ered+0.052
rum+0.051
ogo+0.051
Neg logit contributions
please-0.019
 boss-0.018
 especially-0.018
 different-0.017
 busy-0.016
Token.
Token ID13
Feature activation+2.23e-02

Pos logit contributions
ogo+0.440
heit+0.434
eway+0.426
utt+0.425
ered+0.425
Neg logit contributions
 especially-0.135
 different-0.134
please-0.129
 boss-0.128
 like-0.125
Token She
Token ID1375
Feature activation+1.91e-02

Pos logit contributions
eway+0.044
ently+0.044
ered+0.044
aned+0.042
anced+0.041
Neg logit contributions
please-0.038
 especially-0.036
 boss-0.033
 responsible-0.033
 busy-0.032
Token showed
Token ID3751
Feature activation+1.29e-02

Pos logit contributions
eway+0.038
ered+0.038
ently+0.038
aned+0.036
ides+0.036
Neg logit contributions
please-0.032
 especially-0.030
 boss-0.028
 busy-0.027
 bored-0.027
Token the
Token ID262
Feature activation-2.18e-02

Pos logit contributions
ered+0.030
eway+0.030
ently+0.030
aned+0.028
anced+0.028
Neg logit contributions
please-0.018
 especially-0.016
 boss-0.015
 busy-0.014
 bored-0.014
Token picture
Token ID4286
Feature activation+1.27e-02

Pos logit contributions
please+0.039
 especially+0.036
 boss+0.033
 responsible+0.033
 busy+0.033
Neg logit contributions
eway-0.042
ered-0.042
ently-0.042
aned-0.040
ides-0.039
Token big
Token ID1263
Feature activation-9.61e-03

Pos logit contributions
please+0.027
 especially+0.026
 chefs+0.022
 bored+0.021
 responsible+0.021
Neg logit contributions
eway-0.042
ield-0.040
ered-0.040
heit-0.040
acle-0.040
Token,
Token ID11
Feature activation-4.37e-02

Pos logit contributions
please+0.017
 especially+0.016
 boss+0.014
 responsible+0.014
 bored+0.014
Neg logit contributions
eway-0.019
ered-0.019
ently-0.019
aned-0.018
ides-0.018
Token adventurous
Token ID39770
Feature activation+1.50e-03

Pos logit contributions
please+0.053
 especially+0.045
 chefs+0.039
 responsible+0.037
 bored+0.036
Neg logit contributions
eway-0.107
ered-0.105
ently-0.103
heit-0.101
aned-0.101
Token bird
Token ID6512
Feature activation+4.84e-04

Pos logit contributions
ered+0.003
ently+0.003
eway+0.003
aned+0.003
ides+0.003
Neg logit contributions
please-0.002
 especially-0.002
 boss-0.002
 busy-0.002
 bored-0.002
Token flying
Token ID7348
Feature activation+2.47e-02

Pos logit contributions
ently+0.001
anced+0.001
heit+0.001
eway+0.001
ides+0.001
Neg logit contributions
please-0.001
 especially-0.001
 boss-0.000
 sick-0.000
 thirsty-0.000
Token overhead
Token ID16965
Feature activation+1.60e-01

Pos logit contributions
ently+0.059
ides+0.059
ogo+0.056
ered+0.056
rum+0.056
Neg logit contributions
please-0.032
 especially-0.031
 boss-0.030
 busy-0.029
 different-0.028
Token.
Token ID13
Feature activation+1.64e-02

Pos logit contributions
ogo+0.448
heit+0.439
aned+0.434
ently+0.432
ered+0.428
Neg logit contributions
 different-0.136
 especially-0.135
 busy-0.133
 like-0.128
 boss-0.126
Token They
Token ID1119
Feature activation+1.73e-03

Pos logit contributions
ently+0.037
eway+0.037
ered+0.036
ides+0.036
ogo+0.036
Neg logit contributions
 especially-0.024
please-0.024
 busy-0.023
 bored-0.023
 thirsty-0.023
Token both
Token ID1111
Feature activation+1.20e-02

Pos logit contributions
eway+0.004
ently+0.004
ered+0.004
aned+0.004
ides+0.004
Neg logit contributions
please-0.003
 especially-0.003
 boss-0.002
 busy-0.002
 bored-0.002
Token forgot
Token ID16453
Feature activation-3.09e-02

Pos logit contributions
ered+0.024
eway+0.023
ently+0.023
aned+0.022
anced+0.022
Neg logit contributions
please-0.021
 especially-0.019
 boss-0.018
 busy-0.018
 responsible-0.018
Token about
Token ID546
Feature activation-7.27e-03

Pos logit contributions
 especially+0.050
please+0.047
 different+0.046
 boss+0.046
 busy+0.046
Neg logit contributions
ently-0.065
ides-0.064
aned-0.063
eway-0.062
ogo-0.061
Token saw
Token ID2497
Feature activation-1.08e-02

Pos logit contributions
eway+0.055
ently+0.055
ered+0.055
aned+0.053
ides+0.053
Neg logit contributions
please-0.023
 especially-0.021
 boss-0.019
 busy-0.018
 bored-0.018
Token a
Token ID257
Feature activation-2.33e-02

Pos logit contributions
please+0.018
 especially+0.015
 responsible+0.014
 boss+0.013
 bored+0.013
Neg logit contributions
ered-0.024
eway-0.023
ently-0.022
anced-0.022
heit-0.021
Token big
Token ID1263
Feature activation-1.73e-02

Pos logit contributions
please+0.032
 especially+0.029
Why+0.025
 responsible+0.024
 chefs+0.024
Neg logit contributions
eway-0.052
acle-0.049
ield-0.049
heit-0.048
ered-0.048
Token plane
Token ID6614
Feature activation+5.19e-02

Pos logit contributions
please+0.028
 especially+0.025
 responsible+0.023
 boss+0.022
 bored+0.022
Neg logit contributions
eway-0.036
ered-0.036
ently-0.036
aned-0.034
ides-0.034
Token flying
Token ID7348
Feature activation+2.26e-02

Pos logit contributions
heit+0.139
rum+0.137
ently+0.137
ogo+0.137
anced+0.136
Neg logit contributions
 busy-0.049
 especially-0.049
 boss-0.047
 like-0.045
 different-0.045
Token overhead
Token ID16965
Feature activation+1.59e-01

Pos logit contributions
ently+0.058
ides+0.057
rum+0.056
ogo+0.056
ered+0.056
Neg logit contributions
 especially-0.024
 boss-0.024
please-0.024
 busy-0.023
 different-0.023
Token.
Token ID13
Feature activation+1.86e-02

Pos logit contributions
ogo+0.435
aned+0.422
ently+0.419
heit+0.415
rum+0.411
Neg logit contributions
 different-0.148
 especially-0.146
 boss-0.141
 busy-0.140
 like-0.133
Token Lily
Token ID20037
Feature activation-2.95e-02

Pos logit contributions
ered+0.038
ently+0.038
eway+0.038
aned+0.036
ides+0.036
Neg logit contributions
please-0.031
 especially-0.029
 boss-0.027
 bored-0.027
 busy-0.027
Token felt
Token ID2936
Feature activation-3.77e-02

Pos logit contributions
please+0.046
 especially+0.045
 boss+0.042
 busy+0.040
 bored+0.039
Neg logit contributions
eway-0.066
ently-0.064
ered-0.062
ides-0.061
anced-0.060
Token dizz
Token ID29527
Feature activation-3.03e-02

Pos logit contributions
please+0.048
 especially+0.036
 chefs+0.034
 responsible+0.031
 boss+0.031
Neg logit contributions
ered-0.098
eway-0.094
ently-0.090
rum-0.090
heit-0.089
Tokeny
Token ID88
Feature activation+1.38e-02

Pos logit contributions
please+0.058
 especially+0.053
 boss+0.050
 responsible+0.050
 creative+0.050
Neg logit contributions
ently-0.054
ered-0.054
aned-0.053
eway-0.053
rum-0.052
Token.
Token ID13
Feature activation+2.14e-02

Pos logit contributions
please+0.003
 especially+0.003
 boss+0.002
 responsible+0.002
 bored+0.002
Neg logit contributions
ently-0.003
eway-0.003
ered-0.003
ides-0.003
aned-0.003
Token She
Token ID1375
Feature activation+1.77e-02

Pos logit contributions
eway+0.041
ered+0.041
ently+0.041
aned+0.039
anced+0.039
Neg logit contributions
please-0.038
 especially-0.035
 boss-0.033
 responsible-0.032
 busy-0.032
Token wanted
Token ID2227
Feature activation-9.96e-03

Pos logit contributions
eway+0.038
ently+0.038
ered+0.038
aned+0.036
ides+0.036
Neg logit contributions
please-0.027
 especially-0.026
 boss-0.024
 busy-0.023
 bored-0.023
Token to
Token ID284
Feature activation+1.85e-02

Pos logit contributions
please+0.016
 responsible+0.014
 thirsty+0.014
 restless+0.014
 especially+0.013
Neg logit contributions
ered-0.020
eway-0.019
ently-0.018
anced-0.017
am-0.017
Token say
Token ID910
Feature activation+7.77e-02

Pos logit contributions
ered+0.040
eway+0.039
ently+0.037
ides+0.036
rum+0.036
Neg logit contributions
please-0.029
 especially-0.025
 bored-0.025
 busy-0.024
 chefs-0.024
Token '
Token ID705
Feature activation-1.70e-01

Pos logit contributions
eway+0.114
ently+0.113
ered+0.112
aned+0.108
ides+0.106
Neg logit contributions
please-0.173
 especially-0.166
 boss-0.160
 busy-0.156
 bored-0.156
Tokenhello
Token ID31373
Feature activation+1.19e-02

Pos logit contributions
 chefs+0.209
 especially+0.206
 bored+0.203
 thirsty+0.199
 responsible+0.195
Neg logit contributions
ered-0.367
ides-0.357
ield-0.353
eway-0.353
anced-0.352
Token'
Token ID6
Feature activation-7.93e-02

Pos logit contributions
eway+0.019
ered+0.019
ently+0.019
aned+0.018
ides+0.018
Neg logit contributions
please-0.025
 especially-0.024
 boss-0.022
 busy-0.022
 bored-0.022
Token to
Token ID284
Feature activation+1.67e-02

Pos logit contributions
please+0.106
 especially+0.100
 boss+0.091
 busy+0.088
 responsible+0.086
Neg logit contributions
eway-0.187
ered-0.187
ently-0.186
aned-0.180
heit-0.179
Token her
Token ID607
Feature activation+2.02e-02

Pos logit contributions
ered+0.037
eway+0.035
ently+0.035
heit+0.033
ides+0.033
Neg logit contributions
please-0.026
 especially-0.023
 thirsty-0.022
 chefs-0.022
 bored-0.022
Token child
Token ID1200
Feature activation-1.08e-02

Pos logit contributions
eway+0.040
ently+0.040
ered+0.040
ides+0.037
rum+0.037
Neg logit contributions
please-0.034
 especially-0.031
 boss-0.029
 responsible-0.029
 bored-0.029
Tokenum
Token ID388
Feature activation-5.70e-02

Pos logit contributions
please+0.064
 boss+0.063
 especially+0.060
 busy+0.059
 competitive+0.057
Neg logit contributions
eway-0.045
zed-0.043
ered-0.043
anced-0.042
heit-0.042
Token said
Token ID531
Feature activation+9.08e-02

Pos logit contributions
please+0.098
 especially+0.094
 boss+0.088
 busy+0.086
 different+0.082
Neg logit contributions
eway-0.121
rum-0.111
ently-0.110
ides-0.109
heit-0.109
Token,
Token ID11
Feature activation-5.75e-02

Pos logit contributions
eway+0.187
ently+0.183
ered+0.183
aned+0.178
ides+0.177
Neg logit contributions
please-0.148
 especially-0.140
 boss-0.133
 busy-0.128
 bored-0.127
Token "
Token ID366
Feature activation-1.43e-01

Pos logit contributions
please+0.095
 especially+0.089
 boss+0.084
 busy+0.082
 bored+0.080
Neg logit contributions
eway-0.120
ered-0.119
ently-0.117
ides-0.115
ogo-0.114
TokenSay
Token ID25515
Feature activation+9.09e-02

Pos logit contributions
please+0.168
 especially+0.157
 bored+0.152
 boss+0.147
 thirsty+0.147
Neg logit contributions
eway-0.350
ently-0.340
ered-0.338
anced-0.328
ides-0.326
Token '
Token ID705
Feature activation-1.70e-01

Pos logit contributions
eway+0.180
ered+0.180
ently+0.179
aned+0.171
ides+0.170
Neg logit contributions
please-0.155
 especially-0.145
 boss-0.135
 busy-0.132
 bored-0.131
Tokenplease
Token ID29688
Feature activation-7.83e-02

Pos logit contributions
 especially+0.238
please+0.231
 bored+0.224
 chefs+0.222
 responsible+0.220
Neg logit contributions
ered-0.356
eway-0.347
ently-0.341
ides-0.340
ield-0.339
Token show
Token ID905
Feature activation+1.84e-02

Pos logit contributions
please+0.127
 especially+0.116
 boss+0.113
 busy+0.107
 responsible+0.105
Neg logit contributions
eway-0.169
ently-0.162
ered-0.161
aned-0.160
ogo-0.160
Token me
Token ID502
Feature activation-4.25e-02

Pos logit contributions
eway+0.041
aned+0.041
ently+0.040
ogo+0.040
ered+0.040
Neg logit contributions
please-0.025
 especially-0.024
 boss-0.023
 different-0.022
 busy-0.022
Token a
Token ID257
Feature activation-5.53e-02

Pos logit contributions
please+0.060
 especially+0.059
 boss+0.054
 busy+0.052
 responsible+0.052
Neg logit contributions
eway-0.092
aned-0.092
ently-0.091
ered-0.091
anced-0.090
Token surprise
Token ID5975
Feature activation-5.64e-04

Pos logit contributions
please+0.098
 especially+0.092
 boss+0.086
 busy+0.085
 bored+0.084
Neg logit contributions
eway-0.106
ered-0.106
ently-0.105
aned-0.101
ides-0.099
Token.
Token ID13
Feature activation+2.41e-02

Pos logit contributions
ently+0.042
eway+0.042
ered+0.041
aned+0.041
ogo+0.040
Neg logit contributions
please-0.026
 especially-0.026
 boss-0.023
 bored-0.023
 busy-0.023
Token You
Token ID921
Feature activation+2.75e-02

Pos logit contributions
ently+0.051
ered+0.050
aned+0.050
ogo+0.048
eway+0.048
Neg logit contributions
 especially-0.040
please-0.038
 boss-0.036
I-0.036
Why-0.036
Token have
Token ID423
Feature activation-5.22e-02

Pos logit contributions
ently+0.060
eway+0.060
ered+0.057
aned+0.057
rum+0.056
Neg logit contributions
please-0.042
 especially-0.041
 boss-0.038
 bored-0.037
 busy-0.036
Token to
Token ID284
Feature activation+9.32e-03

Pos logit contributions
please+0.067
 especially+0.061
 boss+0.053
 busy+0.052
 responsible+0.052
Neg logit contributions
ered-0.123
ently-0.120
eway-0.119
anced-0.117
am-0.116
Token say
Token ID910
Feature activation+7.53e-02

Pos logit contributions
ently+0.021
eway+0.021
ered+0.020
aned+0.020
anced+0.020
Neg logit contributions
please-0.014
 especially-0.013
 boss-0.013
 responsible-0.012
 busy-0.012
Token '
Token ID705
Feature activation-1.70e-01

Pos logit contributions
eway+0.159
ently+0.156
ered+0.155
aned+0.152
ides+0.150
Neg logit contributions
please-0.119
 especially-0.115
 boss-0.110
 bored-0.106
 busy-0.106
Tokenplease
Token ID29688
Feature activation-8.23e-02

Pos logit contributions
 especially+0.224
please+0.211
 bored+0.204
 chefs+0.204
 thirsty+0.201
Neg logit contributions
ered-0.373
ently-0.363
eway-0.363
ides-0.356
ield-0.356
Token'
Token ID6
Feature activation-7.41e-02

Pos logit contributions
please+0.126
 especially+0.114
 boss+0.109
 busy+0.103
 responsible+0.102
Neg logit contributions
eway-0.186
ered-0.178
ently-0.178
aned-0.175
anced-0.175
Token and
Token ID290
Feature activation+3.63e-03

Pos logit contributions
please+0.121
 especially+0.113
 boss+0.107
 busy+0.104
 responsible+0.103
Neg logit contributions
eway-0.155
ered-0.154
ently-0.151
anced-0.148
ogo-0.147
Token '
Token ID705
Feature activation-1.52e-01

Pos logit contributions
ered+0.006
eway+0.006
ently+0.006
aned+0.006
ides+0.006
Neg logit contributions
please-0.007
 especially-0.007
 boss-0.006
 busy-0.006
 bored-0.006
Tokenthank
Token ID40716
Feature activation-4.21e-02

Pos logit contributions
 especially+0.188
please+0.185
 bored+0.169
 chefs+0.168
 boss+0.166
Neg logit contributions
ered-0.349
ently-0.343
eway-0.340
ides-0.333
ield-0.331
Token would
Token ID561
Feature activation-7.08e-03

Pos logit contributions
eway+0.003
ently+0.003
ered+0.003
aned+0.003
ides+0.003
Neg logit contributions
please-0.002
 especially-0.002
 boss-0.002
 busy-0.002
 bored-0.002
Token whisper
Token ID31992
Feature activation+8.21e-02

Pos logit contributions
 especially+0.010
please+0.009
 boss+0.008
 busy+0.008
 tidy+0.008
Neg logit contributions
ently-0.017
eway-0.017
aned-0.016
ered-0.016
rum-0.016
Token "
Token ID366
Feature activation-1.43e-01

Pos logit contributions
eway+0.192
ered+0.192
ently+0.191
aned+0.184
ides+0.183
Neg logit contributions
please-0.111
 especially-0.101
 boss-0.092
 busy-0.090
 bored-0.089
Tokenthank
Token ID40716
Feature activation-4.83e-02

Pos logit contributions
please+0.207
 especially+0.202
 chefs+0.200
 responsible+0.198
 bored+0.197
Neg logit contributions
ered-0.299
eway-0.298
ently-0.293
ides-0.286
anced-0.284
Token you
Token ID345
Feature activation-3.93e-02

Pos logit contributions
please+0.078
 especially+0.073
 bored+0.068
 thirsty+0.067
 responsible+0.066
Neg logit contributions
ered-0.098
eway-0.098
ently-0.098
ides-0.093
heit-0.091
Token"
Token ID1
Feature activation-1.69e-01

Pos logit contributions
please+0.068
 especially+0.066
 boss+0.062
 responsible+0.059
 busy+0.059
Neg logit contributions
ered-0.076
eway-0.075
aned-0.074
ently-0.073
anced-0.073
Token to
Token ID284
Feature activation+6.25e-03

Pos logit contributions
please+0.239
 especially+0.225
 chefs+0.214
 thirsty+0.208
 bored+0.207
Neg logit contributions
ently-0.366
eway-0.361
ered-0.359
aned-0.344
ides-0.343
Token whoever
Token ID16958
Feature activation-3.33e-02

Pos logit contributions
ered+0.016
eway+0.016
ently+0.016
aned+0.015
anced+0.015
Neg logit contributions
please-0.007
 especially-0.007
 boss-0.006
 bored-0.006
 busy-0.006
Token put
Token ID1234
Feature activation-6.77e-02

Pos logit contributions
please+0.055
 especially+0.052
 boss+0.048
 busy+0.047
 bored+0.047
Neg logit contributions
ered-0.068
eway-0.067
ently-0.067
aned-0.064
ides-0.064
Token it
Token ID340
Feature activation+2.17e-02

Pos logit contributions
please+0.089
 especially+0.081
 boss+0.073
 responsible+0.071
 busy+0.071
Neg logit contributions
ered-0.162
eway-0.161
ently-0.160
aned-0.154
anced-0.153
Token there
Token ID612
Feature activation+3.91e-02

Pos logit contributions
eway+0.054
ently+0.053
ered+0.053
aned+0.052
ides+0.052
Neg logit contributions
please-0.026
 especially-0.025
 boss-0.022
 busy-0.022
 bored-0.021
Token it
Token ID340
Feature activation+1.21e-02

Pos logit contributions
please+0.018
 especially+0.016
 boss+0.014
 bored+0.014
 busy+0.014
Neg logit contributions
eway-0.026
ently-0.026
ered-0.026
aned-0.024
rum-0.024
Token,
Token ID11
Feature activation-5.41e-02

Pos logit contributions
eway+0.027
ered+0.026
ently+0.026
aned+0.026
ides+0.025
Neg logit contributions
please-0.018
 especially-0.017
 boss-0.016
 busy-0.015
 bored-0.015
Token like
Token ID588
Feature activation-5.83e-02

Pos logit contributions
please+0.071
 especially+0.066
 boss+0.059
 busy+0.058
 bored+0.057
Neg logit contributions
eway-0.129
ered-0.129
ently-0.128
aned-0.124
ides-0.123
Token you
Token ID345
Feature activation-6.69e-03

Pos logit contributions
please+0.087
 especially+0.079
 boss+0.072
 bored+0.072
 busy+0.071
Neg logit contributions
ered-0.128
eway-0.128
ently-0.127
aned-0.120
ides-0.120
Token learned
Token ID4499
Feature activation+1.46e-02

Pos logit contributions
please+0.010
 especially+0.010
 boss+0.009
 busy+0.009
 bored+0.008
Neg logit contributions
eway-0.015
ently-0.015
ered-0.015
aned-0.014
anced-0.014
Token '
Token ID705
Feature activation-1.69e-01

Pos logit contributions
eway+0.032
ered+0.032
ently+0.032
aned+0.031
ides+0.031
Neg logit contributions
please-0.021
 especially-0.020
 boss-0.019
 busy-0.018
 responsible-0.018
Tokencat
Token ID9246
Feature activation+2.61e-02

Pos logit contributions
 especially+0.224
 chefs+0.216
 thirsty+0.213
 bored+0.210
please+0.202
Neg logit contributions
ered-0.360
ield-0.349
ently-0.346
ides-0.345
eway-0.340
Token'.
Token ID4458
Feature activation+1.06e-02

Pos logit contributions
eway+0.047
ered+0.046
ently+0.046
aned+0.044
ogo+0.044
Neg logit contributions
please-0.050
 especially-0.047
 boss-0.045
 busy-0.043
 responsible-0.043
Token It
Token ID632
Feature activation+3.40e-02

Pos logit contributions
ered+0.019
eway+0.019
ently+0.019
aned+0.019
ogo+0.018
Neg logit contributions
please-0.020
 especially-0.019
 boss-0.018
 busy-0.018
I-0.018
Token is
Token ID318
Feature activation+9.89e-03

Pos logit contributions
eway+0.060
ogo+0.059
ently+0.059
ides+0.057
aned+0.056
Neg logit contributions
 especially-0.064
please-0.064
 boss-0.063
 creative-0.059
 different-0.059
Token not
Token ID407
Feature activation+1.14e-02

Pos logit contributions
eway+0.026
ered+0.026
ently+0.026
aned+0.025
ides+0.025
Neg logit contributions
please-0.011
 especially-0.010
 boss-0.009
 busy-0.008
 bored-0.008
Token family
Token ID1641
Feature activation-5.94e-02

Pos logit contributions
ered+0.015
ently+0.014
eway+0.014
am+0.014
iously+0.014
Neg logit contributions
please-0.010
 especially-0.009
 responsible-0.008
Why-0.008
 bored-0.008
Token said
Token ID531
Feature activation+8.50e-02

Pos logit contributions
please+0.101
 especially+0.097
 boss+0.089
 busy+0.087
 responsible+0.087
Neg logit contributions
eway-0.120
ently-0.118
ered-0.117
aned-0.113
ides-0.112
Token "
Token ID366
Feature activation-1.53e-01

Pos logit contributions
eway+0.174
ently+0.172
ered+0.170
aned+0.168
ogo+0.166
Neg logit contributions
please-0.139
 especially-0.135
 boss-0.126
 busy-0.122
 bored-0.121
Tokenthank
Token ID40716
Feature activation-4.56e-02

Pos logit contributions
please+0.163
 especially+0.157
 bored+0.149
 boss+0.146
 busy+0.142
Neg logit contributions
eway-0.379
ered-0.378
ently-0.378
ides-0.366
anced-0.365
Token you
Token ID345
Feature activation-3.71e-02

Pos logit contributions
please+0.072
 especially+0.068
 bored+0.062
 boss+0.061
 thirsty+0.061
Neg logit contributions
ered-0.095
ently-0.094
eway-0.093
ides-0.090
heit-0.088
Token"
Token ID1
Feature activation-1.69e-01

Pos logit contributions
please+0.063
 especially+0.061
 boss+0.058
 busy+0.056
 responsible+0.056
Neg logit contributions
eway-0.074
ered-0.074
ently-0.072
aned-0.071
anced-0.071
Token before
Token ID878
Feature activation-8.06e-03

Pos logit contributions
please+0.239
 especially+0.222
 boss+0.202
 bored+0.202
 responsible+0.199
Neg logit contributions
ently-0.371
eway-0.368
ered-0.365
aned-0.351
ides-0.350
Token they
Token ID484
Feature activation-2.44e-02

Pos logit contributions
please+0.012
 especially+0.011
 boss+0.010
 busy+0.010
 bored+0.010
Neg logit contributions
eway-0.018
ered-0.018
ently-0.018
aned-0.017
ides-0.017
Token left
Token ID1364
Feature activation-1.52e-02

Pos logit contributions
please+0.041
 especially+0.038
 boss+0.035
 busy+0.035
 bored+0.035
Neg logit contributions
eway-0.050
ently-0.050
ered-0.049
aned-0.047
ides-0.047
Token.
Token ID13
Feature activation+1.85e-02

Pos logit contributions
please+0.023
 especially+0.021
 boss+0.019
 responsible+0.018
 bored+0.018
Neg logit contributions
ered-0.034
ently-0.033
eway-0.033
aned-0.032
anced-0.031
Token They
Token ID1119
Feature activation+3.60e-03

Pos logit contributions
ered+0.040
eway+0.040
ently+0.040
aned+0.038
ides+0.038
Neg logit contributions
please-0.028
 especially-0.027
 busy-0.025
 bored-0.024
 boss-0.024
Token He
Token ID679
Feature activation+4.20e-02

Pos logit contributions
ered+0.027
ently+0.026
ides+0.026
eway+0.026
aned+0.024
Neg logit contributions
 especially-0.030
please-0.029
 bored-0.028
 boss-0.028
 busy-0.027
Token said
Token ID531
Feature activation+8.46e-02

Pos logit contributions
eway+0.098
ently+0.097
ides+0.093
rum+0.092
ered+0.090
Neg logit contributions
 especially-0.064
please-0.063
 boss-0.056
 busy-0.055
 creative-0.053
Token "
Token ID366
Feature activation-1.52e-01

Pos logit contributions
eway+0.161
ently+0.157
ered+0.155
ides+0.155
aned+0.154
Neg logit contributions
please-0.152
 especially-0.148
 boss-0.143
 busy-0.137
 bored-0.137
Tokenthank
Token ID40716
Feature activation-4.64e-02

Pos logit contributions
please+0.145
 especially+0.137
 bored+0.125
 boss+0.125
 busy+0.122
Neg logit contributions
eway-0.400
ered-0.398
ently-0.397
aned-0.385
anced-0.384
Token you
Token ID345
Feature activation-3.82e-02

Pos logit contributions
please+0.070
 especially+0.065
 boss+0.059
 bored+0.058
 busy+0.058
Neg logit contributions
ered-0.101
eway-0.101
ently-0.101
ides-0.096
aned-0.095
Token"
Token ID1
Feature activation-1.69e-01

Pos logit contributions
please+0.058
 especially+0.056
 boss+0.054
 responsible+0.052
 busy+0.051
Neg logit contributions
ered-0.084
eway-0.082
aned-0.082
ently-0.081
anced-0.081
Token to
Token ID284
Feature activation+5.12e-03

Pos logit contributions
please+0.242
 especially+0.226
 boss+0.206
 bored+0.202
 busy+0.202
Neg logit contributions
ently-0.371
eway-0.370
ered-0.369
aned-0.354
ides-0.351
Token Tina
Token ID35903
Feature activation+2.38e-03

Pos logit contributions
eway+0.013
ently+0.013
ered+0.013
ides+0.013
aned+0.013
Neg logit contributions
please-0.006
 especially-0.005
 boss-0.005
 busy-0.005
 bored-0.005
Token and
Token ID290
Feature activation+9.73e-03

Pos logit contributions
eway+0.006
ered+0.006
ently+0.006
heit+0.005
anced+0.005
Neg logit contributions
please-0.003
 especially-0.003
 boss-0.003
 busy-0.003
 responsible-0.003
Token told
Token ID1297
Feature activation+7.00e-02

Pos logit contributions
ently+0.023
eway+0.023
ides+0.023
ered+0.023
aned+0.023
Neg logit contributions
please-0.013
 especially-0.012
 boss-0.011
 responsible-0.011
 busy-0.011
Token her
Token ID607
Feature activation+3.90e-03

Pos logit contributions
eway+0.189
ently+0.188
ered+0.185
ides+0.184
aned+0.183
Neg logit contributions
please-0.067
 especially-0.064
 boss-0.057
 busy-0.053
 responsible-0.053
Token.
Token ID13
Feature activation+1.88e-02

Pos logit contributions
eway+0.053
ered+0.053
ently+0.052
aned+0.050
ides+0.050
Neg logit contributions
please-0.038
 especially-0.035
 boss-0.032
 busy-0.032
 bored-0.032
Token He
Token ID679
Feature activation+4.32e-02

Pos logit contributions
ered+0.040
ently+0.039
eway+0.038
ides+0.038
aned+0.037
Neg logit contributions
please-0.030
 especially-0.029
 boss-0.026
 busy-0.026
 bored-0.026
Token wanted
Token ID2227
Feature activation-8.48e-03

Pos logit contributions
eway+0.087
ently+0.087
ered+0.085
ides+0.082
aned+0.082
Neg logit contributions
please-0.074
 especially-0.070
 boss-0.065
 busy-0.063
 bored-0.062
Token to
Token ID284
Feature activation+1.90e-02

Pos logit contributions
please+0.013
 especially+0.011
 responsible+0.011
 bored+0.010
 chefs+0.010
Neg logit contributions
ered-0.019
eway-0.018
ently-0.017
ogo-0.017
am-0.017
Token say
Token ID910
Feature activation+8.00e-02

Pos logit contributions
ered+0.041
eway+0.039
ently+0.038
rum+0.037
aned+0.037
Neg logit contributions
please-0.031
 especially-0.028
 bored-0.026
 busy-0.026
 chefs-0.026
Token '
Token ID705
Feature activation-1.68e-01

Pos logit contributions
ently+0.127
eway+0.125
aned+0.124
ered+0.122
ides+0.121
Neg logit contributions
please-0.167
 especially-0.163
 boss-0.159
 bored-0.152
 busy-0.152
Tokenhello
Token ID31373
Feature activation+9.50e-03

Pos logit contributions
 especially+0.221
 chefs+0.218
 bored+0.213
 responsible+0.212
please+0.204
Neg logit contributions
ered-0.363
eway-0.357
ently-0.347
ides-0.346
anced-0.343
Token',
Token ID3256
Feature activation-2.15e-02

Pos logit contributions
eway+0.015
ered+0.015
ently+0.015
aned+0.014
ides+0.014
Neg logit contributions
please-0.020
 especially-0.019
 boss-0.018
 busy-0.017
 bored-0.017
Token so
Token ID523
Feature activation-1.25e-02

Pos logit contributions
 especially+0.024
please+0.023
 boss+0.021
 busy+0.019
 bored+0.019
Neg logit contributions
eway-0.056
ently-0.055
ogo-0.055
ides-0.055
aned-0.054
Token he
Token ID339
Feature activation+1.66e-02

Pos logit contributions
 especially+0.020
 boss+0.020
please+0.020
 busy+0.019
 bored+0.019
Neg logit contributions
ides-0.026
eway-0.025
ently-0.025
ogo-0.025
aned-0.024
Token waved
Token ID26834
Feature activation+4.33e-02

Pos logit contributions
ently+0.034
eway+0.034
rum+0.033
ides+0.032
ered+0.032
Neg logit contributions
please-0.029
 especially-0.028
 boss-0.026
 busy-0.024
 creative-0.023
Token.
Token ID13
Feature activation+1.79e-02

Pos logit contributions
ered+0.071
eway+0.071
ently+0.070
aned+0.068
heit+0.068
Neg logit contributions
please-0.048
 especially-0.046
 boss-0.041
 busy-0.041
 bored-0.040
Token He
Token ID679
Feature activation+4.08e-02

Pos logit contributions
ered+0.037
ently+0.036
aned+0.036
eway+0.035
ogo+0.035
Neg logit contributions
 especially-0.030
please-0.030
 busy-0.026
Why-0.026
 boss-0.026
Token would
Token ID561
Feature activation-7.80e-03

Pos logit contributions
eway+0.088
ently+0.087
ered+0.087
aned+0.083
ides+0.083
Neg logit contributions
please-0.064
 especially-0.060
 boss-0.055
 busy-0.053
 bored-0.053
Token always
Token ID1464
Feature activation-8.15e-04

Pos logit contributions
please+0.011
 especially+0.010
 boss+0.009
 busy+0.009
 bored+0.009
Neg logit contributions
eway-0.018
ently-0.018
ered-0.018
aned-0.017
ides-0.017
Token say
Token ID910
Feature activation+8.41e-02

Pos logit contributions
please+0.001
 especially+0.001
 boss+0.001
 busy+0.001
 bored+0.001
Neg logit contributions
eway-0.002
ered-0.002
ently-0.002
aned-0.002
ides-0.002
Token '
Token ID705
Feature activation-1.67e-01

Pos logit contributions
eway+0.170
ered+0.168
ently+0.168
aned+0.163
ides+0.160
Neg logit contributions
please-0.141
 especially-0.134
 boss-0.125
 busy-0.122
 bored-0.121
Tokenthank
Token ID40716
Feature activation-4.67e-02

Pos logit contributions
 especially+0.220
 bored+0.219
 thirsty+0.210
 chefs+0.210
 boss+0.208
Neg logit contributions
ered-0.357
eway-0.346
ently-0.346
ides-0.345
ield-0.336
Token-
Token ID12
Feature activation-1.08e-01

Pos logit contributions
please+0.066
 especially+0.061
 boss+0.056
 busy+0.054
 responsible+0.054
Neg logit contributions
eway-0.106
ered-0.106
ently-0.106
aned-0.102
anced-0.101
Tokenyou
Token ID5832
Feature activation-4.74e-03

Pos logit contributions
please+0.223
 especially+0.206
 boss+0.195
 responsible+0.193
 busy+0.191
Neg logit contributions
eway-0.181
ered-0.180
ently-0.180
aned-0.173
ides-0.171
Token'
Token ID6
Feature activation-9.27e-02

Pos logit contributions
please+0.009
 especially+0.009
 boss+0.008
 responsible+0.008
 busy+0.008
Neg logit contributions
eway-0.008
aned-0.008
ogo-0.008
ered-0.008
ently-0.008
Token to
Token ID284
Feature activation+8.61e-03

Pos logit contributions
please+0.119
 especially+0.111
 boss+0.102
 responsible+0.099
 busy+0.098
Neg logit contributions
eway-0.229
ered-0.222
ogo-0.221
anced-0.217
aned-0.217
Token.
Token ID13
Feature activation+1.33e-02

Pos logit contributions
eway+0.081
ered+0.081
heit+0.080
ently+0.079
anced+0.079
Neg logit contributions
 especially-0.044
please-0.043
 busy-0.040
 boss-0.038
 responsible-0.038
Token The
Token ID383
Feature activation+2.13e-02

Pos logit contributions
ently+0.025
ered+0.025
heit+0.024
aned+0.024
ides+0.024
Neg logit contributions
 especially-0.024
please-0.023
I-0.023
 bored-0.022
 boss-0.022
Token next
Token ID1306
Feature activation-2.48e-02

Pos logit contributions
ered+0.042
ently+0.041
eway+0.040
ides+0.039
ield+0.039
Neg logit contributions
please-0.037
 especially-0.033
 responsible-0.029
 chefs-0.029
 bored-0.028
Token task
Token ID4876
Feature activation+6.88e-03

Pos logit contributions
please+0.042
 especially+0.039
 bored+0.036
 boss+0.035
 thirsty+0.035
Neg logit contributions
eway-0.049
ered-0.049
ently-0.049
ides-0.046
am-0.046
Token said
Token ID531
Feature activation+8.70e-02

Pos logit contributions
heit+0.015
ently+0.015
eway+0.015
aned+0.015
ogo+0.015
Neg logit contributions
 especially-0.010
 boss-0.009
please-0.009
 busy-0.009
 tidy-0.008
Token '
Token ID705
Feature activation-1.67e-01

Pos logit contributions
aned+0.180
eway+0.179
ently+0.179
ides+0.176
ered+0.176
Neg logit contributions
please-0.135
 especially-0.135
 boss-0.127
 responsible-0.123
 creative-0.123
Tokenclean
Token ID27773
Feature activation-1.17e-02

Pos logit contributions
 especially+0.194
please+0.191
 bored+0.182
 boss+0.181
 chefs+0.175
Neg logit contributions
ered-0.394
eway-0.386
ently-0.384
ides-0.377
anced-0.375
Token the
Token ID262
Feature activation-1.23e-02

Pos logit contributions
please+0.020
 especially+0.020
 boss+0.018
 responsible+0.018
 busy+0.018
Neg logit contributions
eway-0.022
ogo-0.022
ered-0.022
aned-0.021
ently-0.021
Token floor
Token ID4314
Feature activation+1.36e-02

Pos logit contributions
please+0.024
 especially+0.022
 bored+0.019
 thirsty+0.019
Why+0.019
Neg logit contributions
ently-0.021
eway-0.021
ered-0.021
ides-0.020
ield-0.020
Token'.
Token ID4458
Feature activation+1.18e-02

Pos logit contributions
eway+0.028
ogo+0.028
ered+0.028
ently+0.027
aned+0.027
Neg logit contributions
 especially-0.021
please-0.021
 boss-0.019
 busy-0.019
 responsible-0.019
Token He
Token ID679
Feature activation+4.06e-02

Pos logit contributions
ered+0.022
ently+0.022
aned+0.022
ogo+0.022
eway+0.021
Neg logit contributions
 especially-0.020
please-0.020
I-0.019
 boss-0.019
Why-0.019
Tokenilly
Token ID6548
Feature activation-1.69e-02

Pos logit contributions
please+0.003
 especially+0.003
 boss+0.003
 busy+0.003
 responsible+0.003
Neg logit contributions
eway-0.004
ered-0.004
aned-0.004
heit-0.004
rum-0.004
Token was
Token ID373
Feature activation-5.64e-04

Pos logit contributions
please+0.026
 especially+0.025
 boss+0.023
 busy+0.022
 bored+0.022
Neg logit contributions
eway-0.037
ently-0.037
ered-0.036
ides-0.035
aned-0.035
Token known
Token ID1900
Feature activation-3.89e-02

Pos logit contributions
please+0.001
 chefs+0.001
 especially+0.001
nam+0.001
 boss+0.001
Neg logit contributions
ered-0.001
zed-0.001
eway-0.001
ently-0.001
aned-0.001
Token as
Token ID355
Feature activation-7.26e-02

Pos logit contributions
please+0.063
 especially+0.060
 boss+0.056
 busy+0.055
 bored+0.054
Neg logit contributions
eway-0.079
ently-0.079
ered-0.079
aned-0.076
ides-0.076
Token the
Token ID262
Feature activation-3.94e-03

Pos logit contributions
please+0.104
 especially+0.095
 boss+0.085
 responsible+0.084
 bored+0.083
Neg logit contributions
ered-0.163
eway-0.162
ently-0.161
anced-0.152
ogo-0.151
Token '
Token ID705
Feature activation-1.67e-01

Pos logit contributions
please+0.006
 especially+0.005
 responsible+0.005
 boss+0.005
 chefs+0.005
Neg logit contributions
eway-0.009
ered-0.009
ently-0.008
ogo-0.008
aned-0.008
Tokens
Token ID82
Feature activation+1.67e-02

Pos logit contributions
 especially+0.226
please+0.225
 chefs+0.211
 responsible+0.207
 boss+0.207
Neg logit contributions
ered-0.363
eway-0.355
ently-0.354
anced-0.345
aned-0.343
Tokenilly
Token ID6548
Feature activation-1.71e-02

Pos logit contributions
ently+0.039
eway+0.038
ered+0.038
aned+0.037
ides+0.037
Neg logit contributions
please-0.023
 especially-0.022
 boss-0.020
 busy-0.020
 bored-0.019
Token re
Token ID302
Feature activation-1.03e-02

Pos logit contributions
please+0.028
 especially+0.027
 boss+0.025
 busy+0.024
 bored+0.024
Neg logit contributions
eway-0.036
ered-0.035
ently-0.035
aned-0.035
heit-0.034
Tokenind
Token ID521
Feature activation+6.07e-04

Pos logit contributions
please+0.016
 especially+0.014
 boss+0.013
 responsible+0.012
 busy+0.012
Neg logit contributions
eway-0.023
ently-0.023
ered-0.022
anced-0.022
ides-0.022
Tokeneer
Token ID28153
Feature activation+6.96e-03

Pos logit contributions
eway+0.001
ered+0.001
ently+0.001
aned+0.001
heit+0.001
Neg logit contributions
please-0.001
 especially-0.001
 boss-0.001
 bored-0.001
 busy-0.001
Token child
Token ID1200
Feature activation-1.03e-02

Pos logit contributions
please+0.058
 especially+0.050
 responsible+0.043
 bored+0.040
 busy+0.040
Neg logit contributions
ered-0.121
eway-0.121
ently-0.119
anced-0.118
am-0.116
Token said
Token ID531
Feature activation+8.61e-02

Pos logit contributions
please+0.016
 especially+0.016
 boss+0.014
 busy+0.014
 bored+0.014
Neg logit contributions
eway-0.023
ently-0.022
rum-0.022
ered-0.021
ides-0.021
Token "
Token ID366
Feature activation-1.52e-01

Pos logit contributions
eway+0.155
ently+0.155
ered+0.154
aned+0.150
ides+0.149
Neg logit contributions
please-0.161
 especially-0.154
 boss-0.146
 busy-0.142
 responsible-0.141
TokenThank
Token ID10449
Feature activation-2.87e-02

Pos logit contributions
please+0.154
 especially+0.147
 bored+0.139
 boss+0.136
 busy+0.135
Neg logit contributions
eway-0.386
ered-0.382
ently-0.382
anced-0.372
aned-0.370
Token you
Token ID345
Feature activation-4.51e-02

Pos logit contributions
please+0.038
 especially+0.035
 boss+0.032
 busy+0.031
 responsible+0.030
Neg logit contributions
eway-0.068
ered-0.068
ently-0.067
aned-0.066
anced-0.065
Token"
Token ID1
Feature activation-1.67e-01

Pos logit contributions
 especially+0.068
please+0.065
 boss+0.062
 busy+0.062
 responsible+0.060
Neg logit contributions
ered-0.098
eway-0.097
aned-0.096
ently-0.095
anced-0.094
Token to
Token ID284
Feature activation+1.19e-02

Pos logit contributions
please+0.255
 especially+0.239
 thirsty+0.223
 responsible+0.222
 chefs+0.222
Neg logit contributions
ently-0.343
eway-0.337
ered-0.337
anced-0.322
aned-0.321
Token John
Token ID1757
Feature activation+7.96e-03

Pos logit contributions
ently+0.029
eway+0.028
ides+0.028
aned+0.028
am+0.027
Neg logit contributions
 especially-0.015
please-0.015
 boss-0.013
 tidy-0.013
 busy-0.013
Token and
Token ID290
Feature activation+7.30e-03

Pos logit contributions
eway+0.019
ently+0.018
ered+0.018
heit+0.018
aned+0.018
Neg logit contributions
please-0.011
 especially-0.011
 boss-0.010
 busy-0.009
 bored-0.009
Token ran
Token ID4966
Feature activation-2.06e-03

Pos logit contributions
ently+0.014
ides+0.013
eway+0.013
aned+0.013
ered+0.013
Neg logit contributions
please-0.014
 especially-0.013
 boss-0.012
 busy-0.012
 responsible-0.012
Token home
Token ID1363
Feature activation+4.64e-02

Pos logit contributions
please+0.003
 especially+0.003
 boss+0.003
 busy+0.003
 bored+0.003
Neg logit contributions
ently-0.004
eway-0.004
ered-0.004
ides-0.004
aned-0.004
Token all
Token ID477
Feature activation+8.03e-03

Pos logit contributions
ered+0.008
eway+0.008
ently+0.008
anced+0.008
am+0.008
Neg logit contributions
please-0.008
 especially-0.007
 responsible-0.006
 boss-0.006
 busy-0.006
Token said
Token ID531
Feature activation+8.24e-02

Pos logit contributions
ered+0.017
eway+0.017
ently+0.017
aned+0.016
ides+0.016
Neg logit contributions
please-0.013
 especially-0.012
 boss-0.011
 busy-0.011
 bored-0.011
Token "
Token ID366
Feature activation-1.55e-01

Pos logit contributions
eway+0.168
ently+0.167
ered+0.166
aned+0.162
ides+0.160
Neg logit contributions
please-0.137
 especially-0.131
 boss-0.123
 bored-0.120
 busy-0.120
Tokenthank
Token ID40716
Feature activation-4.13e-02

Pos logit contributions
 especially+0.171
 bored+0.168
 thirsty+0.161
 boss+0.160
please+0.160
Neg logit contributions
eway-0.366
ered-0.365
ently-0.362
ides-0.361
anced-0.358
Token you
Token ID345
Feature activation-3.93e-02

Pos logit contributions
please+0.077
 especially+0.073
 bored+0.066
 thirsty+0.066
 responsible+0.065
Neg logit contributions
ered-0.074
eway-0.072
ently-0.072
ield-0.071
ides-0.069
Token"
Token ID1
Feature activation-1.67e-01

Pos logit contributions
please+0.070
 especially+0.066
 boss+0.063
 busy+0.062
 responsible+0.060
Neg logit contributions
ered-0.077
eway-0.076
ently-0.075
aned-0.073
anced-0.072
Token to
Token ID284
Feature activation+7.02e-03

Pos logit contributions
please+0.233
 especially+0.224
 thirsty+0.204
 bored+0.200
 responsible+0.199
Neg logit contributions
ently-0.363
eway-0.353
ered-0.352
am-0.343
anced-0.340
Token Lily
Token ID20037
Feature activation-5.00e-02

Pos logit contributions
ered+0.016
ently+0.015
eway+0.015
anced+0.015
aned+0.015
Neg logit contributions
please-0.010
 especially-0.009
 responsible-0.008
 bored-0.008
 chefs-0.008
Token and
Token ID290
Feature activation-4.13e-03

Pos logit contributions
please+0.070
 especially+0.066
 boss+0.062
 busy+0.060
 bored+0.060
Neg logit contributions
eway-0.116
ered-0.115
ently-0.113
heit-0.110
anced-0.109
Token her
Token ID607
Feature activation+2.00e-02

Pos logit contributions
please+0.006
 especially+0.006
 responsible+0.005
 busy+0.005
 boss+0.005
Neg logit contributions
ered-0.009
anced-0.009
 replied-0.008
eway-0.008
heit-0.008
Token friends
Token ID2460
Feature activation-6.68e-02

Pos logit contributions
ered+0.046
eway+0.046
ently+0.046
am+0.044
anced+0.044
Neg logit contributions
please-0.028
 especially-0.025
 responsible-0.022
Why-0.022
 boss-0.021
Token It
Token ID632
Feature activation+2.92e-02

Pos logit contributions
ered+0.043
eway+0.042
ently+0.042
ides+0.041
aned+0.041
Neg logit contributions
please-0.030
 especially-0.029
 bored-0.027
 busy-0.027
 boss-0.026
Token said
Token ID531
Feature activation+9.02e-02

Pos logit contributions
eway+0.058
ered+0.058
ently+0.058
aned+0.055
ides+0.055
Neg logit contributions
please-0.050
 especially-0.047
 boss-0.044
 busy-0.043
 bored-0.043
Token "
Token ID366
Feature activation-1.52e-01

Pos logit contributions
eway+0.178
ered+0.174
ently+0.173
aned+0.171
ides+0.169
Neg logit contributions
please-0.156
 especially-0.149
 boss-0.142
 busy-0.137
 bored-0.136
Tokenthank
Token ID40716
Feature activation-4.08e-02

Pos logit contributions
 bored+0.153
 especially+0.151
please+0.146
 chefs+0.145
 thirsty+0.145
Neg logit contributions
eway-0.372
ered-0.370
ently-0.368
ides-0.363
anced-0.361
Token you
Token ID345
Feature activation-3.80e-02

Pos logit contributions
please+0.069
 especially+0.066
 bored+0.061
 thirsty+0.060
 responsible+0.059
Neg logit contributions
ered-0.079
ently-0.079
eway-0.078
ield-0.075
ides-0.074
Token"
Token ID1
Feature activation-1.67e-01

Pos logit contributions
please+0.068
 especially+0.064
 boss+0.061
 busy+0.059
 responsible+0.058
Neg logit contributions
ered-0.074
eway-0.073
ently-0.072
aned-0.071
anced-0.070
Token by
Token ID416
Feature activation+6.10e-02

Pos logit contributions
please+0.227
 especially+0.219
 chefs+0.201
 thirsty+0.198
 bored+0.196
Neg logit contributions
ently-0.365
ered-0.358
eway-0.356
aned-0.343
am-0.342
Token ticking
Token ID40212
Feature activation+1.66e-02

Pos logit contributions
eway+0.146
ently+0.145
ides+0.142
ered+0.142
ogo+0.141
Neg logit contributions
please-0.077
 especially-0.074
 boss-0.070
 busy-0.068
 responsible-0.065
Token louder
Token ID27089
Feature activation+2.45e-03

Pos logit contributions
eway+0.039
ently+0.039
ered+0.039
ides+0.038
ogo+0.038
Neg logit contributions
please-0.022
 especially-0.020
 boss-0.019
 busy-0.018
 responsible-0.018
Token.
Token ID13
Feature activation+1.71e-02

Pos logit contributions
ered+0.006
eway+0.006
ently+0.006
anced+0.006
ogo+0.006
Neg logit contributions
please-0.003
 especially-0.003
 boss-0.003
 busy-0.003
 responsible-0.003
Token The
Token ID383
Feature activation+1.84e-02

Pos logit contributions
ered+0.037
eway+0.036
ogo+0.036
ently+0.036
ides+0.036
Neg logit contributions
please-0.026
 especially-0.025
 busy-0.023
 boss-0.023
 bored-0.022
Token also
Token ID635
Feature activation+2.28e-02

Pos logit contributions
eway+0.094
ently+0.092
ered+0.089
ides+0.089
rum+0.088
Neg logit contributions
please-0.064
 especially-0.062
 boss-0.055
 busy-0.053
 bored-0.052
Token made
Token ID925
Feature activation-2.40e-02

Pos logit contributions
ered+0.049
eway+0.048
ently+0.048
aned+0.046
anced+0.046
Neg logit contributions
please-0.035
 especially-0.032
 boss-0.030
 busy-0.029
 bored-0.029
Token the
Token ID262
Feature activation-4.18e-04

Pos logit contributions
please+0.034
 especially+0.032
 boss+0.029
 busy+0.028
 bored+0.028
Neg logit contributions
eway-0.054
ered-0.054
ently-0.054
aned-0.052
ides-0.052
Token robot
Token ID9379
Feature activation+5.57e-03

Pos logit contributions
please+0.001
 especially+0.001
 thirsty+0.001
 chefs+0.001
 bored+0.001
Neg logit contributions
ently-0.001
eway-0.001
ered-0.001
ield-0.001
acle-0.001
Token say
Token ID910
Feature activation+8.21e-02

Pos logit contributions
eway+0.013
ently+0.013
ered+0.013
aned+0.013
ides+0.013
Neg logit contributions
please-0.007
 especially-0.007
 boss-0.006
 busy-0.006
 responsible-0.006
Token '
Token ID705
Feature activation-1.67e-01

Pos logit contributions
eway+0.148
ered+0.145
ently+0.143
aned+0.143
ogo+0.142
Neg logit contributions
please-0.153
 especially-0.147
 boss-0.145
 responsible-0.139
 busy-0.137
Tokenthank
Token ID40716
Feature activation-4.54e-02

Pos logit contributions
 especially+0.213
please+0.203
 bored+0.201
 chefs+0.200
 thirsty+0.196
Neg logit contributions
ered-0.368
ently-0.360
ides-0.356
eway-0.356
ield-0.350
Token you
Token ID345
Feature activation-3.75e-02

Pos logit contributions
please+0.066
 especially+0.061
 boss+0.056
 busy+0.055
 bored+0.055
Neg logit contributions
eway-0.101
ered-0.101
ently-0.101
aned-0.097
ides-0.096
Token'
Token ID6
Feature activation-9.38e-02

Pos logit contributions
please+0.068
 especially+0.065
 boss+0.062
 responsible+0.061
 busy+0.060
Neg logit contributions
ered-0.070
eway-0.070
aned-0.068
anced-0.067
ently-0.067
Token after
Token ID706
Feature activation+2.53e-02

Pos logit contributions
please+0.122
 especially+0.111
 boss+0.105
 responsible+0.103
 busy+0.100
Neg logit contributions
eway-0.233
ered-0.227
ogo-0.222
heit-0.220
ently-0.220
Token it
Token ID340
Feature activation+1.32e-02

Pos logit contributions
eway+0.062
ogo+0.060
anced+0.060
aned+0.060
ered+0.060
Neg logit contributions
please-0.031
 especially-0.030
 boss-0.027
 busy-0.027
 responsible-0.026
Token he
Token ID339
Feature activation+1.10e-02

Pos logit contributions
please+0.002
 especially+0.002
 busy+0.002
 boss+0.002
 chefs+0.002
Neg logit contributions
ered-0.004
heit-0.004
ently-0.004
aned-0.004
eway-0.004
Token said
Token ID531
Feature activation+8.44e-02

Pos logit contributions
eway+0.023
ently+0.022
ered+0.022
ides+0.021
aned+0.021
Neg logit contributions
please-0.019
 especially-0.018
 boss-0.016
 busy-0.016
 bored-0.016
Token "
Token ID366
Feature activation-1.54e-01

Pos logit contributions
eway+0.127
ently+0.125
ered+0.123
aned+0.123
ides+0.121
Neg logit contributions
please-0.186
 especially-0.180
 boss-0.174
 responsible-0.169
 busy-0.169
Tokenthank
Token ID40716
Feature activation-4.73e-02

Pos logit contributions
please+0.126
 especially+0.124
 bored+0.123
 boss+0.118
 busy+0.117
Neg logit contributions
ered-0.407
ently-0.407
eway-0.407
ides-0.397
anced-0.396
Token you
Token ID345
Feature activation-4.25e-02

Pos logit contributions
please+0.071
 especially+0.066
 boss+0.060
 bored+0.060
 busy+0.059
Neg logit contributions
eway-0.103
ered-0.103
ently-0.102
ides-0.097
aned-0.097
Token"
Token ID1
Feature activation-1.67e-01

Pos logit contributions
please+0.072
 especially+0.068
 boss+0.065
 busy+0.062
 responsible+0.061
Neg logit contributions
ered-0.089
eway-0.086
ently-0.086
aned-0.085
anced-0.085
Token to
Token ID284
Feature activation+6.67e-03

Pos logit contributions
please+0.235
 especially+0.219
 boss+0.200
 bored+0.197
 busy+0.196
Neg logit contributions
eway-0.370
ently-0.370
ered-0.368
aned-0.354
ides-0.352
Token his
Token ID465
Feature activation+2.59e-02

Pos logit contributions
eway+0.015
ently+0.015
ered+0.015
ides+0.015
aned+0.014
Neg logit contributions
please-0.010
 especially-0.009
 boss-0.008
 busy-0.008
 bored-0.008
Token mom
Token ID1995
Feature activation-8.41e-02

Pos logit contributions
ered+0.048
eway+0.047
ently+0.047
am+0.046
rum+0.045
Neg logit contributions
please-0.046
 especially-0.042
Why-0.039
 responsible-0.038
 bored-0.037
Token.
Token ID13
Feature activation+1.38e-02

Pos logit contributions
please+0.115
 especially+0.111
 boss+0.099
 busy+0.097
 bored+0.095
Neg logit contributions
eway-0.202
ered-0.197
ently-0.196
heit-0.193
aned-0.192
Token
Token ID198
Feature activation-3.04e-02

Pos logit contributions
ered+0.029
eway+0.027
ently+0.027
ides+0.027
heit+0.027
Neg logit contributions
please-0.023
 especially-0.022
 busy-0.020
 bored-0.020
I-0.020
Token prince
Token ID19716
Feature activation-2.19e-02

Pos logit contributions
ered+0.055
ently+0.055
eway+0.055
ides+0.053
aned+0.052
Neg logit contributions
please-0.038
 especially-0.035
 boss-0.031
 responsible-0.031
 busy-0.030
Token said
Token ID531
Feature activation+8.66e-02

Pos logit contributions
 especially+0.032
please+0.032
 boss+0.029
 busy+0.027
 creative+0.027
Neg logit contributions
eway-0.050
ently-0.049
ides-0.048
ogo-0.047
rum-0.047
Token "
Token ID366
Feature activation-1.55e-01

Pos logit contributions
eway+0.186
ently+0.180
ides+0.179
aned+0.179
ered+0.176
Neg logit contributions
please-0.138
 especially-0.132
 boss-0.130
 busy-0.122
 responsible-0.121
Tokenthank
Token ID40716
Feature activation-4.43e-02

Pos logit contributions
please+0.158
 especially+0.148
 bored+0.134
 boss+0.134
 busy+0.131
Neg logit contributions
ered-0.400
eway-0.399
ently-0.397
aned-0.385
ides-0.383
Token you
Token ID345
Feature activation-4.24e-02

Pos logit contributions
please+0.067
 especially+0.062
 boss+0.057
 bored+0.056
 busy+0.056
Neg logit contributions
ered-0.096
eway-0.096
ently-0.095
ides-0.091
aned-0.090
Token"
Token ID1
Feature activation-1.67e-01

Pos logit contributions
please+0.064
 boss+0.061
 especially+0.061
 responsible+0.058
 busy+0.057
Neg logit contributions
ered-0.092
eway-0.091
anced-0.090
aned-0.090
ently-0.089
Token over
Token ID625
Feature activation-3.98e-02

Pos logit contributions
please+0.235
 especially+0.220
 boss+0.200
 bored+0.196
 responsible+0.196
Neg logit contributions
ently-0.371
eway-0.369
ered-0.369
aned-0.354
ides-0.351
Token and
Token ID290
Feature activation+1.13e-02

Pos logit contributions
please+0.053
 especially+0.048
 boss+0.044
 busy+0.043
 bored+0.042
Neg logit contributions
eway-0.095
ered-0.095
ently-0.095
aned-0.091
ides-0.091
Token over
Token ID625
Feature activation-2.77e-02

Pos logit contributions
eway+0.027
ently+0.026
ides+0.025
ogo+0.025
rum+0.024
Neg logit contributions
please-0.017
 especially-0.016
 boss-0.015
 bored-0.015
 busy-0.015
Token again
Token ID757
Feature activation-2.08e-02

Pos logit contributions
please+0.046
 especially+0.042
 boss+0.039
 responsible+0.038
 bored+0.038
Neg logit contributions
ered-0.057
ently-0.057
eway-0.057
rum-0.053
anced-0.053
Token.
Token ID13
Feature activation+1.96e-02

Pos logit contributions
please+0.035
 especially+0.032
 boss+0.030
 bored+0.029
 responsible+0.029
Neg logit contributions
ered-0.042
eway-0.042
ently-0.042
ides-0.040
aned-0.040
Token bird
Token ID6512
Feature activation-2.23e-02

Pos logit contributions
ered+0.054
ently+0.053
am+0.052
eway+0.051
ogo+0.050
Neg logit contributions
please-0.036
 especially-0.032
 responsible-0.030
 chefs-0.030
nam-0.029
Token said
Token ID531
Feature activation+8.62e-02

Pos logit contributions
please+0.038
 especially+0.035
 boss+0.033
 busy+0.032
 responsible+0.032
Neg logit contributions
ered-0.044
ently-0.044
eway-0.044
aned-0.042
anced-0.041
Token "
Token ID366
Feature activation-1.57e-01

Pos logit contributions
eway+0.177
ered+0.170
ently+0.170
aned+0.170
ogo+0.169
Neg logit contributions
please-0.143
 especially-0.137
 boss-0.131
 busy-0.126
 creative-0.125
Tokenthank
Token ID40716
Feature activation-4.34e-02

Pos logit contributions
 bored+0.167
 chefs+0.161
 thirsty+0.161
 busy+0.155
 boss+0.150
Neg logit contributions
eway-0.371
ently-0.371
ered-0.367
anced-0.366
ides-0.363
Token you
Token ID345
Feature activation-4.19e-02

Pos logit contributions
please+0.082
 especially+0.079
 responsible+0.072
 thirsty+0.072
 bored+0.072
Neg logit contributions
ered-0.074
ently-0.072
eway-0.071
ield-0.070
am-0.068
Token"
Token ID1
Feature activation-1.67e-01

Pos logit contributions
please+0.072
 especially+0.068
 boss+0.065
 busy+0.063
 responsible+0.062
Neg logit contributions
ered-0.084
eway-0.083
ently-0.082
aned-0.081
anced-0.080
Token to
Token ID284
Feature activation+5.28e-03

Pos logit contributions
please+0.224
 especially+0.215
 bored+0.196
 thirsty+0.196
 chefs+0.193
Neg logit contributions
ently-0.362
eway-0.358
ered-0.356
am-0.347
ides-0.343
Token its
Token ID663
Feature activation+5.02e-02

Pos logit contributions
eway+0.014
ered+0.014
ently+0.014
aned+0.013
ides+0.013
Neg logit contributions
please-0.006
 especially-0.005
 boss-0.005
 busy-0.005
 bored-0.005
Token friends
Token ID2460
Feature activation-6.96e-02

Pos logit contributions
ered+0.105
eway+0.102
ently+0.101
am+0.101
ogo+0.097
Neg logit contributions
please-0.076
 especially-0.071
Why-0.067
I-0.064
 responsible-0.063
Token.
Token ID13
Feature activation+2.26e-02

Pos logit contributions
please+0.098
 especially+0.095
 boss+0.085
 busy+0.084
 bored+0.084
Neg logit contributions
eway-0.161
ered-0.159
ently-0.157
aned-0.153
heit-0.153
Token They
Token ID1119
Feature activation+1.47e-03

Pos logit contributions
ered+0.049
eway+0.048
heit+0.048
ogo+0.047
ides+0.046
Neg logit contributions
please-0.034
 especially-0.033
I-0.031
 bored-0.029
 boss-0.029
Token and
Token ID290
Feature activation+1.70e-02

Pos logit contributions
 especially+0.000
please+0.000
 boss+0.000
 bored+0.000
 busy+0.000
Neg logit contributions
eway-0.001
ered-0.001
heit-0.001
ogo-0.001
aned-0.001
Token said
Token ID531
Feature activation+8.60e-02

Pos logit contributions
ered+0.036
ently+0.035
eway+0.035
aned+0.034
am+0.033
Neg logit contributions
please-0.028
 especially-0.025
 boss-0.023
 busy-0.023
 chefs-0.022
Token "
Token ID366
Feature activation-1.50e-01

Pos logit contributions
ered+0.165
ently+0.164
eway+0.163
aned+0.155
ides+0.153
Neg logit contributions
please-0.156
 especially-0.146
 boss-0.134
 busy-0.132
 bored-0.131
Tokenthank
Token ID40716
Feature activation-4.39e-02

Pos logit contributions
 bored+0.159
 thirsty+0.156
 chefs+0.149
 busy+0.146
 creative+0.145
Neg logit contributions
ently-0.364
ered-0.353
aned-0.350
eway-0.341
ulously-0.340
Token you
Token ID345
Feature activation-4.00e-02

Pos logit contributions
please+0.085
 especially+0.083
 thirsty+0.077
 bored+0.077
 restless+0.073
Neg logit contributions
ered-0.074
ently-0.073
ield-0.072
eway-0.067
heit-0.066
Token"
Token ID1
Feature activation-1.67e-01

Pos logit contributions
please+0.069
 especially+0.065
 boss+0.061
 busy+0.060
 responsible+0.060
Neg logit contributions
eway-0.080
ered-0.079
ently-0.078
aned-0.075
anced-0.075
Token to
Token ID284
Feature activation+9.59e-03

Pos logit contributions
please+0.242
 especially+0.232
 chefs+0.221
 thirsty+0.220
 creative+0.212
Neg logit contributions
ently-0.345
ered-0.326
aned-0.322
eway-0.321
am-0.314
Token her
Token ID607
Feature activation+6.91e-03

Pos logit contributions
ered+0.024
ently+0.023
eway+0.023
aned+0.022
rum+0.022
Neg logit contributions
please-0.012
 especially-0.010
 boss-0.009
 chefs-0.009
 bored-0.009
Token mom
Token ID1995
Feature activation-8.45e-02

Pos logit contributions
ered+0.011
ently+0.011
ided+0.011
am+0.010
rum+0.010
Neg logit contributions
please-0.014
 especially-0.012
Why-0.012
 responsible-0.011
P-0.011
Tokenmy
Token ID1820
Feature activation-5.62e-02

Pos logit contributions
please+0.139
 especially+0.126
 chefs+0.116
 boss+0.116
 busy+0.115
Neg logit contributions
ered-0.170
ently-0.169
eway-0.162
rum-0.160
aned-0.160
Token for
Token ID329
Feature activation-2.72e-02

Pos logit contributions
please+0.091
 especially+0.085
 boss+0.078
 busy+0.077
 bored+0.076
Neg logit contributions
ered-0.116
eway-0.116
ently-0.116
aned-0.111
ides-0.110
Token and
Token ID290
Feature activation+2.06e-02

Pos logit contributions
please+0.006
 especially+0.005
 boss+0.005
 busy+0.005
 bored+0.005
Neg logit contributions
eway-0.009
ered-0.009
ently-0.009
aned-0.008
ides-0.008
Token said
Token ID531
Feature activation+8.84e-02

Pos logit contributions
eway+0.053
ently+0.053
ered+0.051
ides+0.051
aned+0.050
Neg logit contributions
please-0.024
 especially-0.023
 boss-0.021
 busy-0.020
 bored-0.020
Token "
Token ID366
Feature activation-1.50e-01

Pos logit contributions
eway+0.173
ently+0.171
ered+0.170
aned+0.168
ides+0.166
Neg logit contributions
please-0.153
 especially-0.145
 boss-0.139
 busy-0.134
 responsible-0.133
Tokenthank
Token ID40716
Feature activation-4.33e-02

Pos logit contributions
 especially+0.136
please+0.133
 bored+0.133
 chefs+0.128
 busy+0.127
Neg logit contributions
eway-0.383
ered-0.381
ently-0.379
anced-0.372
ides-0.371
Token you
Token ID345
Feature activation-3.83e-02

Pos logit contributions
please+0.069
 especially+0.064
 bored+0.059
 boss+0.058
 responsible+0.058
Neg logit contributions
ered-0.090
eway-0.090
ently-0.089
ides-0.085
rum-0.084
Token"
Token ID1
Feature activation-1.67e-01

Pos logit contributions
please+0.066
 especially+0.063
 boss+0.060
 busy+0.058
 responsible+0.057
Neg logit contributions
ered-0.077
eway-0.076
ently-0.074
aned-0.074
anced-0.073
Token to
Token ID284
Feature activation+7.90e-03

Pos logit contributions
please+0.227
 especially+0.215
 boss+0.194
 responsible+0.194
 thirsty+0.194
Neg logit contributions
ently-0.366
eway-0.362
ered-0.360
anced-0.347
ides-0.346
Token its
Token ID663
Feature activation+5.66e-02

Pos logit contributions
eway+0.020
ently+0.020
ered+0.020
ides+0.020
aned+0.019
Neg logit contributions
please-0.009
 especially-0.009
 boss-0.008
 busy-0.007
 bored-0.007
Token friends
Token ID2460
Feature activation-7.21e-02

Pos logit contributions
ered+0.116
eway+0.116
ently+0.113
ogo+0.108
rum+0.108
Neg logit contributions
please-0.091
 especially-0.081
Why-0.075
 bored-0.071
 responsible-0.071
Token.
Token ID13
Feature activation+1.61e-02

Pos logit contributions
please+0.101
 especially+0.099
 boss+0.088
 busy+0.087
 bored+0.086
Neg logit contributions
eway-0.167
ered-0.165
ently-0.162
aned-0.159
heit-0.158
Token The
Token ID383
Feature activation+2.35e-02

Pos logit contributions
ered+0.035
ently+0.034
eway+0.034
ides+0.034
heit+0.033
Neg logit contributions
 especially-0.024
please-0.024
 boss-0.022
I-0.022
 bored-0.022
Token other
Token ID584
Feature activation-5.30e-03

Pos logit contributions
please+0.055
 especially+0.052
 boss+0.046
 busy+0.045
 bored+0.044
Neg logit contributions
eway-0.123
ently-0.123
ered-0.120
aned-0.119
ides-0.119
Token and
Token ID290
Feature activation+9.22e-03

Pos logit contributions
please+0.008
 especially+0.007
 boss+0.007
 busy+0.007
 bored+0.007
Neg logit contributions
eway-0.012
ently-0.012
ered-0.012
aned-0.011
ides-0.011
Token the
Token ID262
Feature activation+6.91e-03

Pos logit contributions
eway+0.021
ently+0.021
ered+0.020
ides+0.020
aned+0.020
Neg logit contributions
please-0.014
 especially-0.013
 boss-0.012
 busy-0.011
 bored-0.011
Token doctor
Token ID6253
Feature activation-5.67e-02

Pos logit contributions
eway+0.015
ered+0.015
ently+0.015
heit+0.014
aned+0.014
Neg logit contributions
please-0.011
 especially-0.010
 boss-0.009
 responsible-0.009
 bored-0.009
Token said
Token ID531
Feature activation+8.81e-02

Pos logit contributions
please+0.087
 especially+0.083
 boss+0.078
 bored+0.075
 busy+0.074
Neg logit contributions
eway-0.128
ently-0.123
ogo-0.120
aned-0.118
ered-0.117
Token goodbye
Token ID24829
Feature activation+9.10e-02

Pos logit contributions
eway+0.195
ently+0.190
aned+0.188
ered+0.187
ogo+0.187
Neg logit contributions
please-0.130
 especially-0.126
 boss-0.120
 busy-0.115
 responsible-0.112
Token to
Token ID284
Feature activation+1.52e-02

Pos logit contributions
eway+0.239
ered+0.237
heit+0.235
ently+0.232
anced+0.230
Neg logit contributions
please-0.101
 especially-0.094
 boss-0.085
 busy-0.085
 responsible-0.082
Token the
Token ID262
Feature activation-5.45e-03

Pos logit contributions
eway+0.036
ently+0.035
ered+0.035
aned+0.034
ides+0.034
Neg logit contributions
please-0.021
 especially-0.019
 boss-0.017
 busy-0.017
 bored-0.017
Token patient
Token ID5827
Feature activation-4.28e-02

Pos logit contributions
please+0.008
 especially+0.007
 bored+0.007
 boss+0.006
 responsible+0.006
Neg logit contributions
eway-0.012
ently-0.012
ered-0.012
aned-0.011
heit-0.011
Token.
Token ID13
Feature activation+1.81e-02

Pos logit contributions
please+0.061
 especially+0.059
 boss+0.055
 busy+0.053
 responsible+0.052
Neg logit contributions
eway-0.096
ered-0.096
ently-0.095
aned-0.092
ogo-0.092
Token
Token ID220
Feature activation+2.53e-02

Pos logit contributions
ered+0.043
ently+0.042
eway+0.041
ides+0.041
aned+0.041
Neg logit contributions
please-0.025
 especially-0.025
 busy-0.022
 bored-0.021
Why-0.021
Token both
Token ID1111
Feature activation+6.98e-03

Pos logit contributions
ently+0.038
eway+0.038
ides+0.037
aned+0.037
ered+0.037
Neg logit contributions
please-0.023
 especially-0.023
 different-0.022
 busy-0.022
 boss-0.021
Token hands
Token ID2832
Feature activation+3.83e-02

Pos logit contributions
eway+0.014
ides+0.014
ogo+0.014
aned+0.014
ently+0.014
Neg logit contributions
 especially-0.011
please-0.011
 busy-0.011
 different-0.010
 boss-0.010
Token.
Token ID13
Feature activation+2.08e-02

Pos logit contributions
eway+0.092
ered+0.090
ently+0.089
heit+0.088
aned+0.087
Neg logit contributions
please-0.050
 especially-0.050
 busy-0.044
 boss-0.043
 bored-0.043
Token Then
Token ID3244
Feature activation-9.57e-03

Pos logit contributions
ogo+0.044
eway+0.042
ered+0.042
ently+0.042
ides+0.041
Neg logit contributions
 especially-0.033
please-0.031
 boss-0.031
 bored-0.031
 different-0.030
Token he
Token ID339
Feature activation+1.17e-02

Pos logit contributions
 boss+0.011
 especially+0.011
 healthy+0.011
 bored+0.011
please+0.011
Neg logit contributions
aned-0.025
ides-0.025
eway-0.025
ently-0.025
ogo-0.024
Token said
Token ID531
Feature activation+9.32e-02

Pos logit contributions
eway+0.026
ently+0.025
rum+0.025
ered+0.025
aned+0.024
Neg logit contributions
please-0.019
 especially-0.018
 boss-0.016
 busy-0.015
 bored-0.015
Token the
Token ID262
Feature activation+3.19e-03

Pos logit contributions
eway+0.205
ogo+0.203
ides+0.201
aned+0.200
ently+0.197
Neg logit contributions
please-0.139
 especially-0.134
 boss-0.134
 creative-0.128
 busy-0.126
Token same
Token ID976
Feature activation-1.82e-02

Pos logit contributions
ered+0.007
eway+0.007
ently+0.007
anced+0.007
ides+0.007
Neg logit contributions
please-0.004
 especially-0.004
 bored-0.004
 boss-0.004
 responsible-0.004
Token magic
Token ID5536
Feature activation+6.04e-03

Pos logit contributions
 boss+0.027
 especially+0.026
please+0.026
 different+0.025
 creative+0.024
Neg logit contributions
ently-0.040
aned-0.040
eway-0.039
ogo-0.039
ered-0.039
Token word
Token ID1573
Feature activation+7.74e-02

Pos logit contributions
ently+0.013
aned+0.013
ered+0.013
eway+0.013
heit+0.013
Neg logit contributions
 boss-0.009
 especially-0.009
please-0.009
 different-0.009
 creative-0.008
Token.
Token ID13
Feature activation+1.90e-02

Pos logit contributions
heit+0.180
aned+0.179
ently+0.178
eway+0.178
ogo+0.177
Neg logit contributions
 especially-0.106
 boss-0.099
 different-0.095
please-0.095
 like-0.092
Token little
Token ID1310
Feature activation-4.12e-02

Pos logit contributions
eway+0.016
ered+0.016
ently+0.016
heit+0.015
anced+0.015
Neg logit contributions
please-0.010
 especially-0.009
 responsible-0.008
 bored-0.008
 boss-0.008
Token girl
Token ID2576
Feature activation-3.11e-02

Pos logit contributions
please+0.050
 especially+0.045
 responsible+0.040
 bored+0.038
 busy+0.038
Neg logit contributions
eway-0.100
ered-0.099
ently-0.098
anced-0.095
heit-0.094
Token loved
Token ID6151
Feature activation-6.37e-02

Pos logit contributions
please+0.049
 especially+0.048
 boss+0.043
 bored+0.042
 busy+0.042
Neg logit contributions
eway-0.068
ently-0.066
ered-0.065
aned-0.063
ogo-0.063
Token it
Token ID340
Feature activation+2.93e-02

Pos logit contributions
please+0.094
 especially+0.082
 boss+0.079
 responsible+0.079
 bored+0.079
Neg logit contributions
eway-0.136
ered-0.135
ently-0.132
rum-0.128
anced-0.125
Token right
Token ID826
Feature activation+5.19e-02

Pos logit contributions
ently+0.059
eway+0.058
ered+0.058
rum+0.055
ides+0.054
Neg logit contributions
please-0.050
 especially-0.043
 boss-0.042
 chefs-0.041
 bored-0.041
Token away
Token ID1497
Feature activation+9.65e-02

Pos logit contributions
eway+0.114
ered+0.114
ently+0.113
aned+0.109
ides+0.108
Neg logit contributions
please-0.078
 especially-0.072
 boss-0.066
 busy-0.065
 bored-0.064
Token.
Token ID13
Feature activation+1.68e-02

Pos logit contributions
eway+0.195
ently+0.194
ered+0.193
rum+0.183
aned+0.181
Neg logit contributions
please-0.165
 especially-0.149
 boss-0.138
 bored-0.137
 busy-0.135
Token
Token ID198
Feature activation-1.83e-02

Pos logit contributions
ered+0.038
ently+0.037
eway+0.036
aned+0.036
ides+0.036
Neg logit contributions
please-0.025
 especially-0.025
 boss-0.021
Why-0.021
 bored-0.021
Token
Token ID198
Feature activation-2.31e-02

Pos logit contributions
please+0.025
 especially+0.024
 busy+0.021
 boss+0.021
 different+0.020
Neg logit contributions
ered-0.042
ently-0.042
eway-0.041
ides-0.041
anced-0.041
TokenEvery
Token ID6109
Feature activation-4.69e-03

Pos logit contributions
please+0.028
 especially+0.026
 boss+0.023
 busy+0.022
 bored+0.022
Neg logit contributions
eway-0.057
ered-0.057
ently-0.057
aned-0.055
ides-0.055
Token night
Token ID1755
Feature activation+4.94e-02

Pos logit contributions
please+0.007
 especially+0.007
 boss+0.006
 busy+0.006
 bored+0.006
Neg logit contributions
ently-0.011
ered-0.011
eway-0.011
aned-0.010
ides-0.010
Token Dad
Token ID17415
Feature activation-3.84e-02

Pos logit contributions
eway+0.055
ered+0.055
ently+0.054
aned+0.051
ides+0.050
Neg logit contributions
please-0.070
 especially-0.067
 boss-0.063
 busy-0.062
 bored-0.062
Token likes
Token ID7832
Feature activation-3.33e-02

Pos logit contributions
please+0.071
 especially+0.068
 boss+0.064
 busy+0.063
 bored+0.062
Neg logit contributions
eway-0.073
ently-0.070
ered-0.070
aned-0.066
anced-0.066
Token chocolate
Token ID11311
Feature activation+1.58e-02

Pos logit contributions
please+0.056
 especially+0.049
 responsible+0.048
 bored+0.047
 restless+0.047
Neg logit contributions
ered-0.061
ently-0.060
eway-0.059
am-0.057
ield-0.056
Token,"
Token ID553
Feature activation-5.33e-02

Pos logit contributions
ently+0.033
ered+0.032
eway+0.031
aned+0.031
ides+0.031
Neg logit contributions
please-0.025
 especially-0.023
 responsible-0.021
 boss-0.021
 bored-0.021
Token she
Token ID673
Feature activation+1.31e-03

Pos logit contributions
please+0.080
 especially+0.072
 responsible+0.067
 boss+0.065
 bored+0.064
Neg logit contributions
ered-0.115
ently-0.114
eway-0.113
rum-0.107
am-0.107
Token says
Token ID1139
Feature activation+9.88e-02

Pos logit contributions
eway+0.002
ently+0.002
ides+0.002
ered+0.002
rum+0.002
Neg logit contributions
please-0.003
 boss-0.003
 especially-0.003
 busy-0.003
 bored-0.003
Token.
Token ID13
Feature activation+2.29e-02

Pos logit contributions
ered+0.204
ently+0.204
eway+0.203
aned+0.195
ides+0.193
Neg logit contributions
please-0.161
 especially-0.150
 boss-0.138
 busy-0.135
 bored-0.134
Token
Token ID198
Feature activation-2.02e-02

Pos logit contributions
eway+0.044
ently+0.043
ered+0.041
rum+0.041
anced+0.040
Neg logit contributions
please-0.040
 especially-0.037
 responsible-0.035
 boss-0.034
 bored-0.034
Token
Token ID198
Feature activation-2.23e-02

Pos logit contributions
please+0.027
 especially+0.024
 boss+0.022
 busy+0.022
 responsible+0.021
Neg logit contributions
ered-0.048
eway-0.048
ently-0.048
aned-0.046
ides-0.046
TokenAnna
Token ID31160
Feature activation-1.37e-02

Pos logit contributions
please+0.029
 especially+0.027
 bored+0.024
 boss+0.024
 busy+0.023
Neg logit contributions
eway-0.053
ered-0.053
ently-0.053
rum-0.051
aned-0.051
Token and
Token ID290
Feature activation+4.03e-03

Pos logit contributions
please+0.022
 especially+0.021
 boss+0.019
 bored+0.019
 busy+0.019
Neg logit contributions
eway-0.029
ered-0.028
ently-0.028
ides-0.027
aned-0.027
Token
Token ID198
Feature activation-2.46e-02

Pos logit contributions
please+0.031
 especially+0.030
 busy+0.027
 creative+0.026
 boss+0.026
Neg logit contributions
ered-0.048
ently-0.048
eway-0.047
ides-0.047
anced-0.047
TokenJ
Token ID41
Feature activation-2.27e-02

Pos logit contributions
please+0.031
 especially+0.028
 boss+0.026
 busy+0.025
 bored+0.025
Neg logit contributions
eway-0.060
ered-0.060
ently-0.060
aned-0.058
ides-0.057
Tokenill
Token ID359
Feature activation-1.63e-02

Pos logit contributions
please+0.028
 especially+0.026
 boss+0.024
 chefs+0.023
 creative+0.023
Neg logit contributions
heit-0.052
ered-0.051
aned-0.050
eway-0.050
zed-0.049
Token cheered
Token ID34767
Feature activation+1.11e-02

Pos logit contributions
 especially+0.025
please+0.025
 boss+0.022
 busy+0.021
 creative+0.021
Neg logit contributions
eway-0.036
ently-0.034
ides-0.034
ogo-0.033
rum-0.033
Token and
Token ID290
Feature activation+2.02e-02

Pos logit contributions
eway+0.028
ered+0.028
ides+0.028
ently+0.027
anced+0.027
Neg logit contributions
 especially-0.013
please-0.013
 boss-0.012
 busy-0.011
 creative-0.011
Token said
Token ID531
Feature activation+9.10e-02

Pos logit contributions
eway+0.046
ently+0.045
ides+0.045
ogo+0.043
rum+0.043
Neg logit contributions
please-0.030
 especially-0.029
 busy-0.027
 boss-0.027
 creative-0.027
Token,
Token ID11
Feature activation-4.75e-02

Pos logit contributions
eway+0.203
ides+0.199
aned+0.199
ogo+0.198
ently+0.197
Neg logit contributions
please-0.125
 boss-0.120
 especially-0.120
 creative-0.115
 responsible-0.111
Token "
Token ID366
Feature activation-1.44e-01

Pos logit contributions
please+0.074
 especially+0.071
 boss+0.065
 busy+0.064
 bored+0.061
Neg logit contributions
eway-0.105
ered-0.104
ently-0.102
ogo-0.100
ides-0.100
TokenYou
Token ID1639
Feature activation+1.70e-02

Pos logit contributions
 bored+0.169
please+0.165
 thirsty+0.161
 especially+0.160
 busy+0.159
Neg logit contributions
eway-0.338
ered-0.330
ently-0.329
anced-0.326
aned-0.318
Token did
Token ID750
Feature activation+7.08e-02

Pos logit contributions
eway+0.036
ently+0.036
aned+0.034
ered+0.034
rum+0.034
Neg logit contributions
please-0.027
 especially-0.026
 boss-0.025
 bored-0.024
 creative-0.024
Token it
Token ID340
Feature activation+2.81e-02

Pos logit contributions
eway+0.158
ently+0.157
ered+0.156
aned+0.151
ides+0.150
Neg logit contributions
please-0.104
 especially-0.098
 boss-0.090
 busy-0.088
 bored-0.087
Token flew
Token ID13112
Feature activation+1.66e-02

Pos logit contributions
eway+0.042
ently+0.041
ered+0.040
ides+0.039
rum+0.039
Neg logit contributions
please-0.035
 especially-0.034
 boss-0.031
 busy-0.030
 bored-0.030
Token all
Token ID477
Feature activation+5.03e-03

Pos logit contributions
ently+0.040
ides+0.040
ered+0.039
rum+0.039
eway+0.038
Neg logit contributions
please-0.021
 especially-0.021
 busy-0.019
 boss-0.019
 different-0.019
Token around
Token ID1088
Feature activation+5.19e-03

Pos logit contributions
ently+0.012
eway+0.012
ered+0.012
ides+0.012
aned+0.011
Neg logit contributions
please-0.007
 especially-0.006
 busy-0.006
 boss-0.006
 bored-0.006
Token,
Token ID11
Feature activation-4.16e-02

Pos logit contributions
eway+0.012
ered+0.012
ently+0.012
aned+0.012
ides+0.011
Neg logit contributions
please-0.007
 especially-0.006
 boss-0.006
 bored-0.006
 responsible-0.006
Token visiting
Token ID10013
Feature activation+1.34e-02

Pos logit contributions
please+0.055
 especially+0.052
 boss+0.047
 busy+0.046
 bored+0.045
Neg logit contributions
eway-0.099
ered-0.098
ently-0.098
ides-0.095
aned-0.095
Token different
Token ID1180
Feature activation+1.32e-02

Pos logit contributions
eway+0.031
ered+0.031
ently+0.031
aned+0.031
ides+0.031
Neg logit contributions
please-0.017
 especially-0.017
 boss-0.015
 busy-0.015
 bored-0.014
Token places
Token ID4113
Feature activation+1.14e-02

Pos logit contributions
eway+0.028
ered+0.028
ently+0.028
heit+0.026
am+0.026
Neg logit contributions
please-0.021
 especially-0.018
 responsible-0.017
 bored-0.017
 thirsty-0.017
Token and
Token ID290
Feature activation+2.57e-02

Pos logit contributions
eway+0.025
ered+0.025
ently+0.025
aned+0.024
heit+0.024
Neg logit contributions
please-0.017
 especially-0.016
 boss-0.015
 busy-0.015
 bored-0.014
Token meeting
Token ID3249
Feature activation-1.99e-02

Pos logit contributions
eway+0.057
ered+0.057
ently+0.057
aned+0.055
ides+0.054
Neg logit contributions
please-0.038
 especially-0.035
 boss-0.032
 busy-0.031
 bored-0.031
Token new
Token ID649
Feature activation-2.68e-02

Pos logit contributions
please+0.028
 especially+0.026
 boss+0.024
 busy+0.023
 bored+0.023
Neg logit contributions
eway-0.045
ered-0.045
ently-0.045
aned-0.043
ides-0.043
Token friends
Token ID2460
Feature activation-4.45e-02

Pos logit contributions
please+0.039
 especially+0.035
 bored+0.032
 boss+0.031
 responsible+0.031
Neg logit contributions
eway-0.061
ered-0.060
ently-0.060
am-0.057
rum-0.056
Token filled
Token ID5901
Feature activation-1.21e-02

Pos logit contributions
ered+0.038
eway+0.037
ently+0.036
aned+0.036
ided+0.035
Neg logit contributions
please-0.025
 especially-0.022
 boss-0.020
 responsible-0.020
Why-0.020
Token with
Token ID351
Feature activation+8.97e-03

Pos logit contributions
please+0.019
 especially+0.016
 bored+0.016
 boss+0.015
 thirsty+0.015
Neg logit contributions
eway-0.027
ered-0.027
ently-0.026
am-0.024
aned-0.024
Token gold
Token ID3869
Feature activation+2.86e-02

Pos logit contributions
ered+0.020
eway+0.019
ently+0.019
am+0.019
anced+0.018
Neg logit contributions
please-0.014
 especially-0.011
Why-0.011
 responsible-0.010
 boss-0.010
Token coins
Token ID10796
Feature activation+2.83e-02

Pos logit contributions
eway+0.064
ered+0.063
ently+0.062
aned+0.062
heit+0.061
Neg logit contributions
please-0.041
 especially-0.040
 boss-0.037
 busy-0.036
 chefs-0.036
Token.
Token ID13
Feature activation+2.07e-02

Pos logit contributions
eway+0.067
ered+0.065
ently+0.065
heit+0.064
aned+0.064
Neg logit contributions
please-0.037
 especially-0.037
 busy-0.034
 boss-0.033
 bored-0.033
Token
Token ID220
Feature activation+1.63e-02

Pos logit contributions
ogo+0.044
ered+0.042
ently+0.042
eway+0.042
rum+0.041
Neg logit contributions
 especially-0.034
please-0.032
 bored-0.031
 busy-0.031
 different-0.030
Token
Token ID198
Feature activation-2.19e-02

Pos logit contributions
ered+0.042
eway+0.042
ently+0.041
heit+0.041
aned+0.041
Neg logit contributions
please-0.017
 especially-0.017
 busy-0.014
 boss-0.014
 bored-0.014
TokenThe
Token ID464
Feature activation+2.80e-02

Pos logit contributions
please+0.032
 especially+0.029
 boss+0.027
 bored+0.027
 busy+0.026
Neg logit contributions
eway-0.050
ered-0.049
ently-0.049
aned-0.047
ides-0.046
Token plate
Token ID7480
Feature activation+1.57e-02

Pos logit contributions
ently+0.044
acle+0.042
ered+0.042
aned+0.041
zed+0.041
Neg logit contributions
please-0.056
 especially-0.048
 responsible-0.044
Why-0.043
 boss-0.043
Token was
Token ID373
Feature activation+6.48e-03

Pos logit contributions
eway+0.034
ides+0.034
ogo+0.034
rum+0.034
ently+0.034
Neg logit contributions
 especially-0.024
please-0.023
 boss-0.022
 busy-0.020
 creative-0.020
Token so
Token ID523
Feature activation-1.86e-02

Pos logit contributions
ered+0.018
eway+0.018
ently+0.018
aned+0.017
anced+0.017
Neg logit contributions
please-0.006
 especially-0.005
 boss-0.004
 responsible-0.004
 chefs-0.004
Token The
Token ID383
Feature activation+2.61e-02

Pos logit contributions
eway+0.057
ered+0.057
ently+0.057
aned+0.054
ides+0.054
Neg logit contributions
please-0.058
 especially-0.054
 boss-0.051
 busy-0.050
 bored-0.050
Token tree
Token ID5509
Feature activation+1.93e-02

Pos logit contributions
ered+0.051
eway+0.050
ently+0.049
am+0.047
aned+0.046
Neg logit contributions
please-0.045
 especially-0.040
 responsible-0.038
 chefs-0.036
 bored-0.035
Token grew
Token ID6348
Feature activation-6.02e-02

Pos logit contributions
eway+0.042
ently+0.041
ides+0.041
heit+0.040
ered+0.040
Neg logit contributions
 especially-0.030
please-0.030
 boss-0.027
 busy-0.026
 bored-0.026
Token very
Token ID845
Feature activation-3.74e-02

Pos logit contributions
please+0.077
 especially+0.063
 fashion+0.060
 chefs+0.059
 responsible+0.059
Neg logit contributions
ered-0.149
ently-0.144
am-0.141
eway-0.138
aned-0.135
Token tall
Token ID7331
Feature activation+2.37e-02

Pos logit contributions
please+0.040
 especially+0.034
 chefs+0.030
Why+0.028
 responsible+0.027
Neg logit contributions
ered-0.099
ently-0.097
eway-0.097
am-0.094
aned-0.094
Token and
Token ID290
Feature activation+2.01e-02

Pos logit contributions
eway+0.051
ered+0.051
ently+0.051
aned+0.049
ides+0.048
Neg logit contributions
please-0.036
 especially-0.033
 boss-0.030
 busy-0.030
 responsible-0.030
Token was
Token ID373
Feature activation+1.50e-02

Pos logit contributions
ered+0.050
eway+0.048
ently+0.047
heit+0.046
aned+0.046
Neg logit contributions
please-0.024
 chefs-0.020
 especially-0.019
 responsible-0.017
 fashion-0.017
Token covered
Token ID5017
Feature activation+2.26e-02

Pos logit contributions
ered+0.042
eway+0.041
ently+0.041
aned+0.040
rum+0.039
Neg logit contributions
please-0.013
 especially-0.011
 chefs-0.009
 boss-0.009
 responsible-0.009
Token in
Token ID287
Feature activation+2.36e-02

Pos logit contributions
eway+0.056
ered+0.056
ently+0.056
aned+0.054
ides+0.053
Neg logit contributions
please-0.028
 especially-0.025
 boss-0.022
 busy-0.022
 bored-0.022
Token sweet
Token ID6029
Feature activation-5.94e-03

Pos logit contributions
ered+0.052
eway+0.050
ently+0.048
rum+0.047
heit+0.047
Neg logit contributions
please-0.037
 especially-0.029
 responsible-0.028
 bored-0.028
 boss-0.027
Token p
Token ID279
Feature activation-1.22e-03

Pos logit contributions
please+0.009
 especially+0.009
 responsible+0.008
 bored+0.008
 boss+0.008
Neg logit contributions
ered-0.013
ently-0.012
eway-0.012
am-0.012
ides-0.012
TokenAnna
Token ID31160
Feature activation-1.81e-02

Pos logit contributions
please+0.025
 especially+0.022
 bored+0.021
 boss+0.021
 chefs+0.021
Neg logit contributions
eway-0.034
ently-0.033
ered-0.033
aned-0.032
rum-0.031
Token runs
Token ID4539
Feature activation-3.37e-03

Pos logit contributions
please+0.037
 especially+0.034
 boss+0.032
 responsible+0.032
 busy+0.032
Neg logit contributions
ered-0.030
ently-0.029
eway-0.029
aned-0.028
ides-0.028
Token after
Token ID706
Feature activation+3.78e-02

Pos logit contributions
please+0.006
 especially+0.005
nam+0.005
 chefs+0.005
 thirsty+0.005
Neg logit contributions
eway-0.006
ered-0.006
aned-0.006
am-0.006
ently-0.006
Token the
Token ID262
Feature activation-1.73e-02

Pos logit contributions
ered+0.069
ently+0.069
eway+0.066
zed+0.066
am+0.064
Neg logit contributions
please-0.067
 responsible-0.061
 especially-0.060
 ju-0.056
 boss-0.056
Token bubbles
Token ID25037
Feature activation-2.27e-02

Pos logit contributions
please+0.028
 especially+0.025
 responsible+0.023
nam+0.022
 bored+0.021
Neg logit contributions
eway-0.034
ently-0.034
ogo-0.033
ides-0.033
ered-0.033
Token.
Token ID13
Feature activation+2.38e-02

Pos logit contributions
please+0.036
 especially+0.033
 boss+0.031
 busy+0.030
 bored+0.030
Neg logit contributions
eway-0.048
ered-0.048
ently-0.048
aned-0.046
ides-0.045
Token She
Token ID1375
Feature activation+2.56e-02

Pos logit contributions
ently+0.045
eway+0.045
ered+0.045
anced+0.044
aned+0.043
Neg logit contributions
please-0.043
 especially-0.039
 responsible-0.036
 busy-0.035
 chefs-0.035
Token tries
Token ID8404
Feature activation+1.30e-02

Pos logit contributions
ered+0.040
ently+0.039
eway+0.039
aned+0.037
anced+0.037
Neg logit contributions
please-0.055
 especially-0.051
 responsible-0.048
 boss-0.048
 busy-0.048
Token to
Token ID284
Feature activation+2.97e-02

Pos logit contributions
ered+0.021
ently+0.021
eway+0.020
am+0.020
aned+0.019
Neg logit contributions
please-0.024
 responsible-0.021
 chefs-0.021
 ju-0.021
nam-0.021
Token catch
Token ID4929
Feature activation+2.98e-03

Pos logit contributions
ered+0.057
aned+0.051
ided+0.050
eway+0.049
ides+0.048
Neg logit contributions
please-0.052
 responsible-0.048
 wealthy-0.047
 bored-0.047
 chefs-0.046
Token them
Token ID606
Feature activation-1.18e-02

Pos logit contributions
ered+0.005
eway+0.004
ently+0.004
am+0.004
aned+0.004
Neg logit contributions
please-0.007
 responsible-0.006
 chefs-0.006
 especially-0.006
 thirsty-0.005
Token little
Token ID1310
Feature activation-3.86e-02

Pos logit contributions
please+0.039
 especially+0.035
Why+0.033
These+0.032
organ+0.030
Neg logit contributions
 replied-0.045
 chest-0.044
ia-0.043
eros-0.043
ogo-0.042
Token girl
Token ID2576
Feature activation-2.54e-02

Pos logit contributions
please+0.061
 especially+0.055
Why+0.050
 responsible+0.048
organ+0.045
Neg logit contributions
eros-0.073
 replied-0.071
ield-0.071
 chest-0.069
osaurs-0.068
Token named
Token ID3706
Feature activation-2.53e-02

Pos logit contributions
please+0.051
 ju+0.046
 especially+0.045
 bored+0.045
 chefs+0.045
Neg logit contributions
ered-0.038
am-0.036
rum-0.036
eway-0.036
ently-0.035
Token Lily
Token ID20037
Feature activation-2.76e-02

Pos logit contributions
please+0.040
 especially+0.034
 chefs+0.032
 responsible+0.031
 boss+0.030
Neg logit contributions
ered-0.054
eway-0.052
ently-0.050
ided-0.049
aned-0.049
Token.
Token ID13
Feature activation+2.23e-02

Pos logit contributions
please+0.051
 especially+0.047
 boss+0.045
 bored+0.045
 chefs+0.044
Neg logit contributions
ered-0.049
eway-0.048
ently-0.047
rum-0.047
am-0.046
Token She
Token ID1375
Feature activation+1.75e-02

Pos logit contributions
eway+0.036
ently+0.036
ered+0.035
anced+0.035
ides+0.034
Neg logit contributions
please-0.045
 responsible-0.038
 especially-0.038
 chefs-0.037
 boss-0.036
Token loved
Token ID6151
Feature activation-6.97e-02

Pos logit contributions
ered+0.040
eway+0.039
ently+0.039
aned+0.038
am+0.038
Neg logit contributions
please-0.024
 especially-0.021
 boss-0.019
 chefs-0.019
 bored-0.019
Token playing
Token ID2712
Feature activation-4.09e-02

Pos logit contributions
 especially+0.099
please+0.099
 responsible+0.097
 boss+0.093
 happiest+0.093
Neg logit contributions
am-0.120
ered-0.117
anced-0.115
zed-0.115
ield-0.113
Token outside
Token ID2354
Feature activation-1.94e-02

Pos logit contributions
please+0.066
 bored+0.062
 especially+0.059
 thirsty+0.058
 happiest+0.058
Neg logit contributions
ered-0.075
eway-0.073
ently-0.068
 replied-0.068
am-0.067
Token,
Token ID11
Feature activation-3.87e-02

Pos logit contributions
please+0.041
 responsible+0.036
 ju+0.033
 wealthy+0.032
 talented+0.032
Neg logit contributions
ently-0.029
ered-0.029
eway-0.028
am-0.028
zed-0.025
Token especially
Token ID2592
Feature activation-7.75e-02

Pos logit contributions
please+0.053
 responsible+0.045
 chefs+0.043
Girls+0.039
 boss+0.038
Neg logit contributions
ered-0.086
eway-0.083
zed-0.080
ently-0.079
ided-0.077
Token wanted
Token ID2227
Feature activation-9.61e-03

Pos logit contributions
eway+0.083
ently+0.082
ered+0.081
ides+0.078
aned+0.078
Neg logit contributions
please-0.064
 especially-0.061
 boss-0.056
 busy-0.054
 bored-0.054
Token to
Token ID284
Feature activation+1.52e-02

Pos logit contributions
please+0.015
 responsible+0.013
 especially+0.013
 restless+0.012
 bored+0.012
Neg logit contributions
ered-0.020
eway-0.019
ently-0.018
am-0.018
ogo-0.018
Token know
Token ID760
Feature activation+2.55e-02

Pos logit contributions
ered+0.032
eway+0.032
ently+0.030
rum+0.030
ides+0.029
Neg logit contributions
please-0.024
 especially-0.022
 bored-0.021
 busy-0.021
 responsible-0.021
Token how
Token ID703
Feature activation-1.19e-01

Pos logit contributions
eway+0.057
ered+0.057
ently+0.057
aned+0.055
ides+0.054
Neg logit contributions
please-0.037
 especially-0.035
 boss-0.032
 busy-0.031
 bored-0.031
Token big
Token ID1263
Feature activation-2.31e-02

Pos logit contributions
please+0.163
 especially+0.150
 boss+0.136
 busy+0.133
 bored+0.132
Neg logit contributions
eway-0.278
ered-0.278
ently-0.277
aned-0.266
ides-0.264
Token the
Token ID262
Feature activation-1.29e-02

Pos logit contributions
please+0.038
 especially+0.034
 responsible+0.032
 bored+0.031
 boss+0.031
Neg logit contributions
ered-0.048
eway-0.047
ently-0.046
anced-0.044
am-0.044
Token cage
Token ID17186
Feature activation+3.63e-02

Pos logit contributions
please+0.019
 especially+0.018
 responsible+0.016
 bored+0.016
 chefs+0.016
Neg logit contributions
ered-0.028
eway-0.027
ently-0.027
am-0.026
anced-0.025
Token was
Token ID373
Feature activation-2.08e-03

Pos logit contributions
ently+0.075
eway+0.075
heit+0.074
aned+0.073
rum+0.073
Neg logit contributions
 especially-0.056
please-0.055
 boss-0.053
 busy-0.050
 bored-0.049
Token.
Token ID13
Feature activation+2.00e-02

Pos logit contributions
please+0.003
 especially+0.003
 boss+0.003
 busy+0.003
 bored+0.002
Neg logit contributions
eway-0.005
ently-0.005
ered-0.005
aned-0.005
heit-0.005
Token
Token ID198
Feature activation-1.65e-02

Pos logit contributions
ered+0.044
ently+0.043
eway+0.043
ides+0.042
aned+0.042
Neg logit contributions
please-0.032
 especially-0.029
 boss-0.028
 busy-0.027
 bored-0.026
Token
Token ID198
Feature activation-1.99e-02

Pos logit contributions
please+0.021
 especially+0.020
 boss+0.017
 busy+0.017
 responsible+0.016
Neg logit contributions
ered-0.040
ently-0.040
eway-0.040
anced-0.039
aned-0.039
Token boy
Token ID2933
Feature activation-2.49e-02

Pos logit contributions
please+0.079
 especially+0.071
Why+0.068
 bored+0.064
 responsible+0.063
Neg logit contributions
eros-0.094
 replied-0.090
 chest-0.089
ield-0.087
acle-0.086
Token named
Token ID3706
Feature activation-2.44e-02

Pos logit contributions
please+0.045
 especially+0.040
 bored+0.040
 ju+0.039
 busy+0.039
Neg logit contributions
ered-0.043
rum-0.042
am-0.042
eway-0.041
ently-0.041
Token Tim
Token ID5045
Feature activation+1.67e-03

Pos logit contributions
please+0.040
 especially+0.038
 boss+0.035
 busy+0.035
 bored+0.035
Neg logit contributions
ently-0.052
ered-0.052
eway-0.051
aned-0.049
ides-0.049
Token.
Token ID13
Feature activation+2.70e-02

Pos logit contributions
ered+0.003
ently+0.003
eway+0.003
rum+0.003
aned+0.003
Neg logit contributions
please-0.003
 especially-0.003
 boss-0.002
 busy-0.002
 bored-0.002
Token He
Token ID679
Feature activation+3.84e-02

Pos logit contributions
ently+0.059
ered+0.059
eway+0.057
aned+0.056
heit+0.056
Neg logit contributions
please-0.043
 especially-0.042
 boss-0.038
 busy-0.037
 bored-0.037
Token liked
Token ID8288
Feature activation-4.88e-02

Pos logit contributions
ered+0.081
eway+0.080
ently+0.078
aned+0.077
am+0.076
Neg logit contributions
please-0.057
 especially-0.051
 chefs-0.048
 responsible-0.048
 boss-0.048
Token to
Token ID284
Feature activation+2.51e-02

Pos logit contributions
please+0.072
 especially+0.067
 responsible+0.064
 boss+0.061
 bored+0.061
Neg logit contributions
eway-0.091
am-0.090
ered-0.088
ield-0.087
ently-0.087
Token play
Token ID711
Feature activation-2.24e-02

Pos logit contributions
ered+0.061
eway+0.060
ently+0.059
ides+0.057
aned+0.056
Neg logit contributions
please-0.032
 especially-0.029
 bored-0.026
 busy-0.026
 boss-0.026
Token outside
Token ID2354
Feature activation-2.21e-02

Pos logit contributions
please+0.032
 especially+0.027
 bored+0.027
 responsible+0.027
 thirsty+0.026
Neg logit contributions
eway-0.049
ered-0.049
ently-0.048
am-0.046
zed-0.045
Token and
Token ID290
Feature activation+2.14e-02

Pos logit contributions
please+0.037
 responsible+0.031
 especially+0.029
 boss+0.028
 wealthy+0.028
Neg logit contributions
eway-0.045
ered-0.045
ently-0.044
am-0.043
rum-0.042
Token look
Token ID804
Feature activation+9.62e-03

Pos logit contributions
ered+0.043
eway+0.043
ently+0.040
heit+0.040
am+0.039
Neg logit contributions
please-0.034
 especially-0.030
 responsible-0.030
 chefs-0.029
 boss-0.029
Token warm
Token ID5814
Feature activation+2.51e-02

Pos logit contributions
eway+0.032
ered+0.032
ently+0.032
aned+0.031
ides+0.031
Neg logit contributions
please-0.010
 especially-0.009
 boss-0.008
 busy-0.007
 bored-0.007
Token!"
Token ID2474
Feature activation-2.23e-02

Pos logit contributions
eway+0.044
ently+0.044
ered+0.044
aned+0.041
ides+0.041
Neg logit contributions
please-0.048
 especially-0.046
 boss-0.043
 bored-0.042
 busy-0.042
Token
Token ID198
Feature activation-1.76e-02

Pos logit contributions
please+0.036
 especially+0.032
 responsible+0.030
 boss+0.030
 chefs+0.029
Neg logit contributions
ently-0.047
ered-0.046
eway-0.046
am-0.044
aned-0.044
Token
Token ID198
Feature activation-2.03e-02

Pos logit contributions
please+0.023
 especially+0.021
 boss+0.019
 busy+0.019
 bored+0.019
Neg logit contributions
eway-0.042
ered-0.042
ently-0.042
aned-0.040
ides-0.040
Token"
Token ID1
Feature activation-1.50e-01

Pos logit contributions
please+0.039
 especially+0.037
 bored+0.035
 boss+0.034
 thirsty+0.034
Neg logit contributions
ently-0.035
ered-0.035
eway-0.035
ides-0.033
ided-0.033
TokenCan
Token ID6090
Feature activation-2.50e-02

Pos logit contributions
 bored+0.249
 sick+0.236
please+0.221
 chefs+0.218
 thirsty+0.218
Neg logit contributions
eway-0.260
atively-0.242
ided-0.235
acle-0.234
anced-0.233
Token I
Token ID314
Feature activation-1.02e-01

Pos logit contributions
please+0.044
 especially+0.041
 boss+0.038
 busy+0.037
 bored+0.037
Neg logit contributions
eway-0.049
ered-0.049
ently-0.049
aned-0.046
ides-0.046
Token try
Token ID1949
Feature activation+1.06e-02

Pos logit contributions
please+0.154
 especially+0.145
 boss+0.132
 busy+0.129
 bored+0.127
Neg logit contributions
eway-0.225
ently-0.224
ered-0.222
aned-0.215
ides-0.212
Token it
Token ID340
Feature activation+2.12e-02

Pos logit contributions
ered+0.019
ently+0.018
eway+0.018
ogo+0.017
aned+0.017
Neg logit contributions
please-0.021
 especially-0.019
 responsible-0.019
 boss-0.018
 bored-0.018
Token?"
Token ID1701
Feature activation-2.73e-02

Pos logit contributions
ered+0.041
eway+0.041
ently+0.041
aned+0.039
ides+0.039
Neg logit contributions
please-0.037
 especially-0.035
 boss-0.032
 busy-0.032
 responsible-0.032
Token Ben
Token ID3932
Feature activation-2.67e-03

Pos logit contributions
please+0.048
 especially+0.043
 responsible+0.042
 chefs+0.040
 boss+0.039
Neg logit contributions
ered-0.052
ently-0.051
eway-0.050
am-0.049
aned-0.049
Token very
Token ID845
Feature activation-4.05e-02

Pos logit contributions
ently+0.016
eway+0.016
ered+0.016
ides+0.016
aned+0.015
Neg logit contributions
please-0.004
 especially-0.004
 boss-0.003
 busy-0.003
 bored-0.003
Token happy
Token ID3772
Feature activation+5.24e-03

Pos logit contributions
please+0.028
 especially+0.024
 boss+0.020
 busy+0.019
 bored+0.019
Neg logit contributions
eway-0.122
ently-0.122
ered-0.122
aned-0.118
ides-0.118
Token.
Token ID13
Feature activation+3.56e-02

Pos logit contributions
eway+0.011
ently+0.011
ered+0.011
rum+0.010
aned+0.010
Neg logit contributions
please-0.009
 especially-0.008
 boss-0.007
 responsible-0.007
 busy-0.007
Token One
Token ID1881
Feature activation+1.94e-02

Pos logit contributions
ently+0.068
ered+0.068
eway+0.067
aned+0.065
ogo+0.064
Neg logit contributions
please-0.064
 especially-0.062
 boss-0.059
 busy-0.057
 bored-0.057
Token day
Token ID1110
Feature activation+3.87e-02

Pos logit contributions
eway+0.041
ered+0.041
ently+0.040
aned+0.038
rum+0.038
Neg logit contributions
please-0.031
 especially-0.028
 boss-0.026
 bored-0.025
 responsible-0.025
Token,
Token ID11
Feature activation-4.63e-02

Pos logit contributions
eway+0.081
ently+0.081
ered+0.080
aned+0.078
ides+0.077
Neg logit contributions
please-0.062
 especially-0.059
 boss-0.056
 busy-0.054
 bored-0.054
Token the
Token ID262
Feature activation+7.89e-03

Pos logit contributions
please+0.061
 especially+0.053
 responsible+0.049
 boss+0.047
 bored+0.046
Neg logit contributions
ered-0.111
eway-0.111
ently-0.108
anced-0.104
ided-0.103
Token red
Token ID2266
Feature activation+1.24e-02

Pos logit contributions
acle+0.015
am+0.014
eway+0.014
ield+0.014
anced+0.014
Neg logit contributions
please-0.014
 especially-0.011
 responsible-0.011
 chefs-0.010
Why-0.010
Token ball
Token ID2613
Feature activation+3.20e-03

Pos logit contributions
ently+0.033
ered+0.033
eway+0.033
ides+0.032
aned+0.032
Neg logit contributions
please-0.012
 especially-0.012
 boss-0.012
 busy-0.011
 different-0.011
Token saw
Token ID2497
Feature activation-7.06e-03

Pos logit contributions
ently+0.007
eway+0.007
ered+0.007
ides+0.007
rum+0.007
Neg logit contributions
please-0.005
 especially-0.005
 boss-0.004
 bored-0.004
 busy-0.004
Token a
Token ID257
Feature activation-2.39e-02

Pos logit contributions
please+0.008
 especially+0.007
 boss+0.006
 responsible+0.006
 busy+0.006
Neg logit contributions
ered-0.019
eway-0.019
ently-0.018
anced-0.018
aned-0.018
Token
Token ID198
Feature activation-2.29e-02

Pos logit contributions
please+0.023
 especially+0.022
 busy+0.020
 boss+0.019
 responsible+0.019
Neg logit contributions
eway-0.052
ered-0.052
ently-0.052
ides-0.051
anced-0.051
TokenBut
Token ID1537
Feature activation+4.23e-02

Pos logit contributions
please+0.035
 especially+0.032
 boss+0.029
 bored+0.029
 chefs+0.028
Neg logit contributions
ered-0.050
eway-0.049
ently-0.049
aned-0.048
rum-0.047
Token then
Token ID788
Feature activation-4.05e-02

Pos logit contributions
eway+0.088
ered+0.088
ently+0.088
aned+0.084
ides+0.083
Neg logit contributions
please-0.068
 especially-0.064
 boss-0.059
 busy-0.058
 bored-0.057
Token they
Token ID484
Feature activation-5.05e-03

Pos logit contributions
please+0.060
 especially+0.057
 boss+0.054
 bored+0.052
 busy+0.052
Neg logit contributions
ently-0.090
eway-0.090
ered-0.088
aned-0.087
ides-0.086
Token heard
Token ID2982
Feature activation+2.25e-02

Pos logit contributions
please+0.007
 especially+0.007
 responsible+0.006
 boss+0.006
 chefs+0.006
Neg logit contributions
ered-0.011
eway-0.011
ently-0.011
aned-0.011
ogo-0.011
Token a
Token ID257
Feature activation-1.61e-02

Pos logit contributions
ently+0.063
eway+0.063
ered+0.063
aned+0.061
ides+0.061
Neg logit contributions
please-0.020
 especially-0.018
 boss-0.015
 busy-0.015
 bored-0.014
Token whistle
Token ID16121
Feature activation+4.08e-02

Pos logit contributions
please+0.024
 especially+0.022
 boss+0.021
 busy+0.021
 responsible+0.020
Neg logit contributions
ently-0.036
eway-0.035
ered-0.035
anced-0.034
aned-0.034
Token.
Token ID13
Feature activation+2.42e-02

Pos logit contributions
heit+0.115
ently+0.111
anced+0.109
ides+0.108
ogo+0.108
Neg logit contributions
 boss-0.039
 especially-0.039
Why-0.038
 different-0.036
please-0.036
Token It
Token ID632
Feature activation+3.09e-02

Pos logit contributions
ently+0.043
ered+0.042
eway+0.041
ogo+0.040
aned+0.040
Neg logit contributions
please-0.047
 especially-0.045
 boss-0.045
 bored-0.044
 busy-0.044
Token was
Token ID373
Feature activation+1.48e-02

Pos logit contributions
ently+0.070
eway+0.067
ides+0.066
ered+0.064
rum+0.064
Neg logit contributions
please-0.049
 especially-0.047
 boss-0.045
 busy-0.042
 different-0.042
Token a
Token ID257
Feature activation-2.21e-02

Pos logit contributions
ered+0.035
eway+0.035
ently+0.035
aned+0.034
ogo+0.034
Neg logit contributions
please-0.019
 especially-0.017
 boss-0.015
 responsible-0.015
 bored-0.015
Token
Token ID198
Feature activation-2.57e-02

Pos logit contributions
ered+0.039
ently+0.038
ides+0.038
eway+0.038
aned+0.038
Neg logit contributions
please-0.022
 especially-0.020
 boss-0.019
 bored-0.018
 busy-0.018
Token
Token ID198
Feature activation-2.74e-02

Pos logit contributions
please+0.029
 especially+0.027
 busy+0.024
 boss+0.024
 creative+0.023
Neg logit contributions
ered-0.064
aned-0.063
anced-0.063
ides-0.063
ently-0.063
TokenShe
Token ID3347
Feature activation+1.90e-02

Pos logit contributions
please+0.047
 especially+0.044
 boss+0.041
 busy+0.040
 responsible+0.040
Neg logit contributions
ered-0.054
ently-0.054
eway-0.053
aned-0.052
ides-0.051
Token said
Token ID531
Feature activation+9.14e-02

Pos logit contributions
eway+0.046
ently+0.045
rum+0.043
ides+0.043
ogo+0.043
Neg logit contributions
please-0.027
 especially-0.027
 boss-0.024
 busy-0.023
 creative-0.023
Token,
Token ID11
Feature activation-5.30e-02

Pos logit contributions
eway+0.191
ered+0.190
ently+0.189
aned+0.183
ides+0.182
Neg logit contributions
please-0.146
 especially-0.137
 boss-0.128
 busy-0.124
 bored-0.124
Token "
Token ID366
Feature activation-1.46e-01

Pos logit contributions
please+0.075
 especially+0.073
 boss+0.069
 busy+0.067
 bored+0.064
Neg logit contributions
eway-0.121
ered-0.119
ogo-0.118
ently-0.118
ides-0.117
TokenLet
Token ID5756
Feature activation-2.21e-02

Pos logit contributions
please+0.131
 bored+0.128
 especially+0.126
 thirsty+0.123
 busy+0.119
Neg logit contributions
eway-0.393
ered-0.383
ently-0.382
anced-0.368
aned-0.366
Token's
Token ID338
Feature activation-1.96e-03

Pos logit contributions
please+0.039
 especially+0.037
 boss+0.035
 busy+0.034
 bored+0.033
Neg logit contributions
eway-0.043
ently-0.042
ered-0.042
aned-0.041
ides-0.040
Token go
Token ID467
Feature activation-5.78e-02

Pos logit contributions
please+0.003
 especially+0.003
 boss+0.002
 busy+0.002
 bored+0.002
Neg logit contributions
ered-0.004
eway-0.004
ently-0.004
aned-0.004
ides-0.004
Token on
Token ID319
Feature activation+5.39e-02

Pos logit contributions
please+0.073
 especially+0.068
 boss+0.060
 busy+0.059
 bored+0.058
Neg logit contributions
ently-0.141
eway-0.141
ered-0.140
aned-0.136
ides-0.135
Token a
Token ID257
Feature activation-3.45e-02

Pos logit contributions
ered+0.127
eway+0.126
ently+0.125
heit+0.117
am+0.117
Neg logit contributions
please-0.076
 especially-0.065
 boss-0.059
 responsible-0.057
 chefs-0.055
Tokenamp
Token ID696
Feature activation+6.30e-02

Pos logit contributions
ered+0.071
eway+0.071
ently+0.070
ides+0.066
anced+0.065
Neg logit contributions
please-0.091
 especially-0.087
 busy-0.079
 boss-0.079
 bored-0.078
Tokenoline
Token ID14453
Feature activation+5.09e-02

Pos logit contributions
ered+0.075
ently+0.072
eway+0.072
heit+0.071
ides+0.071
Neg logit contributions
please-0.151
 especially-0.146
 boss-0.138
Why-0.136
 busy-0.135
Token and
Token ID290
Feature activation+2.87e-02

Pos logit contributions
eway+0.115
ered+0.115
ently+0.114
aned+0.110
heit+0.109
Neg logit contributions
please-0.072
 especially-0.069
 boss-0.062
 busy-0.061
 bored-0.060
Token shouted
Token ID17293
Feature activation+9.58e-02

Pos logit contributions
eway+0.056
ered+0.056
ently+0.056
aned+0.053
ides+0.053
Neg logit contributions
please-0.050
 especially-0.047
 boss-0.044
 busy-0.043
 bored-0.043
Token,
Token ID11
Feature activation-3.36e-02

Pos logit contributions
eway+0.192
ently+0.191
ered+0.191
aned+0.183
ides+0.182
Neg logit contributions
please-0.161
 especially-0.153
 boss-0.144
 busy-0.139
 bored-0.139
Token "
Token ID366
Feature activation-1.37e-01

Pos logit contributions
please+0.054
 especially+0.051
 boss+0.049
 busy+0.047
 bored+0.047
Neg logit contributions
eway-0.072
ered-0.072
ently-0.071
ides-0.069
ogo-0.068
TokenW
Token ID54
Feature activation+4.22e-02

Pos logit contributions
 chefs+0.172
 bored+0.164
please+0.163
 busy+0.162
 thirsty+0.161
Neg logit contributions
eway-0.305
ered-0.298
aned-0.294
anced-0.290
ently-0.289
Tokenoo
Token ID2238
Feature activation+3.63e-02

Pos logit contributions
eway+0.051
ered+0.050
ently+0.049
aned+0.046
anced+0.045
Neg logit contributions
please-0.106
 especially-0.101
 boss-0.096
 busy-0.095
 responsible-0.095
Token-
Token ID12
Feature activation-6.20e-02

Pos logit contributions
eway+0.071
ently+0.071
ered+0.067
ides+0.067
anced+0.065
Neg logit contributions
please-0.064
 boss-0.061
 especially-0.058
 busy-0.055
 creative-0.055
Token h
Token ID289
Feature activation+5.39e-02

Pos logit contributions
please+0.111
 especially+0.105
 boss+0.097
 busy+0.096
 bored+0.095
Neg logit contributions
eway-0.118
ered-0.118
ently-0.117
aned-0.112
ides-0.111
Tokenoo
Token ID2238
Feature activation+4.29e-02

Pos logit contributions
eway+0.087
ered+0.086
ently+0.085
rum+0.080
heit+0.079
Neg logit contributions
please-0.111
 especially-0.105
 boss-0.100
 bored-0.097
 busy-0.097
Token in
Token ID287
Feature activation+5.88e-03

Pos logit contributions
please+0.068
 especially+0.063
 boss+0.058
 busy+0.056
 bored+0.055
Neg logit contributions
ently-0.107
eway-0.106
ered-0.105
aned-0.103
ides-0.101
Token,"
Token ID553
Feature activation-5.98e-02

Pos logit contributions
eway+0.011
ered+0.010
ently+0.010
rum+0.010
ides+0.010
Neg logit contributions
please-0.011
 especially-0.010
 boss-0.010
 bored-0.010
 responsible-0.009
Token says
Token ID1139
Feature activation+9.78e-02

Pos logit contributions
please+0.100
 especially+0.088
 responsible+0.082
 boss+0.080
 bored+0.078
Neg logit contributions
ered-0.120
ently-0.119
eway-0.118
heit-0.112
anced-0.112
Token Molly
Token ID30236
Feature activation-7.40e-03

Pos logit contributions
ered+0.197
ently+0.196
eway+0.191
am+0.185
anced+0.184
Neg logit contributions
please-0.167
 especially-0.153
 responsible-0.141
 boss-0.132
 busy-0.131
Token.
Token ID13
Feature activation+2.23e-02

Pos logit contributions
please+0.009
 boss+0.008
 especially+0.008
 bored+0.008
 busy+0.008
Neg logit contributions
eway-0.019
ently-0.018
heit-0.018
ered-0.018
ides-0.018
Token "
Token ID366
Feature activation-1.36e-01

Pos logit contributions
eway+0.038
ently+0.038
ered+0.036
anced+0.036
aned+0.035
Neg logit contributions
please-0.044
 especially-0.041
 responsible-0.039
 boss-0.038
 creative-0.037
TokenI
Token ID40
Feature activation-7.85e-02

Pos logit contributions
please+0.137
 especially+0.124
 bored+0.124
 chefs+0.123
 boss+0.123
Neg logit contributions
eway-0.330
ently-0.324
oor-0.316
ered-0.316
ided-0.314
Token have
Token ID423
Feature activation-4.71e-02

Pos logit contributions
please+0.116
 especially+0.103
 chefs+0.095
 bored+0.095
 boss+0.094
Neg logit contributions
ered-0.172
eway-0.169
ides-0.164
ently-0.163
ogo-0.160
Token toys
Token ID14958
Feature activation-6.61e-03

Pos logit contributions
please+0.063
 especially+0.054
Why+0.049
 boss+0.049
 responsible+0.048
Neg logit contributions
ered-0.107
am-0.103
eway-0.103
anced-0.102
ently-0.102
Token and
Token ID290
Feature activation+5.48e-03

Pos logit contributions
please+0.011
 especially+0.010
 boss+0.009
 bored+0.009
 responsible+0.009
Neg logit contributions
ered-0.013
ently-0.013
eway-0.013
ides-0.013
aned-0.013
Token books
Token ID3835
Feature activation+4.57e-02

Pos logit contributions
ered+0.012
eway+0.012
ently+0.012
anced+0.011
aned+0.011
Neg logit contributions
please-0.008
 especially-0.007
 boss-0.007
 busy-0.007
 bored-0.007
Token at
Token ID379
Feature activation+5.11e-02

Pos logit contributions
ently+0.041
eway+0.041
ered+0.041
aned+0.039
ides+0.039
Neg logit contributions
please-0.022
 especially-0.021
 boss-0.019
 busy-0.018
 bored-0.018
Token them
Token ID606
Feature activation-1.61e-02

Pos logit contributions
ered+0.112
eway+0.108
ently+0.105
anced+0.103
am+0.102
Neg logit contributions
please-0.080
 responsible-0.067
 especially-0.067
 boss-0.062
 chefs-0.062
Token.
Token ID13
Feature activation+2.74e-02

Pos logit contributions
 especially+0.019
please+0.018
 busy+0.017
 boss+0.017
 bored+0.016
Neg logit contributions
eway-0.040
heit-0.039
ently-0.039
ered-0.039
aned-0.039
Token
Token ID198
Feature activation-1.25e-02

Pos logit contributions
ered+0.054
eway+0.054
ently+0.054
aned+0.051
ides+0.051
Neg logit contributions
please-0.047
 especially-0.045
 boss-0.041
 busy-0.041
 bored-0.040
Token
Token ID198
Feature activation-1.68e-02

Pos logit contributions
please+0.016
 especially+0.015
 boss+0.014
 busy+0.014
 responsible+0.013
Neg logit contributions
ered-0.030
ently-0.030
eway-0.030
aned-0.029
ides-0.029
Token"
Token ID1
Feature activation-1.53e-01

Pos logit contributions
please+0.028
 especially+0.026
 boss+0.024
 bored+0.024
 busy+0.024
Neg logit contributions
eway-0.034
ered-0.034
ently-0.034
aned-0.032
ides-0.032
TokenGood
Token ID10248
Feature activation-5.10e-03

Pos logit contributions
please+0.216
 especially+0.213
 bored+0.207
 responsible+0.202
 thirsty+0.202
Neg logit contributions
eway-0.333
ered-0.317
anced-0.314
ield-0.305
ides-0.303
Tokenbye
Token ID16390
Feature activation+4.49e-02

Pos logit contributions
please+0.008
 especially+0.008
 boss+0.008
 bored+0.007
 busy+0.007
Neg logit contributions
ently-0.011
eway-0.011
ered-0.011
aned-0.011
rum-0.010
Token!"
Token ID2474
Feature activation-4.22e-02

Pos logit contributions
ently+0.095
ered+0.094
eway+0.094
aned+0.092
anced+0.089
Neg logit contributions
please-0.073
 especially-0.067
 boss-0.066
 busy-0.063
 bored-0.062
Token Anna
Token ID11735
Feature activation-1.43e-02

Pos logit contributions
 especially+0.049
please+0.049
 boss+0.048
 bored+0.047
 busy+0.046
Neg logit contributions
ently-0.108
eway-0.105
ides-0.104
aned-0.103
ered-0.103
Token says
Token ID1139
Feature activation+8.88e-02

Pos logit contributions
 especially+0.023
please+0.023
 bored+0.023
 boss+0.022
 busy+0.022
Neg logit contributions
eway-0.030
ently-0.029
ered-0.029
heit-0.028
rum-0.028
Token think
Token ID892
Feature activation+5.57e-02

Pos logit contributions
please+0.144
 especially+0.136
 boss+0.125
 busy+0.122
 bored+0.121
Neg logit contributions
eway-0.201
ently-0.200
ered-0.199
aned-0.191
ides-0.189
Token I
Token ID314
Feature activation-9.74e-02

Pos logit contributions
eway+0.116
ered+0.115
ently+0.114
aned+0.110
ides+0.109
Neg logit contributions
please-0.090
 especially-0.086
 boss-0.080
 busy-0.078
 bored-0.077
Token can
Token ID460
Feature activation-5.58e-02

Pos logit contributions
please+0.155
 especially+0.149
 boss+0.136
 bored+0.131
 busy+0.131
Neg logit contributions
ently-0.208
eway-0.208
ered-0.203
aned-0.197
heit-0.194
Token help
Token ID1037
Feature activation-4.67e-02

Pos logit contributions
please+0.079
 especially+0.073
 boss+0.067
 busy+0.065
 bored+0.064
Neg logit contributions
eway-0.128
ered-0.127
ently-0.127
aned-0.122
ides-0.121
Token you
Token ID345
Feature activation-1.65e-02

Pos logit contributions
please+0.075
 especially+0.069
 responsible+0.064
 bored+0.064
 busy+0.064
Neg logit contributions
ered-0.095
ently-0.095
eway-0.092
rum-0.090
aned-0.089
Token get
Token ID651
Feature activation-9.04e-02

Pos logit contributions
please+0.028
 especially+0.026
 busy+0.024
 bored+0.024
 boss+0.024
Neg logit contributions
ered-0.033
ently-0.033
eway-0.032
rum-0.031
ides-0.031
Token down
Token ID866
Feature activation+2.44e-02

Pos logit contributions
please+0.145
 especially+0.135
 responsible+0.121
 boss+0.118
 chefs+0.118
Neg logit contributions
ered-0.189
ently-0.177
am-0.175
eway-0.175
anced-0.172
Token!"
Token ID2474
Feature activation-2.32e-02

Pos logit contributions
ered+0.054
eway+0.054
ently+0.054
aned+0.052
ides+0.052
Neg logit contributions
please-0.035
 especially-0.034
 boss-0.031
 busy-0.031
 bored-0.030
Token
Token ID220
Feature activation+1.41e-02

Pos logit contributions
please+0.036
 especially+0.034
 boss+0.031
 bored+0.031
 busy+0.030
Neg logit contributions
eway-0.052
ered-0.051
ently-0.050
ides-0.048
aned-0.048
Token
Token ID198
Feature activation-2.56e-02

Pos logit contributions
eway+0.035
ered+0.035
ently+0.034
ides+0.033
aned+0.033
Neg logit contributions
please-0.017
 especially-0.015
 boss-0.014
 busy-0.014
 bored-0.014
Token
Token ID198
Feature activation-2.78e-02

Pos logit contributions
please+0.037
 especially+0.034
 boss+0.031
 bored+0.030
 busy+0.030
Neg logit contributions
eway-0.058
ently-0.058
ered-0.057
aned-0.055
ides-0.055