TOP ACTIVATIONS
MAX = 8.504

?" The rh in oc eros nodded and the
home . The puppet made sure the little girl
?" The p anda shook his head and said
. The flaming o nodded its head and
!" The rh in oc eros looked up ,
and smiled . As the puppet watched the little girl walk
." The tooth was so happy and thanked
eat . The gir affe was huge and Sally
. The rh in oc eros decided to give
present ." The tooth was so excited ! It
. The st eg osaurus came out of the
. The rh in oc eros smiled and flew
do ." The puppet and girl laughed and gigg
The rad ish thought for a moment and
The rh in oc eros was so excited
, there was a tall gir affe named Jerry . Jerry
from the tank . Hel io was thankful and thanked T
â The gir affe said that they should
said hello . The cel ery were so happy that they
." The beetle laughed and said , "

MIN ACTIVATIONS
MIN = -8.215

Look at me ! I figured out how to fold my
" What happened here ? Did you hurt yourself ?"
and asked if he had seen their orange ball , and
ally 's mom my had figured it out ! The hidden
did you do ? How could you be so careless and
, " Hey , where did the symbol go ?" It
and asked if she had seen a little boy . Lily
mom my accidentally let go of her hand . Lily lost
out a needle and carefully inserted it into the toy car
realized that Jimmy had already figured out what was about to
to learn to let go of things they do not need
He wished he could have figured out a way to help
my little mud monsters . Did you have fun in the
all lined up and waited for someone to say " go
. I don 't . Did I do something wrong ?"
't want to let go of her dream . She wanted
love her too . How would you feel if she did
y that she fell down onto the floor . But that
take a closer look . Did you manage ok ?"
shouted . Mom heard the noise and ran to

INTERVAL 4.325 - 8.504
CONTAINS 4.152%

for everyone . Mary was so happy to see
my opened the door and Lily ran inside . She saw
You have your truck ," Lily said . "
" Fish are better than ducks ." They stop
they both went flying . Tim my fell down and hurt

INTERVAL 0.145 - 4.325
CONTAINS 48.878%

basket . He looked friendly . They ran to him and
and think about what he did . Tom and
go and release the word . <|endoftext|> Once there was a
whistle and was very excited . She put the whistle to
Together , they reached the top of the tree . Tim

INTERVAL -4.035 - 0.145
CONTAINS 39.175%

the light go off . " I 'm sorry
I like this one more ," Tom says , showing a
, it â s fragile . You don â
opard . They said it was the prince of the jungle
. He always wanted to do fun things , even when

INTERVAL -8.215 - -4.035
CONTAINS 7.795%

get up and run to find mom . They show her
night after she had gone to bed , she heard a
was so good that it made her start applaud ing again
. It is a story about a princess and a dragon
for school . They both needed a uniform . Molly 's
Token?"
Token ID1701
Feature activation+1.82e+00

Pos logit contributions
anda+0.128
oc+0.126
ouse+0.124
umpy+0.123
uer+0.120
Neg logit contributions
 the-0.049
 it-0.046
 your-0.039
 this-0.039
 them-0.039
Token
Token ID198
Feature activation+4.59e-01

Pos logit contributions
anda+0.061
ouse+0.060
umpy+0.059
oc+0.057
uer+0.057
Neg logit contributions
 the-0.042
 it-0.040
 them-0.036
 her-0.035
 this-0.034
Token
Token ID198
Feature activation+3.06e-01

Pos logit contributions
ouse+0.014
anda+0.013
oc+0.013
umpy+0.013
ney+0.013
Neg logit contributions
 the-0.011
 it-0.011
 them-0.010
 your-0.010
 her-0.009
TokenThe
Token ID464
Feature activation+2.43e+00

Pos logit contributions
ouse+0.011
anda+0.011
oc+0.010
umpy+0.010
ney+0.010
Neg logit contributions
 the-0.006
 it-0.005
 them-0.005
 your-0.005
 this-0.005
Token rh
Token ID9529
Feature activation+5.97e+00

Pos logit contributions
ouse+0.066
umpy+0.061
anda+0.060
ables+0.060
jo+0.059
Neg logit contributions
 the-0.072
 it-0.066
 this-0.061
 a-0.060
 them-0.060
Tokenin
Token ID259
Feature activation+8.50e+00

Pos logit contributions
ouse+0.132
anda+0.130
ables+0.124
jo+0.122
umpy+0.120
Neg logit contributions
 the-0.183
 it-0.180
 them-0.165
 this-0.161
 your-0.160
Tokenoc
Token ID420
Feature activation+3.91e+00

Pos logit contributions
umpy+0.128
ouses+0.107
uddle+0.107
ouse+0.101
ables+0.100
Neg logit contributions
 the-0.340
 it-0.321
 this-0.303
 a-0.300
 what-0.286
Tokeneros
Token ID27498
Feature activation+6.65e+00

Pos logit contributions
ouse+0.077
umpy+0.073
anda+0.071
jo+0.066
ables+0.066
Neg logit contributions
 the-0.132
 it-0.130
 them-0.124
 your-0.121
 this-0.121
Token nodded
Token ID14464
Feature activation-7.53e-01

Pos logit contributions
umpy+0.205
ouse+0.196
ables+0.192
anda+0.188
ini+0.182
Neg logit contributions
 the-0.182
 it-0.169
 this-0.151
 a-0.149
 her-0.145
Token and
Token ID290
Feature activation+4.65e-01

Pos logit contributions
 the+0.018
 it+0.017
 them+0.017
 your+0.016
 her+0.015
Neg logit contributions
ouse-0.022
anda-0.022
oc-0.022
umpy-0.021
ney-0.021
Token the
Token ID262
Feature activation+1.96e+00

Pos logit contributions
ouse+0.012
anda+0.012
umpy+0.011
oc+0.011
ney+0.011
Neg logit contributions
 the-0.014
 it-0.013
 them-0.012
 your-0.012
 her-0.012
Token home
Token ID1363
Feature activation+8.53e-01

Pos logit contributions
 the+0.013
 it+0.013
 them+0.012
 your+0.011
 this+0.011
Neg logit contributions
ouse-0.022
anda-0.022
oc-0.021
umpy-0.021
ney-0.020
Token.
Token ID13
Feature activation+2.21e+00

Pos logit contributions
ouse+0.026
anda+0.025
oc+0.025
umpy+0.024
ney+0.024
Neg logit contributions
 the-0.019
 it-0.018
 them-0.018
 your-0.017
 anything-0.016
Token
Token ID198
Feature activation+1.02e+00

Pos logit contributions
anda+0.071
ouse+0.071
oc+0.069
umpy+0.069
uer+0.068
Neg logit contributions
 the-0.051
 it-0.049
 them-0.044
 her-0.042
 your-0.042
Token
Token ID198
Feature activation+9.20e-01

Pos logit contributions
ouse+0.036
anda+0.035
oc+0.035
umpy+0.034
ney+0.033
Neg logit contributions
 the-0.019
 it-0.018
 them-0.017
 your-0.016
 her-0.015
TokenThe
Token ID464
Feature activation+3.09e+00

Pos logit contributions
ouse+0.028
oc+0.028
umpy+0.027
anda+0.027
ney+0.026
Neg logit contributions
 the-0.021
 it-0.020
 them-0.019
 anything-0.019
 your-0.018
Token puppet
Token ID30095
Feature activation+8.48e+00

Pos logit contributions
ouse+0.097
ney+0.094
oc+0.093
uer+0.093
anda+0.092
Neg logit contributions
 the-0.076
 it-0.070
 them-0.066
 this-0.064
 a-0.062
Token made
Token ID925
Feature activation-3.97e+00

Pos logit contributions
umpy+0.249
ouse+0.241
oc+0.239
anda+0.235
ney+0.227
Neg logit contributions
 the-0.234
 it-0.219
 this-0.201
 them-0.195
 her-0.194
Token sure
Token ID1654
Feature activation-3.03e+00

Pos logit contributions
 the+0.029
 it+0.026
 them+0.021
 your+0.020
 anything+0.018
Neg logit contributions
ouse-0.173
anda-0.173
oc-0.171
umpy-0.168
ney-0.164
Token the
Token ID262
Feature activation+2.64e+00

Pos logit contributions
wered+0.037
 them+0.037
 the+0.036
 your+0.033
 it+0.033
Neg logit contributions
anda-0.107
ouse-0.107
ney-0.101
umpy-0.100
ables-0.099
Token little
Token ID1310
Feature activation+1.43e+00

Pos logit contributions
ouse+0.077
oc+0.076
ney+0.073
anda+0.073
umpy+0.073
Neg logit contributions
 the-0.070
 it-0.064
 them-0.060
 this-0.059
 a-0.058
Token girl
Token ID2576
Feature activation+4.24e+00

Pos logit contributions
ouse+0.030
oc+0.029
umpy+0.029
ney+0.029
anda+0.028
Neg logit contributions
 the-0.050
 it-0.048
 them-0.045
 her-0.045
 this-0.044
Token?"
Token ID1701
Feature activation+1.88e+00

Pos logit contributions
ouse+0.025
anda+0.024
oc+0.024
umpy+0.024
ney+0.023
Neg logit contributions
 the-0.014
 it-0.013
 them-0.012
 your-0.012
 her-0.011
Token
Token ID198
Feature activation+1.12e+00

Pos logit contributions
anda+0.061
ouse+0.061
umpy+0.059
uer+0.057
oc+0.056
Neg logit contributions
 the-0.044
 it-0.042
 them-0.038
 this-0.037
 your-0.036
Token
Token ID198
Feature activation+9.24e-01

Pos logit contributions
ouse+0.033
anda+0.032
oc+0.032
umpy+0.031
ney+0.030
Neg logit contributions
 the-0.028
 it-0.027
 them-0.025
 your-0.024
 her-0.024
TokenThe
Token ID464
Feature activation+1.34e+00

Pos logit contributions
ouse+0.034
anda+0.033
oc+0.032
umpy+0.032
ney+0.031
Neg logit contributions
 the-0.017
 it-0.016
 them-0.014
 your-0.014
 this-0.013
Token p
Token ID279
Feature activation+5.45e+00

Pos logit contributions
ables+0.044
ouse+0.044
jo+0.041
uer+0.041
oc+0.041
Neg logit contributions
 the-0.034
 it-0.031
 them-0.030
 her-0.029
 this-0.028
Tokenanda
Token ID5282
Feature activation+8.43e+00

Pos logit contributions
oc+0.064
est+0.054
ouse+0.053
affe+0.053
uer+0.051
Neg logit contributions
 it-0.193
 them-0.189
 your-0.186
 the-0.186
 what-0.182
Token shook
Token ID14682
Feature activation-2.94e+00

Pos logit contributions
umpy+0.289
ouse+0.264
ables+0.261
uer+0.257
ouses+0.253
Neg logit contributions
 the-0.204
 it-0.190
 her-0.167
 a-0.161
 this-0.160
Token his
Token ID465
Feature activation+2.22e+00

Pos logit contributions
 the+0.038
 it+0.037
 anything+0.036
 them+0.036
 your+0.035
Neg logit contributions
oc-0.103
anda-0.101
ouse-0.100
jo-0.098
umpy-0.098
Token head
Token ID1182
Feature activation+4.16e+00

Pos logit contributions
oc+0.063
ouse+0.061
umpy+0.061
anda+0.060
ari+0.059
Neg logit contributions
 the-0.059
 it-0.054
 her-0.051
 your-0.051
 a-0.050
Token and
Token ID290
Feature activation+1.05e+00

Pos logit contributions
anda+0.138
ouse+0.137
oc+0.134
umpy+0.131
ney+0.128
Neg logit contributions
 the-0.092
 it-0.089
 them-0.080
 her-0.078
 an-0.076
Token said
Token ID531
Feature activation-7.09e-01

Pos logit contributions
ouse+0.024
anda+0.023
oc+0.023
umpy+0.022
ney+0.021
Neg logit contributions
 the-0.032
 it-0.031
 them-0.030
 your-0.029
 her-0.029
Token.
Token ID13
Feature activation+3.26e-01

Pos logit contributions
 the+0.031
 it+0.030
 them+0.029
 your+0.027
 anything+0.026
Neg logit contributions
ouse-0.047
anda-0.046
oc-0.045
umpy-0.045
ney-0.043
Token
Token ID220
Feature activation-9.01e-02

Pos logit contributions
ouse+0.010
anda+0.009
oc+0.009
umpy+0.009
uer+0.009
Neg logit contributions
 the-0.008
 it-0.008
 them-0.007
 your-0.007
 this-0.007
Token
Token ID198
Feature activation+3.92e-01

Pos logit contributions
 the+0.002
 it+0.002
 them+0.002
 your+0.002
 this+0.002
Neg logit contributions
ouse-0.003
anda-0.003
oc-0.003
umpy-0.003
uer-0.003
Token
Token ID198
Feature activation+2.76e-01

Pos logit contributions
ouse+0.012
anda+0.012
oc+0.011
umpy+0.011
uer+0.011
Neg logit contributions
 the-0.009
 it-0.009
 them-0.009
 your-0.008
 this-0.008
TokenThe
Token ID464
Feature activation+2.30e+00

Pos logit contributions
ouse+0.008
anda+0.008
oc+0.008
umpy+0.008
ney+0.008
Neg logit contributions
 the-0.007
 it-0.006
 them-0.006
 your-0.006
 this-0.006
Token flaming
Token ID39518
Feature activation+8.39e+00

Pos logit contributions
ouse+0.073
ables+0.070
uer+0.067
oc+0.067
umpy+0.066
Neg logit contributions
 the-0.060
 it-0.056
 them-0.053
 this-0.052
 your-0.050
Tokeno
Token ID78
Feature activation+6.58e+00

Pos logit contributions
uer+0.178
ouse+0.161
suit+0.157
umpy+0.152
cases+0.149
Neg logit contributions
 the-0.258
 it-0.240
 them-0.235
 what-0.233
 her-0.232
Token nodded
Token ID14464
Feature activation-1.68e+00

Pos logit contributions
umpy+0.201
ouse+0.200
uer+0.185
anda+0.183
ables+0.182
Neg logit contributions
 the-0.171
 it-0.162
 them-0.145
 this-0.141
 her-0.135
Token its
Token ID663
Feature activation+1.87e+00

Pos logit contributions
 the+0.036
 it+0.035
 them+0.033
 your+0.032
 anything+0.031
Neg logit contributions
ouse-0.052
anda-0.051
oc-0.051
umpy-0.050
ney-0.048
Token head
Token ID1182
Feature activation+3.04e+00

Pos logit contributions
ouse+0.052
oc+0.051
ari+0.051
anda+0.050
umpy+0.049
Neg logit contributions
 the-0.050
 your-0.046
 it-0.046
 her-0.044
 an-0.044
Token and
Token ID290
Feature activation-5.85e-01

Pos logit contributions
anda+0.100
ouse+0.099
oc+0.098
umpy+0.096
ney+0.093
Neg logit contributions
 the-0.067
 it-0.065
 them-0.059
 her-0.057
 this-0.055
Token!"
Token ID2474
Feature activation+1.71e+00

Pos logit contributions
wered+0.063
 the+0.063
 it+0.061
 them+0.060
 her+0.059
Neg logit contributions
oc-0.091
ouse-0.083
anda-0.083
umpy-0.082
ables-0.081
Token
Token ID198
Feature activation+1.17e+00

Pos logit contributions
anda+0.056
ouse+0.056
umpy+0.055
uer+0.052
oc+0.051
Neg logit contributions
 the-0.041
 it-0.039
 them-0.035
 this-0.033
 your-0.033
Token
Token ID198
Feature activation+1.20e+00

Pos logit contributions
ouse+0.038
anda+0.038
umpy+0.036
oc+0.036
uer+0.036
Neg logit contributions
 the-0.026
 it-0.025
 them-0.024
 your-0.022
 this-0.021
TokenThe
Token ID464
Feature activation+2.25e+00

Pos logit contributions
ouse+0.042
anda+0.042
umpy+0.040
int+0.039
oc+0.039
Neg logit contributions
 the-0.023
 it-0.021
 them-0.020
 your-0.019
 this-0.019
Token rh
Token ID9529
Feature activation+6.22e+00

Pos logit contributions
ouse+0.059
ables+0.055
umpy+0.055
anda+0.055
int+0.053
Neg logit contributions
 the-0.070
 it-0.065
 this-0.060
 them-0.060
 your-0.060
Tokenin
Token ID259
Feature activation+8.26e+00

Pos logit contributions
ouse+0.140
anda+0.134
ables+0.128
jo+0.128
umpy+0.126
Neg logit contributions
 the-0.189
 it-0.184
 them-0.173
 your-0.168
 this-0.166
Tokenoc
Token ID420
Feature activation+4.05e+00

Pos logit contributions
umpy+0.145
uddle+0.143
ouses+0.130
ouse+0.129
attered+0.127
Neg logit contributions
 the-0.305
 it-0.281
 a-0.274
 this-0.269
 what-0.256
Tokeneros
Token ID27498
Feature activation+5.98e+00

Pos logit contributions
ouse+0.077
umpy+0.072
anda+0.071
oc+0.066
ney+0.065
Neg logit contributions
 the-0.141
 it-0.138
 them-0.133
 your-0.129
 this-0.128
Token looked
Token ID3114
Feature activation-2.17e+00

Pos logit contributions
umpy+0.194
ouse+0.190
anda+0.186
ables+0.178
jo+0.177
Neg logit contributions
 the-0.151
 it-0.135
 a-0.122
 this-0.120
 an-0.116
Token up
Token ID510
Feature activation-5.74e-01

Pos logit contributions
 the+0.049
 it+0.047
 them+0.044
 your+0.041
 her+0.041
Neg logit contributions
ouse-0.070
anda-0.069
oc-0.067
umpy-0.067
ney-0.066
Token,
Token ID11
Feature activation+9.66e-01

Pos logit contributions
 the+0.014
 it+0.014
 them+0.013
 your+0.013
 anything+0.013
Neg logit contributions
ouse-0.015
anda-0.015
oc-0.015
umpy-0.015
ney-0.014
Token and
Token ID290
Feature activation-4.92e-01

Pos logit contributions
anda+0.049
ouse+0.049
oc+0.048
umpy+0.048
ney+0.046
Neg logit contributions
 the-0.025
 it-0.024
 them-0.021
 her-0.020
 this-0.020
Token smiled
Token ID13541
Feature activation+7.37e-01

Pos logit contributions
 the+0.012
 it+0.012
 them+0.011
 your+0.011
 her+0.011
Neg logit contributions
ouse-0.014
anda-0.014
oc-0.014
umpy-0.014
ney-0.013
Token.
Token ID13
Feature activation+1.40e+00

Pos logit contributions
anda+0.026
ouse+0.026
oc+0.025
umpy+0.025
ney+0.024
Neg logit contributions
 the-0.015
 it-0.014
 them-0.013
 her-0.012
 your-0.012
Token As
Token ID1081
Feature activation-3.52e+00

Pos logit contributions
anda+0.040
ouse+0.040
oc+0.038
umpy+0.038
uer+0.037
Neg logit contributions
 the-0.038
 it-0.036
 them-0.033
 her-0.032
 your-0.032
Token the
Token ID262
Feature activation+3.05e+00

Pos logit contributions
 them+0.046
 the+0.043
 yourselves+0.042
 your+0.041
 it+0.041
Neg logit contributions
oc-0.124
ouse-0.122
anda-0.120
umpy-0.115
jo-0.115
Token puppet
Token ID30095
Feature activation+8.25e+00

Pos logit contributions
ouse+0.091
oc+0.091
ney+0.090
uer+0.089
ables+0.088
Neg logit contributions
 the-0.080
 it-0.074
 them-0.068
 this-0.068
 a-0.066
Token watched
Token ID7342
Feature activation-3.56e+00

Pos logit contributions
ouse+0.246
umpy+0.244
anda+0.239
oc+0.237
ney+0.231
Neg logit contributions
 the-0.224
 it-0.207
 this-0.190
 them-0.188
 her-0.183
Token the
Token ID262
Feature activation+3.32e+00

Pos logit contributions
 the+0.032
 it+0.032
 yourselves+0.031
 your+0.031
 yours+0.031
Neg logit contributions
ouse-0.132
oc-0.132
anda-0.131
umpy-0.126
jo-0.125
Token little
Token ID1310
Feature activation+1.51e+00

Pos logit contributions
ouse+0.100
ney+0.097
oc+0.097
uer+0.095
anda+0.095
Neg logit contributions
 the-0.087
 it-0.079
 them-0.075
 this-0.072
 a-0.071
Token girl
Token ID2576
Feature activation+3.99e+00

Pos logit contributions
ouse+0.031
oc+0.030
anda+0.030
ney+0.029
umpy+0.029
Neg logit contributions
 the-0.055
 it-0.052
 them-0.049
 her-0.048
 your-0.048
Token walk
Token ID2513
Feature activation-4.54e-01

Pos logit contributions
umpy+0.130
ouse+0.130
anda+0.125
oc+0.123
ney+0.122
Neg logit contributions
 the-0.097
 it-0.091
 them-0.081
 this-0.079
 her-0.078
Token."
Token ID526
Feature activation+1.93e+00

Pos logit contributions
ouse+0.057
anda+0.055
oc+0.054
umpy+0.054
ney+0.053
Neg logit contributions
 the-0.038
 it-0.037
 them-0.036
 her-0.033
 your-0.033
Token
Token ID220
Feature activation+7.74e-01

Pos logit contributions
ouse+0.056
anda+0.055
umpy+0.054
oc+0.054
uer+0.052
Neg logit contributions
 the-0.049
 it-0.047
 them-0.044
 your-0.042
 this-0.042
Token
Token ID198
Feature activation+1.11e+00

Pos logit contributions
ouse+0.023
anda+0.023
oc+0.023
umpy+0.022
ney+0.021
Neg logit contributions
 the-0.019
 it-0.018
 them-0.017
 your-0.016
 her-0.016
Token
Token ID198
Feature activation+7.97e-01

Pos logit contributions
ouse+0.034
anda+0.034
oc+0.033
umpy+0.033
uer+0.032
Neg logit contributions
 the-0.026
 it-0.025
 them-0.024
 your-0.022
 this-0.022
TokenThe
Token ID464
Feature activation+1.90e+00

Pos logit contributions
ouse+0.026
oc+0.025
anda+0.025
umpy+0.025
ney+0.024
Neg logit contributions
 the-0.017
 it-0.016
 them-0.015
 your-0.015
 anything-0.014
Token tooth
Token ID16162
Feature activation+8.24e+00

Pos logit contributions
ouse+0.052
anda+0.052
umpy+0.051
oc+0.050
ables+0.049
Neg logit contributions
 the-0.052
 it-0.049
 this-0.045
 your-0.044
 her-0.044
Token was
Token ID373
Feature activation-1.55e+00

Pos logit contributions
umpy+0.250
ables+0.229
anda+0.226
ashing+0.222
oc+0.221
Neg logit contributions
 it-0.209
 the-0.207
 this-0.183
 her-0.177
 a-0.177
Token so
Token ID523
Feature activation-1.35e+00

Pos logit contributions
 the+0.036
 it+0.036
 them+0.034
 your+0.033
 anything+0.032
Neg logit contributions
ouse-0.045
anda-0.045
oc-0.044
umpy-0.043
ney-0.042
Token happy
Token ID3772
Feature activation-2.09e-01

Pos logit contributions
 the+0.037
 it+0.035
 her+0.032
 your+0.031
 them+0.031
Neg logit contributions
oc-0.039
anda-0.039
ouse-0.039
ney-0.036
ables-0.036
Token and
Token ID290
Feature activation+8.00e-01

Pos logit contributions
 the+0.004
 them+0.004
 it+0.004
 your+0.004
 anything+0.004
Neg logit contributions
ouse-0.006
anda-0.006
oc-0.006
umpy-0.006
ney-0.006
Token thanked
Token ID26280
Feature activation-4.29e+00

Pos logit contributions
anda+0.023
ouse+0.023
oc+0.022
umpy+0.022
ney+0.021
Neg logit contributions
 the-0.021
 it-0.021
 them-0.019
 your-0.019
 this-0.018
Token eat
Token ID4483
Feature activation-2.99e+00

Pos logit contributions
 it+0.049
 the+0.049
 anything+0.046
 them+0.046
 your+0.045
Neg logit contributions
oc-0.110
ouse-0.107
anda-0.104
affe-0.100
umpy-0.098
Token.
Token ID13
Feature activation+1.55e+00

Pos logit contributions
 the+0.061
 it+0.058
 your+0.057
wered+0.057
 anything+0.056
Neg logit contributions
oc-0.085
ouse-0.082
anda-0.081
umpy-0.077
affe-0.075
Token
Token ID198
Feature activation+4.88e-01

Pos logit contributions
ouse+0.046
anda+0.045
oc+0.044
umpy+0.044
ney+0.042
Neg logit contributions
 the-0.036
 it-0.035
 them-0.034
 your-0.032
 anything-0.031
Token
Token ID198
Feature activation+3.94e-01

Pos logit contributions
ouse+0.014
anda+0.014
oc+0.014
umpy+0.013
ney+0.013
Neg logit contributions
 the-0.012
 it-0.012
 them-0.011
 your-0.011
 her-0.010
TokenThe
Token ID464
Feature activation+2.73e+00

Pos logit contributions
ouse+0.011
anda+0.010
oc+0.010
umpy+0.010
ney+0.010
Neg logit contributions
 the-0.011
 it-0.010
 them-0.010
 your-0.009
 her-0.009
Token gir
Token ID37370
Feature activation+8.23e+00

Pos logit contributions
ouse+0.088
uer+0.080
umpy+0.079
int+0.078
ables+0.078
Neg logit contributions
 the-0.074
 it-0.065
 them-0.064
 this-0.062
 a-0.059
Tokenaffe
Token ID21223
Feature activation+6.21e+00

Pos logit contributions
ouse+0.147
suit+0.117
uer+0.113
ables+0.106
umpy+0.106
Neg logit contributions
 the-0.315
 it-0.289
 this-0.285
 them-0.272
 that-0.269
Token was
Token ID373
Feature activation-1.66e+00

Pos logit contributions
umpy+0.216
ouse+0.206
uer+0.197
ini+0.192
anda+0.185
Neg logit contributions
 the-0.161
 it-0.150
 this-0.130
 her-0.125
 a-0.123
Token huge
Token ID3236
Feature activation+1.96e+00

Pos logit contributions
 the+0.037
 it+0.035
 them+0.032
 her+0.031
 your+0.031
Neg logit contributions
ouse-0.055
oc-0.055
anda-0.054
umpy-0.052
uer-0.052
Token and
Token ID290
Feature activation-6.12e-01

Pos logit contributions
ouse+0.066
umpy+0.062
anda+0.062
uer+0.061
oc+0.061
Neg logit contributions
 the-0.045
 it-0.042
 them-0.037
 this-0.036
 her-0.036
Token Sally
Token ID25737
Feature activation+5.79e+00

Pos logit contributions
 the+0.015
 it+0.015
 them+0.014
 your+0.013
 her+0.013
Neg logit contributions
ouse-0.018
anda-0.018
oc-0.017
umpy-0.017
ney-0.017
Token.
Token ID13
Feature activation+2.92e+00

Pos logit contributions
 the+0.015
 it+0.014
 them+0.014
 your+0.013
 their+0.013
Neg logit contributions
ouse-0.022
oc-0.022
anda-0.022
umpy-0.022
ney-0.021
Token
Token ID198
Feature activation+1.35e+00

Pos logit contributions
ouse+0.108
anda+0.106
uer+0.106
umpy+0.104
int+0.096
Neg logit contributions
 the-0.062
 it-0.059
 this-0.048
 them-0.046
 your-0.045
Token
Token ID198
Feature activation+1.37e+00

Pos logit contributions
ouse+0.040
anda+0.039
oc+0.038
umpy+0.038
uer+0.037
Neg logit contributions
 the-0.033
 it-0.032
 them-0.030
 your-0.029
 this-0.028
TokenThe
Token ID464
Feature activation+2.67e+00

Pos logit contributions
ouse+0.045
anda+0.045
oc+0.042
umpy+0.042
uer+0.041
Neg logit contributions
 the-0.030
 it-0.028
 them-0.026
 your-0.025
 this-0.025
Token rh
Token ID9529
Feature activation+6.24e+00

Pos logit contributions
ouse+0.076
ables+0.072
jo+0.069
anda+0.069
umpy+0.069
Neg logit contributions
 the-0.078
 it-0.072
 this-0.067
 them-0.065
 a-0.065
Tokenin
Token ID259
Feature activation+8.22e+00

Pos logit contributions
ouse+0.143
anda+0.137
ables+0.135
jo+0.135
suit+0.133
Neg logit contributions
 the-0.190
 it-0.185
 them-0.173
 your-0.166
 this-0.164
Tokenoc
Token ID420
Feature activation+4.07e+00

Pos logit contributions
ouses+0.154
ouse+0.148
suit+0.144
uddle+0.142
ables+0.136
Neg logit contributions
 the-0.300
 it-0.275
 this-0.262
 a-0.256
 what-0.250
Tokeneros
Token ID27498
Feature activation+6.11e+00

Pos logit contributions
ouse+0.080
umpy+0.074
anda+0.073
oc+0.068
jo+0.068
Neg logit contributions
 the-0.140
 it-0.137
 them-0.132
 your-0.128
 this-0.127
Token decided
Token ID3066
Feature activation-1.85e+00

Pos logit contributions
ouse+0.211
umpy+0.206
anda+0.196
ables+0.195
ouses+0.194
Neg logit contributions
 the-0.154
 it-0.133
 this-0.117
 a-0.117
 them-0.110
Token to
Token ID284
Feature activation-2.46e+00

Pos logit contributions
wered+0.030
 them+0.030
 the+0.029
 yourselves+0.027
 such+0.027
Neg logit contributions
ouse-0.058
anda-0.056
oc-0.055
uer-0.054
ables-0.054
Token give
Token ID1577
Feature activation-3.57e+00

Pos logit contributions
 the+0.040
 it+0.040
 them+0.037
 anything+0.035
 your+0.035
Neg logit contributions
ouse-0.084
oc-0.084
anda-0.082
umpy-0.079
ney-0.079
Token present
Token ID1944
Feature activation+3.20e+00

Pos logit contributions
ouse+0.070
anda+0.069
oc+0.068
umpy+0.067
ney+0.064
Neg logit contributions
 the-0.062
 it-0.060
 them-0.057
 your-0.055
 her-0.054
Token."
Token ID526
Feature activation+1.94e+00

Pos logit contributions
ouse+0.111
anda+0.111
oc+0.110
umpy+0.108
ney+0.105
Neg logit contributions
 the-0.060
 it-0.058
 them-0.053
 your-0.051
 this-0.049
Token
Token ID198
Feature activation+9.24e-01

Pos logit contributions
ouse+0.057
anda+0.056
oc+0.055
umpy+0.054
ney+0.052
Neg logit contributions
 the-0.048
 it-0.047
 them-0.044
 your-0.042
 this-0.041
Token
Token ID198
Feature activation+7.02e-01

Pos logit contributions
ouse+0.026
anda+0.026
oc+0.026
umpy+0.025
ney+0.024
Neg logit contributions
 the-0.023
 it-0.022
 them-0.021
 your-0.021
 her-0.020
TokenThe
Token ID464
Feature activation+1.82e+00

Pos logit contributions
ouse+0.023
oc+0.023
anda+0.023
umpy+0.022
ney+0.022
Neg logit contributions
 the-0.014
 it-0.014
 them-0.013
 your-0.012
 anything-0.012
Token tooth
Token ID16162
Feature activation+8.20e+00

Pos logit contributions
ouse+0.049
anda+0.048
umpy+0.047
oc+0.047
ables+0.046
Neg logit contributions
 the-0.050
 it-0.048
 this-0.044
 your-0.043
 her-0.043
Token was
Token ID373
Feature activation-1.82e+00

Pos logit contributions
umpy+0.252
ables+0.231
anda+0.228
ang+0.225
oc+0.225
Neg logit contributions
 it-0.203
 the-0.203
 a-0.176
 this-0.176
 her-0.174
Token so
Token ID523
Feature activation-1.54e+00

Pos logit contributions
 the+0.041
 it+0.040
 them+0.038
 your+0.036
 anything+0.035
Neg logit contributions
ouse-0.055
anda-0.054
oc-0.054
umpy-0.053
ney-0.051
Token excited
Token ID6568
Feature activation+5.51e-01

Pos logit contributions
 the+0.042
 it+0.039
 her+0.036
 your+0.036
 their+0.034
Neg logit contributions
oc-0.045
anda-0.044
ouse-0.044
ables-0.042
affe-0.042
Token!
Token ID0
Feature activation+1.41e+00

Pos logit contributions
ouse+0.018
anda+0.018
oc+0.017
umpy+0.017
ney+0.016
Neg logit contributions
 the-0.011
 it-0.011
 them-0.010
 your-0.010
 anything-0.009
Token It
Token ID632
Feature activation+3.38e+00

Pos logit contributions
ouse+0.041
anda+0.041
oc+0.039
umpy+0.037
uer+0.037
Neg logit contributions
 the-0.038
 it-0.036
 your-0.033
 them-0.032
 this-0.032
Token.
Token ID13
Feature activation+1.25e+00

Pos logit contributions
ouse+0.108
anda+0.107
oc+0.106
umpy+0.104
ney+0.101
Neg logit contributions
 the-0.067
 it-0.064
 them-0.061
 your-0.057
 her-0.055
Token
Token ID198
Feature activation+2.76e-01

Pos logit contributions
ouse+0.036
anda+0.036
oc+0.035
umpy+0.034
ney+0.033
Neg logit contributions
 the-0.031
 it-0.030
 them-0.029
 your-0.027
 her-0.027
Token
Token ID198
Feature activation+5.57e-02

Pos logit contributions
ouse+0.007
anda+0.007
oc+0.007
umpy+0.007
ney+0.007
Neg logit contributions
 the-0.007
 it-0.007
 them-0.007
 your-0.006
 anything-0.006
TokenThe
Token ID464
Feature activation+2.16e+00

Pos logit contributions
ouse+0.002
oc+0.001
anda+0.001
umpy+0.001
ney+0.001
Neg logit contributions
 the-0.001
 it-0.001
 them-0.001
 anything-0.001
 your-0.001
Token st
Token ID336
Feature activation+5.58e+00

Pos logit contributions
ouse+0.062
uer+0.059
anda+0.058
ables+0.057
ney+0.057
Neg logit contributions
 the-0.061
 it-0.056
 this-0.053
 them-0.053
 your-0.051
Tokeneg
Token ID1533
Feature activation+8.19e+00

Pos logit contributions
umpy+0.085
anda+0.079
ouse+0.079
int+0.077
uer+0.076
Neg logit contributions
 the-0.187
 it-0.183
 your-0.171
 them-0.170
 this-0.168
Tokenosaurus
Token ID47650
Feature activation+7.48e+00

Pos logit contributions
umpy+0.149
uer+0.148
uddle+0.136
int+0.135
ishy+0.133
Neg logit contributions
 it-0.255
 the-0.254
 him-0.247
 them-0.235
 this-0.232
Token came
Token ID1625
Feature activation-1.51e+00

Pos logit contributions
umpy+0.240
ouse+0.227
anda+0.219
ini+0.214
uer+0.212
Neg logit contributions
 the-0.191
 it-0.179
 this-0.158
 them-0.156
 a-0.153
Token out
Token ID503
Feature activation-6.20e-01

Pos logit contributions
 the+0.027
 it+0.025
 them+0.024
 your+0.022
 her+0.021
Neg logit contributions
ouse-0.056
anda-0.055
oc-0.054
umpy-0.053
ney-0.052
Token of
Token ID286
Feature activation-4.77e+00

Pos logit contributions
 the+0.015
 it+0.015
 them+0.014
 your+0.013
 her+0.013
Neg logit contributions
ouse-0.018
anda-0.018
oc-0.018
umpy-0.017
ney-0.017
Token the
Token ID262
Feature activation+2.97e+00

Pos logit contributions
 happen+0.045
 respect+0.040
 listen+0.038
 ourselves+0.037
 anything+0.036
Neg logit contributions
oc-0.168
ney-0.162
man-0.155
suit-0.151
pit-0.150
Token.
Token ID13
Feature activation+1.74e+00

Pos logit contributions
ouse+0.118
anda+0.116
oc+0.115
umpy+0.113
ney+0.110
Neg logit contributions
 the-0.086
 it-0.083
 them-0.079
 your-0.077
 her-0.073
Token
Token ID198
Feature activation+3.17e-01

Pos logit contributions
ouse+0.055
anda+0.054
umpy+0.052
uer+0.052
oc+0.051
Neg logit contributions
 the-0.043
 it-0.041
 them-0.038
 this-0.035
 your-0.035
Token
Token ID198
Feature activation+4.27e-01

Pos logit contributions
ouse+0.009
anda+0.009
oc+0.009
umpy+0.009
ney+0.009
Neg logit contributions
 the-0.008
 it-0.007
 them-0.007
 your-0.007
 her-0.006
TokenThe
Token ID464
Feature activation+2.16e+00

Pos logit contributions
ouse+0.013
oc+0.013
anda+0.013
umpy+0.013
ney+0.013
Neg logit contributions
 the-0.009
 it-0.009
 them-0.009
 your-0.008
 her-0.008
Token rh
Token ID9529
Feature activation+5.98e+00

Pos logit contributions
ouse+0.059
anda+0.054
umpy+0.053
ables+0.052
jo+0.052
Neg logit contributions
 the-0.066
 it-0.061
 this-0.056
 them-0.056
 a-0.055
Tokenin
Token ID259
Feature activation+8.18e+00

Pos logit contributions
ouse+0.136
anda+0.126
jo+0.125
ables+0.124
suit+0.123
Neg logit contributions
 the-0.185
 it-0.180
 them-0.169
 this-0.160
 your-0.159
Tokenoc
Token ID420
Feature activation+3.93e+00

Pos logit contributions
ouse+0.131
umpy+0.127
ouses+0.126
uddle+0.120
suit+0.118
Neg logit contributions
 the-0.313
 it-0.291
 this-0.278
 a-0.268
 what-0.265
Tokeneros
Token ID27498
Feature activation+6.14e+00

Pos logit contributions
ouse+0.084
umpy+0.078
anda+0.074
jo+0.071
ables+0.071
Neg logit contributions
 the-0.129
 it-0.126
 them-0.121
 your-0.118
 this-0.117
Token smiled
Token ID13541
Feature activation+1.08e+00

Pos logit contributions
ouse+0.192
umpy+0.189
anda+0.183
uer+0.177
ables+0.174
Neg logit contributions
 the-0.161
 it-0.147
 this-0.131
 them-0.124
 her-0.123
Token and
Token ID290
Feature activation+7.65e-01

Pos logit contributions
anda+0.037
ouse+0.036
oc+0.036
umpy+0.035
ney+0.035
Neg logit contributions
 the-0.024
 it-0.023
 them-0.020
 her-0.020
 an-0.020
Token flew
Token ID13112
Feature activation-1.81e+00

Pos logit contributions
anda+0.021
ouse+0.020
umpy+0.020
oc+0.020
ney+0.019
Neg logit contributions
 the-0.022
 it-0.021
 them-0.019
 her-0.019
 this-0.019
Token do
Token ID466
Feature activation-1.32e+00

Pos logit contributions
ouse+0.005
anda+0.004
oc+0.004
umpy+0.004
ney+0.004
Neg logit contributions
 the-0.003
 it-0.003
 them-0.003
 your-0.003
 this-0.003
Token."
Token ID526
Feature activation+2.00e+00

Pos logit contributions
 the+0.029
 them+0.028
wered+0.028
 it+0.027
 her+0.027
Neg logit contributions
ouse-0.037
oc-0.036
anda-0.036
uer-0.034
umpy-0.034
Token
Token ID198
Feature activation+1.18e+00

Pos logit contributions
anda+0.069
ouse+0.068
umpy+0.067
oc+0.066
uer+0.064
Neg logit contributions
 the-0.046
 it-0.043
 them-0.039
 her-0.038
 their-0.036
Token
Token ID198
Feature activation+8.35e-01

Pos logit contributions
ouse+0.037
anda+0.037
oc+0.036
umpy+0.036
uer+0.035
Neg logit contributions
 the-0.027
 it-0.026
 them-0.024
 your-0.023
 her-0.022
TokenThe
Token ID464
Feature activation+2.66e+00

Pos logit contributions
ouse+0.029
anda+0.028
oc+0.028
umpy+0.027
ney+0.027
Neg logit contributions
 the-0.017
 it-0.016
 them-0.015
 your-0.014
 this-0.014
Token puppet
Token ID30095
Feature activation+8.17e+00

Pos logit contributions
ouse+0.064
oc+0.062
ney+0.061
anda+0.061
uer+0.060
Neg logit contributions
 the-0.084
 it-0.080
 them-0.076
 this-0.074
 your-0.072
Token and
Token ID290
Feature activation-1.82e+00

Pos logit contributions
umpy+0.258
oc+0.243
ouse+0.243
anda+0.236
ney+0.231
Neg logit contributions
 the-0.219
 it-0.202
 this-0.184
 them-0.184
 her-0.178
Token girl
Token ID2576
Feature activation+3.63e+00

Pos logit contributions
 the+0.004
 it+0.003
 them+0.002
 your+0.001
 anything+0.000
Neg logit contributions
ouse-0.089
anda-0.089
oc-0.089
umpy-0.086
ney-0.085
Token laughed
Token ID13818
Feature activation+1.15e-01

Pos logit contributions
ouse+0.114
umpy+0.114
oc+0.112
anda+0.110
uer+0.107
Neg logit contributions
 the-0.091
 it-0.083
 them-0.079
 their-0.075
 her-0.075
Token and
Token ID290
Feature activation+2.18e-01

Pos logit contributions
ouse+0.004
anda+0.004
oc+0.004
umpy+0.004
ney+0.004
Neg logit contributions
 the-0.003
 it-0.003
 them-0.002
 her-0.002
 their-0.002
Token gigg
Token ID30442
Feature activation+1.32e+00

Pos logit contributions
ouse+0.006
anda+0.006
umpy+0.006
oc+0.006
ney+0.006
Neg logit contributions
 the-0.006
 it-0.006
 them-0.005
 her-0.005
 your-0.005
Token
Token ID220
Feature activation+1.10e+00

Pos logit contributions
ouse+0.056
anda+0.056
umpy+0.054
oc+0.053
ney+0.052
Neg logit contributions
 the-0.044
 it-0.042
 them-0.039
 your-0.037
 this-0.037
Token
Token ID198
Feature activation+1.15e+00

Pos logit contributions
ouse+0.034
anda+0.033
oc+0.033
umpy+0.032
uer+0.031
Neg logit contributions
 the-0.026
 it-0.025
 them-0.024
 your-0.022
 her-0.022
Token
Token ID198
Feature activation+9.20e-01

Pos logit contributions
ouse+0.036
anda+0.035
oc+0.034
umpy+0.034
uer+0.033
Neg logit contributions
 the-0.027
 it-0.026
 them-0.025
 your-0.023
 this-0.023
TokenThe
Token ID464
Feature activation+9.44e-01

Pos logit contributions
ouse+0.033
anda+0.032
oc+0.031
umpy+0.031
ney+0.030
Neg logit contributions
 the-0.017
 it-0.016
 them-0.015
 your-0.014
 this-0.014
Token rad
Token ID2511
Feature activation+6.04e+00

Pos logit contributions
ouse+0.028
anda+0.027
oc+0.027
umpy+0.026
ney+0.026
Neg logit contributions
 the-0.024
 it-0.023
 them-0.021
 your-0.020
 her-0.020
Tokenish
Token ID680
Feature activation+8.13e+00

Pos logit contributions
umpy+0.096
ouse+0.095
jo+0.092
ari+0.089
uer+0.082
Neg logit contributions
 it-0.227
 the-0.226
 them-0.211
 your-0.209
 what-0.206
Token thought
Token ID1807
Feature activation-2.28e+00

Pos logit contributions
umpy+0.242
ouse+0.231
ari+0.228
ables+0.225
anda+0.225
Neg logit contributions
 the-0.218
 it-0.213
 her-0.186
 this-0.185
 them-0.176
Token for
Token ID329
Feature activation-3.04e+00

Pos logit contributions
 the+0.038
 yourselves+0.038
 them+0.037
 it+0.036
 your+0.035
Neg logit contributions
anda-0.073
oc-0.071
ouse-0.071
affe-0.070
umpy-0.069
Token a
Token ID257
Feature activation+1.26e+00

Pos logit contributions
 your+0.039
 yourselves+0.039
 yours+0.038
 them+0.037
 these+0.033
Neg logit contributions
oc-0.095
ney-0.088
jo-0.085
affe-0.083
est-0.083
Token moment
Token ID2589
Feature activation+1.81e+00

Pos logit contributions
ouse+0.034
anda+0.034
umpy+0.032
oc+0.032
ney+0.031
Neg logit contributions
 the-0.035
 it-0.034
 them-0.031
 your-0.030
 this-0.030
Token and
Token ID290
Feature activation+1.10e-01

Pos logit contributions
ouse+0.048
oc+0.047
anda+0.046
umpy+0.045
ney+0.044
Neg logit contributions
 the-0.047
 it-0.045
 them-0.044
 your-0.043
 anything-0.042
Token
Token ID220
Feature activation+4.11e-01

Pos logit contributions
ouse+0.053
anda+0.052
umpy+0.050
uer+0.049
ney+0.047
Neg logit contributions
 the-0.038
 it-0.036
 them-0.032
 this-0.031
 their-0.030
Token
Token ID198
Feature activation+5.92e-01

Pos logit contributions
ouse+0.012
anda+0.012
oc+0.012
umpy+0.012
uer+0.011
Neg logit contributions
 the-0.010
 it-0.009
 them-0.009
 your-0.008
 this-0.008
Token
Token ID198
Feature activation+7.07e-01

Pos logit contributions
anda+0.019
ouse+0.018
oc+0.018
umpy+0.018
uer+0.017
Neg logit contributions
 the-0.014
 it-0.013
 them-0.012
 your-0.012
 this-0.011
TokenThe
Token ID464
Feature activation+2.35e+00

Pos logit contributions
ouse+0.024
anda+0.023
oc+0.023
umpy+0.023
ney+0.022
Neg logit contributions
 the-0.014
 it-0.014
 them-0.013
 your-0.012
 this-0.012
Token rh
Token ID9529
Feature activation+5.95e+00

Pos logit contributions
ouse+0.069
jo+0.063
anda+0.063
umpy+0.063
ables+0.062
Neg logit contributions
 the-0.067
 it-0.061
 this-0.056
 a-0.056
 them-0.055
Tokenin
Token ID259
Feature activation+8.12e+00

Pos logit contributions
ouse+0.135
anda+0.130
jo+0.128
ables+0.122
suit+0.120
Neg logit contributions
 the-0.183
 it-0.179
 them-0.167
 your-0.160
 this-0.158
Tokenoc
Token ID420
Feature activation+4.00e+00

Pos logit contributions
ouses+0.140
ouse+0.138
suit+0.135
cases+0.133
umpy+0.130
Neg logit contributions
 the-0.295
 it-0.271
 this-0.261
 a-0.250
 help-0.240
Tokeneros
Token ID27498
Feature activation+6.13e+00

Pos logit contributions
ouse+0.084
umpy+0.078
anda+0.075
jo+0.072
ables+0.071
Neg logit contributions
 the-0.133
 it-0.130
 them-0.124
 your-0.121
 this-0.120
Token was
Token ID373
Feature activation-1.26e+00

Pos logit contributions
ouse+0.197
umpy+0.194
anda+0.188
uer+0.186
ables+0.179
Neg logit contributions
 the-0.158
 it-0.140
 this-0.127
 a-0.119
 her-0.114
Token so
Token ID523
Feature activation-1.07e+00

Pos logit contributions
 the+0.028
 it+0.027
 them+0.025
 your+0.024
 her+0.024
Neg logit contributions
ouse-0.040
anda-0.039
oc-0.039
umpy-0.038
ney-0.037
Token excited
Token ID6568
Feature activation+3.85e-02

Pos logit contributions
 the+0.028
 it+0.026
 your+0.023
 their+0.023
 her+0.023
Neg logit contributions
ouse-0.033
anda-0.033
oc-0.033
affe-0.031
ari-0.031
Token,
Token ID11
Feature activation-9.02e-01

Pos logit contributions
anda+0.031
ouse+0.031
oc+0.030
umpy+0.029
ney+0.028
Neg logit contributions
 the-0.025
 it-0.025
 them-0.024
 your-0.023
 anything-0.023
Token there
Token ID612
Feature activation+2.15e+00

Pos logit contributions
 the+0.013
 it+0.013
 them+0.012
 anything+0.011
 your+0.011
Neg logit contributions
ouse-0.033
anda-0.033
oc-0.032
umpy-0.032
ney-0.031
Token was
Token ID373
Feature activation-4.06e+00

Pos logit contributions
ouse+0.048
oc+0.048
anda+0.047
umpy+0.045
ney+0.044
Neg logit contributions
 the-0.063
 it-0.061
 them-0.060
 your-0.059
 anything-0.058
Token a
Token ID257
Feature activation+2.34e+00

Pos logit contributions
 it+0.036
 them+0.036
 anything+0.034
 permission+0.033
 the+0.033
Neg logit contributions
oc-0.154
ouse-0.151
umpy-0.151
anda-0.146
ney-0.146
Token tall
Token ID7331
Feature activation+4.71e+00

Pos logit contributions
ouse+0.071
anda+0.071
ables+0.070
jo+0.070
umpy+0.069
Neg logit contributions
 the-0.058
 it-0.054
 your-0.048
 a-0.046
 their-0.046
Token gir
Token ID37370
Feature activation+8.12e+00

Pos logit contributions
ouse+0.121
anda+0.116
umpy+0.113
uer+0.111
ables+0.109
Neg logit contributions
 the-0.145
 it-0.136
 this-0.124
 your-0.122
 them-0.118
Tokenaffe
Token ID21223
Feature activation+5.07e+00

Pos logit contributions
ouse+0.114
suit+0.087
ouses+0.086
uer+0.085
ney+0.083
Neg logit contributions
 the-0.322
 it-0.308
 this-0.289
 your-0.281
 them-0.280
Token named
Token ID3706
Feature activation-2.84e+00

Pos logit contributions
ouse+0.170
anda+0.166
uer+0.165
umpy+0.163
uddle+0.156
Neg logit contributions
 the-0.113
 it-0.106
 this-0.092
 your-0.091
 her-0.090
Token Jerry
Token ID13075
Feature activation+4.60e+00

Pos logit contributions
 the+0.049
 it+0.048
 them+0.046
 your+0.043
 anything+0.042
Neg logit contributions
anda-0.097
ouse-0.097
oc-0.096
umpy-0.093
ney-0.091
Token.
Token ID13
Feature activation+3.67e-01

Pos logit contributions
ouse+0.151
anda+0.149
oc+0.147
umpy+0.145
ney+0.141
Neg logit contributions
 the-0.095
 it-0.091
 them-0.086
 your-0.081
 her-0.079
Token Jerry
Token ID13075
Feature activation+4.99e+00

Pos logit contributions
ouse+0.013
anda+0.013
oc+0.012
umpy+0.012
uer+0.012
Neg logit contributions
 the-0.007
 it-0.007
 them-0.006
 your-0.006
 her-0.006
Token from
Token ID422
Feature activation-4.08e+00

Pos logit contributions
ouse+0.115
anda+0.114
oc+0.113
umpy+0.112
ney+0.107
Neg logit contributions
 the-0.074
 it-0.071
 them-0.066
 your-0.063
 this-0.061
Token the
Token ID262
Feature activation+1.72e+00

Pos logit contributions
 it+0.022
 the+0.019
 your+0.017
 anything+0.017
 them+0.016
Neg logit contributions
oc-0.167
anda-0.162
jo-0.160
umpy-0.156
ouse-0.155
Token tank
Token ID6873
Feature activation+4.32e+00

Pos logit contributions
oc+0.045
ouse+0.044
anda+0.044
umpy+0.043
ney+0.043
Neg logit contributions
 the-0.042
 it-0.042
 them-0.041
 her-0.039
 your-0.039
Token.
Token ID13
Feature activation+1.72e+00

Pos logit contributions
ouse+0.147
anda+0.146
oc+0.142
umpy+0.141
uer+0.136
Neg logit contributions
 the-0.088
 it-0.083
 them-0.076
 this-0.072
 your-0.072
Token Hel
Token ID5053
Feature activation+2.15e+00

Pos logit contributions
anda+0.052
ouse+0.051
oc+0.049
umpy+0.049
ney+0.048
Neg logit contributions
 the-0.044
 it-0.042
 them-0.039
 her-0.038
 your-0.037
Tokenio
Token ID952
Feature activation+8.12e+00

Pos logit contributions
ouse+0.044
oc+0.041
umpy+0.041
anda+0.040
ney+0.039
Neg logit contributions
 the-0.072
 it-0.070
 them-0.068
 your-0.065
 this-0.064
Token was
Token ID373
Feature activation-1.29e+00

Pos logit contributions
umpy+0.248
ouse+0.230
uddle+0.224
uer+0.223
anda+0.219
Neg logit contributions
 the-0.208
 it-0.192
 a-0.171
 them-0.170
 this-0.169
Token thankful
Token ID25535
Feature activation-7.10e-01

Pos logit contributions
 the+0.032
 it+0.031
 them+0.029
 your+0.028
 her+0.027
Neg logit contributions
ouse-0.037
anda-0.037
oc-0.036
umpy-0.035
ney-0.034
Token and
Token ID290
Feature activation+1.85e-01

Pos logit contributions
 the+0.013
 it+0.013
 them+0.012
 your+0.012
 anything+0.011
Neg logit contributions
ouse-0.024
anda-0.023
oc-0.023
umpy-0.022
ney-0.022
Token thanked
Token ID26280
Feature activation-4.22e+00

Pos logit contributions
ouse+0.005
anda+0.005
oc+0.005
umpy+0.005
ney+0.004
Neg logit contributions
 the-0.005
 it-0.005
 them-0.005
 your-0.005
 her-0.005
Token T
Token ID309
Feature activation+3.95e+00

Pos logit contributions
 the+0.059
 it+0.058
 your+0.055
 them+0.052
 anything+0.052
Neg logit contributions
oc-0.147
ouse-0.143
anda-0.143
jo-0.139
ari-0.136
Tokenâ
Token ID22940
Feature activation+3.00e+00

Pos logit contributions
anda+0.019
ouse+0.019
uer+0.018
umpy+0.018
ney+0.018
Neg logit contributions
 the-0.016
 it-0.015
 your-0.013
 them-0.013
 this-0.013
Token
Token ID26391
Feature activation+3.79e+00

Pos logit contributions
ouse+0.078
anda+0.078
oc+0.074
umpy+0.073
uer+0.071
Neg logit contributions
 the-0.082
 it-0.079
 them-0.075
 your-0.074
 this-0.073
Token
Token ID198
Feature activation+1.26e+00

Pos logit contributions
anda+0.106
ouse+0.103
umpy+0.101
uer+0.097
int+0.096
Neg logit contributions
 the-0.100
 it-0.098
 this-0.086
 your-0.085
 them-0.083
Token
Token ID198
Feature activation+1.05e+00

Pos logit contributions
ouse+0.035
anda+0.035
oc+0.034
umpy+0.034
ney+0.033
Neg logit contributions
 the-0.032
 it-0.031
 them-0.029
 your-0.028
 her-0.027
TokenThe
Token ID464
Feature activation+2.59e+00

Pos logit contributions
ouse+0.035
anda+0.034
oc+0.033
umpy+0.033
ney+0.032
Neg logit contributions
 the-0.022
 it-0.021
 them-0.020
 your-0.019
 this-0.018
Token gir
Token ID37370
Feature activation+8.10e+00

Pos logit contributions
ouse+0.064
jo+0.058
ables+0.056
ushed+0.055
int+0.054
Neg logit contributions
 the-0.089
 it-0.079
 a-0.078
 your-0.077
 this-0.077
Tokenaffe
Token ID21223
Feature activation+6.71e+00

Pos logit contributions
ouse+0.148
suit+0.136
ouses+0.119
aj+0.116
uer+0.116
Neg logit contributions
 the-0.304
 it-0.280
 this-0.270
 her-0.254
 them-0.253
Token said
Token ID531
Feature activation-1.54e+00

Pos logit contributions
umpy+0.219
ouse+0.210
uer+0.206
ini+0.203
anda+0.202
Neg logit contributions
 the-0.170
 it-0.163
 this-0.142
 a-0.134
 her-0.127
Token that
Token ID326
Feature activation-2.02e+00

Pos logit contributions
 the+0.035
 it+0.034
 them+0.033
 your+0.031
 her+0.030
Neg logit contributions
ouse-0.046
anda-0.045
oc-0.044
umpy-0.043
uer-0.042
Token they
Token ID484
Feature activation-4.54e-01

Pos logit contributions
 the+0.026
 them+0.026
 it+0.025
 your+0.022
 such+0.021
Neg logit contributions
oc-0.075
ouse-0.074
anda-0.074
umpy-0.072
ney-0.071
Token should
Token ID815
Feature activation-3.53e+00

Pos logit contributions
 the+0.013
 it+0.012
 them+0.012
 your+0.011
 her+0.011
Neg logit contributions
ouse-0.012
anda-0.012
oc-0.011
umpy-0.011
ney-0.011
Token said
Token ID531
Feature activation-1.39e+00

Pos logit contributions
ouse+0.018
anda+0.018
oc+0.018
umpy+0.018
ney+0.017
Neg logit contributions
 the-0.020
 it-0.019
 them-0.018
 your-0.017
 her-0.017
Token hello
Token ID23748
Feature activation+8.62e-01

Pos logit contributions
 the+0.032
 it+0.031
 them+0.031
 your+0.029
 such+0.029
Neg logit contributions
anda-0.038
ouse-0.038
oc-0.037
umpy-0.035
uer-0.035
Token.
Token ID13
Feature activation+2.09e+00

Pos logit contributions
ouse+0.032
umpy+0.032
oc+0.032
anda+0.032
ney+0.030
Neg logit contributions
 the-0.015
 it-0.014
 them-0.013
 her-0.012
 their-0.012
Token The
Token ID383
Feature activation+2.09e+00

Pos logit contributions
ouse+0.068
anda+0.067
oc+0.065
umpy+0.065
uer+0.063
Neg logit contributions
 the-0.049
 it-0.046
 them-0.043
 your-0.040
 her-0.040
Token cel
Token ID18725
Feature activation+6.68e+00

Pos logit contributions
ouse+0.055
anda+0.055
umpy+0.053
jo+0.053
uer+0.052
Neg logit contributions
 the-0.061
 it-0.056
 this-0.052
 a-0.051
 help-0.050
Tokenery
Token ID1924
Feature activation+8.10e+00

Pos logit contributions
anda+0.145
ouse+0.142
umpy+0.142
oc+0.133
uer+0.133
Neg logit contributions
 it-0.206
 the-0.205
 them-0.197
 this-0.182
 your-0.182
Token were
Token ID547
Feature activation-1.52e+00

Pos logit contributions
umpy+0.266
anda+0.247
ouse+0.247
ables+0.240
uer+0.237
Neg logit contributions
 the-0.200
 it-0.194
 this-0.167
 her-0.163
 them-0.160
Token so
Token ID523
Feature activation-1.32e+00

Pos logit contributions
 the+0.036
 it+0.034
 them+0.032
 her+0.031
 your+0.030
Neg logit contributions
ouse-0.049
oc-0.049
anda-0.048
umpy-0.046
ney-0.046
Token happy
Token ID3772
Feature activation+1.73e-01

Pos logit contributions
 the+0.030
 it+0.027
 their+0.025
 her+0.025
 them+0.024
Neg logit contributions
oc-0.047
ouse-0.046
int-0.046
ables-0.045
anda-0.045
Token that
Token ID326
Feature activation-2.26e+00

Pos logit contributions
ouse+0.005
anda+0.005
oc+0.005
umpy+0.005
ney+0.005
Neg logit contributions
 the-0.003
 them-0.003
 it-0.003
 your-0.003
 anything-0.003
Token they
Token ID484
Feature activation+2.00e+00

Pos logit contributions
 yourselves+0.030
 them+0.029
 permission+0.029
wered+0.029
 your+0.029
Neg logit contributions
oc-0.075
ouse-0.072
anda-0.071
ney-0.068
umpy-0.068
Token."
Token ID526
Feature activation+2.46e+00

Pos logit contributions
 the+0.044
 it+0.042
 them+0.041
wered+0.039
 help+0.038
Neg logit contributions
anda-0.074
ouse-0.073
oc-0.073
umpy-0.073
ney-0.069
Token
Token ID220
Feature activation+1.24e+00

Pos logit contributions
anda+0.074
umpy+0.073
ouse+0.073
oc+0.071
uer+0.070
Neg logit contributions
 the-0.063
 it-0.060
 them-0.056
 her-0.054
 your-0.052
Token
Token ID198
Feature activation+1.48e+00

Pos logit contributions
ouse+0.039
anda+0.038
oc+0.038
umpy+0.037
uer+0.036
Neg logit contributions
 the-0.029
 it-0.027
 them-0.026
 your-0.024
 her-0.024
Token
Token ID198
Feature activation+1.35e+00

Pos logit contributions
ouse+0.047
anda+0.047
oc+0.046
umpy+0.045
uer+0.044
Neg logit contributions
 the-0.034
 it-0.033
 them-0.031
 your-0.029
 her-0.029
TokenThe
Token ID464
Feature activation+2.03e+00

Pos logit contributions
ouse+0.042
anda+0.042
oc+0.041
umpy+0.040
ney+0.039
Neg logit contributions
 the-0.031
 it-0.030
 them-0.028
 your-0.027
 her-0.026
Token beetle
Token ID45489
Feature activation+8.08e+00

Pos logit contributions
ouse+0.050
anda+0.050
oc+0.050
umpy+0.048
ney+0.047
Neg logit contributions
 the-0.060
 it-0.057
 them-0.054
 your-0.053
 her-0.052
Token laughed
Token ID13818
Feature activation+6.26e-01

Pos logit contributions
umpy+0.275
ouse+0.258
ables+0.252
ari+0.246
anda+0.246
Neg logit contributions
 the-0.198
 it-0.190
 her-0.176
 this-0.168
 a-0.159
Token and
Token ID290
Feature activation+1.26e+00

Pos logit contributions
anda+0.020
ouse+0.020
oc+0.019
ney+0.019
umpy+0.019
Neg logit contributions
 the-0.015
 it-0.015
 them-0.014
 her-0.013
 your-0.013
Token said
Token ID531
Feature activation-1.37e+00

Pos logit contributions
ouse+0.032
anda+0.032
umpy+0.032
oc+0.032
ney+0.031
Neg logit contributions
 the-0.037
 it-0.037
 them-0.033
 her-0.033
 this-0.032
Token,
Token ID11
Feature activation+7.93e-01

Pos logit contributions
 the+0.031
 it+0.030
 them+0.029
 your+0.027
 this+0.026
Neg logit contributions
ouse-0.041
anda-0.041
oc-0.040
umpy-0.039
uer-0.038
Token "
Token ID366
Feature activation+3.80e-01

Pos logit contributions
ouse+0.026
anda+0.025
oc+0.025
umpy+0.025
ney+0.024
Neg logit contributions
 the-0.018
 it-0.017
 them-0.016
 your-0.015
 her-0.015
Token Look
Token ID6803
Feature activation-4.40e+00

Pos logit contributions
 the+0.049
 it+0.047
 them+0.043
 your+0.042
 this+0.041
Neg logit contributions
ouse-0.071
anda-0.070
oc-0.069
umpy-0.068
ney-0.067
Token at
Token ID379
Feature activation-5.19e+00

Pos logit contributions
 the+0.097
 it+0.092
 them+0.092
 their+0.085
 your+0.084
Neg logit contributions
anda-0.131
ouse-0.131
oc-0.126
umpy-0.120
uer-0.118
Token me
Token ID502
Feature activation-3.98e-01

Pos logit contributions
 them+0.046
 their+0.045
 it+0.044
 the+0.039
 her+0.039
Neg logit contributions
anda-0.196
oc-0.195
ouse-0.189
umpy-0.185
affe-0.184
Token!
Token ID0
Feature activation-2.05e+00

Pos logit contributions
 the+0.008
 it+0.008
 them+0.008
 your+0.007
 their+0.007
Neg logit contributions
ouse-0.013
anda-0.012
oc-0.012
umpy-0.012
ney-0.012
Token I
Token ID314
Feature activation+1.82e-02

Pos logit contributions
 the+0.040
 it+0.038
 them+0.034
 your+0.034
 this+0.033
Neg logit contributions
ouse-0.070
anda-0.070
oc-0.068
umpy-0.067
uer-0.066
Token figured
Token ID11638
Feature activation-8.21e+00

Pos logit contributions
ouse+0.001
anda+0.001
oc+0.001
umpy+0.001
ney+0.001
Neg logit contributions
 the-0.000
 it-0.000
 them-0.000
 your-0.000
 this-0.000
Token out
Token ID503
Feature activation-3.73e+00

Pos logit contributions
wered+0.087
 yourselves+0.087
 their+0.086
 the+0.083
 them+0.077
Neg logit contributions
anda-0.287
ouse-0.282
affe-0.277
oc-0.263
int-0.260
Token how
Token ID703
Feature activation-2.85e+00

Pos logit contributions
 them+0.029
 the+0.026
 it+0.026
 their+0.023
wered+0.021
Neg logit contributions
anda-0.153
oc-0.152
ouse-0.150
umpy-0.144
affe-0.143
Token to
Token ID284
Feature activation-4.09e+00

Pos logit contributions
 them+0.046
 the+0.043
 it+0.040
 their+0.038
wered+0.038
Neg logit contributions
ouse-0.100
umpy-0.099
anda-0.097
ables-0.094
ney-0.094
Token fold
Token ID5591
Feature activation-3.69e+00

Pos logit contributions
 them+0.078
 their+0.074
 anything+0.074
 the+0.074
 it+0.072
Neg logit contributions
ouse-0.120
uer-0.113
oc-0.113
zag-0.107
anda-0.106
Token my
Token ID616
Feature activation+2.26e+00

Pos logit contributions
 the+0.029
wered+0.027
 them+0.027
 it+0.027
 her+0.023
Neg logit contributions
oc-0.149
ouse-0.147
anda-0.143
ables-0.141
umpy-0.139
Token "
Token ID366
Feature activation-7.82e-02

Pos logit contributions
ouse+0.015
anda+0.015
oc+0.015
umpy+0.014
ney+0.014
Neg logit contributions
 the-0.012
 it-0.011
 them-0.011
 your-0.010
 her-0.010
TokenWhat
Token ID2061
Feature activation-1.68e+00

Pos logit contributions
 the+0.002
 it+0.002
 them+0.002
 your+0.002
 her+0.002
Neg logit contributions
ouse-0.002
anda-0.002
oc-0.002
umpy-0.002
ney-0.002
Token happened
Token ID3022
Feature activation-2.06e+00

Pos logit contributions
 them+0.042
 the+0.039
 it+0.038
wered+0.037
 this+0.036
Neg logit contributions
ouse-0.045
umpy-0.042
oc-0.041
anda-0.040
ney-0.040
Token here
Token ID994
Feature activation-9.08e-01

Pos logit contributions
wered+0.044
 them+0.043
 themselves+0.042
 the+0.041
 their+0.040
Neg logit contributions
ouse-0.057
umpy-0.053
anda-0.053
oc-0.052
uer-0.049
Token?
Token ID30
Feature activation-7.64e-01

Pos logit contributions
 them+0.017
 it+0.017
 the+0.017
 anything+0.015
 their+0.015
Neg logit contributions
ouse-0.029
anda-0.028
umpy-0.028
oc-0.027
ney-0.027
Token Did
Token ID7731
Feature activation-8.21e+00

Pos logit contributions
 the+0.017
 it+0.016
 your+0.015
 them+0.015
 this+0.014
Neg logit contributions
ouse-0.024
anda-0.024
oc-0.023
umpy-0.023
uer-0.023
Token you
Token ID345
Feature activation-2.15e+00

Pos logit contributions
 them+0.090
wered+0.089
 the+0.089
 their+0.080
 such+0.079
Neg logit contributions
anda-0.301
ouse-0.276
affe-0.261
oc-0.259
ari-0.254
Token hurt
Token ID5938
Feature activation-4.72e+00

Pos logit contributions
 the+0.036
 them+0.036
 it+0.035
 their+0.032
 anything+0.032
Neg logit contributions
ouse-0.075
umpy-0.072
anda-0.070
oc-0.069
ney-0.067
Token yourself
Token ID3511
Feature activation-3.87e-02

Pos logit contributions
wered+0.053
 help+0.050
 respect+0.047
 the+0.046
 happen+0.045
Neg logit contributions
anda-0.163
ouse-0.161
umpy-0.159
oc-0.155
ney-0.150
Token?"
Token ID1701
Feature activation+1.75e+00

Pos logit contributions
 the+0.001
 it+0.001
 them+0.001
 their+0.001
 your+0.001
Neg logit contributions
ouse-0.001
umpy-0.001
anda-0.001
oc-0.001
ney-0.001
Token
Token ID220
Feature activation+1.10e+00

Pos logit contributions
anda+0.068
umpy+0.067
ouse+0.066
oc+0.065
uer+0.065
Neg logit contributions
 the-0.032
 it-0.030
 her-0.027
 them-0.026
 this-0.024
Token and
Token ID290
Feature activation-1.23e+00

Pos logit contributions
 the+0.023
 it+0.022
 them+0.022
 anything+0.021
 your+0.021
Neg logit contributions
ouse-0.043
anda-0.042
oc-0.042
umpy-0.041
ney-0.040
Token asked
Token ID1965
Feature activation-3.44e+00

Pos logit contributions
 it+0.033
 the+0.033
 them+0.033
 your+0.032
 anything+0.031
Neg logit contributions
anda-0.028
ouse-0.028
oc-0.028
umpy-0.026
ney-0.026
Token if
Token ID611
Feature activation-4.61e+00

Pos logit contributions
 yours+0.041
 yourselves+0.041
 anything+0.039
 your+0.039
 such+0.039
Neg logit contributions
anda-0.120
oc-0.117
jo-0.114
ouse-0.110
ney-0.108
Token he
Token ID339
Feature activation+4.77e-01

Pos logit contributions
 yours+0.089
 yourselves+0.088
wered+0.088
 them+0.085
 your+0.084
Neg logit contributions
anda-0.118
oc-0.109
ouse-0.108
jo-0.106
s-0.105
Token had
Token ID550
Feature activation-4.86e+00

Pos logit contributions
ouse+0.013
oc+0.012
anda+0.012
umpy+0.012
ney+0.012
Neg logit contributions
 the-0.013
 it-0.013
 them-0.012
 this-0.011
 what-0.011
Token seen
Token ID1775
Feature activation-8.18e+00

Pos logit contributions
 them+0.082
wered+0.080
 it+0.079
 the+0.077
 ourselves+0.076
Neg logit contributions
ouse-0.151
umpy-0.137
anda-0.135
ables-0.133
int-0.132
Token their
Token ID511
Feature activation+1.11e+00

Pos logit contributions
wered+0.054
 respect+0.042
 yourselves+0.040
 happen+0.036
 help+0.035
Neg logit contributions
anda-0.296
oc-0.290
jo-0.284
affe-0.278
lers-0.277
Token orange
Token ID10912
Feature activation+3.21e+00

Pos logit contributions
ouse+0.030
anda+0.030
oc+0.029
umpy+0.029
ney+0.028
Neg logit contributions
 the-0.028
 it-0.028
 them-0.027
 your-0.026
 anything-0.025
Token ball
Token ID2613
Feature activation+3.15e+00

Pos logit contributions
ouse+0.088
anda+0.088
oc+0.087
umpy+0.086
ney+0.082
Neg logit contributions
 the-0.085
 it-0.082
 them-0.077
 your-0.075
 her-0.074
Token,
Token ID11
Feature activation-1.85e+00

Pos logit contributions
ouse+0.104
anda+0.103
oc+0.102
umpy+0.100
ney+0.098
Neg logit contributions
 the-0.064
 it-0.061
 them-0.058
 your-0.055
 her-0.053
Token and
Token ID290
Feature activation-2.34e+00

Pos logit contributions
 the+0.041
 them+0.041
 it+0.041
 your+0.038
 anything+0.038
Neg logit contributions
ouse-0.049
anda-0.048
oc-0.048
umpy-0.047
ney-0.046
Tokenally
Token ID453
Feature activation+5.43e+00

Pos logit contributions
ouse+0.061
oc+0.060
anda+0.059
umpy+0.057
uer+0.057
Neg logit contributions
 the-0.073
 it-0.070
 them-0.068
 your-0.065
 her-0.065
Token's
Token ID338
Feature activation+8.23e-01

Pos logit contributions
umpy+0.167
ouse+0.164
uer+0.160
anda+0.159
oc+0.156
Neg logit contributions
 the-0.137
 it-0.126
 this-0.111
 her-0.108
 a-0.106
Token mom
Token ID1995
Feature activation+4.02e+00

Pos logit contributions
ouse+0.027
anda+0.027
oc+0.027
umpy+0.027
ney+0.026
Neg logit contributions
 the-0.017
 it-0.016
 them-0.015
 your-0.014
 her-0.014
Tokenmy
Token ID1820
Feature activation+4.13e+00

Pos logit contributions
umpy+0.107
ouse+0.106
uer+0.102
oc+0.099
ables+0.099
Neg logit contributions
 the-0.123
 it-0.115
 her-0.104
 what-0.104
 them-0.103
Token had
Token ID550
Feature activation-2.95e+00

Pos logit contributions
umpy+0.119
ouse+0.119
anda+0.115
oc+0.112
uer+0.110
Neg logit contributions
 the-0.113
 it-0.107
 them-0.096
 this-0.094
 her-0.093
Token figured
Token ID11638
Feature activation-8.02e+00

Pos logit contributions
 them+0.045
 it+0.043
 the+0.042
 yourselves+0.042
wered+0.041
Neg logit contributions
ouse-0.094
ables-0.089
ney-0.088
umpy-0.088
oc-0.087
Token it
Token ID340
Feature activation-3.96e-01

Pos logit contributions
 yourselves+0.097
wered+0.088
 ourselves+0.082
 the+0.072
 yourself+0.072
Neg logit contributions
anda-0.278
ouse-0.273
affe-0.257
ip-0.253
ables-0.249
Token out
Token ID503
Feature activation-1.03e+00

Pos logit contributions
 the+0.007
 it+0.007
 them+0.007
 your+0.006
 this+0.006
Neg logit contributions
ouse-0.014
anda-0.013
oc-0.013
umpy-0.013
ney-0.013
Token!
Token ID0
Feature activation+6.70e-01

Pos logit contributions
 the+0.020
 it+0.020
 them+0.020
 your+0.018
 anything+0.018
Neg logit contributions
oc-0.031
ouse-0.031
anda-0.031
umpy-0.030
ney-0.029
Token The
Token ID383
Feature activation+1.51e+00

Pos logit contributions
ouse+0.019
anda+0.019
oc+0.019
umpy+0.018
ney+0.018
Neg logit contributions
 the-0.017
 it-0.016
 them-0.015
 your-0.015
 her-0.014
Token hidden
Token ID7104
Feature activation+2.23e+00

Pos logit contributions
ouse+0.039
anda+0.038
oc+0.038
umpy+0.037
ney+0.036
Neg logit contributions
 the-0.042
 it-0.041
 them-0.039
 your-0.038
 anything-0.037
Token did
Token ID750
Feature activation-4.65e+00

Pos logit contributions
 them+0.039
 the+0.038
 it+0.038
 your+0.035
 anything+0.034
Neg logit contributions
ouse-0.047
umpy-0.045
oc-0.045
anda-0.044
ney-0.043
Token you
Token ID345
Feature activation-1.24e+00

Pos logit contributions
wered+0.072
 them+0.069
 permission+0.065
asted+0.060
 such+0.055
Neg logit contributions
anda-0.131
s-0.128
ouse-0.127
oc-0.122
jo-0.119
Token do
Token ID466
Feature activation-8.03e-01

Pos logit contributions
 it+0.024
 them+0.024
 the+0.024
 anything+0.021
 their+0.021
Neg logit contributions
ouse-0.041
umpy-0.040
anda-0.037
oc-0.037
ney-0.037
Token?
Token ID30
Feature activation-1.31e+00

Pos logit contributions
wered+0.018
 them+0.018
 the+0.017
 permission+0.017
 it+0.016
Neg logit contributions
ouse-0.021
umpy-0.020
oc-0.020
ney-0.018
ables-0.018
Token How
Token ID1374
Feature activation-2.50e+00

Pos logit contributions
 the+0.034
 it+0.033
 them+0.032
 your+0.030
 her+0.030
Neg logit contributions
ouse-0.035
anda-0.035
oc-0.035
umpy-0.034
ney-0.033
Token could
Token ID714
Feature activation-7.97e+00

Pos logit contributions
 the+0.069
 it+0.067
 them+0.066
 your+0.063
 her+0.062
Neg logit contributions
ouse-0.063
anda-0.061
umpy-0.059
oc-0.059
ney-0.057
Token you
Token ID345
Feature activation-2.04e+00

Pos logit contributions
 them+0.111
 such+0.096
wered+0.091
 respect+0.091
 it+0.090
Neg logit contributions
ouse-0.248
anda-0.248
umpy-0.219
ney-0.219
zag-0.218
Token be
Token ID307
Feature activation-2.06e+00

Pos logit contributions
 it+0.033
 them+0.033
 the+0.033
 anything+0.030
 their+0.030
Neg logit contributions
ouse-0.069
oc-0.066
umpy-0.065
anda-0.064
ney-0.064
Token so
Token ID523
Feature activation-2.56e+00

Pos logit contributions
 them+0.038
 it+0.038
 the+0.036
wered+0.035
 him+0.033
Neg logit contributions
ouse-0.065
umpy-0.063
anda-0.062
oc-0.061
ney-0.060
Token careless
Token ID36138
Feature activation-1.11e+00

Pos logit contributions
 the+0.070
 it+0.068
 them+0.066
 your+0.062
 her+0.061
Neg logit contributions
ouse-0.066
anda-0.066
umpy-0.064
oc-0.063
uer-0.060
Token and
Token ID290
Feature activation-3.59e+00

Pos logit contributions
wered+0.024
 the+0.024
 them+0.023
 it+0.023
 her+0.023
Neg logit contributions
umpy-0.029
ouse-0.028
oc-0.027
ney-0.026
anda-0.026
Token,
Token ID11
Feature activation-1.19e-01

Pos logit contributions
 the+0.034
 yourselves+0.034
 them+0.033
wered+0.033
 your+0.032
Neg logit contributions
ouse-0.037
anda-0.036
oc-0.034
umpy-0.033
uer-0.033
Token "
Token ID366
Feature activation-2.29e-01

Pos logit contributions
 the+0.003
 it+0.003
 them+0.003
 your+0.003
 anything+0.003
Neg logit contributions
ouse-0.003
anda-0.003
oc-0.003
umpy-0.003
ney-0.003
TokenHey
Token ID10814
Feature activation-7.72e-01

Pos logit contributions
 the+0.005
 it+0.005
 them+0.005
 your+0.004
 her+0.004
Neg logit contributions
anda-0.008
ouse-0.008
oc-0.007
uer-0.007
umpy-0.007
Token,
Token ID11
Feature activation-4.39e+00

Pos logit contributions
 the+0.015
 them+0.014
 it+0.014
 her+0.013
 your+0.013
Neg logit contributions
ouse-0.025
anda-0.025
oc-0.025
umpy-0.024
uer-0.023
Token where
Token ID810
Feature activation-3.57e+00

Pos logit contributions
 her+0.052
 the+0.051
 them+0.050
wered+0.049
 anything+0.048
Neg logit contributions
anda-0.155
ouse-0.143
oc-0.143
affe-0.140
umpy-0.139
Token did
Token ID750
Feature activation-7.91e+00

Pos logit contributions
 them+0.072
 the+0.066
 it+0.066
 her+0.063
wered+0.059
Neg logit contributions
ouse-0.108
anda-0.105
umpy-0.103
oc-0.100
ney-0.098
Token the
Token ID262
Feature activation+1.06e+00

Pos logit contributions
wered+0.101
 permission+0.096
 them+0.091
 herself+0.088
 respect+0.082
Neg logit contributions
anda-0.245
oc-0.238
s-0.233
ouse-0.229
affe-0.228
Token symbol
Token ID6194
Feature activation+1.71e+00

Pos logit contributions
ouse+0.026
anda+0.025
oc+0.025
umpy+0.024
ney+0.024
Neg logit contributions
 the-0.031
 it-0.030
 them-0.029
 your-0.028
 her-0.027
Token go
Token ID467
Feature activation-1.88e+00

Pos logit contributions
ouse+0.061
anda+0.061
oc+0.060
umpy+0.059
ney+0.058
Neg logit contributions
 the-0.030
 it-0.028
 them-0.026
 your-0.025
 her-0.024
Token?"
Token ID1701
Feature activation+1.14e+00

Pos logit contributions
 it+0.034
 the+0.033
 them+0.031
 anything+0.030
 her+0.029
Neg logit contributions
ouse-0.062
anda-0.061
oc-0.061
umpy-0.060
ney-0.058
Token It
Token ID632
Feature activation+1.95e+00

Pos logit contributions
ouse+0.041
oc+0.040
anda+0.040
umpy+0.040
ney+0.038
Neg logit contributions
 the-0.018
 it-0.017
 them-0.016
 your-0.016
 anything-0.015
Token and
Token ID290
Feature activation-2.64e-01

Pos logit contributions
ouse+0.046
umpy+0.044
anda+0.044
oc+0.043
ney+0.043
Neg logit contributions
 the-0.026
 it-0.023
 them-0.022
 what-0.020
 their-0.020
Token asked
Token ID1965
Feature activation-3.57e+00

Pos logit contributions
 the+0.007
 it+0.007
 them+0.007
 your+0.006
 anything+0.006
Neg logit contributions
ouse-0.007
oc-0.007
anda-0.006
uer-0.006
ney-0.006
Token if
Token ID611
Feature activation-4.63e+00

Pos logit contributions
 yourselves+0.038
wered+0.034
 such+0.032
 ourselves+0.031
 yours+0.031
Neg logit contributions
oc-0.129
anda-0.124
ouse-0.116
affe-0.116
ari-0.116
Token she
Token ID673
Feature activation-2.89e-02

Pos logit contributions
 yourselves+0.096
 yours+0.095
wered+0.095
 such+0.094
 respect+0.093
Neg logit contributions
anda-0.108
ouse-0.104
ari-0.100
uer-0.099
s-0.099
Token had
Token ID550
Feature activation-4.83e+00

Pos logit contributions
 the+0.001
 it+0.001
 them+0.001
 what+0.001
 her+0.001
Neg logit contributions
ouse-0.001
umpy-0.001
anda-0.001
oc-0.001
ables-0.001
Token seen
Token ID1775
Feature activation-7.87e+00

Pos logit contributions
wered+0.087
 yourselves+0.086
 it+0.086
 ourselves+0.085
 the+0.084
Neg logit contributions
ouse-0.139
anda-0.129
int-0.122
ables-0.121
umpy-0.120
Token a
Token ID257
Feature activation+1.55e+00

Pos logit contributions
 the+0.053
 yourselves+0.053
 your+0.052
wered+0.052
 it+0.051
Neg logit contributions
anda-0.301
oc-0.286
ouse-0.280
jo-0.274
affe-0.272
Token little
Token ID1310
Feature activation+2.10e+00

Pos logit contributions
ouse+0.044
anda+0.044
oc+0.043
umpy+0.043
ney+0.042
Neg logit contributions
 the-0.039
 it-0.038
 them-0.036
 your-0.034
 their-0.033
Token boy
Token ID2933
Feature activation+2.00e+00

Pos logit contributions
ouse+0.056
anda+0.054
umpy+0.053
oc+0.052
ney+0.051
Neg logit contributions
 the-0.058
 it-0.057
 them-0.053
 your-0.051
 this-0.050
Token.
Token ID13
Feature activation+2.20e+00

Pos logit contributions
ouse+0.067
anda+0.066
oc+0.065
umpy+0.064
ney+0.062
Neg logit contributions
 the-0.040
 it-0.039
 them-0.036
 your-0.034
 her-0.033
Token Lily
Token ID20037
Feature activation+5.12e+00

Pos logit contributions
ouse+0.075
anda+0.074
umpy+0.073
oc+0.071
uer+0.070
Neg logit contributions
 the-0.049
 it-0.046
 them-0.043
 her-0.041
 your-0.040
Token mom
Token ID1995
Feature activation+3.23e+00

Pos logit contributions
oc+0.044
ouse+0.043
umpy+0.043
anda+0.042
ney+0.041
Neg logit contributions
 the-0.029
 it-0.027
 them-0.025
 her-0.024
 your-0.024
Tokenmy
Token ID1820
Feature activation+3.30e+00

Pos logit contributions
ouse+0.094
oc+0.092
umpy+0.092
anda+0.091
uer+0.089
Neg logit contributions
 the-0.086
 it-0.081
 them-0.077
 her-0.074
 what-0.070
Token accidentally
Token ID14716
Feature activation-4.25e+00

Pos logit contributions
ouse+0.096
umpy+0.095
anda+0.095
oc+0.094
ney+0.091
Neg logit contributions
 the-0.086
 it-0.083
 them-0.077
 her-0.073
 this-0.071
Token let
Token ID1309
Feature activation-4.68e+00

Pos logit contributions
 them+0.107
 the+0.106
 ourselves+0.104
wered+0.104
 your+0.103
Neg logit contributions
ouse-0.104
anda-0.094
oc-0.089
ney-0.084
zag-0.082
Token go
Token ID467
Feature activation-1.72e+00

Pos logit contributions
 the+0.038
 it+0.036
 yourselves+0.034
wered+0.033
 your+0.032
Neg logit contributions
ouse-0.180
anda-0.180
umpy-0.177
oc-0.175
ables-0.170
Token of
Token ID286
Feature activation-7.83e+00

Pos logit contributions
 the+0.037
 it+0.036
 them+0.034
 your+0.033
 anything+0.032
Neg logit contributions
anda-0.053
ouse-0.053
oc-0.052
umpy-0.051
ney-0.049
Token her
Token ID607
Feature activation+1.03e+00

Pos logit contributions
wered+0.064
 respect+0.064
 listen+0.045
 happen+0.044
 yours+0.044
Neg logit contributions
anda-0.251
oc-0.240
ouse-0.230
s-0.230
uddle-0.228
Token hand
Token ID1021
Feature activation+1.22e+00

Pos logit contributions
ouse+0.033
anda+0.033
oc+0.033
umpy+0.032
ney+0.031
Neg logit contributions
 the-0.023
 it-0.021
 them-0.020
 your-0.019
 her-0.019
Token.
Token ID13
Feature activation+1.05e+00

Pos logit contributions
ouse+0.039
anda+0.038
oc+0.038
umpy+0.037
ney+0.036
Neg logit contributions
 the-0.026
 it-0.025
 them-0.024
 your-0.023
 her-0.022
Token Lily
Token ID20037
Feature activation+4.52e+00

Pos logit contributions
ouse+0.034
anda+0.034
oc+0.033
umpy+0.032
ney+0.032
Neg logit contributions
 the-0.023
 it-0.022
 them-0.021
 your-0.020
 her-0.019
Token lost
Token ID2626
Feature activation-4.37e+00

Pos logit contributions
anda+0.130
ari+0.128
oc+0.128
ouse+0.128
umpy+0.127
Neg logit contributions
 the-0.120
 it-0.116
 her-0.101
 them-0.097
 this-0.097
Token out
Token ID503
Feature activation-3.10e+00

Pos logit contributions
 it+0.037
 the+0.036
 yourselves+0.033
 them+0.033
 your+0.032
Neg logit contributions
ouse-0.207
oc-0.206
anda-0.203
umpy-0.197
int-0.191
Token a
Token ID257
Feature activation+2.00e+00

Pos logit contributions
 it+0.024
 your+0.020
 the+0.020
 them+0.020
 anything+0.020
Neg logit contributions
ouse-0.127
anda-0.125
oc-0.125
umpy-0.121
ney-0.119
Token needle
Token ID17598
Feature activation+3.85e+00

Pos logit contributions
anda+0.059
ouse+0.058
umpy+0.057
oc+0.057
ney+0.055
Neg logit contributions
 the-0.049
 it-0.047
 them-0.044
 your-0.042
 this-0.042
Token and
Token ID290
Feature activation-9.71e-01

Pos logit contributions
ouse+0.106
anda+0.104
oc+0.103
umpy+0.100
ney+0.098
Neg logit contributions
 the-0.099
 it-0.096
 them-0.092
 your-0.089
 anything-0.086
Token carefully
Token ID7773
Feature activation-8.60e-01

Pos logit contributions
 the+0.020
 them+0.020
 it+0.019
 your+0.018
 anything+0.018
Neg logit contributions
ouse-0.030
oc-0.029
anda-0.028
ney-0.027
uer-0.027
Token inserted
Token ID18846
Feature activation-7.73e+00

Pos logit contributions
 the+0.020
 it+0.019
 your+0.019
 them+0.019
 anything+0.018
Neg logit contributions
ouse-0.025
oc-0.024
anda-0.024
ney-0.023
umpy-0.023
Token it
Token ID340
Feature activation+1.52e+00

Pos logit contributions
 yourselves+0.037
wered+0.031
 listen+0.030
 ourselves+0.028
 happen+0.028
Neg logit contributions
oc-0.306
anda-0.303
ouse-0.294
lers-0.291
s-0.289
Token into
Token ID656
Feature activation-6.25e+00

Pos logit contributions
ouse+0.053
anda+0.053
oc+0.053
umpy+0.052
ney+0.050
Neg logit contributions
 the-0.030
 it-0.029
 them-0.026
 her-0.024
 this-0.024
Token the
Token ID262
Feature activation+1.80e+00

Pos logit contributions
 yourselves+0.034
 yours+0.034
 respect+0.033
 your+0.032
 listen+0.031
Neg logit contributions
oc-0.250
ouse-0.240
anda-0.234
ney-0.229
jo-0.225
Token toy
Token ID13373
Feature activation+2.91e+00

Pos logit contributions
ouse+0.042
anda+0.042
oc+0.041
umpy+0.040
ney+0.039
Neg logit contributions
 the-0.054
 it-0.052
 them-0.051
 your-0.048
 her-0.048
Token car
Token ID1097
Feature activation+3.57e+00

Pos logit contributions
ouse+0.067
anda+0.066
oc+0.065
umpy+0.063
ney+0.061
Neg logit contributions
 the-0.088
 it-0.086
 them-0.083
 your-0.080
 her-0.078
Token realized
Token ID6939
Feature activation-4.05e+00

Pos logit contributions
 the+0.031
 it+0.030
 them+0.029
 your+0.027
 anything+0.026
Neg logit contributions
ouse-0.037
anda-0.037
oc-0.036
umpy-0.036
ney-0.034
Token that
Token ID326
Feature activation-3.92e+00

Pos logit contributions
 the+0.054
 them+0.052
 your+0.052
 it+0.050
wered+0.049
Neg logit contributions
oc-0.136
anda-0.135
ouse-0.134
umpy-0.127
jo-0.126
Token Jimmy
Token ID12963
Feature activation+4.27e+00

Pos logit contributions
 the+0.057
 them+0.057
 it+0.054
 your+0.053
wered+0.051
Neg logit contributions
oc-0.131
ouse-0.128
anda-0.124
umpy-0.123
ney-0.120
Token had
Token ID550
Feature activation-2.89e+00

Pos logit contributions
ouse+0.125
anda+0.121
oc+0.120
uer+0.118
umpy+0.117
Neg logit contributions
 the-0.112
 it-0.109
 them-0.097
 this-0.094
 her-0.092
Token already
Token ID1541
Feature activation-1.89e+00

Pos logit contributions
 it+0.059
 the+0.058
 them+0.058
 your+0.056
 this+0.053
Neg logit contributions
ouse-0.083
umpy-0.080
anda-0.078
ney-0.077
ables-0.076
Token figured
Token ID11638
Feature activation-7.68e+00

Pos logit contributions
 it+0.041
 the+0.040
 them+0.039
 your+0.037
 anything+0.036
Neg logit contributions
ouse-0.057
anda-0.055
umpy-0.054
oc-0.053
ney-0.053
Token out
Token ID503
Feature activation-3.03e+00

Pos logit contributions
 the+0.105
 yourselves+0.104
wered+0.102
 it+0.098
 them+0.097
Neg logit contributions
ouse-0.248
anda-0.243
oc-0.234
umpy-0.232
jo-0.230
Token what
Token ID644
Feature activation-1.82e+00

Pos logit contributions
 it+0.023
 them+0.023
 the+0.021
wered+0.019
 your+0.017
Neg logit contributions
oc-0.122
anda-0.120
ouse-0.120
umpy-0.118
ables-0.114
Token was
Token ID373
Feature activation-1.52e+00

Pos logit contributions
 the+0.044
 them+0.043
 it+0.042
 your+0.040
wered+0.039
Neg logit contributions
ouse-0.047
oc-0.046
anda-0.045
umpy-0.045
ney-0.044
Token about
Token ID546
Feature activation-1.75e+00

Pos logit contributions
 the+0.027
 it+0.026
 them+0.026
wered+0.024
 your+0.024
Neg logit contributions
umpy-0.046
ouse-0.046
anda-0.046
oc-0.044
ney-0.043
Token to
Token ID284
Feature activation-3.59e+00

Pos logit contributions
 the+0.035
 it+0.034
 them+0.033
 your+0.031
 anything+0.031
Neg logit contributions
oc-0.053
ouse-0.053
anda-0.052
umpy-0.051
ney-0.050
Token to
Token ID284
Feature activation-4.60e+00

Pos logit contributions
wered+0.058
 it+0.055
 the+0.053
 happen+0.052
 them+0.049
Neg logit contributions
anda-0.102
ouse-0.100
ables-0.098
ney-0.096
umpy-0.096
Token learn
Token ID2193
Feature activation-3.94e+00

Pos logit contributions
 it+0.089
 anything+0.084
 such+0.081
 the+0.079
 them+0.078
Neg logit contributions
ouse-0.116
oc-0.111
age-0.109
ney-0.108
umpy-0.107
Token to
Token ID284
Feature activation-4.02e+00

Pos logit contributions
wered+0.060
 it+0.059
 the+0.057
 anything+0.052
 them+0.052
Neg logit contributions
anda-0.126
ouse-0.125
umpy-0.121
oc-0.119
ney-0.118
Token let
Token ID1309
Feature activation-5.19e+00

Pos logit contributions
 anything+0.075
 it+0.073
 such+0.073
 this+0.067
 the+0.066
Neg logit contributions
oc-0.100
ouse-0.098
ney-0.095
anda-0.094
 driver-0.090
Token go
Token ID467
Feature activation-2.10e-01

Pos logit contributions
 the+0.022
 it+0.022
wered+0.019
 yourselves+0.018
 such+0.018
Neg logit contributions
anda-0.216
oc-0.211
ouse-0.211
umpy-0.208
ables-0.207
Token of
Token ID286
Feature activation-7.64e+00

Pos logit contributions
 the+0.004
 it+0.004
 them+0.004
 your+0.004
 her+0.004
Neg logit contributions
ouse-0.007
anda-0.007
oc-0.007
umpy-0.007
uer-0.007
Token things
Token ID1243
Feature activation+2.23e-01

Pos logit contributions
wered+0.057
 respect+0.043
 such+0.042
 yours+0.039
pped+0.038
Neg logit contributions
anda-0.255
oc-0.239
s-0.236
ouse-0.232
uddle-0.230
Token they
Token ID484
Feature activation+9.09e-01

Pos logit contributions
ouse+0.007
anda+0.007
umpy+0.007
oc+0.007
uer+0.006
Neg logit contributions
 the-0.004
 it-0.004
wered-0.004
 them-0.004
 such-0.004
Token do
Token ID466
Feature activation-4.92e-01

Pos logit contributions
ouse+0.032
anda+0.032
oc+0.032
umpy+0.031
ney+0.030
Neg logit contributions
 the-0.016
 it-0.015
 them-0.014
 your-0.014
 her-0.013
Token not
Token ID407
Feature activation-2.34e+00

Pos logit contributions
 the+0.011
 it+0.010
wered+0.010
 them+0.010
 your+0.009
Neg logit contributions
ouse-0.015
umpy-0.014
oc-0.013
anda-0.013
ney-0.013
Token need
Token ID761
Feature activation-3.55e+00

Pos logit contributions
 the+0.048
 it+0.048
 them+0.045
 this+0.044
 such+0.044
Neg logit contributions
umpy-0.072
ouse-0.069
oc-0.066
anda-0.065
ney-0.065
Token He
Token ID679
Feature activation+2.97e+00

Pos logit contributions
ouse+0.083
anda+0.080
uer+0.080
umpy+0.077
int+0.076
Neg logit contributions
 the-0.058
 it-0.056
 them-0.050
 your-0.049
 this-0.048
Token wished
Token ID16555
Feature activation-4.36e+00

Pos logit contributions
ouse+0.091
umpy+0.089
uer+0.085
anda+0.085
oc+0.084
Neg logit contributions
 the-0.077
 it-0.073
 them-0.067
 her-0.064
 their-0.063
Token he
Token ID339
Feature activation+9.46e-01

Pos logit contributions
 the+0.096
 your+0.094
 it+0.094
 them+0.092
 yourselves+0.090
Neg logit contributions
anda-0.109
ouse-0.107
oc-0.106
ney-0.100
umpy-0.100
Token could
Token ID714
Feature activation-3.29e+00

Pos logit contributions
ouse+0.024
anda+0.023
oc+0.023
umpy+0.023
ney+0.022
Neg logit contributions
 the-0.027
 it-0.027
 them-0.025
 your-0.024
 this-0.024
Token have
Token ID423
Feature activation-4.31e+00

Pos logit contributions
 the+0.050
 it+0.047
 them+0.043
 your+0.041
 this+0.040
Neg logit contributions
ouse-0.122
anda-0.120
oc-0.119
umpy-0.116
ney-0.116
Token figured
Token ID11638
Feature activation-7.61e+00

Pos logit contributions
 the+0.086
 it+0.084
wered+0.081
 your+0.081
 them+0.080
Neg logit contributions
ouse-0.118
oc-0.116
anda-0.111
uer-0.108
ney-0.108
Token out
Token ID503
Feature activation-3.13e+00

Pos logit contributions
 the+0.100
 yourselves+0.098
wered+0.093
 it+0.087
 your+0.086
Neg logit contributions
ouse-0.251
anda-0.249
oc-0.243
affe-0.238
umpy-0.232
Token a
Token ID257
Feature activation+1.81e+00

Pos logit contributions
 it+0.023
wered+0.022
 the+0.021
 them+0.021
 yourselves+0.017
Neg logit contributions
oc-0.123
anda-0.121
ouse-0.120
umpy-0.116
affe-0.114
Token way
Token ID835
Feature activation-1.21e+00

Pos logit contributions
ouse+0.055
anda+0.054
oc+0.053
umpy+0.052
ney+0.051
Neg logit contributions
 the-0.042
 it-0.040
 them-0.038
 your-0.037
 this-0.035
Token to
Token ID284
Feature activation-4.08e+00

Pos logit contributions
 the+0.020
 it+0.019
 them+0.018
 your+0.017
 anything+0.017
Neg logit contributions
ouse-0.042
anda-0.042
umpy-0.041
oc-0.041
ney-0.040
Token help
Token ID1037
Feature activation-3.09e+00

Pos logit contributions
 it+0.061
 them+0.055
 anything+0.054
 the+0.054
 your+0.053
Neg logit contributions
oc-0.129
ouse-0.119
ney-0.115
uer-0.115
anda-0.113
Token my
Token ID616
Feature activation+1.33e+00

Pos logit contributions
 the+0.061
 it+0.060
 them+0.057
 anything+0.055
 your+0.052
Neg logit contributions
anda-0.101
ouse-0.097
umpy-0.094
oc-0.093
ari-0.091
Token little
Token ID1310
Feature activation+1.10e+00

Pos logit contributions
ouse+0.036
oc+0.035
umpy+0.035
anda+0.034
ney+0.034
Neg logit contributions
 the-0.036
 it-0.035
 them-0.032
 your-0.032
 what-0.031
Token mud
Token ID17492
Feature activation+3.54e+00

Pos logit contributions
ouse+0.024
oc+0.024
anda+0.024
umpy+0.023
ney+0.022
Neg logit contributions
 the-0.035
 it-0.034
 them-0.033
 your-0.032
 her-0.031
Token monsters
Token ID10963
Feature activation+4.27e+00

Pos logit contributions
ouse+0.106
oc+0.105
umpy+0.104
anda+0.102
uer+0.102
Neg logit contributions
 the-0.084
 it-0.081
 them-0.077
 your-0.074
 her-0.071
Token.
Token ID13
Feature activation-1.73e+00

Pos logit contributions
ouse+0.139
anda+0.138
oc+0.136
umpy+0.134
ney+0.130
Neg logit contributions
 the-0.088
 it-0.084
 them-0.079
 your-0.075
 her-0.073
Token Did
Token ID7731
Feature activation-7.61e+00

Pos logit contributions
 the+0.040
 it+0.039
 them+0.037
 your+0.035
 her+0.035
Neg logit contributions
ouse-0.051
anda-0.051
oc-0.051
umpy-0.049
ney-0.048
Token you
Token ID345
Feature activation-2.61e+00

Pos logit contributions
 the+0.078
 such+0.072
 them+0.070
wered+0.069
 it+0.066
Neg logit contributions
anda-0.289
ouse-0.261
affe-0.252
oc-0.251
jo-0.246
Token have
Token ID423
Feature activation-4.70e+00

Pos logit contributions
 the+0.041
 it+0.041
 them+0.038
 your+0.036
 this+0.035
Neg logit contributions
ouse-0.091
anda-0.088
umpy-0.087
oc-0.086
ney-0.084
Token fun
Token ID1257
Feature activation-3.28e-01

Pos logit contributions
wered+0.079
 the+0.075
 it+0.073
 ourselves+0.068
 this+0.066
Neg logit contributions
anda-0.136
ouse-0.131
umpy-0.120
lers-0.120
ables-0.120
Token in
Token ID287
Feature activation-4.17e+00

Pos logit contributions
 the+0.006
 it+0.006
 them+0.006
 anything+0.005
 your+0.005
Neg logit contributions
ouse-0.011
anda-0.011
umpy-0.010
oc-0.010
ney-0.010
Token the
Token ID262
Feature activation+1.78e+00

Pos logit contributions
wered+0.039
 permission+0.031
 respect+0.031
 listen+0.030
 help+0.025
Neg logit contributions
anda-0.144
oc-0.143
ney-0.142
umpy-0.140
ouse-0.139
Token all
Token ID477
Feature activation+6.24e-01

Pos logit contributions
ouse+0.038
anda+0.037
oc+0.036
umpy+0.036
uer+0.035
Neg logit contributions
 the-0.036
 it-0.034
 them-0.032
 her-0.030
 their-0.030
Token lined
Token ID16566
Feature activation-5.37e+00

Pos logit contributions
ouse+0.018
anda+0.017
oc+0.017
uer+0.017
umpy+0.017
Neg logit contributions
 the-0.017
 it-0.016
 them-0.015
 their-0.014
 her-0.014
Token up
Token ID510
Feature activation-2.27e+00

Pos logit contributions
 the+0.108
 anything+0.105
 it+0.103
 your+0.103
 help+0.102
Neg logit contributions
anda-0.147
ouse-0.144
oc-0.142
ney-0.134
uer-0.131
Token and
Token ID290
Feature activation-1.31e+00

Pos logit contributions
 the+0.045
 it+0.045
 them+0.042
 your+0.040
 anything+0.039
Neg logit contributions
ouse-0.073
anda-0.073
oc-0.071
umpy-0.070
ney-0.069
Token waited
Token ID13488
Feature activation-2.51e+00

Pos logit contributions
 the+0.036
 it+0.035
 them+0.035
 your+0.034
 anything+0.033
Neg logit contributions
ouse-0.032
anda-0.031
oc-0.031
umpy-0.029
ney-0.029
Token for
Token ID329
Feature activation-7.61e+00

Pos logit contributions
 it+0.054
 such+0.054
 the+0.054
 anything+0.053
 respect+0.053
Neg logit contributions
ouse-0.059
anda-0.056
uer-0.055
umpy-0.055
oc-0.053
Token someone
Token ID2130
Feature activation+1.74e+00

Pos logit contributions
 yourselves+0.080
 yourself+0.073
 your+0.072
 yours+0.067
 happen+0.066
Neg logit contributions
oc-0.237
anda-0.222
ahs-0.218
 driver-0.216
affe-0.211
Token to
Token ID284
Feature activation-3.66e+00

Pos logit contributions
ouse+0.071
oc+0.070
anda+0.069
umpy+0.069
ney+0.067
Neg logit contributions
 the-0.024
 it-0.022
 them-0.020
 your-0.018
 this-0.017
Token say
Token ID910
Feature activation-2.86e+00

Pos logit contributions
 it+0.063
 your+0.062
 yourselves+0.061
 yours+0.058
 anything+0.057
Neg logit contributions
 driver-0.097
umpy-0.096
oc-0.094
ari-0.094
uer-0.093
Token "
Token ID366
Feature activation-3.30e-01

Pos logit contributions
 the+0.058
 it+0.056
 them+0.053
 your+0.051
 her+0.050
Neg logit contributions
ouse-0.090
anda-0.089
oc-0.087
umpy-0.085
uer-0.083
Tokengo
Token ID2188
Feature activation+3.85e-01

Pos logit contributions
 the+0.009
 it+0.008
 them+0.008
 him+0.008
 her+0.007
Neg logit contributions
anda-0.009
ouse-0.009
oc-0.008
umpy-0.008
uddle-0.008
Token.
Token ID13
Feature activation-2.42e+00

Pos logit contributions
ouse+0.087
anda+0.083
umpy+0.083
oc+0.083
ney+0.081
Neg logit contributions
 the-0.055
 it-0.052
 them-0.048
 your-0.047
 this-0.045
Token I
Token ID314
Feature activation+1.74e-01

Pos logit contributions
 the+0.048
 it+0.046
 them+0.043
 your+0.041
 this+0.039
Neg logit contributions
ouse-0.082
anda-0.080
oc-0.080
umpy-0.078
ney-0.076
Token don
Token ID836
Feature activation-2.09e+00

Pos logit contributions
ouse+0.006
anda+0.006
umpy+0.006
oc+0.006
ney+0.006
Neg logit contributions
 the-0.003
 it-0.003
 them-0.003
 your-0.003
 an-0.003
Token't
Token ID470
Feature activation-2.40e+00

Pos logit contributions
 the+0.039
 them+0.037
 it+0.037
 anything+0.036
 your+0.035
Neg logit contributions
umpy-0.066
ouse-0.066
oc-0.064
anda-0.064
ney-0.062
Token.
Token ID13
Feature activation-2.32e+00

Pos logit contributions
 the+0.047
 it+0.045
 them+0.042
 your+0.039
 her+0.038
Neg logit contributions
ouse-0.083
anda-0.082
oc-0.081
umpy-0.079
ney-0.077
Token Did
Token ID7731
Feature activation-7.60e+00

Pos logit contributions
 the+0.041
 it+0.039
 them+0.036
 your+0.034
 her+0.032
Neg logit contributions
ouse-0.083
anda-0.082
oc-0.081
umpy-0.080
ney-0.078
Token I
Token ID314
Feature activation-2.16e+00

Pos logit contributions
 the+0.073
 them+0.071
 it+0.064
 such+0.061
wered+0.060
Neg logit contributions
anda-0.301
ouse-0.281
oc-0.273
affe-0.271
jo-0.265
Token do
Token ID466
Feature activation-4.73e+00

Pos logit contributions
 the+0.037
 it+0.035
 them+0.032
 your+0.030
 her+0.029
Neg logit contributions
ouse-0.079
anda-0.078
oc-0.077
umpy-0.076
ney-0.074
Token something
Token ID1223
Feature activation-1.64e-01

Pos logit contributions
 them+0.034
 the+0.033
wered+0.030
 their+0.027
 it+0.026
Neg logit contributions
ouse-0.185
anda-0.182
oc-0.174
uer-0.171
umpy-0.170
Token wrong
Token ID2642
Feature activation-1.85e-01

Pos logit contributions
 the+0.003
 it+0.003
 them+0.003
 their+0.003
 such+0.003
Neg logit contributions
anda-0.005
ouse-0.005
umpy-0.005
oc-0.005
uer-0.005
Token?"
Token ID1701
Feature activation-2.38e-01

Pos logit contributions
 the+0.003
 it+0.003
 them+0.003
 your+0.003
 anything+0.003
Neg logit contributions
ouse-0.006
anda-0.006
umpy-0.006
oc-0.006
ney-0.006
Token't
Token ID470
Feature activation-2.12e+00

Pos logit contributions
 the+0.058
 it+0.055
 them+0.053
 your+0.053
 anything+0.052
Neg logit contributions
ouse-0.096
umpy-0.096
anda-0.092
oc-0.090
zag-0.089
Token want
Token ID765
Feature activation-5.25e+00

Pos logit contributions
 the+0.043
 it+0.042
 them+0.039
 your+0.037
 this+0.036
Neg logit contributions
ouse-0.070
anda-0.068
oc-0.068
umpy-0.068
ney-0.065
Token to
Token ID284
Feature activation-3.47e+00

Pos logit contributions
 it+0.074
 the+0.073
 yourselves+0.072
wered+0.072
 ourselves+0.070
Neg logit contributions
anda-0.166
ouse-0.156
umpy-0.152
jo-0.150
ables-0.147
Token let
Token ID1309
Feature activation-5.35e+00

Pos logit contributions
 the+0.059
 it+0.058
 them+0.057
 such+0.055
 this+0.054
Neg logit contributions
ouse-0.111
anda-0.107
umpy-0.107
oc-0.105
ney-0.101
Token go
Token ID467
Feature activation-2.52e-01

Pos logit contributions
 the+0.049
 yourselves+0.047
 it+0.047
wered+0.043
 such+0.043
Neg logit contributions
ouse-0.195
anda-0.195
oc-0.184
ables-0.183
jo-0.182
Token of
Token ID286
Feature activation-7.60e+00

Pos logit contributions
 the+0.006
 it+0.006
 them+0.006
 your+0.005
 her+0.005
Neg logit contributions
ouse-0.007
anda-0.007
oc-0.007
umpy-0.007
ney-0.007
Token her
Token ID607
Feature activation+1.07e+00

Pos logit contributions
 respect+0.053
wered+0.047
 yours+0.040
 yourselves+0.035
 it+0.034
Neg logit contributions
anda-0.254
ouse-0.235
lers-0.233
s-0.232
ahs-0.231
Token dream
Token ID4320
Feature activation+4.08e+00

Pos logit contributions
ouse+0.033
oc+0.033
umpy+0.033
anda+0.032
ney+0.031
Neg logit contributions
 the-0.025
 it-0.023
 them-0.022
 your-0.021
 her-0.021
Token.
Token ID13
Feature activation+6.91e-01

Pos logit contributions
ouse+0.137
anda+0.136
oc+0.135
umpy+0.131
ney+0.128
Neg logit contributions
 the-0.083
 it-0.079
 them-0.074
 your-0.069
 her-0.068
Token She
Token ID1375
Feature activation+3.49e+00

Pos logit contributions
ouse+0.018
anda+0.018
oc+0.018
umpy+0.017
ney+0.017
Neg logit contributions
 the-0.019
 it-0.019
 them-0.018
 your-0.017
 her-0.017
Token wanted
Token ID2227
Feature activation-5.40e+00

Pos logit contributions
ouse+0.109
umpy+0.109
oc+0.107
anda+0.106
ney+0.103
Neg logit contributions
 the-0.084
 it-0.081
 her-0.074
 them-0.073
 a-0.070
Token love
Token ID1842
Feature activation-4.81e+00

Pos logit contributions
 the+0.076
 it+0.072
 anything+0.070
 such+0.069
 an+0.065
Neg logit contributions
anda-0.140
ouse-0.139
umpy-0.135
oc-0.133
affe-0.126
Token her
Token ID607
Feature activation+7.28e-01

Pos logit contributions
 the+0.049
 it+0.045
wered+0.041
 anything+0.040
 them+0.037
Neg logit contributions
anda-0.187
ouse-0.183
oc-0.183
umpy-0.176
ney-0.176
Token too
Token ID1165
Feature activation+5.66e-01

Pos logit contributions
ouse+0.022
anda+0.022
oc+0.021
umpy+0.021
ney+0.020
Neg logit contributions
 the-0.017
 it-0.017
 them-0.016
 your-0.015
 her-0.015
Token.
Token ID13
Feature activation-1.69e-01

Pos logit contributions
umpy+0.017
oc+0.017
ouse+0.016
anda+0.016
ney+0.015
Neg logit contributions
 the-0.012
 it-0.012
 them-0.012
 your-0.011
 anything-0.011
Token How
Token ID1374
Feature activation-2.47e+00

Pos logit contributions
 the+0.004
 it+0.004
 them+0.004
 your+0.004
 this+0.004
Neg logit contributions
ouse-0.005
anda-0.005
oc-0.005
umpy-0.004
ney-0.004
Token would
Token ID561
Feature activation-7.59e+00

Pos logit contributions
 the+0.051
 it+0.049
 them+0.047
 your+0.044
 this+0.043
Neg logit contributions
ouse-0.080
anda-0.078
umpy-0.077
oc-0.076
ney-0.074
Token you
Token ID345
Feature activation-1.28e+00

Pos logit contributions
 the+0.089
wered+0.089
 it+0.087
 them+0.083
 such+0.081
Neg logit contributions
anda-0.237
ouse-0.229
umpy-0.209
s-0.209
affe-0.206
Token feel
Token ID1254
Feature activation-2.50e+00

Pos logit contributions
 the+0.022
 it+0.022
 them+0.021
 your+0.020
 their+0.019
Neg logit contributions
ouse-0.043
umpy-0.041
oc-0.041
uer-0.040
anda-0.040
Token if
Token ID611
Feature activation-5.05e+00

Pos logit contributions
 the+0.045
 them+0.044
 it+0.043
wered+0.043
 their+0.040
Neg logit contributions
umpy-0.077
ouse-0.076
oc-0.072
anda-0.071
uer-0.067
Token she
Token ID673
Feature activation+5.54e-01

Pos logit contributions
wered+0.067
 the+0.060
 such+0.058
 them+0.057
 respect+0.053
Neg logit contributions
anda-0.163
ouse-0.159
umpy-0.151
s-0.148
oc-0.143
Token did
Token ID750
Feature activation-4.19e+00

Pos logit contributions
ouse+0.016
anda+0.016
oc+0.016
umpy+0.015
ney+0.015
Neg logit contributions
 the-0.014
 it-0.013
 them-0.013
 your-0.012
 her-0.012
Tokeny
Token ID88
Feature activation+7.23e-01

Pos logit contributions
oc+0.025
ouse+0.025
anda+0.024
ney+0.021
uer+0.021
Neg logit contributions
 the-0.091
 it-0.089
 them-0.086
 her-0.084
 your-0.083
Token that
Token ID326
Feature activation-1.55e+00

Pos logit contributions
ouse+0.022
anda+0.022
oc+0.022
umpy+0.022
uer+0.021
Neg logit contributions
 the-0.013
 them-0.013
 it-0.013
 your-0.012
wered-0.012
Token she
Token ID673
Feature activation+2.12e+00

Pos logit contributions
 them+0.032
 our+0.029
 yourselves+0.028
 the+0.028
 your+0.028
Neg logit contributions
ouse-0.043
umpy-0.041
anda-0.040
oc-0.040
ney-0.039
Token fell
Token ID3214
Feature activation-1.03e+00

Pos logit contributions
oc+0.056
ouse+0.056
anda+0.055
umpy+0.054
ney+0.052
Neg logit contributions
 the-0.059
 it-0.058
 them-0.053
 your-0.051
 her-0.050
Token down
Token ID866
Feature activation-2.05e+00

Pos logit contributions
 the+0.024
 it+0.024
 them+0.022
 your+0.021
 her+0.021
Neg logit contributions
ouse-0.031
anda-0.031
oc-0.030
umpy-0.030
ney-0.029
Token onto
Token ID4291
Feature activation-7.59e+00

Pos logit contributions
 them+0.046
 your+0.045
 it+0.045
 the+0.045
 yours+0.043
Neg logit contributions
umpy-0.052
ouse-0.052
oc-0.051
anda-0.050
ney-0.049
Token the
Token ID262
Feature activation+2.09e+00

Pos logit contributions
 yours+0.092
 respect+0.089
 yourselves+0.088
 ours+0.084
 ourselves+0.076
Neg logit contributions
oc-0.180
pillar-0.171
hog-0.171
 bear-0.170
est-0.168
Token floor
Token ID4314
Feature activation+2.55e+00

Pos logit contributions
ouse+0.057
anda+0.056
umpy+0.055
oc+0.054
ney+0.053
Neg logit contributions
 the-0.052
 it-0.052
 them-0.051
 your-0.048
 their-0.046
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
ouse+0.081
anda+0.080
oc+0.079
umpy+0.078
ney+0.076
Neg logit contributions
 the-0.054
 it-0.052
 them-0.049
 your-0.047
 this-0.045
Token But
Token ID887
Feature activation-1.87e+00

Pos logit contributions
ouse+0.031
anda+0.030
oc+0.030
umpy+0.029
ney+0.028
Neg logit contributions
 the-0.032
 it-0.031
 them-0.029
 your-0.028
 this-0.028
Token that
Token ID326
Feature activation-6.78e-01

Pos logit contributions
 them+0.032
 yourselves+0.031
 yours+0.029
 your+0.028
 the+0.028
Neg logit contributions
oc-0.055
anda-0.055
ouse-0.055
uer-0.052
umpy-0.051
Token take
Token ID1011
Feature activation-5.39e+00

Pos logit contributions
 the+0.067
 it+0.066
 them+0.066
 anything+0.063
 their+0.062
Neg logit contributions
oc-0.132
anda-0.124
ouse-0.123
ney-0.120
affe-0.120
Token a
Token ID257
Feature activation+1.75e+00

Pos logit contributions
wered+0.031
 the+0.030
 it+0.022
 them+0.021
 their+0.020
Neg logit contributions
oc-0.225
anda-0.218
ouse-0.215
ables-0.208
affe-0.208
Token closer
Token ID5699
Feature activation+2.28e+00

Pos logit contributions
ouse+0.061
anda+0.060
oc+0.060
umpy+0.059
ney+0.057
Neg logit contributions
 the-0.033
 it-0.032
 them-0.029
 your-0.028
 this-0.027
Token look
Token ID804
Feature activation+4.16e-01

Pos logit contributions
ouse+0.095
anda+0.094
oc+0.094
umpy+0.093
uer+0.090
Neg logit contributions
 the-0.028
 it-0.027
 them-0.023
 your-0.021
 this-0.020
Token.
Token ID13
Feature activation+2.14e-01

Pos logit contributions
ouse+0.014
anda+0.014
oc+0.014
umpy+0.014
ney+0.014
Neg logit contributions
 the-0.008
 it-0.008
 them-0.007
 your-0.007
 this-0.007
Token Did
Token ID7731
Feature activation-7.58e+00

Pos logit contributions
ouse+0.007
anda+0.007
umpy+0.006
oc+0.006
uer+0.006
Neg logit contributions
 the-0.005
 it-0.005
 them-0.004
 your-0.004
 this-0.004
Token you
Token ID345
Feature activation-1.43e+00

Pos logit contributions
 the+0.072
 them+0.068
 it+0.060
 their+0.060
 such+0.057
Neg logit contributions
anda-0.299
ouse-0.285
oc-0.283
affe-0.275
uer-0.267
Token manage
Token ID6687
Feature activation-5.07e+00

Pos logit contributions
 the+0.023
 it+0.022
 them+0.021
 your+0.020
 their+0.019
Neg logit contributions
ouse-0.052
anda-0.051
oc-0.051
umpy-0.051
ney-0.049
Token ok
Token ID12876
Feature activation-1.86e+00

Pos logit contributions
wered+0.065
 the+0.056
 their+0.053
 themselves+0.053
 them+0.052
Neg logit contributions
ouse-0.170
anda-0.163
ables-0.162
umpy-0.159
oc-0.157
Token?"
Token ID1701
Feature activation+1.43e+00

Pos logit contributions
 the+0.035
 it+0.034
 them+0.033
 their+0.030
 your+0.030
Neg logit contributions
ouse-0.061
anda-0.059
oc-0.058
umpy-0.057
ney-0.057
Token
Token ID198
Feature activation-1.62e-01

Pos logit contributions
anda+0.045
ouse+0.045
umpy+0.043
oc+0.043
uer+0.042
Neg logit contributions
 the-0.035
 it-0.034
 them-0.030
 her-0.029
 your-0.029
Token shouted
Token ID17293
Feature activation-1.35e+00

Pos logit contributions
ouse+0.169
umpy+0.157
ables+0.156
anda+0.155
uer+0.151
Neg logit contributions
 the-0.134
 it-0.127
 them-0.115
 a-0.105
 her-0.105
Token.
Token ID13
Feature activation+1.10e-01

Pos logit contributions
 the+0.028
 it+0.027
 them+0.025
 your+0.023
 her+0.023
Neg logit contributions
ouse-0.045
anda-0.045
oc-0.044
umpy-0.043
ney-0.043
Token
Token ID198
Feature activation-6.54e-01

Pos logit contributions
anda+0.003
ouse+0.003
oc+0.003
uer+0.003
ney+0.003
Neg logit contributions
 the-0.003
 it-0.003
 them-0.003
 her-0.003
 an-0.003
Token
Token ID198
Feature activation-6.63e-01

Pos logit contributions
 the+0.016
 it+0.016
 them+0.015
 your+0.015
 her+0.014
Neg logit contributions
ouse-0.018
anda-0.018
oc-0.018
umpy-0.018
ney-0.017
TokenMom
Token ID29252
Feature activation+3.21e+00

Pos logit contributions
 the+0.016
 it+0.016
 them+0.015
 your+0.014
 this+0.014
Neg logit contributions
ouse-0.020
anda-0.020
oc-0.019
umpy-0.019
ney-0.018
Token heard
Token ID2982
Feature activation-7.58e+00

Pos logit contributions
ouse+0.098
oc+0.096
anda+0.095
umpy+0.093
uer+0.092
Neg logit contributions
 the-0.078
 it-0.074
 them-0.071
 her-0.066
 this-0.064
Token the
Token ID262
Feature activation+1.72e+00

Pos logit contributions
 yours+0.085
 respect+0.067
 happen+0.066
 yourself+0.064
 yourselves+0.064
Neg logit contributions
anda-0.250
s-0.248
oc-0.244
affe-0.242
jo-0.231
Token noise
Token ID7838
Feature activation+1.34e+00

Pos logit contributions
ouse+0.052
anda+0.051
oc+0.051
umpy+0.049
ney+0.048
Neg logit contributions
 the-0.041
 it-0.039
 them-0.037
 your-0.035
 this-0.035
Token and
Token ID290
Feature activation-4.87e-01

Pos logit contributions
ouse+0.040
anda+0.040
oc+0.039
umpy+0.038
uer+0.037
Neg logit contributions
 the-0.033
 it-0.032
 them-0.030
 her-0.028
 your-0.028
Token ran
Token ID4966
Feature activation-2.71e+00

Pos logit contributions
 your+0.013
 yours+0.013
 the+0.013
 it+0.013
 such+0.013
Neg logit contributions
anda-0.011
oc-0.010
ouse-0.010
umpy-0.010
ney-0.009
Token to
Token ID284
Feature activation-5.61e+00

Pos logit contributions
 it+0.048
 yours+0.048
 your+0.048
 ourselves+0.047
 anything+0.047
Neg logit contributions
umpy-0.077
anda-0.076
ny-0.075
oc-0.073
ney-0.072
Token for
Token ID329
Feature activation-5.19e+00

Pos logit contributions
ouse+0.034
anda+0.033
oc+0.032
umpy+0.032
ney+0.031
Neg logit contributions
 the-0.031
 it-0.030
 them-0.029
 your-0.028
 anything-0.028
Token everyone
Token ID2506
Feature activation+1.55e+00

Pos logit contributions
 yours+0.050
 your+0.047
 respect+0.045
 it+0.041
wered+0.039
Neg logit contributions
anda-0.168
oc-0.160
ouse-0.158
uer-0.156
int-0.155
Token.
Token ID13
Feature activation+2.28e+00

Pos logit contributions
ouse+0.052
anda+0.051
oc+0.050
umpy+0.049
ney+0.048
Neg logit contributions
 the-0.030
 it-0.029
 them-0.028
 your-0.026
 anything-0.025
Token
Token ID198
Feature activation+1.27e+00

Pos logit contributions
ouse+0.067
anda+0.066
oc+0.065
umpy+0.064
ney+0.062
Neg logit contributions
 the-0.056
 it-0.054
 them-0.051
 your-0.049
 her-0.048
Token
Token ID198
Feature activation+1.48e+00

Pos logit contributions
ouse+0.033
oc+0.033
anda+0.033
umpy+0.032
ney+0.031
Neg logit contributions
 the-0.033
 it-0.032
 them-0.031
 anything-0.030
 your-0.030
TokenMary
Token ID24119
Feature activation+5.21e+00

Pos logit contributions
ouse+0.041
anda+0.040
oc+0.040
umpy+0.039
ney+0.038
Neg logit contributions
 the-0.038
 it-0.037
 them-0.035
 your-0.034
 anything-0.033
Token was
Token ID373
Feature activation-1.57e+00

Pos logit contributions
umpy+0.155
ouse+0.153
anda+0.152
uer+0.151
ari+0.145
Neg logit contributions
 the-0.138
 it-0.130
 this-0.114
 her-0.113
 them-0.109
Token so
Token ID523
Feature activation-1.56e+00

Pos logit contributions
 them+0.039
 it+0.039
 the+0.038
 your+0.036
 anything+0.035
Neg logit contributions
ouse-0.040
umpy-0.039
anda-0.038
ney-0.036
oc-0.036
Token happy
Token ID3772
Feature activation-2.35e-01

Pos logit contributions
 the+0.043
 it+0.042
 them+0.040
 your+0.038
 her+0.038
Neg logit contributions
ouse-0.041
anda-0.040
oc-0.040
umpy-0.039
ney-0.038
Token to
Token ID284
Feature activation-2.66e+00

Pos logit contributions
 the+0.005
 them+0.005
 it+0.004
 your+0.004
 anything+0.004
Neg logit contributions
ouse-0.007
anda-0.007
oc-0.007
umpy-0.007
ney-0.007
Token see
Token ID766
Feature activation-5.15e+00

Pos logit contributions
 the+0.042
 it+0.042
 them+0.041
 anything+0.039
 your+0.037
Neg logit contributions
ouse-0.089
oc-0.089
umpy-0.086
ney-0.082
anda-0.082
Tokenmy
Token ID1820
Feature activation+4.43e+00

Pos logit contributions
uer+0.133
umpy+0.132
ouse+0.131
ables+0.129
oc+0.126
Neg logit contributions
 the-0.124
 it-0.116
 her-0.110
 what-0.107
 them-0.106
Token opened
Token ID4721
Feature activation-4.48e+00

Pos logit contributions
umpy+0.139
ouse+0.134
ables+0.131
oc+0.128
uer+0.127
Neg logit contributions
 the-0.117
 it-0.112
 them-0.100
 her-0.099
 this-0.096
Token the
Token ID262
Feature activation+2.50e+00

Pos logit contributions
 happen+0.041
 listen+0.036
 respect+0.036
 anything+0.035
 yourselves+0.035
Neg logit contributions
anda-0.160
s-0.159
oc-0.156
acked-0.149
ouse-0.147
Token door
Token ID3420
Feature activation+2.24e+00

Pos logit contributions
ouse+0.069
oc+0.067
anda+0.067
umpy+0.066
ney+0.064
Neg logit contributions
 the-0.066
 it-0.063
 them-0.060
 your-0.058
 her-0.057
Token and
Token ID290
Feature activation-5.67e-01

Pos logit contributions
ouse+0.063
anda+0.062
oc+0.062
umpy+0.060
ney+0.058
Neg logit contributions
 the-0.057
 it-0.055
 them-0.052
 your-0.050
 her-0.049
Token Lily
Token ID20037
Feature activation+4.73e+00

Pos logit contributions
 the+0.014
 anything+0.014
 them+0.013
 it+0.013
 your+0.013
Neg logit contributions
ouse-0.015
anda-0.014
oc-0.014
ney-0.013
ari-0.013
Token ran
Token ID4966
Feature activation-2.63e+00

Pos logit contributions
umpy+0.149
ouse+0.143
oc+0.141
anda+0.140
uer+0.137
Neg logit contributions
 the-0.118
 it-0.112
 her-0.099
 them-0.098
 this-0.096
Token inside
Token ID2641
Feature activation-8.77e-01

Pos logit contributions
 anything+0.051
wered+0.050
 yourselves+0.049
 your+0.049
 the+0.049
Neg logit contributions
ouse-0.073
anda-0.072
suit-0.069
ney-0.068
ari-0.067
Token.
Token ID13
Feature activation+1.02e+00

Pos logit contributions
 yourselves+0.022
 anything+0.022
wered+0.022
 yours+0.021
 yourself+0.021
Neg logit contributions
s-0.019
ouse-0.018
oc-0.018
ney-0.017
umpy-0.017
Token She
Token ID1375
Feature activation+2.70e+00

Pos logit contributions
ouse+0.025
anda+0.024
umpy+0.024
oc+0.023
ney+0.023
Neg logit contributions
 the-0.028
 it-0.028
 them-0.027
 your-0.026
 anything-0.026
Token saw
Token ID2497
Feature activation-4.60e+00

Pos logit contributions
umpy+0.078
ouse+0.078
oc+0.077
anda+0.076
uer+0.073
Neg logit contributions
 the-0.070
 it-0.067
 them-0.061
 her-0.060
 this-0.059
Token You
Token ID921
Feature activation-1.57e+00

Pos logit contributions
 the+0.025
 it+0.025
 your+0.023
 them+0.022
 her+0.022
Neg logit contributions
ouse-0.043
anda-0.041
oc-0.041
umpy-0.041
uer-0.041
Token have
Token ID423
Feature activation-3.04e+00

Pos logit contributions
 the+0.030
 it+0.029
 them+0.027
 your+0.025
 her+0.025
Neg logit contributions
ouse-0.054
anda-0.054
oc-0.053
umpy-0.052
ney-0.050
Token your
Token ID534
Feature activation+9.44e-01

Pos logit contributions
wered+0.028
 the+0.026
 them+0.025
 it+0.024
 happen+0.023
Neg logit contributions
anda-0.121
ouse-0.118
umpy-0.111
ables-0.111
oc-0.109
Token truck
Token ID7779
Feature activation+4.50e+00

Pos logit contributions
oc+0.025
ouse+0.025
umpy+0.025
ney+0.024
anda+0.024
Neg logit contributions
 the-0.026
 it-0.025
 them-0.024
 your-0.024
 her-0.023
Token,"
Token ID553
Feature activation-7.98e-02

Pos logit contributions
ouse+0.157
oc+0.156
umpy+0.155
anda+0.152
uer+0.151
Neg logit contributions
 the-0.087
 it-0.079
 your-0.073
 this-0.070
 them-0.070
Token Lily
Token ID20037
Feature activation+5.33e+00

Pos logit contributions
 the+0.001
 it+0.001
 them+0.001
 your+0.001
 her+0.001
Neg logit contributions
ouse-0.003
anda-0.003
oc-0.003
umpy-0.003
ney-0.003
Token said
Token ID531
Feature activation-3.83e-01

Pos logit contributions
ouse+0.159
anda+0.159
umpy+0.158
oc+0.154
ney+0.154
Neg logit contributions
 the-0.139
 it-0.131
 her-0.122
 them-0.117
 an-0.116
Token.
Token ID13
Feature activation+9.55e-01

Pos logit contributions
 the+0.008
 it+0.008
 them+0.008
 your+0.007
 anything+0.007
Neg logit contributions
ouse-0.012
anda-0.012
oc-0.011
umpy-0.011
uer-0.011
Token
Token ID198
Feature activation+4.47e-01

Pos logit contributions
ouse+0.030
anda+0.030
oc+0.029
umpy+0.028
uer+0.028
Neg logit contributions
 the-0.023
 it-0.022
 them-0.020
 her-0.019
 your-0.019
Token
Token ID198
Feature activation+1.77e-01

Pos logit contributions
ouse+0.013
anda+0.012
oc+0.012
umpy+0.012
ney+0.012
Neg logit contributions
 the-0.011
 it-0.011
 them-0.010
 your-0.010
 this-0.010
Token"
Token ID1
Feature activation-1.33e+00

Pos logit contributions
ouse+0.005
oc+0.005
anda+0.005
umpy+0.005
ney+0.005
Neg logit contributions
 the-0.004
 it-0.004
 them-0.004
 your-0.004
 anything-0.004
Token "
Token ID366
Feature activation-1.21e+00

Pos logit contributions
anda+0.020
ouse+0.020
oc+0.019
uer+0.019
ney+0.019
Neg logit contributions
 the-0.012
 it-0.011
 them-0.010
 her-0.010
 their-0.009
TokenFish
Token ID39428
Feature activation+3.12e+00

Pos logit contributions
 the+0.025
 it+0.024
 them+0.023
 your+0.021
 her+0.020
Neg logit contributions
anda-0.040
ouse-0.040
oc-0.039
umpy-0.038
ney-0.037
Token are
Token ID389
Feature activation-2.85e+00

Pos logit contributions
ouse+0.113
anda+0.111
oc+0.110
umpy+0.108
ney+0.106
Neg logit contributions
 the-0.055
 it-0.053
 them-0.049
 your-0.046
 this-0.044
Token better
Token ID1365
Feature activation+2.94e-01

Pos logit contributions
 it+0.050
 the+0.050
 them+0.048
 anything+0.043
 her+0.043
Neg logit contributions
anda-0.091
umpy-0.088
oc-0.088
ouse-0.088
ney-0.083
Token than
Token ID621
Feature activation-4.30e+00

Pos logit contributions
ouse+0.007
anda+0.007
oc+0.007
umpy+0.007
ney+0.007
Neg logit contributions
 the-0.008
 it-0.008
 them-0.008
 your-0.007
 her-0.007
Token ducks
Token ID39694
Feature activation+4.48e+00

Pos logit contributions
 it+0.047
 the+0.043
wered+0.040
 her+0.038
 such+0.037
Neg logit contributions
anda-0.153
umpy-0.149
ouse-0.142
affe-0.142
ari-0.139
Token."
Token ID526
Feature activation+8.47e-01

Pos logit contributions
ouse+0.147
anda+0.145
oc+0.144
umpy+0.141
ney+0.138
Neg logit contributions
 the-0.092
 it-0.089
 them-0.083
 your-0.079
 her-0.076
Token
Token ID198
Feature activation+5.82e-01

Pos logit contributions
anda+0.027
ouse+0.027
oc+0.026
umpy+0.025
uer+0.025
Neg logit contributions
 the-0.020
 it-0.019
 them-0.018
 your-0.017
 her-0.016
Token
Token ID198
Feature activation+4.09e-01

Pos logit contributions
ouse+0.016
anda+0.016
oc+0.016
umpy+0.016
ney+0.015
Neg logit contributions
 the-0.015
 it-0.014
 them-0.013
 your-0.013
 anything-0.013
TokenThey
Token ID2990
Feature activation-2.51e-02

Pos logit contributions
ouse+0.012
anda+0.012
oc+0.012
umpy+0.011
uer+0.011
Neg logit contributions
 the-0.010
 it-0.010
 them-0.009
 your-0.009
 her-0.009
Token stop
Token ID2245
Feature activation-2.92e+00

Pos logit contributions
 it+0.000
 your+0.000
 the+0.000
 them+0.000
 such+0.000
Neg logit contributions
anda-0.001
ouse-0.001
oc-0.001
umpy-0.001
affe-0.001
Token they
Token ID484
Feature activation+1.55e+00

Pos logit contributions
 the+0.023
 it+0.022
 them+0.022
 your+0.021
 anything+0.020
Neg logit contributions
ouse-0.035
oc-0.034
anda-0.034
umpy-0.033
ney-0.032
Token both
Token ID1111
Feature activation+9.77e-01

Pos logit contributions
ouse+0.048
anda+0.048
oc+0.047
umpy+0.047
ney+0.045
Neg logit contributions
 the-0.036
 it-0.035
 them-0.032
 her-0.030
 your-0.030
Token went
Token ID1816
Feature activation-2.07e+00

Pos logit contributions
ouse+0.026
anda+0.026
oc+0.025
umpy+0.025
uer+0.024
Neg logit contributions
 the-0.027
 it-0.027
 them-0.024
 your-0.023
 her-0.023
Token flying
Token ID7348
Feature activation-2.03e+00

Pos logit contributions
 the+0.042
 it+0.041
 them+0.039
 anything+0.038
 your+0.038
Neg logit contributions
anda-0.061
ouse-0.060
umpy-0.058
oc-0.057
uer-0.056
Token.
Token ID13
Feature activation+1.08e+00

Pos logit contributions
 the+0.039
 it+0.038
 your+0.038
 them+0.037
 what+0.036
Neg logit contributions
oc-0.060
anda-0.059
ouse-0.059
umpy-0.058
ney-0.058
Token Tim
Token ID5045
Feature activation+4.97e+00

Pos logit contributions
ouse+0.042
anda+0.042
oc+0.041
umpy+0.040
uer+0.040
Neg logit contributions
 the-0.017
 it-0.016
 them-0.015
 your-0.013
 their-0.013
Tokenmy
Token ID1820
Feature activation+4.87e+00

Pos logit contributions
uer+0.163
ouse+0.152
ouses+0.152
oc+0.150
int+0.141
Neg logit contributions
 the-0.123
 it-0.107
 her-0.101
 him-0.096
 your-0.095
Token fell
Token ID3214
Feature activation-1.51e+00

Pos logit contributions
ouse+0.149
umpy+0.147
anda+0.145
ari+0.144
uer+0.142
Neg logit contributions
 the-0.129
 it-0.128
 this-0.104
 them-0.101
 a-0.098
Token down
Token ID866
Feature activation-2.07e+00

Pos logit contributions
 the+0.034
 it+0.033
 them+0.030
 your+0.029
 her+0.029
Neg logit contributions
ouse-0.049
oc-0.048
anda-0.048
umpy-0.046
ney-0.046
Token and
Token ID290
Feature activation-1.71e-01

Pos logit contributions
 the+0.047
 it+0.047
 them+0.047
 your+0.046
 anything+0.044
Neg logit contributions
oc-0.054
umpy-0.054
ouse-0.054
anda-0.053
ney-0.051
Token hurt
Token ID5938
Feature activation-3.58e+00

Pos logit contributions
 the+0.004
 it+0.004
 them+0.004
 your+0.004
 anything+0.003
Neg logit contributions
ouse-0.005
anda-0.005
oc-0.005
umpy-0.005
ney-0.005
Token basket
Token ID7988
Feature activation+3.36e+00

Pos logit contributions
anda+0.021
ouse+0.021
oc+0.021
umpy+0.021
ney+0.020
Neg logit contributions
 the-0.021
 it-0.020
 them-0.019
 your-0.019
 her-0.018
Token.
Token ID13
Feature activation+7.32e-01

Pos logit contributions
ouse+0.106
anda+0.105
oc+0.104
umpy+0.102
ney+0.099
Neg logit contributions
 the-0.073
 it-0.070
 them-0.066
 your-0.063
 her-0.061
Token He
Token ID679
Feature activation+2.48e+00

Pos logit contributions
ouse+0.016
anda+0.015
oc+0.015
umpy+0.014
uer+0.014
Neg logit contributions
 the-0.025
 it-0.024
 them-0.023
 your-0.022
 their-0.022
Token looked
Token ID3114
Feature activation-2.12e+00

Pos logit contributions
ouse+0.073
umpy+0.072
anda+0.070
oc+0.068
uer+0.068
Neg logit contributions
 the-0.067
 it-0.064
 them-0.058
 her-0.055
 their-0.055
Token friendly
Token ID8030
Feature activation+7.09e-01

Pos logit contributions
 the+0.055
 it+0.053
 them+0.051
 your+0.048
 her+0.047
Neg logit contributions
ouse-0.060
anda-0.059
oc-0.058
umpy-0.057
ney-0.056
Token.
Token ID13
Feature activation+8.43e-01

Pos logit contributions
ouse+0.021
anda+0.020
oc+0.020
umpy+0.020
ney+0.019
Neg logit contributions
 the-0.017
 it-0.016
 them-0.015
 your-0.015
 this-0.014
Token They
Token ID1119
Feature activation+2.05e+00

Pos logit contributions
ouse+0.021
anda+0.021
oc+0.020
umpy+0.019
uer+0.019
Neg logit contributions
 the-0.026
 it-0.025
 them-0.024
 your-0.023
 their-0.023
Token ran
Token ID4966
Feature activation-2.04e+00

Pos logit contributions
ouse+0.055
anda+0.053
oc+0.053
umpy+0.052
ney+0.050
Neg logit contributions
 the-0.058
 it-0.055
 them-0.053
 her-0.050
 your-0.049
Token to
Token ID284
Feature activation-5.15e+00

Pos logit contributions
 the+0.032
 it+0.031
 them+0.028
 your+0.027
 anything+0.027
Neg logit contributions
ouse-0.073
anda-0.073
oc-0.072
umpy-0.071
ney-0.069
Token him
Token ID683
Feature activation+2.31e+00

Pos logit contributions
 yourselves+0.049
 yourself+0.046
 yours+0.045
 anything+0.041
 such+0.041
Neg logit contributions
oc-0.165
 driver-0.160
s-0.153
anda-0.148
ny-0.146
Token and
Token ID290
Feature activation-2.64e-01

Pos logit contributions
ouse+0.073
anda+0.073
umpy+0.072
oc+0.070
uer+0.069
Neg logit contributions
 the-0.054
 it-0.051
 them-0.047
 her-0.045
 this-0.045
Token and
Token ID290
Feature activation-2.18e+00

Pos logit contributions
ouse+0.071
anda+0.070
oc+0.069
umpy+0.067
ney+0.065
Neg logit contributions
 the-0.057
 it-0.055
 them-0.052
 your-0.050
 her-0.049
Token think
Token ID892
Feature activation-2.67e+00

Pos logit contributions
 the+0.034
 them+0.033
 it+0.032
 such+0.028
 anything+0.028
Neg logit contributions
ouse-0.075
anda-0.073
oc-0.072
ney-0.071
umpy-0.070
Token about
Token ID546
Feature activation-5.61e+00

Pos logit contributions
 the+0.060
 it+0.058
 them+0.056
 your+0.055
 such+0.053
Neg logit contributions
ouse-0.074
anda-0.073
umpy-0.072
oc-0.070
ney-0.067
Token what
Token ID644
Feature activation-2.25e+00

Pos logit contributions
wered+0.042
 the+0.037
 it+0.036
 them+0.034
 such+0.033
Neg logit contributions
oc-0.208
anda-0.205
ouse-0.195
jo-0.194
ney-0.190
Token he
Token ID339
Feature activation+2.13e+00

Pos logit contributions
 the+0.048
 them+0.048
 it+0.046
 your+0.044
 such+0.043
Neg logit contributions
ouse-0.063
anda-0.062
umpy-0.060
ney-0.059
oc-0.059
Token did
Token ID750
Feature activation+6.14e-01

Pos logit contributions
ouse+0.063
anda+0.063
oc+0.060
umpy+0.059
uer+0.059
Neg logit contributions
 the-0.054
 it-0.051
 them-0.046
 her-0.045
 this-0.045
Token.
Token ID13
Feature activation+1.50e+00

Pos logit contributions
umpy+0.016
ouse+0.016
oc+0.015
ney+0.014
ables+0.014
Neg logit contributions
 the-0.014
 it-0.014
wered-0.014
 them-0.014
 your-0.013
Token
Token ID198
Feature activation-9.73e-02

Pos logit contributions
ouse+0.049
anda+0.048
oc+0.047
uer+0.046
umpy+0.045
Neg logit contributions
 the-0.035
 it-0.034
 them-0.031
 her-0.030
 your-0.030
Token
Token ID198
Feature activation-3.66e-02

Pos logit contributions
 the+0.002
 it+0.002
 them+0.002
 your+0.002
 anything+0.002
Neg logit contributions
ouse-0.003
anda-0.003
oc-0.003
umpy-0.003
ney-0.002
TokenTom
Token ID13787
Feature activation+5.31e+00

Pos logit contributions
 the+0.001
 it+0.001
 them+0.001
 your+0.001
 her+0.001
Neg logit contributions
ouse-0.001
anda-0.001
oc-0.001
umpy-0.001
uer-0.001
Token and
Token ID290
Feature activation-2.58e+00

Pos logit contributions
ouse+0.168
uer+0.167
oc+0.165
umpy+0.162
ouses+0.162
Neg logit contributions
 the-0.132
 it-0.118
 a-0.108
 her-0.107
 them-0.104
Token go
Token ID467
Feature activation-9.54e-01

Pos logit contributions
 the+0.027
 it+0.025
 them+0.023
 your+0.022
 yourselves+0.018
Neg logit contributions
oc-0.181
ouse-0.178
umpy-0.173
anda-0.173
ney-0.172
Token and
Token ID290
Feature activation-1.93e+00

Pos logit contributions
 the+0.021
 it+0.020
 them+0.019
 your+0.018
 her+0.018
Neg logit contributions
ouse-0.030
anda-0.029
oc-0.029
umpy-0.029
ney-0.028
Token release
Token ID2650
Feature activation-5.03e+00

Pos logit contributions
 the+0.028
 them+0.028
 it+0.027
 anything+0.024
 your+0.024
Neg logit contributions
ouse-0.069
oc-0.068
anda-0.066
ney-0.065
uer-0.064
Token the
Token ID262
Feature activation+2.22e+00

Pos logit contributions
 the+0.018
 it+0.017
 them+0.014
 your+0.011
 her+0.007
Neg logit contributions
oc-0.230
ouse-0.226
anda-0.224
umpy-0.221
ney-0.218
Token word
Token ID1573
Feature activation+1.93e+00

Pos logit contributions
ouse+0.065
anda+0.064
oc+0.062
umpy+0.062
ney+0.060
Neg logit contributions
 the-0.057
 it-0.055
 them-0.051
 your-0.049
 this-0.049
Token.
Token ID13
Feature activation+1.87e+00

Pos logit contributions
ouse+0.066
anda+0.066
oc+0.065
umpy+0.064
ney+0.062
Neg logit contributions
 the-0.039
 it-0.038
 them-0.034
 your-0.032
 her-0.032
Token<|endoftext|>
Token ID50256
Feature activation+2.48e+00

Pos logit contributions
ouse+0.059
anda+0.059
umpy+0.056
uer+0.056
oc+0.054
Neg logit contributions
 the-0.046
 it-0.044
 them-0.039
 this-0.038
 your-0.038
TokenOnce
Token ID7454
Feature activation+9.97e-01

Pos logit contributions
ouse+0.069
anda+0.068
oc+0.066
umpy+0.065
uer+0.064
Neg logit contributions
 the-0.066
 it-0.063
 them-0.060
 your-0.056
 her-0.056
Token there
Token ID612
Feature activation+2.94e+00

Pos logit contributions
ouse+0.028
oc+0.028
anda+0.027
umpy+0.027
uer+0.026
Neg logit contributions
 the-0.026
 it-0.025
 them-0.024
 your-0.023
 her-0.022
Token was
Token ID373
Feature activation-4.04e+00

Pos logit contributions
ouse+0.071
anda+0.070
oc+0.069
umpy+0.067
ney+0.065
Neg logit contributions
 the-0.084
 it-0.081
 them-0.079
 your-0.076
 anything-0.074
Token a
Token ID257
Feature activation+2.87e+00

Pos logit contributions
 the+0.028
 it+0.026
 them+0.024
 anything+0.020
 your+0.020
Neg logit contributions
oc-0.173
anda-0.171
ouse-0.170
umpy-0.170
ney-0.164
Token whistle
Token ID16121
Feature activation+4.09e+00

Pos logit contributions
ouse+0.066
anda+0.066
oc+0.066
umpy+0.065
ney+0.062
Neg logit contributions
 the-0.056
 it-0.055
 them-0.051
 your-0.050
 her-0.048
Token and
Token ID290
Feature activation+1.79e-01

Pos logit contributions
anda+0.142
ouse+0.140
oc+0.138
umpy+0.136
uer+0.134
Neg logit contributions
 the-0.085
 it-0.080
 them-0.071
 her-0.069
 this-0.069
Token was
Token ID373
Feature activation-1.96e+00

Pos logit contributions
ouse+0.005
anda+0.005
oc+0.005
umpy+0.005
ney+0.005
Neg logit contributions
 the-0.004
 it-0.004
 them-0.004
 your-0.004
 anything-0.004
Token very
Token ID845
Feature activation-1.32e+00

Pos logit contributions
 the+0.046
 it+0.044
 them+0.041
 your+0.039
 her+0.039
Neg logit contributions
ouse-0.061
oc-0.060
anda-0.060
umpy-0.058
ney-0.057
Token excited
Token ID6568
Feature activation+5.19e-01

Pos logit contributions
 the+0.032
 it+0.031
 your+0.029
 her+0.028
 their+0.027
Neg logit contributions
oc-0.042
ari-0.041
anda-0.041
ouse-0.040
ney-0.040
Token.
Token ID13
Feature activation+2.27e+00

Pos logit contributions
ouse+0.015
anda+0.015
umpy+0.014
oc+0.014
ney+0.014
Neg logit contributions
 the-0.011
 it-0.011
 them-0.011
 your-0.010
 anything-0.010
Token She
Token ID1375
Feature activation+3.13e+00

Pos logit contributions
anda+0.073
ouse+0.072
oc+0.072
uer+0.070
umpy+0.070
Neg logit contributions
 the-0.053
 it-0.052
 them-0.047
 her-0.046
 your-0.045
Token put
Token ID1234
Feature activation-3.80e+00

Pos logit contributions
umpy+0.094
oc+0.092
ouse+0.092
anda+0.092
uer+0.090
Neg logit contributions
 the-0.079
 it-0.079
 them-0.070
 her-0.066
 your-0.066
Token the
Token ID262
Feature activation+2.31e+00

Pos logit contributions
 the+0.043
 them+0.041
 it+0.040
 your+0.039
 anything+0.038
Neg logit contributions
ouse-0.142
oc-0.137
anda-0.137
umpy-0.131
affe-0.129
Token whistle
Token ID16121
Feature activation+3.64e+00

Pos logit contributions
anda+0.068
oc+0.066
ouse+0.066
umpy+0.063
ney+0.063
Neg logit contributions
 the-0.059
 it-0.057
 them-0.053
 your-0.051
 this-0.050
Token to
Token ID284
Feature activation-5.05e+00

Pos logit contributions
anda+0.134
oc+0.133
ouse+0.129
umpy+0.128
uer+0.126
Neg logit contributions
 the-0.071
 it-0.066
 her-0.058
 them-0.057
 this-0.056
Token Together
Token ID17083
Feature activation-1.83e+00

Pos logit contributions
anda+0.031
ouse+0.031
oc+0.030
umpy+0.029
ney+0.029
Neg logit contributions
 the-0.022
 it-0.021
 them-0.019
 your-0.018
 this-0.018
Token,
Token ID11
Feature activation-2.02e+00

Pos logit contributions
 the+0.039
 it+0.038
 them+0.037
 your+0.036
 anything+0.036
Neg logit contributions
ouse-0.053
anda-0.052
oc-0.052
umpy-0.049
ney-0.049
Token they
Token ID484
Feature activation+1.48e+00

Pos logit contributions
 the+0.035
 it+0.034
 them+0.031
 your+0.030
 her+0.028
Neg logit contributions
ouse-0.072
anda-0.071
oc-0.071
umpy-0.070
ney-0.068
Token reached
Token ID4251
Feature activation-5.53e+00

Pos logit contributions
ouse+0.043
umpy+0.043
anda+0.043
oc+0.042
uer+0.040
Neg logit contributions
 the-0.038
 it-0.036
 them-0.034
 her-0.032
 this-0.031
Token the
Token ID262
Feature activation+2.43e+00

Pos logit contributions
 yourselves+0.036
wered+0.031
 it+0.030
 respect+0.029
 them+0.029
Neg logit contributions
anda-0.215
oc-0.211
ney-0.210
ouse-0.209
ables-0.207
Token top
Token ID1353
Feature activation+3.07e+00

Pos logit contributions
ouse+0.076
anda+0.074
oc+0.074
umpy+0.072
uer+0.070
Neg logit contributions
 the-0.056
 it-0.053
 them-0.050
 your-0.048
 this-0.047
Token of
Token ID286
Feature activation-5.61e+00

Pos logit contributions
ouse+0.103
anda+0.099
oc+0.096
umpy+0.095
uer+0.094
Neg logit contributions
 the-0.071
 it-0.067
 them-0.060
 this-0.059
 her-0.057
Token the
Token ID262
Feature activation+3.19e+00

Pos logit contributions
 yourselves+0.053
 happen+0.051
 respect+0.048
 yourself+0.046
 listen+0.044
Neg logit contributions
oc-0.173
affe-0.156
s-0.156
ny-0.154
ang-0.153
Token tree
Token ID5509
Feature activation+3.25e+00

Pos logit contributions
ouse+0.100
anda+0.094
oc+0.094
umpy+0.093
uer+0.091
Neg logit contributions
 the-0.079
 it-0.074
 this-0.066
 your-0.066
 them-0.065
Token.
Token ID13
Feature activation+9.86e-01

Pos logit contributions
ouse+0.112
anda+0.110
umpy+0.108
oc+0.107
uer+0.103
Neg logit contributions
 the-0.068
 it-0.064
 them-0.057
 this-0.055
 her-0.054
Token Tim
Token ID5045
Feature activation+5.22e+00

Pos logit contributions
anda+0.033
ouse+0.033
oc+0.032
umpy+0.032
ney+0.031
Neg logit contributions
 the-0.022
 it-0.021
 them-0.019
 your-0.018
 this-0.017
Token the
Token ID262
Feature activation+7.19e-01

Pos logit contributions
 them+0.038
 the+0.036
 it+0.035
 your+0.035
 such+0.034
Neg logit contributions
ouse-0.049
anda-0.048
oc-0.047
umpy-0.045
ney-0.045
Token light
Token ID1657
Feature activation+2.46e+00

Pos logit contributions
ouse+0.019
anda+0.018
oc+0.018
umpy+0.018
ney+0.017
Neg logit contributions
 the-0.020
 it-0.019
 them-0.018
 your-0.018
 her-0.017
Token go
Token ID467
Feature activation-4.26e-01

Pos logit contributions
ouse+0.076
oc+0.075
anda+0.075
umpy+0.073
uer+0.071
Neg logit contributions
 the-0.059
 it-0.057
 them-0.053
 your-0.049
 her-0.049
Token off
Token ID572
Feature activation-1.88e+00

Pos logit contributions
 the+0.009
 it+0.009
 them+0.008
 your+0.008
 her+0.008
Neg logit contributions
ouse-0.014
anda-0.013
oc-0.013
umpy-0.013
ney-0.013
Token.
Token ID13
Feature activation+6.53e-01

Pos logit contributions
 it+0.037
 the+0.037
 them+0.037
 your+0.036
 what+0.035
Neg logit contributions
oc-0.054
ouse-0.053
umpy-0.053
anda-0.052
ney-0.051
Token
Token ID198
Feature activation-2.04e-01

Pos logit contributions
anda+0.017
ouse+0.017
oc+0.017
umpy+0.016
ney+0.016
Neg logit contributions
 the-0.018
 it-0.018
 them-0.017
 your-0.016
 her-0.016
Token
Token ID198
Feature activation-4.06e-01

Pos logit contributions
 the+0.005
 it+0.005
 them+0.005
 your+0.005
 anything+0.005
Neg logit contributions
ouse-0.006
anda-0.005
oc-0.005
umpy-0.005
ney-0.005
Token"
Token ID1
Feature activation-4.63e-01

Pos logit contributions
 the+0.010
 it+0.010
 them+0.009
 your+0.009
 her+0.008
Neg logit contributions
ouse-0.012
anda-0.012
oc-0.012
umpy-0.011
ney-0.011
TokenI
Token ID40
Feature activation+3.38e-01

Pos logit contributions
 the+0.011
 it+0.011
 them+0.010
 your+0.010
 this+0.010
Neg logit contributions
ouse-0.013
oc-0.013
umpy-0.013
anda-0.013
ney-0.012
Token'm
Token ID1101
Feature activation-2.91e+00

Pos logit contributions
ouse+0.011
anda+0.011
oc+0.010
uer+0.010
ney+0.010
Neg logit contributions
 the-0.008
 it-0.008
 them-0.007
 your-0.007
 an-0.006
Token sorry
Token ID7926
Feature activation-2.79e+00

Pos logit contributions
 them+0.080
 respect+0.074
 it+0.073
 the+0.071
 their+0.070
Neg logit contributions
anda-0.062
umpy-0.061
ouse-0.058
ney-0.051
ang-0.050
Token I
Token ID314
Feature activation+5.19e-01

Pos logit contributions
 them+0.066
 such+0.063
 him+0.062
wered+0.062
 the+0.057
Neg logit contributions
anda-0.134
ouse-0.119
oc-0.118
ney-0.115
uer-0.112
Token like
Token ID588
Feature activation-4.59e+00

Pos logit contributions
ouse+0.018
anda+0.018
oc+0.018
umpy+0.018
ney+0.017
Neg logit contributions
 the-0.010
 it-0.009
 them-0.009
 your-0.008
 her-0.008
Token this
Token ID428
Feature activation+1.28e+00

Pos logit contributions
wered+0.041
 him+0.035
 such+0.033
 them+0.033
 permission+0.031
Neg logit contributions
anda-0.177
ouse-0.162
oc-0.160
umpy-0.160
ney-0.158
Token one
Token ID530
Feature activation+3.17e+00

Pos logit contributions
ouse+0.041
anda+0.040
oc+0.040
umpy+0.039
ney+0.038
Neg logit contributions
 the-0.028
 it-0.027
 them-0.025
 your-0.024
 her-0.023
Token more
Token ID517
Feature activation+1.09e+00

Pos logit contributions
ouse+0.117
oc+0.111
umpy+0.111
anda+0.111
uer+0.110
Neg logit contributions
 the-0.061
 it-0.059
 your-0.051
 this-0.050
 them-0.050
Token,"
Token ID553
Feature activation-5.86e-01

Pos logit contributions
ouse+0.035
oc+0.035
anda+0.035
umpy+0.034
uer+0.033
Neg logit contributions
 the-0.024
 it-0.023
 them-0.021
 your-0.021
 her-0.020
Token Tom
Token ID4186
Feature activation+3.91e+00

Pos logit contributions
 them+0.008
 it+0.008
 anything+0.008
 the+0.007
 such+0.007
Neg logit contributions
anda-0.020
oc-0.020
ouse-0.019
affe-0.019
umpy-0.018
Token says
Token ID1139
Feature activation-4.78e-01

Pos logit contributions
ouse+0.120
oc+0.110
uer+0.110
ney+0.110
umpy+0.109
Neg logit contributions
 the-0.108
 it-0.098
 them-0.097
 her-0.094
 this-0.091
Token,
Token ID11
Feature activation-5.19e-01

Pos logit contributions
 the+0.010
 it+0.009
 them+0.008
 her+0.008
 your+0.008
Neg logit contributions
ouse-0.017
oc-0.017
anda-0.016
umpy-0.016
ney-0.016
Token showing
Token ID4478
Feature activation-2.82e+00

Pos logit contributions
 the+0.010
 it+0.010
 them+0.010
 your+0.009
 anything+0.009
Neg logit contributions
ouse-0.016
oc-0.016
anda-0.016
umpy-0.016
uer-0.015
Token a
Token ID257
Feature activation+7.10e-01

Pos logit contributions
 the+0.032
 it+0.029
 them+0.027
 anything+0.026
wered+0.025
Neg logit contributions
anda-0.107
ouse-0.105
oc-0.102
umpy-0.100
affe-0.099
Token,
Token ID11
Feature activation-2.91e+00

Pos logit contributions
 the+0.015
 it+0.014
 them+0.014
 anything+0.013
 your+0.013
Neg logit contributions
ouse-0.023
anda-0.023
umpy-0.022
oc-0.022
ney-0.022
Token it
Token ID340
Feature activation+1.84e-01

Pos logit contributions
 the+0.036
 them+0.034
 it+0.032
 her+0.030
 anything+0.029
Neg logit contributions
anda-0.109
oc-0.109
ouse-0.108
umpy-0.107
affe-0.103
Tokenâ
Token ID22940
Feature activation+1.28e+00

Pos logit contributions
ouse+0.005
anda+0.005
umpy+0.005
uer+0.005
oc+0.005
Neg logit contributions
 the-0.005
 it-0.005
 your-0.004
 them-0.004
 this-0.004
Token
Token ID26391
Feature activation+2.81e+00

Pos logit contributions
ouse+0.032
anda+0.032
oc+0.031
umpy+0.030
uer+0.029
Neg logit contributions
 the-0.037
 it-0.036
 them-0.034
 your-0.033
 this-0.032
Token
Token ID8151
Feature activation+3.56e+00

Pos logit contributions
anda+0.068
ouse+0.067
umpy+0.065
oc+0.064
uer+0.061
Neg logit contributions
 the-0.082
 it-0.081
 them-0.073
 your-0.073
 this-0.072
Tokens
Token ID82
Feature activation-3.07e+00

Pos logit contributions
ouse+0.073
uer+0.070
oc+0.069
anda+0.067
int+0.066
Neg logit contributions
 the-0.121
 it-0.117
 them-0.110
 your-0.107
 this-0.104
Token fragile
Token ID21049
Feature activation+3.61e-01

Pos logit contributions
 the+0.055
 them+0.053
 it+0.053
 her+0.048
 their+0.047
Neg logit contributions
anda-0.101
ouse-0.099
umpy-0.098
oc-0.098
ney-0.097
Token.
Token ID13
Feature activation-7.34e-01

Pos logit contributions
ouse+0.011
anda+0.011
oc+0.011
umpy+0.011
ney+0.011
Neg logit contributions
 the-0.008
 it-0.007
 them-0.007
 your-0.007
 her-0.007
Token You
Token ID921
Feature activation-1.51e+00

Pos logit contributions
 the+0.017
 it+0.017
 them+0.016
 your+0.015
 this+0.015
Neg logit contributions
ouse-0.022
anda-0.022
oc-0.021
umpy-0.021
uer-0.020
Token don
Token ID836
Feature activation-1.90e+00

Pos logit contributions
 the+0.034
 it+0.032
 them+0.031
 your+0.030
 her+0.029
Neg logit contributions
ouse-0.047
anda-0.046
oc-0.045
umpy-0.045
ney-0.043
Tokenâ
Token ID22940
Feature activation+1.36e+00

Pos logit contributions
 the+0.035
 it+0.033
 them+0.032
 your+0.031
 anything+0.031
Neg logit contributions
umpy-0.064
ouse-0.062
oc-0.061
anda-0.061
ney-0.058
Tokenopard
Token ID15478
Feature activation+4.14e+00

Pos logit contributions
ouse+0.071
oc+0.070
anda+0.069
jo+0.068
umpy+0.068
Neg logit contributions
 the-0.170
 it-0.168
 them-0.161
 this-0.157
 your-0.156
Token.
Token ID13
Feature activation+8.31e-01

Pos logit contributions
ouse+0.139
anda+0.137
oc+0.135
umpy+0.134
ney+0.130
Neg logit contributions
 the-0.083
 it-0.080
 them-0.074
 your-0.070
 this-0.069
Token They
Token ID1119
Feature activation+1.96e+00

Pos logit contributions
ouse+0.024
anda+0.024
oc+0.023
umpy+0.023
uer+0.023
Neg logit contributions
 the-0.022
 it-0.021
 them-0.019
 your-0.019
 this-0.018
Token said
Token ID531
Feature activation-2.10e+00

Pos logit contributions
ouse+0.055
anda+0.054
oc+0.053
umpy+0.052
ney+0.051
Neg logit contributions
 the-0.050
 it-0.048
 them-0.046
 your-0.044
 her-0.043
Token it
Token ID340
Feature activation+2.78e+00

Pos logit contributions
 the+0.038
 it+0.035
 them+0.035
 your+0.034
 such+0.033
Neg logit contributions
anda-0.068
ouse-0.067
oc-0.065
umpy-0.063
uer-0.062
Token was
Token ID373
Feature activation-3.24e+00

Pos logit contributions
umpy+0.092
ouse+0.090
anda+0.086
int+0.085
oc+0.085
Neg logit contributions
 the-0.067
 it-0.067
 them-0.059
 her-0.057
 a-0.056
Token the
Token ID262
Feature activation+1.76e+00

Pos logit contributions
 it+0.030
 the+0.030
 them+0.027
 your+0.023
 this+0.023
Neg logit contributions
anda-0.127
ouse-0.125
oc-0.123
umpy-0.121
ney-0.118
Token prince
Token ID19716
Feature activation+6.34e+00

Pos logit contributions
ouse+0.048
oc+0.046
anda+0.046
umpy+0.045
jo+0.044
Neg logit contributions
 the-0.048
 it-0.046
 them-0.043
 your-0.043
 anything-0.042
Token of
Token ID286
Feature activation-3.44e+00

Pos logit contributions
ouse+0.203
anda+0.200
umpy+0.199
oc+0.194
uer+0.192
Neg logit contributions
 the-0.144
 it-0.138
 them-0.123
 this-0.121
 her-0.119
Token the
Token ID262
Feature activation+2.85e+00

Pos logit contributions
 it+0.026
 the+0.021
 anything+0.021
 happen+0.021
 ourselves+0.021
Neg logit contributions
anda-0.134
oc-0.133
affe-0.128
ouse-0.126
umpy-0.125
Token jungle
Token ID20712
Feature activation+4.87e+00

Pos logit contributions
ouse+0.076
umpy+0.072
anda+0.071
oc+0.070
uer+0.070
Neg logit contributions
 the-0.080
 it-0.076
 your-0.072
 them-0.071
 this-0.070
Token.
Token ID13
Feature activation+1.03e+00

Pos logit contributions
ouse+0.094
anda+0.091
oc+0.090
umpy+0.087
uer+0.086
Neg logit contributions
 the-0.124
 it-0.119
 them-0.115
 your-0.111
 her-0.110
Token He
Token ID679
Feature activation+2.44e+00

Pos logit contributions
ouse+0.034
anda+0.033
oc+0.032
uer+0.032
umpy+0.031
Neg logit contributions
 the-0.024
 it-0.022
 them-0.021
 her-0.019
 your-0.019
Token always
Token ID1464
Feature activation+2.43e-01

Pos logit contributions
ouse+0.072
anda+0.071
uer+0.070
oc+0.069
ari+0.068
Neg logit contributions
 the-0.063
 it-0.060
 her-0.054
 them-0.053
 an-0.053
Token wanted
Token ID2227
Feature activation-4.86e+00

Pos logit contributions
ouse+0.006
anda+0.006
oc+0.006
umpy+0.006
uer+0.006
Neg logit contributions
 the-0.007
 it-0.007
 them-0.006
 your-0.006
 her-0.006
Token to
Token ID284
Feature activation-3.79e+00

Pos logit contributions
 ourselves+0.085
 yourselves+0.083
wered+0.081
 it+0.080
 them+0.079
Neg logit contributions
anda-0.126
ouse-0.124
ables-0.124
ney-0.124
umpy-0.122
Token do
Token ID466
Feature activation-3.88e+00

Pos logit contributions
 them+0.070
 it+0.069
 the+0.064
 what+0.064
 anything+0.060
Neg logit contributions
ouse-0.116
oc-0.111
ney-0.109
umpy-0.106
anda-0.105
Token fun
Token ID1257
Feature activation-5.44e-01

Pos logit contributions
 the+0.044
 them+0.044
 it+0.041
 yours+0.037
wered+0.037
Neg logit contributions
ouse-0.144
oc-0.143
anda-0.141
ables-0.139
umpy-0.134
Token things
Token ID1243
Feature activation+1.73e+00

Pos logit contributions
 the+0.011
 it+0.011
 them+0.010
 your+0.010
 her+0.009
Neg logit contributions
ouse-0.018
oc-0.017
anda-0.017
umpy-0.017
ney-0.016
Token,
Token ID11
Feature activation-1.46e+00

Pos logit contributions
ouse+0.051
anda+0.050
oc+0.049
umpy+0.049
ney+0.047
Neg logit contributions
 the-0.036
 it-0.035
 them-0.034
 your-0.032
wered-0.032
Token even
Token ID772
Feature activation-3.78e+00

Pos logit contributions
 the+0.035
 it+0.034
 them+0.034
 your+0.032
 her+0.031
Neg logit contributions
ouse-0.038
anda-0.037
oc-0.037
umpy-0.037
ney-0.035
Token when
Token ID618
Feature activation-2.99e+00

Pos logit contributions
 it+0.067
 them+0.065
 the+0.063
 this+0.057
 your+0.056
Neg logit contributions
oc-0.118
ouse-0.114
umpy-0.113
anda-0.112
ney-0.107
Token get
Token ID651
Feature activation-3.35e+00

Pos logit contributions
anda+0.011
ouse+0.011
oc+0.011
ari+0.011
ney+0.010
Neg logit contributions
 it-0.006
 your-0.006
 the-0.006
 such-0.006
 anything-0.006
Token up
Token ID510
Feature activation-2.13e+00

Pos logit contributions
 it+0.060
 the+0.059
 your+0.058
 anything+0.056
 this+0.056
Neg logit contributions
anda-0.104
ouse-0.103
ney-0.098
oc-0.097
umpy-0.094
Token and
Token ID290
Feature activation-1.11e+00

Pos logit contributions
 it+0.046
 the+0.045
 them+0.044
 your+0.044
 anything+0.043
Neg logit contributions
ouse-0.061
anda-0.060
oc-0.059
umpy-0.058
ney-0.058
Token run
Token ID1057
Feature activation-9.61e-01

Pos logit contributions
 them+0.019
 yours+0.019
 your+0.019
 such+0.019
 it+0.018
Neg logit contributions
ouse-0.035
oc-0.034
anda-0.034
ney-0.032
uer-0.032
Token to
Token ID284
Feature activation-4.43e+00

Pos logit contributions
 the+0.015
 it+0.015
 your+0.014
 them+0.014
 anything+0.013
Neg logit contributions
ouse-0.035
anda-0.034
oc-0.034
umpy-0.033
ney-0.033
Token find
Token ID1064
Feature activation-5.00e+00

Pos logit contributions
 yours+0.047
 yourself+0.046
 yourselves+0.044
pped+0.044
 anything+0.038
Neg logit contributions
oc-0.139
 driver-0.130
anda-0.129
jo-0.129
s-0.129
Token mom
Token ID1995
Feature activation+1.50e+00

Pos logit contributions
 yourselves+0.032
 yourself+0.030
wered+0.029
 it+0.029
 happen+0.027
Neg logit contributions
oc-0.192
anda-0.189
ouse-0.177
affe-0.177
ney-0.177
Token.
Token ID13
Feature activation+5.57e-01

Pos logit contributions
ouse+0.050
anda+0.049
oc+0.049
umpy+0.048
uer+0.047
Neg logit contributions
 the-0.031
 it-0.030
 them-0.028
 your-0.026
 her-0.026
Token They
Token ID1119
Feature activation+5.65e-01

Pos logit contributions
ouse+0.015
anda+0.015
oc+0.015
umpy+0.015
ney+0.014
Neg logit contributions
 the-0.014
 it-0.014
 them-0.013
 your-0.012
 anything-0.012
Token show
Token ID905
Feature activation-4.28e+00

Pos logit contributions
anda+0.019
oc+0.019
ouse+0.019
ari+0.018
ney+0.017
Neg logit contributions
 it-0.010
 your-0.009
 the-0.009
 them-0.009
 anything-0.009
Token her
Token ID607
Feature activation-6.23e-01

Pos logit contributions
 yourselves+0.029
wered+0.028
 it+0.027
 yourself+0.027
 anything+0.026
Neg logit contributions
anda-0.163
jo-0.153
oc-0.150
ouse-0.150
affe-0.149
Token night
Token ID1755
Feature activation+4.21e-01

Pos logit contributions
anda+0.047
ouse+0.047
uer+0.046
oc+0.046
umpy+0.044
Neg logit contributions
 the-0.042
 it-0.039
 her-0.036
 your-0.035
 their-0.035
Token after
Token ID706
Feature activation-3.67e+00

Pos logit contributions
ouse+0.011
oc+0.011
anda+0.011
umpy+0.011
ney+0.010
Neg logit contributions
 the-0.010
 it-0.010
 them-0.009
 your-0.009
 anything-0.009
Token she
Token ID673
Feature activation+2.01e+00

Pos logit contributions
 it+0.048
 them+0.048
 the+0.048
 respect+0.044
 your+0.043
Neg logit contributions
ouse-0.124
anda-0.122
jo-0.120
oc-0.120
umpy-0.118
Token had
Token ID550
Feature activation-3.52e+00

Pos logit contributions
ouse+0.052
anda+0.051
oc+0.050
umpy+0.049
uer+0.048
Neg logit contributions
 the-0.057
 it-0.055
 them-0.052
 your-0.050
 her-0.050
Token gone
Token ID3750
Feature activation-2.63e+00

Pos logit contributions
wered+0.040
 respect+0.040
 yours+0.038
 them+0.038
 what+0.037
Neg logit contributions
ouse-0.117
anda-0.111
ables-0.110
umpy-0.109
ney-0.107
Token to
Token ID284
Feature activation-4.71e+00

Pos logit contributions
 it+0.051
 them+0.050
 the+0.049
wered+0.046
 anything+0.045
Neg logit contributions
ouse-0.079
anda-0.077
umpy-0.072
ney-0.071
uer-0.070
Token bed
Token ID3996
Feature activation+1.21e+00

Pos logit contributions
 them+0.082
 yours+0.076
 it+0.072
 respect+0.072
 these+0.070
Neg logit contributions
ouse-0.118
umpy-0.104
oc-0.103
zag-0.101
ney-0.101
Token,
Token ID11
Feature activation-1.24e+00

Pos logit contributions
ouse+0.038
anda+0.037
oc+0.037
umpy+0.037
ney+0.035
Neg logit contributions
 the-0.026
 it-0.025
 them-0.024
 your-0.023
 this-0.022
Token she
Token ID673
Feature activation+2.62e+00

Pos logit contributions
 them+0.022
 respect+0.022
 yours+0.022
 the+0.021
 your+0.021
Neg logit contributions
ouse-0.037
anda-0.033
jo-0.032
oc-0.032
ney-0.032
Token heard
Token ID2982
Feature activation-3.65e+00

Pos logit contributions
anda+0.071
ouse+0.071
oc+0.070
umpy+0.068
uer+0.067
Neg logit contributions
 the-0.072
 it-0.069
 them-0.063
 her-0.063
 your-0.062
Token a
Token ID257
Feature activation+1.57e+00

Pos logit contributions
 the+0.027
 it+0.027
 them+0.026
 your+0.026
 yours+0.025
Neg logit contributions
ouse-0.145
anda-0.142
jo-0.141
umpy-0.140
oc-0.140
Token was
Token ID373
Feature activation-2.80e+00

Pos logit contributions
ouse+0.059
anda+0.058
oc+0.057
umpy+0.057
uer+0.055
Neg logit contributions
 the-0.054
 it-0.051
 them-0.047
 her-0.046
 this-0.045
Token so
Token ID523
Feature activation-1.78e+00

Pos logit contributions
 it+0.045
 them+0.044
 the+0.043
 anything+0.041
 your+0.040
Neg logit contributions
ouse-0.087
anda-0.087
umpy-0.086
oc-0.083
ney-0.082
Token good
Token ID922
Feature activation+5.27e-01

Pos logit contributions
 the+0.043
 it+0.041
 them+0.039
 your+0.037
 her+0.036
Neg logit contributions
ouse-0.053
anda-0.052
oc-0.051
umpy-0.050
ney-0.049
Token that
Token ID326
Feature activation-2.36e+00

Pos logit contributions
ouse+0.018
anda+0.017
oc+0.017
umpy+0.017
ney+0.016
Neg logit contributions
 the-0.010
 it-0.010
 them-0.009
 your-0.009
 anything-0.009
Token it
Token ID340
Feature activation+1.92e+00

Pos logit contributions
 them+0.034
 the+0.031
 it+0.029
 your+0.029
 yourselves+0.028
Neg logit contributions
ouse-0.080
oc-0.078
anda-0.077
umpy-0.075
ney-0.075
Token made
Token ID925
Feature activation-4.58e+00

Pos logit contributions
ouse+0.053
anda+0.053
oc+0.053
umpy+0.051
uer+0.050
Neg logit contributions
 the-0.051
 it-0.049
 them-0.046
 her-0.044
 your-0.044
Token her
Token ID607
Feature activation-1.46e-01

Pos logit contributions
 the+0.022
 it+0.020
 your+0.017
 them+0.015
 this+0.015
Neg logit contributions
ouse-0.193
anda-0.192
oc-0.188
umpy-0.187
jo-0.184
Token start
Token ID923
Feature activation-1.19e+00

Pos logit contributions
 the+0.003
 it+0.003
 them+0.003
 your+0.003
 her+0.003
Neg logit contributions
ouse-0.005
anda-0.005
oc-0.005
umpy-0.004
uer-0.004
Token applaud
Token ID22517
Feature activation+2.36e-01

Pos logit contributions
 the+0.021
 it+0.021
 them+0.019
 your+0.018
 this+0.017
Neg logit contributions
ouse-0.042
anda-0.041
oc-0.040
umpy-0.040
ney-0.039
Tokening
Token ID278
Feature activation-1.21e+00

Pos logit contributions
ouse+0.008
anda+0.008
oc+0.008
umpy+0.007
ney+0.007
Neg logit contributions
 the-0.005
 it-0.005
 them-0.004
 your-0.004
 her-0.004
Token again
Token ID757
Feature activation+3.11e-01

Pos logit contributions
 the+0.022
 it+0.021
 them+0.019
 your+0.018
 her+0.018
Neg logit contributions
ouse-0.043
anda-0.043
oc-0.042
umpy-0.042
ney-0.041
Token.
Token ID13
Feature activation+6.61e-02

Pos logit contributions
anda+0.144
ouse+0.144
oc+0.144
umpy+0.141
uer+0.135
Neg logit contributions
 the-0.082
 it-0.079
 them-0.070
 her-0.068
 this-0.067
Token It
Token ID632
Feature activation+1.68e+00

Pos logit contributions
ouse+0.002
anda+0.002
oc+0.002
umpy+0.002
ney+0.002
Neg logit contributions
 the-0.002
 it-0.002
 them-0.002
 your-0.001
 her-0.001
Token is
Token ID318
Feature activation-2.44e+00

Pos logit contributions
ouse+0.056
umpy+0.054
uer+0.052
anda+0.052
oc+0.051
Neg logit contributions
 the-0.042
 it-0.040
 them-0.035
 her-0.035
 this-0.035
Token a
Token ID257
Feature activation+2.84e+00

Pos logit contributions
 the+0.049
 it+0.048
 them+0.046
 your+0.043
 this+0.042
Neg logit contributions
ouse-0.078
anda-0.077
umpy-0.075
oc-0.075
ney-0.073
Token story
Token ID1621
Feature activation+2.89e+00

Pos logit contributions
anda+0.081
umpy+0.080
ouse+0.079
int+0.079
oc+0.078
Neg logit contributions
 the-0.073
 it-0.072
 your-0.064
 this-0.063
 her-0.062
Token about
Token ID546
Feature activation-4.06e+00

Pos logit contributions
ouse+0.094
anda+0.092
oc+0.091
umpy+0.090
ney+0.088
Neg logit contributions
 the-0.060
 it-0.057
 them-0.055
 your-0.052
 anything-0.050
Token a
Token ID257
Feature activation+2.69e+00

Pos logit contributions
 it+0.034
 the+0.033
 them+0.032
 your+0.031
 help+0.027
Neg logit contributions
oc-0.158
anda-0.158
ouse-0.156
ney-0.153
umpy-0.150
Token princess
Token ID21752
Feature activation+5.45e+00

Pos logit contributions
anda+0.077
ouse+0.076
int+0.076
umpy+0.076
ney+0.073
Neg logit contributions
 the-0.072
 it-0.071
 your-0.063
 this-0.061
 them-0.061
Token and
Token ID290
Feature activation-2.30e+00

Pos logit contributions
ouse+0.189
anda+0.187
umpy+0.187
oc+0.180
ney+0.178
Neg logit contributions
 the-0.108
 it-0.104
 them-0.092
 this-0.088
 her-0.087
Token a
Token ID257
Feature activation+1.94e+00

Pos logit contributions
 the+0.014
 it+0.013
 them+0.012
 your+0.009
 anything+0.007
Neg logit contributions
ouse-0.104
anda-0.103
oc-0.103
umpy-0.100
ney-0.099
Token dragon
Token ID10441
Feature activation+5.38e+00

Pos logit contributions
int+0.055
umpy+0.055
anda+0.054
ouse+0.054
ney+0.053
Neg logit contributions
 the-0.053
 it-0.051
 your-0.046
 them-0.045
 her-0.045
Token for
Token ID329
Feature activation-4.33e+00

Pos logit contributions
 yourselves+0.040
 the+0.039
 anything+0.039
 them+0.039
 it+0.038
Neg logit contributions
ouse-0.069
anda-0.063
umpy-0.062
jo-0.061
ables-0.061
Token school
Token ID1524
Feature activation+1.41e+00

Pos logit contributions
wered+0.050
 yours+0.049
 yourselves+0.044
 respect+0.043
 anything+0.042
Neg logit contributions
jo-0.130
ouse-0.130
oc-0.128
s-0.126
anda-0.126
Token.
Token ID13
Feature activation+5.23e-01

Pos logit contributions
ouse+0.049
anda+0.049
oc+0.048
umpy+0.047
ney+0.046
Neg logit contributions
 the-0.027
 it-0.026
 them-0.024
 her-0.022
 your-0.022
Token They
Token ID1119
Feature activation+1.01e+00

Pos logit contributions
anda+0.015
ouse+0.015
oc+0.015
uer+0.015
ney+0.015
Neg logit contributions
 the-0.014
 it-0.013
 them-0.012
 her-0.012
 their-0.012
Token both
Token ID1111
Feature activation+1.04e+00

Pos logit contributions
ouse+0.028
anda+0.027
oc+0.027
umpy+0.026
ney+0.025
Neg logit contributions
 the-0.027
 it-0.026
 them-0.025
 your-0.024
 her-0.024
Token needed
Token ID2622
Feature activation-5.10e+00

Pos logit contributions
ouse+0.029
oc+0.028
anda+0.028
umpy+0.027
uer+0.027
Neg logit contributions
 the-0.029
 it-0.027
 them-0.025
 her-0.025
 your-0.024
Token a
Token ID257
Feature activation+1.34e+00

Pos logit contributions
wered+0.080
 yourselves+0.070
 happen+0.064
 ourselves+0.063
 your+0.062
Neg logit contributions
ouse-0.141
ari-0.134
anda-0.131
s-0.131
ables-0.130
Token uniform
Token ID8187
Feature activation+3.52e+00

Pos logit contributions
ouse+0.040
oc+0.040
anda+0.040
umpy+0.039
ney+0.038
Neg logit contributions
 the-0.032
 it-0.031
 them-0.029
 your-0.028
 this-0.027
Token.
Token ID13
Feature activation+1.06e+00

Pos logit contributions
oc+0.140
anda+0.139
ouse+0.139
umpy+0.137
ney+0.132
Neg logit contributions
 the-0.050
 it-0.047
 them-0.042
 your-0.038
 her-0.038
Token Molly
Token ID30236
Feature activation+5.48e+00

Pos logit contributions
ouse+0.033
anda+0.033
oc+0.033
umpy+0.032
ney+0.031
Neg logit contributions
 the-0.025
 it-0.024
 them-0.023
 your-0.021
 her-0.021
Token's
Token ID338
Feature activation+1.29e+00

Pos logit contributions
umpy+0.160
ouse+0.158
oc+0.157
anda+0.155
uer+0.149
Neg logit contributions
 the-0.144
 it-0.139
 them-0.123
 her-0.123
 this-0.118