TOP ACTIVATIONS
MAX = 4.919

she 'd never seen a l oll ipop before .
she never stopped dreaming of l oll ip ops . <|endoftext|>
ados . They were never jealous of each other again and
. Lily forgot about her l oll ipop and had so
He 'd never had a l oll ipop before ! But
And Freddy was never helpless again . He flew high
games . They were never lonely again . The
. That 's not a l oll ipop . That 's
every day and was never hungry again . <|endoftext|> Once there
just in case another great l oll ipop would rise in
managed to number all her l oll ip ops . She
a little bit of the l oll ipop . That way
a very special toy for L ott ie . It was
no , where did Mr . Beetle go ?" she said
The old man was never hungry again , and he forever
Jane had never eaten a tomato before , but when she
happy that she had her l oll ipop and it tasted
ground . At last the l oll ipop stopped rising and
if she could have the l oll ipop . The shop
the little rabbit was never sad again . He was always

MIN ACTIVATIONS
MIN = -4.332

" Hello , cloud !" Ellie said .
" Let 's help !" said Max , who was
Hi , I 'm Mouse ," said the mouse . "
. She got on a different train with her ticket .
. He put on his favourite party hat and hopped along
." He put on a different song and Lily started to
you to climb the wall !" Jimmy said .
the princess of the castle ," said the voice . "
the switches and pressed the big red button . The radio
. It was a big , shiny radio and Katie was
Let 's go , Emmy !" said her mom . "
wide and they gave a loud roar when they started the
" It 's an earthquake !" said her friend , the
. Suddenly they heard a loud noise . It was a
" Hi , bird !" Max said . " Do
" Let 's go inside ," Max said .
a swim , my darling ," her mom my said .
's see what 's inside !" said Bun 's friend ,
The baby had on a beautiful dress , with a bon
. He was wearing his favourite yellow shirt and smiling .

INTERVAL 2.607 - 4.919
CONTAINS 0.082%

. He added cheese , ham , and lettuce to his
She could finally cross the sm elly swamp . After that
time , there was an oct opus named Tim my .
nose and ran as fast as she could . She made
very still , watching the bar ber in the mirror .

INTERVAL 0.294 - 2.607
CONTAINS 34.818%

play baseball . Tim my 's friend , Billy , hit
She thinks Sam is silly . She says , " No
But Sarah liked the furry costume and really wanted to keep
a hug and a rib bet , and then off they
The little girl loved her book , so she

INTERVAL -2.019 - 0.294
CONTAINS 64.576%

the big fish away . From that day on
oof , wo of !" the dolls say .
very sad . So , they sat down to discuss their
look ! An aer opl ane !" Her mum
showed it to their parents when they came outside to see

INTERVAL -4.332 - -2.019
CONTAINS 0.523%

black dog named Max . Max loved to play outside in
her , � � € � � What is wrong ,
so scared , so they held on tight .
said , " No , that 's mine . Give it
" Yes , please . That sounds fun . Thank you
Token she
Token ID673
Feature activation+1.35e+00

Pos logit contributions
ubb+0.081
 servant+0.078
aph+0.074
Col+0.074
ascal+0.073
Neg logit contributions
 automobile-0.041
 tracks-0.041
 package-0.041
 radio-0.040
 track-0.039
Token'd
Token ID1549
Feature activation+1.05e+00

Pos logit contributions
ubb+0.090
aph+0.085
 queens+0.084
 servant+0.084
ill+0.083
Neg logit contributions
 package-0.028
 tracks-0.027
 belongs-0.027
 radio-0.026
cycles-0.025
Token never
Token ID1239
Feature activation+2.02e+00

Pos logit contributions
ubb+0.058
 queens+0.054
 servant+0.054
aph+0.053
oll+0.053
Neg logit contributions
 tracks-0.034
 package-0.032
 radio-0.032
 track-0.031
 stret-0.031
Token seen
Token ID1775
Feature activation+2.37e+00

Pos logit contributions
ubb+0.120
acy+0.111
umber+0.109
 servant+0.109
oll+0.109
Neg logit contributions
 tracks-0.051
 receiver-0.050
 belongs-0.050
's-0.048
 package-0.047
Token a
Token ID257
Feature activation+2.02e+00

Pos logit contributions
ubb+0.143
 servant+0.134
 queens+0.133
ill+0.133
umber+0.132
Neg logit contributions
 tracks-0.061
 package-0.059
 radio-0.055
 automobile-0.053
 track-0.052
Token l
Token ID300
Feature activation+4.92e+00

Pos logit contributions
ubb+0.116
ill+0.114
 queens+0.107
acy+0.106
aph+0.105
Neg logit contributions
 package-0.065
 radio-0.058
 tracks-0.056
 track-0.056
 automobile-0.055
Tokenoll
Token ID692
Feature activation+2.75e+00

Pos logit contributions
 queens+0.132
ubb+0.122
ory+0.110
ill+0.102
Fre+0.099
Neg logit contributions
nob-0.221
 package-0.219
shirt-0.215
c-0.214
chair-0.212
Tokenipop
Token ID42800
Feature activation+3.72e+00

Pos logit contributions
ubb+0.120
 queens+0.120
ill+0.116
aph+0.115
Col+0.112
Neg logit contributions
 package-0.114
 belongs-0.102
 stret-0.096
cycles-0.095
 automobile-0.095
Token before
Token ID878
Feature activation+6.78e-01

Pos logit contributions
ubb+0.210
unn+0.194
Col+0.189
ascal+0.188
add+0.188
Neg logit contributions
 package-0.108
 track-0.103
 away-0.102
's-0.099
 radio-0.096
Token.
Token ID13
Feature activation+6.17e-01

Pos logit contributions
ubb+0.041
 servant+0.041
 queens+0.040
oll+0.039
umbo+0.037
Neg logit contributions
 tracks-0.018
 radio-0.018
 package-0.017
 track-0.017
 automobile-0.017
Token
Token ID198
Feature activation+2.87e-01

Pos logit contributions
ubb+0.034
 queens+0.033
 servant+0.032
ier+0.031
ill+0.030
Neg logit contributions
 tracks-0.021
 radio-0.020
 package-0.020
 automobile-0.019
 track-0.019
Token she
Token ID673
Feature activation+2.01e+00

Pos logit contributions
ubb+0.046
 servant+0.046
oll+0.045
 queens+0.044
ill+0.043
Neg logit contributions
 tracks-0.037
 track-0.036
 automobile-0.036
 radio-0.035
 package-0.034
Token never
Token ID1239
Feature activation+1.70e+00

Pos logit contributions
ubb+0.123
 queens+0.121
aph+0.116
 servant+0.116
oll+0.112
Neg logit contributions
 tracks-0.050
 track-0.048
 package-0.047
 automobile-0.047
 belongs-0.047
Token stopped
Token ID5025
Feature activation+1.95e+00

Pos logit contributions
ubb+0.093
oll+0.091
 servant+0.089
acy+0.087
 queens+0.087
Neg logit contributions
 tracks-0.048
 belongs-0.046
 receiver-0.046
 automobile-0.046
 package-0.045
Token dreaming
Token ID28368
Feature activation+1.47e+00

Pos logit contributions
oll+0.101
ubb+0.099
 queens+0.098
 servant+0.096
acy+0.096
Neg logit contributions
 tracks-0.065
 automobile-0.064
 track-0.063
 radio-0.062
 receiver-0.059
Token of
Token ID286
Feature activation+1.41e+00

Pos logit contributions
ubb+0.073
 servant+0.071
 queens+0.070
Col+0.067
oll+0.067
Neg logit contributions
 tracks-0.052
 stret-0.051
 radio-0.051
 automobile-0.050
 package-0.050
Token l
Token ID300
Feature activation+4.57e+00

Pos logit contributions
ubb+0.075
ill+0.075
 queens+0.074
 servant+0.073
oll+0.071
Neg logit contributions
 tracks-0.046
 track-0.044
 radio-0.044
 automobile-0.044
 package-0.043
Tokenoll
Token ID692
Feature activation+2.30e+00

Pos logit contributions
 queens+0.135
 diver+0.122
ill+0.119
aph+0.116
ubb+0.111
Neg logit contributions
g-0.177
c-0.177
 around-0.173
 belongs-0.171
shirt-0.170
Tokenip
Token ID541
Feature activation+2.06e+00

Pos logit contributions
ill+0.093
ubb+0.093
 queens+0.091
aph+0.089
Col+0.084
Neg logit contributions
 package-0.099
 belongs-0.097
cycles-0.092
 stret-0.089
ci-0.088
Tokenops
Token ID2840
Feature activation+3.08e+00

Pos logit contributions
ubb+0.093
 queens+0.093
ill+0.092
 servant+0.088
aph+0.087
Neg logit contributions
 package-0.078
 stret-0.077
 belongs-0.077
cycles-0.076
 tracks-0.075
Token.
Token ID13
Feature activation+6.86e-01

Pos logit contributions
ubb+0.157
 servant+0.145
Col+0.142
aph+0.138
 queens+0.137
Neg logit contributions
 track-0.098
 package-0.097
 automobile-0.094
's-0.093
 belongs-0.093
Token<|endoftext|>
Token ID50256
Feature activation+4.83e-01

Pos logit contributions
ubb+0.038
 servant+0.038
 queens+0.037
oll+0.036
ill+0.035
Neg logit contributions
 tracks-0.021
 automobile-0.021
 radio-0.020
 track-0.020
 package-0.020
Tokenados
Token ID22484
Feature activation+9.58e-01

Pos logit contributions
ubb+0.081
 servant+0.079
acy+0.078
ill+0.078
 diver+0.078
Neg logit contributions
 automobile-0.057
 package-0.052
cycles-0.051
 receiver-0.051
obiles-0.051
Token.
Token ID13
Feature activation-5.24e-01

Pos logit contributions
ubb+0.058
 servant+0.055
 queens+0.054
oll+0.053
acy+0.053
Neg logit contributions
 tracks-0.026
 automobile-0.025
 radio-0.025
 track-0.025
 package-0.024
Token They
Token ID1119
Feature activation+1.22e+00

Pos logit contributions
 stret+0.017
 package+0.016
 radio+0.016
 automobile+0.016
 tracks+0.016
Neg logit contributions
ubb-0.028
 servant-0.028
 queens-0.028
oll-0.027
ill-0.027
Token were
Token ID547
Feature activation+3.78e-01

Pos logit contributions
ubb+0.070
 queens+0.066
 servant+0.065
oll+0.064
ill+0.062
Neg logit contributions
 automobile-0.036
 tracks-0.036
 radio-0.035
 track-0.034
 package-0.034
Token never
Token ID1239
Feature activation+1.95e+00

Pos logit contributions
ubb+0.020
 servant+0.019
 queens+0.019
oll+0.019
ill+0.018
Neg logit contributions
 tracks-0.013
 automobile-0.013
 track-0.012
 radio-0.012
 package-0.012
Token jealous
Token ID19354
Feature activation+4.56e+00

Pos logit contributions
ubb+0.095
umber+0.091
acy+0.091
oll+0.091
 servant+0.089
Neg logit contributions
 automobile-0.063
 tracks-0.062
 track-0.061
 receiver-0.061
 stret-0.058
Token of
Token ID286
Feature activation+4.30e-01

Pos logit contributions
agg+0.270
acy+0.268
Fre+0.265
ona+0.264
ubb+0.262
Neg logit contributions
 around-0.089
's-0.088
 language-0.082
 away-0.079
 over-0.077
Token each
Token ID1123
Feature activation+1.62e+00

Pos logit contributions
ubb+0.024
 servant+0.023
 queens+0.023
oll+0.023
ill+0.022
Neg logit contributions
 tracks-0.013
 automobile-0.013
 radio-0.013
 track-0.013
 package-0.013
Token other
Token ID584
Feature activation+1.45e+00

Pos logit contributions
ubb+0.092
oll+0.086
 queens+0.084
acy+0.084
ectar+0.082
Neg logit contributions
 automobile-0.050
 package-0.047
 receiver-0.045
 track-0.044
 radio-0.043
Token again
Token ID757
Feature activation+4.10e-01

Pos logit contributions
ubb+0.071
 queens+0.064
 servant+0.064
oll+0.063
aph+0.063
Neg logit contributions
 radio-0.055
 automobile-0.054
 tracks-0.054
 track-0.053
 package-0.052
Token and
Token ID290
Feature activation+4.83e-01

Pos logit contributions
ubb+0.024
 servant+0.023
 queens+0.023
oll+0.022
ill+0.022
Neg logit contributions
 tracks-0.012
 radio-0.011
 automobile-0.011
 track-0.011
 package-0.011
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
 radio+0.032
 receiver+0.031
 Amen+0.031
cycles+0.031
 package+0.031
Neg logit contributions
 queens-0.038
 servant-0.038
ubb-0.038
ill-0.038
oll-0.037
Token Lily
Token ID20037
Feature activation+7.76e-02

Pos logit contributions
 radio+0.043
 package+0.042
 stret+0.042
cycles+0.041
 controller+0.041
Neg logit contributions
oll-0.044
 servant-0.044
ubb-0.044
 queens-0.044
ill-0.044
Token forgot
Token ID16453
Feature activation-3.80e-01

Pos logit contributions
ubb+0.004
 servant+0.004
 queens+0.004
oll+0.004
ill+0.004
Neg logit contributions
 tracks-0.002
 radio-0.002
 automobile-0.002
 stret-0.002
 track-0.002
Token about
Token ID546
Feature activation+4.33e-01

Pos logit contributions
 tracks+0.015
 automobile+0.015
 stret+0.015
 package+0.015
 radio+0.015
Neg logit contributions
 servant-0.017
ubb-0.017
 queens-0.017
oll-0.016
ill-0.016
Token her
Token ID607
Feature activation+1.03e+00

Pos logit contributions
ubb+0.025
 servant+0.025
 queens+0.025
oll+0.024
ill+0.024
Neg logit contributions
 tracks-0.012
 radio-0.012
 track-0.012
 automobile-0.012
 package-0.011
Token l
Token ID300
Feature activation+4.30e+00

Pos logit contributions
ubb+0.054
 queens+0.053
ill+0.052
acy+0.050
 servant+0.050
Neg logit contributions
 package-0.037
 radio-0.036
 tracks-0.036
 automobile-0.036
 track-0.034
Tokenoll
Token ID692
Feature activation+1.97e+00

Pos logit contributions
 queens+0.148
ubb+0.127
aph+0.124
ill+0.123
 Mon+0.123
Neg logit contributions
 belongs-0.172
c-0.164
shirt-0.162
 package-0.161
ag-0.152
Tokenipop
Token ID42800
Feature activation+1.34e+00

Pos logit contributions
ubb+0.080
 queens+0.078
ill+0.077
aph+0.076
Col+0.071
Neg logit contributions
 package-0.090
 belongs-0.087
cycles-0.082
 automobile-0.081
 radio-0.079
Token and
Token ID290
Feature activation-1.40e-01

Pos logit contributions
ubb+0.083
 queens+0.077
 servant+0.077
Col+0.075
ill+0.073
Neg logit contributions
 track-0.037
 tracks-0.037
 package-0.036
 radio-0.036
 automobile-0.036
Token had
Token ID550
Feature activation+2.81e-01

Pos logit contributions
 tracks+0.005
 stret+0.005
 automobile+0.005
 radio+0.005
 package+0.005
Neg logit contributions
ubb-0.007
 servant-0.007
 queens-0.007
oll-0.006
ill-0.006
Token so
Token ID523
Feature activation+1.24e+00

Pos logit contributions
ubb+0.016
 servant+0.015
 queens+0.015
oll+0.015
ill+0.014
Neg logit contributions
 tracks-0.009
 radio-0.009
 automobile-0.009
 track-0.009
 package-0.008
Token He
Token ID679
Feature activation+5.89e-01

Pos logit contributions
ubb+0.029
 servant+0.027
 queens+0.027
oll+0.027
ill+0.025
Neg logit contributions
 tracks-0.018
 automobile-0.018
 track-0.018
 radio-0.017
 package-0.017
Token'd
Token ID1549
Feature activation+5.21e-01

Pos logit contributions
ubb+0.033
 queens+0.032
 servant+0.032
oll+0.031
ill+0.031
Neg logit contributions
 tracks-0.018
 radio-0.017
 automobile-0.017
 track-0.017
 stret-0.017
Token never
Token ID1239
Feature activation+9.44e-01

Pos logit contributions
ubb+0.027
 queens+0.026
 servant+0.026
oll+0.025
ill+0.024
Neg logit contributions
 tracks-0.019
 radio-0.018
 automobile-0.018
 stret-0.018
 track-0.018
Token had
Token ID550
Feature activation+1.53e+00

Pos logit contributions
ubb+0.046
 servant+0.043
oll+0.043
 queens+0.043
acy+0.041
Neg logit contributions
 tracks-0.035
 automobile-0.034
 receiver-0.034
 package-0.033
 track-0.033
Token a
Token ID257
Feature activation+1.39e+00

Pos logit contributions
ubb+0.088
ill+0.082
 servant+0.082
aph+0.081
 queens+0.080
Neg logit contributions
 tracks-0.044
 package-0.042
 automobile-0.041
 radio-0.040
 track-0.039
Token l
Token ID300
Feature activation+4.28e+00

Pos logit contributions
ubb+0.073
 queens+0.070
ill+0.068
acy+0.065
aph+0.065
Neg logit contributions
 package-0.050
 stret-0.047
 radio-0.047
 tracks-0.047
 automobile-0.046
Tokenoll
Token ID692
Feature activation+2.25e+00

Pos logit contributions
 queens+0.133
ubb+0.109
aph+0.105
 kingdoms+0.100
Fre+0.100
Neg logit contributions
shirt-0.193
 package-0.182
 away-0.174
c-0.173
g-0.172
Tokenipop
Token ID42800
Feature activation+3.09e+00

Pos logit contributions
ubb+0.096
 queens+0.094
aph+0.090
ill+0.089
Col+0.086
Neg logit contributions
 package-0.096
 belongs-0.087
cycles-0.087
 stret-0.086
 automobile-0.084
Token before
Token ID878
Feature activation+4.03e-01

Pos logit contributions
ubb+0.178
 queens+0.158
acy+0.158
Col+0.156
aph+0.156
Neg logit contributions
 package-0.091
 away-0.087
 track-0.087
 automobile-0.084
 stret-0.082
Token!
Token ID0
Feature activation+3.34e-02

Pos logit contributions
ubb+0.023
 servant+0.023
 queens+0.022
oll+0.022
ill+0.021
Neg logit contributions
 tracks-0.012
 radio-0.012
 track-0.012
 automobile-0.011
 package-0.011
Token But
Token ID887
Feature activation-1.43e-01

Pos logit contributions
ubb+0.002
 servant+0.002
 queens+0.002
oll+0.002
ill+0.002
Neg logit contributions
 tracks-0.001
 radio-0.001
 automobile-0.001
 stret-0.001
 package-0.001
Token
Token ID198
Feature activation+8.95e-01

Pos logit contributions
ubb+0.016
 servant+0.016
 queens+0.015
oll+0.015
ill+0.015
Neg logit contributions
 tracks-0.005
 radio-0.005
 automobile-0.005
 track-0.005
 stret-0.005
TokenAnd
Token ID1870
Feature activation+8.41e-01

Pos logit contributions
 servant+0.049
ubb+0.048
 queens+0.046
oll+0.045
ill+0.044
Neg logit contributions
 tracks-0.027
 track-0.027
cycles-0.027
 automobile-0.026
obiles-0.026
Token Freddy
Token ID45437
Feature activation+9.04e-01

Pos logit contributions
ubb+0.048
 servant+0.046
 queens+0.045
oll+0.045
ill+0.044
Neg logit contributions
 tracks-0.025
 track-0.023
 package-0.023
 automobile-0.023
 radio-0.022
Token was
Token ID373
Feature activation-2.19e-02

Pos logit contributions
ubb+0.053
 servant+0.052
 queens+0.052
oll+0.050
ill+0.049
Neg logit contributions
 tracks-0.025
 track-0.023
 radio-0.023
 package-0.023
 automobile-0.023
Token never
Token ID1239
Feature activation+1.11e+00

Pos logit contributions
 tracks+0.001
 radio+0.001
 automobile+0.001
 stret+0.001
 package+0.001
Neg logit contributions
ubb-0.001
 servant-0.001
 queens-0.001
oll-0.001
ill-0.001
Token helpless
Token ID21144
Feature activation+4.28e+00

Pos logit contributions
ubb+0.056
oll+0.054
 servant+0.054
acy+0.052
umber+0.051
Neg logit contributions
 tracks-0.038
 track-0.037
 automobile-0.036
 stret-0.036
 radio-0.036
Token again
Token ID757
Feature activation+1.52e+00

Pos logit contributions
umber+0.276
Col+0.266
uka+0.260
olo+0.260
Dan+0.259
Neg logit contributions
 away-0.075
 around-0.073
's-0.068
 noise-0.061
 language-0.058
Token.
Token ID13
Feature activation+1.85e+00

Pos logit contributions
ubb+0.099
 servant+0.099
oll+0.097
 queens+0.095
umbo+0.095
Neg logit contributions
 tracks-0.030
 radio-0.028
 track-0.028
 automobile-0.025
 package-0.025
Token He
Token ID679
Feature activation+3.68e-01

Pos logit contributions
ubb+0.119
 servant+0.115
 queens+0.114
ier+0.110
acy+0.108
Neg logit contributions
 -0.037
 tracks-0.037
cycles-0.036
 automobile-0.034
 belongs-0.033
Token flew
Token ID13112
Feature activation+7.04e-01

Pos logit contributions
ubb+0.019
 servant+0.019
 queens+0.019
oll+0.018
ill+0.018
Neg logit contributions
 tracks-0.012
 radio-0.012
 automobile-0.012
 track-0.012
 package-0.012
Token high
Token ID1029
Feature activation+1.70e+00

Pos logit contributions
ubb+0.044
 servant+0.043
oll+0.043
 queens+0.043
acy+0.041
Neg logit contributions
 tracks-0.017
 track-0.016
 radio-0.016
 automobile-0.016
 stret-0.015
Token games
Token ID1830
Feature activation+6.88e-01

Pos logit contributions
 package+0.049
 stret+0.049
 controller+0.048
 Amen+0.048
 radio+0.047
Neg logit contributions
 queens-0.059
ubb-0.055
 servant-0.055
oll-0.052
umbo-0.052
Token.
Token ID13
Feature activation+6.62e-01

Pos logit contributions
ubb+0.043
 servant+0.041
 queens+0.040
oll+0.040
acy+0.038
Neg logit contributions
 tracks-0.018
 track-0.017
 radio-0.017
 automobile-0.017
 stret-0.016
Token They
Token ID1119
Feature activation+4.25e-01

Pos logit contributions
ubb+0.042
 servant+0.041
 queens+0.041
oll+0.039
ier+0.039
Neg logit contributions
 tracks-0.017
 automobile-0.016
 radio-0.015
 track-0.015
 package-0.014
Token were
Token ID547
Feature activation+4.02e-01

Pos logit contributions
ubb+0.024
 servant+0.023
 queens+0.023
oll+0.023
ill+0.022
Neg logit contributions
 tracks-0.013
 track-0.012
 automobile-0.012
 radio-0.012
 package-0.012
Token never
Token ID1239
Feature activation+1.64e+00

Pos logit contributions
ubb+0.020
 servant+0.020
 queens+0.020
oll+0.019
ill+0.019
Neg logit contributions
 tracks-0.014
 track-0.014
 automobile-0.014
 radio-0.014
 package-0.013
Token lonely
Token ID21757
Feature activation+4.28e+00

Pos logit contributions
ubb+0.082
oll+0.081
 servant+0.078
umber+0.076
acy+0.075
Neg logit contributions
 tracks-0.056
 track-0.056
 automobile-0.054
 receiver-0.053
 stret-0.052
Token again
Token ID757
Feature activation+1.07e+00

Pos logit contributions
ubb+0.254
umber+0.252
acy+0.247
add+0.244
Col+0.242
Neg logit contributions
 track-0.084
 language-0.083
 away-0.083
 around-0.082
's-0.081
Token.
Token ID13
Feature activation+5.32e-02

Pos logit contributions
ubb+0.066
 servant+0.065
oll+0.063
 queens+0.062
umbo+0.061
Neg logit contributions
 tracks-0.027
 track-0.026
 radio-0.026
 automobile-0.025
 package-0.024
Token
Token ID198
Feature activation-1.01e+00

Pos logit contributions
ubb+0.003
 servant+0.003
 queens+0.003
oll+0.003
ill+0.003
Neg logit contributions
 tracks-0.002
 automobile-0.002
 radio-0.002
 stret-0.002
 package-0.002
Token
Token ID198
Feature activation+7.54e-02

Pos logit contributions
 automobile+0.029
 package+0.028
 stret+0.028
 receiver+0.028
 radio+0.028
Neg logit contributions
oll-0.054
ubb-0.054
 servant-0.054
 queens-0.053
ill-0.052
TokenThe
Token ID464
Feature activation-1.66e-02

Pos logit contributions
ubb+0.004
 servant+0.004
 queens+0.004
oll+0.004
ill+0.004
Neg logit contributions
 tracks-0.003
 radio-0.003
 automobile-0.003
 stret-0.003
 track-0.003
Token.
Token ID13
Feature activation+2.77e-01

Pos logit contributions
ubb+0.046
 queens+0.045
 servant+0.044
ill+0.042
oll+0.042
Neg logit contributions
 package-0.023
 tracks-0.023
 automobile-0.023
 radio-0.022
 track-0.022
Token That
Token ID1320
Feature activation+3.37e-01

Pos logit contributions
ubb+0.017
 servant+0.017
 queens+0.017
oll+0.016
ill+0.016
Neg logit contributions
 tracks-0.007
 automobile-0.006
 radio-0.006
 package-0.006
 track-0.006
Token's
Token ID338
Feature activation-6.19e-01

Pos logit contributions
ubb+0.024
 servant+0.023
 queens+0.023
oll+0.023
ill+0.023
Neg logit contributions
 tracks-0.005
 automobile-0.005
 radio-0.005
 package-0.005
 track-0.005
Token not
Token ID407
Feature activation-1.16e-01

Pos logit contributions
 stret+0.025
 tracks+0.025
 radio+0.025
 automobile+0.024
 controller+0.024
Neg logit contributions
 servant-0.028
ubb-0.028
 queens-0.028
oll-0.027
ill-0.025
Token a
Token ID257
Feature activation+1.68e+00

Pos logit contributions
 tracks+0.004
 radio+0.003
 automobile+0.003
 stret+0.003
 track+0.003
Neg logit contributions
ubb-0.006
 servant-0.006
 queens-0.006
oll-0.006
ill-0.006
Token l
Token ID300
Feature activation+4.25e+00

Pos logit contributions
ubb+0.090
ill+0.089
 queens+0.086
aph+0.084
umbo+0.083
Neg logit contributions
 package-0.058
 radio-0.056
 stret-0.053
 tracks-0.053
 automobile-0.053
Tokenoll
Token ID692
Feature activation+2.39e+00

Pos logit contributions
 queens+0.124
ubb+0.116
ill+0.115
aph+0.105
Col+0.105
Neg logit contributions
 belongs-0.200
 package-0.193
shirt-0.176
 is-0.174
c-0.171
Tokenipop
Token ID42800
Feature activation+1.65e+00

Pos logit contributions
ubb+0.100
 queens+0.099
ill+0.097
aph+0.096
Col+0.091
Neg logit contributions
 package-0.107
 belongs-0.103
cycles-0.094
 automobile-0.093
 stret-0.093
Token.
Token ID13
Feature activation+3.73e-01

Pos logit contributions
ubb+0.109
 queens+0.102
Col+0.101
 servant+0.100
aph+0.100
Neg logit contributions
 package-0.036
 belongs-0.035
 automobile-0.034
 radio-0.034
 tracks-0.034
Token That
Token ID1320
Feature activation+3.53e-01

Pos logit contributions
ubb+0.024
 servant+0.023
 queens+0.023
oll+0.022
ill+0.022
Neg logit contributions
 tracks-0.009
 automobile-0.008
 radio-0.008
 package-0.008
 stret-0.008
Token's
Token ID338
Feature activation-6.70e-01

Pos logit contributions
ubb+0.025
 servant+0.025
 queens+0.025
oll+0.024
ill+0.024
Neg logit contributions
 tracks-0.005
 automobile-0.005
 radio-0.005
 package-0.005
 track-0.005
Token every
Token ID790
Feature activation+1.81e-02

Pos logit contributions
ubb+0.106
 servant+0.100
 queens+0.100
aph+0.097
umbo+0.096
Neg logit contributions
 tracks-0.049
 track-0.048
 radio-0.047
 stret-0.047
 package-0.046
Token day
Token ID1110
Feature activation+1.43e+00

Pos logit contributions
ubb+0.001
 servant+0.001
 queens+0.001
oll+0.001
ill+0.001
Neg logit contributions
 tracks-0.001
 automobile-0.001
 radio-0.001
 stret-0.001
 track-0.001
Token and
Token ID290
Feature activation+1.12e+00

Pos logit contributions
ubb+0.084
 servant+0.083
 queens+0.081
aph+0.078
oll+0.077
Neg logit contributions
 tracks-0.041
 track-0.039
 radio-0.037
 package-0.037
 automobile-0.036
Token was
Token ID373
Feature activation+5.83e-01

Pos logit contributions
ubb+0.061
 servant+0.060
 queens+0.059
oll+0.058
aph+0.057
Neg logit contributions
 tracks-0.035
 track-0.032
 radio-0.032
 automobile-0.032
 package-0.031
Token never
Token ID1239
Feature activation+1.57e+00

Pos logit contributions
ubb+0.028
 servant+0.028
 queens+0.028
oll+0.026
ill+0.026
Neg logit contributions
 tracks-0.022
 track-0.021
 automobile-0.021
 radio-0.021
 stret-0.021
Token hungry
Token ID14720
Feature activation+4.25e+00

Pos logit contributions
ubb+0.075
oll+0.074
 servant+0.073
umber+0.072
acy+0.071
Neg logit contributions
 tracks-0.054
 track-0.052
 receiver-0.052
 stret-0.051
 automobile-0.051
Token again
Token ID757
Feature activation+1.36e+00

Pos logit contributions
acy+0.249
umber+0.246
ubb+0.245
Col+0.243
ascal+0.237
Neg logit contributions
 away-0.092
's-0.091
 package-0.086
,"-0.081
 automobile-0.081
Token.
Token ID13
Feature activation+9.85e-01

Pos logit contributions
ubb+0.088
 servant+0.088
oll+0.085
 queens+0.084
umbo+0.084
Neg logit contributions
 tracks-0.027
 track-0.025
 radio-0.024
 package-0.024
 automobile-0.023
Token<|endoftext|>
Token ID50256
Feature activation+4.57e-01

Pos logit contributions
ubb+0.059
 servant+0.057
 queens+0.057
oll+0.054
ier+0.054
Neg logit contributions
 tracks-0.027
 automobile-0.026
 track-0.025
cycles-0.025
 radio-0.025
TokenOnce
Token ID7454
Feature activation+7.05e-01

Pos logit contributions
ubb+0.024
 servant+0.023
 queens+0.023
oll+0.022
ill+0.021
Neg logit contributions
 tracks-0.016
 radio-0.015
 package-0.015
 automobile-0.015
 track-0.015
Token there
Token ID612
Feature activation-4.94e-01

Pos logit contributions
ubb+0.024
 servant+0.023
 queens+0.023
oll+0.021
ill+0.019
Neg logit contributions
 tracks-0.037
 automobile-0.036
 radio-0.036
 track-0.036
 package-0.036
Token just
Token ID655
Feature activation+8.88e-01

Pos logit contributions
 servant+0.081
ubb+0.081
 queens+0.077
oll+0.075
umbo+0.074
Neg logit contributions
 tracks-0.047
 track-0.045
 radio-0.044
 automobile-0.043
 package-0.043
Token in
Token ID287
Feature activation+5.90e-01

Pos logit contributions
ubb+0.043
 servant+0.043
 queens+0.042
oll+0.041
umbo+0.040
Neg logit contributions
 tracks-0.033
 radio-0.032
 track-0.031
 package-0.031
 automobile-0.031
Token case
Token ID1339
Feature activation+1.59e+00

Pos logit contributions
 servant+0.035
ubb+0.035
 queens+0.034
oll+0.034
umbo+0.032
Neg logit contributions
 tracks-0.016
 track-0.016
 radio-0.015
 package-0.015
 automobile-0.015
Token another
Token ID1194
Feature activation+8.51e-01

Pos logit contributions
ubb+0.076
 servant+0.076
 queens+0.073
oll+0.070
Col+0.069
Neg logit contributions
 tracks-0.063
 track-0.060
 radio-0.060
 automobile-0.058
 package-0.057
Token great
Token ID1049
Feature activation+1.74e+00

Pos logit contributions
ubb+0.048
 servant+0.046
 queens+0.046
oll+0.044
ill+0.044
Neg logit contributions
 tracks-0.026
 package-0.026
 radio-0.025
 track-0.025
 automobile-0.025
Token l
Token ID300
Feature activation+4.25e+00

Pos logit contributions
ubb+0.108
 queens+0.104
oll+0.103
ier+0.103
ill+0.103
Neg logit contributions
 package-0.043
 tracks-0.041
 track-0.041
 radio-0.040
 automobile-0.040
Tokenoll
Token ID692
Feature activation+2.52e+00

Pos logit contributions
 queens+0.122
ubb+0.115
 diver+0.094
ory+0.092
 servant+0.091
Neg logit contributions
c-0.201
 belongs-0.198
 package-0.193
nob-0.190
shirt-0.189
Tokenipop
Token ID42800
Feature activation+1.77e+00

Pos logit contributions
 queens+0.111
ubb+0.110
aph+0.106
ill+0.104
Col+0.099
Neg logit contributions
 package-0.103
 belongs-0.098
cycles-0.091
 replied-0.087
 stret-0.087
Token would
Token ID561
Feature activation+1.70e+00

Pos logit contributions
ubb+0.107
 queens+0.101
 servant+0.101
Col+0.096
aph+0.094
Neg logit contributions
 tracks-0.047
 package-0.047
 track-0.047
 radio-0.045
 automobile-0.045
Token rise
Token ID4485
Feature activation+8.57e-01

Pos logit contributions
ubb+0.096
 servant+0.095
oll+0.095
 queens+0.094
acy+0.089
Neg logit contributions
 tracks-0.047
 track-0.046
 automobile-0.045
 radio-0.045
 package-0.044
Token in
Token ID287
Feature activation+2.25e-01

Pos logit contributions
ubb+0.048
 servant+0.048
 queens+0.046
oll+0.046
umbo+0.043
Neg logit contributions
 tracks-0.026
 track-0.025
 radio-0.025
 automobile-0.025
 package-0.024
Token managed
Token ID5257
Feature activation+1.48e+00

Pos logit contributions
ubb+0.115
oll+0.104
ancer+0.103
ascal+0.103
ectar+0.103
Neg logit contributions
 tracks-0.059
 track-0.058
 package-0.054
 receiver-0.052
's-0.052
Token to
Token ID284
Feature activation+1.39e+00

Pos logit contributions
ubb+0.080
 servant+0.074
oll+0.074
 queens+0.074
acy+0.070
Neg logit contributions
 tracks-0.049
 track-0.048
 package-0.047
 radio-0.046
 receiver-0.046
Token number
Token ID1271
Feature activation+1.85e+00

Pos logit contributions
ubb+0.078
oll+0.075
 queens+0.075
 servant+0.074
ectar+0.071
Neg logit contributions
 track-0.044
 tracks-0.043
 automobile-0.041
 stret-0.040
 package-0.040
Token all
Token ID477
Feature activation-4.94e-01

Pos logit contributions
ubb+0.105
 servant+0.105
 queens+0.103
oll+0.100
 diver+0.098
Neg logit contributions
 tracks-0.051
 track-0.048
 radio-0.047
 package-0.046
 automobile-0.046
Token her
Token ID607
Feature activation+2.14e+00

Pos logit contributions
 stret+0.018
 radio+0.017
 tracks+0.017
 automobile+0.017
 package+0.017
Neg logit contributions
ubb-0.025
 servant-0.024
 queens-0.024
ill-0.024
oll-0.023
Token l
Token ID300
Feature activation+4.24e+00

Pos logit contributions
ubb+0.113
ill+0.111
 queens+0.110
 diver+0.107
aph+0.106
Neg logit contributions
 tracks-0.068
 package-0.066
 belongs-0.063
 track-0.062
 radio-0.060
Tokenoll
Token ID692
Feature activation+2.60e+00

Pos logit contributions
 queens+0.132
ubb+0.112
 diver+0.107
 servant+0.106
aph+0.106
Neg logit contributions
shirt-0.182
c-0.175
g-0.174
 belongs-0.170
ches-0.161
Tokenip
Token ID541
Feature activation+2.22e+00

Pos logit contributions
 queens+0.108
ubb+0.106
aph+0.105
ill+0.105
 gard+0.099
Neg logit contributions
 belongs-0.107
 package-0.106
cycles-0.097
 coats-0.097
ci-0.095
Tokenops
Token ID2840
Feature activation+1.87e+00

Pos logit contributions
 queens+0.103
ubb+0.102
 servant+0.096
ill+0.095
aph+0.093
Neg logit contributions
 tracks-0.083
 belongs-0.082
 package-0.082
 track-0.081
 stret-0.079
Token.
Token ID13
Feature activation+1.52e+00

Pos logit contributions
ubb+0.119
 servant+0.116
 queens+0.114
Col+0.109
aph+0.107
Neg logit contributions
 track-0.042
 tracks-0.041
 package-0.039
 automobile-0.039
 radio-0.038
Token She
Token ID1375
Feature activation+1.15e+00

Pos logit contributions
ubb+0.090
 queens+0.090
ier+0.087
 servant+0.086
 diver+0.082
Neg logit contributions
 tracks-0.041
 -0.039
 radio-0.037
 receiver-0.036
 package-0.036
Token a
Token ID257
Feature activation+1.39e+00

Pos logit contributions
ubb+0.028
 servant+0.027
 queens+0.026
oll+0.025
ill+0.025
Neg logit contributions
 tracks-0.020
 package-0.020
 automobile-0.020
 radio-0.019
 track-0.019
Token little
Token ID1310
Feature activation+1.09e+00

Pos logit contributions
ubb+0.074
 queens+0.073
ill+0.071
ier+0.068
oll+0.068
Neg logit contributions
 package-0.049
 stret-0.046
 radio-0.045
 automobile-0.044
 tracks-0.043
Token bit
Token ID1643
Feature activation+1.86e+00

Pos logit contributions
ubb+0.066
 queens+0.065
 servant+0.063
ill+0.063
acy+0.062
Neg logit contributions
 package-0.028
 tracks-0.027
 automobile-0.026
 radio-0.025
 stret-0.025
Token of
Token ID286
Feature activation+9.20e-01

Pos logit contributions
 servant+0.098
ubb+0.097
Col+0.096
aph+0.096
 queens+0.095
Neg logit contributions
 package-0.054
 tracks-0.054
 automobile-0.052
 away-0.051
 receiver-0.049
Token the
Token ID262
Feature activation+1.53e+00

Pos logit contributions
ubb+0.049
ill+0.047
 queens+0.047
 servant+0.047
aph+0.046
Neg logit contributions
 tracks-0.030
 package-0.029
 automobile-0.029
 track-0.028
 radio-0.028
Token l
Token ID300
Feature activation+4.23e+00

Pos logit contributions
ubb+0.080
ill+0.078
 queens+0.077
aph+0.075
acy+0.074
Neg logit contributions
 package-0.054
 tracks-0.050
 automobile-0.050
 radio-0.049
 stret-0.049
Tokenoll
Token ID692
Feature activation+2.36e+00

Pos logit contributions
 queens+0.125
ubb+0.114
ill+0.107
Col+0.106
aph+0.104
Neg logit contributions
 belongs-0.194
c-0.193
shirt-0.187
 package-0.180
g-0.173
Tokenipop
Token ID42800
Feature activation+1.46e+00

Pos logit contributions
ubb+0.100
 queens+0.098
ill+0.098
aph+0.095
Col+0.095
Neg logit contributions
 package-0.099
 belongs-0.098
cycles-0.089
 automobile-0.088
 stret-0.086
Token.
Token ID13
Feature activation+7.31e-01

Pos logit contributions
ubb+0.090
 queens+0.084
 servant+0.083
Col+0.082
acy+0.080
Neg logit contributions
 package-0.037
 automobile-0.037
 track-0.036
 tracks-0.036
 radio-0.035
Token That
Token ID1320
Feature activation-9.81e-02

Pos logit contributions
ubb+0.043
 servant+0.041
 queens+0.041
oll+0.040
ill+0.039
Neg logit contributions
 tracks-0.020
 automobile-0.020
 radio-0.020
 package-0.020
 track-0.019
Token way
Token ID835
Feature activation-3.52e-01

Pos logit contributions
 tracks+0.002
 stret+0.002
 radio+0.002
 automobile+0.002
 track+0.002
Neg logit contributions
ubb-0.006
 servant-0.006
 queens-0.006
oll-0.006
ill-0.006
Token a
Token ID257
Feature activation-5.43e-01

Pos logit contributions
ubb+0.048
 servant+0.046
 queens+0.046
ill+0.044
oll+0.044
Neg logit contributions
 tracks-0.024
 package-0.023
 automobile-0.023
 radio-0.022
 stret-0.022
Token very
Token ID845
Feature activation+6.07e-01

Pos logit contributions
 tracks+0.020
cycles+0.020
 automobile+0.019
 track+0.019
 radio+0.019
Neg logit contributions
 servant-0.026
ubb-0.026
 queens-0.025
oll-0.024
ill-0.023
Token special
Token ID2041
Feature activation-1.52e+00

Pos logit contributions
ubb+0.035
 queens+0.034
 servant+0.034
ill+0.033
oll+0.033
Neg logit contributions
 tracks-0.017
 automobile-0.017
 package-0.017
 radio-0.017
 track-0.016
Token toy
Token ID13373
Feature activation+8.52e-02

Pos logit contributions
cycles+0.057
obiles+0.056
 tracks+0.055
 stret+0.053
 track+0.052
Neg logit contributions
 servant-0.073
ubb-0.069
 queens-0.067
oll-0.066
 diver-0.065
Token for
Token ID329
Feature activation+1.98e+00

Pos logit contributions
ubb+0.005
 servant+0.005
 queens+0.005
oll+0.004
ill+0.004
Neg logit contributions
 tracks-0.003
 automobile-0.003
 radio-0.003
 stret-0.003
 package-0.003
Token L
Token ID406
Feature activation+4.22e+00

Pos logit contributions
ubb+0.117
 servant+0.115
umbo+0.114
ill+0.113
oll+0.112
Neg logit contributions
 tracks-0.053
 track-0.048
 radio-0.046
 automobile-0.046
 their-0.045
Tokenott
Token ID1252
Feature activation-7.44e-02

Pos logit contributions
 goalie+0.160
 diver+0.158
 queens+0.158
 servant+0.156
ubb+0.152
Neg logit contributions
 belongs-0.176
c-0.168
 package-0.164
 receiver-0.160
 automobile-0.159
Tokenie
Token ID494
Feature activation-6.08e-01

Pos logit contributions
 tracks+0.003
 radio+0.003
 automobile+0.003
 stret+0.003
 track+0.003
Neg logit contributions
ubb-0.003
 servant-0.003
 queens-0.003
oll-0.003
ill-0.003
Token.
Token ID13
Feature activation+2.39e-01

Pos logit contributions
 tracks+0.021
 radio+0.021
 stret+0.021
 automobile+0.021
cycles+0.021
Neg logit contributions
 servant-0.030
ubb-0.030
 queens-0.029
oll-0.029
ill-0.028
Token It
Token ID632
Feature activation-9.21e-01

Pos logit contributions
ubb+0.013
 servant+0.012
 queens+0.012
oll+0.012
ill+0.012
Neg logit contributions
 tracks-0.008
 automobile-0.008
 radio-0.008
 track-0.008
 package-0.008
Token was
Token ID373
Feature activation-4.57e-02

Pos logit contributions
 stret+0.031
 radio+0.029
 automobile+0.029
 tracks+0.029
 track+0.028
Neg logit contributions
 servant-0.047
ubb-0.047
 queens-0.046
oll-0.046
ill-0.045
Token no
Token ID645
Feature activation+1.19e+00

Pos logit contributions
 tracks+0.006
 radio+0.006
 stret+0.006
 automobile+0.006
 track+0.006
Neg logit contributions
ubb-0.005
 servant-0.005
 queens-0.005
oll-0.005
ill-0.004
Token,
Token ID11
Feature activation+1.28e+00

Pos logit contributions
ubb+0.078
 queens+0.074
 servant+0.074
oll+0.074
acy+0.074
Neg logit contributions
 tracks-0.024
 radio-0.023
 automobile-0.023
 track-0.023
 package-0.023
Token where
Token ID810
Feature activation+3.82e-01

Pos logit contributions
ubb+0.068
ill+0.066
 queens+0.064
 servant+0.064
Col+0.063
Neg logit contributions
 package-0.042
 radio-0.041
 automobile-0.039
 tracks-0.038
 receiver-0.037
Token did
Token ID750
Feature activation-1.01e+00

Pos logit contributions
ubb+0.020
 servant+0.020
 queens+0.020
oll+0.019
ill+0.019
Neg logit contributions
 tracks-0.012
 radio-0.012
 automobile-0.012
 package-0.012
 track-0.012
Token Mr
Token ID1770
Feature activation+2.65e+00

Pos logit contributions
 stret+0.046
 tracks+0.044
 controller+0.044
 automobile+0.044
 track+0.044
Neg logit contributions
 queens-0.041
 servant-0.041
ubb-0.039
oll-0.037
 beetles-0.034
Token.
Token ID13
Feature activation+4.18e+00

Pos logit contributions
ubb+0.167
acy+0.138
Col+0.137
years+0.137
 queens+0.137
Neg logit contributions
 belongs-0.070
 package-0.064
's-0.058
 language-0.056
 radio-0.055
Token Beetle
Token ID47464
Feature activation+4.10e-01

Pos logit contributions
ier+0.178
add+0.162
ubb+0.158
oll+0.158
unn+0.152
Neg logit contributions
 -0.145
's-0.142
c-0.141
 That-0.139
 receiver-0.136
Token go
Token ID467
Feature activation+3.63e-01

Pos logit contributions
ubb+0.023
 servant+0.023
 queens+0.022
oll+0.022
ill+0.022
Neg logit contributions
 tracks-0.012
 radio-0.012
 automobile-0.012
 package-0.012
 track-0.011
Token?"
Token ID1701
Feature activation-3.38e-01

Pos logit contributions
ubb+0.020
 servant+0.019
oll+0.019
 queens+0.019
ill+0.018
Neg logit contributions
 tracks-0.012
 automobile-0.012
 track-0.012
 radio-0.011
 package-0.011
Token she
Token ID673
Feature activation-5.94e-01

Pos logit contributions
 tracks+0.016
 track+0.015
 automobile+0.015
 radio+0.015
 stret+0.015
Neg logit contributions
 servant-0.013
ubb-0.013
 queens-0.013
oll-0.012
ill-0.012
Token said
Token ID531
Feature activation-4.05e-01

Pos logit contributions
 automobile+0.022
 stret+0.022
 tracks+0.021
 package+0.021
 radio+0.021
Neg logit contributions
 servant-0.029
ubb-0.029
 queens-0.028
oll-0.027
ill-0.026
TokenThe
Token ID464
Feature activation-2.88e-01

Pos logit contributions
ubb+0.016
 servant+0.015
 queens+0.015
oll+0.015
ill+0.014
Neg logit contributions
 tracks-0.009
 radio-0.009
 stret-0.009
 track-0.009
 automobile-0.009
Token old
Token ID1468
Feature activation-5.19e-01

Pos logit contributions
 tracks+0.012
 stret+0.012
 radio+0.012
 automobile+0.012
 track+0.012
Neg logit contributions
 servant-0.012
ubb-0.012
 queens-0.012
oll-0.012
ill-0.011
Token man
Token ID582
Feature activation+3.74e-01

Pos logit contributions
 tracks+0.021
 radio+0.020
 stret+0.020
 automobile+0.020
 track+0.020
Neg logit contributions
 servant-0.024
ubb-0.023
 queens-0.023
oll-0.022
ill-0.021
Token was
Token ID373
Feature activation+1.86e-01

Pos logit contributions
ubb+0.022
 servant+0.021
 queens+0.021
oll+0.020
ill+0.020
Neg logit contributions
 tracks-0.011
 automobile-0.010
 radio-0.010
 package-0.010
 stret-0.010
Token never
Token ID1239
Feature activation+1.08e+00

Pos logit contributions
ubb+0.010
 servant+0.009
 queens+0.009
oll+0.009
ill+0.009
Neg logit contributions
 tracks-0.006
 automobile-0.006
 radio-0.006
 stret-0.006
 package-0.006
Token hungry
Token ID14720
Feature activation+4.17e+00

Pos logit contributions
ubb+0.053
 servant+0.051
oll+0.050
aph+0.048
 queens+0.048
Neg logit contributions
 tracks-0.039
 stret-0.038
 receiver-0.037
 package-0.037
 automobile-0.037
Token again
Token ID757
Feature activation+1.74e+00

Pos logit contributions
acy+0.249
ubb+0.247
umber+0.242
aph+0.242
ascal+0.240
Neg logit contributions
 away-0.091
 package-0.086
's-0.085
 track-0.081
 automobile-0.078
Token,
Token ID11
Feature activation+8.13e-01

Pos logit contributions
ubb+0.114
 servant+0.111
umbo+0.109
oll+0.109
aph+0.108
Neg logit contributions
 tracks-0.033
 track-0.032
 radio-0.031
 package-0.031
 automobile-0.029
Token and
Token ID290
Feature activation+7.92e-01

Pos logit contributions
ubb+0.047
 servant+0.046
oll+0.045
 queens+0.045
acy+0.044
Neg logit contributions
 tracks-0.024
 track-0.023
 automobile-0.023
 radio-0.023
 package-0.022
Token he
Token ID339
Feature activation+1.09e+00

Pos logit contributions
ubb+0.039
 servant+0.037
oll+0.037
 queens+0.036
ill+0.036
Neg logit contributions
 tracks-0.030
 track-0.028
 automobile-0.028
 package-0.028
 radio-0.028
Token forever
Token ID8097
Feature activation+1.11e+00

Pos logit contributions
ubb+0.065
 queens+0.063
 servant+0.062
aph+0.060
oll+0.059
Neg logit contributions
 tracks-0.030
 package-0.028
 radio-0.028
 automobile-0.028
 belongs-0.027
Token Jane
Token ID12091
Feature activation-4.06e-01

Pos logit contributions
 radio+0.026
 stret+0.026
 tracks+0.026
 track+0.025
 automobile+0.025
Neg logit contributions
ill-0.032
ubb-0.031
 servant-0.031
 queens-0.031
oll-0.029
Token had
Token ID550
Feature activation+6.35e-01

Pos logit contributions
 tracks+0.012
 radio+0.012
 automobile+0.012
 stret+0.012
 track+0.012
Neg logit contributions
ubb-0.022
 servant-0.022
 queens-0.021
oll-0.021
ill-0.021
Token never
Token ID1239
Feature activation+1.59e+00

Pos logit contributions
ubb+0.035
 servant+0.034
 queens+0.033
oll+0.032
aph+0.032
Neg logit contributions
 tracks-0.021
 package-0.020
 radio-0.020
 automobile-0.019
 stret-0.019
Token eaten
Token ID17065
Feature activation+3.01e+00

Pos logit contributions
ubb+0.084
 servant+0.078
aph+0.077
 queens+0.076
oll+0.076
Neg logit contributions
 tracks-0.051
 package-0.051
 receiver-0.050
 belongs-0.049
 stret-0.048
Token a
Token ID257
Feature activation+1.23e+00

Pos logit contributions
ubb+0.177
 servant+0.176
oll+0.174
aph+0.173
acy+0.173
Neg logit contributions
 our-0.066
 away-0.065
 their-0.062
 track-0.057
 package-0.057
Token tomato
Token ID24240
Feature activation+4.16e+00

Pos logit contributions
ubb+0.054
oll+0.052
 queens+0.051
 servant+0.050
ill+0.048
Neg logit contributions
 package-0.053
 radio-0.052
 track-0.051
 tracks-0.050
 automobile-0.050
Token before
Token ID878
Feature activation+9.77e-01

Pos logit contributions
ubb+0.232
aph+0.224
add+0.218
unn+0.217
ancer+0.213
Neg logit contributions
 away-0.103
 package-0.102
's-0.101
 track-0.099
house-0.087
Token,
Token ID11
Feature activation+1.21e+00

Pos logit contributions
ubb+0.060
 servant+0.059
 queens+0.058
oll+0.056
umbo+0.055
Neg logit contributions
 tracks-0.025
 radio-0.025
 package-0.024
 track-0.024
 automobile-0.024
Token but
Token ID475
Feature activation-8.60e-02

Pos logit contributions
ubb+0.076
 servant+0.075
 queens+0.072
oll+0.072
aph+0.070
Neg logit contributions
 automobile-0.029
 tracks-0.029
 track-0.028
 radio-0.028
 package-0.027
Token when
Token ID618
Feature activation+1.38e+00

Pos logit contributions
 tracks+0.004
 stret+0.004
 radio+0.004
 automobile+0.004
 track+0.004
Neg logit contributions
ubb-0.004
 servant-0.004
 queens-0.004
oll-0.004
ill-0.003
Token she
Token ID673
Feature activation+1.19e+00

Pos logit contributions
ubb+0.082
oll+0.079
 servant+0.078
ascal+0.074
umbo+0.074
Neg logit contributions
 package-0.037
 automobile-0.037
 tracks-0.037
 radio-0.036
 their-0.036
Token happy
Token ID3772
Feature activation+5.91e-01

Pos logit contributions
 servant+0.064
 queens+0.062
ubb+0.060
oll+0.059
umbo+0.058
Neg logit contributions
 tracks-0.026
 automobile-0.025
 package-0.024
 receiver-0.024
 track-0.024
Token that
Token ID326
Feature activation+6.34e-01

Pos logit contributions
ubb+0.034
 servant+0.033
 queens+0.032
oll+0.032
acy+0.030
Neg logit contributions
 tracks-0.018
 radio-0.018
 track-0.017
 automobile-0.017
 package-0.017
Token she
Token ID673
Feature activation+1.20e-01

Pos logit contributions
ubb+0.027
 servant+0.026
 queens+0.026
oll+0.025
ill+0.024
Neg logit contributions
 tracks-0.028
 radio-0.027
 automobile-0.027
 track-0.027
 package-0.027
Token had
Token ID550
Feature activation+3.30e-01

Pos logit contributions
ubb+0.006
 servant+0.006
 queens+0.006
oll+0.006
ill+0.006
Neg logit contributions
 tracks-0.004
 radio-0.004
 automobile-0.004
 stret-0.004
 track-0.004
Token her
Token ID607
Feature activation+1.58e+00

Pos logit contributions
ubb+0.017
 servant+0.016
 queens+0.016
oll+0.016
ill+0.015
Neg logit contributions
 tracks-0.012
 package-0.012
 track-0.012
 automobile-0.012
 radio-0.012
Token l
Token ID300
Feature activation+4.14e+00

Pos logit contributions
ubb+0.083
 queens+0.083
ill+0.081
ier+0.079
acy+0.075
Neg logit contributions
 package-0.057
 radio-0.053
 tracks-0.053
 automobile-0.052
 track-0.051
Tokenoll
Token ID692
Feature activation+1.88e+00

Pos logit contributions
 queens+0.130
ubb+0.105
 Mon+0.102
aph+0.098
 diver+0.098
Neg logit contributions
 package-0.175
c-0.175
shirt-0.171
 belongs-0.170
age-0.163
Tokenipop
Token ID42800
Feature activation+1.24e+00

Pos logit contributions
ubb+0.077
 queens+0.075
aph+0.073
ill+0.071
Col+0.068
Neg logit contributions
 package-0.083
 belongs-0.079
cycles-0.075
 automobile-0.073
 coats-0.072
Token and
Token ID290
Feature activation+6.11e-01

Pos logit contributions
ubb+0.073
 queens+0.069
 servant+0.068
acy+0.065
Col+0.065
Neg logit contributions
 tracks-0.036
 track-0.036
 package-0.036
 automobile-0.036
 radio-0.035
Token it
Token ID340
Feature activation+4.24e-01

Pos logit contributions
ubb+0.032
 servant+0.030
 queens+0.030
oll+0.030
ill+0.029
Neg logit contributions
 tracks-0.022
 automobile-0.021
 radio-0.021
 track-0.021
 package-0.020
Token tasted
Token ID29187
Feature activation+6.43e-01

Pos logit contributions
ubb+0.026
 servant+0.025
 queens+0.025
oll+0.024
ill+0.024
Neg logit contributions
 tracks-0.011
 radio-0.011
 automobile-0.011
 track-0.010
 package-0.010
Token ground
Token ID2323
Feature activation+1.61e+00

Pos logit contributions
ill+0.157
ubb+0.154
enny+0.146
umber+0.146
acy+0.144
Neg logit contributions
 package-0.086
 radio-0.076
 tracks-0.074
 automobile-0.074
 track-0.071
Token.
Token ID13
Feature activation+1.88e-01

Pos logit contributions
 servant+0.090
ubb+0.088
oll+0.087
 queens+0.085
Col+0.084
Neg logit contributions
 tracks-0.052
 track-0.050
 automobile-0.049
 radio-0.047
 package-0.046
Token At
Token ID1629
Feature activation+6.13e-01

Pos logit contributions
ubb+0.011
 servant+0.011
 queens+0.011
oll+0.011
ill+0.010
Neg logit contributions
 tracks-0.005
 radio-0.005
 automobile-0.005
 stret-0.005
 package-0.005
Token last
Token ID938
Feature activation-2.53e-01

Pos logit contributions
ubb+0.037
 servant+0.037
oll+0.036
 queens+0.036
ill+0.034
Neg logit contributions
 tracks-0.017
 track-0.016
 automobile-0.016
 radio-0.016
 package-0.016
Token the
Token ID262
Feature activation+1.31e+00

Pos logit contributions
 tracks+0.009
 automobile+0.009
 radio+0.009
 stret+0.009
 package+0.009
Neg logit contributions
ubb-0.013
 servant-0.012
 queens-0.012
oll-0.012
ill-0.012
Token l
Token ID300
Feature activation+4.11e+00

Pos logit contributions
ill+0.070
ubb+0.070
 queens+0.066
 servant+0.065
ier+0.065
Neg logit contributions
 package-0.047
 tracks-0.043
 radio-0.043
 automobile-0.042
 track-0.041
Tokenoll
Token ID692
Feature activation+2.05e+00

Pos logit contributions
 queens+0.119
ubb+0.104
ory+0.099
 servant+0.093
 diver+0.085
Neg logit contributions
 tracks-0.200
 package-0.197
c-0.197
chair-0.196
 slot-0.196
Tokenipop
Token ID42800
Feature activation+1.23e+00

Pos logit contributions
ubb+0.088
 queens+0.087
ill+0.084
aph+0.082
Col+0.081
Neg logit contributions
 package-0.087
 belongs-0.082
 replied-0.079
 coats-0.076
 tracks-0.076
Token stopped
Token ID5025
Feature activation+1.40e+00

Pos logit contributions
ubb+0.072
 servant+0.070
 queens+0.069
oll+0.067
Col+0.065
Neg logit contributions
 tracks-0.037
 track-0.035
 package-0.035
 radio-0.035
 automobile-0.034
Token rising
Token ID7396
Feature activation+6.60e-01

Pos logit contributions
 servant+0.087
oll+0.087
 queens+0.085
ubb+0.084
umbo+0.081
Neg logit contributions
 tracks-0.036
 radio-0.035
 automobile-0.035
 track-0.035
 stret-0.032
Token and
Token ID290
Feature activation-9.86e-02

Pos logit contributions
ubb+0.038
 servant+0.038
 queens+0.037
oll+0.036
acy+0.034
Neg logit contributions
 tracks-0.020
 radio-0.019
 track-0.019
 automobile-0.019
 package-0.018
Token if
Token ID611
Feature activation+1.58e+00

Pos logit contributions
ubb+0.050
 queens+0.047
 servant+0.046
oll+0.046
ill+0.044
Neg logit contributions
 tracks-0.032
 package-0.031
 radio-0.031
 automobile-0.031
 track-0.030
Token she
Token ID673
Feature activation+1.51e+00

Pos logit contributions
ubb+0.076
oll+0.069
ill+0.069
umbo+0.067
 queens+0.067
Neg logit contributions
 package-0.063
 radio-0.060
 receiver-0.059
 automobile-0.059
 tracks-0.058
Token could
Token ID714
Feature activation+1.53e+00

Pos logit contributions
ubb+0.085
 queens+0.080
 servant+0.075
aph+0.075
ectar+0.074
Neg logit contributions
 package-0.049
 belongs-0.048
 automobile-0.045
 tracks-0.045
 receiver-0.044
Token have
Token ID423
Feature activation+2.43e-01

Pos logit contributions
ubb+0.096
 queens+0.094
 servant+0.090
oll+0.087
aph+0.086
Neg logit contributions
 automobile-0.039
 tracks-0.039
 package-0.038
 radio-0.037
 track-0.036
Token the
Token ID262
Feature activation+1.54e+00

Pos logit contributions
ubb+0.014
 servant+0.013
 queens+0.013
oll+0.013
ill+0.013
Neg logit contributions
 tracks-0.007
 automobile-0.007
 radio-0.007
 package-0.007
 stret-0.007
Token l
Token ID300
Feature activation+4.10e+00

Pos logit contributions
ubb+0.085
 queens+0.082
ill+0.079
aph+0.075
acy+0.075
Neg logit contributions
 package-0.054
 radio-0.048
 automobile-0.047
 tracks-0.047
 receiver-0.046
Tokenoll
Token ID692
Feature activation+2.20e+00

Pos logit contributions
 queens+0.123
ubb+0.111
aph+0.099
ipples+0.095
ill+0.091
Neg logit contributions
 package-0.198
shirt-0.198
c-0.189
 belongs-0.187
nob-0.178
Tokenipop
Token ID42800
Feature activation+5.39e-01

Pos logit contributions
ubb+0.092
 queens+0.091
aph+0.088
ill+0.087
Col+0.083
Neg logit contributions
 package-0.097
 belongs-0.092
 automobile-0.082
 replied-0.082
cycles-0.082
Token.
Token ID13
Feature activation+9.76e-01

Pos logit contributions
ubb+0.030
 servant+0.029
 queens+0.029
oll+0.027
ill+0.027
Neg logit contributions
 tracks-0.017
 automobile-0.016
 package-0.016
 radio-0.016
 track-0.016
Token The
Token ID383
Feature activation+6.72e-01

Pos logit contributions
ubb+0.063
 queens+0.062
 servant+0.060
oll+0.059
ier+0.057
Neg logit contributions
 automobile-0.023
 tracks-0.022
 package-0.022
 radio-0.022
 stret-0.021
Token shop
Token ID6128
Feature activation-3.82e-01

Pos logit contributions
ubb+0.040
ill+0.039
 queens+0.038
 servant+0.037
aph+0.036
Neg logit contributions
 package-0.020
 automobile-0.019
 radio-0.019
 tracks-0.018
 receiver-0.018
Token the
Token ID262
Feature activation-3.17e-01

Pos logit contributions
oll+0.090
 servant+0.086
ubb+0.086
ill+0.086
acy+0.084
Neg logit contributions
 tracks-0.026
 track-0.025
 automobile-0.022
 radio-0.021
cycles-0.019
Token little
Token ID1310
Feature activation+1.51e-01

Pos logit contributions
 stret+0.014
 tracks+0.013
 automobile+0.013
 radio+0.013
 package+0.013
Neg logit contributions
 servant-0.014
ubb-0.013
 queens-0.013
oll-0.012
ill-0.012
Token rabbit
Token ID22746
Feature activation+1.78e+00

Pos logit contributions
ubb+0.008
 servant+0.007
 queens+0.007
oll+0.007
ill+0.007
Neg logit contributions
 tracks-0.006
 automobile-0.005
 radio-0.005
 track-0.005
 package-0.005
Token was
Token ID373
Feature activation+7.26e-01

Pos logit contributions
ubb+0.109
 queens+0.104
oll+0.102
 servant+0.102
aph+0.099
Neg logit contributions
 tracks-0.048
 track-0.044
 radio-0.042
 automobile-0.041
cycles-0.041
Token never
Token ID1239
Feature activation+1.03e+00

Pos logit contributions
ubb+0.033
 servant+0.031
 queens+0.031
oll+0.030
aph+0.030
Neg logit contributions
 tracks-0.030
 track-0.029
 stret-0.028
 automobile-0.028
 radio-0.028
Token sad
Token ID6507
Feature activation+4.10e+00

Pos logit contributions
ubb+0.050
oll+0.048
 servant+0.048
acy+0.046
 queens+0.045
Neg logit contributions
 tracks-0.038
 track-0.037
 receiver-0.036
 automobile-0.035
 stret-0.035
Token again
Token ID757
Feature activation+1.40e+00

Pos logit contributions
umber+0.273
acy+0.250
uka+0.249
Col+0.249
ona+0.248
Neg logit contributions
 around-0.068
 away-0.064
's-0.061
 language-0.060
 up-0.058
Token.
Token ID13
Feature activation+1.28e+00

Pos logit contributions
ubb+0.088
 servant+0.088
oll+0.087
 queens+0.085
umbo+0.084
Neg logit contributions
 tracks-0.031
 track-0.030
 radio-0.029
 automobile-0.027
 receiver-0.026
Token He
Token ID679
Feature activation+1.32e+00

Pos logit contributions
ubb+0.080
 servant+0.078
 queens+0.077
oll+0.075
ier+0.074
Neg logit contributions
 tracks-0.032
 automobile-0.029
cycles-0.029
 track-0.028
 radio-0.028
Token was
Token ID373
Feature activation+2.82e-01

Pos logit contributions
ubb+0.077
 queens+0.075
 servant+0.073
oll+0.072
ill+0.070
Neg logit contributions
 tracks-0.038
 track-0.036
 radio-0.035
 belongs-0.035
 automobile-0.034
Token always
Token ID1464
Feature activation-4.19e-01

Pos logit contributions
ubb+0.014
 servant+0.013
 queens+0.013
oll+0.013
ill+0.013
Neg logit contributions
 tracks-0.010
 automobile-0.010
 track-0.010
 radio-0.010
 package-0.010
Token
Token ID198
Feature activation-1.52e+00

Pos logit contributions
 stret+0.032
 automobile+0.032
 radio+0.032
 tracks+0.032
 package+0.032
Neg logit contributions
 queens-0.034
 servant-0.034
ubb-0.033
ill-0.033
oll-0.032
Token"
Token ID1
Feature activation-1.58e-01

Pos logit contributions
 stret+0.056
 controller+0.054
cycles+0.053
 tracks+0.053
 package+0.053
Neg logit contributions
 queens-0.075
 servant-0.070
umbo-0.069
 gard-0.064
aph-0.064
TokenHello
Token ID15496
Feature activation+8.98e-03

Pos logit contributions
 tracks+0.005
 stret+0.005
 automobile+0.005
 radio+0.005
 track+0.005
Neg logit contributions
ubb-0.008
 servant-0.008
 queens-0.008
oll-0.008
ill-0.008
Token,
Token ID11
Feature activation-7.30e-01

Pos logit contributions
ubb+0.000
 servant+0.000
 queens+0.000
oll+0.000
ill+0.000
Neg logit contributions
 tracks-0.000
 radio-0.000
 automobile-0.000
 stret-0.000
 track-0.000
Token cloud
Token ID6279
Feature activation-1.03e+00

Pos logit contributions
 stret+0.028
 tracks+0.028
cycles+0.027
 track+0.026
 controller+0.026
Neg logit contributions
 servant-0.034
 queens-0.034
ubb-0.033
oll-0.031
umbo-0.031
Token!"
Token ID2474
Feature activation-4.33e+00

Pos logit contributions
 stret+0.040
 radio+0.039
 tracks+0.039
obiles+0.038
cycles+0.038
Neg logit contributions
 queens-0.048
 servant-0.048
ubb-0.048
ill-0.046
oll-0.045
Token Ellie
Token ID44801
Feature activation-7.99e-01

Pos logit contributions
 stret+0.217
cycles+0.190
 controller+0.184
izer+0.184
 trailer+0.179
Neg logit contributions
 servant-0.130
 she-0.128
 queens-0.127
 gard-0.116
 beetles-0.114
Token said
Token ID531
Feature activation-4.31e-01

Pos logit contributions
 stret+0.026
 radio+0.025
 package+0.025
 automobile+0.025
cycles+0.024
Neg logit contributions
 servant-0.043
 queens-0.042
ubb-0.041
ill-0.041
oll-0.040
Token.
Token ID13
Feature activation-1.24e+00

Pos logit contributions
 tracks+0.016
 automobile+0.016
 package+0.016
 stret+0.015
 radio+0.015
Neg logit contributions
 servant-0.021
ubb-0.021
 queens-0.020
ill-0.020
oll-0.020
Token
Token ID220
Feature activation+4.65e-01

Pos logit contributions
 stret+0.047
 package+0.047
 automobile+0.044
cycles+0.044
 radio+0.044
Neg logit contributions
 servant-0.059
 queens-0.055
ill-0.055
ubb-0.053
umbo-0.052
Token
Token ID198
Feature activation-5.63e-01

Pos logit contributions
ubb+0.025
 servant+0.025
 queens+0.025
oll+0.024
ill+0.023
Neg logit contributions
 tracks-0.015
 automobile-0.014
 radio-0.014
 track-0.014
 stret-0.014
Token
Token ID198
Feature activation-2.74e+00

Pos logit contributions
 stret+0.140
 automobile+0.138
 receiver+0.136
 belongs+0.134
 package+0.134
Neg logit contributions
enny-0.061
ill-0.060
Col-0.056
olly-0.054
 queens-0.054
Token"
Token ID1
Feature activation-6.56e-01

Pos logit contributions
 cigarettes+0.127
 belongs+0.127
 stret+0.126
ci+0.124
 receiver+0.123
Neg logit contributions
enny-0.083
 queens-0.083
olly-0.081
Col-0.078
Ben-0.071
TokenLet
Token ID5756
Feature activation-2.01e+00

Pos logit contributions
 tracks+0.028
 belongs+0.026
 stret+0.026
 track+0.026
cycles+0.026
Neg logit contributions
 queens-0.028
ubb-0.028
 servant-0.028
ill-0.027
Col-0.026
Token's
Token ID338
Feature activation-5.90e-01

Pos logit contributions
 track+0.069
 controller+0.067
 tracks+0.067
cycles+0.066
 stret+0.066
Neg logit contributions
 servant-0.088
ax-0.086
olly-0.086
ory-0.085
ill-0.085
Token help
Token ID1037
Feature activation-1.08e+00

Pos logit contributions
 stret+0.021
 tracks+0.021
 radio+0.021
 receiver+0.021
 automobile+0.020
Neg logit contributions
 servant-0.028
ubb-0.028
 queens-0.027
oll-0.026
ill-0.026
Token!"
Token ID2474
Feature activation-4.27e+00

Pos logit contributions
 stret+0.035
 receiver+0.034
 automobile+0.034
 Amen+0.034
obiles+0.034
Neg logit contributions
 queens-0.054
 servant-0.053
ubb-0.051
ill-0.049
 diver-0.048
Token said
Token ID531
Feature activation-3.18e+00

Pos logit contributions
 stret+0.228
obiles+0.211
rated+0.207
 Amen+0.201
esh+0.200
Neg logit contributions
 Ben-0.101
 she-0.082
olly-0.082
 he-0.076
enny-0.072
Token Max
Token ID5436
Feature activation-1.16e+00

Pos logit contributions
obiles+0.143
 stret+0.138
puters+0.135
cycles+0.131
 belongs+0.130
Neg logit contributions
 servant-0.122
 diver-0.102
 Mon-0.102
 Colin-0.101
acy-0.096
Token,
Token ID11
Feature activation-1.34e+00

Pos logit contributions
 automobile+0.045
 radio+0.045
obiles+0.045
 package+0.045
 tracks+0.044
Neg logit contributions
 servant-0.055
ubb-0.051
ill-0.051
oll-0.051
 queens-0.049
Token who
Token ID508
Feature activation-1.63e+00

Pos logit contributions
 package+0.057
 belongs+0.057
obiles+0.056
 automobile+0.055
 radio+0.054
Neg logit contributions
 servant-0.056
ubb-0.055
 queens-0.052
ill-0.051
 gard-0.050
Token was
Token ID373
Feature activation-7.44e-01

Pos logit contributions
 stret+0.067
 tracks+0.064
 radio+0.062
obiles+0.061
 track+0.061
Neg logit contributions
 servant-0.070
oll-0.068
ill-0.066
ubb-0.066
 queens-0.064
TokenHi
Token ID17250
Feature activation-1.10e+00

Pos logit contributions
ubb+0.002
 servant+0.002
 queens+0.002
oll+0.002
ill+0.002
Neg logit contributions
 tracks-0.001
 automobile-0.001
 radio-0.001
 stret-0.001
 package-0.001
Token,
Token ID11
Feature activation-7.44e-01

Pos logit contributions
 tracks+0.041
 stret+0.040
 track+0.039
 controller+0.039
 radio+0.039
Neg logit contributions
 servant-0.053
 queens-0.053
ubb-0.050
oll-0.048
umbo-0.048
Token I
Token ID314
Feature activation+6.58e-01

Pos logit contributions
 tracks+0.030
 stret+0.029
 track+0.028
 controller+0.028
 automobile+0.028
Neg logit contributions
 servant-0.034
 queens-0.034
ubb-0.033
oll-0.031
umbo-0.031
Token'm
Token ID1101
Feature activation+3.75e-01

Pos logit contributions
ubb+0.029
 servant+0.029
 queens+0.028
oll+0.028
ill+0.026
Neg logit contributions
 tracks-0.027
 radio-0.027
 package-0.026
 automobile-0.026
 track-0.026
Token Mouse
Token ID21839
Feature activation-1.00e+00

Pos logit contributions
ubb+0.020
 servant+0.020
 queens+0.019
oll+0.019
ill+0.019
Neg logit contributions
 tracks-0.012
 automobile-0.012
 track-0.012
 radio-0.012
 package-0.012
Token,"
Token ID553
Feature activation-4.18e+00

Pos logit contributions
 stret+0.033
 tracks+0.033
 radio+0.032
obiles+0.032
 automobile+0.032
Neg logit contributions
 servant-0.052
 queens-0.051
ubb-0.051
ill-0.049
oll-0.048
Token said
Token ID531
Feature activation-2.20e+00

Pos logit contributions
 stret+0.198
 track+0.197
obiles+0.188
 tracks+0.187
 controller+0.183
Neg logit contributions
 servant-0.118
 gard-0.115
 beetle-0.114
 queens-0.112
 beetles-0.111
Token the
Token ID262
Feature activation+2.08e-01

Pos logit contributions
obiles+0.090
 tracks+0.088
 stret+0.087
 track+0.086
cycles+0.085
Neg logit contributions
 servant-0.098
 queens-0.089
ubb-0.087
aph-0.081
 diver-0.081
Token mouse
Token ID10211
Feature activation-9.41e-01

Pos logit contributions
ubb+0.011
 servant+0.011
 queens+0.011
oll+0.011
ill+0.011
Neg logit contributions
 tracks-0.007
 radio-0.007
 automobile-0.007
 package-0.007
 track-0.007
Token.
Token ID13
Feature activation-1.38e+00

Pos logit contributions
 stret+0.037
 radio+0.036
 package+0.036
 automobile+0.036
obiles+0.035
Neg logit contributions
 servant-0.045
ubb-0.044
 queens-0.042
ill-0.042
oll-0.042
Token "
Token ID366
Feature activation+1.31e+00

Pos logit contributions
 package+0.057
 stret+0.056
 automobile+0.056
 radio+0.055
 tracks+0.055
Neg logit contributions
 servant-0.058
 queens-0.056
ubb-0.052
umbo-0.051
ill-0.050
Token.
Token ID13
Feature activation+1.81e-01

Pos logit contributions
 tracks+0.013
 stret+0.013
 track+0.013
 automobile+0.013
 radio+0.013
Neg logit contributions
 servant-0.020
ubb-0.020
 queens-0.020
oll-0.019
ill-0.019
Token She
Token ID1375
Feature activation-6.31e-02

Pos logit contributions
ubb+0.008
 servant+0.008
 queens+0.008
oll+0.008
ill+0.007
Neg logit contributions
 tracks-0.007
 automobile-0.007
 radio-0.007
 stret-0.007
 package-0.007
Token got
Token ID1392
Feature activation-3.35e-01

Pos logit contributions
 tracks+0.002
 automobile+0.002
 radio+0.002
 stret+0.002
 track+0.002
Neg logit contributions
ubb-0.003
 servant-0.003
 queens-0.003
oll-0.003
ill-0.003
Token on
Token ID319
Feature activation-5.20e-01

Pos logit contributions
 tracks+0.011
 radio+0.011
 automobile+0.011
cycles+0.011
 stret+0.011
Neg logit contributions
ubb-0.017
 servant-0.017
 queens-0.017
oll-0.017
ill-0.016
Token a
Token ID257
Feature activation-3.05e+00

Pos logit contributions
cycles+0.016
 tracks+0.016
 receiver+0.016
 automobile+0.016
 belongs+0.016
Neg logit contributions
 servant-0.027
ubb-0.027
 queens-0.026
oll-0.025
ill-0.025
Token different
Token ID1180
Feature activation-4.15e+00

Pos logit contributions
cycles+0.120
agree+0.114
adoes+0.111
obiles+0.109
esh+0.107
Neg logit contributions
 servant-0.132
onna-0.122
 queens-0.116
 diver-0.116
 begg-0.115
Token train
Token ID4512
Feature activation-1.97e+00

Pos logit contributions
cycles+0.165
obiles+0.158
 Amen+0.155
agree+0.151
 Rusty+0.147
Neg logit contributions
 servant-0.153
 king-0.139
 diver-0.135
 begg-0.129
 sur-0.129
Token with
Token ID351
Feature activation+6.02e-02

Pos logit contributions
 Amen+0.074
cycles+0.073
obiles+0.073
 receiver+0.072
 belongs+0.071
Neg logit contributions
 servant-0.082
 queens-0.078
ill-0.077
oll-0.077
ier-0.076
Token her
Token ID607
Feature activation-7.60e-01

Pos logit contributions
ubb+0.003
 servant+0.003
 queens+0.003
oll+0.003
ill+0.003
Neg logit contributions
 tracks-0.002
 automobile-0.002
 radio-0.002
 stret-0.002
 track-0.002
Token ticket
Token ID7846
Feature activation-1.10e+00

Pos logit contributions
 stret+0.026
 automobile+0.026
obiles+0.026
 tracks+0.026
 receiver+0.026
Neg logit contributions
 servant-0.037
ubb-0.037
 queens-0.035
oll-0.035
ill-0.034
Token.
Token ID13
Feature activation+3.68e-01

Pos logit contributions
 Amen+0.038
cycles+0.038
obiles+0.038
 stret+0.038
 tracks+0.038
Neg logit contributions
 servant-0.052
oll-0.050
 queens-0.050
ubb-0.050
ill-0.049
Token.
Token ID13
Feature activation-6.75e-02

Pos logit contributions
ubb+0.021
 servant+0.020
 queens+0.020
oll+0.019
ill+0.019
Neg logit contributions
 tracks-0.012
 automobile-0.011
 radio-0.011
 track-0.011
 stret-0.011
Token He
Token ID679
Feature activation-3.21e-01

Pos logit contributions
 tracks+0.002
 radio+0.002
 stret+0.002
 automobile+0.002
 package+0.002
Neg logit contributions
ubb-0.004
 servant-0.003
 queens-0.003
oll-0.003
ill-0.003
Token put
Token ID1234
Feature activation-1.22e+00

Pos logit contributions
 tracks+0.011
 automobile+0.011
 stret+0.011
 package+0.011
 radio+0.011
Neg logit contributions
ubb-0.016
 servant-0.016
 queens-0.016
oll-0.015
ill-0.015
Token on
Token ID319
Feature activation-8.07e-01

Pos logit contributions
 stret+0.049
 tracks+0.047
 automobile+0.047
obiles+0.047
 radio+0.047
Neg logit contributions
 servant-0.053
ubb-0.052
 queens-0.052
oll-0.051
Col-0.046
Token his
Token ID465
Feature activation-2.52e+00

Pos logit contributions
 package+0.029
 stret+0.029
 receiver+0.028
cycles+0.028
 automobile+0.028
Neg logit contributions
 servant-0.038
ubb-0.038
 queens-0.038
oll-0.035
ill-0.035
Token favourite
Token ID12507
Feature activation-4.09e+00

Pos logit contributions
 receiver+0.093
esh+0.093
agree+0.092
 package+0.092
obiles+0.091
Neg logit contributions
 servant-0.110
 queens-0.101
ubb-0.099
oll-0.098
 diver-0.095
Token party
Token ID2151
Feature activation-4.62e-01

Pos logit contributions
agree+0.180
cycles+0.170
obiles+0.165
rated+0.164
limits+0.163
Neg logit contributions
 servant-0.154
 crabs-0.135
 queens-0.135
 diver-0.133
 crab-0.130
Token hat
Token ID6877
Feature activation+4.47e-02

Pos logit contributions
 tracks+0.016
 automobile+0.016
 stret+0.016
 radio+0.016
 package+0.016
Neg logit contributions
 servant-0.023
ubb-0.022
 queens-0.022
oll-0.022
ill-0.021
Token and
Token ID290
Feature activation-7.92e-01

Pos logit contributions
ubb+0.002
 servant+0.002
 queens+0.002
oll+0.002
ill+0.002
Neg logit contributions
 tracks-0.001
 automobile-0.001
 radio-0.001
 stret-0.001
 package-0.001
Token hopped
Token ID45230
Feature activation-7.28e-01

Pos logit contributions
 stret+0.028
 package+0.027
 automobile+0.027
obiles+0.027
cycles+0.027
Neg logit contributions
 servant-0.039
 queens-0.038
ubb-0.037
oll-0.037
ill-0.035
Token along
Token ID1863
Feature activation-3.88e-01

Pos logit contributions
 automobile+0.027
 tracks+0.026
 package+0.026
cycles+0.026
 stret+0.026
Neg logit contributions
 servant-0.035
 queens-0.034
ubb-0.034
oll-0.033
ill-0.033
Token."
Token ID526
Feature activation-3.66e-01

Pos logit contributions
 stret+0.025
 automobile+0.024
 package+0.024
 tracks+0.024
 receiver+0.024
Neg logit contributions
 queens-0.033
ubb-0.033
 servant-0.033
oll-0.031
ill-0.031
Token He
Token ID679
Feature activation-2.11e-01

Pos logit contributions
 radio+0.011
 tracks+0.011
 package+0.011
 automobile+0.011
 stret+0.011
Neg logit contributions
 servant-0.019
ubb-0.019
 queens-0.019
ill-0.019
oll-0.019
Token put
Token ID1234
Feature activation-8.40e-01

Pos logit contributions
 automobile+0.007
 tracks+0.007
 stret+0.007
 radio+0.007
 package+0.007
Neg logit contributions
ubb-0.011
 servant-0.010
 queens-0.010
oll-0.010
ill-0.010
Token on
Token ID319
Feature activation-7.89e-01

Pos logit contributions
 stret+0.030
 automobile+0.030
 tracks+0.030
 radio+0.029
obiles+0.029
Neg logit contributions
ubb-0.041
 servant-0.040
oll-0.040
 queens-0.039
ill-0.038
Token a
Token ID257
Feature activation-3.15e+00

Pos logit contributions
 package+0.029
 stret+0.028
 automobile+0.028
 receiver+0.027
obiles+0.027
Neg logit contributions
ubb-0.038
 servant-0.037
 queens-0.036
ill-0.035
acy-0.035
Token different
Token ID1180
Feature activation-4.07e+00

Pos logit contributions
esh+0.132
cycles+0.128
obiles+0.123
 Amen+0.121
agree+0.121
Neg logit contributions
 diver-0.121
olly-0.118
acy-0.117
 queens-0.115
 servant-0.115
Token song
Token ID3496
Feature activation-4.27e-01

Pos logit contributions
 stret+0.153
obiles+0.151
cycles+0.150
 Amen+0.148
agree+0.137
Neg logit contributions
 diver-0.159
 servant-0.157
 crabs-0.148
ubb-0.148
ill-0.148
Token and
Token ID290
Feature activation-8.52e-01

Pos logit contributions
 tracks+0.015
 automobile+0.015
 stret+0.015
 radio+0.015
 package+0.015
Neg logit contributions
ubb-0.021
 servant-0.020
 queens-0.020
oll-0.020
ill-0.019
Token Lily
Token ID20037
Feature activation-1.09e-01

Pos logit contributions
 stret+0.031
 automobile+0.031
 package+0.030
 radio+0.030
obiles+0.029
Neg logit contributions
ubb-0.041
 servant-0.041
 queens-0.041
oll-0.039
ill-0.038
Token started
Token ID2067
Feature activation+2.34e-01

Pos logit contributions
 tracks+0.004
 automobile+0.004
 radio+0.004
 stret+0.004
 package+0.004
Neg logit contributions
ubb-0.005
 servant-0.005
 queens-0.005
oll-0.005
ill-0.005
Token to
Token ID284
Feature activation-4.70e-01

Pos logit contributions
ubb+0.010
 servant+0.010
 queens+0.010
oll+0.010
ill+0.009
Neg logit contributions
 tracks-0.010
 radio-0.010
 track-0.010
 automobile-0.010
 stret-0.009
Token you
Token ID345
Feature activation+1.12e+00

Pos logit contributions
ubb+0.003
 servant+0.003
 queens+0.003
oll+0.002
ill+0.002
Neg logit contributions
 tracks-0.002
 radio-0.002
 automobile-0.002
 stret-0.002
 track-0.002
Token to
Token ID284
Feature activation+1.49e+00

Pos logit contributions
ubb+0.058
 servant+0.054
oll+0.053
 queens+0.053
ill+0.050
Neg logit contributions
 tracks-0.040
 automobile-0.040
 track-0.039
 package-0.039
 radio-0.038
Token climb
Token ID12080
Feature activation-6.35e-01

Pos logit contributions
ubb+0.087
oll+0.084
 queens+0.083
 servant+0.083
ill+0.078
Neg logit contributions
 track-0.046
 tracks-0.046
 automobile-0.045
 radio-0.043
 package-0.043
Token the
Token ID262
Feature activation+1.20e+00

Pos logit contributions
 stret+0.019
 tracks+0.019
 receiver+0.019
 radio+0.019
 package+0.019
Neg logit contributions
ubb-0.034
 queens-0.034
 servant-0.034
oll-0.032
ill-0.032
Token wall
Token ID3355
Feature activation-4.56e-01

Pos logit contributions
ubb+0.067
ill+0.062
 queens+0.062
 servant+0.062
oll+0.061
Neg logit contributions
 tracks-0.043
 track-0.043
 automobile-0.042
 radio-0.042
 package-0.041
Token!"
Token ID2474
Feature activation-4.05e+00

Pos logit contributions
 tracks+0.012
 radio+0.012
 stret+0.012
 automobile+0.012
 package+0.012
Neg logit contributions
ubb-0.026
 servant-0.026
 queens-0.026
ill-0.025
oll-0.025
Token Jimmy
Token ID12963
Feature activation-5.69e-01

Pos logit contributions
 stret+0.223
 trailer+0.221
cycles+0.221
izer+0.214
rated+0.208
Neg logit contributions
 she-0.102
ill-0.081
 Mon-0.080
 J-0.075
 Av-0.071
Token said
Token ID531
Feature activation-7.60e-01

Pos logit contributions
 radio+0.017
 automobile+0.017
 tracks+0.017
 stret+0.017
 package+0.016
Neg logit contributions
 servant-0.031
ubb-0.031
oll-0.030
ill-0.030
 queens-0.030
Token.
Token ID13
Feature activation-8.16e-01

Pos logit contributions
 tracks+0.029
 automobile+0.029
 package+0.029
 radio+0.029
obiles+0.029
Neg logit contributions
 servant-0.035
ubb-0.035
ill-0.034
oll-0.034
 queens-0.033
Token
Token ID220
Feature activation+2.70e-01

Pos logit contributions
 package+0.027
 stret+0.027
 radio+0.027
 automobile+0.027
 tracks+0.026
Neg logit contributions
 servant-0.042
ubb-0.040
ill-0.040
 queens-0.040
oll-0.039
Token
Token ID198
Feature activation-6.23e-01

Pos logit contributions
ubb+0.014
 servant+0.014
 queens+0.014
oll+0.013
ill+0.013
Neg logit contributions
 tracks-0.009
 automobile-0.009
 radio-0.009
 stret-0.009
 track-0.009
Token the
Token ID262
Feature activation+2.18e+00

Pos logit contributions
ubb+0.012
 servant+0.012
 queens+0.012
oll+0.011
ill+0.011
Neg logit contributions
 tracks-0.008
 radio-0.008
 automobile-0.008
 stret-0.007
 package-0.007
Token princess
Token ID21752
Feature activation-4.43e-01

Pos logit contributions
unch+0.093
oll+0.088
ubb+0.086
enny+0.084
ectar+0.084
Neg logit contributions
 radio-0.101
 track-0.101
 package-0.100
 automobile-0.100
 tracks-0.099
Token of
Token ID286
Feature activation+1.40e+00

Pos logit contributions
 tracks+0.015
 radio+0.014
 stret+0.014
 automobile+0.014
 track+0.014
Neg logit contributions
 servant-0.023
ubb-0.023
 queens-0.023
oll-0.022
ill-0.022
Token the
Token ID262
Feature activation+2.25e+00

Pos logit contributions
oll+0.093
ubb+0.092
 servant+0.091
ill+0.091
Col+0.089
Neg logit contributions
 track-0.026
 tracks-0.026
 automobile-0.026
 radio-0.025
 package-0.024
Token castle
Token ID16669
Feature activation-1.68e-01

Pos logit contributions
oll+0.120
unch+0.119
ubb+0.116
ectar+0.114
ill+0.112
Neg logit contributions
 track-0.078
 tracks-0.076
 automobile-0.076
 radio-0.075
 package-0.074
Token,"
Token ID553
Feature activation-4.03e+00

Pos logit contributions
 tracks+0.005
 radio+0.005
 automobile+0.005
 stret+0.005
 track+0.005
Neg logit contributions
ubb-0.009
 servant-0.009
 queens-0.009
oll-0.009
ill-0.009
Token said
Token ID531
Feature activation-2.53e+00

Pos logit contributions
 stret+0.222
 track+0.205
 controller+0.203
 DVD+0.202
izer+0.197
Neg logit contributions
 gard-0.109
 she-0.101
 queens-0.101
 Mon-0.087
 ever-0.086
Token the
Token ID262
Feature activation+1.22e+00

Pos logit contributions
cycles+0.110
 stret+0.110
 track+0.107
 tracks+0.107
obiles+0.104
Neg logit contributions
 servant-0.102
 queens-0.095
 gard-0.094
ier-0.094
ill-0.093
Token voice
Token ID3809
Feature activation-1.91e+00

Pos logit contributions
ubb+0.060
oll+0.058
ill+0.058
unch+0.056
ectar+0.054
Neg logit contributions
 tracks-0.047
 radio-0.046
 receiver-0.045
 package-0.044
 automobile-0.044
Token.
Token ID13
Feature activation-3.24e+00

Pos logit contributions
obiles+0.085
 track+0.083
cycles+0.083
 radio+0.082
 tracks+0.082
Neg logit contributions
 servant-0.077
ier-0.077
oll-0.075
ubb-0.075
ill-0.074
Token "
Token ID366
Feature activation+5.81e-01

Pos logit contributions
 stret+0.155
 track+0.147
 tracks+0.145
 instrument+0.143
 coats+0.143
Neg logit contributions
 gard-0.107
 servant-0.104
 queens-0.102
ill-0.098
 Mon-0.096
Token the
Token ID262
Feature activation-3.24e+00

Pos logit contributions
 stret+0.096
cycles+0.088
 Amen+0.087
obiles+0.086
 radio+0.081
Neg logit contributions
ill-0.074
ubb-0.072
acy-0.069
umbo-0.066
 queens-0.064
Token switches
Token ID18225
Feature activation-4.60e-01

Pos logit contributions
house+0.140
seed+0.139
 Amen+0.135
obiles+0.135
ishly+0.134
Neg logit contributions
ier-0.097
acy-0.096
ubb-0.092
 queens-0.086
ancer-0.083
Token and
Token ID290
Feature activation+4.35e-02

Pos logit contributions
 stret+0.016
 tracks+0.016
 automobile+0.016
 radio+0.016
 package+0.016
Neg logit contributions
ubb-0.023
 servant-0.022
 queens-0.022
oll-0.021
ill-0.021
Token pressed
Token ID12070
Feature activation-5.19e-01

Pos logit contributions
ubb+0.002
 servant+0.002
 queens+0.002
oll+0.002
ill+0.002
Neg logit contributions
 tracks-0.001
 automobile-0.001
 radio-0.001
 stret-0.001
 package-0.001
Token the
Token ID262
Feature activation-3.06e+00

Pos logit contributions
 stret+0.016
 automobile+0.016
 package+0.016
 radio+0.016
cycles+0.016
Neg logit contributions
ubb-0.028
 queens-0.027
 servant-0.027
ill-0.026
oll-0.026
Token big
Token ID1263
Feature activation-4.02e+00

Pos logit contributions
seed+0.125
house+0.121
obiles+0.116
c+0.113
 Amen+0.110
Neg logit contributions
ier-0.110
ubb-0.109
acy-0.107
 servant-0.096
ancer-0.096
Token red
Token ID2266
Feature activation-3.03e+00

Pos logit contributions
obiles+0.162
seed+0.148
limits+0.145
cycles+0.143
agree+0.143
Neg logit contributions
ubb-0.127
 bud-0.126
 servant-0.123
ier-0.118
acy-0.115
Token button
Token ID4936
Feature activation-3.17e-01

Pos logit contributions
obiles+0.108
 Amen+0.100
cycles+0.099
 belongs+0.097
iday+0.096
Neg logit contributions
ier-0.123
ubb-0.118
 servant-0.118
acy-0.112
ancer-0.110
Token.
Token ID13
Feature activation+1.08e+00

Pos logit contributions
 tracks+0.010
 stret+0.010
 automobile+0.010
 radio+0.010
 package+0.010
Neg logit contributions
ubb-0.016
 servant-0.016
 queens-0.016
oll-0.015
ill-0.015
Token The
Token ID383
Feature activation-2.54e+00

Pos logit contributions
ubb+0.067
 servant+0.067
 queens+0.066
oll+0.063
ill+0.061
Neg logit contributions
 tracks-0.030
 radio-0.028
 automobile-0.027
 track-0.027
 package-0.026
Token radio
Token ID5243
Feature activation-1.13e+00

Pos logit contributions
seed+0.081
 belongs+0.080
c+0.078
 stret+0.075
obiles+0.075
Neg logit contributions
ubb-0.110
acy-0.104
 queens-0.104
ier-0.103
 servant-0.102
Token.
Token ID13
Feature activation-9.47e-02

Pos logit contributions
ubb+0.081
 servant+0.080
 queens+0.077
oll+0.075
Col+0.073
Neg logit contributions
 track-0.037
 tracks-0.037
 radio-0.036
 automobile-0.035
 package-0.035
Token It
Token ID632
Feature activation-2.01e+00

Pos logit contributions
 tracks+0.003
 automobile+0.003
 stret+0.003
 radio+0.003
 track+0.003
Neg logit contributions
ubb-0.005
 servant-0.005
 queens-0.005
oll-0.005
ill-0.005
Token was
Token ID373
Feature activation-7.71e-01

Pos logit contributions
 stret+0.094
obiles+0.085
cycles+0.083
 automobile+0.083
 Amen+0.082
Neg logit contributions
ubb-0.071
ill-0.070
 queens-0.070
 servant-0.067
 diver-0.065
Token a
Token ID257
Feature activation-2.13e+00

Pos logit contributions
 stret+0.029
cycles+0.029
obiles+0.028
 automobile+0.028
 tracks+0.028
Neg logit contributions
ubb-0.036
 servant-0.035
 queens-0.035
oll-0.034
ill-0.033
Token big
Token ID1263
Feature activation-3.03e+00

Pos logit contributions
cycles+0.089
obiles+0.089
 belongs+0.084
esh+0.084
agree+0.081
Neg logit contributions
 servant-0.086
 queens-0.084
ubb-0.080
 diver-0.079
 gard-0.075
Token,
Token ID11
Feature activation-4.01e+00

Pos logit contributions
obiles+0.127
cycles+0.122
cks+0.116
 belongs+0.114
 stret+0.113
Neg logit contributions
 servant-0.112
 diver-0.109
 queens-0.105
ubb-0.103
 king-0.102
Token shiny
Token ID22441
Feature activation-2.86e+00

Pos logit contributions
esh+0.195
cycles+0.185
obiles+0.185
ishly+0.185
heit+0.182
Neg logit contributions
 queens-0.114
oll-0.094
ubb-0.091
 beetle-0.088
 king-0.086
Token radio
Token ID5243
Feature activation-1.82e+00

Pos logit contributions
obiles+0.093
cycles+0.087
agree+0.080
cks+0.078
osc+0.077
Neg logit contributions
 servant-0.143
 diver-0.136
ier-0.132
 chef-0.129
ubb-0.128
Token and
Token ID290
Feature activation-1.26e+00

Pos logit contributions
 stret+0.070
obiles+0.069
cycles+0.068
 tracks+0.067
 Amen+0.066
Neg logit contributions
 servant-0.078
 queens-0.076
oll-0.074
ubb-0.074
ill-0.073
Token Katie
Token ID24721
Feature activation-7.72e-01

Pos logit contributions
 stret+0.050
cycles+0.045
 automobile+0.045
obiles+0.045
 controller+0.044
Neg logit contributions
 queens-0.061
ubb-0.057
 servant-0.056
oll-0.055
ill-0.053
Token was
Token ID373
Feature activation-4.32e-01

Pos logit contributions
 stret+0.026
 automobile+0.025
 tracks+0.025
 radio+0.025
 track+0.025
Neg logit contributions
 servant-0.039
ubb-0.039
 queens-0.038
oll-0.038
ill-0.038
TokenLet
Token ID5756
Feature activation-1.89e+00

Pos logit contributions
 tracks+0.004
 stret+0.004
 radio+0.004
 automobile+0.004
 track+0.004
Neg logit contributions
ubb-0.005
 servant-0.005
 queens-0.005
oll-0.005
ill-0.005
Token's
Token ID338
Feature activation-5.79e-02

Pos logit contributions
cycles+0.062
 stret+0.062
 controller+0.060
 trailer+0.060
 track+0.060
Neg logit contributions
olly-0.087
 queens-0.087
 servant-0.086
ill-0.085
oll-0.084
Token go
Token ID467
Feature activation-3.70e-01

Pos logit contributions
 tracks+0.002
 radio+0.002
 automobile+0.002
 stret+0.002
 package+0.002
Neg logit contributions
ubb-0.003
 servant-0.003
 queens-0.003
oll-0.003
ill-0.003
Token,
Token ID11
Feature activation-1.89e+00

Pos logit contributions
 tracks+0.016
 automobile+0.015
 radio+0.015
 stret+0.015
 package+0.015
Neg logit contributions
 servant-0.016
 queens-0.015
ubb-0.015
oll-0.015
ill-0.014
Token Emmy
Token ID33953
Feature activation-1.13e+00

Pos logit contributions
 stret+0.077
 tracks+0.074
 controller+0.073
cycles+0.069
obiles+0.067
Neg logit contributions
 queens-0.086
 servant-0.080
ubb-0.076
ill-0.073
ory-0.072
Token!"
Token ID2474
Feature activation-3.99e+00

Pos logit contributions
 tracks+0.032
 radio+0.032
 stret+0.031
 controller+0.031
obiles+0.031
Neg logit contributions
 servant-0.064
 queens-0.061
ubb-0.061
ill-0.059
oll-0.059
Token said
Token ID531
Feature activation-1.71e+00

Pos logit contributions
cks+0.197
 stret+0.190
cycles+0.182
igators+0.182
 snowball+0.182
Neg logit contributions
enny-0.113
 she-0.107
ory-0.106
 Av-0.105
 servant-0.105
Token her
Token ID607
Feature activation+2.26e-01

Pos logit contributions
cycles+0.080
obiles+0.077
 tracks+0.076
 belongs+0.074
 stret+0.073
Neg logit contributions
 servant-0.068
ill-0.062
ubb-0.061
 queens-0.061
acy-0.061
Token mom
Token ID1995
Feature activation-1.30e+00

Pos logit contributions
ubb+0.011
 servant+0.010
 queens+0.010
oll+0.010
ill+0.010
Neg logit contributions
 tracks-0.009
 automobile-0.009
 radio-0.009
 package-0.009
 track-0.009
Token.
Token ID13
Feature activation-2.34e+00

Pos logit contributions
 radio+0.051
 tracks+0.050
 stret+0.050
obiles+0.050
cycles+0.049
Neg logit contributions
 servant-0.059
ill-0.057
oll-0.057
ubb-0.057
 queens-0.056
Token "
Token ID366
Feature activation+7.55e-01

Pos logit contributions
 stret+0.102
cycles+0.101
 tracks+0.101
 belongs+0.100
 controller+0.099
Neg logit contributions
 queens-0.083
 servant-0.082
ill-0.079
enny-0.079
umbo-0.076
Token wide
Token ID3094
Feature activation-2.93e-02

Pos logit contributions
 stret+0.036
 package+0.035
 radio+0.035
 receiver+0.034
 automobile+0.034
Neg logit contributions
 servant-0.039
 queens-0.038
ubb-0.038
oll-0.034
ill-0.034
Token and
Token ID290
Feature activation-1.92e+00

Pos logit contributions
 tracks+0.001
 radio+0.001
 automobile+0.001
 stret+0.001
 package+0.001
Neg logit contributions
ubb-0.002
 servant-0.001
 queens-0.001
oll-0.001
ill-0.001
Token they
Token ID484
Feature activation-8.43e-01

Pos logit contributions
 stret+0.093
 package+0.084
 receiver+0.080
 radio+0.080
 Amen+0.078
Neg logit contributions
 queens-0.081
ubb-0.076
 servant-0.072
ill-0.067
 crabs-0.065
Token gave
Token ID2921
Feature activation-1.69e+00

Pos logit contributions
 stret+0.036
 package+0.035
 radio+0.035
 tracks+0.034
 receiver+0.034
Neg logit contributions
 servant-0.036
ubb-0.036
 queens-0.035
ill-0.033
oll-0.033
Token a
Token ID257
Feature activation-1.11e+00

Pos logit contributions
 stret+0.076
 Amen+0.071
 package+0.069
cycles+0.068
 radio+0.068
Neg logit contributions
 servant-0.065
ubb-0.065
 queens-0.065
aph-0.063
ill-0.060
Token loud
Token ID7812
Feature activation-3.96e+00

Pos logit contributions
cycles+0.045
obiles+0.043
 receiver+0.043
 tracks+0.042
esh+0.042
Neg logit contributions
 servant-0.050
 queens-0.047
ubb-0.046
aph-0.044
 diver-0.043
Token roar
Token ID34532
Feature activation-1.10e+00

Pos logit contributions
ci+0.152
 receiver+0.144
obiles+0.139
cycles+0.132
 belongs+0.132
Neg logit contributions
 servant-0.150
 diver-0.144
ubb-0.139
aph-0.137
 king-0.133
Token when
Token ID618
Feature activation-2.65e-01

Pos logit contributions
 stret+0.041
 package+0.040
 automobile+0.040
 receiver+0.040
obiles+0.040
Neg logit contributions
 servant-0.052
ubb-0.051
 queens-0.050
oll-0.049
ill-0.049
Token they
Token ID484
Feature activation+9.95e-01

Pos logit contributions
 stret+0.009
 tracks+0.009
 automobile+0.008
 package+0.008
 radio+0.008
Neg logit contributions
ubb-0.014
 servant-0.014
 queens-0.014
oll-0.013
ill-0.013
Token started
Token ID2067
Feature activation-6.58e-01

Pos logit contributions
ubb+0.056
 servant+0.055
oll+0.054
 queens+0.054
ectar+0.051
Neg logit contributions
 tracks-0.031
 track-0.030
 automobile-0.029
 controller-0.029
 radio-0.028
Token the
Token ID262
Feature activation-9.38e-02

Pos logit contributions
 package+0.031
 stret+0.031
 radio+0.031
 automobile+0.031
 receiver+0.030
Neg logit contributions
ubb-0.024
 servant-0.024
 queens-0.024
ill-0.021
aph-0.021
Token"
Token ID1
Feature activation-1.71e-01

Pos logit contributions
 stret+0.048
 package+0.048
 tracks+0.048
cycles+0.046
 radio+0.046
Neg logit contributions
 queens-0.052
 servant-0.051
aph-0.046
ubb-0.046
ill-0.046
TokenIt
Token ID1026
Feature activation-1.73e+00

Pos logit contributions
 tracks+0.008
 stret+0.008
 automobile+0.008
 radio+0.008
 package+0.008
Neg logit contributions
ubb-0.007
 queens-0.006
 servant-0.006
oll-0.006
ill-0.006
Token's
Token ID338
Feature activation+7.60e-02

Pos logit contributions
 stret+0.055
 automobile+0.045
 radio+0.043
cycles+0.042
obiles+0.041
Neg logit contributions
 servant-0.096
 queens-0.095
ubb-0.093
oll-0.091
ill-0.090
Token an
Token ID281
Feature activation+1.41e-01

Pos logit contributions
ubb+0.004
 servant+0.004
 queens+0.004
oll+0.004
ill+0.004
Neg logit contributions
 tracks-0.003
 radio-0.003
 automobile-0.003
 track-0.003
 stret-0.003
Token earthquake
Token ID16295
Feature activation-9.66e-01

Pos logit contributions
ubb+0.007
 servant+0.007
 queens+0.007
oll+0.007
ill+0.007
Neg logit contributions
 tracks-0.005
 automobile-0.005
 radio-0.005
 track-0.005
 stret-0.005
Token!"
Token ID2474
Feature activation-3.95e+00

Pos logit contributions
 tracks+0.029
 radio+0.029
 stret+0.028
cycles+0.028
obiles+0.028
Neg logit contributions
 servant-0.052
 queens-0.052
ubb-0.051
oll-0.050
ill-0.049
Token said
Token ID531
Feature activation-1.70e+00

Pos logit contributions
 stret+0.166
cycles+0.152
obiles+0.143
 controller+0.139
 track+0.137
Neg logit contributions
 servant-0.153
aph-0.141
 queens-0.140
 gard-0.136
umbo-0.130
Token her
Token ID607
Feature activation+1.83e+00

Pos logit contributions
cycles+0.060
 stret+0.058
obiles+0.057
 tracks+0.055
 package+0.054
Neg logit contributions
 servant-0.089
 queens-0.082
aph-0.081
ubb-0.079
ill-0.077
Token friend
Token ID1545
Feature activation-1.46e+00

Pos logit contributions
ubb+0.097
 queens+0.090
Col+0.090
ill+0.089
acy+0.087
Neg logit contributions
 automobile-0.067
 tracks-0.067
 radio-0.067
 track-0.067
 horn-0.064
Token,
Token ID11
Feature activation-1.28e+00

Pos logit contributions
obiles+0.059
cycles+0.059
 radio+0.058
 stret+0.057
 package+0.057
Neg logit contributions
 servant-0.067
ill-0.063
oll-0.063
ubb-0.062
 queens-0.060
Token the
Token ID262
Feature activation+6.00e-01

Pos logit contributions
 belongs+0.047
 package+0.047
obiles+0.046
 tracks+0.046
cycles+0.046
Neg logit contributions
 servant-0.061
ubb-0.058
 queens-0.058
ill-0.056
oll-0.054
Token.
Token ID13
Feature activation-2.35e+00

Pos logit contributions
ubb+0.034
 servant+0.033
 queens+0.032
oll+0.031
ill+0.031
Neg logit contributions
 tracks-0.019
 radio-0.019
 automobile-0.019
 track-0.018
 stret-0.018
Token Suddenly
Token ID24975
Feature activation-1.57e+00

Pos logit contributions
 stret+0.093
cycles+0.093
 track+0.092
 controller+0.092
ci+0.090
Neg logit contributions
 servant-0.089
ill-0.089
 Mon-0.084
Col-0.082
 queens-0.082
Token they
Token ID484
Feature activation+1.27e+00

Pos logit contributions
cycles+0.067
obiles+0.066
 tracks+0.066
 stret+0.066
 track+0.065
Neg logit contributions
 servant-0.065
 queens-0.063
ubb-0.060
ill-0.059
oll-0.059
Token heard
Token ID2982
Feature activation-1.98e+00

Pos logit contributions
ubb+0.074
 servant+0.072
oll+0.068
 queens+0.068
ectar+0.066
Neg logit contributions
 tracks-0.038
 radio-0.037
 automobile-0.036
 package-0.036
 track-0.035
Token a
Token ID257
Feature activation-1.61e+00

Pos logit contributions
cycles+0.083
 belongs+0.080
 track+0.080
obiles+0.078
 tracks+0.077
Neg logit contributions
 queens-0.083
 servant-0.080
ill-0.077
oll-0.076
ubb-0.075
Token loud
Token ID7812
Feature activation-3.93e+00

Pos logit contributions
cycles+0.055
 tracks+0.052
obiles+0.052
 belongs+0.051
 automobile+0.050
Neg logit contributions
ubb-0.079
 servant-0.078
 queens-0.077
 diver-0.071
aph-0.071
Token noise
Token ID7838
Feature activation-1.33e+00

Pos logit contributions
cycles+0.134
obiles+0.133
 belongs+0.130
ci+0.126
 ignition+0.126
Neg logit contributions
ubb-0.161
 servant-0.153
 diver-0.149
umble-0.148
 gard-0.147
Token.
Token ID13
Feature activation-2.79e+00

Pos logit contributions
obiles+0.052
cycles+0.051
 tracks+0.050
 automobile+0.050
 receiver+0.050
Neg logit contributions
ubb-0.056
ill-0.056
 queens-0.054
oll-0.054
 servant-0.054
Token It
Token ID632
Feature activation-3.25e+00

Pos logit contributions
 track+0.109
ci+0.107
 stret+0.105
 engine+0.103
 automobile+0.103
Neg logit contributions
ill-0.107
 servant-0.105
 Mon-0.104
Col-0.104
 queens-0.103
Token was
Token ID373
Feature activation-7.58e-01

Pos logit contributions
 stret+0.154
obiles+0.145
 automobile+0.143
cycles+0.137
 trailer+0.134
Neg logit contributions
ubb-0.093
ill-0.091
ier-0.090
 queens-0.088
 ever-0.087
Token a
Token ID257
Feature activation-1.30e+00

Pos logit contributions
cycles+0.026
obiles+0.025
 automobile+0.025
 tracks+0.025
 package+0.025
Neg logit contributions
ubb-0.037
 servant-0.036
 queens-0.036
ill-0.035
oll-0.034
Token
Token ID198
Feature activation-1.91e+00

Pos logit contributions
 stret+0.058
 automobile+0.058
 receiver+0.057
 package+0.056
 radio+0.056
Neg logit contributions
ill-0.048
 queens-0.047
 servant-0.047
oll-0.046
ubb-0.045
Token"
Token ID1
Feature activation-1.69e-01

Pos logit contributions
 stret+0.082
 receiver+0.079
ci+0.078
 belongs+0.078
 package+0.078
Neg logit contributions
 queens-0.072
enny-0.066
umbo-0.066
olly-0.066
 servant-0.064
TokenHi
Token ID17250
Feature activation-1.17e+00

Pos logit contributions
 tracks+0.006
 radio+0.006
 track+0.006
 automobile+0.006
 stret+0.006
Neg logit contributions
ubb-0.009
 servant-0.008
 queens-0.008
oll-0.008
ill-0.008
Token,
Token ID11
Feature activation-8.53e-01

Pos logit contributions
 stret+0.047
 tracks+0.046
 receiver+0.045
cycles+0.044
 radio+0.044
Neg logit contributions
 servant-0.052
 queens-0.051
ubb-0.048
oll-0.048
ill-0.047
Token bird
Token ID6512
Feature activation-7.23e-01

Pos logit contributions
 stret+0.035
 tracks+0.034
cycles+0.033
 track+0.033
 controller+0.032
Neg logit contributions
 servant-0.037
 queens-0.037
ubb-0.036
oll-0.035
umbo-0.033
Token!"
Token ID2474
Feature activation-3.92e+00

Pos logit contributions
 tracks+0.027
 radio+0.027
 stret+0.027
 controller+0.026
 package+0.026
Neg logit contributions
 servant-0.034
 queens-0.034
ubb-0.034
ill-0.033
oll-0.032
Token Max
Token ID5436
Feature activation-1.29e+00

Pos logit contributions
 stret+0.209
esh+0.188
rated+0.186
obiles+0.179
ci+0.172
Neg logit contributions
 Ben-0.092
 beetles-0.083
 he-0.080
 queens-0.077
 servant-0.077
Token said
Token ID531
Feature activation-7.90e-01

Pos logit contributions
 stret+0.046
 package+0.045
 automobile+0.044
obiles+0.044
 radio+0.044
Neg logit contributions
 servant-0.067
 queens-0.062
ill-0.061
aph-0.059
oll-0.058
Token.
Token ID13
Feature activation-1.36e+00

Pos logit contributions
obiles+0.031
 package+0.031
 stret+0.031
 automobile+0.031
 tracks+0.031
Neg logit contributions
 servant-0.036
ubb-0.035
 queens-0.034
ill-0.033
oll-0.033
Token "
Token ID366
Feature activation+7.36e-01

Pos logit contributions
 stret+0.058
 package+0.057
 automobile+0.056
 radio+0.056
 receiver+0.055
Neg logit contributions
 servant-0.055
 queens-0.052
ill-0.049
ubb-0.049
onna-0.048
TokenDo
Token ID5211
Feature activation-2.84e-01

Pos logit contributions
ubb+0.038
 servant+0.037
oll+0.036
 queens+0.035
ier+0.034
Neg logit contributions
 automobile-0.025
 radio-0.025
 tracks-0.025
 track-0.025
 receiver-0.025
Token"
Token ID1
Feature activation-9.29e-01

Pos logit contributions
 package+0.123
 belongs+0.122
 controller+0.117
 Amen+0.116
 automobile+0.115
Neg logit contributions
 queens-0.048
enny-0.044
olly-0.040
Col-0.039
J-0.039
TokenLet
Token ID5756
Feature activation-1.80e+00

Pos logit contributions
 tracks+0.036
 belongs+0.035
 package+0.034
cycles+0.033
 automobile+0.033
Neg logit contributions
 queens-0.044
ubb-0.042
 servant-0.041
ill-0.040
Col-0.039
Token's
Token ID338
Feature activation-4.89e-01

Pos logit contributions
cycles+0.059
 track+0.059
 controller+0.058
 tracks+0.057
 stret+0.056
Neg logit contributions
 servant-0.083
olly-0.082
ill-0.080
oll-0.080
andy-0.080
Token go
Token ID467
Feature activation-4.76e-01

Pos logit contributions
 tracks+0.017
 radio+0.017
 automobile+0.017
 package+0.017
 stret+0.016
Neg logit contributions
ubb-0.024
 servant-0.024
 queens-0.023
oll-0.023
ill-0.022
Token inside
Token ID2641
Feature activation-3.62e-01

Pos logit contributions
 radio+0.017
 automobile+0.017
 tracks+0.017
 stret+0.017
 package+0.017
Neg logit contributions
 servant-0.023
 queens-0.022
ubb-0.022
ill-0.021
oll-0.021
Token,"
Token ID553
Feature activation-3.91e+00

Pos logit contributions
 tracks+0.013
 automobile+0.012
 stret+0.012
 radio+0.012
 package+0.012
Neg logit contributions
ubb-0.018
 servant-0.018
 queens-0.018
ill-0.017
oll-0.017
Token Max
Token ID5436
Feature activation-1.96e+00

Pos logit contributions
 stret+0.217
obiles+0.215
esh+0.214
 trailer+0.207
cycles+0.206
Neg logit contributions
 she-0.079
 Ben-0.076
 servant-0.068
 queens-0.066
 beetles-0.066
Token said
Token ID531
Feature activation-1.44e+00

Pos logit contributions
obiles+0.079
 automobile+0.078
 radio+0.077
 stret+0.076
 tracks+0.075
Neg logit contributions
 servant-0.091
ill-0.080
oll-0.080
ona-0.078
ory-0.077
Token.
Token ID13
Feature activation-1.59e+00

Pos logit contributions
obiles+0.064
cycles+0.061
 package+0.060
 tracks+0.060
 automobile+0.060
Neg logit contributions
 servant-0.063
ubb-0.057
oll-0.057
ill-0.057
 queens-0.055
Token
Token ID198
Feature activation-1.83e+00

Pos logit contributions
 package+0.070
obiles+0.069
 radio+0.068
 automobile+0.068
 belongs+0.067
Neg logit contributions
 servant-0.063
 queens-0.059
ill-0.055
enny-0.054
umbo-0.053
Token
Token ID198
Feature activation-2.11e+00

Pos logit contributions
 automobile+0.092
 package+0.091
 stret+0.091
 radio+0.091
cycles+0.090
Neg logit contributions
ill-0.057
enny-0.055
 queens-0.053
olly-0.052
 servant-0.052
Token a
Token ID257
Feature activation+1.23e-01

Pos logit contributions
 stret+0.030
 automobile+0.030
 receiver+0.030
 radio+0.029
 tracks+0.029
Neg logit contributions
 queens-0.038
 servant-0.036
ubb-0.036
oll-0.035
ill-0.033
Token swim
Token ID9422
Feature activation-7.76e-01

Pos logit contributions
ubb+0.007
 servant+0.007
 queens+0.006
oll+0.006
ill+0.006
Neg logit contributions
 tracks-0.004
 radio-0.004
 automobile-0.004
 stret-0.004
 package-0.004
Token,
Token ID11
Feature activation-1.06e+00

Pos logit contributions
 tracks+0.024
 receiver+0.024
 stret+0.024
 radio+0.024
 automobile+0.024
Neg logit contributions
 servant-0.041
 queens-0.041
ubb-0.040
ill-0.039
oll-0.038
Token my
Token ID616
Feature activation+5.36e-01

Pos logit contributions
 tracks+0.040
 stret+0.040
 automobile+0.039
 controller+0.039
cycles+0.039
Neg logit contributions
 queens-0.049
 servant-0.048
ill-0.046
ubb-0.046
oll-0.045
Token darling
Token ID40003
Feature activation-8.99e-01

Pos logit contributions
ubb+0.025
 queens+0.024
 servant+0.024
ill+0.023
oll+0.023
Neg logit contributions
 radio-0.022
 tracks-0.022
 automobile-0.021
 package-0.021
 track-0.021
Token,"
Token ID553
Feature activation-3.86e+00

Pos logit contributions
 tracks+0.022
 radio+0.020
obiles+0.020
 track+0.020
 receiver+0.020
Neg logit contributions
 servant-0.056
 queens-0.054
ubb-0.053
ill-0.052
oll-0.052
Token her
Token ID607
Feature activation-3.90e-01

Pos logit contributions
 track+0.227
obiles+0.222
 controller+0.219
 stret+0.215
izer+0.214
Neg logit contributions
 she-0.097
 gard-0.074
 Mon-0.072
 servant-0.071
oll-0.069
Token mom
Token ID1995
Feature activation-1.26e+00

Pos logit contributions
 tracks+0.018
 stret+0.017
cycles+0.017
 track+0.017
obiles+0.017
Neg logit contributions
 servant-0.016
ubb-0.015
 queens-0.015
oll-0.015
ill-0.014
Tokenmy
Token ID1820
Feature activation-1.81e+00

Pos logit contributions
 radio+0.055
 tracks+0.054
 stret+0.054
 track+0.053
obiles+0.052
Neg logit contributions
ill-0.053
 servant-0.052
oll-0.051
 queens-0.050
umbo-0.049
Token said
Token ID531
Feature activation-1.49e+00

Pos logit contributions
 radio+0.072
 tracks+0.069
 automobile+0.069
obiles+0.068
 stret+0.067
Neg logit contributions
 servant-0.081
ill-0.080
oll-0.079
ax-0.075
ory-0.075
Token.
Token ID13
Feature activation-1.76e+00

Pos logit contributions
obiles+0.064
 automobile+0.063
 tracks+0.063
 radio+0.062
 package+0.061
Neg logit contributions
oll-0.062
 servant-0.061
ubb-0.061
ill-0.059
ier-0.058
Token's
Token ID338
Feature activation+6.00e-01

Pos logit contributions
 stret+0.050
 track+0.050
 controller+0.050
 tracks+0.049
 receiver+0.048
Neg logit contributions
 servant-0.079
ubb-0.077
 queens-0.077
olly-0.075
oll-0.075
Token see
Token ID766
Feature activation-4.59e-02

Pos logit contributions
ubb+0.034
 servant+0.033
 queens+0.033
oll+0.032
ill+0.031
Neg logit contributions
 tracks-0.019
 track-0.018
 radio-0.018
 automobile-0.018
 package-0.017
Token what
Token ID644
Feature activation-1.34e+00

Pos logit contributions
 tracks+0.001
 automobile+0.001
 radio+0.001
 stret+0.001
 package+0.001
Neg logit contributions
ubb-0.003
 servant-0.003
 queens-0.003
oll-0.002
ill-0.002
Token's
Token ID338
Feature activation-1.15e+00

Pos logit contributions
 stret+0.044
 track+0.039
 receiver+0.039
cycles+0.038
obiles+0.038
Neg logit contributions
 queens-0.073
 servant-0.071
ubb-0.068
oll-0.068
 diver-0.065
Token inside
Token ID2641
Feature activation-7.28e-01

Pos logit contributions
 stret+0.044
 Amen+0.043
cycles+0.043
obiles+0.043
 automobile+0.043
Neg logit contributions
 servant-0.051
 queens-0.050
ubb-0.050
oll-0.048
ier-0.046
Token!"
Token ID2474
Feature activation-3.83e+00

Pos logit contributions
 stret+0.023
 tracks+0.023
 radio+0.022
 automobile+0.022
 track+0.022
Neg logit contributions
 queens-0.039
ubb-0.038
 servant-0.038
ill-0.037
oll-0.036
Token said
Token ID531
Feature activation-2.57e+00

Pos logit contributions
obiles+0.183
 stret+0.173
rated+0.172
initely+0.169
cycles+0.164
Neg logit contributions
 servant-0.107
oll-0.096
unn-0.096
 she-0.095
 Po-0.095
Token Bun
Token ID28515
Feature activation-7.62e-01

Pos logit contributions
obiles+0.120
 stret+0.111
cycles+0.109
 belongs+0.106
 receiver+0.105
Neg logit contributions
 servant-0.102
ubb-0.089
oll-0.086
 diver-0.085
aph-0.085
Token's
Token ID338
Feature activation+1.23e+00

Pos logit contributions
 radio+0.025
 automobile+0.025
 tracks+0.025
 stret+0.025
obiles+0.025
Neg logit contributions
 servant-0.040
 queens-0.038
ubb-0.038
oll-0.037
ill-0.036
Token friend
Token ID1545
Feature activation-9.47e-01

Pos logit contributions
ubb+0.064
ill+0.061
 queens+0.060
acy+0.059
Col+0.058
Neg logit contributions
 radio-0.048
 tracks-0.047
 track-0.046
 package-0.045
 automobile-0.045
Token,
Token ID11
Feature activation-7.97e-01

Pos logit contributions
 stret+0.036
obiles+0.036
 radio+0.036
cycles+0.035
 tracks+0.035
Neg logit contributions
 servant-0.045
ill-0.043
ubb-0.043
oll-0.043
 queens-0.042
Token The
Token ID383
Feature activation-4.85e-01

Pos logit contributions
 stret+0.019
 tracks+0.019
 radio+0.018
 package+0.018
 track+0.018
Neg logit contributions
ubb-0.019
 servant-0.019
 queens-0.018
ill-0.018
oll-0.018
Token baby
Token ID5156
Feature activation-4.76e-01

Pos logit contributions
 tracks+0.021
 track+0.020
 stret+0.020
cycles+0.020
obiles+0.020
Neg logit contributions
ubb-0.021
 servant-0.021
 queens-0.020
oll-0.019
ill-0.018
Token had
Token ID550
Feature activation+2.31e-01

Pos logit contributions
 tracks+0.014
 stret+0.014
 track+0.014
 radio+0.014
 automobile+0.013
Neg logit contributions
 servant-0.026
ubb-0.026
 queens-0.026
oll-0.025
ill-0.025
Token on
Token ID319
Feature activation-1.70e+00

Pos logit contributions
ubb+0.012
 servant+0.012
 queens+0.012
oll+0.012
ill+0.011
Neg logit contributions
 tracks-0.008
 radio-0.007
 automobile-0.007
 package-0.007
 stret-0.007
Token a
Token ID257
Feature activation-2.51e+00

Pos logit contributions
cycles+0.070
obiles+0.068
 controller+0.068
 receiver+0.068
 Amen+0.067
Neg logit contributions
 servant-0.067
ubb-0.066
 queens-0.063
oll-0.063
Col-0.061
Token beautiful
Token ID4950
Feature activation-3.83e+00

Pos logit contributions
cycles+0.109
 receiver+0.103
adoes+0.103
obiles+0.101
agree+0.101
Neg logit contributions
 servant-0.097
oll-0.092
olly-0.091
ory-0.088
 queens-0.088
Token dress
Token ID6576
Feature activation-1.36e-01

Pos logit contributions
cycles+0.168
obiles+0.162
 controller+0.161
agree+0.155
 receiver+0.154
Neg logit contributions
 servant-0.130
 diver-0.118
 chef-0.116
ona-0.115
andy-0.115
Token,
Token ID11
Feature activation-5.53e-01

Pos logit contributions
 tracks+0.005
 automobile+0.005
 radio+0.005
 stret+0.005
 package+0.004
Neg logit contributions
ubb-0.007
 servant-0.007
 queens-0.007
oll-0.007
ill-0.006
Token with
Token ID351
Feature activation+2.26e-01

Pos logit contributions
 stret+0.020
 automobile+0.020
 tracks+0.020
 package+0.020
 radio+0.020
Neg logit contributions
ubb-0.026
 servant-0.026
 queens-0.026
oll-0.025
ill-0.024
Token a
Token ID257
Feature activation-1.11e+00

Pos logit contributions
ubb+0.012
 servant+0.012
 queens+0.012
oll+0.011
ill+0.011
Neg logit contributions
 tracks-0.008
 radio-0.007
 automobile-0.007
 package-0.007
 track-0.007
Token bon
Token ID5351
Feature activation-9.61e-01

Pos logit contributions
cycles+0.043
 tracks+0.043
obiles+0.042
 automobile+0.042
 belongs+0.042
Neg logit contributions
 servant-0.049
ubb-0.049
 queens-0.047
oll-0.046
Col-0.044
Token.
Token ID13
Feature activation+3.72e-01

Pos logit contributions
ubb+0.011
 servant+0.011
 queens+0.011
oll+0.010
ill+0.010
Neg logit contributions
 tracks-0.007
 automobile-0.007
 radio-0.007
 track-0.007
 package-0.007
Token He
Token ID679
Feature activation-1.56e-01

Pos logit contributions
ubb+0.021
 servant+0.020
 queens+0.020
oll+0.019
ill+0.019
Neg logit contributions
 tracks-0.012
 automobile-0.012
 radio-0.011
 track-0.011
 package-0.011
Token was
Token ID373
Feature activation+3.13e-01

Pos logit contributions
 tracks+0.005
 automobile+0.005
 radio+0.005
 stret+0.005
 package+0.004
Neg logit contributions
ubb-0.009
 servant-0.009
 queens-0.008
oll-0.008
ill-0.008
Token wearing
Token ID5762
Feature activation-1.64e+00

Pos logit contributions
ubb+0.016
 servant+0.016
 queens+0.016
oll+0.015
ill+0.015
Neg logit contributions
 tracks-0.011
 radio-0.010
 automobile-0.010
 track-0.010
 stret-0.010
Token his
Token ID465
Feature activation-2.41e+00

Pos logit contributions
 belongs+0.071
 Amen+0.070
 stret+0.069
obiles+0.067
cycles+0.067
Neg logit contributions
 servant-0.065
 queens-0.065
ubb-0.062
oll-0.059
Col-0.056
Token favourite
Token ID12507
Feature activation-3.82e+00

Pos logit contributions
 Amen+0.095
agree+0.094
esh+0.092
obiles+0.090
 controller+0.089
Neg logit contributions
 servant-0.103
oll-0.093
ubb-0.089
 queens-0.087
ory-0.083
Token yellow
Token ID7872
Feature activation-2.87e+00

Pos logit contributions
agree+0.187
obiles+0.168
cycles+0.166
 Amen+0.162
rated+0.158
Neg logit contributions
 servant-0.140
ory-0.112
 crab-0.109
 chef-0.109
 queens-0.107
Token shirt
Token ID10147
Feature activation-2.58e-01

Pos logit contributions
agree+0.108
obiles+0.105
cycles+0.104
 belongs+0.102
 Amen+0.101
Neg logit contributions
 servant-0.126
ory-0.107
oll-0.107
 queens-0.103
ubb-0.103
Token and
Token ID290
Feature activation-4.62e-01

Pos logit contributions
 tracks+0.009
 radio+0.009
 stret+0.009
 automobile+0.009
 package+0.008
Neg logit contributions
ubb-0.013
 servant-0.013
 queens-0.013
oll-0.013
ill-0.012
Token smiling
Token ID16755
Feature activation+1.02e-01

Pos logit contributions
 stret+0.016
 automobile+0.016
 radio+0.016
 package+0.016
 tracks+0.016
Neg logit contributions
 servant-0.023
ubb-0.023
 queens-0.023
oll-0.022
ill-0.021
Token.
Token ID13
Feature activation+4.70e-01

Pos logit contributions
ubb+0.005
 servant+0.005
 queens+0.005
oll+0.005
ill+0.005
Neg logit contributions
 tracks-0.003
 radio-0.003
 automobile-0.003
 stret-0.003
 track-0.003
Token.
Token ID13
Feature activation-4.32e-01

Pos logit contributions
ubb+0.120
Col+0.108
 queens+0.108
acy+0.107
 servant+0.105
Neg logit contributions
 track-0.040
 tracks-0.036
 package-0.036
 belongs-0.035
 automobile-0.035
Token He
Token ID679
Feature activation+4.58e-01

Pos logit contributions
 stret+0.018
 radio+0.018
 tracks+0.018
 automobile+0.018
 package+0.018
Neg logit contributions
 servant-0.018
ubb-0.018
 queens-0.017
oll-0.017
ill-0.017
Token added
Token ID2087
Feature activation+9.82e-01

Pos logit contributions
ubb+0.026
 queens+0.024
 servant+0.024
oll+0.023
ill+0.023
Neg logit contributions
 tracks-0.015
 automobile-0.014
 package-0.014
 radio-0.014
 track-0.014
Token cheese
Token ID9891
Feature activation+2.34e+00

Pos logit contributions
ubb+0.063
 servant+0.062
 queens+0.060
aph+0.059
oll+0.059
Neg logit contributions
 tracks-0.022
 package-0.021
 track-0.020
 radio-0.019
 automobile-0.019
Token,
Token ID11
Feature activation+1.07e+00

Pos logit contributions
ubb+0.143
 queens+0.132
 servant+0.131
aph+0.128
acy+0.127
Neg logit contributions
 tracks-0.060
 track-0.060
 package-0.056
 controller-0.055
 belongs-0.054
Token ham
Token ID8891
Feature activation+3.41e+00

Pos logit contributions
ubb+0.051
 servant+0.047
oll+0.045
aph+0.045
 queens+0.045
Neg logit contributions
 tracks-0.044
 track-0.042
 automobile-0.042
 radio-0.042
 stret-0.041
Token,
Token ID11
Feature activation+1.31e+00

Pos logit contributions
ubb+0.241
aph+0.232
umbo+0.224
 queens+0.223
acy+0.218
Neg logit contributions
's-0.044
 track-0.043
 tracks-0.042
 package-0.041
 their-0.040
Token and
Token ID290
Feature activation+1.11e+00

Pos logit contributions
ubb+0.078
 servant+0.072
aph+0.072
oll+0.070
acy+0.070
Neg logit contributions
 tracks-0.039
 track-0.036
 automobile-0.036
 package-0.035
 radio-0.035
Token lettuce
Token ID39406
Feature activation+2.12e+00

Pos logit contributions
ubb+0.056
 servant+0.052
aph+0.051
ill+0.051
oll+0.050
Neg logit contributions
 tracks-0.042
 automobile-0.040
 package-0.040
 track-0.040
 radio-0.040
Token to
Token ID284
Feature activation+8.82e-01

Pos logit contributions
ubb+0.137
 queens+0.133
aph+0.130
 servant+0.130
Col+0.128
Neg logit contributions
 tracks-0.044
 belongs-0.042
 track-0.040
 package-0.039
cycles-0.038
Token his
Token ID465
Feature activation+8.73e-01

Pos logit contributions
ubb+0.047
 servant+0.046
 queens+0.045
oll+0.045
aph+0.043
Neg logit contributions
 tracks-0.029
 track-0.029
 stret-0.028
 automobile-0.028
 radio-0.028
Token She
Token ID1375
Feature activation+5.37e-01

Pos logit contributions
ubb+0.072
 queens+0.070
 servant+0.069
ier+0.066
 diver+0.065
Neg logit contributions
 tracks-0.042
 automobile-0.040
 radio-0.038
 package-0.037
 track-0.036
Token could
Token ID714
Feature activation+1.21e+00

Pos logit contributions
ubb+0.032
 servant+0.031
 queens+0.031
oll+0.030
ill+0.029
Neg logit contributions
 tracks-0.015
 track-0.014
 automobile-0.014
 radio-0.014
 package-0.014
Token finally
Token ID3443
Feature activation+1.21e+00

Pos logit contributions
ubb+0.064
oll+0.062
 queens+0.062
 servant+0.061
umbo+0.057
Neg logit contributions
 tracks-0.044
 track-0.044
 radio-0.042
 automobile-0.041
 stret-0.041
Token cross
Token ID3272
Feature activation+1.82e-01

Pos logit contributions
ubb+0.066
 queens+0.065
oll+0.065
 servant+0.064
acy+0.060
Neg logit contributions
 tracks-0.041
 track-0.039
 automobile-0.037
 stret-0.037
 radio-0.036
Token the
Token ID262
Feature activation-1.48e-01

Pos logit contributions
ubb+0.011
 servant+0.011
 queens+0.011
oll+0.010
ill+0.010
Neg logit contributions
 tracks-0.005
 automobile-0.005
 radio-0.005
 track-0.005
 stret-0.005
Token sm
Token ID895
Feature activation+2.73e+00

Pos logit contributions
 stret+0.006
 automobile+0.006
 tracks+0.006
 radio+0.006
 package+0.006
Neg logit contributions
ubb-0.007
 servant-0.007
 queens-0.007
oll-0.006
ill-0.006
Tokenelly
Token ID6148
Feature activation-4.21e-01

Pos logit contributions
 servant+0.105
ubb+0.090
 queens+0.090
acy+0.084
ona+0.083
Neg logit contributions
 automobile-0.124
 tracks-0.123
 track-0.121
 belongs-0.111
cycles-0.111
Token swamp
Token ID26837
Feature activation+6.69e-01

Pos logit contributions
 stret+0.017
 automobile+0.016
 radio+0.016
 package+0.016
 tracks+0.016
Neg logit contributions
ubb-0.020
 servant-0.019
 queens-0.019
ill-0.018
oll-0.018
Token.
Token ID13
Feature activation+1.05e+00

Pos logit contributions
ubb+0.037
 servant+0.036
 queens+0.035
oll+0.035
acy+0.033
Neg logit contributions
 tracks-0.022
 track-0.021
 automobile-0.020
 radio-0.020
 stret-0.020
Token After
Token ID2293
Feature activation+1.46e+00

Pos logit contributions
ubb+0.062
 queens+0.060
 servant+0.058
ier+0.056
 diver+0.056
Neg logit contributions
 tracks-0.032
 automobile-0.031
 package-0.029
 radio-0.029
 receiver-0.029
Token that
Token ID326
Feature activation-1.13e+00

Pos logit contributions
ubb+0.091
 servant+0.089
ill+0.086
oll+0.086
ory+0.086
Neg logit contributions
 tracks-0.041
 track-0.036
 radio-0.035
 automobile-0.033
 package-0.032
Token time
Token ID640
Feature activation+5.62e-02

Pos logit contributions
 tracks+0.029
 receiver+0.029
 automobile+0.029
 radio+0.028
 package+0.028
Neg logit contributions
ubb-0.033
oll-0.033
 servant-0.033
ill-0.032
 queens-0.032
Token,
Token ID11
Feature activation-1.06e+00

Pos logit contributions
ubb+0.003
 servant+0.003
 queens+0.003
oll+0.003
ill+0.003
Neg logit contributions
 tracks-0.002
 automobile-0.002
 radio-0.002
 stret-0.002
 package-0.002
Token there
Token ID612
Feature activation-5.32e-01

Pos logit contributions
 stret+0.051
 tracks+0.049
 radio+0.048
 package+0.048
 automobile+0.048
Neg logit contributions
ill-0.040
ubb-0.040
 servant-0.039
 queens-0.039
oll-0.039
Token was
Token ID373
Feature activation+2.55e-01

Pos logit contributions
 stret+0.016
 tracks+0.015
 radio+0.015
 automobile+0.015
 track+0.015
Neg logit contributions
ubb-0.029
 servant-0.029
ill-0.028
oll-0.028
 queens-0.028
Token an
Token ID281
Feature activation+6.68e-02

Pos logit contributions
ubb+0.014
 servant+0.014
 queens+0.014
oll+0.013
ill+0.013
Neg logit contributions
 tracks-0.008
 radio-0.008
 automobile-0.008
 track-0.007
 package-0.007
Token oct
Token ID19318
Feature activation+3.83e+00

Pos logit contributions
ubb+0.003
 servant+0.003
 queens+0.003
oll+0.003
ill+0.003
Neg logit contributions
 tracks-0.002
 automobile-0.002
 radio-0.002
 stret-0.002
 track-0.002
Tokenopus
Token ID25790
Feature activation-5.38e-01

Pos logit contributions
ubb+0.168
ill+0.156
 servant+0.148
ier+0.142
 queens+0.140
Neg logit contributions
 radio-0.143
 package-0.142
 track-0.139
 instrument-0.137
 automobile-0.137
Token named
Token ID3706
Feature activation+4.12e-01

Pos logit contributions
 stret+0.015
 package+0.015
 tracks+0.015
 radio+0.015
 automobile+0.014
Neg logit contributions
ubb-0.030
 servant-0.030
 queens-0.030
oll-0.029
ill-0.029
Token Tim
Token ID5045
Feature activation-3.23e-01

Pos logit contributions
ubb+0.020
 servant+0.020
 queens+0.019
oll+0.019
ill+0.018
Neg logit contributions
 tracks-0.016
 track-0.015
 radio-0.015
 automobile-0.015
 package-0.015
Tokenmy
Token ID1820
Feature activation-8.22e-01

Pos logit contributions
 stret+0.011
 tracks+0.011
 radio+0.011
 package+0.011
 automobile+0.011
Neg logit contributions
 servant-0.016
ubb-0.016
 queens-0.016
ill-0.016
oll-0.015
Token.
Token ID13
Feature activation-4.68e-01

Pos logit contributions
 stret+0.031
 package+0.030
obiles+0.030
 receiver+0.030
 tracks+0.030
Neg logit contributions
 servant-0.038
ubb-0.037
 queens-0.037
oll-0.036
ill-0.036
Token nose
Token ID9686
Feature activation+6.99e-01

Pos logit contributions
ubb+0.034
 queens+0.032
ill+0.032
oll+0.032
 servant+0.031
Neg logit contributions
 tracks-0.024
 radio-0.024
 track-0.023
 automobile-0.022
 package-0.022
Token and
Token ID290
Feature activation-3.63e-01

Pos logit contributions
ubb+0.039
 servant+0.038
 queens+0.038
oll+0.037
acy+0.035
Neg logit contributions
 tracks-0.022
 radio-0.022
 track-0.021
 automobile-0.021
 stret-0.021
Token ran
Token ID4966
Feature activation+1.43e-01

Pos logit contributions
 automobile+0.014
 stret+0.013
 radio+0.013
 package+0.013
 tracks+0.013
Neg logit contributions
 servant-0.017
ubb-0.017
 queens-0.017
oll-0.016
ill-0.016
Token as
Token ID355
Feature activation+1.83e+00

Pos logit contributions
ubb+0.008
 servant+0.008
 queens+0.007
oll+0.007
ill+0.007
Neg logit contributions
 tracks-0.005
 radio-0.005
 automobile-0.005
 track-0.004
 stret-0.004
Token fast
Token ID3049
Feature activation+1.14e+00

Pos logit contributions
ubb+0.116
oll+0.116
 servant+0.114
ory+0.109
ill+0.109
Neg logit contributions
 tracks-0.047
 track-0.046
 radio-0.044
 automobile-0.041
 wire-0.039
Token as
Token ID355
Feature activation+2.64e+00

Pos logit contributions
ubb+0.065
 servant+0.064
 queens+0.063
ill+0.063
acy+0.062
Neg logit contributions
 tracks-0.034
 stret-0.032
 track-0.032
 automobile-0.031
 radio-0.031
Token she
Token ID673
Feature activation+1.58e+00

Pos logit contributions
ubb+0.141
 servant+0.138
oll+0.137
ascal+0.136
Col+0.134
Neg logit contributions
 tracks-0.093
 track-0.087
 their-0.080
 radio-0.079
 wire-0.079
Token could
Token ID714
Feature activation+1.53e+00

Pos logit contributions
ubb+0.085
 queens+0.081
 servant+0.080
umbo+0.077
ill+0.076
Neg logit contributions
 tracks-0.054
 package-0.050
 receiver-0.049
 belongs-0.048
 radio-0.048
Token.
Token ID13
Feature activation+7.69e-01

Pos logit contributions
 queens+0.103
ubb+0.103
 servant+0.100
oll+0.096
Col+0.094
Neg logit contributions
 tracks-0.034
 track-0.031
 radio-0.029
 stret-0.028
 automobile-0.026
Token She
Token ID1375
Feature activation+5.88e-01

Pos logit contributions
ubb+0.046
 queens+0.045
 servant+0.045
oll+0.042
ill+0.042
Neg logit contributions
 tracks-0.022
 automobile-0.021
 radio-0.021
 track-0.020
 receiver-0.020
Token made
Token ID925
Feature activation+8.80e-01

Pos logit contributions
ubb+0.032
 servant+0.031
 queens+0.031
oll+0.030
ill+0.029
Neg logit contributions
 tracks-0.020
 track-0.019
 radio-0.018
 automobile-0.018
 package-0.018
Token very
Token ID845
Feature activation+1.37e+00

Pos logit contributions
ubb+0.005
 servant+0.005
 queens+0.005
oll+0.005
ill+0.005
Neg logit contributions
 tracks-0.002
 automobile-0.002
 radio-0.002
 stret-0.002
 track-0.002
Token still
Token ID991
Feature activation-7.06e-01

Pos logit contributions
 servant+0.072
 queens+0.072
ubb+0.072
ill+0.068
oll+0.067
Neg logit contributions
 tracks-0.045
 track-0.045
 radio-0.043
 automobile-0.043
 stret-0.042
Token,
Token ID11
Feature activation-1.26e-01

Pos logit contributions
 automobile+0.027
 tracks+0.027
 package+0.027
 stret+0.026
 radio+0.026
Neg logit contributions
ubb-0.033
 servant-0.032
 queens-0.031
oll-0.031
ill-0.030
Token watching
Token ID4964
Feature activation+2.24e-01

Pos logit contributions
 tracks+0.005
 automobile+0.005
 radio+0.005
 package+0.005
 stret+0.005
Neg logit contributions
ubb-0.006
 servant-0.006
 queens-0.006
oll-0.006
ill-0.006
Token the
Token ID262
Feature activation+6.63e-01

Pos logit contributions
ubb+0.013
 servant+0.013
 queens+0.012
oll+0.012
ill+0.012
Neg logit contributions
 tracks-0.006
 radio-0.006
 automobile-0.006
 stret-0.006
 track-0.006
Token bar
Token ID2318
Feature activation+2.65e+00

Pos logit contributions
ubb+0.034
 servant+0.033
 queens+0.033
oll+0.032
ill+0.032
Neg logit contributions
 tracks-0.023
 radio-0.023
 automobile-0.023
 package-0.023
 track-0.023
Tokenber
Token ID527
Feature activation-1.28e-01

Pos logit contributions
ubb+0.116
 queens+0.111
umbo+0.103
 servant+0.100
Col+0.098
Neg logit contributions
 tracks-0.107
 automobile-0.105
 radio-0.103
 track-0.101
 package-0.101
Token in
Token ID287
Feature activation-5.67e-01

Pos logit contributions
 tracks+0.004
 automobile+0.004
 radio+0.004
 stret+0.004
 package+0.004
Neg logit contributions
ubb-0.007
 servant-0.007
 queens-0.007
oll-0.006
ill-0.006
Token the
Token ID262
Feature activation-8.99e-01

Pos logit contributions
 automobile+0.019
 stret+0.019
 tracks+0.019
cycles+0.018
 radio+0.018
Neg logit contributions
ubb-0.029
 servant-0.029
 queens-0.028
ill-0.028
oll-0.027
Token mirror
Token ID10162
Feature activation+8.87e-01

Pos logit contributions
cycles+0.032
obiles+0.031
 tracks+0.031
 track+0.031
 controller+0.031
Neg logit contributions
ubb-0.043
 servant-0.042
oll-0.041
 queens-0.040
Col-0.038
Token.
Token ID13
Feature activation+6.56e-01

Pos logit contributions
ubb+0.051
 servant+0.050
 queens+0.050
oll+0.047
umbo+0.046
Neg logit contributions
 tracks-0.026
 radio-0.025
 automobile-0.025
 track-0.025
 stret-0.025
Token play
Token ID711
Feature activation-1.92e+00

Pos logit contributions
 stret+0.057
 receiver+0.056
 radio+0.056
 package+0.055
cycles+0.055
Neg logit contributions
 servant-0.057
 queens-0.054
ill-0.054
 diver-0.054
Col-0.053
Token baseball
Token ID9283
Feature activation-6.10e-01

Pos logit contributions
 receiver+0.091
 package+0.090
 Amen+0.089
 stret+0.089
 automobile+0.086
Neg logit contributions
oll-0.065
 servant-0.065
 queens-0.064
 diver-0.063
ill-0.063
Token.
Token ID13
Feature activation-1.81e+00

Pos logit contributions
 stret+0.022
 package+0.022
 radio+0.021
 automobile+0.021
 receiver+0.021
Neg logit contributions
 servant-0.030
 queens-0.029
ubb-0.029
oll-0.028
ill-0.028
Token Tim
Token ID5045
Feature activation-6.25e-01

Pos logit contributions
 stret+0.080
 radio+0.077
 package+0.076
 receiver+0.075
 Amen+0.074
Neg logit contributions
ill-0.072
umbo-0.068
 servant-0.064
aph-0.064
unch-0.063
Tokenmy
Token ID1820
Feature activation-1.10e+00

Pos logit contributions
 stret+0.024
 radio+0.024
 tracks+0.024
 package+0.023
 automobile+0.023
Neg logit contributions
 servant-0.029
ill-0.028
ubb-0.028
 queens-0.028
oll-0.027
Token's
Token ID338
Feature activation+6.90e-01

Pos logit contributions
 stret+0.040
 package+0.038
 automobile+0.038
 Amen+0.038
 radio+0.037
Neg logit contributions
 servant-0.053
 queens-0.051
ubb-0.051
ill-0.050
oll-0.050
Token friend
Token ID1545
Feature activation-8.75e-01

Pos logit contributions
ubb+0.032
 queens+0.031
 servant+0.030
oll+0.029
ill+0.029
Neg logit contributions
 tracks-0.029
 track-0.029
 automobile-0.029
 radio-0.029
 package-0.028
Token,
Token ID11
Feature activation-8.66e-01

Pos logit contributions
 stret+0.034
 radio+0.033
 tracks+0.033
obiles+0.033
 package+0.033
Neg logit contributions
 servant-0.041
ill-0.038
 queens-0.038
ubb-0.038
oll-0.038
Token Billy
Token ID15890
Feature activation-9.52e-01

Pos logit contributions
 tracks+0.032
 stret+0.032
 automobile+0.032
 radio+0.032
cycles+0.032
Neg logit contributions
 servant-0.041
 queens-0.040
ill-0.039
ubb-0.037
oll-0.037
Token,
Token ID11
Feature activation-1.64e-01

Pos logit contributions
 stret+0.040
 tracks+0.039
 radio+0.039
 automobile+0.039
 package+0.039
Neg logit contributions
 servant-0.040
ubb-0.039
ill-0.038
 queens-0.038
umbo-0.037
Token hit
Token ID2277
Feature activation-8.27e-01

Pos logit contributions
 tracks+0.006
 radio+0.005
 automobile+0.005
 stret+0.005
 track+0.005
Neg logit contributions
ubb-0.008
 servant-0.008
 queens-0.008
oll-0.008
ill-0.008
Token She
Token ID1375
Feature activation-7.54e-02

Pos logit contributions
ubb+0.073
 queens+0.072
 servant+0.069
oll+0.066
ier+0.065
Neg logit contributions
 tracks-0.042
 radio-0.041
 automobile-0.041
 track-0.039
 package-0.036
Token thinks
Token ID6834
Feature activation+1.58e+00

Pos logit contributions
 tracks+0.002
 automobile+0.002
 radio+0.002
 stret+0.002
 package+0.002
Neg logit contributions
ubb-0.005
 servant-0.005
 queens-0.005
oll-0.005
ill-0.005
Token Sam
Token ID3409
Feature activation-1.21e+00

Pos logit contributions
ubb+0.092
 queens+0.091
Col+0.090
 servant+0.089
ory+0.085
Neg logit contributions
 tracks-0.048
 track-0.045
 radio-0.045
 automobile-0.043
 controller-0.040
Token is
Token ID318
Feature activation-4.19e-01

Pos logit contributions
 stret+0.036
 tracks+0.035
 track+0.034
 radio+0.034
 automobile+0.033
Neg logit contributions
 servant-0.067
oll-0.066
ill-0.065
ubb-0.065
 queens-0.063
Token silly
Token ID14397
Feature activation+3.99e-01

Pos logit contributions
 tracks+0.016
 stret+0.016
 automobile+0.016
 radio+0.016
 package+0.016
Neg logit contributions
 servant-0.020
ubb-0.020
 queens-0.020
oll-0.019
ill-0.018
Token.
Token ID13
Feature activation+1.03e+00

Pos logit contributions
ubb+0.023
 servant+0.023
 queens+0.023
oll+0.022
ill+0.021
Neg logit contributions
 tracks-0.011
 automobile-0.011
 radio-0.011
 package-0.011
 track-0.011
Token She
Token ID1375
Feature activation+2.36e-01

Pos logit contributions
ubb+0.057
 queens+0.054
 servant+0.054
oll+0.051
ier+0.050
Neg logit contributions
 tracks-0.035
 automobile-0.035
 radio-0.035
 track-0.033
 receiver-0.032
Token says
Token ID1139
Feature activation+1.19e+00

Pos logit contributions
ubb+0.015
 servant+0.015
 queens+0.015
oll+0.014
ill+0.014
Neg logit contributions
 tracks-0.005
 radio-0.005
 automobile-0.005
 track-0.005
 stret-0.005
Token,
Token ID11
Feature activation-3.89e-01

Pos logit contributions
ubb+0.077
 queens+0.073
 servant+0.073
oll+0.072
Col+0.072
Neg logit contributions
 radio-0.024
 tracks-0.024
 automobile-0.024
 track-0.022
 package-0.022
Token "
Token ID366
Feature activation+2.49e-02

Pos logit contributions
 tracks+0.013
 automobile+0.013
 package+0.012
 stret+0.012
 radio+0.012
Neg logit contributions
ubb-0.020
 servant-0.020
 queens-0.020
oll-0.019
ill-0.019
TokenNo
Token ID2949
Feature activation-9.12e-01

Pos logit contributions
ubb+0.001
 servant+0.001
 queens+0.001
oll+0.001
ill+0.001
Neg logit contributions
 tracks-0.001
 radio-0.001
 automobile-0.001
 stret-0.001
 package-0.001
TokenBut
Token ID1537
Feature activation-1.65e+00

Pos logit contributions
 stret+0.074
 tracks+0.072
 radio+0.071
cycles+0.070
 controller+0.070
Neg logit contributions
 queens-0.065
ill-0.062
enny-0.059
umbo-0.057
Col-0.056
Token Sarah
Token ID10490
Feature activation-2.31e-01

Pos logit contributions
 stret+0.063
cycles+0.058
 controller+0.056
 tracks+0.056
obiles+0.055
Neg logit contributions
 queens-0.078
 servant-0.077
ill-0.076
ubb-0.073
oll-0.072
Token liked
Token ID8288
Feature activation+9.30e-01

Pos logit contributions
 tracks+0.007
 radio+0.007
 automobile+0.007
 stret+0.007
 track+0.007
Neg logit contributions
ubb-0.013
 servant-0.013
 queens-0.012
oll-0.012
ill-0.012
Token the
Token ID262
Feature activation-3.71e-01

Pos logit contributions
ubb+0.055
 servant+0.053
oll+0.052
ill+0.051
 queens+0.051
Neg logit contributions
 tracks-0.028
 radio-0.027
 automobile-0.026
 track-0.026
 package-0.025
Token furry
Token ID40857
Feature activation-1.18e-01

Pos logit contributions
 stret+0.012
 automobile+0.012
 tracks+0.012
 radio+0.012
 package+0.012
Neg logit contributions
 servant-0.019
ubb-0.019
 queens-0.019
oll-0.018
ill-0.018
Token costume
Token ID16569
Feature activation+4.16e-01

Pos logit contributions
 tracks+0.004
 stret+0.004
 automobile+0.004
 radio+0.004
 package+0.003
Neg logit contributions
ubb-0.007
 servant-0.006
 queens-0.006
oll-0.006
ill-0.006
Token and
Token ID290
Feature activation-5.88e-01

Pos logit contributions
ubb+0.022
 servant+0.021
 queens+0.021
oll+0.020
ill+0.020
Neg logit contributions
 tracks-0.014
 automobile-0.014
 radio-0.014
 track-0.014
 package-0.014
Token really
Token ID1107
Feature activation-1.21e+00

Pos logit contributions
 stret+0.024
 radio+0.023
 tracks+0.023
 controller+0.023
 package+0.023
Neg logit contributions
 servant-0.026
ubb-0.026
 queens-0.026
oll-0.025
ill-0.024
Token wanted
Token ID2227
Feature activation+2.97e-01

Pos logit contributions
 tracks+0.047
 package+0.047
 stret+0.047
 radio+0.047
 track+0.046
Neg logit contributions
 servant-0.055
ubb-0.054
oll-0.054
 queens-0.051
ill-0.049
Token to
Token ID284
Feature activation-1.48e-02

Pos logit contributions
ubb+0.014
 servant+0.014
 queens+0.014
oll+0.013
ill+0.013
Neg logit contributions
 tracks-0.011
 automobile-0.011
 radio-0.011
 package-0.011
 track-0.011
Token keep
Token ID1394
Feature activation-7.14e-01

Pos logit contributions
 tracks+0.001
 radio+0.000
 automobile+0.000
 stret+0.000
 package+0.000
Neg logit contributions
ubb-0.001
 servant-0.001
 queens-0.001
oll-0.001
ill-0.001
Token a
Token ID257
Feature activation+5.65e-01

Pos logit contributions
 stret+0.022
 tracks+0.022
 radio+0.022
 track+0.021
 automobile+0.021
Neg logit contributions
 servant-0.024
ubb-0.024
 queens-0.024
oll-0.023
ill-0.022
Token hug
Token ID16225
Feature activation-3.33e-01

Pos logit contributions
ubb+0.028
 servant+0.027
 queens+0.027
oll+0.026
ill+0.026
Neg logit contributions
 tracks-0.021
 radio-0.021
 package-0.020
 automobile-0.020
 stret-0.020
Token and
Token ID290
Feature activation-3.81e-01

Pos logit contributions
 tracks+0.012
 stret+0.012
 automobile+0.012
 radio+0.012
 track+0.012
Neg logit contributions
ubb-0.016
 servant-0.016
 queens-0.016
oll-0.016
ill-0.015
Token a
Token ID257
Feature activation+1.96e-01

Pos logit contributions
 stret+0.012
 automobile+0.012
 tracks+0.012
 radio+0.012
 track+0.012
Neg logit contributions
ubb-0.020
 servant-0.020
 queens-0.019
oll-0.019
ill-0.018
Token rib
Token ID12183
Feature activation-1.22e+00

Pos logit contributions
ubb+0.010
 servant+0.010
 queens+0.009
oll+0.009
ill+0.009
Neg logit contributions
 tracks-0.007
 radio-0.007
 automobile-0.007
 stret-0.007
 package-0.007
Tokenbet
Token ID11181
Feature activation+1.37e+00

Pos logit contributions
 radio+0.057
 stret+0.055
 package+0.055
 track+0.054
 receiver+0.054
Neg logit contributions
ubb-0.048
oll-0.045
 servant-0.045
 queens-0.044
ill-0.042
Token,
Token ID11
Feature activation+1.15e-01

Pos logit contributions
ubb+0.071
 queens+0.070
 servant+0.067
Col+0.064
oll+0.064
Neg logit contributions
 tracks-0.048
 automobile-0.047
 radio-0.047
 package-0.046
 stret-0.045
Token and
Token ID290
Feature activation-3.89e-01

Pos logit contributions
ubb+0.006
 servant+0.006
 queens+0.006
oll+0.005
ill+0.005
Neg logit contributions
 tracks-0.004
 radio-0.004
 automobile-0.004
 stret-0.004
 track-0.004
Token then
Token ID788
Feature activation+6.76e-01

Pos logit contributions
 tracks+0.013
 stret+0.013
 radio+0.013
 automobile+0.013
 package+0.013
Neg logit contributions
 servant-0.020
ubb-0.020
 queens-0.019
oll-0.019
ill-0.018
Token off
Token ID572
Feature activation+6.48e-02

Pos logit contributions
ubb+0.038
 servant+0.036
 queens+0.036
oll+0.036
ill+0.035
Neg logit contributions
 tracks-0.021
 automobile-0.020
 radio-0.020
 track-0.020
 package-0.020
Token they
Token ID484
Feature activation+1.21e+00

Pos logit contributions
ubb+0.003
 servant+0.003
 queens+0.003
oll+0.003
ill+0.003
Neg logit contributions
 tracks-0.002
 radio-0.002
 automobile-0.002
 stret-0.002
 track-0.002
Token
Token ID198
Feature activation+3.59e-01

Pos logit contributions
 tracks+0.007
 radio+0.007
 automobile+0.007
 stret+0.007
 package+0.007
Neg logit contributions
ubb-0.010
 servant-0.010
 queens-0.010
oll-0.010
ill-0.010
Token
Token ID198
Feature activation+6.50e-02

Pos logit contributions
ubb+0.019
 servant+0.019
 queens+0.019
oll+0.018
ill+0.017
Neg logit contributions
 tracks-0.012
 automobile-0.012
 stret-0.011
 radio-0.011
 track-0.011
TokenThe
Token ID464
Feature activation+1.04e+00

Pos logit contributions
ubb+0.003
 servant+0.003
 queens+0.003
oll+0.003
ill+0.003
Neg logit contributions
 tracks-0.002
 automobile-0.002
 radio-0.002
 stret-0.002
 track-0.002
Token little
Token ID1310
Feature activation+5.31e-01

Pos logit contributions
ubb+0.050
ill+0.048
 queens+0.047
 servant+0.047
oll+0.046
Neg logit contributions
 tracks-0.040
 radio-0.040
 automobile-0.040
 package-0.039
 track-0.039
Token girl
Token ID2576
Feature activation-3.54e-01

Pos logit contributions
ubb+0.028
 servant+0.027
 queens+0.027
oll+0.027
ill+0.026
Neg logit contributions
 tracks-0.017
 automobile-0.017
 radio-0.017
 package-0.017
 track-0.017
Token loved
Token ID6151
Feature activation+5.95e-01

Pos logit contributions
 radio+0.011
 tracks+0.011
 automobile+0.010
 stret+0.010
 track+0.010
Neg logit contributions
ubb-0.019
 servant-0.019
 queens-0.019
oll-0.019
ill-0.018
Token her
Token ID607
Feature activation-1.17e+00

Pos logit contributions
ubb+0.033
 servant+0.032
oll+0.032
 queens+0.031
ill+0.031
Neg logit contributions
 tracks-0.019
 radio-0.019
 automobile-0.019
 track-0.018
 package-0.018
Token book
Token ID1492
Feature activation-2.07e-01

Pos logit contributions
cycles+0.044
obiles+0.043
 stret+0.042
 receiver+0.040
 Amen+0.040
Neg logit contributions
 servant-0.058
ubb-0.055
 queens-0.051
oll-0.051
ill-0.049
Token,
Token ID11
Feature activation-1.05e-01

Pos logit contributions
 tracks+0.007
 radio+0.007
 stret+0.007
 automobile+0.007
 package+0.007
Neg logit contributions
ubb-0.010
 servant-0.010
 queens-0.010
oll-0.010
ill-0.010
Token so
Token ID523
Feature activation+5.85e-01

Pos logit contributions
 tracks+0.004
 automobile+0.004
 radio+0.004
 stret+0.004
 package+0.004
Neg logit contributions
ubb-0.005
 servant-0.005
 queens-0.005
oll-0.005
ill-0.005
Token she
Token ID673
Feature activation+4.90e-01

Pos logit contributions
ubb+0.023
 servant+0.022
 queens+0.022
oll+0.021
ill+0.021
Neg logit contributions
 tracks-0.028
 automobile-0.027
 radio-0.027
 track-0.027
 package-0.027
Token the
Token ID262
Feature activation+7.33e-01

Pos logit contributions
ubb+0.051
 servant+0.050
oll+0.049
 queens+0.048
Col+0.047
Neg logit contributions
 tracks-0.036
 automobile-0.034
 radio-0.034
 track-0.033
 receiver-0.033
Token big
Token ID1263
Feature activation-3.51e-01

Pos logit contributions
ubb+0.036
 servant+0.035
 queens+0.035
oll+0.035
ill+0.034
Neg logit contributions
 tracks-0.027
 radio-0.027
 automobile-0.026
 package-0.026
 track-0.026
Token fish
Token ID5916
Feature activation-3.15e-01

Pos logit contributions
 tracks+0.013
 stret+0.013
 radio+0.013
 automobile+0.013
 track+0.013
Neg logit contributions
ubb-0.016
 servant-0.016
 queens-0.016
oll-0.015
ill-0.015
Token away
Token ID1497
Feature activation-3.63e-01

Pos logit contributions
 tracks+0.011
 automobile+0.011
 radio+0.011
 stret+0.011
 package+0.011
Neg logit contributions
ubb-0.016
 servant-0.016
 queens-0.015
oll-0.015
ill-0.015
Token.
Token ID13
Feature activation+6.64e-01

Pos logit contributions
 tracks+0.012
 automobile+0.012
 stret+0.012
 radio+0.012
 package+0.012
Neg logit contributions
ubb-0.019
 servant-0.018
 queens-0.018
oll-0.017
ill-0.017
Token
Token ID198
Feature activation+1.12e-01

Pos logit contributions
ubb+0.037
 queens+0.036
 servant+0.036
oll+0.035
ier+0.033
Neg logit contributions
 tracks-0.021
 automobile-0.020
 track-0.020
 radio-0.020
 receiver-0.019
Token
Token ID198
Feature activation+1.47e-01

Pos logit contributions
ubb+0.006
 servant+0.006
 queens+0.006
oll+0.006
ill+0.005
Neg logit contributions
 tracks-0.004
 radio-0.004
 automobile-0.004
 stret-0.004
 track-0.004
TokenFrom
Token ID4863
Feature activation-1.22e+00

Pos logit contributions
ubb+0.007
 servant+0.007
 queens+0.007
oll+0.006
ill+0.006
Neg logit contributions
 tracks-0.006
 radio-0.006
 automobile-0.006
 stret-0.006
 track-0.006
Token that
Token ID326
Feature activation-8.04e-01

Pos logit contributions
 stret+0.053
 controller+0.051
 radio+0.050
 tracks+0.050
 automobile+0.049
Neg logit contributions
oll-0.049
 servant-0.048
ubb-0.048
 queens-0.048
aph-0.046
Token day
Token ID1110
Feature activation+5.32e-01

Pos logit contributions
 stret+0.036
 automobile+0.032
 tracks+0.032
obiles+0.032
 radio+0.031
Neg logit contributions
ubb-0.036
 servant-0.036
 queens-0.034
oll-0.033
acy-0.033
Token on
Token ID319
Feature activation-3.38e-01

Pos logit contributions
ubb+0.031
 queens+0.030
 servant+0.030
ill+0.029
oll+0.029
Neg logit contributions
 tracks-0.015
 track-0.014
 radio-0.014
 package-0.014
 automobile-0.014
Tokenoof
Token ID37711
Feature activation+1.03e+00

Pos logit contributions
 tracks+0.007
 stret+0.007
 radio+0.007
 automobile+0.007
 track+0.006
Neg logit contributions
ubb-0.018
 servant-0.018
oll-0.017
 queens-0.017
ill-0.017
Token,
Token ID11
Feature activation+2.93e-01

Pos logit contributions
ubb+0.064
 servant+0.060
 queens+0.060
Col+0.058
oll+0.057
Neg logit contributions
 tracks-0.027
 radio-0.026
 automobile-0.026
 package-0.026
 track-0.024
Token wo
Token ID24486
Feature activation+1.54e+00

Pos logit contributions
ubb+0.015
 servant+0.014
 queens+0.014
oll+0.014
ill+0.014
Neg logit contributions
 tracks-0.011
 radio-0.010
 automobile-0.010
 stret-0.010
 package-0.010
Tokenof
Token ID1659
Feature activation-1.29e-01

Pos logit contributions
ubb+0.078
 servant+0.078
 queens+0.074
oll+0.071
ier+0.067
Neg logit contributions
 automobile-0.054
 package-0.054
 tracks-0.053
 radio-0.052
 track-0.051
Token!"
Token ID2474
Feature activation-1.63e-01

Pos logit contributions
 tracks+0.004
 radio+0.004
 automobile+0.004
 stret+0.004
 track+0.004
Neg logit contributions
ubb-0.007
 servant-0.007
 queens-0.007
oll-0.006
ill-0.006
Token the
Token ID262
Feature activation-1.72e-01

Pos logit contributions
 tracks+0.008
 automobile+0.008
 package+0.008
 radio+0.008
 track+0.008
Neg logit contributions
ubb-0.006
 servant-0.006
 queens-0.006
oll-0.005
ill-0.005
Token dolls
Token ID36062
Feature activation+6.11e-01

Pos logit contributions
 tracks+0.006
 stret+0.005
 automobile+0.005
 radio+0.005
 package+0.005
Neg logit contributions
 servant-0.009
ubb-0.009
 queens-0.009
oll-0.009
ill-0.008
Token say
Token ID910
Feature activation-3.44e-01

Pos logit contributions
ubb+0.036
 queens+0.035
 servant+0.035
oll+0.034
ill+0.032
Neg logit contributions
 tracks-0.017
 radio-0.017
 stret-0.017
 automobile-0.017
 track-0.017
Token.
Token ID13
Feature activation-3.65e-01

Pos logit contributions
 automobile+0.012
 tracks+0.012
 package+0.012
 radio+0.012
 stret+0.012
Neg logit contributions
ubb-0.018
 servant-0.017
 queens-0.017
oll-0.017
ill-0.016
Token
Token ID198
Feature activation+1.52e-02

Pos logit contributions
 automobile+0.012
 package+0.012
 tracks+0.012
 stret+0.012
 track+0.012
Neg logit contributions
 servant-0.019
ubb-0.019
 queens-0.018
oll-0.018
ill-0.018
Token
Token ID198
Feature activation-6.79e-01

Pos logit contributions
ubb+0.001
 servant+0.001
 queens+0.001
oll+0.001
ill+0.001
Neg logit contributions
 tracks-0.001
 automobile-0.001
 radio-0.001
 stret-0.001
 package-0.001
Token very
Token ID845
Feature activation+5.83e-01

Pos logit contributions
 stret+0.023
 radio+0.023
 package+0.023
 automobile+0.022
 tracks+0.022
Neg logit contributions
ubb-0.027
 servant-0.027
 queens-0.026
oll-0.025
ill-0.024
Token sad
Token ID6507
Feature activation+6.28e-01

Pos logit contributions
ubb+0.032
 servant+0.032
 queens+0.031
ill+0.031
oll+0.031
Neg logit contributions
 tracks-0.018
 automobile-0.018
 track-0.018
 radio-0.017
 stret-0.017
Token.
Token ID13
Feature activation-1.45e+00

Pos logit contributions
ubb+0.039
 servant+0.037
 queens+0.037
oll+0.036
Col+0.035
Neg logit contributions
 tracks-0.017
 radio-0.017
 track-0.016
 automobile-0.016
 package-0.016
Token So
Token ID1406
Feature activation-1.43e+00

Pos logit contributions
 stret+0.062
 package+0.059
 radio+0.058
 Amen+0.057
esh+0.055
Neg logit contributions
 servant-0.060
 queens-0.059
ill-0.058
oll-0.057
olly-0.056
Token,
Token ID11
Feature activation-4.88e-01

Pos logit contributions
 stret+0.079
obiles+0.070
 receiver+0.070
 controller+0.070
 radio+0.069
Neg logit contributions
 servant-0.049
 queens-0.048
oll-0.043
ill-0.043
olly-0.042
Token they
Token ID484
Feature activation-3.60e-01

Pos logit contributions
 stret+0.019
 tracks+0.019
 package+0.018
 radio+0.018
cycles+0.018
Neg logit contributions
 queens-0.023
 servant-0.023
ubb-0.022
oll-0.021
ill-0.021
Token sat
Token ID3332
Feature activation-5.17e-01

Pos logit contributions
 tracks+0.014
 stret+0.014
 radio+0.014
 automobile+0.014
 package+0.014
Neg logit contributions
 servant-0.017
 queens-0.016
ubb-0.016
oll-0.015
ill-0.015
Token down
Token ID866
Feature activation+7.65e-01

Pos logit contributions
 tracks+0.017
 stret+0.017
 radio+0.016
 package+0.016
 automobile+0.016
Neg logit contributions
 servant-0.027
 queens-0.027
ubb-0.027
oll-0.026
ill-0.025
Token to
Token ID284
Feature activation-9.08e-01

Pos logit contributions
ubb+0.045
 servant+0.043
 queens+0.042
oll+0.042
ill+0.040
Neg logit contributions
 tracks-0.022
 track-0.022
 radio-0.022
 automobile-0.022
 package-0.021
Token discuss
Token ID2112
Feature activation-1.95e+00

Pos logit contributions
 package+0.034
cycles+0.034
 automobile+0.034
 tracks+0.034
 receiver+0.033
Neg logit contributions
 servant-0.041
 queens-0.040
ubb-0.039
ill-0.039
aph-0.037
Token their
Token ID511
Feature activation-3.35e-01

Pos logit contributions
 Amen+0.075
cycles+0.071
 receiver+0.068
 stret+0.068
obiles+0.068
Neg logit contributions
 queens-0.089
 servant-0.087
ill-0.083
oll-0.080
 beetles-0.080
Token look
Token ID804
Feature activation-1.21e+00

Pos logit contributions
 tracks+0.007
 automobile+0.006
 radio+0.006
 stret+0.006
 track+0.006
Neg logit contributions
ubb-0.009
 servant-0.009
 queens-0.009
oll-0.009
ill-0.009
Token!
Token ID0
Feature activation-1.51e-01

Pos logit contributions
 tracks+0.052
 automobile+0.051
 radio+0.051
 controller+0.050
cycles+0.050
Neg logit contributions
ubb-0.050
 queens-0.049
 servant-0.049
oll-0.049
ill-0.046
Token An
Token ID1052
Feature activation-4.26e-01

Pos logit contributions
 tracks+0.005
 radio+0.005
 stret+0.005
 automobile+0.005
 track+0.005
Neg logit contributions
ubb-0.008
 servant-0.008
 queens-0.008
oll-0.007
ill-0.007
Token aer
Token ID9551
Feature activation+1.10e-01

Pos logit contributions
 tracks+0.016
 radio+0.016
 stret+0.016
 receiver+0.016
 track+0.016
Neg logit contributions
ubb-0.020
 servant-0.019
 queens-0.019
oll-0.019
ill-0.018
Tokenopl
Token ID20106
Feature activation+1.77e+00

Pos logit contributions
ubb+0.006
 servant+0.006
 queens+0.006
oll+0.006
ill+0.006
Neg logit contributions
 tracks-0.003
 automobile-0.003
 radio-0.003
 stret-0.003
 track-0.003
Tokenane
Token ID1531
Feature activation-1.50e+00

Pos logit contributions
ubb+0.072
 servant+0.072
 queens+0.065
ory+0.062
oll+0.061
Neg logit contributions
 automobile-0.084
 package-0.081
 radio-0.079
 receiver-0.078
obiles-0.077
Token!"
Token ID2474
Feature activation-8.16e-01

Pos logit contributions
 stret+0.055
 radio+0.053
 tracks+0.053
 automobile+0.052
cycles+0.050
Neg logit contributions
ubb-0.072
 queens-0.071
 servant-0.071
ill-0.068
oll-0.067
Token
Token ID198
Feature activation-3.51e-01

Pos logit contributions
 automobile+0.033
 track+0.032
 tracks+0.031
 controller+0.031
 package+0.031
Neg logit contributions
 servant-0.037
ubb-0.036
oll-0.036
 queens-0.034
acy-0.034
Token
Token ID198
Feature activation-4.87e-01

Pos logit contributions
 automobile+0.013
 tracks+0.013
 track+0.013
 package+0.013
 radio+0.013
Neg logit contributions
ubb-0.016
 servant-0.016
 queens-0.016
oll-0.016
ill-0.015
TokenHer
Token ID9360
Feature activation+1.79e+00

Pos logit contributions
 automobile+0.022
 tracks+0.022
 track+0.022
 package+0.021
 stret+0.021
Neg logit contributions
 servant-0.019
ubb-0.019
 queens-0.019
oll-0.018
acy-0.017
Token mum
Token ID25682
Feature activation-4.80e-01

Pos logit contributions
 queens+0.087
ubb+0.081
ier+0.080
ill+0.079
umbo+0.077
Neg logit contributions
 package-0.071
 radio-0.071
 tracks-0.071
 automobile-0.067
 track-0.066
Token showed
Token ID3751
Feature activation-1.73e+00

Pos logit contributions
 stret+0.016
 package+0.015
 radio+0.015
 automobile+0.015
 tracks+0.015
Neg logit contributions
ubb-0.021
 queens-0.021
 servant-0.021
oll-0.020
ill-0.019
Token it
Token ID340
Feature activation-3.96e-01

Pos logit contributions
 stret+0.074
cycles+0.068
 Amen+0.067
obiles+0.067
 package+0.065
Neg logit contributions
 servant-0.073
ubb-0.071
 queens-0.070
oll-0.067
aph-0.067
Token to
Token ID284
Feature activation-2.11e-01

Pos logit contributions
 stret+0.019
 automobile+0.018
 radio+0.018
 package+0.018
 tracks+0.018
Neg logit contributions
 servant-0.015
ubb-0.015
 queens-0.015
oll-0.014
ill-0.014
Token their
Token ID511
Feature activation+6.05e-01

Pos logit contributions
 tracks+0.007
 stret+0.007
 package+0.007
 automobile+0.007
 radio+0.007
Neg logit contributions
ubb-0.011
 servant-0.011
 queens-0.011
oll-0.010
ill-0.010
Token parents
Token ID3397
Feature activation-6.71e-01

Pos logit contributions
ubb+0.029
 servant+0.027
 queens+0.027
oll+0.027
ill+0.027
Neg logit contributions
 tracks-0.025
 automobile-0.024
 radio-0.024
 track-0.024
 package-0.024
Token when
Token ID618
Feature activation+9.92e-02

Pos logit contributions
 tracks+0.024
 stret+0.024
 automobile+0.024
 radio+0.024
 package+0.024
Neg logit contributions
 servant-0.033
ubb-0.033
 queens-0.032
ill-0.031
oll-0.031
Token they
Token ID484
Feature activation+1.12e+00

Pos logit contributions
ubb+0.005
 servant+0.005
 queens+0.005
oll+0.005
ill+0.005
Neg logit contributions
 tracks-0.003
 radio-0.003
 automobile-0.003
 stret-0.003
 track-0.003
Token came
Token ID1625
Feature activation+2.54e-01

Pos logit contributions
ubb+0.065
 servant+0.064
oll+0.062
 queens+0.061
ectar+0.058
Neg logit contributions
 automobile-0.031
 package-0.031
 tracks-0.030
 radio-0.029
 receiver-0.029
Token outside
Token ID2354
Feature activation-1.64e-01

Pos logit contributions
ubb+0.013
 servant+0.013
 queens+0.013
oll+0.012
ill+0.012
Neg logit contributions
 tracks-0.009
 automobile-0.009
 radio-0.009
 track-0.009
 package-0.009
Token to
Token ID284
Feature activation-4.82e-01

Pos logit contributions
 tracks+0.006
 automobile+0.006
 radio+0.006
 stret+0.006
 package+0.006
Neg logit contributions
ubb-0.008
 servant-0.008
 queens-0.008
oll-0.008
ill-0.008
Token see
Token ID766
Feature activation-1.30e+00

Pos logit contributions
 tracks+0.017
 radio+0.016
 stret+0.016
 package+0.016
 automobile+0.016
Neg logit contributions
ubb-0.024
 servant-0.024
 queens-0.024
ill-0.023
oll-0.022
Token black
Token ID2042
Feature activation+8.76e-01

Pos logit contributions
 tracks+0.021
 stret+0.020
 radio+0.020
 automobile+0.020
cycles+0.020
Neg logit contributions
ubb-0.023
 servant-0.023
 queens-0.023
oll-0.022
ill-0.021
Token dog
Token ID3290
Feature activation-1.69e+00

Pos logit contributions
ubb+0.052
 queens+0.049
 servant+0.048
oll+0.047
ill+0.047
Neg logit contributions
 automobile-0.027
 radio-0.027
 tracks-0.026
 package-0.026
 track-0.025
Token named
Token ID3706
Feature activation-1.07e+00

Pos logit contributions
 stret+0.065
obiles+0.064
 controller+0.062
 package+0.062
 receiver+0.062
Neg logit contributions
 servant-0.077
 queens-0.074
oll-0.072
umbo-0.068
ill-0.067
Token Max
Token ID5436
Feature activation-1.37e+00

Pos logit contributions
 stret+0.033
 receiver+0.032
 automobile+0.031
cycles+0.031
 tracks+0.031
Neg logit contributions
 servant-0.056
oll-0.054
 queens-0.054
umbo-0.053
ubb-0.052
Token.
Token ID13
Feature activation-1.31e+00

Pos logit contributions
 stret+0.064
 controller+0.063
 receiver+0.063
obiles+0.062
 package+0.062
Neg logit contributions
 servant-0.052
ill-0.049
 queens-0.048
oll-0.048
ubb-0.044
Token Max
Token ID5436
Feature activation-2.06e+00

Pos logit contributions
 stret+0.046
 receiver+0.041
 package+0.041
 tracks+0.039
 controller+0.038
Neg logit contributions
 servant-0.066
 queens-0.065
umbo-0.064
oll-0.064
ill-0.062
Token loved
Token ID6151
Feature activation+2.81e-01

Pos logit contributions
 stret+0.094
 controller+0.089
 receiver+0.086
 package+0.085
obiles+0.082
Neg logit contributions
 servant-0.081
oll-0.079
Col-0.075
 queens-0.073
ill-0.069
Token to
Token ID284
Feature activation-1.79e-01

Pos logit contributions
ubb+0.013
 servant+0.012
 queens+0.012
oll+0.012
ill+0.012
Neg logit contributions
 tracks-0.012
 radio-0.012
 automobile-0.012
 track-0.012
 package-0.012
Token play
Token ID711
Feature activation-8.57e-01

Pos logit contributions
 tracks+0.006
 automobile+0.005
 package+0.005
 radio+0.005
 stret+0.005
Neg logit contributions
ubb-0.010
 servant-0.010
 queens-0.009
oll-0.009
ill-0.009
Token outside
Token ID2354
Feature activation+6.24e-02

Pos logit contributions
 receiver+0.033
 package+0.032
 tracks+0.032
 automobile+0.031
 stret+0.031
Neg logit contributions
 servant-0.039
 queens-0.038
oll-0.037
ubb-0.037
 diver-0.035
Token in
Token ID287
Feature activation-7.89e-01

Pos logit contributions
ubb+0.003
 servant+0.003
 queens+0.003
oll+0.003
ill+0.003
Neg logit contributions
 tracks-0.002
 automobile-0.002
 radio-0.002
 stret-0.002
 track-0.002
Token her
Token ID607
Feature activation+3.11e-01

Pos logit contributions
 tracks+0.008
 automobile+0.008
 radio+0.008
 stret+0.008
 track+0.008
Neg logit contributions
ubb-0.012
 servant-0.012
 queens-0.012
oll-0.011
ill-0.011
Token,
Token ID11
Feature activation-4.50e-01

Pos logit contributions
ubb+0.019
 queens+0.018
 servant+0.018
oll+0.017
ill+0.017
Neg logit contributions
 tracks-0.009
 radio-0.008
 automobile-0.008
 package-0.008
 track-0.008
Token �
Token ID6184
Feature activation-5.64e-01

Pos logit contributions
 tracks+0.015
 automobile+0.015
 radio+0.014
 stret+0.014
 package+0.014
Neg logit contributions
ubb-0.024
 servant-0.023
 queens-0.023
oll-0.022
ill-0.022
Token�
Token ID95
Feature activation-1.73e+00

Pos logit contributions
 tracks+0.021
 radio+0.021
 automobile+0.021
 stret+0.020
 package+0.020
Neg logit contributions
ubb-0.027
 servant-0.027
 queens-0.026
oll-0.026
ill-0.025
Token€
Token ID26391
Feature activation-7.03e-01

Pos logit contributions
 automobile+0.057
 radio+0.056
 tracks+0.056
 stret+0.053
 package+0.053
Neg logit contributions
oll-0.084
ill-0.084
 servant-0.084
acy-0.083
 queens-0.082
Token�
Token ID129
Feature activation-2.11e+00

Pos logit contributions
 tracks+0.024
 radio+0.024
 stret+0.023
 automobile+0.023
 track+0.023
Neg logit contributions
 queens-0.035
 servant-0.035
ubb-0.035
ill-0.034
oll-0.034
Token�
Token ID241
Feature activation+7.63e-01

Pos logit contributions
 package+0.072
 radio+0.069
 automobile+0.069
 controller+0.068
 stret+0.066
Neg logit contributions
ubb-0.097
 servant-0.097
oll-0.095
umble-0.094
acy-0.092
TokenWhat
Token ID2061
Feature activation+2.91e-01

Pos logit contributions
ubb+0.038
 servant+0.038
 queens+0.036
oll+0.035
umbo+0.034
Neg logit contributions
 tracks-0.028
 radio-0.028
 automobile-0.027
 package-0.027
 track-0.027
Token is
Token ID318
Feature activation-1.40e+00

Pos logit contributions
ubb+0.016
 servant+0.016
 queens+0.015
oll+0.015
ill+0.015
Neg logit contributions
 tracks-0.009
 radio-0.009
 automobile-0.009
 package-0.009
 track-0.009
Token wrong
Token ID2642
Feature activation+4.79e-01

Pos logit contributions
 stret+0.052
 tracks+0.050
 Amen+0.050
cycles+0.049
obiles+0.049
Neg logit contributions
 servant-0.064
ubb-0.063
oll-0.063
 queens-0.062
acy-0.057
Token,
Token ID11
Feature activation+2.96e-02

Pos logit contributions
ubb+0.027
 servant+0.026
 queens+0.026
oll+0.025
ill+0.024
Neg logit contributions
 radio-0.015
 tracks-0.015
 automobile-0.014
 package-0.014
 track-0.014
Token so
Token ID523
Feature activation+1.14e+00

Pos logit contributions
 tracks+0.001
 automobile+0.001
 radio+0.001
 stret+0.001
 package+0.001
Neg logit contributions
ubb-0.002
 servant-0.002
 queens-0.001
oll-0.001
ill-0.001
Token scared
Token ID12008
Feature activation+2.41e-01

Pos logit contributions
 servant+0.066
oll+0.065
ubb+0.064
 queens+0.062
ill+0.060
Neg logit contributions
 tracks-0.033
 track-0.033
 automobile-0.032
 radio-0.032
 controller-0.031
Token,
Token ID11
Feature activation-9.32e-01

Pos logit contributions
ubb+0.013
 servant+0.013
 queens+0.013
oll+0.013
ill+0.012
Neg logit contributions
 tracks-0.007
 radio-0.007
 automobile-0.007
 track-0.007
 package-0.007
Token so
Token ID523
Feature activation+2.48e-01

Pos logit contributions
 package+0.037
 automobile+0.036
 stret+0.036
 radio+0.035
 tracks+0.035
Neg logit contributions
ubb-0.042
 queens-0.041
 servant-0.041
ill-0.038
aph-0.037
Token they
Token ID484
Feature activation-8.14e-01

Pos logit contributions
ubb+0.014
 servant+0.014
oll+0.014
 queens+0.014
ill+0.013
Neg logit contributions
 tracks-0.007
 radio-0.007
 automobile-0.007
 track-0.007
 package-0.007
Token held
Token ID2714
Feature activation-2.07e+00

Pos logit contributions
 automobile+0.032
 tracks+0.032
 package+0.032
 radio+0.031
 stret+0.031
Neg logit contributions
 servant-0.037
ubb-0.036
 queens-0.036
oll-0.034
ill-0.033
Token on
Token ID319
Feature activation-1.77e+00

Pos logit contributions
 stret+0.082
obiles+0.077
 automobile+0.075
 belongs+0.073
cycles+0.073
Neg logit contributions
ubb-0.092
oll-0.086
 servant-0.083
 Mon-0.081
 Colin-0.081
Token tight
Token ID5381
Feature activation-7.28e-01

Pos logit contributions
 belongs+0.081
 receiver+0.081
 package+0.080
 Amen+0.080
 automobile+0.079
Neg logit contributions
 servant-0.058
ubb-0.057
Col-0.052
aph-0.052
 queens-0.052
Token.
Token ID13
Feature activation-1.04e+00

Pos logit contributions
 automobile+0.030
 stret+0.029
obiles+0.029
 package+0.029
 radio+0.029
Neg logit contributions
ubb-0.032
 servant-0.031
 queens-0.031
oll-0.030
ill-0.029
Token
Token ID198
Feature activation-3.90e-01

Pos logit contributions
 stret+0.038
 package+0.038
 automobile+0.038
 radio+0.037
 receiver+0.035
Neg logit contributions
ubb-0.048
 servant-0.047
ill-0.047
 queens-0.046
Col-0.046
Token
Token ID198
Feature activation-8.82e-01

Pos logit contributions
 automobile+0.015
 tracks+0.015
 package+0.015
 radio+0.015
 stret+0.015
Neg logit contributions
ubb-0.018
 servant-0.018
 queens-0.018
ill-0.017
oll-0.017
Token said
Token ID531
Feature activation+8.97e-01

Pos logit contributions
 automobile+0.059
 package+0.058
 radio+0.058
obiles+0.057
 controller+0.057
Neg logit contributions
 servant-0.065
ubb-0.062
oll-0.060
 queens-0.060
 gard-0.057
Token,
Token ID11
Feature activation-2.36e-01

Pos logit contributions
ubb+0.055
 queens+0.053
 servant+0.053
ill+0.051
oll+0.051
Neg logit contributions
 tracks-0.022
 automobile-0.022
 radio-0.021
 stret-0.021
 package-0.020
Token "
Token ID366
Feature activation+3.02e-02

Pos logit contributions
 tracks+0.007
 automobile+0.007
 radio+0.007
 package+0.007
 stret+0.007
Neg logit contributions
ubb-0.013
 servant-0.013
 queens-0.012
oll-0.012
ill-0.012
TokenNo
Token ID2949
Feature activation-1.16e+00

Pos logit contributions
ubb+0.001
 servant+0.001
 queens+0.001
oll+0.001
ill+0.001
Neg logit contributions
 tracks-0.001
 radio-0.001
 automobile-0.001
 stret-0.001
 track-0.001
Token,
Token ID11
Feature activation-4.01e-01

Pos logit contributions
 receiver+0.052
 stret+0.052
obiles+0.051
 tracks+0.050
 radio+0.050
Neg logit contributions
 servant-0.045
ubb-0.042
 queens-0.041
oll-0.040
ill-0.040
Token that
Token ID326
Feature activation-2.06e+00

Pos logit contributions
 tracks+0.017
 radio+0.016
 stret+0.016
 automobile+0.016
 track+0.016
Neg logit contributions
ubb-0.017
 servant-0.017
 queens-0.017
oll-0.017
ill-0.016
Token's
Token ID338
Feature activation-1.15e+00

Pos logit contributions
 stret+0.063
 Amen+0.061
obiles+0.060
 receiver+0.058
cycles+0.058
Neg logit contributions
 servant-0.098
 queens-0.095
ubb-0.094
oll-0.093
ier-0.092
Token mine
Token ID6164
Feature activation+3.82e-01

Pos logit contributions
 stret+0.052
 radio+0.051
 receiver+0.051
 tracks+0.050
 Amen+0.050
Neg logit contributions
 queens-0.044
 servant-0.044
oll-0.044
ubb-0.043
aph-0.039
Token.
Token ID13
Feature activation-6.38e-01

Pos logit contributions
ubb+0.022
 servant+0.021
 queens+0.021
oll+0.020
ill+0.019
Neg logit contributions
 tracks-0.012
 automobile-0.011
 radio-0.011
 track-0.011
 package-0.011
Token Give
Token ID13786
Feature activation-1.01e+00

Pos logit contributions
 stret+0.025
 tracks+0.024
 radio+0.024
 track+0.024
 package+0.024
Neg logit contributions
ubb-0.030
 servant-0.029
oll-0.029
 queens-0.029
ill-0.027
Token it
Token ID340
Feature activation-1.06e+00

Pos logit contributions
 stret+0.037
 tracks+0.036
 radio+0.036
 receiver+0.035
cycles+0.035
Neg logit contributions
 servant-0.048
 queens-0.047
ubb-0.045
oll-0.044
ill-0.043
Token"
Token ID1
Feature activation+1.28e+00

Pos logit contributions
ubb+0.005
 servant+0.005
 queens+0.005
oll+0.005
ill+0.004
Neg logit contributions
 tracks-0.003
 automobile-0.003
 radio-0.003
 stret-0.003
 track-0.003
TokenYes
Token ID5297
Feature activation-8.01e-02

Pos logit contributions
ubb+0.074
 servant+0.071
oll+0.070
ill+0.069
 queens+0.068
Neg logit contributions
 radio-0.037
 automobile-0.037
 controller-0.036
 track-0.035
 tracks-0.034
Token,
Token ID11
Feature activation+2.53e-01

Pos logit contributions
 tracks+0.003
 radio+0.003
 automobile+0.003
 stret+0.003
 track+0.003
Neg logit contributions
ubb-0.004
 servant-0.004
 queens-0.004
oll-0.004
ill-0.004
Token please
Token ID3387
Feature activation-1.13e+00

Pos logit contributions
ubb+0.011
 servant+0.011
 queens+0.010
oll+0.010
ill+0.010
Neg logit contributions
 tracks-0.011
 radio-0.011
 automobile-0.011
 package-0.011
 track-0.011
Token.
Token ID13
Feature activation-1.11e+00

Pos logit contributions
 stret+0.048
 tracks+0.046
 automobile+0.046
 radio+0.045
 receiver+0.044
Neg logit contributions
 servant-0.049
 queens-0.048
ill-0.047
ubb-0.046
oll-0.045
Token That
Token ID1320
Feature activation-2.30e+00

Pos logit contributions
 stret+0.045
 tracks+0.042
 package+0.042
 radio+0.042
 automobile+0.042
Neg logit contributions
 servant-0.051
oll-0.049
 queens-0.049
ubb-0.048
ill-0.045
Token sounds
Token ID5238
Feature activation-1.87e+00

Pos logit contributions
 stret+0.081
obiles+0.073
cycles+0.066
 Amen+0.061
 Ellen+0.061
Neg logit contributions
 servant-0.116
oll-0.100
 diver-0.099
 queens-0.098
 gard-0.097
Token fun
Token ID1257
Feature activation-1.68e+00

Pos logit contributions
 stret+0.098
 automobile+0.096
ci+0.095
 receiver+0.092
 tracks+0.092
Neg logit contributions
 queens-0.063
 servant-0.061
ubb-0.059
oll-0.059
ier-0.055
Token.
Token ID13
Feature activation-9.62e-01

Pos logit contributions
 stret+0.076
obiles+0.069
 automobile+0.068
 receiver+0.067
 radio+0.067
Neg logit contributions
 queens-0.073
 servant-0.070
oll-0.067
ill-0.065
ier-0.065
Token Thank
Token ID6952
Feature activation-7.17e-01

Pos logit contributions
 stret+0.036
 package+0.034
 automobile+0.034
 radio+0.034
 tracks+0.033
Neg logit contributions
 servant-0.047
ubb-0.046
 queens-0.046
oll-0.046
ill-0.043
Token you
Token ID345
Feature activation-4.52e-02

Pos logit contributions
 stret+0.035
 tracks+0.034
 radio+0.034
 automobile+0.034
 track+0.033
Neg logit contributions
 queens-0.028
ubb-0.028
 servant-0.027
oll-0.026
ill-0.024