TOP ACTIVATIONS
MAX = 0.121

she 'd never seen a l oll ipop before .
she never stopped dreaming of l oll ip ops . <|endoftext|>
ados . They were never jealous of each other again and
. Lily forgot about her l oll ipop and had so
And Freddy was never helpless again . He flew high
games . They were never lonely again . The
He 'd never had a l oll ipop before ! But
. That 's not a l oll ipop . That 's
every day and was never hungry again . <|endoftext|> Once there
just in case another great l oll ipop would rise in
managed to number all her l oll ip ops . She
a very special toy for L ott ie . It was
a little bit of the l oll ipop . That way
no , where did Mr . Beetle go ?" she said
The old man was never hungry again , and he forever
Jane had never eaten a tomato before , but when she
happy that she had her l oll ipop and it tasted
ground . At last the l oll ipop stopped rising and
the little rabbit was never sad again . He was always
if she could have the l oll ipop . The shop

MIN ACTIVATIONS
MIN = -0.106

" Hello , cloud !" Ellie said .
" Let 's help !" said Max , who was
Hi , I 'm Mouse ," said the mouse . "
. She got on a different train with her ticket .
. He put on his favourite party hat and hopped along
." He put on a different song and Lily started to
you to climb the wall !" Jimmy said .
the princess of the castle ," said the voice . "
the switches and pressed the big red button . The radio
. It was a big , shiny radio and Katie was
Let 's go , Emmy !" said her mom . "
wide and they gave a loud roar when they started the
" It 's an earthquake !" said her friend , the
. Suddenly they heard a loud noise . It was a
" Hi , bird !" Max said . " Do
" Let 's go inside ," Max said .
a swim , my darling ," her mom my said .
's see what 's inside !" said Bun 's friend ,
The baby had on a beautiful dress , with a bon
train when they heard a loud noise . "

INTERVAL 0.064 - 0.121
CONTAINS 0.083%

fridge because there were always y ummy treats . After he
thing anyway . M andy 's mother was angry
so happy and he ate as much as he could before
him . Tom never felt sad for long because of his
He would always remember the oy ster and the pearl it

INTERVAL 0.007 - 0.064
CONTAINS 34.878%

she brave ly opened her eyes and reached for the big
next time , be careful when you twist things . You
a magnet ?" Sam asked . Ben gave him
took her outside to see the rainbow in the sky .
Hello , Tom ! Can you help me ? I need

INTERVAL -0.050 - 0.007
CONTAINS 64.514%

They 're just playing . They 're not bad guys .
a small town with her parents . One day , Lisa
fun day at the stadium . <|endoftext|> S ara and Ben
tease Tim my and take his toys away .
tree in his backyard . He decided to raise pigs under

INTERVAL -0.106 - -0.050
CONTAINS 0.526%

day , she heard a loud noise coming from the chim
he was sad and held him close in her arms and
One , two , three !" they shouted , as they
animals . Lily had a soft bunny that she liked to
" Let 's go explore !" said Jake 's mom .
Token she
Token ID673
Feature activation+3.32e-02

Pos logit contributions
ubb+0.081
 servant+0.078
Col+0.074
aph+0.074
ascal+0.073
Neg logit contributions
 automobile-0.041
 tracks-0.041
 package-0.041
 radio-0.040
 track-0.039
Token'd
Token ID1549
Feature activation+2.57e-02

Pos logit contributions
ubb+0.091
aph+0.085
 queens+0.084
 servant+0.084
ill+0.083
Neg logit contributions
 package-0.028
 tracks-0.027
 belongs-0.027
 radio-0.026
cycles-0.025
Token never
Token ID1239
Feature activation+4.94e-02

Pos logit contributions
ubb+0.058
 queens+0.054
 servant+0.054
aph+0.053
oll+0.053
Neg logit contributions
 tracks-0.034
 package-0.032
 radio-0.032
 track-0.031
 stret-0.031
Token seen
Token ID1775
Feature activation+5.81e-02

Pos logit contributions
ubb+0.120
acy+0.111
umber+0.109
 servant+0.109
oll+0.109
Neg logit contributions
 tracks-0.051
 receiver-0.050
 belongs-0.050
's-0.047
 package-0.047
Token a
Token ID257
Feature activation+4.95e-02

Pos logit contributions
ubb+0.143
 servant+0.134
 queens+0.133
ill+0.132
umber+0.132
Neg logit contributions
 tracks-0.061
 package-0.059
 radio-0.055
 automobile-0.053
 track-0.052
Token l
Token ID300
Feature activation+1.21e-01

Pos logit contributions
ubb+0.116
ill+0.114
 queens+0.107
acy+0.106
aph+0.105
Neg logit contributions
 package-0.064
 radio-0.058
 tracks-0.056
 track-0.056
 automobile-0.055
Tokenoll
Token ID692
Feature activation+6.75e-02

Pos logit contributions
 queens+0.132
ubb+0.122
ory+0.110
ill+0.101
Fre+0.099
Neg logit contributions
nob-0.221
 package-0.219
shirt-0.214
c-0.214
chair-0.212
Tokenipop
Token ID42800
Feature activation+9.14e-02

Pos logit contributions
ubb+0.121
 queens+0.120
ill+0.116
aph+0.115
Col+0.112
Neg logit contributions
 package-0.114
 belongs-0.102
 stret-0.096
 automobile-0.095
cycles-0.095
Token before
Token ID878
Feature activation+1.67e-02

Pos logit contributions
ubb+0.210
unn+0.194
Col+0.190
ascal+0.189
add+0.188
Neg logit contributions
 package-0.108
 track-0.103
 away-0.102
's-0.099
 radio-0.096
Token.
Token ID13
Feature activation+1.51e-02

Pos logit contributions
ubb+0.041
 servant+0.041
 queens+0.040
oll+0.039
Col+0.038
Neg logit contributions
 tracks-0.018
 radio-0.018
 package-0.017
 track-0.017
 automobile-0.017
Token
Token ID198
Feature activation+7.02e-03

Pos logit contributions
ubb+0.034
 queens+0.033
 servant+0.032
ier+0.030
ill+0.030
Neg logit contributions
 tracks-0.021
 radio-0.020
 package-0.019
 automobile-0.019
 track-0.019
Token she
Token ID673
Feature activation+4.95e-02

Pos logit contributions
ubb+0.046
 servant+0.046
oll+0.045
 queens+0.044
ill+0.043
Neg logit contributions
 tracks-0.037
 track-0.036
 automobile-0.036
 radio-0.035
 package-0.034
Token never
Token ID1239
Feature activation+4.16e-02

Pos logit contributions
ubb+0.123
 queens+0.121
aph+0.116
 servant+0.116
oll+0.112
Neg logit contributions
 tracks-0.050
 track-0.048
 package-0.048
 automobile-0.047
 belongs-0.047
Token stopped
Token ID5025
Feature activation+4.80e-02

Pos logit contributions
ubb+0.093
oll+0.091
 servant+0.089
acy+0.087
 queens+0.087
Neg logit contributions
 tracks-0.048
 belongs-0.046
 receiver-0.046
 automobile-0.045
 package-0.045
Token dreaming
Token ID28368
Feature activation+3.61e-02

Pos logit contributions
oll+0.101
ubb+0.099
 queens+0.098
acy+0.096
 servant+0.096
Neg logit contributions
 tracks-0.065
 automobile-0.064
 track-0.063
 radio-0.062
 receiver-0.059
Token of
Token ID286
Feature activation+3.47e-02

Pos logit contributions
ubb+0.073
 servant+0.071
 queens+0.070
Col+0.067
oll+0.067
Neg logit contributions
 tracks-0.052
 stret-0.051
 radio-0.050
 automobile-0.050
 package-0.050
Token l
Token ID300
Feature activation+1.12e-01

Pos logit contributions
ubb+0.075
ill+0.074
 queens+0.074
 servant+0.073
oll+0.071
Neg logit contributions
 tracks-0.046
 track-0.044
 radio-0.044
 automobile-0.044
 package-0.043
Tokenoll
Token ID692
Feature activation+5.66e-02

Pos logit contributions
 queens+0.135
 diver+0.122
ill+0.119
aph+0.116
ubb+0.111
Neg logit contributions
g-0.177
c-0.177
 around-0.173
 belongs-0.171
shirt-0.170
Tokenip
Token ID541
Feature activation+5.06e-02

Pos logit contributions
ill+0.093
ubb+0.093
 queens+0.091
aph+0.089
Col+0.085
Neg logit contributions
 package-0.099
 belongs-0.097
cycles-0.092
 stret-0.088
ci-0.088
Tokenops
Token ID2840
Feature activation+7.58e-02

Pos logit contributions
ubb+0.093
 queens+0.093
ill+0.092
 servant+0.088
aph+0.087
Neg logit contributions
 package-0.078
 stret-0.077
 belongs-0.077
cycles-0.076
 tracks-0.075
Token.
Token ID13
Feature activation+1.69e-02

Pos logit contributions
ubb+0.157
 servant+0.145
Col+0.142
aph+0.138
ascal+0.137
Neg logit contributions
 track-0.097
 package-0.097
 automobile-0.094
 belongs-0.093
's-0.093
Token<|endoftext|>
Token ID50256
Feature activation+1.19e-02

Pos logit contributions
ubb+0.038
 servant+0.038
 queens+0.037
oll+0.036
ill+0.035
Neg logit contributions
 tracks-0.021
 automobile-0.021
 radio-0.020
 track-0.020
 package-0.020
Tokenados
Token ID22484
Feature activation+2.35e-02

Pos logit contributions
ubb+0.081
 servant+0.079
acy+0.078
ill+0.078
 diver+0.078
Neg logit contributions
 automobile-0.057
 package-0.052
cycles-0.051
obiles-0.051
 receiver-0.051
Token.
Token ID13
Feature activation-1.28e-02

Pos logit contributions
ubb+0.058
 servant+0.055
 queens+0.054
oll+0.053
acy+0.053
Neg logit contributions
 tracks-0.026
 automobile-0.025
 radio-0.025
 track-0.024
 package-0.024
Token They
Token ID1119
Feature activation+3.02e-02

Pos logit contributions
 stret+0.017
 package+0.016
 radio+0.016
 automobile+0.016
 tracks+0.016
Neg logit contributions
ubb-0.028
 servant-0.028
 queens-0.027
oll-0.027
ill-0.027
Token were
Token ID547
Feature activation+9.27e-03

Pos logit contributions
ubb+0.070
 queens+0.066
 servant+0.065
oll+0.064
ill+0.062
Neg logit contributions
 automobile-0.037
 tracks-0.036
 radio-0.035
 package-0.034
 track-0.034
Token never
Token ID1239
Feature activation+4.78e-02

Pos logit contributions
ubb+0.020
 servant+0.019
 queens+0.019
oll+0.019
ill+0.018
Neg logit contributions
 tracks-0.013
 automobile-0.013
 track-0.012
 radio-0.012
 package-0.012
Token jealous
Token ID19354
Feature activation+1.12e-01

Pos logit contributions
ubb+0.095
umber+0.091
acy+0.091
oll+0.091
 servant+0.089
Neg logit contributions
 automobile-0.063
 tracks-0.062
 track-0.061
 receiver-0.060
 stret-0.058
Token of
Token ID286
Feature activation+1.06e-02

Pos logit contributions
agg+0.269
acy+0.268
Fre+0.265
ona+0.264
ubb+0.262
Neg logit contributions
 around-0.089
's-0.088
 language-0.082
 away-0.079
 over-0.077
Token each
Token ID1123
Feature activation+3.97e-02

Pos logit contributions
ubb+0.024
 servant+0.023
 queens+0.023
oll+0.023
ill+0.022
Neg logit contributions
 tracks-0.013
 automobile-0.013
 radio-0.013
 track-0.013
 package-0.013
Token other
Token ID584
Feature activation+3.58e-02

Pos logit contributions
ubb+0.092
oll+0.086
 queens+0.084
acy+0.084
ectar+0.082
Neg logit contributions
 automobile-0.050
 package-0.047
 receiver-0.045
 track-0.044
 radio-0.043
Token again
Token ID757
Feature activation+1.02e-02

Pos logit contributions
ubb+0.071
 servant+0.064
 queens+0.064
oll+0.063
acy+0.063
Neg logit contributions
 radio-0.055
 automobile-0.054
 tracks-0.054
 track-0.053
 package-0.052
Token and
Token ID290
Feature activation+1.19e-02

Pos logit contributions
ubb+0.024
 servant+0.024
 queens+0.023
oll+0.023
ill+0.022
Neg logit contributions
 tracks-0.012
 radio-0.011
 automobile-0.011
 track-0.011
 package-0.011
Token.
Token ID13
Feature activation-2.60e-02

Pos logit contributions
 radio+0.032
 receiver+0.032
cycles+0.031
 Amen+0.031
 package+0.031
Neg logit contributions
 queens-0.038
 servant-0.038
ubb-0.038
ill-0.038
oll-0.037
Token Lily
Token ID20037
Feature activation+1.93e-03

Pos logit contributions
 radio+0.043
 package+0.042
 stret+0.042
cycles+0.041
 controller+0.041
Neg logit contributions
oll-0.044
 servant-0.044
ubb-0.044
 queens-0.044
ill-0.044
Token forgot
Token ID16453
Feature activation-9.30e-03

Pos logit contributions
ubb+0.005
 servant+0.004
 queens+0.004
oll+0.004
ill+0.004
Neg logit contributions
 tracks-0.002
 automobile-0.002
 radio-0.002
 stret-0.002
 track-0.002
Token about
Token ID546
Feature activation+1.06e-02

Pos logit contributions
 tracks+0.015
 automobile+0.015
 stret+0.015
 package+0.015
 radio+0.015
Neg logit contributions
 servant-0.017
ubb-0.017
 queens-0.017
oll-0.016
ill-0.016
Token her
Token ID607
Feature activation+2.54e-02

Pos logit contributions
ubb+0.025
 servant+0.025
 queens+0.025
oll+0.024
ill+0.024
Neg logit contributions
 tracks-0.012
 radio-0.012
 automobile-0.012
 track-0.012
 package-0.011
Token l
Token ID300
Feature activation+1.06e-01

Pos logit contributions
ubb+0.054
 queens+0.052
ill+0.052
acy+0.050
 servant+0.050
Neg logit contributions
 package-0.037
 tracks-0.036
 radio-0.036
 automobile-0.036
 track-0.034
Tokenoll
Token ID692
Feature activation+4.85e-02

Pos logit contributions
 queens+0.148
ubb+0.127
aph+0.124
ill+0.123
 Mon+0.123
Neg logit contributions
 belongs-0.172
c-0.164
shirt-0.162
 package-0.161
ag-0.152
Tokenipop
Token ID42800
Feature activation+3.31e-02

Pos logit contributions
ubb+0.080
 queens+0.078
ill+0.077
aph+0.076
Col+0.071
Neg logit contributions
 package-0.090
 belongs-0.087
cycles-0.082
 automobile-0.081
 radio-0.079
Token and
Token ID290
Feature activation-3.37e-03

Pos logit contributions
ubb+0.083
 queens+0.078
 servant+0.077
Col+0.075
ill+0.073
Neg logit contributions
 track-0.037
 tracks-0.037
 package-0.036
 radio-0.036
 automobile-0.036
Token had
Token ID550
Feature activation+6.85e-03

Pos logit contributions
 tracks+0.005
 stret+0.005
 automobile+0.005
 radio+0.005
 package+0.005
Neg logit contributions
ubb-0.007
 servant-0.007
 queens-0.006
oll-0.006
ill-0.006
Token so
Token ID523
Feature activation+3.05e-02

Pos logit contributions
ubb+0.015
 servant+0.015
 queens+0.015
oll+0.014
ill+0.014
Neg logit contributions
 tracks-0.009
 radio-0.009
 automobile-0.008
 track-0.008
 package-0.008
Token
Token ID198
Feature activation+2.20e-02

Pos logit contributions
ubb+0.016
 servant+0.016
 queens+0.015
oll+0.015
ill+0.015
Neg logit contributions
 tracks-0.005
 automobile-0.005
 radio-0.005
 stret-0.005
 track-0.005
TokenAnd
Token ID1870
Feature activation+2.07e-02

Pos logit contributions
 servant+0.049
ubb+0.048
 queens+0.046
oll+0.045
ill+0.044
Neg logit contributions
 tracks-0.027
 track-0.027
cycles-0.027
 automobile-0.026
obiles-0.026
Token Freddy
Token ID45437
Feature activation+2.22e-02

Pos logit contributions
ubb+0.048
 servant+0.046
 queens+0.045
oll+0.045
ill+0.044
Neg logit contributions
 tracks-0.025
 track-0.023
 package-0.023
 automobile-0.023
 radio-0.022
Token was
Token ID373
Feature activation-5.55e-04

Pos logit contributions
ubb+0.053
 servant+0.052
 queens+0.052
oll+0.050
ill+0.049
Neg logit contributions
 tracks-0.025
 track-0.023
 radio-0.023
 package-0.023
 automobile-0.023
Token never
Token ID1239
Feature activation+2.71e-02

Pos logit contributions
 tracks+0.001
 automobile+0.001
 radio+0.001
 stret+0.001
 package+0.001
Neg logit contributions
ubb-0.001
 servant-0.001
 queens-0.001
oll-0.001
ill-0.001
Token helpless
Token ID21144
Feature activation+1.05e-01

Pos logit contributions
ubb+0.055
 servant+0.054
oll+0.054
acy+0.052
umber+0.051
Neg logit contributions
 tracks-0.038
 track-0.037
 automobile-0.036
 stret-0.036
 radio-0.036
Token again
Token ID757
Feature activation+3.76e-02

Pos logit contributions
umber+0.276
Col+0.266
uka+0.260
olo+0.260
Dan+0.258
Neg logit contributions
 away-0.075
 around-0.073
's-0.068
 noise-0.061
 language-0.058
Token.
Token ID13
Feature activation+4.56e-02

Pos logit contributions
ubb+0.100
 servant+0.099
oll+0.097
 queens+0.096
umbo+0.096
Neg logit contributions
 tracks-0.030
 radio-0.028
 track-0.028
 automobile-0.025
 package-0.025
Token He
Token ID679
Feature activation+9.03e-03

Pos logit contributions
ubb+0.119
 servant+0.115
 queens+0.114
ier+0.110
acy+0.108
Neg logit contributions
 -0.037
 tracks-0.037
cycles-0.036
 automobile-0.034
 belongs-0.033
Token flew
Token ID13112
Feature activation+1.73e-02

Pos logit contributions
ubb+0.019
 servant+0.019
 queens+0.019
oll+0.018
ill+0.018
Neg logit contributions
 tracks-0.012
 radio-0.012
 automobile-0.012
 track-0.012
 package-0.012
Token high
Token ID1029
Feature activation+4.19e-02

Pos logit contributions
ubb+0.044
 servant+0.043
oll+0.043
 queens+0.043
acy+0.041
Neg logit contributions
 tracks-0.017
 track-0.016
 radio-0.016
 automobile-0.016
 stret-0.015
Token games
Token ID1830
Feature activation+1.69e-02

Pos logit contributions
 package+0.049
 stret+0.049
 controller+0.048
 Amen+0.048
 radio+0.048
Neg logit contributions
 queens-0.059
 servant-0.055
ubb-0.055
oll-0.052
umbo-0.052
Token.
Token ID13
Feature activation+1.62e-02

Pos logit contributions
ubb+0.043
 servant+0.041
 queens+0.040
oll+0.040
acy+0.038
Neg logit contributions
 tracks-0.018
 track-0.017
 automobile-0.017
 radio-0.017
 stret-0.016
Token They
Token ID1119
Feature activation+1.04e-02

Pos logit contributions
ubb+0.042
 servant+0.041
 queens+0.041
oll+0.039
ier+0.039
Neg logit contributions
 tracks-0.017
 automobile-0.016
 radio-0.015
 track-0.015
 package-0.014
Token were
Token ID547
Feature activation+9.80e-03

Pos logit contributions
ubb+0.024
 servant+0.023
 queens+0.023
oll+0.023
ill+0.022
Neg logit contributions
 tracks-0.013
 automobile-0.012
 track-0.012
 radio-0.012
 package-0.012
Token never
Token ID1239
Feature activation+4.03e-02

Pos logit contributions
ubb+0.020
 servant+0.020
 queens+0.019
oll+0.019
ill+0.019
Neg logit contributions
 tracks-0.014
 track-0.014
 automobile-0.014
 radio-0.014
 package-0.013
Token lonely
Token ID21757
Feature activation+1.05e-01

Pos logit contributions
ubb+0.081
oll+0.080
 servant+0.078
umber+0.076
acy+0.075
Neg logit contributions
 tracks-0.056
 track-0.055
 automobile-0.054
 receiver-0.053
 stret-0.052
Token again
Token ID757
Feature activation+2.65e-02

Pos logit contributions
ubb+0.254
umber+0.252
acy+0.247
add+0.244
Col+0.242
Neg logit contributions
 track-0.084
 language-0.083
 away-0.083
 around-0.082
's-0.081
Token.
Token ID13
Feature activation+1.28e-03

Pos logit contributions
ubb+0.067
 servant+0.065
oll+0.063
 queens+0.062
umbo+0.062
Neg logit contributions
 tracks-0.027
 track-0.027
 radio-0.026
 automobile-0.025
 package-0.024
Token
Token ID198
Feature activation-2.47e-02

Pos logit contributions
ubb+0.003
 servant+0.003
 queens+0.003
oll+0.003
ill+0.003
Neg logit contributions
 tracks-0.002
 automobile-0.002
 radio-0.002
 stret-0.002
 package-0.002
Token
Token ID198
Feature activation+1.88e-03

Pos logit contributions
 automobile+0.029
 package+0.028
 stret+0.028
 receiver+0.028
 radio+0.028
Neg logit contributions
oll-0.054
ubb-0.054
 servant-0.054
 queens-0.053
ill-0.052
TokenThe
Token ID464
Feature activation-4.96e-04

Pos logit contributions
ubb+0.004
 servant+0.004
 queens+0.004
oll+0.004
ill+0.004
Neg logit contributions
 tracks-0.003
 automobile-0.003
 radio-0.003
 stret-0.003
 track-0.003
Token He
Token ID679
Feature activation+1.44e-02

Pos logit contributions
ubb+0.028
 servant+0.027
 queens+0.027
oll+0.027
ill+0.025
Neg logit contributions
 tracks-0.018
 automobile-0.018
 track-0.018
 radio-0.017
 package-0.017
Token'd
Token ID1549
Feature activation+1.26e-02

Pos logit contributions
ubb+0.033
 queens+0.032
 servant+0.032
oll+0.031
ill+0.030
Neg logit contributions
 tracks-0.018
 radio-0.017
 automobile-0.017
 track-0.017
 stret-0.017
Token never
Token ID1239
Feature activation+2.31e-02

Pos logit contributions
ubb+0.026
 queens+0.025
 servant+0.025
oll+0.025
ill+0.024
Neg logit contributions
 tracks-0.018
 radio-0.018
 automobile-0.018
 stret-0.018
 track-0.018
Token had
Token ID550
Feature activation+3.74e-02

Pos logit contributions
ubb+0.046
 servant+0.043
oll+0.043
 queens+0.043
acy+0.041
Neg logit contributions
 tracks-0.035
 automobile-0.034
 receiver-0.033
 package-0.033
 track-0.033
Token a
Token ID257
Feature activation+3.40e-02

Pos logit contributions
ubb+0.088
 servant+0.082
ill+0.082
aph+0.080
 queens+0.080
Neg logit contributions
 tracks-0.044
 package-0.042
 automobile-0.041
 radio-0.040
 track-0.039
Token l
Token ID300
Feature activation+1.05e-01

Pos logit contributions
ubb+0.073
 queens+0.070
ill+0.068
acy+0.065
 servant+0.065
Neg logit contributions
 package-0.050
 stret-0.047
 tracks-0.046
 radio-0.046
 automobile-0.046
Tokenoll
Token ID692
Feature activation+5.53e-02

Pos logit contributions
 queens+0.133
ubb+0.109
aph+0.105
 kingdoms+0.100
Fre+0.100
Neg logit contributions
shirt-0.193
 package-0.182
 away-0.174
c-0.173
g-0.172
Tokenipop
Token ID42800
Feature activation+7.59e-02

Pos logit contributions
ubb+0.096
 queens+0.094
aph+0.090
ill+0.089
Col+0.086
Neg logit contributions
 package-0.096
 belongs-0.087
cycles-0.086
 stret-0.086
 automobile-0.084
Token before
Token ID878
Feature activation+9.92e-03

Pos logit contributions
ubb+0.178
 queens+0.158
acy+0.158
Col+0.156
aph+0.156
Neg logit contributions
 package-0.091
 away-0.087
 track-0.086
 automobile-0.084
 stret-0.082
Token!
Token ID0
Feature activation+7.43e-04

Pos logit contributions
ubb+0.023
 servant+0.023
 queens+0.022
oll+0.022
ill+0.021
Neg logit contributions
 tracks-0.012
 radio-0.012
 track-0.012
 automobile-0.012
 package-0.011
Token But
Token ID887
Feature activation-3.50e-03

Pos logit contributions
ubb+0.002
 servant+0.002
 queens+0.002
oll+0.001
ill+0.001
Neg logit contributions
 tracks-0.001
 automobile-0.001
 radio-0.001
 stret-0.001
 package-0.001
Token.
Token ID13
Feature activation+6.77e-03

Pos logit contributions
ubb+0.046
 queens+0.045
 servant+0.044
ill+0.042
oll+0.042
Neg logit contributions
 package-0.023
 tracks-0.023
 automobile-0.023
 radio-0.022
 track-0.022
Token That
Token ID1320
Feature activation+8.23e-03

Pos logit contributions
ubb+0.017
 servant+0.017
 queens+0.017
oll+0.016
ill+0.016
Neg logit contributions
 tracks-0.006
 automobile-0.006
 radio-0.006
 package-0.006
 track-0.006
Token's
Token ID338
Feature activation-1.53e-02

Pos logit contributions
ubb+0.024
 servant+0.023
 queens+0.023
oll+0.023
ill+0.022
Neg logit contributions
 tracks-0.005
 automobile-0.005
 package-0.005
 radio-0.005
 track-0.005
Token not
Token ID407
Feature activation-2.99e-03

Pos logit contributions
 stret+0.025
 tracks+0.025
 radio+0.025
 automobile+0.024
 controller+0.024
Neg logit contributions
 servant-0.028
ubb-0.028
 queens-0.028
oll-0.028
ill-0.025
Token a
Token ID257
Feature activation+4.12e-02

Pos logit contributions
 tracks+0.004
 radio+0.004
 automobile+0.004
 stret+0.004
 track+0.003
Neg logit contributions
ubb-0.007
 servant-0.007
 queens-0.007
oll-0.006
ill-0.006
Token l
Token ID300
Feature activation+1.04e-01

Pos logit contributions
ubb+0.090
ill+0.089
 queens+0.085
aph+0.084
acy+0.083
Neg logit contributions
 package-0.058
 radio-0.055
 stret-0.053
 tracks-0.053
 automobile-0.053
Tokenoll
Token ID692
Feature activation+5.87e-02

Pos logit contributions
 queens+0.124
ubb+0.116
ill+0.115
Col+0.105
aph+0.105
Neg logit contributions
 belongs-0.200
 package-0.193
shirt-0.176
 is-0.174
c-0.171
Tokenipop
Token ID42800
Feature activation+4.06e-02

Pos logit contributions
ubb+0.101
 queens+0.100
ill+0.097
aph+0.096
Col+0.091
Neg logit contributions
 package-0.107
 belongs-0.103
cycles-0.094
 automobile-0.093
 stret-0.093
Token.
Token ID13
Feature activation+9.19e-03

Pos logit contributions
ubb+0.109
 queens+0.102
Col+0.101
 servant+0.100
aph+0.100
Neg logit contributions
 package-0.036
 belongs-0.035
 automobile-0.034
 tracks-0.034
 radio-0.034
Token That
Token ID1320
Feature activation+8.65e-03

Pos logit contributions
ubb+0.024
 servant+0.023
 queens+0.023
oll+0.022
ill+0.022
Neg logit contributions
 tracks-0.009
 automobile-0.008
 radio-0.008
 package-0.008
 stret-0.008
Token's
Token ID338
Feature activation-1.66e-02

Pos logit contributions
ubb+0.025
 servant+0.025
 queens+0.025
oll+0.024
ill+0.024
Neg logit contributions
 tracks-0.005
 automobile-0.005
 package-0.005
 radio-0.005
 track-0.005
Token every
Token ID790
Feature activation+3.66e-04

Pos logit contributions
ubb+0.107
 servant+0.100
 queens+0.100
aph+0.097
umbo+0.096
Neg logit contributions
 tracks-0.049
 track-0.048
 stret-0.047
 radio-0.047
 package-0.047
Token day
Token ID1110
Feature activation+3.52e-02

Pos logit contributions
ubb+0.001
 servant+0.001
 queens+0.001
oll+0.001
ill+0.001
Neg logit contributions
 tracks-0.000
 automobile-0.000
 radio-0.000
 stret-0.000
 track-0.000
Token and
Token ID290
Feature activation+2.75e-02

Pos logit contributions
ubb+0.084
 servant+0.083
 queens+0.081
aph+0.078
oll+0.077
Neg logit contributions
 tracks-0.041
 track-0.039
 radio-0.037
 package-0.037
 automobile-0.036
Token was
Token ID373
Feature activation+1.43e-02

Pos logit contributions
ubb+0.061
 servant+0.060
 queens+0.059
oll+0.058
aph+0.058
Neg logit contributions
 tracks-0.035
 track-0.032
 radio-0.032
 automobile-0.032
 package-0.031
Token never
Token ID1239
Feature activation+3.85e-02

Pos logit contributions
ubb+0.028
 servant+0.028
 queens+0.027
oll+0.026
ill+0.026
Neg logit contributions
 tracks-0.022
 automobile-0.021
 track-0.021
 radio-0.021
 stret-0.020
Token hungry
Token ID14720
Feature activation+1.04e-01

Pos logit contributions
ubb+0.075
oll+0.074
 servant+0.073
umber+0.072
acy+0.071
Neg logit contributions
 tracks-0.054
 track-0.052
 receiver-0.051
 automobile-0.051
 stret-0.051
Token again
Token ID757
Feature activation+3.35e-02

Pos logit contributions
acy+0.249
umber+0.246
ubb+0.245
Col+0.243
ascal+0.238
Neg logit contributions
 away-0.092
's-0.091
 package-0.086
,"-0.081
 automobile-0.081
Token.
Token ID13
Feature activation+2.42e-02

Pos logit contributions
ubb+0.089
 servant+0.088
oll+0.086
 queens+0.085
umbo+0.084
Neg logit contributions
 tracks-0.027
 track-0.025
 package-0.024
 radio-0.024
 automobile-0.023
Token<|endoftext|>
Token ID50256
Feature activation+1.13e-02

Pos logit contributions
ubb+0.058
 servant+0.057
 queens+0.057
oll+0.054
ier+0.054
Neg logit contributions
 tracks-0.027
 automobile-0.026
cycles-0.025
 track-0.025
 radio-0.025
TokenOnce
Token ID7454
Feature activation+1.73e-02

Pos logit contributions
ubb+0.024
 servant+0.024
 queens+0.023
oll+0.022
ill+0.021
Neg logit contributions
 tracks-0.016
 radio-0.015
 package-0.015
 automobile-0.015
 track-0.015
Token there
Token ID612
Feature activation-1.20e-02

Pos logit contributions
ubb+0.024
 servant+0.023
 queens+0.023
oll+0.021
ill+0.019
Neg logit contributions
 tracks-0.037
 automobile-0.036
 radio-0.036
 track-0.036
 package-0.036
Token just
Token ID655
Feature activation+2.17e-02

Pos logit contributions
 servant+0.081
ubb+0.081
 queens+0.077
oll+0.075
umbo+0.074
Neg logit contributions
 tracks-0.047
 track-0.045
 radio-0.044
 automobile-0.043
 package-0.043
Token in
Token ID287
Feature activation+1.45e-02

Pos logit contributions
ubb+0.043
 servant+0.043
 queens+0.042
oll+0.040
umbo+0.039
Neg logit contributions
 tracks-0.033
 radio-0.031
 track-0.031
 package-0.031
 automobile-0.031
Token case
Token ID1339
Feature activation+3.91e-02

Pos logit contributions
 servant+0.035
ubb+0.035
 queens+0.034
oll+0.034
umbo+0.032
Neg logit contributions
 tracks-0.016
 track-0.015
 radio-0.015
 package-0.015
 automobile-0.015
Token another
Token ID1194
Feature activation+2.08e-02

Pos logit contributions
ubb+0.076
 servant+0.076
 queens+0.073
oll+0.070
Col+0.069
Neg logit contributions
 tracks-0.063
 track-0.060
 radio-0.059
 automobile-0.058
 package-0.057
Token great
Token ID1049
Feature activation+4.26e-02

Pos logit contributions
ubb+0.048
 servant+0.046
 queens+0.046
oll+0.044
ill+0.044
Neg logit contributions
 tracks-0.026
 package-0.026
 radio-0.025
 track-0.025
 automobile-0.025
Token l
Token ID300
Feature activation+1.04e-01

Pos logit contributions
ubb+0.108
 queens+0.104
oll+0.103
ier+0.103
ill+0.103
Neg logit contributions
 package-0.043
 tracks-0.041
 track-0.041
 radio-0.040
 automobile-0.040
Tokenoll
Token ID692
Feature activation+6.20e-02

Pos logit contributions
 queens+0.122
ubb+0.115
 diver+0.094
ory+0.092
 servant+0.091
Neg logit contributions
c-0.201
 belongs-0.198
 package-0.192
nob-0.190
shirt-0.189
Tokenipop
Token ID42800
Feature activation+4.35e-02

Pos logit contributions
 queens+0.111
ubb+0.110
aph+0.106
ill+0.104
Col+0.099
Neg logit contributions
 package-0.102
 belongs-0.098
cycles-0.090
 replied-0.086
 stret-0.086
Token would
Token ID561
Feature activation+4.17e-02

Pos logit contributions
ubb+0.107
 queens+0.101
 servant+0.101
Col+0.096
aph+0.094
Neg logit contributions
 tracks-0.047
 package-0.047
 track-0.047
 radio-0.045
 automobile-0.045
Token rise
Token ID4485
Feature activation+2.11e-02

Pos logit contributions
ubb+0.096
 servant+0.095
oll+0.094
 queens+0.094
acy+0.089
Neg logit contributions
 tracks-0.047
 track-0.046
 automobile-0.045
 radio-0.045
 package-0.044
Token in
Token ID287
Feature activation+5.56e-03

Pos logit contributions
ubb+0.048
 servant+0.048
 queens+0.046
oll+0.046
umbo+0.043
Neg logit contributions
 tracks-0.026
 track-0.026
 radio-0.025
 automobile-0.025
 package-0.024
Token managed
Token ID5257
Feature activation+3.63e-02

Pos logit contributions
ubb+0.115
oll+0.104
ancer+0.103
ascal+0.103
ectar+0.103
Neg logit contributions
 tracks-0.058
 track-0.058
 package-0.054
 receiver-0.052
's-0.052
Token to
Token ID284
Feature activation+3.42e-02

Pos logit contributions
ubb+0.080
 servant+0.074
oll+0.074
 queens+0.073
acy+0.070
Neg logit contributions
 tracks-0.049
 track-0.048
 package-0.047
 radio-0.046
 automobile-0.046
Token number
Token ID1271
Feature activation+4.54e-02

Pos logit contributions
ubb+0.078
oll+0.075
 queens+0.075
 servant+0.074
ectar+0.071
Neg logit contributions
 track-0.044
 tracks-0.043
 automobile-0.041
 stret-0.040
 package-0.040
Token all
Token ID477
Feature activation-1.23e-02

Pos logit contributions
ubb+0.105
 servant+0.105
 queens+0.103
oll+0.099
 diver+0.098
Neg logit contributions
 tracks-0.051
 track-0.048
 radio-0.046
 package-0.046
 automobile-0.046
Token her
Token ID607
Feature activation+5.26e-02

Pos logit contributions
 stret+0.018
 radio+0.017
 tracks+0.017
 automobile+0.017
 package+0.017
Neg logit contributions
ubb-0.026
 servant-0.025
 queens-0.025
ill-0.024
oll-0.024
Token l
Token ID300
Feature activation+1.04e-01

Pos logit contributions
ubb+0.113
ill+0.111
 queens+0.110
 diver+0.107
aph+0.106
Neg logit contributions
 tracks-0.068
 package-0.066
 belongs-0.063
 track-0.062
 radio-0.059
Tokenoll
Token ID692
Feature activation+6.39e-02

Pos logit contributions
 queens+0.132
ubb+0.113
 diver+0.107
 servant+0.106
aph+0.106
Neg logit contributions
shirt-0.182
c-0.175
g-0.174
 belongs-0.170
ches-0.160
Tokenip
Token ID541
Feature activation+5.47e-02

Pos logit contributions
 queens+0.108
ubb+0.106
aph+0.105
ill+0.105
 gard+0.099
Neg logit contributions
 belongs-0.107
 package-0.106
cycles-0.097
 coats-0.097
ci-0.095
Tokenops
Token ID2840
Feature activation+4.61e-02

Pos logit contributions
ubb+0.103
 queens+0.103
 servant+0.097
ill+0.095
aph+0.094
Neg logit contributions
 tracks-0.083
 belongs-0.082
 package-0.082
 track-0.081
 stret-0.079
Token.
Token ID13
Feature activation+3.75e-02

Pos logit contributions
ubb+0.119
 servant+0.116
 queens+0.114
Col+0.109
aph+0.107
Neg logit contributions
 track-0.042
 tracks-0.041
 package-0.039
 automobile-0.039
 radio-0.038
Token She
Token ID1375
Feature activation+2.83e-02

Pos logit contributions
ubb+0.091
 queens+0.090
ier+0.087
 servant+0.087
 diver+0.082
Neg logit contributions
 tracks-0.041
 -0.039
 radio-0.037
 receiver-0.036
 package-0.036
Token a
Token ID257
Feature activation-1.33e-02

Pos logit contributions
ubb+0.048
 servant+0.046
 queens+0.046
ill+0.044
oll+0.043
Neg logit contributions
 tracks-0.024
 package-0.023
 automobile-0.023
 radio-0.022
 stret-0.022
Token very
Token ID845
Feature activation+1.50e-02

Pos logit contributions
 tracks+0.020
cycles+0.020
 automobile+0.019
 track+0.019
 stret+0.019
Neg logit contributions
 servant-0.026
ubb-0.025
 queens-0.025
oll-0.024
ill-0.023
Token special
Token ID2041
Feature activation-3.73e-02

Pos logit contributions
ubb+0.035
 queens+0.034
 servant+0.034
ill+0.034
oll+0.033
Neg logit contributions
 tracks-0.017
 automobile-0.017
 package-0.017
 radio-0.017
 track-0.016
Token toy
Token ID13373
Feature activation+2.29e-03

Pos logit contributions
cycles+0.056
obiles+0.056
 tracks+0.055
 stret+0.053
 track+0.052
Neg logit contributions
 servant-0.073
ubb-0.069
 queens-0.067
oll-0.066
 diver-0.065
Token for
Token ID329
Feature activation+4.86e-02

Pos logit contributions
ubb+0.005
 servant+0.005
 queens+0.005
oll+0.005
ill+0.005
Neg logit contributions
 tracks-0.003
 automobile-0.003
 radio-0.003
 stret-0.003
 package-0.003
Token L
Token ID406
Feature activation+1.04e-01

Pos logit contributions
ubb+0.117
 servant+0.115
umbo+0.114
ill+0.113
oll+0.112
Neg logit contributions
 tracks-0.053
 track-0.048
 radio-0.046
 automobile-0.046
 their-0.044
Tokenott
Token ID1252
Feature activation-1.84e-03

Pos logit contributions
 goalie+0.160
 diver+0.158
 queens+0.158
 servant+0.156
ubb+0.152
Neg logit contributions
 belongs-0.176
c-0.169
 package-0.164
 receiver-0.160
 automobile-0.160
Tokenie
Token ID494
Feature activation-1.49e-02

Pos logit contributions
 tracks+0.003
 radio+0.003
 automobile+0.003
 stret+0.003
 track+0.003
Neg logit contributions
ubb-0.004
 servant-0.003
 queens-0.003
oll-0.003
ill-0.003
Token.
Token ID13
Feature activation+5.89e-03

Pos logit contributions
 tracks+0.021
 radio+0.021
 stret+0.021
 automobile+0.021
cycles+0.020
Neg logit contributions
 servant-0.030
ubb-0.030
 queens-0.029
oll-0.029
ill-0.028
Token It
Token ID632
Feature activation-2.25e-02

Pos logit contributions
ubb+0.013
 servant+0.012
 queens+0.012
oll+0.012
ill+0.012
Neg logit contributions
 tracks-0.008
 automobile-0.008
 radio-0.008
 track-0.008
 package-0.008
Token was
Token ID373
Feature activation-1.09e-03

Pos logit contributions
 stret+0.031
 radio+0.029
 automobile+0.029
 tracks+0.029
 track+0.028
Neg logit contributions
 servant-0.047
ubb-0.046
 queens-0.046
oll-0.045
ill-0.045
Token a
Token ID257
Feature activation+3.39e-02

Pos logit contributions
ubb+0.027
 servant+0.026
 queens+0.026
oll+0.025
ill+0.025
Neg logit contributions
 tracks-0.020
 package-0.019
 automobile-0.019
 radio-0.019
 track-0.019
Token little
Token ID1310
Feature activation+2.67e-02

Pos logit contributions
ubb+0.074
 queens+0.072
ill+0.070
ier+0.068
oll+0.067
Neg logit contributions
 package-0.048
 stret-0.045
 radio-0.044
 automobile-0.044
 tracks-0.043
Token bit
Token ID1643
Feature activation+4.57e-02

Pos logit contributions
ubb+0.066
 queens+0.064
 servant+0.063
ill+0.063
acy+0.062
Neg logit contributions
 package-0.028
 tracks-0.027
 automobile-0.026
 radio-0.025
 stret-0.025
Token of
Token ID286
Feature activation+2.25e-02

Pos logit contributions
 servant+0.098
ubb+0.097
Col+0.096
aph+0.095
 queens+0.095
Neg logit contributions
 package-0.054
 tracks-0.053
 automobile-0.052
 away-0.050
 receiver-0.049
Token the
Token ID262
Feature activation+3.74e-02

Pos logit contributions
ubb+0.048
ill+0.047
 queens+0.047
 servant+0.047
aph+0.045
Neg logit contributions
 tracks-0.030
 package-0.029
 automobile-0.029
 track-0.028
 radio-0.028
Token l
Token ID300
Feature activation+1.04e-01

Pos logit contributions
ubb+0.079
ill+0.078
 queens+0.077
aph+0.074
acy+0.074
Neg logit contributions
 package-0.054
 tracks-0.050
 automobile-0.049
 radio-0.049
 stret-0.048
Tokenoll
Token ID692
Feature activation+5.80e-02

Pos logit contributions
 queens+0.125
ubb+0.114
ill+0.107
Col+0.106
aph+0.104
Neg logit contributions
 belongs-0.193
c-0.192
shirt-0.187
 package-0.179
g-0.172
Tokenipop
Token ID42800
Feature activation+3.59e-02

Pos logit contributions
ubb+0.100
 queens+0.098
ill+0.098
aph+0.095
Col+0.095
Neg logit contributions
 package-0.099
 belongs-0.097
cycles-0.089
 automobile-0.088
 stret-0.086
Token.
Token ID13
Feature activation+1.80e-02

Pos logit contributions
ubb+0.090
 queens+0.084
 servant+0.083
Col+0.082
acy+0.080
Neg logit contributions
 package-0.037
 automobile-0.037
 track-0.036
 tracks-0.036
 radio-0.035
Token That
Token ID1320
Feature activation-2.43e-03

Pos logit contributions
ubb+0.043
 servant+0.042
 queens+0.041
oll+0.040
ill+0.039
Neg logit contributions
 tracks-0.021
 automobile-0.020
 radio-0.020
 package-0.020
 track-0.019
Token way
Token ID835
Feature activation-8.62e-03

Pos logit contributions
 tracks+0.002
 stret+0.002
 radio+0.002
 automobile+0.002
 track+0.002
Neg logit contributions
ubb-0.006
 servant-0.006
 queens-0.006
oll-0.006
ill-0.006
Token no
Token ID645
Feature activation+2.92e-02

Pos logit contributions
 tracks+0.006
 radio+0.006
 stret+0.006
 automobile+0.006
 track+0.006
Neg logit contributions
ubb-0.005
 servant-0.005
 queens-0.005
oll-0.005
ill-0.005
Token,
Token ID11
Feature activation+3.15e-02

Pos logit contributions
ubb+0.078
 queens+0.074
 servant+0.074
acy+0.073
oll+0.073
Neg logit contributions
 tracks-0.024
 radio-0.023
 automobile-0.023
 package-0.023
 track-0.023
Token where
Token ID810
Feature activation+9.26e-03

Pos logit contributions
ubb+0.068
ill+0.066
 queens+0.064
 servant+0.064
Col+0.063
Neg logit contributions
 package-0.042
 radio-0.041
 automobile-0.039
 tracks-0.038
 receiver-0.037
Token did
Token ID750
Feature activation-2.49e-02

Pos logit contributions
ubb+0.020
 servant+0.020
 queens+0.019
oll+0.019
ill+0.019
Neg logit contributions
 tracks-0.012
 automobile-0.012
 radio-0.012
 package-0.012
 track-0.012
Token Mr
Token ID1770
Feature activation+6.53e-02

Pos logit contributions
 stret+0.046
 tracks+0.045
 controller+0.045
 automobile+0.044
 track+0.044
Neg logit contributions
 queens-0.041
 servant-0.041
ubb-0.040
oll-0.037
 beetles-0.034
Token.
Token ID13
Feature activation+1.03e-01

Pos logit contributions
ubb+0.167
acy+0.139
years+0.137
Col+0.137
 queens+0.137
Neg logit contributions
 belongs-0.070
 package-0.064
's-0.058
 language-0.056
 radio-0.055
Token Beetle
Token ID47464
Feature activation+1.01e-02

Pos logit contributions
ier+0.177
add+0.162
ubb+0.158
oll+0.157
unn+0.152
Neg logit contributions
 -0.146
's-0.142
c-0.142
 That-0.139
 receiver-0.136
Token go
Token ID467
Feature activation+8.91e-03

Pos logit contributions
ubb+0.023
 servant+0.023
 queens+0.022
oll+0.022
ill+0.022
Neg logit contributions
 tracks-0.012
 automobile-0.012
 radio-0.012
 package-0.012
 stret-0.011
Token?"
Token ID1701
Feature activation-8.33e-03

Pos logit contributions
ubb+0.020
 servant+0.019
oll+0.019
 queens+0.019
ill+0.018
Neg logit contributions
 tracks-0.012
 automobile-0.012
 track-0.012
 radio-0.011
 package-0.011
Token she
Token ID673
Feature activation-1.46e-02

Pos logit contributions
 tracks+0.016
 automobile+0.015
 track+0.015
 radio+0.015
 stret+0.015
Neg logit contributions
 servant-0.013
ubb-0.013
 queens-0.013
oll-0.012
ill-0.012
Token said
Token ID531
Feature activation-9.94e-03

Pos logit contributions
 automobile+0.022
 stret+0.021
 tracks+0.021
 package+0.021
 radio+0.021
Neg logit contributions
 servant-0.029
ubb-0.029
 queens-0.028
oll-0.027
ill-0.026
TokenThe
Token ID464
Feature activation-7.18e-03

Pos logit contributions
ubb+0.016
 servant+0.015
 queens+0.015
oll+0.015
ill+0.014
Neg logit contributions
 tracks-0.009
 radio-0.009
 stret-0.009
 track-0.009
 automobile-0.009
Token old
Token ID1468
Feature activation-1.28e-02

Pos logit contributions
 tracks+0.012
 stret+0.012
 radio+0.012
 automobile+0.012
 track+0.012
Neg logit contributions
 servant-0.013
ubb-0.013
 queens-0.012
oll-0.012
ill-0.011
Token man
Token ID582
Feature activation+9.26e-03

Pos logit contributions
 tracks+0.021
 radio+0.020
 stret+0.020
 automobile+0.020
 controller+0.020
Neg logit contributions
 servant-0.025
ubb-0.023
 queens-0.023
oll-0.022
ill-0.021
Token was
Token ID373
Feature activation+4.53e-03

Pos logit contributions
ubb+0.022
 servant+0.021
 queens+0.021
oll+0.020
ill+0.020
Neg logit contributions
 tracks-0.011
 automobile-0.010
 radio-0.010
 package-0.010
 stret-0.010
Token never
Token ID1239
Feature activation+2.66e-02

Pos logit contributions
ubb+0.010
 servant+0.009
 queens+0.009
oll+0.009
ill+0.009
Neg logit contributions
 tracks-0.006
 automobile-0.006
 radio-0.006
 stret-0.006
 package-0.006
Token hungry
Token ID14720
Feature activation+1.02e-01

Pos logit contributions
ubb+0.053
 servant+0.051
oll+0.050
aph+0.048
 queens+0.048
Neg logit contributions
 tracks-0.039
 stret-0.038
 receiver-0.037
 package-0.037
 automobile-0.037
Token again
Token ID757
Feature activation+4.28e-02

Pos logit contributions
acy+0.249
ubb+0.247
umber+0.242
aph+0.241
ascal+0.240
Neg logit contributions
 away-0.091
 package-0.086
's-0.085
 track-0.081
 automobile-0.078
Token,
Token ID11
Feature activation+1.99e-02

Pos logit contributions
ubb+0.115
 servant+0.111
umbo+0.109
oll+0.109
aph+0.108
Neg logit contributions
 tracks-0.033
 track-0.032
 radio-0.031
 package-0.031
 automobile-0.029
Token and
Token ID290
Feature activation+1.95e-02

Pos logit contributions
ubb+0.047
 servant+0.046
oll+0.044
 queens+0.044
acy+0.044
Neg logit contributions
 tracks-0.024
 track-0.023
 automobile-0.023
 radio-0.022
 package-0.022
Token he
Token ID339
Feature activation+2.69e-02

Pos logit contributions
ubb+0.039
 servant+0.038
oll+0.037
 queens+0.036
ill+0.036
Neg logit contributions
 tracks-0.030
 track-0.028
 automobile-0.028
 package-0.028
 radio-0.028
Token forever
Token ID8097
Feature activation+2.73e-02

Pos logit contributions
ubb+0.066
 queens+0.063
 servant+0.062
aph+0.060
oll+0.059
Neg logit contributions
 tracks-0.030
 package-0.028
 automobile-0.028
 radio-0.028
 belongs-0.027
Token Jane
Token ID12091
Feature activation-9.89e-03

Pos logit contributions
 radio+0.026
 stret+0.026
 tracks+0.026
 automobile+0.025
 track+0.025
Neg logit contributions
ill-0.032
ubb-0.031
 servant-0.031
 queens-0.031
oll-0.030
Token had
Token ID550
Feature activation+1.55e-02

Pos logit contributions
 tracks+0.012
 radio+0.012
 automobile+0.012
 stret+0.012
 track+0.012
Neg logit contributions
ubb-0.022
 servant-0.022
 queens-0.021
oll-0.021
ill-0.021
Token never
Token ID1239
Feature activation+3.89e-02

Pos logit contributions
ubb+0.035
 servant+0.033
 queens+0.033
oll+0.032
aph+0.031
Neg logit contributions
 tracks-0.021
 package-0.020
 radio-0.019
 automobile-0.019
 stret-0.019
Token eaten
Token ID17065
Feature activation+7.38e-02

Pos logit contributions
ubb+0.084
 servant+0.078
aph+0.077
 queens+0.076
oll+0.076
Neg logit contributions
 tracks-0.051
 package-0.050
 receiver-0.049
 belongs-0.049
 stret-0.048
Token a
Token ID257
Feature activation+3.04e-02

Pos logit contributions
ubb+0.177
 servant+0.176
oll+0.174
acy+0.173
aph+0.173
Neg logit contributions
 our-0.065
 away-0.065
 their-0.062
 track-0.057
 package-0.057
Token tomato
Token ID24240
Feature activation+1.02e-01

Pos logit contributions
ubb+0.054
oll+0.052
 queens+0.051
 servant+0.051
ill+0.048
Neg logit contributions
 package-0.053
 radio-0.052
 track-0.051
 tracks-0.050
 automobile-0.050
Token before
Token ID878
Feature activation+2.40e-02

Pos logit contributions
ubb+0.232
aph+0.224
add+0.219
unn+0.217
ancer+0.214
Neg logit contributions
 away-0.103
 package-0.102
's-0.101
 track-0.098
house-0.087
Token,
Token ID11
Feature activation+2.97e-02

Pos logit contributions
ubb+0.060
 servant+0.059
 queens+0.058
oll+0.056
umbo+0.055
Neg logit contributions
 tracks-0.025
 radio-0.025
 package-0.024
 track-0.024
 automobile-0.024
Token but
Token ID475
Feature activation-2.06e-03

Pos logit contributions
ubb+0.075
 servant+0.074
 queens+0.072
oll+0.071
aph+0.069
Neg logit contributions
 automobile-0.029
 tracks-0.029
 track-0.028
 radio-0.027
 package-0.027
Token when
Token ID618
Feature activation+3.39e-02

Pos logit contributions
 tracks+0.004
 stret+0.004
 radio+0.003
 automobile+0.003
 track+0.003
Neg logit contributions
ubb-0.004
 servant-0.004
 queens-0.004
oll-0.003
ill-0.003
Token she
Token ID673
Feature activation+2.94e-02

Pos logit contributions
ubb+0.082
oll+0.079
 servant+0.078
ascal+0.074
umbo+0.074
Neg logit contributions
 package-0.037
 automobile-0.037
 tracks-0.037
 radio-0.036
 their-0.036
Token happy
Token ID3772
Feature activation+1.45e-02

Pos logit contributions
 servant+0.063
 queens+0.062
ubb+0.060
oll+0.058
umbo+0.057
Neg logit contributions
 tracks-0.026
 automobile-0.025
 package-0.024
 receiver-0.024
 track-0.024
Token that
Token ID326
Feature activation+1.56e-02

Pos logit contributions
ubb+0.034
 servant+0.033
 queens+0.032
oll+0.032
acy+0.030
Neg logit contributions
 tracks-0.018
 radio-0.018
 track-0.017
 automobile-0.017
 package-0.017
Token she
Token ID673
Feature activation+2.91e-03

Pos logit contributions
ubb+0.027
 servant+0.027
 queens+0.026
oll+0.025
ill+0.024
Neg logit contributions
 tracks-0.028
 radio-0.027
 automobile-0.027
 track-0.027
 package-0.027
Token had
Token ID550
Feature activation+8.00e-03

Pos logit contributions
ubb+0.006
 servant+0.006
 queens+0.006
oll+0.006
ill+0.006
Neg logit contributions
 tracks-0.004
 automobile-0.004
 radio-0.004
 stret-0.004
 package-0.004
Token her
Token ID607
Feature activation+3.89e-02

Pos logit contributions
ubb+0.017
 servant+0.016
 queens+0.016
oll+0.015
ill+0.015
Neg logit contributions
 tracks-0.012
 package-0.011
 track-0.011
 automobile-0.011
 radio-0.011
Token l
Token ID300
Feature activation+1.02e-01

Pos logit contributions
ubb+0.083
 queens+0.083
ill+0.080
ier+0.079
acy+0.075
Neg logit contributions
 package-0.057
 radio-0.053
 tracks-0.053
 automobile-0.052
 track-0.051
Tokenoll
Token ID692
Feature activation+4.61e-02

Pos logit contributions
 queens+0.130
ubb+0.105
 Mon+0.102
 chef+0.098
 diver+0.098
Neg logit contributions
 package-0.175
c-0.175
shirt-0.171
 belongs-0.170
age-0.163
Tokenipop
Token ID42800
Feature activation+3.06e-02

Pos logit contributions
ubb+0.077
 queens+0.075
aph+0.073
ill+0.071
Col+0.068
Neg logit contributions
 package-0.083
 belongs-0.079
cycles-0.074
 automobile-0.073
 coats-0.072
Token and
Token ID290
Feature activation+1.51e-02

Pos logit contributions
ubb+0.073
 queens+0.069
 servant+0.068
acy+0.065
Col+0.065
Neg logit contributions
 tracks-0.036
 package-0.036
 track-0.036
 automobile-0.036
 radio-0.035
Token it
Token ID340
Feature activation+1.04e-02

Pos logit contributions
ubb+0.032
 servant+0.030
 queens+0.030
oll+0.030
ill+0.029
Neg logit contributions
 tracks-0.022
 automobile-0.021
 track-0.021
 radio-0.021
 package-0.020
Token tasted
Token ID29187
Feature activation+1.59e-02

Pos logit contributions
ubb+0.025
 servant+0.025
 queens+0.025
oll+0.024
ill+0.023
Neg logit contributions
 tracks-0.011
 automobile-0.010
 radio-0.010
 track-0.010
 package-0.010
Token ground
Token ID2323
Feature activation+3.97e-02

Pos logit contributions
ill+0.156
ubb+0.154
enny+0.146
umber+0.146
acy+0.145
Neg logit contributions
 package-0.086
 radio-0.075
 tracks-0.074
 automobile-0.073
 track-0.071
Token.
Token ID13
Feature activation+4.75e-03

Pos logit contributions
 servant+0.090
ubb+0.088
oll+0.087
 queens+0.085
Col+0.084
Neg logit contributions
 tracks-0.052
 track-0.050
 automobile-0.049
 radio-0.047
 package-0.046
Token At
Token ID1629
Feature activation+1.49e-02

Pos logit contributions
ubb+0.011
 servant+0.011
 queens+0.011
oll+0.011
ill+0.011
Neg logit contributions
 tracks-0.005
 automobile-0.005
 radio-0.005
 package-0.005
 stret-0.005
Token last
Token ID938
Feature activation-6.35e-03

Pos logit contributions
ubb+0.037
 servant+0.037
oll+0.036
 queens+0.036
ill+0.033
Neg logit contributions
 tracks-0.016
 track-0.016
 automobile-0.016
 radio-0.016
 package-0.015
Token the
Token ID262
Feature activation+3.20e-02

Pos logit contributions
 automobile+0.009
 tracks+0.009
 radio+0.009
 stret+0.009
 package+0.009
Neg logit contributions
ubb-0.013
 servant-0.013
 queens-0.012
oll-0.012
ill-0.012
Token l
Token ID300
Feature activation+1.01e-01

Pos logit contributions
ill+0.070
ubb+0.070
 queens+0.066
 servant+0.065
ier+0.064
Neg logit contributions
 package-0.047
 tracks-0.043
 radio-0.042
 automobile-0.042
 track-0.040
Tokenoll
Token ID692
Feature activation+5.03e-02

Pos logit contributions
 queens+0.119
ubb+0.104
ory+0.099
 servant+0.093
 diver+0.085
Neg logit contributions
 tracks-0.200
 package-0.197
c-0.197
chair-0.196
 slot-0.196
Tokenipop
Token ID42800
Feature activation+3.02e-02

Pos logit contributions
ubb+0.088
 queens+0.087
ill+0.084
aph+0.082
Col+0.081
Neg logit contributions
 package-0.086
 belongs-0.082
 replied-0.078
 coats-0.076
 tracks-0.075
Token stopped
Token ID5025
Feature activation+3.44e-02

Pos logit contributions
ubb+0.072
 servant+0.070
 queens+0.069
oll+0.067
Col+0.065
Neg logit contributions
 tracks-0.037
 track-0.035
 package-0.035
 radio-0.035
 automobile-0.034
Token rising
Token ID7396
Feature activation+1.62e-02

Pos logit contributions
 servant+0.087
oll+0.087
 queens+0.085
ubb+0.084
umbo+0.081
Neg logit contributions
 tracks-0.036
 radio-0.035
 automobile-0.035
 track-0.035
 stret-0.032
Token and
Token ID290
Feature activation-2.40e-03

Pos logit contributions
ubb+0.038
 servant+0.038
 queens+0.037
oll+0.036
acy+0.034
Neg logit contributions
 tracks-0.020
 radio-0.019
 automobile-0.019
 track-0.019
 package-0.018
Token the
Token ID262
Feature activation-7.79e-03

Pos logit contributions
oll+0.090
 servant+0.086
ubb+0.086
ill+0.086
acy+0.084
Neg logit contributions
 tracks-0.026
 track-0.025
 automobile-0.022
 radio-0.021
cycles-0.019
Token little
Token ID1310
Feature activation+3.67e-03

Pos logit contributions
 stret+0.014
 tracks+0.013
 automobile+0.013
 radio+0.013
 package+0.013
Neg logit contributions
 servant-0.014
ubb-0.013
 queens-0.013
oll-0.012
ill-0.012
Token rabbit
Token ID22746
Feature activation+4.38e-02

Pos logit contributions
ubb+0.007
 servant+0.007
 queens+0.007
oll+0.007
ill+0.007
Neg logit contributions
 tracks-0.006
 automobile-0.005
 radio-0.005
 track-0.005
 package-0.005
Token was
Token ID373
Feature activation+1.78e-02

Pos logit contributions
ubb+0.109
 queens+0.104
oll+0.102
 servant+0.102
aph+0.099
Neg logit contributions
 tracks-0.048
 track-0.044
 radio-0.042
 automobile-0.041
cycles-0.041
Token never
Token ID1239
Feature activation+2.52e-02

Pos logit contributions
ubb+0.033
 servant+0.031
 queens+0.031
oll+0.030
aph+0.029
Neg logit contributions
 tracks-0.030
 track-0.029
 stret-0.028
 automobile-0.028
 radio-0.028
Token sad
Token ID6507
Feature activation+1.01e-01

Pos logit contributions
ubb+0.049
oll+0.048
 servant+0.048
acy+0.045
 queens+0.045
Neg logit contributions
 tracks-0.038
 track-0.037
 receiver-0.036
 automobile-0.035
 stret-0.035
Token again
Token ID757
Feature activation+3.44e-02

Pos logit contributions
umber+0.273
acy+0.251
uka+0.249
Col+0.249
ona+0.248
Neg logit contributions
 around-0.068
 away-0.064
's-0.061
 language-0.060
 up-0.058
Token.
Token ID13
Feature activation+3.15e-02

Pos logit contributions
ubb+0.089
 servant+0.088
oll+0.087
 queens+0.085
umbo+0.084
Neg logit contributions
 tracks-0.031
 track-0.030
 radio-0.028
 automobile-0.027
 receiver-0.026
Token He
Token ID679
Feature activation+3.25e-02

Pos logit contributions
ubb+0.080
 servant+0.077
 queens+0.077
oll+0.075
ier+0.073
Neg logit contributions
 tracks-0.032
 automobile-0.029
cycles-0.029
 track-0.028
 radio-0.028
Token was
Token ID373
Feature activation+6.86e-03

Pos logit contributions
ubb+0.077
 queens+0.075
 servant+0.073
oll+0.072
ill+0.070
Neg logit contributions
 tracks-0.038
 track-0.036
 radio-0.035
 belongs-0.035
 automobile-0.034
Token always
Token ID1464
Feature activation-1.03e-02

Pos logit contributions
ubb+0.014
 servant+0.013
 queens+0.013
oll+0.013
ill+0.012
Neg logit contributions
 tracks-0.010
 automobile-0.010
 track-0.010
 radio-0.010
 package-0.010
Token if
Token ID611
Feature activation+3.88e-02

Pos logit contributions
ubb+0.050
 queens+0.047
 servant+0.046
oll+0.046
ill+0.044
Neg logit contributions
 tracks-0.032
 package-0.031
 radio-0.031
 automobile-0.031
 track-0.030
Token she
Token ID673
Feature activation+3.71e-02

Pos logit contributions
ubb+0.076
oll+0.069
ill+0.069
umbo+0.067
 queens+0.067
Neg logit contributions
 package-0.063
 radio-0.060
 receiver-0.059
 automobile-0.059
 tracks-0.058
Token could
Token ID714
Feature activation+3.76e-02

Pos logit contributions
ubb+0.086
 queens+0.080
 servant+0.076
aph+0.075
ectar+0.074
Neg logit contributions
 package-0.049
 belongs-0.048
 automobile-0.045
 tracks-0.045
 receiver-0.044
Token have
Token ID423
Feature activation+5.89e-03

Pos logit contributions
ubb+0.096
 queens+0.094
 servant+0.090
oll+0.087
aph+0.086
Neg logit contributions
 automobile-0.039
 tracks-0.038
 package-0.038
 radio-0.036
 track-0.036
Token the
Token ID262
Feature activation+3.78e-02

Pos logit contributions
ubb+0.013
 servant+0.013
 queens+0.013
oll+0.013
ill+0.012
Neg logit contributions
 tracks-0.007
 automobile-0.007
 package-0.007
 radio-0.007
 stret-0.007
Token l
Token ID300
Feature activation+1.01e-01

Pos logit contributions
ubb+0.085
 queens+0.082
ill+0.079
aph+0.075
acy+0.075
Neg logit contributions
 package-0.054
 radio-0.047
 automobile-0.047
 tracks-0.047
 receiver-0.045
Tokenoll
Token ID692
Feature activation+5.40e-02

Pos logit contributions
 queens+0.123
ubb+0.111
aph+0.099
ipples+0.095
 diver+0.091
Neg logit contributions
 package-0.198
shirt-0.197
c-0.189
 belongs-0.187
nob-0.178
Tokenipop
Token ID42800
Feature activation+1.33e-02

Pos logit contributions
ubb+0.093
 queens+0.091
aph+0.088
ill+0.087
Col+0.083
Neg logit contributions
 package-0.096
 belongs-0.092
 automobile-0.082
 replied-0.082
cycles-0.081
Token.
Token ID13
Feature activation+2.40e-02

Pos logit contributions
ubb+0.030
 servant+0.029
 queens+0.029
oll+0.028
ill+0.027
Neg logit contributions
 tracks-0.017
 automobile-0.017
 package-0.016
 radio-0.016
 track-0.016
Token The
Token ID383
Feature activation+1.65e-02

Pos logit contributions
ubb+0.063
 queens+0.062
 servant+0.060
oll+0.059
ier+0.057
Neg logit contributions
 automobile-0.023
 tracks-0.022
 package-0.022
 radio-0.021
 stret-0.021
Token shop
Token ID6128
Feature activation-9.34e-03

Pos logit contributions
ubb+0.040
ill+0.039
 queens+0.038
 servant+0.037
aph+0.036
Neg logit contributions
 package-0.020
 automobile-0.019
 radio-0.018
 tracks-0.018
 receiver-0.018
Token
Token ID198
Feature activation-3.73e-02

Pos logit contributions
 stret+0.032
 automobile+0.032
 radio+0.032
 tracks+0.032
 package+0.032
Neg logit contributions
 servant-0.033
 queens-0.033
ubb-0.033
ill-0.033
oll-0.032
Token"
Token ID1
Feature activation-3.86e-03

Pos logit contributions
 stret+0.056
 controller+0.054
cycles+0.053
 tracks+0.053
 package+0.053
Neg logit contributions
 queens-0.075
 servant-0.070
umbo-0.069
 gard-0.064
aph-0.063
TokenHello
Token ID15496
Feature activation+2.03e-04

Pos logit contributions
 tracks+0.005
 stret+0.005
 automobile+0.005
 radio+0.005
 track+0.005
Neg logit contributions
ubb-0.008
 servant-0.008
 queens-0.008
oll-0.008
ill-0.008
Token,
Token ID11
Feature activation-1.80e-02

Pos logit contributions
ubb+0.000
 servant+0.000
 queens+0.000
oll+0.000
ill+0.000
Neg logit contributions
 tracks-0.000
 automobile-0.000
 radio-0.000
 stret-0.000
 track-0.000
Token cloud
Token ID6279
Feature activation-2.54e-02

Pos logit contributions
 stret+0.029
 tracks+0.028
cycles+0.027
 track+0.026
 controller+0.026
Neg logit contributions
 servant-0.034
 queens-0.034
ubb-0.033
oll-0.031
umbo-0.031
Token!"
Token ID2474
Feature activation-1.06e-01

Pos logit contributions
 stret+0.040
 radio+0.039
 tracks+0.039
obiles+0.038
cycles+0.038
Neg logit contributions
 queens-0.048
 servant-0.048
ubb-0.048
ill-0.046
oll-0.045
Token Ellie
Token ID44801
Feature activation-1.97e-02

Pos logit contributions
 stret+0.217
cycles+0.189
 controller+0.184
izer+0.183
 trailer+0.179
Neg logit contributions
 servant-0.130
 she-0.128
 queens-0.127
 gard-0.116
 beetles-0.114
Token said
Token ID531
Feature activation-1.05e-02

Pos logit contributions
 stret+0.026
 radio+0.025
 package+0.025
 automobile+0.025
obiles+0.024
Neg logit contributions
 servant-0.043
 queens-0.042
ubb-0.041
ill-0.041
oll-0.040
Token.
Token ID13
Feature activation-3.06e-02

Pos logit contributions
 tracks+0.016
 automobile+0.016
 package+0.016
 stret+0.015
 radio+0.015
Neg logit contributions
 servant-0.021
ubb-0.021
 queens-0.020
ill-0.019
oll-0.019
Token
Token ID220
Feature activation+1.14e-02

Pos logit contributions
 stret+0.047
 package+0.047
 automobile+0.044
cycles+0.044
 radio+0.044
Neg logit contributions
 servant-0.059
 queens-0.055
ill-0.055
ubb-0.053
umbo-0.052
Token
Token ID198
Feature activation-1.39e-02

Pos logit contributions
ubb+0.025
 servant+0.025
 queens+0.025
oll+0.024
ill+0.023
Neg logit contributions
 tracks-0.015
 automobile-0.014
 radio-0.014
 track-0.014
 stret-0.014
Token
Token ID198
Feature activation-6.73e-02

Pos logit contributions
 stret+0.140
 automobile+0.138
 receiver+0.136
 belongs+0.134
 package+0.134
Neg logit contributions
enny-0.061
ill-0.060
Col-0.056
olly-0.054
 queens-0.053
Token"
Token ID1
Feature activation-1.60e-02

Pos logit contributions
 cigarettes+0.127
 belongs+0.127
 stret+0.126
ci+0.123
 receiver+0.123
Neg logit contributions
enny-0.083
 queens-0.083
olly-0.081
Col-0.078
Ben-0.071
TokenLet
Token ID5756
Feature activation-4.94e-02

Pos logit contributions
 tracks+0.027
 belongs+0.026
 stret+0.026
 track+0.026
cycles+0.026
Neg logit contributions
 queens-0.028
ubb-0.028
 servant-0.027
ill-0.027
Col-0.026
Token's
Token ID338
Feature activation-1.43e-02

Pos logit contributions
 track+0.069
 controller+0.067
 tracks+0.067
cycles+0.067
 stret+0.066
Neg logit contributions
 servant-0.088
ax-0.086
olly-0.086
ory-0.085
ill-0.084
Token help
Token ID1037
Feature activation-2.65e-02

Pos logit contributions
 stret+0.021
 tracks+0.021
 radio+0.021
 receiver+0.020
 automobile+0.020
Neg logit contributions
 servant-0.028
ubb-0.028
 queens-0.027
oll-0.026
ill-0.026
Token!"
Token ID2474
Feature activation-1.05e-01

Pos logit contributions
 stret+0.035
 automobile+0.034
 receiver+0.034
 Amen+0.034
obiles+0.034
Neg logit contributions
 queens-0.054
 servant-0.053
ubb-0.051
ill-0.049
 diver-0.048
Token said
Token ID531
Feature activation-7.80e-02

Pos logit contributions
 stret+0.227
obiles+0.211
rated+0.207
 Amen+0.200
initely+0.200
Neg logit contributions
 Ben-0.101
 she-0.082
olly-0.082
 he-0.076
enny-0.073
Token Max
Token ID5436
Feature activation-2.85e-02

Pos logit contributions
obiles+0.143
 stret+0.138
puters+0.135
cycles+0.131
 belongs+0.130
Neg logit contributions
 servant-0.122
 diver-0.102
 Mon-0.102
 Colin-0.101
acy-0.096
Token,
Token ID11
Feature activation-3.30e-02

Pos logit contributions
 automobile+0.045
obiles+0.045
 radio+0.045
 package+0.044
 tracks+0.044
Neg logit contributions
 servant-0.055
ubb-0.051
ill-0.051
oll-0.051
 queens-0.049
Token who
Token ID508
Feature activation-4.00e-02

Pos logit contributions
 package+0.057
 belongs+0.057
obiles+0.056
 automobile+0.056
 radio+0.054
Neg logit contributions
 servant-0.056
ubb-0.055
 queens-0.052
ill-0.051
 gard-0.050
Token was
Token ID373
Feature activation-1.82e-02

Pos logit contributions
 stret+0.067
 tracks+0.064
 radio+0.062
obiles+0.061
 track+0.061
Neg logit contributions
 servant-0.070
oll-0.068
ubb-0.066
ill-0.066
 queens-0.064
TokenHi
Token ID17250
Feature activation-2.69e-02

Pos logit contributions
ubb+0.002
 servant+0.002
 queens+0.002
oll+0.002
ill+0.002
Neg logit contributions
 tracks-0.001
 automobile-0.001
 radio-0.001
 stret-0.001
 package-0.001
Token,
Token ID11
Feature activation-1.83e-02

Pos logit contributions
 tracks+0.041
 stret+0.040
 track+0.039
 controller+0.039
 radio+0.038
Neg logit contributions
 servant-0.053
 queens-0.053
ubb-0.050
oll-0.048
umbo-0.048
Token I
Token ID314
Feature activation+1.61e-02

Pos logit contributions
 tracks+0.030
 stret+0.029
 track+0.028
 controller+0.028
 automobile+0.028
Neg logit contributions
 servant-0.034
 queens-0.034
ubb-0.033
oll-0.031
umbo-0.031
Token'm
Token ID1101
Feature activation+9.17e-03

Pos logit contributions
ubb+0.029
 servant+0.029
 queens+0.028
oll+0.028
ill+0.026
Neg logit contributions
 tracks-0.027
 radio-0.026
 package-0.026
 automobile-0.026
 track-0.026
Token Mouse
Token ID21839
Feature activation-2.45e-02

Pos logit contributions
ubb+0.020
 servant+0.020
 queens+0.019
oll+0.019
ill+0.019
Neg logit contributions
 tracks-0.012
 automobile-0.012
 track-0.012
 radio-0.012
 package-0.011
Token,"
Token ID553
Feature activation-1.03e-01

Pos logit contributions
 stret+0.033
 tracks+0.033
obiles+0.032
 radio+0.032
 automobile+0.032
Neg logit contributions
 servant-0.052
 queens-0.051
ubb-0.050
ill-0.049
oll-0.048
Token said
Token ID531
Feature activation-5.40e-02

Pos logit contributions
 stret+0.198
 track+0.196
obiles+0.188
 tracks+0.186
 controller+0.183
Neg logit contributions
 servant-0.118
 gard-0.115
 beetle-0.114
 queens-0.112
 beetles-0.112
Token the
Token ID262
Feature activation+5.17e-03

Pos logit contributions
obiles+0.090
 tracks+0.088
 stret+0.087
 track+0.086
cycles+0.085
Neg logit contributions
 servant-0.098
 queens-0.089
ubb-0.087
 Mon-0.081
aph-0.081
Token mouse
Token ID10211
Feature activation-2.31e-02

Pos logit contributions
ubb+0.011
 servant+0.011
 queens+0.011
oll+0.011
ill+0.011
Neg logit contributions
 tracks-0.007
 radio-0.007
 automobile-0.007
 package-0.007
 track-0.007
Token.
Token ID13
Feature activation-3.38e-02

Pos logit contributions
 stret+0.036
 package+0.036
 radio+0.036
 automobile+0.035
obiles+0.035
Neg logit contributions
 servant-0.045
ubb-0.044
 queens-0.042
ill-0.042
oll-0.041
Token "
Token ID366
Feature activation+3.22e-02

Pos logit contributions
 package+0.057
 stret+0.056
 automobile+0.056
 radio+0.055
 tracks+0.055
Neg logit contributions
 servant-0.058
 queens-0.056
ubb-0.052
umbo-0.051
ill-0.050
Token.
Token ID13
Feature activation+4.40e-03

Pos logit contributions
 tracks+0.013
 stret+0.013
 automobile+0.013
 track+0.013
 radio+0.013
Neg logit contributions
 servant-0.020
ubb-0.020
 queens-0.020
oll-0.019
ill-0.019
Token She
Token ID1375
Feature activation-1.60e-03

Pos logit contributions
ubb+0.008
 servant+0.008
 queens+0.008
oll+0.008
ill+0.007
Neg logit contributions
 tracks-0.007
 automobile-0.007
 radio-0.007
 stret-0.007
 package-0.007
Token got
Token ID1392
Feature activation-8.17e-03

Pos logit contributions
 tracks+0.002
 automobile+0.002
 radio+0.002
 stret+0.002
 track+0.002
Neg logit contributions
ubb-0.004
 servant-0.003
 queens-0.003
oll-0.003
ill-0.003
Token on
Token ID319
Feature activation-1.28e-02

Pos logit contributions
 tracks+0.011
 automobile+0.011
 radio+0.011
 stret+0.011
cycles+0.011
Neg logit contributions
ubb-0.017
 servant-0.017
 queens-0.017
oll-0.017
ill-0.016
Token a
Token ID257
Feature activation-7.51e-02

Pos logit contributions
cycles+0.016
 tracks+0.016
 receiver+0.016
 automobile+0.016
 belongs+0.016
Neg logit contributions
 servant-0.027
ubb-0.027
 queens-0.026
oll-0.025
ill-0.025
Token different
Token ID1180
Feature activation-1.02e-01

Pos logit contributions
cycles+0.120
agree+0.114
adoes+0.111
obiles+0.109
esh+0.106
Neg logit contributions
 servant-0.132
onna-0.122
 queens-0.116
 diver-0.116
 begg-0.115
Token train
Token ID4512
Feature activation-4.85e-02

Pos logit contributions
cycles+0.165
obiles+0.158
 Amen+0.155
agree+0.152
 Rusty+0.147
Neg logit contributions
 servant-0.153
 king-0.140
 diver-0.135
 begg-0.129
 sur-0.129
Token with
Token ID351
Feature activation+1.39e-03

Pos logit contributions
 Amen+0.074
cycles+0.074
obiles+0.073
 receiver+0.072
 belongs+0.071
Neg logit contributions
 servant-0.082
 queens-0.078
ill-0.078
oll-0.077
ier-0.076
Token her
Token ID607
Feature activation-1.87e-02

Pos logit contributions
ubb+0.003
 servant+0.003
 queens+0.003
oll+0.003
ill+0.003
Neg logit contributions
 tracks-0.002
 automobile-0.002
 radio-0.002
 stret-0.002
 package-0.002
Token ticket
Token ID7846
Feature activation-2.72e-02

Pos logit contributions
 stret+0.026
 automobile+0.026
obiles+0.026
 tracks+0.026
 receiver+0.026
Neg logit contributions
 servant-0.037
ubb-0.037
 queens-0.035
oll-0.035
ill-0.034
Token.
Token ID13
Feature activation+8.95e-03

Pos logit contributions
 Amen+0.038
obiles+0.038
cycles+0.038
 stret+0.038
 tracks+0.038
Neg logit contributions
 servant-0.052
oll-0.050
 queens-0.050
ubb-0.050
ill-0.049
Token.
Token ID13
Feature activation-1.63e-03

Pos logit contributions
ubb+0.021
 servant+0.020
 queens+0.020
oll+0.019
ill+0.019
Neg logit contributions
 tracks-0.012
 automobile-0.011
 track-0.011
 radio-0.011
 stret-0.011
Token He
Token ID679
Feature activation-7.88e-03

Pos logit contributions
 tracks+0.002
 stret+0.002
 radio+0.002
 automobile+0.002
 package+0.002
Neg logit contributions
ubb-0.003
 servant-0.003
 queens-0.003
oll-0.003
ill-0.003
Token put
Token ID1234
Feature activation-3.01e-02

Pos logit contributions
 tracks+0.011
 automobile+0.011
 stret+0.011
 package+0.011
 radio+0.011
Neg logit contributions
ubb-0.016
 servant-0.016
 queens-0.016
oll-0.015
ill-0.015
Token on
Token ID319
Feature activation-1.99e-02

Pos logit contributions
 stret+0.049
 tracks+0.047
 automobile+0.047
obiles+0.047
 radio+0.047
Neg logit contributions
 servant-0.053
ubb-0.052
 queens-0.052
oll-0.051
Col-0.046
Token his
Token ID465
Feature activation-6.20e-02

Pos logit contributions
 package+0.029
 stret+0.029
 receiver+0.028
cycles+0.028
 automobile+0.028
Neg logit contributions
 servant-0.039
ubb-0.038
 queens-0.038
oll-0.035
ill-0.035
Token favourite
Token ID12507
Feature activation-1.01e-01

Pos logit contributions
 receiver+0.093
esh+0.093
agree+0.093
 package+0.092
obiles+0.091
Neg logit contributions
 servant-0.111
 queens-0.102
ubb-0.099
oll-0.098
 diver-0.095
Token party
Token ID2151
Feature activation-1.14e-02

Pos logit contributions
agree+0.180
cycles+0.170
obiles+0.165
rated+0.164
limits+0.163
Neg logit contributions
 servant-0.154
 crabs-0.135
 queens-0.135
 diver-0.133
 crab-0.130
Token hat
Token ID6877
Feature activation+1.10e-03

Pos logit contributions
 tracks+0.016
 automobile+0.016
 stret+0.016
 radio+0.016
 package+0.016
Neg logit contributions
 servant-0.023
ubb-0.022
 queens-0.022
oll-0.022
ill-0.021
Token and
Token ID290
Feature activation-1.95e-02

Pos logit contributions
ubb+0.002
 servant+0.002
 queens+0.002
oll+0.002
ill+0.002
Neg logit contributions
 tracks-0.001
 automobile-0.001
 radio-0.001
 stret-0.001
 package-0.001
Token hopped
Token ID45230
Feature activation-1.79e-02

Pos logit contributions
 stret+0.028
 package+0.027
 automobile+0.027
obiles+0.027
cycles+0.027
Neg logit contributions
 servant-0.039
 queens-0.038
ubb-0.037
oll-0.037
ill-0.035
Token along
Token ID1863
Feature activation-9.50e-03

Pos logit contributions
 automobile+0.027
 tracks+0.026
 package+0.026
cycles+0.026
 stret+0.026
Neg logit contributions
 servant-0.035
 queens-0.034
ubb-0.034
oll-0.033
ill-0.033
Token."
Token ID526
Feature activation-8.96e-03

Pos logit contributions
 stret+0.025
 automobile+0.024
 package+0.024
 tracks+0.024
 receiver+0.023
Neg logit contributions
 queens-0.033
ubb-0.033
 servant-0.033
oll-0.031
ill-0.031
Token He
Token ID679
Feature activation-5.25e-03

Pos logit contributions
 radio+0.011
 tracks+0.011
 automobile+0.011
 package+0.011
 stret+0.011
Neg logit contributions
 servant-0.019
ubb-0.019
 queens-0.019
ill-0.018
oll-0.018
Token put
Token ID1234
Feature activation-2.07e-02

Pos logit contributions
 automobile+0.007
 tracks+0.007
 stret+0.007
 radio+0.007
 package+0.007
Neg logit contributions
ubb-0.011
 servant-0.011
 queens-0.010
oll-0.010
ill-0.010
Token on
Token ID319
Feature activation-1.94e-02

Pos logit contributions
 stret+0.030
 automobile+0.030
 tracks+0.030
 radio+0.029
obiles+0.029
Neg logit contributions
ubb-0.041
 servant-0.040
oll-0.040
 queens-0.039
ill-0.038
Token a
Token ID257
Feature activation-7.74e-02

Pos logit contributions
 package+0.029
 stret+0.028
 automobile+0.028
 receiver+0.027
obiles+0.027
Neg logit contributions
ubb-0.038
 servant-0.037
 queens-0.036
ill-0.035
acy-0.035
Token different
Token ID1180
Feature activation-1.00e-01

Pos logit contributions
esh+0.132
cycles+0.128
obiles+0.124
agree+0.121
 Amen+0.121
Neg logit contributions
 diver-0.121
olly-0.118
acy-0.117
 queens-0.115
 servant-0.115
Token song
Token ID3496
Feature activation-1.05e-02

Pos logit contributions
 stret+0.153
obiles+0.151
cycles+0.150
 Amen+0.148
agree+0.137
Neg logit contributions
 diver-0.159
 servant-0.157
 crabs-0.148
ubb-0.148
ill-0.148
Token and
Token ID290
Feature activation-2.10e-02

Pos logit contributions
 tracks+0.015
 automobile+0.015
 stret+0.015
 radio+0.015
 package+0.015
Neg logit contributions
ubb-0.021
 servant-0.020
 queens-0.020
oll-0.020
ill-0.019
Token Lily
Token ID20037
Feature activation-2.74e-03

Pos logit contributions
 stret+0.031
 automobile+0.031
 package+0.030
 radio+0.029
obiles+0.029
Neg logit contributions
ubb-0.041
 servant-0.041
 queens-0.041
oll-0.039
ill-0.038
Token started
Token ID2067
Feature activation+5.74e-03

Pos logit contributions
 tracks+0.004
 automobile+0.004
 stret+0.004
 radio+0.004
 package+0.004
Neg logit contributions
ubb-0.005
 servant-0.005
 queens-0.005
oll-0.005
ill-0.005
Token to
Token ID284
Feature activation-1.15e-02

Pos logit contributions
ubb+0.010
 servant+0.010
 queens+0.010
oll+0.010
ill+0.009
Neg logit contributions
 tracks-0.010
 radio-0.010
 track-0.010
 automobile-0.010
 stret-0.009
Token you
Token ID345
Feature activation+2.73e-02

Pos logit contributions
ubb+0.003
 servant+0.002
 queens+0.002
oll+0.002
ill+0.002
Neg logit contributions
 tracks-0.002
 automobile-0.002
 radio-0.002
 stret-0.002
 package-0.002
Token to
Token ID284
Feature activation+3.67e-02

Pos logit contributions
ubb+0.058
 servant+0.053
oll+0.053
 queens+0.053
acy+0.050
Neg logit contributions
 tracks-0.040
 automobile-0.039
 track-0.039
 package-0.039
 radio-0.038
Token climb
Token ID12080
Feature activation-1.57e-02

Pos logit contributions
ubb+0.087
oll+0.084
 queens+0.083
 servant+0.083
ill+0.078
Neg logit contributions
 track-0.046
 tracks-0.046
 automobile-0.045
 radio-0.043
 package-0.043
Token the
Token ID262
Feature activation+2.94e-02

Pos logit contributions
 stret+0.019
 tracks+0.019
 receiver+0.019
 radio+0.019
 package+0.019
Neg logit contributions
ubb-0.034
 queens-0.034
 servant-0.034
oll-0.032
ill-0.032
Token wall
Token ID3355
Feature activation-1.13e-02

Pos logit contributions
ubb+0.067
ill+0.062
 queens+0.062
 servant+0.061
oll+0.061
Neg logit contributions
 tracks-0.043
 track-0.042
 automobile-0.042
 radio-0.042
 package-0.041
Token!"
Token ID2474
Feature activation-9.96e-02

Pos logit contributions
 tracks+0.012
 radio+0.012
 stret+0.012
 automobile+0.012
 package+0.012
Neg logit contributions
ubb-0.026
 servant-0.026
 queens-0.026
ill-0.025
oll-0.025
Token Jimmy
Token ID12963
Feature activation-1.40e-02

Pos logit contributions
 stret+0.223
 trailer+0.221
cycles+0.221
izer+0.214
rated+0.208
Neg logit contributions
 she-0.102
ill-0.081
 Mon-0.080
 J-0.075
 Av-0.071
Token said
Token ID531
Feature activation-1.86e-02

Pos logit contributions
 radio+0.017
 automobile+0.017
 tracks+0.017
 stret+0.017
 package+0.016
Neg logit contributions
 servant-0.031
ubb-0.031
oll-0.030
ill-0.030
 queens-0.030
Token.
Token ID13
Feature activation-2.01e-02

Pos logit contributions
 tracks+0.029
 automobile+0.029
 package+0.029
 radio+0.029
obiles+0.028
Neg logit contributions
 servant-0.035
ubb-0.035
ill-0.034
oll-0.033
 queens-0.033
Token
Token ID220
Feature activation+6.66e-03

Pos logit contributions
 package+0.027
 stret+0.027
 automobile+0.027
 radio+0.027
 tracks+0.026
Neg logit contributions
 servant-0.042
ubb-0.040
ill-0.040
 queens-0.040
oll-0.039
Token
Token ID198
Feature activation-1.53e-02

Pos logit contributions
ubb+0.014
 servant+0.014
 queens+0.014
oll+0.013
ill+0.013
Neg logit contributions
 tracks-0.009
 automobile-0.009
 radio-0.009
 stret-0.009
 track-0.009
Token the
Token ID262
Feature activation+5.37e-02

Pos logit contributions
ubb+0.012
 servant+0.012
 queens+0.011
oll+0.011
ill+0.011
Neg logit contributions
 tracks-0.008
 automobile-0.007
 radio-0.007
 stret-0.007
 package-0.007
Token princess
Token ID21752
Feature activation-1.09e-02

Pos logit contributions
unch+0.093
oll+0.088
ubb+0.086
enny+0.084
ectar+0.084
Neg logit contributions
 radio-0.101
 track-0.101
 package-0.100
 automobile-0.100
 tracks-0.099
Token of
Token ID286
Feature activation+3.43e-02

Pos logit contributions
 tracks+0.015
 radio+0.014
 stret+0.014
 automobile+0.014
 track+0.014
Neg logit contributions
 servant-0.023
ubb-0.023
 queens-0.023
oll-0.022
ill-0.022
Token the
Token ID262
Feature activation+5.54e-02

Pos logit contributions
oll+0.093
ubb+0.092
 servant+0.091
ill+0.091
Col+0.089
Neg logit contributions
 track-0.026
 tracks-0.026
 automobile-0.026
 radio-0.024
 package-0.024
Token castle
Token ID16669
Feature activation-4.19e-03

Pos logit contributions
oll+0.120
unch+0.118
ubb+0.116
ectar+0.114
ill+0.111
Neg logit contributions
 track-0.078
 tracks-0.077
 automobile-0.076
 radio-0.075
 package-0.074
Token,"
Token ID553
Feature activation-9.90e-02

Pos logit contributions
 tracks+0.005
 radio+0.005
 automobile+0.005
 stret+0.005
 track+0.005
Neg logit contributions
ubb-0.009
 servant-0.009
 queens-0.009
oll-0.009
ill-0.009
Token said
Token ID531
Feature activation-6.22e-02

Pos logit contributions
 stret+0.222
 track+0.205
 controller+0.203
 DVD+0.202
izer+0.197
Neg logit contributions
 gard-0.109
 she-0.101
 queens-0.101
 Mon-0.087
 ever-0.086
Token the
Token ID262
Feature activation+3.00e-02

Pos logit contributions
cycles+0.110
 stret+0.110
 track+0.107
 tracks+0.107
obiles+0.104
Neg logit contributions
 servant-0.102
 queens-0.095
 gard-0.094
ier-0.094
ill-0.093
Token voice
Token ID3809
Feature activation-4.71e-02

Pos logit contributions
ubb+0.060
oll+0.058
ill+0.057
unch+0.056
ectar+0.054
Neg logit contributions
 tracks-0.047
 radio-0.046
 receiver-0.045
 package-0.044
 automobile-0.044
Token.
Token ID13
Feature activation-7.98e-02

Pos logit contributions
obiles+0.086
 track+0.083
cycles+0.083
 tracks+0.082
 stret+0.082
Neg logit contributions
 servant-0.077
ier-0.077
oll-0.075
ubb-0.075
ill-0.074
Token "
Token ID366
Feature activation+1.42e-02

Pos logit contributions
 stret+0.155
 track+0.147
 tracks+0.145
 coats+0.144
 instrument+0.143
Neg logit contributions
 gard-0.107
 servant-0.104
 queens-0.102
ill-0.098
 Mon-0.096
Token the
Token ID262
Feature activation-7.96e-02

Pos logit contributions
 stret+0.096
cycles+0.088
 Amen+0.088
obiles+0.087
 radio+0.081
Neg logit contributions
ill-0.074
ubb-0.073
acy-0.069
umbo-0.066
 queens-0.064
Token switches
Token ID18225
Feature activation-1.13e-02

Pos logit contributions
house+0.140
seed+0.138
 Amen+0.135
obiles+0.135
ishly+0.134
Neg logit contributions
ier-0.097
acy-0.096
ubb-0.092
 queens-0.086
ancer-0.083
Token and
Token ID290
Feature activation+1.05e-03

Pos logit contributions
 stret+0.016
 tracks+0.016
 automobile+0.016
 radio+0.016
 package+0.016
Neg logit contributions
ubb-0.023
 servant-0.022
 queens-0.022
oll-0.021
ill-0.021
Token pressed
Token ID12070
Feature activation-1.28e-02

Pos logit contributions
ubb+0.002
 servant+0.002
 queens+0.002
oll+0.002
ill+0.002
Neg logit contributions
 tracks-0.001
 automobile-0.001
 radio-0.001
 stret-0.001
 package-0.001
Token the
Token ID262
Feature activation-7.53e-02

Pos logit contributions
 stret+0.016
 automobile+0.016
 package+0.016
 radio+0.016
cycles+0.016
Neg logit contributions
ubb-0.028
 queens-0.027
 servant-0.027
ill-0.026
oll-0.026
Token big
Token ID1263
Feature activation-9.89e-02

Pos logit contributions
seed+0.125
house+0.121
obiles+0.116
c+0.113
 Amen+0.110
Neg logit contributions
ier-0.110
ubb-0.109
acy-0.107
 servant-0.097
ancer-0.096
Token red
Token ID2266
Feature activation-7.44e-02

Pos logit contributions
obiles+0.162
seed+0.148
limits+0.145
agree+0.144
cycles+0.143
Neg logit contributions
ubb-0.127
 bud-0.126
 servant-0.123
ier-0.118
acy-0.116
Token button
Token ID4936
Feature activation-7.84e-03

Pos logit contributions
obiles+0.108
 Amen+0.100
cycles+0.099
 belongs+0.097
iday+0.096
Neg logit contributions
ier-0.123
ubb-0.118
 servant-0.118
acy-0.113
ancer-0.110
Token.
Token ID13
Feature activation+2.67e-02

Pos logit contributions
 tracks+0.011
 stret+0.010
 automobile+0.010
 radio+0.010
 package+0.010
Neg logit contributions
ubb-0.016
 servant-0.016
 queens-0.016
oll-0.016
ill-0.015
Token The
Token ID383
Feature activation-6.25e-02

Pos logit contributions
ubb+0.067
 servant+0.067
 queens+0.066
oll+0.063
ill+0.061
Neg logit contributions
 tracks-0.030
 radio-0.028
 automobile-0.027
 track-0.027
 package-0.026
Token radio
Token ID5243
Feature activation-2.78e-02

Pos logit contributions
seed+0.081
 belongs+0.080
c+0.078
 stret+0.075
obiles+0.075
Neg logit contributions
ubb-0.110
acy-0.104
 queens-0.103
ier-0.103
 servant-0.103
Token.
Token ID13
Feature activation-2.32e-03

Pos logit contributions
ubb+0.081
 servant+0.080
 queens+0.077
oll+0.075
Col+0.073
Neg logit contributions
 track-0.037
 tracks-0.037
 radio-0.036
 automobile-0.036
 package-0.035
Token It
Token ID632
Feature activation-4.95e-02

Pos logit contributions
 tracks+0.003
 automobile+0.003
 stret+0.003
 radio+0.003
 track+0.003
Neg logit contributions
ubb-0.005
 servant-0.005
 queens-0.005
oll-0.005
ill-0.005
Token was
Token ID373
Feature activation-1.90e-02

Pos logit contributions
 stret+0.094
obiles+0.085
cycles+0.083
 automobile+0.083
 Amen+0.082
Neg logit contributions
ubb-0.071
ill-0.070
 queens-0.070
 servant-0.067
 diver-0.065
Token a
Token ID257
Feature activation-5.23e-02

Pos logit contributions
 stret+0.029
cycles+0.029
obiles+0.028
 automobile+0.028
 tracks+0.028
Neg logit contributions
ubb-0.036
 servant-0.035
 queens-0.035
oll-0.034
ill-0.033
Token big
Token ID1263
Feature activation-7.46e-02

Pos logit contributions
cycles+0.089
obiles+0.088
 belongs+0.084
esh+0.084
agree+0.081
Neg logit contributions
 servant-0.086
 queens-0.084
ubb-0.080
 diver-0.079
 gard-0.075
Token,
Token ID11
Feature activation-9.87e-02

Pos logit contributions
obiles+0.127
cycles+0.122
cks+0.116
 belongs+0.114
 stret+0.113
Neg logit contributions
 servant-0.113
 diver-0.109
 queens-0.105
ubb-0.103
 king-0.102
Token shiny
Token ID22441
Feature activation-7.02e-02

Pos logit contributions
esh+0.195
cycles+0.185
obiles+0.185
ishly+0.184
heit+0.182
Neg logit contributions
 queens-0.115
oll-0.094
ubb-0.091
 beetle-0.088
 king-0.086
Token radio
Token ID5243
Feature activation-4.46e-02

Pos logit contributions
obiles+0.093
cycles+0.087
agree+0.081
cks+0.078
osc+0.077
Neg logit contributions
 servant-0.143
 diver-0.136
ier-0.132
 chef-0.129
ubb-0.128
Token and
Token ID290
Feature activation-3.11e-02

Pos logit contributions
 stret+0.070
obiles+0.069
cycles+0.068
 tracks+0.066
 Amen+0.066
Neg logit contributions
 servant-0.078
 queens-0.076
oll-0.074
ubb-0.074
ill-0.073
Token Katie
Token ID24721
Feature activation-1.89e-02

Pos logit contributions
 stret+0.050
cycles+0.045
 automobile+0.045
obiles+0.045
 controller+0.044
Neg logit contributions
 queens-0.061
ubb-0.057
 servant-0.057
oll-0.055
ill-0.053
Token was
Token ID373
Feature activation-1.06e-02

Pos logit contributions
 stret+0.026
 automobile+0.025
 tracks+0.025
 track+0.025
 radio+0.025
Neg logit contributions
 servant-0.039
ubb-0.039
 queens-0.038
oll-0.038
ill-0.037
TokenLet
Token ID5756
Feature activation-4.65e-02

Pos logit contributions
 tracks+0.004
 stret+0.004
 automobile+0.004
 radio+0.004
 track+0.004
Neg logit contributions
ubb-0.005
 servant-0.005
 queens-0.005
oll-0.005
ill-0.005
Token's
Token ID338
Feature activation-1.34e-03

Pos logit contributions
cycles+0.062
 stret+0.062
 controller+0.060
 trailer+0.060
 track+0.060
Neg logit contributions
 queens-0.087
olly-0.087
 servant-0.086
ill-0.085
acy-0.084
Token go
Token ID467
Feature activation-9.08e-03

Pos logit contributions
 tracks+0.002
 automobile+0.002
 radio+0.002
 stret+0.002
 package+0.002
Neg logit contributions
ubb-0.003
 servant-0.003
 queens-0.003
oll-0.003
ill-0.003
Token,
Token ID11
Feature activation-4.64e-02

Pos logit contributions
 tracks+0.016
 automobile+0.015
 stret+0.015
 radio+0.015
 package+0.015
Neg logit contributions
 servant-0.016
 queens-0.015
ubb-0.015
oll-0.015
ill-0.014
Token Emmy
Token ID33953
Feature activation-2.79e-02

Pos logit contributions
 stret+0.077
 tracks+0.074
 controller+0.073
cycles+0.069
obiles+0.067
Neg logit contributions
 queens-0.086
 servant-0.080
ubb-0.076
ill-0.073
ory-0.072
Token!"
Token ID2474
Feature activation-9.80e-02

Pos logit contributions
 tracks+0.032
 radio+0.032
 stret+0.031
 controller+0.031
obiles+0.031
Neg logit contributions
 servant-0.064
 queens-0.061
ubb-0.061
ill-0.059
oll-0.058
Token said
Token ID531
Feature activation-4.20e-02

Pos logit contributions
cks+0.197
 stret+0.190
cycles+0.182
igators+0.182
 snowball+0.181
Neg logit contributions
enny-0.113
 she-0.107
ory-0.106
 Av-0.105
 servant-0.105
Token her
Token ID607
Feature activation+5.60e-03

Pos logit contributions
cycles+0.079
obiles+0.077
 tracks+0.075
 belongs+0.074
 stret+0.073
Neg logit contributions
 servant-0.068
ill-0.062
ubb-0.061
acy-0.061
 queens-0.061
Token mom
Token ID1995
Feature activation-3.19e-02

Pos logit contributions
ubb+0.011
 servant+0.010
 queens+0.010
oll+0.010
ill+0.010
Neg logit contributions
 tracks-0.009
 automobile-0.009
 radio-0.009
 package-0.009
 track-0.009
Token.
Token ID13
Feature activation-5.75e-02

Pos logit contributions
 radio+0.051
 tracks+0.050
 stret+0.050
obiles+0.050
cycles+0.049
Neg logit contributions
 servant-0.059
ill-0.057
oll-0.057
ubb-0.057
 queens-0.056
Token "
Token ID366
Feature activation+1.86e-02

Pos logit contributions
 stret+0.102
cycles+0.101
 tracks+0.101
 belongs+0.100
 controller+0.098
Neg logit contributions
 queens-0.083
 servant-0.082
ill-0.079
enny-0.079
umbo-0.076
Token wide
Token ID3094
Feature activation-6.81e-04

Pos logit contributions
 stret+0.036
 package+0.036
 radio+0.035
 receiver+0.035
 automobile+0.034
Neg logit contributions
 servant-0.039
 queens-0.039
ubb-0.038
oll-0.035
ill-0.034
Token and
Token ID290
Feature activation-4.72e-02

Pos logit contributions
 tracks+0.001
 automobile+0.001
 radio+0.001
 stret+0.001
 package+0.001
Neg logit contributions
ubb-0.001
 servant-0.001
 queens-0.001
oll-0.001
ill-0.001
Token they
Token ID484
Feature activation-2.07e-02

Pos logit contributions
 stret+0.094
 package+0.084
 receiver+0.080
 radio+0.080
 Amen+0.078
Neg logit contributions
 queens-0.081
ubb-0.076
 servant-0.072
ill-0.067
 crabs-0.065
Token gave
Token ID2921
Feature activation-4.17e-02

Pos logit contributions
 stret+0.036
 package+0.035
 radio+0.035
 tracks+0.034
 receiver+0.034
Neg logit contributions
 servant-0.036
ubb-0.036
 queens-0.035
ill-0.033
oll-0.033
Token a
Token ID257
Feature activation-2.74e-02

Pos logit contributions
 stret+0.076
 Amen+0.071
 package+0.069
cycles+0.068
 radio+0.068
Neg logit contributions
 servant-0.065
ubb-0.065
 queens-0.065
aph-0.063
ill-0.060
Token loud
Token ID7812
Feature activation-9.76e-02

Pos logit contributions
cycles+0.045
obiles+0.043
 receiver+0.043
 tracks+0.042
esh+0.042
Neg logit contributions
 servant-0.050
 queens-0.047
ubb-0.047
aph-0.044
 diver-0.043
Token roar
Token ID34532
Feature activation-2.71e-02

Pos logit contributions
ci+0.152
 receiver+0.144
obiles+0.140
 belongs+0.133
cycles+0.132
Neg logit contributions
 servant-0.151
 diver-0.144
ubb-0.139
aph-0.137
 king-0.134
Token when
Token ID618
Feature activation-6.47e-03

Pos logit contributions
 stret+0.041
 package+0.040
 automobile+0.040
obiles+0.040
 receiver+0.040
Neg logit contributions
 servant-0.052
ubb-0.051
 queens-0.050
oll-0.049
ill-0.049
Token they
Token ID484
Feature activation+2.44e-02

Pos logit contributions
 stret+0.009
 tracks+0.008
 automobile+0.008
 package+0.008
 radio+0.008
Neg logit contributions
ubb-0.014
 servant-0.014
 queens-0.014
oll-0.013
ill-0.013
Token started
Token ID2067
Feature activation-1.62e-02

Pos logit contributions
ubb+0.056
 servant+0.054
oll+0.054
 queens+0.053
ectar+0.051
Neg logit contributions
 tracks-0.031
 track-0.030
 automobile-0.029
 controller-0.029
 radio-0.028
Token the
Token ID262
Feature activation-2.38e-03

Pos logit contributions
 package+0.031
 stret+0.031
 automobile+0.031
 radio+0.031
 receiver+0.031
Neg logit contributions
ubb-0.025
 servant-0.024
 queens-0.024
ill-0.022
aph-0.021
Token"
Token ID1
Feature activation-4.21e-03

Pos logit contributions
 stret+0.048
 package+0.048
 tracks+0.047
cycles+0.046
 radio+0.046
Neg logit contributions
 queens-0.052
 servant-0.051
aph-0.046
ubb-0.046
ill-0.046
TokenIt
Token ID1026
Feature activation-4.25e-02

Pos logit contributions
 tracks+0.008
 stret+0.008
 automobile+0.008
 radio+0.008
 package+0.008
Neg logit contributions
ubb-0.007
 queens-0.006
 servant-0.006
oll-0.006
ill-0.006
Token's
Token ID338
Feature activation+1.84e-03

Pos logit contributions
 stret+0.055
 automobile+0.045
 radio+0.043
cycles+0.042
obiles+0.042
Neg logit contributions
 servant-0.096
 queens-0.095
ubb-0.092
oll-0.091
ill-0.090
Token an
Token ID281
Feature activation+3.42e-03

Pos logit contributions
ubb+0.004
 servant+0.004
 queens+0.004
oll+0.004
ill+0.004
Neg logit contributions
 tracks-0.003
 automobile-0.003
 radio-0.003
 stret-0.002
 track-0.002
Token earthquake
Token ID16295
Feature activation-2.38e-02

Pos logit contributions
ubb+0.007
 servant+0.007
 queens+0.007
oll+0.007
ill+0.007
Neg logit contributions
 tracks-0.005
 automobile-0.005
 radio-0.005
 track-0.004
 stret-0.004
Token!"
Token ID2474
Feature activation-9.71e-02

Pos logit contributions
 tracks+0.029
 radio+0.029
 stret+0.028
cycles+0.028
obiles+0.028
Neg logit contributions
 servant-0.052
 queens-0.052
ubb-0.052
oll-0.050
ill-0.050
Token said
Token ID531
Feature activation-4.17e-02

Pos logit contributions
 stret+0.165
cycles+0.152
obiles+0.143
 controller+0.139
 track+0.136
Neg logit contributions
 servant-0.154
aph-0.141
 queens-0.140
 gard-0.136
umbo-0.130
Token her
Token ID607
Feature activation+4.50e-02

Pos logit contributions
cycles+0.060
 stret+0.058
obiles+0.057
 tracks+0.055
 package+0.054
Neg logit contributions
 servant-0.089
 queens-0.082
aph-0.080
ubb-0.079
ill-0.077
Token friend
Token ID1545
Feature activation-3.59e-02

Pos logit contributions
ubb+0.097
 queens+0.090
Col+0.090
ill+0.089
acy+0.087
Neg logit contributions
 automobile-0.068
 tracks-0.067
 radio-0.067
 track-0.067
 horn-0.064
Token,
Token ID11
Feature activation-3.15e-02

Pos logit contributions
obiles+0.059
cycles+0.058
 radio+0.058
 stret+0.057
 package+0.057
Neg logit contributions
 servant-0.067
ill-0.063
oll-0.063
ubb-0.062
 queens-0.060
Token the
Token ID262
Feature activation+1.48e-02

Pos logit contributions
 belongs+0.047
 package+0.047
obiles+0.046
 tracks+0.046
cycles+0.046
Neg logit contributions
 servant-0.061
ubb-0.058
 queens-0.058
ill-0.056
oll-0.054
Token.
Token ID13
Feature activation-5.79e-02

Pos logit contributions
ubb+0.034
 servant+0.033
 queens+0.032
oll+0.031
ill+0.031
Neg logit contributions
 tracks-0.019
 radio-0.019
 automobile-0.019
 stret-0.018
 track-0.018
Token Suddenly
Token ID24975
Feature activation-3.87e-02

Pos logit contributions
 stret+0.093
cycles+0.093
 track+0.092
 controller+0.092
ci+0.090
Neg logit contributions
 servant-0.089
ill-0.089
 Mon-0.084
Col-0.083
 queens-0.082
Token they
Token ID484
Feature activation+3.14e-02

Pos logit contributions
cycles+0.067
obiles+0.066
 tracks+0.066
 stret+0.066
 track+0.065
Neg logit contributions
 servant-0.065
 queens-0.063
ubb-0.060
ill-0.059
oll-0.059
Token heard
Token ID2982
Feature activation-4.89e-02

Pos logit contributions
ubb+0.074
 servant+0.072
oll+0.068
 queens+0.068
ectar+0.067
Neg logit contributions
 tracks-0.038
 radio-0.037
 automobile-0.036
 package-0.036
 track-0.035
Token a
Token ID257
Feature activation-3.95e-02

Pos logit contributions
cycles+0.083
 belongs+0.080
 track+0.080
obiles+0.078
 tracks+0.077
Neg logit contributions
 queens-0.083
 servant-0.080
ill-0.077
oll-0.076
ubb-0.075
Token loud
Token ID7812
Feature activation-9.68e-02

Pos logit contributions
cycles+0.055
 tracks+0.052
obiles+0.052
 belongs+0.051
 automobile+0.050
Neg logit contributions
ubb-0.079
 servant-0.078
 queens-0.077
 diver-0.071
aph-0.070
Token noise
Token ID7838
Feature activation-3.26e-02

Pos logit contributions
cycles+0.134
obiles+0.133
 belongs+0.131
 ignition+0.126
 package+0.126
Neg logit contributions
ubb-0.161
 servant-0.153
 diver-0.150
umble-0.148
 gard-0.147
Token.
Token ID13
Feature activation-6.85e-02

Pos logit contributions
obiles+0.052
cycles+0.051
 tracks+0.050
 automobile+0.050
 receiver+0.050
Neg logit contributions
ubb-0.056
ill-0.056
 queens-0.054
 servant-0.054
oll-0.054
Token It
Token ID632
Feature activation-7.99e-02

Pos logit contributions
 track+0.109
ci+0.107
 stret+0.105
 engine+0.103
 automobile+0.103
Neg logit contributions
ill-0.107
 servant-0.105
 Mon-0.104
Col-0.104
 queens-0.103
Token was
Token ID373
Feature activation-1.86e-02

Pos logit contributions
 stret+0.154
obiles+0.145
 automobile+0.143
cycles+0.136
 trailer+0.134
Neg logit contributions
ubb-0.092
ill-0.091
ier-0.090
 queens-0.088
 ever-0.087
Token a
Token ID257
Feature activation-3.18e-02

Pos logit contributions
cycles+0.026
obiles+0.025
 automobile+0.025
 tracks+0.025
 package+0.025
Neg logit contributions
ubb-0.037
 servant-0.036
 queens-0.036
ill-0.035
oll-0.034
Token
Token ID198
Feature activation-4.70e-02

Pos logit contributions
 stret+0.058
 automobile+0.058
 receiver+0.057
 package+0.056
 radio+0.056
Neg logit contributions
ill-0.048
 queens-0.047
 servant-0.047
oll-0.046
ubb-0.045
Token"
Token ID1
Feature activation-4.13e-03

Pos logit contributions
 stret+0.082
 receiver+0.079
 belongs+0.078
ci+0.078
 package+0.078
Neg logit contributions
 queens-0.072
enny-0.066
umbo-0.066
olly-0.066
 servant-0.064
TokenHi
Token ID17250
Feature activation-2.87e-02

Pos logit contributions
 tracks+0.006
 automobile+0.006
 radio+0.006
 track+0.006
 stret+0.006
Neg logit contributions
ubb-0.009
 servant-0.008
 queens-0.008
oll-0.008
ill-0.008
Token,
Token ID11
Feature activation-2.11e-02

Pos logit contributions
 stret+0.047
 tracks+0.046
 receiver+0.045
cycles+0.044
 controller+0.044
Neg logit contributions
 servant-0.052
 queens-0.051
ubb-0.048
oll-0.048
ill-0.047
Token bird
Token ID6512
Feature activation-1.78e-02

Pos logit contributions
 stret+0.035
 tracks+0.035
cycles+0.034
 track+0.033
 controller+0.033
Neg logit contributions
 servant-0.038
 queens-0.037
ubb-0.036
oll-0.035
umbo-0.033
Token!"
Token ID2474
Feature activation-9.63e-02

Pos logit contributions
 tracks+0.027
 stret+0.027
 radio+0.027
 controller+0.026
 package+0.026
Neg logit contributions
 servant-0.034
 queens-0.034
ubb-0.034
ill-0.033
oll-0.032
Token Max
Token ID5436
Feature activation-3.17e-02

Pos logit contributions
 stret+0.209
esh+0.187
rated+0.186
obiles+0.179
ci+0.172
Neg logit contributions
 Ben-0.092
 beetles-0.083
 he-0.080
 queens-0.077
 servant-0.077
Token said
Token ID531
Feature activation-1.94e-02

Pos logit contributions
 stret+0.046
 package+0.045
obiles+0.044
 automobile+0.044
 radio+0.043
Neg logit contributions
 servant-0.067
 queens-0.062
ill-0.061
aph-0.059
oll-0.058
Token.
Token ID13
Feature activation-3.35e-02

Pos logit contributions
obiles+0.031
 package+0.031
 stret+0.031
 automobile+0.031
 tracks+0.031
Neg logit contributions
 servant-0.036
ubb-0.035
 queens-0.034
ill-0.033
oll-0.033
Token "
Token ID366
Feature activation+1.81e-02

Pos logit contributions
 stret+0.058
 package+0.057
 automobile+0.056
 radio+0.056
 receiver+0.055
Neg logit contributions
 servant-0.055
 queens-0.052
ill-0.049
ubb-0.049
onna-0.048
TokenDo
Token ID5211
Feature activation-7.15e-03

Pos logit contributions
ubb+0.038
 servant+0.037
oll+0.036
 queens+0.035
ier+0.034
Neg logit contributions
 automobile-0.025
 radio-0.025
 tracks-0.025
 track-0.025
 receiver-0.025
Token"
Token ID1
Feature activation-2.28e-02

Pos logit contributions
 package+0.123
 belongs+0.122
 controller+0.116
 Amen+0.116
 automobile+0.115
Neg logit contributions
 queens-0.048
enny-0.044
olly-0.039
Col-0.039
J-0.039
TokenLet
Token ID5756
Feature activation-4.44e-02

Pos logit contributions
 tracks+0.035
 belongs+0.035
 package+0.033
cycles+0.033
 automobile+0.033
Neg logit contributions
 queens-0.044
ubb-0.042
 servant-0.041
ill-0.040
Col-0.039
Token's
Token ID338
Feature activation-1.19e-02

Pos logit contributions
cycles+0.059
 track+0.059
 controller+0.058
 tracks+0.057
 stret+0.057
Neg logit contributions
 servant-0.083
olly-0.081
andy-0.080
ill-0.080
oll-0.080
Token go
Token ID467
Feature activation-1.17e-02

Pos logit contributions
 tracks+0.017
 automobile+0.016
 radio+0.016
 package+0.016
 stret+0.016
Neg logit contributions
ubb-0.024
 servant-0.024
 queens-0.023
oll-0.022
ill-0.022
Token inside
Token ID2641
Feature activation-8.81e-03

Pos logit contributions
 radio+0.017
 automobile+0.017
 tracks+0.017
 stret+0.017
 package+0.017
Neg logit contributions
 servant-0.023
 queens-0.023
ubb-0.022
ill-0.021
oll-0.021
Token,"
Token ID553
Feature activation-9.62e-02

Pos logit contributions
 tracks+0.012
 automobile+0.012
 stret+0.012
 radio+0.012
 package+0.012
Neg logit contributions
ubb-0.018
 servant-0.018
 queens-0.018
ill-0.017
oll-0.017
Token Max
Token ID5436
Feature activation-4.81e-02

Pos logit contributions
 stret+0.216
obiles+0.215
esh+0.214
 trailer+0.207
cycles+0.205
Neg logit contributions
 she-0.080
 Ben-0.076
 servant-0.068
 queens-0.066
 beetles-0.066
Token said
Token ID531
Feature activation-3.54e-02

Pos logit contributions
obiles+0.079
 automobile+0.078
 radio+0.077
 stret+0.076
 tracks+0.075
Neg logit contributions
 servant-0.091
ill-0.080
oll-0.080
ona-0.078
ory-0.077
Token.
Token ID13
Feature activation-3.92e-02

Pos logit contributions
obiles+0.064
cycles+0.061
 package+0.060
 tracks+0.060
 automobile+0.060
Neg logit contributions
 servant-0.063
ubb-0.057
oll-0.057
ill-0.056
 queens-0.055
Token
Token ID198
Feature activation-4.49e-02

Pos logit contributions
 package+0.070
obiles+0.069
 radio+0.068
 automobile+0.068
 belongs+0.067
Neg logit contributions
 servant-0.063
 queens-0.059
ill-0.055
enny-0.054
umbo-0.053
Token
Token ID198
Feature activation-5.18e-02

Pos logit contributions
 automobile+0.092
 package+0.091
 stret+0.091
 radio+0.091
cycles+0.090
Neg logit contributions
ill-0.056
enny-0.055
 queens-0.053
olly-0.052
 servant-0.052
Token a
Token ID257
Feature activation+2.94e-03

Pos logit contributions
 stret+0.030
 automobile+0.030
 receiver+0.030
 radio+0.030
 tracks+0.029
Neg logit contributions
 queens-0.038
 servant-0.036
ubb-0.036
oll-0.035
ill-0.033
Token swim
Token ID9422
Feature activation-1.91e-02

Pos logit contributions
ubb+0.007
 servant+0.006
 queens+0.006
oll+0.006
ill+0.006
Neg logit contributions
 tracks-0.004
 automobile-0.004
 radio-0.004
 stret-0.004
 package-0.004
Token,
Token ID11
Feature activation-2.61e-02

Pos logit contributions
 tracks+0.024
 receiver+0.024
 stret+0.024
 radio+0.024
 automobile+0.024
Neg logit contributions
 servant-0.041
 queens-0.041
ubb-0.040
ill-0.039
oll-0.038
Token my
Token ID616
Feature activation+1.32e-02

Pos logit contributions
 tracks+0.040
 stret+0.040
 automobile+0.039
 controller+0.039
cycles+0.039
Neg logit contributions
 queens-0.048
 servant-0.048
ill-0.046
ubb-0.046
oll-0.045
Token darling
Token ID40003
Feature activation-2.20e-02

Pos logit contributions
ubb+0.025
 queens+0.025
 servant+0.024
ill+0.024
oll+0.024
Neg logit contributions
 tracks-0.022
 radio-0.022
 automobile-0.021
 package-0.021
 track-0.021
Token,"
Token ID553
Feature activation-9.49e-02

Pos logit contributions
 tracks+0.022
 radio+0.020
obiles+0.020
 track+0.020
 receiver+0.020
Neg logit contributions
 servant-0.056
 queens-0.054
ubb-0.053
ill-0.052
oll-0.051
Token her
Token ID607
Feature activation-9.49e-03

Pos logit contributions
 track+0.226
obiles+0.222
 controller+0.218
 stret+0.215
izer+0.213
Neg logit contributions
 she-0.097
 gard-0.074
 Mon-0.073
 servant-0.071
oll-0.069
Token mom
Token ID1995
Feature activation-3.09e-02

Pos logit contributions
 tracks+0.018
 stret+0.017
cycles+0.017
 track+0.017
obiles+0.017
Neg logit contributions
 servant-0.016
ubb-0.015
 queens-0.015
oll-0.014
ill-0.014
Tokenmy
Token ID1820
Feature activation-4.44e-02

Pos logit contributions
 radio+0.055
 tracks+0.054
 stret+0.054
 track+0.053
obiles+0.052
Neg logit contributions
ill-0.053
 servant-0.052
oll-0.051
 queens-0.050
umbo-0.049
Token said
Token ID531
Feature activation-3.65e-02

Pos logit contributions
 radio+0.072
 tracks+0.069
 automobile+0.069
obiles+0.068
 stret+0.067
Neg logit contributions
 servant-0.081
ill-0.080
oll-0.079
ory-0.075
ubb-0.075
Token.
Token ID13
Feature activation-4.34e-02

Pos logit contributions
obiles+0.064
 automobile+0.063
 tracks+0.063
 radio+0.061
 package+0.061
Neg logit contributions
oll-0.062
 servant-0.061
ubb-0.060
ill-0.059
ier-0.058
Token's
Token ID338
Feature activation+1.49e-02

Pos logit contributions
 stret+0.051
 track+0.050
 controller+0.050
 tracks+0.049
 receiver+0.048
Neg logit contributions
 servant-0.079
ubb-0.077
 queens-0.077
ory-0.075
olly-0.075
Token see
Token ID766
Feature activation-1.03e-03

Pos logit contributions
ubb+0.034
 servant+0.033
 queens+0.033
oll+0.033
ill+0.032
Neg logit contributions
 tracks-0.019
 track-0.018
 radio-0.018
 automobile-0.018
 package-0.018
Token what
Token ID644
Feature activation-3.29e-02

Pos logit contributions
 tracks+0.001
 automobile+0.001
 radio+0.001
 stret+0.001
 package+0.001
Neg logit contributions
ubb-0.002
 servant-0.002
 queens-0.002
oll-0.002
ill-0.002
Token's
Token ID338
Feature activation-2.83e-02

Pos logit contributions
 stret+0.044
 track+0.039
 receiver+0.039
cycles+0.038
obiles+0.038
Neg logit contributions
 queens-0.073
 servant-0.071
ubb-0.068
oll-0.068
 diver-0.065
Token inside
Token ID2641
Feature activation-1.79e-02

Pos logit contributions
 stret+0.044
cycles+0.043
 Amen+0.043
obiles+0.043
 automobile+0.043
Neg logit contributions
 servant-0.051
 queens-0.050
ubb-0.050
oll-0.048
ier-0.046
Token!"
Token ID2474
Feature activation-9.41e-02

Pos logit contributions
 stret+0.023
 tracks+0.022
 radio+0.022
 automobile+0.022
 track+0.022
Neg logit contributions
 queens-0.039
ubb-0.038
 servant-0.038
ill-0.037
oll-0.036
Token said
Token ID531
Feature activation-6.31e-02

Pos logit contributions
obiles+0.183
 stret+0.173
rated+0.171
initely+0.169
cycles+0.164
Neg logit contributions
 servant-0.107
oll-0.096
unn-0.095
 she-0.095
 Po-0.095
Token Bun
Token ID28515
Feature activation-1.87e-02

Pos logit contributions
obiles+0.120
 stret+0.111
cycles+0.109
 belongs+0.106
 receiver+0.105
Neg logit contributions
 servant-0.102
ubb-0.089
oll-0.086
 diver-0.085
aph-0.085
Token's
Token ID338
Feature activation+3.04e-02

Pos logit contributions
 automobile+0.025
 tracks+0.025
 radio+0.025
 stret+0.025
obiles+0.025
Neg logit contributions
 servant-0.040
 queens-0.038
ubb-0.038
oll-0.037
ill-0.036
Token friend
Token ID1545
Feature activation-2.32e-02

Pos logit contributions
ubb+0.064
ill+0.061
 queens+0.061
acy+0.059
Col+0.058
Neg logit contributions
 radio-0.048
 tracks-0.047
 track-0.046
 package-0.046
 automobile-0.045
Token,
Token ID11
Feature activation-1.96e-02

Pos logit contributions
 stret+0.036
obiles+0.036
 radio+0.035
cycles+0.035
 tracks+0.035
Neg logit contributions
 servant-0.045
ill-0.043
ubb-0.043
oll-0.042
 queens-0.042
Token The
Token ID383
Feature activation-1.18e-02

Pos logit contributions
 stret+0.019
 tracks+0.018
 radio+0.018
 package+0.018
 track+0.018
Neg logit contributions
ubb-0.019
 servant-0.019
 queens-0.018
ill-0.018
oll-0.018
Token baby
Token ID5156
Feature activation-1.16e-02

Pos logit contributions
 tracks+0.020
 track+0.020
 stret+0.020
cycles+0.020
obiles+0.020
Neg logit contributions
ubb-0.021
 servant-0.021
 queens-0.020
oll-0.019
acy-0.018
Token had
Token ID550
Feature activation+5.63e-03

Pos logit contributions
 tracks+0.014
 stret+0.014
 track+0.014
 radio+0.013
 automobile+0.013
Neg logit contributions
 servant-0.026
ubb-0.026
 queens-0.025
oll-0.025
ill-0.024
Token on
Token ID319
Feature activation-4.18e-02

Pos logit contributions
ubb+0.012
 servant+0.012
 queens+0.012
oll+0.011
ill+0.011
Neg logit contributions
 tracks-0.008
 radio-0.007
 automobile-0.007
 package-0.007
 stret-0.007
Token a
Token ID257
Feature activation-6.17e-02

Pos logit contributions
cycles+0.070
obiles+0.068
 controller+0.068
 receiver+0.067
 Amen+0.067
Neg logit contributions
 servant-0.067
ubb-0.066
 queens-0.063
oll-0.063
Col-0.061
Token beautiful
Token ID4950
Feature activation-9.40e-02

Pos logit contributions
cycles+0.108
 receiver+0.103
adoes+0.102
obiles+0.101
agree+0.101
Neg logit contributions
 servant-0.097
oll-0.092
olly-0.091
ory-0.088
 queens-0.088
Token dress
Token ID6576
Feature activation-3.39e-03

Pos logit contributions
cycles+0.168
obiles+0.162
 controller+0.161
agree+0.155
 receiver+0.154
Neg logit contributions
 servant-0.130
 diver-0.118
 chef-0.116
ona-0.115
andy-0.115
Token,
Token ID11
Feature activation-1.36e-02

Pos logit contributions
 tracks+0.005
 automobile+0.005
 radio+0.005
 stret+0.005
 package+0.005
Neg logit contributions
ubb-0.007
 servant-0.007
 queens-0.007
oll-0.007
ill-0.007
Token with
Token ID351
Feature activation+5.58e-03

Pos logit contributions
 stret+0.020
 automobile+0.020
 tracks+0.020
 package+0.020
 radio+0.020
Neg logit contributions
ubb-0.026
 servant-0.026
 queens-0.026
oll-0.025
ill-0.024
Token a
Token ID257
Feature activation-2.72e-02

Pos logit contributions
ubb+0.012
 servant+0.012
 queens+0.012
oll+0.011
ill+0.011
Neg logit contributions
 tracks-0.008
 radio-0.007
 automobile-0.007
 package-0.007
 track-0.007
Token bon
Token ID5351
Feature activation-2.36e-02

Pos logit contributions
cycles+0.043
 tracks+0.043
obiles+0.042
 automobile+0.042
 belongs+0.041
Neg logit contributions
 servant-0.049
ubb-0.049
 queens-0.047
oll-0.046
Col-0.044
Token train
Token ID4512
Feature activation-4.01e-02

Pos logit contributions
 stret+0.070
 Amen+0.069
cycles+0.067
 package+0.067
obiles+0.064
Neg logit contributions
 servant-0.075
 queens-0.074
aph-0.073
ill-0.072
ier-0.069
Token when
Token ID618
Feature activation-1.76e-02

Pos logit contributions
 stret+0.062
 package+0.062
 Amen+0.061
cycles+0.059
 radio+0.059
Neg logit contributions
 servant-0.071
 queens-0.070
aph-0.067
ier-0.067
ill-0.067
Token they
Token ID484
Feature activation+3.05e-02

Pos logit contributions
 stret+0.026
 receiver+0.025
 package+0.025
 tracks+0.024
 radio+0.024
Neg logit contributions
 queens-0.035
 servant-0.034
ubb-0.034
ill-0.032
aph-0.031
Token heard
Token ID2982
Feature activation-4.14e-02

Pos logit contributions
ubb+0.079
 servant+0.077
 queens+0.075
oll+0.075
ectar+0.072
Neg logit contributions
 tracks-0.031
 automobile-0.030
 radio-0.029
 track-0.028
 package-0.028
Token a
Token ID257
Feature activation-4.67e-02

Pos logit contributions
cycles+0.068
 belongs+0.067
 Amen+0.067
 stret+0.066
obiles+0.065
Neg logit contributions
 queens-0.069
ubb-0.065
 servant-0.065
ill-0.063
oll-0.062
Token loud
Token ID7812
Feature activation-9.39e-02

Pos logit contributions
cycles+0.070
 belongs+0.069
obiles+0.066
 tracks+0.065
ci+0.065
Neg logit contributions
 queens-0.082
ubb-0.082
 servant-0.082
aph-0.076
 diver-0.074
Token noise
Token ID7838
Feature activation-2.79e-02

Pos logit contributions
 belongs+0.130
obiles+0.127
ci+0.126
 package+0.125
cycles+0.125
Neg logit contributions
ubb-0.148
aph-0.136
 servant-0.136
umble-0.136
 diver-0.135
Token.
Token ID13
Feature activation-5.07e-02

Pos logit contributions
obiles+0.044
 tracks+0.043
 package+0.043
cycles+0.043
 receiver+0.043
Neg logit contributions
ubb-0.049
ill-0.047
 queens-0.047
ier-0.046
 servant-0.046
Token
Token ID198
Feature activation-4.23e-02

Pos logit contributions
 radio+0.091
 stret+0.091
 package+0.090
 instrument+0.087
 Amen+0.086
Neg logit contributions
ill-0.067
unch-0.067
umbo-0.063
 Mon-0.063
aph-0.062
Token
Token ID198
Feature activation-5.15e-02

Pos logit contributions
 radio+0.085
 stret+0.085
 automobile+0.084
 package+0.084
 receiver+0.084
Neg logit contributions
ill-0.055
 queens-0.051
Col-0.050
enny-0.050
unch-0.049
Token"
Token ID1
Feature activation-1.76e-02

Pos logit contributions
 radio+0.103
 package+0.102
 belongs+0.100
 tracks+0.100
 receiver+0.098
Neg logit contributions
 queens-0.064
ill-0.060
Col-0.060
unch-0.059
umber-0.058
Token fridge
Token ID25772
Feature activation+2.26e-02

Pos logit contributions
ubb+0.055
 servant+0.051
ill+0.050
oll+0.050
 queens+0.050
Neg logit contributions
 package-0.029
 automobile-0.028
 radio-0.027
 tracks-0.027
 receiver-0.026
Token because
Token ID780
Feature activation+7.51e-03

Pos logit contributions
ubb+0.055
 servant+0.052
 queens+0.052
oll+0.049
Col+0.049
Neg logit contributions
 tracks-0.027
 package-0.027
 automobile-0.026
 track-0.026
 radio-0.026
Token there
Token ID612
Feature activation-1.95e-02

Pos logit contributions
ubb+0.016
 servant+0.015
 queens+0.015
oll+0.015
ill+0.014
Neg logit contributions
 tracks-0.011
 automobile-0.011
 radio-0.011
 package-0.010
 track-0.010
Token were
Token ID547
Feature activation+3.84e-02

Pos logit contributions
 stret+0.030
 tracks+0.028
 radio+0.028
 track+0.028
 automobile+0.028
Neg logit contributions
 servant-0.038
ubb-0.038
 queens-0.037
oll-0.036
ill-0.036
Token always
Token ID1464
Feature activation-8.33e-03

Pos logit contributions
ubb+0.085
aph+0.079
 servant+0.078
umbo+0.077
oll+0.076
Neg logit contributions
 tracks-0.055
 package-0.052
 automobile-0.052
 radio-0.051
 track-0.049
Token y
Token ID331
Feature activation+6.60e-02

Pos logit contributions
 tracks+0.012
 stret+0.011
 radio+0.011
 automobile+0.011
 track+0.011
Neg logit contributions
 servant-0.017
ubb-0.017
 queens-0.017
oll-0.016
ill-0.016
Tokenummy
Token ID13513
Feature activation+1.93e-02

Pos logit contributions
 servant+0.108
ubb+0.101
 diver+0.099
Col+0.097
acy+0.097
Neg logit contributions
 package-0.123
 automobile-0.122
 coats-0.120
 tracks-0.116
 track-0.107
Token treats
Token ID18432
Feature activation+1.58e-02

Pos logit contributions
ubb+0.047
 servant+0.044
 queens+0.043
ill+0.043
aph+0.043
Neg logit contributions
 tracks-0.022
 package-0.022
 automobile-0.021
 radio-0.021
 stret-0.021
Token.
Token ID13
Feature activation-2.23e-02

Pos logit contributions
ubb+0.037
 servant+0.036
 queens+0.035
oll+0.034
ill+0.033
Neg logit contributions
 package-0.019
 tracks-0.019
 automobile-0.019
 radio-0.019
 stret-0.018
Token After
Token ID2293
Feature activation+1.67e-02

Pos logit contributions
 stret+0.037
 radio+0.036
 controller+0.035
 package+0.034
 tracks+0.034
Neg logit contributions
 servant-0.041
oll-0.038
umbo-0.038
ubb-0.038
ill-0.037
Token he
Token ID339
Feature activation+3.06e-02

Pos logit contributions
ubb+0.038
 servant+0.036
 queens+0.035
ill+0.035
oll+0.035
Neg logit contributions
 tracks-0.022
 package-0.021
 radio-0.021
 automobile-0.021
 track-0.021
Token thing
Token ID1517
Feature activation+4.95e-03

Pos logit contributions
 stret+0.028
cycles+0.027
 Amen+0.027
 tracks+0.026
 automobile+0.026
Neg logit contributions
 servant-0.035
ubb-0.035
 queens-0.033
oll-0.031
 diver-0.030
Token anyway
Token ID6949
Feature activation+7.74e-03

Pos logit contributions
ubb+0.011
 servant+0.011
 queens+0.011
oll+0.010
ill+0.010
Neg logit contributions
 tracks-0.006
 automobile-0.006
 radio-0.006
 package-0.006
 track-0.006
Token.
Token ID13
Feature activation+3.52e-02

Pos logit contributions
ubb+0.018
 servant+0.018
 queens+0.017
oll+0.017
ill+0.016
Neg logit contributions
 tracks-0.009
 automobile-0.009
 radio-0.009
 package-0.009
 track-0.009
Token
Token ID198
Feature activation+1.94e-02

Pos logit contributions
 queens+0.093
ubb+0.091
 servant+0.091
ier+0.089
 diver+0.085
Neg logit contributions
 tracks-0.033
 -0.030
 receiver-0.028
 package-0.027
 automobile-0.026
Token
Token ID198
Feature activation+2.52e-02

Pos logit contributions
ubb+0.047
 servant+0.047
 queens+0.046
oll+0.043
ier+0.042
Neg logit contributions
 tracks-0.022
 radio-0.020
 stret-0.020
 track-0.020
 package-0.020
TokenM
Token ID44
Feature activation+7.72e-02

Pos logit contributions
 servant+0.052
ubb+0.050
 queens+0.049
oll+0.047
ier+0.046
Neg logit contributions
 tracks-0.036
 track-0.035
 radio-0.035
 stret-0.034
cycles-0.034
Tokenandy
Token ID10757
Feature activation+6.07e-04

Pos logit contributions
 queens+0.134
 servant+0.120
ancer+0.101
 goalie+0.100
 diver+0.095
Neg logit contributions
 replied-0.141
 package-0.126
 tracks-0.125
 belongs-0.123
 rail-0.120
Token's
Token ID338
Feature activation+5.78e-03

Pos logit contributions
ubb+0.001
 servant+0.001
 queens+0.001
oll+0.001
ill+0.001
Neg logit contributions
 tracks-0.001
 automobile-0.001
 radio-0.001
 stret-0.001
 package-0.001
Token mother
Token ID2802
Feature activation+3.81e-03

Pos logit contributions
ubb+0.010
 queens+0.010
 servant+0.010
oll+0.009
ill+0.009
Neg logit contributions
 tracks-0.010
 radio-0.010
 automobile-0.010
 package-0.010
 track-0.010
Token was
Token ID373
Feature activation-3.45e-03

Pos logit contributions
ubb+0.009
 servant+0.009
 queens+0.008
oll+0.008
ill+0.008
Neg logit contributions
 tracks-0.005
 automobile-0.005
 radio-0.005
 stret-0.005
 package-0.005
Token angry
Token ID7954
Feature activation+1.99e-02

Pos logit contributions
 tracks+0.005
 automobile+0.005
 radio+0.005
 package+0.005
 track+0.005
Neg logit contributions
ubb-0.007
 servant-0.007
 queens-0.007
oll-0.006
ill-0.006
Token so
Token ID523
Feature activation+3.11e-02

Pos logit contributions
ubb+0.018
 servant+0.018
 queens+0.018
oll+0.017
ill+0.017
Neg logit contributions
 tracks-0.010
 track-0.010
 automobile-0.010
 radio-0.010
 package-0.010
Token happy
Token ID3772
Feature activation+2.70e-02

Pos logit contributions
 servant+0.083
ubb+0.081
 queens+0.081
aph+0.080
oll+0.078
Neg logit contributions
 tracks-0.023
 track-0.022
 automobile-0.022
 package-0.022
 receiver-0.022
Token and
Token ID290
Feature activation+1.02e-02

Pos logit contributions
ubb+0.070
 servant+0.065
oll+0.065
acy+0.065
 queens+0.063
Neg logit contributions
 tracks-0.029
 track-0.028
 automobile-0.026
 radio-0.026
 package-0.026
Token he
Token ID339
Feature activation+2.09e-02

Pos logit contributions
ubb+0.021
 servant+0.020
 queens+0.020
oll+0.020
ill+0.019
Neg logit contributions
 tracks-0.015
 track-0.015
 automobile-0.015
 radio-0.015
 package-0.014
Token ate
Token ID15063
Feature activation+1.38e-02

Pos logit contributions
ubb+0.049
 servant+0.046
 queens+0.046
oll+0.045
ill+0.045
Neg logit contributions
 tracks-0.025
 track-0.024
 package-0.024
 radio-0.024
 automobile-0.023
Token as
Token ID355
Feature activation+7.88e-02

Pos logit contributions
ubb+0.037
 servant+0.036
 queens+0.036
oll+0.035
ill+0.034
Neg logit contributions
 tracks-0.012
 track-0.011
 package-0.011
 radio-0.011
 stret-0.011
Token much
Token ID881
Feature activation+3.36e-02

Pos logit contributions
ubb+0.198
 servant+0.197
ascal+0.195
umm+0.193
alia+0.188
Neg logit contributions
 their-0.076
 its-0.067
 our-0.058
 tracks-0.052
 it-0.051
Token as
Token ID355
Feature activation+7.96e-02

Pos logit contributions
ubb+0.080
 servant+0.079
 queens+0.078
ill+0.078
aph+0.076
Neg logit contributions
 tracks-0.037
 package-0.035
 track-0.034
cycles-0.033
 radio-0.033
Token he
Token ID339
Feature activation+5.15e-02

Pos logit contributions
ascal+0.207
ubb+0.204
umber+0.203
umm+0.200
aph+0.198
Neg logit contributions
 their-0.074
 our-0.069
 its-0.062
 tracks-0.058
 it-0.056
Token could
Token ID714
Feature activation+4.44e-02

Pos logit contributions
ubb+0.132
 queens+0.123
aph+0.122
 servant+0.121
umbo+0.119
Neg logit contributions
 tracks-0.050
 package-0.048
 replied-0.048
 belongs-0.048
cycles-0.047
Token before
Token ID878
Feature activation+2.09e-02

Pos logit contributions
ubb+0.124
 queens+0.124
aph+0.117
 servant+0.115
umbo+0.113
Neg logit contributions
 tracks-0.035
 track-0.030
 stret-0.029
 belongs-0.027
 lap-0.027
Token him
Token ID683
Feature activation-5.59e-03

Pos logit contributions
ubb+0.015
 servant+0.015
 queens+0.015
oll+0.014
ill+0.014
Neg logit contributions
 tracks-0.010
 radio-0.010
 automobile-0.010
 track-0.010
 stret-0.010
Token.
Token ID13
Feature activation-6.57e-03

Pos logit contributions
 tracks+0.008
 automobile+0.008
 stret+0.008
 radio+0.007
 package+0.007
Neg logit contributions
ubb-0.012
 servant-0.012
 queens-0.011
oll-0.011
ill-0.011
Token Tom
Token ID4186
Feature activation-5.33e-03

Pos logit contributions
 stret+0.010
 radio+0.009
 tracks+0.009
 automobile+0.009
 package+0.009
Neg logit contributions
 servant-0.013
ubb-0.013
 queens-0.013
oll-0.013
ill-0.012
Token never
Token ID1239
Feature activation+2.37e-02

Pos logit contributions
 tracks+0.007
 stret+0.007
 automobile+0.007
 radio+0.007
 track+0.007
Neg logit contributions
 servant-0.012
ubb-0.012
 queens-0.011
oll-0.011
ill-0.011
Token felt
Token ID2936
Feature activation+2.45e-02

Pos logit contributions
ubb+0.055
 servant+0.053
 queens+0.052
oll+0.051
ill+0.050
Neg logit contributions
 tracks-0.029
 automobile-0.028
 radio-0.028
 receiver-0.027
 package-0.027
Token sad
Token ID6507
Feature activation+8.13e-02

Pos logit contributions
 servant+0.052
 queens+0.052
ubb+0.052
ill+0.050
aph+0.050
Neg logit contributions
 tracks-0.032
 radio-0.031
 package-0.030
 automobile-0.030
 stret-0.030
Token for
Token ID329
Feature activation+3.92e-02

Pos logit contributions
umber+0.228
Col+0.224
ubb+0.220
ona+0.216
acy+0.216
Neg logit contributions
 radio-0.044
 away-0.041
 around-0.041
 tracks-0.040
 language-0.039
Token long
Token ID890
Feature activation+3.81e-02

Pos logit contributions
ubb+0.102
oll+0.100
 servant+0.098
olly+0.098
umbo+0.098
Neg logit contributions
 tracks-0.040
 track-0.037
 radio-0.036
 automobile-0.033
 package-0.031
Token because
Token ID780
Feature activation+3.78e-02

Pos logit contributions
ubb+0.079
 servant+0.076
aph+0.076
 queens+0.075
oll+0.075
Neg logit contributions
 tracks-0.053
 radio-0.049
 automobile-0.048
 package-0.048
 track-0.048
Token of
Token ID286
Feature activation+2.66e-02

Pos logit contributions
ubb+0.079
 servant+0.078
ory+0.072
 queens+0.072
ill+0.072
Neg logit contributions
 tracks-0.054
 automobile-0.054
 radio-0.053
 track-0.052
 package-0.050
Token his
Token ID465
Feature activation+3.06e-02

Pos logit contributions
 servant+0.067
ubb+0.066
oll+0.066
 queens+0.065
ill+0.065
Neg logit contributions
 tracks-0.027
 track-0.026
 radio-0.025
 automobile-0.024
 package-0.023
Token He
Token ID679
Feature activation+1.46e-02

Pos logit contributions
 stret+0.023
 radio+0.022
 automobile+0.022
 package+0.022
 tracks+0.022
Neg logit contributions
 servant-0.034
ubb-0.034
 queens-0.033
oll-0.033
ill-0.032
Token would
Token ID561
Feature activation+1.81e-02

Pos logit contributions
ubb+0.034
 queens+0.033
 servant+0.033
oll+0.031
ill+0.031
Neg logit contributions
 tracks-0.018
 radio-0.017
 track-0.017
 automobile-0.017
 package-0.017
Token always
Token ID1464
Feature activation-1.64e-02

Pos logit contributions
ubb+0.037
 queens+0.037
 servant+0.036
oll+0.035
ill+0.034
Neg logit contributions
 tracks-0.027
 track-0.026
 radio-0.026
 automobile-0.026
 stret-0.025
Token remember
Token ID3505
Feature activation+6.97e-03

Pos logit contributions
 tracks+0.027
 automobile+0.027
 stret+0.027
 radio+0.027
 package+0.026
Neg logit contributions
 servant-0.029
ubb-0.029
 queens-0.028
oll-0.027
ill-0.027
Token the
Token ID262
Feature activation+2.05e-02

Pos logit contributions
ubb+0.016
 servant+0.016
 queens+0.016
oll+0.016
ill+0.015
Neg logit contributions
 tracks-0.008
 radio-0.008
 track-0.008
 automobile-0.008
 package-0.008
Token oy
Token ID35104
Feature activation+6.72e-02

Pos logit contributions
ubb+0.043
 queens+0.041
ill+0.041
oll+0.041
 servant+0.041
Neg logit contributions
 package-0.029
 radio-0.029
 tracks-0.028
 automobile-0.028
 track-0.027
Tokenster
Token ID1706
Feature activation+9.84e-03

Pos logit contributions
ubb+0.115
acy+0.108
ill+0.107
aph+0.103
umbo+0.101
Neg logit contributions
cycles-0.113
 package-0.113
 belongs-0.108
 horn-0.108
ci-0.104
Token and
Token ID290
Feature activation+1.28e-02

Pos logit contributions
ubb+0.021
 servant+0.020
 queens+0.020
oll+0.019
ill+0.019
Neg logit contributions
 tracks-0.014
 radio-0.013
 automobile-0.013
 track-0.013
 package-0.013
Token the
Token ID262
Feature activation+3.30e-02

Pos logit contributions
ubb+0.028
 servant+0.027
 queens+0.027
oll+0.027
ill+0.026
Neg logit contributions
 tracks-0.017
 track-0.016
 radio-0.016
 automobile-0.016
 package-0.016
Token pearl
Token ID43836
Feature activation+2.04e-02

Pos logit contributions
oll+0.065
ill+0.064
ubb+0.064
 queens+0.061
umbo+0.061
Neg logit contributions
 package-0.050
 radio-0.049
 tracks-0.049
 track-0.048
 automobile-0.048
Token it
Token ID340
Feature activation+4.92e-03

Pos logit contributions
ubb+0.041
 queens+0.039
 servant+0.039
oll+0.037
ill+0.036
Neg logit contributions
 tracks-0.031
 radio-0.030
 track-0.030
 automobile-0.030
 package-0.030
Token she
Token ID673
Feature activation+1.68e-02

Pos logit contributions
ubb+0.041
 servant+0.040
oll+0.040
 queens+0.039
ill+0.038
Neg logit contributions
 tracks-0.029
 radio-0.029
 package-0.028
 automobile-0.028
 track-0.028
Token brave
Token ID14802
Feature activation+2.46e-02

Pos logit contributions
ubb+0.039
 servant+0.038
 queens+0.037
oll+0.036
ill+0.035
Neg logit contributions
 tracks-0.021
 radio-0.020
 package-0.020
 automobile-0.019
 stret-0.019
Tokenly
Token ID306
Feature activation+1.09e-02

Pos logit contributions
ubb+0.057
 queens+0.055
oll+0.054
 servant+0.054
ill+0.053
Neg logit contributions
 tracks-0.030
 radio-0.030
 package-0.029
 automobile-0.029
 track-0.027
Token opened
Token ID4721
Feature activation-2.49e-02

Pos logit contributions
ubb+0.024
 servant+0.023
 queens+0.023
oll+0.022
ill+0.022
Neg logit contributions
 tracks-0.015
 radio-0.014
 package-0.014
 automobile-0.014
 stret-0.014
Token her
Token ID607
Feature activation-1.09e-02

Pos logit contributions
 stret+0.039
cycles+0.038
 automobile+0.037
obiles+0.037
 controller+0.037
Neg logit contributions
ubb-0.047
 servant-0.046
 queens-0.045
ill-0.045
acy-0.043
Token eyes
Token ID2951
Feature activation+1.43e-02

Pos logit contributions
 stret+0.018
 automobile+0.018
 tracks+0.018
 track+0.018
 radio+0.018
Neg logit contributions
 servant-0.020
ubb-0.020
 queens-0.019
oll-0.018
ill-0.018
Token and
Token ID290
Feature activation-6.19e-04

Pos logit contributions
ubb+0.032
 servant+0.032
 queens+0.031
oll+0.030
umbo+0.029
Neg logit contributions
 tracks-0.018
 radio-0.018
 package-0.018
 automobile-0.018
 track-0.017
Token reached
Token ID4251
Feature activation+9.48e-03

Pos logit contributions
 tracks+0.001
 automobile+0.001
 radio+0.001
 stret+0.001
 track+0.001
Neg logit contributions
ubb-0.001
 servant-0.001
 queens-0.001
oll-0.001
ill-0.001
Token for
Token ID329
Feature activation+1.55e-02

Pos logit contributions
ubb+0.023
 servant+0.022
 queens+0.022
oll+0.021
ill+0.021
Neg logit contributions
 tracks-0.011
 radio-0.011
 track-0.011
 package-0.010
 automobile-0.010
Token the
Token ID262
Feature activation+9.57e-03

Pos logit contributions
ubb+0.040
 servant+0.039
oll+0.038
 queens+0.037
ill+0.037
Neg logit contributions
 tracks-0.015
 radio-0.015
 package-0.014
 track-0.014
 automobile-0.014
Token big
Token ID1263
Feature activation-1.11e-02

Pos logit contributions
ubb+0.018
 servant+0.018
 queens+0.017
oll+0.017
ill+0.017
Neg logit contributions
 radio-0.015
 tracks-0.015
 package-0.015
 automobile-0.015
 track-0.015
Token next
Token ID1306
Feature activation-1.28e-02

Pos logit contributions
ubb+0.026
 servant+0.025
 queens+0.025
oll+0.024
ill+0.024
Neg logit contributions
 tracks-0.017
 radio-0.017
 automobile-0.017
 package-0.017
 track-0.017
Token time
Token ID640
Feature activation-5.06e-04

Pos logit contributions
 stret+0.021
 tracks+0.020
 automobile+0.020
 receiver+0.020
 radio+0.020
Neg logit contributions
 servant-0.023
ubb-0.023
oll-0.023
 queens-0.023
ill-0.022
Token,
Token ID11
Feature activation+5.36e-03

Pos logit contributions
 tracks+0.001
 automobile+0.001
 radio+0.001
 stret+0.001
 package+0.001
Neg logit contributions
ubb-0.001
 servant-0.001
 queens-0.001
oll-0.001
ill-0.001
Token be
Token ID307
Feature activation-2.80e-02

Pos logit contributions
ubb+0.011
 servant+0.011
 queens+0.010
oll+0.010
ill+0.010
Neg logit contributions
 tracks-0.008
 automobile-0.008
 radio-0.008
 package-0.008
 track-0.008
Token careful
Token ID8161
Feature activation-1.81e-02

Pos logit contributions
 radio+0.064
 automobile+0.063
 controller+0.062
 tracks+0.062
 package+0.061
Neg logit contributions
 queens-0.033
 servant-0.033
ubb-0.031
oll-0.030
Col-0.029
Token when
Token ID618
Feature activation+3.46e-02

Pos logit contributions
 stret+0.030
 tracks+0.029
 package+0.029
 radio+0.029
 automobile+0.029
Neg logit contributions
ubb-0.032
 queens-0.032
ill-0.032
 servant-0.032
oll-0.031
Token you
Token ID345
Feature activation+2.14e-02

Pos logit contributions
ubb+0.078
 servant+0.076
oll+0.076
ill+0.076
umbo+0.075
Neg logit contributions
 tracks-0.045
 track-0.043
 radio-0.042
 automobile-0.041
 belongs-0.040
Token twist
Token ID14528
Feature activation-2.48e-02

Pos logit contributions
ubb+0.048
 servant+0.047
 queens+0.046
oll+0.044
ill+0.043
Neg logit contributions
 tracks-0.027
 automobile-0.027
 package-0.027
 radio-0.027
 stret-0.026
Token things
Token ID1243
Feature activation-8.67e-03

Pos logit contributions
 stret+0.033
 package+0.031
 tracks+0.031
 automobile+0.031
 Amen+0.031
Neg logit contributions
ubb-0.053
 queens-0.052
 servant-0.050
ill-0.050
oll-0.049
Token.
Token ID13
Feature activation-1.66e-03

Pos logit contributions
 tracks+0.012
 automobile+0.012
 radio+0.012
 stret+0.012
 package+0.012
Neg logit contributions
ubb-0.018
 servant-0.017
 queens-0.017
oll-0.017
ill-0.017
Token You
Token ID921
Feature activation+9.36e-03

Pos logit contributions
 tracks+0.002
 automobile+0.002
 radio+0.002
 stret+0.002
 package+0.002
Neg logit contributions
ubb-0.004
 servant-0.004
 queens-0.004
oll-0.004
ill-0.004
Token a
Token ID257
Feature activation-1.66e-02

Pos logit contributions
 stret+0.042
 receiver+0.040
 radio+0.040
 package+0.040
 automobile+0.040
Neg logit contributions
 queens-0.061
 servant-0.057
ubb-0.054
oll-0.052
ill-0.051
Token magnet
Token ID19972
Feature activation+1.46e-02

Pos logit contributions
 tracks+0.024
 automobile+0.023
 stret+0.023
 receiver+0.023
obiles+0.023
Neg logit contributions
 servant-0.034
ubb-0.033
 queens-0.033
oll-0.031
ill-0.030
Token?"
Token ID1701
Feature activation+9.97e-03

Pos logit contributions
ubb+0.033
 servant+0.031
 queens+0.031
oll+0.030
ill+0.029
Neg logit contributions
 automobile-0.019
 radio-0.019
 package-0.019
 tracks-0.018
 track-0.018
Token Sam
Token ID3409
Feature activation-1.68e-02

Pos logit contributions
ubb+0.018
 servant+0.018
 queens+0.017
oll+0.017
ill+0.016
Neg logit contributions
 tracks-0.018
 automobile-0.017
 radio-0.017
 track-0.017
 stret-0.016
Token asked
Token ID1965
Feature activation+2.04e-02

Pos logit contributions
 tracks+0.024
 track+0.024
 stret+0.024
 automobile+0.024
 package+0.023
Neg logit contributions
 servant-0.034
 queens-0.033
oll-0.033
ubb-0.033
ill-0.032
Token.
Token ID13
Feature activation+2.08e-02

Pos logit contributions
ubb+0.052
 queens+0.050
 servant+0.050
oll+0.049
acy+0.047
Neg logit contributions
 tracks-0.020
 automobile-0.019
 radio-0.019
 track-0.019
 stret-0.018
Token
Token ID198
Feature activation-3.46e-03

Pos logit contributions
ubb+0.053
 servant+0.050
 queens+0.050
oll+0.049
acy+0.047
Neg logit contributions
 tracks-0.022
 radio-0.021
 automobile-0.021
 track-0.020
 stret-0.020
Token
Token ID198
Feature activation-5.97e-03

Pos logit contributions
 tracks+0.005
 automobile+0.005
 radio+0.005
 package+0.005
 stret+0.005
Neg logit contributions
ubb-0.007
 servant-0.007
 queens-0.007
oll-0.006
ill-0.006
TokenBen
Token ID11696
Feature activation-2.78e-03

Pos logit contributions
 tracks+0.009
 radio+0.009
 package+0.009
 automobile+0.009
 stret+0.009
Neg logit contributions
 queens-0.012
ubb-0.012
 servant-0.012
ill-0.011
oll-0.011
Token gave
Token ID2921
Feature activation+1.38e-02

Pos logit contributions
 tracks+0.004
 automobile+0.004
 radio+0.004
 stret+0.004
 track+0.004
Neg logit contributions
ubb-0.006
 servant-0.006
 queens-0.006
oll-0.006
ill-0.006
Token him
Token ID683
Feature activation-3.41e-03

Pos logit contributions
ubb+0.033
 servant+0.032
 queens+0.032
oll+0.032
acy+0.030
Neg logit contributions
 tracks-0.016
 automobile-0.015
 track-0.015
 package-0.014
 radio-0.014
Token took
Token ID1718
Feature activation-2.19e-03

Pos logit contributions
ubb+0.002
 servant+0.002
 queens+0.002
oll+0.002
ill+0.002
Neg logit contributions
 tracks-0.001
 automobile-0.001
 radio-0.001
 stret-0.001
 package-0.001
Token her
Token ID607
Feature activation-3.48e-02

Pos logit contributions
 tracks+0.003
 automobile+0.003
 radio+0.003
 stret+0.003
 track+0.003
Neg logit contributions
ubb-0.005
 servant-0.005
 queens-0.004
oll-0.004
ill-0.004
Token outside
Token ID2354
Feature activation-1.70e-02

Pos logit contributions
obiles+0.071
cycles+0.071
 Amen+0.071
 automobile+0.067
agree+0.067
Neg logit contributions
 servant-0.045
oll-0.043
ubb-0.042
 queens-0.041
ill-0.038
Token to
Token ID284
Feature activation-2.37e-02

Pos logit contributions
cycles+0.030
obiles+0.029
 stret+0.029
 tracks+0.029
 automobile+0.029
Neg logit contributions
 servant-0.028
ubb-0.028
 queens-0.028
ill-0.028
oll-0.027
Token see
Token ID766
Feature activation-1.60e-02

Pos logit contributions
cycles+0.038
 automobile+0.038
 controller+0.037
 tracks+0.037
 package+0.037
Neg logit contributions
ubb-0.041
 servant-0.041
 queens-0.040
ill-0.039
aph-0.037
Token the
Token ID262
Feature activation+2.67e-02

Pos logit contributions
 tracks+0.024
cycles+0.024
 controller+0.024
 stret+0.024
 track+0.024
Neg logit contributions
ubb-0.030
 servant-0.030
 queens-0.030
oll-0.029
ill-0.028
Token rainbow
Token ID27223
Feature activation-7.69e-03

Pos logit contributions
ubb+0.058
oll+0.057
 servant+0.057
ill+0.055
 queens+0.055
Neg logit contributions
 automobile-0.038
 radio-0.038
 tracks-0.037
 package-0.037
 track-0.036
Token in
Token ID287
Feature activation-1.88e-03

Pos logit contributions
 tracks+0.011
 stret+0.011
 radio+0.011
 automobile+0.011
 package+0.011
Neg logit contributions
ubb-0.016
 servant-0.015
 queens-0.015
oll-0.015
ill-0.015
Token the
Token ID262
Feature activation-1.82e-02

Pos logit contributions
 tracks+0.002
 automobile+0.002
 stret+0.002
 radio+0.002
 package+0.002
Neg logit contributions
ubb-0.004
 servant-0.004
 queens-0.004
oll-0.004
ill-0.004
Token sky
Token ID6766
Feature activation+3.29e-04

Pos logit contributions
cycles+0.036
 stret+0.035
obiles+0.034
 radio+0.034
 track+0.034
Neg logit contributions
ubb-0.026
 queens-0.026
 servant-0.025
ill-0.023
oll-0.023
Token.
Token ID13
Feature activation-4.93e-03

Pos logit contributions
ubb+0.001
 servant+0.001
 queens+0.001
oll+0.001
ill+0.001
Neg logit contributions
 tracks-0.000
 automobile-0.000
 radio-0.000
 stret-0.000
 package-0.000
TokenHello
Token ID15496
Feature activation+1.83e-02

Pos logit contributions
 tracks+0.005
 radio+0.005
 stret+0.005
 track+0.005
 automobile+0.005
Neg logit contributions
ubb-0.007
 servant-0.007
 queens-0.007
oll-0.006
ill-0.006
Token,
Token ID11
Feature activation+8.37e-03

Pos logit contributions
ubb+0.046
 servant+0.043
 queens+0.042
oll+0.042
acy+0.042
Neg logit contributions
 automobile-0.020
 package-0.020
 tracks-0.020
 radio-0.019
 track-0.019
Token Tom
Token ID4186
Feature activation-2.80e-02

Pos logit contributions
ubb+0.017
 servant+0.017
 queens+0.016
oll+0.016
ill+0.016
Neg logit contributions
 automobile-0.012
 tracks-0.012
 radio-0.012
 package-0.012
 track-0.012
Token!
Token ID0
Feature activation+6.59e-03

Pos logit contributions
 stret+0.049
 tracks+0.048
 radio+0.047
 controller+0.047
 automobile+0.046
Neg logit contributions
 servant-0.048
 queens-0.047
ill-0.047
ubb-0.045
oll-0.044
Token Can
Token ID1680
Feature activation-8.43e-03

Pos logit contributions
ubb+0.014
 servant+0.014
 queens+0.014
oll+0.014
ill+0.013
Neg logit contributions
 tracks-0.009
 automobile-0.008
 radio-0.008
 package-0.008
 track-0.008
Token you
Token ID345
Feature activation+1.44e-02

Pos logit contributions
 tracks+0.015
 stret+0.015
 radio+0.015
 package+0.015
 automobile+0.015
Neg logit contributions
ubb-0.014
 servant-0.014
 queens-0.014
oll-0.013
ill-0.013
Token help
Token ID1037
Feature activation+9.40e-03

Pos logit contributions
ubb+0.032
 servant+0.031
 queens+0.031
oll+0.031
ill+0.029
Neg logit contributions
 tracks-0.019
 automobile-0.019
 track-0.018
 radio-0.018
 stret-0.018
Token me
Token ID502
Feature activation+2.15e-02

Pos logit contributions
ubb+0.025
 servant+0.024
oll+0.024
 queens+0.023
ill+0.023
Neg logit contributions
 tracks-0.009
 automobile-0.009
 radio-0.009
 track-0.008
 package-0.008
Token?
Token ID30
Feature activation-1.79e-02

Pos logit contributions
ubb+0.049
 servant+0.048
 queens+0.047
oll+0.046
ill+0.045
Neg logit contributions
 tracks-0.027
 track-0.027
 automobile-0.027
 stret-0.026
 radio-0.026
Token I
Token ID314
Feature activation+3.50e-03

Pos logit contributions
 stret+0.030
 tracks+0.029
 package+0.029
 radio+0.029
 automobile+0.029
Neg logit contributions
ubb-0.032
 servant-0.032
 queens-0.031
oll-0.030
ill-0.029
Token need
Token ID761
Feature activation+4.97e-03

Pos logit contributions
ubb+0.007
 servant+0.007
 queens+0.007
oll+0.007
ill+0.006
Neg logit contributions
 tracks-0.005
 automobile-0.005
 radio-0.005
 package-0.005
 stret-0.005
Token They
Token ID1119
Feature activation-2.49e-02

Pos logit contributions
 stret+0.014
 tracks+0.014
 automobile+0.014
 package+0.014
 track+0.014
Neg logit contributions
ubb-0.021
 servant-0.020
 queens-0.020
oll-0.020
ill-0.019
Token're
Token ID821
Feature activation-4.94e-03

Pos logit contributions
 stret+0.039
 tracks+0.039
 automobile+0.039
 track+0.038
 radio+0.038
Neg logit contributions
 servant-0.046
ubb-0.046
 queens-0.045
oll-0.043
ill-0.042
Token just
Token ID655
Feature activation-9.37e-03

Pos logit contributions
 tracks+0.007
 automobile+0.007
 radio+0.007
 stret+0.007
 package+0.007
Neg logit contributions
ubb-0.010
 servant-0.010
 queens-0.010
oll-0.009
ill-0.009
Token playing
Token ID2712
Feature activation-2.88e-02

Pos logit contributions
 tracks+0.015
 automobile+0.015
 stret+0.015
 radio+0.015
 package+0.015
Neg logit contributions
ubb-0.018
 servant-0.017
 queens-0.017
oll-0.016
ill-0.016
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
 tracks+0.044
 package+0.044
 automobile+0.043
 receiver+0.043
 stret+0.043
Neg logit contributions
 queens-0.054
 servant-0.054
ubb-0.054
ill-0.050
oll-0.050
Token They
Token ID1119
Feature activation-1.77e-02

Pos logit contributions
 stret+0.014
 tracks+0.014
 automobile+0.014
 package+0.014
 radio+0.014
Neg logit contributions
ubb-0.024
 servant-0.024
 queens-0.023
oll-0.023
ill-0.022
Token're
Token ID821
Feature activation-4.33e-03

Pos logit contributions
 stret+0.027
 tracks+0.027
 automobile+0.027
 radio+0.026
 track+0.026
Neg logit contributions
ubb-0.034
 servant-0.034
 queens-0.034
oll-0.032
ill-0.032
Token not
Token ID407
Feature activation-3.33e-03

Pos logit contributions
 tracks+0.006
 automobile+0.006
 radio+0.006
 package+0.006
 stret+0.006
Neg logit contributions
ubb-0.009
 servant-0.009
 queens-0.009
oll-0.008
ill-0.008
Token bad
Token ID2089
Feature activation+3.07e-02

Pos logit contributions
 tracks+0.005
 automobile+0.005
 radio+0.005
 stret+0.005
 package+0.005
Neg logit contributions
ubb-0.007
 servant-0.007
 queens-0.006
oll-0.006
ill-0.006
Token guys
Token ID3730
Feature activation+5.10e-03

Pos logit contributions
ubb+0.080
 servant+0.075
oll+0.074
 queens+0.074
Col+0.074
Neg logit contributions
 radio-0.029
 tracks-0.028
 automobile-0.027
 package-0.027
 track-0.026
Token.
Token ID13
Feature activation-5.92e-03

Pos logit contributions
ubb+0.012
 servant+0.011
 queens+0.011
oll+0.011
ill+0.011
Neg logit contributions
 tracks-0.006
 radio-0.006
 automobile-0.006
 stret-0.006
 track-0.006
Token a
Token ID257
Feature activation-5.92e-03

Pos logit contributions
 tracks+0.013
 stret+0.013
 automobile+0.013
 radio+0.012
cycles+0.012
Neg logit contributions
ubb-0.019
 servant-0.018
 queens-0.018
oll-0.017
ill-0.017
Token small
Token ID1402
Feature activation-2.95e-03

Pos logit contributions
 tracks+0.009
 stret+0.009
 automobile+0.009
 radio+0.009
cycles+0.009
Neg logit contributions
ubb-0.011
 servant-0.011
 queens-0.011
oll-0.010
ill-0.010
Token town
Token ID3240
Feature activation-2.18e-02

Pos logit contributions
 tracks+0.004
 stret+0.004
 radio+0.004
 automobile+0.004
 package+0.004
Neg logit contributions
ubb-0.006
 servant-0.006
 queens-0.006
oll-0.005
ill-0.005
Token with
Token ID351
Feature activation+8.59e-03

Pos logit contributions
 stret+0.034
cycles+0.033
 radio+0.033
 package+0.032
 tracks+0.032
Neg logit contributions
ubb-0.042
 queens-0.041
 servant-0.041
ill-0.039
oll-0.038
Token her
Token ID607
Feature activation-3.55e-04

Pos logit contributions
ubb+0.018
 servant+0.018
 queens+0.017
oll+0.017
ill+0.017
Neg logit contributions
 tracks-0.012
 automobile-0.012
 radio-0.012
 track-0.011
 package-0.011
Token parents
Token ID3397
Feature activation-2.04e-02

Pos logit contributions
 tracks+0.001
 automobile+0.001
 radio+0.001
 stret+0.001
 package+0.001
Neg logit contributions
ubb-0.001
 servant-0.001
 queens-0.001
oll-0.001
ill-0.001
Token.
Token ID13
Feature activation-1.91e-02

Pos logit contributions
 stret+0.032
 tracks+0.031
cycles+0.031
obiles+0.030
 radio+0.030
Neg logit contributions
 servant-0.039
ubb-0.038
 queens-0.037
oll-0.037
ill-0.036
Token One
Token ID1881
Feature activation-1.39e-02

Pos logit contributions
 stret+0.032
 package+0.030
 automobile+0.030
 radio+0.030
 tracks+0.029
Neg logit contributions
ill-0.035
 servant-0.034
 queens-0.033
ubb-0.033
acy-0.032
Token day
Token ID1110
Feature activation-7.13e-03

Pos logit contributions
 stret+0.022
 tracks+0.021
cycles+0.020
obiles+0.020
 controller+0.020
Neg logit contributions
ubb-0.026
 servant-0.026
 queens-0.026
oll-0.026
ill-0.025
Token,
Token ID11
Feature activation-1.01e-02

Pos logit contributions
 stret+0.010
 tracks+0.010
 automobile+0.010
 radio+0.010
 package+0.010
Neg logit contributions
 servant-0.014
ubb-0.014
 queens-0.014
oll-0.014
ill-0.014
Token Lisa
Token ID15378
Feature activation-6.18e-03

Pos logit contributions
 stret+0.016
 tracks+0.016
 automobile+0.016
 package+0.016
 radio+0.015
Neg logit contributions
 servant-0.019
 queens-0.019
ubb-0.019
ill-0.018
oll-0.017
Token fun
Token ID1257
Feature activation-2.26e-02

Pos logit contributions
ubb+0.026
 servant+0.025
 queens+0.024
oll+0.024
ill+0.024
Neg logit contributions
 radio-0.016
 tracks-0.016
 automobile-0.016
 package-0.016
 stret-0.016
Token day
Token ID1110
Feature activation+1.55e-02

Pos logit contributions
 receiver+0.036
 stret+0.035
cycles+0.035
obiles+0.035
 tracks+0.034
Neg logit contributions
 servant-0.041
ubb-0.039
 queens-0.039
oll-0.039
ill-0.037
Token at
Token ID379
Feature activation+6.00e-03

Pos logit contributions
ubb+0.036
 servant+0.035
 queens+0.034
oll+0.033
ill+0.033
Neg logit contributions
 tracks-0.019
 automobile-0.018
 radio-0.018
 track-0.018
 package-0.018
Token the
Token ID262
Feature activation+3.62e-02

Pos logit contributions
ubb+0.015
 servant+0.015
 queens+0.014
oll+0.014
ill+0.014
Neg logit contributions
 tracks-0.006
 radio-0.006
 automobile-0.006
 track-0.006
 package-0.006
Token stadium
Token ID10308
Feature activation+2.36e-02

Pos logit contributions
ubb+0.088
oll+0.083
ill+0.082
ier+0.081
 queens+0.081
Neg logit contributions
 track-0.048
 radio-0.047
 tracks-0.047
 automobile-0.045
 package-0.044
Token.
Token ID13
Feature activation+4.65e-03

Pos logit contributions
ubb+0.058
 servant+0.055
oll+0.054
 queens+0.054
acy+0.051
Neg logit contributions
 tracks-0.027
 track-0.026
 automobile-0.026
 radio-0.026
 package-0.025
Token<|endoftext|>
Token ID50256
Feature activation+2.81e-02

Pos logit contributions
ubb+0.010
 servant+0.010
 queens+0.010
oll+0.010
ill+0.010
Neg logit contributions
 tracks-0.006
 automobile-0.006
 radio-0.006
 track-0.006
 package-0.006
TokenS
Token ID50
Feature activation+3.59e-02

Pos logit contributions
ubb+0.063
 servant+0.062
 queens+0.059
oll+0.058
ill+0.054
Neg logit contributions
 tracks-0.037
 radio-0.036
 package-0.036
 automobile-0.035
 track-0.035
Tokenara
Token ID3301
Feature activation+3.44e-03

Pos logit contributions
ubb+0.096
 queens+0.091
 servant+0.090
oll+0.085
 diver+0.084
Neg logit contributions
 package-0.037
 automobile-0.034
 tracks-0.034
 radio-0.031
obiles-0.031
Token and
Token ID290
Feature activation-1.50e-02

Pos logit contributions
ubb+0.008
 servant+0.008
 queens+0.007
oll+0.007
ill+0.007
Neg logit contributions
 tracks-0.004
 automobile-0.004
 radio-0.004
 package-0.004
 stret-0.004
Token Ben
Token ID3932
Feature activation+8.77e-03

Pos logit contributions
 tracks+0.024
 stret+0.024
 track+0.024
 radio+0.024
 controller+0.024
Neg logit contributions
 servant-0.027
ubb-0.026
 queens-0.026
oll-0.026
ill-0.026
Token tease
Token ID34691
Feature activation-2.60e-02

Pos logit contributions
 package+0.017
 tracks+0.017
 receiver+0.016
 stret+0.016
 radio+0.016
Neg logit contributions
 servant-0.028
ubb-0.027
 queens-0.027
ill-0.026
oll-0.026
Token Tim
Token ID5045
Feature activation-2.80e-02

Pos logit contributions
 stret+0.043
 package+0.040
 Amen+0.039
 radio+0.039
 tracks+0.039
Neg logit contributions
 queens-0.049
ubb-0.047
 servant-0.047
ill-0.045
aph-0.045
Tokenmy
Token ID1820
Feature activation-3.62e-02

Pos logit contributions
 stret+0.050
 package+0.048
 tracks+0.048
 radio+0.047
cycles+0.047
Neg logit contributions
ill-0.048
 servant-0.047
 queens-0.046
umbo-0.045
ubb-0.044
Token and
Token ID290
Feature activation-1.90e-02

Pos logit contributions
 stret+0.051
 package+0.050
cycles+0.050
obiles+0.049
 Amen+0.049
Neg logit contributions
 servant-0.073
oll-0.071
ubb-0.070
 queens-0.070
ill-0.068
Token take
Token ID1011
Feature activation-1.25e-02

Pos logit contributions
 stret+0.028
 package+0.028
 tracks+0.027
 controller+0.027
 radio+0.027
Neg logit contributions
 servant-0.038
 queens-0.037
ubb-0.036
ill-0.033
aph-0.033
Token his
Token ID465
Feature activation-2.83e-02

Pos logit contributions
 radio+0.017
 stret+0.017
 package+0.017
 automobile+0.017
 tracks+0.017
Neg logit contributions
ubb-0.026
 servant-0.026
 queens-0.026
oll-0.025
ill-0.023
Token toys
Token ID14958
Feature activation-1.80e-03

Pos logit contributions
cycles+0.040
 stret+0.039
obiles+0.039
 Amen+0.039
 receiver+0.037
Neg logit contributions
 servant-0.059
 queens-0.056
ubb-0.055
oll-0.053
aph-0.050
Token away
Token ID1497
Feature activation+9.08e-04

Pos logit contributions
 tracks+0.002
 radio+0.002
 automobile+0.002
 stret+0.002
 package+0.002
Neg logit contributions
ubb-0.004
 servant-0.004
 queens-0.004
oll-0.004
ill-0.004
Token.
Token ID13
Feature activation-4.37e-02

Pos logit contributions
ubb+0.002
 servant+0.002
 queens+0.002
oll+0.002
ill+0.002
Neg logit contributions
 tracks-0.001
 automobile-0.001
 radio-0.001
 stret-0.001
 package-0.001
Token
Token ID198
Feature activation-2.57e-02

Pos logit contributions
 stret+0.087
 radio+0.086
 package+0.084
 receiver+0.082
 controller+0.082
Neg logit contributions
umbo-0.061
ill-0.060
 servant-0.056
aph-0.056
ory-0.055
Token
Token ID198
Feature activation-4.33e-02

Pos logit contributions
 stret+0.044
 tracks+0.044
 radio+0.044
 automobile+0.044
 package+0.044
Neg logit contributions
ill-0.042
 servant-0.042
ubb-0.041
 queens-0.041
oll-0.040
Token tree
Token ID5509
Feature activation-2.33e-02

Pos logit contributions
 stret+0.035
 radio+0.034
cycles+0.033
 tracks+0.033
 receiver+0.033
Neg logit contributions
 queens-0.039
 servant-0.038
ubb-0.038
oll-0.036
ill-0.035
Token in
Token ID287
Feature activation-2.43e-02

Pos logit contributions
 stret+0.036
 radio+0.036
 receiver+0.034
 package+0.034
 tracks+0.034
Neg logit contributions
 servant-0.043
ubb-0.042
 queens-0.042
oll-0.042
ill-0.041
Token his
Token ID465
Feature activation-2.77e-02

Pos logit contributions
 stret+0.039
 radio+0.036
 automobile+0.036
 receiver+0.035
 package+0.034
Neg logit contributions
ubb-0.047
 queens-0.045
 servant-0.045
ill-0.045
Col-0.042
Token backyard
Token ID24296
Feature activation-7.65e-03

Pos logit contributions
 stret+0.045
cycles+0.045
 controller+0.045
 receiver+0.044
obiles+0.042
Neg logit contributions
 servant-0.051
 queens-0.046
ubb-0.045
oll-0.044
ill-0.041
Token.
Token ID13
Feature activation+4.36e-03

Pos logit contributions
 stret+0.010
 radio+0.010
 tracks+0.010
 automobile+0.010
 package+0.010
Neg logit contributions
ubb-0.016
 servant-0.016
 queens-0.016
ill-0.015
oll-0.015
Token He
Token ID679
Feature activation+1.36e-04

Pos logit contributions
ubb+0.009
 servant+0.009
 queens+0.009
oll+0.009
ill+0.009
Neg logit contributions
 tracks-0.006
 automobile-0.006
 radio-0.006
 track-0.006
 stret-0.006
Token decided
Token ID3066
Feature activation+3.05e-03

Pos logit contributions
ubb+0.000
 servant+0.000
 queens+0.000
oll+0.000
ill+0.000
Neg logit contributions
 tracks-0.000
 automobile-0.000
 radio-0.000
 stret-0.000
 package-0.000
Token to
Token ID284
Feature activation-2.79e-03

Pos logit contributions
ubb+0.005
 servant+0.005
 queens+0.005
oll+0.005
ill+0.005
Neg logit contributions
 tracks-0.005
 automobile-0.005
 radio-0.005
 stret-0.005
 track-0.005
Token raise
Token ID5298
Feature activation-1.42e-02

Pos logit contributions
 tracks+0.004
 automobile+0.004
 radio+0.004
 stret+0.004
 package+0.004
Neg logit contributions
ubb-0.006
 servant-0.006
 queens-0.005
oll-0.005
ill-0.005
Token pigs
Token ID22333
Feature activation-2.28e-02

Pos logit contributions
 stret+0.020
 automobile+0.019
 radio+0.019
 package+0.019
cycles+0.019
Neg logit contributions
ubb-0.030
 servant-0.029
 queens-0.029
ill-0.028
oll-0.027
Token under
Token ID739
Feature activation-1.16e-02

Pos logit contributions
 stret+0.037
 package+0.037
 automobile+0.037
 radio+0.037
 tracks+0.036
Neg logit contributions
ubb-0.040
 servant-0.040
 queens-0.040
ill-0.038
oll-0.036
Token day
Token ID1110
Feature activation+7.41e-04

Pos logit contributions
 stret+0.024
obiles+0.022
cycles+0.022
 tracks+0.022
 controller+0.022
Neg logit contributions
 servant-0.028
ubb-0.028
 queens-0.028
oll-0.028
ill-0.027
Token,
Token ID11
Feature activation-3.35e-03

Pos logit contributions
ubb+0.002
 servant+0.002
 queens+0.002
oll+0.002
ill+0.001
Neg logit contributions
 tracks-0.001
 automobile-0.001
 radio-0.001
 stret-0.001
 package-0.001
Token she
Token ID673
Feature activation+8.59e-03

Pos logit contributions
 tracks+0.006
 stret+0.006
 radio+0.006
 automobile+0.006
 package+0.005
Neg logit contributions
ubb-0.006
 servant-0.006
 queens-0.006
oll-0.006
ill-0.005
Token heard
Token ID2982
Feature activation-1.10e-02

Pos logit contributions
ubb+0.020
 servant+0.019
 queens+0.019
oll+0.018
ill+0.018
Neg logit contributions
 tracks-0.011
 automobile-0.011
 radio-0.011
 package-0.010
 track-0.010
Token a
Token ID257
Feature activation-2.30e-02

Pos logit contributions
 stret+0.016
cycles+0.016
 track+0.016
 tracks+0.016
 automobile+0.015
Neg logit contributions
 queens-0.022
 servant-0.022
ubb-0.022
oll-0.021
ill-0.020
Token loud
Token ID7812
Feature activation-7.10e-02

Pos logit contributions
cycles+0.028
 tracks+0.028
 automobile+0.027
 track+0.027
obiles+0.027
Neg logit contributions
 servant-0.050
ubb-0.050
 queens-0.049
oll-0.045
aph-0.045
Token noise
Token ID7838
Feature activation-1.13e-02

Pos logit contributions
 package+0.088
obiles+0.086
cycles+0.086
 belongs+0.084
 receiver+0.083
Neg logit contributions
ubb-0.138
 servant-0.134
 queens-0.130
aph-0.126
umble-0.124
Token coming
Token ID2406
Feature activation+2.23e-03

Pos logit contributions
 tracks+0.015
 automobile+0.015
 package+0.015
 stret+0.015
 radio+0.015
Neg logit contributions
ubb-0.023
 servant-0.023
 queens-0.023
oll-0.022
ill-0.022
Token from
Token ID422
Feature activation-2.49e-02

Pos logit contributions
ubb+0.004
 servant+0.004
 queens+0.004
oll+0.004
ill+0.004
Neg logit contributions
 tracks-0.004
 automobile-0.004
 radio-0.004
 stret-0.004
 track-0.004
Token the
Token ID262
Feature activation+1.39e-02

Pos logit contributions
cycles+0.035
 stret+0.034
obiles+0.034
 receiver+0.034
 controller+0.033
Neg logit contributions
 servant-0.049
 queens-0.049
ubb-0.048
ill-0.045
aph-0.044
Token chim
Token ID18205
Feature activation+1.59e-02

Pos logit contributions
ubb+0.034
 servant+0.033
oll+0.033
 queens+0.032
ill+0.032
Neg logit contributions
 tracks-0.016
 automobile-0.016
 radio-0.016
 package-0.016
 track-0.015
Token he
Token ID339
Feature activation-2.96e-02

Pos logit contributions
 servant+0.077
ubb+0.077
 queens+0.076
aph+0.071
ill+0.071
Neg logit contributions
 radio-0.030
 package-0.030
 tracks-0.030
 automobile-0.029
cycles-0.027
Token was
Token ID373
Feature activation+7.58e-03

Pos logit contributions
 stret+0.044
 track+0.043
 controller+0.042
 tracks+0.042
 automobile+0.041
Neg logit contributions
oll-0.056
 servant-0.056
ubb-0.055
ill-0.054
 queens-0.052
Token sad
Token ID6507
Feature activation+1.27e-02

Pos logit contributions
ubb+0.016
 servant+0.016
 queens+0.015
oll+0.015
ill+0.015
Neg logit contributions
 tracks-0.011
 radio-0.011
 automobile-0.011
 stret-0.011
 track-0.010
Token and
Token ID290
Feature activation-2.09e-02

Pos logit contributions
ubb+0.030
 servant+0.029
 queens+0.029
oll+0.028
ill+0.027
Neg logit contributions
 tracks-0.015
 radio-0.015
 automobile-0.015
 track-0.015
 package-0.015
Token held
Token ID2714
Feature activation-7.11e-03

Pos logit contributions
 stret+0.035
 tracks+0.035
 radio+0.035
 track+0.035
 controller+0.034
Neg logit contributions
 servant-0.037
ubb-0.037
 queens-0.037
oll-0.034
ill-0.033
Token him
Token ID683
Feature activation-5.68e-02

Pos logit contributions
 tracks+0.011
 automobile+0.011
 stret+0.011
 radio+0.011
 track+0.011
Neg logit contributions
ubb-0.014
 servant-0.014
oll-0.013
 queens-0.013
ill-0.013
Token close
Token ID1969
Feature activation-2.01e-02

Pos logit contributions
obiles+0.095
cycles+0.090
 stret+0.090
 MacDonald+0.088
 controller+0.087
Neg logit contributions
 servant-0.098
oll-0.098
ubb-0.089
ill-0.085
 queens-0.083
Token in
Token ID287
Feature activation-2.01e-02

Pos logit contributions
 stret+0.036
obiles+0.036
cycles+0.035
 tracks+0.035
 track+0.035
Neg logit contributions
 servant-0.033
ubb-0.033
oll-0.031
ill-0.031
 queens-0.030
Token her
Token ID607
Feature activation-3.85e-02

Pos logit contributions
cycles+0.031
 tracks+0.030
 stret+0.030
obiles+0.030
 automobile+0.030
Neg logit contributions
ubb-0.039
 servant-0.037
ill-0.037
 queens-0.036
Col-0.035
Token arms
Token ID5101
Feature activation-1.80e-02

Pos logit contributions
obiles+0.063
cycles+0.063
esh+0.061
 controller+0.060
 Amen+0.060
Neg logit contributions
 servant-0.070
oll-0.066
ubb-0.063
 queens-0.058
ill-0.057
Token and
Token ID290
Feature activation-3.96e-02

Pos logit contributions
 stret+0.026
 tracks+0.025
 automobile+0.025
 package+0.025
 radio+0.025
Neg logit contributions
ubb-0.036
 servant-0.035
ill-0.035
oll-0.034
 queens-0.034
TokenOne
Token ID3198
Feature activation-2.00e-02

Pos logit contributions
ubb+0.014
 servant+0.014
 queens+0.013
oll+0.013
ill+0.013
Neg logit contributions
 tracks-0.007
 automobile-0.007
 radio-0.007
 track-0.007
 stret-0.007
Token,
Token ID11
Feature activation-2.08e-02

Pos logit contributions
 stret+0.035
 radio+0.031
 receiver+0.031
 package+0.031
 automobile+0.031
Neg logit contributions
ubb-0.036
 queens-0.036
ill-0.036
 servant-0.035
oll-0.034
Token two
Token ID734
Feature activation+1.33e-03

Pos logit contributions
 stret+0.042
 tracks+0.041
 automobile+0.039
 radio+0.039
 receiver+0.039
Neg logit contributions
 queens-0.032
 servant-0.031
ubb-0.030
oll-0.028
ill-0.027
Token,
Token ID11
Feature activation-8.43e-03

Pos logit contributions
ubb+0.003
 servant+0.003
 queens+0.003
oll+0.003
ill+0.003
Neg logit contributions
 tracks-0.002
 automobile-0.002
 radio-0.002
 stret-0.002
 package-0.002
Token three
Token ID1115
Feature activation-2.92e-02

Pos logit contributions
 tracks+0.018
 stret+0.018
 radio+0.018
 automobile+0.018
 package+0.018
Neg logit contributions
ubb-0.011
 servant-0.011
 queens-0.011
oll-0.010
ill-0.010
Token!"
Token ID2474
Feature activation-8.26e-02

Pos logit contributions
 radio+0.045
 stret+0.044
 package+0.044
 receiver+0.044
 automobile+0.043
Neg logit contributions
 queens-0.058
 servant-0.056
ubb-0.056
oll-0.052
ill-0.052
Token they
Token ID484
Feature activation-2.31e-02

Pos logit contributions
 stret+0.151
obiles+0.141
 belongs+0.140
 track+0.140
 package+0.138
Neg logit contributions
 Colin-0.103
ill-0.100
 Mon-0.098
 servant-0.095
 queens-0.095
Token shouted
Token ID17293
Feature activation-2.95e-02

Pos logit contributions
 stret+0.030
 package+0.029
 tracks+0.029
 radio+0.029
 automobile+0.029
Neg logit contributions
 servant-0.050
ubb-0.049
 queens-0.048
ill-0.048
oll-0.047
Token,
Token ID11
Feature activation-1.61e-02

Pos logit contributions
 package+0.048
 tracks+0.048
obiles+0.047
 automobile+0.047
cycles+0.047
Neg logit contributions
 servant-0.053
ubb-0.052
ill-0.049
 queens-0.049
oll-0.048
Token as
Token ID355
Feature activation+2.93e-03

Pos logit contributions
 package+0.026
 tracks+0.025
 automobile+0.025
 radio+0.025
 stret+0.024
Neg logit contributions
 servant-0.030
ubb-0.030
 queens-0.029
ill-0.028
oll-0.027
Token they
Token ID484
Feature activation+4.34e-02

Pos logit contributions
ubb+0.007
 servant+0.007
 queens+0.006
oll+0.006
ill+0.006
Neg logit contributions
 tracks-0.004
 automobile-0.004
 radio-0.004
 track-0.003
 stret-0.003
Token animals
Token ID4695
Feature activation+2.29e-02

Pos logit contributions
ubb+0.027
 servant+0.026
oll+0.026
 queens+0.025
ill+0.025
Neg logit contributions
 tracks-0.018
 track-0.018
 radio-0.017
 automobile-0.017
 stret-0.017
Token.
Token ID13
Feature activation-4.32e-02

Pos logit contributions
ubb+0.056
 servant+0.054
 queens+0.053
oll+0.053
ill+0.051
Neg logit contributions
 tracks-0.025
 radio-0.025
 track-0.025
 automobile-0.024
 stret-0.023
Token Lily
Token ID20037
Feature activation-2.69e-03

Pos logit contributions
 package+0.080
 stret+0.079
 receiver+0.079
 radio+0.077
 belongs+0.076
Neg logit contributions
ill-0.062
 servant-0.062
umbo-0.059
 queens-0.058
oll-0.058
Token had
Token ID550
Feature activation-1.17e-03

Pos logit contributions
 tracks+0.003
 automobile+0.003
 radio+0.003
 stret+0.003
 package+0.003
Neg logit contributions
ubb-0.006
 servant-0.006
 queens-0.006
oll-0.006
ill-0.006
Token a
Token ID257
Feature activation-1.55e-02

Pos logit contributions
 tracks+0.002
 automobile+0.002
 radio+0.002
 stret+0.002
 package+0.002
Neg logit contributions
ubb-0.003
 servant-0.002
 queens-0.002
oll-0.002
ill-0.002
Token soft
Token ID2705
Feature activation-5.99e-02

Pos logit contributions
 tracks+0.022
cycles+0.022
 receiver+0.021
 stret+0.021
obiles+0.021
Neg logit contributions
 servant-0.032
ubb-0.030
 queens-0.030
oll-0.029
 diver-0.028
Token bunny
Token ID44915
Feature activation-5.46e-03

Pos logit contributions
obiles+0.100
cycles+0.096
agree+0.095
 receiver+0.094
 belongs+0.094
Neg logit contributions
 servant-0.108
 diver-0.093
ubb-0.089
 queens-0.089
 beetle-0.089
Token that
Token ID326
Feature activation+4.91e-03

Pos logit contributions
 tracks+0.007
 stret+0.007
 automobile+0.007
 radio+0.007
 package+0.007
Neg logit contributions
 servant-0.012
ubb-0.012
 queens-0.011
oll-0.011
ill-0.011
Token she
Token ID673
Feature activation+6.96e-03

Pos logit contributions
ubb+0.009
 servant+0.009
 queens+0.009
oll+0.009
ill+0.008
Neg logit contributions
 tracks-0.008
 radio-0.008
 automobile-0.008
 track-0.008
 package-0.008
Token liked
Token ID8288
Feature activation+1.52e-02

Pos logit contributions
ubb+0.016
 servant+0.016
 queens+0.016
oll+0.015
ill+0.015
Neg logit contributions
 tracks-0.008
 radio-0.008
 automobile-0.008
 track-0.008
 stret-0.008
Token to
Token ID284
Feature activation-8.28e-03

Pos logit contributions
ubb+0.030
 servant+0.028
 queens+0.028
oll+0.028
ill+0.027
Neg logit contributions
 tracks-0.025
 radio-0.024
 track-0.024
 automobile-0.023
 package-0.023
Token"
Token ID1
Feature activation+2.18e-02

Pos logit contributions
 stret+0.049
 package+0.047
 automobile+0.047
 tracks+0.046
 radio+0.045
Neg logit contributions
 queens-0.039
ill-0.039
acy-0.038
ubb-0.038
Col-0.037
TokenLet
Token ID5756
Feature activation-3.84e-02

Pos logit contributions
ubb+0.048
 servant+0.048
oll+0.045
 queens+0.044
ier+0.043
Neg logit contributions
 radio-0.029
 package-0.028
 automobile-0.028
 receiver-0.028
 track-0.027
Token's
Token ID338
Feature activation-1.23e-03

Pos logit contributions
 stret+0.048
 track+0.047
 controller+0.047
 tracks+0.046
cycles+0.046
Neg logit contributions
 servant-0.076
oll-0.076
ill-0.075
ax-0.075
olly-0.074
Token go
Token ID467
Feature activation-1.08e-02

Pos logit contributions
 tracks+0.002
 automobile+0.002
 radio+0.002
 stret+0.002
 package+0.002
Neg logit contributions
ubb-0.003
 servant-0.003
 queens-0.003
oll-0.002
ill-0.002
Token explore
Token ID7301
Feature activation-4.24e-02

Pos logit contributions
 radio+0.016
 tracks+0.016
 automobile+0.016
 stret+0.016
 package+0.016
Neg logit contributions
 servant-0.021
ubb-0.021
 queens-0.021
oll-0.020
ill-0.019
Token!"
Token ID2474
Feature activation-5.12e-02

Pos logit contributions
 receiver+0.058
 stret+0.057
 automobile+0.057
cycles+0.056
 controller+0.056
Neg logit contributions
 queens-0.086
ubb-0.079
 servant-0.079
ill-0.079
oll-0.077
Token said
Token ID531
Feature activation-1.28e-02

Pos logit contributions
 stret+0.088
 automobile+0.082
 package+0.081
obiles+0.081
cycles+0.080
Neg logit contributions
acy-0.088
ill-0.086
 servant-0.081
enny-0.080
 Mon-0.080
Token Jake
Token ID14757
Feature activation-2.49e-02

Pos logit contributions
 stret+0.020
 tracks+0.020
 automobile+0.020
 package+0.019
 radio+0.019
Neg logit contributions
 servant-0.024
ubb-0.024
 queens-0.023
ill-0.023
oll-0.023
Token's
Token ID338
Feature activation+3.62e-03

Pos logit contributions
 automobile+0.038
 radio+0.038
 package+0.038
 tracks+0.038
obiles+0.037
Neg logit contributions
 servant-0.048
ubb-0.047
oll-0.047
ill-0.046
 queens-0.044
Token mom
Token ID1995
Feature activation-1.78e-02

Pos logit contributions
ubb+0.007
 servant+0.007
 queens+0.007
oll+0.006
ill+0.006
Neg logit contributions
 tracks-0.006
 radio-0.006
 automobile-0.006
 package-0.006
 stret-0.006
Token.
Token ID13
Feature activation-3.60e-02

Pos logit contributions
 radio+0.026
 tracks+0.025
 automobile+0.025
 stret+0.025
 package+0.025
Neg logit contributions
 servant-0.036
ubb-0.035
oll-0.035
 queens-0.034
ill-0.034