TOP ACTIVATIONS
MAX = 3.810

It is the biggest and nic est button in the box
crown and playing in the freshly cut grass . It looked
Ben . It 's the best tower ever !" Lily said
the mess and the qu arre ling twins . She is
and run to the ost ric hes . "
my and Billy built the coolest tower ever . It was
wide . It is the best tower ever . Mom says
cried . " The ost ric hes are mean ! They
Wow , this is the best summer ever !" Anna says
Ben , this is the best toy ever !" Sara said
and he saw the t ini est pengu in standing there
ummy ! This is the best pear ever !" They shared
, but it was the coolest car he had ever seen
" The cave is the best shelter ." So , it
: ' I 'm the best crane around !' Then he
He thinks it is the best cube ever . He says
. " It is the best tower ever !"
to play and explore the freshly organized lawn . <|endoftext|> Once
The driver and the paramed ic jumped out . They helped
the church and noticed a freshly dug hole in the ground

MIN ACTIVATIONS
MIN = -3.411

. The oct opus had eight arms and could swim very
the roses . They have th orns . They can hurt
and had fun , but Fif i noticed something strange .
L ily liked to keep cater pill ars in a jar
to cook together in their big kitchen . One day ,
with her family in their big tent . One day ,
little fairy . She had tiny wings and a big smile
family of three in their big house . Mom was very
small , human boy with big , brown eyes . He
The oct opus had eight long arms and loved to swim
. The oct opus had eight arms and loved to swim
too dangerous to use . Fre ya was very disappointed and
strong rabbit . He had big , strong muscles . He
was always working in his big garden . One day ,
P ia . She had big , alert eyes and her
. The oct opus had eight long arms and loved to
twisted , and it had th orns all over it .
pa . Grand pa has many animals , but his favorite
pa . Grand pa has many animals , but their favorite
named Daisy . Daisy had big dreams of exploring the world

INTERVAL 2.005 - 3.810
CONTAINS 0.217%

Sam . She saw Lily 's tears and Sam 's cars
fence and goes to the next yard . Lily and Max
he saw a shiny planet up in the sky . Jack
and the cat were the best of friends . And every
" Wow , Lily ! That apple looks so creative .

INTERVAL 0.199 - 2.005
CONTAINS 36.725%

called out : " Where do you think you 're going
tape and she said " You can make a pretend movie
a hat . One day , he and
! Tom would watch and smile , at how adorable Lucky
fun at school ." L ily walked into

INTERVAL -1.606 - 0.199
CONTAINS 62.467%

, the lady drove to a store . At the store
. It had lots of colorful books that made the room
crab went to the park . The crab was very honest
't want to throw her picture away . She decided to
on , the cat would wave his paw at the hole

INTERVAL -3.411 - -1.606
CONTAINS 0.592%

to have a life of its own . He would watch
, but it looked a bit messy . He asked his
the rice in his small hole . Soon , the little
<|endoftext|> Tim and Mia are brother and sister . They live
the yard . He had sharp teeth and a loud grow
Token It
Token ID632
Feature activation-1.23e+00

Pos logit contributions
 scrub+0.009
aj+0.009
 swollen+0.009
 Olive+0.009
 rinse+0.009
Neg logit contributions
--0.004
 across-0.003
ork-0.003
 thing-0.003
ings-0.003
Token is
Token ID318
Feature activation-7.20e-01

Pos logit contributions
 Olive+0.039
 Honey+0.039
 scrub+0.038
 rinse+0.038
aj+0.038
Neg logit contributions
--0.029
by-0.027
 letter-0.027
 across-0.027
eg-0.026
Token the
Token ID262
Feature activation-5.40e-03

Pos logit contributions
 Olive+0.022
 scrub+0.021
 Honey+0.021
 rinse+0.021
aj+0.021
Neg logit contributions
--0.018
by-0.018
eg-0.017
ork-0.017
 letter-0.017
Token biggest
Token ID4094
Feature activation+1.55e+00

Pos logit contributions
 Olive+0.000
 Honey+0.000
 scrub+0.000
 rinse+0.000
 swollen+0.000
Neg logit contributions
--0.000
by-0.000
ables-0.000
ork-0.000
eg-0.000
Token and
Token ID290
Feature activation+1.21e-01

Pos logit contributions
ables+0.039
by+0.039
eg+0.038
rolled+0.037
-+0.036
Neg logit contributions
 scrub-0.046
 swollen-0.045
 pur-0.045
 sob-0.045
 rinse-0.044
Token nic
Token ID9200
Feature activation+3.81e+00

Pos logit contributions
-+0.003
by+0.003
 letter+0.003
eg+0.003
ork+0.003
Neg logit contributions
 Olive-0.003
 Honey-0.003
 scrub-0.003
 rinse-0.003
aj-0.003
Tokenest
Token ID395
Feature activation+1.77e+00

Pos logit contributions
ables+0.096
rolled+0.094
 across+0.094
 messenger+0.093
 Exhibit+0.093
Neg logit contributions
 brist-0.095
 pur-0.091
 rinse-0.088
 iv-0.087
 wipe-0.086
Token button
Token ID4936
Feature activation-1.80e-01

Pos logit contributions
ables+0.045
eg+0.044
by+0.041
rolled+0.040
spe+0.039
Neg logit contributions
 scrub-0.053
 pur-0.052
 swollen-0.051
 sob-0.051
 iv-0.050
Token in
Token ID287
Feature activation+4.28e-01

Pos logit contributions
 scrub+0.005
 rinse+0.005
 Olive+0.004
 pur+0.004
 Honey+0.004
Neg logit contributions
eg-0.005
ables-0.005
by-0.005
oc-0.005
rolled-0.005
Token the
Token ID262
Feature activation-5.59e-03

Pos logit contributions
ables+0.014
eg+0.014
-+0.013
by+0.013
ork+0.013
Neg logit contributions
 Olive-0.010
 Honey-0.010
 Sunny-0.010
 scrub-0.010
 rinse-0.009
Token box
Token ID3091
Feature activation+6.02e-01

Pos logit contributions
 Honey+0.000
 rinse+0.000
 Olive+0.000
 scrub+0.000
 pur+0.000
Neg logit contributions
--0.000
by-0.000
eg-0.000
ings-0.000
ables-0.000
Token crown
Token ID12389
Feature activation-6.32e-01

Pos logit contributions
 pur+0.041
 scrub+0.040
 rinse+0.039
 Honey+0.039
 Olive+0.039
Neg logit contributions
eg-0.048
--0.045
ay-0.045
by-0.045
ables-0.044
Token and
Token ID290
Feature activation+3.17e-01

Pos logit contributions
 pur+0.019
 rinse+0.018
 scrub+0.017
 Honey+0.017
 swe+0.017
Neg logit contributions
eg-0.017
 Exhibit-0.015
 wallet-0.015
 photo-0.015
spe-0.015
Token playing
Token ID2712
Feature activation+1.10e+00

Pos logit contributions
-+0.008
eg+0.008
by+0.008
 letter+0.008
ork+0.008
Neg logit contributions
 Olive-0.009
 scrub-0.009
 Honey-0.009
 rinse-0.009
 pur-0.009
Token in
Token ID287
Feature activation+3.14e-01

Pos logit contributions
-+0.021
 thing+0.019
 across+0.019
 very+0.018
 letter+0.018
Neg logit contributions
 Olive-0.040
aj-0.040
 Honey-0.040
 rinse-0.040
 scrub-0.039
Token the
Token ID262
Feature activation-3.25e-01

Pos logit contributions
-+0.007
 across+0.007
 thing+0.007
 letter+0.007
by+0.007
Neg logit contributions
aj-0.010
 Olive-0.010
 Honey-0.010
 scrub-0.010
 rinse-0.010
Token freshly
Token ID29026
Feature activation+3.79e+00

Pos logit contributions
 Olive+0.009
 Honey+0.009
aj+0.009
 scrub+0.009
 rinse+0.009
Neg logit contributions
--0.009
 across-0.009
 thing-0.009
 very-0.008
by-0.008
Token cut
Token ID2005
Feature activation+5.56e-01

Pos logit contributions
eg+0.094
 letter+0.091
by+0.091
ings+0.090
spe+0.090
Neg logit contributions
 scrub-0.110
 Olive-0.108
 Honey-0.108
 pur-0.106
 rinse-0.106
Token grass
Token ID8701
Feature activation-2.30e-02

Pos logit contributions
eg+0.015
by+0.014
 letter+0.014
 photo+0.014
-+0.014
Neg logit contributions
 Honey-0.016
 Olive-0.015
 scrub-0.015
 rinse-0.015
 pur-0.014
Token.
Token ID13
Feature activation+1.67e-01

Pos logit contributions
 Olive+0.001
 Honey+0.001
 scrub+0.001
 rinse+0.001
 Buzz+0.001
Neg logit contributions
--0.001
eg-0.001
 letter-0.001
by-0.001
ork-0.000
Token It
Token ID632
Feature activation-1.31e+00

Pos logit contributions
-+0.004
by+0.004
 letter+0.004
eg+0.004
 thing+0.004
Neg logit contributions
 Olive-0.005
 Honey-0.005
 scrub-0.005
 rinse-0.005
aj-0.005
Token looked
Token ID3114
Feature activation+9.37e-01

Pos logit contributions
 Olive+0.061
aj+0.061
 Honey+0.060
 rinse+0.059
irms+0.059
Neg logit contributions
--0.012
by-0.010
 very-0.010
 across-0.010
 thing-0.009
Token Ben
Token ID3932
Feature activation-4.24e-02

Pos logit contributions
aj+0.020
 scrub+0.020
 rinse+0.019
 Olive+0.019
Sleep+0.019
Neg logit contributions
--0.023
 thing-0.022
 letter-0.022
 very-0.021
ay-0.021
Token.
Token ID13
Feature activation+6.02e-01

Pos logit contributions
 Olive+0.002
 Honey+0.001
 rinse+0.001
 scrub+0.001
aj+0.001
Neg logit contributions
--0.001
by-0.001
 letter-0.001
 thing-0.001
 across-0.001
Token It
Token ID632
Feature activation-8.21e-01

Pos logit contributions
ay+0.012
-+0.012
 across+0.012
 thing+0.012
eg+0.012
Neg logit contributions
 scrub-0.021
 swollen-0.021
 Olive-0.020
aj-0.020
 rinse-0.020
Token's
Token ID338
Feature activation+2.60e-02

Pos logit contributions
 Olive+0.031
 Honey+0.031
 scrub+0.031
 rinse+0.031
aj+0.030
Neg logit contributions
--0.014
by-0.013
 letter-0.013
eg-0.013
ings-0.012
Token the
Token ID262
Feature activation+6.01e-01

Pos logit contributions
-+0.001
by+0.001
eg+0.001
 letter+0.001
ork+0.001
Neg logit contributions
 Olive-0.001
 Honey-0.001
 scrub-0.001
 rinse-0.001
aj-0.001
Token best
Token ID1266
Feature activation+3.76e+00

Pos logit contributions
-+0.016
ork+0.015
s+0.015
by+0.015
ables+0.015
Neg logit contributions
 scrub-0.017
 rinse-0.017
 Olive-0.017
 Honey-0.016
 pur-0.016
Token tower
Token ID10580
Feature activation-3.32e-01

Pos logit contributions
eg+0.105
ables+0.102
by+0.100
s+0.099
 letter+0.097
Neg logit contributions
 scrub-0.101
 Olive-0.091
 pur-0.091
 rinse-0.091
 Buzz-0.089
Token ever
Token ID1683
Feature activation+2.74e-01

Pos logit contributions
 Olive+0.010
 scrub+0.010
 Honey+0.009
 Buzz+0.009
 Sunny+0.009
Neg logit contributions
eg-0.008
by-0.008
ork-0.008
 letter-0.008
ings-0.008
Token!"
Token ID2474
Feature activation+2.19e-01

Pos logit contributions
-+0.007
eg+0.007
 letter+0.007
by+0.007
ork+0.007
Neg logit contributions
 Olive-0.008
 Honey-0.008
 scrub-0.008
 rinse-0.007
aj-0.007
Token Lily
Token ID20037
Feature activation-9.76e-02

Pos logit contributions
-+0.009
 thing+0.008
 very+0.008
by+0.008
 across+0.008
Neg logit contributions
 scrub-0.003
aj-0.003
 rinse-0.003
 Honey-0.003
 Olive-0.003
Token said
Token ID531
Feature activation+8.07e-01

Pos logit contributions
Sleep+0.003
 Honey+0.003
 rinse+0.003
 Olive+0.002
irms+0.002
Neg logit contributions
--0.003
s-0.003
 very-0.003
by-0.003
 letter-0.003
Token the
Token ID262
Feature activation+1.07e+00

Pos logit contributions
 Olive+0.003
 scrub+0.003
aj+0.003
 Honey+0.003
 rinse+0.003
Neg logit contributions
--0.003
 letter-0.003
 across-0.003
by-0.003
 thing-0.003
Token mess
Token ID2085
Feature activation+6.14e-02

Pos logit contributions
-+0.034
by+0.032
 letter+0.032
eg+0.032
 across+0.032
Neg logit contributions
 Olive-0.025
 scrub-0.025
 Honey-0.025
 rinse-0.025
aj-0.024
Token and
Token ID290
Feature activation+1.21e+00

Pos logit contributions
-+0.001
by+0.001
 letter+0.001
eg+0.001
 across+0.001
Neg logit contributions
 Olive-0.002
 Honey-0.002
 scrub-0.002
 rinse-0.002
aj-0.002
Token the
Token ID262
Feature activation+9.24e-01

Pos logit contributions
-+0.034
by+0.034
eg+0.034
ork+0.033
 letter+0.033
Neg logit contributions
 Olive-0.032
 scrub-0.032
 Honey-0.032
 rinse-0.031
 pur-0.030
Token qu
Token ID627
Feature activation+1.60e+00

Pos logit contributions
-+0.027
by+0.026
 letter+0.026
eg+0.026
 across+0.026
Neg logit contributions
 Olive-0.024
 Honey-0.023
 scrub-0.023
 rinse-0.023
aj-0.022
Tokenarre
Token ID9624
Feature activation+3.72e+00

Pos logit contributions
 letter+0.053
 wallet+0.053
 country+0.052
 across+0.050
 photo+0.050
Neg logit contributions
 rinse-0.037
 scrub-0.036
aj-0.035
irms-0.033
imi-0.033
Tokenling
Token ID1359
Feature activation+2.10e+00

Pos logit contributions
eg+0.126
 wallet+0.124
ork+0.120
rolled+0.116
by+0.116
Neg logit contributions
 scrub-0.089
 rinse-0.084
 sob-0.074
 swollen-0.074
 Honey-0.072
Token twins
Token ID20345
Feature activation+1.27e+00

Pos logit contributions
eg+0.063
ables+0.061
by+0.060
 Exhibit+0.059
rolled+0.058
Neg logit contributions
 scrub-0.055
 rinse-0.054
 Honey-0.052
 Olive-0.052
 iv-0.050
Token.
Token ID13
Feature activation+3.03e-02

Pos logit contributions
-+0.030
by+0.030
eg+0.029
 letter+0.028
ork+0.028
Neg logit contributions
 scrub-0.039
 rinse-0.039
 Olive-0.039
 Honey-0.038
 sob-0.037
Token She
Token ID1375
Feature activation+3.68e-01

Pos logit contributions
-+0.001
by+0.001
 letter+0.001
 across+0.001
eg+0.001
Neg logit contributions
 Olive-0.001
 Honey-0.001
 scrub-0.001
 rinse-0.001
aj-0.001
Token is
Token ID318
Feature activation+4.67e-02

Pos logit contributions
-+0.008
by+0.008
 letter+0.007
eg+0.007
 across+0.007
Neg logit contributions
 Olive-0.012
 scrub-0.012
 Honey-0.012
 rinse-0.012
aj-0.012
Token and
Token ID290
Feature activation+1.20e+00

Pos logit contributions
 Olive+0.007
 scrub+0.007
 Honey+0.007
 rinse+0.007
 pur+0.007
Neg logit contributions
--0.004
by-0.004
eg-0.004
 letter-0.004
ork-0.004
Token run
Token ID1057
Feature activation-8.62e-01

Pos logit contributions
-+0.031
 across+0.030
by+0.030
 letter+0.030
 thing+0.029
Neg logit contributions
 Olive-0.035
 Honey-0.034
aj-0.034
 scrub-0.034
 rinse-0.033
Token to
Token ID284
Feature activation+3.21e-01

Pos logit contributions
 Olive+0.028
 Fif+0.026
 scrub+0.026
 Buzz+0.026
 Sunny+0.025
Neg logit contributions
 letter-0.022
eg-0.021
 wallet-0.020
ay-0.020
 photo-0.020
Token the
Token ID262
Feature activation-4.24e-02

Pos logit contributions
-+0.009
ables+0.009
eg+0.009
ay+0.009
by+0.009
Neg logit contributions
 Olive-0.009
 scrub-0.009
 Honey-0.009
 rinse-0.008
 Sunny-0.008
Token ost
Token ID23619
Feature activation+1.19e+00

Pos logit contributions
 Olive+0.001
 Honey+0.001
 iv+0.001
 pur+0.001
 swollen+0.001
Neg logit contributions
--0.001
ings-0.001
by-0.001
ay-0.001
eg-0.001
Tokenric
Token ID1173
Feature activation+3.68e+00

Pos logit contributions
 thing+0.025
-+0.025
 letter+0.025
 photo+0.023
ork+0.023
Neg logit contributions
aj-0.039
 Olive-0.038
 Honey-0.037
 scrub-0.037
 til-0.037
Tokenhes
Token ID956
Feature activation+3.98e-01

Pos logit contributions
by+0.145
-+0.144
 letter+0.142
 photo+0.141
eg+0.141
Neg logit contributions
 Olive-0.052
 Honey-0.052
 scrub-0.051
 rinse-0.049
aj-0.048
Token.
Token ID13
Feature activation-1.47e-02

Pos logit contributions
by+0.011
eg+0.011
 wallet+0.011
 photo+0.011
 letter+0.011
Neg logit contributions
 scrub-0.011
 Olive-0.011
 rinse-0.010
 Honey-0.010
 Sunny-0.010
Token
Token ID198
Feature activation+1.44e-01

Pos logit contributions
 Olive+0.001
 Honey+0.000
 scrub+0.000
 rinse+0.000
aj+0.000
Neg logit contributions
--0.000
 across-0.000
by-0.000
 letter-0.000
eg-0.000
Token
Token ID198
Feature activation-5.35e-01

Pos logit contributions
-+0.003
by+0.003
 across+0.003
ork+0.003
 letter+0.003
Neg logit contributions
 Olive-0.005
 Honey-0.005
 scrub-0.005
 rinse-0.005
aj-0.005
Token"
Token ID1
Feature activation-3.20e-01

Pos logit contributions
 Olive+0.017
 Honey+0.017
 scrub+0.016
 rinse+0.016
aj+0.016
Neg logit contributions
--0.013
 across-0.012
by-0.012
 letter-0.012
 thing-0.012
Tokenmy
Token ID1820
Feature activation+1.00e+00

Pos logit contributions
 scrub+0.014
aj+0.013
 Olive+0.013
 Honey+0.013
 pur+0.013
Neg logit contributions
ork-0.021
ings-0.021
 thing-0.021
 across-0.020
 letter-0.020
Token and
Token ID290
Feature activation-7.21e-01

Pos logit contributions
ings+0.023
 letter+0.023
 photo+0.023
 thing+0.023
ork+0.022
Neg logit contributions
 scrub-0.033
 pur-0.030
 Honey-0.030
 Olive-0.030
 rinse-0.029
Token Billy
Token ID15890
Feature activation+1.51e+00

Pos logit contributions
 Olive+0.019
 Honey+0.018
 scrub+0.018
 Midnight+0.018
 Buzz+0.017
Neg logit contributions
--0.022
ork-0.021
ings-0.021
ables-0.021
eg-0.021
Token built
Token ID3170
Feature activation+9.13e-01

Pos logit contributions
-+0.042
ings+0.042
ork+0.041
 letter+0.041
 thing+0.041
Neg logit contributions
 scrub-0.042
 rinse-0.040
 Olive-0.040
 Honey-0.040
 pur-0.039
Token the
Token ID262
Feature activation+1.45e+00

Pos logit contributions
eg+0.031
ables+0.030
ay+0.030
 photo+0.029
 letter+0.029
Neg logit contributions
 scrub-0.021
 pur-0.020
 Buzz-0.020
 til-0.020
 Sunny-0.019
Token coolest
Token ID38889
Feature activation+3.66e+00

Pos logit contributions
-+0.041
by+0.039
ings+0.039
eg+0.039
 letter+0.038
Neg logit contributions
 scrub-0.039
 rinse-0.037
 pur-0.037
 Olive-0.036
 swe-0.036
Token tower
Token ID10580
Feature activation+2.24e-01

Pos logit contributions
eg+0.129
ables+0.118
by+0.116
ifying+0.115
 document+0.114
Neg logit contributions
 scrub-0.078
 swe-0.067
 pur-0.066
 honey-0.065
 Buzz-0.064
Token ever
Token ID1683
Feature activation+3.44e-01

Pos logit contributions
eg+0.007
by+0.006
 letter+0.006
 photo+0.006
ables+0.006
Neg logit contributions
 scrub-0.005
 Buzz-0.005
 Olive-0.005
 pur-0.005
 Sunny-0.005
Token.
Token ID13
Feature activation+2.94e-01

Pos logit contributions
eg+0.010
 letter+0.010
rolled+0.009
ork+0.009
 wallet+0.009
Neg logit contributions
 Olive-0.008
 Honey-0.008
 Buzz-0.008
 scrub-0.008
 iv-0.008
Token It
Token ID632
Feature activation-8.00e-01

Pos logit contributions
-+0.009
by+0.009
 letter+0.009
ork+0.009
ings+0.009
Neg logit contributions
 Olive-0.008
 Honey-0.007
 scrub-0.007
 Buzz-0.007
 Midnight-0.007
Token was
Token ID373
Feature activation-9.08e-02

Pos logit contributions
 scrub+0.031
 pur+0.030
 Olive+0.029
 Honey+0.029
 rinse+0.029
Neg logit contributions
eg-0.013
ings-0.013
 letter-0.013
by-0.013
ork-0.013
Token wide
Token ID3094
Feature activation+1.06e-01

Pos logit contributions
 Olive+0.042
 Honey+0.041
 scrub+0.040
aj+0.040
 rinse+0.040
Neg logit contributions
--0.034
 letter-0.033
 across-0.033
 very-0.032
by-0.032
Token.
Token ID13
Feature activation+6.19e-03

Pos logit contributions
-+0.002
by+0.002
 letter+0.002
eg+0.002
ork+0.002
Neg logit contributions
 Olive-0.004
 Honey-0.004
 scrub-0.004
 rinse-0.003
aj-0.003
Token It
Token ID632
Feature activation-1.02e+00

Pos logit contributions
-+0.000
 across+0.000
 thing+0.000
ork+0.000
eg+0.000
Neg logit contributions
 scrub-0.000
 Olive-0.000
 swollen-0.000
 Honey-0.000
 rinse-0.000
Token is
Token ID318
Feature activation-8.21e-01

Pos logit contributions
 Olive+0.039
 Honey+0.039
 scrub+0.039
 rinse+0.039
aj+0.038
Neg logit contributions
--0.016
by-0.015
 letter-0.015
eg-0.015
 across-0.015
Token the
Token ID262
Feature activation+8.20e-02

Pos logit contributions
 Olive+0.025
 scrub+0.025
 Honey+0.025
 rinse+0.024
aj+0.024
Neg logit contributions
--0.020
by-0.020
eg-0.019
ork-0.019
 letter-0.019
Token best
Token ID1266
Feature activation+3.61e+00

Pos logit contributions
-+0.002
ork+0.002
by+0.002
eg+0.002
ings+0.002
Neg logit contributions
 scrub-0.002
 rinse-0.002
 Olive-0.002
 Honey-0.002
 pur-0.002
Token tower
Token ID10580
Feature activation-2.36e-01

Pos logit contributions
eg+0.103
by+0.097
ables+0.097
rolled+0.093
ings+0.092
Neg logit contributions
 scrub-0.096
 pur-0.088
 rinse-0.088
 Buzz-0.085
 Olive-0.083
Token ever
Token ID1683
Feature activation+5.45e-01

Pos logit contributions
 scrub+0.006
 Olive+0.006
 Sunny+0.006
 Buzz+0.006
 pur+0.006
Neg logit contributions
eg-0.007
rolled-0.006
ables-0.006
by-0.006
ings-0.006
Token.
Token ID13
Feature activation+3.71e-02

Pos logit contributions
eg+0.014
 letter+0.013
ork+0.013
-+0.013
ings+0.013
Neg logit contributions
 Olive-0.016
 scrub-0.016
 Honey-0.016
aj-0.015
 rinse-0.015
Token Mom
Token ID11254
Feature activation-5.92e-01

Pos logit contributions
-+0.001
 across+0.001
 thing+0.001
eg+0.001
ay+0.001
Neg logit contributions
 scrub-0.001
 Olive-0.001
 swollen-0.001
 rinse-0.001
 Honey-0.001
Token says
Token ID1139
Feature activation+5.59e-01

Pos logit contributions
 Olive+0.019
 Honey+0.019
 rinse+0.018
 swollen+0.018
aj+0.018
Neg logit contributions
--0.014
by-0.013
s-0.013
 letter-0.013
 very-0.013
Token cried
Token ID16896
Feature activation+8.85e-01

Pos logit contributions
-+0.017
 very+0.016
s+0.016
 across+0.016
by+0.016
Neg logit contributions
aj-0.013
 Honey-0.013
 Olive-0.013
Sleep-0.013
 scrub-0.013
Token.
Token ID13
Feature activation+4.50e-01

Pos logit contributions
-+0.018
 across+0.016
by+0.016
 very+0.016
 letter+0.016
Neg logit contributions
 Olive-0.032
 Honey-0.031
aj-0.031
 rinse-0.031
 scrub-0.031
Token "
Token ID366
Feature activation+8.57e-01

Pos logit contributions
by+0.012
-+0.012
 letter+0.012
eg+0.011
 photo+0.011
Neg logit contributions
 Olive-0.014
 sob-0.013
 Honey-0.013
 scrub-0.013
 Sunny-0.013
TokenThe
Token ID464
Feature activation-8.21e-02

Pos logit contributions
-+0.027
by+0.026
 letter+0.026
eg+0.025
ings+0.025
Neg logit contributions
 rinse-0.021
 Olive-0.021
 Honey-0.021
 scrub-0.020
 swollen-0.020
Token ost
Token ID23619
Feature activation+1.60e+00

Pos logit contributions
 Olive+0.002
 scrub+0.002
 swollen+0.002
 rinse+0.002
 Honey+0.002
Neg logit contributions
--0.002
by-0.002
ables-0.002
eg-0.002
s-0.002
Tokenric
Token ID1173
Feature activation+3.58e+00

Pos logit contributions
-+0.031
 letter+0.031
 thing+0.030
 across+0.030
by+0.030
Neg logit contributions
 Olive-0.055
aj-0.054
 scrub-0.054
 rinse-0.053
 Honey-0.053
Tokenhes
Token ID956
Feature activation+8.37e-01

Pos logit contributions
by+0.148
-+0.145
 letter+0.144
 photo+0.144
eg+0.143
Neg logit contributions
 scrub-0.045
 Olive-0.045
 rinse-0.044
 Honey-0.044
aj-0.041
Token are
Token ID389
Feature activation-3.83e-01

Pos logit contributions
by+0.022
 wallet+0.022
eg+0.022
 fortune+0.021
 letter+0.021
Neg logit contributions
 rinse-0.023
 Olive-0.023
 scrub-0.022
 pur-0.022
ens-0.022
Token mean
Token ID1612
Feature activation-4.48e-01

Pos logit contributions
 Olive+0.011
 Honey+0.011
 scrub+0.011
 rinse+0.011
aj+0.010
Neg logit contributions
--0.010
by-0.010
 letter-0.010
 across-0.009
eg-0.009
Token!
Token ID0
Feature activation+7.04e-01

Pos logit contributions
 Olive+0.011
 Honey+0.011
 scrub+0.011
 iv+0.011
 rinse+0.011
Neg logit contributions
eg-0.013
ork-0.013
rolled-0.013
spe-0.013
by-0.012
Token They
Token ID1119
Feature activation-1.33e-01

Pos logit contributions
by+0.015
-+0.015
 letter+0.014
 across+0.014
 photo+0.013
Neg logit contributions
 Honey-0.024
 Olive-0.024
 rinse-0.023
 scrub-0.023
aj-0.023
TokenWow
Token ID22017
Feature activation+2.51e-01

Pos logit contributions
aj+0.013
irms+0.011
 scrub+0.011
 Olive+0.011
 rinse+0.011
Neg logit contributions
 letter-0.009
 thing-0.009
 photo-0.009
 very-0.009
--0.009
Token,
Token ID11
Feature activation-3.30e-01

Pos logit contributions
-+0.005
 letter+0.005
by+0.005
eg+0.005
 across+0.005
Neg logit contributions
 Olive-0.008
 Honey-0.008
 scrub-0.008
 rinse-0.008
aj-0.008
Token this
Token ID428
Feature activation+1.10e+00

Pos logit contributions
 Olive+0.009
 Honey+0.008
 rinse+0.008
 Sunny+0.008
 Buzz+0.008
Neg logit contributions
--0.010
ings-0.010
ables-0.010
by-0.010
 across-0.010
Token is
Token ID318
Feature activation-1.50e-01

Pos logit contributions
by+0.026
spe+0.025
eg+0.025
rolled+0.025
ings+0.024
Neg logit contributions
 Olive-0.031
 pur-0.031
 swe-0.031
 iv-0.031
 Honey-0.030
Token the
Token ID262
Feature activation+1.01e+00

Pos logit contributions
 Olive+0.004
 Honey+0.004
 Sunny+0.004
 Buzz+0.004
 Ruby+0.004
Neg logit contributions
by-0.004
--0.004
ables-0.004
ings-0.004
ork-0.004
Token best
Token ID1266
Feature activation+3.57e+00

Pos logit contributions
-+0.025
by+0.024
ables+0.023
ork+0.023
ings+0.023
Neg logit contributions
 Olive-0.031
 Honey-0.030
 rinse-0.030
 scrub-0.029
 pur-0.029
Token summer
Token ID3931
Feature activation+1.45e+00

Pos logit contributions
eg+0.082
ables+0.082
by+0.081
spe+0.075
 across+0.075
Neg logit contributions
 Olive-0.107
 pur-0.103
 scrub-0.103
 Ruby-0.101
 Buzz-0.100
Token ever
Token ID1683
Feature activation+5.17e-01

Pos logit contributions
eg+0.036
by+0.034
rolled+0.033
ables+0.033
ork+0.032
Neg logit contributions
 Olive-0.043
 Honey-0.042
 Sunny-0.042
 iv-0.041
 scrub-0.041
Token!"
Token ID2474
Feature activation-8.50e-01

Pos logit contributions
eg+0.016
 letter+0.015
ables+0.015
ork+0.015
rolled+0.015
Neg logit contributions
 Olive-0.012
 Honey-0.012
 scrub-0.011
aj-0.011
 Buzz-0.011
Token Anna
Token ID11735
Feature activation-6.80e-01

Pos logit contributions
 scrub+0.018
 rinse+0.018
aj+0.018
 Honey+0.018
 Olive+0.018
Neg logit contributions
--0.028
 letter-0.027
by-0.027
 across-0.027
 thing-0.027
Token says
Token ID1139
Feature activation+2.42e-01

Pos logit contributions
 swollen+0.020
 Olive+0.020
 rinse+0.020
 Honey+0.020
 scrub+0.019
Neg logit contributions
--0.018
s-0.017
 very-0.017
 across-0.017
 letter-0.017
Token Ben
Token ID3932
Feature activation-4.99e-01

Pos logit contributions
 scrub+0.015
 Olive+0.015
 Honey+0.015
aj+0.015
 rinse+0.015
Neg logit contributions
--0.026
 letter-0.025
 thing-0.025
eg-0.025
by-0.024
Token,
Token ID11
Feature activation-2.03e-01

Pos logit contributions
 Olive+0.016
 Honey+0.016
 scrub+0.015
aj+0.015
 rinse+0.015
Neg logit contributions
--0.011
by-0.011
eg-0.011
 across-0.011
 letter-0.011
Token this
Token ID428
Feature activation+1.09e+00

Pos logit contributions
 Olive+0.006
 Honey+0.005
 scrub+0.005
 rinse+0.005
aj+0.005
Neg logit contributions
--0.006
 letter-0.005
by-0.005
 across-0.005
 thing-0.005
Token is
Token ID318
Feature activation-1.36e-01

Pos logit contributions
by+0.031
eg+0.031
ables+0.029
 fortune+0.028
ole+0.027
Neg logit contributions
 scrub-0.027
 wipes-0.026
 iv-0.026
 Lu-0.026
 rinse-0.026
Token the
Token ID262
Feature activation+8.33e-01

Pos logit contributions
 scrub+0.004
 Honey+0.003
 Olive+0.003
 rinse+0.003
 Sunny+0.003
Neg logit contributions
by-0.004
ables-0.004
--0.004
ings-0.004
ork-0.004
Token best
Token ID1266
Feature activation+3.52e+00

Pos logit contributions
ables+0.026
by+0.025
s+0.024
eg+0.024
oc+0.023
Neg logit contributions
 scrub-0.021
 rinse-0.021
 ti-0.019
 swe-0.019
 iv-0.019
Token toy
Token ID13373
Feature activation+6.58e-01

Pos logit contributions
ables+0.097
eg+0.096
by+0.090
ifying+0.083
s+0.080
Neg logit contributions
 scrub-0.101
 rinse-0.091
 Lu-0.091
 wipe-0.089
 pur-0.086
Token ever
Token ID1683
Feature activation+7.49e-01

Pos logit contributions
by+0.021
ables+0.021
eg+0.021
ings+0.020
ork+0.020
Neg logit contributions
 scrub-0.013
 iv-0.013
 pur-0.013
 Sunny-0.013
 Ruby-0.013
Token!"
Token ID2474
Feature activation+5.83e-03

Pos logit contributions
eg+0.024
ork+0.022
rolled+0.022
ables+0.022
ings+0.022
Neg logit contributions
 scrub-0.017
 Olive-0.016
aj-0.016
 Honey-0.016
 pur-0.016
Token Sara
Token ID24799
Feature activation-2.95e-01

Pos logit contributions
-+0.000
by+0.000
 letter+0.000
 across+0.000
 thing+0.000
Neg logit contributions
 Olive-0.000
 Honey-0.000
 scrub-0.000
 rinse-0.000
aj-0.000
Token said
Token ID531
Feature activation+8.37e-01

Pos logit contributions
 Olive+0.007
 Honey+0.007
 rinse+0.007
 scrub+0.006
aj+0.006
Neg logit contributions
--0.010
by-0.009
 letter-0.009
s-0.009
 across-0.009
Token and
Token ID290
Feature activation+3.68e-01

Pos logit contributions
 Olive+0.004
 Honey+0.004
 rinse+0.004
 scrub+0.004
aj+0.004
Neg logit contributions
--0.003
 across-0.003
 letter-0.003
by-0.003
 very-0.003
Token he
Token ID339
Feature activation+9.16e-01

Pos logit contributions
-+0.010
 across+0.009
by+0.009
 letter+0.009
 very+0.009
Neg logit contributions
 Olive-0.010
 rinse-0.010
aj-0.010
 Honey-0.010
 scrub-0.010
Token saw
Token ID2497
Feature activation+7.24e-02

Pos logit contributions
ings+0.023
-+0.023
 letter+0.023
ay+0.023
 thing+0.022
Neg logit contributions
 scrub-0.027
 Honey-0.026
 Olive-0.026
 pur-0.026
 sob-0.026
Token the
Token ID262
Feature activation+7.80e-01

Pos logit contributions
-+0.002
by+0.002
eg+0.002
ables+0.002
ork+0.002
Neg logit contributions
 Olive-0.002
 Honey-0.002
 Buzz-0.002
 scrub-0.002
 Sunny-0.002
Token t
Token ID256
Feature activation+8.37e-01

Pos logit contributions
-+0.020
by+0.019
 letter+0.019
ork+0.019
eg+0.019
Neg logit contributions
 scrub-0.023
 Honey-0.023
 Olive-0.023
 rinse-0.022
 swollen-0.022
Tokenini
Token ID5362
Feature activation+3.49e+00

Pos logit contributions
 letter+0.023
 language+0.022
 across+0.022
 photo+0.022
 very+0.022
Neg logit contributions
unn-0.022
imi-0.021
aj-0.021
 scrub-0.020
 swollen-0.020
Tokenest
Token ID395
Feature activation+1.66e+00

Pos logit contributions
ork+0.087
ables+0.083
eg+0.083
ings+0.082
spe+0.077
Neg logit contributions
 scrub-0.106
 Buzz-0.104
 Honey-0.104
 pur-0.103
 Olive-0.100
Token pengu
Token ID38373
Feature activation+9.64e-01

Pos logit contributions
ables+0.047
eg+0.046
ork+0.044
 letter+0.043
ay+0.043
Neg logit contributions
 scrub-0.045
 Honey-0.044
 Olive-0.043
 pur-0.043
 iv-0.042
Tokenin
Token ID259
Feature activation+9.24e-01

Pos logit contributions
 letter+0.026
 across+0.026
eg+0.026
by+0.025
 photo+0.025
Neg logit contributions
 scrub-0.026
 Honey-0.025
 Olive-0.025
 rinse-0.025
aj-0.024
Token standing
Token ID5055
Feature activation+2.36e+00

Pos logit contributions
eg+0.026
ay+0.026
ork+0.025
ables+0.024
ings+0.024
Neg logit contributions
 scrub-0.024
 Honey-0.024
 Buzz-0.024
 Olive-0.024
 pur-0.023
Token there
Token ID612
Feature activation+7.52e-02

Pos logit contributions
by+0.064
-+0.064
eg+0.064
 letter+0.063
ork+0.062
Neg logit contributions
 scrub-0.063
 Honey-0.063
 Olive-0.062
 rinse-0.061
aj-0.061
Tokenummy
Token ID13513
Feature activation+4.46e-01

Pos logit contributions
 letter+0.018
-+0.018
 across+0.018
spe+0.018
 photo+0.018
Neg logit contributions
 Olive-0.034
 Honey-0.034
 rinse-0.033
aj-0.033
 pur-0.032
Token!
Token ID0
Feature activation+1.13e+00

Pos logit contributions
eg+0.011
spe+0.011
ork+0.011
 letter+0.011
ables+0.011
Neg logit contributions
 Olive-0.013
 Honey-0.013
 Buzz-0.012
 pur-0.012
 Sunny-0.012
Token This
Token ID770
Feature activation+2.11e+00

Pos logit contributions
eg+0.025
-+0.025
ay+0.025
 thing+0.024
 letter+0.024
Neg logit contributions
 scrub-0.036
 swollen-0.035
aj-0.035
 rinse-0.035
irms-0.034
Token is
Token ID318
Feature activation-4.48e-01

Pos logit contributions
-+0.061
by+0.058
 letter+0.058
 thing+0.057
 across+0.057
Neg logit contributions
 Olive-0.055
 Honey-0.055
 scrub-0.055
 rinse-0.055
aj-0.054
Token the
Token ID262
Feature activation+8.92e-01

Pos logit contributions
 Olive+0.013
 Honey+0.012
 scrub+0.012
 rinse+0.012
 Buzz+0.012
Neg logit contributions
--0.013
by-0.012
eg-0.012
ork-0.012
ables-0.012
Token best
Token ID1266
Feature activation+3.48e+00

Pos logit contributions
-+0.020
by+0.019
 letter+0.019
 thing+0.019
 across+0.019
Neg logit contributions
 Olive-0.029
 scrub-0.029
 Honey-0.029
 rinse-0.028
aj-0.028
Token pear
Token ID25286
Feature activation-3.07e-01

Pos logit contributions
-+0.110
by+0.107
 letter+0.106
eg+0.105
 across+0.104
Neg logit contributions
 Olive-0.081
 Honey-0.080
 scrub-0.079
 rinse-0.078
aj-0.076
Token ever
Token ID1683
Feature activation+4.36e-01

Pos logit contributions
 Olive+0.009
 Honey+0.008
 Buzz+0.008
 Sunny+0.008
 iv+0.008
Neg logit contributions
eg-0.009
rolled-0.008
 letter-0.008
by-0.008
 document-0.008
Token!"
Token ID2474
Feature activation+1.18e-01

Pos logit contributions
eg+0.013
 letter+0.012
ork+0.012
rolled+0.012
ings+0.012
Neg logit contributions
 Olive-0.011
 Honey-0.011
 scrub-0.010
aj-0.010
 Buzz-0.010
Token They
Token ID1119
Feature activation-2.27e-01

Pos logit contributions
by+0.003
 letter+0.003
-+0.003
 photo+0.003
spe+0.003
Neg logit contributions
 Olive-0.004
 scrub-0.004
 sob-0.004
 Honey-0.004
 pur-0.003
Token shared
Token ID4888
Feature activation+1.10e+00

Pos logit contributions
 Olive+0.007
 Honey+0.007
aj+0.007
 scrub+0.007
 rinse+0.007
Neg logit contributions
--0.005
by-0.005
 letter-0.005
 across-0.005
eg-0.005
Token,
Token ID11
Feature activation-1.41e-01

Pos logit contributions
 Olive+0.028
 scrub+0.028
 Honey+0.028
 rinse+0.027
aj+0.027
Neg logit contributions
--0.019
by-0.019
 letter-0.018
eg-0.018
 across-0.018
Token but
Token ID475
Feature activation+1.25e+00

Pos logit contributions
 scrub+0.003
 Olive+0.003
 Honey+0.003
 rinse+0.003
 pur+0.003
Neg logit contributions
by-0.005
eg-0.004
--0.004
ables-0.004
ork-0.004
Token it
Token ID340
Feature activation+1.09e-01

Pos logit contributions
by+0.030
ables+0.030
-+0.029
eg+0.029
ings+0.029
Neg logit contributions
 Olive-0.039
 scrub-0.038
 Honey-0.038
 swollen-0.037
 Buzz-0.037
Token was
Token ID373
Feature activation+3.12e-01

Pos logit contributions
by+0.002
ings+0.002
-+0.002
ables+0.002
 letter+0.002
Neg logit contributions
 scrub-0.004
 Olive-0.004
 Honey-0.004
 pur-0.004
 rinse-0.004
Token the
Token ID262
Feature activation+2.05e-01

Pos logit contributions
-+0.006
by+0.005
 letter+0.005
 across+0.005
eg+0.005
Neg logit contributions
 Olive-0.012
 Honey-0.012
 scrub-0.012
 rinse-0.012
aj-0.011
Token coolest
Token ID38889
Feature activation+3.46e+00

Pos logit contributions
-+0.006
by+0.006
ork+0.006
ables+0.006
ings+0.006
Neg logit contributions
 scrub-0.005
 rinse-0.005
 Olive-0.005
 Honey-0.005
 swollen-0.005
Token car
Token ID1097
Feature activation+8.63e-01

Pos logit contributions
ables+0.082
by+0.069
eg+0.067
ifying+0.066
yn+0.063
Neg logit contributions
 scrub-0.117
 ti-0.104
 Fay-0.102
 swe-0.101
 rinse-0.101
Token he
Token ID339
Feature activation+1.45e+00

Pos logit contributions
by+0.027
ings+0.027
eg+0.026
ables+0.026
 lot+0.026
Neg logit contributions
 Olive-0.019
 scrub-0.018
 Buzz-0.018
 Sunny-0.018
 Ruby-0.018
Token had
Token ID550
Feature activation+1.58e+00

Pos logit contributions
ings+0.049
ay+0.044
eg+0.042
 letter+0.042
 photo+0.042
Neg logit contributions
 scrub-0.034
 Olive-0.033
 seek-0.032
 Honey-0.032
 Buzz-0.032
Token ever
Token ID1683
Feature activation-3.97e-01

Pos logit contributions
eg+0.045
 letter+0.043
by+0.042
ay+0.041
ables+0.040
Neg logit contributions
 scrub-0.040
 Olive-0.039
 seek-0.038
 Buzz-0.037
Sleep-0.037
Token seen
Token ID1775
Feature activation-4.98e-01

Pos logit contributions
 scrub+0.010
 pur+0.009
aj+0.009
Sleep+0.009
 sob+0.009
Neg logit contributions
ay-0.012
eg-0.012
ings-0.012
ork-0.012
 letter-0.012
Token "
Token ID366
Feature activation+6.99e-01

Pos logit contributions
aj+0.000
 rinse+0.000
 Olive+0.000
irms+0.000
 swollen+0.000
Neg logit contributions
--0.000
 very-0.000
 across-0.000
 letter-0.000
s-0.000
TokenThe
Token ID464
Feature activation+1.12e+00

Pos logit contributions
-+0.022
eg+0.022
by+0.022
ings+0.022
 letter+0.022
Neg logit contributions
 Honey-0.016
 Olive-0.016
 scrub-0.015
Sleep-0.015
 rinse-0.015
Token cave
Token ID11527
Feature activation-1.24e-01

Pos logit contributions
-+0.025
 thing+0.024
 letter+0.024
 very+0.023
 across+0.023
Neg logit contributions
 Olive-0.038
 Honey-0.038
 scrub-0.037
 rinse-0.037
aj-0.037
Token is
Token ID318
Feature activation+2.99e-01

Pos logit contributions
 Honey+0.004
 iv+0.004
 scrub+0.004
 pur+0.004
 Olive+0.004
Neg logit contributions
eg-0.003
by-0.003
 photo-0.003
 across-0.003
rolled-0.003
Token the
Token ID262
Feature activation+4.67e-01

Pos logit contributions
-+0.007
by+0.007
eg+0.007
 letter+0.006
ings+0.006
Neg logit contributions
 Olive-0.010
 Honey-0.009
 scrub-0.009
 rinse-0.009
aj-0.009
Token best
Token ID1266
Feature activation+3.45e+00

Pos logit contributions
-+0.010
 letter+0.010
 thing+0.010
 across+0.010
 very+0.009
Neg logit contributions
 Olive-0.016
 Honey-0.016
 scrub-0.016
 rinse-0.015
aj-0.015
Token shelter
Token ID11772
Feature activation+3.96e-01

Pos logit contributions
-+0.072
by+0.070
eg+0.069
 letter+0.068
 across+0.066
Neg logit contributions
 Olive-0.117
 Honey-0.116
 scrub-0.115
 rinse-0.113
aj-0.111
Token."
Token ID526
Feature activation-3.88e-02

Pos logit contributions
eg+0.012
by+0.011
rolled+0.011
ork+0.010
ings+0.010
Neg logit contributions
 Honey-0.010
 Olive-0.010
 Sunny-0.010
 Buzz-0.010
 iv-0.010
Token So
Token ID1406
Feature activation-2.60e-01

Pos logit contributions
 swollen+0.001
 rinse+0.001
aj+0.001
 Olive+0.001
 scrub+0.001
Neg logit contributions
--0.001
 across-0.001
 very-0.001
 thing-0.001
s-0.001
Token,
Token ID11
Feature activation-2.59e-01

Pos logit contributions
 Olive+0.008
 Honey+0.008
 scrub+0.008
 rinse+0.008
 Buzz+0.008
Neg logit contributions
--0.006
eg-0.006
by-0.006
 letter-0.006
ork-0.006
Token it
Token ID340
Feature activation-7.11e-01

Pos logit contributions
 Honey+0.008
 Olive+0.008
 scrub+0.008
 rinse+0.008
 Buzz+0.007
Neg logit contributions
--0.006
by-0.006
eg-0.006
ork-0.006
 letter-0.006
Token:
Token ID25
Feature activation+3.08e-01

Pos logit contributions
eg+0.018
ay+0.017
ables+0.017
by+0.017
 photo+0.017
Neg logit contributions
 Olive-0.017
 Honey-0.016
 scrub-0.016
 til-0.016
 Buzz-0.016
Token '
Token ID705
Feature activation+5.90e-01

Pos logit contributions
-+0.008
by+0.008
 letter+0.007
eg+0.007
ay+0.007
Neg logit contributions
 Olive-0.009
 Honey-0.009
 Buzz-0.009
 scrub-0.009
 pur-0.009
TokenI
Token ID40
Feature activation+6.25e-01

Pos logit contributions
-+0.018
 letter+0.018
eg+0.018
by+0.018
ork+0.018
Neg logit contributions
 scrub-0.014
 Honey-0.014
 Olive-0.014
 rinse-0.013
Sleep-0.013
Token'm
Token ID1101
Feature activation-4.13e-02

Pos logit contributions
 letter+0.015
-+0.015
eg+0.014
 across+0.014
ings+0.014
Neg logit contributions
 scrub-0.019
 rinse-0.019
 Honey-0.018
 Olive-0.018
aj-0.018
Token the
Token ID262
Feature activation+6.98e-01

Pos logit contributions
 Olive+0.001
aj+0.001
 Honey+0.001
Sleep+0.001
 rinse+0.001
Neg logit contributions
--0.001
 very-0.001
 across-0.001
 letter-0.001
 thing-0.001
Token best
Token ID1266
Feature activation+3.44e+00

Pos logit contributions
-+0.019
ings+0.018
ork+0.018
by+0.018
eg+0.017
Neg logit contributions
 Olive-0.020
 scrub-0.020
 Honey-0.019
 rinse-0.019
 swollen-0.019
Token crane
Token ID41175
Feature activation+3.74e-01

Pos logit contributions
by+0.078
eg+0.078
ings+0.074
ables+0.074
-+0.074
Neg logit contributions
 scrub-0.108
 Olive-0.105
 rinse-0.103
 Honey-0.103
 pur-0.102
Token around
Token ID1088
Feature activation+4.91e-01

Pos logit contributions
eg+0.010
ork+0.010
rolled+0.010
ings+0.010
 photo+0.009
Neg logit contributions
 Buzz-0.010
 Olive-0.010
 Honey-0.010
 Sunny-0.010
 Ruby-0.009
Token!'
Token ID13679
Feature activation+7.24e-01

Pos logit contributions
eg+0.014
by+0.013
ork+0.013
-+0.012
 wallet+0.012
Neg logit contributions
 Honey-0.014
 Olive-0.013
 Buzz-0.013
 Sunny-0.013
 scrub-0.013
Token Then
Token ID3244
Feature activation+1.25e-01

Pos logit contributions
 letter+0.019
eg+0.019
 photo+0.019
-+0.019
by+0.019
Neg logit contributions
 Olive-0.020
 Honey-0.020
 sob-0.019
 scrub-0.019
 Buzz-0.019
Token he
Token ID339
Feature activation+2.03e-01

Pos logit contributions
ork+0.004
eg+0.004
ay+0.004
-+0.004
 letter+0.004
Neg logit contributions
 Honey-0.003
 Olive-0.003
 Buzz-0.003
 scrub-0.003
 Holly-0.003
Token He
Token ID679
Feature activation+8.76e-01

Pos logit contributions
 letter+0.012
-+0.012
by+0.012
 photo+0.011
eg+0.011
Neg logit contributions
 Olive-0.014
 Honey-0.013
 scrub-0.013
 rinse-0.013
 Buzz-0.013
Token thinks
Token ID6834
Feature activation-4.10e-01

Pos logit contributions
-+0.025
by+0.024
s+0.023
 letter+0.023
 very+0.023
Neg logit contributions
 Honey-0.025
Sleep-0.024
 Olive-0.024
 rinse-0.023
 swollen-0.023
Token it
Token ID340
Feature activation+4.94e-01

Pos logit contributions
 scrub+0.014
 Olive+0.013
 rinse+0.013
 Honey+0.013
 Buzz+0.013
Neg logit contributions
by-0.009
ables-0.009
spe-0.009
--0.009
eg-0.009
Token is
Token ID318
Feature activation+5.26e-01

Pos logit contributions
 photo+0.010
spe+0.010
ork+0.010
-+0.010
 letter+0.010
Neg logit contributions
 Olive-0.016
 pur-0.016
 Honey-0.016
 scrub-0.016
 rinse-0.015
Token the
Token ID262
Feature activation+5.16e-01

Pos logit contributions
-+0.015
by+0.014
ork+0.014
ables+0.014
eg+0.014
Neg logit contributions
 Olive-0.015
 Honey-0.014
 scrub-0.014
 rinse-0.014
 swollen-0.014
Token best
Token ID1266
Feature activation+3.43e+00

Pos logit contributions
ork+0.016
-+0.016
ables+0.015
rolled+0.014
by+0.014
Neg logit contributions
 rinse-0.013
 scrub-0.013
 swe-0.012
 Olive-0.012
 thy-0.012
Token cube
Token ID23441
Feature activation+9.34e-01

Pos logit contributions
ables+0.097
ifying+0.092
ork+0.083
eg+0.082
rolled+0.082
Neg logit contributions
 scrub-0.089
 swe-0.078
 wipe-0.076
onna-0.076
 rinse-0.075
Token ever
Token ID1683
Feature activation+6.59e-01

Pos logit contributions
rolled+0.028
ables+0.025
eg+0.025
ings+0.025
ork+0.025
Neg logit contributions
 Olive-0.024
 Buzz-0.023
 scrub-0.023
ens-0.022
 rinse-0.022
Token.
Token ID13
Feature activation+3.72e-02

Pos logit contributions
eg+0.017
ork+0.016
-+0.016
 letter+0.016
by+0.016
Neg logit contributions
 Olive-0.019
 scrub-0.019
 Honey-0.019
aj-0.018
 rinse-0.018
Token He
Token ID679
Feature activation+6.07e-01

Pos logit contributions
-+0.001
by+0.001
 letter+0.001
eg+0.001
 across+0.001
Neg logit contributions
 Olive-0.001
 Honey-0.001
 scrub-0.001
 rinse-0.001
aj-0.001
Token says
Token ID1139
Feature activation+4.84e-01

Pos logit contributions
-+0.018
by+0.017
 letter+0.016
 very+0.016
s+0.016
Neg logit contributions
 Honey-0.017
 Olive-0.016
Sleep-0.016
 swollen-0.016
aj-0.015
Token.
Token ID13
Feature activation+5.73e-01

Pos logit contributions
ables+0.028
eg+0.028
 letter+0.028
by+0.028
ay+0.028
Neg logit contributions
 scrub-0.032
 Olive-0.031
 Honey-0.031
 sob-0.030
 pur-0.030
Token "
Token ID366
Feature activation+1.15e+00

Pos logit contributions
-+0.012
 across+0.011
 thing+0.011
 very+0.011
eg+0.011
Neg logit contributions
aj-0.019
 rinse-0.019
 scrub-0.019
 Honey-0.019
 Olive-0.019
TokenIt
Token ID1026
Feature activation+4.11e-01

Pos logit contributions
by+0.040
-+0.039
ings+0.037
eg+0.037
 letter+0.037
Neg logit contributions
 Honey-0.025
 scrub-0.024
Sleep-0.024
 rinse-0.024
 Olive-0.023
Token is
Token ID318
Feature activation-7.35e-02

Pos logit contributions
 photo+0.009
by+0.009
ork+0.009
ings+0.009
eg+0.009
Neg logit contributions
 Honey-0.013
 pur-0.013
 scrub-0.013
 rinse-0.013
 Olive-0.013
Token the
Token ID262
Feature activation+5.95e-01

Pos logit contributions
 scrub+0.002
 Honey+0.002
 Olive+0.002
 rinse+0.002
 pur+0.002
Neg logit contributions
by-0.002
--0.002
ork-0.002
eg-0.002
ables-0.002
Token best
Token ID1266
Feature activation+3.42e+00

Pos logit contributions
-+0.016
by+0.015
ork+0.015
s+0.015
ings+0.015
Neg logit contributions
 scrub-0.017
 rinse-0.017
 Olive-0.017
 Honey-0.016
 pur-0.016
Token tower
Token ID10580
Feature activation-1.46e-01

Pos logit contributions
eg+0.095
by+0.094
ables+0.091
s+0.088
yn+0.086
Neg logit contributions
 scrub-0.090
 Buzz-0.085
 pur-0.085
 Olive-0.082
 rinse-0.081
Token ever
Token ID1683
Feature activation+1.59e-01

Pos logit contributions
 scrub+0.004
 Sunny+0.004
 Olive+0.004
 Buzz+0.003
 iv+0.003
Neg logit contributions
eg-0.004
rolled-0.004
by-0.004
ings-0.004
ables-0.004
Token!"
Token ID2474
Feature activation+5.68e-01

Pos logit contributions
eg+0.004
-+0.004
by+0.004
ork+0.004
 letter+0.004
Neg logit contributions
 Olive-0.004
 scrub-0.004
 Honey-0.004
aj-0.004
 rinse-0.004
Token
Token ID198
Feature activation+2.30e-01

Pos logit contributions
-+0.018
 across+0.017
 thing+0.017
 very+0.017
ork+0.017
Neg logit contributions
 swollen-0.014
 Honey-0.013
 rinse-0.013
 Olive-0.013
 scrub-0.013
Token
Token ID198
Feature activation-1.86e-01

Pos logit contributions
-+0.005
by+0.004
 across+0.004
ork+0.004
ings+0.004
Neg logit contributions
 Honey-0.008
 Olive-0.008
 swollen-0.008
 scrub-0.008
 rinse-0.008
Token to
Token ID284
Feature activation+4.32e-01

Pos logit contributions
eg+0.017
 letter+0.016
ork+0.016
ables+0.016
by+0.015
Neg logit contributions
 scrub-0.021
 Olive-0.021
 Honey-0.021
 rinse-0.020
 Holly-0.020
Token play
Token ID711
Feature activation+1.02e-01

Pos logit contributions
-+0.012
by+0.011
eg+0.011
 letter+0.011
ork+0.011
Neg logit contributions
 Olive-0.012
 Honey-0.012
 scrub-0.012
 rinse-0.012
 pur-0.012
Token and
Token ID290
Feature activation+1.33e+00

Pos logit contributions
-+0.002
 across+0.002
by+0.002
 letter+0.002
 thing+0.002
Neg logit contributions
 Olive-0.004
 Honey-0.003
 scrub-0.003
 rinse-0.003
aj-0.003
Token explore
Token ID7301
Feature activation-1.25e-02

Pos logit contributions
-+0.039
eg+0.038
by+0.038
 letter+0.037
ork+0.037
Neg logit contributions
 Olive-0.034
 scrub-0.034
 Honey-0.034
 rinse-0.033
 pur-0.032
Token the
Token ID262
Feature activation+2.47e-01

Pos logit contributions
 Olive+0.000
 Honey+0.000
 rinse+0.000
 scrub+0.000
aj+0.000
Neg logit contributions
--0.000
by-0.000
 across-0.000
 letter-0.000
 thing-0.000
Token freshly
Token ID29026
Feature activation+3.42e+00

Pos logit contributions
-+0.006
 across+0.005
 thing+0.005
 letter+0.005
 very+0.005
Neg logit contributions
 Olive-0.008
 Honey-0.008
aj-0.008
 rinse-0.008
 scrub-0.008
Token organized
Token ID8389
Feature activation+1.38e+00

Pos logit contributions
eg+0.086
by+0.081
spe+0.080
 letter+0.080
ings+0.079
Neg logit contributions
 Olive-0.102
 scrub-0.099
 Honey-0.098
 pur-0.097
 rinse-0.095
Token lawn
Token ID21133
Feature activation+1.38e+00

Pos logit contributions
eg+0.038
-+0.038
by+0.037
 letter+0.036
ork+0.036
Neg logit contributions
 Olive-0.038
 scrub-0.037
 Honey-0.037
 rinse-0.036
aj-0.035
Token.
Token ID13
Feature activation-3.52e-01

Pos logit contributions
eg+0.035
ork+0.034
-+0.034
by+0.034
 letter+0.034
Neg logit contributions
 Olive-0.040
 scrub-0.040
 Honey-0.040
 Buzz-0.038
 rinse-0.038
Token<|endoftext|>
Token ID50256
Feature activation-3.46e-01

Pos logit contributions
 Olive+0.008
 Honey+0.008
 Buzz+0.008
 Holly+0.008
 Ruby+0.008
Neg logit contributions
--0.012
 letter-0.012
 wallet-0.011
 thing-0.011
 photo-0.011
TokenOnce
Token ID7454
Feature activation-1.19e+00

Pos logit contributions
 Olive+0.010
 Honey+0.010
 scrub+0.009
 rinse+0.009
aj+0.009
Neg logit contributions
--0.009
 letter-0.009
by-0.009
eg-0.009
 thing-0.009
Token The
Token ID383
Feature activation-1.88e-01

Pos logit contributions
 scrub+0.010
 Olive+0.010
aj+0.010
 rinse+0.010
 Honey+0.010
Neg logit contributions
--0.011
 across-0.011
ork-0.011
 letter-0.010
eg-0.010
Token driver
Token ID4639
Feature activation+5.72e-01

Pos logit contributions
 scrub+0.006
 swollen+0.006
 sob+0.006
 rinse+0.006
 pur+0.006
Neg logit contributions
by-0.005
eg-0.004
ables-0.004
ings-0.004
--0.004
Token and
Token ID290
Feature activation-3.05e-01

Pos logit contributions
ings+0.016
ables+0.016
by+0.015
 photo+0.015
eg+0.015
Neg logit contributions
 scrub-0.016
 sob-0.016
 pur-0.015
 rinse-0.015
 kissed-0.014
Token the
Token ID262
Feature activation+3.04e-01

Pos logit contributions
 Olive+0.005
 Honey+0.005
 scrub+0.004
 rinse+0.004
aj+0.004
Neg logit contributions
--0.012
by-0.012
 letter-0.012
 across-0.012
 thing-0.012
Token paramed
Token ID34581
Feature activation+1.21e+00

Pos logit contributions
by+0.007
eg+0.007
-+0.006
ables+0.006
ings+0.006
Neg logit contributions
 scrub-0.010
 swollen-0.010
 sob-0.010
 pur-0.010
 rinse-0.010
Tokenic
Token ID291
Feature activation+3.41e+00

Pos logit contributions
 across+0.034
eg+0.031
 photo+0.030
 very+0.030
 letter+0.030
Neg logit contributions
Fine-0.033
aj-0.033
 sob-0.031
Sleep-0.030
ens-0.030
Token jumped
Token ID11687
Feature activation+3.22e-01

Pos logit contributions
 across+0.093
ables+0.091
 photo+0.085
eg+0.084
by+0.083
Neg logit contributions
 scrub-0.094
 sob-0.089
ens-0.087
 swollen-0.083
 kissed-0.082
Token out
Token ID503
Feature activation+1.20e+00

Pos logit contributions
by+0.008
eg+0.008
 letter+0.007
 wallet+0.007
 photo+0.007
Neg logit contributions
 scrub-0.010
 Olive-0.010
 Honey-0.009
 rinse-0.009
 sob-0.009
Token.
Token ID13
Feature activation-1.94e-02

Pos logit contributions
eg+0.027
by+0.027
ables+0.027
 wallet+0.026
 letter+0.026
Neg logit contributions
 scrub-0.039
 rinse-0.038
 Honey-0.037
 Olive-0.037
 pur-0.037
Token They
Token ID1119
Feature activation-5.25e-01

Pos logit contributions
 Olive+0.001
 Honey+0.001
 scrub+0.001
 rinse+0.001
aj+0.001
Neg logit contributions
--0.001
by-0.000
 letter-0.000
 across-0.000
eg-0.000
Token helped
Token ID4193
Feature activation-8.49e-02

Pos logit contributions
 Olive+0.014
 Honey+0.014
aj+0.014
 scrub+0.014
 rinse+0.014
Neg logit contributions
--0.015
by-0.014
 letter-0.014
 across-0.014
eg-0.014
Token the
Token ID262
Feature activation+9.43e-02

Pos logit contributions
eg+0.012
ables+0.011
ay+0.011
 letter+0.011
ork+0.011
Neg logit contributions
 scrub-0.011
 Honey-0.011
 Olive-0.011
 Buzz-0.011
 pur-0.010
Token church
Token ID4928
Feature activation+7.07e-01

Pos logit contributions
eg+0.003
ings+0.003
by+0.002
ay+0.002
rolled+0.002
Neg logit contributions
 Honey-0.003
 pur-0.003
 scrub-0.003
Sleep-0.003
 iv-0.003
Token and
Token ID290
Feature activation+4.30e-01

Pos logit contributions
eg+0.018
by+0.017
ay+0.017
 letter+0.016
-+0.016
Neg logit contributions
 scrub-0.021
 Honey-0.020
 Olive-0.020
 pur-0.020
aj-0.020
Token noticed
Token ID6810
Feature activation+3.98e-01

Pos logit contributions
-+0.011
ings+0.011
eg+0.011
ay+0.011
 letter+0.011
Neg logit contributions
 scrub-0.012
 Honey-0.012
 Olive-0.012
 rinse-0.011
 pur-0.011
Token a
Token ID257
Feature activation+1.03e+00

Pos logit contributions
eg+0.011
ables+0.011
by+0.011
ork+0.010
ay+0.010
Neg logit contributions
 Olive-0.011
 Honey-0.011
 scrub-0.011
 Buzz-0.010
 Sunny-0.010
Token freshly
Token ID29026
Feature activation+3.41e+00

Pos logit contributions
rolled+0.028
eg+0.027
ings+0.027
attered+0.027
ables+0.026
Neg logit contributions
 pur-0.029
 swollen-0.029
 scrub-0.028
 sob-0.027
 Honey-0.027
Token dug
Token ID18735
Feature activation+1.25e-01

Pos logit contributions
eg+0.079
spe+0.077
ables+0.076
ings+0.073
s+0.073
Neg logit contributions
 scrub-0.107
 Olive-0.104
 pur-0.104
Sleep-0.101
 Honey-0.101
Token hole
Token ID7604
Feature activation-1.24e+00

Pos logit contributions
eg+0.003
by+0.003
 photo+0.003
 letter+0.003
ables+0.003
Neg logit contributions
 scrub-0.004
 iv-0.004
 Honey-0.003
 rinse-0.003
 pur-0.003
Token in
Token ID287
Feature activation+1.34e+00

Pos logit contributions
 scrub+0.038
 rinse+0.035
 Honey+0.035
 Olive+0.035
 swollen+0.035
Neg logit contributions
eg-0.032
by-0.030
ables-0.029
 letter-0.029
ork-0.029
Token the
Token ID262
Feature activation+3.81e-01

Pos logit contributions
eg+0.041
ables+0.041
by+0.039
-+0.039
ay+0.038
Neg logit contributions
 scrub-0.036
 Olive-0.034
 Honey-0.034
 Buzz-0.033
 rinse-0.033
Token ground
Token ID2323
Feature activation-3.91e-01

Pos logit contributions
ings+0.012
eg+0.012
-+0.012
by+0.012
rolled+0.011
Neg logit contributions
 Honey-0.009
 iv-0.009
 Olive-0.009
 scrub-0.009
 rinse-0.009
Token.
Token ID13
Feature activation-1.34e-01

Pos logit contributions
 Olive+0.006
 Honey+0.006
 scrub+0.006
 Buzz+0.006
 Sunny+0.006
Neg logit contributions
eg-0.007
by-0.007
 letter-0.007
ings-0.007
ork-0.007
Token The
Token ID383
Feature activation-1.11e-01

Pos logit contributions
 scrub+0.005
 rinse+0.005
 Olive+0.005
aj+0.005
 swollen+0.005
Neg logit contributions
--0.002
 across-0.002
 letter-0.002
eg-0.002
 thing-0.002
Token oct
Token ID19318
Feature activation-8.36e-03

Pos logit contributions
 Honey+0.002
 Olive+0.002
 scrub+0.002
 Buzz+0.002
 rinse+0.002
Neg logit contributions
ings-0.004
ork-0.004
ables-0.004
--0.004
by-0.003
Tokenopus
Token ID25790
Feature activation-3.73e-02

Pos logit contributions
 Buzz+0.000
 Olive+0.000
aj+0.000
 Honey+0.000
 Holly+0.000
Neg logit contributions
 letter-0.000
by-0.000
 across-0.000
 thing-0.000
ings-0.000
Token had
Token ID550
Feature activation-3.48e-01

Pos logit contributions
 scrub+0.001
 Olive+0.001
 pur+0.001
 Honey+0.001
 Buzz+0.001
Neg logit contributions
ings-0.001
ork-0.001
 letter-0.001
ay-0.001
eg-0.001
Token eight
Token ID3624
Feature activation-3.41e+00

Pos logit contributions
 pur+0.010
 Buzz+0.009
 Olive+0.009
 Sunny+0.009
 scrub+0.009
Neg logit contributions
ings-0.010
--0.010
ork-0.009
by-0.009
spe-0.009
Token arms
Token ID5101
Feature activation-6.83e-01

Pos logit contributions
 Olive+0.091
 Honey+0.090
 scrub+0.089
 rinse+0.087
aj+0.087
Neg logit contributions
--0.095
 letter-0.093
by-0.092
ork-0.090
eg-0.090
Token and
Token ID290
Feature activation-7.82e-01

Pos logit contributions
 scrub+0.020
 Olive+0.020
 Honey+0.020
 pur+0.019
 Buzz+0.019
Neg logit contributions
 letter-0.016
ork-0.016
eg-0.016
by-0.016
--0.015
Token could
Token ID714
Feature activation-1.44e-01

Pos logit contributions
 Olive+0.023
 Honey+0.023
 Buzz+0.023
 scrub+0.023
 pur+0.023
Neg logit contributions
--0.019
ork-0.019
ings-0.019
ay-0.019
 letter-0.019
Token swim
Token ID9422
Feature activation-4.69e-01

Pos logit contributions
 Olive+0.004
 Honey+0.004
aj+0.004
 scrub+0.004
 rinse+0.004
Neg logit contributions
--0.004
 across-0.004
 letter-0.004
by-0.004
 very-0.004
Token very
Token ID845
Feature activation-3.57e-01

Pos logit contributions
 Buzz+0.011
 Olive+0.011
 scrub+0.011
 Sunny+0.011
 Honey+0.011
Neg logit contributions
 letter-0.014
 wallet-0.014
eg-0.014
 photo-0.013
ork-0.013
Token the
Token ID262
Feature activation-3.24e-01

Pos logit contributions
 Olive+0.019
 Honey+0.019
 scrub+0.018
 rinse+0.018
aj+0.018
Neg logit contributions
--0.014
by-0.014
 letter-0.014
eg-0.014
 across-0.013
Token roses
Token ID42152
Feature activation+1.68e-01

Pos logit contributions
 Olive+0.007
 Honey+0.007
 scrub+0.007
 rinse+0.007
aj+0.007
Neg logit contributions
--0.011
by-0.010
 letter-0.010
eg-0.010
 across-0.010
Token.
Token ID13
Feature activation+2.76e-01

Pos logit contributions
-+0.004
 letter+0.003
 across+0.003
 very+0.003
 thing+0.003
Neg logit contributions
 Olive-0.006
 scrub-0.006
aj-0.006
 Honey-0.006
 rinse-0.006
Token They
Token ID1119
Feature activation-6.77e-01

Pos logit contributions
ay+0.005
-+0.004
 very+0.004
ings+0.004
 across+0.004
Neg logit contributions
 scrub-0.011
 swollen-0.010
aj-0.010
Fine-0.010
 til-0.010
Token have
Token ID423
Feature activation-8.33e-01

Pos logit contributions
 Olive+0.022
 Honey+0.022
 scrub+0.022
aj+0.022
 rinse+0.021
Neg logit contributions
--0.015
by-0.014
 letter-0.014
 across-0.014
 thing-0.014
Token th
Token ID294
Feature activation-3.41e+00

Pos logit contributions
 Honey+0.027
 Olive+0.027
 scrub+0.026
 rinse+0.026
aj+0.026
Neg logit contributions
--0.019
 across-0.018
 very-0.018
by-0.018
 letter-0.018
Tokenorns
Token ID19942
Feature activation-1.77e+00

Pos logit contributions
 rinse+0.075
 scrub+0.073
 sob+0.072
 Olive+0.072
Sleep+0.072
Neg logit contributions
--0.118
ings-0.112
by-0.108
ables-0.106
 across-0.103
Token.
Token ID13
Feature activation-3.80e-02

Pos logit contributions
 Olive+0.065
 Honey+0.065
 scrub+0.064
 rinse+0.064
aj+0.063
Neg logit contributions
--0.033
by-0.030
 letter-0.030
 across-0.030
 thing-0.029
Token They
Token ID1119
Feature activation-8.56e-01

Pos logit contributions
 scrub+0.002
 swollen+0.002
aj+0.002
 til+0.002
Fine+0.002
Neg logit contributions
ay-0.000
 very-0.000
 across-0.000
eg-0.000
 letter-0.000
Token can
Token ID460
Feature activation-1.22e+00

Pos logit contributions
aj+0.028
 Olive+0.028
 Honey+0.028
 scrub+0.028
Sleep+0.027
Neg logit contributions
--0.019
 letter-0.018
by-0.018
 very-0.017
 across-0.017
Token hurt
Token ID5938
Feature activation-6.28e-01

Pos logit contributions
aj+0.044
Sleep+0.041
irms+0.040
Acc+0.039
Fine+0.038
Neg logit contributions
 very-0.029
s-0.027
 lot-0.026
--0.025
 across-0.025
Token and
Token ID290
Feature activation+2.54e-01

Pos logit contributions
aj+0.003
 rinse+0.003
 Honey+0.003
 Olive+0.003
 swollen+0.003
Neg logit contributions
--0.001
 across-0.001
 very-0.001
by-0.001
 thing-0.001
Token had
Token ID550
Feature activation-1.50e-01

Pos logit contributions
 across+0.007
 very+0.006
-+0.006
by+0.006
s+0.006
Neg logit contributions
aj-0.008
 Deal-0.008
 swollen-0.008
Sleep-0.008
 rinse-0.008
Token fun
Token ID1257
Feature activation+3.66e-01

Pos logit contributions
 Olive+0.005
 Honey+0.005
 pur+0.005
 scrub+0.005
 Buzz+0.005
Neg logit contributions
--0.003
eg-0.003
ings-0.003
 letter-0.003
ork-0.003
Token,
Token ID11
Feature activation-1.33e-01

Pos logit contributions
-+0.008
 across+0.007
 thing+0.007
 very+0.007
by+0.006
Neg logit contributions
 Honey-0.014
 Olive-0.014
 rinse-0.013
 swollen-0.013
 Midnight-0.013
Token but
Token ID475
Feature activation+1.05e+00

Pos logit contributions
 Olive+0.003
 Honey+0.003
 scrub+0.003
 rinse+0.003
aj+0.003
Neg logit contributions
--0.004
by-0.004
 across-0.004
 letter-0.004
 thing-0.004
Token Fif
Token ID12830
Feature activation-3.38e+00

Pos logit contributions
-+0.027
 across+0.025
 thing+0.025
by+0.025
 very+0.025
Neg logit contributions
aj-0.031
 rinse-0.031
 scrub-0.031
 Honey-0.031
 Olive-0.030
Tokeni
Token ID72
Feature activation+7.40e-01

Pos logit contributions
 rinse+0.115
 Deal+0.112
Sleep+0.111
 Midnight+0.109
Aud+0.108
Neg logit contributions
by-0.087
--0.085
s-0.072
ings-0.070
y-0.064
Token noticed
Token ID6810
Feature activation+3.12e-02

Pos logit contributions
-+0.015
 letter+0.015
by+0.014
eg+0.014
 across+0.014
Neg logit contributions
 Olive-0.025
 Honey-0.025
 scrub-0.025
 rinse-0.024
aj-0.024
Token something
Token ID1223
Feature activation-3.41e-01

Pos logit contributions
-+0.001
 across+0.001
 thing+0.001
by+0.001
 very+0.001
Neg logit contributions
 rinse-0.001
aj-0.001
 scrub-0.001
 Honey-0.001
 Olive-0.001
Token strange
Token ID6283
Feature activation-3.07e-01

Pos logit contributions
 Olive+0.011
 scrub+0.011
 Honey+0.011
 rinse+0.011
 iv+0.011
Neg logit contributions
eg-0.008
by-0.008
ables-0.008
ay-0.007
--0.007
Token.
Token ID13
Feature activation+1.53e-01

Pos logit contributions
 Olive+0.010
 Honey+0.010
 scrub+0.010
 rinse+0.010
 iv+0.009
Neg logit contributions
--0.007
eg-0.007
by-0.007
 letter-0.007
ables-0.006
TokenL
Token ID43
Feature activation+1.63e-02

Pos logit contributions
 scrub+0.009
 rinse+0.009
 swollen+0.009
aj+0.009
 Olive+0.008
Neg logit contributions
--0.006
 across-0.006
s-0.005
ay-0.005
 thing-0.005
Tokenily
Token ID813
Feature activation-1.36e+00

Pos logit contributions
spe+0.001
 across+0.001
ables+0.001
eg+0.001
 thing+0.001
Neg logit contributions
 Olive-0.000
 Honey-0.000
 scrub-0.000
aj-0.000
 rinse-0.000
Token liked
Token ID8288
Feature activation-9.26e-01

Pos logit contributions
 swollen+0.051
 Olive+0.051
 rinse+0.050
 Honey+0.050
 scrub+0.050
Neg logit contributions
--0.026
by-0.024
 across-0.022
 letter-0.022
 thing-0.022
Token to
Token ID284
Feature activation-2.48e-01

Pos logit contributions
 Olive+0.042
 Honey+0.042
 rinse+0.042
aj+0.041
 scrub+0.041
Neg logit contributions
--0.009
 letter-0.008
 across-0.008
by-0.008
 thing-0.007
Token keep
Token ID1394
Feature activation-9.15e-01

Pos logit contributions
 Olive+0.007
 Honey+0.006
 scrub+0.006
 rinse+0.006
aj+0.006
Neg logit contributions
--0.007
by-0.007
 letter-0.007
 across-0.007
eg-0.007
Token cater
Token ID20825
Feature activation-3.33e+00

Pos logit contributions
 Olive+0.026
 Honey+0.026
 rinse+0.026
 scrub+0.025
aj+0.025
Neg logit contributions
--0.025
 letter-0.023
 across-0.023
by-0.023
 very-0.023
Tokenpill
Token ID27215
Feature activation-1.90e+00

Pos logit contributions
 Deal+0.095
 scrub+0.091
 rinse+0.090
aj+0.089
 seventy+0.088
Neg logit contributions
--0.110
s-0.106
ay-0.092
ables-0.091
by-0.089
Tokenars
Token ID945
Feature activation-3.65e-01

Pos logit contributions
aj+0.074
 Olive+0.073
 scrub+0.073
 Honey+0.073
 swollen+0.072
Neg logit contributions
--0.035
 very-0.032
ables-0.031
s-0.031
ork-0.029
Token in
Token ID287
Feature activation-5.18e-01

Pos logit contributions
 Olive+0.012
 Honey+0.012
 scrub+0.012
 rinse+0.012
aj+0.012
Neg logit contributions
--0.007
by-0.007
eg-0.007
 letter-0.007
ork-0.007
Token a
Token ID257
Feature activation-1.91e+00

Pos logit contributions
 Olive+0.014
 Honey+0.014
 scrub+0.014
 rinse+0.013
aj+0.013
Neg logit contributions
--0.015
by-0.014
 letter-0.014
 across-0.014
eg-0.014
Token jar
Token ID17379
Feature activation-1.19e+00

Pos logit contributions
 Olive+0.060
 rinse+0.059
 Honey+0.058
 scrub+0.058
 pur+0.057
Neg logit contributions
ings-0.044
eg-0.043
--0.043
by-0.042
rolled-0.042
Token to
Token ID284
Feature activation+1.48e-01

Pos logit contributions
 scrub+0.042
 Olive+0.042
 pur+0.041
 Honey+0.041
 Sunny+0.040
Neg logit contributions
ork-0.023
ings-0.023
eg-0.022
ay-0.021
ables-0.021
Token cook
Token ID4255
Feature activation+8.95e-02

Pos logit contributions
-+0.004
 letter+0.004
by+0.004
eg+0.004
 thing+0.004
Neg logit contributions
 Olive-0.004
 Honey-0.004
aj-0.004
 scrub-0.004
 swollen-0.004
Token together
Token ID1978
Feature activation-1.17e-01

Pos logit contributions
by+0.002
spe+0.002
eg+0.002
ork+0.002
rolled+0.002
Neg logit contributions
 scrub-0.003
 Honey-0.003
 rinse-0.002
 Olive-0.002
 pur-0.002
Token in
Token ID287
Feature activation-6.34e-01

Pos logit contributions
 scrub+0.004
 Honey+0.003
 Olive+0.003
 rinse+0.003
 Sunny+0.003
Neg logit contributions
eg-0.003
by-0.003
ork-0.003
--0.003
 photo-0.003
Token their
Token ID511
Feature activation-1.30e+00

Pos logit contributions
 scrub+0.016
 rinse+0.015
 Honey+0.015
 Olive+0.014
 pur+0.013
Neg logit contributions
eg-0.021
by-0.020
ork-0.020
ables-0.020
--0.019
Token big
Token ID1263
Feature activation-3.31e+00

Pos logit contributions
 rinse+0.033
 scrub+0.032
 Honey+0.032
 swe+0.032
aj+0.032
Neg logit contributions
eg-0.039
ings-0.038
spe-0.038
ork-0.037
rolled-0.036
Token kitchen
Token ID9592
Feature activation-2.36e-02

Pos logit contributions
 scrub+0.089
 Honey+0.088
 rinse+0.088
 swe+0.088
 pur+0.087
Neg logit contributions
ork-0.089
eg-0.089
spe-0.088
rolled-0.085
ings-0.085
Token.
Token ID13
Feature activation-1.11e-01

Pos logit contributions
 scrub+0.001
 Honey+0.001
aj+0.001
 rinse+0.001
 Olive+0.001
Neg logit contributions
eg-0.001
by-0.001
--0.001
 across-0.001
ork-0.001
Token One
Token ID1881
Feature activation+1.59e-01

Pos logit contributions
 swollen+0.003
 scrub+0.003
 Olive+0.003
 rinse+0.003
 Honey+0.003
Neg logit contributions
--0.003
 letter-0.003
 across-0.003
eg-0.003
 thing-0.003
Token day
Token ID1110
Feature activation+2.70e-01

Pos logit contributions
eg+0.004
ings+0.004
rolled+0.004
ork+0.004
ables+0.004
Neg logit contributions
 scrub-0.004
 Honey-0.004
 rinse-0.004
Sleep-0.004
 Olive-0.004
Token,
Token ID11
Feature activation+6.88e-02

Pos logit contributions
ables+0.007
spe+0.007
 letter+0.007
eg+0.007
ork+0.007
Neg logit contributions
 Olive-0.007
 Honey-0.007
 scrub-0.007
 Buzz-0.007
 Sunny-0.007
Token with
Token ID351
Feature activation-1.88e-01

Pos logit contributions
-+0.002
by+0.002
 letter+0.002
 across+0.002
 thing+0.002
Neg logit contributions
 Olive-0.004
 Honey-0.004
 scrub-0.004
 rinse-0.004
aj-0.004
Token her
Token ID607
Feature activation-1.29e+00

Pos logit contributions
 Olive+0.005
 Honey+0.005
 scrub+0.005
aj+0.005
 rinse+0.005
Neg logit contributions
--0.005
 letter-0.005
by-0.005
 thing-0.005
 across-0.005
Token family
Token ID1641
Feature activation+1.96e-01

Pos logit contributions
 Olive+0.047
 rinse+0.047
 Honey+0.047
 scrub+0.046
Fine+0.046
Neg logit contributions
--0.025
by-0.023
 very-0.023
 letter-0.023
 thing-0.022
Token in
Token ID287
Feature activation-5.94e-01

Pos logit contributions
-+0.004
ork+0.004
eg+0.004
by+0.004
 letter+0.004
Neg logit contributions
 Olive-0.006
 Honey-0.006
 scrub-0.006
 rinse-0.006
aj-0.006
Token their
Token ID511
Feature activation-1.11e+00

Pos logit contributions
 Olive+0.018
 Honey+0.018
 scrub+0.017
 rinse+0.017
aj+0.017
Neg logit contributions
--0.015
by-0.014
 letter-0.014
 across-0.014
 thing-0.014
Token big
Token ID1263
Feature activation-3.30e+00

Pos logit contributions
aj+0.033
 Honey+0.033
 Olive+0.033
 scrub+0.033
 rinse+0.032
Neg logit contributions
ings-0.026
--0.026
eg-0.026
ork-0.026
 letter-0.025
Token tent
Token ID11105
Feature activation-1.73e-01

Pos logit contributions
 Olive+0.100
 Honey+0.100
 scrub+0.099
 rinse+0.097
aj+0.097
Neg logit contributions
--0.080
by-0.077
 letter-0.076
eg-0.075
ork-0.075
Token.
Token ID13
Feature activation+3.84e-02

Pos logit contributions
 Olive+0.006
 Honey+0.006
 scrub+0.006
 rinse+0.006
aj+0.006
Neg logit contributions
--0.004
by-0.003
 letter-0.003
 across-0.003
 thing-0.003
Token One
Token ID1881
Feature activation-1.19e-01

Pos logit contributions
ay+0.001
s+0.001
 across+0.001
-+0.001
 lot+0.001
Neg logit contributions
 swollen-0.001
 scrub-0.001
 rinse-0.001
 iv-0.001
aj-0.001
Token day
Token ID1110
Feature activation+5.49e-01

Pos logit contributions
 Honey+0.004
 Olive+0.004
Sleep+0.004
 scrub+0.004
 rinse+0.004
Neg logit contributions
ings-0.003
eg-0.003
ork-0.003
spe-0.002
by-0.002
Token,
Token ID11
Feature activation+3.88e-01

Pos logit contributions
 letter+0.014
eg+0.014
spe+0.014
ables+0.013
by+0.013
Neg logit contributions
 Olive-0.017
 Honey-0.016
 scrub-0.015
 rinse-0.015
 Midnight-0.015
Token little
Token ID1310
Feature activation-7.61e-01

Pos logit contributions
 scrub+0.027
aj+0.027
 Olive+0.027
 rinse+0.027
 Honey+0.027
Neg logit contributions
--0.016
 letter-0.015
by-0.015
 very-0.015
 thing-0.015
Token fairy
Token ID25607
Feature activation-2.02e+00

Pos logit contributions
 Olive+0.022
 Honey+0.022
 scrub+0.022
 rinse+0.022
aj+0.021
Neg logit contributions
--0.020
by-0.019
 letter-0.019
 across-0.018
eg-0.018
Token.
Token ID13
Feature activation-8.73e-01

Pos logit contributions
 Olive+0.052
 rinse+0.052
 scrub+0.051
 Honey+0.051
aj+0.050
Neg logit contributions
--0.061
by-0.057
 letter-0.057
 across-0.056
 thing-0.056
Token She
Token ID1375
Feature activation-7.36e-01

Pos logit contributions
 scrub+0.029
aj+0.028
 rinse+0.028
 swollen+0.028
 Olive+0.026
Neg logit contributions
 across-0.020
--0.020
ay-0.019
 thing-0.019
 letter-0.019
Token had
Token ID550
Feature activation-5.28e-01

Pos logit contributions
 Olive+0.027
 Honey+0.027
 scrub+0.026
 rinse+0.026
aj+0.026
Neg logit contributions
--0.014
by-0.013
 letter-0.013
 across-0.013
eg-0.013
Token tiny
Token ID7009
Feature activation-3.28e+00

Pos logit contributions
 Olive+0.019
 Honey+0.019
 scrub+0.019
 rinse+0.019
aj+0.018
Neg logit contributions
--0.010
by-0.009
 letter-0.009
 across-0.009
eg-0.009
Token wings
Token ID12098
Feature activation-1.52e+00

Pos logit contributions
 Olive+0.087
 Honey+0.086
 scrub+0.084
aj+0.083
 pur+0.082
Neg logit contributions
--0.091
eg-0.089
by-0.089
 letter-0.088
 across-0.088
Token and
Token ID290
Feature activation-9.51e-01

Pos logit contributions
 Olive+0.049
 Honey+0.048
 scrub+0.048
 pur+0.047
 rinse+0.047
Neg logit contributions
eg-0.033
--0.032
by-0.032
 letter-0.032
ork-0.031
Token a
Token ID257
Feature activation-6.43e-01

Pos logit contributions
 Olive+0.031
 Honey+0.031
 scrub+0.031
 rinse+0.031
aj+0.030
Neg logit contributions
--0.021
by-0.020
 letter-0.020
 across-0.019
 thing-0.019
Token big
Token ID1263
Feature activation-1.54e+00

Pos logit contributions
 Olive+0.021
 Honey+0.021
 scrub+0.020
 rinse+0.020
aj+0.020
Neg logit contributions
--0.015
by-0.014
 letter-0.014
eg-0.014
 across-0.014
Token smile
Token ID8212
Feature activation+2.54e-01

Pos logit contributions
 Olive+0.049
 Honey+0.049
 scrub+0.048
 rinse+0.048
aj+0.047
Neg logit contributions
--0.036
by-0.034
 letter-0.034
 across-0.033
 thing-0.033
Token family
Token ID1641
Feature activation+5.79e-01

Pos logit contributions
 Olive+0.009
 Honey+0.009
 scrub+0.009
 rinse+0.009
aj+0.009
Neg logit contributions
--0.005
by-0.005
 letter-0.004
 across-0.004
eg-0.004
Token of
Token ID286
Feature activation+1.33e-01

Pos logit contributions
eg+0.018
ork+0.017
ables+0.017
ings+0.017
by+0.017
Neg logit contributions
 scrub-0.014
 Honey-0.014
 Olive-0.014
Sleep-0.013
 rinse-0.013
Token three
Token ID1115
Feature activation-9.87e-01

Pos logit contributions
-+0.003
 very+0.003
 letter+0.003
 thing+0.003
by+0.003
Neg logit contributions
aj-0.004
 rinse-0.004
 Olive-0.004
 Honey-0.004
 swollen-0.004
Token in
Token ID287
Feature activation-9.24e-01

Pos logit contributions
 Olive+0.032
aj+0.031
 scrub+0.031
 Honey+0.031
 pur+0.031
Neg logit contributions
 across-0.022
eg-0.021
ings-0.021
by-0.021
 letter-0.020
Token their
Token ID511
Feature activation-9.13e-01

Pos logit contributions
 Honey+0.029
 Olive+0.029
 rinse+0.028
 scrub+0.028
 Sunny+0.027
Neg logit contributions
by-0.022
--0.022
ables-0.022
ings-0.022
eg-0.022
Token big
Token ID1263
Feature activation-3.27e+00

Pos logit contributions
 Honey+0.027
 Olive+0.027
 Sunny+0.025
 pur+0.025
 rinse+0.025
Neg logit contributions
ork-0.024
ings-0.024
eg-0.024
ables-0.024
--0.023
Token house
Token ID2156
Feature activation-7.90e-01

Pos logit contributions
 Honey+0.104
 Olive+0.103
 pur+0.101
 scrub+0.100
aj+0.099
Neg logit contributions
ork-0.076
ings-0.071
eg-0.071
--0.071
by-0.069
Token.
Token ID13
Feature activation-4.87e-01

Pos logit contributions
 Olive+0.026
 Honey+0.026
 scrub+0.025
 rinse+0.025
aj+0.025
Neg logit contributions
--0.017
by-0.017
 letter-0.016
eg-0.016
 across-0.016
Token Mom
Token ID11254
Feature activation-7.46e-01

Pos logit contributions
 Olive+0.015
 Honey+0.015
 scrub+0.015
 rinse+0.015
aj+0.014
Neg logit contributions
--0.012
by-0.011
 letter-0.011
eg-0.011
 across-0.011
Token was
Token ID373
Feature activation-4.60e-01

Pos logit contributions
 Olive+0.021
 Honey+0.021
 scrub+0.021
 rinse+0.021
aj+0.020
Neg logit contributions
--0.020
by-0.019
 letter-0.019
 across-0.019
 thing-0.019
Token very
Token ID845
Feature activation-6.26e-01

Pos logit contributions
 Olive+0.017
 scrub+0.017
 Honey+0.017
 rinse+0.016
 pur+0.016
Neg logit contributions
by-0.009
eg-0.008
ables-0.008
ork-0.008
ings-0.008
Token small
Token ID1402
Feature activation-4.62e-01

Pos logit contributions
ings+0.003
-+0.003
ork+0.003
eg+0.003
 letter+0.003
Neg logit contributions
 Honey-0.004
 Olive-0.004
 scrub-0.004
 rinse-0.004
 pur-0.004
Token,
Token ID11
Feature activation+5.25e-01

Pos logit contributions
 Olive+0.017
 Honey+0.017
 scrub+0.017
 rinse+0.017
aj+0.016
Neg logit contributions
--0.008
by-0.008
 letter-0.008
 across-0.008
eg-0.008
Token human
Token ID1692
Feature activation-1.93e-01

Pos logit contributions
-+0.012
 very+0.011
 across+0.011
 letter+0.011
 thing+0.010
Neg logit contributions
 Olive-0.018
 Honey-0.017
aj-0.017
 rinse-0.017
irms-0.017
Token boy
Token ID2933
Feature activation+4.99e-01

Pos logit contributions
 Olive+0.006
 Honey+0.006
 scrub+0.006
 rinse+0.006
aj+0.006
Neg logit contributions
--0.005
by-0.004
 letter-0.004
 across-0.004
 thing-0.004
Token with
Token ID351
Feature activation-8.50e-01

Pos logit contributions
-+0.012
by+0.011
 letter+0.011
eg+0.011
 across+0.011
Neg logit contributions
 Olive-0.016
 Honey-0.016
 scrub-0.016
 rinse-0.015
aj-0.015
Token big
Token ID1263
Feature activation-3.25e+00

Pos logit contributions
 Olive+0.029
 Honey+0.029
 scrub+0.029
 rinse+0.029
 swollen+0.028
Neg logit contributions
--0.018
by-0.017
 letter-0.016
ork-0.016
eg-0.016
Token,
Token ID11
Feature activation+2.42e-01

Pos logit contributions
 Olive+0.096
 Honey+0.095
 scrub+0.094
 rinse+0.093
aj+0.091
Neg logit contributions
--0.083
by-0.079
 letter-0.079
 across-0.078
eg-0.077
Token brown
Token ID7586
Feature activation-1.38e+00

Pos logit contributions
-+0.006
 very+0.006
 across+0.006
 letter+0.006
 thing+0.005
Neg logit contributions
 Olive-0.007
aj-0.007
 rinse-0.007
 Honey-0.007
 scrub-0.007
Token eyes
Token ID2951
Feature activation-6.81e-01

Pos logit contributions
 Olive+0.039
 Honey+0.038
 pur+0.037
 scrub+0.037
aj+0.036
Neg logit contributions
spe-0.037
ork-0.036
eg-0.036
 letter-0.035
by-0.035
Token.
Token ID13
Feature activation-7.95e-01

Pos logit contributions
 scrub+0.022
 Honey+0.022
 Olive+0.022
 pur+0.021
 rinse+0.021
Neg logit contributions
by-0.015
eg-0.014
--0.014
 letter-0.014
ork-0.014
Token He
Token ID679
Feature activation-2.15e-01

Pos logit contributions
 Olive+0.024
 Honey+0.024
 scrub+0.024
 rinse+0.023
aj+0.023
Neg logit contributions
--0.020
by-0.019
 letter-0.019
ork-0.019
eg-0.019
Token The
Token ID383
Feature activation-5.21e-03

Pos logit contributions
 Olive+0.005
 scrub+0.005
 Honey+0.005
 rinse+0.005
aj+0.005
Neg logit contributions
--0.003
 letter-0.003
by-0.003
 across-0.003
eg-0.003
Token oct
Token ID19318
Feature activation+1.36e-01

Pos logit contributions
 Honey+0.000
 Olive+0.000
 scrub+0.000
 Buzz+0.000
 rinse+0.000
Neg logit contributions
ings-0.000
ables-0.000
ork-0.000
--0.000
by-0.000
Tokenopus
Token ID25790
Feature activation+9.97e-02

Pos logit contributions
 letter+0.005
 across+0.005
by+0.004
 thing+0.004
ings+0.004
Neg logit contributions
aj-0.003
 Olive-0.003
 Buzz-0.003
 Honey-0.003
 Holly-0.002
Token had
Token ID550
Feature activation-2.10e-01

Pos logit contributions
ings+0.002
ay+0.002
 letter+0.002
ork+0.002
eg+0.002
Neg logit contributions
 scrub-0.003
 Olive-0.003
 pur-0.003
 Honey-0.003
 Buzz-0.003
Token eight
Token ID3624
Feature activation-3.17e+00

Pos logit contributions
 pur+0.006
 Olive+0.006
 Buzz+0.006
 Sunny+0.005
 Honey+0.005
Neg logit contributions
ings-0.006
--0.006
by-0.006
spe-0.006
ork-0.006
Token long
Token ID890
Feature activation-3.24e+00

Pos logit contributions
 Olive+0.082
 Honey+0.081
 scrub+0.081
aj+0.078
 rinse+0.078
Neg logit contributions
--0.090
 letter-0.089
by-0.088
 across-0.086
ork-0.086
Token arms
Token ID5101
Feature activation-5.38e-01

Pos logit contributions
 Olive+0.093
 Honey+0.092
 scrub+0.091
 rinse+0.090
aj+0.089
Neg logit contributions
--0.085
by-0.082
 letter-0.081
 across-0.080
eg-0.080
Token and
Token ID290
Feature activation-8.73e-01

Pos logit contributions
 scrub+0.016
 Olive+0.016
 Honey+0.015
 pur+0.015
 sob+0.015
Neg logit contributions
 letter-0.013
eg-0.013
by-0.013
ork-0.013
--0.012
Token loved
Token ID6151
Feature activation-6.54e-01

Pos logit contributions
 Olive+0.028
 Honey+0.028
 scrub+0.028
 Buzz+0.027
 pur+0.027
Neg logit contributions
--0.020
 letter-0.019
ork-0.019
ings-0.019
by-0.019
Token to
Token ID284
Feature activation+3.92e-01

Pos logit contributions
 scrub+0.026
 Olive+0.025
 Honey+0.025
 Buzz+0.025
 pur+0.025
Neg logit contributions
eg-0.010
ay-0.010
ork-0.010
--0.010
 letter-0.010
Token swim
Token ID9422
Feature activation-4.90e-01

Pos logit contributions
-+0.011
by+0.010
 thing+0.010
 letter+0.010
 across+0.010
Neg logit contributions
 Olive-0.011
 Honey-0.011
aj-0.011
 swollen-0.011
Sleep-0.011
Token.
Token ID13
Feature activation-3.32e-02

Pos logit contributions
 scrub+0.014
 Olive+0.013
 Honey+0.013
 Buzz+0.013
 pur+0.013
Neg logit contributions
eg-0.016
 letter-0.015
by-0.015
ay-0.015
ork-0.015
Token The
Token ID383
Feature activation-1.67e-01

Pos logit contributions
 Olive+0.001
 Honey+0.001
 scrub+0.001
 rinse+0.001
aj+0.001
Neg logit contributions
--0.001
by-0.001
 letter-0.001
 across-0.001
eg-0.001
Token oct
Token ID19318
Feature activation-1.00e-01

Pos logit contributions
 Honey+0.003
 Buzz+0.003
 scrub+0.003
 Olive+0.003
 Holly+0.003
Neg logit contributions
ings-0.006
ables-0.006
ork-0.006
--0.006
by-0.006
Tokenopus
Token ID25790
Feature activation+4.09e-02

Pos logit contributions
 Buzz+0.002
aj+0.002
 Olive+0.002
 Honey+0.002
 Holly+0.002
Neg logit contributions
 letter-0.003
 across-0.003
by-0.003
 thing-0.003
ings-0.003
Token had
Token ID550
Feature activation-2.32e-01

Pos logit contributions
 letter+0.001
ings+0.001
-+0.001
ay+0.001
ork+0.001
Neg logit contributions
 scrub-0.001
 Olive-0.001
 Honey-0.001
 pur-0.001
 Buzz-0.001
Token eight
Token ID3624
Feature activation-3.23e+00

Pos logit contributions
 pur+0.007
 scrub+0.007
 Olive+0.006
 Buzz+0.006
 sob+0.006
Neg logit contributions
ings-0.006
by-0.006
--0.006
ay-0.006
eg-0.006
Token arms
Token ID5101
Feature activation-4.32e-01

Pos logit contributions
 Olive+0.084
 Honey+0.083
 scrub+0.083
 rinse+0.081
aj+0.080
Neg logit contributions
--0.093
by-0.089
 letter-0.089
 across-0.088
eg-0.087
Token and
Token ID290
Feature activation-6.86e-01

Pos logit contributions
 scrub+0.013
 Olive+0.012
 pur+0.012
 Honey+0.012
 sob+0.012
Neg logit contributions
 letter-0.011
by-0.010
eg-0.010
ork-0.010
spe-0.010
Token loved
Token ID6151
Feature activation-7.39e-01

Pos logit contributions
 Olive+0.020
 Honey+0.020
 scrub+0.020
 Buzz+0.020
 pur+0.020
Neg logit contributions
--0.017
ings-0.017
ork-0.017
by-0.017
 letter-0.016
Token to
Token ID284
Feature activation+3.57e-01

Pos logit contributions
 scrub+0.029
 Olive+0.029
 Honey+0.028
 Buzz+0.028
 pur+0.028
Neg logit contributions
eg-0.012
ork-0.011
ay-0.011
 letter-0.011
--0.011
Token swim
Token ID9422
Feature activation-4.88e-01

Pos logit contributions
-+0.009
 thing+0.009
 very+0.009
by+0.009
 across+0.009
Neg logit contributions
aj-0.011
 Olive-0.011
 Honey-0.011
 swollen-0.010
Sleep-0.010
Token too
Token ID1165
Feature activation-7.49e-02

Pos logit contributions
 Olive+0.012
 Honey+0.011
 scrub+0.011
 rinse+0.011
 swollen+0.011
Neg logit contributions
--0.009
by-0.009
eg-0.008
 letter-0.008
ings-0.008
Token dangerous
Token ID4923
Feature activation-6.72e-01

Pos logit contributions
 scrub+0.002
 Olive+0.002
 Honey+0.002
 swollen+0.002
 rinse+0.002
Neg logit contributions
eg-0.002
by-0.002
--0.002
 letter-0.002
ings-0.002
Token to
Token ID284
Feature activation-4.07e-01

Pos logit contributions
 Olive+0.022
 Honey+0.021
 scrub+0.021
 iv+0.020
 pur+0.020
Neg logit contributions
eg-0.015
 letter-0.014
by-0.014
 photo-0.014
ork-0.014
Token use
Token ID779
Feature activation-4.41e-01

Pos logit contributions
aj+0.014
irms+0.014
 Olive+0.014
 Honey+0.013
Sleep+0.013
Neg logit contributions
--0.009
 thing-0.008
by-0.008
 very-0.008
ork-0.008
Token.
Token ID13
Feature activation-1.16e+00

Pos logit contributions
 scrub+0.014
 Olive+0.013
 Honey+0.013
 rinse+0.013
 swollen+0.012
Neg logit contributions
eg-0.011
by-0.011
 letter-0.010
--0.010
ables-0.010
Token Fre
Token ID4848
Feature activation-3.22e+00

Pos logit contributions
 Honey+0.028
 Olive+0.027
 Buzz+0.025
 Sunny+0.025
 Holly+0.024
Neg logit contributions
--0.038
 letter-0.037
by-0.036
eg-0.036
 thing-0.035
Tokenya
Token ID3972
Feature activation+3.84e-01

Pos logit contributions
 rinse+0.109
 Olive+0.108
 scrub+0.107
 Honey+0.105
 swollen+0.105
Neg logit contributions
--0.075
by-0.070
 letter-0.067
 thing-0.065
 very-0.064
Token was
Token ID373
Feature activation-6.92e-01

Pos logit contributions
eg+0.009
ay+0.009
ings+0.009
 photo+0.009
 across+0.008
Neg logit contributions
 scrub-0.012
 Honey-0.012
 sob-0.012
 pur-0.012
 Olive-0.011
Token very
Token ID845
Feature activation-4.33e-01

Pos logit contributions
 Olive+0.024
 scrub+0.024
 Honey+0.023
 rinse+0.023
 sob+0.023
Neg logit contributions
eg-0.014
by-0.014
--0.014
ings-0.014
ay-0.014
Token disappointed
Token ID11679
Feature activation-3.83e-01

Pos logit contributions
 Olive+0.012
 scrub+0.012
 Honey+0.012
 rinse+0.012
 swollen+0.012
Neg logit contributions
eg-0.011
ay-0.011
--0.011
ings-0.011
by-0.011
Token and
Token ID290
Feature activation-2.68e-01

Pos logit contributions
 Olive+0.012
 Honey+0.012
 scrub+0.012
 rinse+0.011
aj+0.011
Neg logit contributions
eg-0.009
 letter-0.009
--0.009
by-0.009
ork-0.008
Token strong
Token ID1913
Feature activation-5.86e-01

Pos logit contributions
 Olive+0.014
 Honey+0.014
 scrub+0.014
 rinse+0.014
aj+0.013
Neg logit contributions
--0.007
by-0.007
 letter-0.007
 across-0.007
 thing-0.007
Token rabbit
Token ID22746
Feature activation-6.08e-01

Pos logit contributions
 Olive+0.018
 Honey+0.018
 scrub+0.018
 rinse+0.017
 pur+0.017
Neg logit contributions
--0.014
by-0.013
ings-0.013
 across-0.013
ork-0.013
Token.
Token ID13
Feature activation-2.98e-01

Pos logit contributions
 rinse+0.017
aj+0.017
 scrub+0.017
 Olive+0.017
 Honey+0.016
Neg logit contributions
--0.017
 very-0.016
 letter-0.016
 thing-0.016
 across-0.016
Token He
Token ID679
Feature activation+8.09e-04

Pos logit contributions
 scrub+0.010
 Olive+0.010
 rinse+0.010
aj+0.010
 Honey+0.010
Neg logit contributions
--0.006
by-0.006
 across-0.006
 letter-0.006
eg-0.006
Token had
Token ID550
Feature activation-2.22e-01

Pos logit contributions
 letter+0.000
ings+0.000
-+0.000
ay+0.000
by+0.000
Neg logit contributions
 Olive-0.000
 Honey-0.000
 scrub-0.000
 pur-0.000
 Buzz-0.000
Token big
Token ID1263
Feature activation-3.20e+00

Pos logit contributions
 Olive+0.007
 Honey+0.007
 scrub+0.007
 rinse+0.007
 pur+0.007
Neg logit contributions
--0.005
by-0.005
ings-0.005
eg-0.005
 letter-0.005
Token,
Token ID11
Feature activation-2.04e-01

Pos logit contributions
 rinse+0.102
 Honey+0.102
 Olive+0.102
 scrub+0.102
Sleep+0.099
Neg logit contributions
--0.075
by-0.070
 letter-0.069
 very-0.068
 thing-0.068
Token strong
Token ID1913
Feature activation-2.10e+00

Pos logit contributions
 Olive+0.006
 scrub+0.006
 rinse+0.006
aj+0.006
 Honey+0.006
Neg logit contributions
--0.005
 letter-0.005
 very-0.005
 thing-0.005
 across-0.005
Token muscles
Token ID12749
Feature activation-6.20e-01

Pos logit contributions
 Olive+0.058
 Honey+0.058
 scrub+0.057
 rinse+0.057
aj+0.056
Neg logit contributions
--0.058
by-0.055
 letter-0.055
 across-0.054
eg-0.054
Token.
Token ID13
Feature activation+1.54e-01

Pos logit contributions
 scrub+0.017
 Olive+0.017
 Honey+0.017
 pur+0.016
 rinse+0.016
Neg logit contributions
eg-0.016
by-0.016
 letter-0.016
ay-0.015
ork-0.015
Token He
Token ID679
Feature activation-1.56e-01

Pos logit contributions
ay+0.002
 across+0.002
eg+0.002
 thing+0.002
-+0.002
Neg logit contributions
 swollen-0.006
 rinse-0.006
 scrub-0.006
aj-0.006
Aud-0.006
Token was
Token ID373
Feature activation-4.92e-01

Pos logit contributions
-+0.001
by+0.001
 letter+0.001
 across+0.001
eg+0.001
Neg logit contributions
 Olive-0.002
 Honey-0.002
 scrub-0.002
 rinse-0.002
aj-0.002
Token always
Token ID1464
Feature activation+3.10e-03

Pos logit contributions
 Olive+0.017
 scrub+0.017
 Honey+0.017
 rinse+0.016
 pur+0.016
Neg logit contributions
by-0.010
--0.010
ork-0.010
eg-0.010
ings-0.010
Token working
Token ID1762
Feature activation-3.54e-01

Pos logit contributions
-+0.000
by+0.000
 letter+0.000
eg+0.000
 across+0.000
Neg logit contributions
 Olive-0.000
 Honey-0.000
 scrub-0.000
 rinse-0.000
aj-0.000
Token in
Token ID287
Feature activation-9.66e-02

Pos logit contributions
 scrub+0.011
 rinse+0.010
aj+0.010
 Honey+0.009
 Olive+0.009
Neg logit contributions
 photo-0.009
eg-0.009
 letter-0.009
by-0.009
ork-0.008
Token his
Token ID465
Feature activation-1.44e+00

Pos logit contributions
 scrub+0.003
 Honey+0.003
 Olive+0.003
 rinse+0.003
 swollen+0.002
Neg logit contributions
eg-0.003
by-0.003
--0.003
ables-0.003
s-0.003
Token big
Token ID1263
Feature activation-3.20e+00

Pos logit contributions
 Honey+0.036
 pur+0.036
 Olive+0.035
aj+0.035
 rinse+0.035
Neg logit contributions
spe-0.043
ay-0.041
eg-0.040
ork-0.040
ings-0.040
Token garden
Token ID11376
Feature activation-9.42e-01

Pos logit contributions
 pur+0.090
 scrub+0.089
 Honey+0.089
 swe+0.088
 Olive+0.087
Neg logit contributions
eg-0.081
ork-0.081
spe-0.080
ings-0.078
rolled-0.077
Token.
Token ID13
Feature activation+4.09e-01

Pos logit contributions
 scrub+0.026
 Honey+0.026
 Olive+0.026
aj+0.026
 rinse+0.025
Neg logit contributions
eg-0.025
by-0.024
 photo-0.024
ork-0.023
 letter-0.023
Token One
Token ID1881
Feature activation-1.66e-01

Pos logit contributions
-+0.008
 across+0.008
 thing+0.008
eg+0.008
ay+0.008
Neg logit contributions
 Olive-0.014
 scrub-0.014
 rinse-0.014
 Honey-0.014
 swollen-0.014
Token day
Token ID1110
Feature activation+8.74e-02

Pos logit contributions
 Honey+0.005
 Olive+0.005
ens+0.005
 Buzz+0.004
 scrub+0.004
Neg logit contributions
eg-0.005
ings-0.005
ork-0.004
rolled-0.004
ables-0.004
Token,
Token ID11
Feature activation+1.96e-01

Pos logit contributions
 letter+0.002
spe+0.002
ables+0.002
by+0.002
eg+0.002
Neg logit contributions
 Olive-0.002
 Honey-0.002
 scrub-0.002
 Buzz-0.002
 Sunny-0.002
Token P
Token ID350
Feature activation-6.08e-01

Pos logit contributions
 rinse+0.002
 scrub+0.002
aj+0.002
irms+0.002
Fine+0.002
Neg logit contributions
--0.002
 letter-0.002
eg-0.002
 across-0.002
 very-0.002
Tokenia
Token ID544
Feature activation-2.35e-01

Pos logit contributions
 Olive+0.014
 Honey+0.014
 scrub+0.014
 rinse+0.014
aj+0.013
Neg logit contributions
--0.019
by-0.019
 letter-0.019
 across-0.018
eg-0.018
Token.
Token ID13
Feature activation-3.37e-01

Pos logit contributions
 Olive+0.008
 Honey+0.008
 scrub+0.008
 rinse+0.007
aj+0.007
Neg logit contributions
--0.005
by-0.005
 letter-0.005
eg-0.005
 across-0.005
Token She
Token ID1375
Feature activation-1.12e-01

Pos logit contributions
 Olive+0.007
 scrub+0.007
 Honey+0.007
 rinse+0.007
aj+0.007
Neg logit contributions
--0.012
by-0.011
 across-0.011
eg-0.011
 thing-0.011
Token had
Token ID550
Feature activation-9.80e-02

Pos logit contributions
 Olive+0.004
 Honey+0.004
 scrub+0.004
 rinse+0.004
aj+0.004
Neg logit contributions
--0.002
by-0.002
 letter-0.002
 across-0.002
eg-0.002
Token big
Token ID1263
Feature activation-3.19e+00

Pos logit contributions
 Olive+0.003
 Honey+0.003
 scrub+0.003
 pur+0.003
 rinse+0.003
Neg logit contributions
--0.002
eg-0.002
ings-0.002
by-0.002
 letter-0.002
Token,
Token ID11
Feature activation-1.35e-01

Pos logit contributions
 Olive+0.094
 Honey+0.093
 scrub+0.091
 pur+0.090
aj+0.090
Neg logit contributions
--0.080
by-0.077
ork-0.076
eg-0.076
 across-0.076
Token alert
Token ID7995
Feature activation-9.43e-01

Pos logit contributions
 scrub+0.004
 rinse+0.004
 Olive+0.004
aj+0.004
 Honey+0.004
Neg logit contributions
--0.004
 letter-0.003
 very-0.003
 thing-0.003
 across-0.003
Token eyes
Token ID2951
Feature activation-6.88e-01

Pos logit contributions
 pur+0.029
 Honey+0.029
 Olive+0.029
aj+0.029
 scrub+0.028
Neg logit contributions
eg-0.022
ork-0.021
 letter-0.021
 photo-0.021
spe-0.020
Token and
Token ID290
Feature activation-5.08e-01

Pos logit contributions
 Olive+0.022
 Honey+0.022
 scrub+0.022
 pur+0.021
 rinse+0.021
Neg logit contributions
by-0.015
eg-0.015
 letter-0.015
--0.014
 photo-0.014
Token her
Token ID607
Feature activation-5.52e-01

Pos logit contributions
 Olive+0.018
 Honey+0.017
 rinse+0.017
 scrub+0.017
aj+0.017
Neg logit contributions
--0.011
by-0.010
 very-0.010
 across-0.010
 letter-0.010
Token.
Token ID13
Feature activation-1.53e-01

Pos logit contributions
 Olive+0.005
 Honey+0.005
 scrub+0.005
 rinse+0.005
aj+0.005
Neg logit contributions
--0.004
by-0.003
 letter-0.003
eg-0.003
 across-0.003
Token The
Token ID383
Feature activation-5.21e-03

Pos logit contributions
 Olive+0.005
 scrub+0.005
 Honey+0.005
 rinse+0.005
aj+0.005
Neg logit contributions
--0.003
 letter-0.003
by-0.003
 across-0.003
eg-0.003
Token oct
Token ID19318
Feature activation+1.36e-01

Pos logit contributions
 Honey+0.000
 Olive+0.000
 scrub+0.000
 Buzz+0.000
 rinse+0.000
Neg logit contributions
ings-0.000
ables-0.000
ork-0.000
--0.000
by-0.000
Tokenopus
Token ID25790
Feature activation+9.97e-02

Pos logit contributions
 letter+0.005
 across+0.005
by+0.004
 thing+0.004
ings+0.004
Neg logit contributions
aj-0.003
 Olive-0.003
 Buzz-0.003
 Honey-0.003
 Holly-0.002
Token had
Token ID550
Feature activation-2.10e-01

Pos logit contributions
ings+0.002
ay+0.002
 letter+0.002
ork+0.002
eg+0.002
Neg logit contributions
 scrub-0.003
 Olive-0.003
 pur-0.003
 Honey-0.003
 Buzz-0.003
Token eight
Token ID3624
Feature activation-3.17e+00

Pos logit contributions
 pur+0.006
 Olive+0.006
 Buzz+0.006
 Sunny+0.005
 Honey+0.005
Neg logit contributions
ings-0.006
--0.006
by-0.006
spe-0.006
ork-0.006
Token long
Token ID890
Feature activation-3.24e+00

Pos logit contributions
 Olive+0.082
 Honey+0.081
 scrub+0.081
aj+0.078
 rinse+0.078
Neg logit contributions
--0.090
 letter-0.089
by-0.088
 across-0.086
ork-0.086
Token arms
Token ID5101
Feature activation-5.38e-01

Pos logit contributions
 Olive+0.093
 Honey+0.092
 scrub+0.091
 rinse+0.090
aj+0.089
Neg logit contributions
--0.085
by-0.082
 letter-0.081
 across-0.080
eg-0.080
Token and
Token ID290
Feature activation-8.73e-01

Pos logit contributions
 scrub+0.016
 Olive+0.016
 Honey+0.015
 pur+0.015
 sob+0.015
Neg logit contributions
 letter-0.013
eg-0.013
by-0.013
ork-0.013
--0.012
Token loved
Token ID6151
Feature activation-6.54e-01

Pos logit contributions
 Olive+0.028
 Honey+0.028
 scrub+0.028
 Buzz+0.027
 pur+0.027
Neg logit contributions
--0.020
 letter-0.019
ork-0.019
ings-0.019
by-0.019
Token to
Token ID284
Feature activation+3.92e-01

Pos logit contributions
 scrub+0.026
 Olive+0.025
 Honey+0.025
 Buzz+0.025
 pur+0.025
Neg logit contributions
eg-0.010
ay-0.010
ork-0.010
--0.010
 letter-0.010
Token twisted
Token ID19074
Feature activation-5.34e-01

Pos logit contributions
 Olive+0.037
 Honey+0.037
 scrub+0.037
 rinse+0.037
aj+0.036
Neg logit contributions
--0.028
by-0.026
 letter-0.026
 across-0.026
 thing-0.026
Token,
Token ID11
Feature activation-5.21e-01

Pos logit contributions
 scrub+0.017
 Olive+0.017
 Honey+0.017
 pur+0.016
 iv+0.016
Neg logit contributions
eg-0.013
 letter-0.013
 photo-0.012
by-0.012
ork-0.012
Token and
Token ID290
Feature activation-3.95e-01

Pos logit contributions
 Olive+0.016
 Honey+0.016
 scrub+0.016
 rinse+0.015
 pur+0.015
Neg logit contributions
by-0.013
--0.013
eg-0.012
 letter-0.012
ork-0.012
Token it
Token ID340
Feature activation-6.96e-01

Pos logit contributions
 Olive+0.014
 Honey+0.014
 scrub+0.014
 rinse+0.013
 pur+0.013
Neg logit contributions
--0.008
by-0.008
eg-0.008
 letter-0.007
ork-0.007
Token had
Token ID550
Feature activation+2.42e-01

Pos logit contributions
aj+0.025
 Olive+0.025
 Honey+0.025
 Deal+0.025
Sleep+0.025
Neg logit contributions
--0.014
 very-0.014
 across-0.013
 thing-0.013
by-0.012
Token th
Token ID294
Feature activation-3.15e+00

Pos logit contributions
eg+0.006
-+0.006
by+0.006
ings+0.006
ay+0.006
Neg logit contributions
 pur-0.007
 Olive-0.007
 scrub-0.007
 Honey-0.007
 rinse-0.007
Tokenorns
Token ID19942
Feature activation-1.54e+00

Pos logit contributions
 rinse+0.083
 Olive+0.081
 Honey+0.079
Sleep+0.078
 Sunny+0.078
Neg logit contributions
--0.097
ings-0.091
 across-0.089
by-0.088
ay-0.087
Token all
Token ID477
Feature activation+3.40e-01

Pos logit contributions
 Olive+0.048
 scrub+0.048
 Honey+0.047
 rinse+0.046
 iv+0.046
Neg logit contributions
eg-0.036
 letter-0.035
by-0.035
--0.034
ay-0.033
Token over
Token ID625
Feature activation+2.72e-01

Pos logit contributions
-+0.008
eg+0.008
 letter+0.008
by+0.007
ork+0.007
Neg logit contributions
 Olive-0.011
 scrub-0.011
 Honey-0.010
 rinse-0.010
 pur-0.010
Token it
Token ID340
Feature activation-1.88e+00

Pos logit contributions
eg+0.007
 letter+0.007
ables+0.007
-+0.007
ork+0.007
Neg logit contributions
 scrub-0.008
 Honey-0.008
 Olive-0.008
 Buzz-0.008
 rinse-0.008
Token.
Token ID13
Feature activation+2.30e-01

Pos logit contributions
 Olive+0.066
 Honey+0.065
 scrub+0.064
 rinse+0.064
aj+0.064
Neg logit contributions
--0.038
 across-0.036
 letter-0.035
by-0.035
 very-0.035
Tokenpa
Token ID8957
Feature activation-1.43e-01

Pos logit contributions
 pur+0.036
 Olive+0.036
aj+0.036
 scrub+0.035
 Honey+0.035
Neg logit contributions
 across-0.039
ork-0.038
spe-0.037
ings-0.037
--0.037
Token.
Token ID13
Feature activation-9.42e-01

Pos logit contributions
 Honey+0.005
 Olive+0.004
 scrub+0.004
 rinse+0.004
 Sunny+0.004
Neg logit contributions
ork-0.003
--0.003
by-0.003
eg-0.003
ings-0.003
Token Grand
Token ID5675
Feature activation-8.28e-01

Pos logit contributions
 Olive+0.024
 Honey+0.024
 scrub+0.024
 rinse+0.024
aj+0.023
Neg logit contributions
--0.028
by-0.027
 letter-0.027
 across-0.026
eg-0.026
Tokenpa
Token ID8957
Feature activation+1.45e-01

Pos logit contributions
 Olive+0.022
 scrub+0.022
 Honey+0.022
aj+0.022
 rinse+0.022
Neg logit contributions
--0.022
 across-0.022
by-0.022
ork-0.021
eg-0.021
Token has
Token ID468
Feature activation-1.10e+00

Pos logit contributions
-+0.003
by+0.003
 letter+0.003
 across+0.003
 very+0.003
Neg logit contributions
 Olive-0.005
 Honey-0.005
 rinse-0.005
 scrub-0.005
 swollen-0.005
Token many
Token ID867
Feature activation-3.15e+00

Pos logit contributions
 Olive+0.034
 Honey+0.033
 scrub+0.033
 Sunny+0.032
 pur+0.032
Neg logit contributions
--0.027
by-0.027
ings-0.027
eg-0.026
ay-0.026
Token animals
Token ID4695
Feature activation-4.58e-01

Pos logit contributions
 Olive+0.089
 scrub+0.088
 Honey+0.087
aj+0.086
 rinse+0.085
Neg logit contributions
eg-0.081
by-0.081
--0.080
 across-0.080
ork-0.079
Token,
Token ID11
Feature activation-3.02e-01

Pos logit contributions
 Olive+0.015
 Honey+0.014
 scrub+0.014
 rinse+0.014
 Sunny+0.013
Neg logit contributions
eg-0.010
by-0.010
ork-0.010
 letter-0.010
ay-0.010
Token but
Token ID475
Feature activation+8.11e-01

Pos logit contributions
 Olive+0.008
 Honey+0.008
 scrub+0.007
 rinse+0.007
aj+0.007
Neg logit contributions
--0.009
 very-0.008
 letter-0.008
 across-0.008
by-0.008
Token his
Token ID465
Feature activation-9.33e-01

Pos logit contributions
-+0.020
 letter+0.019
 thing+0.019
by+0.019
 across+0.019
Neg logit contributions
 Olive-0.024
 scrub-0.024
 Honey-0.024
 rinse-0.024
aj-0.024
Token favorite
Token ID4004
Feature activation+4.32e-01

Pos logit contributions
aj+0.026
 pur+0.026
 Honey+0.026
 Olive+0.025
 Sunny+0.025
Neg logit contributions
ork-0.025
ings-0.023
eg-0.023
ay-0.023
ole-0.023
Tokenpa
Token ID8957
Feature activation-7.19e-02

Pos logit contributions
 Olive+0.041
 scrub+0.041
aj+0.041
 Honey+0.041
 pur+0.041
Neg logit contributions
 across-0.040
ork-0.039
--0.039
eg-0.038
ings-0.038
Token.
Token ID13
Feature activation-8.99e-01

Pos logit contributions
 Honey+0.002
 Olive+0.002
 scrub+0.002
 Sunny+0.002
 rinse+0.002
Neg logit contributions
ork-0.002
--0.001
by-0.001
eg-0.001
ings-0.001
Token Grand
Token ID5675
Feature activation-1.16e+00

Pos logit contributions
 Olive+0.023
 Honey+0.022
 scrub+0.022
 rinse+0.022
aj+0.022
Neg logit contributions
--0.026
by-0.026
 across-0.025
 letter-0.025
eg-0.025
Tokenpa
Token ID8957
Feature activation+7.43e-02

Pos logit contributions
 scrub+0.030
 Olive+0.030
 Honey+0.030
aj+0.030
 rinse+0.029
Neg logit contributions
 across-0.032
--0.032
ork-0.032
eg-0.031
by-0.031
Token has
Token ID468
Feature activation-1.22e+00

Pos logit contributions
-+0.002
by+0.001
 letter+0.001
 across+0.001
eg+0.001
Neg logit contributions
 Olive-0.003
 Honey-0.003
 scrub-0.003
 rinse-0.003
aj-0.002
Token many
Token ID867
Feature activation-3.15e+00

Pos logit contributions
 Olive+0.036
 Honey+0.035
 Sunny+0.035
 scrub+0.035
 pur+0.035
Neg logit contributions
ings-0.031
--0.031
eg-0.030
by-0.030
ork-0.030
Token animals
Token ID4695
Feature activation-4.34e-01

Pos logit contributions
 scrub+0.088
 Olive+0.088
 Honey+0.086
aj+0.085
 pur+0.084
Neg logit contributions
eg-0.083
by-0.082
ork-0.081
 across-0.081
--0.080
Token,
Token ID11
Feature activation-3.23e-01

Pos logit contributions
 Olive+0.014
 scrub+0.013
 Honey+0.013
 rinse+0.013
 Sunny+0.012
Neg logit contributions
eg-0.010
by-0.010
ork-0.010
 letter-0.010
ay-0.010
Token but
Token ID475
Feature activation+8.06e-01

Pos logit contributions
 Olive+0.008
 Honey+0.008
 scrub+0.008
 rinse+0.008
aj+0.008
Neg logit contributions
--0.009
 letter-0.009
 across-0.009
by-0.009
 very-0.009
Token their
Token ID511
Feature activation-1.49e+00

Pos logit contributions
-+0.020
 thing+0.020
 letter+0.020
 very+0.020
by+0.019
Neg logit contributions
 scrub-0.023
 Olive-0.023
 rinse-0.023
aj-0.023
 Honey-0.023
Token favorite
Token ID4004
Feature activation+3.30e-01

Pos logit contributions
 pur+0.040
aj+0.039
 swe+0.039
 Olive+0.038
 scrub+0.038
Neg logit contributions
ork-0.042
ings-0.041
rolled-0.039
ole-0.039
eg-0.039
Token named
Token ID3706
Feature activation-9.80e-01

Pos logit contributions
 Olive+0.032
 Honey+0.032
 rinse+0.032
 scrub+0.032
aj+0.032
Neg logit contributions
--0.037
by-0.034
 letter-0.034
 across-0.034
 thing-0.034
Token Daisy
Token ID40355
Feature activation-2.36e-02

Pos logit contributions
aj+0.025
 rinse+0.025
Fine+0.025
 scrub+0.025
 til+0.024
Neg logit contributions
 very-0.028
 across-0.028
--0.027
by-0.027
 thing-0.027
Token.
Token ID13
Feature activation-9.08e-01

Pos logit contributions
aj+0.001
 Olive+0.001
 rinse+0.001
 scrub+0.001
irms+0.001
Neg logit contributions
--0.000
by-0.000
 very-0.000
 across-0.000
 thing-0.000
Token Daisy
Token ID40355
Feature activation+2.33e-01

Pos logit contributions
aj+0.032
 scrub+0.029
 swollen+0.028
 rinse+0.028
 elbow+0.028
Neg logit contributions
 across-0.025
 very-0.022
 thing-0.022
ay-0.021
s-0.021
Token had
Token ID550
Feature activation+3.32e-02

Pos logit contributions
-+0.004
 very+0.003
by+0.003
 thing+0.003
 across+0.003
Neg logit contributions
 Olive-0.009
aj-0.009
 rinse-0.009
 Honey-0.009
 scrub-0.009
Token big
Token ID1263
Feature activation-3.15e+00

Pos logit contributions
-+0.001
by+0.001
 letter+0.001
 across+0.001
 thing+0.001
Neg logit contributions
 Olive-0.001
 Honey-0.001
 scrub-0.001
 rinse-0.001
aj-0.001
Token dreams
Token ID10625
Feature activation-4.14e-01

Pos logit contributions
 Olive+0.103
 Honey+0.103
 rinse+0.103
 scrub+0.102
Sleep+0.100
Neg logit contributions
--0.071
by-0.066
 letter-0.065
 thing-0.065
 across-0.064
Token of
Token ID286
Feature activation+3.50e-02

Pos logit contributions
 Olive+0.013
 Honey+0.013
 scrub+0.013
 rinse+0.013
aj+0.013
Neg logit contributions
--0.009
by-0.009
eg-0.009
 letter-0.009
 across-0.009
Token exploring
Token ID13504
Feature activation-6.76e-03

Pos logit contributions
 very+0.001
-+0.001
 thing+0.001
 across+0.001
 letter+0.001
Neg logit contributions
aj-0.001
 rinse-0.001
Fine-0.001
 scrub-0.001
irms-0.001
Token the
Token ID262
Feature activation-7.42e-01

Pos logit contributions
aj+0.000
 rinse+0.000
 swollen+0.000
 scrub+0.000
 Honey+0.000
Neg logit contributions
--0.000
 across-0.000
 very-0.000
 thing-0.000
 lot-0.000
Token world
Token ID995
Feature activation+2.28e-01

Pos logit contributions
 Olive+0.029
 Honey+0.029
 scrub+0.029
 rinse+0.028
aj+0.028
Neg logit contributions
--0.011
eg-0.011
 letter-0.011
by-0.011
ings-0.010
Token Sam
Token ID3409
Feature activation+1.79e+00

Pos logit contributions
aj+0.002
Sleep+0.002
 Honey+0.002
 Olive+0.002
 rinse+0.002
Neg logit contributions
--0.002
 very-0.002
 letter-0.002
 across-0.002
by-0.002
Token.
Token ID13
Feature activation+7.93e-01

Pos logit contributions
-+0.045
 letter+0.042
 across+0.042
by+0.042
 thing+0.041
Neg logit contributions
 Olive-0.054
 Honey-0.054
 scrub-0.052
aj-0.052
 rinse-0.052
Token She
Token ID1375
Feature activation+6.66e-01

Pos logit contributions
by+0.027
-+0.027
eg+0.027
 letter+0.026
ings+0.026
Neg logit contributions
 Olive-0.017
 Honey-0.016
 scrub-0.016
 rinse-0.016
 sob-0.015
Token saw
Token ID2497
Feature activation+6.11e-01

Pos logit contributions
-+0.012
by+0.011
 letter+0.011
eg+0.011
 across+0.011
Neg logit contributions
 Olive-0.025
 scrub-0.025
 Honey-0.025
 rinse-0.024
aj-0.024
Token Lily
Token ID20037
Feature activation+4.85e-01

Pos logit contributions
-+0.016
 letter+0.015
 across+0.015
by+0.015
 thing+0.015
Neg logit contributions
aj-0.018
 Honey-0.018
 Olive-0.018
 scrub-0.018
 rinse-0.017
Token's
Token ID338
Feature activation+2.00e+00

Pos logit contributions
by+0.010
eg+0.010
-+0.010
ay+0.010
 letter+0.010
Neg logit contributions
 Olive-0.016
 scrub-0.016
 rinse-0.016
 Honey-0.016
 sob-0.016
Token tears
Token ID10953
Feature activation+2.00e-01

Pos logit contributions
by+0.057
ables+0.055
-+0.054
ay+0.052
eg+0.052
Neg logit contributions
 sob-0.055
 scrub-0.054
 swollen-0.053
 Olive-0.052
 rinse-0.052
Token and
Token ID290
Feature activation+8.99e-01

Pos logit contributions
-+0.004
 letter+0.003
by+0.003
 across+0.003
 thing+0.003
Neg logit contributions
 Olive-0.008
 Honey-0.007
 scrub-0.007
 rinse-0.007
aj-0.007
Token Sam
Token ID3409
Feature activation+1.24e+00

Pos logit contributions
-+0.026
 letter+0.024
by+0.024
 across+0.024
 very+0.024
Neg logit contributions
 Olive-0.024
 Honey-0.024
aj-0.024
 scrub-0.023
 rinse-0.023
Token's
Token ID338
Feature activation+9.58e-01

Pos logit contributions
-+0.022
by+0.021
 letter+0.020
eg+0.020
ork+0.020
Neg logit contributions
 Olive-0.046
 Honey-0.046
 scrub-0.046
 rinse-0.045
aj-0.044
Token cars
Token ID5006
Feature activation+7.07e-01

Pos logit contributions
by+0.026
ables+0.025
-+0.024
 across+0.024
 Exhibit+0.024
Neg logit contributions
 swollen-0.027
 sob-0.027
 scrub-0.027
 rinse-0.026
aj-0.026
Token fence
Token ID13990
Feature activation-3.48e-01

Pos logit contributions
 Olive+0.001
 Honey+0.001
 scrub+0.001
 rinse+0.001
 Sunny+0.001
Neg logit contributions
--0.001
by-0.001
eg-0.001
ings-0.001
ork-0.001
Token and
Token ID290
Feature activation+9.82e-01

Pos logit contributions
 Olive+0.012
 scrub+0.012
 Honey+0.012
 rinse+0.011
 pur+0.011
Neg logit contributions
by-0.007
eg-0.007
--0.007
 letter-0.007
ork-0.007
Token goes
Token ID2925
Feature activation+5.41e-01

Pos logit contributions
 across+0.027
-+0.027
 very+0.026
s+0.026
 thing+0.025
Neg logit contributions
aj-0.028
Sleep-0.027
 Honey-0.027
 scrub-0.026
 Olive-0.025
Token to
Token ID284
Feature activation+7.37e-01

Pos logit contributions
eg+0.012
 letter+0.012
by+0.012
ay+0.011
ables+0.011
Neg logit contributions
 Olive-0.018
 scrub-0.018
 Honey-0.017
 rinse-0.017
 Holly-0.017
Token the
Token ID262
Feature activation+1.68e-01

Pos logit contributions
-+0.017
 across+0.017
 letter+0.016
by+0.016
 very+0.016
Neg logit contributions
aj-0.024
 Honey-0.022
 scrub-0.022
 Olive-0.022
 rinse-0.022
Token next
Token ID1306
Feature activation+2.47e+00

Pos logit contributions
-+0.005
eg+0.005
ings+0.005
ay+0.005
by+0.005
Neg logit contributions
 Olive-0.005
 Honey-0.005
 Sunny-0.005
 scrub-0.004
 rinse-0.004
Token yard
Token ID12699
Feature activation+2.89e-01

Pos logit contributions
eg+0.055
-+0.054
ings+0.053
by+0.053
 letter+0.051
Neg logit contributions
 Olive-0.079
 scrub-0.077
 Honey-0.077
 rinse-0.075
 Sunny-0.074
Token.
Token ID13
Feature activation+1.25e-01

Pos logit contributions
eg+0.007
by+0.007
-+0.007
 letter+0.007
ay+0.007
Neg logit contributions
 Olive-0.009
 scrub-0.008
 Honey-0.008
 rinse-0.008
 Buzz-0.008
Token Lily
Token ID20037
Feature activation-7.32e-01

Pos logit contributions
 across+0.002
ork+0.002
 very+0.002
ay+0.002
s+0.002
Neg logit contributions
 scrub-0.004
aj-0.004
 rinse-0.004
 swollen-0.004
Fine-0.004
Token and
Token ID290
Feature activation-1.71e+00

Pos logit contributions
 Honey+0.030
 rinse+0.029
 Olive+0.029
 scrub+0.029
Sleep+0.029
Neg logit contributions
--0.011
s-0.010
 across-0.009
 very-0.009
ables-0.009
Token Max
Token ID5436
Feature activation+1.15e+00

Pos logit contributions
aj+0.041
 scrub+0.040
 rinse+0.040
Sleep+0.039
 Honey+0.039
Neg logit contributions
 letter-0.053
--0.053
 very-0.051
 across-0.051
 thing-0.050
Token he
Token ID339
Feature activation+7.01e-01

Pos logit contributions
 Honey+0.011
 Olive+0.011
 scrub+0.011
 Buzz+0.010
 rinse+0.010
Neg logit contributions
--0.012
eg-0.012
by-0.012
 letter-0.012
ings-0.012
Token saw
Token ID2497
Feature activation-1.65e-01

Pos logit contributions
-+0.015
 letter+0.015
eg+0.014
ings+0.014
ay+0.014
Neg logit contributions
 scrub-0.023
 Olive-0.023
 Honey-0.023
aj-0.022
 rinse-0.022
Token a
Token ID257
Feature activation+1.01e+00

Pos logit contributions
 Olive+0.006
 Honey+0.006
 scrub+0.005
 rinse+0.005
 swollen+0.005
Neg logit contributions
--0.003
by-0.003
eg-0.003
ings-0.003
 letter-0.003
Token shiny
Token ID22441
Feature activation+3.91e-01

Pos logit contributions
ings+0.026
ables+0.026
rolled+0.026
by+0.025
ed+0.024
Neg logit contributions
 pur-0.029
 Olive-0.028
 swollen-0.028
 Honey-0.028
 scrub-0.027
Token planet
Token ID5440
Feature activation-4.25e-01

Pos logit contributions
ables+0.011
spe+0.011
rolled+0.011
s+0.010
by+0.010
Neg logit contributions
 Olive-0.010
 scrub-0.010
 Honey-0.010
 swollen-0.010
 ti-0.010
Token up
Token ID510
Feature activation+2.18e+00

Pos logit contributions
 Olive+0.012
 scrub+0.012
 Honey+0.012
 pur+0.012
 iv+0.012
Neg logit contributions
eg-0.011
by-0.011
--0.010
ork-0.010
ables-0.010
Token in
Token ID287
Feature activation+1.23e+00

Pos logit contributions
-+0.051
eg+0.051
by+0.049
 letter+0.049
ork+0.048
Neg logit contributions
 Olive-0.068
 scrub-0.067
 Honey-0.067
 rinse-0.065
aj-0.064
Token the
Token ID262
Feature activation+6.33e-02

Pos logit contributions
-+0.034
by+0.033
eg+0.033
ables+0.033
ings+0.032
Neg logit contributions
 Honey-0.034
 Olive-0.034
 scrub-0.033
 rinse-0.032
 Sunny-0.032
Token sky
Token ID6766
Feature activation-6.66e-01

Pos logit contributions
-+0.002
ings+0.002
eg+0.002
by+0.002
ork+0.002
Neg logit contributions
 Honey-0.002
 Olive-0.002
 rinse-0.002
 pur-0.002
 scrub-0.002
Token.
Token ID13
Feature activation+1.92e-01

Pos logit contributions
 scrub+0.020
 Honey+0.019
 Olive+0.019
 rinse+0.019
aj+0.019
Neg logit contributions
eg-0.016
by-0.016
--0.016
 letter-0.015
ork-0.015
Token Jack
Token ID3619
Feature activation+1.05e+00

Pos logit contributions
-+0.004
 across+0.003
 letter+0.003
by+0.003
eg+0.003
Neg logit contributions
 Olive-0.007
 Honey-0.007
 scrub-0.007
 rinse-0.007
aj-0.007
Token and
Token ID290
Feature activation-4.02e-01

Pos logit contributions
 scrub+0.008
 Honey+0.007
 Olive+0.007
 pur+0.007
 rinse+0.007
Neg logit contributions
 letter-0.005
--0.005
 photo-0.005
ings-0.005
ork-0.005
Token the
Token ID262
Feature activation+8.07e-01

Pos logit contributions
aj+0.014
 rinse+0.014
 Honey+0.013
 scrub+0.013
 Olive+0.013
Neg logit contributions
--0.009
 thing-0.008
 letter-0.008
 across-0.008
 very-0.008
Token cat
Token ID3797
Feature activation+3.37e-01

Pos logit contributions
-+0.018
 letter+0.017
by+0.017
 across+0.016
 thing+0.016
Neg logit contributions
 Olive-0.027
 Honey-0.027
 scrub-0.027
 rinse-0.026
aj-0.026
Token were
Token ID547
Feature activation-2.41e-01

Pos logit contributions
ings+0.009
 letter+0.008
ork+0.008
-+0.008
 photo+0.008
Neg logit contributions
 scrub-0.010
 pur-0.010
 Olive-0.009
 Honey-0.009
 rinse-0.009
Token the
Token ID262
Feature activation-5.26e-03

Pos logit contributions
 Olive+0.007
 Honey+0.007
 scrub+0.007
 rinse+0.007
 pur+0.007
Neg logit contributions
--0.006
ork-0.006
by-0.006
eg-0.006
 letter-0.006
Token best
Token ID1266
Feature activation+2.64e+00

Pos logit contributions
 Olive+0.000
 Honey+0.000
 scrub+0.000
 rinse+0.000
aj+0.000
Neg logit contributions
--0.000
by-0.000
ork-0.000
 across-0.000
ings-0.000
Token of
Token ID286
Feature activation+1.82e+00

Pos logit contributions
by+0.055
ables+0.052
eg+0.052
ings+0.052
 across+0.052
Neg logit contributions
 scrub-0.088
 Olive-0.088
 Honey-0.086
 rinse-0.083
 Buzz-0.083
Token friends
Token ID2460
Feature activation+2.37e-01

Pos logit contributions
-+0.042
 letter+0.039
by+0.039
 thing+0.038
 across+0.038
Neg logit contributions
 Honey-0.059
 Olive-0.059
 scrub-0.058
 rinse-0.058
aj-0.058
Token.
Token ID13
Feature activation+3.02e-01

Pos logit contributions
eg+0.008
ork+0.007
 letter+0.007
ings+0.007
 photo+0.007
Neg logit contributions
 Olive-0.005
 Honey-0.005
 Buzz-0.005
 Sunny-0.005
 pur-0.005
Token And
Token ID843
Feature activation-6.27e-01

Pos logit contributions
-+0.007
by+0.007
 letter+0.007
 across+0.007
ork+0.007
Neg logit contributions
 Olive-0.009
 Honey-0.009
 scrub-0.009
 rinse-0.009
aj-0.009
Token every
Token ID790
Feature activation+1.19e+00

Pos logit contributions
 Olive+0.015
 Buzz+0.014
 Honey+0.014
 Fif+0.013
 Sunny+0.013
Neg logit contributions
ables-0.021
ork-0.021
ings-0.021
--0.020
eg-0.020
Token "
Token ID366
Feature activation-2.95e-02

Pos logit contributions
by+0.018
eg+0.017
ables+0.017
ork+0.017
-+0.017
Neg logit contributions
 Olive-0.020
 Honey-0.019
 Sunny-0.018
 Buzz-0.018
 Holly-0.018
TokenWow
Token ID22017
Feature activation+8.05e-01

Pos logit contributions
aj+0.001
 til+0.001
 scrub+0.001
unn+0.001
 pur+0.001
Neg logit contributions
--0.001
 photo-0.001
 thing-0.001
 across-0.001
 letter-0.001
Token,
Token ID11
Feature activation-8.55e-02

Pos logit contributions
 letter+0.019
eg+0.019
-+0.019
by+0.019
spe+0.018
Neg logit contributions
 Olive-0.025
 Honey-0.024
 rinse-0.024
aj-0.024
 scrub-0.023
Token Lily
Token ID20037
Feature activation-5.93e-02

Pos logit contributions
 Olive+0.002
 Honey+0.002
 rinse+0.002
 scrub+0.002
 Sunny+0.002
Neg logit contributions
ables-0.003
--0.003
by-0.003
eg-0.003
ings-0.003
Token!
Token ID0
Feature activation+8.73e-01

Pos logit contributions
 Olive+0.002
 Honey+0.002
 scrub+0.002
 rinse+0.002
aj+0.002
Neg logit contributions
--0.001
by-0.001
 letter-0.001
 across-0.001
eg-0.001
Token That
Token ID1320
Feature activation+2.38e+00

Pos logit contributions
-+0.020
by+0.019
 letter+0.018
 across+0.018
eg+0.018
Neg logit contributions
 Olive-0.028
 Honey-0.028
 scrub-0.028
 rinse-0.028
aj-0.027
Token apple
Token ID17180
Feature activation-4.28e-02

Pos logit contributions
rolled+0.066
ings+0.063
ole+0.063
spe+0.061
by+0.060
Neg logit contributions
 pur-0.071
 Honey-0.069
 Midnight-0.067
 Olive-0.066
 Sunny-0.065
Token looks
Token ID3073
Feature activation+7.01e-01

Pos logit contributions
 Olive+0.001
 pur+0.001
 Honey+0.001
 iv+0.001
 Sunny+0.001
Neg logit contributions
eg-0.001
by-0.001
rolled-0.001
spe-0.001
ork-0.001
Token so
Token ID523
Feature activation+5.85e-01

Pos logit contributions
by+0.023
rolled+0.022
ork+0.022
eg+0.022
ables+0.022
Neg logit contributions
 Honey-0.016
 pur-0.016
 rinse-0.016
 swe-0.016
 Olive-0.016
Token creative
Token ID7325
Feature activation+4.80e-01

Pos logit contributions
-+0.016
by+0.016
 letter+0.016
 across+0.015
eg+0.015
Neg logit contributions
 Olive-0.016
 Honey-0.016
 scrub-0.016
 rinse-0.015
aj-0.015
Token.
Token ID13
Feature activation+1.32e-01

Pos logit contributions
-+0.010
 across+0.009
 letter+0.009
 thing+0.009
by+0.009
Neg logit contributions
 Olive-0.016
 Honey-0.016
 scrub-0.016
 rinse-0.016
aj-0.016
Token called
Token ID1444
Feature activation+4.41e-01

Pos logit contributions
 Olive+0.020
 scrub+0.020
 Honey+0.020
 pur+0.020
 sob+0.020
Neg logit contributions
eg-0.017
ings-0.017
ork-0.016
ables-0.016
 thing-0.016
Token out
Token ID503
Feature activation+6.36e-01

Pos logit contributions
-+0.008
ables+0.008
eg+0.007
s+0.007
by+0.007
Neg logit contributions
 Olive-0.017
 Honey-0.017
 Buzz-0.016
 Ruby-0.016
 Sunny-0.016
Token:
Token ID25
Feature activation+7.80e-02

Pos logit contributions
ables+0.017
eg+0.017
by+0.016
 wallet+0.016
 photo+0.016
Neg logit contributions
 Olive-0.019
 Buzz-0.018
 Ruby-0.018
 pur-0.018
 Honey-0.018
Token "
Token ID366
Feature activation+1.03e-02

Pos logit contributions
by+0.002
-+0.002
ables+0.002
eg+0.002
ay+0.002
Neg logit contributions
 Olive-0.002
 Honey-0.002
 Buzz-0.002
 Sunny-0.002
 Ruby-0.002
TokenWhere
Token ID8496
Feature activation+4.17e-01

Pos logit contributions
-+0.000
by+0.000
 letter+0.000
 across+0.000
eg+0.000
Neg logit contributions
 Olive-0.000
 Honey-0.000
 scrub-0.000
 rinse-0.000
aj-0.000
Token do
Token ID466
Feature activation+2.99e-01

Pos logit contributions
-+0.011
by+0.011
eg+0.011
 across+0.011
ings+0.011
Neg logit contributions
 Olive-0.012
 rinse-0.011
 scrub-0.011
 Honey-0.011
 Buzz-0.011
Token you
Token ID345
Feature activation+6.16e-01

Pos logit contributions
by+0.011
eg+0.010
 across+0.010
-+0.010
ings+0.010
Neg logit contributions
 Olive-0.006
 Honey-0.006
 Sunny-0.005
 Buzz-0.005
 Ruby-0.005
Token think
Token ID892
Feature activation-5.67e-01

Pos logit contributions
-+0.017
by+0.016
ork+0.016
eg+0.016
 letter+0.016
Neg logit contributions
 Olive-0.017
 Honey-0.017
 scrub-0.017
 rinse-0.017
 pur-0.016
Token you
Token ID345
Feature activation+3.75e-01

Pos logit contributions
 Honey+0.013
 Olive+0.013
 rinse+0.012
 scrub+0.012
 Sunny+0.012
Neg logit contributions
by-0.018
eg-0.018
--0.018
ork-0.017
ings-0.017
Token're
Token ID821
Feature activation+1.41e+00

Pos logit contributions
-+0.010
ork+0.010
by+0.010
eg+0.010
ings+0.010
Neg logit contributions
 scrub-0.010
 Honey-0.010
 rinse-0.010
 pur-0.010
 Olive-0.010
Token going
Token ID1016
Feature activation+1.51e+00

Pos logit contributions
-+0.041
by+0.041
eg+0.040
ork+0.040
ables+0.039
Neg logit contributions
 Honey-0.036
 Olive-0.036
 scrub-0.035
 rinse-0.035
 Buzz-0.034
Token tape
Token ID9154
Feature activation+5.08e-01

Pos logit contributions
by+0.006
eg+0.006
-+0.006
ables+0.006
ings+0.006
Neg logit contributions
 scrub-0.005
 rinse-0.005
 Olive-0.005
 Honey-0.005
 swollen-0.005
Token and
Token ID290
Feature activation+1.30e+00

Pos logit contributions
eg+0.014
by+0.014
ables+0.013
rolled+0.013
-+0.013
Neg logit contributions
 scrub-0.014
 Olive-0.014
 Honey-0.014
 rinse-0.013
 Midnight-0.013
Token she
Token ID673
Feature activation-6.64e-02

Pos logit contributions
eg+0.043
-+0.043
ables+0.042
ings+0.042
by+0.042
Neg logit contributions
 scrub-0.029
 Olive-0.029
 rinse-0.028
 Honey-0.028
 pur-0.027
Token said
Token ID531
Feature activation+7.25e-01

Pos logit contributions
 Olive+0.001
 Honey+0.001
 scrub+0.001
 rinse+0.001
aj+0.001
Neg logit contributions
--0.002
by-0.002
 letter-0.002
 across-0.002
eg-0.002
Token "
Token ID366
Feature activation-8.63e-02

Pos logit contributions
-+0.019
by+0.019
eg+0.018
 letter+0.018
 across+0.018
Neg logit contributions
 Olive-0.021
 Honey-0.021
 scrub-0.020
 rinse-0.020
aj-0.019
TokenYou
Token ID1639
Feature activation+5.89e-01

Pos logit contributions
 Honey+0.002
 scrub+0.002
 Olive+0.002
Sleep+0.002
Fine+0.002
Neg logit contributions
eg-0.003
--0.003
by-0.003
ings-0.003
 letter-0.003
Token can
Token ID460
Feature activation-7.12e-01

Pos logit contributions
-+0.017
ings+0.017
by+0.016
 across+0.016
eg+0.016
Neg logit contributions
 rinse-0.016
 Honey-0.015
 scrub-0.015
 Olive-0.015
 wipe-0.014
Token make
Token ID787
Feature activation-5.20e-02

Pos logit contributions
 Olive+0.019
 Honey+0.019
 scrub+0.019
 rinse+0.018
aj+0.018
Neg logit contributions
--0.020
by-0.019
 letter-0.019
 across-0.019
eg-0.019
Token a
Token ID257
Feature activation+1.36e-01

Pos logit contributions
 Honey+0.002
 Olive+0.002
 scrub+0.002
 rinse+0.002
 pur+0.002
Neg logit contributions
by-0.001
eg-0.001
spe-0.001
--0.001
 across-0.001
Token pretend
Token ID16614
Feature activation+9.80e-02

Pos logit contributions
rolled+0.004
spe+0.004
ed+0.004
attered+0.004
ables+0.004
Neg logit contributions
 pur-0.004
 scrub-0.004
 rinse-0.004
 swe-0.004
 wipe-0.004
Token movie
Token ID3807
Feature activation+9.61e-01

Pos logit contributions
by+0.002
rolled+0.002
attered+0.002
eg+0.002
spe+0.002
Neg logit contributions
 pur-0.003
 scrub-0.003
 Sunny-0.003
 swe-0.003
 Ruby-0.003
Token a
Token ID257
Feature activation-3.02e-01

Pos logit contributions
 Olive+0.004
 Honey+0.004
 Buzz+0.004
 Sunny+0.004
 pur+0.004
Neg logit contributions
by-0.004
 letter-0.004
spe-0.004
eg-0.004
ables-0.004
Token hat
Token ID6877
Feature activation+1.13e+00

Pos logit contributions
 Honey+0.008
 Olive+0.008
 Sunny+0.007
 pur+0.007
 Buzz+0.007
Neg logit contributions
rolled-0.009
spe-0.009
by-0.009
ings-0.009
--0.009
Token.
Token ID13
Feature activation+3.95e-01

Pos logit contributions
-+0.025
by+0.024
 letter+0.024
eg+0.023
 across+0.023
Neg logit contributions
 Olive-0.037
 Honey-0.036
 scrub-0.036
 rinse-0.035
aj-0.035
Token
Token ID220
Feature activation+1.21e+00

Pos logit contributions
-+0.008
 across+0.008
eg+0.008
by+0.008
 letter+0.008
Neg logit contributions
 Olive-0.013
 scrub-0.013
 rinse-0.013
 Honey-0.013
aj-0.013
Token
Token ID198
Feature activation+8.26e-01

Pos logit contributions
-+0.032
 letter+0.031
by+0.031
 thing+0.030
 photo+0.030
Neg logit contributions
 Olive-0.035
 Honey-0.034
 scrub-0.033
 rinse-0.033
aj-0.033
Token
Token ID198
Feature activation+2.49e-01

Pos logit contributions
-+0.020
by+0.019
 letter+0.018
 across+0.018
eg+0.018
Neg logit contributions
 Olive-0.026
 Honey-0.026
 scrub-0.025
 rinse-0.025
aj-0.025
TokenOne
Token ID3198
Feature activation-2.70e-01

Pos logit contributions
-+0.009
 across+0.008
by+0.008
 letter+0.008
 thing+0.008
Neg logit contributions
 Olive-0.005
 Honey-0.005
 rinse-0.005
 scrub-0.005
aj-0.005
Token day
Token ID1110
Feature activation+9.29e-02

Pos logit contributions
 Olive+0.009
 Honey+0.009
 scrub+0.009
 rinse+0.009
 Buzz+0.009
Neg logit contributions
ings-0.005
eg-0.005
--0.005
ork-0.005
 letter-0.005
Token,
Token ID11
Feature activation+9.81e-02

Pos logit contributions
 letter+0.002
by+0.002
ables+0.002
eg+0.002
ork+0.002
Neg logit contributions
 Olive-0.003
 Honey-0.003
 Buzz-0.003
 scrub-0.003
 Sunny-0.003
Token he
Token ID339
Feature activation+2.88e-01

Pos logit contributions
ables+0.003
ork+0.003
by+0.003
eg+0.003
ings+0.003
Neg logit contributions
 Olive-0.003
 Buzz-0.003
 Honey-0.002
 Sunny-0.002
 Fif-0.002
Token and
Token ID290
Feature activation+3.05e-01

Pos logit contributions
ay+0.007
ings+0.006
 letter+0.006
eg+0.006
-+0.006
Neg logit contributions
 Olive-0.009
 Honey-0.009
 scrub-0.009
 sob-0.009
 pur-0.009
Token!
Token ID0
Feature activation+3.54e-01

Pos logit contributions
 letter+0.010
by+0.010
eg+0.010
 across+0.010
-+0.010
Neg logit contributions
 Olive-0.013
 Honey-0.013
 scrub-0.013
 rinse-0.013
aj-0.013
Token Tom
Token ID4186
Feature activation+3.04e-01

Pos logit contributions
-+0.011
 letter+0.010
by+0.010
 thing+0.010
ings+0.010
Neg logit contributions
 Olive-0.009
 Honey-0.009
aj-0.009
 Midnight-0.009
 rinse-0.009
Token would
Token ID561
Feature activation+1.24e-01

Pos logit contributions
 letter+0.006
-+0.006
ings+0.006
ork+0.006
 photo+0.006
Neg logit contributions
 scrub-0.010
 Olive-0.010
 Honey-0.010
 pur-0.010
aj-0.010
Token watch
Token ID2342
Feature activation-4.29e-01

Pos logit contributions
-+0.004
by+0.004
 letter+0.004
eg+0.003
ork+0.003
Neg logit contributions
 Olive-0.003
 scrub-0.003
 Honey-0.003
 rinse-0.003
 pur-0.003
Token and
Token ID290
Feature activation+5.91e-01

Pos logit contributions
 Olive+0.011
 Honey+0.011
 Holly+0.011
 Ruby+0.011
 Midnight+0.010
Neg logit contributions
ork-0.013
 letter-0.013
spe-0.013
 wallet-0.013
 photo-0.012
Token smile
Token ID8212
Feature activation+4.80e-01

Pos logit contributions
-+0.018
by+0.018
 letter+0.018
eg+0.017
 across+0.017
Neg logit contributions
 Olive-0.014
 Honey-0.014
 scrub-0.014
 rinse-0.014
aj-0.013
Token,
Token ID11
Feature activation-6.10e-01

Pos logit contributions
ork+0.013
eg+0.013
 letter+0.013
 photo+0.013
 wallet+0.012
Neg logit contributions
 Honey-0.013
 Olive-0.013
 pur-0.013
 scrub-0.013
 Sunny-0.012
Token at
Token ID379
Feature activation+7.06e-01

Pos logit contributions
 Olive+0.018
 Honey+0.018
 scrub+0.018
 rinse+0.018
aj+0.018
Neg logit contributions
--0.015
by-0.014
 letter-0.014
eg-0.014
 across-0.014
Token how
Token ID703
Feature activation-2.17e-01

Pos logit contributions
-+0.023
 letter+0.023
by+0.022
ork+0.022
 across+0.022
Neg logit contributions
 Olive-0.016
 Honey-0.016
 scrub-0.015
 rinse-0.015
 Holly-0.015
Token adorable
Token ID29012
Feature activation+1.92e-01

Pos logit contributions
 Olive+0.007
 Honey+0.006
 scrub+0.006
 rinse+0.006
 Holly+0.006
Neg logit contributions
 letter-0.005
--0.005
ings-0.005
eg-0.005
 photo-0.005
Token Lucky
Token ID27963
Feature activation+2.37e-01

Pos logit contributions
 letter+0.006
ork+0.006
eg+0.006
-+0.006
by+0.006
Neg logit contributions
 Olive-0.005
 Honey-0.005
 Holly-0.005
 Midnight-0.005
 scrub-0.005
Token fun
Token ID1257
Feature activation+3.03e-02

Pos logit contributions
-+0.003
by+0.002
 across+0.002
 very+0.002
 letter+0.002
Neg logit contributions
 Olive-0.004
 Honey-0.004
 rinse-0.004
 scrub-0.004
aj-0.004
Token at
Token ID379
Feature activation+2.99e-01

Pos logit contributions
-+0.001
 very+0.001
 across+0.001
 thing+0.001
by+0.001
Neg logit contributions
 Olive-0.001
 Honey-0.001
 swollen-0.001
 rinse-0.001
 Midnight-0.001
Token school
Token ID1524
Feature activation-3.32e-01

Pos logit contributions
-+0.007
 across+0.006
 very+0.006
 letter+0.006
 thing+0.006
Neg logit contributions
 Olive-0.010
 Honey-0.010
aj-0.010
 rinse-0.010
 scrub-0.010
Token."
Token ID526
Feature activation-5.55e-03

Pos logit contributions
 Olive+0.009
 Honey+0.009
 scrub+0.009
 rinse+0.008
aj+0.008
Neg logit contributions
--0.010
by-0.009
 letter-0.009
 across-0.009
eg-0.009
Token
Token ID220
Feature activation+6.49e-01

Pos logit contributions
 swollen+0.000
 rinse+0.000
 iv+0.000
 scrub+0.000
 Honey+0.000
Neg logit contributions
--0.000
s-0.000
 very-0.000
by-0.000
 across-0.000
Token
Token ID198
Feature activation+9.91e-01

Pos logit contributions
-+0.015
by+0.014
 letter+0.014
 across+0.014
eg+0.014
Neg logit contributions
 Olive-0.021
 Honey-0.021
 scrub-0.020
 rinse-0.020
aj-0.020
Token
Token ID198
Feature activation+5.03e-01

Pos logit contributions
-+0.019
s+0.018
by+0.018
 across+0.018
 very+0.018
Neg logit contributions
 Honey-0.035
 rinse-0.035
 Olive-0.035
 swollen-0.035
 scrub-0.035
TokenL
Token ID43
Feature activation+1.34e-01

Pos logit contributions
-+0.016
 very+0.015
 lot+0.015
 thing+0.015
ork+0.015
Neg logit contributions
 Honey-0.012
 Olive-0.012
 til-0.012
 Buzz-0.012
 swollen-0.012
Tokenily
Token ID813
Feature activation-5.31e-01

Pos logit contributions
-+0.006
ork+0.005
 thing+0.005
by+0.005
 letter+0.005
Neg logit contributions
 swollen-0.002
 Honey-0.002
 Olive-0.002
 rinse-0.002
 scrub-0.002
Token walked
Token ID6807
Feature activation-4.75e-01

Pos logit contributions
 Honey+0.020
 Olive+0.020
 swollen+0.020
 rinse+0.020
Sleep+0.019
Neg logit contributions
--0.010
by-0.009
 very-0.009
s-0.008
 across-0.008
Token into
Token ID656
Feature activation-2.26e-01

Pos logit contributions
 Olive+0.016
 scrub+0.016
 Holly+0.016
 Honey+0.015
 pur+0.015
Neg logit contributions
eg-0.010
ings-0.010
 letter-0.010
ork-0.010
ay-0.009
Token,
Token ID11
Feature activation+2.13e-03

Pos logit contributions
-+0.003
by+0.003
 letter+0.003
 across+0.003
 thing+0.003
Neg logit contributions
 Olive-0.005
 Honey-0.005
 scrub-0.005
 rinse-0.005
aj-0.005
Token the
Token ID262
Feature activation-3.39e-02

Pos logit contributions
-+0.000
by+0.000
eg+0.000
 letter+0.000
 across+0.000
Neg logit contributions
 Olive-0.000
 Honey-0.000
 scrub-0.000
 rinse-0.000
 Sunny-0.000
Token lady
Token ID10846
Feature activation+9.30e-01

Pos logit contributions
 Olive+0.001
 Honey+0.001
 scrub+0.001
 rinse+0.001
aj+0.001
Neg logit contributions
--0.001
by-0.001
eg-0.001
ings-0.001
ork-0.001
Token drove
Token ID10357
Feature activation+7.39e-01

Pos logit contributions
ings+0.028
ork+0.027
eg+0.026
 photo+0.026
ay+0.026
Neg logit contributions
 scrub-0.024
aj-0.021
 sob-0.021
 Honey-0.021
 pur-0.021
Token to
Token ID284
Feature activation-3.04e-01

Pos logit contributions
eg+0.021
ables+0.019
oc+0.019
ay+0.019
ork+0.018
Neg logit contributions
 Olive-0.022
 scrub-0.020
 pur-0.020
 Sunny-0.020
 Honey-0.020
Token a
Token ID257
Feature activation-1.45e-01

Pos logit contributions
 Olive+0.009
 Honey+0.009
 scrub+0.009
 rinse+0.008
 Sunny+0.008
Neg logit contributions
--0.008
eg-0.008
by-0.008
ables-0.008
ings-0.008
Token store
Token ID3650
Feature activation-7.77e-02

Pos logit contributions
 Olive+0.004
 scrub+0.004
 swe+0.004
 pur+0.004
 Sunny+0.003
Neg logit contributions
ings-0.004
ed-0.004
umbled-0.004
eg-0.004
rolled-0.004
Token.
Token ID13
Feature activation+1.48e-01

Pos logit contributions
 Olive+0.002
 scrub+0.002
 sob+0.002
aj+0.002
 Sunny+0.002
Neg logit contributions
eg-0.002
by-0.002
ork-0.002
ay-0.002
ables-0.002
Token At
Token ID1629
Feature activation-3.97e-01

Pos logit contributions
-+0.004
 across+0.004
 letter+0.004
by+0.004
 thing+0.003
Neg logit contributions
 scrub-0.004
 Olive-0.004
 rinse-0.004
 swollen-0.004
aj-0.004
Token the
Token ID262
Feature activation+4.58e-01

Pos logit contributions
 Olive+0.011
 Honey+0.011
 scrub+0.011
 rinse+0.011
aj+0.011
Neg logit contributions
--0.011
by-0.010
eg-0.010
 letter-0.010
ork-0.010
Token store
Token ID3650
Feature activation-1.06e-02

Pos logit contributions
by+0.021
ings+0.020
-+0.020
ret+0.020
spe+0.020
Neg logit contributions
 swe-0.004
 rinse-0.003
 Olive-0.003
 Honey-0.003
Sleep-0.003
Token.
Token ID13
Feature activation+3.34e-01

Pos logit contributions
eg+0.020
by+0.020
ables+0.020
ork+0.019
ings+0.018
Neg logit contributions
 scrub-0.022
 Olive-0.021
 rinse-0.021
 Honey-0.020
 pur-0.020
Token It
Token ID632
Feature activation-3.64e-01

Pos logit contributions
-+0.009
by+0.009
 letter+0.008
eg+0.008
ings+0.008
Neg logit contributions
 Olive-0.011
 Honey-0.010
 Sunny-0.010
 Buzz-0.010
 scrub-0.010
Token had
Token ID550
Feature activation+8.13e-01

Pos logit contributions
 Olive+0.015
 Honey+0.015
 scrub+0.014
 pur+0.014
 rinse+0.014
Neg logit contributions
--0.005
by-0.005
eg-0.005
ings-0.005
 letter-0.005
Token lots
Token ID6041
Feature activation-1.19e+00

Pos logit contributions
ings+0.024
by+0.024
ole+0.024
eg+0.023
-+0.022
Neg logit contributions
 pur-0.022
 Sunny-0.020
 Olive-0.020
 honey-0.020
 iv-0.020
Token of
Token ID286
Feature activation+1.76e-01

Pos logit contributions
 Olive+0.041
 scrub+0.040
 Honey+0.040
aj+0.039
 pur+0.039
Neg logit contributions
by-0.025
eg-0.023
--0.022
 letter-0.022
ings-0.022
Token colorful
Token ID20239
Feature activation-6.55e-01

Pos logit contributions
by+0.004
eg+0.004
-+0.004
ings+0.004
spe+0.004
Neg logit contributions
 Olive-0.005
 scrub-0.005
 pur-0.005
 Honey-0.005
 sob-0.005
Token books
Token ID3835
Feature activation+9.61e-01

Pos logit contributions
 pur+0.022
aj+0.022
 scrub+0.022
 Olive+0.022
 Honey+0.021
Neg logit contributions
by-0.013
eg-0.013
spe-0.013
ole-0.012
 across-0.012
Token that
Token ID326
Feature activation-3.97e-01

Pos logit contributions
by+0.029
eg+0.027
ables+0.025
 fortune+0.025
 letter+0.024
Neg logit contributions
 scrub-0.025
 Olive-0.024
 rinse-0.024
 Honey-0.023
 iv-0.023
Token made
Token ID925
Feature activation+4.71e-01

Pos logit contributions
 Olive+0.009
 Honey+0.009
 scrub+0.009
 rinse+0.009
 pur+0.009
Neg logit contributions
--0.013
by-0.012
ork-0.012
eg-0.012
 letter-0.012
Token the
Token ID262
Feature activation-1.27e-01

Pos logit contributions
eg+0.015
by+0.014
ables+0.014
oc+0.014
 wallet+0.014
Neg logit contributions
 scrub-0.011
 Honey-0.011
 Buzz-0.011
 rinse-0.011
 Olive-0.011
Token room
Token ID2119
Feature activation+3.12e-01

Pos logit contributions
 Olive+0.004
 Honey+0.004
 pur+0.004
 Ruby+0.004
 Sunny+0.004
Neg logit contributions
--0.003
eg-0.003
rolled-0.003
ings-0.003
by-0.003
Token crab
Token ID32202
Feature activation-6.29e-01

Pos logit contributions
eg+0.000
ork+0.000
ings+0.000
ables+0.000
spe+0.000
Neg logit contributions
 Olive-0.001
 Honey-0.001
 scrub-0.001
 rinse-0.001
 Sunny-0.001
Token went
Token ID1816
Feature activation+1.75e-01

Pos logit contributions
 Olive+0.016
 Honey+0.016
 scrub+0.016
 pur+0.015
 rinse+0.015
Neg logit contributions
eg-0.018
 letter-0.017
by-0.017
ings-0.017
ables-0.017
Token to
Token ID284
Feature activation-7.42e-01

Pos logit contributions
-+0.003
 across+0.003
 very+0.003
 letter+0.003
 thing+0.003
Neg logit contributions
 rinse-0.007
 Honey-0.007
aj-0.007
 scrub-0.007
 Olive-0.007
Token the
Token ID262
Feature activation-3.49e-01

Pos logit contributions
aj+0.026
 Olive+0.025
 Honey+0.024
 swollen+0.024
 scrub+0.024
Neg logit contributions
--0.015
 across-0.015
 very-0.015
 letter-0.015
 thing-0.015
Token park
Token ID3952
Feature activation-5.53e-01

Pos logit contributions
 Olive+0.009
 Honey+0.008
 rinse+0.008
 scrub+0.008
 Sunny+0.008
Neg logit contributions
--0.011
ings-0.010
by-0.010
eg-0.010
ay-0.010
Token.
Token ID13
Feature activation-6.07e-01

Pos logit contributions
 Olive+0.017
 scrub+0.016
 Honey+0.016
 Buzz+0.016
 pur+0.016
Neg logit contributions
eg-0.013
 letter-0.013
by-0.013
ay-0.013
ork-0.013
Token The
Token ID383
Feature activation-1.19e-01

Pos logit contributions
 Olive+0.019
 Honey+0.019
 Buzz+0.018
 rinse+0.018
 Sunny+0.018
Neg logit contributions
--0.014
by-0.014
 letter-0.014
ork-0.013
eg-0.013
Token crab
Token ID32202
Feature activation-3.84e-01

Pos logit contributions
 Olive+0.004
 Honey+0.003
 rinse+0.003
 scrub+0.003
aj+0.003
Neg logit contributions
--0.003
ings-0.003
by-0.003
ables-0.003
eg-0.003
Token was
Token ID373
Feature activation-5.67e-01

Pos logit contributions
 Honey+0.015
 Olive+0.015
 rinse+0.015
 scrub+0.015
aj+0.015
Neg logit contributions
--0.006
by-0.005
 across-0.005
 letter-0.005
 very-0.005
Token very
Token ID845
Feature activation-4.34e-01

Pos logit contributions
 Honey+0.022
aj+0.022
 rinse+0.022
 Olive+0.022
 scrub+0.022
Neg logit contributions
 very-0.010
--0.009
 across-0.009
 letter-0.008
 thing-0.008
Token honest
Token ID5508
Feature activation-2.34e-01

Pos logit contributions
 Honey+0.015
 Olive+0.015
 scrub+0.015
 iv+0.015
Sleep+0.015
Neg logit contributions
--0.010
 very-0.009
by-0.009
 letter-0.009
 thing-0.008
Token't
Token ID470
Feature activation+3.65e-01

Pos logit contributions
 Olive+0.002
 Honey+0.002
 scrub+0.002
 rinse+0.002
aj+0.002
Neg logit contributions
--0.002
by-0.002
 letter-0.001
 thing-0.001
 across-0.001
Token want
Token ID765
Feature activation-6.62e-01

Pos logit contributions
-+0.010
by+0.009
 letter+0.009
 thing+0.009
 across+0.009
Neg logit contributions
 Olive-0.010
 Honey-0.010
aj-0.010
 scrub-0.010
 rinse-0.010
Token to
Token ID284
Feature activation-3.58e-01

Pos logit contributions
 Olive+0.026
 Honey+0.025
 scrub+0.025
 rinse+0.024
 pur+0.024
Neg logit contributions
eg-0.011
spe-0.011
--0.011
ables-0.011
by-0.011
Token throw
Token ID3714
Feature activation-3.28e-01

Pos logit contributions
 Honey+0.010
 Olive+0.009
aj+0.009
 scrub+0.009
Sleep+0.009
Neg logit contributions
--0.011
by-0.010
 letter-0.010
 thing-0.010
 very-0.010
Token her
Token ID607
Feature activation-6.15e-01

Pos logit contributions
 Olive+0.009
 Honey+0.009
 scrub+0.009
 rinse+0.008
irms+0.008
Neg logit contributions
spe-0.010
ables-0.010
eg-0.010
 Exhibit-0.010
rolled-0.009
Token picture
Token ID4286
Feature activation+7.95e-02

Pos logit contributions
 swollen+0.018
 sob+0.018
 Ruby+0.018
 Olive+0.017
 wipe+0.017
Neg logit contributions
spe-0.015
rolled-0.015
ables-0.015
rolley-0.015
 Exhibit-0.015
Token away
Token ID1497
Feature activation+1.28e-01

Pos logit contributions
eg+0.002
by+0.002
-+0.002
 letter+0.002
ables+0.002
Neg logit contributions
 Olive-0.002
 scrub-0.002
 rinse-0.002
 Honey-0.002
 pur-0.002
Token.
Token ID13
Feature activation+1.15e-01

Pos logit contributions
eg+0.004
by+0.003
ables+0.003
 letter+0.003
ay+0.003
Neg logit contributions
 Olive-0.004
 Honey-0.004
 scrub-0.003
 rinse-0.003
 Buzz-0.003
Token She
Token ID1375
Feature activation-4.24e-01

Pos logit contributions
-+0.004
by+0.003
 letter+0.003
eg+0.003
 across+0.003
Neg logit contributions
 Olive-0.003
 Honey-0.003
 scrub-0.003
 rinse-0.003
aj-0.003
Token decided
Token ID3066
Feature activation+4.22e-01

Pos logit contributions
 Honey+0.013
 Olive+0.013
 rinse+0.012
aj+0.012
Sleep+0.012
Neg logit contributions
--0.011
by-0.011
 letter-0.010
 very-0.010
 across-0.010
Token to
Token ID284
Feature activation-9.26e-01

Pos logit contributions
-+0.006
eg+0.006
by+0.006
ay+0.005
ings+0.005
Neg logit contributions
 Olive-0.017
 Honey-0.017
 scrub-0.017
 rinse-0.017
 pur-0.017
Token on
Token ID319
Feature activation-9.28e-01

Pos logit contributions
-+0.008
by+0.008
 letter+0.007
eg+0.007
 across+0.007
Neg logit contributions
 Olive-0.015
 Honey-0.015
 scrub-0.015
 rinse-0.015
aj-0.014
Token,
Token ID11
Feature activation-3.17e-01

Pos logit contributions
 Olive+0.029
 Honey+0.028
 scrub+0.028
 rinse+0.027
 Sunny+0.027
Neg logit contributions
 letter-0.022
eg-0.022
--0.022
by-0.021
 photo-0.021
Token the
Token ID262
Feature activation-5.69e-02

Pos logit contributions
 Olive+0.009
 Honey+0.009
 scrub+0.008
 rinse+0.008
 Buzz+0.008
Neg logit contributions
--0.009
 letter-0.009
by-0.009
eg-0.009
ork-0.009
Token cat
Token ID3797
Feature activation-3.16e-01

Pos logit contributions
 Olive+0.002
 Honey+0.002
 scrub+0.002
 rinse+0.002
aj+0.002
Neg logit contributions
--0.001
 letter-0.001
by-0.001
 thing-0.001
 across-0.001
Token would
Token ID561
Feature activation+2.01e-01

Pos logit contributions
 Olive+0.011
 Honey+0.011
 rinse+0.011
aj+0.011
 scrub+0.011
Neg logit contributions
--0.006
by-0.006
 across-0.006
 letter-0.006
 thing-0.006
Token wave
Token ID6769
Feature activation+2.75e-02

Pos logit contributions
-+0.005
by+0.005
 very+0.005
 letter+0.005
 across+0.005
Neg logit contributions
aj-0.006
irms-0.006
 Olive-0.006
 Honey-0.006
 swollen-0.006
Token his
Token ID465
Feature activation-7.99e-01

Pos logit contributions
eg+0.001
 photo+0.001
 letter+0.001
ork+0.001
 wallet+0.001
Neg logit contributions
 Olive-0.001
 pur-0.001
 scrub-0.001
 Honey-0.001
 Ruby-0.001
Token paw
Token ID41827
Feature activation-1.09e+00

Pos logit contributions
 scrub+0.022
 Olive+0.021
aj+0.021
 pur+0.021
 rinse+0.021
Neg logit contributions
eg-0.021
ork-0.021
--0.021
ings-0.020
spe-0.020
Token at
Token ID379
Feature activation+4.83e-01

Pos logit contributions
 Olive+0.036
 scrub+0.036
 Honey+0.035
 rinse+0.035
 pur+0.034
Neg logit contributions
eg-0.023
--0.023
by-0.022
ork-0.022
 letter-0.022
Token the
Token ID262
Feature activation-6.16e-02

Pos logit contributions
eg+0.013
 letter+0.013
ork+0.013
ables+0.013
spe+0.013
Neg logit contributions
 Olive-0.013
 Honey-0.013
 Buzz-0.013
 Sunny-0.013
 scrub-0.013
Token hole
Token ID7604
Feature activation-1.55e+00

Pos logit contributions
 Olive+0.002
 Honey+0.002
 pur+0.002
 scrub+0.002
 Midnight+0.002
Neg logit contributions
--0.002
eg-0.002
ings-0.002
by-0.002
ork-0.001
Token to
Token ID284
Feature activation-6.39e-01

Pos logit contributions
eg+0.021
ork+0.021
ay+0.020
-+0.020
 letter+0.020
Neg logit contributions
 scrub-0.036
 Olive-0.035
 Honey-0.035
 rinse-0.035
 pur-0.034
Token have
Token ID423
Feature activation-2.17e-01

Pos logit contributions
 scrub+0.019
 Honey+0.019
 Olive+0.019
 rinse+0.019
 pur+0.018
Neg logit contributions
--0.016
by-0.015
eg-0.015
 letter-0.015
ork-0.015
Token a
Token ID257
Feature activation-3.81e-01

Pos logit contributions
 scrub+0.006
 Olive+0.006
 pur+0.006
 Honey+0.006
 rinse+0.006
Neg logit contributions
eg-0.006
--0.006
by-0.006
ings-0.005
ay-0.005
Token life
Token ID1204
Feature activation-7.16e-01

Pos logit contributions
 scrub+0.011
 pur+0.011
 rinse+0.011
 Olive+0.010
 swe+0.010
Neg logit contributions
ings-0.010
ed-0.009
--0.009
attered-0.009
ay-0.009
Token of
Token ID286
Feature activation-3.41e-01

Pos logit contributions
 scrub+0.019
 rinse+0.018
 pur+0.018
 swe+0.017
 Honey+0.017
Neg logit contributions
eg-0.020
ork-0.019
by-0.019
rolled-0.018
ay-0.018
Token its
Token ID663
Feature activation-1.90e+00

Pos logit contributions
 Olive+0.011
 Honey+0.010
 scrub+0.010
 rinse+0.010
aj+0.010
Neg logit contributions
--0.008
by-0.008
eg-0.008
 letter-0.008
ork-0.008
Token own
Token ID898
Feature activation-2.42e-01

Pos logit contributions
aj+0.056
 pur+0.055
 scrub+0.054
 Olive+0.053
 Honey+0.052
Neg logit contributions
--0.045
ork-0.045
eg-0.044
ay-0.042
ables-0.042
Token.
Token ID13
Feature activation-5.16e-02

Pos logit contributions
aj+0.006
 scrub+0.006
 pur+0.006
 rinse+0.006
 iv+0.005
Neg logit contributions
eg-0.007
ork-0.007
 letter-0.006
ay-0.006
--0.006
Token He
Token ID679
Feature activation-2.20e-01

Pos logit contributions
 Olive+0.002
 scrub+0.002
 Honey+0.002
 rinse+0.002
 swollen+0.002
Neg logit contributions
--0.001
 across-0.001
by-0.001
 letter-0.001
eg-0.001
Token would
Token ID561
Feature activation-7.93e-02

Pos logit contributions
 scrub+0.007
 Olive+0.007
 Honey+0.007
 rinse+0.007
aj+0.007
Neg logit contributions
--0.005
 letter-0.005
ings-0.005
by-0.005
eg-0.005
Token watch
Token ID2342
Feature activation-6.00e-01

Pos logit contributions
 Olive+0.002
 Honey+0.002
 scrub+0.002
 rinse+0.002
aj+0.002
Neg logit contributions
--0.002
by-0.002
 letter-0.002
 across-0.002
eg-0.002
Token,
Token ID11
Feature activation+3.83e-01

Pos logit contributions
 Honey+0.052
 Olive+0.052
 scrub+0.051
 rinse+0.050
 pur+0.049
Neg logit contributions
--0.037
by-0.036
eg-0.036
 letter-0.036
ork-0.035
Token but
Token ID475
Feature activation+1.24e+00

Pos logit contributions
-+0.014
by+0.014
eg+0.013
ings+0.013
ork+0.013
Neg logit contributions
 Olive-0.008
 scrub-0.008
 Honey-0.007
 rinse-0.007
 pur-0.007
Token it
Token ID340
Feature activation-2.79e-01

Pos logit contributions
eg+0.037
ay+0.036
ings+0.036
ork+0.036
ables+0.036
Neg logit contributions
 scrub-0.032
 Olive-0.032
 Honey-0.031
 rinse-0.030
 Midnight-0.029
Token looked
Token ID3114
Feature activation+1.03e+00

Pos logit contributions
 Olive+0.011
 scrub+0.011
 Honey+0.011
 rinse+0.010
 pur+0.010
Neg logit contributions
--0.004
 letter-0.004
eg-0.004
by-0.004
 photo-0.004
Token a
Token ID257
Feature activation+6.06e-01

Pos logit contributions
eg+0.027
ork+0.025
by+0.025
ay+0.025
rolled+0.025
Neg logit contributions
 scrub-0.032
 Honey-0.031
 swe-0.031
 rinse-0.031
 pur-0.031
Token bit
Token ID1643
Feature activation-2.13e+00

Pos logit contributions
rolled+0.019
ables+0.018
by+0.018
eg+0.018
ings+0.018
Neg logit contributions
 scrub-0.015
 rinse-0.015
 swollen-0.015
 Honey-0.014
 pur-0.014
Token messy
Token ID24097
Feature activation-6.99e-01

Pos logit contributions
aj+0.066
 Olive+0.066
 Honey+0.065
Sleep+0.063
 scrub+0.063
Neg logit contributions
--0.053
 very-0.051
 across-0.050
 thing-0.049
by-0.048
Token.
Token ID13
Feature activation-1.91e-01

Pos logit contributions
 Olive+0.023
 Honey+0.023
 scrub+0.022
 rinse+0.022
aj+0.022
Neg logit contributions
--0.015
by-0.015
eg-0.015
 letter-0.015
ork-0.014
Token He
Token ID679
Feature activation+1.34e-01

Pos logit contributions
 Olive+0.005
 Honey+0.005
 scrub+0.005
 Buzz+0.004
 rinse+0.004
Neg logit contributions
--0.006
 letter-0.006
by-0.006
eg-0.006
 photo-0.006
Token asked
Token ID1965
Feature activation+2.89e-01

Pos logit contributions
ings+0.003
ay+0.003
-+0.003
 letter+0.003
eg+0.003
Neg logit contributions
 scrub-0.004
 Olive-0.004
 Honey-0.004
 rinse-0.004
 pur-0.004
Token his
Token ID465
Feature activation-5.48e-01

Pos logit contributions
ables+0.006
eg+0.006
by+0.006
ork+0.006
-+0.006
Neg logit contributions
 scrub-0.010
 Olive-0.010
 Honey-0.010
 Buzz-0.009
 rinse-0.009
Token the
Token ID262
Feature activation+1.89e-01

Pos logit contributions
 Olive+0.007
 scrub+0.007
 Honey+0.007
 rinse+0.007
aj+0.007
Neg logit contributions
--0.005
by-0.005
eg-0.005
 letter-0.005
 photo-0.005
Token rice
Token ID11464
Feature activation+4.08e-01

Pos logit contributions
rolled+0.005
eg+0.005
-+0.005
by+0.005
ings+0.004
Neg logit contributions
 scrub-0.006
 rinse-0.005
 Olive-0.005
aj-0.005
 swe-0.005
Token in
Token ID287
Feature activation-1.31e-01

Pos logit contributions
eg+0.010
ork+0.010
ay+0.010
by+0.009
rolled+0.009
Neg logit contributions
 scrub-0.012
 Honey-0.012
 Olive-0.012
 rinse-0.012
 Sunny-0.012
Token his
Token ID465
Feature activation-1.20e+00

Pos logit contributions
 scrub+0.004
 Honey+0.004
 Olive+0.004
 rinse+0.004
 Buzz+0.004
Neg logit contributions
eg-0.003
by-0.003
ables-0.003
--0.003
s-0.003
Token small
Token ID1402
Feature activation-2.31e+00

Pos logit contributions
aj+0.031
 rinse+0.028
 pur+0.028
 Honey+0.028
 scrub+0.028
Neg logit contributions
spe-0.037
ay-0.034
ings-0.034
eg-0.034
by-0.033
Token hole
Token ID7604
Feature activation-2.11e+00

Pos logit contributions
 swe+0.061
 Honey+0.060
 scrub+0.060
aj+0.060
 rinse+0.059
Neg logit contributions
eg-0.065
spe-0.061
rolled-0.059
ork-0.059
 Exhibit-0.058
Token.
Token ID13
Feature activation-2.51e-01

Pos logit contributions
 scrub+0.069
 Olive+0.068
 Honey+0.068
aj+0.067
 rinse+0.066
Neg logit contributions
eg-0.045
--0.044
by-0.044
 letter-0.044
 photo-0.043
Token Soon
Token ID15894
Feature activation-1.35e-01

Pos logit contributions
 rinse+0.009
 scrub+0.009
 swollen+0.009
 Olive+0.009
aj+0.009
Neg logit contributions
--0.005
 across-0.004
eg-0.004
ay-0.004
ables-0.004
Token,
Token ID11
Feature activation-2.36e-01

Pos logit contributions
 Honey+0.004
 Olive+0.004
 Buzz+0.004
 Sunny+0.004
 scrub+0.004
Neg logit contributions
--0.003
 letter-0.003
 photo-0.003
eg-0.003
ay-0.003
Token the
Token ID262
Feature activation+6.11e-02

Pos logit contributions
 Honey+0.007
 Olive+0.006
 scrub+0.006
 Buzz+0.006
 rinse+0.006
Neg logit contributions
--0.007
by-0.006
eg-0.006
ings-0.006
ork-0.006
Token little
Token ID1310
Feature activation+4.54e-01

Pos logit contributions
-+0.002
by+0.002
eg+0.002
ings+0.002
ay+0.002
Neg logit contributions
 Honey-0.002
 Olive-0.002
 scrub-0.002
 rinse-0.002
 swollen-0.002
Token<|endoftext|>
Token ID50256
Feature activation-1.38e-01

Pos logit contributions
 scrub+0.014
 swollen+0.014
aj+0.014
 rinse+0.013
 iv+0.013
Neg logit contributions
 across-0.005
 very-0.005
s-0.005
ay-0.004
 thing-0.004
TokenTim
Token ID14967
Feature activation-7.83e-01

Pos logit contributions
 scrub+0.004
 swollen+0.004
 rinse+0.004
aj+0.004
 Olive+0.004
Neg logit contributions
--0.003
 across-0.003
 thing-0.003
ay-0.003
ings-0.003
Token and
Token ID290
Feature activation-2.12e-01

Pos logit contributions
 Olive+0.030
 Honey+0.029
 rinse+0.029
 scrub+0.029
 swollen+0.029
Neg logit contributions
--0.014
by-0.013
 letter-0.012
 across-0.012
 thing-0.012
Token Mia
Token ID32189
Feature activation-4.56e-01

Pos logit contributions
 Olive+0.006
 Honey+0.006
 Sunny+0.006
 Buzz+0.006
 rinse+0.006
Neg logit contributions
ings-0.006
--0.006
ork-0.006
ables-0.006
eg-0.005
Token are
Token ID389
Feature activation+5.53e-01

Pos logit contributions
 Olive+0.016
 Honey+0.015
aj+0.015
 scrub+0.015
 rinse+0.015
Neg logit contributions
--0.010
by-0.009
 across-0.009
 letter-0.009
 thing-0.009
Token brother
Token ID3956
Feature activation-1.90e+00

Pos logit contributions
-+0.014
by+0.013
 letter+0.013
 across+0.013
eg+0.013
Neg logit contributions
 Olive-0.017
 Honey-0.016
 scrub-0.016
 rinse-0.016
aj-0.016
Token and
Token ID290
Feature activation-9.02e-01

Pos logit contributions
 rinse+0.083
 scrub+0.081
Sleep+0.080
 Honey+0.079
Fine+0.079
Neg logit contributions
--0.028
 very-0.025
s-0.023
by-0.021
 letter-0.019
Token sister
Token ID6621
Feature activation-2.36e+00

Pos logit contributions
 Olive+0.024
 Honey+0.024
 scrub+0.023
 rinse+0.023
 pur+0.022
Neg logit contributions
--0.026
ings-0.026
ork-0.026
by-0.025
 across-0.025
Token.
Token ID13
Feature activation+1.32e-01

Pos logit contributions
 Olive+0.085
 Honey+0.085
 scrub+0.084
 rinse+0.084
 swollen+0.082
Neg logit contributions
--0.046
by-0.043
 very-0.042
 letter-0.041
 across-0.041
Token They
Token ID1119
Feature activation-6.22e-01

Pos logit contributions
 thing+0.002
 very+0.002
 across+0.002
-+0.002
ay+0.002
Neg logit contributions
 swollen-0.005
aj-0.005
 scrub-0.005
 Honey-0.005
Sleep-0.005
Token live
Token ID2107
Feature activation-3.46e-01

Pos logit contributions
 Olive+0.018
 Honey+0.018
aj+0.018
 swollen+0.018
 scrub+0.018
Neg logit contributions
--0.016
by-0.015
 very-0.015
 letter-0.015
 across-0.015
Token the
Token ID262
Feature activation+3.08e-01

Pos logit contributions
 Olive+0.004
 Honey+0.004
 scrub+0.004
 Buzz+0.004
 Sunny+0.004
Neg logit contributions
eg-0.003
--0.003
ables-0.003
ings-0.003
by-0.003
Token yard
Token ID12699
Feature activation-1.51e-01

Pos logit contributions
-+0.008
eg+0.007
ork+0.007
ings+0.007
by+0.007
Neg logit contributions
 Olive-0.009
 Honey-0.009
 scrub-0.009
 rinse-0.009
aj-0.009
Token.
Token ID13
Feature activation-5.19e-01

Pos logit contributions
 scrub+0.005
 Olive+0.005
 Honey+0.005
 pur+0.005
 rinse+0.005
Neg logit contributions
eg-0.003
by-0.003
--0.003
 letter-0.003
ork-0.003
Token He
Token ID679
Feature activation+3.16e-01

Pos logit contributions
 scrub+0.017
 rinse+0.016
 Olive+0.016
 Honey+0.016
aj+0.016
Neg logit contributions
--0.011
 across-0.011
 letter-0.011
by-0.011
eg-0.011
Token had
Token ID550
Feature activation+2.75e-02

Pos logit contributions
-+0.008
by+0.007
 letter+0.007
 across+0.007
eg+0.007
Neg logit contributions
 Olive-0.010
 Honey-0.010
 scrub-0.009
 rinse-0.009
aj-0.009
Token sharp
Token ID7786
Feature activation-1.73e+00

Pos logit contributions
-+0.001
eg+0.001
by+0.001
ings+0.001
ay+0.001
Neg logit contributions
 scrub-0.001
 Olive-0.001
 rinse-0.001
 Honey-0.001
 pur-0.001
Token teeth
Token ID9941
Feature activation-6.88e-01

Pos logit contributions
 scrub+0.052
irms+0.051
aj+0.050
 swollen+0.050
 til+0.049
Neg logit contributions
spe-0.043
 letter-0.043
eg-0.042
 photo-0.042
ay-0.040
Token and
Token ID290
Feature activation-3.38e-01

Pos logit contributions
 scrub+0.022
 Olive+0.022
 Honey+0.022
 rinse+0.022
 iv+0.021
Neg logit contributions
by-0.015
eg-0.014
 letter-0.014
 photo-0.014
--0.014
Token a
Token ID257
Feature activation+9.23e-02

Pos logit contributions
 Olive+0.010
 scrub+0.009
 Honey+0.009
 rinse+0.009
 swollen+0.009
Neg logit contributions
--0.009
by-0.009
eg-0.009
 letter-0.009
ings-0.009
Token loud
Token ID7812
Feature activation+4.14e-01

Pos logit contributions
rolled+0.003
by+0.003
-+0.003
ay+0.003
ables+0.003
Neg logit contributions
 swollen-0.003
 pur-0.002
 scrub-0.002
 sob-0.002
 rinse-0.002
Token grow
Token ID1663
Feature activation-7.48e-01

Pos logit contributions
by+0.012
ables+0.011
eg+0.011
spe+0.011
 letter+0.011
Neg logit contributions
 pur-0.011
 sob-0.011
 Olive-0.011
 scrub-0.011
 til-0.011