TOP ACTIVATIONS
MAX = 7.302

noticed something strange . A gol fer was hitting golf balls
wanted to be a good gol fer like his dad .
The town spe ople had to find a
The new light bul b was much bigger and
asked his friend , a se ag ull named Tim my
lamp . Suddenly , a gen ie appeared ! The gen
it away in a wheel bar row . Once
She blinked her eyes and imm ag ined she could see
Thank you , Mr . Mechan ic !" Tim my said
elephants , lions and ac rob ats . They are amazed
. " Of course ," the lady says .
Suddenly , a gen ie appeared in her bedroom
. You are a successful lear ner and a good daughter
own two hands . She se wed it with bright colors
with him , and he tre asured it always . <|endoftext|>
. I have a new er aser for you . It
it . Suddenly , a gen ie appeared .
hands . Suddenly , a gen ie appeared !
. " Of course , you can . But
always acted silly and mis chie vous . One

MIN ACTIVATIONS
MIN = -6.527

said , holding the magn ifying glass over a black ant
the watch with the magn ifying glass . He saw the
He grabbed the che wy snack he was eating and
cried , dropping the magn ifying glass . "
and scoop ed up the refreshing water into her hands .
The skeleton enjoyed drinking the magical drink all the
ever ! The bee kept drinking it every day , until
When Daisy put the magn ifying glass down , the cater
The kitten kept drinking and drinking until it was
skull and pretended it was drinking . They laughed and had
, handing him the magn ifying glass . She looked around
, and Mia enjoyed ro asting marsh m allows and telling
! It had touched the warm water and was sticking to
out from the shr ub bery . Jack followed
the pond . It was drinking so much that the pond
his trumpet and fed the hungry animals for days and weeks
I wish I picked the warm dress instead ." <|endoftext|> Once
gone . Sammy missed the warm , golden rays and he
While they were ro asting marsh m allows , they
drank the milk . After drinking the milk , Mitt ens

INTERVAL 3.845 - 7.302
CONTAINS 0.102%

there was no more noisy aer opl ane . They all
was not happy . She sc olded Sara and Ben for
day , he saw a poster on the wall of his
frogs and ducks . He hon ked his horn and waved
and said , " Ha ha !" But then

INTERVAL 0.388 - 3.845
CONTAINS 31.936%

dinner . Sam 's mom and dad say yes . They
She asked Mom what was wrong . Mom said Max was
" No , don 't do that !" Jen
us how far we have walked ". The girl got excited
mom and dad and ran to her k ite . She

INTERVAL -3.070 - 0.388
CONTAINS 67.452%

day , they are playing with blocks in Sara 's room
You just need to help each other and it isn 't
with things . This is not a success ." Lily felt
said they loved her too and said goodbye . They hung
rabbit hopped away . Sarah was feeling so excited

INTERVAL -6.527 - -3.070
CONTAINS 0.510%

The boun cy ball was so scared .
when she saw the water come out . The
in . She saw Jack eating the cel ery and asked
desert , Max was very thirsty . He spotted a giant
Maybe the sw an was hungry and in pain . Maybe
Token noticed
Token ID6810
Feature activation-4.53e-01

Pos logit contributions
iren+0.038
rob+0.037
loo+0.035
 robots+0.034
 officer+0.034
Neg logit contributions
opsy-0.042
 gra-0.036
 herbal-0.035
 harvest-0.035
ossom-0.035
Token something
Token ID1223
Feature activation-1.33e+00

Pos logit contributions
opsy+0.016
 mashed+0.014
 cozy+0.013
 nib+0.013
 Baby+0.013
Neg logit contributions
iren-0.020
rob-0.020
 inconvenience-0.019
 poster-0.018
loo-0.018
Token strange
Token ID6283
Feature activation+1.13e+00

Pos logit contributions
opsy+0.061
ossom+0.053
opp+0.051
cious+0.050
ench+0.050
Neg logit contributions
iren-0.049
rob-0.046
loo-0.045
 robots-0.044
 officer-0.043
Token.
Token ID13
Feature activation-9.94e-02

Pos logit contributions
iren+0.044
rob+0.044
loo+0.040
 inconvenience+0.039
 robots+0.038
Neg logit contributions
opsy-0.047
 nib-0.040
cious-0.040
 cozy-0.039
 harvest-0.039
Token A
Token ID317
Feature activation-2.68e-01

Pos logit contributions
opsy+0.004
cious+0.003
 cozy+0.003
 harvest+0.003
 nib+0.003
Neg logit contributions
iren-0.004
rob-0.004
loo-0.004
 robots-0.004
 inconvenience-0.004
Token gol
Token ID43791
Feature activation+6.74e+00

Pos logit contributions
opsy+0.011
cious+0.009
ossom+0.009
apa+0.009
 gra+0.009
Neg logit contributions
iren-0.011
loo-0.010
rob-0.010
 poster-0.010
 robots-0.010
Tokenfer
Token ID2232
Feature activation+2.87e+00

Pos logit contributions
iren+0.275
lights+0.266
loo+0.252
 inconvenience+0.248
irens+0.243
Neg logit contributions
opsy-0.275
cious-0.245
 reel-0.225
 nib-0.222
aults-0.220
Token was
Token ID373
Feature activation+7.22e-01

Pos logit contributions
iren+0.128
rob+0.126
loo+0.117
lights+0.116
 inconvenience+0.115
Neg logit contributions
opsy-0.106
 nib-0.090
cious-0.089
-0.083
 relaxed-0.083
Token hitting
Token ID9008
Feature activation-9.14e-01

Pos logit contributions
rob+0.029
iren+0.029
loo+0.028
 inconvenience+0.026
 robots+0.025
Neg logit contributions
opsy-0.029
 cozy-0.024
 nib-0.024
 mashed-0.024
cious-0.024
Token golf
Token ID13126
Feature activation+1.29e+00

Pos logit contributions
opsy+0.034
 nib+0.029
 cozy+0.029
 reel+0.029
cious+0.029
Neg logit contributions
rob-0.038
iren-0.038
loo-0.036
 inconvenience-0.035
 poster-0.033
Token balls
Token ID11333
Feature activation-9.92e-01

Pos logit contributions
iren+0.055
rob+0.053
loo+0.050
 inconvenience+0.049
irens+0.049
Neg logit contributions
opsy-0.048
 nib-0.043
-0.040
cious-0.040
 reel-0.040
Token wanted
Token ID2227
Feature activation-1.17e+00

Pos logit contributions
iren+0.031
rob+0.029
loo+0.028
irens+0.028
 inconvenience+0.028
Neg logit contributions
opsy-0.021
 nib-0.017
 relaxed-0.017
 mashed-0.016
 cozy-0.016
Token to
Token ID284
Feature activation-5.62e-01

Pos logit contributions
opsy+0.039
 mashed+0.037
 cozy+0.037
 nib+0.037
 pick+0.036
Neg logit contributions
rob-0.053
iren-0.050
loo-0.048
 inconvenience-0.046
mith-0.044
Token be
Token ID307
Feature activation+1.94e+00

Pos logit contributions
opsy+0.016
 harvest+0.013
 nib+0.012
 cozy+0.012
cious+0.012
Neg logit contributions
iren-0.030
rob-0.030
loo-0.028
irens-0.027
 robots-0.027
Token a
Token ID257
Feature activation+7.48e-01

Pos logit contributions
iren+0.086
rob+0.085
loo+0.080
 inconvenience+0.076
irens+0.076
Neg logit contributions
opsy-0.074
cious-0.060
 cozy-0.060
 harvest-0.059
 nib-0.058
Token good
Token ID922
Feature activation-1.22e+00

Pos logit contributions
iren+0.026
rob+0.025
loo+0.024
 robots+0.023
 poster+0.023
Neg logit contributions
opsy-0.035
cious-0.030
ossom-0.029
apa-0.029
ench-0.029
Token gol
Token ID43791
Feature activation+6.45e+00

Pos logit contributions
opsy+0.056
cious+0.046
 harvest+0.046
 cozy+0.046
ossom+0.045
Neg logit contributions
iren-0.045
rob-0.044
loo-0.040
irens-0.039
 robots-0.038
Tokenfer
Token ID2232
Feature activation+2.64e+00

Pos logit contributions
loo+0.252
iren+0.251
lights+0.244
 inconvenience+0.234
 doors+0.223
Neg logit contributions
opsy-0.257
cious-0.232
aults-0.226
 reel-0.213
 harvest-0.207
Token like
Token ID588
Feature activation-1.65e+00

Pos logit contributions
iren+0.123
loo+0.114
rob+0.114
irens+0.110
 inconvenience+0.110
Neg logit contributions
opsy-0.092
 nib-0.081
cious-0.078
 relaxed-0.076
 harvest-0.076
Token his
Token ID465
Feature activation+3.63e-01

Pos logit contributions
opsy+0.052
 nib+0.051
 mashed+0.049
 Baby+0.048
 pet+0.048
Neg logit contributions
rob-0.074
iren-0.071
loo-0.069
 inconvenience-0.065
lights-0.064
Token dad
Token ID9955
Feature activation+3.04e-01

Pos logit contributions
iren+0.017
rob+0.017
loo+0.016
irens+0.015
 inconvenience+0.015
Neg logit contributions
opsy-0.013
 cozy-0.010
 harvest-0.010
cious-0.010
 relaxed-0.010
Token.
Token ID13
Feature activation+7.54e-01

Pos logit contributions
iren+0.011
rob+0.011
 robots+0.010
loo+0.010
 officer+0.010
Neg logit contributions
opsy-0.014
cious-0.011
ossom-0.011
 herbal-0.011
 harvest-0.011
Token
Token ID220
Feature activation-4.65e-01

Pos logit contributions
iren+0.021
rob+0.018
loo+0.016
irens+0.015
 robots+0.014
Neg logit contributions
opsy-0.073
cious-0.065
ossom-0.064
 cozy-0.064
 harvest-0.063
Token
Token ID198
Feature activation+1.17e-01

Pos logit contributions
opsy+0.018
cious+0.014
ossom+0.014
apa+0.014
 cozy+0.014
Neg logit contributions
rob-0.020
iren-0.020
loo-0.019
 robots-0.018
 inconvenience-0.018
Token
Token ID198
Feature activation-9.98e-02

Pos logit contributions
rob+0.006
iren+0.005
loo+0.005
 inconvenience+0.005
 robots+0.005
Neg logit contributions
opsy-0.004
cious-0.003
 cozy-0.003
apa-0.003
 nib-0.003
TokenThe
Token ID464
Feature activation+1.34e+00

Pos logit contributions
opsy+0.004
cious+0.003
ossom+0.003
 cozy+0.003
 harvest+0.003
Neg logit contributions
iren-0.004
rob-0.004
loo-0.004
 robots-0.004
 inconvenience-0.004
Token town
Token ID3240
Feature activation+2.39e+00

Pos logit contributions
iren+0.057
rob+0.057
loo+0.051
irens+0.051
 robots+0.049
Neg logit contributions
opsy-0.055
 cozy-0.045
 harvest-0.044
cious-0.044
 herbal-0.043
Tokenspe
Token ID4125
Feature activation+6.08e+00

Pos logit contributions
rob+0.117
iren+0.110
irens+0.098
loo+0.098
 inconvenience+0.098
Neg logit contributions
opsy-0.079
 relaxed-0.069
cious-0.066
 harvest-0.066
-0.064
Tokenople
Token ID643
Feature activation+1.49e+00

Pos logit contributions
iren+0.193
loo+0.178
 outage+0.161
rob+0.159
arl+0.156
Neg logit contributions
opsy-0.263
cious-0.256
-0.254
aults-0.250
ench-0.248
Token had
Token ID550
Feature activation+1.11e+00

Pos logit contributions
iren+0.076
rob+0.076
loo+0.072
irens+0.069
 robots+0.068
Neg logit contributions
opsy-0.044
 relaxed-0.036
cious-0.035
 harvest-0.035
 nib-0.034
Token to
Token ID284
Feature activation+2.36e-01

Pos logit contributions
rob+0.053
iren+0.050
loo+0.049
 poster+0.045
lights+0.043
Neg logit contributions
opsy-0.036
 cozy-0.034
 relaxed-0.033
 harvest-0.033
cious-0.032
Token find
Token ID1064
Feature activation+7.25e-01

Pos logit contributions
rob+0.013
iren+0.012
loo+0.012
irens+0.011
 robots+0.011
Neg logit contributions
opsy-0.007
 harvest-0.006
 gra-0.005
 cozy-0.005
 drink-0.005
Token a
Token ID257
Feature activation-3.19e-01

Pos logit contributions
rob+0.033
iren+0.032
loo+0.029
 inconvenience+0.028
 poster+0.028
Neg logit contributions
opsy-0.025
 cozy-0.023
 harvest-0.023
apa-0.022
 Baby-0.022
Token
Token ID198
Feature activation-5.93e-01

Pos logit contributions
iren+0.000
rob+0.000
loo+0.000
 robots+0.000
irens+0.000
Neg logit contributions
opsy-0.000
cious-0.000
 harvest-0.000
ossom-0.000
 cozy-0.000
Token
Token ID198
Feature activation-7.09e-01

Pos logit contributions
opsy+0.027
cious+0.022
 harvest+0.022
ossom+0.021
 herbal+0.021
Neg logit contributions
iren-0.023
rob-0.022
loo-0.020
irens-0.019
 robots-0.019
TokenThe
Token ID464
Feature activation+1.16e+00

Pos logit contributions
opsy+0.033
cious+0.028
 harvest+0.027
 cozy+0.027
ossom+0.027
Neg logit contributions
iren-0.026
rob-0.025
loo-0.023
 robots-0.022
irens-0.022
Token new
Token ID649
Feature activation+4.75e-01

Pos logit contributions
iren+0.043
rob+0.043
loo+0.039
irens+0.037
 robots+0.037
Neg logit contributions
opsy-0.053
 cozy-0.044
 harvest-0.044
cious-0.044
 herbal-0.043
Token light
Token ID1657
Feature activation+3.56e-01

Pos logit contributions
rob+0.021
iren+0.021
loo+0.019
irens+0.019
 inconvenience+0.018
Neg logit contributions
opsy-0.018
 cozy-0.015
 harvest-0.015
 cos-0.015
 herbal-0.015
Tokenbul
Token ID15065
Feature activation+6.03e+00

Pos logit contributions
rob+0.016
iren+0.014
 inconvenience+0.014
 yourselves+0.014
irens+0.014
Neg logit contributions
opsy-0.013
 cozy-0.012
 relaxed-0.011
 cos-0.011
 harvest-0.011
Tokenb
Token ID65
Feature activation+1.28e+00

Pos logit contributions
 inconvenience+0.303
 principal+0.282
 yourselves+0.280
 robots+0.276
rob+0.271
Neg logit contributions
-0.152
opsy-0.152
ossom-0.135
 relaxed-0.135
sy-0.128
Token was
Token ID373
Feature activation+8.12e-02

Pos logit contributions
rob+0.059
 inconvenience+0.054
iren+0.051
 officer+0.048
reating+0.047
Neg logit contributions
opsy-0.049
apa-0.043
ossom-0.042
opp-0.040
cious-0.040
Token much
Token ID881
Feature activation-1.04e+00

Pos logit contributions
iren+0.004
rob+0.003
loo+0.003
 robots+0.003
irens+0.003
Neg logit contributions
opsy-0.003
cious-0.003
 harvest-0.003
 cozy-0.003
ossom-0.003
Token bigger
Token ID5749
Feature activation+2.71e-01

Pos logit contributions
opsy+0.049
cious+0.042
ossom+0.039
 gra+0.039
opp+0.038
Neg logit contributions
iren-0.039
rob-0.036
loo-0.035
 robots-0.034
 officer-0.033
Token and
Token ID290
Feature activation-1.26e+00

Pos logit contributions
iren+0.011
rob+0.011
loo+0.010
 inconvenience+0.010
irens+0.009
Neg logit contributions
opsy-0.011
cious-0.010
 nib-0.010
 cozy-0.010
 harvest-0.010
Token asked
Token ID1965
Feature activation-4.54e-01

Pos logit contributions
iren+0.018
rob+0.017
loo+0.016
 robots+0.016
 officer+0.016
Neg logit contributions
opsy-0.020
cious-0.016
ossom-0.016
 harvest-0.016
 herbal-0.016
Token his
Token ID465
Feature activation+4.50e-01

Pos logit contributions
opsy+0.022
cious+0.018
 harvest+0.018
 cozy+0.018
 nib+0.018
Neg logit contributions
iren-0.016
rob-0.015
loo-0.014
 robots-0.013
 inconvenience-0.013
Token friend
Token ID1545
Feature activation+6.52e-01

Pos logit contributions
iren+0.014
rob+0.013
loo+0.013
 robots+0.012
 officer+0.012
Neg logit contributions
opsy-0.023
cious-0.020
 gra-0.020
ossom-0.019
 harvest-0.019
Token,
Token ID11
Feature activation+7.90e-01

Pos logit contributions
iren+0.022
rob+0.021
loo+0.020
 robots+0.020
 officer+0.019
Neg logit contributions
opsy-0.031
cious-0.027
ossom-0.026
 harvest-0.026
 cozy-0.026
Token a
Token ID257
Feature activation+1.02e+00

Pos logit contributions
iren+0.023
rob+0.022
 robots+0.021
 officer+0.021
?"+0.021
Neg logit contributions
opsy-0.040
ossom-0.036
cious-0.035
 harvest-0.035
ench-0.034
Token se
Token ID384
Feature activation+5.93e+00

Pos logit contributions
iren+0.042
rob+0.039
loo+0.039
 poster+0.038
 officer+0.038
Neg logit contributions
opsy-0.041
cious-0.034
ossom-0.034
ench-0.033
 harvest-0.033
Tokenag
Token ID363
Feature activation+1.71e+00

Pos logit contributions
iren+0.207
rob+0.206
 robots+0.204
 yourselves+0.196
irens+0.196
Neg logit contributions
opsy-0.266
ossom-0.226
ench-0.220
cious-0.218
 nib-0.217
Tokenull
Token ID724
Feature activation+6.04e-01

Pos logit contributions
rob+0.070
iren+0.069
 inconvenience+0.063
loo+0.062
 robots+0.062
Neg logit contributions
opsy-0.074
cious-0.058
 nib-0.058
ossom-0.057
 harvest-0.057
Token named
Token ID3706
Feature activation+8.10e-01

Pos logit contributions
iren+0.022
rob+0.021
loo+0.020
 robots+0.019
irens+0.019
Neg logit contributions
opsy-0.028
cious-0.023
 harvest-0.023
 cozy-0.023
ossom-0.023
Token Tim
Token ID5045
Feature activation+5.45e-01

Pos logit contributions
iren+0.039
rob+0.038
loo+0.037
irens+0.035
 inconvenience+0.034
Neg logit contributions
opsy-0.028
cious-0.021
 cozy-0.021
 nib-0.021
 gra-0.021
Tokenmy
Token ID1820
Feature activation+1.66e-01

Pos logit contributions
iren+0.018
rob+0.017
loo+0.016
 robots+0.015
irens+0.015
Neg logit contributions
opsy-0.027
cious-0.023
 harvest-0.023
 cozy-0.023
 nib-0.023
Token lamp
Token ID20450
Feature activation+5.08e-01

Pos logit contributions
opsy+0.020
 cozy+0.019
 cos+0.018
 herbal+0.017
 gra+0.017
Neg logit contributions
rob-0.017
iren-0.016
loo-0.014
irens-0.014
 inconvenience-0.014
Token.
Token ID13
Feature activation-4.72e-01

Pos logit contributions
iren+0.016
 robots+0.015
rob+0.015
lights+0.015
er+0.014
Neg logit contributions
opsy-0.026
 Ginny-0.021
cious-0.021
 gra-0.020
 Baby-0.020
Token Suddenly
Token ID24975
Feature activation-2.03e-01

Pos logit contributions
opsy+0.022
cious+0.017
ossom+0.017
opp+0.017
 herbal+0.017
Neg logit contributions
iren-0.018
 robots-0.017
rob-0.016
loo-0.016
 inconvenience-0.016
Token,
Token ID11
Feature activation-3.07e-02

Pos logit contributions
opsy+0.008
cious+0.007
 cozy+0.007
 harvest+0.007
 nib+0.006
Neg logit contributions
iren-0.008
rob-0.008
loo-0.008
 robots-0.007
irens-0.007
Token a
Token ID257
Feature activation+6.70e-01

Pos logit contributions
opsy+0.001
 cozy+0.001
 harvest+0.001
 Baby+0.001
cious+0.001
Neg logit contributions
iren-0.001
rob-0.001
loo-0.001
irens-0.001
 inconvenience-0.001
Token gen
Token ID2429
Feature activation+5.87e+00

Pos logit contributions
iren+0.028
rob+0.027
loo+0.026
 robots+0.025
 poster+0.025
Neg logit contributions
opsy-0.027
cious-0.022
ossom-0.022
 harvest-0.022
 herbal-0.021
Tokenie
Token ID494
Feature activation+4.94e-01

Pos logit contributions
iren+0.253
 robots+0.248
rob+0.240
 yourselves+0.237
 inconvenience+0.236
Neg logit contributions
opsy-0.208
cious-0.161
 nib-0.158
 relaxed-0.155
 harvest-0.153
Token appeared
Token ID4120
Feature activation+9.07e-01

Pos logit contributions
iren+0.018
rob+0.018
loo+0.017
 inconvenience+0.016
irens+0.016
Neg logit contributions
opsy-0.022
cious-0.019
 nib-0.019
 harvest-0.018
 gra-0.018
Token!
Token ID0
Feature activation-5.52e-01

Pos logit contributions
iren+0.034
rob+0.033
loo+0.030
 inconvenience+0.029
 robots+0.029
Neg logit contributions
opsy-0.041
cious-0.034
 harvest-0.034
 cozy-0.034
 nib-0.033
Token The
Token ID383
Feature activation+7.83e-01

Pos logit contributions
opsy+0.018
 cozy+0.015
 nib+0.015
 harvest+0.015
cious+0.015
Neg logit contributions
rob-0.026
iren-0.025
loo-0.024
 inconvenience-0.023
irens-0.022
Token gen
Token ID2429
Feature activation+4.06e+00

Pos logit contributions
rob+0.031
iren+0.029
irens+0.027
lights+0.025
reating+0.025
Neg logit contributions
 cozy-0.031
opsy-0.030
 warm-0.029
 cos-0.028
 nib-0.028
Token it
Token ID340
Feature activation-9.10e-01

Pos logit contributions
opsy+0.036
cious+0.030
ossom+0.030
 herbal+0.029
 harvest+0.029
Neg logit contributions
iren-0.024
rob-0.023
loo-0.021
 robots-0.021
irens-0.020
Token away
Token ID1497
Feature activation+1.80e+00

Pos logit contributions
opsy+0.043
cious+0.036
 herbal+0.035
 harvest+0.034
 gra+0.034
Neg logit contributions
iren-0.032
rob-0.030
loo-0.030
 robots-0.028
irens-0.027
Token in
Token ID287
Feature activation+8.20e-01

Pos logit contributions
iren+0.055
rob+0.052
loo+0.050
 robots+0.049
er+0.046
Neg logit contributions
opsy-0.093
cious-0.078
 gra-0.077
 herbal-0.077
ossom-0.076
Token a
Token ID257
Feature activation+9.00e-01

Pos logit contributions
iren+0.030
rob+0.029
loo+0.027
 robots+0.026
irens+0.025
Neg logit contributions
opsy-0.038
cious-0.032
 harvest-0.031
ossom-0.031
 cozy-0.031
Token wheel
Token ID7825
Feature activation+2.93e+00

Pos logit contributions
iren+0.033
loo+0.032
 poster+0.032
?"+0.029
 inconvenience+0.029
Neg logit contributions
opsy-0.039
cious-0.033
 Ginny-0.030
ossom-0.030
apa-0.030
Tokenbar
Token ID5657
Feature activation+5.86e+00

Pos logit contributions
rob+0.128
iren+0.125
 inconvenience+0.115
irens+0.114
loo+0.114
Neg logit contributions
opsy-0.106
 nib-0.090
 harvest-0.088
cious-0.086
ench-0.086
Tokenrow
Token ID808
Feature activation+2.71e-01

Pos logit contributions
rob+0.250
 poster+0.246
iren+0.243
 inconvenience+0.241
 officer+0.230
Neg logit contributions
opsy-0.229
ench-0.188
apa-0.182
 harvest-0.181
ossom-0.180
Token.
Token ID13
Feature activation-2.63e-03

Pos logit contributions
rob+0.011
iren+0.011
 poster+0.010
irens+0.010
loo+0.010
Neg logit contributions
opsy-0.011
cious-0.009
 harvest-0.009
 nib-0.009
 cozy-0.009
Token
Token ID198
Feature activation-4.87e-01

Pos logit contributions
opsy+0.000
cious+0.000
ossom+0.000
 harvest+0.000
 herbal+0.000
Neg logit contributions
iren-0.000
rob-0.000
loo-0.000
 robots-0.000
irens-0.000
Token
Token ID198
Feature activation-6.30e-01

Pos logit contributions
opsy+0.021
cious+0.017
 harvest+0.017
ossom+0.017
 herbal+0.017
Neg logit contributions
iren-0.019
rob-0.019
loo-0.018
 robots-0.017
irens-0.017
TokenOnce
Token ID7454
Feature activation-7.20e-01

Pos logit contributions
opsy+0.028
cious+0.023
ossom+0.022
 herbal+0.022
 harvest+0.022
Neg logit contributions
iren-0.025
rob-0.023
loo-0.022
 robots-0.021
irens-0.021
Token She
Token ID1375
Feature activation-4.29e-01

Pos logit contributions
iren+0.002
?"+0.002
 robots+0.002
loo+0.002
rob+0.002
Neg logit contributions
opsy-0.004
aults-0.004
cious-0.004
 harvest-0.004
 mental-0.004
Token blinked
Token ID40360
Feature activation+9.09e-01

Pos logit contributions
opsy+0.018
cious+0.014
 harvest+0.014
ossom+0.014
 herbal+0.014
Neg logit contributions
iren-0.017
rob-0.017
loo-0.016
 robots-0.016
 poster-0.015
Token her
Token ID607
Feature activation-8.30e-01

Pos logit contributions
iren+0.031
rob+0.030
 robots+0.028
loo+0.028
irens+0.027
Neg logit contributions
opsy-0.043
cious-0.036
 harvest-0.035
 gra-0.035
 herbal-0.035
Token eyes
Token ID2951
Feature activation-1.90e+00

Pos logit contributions
opsy+0.040
cious+0.034
 harvest+0.033
ossom+0.033
 cozy+0.033
Neg logit contributions
iren-0.028
rob-0.027
loo-0.025
 robots-0.025
irens-0.024
Token and
Token ID290
Feature activation+6.32e-01

Pos logit contributions
opsy+0.101
cious+0.082
 gra+0.081
apa+0.081
 herbal+0.081
Neg logit contributions
iren-0.058
er-0.051
rob-0.050
 robots-0.048
irens-0.047
Token imm
Token ID2296
Feature activation+5.82e+00

Pos logit contributions
iren+0.024
rob+0.024
 robots+0.022
 poster+0.022
loo+0.022
Neg logit contributions
opsy-0.027
 herbal-0.021
ench-0.021
cious-0.021
 harvest-0.021
Tokenag
Token ID363
Feature activation+1.04e+00

Pos logit contributions
rob+0.237
iren+0.226
loo+0.205
 inconvenience+0.204
 poster+0.202
Neg logit contributions
opsy-0.263
ench-0.217
ossom-0.206
 harvest-0.204
 relaxed-0.197
Tokenined
Token ID1389
Feature activation+2.45e-01

Pos logit contributions
iren+0.042
rob+0.042
loo+0.038
 inconvenience+0.038
 poster+0.037
Neg logit contributions
opsy-0.044
cious-0.036
ossom-0.035
 cozy-0.035
 harvest-0.035
Token she
Token ID673
Feature activation-6.00e-01

Pos logit contributions
iren+0.007
 robots+0.007
er+0.007
rob+0.006
lights+0.006
Neg logit contributions
opsy-0.013
cious-0.011
 harvest-0.010
apa-0.010
ench-0.010
Token could
Token ID714
Feature activation-6.32e-01

Pos logit contributions
opsy+0.024
cious+0.019
 harvest+0.019
 herbal+0.019
ossom+0.019
Neg logit contributions
iren-0.025
rob-0.025
loo-0.023
 robots-0.023
irens-0.022
Token see
Token ID766
Feature activation+2.85e-01

Pos logit contributions
opsy+0.020
cious+0.015
 harvest+0.015
 cozy+0.015
ossom+0.015
Neg logit contributions
iren-0.032
rob-0.032
loo-0.030
 robots-0.029
irens-0.029
TokenThank
Token ID10449
Feature activation-7.91e-01

Pos logit contributions
iren+0.016
rob+0.016
loo+0.015
 robots+0.014
irens+0.014
Neg logit contributions
opsy-0.017
cious-0.014
 cozy-0.014
 harvest-0.014
 nib-0.014
Token you
Token ID345
Feature activation-2.25e+00

Pos logit contributions
opsy+0.032
+0.029
 Baby+0.027
 Ginny+0.027
 nib+0.027
Neg logit contributions
lights-0.031
iren-0.030
loo-0.028
 outage-0.028
rob-0.027
Token,
Token ID11
Feature activation+4.13e-01

Pos logit contributions
opsy+0.084
 Baby+0.075
 cozy+0.073
apa+0.071
 cos+0.069
Neg logit contributions
rob-0.103
iren-0.100
irens-0.090
 poster-0.088
loo-0.087
Token Mr
Token ID1770
Feature activation-2.23e+00

Pos logit contributions
rob+0.021
iren+0.020
irens+0.018
loo+0.018
ers+0.017
Neg logit contributions
opsy-0.013
 cozy-0.013
 Baby-0.012
 cos-0.012
 ju-0.011
Token.
Token ID13
Feature activation-2.05e-01

Pos logit contributions
opsy+0.086
opp+0.070
 cozy+0.070
 nib+0.067
 cos+0.067
Neg logit contributions
iren-0.098
rob-0.092
irens-0.090
 inconvenience-0.085
lights-0.084
Token Mechan
Token ID13438
Feature activation+5.81e+00

Pos logit contributions
opsy+0.009
 Baby+0.008
opp+0.008
+0.008
 cozy+0.008
Neg logit contributions
rob-0.007
iren-0.007
lights-0.007
irens-0.006
 inconvenience-0.006
Tokenic
Token ID291
Feature activation+2.32e+00

Pos logit contributions
 inconvenience+0.201
iren+0.201
 robots+0.200
 hits+0.190
 poster+0.189
Neg logit contributions
opsy-0.236
ench-0.210
opp-0.202
ossom-0.202
 mental-0.200
Token!"
Token ID2474
Feature activation+3.05e-01

Pos logit contributions
lights+0.082
iren+0.080
irens+0.079
 robots+0.078
 English+0.076
Neg logit contributions
opsy-0.089
apa-0.081
cious-0.080
-0.079
sy-0.077
Token Tim
Token ID5045
Feature activation+4.95e-01

Pos logit contributions
iren+0.011
rob+0.011
loo+0.010
 robots+0.009
 inconvenience+0.009
Neg logit contributions
opsy-0.014
cious-0.012
 cozy-0.012
 nib-0.012
 harvest-0.012
Tokenmy
Token ID1820
Feature activation+1.57e-01

Pos logit contributions
iren+0.020
rob+0.020
loo+0.019
 inconvenience+0.019
irens+0.018
Neg logit contributions
opsy-0.020
 cozy-0.017
 nib-0.017
 relaxed-0.017
 cos-0.016
Token said
Token ID531
Feature activation+2.88e-01

Pos logit contributions
iren+0.006
rob+0.006
loo+0.006
 inconvenience+0.006
irens+0.005
Neg logit contributions
opsy-0.007
 cozy-0.006
cious-0.006
 nib-0.006
 relaxed-0.005
Token elephants
Token ID25368
Feature activation+1.02e+00

Pos logit contributions
iren+0.034
rob+0.033
loo+0.031
 inconvenience+0.029
 robots+0.029
Neg logit contributions
opsy-0.039
cious-0.033
 cozy-0.032
 harvest-0.032
 nib-0.032
Token,
Token ID11
Feature activation+4.99e-01

Pos logit contributions
iren+0.034
rob+0.033
loo+0.030
 robots+0.030
irens+0.029
Neg logit contributions
opsy-0.052
ossom-0.044
ench-0.043
 herbal-0.043
 cozy-0.042
Token lions
Token ID31420
Feature activation+7.05e-01

Pos logit contributions
iren+0.021
rob+0.020
loo+0.020
 inconvenience+0.019
irens+0.018
Neg logit contributions
opsy-0.019
cious-0.017
 nib-0.017
 cozy-0.017
 harvest-0.016
Token and
Token ID290
Feature activation-2.15e-02

Pos logit contributions
iren+0.023
rob+0.022
 robots+0.020
loo+0.020
irens+0.019
Neg logit contributions
opsy-0.036
ossom-0.031
ench-0.030
 herbal-0.030
 cozy-0.030
Token ac
Token ID936
Feature activation+1.97e+00

Pos logit contributions
opsy+0.001
cious+0.001
 cozy+0.001
 nib+0.001
 relaxed+0.001
Neg logit contributions
iren-0.001
loo-0.001
rob-0.001
mith-0.001
irens-0.001
Tokenrob
Token ID22609
Feature activation+5.81e+00

Pos logit contributions
iren+0.009
rob+0.005
loo+0.003
 inconvenience+0.001
 robots+0.001
Neg logit contributions
opsy-0.151
cious-0.137
 harvest-0.135
 nib-0.135
 cozy-0.134
Tokenats
Token ID1381
Feature activation-2.24e+00

Pos logit contributions
iren+0.239
rob+0.231
loo+0.221
 robots+0.213
 officer+0.210
Neg logit contributions
opsy-0.238
cious-0.197
ossom-0.189
 harvest-0.189
 nib-0.186
Token.
Token ID13
Feature activation+8.42e-02

Pos logit contributions
opsy+0.101
cious+0.086
 harvest+0.084
 nib+0.084
 gra+0.084
Neg logit contributions
iren-0.081
rob-0.080
loo-0.074
 robots-0.070
 inconvenience-0.070
Token They
Token ID1119
Feature activation-1.41e+00

Pos logit contributions
rob+0.004
iren+0.004
loo+0.004
 inconvenience+0.004
 poster+0.003
Neg logit contributions
opsy-0.003
cious-0.002
 cozy-0.002
 nib-0.002
 gra-0.002
Token are
Token ID389
Feature activation+1.58e-01

Pos logit contributions
opsy+0.052
cious+0.040
ossom+0.040
 herbal+0.040
ench+0.039
Neg logit contributions
iren-0.065
rob-0.063
 robots-0.059
loo-0.059
 officer-0.059
Token amazed
Token ID24562
Feature activation+5.16e-01

Pos logit contributions
iren+0.006
 robots+0.006
rob+0.006
loo+0.006
irens+0.006
Neg logit contributions
opsy-0.007
ossom-0.005
ench-0.005
cious-0.005
 harvest-0.005
Token.
Token ID13
Feature activation+6.13e-01

Pos logit contributions
iren+0.079
rob+0.077
loo+0.073
irens+0.069
 poster+0.068
Neg logit contributions
opsy-0.076
apa-0.062
cious-0.061
 Ginny-0.061
 harvest-0.061
Token
Token ID198
Feature activation-1.04e-01

Pos logit contributions
rob+0.029
iren+0.027
loo+0.026
 inconvenience+0.025
 robots+0.025
Neg logit contributions
opsy-0.022
cious-0.018
 Baby-0.018
apa-0.018
 cozy-0.018
Token
Token ID198
Feature activation-2.06e-01

Pos logit contributions
opsy+0.004
apa+0.003
cious+0.003
 cozy+0.003
opp+0.003
Neg logit contributions
rob-0.005
iren-0.005
loo-0.005
 inconvenience-0.005
 robots-0.005
Token"
Token ID1
Feature activation+9.21e-01

Pos logit contributions
opsy+0.010
cious+0.008
ossom+0.008
 cozy+0.008
apa+0.008
Neg logit contributions
rob-0.007
iren-0.007
loo-0.007
 robots-0.006
 inconvenience-0.006
TokenOf
Token ID5189
Feature activation+1.27e+00

Pos logit contributions
iren+0.037
rob+0.037
loo+0.037
 robots+0.035
 inconvenience+0.034
Neg logit contributions
opsy-0.039
cious-0.032
 herbal-0.030
apa-0.030
 Ginny-0.030
Token course
Token ID1781
Feature activation+5.79e+00

Pos logit contributions
 robots+0.037
iren+0.037
lights+0.035
loo+0.034
driver+0.032
Neg logit contributions
opsy-0.067
-0.061
 Baby-0.060
 Ginny-0.060
apa-0.059
Token,"
Token ID553
Feature activation+1.58e+00

Pos logit contributions
rob+0.291
iren+0.287
loo+0.261
irens+0.260
 robots+0.249
Neg logit contributions
opsy-0.192
-0.175
 Baby-0.167
cious-0.166
 nib-0.164
Token the
Token ID262
Feature activation+2.13e+00

Pos logit contributions
iren+0.068
rob+0.067
loo+0.067
 inconvenience+0.060
irens+0.059
Neg logit contributions
opsy-0.064
 Baby-0.054
 Ginny-0.053
 nib-0.053
apa-0.052
Token lady
Token ID10846
Feature activation+1.96e+00

Pos logit contributions
rob+0.088
iren+0.084
loo+0.077
irens+0.076
lights+0.073
Neg logit contributions
opsy-0.092
 cozy-0.074
cious-0.073
 herbal-0.072
 harvest-0.071
Token says
Token ID1139
Feature activation+1.43e+00

Pos logit contributions
rob+0.069
iren+0.068
loo+0.065
 inconvenience+0.062
 robots+0.061
Neg logit contributions
opsy-0.091
 nib-0.079
apa-0.076
cious-0.076
 relaxed-0.075
Token.
Token ID13
Feature activation+3.39e-01

Pos logit contributions
iren+0.052
rob+0.052
loo+0.048
 robots+0.045
irens+0.045
Neg logit contributions
opsy-0.066
cious-0.055
 cozy-0.054
 harvest-0.054
ossom-0.054
Token
Token ID220
Feature activation-3.31e-01

Pos logit contributions
opsy+0.025
cious+0.020
 cozy+0.020
ossom+0.019
apa+0.019
Neg logit contributions
rob-0.031
iren-0.030
loo-0.029
 robots-0.028
 inconvenience-0.027
Token
Token ID198
Feature activation-7.31e-01

Pos logit contributions
opsy+0.013
cious+0.010
 cozy+0.010
ossom+0.010
 gra+0.010
Neg logit contributions
rob-0.014
iren-0.014
loo-0.013
 robots-0.013
 inconvenience-0.013
TokenSuddenly
Token ID38582
Feature activation-1.76e-01

Pos logit contributions
opsy+0.033
 harvest+0.028
 gra+0.028
 nib+0.027
 ju+0.027
Neg logit contributions
iren-0.027
rob-0.024
irens-0.024
loo-0.023
 robots-0.023
Token,
Token ID11
Feature activation-3.74e-01

Pos logit contributions
opsy+0.008
cious+0.006
 cozy+0.006
 harvest+0.006
ossom+0.006
Neg logit contributions
iren-0.007
rob-0.007
loo-0.006
 robots-0.006
 inconvenience-0.006
Token a
Token ID257
Feature activation+5.08e-01

Pos logit contributions
opsy+0.015
 cozy+0.012
cious+0.012
 harvest+0.012
 nib+0.012
Neg logit contributions
iren-0.016
rob-0.016
loo-0.014
 robots-0.014
irens-0.014
Token gen
Token ID2429
Feature activation+5.78e+00

Pos logit contributions
iren+0.022
rob+0.022
loo+0.020
 robots+0.020
irens+0.019
Neg logit contributions
opsy-0.020
cious-0.016
 harvest-0.015
 cozy-0.015
ossom-0.015
Tokenie
Token ID494
Feature activation+3.60e-01

Pos logit contributions
iren+0.250
 robots+0.249
rob+0.240
 inconvenience+0.237
 yourselves+0.237
Neg logit contributions
opsy-0.207
 nib-0.159
cious-0.153
 relaxed-0.151
 gra-0.151
Token appeared
Token ID4120
Feature activation+8.79e-01

Pos logit contributions
rob+0.014
iren+0.014
loo+0.013
 inconvenience+0.013
 yourselves+0.013
Neg logit contributions
opsy-0.015
 nib-0.013
cious-0.013
 gra-0.012
 harvest-0.012
Token in
Token ID287
Feature activation+6.90e-01

Pos logit contributions
iren+0.033
rob+0.033
loo+0.031
 inconvenience+0.029
 robots+0.029
Neg logit contributions
opsy-0.039
cious-0.032
 cozy-0.032
 harvest-0.032
 nib-0.032
Token her
Token ID607
Feature activation+5.85e-01

Pos logit contributions
rob+0.031
iren+0.030
loo+0.029
 inconvenience+0.028
 robots+0.026
Neg logit contributions
opsy-0.025
 nib-0.022
 harvest-0.021
 cozy-0.021
 Baby-0.021
Token bedroom
Token ID14043
Feature activation+2.10e-01

Pos logit contributions
iren+0.026
rob+0.026
 inconvenience+0.025
irens+0.024
loo+0.023
Neg logit contributions
opsy-0.022
 cozy-0.019
cious-0.018
 cos-0.018
 gra-0.018
Token.
Token ID13
Feature activation+1.31e+00

Pos logit contributions
iren+0.033
rob+0.032
irens+0.030
 inconvenience+0.029
loo+0.029
Neg logit contributions
opsy-0.034
 cozy-0.031
 relaxed-0.029
 cos-0.028
 nib-0.028
Token You
Token ID921
Feature activation+7.07e-01

Pos logit contributions
iren+0.041
rob+0.040
loo+0.036
 robots+0.034
irens+0.034
Neg logit contributions
opsy-0.067
cious-0.057
 cozy-0.057
 harvest-0.056
 herbal-0.056
Token are
Token ID389
Feature activation+1.15e+00

Pos logit contributions
iren+0.027
rob+0.026
loo+0.025
 robots+0.024
 inconvenience+0.023
Neg logit contributions
opsy-0.031
cious-0.026
 harvest-0.025
ossom-0.025
 herbal-0.025
Token a
Token ID257
Feature activation+1.66e+00

Pos logit contributions
iren+0.037
rob+0.036
loo+0.033
 robots+0.031
irens+0.030
Neg logit contributions
opsy-0.058
cious-0.050
 harvest-0.049
 cozy-0.049
ossom-0.049
Token successful
Token ID4388
Feature activation+1.43e+00

Pos logit contributions
iren+0.063
 poster+0.061
 robots+0.061
?"+0.061
rob+0.061
Neg logit contributions
opsy-0.067
cious-0.058
 gra-0.056
apa-0.056
ossom-0.056
Token lear
Token ID22454
Feature activation+5.74e+00

Pos logit contributions
rob+0.069
iren+0.067
irens+0.062
loo+0.058
mith+0.056
Neg logit contributions
opsy-0.054
 harvest-0.044
 drink-0.043
 cozy-0.043
 pet-0.042
Tokenner
Token ID1008
Feature activation+2.70e+00

Pos logit contributions
lights+0.216
loo+0.216
iren+0.212
rob+0.206
 robots+0.202
Neg logit contributions
opsy-0.232
cious-0.205
opp-0.198
-0.197
 Honey-0.194
Token and
Token ID290
Feature activation+1.11e+00

Pos logit contributions
iren+0.141
rob+0.141
irens+0.133
loo+0.123
ulk+0.117
Neg logit contributions
opsy-0.080
cious-0.069
 relaxed-0.068
 cozy-0.068
 harvest-0.068
Token a
Token ID257
Feature activation+1.69e+00

Pos logit contributions
iren+0.048
rob+0.046
loo+0.043
irens+0.042
 inconvenience+0.039
Neg logit contributions
opsy-0.045
 cozy-0.040
 relaxed-0.039
 cos-0.038
cious-0.038
Token good
Token ID922
Feature activation-4.64e-02

Pos logit contributions
iren+0.075
rob+0.072
loo+0.070
 robots+0.069
 poster+0.069
Neg logit contributions
opsy-0.062
cious-0.051
 gra-0.048
ossom-0.048
apa-0.048
Token daughter
Token ID4957
Feature activation-4.42e-01

Pos logit contributions
opsy+0.002
 cozy+0.002
 harvest+0.002
 nib+0.002
 drink+0.002
Neg logit contributions
iren-0.002
rob-0.002
irens-0.002
loo-0.002
 robots-0.002
Token own
Token ID898
Feature activation+4.32e-01

Pos logit contributions
opsy+0.022
 cozy+0.018
cious+0.017
 cos+0.017
 nib+0.017
Neg logit contributions
iren-0.025
rob-0.024
loo-0.023
irens-0.022
 inconvenience-0.022
Token two
Token ID734
Feature activation+8.79e-01

Pos logit contributions
iren+0.020
rob+0.020
loo+0.019
irens+0.018
 inconvenience+0.018
Neg logit contributions
opsy-0.016
 cozy-0.013
cious-0.012
 harvest-0.012
 cos-0.012
Token hands
Token ID2832
Feature activation-7.98e-01

Pos logit contributions
iren+0.034
rob+0.032
 robots+0.031
 officer+0.030
loo+0.030
Neg logit contributions
opsy-0.039
ossom-0.032
ench-0.031
cious-0.031
 harvest-0.031
Token.
Token ID13
Feature activation+3.59e-01

Pos logit contributions
opsy+0.032
cious+0.028
 cozy+0.027
 nib+0.027
 harvest+0.026
Neg logit contributions
iren-0.032
rob-0.032
loo-0.030
 inconvenience-0.029
irens-0.028
Token She
Token ID1375
Feature activation+7.68e-02

Pos logit contributions
iren+0.017
rob+0.017
loo+0.016
 inconvenience+0.015
irens+0.015
Neg logit contributions
opsy-0.013
 Baby-0.010
 cozy-0.010
cious-0.010
 gra-0.010
Token se
Token ID384
Feature activation+5.74e+00

Pos logit contributions
iren+0.004
rob+0.004
loo+0.003
 inconvenience+0.003
irens+0.003
Neg logit contributions
opsy-0.003
 nib-0.002
cious-0.002
 cozy-0.002
 relaxed-0.002
Tokenwed
Token ID19103
Feature activation-4.61e-01

Pos logit contributions
 poster+0.277
iren+0.272
loo+0.267
 robots+0.262
irens+0.258
Neg logit contributions
opsy-0.160
ench-0.141
ossom-0.141
 mental-0.132
apa-0.129
Token it
Token ID340
Feature activation-1.01e+00

Pos logit contributions
opsy+0.022
cious+0.018
 cozy+0.018
 harvest+0.018
 herbal+0.018
Neg logit contributions
iren-0.016
rob-0.016
loo-0.015
 inconvenience-0.014
 robots-0.014
Token with
Token ID351
Feature activation-6.87e-01

Pos logit contributions
opsy+0.043
cious+0.036
 harvest+0.035
 cozy+0.035
ossom+0.035
Neg logit contributions
iren-0.040
rob-0.039
loo-0.036
 robots-0.035
irens-0.034
Token bright
Token ID6016
Feature activation-7.26e-01

Pos logit contributions
opsy+0.037
ossom+0.033
cious+0.032
ench+0.032
 harvest+0.030
Neg logit contributions
iren-0.019
 robots-0.018
rob-0.018
irens-0.016
 hits-0.016
Token colors
Token ID7577
Feature activation+1.10e-01

Pos logit contributions
opsy+0.035
cious+0.030
ossom+0.029
 harvest+0.029
 gra+0.029
Neg logit contributions
iren-0.025
rob-0.024
loo-0.022
 robots-0.022
irens-0.021
Token with
Token ID351
Feature activation-4.44e-01

Pos logit contributions
iren+0.054
rob+0.052
loo+0.048
 robots+0.047
irens+0.045
Neg logit contributions
opsy-0.081
cious-0.068
 gra-0.067
 harvest-0.067
 herbal-0.067
Token him
Token ID683
Feature activation-9.27e-01

Pos logit contributions
opsy+0.022
cious+0.019
ossom+0.019
 herbal+0.018
 gra+0.018
Neg logit contributions
iren-0.014
rob-0.014
 robots-0.013
loo-0.013
irens-0.012
Token,
Token ID11
Feature activation+3.91e-01

Pos logit contributions
opsy+0.047
cious+0.040
 herbal+0.040
 harvest+0.039
ossom+0.039
Neg logit contributions
iren-0.028
rob-0.026
 robots-0.025
loo-0.025
 officer-0.024
Token and
Token ID290
Feature activation+8.98e-01

Pos logit contributions
iren+0.015
rob+0.015
loo+0.014
 robots+0.014
 officer+0.013
Neg logit contributions
opsy-0.017
cious-0.014
ossom-0.014
 harvest-0.013
 herbal-0.013
Token he
Token ID339
Feature activation+1.56e+00

Pos logit contributions
iren+0.033
rob+0.032
loo+0.029
irens+0.028
 inconvenience+0.028
Neg logit contributions
opsy-0.042
cious-0.035
 cozy-0.035
 nib-0.035
 harvest-0.035
Token tre
Token ID2054
Feature activation+5.72e+00

Pos logit contributions
iren+0.067
rob+0.066
loo+0.062
 robots+0.060
irens+0.059
Neg logit contributions
opsy-0.061
cious-0.049
 harvest-0.048
ossom-0.047
 herbal-0.047
Tokenasured
Token ID34006
Feature activation+1.49e+00

Pos logit contributions
iren+0.254
rob+0.243
loo+0.232
 poster+0.230
 robots+0.227
Neg logit contributions
opsy-0.224
 nib-0.179
 harvest-0.173
 mashed-0.170
 cozy-0.169
Token it
Token ID340
Feature activation-6.90e-01

Pos logit contributions
iren+0.069
rob+0.067
loo+0.062
 inconvenience+0.061
irens+0.060
Neg logit contributions
opsy-0.056
 cozy-0.048
 cos-0.046
 relaxed-0.046
 nib-0.045
Token always
Token ID1464
Feature activation+1.88e+00

Pos logit contributions
opsy+0.029
cious+0.024
 cozy+0.024
 harvest+0.024
 nib+0.024
Neg logit contributions
iren-0.028
rob-0.027
loo-0.025
irens-0.025
 inconvenience-0.024
Token.
Token ID13
Feature activation+1.78e-01

Pos logit contributions
iren+0.081
rob+0.078
loo+0.074
irens+0.071
 robots+0.071
Neg logit contributions
opsy-0.074
cious-0.061
 cozy-0.060
 harvest-0.060
 nib-0.059
Token<|endoftext|>
Token ID50256
Feature activation+1.86e+00

Pos logit contributions
iren+0.007
rob+0.007
loo+0.007
 robots+0.007
 inconvenience+0.007
Neg logit contributions
opsy-0.007
cious-0.006
 cozy-0.006
 harvest-0.006
ossom-0.006
Token.
Token ID13
Feature activation+1.53e+00

Pos logit contributions
iren+0.092
rob+0.089
loo+0.082
irens+0.076
 officer+0.076
Neg logit contributions
opsy-0.143
 harvest-0.125
 cozy-0.124
 nib-0.122
cious-0.121
Token I
Token ID314
Feature activation-5.70e-01

Pos logit contributions
iren+0.048
rob+0.047
loo+0.043
 robots+0.041
irens+0.040
Neg logit contributions
opsy-0.078
cious-0.066
 harvest-0.065
 cozy-0.065
ossom-0.065
Token have
Token ID423
Feature activation+1.94e+00

Pos logit contributions
opsy+0.024
cious+0.020
 cozy+0.019
 herbal+0.019
 harvest+0.019
Neg logit contributions
iren-0.023
rob-0.022
loo-0.021
 inconvenience-0.020
 robots-0.020
Token a
Token ID257
Feature activation+1.62e+00

Pos logit contributions
rob+0.094
iren+0.094
loo+0.088
lights+0.082
irens+0.082
Neg logit contributions
opsy-0.067
 cozy-0.061
 nib-0.060
 mashed-0.060
 harvest-0.059
Token new
Token ID649
Feature activation+2.86e+00

Pos logit contributions
iren+0.065
rob+0.062
loo+0.060
 poster+0.059
 robots+0.059
Neg logit contributions
opsy-0.066
cious-0.056
ossom-0.053
 herbal-0.052
 gra-0.052
Token er
Token ID1931
Feature activation+5.71e+00

Pos logit contributions
iren+0.139
rob+0.136
irens+0.131
loo+0.119
lights+0.118
Neg logit contributions
opsy-0.102
 harvest-0.091
 nib-0.089
 drink-0.089
 pet-0.088
Tokenaser
Token ID6005
Feature activation+3.30e+00

Pos logit contributions
 inconvenience+0.224
 buses+0.203
iren+0.197
irens+0.196
rob+0.194
Neg logit contributions
opsy-0.212
ossom-0.184
-0.183
opp-0.176
ish-0.175
Token for
Token ID329
Feature activation+2.22e+00

Pos logit contributions
rob+0.156
iren+0.148
irens+0.146
loo+0.131
 inconvenience+0.131
Neg logit contributions
-0.109
 drink-0.096
 nib-0.095
 pet-0.095
opsy-0.092
Token you
Token ID345
Feature activation+4.71e-01

Pos logit contributions
rob+0.075
iren+0.073
loo+0.070
lights+0.060
irens+0.058
Neg logit contributions
opsy-0.110
 nib-0.097
 cozy-0.095
 Baby-0.095
 harvest-0.095
Token.
Token ID13
Feature activation+1.41e+00

Pos logit contributions
rob+0.018
iren+0.017
loo+0.015
irens+0.015
 robots+0.014
Neg logit contributions
opsy-0.022
 harvest-0.019
 cozy-0.019
 nib-0.019
cious-0.019
Token It
Token ID632
Feature activation+2.14e-01

Pos logit contributions
rob+0.050
iren+0.050
loo+0.047
irens+0.043
 inconvenience+0.043
Neg logit contributions
opsy-0.065
cious-0.056
 herbal-0.055
 nib-0.055
ossom-0.054
Token it
Token ID340
Feature activation-6.78e-01

Pos logit contributions
iren+0.000
rob+0.000
loo+0.000
 inconvenience+0.000
 robots+0.000
Neg logit contributions
opsy-0.000
cious-0.000
 cozy-0.000
 harvest-0.000
ossom-0.000
Token.
Token ID13
Feature activation-9.11e-02

Pos logit contributions
opsy+0.030
cious+0.025
 harvest+0.024
ossom+0.024
 cozy+0.024
Neg logit contributions
iren-0.026
rob-0.025
loo-0.023
 robots-0.023
irens-0.022
Token Suddenly
Token ID24975
Feature activation-2.60e-01

Pos logit contributions
opsy+0.004
cious+0.003
 harvest+0.003
 cozy+0.003
ossom+0.003
Neg logit contributions
iren-0.004
rob-0.004
loo-0.003
 robots-0.003
 inconvenience-0.003
Token,
Token ID11
Feature activation-2.50e-01

Pos logit contributions
opsy+0.011
cious+0.009
 cozy+0.009
 nib+0.009
 harvest+0.009
Neg logit contributions
iren-0.011
rob-0.011
loo-0.010
 inconvenience-0.009
 robots-0.009
Token a
Token ID257
Feature activation+4.68e-01

Pos logit contributions
opsy+0.011
 cozy+0.010
 harvest+0.010
 nib+0.010
cious+0.010
Neg logit contributions
iren-0.009
rob-0.009
loo-0.008
 inconvenience-0.008
irens-0.008
Token gen
Token ID2429
Feature activation+5.71e+00

Pos logit contributions
iren+0.020
rob+0.019
loo+0.018
 robots+0.018
irens+0.017
Neg logit contributions
opsy-0.019
cious-0.015
 harvest-0.015
ossom-0.015
 cozy-0.015
Tokenie
Token ID494
Feature activation-4.17e-02

Pos logit contributions
 robots+0.252
iren+0.248
rob+0.240
 inconvenience+0.235
loo+0.233
Neg logit contributions
opsy-0.200
cious-0.154
 nib-0.152
 gra-0.149
 relaxed-0.146
Token appeared
Token ID4120
Feature activation+3.10e-01

Pos logit contributions
opsy+0.002
cious+0.002
 nib+0.001
 harvest+0.001
 gra+0.001
Neg logit contributions
iren-0.002
rob-0.002
loo-0.001
 inconvenience-0.001
 robots-0.001
Token.
Token ID13
Feature activation-4.59e-01

Pos logit contributions
iren+0.011
rob+0.011
loo+0.010
 inconvenience+0.010
 robots+0.010
Neg logit contributions
opsy-0.014
cious-0.012
 harvest-0.012
 cozy-0.012
 nib-0.012
Token
Token ID220
Feature activation-4.28e-01

Pos logit contributions
opsy+0.019
cious+0.016
 harvest+0.015
 cozy+0.015
ossom+0.015
Neg logit contributions
iren-0.019
rob-0.018
loo-0.017
 robots-0.017
irens-0.016
Token
Token ID198
Feature activation-9.46e-01

Pos logit contributions
opsy+0.016
apa+0.012
 cozy+0.012
cious+0.012
ossom+0.012
Neg logit contributions
rob-0.019
iren-0.019
loo-0.018
 inconvenience-0.017
 robots-0.017
Token hands
Token ID2832
Feature activation-2.08e+00

Pos logit contributions
opsy+0.008
cious+0.007
 gra+0.007
ossom+0.007
 harvest+0.007
Neg logit contributions
iren-0.010
 robots-0.009
rob-0.009
loo-0.009
 officer-0.009
Token.
Token ID13
Feature activation-4.14e-01

Pos logit contributions
opsy+0.132
 gra+0.113
 herbal+0.112
 harvest+0.111
cious+0.110
Neg logit contributions
iren-0.042
er-0.039
driver-0.036
 robots-0.035
rob-0.032
Token Suddenly
Token ID24975
Feature activation-3.65e-01

Pos logit contributions
opsy+0.019
cious+0.017
 harvest+0.016
 herbal+0.016
ossom+0.016
Neg logit contributions
iren-0.015
rob-0.014
loo-0.014
 robots-0.013
irens-0.013
Token,
Token ID11
Feature activation-4.38e-01

Pos logit contributions
opsy+0.016
cious+0.013
 harvest+0.013
 cozy+0.013
ossom+0.013
Neg logit contributions
iren-0.014
rob-0.014
loo-0.013
 robots-0.012
irens-0.012
Token a
Token ID257
Feature activation+4.61e-01

Pos logit contributions
opsy+0.022
cious+0.019
 harvest+0.018
 cozy+0.018
 nib+0.018
Neg logit contributions
iren-0.014
rob-0.014
loo-0.013
 robots-0.012
irens-0.012
Token gen
Token ID2429
Feature activation+5.70e+00

Pos logit contributions
iren+0.019
rob+0.018
loo+0.017
 robots+0.017
 poster+0.017
Neg logit contributions
opsy-0.019
cious-0.016
ossom-0.016
 harvest-0.015
apa-0.015
Tokenie
Token ID494
Feature activation+1.38e-01

Pos logit contributions
iren+0.244
 robots+0.243
rob+0.228
 poster+0.228
driver+0.225
Neg logit contributions
opsy-0.206
 nib-0.162
 drink-0.153
 gra-0.153
 relaxed-0.152
Token appeared
Token ID4120
Feature activation+4.93e-01

Pos logit contributions
rob+0.005
iren+0.005
loo+0.005
 inconvenience+0.005
irens+0.005
Neg logit contributions
opsy-0.006
 nib-0.005
cious-0.005
 gra-0.005
 cozy-0.005
Token!
Token ID0
Feature activation-4.10e-01

Pos logit contributions
iren+0.018
rob+0.018
loo+0.017
 inconvenience+0.016
 robots+0.016
Neg logit contributions
opsy-0.022
cious-0.018
 harvest-0.018
 cozy-0.018
 nib-0.018
Token
Token ID220
Feature activation-3.45e-01

Pos logit contributions
opsy+0.016
cious+0.012
 cozy+0.012
 harvest+0.012
 nib+0.012
Neg logit contributions
iren-0.018
rob-0.018
loo-0.017
 robots-0.016
 inconvenience-0.016
Token
Token ID198
Feature activation-9.58e-01

Pos logit contributions
opsy+0.012
 cozy+0.009
ench+0.009
apa+0.009
ossom+0.009
Neg logit contributions
rob-0.017
iren-0.016
 inconvenience-0.015
loo-0.015
 poster-0.015
Token.
Token ID13
Feature activation+5.17e-01

Pos logit contributions
iren+0.004
rob+0.004
loo+0.004
 robots+0.004
 inconvenience+0.004
Neg logit contributions
opsy-0.005
cious-0.004
 cozy-0.004
 nib-0.004
ossom-0.004
Token
Token ID198
Feature activation+1.51e-01

Pos logit contributions
rob+0.025
iren+0.024
loo+0.022
 inconvenience+0.021
 poster+0.021
Neg logit contributions
opsy-0.018
 Baby-0.015
 gra-0.015
cious-0.015
 cozy-0.015
Token
Token ID198
Feature activation-7.37e-02

Pos logit contributions
rob+0.008
iren+0.007
loo+0.007
 inconvenience+0.007
 robots+0.007
Neg logit contributions
opsy-0.005
 cozy-0.004
apa-0.004
 gra-0.004
 nib-0.004
Token"
Token ID1
Feature activation+9.66e-01

Pos logit contributions
opsy+0.003
cious+0.003
 cozy+0.003
ossom+0.003
 harvest+0.003
Neg logit contributions
rob-0.003
iren-0.003
loo-0.003
 robots-0.003
 inconvenience-0.002
TokenOf
Token ID5189
Feature activation+1.31e+00

Pos logit contributions
iren+0.041
rob+0.039
loo+0.039
 robots+0.037
irens+0.036
Neg logit contributions
opsy-0.039
cious-0.032
 relaxed-0.031
 ju-0.031
 herbal-0.030
Token course
Token ID1781
Feature activation+5.69e+00

Pos logit contributions
 robots+0.039
iren+0.038
lights+0.035
driver+0.035
loo+0.034
Neg logit contributions
opsy-0.069
 Baby-0.062
-0.060
 Ginny-0.060
apa-0.060
Token,
Token ID11
Feature activation+1.67e+00

Pos logit contributions
rob+0.287
iren+0.284
irens+0.256
loo+0.255
 robots+0.246
Neg logit contributions
opsy-0.192
-0.171
 Baby-0.170
cious-0.164
 nib-0.163
Token you
Token ID345
Feature activation+6.48e-01

Pos logit contributions
rob+0.069
iren+0.068
irens+0.060
loo+0.059
ulk+0.054
Neg logit contributions
opsy-0.075
 Baby-0.065
 ju-0.063
 cozy-0.063
 Ginny-0.063
Token can
Token ID460
Feature activation+1.12e+00

Pos logit contributions
iren+0.027
rob+0.027
loo+0.024
irens+0.024
 robots+0.024
Neg logit contributions
opsy-0.027
 harvest-0.022
cious-0.022
 cozy-0.022
 nib-0.022
Token.
Token ID13
Feature activation+8.66e-01

Pos logit contributions
iren+0.033
rob+0.032
loo+0.029
 inconvenience+0.028
 robots+0.028
Neg logit contributions
opsy-0.059
cious-0.050
 herbal-0.049
ossom-0.049
 cozy-0.049
Token But
Token ID887
Feature activation+9.20e-01

Pos logit contributions
iren+0.027
rob+0.027
loo+0.024
irens+0.023
 robots+0.023
Neg logit contributions
opsy-0.044
cious-0.038
 harvest-0.037
 cozy-0.037
 gra-0.037
Token always
Token ID1464
Feature activation+2.43e+00

Pos logit contributions
iren+0.058
rob+0.057
loo+0.054
irens+0.052
 inconvenience+0.051
Neg logit contributions
opsy-0.045
cious-0.036
 nib-0.035
 harvest-0.035
 cozy-0.035
Token acted
Token ID13134
Feature activation+1.07e+00

Pos logit contributions
iren+0.116
rob+0.113
loo+0.108
irens+0.105
 robots+0.101
Neg logit contributions
opsy-0.084
 relaxed-0.070
 cozy-0.069
cious-0.069
 nib-0.068
Token silly
Token ID14397
Feature activation-8.01e-01

Pos logit contributions
rob+0.052
iren+0.050
loo+0.048
 inconvenience+0.045
irens+0.045
Neg logit contributions
opsy-0.036
 cozy-0.030
cious-0.030
ossom-0.029
 relaxed-0.029
Token and
Token ID290
Feature activation+3.90e-02

Pos logit contributions
opsy+0.027
cious+0.024
 drink+0.024
 nib+0.024
 herbal+0.024
Neg logit contributions
rob-0.037
iren-0.035
loo-0.033
irens-0.031
 poster-0.030
Token mis
Token ID2984
Feature activation+2.11e+00

Pos logit contributions
rob+0.002
iren+0.002
loo+0.002
irens+0.002
 poster+0.002
Neg logit contributions
opsy-0.001
 cozy-0.001
 relaxed-0.001
cious-0.001
 nib-0.001
Tokenchie
Token ID3043
Feature activation+5.65e+00

Pos logit contributions
iren+0.063
rob+0.058
loo+0.052
 robots+0.052
irens+0.052
Neg logit contributions
opsy-0.113
 cozy-0.097
cious-0.096
 herbal-0.095
 harvest-0.095
Tokenvous
Token ID31222
Feature activation-5.36e-01

Pos logit contributions
iren+0.278
loo+0.269
rob+0.269
 poster+0.268
irens+0.260
Neg logit contributions
cious-0.150
opsy-0.149
 relaxed-0.145
-0.143
ossom-0.131
Token.
Token ID13
Feature activation+5.83e-02

Pos logit contributions
opsy+0.023
cious+0.020
 harvest+0.020
 cozy+0.020
 nib+0.019
Neg logit contributions
rob-0.021
iren-0.020
loo-0.018
irens-0.017
 poster-0.017
Token
Token ID198
Feature activation-6.99e-01

Pos logit contributions
iren+0.003
rob+0.003
loo+0.002
irens+0.002
 poster+0.002
Neg logit contributions
opsy-0.002
 cozy-0.002
 gra-0.002
cious-0.002
 Baby-0.002
Token
Token ID198
Feature activation-8.54e-01

Pos logit contributions
opsy+0.024
 cozy+0.019
cious+0.019
 gra+0.018
 nib+0.018
Neg logit contributions
iren-0.033
rob-0.032
loo-0.030
 robots-0.030
 inconvenience-0.030
TokenOne
Token ID3198
Feature activation+1.84e+00

Pos logit contributions
opsy+0.041
cious+0.034
 cozy+0.034
 harvest+0.034
 nib+0.033
Neg logit contributions
iren-0.030
rob-0.029
loo-0.027
 robots-0.026
 inconvenience-0.025
Token said
Token ID531
Feature activation+1.08e+00

Pos logit contributions
iren+0.027
rob+0.026
 robots+0.024
loo+0.023
 officer+0.023
Neg logit contributions
opsy-0.049
 harvest-0.042
cious-0.041
ossom-0.041
 gra-0.041
Token,
Token ID11
Feature activation+8.36e-01

Pos logit contributions
iren+0.027
er+0.023
 robots+0.023
driver+0.023
 officer+0.023
Neg logit contributions
opsy-0.062
 harvest-0.053
 ju-0.053
 herbal-0.052
 gra-0.052
Token holding
Token ID4769
Feature activation-1.87e+00

Pos logit contributions
iren+0.027
 poster+0.025
 robots+0.025
 officer+0.025
lf+0.024
Neg logit contributions
opsy-0.040
apa-0.035
ench-0.035
ossom-0.035
opp-0.034
Token the
Token ID262
Feature activation-8.08e-01

Pos logit contributions
opsy+0.101
cious+0.090
ossom+0.089
 harvest+0.088
ench+0.088
Neg logit contributions
iren-0.046
 robots-0.045
rob-0.044
 hits-0.042
er-0.042
Token magn
Token ID7842
Feature activation-2.42e+00

Pos logit contributions
opsy+0.031
ossom+0.026
cious+0.026
ench+0.025
 gra+0.025
Neg logit contributions
iren-0.034
rob-0.032
loo-0.032
 poster-0.031
 robots-0.031
Tokenifying
Token ID4035
Feature activation-6.53e+00

Pos logit contributions
opsy+0.074
cious+0.056
 harvest+0.054
 cozy+0.054
ossom+0.054
Neg logit contributions
iren-0.125
rob-0.123
loo-0.117
 robots-0.113
irens-0.112
Token glass
Token ID5405
Feature activation-1.56e+00

Pos logit contributions
opsy+0.165
 cozy+0.133
 nib+0.127
 harvest+0.126
 drink+0.123
Neg logit contributions
rob-0.372
iren-0.355
 inconvenience-0.336
 robots-0.336
 officer-0.336
Token over
Token ID625
Feature activation-2.38e+00

Pos logit contributions
opsy+0.071
cious+0.060
 harvest+0.059
 cozy+0.059
 nib+0.059
Neg logit contributions
rob-0.057
iren-0.057
loo-0.051
 robots-0.049
 inconvenience-0.049
Token a
Token ID257
Feature activation-5.73e-01

Pos logit contributions
opsy+0.112
cious+0.094
ossom+0.093
 harvest+0.091
ench+0.091
Neg logit contributions
iren-0.084
rob-0.081
loo-0.076
 robots-0.074
irens-0.073
Token black
Token ID2042
Feature activation-9.13e-01

Pos logit contributions
opsy+0.026
apa+0.021
ench+0.021
opp+0.020
ossom+0.020
Neg logit contributions
iren-0.021
 poster-0.020
 officer-0.019
loo-0.019
 robots-0.019
Token ant
Token ID1885
Feature activation-4.67e-01

Pos logit contributions
opsy+0.039
cious+0.032
 harvest+0.032
 nib+0.031
 herbal+0.031
Neg logit contributions
iren-0.036
rob-0.036
loo-0.033
 robots-0.032
irens-0.032
Token the
Token ID262
Feature activation-4.18e-01

Pos logit contributions
iren+0.025
rob+0.024
ider+0.024
 inconvenience+0.024
loo+0.022
Neg logit contributions
opsy-0.018
 Baby-0.015
 Ginny-0.014
 cozy-0.014
 drink-0.013
Token watch
Token ID2342
Feature activation-3.17e+00

Pos logit contributions
opsy+0.018
cious+0.014
 harvest+0.014
 cozy+0.014
ossom+0.014
Neg logit contributions
iren-0.017
rob-0.017
loo-0.015
 robots-0.015
irens-0.015
Token with
Token ID351
Feature activation-7.37e-01

Pos logit contributions
opsy+0.141
cious+0.121
 harvest+0.118
 nib+0.118
 gra+0.117
Neg logit contributions
rob-0.117
iren-0.115
loo-0.104
 inconvenience-0.099
 robots-0.099
Token the
Token ID262
Feature activation-5.25e-01

Pos logit contributions
opsy+0.034
cious+0.028
 harvest+0.027
ossom+0.027
 cozy+0.027
Neg logit contributions
iren-0.027
rob-0.027
loo-0.025
 robots-0.024
irens-0.024
Token magn
Token ID7842
Feature activation-2.49e+00

Pos logit contributions
opsy+0.021
cious+0.017
 cozy+0.017
 harvest+0.017
 nib+0.017
Neg logit contributions
iren-0.022
rob-0.022
loo-0.020
 robots-0.019
irens-0.019
Tokenifying
Token ID4035
Feature activation-6.47e+00

Pos logit contributions
opsy+0.069
cious+0.050
ossom+0.049
 harvest+0.047
 nib+0.047
Neg logit contributions
rob-0.137
iren-0.136
loo-0.128
 inconvenience-0.126
 robots-0.125
Token glass
Token ID5405
Feature activation-1.44e+00

Pos logit contributions
opsy+0.150
 cozy+0.125
 drink+0.121
 nib+0.121
 harvest+0.118
Neg logit contributions
rob-0.372
iren-0.355
 robots-0.345
 officer-0.341
 inconvenience-0.336
Token.
Token ID13
Feature activation-1.08e-01

Pos logit contributions
opsy+0.059
cious+0.050
 nib+0.049
 harvest+0.049
 drink+0.049
Neg logit contributions
rob-0.059
iren-0.057
 inconvenience-0.051
irens-0.051
 robots-0.050
Token He
Token ID679
Feature activation+5.35e-01

Pos logit contributions
opsy+0.005
cious+0.004
 cozy+0.004
 harvest+0.004
ossom+0.004
Neg logit contributions
iren-0.004
rob-0.004
loo-0.004
 robots-0.003
 inconvenience-0.003
Token saw
Token ID2497
Feature activation+8.99e-02

Pos logit contributions
iren+0.018
rob+0.018
 robots+0.017
 officer+0.016
loo+0.016
Neg logit contributions
opsy-0.024
ench-0.021
 harvest-0.021
 herbal-0.021
ossom-0.021
Token the
Token ID262
Feature activation-1.31e-01

Pos logit contributions
rob+0.004
iren+0.004
 inconvenience+0.004
loo+0.004
 poster+0.004
Neg logit contributions
opsy-0.003
 cozy-0.003
 nib-0.003
 mashed-0.003
 cos-0.003
Token
Token ID198
Feature activation-1.01e+00

Pos logit contributions
opsy+0.034
cious+0.027
ossom+0.026
 cozy+0.026
 harvest+0.026
Neg logit contributions
iren-0.038
rob-0.037
loo-0.035
 robots-0.034
 inconvenience-0.034
TokenHe
Token ID1544
Feature activation+3.05e-01

Pos logit contributions
opsy+0.046
 gra+0.037
 cozy+0.037
 harvest+0.037
 herbal+0.037
Neg logit contributions
iren-0.038
rob-0.035
loo-0.033
irens-0.033
 robots-0.032
Token grabbed
Token ID13176
Feature activation+7.75e-02

Pos logit contributions
iren+0.014
rob+0.014
loo+0.013
 robots+0.012
irens+0.012
Neg logit contributions
opsy-0.011
cious-0.009
 nib-0.009
 cozy-0.009
 harvest-0.009
Token the
Token ID262
Feature activation-1.13e+00

Pos logit contributions
iren+0.003
rob+0.003
loo+0.003
 inconvenience+0.003
irens+0.002
Neg logit contributions
opsy-0.003
 cozy-0.003
 nib-0.003
 harvest-0.003
cious-0.003
Token che
Token ID1125
Feature activation-1.04e+00

Pos logit contributions
opsy+0.044
cious+0.036
 cozy+0.035
 harvest+0.035
 herbal+0.034
Neg logit contributions
iren-0.049
rob-0.048
loo-0.045
 robots-0.043
irens-0.043
Tokenwy
Token ID21768
Feature activation-6.45e+00

Pos logit contributions
opsy+0.040
 harvest+0.032
ench+0.032
 mashed+0.032
 nib+0.031
Neg logit contributions
iren-0.044
 officer-0.043
 robots-0.042
rob-0.041
 poster-0.041
Token snack
Token ID26906
Feature activation-3.27e+00

Pos logit contributions
opsy+0.316
ossom+0.269
apa+0.264
cious+0.262
 herbal+0.259
Neg logit contributions
rob-0.229
iren-0.219
 robots-0.207
loo-0.203
 poster-0.195
Token he
Token ID339
Feature activation-1.03e-01

Pos logit contributions
opsy+0.153
cious+0.128
 harvest+0.126
 cozy+0.125
ossom+0.125
Neg logit contributions
iren-0.117
rob-0.114
loo-0.105
 robots-0.101
irens-0.099
Token was
Token ID373
Feature activation+5.33e-01

Pos logit contributions
opsy+0.004
cious+0.003
 harvest+0.003
 cozy+0.003
 nib+0.003
Neg logit contributions
iren-0.005
rob-0.004
loo-0.004
 robots-0.004
irens-0.004
Token eating
Token ID6600
Feature activation-3.90e+00

Pos logit contributions
iren+0.021
rob+0.021
loo+0.019
 robots+0.019
irens+0.019
Neg logit contributions
opsy-0.023
cious-0.019
 harvest-0.018
ossom-0.018
 herbal-0.018
Token and
Token ID290
Feature activation-1.40e+00

Pos logit contributions
opsy+0.184
cious+0.155
 harvest+0.151
ossom+0.151
 cozy+0.151
Neg logit contributions
iren-0.137
rob-0.134
loo-0.124
 robots-0.119
irens-0.116
Token cried
Token ID16896
Feature activation+1.91e+00

Pos logit contributions
iren+0.024
rob+0.023
 robots+0.022
irens+0.021
?"+0.021
Neg logit contributions
opsy-0.032
ossom-0.028
 harvest-0.027
 gra-0.027
 herbal-0.027
Token,
Token ID11
Feature activation+1.43e+00

Pos logit contributions
iren+0.054
 robots+0.047
 officer+0.046
irens+0.044
er+0.044
Neg logit contributions
opsy-0.105
cious-0.095
ossom-0.093
 gra-0.092
 harvest-0.091
Token dropping
Token ID12047
Feature activation-1.85e+00

Pos logit contributions
iren+0.046
 officer+0.042
 robots+0.042
irens+0.040
er+0.039
Neg logit contributions
opsy-0.070
ossom-0.064
opp-0.061
apa-0.060
ench-0.060
Token the
Token ID262
Feature activation-3.85e-01

Pos logit contributions
opsy+0.089
cious+0.076
ossom+0.075
ench+0.073
 harvest+0.072
Neg logit contributions
iren-0.065
rob-0.060
 robots-0.057
loo-0.056
irens-0.056
Token magn
Token ID7842
Feature activation-2.33e+00

Pos logit contributions
opsy+0.016
cious+0.013
ossom+0.013
ench+0.013
 gra+0.013
Neg logit contributions
iren-0.016
loo-0.015
rob-0.014
 poster-0.014
 robots-0.014
Tokenifying
Token ID4035
Feature activation-6.37e+00

Pos logit contributions
opsy+0.062
cious+0.044
ossom+0.042
 harvest+0.042
apa+0.041
Neg logit contributions
rob-0.131
iren-0.129
loo-0.121
 inconvenience-0.121
 robots-0.120
Token glass
Token ID5405
Feature activation-1.56e+00

Pos logit contributions
opsy+0.149
 cozy+0.126
 drink+0.120
 harvest+0.117
 nib+0.116
Neg logit contributions
rob-0.370
iren-0.341
 robots-0.331
 inconvenience-0.330
 officer-0.329
Token.
Token ID13
Feature activation+1.01e-01

Pos logit contributions
opsy+0.058
 drink+0.051
 harvest+0.049
 nib+0.048
 vegetable+0.047
Neg logit contributions
rob-0.072
iren-0.063
 inconvenience-0.060
irens-0.059
 robots-0.058
Token
Token ID198
Feature activation-1.59e-01

Pos logit contributions
iren+0.004
rob+0.004
loo+0.004
 robots+0.003
 inconvenience+0.003
Neg logit contributions
opsy-0.004
cious-0.004
 cozy-0.003
 harvest-0.003
ossom-0.003
Token
Token ID198
Feature activation-2.60e-01

Pos logit contributions
opsy+0.006
cious+0.005
 cozy+0.005
ossom+0.005
apa+0.005
Neg logit contributions
rob-0.007
iren-0.007
loo-0.006
 robots-0.006
 inconvenience-0.006
Token"
Token ID1
Feature activation+6.14e-01

Pos logit contributions
opsy+0.014
 harvest+0.012
 nib+0.012
 gra+0.012
cious+0.012
Neg logit contributions
iren-0.007
rob-0.007
loo-0.006
irens-0.006
 robots-0.006
Token and
Token ID290
Feature activation-3.40e-01

Pos logit contributions
opsy+0.068
cious+0.058
ossom+0.058
 harvest+0.056
ench+0.056
Neg logit contributions
iren-0.039
rob-0.038
loo-0.035
 robots-0.033
irens-0.032
Token scoop
Token ID23529
Feature activation-8.64e-01

Pos logit contributions
opsy+0.015
ench+0.013
 harvest+0.013
opp+0.013
ossom+0.013
Neg logit contributions
iren-0.011
rob-0.011
 officer-0.010
er-0.010
 robots-0.010
Tokened
Token ID276
Feature activation-2.50e+00

Pos logit contributions
opsy+0.047
cious+0.041
 harvest+0.041
ossom+0.040
 gra+0.040
Neg logit contributions
iren-0.023
rob-0.022
loo-0.020
er-0.020
irens-0.019
Token up
Token ID510
Feature activation-2.31e+00

Pos logit contributions
opsy+0.135
cious+0.116
ossom+0.114
ench+0.112
opp+0.111
Neg logit contributions
er-0.062
iren-0.060
 roles-0.058
 robots-0.056
 hits-0.056
Token the
Token ID262
Feature activation-1.42e+00

Pos logit contributions
opsy+0.121
cious+0.104
ossom+0.103
ench+0.103
 mental+0.101
Neg logit contributions
iren-0.063
rob-0.061
er-0.056
loo-0.055
 robots-0.055
Token refreshing
Token ID23056
Feature activation-6.31e+00

Pos logit contributions
opsy+0.053
cious+0.044
ossom+0.042
opp+0.042
apa+0.042
Neg logit contributions
iren-0.060
loo-0.059
rob-0.058
 inconvenience-0.056
 poster-0.055
Token water
Token ID1660
Feature activation-2.13e+00

Pos logit contributions
opsy+0.263
cious+0.237
apa+0.229
 mental+0.226
ossom+0.225
Neg logit contributions
iren-0.231
er-0.228
loo-0.228
 hits-0.217
rob-0.215
Token into
Token ID656
Feature activation+6.24e-01

Pos logit contributions
opsy+0.113
cious+0.096
 mental+0.095
ossom+0.094
 gra+0.093
Neg logit contributions
iren-0.060
er-0.055
loo-0.054
rob-0.053
lights-0.048
Token her
Token ID607
Feature activation-7.67e-01

Pos logit contributions
iren+0.018
rob+0.016
 yourselves+0.016
loo+0.016
 robots+0.015
Neg logit contributions
opsy-0.032
cious-0.029
ossom-0.029
ench-0.028
apa-0.027
Token hands
Token ID2832
Feature activation-1.57e+00

Pos logit contributions
opsy+0.026
 mental+0.022
 Ginny+0.022
opp+0.022
cious+0.022
Neg logit contributions
loo-0.030
iren-0.030
?"-0.029
lights-0.029
 yourselves-0.029
Token.
Token ID13
Feature activation+7.30e-02

Pos logit contributions
opsy+0.095
 gra+0.084
 mental+0.083
 herbal+0.082
 harvest+0.080
Neg logit contributions
er-0.031
iren-0.029
driver-0.023
?"-0.023
loo-0.023
Token
Token ID198
Feature activation-2.84e-01

Pos logit contributions
iren+0.000
rob+0.000
loo+0.000
 robots+0.000
irens+0.000
Neg logit contributions
opsy-0.000
cious-0.000
 harvest-0.000
 cozy-0.000
ossom-0.000
Token
Token ID198
Feature activation-5.14e-01

Pos logit contributions
opsy+0.012
cious+0.010
 harvest+0.010
 cozy+0.010
ossom+0.010
Neg logit contributions
iren-0.012
rob-0.011
loo-0.011
 robots-0.010
irens-0.010
TokenThe
Token ID464
Feature activation+7.50e-01

Pos logit contributions
opsy+0.021
cious+0.017
 cozy+0.017
 harvest+0.017
 herbal+0.017
Neg logit contributions
iren-0.021
rob-0.021
loo-0.020
 robots-0.019
irens-0.019
Token skeleton
Token ID18328
Feature activation-2.86e-02

Pos logit contributions
iren+0.027
rob+0.027
loo+0.024
irens+0.024
 robots+0.022
Neg logit contributions
opsy-0.035
 cozy-0.030
 harvest-0.030
cious-0.029
 herbal-0.029
Token enjoyed
Token ID8359
Feature activation-2.39e+00

Pos logit contributions
opsy+0.001
cious+0.001
 nib+0.001
 cozy+0.001
 harvest+0.001
Neg logit contributions
iren-0.001
rob-0.001
loo-0.001
irens-0.001
 robots-0.001
Token drinking
Token ID7722
Feature activation-6.30e+00

Pos logit contributions
opsy+0.109
cious+0.091
 harvest+0.090
 nib+0.090
 cozy+0.090
Neg logit contributions
iren-0.089
rob-0.086
loo-0.080
 robots-0.077
irens-0.076
Token the
Token ID262
Feature activation-1.58e+00

Pos logit contributions
opsy+0.306
cious+0.260
ossom+0.252
opp+0.243
 harvest+0.242
Neg logit contributions
iren-0.216
rob-0.206
loo-0.200
 robots-0.188
er-0.186
Token magical
Token ID10883
Feature activation-1.40e+00

Pos logit contributions
opsy+0.054
apa+0.044
cious+0.044
opp+0.042
 mental+0.040
Neg logit contributions
iren-0.070
rob-0.070
loo-0.069
 poster-0.067
er-0.066
Token drink
Token ID4144
Feature activation-5.70e+00

Pos logit contributions
opsy+0.052
cious+0.042
apa+0.040
ossom+0.040
opp+0.039
Neg logit contributions
iren-0.062
rob-0.061
loo-0.058
 poster-0.056
 robots-0.056
Token all
Token ID477
Feature activation-1.07e-01

Pos logit contributions
opsy+0.276
cious+0.232
ossom+0.227
 harvest+0.226
 cozy+0.225
Neg logit contributions
iren-0.192
rob-0.186
loo-0.175
 robots-0.165
irens-0.160
Token the
Token ID262
Feature activation-9.00e-01

Pos logit contributions
opsy+0.004
 cozy+0.004
 harvest+0.004
 relaxed+0.004
 herbal+0.003
Neg logit contributions
iren-0.004
rob-0.004
 robots-0.004
irens-0.004
 inconvenience-0.004
Token ever
Token ID1683
Feature activation+6.56e-01

Pos logit contributions
opsy+0.269
cious+0.230
ossom+0.227
 harvest+0.226
 gra+0.225
Neg logit contributions
iren-0.140
rob-0.135
loo-0.127
 robots-0.117
 inconvenience-0.113
Token!
Token ID0
Feature activation-4.99e-01

Pos logit contributions
rob+0.021
iren+0.020
loo+0.019
er+0.019
 robots+0.019
Neg logit contributions
opsy-0.033
apa-0.027
 gra-0.027
opp-0.027
ossom-0.026
Token The
Token ID383
Feature activation+5.95e-01

Pos logit contributions
opsy+0.020
cious+0.016
 harvest+0.015
 cozy+0.015
ossom+0.015
Neg logit contributions
iren-0.022
rob-0.021
loo-0.020
 robots-0.019
irens-0.019
Token bee
Token ID20697
Feature activation-2.33e+00

Pos logit contributions
iren+0.024
 poster+0.024
loo+0.024
rob+0.023
?"+0.023
Neg logit contributions
opsy-0.021
cious-0.019
ossom-0.018
opp-0.018
apa-0.018
Token kept
Token ID4030
Feature activation-1.68e+00

Pos logit contributions
opsy+0.099
cious+0.084
opp+0.083
 mental+0.083
 gra+0.083
Neg logit contributions
iren-0.081
rob-0.080
er-0.077
loo-0.076
 robots-0.072
Token drinking
Token ID7722
Feature activation-6.29e+00

Pos logit contributions
opsy+0.075
ossom+0.069
ench+0.066
cious+0.066
apa+0.064
Neg logit contributions
iren-0.056
rob-0.050
 hits-0.049
irens-0.049
 robots-0.049
Token it
Token ID340
Feature activation-1.89e+00

Pos logit contributions
opsy+0.361
cious+0.324
ossom+0.306
 mental+0.298
 gra+0.296
Neg logit contributions
er-0.151
iren-0.135
?"-0.121
ers-0.121
loo-0.117
Token every
Token ID790
Feature activation+3.62e-02

Pos logit contributions
opsy+0.108
 gra+0.094
cious+0.093
 mental+0.091
apa+0.090
Neg logit contributions
er-0.042
?"-0.036
iren-0.035
ers-0.035
lights-0.034
Token day
Token ID1110
Feature activation+5.13e-01

Pos logit contributions
iren+0.001
rob+0.001
loo+0.001
 inconvenience+0.001
 poster+0.001
Neg logit contributions
opsy-0.002
cious-0.001
ossom-0.001
apa-0.001
opp-0.001
Token,
Token ID11
Feature activation+1.50e-01

Pos logit contributions
iren+0.011
er+0.011
 robots+0.010
rob+0.010
loo+0.009
Neg logit contributions
opsy-0.029
ench-0.025
ossom-0.025
 mental-0.024
 gra-0.024
Token until
Token ID1566
Feature activation-7.15e-01

Pos logit contributions
iren+0.005
rob+0.005
loo+0.005
 robots+0.005
 poster+0.005
Neg logit contributions
opsy-0.007
cious-0.006
ossom-0.006
apa-0.005
ench-0.005
Token When
Token ID1649
Feature activation-1.22e+00

Pos logit contributions
rob+0.021
iren+0.020
loo+0.019
 inconvenience+0.019
 poster+0.018
Neg logit contributions
opsy-0.016
 Baby-0.013
 gra-0.013
cious-0.013
 cozy-0.013
Token Daisy
Token ID40355
Feature activation-1.38e+00

Pos logit contributions
opsy+0.050
 cozy+0.043
 drink+0.043
 Baby+0.043
 harvest+0.043
Neg logit contributions
rob-0.051
iren-0.050
loo-0.044
irens-0.043
 robots-0.042
Token put
Token ID1234
Feature activation-2.55e-01

Pos logit contributions
opsy+0.062
cious+0.052
 cozy+0.051
 herbal+0.051
ossom+0.051
Neg logit contributions
iren-0.050
rob-0.047
?"-0.045
loo-0.045
 hits-0.044
Token the
Token ID262
Feature activation-8.16e-01

Pos logit contributions
opsy+0.010
 cozy+0.008
 Ginny+0.008
 nib+0.008
 Baby+0.008
Neg logit contributions
iren-0.011
rob-0.011
 inconvenience-0.010
loo-0.010
 officer-0.010
Token magn
Token ID7842
Feature activation-1.92e+00

Pos logit contributions
opsy+0.034
cious+0.027
 cozy+0.027
 harvest+0.027
 herbal+0.027
Neg logit contributions
iren-0.034
rob-0.033
loo-0.030
irens-0.030
 robots-0.029
Tokenifying
Token ID4035
Feature activation-6.28e+00

Pos logit contributions
opsy+0.049
ossom+0.037
cious+0.036
apa+0.034
 harvest+0.034
Neg logit contributions
rob-0.109
iren-0.107
 inconvenience-0.102
loo-0.101
 robots-0.098
Token glass
Token ID5405
Feature activation-1.10e+00

Pos logit contributions
opsy+0.185
 cozy+0.145
 harvest+0.144
 nib+0.142
ossom+0.141
Neg logit contributions
rob-0.329
iren-0.326
 robots-0.310
 officer-0.308
 inconvenience-0.304
Token down
Token ID866
Feature activation-2.28e+00

Pos logit contributions
opsy+0.040
 cozy+0.034
 drink+0.033
 cos+0.033
ossom+0.033
Neg logit contributions
rob-0.049
iren-0.047
 inconvenience-0.046
 robots-0.044
irens-0.043
Token,
Token ID11
Feature activation-8.41e-01

Pos logit contributions
opsy+0.076
 cozy+0.066
 Baby+0.064
 drink+0.063
 Ginny+0.063
Neg logit contributions
rob-0.117
iren-0.108
 inconvenience-0.100
 robots-0.098
irens-0.097
Token the
Token ID262
Feature activation+4.52e-01

Pos logit contributions
opsy+0.036
cious+0.030
 cozy+0.030
 harvest+0.029
 nib+0.029
Neg logit contributions
iren-0.034
rob-0.033
loo-0.031
irens-0.029
 robots-0.029
Token cater
Token ID20825
Feature activation-1.25e+00

Pos logit contributions
iren+0.018
rob+0.018
loo+0.017
irens+0.016
 robots+0.016
Neg logit contributions
opsy-0.019
cious-0.015
 cozy-0.015
 harvest-0.015
 herbal-0.015
Token
Token ID198
Feature activation-2.42e-01

Pos logit contributions
rob+0.003
iren+0.003
loo+0.003
 inconvenience+0.003
 robots+0.003
Neg logit contributions
opsy-0.003
cious-0.002
 cozy-0.002
 Baby-0.002
 nib-0.002
Token
Token ID198
Feature activation-3.51e-01

Pos logit contributions
opsy+0.009
apa+0.007
cious+0.007
 cozy+0.006
 Baby+0.006
Neg logit contributions
rob-0.012
iren-0.011
loo-0.011
 inconvenience-0.010
 robots-0.010
TokenThe
Token ID464
Feature activation+1.00e+00

Pos logit contributions
opsy+0.015
cious+0.013
ossom+0.012
 cozy+0.012
 harvest+0.012
Neg logit contributions
rob-0.013
iren-0.013
loo-0.013
 robots-0.012
 inconvenience-0.012
Token kitten
Token ID42143
Feature activation-7.95e-01

Pos logit contributions
iren+0.046
rob+0.045
loo+0.042
irens+0.040
 robots+0.040
Neg logit contributions
opsy-0.038
cious-0.030
 cozy-0.029
 harvest-0.029
 herbal-0.029
Token kept
Token ID4030
Feature activation-1.69e+00

Pos logit contributions
opsy+0.034
 harvest+0.028
cious+0.028
ossom+0.028
ench+0.028
Neg logit contributions
iren-0.029
rob-0.028
 officer-0.027
er-0.027
 robots-0.026
Token drinking
Token ID7722
Feature activation-6.28e+00

Pos logit contributions
opsy+0.081
ossom+0.072
ench+0.070
cious+0.070
 ju+0.067
Neg logit contributions
iren-0.058
rob-0.051
 robots-0.051
irens-0.050
 poster-0.048
Token and
Token ID290
Feature activation-1.96e+00

Pos logit contributions
opsy+0.339
cious+0.300
ossom+0.287
 harvest+0.278
ench+0.277
Neg logit contributions
iren-0.177
er-0.165
loo-0.146
 robots-0.146
rob-0.145
Token drinking
Token ID7722
Feature activation-5.31e+00

Pos logit contributions
opsy+0.084
ossom+0.070
cious+0.069
ench+0.068
opp+0.067
Neg logit contributions
iren-0.074
rob-0.073
 robots-0.068
loo-0.068
er-0.065
Token until
Token ID1566
Feature activation-9.50e-01

Pos logit contributions
opsy+0.276
cious+0.238
ossom+0.229
 harvest+0.224
opp+0.223
Neg logit contributions
iren-0.162
rob-0.149
er-0.144
loo-0.143
 robots-0.139
Token it
Token ID340
Feature activation-5.57e-02

Pos logit contributions
opsy+0.041
 harvest+0.034
 cozy+0.033
cious+0.033
 nib+0.033
Neg logit contributions
iren-0.038
rob-0.038
loo-0.035
 robots-0.033
 inconvenience-0.032
Token was
Token ID373
Feature activation-3.80e-01

Pos logit contributions
opsy+0.003
cious+0.002
 harvest+0.002
 mental+0.002
opp+0.002
Neg logit contributions
er-0.002
iren-0.001
ers-0.001
loo-0.001
rob-0.001
Token skull
Token ID14511
Feature activation-1.78e+00

Pos logit contributions
iren+0.003
loo+0.003
rob+0.003
 robots+0.003
?"+0.003
Neg logit contributions
opsy-0.003
opp-0.002
cious-0.002
ossom-0.002
apa-0.002
Token and
Token ID290
Feature activation+2.78e-01

Pos logit contributions
opsy+0.095
cious+0.080
 gra+0.080
 mental+0.079
 Ginny+0.079
Neg logit contributions
iren-0.051
loo-0.047
rob-0.046
er-0.044
 robots-0.044
Token pretended
Token ID36766
Feature activation-4.98e-01

Pos logit contributions
rob+0.010
iren+0.010
 robots+0.009
 hits+0.009
er+0.009
Neg logit contributions
opsy-0.012
ench-0.010
ossom-0.010
cious-0.010
opp-0.010
Token it
Token ID340
Feature activation-1.51e+00

Pos logit contributions
opsy+0.022
 cozy+0.018
cious+0.018
 nib+0.018
 harvest+0.018
Neg logit contributions
rob-0.019
iren-0.019
loo-0.017
irens-0.016
 robots-0.016
Token was
Token ID373
Feature activation-8.65e-01

Pos logit contributions
opsy+0.079
ossom+0.066
apa+0.066
cious+0.065
 harvest+0.065
Neg logit contributions
rob-0.046
iren-0.046
loo-0.042
 robots-0.041
er-0.041
Token drinking
Token ID7722
Feature activation-6.24e+00

Pos logit contributions
opsy+0.043
cious+0.039
ossom+0.038
apa+0.036
opp+0.036
Neg logit contributions
?"-0.025
loo-0.024
 robots-0.024
iren-0.024
 yourselves-0.022
Token.
Token ID13
Feature activation-1.20e-01

Pos logit contributions
opsy+0.300
cious+0.257
ossom+0.242
opp+0.236
apa+0.235
Neg logit contributions
iren-0.219
loo-0.207
rob-0.205
er-0.197
?"-0.195
Token They
Token ID1119
Feature activation-1.65e+00

Pos logit contributions
opsy+0.005
cious+0.004
 harvest+0.004
ossom+0.004
 cozy+0.004
Neg logit contributions
iren-0.005
rob-0.005
loo-0.004
 robots-0.004
irens-0.004
Token laughed
Token ID13818
Feature activation-1.22e+00

Pos logit contributions
opsy+0.083
ossom+0.076
ench+0.075
apa+0.074
cious+0.074
Neg logit contributions
?"-0.038
rob-0.037
 officer-0.037
 robots-0.036
 trucks-0.035
Token and
Token ID290
Feature activation-1.76e-01

Pos logit contributions
opsy+0.069
cious+0.059
 gra+0.059
apa+0.058
ossom+0.058
Neg logit contributions
iren-0.030
rob-0.030
 robots-0.029
er-0.027
?"-0.026
Token had
Token ID550
Feature activation+1.03e+00

Pos logit contributions
opsy+0.008
ossom+0.007
apa+0.007
ench+0.007
 gra+0.007
Neg logit contributions
rob-0.006
 robots-0.005
iren-0.005
 officer-0.005
 trucks-0.005
Token,
Token ID11
Feature activation+1.02e+00

Pos logit contributions
iren+0.013
rob+0.012
loo+0.012
er+0.011
 robots+0.011
Neg logit contributions
opsy-0.023
 harvest-0.020
 gra-0.019
 herbal-0.019
ossom-0.019
Token handing
Token ID22786
Feature activation-1.27e+00

Pos logit contributions
iren+0.031
rob+0.029
 robots+0.027
loo+0.026
 officer+0.026
Neg logit contributions
opsy-0.053
ossom-0.045
cious-0.044
 harvest-0.044
ench-0.044
Token him
Token ID683
Feature activation-1.76e+00

Pos logit contributions
opsy+0.058
cious+0.049
ossom+0.047
 harvest+0.047
 cozy+0.047
Neg logit contributions
iren-0.047
rob-0.046
loo-0.043
 robots-0.042
irens-0.041
Token the
Token ID262
Feature activation-3.63e-01

Pos logit contributions
opsy+0.086
cious+0.073
ossom+0.072
 harvest+0.072
 herbal+0.071
Neg logit contributions
iren-0.057
rob-0.056
loo-0.052
 robots-0.052
 officer-0.049
Token magn
Token ID7842
Feature activation-1.97e+00

Pos logit contributions
opsy+0.015
 cozy+0.012
 harvest+0.012
 gra+0.012
cious+0.012
Neg logit contributions
rob-0.015
iren-0.015
loo-0.013
irens-0.013
 robots-0.013
Tokenifying
Token ID4035
Feature activation-6.23e+00

Pos logit contributions
opsy+0.038
ossom+0.025
cious+0.024
apa+0.023
aults+0.022
Neg logit contributions
rob-0.126
iren-0.121
 inconvenience-0.119
irens-0.115
 robots-0.113
Token glass
Token ID5405
Feature activation-1.23e+00

Pos logit contributions
opsy+0.132
 cozy+0.112
 drink+0.109
 nib+0.105
 harvest+0.104
Neg logit contributions
rob-0.369
iren-0.353
 robots-0.343
 officer-0.342
 inconvenience-0.336
Token.
Token ID13
Feature activation+2.36e-01

Pos logit contributions
opsy+0.042
 drink+0.039
 nib+0.037
 pet+0.035
 vegetable+0.035
Neg logit contributions
rob-0.060
iren-0.053
irens-0.051
 inconvenience-0.051
 robots-0.049
Token She
Token ID1375
Feature activation-1.64e-01

Pos logit contributions
iren+0.009
rob+0.009
loo+0.009
 robots+0.008
irens+0.008
Neg logit contributions
opsy-0.010
cious-0.008
 harvest-0.008
ossom-0.008
 cozy-0.008
Token looked
Token ID3114
Feature activation-2.92e-01

Pos logit contributions
opsy+0.007
cious+0.005
 nib+0.005
 cozy+0.005
 relaxed+0.005
Neg logit contributions
iren-0.007
rob-0.007
loo-0.007
 inconvenience-0.006
irens-0.006
Token around
Token ID1088
Feature activation-5.48e-01

Pos logit contributions
opsy+0.013
cious+0.010
 cozy+0.010
 harvest+0.010
ossom+0.010
Neg logit contributions
iren-0.012
rob-0.011
loo-0.011
 robots-0.010
irens-0.010
Token,
Token ID11
Feature activation+7.72e-01

Pos logit contributions
opsy+0.007
cious+0.006
 harvest+0.006
ossom+0.006
 cozy+0.006
Neg logit contributions
iren-0.006
rob-0.006
loo-0.005
 robots-0.005
irens-0.005
Token and
Token ID290
Feature activation+8.17e-01

Pos logit contributions
rob+0.032
iren+0.032
loo+0.030
irens+0.028
 inconvenience+0.028
Neg logit contributions
opsy-0.031
cious-0.026
 nib-0.026
 relaxed-0.026
 harvest-0.026
Token Mia
Token ID32189
Feature activation-8.80e-01

Pos logit contributions
iren+0.031
rob+0.030
loo+0.028
 robots+0.027
irens+0.027
Neg logit contributions
opsy-0.036
cious-0.030
 harvest-0.030
 cozy-0.030
 nib-0.030
Token enjoyed
Token ID8359
Feature activation-1.54e+00

Pos logit contributions
opsy+0.035
cious+0.029
 harvest+0.028
 cozy+0.028
ossom+0.028
Neg logit contributions
iren-0.037
rob-0.036
loo-0.034
 robots-0.033
irens-0.032
Token ro
Token ID686
Feature activation+1.41e+00

Pos logit contributions
opsy+0.071
ossom+0.059
cious+0.059
ench+0.058
apa+0.057
Neg logit contributions
iren-0.056
rob-0.056
loo-0.051
 robots-0.050
irens-0.048
Tokenasting
Token ID9222
Feature activation-6.22e+00

Pos logit contributions
iren+0.063
rob+0.063
loo+0.059
 robots+0.058
 inconvenience+0.057
Neg logit contributions
opsy-0.052
cious-0.042
 harvest-0.040
ossom-0.040
ench-0.040
Token marsh
Token ID22397
Feature activation-3.42e+00

Pos logit contributions
opsy+0.245
ossom+0.189
cious+0.186
ench+0.179
opp+0.177
Neg logit contributions
iren-0.289
rob-0.278
loo-0.258
irens-0.257
 robots-0.256
Tokenm
Token ID76
Feature activation-3.46e+00

Pos logit contributions
opsy+0.162
 cozy+0.139
cious+0.138
ossom+0.136
 herbal+0.136
Neg logit contributions
iren-0.121
rob-0.119
loo-0.111
irens-0.103
 robots-0.101
Tokenallows
Token ID47205
Feature activation-4.51e+00

Pos logit contributions
opsy+0.127
 ju+0.100
 Ginny+0.099
ossom+0.098
cious+0.097
Neg logit contributions
iren-0.157
er-0.144
ers-0.144
loo-0.143
irens-0.143
Token and
Token ID290
Feature activation-7.65e-01

Pos logit contributions
opsy+0.212
cious+0.179
 harvest+0.176
 cozy+0.175
ossom+0.174
Neg logit contributions
iren-0.158
rob-0.154
loo-0.142
 robots-0.137
irens-0.134
Token telling
Token ID5149
Feature activation-8.23e-01

Pos logit contributions
opsy+0.034
ossom+0.027
cious+0.027
ench+0.027
apa+0.027
Neg logit contributions
iren-0.029
rob-0.029
 robots-0.027
loo-0.027
 inconvenience-0.025
Token!
Token ID0
Feature activation-1.42e-01

Pos logit contributions
opsy+0.050
cious+0.043
ossom+0.042
 gra+0.042
 harvest+0.041
Neg logit contributions
iren-0.029
rob-0.028
loo-0.026
 robots-0.025
er-0.025
Token It
Token ID632
Feature activation-4.42e-01

Pos logit contributions
opsy+0.006
 cozy+0.005
cious+0.005
 herbal+0.005
ossom+0.005
Neg logit contributions
rob-0.006
iren-0.006
loo-0.005
 inconvenience-0.005
irens-0.005
Token had
Token ID550
Feature activation+4.47e-01

Pos logit contributions
opsy+0.020
cious+0.017
 harvest+0.017
ossom+0.017
ench+0.017
Neg logit contributions
iren-0.016
rob-0.015
loo-0.014
 robots-0.014
 officer-0.013
Token touched
Token ID12615
Feature activation-1.41e+00

Pos logit contributions
rob+0.020
iren+0.019
loo+0.018
 poster+0.017
irens+0.017
Neg logit contributions
opsy-0.017
 cozy-0.015
cious-0.014
 harvest-0.014
 nib-0.014
Token the
Token ID262
Feature activation-1.50e+00

Pos logit contributions
opsy+0.062
cious+0.052
 cozy+0.051
 herbal+0.050
 harvest+0.050
Neg logit contributions
iren-0.054
rob-0.053
loo-0.049
 robots-0.047
 inconvenience-0.047
Token warm
Token ID5814
Feature activation-6.20e+00

Pos logit contributions
opsy+0.075
cious+0.063
 harvest+0.062
ossom+0.062
 gra+0.061
Neg logit contributions
iren-0.049
rob-0.048
loo-0.045
 robots-0.043
 inconvenience-0.042
Token water
Token ID1660
Feature activation-2.82e+00

Pos logit contributions
opsy+0.260
cious+0.213
 cozy+0.212
 harvest+0.211
 herbal+0.209
Neg logit contributions
iren-0.249
rob-0.247
loo-0.226
 robots-0.219
irens-0.218
Token and
Token ID290
Feature activation-7.07e-01

Pos logit contributions
opsy+0.144
cious+0.122
ossom+0.120
 harvest+0.116
 herbal+0.116
Neg logit contributions
iren-0.089
rob-0.080
loo-0.080
er-0.076
 robots-0.075
Token was
Token ID373
Feature activation+6.29e-01

Pos logit contributions
opsy+0.031
cious+0.026
 harvest+0.025
ossom+0.025
 gra+0.025
Neg logit contributions
iren-0.027
rob-0.026
loo-0.025
 robots-0.024
irens-0.023
Token sticking
Token ID17274
Feature activation-2.89e+00

Pos logit contributions
iren+0.024
 robots+0.022
loo+0.022
 inconvenience+0.021
rob+0.021
Neg logit contributions
opsy-0.027
cious-0.022
ossom-0.022
 harvest-0.021
opp-0.021
Token to
Token ID284
Feature activation-1.56e+00

Pos logit contributions
opsy+0.114
cious+0.091
 harvest+0.089
ossom+0.088
 herbal+0.087
Neg logit contributions
iren-0.125
rob-0.121
loo-0.115
 robots-0.112
irens-0.110
Token out
Token ID503
Feature activation-1.20e+00

Pos logit contributions
opsy+0.085
cious+0.070
 cozy+0.068
 harvest+0.068
 nib+0.068
Neg logit contributions
iren-0.088
rob-0.087
loo-0.081
 robots-0.077
 inconvenience-0.077
Token from
Token ID422
Feature activation+1.49e-01

Pos logit contributions
opsy+0.059
 cozy+0.052
 nib+0.052
cious+0.051
 relaxed+0.051
Neg logit contributions
rob-0.040
iren-0.037
loo-0.034
 inconvenience-0.033
 poster-0.031
Token the
Token ID262
Feature activation+3.30e-01

Pos logit contributions
iren+0.007
rob+0.007
loo+0.006
irens+0.006
 inconvenience+0.006
Neg logit contributions
opsy-0.005
 nib-0.005
 cozy-0.005
 harvest-0.005
 gra-0.004
Token shr
Token ID10157
Feature activation+1.08e+00

Pos logit contributions
iren+0.014
rob+0.014
loo+0.013
 robots+0.012
irens+0.012
Neg logit contributions
opsy-0.013
cious-0.011
 harvest-0.011
 cozy-0.011
ossom-0.010
Tokenub
Token ID549
Feature activation-5.10e-01

Pos logit contributions
iren+0.046
rob+0.043
loo+0.042
irens+0.041
 robots+0.040
Neg logit contributions
opsy-0.042
cious-0.035
 gra-0.034
 harvest-0.034
 cozy-0.034
Tokenbery
Token ID13001
Feature activation-6.20e+00

Pos logit contributions
opsy+0.021
cious+0.018
ossom+0.017
apa+0.017
 harvest+0.017
Neg logit contributions
iren-0.021
rob-0.021
loo-0.020
 robots-0.019
irens-0.019
Token.
Token ID13
Feature activation-3.26e-02

Pos logit contributions
opsy+0.269
cious+0.222
 harvest+0.218
 cozy+0.217
ossom+0.216
Neg logit contributions
iren-0.240
rob-0.235
loo-0.219
 robots-0.211
irens-0.208
Token
Token ID198
Feature activation-5.55e-01

Pos logit contributions
opsy+0.001
cious+0.001
 cozy+0.001
 harvest+0.001
ossom+0.001
Neg logit contributions
iren-0.001
rob-0.001
loo-0.001
 robots-0.001
 inconvenience-0.001
Token
Token ID198
Feature activation-6.82e-01

Pos logit contributions
opsy+0.023
cious+0.018
 cozy+0.018
 harvest+0.018
ossom+0.018
Neg logit contributions
iren-0.023
rob-0.023
loo-0.021
 robots-0.021
 inconvenience-0.020
TokenJack
Token ID14295
Feature activation+6.98e-01

Pos logit contributions
opsy+0.030
cious+0.024
 harvest+0.024
 cozy+0.024
ossom+0.024
Neg logit contributions
iren-0.027
rob-0.026
loo-0.024
 robots-0.023
irens-0.023
Token followed
Token ID3940
Feature activation-4.06e-01

Pos logit contributions
iren+0.026
rob+0.024
 robots+0.023
er+0.023
loo+0.023
Neg logit contributions
opsy-0.031
 harvest-0.026
ossom-0.025
 herbal-0.025
 cozy-0.025
Token the
Token ID262
Feature activation-1.20e-01

Pos logit contributions
opsy+0.007
 harvest+0.006
 nib+0.006
 cozy+0.006
 gra+0.006
Neg logit contributions
iren-0.008
rob-0.007
loo-0.007
 robots-0.007
irens-0.007
Token pond
Token ID16723
Feature activation-1.64e+00

Pos logit contributions
opsy+0.005
cious+0.004
ossom+0.004
apa+0.004
 harvest+0.004
Neg logit contributions
iren-0.005
rob-0.005
loo-0.005
 robots-0.005
 inconvenience-0.005
Token.
Token ID13
Feature activation-3.80e-01

Pos logit contributions
opsy+0.075
cious+0.063
 harvest+0.062
 cozy+0.062
ossom+0.061
Neg logit contributions
iren-0.060
rob-0.059
loo-0.054
 robots-0.052
irens-0.051
Token It
Token ID632
Feature activation-6.18e-01

Pos logit contributions
opsy+0.017
cious+0.014
 gra+0.014
 harvest+0.014
 nib+0.014
Neg logit contributions
iren-0.014
rob-0.013
 robots-0.013
loo-0.013
irens-0.013
Token was
Token ID373
Feature activation-7.62e-01

Pos logit contributions
opsy+0.033
ossom+0.029
 harvest+0.028
opp+0.028
ench+0.028
Neg logit contributions
iren-0.017
rob-0.016
 robots-0.015
loo-0.015
er-0.015
Token drinking
Token ID7722
Feature activation-6.19e+00

Pos logit contributions
opsy+0.039
ossom+0.035
cious+0.034
ench+0.034
opp+0.033
Neg logit contributions
iren-0.022
 robots-0.020
loo-0.020
rob-0.019
irens-0.019
Token so
Token ID523
Feature activation-2.61e+00

Pos logit contributions
opsy+0.280
cious+0.232
ossom+0.228
 harvest+0.221
opp+0.221
Neg logit contributions
iren-0.234
rob-0.223
loo-0.216
 robots-0.204
irens-0.200
Token much
Token ID881
Feature activation-1.88e+00

Pos logit contributions
opsy+0.122
 gra+0.104
cious+0.104
ossom+0.100
ench+0.100
Neg logit contributions
iren-0.084
loo-0.083
 robots-0.080
lights-0.080
rob-0.077
Token that
Token ID326
Feature activation-6.87e-01

Pos logit contributions
opsy+0.088
cious+0.074
ossom+0.072
 gra+0.071
opp+0.070
Neg logit contributions
iren-0.070
rob-0.066
loo-0.063
 robots-0.061
irens-0.059
Token the
Token ID262
Feature activation-5.79e-01

Pos logit contributions
opsy+0.030
cious+0.025
 cozy+0.025
 harvest+0.025
 herbal+0.025
Neg logit contributions
iren-0.026
rob-0.026
loo-0.024
 robots-0.023
irens-0.023
Token pond
Token ID16723
Feature activation-2.01e+00

Pos logit contributions
opsy+0.022
cious+0.017
ossom+0.017
apa+0.017
opp+0.017
Neg logit contributions
iren-0.025
rob-0.024
loo-0.024
 inconvenience-0.023
 robots-0.023
Token his
Token ID465
Feature activation-3.23e-01

Pos logit contributions
opsy+0.027
 nib+0.023
 cozy+0.023
 harvest+0.023
cious+0.023
Neg logit contributions
iren-0.024
rob-0.023
loo-0.022
irens-0.021
 inconvenience-0.020
Token trumpet
Token ID40334
Feature activation+1.51e+00

Pos logit contributions
opsy+0.016
 cozy+0.013
cious+0.013
 harvest+0.013
 nib+0.013
Neg logit contributions
iren-0.011
rob-0.010
loo-0.010
irens-0.010
 inconvenience-0.009
Token and
Token ID290
Feature activation+1.82e-02

Pos logit contributions
iren+0.060
rob+0.059
loo+0.055
 inconvenience+0.053
irens+0.052
Neg logit contributions
opsy-0.065
cious-0.053
 harvest-0.053
 cozy-0.052
 nib-0.052
Token fed
Token ID11672
Feature activation-2.70e+00

Pos logit contributions
iren+0.001
rob+0.001
loo+0.001
irens+0.001
 inconvenience+0.001
Neg logit contributions
opsy-0.001
cious-0.001
 nib-0.001
 cozy-0.001
 harvest-0.001
Token the
Token ID262
Feature activation-1.25e+00

Pos logit contributions
opsy+0.126
cious+0.105
 harvest+0.103
ossom+0.103
 cozy+0.102
Neg logit contributions
iren-0.097
rob-0.095
loo-0.088
 robots-0.085
irens-0.083
Token hungry
Token ID14720
Feature activation-6.16e+00

Pos logit contributions
opsy+0.060
cious+0.051
 cozy+0.050
 harvest+0.050
 herbal+0.050
Neg logit contributions
iren-0.043
rob-0.041
loo-0.039
irens-0.036
 robots-0.036
Token animals
Token ID4695
Feature activation-2.45e+00

Pos logit contributions
opsy+0.280
cious+0.235
 harvest+0.231
 cozy+0.230
 nib+0.229
Neg logit contributions
iren-0.223
rob-0.219
loo-0.201
irens-0.192
 inconvenience-0.192
Token for
Token ID329
Feature activation+1.45e+00

Pos logit contributions
opsy+0.133
cious+0.112
 herbal+0.110
ossom+0.110
 mental+0.110
Neg logit contributions
iren-0.065
rob-0.060
loo-0.058
 robots-0.058
er-0.056
Token days
Token ID1528
Feature activation+1.06e+00

Pos logit contributions
iren+0.050
rob+0.050
 robots+0.046
loo+0.045
 inconvenience+0.044
Neg logit contributions
opsy-0.069
ossom-0.059
cious-0.057
ench-0.056
 herbal-0.056
Token and
Token ID290
Feature activation+1.02e-01

Pos logit contributions
iren+0.035
rob+0.033
loo+0.031
 robots+0.030
irens+0.029
Neg logit contributions
opsy-0.053
cious-0.044
ossom-0.043
 herbal-0.043
 harvest-0.043
Token weeks
Token ID2745
Feature activation+1.78e+00

Pos logit contributions
iren+0.004
rob+0.004
loo+0.004
 robots+0.004
irens+0.004
Neg logit contributions
opsy-0.004
cious-0.004
 harvest-0.003
 cozy-0.003
ossom-0.003
TokenI
Token ID40
Feature activation-1.28e+00

Pos logit contributions
iren+0.009
rob+0.009
loo+0.009
 robots+0.008
 inconvenience+0.008
Neg logit contributions
opsy-0.010
cious-0.008
 cozy-0.008
ossom-0.008
 harvest-0.008
Token wish
Token ID4601
Feature activation+2.68e+00

Pos logit contributions
opsy+0.048
cious+0.041
 gra+0.038
 herbal+0.038
 harvest+0.037
Neg logit contributions
iren-0.054
?"-0.053
loo-0.051
rob-0.051
 inconvenience-0.050
Token I
Token ID314
Feature activation-1.89e+00

Pos logit contributions
rob+0.124
iren+0.116
irens+0.100
loo+0.096
 officer+0.095
Neg logit contributions
 cozy-0.108
opsy-0.108
 warm-0.101
 Baby-0.101
 Honey-0.099
Token picked
Token ID6497
Feature activation-1.00e+00

Pos logit contributions
opsy+0.070
cious+0.056
 cozy+0.056
 nib+0.054
ossom+0.054
Neg logit contributions
iren-0.087
rob-0.086
loo-0.079
irens-0.078
 robots-0.078
Token the
Token ID262
Feature activation-1.23e+00

Pos logit contributions
opsy+0.039
 cozy+0.035
 warm+0.034
 Baby+0.033
 Ginny+0.033
Neg logit contributions
iren-0.043
loo-0.040
rob-0.040
 inconvenience-0.038
irens-0.037
Token warm
Token ID5814
Feature activation-6.13e+00

Pos logit contributions
opsy+0.041
 cozy+0.036
 ju+0.034
 herbal+0.032
 cos+0.032
Neg logit contributions
iren-0.061
rob-0.059
loo-0.056
irens-0.055
driver-0.052
Token dress
Token ID6576
Feature activation-9.06e-01

Pos logit contributions
opsy+0.201
 drink+0.184
 pet+0.173
 pick+0.172
 cozy+0.169
Neg logit contributions
rob-0.293
iren-0.286
irens-0.271
 inconvenience-0.264
 robots-0.259
Token instead
Token ID2427
Feature activation+4.67e-01

Pos logit contributions
opsy+0.029
ossom+0.028
 cozy+0.028
 warm+0.027
 drink+0.027
Neg logit contributions
rob-0.044
iren-0.041
irens-0.040
 inconvenience-0.039
 robots-0.037
Token."
Token ID526
Feature activation+1.61e-01

Pos logit contributions
rob+0.020
iren+0.018
irens+0.016
 poster+0.016
 officer+0.015
Neg logit contributions
opsy-0.020
-0.018
 cozy-0.018
 relaxed-0.017
ench-0.017
Token<|endoftext|>
Token ID50256
Feature activation+1.93e+00

Pos logit contributions
iren+0.006
rob+0.006
loo+0.005
 robots+0.005
irens+0.005
Neg logit contributions
opsy-0.007
cious-0.006
 harvest-0.006
 herbal-0.006
ossom-0.006
TokenOnce
Token ID7454
Feature activation-5.26e-01

Pos logit contributions
iren+0.085
rob+0.083
loo+0.078
 inconvenience+0.076
 robots+0.076
Neg logit contributions
opsy-0.073
cious-0.058
 cozy-0.058
 nib-0.057
 gra-0.057
Token gone
Token ID3750
Feature activation-6.53e-01

Pos logit contributions
opsy+0.039
cious+0.032
ossom+0.031
 harvest+0.030
 herbal+0.030
Neg logit contributions
iren-0.041
rob-0.039
loo-0.037
 robots-0.036
irens-0.035
Token.
Token ID13
Feature activation-4.75e-01

Pos logit contributions
opsy+0.030
cious+0.025
 cozy+0.025
 harvest+0.025
ossom+0.024
Neg logit contributions
iren-0.024
rob-0.023
loo-0.022
 robots-0.021
irens-0.020
Token Sammy
Token ID44067
Feature activation-1.59e+00

Pos logit contributions
opsy+0.022
cious+0.019
ossom+0.019
 harvest+0.018
 herbal+0.018
Neg logit contributions
iren-0.018
rob-0.016
 robots-0.016
irens-0.016
loo-0.015
Token missed
Token ID6825
Feature activation-1.24e-01

Pos logit contributions
opsy+0.068
ossom+0.057
 harvest+0.057
 herbal+0.057
cious+0.056
Neg logit contributions
iren-0.059
rob-0.057
 officer-0.054
?"-0.053
 robots-0.053
Token the
Token ID262
Feature activation-1.14e+00

Pos logit contributions
opsy+0.006
cious+0.005
ossom+0.005
ench+0.005
 harvest+0.005
Neg logit contributions
iren-0.004
rob-0.004
 robots-0.004
loo-0.004
irens-0.003
Token warm
Token ID5814
Feature activation-6.08e+00

Pos logit contributions
opsy+0.047
cious+0.040
ossom+0.039
opp+0.038
apa+0.038
Neg logit contributions
iren-0.046
rob-0.044
loo-0.042
 robots-0.042
 inconvenience-0.041
Token,
Token ID11
Feature activation-1.08e+00

Pos logit contributions
opsy+0.256
cious+0.216
ossom+0.209
opp+0.206
apa+0.206
Neg logit contributions
iren-0.243
rob-0.231
loo-0.224
 robots-0.215
er-0.209
Token golden
Token ID10861
Feature activation-1.76e+00

Pos logit contributions
opsy+0.039
cious+0.030
ossom+0.030
opp+0.029
apa+0.029
Neg logit contributions
iren-0.050
rob-0.048
 robots-0.046
loo-0.046
 officer-0.044
Token rays
Token ID24823
Feature activation-2.66e+00

Pos logit contributions
opsy+0.066
cious+0.053
 harvest+0.051
 cozy+0.051
ossom+0.051
Neg logit contributions
iren-0.079
rob-0.078
loo-0.073
 robots-0.071
irens-0.070
Token and
Token ID290
Feature activation-8.23e-01

Pos logit contributions
opsy+0.146
cious+0.125
ossom+0.123
 herbal+0.122
 harvest+0.122
Neg logit contributions
iren-0.075
rob-0.068
loo-0.066
 officer-0.061
 robots-0.060
Token he
Token ID339
Feature activation+6.77e-01

Pos logit contributions
opsy+0.037
cious+0.030
ossom+0.030
 harvest+0.029
ench+0.029
Neg logit contributions
iren-0.031
rob-0.030
loo-0.028
 robots-0.028
irens-0.027
Token
Token ID198
Feature activation-9.77e-01

Pos logit contributions
opsy+0.032
 cozy+0.025
cious+0.025
ossom+0.025
 harvest+0.024
Neg logit contributions
iren-0.036
rob-0.036
loo-0.034
 robots-0.033
 inconvenience-0.033
TokenWhile
Token ID3633
Feature activation-1.12e-01

Pos logit contributions
opsy+0.048
 harvest+0.040
cious+0.039
apa+0.039
 gra+0.039
Neg logit contributions
iren-0.033
rob-0.030
loo-0.028
 robots-0.028
irens-0.028
Token they
Token ID484
Feature activation-1.18e+00

Pos logit contributions
opsy+0.005
 cozy+0.004
cious+0.004
 harvest+0.004
 herbal+0.004
Neg logit contributions
iren-0.004
rob-0.004
loo-0.004
 inconvenience-0.003
 officer-0.003
Token were
Token ID547
Feature activation-5.43e-01

Pos logit contributions
opsy+0.055
ossom+0.049
cious+0.047
 Ginny+0.046
 herbal+0.046
Neg logit contributions
loo-0.042
rob-0.039
iren-0.038
 robots-0.034
?"-0.034
Token ro
Token ID686
Feature activation+1.35e+00

Pos logit contributions
opsy+0.024
ossom+0.020
cious+0.019
 herbal+0.019
 mental+0.019
Neg logit contributions
iren-0.022
rob-0.020
loo-0.020
 robots-0.019
irens-0.019
Tokenasting
Token ID9222
Feature activation-6.05e+00

Pos logit contributions
iren+0.057
rob+0.054
loo+0.051
irens+0.049
 robots+0.048
Neg logit contributions
opsy-0.057
cious-0.047
 nib-0.045
 harvest-0.045
 herbal-0.045
Token marsh
Token ID22397
Feature activation-3.16e+00

Pos logit contributions
opsy+0.264
ossom+0.211
cious+0.209
 Ginny+0.191
opp+0.191
Neg logit contributions
iren-0.266
rob-0.246
loo-0.244
irens-0.235
 robots-0.234
Tokenm
Token ID76
Feature activation-3.60e+00

Pos logit contributions
opsy+0.145
cious+0.124
 cozy+0.123
ossom+0.122
 herbal+0.121
Neg logit contributions
iren-0.117
rob-0.114
loo-0.108
irens-0.100
 robots-0.099
Tokenallows
Token ID47205
Feature activation-4.41e+00

Pos logit contributions
opsy+0.140
 Ginny+0.121
 ju+0.119
 mental+0.119
cious+0.116
Neg logit contributions
iren-0.155
loo-0.148
irens-0.139
lights-0.139
?"-0.138
Token,
Token ID11
Feature activation-4.01e-01

Pos logit contributions
opsy+0.220
cious+0.186
ossom+0.183
 herbal+0.180
 cozy+0.177
Neg logit contributions
iren-0.153
rob-0.142
loo-0.138
 robots-0.129
irens-0.127
Token they
Token ID484
Feature activation-4.36e-01

Pos logit contributions
opsy+0.021
cious+0.018
ossom+0.018
 harvest+0.018
 herbal+0.017
Neg logit contributions
iren-0.012
rob-0.012
loo-0.011
 robots-0.010
irens-0.010
Token drank
Token ID24070
Feature activation-4.41e+00

Pos logit contributions
opsy+0.015
ench+0.014
opp+0.014
aults+0.013
apa+0.013
Neg logit contributions
rob-0.010
?"-0.010
iren-0.009
er-0.009
 my-0.009
Token the
Token ID262
Feature activation-1.04e+00

Pos logit contributions
opsy+0.263
cious+0.261
ossom+0.244
aults+0.235
 fingertips+0.230
Neg logit contributions
er-0.096
iren-0.091
rob-0.083
 hits-0.074
?"-0.073
Token milk
Token ID7545
Feature activation-3.96e+00

Pos logit contributions
opsy+0.045
cious+0.043
aults+0.042
opp+0.040
ossom+0.040
Neg logit contributions
iren-0.036
?"-0.035
loo-0.034
er-0.034
rob-0.032
Token.
Token ID13
Feature activation-3.63e-01

Pos logit contributions
opsy+0.229
cious+0.206
apa+0.202
alph+0.202
 mental+0.199
Neg logit contributions
er-0.087
loo-0.076
iren-0.074
rob-0.073
man-0.070
Token After
Token ID2293
Feature activation-2.29e-01

Pos logit contributions
opsy+0.017
aults+0.016
 harvest+0.016
cious+0.016
ossom+0.015
Neg logit contributions
iren-0.011
rob-0.010
 robots-0.010
loo-0.010
er-0.009
Token drinking
Token ID7722
Feature activation-6.04e+00

Pos logit contributions
opsy+0.012
ossom+0.011
cious+0.011
ench+0.011
 gra+0.011
Neg logit contributions
iren-0.006
rob-0.006
loo-0.006
 robots-0.006
irens-0.005
Token the
Token ID262
Feature activation-1.38e+00

Pos logit contributions
opsy+0.333
cious+0.319
ossom+0.294
aults+0.288
opp+0.280
Neg logit contributions
iren-0.167
loo-0.150
er-0.143
?"-0.138
rob-0.137
Token milk
Token ID7545
Feature activation-3.99e+00

Pos logit contributions
opsy+0.060
cious+0.059
aults+0.055
apa+0.054
opp+0.053
Neg logit contributions
?"-0.046
iren-0.045
er-0.045
loo-0.045
rob-0.042
Token,
Token ID11
Feature activation-7.93e-01

Pos logit contributions
opsy+0.218
cious+0.201
ossom+0.193
 gra+0.189
 herbal+0.187
Neg logit contributions
iren-0.101
loo-0.098
er-0.094
rob-0.093
?"-0.081
Token Mitt
Token ID16627
Feature activation-4.41e+00

Pos logit contributions
cious+0.032
ossom+0.032
opsy+0.031
ench+0.029
apa+0.028
Neg logit contributions
iren-0.028
rob-0.027
 robots-0.027
irens-0.025
 hits-0.025
Tokenens
Token ID641
Feature activation-2.19e+00

Pos logit contributions
opsy+0.203
ossom+0.180
cious+0.180
 harvest+0.178
 gra+0.174
Neg logit contributions
rob-0.156
iren-0.154
loo-0.135
er-0.131
 robots-0.130
Token there
Token ID612
Feature activation+7.10e-01

Pos logit contributions
opsy+0.037
 cozy+0.036
 relaxed+0.034
 cos+0.033
 harvest+0.033
Neg logit contributions
rob-0.055
iren-0.053
loo-0.047
irens-0.046
 inconvenience-0.046
Token was
Token ID373
Feature activation-6.71e-01

Pos logit contributions
iren+0.024
rob+0.023
loo+0.023
 robots+0.023
 officer+0.022
Neg logit contributions
opsy-0.034
ossom-0.028
cious-0.028
 herbal-0.027
 harvest-0.027
Token no
Token ID645
Feature activation+5.33e-01

Pos logit contributions
opsy+0.026
 cozy+0.023
 harvest+0.022
 nib+0.022
 cos+0.022
Neg logit contributions
rob-0.029
iren-0.029
loo-0.026
irens-0.025
 poster-0.025
Token more
Token ID517
Feature activation+1.35e+00

Pos logit contributions
rob+0.023
iren+0.022
irens+0.020
loo+0.020
driver+0.020
Neg logit contributions
opsy-0.019
 cozy-0.019
 pet-0.018
 wonder-0.018
 cos-0.018
Token noisy
Token ID31210
Feature activation+2.32e-01

Pos logit contributions
rob+0.060
iren+0.055
loo+0.053
irens+0.049
 poster+0.049
Neg logit contributions
opsy-0.048
 cozy-0.046
 relaxed-0.045
 cos-0.043
 harvest-0.042
Token aer
Token ID9551
Feature activation+4.28e+00

Pos logit contributions
rob+0.009
iren+0.009
loo+0.008
ider+0.008
driver+0.008
Neg logit contributions
 relaxed-0.008
 cozy-0.008
 sweat-0.008
 harvest-0.008
 skin-0.008
Tokenopl
Token ID20106
Feature activation+2.71e+00

Pos logit contributions
 roles+0.127
 yourselves+0.123
 poster+0.122
loo+0.122
 officer+0.121
Neg logit contributions
opsy-0.183
cious-0.164
umble-0.164
 relaxed-0.164
ossom-0.163
Tokenane
Token ID1531
Feature activation+3.45e+00

Pos logit contributions
rob+0.075
 inconvenience+0.068
 officer+0.063
 trucks+0.060
 robots+0.060
Neg logit contributions
opsy-0.130
opp-0.118
cious-0.112
ossom-0.111
umble-0.109
Token.
Token ID13
Feature activation-4.06e-01

Pos logit contributions
rob+0.147
iren+0.142
loo+0.132
irens+0.125
 yourselves+0.124
Neg logit contributions
opsy-0.124
 relaxed-0.113
 harvest-0.112
 cozy-0.112
cious-0.111
Token They
Token ID1119
Feature activation-1.19e+00

Pos logit contributions
opsy+0.010
 Baby+0.009
 cozy+0.009
apa+0.008
cious+0.008
Neg logit contributions
rob-0.023
iren-0.022
loo-0.021
irens-0.020
 poster-0.020
Token all
Token ID477
Feature activation+2.97e+00

Pos logit contributions
opsy+0.042
 nib+0.033
 relaxed+0.033
 harvest+0.033
 cozy+0.033
Neg logit contributions
iren-0.057
rob-0.056
loo-0.052
irens-0.051
 inconvenience-0.050
Token was
Token ID373
Feature activation+6.01e-01

Pos logit contributions
opsy+0.004
 sniff+0.003
 cozy+0.003
 relaxed+0.003
 nib+0.003
Neg logit contributions
rob-0.006
 inconvenience-0.006
iren-0.006
loo-0.006
irens-0.005
Token not
Token ID407
Feature activation+1.04e+00

Pos logit contributions
rob+0.026
iren+0.026
loo+0.024
 inconvenience+0.023
 robots+0.023
Neg logit contributions
opsy-0.024
 cozy-0.019
cious-0.019
 nib-0.018
 harvest-0.018
Token happy
Token ID3772
Feature activation-1.17e-01

Pos logit contributions
rob+0.062
iren+0.062
loo+0.059
irens+0.057
 inconvenience+0.057
Neg logit contributions
opsy-0.023
 cozy-0.017
cious-0.016
 relaxed-0.015
 nib-0.015
Token.
Token ID13
Feature activation+2.39e-01

Pos logit contributions
opsy+0.004
 drink+0.004
 nib+0.004
 harvest+0.004
 cozy+0.004
Neg logit contributions
rob-0.005
iren-0.005
loo-0.005
irens-0.004
mith-0.004
Token She
Token ID1375
Feature activation+4.78e-02

Pos logit contributions
iren+0.009
rob+0.008
loo+0.008
irens+0.008
 yourselves+0.008
Neg logit contributions
opsy-0.011
 nib-0.009
 harvest-0.009
cious-0.009
ossom-0.009
Token sc
Token ID629
Feature activation+4.63e+00

Pos logit contributions
rob+0.002
iren+0.002
loo+0.002
 inconvenience+0.002
irens+0.002
Neg logit contributions
opsy-0.002
 nib-0.001
 cozy-0.001
cious-0.001
 relaxed-0.001
Tokenolded
Token ID48959
Feature activation-6.42e-01

Pos logit contributions
rob+0.206
 inconvenience+0.174
 poster+0.174
loo+0.174
iren+0.172
Neg logit contributions
opsy-0.179
apa-0.164
ossom-0.154
opp-0.151
cious-0.150
Token Sara
Token ID24799
Feature activation-2.91e+00

Pos logit contributions
opsy+0.035
cious+0.031
ossom+0.031
 cozy+0.030
 harvest+0.030
Neg logit contributions
iren-0.018
rob-0.017
loo-0.015
irens-0.015
 robots-0.015
Token and
Token ID290
Feature activation+6.52e-01

Pos logit contributions
opsy+0.133
cious+0.111
 cozy+0.109
 harvest+0.108
ossom+0.108
Neg logit contributions
iren-0.107
rob-0.105
loo-0.098
 robots-0.093
 inconvenience-0.092
Token Ben
Token ID3932
Feature activation-1.08e+00

Pos logit contributions
iren+0.024
rob+0.023
loo+0.021
 robots+0.021
irens+0.021
Neg logit contributions
opsy-0.030
ossom-0.025
cious-0.025
 harvest-0.024
ench-0.024
Token for
Token ID329
Feature activation+2.72e+00

Pos logit contributions
opsy+0.051
cious+0.043
 harvest+0.042
 cozy+0.042
ossom+0.042
Neg logit contributions
iren-0.037
rob-0.036
loo-0.034
 robots-0.032
irens-0.032
Token day
Token ID1110
Feature activation+1.42e+00

Pos logit contributions
iren+0.078
rob+0.076
loo+0.071
 robots+0.068
irens+0.067
Neg logit contributions
opsy-0.092
cious-0.077
 harvest-0.075
 cozy-0.074
ossom-0.074
Token,
Token ID11
Feature activation+1.02e-01

Pos logit contributions
iren+0.042
rob+0.039
loo+0.038
?"+0.037
 robots+0.036
Neg logit contributions
opsy-0.074
 gra-0.064
cious-0.064
ossom-0.063
 herbal-0.063
Token he
Token ID339
Feature activation+1.08e+00

Pos logit contributions
iren+0.003
rob+0.003
loo+0.003
 robots+0.003
?"+0.003
Neg logit contributions
opsy-0.005
cious-0.004
ossom-0.004
 gra-0.004
 herbal-0.004
Token saw
Token ID2497
Feature activation+2.03e-01

Pos logit contributions
iren+0.056
rob+0.056
loo+0.052
irens+0.050
 inconvenience+0.050
Neg logit contributions
opsy-0.034
 relaxed-0.027
 nib-0.027
 cozy-0.026
cious-0.026
Token a
Token ID257
Feature activation-2.15e-01

Pos logit contributions
iren+0.009
rob+0.009
 inconvenience+0.009
loo+0.009
 poster+0.008
Neg logit contributions
opsy-0.007
 cozy-0.006
 mashed-0.006
 nib-0.006
 Baby-0.006
Token poster
Token ID11968
Feature activation+4.15e+00

Pos logit contributions
opsy+0.009
cious+0.007
 harvest+0.007
ossom+0.007
 cozy+0.007
Neg logit contributions
iren-0.009
rob-0.009
loo-0.008
 robots-0.008
irens-0.008
Token on
Token ID319
Feature activation-4.29e-02

Pos logit contributions
rob+0.202
iren+0.183
loo+0.182
 yourselves+0.180
 inconvenience+0.171
Neg logit contributions
-0.122
 pick-0.116
 pet-0.114
 drink-0.112
 nib-0.111
Token the
Token ID262
Feature activation+6.45e-01

Pos logit contributions
opsy+0.001
 cozy+0.001
 nib+0.001
 harvest+0.001
 Baby+0.001
Neg logit contributions
rob-0.002
iren-0.002
loo-0.002
 inconvenience-0.002
ider-0.002
Token wall
Token ID3355
Feature activation+1.32e+00

Pos logit contributions
iren+0.027
rob+0.026
loo+0.025
irens+0.024
driver+0.023
Neg logit contributions
opsy-0.025
 cozy-0.022
cious-0.022
 harvest-0.021
 gra-0.021
Token of
Token ID286
Feature activation-7.72e-01

Pos logit contributions
rob+0.075
iren+0.063
 inconvenience+0.063
 yourselves+0.062
 robots+0.060
Neg logit contributions
-0.030
 drink-0.028
 relaxed-0.027
 pick-0.027
 nib-0.027
Token his
Token ID465
Feature activation-3.85e-02

Pos logit contributions
opsy+0.027
 cozy+0.025
 nib+0.024
 harvest+0.023
 gra+0.023
Neg logit contributions
iren-0.035
rob-0.034
loo-0.032
irens-0.030
 inconvenience-0.030
Token frogs
Token ID37475
Feature activation+8.23e-01

Pos logit contributions
iren+0.000
loo+0.000
rob+0.000
 yourselves+0.000
irens+0.000
Neg logit contributions
opsy-0.001
ossom-0.001
ench-0.001
 gra-0.001
 harvest-0.001
Token and
Token ID290
Feature activation+1.42e+00

Pos logit contributions
iren+0.036
rob+0.035
loo+0.033
irens+0.033
 robots+0.032
Neg logit contributions
opsy-0.032
 nib-0.028
 relaxed-0.026
 drink-0.026
cious-0.026
Token ducks
Token ID39694
Feature activation+1.80e-01

Pos logit contributions
iren+0.067
rob+0.065
loo+0.062
irens+0.059
 inconvenience+0.058
Neg logit contributions
opsy-0.051
 nib-0.042
 cozy-0.041
cious-0.041
 gra-0.041
Token.
Token ID13
Feature activation+7.63e-01

Pos logit contributions
iren+0.007
rob+0.007
loo+0.007
irens+0.007
 robots+0.007
Neg logit contributions
opsy-0.008
 nib-0.006
 gra-0.006
cious-0.006
 harvest-0.006
Token He
Token ID679
Feature activation+1.11e+00

Pos logit contributions
iren+0.033
rob+0.033
loo+0.031
 robots+0.029
 inconvenience+0.029
Neg logit contributions
opsy-0.030
cious-0.024
 cozy-0.023
 gra-0.023
 nib-0.023
Token hon
Token ID3032
Feature activation+4.56e+00

Pos logit contributions
iren+0.047
rob+0.045
loo+0.043
irens+0.042
 robots+0.041
Neg logit contributions
opsy-0.045
 nib-0.039
 relaxed-0.038
 gra-0.037
cious-0.037
Tokenked
Token ID9091
Feature activation-3.85e-01

Pos logit contributions
rob+0.191
loo+0.190
iren+0.183
 inconvenience+0.173
 robots+0.165
Neg logit contributions
opsy-0.175
cious-0.159
 nib-0.154
 relaxed-0.148
ossom-0.147
Token his
Token ID465
Feature activation+4.67e-01

Pos logit contributions
opsy+0.015
 nib+0.013
cious+0.013
 cozy+0.012
 gra+0.012
Neg logit contributions
rob-0.017
iren-0.016
loo-0.016
 inconvenience-0.015
irens-0.014
Token horn
Token ID12718
Feature activation+2.27e+00

Pos logit contributions
iren+0.022
rob+0.022
loo+0.021
irens+0.020
 inconvenience+0.020
Neg logit contributions
opsy-0.017
cious-0.013
 cozy-0.013
 harvest-0.013
 nib-0.013
Token and
Token ID290
Feature activation+2.64e-04

Pos logit contributions
rob+0.105
iren+0.100
 inconvenience+0.098
loo+0.098
irens+0.092
Neg logit contributions
opsy-0.078
 nib-0.070
cious-0.067
 relaxed-0.065
ossom-0.065
Token waved
Token ID26834
Feature activation+3.03e-01

Pos logit contributions
iren+0.000
loo+0.000
rob+0.000
 inconvenience+0.000
irens+0.000
Neg logit contributions
 nib-0.000
opsy-0.000
 pet-0.000
 sniff-0.000
 relaxed-0.000
Token and
Token ID290
Feature activation+2.38e-01

Pos logit contributions
opsy+0.027
cious+0.023
 harvest+0.023
 cozy+0.022
ossom+0.022
Neg logit contributions
iren-0.021
rob-0.020
loo-0.019
 robots-0.018
irens-0.018
Token said
Token ID531
Feature activation+3.36e-03

Pos logit contributions
iren+0.009
rob+0.009
loo+0.008
 robots+0.008
irens+0.008
Neg logit contributions
opsy-0.010
cious-0.009
 harvest-0.008
ossom-0.008
 cozy-0.008
Token,
Token ID11
Feature activation+5.08e-01

Pos logit contributions
iren+0.000
rob+0.000
loo+0.000
 robots+0.000
 inconvenience+0.000
Neg logit contributions
opsy-0.000
cious-0.000
 cozy-0.000
 harvest-0.000
ossom-0.000
Token "
Token ID366
Feature activation+3.89e-01

Pos logit contributions
iren+0.016
rob+0.015
loo+0.014
 robots+0.014
irens+0.014
Neg logit contributions
opsy-0.026
cious-0.022
 harvest-0.021
ossom-0.021
ench-0.021
TokenHa
Token ID23303
Feature activation+4.15e+00

Pos logit contributions
iren+0.016
rob+0.015
loo+0.015
 robots+0.014
 inconvenience+0.014
Neg logit contributions
opsy-0.016
cious-0.014
 harvest-0.013
ossom-0.013
 cozy-0.013
Token ha
Token ID387
Feature activation+4.01e+00

Pos logit contributions
iren+0.160
rob+0.145
loo+0.140
 robots+0.139
lights+0.138
Neg logit contributions
opsy-0.187
cious-0.160
 harvest-0.155
 nib-0.155
 drink-0.149
Token!"
Token ID2474
Feature activation+7.31e-01

Pos logit contributions
iren+0.171
rob+0.168
 robots+0.159
loo+0.158
 officer+0.153
Neg logit contributions
opsy-0.160
cious-0.136
opp-0.131
ossom-0.127
 nib-0.126
Token
Token ID198
Feature activation+4.29e-01

Pos logit contributions
rob+0.039
iren+0.036
loo+0.036
 robots+0.035
 inconvenience+0.034
Neg logit contributions
opsy-0.021
 Baby-0.017
 relaxed-0.017
 cozy-0.017
 drink-0.016
Token
Token ID198
Feature activation+2.23e-01

Pos logit contributions
rob+0.022
iren+0.021
loo+0.020
 inconvenience+0.020
 robots+0.020
Neg logit contributions
opsy-0.014
 cozy-0.011
apa-0.010
 mental-0.010
 Baby-0.010
TokenBut
Token ID1537
Feature activation+9.65e-01

Pos logit contributions
rob+0.010
iren+0.010
loo+0.009
 inconvenience+0.009
 robots+0.009
Neg logit contributions
opsy-0.008
cious-0.007
 cozy-0.007
 relaxed-0.007
ossom-0.006
Token then
Token ID788
Feature activation+4.52e-01

Pos logit contributions
iren+0.041
rob+0.041
loo+0.037
irens+0.036
 inconvenience+0.035
Neg logit contributions
opsy-0.039
 Baby-0.032
 cozy-0.032
 nib-0.032
 harvest-0.031
Token dinner
Token ID8073
Feature activation-1.44e-01

Pos logit contributions
iren+0.098
rob+0.096
loo+0.093
irens+0.085
mith+0.084
Neg logit contributions
opsy-0.077
 cozy-0.065
 nib-0.064
 harvest-0.064
cious-0.063
Token.
Token ID13
Feature activation+6.47e-01

Pos logit contributions
opsy+0.005
 harvest+0.004
cious+0.004
 cozy+0.004
 relaxed+0.004
Neg logit contributions
iren-0.007
rob-0.006
loo-0.006
irens-0.006
 robots-0.006
Token Sam
Token ID3409
Feature activation+1.67e-01

Pos logit contributions
rob+0.030
iren+0.030
loo+0.027
 inconvenience+0.026
irens+0.026
Neg logit contributions
opsy-0.023
 cozy-0.019
cious-0.019
 gra-0.018
 herbal-0.018
Token's
Token ID338
Feature activation+3.12e-01

Pos logit contributions
iren+0.005
rob+0.005
loo+0.004
 robots+0.004
irens+0.004
Neg logit contributions
opsy-0.009
cious-0.008
 harvest-0.007
 cozy-0.007
ossom-0.007
Token mom
Token ID1995
Feature activation+1.00e+00

Pos logit contributions
iren+0.013
rob+0.013
loo+0.012
irens+0.011
 robots+0.011
Neg logit contributions
opsy-0.013
cious-0.010
 cozy-0.010
 harvest-0.010
 nib-0.010
Token and
Token ID290
Feature activation+1.36e+00

Pos logit contributions
rob+0.043
iren+0.042
loo+0.040
 inconvenience+0.040
irens+0.038
Neg logit contributions
opsy-0.038
 nib-0.031
 cozy-0.031
 relaxed-0.030
 mashed-0.030
Token dad
Token ID9955
Feature activation+1.86e+00

Pos logit contributions
rob+0.070
iren+0.070
loo+0.066
irens+0.062
 inconvenience+0.062
Neg logit contributions
opsy-0.044
 cozy-0.036
 Baby-0.036
 drink-0.035
 pet-0.035
Token say
Token ID910
Feature activation+1.03e-01

Pos logit contributions
iren+0.077
rob+0.074
loo+0.070
 officer+0.068
 poster+0.068
Neg logit contributions
opsy-0.077
cious-0.063
ossom-0.061
 herbal-0.061
 cozy-0.060
Token yes
Token ID3763
Feature activation+6.79e-01

Pos logit contributions
iren+0.004
rob+0.004
loo+0.004
irens+0.004
 robots+0.004
Neg logit contributions
opsy-0.004
cious-0.003
 cozy-0.003
 Baby-0.003
 nib-0.003
Token.
Token ID13
Feature activation+7.75e-01

Pos logit contributions
rob+0.030
iren+0.030
loo+0.028
irens+0.026
 robots+0.026
Neg logit contributions
opsy-0.026
cious-0.022
 cozy-0.021
 harvest-0.021
ench-0.021
Token They
Token ID1119
Feature activation-1.09e+00

Pos logit contributions
rob+0.042
iren+0.040
 inconvenience+0.039
loo+0.039
 poster+0.038
Neg logit contributions
opsy-0.021
 cozy-0.018
 Baby-0.017
 cos-0.016
 relaxed-0.016
Token She
Token ID1375
Feature activation-2.58e-01

Pos logit contributions
opsy+0.000
 cozy+0.000
cious+0.000
 Baby+0.000
 Ginny+0.000
Neg logit contributions
rob-0.000
iren-0.000
loo-0.000
 inconvenience-0.000
 poster-0.000
Token asked
Token ID1965
Feature activation-5.40e-01

Pos logit contributions
opsy+0.010
cious+0.008
 cozy+0.008
 nib+0.008
 harvest+0.008
Neg logit contributions
iren-0.011
rob-0.011
loo-0.011
irens-0.010
 robots-0.010
Token Mom
Token ID11254
Feature activation-1.60e+00

Pos logit contributions
opsy+0.022
 cozy+0.019
 Baby+0.019
 Ginny+0.018
 nib+0.018
Neg logit contributions
rob-0.022
iren-0.021
loo-0.020
 inconvenience-0.019
 poster-0.018
Token what
Token ID644
Feature activation-1.48e+00

Pos logit contributions
opsy+0.073
 cozy+0.062
 harvest+0.061
 nib+0.061
cious+0.061
Neg logit contributions
rob-0.058
iren-0.057
loo-0.052
 inconvenience-0.049
irens-0.049
Token was
Token ID373
Feature activation+1.37e+00

Pos logit contributions
opsy+0.072
cious+0.061
ossom+0.060
 harvest+0.060
ench+0.059
Neg logit contributions
iren-0.051
rob-0.049
loo-0.046
 robots-0.045
irens-0.043
Token wrong
Token ID2642
Feature activation+1.23e+00

Pos logit contributions
iren+0.031
rob+0.029
loo+0.026
irens+0.023
 robots+0.023
Neg logit contributions
opsy-0.083
cious-0.073
 cozy-0.073
 harvest-0.071
 herbal-0.071
Token.
Token ID13
Feature activation+9.07e-02

Pos logit contributions
rob+0.055
iren+0.053
loo+0.048
 inconvenience+0.047
irens+0.046
Neg logit contributions
opsy-0.049
 cozy-0.040
cious-0.040
 harvest-0.039
 herbal-0.039
Token Mom
Token ID11254
Feature activation-6.79e-01

Pos logit contributions
rob+0.004
iren+0.004
loo+0.004
 inconvenience+0.004
 robots+0.004
Neg logit contributions
opsy-0.003
 cozy-0.003
cious-0.003
 herbal-0.002
 Baby-0.002
Token said
Token ID531
Feature activation+4.58e-01

Pos logit contributions
opsy+0.026
 nib+0.022
 mashed+0.022
 sniff+0.021
 relaxed+0.021
Neg logit contributions
rob-0.029
iren-0.029
loo-0.028
 inconvenience-0.027
 yourselves-0.027
Token Max
Token ID5436
Feature activation-8.63e-01

Pos logit contributions
iren+0.016
rob+0.016
loo+0.015
 robots+0.014
irens+0.014
Neg logit contributions
opsy-0.022
cious-0.018
 harvest-0.018
 cozy-0.018
ossom-0.018
Token was
Token ID373
Feature activation+9.93e-01

Pos logit contributions
opsy+0.037
cious+0.030
 harvest+0.030
ossom+0.030
 cozy+0.030
Neg logit contributions
iren-0.034
rob-0.033
loo-0.031
 robots-0.030
irens-0.029
Token
Token ID198
Feature activation-1.38e-01

Pos logit contributions
iren+0.007
rob+0.007
loo+0.006
 robots+0.006
irens+0.006
Neg logit contributions
opsy-0.008
cious-0.007
 harvest-0.006
ossom-0.006
 cozy-0.006
Token
Token ID198
Feature activation-2.92e-01

Pos logit contributions
opsy+0.006
cious+0.005
 harvest+0.004
 cozy+0.004
ossom+0.004
Neg logit contributions
iren-0.006
rob-0.006
loo-0.005
 robots-0.005
 inconvenience-0.005
Token"
Token ID1
Feature activation+8.24e-01

Pos logit contributions
opsy+0.015
cious+0.013
ossom+0.013
 gra+0.013
 herbal+0.013
Neg logit contributions
iren-0.009
rob-0.008
loo-0.008
irens-0.008
er-0.007
TokenNo
Token ID2949
Feature activation+1.80e+00

Pos logit contributions
iren+0.030
rob+0.030
loo+0.026
 robots+0.026
irens+0.026
Neg logit contributions
opsy-0.038
ench-0.031
 harvest-0.031
ossom-0.031
 cozy-0.031
Token,
Token ID11
Feature activation+1.24e+00

Pos logit contributions
iren+0.079
rob+0.077
loo+0.072
irens+0.069
 robots+0.068
Neg logit contributions
opsy-0.071
 nib-0.057
cious-0.057
 harvest-0.057
 cozy-0.057
Token don
Token ID836
Feature activation+9.71e-01

Pos logit contributions
iren+0.040
rob+0.039
loo+0.036
 robots+0.034
 inconvenience+0.034
Neg logit contributions
opsy-0.063
cious-0.053
 harvest-0.052
ossom-0.052
 cozy-0.052
Token't
Token ID470
Feature activation+2.76e+00

Pos logit contributions
rob+0.031
iren+0.029
loo+0.028
 inconvenience+0.026
 officer+0.026
Neg logit contributions
opsy-0.046
-0.043
 cozy-0.040
cious-0.040
 harvest-0.040
Token do
Token ID466
Feature activation+6.65e-01

Pos logit contributions
?"+0.083
iren+0.080
 robots+0.076
 poster+0.075
 hits+0.073
Neg logit contributions
opsy-0.138
 herbal-0.118
 gra-0.117
cious-0.116
ossom-0.115
Token that
Token ID326
Feature activation+8.25e-01

Pos logit contributions
rob+0.027
iren+0.026
loo+0.025
 robots+0.022
lights+0.022
Neg logit contributions
opsy-0.027
cious-0.024
 cozy-0.024
 nib-0.023
 harvest-0.023
Token!"
Token ID2474
Feature activation+4.22e-01

Pos logit contributions
rob+0.031
iren+0.031
loo+0.028
irens+0.027
 inconvenience+0.025
Neg logit contributions
opsy-0.035
 cozy-0.033
 nib-0.031
 harvest-0.031
 cos-0.031
Token Jen
Token ID13364
Feature activation-1.05e-01

Pos logit contributions
iren+0.013
rob+0.013
loo+0.011
 robots+0.011
 inconvenience+0.011
Neg logit contributions
opsy-0.022
cious-0.019
 harvest-0.019
 cozy-0.019
 nib-0.018
Token us
Token ID514
Feature activation-1.95e+00

Pos logit contributions
rob+0.019
iren+0.018
loo+0.016
 robots+0.015
irens+0.015
Neg logit contributions
opsy-0.030
cious-0.026
 harvest-0.026
 cozy-0.026
apa-0.026
Token how
Token ID703
Feature activation-4.73e-01

Pos logit contributions
opsy+0.095
cious+0.081
 harvest+0.080
 cozy+0.080
ossom+0.079
Neg logit contributions
iren-0.065
rob-0.065
loo-0.058
 robots-0.055
irens-0.055
Token far
Token ID1290
Feature activation-5.58e-01

Pos logit contributions
opsy+0.023
cious+0.019
 harvest+0.019
 cozy+0.019
ossom+0.019
Neg logit contributions
iren-0.016
rob-0.016
loo-0.015
 robots-0.014
irens-0.014
Token we
Token ID356
Feature activation-9.74e-01

Pos logit contributions
opsy+0.021
 drink+0.020
 harvest+0.019
 Baby+0.018
 cozy+0.018
Neg logit contributions
iren-0.023
rob-0.023
loo-0.022
irens-0.021
 robots-0.020
Token have
Token ID423
Feature activation+1.99e+00

Pos logit contributions
opsy+0.041
cious+0.033
 herbal+0.032
ossom+0.032
 Ginny+0.031
Neg logit contributions
iren-0.039
rob-0.038
 robots-0.036
loo-0.036
 officer-0.036
Token walked
Token ID6807
Feature activation+8.53e-01

Pos logit contributions
iren+0.083
rob+0.083
loo+0.076
irens+0.071
 officer+0.070
Neg logit contributions
opsy-0.077
 harvest-0.071
 cozy-0.070
cious-0.069
 nib-0.068
Token".
Token ID1911
Feature activation-2.50e-01

Pos logit contributions
rob+0.037
iren+0.035
loo+0.034
 inconvenience+0.031
 poster+0.031
Neg logit contributions
opsy-0.031
 cozy-0.029
 harvest-0.028
 gra-0.028
cious-0.027
Token The
Token ID383
Feature activation+7.47e-01

Pos logit contributions
opsy+0.009
cious+0.007
 harvest+0.007
ossom+0.007
 herbal+0.007
Neg logit contributions
iren-0.011
rob-0.011
loo-0.010
 robots-0.010
irens-0.010
Token girl
Token ID2576
Feature activation-6.13e-01

Pos logit contributions
iren+0.019
 poster+0.019
 robots+0.018
 officer+0.018
loo+0.018
Neg logit contributions
opsy-0.040
 gra-0.035
cious-0.035
ossom-0.034
apa-0.034
Token got
Token ID1392
Feature activation-9.90e-01

Pos logit contributions
opsy+0.030
 gra+0.025
 harvest+0.025
 Ginny+0.025
 herbal+0.025
Neg logit contributions
iren-0.019
rob-0.018
er-0.018
 robots-0.017
 officer-0.016
Token excited
Token ID6568
Feature activation-1.34e+00

Pos logit contributions
opsy+0.049
cious+0.042
ossom+0.040
uces+0.039
opp+0.038
Neg logit contributions
iren-0.036
 officer-0.031
irens-0.030
 doors-0.029
?"-0.029
Token mom
Token ID1995
Feature activation+1.82e-01

Pos logit contributions
opsy+0.004
 cozy+0.003
cious+0.003
 harvest+0.003
ossom+0.003
Neg logit contributions
iren-0.004
rob-0.004
loo-0.004
irens-0.004
 inconvenience-0.003
Token and
Token ID290
Feature activation+5.42e-01

Pos logit contributions
iren+0.006
rob+0.005
 robots+0.005
 officer+0.005
irens+0.005
Neg logit contributions
opsy-0.009
cious-0.008
ossom-0.008
 herbal-0.007
opp-0.007
Token dad
Token ID9955
Feature activation+1.47e+00

Pos logit contributions
iren+0.021
rob+0.020
loo+0.019
 robots+0.018
 officer+0.018
Neg logit contributions
opsy-0.024
cious-0.020
ossom-0.020
 harvest-0.019
 herbal-0.019
Token and
Token ID290
Feature activation+1.53e+00

Pos logit contributions
iren+0.040
 officer+0.037
 robots+0.034
er+0.032
 poster+0.031
Neg logit contributions
opsy-0.085
ossom-0.072
cious-0.071
 herbal-0.070
 ju-0.070
Token ran
Token ID4966
Feature activation+4.95e-01

Pos logit contributions
iren+0.062
rob+0.061
loo+0.057
 robots+0.055
irens+0.054
Neg logit contributions
opsy-0.064
cious-0.053
 cozy-0.052
 harvest-0.052
 nib-0.051
Token to
Token ID284
Feature activation+3.96e-01

Pos logit contributions
rob+0.020
iren+0.019
loo+0.018
 inconvenience+0.017
 robots+0.017
Neg logit contributions
opsy-0.020
 cozy-0.018
 nib-0.017
cious-0.017
 harvest-0.017
Token her
Token ID607
Feature activation+6.99e-01

Pos logit contributions
iren+0.014
rob+0.013
 robots+0.012
 officer+0.012
 inconvenience+0.012
Neg logit contributions
opsy-0.019
cious-0.016
ossom-0.016
ench-0.015
 herbal-0.015
Token k
Token ID479
Feature activation+2.71e+00

Pos logit contributions
iren+0.022
rob+0.021
loo+0.019
 robots+0.019
 poster+0.019
Neg logit contributions
opsy-0.036
cious-0.031
ossom-0.030
 harvest-0.030
 herbal-0.030
Tokenite
Token ID578
Feature activation+2.99e+00

Pos logit contributions
rob+0.081
iren+0.079
 inconvenience+0.074
loo+0.072
 robots+0.070
Neg logit contributions
opsy-0.144
ench-0.117
ossom-0.117
opp-0.117
 cozy-0.116
Token.
Token ID13
Feature activation+1.77e-01

Pos logit contributions
iren+0.137
rob+0.132
 inconvenience+0.125
loo+0.124
irens+0.123
Neg logit contributions
opsy-0.113
 nib-0.094
 cozy-0.094
 harvest-0.092
 gra-0.090
Token She
Token ID1375
Feature activation+4.96e-02

Pos logit contributions
rob+0.008
iren+0.008
loo+0.007
 inconvenience+0.007
 robots+0.007
Neg logit contributions
opsy-0.007
 cozy-0.005
cious-0.005
 harvest-0.005
 herbal-0.005
Token day
Token ID1110
Feature activation+1.46e+00

Pos logit contributions
rob+0.087
iren+0.086
irens+0.077
 inconvenience+0.075
loo+0.075
Neg logit contributions
opsy-0.082
 cozy-0.067
 harvest-0.066
 drink-0.065
 herbal-0.065
Token,
Token ID11
Feature activation+9.44e-02

Pos logit contributions
iren+0.045
rob+0.043
loo+0.041
?"+0.040
irens+0.038
Neg logit contributions
opsy-0.073
 gra-0.063
cious-0.062
ossom-0.062
 mental-0.062
Token they
Token ID484
Feature activation-3.11e-01

Pos logit contributions
rob+0.003
iren+0.003
loo+0.002
?"+0.002
 robots+0.002
Neg logit contributions
opsy-0.005
cious-0.004
ossom-0.004
 gra-0.004
 herbal-0.004
Token are
Token ID389
Feature activation+8.09e-02

Pos logit contributions
opsy+0.010
cious+0.008
ossom+0.007
 herbal+0.007
 cozy+0.007
Neg logit contributions
iren-0.016
rob-0.015
loo-0.015
 robots-0.015
 officer-0.014
Token playing
Token ID2712
Feature activation-1.07e+00

Pos logit contributions
iren+0.003
rob+0.003
loo+0.003
 robots+0.003
irens+0.003
Neg logit contributions
opsy-0.003
cious-0.003
 cozy-0.003
 harvest-0.003
ossom-0.003
Token with
Token ID351
Feature activation-6.74e-01

Pos logit contributions
opsy+0.030
 drink+0.029
 nib+0.029
 pick+0.029
 cozy+0.029
Neg logit contributions
iren-0.052
rob-0.051
 inconvenience-0.048
loo-0.047
irens-0.046
Token blocks
Token ID7021
Feature activation+1.28e+00

Pos logit contributions
opsy+0.033
 cozy+0.028
 harvest+0.028
cious+0.028
 nib+0.028
Neg logit contributions
iren-0.022
rob-0.022
loo-0.020
 inconvenience-0.019
irens-0.018
Token in
Token ID287
Feature activation+5.40e-01

Pos logit contributions
rob+0.069
iren+0.065
 inconvenience+0.064
irens+0.059
loo+0.059
Neg logit contributions
opsy-0.034
 drink-0.030
 harvest-0.029
 pick-0.029
 nib-0.028
Token Sara
Token ID24799
Feature activation-1.60e+00

Pos logit contributions
iren+0.023
rob+0.023
loo+0.021
 inconvenience+0.021
irens+0.020
Neg logit contributions
opsy-0.021
 cozy-0.018
cious-0.017
 harvest-0.017
 cos-0.017
Token's
Token ID338
Feature activation-1.42e-01

Pos logit contributions
opsy+0.052
ossom+0.038
cious+0.038
 herbal+0.038
 harvest+0.038
Neg logit contributions
iren-0.080
rob-0.077
 robots-0.076
loo-0.074
 poster-0.072
Token room
Token ID2119
Feature activation+1.91e+00

Pos logit contributions
opsy+0.006
 cozy+0.005
cious+0.005
 harvest+0.005
 cos+0.005
Neg logit contributions
iren-0.006
rob-0.006
 inconvenience-0.005
loo-0.005
irens-0.005
Token You
Token ID921
Feature activation+1.36e-01

Pos logit contributions
rob+0.010
iren+0.009
loo+0.009
 poster+0.008
irens+0.008
Neg logit contributions
opsy-0.013
 Baby-0.011
 gra-0.011
cious-0.011
 cozy-0.011
Token just
Token ID655
Feature activation+1.89e+00

Pos logit contributions
iren+0.006
rob+0.006
loo+0.006
irens+0.005
 robots+0.005
Neg logit contributions
opsy-0.005
cious-0.004
 harvest-0.004
 cozy-0.004
ossom-0.004
Token need
Token ID761
Feature activation+1.96e+00

Pos logit contributions
iren+0.088
rob+0.087
loo+0.082
irens+0.079
 robots+0.077
Neg logit contributions
opsy-0.068
cious-0.054
 harvest-0.054
 nib-0.054
 cozy-0.054
Token to
Token ID284
Feature activation+4.73e-01

Pos logit contributions
rob+0.073
iren+0.071
loo+0.066
 robots+0.062
irens+0.061
Neg logit contributions
opsy-0.089
 cozy-0.075
 harvest-0.075
 nib-0.074
 herbal-0.074
Token help
Token ID1037
Feature activation+1.08e+00

Pos logit contributions
iren+0.020
?"+0.018
rob+0.018
 inconvenience+0.018
er+0.017
Neg logit contributions
opsy-0.018
cious-0.016
ossom-0.015
 ju-0.015
 herbal-0.015
Token each
Token ID1123
Feature activation+8.32e-02

Pos logit contributions
iren+0.029
rob+0.027
 robots+0.025
loo+0.025
irens+0.024
Neg logit contributions
opsy-0.060
cious-0.052
ossom-0.051
 herbal-0.051
 gra-0.050
Token other
Token ID584
Feature activation+9.47e-01

Pos logit contributions
iren+0.004
rob+0.004
loo+0.003
 robots+0.003
irens+0.003
Neg logit contributions
opsy-0.003
 harvest-0.003
cious-0.003
 cozy-0.003
 herbal-0.003
Token and
Token ID290
Feature activation+8.04e-01

Pos logit contributions
iren+0.027
rob+0.026
loo+0.024
 robots+0.023
 inconvenience+0.022
Neg logit contributions
opsy-0.051
cious-0.044
 harvest-0.043
ossom-0.043
 herbal-0.042
Token it
Token ID340
Feature activation+3.89e-01

Pos logit contributions
iren+0.037
rob+0.037
loo+0.034
irens+0.033
 robots+0.033
Neg logit contributions
opsy-0.029
 cozy-0.023
 harvest-0.023
cious-0.023
 nib-0.023
Token isn
Token ID2125
Feature activation-6.44e-01

Pos logit contributions
iren+0.016
rob+0.016
loo+0.015
irens+0.014
 inconvenience+0.014
Neg logit contributions
opsy-0.016
 cozy-0.013
cious-0.013
ossom-0.013
 harvest-0.013
Token't
Token ID470
Feature activation+1.65e+00

Pos logit contributions
opsy+0.035
+0.030
 gra+0.029
 harvest+0.029
 cozy+0.029
Neg logit contributions
rob-0.018
iren-0.017
loo-0.016
 inconvenience-0.015
 robots-0.015
Token with
Token ID351
Feature activation+6.17e-01

Pos logit contributions
opsy+0.051
 relaxed+0.047
 cozy+0.047
 harvest+0.046
 cos+0.046
Neg logit contributions
rob-0.075
iren-0.071
loo-0.067
 poster-0.062
irens-0.062
Token things
Token ID1243
Feature activation+9.05e-01

Pos logit contributions
rob+0.023
iren+0.022
loo+0.020
 inconvenience+0.018
irens+0.018
Neg logit contributions
opsy-0.029
 cozy-0.025
 Baby-0.024
 harvest-0.024
cious-0.024
Token.
Token ID13
Feature activation+2.49e-01

Pos logit contributions
rob+0.043
iren+0.042
loo+0.038
irens+0.038
 inconvenience+0.035
Neg logit contributions
opsy-0.029
-0.028
 cozy-0.028
 drink-0.027
 relaxed-0.026
Token This
Token ID770
Feature activation-5.95e-01

Pos logit contributions
iren+0.008
rob+0.008
loo+0.007
 robots+0.007
irens+0.007
Neg logit contributions
opsy-0.013
cious-0.011
 cozy-0.011
 harvest-0.011
 herbal-0.011
Token is
Token ID318
Feature activation-8.46e-01

Pos logit contributions
opsy+0.023
 drink+0.022
 cozy+0.022
 pick+0.021
 herbal+0.021
Neg logit contributions
iren-0.023
rob-0.023
irens-0.021
lights-0.020
 robots-0.020
Token not
Token ID407
Feature activation+3.53e-02

Pos logit contributions
opsy+0.031
 cozy+0.030
 delicious+0.027
 ju+0.027
 warm+0.027
Neg logit contributions
rob-0.042
iren-0.040
loo-0.036
irens-0.034
 poster-0.033
Token a
Token ID257
Feature activation+9.28e-01

Pos logit contributions
rob+0.002
iren+0.002
loo+0.002
irens+0.002
 poster+0.002
Neg logit contributions
 cozy-0.001
opsy-0.001
 cos-0.001
 warm-0.001
 harvest-0.001
Token success
Token ID1943
Feature activation+2.48e+00

Pos logit contributions
iren+0.041
rob+0.041
loo+0.037
irens+0.037
 robots+0.036
Neg logit contributions
opsy-0.035
 cozy-0.030
 harvest-0.029
cious-0.028
 herbal-0.028
Token."
Token ID526
Feature activation-1.69e-01

Pos logit contributions
rob+0.111
iren+0.106
irens+0.096
loo+0.094
ulk+0.089
Neg logit contributions
opsy-0.084
 cozy-0.080
 drink-0.080
-0.078
 harvest-0.078
Token Lily
Token ID20037
Feature activation+3.30e-01

Pos logit contributions
opsy+0.007
 cozy+0.006
 Baby+0.006
ossom+0.006
 herbal+0.006
Neg logit contributions
rob-0.007
iren-0.007
loo-0.007
 inconvenience-0.006
 robots-0.006
Token felt
Token ID2936
Feature activation-5.13e-02

Pos logit contributions
iren+0.013
rob+0.013
 robots+0.012
 officer+0.012
loo+0.012
Neg logit contributions
opsy-0.014
 gra-0.011
 harvest-0.011
cious-0.011
ossom-0.011
Token said
Token ID531
Feature activation+1.53e-01

Pos logit contributions
 officer+0.009
iren+0.009
rob+0.009
?"+0.008
 poster+0.008
Neg logit contributions
opsy-0.009
 herbal-0.007
cious-0.007
ossom-0.007
 ju-0.007
Token they
Token ID484
Feature activation-9.43e-01

Pos logit contributions
rob+0.006
iren+0.006
loo+0.005
 robots+0.005
irens+0.005
Neg logit contributions
opsy-0.007
cious-0.006
 cozy-0.006
ossom-0.005
 harvest-0.005
Token loved
Token ID6151
Feature activation-9.20e-01

Pos logit contributions
opsy+0.043
 herbal+0.036
cious+0.035
 harvest+0.035
ossom+0.034
Neg logit contributions
iren-0.032
 officer-0.032
rob-0.031
loo-0.030
?"-0.030
Token her
Token ID607
Feature activation-8.00e-01

Pos logit contributions
opsy+0.041
cious+0.034
ossom+0.034
 harvest+0.034
 cozy+0.033
Neg logit contributions
iren-0.035
rob-0.034
loo-0.031
 robots-0.031
irens-0.030
Token too
Token ID1165
Feature activation+5.13e-01

Pos logit contributions
opsy+0.034
 cozy+0.029
 harvest+0.028
cious+0.028
 nib+0.028
Neg logit contributions
iren-0.032
rob-0.032
loo-0.029
irens-0.028
 inconvenience-0.027
Token and
Token ID290
Feature activation+1.70e-01

Pos logit contributions
iren+0.014
 robots+0.012
rob+0.012
loo+0.012
 officer+0.011
Neg logit contributions
opsy-0.029
 gra-0.025
cious-0.024
ossom-0.024
 herbal-0.024
Token said
Token ID531
Feature activation-1.70e-01

Pos logit contributions
iren+0.007
rob+0.007
loo+0.006
 robots+0.006
irens+0.006
Neg logit contributions
opsy-0.007
cious-0.006
 cozy-0.006
 harvest-0.006
ossom-0.006
Token goodbye
Token ID24829
Feature activation+7.90e-01

Pos logit contributions
opsy+0.007
 cozy+0.005
cious+0.005
 relaxed+0.005
apa+0.005
Neg logit contributions
rob-0.008
iren-0.007
loo-0.007
irens-0.007
 robots-0.007
Token.
Token ID13
Feature activation-2.21e-01

Pos logit contributions
rob+0.034
iren+0.033
loo+0.031
irens+0.030
 robots+0.029
Neg logit contributions
opsy-0.032
cious-0.028
 cozy-0.028
ossom-0.028
 relaxed-0.027
Token They
Token ID1119
Feature activation-1.88e+00

Pos logit contributions
opsy+0.008
cious+0.007
 cozy+0.007
 harvest+0.006
ossom+0.006
Neg logit contributions
iren-0.010
rob-0.010
loo-0.009
 inconvenience-0.009
 robots-0.009
Token hung
Token ID9174
Feature activation-1.01e+00

Pos logit contributions
opsy+0.078
cious+0.065
ossom+0.065
 herbal+0.064
 harvest+0.064
Neg logit contributions
rob-0.072
iren-0.071
 officer-0.066
loo-0.066
 robots-0.065
Token rabbit
Token ID22746
Feature activation-1.96e+00

Pos logit contributions
iren+0.012
rob+0.011
 robots+0.011
loo+0.011
 poster+0.011
Neg logit contributions
opsy-0.016
ossom-0.014
ench-0.013
apa-0.013
cious-0.013
Token hopped
Token ID45230
Feature activation-1.32e+00

Pos logit contributions
opsy+0.091
ossom+0.077
ench+0.077
 herbal+0.076
 harvest+0.076
Neg logit contributions
rob-0.063
iren-0.062
 officer-0.060
er-0.060
 robots-0.056
Token away
Token ID1497
Feature activation+8.18e-01

Pos logit contributions
opsy+0.059
cious+0.049
 harvest+0.047
ossom+0.047
 cozy+0.047
Neg logit contributions
iren-0.051
rob-0.049
loo-0.046
 robots-0.044
irens-0.043
Token.
Token ID13
Feature activation-2.89e-01

Pos logit contributions
iren+0.028
rob+0.027
loo+0.025
 robots+0.024
irens+0.024
Neg logit contributions
opsy-0.039
cious-0.033
 harvest-0.032
 gra-0.032
ossom-0.032
Token
Token ID198
Feature activation-8.25e-01

Pos logit contributions
opsy+0.011
 cozy+0.009
cious+0.009
ossom+0.009
 Baby+0.009
Neg logit contributions
rob-0.012
iren-0.012
loo-0.012
 inconvenience-0.011
 poster-0.011
Token
Token ID198
Feature activation-9.49e-01

Pos logit contributions
opsy+0.029
 cozy+0.023
cious+0.022
apa+0.022
ossom+0.022
Neg logit contributions
rob-0.039
iren-0.038
loo-0.036
 inconvenience-0.035
 robots-0.035
TokenSarah
Token ID29284
Feature activation+3.44e-01

Pos logit contributions
opsy+0.046
cious+0.040
 cozy+0.039
ossom+0.039
 harvest+0.038
Neg logit contributions
rob-0.032
iren-0.031
loo-0.030
 inconvenience-0.027
 robots-0.027
Token was
Token ID373
Feature activation+2.07e-01

Pos logit contributions
iren+0.012
rob+0.011
er+0.011
 officer+0.011
 robots+0.010
Neg logit contributions
opsy-0.016
 harvest-0.013
 herbal-0.013
ossom-0.013
ench-0.013
Token feeling
Token ID4203
Feature activation-2.18e-01

Pos logit contributions
iren+0.007
 robots+0.007
 officer+0.007
rob+0.006
irens+0.006
Neg logit contributions
opsy-0.009
 harvest-0.008
ossom-0.008
cious-0.008
apa-0.008
Token so
Token ID523
Feature activation-1.52e+00

Pos logit contributions
opsy+0.010
ossom+0.008
cious+0.008
 harvest+0.008
opp+0.008
Neg logit contributions
iren-0.008
 officer-0.007
 robots-0.007
?"-0.007
er-0.007
Token excited
Token ID6568
Feature activation-7.57e-01

Pos logit contributions
opsy+0.062
cious+0.051
uces+0.050
 harvest+0.049
opp+0.049
Neg logit contributions
iren-0.060
?"-0.057
 hits-0.053
loo-0.052
 yourselves-0.052
Token
Token ID220
Feature activation-3.30e-01

Pos logit contributions
opsy+0.001
cious+0.001
 cozy+0.001
 harvest+0.001
ossom+0.001
Neg logit contributions
iren-0.001
rob-0.001
loo-0.001
 robots-0.001
 inconvenience-0.001
Token
Token ID198
Feature activation-7.80e-01

Pos logit contributions
opsy+0.012
 cozy+0.010
cious+0.010
apa+0.010
ossom+0.010
Neg logit contributions
rob-0.015
iren-0.015
loo-0.014
 robots-0.013
 inconvenience-0.013
Token
Token ID198
Feature activation-8.64e-01

Pos logit contributions
opsy+0.035
 harvest+0.029
cious+0.029
ossom+0.029
 herbal+0.029
Neg logit contributions
iren-0.030
rob-0.028
loo-0.026
irens-0.025
 robots-0.025
TokenThe
Token ID464
Feature activation+7.73e-01

Pos logit contributions
opsy+0.038
 harvest+0.032
 cozy+0.031
ossom+0.031
 nib+0.031
Neg logit contributions
iren-0.034
rob-0.030
irens-0.029
loo-0.028
 robots-0.028
Token boun
Token ID31283
Feature activation-1.92e-01

Pos logit contributions
iren+0.030
rob+0.029
loo+0.027
irens+0.026
 robots+0.025
Neg logit contributions
opsy-0.034
cious-0.028
 cozy-0.028
 harvest-0.028
 herbal-0.028
Tokency
Token ID948
Feature activation-3.10e+00

Pos logit contributions
opsy+0.004
ossom+0.002
 harvest+0.002
ench+0.002
 drink+0.002
Neg logit contributions
iren-0.012
rob-0.012
 inconvenience-0.012
irens-0.011
 robots-0.011
Token ball
Token ID2613
Feature activation-3.91e-01

Pos logit contributions
opsy+0.128
cious+0.105
 cozy+0.103
 nib+0.103
 harvest+0.102
Neg logit contributions
iren-0.128
rob-0.125
loo-0.116
irens-0.113
 robots-0.111
Token was
Token ID373
Feature activation-5.76e-01

Pos logit contributions
opsy+0.014
 nib+0.012
cious+0.011
 cozy+0.011
 relaxed+0.011
Neg logit contributions
rob-0.018
iren-0.018
loo-0.017
irens-0.016
 inconvenience-0.016
Token so
Token ID523
Feature activation-2.10e+00

Pos logit contributions
opsy+0.025
cious+0.021
ossom+0.021
 harvest+0.020
apa+0.020
Neg logit contributions
iren-0.023
rob-0.021
 robots-0.020
loo-0.020
irens-0.020
Token scared
Token ID12008
Feature activation-1.22e+00

Pos logit contributions
opsy+0.084
 harvest+0.069
cious+0.069
ossom+0.066
ench+0.065
Neg logit contributions
iren-0.091
loo-0.081
 robots-0.081
?"-0.078
rob-0.078
Token.
Token ID13
Feature activation-5.15e-01

Pos logit contributions
opsy+0.053
 Baby+0.046
 cozy+0.044
 Ginny+0.044
 nib+0.043
Neg logit contributions
rob-0.049
iren-0.046
loo-0.043
 poster-0.042
 inconvenience-0.041
Token when
Token ID618
Feature activation-1.72e+00

Pos logit contributions
iren+0.036
rob+0.034
 robots+0.033
loo+0.032
er+0.032
Neg logit contributions
opsy-0.064
cious-0.055
 gra-0.055
 harvest-0.054
 herbal-0.054
Token she
Token ID673
Feature activation+3.30e-01

Pos logit contributions
opsy+0.089
cious+0.077
ossom+0.076
ench+0.075
 herbal+0.074
Neg logit contributions
iren-0.052
rob-0.048
loo-0.046
 robots-0.046
 poster-0.044
Token saw
Token ID2497
Feature activation+2.13e-01

Pos logit contributions
rob+0.012
iren+0.012
loo+0.011
?"+0.010
 poster+0.010
Neg logit contributions
opsy-0.014
cious-0.012
ossom-0.012
 herbal-0.012
 gra-0.012
Token the
Token ID262
Feature activation-6.05e-01

Pos logit contributions
iren+0.007
rob+0.007
loo+0.006
 robots+0.006
irens+0.006
Neg logit contributions
opsy-0.011
cious-0.009
ossom-0.009
 harvest-0.009
 herbal-0.009
Token water
Token ID1660
Feature activation-2.42e+00

Pos logit contributions
opsy+0.025
cious+0.021
apa+0.021
ossom+0.020
opp+0.020
Neg logit contributions
iren-0.024
loo-0.024
 poster-0.023
rob-0.022
 inconvenience-0.022
Token come
Token ID1282
Feature activation-3.19e+00

Pos logit contributions
opsy+0.116
cious+0.098
ossom+0.098
apa+0.095
opp+0.094
Neg logit contributions
iren-0.083
loo-0.078
rob-0.076
er-0.073
 poster-0.071
Token out
Token ID503
Feature activation-1.27e+00

Pos logit contributions
opsy+0.165
cious+0.142
apa+0.139
 herbal+0.137
 mental+0.136
Neg logit contributions
iren-0.093
er-0.083
?"-0.077
loo-0.077
 robots-0.077
Token.
Token ID13
Feature activation-1.97e-01

Pos logit contributions
opsy+0.075
cious+0.066
ossom+0.064
 gra+0.063
apa+0.062
Neg logit contributions
iren-0.032
loo-0.027
er-0.026
 robots-0.026
irens-0.025
Token
Token ID198
Feature activation-6.68e-01

Pos logit contributions
opsy+0.008
cious+0.007
 harvest+0.007
ossom+0.007
 cozy+0.007
Neg logit contributions
iren-0.008
rob-0.008
loo-0.007
 robots-0.007
irens-0.007
Token
Token ID198
Feature activation-8.06e-01

Pos logit contributions
opsy+0.029
cious+0.024
 harvest+0.024
ossom+0.024
 herbal+0.024
Neg logit contributions
iren-0.026
rob-0.025
loo-0.023
irens-0.022
 robots-0.022
TokenThe
Token ID464
Feature activation+7.84e-01

Pos logit contributions
opsy+0.036
 harvest+0.030
cious+0.030
apa+0.029
ossom+0.029
Neg logit contributions
iren-0.031
rob-0.028
irens-0.027
 robots-0.026
loo-0.026
Token in
Token ID287
Feature activation+2.93e-01

Pos logit contributions
iren+0.011
rob+0.010
loo+0.010
 robots+0.009
irens+0.009
Neg logit contributions
opsy-0.012
cious-0.010
 harvest-0.010
ossom-0.010
 cozy-0.010
Token.
Token ID13
Feature activation-5.94e-01

Pos logit contributions
iren+0.011
rob+0.010
loo+0.010
 robots+0.009
irens+0.009
Neg logit contributions
opsy-0.014
cious-0.011
 harvest-0.011
 cozy-0.011
ossom-0.011
Token She
Token ID1375
Feature activation-3.48e-01

Pos logit contributions
opsy+0.026
cious+0.022
 harvest+0.021
 cozy+0.021
ossom+0.021
Neg logit contributions
iren-0.023
rob-0.022
loo-0.021
 robots-0.020
irens-0.020
Token saw
Token ID2497
Feature activation+6.28e-02

Pos logit contributions
opsy+0.015
 harvest+0.012
cious+0.012
ossom+0.012
 herbal+0.012
Neg logit contributions
iren-0.014
rob-0.014
loo-0.013
 robots-0.012
 officer-0.012
Token Jack
Token ID3619
Feature activation-9.64e-01

Pos logit contributions
iren+0.002
rob+0.002
loo+0.002
 robots+0.002
 inconvenience+0.002
Neg logit contributions
opsy-0.003
cious-0.002
 harvest-0.002
 cozy-0.002
ossom-0.002
Token eating
Token ID6600
Feature activation-3.70e+00

Pos logit contributions
opsy+0.048
ossom+0.042
 harvest+0.041
 herbal+0.041
ench+0.041
Neg logit contributions
iren-0.029
er-0.026
 robots-0.026
 officer-0.026
rob-0.026
Token the
Token ID262
Feature activation-2.40e-01

Pos logit contributions
opsy+0.189
cious+0.163
ossom+0.161
apa+0.154
ench+0.153
Neg logit contributions
iren-0.112
rob-0.110
loo-0.100
er-0.097
 robots-0.096
Token cel
Token ID18725
Feature activation-9.89e-01

Pos logit contributions
opsy+0.008
cious+0.007
ossom+0.007
apa+0.007
 Ginny+0.006
Neg logit contributions
rob-0.011
iren-0.011
loo-0.010
 robots-0.010
?"-0.010
Tokenery
Token ID1924
Feature activation-2.88e+00

Pos logit contributions
opsy+0.038
cious+0.032
 cozy+0.032
 Baby+0.031
 harvest+0.031
Neg logit contributions
rob-0.042
iren-0.041
loo-0.039
ers-0.036
er-0.036
Token and
Token ID290
Feature activation-3.72e-01

Pos logit contributions
opsy+0.148
cious+0.124
 gra+0.123
 mental+0.123
opp+0.122
Neg logit contributions
iren-0.081
er-0.077
rob-0.077
loo-0.076
 robots-0.070
Token asked
Token ID1965
Feature activation-5.19e-01

Pos logit contributions
opsy+0.016
cious+0.013
 cozy+0.012
 harvest+0.012
 nib+0.012
Neg logit contributions
iren-0.015
rob-0.015
loo-0.014
 robots-0.013
 inconvenience-0.013
Token desert
Token ID10326
Feature activation-1.84e+00

Pos logit contributions
rob+0.018
iren+0.018
loo+0.016
 inconvenience+0.016
driver+0.016
Neg logit contributions
opsy-0.021
 cozy-0.019
 gra-0.018
 cos-0.018
 harvest-0.018
Token,
Token ID11
Feature activation-1.68e-01

Pos logit contributions
opsy+0.073
cious+0.063
 cozy+0.060
 harvest+0.060
 relaxed+0.060
Neg logit contributions
rob-0.076
iren-0.074
 robots-0.067
loo-0.067
 inconvenience-0.067
Token Max
Token ID5436
Feature activation-1.30e-01

Pos logit contributions
opsy+0.008
cious+0.007
 harvest+0.007
ossom+0.007
 herbal+0.007
Neg logit contributions
iren-0.005
rob-0.005
loo-0.005
 robots-0.005
irens-0.004
Token was
Token ID373
Feature activation+2.15e-01

Pos logit contributions
opsy+0.006
 herbal+0.005
 harvest+0.005
ossom+0.005
 mental+0.005
Neg logit contributions
iren-0.005
rob-0.005
 robots-0.004
 officer-0.004
loo-0.004
Token very
Token ID845
Feature activation-1.11e+00

Pos logit contributions
iren+0.009
rob+0.009
loo+0.008
 robots+0.008
irens+0.008
Neg logit contributions
opsy-0.009
cious-0.007
 harvest-0.007
 cozy-0.007
ossom-0.007
Token thirsty
Token ID47124
Feature activation-3.66e+00

Pos logit contributions
opsy+0.037
cious+0.029
 herbal+0.028
 harvest+0.028
ench+0.027
Neg logit contributions
iren-0.052
loo-0.048
?"-0.047
 inconvenience-0.046
 robots-0.045
Token.
Token ID13
Feature activation-3.55e-01

Pos logit contributions
opsy+0.207
ossom+0.183
cious+0.180
 gra+0.176
apa+0.175
Neg logit contributions
iren-0.097
rob-0.088
loo-0.087
er-0.084
?"-0.083
Token He
Token ID679
Feature activation+3.60e-01

Pos logit contributions
opsy+0.017
cious+0.014
 cozy+0.014
 harvest+0.014
 nib+0.014
Neg logit contributions
iren-0.013
rob-0.012
loo-0.011
 inconvenience-0.011
 robots-0.011
Token spotted
Token ID13489
Feature activation-8.00e-01

Pos logit contributions
iren+0.015
rob+0.014
loo+0.013
 robots+0.013
 officer+0.013
Neg logit contributions
opsy-0.014
cious-0.012
ossom-0.012
 harvest-0.011
 herbal-0.011
Token a
Token ID257
Feature activation-3.75e-01

Pos logit contributions
opsy+0.036
cious+0.030
 harvest+0.029
 cozy+0.029
ossom+0.029
Neg logit contributions
iren-0.030
rob-0.030
loo-0.028
 robots-0.027
irens-0.026
Token giant
Token ID6175
Feature activation+2.18e-01

Pos logit contributions
opsy+0.015
cious+0.013
apa+0.012
ossom+0.012
opp+0.012
Neg logit contributions
iren-0.015
 poster-0.015
loo-0.014
rob-0.014
 officer-0.013
TokenMaybe
Token ID13300
Feature activation+5.63e-01

Pos logit contributions
iren+0.033
rob+0.033
loo+0.031
 robots+0.029
irens+0.029
Neg logit contributions
opsy-0.038
cious-0.032
 harvest-0.031
 cozy-0.031
 herbal-0.031
Token the
Token ID262
Feature activation+9.52e-01

Pos logit contributions
iren+0.026
rob+0.025
irens+0.022
loo+0.022
 robots+0.021
Neg logit contributions
opsy-0.023
 cozy-0.020
 harvest-0.019
 drink-0.019
 Baby-0.019
Token sw
Token ID1509
Feature activation-1.18e+00

Pos logit contributions
iren+0.049
rob+0.048
loo+0.046
 robots+0.045
irens+0.044
Neg logit contributions
opsy-0.030
cious-0.023
 harvest-0.022
 cozy-0.022
 herbal-0.022
Tokenan
Token ID272
Feature activation-1.69e+00

Pos logit contributions
opsy+0.047
cious+0.038
ossom+0.037
 harvest+0.037
ench+0.036
Neg logit contributions
iren-0.051
rob-0.051
loo-0.047
 robots-0.047
irens-0.045
Token was
Token ID373
Feature activation+7.87e-01

Pos logit contributions
opsy+0.071
 gra+0.061
 drink+0.060
 nib+0.060
 relaxed+0.059
Neg logit contributions
iren-0.067
rob-0.067
irens-0.063
loo-0.060
 robots-0.059
Token hungry
Token ID14720
Feature activation-3.66e+00

Pos logit contributions
iren+0.033
rob+0.033
loo+0.031
irens+0.029
 robots+0.029
Neg logit contributions
opsy-0.032
 cozy-0.027
 nib-0.026
cious-0.026
 harvest-0.026
Token and
Token ID290
Feature activation-4.62e-01

Pos logit contributions
opsy+0.149
 drink+0.136
 harvest+0.136
 cozy+0.135
 nib+0.135
Neg logit contributions
rob-0.149
iren-0.141
irens-0.129
loo-0.126
 robots-0.124
Token in
Token ID287
Feature activation+1.53e+00

Pos logit contributions
opsy+0.016
 cozy+0.013
 nib+0.013
 gra+0.012
 relaxed+0.012
Neg logit contributions
rob-0.023
iren-0.022
loo-0.021
irens-0.020
 robots-0.020
Token pain
Token ID2356
Feature activation+1.15e+00

Pos logit contributions
rob+0.070
iren+0.068
loo+0.063
irens+0.059
 robots+0.058
Neg logit contributions
opsy-0.056
 cozy-0.051
 harvest-0.050
 drink-0.048
 Baby-0.047
Token.
Token ID13
Feature activation+8.54e-01

Pos logit contributions
rob+0.044
iren+0.043
loo+0.039
irens+0.038
 robots+0.038
Neg logit contributions
opsy-0.049
-0.043
 relaxed-0.043
 drink-0.043
 Baby-0.043
Token Maybe
Token ID6674
Feature activation+5.38e-02

Pos logit contributions
rob+0.028
iren+0.028
loo+0.025
irens+0.023
 robots+0.023
Neg logit contributions
opsy-0.043
cious-0.037
 cozy-0.036
ossom-0.036
 harvest-0.036