TOP ACTIVATIONS
MAX = 6.444

to hop inside , a gru ff voice shouted " NO
man stepped closer and said gru ff ly , " You
Tom and said in a gru ff voice , " Get
ks . It was a gru ff voice and it said
she heard a loud , gru ff voice calling her name
! It said in a gru ff voice , " This
the rice back in its rightful place . John was proud
Ab ig ail for her polit eness , then walked away
word . It is ' bat '. It means a stick
that the penny find its rightful owner .
the ancient knob found its rightful place in their house .
animal . It was a gu inea pig with a scale
her own special slide - bat he . She
aloud , with her favourite words being ˜
. It was a special gu inea pig .
returned the towel to its rightful owner . The owner was
put them back in their rightful place . After
The boot returned to its rightful owners and the conflict was
restore it back to its rightful place . Although Sammy was
slider after her slide - bat he and she was so

MIN ACTIVATIONS
MIN = -6.395

things â col ouring books , stickers and
a strange , raspberry - col oured object hiding under a
delicious vegetables and bright - col oured flowers .
, with all its different col ours , shapes and textures
date , a rainbow - col oured butterfly flew in through
day , a chocolate - col oured whip named C heddar
big , beautiful rainbow - col oured butterfly ! The little
suddenly , a brightly - col oured belt caught her eye
want to help . B amb i felt sad and scared
- a box of mult icol ored bubbles !
i . B amb i ran as fast as
was trying to shoot B amb i . B
in a trap . B amb i tried to help Th
that !" Charl otte thanked her mother for
. " L amb ? Like sheep ?" Mia
Finally , B amb i found a kind hunter
I - I - I p - played with the p
dolls and pretend they were mer maid s or pirates .
had saved him . <|endoftext|> Charl otte and her mom were
. B amb i knew he had to

INTERVAL 3.234 - 6.444
CONTAINS 0.029%

, " What are the rods for ?" Jake said that
not know what to say next . The man
the stolen squares to their rightful place . The
, the mean boy from next door . He was holding
. He had a bus hy tail and a small nose

INTERVAL 0.025 - 3.234
CONTAINS 49.095%

jealous because she wanted some cream too . K
was so small , it was hard for him to climb
it with lots of love and attention . Sur
time there was a rabbit . She was very sad because
was too slow . Something else was faster ! A frog

INTERVAL -3.185 - 0.025
CONTAINS 50.832%

" What a lovely plant !" Mom my said
go on a fast ride with his mom . <|endoftext|> Once
around the garden . He sees flowers , grass , and
relaxed . When she opened her eyes again ,
field . Tim my and his mum followed the

INTERVAL -6.395 - -3.185
CONTAINS 0.044%

found it . B enny saw a big fish
She tossed and turned all night and couldn 't get close
too far away for the little girl to see . She
stars . She couldn 't believe it was real ! The
can 't watch TV all day , Lily . You have
Token to
Token ID284
Feature activation+1.11e+00

Pos logit contributions
 sidelines+0.000
 shower+0.000
 stereo+0.000
 curtain+0.000
 curtains+0.000
Neg logit contributions
over-0.000
 tal-0.000
id-0.000
 enough-0.000
est-0.000
Token hop
Token ID1725
Feature activation+8.78e-01

Pos logit contributions
over+0.035
 enough+0.035
id+0.034
 tal+0.033
 Bill+0.033
Neg logit contributions
 sidelines-0.029
 rinse-0.026
 stereo-0.026
 shower-0.025
irs-0.024
Token inside
Token ID2641
Feature activation-1.43e+00

Pos logit contributions
id+0.019
over+0.019
 Bill+0.018
 tal+0.018
 Stan+0.017
Neg logit contributions
 sidelines-0.031
 shower-0.030
 curtain-0.029
 stereo-0.029
 rinse-0.028
Token,
Token ID11
Feature activation+1.36e+00

Pos logit contributions
 sidelines+0.050
 shower+0.048
 curtain+0.048
 curtains+0.048
 bathroom+0.047
Neg logit contributions
 tal-0.029
over-0.029
id-0.028
colm-0.025
 Bill-0.024
Token a
Token ID257
Feature activation+8.17e-01

Pos logit contributions
over+0.042
id+0.039
aces+0.039
ac+0.038
 tal+0.037
Neg logit contributions
 shower-0.029
 sidelines-0.029
 bathroom-0.029
 curtains-0.028
 Nina-0.026
Token gru
Token ID22848
Feature activation+6.44e+00

Pos logit contributions
 enough+0.016
over+0.016
 Bill+0.015
id+0.015
 tal+0.015
Neg logit contributions
 sidelines-0.031
 stereo-0.029
 rinse-0.028
 shower-0.028
irs-0.028
Tokenff
Token ID487
Feature activation+3.57e-01

Pos logit contributions
 enough+0.126
over+0.124
 Bill+0.119
 meant+0.114
id+0.113
Neg logit contributions
 sidelines-0.224
 stereo-0.214
 shower-0.210
irs-0.207
 rinse-0.207
Token voice
Token ID3809
Feature activation+9.98e-01

Pos logit contributions
 enough+0.008
over+0.008
 Bill+0.008
id+0.008
 tal+0.008
Neg logit contributions
 sidelines-0.012
 stereo-0.011
 shower-0.011
 curtain-0.011
 rinse-0.011
Token shouted
Token ID17293
Feature activation+3.25e-01

Pos logit contributions
over+0.026
 enough+0.025
id+0.025
 tal+0.025
 Bill+0.024
Neg logit contributions
 sidelines-0.030
 shower-0.028
 stereo-0.028
 curtain-0.028
 rinse-0.027
Token "
Token ID366
Feature activation+3.75e-01

Pos logit contributions
 tal+0.009
 enough+0.008
 meant+0.008
 Bill+0.008
over+0.008
Neg logit contributions
 shower-0.009
 sidelines-0.009
 curtain-0.009
 stereo-0.009
 curtains-0.008
TokenNO
Token ID15285
Feature activation+1.05e+00

Pos logit contributions
 enough+0.011
over+0.011
 tal+0.011
 Bill+0.010
 Stan+0.010
Neg logit contributions
 sidelines-0.010
 stereo-0.009
 shower-0.009
irs-0.009
 rinse-0.008
Token man
Token ID582
Feature activation+3.77e-01

Pos logit contributions
 enough+0.014
over+0.014
 Bill+0.014
id+0.013
 tal+0.013
Neg logit contributions
 sidelines-0.026
 stereo-0.024
 rinse-0.024
irs-0.024
 shower-0.024
Token stepped
Token ID10764
Feature activation+2.39e-01

Pos logit contributions
over+0.010
 tal+0.010
 enough+0.009
 Bill+0.009
id+0.009
Neg logit contributions
 sidelines-0.011
 sip-0.010
 rinse-0.010
 stereo-0.010
 shower-0.010
Token closer
Token ID5699
Feature activation+4.73e-01

Pos logit contributions
over+0.005
 Bill+0.005
 enough+0.005
 tal+0.005
id+0.005
Neg logit contributions
 sidelines-0.009
 stereo-0.008
 rinse-0.008
 shower-0.008
irs-0.008
Token and
Token ID290
Feature activation+9.36e-01

Pos logit contributions
over+0.012
id+0.011
 tal+0.011
 Bill+0.010
colm+0.010
Neg logit contributions
 sidelines-0.014
 sip-0.014
 shower-0.014
 curtain-0.013
 curtains-0.013
Token said
Token ID531
Feature activation+3.47e-02

Pos logit contributions
 enough+0.020
over+0.019
 Bill+0.019
id+0.019
 tal+0.019
Neg logit contributions
 sidelines-0.034
 stereo-0.031
 rinse-0.031
 shower-0.030
irs-0.030
Token gru
Token ID22848
Feature activation+6.31e+00

Pos logit contributions
over+0.001
 tal+0.001
id+0.001
 Bill+0.000
 meant+0.000
Neg logit contributions
 sidelines-0.001
 shower-0.001
 rinse-0.001
 curtain-0.001
 stereo-0.001
Tokenff
Token ID487
Feature activation-7.63e-01

Pos logit contributions
over+0.140
 Bill+0.131
 afloat+0.126
 enough+0.126
 Stan+0.125
Neg logit contributions
 sidelines-0.187
 Acc-0.186
irs-0.182
 stereo-0.181
 rinse-0.179
Tokenly
Token ID306
Feature activation-7.85e-01

Pos logit contributions
 sidelines+0.023
 stereo+0.021
 curtain+0.021
 shower+0.021
 rinse+0.021
Neg logit contributions
over-0.018
 tal-0.017
 Bill-0.017
 enough-0.017
 Stan-0.016
Token,
Token ID11
Feature activation+8.29e-01

Pos logit contributions
 sidelines+0.029
 curtain+0.028
 stereo+0.028
 shower+0.028
 curtains+0.027
Neg logit contributions
over-0.012
 tal-0.012
 afloat-0.011
id-0.011
 Bill-0.011
Token "
Token ID366
Feature activation+2.93e-02

Pos logit contributions
over+0.021
 enough+0.021
 tal+0.020
 Bill+0.020
id+0.020
Neg logit contributions
 sidelines-0.026
 stereo-0.024
 rinse-0.023
 shower-0.023
 curtain-0.023
TokenYou
Token ID1639
Feature activation-1.64e-01

Pos logit contributions
 enough+0.001
over+0.001
 Bill+0.001
 tal+0.001
id+0.001
Neg logit contributions
 sidelines-0.001
 stereo-0.001
 rinse-0.001
 shower-0.001
irs-0.001
Token Tom
Token ID4186
Feature activation+1.06e+00

Pos logit contributions
over+0.031
 tal+0.031
id+0.030
 enough+0.027
 afloat+0.027
Neg logit contributions
 sidelines-0.038
 stereo-0.033
 shower-0.033
 curtain-0.033
irs-0.032
Token and
Token ID290
Feature activation+8.44e-01

Pos logit contributions
over+0.015
 tal+0.014
 Bill+0.014
 enough+0.014
id+0.013
Neg logit contributions
 sidelines-0.045
 stereo-0.043
 shower-0.042
 rinse-0.042
 curtain-0.041
Token said
Token ID531
Feature activation+2.81e-02

Pos logit contributions
over+0.022
id+0.021
 tal+0.020
 enough+0.020
est+0.019
Neg logit contributions
 sidelines-0.026
 stereo-0.024
 rinse-0.024
 shower-0.024
 sip-0.023
Token in
Token ID287
Feature activation+2.27e-01

Pos logit contributions
over+0.000
 tal+0.000
id+0.000
 afloat+0.000
 meant+0.000
Neg logit contributions
 sidelines-0.001
 shower-0.001
 curtain-0.001
 stereo-0.001
 rinse-0.001
Token a
Token ID257
Feature activation+2.41e-01

Pos logit contributions
id+0.005
 tal+0.004
over+0.004
 Bill+0.004
 enough+0.004
Neg logit contributions
 sidelines-0.008
 stereo-0.007
 shower-0.007
 rinse-0.007
 sip-0.007
Token gru
Token ID22848
Feature activation+6.27e+00

Pos logit contributions
over+0.005
 enough+0.005
id+0.005
 Bill+0.005
 tal+0.005
Neg logit contributions
 sidelines-0.009
 shower-0.008
 stereo-0.008
 rinse-0.008
 curtain-0.008
Tokenff
Token ID487
Feature activation+6.59e-02

Pos logit contributions
over+0.124
 enough+0.124
 Bill+0.117
 meant+0.116
id+0.114
Neg logit contributions
 sidelines-0.213
 stereo-0.204
 shower-0.200
irs-0.199
 rinse-0.197
Token voice
Token ID3809
Feature activation+7.46e-01

Pos logit contributions
 enough+0.002
 Bill+0.002
over+0.002
id+0.002
 tal+0.001
Neg logit contributions
 sidelines-0.002
 stereo-0.002
 shower-0.002
 curtain-0.002
 rinse-0.002
Token,
Token ID11
Feature activation+7.20e-01

Pos logit contributions
 tal+0.014
over+0.014
id+0.012
 afloat+0.012
colm+0.012
Neg logit contributions
 shower-0.026
 curtain-0.026
 sidelines-0.026
 stereo-0.025
 curtains-0.025
Token "
Token ID366
Feature activation-8.38e-03

Pos logit contributions
 enough+0.017
over+0.017
 Bill+0.016
 tal+0.016
id+0.016
Neg logit contributions
 sidelines-0.025
 stereo-0.023
 rinse-0.022
 shower-0.022
irs-0.022
TokenGet
Token ID3855
Feature activation-4.92e-01

Pos logit contributions
 sidelines+0.000
 stereo+0.000
 rinse+0.000
 shower+0.000
irs+0.000
Neg logit contributions
 enough-0.000
over-0.000
 Bill-0.000
 tal-0.000
id-0.000
Tokenks
Token ID591
Feature activation+1.74e+00

Pos logit contributions
 tal+0.014
over+0.013
 enough+0.012
 Bill+0.012
id+0.012
Neg logit contributions
 sidelines-0.017
 curtain-0.017
 stereo-0.017
 shower-0.016
irs-0.016
Token.
Token ID13
Feature activation+2.92e-01

Pos logit contributions
 Bill+0.032
 tal+0.032
over+0.032
 enough+0.031
id+0.031
Neg logit contributions
 sidelines-0.065
 stereo-0.062
 shower-0.061
 rinse-0.060
 curtain-0.060
Token It
Token ID632
Feature activation-1.04e+00

Pos logit contributions
over+0.008
 enough+0.008
 tal+0.007
id+0.007
 afloat+0.007
Neg logit contributions
 sidelines-0.009
 stereo-0.008
 shower-0.008
 curtain-0.008
 rinse-0.008
Token was
Token ID373
Feature activation+7.97e-01

Pos logit contributions
 sidelines+0.032
 stereo+0.031
 shower+0.031
 curtain+0.031
 rinse+0.030
Neg logit contributions
over-0.027
 enough-0.025
id-0.024
 tal-0.024
 Bill-0.022
Token a
Token ID257
Feature activation+5.43e-01

Pos logit contributions
over+0.018
id+0.018
 enough+0.017
 tal+0.017
est+0.016
Neg logit contributions
 sidelines-0.026
 shower-0.025
 stereo-0.025
 curtain-0.025
 rinse-0.025
Token gru
Token ID22848
Feature activation+6.25e+00

Pos logit contributions
 Bill+0.010
 enough+0.010
over+0.010
 tal+0.010
id+0.009
Neg logit contributions
 sidelines-0.020
irs-0.019
 rinse-0.019
 stereo-0.019
 shower-0.018
Tokenff
Token ID487
Feature activation-5.01e-01

Pos logit contributions
 enough+0.123
over+0.121
 Bill+0.116
 meant+0.113
 tal+0.111
Neg logit contributions
 sidelines-0.216
 stereo-0.207
 shower-0.203
irs-0.201
 rinse-0.200
Token voice
Token ID3809
Feature activation+7.17e-01

Pos logit contributions
 sidelines+0.017
 stereo+0.016
 shower+0.016
 curtain+0.015
 rinse+0.015
Neg logit contributions
 enough-0.012
 Bill-0.011
over-0.011
id-0.011
 tal-0.011
Token and
Token ID290
Feature activation+5.60e-01

Pos logit contributions
over+0.018
 enough+0.017
 tal+0.017
id+0.017
 Bill+0.017
Neg logit contributions
 sidelines-0.023
 shower-0.021
 stereo-0.021
 curtain-0.021
 rinse-0.020
Token it
Token ID340
Feature activation-3.51e-01

Pos logit contributions
over+0.012
 enough+0.012
id+0.011
 tal+0.011
 Bill+0.011
Neg logit contributions
 sidelines-0.020
 stereo-0.019
 rinse-0.018
 shower-0.018
irs-0.018
Token said
Token ID531
Feature activation-1.57e-01

Pos logit contributions
 sidelines+0.011
 stereo+0.010
 shower+0.010
 curtain+0.010
 rinse+0.010
Neg logit contributions
over-0.009
 tal-0.008
 enough-0.008
id-0.008
 Bill-0.008
Token she
Token ID673
Feature activation-5.35e-01

Pos logit contributions
 enough+0.022
 Bill+0.021
over+0.021
 tal+0.020
id+0.020
Neg logit contributions
 sidelines-0.023
 stereo-0.021
 rinse-0.021
irs-0.020
 shower-0.020
Token heard
Token ID2982
Feature activation+1.01e-01

Pos logit contributions
 sidelines+0.021
 stereo+0.019
 rinse+0.019
 shower+0.019
 curtain+0.018
Neg logit contributions
over-0.010
 enough-0.010
 Bill-0.010
 tal-0.009
id-0.009
Token a
Token ID257
Feature activation-4.94e-01

Pos logit contributions
over+0.001
 enough+0.001
id+0.001
 Bill+0.001
 tal+0.001
Neg logit contributions
 sidelines-0.004
 stereo-0.004
 shower-0.004
 rinse-0.004
 curtain-0.004
Token loud
Token ID7812
Feature activation+2.21e+00

Pos logit contributions
 sidelines+0.019
irs+0.019
 rinse+0.018
 stereo+0.018
 shower+0.018
Neg logit contributions
 Bill-0.008
over-0.008
 enough-0.008
 tal-0.008
id-0.007
Token,
Token ID11
Feature activation+1.56e-01

Pos logit contributions
 enough+0.057
over+0.056
 Bill+0.055
id+0.055
 tal+0.054
Neg logit contributions
 sidelines-0.070
 stereo-0.064
 shower-0.063
 rinse-0.062
 curtain-0.061
Token gru
Token ID22848
Feature activation+6.24e+00

Pos logit contributions
over+0.004
id+0.004
 enough+0.003
 tal+0.003
 Bill+0.003
Neg logit contributions
 sidelines-0.006
 stereo-0.005
 shower-0.005
 curtain-0.005
 rinse-0.005
Tokenff
Token ID487
Feature activation-3.76e-01

Pos logit contributions
 enough+0.121
over+0.120
 Bill+0.112
 meant+0.109
id+0.108
Neg logit contributions
 sidelines-0.220
 stereo-0.210
irs-0.206
 shower-0.205
 rinse-0.203
Token voice
Token ID3809
Feature activation+1.76e-01

Pos logit contributions
 sidelines+0.013
 stereo+0.012
 shower+0.012
 curtain+0.011
 rinse+0.011
Neg logit contributions
 enough-0.009
over-0.009
 Bill-0.008
 tal-0.008
id-0.008
Token calling
Token ID4585
Feature activation-1.50e+00

Pos logit contributions
over+0.005
 enough+0.004
 tal+0.004
id+0.004
 Bill+0.004
Neg logit contributions
 sidelines-0.005
 shower-0.005
 stereo-0.005
 curtain-0.005
 curtains-0.005
Token her
Token ID607
Feature activation-4.14e-01

Pos logit contributions
 sidelines+0.041
 shower+0.039
 curtain+0.038
 stereo+0.038
 rinse+0.037
Neg logit contributions
over-0.043
id-0.043
 tal-0.042
 enough-0.041
 Bill-0.041
Token name
Token ID1438
Feature activation+1.77e-01

Pos logit contributions
 sidelines+0.013
 stereo+0.012
 rinse+0.012
 shower+0.012
 curtain+0.012
Neg logit contributions
over-0.010
 enough-0.010
 Bill-0.010
id-0.010
 tal-0.010
Token!
Token ID0
Feature activation-1.75e-01

Pos logit contributions
over+0.019
 Bill+0.018
 tal+0.018
 Stan+0.017
colm+0.017
Neg logit contributions
 curtain-0.024
 shower-0.023
 curtains-0.023
 stereo-0.023
 carpet-0.022
Token It
Token ID632
Feature activation-1.69e+00

Pos logit contributions
 sidelines+0.006
 stereo+0.005
 shower+0.005
 curtain+0.005
 curtains+0.005
Neg logit contributions
over-0.004
 enough-0.004
id-0.004
 tal-0.004
 Bill-0.004
Token said
Token ID531
Feature activation-4.95e-01

Pos logit contributions
 sidelines+0.060
 stereo+0.056
 rinse+0.056
 shower+0.055
 curtain+0.054
Neg logit contributions
over-0.036
 enough-0.035
 Bill-0.034
id-0.034
 tal-0.032
Token in
Token ID287
Feature activation+1.77e-01

Pos logit contributions
 curtain+0.018
 sidelines+0.017
 shower+0.017
 sip+0.017
 tea+0.017
Neg logit contributions
over-0.010
id-0.009
 tal-0.008
 afloat-0.008
colm-0.008
Token a
Token ID257
Feature activation-2.48e-01

Pos logit contributions
id+0.003
over+0.003
 enough+0.003
 Bill+0.003
 tal+0.003
Neg logit contributions
 sidelines-0.007
 stereo-0.006
 shower-0.006
 curtain-0.006
 rinse-0.006
Token gru
Token ID22848
Feature activation+5.72e+00

Pos logit contributions
 sidelines+0.009
 shower+0.008
 stereo+0.008
 rinse+0.008
 curtain+0.008
Neg logit contributions
over-0.005
 enough-0.005
id-0.005
 Bill-0.005
 tal-0.005
Tokenff
Token ID487
Feature activation-5.37e-01

Pos logit contributions
 enough+0.111
over+0.110
 Bill+0.105
id+0.102
 meant+0.102
Neg logit contributions
 sidelines-0.200
 stereo-0.191
 shower-0.189
irs-0.187
 rinse-0.186
Token voice
Token ID3809
Feature activation+4.15e-01

Pos logit contributions
 sidelines+0.017
 stereo+0.016
 shower+0.016
 curtain+0.016
 rinse+0.015
Neg logit contributions
 enough-0.013
over-0.013
 Bill-0.013
id-0.012
 Stan-0.012
Token,
Token ID11
Feature activation+6.20e-01

Pos logit contributions
 tal+0.007
over+0.007
id+0.006
 afloat+0.006
 enough+0.006
Neg logit contributions
 shower-0.015
 curtain-0.015
 sidelines-0.015
 stereo-0.015
 curtains-0.014
Token "
Token ID366
Feature activation-3.74e-01

Pos logit contributions
over+0.016
 enough+0.015
id+0.015
 tal+0.015
 Bill+0.015
Neg logit contributions
 sidelines-0.019
 stereo-0.018
 shower-0.017
 rinse-0.017
 curtain-0.017
TokenThis
Token ID1212
Feature activation+9.64e-01

Pos logit contributions
 sidelines+0.010
 stereo+0.009
 rinse+0.009
 shower+0.009
irs+0.009
Neg logit contributions
 enough-0.011
over-0.011
 Bill-0.011
 tal-0.011
id-0.011
Token the
Token ID262
Feature activation-5.05e-01

Pos logit contributions
id+0.006
over+0.006
 enough+0.005
 tal+0.005
 Bill+0.005
Neg logit contributions
 sidelines-0.008
 rinse-0.008
 shower-0.008
 sip-0.008
 curtain-0.008
Token rice
Token ID11464
Feature activation+7.03e-01

Pos logit contributions
 sidelines+0.017
 stereo+0.015
 rinse+0.015
irs+0.015
 shower+0.015
Neg logit contributions
 enough-0.012
over-0.012
 Bill-0.012
 tal-0.012
id-0.011
Token back
Token ID736
Feature activation+5.55e-01

Pos logit contributions
 Bill+0.014
over+0.014
id+0.014
 tal+0.013
 enough+0.013
Neg logit contributions
 sidelines-0.025
 rinse-0.024
 shower-0.024
 curtain-0.024
 stereo-0.024
Token in
Token ID287
Feature activation-3.32e-01

Pos logit contributions
 enough+0.008
over+0.008
 Bill+0.008
id+0.007
 tal+0.007
Neg logit contributions
 sidelines-0.024
 stereo-0.022
 rinse-0.022
 shower-0.022
irs-0.021
Token its
Token ID663
Feature activation+8.66e-01

Pos logit contributions
 sidelines+0.012
 rinse+0.011
 shower+0.011
 stereo+0.011
 sip+0.011
Neg logit contributions
over-0.007
id-0.007
 enough-0.007
 Bill-0.006
 tal-0.006
Token rightful
Token ID43133
Feature activation+5.08e+00

Pos logit contributions
 enough+0.020
over+0.020
 tal+0.020
 Bill+0.020
id+0.018
Neg logit contributions
 sidelines-0.030
 stereo-0.027
irs-0.027
 rinse-0.027
 shower-0.026
Token place
Token ID1295
Feature activation+4.13e-01

Pos logit contributions
id+0.133
over+0.132
 Bill+0.130
 meant+0.126
 afloat+0.123
Neg logit contributions
 bathroom-0.128
 curtain-0.120
 sidelines-0.119
 curtains-0.118
 carpet-0.118
Token.
Token ID13
Feature activation+4.21e-01

Pos logit contributions
over+0.008
id+0.008
 Bill+0.008
 enough+0.008
 tal+0.008
Neg logit contributions
 sidelines-0.015
 stereo-0.014
 shower-0.014
 rinse-0.014
 curtain-0.013
Token John
Token ID1757
Feature activation+3.04e-01

Pos logit contributions
 Bill+0.009
 enough+0.008
over+0.008
 tal+0.007
 Stan+0.007
Neg logit contributions
 sidelines-0.016
 rinse-0.015
 stereo-0.015
 shower-0.015
 curtain-0.015
Token was
Token ID373
Feature activation+8.45e-01

Pos logit contributions
over+0.006
 Bill+0.006
 enough+0.006
 tal+0.006
id+0.006
Neg logit contributions
 sidelines-0.010
 rinse-0.010
 stereo-0.010
 shower-0.010
 curtain-0.009
Token proud
Token ID6613
Feature activation-6.33e-01

Pos logit contributions
id+0.017
 tal+0.016
over+0.016
 Bill+0.016
 enough+0.015
Neg logit contributions
 sidelines-0.031
 rinse-0.029
 shower-0.029
 stereo-0.029
 bathroom-0.027
Token Ab
Token ID2275
Feature activation-5.91e-01

Pos logit contributions
 sidelines+0.017
 stereo+0.015
 shower+0.015
irs+0.015
 rinse+0.015
Neg logit contributions
over-0.013
id-0.013
 enough-0.012
 tal-0.012
 meant-0.012
Tokenig
Token ID328
Feature activation-6.07e-01

Pos logit contributions
 sidelines+0.021
 stereo+0.019
 rinse+0.019
 shower+0.018
irs+0.018
Neg logit contributions
 enough-0.014
 Bill-0.013
over-0.013
id-0.013
 tal-0.013
Tokenail
Token ID603
Feature activation-8.95e-01

Pos logit contributions
 sidelines+0.013
 stereo+0.011
 rinse+0.011
 shower+0.011
irs+0.010
Neg logit contributions
 enough-0.022
over-0.022
 Bill-0.022
id-0.021
 tal-0.021
Token for
Token ID329
Feature activation+1.87e+00

Pos logit contributions
 sidelines+0.033
 stereo+0.030
 rinse+0.030
 shower+0.030
 curtain+0.030
Neg logit contributions
over-0.017
 Bill-0.017
id-0.016
 enough-0.016
 tal-0.016
Token her
Token ID607
Feature activation-3.37e-01

Pos logit contributions
 enough+0.042
over+0.042
 Bill+0.041
id+0.040
 tal+0.040
Neg logit contributions
 sidelines-0.064
 stereo-0.059
 rinse-0.058
 shower-0.058
irs-0.057
Token polit
Token ID1408
Feature activation+5.01e+00

Pos logit contributions
 sidelines+0.011
 stereo+0.010
 rinse+0.010
 shower+0.010
irs+0.010
Neg logit contributions
 enough-0.008
over-0.008
 Bill-0.008
id-0.008
 tal-0.007
Tokeneness
Token ID9449
Feature activation+2.14e+00

Pos logit contributions
 Bill+0.098
 Stan+0.090
 Sid+0.089
 Ac+0.087
 enough+0.085
Neg logit contributions
 sidelines-0.175
 stereo-0.165
 rinse-0.158
 shower-0.156
irs-0.156
Token,
Token ID11
Feature activation+1.09e+00

Pos logit contributions
 Bill+0.033
over+0.033
id+0.032
 tal+0.032
 enough+0.031
Neg logit contributions
 sidelines-0.085
 stereo-0.081
 shower-0.081
 rinse-0.080
 curtain-0.078
Token then
Token ID788
Feature activation+7.05e-01

Pos logit contributions
 enough+0.018
over+0.018
 Bill+0.018
id+0.017
 tal+0.017
Neg logit contributions
 sidelines-0.044
 stereo-0.041
 rinse-0.040
 shower-0.040
irs-0.039
Token walked
Token ID6807
Feature activation+1.90e+00

Pos logit contributions
over+0.018
 enough+0.017
 Bill+0.017
id+0.017
 tal+0.017
Neg logit contributions
 sidelines-0.022
 rinse-0.020
 stereo-0.020
 shower-0.020
irs-0.019
Token away
Token ID1497
Feature activation-4.48e-01

Pos logit contributions
over+0.043
 enough+0.043
 Bill+0.043
 tal+0.043
id+0.042
Neg logit contributions
 sidelines-0.064
 stereo-0.058
 shower-0.057
 rinse-0.057
 curtain-0.056
Token word
Token ID1573
Feature activation+1.65e+00

Pos logit contributions
 enough+0.021
over+0.021
 Bill+0.021
id+0.020
 tal+0.020
Neg logit contributions
 sidelines-0.039
 stereo-0.036
 rinse-0.036
 shower-0.036
irs-0.035
Token.
Token ID13
Feature activation-1.08e-01

Pos logit contributions
 enough+0.038
over+0.037
 Bill+0.037
id+0.035
 tal+0.035
Neg logit contributions
 sidelines-0.057
 stereo-0.052
 rinse-0.052
 shower-0.051
irs-0.050
Token It
Token ID632
Feature activation-1.11e+00

Pos logit contributions
 sidelines+0.004
 stereo+0.003
 shower+0.003
 rinse+0.003
 curtain+0.003
Neg logit contributions
 Bill-0.003
over-0.003
 enough-0.003
 tal-0.002
id-0.002
Token is
Token ID318
Feature activation+1.08e+00

Pos logit contributions
 sidelines+0.041
 stereo+0.037
 rinse+0.036
 shower+0.036
irs+0.036
Neg logit contributions
 enough-0.024
 Bill-0.023
 meant-0.023
over-0.023
 tal-0.022
Token '
Token ID705
Feature activation+3.34e-01

Pos logit contributions
over+0.018
 enough+0.018
 Bill+0.018
id+0.017
 tal+0.017
Neg logit contributions
 sidelines-0.043
 stereo-0.040
 rinse-0.040
 shower-0.039
irs-0.039
Tokenbat
Token ID8664
Feature activation+4.96e+00

Pos logit contributions
 enough+0.009
 tal+0.009
 Bill+0.009
over+0.009
id+0.008
Neg logit contributions
 sidelines-0.010
 stereo-0.009
 rinse-0.009
irs-0.009
 shower-0.008
Token'.
Token ID4458
Feature activation-5.07e-01

Pos logit contributions
 enough+0.108
 Bill+0.106
 tal+0.104
over+0.099
id+0.095
Neg logit contributions
 sidelines-0.177
 stereo-0.166
irs-0.160
 rinse-0.160
 shower-0.159
Token It
Token ID632
Feature activation-8.33e-01

Pos logit contributions
 sidelines+0.018
 stereo+0.016
 rinse+0.016
 shower+0.016
irs+0.015
Neg logit contributions
 enough-0.012
over-0.011
 Bill-0.011
id-0.011
 tal-0.011
Token means
Token ID1724
Feature activation+2.29e+00

Pos logit contributions
 sidelines+0.029
 stereo+0.026
 rinse+0.026
 shower+0.025
irs+0.025
Neg logit contributions
 enough-0.020
over-0.019
 Bill-0.019
 tal-0.018
id-0.018
Token a
Token ID257
Feature activation+5.46e-01

Pos logit contributions
 enough+0.053
over+0.052
 Bill+0.052
 tal+0.049
id+0.049
Neg logit contributions
 sidelines-0.079
 stereo-0.072
 rinse-0.071
 shower-0.070
irs-0.069
Token stick
Token ID4859
Feature activation+1.22e+00

Pos logit contributions
 enough+0.013
over+0.013
 Bill+0.012
 tal+0.012
id+0.012
Neg logit contributions
 sidelines-0.018
 stereo-0.017
 rinse-0.017
irs-0.017
 shower-0.016
Token that
Token ID326
Feature activation+2.02e-01

Pos logit contributions
 sidelines+0.002
 stereo+0.002
 rinse+0.002
 curtain+0.002
 curtains+0.002
Neg logit contributions
 tal-0.003
over-0.003
id-0.003
 enough-0.003
 afloat-0.003
Token the
Token ID262
Feature activation-8.48e-02

Pos logit contributions
over+0.006
id+0.005
 tal+0.005
 enough+0.005
 Bill+0.005
Neg logit contributions
 sidelines-0.006
 shower-0.006
 rinse-0.006
 stereo-0.005
 curtain-0.005
Token penny
Token ID26369
Feature activation+9.19e-01

Pos logit contributions
 sidelines+0.003
 stereo+0.003
 rinse+0.003
irs+0.003
 shower+0.003
Neg logit contributions
 enough-0.002
over-0.002
 Bill-0.002
 tal-0.002
id-0.002
Token find
Token ID1064
Feature activation-1.27e-01

Pos logit contributions
over+0.020
 tal+0.019
id+0.019
 Bill+0.018
 enough+0.018
Neg logit contributions
 sidelines-0.030
 shower-0.030
 curtain-0.029
 stereo-0.029
 rinse-0.029
Token its
Token ID663
Feature activation+7.82e-02

Pos logit contributions
 sidelines+0.005
 stereo+0.004
 rinse+0.004
 shower+0.004
 curtain+0.004
Neg logit contributions
over-0.002
id-0.002
 Bill-0.002
 tal-0.002
 enough-0.002
Token rightful
Token ID43133
Feature activation+4.93e+00

Pos logit contributions
over+0.002
id+0.002
 Bill+0.002
 enough+0.002
 Stan+0.002
Neg logit contributions
 sidelines-0.002
 shower-0.002
 rinse-0.002
 stereo-0.002
 curtain-0.002
Token owner
Token ID4870
Feature activation+1.81e+00

Pos logit contributions
over+0.105
id+0.101
 Bill+0.097
 afloat+0.095
 Stan+0.094
Neg logit contributions
 bathroom-0.147
 curtain-0.145
 curtains-0.144
 mirror-0.143
 carpet-0.140
Token.
Token ID13
Feature activation+3.74e-01

Pos logit contributions
 tal+0.033
over+0.033
id+0.032
 Bill+0.031
 enough+0.028
Neg logit contributions
 sidelines-0.063
 rinse-0.061
 stereo-0.061
 curtain-0.061
 shower-0.061
Token
Token ID220
Feature activation-1.04e+00

Pos logit contributions
 Bill+0.008
over+0.007
 enough+0.007
id+0.007
 tal+0.007
Neg logit contributions
 sidelines-0.014
 rinse-0.013
 stereo-0.013
 shower-0.013
irs-0.013
Token
Token ID198
Feature activation+3.20e-02

Pos logit contributions
 sidelines+0.042
 stereo+0.038
 shower+0.038
 rinse+0.037
 curtain+0.037
Neg logit contributions
 Bill-0.020
over-0.018
 enough-0.018
id-0.018
 tal-0.017
Token
Token ID198
Feature activation-2.70e-02

Pos logit contributions
 Bill+0.001
over+0.001
id+0.001
 enough+0.001
 tal+0.001
Neg logit contributions
 sidelines-0.001
 stereo-0.001
 rinse-0.001
 shower-0.001
 curtain-0.001
Token the
Token ID262
Feature activation+6.28e-01

Pos logit contributions
 enough+0.010
 Bill+0.010
over+0.010
 tal+0.009
id+0.009
Neg logit contributions
 sidelines-0.014
 stereo-0.012
 rinse-0.012
 shower-0.012
irs-0.012
Token ancient
Token ID6156
Feature activation+2.23e-01

Pos logit contributions
 enough+0.014
over+0.013
 Bill+0.013
 tal+0.012
id+0.012
Neg logit contributions
 sidelines-0.022
irs-0.021
 stereo-0.021
 rinse-0.021
 shower-0.020
Token knob
Token ID36842
Feature activation+1.17e-01

Pos logit contributions
 enough+0.005
 Bill+0.005
over+0.005
 tal+0.005
id+0.004
Neg logit contributions
 sidelines-0.008
 rinse-0.007
 stereo-0.007
irs-0.007
 shower-0.007
Token found
Token ID1043
Feature activation+1.76e-01

Pos logit contributions
over+0.002
 enough+0.002
 Bill+0.002
id+0.002
 tal+0.002
Neg logit contributions
 sidelines-0.004
 stereo-0.004
 rinse-0.004
 shower-0.004
 curtain-0.004
Token its
Token ID663
Feature activation-4.31e-01

Pos logit contributions
over+0.003
id+0.003
 tal+0.003
 enough+0.003
 Bill+0.003
Neg logit contributions
 sidelines-0.006
 curtain-0.006
 curtains-0.006
 stereo-0.006
irs-0.006
Token rightful
Token ID43133
Feature activation+4.83e+00

Pos logit contributions
 sidelines+0.014
 stereo+0.013
 shower+0.013
 rinse+0.013
 curtain+0.013
Neg logit contributions
over-0.010
 enough-0.010
id-0.010
 Bill-0.010
 tal-0.010
Token place
Token ID1295
Feature activation+3.19e-01

Pos logit contributions
 afloat+0.091
over+0.091
id+0.091
 Bill+0.086
 enough+0.084
Neg logit contributions
 curtain-0.150
 bathroom-0.148
 curtains-0.147
 stereo-0.145
 carpet-0.144
Token in
Token ID287
Feature activation+6.47e-02

Pos logit contributions
over+0.005
id+0.005
 Bill+0.005
 enough+0.005
 tal+0.005
Neg logit contributions
 sidelines-0.012
 stereo-0.012
 shower-0.012
 rinse-0.011
 curtain-0.011
Token their
Token ID511
Feature activation+2.64e-01

Pos logit contributions
over+0.001
id+0.001
 enough+0.001
 Bill+0.001
 tal+0.001
Neg logit contributions
 sidelines-0.002
 stereo-0.002
 shower-0.002
 rinse-0.002
irs-0.002
Token house
Token ID2156
Feature activation-6.18e-01

Pos logit contributions
 enough+0.007
over+0.007
 Bill+0.007
id+0.007
 tal+0.007
Neg logit contributions
 sidelines-0.008
 stereo-0.007
 rinse-0.007
 shower-0.007
irs-0.007
Token.
Token ID13
Feature activation-8.30e-02

Pos logit contributions
 sidelines+0.022
 stereo+0.021
 shower+0.020
 rinse+0.020
 curtain+0.020
Neg logit contributions
 Bill-0.012
 tal-0.012
id-0.012
over-0.012
 enough-0.012
Token animal
Token ID5044
Feature activation+3.37e-01

Pos logit contributions
 Bill+0.019
id+0.019
 enough+0.019
over+0.018
 Stan+0.018
Neg logit contributions
 sidelines-0.032
 curtain-0.031
 shower-0.030
 stereo-0.030
 curtains-0.029
Token.
Token ID13
Feature activation+6.70e-01

Pos logit contributions
 enough+0.006
over+0.006
 Bill+0.006
id+0.005
 tal+0.005
Neg logit contributions
 sidelines-0.013
 stereo-0.012
 rinse-0.012
 shower-0.012
irs-0.012
Token It
Token ID632
Feature activation-9.08e-01

Pos logit contributions
 enough+0.018
over+0.018
id+0.018
 tal+0.017
 Bill+0.017
Neg logit contributions
 sidelines-0.019
 rinse-0.019
 stereo-0.019
 shower-0.018
 curtain-0.018
Token was
Token ID373
Feature activation+7.52e-01

Pos logit contributions
 sidelines+0.029
 sip+0.029
 curtain+0.029
 stereo+0.028
 rinse+0.028
Neg logit contributions
over-0.023
id-0.021
 enough-0.021
 tal-0.019
 Bill-0.019
Token a
Token ID257
Feature activation+7.40e-01

Pos logit contributions
id+0.018
over+0.017
 tal+0.016
 Bill+0.015
est+0.015
Neg logit contributions
 sidelines-0.025
 curtain-0.024
 rinse-0.024
 curtains-0.024
 shower-0.024
Token gu
Token ID915
Feature activation+4.82e+00

Pos logit contributions
 enough+0.014
over+0.014
 Bill+0.014
 tal+0.013
id+0.013
Neg logit contributions
 sidelines-0.028
irs-0.026
 stereo-0.026
 rinse-0.026
 shower-0.025
Tokeninea
Token ID18343
Feature activation-2.35e+00

Pos logit contributions
 enough+0.097
 Bill+0.097
over+0.095
 tal+0.091
id+0.091
Neg logit contributions
 sidelines-0.175
 stereo-0.163
 rinse-0.161
 shower-0.159
irs-0.159
Token pig
Token ID12967
Feature activation+9.01e-01

Pos logit contributions
 sidelines+0.077
 curtain+0.072
 shower+0.071
 stereo+0.071
 curtains+0.071
Neg logit contributions
over-0.053
 Bill-0.052
 enough-0.051
id-0.050
 meant-0.050
Token with
Token ID351
Feature activation-7.22e-01

Pos logit contributions
 enough+0.019
 tal+0.019
 Bill+0.019
over+0.018
 meant+0.018
Neg logit contributions
 sidelines-0.028
 curtain-0.028
 stereo-0.028
 curtains-0.027
 shower-0.027
Token a
Token ID257
Feature activation+1.39e-01

Pos logit contributions
 sidelines+0.028
 stereo+0.025
 rinse+0.025
irs+0.025
 shower+0.025
Neg logit contributions
over-0.014
id-0.013
 enough-0.012
 Bill-0.012
 tal-0.012
Token scale
Token ID5046
Feature activation+1.03e+00

Pos logit contributions
 enough+0.002
over+0.002
 Bill+0.002
 tal+0.002
id+0.002
Neg logit contributions
 sidelines-0.006
 stereo-0.005
 rinse-0.005
 shower-0.005
irs-0.005
Token her
Token ID607
Feature activation+2.52e-01

Pos logit contributions
over+0.006
 tal+0.005
 enough+0.005
id+0.005
 Bill+0.005
Neg logit contributions
 sidelines-0.008
 stereo-0.008
 shower-0.008
 rinse-0.008
 curtain-0.007
Token own
Token ID898
Feature activation+9.46e-01

Pos logit contributions
 enough+0.006
over+0.006
 Bill+0.006
id+0.006
 tal+0.006
Neg logit contributions
 sidelines-0.008
 stereo-0.008
 rinse-0.008
 shower-0.008
irs-0.007
Token special
Token ID2041
Feature activation+8.61e-02

Pos logit contributions
 enough+0.023
over+0.023
 Bill+0.023
 tal+0.022
id+0.022
Neg logit contributions
 sidelines-0.031
 stereo-0.028
 rinse-0.028
irs-0.027
 shower-0.027
Token slide
Token ID10649
Feature activation+1.19e+00

Pos logit contributions
 enough+0.002
over+0.002
 Bill+0.002
 tal+0.002
id+0.002
Neg logit contributions
 sidelines-0.003
 stereo-0.002
irs-0.002
 rinse-0.002
 shower-0.002
Token-
Token ID12
Feature activation-6.19e-01

Pos logit contributions
 tal+0.025
over+0.024
id+0.022
 Bill+0.022
 Stan+0.021
Neg logit contributions
 sidelines-0.043
 shower-0.040
 stereo-0.040
 rinse-0.039
 curtain-0.039
Tokenbat
Token ID8664
Feature activation+4.72e+00

Pos logit contributions
 sidelines+0.019
 stereo+0.017
 rinse+0.017
 shower+0.017
irs+0.017
Neg logit contributions
 enough-0.016
over-0.016
 Bill-0.016
 tal-0.016
id-0.015
Tokenhe
Token ID258
Feature activation-1.11e+00

Pos logit contributions
 tal+0.142
 enough+0.140
 Bill+0.137
over+0.130
 Sid+0.129
Neg logit contributions
 sidelines-0.127
 stereo-0.119
irs-0.115
 shower-0.112
 curtain-0.111
Token.
Token ID13
Feature activation+8.05e-01

Pos logit contributions
 sidelines+0.042
 shower+0.040
 stereo+0.040
 rinse+0.039
 curtain+0.039
Neg logit contributions
 tal-0.020
 Bill-0.020
 enough-0.020
id-0.020
over-0.019
Token
Token ID198
Feature activation+1.31e-01

Pos logit contributions
 Bill+0.013
over+0.010
 Stan+0.010
 tal+0.009
 Roll+0.009
Neg logit contributions
 sidelines-0.037
 rinse-0.033
 stereo-0.033
 spill-0.033
irs-0.032
Token
Token ID198
Feature activation+2.97e-02

Pos logit contributions
 Bill+0.002
over+0.002
id+0.002
 tal+0.002
 enough+0.002
Neg logit contributions
 sidelines-0.005
 stereo-0.005
 rinse-0.005
 shower-0.005
 curtain-0.005
TokenShe
Token ID3347
Feature activation-7.17e-01

Pos logit contributions
over+0.001
id+0.000
 Bill+0.000
 Stan+0.000
Bill+0.000
Neg logit contributions
 sidelines-0.001
 curtains-0.001
 curtain-0.001
irs-0.001
 shower-0.001
Token aloud
Token ID32227
Feature activation+4.63e-01

Pos logit contributions
id+0.008
over+0.008
 tal+0.008
 Bill+0.007
 enough+0.007
Neg logit contributions
 sidelines-0.009
 stereo-0.009
 curtain-0.009
 shower-0.009
 curtains-0.009
Token,
Token ID11
Feature activation+3.28e-01

Pos logit contributions
over+0.009
 enough+0.009
 Bill+0.008
 tal+0.008
id+0.008
Neg logit contributions
 sidelines-0.018
 stereo-0.016
 rinse-0.016
 shower-0.016
 curtain-0.016
Token with
Token ID351
Feature activation-5.06e-01

Pos logit contributions
 enough+0.006
 Bill+0.006
over+0.005
id+0.005
 meant+0.005
Neg logit contributions
 sidelines-0.013
irs-0.012
 stereo-0.012
 rinse-0.012
 shower-0.012
Token her
Token ID607
Feature activation-2.17e-01

Pos logit contributions
 sidelines+0.017
 stereo+0.015
 rinse+0.015
 shower+0.015
irs+0.015
Neg logit contributions
 enough-0.012
over-0.012
 Bill-0.012
id-0.012
 tal-0.012
Token favourite
Token ID12507
Feature activation+1.58e-01

Pos logit contributions
 sidelines+0.007
 stereo+0.006
 shower+0.006
 rinse+0.006
 curtain+0.006
Neg logit contributions
over-0.006
 enough-0.006
id-0.006
 Bill-0.006
 tal-0.005
Token words
Token ID2456
Feature activation+4.71e+00

Pos logit contributions
 enough+0.003
over+0.003
 Bill+0.003
 tal+0.003
est+0.003
Neg logit contributions
 sidelines-0.006
irs-0.005
 rinse-0.005
 stereo-0.005
 shower-0.005
Token being
Token ID852
Feature activation+1.76e+00

Pos logit contributions
over+0.083
 tal+0.080
 Bill+0.079
 enough+0.079
id+0.078
Neg logit contributions
 sidelines-0.182
 stereo-0.173
 rinse-0.171
 shower-0.170
 curtain-0.168
Token
Token ID6184
Feature activation-1.01e+00

Pos logit contributions
over+0.034
 enough+0.033
 Bill+0.033
id+0.033
 tal+0.033
Neg logit contributions
 sidelines-0.064
 stereo-0.061
 rinse-0.060
 shower-0.059
 curtain-0.058
Token
Token ID95
Feature activation-5.79e-01

Pos logit contributions
 sidelines+0.029
 rinse+0.026
irs+0.026
 stereo+0.026
 curtain+0.025
Neg logit contributions
over-0.031
id-0.028
 Bill-0.027
 enough-0.026
 Ac-0.024
Token
Token ID26391
Feature activation-5.42e-01

Pos logit contributions
 sidelines+0.018
irs+0.016
 rinse+0.016
 shower+0.016
 stereo+0.015
Neg logit contributions
over-0.016
id-0.015
 Bill-0.015
 enough-0.014
 tal-0.014
Token˜
Token ID41185
Feature activation+4.90e-01

Pos logit contributions
 sidelines+0.018
 stereo+0.017
 rinse+0.016
 shower+0.016
irs+0.016
Neg logit contributions
 enough-0.013
over-0.012
 Bill-0.012
 tal-0.012
id-0.012
Token.
Token ID13
Feature activation+2.69e-01

Pos logit contributions
 enough+0.019
over+0.018
 Bill+0.018
 tal+0.018
id+0.017
Neg logit contributions
 sidelines-0.044
 stereo-0.041
 rinse-0.040
 shower-0.040
irs-0.040
Token It
Token ID632
Feature activation-1.22e+00

Pos logit contributions
 enough+0.007
over+0.006
id+0.006
 Bill+0.006
 tal+0.006
Neg logit contributions
 sidelines-0.009
 stereo-0.008
 rinse-0.008
 shower-0.008
irs-0.008
Token was
Token ID373
Feature activation+5.84e-01

Pos logit contributions
 sidelines+0.042
 rinse+0.041
 stereo+0.040
 sip+0.040
 shower+0.039
Neg logit contributions
over-0.027
id-0.025
 enough-0.025
 Bill-0.025
 tal-0.024
Token a
Token ID257
Feature activation+5.30e-01

Pos logit contributions
id+0.013
over+0.013
 Bill+0.012
 tal+0.012
est+0.011
Neg logit contributions
 sidelines-0.020
 rinse-0.019
 curtain-0.018
 stereo-0.018
 shower-0.018
Token special
Token ID2041
Feature activation-1.22e-01

Pos logit contributions
 enough+0.010
over+0.010
 tal+0.010
 Bill+0.010
id+0.009
Neg logit contributions
 sidelines-0.020
irs-0.019
 stereo-0.018
 rinse-0.018
 shower-0.018
Token gu
Token ID915
Feature activation+4.70e+00

Pos logit contributions
 sidelines+0.004
 stereo+0.004
 shower+0.004
 rinse+0.004
 curtain+0.004
Neg logit contributions
 enough-0.003
over-0.003
 Bill-0.003
id-0.003
 tal-0.003
Tokeninea
Token ID18343
Feature activation-2.47e+00

Pos logit contributions
 enough+0.093
 Bill+0.092
over+0.091
 tal+0.087
id+0.086
Neg logit contributions
 sidelines-0.172
 stereo-0.161
 rinse-0.159
 shower-0.157
irs-0.157
Token pig
Token ID12967
Feature activation+8.39e-01

Pos logit contributions
 sidelines+0.083
 stereo+0.077
 curtain+0.076
 shower+0.076
 curtains+0.075
Neg logit contributions
over-0.053
 Bill-0.053
 enough-0.052
id-0.051
 meant-0.050
Token.
Token ID13
Feature activation+2.13e-02

Pos logit contributions
 enough+0.018
 tal+0.017
 Bill+0.017
over+0.017
 meant+0.016
Neg logit contributions
 sidelines-0.026
 stereo-0.026
 curtain-0.026
 curtains-0.025
 carpet-0.025
Token
Token ID198
Feature activation-2.30e-01

Pos logit contributions
 enough+0.001
over+0.000
id+0.000
 Bill+0.000
 tal+0.000
Neg logit contributions
 sidelines-0.001
 stereo-0.001
 rinse-0.001
 shower-0.001
irs-0.001
Token
Token ID198
Feature activation-1.50e-01

Pos logit contributions
 sidelines+0.009
 stereo+0.008
 rinse+0.008
 shower+0.008
irs+0.008
Neg logit contributions
 Bill-0.004
 enough-0.004
over-0.004
 tal-0.004
id-0.004
Token returned
Token ID4504
Feature activation+9.69e-01

Pos logit contributions
 enough+0.024
 Bill+0.024
 meant+0.022
 tal+0.021
over+0.021
Neg logit contributions
 sidelines-0.033
 stereo-0.031
irs-0.031
 shower-0.030
 curtain-0.029
Token the
Token ID262
Feature activation-1.67e-01

Pos logit contributions
over+0.022
 enough+0.021
id+0.021
 tal+0.021
 Bill+0.021
Neg logit contributions
 sidelines-0.033
 stereo-0.030
 rinse-0.030
 shower-0.030
irs-0.029
Token towel
Token ID24808
Feature activation-1.14e+00

Pos logit contributions
 sidelines+0.004
irs+0.004
 stereo+0.004
 rinse+0.004
 curtain+0.003
Neg logit contributions
 enough-0.005
over-0.005
 Bill-0.005
 tal-0.005
 meant-0.005
Token to
Token ID284
Feature activation+6.24e-01

Pos logit contributions
 sidelines+0.040
 stereo+0.037
 shower+0.037
 rinse+0.036
 curtain+0.036
Neg logit contributions
over-0.024
 Bill-0.024
 tal-0.024
id-0.024
 enough-0.023
Token its
Token ID663
Feature activation+3.65e-01

Pos logit contributions
 Bill+0.015
 enough+0.014
over+0.013
 Stan+0.013
 tal+0.013
Neg logit contributions
 sidelines-0.021
 stereo-0.020
 rinse-0.019
irs-0.019
 shower-0.019
Token rightful
Token ID43133
Feature activation+4.65e+00

Pos logit contributions
 enough+0.008
over+0.007
 Bill+0.007
 tal+0.007
id+0.007
Neg logit contributions
 sidelines-0.013
irs-0.013
 stereo-0.013
 rinse-0.012
 shower-0.012
Token owner
Token ID4870
Feature activation+1.37e+00

Pos logit contributions
over+0.087
 Bill+0.086
id+0.085
 enough+0.079
 tal+0.077
Neg logit contributions
 bathroom-0.156
 curtain-0.152
 sidelines-0.152
 shower-0.150
 stereo-0.150
Token.
Token ID13
Feature activation+4.06e-01

Pos logit contributions
 tal+0.029
 Bill+0.029
over+0.028
id+0.028
 enough+0.027
Neg logit contributions
 sidelines-0.047
 stereo-0.043
 shower-0.043
 rinse-0.043
 curtain-0.042
Token The
Token ID383
Feature activation+4.54e-01

Pos logit contributions
 Bill+0.009
 enough+0.008
over+0.008
 Stan+0.007
id+0.007
Neg logit contributions
 sidelines-0.016
 rinse-0.014
 curtain-0.014
 shower-0.014
irs-0.014
Token owner
Token ID4870
Feature activation+1.67e-01

Pos logit contributions
 enough+0.011
over+0.010
 Bill+0.010
 meant+0.009
 tal+0.009
Neg logit contributions
 sidelines-0.016
irs-0.015
 stereo-0.015
 rinse-0.014
 shower-0.014
Token was
Token ID373
Feature activation+3.12e-01

Pos logit contributions
over+0.004
 enough+0.004
 Bill+0.004
 tal+0.003
id+0.003
Neg logit contributions
 sidelines-0.006
 stereo-0.005
 rinse-0.005
 shower-0.005
irs-0.005
Token put
Token ID1234
Feature activation-4.71e-01

Pos logit contributions
 enough+0.041
 Bill+0.039
 meant+0.037
 ac+0.034
 on+0.034
Neg logit contributions
 sidelines-0.045
irs-0.044
 stereo-0.042
 curtain-0.040
 shower-0.039
Token them
Token ID606
Feature activation+5.06e-01

Pos logit contributions
 sidelines+0.013
 rinse+0.012
 stereo+0.012
 shower+0.012
irs+0.012
Neg logit contributions
over-0.013
id-0.013
 Bill-0.013
 enough-0.013
 tal-0.013
Token back
Token ID736
Feature activation+8.89e-01

Pos logit contributions
over+0.010
 Bill+0.010
id+0.010
 tal+0.010
 enough+0.010
Neg logit contributions
 sidelines-0.018
 rinse-0.018
 shower-0.017
 stereo-0.017
 curtain-0.017
Token in
Token ID287
Feature activation-9.42e-01

Pos logit contributions
over+0.015
 Bill+0.015
 enough+0.015
 tal+0.014
id+0.014
Neg logit contributions
 sidelines-0.035
 stereo-0.033
 rinse-0.033
 shower-0.032
irs-0.032
Token their
Token ID511
Feature activation+1.36e-01

Pos logit contributions
 sidelines+0.032
 rinse+0.032
 shower+0.032
 bathroom+0.031
 stereo+0.030
Neg logit contributions
over-0.021
id-0.020
 Bill-0.019
 tal-0.018
 enough-0.018
Token rightful
Token ID43133
Feature activation+4.63e+00

Pos logit contributions
 enough+0.004
over+0.004
 tal+0.004
 Bill+0.004
 best+0.003
Neg logit contributions
 sidelines-0.004
irs-0.004
 stereo-0.003
 rinse-0.003
 shower-0.003
Token place
Token ID1295
Feature activation+4.17e-02

Pos logit contributions
over+0.138
 Bill+0.138
id+0.136
 enough+0.131
 afloat+0.129
Neg logit contributions
 bathroom-0.105
 sidelines-0.104
 curtain-0.103
 stereo-0.102
 curtains-0.101
Token.
Token ID13
Feature activation+7.14e-01

Pos logit contributions
over+0.001
 Bill+0.001
 enough+0.001
id+0.001
 tal+0.001
Neg logit contributions
 sidelines-0.002
 stereo-0.001
 rinse-0.001
 shower-0.001
 curtain-0.001
Token
Token ID198
Feature activation+4.66e-01

Pos logit contributions
 Bill+0.014
 enough+0.012
over+0.012
 tal+0.012
 Stan+0.012
Neg logit contributions
 sidelines-0.029
 stereo-0.027
 rinse-0.026
 shower-0.026
 curtain-0.026
Token
Token ID198
Feature activation+3.74e-01

Pos logit contributions
 Bill+0.009
over+0.009
 tal+0.009
 enough+0.008
id+0.008
Neg logit contributions
 sidelines-0.018
 stereo-0.017
 rinse-0.016
 shower-0.016
 curtain-0.016
TokenAfter
Token ID3260
Feature activation+5.86e-01

Pos logit contributions
over+0.008
 Bill+0.008
 enough+0.008
id+0.008
 tal+0.008
Neg logit contributions
 sidelines-0.013
 stereo-0.012
 shower-0.012
 curtain-0.012
 rinse-0.012
Token The
Token ID383
Feature activation+3.90e-01

Pos logit contributions
 Bill+0.015
 enough+0.014
over+0.014
 tal+0.014
id+0.013
Neg logit contributions
 sidelines-0.022
 rinse-0.020
 stereo-0.020
 shower-0.020
irs-0.019
Token boot
Token ID6297
Feature activation+2.47e-01

Pos logit contributions
 enough+0.009
over+0.009
 tal+0.008
 Bill+0.008
id+0.008
Neg logit contributions
 sidelines-0.014
 shower-0.013
 stereo-0.013
 curtain-0.012
 rinse-0.012
Token returned
Token ID4504
Feature activation+1.26e+00

Pos logit contributions
over+0.006
 enough+0.006
id+0.005
 tal+0.005
 Bill+0.005
Neg logit contributions
 sidelines-0.008
 stereo-0.007
 curtain-0.007
 rinse-0.007
 shower-0.007
Token to
Token ID284
Feature activation+1.49e-01

Pos logit contributions
over+0.029
 tal+0.026
 meant+0.026
id+0.026
 enough+0.025
Neg logit contributions
 sidelines-0.040
 curtain-0.037
 curtains-0.036
 stereo-0.036
 shower-0.036
Token its
Token ID663
Feature activation+1.75e-01

Pos logit contributions
 enough+0.003
 Bill+0.003
over+0.003
 tal+0.002
id+0.002
Neg logit contributions
 sidelines-0.006
 stereo-0.005
 rinse-0.005
 shower-0.005
irs-0.005
Token rightful
Token ID43133
Feature activation+4.61e+00

Pos logit contributions
over+0.004
 enough+0.004
 Bill+0.004
id+0.004
 tal+0.004
Neg logit contributions
 sidelines-0.006
 stereo-0.005
 rinse-0.005
 shower-0.005
 curtain-0.005
Token owners
Token ID4393
Feature activation+6.74e-01

Pos logit contributions
over+0.106
id+0.096
 Bill+0.093
 Humph+0.092
 afloat+0.092
Neg logit contributions
 bathroom-0.139
 curtain-0.132
 curtains-0.130
 carpet-0.130
 mirror-0.126
Token and
Token ID290
Feature activation+1.60e+00

Pos logit contributions
over+0.009
 Bill+0.009
 enough+0.009
 tal+0.009
id+0.009
Neg logit contributions
 sidelines-0.029
 stereo-0.027
 shower-0.027
 rinse-0.027
 curtain-0.026
Token the
Token ID262
Feature activation+1.80e-01

Pos logit contributions
over+0.038
 enough+0.036
id+0.036
 Bill+0.035
 tal+0.035
Neg logit contributions
 sidelines-0.054
 stereo-0.049
 rinse-0.049
 shower-0.049
 curtain-0.047
Token conflict
Token ID5358
Feature activation+5.00e-01

Pos logit contributions
over+0.004
 enough+0.004
 tal+0.004
id+0.004
 Bill+0.004
Neg logit contributions
 sidelines-0.006
 shower-0.006
 stereo-0.006
 rinse-0.006
 curtain-0.006
Token was
Token ID373
Feature activation+1.28e+00

Pos logit contributions
over+0.010
 enough+0.010
 Bill+0.010
 tal+0.010
id+0.009
Neg logit contributions
 sidelines-0.018
 stereo-0.017
 rinse-0.017
 shower-0.017
irs-0.017
Token restore
Token ID11169
Feature activation-2.59e-01

Pos logit contributions
 enough+0.024
 Bill+0.024
 tal+0.023
over+0.023
 meant+0.022
Neg logit contributions
 sidelines-0.034
 stereo-0.031
 rinse-0.030
 shower-0.030
irs-0.030
Token it
Token ID340
Feature activation-1.96e-01

Pos logit contributions
irs+0.007
 sidelines+0.007
 rinse+0.007
 curtains+0.007
 shower+0.006
Neg logit contributions
over-0.007
id-0.007
 Bill-0.006
 tal-0.006
 enough-0.006
Token back
Token ID736
Feature activation+1.24e+00

Pos logit contributions
 sidelines+0.007
 rinse+0.006
 stereo+0.006
 shower+0.006
irs+0.006
Neg logit contributions
over-0.004
 Bill-0.004
id-0.004
 tal-0.004
 enough-0.004
Token to
Token ID284
Feature activation+1.06e+00

Pos logit contributions
over+0.029
id+0.028
 Bill+0.028
 enough+0.027
 tal+0.027
Neg logit contributions
 sidelines-0.040
 stereo-0.037
 shower-0.037
 rinse-0.037
 curtain-0.037
Token its
Token ID663
Feature activation+1.89e-01

Pos logit contributions
over+0.020
id+0.019
 enough+0.019
 Bill+0.018
 tal+0.018
Neg logit contributions
 sidelines-0.041
 rinse-0.038
 stereo-0.038
irs-0.037
 shower-0.037
Token rightful
Token ID43133
Feature activation+4.60e+00

Pos logit contributions
over+0.005
 Bill+0.005
id+0.005
 enough+0.005
 tal+0.004
Neg logit contributions
 sidelines-0.006
 stereo-0.005
 rinse-0.005
 shower-0.005
 curtain-0.005
Token place
Token ID1295
Feature activation-1.69e-01

Pos logit contributions
over+0.119
 Bill+0.117
id+0.117
 Humph+0.114
 afloat+0.112
Neg logit contributions
 bathroom-0.116
 curtain-0.112
 curtains-0.111
 carpet-0.109
 studio-0.105
Token.
Token ID13
Feature activation+5.99e-01

Pos logit contributions
 sidelines+0.006
 stereo+0.006
 rinse+0.006
 shower+0.006
 curtain+0.005
Neg logit contributions
over-0.003
id-0.003
 enough-0.003
 Bill-0.003
 tal-0.003
Token Although
Token ID4900
Feature activation+5.20e-01

Pos logit contributions
 enough+0.014
over+0.014
id+0.013
 Bill+0.013
 tal+0.013
Neg logit contributions
 sidelines-0.020
 stereo-0.018
 rinse-0.018
 shower-0.018
irs-0.018
Token Sammy
Token ID44067
Feature activation-1.19e-02

Pos logit contributions
over+0.015
id+0.014
aces+0.013
 tal+0.013
colm+0.013
Neg logit contributions
 shower-0.013
 sidelines-0.012
irs-0.012
 carpet-0.011
 rinse-0.011
Token was
Token ID373
Feature activation+4.03e-01

Pos logit contributions
 sidelines+0.000
 stereo+0.000
 rinse+0.000
 shower+0.000
 sip+0.000
Neg logit contributions
over-0.000
 Bill-0.000
id-0.000
 enough-0.000
 tal-0.000
Token slider
Token ID28982
Feature activation+4.23e-01

Pos logit contributions
 enough+0.022
over+0.021
 Bill+0.021
id+0.020
 tal+0.020
Neg logit contributions
 sidelines-0.032
 stereo-0.029
 rinse-0.029
 shower-0.029
irs-0.028
Token after
Token ID706
Feature activation+5.82e-01

Pos logit contributions
 enough+0.010
over+0.010
 Bill+0.010
 tal+0.010
id+0.010
Neg logit contributions
 sidelines-0.014
 stereo-0.013
 shower-0.013
 rinse-0.013
 curtain-0.012
Token her
Token ID607
Feature activation-1.07e+00

Pos logit contributions
 enough+0.010
 Bill+0.009
over+0.008
 meant+0.007
 once+0.007
Neg logit contributions
 sidelines-0.024
 rinse-0.024
 stereo-0.023
 Acc-0.022
 sip-0.022
Token slide
Token ID10649
Feature activation+7.23e-01

Pos logit contributions
 sidelines+0.035
 stereo+0.032
 shower+0.032
 rinse+0.032
irs+0.031
Neg logit contributions
 enough-0.026
over-0.026
 Bill-0.025
id-0.025
 tal-0.025
Token-
Token ID12
Feature activation-1.55e+00

Pos logit contributions
 tal+0.015
over+0.015
id+0.014
 Bill+0.014
 Stan+0.014
Neg logit contributions
 sidelines-0.026
 shower-0.024
 stereo-0.023
 rinse-0.023
 curtain-0.023
Tokenbat
Token ID8664
Feature activation+4.60e+00

Pos logit contributions
 sidelines+0.048
 stereo+0.044
 rinse+0.043
 shower+0.042
irs+0.042
Neg logit contributions
 enough-0.041
over-0.040
 Bill-0.040
 tal-0.039
id-0.039
Tokenhe
Token ID258
Feature activation-1.62e+00

Pos logit contributions
 tal+0.147
 enough+0.143
 Bill+0.142
 Sid+0.134
 Stan+0.134
Neg logit contributions
 sidelines-0.118
 stereo-0.108
irs-0.105
 shower-0.102
 curtain-0.099
Token and
Token ID290
Feature activation+1.11e+00

Pos logit contributions
 sidelines+0.063
 shower+0.059
 stereo+0.059
 rinse+0.058
 curtain+0.057
Neg logit contributions
 Bill-0.029
 tal-0.028
 enough-0.028
id-0.027
over-0.027
Token she
Token ID673
Feature activation-5.56e-01

Pos logit contributions
 enough+0.027
 Bill+0.027
 meant+0.024
over+0.024
 tal+0.024
Neg logit contributions
 sidelines-0.037
irs-0.035
 stereo-0.035
 rinse-0.034
 shower-0.033
Token was
Token ID373
Feature activation+8.92e-01

Pos logit contributions
 sidelines+0.021
 stereo+0.019
 rinse+0.019
 shower+0.019
irs+0.018
Neg logit contributions
over-0.011
 enough-0.011
 Bill-0.011
 tal-0.011
id-0.010
Token so
Token ID523
Feature activation-3.06e-01

Pos logit contributions
 enough+0.017
over+0.017
 Bill+0.016
id+0.016
 tal+0.016
Neg logit contributions
 sidelines-0.034
 stereo-0.032
 rinse-0.031
 shower-0.031
irs-0.031
Token things
Token ID1243
Feature activation+7.64e-01

Pos logit contributions
 sidelines+0.015
 stereo+0.014
 rinse+0.013
 shower+0.013
irs+0.013
Neg logit contributions
 enough-0.008
over-0.008
 Bill-0.008
 tal-0.008
id-0.008
Tokenâ
Token ID22940
Feature activation+8.68e-01

Pos logit contributions
 enough+0.014
over+0.014
 Bill+0.014
id+0.014
 tal+0.014
Neg logit contributions
 sidelines-0.029
 stereo-0.027
 rinse-0.027
 shower-0.026
irs-0.026
Token
Token ID26391
Feature activation-4.00e-01

Pos logit contributions
 enough+0.022
over+0.022
 Bill+0.022
 tal+0.021
id+0.021
Neg logit contributions
 sidelines-0.027
 stereo-0.025
 rinse-0.025
 shower-0.024
irs-0.024
Token
Token ID447
Feature activation+2.29e+00

Pos logit contributions
 sidelines+0.014
 stereo+0.013
 rinse+0.013
 shower+0.013
irs+0.012
Neg logit contributions
 enough-0.009
over-0.009
 Bill-0.009
 tal-0.008
id-0.008
Token
Token ID251
Feature activation+5.99e-01

Pos logit contributions
over+0.082
 enough+0.081
 Bill+0.080
id+0.080
 tal+0.079
Neg logit contributions
 sidelines-0.050
 stereo-0.044
 rinse-0.042
 shower-0.041
irs-0.040
Tokencol
Token ID4033
Feature activation-5.97e+00

Pos logit contributions
 enough+0.010
over+0.010
 Bill+0.010
 tal+0.009
id+0.008
Neg logit contributions
 sidelines-0.025
 stereo-0.023
irs-0.023
 curtain-0.022
 rinse-0.022
Tokenouring
Token ID21823
Feature activation-4.86e-01

Pos logit contributions
 sidelines+0.182
 bathroom+0.170
 carpet+0.170
 curtain+0.168
 shower+0.166
Neg logit contributions
id-0.167
 enough-0.157
ared-0.148
eb-0.146
 Bill-0.145
Token books
Token ID3835
Feature activation+7.42e-01

Pos logit contributions
 sidelines+0.017
 stereo+0.016
 rinse+0.016
 shower+0.016
irs+0.016
Neg logit contributions
over-0.010
 enough-0.010
 Bill-0.010
id-0.010
 tal-0.010
Token,
Token ID11
Feature activation+5.39e-01

Pos logit contributions
 tal+0.015
over+0.014
 Bill+0.013
id+0.013
 Stan+0.013
Neg logit contributions
 sidelines-0.027
 shower-0.026
 rinse-0.026
 sip-0.025
 stereo-0.025
Token stickers
Token ID28568
Feature activation+1.92e+00

Pos logit contributions
 Bill+0.010
 enough+0.010
over+0.010
 tal+0.009
id+0.009
Neg logit contributions
 sidelines-0.021
 stereo-0.019
irs-0.019
 rinse-0.019
 shower-0.019
Token and
Token ID290
Feature activation+8.53e-01

Pos logit contributions
 tal+0.032
over+0.030
 Bill+0.030
 Stan+0.029
id+0.027
Neg logit contributions
 sidelines-0.073
 shower-0.068
 curtain-0.068
 curtains-0.068
 stereo-0.068
Token a
Token ID257
Feature activation-2.39e-01

Pos logit contributions
over+0.014
 enough+0.013
 Bill+0.013
id+0.013
 tal+0.013
Neg logit contributions
 sidelines-0.033
 stereo-0.030
 rinse-0.030
 shower-0.030
irs-0.029
Token strange
Token ID6283
Feature activation+5.34e-02

Pos logit contributions
 sidelines+0.009
irs+0.009
 stereo+0.008
 rinse+0.008
 shower+0.008
Neg logit contributions
 enough-0.004
over-0.004
 Bill-0.004
 tal-0.004
id-0.004
Token,
Token ID11
Feature activation-5.17e-03

Pos logit contributions
 enough+0.001
over+0.001
 Bill+0.001
id+0.001
 tal+0.001
Neg logit contributions
 sidelines-0.002
 shower-0.002
 stereo-0.002
 curtain-0.002
 rinse-0.002
Token raspberry
Token ID38973
Feature activation-6.11e-01

Pos logit contributions
 sidelines+0.000
 shower+0.000
 stereo+0.000
 curtain+0.000
 bathroom+0.000
Neg logit contributions
id-0.000
over-0.000
 enough-0.000
 Bill-0.000
 tal-0.000
Token-
Token ID12
Feature activation-1.40e+00

Pos logit contributions
 sidelines+0.020
 shower+0.019
 curtain+0.018
 stereo+0.018
 rinse+0.018
Neg logit contributions
 Bill-0.014
over-0.014
 enough-0.014
 tal-0.014
id-0.014
Tokencol
Token ID4033
Feature activation-5.88e+00

Pos logit contributions
 sidelines+0.045
 rinse+0.041
 shower+0.041
 stereo+0.040
irs+0.039
Neg logit contributions
 Bill-0.037
 enough-0.036
over-0.034
id-0.034
 tal-0.034
Tokenoured
Token ID8167
Feature activation-2.82e+00

Pos logit contributions
 sidelines+0.156
 curtain+0.154
 rinse+0.150
 shower+0.149
 carpet+0.149
Neg logit contributions
 enough-0.184
id-0.179
 Bill-0.166
 tal-0.157
over-0.157
Token object
Token ID2134
Feature activation-5.75e-01

Pos logit contributions
 sidelines+0.089
 stereo+0.081
 rinse+0.079
 shower+0.079
 curtain+0.077
Neg logit contributions
 enough-0.072
over-0.071
 Bill-0.071
id-0.069
 tal-0.068
Token hiding
Token ID11816
Feature activation+5.12e-01

Pos logit contributions
 sidelines+0.022
 stereo+0.021
 shower+0.021
 curtain+0.021
 rinse+0.021
Neg logit contributions
over-0.010
 Bill-0.010
 tal-0.010
 enough-0.009
id-0.009
Token under
Token ID739
Feature activation+2.98e-01

Pos logit contributions
 tal+0.014
 Bill+0.013
 Stan+0.013
id+0.013
over+0.013
Neg logit contributions
 curtain-0.014
 carpet-0.013
 curtains-0.013
 rinse-0.012
 shower-0.012
Token a
Token ID257
Feature activation+9.68e-02

Pos logit contributions
 enough+0.005
 Bill+0.005
over+0.005
 tal+0.005
id+0.004
Neg logit contributions
 sidelines-0.012
 stereo-0.011
 rinse-0.011
 shower-0.011
irs-0.011
Token delicious
Token ID12625
Feature activation+5.56e-01

Pos logit contributions
 sidelines+0.019
 rinse+0.017
 shower+0.017
 stereo+0.017
 curtain+0.017
Neg logit contributions
over-0.017
id-0.017
 enough-0.016
 tal-0.016
 Bill-0.016
Token vegetables
Token ID13701
Feature activation+2.23e+00

Pos logit contributions
 enough+0.014
over+0.014
 Bill+0.014
id+0.013
 tal+0.013
Neg logit contributions
 sidelines-0.018
 stereo-0.016
 rinse-0.016
 shower-0.016
irs-0.016
Token and
Token ID290
Feature activation+2.86e-02

Pos logit contributions
 Bill+0.046
 tal+0.046
id+0.046
over+0.046
 enough+0.041
Neg logit contributions
 sidelines-0.074
 shower-0.073
 curtain-0.072
 rinse-0.072
 carpet-0.071
Token bright
Token ID6016
Feature activation-1.56e+00

Pos logit contributions
 enough+0.001
 Bill+0.001
over+0.001
 tal+0.001
id+0.001
Neg logit contributions
 sidelines-0.001
 stereo-0.001
 rinse-0.001
 shower-0.001
irs-0.001
Token-
Token ID12
Feature activation-9.97e-01

Pos logit contributions
 sidelines+0.055
 stereo+0.050
 rinse+0.049
 shower+0.048
irs+0.047
Neg logit contributions
 enough-0.038
over-0.036
 Bill-0.036
 tal-0.034
id-0.033
Tokencol
Token ID4033
Feature activation-5.88e+00

Pos logit contributions
 sidelines+0.033
 rinse+0.030
 shower+0.030
 stereo+0.029
 bathroom+0.028
Neg logit contributions
over-0.026
 Bill-0.026
 enough-0.024
id-0.024
 tal-0.023
Tokenoured
Token ID8167
Feature activation-2.68e+00

Pos logit contributions
 shower+0.167
 bathroom+0.165
 rinse+0.164
 sidelines+0.164
 curtain+0.162
Neg logit contributions
 enough-0.179
id-0.173
 Bill-0.168
over-0.155
 tal-0.154
Token flowers
Token ID12734
Feature activation+2.96e-01

Pos logit contributions
 sidelines+0.090
 stereo+0.083
 rinse+0.081
 shower+0.080
irs+0.079
Neg logit contributions
 enough-0.064
over-0.063
 Bill-0.062
 tal-0.060
id-0.060
Token.
Token ID13
Feature activation+1.91e-01

Pos logit contributions
over+0.005
 enough+0.005
 Bill+0.005
id+0.005
 tal+0.005
Neg logit contributions
 sidelines-0.011
 stereo-0.011
 rinse-0.010
 shower-0.010
irs-0.010
Token
Token ID220
Feature activation-1.07e+00

Pos logit contributions
 Bill+0.004
 enough+0.004
over+0.004
 tal+0.003
id+0.003
Neg logit contributions
 sidelines-0.007
 stereo-0.007
 rinse-0.007
 shower-0.007
irs-0.007
Token
Token ID198
Feature activation-3.48e-01

Pos logit contributions
 sidelines+0.037
 stereo+0.034
 rinse+0.034
 shower+0.034
irs+0.033
Neg logit contributions
 enough-0.023
over-0.023
 Bill-0.023
id-0.022
 tal-0.022
Token,
Token ID11
Feature activation-2.77e-01

Pos logit contributions
 tal+0.002
id+0.002
 Bill+0.002
 Sid+0.002
over+0.002
Neg logit contributions
 curtain-0.002
 curtains-0.002
 rinse-0.002
 tea-0.002
 carpet-0.002
Token with
Token ID351
Feature activation-4.35e-01

Pos logit contributions
 sidelines+0.011
 stereo+0.010
 rinse+0.010
 shower+0.010
 curtains+0.009
Neg logit contributions
over-0.005
id-0.005
 tal-0.005
 enough-0.005
 Bill-0.004
Token all
Token ID477
Feature activation-7.62e-01

Pos logit contributions
 sidelines+0.016
 stereo+0.015
 rinse+0.015
 shower+0.015
irs+0.014
Neg logit contributions
 enough-0.009
 Bill-0.009
over-0.008
 tal-0.008
id-0.008
Token its
Token ID663
Feature activation-3.67e-02

Pos logit contributions
 sidelines+0.032
 stereo+0.029
 rinse+0.028
 shower+0.028
irs+0.027
Neg logit contributions
 enough-0.014
 Bill-0.013
 tal-0.012
 meant-0.012
over-0.012
Token different
Token ID1180
Feature activation-1.31e+00

Pos logit contributions
 sidelines+0.001
 stereo+0.001
 rinse+0.001
 shower+0.001
irs+0.001
Neg logit contributions
 enough-0.001
over-0.001
 Bill-0.001
 tal-0.001
id-0.001
Tokencol
Token ID4033
Feature activation-5.86e+00

Pos logit contributions
 sidelines+0.042
 stereo+0.038
 rinse+0.037
 shower+0.037
irs+0.036
Neg logit contributions
 enough-0.034
over-0.033
 Bill-0.033
id-0.032
 tal-0.032
Tokenours
Token ID4662
Feature activation+1.28e+00

Pos logit contributions
 sidelines+0.177
 curtain+0.172
 shower+0.171
 bathroom+0.170
 rinse+0.162
Neg logit contributions
id-0.167
 enough-0.164
over-0.152
 tal-0.151
 Bill-0.148
Token,
Token ID11
Feature activation-3.61e-01

Pos logit contributions
over+0.024
id+0.023
 Bill+0.023
 tal+0.022
 enough+0.022
Neg logit contributions
 sidelines-0.048
 rinse-0.045
 shower-0.045
 stereo-0.045
irs-0.044
Token shapes
Token ID15268
Feature activation+5.11e-01

Pos logit contributions
 sidelines+0.013
 stereo+0.012
 rinse+0.012
 shower+0.011
irs+0.011
Neg logit contributions
over-0.008
 enough-0.008
 Bill-0.008
id-0.008
 tal-0.008
Token and
Token ID290
Feature activation+2.85e-01

Pos logit contributions
 enough+0.007
over+0.007
 Bill+0.007
id+0.006
 tal+0.006
Neg logit contributions
 sidelines-0.023
 stereo-0.021
 rinse-0.021
irs-0.020
 shower-0.020
Token textures
Token ID20028
Feature activation-4.38e-01

Pos logit contributions
 enough+0.008
 Bill+0.007
 tal+0.007
 meant+0.007
over+0.007
Neg logit contributions
 sidelines-0.009
 stereo-0.008
irs-0.008
 rinse-0.008
 shower-0.008
Token date
Token ID3128
Feature activation+6.98e-01

Pos logit contributions
 sidelines+0.020
 rinse+0.017
irs+0.017
 stereo+0.017
 shower+0.016
Neg logit contributions
 enough-0.035
over-0.034
 Bill-0.034
id-0.033
 tal-0.033
Token,
Token ID11
Feature activation+4.21e-01

Pos logit contributions
 tal+0.011
over+0.011
 Bill+0.010
 enough+0.010
id+0.010
Neg logit contributions
 sidelines-0.028
 curtain-0.027
 stereo-0.027
 shower-0.026
 rinse-0.026
Token a
Token ID257
Feature activation+2.70e-01

Pos logit contributions
id+0.014
over+0.014
 tal+0.013
 enough+0.013
 Bill+0.013
Neg logit contributions
 sidelines-0.010
 shower-0.009
 rinse-0.009
 curtain-0.009
 stereo-0.009
Token rainbow
Token ID27223
Feature activation-8.81e-01

Pos logit contributions
over+0.005
 enough+0.005
 Bill+0.005
 tal+0.005
id+0.005
Neg logit contributions
 sidelines-0.010
irs-0.010
 rinse-0.009
 stereo-0.009
 shower-0.009
Token-
Token ID12
Feature activation-1.04e+00

Pos logit contributions
 sidelines+0.034
 rinse+0.031
 stereo+0.031
irs+0.031
 Acc+0.029
Neg logit contributions
 enough-0.018
over-0.017
 Bill-0.016
 meant-0.016
id-0.016
Tokencol
Token ID4033
Feature activation-5.81e+00

Pos logit contributions
 sidelines+0.032
 rinse+0.028
 shower+0.027
 bathroom+0.027
 stereo+0.026
Neg logit contributions
 Bill-0.030
 enough-0.029
over-0.028
id-0.028
 meant-0.027
Tokenoured
Token ID8167
Feature activation-2.41e+00

Pos logit contributions
 sidelines+0.163
 carpet+0.157
 bathroom+0.157
 rinse+0.154
 curtain+0.152
Neg logit contributions
 enough-0.174
id-0.168
 Bill-0.157
over-0.149
ared-0.144
Token butterfly
Token ID35113
Feature activation-6.94e-01

Pos logit contributions
 sidelines+0.068
 rinse+0.063
irs+0.063
 stereo+0.062
 shower+0.057
Neg logit contributions
 enough-0.071
over-0.070
 Bill-0.068
 tal-0.065
id-0.064
Token flew
Token ID13112
Feature activation+1.05e+00

Pos logit contributions
 sidelines+0.029
 rinse+0.027
irs+0.027
 stereo+0.027
 Acc+0.026
Neg logit contributions
 enough-0.012
over-0.010
 meant-0.010
 Bill-0.010
 tal-0.010
Token in
Token ID287
Feature activation-7.47e-01

Pos logit contributions
over+0.026
 tal+0.025
id+0.025
 Bill+0.025
 enough+0.024
Neg logit contributions
 sidelines-0.033
 stereo-0.031
 shower-0.030
 curtain-0.030
 rinse-0.030
Token through
Token ID832
Feature activation-7.51e-01

Pos logit contributions
 sidelines+0.027
 shower+0.025
 stereo+0.025
 curtain+0.025
 rinse+0.025
Neg logit contributions
over-0.016
id-0.015
 tal-0.014
 Bill-0.014
 enough-0.014
Token day
Token ID1110
Feature activation-1.80e+00

Pos logit contributions
 sidelines+0.002
 stereo+0.002
 rinse+0.002
 shower+0.002
irs+0.002
Neg logit contributions
 enough-0.001
over-0.001
 Bill-0.001
 tal-0.001
id-0.001
Token,
Token ID11
Feature activation+1.06e+00

Pos logit contributions
 sidelines+0.084
 shower+0.080
 stereo+0.079
 rinse+0.079
 curtain+0.078
Neg logit contributions
over-0.017
 enough-0.017
 Bill-0.016
 tal-0.016
id-0.016
Token a
Token ID257
Feature activation+5.79e-01

Pos logit contributions
 enough+0.030
 Bill+0.029
over+0.029
 tal+0.027
 meant+0.027
Neg logit contributions
 sidelines-0.032
 stereo-0.029
irs-0.028
 rinse-0.028
 shower-0.027
Token chocolate
Token ID11311
Feature activation+4.52e-01

Pos logit contributions
over+0.012
 enough+0.011
 Bill+0.011
 tal+0.011
id+0.011
Neg logit contributions
 sidelines-0.021
irs-0.020
 stereo-0.020
 rinse-0.020
 shower-0.019
Token-
Token ID12
Feature activation-1.29e+00

Pos logit contributions
over+0.009
 enough+0.009
 Bill+0.009
id+0.009
 meant+0.008
Neg logit contributions
 sidelines-0.017
irs-0.017
 rinse-0.015
 stereo-0.015
 Acc-0.014
Tokencol
Token ID4033
Feature activation-5.72e+00

Pos logit contributions
 sidelines+0.045
 rinse+0.041
 stereo+0.041
 shower+0.041
irs+0.040
Neg logit contributions
 Bill-0.030
over-0.029
 enough-0.029
id-0.028
 tal-0.027
Tokenoured
Token ID8167
Feature activation-2.42e+00

Pos logit contributions
 sidelines+0.150
 curtain+0.148
 carpet+0.147
 rinse+0.146
 bathroom+0.146
Neg logit contributions
 enough-0.177
id-0.168
 Bill-0.165
over-0.156
 tal-0.155
Token whip
Token ID22972
Feature activation+4.87e-01

Pos logit contributions
 sidelines+0.082
 stereo+0.075
 rinse+0.074
irs+0.073
 shower+0.073
Neg logit contributions
 enough-0.057
over-0.056
 Bill-0.055
 tal-0.053
id-0.053
Token named
Token ID3706
Feature activation-1.75e+00

Pos logit contributions
over+0.008
 Bill+0.007
 enough+0.007
 tal+0.007
id+0.007
Neg logit contributions
 sidelines-0.020
 stereo-0.019
 shower-0.019
 rinse-0.018
 curtain-0.018
Token C
Token ID327
Feature activation+8.59e-01

Pos logit contributions
 sidelines+0.055
 stereo+0.050
 rinse+0.049
 shower+0.048
irs+0.048
Neg logit contributions
 Bill-0.046
 enough-0.046
over-0.044
 tal-0.043
id-0.043
Tokenheddar
Token ID44937
Feature activation-1.03e+00

Pos logit contributions
id+0.025
over+0.025
 Bill+0.021
 enough+0.021
aw+0.021
Neg logit contributions
 sidelines-0.028
 rinse-0.025
 stereo-0.024
 shower-0.024
 sip-0.024
Token big
Token ID1263
Feature activation-5.00e-01

Pos logit contributions
irs+0.031
 sidelines+0.030
 stereo+0.029
 rinse+0.028
 Acc+0.026
Neg logit contributions
 enough-0.019
over-0.019
 Bill-0.018
 tal-0.017
 ac-0.017
Token,
Token ID11
Feature activation-2.20e-01

Pos logit contributions
 sidelines+0.016
 stereo+0.015
 rinse+0.015
irs+0.014
 shower+0.014
Neg logit contributions
 enough-0.013
over-0.012
 Bill-0.012
 tal-0.012
id-0.012
Token beautiful
Token ID4950
Feature activation-1.49e+00

Pos logit contributions
 sidelines+0.008
 shower+0.007
 rinse+0.007
 stereo+0.007
 curtain+0.007
Neg logit contributions
id-0.005
over-0.005
 enough-0.005
 Bill-0.005
 tal-0.005
Token rainbow
Token ID27223
Feature activation-1.12e+00

Pos logit contributions
 sidelines+0.045
irs+0.044
 rinse+0.041
 stereo+0.041
 Acc+0.036
Neg logit contributions
 enough-0.041
over-0.040
 Bill-0.039
 tal-0.038
 ac-0.036
Token-
Token ID12
Feature activation-1.09e+00

Pos logit contributions
 sidelines+0.048
 rinse+0.044
irs+0.044
 stereo+0.044
 Acc+0.042
Neg logit contributions
 enough-0.018
over-0.018
 Bill-0.016
id-0.015
 tal-0.015
Tokencol
Token ID4033
Feature activation-5.54e+00

Pos logit contributions
 sidelines+0.032
 rinse+0.030
 shower+0.028
 bathroom+0.027
 stereo+0.026
Neg logit contributions
 Bill-0.034
 enough-0.031
over-0.031
id-0.031
aw-0.030
Tokenoured
Token ID8167
Feature activation-2.53e+00

Pos logit contributions
 rinse+0.159
 sidelines+0.158
 bathroom+0.156
 carpet+0.155
 curtain+0.151
Neg logit contributions
 enough-0.169
id-0.152
 Bill-0.147
over-0.137
 tal-0.134
Token butterfly
Token ID35113
Feature activation-5.31e-01

Pos logit contributions
 sidelines+0.088
 rinse+0.082
 stereo+0.082
irs+0.081
 shower+0.078
Neg logit contributions
 enough-0.058
over-0.057
 Bill-0.056
 tal-0.053
id-0.052
Token!
Token ID0
Feature activation-3.99e-01

Pos logit contributions
 sidelines+0.023
 rinse+0.022
irs+0.022
 stereo+0.021
 shower+0.021
Neg logit contributions
 enough-0.008
over-0.008
 tal-0.007
 Bill-0.007
id-0.006
Token The
Token ID383
Feature activation-2.09e-01

Pos logit contributions
 sidelines+0.015
 rinse+0.014
 stereo+0.014
 shower+0.013
irs+0.013
Neg logit contributions
 Bill-0.009
over-0.008
 tal-0.007
 enough-0.007
 Stan-0.007
Token little
Token ID1310
Feature activation-2.36e+00

Pos logit contributions
irs+0.007
 sidelines+0.007
 rinse+0.007
 stereo+0.007
 Acc+0.006
Neg logit contributions
 enough-0.004
over-0.004
 best-0.004
 Bill-0.004
 tal-0.004
Token suddenly
Token ID6451
Feature activation-1.52e-01

Pos logit contributions
 enough+0.013
over+0.012
 Bill+0.012
id+0.012
 tal+0.012
Neg logit contributions
 sidelines-0.011
 stereo-0.010
 rinse-0.010
 shower-0.010
irs-0.009
Token,
Token ID11
Feature activation+1.02e+00

Pos logit contributions
 sidelines+0.005
 stereo+0.005
 rinse+0.005
 shower+0.005
irs+0.005
Neg logit contributions
 enough-0.004
over-0.004
 Bill-0.003
id-0.003
 tal-0.003
Token a
Token ID257
Feature activation+9.39e-01

Pos logit contributions
 enough+0.030
over+0.030
 Bill+0.030
id+0.029
 tal+0.029
Neg logit contributions
 sidelines-0.028
 stereo-0.025
 rinse-0.025
 shower-0.025
irs-0.024
Token brightly
Token ID35254
Feature activation+3.91e-01

Pos logit contributions
 enough+0.016
 Bill+0.016
over+0.016
 tal+0.015
id+0.014
Neg logit contributions
 sidelines-0.036
irs-0.035
 stereo-0.034
 rinse-0.034
 shower-0.032
Token-
Token ID12
Feature activation+1.17e-01

Pos logit contributions
 enough+0.010
over+0.009
 Bill+0.009
id+0.009
 tal+0.009
Neg logit contributions
 sidelines-0.013
 stereo-0.012
 shower-0.012
 rinse-0.011
 curtain-0.011
Tokencol
Token ID4033
Feature activation-4.98e+00

Pos logit contributions
 Bill+0.003
over+0.003
 enough+0.002
id+0.002
 tal+0.002
Neg logit contributions
 sidelines-0.004
 rinse-0.004
irs-0.004
 shower-0.004
 stereo-0.004
Tokenoured
Token ID8167
Feature activation-1.81e+00

Pos logit contributions
 sidelines+0.127
 rinse+0.123
 carpet+0.119
 bathroom+0.116
 shower+0.116
Neg logit contributions
 enough-0.157
id-0.153
 Bill-0.145
over-0.141
 tal-0.140
Token belt
Token ID10999
Feature activation+1.62e+00

Pos logit contributions
 sidelines+0.062
irs+0.060
 rinse+0.058
 stereo+0.058
 shower+0.054
Neg logit contributions
over-0.041
 enough-0.040
 Bill-0.040
 tal-0.038
id-0.038
Token caught
Token ID4978
Feature activation+1.21e+00

Pos logit contributions
over+0.034
 Bill+0.033
 enough+0.033
 tal+0.033
id+0.032
Neg logit contributions
 sidelines-0.058
 stereo-0.054
 shower-0.053
 rinse-0.052
 curtain-0.052
Token her
Token ID607
Feature activation-5.83e-01

Pos logit contributions
 tal+0.030
 Bill+0.029
over+0.028
id+0.028
 meant+0.026
Neg logit contributions
 sidelines-0.035
 curtain-0.035
 shower-0.034
irs-0.034
 curtains-0.034
Token eye
Token ID4151
Feature activation-1.17e-01

Pos logit contributions
 sidelines+0.020
 stereo+0.019
 shower+0.019
 rinse+0.018
 curtain+0.018
Neg logit contributions
 Bill-0.013
 enough-0.012
over-0.012
id-0.012
 tal-0.012
Token want
Token ID765
Feature activation-3.60e-01

Pos logit contributions
 sidelines+0.017
 stereo+0.015
 rinse+0.015
 shower+0.015
irs+0.014
Neg logit contributions
 enough-0.016
over-0.016
 Bill-0.016
 tal-0.016
id-0.016
Token to
Token ID284
Feature activation-7.19e-01

Pos logit contributions
 sidelines+0.013
 stereo+0.012
 rinse+0.012
 shower+0.012
 curtain+0.011
Neg logit contributions
 enough-0.008
 Bill-0.008
over-0.007
 tal-0.007
id-0.007
Token help
Token ID1037
Feature activation-2.04e+00

Pos logit contributions
 sidelines+0.018
 stereo+0.016
 rinse+0.016
 shower+0.016
irs+0.015
Neg logit contributions
 enough-0.023
over-0.023
 Bill-0.023
 tal-0.022
id-0.022
Token.
Token ID13
Feature activation+2.61e-01

Pos logit contributions
 sidelines+0.062
 rinse+0.059
irs+0.058
 shower+0.058
 stereo+0.058
Neg logit contributions
over-0.052
 tal-0.051
 enough-0.048
id-0.048
 Bill-0.047
Token B
Token ID347
Feature activation-1.81e+00

Pos logit contributions
over+0.007
 enough+0.007
id+0.007
 Bill+0.006
 tal+0.006
Neg logit contributions
 sidelines-0.008
 stereo-0.007
 shower-0.007
 rinse-0.007
irs-0.007
Tokenamb
Token ID4131
Feature activation-4.95e+00

Pos logit contributions
 sidelines+0.048
 stereo+0.045
irs+0.044
 rinse+0.044
 shower+0.043
Neg logit contributions
 enough-0.050
 Bill-0.049
over-0.048
 tal-0.047
 meant-0.046
Tokeni
Token ID72
Feature activation-9.62e-02

Pos logit contributions
 sidelines+0.201
 stereo+0.187
 rinse+0.185
 shower+0.184
irs+0.181
Neg logit contributions
 enough-0.084
over-0.082
 Bill-0.081
id-0.078
 tal-0.077
Token felt
Token ID2936
Feature activation-3.35e-01

Pos logit contributions
 sidelines+0.002
 vacuum+0.002
 sip+0.002
 stereo+0.002
 rinse+0.002
Neg logit contributions
over-0.003
 Bill-0.003
 tal-0.002
 enough-0.002
 afloat-0.002
Token sad
Token ID6507
Feature activation+1.16e-01

Pos logit contributions
 sidelines+0.012
 rinse+0.011
 shower+0.011
 stereo+0.010
 sip+0.010
Neg logit contributions
id-0.008
 Bill-0.008
over-0.007
 tal-0.007
 enough-0.007
Token and
Token ID290
Feature activation-8.52e-01

Pos logit contributions
 Bill+0.002
over+0.002
id+0.002
 tal+0.002
 Stan+0.002
Neg logit contributions
 sidelines-0.004
 shower-0.004
 rinse-0.004
 stereo-0.004
 curtain-0.004
Token scared
Token ID12008
Feature activation-8.65e-02

Pos logit contributions
 sidelines+0.031
 stereo+0.028
 rinse+0.028
 shower+0.028
 sip+0.027
Neg logit contributions
over-0.018
id-0.018
 enough-0.017
 Bill-0.017
 tal-0.017
Token -
Token ID532
Feature activation+2.06e+00

Pos logit contributions
over+0.010
 enough+0.010
 Bill+0.010
 tal+0.009
id+0.009
Neg logit contributions
 sidelines-0.021
 stereo-0.020
 rinse-0.020
 shower-0.020
irs-0.019
Token a
Token ID257
Feature activation+9.74e-01

Pos logit contributions
 enough+0.040
 Bill+0.038
over+0.037
 tal+0.034
 meant+0.034
Neg logit contributions
 sidelines-0.080
 stereo-0.074
irs-0.073
 rinse-0.072
 shower-0.071
Token box
Token ID3091
Feature activation+8.11e-01

Pos logit contributions
 enough+0.021
 tal+0.020
 Bill+0.020
over+0.019
 best+0.018
Neg logit contributions
irs-0.034
 sidelines-0.034
 rinse-0.031
 stereo-0.031
 shower-0.029
Token of
Token ID286
Feature activation-6.99e-01

Pos logit contributions
 enough+0.014
over+0.013
 Bill+0.013
id+0.013
 tal+0.012
Neg logit contributions
 sidelines-0.033
 stereo-0.030
 rinse-0.030
irs-0.030
 shower-0.029
Token mult
Token ID1963
Feature activation-2.61e+00

Pos logit contributions
 sidelines+0.022
 stereo+0.020
irs+0.020
 rinse+0.020
 shower+0.019
Neg logit contributions
 enough-0.017
 Bill-0.017
 tal-0.016
over-0.016
 meant-0.016
Tokenicol
Token ID27045
Feature activation-4.94e+00

Pos logit contributions
 sidelines+0.063
 rinse+0.056
 stereo+0.055
irs+0.054
 shower+0.054
Neg logit contributions
 enough-0.088
over-0.086
 tal-0.085
 Bill-0.085
id-0.084
Tokenored
Token ID1850
Feature activation-2.29e+00

Pos logit contributions
 sidelines+0.120
 curtain+0.106
 stereo+0.106
 shower+0.105
 rinse+0.104
Neg logit contributions
 enough-0.171
over-0.164
id-0.160
 Bill-0.154
 tal-0.150
Token bubbles
Token ID25037
Feature activation-6.03e-02

Pos logit contributions
 sidelines+0.072
 stereo+0.066
 rinse+0.064
 shower+0.064
irs+0.063
Neg logit contributions
over-0.058
 enough-0.058
 Bill-0.058
id-0.056
 tal-0.056
Token!
Token ID0
Feature activation+7.27e-01

Pos logit contributions
 sidelines+0.003
 stereo+0.002
 rinse+0.002
irs+0.002
 shower+0.002
Neg logit contributions
 enough-0.001
over-0.001
 Bill-0.001
id-0.001
 tal-0.001
Token
Token ID198
Feature activation+1.56e-01

Pos logit contributions
 Bill+0.013
 enough+0.011
over+0.011
 tal+0.010
 meant+0.009
Neg logit contributions
 sidelines-0.032
irs-0.029
 rinse-0.029
 stereo-0.028
 spill-0.028
Token
Token ID198
Feature activation+1.49e-01

Pos logit contributions
 Bill+0.002
 tal+0.002
over+0.002
id+0.002
 enough+0.002
Neg logit contributions
 sidelines-0.007
 stereo-0.006
 rinse-0.006
 shower-0.006
 curtain-0.006
Tokeni
Token ID72
Feature activation-6.99e-01

Pos logit contributions
 sidelines+0.179
 stereo+0.166
irs+0.165
 rinse+0.163
 shower+0.161
Neg logit contributions
over-0.099
 enough-0.096
 Bill-0.094
 tal-0.092
id-0.090
Token.
Token ID13
Feature activation+1.19e-01

Pos logit contributions
 sidelines+0.024
 stereo+0.023
 shower+0.023
 curtain+0.023
 rinse+0.022
Neg logit contributions
over-0.014
 Bill-0.014
id-0.014
 tal-0.014
 enough-0.013
Token
Token ID198
Feature activation-5.86e-01

Pos logit contributions
over+0.003
 enough+0.003
id+0.003
 tal+0.003
 Bill+0.003
Neg logit contributions
 sidelines-0.004
 stereo-0.003
 shower-0.003
 rinse-0.003
 bathroom-0.003
Token
Token ID198
Feature activation-6.44e-01

Pos logit contributions
 sidelines+0.021
 stereo+0.019
 rinse+0.019
 shower+0.019
irs+0.019
Neg logit contributions
 enough-0.013
over-0.012
 Bill-0.012
id-0.012
 tal-0.012
TokenB
Token ID33
Feature activation-3.20e+00

Pos logit contributions
 sidelines+0.017
 stereo+0.016
 rinse+0.016
 shower+0.015
irs+0.015
Neg logit contributions
 enough-0.019
over-0.019
 Bill-0.019
 tal-0.019
id-0.018
Tokenamb
Token ID4131
Feature activation-4.93e+00

Pos logit contributions
 sidelines+0.093
 stereo+0.087
irs+0.085
 rinse+0.085
 shower+0.084
Neg logit contributions
 enough-0.085
 Bill-0.083
over-0.082
 tal-0.080
id-0.078
Tokeni
Token ID72
Feature activation+4.55e-02

Pos logit contributions
 sidelines+0.197
 stereo+0.183
 rinse+0.181
 shower+0.179
irs+0.177
Neg logit contributions
 enough-0.087
over-0.085
 Bill-0.083
id-0.080
 tal-0.080
Token ran
Token ID4966
Feature activation+2.68e-01

Pos logit contributions
over+0.001
 enough+0.001
 tal+0.001
 Bill+0.001
 intact+0.001
Neg logit contributions
 sidelines-0.001
 stereo-0.001
 sip-0.001
 carpet-0.001
 curtain-0.001
Token as
Token ID355
Feature activation-7.85e-01

Pos logit contributions
over+0.005
 enough+0.005
 Bill+0.005
 tal+0.005
id+0.005
Neg logit contributions
 sidelines-0.010
 stereo-0.009
 shower-0.009
 rinse-0.009
irs-0.009
Token fast
Token ID3049
Feature activation-2.01e-01

Pos logit contributions
 sidelines+0.024
 stereo+0.021
 rinse+0.021
 shower+0.020
 sip+0.020
Neg logit contributions
over-0.021
 enough-0.021
 tal-0.021
id-0.021
 Bill-0.019
Token as
Token ID355
Feature activation+2.41e-01

Pos logit contributions
 sidelines+0.007
 shower+0.006
 stereo+0.006
 rinse+0.006
 sip+0.006
Neg logit contributions
over-0.005
id-0.005
 Bill-0.005
 tal-0.004
 meant-0.004
Token was
Token ID373
Feature activation+9.30e-02

Pos logit contributions
 sidelines+0.003
 stereo+0.003
 shower+0.003
 rinse+0.003
 curtain+0.003
Neg logit contributions
over-0.002
id-0.002
 enough-0.002
 tal-0.002
 Bill-0.002
Token trying
Token ID2111
Feature activation+9.83e-02

Pos logit contributions
over+0.002
id+0.002
 tal+0.002
 enough+0.002
 Bill+0.002
Neg logit contributions
 sidelines-0.003
 stereo-0.003
 shower-0.003
 rinse-0.003
 curtain-0.003
Token to
Token ID284
Feature activation+3.38e-02

Pos logit contributions
over+0.003
id+0.002
 Bill+0.002
 enough+0.002
 tal+0.002
Neg logit contributions
 sidelines-0.003
 stereo-0.003
 shower-0.003
 rinse-0.003
irs-0.003
Token shoot
Token ID2686
Feature activation+1.65e-01

Pos logit contributions
over+0.001
 enough+0.001
 Bill+0.001
id+0.001
 tal+0.001
Neg logit contributions
 sidelines-0.001
 stereo-0.001
 rinse-0.001
 shower-0.001
irs-0.001
Token B
Token ID347
Feature activation-2.86e+00

Pos logit contributions
over+0.005
id+0.004
 intact+0.004
colm+0.004
 Bill+0.004
Neg logit contributions
irs-0.004
 shower-0.004
 curtains-0.004
 stereo-0.004
 sidelines-0.004
Tokenamb
Token ID4131
Feature activation-4.92e+00

Pos logit contributions
 sidelines+0.085
 stereo+0.078
 shower+0.075
irs+0.075
 rinse+0.075
Neg logit contributions
 enough-0.076
 Bill-0.074
 tal-0.073
over-0.072
id-0.070
Tokeni
Token ID72
Feature activation-6.99e-01

Pos logit contributions
 sidelines+0.179
 stereo+0.166
irs+0.165
 rinse+0.163
 shower+0.161
Neg logit contributions
over-0.099
 enough-0.096
 Bill-0.094
 tal-0.092
id-0.090
Token.
Token ID13
Feature activation+1.19e-01

Pos logit contributions
 sidelines+0.024
 stereo+0.023
 shower+0.023
 curtain+0.023
 rinse+0.022
Neg logit contributions
over-0.014
 Bill-0.014
id-0.014
 tal-0.014
 enough-0.013
Token
Token ID198
Feature activation-5.86e-01

Pos logit contributions
over+0.003
 enough+0.003
id+0.003
 tal+0.003
 Bill+0.003
Neg logit contributions
 sidelines-0.004
 stereo-0.003
 shower-0.003
 rinse-0.003
 bathroom-0.003
Token
Token ID198
Feature activation-6.44e-01

Pos logit contributions
 sidelines+0.021
 stereo+0.019
 rinse+0.019
 shower+0.019
irs+0.019
Neg logit contributions
 enough-0.013
over-0.012
 Bill-0.012
id-0.012
 tal-0.012
TokenB
Token ID33
Feature activation-3.20e+00

Pos logit contributions
 sidelines+0.017
 stereo+0.016
 rinse+0.016
 shower+0.015
irs+0.015
Neg logit contributions
 enough-0.019
over-0.019
 Bill-0.019
 tal-0.019
id-0.018
Token in
Token ID287
Feature activation-4.78e-01

Pos logit contributions
over+0.020
id+0.020
 Bill+0.019
 tal+0.019
oc+0.018
Neg logit contributions
 sidelines-0.028
 carpet-0.027
 shower-0.027
 stereo-0.027
 bathroom-0.026
Token a
Token ID257
Feature activation-4.01e-01

Pos logit contributions
 sidelines+0.017
 shower+0.017
 bathroom+0.016
 stereo+0.016
 rinse+0.016
Neg logit contributions
over-0.010
id-0.010
 Bill-0.009
 tal-0.009
 enough-0.008
Token trap
Token ID12840
Feature activation+1.30e+00

Pos logit contributions
 sidelines+0.015
irs+0.015
 stereo+0.014
 rinse+0.014
 shower+0.014
Neg logit contributions
 enough-0.007
over-0.007
 tal-0.007
 Bill-0.007
id-0.006
Token.
Token ID13
Feature activation+6.54e-02

Pos logit contributions
 Bill+0.028
over+0.027
id+0.026
 tal+0.026
 enough+0.026
Neg logit contributions
 sidelines-0.046
 stereo-0.043
 shower-0.043
 curtain-0.042
 rinse-0.042
Token B
Token ID347
Feature activation-2.07e+00

Pos logit contributions
 enough+0.002
over+0.002
 tal+0.001
 Bill+0.001
id+0.001
Neg logit contributions
 sidelines-0.002
 stereo-0.002
 shower-0.002
 rinse-0.002
irs-0.002
Tokenamb
Token ID4131
Feature activation-4.78e+00

Pos logit contributions
 sidelines+0.058
 stereo+0.055
irs+0.053
 rinse+0.052
 shower+0.052
Neg logit contributions
 enough-0.058
 Bill-0.055
 tal-0.054
over-0.054
 meant-0.051
Tokeni
Token ID72
Feature activation-2.87e-02

Pos logit contributions
 sidelines+0.189
 stereo+0.176
 rinse+0.174
 shower+0.172
irs+0.170
Neg logit contributions
 enough-0.086
over-0.085
 Bill-0.083
id-0.080
 tal-0.080
Token tried
Token ID3088
Feature activation-1.05e+00

Pos logit contributions
 sidelines+0.001
 stereo+0.001
 carpet+0.001
 curtain+0.001
 sip+0.001
Neg logit contributions
over-0.001
 enough-0.001
 Bill-0.001
 tal-0.001
 Stan-0.001
Token to
Token ID284
Feature activation-1.53e-01

Pos logit contributions
 sidelines+0.032
 stereo+0.029
 shower+0.029
 rinse+0.029
 sip+0.028
Neg logit contributions
over-0.030
 tal-0.027
 enough-0.026
id-0.026
 Bill-0.026
Token help
Token ID1037
Feature activation-1.40e+00

Pos logit contributions
 sidelines+0.004
 stereo+0.004
 rinse+0.004
 shower+0.003
irs+0.003
Neg logit contributions
 enough-0.005
over-0.005
 Bill-0.005
id-0.005
 tal-0.005
Token Th
Token ID536
Feature activation+6.30e-01

Pos logit contributions
 sidelines+0.048
 rinse+0.045
 stereo+0.045
 shower+0.044
irs+0.044
Neg logit contributions
over-0.031
 tal-0.030
 enough-0.030
 Bill-0.029
id-0.028
Token that
Token ID326
Feature activation+1.13e-01

Pos logit contributions
 sidelines+0.001
 stereo+0.001
 rinse+0.001
 shower+0.001
irs+0.001
Neg logit contributions
 enough-0.001
 Bill-0.001
over-0.001
 tal-0.001
id-0.001
Token!"
Token ID2474
Feature activation+6.71e-02

Pos logit contributions
over+0.003
 enough+0.003
 Bill+0.003
id+0.003
 tal+0.003
Neg logit contributions
 sidelines-0.003
 stereo-0.003
 rinse-0.003
 shower-0.003
 curtain-0.003
Token
Token ID220
Feature activation-6.71e-01

Pos logit contributions
over+0.001
 enough+0.001
 Bill+0.001
 tal+0.001
id+0.001
Neg logit contributions
 sidelines-0.002
 stereo-0.002
 rinse-0.002
irs-0.002
 shower-0.002
Token
Token ID198
Feature activation+1.86e-01

Pos logit contributions
 sidelines+0.025
 stereo+0.023
 rinse+0.022
 shower+0.022
 curtain+0.022
Neg logit contributions
over-0.014
 enough-0.014
 Bill-0.014
id-0.013
 tal-0.013
Token
Token ID198
Feature activation+4.78e-02

Pos logit contributions
over+0.004
 enough+0.003
 tal+0.003
 Bill+0.003
id+0.003
Neg logit contributions
 sidelines-0.007
 stereo-0.006
 rinse-0.006
 shower-0.006
irs-0.006
TokenCharl
Token ID24453
Feature activation-4.73e+00

Pos logit contributions
over+0.002
 enough+0.002
 Bill+0.002
id+0.002
 tal+0.002
Neg logit contributions
 sidelines-0.001
 stereo-0.001
 rinse-0.001
 shower-0.001
irs-0.001
Tokenotte
Token ID11404
Feature activation-1.32e+00

Pos logit contributions
 sidelines+0.084
 rinse+0.070
 shower+0.070
 stereo+0.069
 curtain+0.067
Neg logit contributions
over-0.191
 enough-0.186
id-0.185
 Bill-0.180
 tal-0.177
Token thanked
Token ID26280
Feature activation-2.82e-01

Pos logit contributions
 sidelines+0.047
 stereo+0.044
 rinse+0.044
 shower+0.043
 sip+0.042
Neg logit contributions
over-0.027
 enough-0.027
 Bill-0.026
 tal-0.025
id-0.025
Token her
Token ID607
Feature activation-5.67e-02

Pos logit contributions
 sidelines+0.010
 stereo+0.009
 rinse+0.009
irs+0.009
 shower+0.008
Neg logit contributions
 Bill-0.007
 enough-0.006
over-0.006
id-0.006
 tal-0.006
Token mother
Token ID2802
Feature activation+8.16e-01

Pos logit contributions
 sidelines+0.002
 stereo+0.001
 rinse+0.001
 shower+0.001
 curtain+0.001
Neg logit contributions
over-0.002
 enough-0.002
 Bill-0.002
id-0.002
 tal-0.002
Token for
Token ID329
Feature activation+1.83e+00

Pos logit contributions
over+0.014
 Bill+0.013
 tal+0.013
id+0.012
 enough+0.012
Neg logit contributions
 sidelines-0.032
 stereo-0.030
 shower-0.030
 rinse-0.030
 curtain-0.030
Token.
Token ID13
Feature activation+6.98e-01

Pos logit contributions
 sidelines+0.003
 rinse+0.003
 shower+0.003
 stereo+0.003
 sip+0.003
Neg logit contributions
 tal-0.002
 Bill-0.002
id-0.002
over-0.002
 enough-0.002
Token
Token ID198
Feature activation-5.50e-01

Pos logit contributions
 Bill+0.014
 enough+0.013
over+0.012
 tal+0.012
id+0.012
Neg logit contributions
 sidelines-0.028
 stereo-0.026
 rinse-0.025
irs-0.025
 shower-0.024
Token
Token ID198
Feature activation-6.12e-01

Pos logit contributions
 sidelines+0.026
 stereo+0.023
 rinse+0.022
 shower+0.022
 salon+0.022
Neg logit contributions
 Bill-0.008
over-0.007
id-0.007
 tal-0.006
 enough-0.006
Token"
Token ID1
Feature activation+5.23e-01

Pos logit contributions
 sidelines+0.024
 stereo+0.022
 shower+0.021
irs+0.021
 rinse+0.021
Neg logit contributions
over-0.012
 Bill-0.012
 enough-0.012
id-0.011
 tal-0.011
TokenL
Token ID43
Feature activation-1.12e+00

Pos logit contributions
over+0.011
 Bill+0.011
 meant+0.010
 Stan+0.010
id+0.010
Neg logit contributions
 sidelines-0.018
irs-0.018
 curtain-0.017
 carpet-0.017
 shower-0.016
Tokenamb
Token ID4131
Feature activation-4.72e+00

Pos logit contributions
 sidelines+0.038
 stereo+0.034
 rinse+0.033
 shower+0.033
irs+0.032
Neg logit contributions
 enough-0.028
over-0.028
 Bill-0.027
id-0.026
 tal-0.026
Token?
Token ID30
Feature activation-1.25e+00

Pos logit contributions
 sidelines+0.156
 stereo+0.143
 rinse+0.141
 shower+0.140
irs+0.138
Neg logit contributions
 enough-0.115
over-0.114
 Bill-0.113
id-0.109
 tal-0.109
Token Like
Token ID4525
Feature activation-7.46e-01

Pos logit contributions
 sidelines+0.041
irs+0.037
 stereo+0.035
 rinse+0.034
 shower+0.034
Neg logit contributions
 Bill-0.033
over-0.032
 enough-0.031
 meant-0.029
 Stan-0.029
Token sheep
Token ID15900
Feature activation-2.26e-01

Pos logit contributions
 sidelines+0.026
 stereo+0.023
irs+0.023
 rinse+0.022
 curtain+0.022
Neg logit contributions
 enough-0.018
over-0.017
 Bill-0.017
 tal-0.016
 meant-0.015
Token?"
Token ID1701
Feature activation-3.69e-01

Pos logit contributions
 sidelines+0.009
 stereo+0.009
 rinse+0.009
 shower+0.009
 curtain+0.008
Neg logit contributions
 enough-0.004
over-0.004
 Bill-0.003
id-0.003
 tal-0.003
Token Mia
Token ID32189
Feature activation+2.45e-01

Pos logit contributions
 sidelines+0.011
 stereo+0.010
irs+0.009
 rinse+0.009
 shower+0.009
Neg logit contributions
 Bill-0.011
 enough-0.011
over-0.010
 meant-0.010
 Stan-0.010
Token
Token ID198
Feature activation+1.25e-01

Pos logit contributions
 sidelines+0.029
 stereo+0.026
 rinse+0.026
 shower+0.026
 curtain+0.025
Neg logit contributions
 Bill-0.015
 enough-0.015
over-0.015
id-0.014
 tal-0.014
Token
Token ID198
Feature activation+1.20e-01

Pos logit contributions
over+0.003
 Bill+0.003
 enough+0.003
id+0.002
 tal+0.002
Neg logit contributions
 sidelines-0.005
 stereo-0.004
 rinse-0.004
 shower-0.004
 curtain-0.004
TokenFinally
Token ID11158
Feature activation-4.27e-01

Pos logit contributions
over+0.003
 enough+0.003
id+0.003
 Bill+0.003
 tal+0.003
Neg logit contributions
 sidelines-0.004
 stereo-0.004
 curtain-0.004
 shower-0.004
 rinse-0.004
Token,
Token ID11
Feature activation+1.13e+00

Pos logit contributions
 shower+0.015
 sidelines+0.015
 rinse+0.014
 tea+0.014
irs+0.014
Neg logit contributions
 tal-0.007
over-0.007
id-0.007
 meant-0.007
aces-0.007
Token B
Token ID347
Feature activation-1.52e+00

Pos logit contributions
over+0.033
 tal+0.032
id+0.032
aces+0.030
 enough+0.029
Neg logit contributions
 sidelines-0.030
 shower-0.030
 rinse-0.028
 stereo-0.027
 bathroom-0.026
Tokenamb
Token ID4131
Feature activation-4.72e+00

Pos logit contributions
 sidelines+0.041
 stereo+0.039
irs+0.037
 rinse+0.037
 shower+0.037
Neg logit contributions
 enough-0.042
 Bill-0.041
 tal-0.040
over-0.040
 meant-0.039
Tokeni
Token ID72
Feature activation-1.78e-01

Pos logit contributions
 sidelines+0.190
 stereo+0.177
 rinse+0.175
 shower+0.173
irs+0.171
Neg logit contributions
 enough-0.081
over-0.080
 Bill-0.078
id-0.075
 tal-0.075
Token found
Token ID1043
Feature activation-3.62e-01

Pos logit contributions
 sidelines+0.005
 rinse+0.005
 shower+0.005
 sip+0.005
 bathroom+0.005
Neg logit contributions
over-0.004
 Bill-0.004
 tal-0.004
 enough-0.004
 Stan-0.004
Token a
Token ID257
Feature activation-1.18e-01

Pos logit contributions
 sidelines+0.013
irs+0.012
 shower+0.012
 stereo+0.012
 rinse+0.012
Neg logit contributions
over-0.008
id-0.007
 Bill-0.007
 tal-0.007
 enough-0.006
Token kind
Token ID1611
Feature activation+2.10e+00

Pos logit contributions
 sidelines+0.004
 shower+0.004
 stereo+0.004
 rinse+0.004
 curtain+0.004
Neg logit contributions
over-0.003
 Bill-0.003
 enough-0.003
id-0.003
 tal-0.003
Token hunter
Token ID19177
Feature activation+9.43e-01

Pos logit contributions
id+0.046
over+0.046
 Bill+0.044
 enough+0.044
 tal+0.043
Neg logit contributions
 sidelines-0.071
 shower-0.066
 curtain-0.066
 carpet-0.064
 stereo-0.064
TokenI
Token ID40
Feature activation-2.14e+00

Pos logit contributions
over+0.014
 Bill+0.014
 meant+0.014
id+0.013
 Stan+0.013
Neg logit contributions
 sidelines-0.018
irs-0.017
 stereo-0.016
 shower-0.016
 curtain-0.015
Token-
Token ID12
Feature activation-2.12e+00

Pos logit contributions
 sidelines+0.083
 stereo+0.073
irs+0.072
 curtain+0.070
 rinse+0.069
Neg logit contributions
 enough-0.045
 Bill-0.044
 meant-0.043
over-0.042
 tal-0.041
TokenI
Token ID40
Feature activation-1.99e+00

Pos logit contributions
 sidelines+0.073
 stereo+0.066
 curtain+0.066
 shower+0.065
irs+0.065
Neg logit contributions
 enough-0.054
 Bill-0.050
id-0.049
over-0.047
 meant-0.043
Token-
Token ID12
Feature activation-2.23e+00

Pos logit contributions
 sidelines+0.081
irs+0.069
 stereo+0.069
 curtain+0.066
 studio+0.065
Neg logit contributions
 meant-0.042
 enough-0.041
 Bill-0.040
over-0.037
 tal-0.036
TokenI
Token ID40
Feature activation-1.88e+00

Pos logit contributions
 sidelines+0.077
 stereo+0.070
 curtain+0.070
 carpet+0.069
irs+0.068
Neg logit contributions
 enough-0.057
 Bill-0.051
id-0.051
over-0.048
 an-0.044
Token p
Token ID279
Feature activation-4.71e+00

Pos logit contributions
 sidelines+0.080
irs+0.069
 stereo+0.067
 studio+0.065
 curtain+0.065
Neg logit contributions
 meant-0.038
 enough-0.036
 Bill-0.034
over-0.031
 tal-0.031
Token-
Token ID12
Feature activation-2.25e+00

Pos logit contributions
 sidelines+0.152
 carpet+0.137
 stereo+0.133
 curtain+0.127
 tv+0.126
Neg logit contributions
 enough-0.126
over-0.120
id-0.119
 at-0.111
 Bill-0.108
Tokenplayed
Token ID21542
Feature activation+2.97e-01

Pos logit contributions
 sidelines+0.076
 curtain+0.071
 stereo+0.071
 carpet+0.070
irs+0.068
Neg logit contributions
 enough-0.057
 Bill-0.052
id-0.051
over-0.048
 an-0.046
Token with
Token ID351
Feature activation-2.08e+00

Pos logit contributions
 enough+0.005
over+0.005
 Bill+0.005
 tal+0.005
id+0.004
Neg logit contributions
 sidelines-0.012
 stereo-0.011
 rinse-0.011
irs-0.011
 shower-0.011
Token the
Token ID262
Feature activation-6.92e-01

Pos logit contributions
 sidelines+0.073
 stereo+0.067
irs+0.065
 rinse+0.064
 studio+0.063
Neg logit contributions
 enough-0.049
 Bill-0.045
over-0.042
 Stan-0.041
 tal-0.041
Token p
Token ID279
Feature activation-3.38e+00

Pos logit contributions
 sidelines+0.023
irs+0.022
 stereo+0.020
 rinse+0.019
 actresses+0.018
Neg logit contributions
 enough-0.017
 meant-0.016
 best-0.016
 Bill-0.015
 ac-0.015
Token dolls
Token ID36062
Feature activation-2.95e-01

Pos logit contributions
 sidelines+0.030
irs+0.028
 stereo+0.026
 rinse+0.024
Girls+0.024
Neg logit contributions
 enough-0.022
over-0.021
 tal-0.020
 best-0.020
 meant-0.019
Token and
Token ID290
Feature activation+3.16e-01

Pos logit contributions
 sidelines+0.013
 stereo+0.012
irs+0.012
 rinse+0.012
 shower+0.012
Neg logit contributions
 enough-0.004
over-0.004
id-0.003
 Bill-0.003
 tal-0.003
Token pretend
Token ID16614
Feature activation-1.64e+00

Pos logit contributions
 enough+0.008
 neck+0.008
 ac+0.008
 tal+0.007
 Bill+0.007
Neg logit contributions
irs-0.010
 sidelines-0.010
 stereo-0.009
Girls-0.009
ugg-0.008
Token they
Token ID484
Feature activation+6.20e-02

Pos logit contributions
 sidelines+0.069
irs+0.062
 stereo+0.061
 rinse+0.058
 Acc+0.057
Neg logit contributions
 enough-0.031
 on-0.027
 meant-0.027
 Bill-0.026
,-0.026
Token were
Token ID547
Feature activation+3.75e-01

Pos logit contributions
over+0.001
 enough+0.001
 Bill+0.001
 tal+0.001
id+0.001
Neg logit contributions
 sidelines-0.003
 stereo-0.002
 rinse-0.002
 shower-0.002
irs-0.002
Token mer
Token ID4017
Feature activation-4.69e+00

Pos logit contributions
 enough+0.009
over+0.009
 Bill+0.008
 meant+0.008
id+0.008
Neg logit contributions
 sidelines-0.013
 stereo-0.012
irs-0.011
 rinse-0.011
 sip-0.011
Tokenmaid
Token ID23151
Feature activation-2.18e+00

Pos logit contributions
 sidelines+0.120
 stereo+0.103
 shower+0.102
 rinse+0.101
irs+0.099
Neg logit contributions
over-0.154
 enough-0.154
id-0.147
 Bill-0.145
 tal-0.138
Tokens
Token ID82
Feature activation+5.64e-01

Pos logit contributions
 sidelines+0.077
 stereo+0.068
 rinse+0.067
irs+0.066
 shower+0.064
Neg logit contributions
over-0.053
 enough-0.053
id-0.048
 Bill-0.048
 tal-0.045
Token or
Token ID393
Feature activation+1.38e+00

Pos logit contributions
 enough+0.010
over+0.010
 Bill+0.010
 tal+0.009
id+0.009
Neg logit contributions
 sidelines-0.022
 stereo-0.021
 rinse-0.021
 shower-0.020
irs-0.020
Token pirates
Token ID27516
Feature activation+1.59e+00

Pos logit contributions
 enough+0.030
 an+0.026
 Bill+0.026
 best+0.025
 hunters+0.025
Neg logit contributions
 sidelines-0.053
irs-0.049
 stereo-0.047
 rinse-0.044
ugg-0.044
Token.
Token ID13
Feature activation+3.43e-01

Pos logit contributions
 tal+0.035
 Bill+0.034
id+0.032
over+0.032
 enough+0.031
Neg logit contributions
 sidelines-0.054
 stereo-0.051
 shower-0.051
 rinse-0.051
 curtain-0.050
Token had
Token ID550
Feature activation+1.24e+00

Pos logit contributions
 sidelines+0.004
 stereo+0.004
 curtain+0.004
 rinse+0.003
 shower+0.003
Neg logit contributions
id-0.003
over-0.003
 tal-0.003
 Bill-0.003
 enough-0.002
Token saved
Token ID7448
Feature activation+1.03e+00

Pos logit contributions
over+0.032
 tal+0.029
id+0.029
 afloat+0.028
 Bill+0.028
Neg logit contributions
 sidelines-0.033
 rinse-0.032
irs-0.032
 stereo-0.032
 shower-0.031
Token him
Token ID683
Feature activation+8.62e-01

Pos logit contributions
over+0.028
id+0.027
 tal+0.026
aces+0.024
 meant+0.024
Neg logit contributions
 sidelines-0.027
irs-0.027
 shower-0.024
 rinse-0.024
 curtains-0.024
Token.
Token ID13
Feature activation+3.73e-01

Pos logit contributions
id+0.020
over+0.020
 Bill+0.020
 tal+0.019
 Stan+0.018
Neg logit contributions
 sidelines-0.027
 shower-0.026
 rinse-0.025
 stereo-0.025
 sip-0.024
Token<|endoftext|>
Token ID50256
Feature activation-5.90e-01

Pos logit contributions
 Bill+0.008
 enough+0.007
over+0.007
 tal+0.007
id+0.007
Neg logit contributions
 sidelines-0.014
 rinse-0.013
 stereo-0.013
 shower-0.013
irs-0.013
TokenCharl
Token ID24453
Feature activation-4.65e+00

Pos logit contributions
 sidelines+0.014
 shower+0.014
 stereo+0.014
 curtain+0.013
irs+0.013
Neg logit contributions
 enough-0.018
over-0.017
 tal-0.017
 Bill-0.017
id-0.017
Tokenotte
Token ID11404
Feature activation-9.23e-01

Pos logit contributions
 sidelines+0.106
 rinse+0.093
 stereo+0.093
 shower+0.091
 curtain+0.088
Neg logit contributions
over-0.159
 enough-0.157
 Bill-0.156
id-0.156
 tal-0.151
Token and
Token ID290
Feature activation+1.07e+00

Pos logit contributions
 sidelines+0.034
 stereo+0.032
 shower+0.032
 rinse+0.032
 curtain+0.031
Neg logit contributions
over-0.017
 enough-0.016
 tal-0.016
 Bill-0.016
id-0.016
Token her
Token ID607
Feature activation+5.28e-01

Pos logit contributions
over+0.023
 enough+0.023
id+0.022
 tal+0.022
 Bill+0.021
Neg logit contributions
 sidelines-0.038
 stereo-0.035
 shower-0.035
 rinse-0.035
irs-0.034
Token mom
Token ID1995
Feature activation+1.13e+00

Pos logit contributions
 enough+0.012
 Bill+0.012
over+0.012
 tal+0.011
id+0.011
Neg logit contributions
 sidelines-0.018
 stereo-0.017
 rinse-0.017
irs-0.016
 shower-0.016
Token were
Token ID547
Feature activation+1.99e-01

Pos logit contributions
 enough+0.026
 Bill+0.026
over+0.025
id+0.024
 tal+0.024
Neg logit contributions
 sidelines-0.040
 stereo-0.036
 rinse-0.035
 shower-0.035
irs-0.035
Token.
Token ID13
Feature activation+3.18e-01

Pos logit contributions
over+0.011
 tal+0.010
 Bill+0.010
 Sid+0.010
 Stan+0.010
Neg logit contributions
 stereo-0.011
 sidelines-0.011
 shower-0.010
 carpet-0.010
irs-0.010
Token
Token ID220
Feature activation-9.93e-01

Pos logit contributions
over+0.008
 enough+0.008
 tal+0.008
id+0.008
 Bill+0.008
Neg logit contributions
 sidelines-0.010
 stereo-0.009
 shower-0.009
 rinse-0.009
irs-0.009
Token
Token ID198
Feature activation-2.39e-01

Pos logit contributions
 sidelines+0.033
 stereo+0.031
 rinse+0.030
 shower+0.030
irs+0.030
Neg logit contributions
 enough-0.023
over-0.023
 Bill-0.022
 tal-0.022
id-0.022
Token
Token ID198
Feature activation-3.08e-01

Pos logit contributions
 sidelines+0.008
 stereo+0.007
 rinse+0.007
 shower+0.007
irs+0.007
Neg logit contributions
 enough-0.006
over-0.006
 Bill-0.005
 tal-0.005
id-0.005
TokenB
Token ID33
Feature activation-3.17e+00

Pos logit contributions
 sidelines+0.009
 stereo+0.008
 rinse+0.008
 shower+0.008
irs+0.008
Neg logit contributions
 enough-0.009
over-0.009
 Bill-0.009
 tal-0.008
id-0.008
Tokenamb
Token ID4131
Feature activation-4.65e+00

Pos logit contributions
 sidelines+0.092
 stereo+0.085
 rinse+0.084
irs+0.083
 shower+0.083
Neg logit contributions
 enough-0.086
 Bill-0.084
over-0.083
 tal-0.082
id-0.079
Tokeni
Token ID72
Feature activation-4.71e-04

Pos logit contributions
 sidelines+0.175
 stereo+0.162
 rinse+0.160
 shower+0.158
irs+0.158
Neg logit contributions
over-0.090
 enough-0.090
 Bill-0.088
 tal-0.085
id-0.085
Token knew
Token ID2993
Feature activation-1.24e+00

Pos logit contributions
 sidelines+0.000
 sip+0.000
 stereo+0.000
 carpet+0.000
 vacuum+0.000
Neg logit contributions
over-0.000
 Bill-0.000
 tal-0.000
 enough-0.000
 afloat-0.000
Token he
Token ID339
Feature activation+4.25e-01

Pos logit contributions
 sidelines+0.036
 shower+0.034
 stereo+0.033
 rinse+0.033
irs+0.032
Neg logit contributions
over-0.033
 tal-0.031
id-0.031
est-0.029
 Bill-0.028
Token had
Token ID550
Feature activation+2.86e-01

Pos logit contributions
over+0.013
 tal+0.012
id+0.011
 Bill+0.011
 afloat+0.011
Neg logit contributions
 sip-0.010
 stereo-0.010
 sidelines-0.010
 shower-0.009
 rinse-0.009
Token to
Token ID284
Feature activation+4.61e-01

Pos logit contributions
over+0.010
est+0.008
id+0.008
 Bill+0.008
 tal+0.008
Neg logit contributions
 sidelines-0.007
 shower-0.007
 bathroom-0.007
 rinse-0.006
 stereo-0.006
Token,
Token ID11
Feature activation+4.59e-02

Pos logit contributions
 sidelines+0.005
 stereo+0.004
irs+0.004
 rinse+0.004
 shower+0.004
Neg logit contributions
 enough-0.002
over-0.002
 Bill-0.002
id-0.002
 tal-0.002
Token "
Token ID366
Feature activation-5.79e-01

Pos logit contributions
 enough+0.001
 Bill+0.001
over+0.001
 tal+0.001
id+0.001
Neg logit contributions
 sidelines-0.002
 stereo-0.002
 rinse-0.002
irs-0.002
 shower-0.002
TokenWhat
Token ID2061
Feature activation+1.10e-01

Pos logit contributions
 sidelines+0.019
 curtain+0.018
 shower+0.017
 curtains+0.017
irs+0.017
Neg logit contributions
 Bill-0.015
id-0.014
over-0.014
 enough-0.013
 Stan-0.013
Token are
Token ID389
Feature activation+3.16e-01

Pos logit contributions
over+0.003
 tal+0.003
id+0.003
 enough+0.002
 Bill+0.002
Neg logit contributions
 shower-0.003
 stereo-0.003
 rinse-0.003
 sidelines-0.003
 curtain-0.003
Token the
Token ID262
Feature activation-1.88e-02

Pos logit contributions
 enough+0.011
 Bill+0.011
over+0.010
 best+0.010
 Stan+0.010
Neg logit contributions
 sidelines-0.008
 rinse-0.007
 stereo-0.007
 sip-0.006
 shower-0.006
Token rods
Token ID32858
Feature activation+3.24e+00

Pos logit contributions
 sidelines+0.001
 stereo+0.001
 rinse+0.001
 shower+0.001
irs+0.001
Neg logit contributions
 enough-0.000
over-0.000
 Bill-0.000
id-0.000
 tal-0.000
Token for
Token ID329
Feature activation+1.10e+00

Pos logit contributions
over+0.083
 enough+0.082
 Bill+0.082
id+0.081
 tal+0.081
Neg logit contributions
 sidelines-0.101
 stereo-0.091
 shower-0.091
 rinse-0.090
 curtain-0.088
Token?"
Token ID1701
Feature activation-5.78e-01

Pos logit contributions
 enough+0.029
 Bill+0.029
over+0.029
 tal+0.028
id+0.027
Neg logit contributions
 sidelines-0.034
 stereo-0.031
 rinse-0.030
 shower-0.030
irs-0.029
Token Jake
Token ID14757
Feature activation+4.40e-01

Pos logit contributions
 sidelines+0.021
 rinse+0.019
 stereo+0.019
 shower+0.019
irs+0.019
Neg logit contributions
 Bill-0.013
 enough-0.012
over-0.012
id-0.012
 tal-0.012
Token said
Token ID531
Feature activation-3.18e-02

Pos logit contributions
 enough+0.009
 Bill+0.009
over+0.009
id+0.009
 tal+0.009
Neg logit contributions
 sidelines-0.016
 stereo-0.015
 rinse-0.015
 shower-0.014
irs-0.014
Token that
Token ID326
Feature activation+3.00e-03

Pos logit contributions
 sidelines+0.001
 rinse+0.001
 stereo+0.001
 shower+0.001
 curtain+0.001
Neg logit contributions
 tal-0.001
over-0.001
id-0.000
 enough-0.000
 Bill-0.000
Token not
Token ID407
Feature activation+8.07e-01

Pos logit contributions
 enough+0.010
 Bill+0.009
over+0.008
 meant+0.008
 an+0.008
Neg logit contributions
 sidelines-0.030
 stereo-0.027
irs-0.027
 shower-0.027
 rinse-0.026
Token know
Token ID760
Feature activation-9.12e-01

Pos logit contributions
 enough+0.017
over+0.017
 Bill+0.017
 tal+0.016
 meant+0.016
Neg logit contributions
 sidelines-0.029
 stereo-0.027
irs-0.026
 rinse-0.026
 shower-0.026
Token what
Token ID644
Feature activation-1.13e+00

Pos logit contributions
 sidelines+0.032
 stereo+0.029
 rinse+0.028
 shower+0.028
irs+0.028
Neg logit contributions
 enough-0.021
 Bill-0.021
over-0.020
 tal-0.020
id-0.019
Token to
Token ID284
Feature activation+1.12e+00

Pos logit contributions
 sidelines+0.040
 stereo+0.038
 rinse+0.037
 shower+0.037
 curtain+0.036
Neg logit contributions
over-0.024
 enough-0.024
 Bill-0.023
id-0.023
 tal-0.023
Token say
Token ID910
Feature activation+1.14e+00

Pos logit contributions
 enough+0.037
over+0.037
id+0.037
 tal+0.036
 Bill+0.035
Neg logit contributions
 sidelines-0.026
 shower-0.024
 stereo-0.024
 rinse-0.024
 sip-0.023
Token next
Token ID1306
Feature activation+3.67e+00

Pos logit contributions
 tal+0.028
id+0.027
 Bill+0.026
over+0.026
 enough+0.024
Neg logit contributions
 sidelines-0.036
 stereo-0.034
 shower-0.034
 rinse-0.034
 curtain-0.033
Token.
Token ID13
Feature activation+8.84e-02

Pos logit contributions
 Bill+0.082
 tal+0.081
over+0.080
id+0.080
 enough+0.076
Neg logit contributions
 sidelines-0.118
 shower-0.117
 stereo-0.115
 curtain-0.115
 curtains-0.110
Token
Token ID198
Feature activation-3.72e-02

Pos logit contributions
 enough+0.002
over+0.002
 tal+0.002
id+0.002
 Bill+0.002
Neg logit contributions
 sidelines-0.003
 stereo-0.003
 rinse-0.003
 shower-0.003
irs-0.003
Token
Token ID198
Feature activation-7.57e-02

Pos logit contributions
 sidelines+0.001
 stereo+0.001
 rinse+0.001
 shower+0.001
 curtain+0.001
Neg logit contributions
 Bill-0.001
over-0.001
 enough-0.001
 tal-0.001
id-0.001
TokenThe
Token ID464
Feature activation+4.24e-01

Pos logit contributions
 sidelines+0.002
 stereo+0.002
 rinse+0.002
irs+0.002
 shower+0.002
Neg logit contributions
 enough-0.002
 tal-0.002
over-0.002
 Bill-0.002
 meant-0.002
Token man
Token ID582
Feature activation+2.50e-01

Pos logit contributions
 Bill+0.009
 tal+0.009
id+0.008
over+0.008
 enough+0.008
Neg logit contributions
 sidelines-0.016
 stereo-0.015
 curtain-0.015
 shower-0.015
 rinse-0.014
Token the
Token ID262
Feature activation-3.52e-01

Pos logit contributions
over+0.017
 enough+0.016
id+0.016
 tal+0.016
 meant+0.015
Neg logit contributions
 sidelines-0.027
 stereo-0.024
 rinse-0.024
 shower-0.024
irs-0.024
Token stolen
Token ID9909
Feature activation+6.25e-01

Pos logit contributions
 sidelines+0.012
 stereo+0.011
 rinse+0.011
 shower+0.011
irs+0.011
Neg logit contributions
 enough-0.008
over-0.008
 Bill-0.008
id-0.008
 tal-0.008
Token squares
Token ID24438
Feature activation+1.11e+00

Pos logit contributions
 enough+0.015
over+0.015
 Bill+0.015
id+0.014
 tal+0.014
Neg logit contributions
 sidelines-0.021
 stereo-0.019
 rinse-0.019
 shower-0.019
irs-0.018
Token to
Token ID284
Feature activation+3.84e-01

Pos logit contributions
over+0.021
 Bill+0.020
 enough+0.020
id+0.020
 tal+0.020
Neg logit contributions
 sidelines-0.042
 stereo-0.039
 rinse-0.039
 shower-0.038
irs-0.038
Token their
Token ID511
Feature activation-5.20e-02

Pos logit contributions
 enough+0.007
 Bill+0.007
over+0.007
 tal+0.007
id+0.006
Neg logit contributions
 sidelines-0.015
 stereo-0.014
 rinse-0.014
 shower-0.014
irs-0.013
Token rightful
Token ID43133
Feature activation+4.39e+00

Pos logit contributions
 sidelines+0.002
irs+0.002
 stereo+0.002
 rinse+0.002
 shower+0.002
Neg logit contributions
 enough-0.001
 Bill-0.001
over-0.001
 tal-0.001
id-0.001
Token place
Token ID1295
Feature activation-8.04e-02

Pos logit contributions
over+0.089
 Bill+0.082
id+0.082
 meant+0.078
 afloat+0.077
Neg logit contributions
 bathroom-0.140
 curtain-0.136
 sidelines-0.135
 carpet-0.133
 curtains-0.133
Token.
Token ID13
Feature activation+4.41e-01

Pos logit contributions
 sidelines+0.003
 shower+0.003
 stereo+0.003
 rinse+0.003
 curtain+0.003
Neg logit contributions
over-0.002
id-0.002
 tal-0.002
 enough-0.002
 Bill-0.002
Token
Token ID198
Feature activation+3.85e-01

Pos logit contributions
 Bill+0.009
 enough+0.009
over+0.009
 tal+0.009
id+0.008
Neg logit contributions
 sidelines-0.016
 stereo-0.015
 rinse-0.015
 shower-0.015
irs-0.015
Token
Token ID198
Feature activation+3.43e-01

Pos logit contributions
 enough+0.008
over+0.008
 Bill+0.008
 tal+0.008
id+0.008
Neg logit contributions
 sidelines-0.014
 stereo-0.012
 rinse-0.012
 shower-0.012
irs-0.012
TokenThe
Token ID464
Feature activation+6.75e-01

Pos logit contributions
 enough+0.007
over+0.006
 Bill+0.006
 tal+0.006
id+0.006
Neg logit contributions
 sidelines-0.013
 rinse-0.012
 stereo-0.012
 shower-0.012
irs-0.012
Token,
Token ID11
Feature activation+7.08e-02

Pos logit contributions
 tal+0.028
over+0.026
 Bill+0.026
id+0.025
 Stan+0.022
Neg logit contributions
 sidelines-0.035
 shower-0.035
 sip-0.033
 curtain-0.033
 stereo-0.033
Token the
Token ID262
Feature activation+4.51e-01

Pos logit contributions
over+0.002
id+0.002
 tal+0.002
 enough+0.001
 Bill+0.001
Neg logit contributions
 sidelines-0.002
 shower-0.002
 stereo-0.002
 rinse-0.002
 curtain-0.002
Token mean
Token ID1612
Feature activation+2.07e+00

Pos logit contributions
over+0.012
id+0.011
 enough+0.011
 Bill+0.011
 tal+0.011
Neg logit contributions
 sidelines-0.015
 shower-0.013
 stereo-0.013
 curtain-0.013
 rinse-0.013
Token boy
Token ID2933
Feature activation+1.09e-02

Pos logit contributions
over+0.051
 tal+0.051
id+0.050
 enough+0.046
vered+0.043
Neg logit contributions
 sidelines-0.065
 curtain-0.062
 carpet-0.062
 shower-0.061
 mirror-0.058
Token from
Token ID422
Feature activation+1.40e+00

Pos logit contributions
 tal+0.000
over+0.000
id+0.000
 Bill+0.000
 enough+0.000
Neg logit contributions
 sidelines-0.000
 shower-0.000
 sip-0.000
 stereo-0.000
 curtain-0.000
Token next
Token ID1306
Feature activation+3.38e+00

Pos logit contributions
 enough+0.024
 Bill+0.023
over+0.023
 tal+0.022
id+0.021
Neg logit contributions
 sidelines-0.057
 stereo-0.053
 rinse-0.053
irs-0.052
 shower-0.052
Token door
Token ID3420
Feature activation-6.78e-02

Pos logit contributions
 enough+0.104
 Bill+0.101
over+0.100
id+0.097
 tal+0.097
Neg logit contributions
 sidelines-0.092
 stereo-0.083
 rinse-0.083
irs-0.080
 shower-0.079
Token.
Token ID13
Feature activation-4.76e-03

Pos logit contributions
 sidelines+0.002
 curtain+0.002
 shower+0.002
 curtains+0.002
 stereo+0.002
Neg logit contributions
 tal-0.002
over-0.001
 Bill-0.001
id-0.001
 Stan-0.001
Token He
Token ID679
Feature activation+3.92e-01

Pos logit contributions
 sidelines+0.000
 stereo+0.000
 rinse+0.000
 shower+0.000
irs+0.000
Neg logit contributions
 enough-0.000
over-0.000
 Bill-0.000
id-0.000
 tal-0.000
Token was
Token ID373
Feature activation-8.18e-02

Pos logit contributions
over+0.011
 tal+0.009
 enough+0.009
id+0.009
 Bill+0.009
Neg logit contributions
 sidelines-0.012
 stereo-0.011
 sip-0.011
 rinse-0.011
 shower-0.011
Token holding
Token ID4769
Feature activation+1.14e-01

Pos logit contributions
 sidelines+0.003
 stereo+0.003
 shower+0.003
 rinse+0.003
 curtain+0.002
Neg logit contributions
over-0.002
 tal-0.002
id-0.002
 Bill-0.002
 enough-0.002
Token.
Token ID13
Feature activation-6.19e-01

Pos logit contributions
 tal+0.051
id+0.046
over+0.045
 Bill+0.044
 Stan+0.042
Neg logit contributions
 sidelines-0.081
 stereo-0.079
 curtain-0.078
 shower-0.077
 curtains-0.077
Token He
Token ID679
Feature activation-8.75e-02

Pos logit contributions
 sidelines+0.026
 stereo+0.024
 rinse+0.024
 shower+0.023
irs+0.023
Neg logit contributions
 enough-0.010
over-0.010
 Bill-0.010
id-0.009
 tal-0.009
Token had
Token ID550
Feature activation-3.65e-01

Pos logit contributions
 sidelines+0.003
 stereo+0.003
 rinse+0.003
 shower+0.003
irs+0.003
Neg logit contributions
 enough-0.002
over-0.002
 Bill-0.002
id-0.002
 tal-0.002
Token a
Token ID257
Feature activation-3.87e-01

Pos logit contributions
 sidelines+0.014
 stereo+0.013
 rinse+0.013
 shower+0.012
irs+0.012
Neg logit contributions
over-0.007
id-0.007
 Bill-0.007
 enough-0.007
 tal-0.007
Token bus
Token ID1323
Feature activation+1.18e+00

Pos logit contributions
 sidelines+0.014
 stereo+0.013
 rinse+0.013
irs+0.013
 shower+0.013
Neg logit contributions
 enough-0.008
over-0.008
 Bill-0.007
 tal-0.007
id-0.007
Tokenhy
Token ID12114
Feature activation+4.12e+00

Pos logit contributions
 tal+0.028
over+0.028
id+0.026
 Bill+0.026
 enough+0.026
Neg logit contributions
 sidelines-0.039
 stereo-0.036
 shower-0.035
 curtain-0.034
 rinse-0.034
Token tail
Token ID7894
Feature activation+1.23e+00

Pos logit contributions
 meant+0.088
 Bill+0.085
 afloat+0.083
 enough+0.082
over+0.080
Neg logit contributions
 sidelines-0.140
 curtain-0.129
 mirror-0.122
 stereo-0.121
 carpet-0.120
Token and
Token ID290
Feature activation-7.00e-01

Pos logit contributions
 tal+0.023
 Bill+0.023
over+0.022
id+0.022
 enough+0.020
Neg logit contributions
 sidelines-0.045
 curtain-0.043
 stereo-0.042
 shower-0.042
 carpet-0.041
Token a
Token ID257
Feature activation+7.89e-01

Pos logit contributions
 sidelines+0.026
 stereo+0.024
 shower+0.024
 rinse+0.023
 curtain+0.023
Neg logit contributions
over-0.014
id-0.014
 enough-0.014
 Bill-0.013
 tal-0.013
Token small
Token ID1402
Feature activation-5.69e-01

Pos logit contributions
 enough+0.016
over+0.015
 Bill+0.015
 tal+0.015
id+0.014
Neg logit contributions
 sidelines-0.029
irs-0.028
 stereo-0.027
 rinse-0.027
 shower-0.026
Token nose
Token ID9686
Feature activation+1.35e+00

Pos logit contributions
 sidelines+0.019
 shower+0.017
 stereo+0.017
 curtain+0.017
 rinse+0.017
Neg logit contributions
over-0.013
id-0.013
 Bill-0.013
 enough-0.013
 tal-0.012
Token jealous
Token ID19354
Feature activation+5.59e-01

Pos logit contributions
 sidelines+0.027
 rinse+0.025
 stereo+0.025
 shower+0.025
 sip+0.024
Neg logit contributions
id-0.017
over-0.016
 Bill-0.016
 enough-0.016
 tal-0.016
Token because
Token ID780
Feature activation-1.59e+00

Pos logit contributions
over+0.009
 Bill+0.009
 enough+0.009
 tal+0.009
id+0.009
Neg logit contributions
 sidelines-0.022
 stereo-0.021
 shower-0.021
 rinse-0.021
 curtain-0.020
Token she
Token ID673
Feature activation-1.22e+00

Pos logit contributions
 sidelines+0.035
 shower+0.033
 stereo+0.032
 curtain+0.032
 curtains+0.031
Neg logit contributions
id-0.055
over-0.054
 tal-0.053
 enough-0.050
 afloat-0.048
Token wanted
Token ID2227
Feature activation-4.96e-03

Pos logit contributions
 sidelines+0.058
irs+0.053
 shower+0.053
 rinse+0.051
 stereo+0.051
Neg logit contributions
 meant-0.016
 Bill-0.014
 enough-0.013
id-0.013
 did-0.011
Token some
Token ID617
Feature activation+9.30e-01

Pos logit contributions
 sidelines+0.000
 stereo+0.000
irs+0.000
 rinse+0.000
 shower+0.000
Neg logit contributions
 enough-0.000
 Bill-0.000
over-0.000
 tal-0.000
 meant-0.000
Token cream
Token ID8566
Feature activation+3.32e-01

Pos logit contributions
 enough+0.022
over+0.021
 Bill+0.021
 tal+0.020
 meant+0.020
Neg logit contributions
 sidelines-0.032
 stereo-0.030
irs-0.029
 rinse-0.029
 shower-0.028
Token too
Token ID1165
Feature activation+6.14e-01

Pos logit contributions
 tal+0.008
over+0.007
id+0.007
 Bill+0.007
 enough+0.006
Neg logit contributions
 sidelines-0.011
 shower-0.010
 rinse-0.010
 stereo-0.010
 sip-0.010
Token.
Token ID13
Feature activation+6.31e-01

Pos logit contributions
 tal+0.013
over+0.013
id+0.013
 Bill+0.013
 enough+0.012
Neg logit contributions
 sidelines-0.021
 rinse-0.020
 shower-0.020
 stereo-0.019
 sip-0.019
Token
Token ID198
Feature activation-3.11e-01

Pos logit contributions
over+0.017
 enough+0.017
id+0.016
 tal+0.016
 Bill+0.015
Neg logit contributions
 sidelines-0.018
 stereo-0.017
 shower-0.017
 rinse-0.016
 curtain-0.016
Token
Token ID198
Feature activation-3.68e-01

Pos logit contributions
 sidelines+0.009
 stereo+0.009
 shower+0.009
 rinse+0.009
irs+0.008
Neg logit contributions
 enough-0.008
over-0.008
 Bill-0.008
id-0.008
 tal-0.008
TokenK
Token ID42
Feature activation-1.66e+00

Pos logit contributions
 sidelines+0.010
 shower+0.009
 stereo+0.009
 rinse+0.009
 sip+0.009
Neg logit contributions
 enough-0.011
over-0.010
 tal-0.010
 Bill-0.010
id-0.010
Token was
Token ID373
Feature activation-3.35e-01

Pos logit contributions
 sidelines+0.014
 stereo+0.013
 rinse+0.012
 shower+0.012
irs+0.012
Neg logit contributions
over-0.007
 enough-0.007
 Bill-0.007
id-0.007
 tal-0.007
Token so
Token ID523
Feature activation-1.08e+00

Pos logit contributions
 sidelines+0.014
 stereo+0.013
 rinse+0.013
 shower+0.013
irs+0.013
Neg logit contributions
 enough-0.005
over-0.005
 Bill-0.005
 tal-0.005
id-0.005
Token small
Token ID1402
Feature activation+3.72e-01

Pos logit contributions
 sidelines+0.046
 stereo+0.044
 shower+0.043
irs+0.043
 rinse+0.043
Neg logit contributions
 enough-0.016
over-0.015
 Bill-0.015
 tal-0.014
id-0.013
Token,
Token ID11
Feature activation-8.34e-02

Pos logit contributions
 tal+0.007
id+0.007
over+0.006
 Bill+0.006
 Sid+0.006
Neg logit contributions
 sidelines-0.013
 shower-0.013
 curtain-0.013
 curtains-0.013
 stereo-0.012
Token it
Token ID340
Feature activation-5.08e-01

Pos logit contributions
 sidelines+0.003
 stereo+0.003
 shower+0.003
 rinse+0.003
 curtain+0.003
Neg logit contributions
over-0.002
id-0.002
 enough-0.002
 tal-0.002
 Bill-0.001
Token was
Token ID373
Feature activation+2.25e-01

Pos logit contributions
 sidelines+0.019
 stereo+0.018
 rinse+0.018
 shower+0.017
 curtain+0.017
Neg logit contributions
over-0.010
 enough-0.010
 Bill-0.009
id-0.009
 tal-0.009
Token hard
Token ID1327
Feature activation-4.83e-01

Pos logit contributions
over+0.005
id+0.004
 tal+0.004
 Bill+0.004
 enough+0.004
Neg logit contributions
 sidelines-0.008
 rinse-0.008
 stereo-0.008
 shower-0.008
 curtain-0.007
Token for
Token ID329
Feature activation+1.32e+00

Pos logit contributions
 sidelines+0.015
 shower+0.014
 stereo+0.014
 rinse+0.014
 curtain+0.013
Neg logit contributions
over-0.012
 Bill-0.012
id-0.012
 tal-0.012
 Stan-0.011
Token him
Token ID683
Feature activation+1.04e-01

Pos logit contributions
over+0.029
 tal+0.027
id+0.027
 enough+0.027
 meant+0.025
Neg logit contributions
 sidelines-0.047
irs-0.044
 shower-0.042
 stereo-0.041
 curtain-0.041
Token to
Token ID284
Feature activation+1.02e+00

Pos logit contributions
over+0.002
id+0.002
 Bill+0.002
 enough+0.002
 tal+0.002
Neg logit contributions
 sidelines-0.004
 shower-0.003
 stereo-0.003
 rinse-0.003
 curtain-0.003
Token climb
Token ID12080
Feature activation+5.55e-01

Pos logit contributions
 enough+0.030
over+0.030
 Bill+0.030
id+0.029
 tal+0.029
Neg logit contributions
 sidelines-0.028
 stereo-0.025
 rinse-0.025
 shower-0.024
irs-0.024
Token it
Token ID340
Feature activation-1.48e-01

Pos logit contributions
over+0.006
id+0.006
 Bill+0.006
 tal+0.006
 Stan+0.006
Neg logit contributions
 curtains-0.006
 curtain-0.006
 rinse-0.006
irs-0.006
 tea-0.006
Token with
Token ID351
Feature activation-7.57e-01

Pos logit contributions
 carpet+0.004
 rinse+0.004
 curtain+0.004
 sip+0.004
 sidelines+0.004
Neg logit contributions
 tal-0.004
 Bill-0.004
id-0.004
over-0.004
 Stan-0.004
Token lots
Token ID6041
Feature activation-7.11e-01

Pos logit contributions
 sidelines+0.032
 stereo+0.030
 rinse+0.029
 shower+0.029
irs+0.029
Neg logit contributions
 enough-0.012
 Bill-0.012
over-0.011
 tal-0.011
id-0.011
Token of
Token ID286
Feature activation-7.70e-01

Pos logit contributions
 sidelines+0.027
 stereo+0.026
 rinse+0.025
 shower+0.024
irs+0.024
Neg logit contributions
 enough-0.015
over-0.013
 Bill-0.013
 tal-0.013
 meant-0.012
Token love
Token ID1842
Feature activation-1.28e+00

Pos logit contributions
 sidelines+0.028
 stereo+0.026
 shower+0.025
 rinse+0.024
irs+0.024
Neg logit contributions
 enough-0.018
 Bill-0.017
 tal-0.017
over-0.016
 meant-0.016
Token and
Token ID290
Feature activation+2.07e-01

Pos logit contributions
 sidelines+0.046
 rinse+0.043
 stereo+0.043
 shower+0.042
irs+0.042
Neg logit contributions
over-0.026
 enough-0.026
 Bill-0.025
 tal-0.025
id-0.025
Token attention
Token ID3241
Feature activation+3.70e-01

Pos logit contributions
 enough+0.005
 Bill+0.004
 meant+0.004
 tal+0.004
over+0.004
Neg logit contributions
 sidelines-0.008
 stereo-0.007
irs-0.007
 shower-0.007
 rinse-0.007
Token.
Token ID13
Feature activation+6.66e-01

Pos logit contributions
over+0.007
 enough+0.007
 Bill+0.007
 tal+0.006
id+0.006
Neg logit contributions
 sidelines-0.014
 stereo-0.013
 rinse-0.013
 shower-0.013
irs-0.013
Token
Token ID198
Feature activation-1.65e-01

Pos logit contributions
 Bill+0.012
 enough+0.010
id+0.009
over+0.009
 Stan+0.009
Neg logit contributions
 sidelines-0.028
 rinse-0.027
 stereo-0.026
 shower-0.026
irs-0.026
Token
Token ID198
Feature activation-2.13e-01

Pos logit contributions
 sidelines+0.007
 rinse+0.006
 stereo+0.006
 shower+0.006
 curtain+0.006
Neg logit contributions
 Bill-0.003
over-0.003
id-0.003
 enough-0.002
 Stan-0.002
TokenSur
Token ID14214
Feature activation-5.29e-01

Pos logit contributions
 sidelines+0.007
 shower+0.007
 rinse+0.007
 stereo+0.007
irs+0.007
Neg logit contributions
over-0.005
id-0.005
 Bill-0.005
 enough-0.005
 meant-0.004
Token time
Token ID640
Feature activation+2.42e+00

Pos logit contributions
 Bill+0.014
 enough+0.014
id+0.014
 tal+0.013
over+0.013
Neg logit contributions
 sidelines-0.028
 rinse-0.027
 shower-0.027
 stereo-0.027
 sip-0.026
Token there
Token ID612
Feature activation+1.95e+00

Pos logit contributions
 tal+0.044
over+0.044
id+0.042
 enough+0.041
 meant+0.039
Neg logit contributions
 shower-0.090
 curtain-0.088
irs-0.083
 sidelines-0.083
 sip-0.082
Token was
Token ID373
Feature activation+1.22e+00

Pos logit contributions
over+0.062
 tal+0.060
 enough+0.060
id+0.059
 hunters+0.058
Neg logit contributions
 shower-0.048
 bathroom-0.044
 sip-0.044
 curtain-0.044
 rinse-0.044
Token a
Token ID257
Feature activation+7.02e-01

Pos logit contributions
id+0.022
 tal+0.021
 enough+0.021
over+0.020
 Bill+0.020
Neg logit contributions
 sidelines-0.046
 shower-0.045
 stereo-0.044
 rinse-0.044
 curtain-0.043
Token rabbit
Token ID22746
Feature activation-6.29e-02

Pos logit contributions
over+0.013
 Bill+0.012
 enough+0.012
id+0.011
 tal+0.011
Neg logit contributions
 sidelines-0.027
irs-0.027
 stereo-0.025
 rinse-0.025
 shower-0.024
Token.
Token ID13
Feature activation+4.52e-01

Pos logit contributions
 sidelines+0.002
irs+0.002
 rinse+0.002
 stereo+0.002
 shower+0.002
Neg logit contributions
over-0.001
 enough-0.001
 Bill-0.001
 tal-0.001
id-0.001
Token She
Token ID1375
Feature activation-1.09e-01

Pos logit contributions
 enough+0.013
over+0.013
id+0.012
 tal+0.012
 best+0.011
Neg logit contributions
 sidelines-0.012
 sip-0.012
 stereo-0.012
 shower-0.012
 rinse-0.011
Token was
Token ID373
Feature activation-1.16e-01

Pos logit contributions
 sidelines+0.004
 stereo+0.004
 rinse+0.004
 shower+0.004
irs+0.004
Neg logit contributions
 enough-0.002
over-0.002
 Bill-0.002
id-0.002
 tal-0.002
Token very
Token ID845
Feature activation-1.57e+00

Pos logit contributions
 sidelines+0.004
 shower+0.004
 stereo+0.004
 rinse+0.004
 curtain+0.004
Neg logit contributions
id-0.003
 tal-0.002
over-0.002
 enough-0.002
 Bill-0.002
Token sad
Token ID6507
Feature activation-1.88e-01

Pos logit contributions
 sidelines+0.065
 stereo+0.060
irs+0.060
 rinse+0.058
 shower+0.058
Neg logit contributions
over-0.026
 Bill-0.025
 enough-0.025
 tal-0.022
 meant-0.022
Token because
Token ID780
Feature activation-1.38e+00

Pos logit contributions
 sidelines+0.008
 stereo+0.007
 rinse+0.007
 shower+0.007
irs+0.007
Neg logit contributions
 enough-0.003
over-0.003
 Bill-0.003
id-0.003
 tal-0.003
Token was
Token ID373
Feature activation+6.27e-01

Pos logit contributions
over+0.032
id+0.031
 tal+0.030
est+0.028
 Bill+0.027
Neg logit contributions
 sidelines-0.028
 rinse-0.025
 shower-0.024
 curtain-0.024
 bathroom-0.024
Token too
Token ID1165
Feature activation+1.37e+00

Pos logit contributions
over+0.012
id+0.012
 tal+0.012
 Bill+0.012
 enough+0.011
Neg logit contributions
 sidelines-0.024
 stereo-0.022
 rinse-0.022
 shower-0.021
 curtain-0.021
Token slow
Token ID3105
Feature activation+7.44e-01

Pos logit contributions
 tal+0.036
over+0.035
id+0.035
 Bill+0.035
 enough+0.035
Neg logit contributions
 rinse-0.037
 sidelines-0.035
 spill-0.035
 sip-0.034
 shower-0.034
Token.
Token ID13
Feature activation+1.56e-01

Pos logit contributions
 tal+0.019
over+0.018
 Bill+0.017
id+0.017
 Stan+0.017
Neg logit contributions
 sidelines-0.022
 stereo-0.021
 shower-0.021
 rinse-0.020
 curtain-0.020
Token Something
Token ID13742
Feature activation+4.32e-01

Pos logit contributions
over+0.004
 enough+0.004
id+0.004
 tal+0.004
 Bill+0.004
Neg logit contributions
 sidelines-0.005
 stereo-0.005
 shower-0.004
 rinse-0.004
 curtain-0.004
Token else
Token ID2073
Feature activation+6.35e-01

Pos logit contributions
over+0.012
id+0.011
 Bill+0.010
 tal+0.010
 enough+0.010
Neg logit contributions
 sidelines-0.013
 shower-0.013
 rinse-0.013
 sip-0.013
 stereo-0.012
Token was
Token ID373
Feature activation+7.66e-01

Pos logit contributions
over+0.016
id+0.014
 tal+0.014
 Bill+0.014
 enough+0.013
Neg logit contributions
 sidelines-0.020
 shower-0.020
 rinse-0.019
 stereo-0.019
 curtain-0.019
Token faster
Token ID5443
Feature activation+1.42e+00

Pos logit contributions
over+0.019
id+0.018
 tal+0.018
 Bill+0.018
 enough+0.017
Neg logit contributions
 sidelines-0.025
 shower-0.023
 stereo-0.023
 rinse-0.023
 curtain-0.022
Token!
Token ID0
Feature activation+4.44e-01

Pos logit contributions
over+0.026
 tal+0.026
id+0.025
 Bill+0.025
 Stan+0.023
Neg logit contributions
 sidelines-0.047
 shower-0.047
 stereo-0.046
 curtain-0.045
 spill-0.045
Token A
Token ID317
Feature activation+6.18e-01

Pos logit contributions
 enough+0.009
over+0.009
 Bill+0.009
id+0.009
 tal+0.009
Neg logit contributions
 sidelines-0.016
 stereo-0.015
 rinse-0.015
 shower-0.014
irs-0.014
Token frog
Token ID21264
Feature activation-1.71e-01

Pos logit contributions
 enough+0.013
over+0.013
 Bill+0.012
 tal+0.012
id+0.012
Neg logit contributions
 sidelines-0.023
 stereo-0.021
 rinse-0.021
irs-0.020
 shower-0.020
Token
Token ID198
Feature activation-3.40e-01

Pos logit contributions
 Bill+0.001
 Stan+0.001
 Sid+0.001
 enough+0.001
,+0.001
Neg logit contributions
 sidelines-0.003
 rinse-0.003
 stereo-0.003
irs-0.003
 sip-0.003
Token
Token ID198
Feature activation-4.10e-01

Pos logit contributions
 sidelines+0.016
 stereo+0.014
 rinse+0.014
 studio+0.014
 shower+0.014
Neg logit contributions
 Bill-0.004
id-0.004
 Stan-0.004
over-0.003
 tal-0.003
Token"
Token ID1
Feature activation+3.43e-02

Pos logit contributions
 sidelines+0.019
 stereo+0.018
 shower+0.018
 curtain+0.018
 curtains+0.018
Neg logit contributions
over-0.005
id-0.004
 Bill-0.004
 Stan-0.003
 Sid-0.003
TokenWhat
Token ID2061
Feature activation-6.13e-02

Pos logit contributions
 Bill+0.001
 meant+0.001
over+0.001
id+0.001
 enough+0.001
Neg logit contributions
 sidelines-0.001
irs-0.001
 stereo-0.001
 shower-0.001
 curtain-0.001
Token a
Token ID257
Feature activation-3.72e-01

Pos logit contributions
 shower+0.002
 rinse+0.002
 stereo+0.002
 curtain+0.002
 sidelines+0.002
Neg logit contributions
 tal-0.001
over-0.001
id-0.001
 enough-0.001
 Bill-0.001
Token lovely
Token ID14081
Feature activation-1.71e+00

Pos logit contributions
 sidelines+0.015
irs+0.014
 stereo+0.014
 rinse+0.013
 salon+0.013
Neg logit contributions
 enough-0.006
over-0.006
 Bill-0.006
 best-0.006
 tal-0.006
Token plant
Token ID4618
Feature activation-1.36e-02

Pos logit contributions
 sidelines+0.051
 stereo+0.045
irs+0.045
 rinse+0.044
 shower+0.041
Neg logit contributions
over-0.049
 enough-0.049
 Bill-0.048
 tal-0.047
id-0.046
Token!"
Token ID2474
Feature activation-2.95e-01

Pos logit contributions
 sidelines+0.001
 stereo+0.000
irs+0.000
 rinse+0.000
 shower+0.000
Neg logit contributions
 enough-0.000
over-0.000
 Bill-0.000
 meant-0.000
 tal-0.000
Token Mom
Token ID11254
Feature activation-5.95e-01

Pos logit contributions
 sidelines+0.010
 stereo+0.009
irs+0.009
 rinse+0.008
 shower+0.008
Neg logit contributions
 Bill-0.008
 enough-0.007
 Stan-0.007
 meant-0.007
over-0.007
Tokenmy
Token ID1820
Feature activation-8.38e-01

Pos logit contributions
 sidelines+0.024
 stereo+0.021
irs+0.021
 shower+0.021
 rinse+0.021
Neg logit contributions
 meant-0.012
 enough-0.012
 Bill-0.011
id-0.011
over-0.010
Token said
Token ID531
Feature activation+4.17e-02

Pos logit contributions
 sidelines+0.029
 stereo+0.028
 rinse+0.027
 shower+0.027
 curtain+0.026
Neg logit contributions
over-0.019
 tal-0.018
 enough-0.018
 Bill-0.017
id-0.016
Token go
Token ID467
Feature activation+9.59e-01

Pos logit contributions
 sidelines+0.025
 shower+0.023
 rinse+0.022
 stereo+0.022
 curtain+0.020
Neg logit contributions
over-0.030
 tal-0.030
id-0.029
 enough-0.029
 Bill-0.028
Token on
Token ID319
Feature activation+7.54e-01

Pos logit contributions
over+0.019
 enough+0.018
 Bill+0.018
 tal+0.018
id+0.018
Neg logit contributions
 sidelines-0.036
 stereo-0.033
 shower-0.033
 rinse-0.033
irs-0.032
Token a
Token ID257
Feature activation-4.64e-01

Pos logit contributions
id+0.018
over+0.018
 meant+0.016
 tal+0.016
 Bill+0.015
Neg logit contributions
 shower-0.025
 sidelines-0.025
 bathroom-0.024
 stereo-0.023
 tea-0.023
Token fast
Token ID3049
Feature activation-7.57e-01

Pos logit contributions
 sidelines+0.015
 stereo+0.013
 rinse+0.013
 shower+0.013
irs+0.013
Neg logit contributions
 enough-0.012
over-0.012
 Bill-0.012
id-0.011
 tal-0.011
Token ride
Token ID6594
Feature activation+5.56e-01

Pos logit contributions
 sidelines+0.025
 stereo+0.023
 rinse+0.022
 shower+0.022
irs+0.022
Neg logit contributions
 enough-0.019
over-0.018
 Bill-0.018
id-0.018
 tal-0.018
Token with
Token ID351
Feature activation-6.15e-01

Pos logit contributions
 tal+0.014
id+0.012
 Sid+0.012
 Stan+0.012
 Bill+0.012
Neg logit contributions
 sidelines-0.017
 stereo-0.016
 shower-0.016
 carpet-0.015
 curtain-0.015
Token his
Token ID465
Feature activation+6.63e-01

Pos logit contributions
 sidelines+0.025
 stereo+0.024
 shower+0.023
 rinse+0.023
irs+0.023
Neg logit contributions
over-0.010
id-0.009
 enough-0.009
 tal-0.009
 Bill-0.009
Token mom
Token ID1995
Feature activation+6.34e-01

Pos logit contributions
over+0.020
 enough+0.019
id+0.019
 Bill+0.018
 tal+0.018
Neg logit contributions
 sidelines-0.019
 shower-0.017
 stereo-0.017
 rinse-0.016
 bathroom-0.016
Token.
Token ID13
Feature activation+5.66e-01

Pos logit contributions
 tal+0.017
over+0.016
id+0.016
 Bill+0.016
 Sid+0.015
Neg logit contributions
 stereo-0.017
 shower-0.017
 sidelines-0.017
 rinse-0.017
 curtains-0.016
Token<|endoftext|>
Token ID50256
Feature activation-9.61e-01

Pos logit contributions
 Bill+0.011
 enough+0.010
over+0.009
 tal+0.009
id+0.009
Neg logit contributions
 sidelines-0.023
 rinse-0.021
 stereo-0.021
irs-0.020
 shower-0.020
TokenOnce
Token ID7454
Feature activation-6.14e-01

Pos logit contributions
 sidelines+0.025
 shower+0.024
 stereo+0.024
 curtain+0.023
 rinse+0.023
Neg logit contributions
 enough-0.028
over-0.027
 tal-0.027
 Bill-0.026
id-0.026
Token around
Token ID1088
Feature activation-1.83e+00

Pos logit contributions
 sidelines+0.016
 stereo+0.015
 rinse+0.014
 shower+0.014
irs+0.014
Neg logit contributions
 enough-0.006
over-0.006
 Bill-0.006
 tal-0.006
id-0.006
Token the
Token ID262
Feature activation+3.84e-01

Pos logit contributions
 sidelines+0.062
 shower+0.058
 curtain+0.057
 sip+0.056
 bathroom+0.055
Neg logit contributions
id-0.044
 tal-0.044
over-0.042
 Bill-0.041
 Stan-0.038
Token garden
Token ID11376
Feature activation-1.81e+00

Pos logit contributions
 enough+0.008
over+0.008
 ac+0.008
,+0.008
 best+0.008
Neg logit contributions
irs-0.014
 sidelines-0.013
 stereo-0.012
 rinse-0.012
 Acc-0.012
Token.
Token ID13
Feature activation+3.42e-01

Pos logit contributions
 curtain+0.056
 sidelines+0.056
 curtains+0.054
 carpet+0.053
 bathroom+0.053
Neg logit contributions
 tal-0.047
 Bill-0.045
id-0.044
over-0.042
 Stan-0.042
Token He
Token ID679
Feature activation+4.92e-01

Pos logit contributions
 enough+0.005
over+0.005
 Bill+0.005
 tal+0.004
id+0.004
Neg logit contributions
 sidelines-0.015
irs-0.014
 stereo-0.014
 rinse-0.013
 shower-0.013
Token sees
Token ID7224
Feature activation-2.58e-01

Pos logit contributions
 enough+0.012
 Bill+0.011
 meant+0.010
 tal+0.010
id+0.010
Neg logit contributions
 sidelines-0.017
irs-0.016
 stereo-0.015
 rinse-0.015
 shower-0.015
Token flowers
Token ID12734
Feature activation+2.36e-01

Pos logit contributions
 sidelines+0.011
 stereo+0.011
 rinse+0.010
 shower+0.010
irs+0.010
Neg logit contributions
 enough-0.004
 Bill-0.003
over-0.003
 tal-0.003
id-0.003
Token,
Token ID11
Feature activation-1.72e+00

Pos logit contributions
over+0.004
 tal+0.003
id+0.003
 Bill+0.003
 enough+0.003
Neg logit contributions
 sidelines-0.010
 curtain-0.010
 shower-0.010
 rinse-0.009
 stereo-0.009
Token grass
Token ID8701
Feature activation-3.88e-01

Pos logit contributions
 sidelines+0.050
 shower+0.045
 stereo+0.045
 rinse+0.045
 curtain+0.045
Neg logit contributions
over-0.050
id-0.048
 enough-0.048
 Bill-0.047
 tal-0.047
Token,
Token ID11
Feature activation-1.03e+00

Pos logit contributions
 curtain+0.016
 sidelines+0.016
 shower+0.016
 curtains+0.015
 carpet+0.015
Neg logit contributions
over-0.006
 tal-0.006
id-0.005
 Bill-0.005
 enough-0.005
Token and
Token ID290
Feature activation+7.26e-02

Pos logit contributions
 sidelines+0.032
 curtain+0.030
 curtains+0.030
 shower+0.030
 rinse+0.029
Neg logit contributions
over-0.028
id-0.027
 tal-0.026
 Bill-0.025
 enough-0.025
Token relaxed
Token ID18397
Feature activation-6.92e-01

Pos logit contributions
 enough+0.010
over+0.010
 Bill+0.010
 tal+0.010
id+0.010
Neg logit contributions
 sidelines-0.016
 stereo-0.015
 rinse-0.015
 shower-0.015
irs-0.015
Token.
Token ID13
Feature activation+4.76e-01

Pos logit contributions
 sidelines+0.021
 shower+0.021
 rinse+0.020
 stereo+0.020
 curtain+0.020
Neg logit contributions
 tal-0.018
 Bill-0.017
id-0.016
 Stan-0.016
 Sid-0.015
Token
Token ID198
Feature activation+1.67e-01

Pos logit contributions
 Bill+0.009
over+0.008
 enough+0.008
 tal+0.007
 Stan+0.007
Neg logit contributions
 sidelines-0.020
 rinse-0.018
 stereo-0.018
 shower-0.018
 curtain-0.017
Token
Token ID198
Feature activation+9.80e-02

Pos logit contributions
 Bill+0.003
over+0.003
 tal+0.003
 enough+0.003
id+0.002
Neg logit contributions
 sidelines-0.007
 stereo-0.006
 rinse-0.006
 shower-0.006
 curtain-0.006
TokenWhen
Token ID2215
Feature activation-4.97e-02

Pos logit contributions
over+0.002
id+0.002
 Bill+0.002
 enough+0.002
 tal+0.002
Neg logit contributions
 sidelines-0.004
 stereo-0.003
 curtain-0.003
 shower-0.003
 curtains-0.003
Token she
Token ID673
Feature activation-6.55e-01

Pos logit contributions
 sidelines+0.002
 stereo+0.002
 rinse+0.002
 shower+0.002
irs+0.001
Neg logit contributions
over-0.001
 enough-0.001
 Bill-0.001
id-0.001
 tal-0.001
Token opened
Token ID4721
Feature activation-4.71e-01

Pos logit contributions
 sidelines+0.026
 stereo+0.023
irs+0.023
 rinse+0.022
 curtain+0.022
Neg logit contributions
 enough-0.013
over-0.012
 Bill-0.012
 tal-0.012
 meant-0.012
Token her
Token ID607
Feature activation-4.08e-01

Pos logit contributions
 sidelines+0.017
 stereo+0.016
 rinse+0.016
 shower+0.015
irs+0.015
Neg logit contributions
 enough-0.010
 Bill-0.010
over-0.009
 tal-0.009
id-0.009
Token eyes
Token ID2951
Feature activation+2.03e+00

Pos logit contributions
 sidelines+0.016
irs+0.014
 stereo+0.014
 rinse+0.014
Girls+0.013
Neg logit contributions
 enough-0.008
 tal-0.008
 Bill-0.007
over-0.007
 best-0.007
Token again
Token ID757
Feature activation+1.41e+00

Pos logit contributions
over+0.029
 tal+0.028
 Bill+0.028
id+0.027
 Stan+0.025
Neg logit contributions
 sidelines-0.085
 shower-0.083
 rinse-0.081
 stereo-0.081
 curtain-0.080
Token,
Token ID11
Feature activation+1.12e+00

Pos logit contributions
 tal+0.017
over+0.017
id+0.016
 Bill+0.016
 enough+0.016
Neg logit contributions
 sidelines-0.061
 shower-0.060
 rinse-0.059
 curtain-0.057
 stereo-0.057
Token field
Token ID2214
Feature activation+8.45e-01

Pos logit contributions
 enough+0.011
over+0.011
 Bill+0.011
 tal+0.011
id+0.011
Neg logit contributions
 sidelines-0.016
 stereo-0.014
 rinse-0.014
 shower-0.014
irs-0.014
Token.
Token ID13
Feature activation+4.20e-01

Pos logit contributions
 tal+0.017
over+0.017
 Bill+0.017
id+0.017
 enough+0.016
Neg logit contributions
 sidelines-0.030
 shower-0.028
 stereo-0.028
 curtain-0.028
 rinse-0.027
Token
Token ID198
Feature activation+8.39e-02

Pos logit contributions
 Bill+0.009
 enough+0.009
over+0.008
 tal+0.008
id+0.008
Neg logit contributions
 sidelines-0.016
 stereo-0.014
 rinse-0.014
 shower-0.014
irs-0.014
Token
Token ID198
Feature activation+6.27e-02

Pos logit contributions
 Bill+0.002
 tal+0.001
over+0.001
 enough+0.001
id+0.001
Neg logit contributions
 sidelines-0.003
 stereo-0.003
 rinse-0.003
 shower-0.003
 curtain-0.003
TokenTim
Token ID14967
Feature activation-8.76e-02

Pos logit contributions
 Bill+0.001
over+0.001
id+0.001
 enough+0.001
 tal+0.001
Neg logit contributions
 sidelines-0.002
 stereo-0.002
 curtain-0.002
 shower-0.002
 rinse-0.002
Tokenmy
Token ID1820
Feature activation-6.16e-01

Pos logit contributions
 sidelines+0.002
 stereo+0.002
 rinse+0.002
 shower+0.002
 curtain+0.002
Neg logit contributions
over-0.003
 enough-0.003
 Bill-0.003
 tal-0.003
 Stan-0.003
Token and
Token ID290
Feature activation+8.63e-01

Pos logit contributions
 sidelines+0.020
 stereo+0.019
 rinse+0.019
 shower+0.019
 curtain+0.018
Neg logit contributions
over-0.015
 enough-0.014
 Bill-0.014
id-0.013
 tal-0.013
Token his
Token ID465
Feature activation+1.42e+00

Pos logit contributions
over+0.016
 enough+0.015
id+0.015
 tal+0.014
 meant+0.012
Neg logit contributions
 sidelines-0.033
 shower-0.032
 rinse-0.032
 stereo-0.031
 bathroom-0.031
Token mum
Token ID25682
Feature activation+1.09e+00

Pos logit contributions
over+0.044
 enough+0.044
 Bill+0.043
id+0.043
 tal+0.042
Neg logit contributions
 sidelines-0.038
 shower-0.034
 stereo-0.033
 rinse-0.033
 curtain-0.032
Token followed
Token ID3940
Feature activation-4.87e-01

Pos logit contributions
over+0.031
 tal+0.028
 enough+0.027
 Bill+0.027
id+0.026
Neg logit contributions
 sidelines-0.033
 rinse-0.030
 shower-0.030
 stereo-0.030
 sip-0.029
Token the
Token ID262
Feature activation-4.21e-02

Pos logit contributions
 sidelines+0.016
 shower+0.015
 rinse+0.015
 stereo+0.015
 sip+0.015
Neg logit contributions
over-0.012
id-0.011
 tal-0.011
 enough-0.011
 meant-0.010
Token found
Token ID1043
Feature activation-6.36e-01

Pos logit contributions
 sidelines+0.007
 shower+0.007
 rinse+0.007
 sip+0.007
 stereo+0.007
Neg logit contributions
over-0.007
 tal-0.007
 Bill-0.007
 enough-0.007
id-0.006
Token it
Token ID340
Feature activation-4.91e-01

Pos logit contributions
 sidelines+0.023
 shower+0.022
 curtain+0.022
 curtains+0.021
 rinse+0.021
Neg logit contributions
over-0.013
id-0.013
 Bill-0.011
 enough-0.011
 tal-0.011
Token.
Token ID13
Feature activation+3.10e-01

Pos logit contributions
 shower+0.013
 curtain+0.013
 sidelines+0.013
 bathroom+0.013
 carpet+0.013
Neg logit contributions
 tal-0.014
over-0.014
 Bill-0.013
id-0.013
 Stan-0.012
Token
Token ID198
Feature activation-3.90e-01

Pos logit contributions
 Bill+0.005
 tal+0.004
 Stan+0.004
 enough+0.004
over+0.004
Neg logit contributions
 sidelines-0.014
irs-0.013
 rinse-0.013
 curtain-0.013
 stereo-0.013
Token
Token ID198
Feature activation-4.42e-01

Pos logit contributions
 sidelines+0.015
 stereo+0.013
 rinse+0.013
 shower+0.013
irs+0.013
Neg logit contributions
 Bill-0.008
over-0.008
 enough-0.008
 tal-0.007
id-0.007
TokenB
Token ID33
Feature activation-3.21e+00

Pos logit contributions
 sidelines+0.014
 shower+0.013
 curtain+0.013
 stereo+0.013
irs+0.013
Neg logit contributions
over-0.011
id-0.011
 enough-0.010
 Bill-0.010
 tal-0.010
Tokenenny
Token ID11870
Feature activation-1.49e+00

Pos logit contributions
 sidelines+0.060
 stereo+0.051
 rinse+0.050
 shower+0.049
irs+0.047
Neg logit contributions
 enough-0.124
over-0.124
 Bill-0.123
id-0.121
 tal-0.121
Token saw
Token ID2497
Feature activation+1.41e-02

Pos logit contributions
 sidelines+0.050
 stereo+0.048
 rinse+0.046
 shower+0.046
 curtain+0.045
Neg logit contributions
over-0.034
 enough-0.032
 Bill-0.032
 tal-0.031
id-0.030
Token a
Token ID257
Feature activation-8.77e-02

Pos logit contributions
over+0.000
 enough+0.000
id+0.000
 Bill+0.000
 tal+0.000
Neg logit contributions
 sidelines-0.001
 stereo-0.001
 shower-0.001
 rinse-0.001
irs-0.001
Token big
Token ID1263
Feature activation+8.73e-02

Pos logit contributions
 sidelines+0.003
 stereo+0.003
 rinse+0.003
irs+0.003
 shower+0.003
Neg logit contributions
 enough-0.002
over-0.002
 Bill-0.002
 tal-0.002
id-0.002
Token fish
Token ID5916
Feature activation+8.59e-01

Pos logit contributions
over+0.002
id+0.002
 enough+0.002
 Bill+0.002
 tal+0.002
Neg logit contributions
 sidelines-0.003
 shower-0.003
 curtain-0.003
 carpet-0.003
 bathroom-0.003
Token She
Token ID1375
Feature activation-8.15e-01

Pos logit contributions
 Bill+0.001
 enough+0.001
over+0.001
 meant+0.001
id+0.001
Neg logit contributions
 sidelines-0.002
irs-0.002
 rinse-0.002
 stereo-0.002
 shower-0.002
Token tossed
Token ID22591
Feature activation+5.21e-01

Pos logit contributions
 sidelines+0.030
 stereo+0.028
 rinse+0.028
 shower+0.028
 curtain+0.027
Neg logit contributions
over-0.016
 enough-0.016
 Bill-0.016
id-0.015
 tal-0.015
Token and
Token ID290
Feature activation-1.48e-01

Pos logit contributions
over+0.012
id+0.011
 Bill+0.011
 tal+0.011
 enough+0.010
Neg logit contributions
 sidelines-0.018
 rinse-0.017
 shower-0.017
 stereo-0.017
 curtain-0.017
Token turned
Token ID2900
Feature activation-1.36e+00

Pos logit contributions
 sidelines+0.005
irs+0.005
 stereo+0.005
 rinse+0.005
 shower+0.005
Neg logit contributions
 enough-0.003
 Bill-0.003
 meant-0.003
 tal-0.003
over-0.003
Token all
Token ID477
Feature activation-6.07e-01

Pos logit contributions
 sidelines+0.054
 stereo+0.050
 rinse+0.050
 shower+0.049
irs+0.048
Neg logit contributions
over-0.025
 enough-0.024
 Bill-0.024
id-0.023
 tal-0.023
Token night
Token ID1755
Feature activation-3.53e+00

Pos logit contributions
 sidelines+0.027
 stereo+0.024
 rinse+0.023
irs+0.023
 sip+0.022
Neg logit contributions
 enough-0.010
 Bill-0.009
 meant-0.009
over-0.008
est-0.008
Token and
Token ID290
Feature activation-8.70e-02

Pos logit contributions
 sidelines+0.142
 stereo+0.133
 shower+0.132
 rinse+0.131
 curtain+0.128
Neg logit contributions
 Bill-0.058
 enough-0.058
over-0.057
id-0.055
 tal-0.055
Token couldn
Token ID3521
Feature activation-1.27e+00

Pos logit contributions
 sidelines+0.003
 rinse+0.003
 stereo+0.003
 shower+0.003
 bathroom+0.003
Neg logit contributions
over-0.002
 tal-0.002
id-0.002
 Bill-0.002
 enough-0.002
Token't
Token ID470
Feature activation-6.67e-01

Pos logit contributions
 sidelines+0.033
 rinse+0.029
 stereo+0.028
irs+0.028
 curtain+0.026
Neg logit contributions
 enough-0.040
 Bill-0.040
over-0.039
 meant-0.038
id-0.037
Token get
Token ID651
Feature activation-4.05e-01

Pos logit contributions
 sidelines+0.019
irs+0.017
 stereo+0.017
 shower+0.016
 rinse+0.016
Neg logit contributions
over-0.019
 enough-0.019
 Bill-0.018
 tal-0.018
id-0.018
Token close
Token ID1969
Feature activation-3.68e-01

Pos logit contributions
 sidelines+0.017
 stereo+0.016
irs+0.015
 curtain+0.015
 shower+0.015
Neg logit contributions
 enough-0.008
 Bill-0.007
 meant-0.006
over-0.006
 ac-0.006
Token too
Token ID1165
Feature activation+6.76e-01

Pos logit contributions
 sidelines+0.001
 rinse+0.001
 shower+0.001
 stereo+0.001
irs+0.001
Neg logit contributions
over-0.001
id-0.000
 Bill-0.000
 tal-0.000
 enough-0.000
Token far
Token ID1290
Feature activation+1.30e+00

Pos logit contributions
over+0.017
 Bill+0.016
id+0.016
 tal+0.016
 enough+0.015
Neg logit contributions
 sidelines-0.021
 rinse-0.020
 shower-0.020
 stereo-0.019
 spill-0.019
Token away
Token ID1497
Feature activation-9.83e-01

Pos logit contributions
over+0.036
 Bill+0.034
 tal+0.034
id+0.034
 Stan+0.031
Neg logit contributions
 sidelines-0.038
 shower-0.035
 stereo-0.034
 rinse-0.034
 spill-0.032
Token for
Token ID329
Feature activation+1.21e+00

Pos logit contributions
 sidelines+0.030
 shower+0.029
 rinse+0.028
 curtain+0.028
 stereo+0.028
Neg logit contributions
over-0.024
 tal-0.023
id-0.023
 Bill-0.023
 Stan-0.022
Token the
Token ID262
Feature activation-1.42e+00

Pos logit contributions
 enough+0.025
over+0.025
 Bill+0.025
id+0.024
 tal+0.024
Neg logit contributions
 sidelines-0.044
 stereo-0.041
 rinse-0.040
 shower-0.040
irs-0.039
Token little
Token ID1310
Feature activation-3.55e+00

Pos logit contributions
 sidelines+0.047
 stereo+0.043
 rinse+0.043
irs+0.042
 shower+0.042
Neg logit contributions
 enough-0.034
over-0.034
 Bill-0.033
id-0.032
 tal-0.032
Token girl
Token ID2576
Feature activation-2.93e+00

Pos logit contributions
 sidelines+0.116
 stereo+0.105
irs+0.103
 rinse+0.103
 shower+0.101
Neg logit contributions
 enough-0.090
over-0.088
 Bill-0.086
 tal-0.083
id-0.083
Token to
Token ID284
Feature activation+9.04e-01

Pos logit contributions
 sidelines+0.092
 shower+0.089
 stereo+0.089
 rinse+0.087
 curtain+0.085
Neg logit contributions
id-0.066
over-0.065
 Bill-0.065
 tal-0.063
 enough-0.063
Token see
Token ID766
Feature activation+3.16e-01

Pos logit contributions
id+0.031
over+0.031
 meant+0.029
 tal+0.028
 Bill+0.028
Neg logit contributions
 sidelines-0.021
 rinse-0.020
 shower-0.020
 stereo-0.018
 sip-0.017
Token.
Token ID13
Feature activation+5.12e-01

Pos logit contributions
 enough+0.006
over+0.006
 Bill+0.006
id+0.005
 tal+0.005
Neg logit contributions
 sidelines-0.012
 stereo-0.011
 rinse-0.011
 shower-0.011
irs-0.011
Token She
Token ID1375
Feature activation-7.33e-01

Pos logit contributions
 Bill+0.009
 once+0.006
 Frank+0.006
over+0.006
 Stan+0.006
Neg logit contributions
 sidelines-0.023
 rinse-0.022
irs-0.022
 curtain-0.021
 stereo-0.021
Token stars
Token ID5788
Feature activation+6.18e-01

Pos logit contributions
 sidelines+0.047
 shower+0.045
 stereo+0.044
 curtain+0.044
 rinse+0.044
Neg logit contributions
over-0.024
 Bill-0.024
id-0.024
 enough-0.024
 tal-0.023
Token.
Token ID13
Feature activation+4.62e-01

Pos logit contributions
 tal+0.013
over+0.013
 Bill+0.013
id+0.013
 Stan+0.013
Neg logit contributions
 sidelines-0.021
 shower-0.020
 curtain-0.020
 stereo-0.020
 rinse-0.020
Token She
Token ID1375
Feature activation-4.84e-01

Pos logit contributions
 Bill+0.011
 enough+0.010
over+0.010
 meant+0.009
id+0.009
Neg logit contributions
 sidelines-0.016
 rinse-0.015
irs-0.015
 stereo-0.015
 shower-0.015
Token couldn
Token ID3521
Feature activation-1.45e+00

Pos logit contributions
 sidelines+0.019
irs+0.017
 stereo+0.017
 rinse+0.017
 shower+0.016
Neg logit contributions
 enough-0.009
 Bill-0.009
over-0.009
id-0.009
 meant-0.009
Token't
Token ID470
Feature activation-7.87e-01

Pos logit contributions
 sidelines+0.037
 rinse+0.032
 stereo+0.032
irs+0.031
 curtain+0.029
Neg logit contributions
 enough-0.046
 Bill-0.046
over-0.044
 meant-0.044
id-0.043
Token believe
Token ID1975
Feature activation-3.25e+00

Pos logit contributions
 sidelines+0.026
irs+0.024
 stereo+0.023
 rinse+0.022
 shower+0.022
Neg logit contributions
 enough-0.018
 Bill-0.018
over-0.018
 meant-0.017
 tal-0.017
Token it
Token ID340
Feature activation+2.85e-01

Pos logit contributions
 sidelines+0.119
 stereo+0.106
 rinse+0.105
 shower+0.102
irs+0.101
Neg logit contributions
 enough-0.076
 Bill-0.073
over-0.066
 meant-0.064
id-0.062
Token was
Token ID373
Feature activation+1.35e+00

Pos logit contributions
 tal+0.007
over+0.007
 Bill+0.007
 Stan+0.007
id+0.007
Neg logit contributions
 shower-0.009
 curtain-0.009
 stereo-0.009
 rinse-0.008
 sidelines-0.008
Token real
Token ID1103
Feature activation+7.94e-02

Pos logit contributions
over+0.026
 enough+0.026
 Bill+0.025
id+0.025
 tal+0.025
Neg logit contributions
 sidelines-0.051
 stereo-0.047
 rinse-0.047
 shower-0.046
irs-0.046
Token!
Token ID0
Feature activation+6.61e-01

Pos logit contributions
 enough+0.001
 Bill+0.001
over+0.001
 meant+0.001
 tal+0.001
Neg logit contributions
 sidelines-0.004
 rinse-0.003
irs-0.003
 stereo-0.003
 sip-0.003
Token The
Token ID383
Feature activation+1.40e-01

Pos logit contributions
 Bill+0.016
 enough+0.015
over+0.014
id+0.013
 meant+0.013
Neg logit contributions
 sidelines-0.024
 rinse-0.022
irs-0.022
 shower-0.021
 stereo-0.021
Token can
Token ID460
Feature activation-1.70e+00

Pos logit contributions
 sidelines+0.022
 stereo+0.020
 rinse+0.019
 shower+0.019
irs+0.019
Neg logit contributions
over-0.021
 enough-0.021
 Bill-0.021
id-0.020
 tal-0.020
Token't
Token ID470
Feature activation-4.36e-02

Pos logit contributions
 sidelines+0.056
irs+0.048
 stereo+0.047
 curtain+0.044
 Acc+0.044
Neg logit contributions
 Bill-0.045
 enough-0.044
,-0.042
id-0.041
 meant-0.040
Token watch
Token ID2342
Feature activation-2.13e+00

Pos logit contributions
 sidelines+0.001
irs+0.001
 stereo+0.001
 rinse+0.001
 curtain+0.001
Neg logit contributions
 Bill-0.001
 enough-0.001
over-0.001
id-0.001
 meant-0.001
Token TV
Token ID3195
Feature activation-1.14e+00

Pos logit contributions
 sidelines+0.061
 rinse+0.053
 stereo+0.052
irs+0.052
 shower+0.051
Neg logit contributions
 enough-0.064
over-0.061
 Bill-0.061
id-0.058
 meant-0.057
Token all
Token ID477
Feature activation-1.12e+00

Pos logit contributions
 sidelines+0.047
 stereo+0.044
 rinse+0.044
 shower+0.044
irs+0.043
Neg logit contributions
over-0.017
 enough-0.017
 Bill-0.017
 tal-0.017
id-0.016
Token day
Token ID1110
Feature activation-3.27e+00

Pos logit contributions
 sidelines+0.044
 rinse+0.040
 stereo+0.040
irs+0.040
 sip+0.038
Neg logit contributions
 enough-0.022
 Bill-0.021
 meant-0.020
over-0.019
id-0.018
Token,
Token ID11
Feature activation-6.84e-01

Pos logit contributions
 sidelines+0.119
 stereo+0.112
 shower+0.110
 rinse+0.109
 curtain+0.107
Neg logit contributions
 tal-0.066
 Bill-0.066
id-0.065
over-0.064
 enough-0.063
Token Lily
Token ID20037
Feature activation-5.78e-01

Pos logit contributions
 sidelines+0.021
 stereo+0.019
irs+0.019
 shower+0.017
 rinse+0.017
Neg logit contributions
 enough-0.020
 Bill-0.018
over-0.018
 meant-0.017
 okay-0.017
Token.
Token ID13
Feature activation-7.63e-01

Pos logit contributions
 sidelines+0.020
 rinse+0.019
 stereo+0.019
 shower+0.019
 curtain+0.019
Neg logit contributions
 Bill-0.012
over-0.012
 tal-0.012
 enough-0.011
id-0.011
Token You
Token ID921
Feature activation-7.00e-01

Pos logit contributions
 sidelines+0.026
irs+0.023
 rinse+0.023
 stereo+0.022
 shower+0.022
Neg logit contributions
 Bill-0.019
over-0.018
 enough-0.018
id-0.017
 meant-0.016
Token have
Token ID423
Feature activation-3.57e-01

Pos logit contributions
 sidelines+0.021
 stereo+0.019
 rinse+0.019
 shower+0.019
irs+0.018
Neg logit contributions
 enough-0.019
over-0.019
 Bill-0.019
id-0.018
 tal-0.018