TOP ACTIVATIONS
MAX = 0.088

and watched as the cater pill ars grew bigger and bigger
they paid him for his av oc ados .
Everyday he would take the av oc ados and scatter them
and try to grab the av oc ados . But the
continued to enjoy eating her av oc ados with a smile
and lady bugs and cater pill ars ." Tim
her mom and share her rad ishes . She liked to
made sure to eat her av oc ados . She learned
weather before he picked his av oc ados . <|endoftext|> Once
more of the hairy cater pill ars . <|endoftext|> Once there
recommendation and picked the right av oc ados .
tell the mom my cater pill ars ! ''
cut up one of his rad ishes . He laid the
songs and danced around the rad ish . But
started to chew on some seaw eed . Mary was so
, and all the other av oc ados were very happy
world where everyone shared their rad ishes . <|endoftext|> Once upon
continued to scatter the bread cr umbs . <|endoftext|> Once upon
and Sammy would often share av oc ados and other y
prepared , even when things seem mild . â

MIN ACTIVATIONS
MIN = -0.119

It became the happiest books he lf in the world !
can 't reach the books he lf . Can you reach
a room . The books he lf was filled with books
, I finished the books he lf !" Andy said with
She had a big books he lf in her room with
he finally found the books he lf , and he stepped
. She saw a books he lf with many books on
book falls off the books he lf . It makes a
. She had a books he lf in her room that
should not climb the books he lf . It is dangerous
. She moved the books he lf a little and got
be careful . The books he lf is tall and we
my never saw the books he lf again . But they
He noticed that the books he lf was very smooth and
book . The long books he lf was still there ,
Can I touch the books he lf too ?" His mom
Lily has a big books he lf with many books .
she noticed that the books he lf was very dangerous .
he lf . The books he lf is high , so
his room . The books he lf was very popular because

INTERVAL 0.036 - 0.088
CONTAINS 0.366%

smell and sour fruits and vegetables . The cater pillar did
The mum bought the pie and they went home .
her new bottle and it tasted normal , just like her
, it does . It tastes like dirt and grass ,"
curious , and he asked why the gate was locked .

INTERVAL -0.015 - 0.036
CONTAINS 89.541%

. She found a little box to put the mouse in
We have to get out quickly !" John
there and knew just what to do . His big brother
the hill . The car rolled and rolled until it finally
his mum every single day . <|endoftext|> A little girl was

INTERVAL -0.067 - -0.015
CONTAINS 10.085%

and took her picture . Kate posed like she was a
olly and M andy were best friends . Every day they
She said , " There , there , it will be
old and dangerous . No one should have gone on it
around . She saw a window . She ran to the

INTERVAL -0.119 - -0.067
CONTAINS 0.007%

" It is a k ang aroo ," Ben says .
, Sarah found a books he lf in her room .
L - I - L - Y ," Lily spelled .
next time he could bat he in the warm light .
take one of the works he ets and start doing the
Token and
Token ID290
Feature activation+1.15e-03

Pos logit contributions
ase+0.031
chair+0.030
belt+0.028
lf+0.028
loo+0.027
Neg logit contributions
 tast-0.020
 scr-0.018
 ang-0.018
 rust-0.017
 better-0.016
Token watched
Token ID7342
Feature activation-6.80e-03

Pos logit contributions
 tast+0.002
 ang+0.002
 scr+0.001
 cuc+0.001
 ju+0.001
Neg logit contributions
chair-0.002
ase-0.002
belt-0.002
overs-0.002
lf-0.002
Token as
Token ID355
Feature activation+4.10e-03

Pos logit contributions
chair+0.012
ase+0.011
belt+0.010
lf+0.010
overs+0.009
Neg logit contributions
 tast-0.013
 ang-0.011
 scr-0.011
 cuc-0.011
 ju-0.011
Token the
Token ID262
Feature activation+1.09e-02

Pos logit contributions
 tast+0.006
 scr+0.006
 ang+0.005
 cuc+0.005
 swe+0.005
Neg logit contributions
ase-0.009
chair-0.009
belt-0.008
lf-0.007
board-0.007
Token cater
Token ID20825
Feature activation+2.89e-02

Pos logit contributions
 tast+0.015
 scr+0.013
 ang+0.013
 swe+0.012
 ju+0.012
Neg logit contributions
chair-0.024
ase-0.023
belt-0.023
lf-0.021
overs-0.020
Tokenpill
Token ID27215
Feature activation+8.78e-02

Pos logit contributions
 tast+0.056
 scr+0.051
 cuc+0.051
 ang+0.050
 ju+0.050
Neg logit contributions
belt-0.044
chair-0.044
ase-0.042
lf-0.040
 doorway-0.036
Tokenars
Token ID945
Feature activation-1.33e-03

Pos logit contributions
 tast+0.193
 ang+0.185
 scr+0.184
 cuc+0.178
 Pars+0.176
Neg logit contributions
chair-0.117
ase-0.109
belt-0.100
lf-0.094
overs-0.082
Token grew
Token ID6348
Feature activation+3.96e-02

Pos logit contributions
chair+0.003
ase+0.003
belt+0.002
lf+0.002
overs+0.002
Neg logit contributions
 tast-0.002
 scr-0.002
 ang-0.002
 cuc-0.002
 ju-0.002
Token bigger
Token ID5749
Feature activation+1.27e-02

Pos logit contributions
 tast+0.072
 argue+0.072
 Pars+0.071
 cuc+0.070
heit+0.070
Neg logit contributions
chair-0.047
 corner-0.045
belt-0.043
overs-0.042
 of-0.038
Token and
Token ID290
Feature activation+1.48e-02

Pos logit contributions
 tast+0.023
 ang+0.021
 scr+0.021
 cuc+0.020
 Pars+0.020
Neg logit contributions
chair-0.022
ase-0.021
belt-0.020
lf-0.019
overs-0.017
Token bigger
Token ID5749
Feature activation+1.22e-02

Pos logit contributions
 Pars+0.016
 scr+0.015
 tast+0.015
 cuc+0.015
abb+0.015
Neg logit contributions
chair-0.033
belt-0.031
 corner-0.029
lf-0.029
ase-0.028
Token they
Token ID484
Feature activation-1.09e-03

Pos logit contributions
 tast+0.008
 ang+0.007
 scr+0.007
 cuc+0.007
 rad+0.006
Neg logit contributions
ase-0.010
chair-0.010
belt-0.009
lf-0.009
issa-0.008
Token paid
Token ID3432
Feature activation+1.97e-02

Pos logit contributions
chair+0.002
ase+0.002
belt+0.002
lf+0.002
overs+0.002
Neg logit contributions
 tast-0.002
 scr-0.002
 ang-0.002
 cuc-0.001
 ju-0.001
Token him
Token ID683
Feature activation+2.82e-02

Pos logit contributions
 tast+0.034
 scr+0.031
 ang+0.031
 cuc+0.030
 ju+0.029
Neg logit contributions
chair-0.036
ase-0.034
belt-0.033
lf-0.031
overs-0.029
Token for
Token ID329
Feature activation-1.47e-02

Pos logit contributions
 tast+0.052
 cuc+0.048
 scr+0.048
 ang+0.047
 ju+0.045
Neg logit contributions
chair-0.048
belt-0.044
ase-0.043
lf-0.039
 corner-0.038
Token his
Token ID465
Feature activation+6.09e-03

Pos logit contributions
chair+0.027
ase+0.025
belt+0.024
lf+0.024
overs+0.022
Neg logit contributions
 tast-0.026
 scr-0.023
 ang-0.023
 cuc-0.023
 ju-0.022
Token av
Token ID1196
Feature activation+8.58e-02

Pos logit contributions
 tast+0.010
 scr+0.009
 ang+0.009
 cuc+0.009
 Pars+0.009
Neg logit contributions
chair-0.012
ase-0.011
belt-0.011
lf-0.010
overs-0.009
Tokenoc
Token ID420
Feature activation-3.33e-02

Pos logit contributions
 tast+0.151
 ang+0.149
 scr+0.148
 Pars+0.143
 cuc+0.140
Neg logit contributions
chair-0.144
ase-0.140
belt-0.129
lf-0.120
 corner-0.112
Tokenados
Token ID22484
Feature activation+5.93e-02

Pos logit contributions
chair+0.063
ase+0.062
belt+0.058
lf+0.055
overs+0.052
Neg logit contributions
 tast-0.055
 scr-0.050
 ang-0.049
 ju-0.047
 Pars-0.047
Token.
Token ID13
Feature activation+9.80e-04

Pos logit contributions
 tast+0.097
 ang+0.089
 scr+0.089
 cuc+0.085
 Pars+0.083
Neg logit contributions
chair-0.114
ase-0.107
belt-0.104
lf-0.098
overs-0.090
Token
Token ID198
Feature activation-3.40e-03

Pos logit contributions
 tast+0.002
 ang+0.001
 scr+0.001
 cuc+0.001
 Pars+0.001
Neg logit contributions
chair-0.002
ase-0.002
belt-0.002
lf-0.002
 doorway-0.002
Token
Token ID198
Feature activation-6.78e-04

Pos logit contributions
chair+0.007
ase+0.007
belt+0.007
lf+0.006
overs+0.006
Neg logit contributions
 tast-0.004
 ang-0.004
 scr-0.004
So-0.004
 Pars-0.004
Token Everyday
Token ID48154
Feature activation+1.24e-02

Pos logit contributions
chair+0.025
ase+0.024
belt+0.023
lf+0.022
overs+0.020
Neg logit contributions
 tast-0.021
 ang-0.019
 scr-0.019
 cuc-0.018
 ju-0.017
Token he
Token ID339
Feature activation-9.64e-03

Pos logit contributions
 tast+0.021
 scr+0.019
 ang+0.019
 cuc+0.018
 rad+0.018
Neg logit contributions
ase-0.022
chair-0.022
belt-0.021
lf-0.019
overs-0.018
Token would
Token ID561
Feature activation-1.91e-02

Pos logit contributions
chair+0.019
ase+0.018
belt+0.017
lf+0.017
overs+0.015
Neg logit contributions
 tast-0.015
 scr-0.014
 ang-0.014
 cuc-0.013
 ju-0.013
Token take
Token ID1011
Feature activation+5.84e-03

Pos logit contributions
chair+0.041
ase+0.038
belt+0.038
lf+0.036
 corner+0.034
Neg logit contributions
 tast-0.027
 ang-0.024
 scr-0.024
 cuc-0.023
 ju-0.022
Token the
Token ID262
Feature activation-4.02e-03

Pos logit contributions
 tast+0.009
 ang+0.008
 scr+0.008
 cuc+0.008
 Pars+0.008
Neg logit contributions
chair-0.011
ase-0.011
belt-0.011
lf-0.010
 doorway-0.009
Token av
Token ID1196
Feature activation+8.48e-02

Pos logit contributions
chair+0.007
ase+0.007
belt+0.007
lf+0.006
 corner+0.006
Neg logit contributions
 tast-0.007
 scr-0.006
 ang-0.006
 cuc-0.006
 Pars-0.006
Tokenoc
Token ID420
Feature activation-4.37e-02

Pos logit contributions
 tast+0.147
 ang+0.146
 scr+0.144
 die+0.135
 Pars+0.134
Neg logit contributions
ase-0.146
chair-0.144
belt-0.127
lf-0.121
overs-0.114
Tokenados
Token ID22484
Feature activation+5.27e-02

Pos logit contributions
chair+0.092
ase+0.087
belt+0.084
lf+0.081
overs+0.075
Neg logit contributions
 tast-0.064
 scr-0.058
 ang-0.057
 cuc-0.055
 ju-0.053
Token and
Token ID290
Feature activation+1.25e-02

Pos logit contributions
 tast+0.098
 scr+0.091
 ang+0.090
 cuc+0.087
 Pars+0.085
Neg logit contributions
chair-0.089
ase-0.084
belt-0.080
lf-0.076
overs-0.069
Token scatter
Token ID41058
Feature activation+2.55e-02

Pos logit contributions
 tast+0.018
 ang+0.017
 scr+0.016
 Pars+0.016
 cuc+0.015
Neg logit contributions
chair-0.026
ase-0.024
belt-0.024
lf-0.023
 corner-0.021
Token them
Token ID606
Feature activation+8.52e-03

Pos logit contributions
 tast+0.042
 scr+0.041
 ang+0.040
 Pars+0.038
 cuc+0.036
Neg logit contributions
chair-0.045
ase-0.043
belt-0.040
lf-0.038
overs-0.038
Token and
Token ID290
Feature activation+2.10e-02

Pos logit contributions
 tast+0.005
 ang+0.005
 cuc+0.005
 scr+0.005
 ju+0.004
Neg logit contributions
chair-0.003
ase-0.003
belt-0.003
 corner-0.002
lf-0.002
Token try
Token ID1949
Feature activation+5.77e-03

Pos logit contributions
 tast+0.035
 ang+0.031
abb+0.031
 ju+0.030
 cuc+0.030
Neg logit contributions
chair-0.038
belt-0.033
 corner-0.031
lf-0.031
ase-0.030
Token to
Token ID284
Feature activation+2.31e-02

Pos logit contributions
 tast+0.015
 ang+0.014
 ju+0.014
 cuc+0.013
 scr+0.013
Neg logit contributions
chair-0.005
 corner-0.004
ase-0.004
belt-0.003
lf-0.003
Token grab
Token ID5552
Feature activation+4.79e-03

Pos logit contributions
 tast+0.040
 ang+0.038
 scr+0.037
 cuc+0.036
 Pars+0.036
Neg logit contributions
chair-0.036
lf-0.035
 corner-0.034
belt-0.033
ase-0.033
Token the
Token ID262
Feature activation-4.96e-03

Pos logit contributions
 tast+0.010
 scr+0.010
 ang+0.010
 cuc+0.009
 ju+0.009
Neg logit contributions
chair-0.005
ase-0.005
overs-0.005
 corner-0.005
belt-0.005
Token av
Token ID1196
Feature activation+8.45e-02

Pos logit contributions
chair+0.010
ase+0.010
belt+0.009
 corner+0.009
lf+0.009
Neg logit contributions
 tast-0.007
 scr-0.007
 Pars-0.006
 ang-0.006
Together-0.006
Tokenoc
Token ID420
Feature activation-4.16e-02

Pos logit contributions
 ang+0.158
 tast+0.157
 scr+0.154
 die+0.145
 Pars+0.144
Neg logit contributions
ase-0.133
chair-0.130
belt-0.110
lf-0.108
 corner-0.102
Tokenados
Token ID22484
Feature activation+4.73e-02

Pos logit contributions
ase+0.080
chair+0.077
belt+0.072
lf+0.068
overs+0.066
Neg logit contributions
 tast-0.068
 scr-0.063
 ang-0.062
 funn-0.058
 Pars-0.058
Token.
Token ID13
Feature activation+7.09e-04

Pos logit contributions
 tast+0.090
 ang+0.087
 scr+0.085
 cuc+0.081
 Pars+0.081
Neg logit contributions
chair-0.072
ase-0.066
belt-0.062
 corner-0.059
lf-0.058
Token But
Token ID887
Feature activation-2.86e-02

Pos logit contributions
 tast+0.001
 ang+0.001
 scr+0.001
 cuc+0.001
 funn+0.001
Neg logit contributions
chair-0.001
ase-0.001
belt-0.001
overs-0.001
lf-0.001
Token the
Token ID262
Feature activation-8.68e-03

Pos logit contributions
chair+0.058
ase+0.056
belt+0.052
lf+0.050
 doorway+0.047
Neg logit contributions
 tast-0.045
 ang-0.041
 scr-0.041
 cuc-0.039
 ju-0.038
Token continued
Token ID3767
Feature activation+2.64e-03

Pos logit contributions
chair+0.013
ase+0.013
belt+0.012
lf+0.011
overs+0.011
Neg logit contributions
 tast-0.011
 scr-0.010
 ang-0.010
 cuc-0.010
 ju-0.010
Token to
Token ID284
Feature activation+1.38e-02

Pos logit contributions
 tast+0.005
 scr+0.005
 ang+0.005
 cuc+0.005
 ju+0.005
Neg logit contributions
chair-0.004
ase-0.004
belt-0.004
lf-0.003
overs-0.003
Token enjoy
Token ID2883
Feature activation-9.10e-03

Pos logit contributions
 tast+0.027
 ju+0.024
 ang+0.024
 cuc+0.023
 scr+0.023
Neg logit contributions
chair-0.021
lf-0.020
belt-0.019
ase-0.018
 corner-0.018
Token eating
Token ID6600
Feature activation+7.16e-03

Pos logit contributions
chair+0.016
ase+0.015
belt+0.014
lf+0.014
overs+0.012
Neg logit contributions
 tast-0.017
 scr-0.015
 ang-0.015
 cuc-0.015
 ju-0.014
Token her
Token ID607
Feature activation+1.13e-03

Pos logit contributions
 tast+0.011
 ang+0.010
 scr+0.010
 cuc+0.010
 Pars+0.010
Neg logit contributions
chair-0.013
ase-0.012
belt-0.012
lf-0.012
overs-0.011
Token av
Token ID1196
Feature activation+8.37e-02

Pos logit contributions
 tast+0.002
 scr+0.002
 ang+0.001
 cuc+0.001
 swe+0.001
Neg logit contributions
chair-0.002
ase-0.002
belt-0.002
lf-0.002
overs-0.002
Tokenoc
Token ID420
Feature activation-3.70e-02

Pos logit contributions
 ang+0.158
 tast+0.155
 scr+0.152
 Pars+0.149
 die+0.146
Neg logit contributions
chair-0.127
ase-0.126
lf-0.109
belt-0.108
overs-0.094
Tokenados
Token ID22484
Feature activation+5.73e-02

Pos logit contributions
chair+0.073
ase+0.071
belt+0.067
lf+0.064
overs+0.059
Neg logit contributions
 tast-0.059
 scr-0.053
 ang-0.052
 Pars-0.050
 ju-0.050
Token with
Token ID351
Feature activation+1.36e-02

Pos logit contributions
 tast+0.101
 ang+0.098
 scr+0.095
 cuc+0.092
 Pars+0.092
Neg logit contributions
chair-0.099
ase-0.090
belt-0.088
lf-0.085
overs-0.076
Token a
Token ID257
Feature activation+1.80e-02

Pos logit contributions
 tast+0.024
 ang+0.022
 scr+0.022
 cuc+0.022
 ju+0.021
Neg logit contributions
chair-0.023
ase-0.022
belt-0.021
lf-0.020
overs-0.019
Token smile
Token ID8212
Feature activation+3.94e-03

Pos logit contributions
 tast+0.028
 Pars+0.026
 ang+0.026
 scr+0.025
 cuc+0.024
Neg logit contributions
chair-0.033
 corner-0.030
ase-0.030
belt-0.029
lf-0.029
Token and
Token ID290
Feature activation-5.17e-03

Pos logit contributions
 tast+0.013
 ang+0.012
 scr+0.012
 cuc+0.011
 ju+0.011
Neg logit contributions
chair-0.011
ase-0.011
belt-0.010
lf-0.010
overs-0.009
Token lady
Token ID10846
Feature activation-4.71e-04

Pos logit contributions
chair+0.010
ase+0.009
lf+0.009
belt+0.009
 corner+0.008
Neg logit contributions
 tast-0.008
 ang-0.007
 Pars-0.007
 scr-0.007
 ju-0.006
Tokenbugs
Token ID32965
Feature activation+4.97e-02

Pos logit contributions
chair+0.001
ase+0.001
belt+0.001
lf+0.001
overs+0.001
Neg logit contributions
 tast-0.001
 scr-0.001
 ang-0.001
 cuc-0.001
 ju-0.001
Token and
Token ID290
Feature activation+1.54e-02

Pos logit contributions
 tast+0.092
 ang+0.086
 scr+0.084
 cuc+0.081
 Pars+0.080
Neg logit contributions
chair-0.083
ase-0.080
belt-0.074
lf-0.069
overs-0.066
Token cater
Token ID20825
Feature activation+3.08e-02

Pos logit contributions
 tast+0.023
 ang+0.022
 Pars+0.020
 scr+0.020
 ju+0.019
Neg logit contributions
chair-0.030
ase-0.027
belt-0.025
lf-0.025
 corner-0.024
Tokenpill
Token ID27215
Feature activation+8.27e-02

Pos logit contributions
 ang+0.049
 tast+0.049
 scr+0.044
 ju+0.041
 funn+0.041
Neg logit contributions
chair-0.061
ase-0.059
overs-0.050
oline-0.045
belt-0.045
Tokenars
Token ID945
Feature activation+7.92e-03

Pos logit contributions
 ang+0.222
 scr+0.210
 Pars+0.205
 tast+0.205
 rad+0.202
Neg logit contributions
chair-0.065
ase-0.065
overs-0.039
ily-0.035
he-0.035
Token."
Token ID526
Feature activation+1.84e-03

Pos logit contributions
 tast+0.011
 scr+0.010
 ang+0.010
 cuc+0.010
 swe+0.009
Neg logit contributions
chair-0.017
ase-0.016
belt-0.016
lf-0.015
 doorway-0.014
Token
Token ID198
Feature activation+9.54e-04

Pos logit contributions
 tast+0.004
 scr+0.003
 ang+0.003
 cuc+0.003
 ju+0.003
Neg logit contributions
chair-0.003
ase-0.003
belt-0.002
lf-0.002
overs-0.002
Token
Token ID198
Feature activation+3.67e-03

Pos logit contributions
 tast+0.002
 scr+0.002
 ang+0.002
 cuc+0.001
 Pars+0.001
Neg logit contributions
chair-0.002
ase-0.002
belt-0.002
lf-0.001
overs-0.001
TokenTim
Token ID14967
Feature activation+7.88e-03

Pos logit contributions
 tast+0.008
 scr+0.008
 ang+0.008
 cuc+0.008
 ju+0.008
Neg logit contributions
chair-0.005
ase-0.004
belt-0.004
lf-0.004
 corner-0.003
Token her
Token ID607
Feature activation-4.41e-03

Pos logit contributions
chair+0.010
ase+0.009
belt+0.009
lf+0.008
overs+0.008
Neg logit contributions
 tast-0.012
 ang-0.011
 scr-0.011
 cuc-0.011
 ju-0.011
Token mom
Token ID1995
Feature activation-2.23e-03

Pos logit contributions
chair+0.007
ase+0.006
belt+0.006
lf+0.005
overs+0.005
Neg logit contributions
 tast-0.009
 scr-0.009
 ang-0.009
 cuc-0.008
 ju-0.008
Token and
Token ID290
Feature activation+7.78e-03

Pos logit contributions
chair+0.003
ase+0.003
belt+0.003
lf+0.003
 corner+0.002
Neg logit contributions
 tast-0.005
 ang-0.004
 scr-0.004
 Pars-0.004
 cuc-0.004
Token share
Token ID2648
Feature activation+4.29e-03

Pos logit contributions
 tast+0.014
 scr+0.013
 ang+0.013
 cuc+0.012
 swe+0.012
Neg logit contributions
chair-0.014
ase-0.014
belt-0.013
lf-0.012
overs-0.011
Token her
Token ID607
Feature activation+1.39e-02

Pos logit contributions
 tast+0.009
 ang+0.009
 scr+0.008
 Pars+0.008
 funn+0.008
Neg logit contributions
chair-0.005
ase-0.005
lf-0.005
overs-0.004
 corner-0.004
Token rad
Token ID2511
Feature activation+8.12e-02

Pos logit contributions
 tast+0.013
 ang+0.011
 scr+0.011
 Pars+0.011
 cuc+0.009
Neg logit contributions
chair-0.035
ase-0.034
belt-0.033
lf-0.032
 corner-0.031
Tokenishes
Token ID5614
Feature activation+4.26e-02

Pos logit contributions
 tast+0.078
 ang+0.078
 scr+0.077
 cal+0.066
 funn+0.065
Neg logit contributions
ase-0.190
chair-0.183
belt-0.181
lf-0.168
overs-0.168
Token.
Token ID13
Feature activation+5.98e-03

Pos logit contributions
 tast+0.076
 scr+0.070
 ang+0.070
 cuc+0.067
 ju+0.066
Neg logit contributions
chair-0.075
ase-0.071
belt-0.068
lf-0.065
overs-0.059
Token She
Token ID1375
Feature activation+5.86e-03

Pos logit contributions
 tast+0.010
 scr+0.009
 ang+0.009
 cuc+0.009
 Pars+0.008
Neg logit contributions
chair-0.011
ase-0.011
belt-0.011
lf-0.010
overs-0.009
Token liked
Token ID8288
Feature activation-1.21e-02

Pos logit contributions
 tast+0.010
 scr+0.009
 ang+0.009
 cuc+0.009
 ju+0.009
Neg logit contributions
chair-0.011
ase-0.010
belt-0.010
lf-0.009
overs-0.009
Token to
Token ID284
Feature activation-3.14e-03

Pos logit contributions
chair+0.014
ase+0.014
lf+0.013
overs+0.012
belt+0.011
Neg logit contributions
 tast-0.026
 ang-0.025
 scr-0.024
 funn-0.023
 Pars-0.023
Token made
Token ID925
Feature activation-2.32e-02

Pos logit contributions
 tast+0.017
 scr+0.016
 ang+0.016
 cuc+0.015
 ju+0.015
Neg logit contributions
chair-0.021
ase-0.021
belt-0.020
lf-0.019
overs-0.017
Token sure
Token ID1654
Feature activation-1.35e-02

Pos logit contributions
chair+0.041
ase+0.039
lf+0.036
belt+0.035
overs+0.035
Neg logit contributions
 tast-0.038
 scr-0.035
 ang-0.035
 cuc-0.034
 Pars-0.034
Token to
Token ID284
Feature activation+1.23e-02

Pos logit contributions
chair+0.024
ase+0.023
belt+0.022
lf+0.021
overs+0.019
Neg logit contributions
 tast-0.024
 scr-0.022
 ang-0.022
 cuc-0.021
 ju-0.021
Token eat
Token ID4483
Feature activation+1.08e-02

Pos logit contributions
 tast+0.018
 ang+0.017
 scr+0.016
 cuc+0.016
 ju+0.016
Neg logit contributions
chair-0.025
ase-0.024
belt-0.023
lf-0.023
 corner-0.021
Token her
Token ID607
Feature activation+3.61e-04

Pos logit contributions
 tast+0.017
 ang+0.017
 Pars+0.016
 scr+0.016
abb+0.015
Neg logit contributions
chair-0.018
ase-0.017
overs-0.017
lf-0.016
belt-0.016
Token av
Token ID1196
Feature activation+8.06e-02

Pos logit contributions
 tast+0.000
 ang+0.000
 Pars+0.000
 scr+0.000
 cuc+0.000
Neg logit contributions
chair-0.001
ase-0.001
belt-0.001
lf-0.001
overs-0.001
Tokenoc
Token ID420
Feature activation-3.51e-02

Pos logit contributions
 ang+0.156
 tast+0.148
 scr+0.148
 Pars+0.144
 die+0.144
Neg logit contributions
ase-0.119
chair-0.116
lf-0.100
belt-0.097
overs-0.089
Tokenados
Token ID22484
Feature activation+5.65e-02

Pos logit contributions
ase+0.067
chair+0.066
belt+0.060
lf+0.059
overs+0.055
Neg logit contributions
 tast-0.057
 scr-0.052
 ang-0.051
 Pars-0.049
 ju-0.049
Token.
Token ID13
Feature activation+4.61e-03

Pos logit contributions
 tast+0.097
 ang+0.094
 scr+0.091
 Pars+0.089
 cuc+0.088
Neg logit contributions
chair-0.099
ase-0.092
belt-0.089
lf-0.085
 corner-0.077
Token She
Token ID1375
Feature activation+5.78e-03

Pos logit contributions
 tast+0.008
 scr+0.007
 ang+0.007
 cuc+0.007
 ju+0.007
Neg logit contributions
chair-0.008
ase-0.008
belt-0.008
lf-0.007
overs-0.007
Token learned
Token ID4499
Feature activation+2.78e-03

Pos logit contributions
 tast+0.010
 scr+0.009
 ang+0.009
 cuc+0.009
 ju+0.008
Neg logit contributions
chair-0.011
ase-0.010
belt-0.010
lf-0.009
overs-0.009
Token weather
Token ID6193
Feature activation+1.06e-02

Pos logit contributions
chair+0.023
ase+0.022
belt+0.020
lf+0.020
 corner+0.019
Neg logit contributions
 tast-0.021
 ang-0.020
 scr-0.020
 Pars-0.019
 cuc-0.018
Token before
Token ID878
Feature activation+1.55e-02

Pos logit contributions
 tast+0.019
 scr+0.017
 ang+0.016
 ju+0.016
 cuc+0.016
Neg logit contributions
chair-0.019
ase-0.018
belt-0.017
lf-0.016
 doorway-0.015
Token he
Token ID339
Feature activation+4.91e-03

Pos logit contributions
 tast+0.029
 ang+0.026
 scr+0.026
 cuc+0.025
 ju+0.025
Neg logit contributions
chair-0.026
lf-0.024
belt-0.023
ase-0.023
 corner-0.020
Token picked
Token ID6497
Feature activation+5.97e-03

Pos logit contributions
 tast+0.008
 ang+0.008
 scr+0.008
 cuc+0.007
 Pars+0.007
Neg logit contributions
chair-0.009
ase-0.009
belt-0.008
lf-0.008
overs-0.007
Token his
Token ID465
Feature activation+1.83e-03

Pos logit contributions
 tast+0.010
 scr+0.010
 ang+0.010
 Pars+0.010
 cuc+0.010
Neg logit contributions
chair-0.010
ase-0.009
belt-0.008
overs-0.008
 corner-0.008
Token av
Token ID1196
Feature activation+7.97e-02

Pos logit contributions
 Pars+0.003
 tast+0.003
 ang+0.002
 scr+0.002
abb+0.002
Neg logit contributions
chair-0.003
ase-0.003
lf-0.003
belt-0.003
 corner-0.003
Tokenoc
Token ID420
Feature activation-3.43e-02

Pos logit contributions
 ang+0.148
 tast+0.143
 scr+0.140
 Pars+0.137
 cuc+0.133
Neg logit contributions
chair-0.124
ase-0.124
belt-0.105
lf-0.102
 corner-0.098
Tokenados
Token ID22484
Feature activation+6.23e-02

Pos logit contributions
ase+0.066
chair+0.064
belt+0.059
lf+0.058
overs+0.054
Neg logit contributions
 tast-0.055
 scr-0.051
 ang-0.050
 Pars-0.048
 ju-0.047
Token.
Token ID13
Feature activation+2.36e-03

Pos logit contributions
 tast+0.110
 ang+0.108
 scr+0.104
 cuc+0.103
 Pars+0.103
Neg logit contributions
chair-0.103
belt-0.094
ase-0.093
lf-0.088
 corner-0.083
Token<|endoftext|>
Token ID50256
Feature activation-4.40e-03

Pos logit contributions
 tast+0.004
 ang+0.004
 scr+0.004
 cuc+0.004
 ju+0.004
Neg logit contributions
chair-0.004
ase-0.004
belt-0.004
lf-0.003
overs-0.003
TokenOnce
Token ID7454
Feature activation+9.57e-03

Pos logit contributions
belt+0.010
chair+0.010
ase+0.009
lf+0.009
overs+0.008
Neg logit contributions
 tast-0.005
 Pars-0.004
 ang-0.004
 scr-0.004
So-0.004
Token more
Token ID517
Feature activation-7.98e-03

Pos logit contributions
chair+0.037
ase+0.035
belt+0.034
lf+0.032
overs+0.029
Neg logit contributions
 tast-0.033
 scr-0.030
 ang-0.030
 cuc-0.029
 ju-0.028
Token of
Token ID286
Feature activation+1.16e-02

Pos logit contributions
chair+0.016
ase+0.015
belt+0.014
lf+0.013
overs+0.012
Neg logit contributions
 tast-0.013
 ang-0.012
 scr-0.012
 cuc-0.011
 Pars-0.011
Token the
Token ID262
Feature activation-1.54e-03

Pos logit contributions
 tast+0.021
 scr+0.019
 ang+0.019
 cuc+0.019
 ju+0.018
Neg logit contributions
chair-0.020
ase-0.019
belt-0.018
lf-0.017
overs-0.016
Token hairy
Token ID43873
Feature activation-6.60e-03

Pos logit contributions
chair+0.003
ase+0.003
belt+0.003
lf+0.003
overs+0.002
Neg logit contributions
 tast-0.003
 scr-0.002
 ang-0.002
 cuc-0.002
 ju-0.002
Token cater
Token ID20825
Feature activation+2.72e-02

Pos logit contributions
chair+0.016
ase+0.016
belt+0.015
lf+0.015
overs+0.014
Neg logit contributions
 tast-0.007
 scr-0.006
 ang-0.006
 cuc-0.006
 ju-0.006
Tokenpill
Token ID27215
Feature activation+7.88e-02

Pos logit contributions
 tast+0.053
 scr+0.048
 cuc+0.047
 ju+0.047
 Pars+0.045
Neg logit contributions
belt-0.043
chair-0.042
ase-0.040
lf-0.039
 doorway-0.034
Tokenars
Token ID945
Feature activation-2.69e-03

Pos logit contributions
 tast+0.159
 scr+0.146
 ang+0.144
 cuc+0.141
 ju+0.139
Neg logit contributions
chair-0.122
ase-0.114
belt-0.110
lf-0.103
overs-0.091
Token.
Token ID13
Feature activation-9.63e-04

Pos logit contributions
chair+0.006
ase+0.005
belt+0.005
lf+0.005
 doorway+0.005
Neg logit contributions
 tast-0.004
 scr-0.004
 ang-0.003
 cuc-0.003
 ju-0.003
Token<|endoftext|>
Token ID50256
Feature activation-2.60e-03

Pos logit contributions
chair+0.002
belt+0.002
ase+0.002
lf+0.002
 corner+0.002
Neg logit contributions
 tast-0.001
 scr-0.001
 Pars-0.001
 cuc-0.001
 ang-0.001
TokenOnce
Token ID7454
Feature activation+9.90e-03

Pos logit contributions
belt+0.006
chair+0.006
lf+0.005
ase+0.005
overs+0.005
Neg logit contributions
 tast-0.003
 Pars-0.003
 cuc-0.003
 ang-0.003
 ju-0.003
Token there
Token ID612
Feature activation+1.08e-02

Pos logit contributions
 ang+0.013
 tast+0.013
 scr+0.012
 rad+0.012
 swe+0.012
Neg logit contributions
ase-0.019
chair-0.019
belt-0.019
person-0.017
lf-0.017
Token recommendation
Token ID15602
Feature activation-5.55e-03

Pos logit contributions
 tast+0.037
 cuc+0.034
 scr+0.034
 ang+0.033
 ju+0.033
Neg logit contributions
chair-0.047
ase-0.044
belt-0.043
lf-0.041
 corner-0.040
Token and
Token ID290
Feature activation+3.31e-03

Pos logit contributions
chair+0.012
ase+0.012
belt+0.011
lf+0.011
 doorway+0.010
Neg logit contributions
 tast-0.007
 scr-0.006
 ang-0.006
 swe-0.006
 better-0.006
Token picked
Token ID6497
Feature activation+3.79e-03

Pos logit contributions
 tast+0.005
 scr+0.005
 ang+0.005
 cuc+0.005
 ju+0.004
Neg logit contributions
chair-0.006
ase-0.006
belt-0.006
lf-0.006
overs-0.005
Token the
Token ID262
Feature activation-9.96e-03

Pos logit contributions
 tast+0.007
 scr+0.007
 ang+0.006
 cuc+0.006
 Pars+0.006
Neg logit contributions
chair-0.006
ase-0.006
belt-0.005
overs-0.005
 corner-0.005
Token right
Token ID826
Feature activation+3.56e-03

Pos logit contributions
chair+0.021
ase+0.021
belt+0.019
 corner+0.019
lf+0.018
Neg logit contributions
 scr-0.012
 Pars-0.012
 tast-0.012
 ang-0.011
abb-0.011
Token av
Token ID1196
Feature activation+7.79e-02

Pos logit contributions
 tast+0.006
 scr+0.006
 ang+0.006
 Pars+0.005
 cuc+0.005
Neg logit contributions
chair-0.006
ase-0.006
belt-0.006
lf-0.006
 corner-0.005
Tokenoc
Token ID420
Feature activation-4.25e-02

Pos logit contributions
 tast+0.143
 ang+0.141
 scr+0.141
 Pars+0.132
 rust+0.130
Neg logit contributions
ase-0.124
chair-0.123
belt-0.107
lf-0.103
overs-0.096
Tokenados
Token ID22484
Feature activation+5.30e-02

Pos logit contributions
chair+0.085
ase+0.084
belt+0.078
lf+0.075
overs+0.071
Neg logit contributions
 tast-0.065
 scr-0.060
 ang-0.058
 Pars-0.055
 ju-0.055
Token.
Token ID13
Feature activation-1.86e-03

Pos logit contributions
 tast+0.088
 scr+0.080
 ang+0.080
 cuc+0.077
 ju+0.075
Neg logit contributions
chair-0.101
ase-0.095
belt-0.092
lf-0.087
overs-0.080
Token
Token ID198
Feature activation+7.52e-04

Pos logit contributions
chair+0.003
ase+0.003
belt+0.003
lf+0.003
overs+0.002
Neg logit contributions
 tast-0.004
 scr-0.003
 ang-0.003
 cuc-0.003
 ju-0.003
Token
Token ID198
Feature activation+5.57e-03

Pos logit contributions
 tast+0.001
 scr+0.001
 ang+0.001
 cuc+0.001
 Pars+0.001
Neg logit contributions
chair-0.001
ase-0.001
belt-0.001
lf-0.001
overs-0.001
Token tell
Token ID1560
Feature activation-2.86e-02

Pos logit contributions
chair+0.042
ase+0.041
lf+0.040
belt+0.039
 corner+0.037
Neg logit contributions
 tast-0.039
 ang-0.038
 ju-0.035
 scr-0.035
 Pars-0.034
Token the
Token ID262
Feature activation+3.18e-03

Pos logit contributions
chair+0.046
ase+0.043
belt+0.041
lf+0.039
overs+0.035
Neg logit contributions
 tast-0.056
 scr-0.052
 ang-0.051
 cuc-0.050
 ju-0.049
Token mom
Token ID1995
Feature activation-2.01e-02

Pos logit contributions
 ang+0.005
 Pars+0.005
 scr+0.005
 tast+0.005
Together+0.004
Neg logit contributions
ase-0.005
chair-0.005
 corner-0.005
board-0.004
lf-0.004
Tokenmy
Token ID1820
Feature activation+3.53e-03

Pos logit contributions
chair+0.039
ase+0.036
belt+0.035
lf+0.034
overs+0.031
Neg logit contributions
 tast-0.033
 scr-0.030
 ang-0.029
 cuc-0.028
 ju-0.028
Token cater
Token ID20825
Feature activation+1.93e-02

Pos logit contributions
 tast+0.006
 scr+0.005
 ang+0.005
 cuc+0.005
 ju+0.005
Neg logit contributions
chair-0.007
ase-0.006
belt-0.006
lf-0.006
overs-0.005
Tokenpill
Token ID27215
Feature activation+7.76e-02

Pos logit contributions
 tast+0.042
 scr+0.039
 ang+0.039
 cuc+0.038
 ju+0.037
Neg logit contributions
chair-0.027
ase-0.025
belt-0.024
lf-0.023
overs-0.020
Tokenars
Token ID945
Feature activation-2.28e-03

Pos logit contributions
 tast+0.160
 ang+0.157
 scr+0.156
 cuc+0.148
 Pars+0.147
Neg logit contributions
chair-0.110
ase-0.104
belt-0.094
lf-0.087
overs-0.083
Token!
Token ID0
Feature activation-3.01e-06

Pos logit contributions
chair+0.004
ase+0.004
belt+0.004
lf+0.004
overs+0.003
Neg logit contributions
 tast-0.004
 scr-0.003
 ang-0.003
 cuc-0.003
 ju-0.003
Token''
Token ID7061
Feature activation+1.37e-02

Feature not active on prev token;
no predictions were affected.
Token
Token ID198
Feature activation+9.91e-03

Pos logit contributions
 tast+0.023
 scr+0.022
 ang+0.021
 cuc+0.021
 swe+0.021
Neg logit contributions
chair-0.025
ase-0.023
belt-0.022
lf-0.020
overs-0.020
Token
Token ID198
Feature activation+1.43e-02

Pos logit contributions
 tast+0.017
 scr+0.016
 ang+0.016
 cuc+0.015
 ju+0.015
Neg logit contributions
chair-0.018
ase-0.017
belt-0.016
lf-0.016
overs-0.014
Token cut
Token ID2005
Feature activation-1.54e-02

Pos logit contributions
 tast+0.005
 ang+0.004
 scr+0.004
 funn+0.004
 Pars+0.004
Neg logit contributions
chair-0.004
ase-0.004
lf-0.004
belt-0.004
 corner-0.004
Token up
Token ID510
Feature activation-2.14e-02

Pos logit contributions
chair+0.022
ase+0.021
belt+0.019
overs+0.018
lf+0.018
Neg logit contributions
 tast-0.031
 scr-0.029
 ang-0.028
 cuc-0.027
 Pars-0.027
Token one
Token ID530
Feature activation-1.11e-02

Pos logit contributions
chair+0.028
ase+0.025
 corner+0.023
overs+0.023
lf+0.023
Neg logit contributions
 tast-0.042
itionally-0.040
abb-0.040
 scr-0.040
 Pars-0.039
Token of
Token ID286
Feature activation+2.76e-03

Pos logit contributions
chair+0.015
ase+0.013
belt+0.012
 corner+0.011
lf+0.011
Neg logit contributions
 tast-0.024
 ang-0.023
 scr-0.023
 Pars-0.023
 cuc-0.022
Token his
Token ID465
Feature activation+1.74e-03

Pos logit contributions
 tast+0.006
 scr+0.006
abb+0.005
 ang+0.005
 funn+0.005
Neg logit contributions
chair-0.003
ase-0.003
overs-0.003
lf-0.003
 corner-0.003
Token rad
Token ID2511
Feature activation+7.76e-02

Pos logit contributions
 tast+0.002
 ang+0.002
 scr+0.002
 Pars+0.002
 ju+0.002
Neg logit contributions
chair-0.004
ase-0.003
belt-0.003
lf-0.003
 corner-0.003
Tokenishes
Token ID5614
Feature activation+3.64e-02

Pos logit contributions
 tast+0.097
 scr+0.094
 ang+0.094
 funn+0.089
 cal+0.087
Neg logit contributions
ase-0.161
belt-0.149
chair-0.148
overs-0.139
eddy-0.131
Token.
Token ID13
Feature activation-2.95e-03

Pos logit contributions
 tast+0.066
 ang+0.061
 scr+0.061
 Pars+0.060
 swe+0.058
Neg logit contributions
chair-0.059
ase-0.056
belt-0.054
lf-0.051
overs-0.048
Token He
Token ID679
Feature activation+4.21e-03

Pos logit contributions
chair+0.005
ase+0.004
belt+0.004
lf+0.004
overs+0.003
Neg logit contributions
 tast-0.006
 scr-0.006
 ang-0.006
 cuc-0.005
 swe-0.005
Token laid
Token ID8104
Feature activation-3.49e-02

Pos logit contributions
 tast+0.007
 ang+0.006
 scr+0.006
 cuc+0.006
 Pars+0.006
Neg logit contributions
chair-0.008
ase-0.008
belt-0.007
lf-0.007
 corner-0.006
Token the
Token ID262
Feature activation-1.92e-02

Pos logit contributions
chair+0.063
ase+0.059
belt+0.057
lf+0.055
overs+0.049
Neg logit contributions
 tast-0.062
 ang-0.057
 scr-0.057
 cuc-0.055
 ju-0.054
Token songs
Token ID7259
Feature activation+9.87e-04

Pos logit contributions
chair+0.012
ase+0.012
belt+0.012
lf+0.011
 doorway+0.010
Neg logit contributions
 tast-0.009
 scr-0.008
 ang-0.008
 cuc-0.007
 Pars-0.007
Token and
Token ID290
Feature activation+4.46e-03

Pos logit contributions
 tast+0.002
 scr+0.002
 ang+0.002
 cuc+0.002
 ju+0.002
Neg logit contributions
chair-0.002
ase-0.002
belt-0.002
lf-0.001
overs-0.001
Token danced
Token ID39480
Feature activation+1.94e-02

Pos logit contributions
 tast+0.008
 ang+0.008
 scr+0.007
 Pars+0.007
 ju+0.007
Neg logit contributions
chair-0.008
lf-0.007
ase-0.007
belt-0.006
 corner-0.006
Token around
Token ID1088
Feature activation-1.38e-02

Pos logit contributions
 tast+0.035
 scr+0.032
 ang+0.032
 cuc+0.031
 ju+0.030
Neg logit contributions
chair-0.034
ase-0.033
belt-0.031
lf-0.029
overs-0.027
Token the
Token ID262
Feature activation+9.27e-03

Pos logit contributions
chair+0.024
ase+0.023
belt+0.021
lf+0.021
overs+0.019
Neg logit contributions
 tast-0.025
 scr-0.023
 ang-0.023
 cuc-0.022
 ju-0.022
Token rad
Token ID2511
Feature activation+7.75e-02

Pos logit contributions
 tast+0.018
 scr+0.017
 ang+0.016
 cuc+0.016
 ju+0.016
Neg logit contributions
chair-0.015
ase-0.014
belt-0.014
lf-0.013
overs-0.012
Tokenish
Token ID680
Feature activation+3.50e-03

Pos logit contributions
 scr+0.106
 tast+0.105
 ang+0.102
 funn+0.096
 cal+0.094
Neg logit contributions
ase-0.163
chair-0.142
belt-0.136
 corner-0.131
overs-0.130
Token.
Token ID13
Feature activation-1.03e-03

Pos logit contributions
 tast+0.006
 ang+0.006
 scr+0.005
 ju+0.005
 Pars+0.005
Neg logit contributions
chair-0.006
ase-0.006
lf-0.005
belt-0.005
 corner-0.005
Token
Token ID198
Feature activation-6.96e-04

Pos logit contributions
chair+0.002
ase+0.002
belt+0.002
lf+0.002
overs+0.001
Neg logit contributions
 tast-0.002
 scr-0.002
 ang-0.002
 cuc-0.002
 Pars-0.002
Token
Token ID198
Feature activation+3.09e-03

Pos logit contributions
chair+0.001
belt+0.001
ase+0.001
lf+0.001
overs+0.001
Neg logit contributions
 tast-0.001
 scr-0.001
 ang-0.001
 cuc-0.001
 Pars-0.001
TokenBut
Token ID1537
Feature activation-2.78e-02

Pos logit contributions
 tast+0.005
 scr+0.005
 ang+0.005
 cuc+0.005
 ju+0.005
Neg logit contributions
chair-0.006
ase-0.005
belt-0.005
lf-0.005
overs-0.004
Token started
Token ID2067
Feature activation+1.63e-03

Pos logit contributions
chair+0.003
ase+0.003
belt+0.003
lf+0.003
overs+0.003
Neg logit contributions
 tast-0.002
 scr-0.002
 ang-0.002
 cuc-0.002
 ju-0.002
Token to
Token ID284
Feature activation+1.84e-02

Pos logit contributions
 tast+0.004
 ju+0.004
 scr+0.003
So+0.003
abb+0.003
Neg logit contributions
chair-0.001
ase-0.001
 corner-0.001
belt-0.001
overs-0.001
Token chew
Token ID34722
Feature activation+1.73e-02

Pos logit contributions
 tast+0.030
abb+0.029
So+0.028
 based+0.027
 unnecessary+0.026
Neg logit contributions
 corner-0.025
lf-0.025
chair-0.025
 hallway-0.023
 doorway-0.022
Token on
Token ID319
Feature activation+8.15e-03

Pos logit contributions
 tast+0.034
 ang+0.032
 scr+0.031
 Pars+0.030
 cuc+0.030
Neg logit contributions
chair-0.026
ase-0.024
belt-0.022
lf-0.022
overs-0.021
Token some
Token ID617
Feature activation+2.88e-02

Pos logit contributions
 tast+0.017
 ang+0.016
 scr+0.015
abb+0.015
 ju+0.015
Neg logit contributions
chair-0.011
ase-0.010
lf-0.009
overs-0.009
belt-0.009
Token seaw
Token ID32085
Feature activation+7.71e-02

Pos logit contributions
 ang+0.046
 tast+0.046
 Pars+0.045
 scr+0.042
 argue+0.041
Neg logit contributions
chair-0.050
ase-0.044
 corner-0.044
lf-0.044
overs-0.041
Tokeneed
Token ID2308
Feature activation+2.28e-02

Pos logit contributions
 ang+0.136
 tast+0.134
 scr+0.119
 Pars+0.117
 cal+0.116
Neg logit contributions
ase-0.134
chair-0.125
belt-0.121
board-0.115
lf-0.114
Token.
Token ID13
Feature activation-7.39e-04

Pos logit contributions
 tast+0.043
 ang+0.042
 scr+0.040
 Pars+0.040
 ju+0.038
Neg logit contributions
chair-0.035
ase-0.033
belt-0.030
lf-0.028
 corner-0.028
Token Mary
Token ID5335
Feature activation-3.29e-02

Pos logit contributions
belt+0.001
ase+0.001
chair+0.001
lf+0.001
overs+0.001
Neg logit contributions
 tast-0.001
 scr-0.001
 ang-0.001
 Pars-0.001
 cuc-0.001
Token was
Token ID373
Feature activation-1.15e-02

Pos logit contributions
chair+0.054
ase+0.050
belt+0.048
lf+0.046
overs+0.041
Neg logit contributions
 tast-0.063
 ang-0.058
 scr-0.058
 cuc-0.056
 ju-0.055
Token so
Token ID523
Feature activation-1.41e-02

Pos logit contributions
chair+0.025
ase+0.023
belt+0.023
lf+0.022
 corner+0.021
Neg logit contributions
 tast-0.015
 cuc-0.014
 scr-0.013
 ang-0.013
 Pars-0.013
Token,
Token ID11
Feature activation+5.83e-03

Pos logit contributions
 tast+0.024
 scr+0.022
 ang+0.021
 cuc+0.020
 ju+0.020
Neg logit contributions
chair-0.030
ase-0.029
belt-0.027
lf-0.026
overs-0.024
Token and
Token ID290
Feature activation-5.28e-03

Pos logit contributions
 tast+0.009
 scr+0.008
 ang+0.008
 cuc+0.008
 ju+0.008
Neg logit contributions
chair-0.012
ase-0.012
belt-0.011
lf-0.010
issa-0.009
Token all
Token ID477
Feature activation+1.53e-04

Pos logit contributions
chair+0.010
ase+0.010
belt+0.009
lf+0.009
overs+0.008
Neg logit contributions
 tast-0.009
 scr-0.008
 ang-0.008
 cuc-0.008
 ju-0.007
Token the
Token ID262
Feature activation-1.12e-03

Pos logit contributions
 tast+0.000
 scr+0.000
 ang+0.000
 cuc+0.000
 rad+0.000
Neg logit contributions
chair-0.000
ase-0.000
belt-0.000
lf-0.000
issa-0.000
Token other
Token ID584
Feature activation-1.32e-02

Pos logit contributions
chair+0.002
ase+0.002
belt+0.002
lf+0.002
 corner+0.001
Neg logit contributions
 tast-0.002
 scr-0.002
 ang-0.002
 Pars-0.002
 cuc-0.002
Token av
Token ID1196
Feature activation+7.67e-02

Pos logit contributions
chair+0.024
belt+0.023
ase+0.023
lf+0.021
overs+0.019
Neg logit contributions
 tast-0.024
 scr-0.021
 cuc-0.021
 ang-0.021
 ju-0.020
Tokenoc
Token ID420
Feature activation-4.05e-02

Pos logit contributions
 tast+0.136
 ang+0.133
 scr+0.132
 Pars+0.124
 swe+0.122
Neg logit contributions
chair-0.130
ase-0.129
belt-0.114
lf-0.111
overs-0.102
Tokenados
Token ID22484
Feature activation+4.45e-02

Pos logit contributions
chair+0.087
ase+0.083
belt+0.080
lf+0.076
overs+0.071
Neg logit contributions
 tast-0.058
 scr-0.052
 ang-0.052
 cuc-0.049
 ju-0.048
Token were
Token ID547
Feature activation+1.96e-02

Pos logit contributions
 tast+0.067
 scr+0.060
 ang+0.059
 cuc+0.058
 ju+0.055
Neg logit contributions
chair-0.091
ase-0.087
belt-0.084
lf-0.079
overs-0.074
Token very
Token ID845
Feature activation+2.14e-03

Pos logit contributions
 tast+0.029
 ang+0.027
 scr+0.027
 Pars+0.026
 cuc+0.026
Neg logit contributions
chair-0.037
ase-0.036
belt-0.035
lf-0.035
overs-0.032
Token happy
Token ID3772
Feature activation+1.44e-02

Pos logit contributions
 tast+0.003
 ang+0.003
 scr+0.003
 cuc+0.003
 Pars+0.003
Neg logit contributions
chair-0.004
ase-0.004
belt-0.004
lf-0.004
overs-0.004
Token world
Token ID995
Feature activation+8.81e-04

Pos logit contributions
 tast+0.004
 ang+0.003
 scr+0.003
 cuc+0.003
 Pars+0.003
Neg logit contributions
chair-0.005
ase-0.005
belt-0.004
lf-0.004
 corner-0.004
Token where
Token ID810
Feature activation+3.67e-03

Pos logit contributions
 tast+0.002
 scr+0.001
 ang+0.001
 cuc+0.001
 ju+0.001
Neg logit contributions
chair-0.002
ase-0.001
belt-0.001
lf-0.001
overs-0.001
Token everyone
Token ID2506
Feature activation-1.26e-02

Pos logit contributions
 tast+0.006
 ang+0.006
 scr+0.006
 cuc+0.005
 rad+0.005
Neg logit contributions
ase-0.007
chair-0.007
belt-0.006
lf-0.006
 doorway-0.006
Token shared
Token ID4888
Feature activation+4.44e-03

Pos logit contributions
chair+0.022
ase+0.022
belt+0.021
lf+0.020
overs+0.018
Neg logit contributions
 tast-0.022
 scr-0.020
 ang-0.020
 cuc-0.019
 ju-0.018
Token their
Token ID511
Feature activation+1.98e-02

Pos logit contributions
 tast+0.008
 scr+0.007
 ang+0.007
 cuc+0.007
 ju+0.007
Neg logit contributions
chair-0.008
ase-0.007
belt-0.007
lf-0.007
overs-0.006
Token rad
Token ID2511
Feature activation+7.59e-02

Pos logit contributions
 tast+0.030
 scr+0.027
 ang+0.027
 cuc+0.026
 ju+0.025
Neg logit contributions
chair-0.041
ase-0.039
belt-0.038
lf-0.035
overs-0.033
Tokenishes
Token ID5614
Feature activation+3.42e-02

Pos logit contributions
 tast+0.081
 ang+0.078
 scr+0.078
 Pars+0.072
 cal+0.068
Neg logit contributions
ase-0.169
chair-0.165
belt-0.163
overs-0.150
lf-0.148
Token.
Token ID13
Feature activation+1.09e-03

Pos logit contributions
 tast+0.062
 scr+0.057
 ang+0.057
 cuc+0.055
 ju+0.053
Neg logit contributions
chair-0.060
ase-0.057
belt-0.054
lf-0.051
overs-0.047
Token<|endoftext|>
Token ID50256
Feature activation-3.76e-03

Pos logit contributions
 tast+0.002
 scr+0.002
 ang+0.002
 cuc+0.002
 Pars+0.002
Neg logit contributions
chair-0.002
belt-0.002
ase-0.002
lf-0.002
 doorway-0.002
TokenOnce
Token ID7454
Feature activation+5.06e-03

Pos logit contributions
belt+0.008
chair+0.008
lf+0.008
ase+0.008
overs+0.007
Neg logit contributions
 tast-0.004
 ang-0.004
 Pars-0.004
 cuc-0.004
 scr-0.004
Token upon
Token ID2402
Feature activation-6.22e-03

Pos logit contributions
 tast+0.008
 ang+0.007
 scr+0.007
 rad+0.007
 cuc+0.007
Neg logit contributions
chair-0.010
ase-0.009
belt-0.009
lf-0.008
person-0.008
Token continued
Token ID3767
Feature activation+4.07e-03

Pos logit contributions
 tast+0.029
 ang+0.028
 cuc+0.026
 scr+0.026
 Pars+0.026
Neg logit contributions
chair-0.021
ase-0.020
lf-0.018
belt-0.018
 corner-0.016
Token to
Token ID284
Feature activation+8.17e-03

Pos logit contributions
 tast+0.007
 scr+0.006
 ang+0.006
 cuc+0.006
 rad+0.006
Neg logit contributions
chair-0.007
ase-0.007
belt-0.007
lf-0.006
issa-0.006
Token scatter
Token ID41058
Feature activation+2.30e-02

Pos logit contributions
 tast+0.013
 ang+0.012
 scr+0.012
 cuc+0.011
 ju+0.011
Neg logit contributions
chair-0.016
ase-0.015
belt-0.014
lf-0.014
 corner-0.013
Token the
Token ID262
Feature activation-1.82e-02

Pos logit contributions
 tast+0.038
 ang+0.034
 scr+0.033
 cuc+0.032
 swe+0.032
Neg logit contributions
chair-0.045
ase-0.043
belt-0.042
lf-0.040
issa-0.036
Token bread
Token ID8509
Feature activation+2.07e-02

Pos logit contributions
chair+0.036
ase+0.035
belt+0.034
lf+0.032
overs+0.030
Neg logit contributions
 tast-0.028
 scr-0.025
 ang-0.025
 cuc-0.025
 ju-0.024
Tokencr
Token ID6098
Feature activation+7.59e-02

Pos logit contributions
 tast+0.039
 ang+0.036
 scr+0.036
 cuc+0.035
 Pars+0.034
Neg logit contributions
chair-0.034
ase-0.033
belt-0.031
lf-0.029
overs-0.027
Tokenumbs
Token ID18146
Feature activation+3.85e-02

Pos logit contributions
 ang+0.167
 tast+0.163
 Pars+0.155
 swe+0.152
 cal+0.151
Neg logit contributions
ase-0.104
chair-0.096
overs-0.075
belt-0.073
board-0.072
Token.
Token ID13
Feature activation-7.72e-03

Pos logit contributions
 tast+0.074
 scr+0.068
 ang+0.068
 cuc+0.066
 ju+0.064
Neg logit contributions
chair-0.063
ase-0.060
belt-0.057
lf-0.054
overs-0.049
Token<|endoftext|>
Token ID50256
Feature activation-3.70e-03

Pos logit contributions
chair+0.015
ase+0.014
belt+0.014
lf+0.014
 doorway+0.012
Neg logit contributions
 tast-0.012
 scr-0.011
 cuc-0.011
 ang-0.011
 Pars-0.010
TokenOnce
Token ID7454
Feature activation+2.85e-03

Pos logit contributions
belt+0.009
chair+0.008
lf+0.008
ase+0.008
overs+0.007
Neg logit contributions
 tast-0.004
 ang-0.003
 Pars-0.003
 cuc-0.003
 cal-0.003
Token upon
Token ID2402
Feature activation-1.11e-02

Pos logit contributions
 ang+0.004
 tast+0.004
 scr+0.004
 cuc+0.003
 rad+0.003
Neg logit contributions
chair-0.006
ase-0.006
belt-0.005
person-0.005
lf-0.005
Token and
Token ID290
Feature activation-2.35e-04

Pos logit contributions
 tast+0.041
 scr+0.037
 ang+0.037
 cuc+0.036
 ju+0.035
Neg logit contributions
chair-0.041
ase-0.039
belt-0.037
lf-0.035
overs-0.032
Token Sammy
Token ID44067
Feature activation+2.13e-02

Pos logit contributions
chair+0.000
lf+0.000
ase+0.000
belt+0.000
 corner+0.000
Neg logit contributions
 tast-0.000
 scr-0.000
 ang-0.000
 ju-0.000
 cuc-0.000
Token would
Token ID561
Feature activation-5.52e-03

Pos logit contributions
 tast+0.039
 ang+0.038
 scr+0.035
 ju+0.034
 swe+0.034
Neg logit contributions
ase-0.034
chair-0.034
lf-0.032
belt-0.028
board-0.026
Token often
Token ID1690
Feature activation+5.20e-03

Pos logit contributions
lf+0.008
chair+0.008
ase+0.008
 room+0.006
on+0.006
Neg logit contributions
 ang-0.011
 tast-0.010
 unnecessary-0.010
 ju-0.009
 swe-0.009
Token share
Token ID2648
Feature activation+2.05e-03

Pos logit contributions
 ang+0.009
 tast+0.009
 Pars+0.008
 ju+0.008
 scr+0.008
Neg logit contributions
chair-0.008
lf-0.008
ase-0.007
 corner-0.007
belt-0.007
Token av
Token ID1196
Feature activation+7.58e-02

Pos logit contributions
 tast+0.004
 ang+0.004
 Pars+0.004
 scr+0.004
 gag+0.004
Neg logit contributions
ase-0.002
lf-0.002
chair-0.002
overs-0.002
 corner-0.002
Tokenoc
Token ID420
Feature activation-4.07e-02

Pos logit contributions
 tast+0.139
 ang+0.137
 scr+0.135
 Pars+0.125
 die+0.123
Neg logit contributions
ase-0.124
chair-0.123
lf-0.107
belt-0.104
overs-0.097
Tokenados
Token ID22484
Feature activation+4.86e-02

Pos logit contributions
ase+0.073
chair+0.071
belt+0.065
lf+0.064
overs+0.060
Neg logit contributions
 tast-0.070
 scr-0.066
 ang-0.063
 ju-0.062
 cuc-0.059
Token and
Token ID290
Feature activation+7.19e-03

Pos logit contributions
 tast+0.084
 ang+0.079
 scr+0.079
 cuc+0.074
 Pars+0.074
Neg logit contributions
chair-0.087
ase-0.083
belt-0.077
lf-0.076
overs-0.069
Token other
Token ID584
Feature activation+1.07e-02

Pos logit contributions
 tast+0.011
 ang+0.011
 scr+0.010
 Pars+0.010
abb+0.009
Neg logit contributions
chair-0.014
lf-0.012
ase-0.012
belt-0.012
 corner-0.011
Token y
Token ID331
Feature activation+2.57e-02

Pos logit contributions
 Pars+0.020
 tast+0.020
 ang+0.019
 scr+0.019
 tul+0.019
Neg logit contributions
chair-0.015
ase-0.014
lf-0.013
overs-0.012
 corner-0.012
Token prepared
Token ID5597
Feature activation-9.41e-03

Pos logit contributions
 tast+0.014
 scr+0.012
 ang+0.012
 cuc+0.012
 swe+0.011
Neg logit contributions
chair-0.020
ase-0.019
belt-0.018
lf-0.017
board-0.016
Token,
Token ID11
Feature activation-2.69e-03

Pos logit contributions
chair+0.019
ase+0.019
belt+0.017
lf+0.017
 doorway+0.015
Neg logit contributions
 tast-0.015
 scr-0.013
 cuc-0.013
 swe-0.013
 ang-0.013
Token even
Token ID772
Feature activation-6.37e-03

Pos logit contributions
chair+0.006
ase+0.006
belt+0.005
lf+0.005
 doorway+0.005
Neg logit contributions
 tast-0.004
 scr-0.004
 ang-0.003
 cuc-0.003
 ju-0.003
Token when
Token ID618
Feature activation+1.91e-02

Pos logit contributions
chair+0.012
belt+0.011
ase+0.011
lf+0.011
 corner+0.010
Neg logit contributions
 tast-0.010
 scr-0.009
 ang-0.009
 cuc-0.008
 Pars-0.008
Token things
Token ID1243
Feature activation+1.28e-02

Pos logit contributions
 tast+0.033
 scr+0.030
 ang+0.030
 cuc+0.029
 rad+0.029
Neg logit contributions
ase-0.035
chair-0.035
belt-0.032
lf-0.029
 doorway-0.027
Token seem
Token ID1283
Feature activation+7.58e-02

Pos logit contributions
 tast+0.018
 ang+0.016
 scr+0.016
 cuc+0.015
 ju+0.014
Neg logit contributions
chair-0.028
ase-0.026
belt-0.026
lf-0.025
overs-0.023
Token mild
Token ID11607
Feature activation+2.70e-02

Pos logit contributions
 Pars+0.113
 tast+0.112
 cuc+0.111
 scr+0.108
Together+0.107
Neg logit contributions
chair-0.125
 corner-0.116
belt-0.112
 room-0.104
 doorway-0.101
Token.
Token ID13
Feature activation+3.26e-03

Pos logit contributions
 tast+0.046
 ang+0.042
 scr+0.042
 cuc+0.041
 Pars+0.039
Neg logit contributions
chair-0.050
ase-0.047
belt-0.045
lf-0.042
overs-0.040
Tokenâ
Token ID22940
Feature activation+1.77e-02

Pos logit contributions
 tast+0.005
 scr+0.005
 ang+0.005
 cuc+0.005
 ju+0.005
Neg logit contributions
chair-0.006
ase-0.006
belt-0.006
lf-0.006
overs-0.005
Token
Token ID26391
Feature activation-2.77e-02

Pos logit contributions
 tast+0.031
 ang+0.029
 scr+0.029
 cuc+0.027
 ju+0.027
Neg logit contributions
chair-0.032
ase-0.030
belt-0.029
lf-0.028
overs-0.025
Token
Token ID198
Feature activation+6.28e-03

Pos logit contributions
chair+0.055
ase+0.054
belt+0.052
lf+0.050
 corner+0.047
Neg logit contributions
 tast-0.038
 scr-0.037
 ang-0.035
 ju-0.034
 Pars-0.033
Token It
Token ID632
Feature activation+1.05e-02

Pos logit contributions
chair+0.003
belt+0.003
ase+0.003
lf+0.003
 corner+0.002
Neg logit contributions
 tast-0.003
 scr-0.002
 ang-0.002
 cuc-0.002
 Pars-0.002
Token became
Token ID2627
Feature activation+2.52e-02

Pos logit contributions
 tast+0.019
 scr+0.018
 ang+0.017
 cuc+0.016
 ju+0.016
Neg logit contributions
chair-0.019
ase-0.018
belt-0.018
lf-0.016
overs-0.015
Token the
Token ID262
Feature activation-2.63e-02

Pos logit contributions
 tast+0.042
 scr+0.041
 ang+0.040
 cuc+0.040
 Pars+0.040
Neg logit contributions
chair-0.037
belt-0.036
 corner-0.035
ase-0.034
overs-0.033
Token happiest
Token ID49414
Feature activation-2.67e-02

Pos logit contributions
chair+0.060
ase+0.057
belt+0.055
lf+0.053
overs+0.050
Neg logit contributions
 tast-0.034
 scr-0.030
 ang-0.030
 cuc-0.029
 ju-0.028
Token books
Token ID3835
Feature activation-4.92e-02

Pos logit contributions
belt+0.050
ase+0.044
chair+0.041
ana+0.039
glass+0.039
Neg logit contributions
 tast-0.041
 rad-0.038
 cuc-0.036
 y-0.036
 swe-0.035
Tokenhe
Token ID258
Feature activation-1.19e-01

Pos logit contributions
belt+0.083
chair+0.083
ase+0.083
child+0.080
kid+0.077
Neg logit contributions
 tast-0.075
 better-0.067
 scr-0.065
 ju-0.063
 grow-0.062
Tokenlf
Token ID1652
Feature activation-4.78e-02

Pos logit contributions
chair+0.135
belt+0.118
ase+0.105
shirt+0.103
haired+0.098
Neg logit contributions
 rad-0.248
 cuc-0.238
 grow-0.226
 scr-0.218
 qu-0.208
Token in
Token ID287
Feature activation-1.36e-03

Pos logit contributions
chair+0.086
ase+0.081
belt+0.077
Jake+0.076
 doorway+0.076
Neg logit contributions
 tast-0.063
 scr-0.057
 rad-0.056
 cuc-0.055
 better-0.054
Token the
Token ID262
Feature activation-1.18e-02

Pos logit contributions
chair+0.003
ase+0.003
belt+0.003
board+0.003
lf+0.002
Neg logit contributions
 tast-0.002
 cuc-0.002
 scr-0.002
 rad-0.002
 ang-0.002
Token world
Token ID995
Feature activation-3.58e-03

Pos logit contributions
belt+0.022
chair+0.020
ase+0.020
lf+0.018
person+0.018
Neg logit contributions
 tast-0.020
 ju-0.017
 scr-0.016
 y-0.016
 cal-0.016
Token!
Token ID0
Feature activation-1.01e-02

Pos logit contributions
chair+0.007
ase+0.007
belt+0.007
lf+0.007
issa+0.006
Neg logit contributions
 tast-0.005
 scr-0.004
 ju-0.004
 cal-0.004
 cuc-0.004
Token can
Token ID460
Feature activation+1.12e-02

Pos logit contributions
 tast+0.006
 scr+0.005
 ang+0.005
 cuc+0.005
 ju+0.005
Neg logit contributions
chair-0.011
ase-0.011
belt-0.010
lf-0.010
overs-0.009
Token't
Token ID470
Feature activation-2.44e-02

Pos logit contributions
 tast+0.016
 ang+0.015
 scr+0.015
 cuc+0.014
 ju+0.013
Neg logit contributions
chair-0.024
ase-0.023
belt-0.022
lf-0.021
overs-0.019
Token reach
Token ID3151
Feature activation-7.85e-03

Pos logit contributions
chair+0.049
ase+0.045
lf+0.044
belt+0.043
 corner+0.039
Neg logit contributions
 tast-0.038
 ang-0.036
 scr-0.034
 cuc-0.033
 swe-0.033
Token the
Token ID262
Feature activation-1.42e-02

Pos logit contributions
chair+0.014
ase+0.013
belt+0.013
lf+0.012
overs+0.011
Neg logit contributions
 tast-0.014
 scr-0.013
 ang-0.013
 cuc-0.012
 ju-0.012
Token books
Token ID3835
Feature activation-3.59e-02

Pos logit contributions
ase+0.023
chair+0.022
belt+0.022
overs+0.020
lf+0.020
Neg logit contributions
 tast-0.028
 scr-0.025
 ju-0.024
 cuc-0.023
 ang-0.023
Tokenhe
Token ID258
Feature activation-1.01e-01

Pos logit contributions
chair+0.036
ase+0.034
belt+0.032
lf+0.029
 doorway+0.026
Neg logit contributions
 tast-0.088
 scr-0.083
 ang-0.080
 ju-0.079
 cuc-0.078
Tokenlf
Token ID1652
Feature activation-2.86e-02

Pos logit contributions
belt+0.101
ase+0.096
chair+0.096
overs+0.095
ugg+0.087
Neg logit contributions
 grow-0.190
 cuc-0.187
 scr-0.179
 rad-0.173
 y-0.173
Token.
Token ID13
Feature activation-6.57e-03

Pos logit contributions
chair+0.055
ase+0.053
belt+0.051
lf+0.049
 doorway+0.046
Neg logit contributions
 tast-0.044
 scr-0.040
 ang-0.039
 cuc-0.038
 ju-0.037
Token Can
Token ID1680
Feature activation+1.23e-02

Pos logit contributions
chair+0.010
ase+0.010
lf+0.009
belt+0.009
 doorway+0.008
Neg logit contributions
 tast-0.013
 scr-0.012
 ju-0.012
 Pars-0.012
 cuc-0.011
Token you
Token ID345
Feature activation-4.65e-04

Pos logit contributions
 tast+0.020
 scr+0.019
 ang+0.018
 Pars+0.018
 rad+0.018
Neg logit contributions
ase-0.023
chair-0.022
belt-0.020
 doorway-0.020
lf-0.019
Token reach
Token ID3151
Feature activation+3.77e-03

Pos logit contributions
chair+0.001
belt+0.001
ase+0.001
lf+0.001
 corner+0.001
Neg logit contributions
 tast-0.001
 ang-0.001
 cuc-0.001
 swe-0.001
 scr-0.001
Token a
Token ID257
Feature activation+3.08e-03

Pos logit contributions
 tast+0.003
 scr+0.002
 cuc+0.002
 ang+0.002
 Pars+0.002
Neg logit contributions
chair-0.004
ase-0.004
belt-0.004
anger-0.004
board-0.004
Token room
Token ID2119
Feature activation-2.60e-02

Pos logit contributions
 tast+0.005
 scr+0.004
 cuc+0.004
 ang+0.004
 swe+0.004
Neg logit contributions
chair-0.006
ase-0.006
belt-0.006
lf-0.005
overs-0.005
Token.
Token ID13
Feature activation+1.07e-03

Pos logit contributions
chair+0.059
belt+0.057
ase+0.055
issa+0.052
 doorway+0.050
Neg logit contributions
 tast-0.026
 cuc-0.025
 scr-0.025
 grow-0.022
 rad-0.022
Token The
Token ID383
Feature activation+2.31e-03

Pos logit contributions
 tast+0.002
 cuc+0.002
 scr+0.001
 ju+0.001
 rad+0.001
Neg logit contributions
belt-0.002
chair-0.002
ase-0.002
lf-0.002
issa-0.002
Token books
Token ID3835
Feature activation-3.60e-02

Pos logit contributions
 tast+0.004
 scr+0.004
 ju+0.004
 y+0.004
 swe+0.003
Neg logit contributions
ase-0.004
belt-0.004
chair-0.004
overs-0.004
person-0.003
Tokenhe
Token ID258
Feature activation-1.00e-01

Pos logit contributions
belt+0.065
overs+0.058
ana+0.058
kid+0.057
child+0.057
Neg logit contributions
 grow-0.048
 tast-0.048
 y-0.044
 rust-0.043
 better-0.041
Tokenlf
Token ID1652
Feature activation-3.16e-02

Pos logit contributions
overs+0.126
belt+0.117
 outage+0.111
person+0.106
chair+0.103
Neg logit contributions
 grow-0.169
 cuc-0.165
 scr-0.164
 pr-0.155
 b-0.152
Token was
Token ID373
Feature activation-6.86e-03

Pos logit contributions
belt+0.059
overs+0.057
chair+0.057
 doorway+0.057
ase+0.053
Neg logit contributions
 tast-0.039
 scr-0.037
 grow-0.035
 rad-0.034
 rust-0.033
Token filled
Token ID5901
Feature activation-1.94e-02

Pos logit contributions
chair+0.017
ase+0.017
belt+0.016
lf+0.015
board+0.015
Neg logit contributions
 tast-0.008
 scr-0.007
 ang-0.006
 ju-0.006
 cuc-0.006
Token with
Token ID351
Feature activation+1.28e-03

Pos logit contributions
chair+0.038
belt+0.033
ase+0.033
 doorway+0.030
issa+0.029
Neg logit contributions
 tast-0.033
 cuc-0.029
 scr-0.028
 ju-0.027
 ang-0.027
Token books
Token ID3835
Feature activation-2.15e-02

Pos logit contributions
 tast+0.002
 scr+0.002
 ang+0.002
 cuc+0.002
 ju+0.002
Neg logit contributions
chair-0.002
ase-0.002
belt-0.002
lf-0.002
overs-0.002
Token,
Token ID11
Feature activation+1.34e-04

Pos logit contributions
 tast+0.005
 scr+0.005
 ang+0.005
 cuc+0.005
 Pars+0.004
Neg logit contributions
chair-0.007
ase-0.006
belt-0.006
lf-0.006
overs-0.005
Token I
Token ID314
Feature activation-2.67e-03

Pos logit contributions
 tast+0.000
 scr+0.000
 ju+0.000
 cuc+0.000
 ang+0.000
Neg logit contributions
ase-0.000
chair-0.000
belt-0.000
lf-0.000
overs-0.000
Token finished
Token ID5201
Feature activation-7.23e-03

Pos logit contributions
chair+0.005
ase+0.005
belt+0.005
lf+0.005
overs+0.004
Neg logit contributions
 tast-0.004
 scr-0.004
 ang-0.004
 cuc-0.004
 ju-0.003
Token the
Token ID262
Feature activation-6.00e-03

Pos logit contributions
chair+0.013
ase+0.013
belt+0.012
lf+0.011
 corner+0.010
Neg logit contributions
 tast-0.013
 scr-0.012
 cuc-0.011
 ang-0.011
 ju-0.011
Token books
Token ID3835
Feature activation-3.00e-02

Pos logit contributions
ase+0.011
chair+0.010
belt+0.010
overs+0.009
lf+0.009
Neg logit contributions
 tast-0.012
 scr-0.010
 ju-0.010
 swe-0.010
 funn-0.010
Tokenhe
Token ID258
Feature activation-1.00e-01

Pos logit contributions
ase+0.035
chair+0.033
belt+0.031
lf+0.029
overs+0.027
Neg logit contributions
 tast-0.069
 scr-0.063
 ang-0.061
 ju-0.061
 better-0.059
Tokenlf
Token ID1652
Feature activation-2.11e-02

Pos logit contributions
ase+0.099
belt+0.098
chair+0.094
haired+0.092
overs+0.081
Neg logit contributions
 cuc-0.191
 rad-0.183
 scr-0.183
 grow-0.179
 y-0.175
Token!"
Token ID2474
Feature activation+1.13e-03

Pos logit contributions
ase+0.045
chair+0.045
belt+0.042
lf+0.040
 doorway+0.039
Neg logit contributions
 tast-0.028
 scr-0.025
 ang-0.024
 ju-0.023
 cuc-0.023
Token Andy
Token ID12382
Feature activation+3.62e-03

Pos logit contributions
 tast+0.002
 scr+0.002
 ang+0.002
 cuc+0.002
 ju+0.002
Neg logit contributions
chair-0.002
ase-0.002
belt-0.002
lf-0.002
overs-0.002
Token said
Token ID531
Feature activation-1.39e-02

Pos logit contributions
 tast+0.005
 scr+0.005
 ang+0.005
 cuc+0.004
 ju+0.004
Neg logit contributions
chair-0.008
ase-0.007
belt-0.007
lf-0.007
overs-0.006
Token with
Token ID351
Feature activation+9.62e-04

Pos logit contributions
chair+0.027
ase+0.026
belt+0.025
lf+0.024
overs+0.022
Neg logit contributions
 tast-0.022
 scr-0.020
 ang-0.020
 cuc-0.019
 Pars-0.019
Token She
Token ID1375
Feature activation+1.91e-03

Pos logit contributions
 tast+0.018
 scr+0.017
 ang+0.017
 cuc+0.016
 ju+0.016
Neg logit contributions
chair-0.020
ase-0.019
belt-0.019
lf-0.018
overs-0.016
Token had
Token ID550
Feature activation-2.20e-02

Pos logit contributions
 tast+0.003
 scr+0.003
 cuc+0.003
 rad+0.003
 grow+0.003
Neg logit contributions
chair-0.004
ase-0.004
belt-0.004
overs-0.003
 doorway-0.003
Token a
Token ID257
Feature activation-5.82e-03

Pos logit contributions
chair+0.052
ase+0.051
belt+0.051
lf+0.048
loo+0.046
Neg logit contributions
 tast-0.027
 scr-0.023
 ju-0.022
 funn-0.020
 cuc-0.020
Token big
Token ID1263
Feature activation-1.27e-02

Pos logit contributions
chair+0.010
ase+0.010
belt+0.009
lf+0.009
 corner+0.008
Neg logit contributions
 tast-0.010
 ang-0.010
 scr-0.009
 cuc-0.009
 Pars-0.009
Token books
Token ID3835
Feature activation-3.78e-02

Pos logit contributions
chair+0.022
ase+0.021
belt+0.020
lf+0.019
overs+0.017
Neg logit contributions
 tast-0.023
 scr-0.021
 ang-0.021
 cuc-0.021
 ju-0.020
Tokenhe
Token ID258
Feature activation-9.95e-02

Pos logit contributions
belt+0.045
chair+0.045
child+0.044
ase+0.043
kids+0.043
Neg logit contributions
 tast-0.075
 better-0.067
 scr-0.063
 grow-0.061
 ju-0.061
Tokenlf
Token ID1652
Feature activation-2.33e-02

Pos logit contributions
chair+0.093
belt+0.088
ase+0.082
 doorway+0.072
person+0.072
Neg logit contributions
 scr-0.211
 cuc-0.201
 rad-0.197
 grow-0.196
 pr-0.191
Token in
Token ID287
Feature activation+1.46e-03

Pos logit contributions
chair+0.040
ase+0.039
belt+0.037
lf+0.036
 doorway+0.035
Neg logit contributions
 tast-0.038
 scr-0.034
 ang-0.031
 ju-0.030
 cuc-0.030
Token her
Token ID607
Feature activation-1.23e-03

Pos logit contributions
 tast+0.002
 scr+0.002
 swe+0.002
 ang+0.002
 tul+0.002
Neg logit contributions
chair-0.003
ase-0.003
belt-0.003
board-0.003
loo-0.003
Token room
Token ID2119
Feature activation-2.34e-02

Pos logit contributions
chair+0.002
ase+0.001
belt+0.001
lf+0.001
person+0.001
Neg logit contributions
 tast-0.003
 scr-0.003
 swe-0.002
 prett-0.002
 ang-0.002
Token with
Token ID351
Feature activation+7.31e-03

Pos logit contributions
chair+0.050
belt+0.047
ase+0.046
issa+0.042
lf+0.042
Neg logit contributions
 scr-0.028
 tast-0.028
 cuc-0.024
 grow-0.024
 spr-0.023
Token he
Token ID339
Feature activation-1.22e-02

Pos logit contributions
ase+0.009
chair+0.009
belt+0.008
lf+0.008
overs+0.007
Neg logit contributions
 tast-0.009
 ang-0.008
 scr-0.008
 cuc-0.008
 ju-0.007
Token finally
Token ID3443
Feature activation-8.57e-03

Pos logit contributions
chair+0.025
ase+0.024
belt+0.023
overs+0.022
lf+0.021
Neg logit contributions
 tast-0.018
 scr-0.017
 ang-0.017
 cuc-0.016
 grow-0.016
Token found
Token ID1043
Feature activation-1.61e-02

Pos logit contributions
chair+0.016
ase+0.016
belt+0.015
overs+0.014
lf+0.014
Neg logit contributions
 tast-0.014
 scr-0.013
 ang-0.013
 cuc-0.012
 ju-0.012
Token the
Token ID262
Feature activation-2.39e-02

Pos logit contributions
chair+0.038
ase+0.036
belt+0.035
lf+0.034
overs+0.030
Neg logit contributions
 tast-0.021
 ang-0.019
 cuc-0.018
 ju-0.017
 scr-0.017
Token books
Token ID3835
Feature activation-3.17e-02

Pos logit contributions
ase+0.044
chair+0.042
belt+0.041
lf+0.039
overs+0.038
Neg logit contributions
 tast-0.045
 ju-0.037
 scr-0.036
 funn-0.036
 swe-0.035
Tokenhe
Token ID258
Feature activation-9.94e-02

Pos logit contributions
ase+0.040
chair+0.036
belt+0.035
child+0.033
issa+0.032
Neg logit contributions
 tast-0.068
 scr-0.063
 better-0.062
 ang-0.060
 ju-0.059
Tokenlf
Token ID1652
Feature activation-2.53e-02

Pos logit contributions
belt+0.115
idd+0.112
ase+0.112
overs+0.111
haired+0.110
Neg logit contributions
 scr-0.163
 grow-0.163
 y-0.151
 cuc-0.151
 rust-0.147
Token,
Token ID11
Feature activation-1.23e-02

Pos logit contributions
ase+0.051
chair+0.050
belt+0.046
lf+0.044
issa+0.043
Neg logit contributions
 tast-0.036
 scr-0.033
 ang-0.031
 rust-0.029
 ju-0.029
Token and
Token ID290
Feature activation-1.38e-02

Pos logit contributions
ase+0.027
chair+0.026
belt+0.024
loo+0.022
lf+0.022
Neg logit contributions
 tast-0.019
 scr-0.017
 ang-0.016
 ju-0.016
 cuc-0.016
Token he
Token ID339
Feature activation-1.05e-02

Pos logit contributions
ase+0.031
chair+0.029
belt+0.027
overs+0.026
anny+0.025
Neg logit contributions
 tast-0.020
 scr-0.018
 ang-0.018
 cuc-0.018
 ju-0.017
Token stepped
Token ID10764
Feature activation+3.50e-03

Pos logit contributions
chair+0.020
ase+0.020
belt+0.019
overs+0.018
loo+0.017
Neg logit contributions
 tast-0.017
 scr-0.016
 cuc-0.015
 ang-0.015
 ju-0.014
Token.
Token ID13
Feature activation+4.03e-03

Pos logit contributions
chair+0.027
ase+0.026
belt+0.026
lf+0.024
loo+0.023
Neg logit contributions
 scr-0.015
 tast-0.015
 ang-0.013
 cuc-0.012
 swe-0.012
Token She
Token ID1375
Feature activation+5.68e-03

Pos logit contributions
 scr+0.006
 tast+0.006
 cuc+0.005
 Pars+0.005
 ang+0.005
Neg logit contributions
chair-0.008
ase-0.008
belt-0.008
lf-0.008
loo-0.007
Token saw
Token ID2497
Feature activation-2.10e-02

Pos logit contributions
 scr+0.008
 tast+0.008
 ang+0.007
 cuc+0.007
 swe+0.007
Neg logit contributions
ase-0.012
chair-0.012
belt-0.011
overs-0.010
loo-0.010
Token a
Token ID257
Feature activation-1.19e-02

Pos logit contributions
ase+0.046
belt+0.046
chair+0.045
lf+0.041
overs+0.039
Neg logit contributions
 tast-0.028
 scr-0.026
 ang-0.026
 cuc-0.024
 swe-0.023
Token books
Token ID3835
Feature activation-4.09e-02

Pos logit contributions
chair+0.023
ase+0.022
belt+0.021
lf+0.021
 corner+0.019
Neg logit contributions
 tast-0.019
 ang-0.017
 scr-0.017
 cuc-0.016
 Pars-0.016
Tokenhe
Token ID258
Feature activation-9.92e-02

Pos logit contributions
ase+0.043
belt+0.042
chair+0.040
overs+0.035
child+0.033
Neg logit contributions
 tast-0.093
 scr-0.090
 rust-0.086
 ang-0.085
 grow-0.084
Tokenlf
Token ID1652
Feature activation-2.70e-02

Pos logit contributions
ase+0.086
belt+0.082
chair+0.080
person+0.063
overs+0.060
Neg logit contributions
 scr-0.236
 grow-0.225
 cuc-0.217
 rad-0.214
 rust-0.213
Token with
Token ID351
Feature activation-6.66e-03

Pos logit contributions
chair+0.045
ase+0.044
belt+0.042
lf+0.038
overs+0.037
Neg logit contributions
 tast-0.047
 scr-0.045
 ang-0.043
 cuc-0.041
 rust-0.040
Token many
Token ID867
Feature activation-1.06e-03

Pos logit contributions
chair+0.013
ase+0.013
belt+0.012
lf+0.011
overs+0.011
Neg logit contributions
 tast-0.011
 scr-0.010
 cuc-0.009
 ang-0.009
 ju-0.009
Token books
Token ID3835
Feature activation-2.83e-02

Pos logit contributions
ase+0.002
belt+0.002
chair+0.002
overs+0.002
lf+0.001
Neg logit contributions
 tast-0.002
 scr-0.002
 swe-0.002
 cuc-0.002
 ju-0.002
Token on
Token ID319
Feature activation-1.42e-02

Pos logit contributions
ase+0.057
belt+0.055
chair+0.054
overs+0.050
lf+0.049
Neg logit contributions
 tast-0.037
 scr-0.035
 rust-0.031
 grow-0.031
 better-0.030
Token book
Token ID1492
Feature activation-1.58e-02

Pos logit contributions
 tast+0.007
 ang+0.006
 scr+0.006
 cuc+0.006
 swe+0.006
Neg logit contributions
ase-0.006
belt-0.005
chair-0.005
overs-0.005
lf-0.005
Token falls
Token ID8953
Feature activation-6.21e-03

Pos logit contributions
ase+0.031
chair+0.029
belt+0.029
overs+0.028
lf+0.028
Neg logit contributions
 tast-0.024
 ang-0.022
 scr-0.022
 cal-0.020
 swe-0.020
Token off
Token ID572
Feature activation-3.03e-02

Pos logit contributions
chair+0.011
ase+0.010
belt+0.010
lf+0.009
 corner+0.009
Neg logit contributions
 tast-0.011
 scr-0.010
 cuc-0.010
 ang-0.010
 ju-0.010
Token the
Token ID262
Feature activation-9.65e-04

Pos logit contributions
chair+0.063
belt+0.061
ase+0.061
lf+0.055
issa+0.053
Neg logit contributions
 tast-0.040
 scr-0.038
 ang-0.037
 cuc-0.036
 rad-0.034
Token books
Token ID3835
Feature activation-3.23e-02

Pos logit contributions
ase+0.002
belt+0.001
overs+0.001
chair+0.001
loo+0.001
Neg logit contributions
 tast-0.002
 scr-0.002
 swe-0.002
 ju-0.002
 cuc-0.002
Tokenhe
Token ID258
Feature activation-9.85e-02

Pos logit contributions
belt+0.045
ase+0.044
chair+0.043
lf+0.039
issa+0.036
Neg logit contributions
 tast-0.063
 scr-0.061
 ang-0.059
 better-0.056
 cal-0.055
Tokenlf
Token ID1652
Feature activation-2.48e-02

Pos logit contributions
belt+0.094
chair+0.082
new+0.071
ase+0.071
overs+0.067
Neg logit contributions
 grow-0.208
 cuc-0.203
 scr-0.201
 swall-0.195
 nib-0.189
Token.
Token ID13
Feature activation-4.98e-03

Pos logit contributions
chair+0.053
ase+0.052
belt+0.051
lf+0.048
issa+0.046
Neg logit contributions
 tast-0.031
 scr-0.029
 ang-0.028
 cuc-0.026
 swe-0.024
Token It
Token ID632
Feature activation+1.20e-02

Pos logit contributions
chair+0.008
ase+0.008
belt+0.007
lf+0.007
overs+0.006
Neg logit contributions
 tast-0.010
 scr-0.009
 ang-0.009
 cuc-0.009
 ju-0.008
Token makes
Token ID1838
Feature activation-2.57e-02

Pos logit contributions
 tast+0.021
 scr+0.019
 ang+0.019
 cuc+0.019
 ju+0.018
Neg logit contributions
chair-0.022
ase-0.020
belt-0.020
lf-0.019
overs-0.017
Token a
Token ID257
Feature activation+4.59e-04

Pos logit contributions
chair+0.052
ase+0.050
belt+0.049
lf+0.046
overs+0.043
Neg logit contributions
 tast-0.039
 scr-0.036
 ang-0.036
 cuc-0.034
 Pars-0.032
Token.
Token ID13
Feature activation+8.46e-03

Pos logit contributions
 tast+0.017
 scr+0.015
 ang+0.015
 cuc+0.014
 Pars+0.014
Neg logit contributions
chair-0.026
ase-0.024
belt-0.024
lf-0.023
overs-0.021
Token She
Token ID1375
Feature activation+6.69e-04

Pos logit contributions
 tast+0.015
 scr+0.014
 ang+0.014
 cuc+0.013
 ju+0.013
Neg logit contributions
chair-0.015
ase-0.014
belt-0.014
lf-0.013
overs-0.012
Token had
Token ID550
Feature activation-2.36e-02

Pos logit contributions
 tast+0.001
 scr+0.001
 grow+0.001
 cuc+0.001
 rad+0.001
Neg logit contributions
belt-0.001
chair-0.001
ase-0.001
overs-0.001
 doorway-0.001
Token a
Token ID257
Feature activation-7.28e-03

Pos logit contributions
chair+0.055
ase+0.054
belt+0.054
lf+0.051
loo+0.047
Neg logit contributions
 tast-0.029
 scr-0.026
 ju-0.024
 ang-0.023
 cuc-0.022
Token books
Token ID3835
Feature activation-3.81e-02

Pos logit contributions
chair+0.013
ase+0.012
belt+0.012
lf+0.011
 corner+0.010
Neg logit contributions
 tast-0.013
 ang-0.012
 scr-0.012
 cuc-0.011
 Pars-0.011
Tokenhe
Token ID258
Feature activation-9.83e-02

Pos logit contributions
chair+0.049
belt+0.049
ase+0.048
child+0.048
lf+0.046
Neg logit contributions
 tast-0.074
 better-0.065
 scr-0.063
 ju-0.061
 grow-0.061
Tokenlf
Token ID1652
Feature activation-2.16e-02

Pos logit contributions
chair+0.095
belt+0.091
ase+0.083
 doorway+0.079
ay+0.076
Neg logit contributions
 scr-0.199
 grow-0.186
 cuc-0.186
 rad-0.183
 pr-0.181
Token in
Token ID287
Feature activation+1.70e-03

Pos logit contributions
chair+0.038
ase+0.037
belt+0.035
lf+0.034
 doorway+0.034
Neg logit contributions
 tast-0.034
 scr-0.031
 ang-0.027
 ju-0.027
 rust-0.026
Token her
Token ID607
Feature activation-8.24e-04

Pos logit contributions
 tast+0.002
 scr+0.002
 swe+0.002
 ang+0.002
 cuc+0.002
Neg logit contributions
chair-0.004
ase-0.003
belt-0.003
lf-0.003
board-0.003
Token room
Token ID2119
Feature activation-2.22e-02

Pos logit contributions
chair+0.001
ase+0.001
belt+0.001
person+0.001
lf+0.001
Neg logit contributions
 tast-0.002
 scr-0.002
 swe-0.002
 prett-0.002
 grow-0.001
Token that
Token ID326
Feature activation-2.40e-03

Pos logit contributions
chair+0.049
belt+0.045
ase+0.045
issa+0.041
lf+0.041
Neg logit contributions
 scr-0.027
 tast-0.026
 cuc-0.023
 grow-0.022
 rad-0.021
Token should
Token ID815
Feature activation-1.08e-02

Pos logit contributions
chair+0.025
ase+0.025
belt+0.025
lf+0.023
 doorway+0.022
Neg logit contributions
 tast-0.015
 scr-0.013
 ang-0.013
 ju-0.012
 cuc-0.012
Token not
Token ID407
Feature activation-1.58e-02

Pos logit contributions
chair+0.021
ase+0.020
belt+0.019
lf+0.019
overs+0.017
Neg logit contributions
 tast-0.018
 scr-0.016
 ang-0.016
 cuc-0.015
 ju-0.015
Token climb
Token ID12080
Feature activation+9.29e-03

Pos logit contributions
chair+0.030
ase+0.027
belt+0.025
lf+0.025
 corner+0.023
Neg logit contributions
 tast-0.028
 cuc-0.026
 ang-0.025
 scr-0.025
 ju-0.025
Token the
Token ID262
Feature activation-3.45e-03

Pos logit contributions
 tast+0.014
 ang+0.014
 scr+0.013
 swe+0.013
 ju+0.012
Neg logit contributions
chair-0.018
ase-0.018
belt-0.017
lf-0.016
 doorway-0.015
Token books
Token ID3835
Feature activation-3.28e-02

Pos logit contributions
ase+0.005
belt+0.005
overs+0.004
chair+0.004
ana+0.004
Neg logit contributions
 tast-0.007
 swe-0.006
 ju-0.006
 scr-0.006
 ang-0.006
Tokenhe
Token ID258
Feature activation-9.79e-02

Pos logit contributions
belt+0.048
ase+0.047
chair+0.047
lf+0.044
 doorway+0.041
Neg logit contributions
 tast-0.060
 scr-0.057
 ang-0.054
 better-0.054
 cal-0.051
Tokenlf
Token ID1652
Feature activation-2.38e-02

Pos logit contributions
belt+0.095
chair+0.083
new+0.072
shirt+0.068
ase+0.067
Neg logit contributions
 grow-0.194
 cuc-0.192
 argue-0.189
 scr-0.186
 rad-0.182
Token.
Token ID13
Feature activation-3.38e-03

Pos logit contributions
chair+0.052
ase+0.051
belt+0.050
lf+0.047
 doorway+0.047
Neg logit contributions
 tast-0.028
 scr-0.025
 ang-0.023
 swe-0.021
 better-0.021
Token It
Token ID632
Feature activation+9.69e-03

Pos logit contributions
chair+0.006
lf+0.006
belt+0.006
ase+0.005
 doorway+0.005
Neg logit contributions
 tast-0.005
 scr-0.005
 Pars-0.005
 lav-0.005
 ju-0.005
Token is
Token ID318
Feature activation+3.36e-03

Pos logit contributions
 tast+0.019
 scr+0.017
 ang+0.017
 cuc+0.017
 ju+0.016
Neg logit contributions
chair-0.016
ase-0.015
belt-0.014
lf-0.013
overs-0.012
Token dangerous
Token ID4923
Feature activation-9.04e-03

Pos logit contributions
 tast+0.005
 scr+0.004
 ang+0.004
 swe+0.004
 ju+0.004
Neg logit contributions
chair-0.007
ase-0.007
belt-0.007
lf-0.007
overs-0.006
Token.
Token ID13
Feature activation+3.16e-03

Pos logit contributions
chair+0.002
ase+0.002
 corner+0.002
belt+0.002
lf+0.002
Neg logit contributions
 tast-0.003
 cuc-0.003
 ju-0.003
 tul-0.003
 scr-0.003
Token She
Token ID1375
Feature activation+2.67e-03

Pos logit contributions
 tast+0.005
 scr+0.004
 ang+0.004
 ju+0.004
 Pars+0.004
Neg logit contributions
ase-0.006
chair-0.006
belt-0.006
lf-0.006
overs-0.005
Token moved
Token ID3888
Feature activation+2.12e-03

Pos logit contributions
 tast+0.004
 ang+0.004
 scr+0.004
 cuc+0.004
 swe+0.004
Neg logit contributions
chair-0.005
ase-0.005
belt-0.005
overs-0.005
lf-0.004
Token the
Token ID262
Feature activation-1.13e-02

Pos logit contributions
 ang+0.003
 tast+0.003
 swe+0.002
 Pars+0.002
 cal+0.002
Neg logit contributions
chair-0.005
ase-0.005
belt-0.004
lf-0.004
loo-0.004
Token books
Token ID3835
Feature activation-2.74e-02

Pos logit contributions
belt+0.019
ase+0.019
overs+0.018
chair+0.018
ana+0.016
Neg logit contributions
 tast-0.020
 ang-0.018
 swe-0.018
 scr-0.017
 y-0.017
Tokenhe
Token ID258
Feature activation-9.79e-02

Pos logit contributions
ase+0.039
chair+0.039
belt+0.037
lf+0.034
issa+0.032
Neg logit contributions
 tast-0.054
 ang-0.052
 scr-0.050
 better-0.047
 ju-0.047
Tokenlf
Token ID1652
Feature activation-2.33e-02

Pos logit contributions
belt+0.114
ase+0.094
idd+0.092
chair+0.087
overs+0.084
Neg logit contributions
 grow-0.184
 scr-0.179
 cuc-0.175
 ang-0.172
Together-0.171
Token a
Token ID257
Feature activation+7.21e-03

Pos logit contributions
chair+0.043
ase+0.041
belt+0.039
lf+0.037
overs+0.034
Neg logit contributions
 tast-0.040
 ang-0.036
 scr-0.036
 cuc-0.034
 ju-0.033
Token little
Token ID1310
Feature activation+3.49e-02

Pos logit contributions
 tast+0.011
 scr+0.009
 ang+0.009
 cuc+0.009
 ju+0.009
Neg logit contributions
chair-0.015
ase-0.015
belt-0.014
lf-0.013
overs-0.012
Token and
Token ID290
Feature activation-5.29e-03

Pos logit contributions
 tast+0.057
 scr+0.054
 cuc+0.053
 Pars+0.053
 ang+0.052
Neg logit contributions
chair-0.068
ase-0.061
belt-0.058
 corner-0.057
lf-0.054
Token got
Token ID1392
Feature activation+5.75e-03

Pos logit contributions
ase+0.012
chair+0.012
belt+0.011
overs+0.011
lf+0.010
Neg logit contributions
 tast-0.007
 ang-0.007
 scr-0.006
 ju-0.006
 cuc-0.006
Token be
Token ID307
Feature activation+1.06e-02

Pos logit contributions
 tast+0.039
 cuc+0.037
 ang+0.036
 ju+0.035
 Pars+0.035
Neg logit contributions
chair-0.039
belt-0.037
lf-0.037
ase-0.035
 corner-0.034
Token careful
Token ID8161
Feature activation-2.39e-02

Pos logit contributions
 tast+0.016
 ang+0.015
 scr+0.014
 cuc+0.013
 ju+0.013
Neg logit contributions
chair-0.022
ase-0.021
belt-0.020
lf-0.019
overs-0.018
Token.
Token ID13
Feature activation-2.36e-03

Pos logit contributions
chair+0.052
ase+0.051
belt+0.047
 doorway+0.046
lf+0.045
Neg logit contributions
 tast-0.029
 scr-0.028
 ang-0.026
 swe-0.025
 cuc-0.024
Token The
Token ID383
Feature activation+9.30e-04

Pos logit contributions
chair+0.004
ase+0.004
belt+0.004
lf+0.004
 doorway+0.003
Neg logit contributions
 tast-0.004
 scr-0.004
 ang-0.004
 Pars-0.004
 cuc-0.004
Token books
Token ID3835
Feature activation-2.99e-02

Pos logit contributions
 tast+0.002
 scr+0.002
 ang+0.002
 ju+0.002
 swe+0.002
Neg logit contributions
ase-0.002
chair-0.001
belt-0.001
overs-0.001
lf-0.001
Tokenhe
Token ID258
Feature activation-9.78e-02

Pos logit contributions
chair+0.070
ase+0.070
belt+0.067
lf+0.065
 doorway+0.063
Neg logit contributions
 tast-0.032
 scr-0.028
 ang-0.027
 die-0.025
 better-0.024
Tokenlf
Token ID1652
Feature activation-2.54e-02

Pos logit contributions
chair+0.104
belt+0.096
new+0.092
person+0.091
ase+0.091
Neg logit contributions
 grow-0.183
 scr-0.180
 cuc-0.176
 scar-0.168
 argue-0.167
Token is
Token ID318
Feature activation+6.44e-03

Pos logit contributions
chair+0.047
ase+0.046
belt+0.044
lf+0.043
 doorway+0.040
Neg logit contributions
 tast-0.041
 scr-0.038
 ang-0.036
 swe-0.034
 cuc-0.034
Token tall
Token ID7331
Feature activation-2.28e-02

Pos logit contributions
 tast+0.009
 scr+0.008
 ang+0.008
 cuc+0.008
 ju+0.007
Neg logit contributions
chair-0.014
ase-0.014
belt-0.013
lf-0.012
overs-0.012
Token and
Token ID290
Feature activation-1.79e-03

Pos logit contributions
ase+0.041
chair+0.041
belt+0.039
lf+0.036
 doorway+0.035
Neg logit contributions
 tast-0.038
 scr-0.034
 cuc-0.033
 swe-0.033
 ju-0.033
Token we
Token ID356
Feature activation-7.32e-03

Pos logit contributions
ase+0.004
chair+0.004
belt+0.003
lf+0.003
overs+0.003
Neg logit contributions
 tast-0.003
 scr-0.003
 ang-0.002
 ju-0.002
 swe-0.002
Tokenmy
Token ID1820
Feature activation+1.17e-02

Pos logit contributions
chair+0.019
ase+0.018
belt+0.017
lf+0.016
overs+0.015
Neg logit contributions
 tast-0.016
 scr-0.015
 ang-0.015
 cuc-0.014
 ju-0.013
Token never
Token ID1239
Feature activation-1.30e-03

Pos logit contributions
 tast+0.020
 ang+0.018
 scr+0.018
 cuc+0.018
 ju+0.017
Neg logit contributions
chair-0.022
ase-0.020
belt-0.019
lf-0.019
 corner-0.017
Token saw
Token ID2497
Feature activation-6.04e-03

Pos logit contributions
chair+0.002
ase+0.002
belt+0.002
lf+0.001
 corner+0.001
Neg logit contributions
 tast-0.003
 scr-0.003
 cuc-0.003
 Pars-0.003
 ang-0.003
Token the
Token ID262
Feature activation-1.66e-02

Pos logit contributions
chair+0.011
ase+0.011
belt+0.010
lf+0.010
overs+0.009
Neg logit contributions
 tast-0.010
 scr-0.009
 ang-0.009
 cuc-0.009
 ju-0.009
Token books
Token ID3835
Feature activation-3.40e-02

Pos logit contributions
ase+0.032
belt+0.029
chair+0.029
overs+0.028
loo+0.026
Neg logit contributions
 tast-0.028
 scr-0.024
 ang-0.024
 ju-0.023
 y-0.023
Tokenhe
Token ID258
Feature activation-9.75e-02

Pos logit contributions
ase+0.044
chair+0.043
belt+0.040
lf+0.035
child+0.034
Neg logit contributions
 tast-0.073
 scr-0.070
 ang-0.067
 better-0.065
 ju-0.064
Tokenlf
Token ID1652
Feature activation-2.57e-02

Pos logit contributions
belt+0.105
ase+0.103
chair+0.100
eddy+0.096
overs+0.096
Neg logit contributions
 scr-0.178
 grow-0.177
 argue-0.164
 ang-0.163
 rum-0.162
Token again
Token ID757
Feature activation+6.88e-04

Pos logit contributions
chair+0.054
ase+0.053
belt+0.050
lf+0.047
overs+0.045
Neg logit contributions
 tast-0.035
 scr-0.033
 ang-0.031
 cuc-0.028
 ju-0.028
Token.
Token ID13
Feature activation+2.57e-03

Pos logit contributions
 tast+0.001
 scr+0.001
 ang+0.001
 cuc+0.001
 ju+0.001
Neg logit contributions
chair-0.001
ase-0.001
belt-0.001
lf-0.001
overs-0.001
Token But
Token ID887
Feature activation-2.20e-02

Pos logit contributions
 tast+0.004
 ang+0.004
 scr+0.004
 cuc+0.004
 ju+0.003
Neg logit contributions
chair-0.005
ase-0.005
belt-0.005
lf-0.004
overs-0.004
Token they
Token ID484
Feature activation-9.05e-04

Pos logit contributions
chair+0.044
ase+0.042
belt+0.040
lf+0.038
overs+0.034
Neg logit contributions
 tast-0.036
 scr-0.033
 ang-0.032
 cuc-0.031
 ju-0.030
Token He
Token ID679
Feature activation+2.82e-03

Pos logit contributions
chair+0.008
ase+0.008
belt+0.007
lf+0.007
overs+0.006
Neg logit contributions
 tast-0.007
 scr-0.007
 ang-0.007
 cuc-0.007
 Pars-0.006
Token noticed
Token ID6810
Feature activation-1.98e-02

Pos logit contributions
 tast+0.004
 scr+0.004
 ang+0.004
 cuc+0.004
 ju+0.004
Neg logit contributions
ase-0.006
chair-0.006
belt-0.005
overs-0.005
lf-0.005
Token that
Token ID326
Feature activation-1.13e-02

Pos logit contributions
chair+0.039
ase+0.038
belt+0.036
lf+0.034
overs+0.032
Neg logit contributions
 tast-0.031
 scr-0.028
 ang-0.028
 cuc-0.027
 ju-0.026
Token the
Token ID262
Feature activation-2.29e-02

Pos logit contributions
ase+0.022
chair+0.021
belt+0.020
lf+0.019
overs+0.018
Neg logit contributions
 tast-0.019
 ang-0.017
 scr-0.017
 ju-0.016
 cuc-0.016
Token books
Token ID3835
Feature activation-3.74e-02

Pos logit contributions
ase+0.037
chair+0.035
belt+0.034
overs+0.032
lf+0.031
Neg logit contributions
 tast-0.045
 scr-0.040
 ju-0.039
 ang-0.039
 funn-0.038
Tokenhe
Token ID258
Feature activation-9.74e-02

Pos logit contributions
ase+0.062
chair+0.060
belt+0.058
overs+0.056
child+0.055
Neg logit contributions
 tast-0.062
 scr-0.061
 rust-0.057
 ang-0.055
 grow-0.054
Tokenlf
Token ID1652
Feature activation-2.93e-02

Pos logit contributions
ase+0.100
chair+0.098
overs+0.095
belt+0.089
eddy+0.088
Neg logit contributions
 scr-0.183
 grow-0.180
 cuc-0.173
 rust-0.166
 rum-0.163
Token was
Token ID373
Feature activation+3.54e-03

Pos logit contributions
chair+0.052
ase+0.051
belt+0.046
overs+0.045
lf+0.043
Neg logit contributions
 tast-0.048
 scr-0.046
 ang-0.043
 rust-0.042
 cuc-0.040
Token very
Token ID845
Feature activation-3.01e-03

Pos logit contributions
 tast+0.005
 scr+0.005
 ang+0.005
 cuc+0.005
 ju+0.005
Neg logit contributions
chair-0.007
ase-0.007
belt-0.007
lf-0.006
overs-0.006
Token smooth
Token ID7209
Feature activation-8.21e-03

Pos logit contributions
chair+0.006
ase+0.006
belt+0.006
lf+0.005
overs+0.005
Neg logit contributions
 tast-0.004
 scr-0.004
 ang-0.004
 cuc-0.004
 Pars-0.004
Token and
Token ID290
Feature activation+9.51e-03

Pos logit contributions
chair+0.015
ase+0.015
belt+0.014
lf+0.013
overs+0.012
Neg logit contributions
 tast-0.014
 scr-0.013
 ang-0.013
 cuc-0.012
 ju-0.012
Token book
Token ID1492
Feature activation-1.32e-02

Pos logit contributions
ase+0.009
belt+0.009
chair+0.009
overs+0.008
ana+0.008
Neg logit contributions
 tast-0.007
 ju-0.006
 scr-0.006
 y-0.006
 ang-0.006
Token.
Token ID13
Feature activation-3.42e-03

Pos logit contributions
ase+0.027
chair+0.026
belt+0.026
overs+0.024
lf+0.024
Neg logit contributions
 tast-0.019
 scr-0.016
 ang-0.015
 ju-0.014
 better-0.014
Token The
Token ID383
Feature activation+2.05e-03

Pos logit contributions
belt+0.007
chair+0.007
lf+0.007
ase+0.007
web+0.006
Neg logit contributions
 tast-0.004
 Instead-0.004
 scr-0.004
 So-0.004
Together-0.004
Token long
Token ID890
Feature activation-2.50e-02

Pos logit contributions
 tast+0.003
 scr+0.003
 y+0.002
 ang+0.002
 swe+0.002
Neg logit contributions
belt-0.004
ase-0.004
chair-0.004
overs-0.004
ana-0.004
Token books
Token ID3835
Feature activation-3.29e-02

Pos logit contributions
overs+0.042
ase+0.041
belt+0.041
we+0.038
person+0.037
Neg logit contributions
 scar-0.034
 scr-0.033
 rum-0.032
 grow-0.032
 y-0.031
Tokenhe
Token ID258
Feature activation-9.71e-02

Pos logit contributions
belt+0.050
chair+0.046
overs+0.046
child+0.046
ase+0.046
Neg logit contributions
 tast-0.058
 scr-0.055
 rust-0.054
 grow-0.054
 better-0.053
Tokenlf
Token ID1652
Feature activation-2.93e-02

Pos logit contributions
belt+0.130
overs+0.125
chair+0.118
found+0.114
eddy+0.113
Neg logit contributions
 grow-0.150
 scr-0.141
 spit-0.133
 cuc-0.132
 quarrel-0.131
Token was
Token ID373
Feature activation-3.27e-03

Pos logit contributions
chair+0.057
belt+0.056
ase+0.055
overs+0.053
 doorway+0.051
Neg logit contributions
 tast-0.040
 scr-0.039
 ang-0.035
 rust-0.033
 grow-0.033
Token still
Token ID991
Feature activation-1.51e-02

Pos logit contributions
chair+0.007
ase+0.007
belt+0.006
lf+0.006
overs+0.006
Neg logit contributions
 tast-0.005
 scr-0.004
 ang-0.004
 cuc-0.004
 ju-0.004
Token there
Token ID612
Feature activation+4.83e-03

Pos logit contributions
chair+0.027
ase+0.026
belt+0.025
lf+0.025
 corner+0.022
Neg logit contributions
 tast-0.026
 scr-0.024
 ang-0.023
 cuc-0.023
 Pars-0.023
Token,
Token ID11
Feature activation-7.20e-03

Pos logit contributions
 tast+0.007
 scr+0.006
 ang+0.006
 cuc+0.006
 ju+0.006
Neg logit contributions
chair-0.010
ase-0.010
belt-0.009
lf-0.009
overs-0.008
TokenCan
Token ID6090
Feature activation+2.01e-02

Pos logit contributions
 tast+0.034
 scr+0.031
 cuc+0.030
 ju+0.030
 ang+0.030
Neg logit contributions
chair-0.046
ase-0.043
belt-0.041
lf-0.039
overs-0.036
Token I
Token ID314
Feature activation+1.11e-02

Pos logit contributions
 tast+0.033
 ju+0.031
 scr+0.031
 rad+0.030
 Pars+0.030
Neg logit contributions
ase-0.037
chair-0.034
 doorway-0.032
belt-0.031
overs-0.031
Token touch
Token ID3638
Feature activation-1.23e-02

Pos logit contributions
 tast+0.014
 scr+0.013
 ang+0.011
 cuc+0.011
 Pars+0.011
Neg logit contributions
chair-0.025
ase-0.025
belt-0.024
lf-0.023
overs-0.022
Token the
Token ID262
Feature activation-1.57e-02

Pos logit contributions
chair+0.022
ase+0.021
belt+0.020
lf+0.019
 doorway+0.017
Neg logit contributions
 tast-0.022
 scr-0.020
 ang-0.020
 cuc-0.019
 ju-0.019
Token books
Token ID3835
Feature activation-2.91e-02

Pos logit contributions
ase+0.027
belt+0.025
chair+0.025
overs+0.022
lf+0.022
Neg logit contributions
 tast-0.030
 scr-0.027
 ju-0.026
 cuc-0.025
 swe-0.025
Tokenhe
Token ID258
Feature activation-9.68e-02

Pos logit contributions
chair+0.042
ase+0.042
belt+0.038
lf+0.037
 doorway+0.035
Neg logit contributions
 tast-0.058
 scr-0.054
 ang-0.052
 ju-0.051
 better-0.049
Tokenlf
Token ID1652
Feature activation-2.21e-02

Pos logit contributions
belt+0.091
ase+0.091
chair+0.088
overs+0.086
haired+0.078
Neg logit contributions
 grow-0.192
 cuc-0.188
 rad-0.185
 scr-0.184
 y-0.177
Token too
Token ID1165
Feature activation+1.43e-03

Pos logit contributions
chair+0.044
ase+0.043
belt+0.039
lf+0.038
 doorway+0.038
Neg logit contributions
 tast-0.033
 scr-0.030
 ang-0.029
 ju-0.028
 cuc-0.027
Token?"
Token ID1701
Feature activation+4.58e-04

Pos logit contributions
 tast+0.003
 scr+0.002
 ang+0.002
 cuc+0.002
 Pars+0.002
Neg logit contributions
chair-0.002
ase-0.002
belt-0.002
lf-0.002
 corner-0.002
Token His
Token ID2399
Feature activation+2.57e-02

Pos logit contributions
 tast+0.001
 scr+0.001
 ang+0.001
 cuc+0.001
 Pars+0.001
Neg logit contributions
chair-0.001
ase-0.001
belt-0.001
lf-0.001
overs-0.001
Token mom
Token ID1995
Feature activation-1.71e-02

Pos logit contributions
 tast+0.052
 scr+0.049
 ang+0.049
 cuc+0.046
 swe+0.046
Neg logit contributions
chair-0.040
ase-0.039
belt-0.035
lf-0.033
overs-0.031
Token Lily
Token ID20037
Feature activation-5.07e-03

Pos logit contributions
chair+0.002
ase+0.002
belt+0.002
lf+0.002
overs+0.002
Neg logit contributions
 tast-0.002
 scr-0.002
 ang-0.002
 cuc-0.002
 ju-0.002
Token has
Token ID468
Feature activation-1.47e-02

Pos logit contributions
ase+0.010
chair+0.010
belt+0.010
lf+0.009
overs+0.009
Neg logit contributions
 tast-0.007
 scr-0.007
 ang-0.007
 swe-0.006
 cuc-0.006
Token a
Token ID257
Feature activation-2.51e-03

Pos logit contributions
ase+0.038
chair+0.037
belt+0.036
lf+0.036
loo+0.035
Neg logit contributions
 tast-0.015
 scr-0.013
 swe-0.012
 ju-0.012
 funn-0.011
Token big
Token ID1263
Feature activation-8.77e-03

Pos logit contributions
chair+0.005
ase+0.005
belt+0.004
lf+0.004
 corner+0.004
Neg logit contributions
 tast-0.004
 ang-0.004
 scr-0.004
 cuc-0.004
 Pars-0.004
Token books
Token ID3835
Feature activation-3.12e-02

Pos logit contributions
chair+0.015
ase+0.015
belt+0.014
lf+0.013
overs+0.012
Neg logit contributions
 tast-0.016
 scr-0.015
 ang-0.014
 cuc-0.014
 rad-0.014
Tokenhe
Token ID258
Feature activation-9.68e-02

Pos logit contributions
ase+0.043
chair+0.042
belt+0.041
child+0.040
lf+0.040
Neg logit contributions
 tast-0.056
 better-0.051
 scr-0.050
 grow-0.047
Together-0.046
Tokenlf
Token ID1652
Feature activation-1.72e-02

Pos logit contributions
chair+0.089
ase+0.078
belt+0.075
person+0.062
eddy+0.062
Neg logit contributions
 scr-0.218
 cuc-0.209
 rad-0.206
 grow-0.206
 pr-0.194
Token with
Token ID351
Feature activation-9.49e-03

Pos logit contributions
chair+0.034
ase+0.033
belt+0.031
lf+0.030
 doorway+0.029
Neg logit contributions
 tast-0.023
 scr-0.022
 ang-0.019
 cuc-0.019
 ju-0.018
Token many
Token ID867
Feature activation-9.30e-04

Pos logit contributions
chair+0.019
ase+0.019
belt+0.018
lf+0.017
overs+0.016
Neg logit contributions
 tast-0.014
 scr-0.013
 cuc-0.012
 swe-0.012
 ju-0.012
Token books
Token ID3835
Feature activation-2.55e-02

Pos logit contributions
ase+0.002
belt+0.002
chair+0.001
overs+0.001
loo+0.001
Neg logit contributions
 tast-0.002
 scr-0.002
 swe-0.002
 cuc-0.001
 ju-0.001
Token.
Token ID13
Feature activation-5.37e-03

Pos logit contributions
ase+0.058
chair+0.056
belt+0.056
child+0.053
 doorway+0.053
Neg logit contributions
 tast-0.023
 scr-0.021
 grow-0.019
 better-0.018
 rust-0.016
Token she
Token ID673
Feature activation-1.65e-02

Pos logit contributions
ase+0.014
chair+0.014
belt+0.013
lf+0.012
loo+0.011
Neg logit contributions
 tast-0.013
 ang-0.012
 scr-0.012
 ju-0.011
 cuc-0.011
Token noticed
Token ID6810
Feature activation-1.39e-02

Pos logit contributions
chair+0.033
ase+0.032
overs+0.030
belt+0.030
loo+0.028
Neg logit contributions
 tast-0.026
 scr-0.025
 ang-0.023
 cuc-0.022
 ju-0.022
Token that
Token ID326
Feature activation-7.19e-03

Pos logit contributions
chair+0.028
ase+0.027
belt+0.026
lf+0.024
overs+0.023
Neg logit contributions
 tast-0.022
 scr-0.020
 ang-0.020
 cuc-0.019
 ju-0.018
Token the
Token ID262
Feature activation-2.14e-02

Pos logit contributions
ase+0.015
chair+0.014
belt+0.014
lf+0.013
overs+0.012
Neg logit contributions
 tast-0.011
 ang-0.010
 scr-0.010
 ju-0.010
 cuc-0.009
Token books
Token ID3835
Feature activation-3.48e-02

Pos logit contributions
ase+0.037
chair+0.034
belt+0.033
overs+0.033
ana+0.029
Neg logit contributions
 tast-0.040
 scr-0.034
 ju-0.034
 swe-0.032
 ang-0.032
Tokenhe
Token ID258
Feature activation-9.66e-02

Pos logit contributions
ase+0.066
chair+0.064
belt+0.063
child+0.063
ana+0.062
Neg logit contributions
 tast-0.046
 scr-0.045
 rust-0.044
 grow-0.042
 ang-0.040
Tokenlf
Token ID1652
Feature activation-2.88e-02

Pos logit contributions
idd+0.108
eddy+0.107
overs+0.107
belt+0.101
ase+0.101
Neg logit contributions
 grow-0.173
 scr-0.168
 cuc-0.152
 rust-0.151
 spit-0.149
Token was
Token ID373
Feature activation+5.37e-03

Pos logit contributions
chair+0.051
ase+0.051
belt+0.047
overs+0.046
lf+0.044
Neg logit contributions
 tast-0.044
 scr-0.043
 ang-0.039
 rust-0.039
 grow-0.037
Token very
Token ID845
Feature activation-9.86e-04

Pos logit contributions
 tast+0.008
 scr+0.008
 ang+0.007
 cuc+0.007
 ju+0.007
Neg logit contributions
chair-0.011
ase-0.011
belt-0.010
lf-0.010
overs-0.009
Token dangerous
Token ID4923
Feature activation-5.16e-04

Pos logit contributions
chair+0.002
ase+0.002
belt+0.002
lf+0.002
 corner+0.002
Neg logit contributions
 tast-0.002
 ang-0.001
 scr-0.001
 cuc-0.001
 Pars-0.001
Token.
Token ID13
Feature activation-6.33e-03

Pos logit contributions
ase+0.001
chair+0.001
belt+0.001
lf+0.001
overs+0.001
Neg logit contributions
 tast-0.001
 ang-0.001
 scr-0.001
 swe-0.001
 Pars-0.001
Tokenhe
Token ID258
Feature activation-8.84e-02

Pos logit contributions
belt+0.059
chair+0.056
ase+0.056
person+0.054
child+0.052
Neg logit contributions
 grow-0.035
 scr-0.034
 tast-0.033
 better-0.030
 spr-0.029
Tokenlf
Token ID1652
Feature activation-1.60e-02

Pos logit contributions
chair+0.083
belt+0.075
ase+0.070
person+0.063
eddy+0.063
Neg logit contributions
 scr-0.199
 grow-0.194
 cuc-0.189
 rad-0.179
 spr-0.176
Token.
Token ID13
Feature activation-4.79e-04

Pos logit contributions
chair+0.034
ase+0.032
belt+0.031
 doorway+0.029
lf+0.029
Neg logit contributions
 tast-0.019
 scr-0.018
 cuc-0.017
 ang-0.016
 rust-0.015
Token The
Token ID383
Feature activation+1.21e-03

Pos logit contributions
chair+0.001
ase+0.001
belt+0.001
lf+0.001
overs+0.001
Neg logit contributions
 tast-0.001
 scr-0.001
 ang-0.001
 cuc-0.001
 ju-0.001
Token books
Token ID3835
Feature activation-3.18e-02

Pos logit contributions
 tast+0.002
 scr+0.002
 cuc+0.002
 swe+0.002
 ang+0.002
Neg logit contributions
ase-0.002
chair-0.002
belt-0.002
overs-0.002
lf-0.002
Tokenhe
Token ID258
Feature activation-9.65e-02

Pos logit contributions
belt+0.062
ase+0.060
chair+0.058
person+0.057
child+0.056
Neg logit contributions
 tast-0.042
 grow-0.041
 better-0.038
 scr-0.037
 rust-0.036
Tokenlf
Token ID1652
Feature activation-2.51e-02

Pos logit contributions
belt+0.095
chair+0.094
person+0.087
overs+0.084
idd+0.081
Neg logit contributions
 grow-0.198
 scr-0.194
 cuc-0.186
 spit-0.172
 pr-0.172
Token is
Token ID318
Feature activation+3.57e-03

Pos logit contributions
chair+0.046
belt+0.045
ase+0.045
person+0.041
overs+0.041
Neg logit contributions
 tast-0.037
 scr-0.035
 grow-0.032
 swe-0.032
 ang-0.031
Token high
Token ID1029
Feature activation-2.37e-02

Pos logit contributions
 tast+0.005
 scr+0.004
 swe+0.004
 ju+0.004
 ang+0.004
Neg logit contributions
ase-0.008
chair-0.008
belt-0.007
lf-0.007
overs-0.007
Token,
Token ID11
Feature activation-6.05e-03

Pos logit contributions
chair+0.037
ase+0.034
belt+0.033
lf+0.031
 corner+0.027
Neg logit contributions
 tast-0.048
 ang-0.045
 scr-0.045
 cuc-0.043
 Pars-0.042
Token so
Token ID523
Feature activation-8.02e-03

Pos logit contributions
ase+0.013
chair+0.013
belt+0.012
lf+0.012
issa+0.011
Neg logit contributions
 tast-0.009
 scr-0.008
 ju-0.008
 swe-0.008
 ang-0.007
Token his
Token ID465
Feature activation+1.31e-02

Pos logit contributions
 tast+0.013
 scr+0.012
 ang+0.012
 swe+0.011
 cuc+0.011
Neg logit contributions
chair-0.021
ase-0.021
belt-0.019
board-0.017
lf-0.017
Token room
Token ID2119
Feature activation-1.56e-02

Pos logit contributions
 tast+0.029
 scr+0.028
 ang+0.026
 swe+0.026
 cuc+0.025
Neg logit contributions
chair-0.017
ase-0.016
belt-0.014
lf-0.014
overs-0.012
Token.
Token ID13
Feature activation+7.43e-04

Pos logit contributions
chair+0.035
ase+0.033
belt+0.032
lf+0.031
issa+0.029
Neg logit contributions
 tast-0.019
 scr-0.018
 cuc-0.016
 ang-0.016
 ju-0.015
Token The
Token ID383
Feature activation+2.80e-03

Pos logit contributions
 tast+0.001
 scr+0.001
 ang+0.001
 cuc+0.001
 ju+0.001
Neg logit contributions
chair-0.002
ase-0.002
belt-0.002
lf-0.002
overs-0.002
Token books
Token ID3835
Feature activation-3.16e-02

Pos logit contributions
 tast+0.006
 scr+0.005
 ang+0.005
 ju+0.005
 swe+0.005
Neg logit contributions
ase-0.004
chair-0.004
belt-0.004
overs-0.004
lf-0.004
Tokenhe
Token ID258
Feature activation-9.63e-02

Pos logit contributions
overs+0.040
ana+0.039
child+0.038
belt+0.037
ase+0.036
Neg logit contributions
 tast-0.061
 better-0.055
 rust-0.054
 grow-0.054
 rum-0.053
Tokenlf
Token ID1652
Feature activation-2.71e-02

Pos logit contributions
overs+0.139
itty+0.121
belt+0.118
chair+0.118
found+0.117
Neg logit contributions
 grow-0.137
 cuc-0.129
 scr-0.120
 sp-0.118
 y-0.118
Token was
Token ID373
Feature activation-6.26e-03

Pos logit contributions
overs+0.048
ase+0.047
chair+0.046
 doorway+0.046
belt+0.045
Neg logit contributions
 tast-0.040
 scr-0.036
 ang-0.033
 rust-0.032
 ju-0.031
Token very
Token ID845
Feature activation-7.26e-03

Pos logit contributions
chair+0.015
ase+0.015
belt+0.014
lf+0.013
overs+0.013
Neg logit contributions
 tast-0.008
 scr-0.006
 ang-0.006
 ju-0.006
 swe-0.006
Token popular
Token ID2968
Feature activation-6.26e-03

Pos logit contributions
chair+0.016
ase+0.016
belt+0.015
lf+0.014
overs+0.013
Neg logit contributions
 tast-0.010
 scr-0.009
 ang-0.009
 ju-0.008
 cuc-0.008
Token because
Token ID780
Feature activation-2.52e-02

Pos logit contributions
ase+0.013
chair+0.013
belt+0.012
 doorway+0.012
lf+0.012
Neg logit contributions
 tast-0.008
 scr-0.007
 cuc-0.007
 ju-0.006
 Pars-0.006
Token smell
Token ID8508
Feature activation+3.73e-03

Pos logit contributions
 tast+0.010
 scr+0.009
 ang+0.009
 cuc+0.009
 ju+0.008
Neg logit contributions
chair-0.017
ase-0.017
belt-0.016
lf-0.015
overs-0.014
Token and
Token ID290
Feature activation-1.36e-03

Pos logit contributions
 tast+0.006
 scr+0.006
 cuc+0.005
 rad+0.005
 ang+0.005
Neg logit contributions
belt-0.007
ase-0.007
chair-0.007
lf-0.006
issa-0.006
Token sour
Token ID11348
Feature activation+2.17e-02

Pos logit contributions
chair+0.003
ase+0.002
belt+0.002
lf+0.002
overs+0.002
Neg logit contributions
 tast-0.002
 scr-0.002
 ang-0.002
 cuc-0.002
 ju-0.002
Token fruits
Token ID15921
Feature activation+1.57e-02

Pos logit contributions
 tast+0.031
 ang+0.028
 Pars+0.027
 scr+0.026
 cuc+0.026
Neg logit contributions
chair-0.046
ase-0.043
belt-0.040
lf-0.040
overs-0.038
Token and
Token ID290
Feature activation+1.54e-02

Pos logit contributions
 tast+0.028
 scr+0.025
 ang+0.025
 cuc+0.024
 ju+0.024
Neg logit contributions
chair-0.028
ase-0.027
belt-0.025
lf-0.024
overs-0.022
Token vegetables
Token ID13701
Feature activation+3.95e-02

Pos logit contributions
 tast+0.022
 ang+0.020
 scr+0.020
 cuc+0.019
 Pars+0.018
Neg logit contributions
chair-0.033
ase-0.031
belt-0.030
lf-0.029
overs-0.027
Token.
Token ID13
Feature activation+8.11e-03

Pos logit contributions
 tast+0.067
 ang+0.062
 scr+0.061
 cuc+0.059
 Pars+0.059
Neg logit contributions
chair-0.072
ase-0.068
belt-0.065
lf-0.063
overs-0.057
Token The
Token ID383
Feature activation+5.68e-03

Pos logit contributions
 tast+0.016
 scr+0.015
 ang+0.015
 cuc+0.014
 ju+0.014
Neg logit contributions
chair-0.013
ase-0.012
belt-0.012
lf-0.011
overs-0.010
Token cater
Token ID20825
Feature activation+3.41e-03

Pos logit contributions
 tast+0.009
 scr+0.008
 ang+0.008
 cuc+0.007
 Pars+0.007
Neg logit contributions
chair-0.012
ase-0.011
belt-0.011
lf-0.010
 corner-0.010
Tokenpillar
Token ID41643
Feature activation-7.60e-04

Pos logit contributions
 tast+0.006
 cuc+0.005
 scr+0.005
 ju+0.005
 Pars+0.005
Neg logit contributions
belt-0.006
chair-0.006
lf-0.005
ase-0.005
 doorway-0.005
Token did
Token ID750
Feature activation+1.35e-02

Pos logit contributions
chair+0.001
ase+0.001
belt+0.001
lf+0.001
overs+0.001
Neg logit contributions
 tast-0.001
 scr-0.001
 ang-0.001
 cuc-0.001
 ju-0.001
Token
Token ID198
Feature activation+5.39e-03

Pos logit contributions
 tast+0.002
 scr+0.001
 ang+0.001
So+0.001
 Pars+0.001
Neg logit contributions
chair-0.003
belt-0.003
ase-0.003
lf-0.002
overs-0.002
TokenThe
Token ID464
Feature activation+9.30e-03

Pos logit contributions
 tast+0.009
 scr+0.008
 ang+0.008
 cuc+0.008
 ju+0.007
Neg logit contributions
chair-0.011
ase-0.010
belt-0.010
lf-0.009
overs-0.008
Token mum
Token ID25682
Feature activation-1.11e-02

Pos logit contributions
 tast+0.018
 scr+0.016
 ang+0.016
 cuc+0.016
 Pars+0.016
Neg logit contributions
chair-0.015
ase-0.014
belt-0.014
lf-0.013
overs-0.012
Token bought
Token ID5839
Feature activation-1.60e-02

Pos logit contributions
chair+0.023
belt+0.023
ase+0.022
overs+0.020
lf+0.020
Neg logit contributions
 tast-0.016
 scr-0.014
 cuc-0.014
 ang-0.014
 rad-0.013
Token the
Token ID262
Feature activation-1.18e-02

Pos logit contributions
chair+0.028
ase+0.026
belt+0.024
lf+0.024
overs+0.023
Neg logit contributions
 tast-0.028
 scr-0.026
 ang-0.026
 cuc-0.025
 Pars-0.024
Token pie
Token ID2508
Feature activation+3.66e-02

Pos logit contributions
chair+0.026
ase+0.025
belt+0.023
lf+0.023
 corner+0.022
Neg logit contributions
 tast-0.016
 scr-0.014
 ang-0.014
 Pars-0.014
 cuc-0.013
Token and
Token ID290
Feature activation-4.87e-03

Pos logit contributions
 tast+0.068
 scr+0.063
 ang+0.062
 cuc+0.061
 ju+0.059
Neg logit contributions
chair-0.063
ase-0.059
belt-0.056
lf-0.053
overs-0.048
Token they
Token ID484
Feature activation-6.98e-03

Pos logit contributions
chair+0.010
ase+0.010
belt+0.009
lf+0.009
overs+0.008
Neg logit contributions
 tast-0.008
 scr-0.007
 ang-0.007
 cuc-0.007
 ju-0.006
Token went
Token ID1816
Feature activation+1.24e-02

Pos logit contributions
chair+0.013
ase+0.012
belt+0.012
lf+0.011
overs+0.010
Neg logit contributions
 tast-0.012
 ang-0.011
 scr-0.011
 cuc-0.010
 Pars-0.010
Token home
Token ID1363
Feature activation-3.97e-02

Pos logit contributions
 tast+0.018
 ang+0.017
 swe+0.016
 scr+0.016
 rad+0.015
Neg logit contributions
chair-0.025
ase-0.024
lf-0.023
belt-0.023
issa-0.020
Token.
Token ID13
Feature activation+3.15e-03

Pos logit contributions
chair+0.093
ase+0.091
belt+0.085
lf+0.082
issa+0.076
Neg logit contributions
 tast-0.048
 scr-0.044
 ang-0.044
 cuc-0.038
 swe-0.037
Token her
Token ID607
Feature activation+7.51e-04

Pos logit contributions
 tast+0.008
 scr+0.007
 rad+0.007
 ang+0.007
 cuc+0.007
Neg logit contributions
ase-0.012
chair-0.012
belt-0.011
lf-0.010
loo-0.010
Token new
Token ID649
Feature activation+8.60e-03

Pos logit contributions
 tast+0.001
 scr+0.001
 swe+0.001
 ang+0.001
 cuc+0.001
Neg logit contributions
chair-0.001
ase-0.001
belt-0.001
lf-0.001
overs-0.001
Token bottle
Token ID9294
Feature activation+2.74e-02

Pos logit contributions
 tast+0.013
 scr+0.012
 ang+0.011
 swe+0.011
 cuc+0.011
Neg logit contributions
chair-0.018
ase-0.017
belt-0.017
lf-0.016
issa-0.014
Token and
Token ID290
Feature activation+4.86e-03

Pos logit contributions
 tast+0.048
 scr+0.044
 ang+0.042
 cuc+0.041
 ju+0.041
Neg logit contributions
chair-0.049
ase-0.046
belt-0.046
lf-0.043
issa-0.039
Token it
Token ID340
Feature activation+5.38e-03

Pos logit contributions
 tast+0.008
 scr+0.007
 ang+0.007
 cuc+0.007
 swe+0.006
Neg logit contributions
chair-0.010
ase-0.010
belt-0.009
lf-0.009
overs-0.008
Token tasted
Token ID29187
Feature activation+5.71e-02

Pos logit contributions
 tast+0.011
 ang+0.010
 cuc+0.010
 Pars+0.010
 scr+0.010
Neg logit contributions
chair-0.008
ase-0.007
lf-0.007
belt-0.006
 corner-0.006
Token normal
Token ID3487
Feature activation+1.79e-02

Pos logit contributions
 argue+0.086
 ang+0.085
 Pars+0.083
 cuc+0.077
itionally+0.076
Neg logit contributions
chair-0.097
ase-0.085
 of-0.083
 corner-0.083
overs-0.081
Token,
Token ID11
Feature activation+2.42e-03

Pos logit contributions
 tast+0.030
 ang+0.029
 scr+0.028
 Pars+0.027
 cuc+0.027
Neg logit contributions
chair-0.033
ase-0.031
belt-0.029
lf-0.027
overs-0.026
Token just
Token ID655
Feature activation-2.26e-02

Pos logit contributions
 tast+0.003
 scr+0.003
 ju+0.003
 ang+0.003
 cuc+0.003
Neg logit contributions
ase-0.006
chair-0.005
belt-0.005
lf-0.005
loo-0.005
Token like
Token ID588
Feature activation-2.38e-03

Pos logit contributions
chair+0.046
ase+0.043
belt+0.042
lf+0.040
 corner+0.038
Neg logit contributions
 tast-0.032
 ang-0.030
 scr-0.029
 Pars-0.028
 cuc-0.028
Token her
Token ID607
Feature activation-1.22e-02

Pos logit contributions
chair+0.004
ase+0.004
belt+0.004
lf+0.004
overs+0.003
Neg logit contributions
 tast-0.004
 scr-0.004
 ang-0.004
 cuc-0.004
 ju-0.004
Token,
Token ID11
Feature activation-3.40e-03

Pos logit contributions
 tast+0.014
 scr+0.013
 ang+0.013
 Pars+0.013
 ju+0.013
Neg logit contributions
chair-0.012
ase-0.011
lf-0.011
belt-0.010
overs-0.009
Token it
Token ID340
Feature activation-3.88e-03

Pos logit contributions
chair+0.006
ase+0.006
belt+0.005
lf+0.005
overs+0.005
Neg logit contributions
 tast-0.006
 scr-0.006
 ang-0.006
 cuc-0.006
 ju-0.005
Token does
Token ID857
Feature activation+5.23e-03

Pos logit contributions
chair+0.004
ase+0.004
board+0.003
on+0.003
lf+0.003
Neg logit contributions
 tast-0.009
 Pars-0.009
 Gig-0.008
 cuc-0.008
abb-0.008
Token.
Token ID13
Feature activation-5.84e-03

Pos logit contributions
 tast+0.011
 ang+0.010
 Pars+0.010
 ju+0.010
 cuc+0.010
Neg logit contributions
chair-0.006
lf-0.006
ase-0.006
belt-0.005
overs-0.005
Token It
Token ID632
Feature activation+1.32e-02

Pos logit contributions
chair+0.009
ase+0.008
belt+0.008
lf+0.008
 doorway+0.007
Neg logit contributions
 tast-0.012
 scr-0.011
 ang-0.011
 cuc-0.011
 Pars-0.011
Token tastes
Token ID18221
Feature activation+4.00e-02

Pos logit contributions
 tast+0.029
 Pars+0.028
 ang+0.028
 cuc+0.028
abb+0.027
Neg logit contributions
chair-0.015
ase-0.014
board-0.010
lf-0.010
issa-0.010
Token like
Token ID588
Feature activation+2.68e-03

Pos logit contributions
 Pars+0.060
Pear+0.057
 ang+0.056
 argue+0.056
arse+0.055
Neg logit contributions
chair-0.059
 of-0.055
overs-0.050
lf-0.049
 corner-0.049
Token dirt
Token ID13647
Feature activation+2.23e-03

Pos logit contributions
 tast+0.004
 ang+0.004
 scr+0.004
 Pars+0.004
 cuc+0.004
Neg logit contributions
chair-0.005
ase-0.005
belt-0.004
lf-0.004
overs-0.004
Token and
Token ID290
Feature activation+4.94e-03

Pos logit contributions
 tast+0.004
 ang+0.004
 scr+0.004
 Pars+0.003
 cuc+0.003
Neg logit contributions
chair-0.004
ase-0.004
belt-0.004
lf-0.003
overs-0.003
Token grass
Token ID8701
Feature activation+7.60e-03

Pos logit contributions
 tast+0.008
 ang+0.007
 scr+0.007
 Pars+0.007
 cuc+0.006
Neg logit contributions
chair-0.010
ase-0.009
belt-0.009
lf-0.009
 corner-0.008
Token,"
Token ID553
Feature activation+9.31e-03

Pos logit contributions
 tast+0.014
 ang+0.013
 scr+0.013
 Pars+0.013
 cuc+0.012
Neg logit contributions
chair-0.013
ase-0.012
belt-0.011
lf-0.010
overs-0.010
Token curious
Token ID11040
Feature activation+1.06e-02

Pos logit contributions
ase+0.040
chair+0.039
belt+0.036
lf+0.035
board+0.033
Neg logit contributions
 tast-0.022
 ang-0.020
 scr-0.020
 cal-0.017
 swe-0.017
Token,
Token ID11
Feature activation-2.27e-03

Pos logit contributions
 tast+0.017
 scr+0.016
 ang+0.016
 swe+0.015
 cuc+0.015
Neg logit contributions
ase-0.020
chair-0.019
belt-0.019
lf-0.018
overs-0.016
Token and
Token ID290
Feature activation-8.40e-03

Pos logit contributions
ase+0.006
chair+0.005
belt+0.005
lf+0.005
loo+0.005
Neg logit contributions
 tast-0.003
 scr-0.002
 ang-0.002
 ju-0.002
 cuc-0.002
Token he
Token ID339
Feature activation-4.50e-03

Pos logit contributions
ase+0.019
chair+0.016
overs+0.015
loo+0.015
belt+0.015
Neg logit contributions
 tast-0.013
 ang-0.012
 scr-0.012
 ju-0.011
 swe-0.011
Token asked
Token ID1965
Feature activation-7.66e-03

Pos logit contributions
ase+0.010
chair+0.010
belt+0.009
overs+0.009
lf+0.008
Neg logit contributions
 tast-0.006
 scr-0.006
 ang-0.006
 cal-0.005
 swe-0.005
Token why
Token ID1521
Feature activation+4.49e-02

Pos logit contributions
chair+0.017
ase+0.017
belt+0.015
lf+0.014
 doorway+0.014
Neg logit contributions
 tast-0.010
 ang-0.010
 scr-0.010
 cuc-0.009
 swe-0.009
Token the
Token ID262
Feature activation-7.79e-03

Pos logit contributions
 scr+0.058
 tast+0.056
 ang+0.055
 rust+0.051
 cal+0.051
Neg logit contributions
ase-0.097
chair-0.090
 doorway-0.082
belt-0.081
person-0.076
Token gate
Token ID8946
Feature activation-2.39e-02

Pos logit contributions
ase+0.017
chair+0.014
overs+0.014
belt+0.014
eddy+0.013
Neg logit contributions
 tast-0.011
 prett-0.010
 scr-0.010
 swe-0.010
 bra-0.010
Token was
Token ID373
Feature activation+3.28e-02

Pos logit contributions
ase+0.044
chair+0.043
belt+0.039
lf+0.038
overs+0.038
Neg logit contributions
 tast-0.040
 scr-0.037
 ang-0.037
 swe-0.034
 rust-0.034
Token locked
Token ID8970
Feature activation-2.73e-03

Pos logit contributions
 tast+0.053
 scr+0.048
 ang+0.048
 cuc+0.046
 ju+0.045
Neg logit contributions
chair-0.064
ase-0.062
belt-0.058
lf-0.056
overs-0.052
Token.
Token ID13
Feature activation+4.91e-03

Pos logit contributions
chair+0.006
ase+0.006
belt+0.005
lf+0.005
overs+0.005
Neg logit contributions
 tast-0.004
 ang-0.003
 scr-0.003
 swe-0.003
 ju-0.003
Token.
Token ID13
Feature activation+4.44e-03

Pos logit contributions
 tast+0.041
 ang+0.039
 scr+0.039
 cuc+0.039
 ju+0.037
Neg logit contributions
chair-0.032
ase-0.028
belt-0.027
 corner-0.024
overs-0.024
Token She
Token ID1375
Feature activation+7.22e-03

Pos logit contributions
 tast+0.007
 scr+0.007
 ang+0.007
 cuc+0.006
 ju+0.006
Neg logit contributions
chair-0.008
ase-0.008
belt-0.008
lf-0.008
overs-0.007
Token found
Token ID1043
Feature activation-2.33e-02

Pos logit contributions
 tast+0.011
 scr+0.010
 ang+0.010
 cuc+0.010
 ju+0.009
Neg logit contributions
chair-0.014
ase-0.014
belt-0.013
lf-0.013
overs-0.012
Token a
Token ID257
Feature activation-5.47e-03

Pos logit contributions
chair+0.050
ase+0.049
belt+0.048
lf+0.046
overs+0.041
Neg logit contributions
 tast-0.033
 ang-0.030
 scr-0.029
 cuc-0.028
 ju-0.028
Token little
Token ID1310
Feature activation+1.66e-02

Pos logit contributions
chair+0.010
ase+0.009
belt+0.009
lf+0.009
 corner+0.009
Neg logit contributions
 tast-0.009
 ang-0.008
 scr-0.008
 Pars-0.008
 cuc-0.008
Token box
Token ID3091
Feature activation-1.08e-02

Pos logit contributions
 tast+0.028
 scr+0.026
 ang+0.026
 cuc+0.025
 Pars+0.025
Neg logit contributions
chair-0.031
ase-0.028
belt-0.027
lf-0.026
 corner-0.025
Token to
Token ID284
Feature activation-9.19e-04

Pos logit contributions
chair+0.019
ase+0.018
belt+0.017
lf+0.016
issa+0.016
Neg logit contributions
 tast-0.018
 rust-0.016
 scr-0.016
 cuc-0.015
 ju-0.015
Token put
Token ID1234
Feature activation-1.26e-02

Pos logit contributions
chair+0.002
ase+0.002
belt+0.002
lf+0.002
issa+0.002
Neg logit contributions
 tast-0.001
 scr-0.001
 ang-0.001
 cuc-0.001
 grow-0.001
Token the
Token ID262
Feature activation-1.41e-02

Pos logit contributions
chair+0.026
ase+0.024
belt+0.023
lf+0.023
loo+0.020
Neg logit contributions
 tast-0.020
 ang-0.017
 ju-0.017
 swe-0.017
 rad-0.017
Token mouse
Token ID10211
Feature activation+3.23e-02

Pos logit contributions
chair+0.027
ase+0.026
belt+0.025
lf+0.024
overs+0.022
Neg logit contributions
 tast-0.024
 ju-0.021
 ang-0.021
 cuc-0.021
 scr-0.020
Token in
Token ID287
Feature activation+2.31e-02

Pos logit contributions
 tast+0.060
 scr+0.055
 ang+0.055
 cuc+0.053
 ju+0.052
Neg logit contributions
chair-0.055
ase-0.053
belt-0.050
lf-0.048
overs-0.042
Token We
Token ID775
Feature activation-6.32e-03

Pos logit contributions
lf+0.027
ase+0.026
chair+0.026
belt+0.026
 doorway+0.023
Neg logit contributions
 tast-0.023
 Pars-0.022
 scr-0.022
 ju-0.021
 Instead-0.021
Token have
Token ID423
Feature activation-1.13e-02

Pos logit contributions
chair+0.014
ase+0.014
belt+0.013
lf+0.013
overs+0.012
Neg logit contributions
 tast-0.008
 scr-0.007
 ang-0.007
 cuc-0.007
 ju-0.007
Token to
Token ID284
Feature activation+1.23e-02

Pos logit contributions
chair+0.024
ase+0.023
lf+0.022
belt+0.022
overs+0.019
Neg logit contributions
 tast-0.017
 scr-0.015
 ang-0.015
 swe-0.014
 ju-0.014
Token get
Token ID651
Feature activation+4.58e-03

Pos logit contributions
 tast+0.018
 ang+0.016
 scr+0.016
 cuc+0.016
 ju+0.015
Neg logit contributions
chair-0.026
ase-0.024
belt-0.024
lf-0.023
 corner-0.021
Token out
Token ID503
Feature activation-1.45e-02

Pos logit contributions
 tast+0.007
 ang+0.007
 scr+0.006
 swe+0.006
 better+0.006
Neg logit contributions
ase-0.010
chair-0.010
lf-0.009
belt-0.008
board-0.008
Token quickly
Token ID2952
Feature activation-8.19e-03

Pos logit contributions
ase+0.029
chair+0.027
belt+0.025
lf+0.025
person+0.022
Neg logit contributions
 tast-0.022
 better-0.021
 scr-0.020
 cal-0.020
 ang-0.020
Token!"
Token ID2474
Feature activation+9.59e-04

Pos logit contributions
chair+0.016
ase+0.016
belt+0.015
lf+0.015
overs+0.013
Neg logit contributions
 tast-0.013
 scr-0.011
 ang-0.011
 cuc-0.011
 swe-0.010
Token
Token ID220
Feature activation+9.70e-03

Pos logit contributions
 tast+0.002
 scr+0.002
 ang+0.001
 cuc+0.001
 Pars+0.001
Neg logit contributions
ase-0.002
chair-0.002
belt-0.002
lf-0.002
overs-0.001
Token
Token ID198
Feature activation+7.85e-03

Pos logit contributions
 tast+0.016
 ang+0.014
 scr+0.014
 cuc+0.013
 Pars+0.013
Neg logit contributions
chair-0.019
ase-0.018
belt-0.017
lf-0.016
overs-0.015
Token
Token ID198
Feature activation+1.17e-02

Pos logit contributions
 tast+0.010
So+0.010
 scr+0.010
 ang+0.010
Together+0.010
Neg logit contributions
chair-0.016
ase-0.016
belt-0.015
overs-0.014
lf-0.013
TokenJohn
Token ID7554
Feature activation-1.89e-02

Pos logit contributions
 tast+0.026
 scr+0.025
So+0.025
 ang+0.025
Together+0.024
Neg logit contributions
ase-0.014
chair-0.014
belt-0.013
overs-0.011
lf-0.010
Token there
Token ID612
Feature activation+1.46e-02

Pos logit contributions
 tast+0.022
 ang+0.020
 scr+0.020
 cuc+0.018
 ju+0.018
Neg logit contributions
chair-0.035
ase-0.034
belt-0.032
lf-0.031
overs-0.029
Token and
Token ID290
Feature activation-9.98e-03

Pos logit contributions
 tast+0.025
 scr+0.023
 ang+0.023
 cuc+0.022
 ju+0.021
Neg logit contributions
chair-0.027
ase-0.026
belt-0.025
lf-0.024
overs-0.022
Token knew
Token ID2993
Feature activation+2.98e-03

Pos logit contributions
chair+0.021
ase+0.020
belt+0.019
lf+0.018
overs+0.016
Neg logit contributions
 tast-0.015
 ang-0.014
 scr-0.014
 cuc-0.013
 rad-0.013
Token just
Token ID655
Feature activation-2.24e-02

Pos logit contributions
 tast+0.005
 ang+0.005
 scr+0.005
 cuc+0.005
 Pars+0.005
Neg logit contributions
chair-0.005
ase-0.005
belt-0.005
lf-0.005
overs-0.004
Token what
Token ID644
Feature activation+2.28e-02

Pos logit contributions
chair+0.047
ase+0.044
belt+0.043
lf+0.042
 corner+0.039
Neg logit contributions
 tast-0.032
 scr-0.028
 cuc-0.027
 ang-0.027
 ju-0.027
Token to
Token ID284
Feature activation-6.41e-03

Pos logit contributions
 tast+0.036
 scr+0.034
 ang+0.033
 cuc+0.033
 rad+0.033
Neg logit contributions
chair-0.043
ase-0.043
belt-0.041
lf-0.037
overs-0.037
Token do
Token ID466
Feature activation+1.00e-02

Pos logit contributions
chair+0.015
ase+0.014
belt+0.014
lf+0.013
overs+0.013
Neg logit contributions
 tast-0.008
 scr-0.007
 ang-0.007
 cuc-0.006
 Pars-0.006
Token.
Token ID13
Feature activation+1.56e-03

Pos logit contributions
 tast+0.008
 ang+0.007
 better+0.007
 scr+0.007
 swe+0.006
Neg logit contributions
chair-0.026
ase-0.026
belt-0.024
 doorway-0.024
lf-0.023
Token His
Token ID2399
Feature activation+1.82e-02

Pos logit contributions
 tast+0.003
 scr+0.003
 ang+0.003
 cuc+0.002
 swe+0.002
Neg logit contributions
chair-0.003
ase-0.003
belt-0.002
lf-0.002
overs-0.002
Token big
Token ID1263
Feature activation-1.80e-03

Pos logit contributions
 tast+0.030
 scr+0.028
 ang+0.028
 cuc+0.026
 rad+0.026
Neg logit contributions
chair-0.035
ase-0.034
belt-0.032
lf-0.030
overs-0.028
Token brother
Token ID3956
Feature activation-6.64e-04

Pos logit contributions
chair+0.003
ase+0.003
belt+0.003
lf+0.003
overs+0.003
Neg logit contributions
 tast-0.003
 ang-0.003
 scr-0.003
 cuc-0.003
 Pars-0.003
Token the
Token ID262
Feature activation-3.53e-04

Pos logit contributions
chair+0.033
ase+0.031
belt+0.030
lf+0.029
overs+0.026
Neg logit contributions
 tast-0.028
 scr-0.025
 ang-0.025
 cuc-0.024
 ju-0.023
Token hill
Token ID12788
Feature activation+8.26e-03

Pos logit contributions
ase+0.001
chair+0.001
belt+0.001
overs+0.001
lf+0.001
Neg logit contributions
 tast-0.001
 scr-0.001
 ang-0.001
 rad-0.001
 ju-0.001
Token.
Token ID13
Feature activation-4.86e-03

Pos logit contributions
 tast+0.013
 scr+0.012
 ang+0.012
 cuc+0.012
 Pars+0.011
Neg logit contributions
chair-0.016
ase-0.015
belt-0.015
lf-0.014
 corner-0.013
Token The
Token ID383
Feature activation+4.00e-03

Pos logit contributions
chair+0.008
ase+0.007
belt+0.007
lf+0.006
overs+0.006
Neg logit contributions
 tast-0.010
 scr-0.009
 ang-0.009
 cuc-0.009
 ju-0.009
Token car
Token ID1097
Feature activation-2.64e-02

Pos logit contributions
 tast+0.006
 ang+0.006
 better+0.005
 scr+0.005
 rust+0.005
Neg logit contributions
ase-0.008
chair-0.007
overs-0.007
book-0.007
belt-0.007
Token rolled
Token ID11686
Feature activation+1.47e-02

Pos logit contributions
ase+0.053
overs+0.053
chair+0.050
belt+0.049
book+0.047
Neg logit contributions
 tast-0.036
 scr-0.034
 ang-0.034
 rust-0.031
 grow-0.030
Token and
Token ID290
Feature activation-6.87e-03

Pos logit contributions
 tast+0.025
 ang+0.023
 scr+0.023
 cuc+0.022
 ju+0.022
Neg logit contributions
chair-0.027
ase-0.026
belt-0.025
lf-0.024
overs-0.022
Token rolled
Token ID11686
Feature activation+2.41e-02

Pos logit contributions
chair+0.013
ase+0.013
belt+0.012
lf+0.012
overs+0.011
Neg logit contributions
 tast-0.011
 ang-0.010
 scr-0.010
 cuc-0.010
 ju-0.009
Token until
Token ID1566
Feature activation-7.96e-04

Pos logit contributions
 tast+0.044
 scr+0.041
 ang+0.039
 cuc+0.039
 ju+0.038
Neg logit contributions
chair-0.042
ase-0.038
belt-0.038
lf-0.036
 corner-0.034
Token it
Token ID340
Feature activation+2.42e-03

Pos logit contributions
chair+0.001
ase+0.001
belt+0.001
lf+0.001
overs+0.001
Neg logit contributions
 tast-0.002
 ang-0.001
 scr-0.001
 cuc-0.001
 Pars-0.001
Token finally
Token ID3443
Feature activation+1.27e-02

Pos logit contributions
 tast+0.004
 scr+0.004
 ang+0.004
 ju+0.003
 swe+0.003
Neg logit contributions
chair-0.005
ase-0.004
belt-0.004
lf-0.004
overs-0.004
Token his
Token ID465
Feature activation+1.46e-02

Pos logit contributions
chair+0.004
ase+0.003
belt+0.003
lf+0.003
overs+0.003
Neg logit contributions
 tast-0.004
 scr-0.003
 ang-0.003
 cuc-0.003
 ju-0.003
Token mum
Token ID25682
Feature activation+9.77e-03

Pos logit contributions
 tast+0.027
 ang+0.025
 scr+0.025
 Pars+0.024
 ju+0.023
Neg logit contributions
chair-0.024
ase-0.022
belt-0.022
lf-0.021
 corner-0.020
Token every
Token ID790
Feature activation+2.46e-03

Pos logit contributions
 tast+0.017
 scr+0.016
 ang+0.016
 cuc+0.015
 ju+0.015
Neg logit contributions
chair-0.017
ase-0.017
belt-0.016
lf-0.015
 doorway-0.013
Token single
Token ID2060
Feature activation-2.16e-02

Pos logit contributions
 tast+0.005
 ang+0.004
 scr+0.004
 cal+0.004
 rad+0.004
Neg logit contributions
ase-0.004
chair-0.004
belt-0.004
lf-0.003
overs-0.003
Token day
Token ID1110
Feature activation+4.14e-03

Pos logit contributions
chair+0.034
ase+0.034
belt+0.033
lf+0.031
overs+0.028
Neg logit contributions
 tast-0.040
 scr-0.037
 ang-0.036
 cuc-0.036
 rad-0.035
Token.
Token ID13
Feature activation+9.50e-03

Pos logit contributions
 tast+0.006
 scr+0.006
 ang+0.005
 cuc+0.005
 ju+0.005
Neg logit contributions
chair-0.008
ase-0.008
belt-0.008
lf-0.007
loo-0.007
Token<|endoftext|>
Token ID50256
Feature activation-4.66e-04

Pos logit contributions
 tast+0.016
 scr+0.015
 ang+0.015
 cuc+0.015
 ju+0.014
Neg logit contributions
chair-0.017
ase-0.016
belt-0.016
lf-0.015
overs-0.014
TokenA
Token ID32
Feature activation+1.53e-02

Pos logit contributions
belt+0.001
chair+0.001
lf+0.001
ase+0.001
overs+0.001
Neg logit contributions
 tast-0.001
 cuc-0.000
 Pars-0.000
 ang-0.000
 ju-0.000
Token little
Token ID1310
Feature activation+2.29e-02

Pos logit contributions
 tast+0.032
 ang+0.030
 scr+0.030
 cuc+0.029
 ju+0.028
Neg logit contributions
chair-0.023
ase-0.021
belt-0.020
lf-0.019
overs-0.017
Token girl
Token ID2576
Feature activation-9.81e-03

Pos logit contributions
 tast+0.037
 rad+0.034
 ang+0.034
 cuc+0.033
 ju+0.032
Neg logit contributions
chair-0.042
ase-0.042
belt-0.041
lf-0.039
 doorway-0.034
Token was
Token ID373
Feature activation-2.38e-03

Pos logit contributions
belt+0.019
chair+0.018
ase+0.018
lf+0.017
overs+0.016
Neg logit contributions
 tast-0.014
 scr-0.014
 rad-0.013
 cuc-0.013
 tul-0.013
Token and
Token ID290
Feature activation-2.90e-03

Pos logit contributions
ase+0.007
belt+0.007
chair+0.006
lf+0.006
issa+0.005
Neg logit contributions
 tast-0.011
 scr-0.011
 ang-0.010
 swe-0.010
 rust-0.010
Token took
Token ID1718
Feature activation+1.22e-02

Pos logit contributions
ase+0.006
chair+0.006
belt+0.006
overs+0.005
lf+0.005
Neg logit contributions
 tast-0.004
 scr-0.004
 ang-0.004
 cuc-0.004
 ju-0.004
Token her
Token ID607
Feature activation-1.02e-03

Pos logit contributions
 tast+0.018
 scr+0.016
 ang+0.016
 swe+0.015
 cuc+0.015
Neg logit contributions
chair-0.025
ase-0.025
belt-0.024
lf-0.023
loo-0.021
Token picture
Token ID4286
Feature activation+6.36e-03

Pos logit contributions
ase+0.002
chair+0.002
lf+0.002
belt+0.002
loo+0.002
Neg logit contributions
 tast-0.002
 scr-0.001
 swe-0.001
 ang-0.001
 ju-0.001
Token.
Token ID13
Feature activation+1.03e-03

Pos logit contributions
 tast+0.010
 scr+0.009
 swe+0.008
 ang+0.008
 ju+0.008
Neg logit contributions
chair-0.012
ase-0.012
belt-0.012
lf-0.011
issa-0.010
Token Kate
Token ID16693
Feature activation-1.75e-02

Pos logit contributions
 scr+0.001
 tast+0.001
 cuc+0.001
 Pars+0.001
 So+0.001
Neg logit contributions
belt-0.002
ase-0.002
chair-0.002
lf-0.002
loo-0.002
Token posed
Token ID15459
Feature activation-2.02e-02

Pos logit contributions
chair+0.032
ase+0.031
belt+0.030
lf+0.027
overs+0.027
Neg logit contributions
 tast-0.029
 scr-0.028
 ang-0.026
 cuc-0.026
 ju-0.024
Token like
Token ID588
Feature activation-1.00e-02

Pos logit contributions
chair+0.042
ase+0.040
lf+0.035
belt+0.035
loo+0.032
Neg logit contributions
 tast-0.032
 scr-0.031
 ang-0.028
 cuc-0.028
 ju-0.027
Token she
Token ID673
Feature activation-2.07e-02

Pos logit contributions
chair+0.020
ase+0.019
belt+0.019
lf+0.017
 corner+0.016
Neg logit contributions
 tast-0.016
 scr-0.014
 ang-0.014
 cuc-0.014
 ju-0.013
Token was
Token ID373
Feature activation-9.86e-04

Pos logit contributions
ase+0.037
chair+0.036
belt+0.034
lf+0.031
loo+0.031
Neg logit contributions
 tast-0.036
 scr-0.034
 ang-0.031
 funn-0.030
 swe-0.029
Token a
Token ID257
Feature activation-8.30e-03

Pos logit contributions
chair+0.002
ase+0.002
belt+0.002
lf+0.002
loo+0.002
Neg logit contributions
 tast-0.002
 scr-0.002
 ang-0.001
 cuc-0.001
 ju-0.001
Tokenolly
Token ID5098
Feature activation+4.99e-03

Pos logit contributions
 tast+0.060
 ang+0.055
 scr+0.055
 cuc+0.054
 ju+0.052
Neg logit contributions
chair-0.057
ase-0.054
belt-0.052
lf-0.049
overs-0.044
Token and
Token ID290
Feature activation-9.50e-03

Pos logit contributions
 tast+0.007
 cuc+0.007
 scr+0.007
 rad+0.007
 spr+0.007
Neg logit contributions
belt-0.009
chair-0.009
ase-0.009
lf-0.008
overs-0.008
Token M
Token ID337
Feature activation+1.70e-02

Pos logit contributions
ase+0.019
chair+0.018
belt+0.018
lf+0.016
loo+0.015
Neg logit contributions
 tast-0.014
 scr-0.014
 rad-0.013
 cuc-0.013
 Pars-0.013
Tokenandy
Token ID10757
Feature activation+5.02e-03

Pos logit contributions
 tast+0.032
 ang+0.030
 scr+0.029
 cuc+0.028
 ju+0.028
Neg logit contributions
chair-0.029
ase-0.028
belt-0.026
lf-0.025
overs-0.023
Token were
Token ID547
Feature activation-5.38e-03

Pos logit contributions
 tast+0.009
 scr+0.008
 cuc+0.008
 ang+0.008
 rad+0.007
Neg logit contributions
chair-0.009
ase-0.009
belt-0.009
lf-0.008
overs-0.007
Token best
Token ID1266
Feature activation-1.85e-02

Pos logit contributions
chair+0.010
ase+0.009
belt+0.009
lf+0.008
board+0.007
Neg logit contributions
 tast-0.010
 scr-0.009
 ang-0.009
 cuc-0.009
 ju-0.008
Token friends
Token ID2460
Feature activation-9.00e-03

Pos logit contributions
chair+0.032
ase+0.030
belt+0.028
lf+0.027
overs+0.025
Neg logit contributions
 tast-0.034
 ang-0.032
 scr-0.031
 cuc-0.030
 ju-0.030
Token.
Token ID13
Feature activation+6.28e-03

Pos logit contributions
chair+0.018
ase+0.017
belt+0.016
lf+0.015
overs+0.014
Neg logit contributions
 tast-0.015
 ang-0.013
 scr-0.013
 cuc-0.013
 ju-0.013
Token Every
Token ID3887
Feature activation+6.00e-03

Pos logit contributions
 tast+0.010
 ang+0.010
 scr+0.009
 cuc+0.009
 ju+0.008
Neg logit contributions
chair-0.012
ase-0.012
belt-0.010
overs-0.010
 corner-0.010
Token day
Token ID1110
Feature activation+1.80e-03

Pos logit contributions
 tast+0.012
 scr+0.012
 ang+0.011
 cuc+0.011
 swe+0.010
Neg logit contributions
belt-0.008
chair-0.008
ase-0.008
lf-0.008
overs-0.007
Token they
Token ID484
Feature activation-1.35e-02

Pos logit contributions
 tast+0.003
 ang+0.003
 scr+0.003
 cuc+0.003
 ju+0.003
Neg logit contributions
chair-0.003
ase-0.003
belt-0.003
lf-0.003
overs-0.003
Token She
Token ID1375
Feature activation+1.04e-02

Pos logit contributions
 tast+0.006
 scr+0.005
 ang+0.005
 cuc+0.005
 ju+0.005
Neg logit contributions
chair-0.005
ase-0.005
belt-0.005
lf-0.005
overs-0.004
Token said
Token ID531
Feature activation-2.09e-02

Pos logit contributions
 tast+0.016
 scr+0.015
 ang+0.014
 cuc+0.014
 ju+0.013
Neg logit contributions
chair-0.021
ase-0.020
belt-0.020
lf-0.019
overs-0.018
Token,
Token ID11
Feature activation-2.25e-03

Pos logit contributions
chair+0.035
ase+0.031
belt+0.029
lf+0.028
overs+0.027
Neg logit contributions
 tast-0.041
 scr-0.036
 ang-0.035
 ju-0.034
 funn-0.034
Token "
Token ID366
Feature activation+1.08e-02

Pos logit contributions
chair+0.004
ase+0.004
belt+0.004
lf+0.004
overs+0.003
Neg logit contributions
 tast-0.004
 scr-0.004
 ang-0.004
 cuc-0.003
 Pars-0.003
TokenThere
Token ID1858
Feature activation+2.02e-02

Pos logit contributions
 tast+0.020
 scr+0.019
 ang+0.019
 cuc+0.018
 ju+0.018
Neg logit contributions
chair-0.019
ase-0.017
belt-0.017
lf-0.016
overs-0.014
Token,
Token ID11
Feature activation-1.89e-02

Pos logit contributions
 tast+0.033
 scr+0.030
 ang+0.030
 cuc+0.029
 ju+0.028
Neg logit contributions
chair-0.039
ase-0.037
belt-0.035
lf-0.034
overs-0.031
Token there
Token ID612
Feature activation+2.71e-03

Pos logit contributions
ase+0.038
chair+0.038
belt+0.034
lf+0.034
overs+0.032
Neg logit contributions
 tast-0.030
 ang-0.027
 scr-0.026
 ju-0.026
 rad-0.025
Token,
Token ID11
Feature activation-1.24e-02

Pos logit contributions
 tast+0.005
 ang+0.005
 scr+0.005
 cuc+0.005
 ju+0.004
Neg logit contributions
chair-0.005
ase-0.004
belt-0.004
lf-0.004
 corner-0.004
Token it
Token ID340
Feature activation-2.87e-03

Pos logit contributions
ase+0.026
chair+0.026
belt+0.023
lf+0.023
overs+0.022
Neg logit contributions
 tast-0.019
 ang-0.017
 scr-0.016
 ju-0.016
 cuc-0.015
Token will
Token ID481
Feature activation-1.82e-02

Pos logit contributions
chair+0.004
ase+0.004
belt+0.003
lf+0.003
board+0.003
Neg logit contributions
 tast-0.006
 cuc-0.006
 ang-0.005
 scr-0.005
 Pars-0.005
Token be
Token ID307
Feature activation+8.19e-03

Pos logit contributions
chair+0.031
belt+0.029
ase+0.028
lf+0.027
 corner+0.026
Neg logit contributions
 tast-0.030
 cuc-0.029
 ang-0.029
 scr-0.028
 Pars-0.028
Token old
Token ID1468
Feature activation+2.08e-03

Pos logit contributions
 tast+0.022
 scr+0.020
 ang+0.019
 cuc+0.019
 Pars+0.018
Neg logit contributions
chair-0.031
ase-0.029
belt-0.029
lf-0.027
overs-0.025
Token and
Token ID290
Feature activation-2.21e-03

Pos logit contributions
 tast+0.004
 ang+0.003
 scr+0.003
 cuc+0.003
 swe+0.003
Neg logit contributions
chair-0.004
ase-0.003
belt-0.003
lf-0.003
 doorway-0.003
Token dangerous
Token ID4923
Feature activation-3.10e-03

Pos logit contributions
ase+0.005
chair+0.005
belt+0.005
lf+0.005
overs+0.004
Neg logit contributions
 tast-0.003
 ang-0.003
 scr-0.003
 ju-0.002
 cuc-0.002
Token.
Token ID13
Feature activation+2.36e-03

Pos logit contributions
chair+0.006
ase+0.006
belt+0.006
lf+0.006
overs+0.005
Neg logit contributions
 tast-0.005
 scr-0.004
 ang-0.004
 cuc-0.004
 ju-0.004
Token No
Token ID1400
Feature activation+9.67e-04

Pos logit contributions
 tast+0.004
 Pars+0.004
 scr+0.003
 ju+0.003
 cuc+0.003
Neg logit contributions
lf-0.004
ase-0.004
chair-0.004
belt-0.004
 doorway-0.004
Token one
Token ID530
Feature activation-3.71e-02

Pos logit contributions
 tast+0.001
 scr+0.001
 ang+0.001
 Pars+0.001
 cuc+0.001
Neg logit contributions
chair-0.002
ase-0.002
belt-0.002
lf-0.002
 corner-0.002
Token should
Token ID815
Feature activation-5.72e-03

Pos logit contributions
belt+0.074
ase+0.072
chair+0.071
lf+0.067
overs+0.062
Neg logit contributions
 tast-0.057
 scr-0.050
 ang-0.050
 cal-0.047
 cuc-0.047
Token have
Token ID423
Feature activation-6.30e-03

Pos logit contributions
chair+0.012
ase+0.012
belt+0.011
lf+0.011
overs+0.010
Neg logit contributions
 tast-0.008
 scr-0.007
 ang-0.007
 cuc-0.007
 Pars-0.007
Token gone
Token ID3750
Feature activation+2.72e-02

Pos logit contributions
chair+0.012
ase+0.011
belt+0.011
lf+0.010
 corner+0.009
Neg logit contributions
 tast-0.011
 scr-0.010
 ang-0.010
 cuc-0.009
 Pars-0.009
Token on
Token ID319
Feature activation-2.25e-03

Pos logit contributions
 tast+0.047
 ang+0.044
 scr+0.043
 cuc+0.041
 ju+0.040
Neg logit contributions
chair-0.050
ase-0.048
belt-0.045
lf-0.043
overs-0.039
Token it
Token ID340
Feature activation-1.00e-03

Pos logit contributions
chair+0.005
ase+0.005
belt+0.004
lf+0.004
board+0.004
Neg logit contributions
 tast-0.003
 scr-0.003
 ang-0.003
 rad-0.003
 cuc-0.003
Token around
Token ID1088
Feature activation-2.14e-02

Pos logit contributions
 tast+0.022
 ang+0.021
 scr+0.021
 cuc+0.020
 ju+0.019
Neg logit contributions
chair-0.019
ase-0.018
belt-0.017
lf-0.016
overs-0.014
Token.
Token ID13
Feature activation-7.97e-04

Pos logit contributions
chair+0.045
ase+0.044
belt+0.043
lf+0.040
loo+0.036
Neg logit contributions
 tast-0.032
 ang-0.030
 scr-0.030
 cuc-0.028
 swe-0.027
Token She
Token ID1375
Feature activation+7.48e-03

Pos logit contributions
chair+0.002
ase+0.001
belt+0.001
lf+0.001
overs+0.001
Neg logit contributions
 tast-0.001
 scr-0.001
 ang-0.001
 cuc-0.001
 Pars-0.001
Token saw
Token ID2497
Feature activation-2.42e-02

Pos logit contributions
 tast+0.011
 scr+0.010
 ang+0.010
 grow+0.009
 ju+0.009
Neg logit contributions
ase-0.016
chair-0.016
overs-0.014
belt-0.014
lf-0.014
Token a
Token ID257
Feature activation-9.85e-03

Pos logit contributions
chair+0.050
ase+0.049
belt+0.049
lf+0.045
overs+0.042
Neg logit contributions
 tast-0.035
 ang-0.033
 scr-0.032
 cuc-0.030
 swe-0.029
Token window
Token ID4324
Feature activation-4.38e-02

Pos logit contributions
chair+0.018
ase+0.018
belt+0.017
lf+0.015
overs+0.015
Neg logit contributions
 tast-0.017
 scr-0.015
 ang-0.015
 cuc-0.015
 ju-0.014
Token.
Token ID13
Feature activation-6.85e-03

Pos logit contributions
chair+0.080
ase+0.079
belt+0.078
lf+0.070
child+0.070
Neg logit contributions
 scr-0.063
 rust-0.062
 grow-0.062
 ang-0.060
 tast-0.060
Token She
Token ID1375
Feature activation+3.96e-03

Pos logit contributions
chair+0.012
belt+0.011
ase+0.011
lf+0.011
overs+0.010
Neg logit contributions
 tast-0.012
 scr-0.011
 ang-0.011
 Pars-0.011
 cuc-0.010
Token ran
Token ID4966
Feature activation-6.94e-03

Pos logit contributions
 scr+0.005
 tast+0.005
 ang+0.005
 rust+0.005
 grow+0.005
Neg logit contributions
ase-0.009
chair-0.009
belt-0.008
overs-0.008
loo-0.007
Token to
Token ID284
Feature activation+3.60e-03

Pos logit contributions
ase+0.015
chair+0.014
lf+0.014
belt+0.014
child+0.012
Neg logit contributions
 tast-0.009
 ang-0.009
 cal-0.009
 swe-0.008
 scr-0.008
Token the
Token ID262
Feature activation-7.65e-03

Pos logit contributions
 tast+0.005
 ang+0.005
 scr+0.005
 Pars+0.004
 cuc+0.004
Neg logit contributions
ase-0.008
chair-0.008
belt-0.008
lf-0.007
loo-0.006
Token"
Token ID1
Feature activation-7.82e-03

Pos logit contributions
chair+0.003
ase+0.003
belt+0.003
lf+0.003
overs+0.002
Neg logit contributions
 tast-0.004
 scr-0.003
 ang-0.003
 cuc-0.003
 Pars-0.003
TokenIt
Token ID1026
Feature activation+1.79e-02

Pos logit contributions
chair+0.012
ase+0.012
belt+0.011
lf+0.010
 corner+0.009
Neg logit contributions
 tast-0.016
 ang-0.015
 scr-0.014
 cuc-0.014
 swe-0.014
Token is
Token ID318
Feature activation+1.40e-02

Pos logit contributions
 tast+0.041
 ang+0.039
 scr+0.039
 cuc+0.038
 ju+0.037
Neg logit contributions
chair-0.023
ase-0.020
belt-0.017
lf-0.017
 corner-0.015
Token a
Token ID257
Feature activation-1.61e-02

Pos logit contributions
 tast+0.022
 ang+0.020
 scr+0.020
 cuc+0.019
 Pars+0.018
Neg logit contributions
chair-0.028
ase-0.026
belt-0.025
lf-0.024
 corner-0.022
Token k
Token ID479
Feature activation-2.08e-02

Pos logit contributions
chair+0.030
ase+0.028
belt+0.027
lf+0.026
 corner+0.024
Neg logit contributions
 tast-0.027
 ang-0.025
 scr-0.025
 cuc-0.024
 ju-0.023
Tokenang
Token ID648
Feature activation-8.19e-02

Pos logit contributions
chair+0.030
belt+0.027
ase+0.027
lf+0.027
 doorway+0.024
Neg logit contributions
 tast-0.042
 scr-0.038
 ang-0.038
 Pars-0.038
 cuc-0.037
Tokenaroo
Token ID38049
Feature activation-4.81e-04

Pos logit contributions
chair+0.123
ase+0.112
belt+0.108
lf+0.103
 doorway+0.103
Neg logit contributions
 rad-0.150
 tast-0.149
 cuc-0.145
 scr-0.143
 ang-0.140
Token,"
Token ID553
Feature activation+4.73e-03

Pos logit contributions
chair+0.001
ase+0.001
belt+0.001
lf+0.001
overs+0.001
Neg logit contributions
 tast-0.001
 ang-0.001
 scr-0.001
 cuc-0.001
 ju-0.001
Token Ben
Token ID3932
Feature activation+3.48e-03

Pos logit contributions
 tast+0.010
 ang+0.009
 scr+0.009
 cuc+0.009
 ju+0.009
Neg logit contributions
chair-0.007
ase-0.006
belt-0.006
lf-0.006
 corner-0.005
Token says
Token ID1139
Feature activation-2.04e-02

Pos logit contributions
 tast+0.007
 ju+0.006
 cuc+0.006
 ang+0.006
 healthier+0.006
Neg logit contributions
chair-0.005
ase-0.004
ena-0.004
ana-0.004
board-0.004
Token.
Token ID13
Feature activation-4.07e-03

Pos logit contributions
chair+0.036
ase+0.032
belt+0.031
lf+0.029
 corner+0.028
Neg logit contributions
 tast-0.038
 cuc-0.034
 ju-0.034
 scr-0.034
 ang-0.033
Token,
Token ID11
Feature activation-9.06e-04

Pos logit contributions
 tast+0.012
 ang+0.011
 scr+0.011
 cuc+0.010
 swe+0.010
Neg logit contributions
chair-0.024
ase-0.024
belt-0.023
lf-0.021
 doorway-0.020
Token Sarah
Token ID10490
Feature activation-1.78e-02

Pos logit contributions
ase+0.002
chair+0.002
belt+0.001
loo+0.001
lf+0.001
Neg logit contributions
 tast-0.002
 ang-0.002
 scr-0.002
 cuc-0.001
 swe-0.001
Token found
Token ID1043
Feature activation-1.69e-02

Pos logit contributions
chair+0.035
belt+0.035
ase+0.034
overs+0.031
lf+0.030
Neg logit contributions
 tast-0.026
 scr-0.024
 ang-0.023
 cuc-0.023
 rad-0.022
Token a
Token ID257
Feature activation-9.00e-03

Pos logit contributions
chair+0.033
ase+0.032
belt+0.031
lf+0.029
overs+0.027
Neg logit contributions
 tast-0.027
 scr-0.024
 ang-0.024
 cuc-0.023
 ju-0.022
Token books
Token ID3835
Feature activation-3.49e-02

Pos logit contributions
chair+0.016
ase+0.015
belt+0.014
lf+0.014
 corner+0.013
Neg logit contributions
 tast-0.015
 scr-0.014
 ang-0.014
 Pars-0.014
 cuc-0.013
Tokenhe
Token ID258
Feature activation-9.53e-02

Pos logit contributions
chair+0.040
ase+0.039
belt+0.039
child+0.034
lf+0.033
Neg logit contributions
 tast-0.077
 scr-0.070
 ang-0.069
 better-0.067
 rust-0.067
Tokenlf
Token ID1652
Feature activation-2.40e-02

Pos logit contributions
chair+0.099
ase+0.091
belt+0.088
ane+0.081
overs+0.080
Neg logit contributions
 scr-0.187
 cuc-0.186
 grow-0.179
 rad-0.175
 pr-0.175
Token in
Token ID287
Feature activation+4.15e-03

Pos logit contributions
chair+0.041
ase+0.039
belt+0.037
lf+0.034
overs+0.033
Neg logit contributions
 tast-0.041
 scr-0.038
 ang-0.037
 cuc-0.035
 rust-0.034
Token her
Token ID607
Feature activation-3.17e-03

Pos logit contributions
 tast+0.007
 scr+0.006
 ang+0.006
 cuc+0.006
 ju+0.006
Neg logit contributions
chair-0.008
ase-0.007
belt-0.007
lf-0.007
overs-0.006
Token room
Token ID2119
Feature activation-1.82e-02

Pos logit contributions
chair+0.005
ase+0.004
belt+0.004
t+0.004
lf+0.004
Neg logit contributions
 tast-0.006
 scr-0.006
 swe-0.005
 cuc-0.005
 prett-0.005
Token.
Token ID13
Feature activation+2.27e-03

Pos logit contributions
chair+0.044
belt+0.041
ase+0.041
issa+0.038
lf+0.038
Neg logit contributions
 scr-0.017
 tast-0.017
 cuc-0.014
 ang-0.013
 grow-0.013
TokenL
Token ID43
Feature activation-4.83e-03

Pos logit contributions
chair+0.015
ase+0.014
belt+0.013
lf+0.012
 corner+0.012
Neg logit contributions
 tast-0.014
 ang-0.014
 scr-0.013
 cuc-0.013
 Pars-0.013
Token-
Token ID12
Feature activation-6.91e-02

Pos logit contributions
chair+0.006
belt+0.006
lf+0.005
 doorway+0.005
ase+0.004
Neg logit contributions
 tast-0.010
 scr-0.010
 cuc-0.010
 ang-0.010
abb-0.010
TokenI
Token ID40
Feature activation+8.10e-03

Pos logit contributions
chair+0.122
belt+0.114
ase+0.109
 doorway+0.106
lf+0.104
Neg logit contributions
 tast-0.104
 cuc-0.102
 ang-0.102
 scr-0.099
 ju-0.097
Token-
Token ID12
Feature activation-6.71e-02

Pos logit contributions
 tast+0.012
 ang+0.011
 scr+0.011
 cuc+0.011
 Pars+0.010
Neg logit contributions
chair-0.016
ase-0.016
belt-0.015
lf-0.014
 corner-0.013
TokenL
Token ID43
Feature activation-2.24e-03

Pos logit contributions
chair+0.129
ase+0.120
belt+0.119
lf+0.116
 doorway+0.108
Neg logit contributions
 tast-0.103
 ang-0.098
 cuc-0.094
 scr-0.093
 ju-0.092
Token-
Token ID12
Feature activation-6.70e-02

Pos logit contributions
chair+0.003
belt+0.002
lf+0.002
ase+0.002
 doorway+0.002
Neg logit contributions
 tast-0.005
 ang-0.005
 scr-0.005
 cuc-0.005
abb-0.005
TokenY
Token ID56
Feature activation+1.87e-02

Pos logit contributions
chair+0.129
ase+0.121
belt+0.119
lf+0.117
 doorway+0.109
Neg logit contributions
 tast-0.102
 ang-0.097
 cuc-0.093
 scr-0.092
 ju-0.091
Token,"
Token ID553
Feature activation+4.80e-03

Pos logit contributions
 tast+0.054
 ang+0.052
 scr+0.051
 cuc+0.051
 Pars+0.050
Neg logit contributions
chair-0.013
ase-0.012
belt-0.009
lf-0.006
board-0.006
Token Lily
Token ID20037
Feature activation+2.01e-03

Pos logit contributions
 tast+0.008
 scr+0.007
 ang+0.007
 Pars+0.007
 cuc+0.007
Neg logit contributions
chair-0.009
ase-0.008
lf-0.008
belt-0.008
 corner-0.007
Token spelled
Token ID32213
Feature activation-1.91e-02

Pos logit contributions
 tast+0.004
 ang+0.004
 unnecessary+0.003
 ju+0.003
 Pars+0.003
Neg logit contributions
chair-0.003
ase-0.003
lf-0.003
belt-0.003
ana-0.002
Token.
Token ID13
Feature activation-3.66e-04

Pos logit contributions
chair+0.034
ase+0.032
belt+0.030
lf+0.029
overs+0.026
Neg logit contributions
 tast-0.035
 scr-0.032
 ang-0.032
 cuc-0.031
 ju-0.030
Token next
Token ID1306
Feature activation+1.70e-02

Pos logit contributions
 tast+0.012
 scr+0.011
 ang+0.011
 cuc+0.011
 ju+0.010
Neg logit contributions
chair-0.013
ase-0.013
belt-0.012
lf-0.012
overs-0.011
Token time
Token ID640
Feature activation+7.82e-03

Pos logit contributions
 tast+0.039
 scr+0.037
 ang+0.037
 cuc+0.036
 Pars+0.035
Neg logit contributions
chair-0.022
ase-0.020
belt-0.019
lf-0.018
 corner-0.015
Token he
Token ID339
Feature activation-3.77e-03

Pos logit contributions
 tast+0.012
 ang+0.011
 scr+0.011
 cuc+0.011
 ju+0.010
Neg logit contributions
ase-0.016
chair-0.015
belt-0.014
lf-0.013
 doorway-0.012
Token could
Token ID714
Feature activation+1.19e-02

Pos logit contributions
chair+0.007
ase+0.007
belt+0.007
lf+0.006
overs+0.006
Neg logit contributions
 tast-0.006
 scr-0.006
 ang-0.006
 cuc-0.006
 ju-0.005
Token bat
Token ID7365
Feature activation+1.67e-02

Pos logit contributions
 tast+0.015
 scr+0.014
 ang+0.013
 cuc+0.013
 Pars+0.013
Neg logit contributions
chair-0.027
ase-0.026
belt-0.025
lf-0.024
overs-0.022
Tokenhe
Token ID258
Feature activation-6.87e-02

Pos logit contributions
 tast+0.040
 scr+0.038
 ang+0.037
 cuc+0.036
 ju+0.036
Neg logit contributions
chair-0.016
belt-0.016
ase-0.014
lf-0.014
 doorway-0.012
Token in
Token ID287
Feature activation+2.15e-02

Pos logit contributions
chair+0.137
ase+0.136
belt+0.125
 doorway+0.116
issa+0.112
Neg logit contributions
 tast-0.097
 scr-0.096
 cuc-0.089
 ang-0.087
 rad-0.084
Token the
Token ID262
Feature activation+7.27e-03

Pos logit contributions
 tast+0.031
 scr+0.029
 ang+0.028
 cuc+0.026
 swe+0.026
Neg logit contributions
chair-0.047
ase-0.046
belt-0.043
lf-0.040
person-0.038
Token warm
Token ID5814
Feature activation+1.55e-02

Pos logit contributions
 tast+0.012
 scr+0.011
 ang+0.011
 cuc+0.011
 ju+0.010
Neg logit contributions
chair-0.014
ase-0.013
belt-0.013
lf-0.012
overs-0.011
Token light
Token ID1657
Feature activation-3.62e-03

Pos logit contributions
 tast+0.023
 swe+0.021
 scr+0.021
 rad+0.020
 cuc+0.020
Neg logit contributions
chair-0.031
ase-0.030
belt-0.030
lf-0.029
issa-0.027
Token.
Token ID13
Feature activation-3.51e-03

Pos logit contributions
chair+0.007
ase+0.007
belt+0.007
lf+0.007
issa+0.006
Neg logit contributions
 tast-0.005
 scr-0.005
 ang-0.005
 cuc-0.005
 swe-0.004
Token take
Token ID1011
Feature activation+1.14e-02

Pos logit contributions
 tast+0.007
 scr+0.006
 ang+0.006
 cuc+0.006
 ju+0.006
Neg logit contributions
chair-0.011
ase-0.010
belt-0.010
lf-0.009
overs-0.009
Token one
Token ID530
Feature activation-1.48e-02

Pos logit contributions
 tast+0.017
 ang+0.015
 scr+0.015
 cuc+0.014
 Pars+0.014
Neg logit contributions
chair-0.024
ase-0.023
belt-0.022
lf-0.021
overs-0.019
Token of
Token ID286
Feature activation+1.97e-03

Pos logit contributions
chair+0.023
ase+0.023
belt+0.022
lf+0.021
overs+0.019
Neg logit contributions
 tast-0.029
 scr-0.026
 ang-0.025
 cuc-0.025
 ju-0.024
Token the
Token ID262
Feature activation+8.25e-04

Pos logit contributions
 tast+0.003
 ang+0.003
 scr+0.003
 Pars+0.003
 cuc+0.003
Neg logit contributions
chair-0.004
ase-0.004
belt-0.004
lf-0.003
board-0.003
Token works
Token ID2499
Feature activation+8.21e-03

Pos logit contributions
 tast+0.001
 scr+0.001
 ang+0.001
 swe+0.001
 funn+0.001
Neg logit contributions
ase-0.002
chair-0.001
belt-0.001
lf-0.001
loo-0.001
Tokenhe
Token ID258
Feature activation-7.74e-02

Pos logit contributions
 tast+0.015
 scr+0.013
 ang+0.013
 swe+0.012
 cuc+0.012
Neg logit contributions
chair-0.014
ase-0.014
belt-0.013
lf-0.012
 doorway-0.011
Tokenets
Token ID1039
Feature activation-1.08e-02

Pos logit contributions
ase+0.114
chair+0.111
belt+0.109
 doorway+0.094
loo+0.093
Neg logit contributions
 scr-0.134
 tast-0.130
 cuc-0.126
 ang-0.123
 rad-0.121
Token and
Token ID290
Feature activation-3.23e-03

Pos logit contributions
chair+0.021
ase+0.021
belt+0.020
lf+0.019
issa+0.017
Neg logit contributions
 tast-0.017
 scr-0.015
 ang-0.015
 cuc-0.014
 swe-0.014
Token start
Token ID923
Feature activation-6.02e-03

Pos logit contributions
ase+0.007
chair+0.007
belt+0.006
lf+0.006
overs+0.006
Neg logit contributions
 tast-0.005
 scr-0.004
 ang-0.004
 cuc-0.004
 ju-0.004
Token doing
Token ID1804
Feature activation+2.95e-02

Pos logit contributions
chair+0.010
ase+0.009
belt+0.009
lf+0.008
overs+0.007
Neg logit contributions
 tast-0.012
 scr-0.011
 ang-0.011
 cuc-0.010
 ju-0.010
Token the
Token ID262
Feature activation+7.29e-03

Pos logit contributions
 tast+0.050
 ang+0.044
 cuc+0.044
 scr+0.044
 ju+0.043
Neg logit contributions
chair-0.054
belt-0.049
ase-0.049
lf-0.048
overs-0.043