TOP ACTIVATIONS
MAX = 0.072

" W oof , wo of , I 'm a
tail . " W oof wo of ! Bird !" Max
's mom gave them a brand new basketball and they played
, " W oof , wo of . You are welcome
, " W oof , wo of , my name is
said , " W oof wo of !" The dog bark
speak ! " W oof wo of !" it said .
" W oof , wo of !" Spot bark ed
on her mom my 's brand new table !
that goes " wo of wo of wo of ".
. " W oof , wo of !" "
" wo of wo of wo of ". "
, " W oof , wo of , Tom ! Can
and she gave Teddy a brand new sc ooter . He
, " W oof , wo of , hello , Tom
enormous pet cat named C udd les . She loved C
mom my gave her a brand new pan to paint with
wanted to touch the ost ric hes . They reached out
" W oof , wo of !" Spot bark ed
should not tease the ost ric hes . They are not

MIN ACTIVATIONS
MIN = -0.094

and found the coin 's rightful place . Lily was happy
that the penny find its rightful owner .
be happy and well - beh aved . The end .
, but it was all fore shadow ed by the kind
became friendly and well - beh aved , just like Buddy
ie . They both had birth days coming up , and
the rice back in its rightful place . John was proud
put them back in their rightful place . After
he got it so the rightful owner could find it .
could be back with its rightful owner . In
saw some other kids mis beh aving and running around .
listen . He always mis beh aved and caused trouble wherever
restore it back to its rightful place . Although Sammy was
started walking very slowly as tort o ises do , but
to return it to its rightful owner if you can find
the ancient knob found its rightful place in their house .
The boot returned to its rightful owners and the conflict was
returned the towel to its rightful owner . The owner was
for being so well - beh aved and Jack was delighted
It helps us remember when birth days and special days are

INTERVAL 0.031 - 0.072
CONTAINS 1.901%

. She was very nervous because she had a w igg
Lily were playing in the snow . They wanted to make
to find him . He hopes she will not see his
on . <|endoftext|> L ily liked to play with her toy
back and smiled . He couldn 't help but feel happy

INTERVAL -0.011 - 0.031
CONTAINS 76.869%

water . They make the shark swim and splash . They
pocket . As she walked through the park ,
He grabbed it and threw it on the ground . The
crabs : Max and Sam . Max was a very gifted
each other . They looked at their cars . They looked

INTERVAL -0.053 - -0.011
CONTAINS 21.195%

thanked her . " Thank you so much for
." L ily said , " Wow ,
playing . One day , a little girl ran
and learn all about them . He spent the whole day
y ucky ! Candy is better ." But Lily remembered that

INTERVAL -0.094 - -0.053
CONTAINS 0.036%

my 's dad started to un pack the fishing gear .
. They wished they were haw ks and could eat mice
to be careful . Maybe someone is looking for this money
take something that doesn 't belong to you ! You should
bright lights and lots of people . He was so amazed
Token
Token ID198
Feature activation-1.09e-02

Pos logit contributions
P+0.001
over+0.001
Sus+0.001
hered+0.001
amed+0.001
Neg logit contributions
 stool-0.001
 tray-0.001
 log-0.001
 oven-0.001
 microwave-0.001
Token"
Token ID1
Feature activation-2.35e-02

Pos logit contributions
over+0.016
P+0.016
liest+0.014
Sus+0.014
hered+0.014
Neg logit contributions
 stool-0.019
 tray-0.018
 log-0.018
 chair-0.017
 microwave-0.017
TokenW
Token ID54
Feature activation-6.24e-03

Pos logit contributions
over+0.039
P+0.036
liest+0.035
 such+0.034
ed+0.034
Neg logit contributions
 stool-0.036
 tray-0.035
 log-0.035
 chair-0.033
 scale-0.032
Tokenoof
Token ID37711
Feature activation+3.05e-02

Pos logit contributions
over+0.010
P+0.009
liest+0.009
hered+0.009
beh+0.009
Neg logit contributions
 stool-0.011
 tray-0.010
 log-0.010
 chair-0.010
 scale-0.010
Token,
Token ID11
Feature activation+4.47e-02

Pos logit contributions
 stool+0.051
 tray+0.049
 log+0.048
 chair+0.046
 microwave+0.046
Neg logit contributions
over-0.046
P-0.046
liest-0.041
hered-0.041
Sus-0.040
Token wo
Token ID24486
Feature activation+7.21e-02

Pos logit contributions
 stool+0.040
 microwave+0.037
 tray+0.036
 log+0.035
 oven+0.033
Neg logit contributions
over-0.100
P-0.097
hered-0.094
 such-0.092
lyss-0.091
Tokenof
Token ID1659
Feature activation+2.08e-02

Pos logit contributions
 stool+0.128
 tray+0.123
 log+0.122
 scale+0.119
 microwave+0.118
Neg logit contributions
P-0.106
over-0.102
hered-0.093
liest-0.090
date-0.090
Token,
Token ID11
Feature activation+3.60e-02

Pos logit contributions
 stool+0.032
 tray+0.031
 log+0.030
 chair+0.029
 microwave+0.029
Neg logit contributions
over-0.034
P-0.034
Sus-0.031
liest-0.030
amed-0.030
Token I
Token ID314
Feature activation+1.35e-02

Pos logit contributions
 stool+0.036
 tray+0.034
 microwave+0.034
 log+0.032
 oven+0.032
Neg logit contributions
over-0.074
P-0.073
hered-0.071
 such-0.071
lyss-0.070
Token'm
Token ID1101
Feature activation+3.52e-03

Pos logit contributions
 stool+0.021
 tray+0.020
 log+0.020
 chair+0.019
 microwave+0.019
Neg logit contributions
over-0.022
P-0.022
hered-0.020
liest-0.020
date-0.020
Token a
Token ID257
Feature activation-2.55e-03

Pos logit contributions
 stool+0.006
 log+0.006
 tray+0.005
 chair+0.005
 scale+0.005
Neg logit contributions
P-0.005
over-0.005
liest-0.005
date-0.005
Sus-0.005
Token tail
Token ID7894
Feature activation+1.05e-02

Pos logit contributions
over+0.018
P+0.017
Sus+0.016
liest+0.016
date+0.016
Neg logit contributions
 stool-0.010
 tray-0.010
 log-0.010
 chair-0.009
 scale-0.009
Token.
Token ID13
Feature activation-8.74e-03

Pos logit contributions
 stool+0.018
 tray+0.017
 log+0.017
 chair+0.016
 scale+0.016
Neg logit contributions
P-0.015
over-0.015
Sus-0.014
lyss-0.014
liest-0.014
Token "
Token ID366
Feature activation-7.48e-03

Pos logit contributions
over+0.014
 Emmy+0.013
P+0.013
ed+0.013
 such+0.013
Neg logit contributions
 stool-0.013
 tray-0.012
 log-0.012
 fork-0.012
 oven-0.012
TokenW
Token ID54
Feature activation-1.03e-02

Pos logit contributions
over+0.010
P+0.010
ed+0.009
hered+0.009
liest+0.009
Neg logit contributions
 stool-0.013
 log-0.013
 tray-0.012
 chair-0.012
 scale-0.012
Tokenoof
Token ID37711
Feature activation+2.15e-02

Pos logit contributions
over+0.017
P+0.017
liest+0.016
hered+0.015
Sus+0.015
Neg logit contributions
 stool-0.017
 tray-0.016
 log-0.015
 chair-0.015
 microwave-0.015
Token wo
Token ID24486
Feature activation+7.20e-02

Pos logit contributions
 stool+0.031
 tray+0.030
 log+0.030
 chair+0.028
 microwave+0.028
Neg logit contributions
over-0.037
P-0.037
liest-0.033
Sus-0.033
amed-0.033
Tokenof
Token ID1659
Feature activation+2.24e-02

Pos logit contributions
 stool+0.129
 tray+0.123
 log+0.122
 scale+0.118
 chair+0.118
Neg logit contributions
P-0.105
over-0.102
hered-0.093
liest-0.091
Sus-0.090
Token!
Token ID0
Feature activation+1.79e-02

Pos logit contributions
 stool+0.032
 tray+0.031
 log+0.030
 chair+0.029
 microwave+0.029
Neg logit contributions
over-0.039
P-0.039
Sus-0.035
liest-0.035
hered-0.035
Token Bird
Token ID14506
Feature activation-2.41e-02

Pos logit contributions
 stool+0.030
 tray+0.028
 log+0.028
 chair+0.027
 microwave+0.027
Neg logit contributions
over-0.027
P-0.027
liest-0.024
hered-0.024
Sus-0.024
Token!"
Token ID2474
Feature activation-7.89e-03

Pos logit contributions
over+0.026
P+0.026
liest+0.022
hered+0.022
Sus+0.022
Neg logit contributions
 stool-0.051
 tray-0.049
 log-0.048
 chair-0.047
 microwave-0.047
Token Max
Token ID5436
Feature activation-8.72e-03

Pos logit contributions
over+0.013
P+0.013
liest+0.012
amed+0.012
Sus+0.012
Neg logit contributions
 stool-0.012
 log-0.011
 tray-0.011
 chair-0.011
 scale-0.011
Token's
Token ID338
Feature activation-4.57e-03

Pos logit contributions
over+0.008
P+0.008
liest+0.007
hered+0.007
amed+0.007
Neg logit contributions
 stool-0.013
 tray-0.012
 log-0.012
 microwave-0.012
 chair-0.012
Token mom
Token ID1995
Feature activation-3.03e-02

Pos logit contributions
over+0.009
date+0.008
 Emmy+0.008
P+0.008
 such+0.008
Neg logit contributions
 stool-0.005
 log-0.005
 microwave-0.005
 tray-0.005
 oven-0.005
Token gave
Token ID2921
Feature activation+1.22e-03

Pos logit contributions
over+0.038
P+0.037
liest+0.033
amed+0.033
Sus+0.033
Neg logit contributions
 stool-0.058
 tray-0.056
 log-0.056
 chair-0.054
 microwave-0.054
Token them
Token ID606
Feature activation+4.71e-03

Pos logit contributions
 stool+0.002
 tray+0.002
 log+0.002
 oven+0.002
 microwave+0.002
Neg logit contributions
over-0.002
P-0.002
lyss-0.002
hered-0.002
lish-0.002
Token a
Token ID257
Feature activation+1.78e-02

Pos logit contributions
 tray+0.008
 stool+0.007
 log+0.007
 fork+0.007
 oven+0.007
Neg logit contributions
over-0.008
P-0.007
Sus-0.007
date-0.007
Ph-0.006
Token brand
Token ID4508
Feature activation+7.11e-02

Pos logit contributions
 stool+0.024
 log+0.023
 Deal+0.023
ses+0.023
 thinks+0.023
Neg logit contributions
P-0.030
lyss-0.028
over-0.027
hered-0.027
 Emmy-0.027
Token new
Token ID649
Feature activation-2.02e-02

Pos logit contributions
 stool+0.106
 tray+0.101
 log+0.099
 chair+0.095
 microwave+0.094
Neg logit contributions
P-0.118
over-0.118
liest-0.109
Sus-0.107
lyss-0.106
Token basketball
Token ID9669
Feature activation+3.00e-02

Pos logit contributions
over+0.034
P+0.034
liest+0.031
Sus+0.030
hered+0.030
Neg logit contributions
 stool-0.030
 tray-0.029
 log-0.028
 chair-0.027
 microwave-0.027
Token and
Token ID290
Feature activation+1.55e-03

Pos logit contributions
 stool+0.048
 tray+0.047
 log+0.046
 chair+0.044
 scale+0.044
Neg logit contributions
over-0.048
P-0.047
Sus-0.043
amed-0.042
beh-0.042
Token they
Token ID484
Feature activation-5.35e-03

Pos logit contributions
 stool+0.002
 chair+0.002
 log+0.002
 microwave+0.002
 tray+0.002
Neg logit contributions
over-0.003
P-0.003
hered-0.003
lish-0.003
date-0.003
Token played
Token ID2826
Feature activation-4.28e-03

Pos logit contributions
over+0.008
P+0.007
liest+0.007
beh+0.007
Sus+0.007
Neg logit contributions
 stool-0.009
 tray-0.009
 log-0.009
 chair-0.009
 fork-0.009
Token,
Token ID11
Feature activation+2.07e-02

Pos logit contributions
P+0.011
over+0.011
liest+0.010
Sus+0.010
amed+0.010
Neg logit contributions
 stool-0.014
 tray-0.013
 log-0.013
 chair-0.013
 scale-0.013
Token "
Token ID366
Feature activation+9.39e-03

Pos logit contributions
 tray+0.030
 stool+0.030
 log+0.030
 oven+0.029
 fork+0.028
Neg logit contributions
hered-0.031
P-0.031
over-0.031
ed-0.030
lyss-0.029
TokenW
Token ID54
Feature activation-5.39e-03

Pos logit contributions
 stool+0.017
 log+0.017
 tray+0.017
 thinks+0.016
 snowball+0.016
Neg logit contributions
over-0.011
 Emmy-0.011
P-0.011
date-0.010
Sus-0.010
Tokenoof
Token ID37711
Feature activation+3.65e-02

Pos logit contributions
over+0.009
P+0.009
liest+0.008
hered+0.008
beh+0.008
Neg logit contributions
 stool-0.009
 tray-0.008
 log-0.008
 chair-0.008
 scale-0.008
Token,
Token ID11
Feature activation+4.54e-02

Pos logit contributions
 stool+0.069
 tray+0.067
 log+0.067
 chair+0.064
 microwave+0.064
Neg logit contributions
over-0.046
P-0.046
liest-0.040
hered-0.040
Sus-0.040
Token wo
Token ID24486
Feature activation+7.11e-02

Pos logit contributions
 stool+0.042
 tray+0.038
 microwave+0.037
 oven+0.036
 log+0.035
Neg logit contributions
over-0.099
P-0.099
hered-0.094
 such-0.094
lyss-0.093
Tokenof
Token ID1659
Feature activation+2.78e-02

Pos logit contributions
 stool+0.126
 tray+0.120
 log+0.120
 scale+0.116
 chair+0.115
Neg logit contributions
P-0.105
over-0.102
hered-0.092
liest-0.090
date-0.089
Token.
Token ID13
Feature activation+1.44e-02

Pos logit contributions
 stool+0.052
 tray+0.049
 log+0.049
 microwave+0.048
 chair+0.047
Neg logit contributions
over-0.038
P-0.037
liest-0.033
ed-0.033
hered-0.033
Token You
Token ID921
Feature activation+1.50e-02

Pos logit contributions
 stool+0.022
 tray+0.022
 log+0.021
 chair+0.020
 microwave+0.020
Neg logit contributions
over-0.023
P-0.023
liest-0.021
Sus-0.021
amed-0.021
Token are
Token ID389
Feature activation-1.16e-03

Pos logit contributions
 stool+0.020
 tray+0.019
 log+0.018
 chair+0.017
 microwave+0.017
Neg logit contributions
over-0.028
P-0.028
liest-0.026
hered-0.025
Sus-0.025
Token welcome
Token ID7062
Feature activation+9.93e-05

Pos logit contributions
over+0.002
P+0.002
Sus+0.002
liest+0.002
date+0.002
Neg logit contributions
 stool-0.002
 tray-0.002
 log-0.002
 scale-0.002
 chair-0.002
Token,
Token ID11
Feature activation+1.16e-02

Pos logit contributions
 stool+0.001
 log+0.001
 tray+0.001
 chair+0.001
 scale+0.001
Neg logit contributions
over-0.001
P-0.001
amed-0.001
beh-0.001
date-0.001
Token "
Token ID366
Feature activation+5.44e-03

Pos logit contributions
 tray+0.019
 stool+0.018
 log+0.018
 fort+0.018
 fork+0.018
Neg logit contributions
ed-0.015
P-0.015
hered-0.015
over-0.015
 such-0.014
TokenW
Token ID54
Feature activation-7.65e-03

Pos logit contributions
 stool+0.010
 log+0.010
 tray+0.010
 thinks+0.009
 microwave+0.009
Neg logit contributions
over-0.007
P-0.006
 Emmy-0.006
date-0.006
amed-0.006
Tokenoof
Token ID37711
Feature activation+3.66e-02

Pos logit contributions
over+0.013
P+0.013
hered+0.012
liest+0.012
beh+0.011
Neg logit contributions
 stool-0.012
 tray-0.011
 log-0.011
 chair-0.011
 microwave-0.010
Token,
Token ID11
Feature activation+4.55e-02

Pos logit contributions
 stool+0.069
 tray+0.066
 log+0.066
 microwave+0.064
 chair+0.064
Neg logit contributions
over-0.047
P-0.047
hered-0.041
liest-0.041
ed-0.040
Token wo
Token ID24486
Feature activation+7.09e-02

Pos logit contributions
 stool+0.041
 tray+0.037
 microwave+0.036
 log+0.035
 oven+0.034
Neg logit contributions
over-0.100
P-0.100
hered-0.096
 such-0.095
lyss-0.093
Tokenof
Token ID1659
Feature activation+2.41e-02

Pos logit contributions
 stool+0.130
 tray+0.125
 log+0.123
 scale+0.120
 chair+0.120
Neg logit contributions
P-0.102
over-0.097
hered-0.088
liest-0.085
date-0.084
Token,
Token ID11
Feature activation+3.52e-02

Pos logit contributions
 stool+0.048
 tray+0.045
 microwave+0.044
 log+0.044
 chair+0.043
Neg logit contributions
over-0.031
P-0.030
ed-0.027
liest-0.027
hered-0.027
Token my
Token ID616
Feature activation-3.13e-02

Pos logit contributions
 stool+0.034
 tray+0.031
 microwave+0.029
 log+0.029
 oven+0.028
Neg logit contributions
over-0.075
P-0.075
hered-0.072
 such-0.072
lyss-0.070
Token name
Token ID1438
Feature activation+2.32e-02

Pos logit contributions
over+0.067
P+0.062
ed+0.061
hered+0.060
 such+0.059
Neg logit contributions
 stool-0.035
 tray-0.031
 microwave-0.030
 log-0.030
 oven-0.030
Token is
Token ID318
Feature activation+1.44e-02

Pos logit contributions
 stool+0.035
 log+0.032
 tray+0.032
 microwave+0.031
 chair+0.030
Neg logit contributions
over-0.040
P-0.039
ed-0.037
liest-0.036
hered-0.036
Token said
Token ID531
Feature activation-2.81e-03

Pos logit contributions
 log+0.021
 stool+0.020
 thinks+0.020
 chair+0.020
 scale+0.020
Neg logit contributions
over-0.030
P-0.027
date-0.027
liest-0.027
amed-0.025
Token,
Token ID11
Feature activation+1.04e-02

Pos logit contributions
over+0.005
P+0.005
amed+0.005
aming+0.005
date+0.005
Neg logit contributions
 stool-0.004
 log-0.004
 scale-0.004
 chair-0.004
 tray-0.004
Token "
Token ID366
Feature activation-2.39e-03

Pos logit contributions
 tray+0.017
 stool+0.016
 oven+0.016
 log+0.016
 fork+0.016
Neg logit contributions
over-0.014
 Emmy-0.014
ed-0.014
Sus-0.014
hered-0.014
TokenW
Token ID54
Feature activation-8.61e-03

Pos logit contributions
over+0.003
P+0.002
 such+0.002
amed+0.002
liest+0.002
Neg logit contributions
 stool-0.005
 log-0.005
 tray-0.005
 microwave-0.005
 snowball-0.005
Tokenoof
Token ID37711
Feature activation+3.12e-02

Pos logit contributions
over+0.015
P+0.014
liest+0.013
hered+0.013
beh+0.013
Neg logit contributions
 stool-0.013
 tray-0.012
 log-0.012
 chair-0.012
 microwave-0.012
Token wo
Token ID24486
Feature activation+7.02e-02

Pos logit contributions
 stool+0.050
 tray+0.049
 log+0.049
 chair+0.046
 fork+0.046
Neg logit contributions
P-0.048
over-0.048
amed-0.043
Sus-0.043
liest-0.043
Tokenof
Token ID1659
Feature activation+2.70e-02

Pos logit contributions
 stool+0.126
 tray+0.119
 log+0.118
 chair+0.115
 scale+0.114
Neg logit contributions
P-0.103
over-0.100
hered-0.092
liest-0.089
date-0.089
Token!"
Token ID2474
Feature activation+5.64e-03

Pos logit contributions
 stool+0.048
 tray+0.046
 log+0.045
 microwave+0.044
 chair+0.044
Neg logit contributions
over-0.039
P-0.038
liest-0.034
hered-0.034
ed-0.034
Token The
Token ID383
Feature activation+2.21e-02

Pos logit contributions
 stool+0.007
 log+0.007
 chair+0.007
 microwave+0.007
 tray+0.007
Neg logit contributions
P-0.011
over-0.010
beh-0.010
amed-0.010
liest-0.010
Token dog
Token ID3290
Feature activation+7.87e-05

Pos logit contributions
 stool+0.031
 tray+0.029
 oven+0.029
 Deal+0.029
 microwave+0.028
Neg logit contributions
over-0.038
P-0.038
hered-0.034
ed-0.033
date-0.033
Token bark
Token ID21405
Feature activation+8.50e-03

Pos logit contributions
 stool+0.000
 tray+0.000
 log+0.000
 chair+0.000
 microwave+0.000
Neg logit contributions
over-0.000
P-0.000
liest-0.000
hered-0.000
Sus-0.000
Token speak
Token ID2740
Feature activation-1.79e-02

Pos logit contributions
 stool+0.006
 tray+0.006
 log+0.006
 chair+0.006
 microwave+0.006
Neg logit contributions
over-0.009
P-0.009
liest-0.008
hered-0.008
Sus-0.008
Token!
Token ID0
Feature activation+6.94e-03

Pos logit contributions
P+0.030
over+0.028
hered+0.026
date+0.025
Sus+0.025
Neg logit contributions
 stool-0.028
 tray-0.027
 log-0.027
 chair-0.026
 thinks-0.026
Token "
Token ID366
Feature activation-8.94e-03

Pos logit contributions
 stool+0.010
 tray+0.010
 log+0.010
 chair+0.010
 microwave+0.009
Neg logit contributions
P-0.012
over-0.012
hered-0.010
amed-0.010
liest-0.010
TokenW
Token ID54
Feature activation-7.15e-03

Pos logit contributions
over+0.013
ed+0.011
 such+0.011
liest+0.011
 Emmy+0.011
Neg logit contributions
 thinks-0.014
 Deal-0.014
 stool-0.013
 snowball-0.013
 log-0.013
Tokenoof
Token ID37711
Feature activation+2.33e-02

Pos logit contributions
over+0.012
P+0.012
liest+0.011
hered+0.011
Sus+0.011
Neg logit contributions
 stool-0.010
 tray-0.010
 log-0.010
 chair-0.009
 microwave-0.009
Token wo
Token ID24486
Feature activation+7.01e-02

Pos logit contributions
 stool+0.034
 tray+0.033
 log+0.032
 chair+0.031
 oven+0.030
Neg logit contributions
P-0.040
over-0.039
amed-0.036
date-0.036
Sus-0.036
Tokenof
Token ID1659
Feature activation+2.26e-02

Pos logit contributions
 stool+0.123
 tray+0.117
 log+0.116
 chair+0.113
 scale+0.112
Neg logit contributions
P-0.103
over-0.102
hered-0.092
liest-0.091
Sus-0.090
Token!"
Token ID2474
Feature activation+4.86e-04

Pos logit contributions
 stool+0.035
 tray+0.034
 log+0.033
 chair+0.032
 scale+0.032
Neg logit contributions
P-0.036
over-0.036
Sus-0.033
date-0.033
amed-0.033
Token it
Token ID340
Feature activation-2.91e-04

Pos logit contributions
 stool+0.001
 chair+0.001
 log+0.001
 tray+0.001
 microwave+0.001
Neg logit contributions
P-0.001
over-0.001
amed-0.001
date-0.001
aming-0.001
Token said
Token ID531
Feature activation-1.95e-03

Pos logit contributions
over+0.001
P+0.000
date+0.000
Sus+0.000
liest+0.000
Neg logit contributions
 log-0.000
 stool-0.000
 thinks-0.000
 chair-0.000
 scale-0.000
Token.
Token ID13
Feature activation+5.81e-03

Pos logit contributions
P+0.004
amed+0.003
beh+0.003
aming+0.003
Em+0.003
Neg logit contributions
 stool-0.003
 tray-0.002
 chair-0.002
 scale-0.002
 ladder-0.002
Token
Token ID198
Feature activation-9.59e-03

Pos logit contributions
 stool+0.003
 log+0.003
 tray+0.003
 chair+0.003
 microwave+0.003
Neg logit contributions
over-0.004
P-0.004
liest-0.004
date-0.004
hered-0.004
Token"
Token ID1
Feature activation-2.57e-02

Pos logit contributions
 Emmy+0.019
over+0.019
liest+0.019
beh+0.019
Sus+0.019
Neg logit contributions
 stool-0.010
 log-0.010
 tray-0.009
 chair-0.009
 scale-0.009
TokenW
Token ID54
Feature activation-5.02e-03

Pos logit contributions
over+0.037
P+0.034
liest+0.033
 such+0.033
 Emmy+0.032
Neg logit contributions
 stool-0.045
 tray-0.043
 log-0.043
 chair-0.042
 microwave-0.041
Tokenoof
Token ID37711
Feature activation+2.42e-02

Pos logit contributions
over+0.008
P+0.008
liest+0.007
hered+0.007
beh+0.007
Neg logit contributions
 stool-0.008
 tray-0.008
 chair-0.007
 microwave-0.007
 log-0.007
Token,
Token ID11
Feature activation+4.06e-02

Pos logit contributions
 stool+0.043
 tray+0.042
 log+0.042
 chair+0.040
 microwave+0.040
Neg logit contributions
over-0.033
P-0.033
liest-0.029
Sus-0.029
hered-0.029
Token wo
Token ID24486
Feature activation+6.95e-02

Pos logit contributions
 stool+0.039
 microwave+0.035
 tray+0.035
 Deal+0.033
 log+0.032
Neg logit contributions
over-0.088
P-0.087
hered-0.083
 such-0.083
ed-0.081
Tokenof
Token ID1659
Feature activation+2.08e-02

Pos logit contributions
 stool+0.130
 tray+0.124
 log+0.121
 scale+0.120
 microwave+0.119
Neg logit contributions
P-0.097
over-0.093
hered-0.085
liest-0.081
date-0.080
Token!"
Token ID2474
Feature activation-2.11e-03

Pos logit contributions
 stool+0.038
 tray+0.037
 log+0.037
 chair+0.035
 microwave+0.035
Neg logit contributions
over-0.028
P-0.027
liest-0.024
hered-0.024
ed-0.024
Token Spot
Token ID15899
Feature activation-1.88e-03

Pos logit contributions
amed+0.005
beh+0.005
over+0.005
P+0.004
liest+0.004
Neg logit contributions
 stool-0.002
 tray-0.002
 snowball-0.002
 chair-0.002
 microwave-0.002
Token bark
Token ID21405
Feature activation-1.20e-03

Pos logit contributions
over+0.003
P+0.003
date+0.003
liest+0.003
Sus+0.003
Neg logit contributions
 stool-0.003
 log-0.003
 tray-0.002
 thinks-0.002
 scale-0.002
Tokened
Token ID276
Feature activation+1.50e-02

Pos logit contributions
date+0.001
lyss+0.001
Em+0.001
 Emmy+0.001
P+0.001
Neg logit contributions
 thinks-0.002
 snowball-0.002
 bar-0.002
 log-0.002
ses-0.002
Token on
Token ID319
Feature activation+4.46e-03

Pos logit contributions
P+0.023
over+0.023
beh+0.021
lyss+0.020
hered+0.020
Neg logit contributions
 stool-0.028
 tray-0.027
 log-0.026
 chair-0.026
 fork-0.025
Token her
Token ID607
Feature activation-2.85e-03

Pos logit contributions
 stool+0.009
 log+0.008
 tray+0.008
 Deal+0.008
 microwave+0.008
Neg logit contributions
over-0.005
P-0.005
 such-0.005
liest-0.004
 Emmy-0.004
Token mom
Token ID1995
Feature activation-2.34e-03

Pos logit contributions
P+0.005
over+0.005
amed+0.004
 such+0.004
 Emmy+0.004
Neg logit contributions
 stool-0.004
 log-0.004
 tray-0.004
 microwave-0.004
 snowball-0.004
Tokenmy
Token ID1820
Feature activation+1.77e-02

Pos logit contributions
over+0.003
P+0.003
Sus+0.003
liest+0.003
amed+0.003
Neg logit contributions
 stool-0.004
 tray-0.004
 log-0.004
 chair-0.004
 scale-0.004
Token's
Token ID338
Feature activation+8.24e-03

Pos logit contributions
 stool+0.037
 tray+0.036
 log+0.036
 microwave+0.034
 chair+0.034
Neg logit contributions
over-0.019
P-0.019
liest-0.016
hered-0.016
date-0.016
Token brand
Token ID4508
Feature activation+6.93e-02

Pos logit contributions
 stool+0.011
 log+0.011
 tray+0.011
 microwave+0.011
 snowball+0.011
Neg logit contributions
over-0.014
P-0.014
amed-0.013
 such-0.012
date-0.012
Token new
Token ID649
Feature activation-1.80e-02

Pos logit contributions
 stool+0.115
 tray+0.109
 log+0.108
ses+0.107
 Deal+0.107
Neg logit contributions
P-0.105
over-0.098
liest-0.095
Sus-0.094
 such-0.093
Token table
Token ID3084
Feature activation+7.56e-03

Pos logit contributions
over+0.033
P+0.033
liest+0.030
hered+0.030
Sus+0.030
Neg logit contributions
 stool-0.024
 tray-0.023
 log-0.023
 chair-0.022
 microwave-0.021
Token!
Token ID0
Feature activation-8.03e-03

Pos logit contributions
 stool+0.013
 tray+0.012
 log+0.012
 chair+0.012
 oven+0.011
Neg logit contributions
P-0.011
over-0.011
Sus-0.010
hered-0.010
liest-0.010
Token
Token ID198
Feature activation-1.19e-02

Pos logit contributions
over+0.010
P+0.009
Sus+0.009
ed+0.009
 Lilly+0.009
Neg logit contributions
 stool-0.016
 tray-0.015
 scale-0.015
 log-0.015
 tractor-0.015
Token
Token ID198
Feature activation-1.92e-02

Pos logit contributions
P+0.014
over+0.013
Sus+0.012
Em+0.012
B+0.011
Neg logit contributions
 stool-0.026
ses-0.025
 tray-0.025
 oven-0.024
 scale-0.024
Token that
Token ID326
Feature activation+1.72e-02

Pos logit contributions
 stool+0.005
 tray+0.005
 log+0.005
 chair+0.005
 microwave+0.005
Neg logit contributions
over-0.004
P-0.004
liest-0.004
hered-0.004
Sus-0.004
Token goes
Token ID2925
Feature activation-2.09e-02

Pos logit contributions
 stool+0.023
 tray+0.022
 log+0.022
 chair+0.021
 microwave+0.020
Neg logit contributions
over-0.032
P-0.031
Sus-0.029
amed-0.029
liest-0.029
Token "
Token ID366
Feature activation+8.61e-03

Pos logit contributions
P+0.033
over+0.032
liest+0.030
Sus+0.030
amed+0.029
Neg logit contributions
 stool-0.033
 tray-0.031
 log-0.030
 chair-0.030
 scale-0.030
Tokenwo
Token ID21638
Feature activation+1.02e-02

Pos logit contributions
 stool+0.015
 tray+0.014
 chair+0.014
 microwave+0.014
 thinks+0.014
Neg logit contributions
P-0.011
over-0.011
 Emmy-0.010
lyss-0.010
amed-0.010
Tokenof
Token ID1659
Feature activation+2.45e-02

Pos logit contributions
 stool+0.013
 tray+0.013
 log+0.013
 thinks+0.012
 oven+0.012
Neg logit contributions
P-0.017
over-0.017
Sus-0.017
amed-0.017
 Emmy-0.016
Token wo
Token ID24486
Feature activation+6.91e-02

Pos logit contributions
 stool+0.037
 tray+0.035
 log+0.035
 chair+0.033
 oven+0.033
Neg logit contributions
over-0.040
P-0.040
Sus-0.037
amed-0.036
date-0.036
Tokenof
Token ID1659
Feature activation+2.45e-02

Pos logit contributions
 stool+0.118
 log+0.114
 tray+0.113
 scale+0.109
 chair+0.109
Neg logit contributions
P-0.104
over-0.103
hered-0.091
liest-0.091
Sus-0.090
Token wo
Token ID24486
Feature activation+6.89e-02

Pos logit contributions
 stool+0.037
 tray+0.036
 log+0.035
 chair+0.033
 oven+0.033
Neg logit contributions
P-0.040
over-0.040
Sus-0.037
amed-0.036
date-0.036
Tokenof
Token ID1659
Feature activation+2.48e-02

Pos logit contributions
 stool+0.119
 log+0.115
 tray+0.114
 scale+0.110
 chair+0.109
Neg logit contributions
P-0.103
over-0.102
hered-0.090
liest-0.089
date-0.089
Token".
Token ID1911
Feature activation-1.15e-02

Pos logit contributions
 stool+0.037
 tray+0.036
 log+0.035
 chair+0.034
 oven+0.034
Neg logit contributions
P-0.040
over-0.040
Sus-0.037
amed-0.037
date-0.036
Token
Token ID198
Feature activation-5.61e-04

Pos logit contributions
over+0.019
P+0.019
hered+0.017
Sus+0.017
 such+0.017
Neg logit contributions
 stool-0.017
 log-0.017
 tray-0.017
 scale-0.016
 fork-0.016
Token.
Token ID13
Feature activation+2.73e-04

Pos logit contributions
 stool+0.029
 tray+0.028
 log+0.028
 Deal+0.027
 microwave+0.027
Neg logit contributions
over-0.019
P-0.018
hered-0.017
ed-0.016
liest-0.016
Token "
Token ID366
Feature activation-5.07e-04

Pos logit contributions
 stool+0.000
 log+0.000
 tray+0.000
 scale+0.000
 oven+0.000
Neg logit contributions
P-0.000
over-0.000
ed-0.000
hered-0.000
lyss-0.000
TokenW
Token ID54
Feature activation-9.15e-03

Pos logit contributions
P+0.001
over+0.001
Sus+0.000
amed+0.000
hered+0.000
Neg logit contributions
 stool-0.001
 tray-0.001
 log-0.001
 oven-0.001
 chair-0.001
Tokenoof
Token ID37711
Feature activation+3.11e-02

Pos logit contributions
over+0.015
P+0.015
liest+0.013
hered+0.013
amed+0.013
Neg logit contributions
 stool-0.015
 tray-0.014
 log-0.014
 chair-0.013
 scale-0.013
Token,
Token ID11
Feature activation+3.98e-02

Pos logit contributions
 stool+0.048
 tray+0.047
 log+0.046
 chair+0.044
 microwave+0.044
Neg logit contributions
over-0.050
P-0.050
liest-0.045
Sus-0.045
amed-0.044
Token wo
Token ID24486
Feature activation+6.89e-02

Pos logit contributions
 stool+0.036
 tray+0.033
 log+0.031
 microwave+0.031
 oven+0.029
Neg logit contributions
P-0.087
over-0.086
hered-0.083
 such-0.082
lyss-0.081
Tokenof
Token ID1659
Feature activation+2.30e-02

Pos logit contributions
 stool+0.125
 tray+0.120
 log+0.120
 scale+0.116
 chair+0.115
Neg logit contributions
P-0.099
over-0.095
hered-0.086
liest-0.084
date-0.083
Token!"
Token ID2474
Feature activation+2.21e-03

Pos logit contributions
 stool+0.036
 tray+0.034
 log+0.034
 microwave+0.033
 chair+0.032
Neg logit contributions
over-0.038
P-0.037
liest-0.034
hered-0.034
ed-0.033
Token
Token ID198
Feature activation+5.39e-03

Pos logit contributions
 stool+0.003
 tray+0.003
 log+0.003
 scale+0.003
 oven+0.003
Neg logit contributions
over-0.004
P-0.004
hered-0.003
ed-0.003
 such-0.003
Token
Token ID198
Feature activation-6.73e-03

Pos logit contributions
 stool+0.010
 tray+0.010
 log+0.010
 oven+0.009
 scale+0.009
Neg logit contributions
P-0.007
over-0.007
Sus-0.006
ed-0.006
Ph-0.006
Token"
Token ID1
Feature activation-2.44e-02

Pos logit contributions
over+0.011
P+0.011
liest+0.010
hered+0.010
Sus+0.010
Neg logit contributions
 stool-0.011
 tray-0.010
 log-0.010
 chair-0.010
 microwave-0.009
Token "
Token ID366
Feature activation+8.61e-03

Pos logit contributions
P+0.033
over+0.032
liest+0.030
Sus+0.030
amed+0.029
Neg logit contributions
 stool-0.033
 tray-0.031
 log-0.030
 chair-0.030
 scale-0.030
Tokenwo
Token ID21638
Feature activation+1.02e-02

Pos logit contributions
 stool+0.015
 tray+0.014
 chair+0.014
 microwave+0.014
 thinks+0.014
Neg logit contributions
P-0.011
over-0.011
 Emmy-0.010
lyss-0.010
amed-0.010
Tokenof
Token ID1659
Feature activation+2.45e-02

Pos logit contributions
 stool+0.013
 tray+0.013
 log+0.013
 thinks+0.012
 oven+0.012
Neg logit contributions
P-0.017
over-0.017
Sus-0.017
amed-0.017
 Emmy-0.016
Token wo
Token ID24486
Feature activation+6.91e-02

Pos logit contributions
 stool+0.037
 tray+0.035
 log+0.035
 chair+0.033
 oven+0.033
Neg logit contributions
over-0.040
P-0.040
Sus-0.037
amed-0.036
date-0.036
Tokenof
Token ID1659
Feature activation+2.45e-02

Pos logit contributions
 stool+0.118
 log+0.114
 tray+0.113
 scale+0.109
 chair+0.109
Neg logit contributions
P-0.104
over-0.103
hered-0.091
liest-0.091
Sus-0.090
Token wo
Token ID24486
Feature activation+6.89e-02

Pos logit contributions
 stool+0.037
 tray+0.036
 log+0.035
 chair+0.033
 oven+0.033
Neg logit contributions
P-0.040
over-0.040
Sus-0.037
amed-0.036
date-0.036
Tokenof
Token ID1659
Feature activation+2.48e-02

Pos logit contributions
 stool+0.119
 log+0.115
 tray+0.114
 scale+0.110
 chair+0.109
Neg logit contributions
P-0.103
over-0.102
hered-0.090
liest-0.089
date-0.089
Token".
Token ID1911
Feature activation-1.15e-02

Pos logit contributions
 stool+0.037
 tray+0.036
 log+0.035
 chair+0.034
 oven+0.034
Neg logit contributions
P-0.040
over-0.040
Sus-0.037
amed-0.037
date-0.036
Token
Token ID198
Feature activation-5.61e-04

Pos logit contributions
over+0.019
P+0.019
hered+0.017
Sus+0.017
 such+0.017
Neg logit contributions
 stool-0.017
 log-0.017
 tray-0.017
 scale-0.016
 fork-0.016
Token
Token ID198
Feature activation-1.21e-02

Pos logit contributions
P+0.001
over+0.001
Sus+0.001
Ph+0.001
Em+0.001
Neg logit contributions
 stool-0.001
 tray-0.001
 log-0.001
 fork-0.001
 scale-0.001
Token"
Token ID1
Feature activation-2.52e-02

Pos logit contributions
over+0.020
P+0.020
liest+0.018
hered+0.018
Sus+0.018
Neg logit contributions
 stool-0.019
 tray-0.018
 log-0.018
 chair-0.017
 microwave-0.017
Token,
Token ID11
Feature activation+7.91e-03

Pos logit contributions
over+0.009
P+0.008
amed+0.008
aming+0.008
date+0.008
Neg logit contributions
 stool-0.008
 log-0.008
 chair-0.007
 scale-0.007
 tray-0.007
Token "
Token ID366
Feature activation-3.29e-03

Pos logit contributions
 tray+0.012
 stool+0.011
 log+0.011
 oven+0.011
 fort+0.011
Neg logit contributions
ed-0.012
P-0.012
over-0.012
hered-0.011
lyss-0.011
TokenW
Token ID54
Feature activation-6.95e-03

Pos logit contributions
over+0.004
 Emmy+0.004
date+0.004
amed+0.004
 such+0.004
Neg logit contributions
 thinks-0.006
 log-0.006
 stool-0.006
 snowball-0.006
 microwave-0.006
Tokenoof
Token ID37711
Feature activation+3.43e-02

Pos logit contributions
over+0.011
P+0.010
liest+0.009
hered+0.009
beh+0.009
Neg logit contributions
 stool-0.012
 tray-0.012
 chair-0.011
 log-0.011
 microwave-0.011
Token,
Token ID11
Feature activation+4.53e-02

Pos logit contributions
 stool+0.065
 tray+0.062
 log+0.062
 microwave+0.060
 chair+0.060
Neg logit contributions
over-0.044
P-0.044
hered-0.039
liest-0.038
ed-0.038
Token wo
Token ID24486
Feature activation+6.87e-02

Pos logit contributions
 stool+0.046
 microwave+0.042
 tray+0.040
 Deal+0.040
 fork+0.039
Neg logit contributions
over-0.092
P-0.092
hered-0.089
 such-0.088
lyss-0.088
Tokenof
Token ID1659
Feature activation+2.31e-02

Pos logit contributions
 stool+0.127
 tray+0.121
 log+0.120
 scale+0.117
 fork+0.116
Neg logit contributions
P-0.097
over-0.093
hered-0.085
date-0.082
liest-0.081
Token,
Token ID11
Feature activation+3.62e-02

Pos logit contributions
 stool+0.047
 tray+0.044
 log+0.044
 microwave+0.043
 chair+0.042
Neg logit contributions
over-0.029
P-0.027
ed-0.025
liest-0.024
hered-0.024
Token Tom
Token ID4186
Feature activation+3.94e-03

Pos logit contributions
 stool+0.036
 tray+0.033
 microwave+0.032
 Deal+0.032
 fork+0.031
Neg logit contributions
P-0.074
over-0.073
 such-0.072
lyss-0.072
hered-0.071
Token!
Token ID0
Feature activation+2.66e-02

Pos logit contributions
 stool+0.007
 tray+0.007
 log+0.007
 microwave+0.007
 chair+0.007
Neg logit contributions
over-0.005
P-0.005
liest-0.004
hered-0.004
 such-0.004
Token Can
Token ID1680
Feature activation+9.73e-03

Pos logit contributions
 stool+0.043
 tray+0.042
 log+0.041
 chair+0.039
 scale+0.039
Neg logit contributions
over-0.042
P-0.041
liest-0.037
hered-0.037
ed-0.036
Token and
Token ID290
Feature activation-7.47e-03

Pos logit contributions
over+0.038
P+0.037
ed+0.033
amed+0.033
Sus+0.033
Neg logit contributions
 stool-0.054
 tray-0.053
 log-0.053
 microwave-0.050
 scale-0.050
Token she
Token ID673
Feature activation-1.04e-02

Pos logit contributions
over+0.014
P+0.014
amed+0.012
liest+0.012
lish+0.012
Neg logit contributions
 stool-0.011
 chair-0.010
 log-0.010
 tray-0.010
 microwave-0.010
Token gave
Token ID2921
Feature activation-6.58e-03

Pos logit contributions
over+0.014
P+0.014
liest+0.012
hered+0.012
Sus+0.012
Neg logit contributions
 stool-0.019
 tray-0.018
 log-0.018
 chair-0.017
 microwave-0.017
Token Teddy
Token ID29345
Feature activation+2.51e-02

Pos logit contributions
over+0.011
P+0.011
amed+0.010
lyss+0.010
Sus+0.010
Neg logit contributions
 chair-0.009
 stool-0.009
 fork-0.009
 microwave-0.009
 tray-0.008
Token a
Token ID257
Feature activation+1.41e-02

Pos logit contributions
 stool+0.043
 tray+0.042
 log+0.041
 chair+0.040
 fork+0.039
Neg logit contributions
over-0.036
P-0.036
date-0.032
hered-0.032
Sus-0.032
Token brand
Token ID4508
Feature activation+6.82e-02

Pos logit contributions
 stool+0.020
 log+0.019
 tray+0.019
ses+0.018
 thinks+0.018
Neg logit contributions
P-0.024
over-0.022
lyss-0.022
hered-0.021
 Emmy-0.021
Token new
Token ID649
Feature activation-1.85e-02

Pos logit contributions
 stool+0.097
 tray+0.092
 log+0.091
 chair+0.087
 microwave+0.086
Neg logit contributions
over-0.119
P-0.118
liest-0.107
hered-0.107
Sus-0.107
Token sc
Token ID629
Feature activation+3.07e-02

Pos logit contributions
over+0.035
P+0.034
liest+0.031
hered+0.031
amed+0.031
Neg logit contributions
 stool-0.024
 tray-0.023
 log-0.023
 chair-0.022
 scale-0.022
Tokenooter
Token ID25141
Feature activation+1.04e-02

Pos logit contributions
 stool+0.053
 tray+0.051
 log+0.051
 chair+0.048
 microwave+0.048
Neg logit contributions
over-0.045
P-0.044
liest-0.039
Sus-0.039
amed-0.039
Token.
Token ID13
Feature activation-9.25e-03

Pos logit contributions
 stool+0.018
 tray+0.017
 chair+0.017
 log+0.016
 scale+0.016
Neg logit contributions
P-0.016
over-0.015
Sus-0.014
lyss-0.014
amed-0.014
Token He
Token ID679
Feature activation+6.94e-03

Pos logit contributions
over+0.012
P+0.011
 Emmy+0.010
liest+0.010
Sus+0.010
Neg logit contributions
 stool-0.018
 tray-0.017
 log-0.017
 oven-0.016
 scale-0.016
Token,
Token ID11
Feature activation+1.60e-02

Pos logit contributions
over+0.001
P+0.001
amed+0.001
date+0.001
beh+0.001
Neg logit contributions
 stool-0.001
 log-0.001
 tray-0.001
 chair-0.001
 scale-0.001
Token "
Token ID366
Feature activation+7.34e-03

Pos logit contributions
 tray+0.023
 stool+0.022
 log+0.021
 oven+0.021
 fork+0.021
Neg logit contributions
P-0.025
over-0.025
ed-0.025
hered-0.024
lyss-0.024
TokenW
Token ID54
Feature activation-9.08e-03

Pos logit contributions
 stool+0.013
 log+0.013
 thinks+0.013
 microwave+0.012
 snowball+0.012
Neg logit contributions
over-0.010
amed-0.009
 Emmy-0.009
 such-0.009
date-0.009
Tokenoof
Token ID37711
Feature activation+3.38e-02

Pos logit contributions
over+0.016
P+0.015
liest+0.014
hered+0.014
Sus+0.014
Neg logit contributions
 stool-0.014
 tray-0.013
 log-0.013
 chair-0.012
 scale-0.012
Token,
Token ID11
Feature activation+4.33e-02

Pos logit contributions
 stool+0.066
 tray+0.064
 log+0.064
 chair+0.061
 microwave+0.061
Neg logit contributions
over-0.041
P-0.041
liest-0.035
hered-0.035
Sus-0.035
Token wo
Token ID24486
Feature activation+6.81e-02

Pos logit contributions
 stool+0.034
 tray+0.031
 log+0.029
 microwave+0.028
 oven+0.027
Neg logit contributions
over-0.101
P-0.101
lyss-0.095
hered-0.095
 such-0.095
Tokenof
Token ID1659
Feature activation+2.37e-02

Pos logit contributions
 stool+0.120
 tray+0.115
 log+0.114
 chair+0.110
 scale+0.110
Neg logit contributions
P-0.101
over-0.098
hered-0.088
liest-0.086
date-0.086
Token,
Token ID11
Feature activation+3.55e-02

Pos logit contributions
 stool+0.047
 tray+0.045
 log+0.044
 microwave+0.043
 chair+0.043
Neg logit contributions
over-0.030
P-0.029
liest-0.026
ed-0.025
hered-0.025
Token hello
Token ID23748
Feature activation+3.51e-04

Pos logit contributions
 stool+0.034
 tray+0.031
 log+0.030
 microwave+0.029
 chair+0.028
Neg logit contributions
P-0.077
over-0.077
hered-0.073
lyss-0.072
 such-0.072
Token,
Token ID11
Feature activation+2.72e-02

Pos logit contributions
 stool+0.001
 log+0.001
 tray+0.001
 chair+0.001
 scale+0.001
Neg logit contributions
P-0.001
over-0.001
amed-0.000
Sus-0.000
Em-0.000
Token Tom
Token ID4186
Feature activation+1.19e-02

Pos logit contributions
 stool+0.041
 tray+0.039
 log+0.039
 microwave+0.037
 chair+0.037
Neg logit contributions
over-0.044
P-0.044
hered-0.040
liest-0.040
 such-0.040
Token enormous
Token ID9812
Feature activation+1.80e-02

Pos logit contributions
 stool+0.019
 Deal+0.017
 log+0.017
 microwave+0.017
ccoli+0.017
Neg logit contributions
over-0.015
P-0.015
liest-0.014
 such-0.014
 Emmy-0.013
Token pet
Token ID4273
Feature activation+1.41e-02

Pos logit contributions
 Deal+0.022
 stool+0.021
ws+0.020
 thinks+0.020
 microwave+0.020
Neg logit contributions
over-0.033
P-0.032
date-0.031
lyss-0.029
 Emmy-0.029
Token cat
Token ID3797
Feature activation+2.24e-03

Pos logit contributions
 stool+0.018
 tray+0.018
 log+0.017
 chair+0.016
 microwave+0.016
Neg logit contributions
over-0.026
P-0.026
Sus-0.024
liest-0.024
amed-0.024
Token named
Token ID3706
Feature activation+1.35e-02

Pos logit contributions
 stool+0.004
 tray+0.004
 log+0.004
 chair+0.004
 rug+0.004
Neg logit contributions
over-0.003
P-0.003
amed-0.003
beh-0.003
Sus-0.003
Token C
Token ID327
Feature activation+4.99e-03

Pos logit contributions
 stool+0.022
 tray+0.021
 log+0.021
 chair+0.020
 microwave+0.020
Neg logit contributions
over-0.020
P-0.020
liest-0.018
hered-0.018
Sus-0.018
Tokenudd
Token ID4185
Feature activation+6.80e-02

Pos logit contributions
 stool+0.009
 log+0.009
 tray+0.009
 scale+0.008
 chair+0.008
Neg logit contributions
P-0.007
over-0.007
Sus-0.006
liest-0.006
amed-0.006
Tokenles
Token ID829
Feature activation+1.35e-04

Pos logit contributions
 stool+0.112
 tray+0.108
 log+0.107
 chair+0.103
 scale+0.102
Neg logit contributions
over-0.101
P-0.099
date-0.092
Sus-0.091
amed-0.091
Token.
Token ID13
Feature activation-6.09e-04

Pos logit contributions
 stool+0.000
 tray+0.000
 log+0.000
 chair+0.000
 oven+0.000
Neg logit contributions
over-0.000
P-0.000
Sus-0.000
hered-0.000
amed-0.000
Token She
Token ID1375
Feature activation+5.36e-03

Pos logit contributions
over+0.001
P+0.001
liest+0.001
Sus+0.001
hered+0.001
Neg logit contributions
 stool-0.001
 tray-0.001
 log-0.001
 chair-0.001
 microwave-0.001
Token loved
Token ID6151
Feature activation+3.13e-02

Pos logit contributions
 stool+0.010
 tray+0.009
 log+0.009
 chair+0.009
 microwave+0.009
Neg logit contributions
over-0.007
P-0.007
liest-0.007
hered-0.006
Sus-0.006
Token C
Token ID327
Feature activation+2.84e-03

Pos logit contributions
 stool+0.066
 tray+0.064
 log+0.062
 Deal+0.062
 microwave+0.062
Neg logit contributions
P-0.033
over-0.032
 such-0.029
Sus-0.027
 Emmy-0.027
Token mom
Token ID1995
Feature activation-3.15e-02

Pos logit contributions
P+0.040
over+0.040
date+0.037
amed+0.037
Sus+0.037
Neg logit contributions
 stool-0.021
 log-0.020
 tray-0.019
 microwave-0.019
 scale-0.018
Tokenmy
Token ID1820
Feature activation-9.32e-03

Pos logit contributions
over+0.048
P+0.045
hered+0.040
liest+0.040
beh+0.040
Neg logit contributions
 stool-0.054
 tray-0.052
 log-0.051
 oven-0.050
 chair-0.050
Token gave
Token ID2921
Feature activation-3.86e-03

Pos logit contributions
over+0.012
P+0.012
liest+0.010
hered+0.010
Sus+0.010
Neg logit contributions
 stool-0.018
 tray-0.017
 log-0.017
 chair-0.016
 microwave-0.016
Token her
Token ID607
Feature activation+5.59e-03

Pos logit contributions
over+0.005
P+0.005
liest+0.004
hered+0.004
Sus+0.004
Neg logit contributions
 stool-0.007
 tray-0.007
 log-0.007
 chair-0.007
 microwave-0.007
Token a
Token ID257
Feature activation+4.28e-03

Pos logit contributions
 stool+0.011
 tray+0.010
 log+0.010
 chair+0.010
 microwave+0.010
Neg logit contributions
over-0.007
P-0.007
liest-0.006
hered-0.006
Sus-0.006
Token brand
Token ID4508
Feature activation+6.78e-02

Pos logit contributions
 log+0.006
 Deal+0.006
 stool+0.006
ses+0.006
 thinks+0.006
Neg logit contributions
P-0.007
lyss-0.006
date-0.006
 Emmy-0.006
over-0.006
Token new
Token ID649
Feature activation-2.26e-02

Pos logit contributions
 stool+0.109
 tray+0.103
 log+0.103
 Deal+0.102
ses+0.100
Neg logit contributions
P-0.103
over-0.098
lyss-0.094
liest-0.094
Sus-0.093
Token pan
Token ID3425
Feature activation-1.97e-03

Pos logit contributions
P+0.037
over+0.037
Sus+0.033
liest+0.033
date+0.033
Neg logit contributions
 stool-0.034
 log-0.033
 tray-0.032
 microwave-0.031
 scale-0.030
Token to
Token ID284
Feature activation+3.04e-05

Pos logit contributions
P+0.003
over+0.003
Sus+0.003
lyss+0.003
beh+0.003
Neg logit contributions
 stool-0.003
 tray-0.003
 log-0.003
 chair-0.003
 oven-0.003
Token paint
Token ID7521
Feature activation-6.33e-03

Pos logit contributions
 stool+0.000
 tray+0.000
 log+0.000
 chair+0.000
 oven+0.000
Neg logit contributions
over-0.000
P-0.000
liest-0.000
hered-0.000
amed-0.000
Token with
Token ID351
Feature activation-7.57e-03

Pos logit contributions
P+0.010
over+0.010
hered+0.009
beh+0.009
lyss+0.009
Neg logit contributions
 stool-0.011
 tray-0.010
 oven-0.010
 chair-0.010
 log-0.010
Token wanted
Token ID2227
Feature activation+4.11e-02

Pos logit contributions
 stool+0.037
 tray+0.035
 log+0.035
 chair+0.034
 microwave+0.033
Neg logit contributions
over-0.032
P-0.032
liest-0.028
hered-0.028
Sus-0.028
Token to
Token ID284
Feature activation+1.27e-02

Pos logit contributions
 stool+0.065
 tray+0.063
 log+0.063
 chair+0.061
 oven+0.061
Neg logit contributions
over-0.071
P-0.069
amed-0.064
liest-0.063
date-0.061
Token touch
Token ID3638
Feature activation+2.13e-03

Pos logit contributions
 stool+0.018
 tray+0.017
 log+0.017
 chair+0.016
 scale+0.016
Neg logit contributions
over-0.023
P-0.022
liest-0.021
amed-0.020
Sus-0.020
Token the
Token ID262
Feature activation+1.56e-02

Pos logit contributions
 log+0.003
 stool+0.003
 tray+0.003
 snowball+0.003
 oven+0.003
Neg logit contributions
P-0.003
over-0.003
Em-0.003
amed-0.003
Sus-0.003
Token ost
Token ID23619
Feature activation-2.91e-02

Pos logit contributions
 Deal+0.027
 stool+0.027
 fork+0.026
ccoli+0.025
 microwave+0.025
Neg logit contributions
P-0.021
over-0.020
 met-0.018
 such-0.018
ed-0.018
Tokenric
Token ID1173
Feature activation+6.73e-02

Pos logit contributions
over+0.058
P+0.056
amed+0.053
hered+0.053
beh+0.052
Neg logit contributions
 stool-0.036
 tray-0.033
 chair-0.033
 fork-0.032
 log-0.032
Tokenhes
Token ID956
Feature activation+1.97e-02

Pos logit contributions
 stool+0.119
 log+0.111
 tray+0.111
 Deal+0.109
ses+0.109
Neg logit contributions
P-0.103
over-0.098
ed-0.093
amed-0.092
 such-0.083
Token.
Token ID13
Feature activation-1.78e-03

Pos logit contributions
 stool+0.033
 tray+0.032
 log+0.032
 chair+0.031
 microwave+0.030
Neg logit contributions
P-0.029
over-0.029
Sus-0.026
liest-0.026
amed-0.026
Token They
Token ID1119
Feature activation+1.90e-02

Pos logit contributions
over+0.003
P+0.003
liest+0.003
hered+0.003
beh+0.003
Neg logit contributions
 stool-0.002
 tray-0.002
 log-0.002
 chair-0.002
 microwave-0.002
Token reached
Token ID4251
Feature activation+4.76e-03

Pos logit contributions
 stool+0.034
 tray+0.032
 log+0.032
 chair+0.031
 microwave+0.031
Neg logit contributions
P-0.026
over-0.026
liest-0.023
ed-0.023
hered-0.023
Token out
Token ID503
Feature activation-4.21e-03

Pos logit contributions
 stool+0.011
 Deal+0.010
 microwave+0.010
 scale+0.010
 fork+0.010
Neg logit contributions
over-0.003
 such-0.003
P-0.003
ed-0.003
hered-0.003
Token
Token ID198
Feature activation+1.15e-03

Pos logit contributions
 stool+0.021
 tray+0.021
 log+0.020
 oven+0.019
 scale+0.019
Neg logit contributions
P-0.016
over-0.016
Sus-0.015
Em-0.014
Ph-0.014
Token"
Token ID1
Feature activation-1.90e-02

Pos logit contributions
 stool+0.001
 log+0.001
 tray+0.001
 chair+0.001
 snowball+0.001
Neg logit contributions
over-0.002
liest-0.002
 such-0.002
 Emmy-0.002
ed-0.002
TokenW
Token ID54
Feature activation-5.38e-03

Pos logit contributions
over+0.024
P+0.023
liest+0.021
 such+0.021
 Emmy+0.021
Neg logit contributions
 stool-0.036
 tray-0.035
 log-0.035
 chair-0.033
 microwave-0.033
Tokenoof
Token ID37711
Feature activation+2.89e-02

Pos logit contributions
over+0.009
P+0.008
liest+0.008
hered+0.008
Sus+0.008
Neg logit contributions
 stool-0.009
 tray-0.008
 log-0.008
 chair-0.008
 microwave-0.008
Token,
Token ID11
Feature activation+4.55e-02

Pos logit contributions
 stool+0.052
 tray+0.050
 log+0.050
 chair+0.048
 microwave+0.048
Neg logit contributions
over-0.040
P-0.040
liest-0.035
hered-0.035
Sus-0.035
Token wo
Token ID24486
Feature activation+6.72e-02

Pos logit contributions
 stool+0.039
 tray+0.035
 microwave+0.035
 Deal+0.033
 log+0.033
Neg logit contributions
P-0.102
over-0.101
hered-0.096
 such-0.096
lyss-0.095
Tokenof
Token ID1659
Feature activation+1.95e-02

Pos logit contributions
 stool+0.122
 tray+0.116
 log+0.114
 chair+0.112
 scale+0.111
Neg logit contributions
P-0.097
over-0.092
hered-0.085
date-0.082
liest-0.082
Token!"
Token ID2474
Feature activation+5.12e-03

Pos logit contributions
 stool+0.036
 tray+0.034
 log+0.034
 chair+0.033
 microwave+0.033
Neg logit contributions
over-0.026
P-0.026
liest-0.023
hered-0.023
Sus-0.023
Token Spot
Token ID15899
Feature activation-2.12e-03

Pos logit contributions
 stool+0.006
 chair+0.006
 tray+0.006
 microwave+0.006
 log+0.006
Neg logit contributions
beh-0.010
liest-0.010
over-0.010
amed-0.010
aming-0.010
Token bark
Token ID21405
Feature activation+1.92e-03

Pos logit contributions
over+0.004
P+0.004
liest+0.003
Sus+0.003
date+0.003
Neg logit contributions
 stool-0.003
 tray-0.003
 log-0.003
 chair-0.003
 scale-0.003
Tokened
Token ID276
Feature activation+1.61e-02

Pos logit contributions
 thinks+0.004
 snowball+0.004
 tray+0.004
 bar+0.004
 log+0.004
Neg logit contributions
lyss-0.002
P-0.002
date-0.002
 Emmy-0.002
over-0.002
Token should
Token ID815
Feature activation+1.46e-03

Pos logit contributions
 stool+0.023
 log+0.023
 tray+0.023
 thinks+0.022
 fork+0.022
Neg logit contributions
over-0.030
P-0.028
date-0.026
Sus-0.026
hered-0.025
Token not
Token ID407
Feature activation-2.99e-03

Pos logit contributions
 stool+0.002
 tray+0.002
 log+0.002
 chair+0.002
 microwave+0.002
Neg logit contributions
over-0.003
P-0.003
Sus-0.002
liest-0.002
hered-0.002
Token tease
Token ID34691
Feature activation-4.53e-03

Pos logit contributions
over+0.006
P+0.006
amed+0.006
ed+0.006
liest+0.006
Neg logit contributions
 log-0.003
 stool-0.003
 tray-0.003
 scale-0.003
 thinks-0.003
Token the
Token ID262
Feature activation+1.87e-02

Pos logit contributions
over+0.008
amed+0.008
Sus+0.007
P+0.007
date+0.007
Neg logit contributions
 thinks-0.007
 log-0.007
 tray-0.006
 stool-0.006
 scale-0.006
Token ost
Token ID23619
Feature activation-1.66e-02

Pos logit contributions
 stool+0.033
 fork+0.033
 Deal+0.032
 tray+0.031
ccoli+0.031
Neg logit contributions
over-0.024
P-0.024
 such-0.021
Ph-0.021
Sus-0.021
Tokenric
Token ID1173
Feature activation+6.71e-02

Pos logit contributions
over+0.034
P+0.032
hered+0.031
beh+0.031
amed+0.031
Neg logit contributions
 stool-0.020
 tray-0.018
 chair-0.018
 fork-0.018
 log-0.018
Tokenhes
Token ID956
Feature activation+1.37e-02

Pos logit contributions
 stool+0.107
 tray+0.100
 log+0.099
 chair+0.096
 fork+0.095
Neg logit contributions
P-0.112
over-0.109
ed-0.101
amed-0.100
 such-0.097
Token.
Token ID13
Feature activation+9.52e-03

Pos logit contributions
 stool+0.023
 tray+0.023
 log+0.022
 chair+0.021
 microwave+0.021
Neg logit contributions
over-0.020
P-0.020
liest-0.018
Sus-0.017
hered-0.017
Token They
Token ID1119
Feature activation+1.60e-02

Pos logit contributions
 stool+0.015
 tray+0.014
 log+0.014
 chair+0.013
 fork+0.013
Neg logit contributions
over-0.016
P-0.015
Sus-0.014
liest-0.014
 such-0.014
Token are
Token ID389
Feature activation-4.47e-03

Pos logit contributions
 stool+0.022
 tray+0.021
 log+0.021
 chair+0.020
 microwave+0.020
Neg logit contributions
over-0.028
P-0.028
liest-0.026
hered-0.026
 such-0.026
Token not
Token ID407
Feature activation-1.34e-02

Pos logit contributions
over+0.008
P+0.008
 such+0.007
liest+0.007
amed+0.007
Neg logit contributions
 stool-0.006
 tray-0.006
 log-0.006
 chair-0.006
 microwave-0.006
Token and
Token ID290
Feature activation-4.91e-03

Pos logit contributions
 stool+0.003
 tray+0.003
 log+0.003
 Deal+0.003
 fork+0.003
Neg logit contributions
over-0.003
P-0.003
 such-0.003
ed-0.003
amed-0.003
Token found
Token ID1043
Feature activation-1.49e-03

Pos logit contributions
over+0.009
P+0.009
liest+0.008
beh+0.008
hered+0.007
Neg logit contributions
 stool-0.008
 chair-0.007
 log-0.007
 rug-0.007
 tray-0.007
Token the
Token ID262
Feature activation-3.20e-03

Pos logit contributions
P+0.002
over+0.002
date+0.002
lyss+0.002
beh+0.002
Neg logit contributions
 stool-0.002
 log-0.002
 tray-0.002
 oven-0.002
 chair-0.002
Token coin
Token ID10752
Feature activation+1.59e-03

Pos logit contributions
over+0.006
P+0.006
liest+0.005
Sus+0.005
hered+0.005
Neg logit contributions
 stool-0.004
 tray-0.004
 log-0.004
 chair-0.004
 scale-0.004
Token's
Token ID338
Feature activation-1.15e-02

Pos logit contributions
 stool+0.003
 tray+0.003
 log+0.003
 microwave+0.003
 chair+0.003
Neg logit contributions
over-0.002
P-0.002
liest-0.002
hered-0.002
ed-0.002
Token rightful
Token ID43133
Feature activation-9.41e-02

Pos logit contributions
P+0.021
over+0.021
Sus+0.019
liest+0.019
hered+0.019
Neg logit contributions
 stool-0.015
 tray-0.014
 log-0.014
 chair-0.014
 scale-0.014
Token place
Token ID1295
Feature activation-2.43e-02

Pos logit contributions
 Emmy+0.176
over+0.174
P+0.173
hered+0.172
Em+0.169
Neg logit contributions
 stool-0.120
 chair-0.116
 log-0.113
 tray-0.111
 thinks-0.106
Token.
Token ID13
Feature activation-1.03e-02

Pos logit contributions
P+0.040
over+0.038
lyss+0.034
hered+0.033
date+0.033
Neg logit contributions
 stool-0.041
 tray-0.039
 log-0.038
 chair-0.038
 oven-0.037
Token Lily
Token ID20037
Feature activation-1.68e-02

Pos logit contributions
over+0.017
P+0.017
liest+0.015
hered+0.015
Sus+0.015
Neg logit contributions
 stool-0.016
 tray-0.016
 log-0.015
 chair-0.015
 microwave-0.015
Token was
Token ID373
Feature activation+6.57e-03

Pos logit contributions
over+0.020
P+0.020
liest+0.018
hered+0.018
amed+0.018
Neg logit contributions
 stool-0.033
 tray-0.032
 log-0.032
 chair-0.030
 microwave-0.030
Token happy
Token ID3772
Feature activation+3.07e-04

Pos logit contributions
 stool+0.008
 chair+0.008
 tray+0.008
 log+0.007
 scale+0.007
Neg logit contributions
P-0.013
over-0.012
liest-0.011
date-0.011
lish-0.011
Token that
Token ID326
Feature activation+1.76e-02

Pos logit contributions
P+0.013
over+0.013
amed+0.011
hered+0.011
beh+0.011
Neg logit contributions
 stool-0.014
 tray-0.014
 log-0.014
 chair-0.013
 oven-0.013
Token the
Token ID262
Feature activation+7.79e-03

Pos logit contributions
 stool+0.028
 log+0.027
 tray+0.027
 chair+0.026
 fork+0.025
Neg logit contributions
over-0.029
P-0.028
amed-0.026
hered-0.026
liest-0.026
Token penny
Token ID26369
Feature activation-7.04e-03

Pos logit contributions
 stool+0.013
 tray+0.013
 log+0.013
 chair+0.012
 scale+0.012
Neg logit contributions
over-0.012
P-0.011
hered-0.011
amed-0.011
liest-0.010
Token find
Token ID1064
Feature activation-4.51e-03

Pos logit contributions
P+0.011
over+0.011
Sus+0.010
amed+0.009
hered+0.009
Neg logit contributions
 stool-0.012
 log-0.011
 tray-0.011
 chair-0.011
 scale-0.011
Token its
Token ID663
Feature activation-2.65e-02

Pos logit contributions
P+0.008
over+0.007
fer+0.007
date+0.007
lyss+0.007
Neg logit contributions
 oven-0.006
 tray-0.006
 chair-0.006
 stool-0.006
 log-0.006
Token rightful
Token ID43133
Feature activation-9.27e-02

Pos logit contributions
over+0.046
P+0.045
liest+0.041
hered+0.041
Sus+0.041
Neg logit contributions
 stool-0.038
 tray-0.037
 log-0.036
 chair-0.035
 microwave-0.034
Token owner
Token ID4870
Feature activation-1.37e-02

Pos logit contributions
over+0.164
hered+0.162
P+0.162
date+0.156
lish+0.155
Neg logit contributions
 stool-0.124
 chair-0.121
 log-0.120
 tray-0.117
 thinks-0.117
Token.
Token ID13
Feature activation-7.07e-03

Pos logit contributions
P+0.022
over+0.021
Sus+0.019
amed+0.019
date+0.019
Neg logit contributions
 stool-0.023
 tray-0.022
 log-0.022
 chair-0.021
 oven-0.021
Token
Token ID220
Feature activation+3.09e-02

Pos logit contributions
over+0.012
P+0.012
amed+0.011
liest+0.011
hered+0.011
Neg logit contributions
 stool-0.010
 tray-0.010
 log-0.010
 chair-0.009
 microwave-0.009
Token
Token ID198
Feature activation-9.00e-04

Pos logit contributions
 stool+0.055
 log+0.054
 tray+0.054
 scale+0.051
 fork+0.051
Neg logit contributions
over-0.043
P-0.043
Sus-0.039
 Emmy-0.038
amed-0.038
Token
Token ID198
Feature activation-1.11e-02

Pos logit contributions
over+0.002
P+0.001
liest+0.001
hered+0.001
beh+0.001
Neg logit contributions
 stool-0.001
 tray-0.001
 log-0.001
 chair-0.001
 microwave-0.001
Token be
Token ID307
Feature activation-3.69e-02

Pos logit contributions
over+0.025
P+0.025
liest+0.023
hered+0.023
Sus+0.023
Neg logit contributions
 stool-0.019
 tray-0.018
 log-0.018
 chair-0.017
 microwave-0.017
Token happy
Token ID3772
Feature activation-1.98e-02

Pos logit contributions
hered+0.065
lyss+0.064
P+0.064
liest+0.063
over+0.062
Neg logit contributions
 stool-0.044
 tray-0.043
 log-0.042
 oven-0.042
 scale-0.041
Token and
Token ID290
Feature activation-1.14e-02

Pos logit contributions
P+0.029
over+0.028
amed+0.025
hered+0.025
Sus+0.025
Neg logit contributions
 stool-0.034
 tray-0.033
 log-0.033
 chair-0.032
 scale-0.032
Token well
Token ID880
Feature activation-3.21e-02

Pos logit contributions
over+0.019
P+0.019
liest+0.018
hered+0.017
Sus+0.017
Neg logit contributions
 stool-0.017
 log-0.016
 tray-0.016
 scale-0.016
 chair-0.016
Token-
Token ID12
Feature activation-3.98e-02

Pos logit contributions
P+0.056
over+0.051
Sus+0.051
lyss+0.049
amed+0.049
Neg logit contributions
 stool-0.046
 tray-0.045
 log-0.045
 scale-0.044
 chair-0.043
Tokenbeh
Token ID20709
Feature activation-9.22e-02

Pos logit contributions
P+0.048
over+0.046
lyss+0.043
 Emmy+0.042
hered+0.041
Neg logit contributions
 stool-0.070
 scale-0.068
 oven-0.066
 ladder-0.065
 log-0.065
Tokenaved
Token ID9586
Feature activation+1.53e-02

Pos logit contributions
P+0.137
over+0.133
Sus+0.119
lyss+0.116
 such+0.115
Neg logit contributions
 tray-0.145
 stool-0.144
ses-0.143
 log-0.141
 thinks-0.140
Token.
Token ID13
Feature activation-1.38e-02

Pos logit contributions
 stool+0.023
 tray+0.022
 chair+0.022
 scale+0.022
 log+0.021
Neg logit contributions
P-0.027
lyss-0.024
over-0.023
Ph-0.023
Sus-0.023
Token The
Token ID383
Feature activation+6.70e-03

Pos logit contributions
over+0.019
P+0.019
 such+0.017
Sus+0.017
liest+0.017
Neg logit contributions
 stool-0.025
 tray-0.024
 log-0.023
 scale-0.023
 chair-0.022
Token end
Token ID886
Feature activation-9.79e-03

Pos logit contributions
 stool+0.011
 tray+0.010
 Deal+0.010
ses+0.010
 log+0.010
Neg logit contributions
over-0.010
P-0.010
lyss-0.009
date-0.009
lish-0.009
Token.
Token ID13
Feature activation-9.04e-03

Pos logit contributions
over+0.012
P+0.011
 such+0.010
liest+0.010
Sus+0.010
Neg logit contributions
 stool-0.019
 tray-0.019
 log-0.018
 microwave-0.018
 scale-0.018
Token,
Token ID11
Feature activation+1.91e-03

Pos logit contributions
P+0.029
over+0.026
amed+0.025
Sus+0.025
beh+0.025
Neg logit contributions
 stool-0.024
 log-0.024
 tray-0.024
 chair-0.023
 scale-0.023
Token but
Token ID475
Feature activation+4.87e-03

Pos logit contributions
 stool+0.003
 log+0.003
 tray+0.003
 chair+0.003
 microwave+0.003
Neg logit contributions
P-0.003
over-0.003
liest-0.003
hered-0.003
amed-0.003
Token it
Token ID340
Feature activation-2.26e-02

Pos logit contributions
 log+0.006
 stool+0.006
 chair+0.006
 scale+0.006
 fork+0.006
Neg logit contributions
P-0.010
over-0.009
hered-0.009
date-0.009
amed-0.009
Token was
Token ID373
Feature activation-3.81e-03

Pos logit contributions
over+0.030
P+0.029
Sus+0.026
hered+0.026
date+0.026
Neg logit contributions
 stool-0.042
 log-0.041
 tray-0.041
 chair-0.039
 scale-0.039
Token all
Token ID477
Feature activation-9.62e-03

Pos logit contributions
over+0.006
P+0.006
hered+0.005
liest+0.005
Sus+0.005
Neg logit contributions
 stool-0.006
 tray-0.006
 log-0.006
 chair-0.006
 oven-0.006
Token fore
Token ID1674
Feature activation-9.14e-02

Pos logit contributions
over+0.016
P+0.016
hered+0.013
lish+0.013
date+0.013
Neg logit contributions
 log-0.015
 stool-0.015
 oven-0.015
 tray-0.015
 chair-0.015
Tokenshadow
Token ID19106
Feature activation-4.34e-02

Pos logit contributions
P+0.134
over+0.134
lyss+0.130
 such+0.128
date+0.125
Neg logit contributions
 stool-0.139
 tractor-0.131
 log-0.128
 trailer-0.126
 tray-0.125
Tokened
Token ID276
Feature activation-8.19e-03

Pos logit contributions
P+0.057
date+0.053
lyss+0.049
 Emmy+0.046
Ag+0.046
Neg logit contributions
ia-0.072
 thinks-0.067
 loves-0.064
 tight-0.064
ira-0.063
Token by
Token ID416
Feature activation-2.54e-03

Pos logit contributions
P+0.014
over+0.012
Sus+0.012
lyss+0.012
Em+0.012
Neg logit contributions
 log-0.013
 thinks-0.012
 stool-0.012
 scale-0.012
 tray-0.012
Token the
Token ID262
Feature activation+8.11e-03

Pos logit contributions
P+0.004
over+0.004
date+0.003
amed+0.003
liest+0.003
Neg logit contributions
 stool-0.004
 log-0.004
 tray-0.004
 scale-0.004
 chair-0.004
Token kind
Token ID1611
Feature activation-5.27e-03

Pos logit contributions
 stool+0.012
 tray+0.011
 log+0.011
 chair+0.010
 microwave+0.010
Neg logit contributions
over-0.014
P-0.014
liest-0.013
hered-0.013
Sus-0.013
Token became
Token ID2627
Feature activation-1.51e-02

Pos logit contributions
 thinks+0.007
 log+0.007
 tray+0.007
 stool+0.007
 chair+0.007
Neg logit contributions
over-0.006
P-0.006
date-0.006
Sus-0.005
lish-0.005
Token friendly
Token ID8030
Feature activation-8.46e-03

Pos logit contributions
over+0.028
P+0.027
hered+0.027
Sus+0.026
Em+0.026
Neg logit contributions
 log-0.018
 tray-0.018
 snowball-0.018
 stool-0.017
 scale-0.017
Token and
Token ID290
Feature activation+2.28e-03

Pos logit contributions
P+0.019
Sus+0.018
over+0.018
amed+0.017
beh+0.017
Neg logit contributions
 tray-0.008
 log-0.008
 stool-0.008
 thinks-0.008
 fork-0.007
Token well
Token ID880
Feature activation-2.79e-02

Pos logit contributions
 stool+0.003
 log+0.003
 tray+0.003
 thinks+0.003
 chair+0.003
Neg logit contributions
P-0.004
over-0.004
liest-0.003
Sus-0.003
date-0.003
Token-
Token ID12
Feature activation-4.00e-02

Pos logit contributions
P+0.049
Sus+0.045
over+0.043
amed+0.042
date+0.042
Neg logit contributions
 stool-0.040
 tray-0.040
 log-0.039
 scale-0.038
 chair-0.038
Tokenbeh
Token ID20709
Feature activation-9.14e-02

Pos logit contributions
P+0.054
 Emmy+0.049
over+0.048
Em+0.048
date+0.047
Neg logit contributions
 stool-0.062
 scale-0.061
 oven-0.060
 microwave-0.060
 log-0.060
Tokenaved
Token ID9586
Feature activation+1.79e-02

Pos logit contributions
P+0.134
over+0.129
Sus+0.117
 such+0.114
lish+0.113
Neg logit contributions
 tray-0.144
 stool-0.144
ses-0.143
 log-0.140
 thinks-0.140
Token,
Token ID11
Feature activation+7.88e-03

Pos logit contributions
 tray+0.024
 stool+0.024
 log+0.024
 chair+0.023
 scale+0.023
Neg logit contributions
P-0.034
lyss-0.030
Sus-0.029
beh-0.028
Em-0.028
Token just
Token ID655
Feature activation-2.30e-02

Pos logit contributions
 stool+0.012
 tray+0.011
 log+0.011
 chair+0.011
 microwave+0.011
Neg logit contributions
over-0.013
P-0.013
liest-0.012
hered-0.012
Sus-0.012
Token like
Token ID588
Feature activation+1.78e-02

Pos logit contributions
P+0.043
over+0.038
Sus+0.037
Em+0.036
amed+0.036
Neg logit contributions
 stool-0.032
 log-0.032
 tray-0.032
 thinks-0.031
ses-0.031
Token Buddy
Token ID36896
Feature activation+1.48e-02

Pos logit contributions
 stool+0.030
 log+0.029
 tray+0.029
 scale+0.028
 chair+0.028
Neg logit contributions
P-0.027
over-0.026
amed-0.024
Sus-0.023
liest-0.023
Tokenie
Token ID494
Feature activation+3.69e-03

Pos logit contributions
over+0.027
P+0.027
liest+0.024
Sus+0.023
hered+0.023
Neg logit contributions
 stool-0.045
 tray-0.043
 log-0.043
 chair-0.041
 microwave-0.041
Token.
Token ID13
Feature activation-6.34e-03

Pos logit contributions
 stool+0.007
 tray+0.007
 log+0.006
 microwave+0.006
 Deal+0.006
Neg logit contributions
over-0.005
P-0.005
liest-0.004
ed-0.004
hered-0.004
Token They
Token ID1119
Feature activation+5.83e-03

Pos logit contributions
over+0.010
P+0.010
 Emmy+0.009
hered+0.009
Sus+0.009
Neg logit contributions
 stool-0.010
 log-0.010
 tray-0.010
 fork-0.009
 chair-0.009
Token both
Token ID1111
Feature activation+1.24e-03

Pos logit contributions
 stool+0.010
 tray+0.010
 log+0.010
 fork+0.010
 scale+0.010
Neg logit contributions
over-0.008
P-0.008
hered-0.007
Sus-0.007
liest-0.007
Token had
Token ID550
Feature activation-1.46e-02

Pos logit contributions
 thinks+0.002
 tray+0.001
 likes+0.001
 log+0.001
 rug+0.001
Neg logit contributions
over-0.002
P-0.002
date-0.002
hered-0.002
lish-0.002
Token birth
Token ID4082
Feature activation-9.08e-02

Pos logit contributions
over+0.029
lish+0.029
hered+0.029
fer+0.028
beh+0.028
Neg logit contributions
 tray-0.017
 rug-0.015
 hay-0.015
 log-0.015
 snowball-0.015
Tokendays
Token ID12545
Feature activation-1.62e-03

Pos logit contributions
P+0.133
over+0.132
liest+0.116
lyss+0.114
hered+0.113
Neg logit contributions
 stool-0.154
 tray-0.149
 log-0.145
 chair-0.143
 fork-0.142
Token coming
Token ID2406
Feature activation-2.21e-02

Pos logit contributions
P+0.003
over+0.003
hered+0.002
lyss+0.002
beh+0.002
Neg logit contributions
 stool-0.002
 tray-0.002
 chair-0.002
 oven-0.002
 rug-0.002
Token up
Token ID510
Feature activation-2.88e-02

Pos logit contributions
P+0.031
over+0.031
liest+0.028
lyss+0.027
hered+0.026
Neg logit contributions
 stool-0.040
 tray-0.038
 log-0.038
 chair-0.037
 scale-0.036
Token,
Token ID11
Feature activation+6.84e-03

Pos logit contributions
over+0.042
P+0.042
liest+0.039
hered+0.038
amed+0.037
Neg logit contributions
 stool-0.050
 tray-0.048
 log-0.047
 chair-0.046
 oven-0.045
Token and
Token ID290
Feature activation-5.90e-03

Pos logit contributions
 stool+0.011
 log+0.011
 tray+0.011
 microwave+0.011
 Deal+0.011
Neg logit contributions
over-0.010
P-0.009
 such-0.009
 Emmy-0.009
hered-0.009
Token the
Token ID262
Feature activation-1.25e-02

Pos logit contributions
over+0.023
P+0.023
Sus+0.022
beh+0.022
lyss+0.022
Neg logit contributions
 tray-0.026
 stool-0.026
 log-0.025
 chair-0.025
 oven-0.024
Token rice
Token ID11464
Feature activation-1.02e-02

Pos logit contributions
over+0.023
P+0.023
hered+0.021
liest+0.021
date+0.021
Neg logit contributions
 stool-0.016
 log-0.015
 tray-0.015
 microwave-0.014
 chair-0.014
Token back
Token ID736
Feature activation-1.87e-03

Pos logit contributions
over+0.011
P+0.011
Sus+0.010
lyss+0.010
beh+0.010
Neg logit contributions
 stool-0.021
 tray-0.021
 log-0.021
 chair-0.020
 fork-0.020
Token in
Token ID287
Feature activation+1.63e-02

Pos logit contributions
over+0.002
P+0.002
lyss+0.002
Sus+0.002
hered+0.002
Neg logit contributions
 stool-0.004
 tray-0.004
 log-0.004
 chair-0.004
 oven-0.004
Token its
Token ID663
Feature activation-1.08e-02

Pos logit contributions
 stool+0.029
 tray+0.029
 log+0.028
 chair+0.027
 fork+0.027
Neg logit contributions
over-0.023
P-0.023
liest-0.020
beh-0.020
ed-0.020
Token rightful
Token ID43133
Feature activation-9.01e-02

Pos logit contributions
P+0.022
over+0.021
Sus+0.020
liest+0.020
beh+0.020
Neg logit contributions
 stool-0.013
 tray-0.013
 log-0.013
 chair-0.012
 oven-0.012
Token place
Token ID1295
Feature activation-1.65e-03

Pos logit contributions
P+0.151
hered+0.148
beh+0.146
over+0.145
date+0.143
Neg logit contributions
 stool-0.132
 log-0.128
 tray-0.128
 chair-0.127
 thinks-0.122
Token.
Token ID13
Feature activation-9.31e-03

Pos logit contributions
P+0.003
over+0.003
hered+0.002
amed+0.002
lyss+0.002
Neg logit contributions
 stool-0.003
 tray-0.003
 log-0.002
 chair-0.002
 oven-0.002
Token John
Token ID1757
Feature activation-3.78e-03

Pos logit contributions
over+0.013
P+0.012
 Emmy+0.012
Sus+0.012
 such+0.012
Neg logit contributions
 stool-0.017
 log-0.016
 tray-0.016
 scale-0.015
 fork-0.015
Token was
Token ID373
Feature activation+7.13e-03

Pos logit contributions
over+0.005
P+0.005
Sus+0.004
date+0.004
hered+0.004
Neg logit contributions
 stool-0.007
 tray-0.007
 log-0.007
 chair-0.006
 microwave-0.006
Token proud
Token ID6613
Feature activation+1.25e-02

Pos logit contributions
 stool+0.009
 chair+0.009
 oven+0.009
 tray+0.009
 log+0.008
Neg logit contributions
P-0.014
over-0.012
hered-0.012
liest-0.012
lish-0.012
Token put
Token ID1234
Feature activation-2.35e-02

Pos logit contributions
over+0.006
P+0.006
hered+0.005
liest+0.005
amed+0.005
Neg logit contributions
 stool-0.004
 tray-0.004
 log-0.004
 chair-0.004
 scale-0.004
Token them
Token ID606
Feature activation-2.14e-02

Pos logit contributions
over+0.042
P+0.039
beh+0.038
lyss+0.038
hered+0.037
Neg logit contributions
 stool-0.034
 tray-0.034
 scale-0.033
 log-0.033
 microwave-0.033
Token back
Token ID736
Feature activation-8.01e-03

Pos logit contributions
over+0.028
P+0.026
lyss+0.025
hered+0.023
liest+0.023
Neg logit contributions
 stool-0.041
 tray-0.041
 chair-0.039
 log-0.039
 scale-0.038
Token in
Token ID287
Feature activation+1.52e-02

Pos logit contributions
P+0.009
over+0.009
lyss+0.008
hered+0.008
date+0.008
Neg logit contributions
 stool-0.017
 tray-0.016
 log-0.016
 chair-0.016
 scale-0.015
Token their
Token ID511
Feature activation-1.71e-02

Pos logit contributions
 stool+0.026
 tray+0.025
 log+0.025
 chair+0.024
 fork+0.024
Neg logit contributions
over-0.023
P-0.023
beh-0.021
liest-0.020
amed-0.020
Token rightful
Token ID43133
Feature activation-8.95e-02

Pos logit contributions
over+0.032
P+0.032
liest+0.029
hered+0.029
Sus+0.029
Neg logit contributions
 stool-0.022
 tray-0.021
 log-0.021
 microwave-0.020
 chair-0.019
Token place
Token ID1295
Feature activation-1.24e-03

Pos logit contributions
P+0.165
over+0.163
hered+0.161
beh+0.156
date+0.155
Neg logit contributions
 stool-0.120
 tray-0.113
 chair-0.113
 log-0.112
 thinks-0.106
Token.
Token ID13
Feature activation-8.79e-03

Pos logit contributions
P+0.002
over+0.002
hered+0.002
amed+0.002
lyss+0.002
Neg logit contributions
 stool-0.002
 tray-0.002
 log-0.002
 chair-0.002
 scale-0.002
Token
Token ID198
Feature activation-6.83e-03

Pos logit contributions
over+0.013
P+0.013
Sus+0.012
 Emmy+0.012
 such+0.012
Neg logit contributions
 stool-0.015
 tray-0.014
 log-0.014
 chair-0.013
 scale-0.013
Token
Token ID198
Feature activation-1.44e-02

Pos logit contributions
P+0.009
over+0.009
Sus+0.008
Em+0.008
amed+0.008
Neg logit contributions
 stool-0.013
 tray-0.013
ses-0.012
 log-0.012
 oven-0.012
TokenAfter
Token ID3260
Feature activation-1.19e-02

Pos logit contributions
P+0.016
over+0.016
Sus+0.013
liest+0.013
amed+0.013
Neg logit contributions
 stool-0.030
 tray-0.030
 log-0.029
 chair-0.028
 microwave-0.028
Token he
Token ID339
Feature activation-2.02e-03

Pos logit contributions
 stool+0.011
 log+0.010
 chair+0.010
 fork+0.010
 scale+0.010
Neg logit contributions
P-0.013
over-0.013
hered-0.012
amed-0.012
liest-0.012
Token got
Token ID1392
Feature activation-9.97e-03

Pos logit contributions
over+0.003
P+0.003
Sus+0.002
liest+0.002
hered+0.002
Neg logit contributions
 stool-0.004
 tray-0.004
 log-0.003
 chair-0.003
 microwave-0.003
Token it
Token ID340
Feature activation-8.63e-03

Pos logit contributions
over+0.018
Em+0.018
P+0.017
date+0.016
lyss+0.016
Neg logit contributions
 log-0.014
 scale-0.013
 oven-0.013
 stool-0.013
 ladder-0.013
Token so
Token ID523
Feature activation+1.93e-02

Pos logit contributions
P+0.013
over+0.013
lyss+0.012
Sus+0.012
date+0.011
Neg logit contributions
 stool-0.015
 log-0.014
 tray-0.014
 chair-0.013
 scale-0.013
Token the
Token ID262
Feature activation-6.42e-03

Pos logit contributions
 log+0.025
 stool+0.025
 ladder+0.024
 tray+0.024
 fork+0.023
Neg logit contributions
P-0.036
over-0.035
date-0.034
Em-0.033
hered-0.032
Token rightful
Token ID43133
Feature activation-8.93e-02

Pos logit contributions
over+0.011
P+0.011
hered+0.010
liest+0.010
amed+0.010
Neg logit contributions
 stool-0.010
 tray-0.009
 log-0.009
 chair-0.009
 scale-0.009
Token owner
Token ID4870
Feature activation-1.67e-02

Pos logit contributions
 Emmy+0.150
P+0.149
hered+0.149
over+0.149
Em+0.145
Neg logit contributions
 stool-0.126
 log-0.124
 chair-0.122
 tray-0.120
 thinks-0.119
Token could
Token ID714
Feature activation+1.06e-03

Pos logit contributions
P+0.024
over+0.023
Sus+0.020
lyss+0.020
hered+0.020
Neg logit contributions
 stool-0.030
 tray-0.029
 log-0.029
 chair-0.028
 scale-0.028
Token find
Token ID1064
Feature activation+1.22e-03

Pos logit contributions
 stool+0.002
 tray+0.002
 log+0.002
 microwave+0.001
 chair+0.001
Neg logit contributions
over-0.002
P-0.002
hered-0.002
 such-0.002
date-0.002
Token it
Token ID340
Feature activation-2.53e-03

Pos logit contributions
 log+0.002
 oven+0.002
 scale+0.002
 fork+0.002
 tray+0.002
Neg logit contributions
P-0.002
over-0.002
amed-0.002
lyss-0.002
Em-0.002
Token.
Token ID13
Feature activation-5.68e-03

Pos logit contributions
P+0.004
over+0.004
Sus+0.004
lyss+0.004
amed+0.004
Neg logit contributions
 stool-0.004
 log-0.004
 tray-0.004
 chair-0.004
 scale-0.004
Token could
Token ID714
Feature activation-2.79e-03

Pos logit contributions
over+0.027
P+0.026
Sus+0.024
date+0.023
amed+0.023
Neg logit contributions
 log-0.025
 stool-0.025
 tray-0.024
 thinks-0.023
 scale-0.023
Token be
Token ID307
Feature activation-2.15e-02

Pos logit contributions
over+0.005
P+0.005
liest+0.004
amed+0.004
hered+0.004
Neg logit contributions
 stool-0.004
 tray-0.004
 log-0.004
 chair-0.004
 scale-0.004
Token back
Token ID736
Feature activation-2.09e-02

Pos logit contributions
P+0.039
over+0.038
date+0.036
Em+0.036
lyss+0.035
Neg logit contributions
 log-0.029
 tray-0.028
 stool-0.028
 scale-0.027
 thinks-0.027
Token with
Token ID351
Feature activation+3.90e-03

Pos logit contributions
P+0.034
over+0.032
lyss+0.030
Em+0.030
hered+0.029
Neg logit contributions
 stool-0.033
 log-0.032
 tray-0.031
 chair-0.031
 scale-0.030
Token its
Token ID663
Feature activation-1.17e-02

Pos logit contributions
 stool+0.006
 tray+0.006
 log+0.006
 fork+0.006
 chair+0.006
Neg logit contributions
over-0.006
P-0.006
hered-0.005
lyss-0.005
liest-0.005
Token rightful
Token ID43133
Feature activation-8.88e-02

Pos logit contributions
P+0.022
over+0.021
liest+0.020
date+0.020
Sus+0.019
Neg logit contributions
 tray-0.016
 stool-0.016
 log-0.015
 oven-0.014
ses-0.014
Token owner
Token ID4870
Feature activation-1.84e-02

Pos logit contributions
P+0.160
hered+0.158
over+0.157
Em+0.156
date+0.155
Neg logit contributions
 stool-0.112
 thinks-0.110
 chair-0.110
 tray-0.109
 log-0.109
Token.
Token ID13
Feature activation-1.31e-02

Pos logit contributions
P+0.030
over+0.028
Sus+0.027
amed+0.026
lyss+0.026
Neg logit contributions
 stool-0.029
 tray-0.029
 log-0.028
 chair-0.027
 oven-0.027
Token
Token ID198
Feature activation-4.99e-03

Pos logit contributions
P+0.020
over+0.020
Sus+0.018
liest+0.018
ed+0.018
Neg logit contributions
 stool-0.022
 tray-0.021
 log-0.021
 scale-0.020
 chair-0.020
Token
Token ID198
Feature activation-1.34e-02

Pos logit contributions
P+0.007
over+0.006
Sus+0.006
amed+0.006
Em+0.006
Neg logit contributions
 stool-0.010
 tray-0.009
ses-0.009
 log-0.009
 oven-0.009
TokenIn
Token ID818
Feature activation-2.28e-03

Pos logit contributions
P+0.015
over+0.014
Sus+0.013
amed+0.012
liest+0.012
Neg logit contributions
 stool-0.028
 tray-0.028
 log-0.027
 chair-0.026
 microwave-0.026
Token saw
Token ID2497
Feature activation+1.75e-02

Pos logit contributions
 stool+0.002
 log+0.002
 chair+0.002
 tray+0.002
 thinks+0.002
Neg logit contributions
over-0.002
P-0.002
Sus-0.001
lyss-0.001
date-0.001
Token some
Token ID617
Feature activation-1.20e-02

Pos logit contributions
 log+0.029
 stool+0.028
 tray+0.028
 chair+0.027
 microwave+0.027
Neg logit contributions
over-0.028
P-0.027
hered-0.026
beh-0.025
date-0.024
Token other
Token ID584
Feature activation-2.26e-02

Pos logit contributions
P+0.020
over+0.020
hered+0.019
beh+0.019
amed+0.019
Neg logit contributions
 stool-0.018
 tray-0.018
 log-0.018
 chair-0.017
 fork-0.017
Token kids
Token ID3988
Feature activation-2.86e-02

Pos logit contributions
over+0.033
P+0.033
Sus+0.029
hered+0.029
amed+0.029
Neg logit contributions
 stool-0.039
 tray-0.037
 log-0.037
 chair-0.036
 microwave-0.035
Token mis
Token ID2984
Feature activation-6.26e-02

Pos logit contributions
P+0.042
over+0.040
amed+0.037
liest+0.036
Sus+0.036
Neg logit contributions
 stool-0.050
 tray-0.048
 log-0.048
 chair-0.046
 scale-0.046
Tokenbeh
Token ID20709
Feature activation-8.87e-02

Pos logit contributions
 Emmy+0.072
 such+0.069
 Lilly+0.069
over+0.065
lyss+0.062
Neg logit contributions
 stool-0.105
 log-0.099
 stack-0.099
 tray-0.098
 tractor-0.096
Tokenaving
Token ID2703
Feature activation+1.09e-02

Pos logit contributions
P+0.128
over+0.126
lish+0.113
 such+0.112
Sus+0.109
Neg logit contributions
 stool-0.141
 tray-0.140
 log-0.138
ses-0.135
 thinks-0.133
Token and
Token ID290
Feature activation-1.23e-02

Pos logit contributions
 stool+0.016
 tray+0.015
 log+0.015
 scale+0.015
 chair+0.015
Neg logit contributions
P-0.019
over-0.017
Sus-0.017
amed-0.016
lyss-0.016
Token running
Token ID2491
Feature activation-4.86e-03

Pos logit contributions
P+0.022
over+0.022
liest+0.019
amed+0.019
aming+0.018
Neg logit contributions
 log-0.018
 stool-0.018
 thinks-0.018
 scale-0.018
 chair-0.017
Token around
Token ID1088
Feature activation+5.72e-03

Pos logit contributions
P+0.008
over+0.007
Sus+0.007
Em+0.007
aming+0.007
Neg logit contributions
 stool-0.007
 log-0.007
 tray-0.007
 thinks-0.007
 scale-0.007
Token.
Token ID13
Feature activation-1.14e-02

Pos logit contributions
 stool+0.010
 log+0.009
 tray+0.009
 chair+0.009
 scale+0.009
Neg logit contributions
P-0.009
over-0.009
amed-0.008
liest-0.008
Sus-0.008
Token listen
Token ID6004
Feature activation-1.24e-02

Pos logit contributions
over+0.021
P+0.020
amed+0.020
Sus+0.019
 Emmy+0.019
Neg logit contributions
 log-0.008
 thinks-0.008
 scale-0.008
 peel-0.008
 stool-0.007
Token.
Token ID13
Feature activation-1.63e-03

Pos logit contributions
P+0.018
over+0.017
amed+0.015
hered+0.015
Sus+0.015
Neg logit contributions
 stool-0.022
 log-0.021
 tray-0.021
 chair-0.020
 scale-0.020
Token He
Token ID679
Feature activation+4.97e-03

Pos logit contributions
over+0.002
P+0.002
 such+0.002
Ph+0.002
Sus+0.002
Neg logit contributions
 stool-0.003
 tray-0.003
 fork-0.002
 microwave-0.002
 tractor-0.002
Token always
Token ID1464
Feature activation-1.40e-02

Pos logit contributions
 stool+0.009
 tray+0.009
 log+0.009
 chair+0.008
 microwave+0.008
Neg logit contributions
P-0.007
over-0.007
liest-0.006
ed-0.006
hered-0.006
Token mis
Token ID2984
Feature activation-7.46e-02

Pos logit contributions
over+0.023
P+0.022
date+0.020
liest+0.020
Sus+0.020
Neg logit contributions
 thinks-0.022
 log-0.022
 stool-0.022
 scale-0.021
 tray-0.020
Tokenbeh
Token ID20709
Feature activation-8.87e-02

Pos logit contributions
 Emmy+0.075
 Lilly+0.074
lyss+0.073
over+0.069
Ph+0.064
Neg logit contributions
 log-0.136
 stool-0.132
 tractor-0.132
 reins-0.130
ws-0.130
Tokenaved
Token ID9586
Feature activation+2.05e-02

Pos logit contributions
P+0.128
over+0.128
Sus+0.115
lyss+0.111
lish+0.110
Neg logit contributions
 stool-0.143
 tray-0.142
 log-0.141
 thinks-0.141
ses-0.139
Token and
Token ID290
Feature activation-1.29e-02

Pos logit contributions
 stool+0.033
 log+0.033
 tray+0.032
 scale+0.032
 chair+0.032
Neg logit contributions
P-0.032
over-0.031
Sus-0.030
lyss-0.029
Em-0.028
Token caused
Token ID4073
Feature activation-2.17e-02

Pos logit contributions
over+0.020
P+0.020
liest+0.018
Sus+0.018
amed+0.018
Neg logit contributions
 stool-0.021
 log-0.021
 tray-0.020
 scale-0.020
 chair-0.020
Token trouble
Token ID5876
Feature activation-4.68e-02

Pos logit contributions
P+0.037
over+0.036
date+0.034
amed+0.034
lyss+0.033
Neg logit contributions
 log-0.032
 thinks-0.031
 snowball-0.030
 stool-0.030
 scale-0.030
Token wherever
Token ID14530
Feature activation+1.09e-02

Pos logit contributions
P+0.087
over+0.075
Sus+0.072
amed+0.072
lyss+0.072
Neg logit contributions
 stool-0.068
 log-0.066
 scale-0.064
 tray-0.064
 chair-0.062
Token restore
Token ID11169
Feature activation-2.83e-02

Pos logit contributions
over+0.009
P+0.009
hered+0.008
liest+0.008
Sus+0.008
Neg logit contributions
 stool-0.009
 log-0.008
 tray-0.008
 chair-0.008
 scale-0.008
Token it
Token ID340
Feature activation-8.50e-03

Pos logit contributions
P+0.053
over+0.051
lyss+0.050
Em+0.050
amed+0.049
Neg logit contributions
 log-0.038
 scale-0.038
 fork-0.037
 tray-0.036
 oven-0.035
Token back
Token ID736
Feature activation-1.85e-02

Pos logit contributions
P+0.014
over+0.013
lyss+0.012
Sus+0.012
Em+0.012
Neg logit contributions
 stool-0.013
 log-0.013
 tray-0.013
 scale-0.013
 chair-0.013
Token to
Token ID284
Feature activation+1.07e-02

Pos logit contributions
P+0.027
over+0.026
lyss+0.024
Sus+0.023
Em+0.023
Neg logit contributions
 stool-0.033
 log-0.032
 tray-0.031
 chair-0.031
 scale-0.030
Token its
Token ID663
Feature activation-1.55e-02

Pos logit contributions
 log+0.016
 scale+0.016
 stool+0.016
 oven+0.016
 tray+0.016
Neg logit contributions
P-0.018
over-0.017
amed-0.016
lyss-0.016
hered-0.016
Token rightful
Token ID43133
Feature activation-8.86e-02

Pos logit contributions
over+0.024
P+0.023
 such+0.021
liest+0.021
date+0.021
Neg logit contributions
 stool-0.025
 tray-0.024
 log-0.023
 microwave-0.023
 chair-0.022
Token place
Token ID1295
Feature activation-1.53e-02

Pos logit contributions
P+0.148
over+0.147
hered+0.146
 Emmy+0.140
date+0.140
Neg logit contributions
 stool-0.129
 log-0.128
 tray-0.124
 chair-0.124
 thinks-0.119
Token.
Token ID13
Feature activation-1.41e-02

Pos logit contributions
P+0.026
over+0.024
hered+0.022
lyss+0.022
amed+0.022
Neg logit contributions
 stool-0.024
 tray-0.023
 log-0.023
 chair-0.023
 oven-0.022
Token Although
Token ID4900
Feature activation-8.36e-03

Pos logit contributions
over+0.021
P+0.021
liest+0.019
 such+0.018
hered+0.018
Neg logit contributions
 stool-0.024
 tray-0.024
 log-0.023
 chair-0.022
 scale-0.022
Token Sammy
Token ID44067
Feature activation+1.10e-02

Pos logit contributions
P+0.015
amed+0.014
over+0.014
Em+0.014
date+0.014
Neg logit contributions
 log-0.013
 carrot-0.011
 stool-0.011
 tractor-0.011
 sled-0.011
Token was
Token ID373
Feature activation+9.56e-03

Pos logit contributions
 stool+0.021
 tray+0.020
 log+0.020
 microwave+0.019
 chair+0.019
Neg logit contributions
over-0.014
P-0.014
liest-0.012
ed-0.012
hered-0.012
Token started
Token ID2067
Feature activation-5.17e-03

Pos logit contributions
 stool+0.003
 tray+0.003
 log+0.003
 chair+0.003
 scale+0.003
Neg logit contributions
over-0.002
P-0.002
liest-0.002
Sus-0.002
hered-0.002
Token walking
Token ID6155
Feature activation-8.03e-04

Pos logit contributions
over+0.007
P+0.007
liest+0.006
Sus+0.006
hered+0.006
Neg logit contributions
 stool-0.010
 tray-0.009
 log-0.009
 chair-0.009
 scale-0.009
Token very
Token ID845
Feature activation+1.85e-02

Pos logit contributions
P+0.001
over+0.001
liest+0.001
hered+0.001
Sus+0.001
Neg logit contributions
 stool-0.001
 log-0.001
 tray-0.001
 chair-0.001
 scale-0.001
Token slowly
Token ID6364
Feature activation+1.32e-02

Pos logit contributions
 stool+0.027
 tray+0.027
 microwave+0.026
 log+0.026
 oven+0.026
Neg logit contributions
over-0.031
P-0.031
Sus-0.028
amed-0.027
ed-0.027
Token as
Token ID355
Feature activation+1.07e-02

Pos logit contributions
 stool+0.020
 log+0.019
 chair+0.018
 tray+0.018
 scale+0.018
Neg logit contributions
P-0.024
over-0.022
Sus-0.022
beh-0.021
Em-0.021
Token tort
Token ID7619
Feature activation-8.84e-02

Pos logit contributions
 tray+0.018
 stool+0.018
 log+0.017
 microwave+0.017
 Deal+0.017
Neg logit contributions
over-0.016
P-0.015
Sus-0.014
ed-0.014
 such-0.014
Tokeno
Token ID78
Feature activation+1.06e-02

Pos logit contributions
over+0.141
lyss+0.129
 Emmy+0.127
Ph+0.127
 such+0.125
Neg logit contributions
 log-0.122
 stool-0.121
 sled-0.116
ws-0.115
 tractor-0.113
Tokenises
Token ID2696
Feature activation-1.85e-02

Pos logit contributions
 stool+0.018
 log+0.017
 chair+0.016
 tray+0.016
 scale+0.016
Neg logit contributions
P-0.016
Sus-0.015
over-0.015
amed-0.015
lyss-0.014
Token do
Token ID466
Feature activation-1.34e-02

Pos logit contributions
P+0.028
over+0.027
Sus+0.025
amed+0.024
liest+0.024
Neg logit contributions
 stool-0.031
 tray-0.030
 log-0.030
 chair-0.029
 scale-0.029
Token,
Token ID11
Feature activation+6.01e-03

Pos logit contributions
P+0.022
over+0.021
amed+0.019
hered+0.019
Sus+0.019
Neg logit contributions
 stool-0.021
 log-0.021
 tray-0.020
 chair-0.020
 scale-0.020
Token but
Token ID475
Feature activation+4.00e-03

Pos logit contributions
 stool+0.010
 tray+0.010
 log+0.009
 microwave+0.009
 chair+0.009
Neg logit contributions
over-0.009
P-0.009
 such-0.008
liest-0.008
Sus-0.008
Token to
Token ID284
Feature activation-3.01e-03

Pos logit contributions
over+0.013
P+0.013
hered+0.012
liest+0.012
Sus+0.012
Neg logit contributions
 stool-0.016
 tray-0.016
 log-0.016
 chair-0.015
 microwave-0.015
Token return
Token ID1441
Feature activation-2.43e-02

Pos logit contributions
over+0.006
P+0.006
ed+0.006
amed+0.006
liest+0.006
Neg logit contributions
 stool-0.004
 log-0.004
 scale-0.003
 tray-0.003
 chair-0.003
Token it
Token ID340
Feature activation-5.75e-03

Pos logit contributions
P+0.042
over+0.042
date+0.039
beh+0.039
hered+0.039
Neg logit contributions
 stool-0.035
 log-0.034
 tray-0.033
 chair-0.033
 scale-0.032
Token to
Token ID284
Feature activation+1.56e-02

Pos logit contributions
P+0.010
over+0.010
Sus+0.009
lyss+0.009
hered+0.009
Neg logit contributions
 stool-0.008
 tray-0.008
 log-0.008
 chair-0.008
 scale-0.007
Token its
Token ID663
Feature activation-1.16e-02

Pos logit contributions
 stool+0.026
 log+0.026
 tray+0.025
 oven+0.025
 scale+0.024
Neg logit contributions
P-0.023
over-0.023
amed-0.022
ed-0.021
hered-0.021
Token rightful
Token ID43133
Feature activation-8.75e-02

Pos logit contributions
over+0.017
P+0.017
liest+0.015
 such+0.015
date+0.015
Neg logit contributions
 stool-0.020
 tray-0.019
 log-0.019
 microwave-0.018
 chair-0.018
Token owner
Token ID4870
Feature activation-9.40e-03

Pos logit contributions
P+0.148
hered+0.146
over+0.146
date+0.140
 Emmy+0.140
Neg logit contributions
 stool-0.125
 log-0.123
 chair-0.121
 tray-0.118
 thinks-0.117
Token if
Token ID611
Feature activation+5.49e-03

Pos logit contributions
P+0.015
over+0.015
Sus+0.014
amed+0.014
hered+0.014
Neg logit contributions
 stool-0.014
 tray-0.014
 log-0.014
 chair-0.013
 oven-0.013
Token you
Token ID345
Feature activation+3.95e-03

Pos logit contributions
 stool+0.009
 log+0.008
 tray+0.008
 chair+0.008
 oven+0.008
Neg logit contributions
P-0.009
over-0.009
amed-0.008
hered-0.008
Sus-0.008
Token can
Token ID460
Feature activation-4.67e-03

Pos logit contributions
 stool+0.005
 tray+0.005
 log+0.005
 chair+0.005
 microwave+0.005
Neg logit contributions
over-0.007
P-0.007
liest-0.007
hered-0.007
date-0.007
Token find
Token ID1064
Feature activation-3.10e-03

Pos logit contributions
over+0.008
P+0.008
amed+0.008
liest+0.007
Sus+0.007
Neg logit contributions
 stool-0.006
 tray-0.006
 log-0.006
 scale-0.006
 chair-0.006
Token the
Token ID262
Feature activation-7.05e-04

Pos logit contributions
 stool+0.006
 log+0.005
 tray+0.005
 chair+0.005
 scale+0.005
Neg logit contributions
P-0.005
over-0.005
amed-0.005
liest-0.005
hered-0.005
Token ancient
Token ID6156
Feature activation+1.18e-03

Pos logit contributions
over+0.001
P+0.001
lyss+0.001
date+0.001
liest+0.001
Neg logit contributions
 stool-0.001
 tray-0.001
 log-0.001
 microwave-0.001
 chair-0.001
Token knob
Token ID36842
Feature activation-1.08e-02

Pos logit contributions
 stool+0.002
 tray+0.001
 log+0.001
 microwave+0.001
 chair+0.001
Neg logit contributions
over-0.002
P-0.002
liest-0.002
hered-0.002
Sus-0.002
Token found
Token ID1043
Feature activation+3.61e-03

Pos logit contributions
over+0.015
P+0.015
liest+0.014
hered+0.014
ed+0.014
Neg logit contributions
 stool-0.019
 tray-0.018
 log-0.018
 microwave-0.017
 chair-0.017
Token its
Token ID663
Feature activation-2.03e-02

Pos logit contributions
 log+0.006
 tray+0.006
 stool+0.006
 oven+0.006
 chair+0.006
Neg logit contributions
P-0.006
over-0.005
beh-0.005
lyss-0.005
date-0.005
Token rightful
Token ID43133
Feature activation-8.72e-02

Pos logit contributions
over+0.031
P+0.029
 such+0.028
liest+0.027
date+0.027
Neg logit contributions
 stool-0.033
 tray-0.031
 microwave-0.031
 log-0.030
 Deal-0.030
Token place
Token ID1295
Feature activation-9.14e-03

Pos logit contributions
P+0.148
over+0.146
hered+0.144
beh+0.142
lish+0.141
Neg logit contributions
 stool-0.123
 log-0.119
 chair-0.119
 tray-0.117
 thinks-0.112
Token in
Token ID287
Feature activation+1.10e-02

Pos logit contributions
P+0.016
over+0.015
date+0.014
Sus+0.014
lyss+0.014
Neg logit contributions
 stool-0.014
 tray-0.013
 log-0.013
 chair-0.013
 oven-0.012
Token their
Token ID511
Feature activation-2.00e-02

Pos logit contributions
 stool+0.018
 tray+0.018
 log+0.018
 chair+0.017
 oven+0.017
Neg logit contributions
P-0.017
over-0.017
liest-0.015
beh-0.015
ed-0.015
Token house
Token ID2156
Feature activation+9.73e-03

Pos logit contributions
over+0.034
P+0.034
liest+0.031
Sus+0.031
hered+0.031
Neg logit contributions
 stool-0.029
 tray-0.028
 log-0.028
 chair-0.027
 oven-0.027
Token.
Token ID13
Feature activation-8.20e-03

Pos logit contributions
 stool+0.015
 tray+0.015
 log+0.014
 chair+0.014
 oven+0.014
Neg logit contributions
P-0.016
over-0.015
lyss-0.014
hered-0.014
Sus-0.014
Token The
Token ID383
Feature activation+7.90e-03

Pos logit contributions
over+0.026
P+0.026
liest+0.024
hered+0.024
Sus+0.024
Neg logit contributions
 stool-0.022
 tray-0.021
 log-0.020
 chair-0.019
 microwave-0.019
Token boot
Token ID6297
Feature activation+8.42e-03

Pos logit contributions
 stool+0.013
 tray+0.012
 Deal+0.012
 microwave+0.012
 log+0.012
Neg logit contributions
P-0.012
over-0.012
lyss-0.011
Sus-0.010
amed-0.010
Token returned
Token ID4504
Feature activation-1.60e-02

Pos logit contributions
 stool+0.015
 log+0.014
 tray+0.014
 chair+0.014
 scale+0.014
Neg logit contributions
P-0.012
over-0.012
beh-0.011
date-0.011
Sus-0.010
Token to
Token ID284
Feature activation+1.59e-02

Pos logit contributions
P+0.029
over+0.027
beh+0.026
liest+0.026
Em+0.025
Neg logit contributions
 stool-0.024
 log-0.023
 tray-0.022
 chair-0.022
 scale-0.022
Token its
Token ID663
Feature activation-8.13e-03

Pos logit contributions
 stool+0.029
 tray+0.028
 log+0.028
 chair+0.027
 microwave+0.027
Neg logit contributions
over-0.021
P-0.021
liest-0.019
hered-0.019
Sus-0.019
Token rightful
Token ID43133
Feature activation-8.66e-02

Pos logit contributions
over+0.014
P+0.013
liest+0.012
Sus+0.012
hered+0.012
Neg logit contributions
 stool-0.012
 tray-0.012
 log-0.012
 chair-0.011
 microwave-0.011
Token owners
Token ID4393
Feature activation-1.26e-02

Pos logit contributions
over+0.148
P+0.145
hered+0.144
date+0.138
beh+0.138
Neg logit contributions
 log-0.125
 stool-0.125
 chair-0.120
 tray-0.118
 thinks-0.115
Token and
Token ID290
Feature activation-3.46e-04

Pos logit contributions
P+0.024
over+0.023
amed+0.021
Sus+0.021
beh+0.021
Neg logit contributions
 stool-0.017
 tray-0.016
 log-0.016
 chair-0.016
 oven-0.015
Token the
Token ID262
Feature activation-2.42e-03

Pos logit contributions
P+0.001
over+0.001
lish+0.001
liest+0.001
Sus+0.001
Neg logit contributions
 log-0.001
 stool-0.001
 tray-0.000
 chair-0.000
 scale-0.000
Token conflict
Token ID5358
Feature activation-3.25e-02

Pos logit contributions
over+0.004
P+0.004
Sus+0.003
liest+0.003
hered+0.003
Neg logit contributions
 stool-0.004
 tray-0.004
 log-0.004
 microwave-0.003
 chair-0.003
Token was
Token ID373
Feature activation+4.61e-03

Pos logit contributions
P+0.053
over+0.048
Sus+0.043
beh+0.043
lish+0.042
Neg logit contributions
 stool-0.054
 tray-0.054
 log-0.051
 chair-0.051
 oven-0.051
Token returned
Token ID4504
Feature activation-3.33e-02

Pos logit contributions
over+0.010
P+0.010
liest+0.009
hered+0.009
lish+0.009
Neg logit contributions
 stool-0.008
 log-0.007
 tray-0.007
 scale-0.007
 chair-0.007
Token the
Token ID262
Feature activation-1.57e-02

Pos logit contributions
P+0.058
over+0.057
liest+0.055
lyss+0.053
lish+0.052
Neg logit contributions
 stool-0.050
 log-0.048
 chair-0.047
 tray-0.046
 scale-0.045
Token towel
Token ID24808
Feature activation+5.63e-03

Pos logit contributions
P+0.030
over+0.030
Sus+0.027
hered+0.027
liest+0.027
Neg logit contributions
 stool-0.019
 tray-0.018
 log-0.018
 microwave-0.017
 oven-0.017
Token to
Token ID284
Feature activation+1.47e-02

Pos logit contributions
 stool+0.010
 tray+0.010
 chair+0.009
 log+0.009
 oven+0.009
Neg logit contributions
P-0.008
over-0.008
lyss-0.007
Sus-0.007
Em-0.007
Token its
Token ID663
Feature activation-8.72e-03

Pos logit contributions
 stool+0.024
 tray+0.024
 log+0.023
 chair+0.023
 scale+0.022
Neg logit contributions
P-0.023
over-0.023
liest-0.021
amed-0.021
Sus-0.020
Token rightful
Token ID43133
Feature activation-8.62e-02

Pos logit contributions
over+0.014
P+0.014
liest+0.012
Sus+0.012
hered+0.012
Neg logit contributions
 stool-0.014
 tray-0.013
 log-0.013
 microwave-0.013
 chair-0.013
Token owner
Token ID4870
Feature activation-4.87e-03

Pos logit contributions
P+0.150
over+0.149
hered+0.148
beh+0.143
date+0.142
Neg logit contributions
 stool-0.120
 log-0.117
 chair-0.116
 tray-0.115
 thinks-0.110
Token.
Token ID13
Feature activation-7.01e-03

Pos logit contributions
P+0.007
over+0.007
Sus+0.007
amed+0.006
liest+0.006
Neg logit contributions
 stool-0.008
 tray-0.008
 log-0.008
 chair-0.008
 oven-0.007
Token The
Token ID383
Feature activation+1.25e-02

Pos logit contributions
over+0.011
P+0.011
 Emmy+0.010
Sus+0.010
 such+0.010
Neg logit contributions
 stool-0.011
 log-0.011
 tray-0.011
 scale-0.010
 fork-0.010
Token owner
Token ID4870
Feature activation-1.09e-02

Pos logit contributions
 stool+0.017
 log+0.016
 tray+0.015
ccoli+0.015
 Deal+0.015
Neg logit contributions
over-0.022
P-0.022
date-0.020
lyss-0.020
hered-0.020
Token was
Token ID373
Feature activation+1.25e-02

Pos logit contributions
over+0.015
P+0.014
liest+0.013
hered+0.013
Sus+0.012
Neg logit contributions
 stool-0.020
 tray-0.020
 log-0.019
 chair-0.019
 oven-0.019
Token for
Token ID329
Feature activation-4.73e-03

Pos logit contributions
 stool+0.002
 tray+0.002
 log+0.002
 chair+0.002
 scale+0.002
Neg logit contributions
P-0.002
over-0.002
beh-0.002
Sus-0.002
amed-0.002
Token being
Token ID852
Feature activation-2.83e-02

Pos logit contributions
P+0.008
amed+0.008
liest+0.008
ed+0.007
over+0.007
Neg logit contributions
 log-0.007
 stool-0.007
 scale-0.007
 chair-0.006
 tray-0.006
Token so
Token ID523
Feature activation+1.78e-02

Pos logit contributions
P+0.045
hered+0.045
Ell+0.044
Ph+0.043
Sus+0.043
Neg logit contributions
 fork-0.034
 scale-0.033
 log-0.033
 microwave-0.032
 tray-0.032
Token well
Token ID880
Feature activation-3.45e-02

Pos logit contributions
 stool+0.023
 log+0.023
 scale+0.022
 fork+0.022
 tray+0.022
Neg logit contributions
P-0.031
over-0.031
hered-0.030
date-0.029
amed-0.029
Token-
Token ID12
Feature activation-3.49e-02

Pos logit contributions
P+0.060
Sus+0.056
amed+0.055
over+0.054
beh+0.054
Neg logit contributions
 stool-0.048
 log-0.046
 tray-0.046
 scale-0.045
 chair-0.044
Tokenbeh
Token ID20709
Feature activation-8.53e-02

Pos logit contributions
P+0.037
over+0.036
 Emmy+0.035
Em+0.034
lyss+0.033
Neg logit contributions
 stool-0.061
 scale-0.058
 microwave-0.058
 oven-0.058
 log-0.057
Tokenaved
Token ID9586
Feature activation+1.87e-02

Pos logit contributions
P+0.125
over+0.120
Sus+0.107
 such+0.106
lish+0.105
Neg logit contributions
 stool-0.136
 tray-0.133
 log-0.131
ses-0.130
 microwave-0.130
Token and
Token ID290
Feature activation-1.02e-02

Pos logit contributions
 stool+0.028
 tray+0.027
 log+0.027
 chair+0.026
 scale+0.026
Neg logit contributions
P-0.033
beh-0.029
lyss-0.028
Sus-0.028
over-0.028
Token Jack
Token ID3619
Feature activation-1.49e-02

Pos logit contributions
P+0.017
over+0.017
hered+0.015
liest+0.015
date+0.015
Neg logit contributions
 stool-0.015
 log-0.015
 chair-0.015
 scale-0.015
 fork-0.014
Token was
Token ID373
Feature activation-4.34e-04

Pos logit contributions
P+0.020
over+0.020
Sus+0.018
date+0.018
hered+0.017
Neg logit contributions
 stool-0.027
 log-0.026
 tray-0.026
 chair-0.025
 scale-0.025
Token delighted
Token ID20707
Feature activation+3.49e-03

Pos logit contributions
P+0.001
over+0.001
hered+0.001
liest+0.001
lish+0.001
Neg logit contributions
 stool-0.001
 tray-0.001
 log-0.001
 chair-0.001
 oven-0.001
TokenIt
Token ID1026
Feature activation-9.73e-03

Pos logit contributions
 stool+0.002
 log+0.002
 tray+0.002
 chair+0.002
 microwave+0.002
Neg logit contributions
over-0.001
P-0.001
hered-0.001
date-0.001
liest-0.001
Token helps
Token ID5419
Feature activation+5.14e-03

Pos logit contributions
P+0.011
over+0.010
ed+0.010
liest+0.010
 such+0.010
Neg logit contributions
 stool-0.020
 tray-0.019
 microwave-0.018
 Deal-0.018
 log-0.018
Token us
Token ID514
Feature activation+8.73e-04

Pos logit contributions
 stool+0.008
 tray+0.008
 log+0.008
 oven+0.008
 chair+0.008
Neg logit contributions
P-0.008
over-0.008
lyss-0.007
hered-0.007
amed-0.007
Token remember
Token ID3505
Feature activation-1.00e-02

Pos logit contributions
 stool+0.001
 tray+0.001
 log+0.001
 chair+0.001
 microwave+0.001
Neg logit contributions
over-0.001
P-0.001
liest-0.001
hered-0.001
Sus-0.001
Token when
Token ID618
Feature activation-7.89e-03

Pos logit contributions
over+0.014
P+0.014
hered+0.013
amed+0.012
date+0.012
Neg logit contributions
 stool-0.018
 log-0.017
 tray-0.017
 chair-0.016
 oven-0.016
Token birth
Token ID4082
Feature activation-8.51e-02

Pos logit contributions
P+0.012
over+0.012
amed+0.011
hered+0.011
liest+0.011
Neg logit contributions
 stool-0.013
 log-0.013
 tray-0.013
 chair-0.012
 oven-0.012
Tokendays
Token ID12545
Feature activation-1.59e-02

Pos logit contributions
over+0.121
P+0.119
hered+0.108
date+0.108
liest+0.108
Neg logit contributions
 stool-0.148
 tray-0.143
 log-0.142
 chair-0.136
 microwave-0.135
Token and
Token ID290
Feature activation-7.47e-03

Pos logit contributions
P+0.030
over+0.029
Sus+0.027
amed+0.027
Ag+0.026
Neg logit contributions
 stool-0.021
 tray-0.021
 oven-0.020
 log-0.020
 scale-0.020
Token special
Token ID2041
Feature activation-3.60e-02

Pos logit contributions
P+0.012
over+0.012
liest+0.011
hered+0.011
Sus+0.011
Neg logit contributions
 stool-0.012
 log-0.011
 tray-0.011
 chair-0.011
 scale-0.011
Token days
Token ID1528
Feature activation-2.24e-02

Pos logit contributions
P+0.060
over+0.058
Sus+0.054
amed+0.054
hered+0.053
Neg logit contributions
 stool-0.055
 tray-0.054
 log-0.053
 oven-0.051
 chair-0.051
Token are
Token ID389
Feature activation-1.66e-02

Pos logit contributions
P+0.038
over+0.036
Sus+0.034
amed+0.034
lyss+0.033
Neg logit contributions
 stool-0.034
 tray-0.033
 log-0.032
 oven-0.031
 chair-0.031
Token.
Token ID13
Feature activation+3.91e-03

Pos logit contributions
 stool+0.020
 tray+0.020
 log+0.019
 microwave+0.019
 scale+0.019
Neg logit contributions
P-0.014
over-0.014
Sus-0.013
liest-0.013
hered-0.012
Token She
Token ID1375
Feature activation+1.12e-02

Pos logit contributions
 stool+0.006
 log+0.006
 tray+0.006
 tractor+0.006
 scale+0.006
Neg logit contributions
over-0.006
P-0.006
 Emmy-0.005
date-0.005
Sus-0.005
Token was
Token ID373
Feature activation-1.69e-03

Pos logit contributions
 stool+0.019
 log+0.019
 tray+0.019
 chair+0.018
 fork+0.018
Neg logit contributions
over-0.016
P-0.015
hered-0.015
liest-0.014
date-0.014
Token very
Token ID845
Feature activation+4.28e-03

Pos logit contributions
over+0.003
P+0.003
liest+0.002
hered+0.002
Sus+0.002
Neg logit contributions
 stool-0.003
 tray-0.003
 log-0.003
 chair-0.003
 oven-0.003
Token nervous
Token ID10927
Feature activation-1.43e-03

Pos logit contributions
 stool+0.006
 tray+0.006
 log+0.006
 chair+0.006
 microwave+0.006
Neg logit contributions
over-0.007
P-0.007
liest-0.007
hered-0.006
Sus-0.006
Token because
Token ID780
Feature activation+4.36e-02

Pos logit contributions
P+0.002
over+0.002
amed+0.002
hered+0.002
date+0.002
Neg logit contributions
 stool-0.002
 log-0.002
 tray-0.002
 chair-0.002
 fork-0.002
Token she
Token ID673
Feature activation-5.27e-03

Pos logit contributions
 stool+0.082
 tray+0.080
 log+0.080
 scale+0.077
 microwave+0.077
Neg logit contributions
over-0.054
P-0.054
Sus-0.049
 such-0.049
 Emmy-0.048
Token had
Token ID550
Feature activation-1.45e-02

Pos logit contributions
over+0.007
P+0.007
liest+0.006
hered+0.006
Sus+0.006
Neg logit contributions
 stool-0.009
 tray-0.009
 log-0.009
 chair-0.009
 microwave-0.009
Token a
Token ID257
Feature activation-1.05e-02

Pos logit contributions
over+0.027
P+0.026
liest+0.025
hered+0.025
lish+0.025
Neg logit contributions
 stool-0.020
 chair-0.020
 tray-0.018
 thinks-0.018
 log-0.018
Token w
Token ID266
Feature activation+3.42e-02

Pos logit contributions
P+0.017
over+0.017
date+0.015
lyss+0.015
 Emmy+0.015
Neg logit contributions
 stool-0.015
 log-0.015
 Deal-0.015
 tray-0.015
 microwave-0.014
Tokenigg
Token ID6950
Feature activation+4.38e-02

Pos logit contributions
 stool+0.044
 tray+0.041
 log+0.041
 tractor+0.038
 chair+0.038
Neg logit contributions
over-0.066
P-0.064
liest-0.058
date-0.058
ed-0.058
Token Lily
Token ID20037
Feature activation-4.72e-03

Pos logit contributions
 carrot+0.014
 fork+0.013
 Leo+0.013
 thinks+0.013
 log+0.012
Neg logit contributions
lish-0.029
over-0.028
liest-0.027
amed-0.027
ating-0.027
Token were
Token ID547
Feature activation-3.71e-03

Pos logit contributions
over+0.005
P+0.005
Sus+0.005
liest+0.005
hered+0.004
Neg logit contributions
 stool-0.010
 tray-0.009
 log-0.009
 chair-0.009
 microwave-0.009
Token playing
Token ID2712
Feature activation+8.12e-03

Pos logit contributions
P+0.004
over+0.004
amed+0.003
 such+0.003
date+0.003
Neg logit contributions
 stool-0.008
 log-0.008
 scale-0.008
 tray-0.008
 fork-0.007
Token in
Token ID287
Feature activation+2.06e-03

Pos logit contributions
 tray+0.018
 log+0.018
 stool+0.018
 chair+0.017
 fork+0.017
Neg logit contributions
over-0.008
beh-0.007
Sus-0.007
lyss-0.007
P-0.007
Token the
Token ID262
Feature activation-2.18e-02

Pos logit contributions
 stool+0.004
 tray+0.004
 log+0.004
 microwave+0.004
 oven+0.004
Neg logit contributions
P-0.002
over-0.002
Sus-0.002
 such-0.002
 Emmy-0.002
Token snow
Token ID6729
Feature activation+4.14e-02

Pos logit contributions
over+0.045
P+0.043
hered+0.042
liest+0.042
ed+0.041
Neg logit contributions
 stool-0.024
 log-0.024
 tray-0.023
 chair-0.022
 oven-0.022
Token.
Token ID13
Feature activation+4.53e-03

Pos logit contributions
 stool+0.073
 tray+0.070
 log+0.070
 chair+0.067
 microwave+0.067
Neg logit contributions
over-0.058
P-0.058
liest-0.052
Sus-0.051
hered-0.051
Token They
Token ID1119
Feature activation+1.37e-02

Pos logit contributions
 stool+0.007
 log+0.007
 tray+0.006
 scale+0.006
 tractor+0.006
Neg logit contributions
over-0.007
P-0.007
date-0.007
lyss-0.007
 Emmy-0.007
Token wanted
Token ID2227
Feature activation+3.57e-02

Pos logit contributions
 stool+0.027
 tray+0.026
 log+0.026
 microwave+0.025
 oven+0.025
Neg logit contributions
P-0.016
over-0.016
Sus-0.014
amed-0.014
ed-0.014
Token to
Token ID284
Feature activation+5.59e-03

Pos logit contributions
 stool+0.058
 log+0.058
 tray+0.056
 chair+0.055
 fork+0.054
Neg logit contributions
over-0.062
P-0.059
hered-0.055
liest-0.054
beh-0.054
Token make
Token ID787
Feature activation+2.43e-03

Pos logit contributions
 stool+0.009
 tray+0.008
 Deal+0.008
 microwave+0.008
 oven+0.008
Neg logit contributions
P-0.009
over-0.008
 such-0.008
date-0.008
Sus-0.008
Token to
Token ID284
Feature activation+7.35e-03

Pos logit contributions
 stool+0.024
 tray+0.023
 log+0.023
 microwave+0.022
 oven+0.022
Neg logit contributions
over-0.015
P-0.014
hered-0.013
ed-0.013
liest-0.012
Token find
Token ID1064
Feature activation+7.28e-03

Pos logit contributions
 stool+0.010
 tray+0.009
 log+0.009
 chair+0.009
 scale+0.009
Neg logit contributions
over-0.014
P-0.014
liest-0.013
amed-0.012
ed-0.012
Token him
Token ID683
Feature activation+5.64e-03

Pos logit contributions
 stool+0.013
 tray+0.013
 log+0.012
 chair+0.012
 microwave+0.012
Neg logit contributions
over-0.010
P-0.010
liest-0.009
Sus-0.009
hered-0.009
Token.
Token ID13
Feature activation+2.49e-03

Pos logit contributions
 stool+0.011
 tray+0.010
 log+0.010
 microwave+0.010
 Deal+0.010
Neg logit contributions
over-0.007
P-0.007
 such-0.006
ed-0.006
liest-0.006
Token He
Token ID679
Feature activation+1.71e-02

Pos logit contributions
 stool+0.005
 tray+0.004
 log+0.004
 oven+0.004
 tractor+0.004
Neg logit contributions
over-0.003
P-0.003
ed-0.003
 such-0.003
Em-0.003
Token hopes
Token ID7176
Feature activation+3.38e-02

Pos logit contributions
 stool+0.026
 tray+0.024
 tractor+0.023
arl+0.022
 poker+0.022
Neg logit contributions
 met-0.028
ed-0.026
P-0.024
 such-0.024
 became-0.024
Token she
Token ID673
Feature activation+7.34e-03

Pos logit contributions
 stool+0.052
 tray+0.050
 log+0.050
 chair+0.049
 fork+0.047
Neg logit contributions
P-0.057
over-0.056
amed-0.051
liest-0.051
beh-0.051
Token will
Token ID481
Feature activation+7.16e-03

Pos logit contributions
 stool+0.012
 tray+0.011
 Deal+0.011
 microwave+0.010
 oven+0.010
Neg logit contributions
ed-0.011
 met-0.011
 such-0.010
over-0.010
P-0.010
Token not
Token ID407
Feature activation-3.17e-03

Pos logit contributions
 stool+0.012
 tray+0.012
 microwave+0.012
 log+0.011
 oven+0.011
Neg logit contributions
over-0.010
 such-0.010
P-0.009
date-0.009
Sus-0.009
Token see
Token ID766
Feature activation+2.47e-03

Pos logit contributions
over+0.006
P+0.006
amed+0.006
liest+0.006
ed+0.005
Neg logit contributions
 stool-0.004
 log-0.004
 tray-0.004
 scale-0.004
 chair-0.004
Token his
Token ID465
Feature activation-1.13e-02

Pos logit contributions
 stool+0.005
 tray+0.005
 log+0.005
 microwave+0.004
 chair+0.004
Neg logit contributions
over-0.003
P-0.003
 such-0.003
liest-0.003
ed-0.003
Token on
Token ID319
Feature activation+1.49e-03

Pos logit contributions
 stool+0.034
 tray+0.033
 oven+0.033
ses+0.031
 log+0.031
Neg logit contributions
Sus-0.031
lyss-0.029
P-0.029
Em-0.029
liest-0.027
Token.
Token ID13
Feature activation-2.34e-03

Pos logit contributions
 stool+0.003
 tray+0.002
 log+0.002
 chair+0.002
 oven+0.002
Neg logit contributions
P-0.002
over-0.002
amed-0.002
Sus-0.002
liest-0.002
Token<|endoftext|>
Token ID50256
Feature activation+2.02e-02

Pos logit contributions
over+0.004
P+0.004
Sus+0.003
 Emmy+0.003
 such+0.003
Neg logit contributions
 stool-0.004
 log-0.004
 tray-0.003
 scale-0.003
 chair-0.003
TokenL
Token ID43
Feature activation+1.69e-02

Pos logit contributions
 stool+0.025
 tray+0.025
 log+0.025
 chair+0.024
 microwave+0.023
Neg logit contributions
over-0.037
beh-0.036
liest-0.035
amed-0.035
P-0.035
Tokenily
Token ID813
Feature activation+1.45e-03

Pos logit contributions
 stool+0.032
 log+0.031
 tray+0.031
 scale+0.030
ses+0.030
Neg logit contributions
P-0.022
over-0.021
Sus-0.019
 such-0.019
amed-0.018
Token liked
Token ID8288
Feature activation+3.12e-02

Pos logit contributions
 stool+0.003
 tray+0.002
 log+0.002
 chair+0.002
 microwave+0.002
Neg logit contributions
over-0.002
P-0.002
liest-0.002
Sus-0.002
hered-0.002
Token to
Token ID284
Feature activation-1.36e-02

Pos logit contributions
 stool+0.064
 tray+0.062
 log+0.062
 Deal+0.060
 microwave+0.060
Neg logit contributions
P-0.034
over-0.033
 such-0.029
Sus-0.029
liest-0.028
Token play
Token ID711
Feature activation+6.91e-03

Pos logit contributions
P+0.022
over+0.021
date+0.020
 Emmy+0.019
Sus+0.019
Neg logit contributions
 stool-0.021
 log-0.020
 tray-0.020
 microwave-0.019
 oven-0.019
Token with
Token ID351
Feature activation-2.46e-03

Pos logit contributions
 tray+0.015
 stool+0.014
 log+0.014
 scale+0.014
 chair+0.014
Neg logit contributions
over-0.007
beh-0.007
amed-0.007
Sus-0.007
lyss-0.007
Token her
Token ID607
Feature activation-7.53e-03

Pos logit contributions
over+0.004
P+0.003
liest+0.003
hered+0.003
amed+0.003
Neg logit contributions
 stool-0.004
 tray-0.004
 log-0.004
 chair-0.004
 fork-0.004
Token toy
Token ID13373
Feature activation+2.19e-02

Pos logit contributions
P+0.013
over+0.013
 Emmy+0.011
date+0.011
hered+0.011
Neg logit contributions
 stool-0.011
 log-0.010
 Deal-0.010
 microwave-0.010
 tray-0.010
Token back
Token ID736
Feature activation+1.13e-02

Pos logit contributions
 stool+0.017
 tray+0.017
 log+0.017
 chair+0.017
 thinks+0.017
Neg logit contributions
P-0.026
over-0.024
beh-0.023
Em-0.023
date-0.023
Token and
Token ID290
Feature activation+6.69e-03

Pos logit contributions
 stool+0.020
 tray+0.019
 log+0.019
 microwave+0.018
 chair+0.018
Neg logit contributions
over-0.017
P-0.016
ed-0.015
liest-0.014
hered-0.014
Token smiled
Token ID13541
Feature activation+3.15e-03

Pos logit contributions
 log+0.008
 thinks+0.008
 stool+0.008
 chair+0.008
 scale+0.007
Neg logit contributions
over-0.015
P-0.013
date-0.013
liest-0.013
aming-0.012
Token.
Token ID13
Feature activation+3.77e-04

Pos logit contributions
 stool+0.004
 log+0.004
 tray+0.004
 chair+0.004
 fork+0.004
Neg logit contributions
P-0.006
lyss-0.005
over-0.005
Em-0.005
beh-0.005
Token He
Token ID679
Feature activation+1.42e-02

Pos logit contributions
 stool+0.001
 tray+0.000
 log+0.000
 chair+0.000
 scale+0.000
Neg logit contributions
over-0.001
P-0.001
Sus-0.001
liest-0.001
lyss-0.001
Token couldn
Token ID3521
Feature activation+3.13e-02

Pos logit contributions
 stool+0.023
 log+0.023
 tray+0.022
 thinks+0.022
 scale+0.022
Neg logit contributions
over-0.022
P-0.021
date-0.019
beh-0.019
liest-0.019
Token't
Token ID470
Feature activation+8.26e-03

Pos logit contributions
 stool+0.037
 tray+0.035
 log+0.035
 chair+0.033
 scale+0.033
Neg logit contributions
over-0.062
P-0.061
amed-0.056
liest-0.056
Sus-0.056
Token help
Token ID1037
Feature activation-6.19e-03

Pos logit contributions
 stool+0.011
 log+0.011
 scale+0.011
 tray+0.011
 fork+0.011
Neg logit contributions
P-0.015
over-0.014
amed-0.013
beh-0.013
ed-0.013
Token but
Token ID475
Feature activation+1.87e-02

Pos logit contributions
P+0.014
amed+0.014
over+0.013
beh+0.013
ed+0.013
Neg logit contributions
 stool-0.007
 log-0.006
 chair-0.006
 scale-0.006
 tray-0.006
Token feel
Token ID1254
Feature activation+3.60e-03

Pos logit contributions
 stool+0.023
 tray+0.022
 log+0.022
 chair+0.020
 scale+0.020
Neg logit contributions
P-0.037
over-0.037
liest-0.034
amed-0.034
Sus-0.033
Token happy
Token ID3772
Feature activation-8.65e-03

Pos logit contributions
 log+0.005
 stool+0.005
 thinks+0.004
 tray+0.004
 scale+0.004
Neg logit contributions
P-0.007
over-0.006
date-0.006
Sus-0.006
hered-0.006
Token water
Token ID1660
Feature activation+1.16e-02

Pos logit contributions
 stool+0.009
 tray+0.008
 log+0.008
 chair+0.008
 microwave+0.008
Neg logit contributions
over-0.010
P-0.010
liest-0.009
 such-0.009
hered-0.009
Token.
Token ID13
Feature activation+1.98e-03

Pos logit contributions
 stool+0.023
 Deal+0.023
 log+0.022
 tray+0.022
 microwave+0.021
Neg logit contributions
over-0.013
P-0.012
ed-0.012
 such-0.011
liest-0.011
Token They
Token ID1119
Feature activation+2.13e-02

Pos logit contributions
 stool+0.003
 log+0.003
 tray+0.003
 scale+0.003
 fork+0.003
Neg logit contributions
over-0.003
P-0.003
 such-0.003
Sus-0.003
ed-0.003
Token make
Token ID787
Feature activation+2.21e-04

Pos logit contributions
 stool+0.036
 microwave+0.034
 oven+0.033
 tractor+0.033
ses+0.033
Neg logit contributions
 such-0.024
 met-0.024
P-0.022
 involved-0.022
 created-0.022
Token the
Token ID262
Feature activation+5.49e-03

Pos logit contributions
 stool+0.000
 Deal+0.000
 tray+0.000
 log+0.000
 scale+0.000
Neg logit contributions
over-0.000
 such-0.000
P-0.000
ed-0.000
amed-0.000
Token shark
Token ID21027
Feature activation+4.63e-03

Pos logit contributions
 stool+0.007
 tray+0.007
 log+0.007
 Deal+0.007
 oven+0.006
Neg logit contributions
over-0.010
P-0.010
date-0.009
lyss-0.009
 such-0.009
Token swim
Token ID9422
Feature activation+1.15e-02

Pos logit contributions
 stool+0.007
 Deal+0.007
 microwave+0.006
 tray+0.006
ses+0.006
Neg logit contributions
over-0.008
P-0.007
 such-0.007
ed-0.007
hered-0.007
Token and
Token ID290
Feature activation+9.07e-03

Pos logit contributions
 stool+0.019
 tray+0.018
 log+0.018
 chair+0.017
 microwave+0.017
Neg logit contributions
over-0.018
P-0.018
liest-0.016
hered-0.016
amed-0.016
Token splash
Token ID22870
Feature activation+1.30e-02

Pos logit contributions
 stool+0.012
 tray+0.012
 Deal+0.011
 microwave+0.011
 oven+0.011
Neg logit contributions
P-0.015
over-0.015
 such-0.015
hered-0.014
ed-0.014
Token.
Token ID13
Feature activation+2.91e-03

Pos logit contributions
 stool+0.022
 tray+0.022
 log+0.022
 chair+0.021
 fork+0.021
Neg logit contributions
over-0.019
P-0.019
lyss-0.017
Sus-0.017
hered-0.017
Token They
Token ID1119
Feature activation+2.19e-02

Pos logit contributions
 stool+0.005
 tray+0.004
 log+0.004
 scale+0.004
 fork+0.004
Neg logit contributions
over-0.005
P-0.005
 such-0.004
Sus-0.004
ed-0.004
Token pocket
Token ID10000
Feature activation+8.08e-03

Pos logit contributions
over+0.038
P+0.037
Sus+0.035
hered+0.035
liest+0.035
Neg logit contributions
 stool-0.023
 tray-0.022
 log-0.022
 chair-0.021
 scale-0.020
Token.
Token ID13
Feature activation-2.77e-03

Pos logit contributions
 stool+0.012
 tray+0.012
 log+0.011
 chair+0.011
 scale+0.011
Neg logit contributions
P-0.014
over-0.014
date-0.013
lyss-0.013
hered-0.013
Token
Token ID198
Feature activation-3.22e-04

Pos logit contributions
over+0.004
P+0.004
Sus+0.004
liest+0.004
ed+0.004
Neg logit contributions
 stool-0.004
 tray-0.004
 log-0.004
 scale-0.004
 microwave-0.004
Token
Token ID198
Feature activation-7.58e-03

Pos logit contributions
over+0.000
P+0.000
Sus+0.000
liest+0.000
hered+0.000
Neg logit contributions
 stool-0.001
 tray-0.001
 log-0.000
 microwave-0.000
 chair-0.000
TokenAs
Token ID1722
Feature activation+1.68e-02

Pos logit contributions
over+0.010
P+0.009
hered+0.009
liest+0.009
 such+0.009
Neg logit contributions
 stool-0.014
 tray-0.014
 log-0.014
 chair-0.013
 scale-0.013
Token she
Token ID673
Feature activation-7.33e-03

Pos logit contributions
 stool+0.025
 chair+0.023
 log+0.023
 tray+0.022
 mat+0.021
Neg logit contributions
over-0.030
P-0.029
liest-0.029
beh-0.029
amed-0.028
Token walked
Token ID6807
Feature activation-1.85e-02

Pos logit contributions
over+0.010
P+0.009
liest+0.009
Sus+0.008
hered+0.008
Neg logit contributions
 stool-0.013
 log-0.013
 tray-0.013
 chair-0.013
 scale-0.012
Token through
Token ID832
Feature activation-7.43e-03

Pos logit contributions
over+0.027
P+0.027
beh+0.025
liest+0.024
Em+0.023
Neg logit contributions
 log-0.033
 stool-0.033
 chair-0.031
 tray-0.030
 scale-0.030
Token the
Token ID262
Feature activation-1.80e-02

Pos logit contributions
over+0.012
P+0.012
beh+0.011
amed+0.011
liest+0.011
Neg logit contributions
 stool-0.013
 chair-0.012
 log-0.012
 tray-0.012
 rug-0.011
Token park
Token ID3952
Feature activation+5.92e-03

Pos logit contributions
over+0.037
P+0.035
hered+0.034
beh+0.034
amed+0.033
Neg logit contributions
 log-0.022
 stool-0.022
 chair-0.021
 scale-0.020
 tray-0.020
Token,
Token ID11
Feature activation-8.55e-03

Pos logit contributions
 chair+0.007
 stool+0.007
 ladder+0.007
 log+0.006
 rug+0.006
Neg logit contributions
P-0.014
beh-0.013
Em-0.012
over-0.012
aming-0.011
Token He
Token ID679
Feature activation+2.71e-03

Pos logit contributions
over+0.022
P+0.021
liest+0.019
Sus+0.019
hered+0.019
Neg logit contributions
 stool-0.021
 tray-0.021
 log-0.020
 chair-0.019
 microwave-0.019
Token grabbed
Token ID13176
Feature activation-4.32e-03

Pos logit contributions
 stool+0.005
 tray+0.005
 log+0.005
 chair+0.005
 microwave+0.005
Neg logit contributions
P-0.003
over-0.003
liest-0.003
amed-0.003
ed-0.003
Token it
Token ID340
Feature activation+6.67e-03

Pos logit contributions
P+0.004
 such+0.004
over+0.004
ed+0.004
hered+0.004
Neg logit contributions
 stool-0.009
 tray-0.009
 log-0.009
 Deal-0.008
ses-0.008
Token and
Token ID290
Feature activation+4.01e-03

Pos logit contributions
 stool+0.011
 Deal+0.011
 microwave+0.011
 tray+0.011
ses+0.011
Neg logit contributions
over-0.009
P-0.009
ed-0.008
hered-0.008
liest-0.008
Token threw
Token ID9617
Feature activation-3.11e-05

Pos logit contributions
 stool+0.007
 log+0.006
 tray+0.006
 chair+0.006
 scale+0.006
Neg logit contributions
over-0.007
P-0.006
date-0.006
liest-0.006
lyss-0.006
Token it
Token ID340
Feature activation-3.05e-03

Pos logit contributions
P+0.000
over+0.000
 such+0.000
ed+0.000
liest+0.000
Neg logit contributions
 stool-0.000
 tray-0.000
 log-0.000
 microwave-0.000
 oven-0.000
Token on
Token ID319
Feature activation+2.59e-03

Pos logit contributions
P+0.003
over+0.003
ed+0.002
hered+0.002
amed+0.002
Neg logit contributions
 stool-0.007
 tray-0.007
 microwave-0.007
 log-0.007
 Deal-0.006
Token the
Token ID262
Feature activation+6.15e-04

Pos logit contributions
 stool+0.005
 tray+0.005
ses+0.005
 log+0.005
 Deal+0.005
Neg logit contributions
over-0.003
P-0.003
 such-0.003
liest-0.002
hered-0.002
Token ground
Token ID2323
Feature activation+1.77e-02

Pos logit contributions
 stool+0.001
 log+0.001
 tray+0.001
 Deal+0.001
ses+0.001
Neg logit contributions
over-0.001
P-0.001
liest-0.001
 such-0.001
amed-0.001
Token.
Token ID13
Feature activation-8.64e-03

Pos logit contributions
 stool+0.033
 tray+0.032
 log+0.031
 microwave+0.030
 chair+0.030
Neg logit contributions
over-0.023
P-0.023
liest-0.020
hered-0.020
ed-0.020
Token The
Token ID383
Feature activation+1.55e-02

Pos logit contributions
over+0.012
P+0.012
ed+0.011
 such+0.011
Sus+0.011
Neg logit contributions
 stool-0.015
 tray-0.015
 log-0.015
 fork-0.014
 tractor-0.014
Token crabs
Token ID49172
Feature activation-2.42e-02

Pos logit contributions
over+0.001
P+0.001
hered+0.001
beh+0.001
amed+0.001
Neg logit contributions
 stool-0.001
 log-0.001
 tray-0.001
 chair-0.001
 fork-0.001
Token:
Token ID25
Feature activation+5.36e-03

Pos logit contributions
over+0.033
P+0.032
lyss+0.030
liest+0.030
Sus+0.030
Neg logit contributions
 log-0.041
 stool-0.040
 tray-0.040
 chair-0.039
 oven-0.039
Token Max
Token ID5436
Feature activation+9.13e-03

Pos logit contributions
 stool+0.009
 tray+0.009
 log+0.009
 chair+0.008
 microwave+0.008
Neg logit contributions
over-0.008
P-0.008
liest-0.007
Sus-0.007
hered-0.007
Token and
Token ID290
Feature activation+8.11e-03

Pos logit contributions
 stool+0.015
 tray+0.014
 log+0.013
 microwave+0.013
 Deal+0.013
Neg logit contributions
over-0.015
P-0.015
hered-0.013
liest-0.013
 such-0.013
Token Sam
Token ID3409
Feature activation-4.65e-03

Pos logit contributions
 log+0.013
 stool+0.013
 thinks+0.013
 fork+0.012
 chair+0.012
Neg logit contributions
over-0.014
amed-0.012
P-0.012
aming-0.012
date-0.012
Token.
Token ID13
Feature activation-4.42e-03

Pos logit contributions
over+0.006
P+0.006
liest+0.005
hered+0.005
Sus+0.005
Neg logit contributions
 stool-0.009
 tray-0.008
 log-0.008
 microwave-0.008
 chair-0.008
Token Max
Token ID5436
Feature activation-1.53e-03

Pos logit contributions
over+0.005
hered+0.005
 Emmy+0.005
P+0.005
 such+0.005
Neg logit contributions
 stool-0.008
 log-0.008
 tray-0.008
 chair-0.007
 oven-0.007
Token was
Token ID373
Feature activation+6.51e-03

Pos logit contributions
over+0.002
P+0.002
date+0.002
Sus+0.002
hered+0.002
Neg logit contributions
 log-0.003
 stool-0.003
 tray-0.002
 thinks-0.002
 scale-0.002
Token a
Token ID257
Feature activation-1.13e-02

Pos logit contributions
 log+0.009
 stool+0.009
 tray+0.009
 scale+0.009
 oven+0.008
Neg logit contributions
over-0.011
P-0.011
liest-0.011
Sus-0.010
hered-0.010
Token very
Token ID845
Feature activation+2.49e-02

Pos logit contributions
P+0.018
over+0.017
lyss+0.016
date+0.015
Sus+0.015
Neg logit contributions
 stool-0.018
 tray-0.017
 microwave-0.017
 Deal-0.017
 log-0.017
Token gifted
Token ID22527
Feature activation-5.05e-03

Pos logit contributions
 stool+0.041
 tray+0.039
 log+0.038
 microwave+0.038
 chair+0.037
Neg logit contributions
P-0.039
over-0.038
lyss-0.034
Sus-0.033
date-0.033
Token each
Token ID1123
Feature activation+3.36e-02

Pos logit contributions
 stool+0.039
 log+0.038
 tray+0.037
 scale+0.037
 microwave+0.036
Neg logit contributions
over-0.019
 such-0.018
P-0.018
liest-0.016
 Emmy-0.016
Token other
Token ID584
Feature activation+1.47e-03

Pos logit contributions
 stool+0.066
ses+0.065
 log+0.064
 tray+0.063
ccoli+0.063
Neg logit contributions
P-0.039
over-0.037
 such-0.034
hered-0.033
liest-0.033
Token.
Token ID13
Feature activation-8.27e-03

Pos logit contributions
 stool+0.003
 log+0.003
 tray+0.003
 microwave+0.003
 chair+0.002
Neg logit contributions
over-0.002
P-0.002
liest-0.002
hered-0.002
ed-0.002
Token They
Token ID1119
Feature activation+1.68e-02

Pos logit contributions
P+0.016
over+0.016
amed+0.015
hered+0.015
liest+0.015
Neg logit contributions
 stool-0.010
 tray-0.010
 log-0.010
 chair-0.009
 microwave-0.009
Token looked
Token ID3114
Feature activation+1.63e-02

Pos logit contributions
 stool+0.027
 thinks+0.027
 tray+0.026
 fork+0.026
 log+0.026
Neg logit contributions
over-0.027
P-0.025
date-0.023
beh-0.023
hered-0.023
Token at
Token ID379
Feature activation+1.52e-02

Pos logit contributions
 stool+0.034
 tray+0.033
 log+0.033
 chair+0.032
 microwave+0.031
Neg logit contributions
over-0.018
P-0.018
liest-0.015
ed-0.015
hered-0.015
Token their
Token ID511
Feature activation+9.57e-04

Pos logit contributions
 stool+0.031
 log+0.030
 tray+0.030
 microwave+0.029
 scale+0.029
Neg logit contributions
over-0.017
P-0.016
 such-0.015
liest-0.015
 Emmy-0.014
Token cars
Token ID5006
Feature activation+7.74e-03

Pos logit contributions
 stool+0.001
 Deal+0.001
 log+0.001
 microwave+0.001
 tractor+0.001
Neg logit contributions
over-0.001
 such-0.001
ed-0.001
date-0.001
hered-0.001
Token.
Token ID13
Feature activation-9.58e-03

Pos logit contributions
 stool+0.013
 log+0.013
 tray+0.013
 Deal+0.012
 microwave+0.012
Neg logit contributions
over-0.011
P-0.011
hered-0.010
liest-0.010
ed-0.010
Token They
Token ID1119
Feature activation+1.79e-02

Pos logit contributions
P+0.020
over+0.020
amed+0.019
beh+0.019
hered+0.018
Neg logit contributions
 stool-0.011
 tray-0.010
 microwave-0.009
 log-0.009
 chair-0.009
Token looked
Token ID3114
Feature activation+1.45e-02

Pos logit contributions
 stool+0.029
 tray+0.028
 log+0.028
 thinks+0.028
 fork+0.028
Neg logit contributions
over-0.029
P-0.027
date-0.025
beh-0.025
hered-0.024
Token thanked
Token ID26280
Feature activation+9.60e-03

Pos logit contributions
over+0.018
P+0.017
liest+0.015
date+0.015
beh+0.015
Neg logit contributions
 stool-0.019
 log-0.019
 tray-0.018
 chair-0.018
 thinks-0.018
Token her
Token ID607
Feature activation+4.51e-03

Pos logit contributions
 stool+0.018
 tray+0.017
 log+0.017
 chair+0.016
 scale+0.016
Neg logit contributions
P-0.013
over-0.013
amed-0.011
liest-0.011
hered-0.011
Token.
Token ID13
Feature activation-9.51e-03

Pos logit contributions
 stool+0.008
 microwave+0.008
 Deal+0.008
 tray+0.008
ses+0.008
Neg logit contributions
over-0.006
 such-0.005
 Emmy-0.005
ed-0.005
liest-0.005
Token
Token ID198
Feature activation-1.53e-03

Pos logit contributions
over+0.016
P+0.016
Sus+0.014
liest+0.014
hered+0.014
Neg logit contributions
 stool-0.014
 tray-0.014
 log-0.014
 microwave-0.013
 chair-0.013
Token
Token ID198
Feature activation-1.06e-02

Pos logit contributions
P+0.002
over+0.002
Sus+0.002
Em+0.002
amed+0.002
Neg logit contributions
 stool-0.003
 tray-0.003
 log-0.003
 microwave-0.003
 oven-0.003
Token"
Token ID1
Feature activation-2.80e-02

Pos logit contributions
over+0.014
P+0.014
liest+0.013
hered+0.013
Sus+0.013
Neg logit contributions
 stool-0.019
 tray-0.019
 log-0.018
 chair-0.018
 microwave-0.018
TokenThank
Token ID10449
Feature activation+9.59e-03

Pos logit contributions
over+0.038
P+0.033
liest+0.033
beh+0.031
date+0.031
Neg logit contributions
 stool-0.050
 tray-0.048
 log-0.048
 microwave-0.047
 chair-0.047
Token you
Token ID345
Feature activation+1.69e-02

Pos logit contributions
 log+0.012
 stool+0.012
 Deal+0.012
 tray+0.012
 thinks+0.012
Neg logit contributions
amed-0.016
liest-0.015
date-0.015
over-0.015
 met-0.015
Token so
Token ID523
Feature activation+2.76e-02

Pos logit contributions
 stool+0.028
 tray+0.028
 log+0.027
 chair+0.026
 oven+0.026
Neg logit contributions
P-0.026
over-0.025
hered-0.023
beh-0.023
liest-0.023
Token much
Token ID881
Feature activation+2.21e-03

Pos logit contributions
 stool+0.047
 log+0.046
 tray+0.046
 chair+0.044
 oven+0.043
Neg logit contributions
over-0.040
P-0.040
hered-0.036
date-0.035
liest-0.035
Token for
Token ID329
Feature activation+1.30e-03

Pos logit contributions
 stool+0.004
 tray+0.004
 log+0.004
 microwave+0.004
 chair+0.004
Neg logit contributions
over-0.003
P-0.003
Sus-0.003
liest-0.003
 such-0.003
Token."
Token ID526
Feature activation-1.02e-02

Pos logit contributions
 stool+0.014
 log+0.014
 tray+0.014
ses+0.013
 microwave+0.013
Neg logit contributions
over-0.013
P-0.013
liest-0.012
 such-0.012
ed-0.012
Token
Token ID220
Feature activation+3.45e-02

Pos logit contributions
over+0.015
P+0.015
 Emmy+0.014
Sus+0.014
ed+0.014
Neg logit contributions
 stool-0.017
 log-0.017
 tray-0.016
 scale-0.016
 fork-0.016
Token
Token ID198
Feature activation-3.95e-04

Pos logit contributions
 stool+0.061
 log+0.059
 tray+0.058
 scale+0.056
 fork+0.056
Neg logit contributions
over-0.049
P-0.048
Sus-0.044
amed-0.043
ed-0.043
Token
Token ID198
Feature activation-1.13e-02

Pos logit contributions
P+0.001
over+0.001
Sus+0.001
Em+0.001
amed+0.000
Neg logit contributions
 stool-0.001
 log-0.001
 tray-0.001
 scale-0.001
 oven-0.001
TokenL
Token ID43
Feature activation+7.33e-03

Pos logit contributions
P+0.009
over+0.009
Sus+0.007
Em+0.007
amed+0.007
Neg logit contributions
 stool-0.027
 log-0.027
 tray-0.026
 microwave-0.026
 scale-0.026
Tokenily
Token ID813
Feature activation-1.33e-02

Pos logit contributions
 stool+0.015
 tray+0.014
 log+0.014
 chair+0.014
 microwave+0.014
Neg logit contributions
over-0.009
P-0.008
liest-0.007
Sus-0.007
hered-0.007
Token said
Token ID531
Feature activation-8.56e-03

Pos logit contributions
over+0.018
P+0.017
liest+0.015
hered+0.015
Sus+0.015
Neg logit contributions
 stool-0.025
 tray-0.024
 log-0.024
 chair-0.023
 microwave-0.023
Token,
Token ID11
Feature activation+9.57e-03

Pos logit contributions
P+0.014
over+0.014
beh+0.013
liest+0.012
Sus+0.012
Neg logit contributions
 stool-0.014
 tray-0.013
 chair-0.013
 scale-0.012
 fork-0.012
Token "
Token ID366
Feature activation-3.40e-03

Pos logit contributions
 log+0.014
 tray+0.013
 snowball+0.013
 stool+0.013
 microwave+0.012
Neg logit contributions
P-0.016
over-0.016
ed-0.015
 Emmy-0.015
hered-0.015
TokenWow
Token ID22017
Feature activation-6.33e-03

Pos logit contributions
over+0.005
P+0.005
liest+0.004
hered+0.004
ed+0.004
Neg logit contributions
 stool-0.006
 tray-0.006
 log-0.006
 chair-0.006
 microwave-0.006
Token,
Token ID11
Feature activation+3.64e-02

Pos logit contributions
over+0.007
P+0.007
liest+0.006
hered+0.006
Sus+0.006
Neg logit contributions
 stool-0.013
 tray-0.012
 log-0.012
 microwave-0.012
 chair-0.012
Token playing
Token ID2712
Feature activation-7.04e-03

Pos logit contributions
P+0.035
over+0.034
amed+0.032
hered+0.031
liest+0.031
Neg logit contributions
 log-0.037
 stool-0.037
 tray-0.035
 chair-0.034
 oven-0.034
Token.
Token ID13
Feature activation-3.72e-03

Pos logit contributions
P+0.008
lyss+0.008
over+0.008
beh+0.008
hered+0.008
Neg logit contributions
 log-0.014
 tray-0.014
 stool-0.013
 chair-0.013
 rug-0.013
Token
Token ID198
Feature activation-4.88e-03

Pos logit contributions
over+0.006
P+0.005
 Emmy+0.005
Sus+0.005
Em+0.005
Neg logit contributions
 stool-0.006
 tray-0.006
 log-0.006
 scale-0.006
 fork-0.006
Token
Token ID198
Feature activation-1.19e-02

Pos logit contributions
P+0.006
over+0.006
Sus+0.006
Em+0.005
B+0.005
Neg logit contributions
 stool-0.010
ses-0.010
 tray-0.010
 scale-0.009
 oven-0.009
TokenOne
Token ID3198
Feature activation+1.45e-02

Pos logit contributions
P+0.018
over+0.017
Sus+0.016
Em+0.015
date+0.015
Neg logit contributions
 stool-0.021
 tray-0.021
ses-0.020
 log-0.020
 oven-0.020
Token day
Token ID1110
Feature activation-3.87e-02

Pos logit contributions
 stool+0.028
 tray+0.027
 log+0.026
 microwave+0.026
 scale+0.025
Neg logit contributions
over-0.019
P-0.019
liest-0.016
Sus-0.016
date-0.016
Token,
Token ID11
Feature activation-1.27e-02

Pos logit contributions
beh+0.078
amed+0.073
organized+0.071
liest+0.071
P+0.070
Neg logit contributions
 stool-0.048
 chair-0.047
 log-0.047
 tray-0.045
 fork-0.043
Token a
Token ID257
Feature activation-1.64e-02

Pos logit contributions
P+0.020
over+0.020
beh+0.020
amed+0.020
liest+0.020
Neg logit contributions
 stool-0.020
 log-0.019
 tray-0.019
 chair-0.019
 fork-0.018
Token little
Token ID1310
Feature activation-1.74e-03

Pos logit contributions
over+0.030
P+0.030
liest+0.029
beh+0.029
ed+0.029
Neg logit contributions
 stool-0.022
 log-0.022
 tray-0.021
 chair-0.020
 scale-0.020
Token girl
Token ID2576
Feature activation-2.64e-02

Pos logit contributions
P+0.003
over+0.003
amed+0.003
beh+0.003
liest+0.003
Neg logit contributions
 log-0.002
 stool-0.002
 tray-0.002
 chair-0.002
 tractor-0.002
Token ran
Token ID4966
Feature activation-2.77e-03

Pos logit contributions
over+0.042
P+0.040
beh+0.038
lish+0.038
liest+0.037
Neg logit contributions
 log-0.040
 stool-0.040
 chair-0.039
 thinks-0.039
 oven-0.038
Token and
Token ID290
Feature activation-4.30e-03

Pos logit contributions
 stool+0.011
 tray+0.010
 log+0.010
 chair+0.010
 scale+0.010
Neg logit contributions
P-0.010
over-0.010
Sus-0.009
amed-0.009
lyss-0.009
Token learn
Token ID2193
Feature activation-2.18e-02

Pos logit contributions
over+0.008
P+0.008
liest+0.007
amed+0.007
Sus+0.007
Neg logit contributions
 stool-0.006
 log-0.006
 tray-0.006
 chair-0.006
 scale-0.005
Token all
Token ID477
Feature activation+5.43e-03

Pos logit contributions
over+0.038
P+0.037
hered+0.034
amed+0.034
Sus+0.034
Neg logit contributions
 stool-0.032
 tray-0.031
 log-0.030
 chair-0.029
 scale-0.029
Token about
Token ID546
Feature activation-1.26e-02

Pos logit contributions
 stool+0.010
 tray+0.009
 log+0.009
 microwave+0.009
 chair+0.009
Neg logit contributions
over-0.008
P-0.008
liest-0.007
Sus-0.007
 such-0.007
Token them
Token ID606
Feature activation-1.85e-02

Pos logit contributions
over+0.017
P+0.017
liest+0.015
Sus+0.015
 such+0.015
Neg logit contributions
 stool-0.022
 tray-0.021
 log-0.021
 microwave-0.020
 chair-0.020
Token.
Token ID13
Feature activation-1.63e-02

Pos logit contributions
over+0.026
P+0.026
liest+0.023
hered+0.023
Sus+0.023
Neg logit contributions
 stool-0.033
 tray-0.032
 log-0.031
 chair-0.030
 microwave-0.030
Token He
Token ID679
Feature activation+3.04e-03

Pos logit contributions
P+0.023
over+0.023
ed+0.021
 Emmy+0.021
 such+0.021
Neg logit contributions
 stool-0.028
 tray-0.028
 oven-0.026
 fork-0.025
 chair-0.025
Token spent
Token ID3377
Feature activation-1.25e-02

Pos logit contributions
 stool+0.006
 tray+0.006
 log+0.006
 Deal+0.006
 oven+0.006
Neg logit contributions
P-0.004
over-0.004
ed-0.003
amed-0.003
liest-0.003
Token the
Token ID262
Feature activation-3.91e-02

Pos logit contributions
beh+0.021
P+0.021
over+0.020
Sus+0.020
liest+0.019
Neg logit contributions
 log-0.019
 tray-0.018
 stool-0.018
 chair-0.018
 scale-0.017
Token whole
Token ID2187
Feature activation-1.77e-02

Pos logit contributions
over+0.056
P+0.055
liest+0.049
hered+0.049
Sus+0.049
Neg logit contributions
 stool-0.069
 tray-0.066
 log-0.066
 chair-0.064
 scale-0.063
Token day
Token ID1110
Feature activation-2.06e-02

Pos logit contributions
 Emmy+0.033
beh+0.032
over+0.031
amed+0.031
hered+0.031
Neg logit contributions
 log-0.024
 stool-0.022
 chair-0.022
 tray-0.021
 scale-0.021
Token y
Token ID331
Feature activation+3.15e-02

Pos logit contributions
 stool+0.002
 tray+0.002
 log+0.002
 chair+0.002
 microwave+0.002
Neg logit contributions
over-0.002
P-0.002
liest-0.002
hered-0.002
Sus-0.002
Tokenucky
Token ID5309
Feature activation+2.85e-02

Pos logit contributions
 stool+0.051
 tray+0.050
 log+0.050
 oven+0.047
 fork+0.047
Neg logit contributions
over-0.046
P-0.046
amed-0.043
Sus-0.042
lyss-0.042
Token!
Token ID0
Feature activation+1.65e-02

Pos logit contributions
 stool+0.050
 tray+0.047
 log+0.047
 microwave+0.046
 Deal+0.046
Neg logit contributions
over-0.041
P-0.039
liest-0.036
ed-0.035
 such-0.035
Token Candy
Token ID24680
Feature activation-1.93e-02

Pos logit contributions
 stool+0.030
 tray+0.029
 log+0.028
 scale+0.027
 chair+0.027
Neg logit contributions
over-0.023
P-0.022
liest-0.021
 such-0.020
lyss-0.020
Token is
Token ID318
Feature activation-1.36e-03

Pos logit contributions
over+0.034
P+0.034
liest+0.032
ed+0.031
 such+0.031
Neg logit contributions
 stool-0.026
 tray-0.025
 log-0.024
 microwave-0.023
 Deal-0.023
Token better
Token ID1365
Feature activation-4.31e-02

Pos logit contributions
P+0.002
over+0.002
 such+0.002
liest+0.002
amed+0.002
Neg logit contributions
 stool-0.002
 tray-0.002
 log-0.002
 chair-0.002
 microwave-0.002
Token."
Token ID526
Feature activation-7.47e-03

Pos logit contributions
P+0.060
over+0.059
liest+0.056
amed+0.056
Sus+0.055
Neg logit contributions
 stool-0.077
 log-0.074
 tray-0.073
 chair-0.071
 scale-0.070
Token But
Token ID887
Feature activation-3.08e-04

Pos logit contributions
over+0.012
P+0.012
hered+0.011
liest+0.011
amed+0.011
Neg logit contributions
 stool-0.012
 tray-0.011
 log-0.011
 chair-0.011
 microwave-0.011
Token Lily
Token ID20037
Feature activation-1.38e-02

Pos logit contributions
P+0.001
over+0.001
amed+0.000
hered+0.000
liest+0.000
Neg logit contributions
 stool-0.000
 log-0.000
 chair-0.000
 tray-0.000
 scale-0.000
Token remembered
Token ID12086
Feature activation+9.03e-03

Pos logit contributions
over+0.020
P+0.019
Sus+0.017
liest+0.017
date+0.017
Neg logit contributions
 stool-0.024
 tray-0.023
 log-0.023
 chair-0.022
 scale-0.022
Token that
Token ID326
Feature activation+1.38e-02

Pos logit contributions
 stool+0.014
 log+0.013
 tray+0.013
 chair+0.012
 fork+0.012
Neg logit contributions
over-0.016
P-0.015
amed-0.014
liest-0.014
date-0.014
Tokenmy
Token ID1820
Feature activation-8.88e-03

Pos logit contributions
P+0.006
over+0.006
liest+0.005
amed+0.005
ed+0.005
Neg logit contributions
 stool-0.006
 tray-0.006
 log-0.006
 oven-0.006
 microwave-0.006
Token's
Token ID338
Feature activation-1.42e-02

Pos logit contributions
over+0.013
P+0.012
date+0.011
beh+0.011
liest+0.011
Neg logit contributions
 stool-0.016
 log-0.016
 tray-0.015
 chair-0.015
 scale-0.015
Token dad
Token ID9955
Feature activation-3.47e-02

Pos logit contributions
over+0.029
P+0.028
date+0.026
liest+0.026
hered+0.026
Neg logit contributions
 stool-0.016
 tray-0.015
 log-0.015
 microwave-0.014
 chair-0.014
Token started
Token ID2067
Feature activation-6.41e-03

Pos logit contributions
over+0.048
P+0.047
liest+0.041
Sus+0.041
amed+0.041
Neg logit contributions
 stool-0.063
 tray-0.061
 log-0.061
 scale-0.059
 chair-0.059
Token to
Token ID284
Feature activation-8.98e-03

Pos logit contributions
over+0.010
P+0.010
liest+0.009
amed+0.009
Em+0.008
Neg logit contributions
 stool-0.011
 log-0.011
 tray-0.010
 chair-0.010
 scale-0.010
Token un
Token ID555
Feature activation-6.01e-02

Pos logit contributions
over+0.019
P+0.018
ed+0.018
liest+0.017
amed+0.017
Neg logit contributions
 stool-0.010
 log-0.010
 tray-0.010
 scale-0.009
 chair-0.009
Tokenpack
Token ID8002
Feature activation-1.30e-02

Pos logit contributions
over+0.104
P+0.099
liest+0.094
hered+0.092
lyss+0.091
Neg logit contributions
 stool-0.088
 tray-0.083
 log-0.082
 scale-0.080
 microwave-0.079
Token the
Token ID262
Feature activation-1.45e-02

Pos logit contributions
P+0.020
over+0.019
hered+0.017
beh+0.017
lyss+0.017
Neg logit contributions
 stool-0.023
 tray-0.022
 log-0.021
 chair-0.021
 microwave-0.021
Token fishing
Token ID12478
Feature activation+1.61e-03

Pos logit contributions
over+0.029
P+0.028
Sus+0.027
hered+0.026
beh+0.026
Neg logit contributions
 stool-0.018
 log-0.017
 tray-0.017
 scale-0.016
 microwave-0.016
Token gear
Token ID7733
Feature activation+2.44e-02

Pos logit contributions
 stool+0.002
 tray+0.002
 log+0.002
 scale+0.001
 chair+0.001
Neg logit contributions
over-0.003
P-0.003
Sus-0.003
liest-0.003
hered-0.003
Token.
Token ID13
Feature activation-1.39e-03

Pos logit contributions
 stool+0.043
 tray+0.041
 log+0.041
 chair+0.039
 microwave+0.039
Neg logit contributions
over-0.035
P-0.034
liest-0.030
hered-0.030
Sus-0.030
Token.
Token ID13
Feature activation-1.85e-03

Pos logit contributions
 stool+0.042
 Deal+0.041
ses+0.041
 log+0.040
 microwave+0.040
Neg logit contributions
 such-0.018
over-0.018
liest-0.017
P-0.016
date-0.014
Token They
Token ID1119
Feature activation+1.90e-02

Pos logit contributions
over+0.003
P+0.003
 such+0.003
Sus+0.003
date+0.003
Neg logit contributions
 stool-0.003
 log-0.003
 tray-0.003
 scale-0.002
 chair-0.002
Token wished
Token ID16555
Feature activation+2.88e-02

Pos logit contributions
 stool+0.032
 tray+0.031
 log+0.031
 chair+0.029
 microwave+0.029
Neg logit contributions
P-0.028
over-0.028
liest-0.025
ed-0.025
hered-0.025
Token they
Token ID484
Feature activation+1.11e-02

Pos logit contributions
 stool+0.043
 tray+0.041
 log+0.040
 chair+0.039
 oven+0.039
Neg logit contributions
P-0.051
over-0.050
amed-0.046
liest-0.046
date-0.045
Token were
Token ID547
Feature activation+4.00e-03

Pos logit contributions
 stool+0.018
 tray+0.018
 fork+0.018
 chair+0.017
 log+0.017
Neg logit contributions
over-0.017
P-0.017
date-0.015
Sus-0.015
liest-0.015
Token haw
Token ID23185
Feature activation-6.54e-02

Pos logit contributions
 stool+0.007
 log+0.006
 tray+0.006
 chair+0.006
 microwave+0.006
Neg logit contributions
over-0.006
P-0.006
 such-0.006
amed-0.005
liest-0.005
Tokenks
Token ID591
Feature activation+9.95e-03

Pos logit contributions
P+0.125
lyss+0.124
 Emmy+0.123
Sus+0.122
Em+0.121
Neg logit contributions
ses-0.074
 tray-0.071
 stool-0.071
 thinks-0.069
 log-0.068
Token and
Token ID290
Feature activation+7.35e-03

Pos logit contributions
 stool+0.016
 tray+0.016
 log+0.015
 chair+0.015
 fork+0.015
Neg logit contributions
P-0.015
over-0.015
Sus-0.014
date-0.013
lyss-0.013
Token could
Token ID714
Feature activation+6.17e-03

Pos logit contributions
 stool+0.011
 tray+0.011
 log+0.011
 chair+0.010
 scale+0.010
Neg logit contributions
over-0.012
P-0.012
liest-0.011
Sus-0.011
hered-0.011
Token eat
Token ID4483
Feature activation+7.47e-03

Pos logit contributions
 stool+0.009
 microwave+0.009
 tray+0.009
 log+0.009
 oven+0.008
Neg logit contributions
P-0.009
 such-0.009
over-0.009
hered-0.009
date-0.009
Token mice
Token ID10693
Feature activation-2.01e-02

Pos logit contributions
 stool+0.014
 log+0.013
 tray+0.013
 microwave+0.012
 chair+0.012
Neg logit contributions
over-0.010
P-0.010
 such-0.009
liest-0.009
hered-0.009
Token to
Token ID284
Feature activation+5.27e-03

Pos logit contributions
over+0.029
P+0.028
hered+0.026
Sus+0.026
liest+0.026
Neg logit contributions
 stool-0.032
 tray-0.031
 log-0.030
 chair-0.030
 oven-0.030
Token be
Token ID307
Feature activation-2.33e-02

Pos logit contributions
 stool+0.007
 tray+0.007
 log+0.007
 scale+0.006
 chair+0.006
Neg logit contributions
over-0.010
P-0.009
ed-0.009
liest-0.009
amed-0.009
Token careful
Token ID8161
Feature activation-2.35e-02

Pos logit contributions
over+0.042
liest+0.042
hered+0.042
P+0.041
Sus+0.041
Neg logit contributions
 stool-0.031
 tray-0.030
 scale-0.028
 oven-0.028
 log-0.028
Token.
Token ID13
Feature activation+8.24e-03

Pos logit contributions
over+0.035
P+0.032
liest+0.030
ed+0.030
amed+0.030
Neg logit contributions
 stool-0.040
 tray-0.039
 log-0.039
 microwave-0.038
 fork-0.037
Token Maybe
Token ID6674
Feature activation+3.40e-02

Pos logit contributions
 stool+0.012
 tray+0.012
 log+0.012
 chair+0.011
 scale+0.011
Neg logit contributions
over-0.014
P-0.013
liest-0.012
Sus-0.012
hered-0.012
Token someone
Token ID2130
Feature activation-5.32e-02

Pos logit contributions
 stool+0.045
 chair+0.040
 tray+0.040
 log+0.039
 tractor+0.039
Neg logit contributions
amed-0.068
over-0.068
P-0.067
beh-0.064
hered-0.064
Token is
Token ID318
Feature activation-7.49e-03

Pos logit contributions
over+0.108
P+0.107
Sus+0.099
amed+0.098
liest+0.098
Neg logit contributions
 stool-0.061
 tray-0.058
 log-0.057
 chair-0.054
 scale-0.054
Token looking
Token ID2045
Feature activation+2.09e-02

Pos logit contributions
P+0.013
over+0.013
liest+0.012
Sus+0.012
date+0.011
Neg logit contributions
 stool-0.011
 tray-0.011
 scale-0.011
 log-0.011
 oven-0.011
Token for
Token ID329
Feature activation-1.46e-02

Pos logit contributions
 stool+0.032
 tray+0.031
 log+0.031
 chair+0.030
 microwave+0.029
Neg logit contributions
P-0.034
over-0.034
Sus-0.030
hered-0.030
amed-0.030
Token this
Token ID428
Feature activation-2.92e-03

Pos logit contributions
P+0.024
over+0.024
amed+0.024
date+0.023
hered+0.022
Neg logit contributions
 stool-0.022
 tray-0.021
 log-0.021
 scale-0.021
 fork-0.021
Token money
Token ID1637
Feature activation-1.76e-02

Pos logit contributions
over+0.005
P+0.005
liest+0.004
date+0.004
 such+0.004
Neg logit contributions
 stool-0.004
 tray-0.004
 log-0.004
 microwave-0.004
 chair-0.004
Token take
Token ID1011
Feature activation-1.30e-02

Pos logit contributions
 stool+0.008
 tray+0.007
 log+0.007
 chair+0.007
 scale+0.007
Neg logit contributions
over-0.014
P-0.014
amed-0.013
liest-0.013
Sus-0.013
Token something
Token ID1223
Feature activation-2.18e-03

Pos logit contributions
over+0.022
Sus+0.022
P+0.022
Em+0.021
amed+0.021
Neg logit contributions
 log-0.018
 scale-0.017
 tray-0.017
 thinks-0.017
 stool-0.017
Token that
Token ID326
Feature activation+1.41e-02

Pos logit contributions
over+0.004
P+0.003
amed+0.003
Sus+0.003
date+0.003
Neg logit contributions
 stool-0.003
 tray-0.003
 log-0.003
 scale-0.003
 oven-0.003
Token doesn
Token ID1595
Feature activation+1.77e-02

Pos logit contributions
 stool+0.014
 tray+0.013
 log+0.013
 chair+0.012
 microwave+0.012
Neg logit contributions
P-0.030
over-0.030
liest-0.028
hered-0.028
ed-0.028
Token't
Token ID470
Feature activation+5.96e-03

Pos logit contributions
 stool+0.025
 log+0.024
 tray+0.024
 microwave+0.023
 chair+0.023
Neg logit contributions
over-0.031
P-0.031
hered-0.028
liest-0.028
 such-0.028
Token belong
Token ID5594
Feature activation-5.57e-02

Pos logit contributions
 log+0.010
 tray+0.010
 stool+0.010
 scale+0.010
 fork+0.010
Neg logit contributions
over-0.008
P-0.008
amed-0.007
ed-0.007
lyss-0.007
Token to
Token ID284
Feature activation+1.44e-02

Pos logit contributions
over+0.093
P+0.092
amed+0.086
Sus+0.085
lyss+0.084
Neg logit contributions
 stool-0.079
 tray-0.079
 log-0.076
 fork-0.073
 thinks-0.073
Token you
Token ID345
Feature activation-1.16e-03

Pos logit contributions
 tray+0.024
 stool+0.024
 log+0.023
 oven+0.023
 scale+0.023
Neg logit contributions
over-0.020
P-0.020
amed-0.020
Sus-0.019
ed-0.019
Token!
Token ID0
Feature activation-3.49e-03

Pos logit contributions
P+0.002
over+0.002
Sus+0.001
amed+0.001
hered+0.001
Neg logit contributions
 stool-0.002
 tray-0.002
 log-0.002
 oven-0.002
 chair-0.002
Token You
Token ID921
Feature activation+3.65e-03

Pos logit contributions
over+0.006
P+0.006
amed+0.006
Sus+0.005
liest+0.005
Neg logit contributions
 stool-0.005
 log-0.005
 tray-0.005
 Deal-0.005
 microwave-0.004
Token should
Token ID815
Feature activation-2.89e-03

Pos logit contributions
 stool+0.006
 tray+0.005
 log+0.005
 chair+0.005
 microwave+0.005
Neg logit contributions
P-0.006
over-0.006
 such-0.006
liest-0.005
hered-0.005
Token bright
Token ID6016
Feature activation+1.08e-02

Pos logit contributions
 stool+0.022
 log+0.021
 tray+0.021
 chair+0.020
 scale+0.020
Neg logit contributions
P-0.028
over-0.028
date-0.025
hered-0.025
liest-0.025
Token lights
Token ID7588
Feature activation+6.50e-04

Pos logit contributions
 stool+0.013
 log+0.013
 tray+0.013
 chair+0.013
 scale+0.012
Neg logit contributions
P-0.021
over-0.021
beh-0.019
date-0.019
hered-0.019
Token and
Token ID290
Feature activation-4.27e-05

Pos logit contributions
 stool+0.001
 tray+0.001
 log+0.001
 chair+0.001
 scale+0.001
Neg logit contributions
P-0.001
over-0.001
beh-0.001
lyss-0.001
date-0.001
Token lots
Token ID6041
Feature activation+2.93e-02

Pos logit contributions
P+0.000
over+0.000
date+0.000
hered+0.000
beh+0.000
Neg logit contributions
 stool-0.000
 log-0.000
 thinks-0.000
 chair-0.000
 scale-0.000
Token of
Token ID286
Feature activation-2.53e-02

Pos logit contributions
 stool+0.046
 log+0.046
 scale+0.045
 chair+0.045
 tray+0.045
Neg logit contributions
P-0.050
over-0.044
date-0.041
amed-0.041
beh-0.041
Token people
Token ID661
Feature activation-6.34e-02

Pos logit contributions
P+0.049
over+0.047
hered+0.044
date+0.043
beh+0.043
Neg logit contributions
 log-0.033
 thinks-0.032
 stool-0.032
 chair-0.032
 tray-0.032
Token.
Token ID13
Feature activation-1.35e-02

Pos logit contributions
P+0.110
over+0.102
date+0.093
amed+0.093
lyss+0.091
Neg logit contributions
 stool-0.096
 log-0.092
 scale-0.091
 chair-0.091
 tray-0.090
Token He
Token ID679
Feature activation-2.11e-03

Pos logit contributions
over+0.021
P+0.020
 such+0.018
ed+0.018
liest+0.018
Neg logit contributions
 stool-0.023
 tray-0.022
 log-0.021
 oven-0.021
 tractor-0.020
Token was
Token ID373
Feature activation-6.54e-03

Pos logit contributions
over+0.003
P+0.003
hered+0.003
date+0.003
liest+0.003
Neg logit contributions
 stool-0.004
 log-0.004
 tray-0.004
 chair-0.004
 scale-0.003
Token so
Token ID523
Feature activation-2.23e-03

Pos logit contributions
P+0.009
over+0.008
hered+0.008
liest+0.007
lish+0.007
Neg logit contributions
 stool-0.012
 log-0.012
 tray-0.012
 chair-0.011
 scale-0.011
Token amazed
Token ID24562
Feature activation-2.07e-04

Pos logit contributions
over+0.004
P+0.004
hered+0.004
liest+0.004
date+0.004
Neg logit contributions
 stool-0.003
 tray-0.003
 log-0.003
 chair-0.003
 scale-0.003