TOP ACTIVATIONS
MAX = 4.423

can I wear the purple bl ouse ?". M ummy said
you want to see my sc ooter ?" The
, Sara knew why the bl ouse was so enormous -
she tried to put the bl ouse on , she struggled
they started to pick the blo oms .
was going to steal the bl ouse and she ran as
So , they passed the p ears to each other and
" Where are the all igators , mom ?" Anna asked
But Tommy wanted the bl ouse more than anything .
borrowed her mom 's magn ifying glass to take a closer
you help me fill the bucket with water ?" Liz asked
garden and pick the juicy p ears from the trees .
He pretended to throw his sc ooter , and the dog
" How much is that basket ?" The lady
said , " Hi O ll ie ". <|endoftext|> Once ,
liked to play with their sc ooters in the park .
" No Sarah ," said her mum , " it 's
She pulled out a beautiful bl ouse and held it up
you want to try my sc ooter ? It is very
off , but she knew her mum was right and started

MIN ACTIVATIONS
MIN = -4.366

and went to the to aster . She saw that the
of bread in the to aster and pushed the lever down
put it in the to aster . She took a seat
the toast in the to aster and waited for it to
. She grabbed the to aster and threw it in the
more bread in the to aster and turned it on .
a bad injury and needed surgery . He said they were
put it in the to aster . Then , she went
some bread in the to aster . He goes to his
see Ben and the to aster . The toast
put it in the to aster and waited . When it
the stove and the to aster . Sam wakes
on a scary roller co aster ride . Tim my was
the bread in the to aster and pushed the button down
, so she went to Farmer Brown 's house . When
toast out of the to aster and put it on a
put bread in the to aster . Tom waited
put it in the to aster . One day
puts bread in the to aster and waits for it to
the stove and the to aster ," Tom says .

INTERVAL 2.226 - 4.423
CONTAINS 0.186%

, the wind , the qu ack and the laugh .
healthy and cheap . The both felt better knowing they were
! He sw am the whole day , until he was
we come back to the reef another day ? I want
So , Annie promised the jelly fish that she 'd take

INTERVAL 0.028 - 2.226
CONTAINS 47.706%

request and her mom said yes ! The little girl was
how it looked , how it smelled , and how it
day , while she was playing , she found a shiny
, Lily . Acc idents happen . I 'm glad you
the corner of the room . It was the lip !

INTERVAL -2.169 - 0.028
CONTAINS 51.868%

silly dog . Tom and Mia sigh . They
other as much as we can ." T eddy
much as he could . But then he heard a strange
all day long . One day , she saw a parade
She was very fearful . Everywhere she went , she heard

INTERVAL -4.366 - -2.169
CONTAINS 0.240%

, but it was too late . Lily was gone .
Once there was a turtle who loved to laugh . He
careful and to always think before he acted . He never
make them even better than before . And we can put
higher than she ever had before . She felt so high
Token can
Token ID460
Feature activation-1.73e-01

Pos logit contributions
aces+0.002
 hind+0.002
ags+0.002
sed+0.002
endant+0.002
Neg logit contributions
burn-0.002
 Woo-0.002
 nurses-0.002
T-0.002
Pe-0.002
Token I
Token ID314
Feature activation+6.39e-01

Pos logit contributions
burn+0.006
 Woo+0.006
 nurses+0.006
Pe+0.006
 lam+0.006
Neg logit contributions
aces-0.003
 hind-0.003
ags-0.003
endant-0.003
 hips-0.003
Token wear
Token ID5806
Feature activation+2.56e-01

Pos logit contributions
aces+0.020
 hind+0.020
ags+0.020
endant+0.019
sed+0.018
Neg logit contributions
burn-0.015
 Woo-0.014
 nurses-0.014
T-0.013
Pe-0.013
Token the
Token ID262
Feature activation+8.05e-01

Pos logit contributions
aces+0.009
 hind+0.009
ags+0.008
sed+0.008
ushed+0.008
Neg logit contributions
 Woo-0.006
burn-0.005
 nurses-0.005
T-0.005
Pe-0.004
Token purple
Token ID14032
Feature activation+7.45e-01

Pos logit contributions
aces+0.022
 hind+0.022
ags+0.021
sed+0.020
endant+0.019
Neg logit contributions
 Woo-0.023
burn-0.022
 nurses-0.021
T-0.020
Pe-0.020
Token bl
Token ID698
Feature activation+4.42e+00

Pos logit contributions
aces+0.023
 hind+0.023
ags+0.022
sed+0.021
endant+0.021
Neg logit contributions
burn-0.018
 Woo-0.018
 nurses-0.016
T-0.016
Pe-0.015
Tokenouse
Token ID1076
Feature activation+3.16e+00

Pos logit contributions
 hind+0.103
ags+0.096
ses+0.093
aces+0.093
sed+0.092
Neg logit contributions
burn-0.141
 Woo-0.140
T-0.134
or-0.128
eb-0.121
Token?".
Token ID43634
Feature activation+6.33e-01

Pos logit contributions
ses+0.111
 hind+0.111
sed+0.110
 ey+0.105
ags+0.101
Neg logit contributions
burn-0.072
 Woo-0.070
 one-0.064
 any-0.062
where-0.058
Token M
Token ID337
Feature activation+2.13e+00

Pos logit contributions
aces+0.024
endant+0.024
 hind+0.023
ags+0.022
amed+0.022
Neg logit contributions
 Woo-0.011
 nurses-0.011
T-0.011
where-0.011
burn-0.010
Tokenummy
Token ID13513
Feature activation+9.56e-01

Pos logit contributions
 hind+0.083
aces+0.079
amed+0.077
aks+0.075
ags+0.075
Neg logit contributions
burn-0.033
T-0.028
ie-0.027
where-0.027
 nurses-0.026
Token said
Token ID531
Feature activation+9.72e-01

Pos logit contributions
 hind+0.022
ags+0.022
aces+0.022
endant+0.019
sed+0.019
Neg logit contributions
burn-0.031
 Woo-0.031
 nurses-0.030
T-0.029
 lam-0.028
Token you
Token ID345
Feature activation+1.58e+00

Pos logit contributions
burn+0.028
 Woo+0.027
 nurses+0.025
T+0.024
 lam+0.024
Neg logit contributions
aces-0.029
 hind-0.028
ags-0.027
sed-0.026
endant-0.026
Token want
Token ID765
Feature activation+1.07e+00

Pos logit contributions
 hind+0.058
aces+0.056
endant+0.055
sed+0.055
ags+0.054
Neg logit contributions
 Woo-0.029
burn-0.027
 nurses-0.027
 any-0.026
 one-0.024
Token to
Token ID284
Feature activation+1.25e+00

Pos logit contributions
aces+0.044
ushed+0.043
 hind+0.042
ags+0.042
amed+0.041
Neg logit contributions
 Woo-0.018
 nurses-0.016
burn-0.016
 any-0.015
 one-0.014
Token see
Token ID766
Feature activation+1.48e+00

Pos logit contributions
aces+0.038
 hind+0.038
ags+0.037
sed+0.035
endant+0.035
Neg logit contributions
burn-0.031
 Woo-0.030
 nurses-0.029
T-0.027
Pe-0.027
Token my
Token ID616
Feature activation+4.26e-01

Pos logit contributions
aces+0.051
 hind+0.050
ags+0.048
sed+0.048
ushed+0.047
Neg logit contributions
 Woo-0.032
burn-0.031
 nurses-0.029
T-0.027
 lam-0.026
Token sc
Token ID629
Feature activation+4.06e+00

Pos logit contributions
aces+0.015
agged+0.015
amed+0.014
ags+0.014
ushed+0.014
Neg logit contributions
burn-0.009
 Woo-0.008
 nurses-0.008
 lamb-0.008
 lam-0.008
Tokenooter
Token ID25141
Feature activation+2.32e+00

Pos logit contributions
aces+0.097
 hind+0.095
ags+0.092
sed+0.087
amed+0.086
Neg logit contributions
burn-0.122
 Woo-0.121
 nurses-0.112
Pe-0.109
 lam-0.108
Token?"
Token ID1701
Feature activation-3.41e-01

Pos logit contributions
 hind+0.067
sed+0.064
ses+0.063
ushed+0.063
aces+0.062
Neg logit contributions
burn-0.066
 Woo-0.060
 any-0.057
 nurses-0.056
 one-0.055
Token
Token ID198
Feature activation-2.51e-01

Pos logit contributions
burn+0.009
 Woo+0.009
 nurses+0.008
T+0.008
Pe+0.008
Neg logit contributions
aces-0.010
 hind-0.010
ags-0.009
sed-0.009
endant-0.009
Token
Token ID198
Feature activation-3.96e-01

Pos logit contributions
burn+0.007
 Woo+0.007
 nurses+0.006
 lam+0.006
Pe+0.006
Neg logit contributions
aces-0.007
 hind-0.007
ags-0.007
endant-0.006
sed-0.006
TokenThe
Token ID464
Feature activation-4.07e-01

Pos logit contributions
burn+0.011
 Woo+0.010
 nurses+0.010
 lam+0.009
 any+0.009
Neg logit contributions
 hind-0.010
aces-0.010
ags-0.010
amed-0.010
efully-0.010
Token,
Token ID11
Feature activation+1.07e+00

Pos logit contributions
aces+0.014
 hind+0.014
ags+0.014
sed+0.013
endant+0.013
Neg logit contributions
burn-0.009
 Woo-0.008
 nurses-0.008
T-0.007
Pe-0.007
Token Sara
Token ID24799
Feature activation-2.10e-02

Pos logit contributions
aces+0.035
 hind+0.034
ags+0.032
sed+0.032
endant+0.032
Neg logit contributions
burn-0.023
 nurses-0.020
 Woo-0.020
T-0.020
 paramed-0.019
Token knew
Token ID2993
Feature activation+3.99e-01

Pos logit contributions
T+0.000
burn+0.000
 Woo+0.000
 nurses+0.000
 lam+0.000
Neg logit contributions
ags-0.001
 hind-0.001
aces-0.001
agged-0.001
ses-0.001
Token why
Token ID1521
Feature activation-1.56e+00

Pos logit contributions
aces+0.015
 hind+0.014
ags+0.014
endant+0.014
sed+0.013
Neg logit contributions
burn-0.007
 Woo-0.007
 nurses-0.006
Pe-0.006
T-0.006
Token the
Token ID262
Feature activation+7.79e-01

Pos logit contributions
 Woo+0.029
burn+0.029
 nurses+0.026
T+0.025
Pe+0.024
Neg logit contributions
aces-0.057
 hind-0.057
ags-0.056
sed-0.053
endant-0.053
Token bl
Token ID698
Feature activation+4.02e+00

Pos logit contributions
aces+0.019
 hind+0.018
ags+0.017
endant+0.017
amed+0.016
Neg logit contributions
burn-0.023
 Woo-0.022
 nurses-0.022
Pe-0.021
T-0.020
Tokenouse
Token ID1076
Feature activation+2.83e+00

Pos logit contributions
aces+0.055
 hind+0.048
ags+0.047
endant+0.046
ushed+0.045
Neg logit contributions
 Woo-0.167
burn-0.167
 nurses-0.165
 lam-0.157
Pe-0.156
Token was
Token ID373
Feature activation+8.65e-01

Pos logit contributions
 hind+0.110
ags+0.109
asp+0.098
agged+0.096
aces+0.096
Neg logit contributions
burn-0.055
 Woo-0.048
 nurses-0.047
T-0.043
 any-0.041
Token so
Token ID523
Feature activation+1.48e+00

Pos logit contributions
 hind+0.027
aces+0.027
ags+0.026
ushed+0.025
agged+0.025
Neg logit contributions
 Woo-0.022
burn-0.022
 any-0.020
 nurses-0.019
 one-0.018
Token enormous
Token ID9812
Feature activation+1.09e+00

Pos logit contributions
aces+0.048
 hind+0.046
ags+0.045
ses+0.043
sed+0.043
Neg logit contributions
 Woo-0.039
T-0.034
burn-0.034
Pe-0.032
 nurses-0.031
Token -
Token ID532
Feature activation+1.18e+00

Pos logit contributions
aces+0.038
 hind+0.038
ags+0.037
sed+0.035
endant+0.035
Neg logit contributions
 Woo-0.022
burn-0.022
 nurses-0.020
T-0.019
 lam-0.018
Token she
Token ID673
Feature activation+1.26e+00

Pos logit contributions
burn+0.019
 Woo+0.018
 nurses+0.016
T+0.016
Pe+0.015
Neg logit contributions
aces-0.038
 hind-0.038
ags-0.037
sed-0.036
endant-0.035
Token tried
Token ID3088
Feature activation+6.69e-01

Pos logit contributions
ags+0.039
 hind+0.039
aces+0.038
ses+0.036
sed+0.036
Neg logit contributions
 Woo-0.032
burn-0.031
T-0.030
 nurses-0.030
Pe-0.028
Token to
Token ID284
Feature activation+1.23e+00

Pos logit contributions
aces+0.019
 hind+0.018
ags+0.017
endant+0.017
sed+0.016
Neg logit contributions
burn-0.018
 nurses-0.017
 Woo-0.016
Pe-0.016
 paramed-0.016
Token put
Token ID1234
Feature activation-1.99e-01

Pos logit contributions
aces+0.037
 hind+0.037
ags+0.036
endant+0.033
sed+0.033
Neg logit contributions
burn-0.030
 Woo-0.030
 nurses-0.028
 lam-0.026
Pe-0.026
Token the
Token ID262
Feature activation+8.31e-01

Pos logit contributions
burn+0.004
 Woo+0.004
 nurses+0.004
T+0.003
Pe+0.003
Neg logit contributions
aces-0.007
ags-0.007
 hind-0.007
endant-0.006
sed-0.006
Token bl
Token ID698
Feature activation+4.01e+00

Pos logit contributions
aces+0.024
 hind+0.023
ags+0.023
sed+0.021
endant+0.021
Neg logit contributions
burn-0.022
 Woo-0.022
 nurses-0.021
T-0.020
Pe-0.019
Tokenouse
Token ID1076
Feature activation+2.88e+00

Pos logit contributions
 hind+0.079
ags+0.075
aces+0.074
sed+0.068
ses+0.066
Neg logit contributions
burn-0.142
 Woo-0.140
T-0.132
 nurses-0.128
Pe-0.126
Token on
Token ID319
Feature activation+6.76e-01

Pos logit contributions
 hind+0.101
ags+0.096
sed+0.095
ses+0.095
aces+0.095
Neg logit contributions
burn-0.060
 Woo-0.059
 nurses-0.052
 any-0.051
T-0.050
Token,
Token ID11
Feature activation+8.30e-01

Pos logit contributions
aces+0.024
 hind+0.023
ags+0.023
sed+0.022
endant+0.022
Neg logit contributions
burn-0.014
 Woo-0.014
 nurses-0.013
T-0.012
Pe-0.012
Token she
Token ID673
Feature activation+1.41e-01

Pos logit contributions
aces+0.027
 hind+0.024
sed+0.024
endant+0.023
ags+0.023
Neg logit contributions
burn-0.019
 Woo-0.017
 nurses-0.016
Pe-0.016
 lam-0.015
Token struggled
Token ID11615
Feature activation+6.85e-01

Pos logit contributions
ags+0.004
 hind+0.004
aces+0.004
agged+0.004
endant+0.004
Neg logit contributions
 Woo-0.004
burn-0.004
 nurses-0.004
T-0.003
 lam-0.003
Token they
Token ID484
Feature activation+5.80e-02

Pos logit contributions
aces+0.028
 hind+0.028
ags+0.027
sed+0.026
ushed+0.025
Neg logit contributions
burn-0.021
 Woo-0.020
 nurses-0.019
T-0.018
Pe-0.018
Token started
Token ID2067
Feature activation+3.89e-01

Pos logit contributions
 hind+0.002
ags+0.002
aces+0.002
endant+0.002
ses+0.002
Neg logit contributions
 Woo-0.001
burn-0.001
 nurses-0.001
T-0.001
 lam-0.001
Token to
Token ID284
Feature activation-5.20e-01

Pos logit contributions
 hind+0.011
aces+0.011
ags+0.010
sed+0.010
endant+0.010
Neg logit contributions
 Woo-0.010
burn-0.010
 nurses-0.010
T-0.009
Pe-0.009
Token pick
Token ID2298
Feature activation-1.25e-01

Pos logit contributions
burn+0.012
 Woo+0.011
 nurses+0.011
T+0.010
 lam+0.010
Neg logit contributions
aces-0.018
ags-0.017
 hind-0.017
endant-0.017
agged-0.017
Token the
Token ID262
Feature activation+1.49e-01

Pos logit contributions
burn+0.004
Pe+0.004
 nurses+0.003
 Woo+0.003
T+0.003
Neg logit contributions
aces-0.003
ags-0.003
 hind-0.003
efully-0.003
amed-0.003
Token blo
Token ID24924
Feature activation+4.01e+00

Pos logit contributions
aces+0.004
ags+0.004
agged+0.004
 hind+0.004
ushed+0.004
Neg logit contributions
burn-0.004
 nurses-0.004
 Woo-0.004
 lamb-0.004
 lam-0.004
Tokenoms
Token ID3150
Feature activation+6.91e-01

Pos logit contributions
 hind+0.130
aces+0.127
sed+0.125
ses+0.122
ags+0.118
Neg logit contributions
burn-0.089
 Woo-0.087
 lam-0.082
 nurses-0.081
 lamb-0.080
Token.
Token ID13
Feature activation+1.28e-01

Pos logit contributions
 hind+0.024
aces+0.023
ags+0.023
sed+0.022
ses+0.022
Neg logit contributions
burn-0.014
 Woo-0.014
 nurses-0.013
 lam-0.012
 lamb-0.012
Token
Token ID220
Feature activation+8.85e-01

Pos logit contributions
aces+0.004
 hind+0.004
ags+0.004
endant+0.004
amed+0.004
Neg logit contributions
 Woo-0.003
burn-0.003
 nurses-0.003
T-0.002
Pe-0.002
Token
Token ID198
Feature activation-1.06e-01

Pos logit contributions
 hind+0.032
aces+0.031
ags+0.029
sed+0.029
endant+0.029
Neg logit contributions
 Woo-0.018
burn-0.016
T-0.016
Pe-0.015
 nurses-0.014
Token
Token ID198
Feature activation-3.26e-01

Pos logit contributions
 Woo+0.002
T+0.002
burn+0.002
Pe+0.002
 nurses+0.002
Neg logit contributions
aces-0.004
 hind-0.004
ags-0.003
sed-0.003
endant-0.003
Token was
Token ID373
Feature activation-1.07e-02

Pos logit contributions
ags+0.004
 hind+0.003
aces+0.003
agged+0.003
 ey+0.003
Neg logit contributions
 Woo-0.002
burn-0.002
 nurses-0.002
 lam-0.002
T-0.002
Token going
Token ID1016
Feature activation+3.31e-01

Pos logit contributions
 Woo+0.000
burn+0.000
 nurses+0.000
 lam+0.000
Pe+0.000
Neg logit contributions
aces-0.000
 hind-0.000
endant-0.000
ushed-0.000
ags-0.000
Token to
Token ID284
Feature activation+1.11e+00

Pos logit contributions
aces+0.011
 hind+0.011
ags+0.011
sed+0.010
endant+0.010
Neg logit contributions
burn-0.007
 Woo-0.007
 nurses-0.007
T-0.006
 lam-0.006
Token steal
Token ID8711
Feature activation+1.32e+00

Pos logit contributions
aces+0.039
 hind+0.038
ags+0.037
sed+0.036
ushed+0.036
Neg logit contributions
 Woo-0.024
burn-0.023
 nurses-0.021
T-0.020
Pe-0.020
Token the
Token ID262
Feature activation+1.07e+00

Pos logit contributions
aces+0.047
 hind+0.046
ags+0.045
sed+0.043
endant+0.043
Neg logit contributions
 Woo-0.027
burn-0.026
 nurses-0.024
T-0.023
 lam-0.022
Token bl
Token ID698
Feature activation+4.00e+00

Pos logit contributions
aces+0.027
 hind+0.026
ags+0.025
endant+0.025
sed+0.023
Neg logit contributions
burn-0.031
 Woo-0.030
 nurses-0.030
Pe-0.028
T-0.028
Tokenouse
Token ID1076
Feature activation+2.18e+00

Pos logit contributions
aces+0.051
endant+0.041
 hind+0.039
ags+0.038
ushed+0.037
Neg logit contributions
 nurses-0.177
burn-0.173
 Woo-0.173
 lam-0.165
Pe-0.164
Token and
Token ID290
Feature activation+1.64e-01

Pos logit contributions
 hind+0.078
aces+0.076
ags+0.076
sed+0.074
endant+0.072
Neg logit contributions
burn-0.043
 Woo-0.042
 nurses-0.038
T-0.035
 lam-0.034
Token she
Token ID673
Feature activation-6.11e-01

Pos logit contributions
ags+0.006
aces+0.006
agged+0.006
 hind+0.006
asp+0.006
Neg logit contributions
 Woo-0.003
 nurses-0.003
burn-0.003
 lam-0.002
 any-0.002
Token ran
Token ID4966
Feature activation-2.35e-01

Pos logit contributions
 Woo+0.016
 nurses+0.016
burn+0.015
 lam+0.014
T+0.014
Neg logit contributions
ags-0.019
aces-0.018
 hind-0.018
endant-0.016
agged-0.016
Token as
Token ID355
Feature activation-1.39e+00

Pos logit contributions
burn+0.007
 Woo+0.006
 nurses+0.006
T+0.006
Pe+0.006
Neg logit contributions
aces-0.006
 hind-0.006
ags-0.006
sed-0.006
amed-0.006
Token So
Token ID1406
Feature activation+1.29e+00

Pos logit contributions
 hind+0.011
aces+0.011
endant+0.010
ags+0.010
sed+0.010
Neg logit contributions
 Woo-0.007
burn-0.006
T-0.006
Pe-0.005
 nurses-0.005
Token,
Token ID11
Feature activation+1.51e+00

Pos logit contributions
aces+0.046
 hind+0.045
ags+0.044
sed+0.043
endant+0.041
Neg logit contributions
burn-0.025
 Woo-0.025
 nurses-0.023
T-0.022
Pe-0.021
Token they
Token ID484
Feature activation+2.02e-01

Pos logit contributions
aces+0.066
 hind+0.064
ags+0.064
endant+0.061
sed+0.061
Neg logit contributions
 Woo-0.020
burn-0.020
 nurses-0.016
Pe-0.014
 lam-0.014
Token passed
Token ID3804
Feature activation-3.05e-01

Pos logit contributions
 hind+0.007
ags+0.007
aces+0.006
ses+0.006
 ey+0.006
Neg logit contributions
 Woo-0.004
burn-0.004
 nurses-0.004
T-0.004
 lam-0.004
Token the
Token ID262
Feature activation+1.44e+00

Pos logit contributions
 Woo+0.006
burn+0.006
 nurses+0.005
 any+0.005
 lam+0.005
Neg logit contributions
aces-0.011
 hind-0.011
sed-0.011
ses-0.011
endant-0.011
Token p
Token ID279
Feature activation+3.93e+00

Pos logit contributions
aces+0.039
ags+0.038
 hind+0.038
ushed+0.035
agged+0.035
Neg logit contributions
burn-0.041
 Woo-0.041
 nurses-0.039
 lam-0.037
Pe-0.036
Tokenears
Token ID4127
Feature activation+2.23e+00

Pos logit contributions
 hind+0.079
aces+0.078
ags+0.075
ses+0.073
sed+0.070
Neg logit contributions
burn-0.131
 Woo-0.128
 paramed-0.121
 lam-0.121
 nurses-0.120
Token to
Token ID284
Feature activation+1.16e-01

Pos logit contributions
 hind+0.073
aces+0.073
ags+0.072
sed+0.071
ses+0.069
Neg logit contributions
burn-0.050
 Woo-0.048
 nurses-0.041
 lam-0.041
 any-0.040
Token each
Token ID1123
Feature activation+1.87e+00

Pos logit contributions
aces+0.005
 hind+0.004
endant+0.004
ags+0.004
ushed+0.004
Neg logit contributions
 Woo-0.002
burn-0.002
 nurses-0.002
 any-0.002
Pe-0.002
Token other
Token ID584
Feature activation+1.52e+00

Pos logit contributions
aces+0.058
 hind+0.056
ags+0.055
sed+0.053
endant+0.053
Neg logit contributions
burn-0.047
 Woo-0.047
 nurses-0.044
T-0.041
Pe-0.041
Token and
Token ID290
Feature activation-2.70e-01

Pos logit contributions
ags+0.057
 hind+0.056
agged+0.055
aces+0.055
endant+0.054
Neg logit contributions
burn-0.027
 Woo-0.026
 nurses-0.024
 lam-0.022
 any-0.022
Token"
Token ID1
Feature activation+1.52e-01

Pos logit contributions
burn+0.013
 Woo+0.013
 nurses+0.012
 lam+0.011
T+0.011
Neg logit contributions
aces-0.018
 hind-0.018
ags-0.017
amed-0.016
endant-0.016
TokenWhere
Token ID8496
Feature activation-1.47e+00

Pos logit contributions
aces+0.005
 hind+0.005
ags+0.005
amed+0.004
sed+0.004
Neg logit contributions
burn-0.004
 Woo-0.003
 nurses-0.003
 lam-0.003
T-0.003
Token are
Token ID389
Feature activation+2.08e+00

Pos logit contributions
 Woo+0.037
burn+0.036
 nurses+0.035
T+0.034
 any+0.034
Neg logit contributions
 hind-0.046
ags-0.045
aces-0.042
amed-0.040
agged-0.040
Token the
Token ID262
Feature activation+1.38e+00

Pos logit contributions
aces+0.063
 hind+0.063
ags+0.061
igators+0.058
efully+0.058
Neg logit contributions
burn-0.050
 Woo-0.045
T-0.045
Pe-0.042
 nurses-0.041
Token all
Token ID477
Feature activation+3.65e+00

Pos logit contributions
aces+0.047
endant+0.044
ushed+0.044
 hind+0.043
amed+0.042
Neg logit contributions
 Woo-0.032
 nurses-0.031
burn-0.031
 lamb-0.028
 lam-0.028
Tokenigators
Token ID25975
Feature activation+3.93e+00

Pos logit contributions
aces+0.062
ushed+0.057
endant+0.056
 hind+0.056
sed+0.055
Neg logit contributions
 nurses-0.145
 Woo-0.144
burn-0.143
 any-0.137
 lam-0.137
Token,
Token ID11
Feature activation+1.26e+00

Pos logit contributions
aces+0.130
 hind+0.126
ags+0.121
ushed+0.119
agged+0.118
Neg logit contributions
burn-0.089
where-0.088
 Woo-0.088
 any-0.086
 nurses-0.084
Token mom
Token ID1995
Feature activation+1.82e+00

Pos logit contributions
aces+0.045
ushed+0.043
ags+0.042
agged+0.042
 hind+0.042
Neg logit contributions
 Woo-0.028
 nurses-0.028
burn-0.025
 lamb-0.024
 lam-0.024
Token?"
Token ID1701
Feature activation-2.98e-01

Pos logit contributions
 hind+0.045
aces+0.044
ags+0.042
sed+0.040
ses+0.039
Neg logit contributions
burn-0.060
 Woo-0.057
 nurses-0.055
T-0.051
 lam-0.051
Token Anna
Token ID11735
Feature activation+1.20e+00

Pos logit contributions
burn+0.009
 Woo+0.008
 nurses+0.008
T+0.007
 lam+0.007
Neg logit contributions
aces-0.008
 hind-0.008
ags-0.007
sed-0.007
ses-0.007
Token asked
Token ID1965
Feature activation+3.07e-01

Pos logit contributions
aces+0.035
 hind+0.035
ags+0.034
endant+0.032
sed+0.032
Neg logit contributions
 Woo-0.031
burn-0.030
 nurses-0.029
T-0.029
 lam-0.027
Token
Token ID198
Feature activation-6.27e-01

Pos logit contributions
 nurses+0.016
burn+0.016
 Woo+0.015
 lam+0.015
 lamb+0.014
Neg logit contributions
aces-0.013
 hind-0.013
ags-0.012
ushed-0.011
amed-0.011
TokenBut
Token ID1537
Feature activation-6.80e-01

Pos logit contributions
burn+0.016
 nurses+0.016
 Woo+0.015
 lam+0.015
 lamb+0.014
Neg logit contributions
aces-0.016
 hind-0.016
ags-0.016
amed-0.015
ushed-0.015
Token Tommy
Token ID19919
Feature activation-5.25e-01

Pos logit contributions
burn+0.015
 lam+0.014
 nurses+0.014
 Woo+0.014
Pe+0.013
Neg logit contributions
aces-0.019
 hind-0.019
 done-0.019
ags-0.018
sed-0.018
Token wanted
Token ID2227
Feature activation+8.91e-01

Pos logit contributions
 Woo+0.013
burn+0.013
T+0.012
 nurses+0.012
 paramed+0.011
Neg logit contributions
 hind-0.017
ags-0.016
aces-0.015
endant-0.014
agged-0.014
Token the
Token ID262
Feature activation+6.22e-01

Pos logit contributions
aces+0.028
 hind+0.027
ags+0.026
endant+0.025
sed+0.025
Neg logit contributions
burn-0.022
 Woo-0.021
 nurses-0.020
T-0.019
Pe-0.019
Token bl
Token ID698
Feature activation+3.90e+00

Pos logit contributions
aces+0.012
 hind+0.012
ags+0.011
endant+0.011
 lips+0.011
Neg logit contributions
burn-0.020
 nurses-0.020
 Woo-0.019
 paramed-0.019
Sim-0.018
Tokenouse
Token ID1076
Feature activation+2.09e+00

Pos logit contributions
aces+0.047
endant+0.038
ushed+0.038
 hind+0.037
ags+0.036
Neg logit contributions
 nurses-0.172
 Woo-0.169
burn-0.169
 lam-0.162
Pe-0.160
Token more
Token ID517
Feature activation+5.59e-01

Pos logit contributions
 hind+0.075
ags+0.073
aces+0.072
sed+0.071
endant+0.069
Neg logit contributions
burn-0.041
 Woo-0.041
 any-0.037
 nurses-0.036
T-0.033
Token than
Token ID621
Feature activation-9.60e-01

Pos logit contributions
ags+0.019
 hind+0.019
aces+0.019
sed+0.019
ses+0.018
Neg logit contributions
 Woo-0.011
burn-0.011
 nurses-0.010
T-0.009
 lam-0.009
Token anything
Token ID1997
Feature activation-1.87e+00

Pos logit contributions
burn+0.020
 Woo+0.018
 nurses+0.017
 lam+0.017
 paramed+0.017
Neg logit contributions
aces-0.034
ags-0.031
 hind-0.031
endant-0.030
 done-0.029
Token.
Token ID13
Feature activation+6.13e-01

Pos logit contributions
 Woo+0.041
burn+0.037
 nurses+0.036
 any+0.035
T+0.035
Neg logit contributions
ags-0.063
 hind-0.063
aces-0.060
sed-0.058
ses-0.058
Token borrowed
Token ID22546
Feature activation+1.05e+00

Pos logit contributions
ags+0.030
aces+0.026
agged+0.025
 ours+0.024
 hind+0.024
Neg logit contributions
T-0.014
 lam-0.011
 Woo-0.011
Pe-0.011
 B-0.010
Token her
Token ID607
Feature activation+1.51e+00

Pos logit contributions
aces+0.036
 hind+0.035
ags+0.034
sed+0.032
endant+0.032
Neg logit contributions
 Woo-0.022
burn-0.022
 nurses-0.021
T-0.019
Pe-0.019
Token mom
Token ID1995
Feature activation+4.60e-01

Pos logit contributions
agged+0.056
ags+0.054
aces+0.051
amed+0.049
ushed+0.047
Neg logit contributions
 one-0.033
 any-0.032
 lamb-0.031
 Woo-0.031
burn-0.030
Token's
Token ID338
Feature activation+4.79e-03

Pos logit contributions
aces+0.013
 hind+0.012
ags+0.012
endant+0.011
amed+0.011
Neg logit contributions
 Woo-0.012
burn-0.012
 nurses-0.012
Pe-0.011
T-0.011
Token magn
Token ID7842
Feature activation+3.67e-01

Pos logit contributions
agged+0.000
ags+0.000
aces+0.000
amed+0.000
ieved+0.000
Neg logit contributions
 lamb-0.000
 Woo-0.000
 lam-0.000
 nurses-0.000
burn-0.000
Tokenifying
Token ID4035
Feature activation+3.88e+00

Pos logit contributions
 hind+0.009
ags+0.009
aces+0.009
sed+0.008
ses+0.008
Neg logit contributions
burn-0.011
 Woo-0.011
T-0.010
 lam-0.009
 nurses-0.009
Token glass
Token ID5405
Feature activation+8.43e-01

Pos logit contributions
aces+0.115
 hind+0.105
ags+0.103
amed+0.101
ushed+0.098
Neg logit contributions
 Woo-0.113
burn-0.104
 any-0.096
 St-0.095
Pe-0.094
Token to
Token ID284
Feature activation+2.54e-01

Pos logit contributions
ags+0.032
 hind+0.032
agged+0.030
ieved+0.030
amed+0.029
Neg logit contributions
 Woo-0.014
burn-0.013
 any-0.012
 lam-0.012
 one-0.011
Token take
Token ID1011
Feature activation-3.32e-01

Pos logit contributions
aces+0.010
 hind+0.009
ags+0.009
amed+0.009
agged+0.009
Neg logit contributions
 Woo-0.005
 nurses-0.004
burn-0.004
T-0.004
 any-0.004
Token a
Token ID257
Feature activation+2.54e-01

Pos logit contributions
burn+0.008
 Woo+0.007
 nurses+0.007
Pe+0.007
T+0.007
Neg logit contributions
aces-0.010
ags-0.010
 hind-0.010
amed-0.009
endant-0.009
Token closer
Token ID5699
Feature activation+2.87e-01

Pos logit contributions
 hind+0.007
aces+0.007
ags+0.006
endant+0.006
sed+0.006
Neg logit contributions
burn-0.007
 Woo-0.007
 nurses-0.007
Pe-0.006
T-0.006
Token you
Token ID345
Feature activation+2.43e+00

Pos logit contributions
burn+0.023
 Woo+0.021
 nurses+0.021
 lam+0.020
Pe+0.020
Neg logit contributions
aces-0.014
 hind-0.013
ags-0.013
sed-0.012
endant-0.012
Token help
Token ID1037
Feature activation+1.05e+00

Pos logit contributions
aces+0.089
 hind+0.085
ags+0.084
endant+0.081
sed+0.081
Neg logit contributions
burn-0.047
 Woo-0.047
 nurses-0.043
T-0.040
Pe-0.039
Token me
Token ID502
Feature activation+3.03e+00

Pos logit contributions
 hind+0.021
aces+0.020
ags+0.020
endant+0.017
sed+0.017
Neg logit contributions
burn-0.037
 Woo-0.036
 nurses-0.034
Pe-0.034
T-0.033
Token fill
Token ID6070
Feature activation+7.15e-01

Pos logit contributions
aces+0.096
 hind+0.089
sed+0.088
ags+0.086
ses+0.085
Neg logit contributions
burn-0.078
 Woo-0.075
 nurses-0.070
T-0.065
 lamb-0.064
Token the
Token ID262
Feature activation+2.68e+00

Pos logit contributions
aces+0.027
sed+0.026
 hind+0.026
ses+0.025
agged+0.025
Neg logit contributions
 Woo-0.015
burn-0.013
 nurses-0.012
T-0.012
 lam-0.011
Token bucket
Token ID19236
Feature activation+3.87e+00

Pos logit contributions
aces+0.075
 hind+0.071
agged+0.063
ags+0.062
sed+0.062
Neg logit contributions
 Woo-0.075
burn-0.071
 one-0.069
 lamb-0.069
 lam-0.068
Token with
Token ID351
Feature activation+2.15e+00

Pos logit contributions
sed+0.153
ses+0.149
 hind+0.145
aces+0.143
agged+0.136
Neg logit contributions
burn-0.069
 one-0.063
 Woo-0.061
 any-0.059
where-0.054
Token water
Token ID1660
Feature activation+1.24e+00

Pos logit contributions
aces+0.083
sed+0.078
 hind+0.077
ses+0.077
ushed+0.076
Neg logit contributions
 Woo-0.046
 nurses-0.043
 any-0.042
 them-0.041
 one-0.039
Token?"
Token ID1701
Feature activation-6.17e-01

Pos logit contributions
aces+0.036
 hind+0.035
sed+0.033
ses+0.033
ushed+0.032
Neg logit contributions
burn-0.036
 Woo-0.035
 nurses-0.032
 any-0.030
 paramed-0.029
Token Liz
Token ID28313
Feature activation-4.90e-02

Pos logit contributions
burn+0.014
 Woo+0.014
 nurses+0.012
 lam+0.012
Pe+0.012
Neg logit contributions
aces-0.020
 hind-0.019
ags-0.018
endant-0.018
ushed-0.018
Token asked
Token ID1965
Feature activation-7.65e-01

Pos logit contributions
T+0.001
 Woo+0.001
burn+0.001
 nurses+0.001
 lam+0.001
Neg logit contributions
ags-0.002
 hind-0.001
aces-0.001
ses-0.001
sed-0.001
Token garden
Token ID11376
Feature activation-1.44e-01

Pos logit contributions
burn+0.042
 Woo+0.041
 nurses+0.039
T+0.038
 lam+0.037
Neg logit contributions
aces-0.025
 hind-0.023
ags-0.022
sed-0.021
endant-0.021
Token and
Token ID290
Feature activation-1.43e-01

Pos logit contributions
burn+0.004
 Woo+0.004
 nurses+0.004
T+0.003
Pe+0.003
Neg logit contributions
aces-0.004
 hind-0.004
ags-0.004
endant-0.004
sed-0.004
Token pick
Token ID2298
Feature activation+6.29e-01

Pos logit contributions
 Woo+0.003
burn+0.003
 nurses+0.003
 lam+0.002
T+0.002
Neg logit contributions
aces-0.005
ags-0.005
 hind-0.005
endant-0.005
agged-0.005
Token the
Token ID262
Feature activation+8.28e-01

Pos logit contributions
aces+0.014
ags+0.014
 hind+0.013
endant+0.012
sed+0.012
Neg logit contributions
burn-0.021
 Woo-0.020
 nurses-0.019
T-0.019
Pe-0.019
Token juicy
Token ID35075
Feature activation+2.38e+00

Pos logit contributions
aces+0.026
 hind+0.025
ags+0.024
agged+0.024
ushed+0.024
Neg logit contributions
 Woo-0.021
 lam-0.020
burn-0.020
 nurses-0.019
 lamb-0.019
Token p
Token ID279
Feature activation+3.87e+00

Pos logit contributions
aces+0.058
ags+0.057
 hind+0.056
amed+0.056
sed+0.054
Neg logit contributions
 Woo-0.071
burn-0.071
 lam-0.067
 lamb-0.066
 one-0.065
Tokenears
Token ID4127
Feature activation+2.50e+00

Pos logit contributions
 hind+0.082
aces+0.078
ags+0.071
sed+0.069
ses+0.068
Neg logit contributions
burn-0.128
 Woo-0.128
 lam-0.119
or-0.117
Pe-0.116
Token from
Token ID422
Feature activation+7.59e-02

Pos logit contributions
 hind+0.085
aces+0.084
ags+0.082
sed+0.080
ses+0.078
Neg logit contributions
burn-0.053
 Woo-0.052
 nurses-0.047
 lam-0.044
T-0.044
Token the
Token ID262
Feature activation+1.60e-01

Pos logit contributions
aces+0.003
 hind+0.003
ushed+0.003
ags+0.003
amed+0.003
Neg logit contributions
 Woo-0.001
 nurses-0.001
burn-0.001
 any-0.001
 lam-0.001
Token trees
Token ID7150
Feature activation-5.82e-01

Pos logit contributions
aces+0.004
ags+0.004
agged+0.004
ushed+0.004
 hind+0.004
Neg logit contributions
 Woo-0.005
burn-0.004
 nurses-0.004
 lam-0.004
Pe-0.004
Token.
Token ID13
Feature activation-1.65e-01

Pos logit contributions
burn+0.013
 Woo+0.013
 nurses+0.012
T+0.011
Pe+0.011
Neg logit contributions
aces-0.019
 hind-0.019
ags-0.018
sed-0.017
endant-0.017
Token He
Token ID679
Feature activation+5.45e-02

Pos logit contributions
ags+0.016
 hind+0.016
aces+0.016
endant+0.015
sed+0.015
Neg logit contributions
 Woo-0.010
T-0.009
Pe-0.008
 Th-0.008
burn-0.008
Token pretended
Token ID36766
Feature activation-3.53e-01

Pos logit contributions
ags+0.002
aces+0.002
 hind+0.002
agged+0.002
endant+0.002
Neg logit contributions
 nurses-0.001
T-0.001
 Woo-0.001
 lam-0.001
burn-0.001
Token to
Token ID284
Feature activation+1.13e+00

Pos logit contributions
 Woo+0.006
burn+0.006
 nurses+0.006
T+0.005
 any+0.005
Neg logit contributions
ags-0.013
aces-0.013
 hind-0.013
endant-0.012
ushed-0.012
Token throw
Token ID3714
Feature activation+1.94e-01

Pos logit contributions
aces+0.035
ags+0.034
 hind+0.034
agged+0.033
endant+0.033
Neg logit contributions
 Woo-0.027
burn-0.027
 nurses-0.026
T-0.024
Pe-0.024
Token his
Token ID465
Feature activation+4.53e-01

Pos logit contributions
 hind+0.006
aces+0.006
endant+0.006
ushed+0.006
ags+0.006
Neg logit contributions
 Woo-0.004
 nurses-0.004
burn-0.004
 any-0.004
 lam-0.004
Token sc
Token ID629
Feature activation+3.86e+00

Pos logit contributions
amed+0.014
ieved+0.014
efully+0.014
uct+0.013
asp+0.013
Neg logit contributions
burn-0.009
 any-0.009
 one-0.009
 lamb-0.009
 Woo-0.009
Tokenooter
Token ID25141
Feature activation+1.25e+00

Pos logit contributions
 hind+0.093
aces+0.090
ags+0.090
efully+0.086
sed+0.085
Neg logit contributions
 Woo-0.106
burn-0.103
or-0.098
 nurses-0.095
T-0.095
Token,
Token ID11
Feature activation-7.13e-02

Pos logit contributions
 hind+0.038
aces+0.038
ags+0.038
sed+0.036
endant+0.036
Neg logit contributions
burn-0.030
 Woo-0.029
 nurses-0.027
T-0.026
 lam-0.025
Token and
Token ID290
Feature activation-1.53e-01

Pos logit contributions
 Woo+0.002
burn+0.002
 nurses+0.002
 lam+0.001
T+0.001
Neg logit contributions
aces-0.002
ags-0.002
 hind-0.002
endant-0.002
ushed-0.002
Token the
Token ID262
Feature activation-2.05e-01

Pos logit contributions
 Woo+0.003
burn+0.003
 nurses+0.003
T+0.002
 lam+0.002
Neg logit contributions
aces-0.006
ags-0.006
 hind-0.005
endant-0.005
agged-0.005
Token dog
Token ID3290
Feature activation+9.93e-02

Pos logit contributions
burn+0.005
 Woo+0.005
 nurses+0.005
 lam+0.005
T+0.005
Neg logit contributions
aces-0.006
ags-0.006
 hind-0.006
agged-0.006
ushed-0.006
Token "
Token ID366
Feature activation+1.02e+00

Pos logit contributions
aces+0.030
 hind+0.029
ags+0.029
endant+0.028
ses+0.026
Neg logit contributions
 Woo-0.024
burn-0.023
 nurses-0.022
T-0.022
Pe-0.021
TokenHow
Token ID2437
Feature activation-1.35e+00

Pos logit contributions
aces+0.030
 hind+0.030
ags+0.029
amed+0.028
endant+0.028
Neg logit contributions
burn-0.025
 Woo-0.024
 nurses-0.022
 lam-0.021
Pe-0.020
Token much
Token ID881
Feature activation+9.34e-01

Pos logit contributions
 Woo+0.027
 any+0.027
T+0.026
 nurses+0.026
 they+0.025
Neg logit contributions
 hind-0.049
ags-0.046
amed-0.045
aces-0.044
efully-0.043
Token is
Token ID318
Feature activation+1.58e+00

Pos logit contributions
 hind+0.031
aces+0.031
ags+0.031
sed+0.030
amed+0.029
Neg logit contributions
 Woo-0.021
burn-0.020
 nurses-0.020
 lam-0.018
 any-0.018
Token that
Token ID326
Feature activation+1.18e+00

Pos logit contributions
aces+0.051
 hind+0.050
ags+0.049
sed+0.046
endant+0.046
Neg logit contributions
 Woo-0.036
burn-0.036
 nurses-0.034
T-0.032
Pe-0.031
Token basket
Token ID7988
Feature activation+3.85e+00

Pos logit contributions
 hind+0.036
aces+0.035
ushed+0.034
sed+0.034
ags+0.034
Neg logit contributions
 Woo-0.032
burn-0.031
 nurses-0.031
 one-0.030
 any-0.030
Token?"
Token ID1701
Feature activation+9.22e-01

Pos logit contributions
 hind+0.136
ses+0.129
sed+0.128
aces+0.127
ags+0.127
Neg logit contributions
burn-0.083
 any-0.080
 Woo-0.077
where-0.075
 nurses-0.073
Token
Token ID198
Feature activation+1.01e+00

Pos logit contributions
aces+0.032
 hind+0.031
endant+0.030
sed+0.030
ses+0.030
Neg logit contributions
 Woo-0.020
T-0.019
burn-0.018
 nurses-0.018
where-0.018
Token
Token ID198
Feature activation+1.16e+00

Pos logit contributions
aces+0.036
 hind+0.034
ags+0.033
sed+0.033
ses+0.032
Neg logit contributions
 Woo-0.022
burn-0.021
T-0.020
Pe-0.019
 nurses-0.019
TokenThe
Token ID464
Feature activation+5.27e-01

Pos logit contributions
aces+0.049
ags+0.046
ses+0.046
 hind+0.045
sed+0.045
Neg logit contributions
T-0.019
 Woo-0.018
burn-0.017
Pe-0.017
where-0.015
Token lady
Token ID10846
Feature activation+1.30e+00

Pos logit contributions
aces+0.020
 hind+0.018
ags+0.018
endant+0.018
ushed+0.018
Neg logit contributions
 nurses-0.010
 paramed-0.010
burn-0.009
 Woo-0.009
 lamb-0.009
Token said
Token ID531
Feature activation+1.99e-01

Pos logit contributions
aces+0.008
ags+0.007
 hind+0.007
agged+0.007
endant+0.006
Neg logit contributions
 Woo-0.009
burn-0.009
 nurses-0.008
 lam-0.008
T-0.008
Token,
Token ID11
Feature activation+6.05e-01

Pos logit contributions
aces+0.007
 hind+0.007
ags+0.007
sed+0.006
endant+0.006
Neg logit contributions
burn-0.004
 Woo-0.004
 nurses-0.004
T-0.004
Pe-0.004
Token "
Token ID366
Feature activation-9.47e-03

Pos logit contributions
aces+0.016
 hind+0.016
ags+0.016
endant+0.015
sed+0.014
Neg logit contributions
 Woo-0.017
burn-0.017
 nurses-0.016
T-0.016
Pe-0.015
TokenHi
Token ID17250
Feature activation+1.66e+00

Pos logit contributions
 Woo+0.000
 nurses+0.000
burn+0.000
 lam+0.000
 paramed+0.000
Neg logit contributions
aces-0.000
 hind-0.000
ags-0.000
amed-0.000
ushed-0.000
Token O
Token ID440
Feature activation+2.40e+00

Pos logit contributions
aces+0.042
 hind+0.042
endant+0.041
asp+0.039
sed+0.038
Neg logit contributions
 nurses-0.046
burn-0.046
 Woo-0.045
T-0.040
Pe-0.040
Tokenll
Token ID297
Feature activation+3.85e+00

Pos logit contributions
aces+0.051
 hind+0.049
ags+0.047
endant+0.046
sed+0.044
Neg logit contributions
 Woo-0.082
burn-0.081
 nurses-0.079
Pe-0.073
T-0.073
Tokenie
Token ID494
Feature activation+1.53e+00

Pos logit contributions
aces+0.147
 hind+0.146
ags+0.144
endant+0.137
sed+0.136
Neg logit contributions
 Woo-0.063
burn-0.061
 nurses-0.058
T-0.052
Pe-0.051
Token".
Token ID1911
Feature activation+5.89e-01

Pos logit contributions
aces+0.045
 hind+0.045
ags+0.044
endant+0.041
sed+0.041
Neg logit contributions
burn-0.039
 Woo-0.039
 nurses-0.036
T-0.035
 lam-0.034
Token<|endoftext|>
Token ID50256
Feature activation+5.49e-01

Pos logit contributions
aces+0.017
 hind+0.016
ags+0.016
endant+0.015
sed+0.015
Neg logit contributions
burn-0.016
 Woo-0.016
 nurses-0.015
T-0.014
Pe-0.014
TokenOnce
Token ID7454
Feature activation+1.31e+00

Pos logit contributions
 hind+0.019
aces+0.018
ags+0.018
sed+0.018
endant+0.017
Neg logit contributions
 Woo-0.012
burn-0.011
T-0.011
Pe-0.010
 nurses-0.010
Token,
Token ID11
Feature activation+1.19e+00

Pos logit contributions
aces+0.027
 hind+0.027
ags+0.026
endant+0.024
sed+0.024
Neg logit contributions
burn-0.045
 Woo-0.044
 nurses-0.042
T-0.041
Pe-0.040
Token liked
Token ID8288
Feature activation-4.05e-01

Pos logit contributions
 hind+0.007
ags+0.007
aces+0.007
endant+0.007
agged+0.007
Neg logit contributions
 Woo-0.006
 nurses-0.006
burn-0.006
T-0.005
 lam-0.005
Token to
Token ID284
Feature activation-3.94e-01

Pos logit contributions
 Woo+0.008
burn+0.008
 nurses+0.007
 lam+0.006
T+0.006
Neg logit contributions
aces-0.015
ags-0.015
 hind-0.014
amed-0.014
ushed-0.014
Token play
Token ID711
Feature activation+9.74e-01

Pos logit contributions
burn+0.008
 Woo+0.008
 nurses+0.007
T+0.007
Pe+0.006
Neg logit contributions
aces-0.014
 hind-0.013
amed-0.013
ags-0.013
endant-0.013
Token with
Token ID351
Feature activation+4.55e-01

Pos logit contributions
endant+0.033
 hind+0.033
aces+0.033
sed+0.031
ushed+0.030
Neg logit contributions
 Woo-0.022
burn-0.019
T-0.019
 nurses-0.019
 them-0.017
Token their
Token ID511
Feature activation-7.00e-01

Pos logit contributions
aces+0.017
 hind+0.016
ags+0.016
amed+0.016
endant+0.016
Neg logit contributions
 Woo-0.009
burn-0.009
 nurses-0.008
T-0.007
 lam-0.007
Token sc
Token ID629
Feature activation+3.83e+00

Pos logit contributions
 Woo+0.019
burn+0.018
 nurses+0.018
 lamb+0.017
 lam+0.017
Neg logit contributions
aces-0.020
amed-0.019
ags-0.018
agged-0.018
ushed-0.018
Tokenooters
Token ID48316
Feature activation+1.71e+00

Pos logit contributions
aces+0.087
 hind+0.085
ags+0.082
sed+0.077
endant+0.075
Neg logit contributions
burn-0.122
 Woo-0.121
 nurses-0.115
Pe-0.110
T-0.110
Token in
Token ID287
Feature activation+1.15e+00

Pos logit contributions
 hind+0.058
sed+0.055
endant+0.054
aces+0.054
ags+0.054
Neg logit contributions
burn-0.038
 Woo-0.036
 nurses-0.033
 any-0.030
T-0.030
Token the
Token ID262
Feature activation-3.52e-01

Pos logit contributions
 hind+0.034
aces+0.034
ags+0.033
endant+0.030
amed+0.030
Neg logit contributions
burn-0.029
 Woo-0.028
 nurses-0.026
 paramed-0.025
Pe-0.025
Token park
Token ID3952
Feature activation+3.03e-01

Pos logit contributions
burn+0.011
 Woo+0.011
 nurses+0.010
T+0.010
Pe+0.010
Neg logit contributions
aces-0.009
 hind-0.009
ags-0.008
sed-0.008
endant-0.008
Token.
Token ID13
Feature activation-2.02e-01

Pos logit contributions
 hind+0.012
ags+0.011
aces+0.011
endant+0.011
agged+0.011
Neg logit contributions
 Woo-0.005
burn-0.005
 nurses-0.004
 lam-0.004
T-0.004
Token "
Token ID366
Feature activation+1.05e+00

Pos logit contributions
aces+0.025
 hind+0.024
endant+0.023
ags+0.023
agged+0.022
Neg logit contributions
 Woo-0.010
 nurses-0.009
burn-0.009
T-0.009
 paramed-0.009
TokenNo
Token ID2949
Feature activation+1.29e-02

Pos logit contributions
efully+0.024
aces+0.024
amed+0.024
 hind+0.024
ags+0.024
Neg logit contributions
burn-0.029
 Woo-0.029
 nurses-0.028
 lam-0.025
 paramed-0.024
Token Sarah
Token ID10490
Feature activation-1.63e-01

Pos logit contributions
 hind+0.000
aces+0.000
sed+0.000
ags+0.000
endant+0.000
Neg logit contributions
burn-0.000
 Woo-0.000
Pe-0.000
T-0.000
 nurses-0.000
Token,"
Token ID553
Feature activation+1.27e-01

Pos logit contributions
burn+0.003
 Woo+0.003
 nurses+0.003
 lam+0.003
T+0.003
Neg logit contributions
 hind-0.006
ags-0.006
aces-0.006
agged-0.005
amed-0.005
Token said
Token ID531
Feature activation+9.88e-01

Pos logit contributions
aces+0.005
 hind+0.005
ags+0.005
sed+0.005
endant+0.005
Neg logit contributions
 Woo-0.002
burn-0.002
 nurses-0.002
T-0.001
Pe-0.001
Token her
Token ID607
Feature activation+3.81e+00

Pos logit contributions
aces+0.030
ags+0.028
 hind+0.028
sed+0.026
ushed+0.026
Neg logit contributions
burn-0.024
 Woo-0.024
 nurses-0.023
 lam-0.022
Pe-0.022
Token mum
Token ID25682
Feature activation+1.37e+00

Pos logit contributions
aces+0.132
ags+0.128
 hind+0.124
amed+0.122
agged+0.122
Neg logit contributions
burn-0.081
 Woo-0.079
 lam-0.073
 nurses-0.071
 lamb-0.069
Token,
Token ID11
Feature activation+2.96e-01

Pos logit contributions
aces+0.042
 hind+0.040
ags+0.039
sed+0.038
endant+0.038
Neg logit contributions
 Woo-0.034
burn-0.034
 nurses-0.031
Pe-0.030
T-0.030
Token "
Token ID366
Feature activation+1.80e-01

Pos logit contributions
 hind+0.010
aces+0.010
ags+0.010
endant+0.009
amed+0.009
Neg logit contributions
 Woo-0.007
 nurses-0.006
burn-0.006
T-0.006
 lam-0.006
Tokenit
Token ID270
Feature activation+6.35e-01

Pos logit contributions
aces+0.006
 hind+0.006
ags+0.005
amed+0.005
endant+0.005
Neg logit contributions
burn-0.004
 Woo-0.004
 nurses-0.004
 lam-0.004
T-0.003
Token's
Token ID338
Feature activation+8.23e-01

Pos logit contributions
 hind+0.023
ags+0.023
aces+0.022
endant+0.021
agged+0.021
Neg logit contributions
burn-0.013
 Woo-0.012
 nurses-0.011
 lam-0.011
 lamb-0.011
Token She
Token ID1375
Feature activation+7.82e-01

Pos logit contributions
aces+0.038
 hind+0.038
ags+0.037
sed+0.037
ses+0.036
Neg logit contributions
 Woo-0.028
burn-0.024
T-0.024
 nurses-0.024
Pe-0.023
Token pulled
Token ID5954
Feature activation+1.64e-01

Pos logit contributions
ags+0.030
aces+0.026
 hind+0.026
agged+0.026
ses+0.025
Neg logit contributions
T-0.016
 nurses-0.015
 lam-0.015
 Woo-0.014
 they-0.014
Token out
Token ID503
Feature activation+6.98e-01

Pos logit contributions
aces+0.005
ags+0.005
 hind+0.005
amed+0.004
endant+0.004
Neg logit contributions
burn-0.004
 Woo-0.004
 nurses-0.004
Pe-0.004
 lam-0.004
Token a
Token ID257
Feature activation+9.83e-01

Pos logit contributions
aces+0.019
 hind+0.018
ags+0.017
endant+0.017
amed+0.017
Neg logit contributions
burn-0.020
 Woo-0.020
Pe-0.018
 nurses-0.018
 paramed-0.017
Token beautiful
Token ID4950
Feature activation+1.19e+00

Pos logit contributions
aces+0.031
ags+0.030
agged+0.029
 hind+0.027
ses+0.026
Neg logit contributions
burn-0.025
 Woo-0.025
T-0.024
 lamb-0.022
 nurses-0.022
Token bl
Token ID698
Feature activation+3.80e+00

Pos logit contributions
aces+0.036
ags+0.034
 hind+0.034
agged+0.032
sed+0.031
Neg logit contributions
burn-0.031
 Woo-0.031
T-0.028
 lamb-0.028
 nurses-0.028
Tokenouse
Token ID1076
Feature activation+2.15e+00

Pos logit contributions
aces+0.062
 hind+0.060
ags+0.057
sed+0.051
endant+0.051
Neg logit contributions
burn-0.148
 Woo-0.147
 nurses-0.141
T-0.137
Pe-0.136
Token and
Token ID290
Feature activation+3.34e-01

Pos logit contributions
 hind+0.070
ags+0.069
aces+0.069
sed+0.064
endant+0.064
Neg logit contributions
burn-0.049
 Woo-0.048
 nurses-0.043
T-0.042
 lam-0.040
Token held
Token ID2714
Feature activation-5.47e-01

Pos logit contributions
ags+0.011
aces+0.011
 hind+0.011
agged+0.010
asp+0.010
Neg logit contributions
 Woo-0.008
burn-0.008
T-0.007
 nurses-0.007
 lam-0.007
Token it
Token ID340
Feature activation-1.98e-01

Pos logit contributions
burn+0.010
 Woo+0.010
 nurses+0.009
Pe+0.009
T+0.009
Neg logit contributions
aces-0.020
 hind-0.020
ags-0.019
endant-0.018
sed-0.018
Token up
Token ID510
Feature activation-1.84e+00

Pos logit contributions
burn+0.005
 Woo+0.005
T+0.004
 nurses+0.004
 lam+0.004
Neg logit contributions
ags-0.006
 hind-0.006
aces-0.006
ses-0.006
sed-0.006
Token you
Token ID345
Feature activation+1.44e+00

Pos logit contributions
burn+0.034
 Woo+0.032
 nurses+0.032
 paramed+0.030
 lamb+0.029
Neg logit contributions
aces-0.032
 hind-0.030
ags-0.029
endant-0.028
sed-0.028
Token want
Token ID765
Feature activation+1.14e+00

Pos logit contributions
 hind+0.050
sed+0.047
aces+0.046
ses+0.045
ags+0.045
Neg logit contributions
 Woo-0.030
burn-0.029
 nurses-0.027
T-0.026
 any-0.025
Token to
Token ID284
Feature activation+1.41e+00

Pos logit contributions
aces+0.042
 hind+0.041
ags+0.040
ushed+0.040
sed+0.039
Neg logit contributions
 Woo-0.023
burn-0.022
 nurses-0.020
T-0.019
Pe-0.018
Token try
Token ID1949
Feature activation+1.77e+00

Pos logit contributions
 hind+0.041
aces+0.041
ags+0.040
endant+0.038
sed+0.036
Neg logit contributions
 Woo-0.035
 nurses-0.034
burn-0.034
 lam-0.032
T-0.032
Token my
Token ID616
Feature activation+8.76e-01

Pos logit contributions
aces+0.052
 hind+0.051
ags+0.050
endant+0.047
sed+0.046
Neg logit contributions
burn-0.046
 Woo-0.045
 nurses-0.043
T-0.040
Pe-0.040
Token sc
Token ID629
Feature activation+3.80e+00

Pos logit contributions
aces+0.020
 hind+0.020
ags+0.019
endant+0.018
sed+0.018
Neg logit contributions
 Woo-0.028
burn-0.028
 nurses-0.027
T-0.026
Pe-0.025
Tokenooter
Token ID25141
Feature activation+2.48e+00

Pos logit contributions
aces+0.080
 hind+0.079
ags+0.075
sed+0.071
endant+0.069
Neg logit contributions
burn-0.128
 Woo-0.126
 nurses-0.120
T-0.116
Pe-0.115
Token?
Token ID30
Feature activation-1.47e+00

Pos logit contributions
 hind+0.066
sed+0.065
aces+0.064
ushed+0.063
ses+0.063
Neg logit contributions
burn-0.073
 Woo-0.070
 any-0.064
 nurses-0.063
T-0.061
Token It
Token ID632
Feature activation+5.84e-01

Pos logit contributions
 Woo+0.043
burn+0.042
 nurses+0.040
T+0.039
Pe+0.038
Neg logit contributions
 hind-0.038
aces-0.038
ags-0.037
sed-0.035
endant-0.035
Token is
Token ID318
Feature activation+7.63e-01

Pos logit contributions
 hind+0.019
ags+0.018
aces+0.018
agged+0.017
ushed+0.017
Neg logit contributions
burn-0.014
 Woo-0.014
 nurses-0.013
 lam-0.012
T-0.012
Token very
Token ID845
Feature activation+2.05e+00

Pos logit contributions
ushed+0.023
agged+0.022
aces+0.022
ags+0.021
 hind+0.021
Neg logit contributions
 Woo-0.021
burn-0.019
 any-0.018
 one-0.017
 Snow-0.017
Token off
Token ID572
Feature activation-1.50e-01

Pos logit contributions
aces+0.011
ags+0.011
 hind+0.011
 done+0.010
 hips+0.010
Neg logit contributions
burn-0.011
 Woo-0.010
Pe-0.009
 nurses-0.009
T-0.009
Token,
Token ID11
Feature activation+3.91e-01

Pos logit contributions
 Woo+0.003
burn+0.003
 nurses+0.003
T+0.003
Pe+0.002
Neg logit contributions
aces-0.005
 hind-0.005
ags-0.005
endant-0.005
sed-0.005
Token but
Token ID475
Feature activation+6.95e-01

Pos logit contributions
aces+0.012
 hind+0.011
ags+0.011
sed+0.010
endant+0.010
Neg logit contributions
burn-0.010
 Woo-0.010
 nurses-0.009
T-0.009
Pe-0.009
Token she
Token ID673
Feature activation+1.60e-01

Pos logit contributions
aces+0.031
ags+0.030
 hind+0.030
endant+0.028
sed+0.028
Neg logit contributions
 Woo-0.009
 nurses-0.008
burn-0.007
where-0.007
 lam-0.006
Token knew
Token ID2993
Feature activation+1.25e+00

Pos logit contributions
ags+0.006
 hind+0.006
aces+0.006
agged+0.005
endant+0.005
Neg logit contributions
T-0.003
 nurses-0.003
 Woo-0.003
where-0.003
burn-0.003
Token her
Token ID607
Feature activation+3.80e+00

Pos logit contributions
aces+0.047
 hind+0.046
ags+0.045
endant+0.044
sed+0.043
Neg logit contributions
burn-0.022
 Woo-0.021
 nurses-0.019
Pe-0.018
T-0.018
Token mum
Token ID25682
Feature activation+1.46e+00

Pos logit contributions
aces+0.135
ags+0.129
 hind+0.126
agged+0.121
amed+0.119
Neg logit contributions
 Woo-0.077
burn-0.074
 nurses-0.072
 lam-0.070
T-0.069
Token was
Token ID373
Feature activation-2.77e-02

Pos logit contributions
 hind+0.043
ags+0.041
aces+0.041
sed+0.037
endant+0.037
Neg logit contributions
burn-0.039
 Woo-0.039
 nurses-0.038
T-0.035
 lam-0.034
Token right
Token ID826
Feature activation-2.60e-01

Pos logit contributions
 Woo+0.001
burn+0.001
 nurses+0.001
T+0.001
Pe+0.001
Neg logit contributions
 hind-0.001
aces-0.001
sed-0.001
endant-0.001
ags-0.001
Token and
Token ID290
Feature activation-5.63e-01

Pos logit contributions
burn+0.006
 Woo+0.006
 nurses+0.006
T+0.005
Pe+0.005
Neg logit contributions
aces-0.008
 hind-0.008
ags-0.008
sed-0.008
endant-0.008
Token started
Token ID2067
Feature activation-3.51e-01

Pos logit contributions
 nurses+0.011
 Woo+0.010
burn+0.010
 lam+0.009
 paramed+0.009
Neg logit contributions
aces-0.021
ags-0.021
 hind-0.021
agged-0.019
asp-0.019
Token and
Token ID290
Feature activation-1.26e-01

Pos logit contributions
burn+0.023
 Woo+0.023
 nurses+0.021
T+0.020
 lam+0.019
Neg logit contributions
 hind-0.029
aces-0.029
ags-0.029
endant-0.027
sed-0.027
Token went
Token ID1816
Feature activation-1.11e+00

Pos logit contributions
 Woo+0.003
burn+0.003
 nurses+0.003
 lam+0.003
T+0.003
Neg logit contributions
ags-0.004
aces-0.004
 hind-0.004
agged-0.004
endant-0.004
Token to
Token ID284
Feature activation-1.38e-01

Pos logit contributions
burn+0.026
 Woo+0.026
 nurses+0.024
T+0.022
 lam+0.022
Neg logit contributions
aces-0.036
 hind-0.035
ags-0.034
sed-0.033
endant-0.033
Token the
Token ID262
Feature activation-1.87e-01

Pos logit contributions
 Woo+0.003
burn+0.003
 nurses+0.002
T+0.002
Pe+0.002
Neg logit contributions
aces-0.005
 hind-0.005
ags-0.005
endant-0.005
ushed-0.005
Token to
Token ID284
Feature activation-3.16e-01

Pos logit contributions
 nurses+0.005
burn+0.005
 Woo+0.005
 lamb+0.004
 lam+0.004
Neg logit contributions
aces-0.006
agged-0.006
ags-0.005
ushed-0.005
 hind-0.005
Tokenaster
Token ID1603
Feature activation-4.16e+00

Pos logit contributions
 Woo+0.006
burn+0.006
 nurses+0.006
T+0.005
Pe+0.005
Neg logit contributions
aces-0.012
 hind-0.011
ags-0.011
ushed-0.011
endant-0.011
Token.
Token ID13
Feature activation-6.85e-01

Pos logit contributions
burn+0.074
 Woo+0.073
 nurses+0.065
T+0.060
 any+0.058
Neg logit contributions
 hind-0.155
aces-0.149
endant-0.148
ags-0.147
sed-0.143
Token She
Token ID1375
Feature activation+4.32e-01

Pos logit contributions
 Woo+0.015
burn+0.014
 nurses+0.013
T+0.013
Pe+0.013
Neg logit contributions
aces-0.024
 hind-0.024
ags-0.023
amed-0.022
endant-0.022
Token saw
Token ID2497
Feature activation-4.89e-02

Pos logit contributions
ags+0.016
 hind+0.015
aces+0.014
agged+0.014
endant+0.013
Neg logit contributions
T-0.009
 nurses-0.009
 Woo-0.009
burn-0.009
 lam-0.008
Token that
Token ID326
Feature activation-5.88e-01

Pos logit contributions
burn+0.001
 Woo+0.001
 nurses+0.001
T+0.001
Pe+0.001
Neg logit contributions
aces-0.002
 hind-0.002
ags-0.001
sed-0.001
endant-0.001
Token the
Token ID262
Feature activation-3.99e-02

Pos logit contributions
burn+0.011
 Woo+0.011
 nurses+0.010
T+0.010
Pe+0.009
Neg logit contributions
aces-0.021
 hind-0.021
ags-0.021
endant-0.020
sed-0.020
Token of
Token ID286
Feature activation+6.51e-01

Pos logit contributions
burn+0.002
 Woo+0.002
 nurses+0.002
T+0.002
 lam+0.002
Neg logit contributions
 hind-0.003
aces-0.003
ags-0.003
sed-0.003
endant-0.003
Token bread
Token ID8509
Feature activation-1.09e+00

Pos logit contributions
aces+0.017
 hind+0.016
ushed+0.016
amed+0.015
ags+0.015
Neg logit contributions
 nurses-0.019
 Woo-0.019
burn-0.018
 lam-0.018
 any-0.017
Token in
Token ID287
Feature activation+9.86e-01

Pos logit contributions
burn+0.021
 Woo+0.020
 nurses+0.018
 lam+0.018
 any+0.017
Neg logit contributions
 hind-0.040
ags-0.037
endant-0.035
ieved-0.035
aces-0.035
Token the
Token ID262
Feature activation-3.28e-01

Pos logit contributions
aces+0.033
 hind+0.033
ags+0.032
sed+0.030
endant+0.030
Neg logit contributions
burn-0.021
 Woo-0.021
 nurses-0.020
T-0.018
Pe-0.018
Token to
Token ID284
Feature activation-5.65e-01

Pos logit contributions
 nurses+0.009
burn+0.009
 Woo+0.009
 lam+0.009
 lamb+0.009
Neg logit contributions
aces-0.009
 hind-0.008
ushed-0.008
endant-0.008
agged-0.008
Tokenaster
Token ID1603
Feature activation-4.08e+00

Pos logit contributions
 Woo+0.006
burn+0.006
 nurses+0.005
T+0.005
Pe+0.005
Neg logit contributions
aces-0.026
ushed-0.024
amed-0.024
endant-0.024
 hind-0.024
Token and
Token ID290
Feature activation-9.03e-01

Pos logit contributions
burn+0.074
 Woo+0.071
 nurses+0.066
 lam+0.061
 any+0.059
Neg logit contributions
 hind-0.151
endant-0.144
ags-0.143
aces-0.140
sed-0.138
Token pushed
Token ID7121
Feature activation-1.41e+00

Pos logit contributions
burn+0.022
 nurses+0.022
 Woo+0.021
 lam+0.019
T+0.018
Neg logit contributions
ags-0.029
aces-0.029
 hind-0.029
agged-0.028
endant-0.026
Token the
Token ID262
Feature activation+3.14e-01

Pos logit contributions
burn+0.032
 Woo+0.032
 nurses+0.030
Pe+0.028
T+0.027
Neg logit contributions
aces-0.046
ags-0.044
 hind-0.044
sed-0.040
amed-0.039
Token lever
Token ID17124
Feature activation-2.40e+00

Pos logit contributions
aces+0.008
ags+0.008
agged+0.007
 hind+0.007
endant+0.007
Neg logit contributions
burn-0.010
 nurses-0.009
 Woo-0.009
 lam-0.009
 lamb-0.009
Token down
Token ID866
Feature activation-1.53e-01

Pos logit contributions
 Woo+0.061
burn+0.060
 nurses+0.057
Pe+0.054
T+0.054
Neg logit contributions
aces-0.073
 hind-0.070
ags-0.069
sed-0.065
endant-0.064
Token put
Token ID1234
Feature activation-1.12e+00

Pos logit contributions
ags+0.000
agged+0.000
aces+0.000
 hind+0.000
endant+0.000
Neg logit contributions
burn-0.000
 nurses-0.000
 Woo-0.000
 lam-0.000
 one-0.000
Token it
Token ID340
Feature activation-1.03e-01

Pos logit contributions
 Woo+0.019
burn+0.019
 nurses+0.016
T+0.015
 lam+0.015
Neg logit contributions
 hind-0.044
aces-0.043
ags-0.042
sed-0.041
endant-0.041
Token in
Token ID287
Feature activation+8.44e-01

Pos logit contributions
burn+0.002
 any+0.002
 Woo+0.001
 one+0.001
 nurses+0.001
Neg logit contributions
ses-0.004
 hind-0.004
sed-0.004
 ey-0.004
ushed-0.004
Token the
Token ID262
Feature activation-4.03e-01

Pos logit contributions
aces+0.031
 hind+0.030
ags+0.029
sed+0.029
endant+0.028
Neg logit contributions
 Woo-0.017
burn-0.016
 nurses-0.015
T-0.014
Pe-0.014
Token to
Token ID284
Feature activation-3.35e-01

Pos logit contributions
 nurses+0.011
 Woo+0.011
burn+0.010
 lam+0.010
 lamb+0.010
Neg logit contributions
aces-0.012
agged-0.011
ushed-0.011
 hind-0.011
endant-0.011
Tokenaster
Token ID1603
Feature activation-3.93e+00

Pos logit contributions
 Woo+0.003
burn+0.003
 nurses+0.003
 any+0.003
T+0.002
Neg logit contributions
aces-0.016
sed-0.015
endant-0.015
ushed-0.015
amed-0.015
Token.
Token ID13
Feature activation+1.47e+00

Pos logit contributions
burn+0.072
 Woo+0.068
 nurses+0.062
 lam+0.058
T+0.057
Neg logit contributions
 hind-0.146
ags-0.141
endant-0.140
aces-0.139
sed-0.138
Token She
Token ID1375
Feature activation+7.65e-01

Pos logit contributions
aces+0.053
 hind+0.051
endant+0.050
ags+0.048
sed+0.047
Neg logit contributions
 Woo-0.033
T-0.028
burn-0.027
where-0.026
 St-0.025
Token took
Token ID1718
Feature activation-1.58e-01

Pos logit contributions
ags+0.031
agged+0.027
 hind+0.026
aces+0.025
ses+0.025
Neg logit contributions
T-0.016
 nurses-0.015
burn-0.015
 lam-0.015
 one-0.015
Token a
Token ID257
Feature activation-1.08e-02

Pos logit contributions
burn+0.004
 Woo+0.004
 nurses+0.004
T+0.003
Pe+0.003
Neg logit contributions
aces-0.005
 hind-0.005
ags-0.005
sed-0.004
endant-0.004
Token seat
Token ID5852
Feature activation-3.50e-01

Pos logit contributions
burn+0.000
 Woo+0.000
 nurses+0.000
T+0.000
 lam+0.000
Neg logit contributions
aces-0.000
ags-0.000
 hind-0.000
agged-0.000
sed-0.000
Token the
Token ID262
Feature activation+3.75e-01

Pos logit contributions
 Woo+0.022
burn+0.022
 nurses+0.020
T+0.019
Pe+0.017
Neg logit contributions
 hind-0.046
aces-0.045
sed-0.042
endant-0.042
ags-0.042
Token toast
Token ID27805
Feature activation+1.34e-01

Pos logit contributions
aces+0.009
 hind+0.009
ags+0.009
endant+0.008
agged+0.008
Neg logit contributions
burn-0.012
 nurses-0.011
 Woo-0.011
T-0.010
 any-0.010
Token in
Token ID287
Feature activation+6.98e-01

Pos logit contributions
 hind+0.005
ags+0.004
aces+0.004
endant+0.004
sed+0.004
Neg logit contributions
burn-0.003
 Woo-0.003
 nurses-0.002
T-0.002
 any-0.002
Token the
Token ID262
Feature activation-2.82e-01

Pos logit contributions
aces+0.027
 hind+0.026
endant+0.025
sed+0.025
ags+0.025
Neg logit contributions
 Woo-0.013
burn-0.012
 nurses-0.011
T-0.011
Pe-0.010
Token to
Token ID284
Feature activation-5.81e-01

Pos logit contributions
 nurses+0.008
 Woo+0.008
burn+0.008
 paramed+0.007
 lam+0.007
Neg logit contributions
aces-0.008
 hind-0.007
ags-0.007
endant-0.007
agged-0.007
Tokenaster
Token ID1603
Feature activation-3.92e+00

Pos logit contributions
 Woo+0.007
burn+0.007
 nurses+0.007
T+0.006
Pe+0.006
Neg logit contributions
aces-0.026
 hind-0.024
sed-0.024
endant-0.024
ags-0.024
Token and
Token ID290
Feature activation-7.72e-01

Pos logit contributions
burn+0.076
 Woo+0.075
 nurses+0.068
 lam+0.063
T+0.063
Neg logit contributions
 hind-0.138
ags-0.137
aces-0.134
endant-0.132
agged-0.129
Token waited
Token ID13488
Feature activation-2.80e-01

Pos logit contributions
 Woo+0.020
 nurses+0.019
burn+0.019
T+0.018
 lam+0.016
Neg logit contributions
ags-0.025
aces-0.025
 hind-0.024
agged-0.023
endant-0.022
Token for
Token ID329
Feature activation+3.11e-01

Pos logit contributions
 Woo+0.006
burn+0.005
 nurses+0.005
T+0.005
 lam+0.005
Neg logit contributions
aces-0.010
 hind-0.009
ags-0.009
endant-0.009
sed-0.009
Token it
Token ID340
Feature activation-6.42e-01

Pos logit contributions
aces+0.012
 hind+0.011
ags+0.011
sed+0.011
endant+0.011
Neg logit contributions
 Woo-0.006
burn-0.006
 nurses-0.005
T-0.005
Pe-0.005
Token to
Token ID284
Feature activation-4.27e-01

Pos logit contributions
burn+0.013
 Woo+0.012
 nurses+0.011
T+0.011
 any+0.010
Neg logit contributions
 hind-0.023
ags-0.023
aces-0.022
endant-0.021
 ey-0.021
Token.
Token ID13
Feature activation-1.27e-01

Pos logit contributions
burn+0.035
 Woo+0.034
 nurses+0.032
T+0.030
Pe+0.030
Neg logit contributions
aces-0.044
 hind-0.043
ags-0.042
endant-0.039
sed-0.039
Token She
Token ID1375
Feature activation-9.94e-02

Pos logit contributions
 Woo+0.003
 nurses+0.002
T+0.002
burn+0.002
Pe+0.002
Neg logit contributions
 hind-0.005
aces-0.005
endant-0.004
ags-0.004
amed-0.004
Token grabbed
Token ID13176
Feature activation-7.40e-02

Pos logit contributions
 nurses+0.003
 Woo+0.002
burn+0.002
T+0.002
 lam+0.002
Neg logit contributions
aces-0.003
ags-0.003
 hind-0.003
endant-0.003
agged-0.003
Token the
Token ID262
Feature activation-3.94e-01

Pos logit contributions
burn+0.002
 Woo+0.002
 nurses+0.002
Pe+0.002
 lam+0.002
Neg logit contributions
aces-0.002
ags-0.002
 hind-0.002
sed-0.002
ses-0.002
Token to
Token ID284
Feature activation-1.46e-01

Pos logit contributions
burn+0.013
 Woo+0.012
 nurses+0.012
 lam+0.011
 lamb+0.011
Neg logit contributions
aces-0.010
 hind-0.009
ags-0.009
ushed-0.008
endant-0.008
Tokenaster
Token ID1603
Feature activation-3.92e+00

Pos logit contributions
burn+0.004
 Woo+0.003
 nurses+0.003
 lam+0.003
Pe+0.003
Neg logit contributions
 hind-0.004
ags-0.004
aces-0.004
sed-0.004
ses-0.004
Token and
Token ID290
Feature activation-3.85e-01

Pos logit contributions
burn+0.093
 Woo+0.090
 nurses+0.085
 lam+0.078
T+0.078
Neg logit contributions
 hind-0.124
aces-0.124
ags-0.120
endant-0.116
sed-0.114
Token threw
Token ID9617
Feature activation-1.25e+00

Pos logit contributions
 Woo+0.010
 nurses+0.010
burn+0.010
 lam+0.009
T+0.009
Neg logit contributions
aces-0.012
ags-0.012
 hind-0.011
endant-0.011
agged-0.011
Token it
Token ID340
Feature activation-1.02e+00

Pos logit contributions
 Woo+0.024
burn+0.023
 nurses+0.021
T+0.020
 lam+0.019
Neg logit contributions
aces-0.046
 hind-0.046
ags-0.044
endant-0.043
sed-0.042
Token in
Token ID287
Feature activation+6.79e-01

Pos logit contributions
burn+0.023
 Woo+0.022
 any+0.020
 nurses+0.020
 lam+0.019
Neg logit contributions
 hind-0.033
aces-0.032
endant-0.031
ags-0.031
ushed-0.031
Token the
Token ID262
Feature activation-7.03e-01

Pos logit contributions
aces+0.023
 hind+0.023
ags+0.022
sed+0.021
endant+0.021
Neg logit contributions
burn-0.014
 Woo-0.014
 nurses-0.013
T-0.012
Pe-0.012
Token more
Token ID517
Feature activation+4.58e-01

Pos logit contributions
burn+0.028
 Woo+0.028
 nurses+0.026
T+0.024
 lam+0.023
Neg logit contributions
 hind-0.046
aces-0.046
ags-0.043
endant-0.042
sed-0.042
Token bread
Token ID8509
Feature activation-7.12e-01

Pos logit contributions
 hind+0.012
sed+0.012
ses+0.012
acle+0.011
ushed+0.011
Neg logit contributions
burn-0.013
 nurses-0.012
 Woo-0.012
 lam-0.012
 any-0.011
Token in
Token ID287
Feature activation+1.00e+00

Pos logit contributions
burn+0.015
 Woo+0.014
 nurses+0.012
T+0.012
 lam+0.012
Neg logit contributions
 hind-0.025
ags-0.023
aces-0.023
sed-0.023
endant-0.022
Token the
Token ID262
Feature activation-3.25e-01

Pos logit contributions
aces+0.039
 hind+0.038
ags+0.036
endant+0.036
sed+0.036
Neg logit contributions
 Woo-0.018
burn-0.018
 nurses-0.016
T-0.015
Pe-0.015
Token to
Token ID284
Feature activation-3.54e-01

Pos logit contributions
burn+0.010
 Woo+0.010
 nurses+0.010
 lam+0.009
 lamb+0.009
Neg logit contributions
aces-0.008
 hind-0.008
ushed-0.008
endant-0.007
amed-0.007
Tokenaster
Token ID1603
Feature activation-3.90e+00

Pos logit contributions
 Woo+0.005
burn+0.005
 nurses+0.004
Pe+0.004
T+0.004
Neg logit contributions
aces-0.015
 hind-0.014
endant-0.014
ushed-0.014
sed-0.014
Token and
Token ID290
Feature activation-6.55e-01

Pos logit contributions
burn+0.069
 Woo+0.065
 nurses+0.060
 any+0.055
 lam+0.054
Neg logit contributions
 hind-0.150
endant-0.139
ags-0.138
efully-0.135
aces-0.134
Token turned
Token ID2900
Feature activation-6.54e-01

Pos logit contributions
burn+0.016
 nurses+0.015
 Woo+0.015
 they+0.013
 one+0.013
Neg logit contributions
aces-0.022
ags-0.022
 hind-0.022
agged-0.021
endant-0.020
Token it
Token ID340
Feature activation-2.30e-01

Pos logit contributions
burn+0.014
 Woo+0.014
 nurses+0.013
T+0.012
 lam+0.011
Neg logit contributions
aces-0.023
 hind-0.022
ags-0.022
sed-0.021
endant-0.021
Token on
Token ID319
Feature activation-1.33e-01

Pos logit contributions
burn+0.004
 any+0.003
 one+0.003
 Woo+0.003
 them+0.003
Neg logit contributions
 hind-0.010
 ey-0.009
endant-0.009
sed-0.009
ses-0.009
Token.
Token ID13
Feature activation-5.22e-01

Pos logit contributions
burn+0.003
 Woo+0.003
 nurses+0.003
T+0.003
Pe+0.003
Neg logit contributions
aces-0.004
 hind-0.004
ags-0.004
sed-0.004
endant-0.004
Token a
Token ID257
Feature activation-1.52e+00

Pos logit contributions
 Woo+0.011
burn+0.011
 nurses+0.011
T+0.010
 any+0.009
Neg logit contributions
 hind-0.018
aces-0.017
ags-0.017
ushed-0.016
endant-0.016
Token bad
Token ID2089
Feature activation-1.34e+00

Pos logit contributions
burn+0.041
 Woo+0.040
 nurses+0.038
T+0.036
Pe+0.035
Neg logit contributions
aces-0.044
 hind-0.042
ags-0.042
endant-0.039
sed-0.039
Token injury
Token ID5095
Feature activation-2.04e+00

Pos logit contributions
 Woo+0.036
burn+0.035
T+0.033
 nurses+0.033
Pe+0.033
Neg logit contributions
aces-0.037
 hind-0.036
ags-0.036
sed-0.036
ses-0.035
Token and
Token ID290
Feature activation-1.03e+00

Pos logit contributions
 Woo+0.040
burn+0.040
 nurses+0.037
T+0.033
 lam+0.031
Neg logit contributions
 hind-0.076
ags-0.071
endant-0.070
aces-0.069
efully-0.067
Token needed
Token ID2622
Feature activation-6.14e-01

Pos logit contributions
 nurses+0.020
 Woo+0.020
burn+0.019
 lam+0.015
T+0.015
Neg logit contributions
ags-0.039
 hind-0.039
aces-0.039
asp-0.036
agged-0.036
Token surgery
Token ID8185
Feature activation-3.89e+00

Pos logit contributions
burn+0.015
 Woo+0.014
 nurses+0.013
T+0.013
Pe+0.013
Neg logit contributions
aces-0.019
 hind-0.019
ags-0.018
sed-0.017
endant-0.017
Token.
Token ID13
Feature activation-8.57e-01

Pos logit contributions
burn+0.070
 Woo+0.068
 nurses+0.062
 any+0.054
 lam+0.053
Neg logit contributions
 hind-0.148
ags-0.142
endant-0.142
aces-0.140
 ey-0.139
Token He
Token ID679
Feature activation-4.79e-01

Pos logit contributions
 Woo+0.021
 nurses+0.019
Pe+0.019
burn+0.019
T+0.018
Neg logit contributions
aces-0.028
 hind-0.027
ags-0.026
amed-0.025
sed-0.025
Token said
Token ID531
Feature activation+2.52e-01

Pos logit contributions
 nurses+0.014
 Woo+0.013
burn+0.013
T+0.012
 lam+0.012
Neg logit contributions
ags-0.014
 hind-0.013
aces-0.013
asp-0.012
endant-0.012
Token they
Token ID484
Feature activation-4.40e-01

Pos logit contributions
aces+0.008
 hind+0.008
sed+0.008
ags+0.007
endant+0.007
Neg logit contributions
burn-0.006
 Woo-0.005
T-0.005
 lam-0.005
 nurses-0.005
Token were
Token ID547
Feature activation-5.73e-01

Pos logit contributions
 Woo+0.015
burn+0.015
 nurses+0.015
T+0.014
 lam+0.013
Neg logit contributions
 hind-0.010
aces-0.009
ags-0.009
endant-0.008
sed-0.008
Token put
Token ID1234
Feature activation-1.14e+00

Pos logit contributions
burn+0.008
 Woo+0.008
 nurses+0.008
 lam+0.007
T+0.007
Neg logit contributions
ags-0.009
aces-0.009
 hind-0.008
agged-0.008
endant-0.008
Token it
Token ID340
Feature activation-8.56e-02

Pos logit contributions
 Woo+0.017
burn+0.017
 nurses+0.014
T+0.014
 lam+0.013
Neg logit contributions
 hind-0.047
aces-0.046
ags-0.044
sed-0.044
endant-0.043
Token in
Token ID287
Feature activation+1.04e+00

Pos logit contributions
burn+0.002
 any+0.001
 Woo+0.001
 nurses+0.001
 one+0.001
Neg logit contributions
 hind-0.003
ses-0.003
sed-0.003
 ey-0.003
ags-0.003
Token the
Token ID262
Feature activation-3.72e-01

Pos logit contributions
aces+0.039
 hind+0.038
ags+0.037
endant+0.036
sed+0.036
Neg logit contributions
 Woo-0.020
burn-0.019
 nurses-0.017
T-0.017
Pe-0.016
Token to
Token ID284
Feature activation-8.75e-02

Pos logit contributions
 nurses+0.011
burn+0.011
 Woo+0.010
 lam+0.010
 lamb+0.010
Neg logit contributions
aces-0.010
ushed-0.009
 hind-0.009
agged-0.009
endant-0.009
Tokenaster
Token ID1603
Feature activation-3.86e+00

Pos logit contributions
 Woo+0.001
burn+0.001
 nurses+0.001
T+0.001
Pe+0.001
Neg logit contributions
aces-0.004
 hind-0.003
ags-0.003
endant-0.003
sed-0.003
Token.
Token ID13
Feature activation+7.70e-01

Pos logit contributions
burn+0.066
 Woo+0.061
 nurses+0.057
 lam+0.054
 any+0.054
Neg logit contributions
 hind-0.146
ags-0.141
endant-0.140
sed-0.136
agged-0.136
Token Then
Token ID3244
Feature activation+5.77e-01

Pos logit contributions
 hind+0.026
aces+0.026
ags+0.026
endant+0.025
sed+0.024
Neg logit contributions
 Woo-0.018
T-0.016
burn-0.015
Pe-0.015
 nurses-0.014
Token,
Token ID11
Feature activation+1.01e+00

Pos logit contributions
aces+0.021
 hind+0.020
ags+0.019
sed+0.019
endant+0.018
Neg logit contributions
burn-0.011
 Woo-0.010
 nurses-0.009
Pe-0.009
T-0.009
Token she
Token ID673
Feature activation+3.56e-01

Pos logit contributions
aces+0.040
 hind+0.039
ags+0.038
sed+0.037
endant+0.037
Neg logit contributions
burn-0.016
 Woo-0.016
 nurses-0.014
T-0.013
Pe-0.013
Token went
Token ID1816
Feature activation+3.06e-01

Pos logit contributions
ags+0.014
 hind+0.012
agged+0.012
aces+0.012
ses+0.011
Neg logit contributions
 nurses-0.008
T-0.008
burn-0.008
 lam-0.007
 Woo-0.007
Token some
Token ID617
Feature activation+2.93e-01

Pos logit contributions
 Woo+0.023
burn+0.023
 nurses+0.021
T+0.020
 lam+0.018
Neg logit contributions
 hind-0.044
aces-0.044
sed-0.042
endant-0.042
ags-0.041
Token bread
Token ID8509
Feature activation-5.80e-01

Pos logit contributions
 hind+0.008
sed+0.008
aces+0.008
ses+0.008
endant+0.008
Neg logit contributions
 nurses-0.008
burn-0.008
 Woo-0.007
 lam-0.007
 lamb-0.007
Token in
Token ID287
Feature activation+9.50e-01

Pos logit contributions
burn+0.011
 Woo+0.011
 nurses+0.009
T+0.009
 lam+0.009
Neg logit contributions
 hind-0.021
ags-0.020
aces-0.019
endant-0.019
ses-0.019
Token the
Token ID262
Feature activation-3.86e-01

Pos logit contributions
aces+0.035
 hind+0.034
ags+0.033
sed+0.033
endant+0.033
Neg logit contributions
 Woo-0.018
burn-0.018
 nurses-0.016
T-0.016
Pe-0.015
Token to
Token ID284
Feature activation-3.19e-01

Pos logit contributions
burn+0.012
 Woo+0.011
 nurses+0.011
 lam+0.011
 lamb+0.011
Neg logit contributions
aces-0.010
 hind-0.010
endant-0.009
ushed-0.009
ags-0.009
Tokenaster
Token ID1603
Feature activation-3.84e+00

Pos logit contributions
 Woo+0.005
burn+0.005
 nurses+0.005
T+0.004
Pe+0.004
Neg logit contributions
aces-0.013
 hind-0.012
endant-0.012
ags-0.012
sed-0.012
Token.
Token ID13
Feature activation+1.87e-01

Pos logit contributions
burn+0.069
 Woo+0.066
 nurses+0.061
 lam+0.055
T+0.054
Neg logit contributions
 hind-0.143
endant-0.137
ags-0.137
aces-0.134
agged-0.132
Token He
Token ID679
Feature activation+1.34e-01

Pos logit contributions
 hind+0.006
aces+0.006
endant+0.006
ags+0.006
amed+0.006
Neg logit contributions
 Woo-0.005
T-0.004
burn-0.004
 nurses-0.004
Pe-0.004
Token goes
Token ID2925
Feature activation-2.06e-01

Pos logit contributions
 hind+0.004
ags+0.004
aces+0.004
agged+0.003
amed+0.003
Neg logit contributions
 nurses-0.004
 Woo-0.004
burn-0.004
 lam-0.003
T-0.003
Token to
Token ID284
Feature activation+8.11e-01

Pos logit contributions
burn+0.004
 Woo+0.004
 nurses+0.004
T+0.003
 lam+0.003
Neg logit contributions
aces-0.007
 hind-0.007
ags-0.007
sed-0.007
endant-0.007
Token his
Token ID465
Feature activation-8.33e-01

Pos logit contributions
aces+0.032
 hind+0.029
endant+0.029
ushed+0.029
ags+0.029
Neg logit contributions
 Woo-0.015
burn-0.014
 nurses-0.014
T-0.013
Pe-0.012
Token see
Token ID766
Feature activation-1.01e+00

Pos logit contributions
 Woo+0.025
burn+0.024
Pe+0.022
 nurses+0.022
 lam+0.022
Neg logit contributions
aces-0.022
ags-0.021
 hind-0.020
 done-0.020
amed-0.019
Token Ben
Token ID3932
Feature activation-1.41e+00

Pos logit contributions
burn+0.024
 Woo+0.023
Pe+0.021
T+0.021
 nurses+0.020
Neg logit contributions
aces-0.032
 hind-0.031
ags-0.031
sed-0.029
ses-0.029
Token and
Token ID290
Feature activation+8.09e-01

Pos logit contributions
 Woo+0.039
burn+0.037
 nurses+0.036
Pe+0.036
 lam+0.036
Neg logit contributions
aces-0.039
 hind-0.034
ags-0.034
 done-0.034
sed-0.032
Token the
Token ID262
Feature activation-7.90e-01

Pos logit contributions
ags+0.030
aces+0.030
 hind+0.029
amed+0.029
agged+0.028
Neg logit contributions
 Woo-0.015
 nurses-0.015
burn-0.014
 any-0.012
 lam-0.012
Token to
Token ID284
Feature activation-6.36e-01

Pos logit contributions
burn+0.023
 Woo+0.022
 nurses+0.021
 lam+0.020
Pe+0.019
Neg logit contributions
aces-0.021
 hind-0.021
ags-0.020
endant-0.019
ushed-0.019
Tokenaster
Token ID1603
Feature activation-3.81e+00

Pos logit contributions
burn+0.016
 Woo+0.015
T+0.014
 nurses+0.014
 lam+0.014
Neg logit contributions
 hind-0.019
ags-0.019
aces-0.019
sed-0.017
ses-0.016
Token.
Token ID13
Feature activation+9.91e-02

Pos logit contributions
burn+0.093
 Woo+0.092
 nurses+0.086
T+0.082
Pe+0.081
Neg logit contributions
aces-0.118
 hind-0.115
ags-0.112
sed-0.107
endant-0.106
Token
Token ID198
Feature activation-5.39e-01

Pos logit contributions
aces+0.003
 hind+0.003
ags+0.003
endant+0.003
sed+0.003
Neg logit contributions
 Woo-0.002
burn-0.002
 nurses-0.002
T-0.002
Pe-0.002
Token
Token ID198
Feature activation-7.11e-02

Pos logit contributions
burn+0.014
 Woo+0.014
 nurses+0.013
T+0.012
Pe+0.012
Neg logit contributions
aces-0.016
 hind-0.016
ags-0.015
sed-0.014
endant-0.014
TokenThe
Token ID464
Feature activation-7.73e-01

Pos logit contributions
burn+0.002
 Woo+0.002
 nurses+0.002
 lam+0.002
 lamb+0.002
Neg logit contributions
ags-0.002
aces-0.002
 hind-0.002
amed-0.002
efully-0.002
Token toast
Token ID27805
Feature activation-3.04e-01

Pos logit contributions
burn+0.020
 nurses+0.019
 Woo+0.019
Pe+0.017
 lam+0.017
Neg logit contributions
aces-0.024
 hind-0.023
ags-0.022
endant-0.022
ushed-0.021
Token put
Token ID1234
Feature activation-8.54e-01

Pos logit contributions
ags+0.007
 hind+0.007
agged+0.006
 Deal+0.006
aces+0.006
Neg logit contributions
 nurses-0.003
 one-0.003
T-0.003
 they-0.003
burn-0.003
Token it
Token ID340
Feature activation-1.66e-02

Pos logit contributions
 Woo+0.014
burn+0.014
 nurses+0.013
 any+0.012
T+0.012
Neg logit contributions
 hind-0.034
aces-0.032
sed-0.032
endant-0.031
ses-0.031
Token in
Token ID287
Feature activation+6.71e-01

Pos logit contributions
burn+0.000
 any+0.000
 Woo+0.000
 one+0.000
 nurses+0.000
Neg logit contributions
 hind-0.001
ses-0.001
sed-0.001
endant-0.001
 ey-0.001
Token the
Token ID262
Feature activation-2.39e-01

Pos logit contributions
aces+0.028
sed+0.027
 hind+0.027
ushed+0.026
endant+0.026
Neg logit contributions
 Woo-0.011
burn-0.010
T-0.010
 nurses-0.009
Pe-0.009
Token to
Token ID284
Feature activation-8.37e-01

Pos logit contributions
burn+0.006
 nurses+0.006
 paramed+0.006
 Woo+0.006
 one+0.006
Neg logit contributions
 hind-0.007
aces-0.007
ushed-0.006
endant-0.006
ieved-0.006
Tokenaster
Token ID1603
Feature activation-3.79e+00

Pos logit contributions
 nurses+0.011
 Woo+0.011
burn+0.010
 any+0.010
T+0.009
Neg logit contributions
aces-0.036
sed-0.034
 hind-0.034
endant-0.034
ushed-0.033
Token and
Token ID290
Feature activation-3.81e-01

Pos logit contributions
burn+0.059
 Woo+0.058
 one+0.050
 nurses+0.049
 any+0.048
Neg logit contributions
 hind-0.151
endant-0.146
ags-0.142
efully-0.141
agged-0.138
Token waited
Token ID13488
Feature activation-5.31e-01

Pos logit contributions
 nurses+0.008
 they+0.008
 one+0.008
burn+0.007
 Woo+0.007
Neg logit contributions
ags-0.015
 hind-0.014
agged-0.014
aces-0.014
endant-0.012
Token.
Token ID13
Feature activation-1.18e-02

Pos logit contributions
 Woo+0.009
burn+0.008
 any+0.008
 nurses+0.007
T+0.007
Neg logit contributions
aces-0.020
endant-0.020
 hind-0.019
ags-0.019
amed-0.019
Token When
Token ID1649
Feature activation-8.75e-01

Pos logit contributions
 Woo+0.000
 nurses+0.000
 paramed+0.000
 Cand+0.000
T+0.000
Neg logit contributions
 hind-0.000
endant-0.000
igator-0.000
ak-0.000
ags-0.000
Token it
Token ID340
Feature activation+7.13e-01

Pos logit contributions
burn+0.014
 Woo+0.014
 nurses+0.012
T+0.012
 lam+0.011
Neg logit contributions
aces-0.033
 hind-0.033
ags-0.032
sed-0.031
endant-0.031
Token the
Token ID262
Feature activation-5.69e-01

Pos logit contributions
 Woo+0.004
burn+0.004
 nurses+0.004
T+0.004
 any+0.004
Neg logit contributions
aces-0.010
amed-0.010
ags-0.010
 hind-0.009
agged-0.009
Token stove
Token ID27635
Feature activation-6.54e-01

Pos logit contributions
burn+0.015
 Woo+0.015
 nurses+0.014
T+0.014
 lam+0.013
Neg logit contributions
aces-0.016
 hind-0.016
ags-0.016
endant-0.015
sed-0.015
Token and
Token ID290
Feature activation-3.76e-01

Pos logit contributions
burn+0.014
 Woo+0.014
 nurses+0.013
T+0.012
 lam+0.012
Neg logit contributions
aces-0.021
 hind-0.021
ags-0.021
sed-0.020
endant-0.020
Token the
Token ID262
Feature activation-6.96e-01

Pos logit contributions
 nurses+0.008
 Woo+0.007
burn+0.007
 any+0.007
 they+0.007
Neg logit contributions
ags-0.013
agged-0.013
aces-0.013
amed-0.013
 hind-0.012
Token to
Token ID284
Feature activation-7.38e-01

Pos logit contributions
burn+0.018
 Woo+0.018
 nurses+0.018
 lam+0.016
T+0.016
Neg logit contributions
aces-0.021
 hind-0.020
ags-0.020
endant-0.019
ushed-0.019
Tokenaster
Token ID1603
Feature activation-3.76e+00

Pos logit contributions
 Woo+0.012
burn+0.012
 nurses+0.011
T+0.010
Pe+0.010
Neg logit contributions
aces-0.029
 hind-0.028
ags-0.027
endant-0.027
sed-0.027
Token.
Token ID13
Feature activation+3.53e-02

Pos logit contributions
burn+0.074
 Woo+0.071
 nurses+0.066
 lam+0.060
T+0.059
Neg logit contributions
 hind-0.135
aces-0.130
ags-0.129
endant-0.124
sed-0.123
Token
Token ID198
Feature activation-4.01e-01

Pos logit contributions
aces+0.001
 hind+0.001
ags+0.001
endant+0.001
amed+0.001
Neg logit contributions
 Woo-0.001
T-0.001
Pe-0.001
burn-0.001
 nurses-0.001
Token
Token ID198
Feature activation-6.14e-01

Pos logit contributions
 Woo+0.009
burn+0.009
T+0.008
 nurses+0.008
Pe+0.008
Neg logit contributions
aces-0.013
 hind-0.013
ags-0.012
sed-0.012
endant-0.012
TokenSam
Token ID16305
Feature activation-1.27e+00

Pos logit contributions
burn+0.015
 Woo+0.014
 nurses+0.014
 lam+0.013
 lamb+0.013
Neg logit contributions
aces-0.019
 hind-0.018
ags-0.018
amed-0.017
ushed-0.017
Token wakes
Token ID34267
Feature activation-1.72e+00

Pos logit contributions
 nurses+0.032
burn+0.031
 Woo+0.031
 lam+0.028
T+0.028
Neg logit contributions
 hind-0.040
ags-0.037
aces-0.036
endant-0.036
sed-0.036
Token on
Token ID319
Feature activation-7.27e-02

Pos logit contributions
burn+0.021
T+0.020
 nurses+0.020
 Woo+0.020
Pe+0.020
Neg logit contributions
ags-0.018
 hind-0.018
aces-0.018
amed-0.016
agged-0.016
Token a
Token ID257
Feature activation-6.51e-01

Pos logit contributions
burn+0.002
 Woo+0.002
 nurses+0.002
Pe+0.002
T+0.002
Neg logit contributions
 hind-0.002
ags-0.002
aces-0.002
sed-0.002
 hips-0.002
Token scary
Token ID14343
Feature activation-4.65e-01

Pos logit contributions
burn+0.019
 Woo+0.019
 nurses+0.018
T+0.017
Pe+0.017
Neg logit contributions
 hind-0.017
aces-0.016
ags-0.015
endant-0.014
sed-0.014
Token roller
Token ID24471
Feature activation-1.56e+00

Pos logit contributions
burn+0.014
 Woo+0.014
T+0.014
 nurses+0.014
Pe+0.013
Neg logit contributions
 hind-0.012
aces-0.010
sed-0.010
ags-0.010
ses-0.010
Tokenco
Token ID1073
Feature activation-1.52e-01

Pos logit contributions
 Woo+0.043
burn+0.043
 nurses+0.042
T+0.040
 lam+0.039
Neg logit contributions
aces-0.043
 hind-0.041
ags-0.039
sed-0.037
amed-0.035
Tokenaster
Token ID1603
Feature activation-3.75e+00

Pos logit contributions
 Woo+0.004
burn+0.004
 nurses+0.004
T+0.004
Pe+0.004
Neg logit contributions
aces-0.004
 hind-0.004
ags-0.004
endant-0.003
ushed-0.003
Token ride
Token ID6594
Feature activation-1.58e+00

Pos logit contributions
burn+0.092
 Woo+0.092
 nurses+0.086
Pe+0.082
T+0.082
Neg logit contributions
aces-0.116
 hind-0.111
ags-0.108
sed-0.103
endant-0.102
Token.
Token ID13
Feature activation-9.90e-02

Pos logit contributions
burn+0.039
 Woo+0.039
 nurses+0.037
T+0.034
Pe+0.034
Neg logit contributions
aces-0.048
 hind-0.046
ags-0.046
sed-0.043
endant-0.042
Token Tim
Token ID5045
Feature activation-1.42e-01

Pos logit contributions
 Woo+0.002
burn+0.002
Pe+0.002
T+0.002
 nurses+0.002
Neg logit contributions
aces-0.003
 hind-0.003
ags-0.003
endant-0.003
sed-0.003
Tokenmy
Token ID1820
Feature activation-2.96e-01

Pos logit contributions
burn+0.003
 Woo+0.003
 nurses+0.003
T+0.003
 lam+0.003
Neg logit contributions
aces-0.005
ags-0.005
 hind-0.005
endant-0.005
sed-0.005
Token was
Token ID373
Feature activation-1.41e-01

Pos logit contributions
burn+0.008
 Woo+0.007
 nurses+0.007
T+0.007
 lam+0.007
Neg logit contributions
ags-0.009
aces-0.009
 hind-0.009
endant-0.008
ses-0.008
Token the
Token ID262
Feature activation+4.18e-01

Pos logit contributions
 Woo+0.020
burn+0.019
 nurses+0.018
T+0.016
Pe+0.015
Neg logit contributions
 hind-0.037
aces-0.036
sed-0.034
ags-0.034
endant-0.034
Token bread
Token ID8509
Feature activation+1.50e-01

Pos logit contributions
aces+0.010
 hind+0.010
ags+0.010
endant+0.009
ushed+0.009
Neg logit contributions
burn-0.013
 nurses-0.012
 Woo-0.012
T-0.011
 lam-0.011
Token in
Token ID287
Feature activation+7.92e-01

Pos logit contributions
 hind+0.006
ses+0.005
ags+0.005
sed+0.005
endant+0.005
Neg logit contributions
burn-0.003
 Woo-0.002
 any-0.002
 nurses-0.002
 lamb-0.002
Token the
Token ID262
Feature activation-2.95e-01

Pos logit contributions
aces+0.031
 hind+0.029
endant+0.029
sed+0.029
ags+0.028
Neg logit contributions
 Woo-0.015
burn-0.014
 nurses-0.013
T-0.013
Pe-0.012
Token to
Token ID284
Feature activation-4.42e-01

Pos logit contributions
burn+0.009
 nurses+0.009
 Woo+0.009
 paramed+0.008
 lam+0.008
Neg logit contributions
aces-0.008
 hind-0.007
endant-0.007
ushed-0.007
sed-0.007
Tokenaster
Token ID1603
Feature activation-3.75e+00

Pos logit contributions
 Woo+0.007
burn+0.007
 nurses+0.006
T+0.006
Pe+0.006
Neg logit contributions
aces-0.018
 hind-0.017
sed-0.017
endant-0.017
ushed-0.017
Token and
Token ID290
Feature activation-6.79e-01

Pos logit contributions
burn+0.071
 Woo+0.070
 nurses+0.062
 lam+0.056
T+0.056
Neg logit contributions
 hind-0.137
endant-0.134
ags-0.134
efully-0.130
aces-0.128
Token pushed
Token ID7121
Feature activation-1.27e+00

Pos logit contributions
 nurses+0.017
burn+0.017
 Woo+0.017
T+0.014
 lam+0.014
Neg logit contributions
ags-0.023
aces-0.023
 hind-0.022
agged-0.022
endant-0.020
Token the
Token ID262
Feature activation+5.40e-02

Pos logit contributions
burn+0.026
 Woo+0.025
 nurses+0.023
Pe+0.021
T+0.021
Neg logit contributions
aces-0.045
 hind-0.044
ags-0.043
sed-0.040
endant-0.040
Token button
Token ID4936
Feature activation-1.30e+00

Pos logit contributions
aces+0.001
 hind+0.001
ags+0.001
endant+0.001
ushed+0.001
Neg logit contributions
burn-0.002
 nurses-0.002
 Woo-0.002
 lam-0.002
T-0.002
Token down
Token ID866
Feature activation-2.15e-01

Pos logit contributions
burn+0.031
 Woo+0.030
 nurses+0.028
T+0.027
Pe+0.026
Neg logit contributions
aces-0.041
 hind-0.040
ags-0.039
sed-0.037
endant-0.037
Token,
Token ID11
Feature activation+5.00e-01

Pos logit contributions
aces+0.043
 hind+0.041
ags+0.040
sed+0.038
endant+0.038
Neg logit contributions
burn-0.036
 Woo-0.036
 nurses-0.033
Pe-0.032
T-0.032
Token so
Token ID523
Feature activation-4.58e-03

Pos logit contributions
aces+0.012
ags+0.012
 hind+0.012
endant+0.011
sed+0.011
Neg logit contributions
 Woo-0.016
burn-0.015
 nurses-0.015
T-0.014
 lam-0.014
Token she
Token ID673
Feature activation+6.01e-01

Pos logit contributions
burn+0.000
 Woo+0.000
Pe+0.000
 nurses+0.000
 paramed+0.000
Neg logit contributions
 hind-0.000
aces-0.000
efully-0.000
ushed-0.000
sed-0.000
Token went
Token ID1816
Feature activation-3.15e-01

Pos logit contributions
ags+0.021
 hind+0.019
aces+0.019
ses+0.018
 ey+0.018
Neg logit contributions
 Woo-0.014
burn-0.013
T-0.013
 nurses-0.013
 lam-0.012
Token to
Token ID284
Feature activation+1.95e-01

Pos logit contributions
burn+0.009
 Woo+0.009
 nurses+0.008
Pe+0.008
T+0.008
Neg logit contributions
aces-0.008
 hind-0.008
ags-0.008
amed-0.007
sed-0.007
Token Farmer
Token ID37755
Feature activation-3.71e+00

Pos logit contributions
aces+0.007
 hind+0.007
ags+0.007
endant+0.006
sed+0.006
Neg logit contributions
burn-0.004
 Woo-0.004
 nurses-0.004
T-0.003
Pe-0.003
Token Brown
Token ID4373
Feature activation-2.55e+00

Pos logit contributions
 Woo+0.107
burn+0.107
 nurses+0.099
Pe+0.098
 paramed+0.094
Neg logit contributions
aces-0.092
 hind-0.090
 hips-0.086
sed-0.086
ses-0.079
Token's
Token ID338
Feature activation-1.79e+00

Pos logit contributions
 Woo+0.071
burn+0.066
Pe+0.065
 paramed+0.060
 nurses+0.060
Neg logit contributions
 hind-0.067
aces-0.067
ags-0.062
 hips-0.059
efully-0.058
Token house
Token ID2156
Feature activation-9.10e-01

Pos logit contributions
burn+0.040
 nurses+0.040
 Woo+0.038
 paramed+0.035
 lam+0.035
Neg logit contributions
aces-0.060
agged-0.056
ags-0.055
ushed-0.055
endant-0.055
Token.
Token ID13
Feature activation-5.59e-01

Pos logit contributions
burn+0.018
 Woo+0.018
 nurses+0.017
T+0.015
 lam+0.015
Neg logit contributions
aces-0.031
 hind-0.031
ags-0.031
endant-0.030
sed-0.030
Token When
Token ID1649
Feature activation-1.31e+00

Pos logit contributions
 Woo+0.015
burn+0.015
 nurses+0.014
T+0.014
Pe+0.013
Neg logit contributions
aces-0.016
ags-0.016
 hind-0.016
endant-0.015
sed-0.015
Token toast
Token ID27805
Feature activation-6.40e-01

Pos logit contributions
burn+0.026
 nurses+0.024
 Woo+0.024
 lam+0.023
 lamb+0.022
Neg logit contributions
aces-0.021
 hind-0.020
ags-0.020
ushed-0.019
endant-0.019
Token out
Token ID503
Feature activation-6.05e-02

Pos logit contributions
burn+0.013
 Woo+0.012
 nurses+0.010
 lam+0.010
T+0.010
Neg logit contributions
 hind-0.023
aces-0.023
ags-0.022
endant-0.022
sed-0.022
Token of
Token ID286
Feature activation-1.68e-01

Pos logit contributions
burn+0.001
 Woo+0.001
 nurses+0.001
 lam+0.001
 any+0.001
Neg logit contributions
sed-0.002
aces-0.002
 hind-0.002
ses-0.002
ags-0.002
Token the
Token ID262
Feature activation-7.54e-01

Pos logit contributions
 Woo+0.002
burn+0.002
 them+0.002
 nurses+0.002
 one+0.002
Neg logit contributions
aces-0.008
ushed-0.007
endant-0.007
sed-0.007
 hind-0.007
Token to
Token ID284
Feature activation-1.01e+00

Pos logit contributions
burn+0.022
 nurses+0.021
 lam+0.020
 lamb+0.020
 Woo+0.019
Neg logit contributions
aces-0.021
ushed-0.020
 hind-0.019
agged-0.018
endant-0.018
Tokenaster
Token ID1603
Feature activation-3.69e+00

Pos logit contributions
 Woo+0.011
burn+0.011
 nurses+0.009
Pe+0.008
T+0.008
Neg logit contributions
aces-0.047
 hind-0.044
sed-0.044
endant-0.044
ushed-0.044
Token and
Token ID290
Feature activation-7.97e-01

Pos logit contributions
burn+0.059
 Woo+0.051
 nurses+0.046
 lam+0.046
 any+0.043
Neg logit contributions
 hind-0.145
ags-0.142
endant-0.141
agged-0.138
aces-0.137
Token put
Token ID1234
Feature activation-1.74e+00

Pos logit contributions
 nurses+0.018
 Woo+0.017
burn+0.017
 they+0.015
 lam+0.015
Neg logit contributions
aces-0.030
ags-0.030
agged-0.028
 hind-0.028
asp-0.026
Token it
Token ID340
Feature activation-6.76e-01

Pos logit contributions
 Woo+0.026
burn+0.026
 nurses+0.022
T+0.020
 lam+0.019
Neg logit contributions
 hind-0.072
aces-0.071
ags-0.067
sed-0.067
endant-0.067
Token on
Token ID319
Feature activation-2.96e-01

Pos logit contributions
burn+0.012
 any+0.009
 Woo+0.009
 one+0.008
 nurses+0.007
Neg logit contributions
 hind-0.027
ses-0.026
sed-0.026
 ey-0.026
endant-0.025
Token a
Token ID257
Feature activation-7.23e-01

Pos logit contributions
 Woo+0.006
burn+0.006
 nurses+0.005
T+0.005
Pe+0.005
Neg logit contributions
aces-0.011
 hind-0.010
ags-0.010
endant-0.010
sed-0.010
Token put
Token ID1234
Feature activation-1.06e+00

Pos logit contributions
ags+0.015
 hind+0.015
aces+0.015
endant+0.014
agged+0.013
Neg logit contributions
 Woo-0.010
 nurses-0.010
burn-0.009
T-0.008
 they-0.008
Token bread
Token ID8509
Feature activation-5.32e-01

Pos logit contributions
 Woo+0.021
burn+0.021
 nurses+0.019
T+0.018
 lam+0.017
Neg logit contributions
 hind-0.038
aces-0.038
sed-0.036
endant-0.036
ags-0.035
Token in
Token ID287
Feature activation+1.51e+00

Pos logit contributions
burn+0.010
 Woo+0.010
T+0.009
 nurses+0.009
 them+0.009
Neg logit contributions
 hind-0.019
sed-0.018
ses-0.017
endant-0.017
ags-0.017
Token the
Token ID262
Feature activation-9.45e-02

Pos logit contributions
aces+0.054
 hind+0.053
ags+0.052
sed+0.051
endant+0.050
Neg logit contributions
 Woo-0.030
burn-0.030
 nurses-0.027
T-0.026
Pe-0.025
Token to
Token ID284
Feature activation+2.26e-01

Pos logit contributions
burn+0.003
 Woo+0.003
 nurses+0.003
 lam+0.003
 lamb+0.003
Neg logit contributions
aces-0.002
 hind-0.002
ushed-0.002
endant-0.002
ags-0.002
Tokenaster
Token ID1603
Feature activation-3.68e+00

Pos logit contributions
aces+0.010
 hind+0.009
sed+0.009
endant+0.009
ushed+0.009
Neg logit contributions
 Woo-0.003
burn-0.003
 nurses-0.003
T-0.003
Pe-0.003
Token.
Token ID13
Feature activation+3.09e-01

Pos logit contributions
burn+0.048
 Woo+0.045
 nurses+0.040
 one+0.040
 any+0.039
Neg logit contributions
 hind-0.157
endant-0.149
 ey-0.147
ags-0.146
sed-0.144
Token
Token ID198
Feature activation-3.01e-01

Pos logit contributions
 hind+0.011
aces+0.011
sed+0.010
endant+0.010
ags+0.010
Neg logit contributions
 Woo-0.007
burn-0.006
Pe-0.006
T-0.006
 nurses-0.005
Token
Token ID198
Feature activation-4.89e-01

Pos logit contributions
burn+0.008
 Woo+0.008
 nurses+0.007
T+0.007
Pe+0.007
Neg logit contributions
aces-0.009
 hind-0.009
ags-0.008
sed-0.008
endant-0.008
TokenTom
Token ID13787
Feature activation-5.43e-01

Pos logit contributions
burn+0.012
 Woo+0.012
 nurses+0.012
 lamb+0.011
 lam+0.011
Neg logit contributions
aces-0.014
 hind-0.014
ags-0.014
amed-0.013
agged-0.013
Token waited
Token ID13488
Feature activation+2.05e-01

Pos logit contributions
T+0.011
burn+0.010
 nurses+0.010
eb+0.009
 one+0.009
Neg logit contributions
ags-0.022
 hind-0.021
ses-0.019
endant-0.019
agged-0.019
Token put
Token ID1234
Feature activation-8.14e-01

Pos logit contributions
burn+0.008
 nurses+0.008
T+0.008
where+0.007
 Woo+0.007
Neg logit contributions
 hind-0.013
ags-0.013
agged-0.012
aces-0.011
amed-0.011
Token it
Token ID340
Feature activation+1.89e-01

Pos logit contributions
 Woo+0.011
burn+0.011
 nurses+0.009
T+0.009
 any+0.009
Neg logit contributions
 hind-0.035
sed-0.033
aces-0.033
ses-0.033
endant-0.032
Token in
Token ID287
Feature activation+8.74e-01

Pos logit contributions
ses+0.007
 hind+0.007
sed+0.007
endant+0.006
 ey+0.006
Neg logit contributions
burn-0.004
 any-0.003
 Woo-0.003
 nurses-0.003
 one-0.003
Token the
Token ID262
Feature activation-2.95e-01

Pos logit contributions
aces+0.033
 hind+0.031
sed+0.031
endant+0.030
ses+0.030
Neg logit contributions
 Woo-0.017
burn-0.016
T-0.015
 nurses-0.015
where-0.014
Token to
Token ID284
Feature activation-1.56e-02

Pos logit contributions
burn+0.008
 nurses+0.008
 Woo+0.008
 lam+0.008
 lamb+0.008
Neg logit contributions
aces-0.008
 hind-0.008
ushed-0.007
endant-0.007
agged-0.007
Tokenaster
Token ID1603
Feature activation-3.67e+00

Pos logit contributions
 Woo+0.000
burn+0.000
 nurses+0.000
T+0.000
 any+0.000
Neg logit contributions
aces-0.001
sed-0.001
 hind-0.001
endant-0.001
ushed-0.001
Token.
Token ID13
Feature activation+4.12e-01

Pos logit contributions
burn+0.072
 Woo+0.070
 nurses+0.065
 any+0.061
 lam+0.061
Neg logit contributions
 hind-0.132
endant-0.126
ags-0.124
sed-0.124
ses-0.123
Token
Token ID198
Feature activation-4.29e-01

Pos logit contributions
 hind+0.015
aces+0.015
sed+0.014
ags+0.014
amed+0.014
Neg logit contributions
 Woo-0.008
burn-0.008
 nurses-0.007
T-0.007
Sim-0.007
Token
Token ID198
Feature activation-7.34e-01

Pos logit contributions
 Woo+0.009
burn+0.009
T+0.009
Pe+0.008
 nurses+0.008
Neg logit contributions
aces-0.015
 hind-0.014
sed-0.014
ags-0.014
ses-0.014
TokenOne
Token ID3198
Feature activation+6.37e-01

Pos logit contributions
burn+0.011
 Woo+0.011
 nurses+0.010
T+0.009
Pe+0.009
Neg logit contributions
aces-0.030
 hind-0.029
ags-0.029
sed-0.028
endant-0.028
Token day
Token ID1110
Feature activation-9.59e-01

Pos logit contributions
aces+0.020
 hind+0.020
ags+0.020
endant+0.019
sed+0.019
Neg logit contributions
burn-0.015
 Woo-0.015
 nurses-0.014
T-0.013
 lam-0.013
Token puts
Token ID7584
Feature activation-1.05e+00

Pos logit contributions
 hind+0.012
ags+0.011
aces+0.011
amed+0.010
agged+0.010
Neg logit contributions
 Woo-0.016
 nurses-0.015
burn-0.015
T-0.015
 lam-0.014
Token bread
Token ID8509
Feature activation-2.43e-01

Pos logit contributions
burn+0.023
 Woo+0.023
 nurses+0.021
T+0.020
Pe+0.019
Neg logit contributions
 hind-0.035
aces-0.035
ags-0.033
sed-0.033
endant-0.033
Token in
Token ID287
Feature activation+1.40e+00

Pos logit contributions
burn+0.005
 Woo+0.004
T+0.004
 lam+0.004
 nurses+0.004
Neg logit contributions
 hind-0.009
ags-0.008
ses-0.008
ieved-0.008
aces-0.008
Token the
Token ID262
Feature activation-2.61e-01

Pos logit contributions
aces+0.057
 hind+0.054
sed+0.054
ags+0.053
endant+0.052
Neg logit contributions
 Woo-0.023
burn-0.022
T-0.020
 nurses-0.020
Pe-0.019
Token to
Token ID284
Feature activation-1.41e-01

Pos logit contributions
burn+0.008
 Woo+0.008
 nurses+0.008
 lam+0.007
 lamb+0.007
Neg logit contributions
aces-0.006
 hind-0.006
ushed-0.006
endant-0.006
ags-0.006
Tokenaster
Token ID1603
Feature activation-3.65e+00

Pos logit contributions
 Woo+0.002
burn+0.002
 nurses+0.002
T+0.002
Pe+0.002
Neg logit contributions
aces-0.006
 hind-0.005
sed-0.005
endant-0.005
ags-0.005
Token and
Token ID290
Feature activation-1.75e-01

Pos logit contributions
burn+0.067
 Woo+0.065
 nurses+0.059
 lam+0.055
T+0.052
Neg logit contributions
 hind-0.133
endant-0.129
ags-0.127
efully-0.125
sed-0.124
Token waits
Token ID28364
Feature activation-1.44e-01

Pos logit contributions
 nurses+0.004
 Woo+0.004
burn+0.004
 lam+0.003
 any+0.003
Neg logit contributions
aces-0.006
 hind-0.006
ags-0.006
amed-0.006
agged-0.006
Token for
Token ID329
Feature activation+1.73e-01

Pos logit contributions
 Woo+0.003
burn+0.003
 nurses+0.003
T+0.003
 lam+0.003
Neg logit contributions
aces-0.005
 hind-0.005
ags-0.005
endant-0.005
sed-0.005
Token it
Token ID340
Feature activation-8.55e-01

Pos logit contributions
aces+0.006
 hind+0.006
ags+0.006
sed+0.005
endant+0.005
Neg logit contributions
 Woo-0.004
burn-0.004
 nurses-0.003
T-0.003
Pe-0.003
Token to
Token ID284
Feature activation-5.08e-01

Pos logit contributions
burn+0.013
 Woo+0.011
 any+0.010
 nurses+0.009
T+0.009
Neg logit contributions
 hind-0.037
 ey-0.036
ags-0.035
endant-0.034
efully-0.033
Token the
Token ID262
Feature activation-7.35e-01

Pos logit contributions
 Woo+0.001
 any+0.001
burn+0.001
 nurses+0.001
 them+0.001
Neg logit contributions
aces-0.003
ushed-0.003
amed-0.003
agged-0.003
 hind-0.003
Token stove
Token ID27635
Feature activation-4.67e-01

Pos logit contributions
burn+0.017
 nurses+0.017
 Woo+0.017
 lamb+0.015
 lam+0.015
Neg logit contributions
aces-0.024
 hind-0.023
ags-0.022
ushed-0.022
agged-0.022
Token and
Token ID290
Feature activation-3.17e-01

Pos logit contributions
burn+0.009
 Woo+0.008
 nurses+0.008
 any+0.007
 lam+0.007
Neg logit contributions
 hind-0.018
ags-0.017
aces-0.017
sed-0.016
amed-0.016
Token the
Token ID262
Feature activation-6.97e-01

Pos logit contributions
 nurses+0.005
 Woo+0.004
 any+0.003
 they+0.003
burn+0.003
Neg logit contributions
aces-0.015
ags-0.014
agged-0.014
 hind-0.014
ushed-0.014
Token to
Token ID284
Feature activation-5.91e-01

Pos logit contributions
 nurses+0.018
burn+0.018
 Woo+0.017
 lam+0.016
 lamb+0.016
Neg logit contributions
aces-0.022
 hind-0.020
ushed-0.020
agged-0.020
ags-0.020
Tokenaster
Token ID1603
Feature activation-3.63e+00

Pos logit contributions
 Woo+0.013
burn+0.013
 nurses+0.012
T+0.011
Pe+0.011
Neg logit contributions
aces-0.021
 hind-0.020
ags-0.019
endant-0.019
ushed-0.019
Token,"
Token ID553
Feature activation+6.09e-02

Pos logit contributions
burn+0.066
 Woo+0.064
 nurses+0.060
 any+0.056
 lam+0.055
Neg logit contributions
 hind-0.137
ags-0.129
aces-0.127
ushed-0.127
endant-0.127
Token Tom
Token ID4186
Feature activation-2.25e-02

Pos logit contributions
aces+0.002
 hind+0.002
ags+0.002
endant+0.002
sed+0.002
Neg logit contributions
 Woo-0.002
burn-0.002
 nurses-0.002
T-0.001
Pe-0.001
Token says
Token ID1139
Feature activation-4.10e-01

Pos logit contributions
 Woo+0.001
burn+0.001
 nurses+0.001
 lam+0.001
T+0.001
Neg logit contributions
 hind-0.000
aces-0.000
ags-0.000
endant-0.000
sed-0.000
Token.
Token ID13
Feature activation-2.03e-02

Pos logit contributions
burn+0.011
 Woo+0.011
 nurses+0.010
Pe+0.010
T+0.010
Neg logit contributions
aces-0.012
 hind-0.011
ags-0.011
sed-0.011
ushed-0.010
Token
Token ID198
Feature activation-3.78e-01

Pos logit contributions
 Woo+0.000
burn+0.000
T+0.000
 nurses+0.000
Pe+0.000
Neg logit contributions
aces-0.001
 hind-0.001
ags-0.001
endant-0.001
sed-0.001
Token,
Token ID11
Feature activation-1.83e-01

Pos logit contributions
burn+0.006
 Woo+0.006
 nurses+0.006
T+0.006
Pe+0.005
Neg logit contributions
aces-0.008
 hind-0.008
ags-0.008
sed-0.007
endant-0.007
Token the
Token ID262
Feature activation+6.82e-01

Pos logit contributions
 Woo+0.004
burn+0.004
 nurses+0.004
 lam+0.003
T+0.003
Neg logit contributions
aces-0.006
 hind-0.006
ags-0.006
endant-0.006
sed-0.006
Token wind
Token ID2344
Feature activation+3.76e-01

Pos logit contributions
aces+0.023
 hind+0.022
ags+0.021
ushed+0.021
agged+0.021
Neg logit contributions
burn-0.015
 lam-0.015
 Woo-0.015
 lamb-0.015
 nurses-0.014
Token,
Token ID11
Feature activation-2.56e-01

Pos logit contributions
 hind+0.013
aces+0.013
ags+0.012
endant+0.011
sed+0.011
Neg logit contributions
burn-0.008
 Woo-0.008
 nurses-0.007
 lam-0.007
T-0.007
Token the
Token ID262
Feature activation+5.29e-01

Pos logit contributions
burn+0.006
 Woo+0.006
 nurses+0.005
 lam+0.005
T+0.005
Neg logit contributions
aces-0.008
 hind-0.008
ags-0.008
endant-0.008
sed-0.008
Token qu
Token ID627
Feature activation+2.36e+00

Pos logit contributions
aces+0.018
 hind+0.017
ushed+0.017
agged+0.016
ags+0.016
Neg logit contributions
 lamb-0.012
burn-0.012
 lam-0.012
 nurses-0.011
 Woo-0.011
Tokenack
Token ID441
Feature activation+2.43e-01

Pos logit contributions
 hind+0.086
aces+0.082
sed+0.079
ses+0.077
endant+0.075
Neg logit contributions
burn-0.049
 Woo-0.044
T-0.039
 lam-0.039
Pe-0.039
Token and
Token ID290
Feature activation+4.65e-01

Pos logit contributions
 hind+0.009
aces+0.008
ags+0.008
endant+0.008
sed+0.008
Neg logit contributions
burn-0.005
 Woo-0.005
 nurses-0.004
 lam-0.004
T-0.004
Token the
Token ID262
Feature activation+3.49e-01

Pos logit contributions
aces+0.019
ags+0.019
 hind+0.018
agged+0.018
ushed+0.018
Neg logit contributions
 Woo-0.007
burn-0.007
 nurses-0.007
 lam-0.006
 lamb-0.006
Token laugh
Token ID6487
Feature activation+1.39e+00

Pos logit contributions
aces+0.011
 hind+0.011
ushed+0.011
ags+0.011
agged+0.010
Neg logit contributions
burn-0.008
 lam-0.008
 Woo-0.008
 lamb-0.008
 nurses-0.008
Token.
Token ID13
Feature activation+2.79e-01

Pos logit contributions
 hind+0.049
aces+0.047
ags+0.046
endant+0.045
sed+0.045
Neg logit contributions
burn-0.029
 Woo-0.029
 nurses-0.026
T-0.025
 lam-0.025
Token healthy
Token ID5448
Feature activation-5.12e-01

Pos logit contributions
 Woo+0.006
burn+0.005
 nurses+0.005
 any+0.005
 paramed+0.005
Neg logit contributions
aces-0.006
 hind-0.006
ags-0.006
ses-0.006
sed-0.006
Token and
Token ID290
Feature activation+3.34e-01

Pos logit contributions
burn+0.011
 Woo+0.011
 nurses+0.011
T+0.010
 lam+0.010
Neg logit contributions
 hind-0.017
aces-0.017
ags-0.016
endant-0.016
sed-0.016
Token cheap
Token ID7026
Feature activation+1.24e-03

Pos logit contributions
ags+0.009
aces+0.009
 hind+0.008
agged+0.008
sed+0.008
Neg logit contributions
 Woo-0.010
 nurses-0.010
burn-0.010
 lam-0.009
T-0.009
Token.
Token ID13
Feature activation-1.83e-01

Pos logit contributions
 hind+0.000
ags+0.000
aces+0.000
amed+0.000
agged+0.000
Neg logit contributions
 Woo-0.000
burn-0.000
 nurses-0.000
 lam-0.000
 any-0.000
Token The
Token ID383
Feature activation-7.16e-02

Pos logit contributions
 Woo+0.005
burn+0.005
 nurses+0.005
T+0.004
Pe+0.004
Neg logit contributions
aces-0.005
 hind-0.005
ags-0.005
sed-0.005
endant-0.005
Token both
Token ID1111
Feature activation+2.42e+00

Pos logit contributions
burn+0.002
 Woo+0.002
 nurses+0.002
T+0.002
 lam+0.001
Neg logit contributions
aces-0.002
 hind-0.002
ags-0.002
endant-0.002
sed-0.002
Token felt
Token ID2936
Feature activation+2.83e-01

Pos logit contributions
ags+0.082
aces+0.081
 hind+0.079
agged+0.076
endant+0.075
Neg logit contributions
 nurses-0.055
burn-0.055
 Woo-0.052
 lamb-0.047
 lam-0.047
Token better
Token ID1365
Feature activation-1.43e-01

Pos logit contributions
aces+0.008
 hind+0.008
ags+0.008
sed+0.008
ses+0.008
Neg logit contributions
 Woo-0.008
 nurses-0.007
burn-0.007
 any-0.007
T-0.007
Token knowing
Token ID6970
Feature activation+3.72e-01

Pos logit contributions
burn+0.004
 Woo+0.004
 nurses+0.003
T+0.003
Pe+0.003
Neg logit contributions
aces-0.004
 hind-0.004
ags-0.004
sed-0.004
endant-0.004
Token they
Token ID484
Feature activation-9.33e-02

Pos logit contributions
aces+0.014
 hind+0.013
ags+0.013
endant+0.013
sed+0.012
Neg logit contributions
burn-0.007
 Woo-0.006
 nurses-0.006
Pe-0.006
T-0.006
Token were
Token ID547
Feature activation-1.12e-01

Pos logit contributions
 nurses+0.002
 Woo+0.002
burn+0.002
T+0.002
where+0.002
Neg logit contributions
 hind-0.003
ags-0.003
aces-0.003
ses-0.003
sed-0.003
Token!
Token ID0
Feature activation+6.93e-01

Pos logit contributions
aces+0.026
 hind+0.025
ags+0.024
sed+0.023
endant+0.023
Neg logit contributions
burn-0.022
 Woo-0.022
 nurses-0.021
Pe-0.020
T-0.019
Token He
Token ID679
Feature activation+5.28e-01

Pos logit contributions
aces+0.020
 hind+0.020
ags+0.019
endant+0.019
sed+0.018
Neg logit contributions
 Woo-0.018
burn-0.017
 nurses-0.016
T-0.016
Pe-0.016
Token sw
Token ID1509
Feature activation+7.07e-01

Pos logit contributions
ags+0.018
aces+0.018
 hind+0.016
endant+0.016
agged+0.015
Neg logit contributions
 Woo-0.013
burn-0.012
 nurses-0.012
T-0.012
 lam-0.011
Tokenam
Token ID321
Feature activation+7.96e-01

Pos logit contributions
 hind+0.017
aces+0.017
igator+0.015
endant+0.015
ses+0.015
Neg logit contributions
burn-0.021
 Woo-0.020
eb-0.018
T-0.018
Pe-0.018
Token the
Token ID262
Feature activation+1.43e+00

Pos logit contributions
aces+0.023
 hind+0.022
ags+0.022
endant+0.021
sed+0.020
Neg logit contributions
 Woo-0.021
burn-0.021
 nurses-0.020
T-0.019
Pe-0.019
Token whole
Token ID2187
Feature activation+2.29e+00

Pos logit contributions
 hind+0.037
aces+0.037
ags+0.035
sed+0.033
endant+0.032
Neg logit contributions
burn-0.041
 Woo-0.041
 nurses-0.039
T-0.038
Pe-0.037
Token day
Token ID1110
Feature activation+1.66e-01

Pos logit contributions
aces+0.060
 hind+0.059
ags+0.057
sed+0.053
endant+0.053
Neg logit contributions
burn-0.066
 Woo-0.066
 nurses-0.062
T-0.060
Pe-0.059
Token,
Token ID11
Feature activation+4.17e-01

Pos logit contributions
aces+0.005
 hind+0.005
ags+0.005
sed+0.004
endant+0.004
Neg logit contributions
burn-0.004
 Woo-0.004
 nurses-0.004
T-0.004
Pe-0.004
Token until
Token ID1566
Feature activation-3.06e-02

Pos logit contributions
aces+0.012
 hind+0.012
ags+0.011
sed+0.011
endant+0.011
Neg logit contributions
burn-0.011
 Woo-0.011
 nurses-0.010
T-0.010
Pe-0.010
Token he
Token ID339
Feature activation+1.96e-01

Pos logit contributions
burn+0.001
 nurses+0.001
 lam+0.001
 paramed+0.001
T+0.001
Neg logit contributions
 hind-0.001
 done-0.001
aces-0.001
sed-0.001
ags-0.001
Token was
Token ID373
Feature activation-9.76e-02

Pos logit contributions
aces+0.005
 hind+0.005
ags+0.005
sed+0.005
endant+0.005
Neg logit contributions
burn-0.006
 Woo-0.005
 nurses-0.005
T-0.005
Pe-0.005
Token we
Token ID356
Feature activation+1.19e+00

Pos logit contributions
burn+0.022
 Woo+0.021
 nurses+0.021
 paramed+0.020
T+0.020
Neg logit contributions
aces-0.019
 hind-0.019
sed-0.018
ags-0.017
endant-0.017
Token come
Token ID1282
Feature activation+7.28e-01

Pos logit contributions
aces+0.038
 hind+0.038
ags+0.037
sed+0.035
endant+0.035
Neg logit contributions
burn-0.027
 Woo-0.026
 nurses-0.025
T-0.023
Pe-0.023
Token back
Token ID736
Feature activation+1.74e+00

Pos logit contributions
aces+0.018
 hind+0.018
ags+0.017
endant+0.016
sed+0.016
Neg logit contributions
 Woo-0.022
burn-0.022
 nurses-0.021
T-0.020
Pe-0.020
Token to
Token ID284
Feature activation+1.50e+00

Pos logit contributions
aces+0.067
 hind+0.062
sed+0.061
ushed+0.060
ags+0.060
Neg logit contributions
 Woo-0.036
burn-0.035
 nurses-0.031
T-0.029
where-0.029
Token the
Token ID262
Feature activation+3.49e-01

Pos logit contributions
aces+0.049
 hind+0.049
ags+0.048
sed+0.045
endant+0.045
Neg logit contributions
burn-0.033
 Woo-0.033
 nurses-0.031
T-0.029
Pe-0.029
Token reef
Token ID25088
Feature activation+2.55e+00

Pos logit contributions
aces+0.010
 hind+0.010
ags+0.010
sed+0.010
endant+0.010
Neg logit contributions
burn-0.009
 Woo-0.009
 nurses-0.008
T-0.008
Pe-0.008
Token another
Token ID1194
Feature activation+1.03e+00

Pos logit contributions
aces+0.094
 hind+0.086
ags+0.086
agged+0.085
sed+0.085
Neg logit contributions
 Woo-0.051
burn-0.050
where-0.050
 nurses-0.050
 any-0.048
Token day
Token ID1110
Feature activation+9.04e-01

Pos logit contributions
aces+0.036
ags+0.035
 hind+0.034
amed+0.032
agged+0.032
Neg logit contributions
burn-0.022
 Woo-0.022
 nurses-0.021
 lamb-0.019
 lam-0.019
Token?
Token ID30
Feature activation-1.39e+00

Pos logit contributions
 hind+0.028
aces+0.028
ushed+0.026
ags+0.026
sed+0.026
Neg logit contributions
 Woo-0.023
burn-0.022
 nurses-0.021
 any-0.020
where-0.020
Token I
Token ID314
Feature activation-3.94e-01

Pos logit contributions
 Woo+0.038
burn+0.037
 nurses+0.035
T+0.034
Pe+0.033
Neg logit contributions
aces-0.040
 hind-0.039
ags-0.038
endant-0.035
sed-0.035
Token want
Token ID765
Feature activation+4.16e-01

Pos logit contributions
burn+0.011
 Woo+0.011
 nurses+0.010
T+0.010
Pe+0.010
Neg logit contributions
aces-0.011
 hind-0.011
ags-0.011
endant-0.010
sed-0.010
Token So
Token ID1406
Feature activation+7.65e-01

Pos logit contributions
aces+0.012
 hind+0.012
ags+0.012
sed+0.011
endant+0.011
Neg logit contributions
 Woo-0.007
T-0.007
 nurses-0.006
burn-0.006
where-0.006
Token,
Token ID11
Feature activation+1.61e+00

Pos logit contributions
aces+0.026
 hind+0.026
ags+0.025
sed+0.024
endant+0.024
Neg logit contributions
burn-0.016
 Woo-0.016
 nurses-0.014
Pe-0.013
T-0.013
Token Annie
Token ID25860
Feature activation+3.98e-01

Pos logit contributions
aces+0.053
 hind+0.052
ags+0.050
endant+0.048
sed+0.048
Neg logit contributions
burn-0.036
 Woo-0.035
 nurses-0.032
Pe-0.030
T-0.030
Token promised
Token ID8072
Feature activation+6.07e-01

Pos logit contributions
ags+0.012
 hind+0.012
aces+0.011
endant+0.011
ses+0.011
Neg logit contributions
burn-0.010
 Woo-0.010
 nurses-0.010
T-0.010
 lam-0.009
Token the
Token ID262
Feature activation+1.99e+00

Pos logit contributions
aces+0.014
 hind+0.014
ags+0.014
 done+0.013
 myself+0.013
Neg logit contributions
burn-0.019
 Woo-0.017
 nurses-0.017
 paramed-0.017
 lam-0.016
Token jelly
Token ID27644
Feature activation+2.25e+00

Pos logit contributions
aces+0.049
 hind+0.048
ags+0.046
sed+0.044
endant+0.043
Neg logit contributions
burn-0.062
 Woo-0.060
 nurses-0.058
T-0.054
Pe-0.054
Tokenfish
Token ID11084
Feature activation+1.98e+00

Pos logit contributions
ags+0.066
 hind+0.066
aces+0.064
sed+0.060
ses+0.060
Neg logit contributions
 Woo-0.057
burn-0.054
T-0.053
 nurses-0.052
where-0.052
Token that
Token ID326
Feature activation-1.04e-02

Pos logit contributions
 hind+0.070
ags+0.069
aces+0.066
sed+0.064
ses+0.064
Neg logit contributions
 Woo-0.039
 nurses-0.038
where-0.036
burn-0.036
 any-0.035
Token she
Token ID673
Feature activation+4.40e-01

Pos logit contributions
burn+0.000
 Woo+0.000
 nurses+0.000
T+0.000
Pe+0.000
Neg logit contributions
aces-0.000
 hind-0.000
ags-0.000
sed-0.000
endant-0.000
Token'd
Token ID1549
Feature activation+1.72e+00

Pos logit contributions
ags+0.018
 hind+0.017
agged+0.016
aces+0.016
 ey+0.015
Neg logit contributions
 nurses-0.007
 Woo-0.007
burn-0.007
T-0.007
 lamb-0.006
Token take
Token ID1011
Feature activation-5.80e-01

Pos logit contributions
aces+0.060
 hind+0.060
ags+0.057
endant+0.055
ushed+0.055
Neg logit contributions
 Woo-0.036
burn-0.034
 nurses-0.033
T-0.031
 any-0.030
Token request
Token ID2581
Feature activation-1.22e-01

Pos logit contributions
 Woo+0.028
burn+0.028
 nurses+0.026
T+0.025
Pe+0.025
Neg logit contributions
 hind-0.043
aces-0.039
sed-0.039
ags-0.038
endant-0.038
Token and
Token ID290
Feature activation-6.51e-01

Pos logit contributions
 Woo+0.003
burn+0.003
 nurses+0.002
T+0.002
Pe+0.002
Neg logit contributions
 hind-0.004
aces-0.004
ags-0.004
endant-0.004
sed-0.004
Token her
Token ID607
Feature activation+3.16e-01

Pos logit contributions
 Woo+0.012
burn+0.012
 nurses+0.011
T+0.010
 lam+0.010
Neg logit contributions
aces-0.025
 hind-0.024
ags-0.024
endant-0.023
sed-0.022
Token mom
Token ID1995
Feature activation-1.03e-01

Pos logit contributions
aces+0.012
ags+0.012
 hind+0.012
agged+0.011
endant+0.011
Neg logit contributions
burn-0.006
 nurses-0.006
 Woo-0.006
 paramed-0.005
T-0.005
Token said
Token ID531
Feature activation-6.33e-01

Pos logit contributions
burn+0.004
 Woo+0.004
 nurses+0.003
T+0.003
 lam+0.003
Neg logit contributions
 hind-0.002
aces-0.002
ags-0.002
sed-0.002
endant-0.002
Token yes
Token ID3763
Feature activation+1.84e+00

Pos logit contributions
burn+0.017
 Woo+0.016
 nurses+0.015
Pe+0.014
T+0.014
Neg logit contributions
aces-0.019
sed-0.017
 hind-0.017
endant-0.016
ags-0.016
Token!
Token ID0
Feature activation+9.74e-02

Pos logit contributions
aces+0.050
 hind+0.049
ags+0.046
sed+0.044
endant+0.044
Neg logit contributions
burn-0.052
 Woo-0.051
 nurses-0.049
 lam-0.046
Pe-0.045
Token The
Token ID383
Feature activation-7.15e-01

Pos logit contributions
aces+0.003
 hind+0.003
ags+0.003
endant+0.003
sed+0.003
Neg logit contributions
 Woo-0.002
burn-0.002
T-0.002
 nurses-0.002
Pe-0.002
Token little
Token ID1310
Feature activation+2.28e-01

Pos logit contributions
 Woo+0.020
burn+0.020
Pe+0.018
eb+0.017
T+0.017
Neg logit contributions
 hind-0.019
aces-0.018
ags-0.018
sed-0.017
amed-0.017
Token girl
Token ID2576
Feature activation+8.14e-01

Pos logit contributions
aces+0.006
ags+0.006
 hind+0.006
amed+0.006
sed+0.005
Neg logit contributions
 Woo-0.006
burn-0.006
Pe-0.006
T-0.006
eb-0.005
Token was
Token ID373
Feature activation-1.93e-01

Pos logit contributions
ags+0.025
 hind+0.024
aces+0.024
endant+0.022
agged+0.022
Neg logit contributions
burn-0.020
 Woo-0.020
 nurses-0.020
T-0.019
 lam-0.018
Token how
Token ID703
Feature activation+3.08e-01

Pos logit contributions
aces+0.021
 hind+0.021
ags+0.020
endant+0.019
sed+0.019
Neg logit contributions
burn-0.018
 Woo-0.018
T-0.017
 nurses-0.017
Pe-0.016
Token it
Token ID340
Feature activation+8.60e-01

Pos logit contributions
aces+0.010
 hind+0.010
ags+0.010
sed+0.009
ses+0.009
Neg logit contributions
 Woo-0.007
burn-0.007
 nurses-0.007
T-0.006
 lam-0.006
Token looked
Token ID3114
Feature activation+1.35e+00

Pos logit contributions
 hind+0.028
ags+0.027
aces+0.025
agged+0.024
amed+0.024
Neg logit contributions
burn-0.022
 Woo-0.021
 nurses-0.021
T-0.018
 lam-0.018
Token,
Token ID11
Feature activation+2.02e-02

Pos logit contributions
 hind+0.046
aces+0.046
ags+0.045
sed+0.043
ses+0.042
Neg logit contributions
 Woo-0.028
burn-0.028
 nurses-0.026
 lam-0.024
T-0.024
Token how
Token ID703
Feature activation+1.90e-01

Pos logit contributions
aces+0.001
 hind+0.001
ags+0.001
sed+0.001
endant+0.001
Neg logit contributions
 Woo-0.000
burn-0.000
 nurses-0.000
T-0.000
Pe-0.000
Token it
Token ID340
Feature activation+1.27e+00

Pos logit contributions
 hind+0.007
aces+0.007
ags+0.006
sed+0.006
ses+0.006
Neg logit contributions
 Woo-0.004
burn-0.004
 nurses-0.004
T-0.003
Pe-0.003
Token smelled
Token ID37609
Feature activation+1.84e+00

Pos logit contributions
 hind+0.035
ags+0.034
aces+0.033
sed+0.031
amed+0.030
Neg logit contributions
burn-0.037
 Woo-0.035
 nurses-0.035
 lam-0.032
T-0.032
Token,
Token ID11
Feature activation+1.02e-01

Pos logit contributions
aces+0.063
 hind+0.063
ags+0.061
sed+0.060
ses+0.059
Neg logit contributions
 Woo-0.039
burn-0.038
 nurses-0.035
T-0.033
 lam-0.033
Token and
Token ID290
Feature activation-4.27e-03

Pos logit contributions
 hind+0.004
aces+0.004
ags+0.003
sed+0.003
endant+0.003
Neg logit contributions
 Woo-0.002
burn-0.002
 nurses-0.002
T-0.002
 lam-0.002
Token how
Token ID703
Feature activation-3.64e-01

Pos logit contributions
 Woo+0.000
 nurses+0.000
burn+0.000
 any+0.000
 lam+0.000
Neg logit contributions
ags-0.000
 hind-0.000
aces-0.000
amed-0.000
agged-0.000
Token it
Token ID340
Feature activation+1.06e+00

Pos logit contributions
 Woo+0.007
burn+0.007
 nurses+0.007
T+0.007
 any+0.006
Neg logit contributions
 hind-0.013
aces-0.013
ags-0.013
sed-0.012
ses-0.012
Token day
Token ID1110
Feature activation-1.28e-02

Pos logit contributions
aces+0.036
 hind+0.036
ags+0.035
sed+0.033
ushed+0.032
Neg logit contributions
burn-0.024
 Woo-0.023
 nurses-0.022
 lam-0.021
 lamb-0.020
Token,
Token ID11
Feature activation+9.83e-01

Pos logit contributions
burn+0.000
 Woo+0.000
 nurses+0.000
T+0.000
Pe+0.000
Neg logit contributions
aces-0.000
 hind-0.000
ags-0.000
sed-0.000
endant-0.000
Token while
Token ID981
Feature activation-1.54e+00

Pos logit contributions
 hind+0.028
aces+0.027
sed+0.026
igator+0.024
ags+0.024
Neg logit contributions
burn-0.025
 nurses-0.021
 Woo-0.021
 paramed-0.020
 lam-0.020
Token she
Token ID673
Feature activation+1.17e+00

Pos logit contributions
burn+0.033
 Woo+0.032
 nurses+0.030
T+0.028
Pe+0.028
Neg logit contributions
aces-0.051
 hind-0.050
ags-0.049
sed-0.048
endant-0.046
Token was
Token ID373
Feature activation-2.97e-01

Pos logit contributions
ags+0.042
 hind+0.041
aces+0.040
ses+0.037
agged+0.037
Neg logit contributions
burn-0.025
 Woo-0.024
T-0.023
where-0.022
 lam-0.022
Token playing
Token ID2712
Feature activation+8.45e-01

Pos logit contributions
burn+0.008
 Woo+0.008
 nurses+0.007
 lam+0.007
T+0.007
Neg logit contributions
aces-0.009
 hind-0.009
ags-0.008
endant-0.008
sed-0.008
Token,
Token ID11
Feature activation+7.03e-01

Pos logit contributions
aces+0.021
ags+0.021
 hind+0.020
amed+0.018
sed+0.018
Neg logit contributions
burn-0.026
 Woo-0.025
T-0.024
 nurses-0.024
 lam-0.023
Token she
Token ID673
Feature activation+9.98e-02

Pos logit contributions
aces+0.019
 hind+0.018
 Amy+0.017
sed+0.017
 mushroom+0.017
Neg logit contributions
burn-0.019
or-0.016
 paramed-0.015
 nurses-0.015
eb-0.015
Token found
Token ID1043
Feature activation+5.44e-01

Pos logit contributions
aces+0.003
 hind+0.003
ags+0.003
endant+0.003
sed+0.003
Neg logit contributions
 Woo-0.003
burn-0.003
 nurses-0.002
T-0.002
Pe-0.002
Token a
Token ID257
Feature activation+6.58e-01

Pos logit contributions
aces+0.015
 hind+0.015
ags+0.014
endant+0.013
sed+0.013
Neg logit contributions
burn-0.015
 Woo-0.015
 nurses-0.014
Pe-0.013
T-0.013
Token shiny
Token ID22441
Feature activation+1.29e+00

Pos logit contributions
aces+0.017
ags+0.016
 hind+0.016
sed+0.015
endant+0.015
Neg logit contributions
burn-0.019
 Woo-0.019
 nurses-0.018
T-0.018
Pe-0.017
Token,
Token ID11
Feature activation+9.35e-01

Pos logit contributions
burn+0.016
 Woo+0.016
Pe+0.015
 nurses+0.015
 paramed+0.014
Neg logit contributions
aces-0.016
 hind-0.016
ags-0.016
sed-0.015
ushed-0.015
Token Lily
Token ID20037
Feature activation-2.99e-01

Pos logit contributions
aces+0.028
 hind+0.027
ags+0.026
sed+0.025
endant+0.025
Neg logit contributions
burn-0.024
 Woo-0.023
 nurses-0.022
T-0.021
Pe-0.021
Token.
Token ID13
Feature activation-5.74e-01

Pos logit contributions
burn+0.009
 Woo+0.008
 nurses+0.008
Pe+0.008
 lam+0.008
Neg logit contributions
aces-0.008
ags-0.007
 hind-0.007
sed-0.007
ushed-0.007
Token Acc
Token ID6366
Feature activation+5.35e-01

Pos logit contributions
burn+0.018
 Woo+0.018
 nurses+0.017
T+0.016
Pe+0.016
Neg logit contributions
aces-0.014
 hind-0.014
ags-0.013
sed-0.012
endant-0.012
Tokenidents
Token ID3231
Feature activation+5.65e-01

Pos logit contributions
aces+0.009
 hind+0.008
sed+0.008
ags+0.008
ushed+0.008
Neg logit contributions
 nurses-0.020
burn-0.020
 Woo-0.020
Pe-0.019
T-0.019
Token happen
Token ID1645
Feature activation+2.61e-01

Pos logit contributions
 hind+0.018
aces+0.018
ags+0.017
sed+0.017
endant+0.017
Neg logit contributions
 Woo-0.013
burn-0.013
 nurses-0.012
T-0.012
Pe-0.011
Token.
Token ID13
Feature activation-6.31e-01

Pos logit contributions
 hind+0.010
aces+0.009
ags+0.009
endant+0.009
sed+0.009
Neg logit contributions
 Woo-0.005
burn-0.005
T-0.004
 nurses-0.004
 any-0.004
Token I
Token ID314
Feature activation-4.54e-01

Pos logit contributions
burn+0.020
 Woo+0.020
 nurses+0.019
T+0.018
Pe+0.018
Neg logit contributions
aces-0.015
 hind-0.015
ags-0.014
endant-0.013
sed-0.013
Token'm
Token ID1101
Feature activation+8.26e-01

Pos logit contributions
burn+0.011
 Woo+0.011
T+0.010
 nurses+0.010
 paramed+0.009
Neg logit contributions
 hind-0.015
aces-0.015
ags-0.014
agged-0.013
endant-0.013
Token glad
Token ID9675
Feature activation+2.07e-01

Pos logit contributions
aces+0.021
 hind+0.020
ags+0.018
endant+0.018
ushed+0.018
Neg logit contributions
 Woo-0.027
burn-0.026
 nurses-0.024
Pe-0.023
T-0.023
Token you
Token ID345
Feature activation+1.00e+00

Pos logit contributions
aces+0.006
 hind+0.006
ags+0.006
sed+0.005
endant+0.005
Neg logit contributions
burn-0.006
 Woo-0.005
 nurses-0.005
Pe-0.005
T-0.005
Token the
Token ID262
Feature activation-1.13e+00

Pos logit contributions
 hind+0.011
ags+0.011
aces+0.011
endant+0.010
 ey+0.010
Neg logit contributions
burn-0.012
 Woo-0.011
 nurses-0.010
 lam-0.010
T-0.010
Token corner
Token ID5228
Feature activation+7.51e-01

Pos logit contributions
 Woo+0.036
burn+0.036
 nurses+0.035
T+0.033
 lam+0.032
Neg logit contributions
aces-0.027
 hind-0.026
ags-0.025
sed-0.024
ushed-0.023
Token of
Token ID286
Feature activation+3.01e-01

Pos logit contributions
 hind+0.028
ags+0.028
aces+0.027
sed+0.026
ses+0.026
Neg logit contributions
burn-0.013
 Woo-0.013
 nurses-0.012
T-0.011
where-0.011
Token the
Token ID262
Feature activation-8.71e-01

Pos logit contributions
aces+0.011
 hind+0.011
ags+0.011
sed+0.010
endant+0.010
Neg logit contributions
 Woo-0.006
burn-0.006
 nurses-0.005
T-0.005
Pe-0.005
Token room
Token ID2119
Feature activation+5.81e-01

Pos logit contributions
 Woo+0.029
 nurses+0.029
burn+0.029
T+0.027
 lam+0.027
Neg logit contributions
aces-0.019
 hind-0.018
ags-0.018
agged-0.017
sed-0.017
Token.
Token ID13
Feature activation+7.91e-02

Pos logit contributions
 hind+0.020
aces+0.020
ags+0.019
sed+0.018
endant+0.018
Neg logit contributions
burn-0.012
 Woo-0.012
 nurses-0.011
T-0.011
Pe-0.010
Token It
Token ID632
Feature activation-1.73e-01

Pos logit contributions
 hind+0.003
aces+0.003
ags+0.003
amed+0.003
sed+0.003
Neg logit contributions
 Woo-0.001
T-0.001
 Th-0.001
 paramed-0.001
Pe-0.001
Token was
Token ID373
Feature activation+3.56e-01

Pos logit contributions
burn+0.004
 Woo+0.004
 nurses+0.004
T+0.004
 lam+0.003
Neg logit contributions
ags-0.006
aces-0.006
 hind-0.006
agged-0.005
asp-0.005
Token the
Token ID262
Feature activation-7.62e-01

Pos logit contributions
aces+0.011
 hind+0.011
ags+0.011
endant+0.010
ushed+0.010
Neg logit contributions
 Woo-0.009
burn-0.009
 nurses-0.008
T-0.008
Pe-0.008
Token lip
Token ID10645
Feature activation+6.45e-01

Pos logit contributions
burn+0.022
 Woo+0.022
 nurses+0.021
 lam+0.020
T+0.020
Neg logit contributions
aces-0.020
 hind-0.020
ags-0.019
endant-0.018
agged-0.018
Token!
Token ID0
Feature activation+1.36e-02

Pos logit contributions
aces+0.018
 hind+0.018
ags+0.017
endant+0.016
sed+0.016
Neg logit contributions
burn-0.017
 Woo-0.017
 nurses-0.016
T-0.015
Pe-0.015
Token silly
Token ID14397
Feature activation-7.99e-01

Pos logit contributions
burn+0.009
 Woo+0.009
 nurses+0.008
T+0.008
 lam+0.008
Neg logit contributions
aces-0.011
ags-0.011
 hind-0.011
endant-0.010
agged-0.010
Token dog
Token ID3290
Feature activation-2.94e-01

Pos logit contributions
burn+0.020
 Woo+0.020
 nurses+0.019
Pe+0.019
T+0.018
Neg logit contributions
aces-0.023
sed-0.023
 hind-0.023
ses-0.022
ags-0.022
Token.
Token ID13
Feature activation-6.49e-01

Pos logit contributions
burn+0.008
 Woo+0.007
 nurses+0.007
Pe+0.007
T+0.007
Neg logit contributions
aces-0.009
 hind-0.008
ags-0.008
sed-0.008
ses-0.008
Token
Token ID198
Feature activation-4.38e-01

Pos logit contributions
 Woo+0.018
burn+0.017
 nurses+0.016
T+0.016
Pe+0.016
Neg logit contributions
aces-0.018
 hind-0.018
ags-0.017
endant-0.017
sed-0.016
Token
Token ID198
Feature activation-5.65e-01

Pos logit contributions
 Woo+0.011
burn+0.010
 nurses+0.010
T+0.010
Pe+0.009
Neg logit contributions
aces-0.014
 hind-0.013
ags-0.013
sed-0.012
endant-0.012
TokenTom
Token ID13787
Feature activation-7.96e-01

Pos logit contributions
burn+0.018
 nurses+0.016
 Woo+0.016
 lam+0.015
 lamb+0.015
Neg logit contributions
aces-0.013
 hind-0.013
ags-0.013
amed-0.013
ak-0.012
Token and
Token ID290
Feature activation+1.62e+00

Pos logit contributions
 Woo+0.014
burn+0.013
 lam+0.012
 nurses+0.011
T+0.010
Neg logit contributions
 hind-0.030
ags-0.029
aces-0.029
endant-0.029
ushed-0.028
Token Mia
Token ID32189
Feature activation+5.57e-02

Pos logit contributions
aces+0.034
 hind+0.032
ags+0.031
endant+0.030
ushed+0.028
Neg logit contributions
 Woo-0.057
burn-0.057
 nurses-0.055
T-0.052
 lam-0.052
Token sigh
Token ID19680
Feature activation-8.82e-02

Pos logit contributions
aces+0.002
 hind+0.002
ags+0.001
endant+0.001
sed+0.001
Neg logit contributions
burn-0.002
 Woo-0.001
 nurses-0.001
T-0.001
Pe-0.001
Token.
Token ID13
Feature activation-8.40e-01

Pos logit contributions
 Woo+0.002
burn+0.002
 nurses+0.001
T+0.001
 lam+0.001
Neg logit contributions
 hind-0.003
aces-0.003
sed-0.003
endant-0.003
ags-0.003
Token They
Token ID1119
Feature activation-2.10e-01

Pos logit contributions
burn+0.024
 Woo+0.023
 nurses+0.023
T+0.021
 lam+0.021
Neg logit contributions
aces-0.023
 hind-0.022
ags-0.022
sed-0.021
ses-0.020
Token other
Token ID584
Feature activation+9.54e-01

Pos logit contributions
aces+0.013
 hind+0.012
ags+0.012
sed+0.011
amed+0.011
Neg logit contributions
burn-0.015
 Woo-0.015
 nurses-0.014
Pe-0.014
T-0.013
Token as
Token ID355
Feature activation-1.52e+00

Pos logit contributions
 hind+0.029
endant+0.029
aces+0.028
sed+0.028
ags+0.028
Neg logit contributions
burn-0.024
 Woo-0.023
 nurses-0.023
T-0.021
 lam-0.020
Token much
Token ID881
Feature activation-1.32e+00

Pos logit contributions
 Woo+0.035
burn+0.033
 nurses+0.033
T+0.030
 lam+0.030
Neg logit contributions
aces-0.051
 hind-0.050
ags-0.049
sed-0.046
amed-0.046
Token as
Token ID355
Feature activation-1.55e+00

Pos logit contributions
 Woo+0.030
 nurses+0.028
burn+0.028
 lam+0.025
T+0.024
Neg logit contributions
 hind-0.044
aces-0.043
amed-0.043
ags-0.042
ushed-0.042
Token we
Token ID356
Feature activation-5.93e-01

Pos logit contributions
 Woo+0.035
burn+0.034
 nurses+0.033
T+0.030
 lam+0.030
Neg logit contributions
aces-0.052
 hind-0.051
ags-0.050
sed-0.047
endant-0.047
Token can
Token ID460
Feature activation-8.64e-01

Pos logit contributions
 nurses+0.012
burn+0.012
 Woo+0.012
T+0.011
 any+0.010
Neg logit contributions
 hind-0.022
ags-0.020
aces-0.020
agged-0.019
amed-0.019
Token."
Token ID526
Feature activation+7.49e-01

Pos logit contributions
burn+0.020
 Woo+0.020
 nurses+0.018
T+0.017
Pe+0.017
Neg logit contributions
aces-0.028
 hind-0.027
ags-0.027
sed-0.025
endant-0.025
Token
Token ID198
Feature activation-2.72e-01

Pos logit contributions
aces+0.016
 hind+0.016
ags+0.016
endant+0.014
ushed+0.014
Neg logit contributions
burn-0.025
 Woo-0.024
 nurses-0.024
Pe-0.022
 lam-0.022
Token
Token ID198
Feature activation-4.45e-01

Pos logit contributions
 Woo+0.007
burn+0.007
 nurses+0.006
T+0.006
Pe+0.006
Neg logit contributions
aces-0.008
 hind-0.008
ags-0.008
sed-0.008
endant-0.008
TokenT
Token ID51
Feature activation-6.60e-01

Pos logit contributions
burn+0.008
 Woo+0.007
 nurses+0.007
 any+0.007
 lam+0.006
Neg logit contributions
 hind-0.015
amed-0.015
ushed-0.015
efully-0.014
aces-0.014
Tokeneddy
Token ID21874
Feature activation+3.44e-01

Pos logit contributions
burn+0.010
 Woo+0.009
T+0.009
 nurses+0.008
 paramed+0.007
Neg logit contributions
 hind-0.028
aces-0.027
ags-0.026
sed-0.025
endant-0.025
Token much
Token ID881
Feature activation-2.74e-01

Pos logit contributions
 Woo+0.028
burn+0.026
T+0.025
 nurses+0.025
Pe+0.024
Neg logit contributions
aces-0.035
ags-0.034
 hind-0.034
sed-0.032
ses-0.032
Token as
Token ID355
Feature activation-1.09e+00

Pos logit contributions
 Woo+0.006
 nurses+0.005
burn+0.005
 lam+0.005
 any+0.005
Neg logit contributions
aces-0.009
 hind-0.009
ags-0.009
sed-0.009
amed-0.009
Token he
Token ID339
Feature activation+1.41e-01

Pos logit contributions
 Woo+0.018
burn+0.017
 nurses+0.016
T+0.015
Pe+0.014
Neg logit contributions
aces-0.043
 hind-0.042
ags-0.042
sed-0.040
endant-0.040
Token could
Token ID714
Feature activation+5.83e-02

Pos logit contributions
ags+0.005
 hind+0.005
aces+0.005
 ours+0.004
ses+0.004
Neg logit contributions
T-0.003
eb-0.003
 Woo-0.003
Pe-0.003
where-0.003
Token.
Token ID13
Feature activation-7.40e-02

Pos logit contributions
aces+0.002
ags+0.002
 hind+0.002
endant+0.002
amed+0.002
Neg logit contributions
burn-0.001
 Woo-0.001
 any-0.001
T-0.001
where-0.001
Token But
Token ID887
Feature activation-7.71e-01

Pos logit contributions
 Woo+0.001
 B+0.001
 Th+0.001
Pe+0.001
T+0.001
Neg logit contributions
ags-0.003
aces-0.003
endant-0.003
 hind-0.003
amed-0.003
Token then
Token ID788
Feature activation-5.08e-02

Pos logit contributions
burn+0.016
 Woo+0.016
 nurses+0.015
T+0.014
Pe+0.014
Neg logit contributions
aces-0.026
 hind-0.026
ags-0.025
endant-0.024
sed-0.024
Token he
Token ID339
Feature activation+2.04e-01

Pos logit contributions
burn+0.001
 Woo+0.001
 lam+0.001
Pe+0.001
 nurses+0.001
Neg logit contributions
aces-0.001
 done-0.001
 hind-0.001
 claws-0.001
sed-0.001
Token heard
Token ID2982
Feature activation+9.97e-01

Pos logit contributions
ags+0.007
aces+0.007
 hind+0.007
endant+0.006
ses+0.006
Neg logit contributions
burn-0.005
 Woo-0.005
T-0.004
 nurses-0.004
Pe-0.004
Token a
Token ID257
Feature activation+3.32e-01

Pos logit contributions
aces+0.028
 hind+0.028
ags+0.027
endant+0.025
sed+0.025
Neg logit contributions
burn-0.027
 Woo-0.026
 nurses-0.025
Pe-0.023
T-0.023
Token strange
Token ID6283
Feature activation-1.69e-01

Pos logit contributions
aces+0.009
 hind+0.009
ags+0.009
endant+0.008
sed+0.008
Neg logit contributions
burn-0.009
 Woo-0.009
 nurses-0.009
T-0.008
Pe-0.008
Token all
Token ID477
Feature activation+7.21e-01

Pos logit contributions
aces+0.007
 hind+0.006
ags+0.006
amed+0.006
ushed+0.006
Neg logit contributions
burn-0.007
 Woo-0.006
 nurses-0.006
Pe-0.006
 lam-0.006
Token day
Token ID1110
Feature activation-1.87e+00

Pos logit contributions
aces+0.019
ags+0.019
 hind+0.018
amed+0.017
asp+0.016
Neg logit contributions
burn-0.021
 Woo-0.020
 nurses-0.019
Pe-0.019
T-0.019
Token long
Token ID890
Feature activation-2.01e+00

Pos logit contributions
burn+0.052
 Woo+0.051
 nurses+0.050
Pe+0.047
 lam+0.045
Neg logit contributions
aces-0.052
 hind-0.050
ags-0.048
sed-0.045
endant-0.044
Token.
Token ID13
Feature activation-1.51e-01

Pos logit contributions
burn+0.045
 Woo+0.045
 nurses+0.042
T+0.040
Pe+0.039
Neg logit contributions
aces-0.065
 hind-0.064
ags-0.063
sed-0.060
endant-0.060
Token One
Token ID1881
Feature activation+8.11e-01

Pos logit contributions
burn+0.003
 Woo+0.003
 nurses+0.003
T+0.003
Pe+0.003
Neg logit contributions
aces-0.005
 hind-0.005
ags-0.005
endant-0.005
sed-0.005
Token day
Token ID1110
Feature activation-1.01e+00

Pos logit contributions
aces+0.023
 hind+0.023
ags+0.022
sed+0.021
endant+0.021
Neg logit contributions
burn-0.022
 Woo-0.021
 nurses-0.020
T-0.019
Pe-0.019
Token,
Token ID11
Feature activation+1.21e-01

Pos logit contributions
burn+0.033
 nurses+0.031
 lam+0.029
 Woo+0.029
 lamb+0.029
Neg logit contributions
aces-0.021
sed-0.020
 hind-0.019
ushed-0.019
 done-0.019
Token she
Token ID673
Feature activation-4.87e-01

Pos logit contributions
 hind+0.004
aces+0.003
ags+0.003
sed+0.003
ses+0.003
Neg logit contributions
burn-0.003
 lam-0.003
 nurses-0.003
 Woo-0.003
Sim-0.003
Token saw
Token ID2497
Feature activation+1.49e-01

Pos logit contributions
burn+0.014
 Woo+0.014
 nurses+0.013
Pe+0.013
T+0.013
Neg logit contributions
aces-0.012
 hind-0.012
ags-0.012
sed-0.011
endant-0.011
Token a
Token ID257
Feature activation+5.85e-01

Pos logit contributions
aces+0.004
 hind+0.004
ags+0.004
endant+0.003
amed+0.003
Neg logit contributions
burn-0.004
 Woo-0.004
 nurses-0.004
Pe-0.004
 lam-0.004
Token parade
Token ID16134
Feature activation+1.16e+00

Pos logit contributions
 hind+0.014
aces+0.014
ags+0.013
endant+0.013
sed+0.013
Neg logit contributions
burn-0.018
 Woo-0.018
 nurses-0.017
T-0.016
Pe-0.016
Token She
Token ID1375
Feature activation-3.94e-01

Pos logit contributions
 Woo+0.011
burn+0.010
 nurses+0.010
T+0.010
Pe+0.009
Neg logit contributions
 hind-0.023
aces-0.021
ags-0.021
sed-0.021
endant-0.021
Token was
Token ID373
Feature activation-4.68e-01

Pos logit contributions
 Woo+0.012
T+0.011
burn+0.011
 nurses+0.011
Pe+0.011
Neg logit contributions
 hind-0.011
ags-0.010
aces-0.010
endant-0.009
sed-0.009
Token very
Token ID845
Feature activation+7.90e-01

Pos logit contributions
burn+0.017
 Woo+0.017
T+0.017
 nurses+0.016
eb+0.016
Neg logit contributions
 hind-0.008
aces-0.008
ags-0.007
sed-0.007
amed-0.007
Token fearful
Token ID26844
Feature activation+1.63e-01

Pos logit contributions
aces+0.019
 hind+0.018
ags+0.018
endant+0.017
sed+0.016
Neg logit contributions
 Woo-0.025
burn-0.024
 nurses-0.024
Pe-0.023
T-0.022
Token.
Token ID13
Feature activation-3.71e-01

Pos logit contributions
aces+0.005
 hind+0.004
ags+0.004
sed+0.004
endant+0.004
Neg logit contributions
burn-0.005
 Woo-0.004
Pe-0.004
 nurses-0.004
T-0.004
Token Everywhere
Token ID48201
Feature activation-6.62e-01

Pos logit contributions
 Woo+0.006
burn+0.006
T+0.005
 nurses+0.005
Pe+0.005
Neg logit contributions
 hind-0.015
aces-0.014
endant-0.014
amed-0.013
ags-0.013
Token she
Token ID673
Feature activation+8.04e-01

Pos logit contributions
burn+0.009
 Woo+0.008
 nurses+0.007
T+0.007
Pe+0.006
Neg logit contributions
aces-0.028
 hind-0.028
ags-0.027
sed-0.026
endant-0.026
Token went
Token ID1816
Feature activation+3.51e-01

Pos logit contributions
aces+0.022
 hind+0.021
ags+0.021
endant+0.020
sed+0.020
Neg logit contributions
burn-0.022
 Woo-0.022
 nurses-0.020
Pe-0.020
T-0.020
Token,
Token ID11
Feature activation+5.85e-02

Pos logit contributions
aces+0.010
ags+0.009
 done+0.009
 hind+0.008
sed+0.008
Neg logit contributions
burn-0.010
 Woo-0.009
T-0.009
Pe-0.008
 nurses-0.008
Token she
Token ID673
Feature activation+1.96e-01

Pos logit contributions
aces+0.002
 hind+0.002
ags+0.002
endant+0.002
sed+0.002
Neg logit contributions
burn-0.001
 Woo-0.001
 nurses-0.001
T-0.001
Pe-0.001
Token heard
Token ID2982
Feature activation+7.17e-01

Pos logit contributions
aces+0.005
 hind+0.005
ags+0.005
sed+0.004
endant+0.004
Neg logit contributions
burn-0.006
 Woo-0.006
 nurses-0.005
T-0.005
Pe-0.005
Token,
Token ID11
Feature activation-7.95e-01

Pos logit contributions
burn+0.014
 Woo+0.014
 nurses+0.013
Pe+0.013
T+0.013
Neg logit contributions
aces-0.012
ags-0.012
 hind-0.011
 hips-0.010
sed-0.010
Token but
Token ID475
Feature activation-8.54e-01

Pos logit contributions
burn+0.017
 Woo+0.016
 nurses+0.015
T+0.014
Pe+0.014
Neg logit contributions
aces-0.027
 hind-0.027
ags-0.026
sed-0.025
endant-0.025
Token it
Token ID340
Feature activation-1.64e-01

Pos logit contributions
burn+0.017
 Woo+0.016
 nurses+0.015
 lam+0.014
T+0.014
Neg logit contributions
aces-0.031
 hind-0.030
ags-0.030
sed-0.028
endant-0.028
Token was
Token ID373
Feature activation-1.89e-01

Pos logit contributions
burn+0.005
 Woo+0.005
 nurses+0.004
T+0.004
Pe+0.004
Neg logit contributions
aces-0.005
 hind-0.004
ags-0.004
endant-0.004
sed-0.004
Token too
Token ID1165
Feature activation+7.52e-01

Pos logit contributions
burn+0.005
 Woo+0.005
 any+0.005
 nurses+0.005
 lam+0.004
Neg logit contributions
aces-0.006
 hind-0.005
ushed-0.005
endant-0.005
ags-0.005
Token late
Token ID2739
Feature activation-2.22e+00

Pos logit contributions
aces+0.024
 hind+0.023
ags+0.023
sed+0.022
endant+0.021
Neg logit contributions
burn-0.019
 Woo-0.018
 nurses-0.017
T-0.017
Pe-0.016
Token.
Token ID13
Feature activation-4.07e-02

Pos logit contributions
burn+0.051
 Woo+0.051
 nurses+0.048
T+0.045
Pe+0.044
Neg logit contributions
aces-0.071
 hind-0.070
ags-0.068
sed-0.065
endant-0.064
Token Lily
Token ID20037
Feature activation-3.62e-01

Pos logit contributions
 Woo+0.001
burn+0.001
 nurses+0.001
T+0.001
Pe+0.001
Neg logit contributions
 hind-0.001
aces-0.001
ags-0.001
amed-0.001
endant-0.001
Token was
Token ID373
Feature activation-7.28e-01

Pos logit contributions
 Woo+0.010
burn+0.009
 nurses+0.009
T+0.009
 lam+0.008
Neg logit contributions
ags-0.011
 hind-0.010
aces-0.010
endant-0.010
agged-0.009
Token gone
Token ID3750
Feature activation-4.75e-01

Pos logit contributions
burn+0.022
 Woo+0.021
 nurses+0.020
 lam+0.019
Pe+0.019
Neg logit contributions
aces-0.019
 hind-0.018
ags-0.017
endant-0.017
ushed-0.017
Token.
Token ID13
Feature activation-3.46e-01

Pos logit contributions
burn+0.011
 Woo+0.011
 nurses+0.010
T+0.009
 lam+0.009
Neg logit contributions
aces-0.016
 hind-0.015
ags-0.015
sed-0.014
endant-0.014
TokenOnce
Token ID7454
Feature activation+6.16e-01

Pos logit contributions
 hind+0.017
aces+0.016
sed+0.015
ags+0.015
endant+0.015
Neg logit contributions
 Woo-0.010
burn-0.010
T-0.009
Pe-0.009
where-0.009
Token there
Token ID612
Feature activation-2.60e-01

Pos logit contributions
aces+0.014
 hind+0.013
ags+0.013
sed+0.012
endant+0.012
Neg logit contributions
burn-0.020
 Woo-0.020
 nurses-0.019
T-0.018
Pe-0.018
Token was
Token ID373
Feature activation-1.31e-01

Pos logit contributions
burn+0.008
 Woo+0.008
 nurses+0.008
T+0.007
 lam+0.007
Neg logit contributions
aces-0.006
 hind-0.006
ags-0.006
sed-0.006
endant-0.006
Token a
Token ID257
Feature activation-8.14e-01

Pos logit contributions
burn+0.003
 Woo+0.003
 nurses+0.003
T+0.003
Pe+0.003
Neg logit contributions
aces-0.004
 hind-0.004
ags-0.004
sed-0.003
endant-0.003
Token turtle
Token ID28699
Feature activation-7.63e-02

Pos logit contributions
burn+0.027
 Woo+0.026
 nurses+0.026
or+0.024
 any+0.024
Neg logit contributions
 hind-0.016
sed-0.014
asp-0.013
 juicy-0.013
 mushroom-0.013
Token who
Token ID508
Feature activation-2.24e+00

Pos logit contributions
burn+0.002
 Woo+0.002
 nurses+0.002
Pe+0.002
T+0.002
Neg logit contributions
aces-0.002
 hind-0.002
sed-0.002
endant-0.002
ags-0.002
Token loved
Token ID6151
Feature activation-8.86e-01

Pos logit contributions
burn+0.087
 Woo+0.085
 nurses+0.083
 lam+0.079
Pe+0.078
Neg logit contributions
aces-0.038
 hind-0.034
ags-0.033
sed-0.031
ushed-0.031
Token to
Token ID284
Feature activation-1.21e+00

Pos logit contributions
 Woo+0.020
burn+0.020
 nurses+0.018
 lam+0.017
T+0.017
Neg logit contributions
aces-0.030
 hind-0.029
ags-0.029
endant-0.027
sed-0.027
Token laugh
Token ID6487
Feature activation+1.21e+00

Pos logit contributions
burn+0.026
 Woo+0.026
 nurses+0.024
T+0.022
 lam+0.022
Neg logit contributions
aces-0.041
 hind-0.040
ags-0.039
endant-0.038
sed-0.037
Token.
Token ID13
Feature activation-1.07e+00

Pos logit contributions
aces+0.040
 hind+0.040
ags+0.039
sed+0.037
endant+0.037
Neg logit contributions
 Woo-0.027
burn-0.026
 nurses-0.024
T-0.023
 lam-0.022
Token He
Token ID679
Feature activation-4.15e-01

Pos logit contributions
burn+0.034
 Woo+0.032
 nurses+0.031
T+0.030
 lam+0.030
Neg logit contributions
aces-0.027
 hind-0.026
ags-0.025
sed-0.023
endant-0.023
Token careful
Token ID8161
Feature activation-1.78e+00

Pos logit contributions
 Woo+0.017
burn+0.017
 nurses+0.016
T+0.015
Pe+0.015
Neg logit contributions
aces-0.010
 hind-0.010
ags-0.010
sed-0.009
endant-0.009
Token and
Token ID290
Feature activation-6.80e-01

Pos logit contributions
 Woo+0.049
burn+0.049
Pe+0.046
 nurses+0.045
 paramed+0.043
Neg logit contributions
aces-0.054
ags-0.048
 hind-0.048
amed-0.045
ushed-0.044
Token to
Token ID284
Feature activation-6.18e-01

Pos logit contributions
 nurses+0.014
 Woo+0.014
burn+0.014
T+0.012
Pe+0.012
Neg logit contributions
aces-0.024
ags-0.024
 hind-0.023
agged-0.022
endant-0.022
Token always
Token ID1464
Feature activation-7.06e-01

Pos logit contributions
burn+0.016
 Woo+0.016
 nurses+0.015
 lam+0.014
T+0.014
Neg logit contributions
 hind-0.017
aces-0.017
ags-0.017
endant-0.016
sed-0.016
Token think
Token ID892
Feature activation-7.68e-01

Pos logit contributions
burn+0.016
 Woo+0.016
 nurses+0.015
T+0.014
Pe+0.014
Neg logit contributions
aces-0.023
 hind-0.023
ags-0.022
endant-0.021
sed-0.021
Token before
Token ID878
Feature activation-2.28e+00

Pos logit contributions
 Woo+0.019
burn+0.018
 nurses+0.018
T+0.017
 lam+0.016
Neg logit contributions
 hind-0.023
aces-0.023
ags-0.023
sed-0.022
endant-0.021
Token he
Token ID339
Feature activation-7.37e-01

Pos logit contributions
burn+0.055
 Woo+0.052
 nurses+0.050
Pe+0.047
 paramed+0.047
Neg logit contributions
aces-0.069
 hind-0.068
ags-0.064
sed-0.063
endant-0.062
Token acted
Token ID13134
Feature activation-4.45e-01

Pos logit contributions
 nurses+0.018
burn+0.017
 Woo+0.017
T+0.017
 any+0.016
Neg logit contributions
 hind-0.024
ags-0.022
aces-0.021
amed-0.020
asp-0.020
Token.
Token ID13
Feature activation-5.15e-01

Pos logit contributions
burn+0.011
 Woo+0.011
 nurses+0.010
T+0.010
Pe+0.010
Neg logit contributions
aces-0.013
 hind-0.013
ags-0.013
sed-0.012
endant-0.012
Token He
Token ID679
Feature activation-1.04e-01

Pos logit contributions
burn+0.016
 Woo+0.016
 nurses+0.015
 lam+0.015
Pe+0.015
Neg logit contributions
aces-0.012
 hind-0.012
ags-0.011
sed-0.011
endant-0.010
Token never
Token ID1239
Feature activation+5.85e-01

Pos logit contributions
 Woo+0.003
burn+0.003
 nurses+0.003
T+0.003
 lam+0.003
Neg logit contributions
 hind-0.003
aces-0.003
ags-0.003
endant-0.002
sed-0.002
Token make
Token ID787
Feature activation-1.43e+00

Pos logit contributions
 nurses+0.026
 Woo+0.024
burn+0.022
 any+0.022
T+0.020
Neg logit contributions
aces-0.053
ags-0.052
agged-0.052
 hind-0.051
ushed-0.050
Token them
Token ID606
Feature activation-1.27e+00

Pos logit contributions
burn+0.035
 Woo+0.035
 nurses+0.033
T+0.031
Pe+0.031
Neg logit contributions
aces-0.044
 hind-0.043
ags-0.042
sed-0.039
endant-0.039
Token even
Token ID772
Feature activation-1.29e+00

Pos logit contributions
burn+0.033
 Woo+0.031
 nurses+0.030
T+0.027
 any+0.027
Neg logit contributions
 hind-0.038
aces-0.037
sed-0.036
ags-0.036
ses-0.034
Token better
Token ID1365
Feature activation-9.29e-01

Pos logit contributions
 nurses+0.027
burn+0.026
 Woo+0.025
 any+0.023
 lam+0.022
Neg logit contributions
aces-0.045
 hind-0.044
sed-0.044
ags-0.043
ushed-0.042
Token than
Token ID621
Feature activation-1.76e+00

Pos logit contributions
burn+0.024
 Woo+0.024
 nurses+0.023
 any+0.022
T+0.021
Neg logit contributions
 hind-0.027
aces-0.027
endant-0.026
ags-0.026
sed-0.026
Token before
Token ID878
Feature activation-2.64e+00

Pos logit contributions
 any+0.034
 nurses+0.032
 Woo+0.032
burn+0.030
 them+0.029
Neg logit contributions
ushed-0.067
aces-0.067
 hind-0.065
amed-0.065
agged-0.063
Token.
Token ID13
Feature activation-8.55e-01

Pos logit contributions
 Woo+0.060
burn+0.058
 nurses+0.055
 lam+0.052
 any+0.051
Neg logit contributions
 hind-0.090
aces-0.088
ushed-0.085
ags-0.085
amed-0.084
Token And
Token ID843
Feature activation-6.80e-01

Pos logit contributions
 Woo+0.024
burn+0.024
 nurses+0.023
T+0.022
Pe+0.022
Neg logit contributions
aces-0.023
 hind-0.023
ags-0.022
endant-0.021
sed-0.021
Token we
Token ID356
Feature activation-1.07e+00

Pos logit contributions
burn+0.016
 Woo+0.016
 nurses+0.015
T+0.014
Pe+0.014
Neg logit contributions
aces-0.021
 hind-0.021
ags-0.020
sed-0.019
endant-0.019
Token can
Token ID460
Feature activation-8.86e-01

Pos logit contributions
 Woo+0.026
 nurses+0.025
burn+0.025
T+0.024
 paramed+0.022
Neg logit contributions
 hind-0.036
aces-0.032
ags-0.032
sed-0.031
agged-0.030
Token put
Token ID1234
Feature activation-1.47e+00

Pos logit contributions
 Woo+0.025
burn+0.025
 nurses+0.024
 lam+0.023
Pe+0.023
Neg logit contributions
aces-0.023
ags-0.023
 hind-0.022
endant-0.021
sed-0.020
Token higher
Token ID2440
Feature activation-9.43e-01

Pos logit contributions
 Woo+0.015
burn+0.015
 nurses+0.014
Pe+0.013
 lam+0.013
Neg logit contributions
aces-0.020
 hind-0.019
ags-0.019
endant-0.018
sed-0.018
Token than
Token ID621
Feature activation-9.48e-01

Pos logit contributions
burn+0.024
 Woo+0.024
 nurses+0.022
T+0.021
Pe+0.021
Neg logit contributions
aces-0.028
 hind-0.027
ags-0.027
endant-0.026
sed-0.026
Token she
Token ID673
Feature activation-3.64e-01

Pos logit contributions
 Woo+0.021
burn+0.020
 nurses+0.019
T+0.018
Pe+0.017
Neg logit contributions
aces-0.032
 hind-0.032
ags-0.031
sed-0.030
endant-0.030
Token ever
Token ID1683
Feature activation+1.18e+00

Pos logit contributions
T+0.008
burn+0.008
Pe+0.008
where+0.008
 nurses+0.007
Neg logit contributions
ags-0.013
 hind-0.012
agged-0.011
endant-0.011
ses-0.011
Token had
Token ID550
Feature activation+2.49e-01

Pos logit contributions
ags+0.039
 hind+0.038
aces+0.035
agged+0.033
sed+0.033
Neg logit contributions
 Woo-0.028
burn-0.028
T-0.027
 nurses-0.025
 any-0.025
Token before
Token ID878
Feature activation-2.61e+00

Pos logit contributions
 hind+0.009
ags+0.009
ushed+0.009
ses+0.009
sed+0.009
Neg logit contributions
 any-0.004
 Woo-0.004
burn-0.004
 one-0.004
T-0.004
Token.
Token ID13
Feature activation-8.73e-04

Pos logit contributions
 Woo+0.049
burn+0.048
 lam+0.041
T+0.040
 nurses+0.038
Neg logit contributions
aces-0.098
 hind-0.096
ags-0.095
ushed-0.093
sed-0.092
Token She
Token ID1375
Feature activation+3.27e-01

Pos logit contributions
 Woo+0.000
burn+0.000
T+0.000
 nurses+0.000
Pe+0.000
Neg logit contributions
aces-0.000
 hind-0.000
ags-0.000
endant-0.000
sed-0.000
Token felt
Token ID2936
Feature activation-6.17e-02

Pos logit contributions
ags+0.013
 hind+0.011
aces+0.011
agged+0.011
ses+0.011
Neg logit contributions
 Woo-0.006
T-0.006
 nurses-0.006
burn-0.006
 lam-0.006
Token so
Token ID523
Feature activation+1.08e+00

Pos logit contributions
 Woo+0.002
burn+0.002
 nurses+0.001
Pe+0.001
T+0.001
Neg logit contributions
aces-0.002
 hind-0.002
ags-0.002
sed-0.002
endant-0.002
Token high
Token ID1029
Feature activation+2.45e-01

Pos logit contributions
aces+0.032
sed+0.031
 hind+0.030
ses+0.030
ags+0.029
Neg logit contributions
 Woo-0.027
Pe-0.025
 lam-0.023
T-0.023
burn-0.023