TOP ACTIVATIONS
MAX = 0.077

his puzzle and thanked Wh isk ers for helping him .
Ch ir py thanked Wh isk ers for helping him .
. She grabbed the fire extingu isher and put out the
. The vet showed Wh isk ers the x - ray
to the playground and showed all of his friends the penny
box and pulled out a brand new pair of shoes .
family sh oo ed Wh isk ers away and told Max
bag and pulled out a brand new doll for Sally .
to unlock ," said Wh isk ers . "
In fact , the special bal m made her ankle look
happy that she hugged Wh isk ers . " Thank you
wise old cat named Wh isk ers . Wh isk ers
the garden , showing it all the pretty flowers and y
. The owl told Wh isk ers , " To be
loved the feeling of making someone happy . He was glad
the kitten , showing it all of the new places he
she could and pushed Wh isk ers out of the way
a cloth and wiped it all over . Then , she
flew down and scared Wh isk ers away .
that her toys would make someone else smile .

MIN ACTIVATIONS
MIN = -0.072

" What is under the fence ?" Tom asks
says " what is going on here ? Why are you
and says , " Stop repeating me , Sue ! You
and shouts , " Whe e ! That was awesome !"
they lived on either side of a broken
and shout , " Whe e !" They roll
Sam , what 's going on ?" Mom says . She
. " Whe e !" Lily shouts . "
. " Whe e !" Lily says . "
He has a big man e of hair and a loud
" Time to bat he , L ila and Tom
and has a big man e . He can roar very
and says , " Stop repeating me , Sue ! You
Hey , what 's going on here ?" a voice said
and said , " Whe e , Brown ie , you
. " Why are you on my nose ?"
. " Why are you under the sheet ?"
. " Whe e !" Anna and Ben say
bird . Why are you on my ugly mailbox ?" The
of the perfect field to gra ze . He wanted to

INTERVAL 0.040 - 0.077
CONTAINS 0.574%

, a 3 - year - old boy , was curious
The globe was so happy to be whole again and it
so strong and it blew all the trees and destroyed the
" We have something for you , L ila
, it must belong to someone else . We must give

INTERVAL 0.003 - 0.040
CONTAINS 41.507%

it hurt a lot ! Fortunately , Jane had
and Sam are brothers . They like to play with their
" It 's okay , but you need to ask before
was playing with a big , furry bear . The bear
really badly and he couldn 't walk anymore .

INTERVAL -0.035 - 0.003
CONTAINS 56.960%

They follow Dad to the car . They go
herself . From that day on , Lily
had had enough . He yelled and punched the wall ,
they still looked sad . They hoped the baby k ang
cannot go to school . She misses Ben a lot .

INTERVAL -0.072 - -0.035
CONTAINS 0.960%

excited . She was going on her very first trip !
From that day on , Tim and his friends
my replied . Suddenly , they saw a big
, " Rex , stop barking . It 's just a
was Velvet . Velvet had soft hair and a pretty dress
Token his
Token ID465
Feature activation-1.64e-02

Pos logit contributions
 ears+0.015
 wheels+0.015
 tail+0.015
ps+0.014
 scales+0.014
Neg logit contributions
des-0.030
and-0.029
to-0.029
ido-0.028
 inspection-0.027
Token puzzle
Token ID15027
Feature activation-9.02e-03

Pos logit contributions
 ears+0.026
uders+0.026
ps+0.026
ities+0.026
 wheels+0.026
Neg logit contributions
des-0.027
and-0.027
to-0.025
 proper-0.024
old-0.024
Token and
Token ID290
Feature activation+2.39e-03

Pos logit contributions
 ears+0.008
 wheels+0.007
 tail+0.007
ps+0.007
uders+0.007
Neg logit contributions
des-0.022
and-0.022
to-0.022
ido-0.021
old-0.021
Token thanked
Token ID26280
Feature activation+2.20e-02

Pos logit contributions
des+0.004
and+0.004
to+0.004
 inspection+0.004
ido+0.004
Neg logit contributions
 ears-0.004
 wheels-0.003
 tail-0.003
ities-0.003
ps-0.003
Token Wh
Token ID854
Feature activation+3.70e-02

Pos logit contributions
des+0.042
and+0.041
to+0.040
ido+0.039
 inspection+0.038
Neg logit contributions
 ears-0.032
 wheels-0.030
 tail-0.030
ps-0.030
uders-0.029
Tokenisk
Token ID1984
Feature activation+7.72e-02

Pos logit contributions
des+0.061
and+0.056
to+0.055
 inspection+0.054
ido+0.053
Neg logit contributions
 ears-0.064
 tail-0.063
ps-0.063
e-0.061
 wheels-0.061
Tokeners
Token ID364
Feature activation+1.59e-02

Pos logit contributions
des+0.251
bie+0.246
ido+0.239
 inspection+0.238
yes+0.235
Neg logit contributions
ps-0.016
e-0.012
ates-0.007
ers-0.001
ters+0.001
Token for
Token ID329
Feature activation+4.04e-03

Pos logit contributions
des+0.034
to+0.032
 inspection+0.032
and+0.031
ido+0.031
Neg logit contributions
 ears-0.020
 tail-0.019
e-0.018
ps-0.018
 wheels-0.018
Token helping
Token ID5742
Feature activation+4.05e-02

Pos logit contributions
des+0.009
and+0.009
to+0.009
ido+0.008
 inspection+0.008
Neg logit contributions
 ears-0.005
 tail-0.004
 wheels-0.004
ps-0.004
 scales-0.004
Token him
Token ID683
Feature activation+1.26e-02

Pos logit contributions
des+0.080
and+0.073
to+0.072
ido+0.071
bie+0.070
Neg logit contributions
 ears-0.060
 tail-0.059
 wheels-0.057
e-0.056
ps-0.056
Token.
Token ID13
Feature activation+8.86e-03

Pos logit contributions
des+0.028
and+0.028
to+0.027
old+0.026
ido+0.026
Neg logit contributions
 ears-0.014
 wheels-0.014
 tail-0.014
uders-0.013
ps-0.013
Token Ch
Token ID609
Feature activation+5.70e-03

Pos logit contributions
des+0.012
and+0.011
to+0.011
 inspection+0.010
ido+0.010
Neg logit contributions
 ears-0.007
 wheels-0.007
 tail-0.007
 wings-0.006
 scales-0.006
Tokenir
Token ID343
Feature activation-6.88e-03

Pos logit contributions
ido+0.011
des+0.011
and+0.010
to+0.010
old+0.010
Neg logit contributions
 ears-0.009
 scales-0.009
 wheels-0.008
 tail-0.008
 wings-0.008
Tokenpy
Token ID9078
Feature activation+7.42e-03

Pos logit contributions
ities+0.007
 ears+0.007
 wheels+0.007
 middle+0.007
e+0.007
Neg logit contributions
and-0.015
to-0.015
old-0.014
ido-0.014
des-0.014
Token thanked
Token ID26280
Feature activation+2.25e-02

Pos logit contributions
des+0.014
and+0.014
to+0.014
old+0.013
 inspection+0.013
Neg logit contributions
 ears-0.010
ities-0.010
 wheels-0.010
 tail-0.010
uders-0.010
Token Wh
Token ID854
Feature activation+4.06e-02

Pos logit contributions
des+0.046
and+0.044
to+0.042
ido+0.041
bie+0.041
Neg logit contributions
 ears-0.032
 wheels-0.030
 tail-0.030
ps-0.029
e-0.029
Tokenisk
Token ID1984
Feature activation+7.46e-02

Pos logit contributions
des+0.067
and+0.062
to+0.061
 inspection+0.060
old+0.058
Neg logit contributions
 ears-0.069
ps-0.068
 tail-0.068
ities-0.067
ates-0.066
Tokeners
Token ID364
Feature activation+1.51e-02

Pos logit contributions
des+0.235
bie+0.230
ido+0.223
 inspection+0.222
to+0.221
Neg logit contributions
ps-0.021
e-0.017
ates-0.012
per-0.005
 scales-0.004
Token for
Token ID329
Feature activation+4.17e-03

Pos logit contributions
des+0.038
to+0.036
and+0.035
 inspection+0.035
ido+0.034
Neg logit contributions
 ears-0.014
 tail-0.013
 wings-0.012
e-0.012
 wheels-0.012
Token helping
Token ID5742
Feature activation+4.21e-02

Pos logit contributions
des+0.010
and+0.010
to+0.010
ido+0.009
bie+0.009
Neg logit contributions
 ears-0.004
 tail-0.004
ps-0.004
 wheels-0.004
e-0.004
Token him
Token ID683
Feature activation+1.32e-02

Pos logit contributions
des+0.084
and+0.077
to+0.076
bie+0.074
ido+0.074
Neg logit contributions
 ears-0.062
 tail-0.059
 wheels-0.059
e-0.058
ps-0.057
Token.
Token ID13
Feature activation+5.76e-03

Pos logit contributions
des+0.031
and+0.030
to+0.030
ido+0.029
 inspection+0.029
Neg logit contributions
 ears-0.013
 tail-0.012
 wheels-0.012
ps-0.012
 wings-0.012
Token.
Token ID13
Feature activation+1.71e-02

Pos logit contributions
 ears+0.004
 wheels+0.004
 tail+0.004
uders+0.004
ps+0.004
Neg logit contributions
des-0.009
and-0.009
to-0.009
ido-0.009
 inspection-0.009
Token She
Token ID1375
Feature activation-5.23e-03

Pos logit contributions
des+0.039
ido+0.035
and+0.035
to+0.033
old+0.033
Neg logit contributions
 ears-0.023
 wheels-0.020
 tail-0.020
 skies-0.020
 eyes-0.019
Token grabbed
Token ID13176
Feature activation+2.67e-03

Pos logit contributions
 ears+0.008
 wheels+0.007
 tail+0.007
ps+0.007
 scales+0.007
Neg logit contributions
des-0.010
and-0.010
to-0.010
ido-0.009
 inspection-0.009
Token the
Token ID262
Feature activation-9.62e-03

Pos logit contributions
des+0.007
and+0.007
to+0.006
ido+0.006
old+0.006
Neg logit contributions
 ears-0.002
 wheels-0.002
 tail-0.002
ps-0.002
 wings-0.002
Token fire
Token ID2046
Feature activation+1.93e-02

Pos logit contributions
 ears+0.010
uders+0.010
ities+0.010
 tail+0.010
e+0.010
Neg logit contributions
des-0.021
and-0.020
to-0.020
ido-0.020
bie-0.020
Token extingu
Token ID24995
Feature activation+7.38e-02

Pos logit contributions
des+0.046
and+0.045
to+0.044
ido+0.043
 inspection+0.042
Neg logit contributions
 ears-0.019
 wheels-0.018
 tail-0.018
ps-0.017
 scales-0.017
Tokenisher
Token ID4828
Feature activation+4.12e-03

Pos logit contributions
and+0.137
to+0.133
des+0.130
ido+0.124
 inspection+0.122
Neg logit contributions
ps-0.110
 wheels-0.110
 ears-0.110
 scales-0.103
e-0.100
Token and
Token ID290
Feature activation-8.39e-03

Pos logit contributions
des+0.011
and+0.011
to+0.010
ido+0.010
 inspection+0.010
Neg logit contributions
 ears-0.003
 wheels-0.003
 tail-0.003
ps-0.003
 wings-0.003
Token put
Token ID1234
Feature activation+1.16e-02

Pos logit contributions
 ears+0.011
 wheels+0.011
 tail+0.010
 wings+0.010
ps+0.010
Neg logit contributions
des-0.018
and-0.017
to-0.017
ido-0.017
 inspection-0.016
Token out
Token ID503
Feature activation+1.40e-02

Pos logit contributions
des+0.026
and+0.026
ido+0.024
to+0.024
old+0.024
Neg logit contributions
 wheels-0.014
 ears-0.014
ps-0.014
e-0.013
 wings-0.013
Token the
Token ID262
Feature activation-1.08e-02

Pos logit contributions
des+0.033
and+0.032
to+0.031
ido+0.031
old+0.030
Neg logit contributions
 ears-0.015
 wheels-0.014
 tail-0.014
ps-0.014
 wings-0.014
Token.
Token ID13
Feature activation-4.93e-03

Pos logit contributions
 ears+0.008
 wheels+0.007
 tail+0.007
ps+0.007
 scales+0.007
Neg logit contributions
des-0.019
and-0.019
to-0.019
ido-0.018
 inspection-0.018
Token The
Token ID383
Feature activation-1.78e-02

Pos logit contributions
 ears+0.006
 wheels+0.006
 tail+0.006
e+0.006
 scales+0.006
Neg logit contributions
des-0.010
and-0.010
to-0.010
 inspection-0.009
ido-0.009
Token vet
Token ID20202
Feature activation+1.89e-02

Pos logit contributions
uders+0.023
ps+0.023
ates+0.022
bed+0.022
 ears+0.022
Neg logit contributions
des-0.035
ido-0.033
old-0.033
and-0.032
to-0.032
Token showed
Token ID3751
Feature activation+2.26e-02

Pos logit contributions
des+0.035
and+0.033
to+0.033
ido+0.032
old+0.031
Neg logit contributions
 ears-0.029
 tail-0.028
 wheels-0.028
ps-0.027
 scales-0.027
Token Wh
Token ID854
Feature activation+2.82e-02

Pos logit contributions
des+0.059
and+0.054
ido+0.054
yes+0.053
to+0.053
Neg logit contributions
 ears-0.021
 wings-0.019
 wheels-0.019
ps-0.019
e-0.019
Tokenisk
Token ID1984
Feature activation+7.07e-02

Pos logit contributions
des+0.044
and+0.041
to+0.040
 inspection+0.039
ido+0.038
Neg logit contributions
 ears-0.051
 wheels-0.050
 tail-0.049
ps-0.049
e-0.049
Tokeners
Token ID364
Feature activation+1.15e-02

Pos logit contributions
des+0.216
bie+0.208
ido+0.206
 inspection+0.203
to+0.202
Neg logit contributions
ps-0.023
e-0.020
ates-0.018
 wheels-0.014
 scales-0.013
Token the
Token ID262
Feature activation-1.55e-02

Pos logit contributions
des+0.025
and+0.024
to+0.024
ido+0.023
 inspection+0.023
Neg logit contributions
 ears-0.013
 tail-0.013
 wheels-0.013
ps-0.012
 scales-0.012
Token x
Token ID2124
Feature activation+1.10e-02

Pos logit contributions
 ears+0.018
 tail+0.018
uders+0.018
 wheels+0.017
ps+0.017
Neg logit contributions
des-0.033
and-0.032
to-0.032
ido-0.031
old-0.030
Token-
Token ID12
Feature activation+3.34e-02

Pos logit contributions
des+0.020
and+0.019
to+0.019
ido+0.018
 inspection+0.018
Neg logit contributions
 ears-0.017
 tail-0.016
 wheels-0.016
uders-0.016
ps-0.016
Tokenray
Token ID2433
Feature activation+2.09e-02

Pos logit contributions
des+0.037
and+0.034
 inspection+0.034
ido+0.033
bie+0.032
Neg logit contributions
 wheels-0.073
e-0.072
 ears-0.072
 scales-0.071
ps-0.071
Token to
Token ID284
Feature activation+2.24e-02

Pos logit contributions
des+0.053
ido+0.049
to+0.047
bie+0.047
and+0.046
Neg logit contributions
 wheels-0.028
 ears-0.027
 front-0.027
 tail-0.026
 wings-0.026
Token the
Token ID262
Feature activation+6.67e-03

Pos logit contributions
des+0.059
and+0.056
to+0.054
ll+0.053
ido+0.053
Neg logit contributions
 front-0.019
 ears-0.019
 wheels-0.018
e-0.018
 tail-0.018
Token playground
Token ID24817
Feature activation+2.17e-03

Pos logit contributions
des+0.013
and+0.012
to+0.012
ido+0.011
old+0.011
Neg logit contributions
 ears-0.010
ps-0.009
ities-0.009
uders-0.009
 tail-0.009
Token and
Token ID290
Feature activation+1.10e-03

Pos logit contributions
des+0.006
and+0.005
to+0.005
ido+0.005
 inspection+0.005
Neg logit contributions
 ears-0.002
 wheels-0.002
 tail-0.002
ps-0.001
 scales-0.001
Token showed
Token ID3751
Feature activation+2.77e-02

Pos logit contributions
des+0.002
and+0.002
ido+0.002
to+0.002
 inspection+0.002
Neg logit contributions
 ears-0.001
 wheels-0.001
 tail-0.001
 scales-0.001
 wings-0.001
Token all
Token ID477
Feature activation+6.98e-02

Pos logit contributions
des+0.067
ll+0.062
bie+0.061
ido+0.061
and+0.061
Neg logit contributions
 wheels-0.031
 ears-0.030
 wings-0.030
 scales-0.029
ps-0.028
Token of
Token ID286
Feature activation+5.22e-02

Pos logit contributions
des+0.183
to+0.175
yes+0.169
ll+0.167
and+0.167
Neg logit contributions
 ears-0.057
 wheels-0.055
 front-0.054
 tail-0.051
 scales-0.051
Token his
Token ID465
Feature activation+1.10e-03

Pos logit contributions
des+0.142
to+0.127
ido+0.127
yes+0.124
and+0.123
Neg logit contributions
 ears-0.046
 wheels-0.044
 scales-0.044
 walls-0.041
 animals-0.040
Token friends
Token ID2460
Feature activation+2.94e-02

Pos logit contributions
des+0.003
and+0.003
to+0.003
ido+0.003
 inspection+0.003
Neg logit contributions
 ears-0.001
 wheels-0.001
 tail-0.001
 scales-0.001
 wings-0.001
Token the
Token ID262
Feature activation+6.25e-03

Pos logit contributions
des+0.077
to+0.072
and+0.072
ido+0.070
 inspection+0.070
Neg logit contributions
 ears-0.024
 wheels-0.022
 tail-0.022
e-0.020
ps-0.020
Token penny
Token ID26369
Feature activation+1.99e-02

Pos logit contributions
des+0.012
and+0.012
to+0.012
ido+0.012
 inspection+0.011
Neg logit contributions
 ears-0.009
 wheels-0.009
 tail-0.008
 scales-0.008
 wings-0.008
Token box
Token ID3091
Feature activation+1.18e-02

Pos logit contributions
uders+0.001
 ears+0.001
e+0.001
 tail+0.001
ities+0.001
Neg logit contributions
des-0.001
and-0.001
to-0.001
 inspection-0.001
ido-0.001
Token and
Token ID290
Feature activation-3.85e-03

Pos logit contributions
des+0.032
and+0.031
to+0.031
ido+0.031
bie+0.030
Neg logit contributions
 ears-0.008
 wheels-0.007
 tail-0.007
 wings-0.007
 scales-0.006
Token pulled
Token ID5954
Feature activation+1.94e-02

Pos logit contributions
 ears+0.004
bed+0.004
 wings+0.004
 wheels+0.004
 walls+0.004
Neg logit contributions
des-0.009
and-0.008
ido-0.008
rig-0.008
to-0.008
Token out
Token ID503
Feature activation+8.46e-03

Pos logit contributions
des+0.043
and+0.041
ido+0.039
to+0.039
bie+0.038
Neg logit contributions
 ears-0.024
 wheels-0.024
 tail-0.023
ps-0.023
 wings-0.022
Token a
Token ID257
Feature activation+2.48e-02

Pos logit contributions
des+0.023
and+0.022
to+0.022
ido+0.022
old+0.021
Neg logit contributions
 ears-0.006
 wheels-0.006
 tail-0.006
 wings-0.005
ps-0.005
Token brand
Token ID4508
Feature activation+6.87e-02

Pos logit contributions
des+0.053
and+0.051
to+0.050
ido+0.049
old+0.048
Neg logit contributions
 ears-0.032
 tail-0.032
 wheels-0.031
 scales-0.030
 wings-0.030
Token new
Token ID649
Feature activation+1.73e-02

Pos logit contributions
des+0.142
bie+0.134
ido+0.132
and+0.129
to+0.127
Neg logit contributions
 scales-0.087
 end-0.087
 eyes-0.086
 wheels-0.085
 ears-0.082
Token pair
Token ID5166
Feature activation-8.17e-03

Pos logit contributions
des+0.036
and+0.035
to+0.035
ido+0.034
 inspection+0.033
Neg logit contributions
 ears-0.021
uders-0.021
ps-0.020
 wheels-0.020
ities-0.020
Token of
Token ID286
Feature activation+1.22e-02

Pos logit contributions
e+0.009
 wheels+0.009
 ears+0.009
 tail+0.009
ps+0.009
Neg logit contributions
des-0.018
to-0.017
bie-0.017
and-0.017
ido-0.017
Token shoes
Token ID10012
Feature activation-4.22e-03

Pos logit contributions
des+0.027
and+0.026
to+0.026
ido+0.025
 inspection+0.025
Neg logit contributions
 ears-0.014
 wheels-0.014
 tail-0.014
 scales-0.013
 wings-0.013
Token.
Token ID13
Feature activation+6.90e-03

Pos logit contributions
 ears+0.004
 wheels+0.004
 tail+0.003
ps+0.003
 wings+0.003
Neg logit contributions
des-0.010
and-0.010
to-0.010
ido-0.010
bie-0.010
Token family
Token ID1641
Feature activation+7.08e-03

Pos logit contributions
e+0.018
ps+0.018
uders+0.017
ates+0.017
ities+0.017
Neg logit contributions
and-0.032
des-0.031
bie-0.031
ido-0.031
to-0.030
Token sh
Token ID427
Feature activation+1.98e-02

Pos logit contributions
des+0.012
and+0.012
to+0.012
old+0.012
bie+0.011
Neg logit contributions
 ears-0.011
uders-0.011
 tail-0.011
ates-0.011
ities-0.011
Tokenoo
Token ID2238
Feature activation-1.25e-02

Pos logit contributions
des+0.045
and+0.044
to+0.043
ido+0.042
 inspection+0.042
Neg logit contributions
 ears-0.022
 wheels-0.020
 tail-0.020
ps-0.020
e-0.019
Tokened
Token ID276
Feature activation+1.24e-02

Pos logit contributions
 ears+0.018
 wheels+0.018
 tail+0.018
uders+0.018
 scales+0.018
Neg logit contributions
des-0.023
and-0.022
to-0.022
ido-0.021
 inspection-0.021
Token Wh
Token ID854
Feature activation+2.55e-02

Pos logit contributions
des+0.030
and+0.030
to+0.029
ido+0.028
 inspection+0.028
Neg logit contributions
 ears-0.011
 wheels-0.011
 tail-0.011
ps-0.010
uders-0.010
Tokenisk
Token ID1984
Feature activation+6.85e-02

Pos logit contributions
des+0.036
and+0.034
to+0.034
ido+0.032
old+0.031
Neg logit contributions
 ears-0.050
 wheels-0.049
 tail-0.049
 scales-0.047
ps-0.047
Tokeners
Token ID364
Feature activation-1.28e-02

Pos logit contributions
des+0.218
bie+0.214
 inspection+0.207
ido+0.207
to+0.207
Neg logit contributions
ps-0.020
e-0.015
ates-0.009
ters-0.004
per-0.002
Token away
Token ID1497
Feature activation-1.21e-03

Pos logit contributions
 ears+0.011
 wheels+0.011
uders+0.011
 scales+0.011
ates+0.011
Neg logit contributions
des-0.030
and-0.030
to-0.029
ido-0.029
old-0.028
Token and
Token ID290
Feature activation+5.48e-03

Pos logit contributions
 wheels+0.001
 ears+0.001
ates+0.001
 wings+0.001
uders+0.001
Neg logit contributions
and-0.003
 inspection-0.002
to-0.002
ido-0.002
des-0.002
Token told
Token ID1297
Feature activation+1.99e-02

Pos logit contributions
des+0.011
and+0.010
to+0.010
ido+0.010
 inspection+0.010
Neg logit contributions
 ears-0.008
 wheels-0.007
 tail-0.007
ps-0.007
 scales-0.007
Token Max
Token ID5436
Feature activation+2.77e-02

Pos logit contributions
des+0.039
and+0.037
to+0.036
ido+0.035
bie+0.035
Neg logit contributions
 ears-0.030
 wheels-0.029
 tail-0.028
ps-0.028
 scales-0.027
Token bag
Token ID6131
Feature activation+6.41e-03

Pos logit contributions
ities+0.011
uders+0.011
ps+0.011
ates+0.011
 ears+0.011
Neg logit contributions
des-0.017
and-0.016
to-0.016
bie-0.016
old-0.016
Token and
Token ID290
Feature activation-2.58e-03

Pos logit contributions
des+0.018
to+0.017
and+0.017
ido+0.017
bie+0.017
Neg logit contributions
 ears-0.004
 tail-0.003
 wheels-0.003
 wings-0.003
e-0.003
Token pulled
Token ID5954
Feature activation+2.08e-02

Pos logit contributions
 ears+0.003
 wheels+0.003
 tail+0.003
 wings+0.003
e+0.003
Neg logit contributions
des-0.005
and-0.005
ido-0.005
to-0.005
bie-0.005
Token out
Token ID503
Feature activation+1.34e-02

Pos logit contributions
des+0.051
and+0.048
ido+0.046
to+0.046
old+0.045
Neg logit contributions
 ears-0.022
 wheels-0.021
 tail-0.020
ps-0.020
e-0.020
Token a
Token ID257
Feature activation+2.76e-02

Pos logit contributions
des+0.035
and+0.034
to+0.034
ido+0.033
old+0.032
Neg logit contributions
 ears-0.011
 wheels-0.010
 tail-0.010
 wings-0.009
ps-0.009
Token brand
Token ID4508
Feature activation+6.80e-02

Pos logit contributions
des+0.061
and+0.060
to+0.058
ido+0.057
old+0.056
Neg logit contributions
 tail-0.034
 ears-0.034
 wheels-0.033
 scales-0.031
 front-0.031
Token new
Token ID649
Feature activation+2.02e-02

Pos logit contributions
des+0.143
bie+0.137
and+0.131
ido+0.129
to+0.129
Neg logit contributions
 end-0.085
 eyes-0.080
 wheels-0.080
 scales-0.080
 ears-0.076
Token doll
Token ID3654
Feature activation-1.11e-02

Pos logit contributions
des+0.044
and+0.042
to+0.041
ido+0.040
old+0.040
Neg logit contributions
 ears-0.025
 tail-0.024
 wheels-0.024
 scales-0.023
ps-0.023
Token for
Token ID329
Feature activation-3.68e-03

Pos logit contributions
 ears+0.013
uders+0.012
 wheels+0.012
 scales+0.012
ps+0.012
Neg logit contributions
des-0.024
and-0.023
to-0.023
ido-0.022
old-0.022
Token Sally
Token ID25737
Feature activation+2.56e-02

Pos logit contributions
 ears+0.005
 tail+0.005
 wheels+0.005
uders+0.005
 scales+0.005
Neg logit contributions
des-0.007
and-0.007
to-0.007
 inspection-0.007
ido-0.007
Token.
Token ID13
Feature activation+7.87e-03

Pos logit contributions
des+0.064
to+0.060
and+0.060
 inspection+0.059
ido+0.059
Neg logit contributions
 ears-0.024
 tail-0.022
e-0.021
 wheels-0.020
 wings-0.020
Token to
Token ID284
Feature activation+1.88e-02

Pos logit contributions
 ears+0.026
 wheels+0.025
 tail+0.025
ps+0.024
 scales+0.024
Neg logit contributions
des-0.043
and-0.041
to-0.040
ido-0.039
 inspection-0.039
Token unlock
Token ID12116
Feature activation-1.90e-02

Pos logit contributions
des+0.038
and+0.036
to+0.036
ido+0.035
 inspection+0.034
Neg logit contributions
 ears-0.026
 wheels-0.025
 tail-0.024
 scales-0.024
ps-0.024
Token,"
Token ID553
Feature activation+8.39e-03

Pos logit contributions
 ears+0.026
 tail+0.026
 wheels+0.025
ities+0.025
uders+0.025
Neg logit contributions
des-0.036
and-0.035
to-0.034
ido-0.033
 inspection-0.033
Token said
Token ID531
Feature activation+9.99e-03

Pos logit contributions
des+0.013
to+0.012
and+0.012
ido+0.012
 inspection+0.011
Neg logit contributions
 ears-0.015
 wheels-0.015
ps-0.015
 tail-0.015
uders-0.015
Token Wh
Token ID854
Feature activation+3.50e-02

Pos logit contributions
des+0.018
to+0.018
ido+0.017
and+0.017
old+0.017
Neg logit contributions
 ears-0.015
 wheels-0.014
ps-0.014
uders-0.014
ities-0.014
Tokenisk
Token ID1984
Feature activation+6.79e-02

Pos logit contributions
des+0.059
and+0.056
to+0.055
ido+0.053
 inspection+0.052
Neg logit contributions
 ears-0.060
 tail-0.058
 wheels-0.058
ps-0.056
uders-0.056
Tokeners
Token ID364
Feature activation+2.67e-03

Pos logit contributions
des+0.214
bie+0.207
ido+0.204
and+0.201
 inspection+0.201
Neg logit contributions
ps-0.022
e-0.016
ates-0.012
 scales-0.008
 tail-0.007
Token.
Token ID13
Feature activation+6.04e-03

Pos logit contributions
des+0.006
and+0.006
to+0.006
ido+0.006
 inspection+0.006
Neg logit contributions
 ears-0.003
 wheels-0.003
 tail-0.002
ps-0.002
uders-0.002
Token
Token ID198
Feature activation+8.16e-03

Pos logit contributions
des+0.011
and+0.011
to+0.010
ido+0.010
 inspection+0.010
Neg logit contributions
 ears-0.009
 tail-0.009
 wheels-0.009
ps-0.009
 scales-0.009
Token
Token ID198
Feature activation-9.67e-03

Pos logit contributions
des+0.017
bie+0.016
and+0.016
 inspection+0.015
ido+0.015
Neg logit contributions
 ears-0.010
 tail-0.010
 wheels-0.010
e-0.010
ates-0.010
Token"
Token ID1
Feature activation+1.58e-02

Pos logit contributions
 ears+0.015
ps+0.014
uders+0.014
ities+0.014
 wheels+0.014
Neg logit contributions
des-0.018
and-0.017
to-0.017
ido-0.016
old-0.016
Token In
Token ID554
Feature activation-2.69e-02

Pos logit contributions
des+0.029
ido+0.027
and+0.027
 inspection+0.026
old+0.025
Neg logit contributions
 ears-0.018
 tail-0.017
 wheels-0.017
bed-0.016
 eyes-0.016
Token fact
Token ID1109
Feature activation-2.23e-02

Pos logit contributions
uders+0.034
ities+0.034
 ears+0.034
ider+0.033
 scales+0.032
Neg logit contributions
des-0.051
bie-0.050
 inspection-0.050
ido-0.050
and-0.049
Token,
Token ID11
Feature activation+4.77e-03

Pos logit contributions
 ears+0.016
 wheels+0.015
 tail+0.015
ps+0.014
 wings+0.014
Neg logit contributions
des-0.059
and-0.058
to-0.057
ido-0.056
old-0.055
Token the
Token ID262
Feature activation-1.39e-02

Pos logit contributions
des+0.009
and+0.008
to+0.008
ido+0.008
old+0.008
Neg logit contributions
 ears-0.008
 tail-0.008
 wheels-0.007
 wings-0.007
 scales-0.007
Token special
Token ID2041
Feature activation+5.19e-03

Pos logit contributions
uders+0.018
ates+0.017
ps+0.017
e+0.017
 ears+0.016
Neg logit contributions
des-0.029
and-0.027
to-0.026
ido-0.026
yes-0.026
Token bal
Token ID3652
Feature activation+6.76e-02

Pos logit contributions
des+0.011
and+0.010
to+0.010
ido+0.010
old+0.010
Neg logit contributions
 ears-0.007
 wheels-0.006
 tail-0.006
ps-0.006
uders-0.006
Tokenm
Token ID76
Feature activation-4.05e-03

Pos logit contributions
des+0.114
 inspection+0.100
rig+0.100
 proper+0.098
 sorts+0.097
Neg logit contributions
ider-0.112
ates-0.109
ters-0.108
bed-0.107
ities-0.104
Token made
Token ID925
Feature activation+2.59e-02

Pos logit contributions
 ears+0.006
 wheels+0.006
 tail+0.005
 scales+0.005
uders+0.005
Neg logit contributions
des-0.008
and-0.008
to-0.007
ido-0.007
bie-0.007
Token her
Token ID607
Feature activation+6.70e-03

Pos logit contributions
des+0.055
ido+0.051
and+0.051
to+0.049
bie+0.048
Neg logit contributions
 ears-0.034
 wings-0.033
 tail-0.033
 wheels-0.032
e-0.032
Token ankle
Token ID19847
Feature activation+6.81e-03

Pos logit contributions
des+0.014
and+0.014
to+0.014
ido+0.013
 inspection+0.013
Neg logit contributions
 ears-0.009
 tail-0.008
 wheels-0.008
 wings-0.008
e-0.008
Token look
Token ID804
Feature activation+2.71e-02

Pos logit contributions
des+0.014
and+0.013
to+0.013
ido+0.013
 inspection+0.013
Neg logit contributions
 ears-0.009
 wheels-0.009
 tail-0.009
uders-0.008
ps-0.008
Token happy
Token ID3772
Feature activation-1.47e-02

Pos logit contributions
des+0.015
to+0.014
 inspection+0.014
and+0.014
ido+0.014
Neg logit contributions
 ears-0.010
 tail-0.010
e-0.010
 wheels-0.010
 voice-0.010
Token that
Token ID326
Feature activation+1.94e-02

Pos logit contributions
 ears+0.018
 wheels+0.018
 tail+0.017
ps+0.017
uders+0.017
Neg logit contributions
des-0.031
and-0.031
to-0.030
 inspection-0.029
old-0.029
Token she
Token ID673
Feature activation+8.27e-03

Pos logit contributions
des+0.024
and+0.023
to+0.022
ido+0.021
 inspection+0.021
Neg logit contributions
 ears-0.040
ps-0.040
 scales-0.040
uders-0.040
ates-0.040
Token hugged
Token ID41130
Feature activation+9.53e-03

Pos logit contributions
des+0.013
to+0.013
and+0.012
yes+0.012
old+0.012
Neg logit contributions
ities-0.014
 ears-0.014
ates-0.014
 wheels-0.014
uders-0.014
Token Wh
Token ID854
Feature activation+2.72e-02

Pos logit contributions
des+0.019
and+0.019
to+0.018
ido+0.018
 inspection+0.018
Neg logit contributions
 ears-0.013
 wheels-0.012
 tail-0.012
ps-0.012
 scales-0.012
Tokenisk
Token ID1984
Feature activation+6.75e-02

Pos logit contributions
des+0.039
and+0.037
to+0.036
ido+0.034
 inspection+0.034
Neg logit contributions
 ears-0.054
 wheels-0.052
 tail-0.052
ps-0.051
 scales-0.051
Tokeners
Token ID364
Feature activation+5.60e-03

Pos logit contributions
des+0.213
bie+0.208
ido+0.204
 inspection+0.203
to+0.201
Neg logit contributions
ps-0.018
e-0.015
ates-0.010
ters-0.003
 scales-0.003
Token.
Token ID13
Feature activation-2.30e-04

Pos logit contributions
des+0.013
and+0.013
to+0.012
ido+0.012
old+0.012
Neg logit contributions
uders-0.006
 ears-0.006
 wheels-0.006
ps-0.006
 tail-0.005
Token "
Token ID366
Feature activation+3.40e-02

Pos logit contributions
 ears+0.000
 wheels+0.000
 tail+0.000
 wings+0.000
ps+0.000
Neg logit contributions
des-0.000
and-0.000
to-0.000
ido-0.000
old-0.000
TokenThank
Token ID10449
Feature activation-1.43e-02

Pos logit contributions
des+0.067
and+0.063
to+0.062
ido+0.059
yes+0.059
Neg logit contributions
 ears-0.051
 wheels-0.048
uders-0.047
ps-0.047
 tail-0.047
Token you
Token ID345
Feature activation-1.21e-02

Pos logit contributions
 ears+0.022
 wheels+0.021
 tail+0.021
uders+0.021
ps+0.020
Neg logit contributions
des-0.027
and-0.026
to-0.026
ido-0.024
old-0.024
Token wise
Token ID10787
Feature activation+1.67e-02

Pos logit contributions
 ears+0.012
ps+0.011
uders+0.011
e+0.011
 wheels+0.011
Neg logit contributions
des-0.026
and-0.025
to-0.025
ido-0.024
old-0.024
Token old
Token ID1468
Feature activation+1.06e-02

Pos logit contributions
des+0.037
and+0.035
to+0.035
 inspection+0.033
ido+0.033
Neg logit contributions
 ears-0.021
 tail-0.020
 wheels-0.020
 scales-0.019
 wings-0.019
Token cat
Token ID3797
Feature activation+8.25e-03

Pos logit contributions
des+0.024
and+0.023
to+0.023
ido+0.022
old+0.022
Neg logit contributions
 ears-0.012
 wheels-0.011
 tail-0.011
ps-0.011
uders-0.011
Token named
Token ID3706
Feature activation+9.41e-03

Pos logit contributions
des+0.020
and+0.020
to+0.020
ido+0.019
old+0.019
Neg logit contributions
 ears-0.007
 wheels-0.007
 tail-0.007
ps-0.007
uders-0.007
Token Wh
Token ID854
Feature activation+3.46e-02

Pos logit contributions
des+0.020
and+0.019
to+0.018
yes+0.018
ido+0.018
Neg logit contributions
 ears-0.013
 tail-0.013
 wheels-0.012
 wings-0.012
 scales-0.012
Tokenisk
Token ID1984
Feature activation+6.73e-02

Pos logit contributions
des+0.055
and+0.052
to+0.050
 inspection+0.048
ido+0.048
Neg logit contributions
 ears-0.062
 tail-0.061
ps-0.061
e-0.060
 wheels-0.060
Tokeners
Token ID364
Feature activation-3.88e-03

Pos logit contributions
des+0.215
bie+0.212
ido+0.205
and+0.204
yes+0.204
Neg logit contributions
ps-0.019
e-0.011
ates-0.009
ters-0.002
 scales-0.001
Token.
Token ID13
Feature activation-6.57e-03

Pos logit contributions
 ears+0.003
 tail+0.003
 wheels+0.003
ps+0.003
 wings+0.003
Neg logit contributions
des-0.010
and-0.010
to-0.010
ido-0.009
 inspection-0.009
Token Wh
Token ID854
Feature activation+3.24e-02

Pos logit contributions
ps+0.009
 ears+0.009
uders+0.009
e+0.009
ities+0.009
Neg logit contributions
des-0.012
to-0.012
and-0.012
ido-0.012
 inspection-0.011
Tokenisk
Token ID1984
Feature activation+6.24e-02

Pos logit contributions
des+0.045
and+0.043
to+0.042
ido+0.040
old+0.039
Neg logit contributions
 ears-0.065
 wheels-0.063
 tail-0.063
 scales-0.061
ps-0.061
Tokeners
Token ID364
Feature activation-8.40e-03

Pos logit contributions
des+0.191
bie+0.182
to+0.182
and+0.182
ido+0.181
Neg logit contributions
ps-0.022
e-0.018
ates-0.017
 tail-0.014
 wheels-0.013
Token the
Token ID262
Feature activation+1.13e-02

Pos logit contributions
des+0.024
and+0.023
to+0.023
ido+0.022
 inspection+0.022
Neg logit contributions
 ears-0.010
 wheels-0.010
 tail-0.010
ps-0.009
uders-0.009
Token garden
Token ID11376
Feature activation+1.48e-02

Pos logit contributions
des+0.021
and+0.021
to+0.021
bie+0.020
old+0.020
Neg logit contributions
ps-0.015
ities-0.015
ates-0.015
 ears-0.015
 wheels-0.015
Token,
Token ID11
Feature activation+2.31e-03

Pos logit contributions
des+0.036
and+0.034
to+0.034
ido+0.033
 inspection+0.033
Neg logit contributions
 ears-0.014
 tail-0.013
 wheels-0.013
ps-0.013
 scales-0.013
Token showing
Token ID4478
Feature activation+1.14e-02

Pos logit contributions
des+0.005
and+0.005
to+0.005
ido+0.005
old+0.005
Neg logit contributions
 ears-0.003
 tail-0.002
 wheels-0.002
ps-0.002
 wings-0.002
Token it
Token ID340
Feature activation+1.82e-02

Pos logit contributions
des+0.026
ido+0.025
bie+0.024
and+0.024
ll+0.024
Neg logit contributions
 wings-0.014
 ears-0.014
 scales-0.013
ps-0.013
 tail-0.013
Token all
Token ID477
Feature activation+6.72e-02

Pos logit contributions
des+0.032
ido+0.031
and+0.029
bie+0.029
to+0.029
Neg logit contributions
 ears-0.030
 wings-0.029
 tail-0.029
 feathers-0.028
ps-0.028
Token the
Token ID262
Feature activation+1.18e-02

Pos logit contributions
des+0.165
ido+0.160
to+0.160
yes+0.158
and+0.157
Neg logit contributions
 tail-0.055
 ears-0.054
ps-0.053
 wings-0.053
 scales-0.051
Token pretty
Token ID2495
Feature activation+6.59e-03

Pos logit contributions
des+0.026
to+0.025
ido+0.025
and+0.024
old+0.023
Neg logit contributions
 ears-0.015
 tail-0.015
 wheels-0.014
 wings-0.014
 scales-0.014
Token flowers
Token ID12734
Feature activation+1.87e-02

Pos logit contributions
des+0.015
and+0.015
to+0.015
 inspection+0.015
old+0.014
Neg logit contributions
 ears-0.006
 wheels-0.006
uders-0.006
ps-0.006
e-0.006
Token and
Token ID290
Feature activation+5.78e-03

Pos logit contributions
des+0.045
and+0.044
to+0.043
ido+0.043
old+0.042
Neg logit contributions
 ears-0.018
 tail-0.017
 wheels-0.016
 wings-0.016
ps-0.016
Token y
Token ID331
Feature activation+1.52e-02

Pos logit contributions
des+0.014
and+0.013
to+0.013
ido+0.013
old+0.012
Neg logit contributions
 ears-0.006
 tail-0.006
 wheels-0.006
 wings-0.006
 scales-0.006
Token.
Token ID13
Feature activation-1.46e-02

Pos logit contributions
des+0.016
and+0.015
to+0.015
ido+0.015
 inspection+0.015
Neg logit contributions
 ears-0.006
 wheels-0.005
 tail-0.005
ps-0.005
 scales-0.005
Token The
Token ID383
Feature activation-1.35e-02

Pos logit contributions
ps+0.022
ates+0.022
ities+0.022
uders+0.022
e+0.022
Neg logit contributions
des-0.024
to-0.024
and-0.024
 inspection-0.023
ido-0.023
Token owl
Token ID39610
Feature activation+1.85e-02

Pos logit contributions
uders+0.016
ates+0.016
ps+0.016
 ears+0.016
e+0.016
Neg logit contributions
des-0.028
and-0.027
to-0.026
old-0.026
 inspection-0.025
Token told
Token ID1297
Feature activation+1.15e-03

Pos logit contributions
des+0.028
to+0.027
old+0.026
and+0.026
 inspection+0.025
Neg logit contributions
 ears-0.033
ities-0.033
ps-0.033
 wheels-0.033
 tail-0.032
Token Wh
Token ID854
Feature activation+3.46e-02

Pos logit contributions
des+0.002
to+0.002
and+0.002
ido+0.002
 inspection+0.002
Neg logit contributions
 ears-0.002
 tail-0.002
ates-0.002
ities-0.002
uders-0.002
Tokenisk
Token ID1984
Feature activation+6.70e-02

Pos logit contributions
des+0.053
and+0.049
to+0.048
 inspection+0.047
ido+0.046
Neg logit contributions
 ears-0.063
 tail-0.062
 wheels-0.061
ps-0.061
uders-0.060
Tokeners
Token ID364
Feature activation+2.06e-02

Pos logit contributions
des+0.206
bie+0.199
ido+0.197
to+0.193
yes+0.193
Neg logit contributions
ps-0.021
e-0.018
ates-0.016
 scales-0.013
 tail-0.011
Token,
Token ID11
Feature activation-1.22e-02

Pos logit contributions
des+0.051
and+0.048
to+0.048
ido+0.047
bie+0.046
Neg logit contributions
 ears-0.020
 wheels-0.018
 tail-0.018
 wings-0.018
 scales-0.017
Token "
Token ID366
Feature activation+2.50e-02

Pos logit contributions
 ears+0.017
 tail+0.017
 wheels+0.016
 scales+0.016
 wings+0.016
Neg logit contributions
des-0.025
and-0.023
to-0.023
ido-0.022
old-0.022
TokenTo
Token ID2514
Feature activation-3.10e-02

Pos logit contributions
des+0.044
and+0.043
to+0.042
ido+0.040
old+0.039
Neg logit contributions
 ears-0.041
 tail-0.040
 wheels-0.039
ps-0.039
 scales-0.039
Token be
Token ID307
Feature activation-9.95e-03

Pos logit contributions
 ears+0.045
 wheels+0.045
 tail+0.043
ities+0.042
 scales+0.042
Neg logit contributions
des-0.058
and-0.055
to-0.055
ido-0.054
old-0.053
Token loved
Token ID6151
Feature activation+2.43e-02

Pos logit contributions
des+0.029
and+0.028
to+0.027
ido+0.026
 inspection+0.026
Neg logit contributions
 ears-0.021
 wheels-0.020
 tail-0.020
ps-0.020
 scales-0.020
Token the
Token ID262
Feature activation-4.86e-03

Pos logit contributions
des+0.054
ido+0.050
and+0.050
to+0.049
old+0.048
Neg logit contributions
 ears-0.031
 wheels-0.029
 tail-0.028
 scales-0.028
 wings-0.027
Token feeling
Token ID4203
Feature activation+1.18e-02

Pos logit contributions
 ears+0.007
 wheels+0.007
 tail+0.007
ps+0.006
 scales+0.006
Neg logit contributions
des-0.010
and-0.009
to-0.009
ido-0.009
 inspection-0.009
Token of
Token ID286
Feature activation+1.22e-02

Pos logit contributions
des+0.024
and+0.023
to+0.023
ido+0.022
 inspection+0.022
Neg logit contributions
 ears-0.016
 wheels-0.015
 tail-0.015
e-0.015
ps-0.015
Token making
Token ID1642
Feature activation+7.37e-03

Pos logit contributions
des+0.026
and+0.025
to+0.025
ido+0.024
old+0.024
Neg logit contributions
 ears-0.015
 tail-0.014
 wheels-0.014
uders-0.014
ities-0.013
Token someone
Token ID2130
Feature activation+6.69e-02

Pos logit contributions
des+0.017
ido+0.016
and+0.016
to+0.016
 inspection+0.016
Neg logit contributions
 ears-0.009
 wheels-0.008
 tail-0.008
 wings-0.008
 scales-0.008
Token happy
Token ID3772
Feature activation-1.11e-02

Pos logit contributions
des+0.124
ido+0.120
to+0.118
and+0.117
 inspection+0.115
Neg logit contributions
 ears-0.101
 wheels-0.095
 tail-0.095
 wings-0.094
 feathers-0.093
Token.
Token ID13
Feature activation+1.61e-03

Pos logit contributions
 ears+0.013
uders+0.013
 wheels+0.013
 tail+0.013
ps+0.013
Neg logit contributions
and-0.023
des-0.023
to-0.022
yes-0.021
 inspection-0.021
Token He
Token ID679
Feature activation+3.08e-04

Pos logit contributions
des+0.004
and+0.003
to+0.003
 inspection+0.003
ido+0.003
Neg logit contributions
 ears-0.002
 wheels-0.002
 tail-0.002
 scales-0.002
 skies-0.002
Token was
Token ID373
Feature activation-1.64e-03

Pos logit contributions
des+0.001
to+0.001
and+0.001
old+0.000
bie+0.000
Neg logit contributions
 ears-0.000
 wheels-0.000
 tail-0.000
ities-0.000
uders-0.000
Token glad
Token ID9675
Feature activation-3.13e-03

Pos logit contributions
 ears+0.002
 wheels+0.002
 tail+0.002
uders+0.002
ities+0.002
Neg logit contributions
des-0.003
and-0.003
to-0.003
ido-0.003
old-0.003
Token the
Token ID262
Feature activation+9.96e-04

Pos logit contributions
des+0.014
to+0.013
and+0.013
ido+0.013
bie+0.012
Neg logit contributions
 ears-0.007
 wheels-0.006
 tail-0.006
 wings-0.006
 scales-0.006
Token kitten
Token ID42143
Feature activation+1.79e-02

Pos logit contributions
des+0.002
and+0.002
to+0.002
ido+0.002
old+0.002
Neg logit contributions
 ears-0.001
ps-0.001
uders-0.001
 wheels-0.001
 tail-0.001
Token,
Token ID11
Feature activation+5.56e-04

Pos logit contributions
des+0.041
and+0.041
to+0.040
bie+0.039
old+0.039
Neg logit contributions
ps-0.017
 wheels-0.017
 ears-0.017
uders-0.017
ities-0.017
Token showing
Token ID4478
Feature activation+1.08e-02

Pos logit contributions
des+0.001
and+0.001
to+0.001
ido+0.001
old+0.001
Neg logit contributions
 ears-0.001
 tail-0.001
 wheels-0.001
 wings-0.001
 scales-0.001
Token it
Token ID340
Feature activation+1.84e-02

Pos logit contributions
des+0.025
and+0.023
ido+0.023
bie+0.023
ll+0.023
Neg logit contributions
 ears-0.014
 wings-0.013
 wheels-0.013
 tail-0.012
 scales-0.012
Token all
Token ID477
Feature activation+6.68e-02

Pos logit contributions
des+0.031
ido+0.029
and+0.029
to+0.028
bie+0.028
Neg logit contributions
 ears-0.031
 tail-0.030
 wings-0.030
 wheels-0.030
 scales-0.029
Token of
Token ID286
Feature activation+3.96e-02

Pos logit contributions
des+0.165
to+0.158
ido+0.157
yes+0.155
and+0.154
Neg logit contributions
 ears-0.060
 tail-0.059
 wings-0.057
ps-0.056
e-0.055
Token the
Token ID262
Feature activation+1.15e-02

Pos logit contributions
des+0.104
and+0.093
to+0.092
ido+0.092
yes+0.090
Neg logit contributions
 ears-0.038
 tail-0.036
 scales-0.035
e-0.034
 wheels-0.034
Token new
Token ID649
Feature activation+1.29e-02

Pos logit contributions
des+0.028
to+0.027
and+0.027
ido+0.027
 inspection+0.026
Neg logit contributions
 ears-0.011
 wheels-0.011
 tail-0.011
 wings-0.010
 scales-0.010
Token places
Token ID4113
Feature activation+3.13e-02

Pos logit contributions
des+0.031
and+0.030
to+0.029
ido+0.029
 inspection+0.028
Neg logit contributions
 ears-0.013
 wheels-0.012
 tail-0.012
ps-0.012
 scales-0.011
Token he
Token ID339
Feature activation+9.62e-03

Pos logit contributions
des+0.066
and+0.064
to+0.063
ido+0.062
bie+0.061
Neg logit contributions
 ears-0.039
 wheels-0.037
 tail-0.037
e-0.036
 wings-0.036
Token she
Token ID673
Feature activation-4.70e-03

Pos logit contributions
des+0.075
to+0.069
and+0.069
bie+0.067
ido+0.065
Neg logit contributions
 wheels-0.061
 tail-0.060
 ears-0.060
 wings-0.058
 clouds-0.057
Token could
Token ID714
Feature activation+1.92e-02

Pos logit contributions
 ears+0.007
 wheels+0.007
 tail+0.007
 wings+0.007
 scales+0.007
Neg logit contributions
des-0.009
and-0.008
ido-0.008
to-0.008
old-0.008
Token and
Token ID290
Feature activation-3.30e-03

Pos logit contributions
des+0.052
and+0.049
to+0.049
bie+0.048
ido+0.048
Neg logit contributions
 ears-0.014
 tail-0.013
 wheels-0.012
ps-0.012
 wings-0.012
Token pushed
Token ID7121
Feature activation+1.97e-02

Pos logit contributions
 ears+0.004
 wheels+0.004
 tail+0.004
e+0.003
 wings+0.003
Neg logit contributions
des-0.007
and-0.007
to-0.007
ido-0.007
bie-0.007
Token Wh
Token ID854
Feature activation+3.20e-02

Pos logit contributions
des+0.043
and+0.041
to+0.040
ido+0.039
old+0.039
Neg logit contributions
 ears-0.024
 wheels-0.023
 tail-0.023
ps-0.022
 scales-0.022
Tokenisk
Token ID1984
Feature activation+6.65e-02

Pos logit contributions
des+0.050
and+0.047
to+0.046
 inspection+0.045
ido+0.044
Neg logit contributions
 ears-0.058
 tail-0.056
 wheels-0.056
ps-0.056
e-0.055
Tokeners
Token ID364
Feature activation-2.57e-03

Pos logit contributions
des+0.206
bie+0.203
ido+0.196
yes+0.192
 inspection+0.192
Neg logit contributions
ps-0.022
e-0.019
ates-0.014
 tail-0.009
 wheels-0.009
Token out
Token ID503
Feature activation+7.91e-03

Pos logit contributions
 ears+0.003
 tail+0.003
 wheels+0.003
 wings+0.003
ps+0.003
Neg logit contributions
des-0.006
to-0.005
and-0.005
bie-0.005
ido-0.005
Token of
Token ID286
Feature activation+8.24e-03

Pos logit contributions
des+0.016
and+0.015
to+0.015
ido+0.014
 inspection+0.014
Neg logit contributions
 ears-0.011
 wheels-0.011
 tail-0.011
ps-0.010
 scales-0.010
Token the
Token ID262
Feature activation+1.14e-02

Pos logit contributions
des+0.021
and+0.020
to+0.020
ido+0.019
old+0.019
Neg logit contributions
 ears-0.007
 wheels-0.007
 tail-0.007
ps-0.007
 scales-0.006
Token way
Token ID835
Feature activation+2.08e-02

Pos logit contributions
des+0.022
and+0.022
to+0.021
ido+0.020
old+0.020
Neg logit contributions
 ears-0.017
 wheels-0.017
 tail-0.016
 scales-0.016
 wings-0.016
Token a
Token ID257
Feature activation+2.04e-02

Pos logit contributions
des+0.025
and+0.024
ido+0.024
ll+0.023
bie+0.023
Neg logit contributions
 wheels-0.013
 ears-0.012
e-0.012
 wings-0.012
 tail-0.012
Token cloth
Token ID16270
Feature activation+5.74e-03

Pos logit contributions
des+0.047
and+0.046
ll+0.044
old+0.043
ido+0.042
Neg logit contributions
 front-0.023
 tail-0.022
 voice-0.021
 wings-0.020
 ears-0.019
Token and
Token ID290
Feature activation-5.06e-03

Pos logit contributions
des+0.015
and+0.015
to+0.014
 inspection+0.014
ido+0.014
Neg logit contributions
 ears-0.004
 wheels-0.004
 tail-0.004
 wings-0.004
ps-0.004
Token wiped
Token ID21122
Feature activation+2.69e-02

Pos logit contributions
 ears+0.006
 wheels+0.006
 tail+0.006
 wings+0.006
 scales+0.005
Neg logit contributions
des-0.011
and-0.011
ido-0.011
to-0.011
 inspection-0.010
Token it
Token ID340
Feature activation+1.04e-02

Pos logit contributions
des+0.065
and+0.063
ido+0.060
to+0.059
ll+0.059
Neg logit contributions
 ears-0.030
 wheels-0.030
 tail-0.027
e-0.027
ps-0.027
Token all
Token ID477
Feature activation+6.61e-02

Pos logit contributions
des+0.022
and+0.021
to+0.020
ido+0.020
old+0.020
Neg logit contributions
 ears-0.014
 wheels-0.013
 tail-0.013
ps-0.013
 wings-0.012
Token over
Token ID625
Feature activation+3.89e-02

Pos logit contributions
des+0.165
to+0.164
and+0.163
ido+0.162
 inspection+0.160
Neg logit contributions
e-0.053
 end-0.049
 middle-0.049
 with-0.048
ps-0.048
Token.
Token ID13
Feature activation+5.54e-03

Pos logit contributions
des+0.100
ido+0.095
to+0.094
rig+0.094
and+0.091
Neg logit contributions
 wheels-0.034
 ears-0.033
e-0.031
 front-0.031
 tail-0.029
Token Then
Token ID3244
Feature activation-1.98e-02

Pos logit contributions
des+0.013
and+0.012
ido+0.012
 inspection+0.011
ll+0.011
Neg logit contributions
 wheels-0.007
 ears-0.007
 tail-0.006
 skies-0.006
bed-0.006
Token,
Token ID11
Feature activation-3.45e-03

Pos logit contributions
 ears+0.031
 wheels+0.030
 tail+0.030
 wings+0.029
 scales+0.029
Neg logit contributions
des-0.036
and-0.035
to-0.034
ido-0.033
old-0.032
Token she
Token ID673
Feature activation-1.31e-02

Pos logit contributions
 ears+0.005
 wheels+0.005
 tail+0.005
 wings+0.005
 feathers+0.005
Neg logit contributions
des-0.007
and-0.006
to-0.006
ido-0.006
old-0.006
Token flew
Token ID13112
Feature activation-4.95e-04

Pos logit contributions
 ears+0.004
 tail+0.004
e+0.004
 wheels+0.004
 wings+0.004
Neg logit contributions
des-0.013
and-0.013
to-0.012
ido-0.012
 inspection-0.012
Token down
Token ID866
Feature activation-1.73e-03

Pos logit contributions
 ears+0.001
 wheels+0.001
 tail+0.001
ps+0.001
 scales+0.001
Neg logit contributions
des-0.001
and-0.001
to-0.001
ido-0.001
 inspection-0.001
Token and
Token ID290
Feature activation-1.76e-03

Pos logit contributions
 ears+0.002
 wheels+0.002
 tail+0.002
ps+0.002
 scales+0.002
Neg logit contributions
des-0.003
and-0.003
to-0.003
ido-0.003
 inspection-0.003
Token scared
Token ID12008
Feature activation+6.52e-03

Pos logit contributions
 ears+0.002
 tail+0.002
 wings+0.002
 wheels+0.002
 feathers+0.002
Neg logit contributions
des-0.004
and-0.004
ido-0.004
to-0.004
old-0.004
Token Wh
Token ID854
Feature activation+3.64e-02

Pos logit contributions
des+0.014
and+0.013
to+0.013
ido+0.013
 inspection+0.013
Neg logit contributions
 ears-0.008
 wheels-0.008
 tail-0.008
ps-0.008
 scales-0.008
Tokenisk
Token ID1984
Feature activation+6.61e-02

Pos logit contributions
des+0.055
and+0.052
to+0.051
 inspection+0.049
ido+0.048
Neg logit contributions
 ears-0.067
 tail-0.066
ps-0.065
 wheels-0.065
e-0.064
Tokeners
Token ID364
Feature activation-6.00e-03

Pos logit contributions
des+0.201
bie+0.197
to+0.191
ido+0.190
and+0.190
Neg logit contributions
ps-0.021
e-0.020
ates-0.015
 tail-0.013
uders-0.011
Token away
Token ID1497
Feature activation-3.21e-03

Pos logit contributions
 ears+0.006
 tail+0.006
 wheels+0.005
ps+0.005
 wings+0.005
Neg logit contributions
des-0.014
and-0.014
to-0.014
ido-0.013
bie-0.013
Token.
Token ID13
Feature activation-4.20e-03

Pos logit contributions
 ears+0.004
 wheels+0.003
ates+0.003
 tail+0.003
 scales+0.003
Neg logit contributions
des-0.007
and-0.007
to-0.007
 inspection-0.007
ido-0.007
Token
Token ID198
Feature activation-2.80e-03

Pos logit contributions
 ears+0.006
 tail+0.006
 wheels+0.006
 wings+0.005
uders+0.005
Neg logit contributions
des-0.008
and-0.008
to-0.008
ido-0.008
old-0.007
Token
Token ID198
Feature activation-2.30e-02

Pos logit contributions
 tail+0.003
e+0.003
 middle+0.002
 front+0.002
 wheels+0.002
Neg logit contributions
des-0.007
bie-0.006
 inspection-0.006
ido-0.006
ability-0.006
Token that
Token ID326
Feature activation+3.16e-02

Pos logit contributions
des+0.009
and+0.008
to+0.008
ido+0.008
bie+0.008
Neg logit contributions
 ears-0.006
 tail-0.005
 wheels-0.005
 wings-0.005
 scales-0.005
Token her
Token ID607
Feature activation-6.93e-03

Pos logit contributions
des+0.058
and+0.055
to+0.054
ido+0.053
 inspection+0.052
Neg logit contributions
 ears-0.049
 wheels-0.048
 tail-0.047
 wings-0.046
ps-0.046
Token toys
Token ID14958
Feature activation+1.79e-02

Pos logit contributions
e+0.009
uders+0.009
ps+0.008
ates+0.008
ities+0.008
Neg logit contributions
des-0.013
and-0.013
old-0.012
yes-0.012
bie-0.012
Token would
Token ID561
Feature activation+1.45e-02

Pos logit contributions
des+0.032
and+0.031
to+0.030
ido+0.029
old+0.029
Neg logit contributions
 ears-0.028
 tail-0.027
 wheels-0.027
uders-0.026
 scales-0.026
Token make
Token ID787
Feature activation+1.54e-02

Pos logit contributions
des+0.030
and+0.028
 inspection+0.027
to+0.027
yes+0.027
Neg logit contributions
 ears-0.020
ates-0.019
uders-0.019
ities-0.018
 wheels-0.018
Token someone
Token ID2130
Feature activation+6.60e-02

Pos logit contributions
des+0.034
ido+0.032
and+0.032
to+0.032
bie+0.031
Neg logit contributions
 ears-0.019
 wheels-0.018
 tail-0.017
 wings-0.017
ps-0.017
Token else
Token ID2073
Feature activation+4.47e-02

Pos logit contributions
des+0.122
to+0.118
ido+0.117
and+0.116
 inspection+0.113
Neg logit contributions
 ears-0.098
 wheels-0.094
 tail-0.094
 wings-0.092
 feathers-0.091
Token smile
Token ID8212
Feature activation-7.15e-03

Pos logit contributions
des+0.087
and+0.083
to+0.082
ido+0.081
 inspection+0.079
Neg logit contributions
 ears-0.064
 wheels-0.061
 tail-0.061
ps-0.059
 scales-0.059
Token.
Token ID13
Feature activation+1.88e-02

Pos logit contributions
 ears+0.008
 wheels+0.007
uders+0.007
 scales+0.007
 tail+0.007
Neg logit contributions
des-0.016
and-0.016
to-0.016
ido-0.015
bie-0.015
Token
Token ID198
Feature activation+1.20e-02

Pos logit contributions
des+0.045
ido+0.041
 inspection+0.041
and+0.040
bie+0.040
Neg logit contributions
 ears-0.019
 tail-0.018
bed-0.017
 middle-0.016
 wheels-0.016
Token
Token ID198
Feature activation-1.27e-02

Pos logit contributions
des+0.030
 inspection+0.028
bie+0.028
ido+0.027
ait+0.027
Neg logit contributions
e-0.010
 middle-0.009
 end-0.009
 tail-0.009
 ears-0.008
Token
Token ID198
Feature activation-3.94e-03

Pos logit contributions
 ears+0.002
 wheels+0.002
 tail+0.002
uders+0.002
e+0.002
Neg logit contributions
des-0.003
and-0.003
to-0.003
ido-0.002
 inspection-0.002
Token
Token ID198
Feature activation-2.09e-02

Pos logit contributions
 front+0.004
 ears+0.004
 wheels+0.004
 middle+0.004
e+0.004
Neg logit contributions
des-0.010
yes-0.008
ido-0.008
 inspection-0.008
to-0.008
Token"
Token ID1
Feature activation+7.55e-03

Pos logit contributions
 ears+0.023
 wheels+0.023
 tail+0.023
e+0.022
uders+0.021
Neg logit contributions
des-0.047
and-0.044
to-0.044
 inspection-0.043
ido-0.043
TokenWhat
Token ID2061
Feature activation+1.68e-02

Pos logit contributions
des+0.011
to+0.011
and+0.011
yes+0.010
 inspection+0.010
Neg logit contributions
 ears-0.014
 tail-0.014
 wings-0.014
ates-0.013
 scales-0.013
Token is
Token ID318
Feature activation-7.33e-03

Pos logit contributions
des+0.039
and+0.038
to+0.037
ido+0.037
 inspection+0.036
Neg logit contributions
 ears-0.018
 wheels-0.017
 tail-0.017
e-0.016
uders-0.016
Token under
Token ID739
Feature activation-7.18e-02

Pos logit contributions
 ears+0.011
ities+0.011
 tail+0.011
ates+0.010
ps+0.010
Neg logit contributions
des-0.012
to-0.012
 inspection-0.012
ido-0.012
 Su-0.012
Token the
Token ID262
Feature activation+4.02e-03

Pos logit contributions
ider+0.117
ities+0.115
ates+0.106
 ears+0.103
bed+0.101
Neg logit contributions
 inspection-0.117
 Su-0.116
ido-0.110
old-0.103
des-0.103
Token fence
Token ID13990
Feature activation-3.26e-02

Pos logit contributions
des+0.008
and+0.008
to+0.008
ido+0.008
 inspection+0.007
Neg logit contributions
 ears-0.005
 wheels-0.005
ities-0.005
 tail-0.005
ates-0.005
Token?"
Token ID1701
Feature activation-2.35e-02

Pos logit contributions
 ears+0.036
 tail+0.034
ities+0.034
ps+0.033
ates+0.033
Neg logit contributions
des-0.072
and-0.071
to-0.069
ido-0.069
 inspection-0.068
Token Tom
Token ID4186
Feature activation+3.92e-03

Pos logit contributions
ates+0.041
ities+0.041
ps+0.040
 tail+0.039
 scales+0.039
Neg logit contributions
 inspection-0.035
old-0.035
to-0.034
ido-0.034
 Su-0.034
Token asks
Token ID7893
Feature activation+2.19e-02

Pos logit contributions
to+0.006
and+0.006
old+0.006
des+0.006
bie+0.006
Neg logit contributions
ates-0.006
 tail-0.006
 wheels-0.006
uders-0.006
e-0.006
Token says
Token ID1139
Feature activation+1.21e-02

Pos logit contributions
des+0.037
and+0.036
to+0.035
ido+0.034
 inspection+0.034
Neg logit contributions
 ears-0.025
 wheels-0.024
ps-0.024
 tail-0.023
 wings-0.023
Token "
Token ID366
Feature activation+1.85e-02

Pos logit contributions
des+0.026
and+0.025
to+0.025
ido+0.024
 inspection+0.024
Neg logit contributions
 ears-0.015
 wheels-0.014
 tail-0.014
ps-0.014
 scales-0.013
Tokenwhat
Token ID10919
Feature activation+1.43e-02

Pos logit contributions
des+0.032
and+0.031
to+0.031
ido+0.029
 inspection+0.029
Neg logit contributions
 ears-0.030
 tail-0.029
 wheels-0.029
ps-0.028
 scales-0.028
Token is
Token ID318
Feature activation-1.33e-02

Pos logit contributions
des+0.034
and+0.032
 inspection+0.032
ido+0.032
to+0.032
Neg logit contributions
 ears-0.015
e-0.015
 tail-0.014
 wheels-0.014
ps-0.013
Token going
Token ID1016
Feature activation-2.18e-02

Pos logit contributions
 ears+0.019
 tail+0.018
ities+0.017
 wheels+0.017
 wings+0.017
Neg logit contributions
des-0.025
to-0.025
ido-0.025
and-0.024
old-0.024
Token on
Token ID319
Feature activation-7.11e-02

Pos logit contributions
 ears+0.035
 scales+0.033
ities+0.033
 tail+0.032
 tails+0.032
Neg logit contributions
to-0.040
and-0.038
ido-0.038
des-0.037
yes-0.036
Token here
Token ID994
Feature activation-6.01e-02

Pos logit contributions
 ears+0.108
 feathers+0.104
 tail+0.103
 wheels+0.103
 scales+0.103
Neg logit contributions
to-0.122
ido-0.119
des-0.113
 Su-0.110
and-0.109
Token?
Token ID30
Feature activation-1.18e-02

Pos logit contributions
 ears+0.082
 scales+0.078
 wheels+0.077
 wings+0.076
 feathers+0.076
Neg logit contributions
and-0.118
to-0.116
des-0.115
ido-0.112
yes-0.110
Token Why
Token ID4162
Feature activation-1.33e-02

Pos logit contributions
 ears+0.018
 tail+0.018
 wings+0.017
 scales+0.017
ps+0.017
Neg logit contributions
des-0.021
and-0.021
to-0.021
ido-0.020
old-0.020
Token are
Token ID389
Feature activation-2.84e-02

Pos logit contributions
 ears+0.019
 tail+0.018
 wheels+0.018
 scales+0.018
 wings+0.018
Neg logit contributions
des-0.025
and-0.024
to-0.024
ido-0.024
old-0.023
Token you
Token ID345
Feature activation-3.86e-02

Pos logit contributions
 ears+0.036
 tail+0.035
 wheels+0.034
ps+0.034
 scales+0.033
Neg logit contributions
des-0.059
and-0.057
to-0.056
ido-0.055
old-0.054
Token and
Token ID290
Feature activation+1.75e-03

Pos logit contributions
uders+0.029
ates+0.029
 tail+0.028
athed+0.028
 scales+0.027
Neg logit contributions
and-0.048
to-0.042
old-0.042
des-0.042
 inspection-0.041
Token says
Token ID1139
Feature activation+2.38e-04

Pos logit contributions
des+0.004
and+0.003
to+0.003
ido+0.003
 inspection+0.003
Neg logit contributions
 ears-0.002
 wheels-0.002
 tail-0.002
ps-0.002
 scales-0.002
Token,
Token ID11
Feature activation-6.14e-03

Pos logit contributions
des+0.001
and+0.001
to+0.001
ido+0.001
 inspection+0.001
Neg logit contributions
 ears-0.000
 wheels-0.000
 tail-0.000
ps-0.000
 scales-0.000
Token "
Token ID366
Feature activation+9.40e-03

Pos logit contributions
 ears+0.010
uders+0.010
e+0.010
ates+0.010
ities+0.010
Neg logit contributions
des-0.009
and-0.009
to-0.009
 inspection-0.009
ido-0.009
TokenStop
Token ID19485
Feature activation-2.72e-02

Pos logit contributions
des+0.017
and+0.016
to+0.016
ido+0.015
old+0.015
Neg logit contributions
 ears-0.015
 tail-0.014
 wheels-0.014
ps-0.014
 scales-0.014
Token repeating
Token ID20394
Feature activation-7.10e-02

Pos logit contributions
ates+0.041
 middle+0.039
 scales+0.038
 wheels+0.038
ities+0.037
Neg logit contributions
 inspection-0.047
to-0.047
des-0.046
and-0.045
bie-0.045
Token me
Token ID502
Feature activation-2.36e-02

Pos logit contributions
ities+0.081
ates+0.080
uders+0.079
 ears+0.079
ider+0.078
Neg logit contributions
and-0.139
to-0.139
des-0.137
ido-0.136
 inspection-0.135
Token,
Token ID11
Feature activation-5.17e-03

Pos logit contributions
 ears+0.025
 wheels+0.024
 tail+0.024
 scales+0.023
 wings+0.023
Neg logit contributions
and-0.053
des-0.053
to-0.052
ido-0.051
old-0.050
Token Sue
Token ID26089
Feature activation-1.16e-02

Pos logit contributions
 ears+0.009
 wheels+0.009
 tail+0.009
ps+0.009
 scales+0.009
Neg logit contributions
des-0.008
and-0.008
to-0.008
ido-0.007
 inspection-0.007
Token!
Token ID0
Feature activation-5.86e-03

Pos logit contributions
 ears+0.011
 wheels+0.011
 tail+0.011
ps+0.010
 scales+0.010
Neg logit contributions
des-0.028
and-0.027
to-0.027
ido-0.026
 inspection-0.026
Token You
Token ID921
Feature activation-3.77e-02

Pos logit contributions
 ears+0.008
 tail+0.008
 wheels+0.008
ps+0.008
 scales+0.008
Neg logit contributions
des-0.012
and-0.011
to-0.011
ido-0.011
old-0.011
Token and
Token ID290
Feature activation+1.39e-03

Pos logit contributions
uders+0.051
ates+0.048
e+0.045
ities+0.045
ps+0.042
Neg logit contributions
and-0.069
to-0.066
des-0.064
 inspection-0.060
ido-0.059
Token shouts
Token ID34757
Feature activation-7.19e-03

Pos logit contributions
des+0.003
and+0.002
to+0.002
ido+0.002
 inspection+0.002
Neg logit contributions
 ears-0.002
 wheels-0.002
 tail-0.002
ps-0.002
 scales-0.002
Token,
Token ID11
Feature activation-1.66e-03

Pos logit contributions
 ears+0.007
uders+0.007
 wheels+0.007
ates+0.007
 tail+0.007
Neg logit contributions
des-0.017
and-0.017
to-0.016
 inspection-0.016
ido-0.016
Token "
Token ID366
Feature activation+2.03e-02

Pos logit contributions
 ears+0.003
 wheels+0.002
 tail+0.002
uders+0.002
ps+0.002
Neg logit contributions
des-0.003
and-0.003
to-0.003
ido-0.003
 inspection-0.003
TokenWhe
Token ID10842
Feature activation-2.26e-02

Pos logit contributions
des+0.034
and+0.032
to+0.032
ido+0.031
 inspection+0.030
Neg logit contributions
 ears-0.035
 wheels-0.034
 tail-0.034
ps-0.033
 scales-0.033
Tokene
Token ID68
Feature activation-7.09e-02

Pos logit contributions
 ears+0.014
 wheels+0.013
 tail+0.012
 scales+0.012
ps+0.012
Neg logit contributions
des-0.062
and-0.061
to-0.060
bie-0.059
ido-0.059
Token!
Token ID0
Feature activation-7.26e-03

Pos logit contributions
ities+0.115
ates+0.107
 ears+0.106
 wheels+0.106
 scales+0.105
Neg logit contributions
and-0.115
des-0.115
to-0.115
ido-0.107
yes-0.102
Token That
Token ID1320
Feature activation-1.78e-02

Pos logit contributions
 ears+0.010
ps+0.009
 tail+0.009
 wheels+0.009
 scales+0.009
Neg logit contributions
des-0.015
and-0.014
to-0.014
ido-0.013
old-0.013
Token was
Token ID373
Feature activation-1.66e-02

Pos logit contributions
 ears+0.021
 wheels+0.020
 tail+0.020
ps+0.019
 scales+0.019
Neg logit contributions
des-0.039
and-0.038
to-0.038
ido-0.037
 inspection-0.036
Token awesome
Token ID7427
Feature activation-3.24e-02

Pos logit contributions
 ears+0.026
 wheels+0.025
 tail+0.025
 scales+0.024
 wings+0.024
Neg logit contributions
des-0.031
and-0.030
to-0.029
ido-0.028
old-0.028
Token!"
Token ID2474
Feature activation+5.68e-03

Pos logit contributions
 ears+0.043
 wheels+0.042
 tail+0.041
ps+0.040
ities+0.040
Neg logit contributions
des-0.065
and-0.063
to-0.062
ido-0.060
 inspection-0.060
Token
Token ID26391
Feature activation-3.00e-02

Pos logit contributions
 ears+0.010
 wings+0.010
 wheels+0.010
 feathers+0.009
 eyes+0.009
Neg logit contributions
des-0.011
to-0.011
and-0.010
old-0.010
ido-0.010
Token
Token ID447
Feature activation-1.00e-02

Pos logit contributions
 ears+0.048
 wheels+0.046
 tail+0.046
ps+0.045
 scales+0.045
Neg logit contributions
des-0.053
and-0.051
to-0.050
ido-0.049
bie-0.048
Token
Token ID250
Feature activation+1.92e-02

Pos logit contributions
 ears+0.015
ps+0.015
 tail+0.014
 wheels+0.014
ities+0.014
Neg logit contributions
des-0.019
to-0.018
and-0.018
bie-0.017
yes-0.017
Token they
Token ID484
Feature activation-8.94e-03

Pos logit contributions
des+0.033
and+0.033
to+0.032
ido+0.030
bie+0.030
Neg logit contributions
 ears-0.032
 tail-0.031
 wheels-0.030
ps-0.030
 scales-0.030
Token lived
Token ID5615
Feature activation-2.44e-02

Pos logit contributions
 ears+0.015
 tail+0.015
 wheels+0.015
ates+0.014
ities+0.014
Neg logit contributions
des-0.015
and-0.014
to-0.014
old-0.014
ido-0.013
Token on
Token ID319
Feature activation-7.05e-02

Pos logit contributions
 ears+0.031
 wheels+0.030
ities+0.029
ps+0.028
uders+0.028
Neg logit contributions
and-0.050
des-0.050
to-0.050
bie-0.046
yes-0.046
Token either
Token ID2035
Feature activation-2.84e-02

Pos logit contributions
ities+0.083
ider+0.082
ps+0.078
 ears+0.077
bed+0.076
Neg logit contributions
des-0.138
to-0.136
and-0.135
ido-0.129
old-0.128
Token side
Token ID1735
Feature activation-1.53e-02

Pos logit contributions
 ears+0.038
ps+0.037
 wheels+0.036
 tail+0.036
ities+0.035
Neg logit contributions
and-0.057
des-0.055
to-0.054
old-0.054
ido-0.053
Token of
Token ID286
Feature activation-2.42e-02

Pos logit contributions
 ears+0.020
 wheels+0.019
 tail+0.019
ps+0.019
ities+0.019
Neg logit contributions
and-0.031
des-0.031
to-0.030
old-0.029
ido-0.029
Token a
Token ID257
Feature activation+2.49e-02

Pos logit contributions
 ears+0.025
 wheels+0.024
ities+0.024
ider+0.023
 tail+0.023
Neg logit contributions
and-0.053
des-0.052
to-0.051
 inspection-0.050
ido-0.050
Token broken
Token ID5445
Feature activation+2.74e-03

Pos logit contributions
des+0.055
and+0.053
to+0.052
ido+0.051
old+0.051
Neg logit contributions
 tail-0.029
 ears-0.029
 wheels-0.028
 scales-0.028
 wings-0.027
Token and
Token ID290
Feature activation+9.32e-03

Pos logit contributions
uders+0.032
ates+0.031
e+0.030
ities+0.030
 wheels+0.029
Neg logit contributions
and-0.053
to-0.051
des-0.051
 inspection-0.048
ido-0.047
Token shout
Token ID9191
Feature activation-7.92e-03

Pos logit contributions
des+0.016
and+0.016
to+0.015
ido+0.015
 inspection+0.015
Neg logit contributions
 ears-0.015
 wheels-0.014
 tail-0.014
ps-0.014
 scales-0.014
Token,
Token ID11
Feature activation-4.83e-04

Pos logit contributions
uders+0.008
ates+0.008
 wheels+0.008
 ears+0.008
 scales+0.008
Neg logit contributions
and-0.018
des-0.017
to-0.017
 inspection-0.017
ido-0.017
Token "
Token ID366
Feature activation+2.16e-02

Pos logit contributions
 ears+0.001
 tail+0.001
 wheels+0.001
ps+0.001
 scales+0.001
Neg logit contributions
des-0.001
and-0.001
to-0.001
ido-0.001
old-0.001
TokenWhe
Token ID10842
Feature activation-2.25e-02

Pos logit contributions
des+0.036
and+0.034
to+0.033
ido+0.031
 inspection+0.031
Neg logit contributions
 ears-0.038
 tail-0.036
 wheels-0.036
ps-0.035
 scales-0.035
Tokene
Token ID68
Feature activation-7.01e-02

Pos logit contributions
 ears+0.019
 wheels+0.019
 tail+0.018
 scales+0.018
ps+0.017
Neg logit contributions
des-0.056
and-0.055
to-0.054
bie-0.054
ido-0.053
Token!"
Token ID2474
Feature activation+1.37e-04

Pos logit contributions
ities+0.114
 wheels+0.108
 sincere+0.106
ates+0.106
 scales+0.106
Neg logit contributions
and-0.110
to-0.109
des-0.108
ido-0.100
 inspection-0.098
Token
Token ID198
Feature activation+5.02e-03

Pos logit contributions
des+0.000
and+0.000
to+0.000
ido+0.000
 inspection+0.000
Neg logit contributions
 ears-0.000
 wheels-0.000
 tail-0.000
ps-0.000
 scales-0.000
Token
Token ID198
Feature activation-2.27e-02

Pos logit contributions
des+0.012
yes+0.011
bie+0.010
 inspection+0.010
ability+0.010
Neg logit contributions
 ears-0.005
 middle-0.005
 tail-0.005
 wheels-0.005
e-0.005
TokenThey
Token ID2990
Feature activation-1.29e-02

Pos logit contributions
 ears+0.038
ps+0.037
 wheels+0.037
 wings+0.036
 tail+0.036
Neg logit contributions
des-0.039
and-0.038
to-0.038
ido-0.037
old-0.034
Token roll
Token ID4836
Feature activation+1.60e-03

Pos logit contributions
 ears+0.019
 wheels+0.018
 tail+0.018
ps+0.017
uders+0.017
Neg logit contributions
des-0.025
and-0.024
to-0.023
ido-0.023
 inspection-0.022
Token Sam
Token ID3409
Feature activation-3.96e-02

Pos logit contributions
 ears+0.032
 wheels+0.031
 tail+0.031
 scales+0.030
ps+0.030
Neg logit contributions
des-0.045
and-0.043
to-0.043
ido-0.042
 inspection-0.041
Token,
Token ID11
Feature activation-2.25e-02

Pos logit contributions
ities+0.043
 ears+0.042
ates+0.042
 wheels+0.041
 scales+0.041
Neg logit contributions
des-0.085
to-0.081
old-0.081
bie-0.081
 inspection-0.081
Token what
Token ID644
Feature activation+4.29e-03

Pos logit contributions
 ears+0.031
 wheels+0.030
 tail+0.030
ps+0.029
 scales+0.029
Neg logit contributions
des-0.045
and-0.043
to-0.043
ido-0.041
old-0.041
Token's
Token ID338
Feature activation-1.04e-02

Pos logit contributions
des+0.009
and+0.008
to+0.008
ido+0.008
old+0.008
Neg logit contributions
 ears-0.006
 wheels-0.005
 tail-0.005
ps-0.005
 scales-0.005
Token going
Token ID1016
Feature activation-1.58e-02

Pos logit contributions
 ears+0.016
ities+0.016
ates+0.015
 tail+0.015
bed+0.015
Neg logit contributions
des-0.017
to-0.016
ido-0.016
old-0.016
 Su-0.015
Token on
Token ID319
Feature activation-6.92e-02

Pos logit contributions
 ears+0.020
ities+0.020
 scales+0.019
ates+0.018
 tails+0.018
Neg logit contributions
des-0.033
to-0.033
and-0.031
yes-0.031
ido-0.030
Token?"
Token ID1701
Feature activation-1.31e-02

Pos logit contributions
 ears+0.110
ities+0.108
 tails+0.107
 tail+0.103
 necks+0.100
Neg logit contributions
to-0.111
des-0.108
ido-0.108
 Su-0.107
ll-0.103
Token Mom
Token ID11254
Feature activation+1.65e-02

Pos logit contributions
ates+0.030
ities+0.028
ps+0.028
ider+0.026
bed+0.026
Neg logit contributions
 Su-0.016
 inspection-0.015
old-0.015
des-0.014
ido-0.013
Token says
Token ID1139
Feature activation+2.00e-02

Pos logit contributions
des+0.025
to+0.025
and+0.025
old+0.024
bie+0.023
Neg logit contributions
 tail-0.028
 ears-0.028
ates-0.028
ities-0.028
 wheels-0.027
Token.
Token ID13
Feature activation+2.56e-03

Pos logit contributions
des+0.050
and+0.048
to+0.047
ido+0.047
 inspection+0.046
Neg logit contributions
 ears-0.018
 tail-0.017
 wheels-0.017
ps-0.016
 wings-0.016
Token She
Token ID1375
Feature activation-6.61e-03

Pos logit contributions
des+0.004
and+0.004
to+0.004
 inspection+0.004
ido+0.004
Neg logit contributions
 ears-0.004
ates-0.004
 scales-0.004
e-0.004
 tail-0.004
Token.
Token ID13
Feature activation+7.98e-03

Pos logit contributions
 ears+0.012
 tail+0.011
uders+0.011
ities+0.011
ates+0.011
Neg logit contributions
and-0.023
des-0.022
to-0.022
 inspection-0.021
ido-0.021
Token
Token ID198
Feature activation+1.17e-02

Pos logit contributions
des+0.019
and+0.017
ido+0.016
bie+0.016
old+0.016
Neg logit contributions
 ears-0.009
 wheels-0.009
 tail-0.009
 skies-0.008
 front-0.008
Token
Token ID198
Feature activation-1.61e-02

Pos logit contributions
des+0.031
yes+0.027
ability+0.027
ido+0.027
bie+0.027
Neg logit contributions
 front-0.009
 middle-0.009
 ears-0.009
 wheels-0.009
e-0.008
Token"
Token ID1
Feature activation+1.76e-02

Pos logit contributions
 ears+0.018
 wheels+0.017
 tail+0.017
e+0.016
ps+0.016
Neg logit contributions
des-0.036
and-0.035
to-0.034
ido-0.034
 inspection-0.033
TokenWhe
Token ID10842
Feature activation-1.63e-02

Pos logit contributions
des+0.031
and+0.030
to+0.029
ido+0.028
old+0.028
Neg logit contributions
 ears-0.028
 wheels-0.028
 tail-0.028
ps-0.027
 scales-0.027
Tokene
Token ID68
Feature activation-6.90e-02

Pos logit contributions
 ears+0.007
 wheels+0.007
 tail+0.006
 scales+0.006
uders+0.006
Neg logit contributions
des-0.048
and-0.048
to-0.046
bie-0.046
old-0.046
Token!"
Token ID2474
Feature activation-1.74e-02

Pos logit contributions
ities+0.103
 ears+0.093
 wheels+0.093
 sincere+0.092
ates+0.092
Neg logit contributions
and-0.121
to-0.119
des-0.117
 inspection-0.111
ido-0.109
Token Lily
Token ID20037
Feature activation-9.36e-03

Pos logit contributions
ates+0.035
ities+0.033
uders+0.033
ps+0.033
 scales+0.032
Neg logit contributions
des-0.024
 inspection-0.023
old-0.023
and-0.022
 Su-0.022
Token shouts
Token ID34757
Feature activation-3.59e-03

Pos logit contributions
ities+0.021
ates+0.021
uders+0.020
 wing+0.020
ider+0.019
Neg logit contributions
old-0.010
des-0.009
to-0.009
and-0.009
rig-0.008
Token.
Token ID13
Feature activation+3.59e-03

Pos logit contributions
ates+0.005
uders+0.005
ities+0.004
 scales+0.004
 wheels+0.004
Neg logit contributions
and-0.007
des-0.007
to-0.007
old-0.007
ido-0.007
Token "
Token ID366
Feature activation+1.39e-02

Pos logit contributions
des+0.006
and+0.006
to+0.006
ido+0.006
 inspection+0.006
Neg logit contributions
 ears-0.006
 wheels-0.005
 tail-0.005
 scales-0.005
ps-0.005
Token.
Token ID13
Feature activation-3.35e-03

Pos logit contributions
des+0.003
and+0.003
to+0.003
ido+0.003
 inspection+0.003
Neg logit contributions
 ears-0.001
 wheels-0.001
 tail-0.001
ps-0.001
 scales-0.001
Token
Token ID198
Feature activation+1.99e-03

Pos logit contributions
 ears+0.004
 wheels+0.004
 tail+0.004
 skies+0.004
 front+0.004
Neg logit contributions
des-0.007
and-0.006
bie-0.006
ido-0.006
to-0.006
Token
Token ID198
Feature activation-2.40e-02

Pos logit contributions
des+0.005
yes+0.004
bie+0.004
ability+0.004
iden+0.004
Neg logit contributions
 ears-0.002
 front-0.002
 middle-0.002
e-0.002
 wheels-0.002
Token"
Token ID1
Feature activation+1.05e-02

Pos logit contributions
 ears+0.031
 wheels+0.030
 tail+0.030
ps+0.029
 scales+0.029
Neg logit contributions
des-0.050
and-0.048
to-0.047
ido-0.046
 inspection-0.045
TokenWhe
Token ID10842
Feature activation-1.99e-02

Pos logit contributions
des+0.017
and+0.016
to+0.016
ido+0.015
 inspection+0.015
Neg logit contributions
 ears-0.019
 tail-0.018
 wheels-0.018
 scales-0.018
 wings-0.018
Tokene
Token ID68
Feature activation-6.89e-02

Pos logit contributions
 ears+0.009
 wheels+0.009
 tail+0.008
 scales+0.008
uders+0.007
Neg logit contributions
des-0.058
and-0.057
to-0.056
bie-0.055
ido-0.055
Token!"
Token ID2474
Feature activation-2.26e-02

Pos logit contributions
ities+0.116
ates+0.108
 ears+0.106
 skies+0.106
 scales+0.105
Neg logit contributions
des-0.111
and-0.109
to-0.106
ido-0.100
 inspection-0.098
Token Lily
Token ID20037
Feature activation-9.65e-03

Pos logit contributions
ates+0.045
ps+0.043
uders+0.043
ities+0.043
 scales+0.043
Neg logit contributions
des-0.032
and-0.030
 inspection-0.030
old-0.029
to-0.029
Token says
Token ID1139
Feature activation+8.64e-03

Pos logit contributions
ates+0.022
ities+0.022
uders+0.021
 tail+0.020
 wing+0.020
Neg logit contributions
old-0.010
des-0.010
and-0.009
to-0.009
 inspection-0.008
Token.
Token ID13
Feature activation-4.92e-04

Pos logit contributions
des+0.021
and+0.021
to+0.020
ido+0.020
 inspection+0.020
Neg logit contributions
 ears-0.008
 tail-0.007
 wheels-0.007
ps-0.007
 wings-0.007
Token "
Token ID366
Feature activation+6.47e-03

Pos logit contributions
 ears+0.001
 wheels+0.001
 scales+0.001
 tail+0.001
e+0.001
Neg logit contributions
des-0.001
and-0.001
to-0.001
ido-0.001
 inspection-0.001
Token He
Token ID679
Feature activation-1.52e-02

Pos logit contributions
des+0.006
and+0.006
to+0.006
ido+0.005
old+0.005
Neg logit contributions
 ears-0.005
 tail-0.005
 wheels-0.005
ps-0.005
 wings-0.005
Token has
Token ID468
Feature activation-2.56e-02

Pos logit contributions
ities+0.022
 wheels+0.022
 tail+0.022
 ears+0.022
bed+0.022
Neg logit contributions
des-0.026
old-0.026
to-0.025
ll-0.024
and-0.024
Token a
Token ID257
Feature activation+8.87e-03

Pos logit contributions
bed+0.027
ider+0.027
uders+0.027
ities+0.027
 ears+0.026
Neg logit contributions
des-0.054
and-0.054
to-0.053
old-0.051
 inspection-0.051
Token big
Token ID1263
Feature activation+4.97e-03

Pos logit contributions
des+0.019
and+0.018
to+0.018
ido+0.017
 inspection+0.017
Neg logit contributions
 ears-0.011
 wheels-0.011
 tail-0.010
ps-0.010
uders-0.010
Token man
Token ID582
Feature activation-1.16e-02

Pos logit contributions
bie+0.010
des+0.010
 inspection+0.010
ido+0.009
to+0.009
Neg logit contributions
ps-0.006
ities-0.006
uders-0.006
ters-0.006
ider-0.006
Tokene
Token ID68
Feature activation-6.86e-02

Pos logit contributions
 skies+0.012
 wheels+0.012
uders+0.012
 ears+0.012
ities+0.011
Neg logit contributions
and-0.026
des-0.026
old-0.025
to-0.025
ido-0.025
Token of
Token ID286
Feature activation-1.48e-02

Pos logit contributions
 middle+0.084
 elbows+0.083
ities+0.080
 End+0.078
 skies+0.077
Neg logit contributions
to-0.127
and-0.127
ido-0.120
des-0.116
old-0.115
Token hair
Token ID4190
Feature activation-4.46e-02

Pos logit contributions
 ears+0.016
uders+0.016
ities+0.016
ps+0.016
bed+0.015
Neg logit contributions
des-0.031
and-0.031
to-0.030
ido-0.030
old-0.029
Token and
Token ID290
Feature activation-3.03e-02

Pos logit contributions
uders+0.059
 elbows+0.054
athed+0.052
ities+0.052
 wings+0.052
Neg logit contributions
and-0.081
ido-0.078
des-0.077
to-0.077
bie-0.075
Token a
Token ID257
Feature activation+2.17e-03

Pos logit contributions
uders+0.031
ities+0.030
ates+0.030
ider+0.029
bed+0.029
Neg logit contributions
des-0.065
to-0.064
old-0.062
and-0.062
 inspection-0.061
Token loud
Token ID7812
Feature activation-3.57e-03

Pos logit contributions
des+0.005
and+0.004
to+0.004
ido+0.004
bie+0.004
Neg logit contributions
ps-0.003
 ears-0.002
uders-0.002
ities-0.002
 wheels-0.002
Token
Token ID198
Feature activation-2.80e-02

Pos logit contributions
 front+0.004
 middle+0.004
e+0.004
 with+0.003
 wheels+0.003
Neg logit contributions
des-0.011
 inspection-0.010
bie-0.009
yes-0.009
ido-0.009
Token"
Token ID1
Feature activation+1.74e-02

Pos logit contributions
 ears+0.038
 tail+0.036
 wheels+0.036
ps+0.035
 scales+0.035
Neg logit contributions
des-0.057
and-0.055
to-0.054
ido-0.053
old-0.052
TokenTime
Token ID7575
Feature activation-6.06e-03

Pos logit contributions
des+0.027
and+0.026
to+0.026
ido+0.025
 inspection+0.024
Neg logit contributions
 ears-0.032
 tail-0.031
 wheels-0.031
ps-0.030
 scales-0.030
Token to
Token ID284
Feature activation-7.80e-03

Pos logit contributions
 ears+0.011
 tail+0.011
 wheels+0.011
ps+0.011
 wings+0.010
Neg logit contributions
des-0.009
and-0.009
to-0.009
ido-0.008
old-0.008
Token bat
Token ID7365
Feature activation-2.13e-02

Pos logit contributions
 ears+0.014
 tail+0.013
ities+0.013
uders+0.013
ates+0.013
Neg logit contributions
des-0.013
and-0.011
to-0.011
ido-0.011
bie-0.011
Tokenhe
Token ID258
Feature activation-6.85e-02

Pos logit contributions
 ears+0.043
 wheels+0.042
 tail+0.041
ps+0.041
 scales+0.040
Neg logit contributions
des-0.029
and-0.027
to-0.027
ido-0.026
 inspection-0.025
Token,
Token ID11
Feature activation-7.57e-04

Pos logit contributions
 wheels+0.096
ities+0.095
 ears+0.095
 skies+0.094
uders+0.091
Neg logit contributions
and-0.116
ido-0.110
bie-0.108
des-0.108
to-0.108
Token L
Token ID406
Feature activation+7.19e-03

Pos logit contributions
 ears+0.001
 tail+0.001
 wheels+0.001
ps+0.001
 scales+0.001
Neg logit contributions
des-0.001
and-0.001
to-0.001
ido-0.001
 inspection-0.001
Tokenila
Token ID10102
Feature activation-3.05e-02

Pos logit contributions
des+0.013
and+0.013
to+0.013
 inspection+0.012
bie+0.012
Neg logit contributions
e-0.010
 ears-0.010
uders-0.010
ps-0.010
 wheels-0.010
Token and
Token ID290
Feature activation-3.30e-02

Pos logit contributions
 ears+0.036
 wheels+0.035
ities+0.035
ps+0.034
 scales+0.034
Neg logit contributions
des-0.065
and-0.062
ido-0.061
to-0.061
old-0.061
Token Tom
Token ID4186
Feature activation-1.80e-02

Pos logit contributions
 ears+0.040
 wheels+0.038
 tail+0.037
 wings+0.034
 feathers+0.034
Neg logit contributions
des-0.076
and-0.075
to-0.073
 inspection-0.070
rig-0.068
Token and
Token ID290
Feature activation-5.14e-03

Pos logit contributions
uders+0.026
bed+0.025
ates+0.024
ps+0.024
 middle+0.024
Neg logit contributions
and-0.043
des-0.042
old-0.040
to-0.039
ido-0.039
Token has
Token ID468
Feature activation-3.30e-02

Pos logit contributions
 ears+0.007
 tail+0.007
 wheels+0.007
 scales+0.007
ps+0.006
Neg logit contributions
des-0.010
and-0.010
to-0.010
ido-0.010
 inspection-0.009
Token a
Token ID257
Feature activation-7.50e-04

Pos logit contributions
bed+0.037
ider+0.036
ities+0.035
uders+0.035
ates+0.035
Neg logit contributions
and-0.068
des-0.067
to-0.065
 inspection-0.063
old-0.063
Token big
Token ID1263
Feature activation+1.69e-03

Pos logit contributions
ities+0.001
ps+0.001
ates+0.001
uders+0.001
 wheels+0.001
Neg logit contributions
to-0.001
des-0.001
bie-0.001
and-0.001
ido-0.001
Token man
Token ID582
Feature activation-1.47e-02

Pos logit contributions
bie+0.003
 inspection+0.003
ido+0.003
des+0.003
iden+0.003
Neg logit contributions
ities-0.002
ates-0.002
ps-0.002
ters-0.002
uders-0.002
Tokene
Token ID68
Feature activation-6.83e-02

Pos logit contributions
 skies+0.012
 ears+0.011
 wheels+0.011
ities+0.011
uders+0.011
Neg logit contributions
des-0.037
and-0.036
to-0.036
old-0.035
ido-0.035
Token.
Token ID13
Feature activation-5.35e-03

Pos logit contributions
 End+0.098
 end+0.095
ities+0.094
 elbows+0.094
 middle+0.094
Neg logit contributions
to-0.115
and-0.108
ido-0.102
des-0.101
bie-0.099
Token He
Token ID679
Feature activation-2.57e-02

Pos logit contributions
 ears+0.008
ps+0.008
 tail+0.008
 wheels+0.008
uders+0.008
Neg logit contributions
des-0.010
and-0.009
to-0.009
ido-0.009
old-0.009
Token can
Token ID460
Feature activation-1.70e-04

Pos logit contributions
ities+0.039
 tail+0.037
bed+0.037
 wheels+0.037
uders+0.037
Neg logit contributions
des-0.043
to-0.042
old-0.042
bie-0.041
ll-0.040
Token roar
Token ID34532
Feature activation-1.65e-02

Pos logit contributions
ities+0.000
ates+0.000
uders+0.000
 ears+0.000
 wheels+0.000
Neg logit contributions
to-0.000
des-0.000
of-0.000
ido-0.000
yes-0.000
Token very
Token ID845
Feature activation-1.52e-03

Pos logit contributions
 ears+0.017
uders+0.017
 wheels+0.017
ities+0.016
 tail+0.016
Neg logit contributions
and-0.037
to-0.036
des-0.035
 inspection-0.034
ido-0.034
Token and
Token ID290
Feature activation+4.63e-03

Pos logit contributions
des+0.028
and+0.027
to+0.027
ido+0.026
old+0.026
Neg logit contributions
 ears-0.012
 wheels-0.011
 tail-0.011
uders-0.011
 scales-0.011
Token says
Token ID1139
Feature activation+6.47e-03

Pos logit contributions
des+0.008
and+0.008
to+0.008
ido+0.008
 inspection+0.007
Neg logit contributions
 ears-0.007
 tail-0.007
 wheels-0.007
ps-0.007
 scales-0.007
Token,
Token ID11
Feature activation-2.71e-03

Pos logit contributions
des+0.017
and+0.017
to+0.016
ido+0.016
 inspection+0.016
Neg logit contributions
 ears-0.005
 wheels-0.004
 tail-0.004
ps-0.004
 scales-0.004
Token "
Token ID366
Feature activation+7.45e-03

Pos logit contributions
 ears+0.004
ates+0.004
uders+0.004
e+0.004
 wheels+0.004
Neg logit contributions
to-0.004
des-0.004
and-0.004
 inspection-0.004
ido-0.004
TokenStop
Token ID19485
Feature activation-2.66e-02

Pos logit contributions
des+0.014
and+0.014
to+0.013
ido+0.013
bie+0.013
Neg logit contributions
 ears-0.011
 wheels-0.011
 tail-0.011
e-0.010
 scales-0.010
Token repeating
Token ID20394
Feature activation-6.81e-02

Pos logit contributions
ates+0.038
 middle+0.036
 scales+0.036
 wheels+0.036
ities+0.035
Neg logit contributions
to-0.047
 inspection-0.047
and-0.047
des-0.046
ido-0.045
Token me
Token ID502
Feature activation-1.86e-02

Pos logit contributions
ates+0.076
ities+0.076
 ears+0.075
uders+0.075
ider+0.074
Neg logit contributions
and-0.136
to-0.136
des-0.134
ido-0.132
 inspection-0.131
Token,
Token ID11
Feature activation-5.14e-03

Pos logit contributions
 ears+0.020
 wheels+0.019
 tail+0.019
 scales+0.018
 wings+0.018
Neg logit contributions
des-0.042
and-0.041
to-0.041
ido-0.039
old-0.039
Token Sue
Token ID26089
Feature activation-1.35e-02

Pos logit contributions
 ears+0.008
 wheels+0.008
 tail+0.008
ps+0.008
 scales+0.007
Neg logit contributions
des-0.009
and-0.009
to-0.009
ido-0.009
old-0.008
Token!
Token ID0
Feature activation-1.29e-03

Pos logit contributions
 ears+0.014
 wheels+0.014
 tail+0.013
 scales+0.013
ps+0.013
Neg logit contributions
des-0.031
and-0.030
to-0.030
ido-0.029
old-0.029
Token You
Token ID921
Feature activation-3.96e-02

Pos logit contributions
 ears+0.002
 tail+0.002
 wheels+0.002
 scales+0.002
ps+0.002
Neg logit contributions
des-0.002
and-0.002
to-0.002
ido-0.002
old-0.002
TokenHey
Token ID10814
Feature activation-2.74e-02

Pos logit contributions
des+0.001
and+0.001
to+0.001
yes+0.001
Of+0.001
Neg logit contributions
 ears-0.002
 wings-0.002
 scales-0.001
 tail-0.001
ates-0.001
Token,
Token ID11
Feature activation-9.04e-03

Pos logit contributions
ates+0.029
 ears+0.028
ps+0.028
 wheels+0.027
 scales+0.027
Neg logit contributions
des-0.060
 inspection-0.059
ido-0.058
and-0.057
bie-0.057
Token what
Token ID644
Feature activation+2.61e-03

Pos logit contributions
 ears+0.013
 tail+0.012
 wheels+0.012
 scales+0.012
ps+0.012
Neg logit contributions
des-0.018
and-0.017
to-0.017
ido-0.017
 inspection-0.016
Token's
Token ID338
Feature activation-6.90e-03

Pos logit contributions
des+0.005
and+0.005
to+0.005
old+0.005
ido+0.005
Neg logit contributions
 ears-0.004
 tail-0.004
 wheels-0.004
 scales-0.003
ps-0.003
Token going
Token ID1016
Feature activation-1.41e-02

Pos logit contributions
 ears+0.010
 tail+0.009
ities+0.009
ates+0.009
 wheels+0.009
Neg logit contributions
des-0.013
to-0.013
ido-0.012
 inspection-0.012
and-0.012
Token on
Token ID319
Feature activation-6.80e-02

Pos logit contributions
 ears+0.020
ities+0.019
 scales+0.018
ps+0.018
 tail+0.018
Neg logit contributions
to-0.028
des-0.028
and-0.027
ido-0.026
yes-0.026
Token here
Token ID994
Feature activation-5.50e-02

Pos logit contributions
 ears+0.120
ities+0.116
 tails+0.112
 tail+0.111
 necks+0.108
Neg logit contributions
ido-0.099
to-0.099
des-0.095
 Su-0.093
 awaited-0.091
Token?"
Token ID1701
Feature activation-2.13e-02

Pos logit contributions
 ears+0.074
 wheels+0.067
 tail+0.067
ps+0.067
ities+0.067
Neg logit contributions
to-0.111
des-0.110
and-0.109
ido-0.107
old-0.104
Token a
Token ID257
Feature activation+1.83e-02

Pos logit contributions
ates+0.047
ities+0.045
ps+0.045
 wing+0.044
bed+0.043
Neg logit contributions
old-0.025
 inspection-0.025
ido-0.023
 Su-0.023
and-0.022
Token voice
Token ID3809
Feature activation+2.50e-02

Pos logit contributions
des+0.045
and+0.044
to+0.043
ido+0.043
old+0.043
Neg logit contributions
 ears-0.014
ates-0.014
ities-0.014
ps-0.014
uders-0.014
Token said
Token ID531
Feature activation+1.39e-02

Pos logit contributions
des+0.040
and+0.038
to+0.038
old+0.038
ido+0.037
Neg logit contributions
 ears-0.044
 tail-0.043
 wheels-0.041
ities-0.041
ates-0.041
Token and
Token ID290
Feature activation-1.04e-02

Pos logit contributions
uders+0.021
 ears+0.020
ities+0.020
ates+0.020
 wheels+0.019
Neg logit contributions
and-0.049
des-0.049
to-0.049
 inspection-0.047
ido-0.046
Token said
Token ID531
Feature activation+3.75e-03

Pos logit contributions
 ears+0.014
 wheels+0.013
 tail+0.013
ps+0.013
 scales+0.013
Neg logit contributions
des-0.022
and-0.021
to-0.020
ido-0.020
old-0.020
Token,
Token ID11
Feature activation-6.57e-03

Pos logit contributions
des+0.010
and+0.010
to+0.010
ido+0.010
bie+0.009
Neg logit contributions
 ears-0.003
 wheels-0.002
 tail-0.002
 wings-0.002
 scales-0.002
Token "
Token ID366
Feature activation+1.79e-02

Pos logit contributions
 ears+0.010
 tail+0.009
 wheels+0.009
ps+0.009
 scales+0.009
Neg logit contributions
des-0.013
and-0.012
to-0.012
ido-0.012
old-0.011
TokenWhe
Token ID10842
Feature activation-2.20e-02

Pos logit contributions
des+0.032
and+0.031
to+0.031
ido+0.029
old+0.028
Neg logit contributions
 ears-0.029
 tail-0.028
uders-0.027
ps-0.027
 wheels-0.027
Tokene
Token ID68
Feature activation-6.79e-02

Pos logit contributions
 ears+0.010
 wheels+0.009
 scales+0.009
 tail+0.008
ps+0.008
Neg logit contributions
des-0.065
and-0.064
to-0.063
bie-0.062
old-0.062
Token,
Token ID11
Feature activation-1.60e-02

Pos logit contributions
ities+0.105
 wheels+0.097
 skies+0.096
 ears+0.096
 sincere+0.096
Neg logit contributions
to-0.115
and-0.115
des-0.114
ido-0.106
of-0.103
Token Brown
Token ID4373
Feature activation+1.46e-02

Pos logit contributions
 ears+0.020
 wheels+0.019
 tail+0.019
ps+0.019
 scales+0.019
Neg logit contributions
des-0.034
and-0.032
to-0.032
ido-0.031
 inspection-0.031
Tokenie
Token ID494
Feature activation-2.62e-02

Pos logit contributions
des+0.023
and+0.022
to+0.021
ido+0.020
 inspection+0.020
Neg logit contributions
 ears-0.027
 tail-0.026
 wheels-0.025
e-0.025
ps-0.025
Token,
Token ID11
Feature activation-1.69e-02

Pos logit contributions
 ears+0.036
 wheels+0.035
ities+0.035
ps+0.034
ates+0.034
Neg logit contributions
des-0.049
and-0.047
to-0.047
bie-0.046
ido-0.046
Token you
Token ID345
Feature activation-2.70e-02

Pos logit contributions
 ears+0.020
 wheels+0.019
 tail+0.019
 wings+0.018
e+0.018
Neg logit contributions
des-0.038
and-0.037
to-0.037
old-0.035
ido-0.035
Token.
Token ID13
Feature activation-1.55e-03

Pos logit contributions
des+0.046
and+0.045
to+0.044
ido+0.043
old+0.043
Neg logit contributions
 ears-0.020
 tail-0.019
 wheels-0.019
uders-0.018
ps-0.018
Token "
Token ID366
Feature activation+1.20e-02

Pos logit contributions
 ears+0.002
 wheels+0.002
 tail+0.002
ps+0.002
 scales+0.002
Neg logit contributions
des-0.003
and-0.003
to-0.003
ido-0.003
bie-0.003
TokenWhy
Token ID5195
Feature activation-1.04e-03

Pos logit contributions
des+0.023
and+0.023
to+0.022
ido+0.021
old+0.020
Neg logit contributions
 ears-0.018
 tail-0.017
 wheels-0.017
 wings-0.017
 scales-0.017
Token are
Token ID389
Feature activation-1.56e-02

Pos logit contributions
 ears+0.001
 wheels+0.001
 tail+0.001
 scales+0.001
ps+0.001
Neg logit contributions
des-0.002
and-0.002
to-0.002
ido-0.002
old-0.002
Token you
Token ID345
Feature activation-3.36e-02

Pos logit contributions
 ears+0.020
 tail+0.019
 wheels+0.019
ps+0.018
ities+0.018
Neg logit contributions
des-0.032
and-0.031
to-0.031
ido-0.030
 inspection-0.030
Token on
Token ID319
Feature activation-6.79e-02

Pos logit contributions
ities+0.041
 ears+0.041
 wheels+0.040
ates+0.040
 scales+0.040
Neg logit contributions
to-0.066
ido-0.064
and-0.064
bie-0.064
des-0.063
Token my
Token ID616
Feature activation-8.68e-03

Pos logit contributions
ities+0.094
ider+0.093
bed+0.088
ates+0.087
 ears+0.087
Neg logit contributions
 inspection-0.121
to-0.121
ido-0.121
des-0.120
 Su-0.117
Token nose
Token ID9686
Feature activation-2.89e-02

Pos logit contributions
 ears+0.013
 wheels+0.012
 tail+0.012
ps+0.012
 scales+0.012
Neg logit contributions
des-0.016
and-0.016
to-0.016
ido-0.015
 inspection-0.015
Token?"
Token ID1701
Feature activation-1.35e-03

Pos logit contributions
 ears+0.028
 tail+0.027
ps+0.027
 wheels+0.027
 scales+0.027
Neg logit contributions
des-0.066
and-0.066
to-0.064
ido-0.063
bie-0.062
Token
Token ID198
Feature activation+3.96e-03

Pos logit contributions
 ears+0.002
 wheels+0.002
 tail+0.002
e+0.002
 scales+0.002
Neg logit contributions
des-0.003
and-0.003
to-0.002
ido-0.002
bie-0.002
Token
Token ID198
Feature activation-9.19e-03

Pos logit contributions
des+0.009
bie+0.008
 inspection+0.008
and+0.008
ido+0.008
Neg logit contributions
e-0.004
 wheels-0.004
 front-0.004
 ears-0.004
 middle-0.004
Token.
Token ID13
Feature activation+3.01e-03

Pos logit contributions
des+0.063
and+0.061
to+0.060
ido+0.059
 inspection+0.058
Neg logit contributions
 ears-0.023
 wheels-0.021
 tail-0.021
ps-0.020
 wings-0.020
Token "
Token ID366
Feature activation+7.30e-03

Pos logit contributions
des+0.006
and+0.005
to+0.005
ido+0.005
old+0.005
Neg logit contributions
 ears-0.005
 tail-0.005
 wheels-0.005
ps-0.004
 scales-0.004
TokenWhy
Token ID5195
Feature activation+3.15e-04

Pos logit contributions
and+0.013
des+0.013
to+0.013
ido+0.012
old+0.012
Neg logit contributions
 ears-0.012
 tail-0.011
 wings-0.011
 wheels-0.011
 scales-0.011
Token are
Token ID389
Feature activation-1.39e-02

Pos logit contributions
des+0.001
and+0.001
to+0.001
ido+0.001
bie+0.001
Neg logit contributions
 ears-0.000
 wheels-0.000
 tail-0.000
 scales-0.000
 wings-0.000
Token you
Token ID345
Feature activation-2.99e-02

Pos logit contributions
 ears+0.018
 tail+0.018
 wheels+0.017
ps+0.017
 scales+0.017
Neg logit contributions
des-0.029
and-0.028
to-0.027
ido-0.027
 inspection-0.026
Token under
Token ID739
Feature activation-6.78e-02

Pos logit contributions
 scales+0.038
 wheels+0.038
 middle+0.037
ities+0.037
 tail+0.037
Neg logit contributions
bie-0.058
to-0.057
and-0.056
ido-0.056
 Su-0.054
Token the
Token ID262
Feature activation+9.39e-03

Pos logit contributions
ities+0.104
ider+0.102
ates+0.096
ps+0.092
 ears+0.091
Neg logit contributions
des-0.111
 inspection-0.111
and-0.109
 Su-0.107
to-0.107
Token sheet
Token ID9629
Feature activation-3.88e-02

Pos logit contributions
des+0.020
and+0.020
to+0.019
ido+0.019
 inspection+0.018
Neg logit contributions
 ears-0.012
 wheels-0.011
 tail-0.011
ps-0.011
 scales-0.010
Token?"
Token ID1701
Feature activation-3.86e-03

Pos logit contributions
 ears+0.039
 wheels+0.037
 tail+0.036
ities+0.035
 scales+0.035
Neg logit contributions
des-0.091
and-0.089
to-0.087
ido-0.086
old-0.084
Token
Token ID198
Feature activation+2.40e-03

Pos logit contributions
 ears+0.006
 wheels+0.006
 tail+0.006
 wings+0.005
ps+0.005
Neg logit contributions
des-0.007
and-0.007
to-0.007
ido-0.007
bie-0.007
Token
Token ID198
Feature activation-8.46e-03

Pos logit contributions
des+0.005
ido+0.005
and+0.005
bie+0.005
to+0.005
Neg logit contributions
 ears-0.003
 tail-0.003
 wheels-0.003
 front-0.003
e-0.003
Token.
Token ID13
Feature activation-5.59e-03

Pos logit contributions
and+0.018
des+0.017
to+0.017
old+0.016
ido+0.016
Neg logit contributions
 ears-0.010
uders-0.010
ates-0.010
ities-0.010
 tail-0.010
Token
Token ID198
Feature activation-4.00e-03

Pos logit contributions
 ears+0.008
 wheels+0.008
 tail+0.008
 scales+0.007
uders+0.007
Neg logit contributions
des-0.011
and-0.011
to-0.010
ido-0.010
 inspection-0.010
Token
Token ID198
Feature activation-2.29e-02

Pos logit contributions
 front+0.003
 middle+0.003
 ears+0.003
e+0.003
 tail+0.003
Neg logit contributions
des-0.010
 inspection-0.009
yes-0.009
ido-0.009
 awaited-0.009
Token"
Token ID1
Feature activation+1.06e-02

Pos logit contributions
 ears+0.026
 tail+0.026
e+0.026
 wheels+0.025
 middle+0.024
Neg logit contributions
des-0.050
and-0.047
to-0.047
 inspection-0.046
ido-0.046
TokenWhe
Token ID10842
Feature activation-1.79e-02

Pos logit contributions
des+0.018
and+0.017
to+0.017
ido+0.016
yes+0.016
Neg logit contributions
 ears-0.018
 tail-0.017
 wheels-0.017
 wings-0.017
ates-0.017
Tokene
Token ID68
Feature activation-6.76e-02

Pos logit contributions
 ears+0.010
 wheels+0.010
 scales+0.009
 tail+0.009
ities+0.008
Neg logit contributions
des-0.049
and-0.049
bie-0.048
old-0.048
to-0.048
Token!"
Token ID2474
Feature activation-2.29e-02

Pos logit contributions
ities+0.105
ates+0.096
 skies+0.094
 wheels+0.093
 middle+0.092
Neg logit contributions
to-0.112
des-0.108
and-0.107
ido-0.105
of-0.103
Token Anna
Token ID11735
Feature activation-8.58e-03

Pos logit contributions
ities+0.037
 ears+0.036
ates+0.036
uders+0.036
 tail+0.036
Neg logit contributions
des-0.037
and-0.036
to-0.036
ido-0.036
 inspection-0.036
Token and
Token ID290
Feature activation-6.60e-03

Pos logit contributions
ities+0.016
ates+0.016
uders+0.016
 wheels+0.015
ider+0.015
Neg logit contributions
to-0.012
old-0.011
and-0.011
des-0.011
bie-0.011
Token Ben
Token ID3932
Feature activation+7.57e-03

Pos logit contributions
 ears+0.008
 tail+0.008
 wheels+0.008
ps+0.007
 scales+0.007
Neg logit contributions
des-0.014
and-0.014
to-0.014
ido-0.013
 inspection-0.013
Token say
Token ID910
Feature activation-2.94e-03

Pos logit contributions
des+0.011
to+0.010
and+0.010
old+0.010
ido+0.009
Neg logit contributions
ates-0.014
uders-0.014
ities-0.014
 ears-0.014
 wheels-0.014
Token bird
Token ID6512
Feature activation+1.08e-02

Pos logit contributions
ates+0.021
ps+0.020
ities+0.020
uders+0.019
e+0.019
Neg logit contributions
des-0.032
 inspection-0.030
yes-0.029
and-0.028
bie-0.028
Token.
Token ID13
Feature activation-6.91e-04

Pos logit contributions
des+0.022
and+0.022
to+0.021
ido+0.021
bie+0.021
Neg logit contributions
 ears-0.014
 wheels-0.014
ps-0.013
ities-0.013
 tail-0.013
Token Why
Token ID4162
Feature activation+1.95e-03

Pos logit contributions
 ears+0.001
ps+0.001
 tail+0.001
 wheels+0.001
ities+0.001
Neg logit contributions
des-0.001
and-0.001
to-0.001
ido-0.001
old-0.001
Token are
Token ID389
Feature activation-1.44e-02

Pos logit contributions
des+0.003
to+0.003
ido+0.003
bie+0.003
old+0.003
Neg logit contributions
 wheels-0.003
 ears-0.003
 scales-0.003
ider-0.003
bed-0.003
Token you
Token ID345
Feature activation-2.69e-02

Pos logit contributions
 ears+0.021
 tail+0.020
ps+0.020
ities+0.020
ates+0.020
Neg logit contributions
des-0.027
and-0.026
to-0.026
 inspection-0.025
ido-0.024
Token on
Token ID319
Feature activation-6.75e-02

Pos logit contributions
ities+0.042
ates+0.041
ider+0.040
 scales+0.039
 wheels+0.039
Neg logit contributions
bie-0.045
and-0.044
 Su-0.044
to-0.043
ido-0.043
Token my
Token ID616
Feature activation-1.45e-02

Pos logit contributions
ider+0.110
ities+0.107
ates+0.101
bed+0.098
ps+0.096
Neg logit contributions
des-0.112
to-0.105
and-0.105
 Su-0.103
old-0.102
Token ugly
Token ID13400
Feature activation-2.09e-02

Pos logit contributions
 ears+0.019
 wheels+0.018
ps+0.018
 tail+0.018
uders+0.018
Neg logit contributions
des-0.029
and-0.028
to-0.028
ido-0.027
 inspection-0.026
Token mailbox
Token ID37282
Feature activation-2.44e-02

Pos logit contributions
ps+0.029
e+0.029
ates+0.028
 ears+0.028
ities+0.028
Neg logit contributions
des-0.039
to-0.037
and-0.036
ido-0.036
 inspection-0.035
Token?"
Token ID1701
Feature activation-3.70e-03

Pos logit contributions
ps+0.028
 ears+0.027
ities+0.026
uders+0.026
 scales+0.026
Neg logit contributions
des-0.052
and-0.052
to-0.051
ido-0.049
bie-0.049
Token The
Token ID383
Feature activation-1.18e-02

Pos logit contributions
ities+0.005
ps+0.005
 ears+0.005
uders+0.005
ates+0.004
Neg logit contributions
des-0.008
to-0.008
and-0.007
old-0.007
 inspection-0.007
Token of
Token ID286
Feature activation+7.60e-03

Pos logit contributions
des+0.044
to+0.042
and+0.040
ido+0.040
bie+0.040
Neg logit contributions
 ears-0.028
 wheels-0.027
 tail-0.026
 scales-0.026
 wings-0.026
Token the
Token ID262
Feature activation-1.45e-02

Pos logit contributions
des+0.018
to+0.017
and+0.017
ido+0.016
old+0.016
Neg logit contributions
 ears-0.009
 wheels-0.008
 tail-0.008
 scales-0.008
 wings-0.008
Token perfect
Token ID2818
Feature activation-9.76e-03

Pos logit contributions
 ears+0.017
 wheels+0.016
 tail+0.016
ps+0.015
 scales+0.015
Neg logit contributions
des-0.032
and-0.031
to-0.031
ido-0.030
 inspection-0.030
Token field
Token ID2214
Feature activation-1.30e-02

Pos logit contributions
uders+0.013
ities+0.013
ps+0.013
 ears+0.012
ates+0.012
Neg logit contributions
des-0.018
and-0.017
to-0.017
ido-0.017
yes-0.017
Token to
Token ID284
Feature activation+1.44e-02

Pos logit contributions
 ears+0.014
 tail+0.013
 wheels+0.013
 scales+0.012
ps+0.012
Neg logit contributions
des-0.030
and-0.029
to-0.029
ido-0.028
 inspection-0.028
Token gra
Token ID7933
Feature activation-6.74e-02

Pos logit contributions
des+0.030
and+0.029
to+0.029
ido+0.028
bie+0.028
Neg logit contributions
 ears-0.019
 tail-0.018
 wheels-0.018
e-0.017
 scales-0.017
Tokenze
Token ID2736
Feature activation-2.41e-02

Pos logit contributions
 wheels+0.064
ities+0.064
ider+0.057
 loser+0.056
uders+0.055
Neg logit contributions
to-0.156
des-0.141
war-0.129
old-0.129
and-0.128
Token.
Token ID13
Feature activation-5.87e-03

Pos logit contributions
 ears+0.026
 wheels+0.025
 tail+0.025
uders+0.025
ps+0.025
Neg logit contributions
and-0.053
des-0.053
to-0.052
ido-0.049
 inspection-0.049
Token He
Token ID679
Feature activation-1.73e-02

Pos logit contributions
 ears+0.007
 wheels+0.007
 tail+0.007
 middle+0.007
 scales+0.007
Neg logit contributions
des-0.013
to-0.012
and-0.012
 inspection-0.012
ido-0.012
Token wanted
Token ID2227
Feature activation+9.24e-03

Pos logit contributions
ities+0.028
uders+0.028
 ears+0.028
 wheels+0.028
ates+0.028
Neg logit contributions
des-0.027
to-0.026
bie-0.026
and-0.026
old-0.025
Token to
Token ID284
Feature activation+4.31e-04

Pos logit contributions
des+0.014
and+0.013
to+0.013
ido+0.013
 inspection+0.012
Neg logit contributions
 ears-0.017
 wheels-0.017
 tail-0.017
 scales-0.016
 wings-0.016
Token,
Token ID11
Feature activation-9.73e-03

Pos logit contributions
 ears+0.005
 wheels+0.005
 tail+0.005
 scales+0.005
ps+0.005
Neg logit contributions
des-0.008
and-0.008
to-0.008
ido-0.007
 inspection-0.007
Token a
Token ID257
Feature activation+4.92e-03

Pos logit contributions
 ears+0.011
 tail+0.011
 wheels+0.010
 scales+0.010
e+0.009
Neg logit contributions
des-0.023
and-0.022
to-0.021
rig-0.020
old-0.020
Token 3
Token ID513
Feature activation+2.27e-03

Pos logit contributions
des+0.011
and+0.010
to+0.010
ido+0.010
 inspection+0.010
Neg logit contributions
 ears-0.006
 tail-0.006
 wheels-0.006
 scales-0.005
 wings-0.005
Token-
Token ID12
Feature activation+4.40e-02

Pos logit contributions
des+0.004
to+0.004
and+0.004
ido+0.004
 inspection+0.004
Neg logit contributions
 ears-0.003
 wheels-0.003
 tail-0.003
e-0.003
 scales-0.003
Tokenyear
Token ID1941
Feature activation+9.85e-03

Pos logit contributions
 inspection+0.056
rig+0.053
 awaited+0.051
aman+0.050
des+0.048
Neg logit contributions
e-0.094
ps-0.082
 middle-0.082
ates-0.081
 wheels-0.079
Token-
Token ID12
Feature activation+4.23e-02

Pos logit contributions
des+0.021
to+0.020
and+0.020
 inspection+0.019
bie+0.019
Neg logit contributions
 ears-0.013
 wheels-0.012
 scales-0.012
 tail-0.012
 wings-0.012
Tokenold
Token ID727
Feature activation+3.04e-03

Pos logit contributions
 inspection+0.044
rig+0.041
 awaited+0.038
aman+0.037
 proper+0.037
Neg logit contributions
e-0.103
ates-0.088
 middle-0.087
pie-0.084
 knights-0.084
Token boy
Token ID2933
Feature activation-1.38e-02

Pos logit contributions
des+0.006
and+0.006
to+0.006
ido+0.005
 inspection+0.005
Neg logit contributions
 ears-0.004
 wheels-0.004
 tail-0.004
e-0.004
 wings-0.004
Token,
Token ID11
Feature activation-1.21e-02

Pos logit contributions
 ears+0.013
 tail+0.013
 wheels+0.013
ps+0.012
 scales+0.012
Neg logit contributions
des-0.033
and-0.032
to-0.032
ido-0.031
old-0.031
Token was
Token ID373
Feature activation-1.33e-02

Pos logit contributions
 ears+0.017
 wheels+0.016
 tail+0.016
e+0.016
 wings+0.016
Neg logit contributions
des-0.024
and-0.023
to-0.022
ido-0.022
 inspection-0.021
Token curious
Token ID11040
Feature activation+1.66e-03

Pos logit contributions
 ears+0.017
ities+0.016
uders+0.016
 wheels+0.016
 tail+0.016
Neg logit contributions
to-0.027
and-0.027
des-0.026
old-0.026
bie-0.026
TokenThe
Token ID464
Feature activation+3.14e-03

Pos logit contributions
 ears+0.012
 tail+0.012
 wheels+0.012
e+0.012
 scales+0.011
Neg logit contributions
des-0.020
and-0.019
to-0.019
ido-0.019
 inspection-0.019
Token globe
Token ID13342
Feature activation+2.66e-02

Pos logit contributions
des+0.006
and+0.006
bie+0.006
 inspection+0.006
old+0.006
Neg logit contributions
uders-0.004
ities-0.004
 ears-0.004
ates-0.004
e-0.004
Token was
Token ID373
Feature activation+8.01e-03

Pos logit contributions
des+0.049
to+0.047
and+0.047
old+0.046
 inspection+0.045
Neg logit contributions
ities-0.039
 ears-0.038
ates-0.037
uders-0.037
 wheels-0.037
Token so
Token ID523
Feature activation+2.16e-02

Pos logit contributions
des+0.015
and+0.015
to+0.015
bie+0.014
ido+0.014
Neg logit contributions
 ears-0.011
ities-0.011
uders-0.011
 wheels-0.010
 tail-0.010
Token happy
Token ID3772
Feature activation-8.76e-03

Pos logit contributions
des+0.042
to+0.040
and+0.040
ido+0.039
 inspection+0.038
Neg logit contributions
 wheels-0.032
 ears-0.031
 tail-0.031
ps-0.030
 scales-0.030
Token to
Token ID284
Feature activation+4.65e-02

Pos logit contributions
 ears+0.011
 wheels+0.011
 tail+0.010
ps+0.010
uders+0.010
Neg logit contributions
des-0.018
and-0.018
to-0.018
 inspection-0.017
ido-0.017
Token be
Token ID307
Feature activation-1.16e-03

Pos logit contributions
des+0.096
to+0.090
and+0.090
 inspection+0.089
ido+0.089
Neg logit contributions
 ears-0.062
ities-0.058
 wheels-0.057
 tail-0.056
ates-0.056
Token whole
Token ID2187
Feature activation+7.96e-03

Pos logit contributions
 ears+0.002
ities+0.002
uders+0.002
 wheels+0.002
 scales+0.002
Neg logit contributions
des-0.002
and-0.002
to-0.002
old-0.002
bie-0.002
Token again
Token ID757
Feature activation-2.11e-03

Pos logit contributions
des+0.017
and+0.017
to+0.016
ido+0.016
old+0.016
Neg logit contributions
 ears-0.009
 wheels-0.009
 tail-0.009
ps-0.009
uders-0.009
Token and
Token ID290
Feature activation+1.60e-02

Pos logit contributions
 ears+0.002
 wheels+0.002
ps+0.002
 tail+0.002
uders+0.002
Neg logit contributions
des-0.005
and-0.004
to-0.004
 inspection-0.004
bie-0.004
Token it
Token ID340
Feature activation+2.68e-02

Pos logit contributions
des+0.035
ido+0.034
and+0.034
to+0.033
 inspection+0.032
Neg logit contributions
 ears-0.019
 tail-0.018
 wheels-0.018
 wings-0.018
 scales-0.017
Token so
Token ID523
Feature activation+8.35e-03

Pos logit contributions
 ears+0.003
ities+0.003
ates+0.003
uders+0.003
e+0.003
Neg logit contributions
des-0.005
and-0.005
to-0.005
ido-0.005
 inspection-0.005
Token strong
Token ID1913
Feature activation-6.04e-03

Pos logit contributions
des+0.019
and+0.019
to+0.018
ido+0.018
bie+0.017
Neg logit contributions
 ears-0.010
 wheels-0.010
 tail-0.009
ps-0.009
 wings-0.009
Token and
Token ID290
Feature activation+9.63e-03

Pos logit contributions
 ears+0.006
 tail+0.006
 wheels+0.005
uders+0.005
ps+0.005
Neg logit contributions
des-0.014
and-0.014
to-0.014
ido-0.013
 inspection-0.013
Token it
Token ID340
Feature activation+1.77e-02

Pos logit contributions
des+0.023
and+0.022
to+0.022
ido+0.021
old+0.021
Neg logit contributions
 ears-0.011
 wheels-0.011
 tail-0.011
 wings-0.010
 clouds-0.010
Token blew
Token ID17948
Feature activation-1.35e-03

Pos logit contributions
des+0.035
to+0.033
and+0.033
old+0.032
 inspection+0.032
Neg logit contributions
 ears-0.023
ities-0.023
ates-0.023
 tail-0.022
uders-0.021
Token all
Token ID477
Feature activation+4.12e-02

Pos logit contributions
 ears+0.002
 wheels+0.002
 tail+0.002
ps+0.001
 wings+0.001
Neg logit contributions
des-0.003
and-0.003
to-0.003
ido-0.003
old-0.003
Token the
Token ID262
Feature activation-1.03e-02

Pos logit contributions
des+0.106
to+0.103
and+0.100
ido+0.099
 inspection+0.097
Neg logit contributions
 wheels-0.035
ps-0.034
e-0.033
 ears-0.033
 clouds-0.033
Token trees
Token ID7150
Feature activation+1.34e-02

Pos logit contributions
 ears+0.012
e+0.011
 tail+0.011
ates+0.011
ities+0.011
Neg logit contributions
des-0.023
and-0.022
to-0.021
old-0.021
ido-0.021
Token and
Token ID290
Feature activation-8.22e-03

Pos logit contributions
des+0.032
to+0.031
and+0.031
ido+0.031
old+0.030
Neg logit contributions
 ears-0.013
 wheels-0.013
 tail-0.012
ps-0.012
 wings-0.012
Token destroyed
Token ID6572
Feature activation+2.89e-03

Pos logit contributions
 ears+0.009
 tail+0.008
 wheels+0.008
ates+0.008
ps+0.008
Neg logit contributions
des-0.019
and-0.018
to-0.018
 inspection-0.017
ido-0.017
Token the
Token ID262
Feature activation-1.23e-02

Pos logit contributions
des+0.007
and+0.006
to+0.006
ido+0.006
old+0.006
Neg logit contributions
 ears-0.003
 wheels-0.003
 tail-0.003
ps-0.003
 scales-0.003
Token
Token ID198
Feature activation+1.43e-03

Pos logit contributions
 ears+0.001
 wheels+0.001
 tail+0.001
ps+0.001
 scales+0.001
Neg logit contributions
des-0.001
and-0.001
to-0.001
ido-0.001
old-0.001
Token
Token ID198
Feature activation-1.96e-02

Pos logit contributions
des+0.003
 inspection+0.003
and+0.003
ido+0.003
ability+0.003
Neg logit contributions
 ears-0.002
 wheels-0.001
 front-0.001
e-0.001
 tail-0.001
Token"
Token ID1
Feature activation+2.06e-02

Pos logit contributions
 ears+0.029
 tail+0.028
 scales+0.027
 wheels+0.027
ps+0.027
Neg logit contributions
des-0.038
and-0.037
to-0.036
ido-0.035
old-0.035
TokenWe
Token ID1135
Feature activation+3.06e-04

Pos logit contributions
des+0.037
and+0.035
to+0.034
ido+0.033
 inspection+0.032
Neg logit contributions
 ears-0.034
 tail-0.032
 wheels-0.032
ps-0.032
 scales-0.031
Token have
Token ID423
Feature activation-3.01e-02

Pos logit contributions
des+0.000
bie+0.000
ido+0.000
to+0.000
old+0.000
Neg logit contributions
 ears-0.001
uders-0.001
 wheels-0.001
 tail-0.001
 scales-0.001
Token something
Token ID1223
Feature activation+4.02e-02

Pos logit contributions
 ears+0.041
uders+0.041
ps+0.041
 tail+0.040
 wheels+0.040
Neg logit contributions
des-0.059
and-0.055
to-0.055
bie-0.053
old-0.053
Token for
Token ID329
Feature activation-2.18e-02

Pos logit contributions
des+0.068
and+0.065
to+0.064
ido+0.062
 inspection+0.061
Neg logit contributions
 ears-0.067
 wheels-0.065
 tail-0.065
ps-0.063
 scales-0.063
Token you
Token ID345
Feature activation-2.18e-02

Pos logit contributions
 wheels+0.033
 tail+0.033
 ears+0.033
 scales+0.032
ider+0.032
Neg logit contributions
and-0.035
to-0.034
des-0.034
ido-0.033
old-0.033
Token,
Token ID11
Feature activation+3.13e-03

Pos logit contributions
 ears+0.026
 wheels+0.025
 tail+0.025
ps+0.025
uders+0.025
Neg logit contributions
des-0.046
and-0.046
to-0.044
bie-0.043
ido-0.043
Token L
Token ID406
Feature activation+6.21e-03

Pos logit contributions
des+0.005
and+0.005
ido+0.005
to+0.005
bie+0.005
Neg logit contributions
 ears-0.005
 tail-0.005
ps-0.005
 wheels-0.005
ates-0.005
Tokenila
Token ID10102
Feature activation-2.41e-02

Pos logit contributions
des+0.012
and+0.011
to+0.011
 inspection+0.010
bie+0.010
Neg logit contributions
e-0.009
 ears-0.009
ps-0.008
uders-0.008
 wheels-0.008
Token,
Token ID11
Feature activation+6.50e-03

Pos logit contributions
des+0.009
and+0.009
to+0.009
ido+0.009
 inspection+0.009
Neg logit contributions
 ears-0.004
 wheels-0.004
 tail-0.004
ps-0.004
 scales-0.004
Token it
Token ID340
Feature activation+1.56e-02

Pos logit contributions
des+0.010
to+0.010
and+0.010
ido+0.009
bie+0.009
Neg logit contributions
 ears-0.012
 wheels-0.012
 scales-0.011
 tail-0.011
 walls-0.011
Token must
Token ID1276
Feature activation+1.14e-03

Pos logit contributions
des+0.033
and+0.031
to+0.031
ido+0.030
bie+0.030
Neg logit contributions
 ears-0.019
 wheels-0.019
 tail-0.019
uders-0.018
ities-0.018
Token belong
Token ID5594
Feature activation+2.21e-02

Pos logit contributions
des+0.002
ido+0.002
to+0.002
and+0.002
ll+0.002
Neg logit contributions
 ears-0.002
 wheels-0.001
 tail-0.001
ates-0.001
ities-0.001
Token to
Token ID284
Feature activation-7.40e-03

Pos logit contributions
des+0.034
and+0.033
to+0.032
ido+0.031
 inspection+0.030
Neg logit contributions
 ears-0.040
 wheels-0.039
 tail-0.039
ps-0.038
 scales-0.038
Token someone
Token ID2130
Feature activation+4.05e-02

Pos logit contributions
 ears+0.009
 wheels+0.008
 tail+0.008
ps+0.008
 wings+0.008
Neg logit contributions
des-0.016
and-0.016
to-0.016
ido-0.015
old-0.015
Token else
Token ID2073
Feature activation+1.10e-02

Pos logit contributions
des+0.076
to+0.072
and+0.072
ido+0.070
 inspection+0.069
Neg logit contributions
 ears-0.060
 tail-0.057
 wheels-0.057
e-0.057
 wings-0.055
Token.
Token ID13
Feature activation-1.29e-03

Pos logit contributions
des+0.026
and+0.025
to+0.024
ido+0.024
old+0.023
Neg logit contributions
 ears-0.012
 wheels-0.011
 tail-0.011
ps-0.011
 scales-0.010
Token We
Token ID775
Feature activation+8.92e-03

Pos logit contributions
 ears+0.002
 tail+0.002
 wheels+0.002
ps+0.002
 scales+0.002
Neg logit contributions
des-0.002
and-0.002
to-0.002
ido-0.002
old-0.002
Token must
Token ID1276
Feature activation-4.76e-03

Pos logit contributions
des+0.016
and+0.015
to+0.015
ido+0.014
 inspection+0.014
Neg logit contributions
 ears-0.014
 tail-0.014
 wheels-0.014
ps-0.014
 scales-0.014
Token give
Token ID1577
Feature activation+2.05e-02

Pos logit contributions
 ears+0.007
 wheels+0.007
 tail+0.006
ates+0.006
 scales+0.006
Neg logit contributions
des-0.009
ido-0.009
to-0.009
and-0.008
old-0.008
Token it
Token ID340
Feature activation+4.69e-04

Pos logit contributions
ates+0.035
uders+0.034
 wheels+0.034
 ears+0.034
ities+0.034
Neg logit contributions
des-0.041
to-0.040
and-0.040
 inspection-0.039
old-0.038
Token hurt
Token ID5938
Feature activation-6.71e-03

Pos logit contributions
des+0.001
to+0.001
and+0.001
 inspection+0.001
yes+0.001
Neg logit contributions
ates-0.001
ities-0.001
uders-0.001
 ears-0.001
 tail-0.001
Token a
Token ID257
Feature activation+3.49e-03

Pos logit contributions
 ears+0.007
 wheels+0.007
uders+0.007
 tail+0.007
 scales+0.007
Neg logit contributions
des-0.015
and-0.015
to-0.014
ido-0.014
 inspection-0.014
Token lot
Token ID1256
Feature activation+2.24e-02

Pos logit contributions
des+0.007
and+0.007
to+0.007
old+0.007
ido+0.007
Neg logit contributions
 ears-0.005
 tail-0.005
 wheels-0.004
 scales-0.004
 wings-0.004
Token!
Token ID0
Feature activation+1.74e-03

Pos logit contributions
des+0.056
to+0.054
and+0.054
ido+0.052
bie+0.052
Neg logit contributions
 ears-0.020
 wheels-0.019
 tail-0.018
ps-0.018
 scales-0.017
Token
Token ID198
Feature activation+3.01e-03

Pos logit contributions
des+0.003
ido+0.003
and+0.003
to+0.003
bie+0.003
Neg logit contributions
 ears-0.003
 wheels-0.002
 walls-0.002
 tail-0.002
 skies-0.002
Token
Token ID198
Feature activation-2.69e-02

Pos logit contributions
des+0.008
ido+0.007
ability+0.007
bie+0.007
 proper+0.007
Neg logit contributions
 front-0.002
 middle-0.002
e-0.002
 voice-0.002
 with-0.002
TokenFortunately
Token ID31276
Feature activation-2.11e-02

Pos logit contributions
 ears+0.049
 wheels+0.048
 tail+0.048
ps+0.046
 scales+0.046
Neg logit contributions
des-0.041
and-0.039
to-0.039
ido-0.037
 inspection-0.036
Token,
Token ID11
Feature activation-1.35e-02

Pos logit contributions
 ears+0.011
 tail+0.010
 wheels+0.009
 wings+0.009
e+0.008
Neg logit contributions
des-0.061
and-0.059
to-0.058
ido-0.056
yes-0.055
Token Jane
Token ID12091
Feature activation-1.30e-02

Pos logit contributions
 ears+0.028
 wheels+0.027
 tail+0.027
ps+0.026
 scales+0.026
Neg logit contributions
des-0.018
and-0.017
to-0.017
ido-0.016
old-0.015
Token had
Token ID550
Feature activation-3.24e-02

Pos logit contributions
 ears+0.018
 wheels+0.018
 tail+0.018
uders+0.018
ates+0.018
Neg logit contributions
des-0.025
to-0.024
and-0.024
old-0.023
bie-0.023
Token and
Token ID290
Feature activation-9.28e-03

Pos logit contributions
des+0.002
and+0.002
to+0.002
ido+0.002
 inspection+0.002
Neg logit contributions
 ears-0.001
 wheels-0.001
 tail-0.001
ps-0.001
 scales-0.001
Token Sam
Token ID3409
Feature activation-1.08e-02

Pos logit contributions
 ears+0.011
 tail+0.010
e+0.010
 wheels+0.010
 wings+0.010
Neg logit contributions
des-0.021
and-0.020
to-0.020
 inspection-0.019
rig-0.019
Token are
Token ID389
Feature activation-5.29e-03

Pos logit contributions
 ears+0.017
 tail+0.016
 wheels+0.016
 scales+0.016
uders+0.016
Neg logit contributions
des-0.020
and-0.018
to-0.018
ido-0.018
old-0.018
Token brothers
Token ID9397
Feature activation-6.13e-03

Pos logit contributions
 ears+0.005
 wheels+0.005
 tail+0.005
ps+0.004
 scales+0.004
Neg logit contributions
des-0.013
and-0.013
to-0.013
ido-0.012
 inspection-0.012
Token.
Token ID13
Feature activation+1.62e-03

Pos logit contributions
uders+0.009
 wheels+0.008
ates+0.008
ities+0.008
 scales+0.008
Neg logit contributions
des-0.011
and-0.011
old-0.011
bie-0.010
to-0.010
Token They
Token ID1119
Feature activation+3.05e-03

Pos logit contributions
des+0.003
to+0.003
and+0.003
ido+0.003
old+0.003
Neg logit contributions
 ears-0.002
 wheels-0.002
 tail-0.002
e-0.002
 scales-0.002
Token like
Token ID588
Feature activation+1.36e-02

Pos logit contributions
des+0.006
ido+0.006
and+0.005
old+0.005
yes+0.005
Neg logit contributions
 ears-0.004
uders-0.004
 tail-0.004
 wheels-0.004
ities-0.004
Token to
Token ID284
Feature activation+1.09e-02

Pos logit contributions
des+0.023
and+0.021
ido+0.021
 inspection+0.021
to+0.021
Neg logit contributions
 ears-0.023
 tail-0.023
 wheels-0.022
 feathers-0.022
 scales-0.022
Token play
Token ID711
Feature activation-2.25e-02

Pos logit contributions
des+0.017
to+0.015
and+0.015
ido+0.015
yes+0.014
Neg logit contributions
uders-0.020
 ears-0.020
ps-0.020
ities-0.020
ates-0.020
Token with
Token ID351
Feature activation-5.11e-03

Pos logit contributions
uders+0.034
 ears+0.033
ps+0.032
ities+0.031
 wheels+0.031
Neg logit contributions
and-0.040
to-0.040
des-0.038
yes-0.036
ability-0.035
Token their
Token ID511
Feature activation-9.91e-03

Pos logit contributions
ider+0.008
uders+0.008
ities+0.008
ps+0.008
ates+0.008
Neg logit contributions
and-0.008
des-0.008
to-0.008
old-0.008
ido-0.008
Token"
Token ID1
Feature activation+2.32e-02

Pos logit contributions
 wheels+0.007
 tail+0.007
 ears+0.007
e+0.007
bed+0.007
Neg logit contributions
des-0.011
and-0.010
 inspection-0.010
ido-0.010
bie-0.010
TokenIt
Token ID1026
Feature activation+2.16e-02

Pos logit contributions
des+0.044
and+0.042
bie+0.042
to+0.042
ido+0.041
Neg logit contributions
 wheels-0.032
 tail-0.032
 ears-0.032
 scales-0.030
 feathers-0.030
Token's
Token ID338
Feature activation-1.36e-02

Pos logit contributions
des+0.053
and+0.052
ido+0.050
to+0.050
 inspection+0.049
Neg logit contributions
 ears-0.021
 tail-0.019
 wheels-0.019
ps-0.018
 wings-0.018
Token okay
Token ID8788
Feature activation+8.57e-05

Pos logit contributions
uders+0.024
 ears+0.024
 scales+0.023
ates+0.023
ps+0.023
Neg logit contributions
des-0.021
ido-0.019
and-0.019
to-0.019
bie-0.018
Token,
Token ID11
Feature activation+7.10e-03

Pos logit contributions
des+0.000
and+0.000
to+0.000
ido+0.000
old+0.000
Neg logit contributions
 ears-0.000
 tail-0.000
 wheels-0.000
 wings-0.000
 scales-0.000
Token but
Token ID475
Feature activation+7.89e-03

Pos logit contributions
des+0.013
and+0.013
to+0.013
ido+0.012
old+0.011
Neg logit contributions
 ears-0.012
 wheels-0.012
 tail-0.011
 wings-0.011
 feathers-0.011
Token you
Token ID345
Feature activation-6.37e-03

Pos logit contributions
des+0.017
and+0.017
to+0.017
ido+0.016
old+0.016
Neg logit contributions
 ears-0.010
 wheels-0.009
 tail-0.009
 wings-0.009
 scales-0.009
Token need
Token ID761
Feature activation+1.41e-02

Pos logit contributions
 ears+0.011
 tail+0.010
 wheels+0.010
 scales+0.010
ates+0.010
Neg logit contributions
des-0.011
and-0.010
to-0.010
bie-0.010
ido-0.010
Token to
Token ID284
Feature activation+1.66e-05

Pos logit contributions
des+0.020
and+0.019
to+0.019
ido+0.018
old+0.018
Neg logit contributions
 ears-0.027
 tail-0.027
 wheels-0.027
uders-0.026
ps-0.026
Token ask
Token ID1265
Feature activation-1.25e-02

Pos logit contributions
des+0.000
and+0.000
ido+0.000
to+0.000
old+0.000
Neg logit contributions
 ears-0.000
uders-0.000
 scales-0.000
 wheels-0.000
ates-0.000
Token before
Token ID878
Feature activation-2.54e-02

Pos logit contributions
 ears+0.015
 wheels+0.015
 tail+0.015
uders+0.014
 scales+0.014
Neg logit contributions
des-0.026
and-0.026
to-0.025
ido-0.024
old-0.024
Token was
Token ID373
Feature activation-1.27e-02

Pos logit contributions
 ears+0.018
 wheels+0.017
 tail+0.017
ities+0.017
ps+0.017
Neg logit contributions
des-0.024
to-0.023
and-0.023
ido-0.022
 inspection-0.022
Token playing
Token ID2712
Feature activation-1.71e-02

Pos logit contributions
ities+0.018
uders+0.017
 ears+0.017
 wheels+0.016
ates+0.016
Neg logit contributions
and-0.024
to-0.023
ido-0.023
des-0.022
bie-0.022
Token with
Token ID351
Feature activation+5.40e-03

Pos logit contributions
uders+0.017
 ears+0.017
ities+0.017
ates+0.016
 wheels+0.016
Neg logit contributions
and-0.040
des-0.038
to-0.037
 inspection-0.036
old-0.036
Token a
Token ID257
Feature activation+1.12e-02

Pos logit contributions
des+0.012
and+0.012
to+0.011
ido+0.011
 inspection+0.011
Neg logit contributions
 ears-0.006
 wheels-0.006
 tail-0.006
ps-0.006
 scales-0.006
Token big
Token ID1263
Feature activation+1.56e-02

Pos logit contributions
des+0.024
and+0.023
to+0.023
ido+0.022
old+0.022
Neg logit contributions
 ears-0.014
 tail-0.014
 wheels-0.013
 scales-0.013
 wings-0.013
Token,
Token ID11
Feature activation+7.41e-03

Pos logit contributions
des+0.030
 inspection+0.029
ido+0.029
bie+0.029
and+0.028
Neg logit contributions
ates-0.020
ities-0.020
ps-0.020
uders-0.019
ider-0.019
Token furry
Token ID40857
Feature activation-2.74e-03

Pos logit contributions
des+0.014
and+0.013
to+0.013
ido+0.013
 inspection+0.013
Neg logit contributions
uders-0.010
 ears-0.010
ates-0.010
ities-0.010
ps-0.010
Token bear
Token ID6842
Feature activation-6.25e-03

Pos logit contributions
ps+0.003
ates+0.003
ities+0.003
uders+0.003
ider+0.003
Neg logit contributions
ido-0.005
bie-0.005
 inspection-0.005
and-0.005
des-0.005
Token.
Token ID13
Feature activation+2.22e-03

Pos logit contributions
 wheels+0.007
 ears+0.007
ps+0.007
ates+0.007
uders+0.007
Neg logit contributions
and-0.013
des-0.013
to-0.012
ido-0.012
old-0.012
Token The
Token ID383
Feature activation-4.85e-03

Pos logit contributions
des+0.004
and+0.004
to+0.004
ido+0.004
 inspection+0.004
Neg logit contributions
 ears-0.003
ps-0.003
 wheels-0.003
 tail-0.003
 scales-0.003
Token bear
Token ID6842
Feature activation+1.01e-02

Pos logit contributions
uders+0.007
ates+0.007
e+0.006
ities+0.006
bed+0.006
Neg logit contributions
des-0.009
bie-0.009
and-0.008
yes-0.008
ido-0.008
Token really
Token ID1107
Feature activation-1.44e-02

Pos logit contributions
uders+0.040
ates+0.038
ps+0.036
ities+0.036
 ears+0.034
Neg logit contributions
and-0.071
des-0.070
to-0.069
ido-0.065
old-0.064
Token badly
Token ID11234
Feature activation-2.63e-02

Pos logit contributions
uders+0.024
 ears+0.024
 wheels+0.024
ates+0.023
 tail+0.023
Neg logit contributions
des-0.023
and-0.023
to-0.022
ido-0.021
 inspection-0.021
Token and
Token ID290
Feature activation-2.75e-02

Pos logit contributions
 ears+0.026
 wheels+0.024
 tail+0.024
uders+0.024
 scales+0.023
Neg logit contributions
des-0.062
and-0.061
to-0.060
ido-0.059
 inspection-0.058
Token he
Token ID339
Feature activation-2.86e-02

Pos logit contributions
ates+0.050
ps+0.048
uders+0.047
ider+0.047
ities+0.047
Neg logit contributions
des-0.041
to-0.039
and-0.038
 inspection-0.038
bie-0.037
Token couldn
Token ID3521
Feature activation-4.24e-03

Pos logit contributions
ates+0.051
uders+0.050
ps+0.050
ities+0.050
 wheels+0.048
Neg logit contributions
to-0.044
des-0.043
bie-0.042
and-0.041
old-0.038
Token't
Token ID470
Feature activation+1.37e-02

Pos logit contributions
 ears+0.004
ities+0.004
 tail+0.004
 scales+0.004
 wheels+0.004
Neg logit contributions
des-0.010
to-0.010
and-0.010
ido-0.009
old-0.009
Token walk
Token ID2513
Feature activation-3.34e-03

Pos logit contributions
des+0.025
and+0.024
to+0.024
ido+0.023
old+0.022
Neg logit contributions
 ears-0.021
uders-0.020
 wheels-0.020
ates-0.020
 tail-0.020
Token anymore
Token ID7471
Feature activation-1.63e-02

Pos logit contributions
 ears+0.004
uders+0.004
ities+0.004
ates+0.004
 wheels+0.004
Neg logit contributions
des-0.007
and-0.007
to-0.007
ido-0.007
 inspection-0.007
Token.
Token ID13
Feature activation-1.50e-03

Pos logit contributions
 ears+0.019
ps+0.019
 wheels+0.018
 scales+0.018
uders+0.018
Neg logit contributions
and-0.036
des-0.035
to-0.034
 inspection-0.033
old-0.033
Token
Token ID220
Feature activation+2.06e-02

Pos logit contributions
 ears+0.002
 tail+0.002
 wheels+0.002
 scales+0.002
 wings+0.002
Neg logit contributions
des-0.003
and-0.003
ido-0.002
to-0.002
old-0.002
Token
Token ID198
Feature activation+1.32e-03

Pos logit contributions
des+0.053
 inspection+0.046
iden+0.046
yes+0.046
bek+0.046
Neg logit contributions
 middle-0.016
ates-0.016
 wheels-0.016
 ears-0.016
 tail-0.016
Token They
Token ID1119
Feature activation-1.57e-03

Pos logit contributions
 ears+0.011
 wheels+0.011
 tail+0.011
e+0.010
 scales+0.010
Neg logit contributions
des-0.016
and-0.015
to-0.015
ido-0.015
 inspection-0.015
Token follow
Token ID1061
Feature activation-1.05e-03

Pos logit contributions
 ears+0.002
 tail+0.002
 wheels+0.002
uders+0.002
ps+0.002
Neg logit contributions
des-0.003
and-0.003
to-0.003
ido-0.003
old-0.003
Token Dad
Token ID17415
Feature activation+2.39e-02

Pos logit contributions
 ears+0.001
 wheels+0.001
 tail+0.001
ps+0.001
 scales+0.001
Neg logit contributions
des-0.002
and-0.002
to-0.002
ido-0.002
 inspection-0.002
Token to
Token ID284
Feature activation+8.42e-03

Pos logit contributions
des+0.049
 inspection+0.045
yes+0.044
ability+0.044
ido+0.043
Neg logit contributions
 ears-0.033
e-0.032
ps-0.031
 wheels-0.030
 tail-0.030
Token the
Token ID262
Feature activation-4.86e-03

Pos logit contributions
des+0.023
and+0.022
to+0.022
 inspection+0.021
old+0.021
Neg logit contributions
 ears-0.006
 wheels-0.006
 tail-0.006
 front-0.005
ps-0.005
Token car
Token ID1097
Feature activation-4.16e-03

Pos logit contributions
 ears+0.006
 wheels+0.006
 tail+0.006
ps+0.006
 scales+0.006
Neg logit contributions
des-0.010
and-0.010
to-0.010
ido-0.009
 inspection-0.009
Token.
Token ID13
Feature activation-2.49e-03

Pos logit contributions
 ears+0.004
 tail+0.004
 wheels+0.004
ps+0.004
 scales+0.004
Neg logit contributions
des-0.010
and-0.010
to-0.010
ido-0.009
old-0.009
Token
Token ID198
Feature activation+4.75e-04

Pos logit contributions
 ears+0.003
 wheels+0.003
 tail+0.003
 front+0.003
uders+0.003
Neg logit contributions
des-0.005
and-0.005
to-0.005
 inspection-0.005
old-0.005
Token
Token ID198
Feature activation-1.67e-02

Pos logit contributions
des+0.001
 inspection+0.001
yes+0.001
bie+0.001
to+0.001
Neg logit contributions
 front-0.000
 middle-0.000
 wheels-0.000
 tail-0.000
e-0.000
TokenThey
Token ID2990
Feature activation-5.56e-03

Pos logit contributions
 ears+0.019
 wheels+0.019
 tail+0.019
e+0.018
uders+0.017
Neg logit contributions
des-0.037
and-0.035
to-0.035
 inspection-0.034
bie-0.034
Token go
Token ID467
Feature activation+2.36e-02

Pos logit contributions
 ears+0.008
 wheels+0.008
 tail+0.008
ps+0.008
 scales+0.007
Neg logit contributions
des-0.011
and-0.010
to-0.010
ido-0.010
 inspection-0.010
Token herself
Token ID5223
Feature activation+8.54e-03

Pos logit contributions
des+0.067
and+0.064
to+0.064
ido+0.062
bie+0.061
Neg logit contributions
 ears-0.024
 wheels-0.022
 tail-0.022
ps-0.022
 scales-0.021
Token.
Token ID13
Feature activation+1.55e-02

Pos logit contributions
des+0.020
and+0.020
to+0.019
ido+0.019
 inspection+0.019
Neg logit contributions
 ears-0.009
 tail-0.008
 wheels-0.008
ps-0.008
 wings-0.008
Token
Token ID220
Feature activation+1.86e-02

Pos logit contributions
des+0.037
ido+0.034
and+0.033
 inspection+0.032
to+0.032
Neg logit contributions
 ears-0.018
 tail-0.016
bed-0.015
 wheels-0.015
 middle-0.015
Token
Token ID198
Feature activation+1.34e-02

Pos logit contributions
des+0.046
 inspection+0.041
ido+0.040
yes+0.039
iden+0.038
Neg logit contributions
ates-0.019
 ears-0.018
bed-0.017
 wheels-0.017
ps-0.017
Token
Token ID198
Feature activation-1.26e-02

Pos logit contributions
des+0.032
ido+0.029
bie+0.029
 inspection+0.029
ait+0.028
Neg logit contributions
bed-0.013
e-0.013
 middle-0.013
 tail-0.012
 ears-0.012
TokenFrom
Token ID4863
Feature activation-1.77e-02

Pos logit contributions
 ears+0.017
 wheels+0.016
 tail+0.016
ps+0.016
 scales+0.016
Neg logit contributions
des-0.026
and-0.025
to-0.024
ido-0.024
 inspection-0.023
Token that
Token ID326
Feature activation+9.88e-03

Pos logit contributions
 ears+0.024
ities+0.023
 tail+0.023
uders+0.023
 wheels+0.023
Neg logit contributions
des-0.035
ido-0.034
old-0.034
to-0.033
and-0.033
Token day
Token ID1110
Feature activation-2.53e-02

Pos logit contributions
des+0.018
and+0.017
to+0.017
ido+0.017
 inspection+0.016
Neg logit contributions
 ears-0.016
 tail-0.015
 end-0.015
e-0.015
 wheels-0.015
Token on
Token ID319
Feature activation-3.74e-02

Pos logit contributions
 ears+0.026
 scales+0.025
uders+0.025
 wheels+0.025
 tail+0.024
Neg logit contributions
des-0.059
and-0.057
to-0.057
ido-0.055
old-0.054
Token,
Token ID11
Feature activation+7.28e-03

Pos logit contributions
 scales+0.049
ps+0.048
ities+0.048
 wheels+0.047
uders+0.047
Neg logit contributions
 inspection-0.073
des-0.070
 Su-0.069
ido-0.069
bie-0.069
Token Lily
Token ID20037
Feature activation-6.87e-04

Pos logit contributions
des+0.013
and+0.012
to+0.012
ido+0.011
 inspection+0.011
Neg logit contributions
 ears-0.012
ps-0.012
 wheels-0.012
uders-0.012
 tail-0.012
Token had
Token ID550
Feature activation-3.45e-02

Pos logit contributions
des+0.016
to+0.016
and+0.016
old+0.015
ido+0.015
Neg logit contributions
 ears-0.015
 wheels-0.014
 tail-0.014
ates-0.014
ities-0.014
Token had
Token ID550
Feature activation-2.90e-02

Pos logit contributions
 ears+0.042
 tail+0.040
uders+0.040
ates+0.040
ps+0.040
Neg logit contributions
des-0.074
to-0.071
and-0.068
 inspection-0.066
old-0.066
Token enough
Token ID1576
Feature activation-4.63e-03

Pos logit contributions
 ears+0.035
 tail+0.034
uders+0.034
ps+0.033
 wheels+0.033
Neg logit contributions
des-0.062
to-0.059
and-0.058
old-0.056
 inspection-0.056
Token.
Token ID13
Feature activation-1.88e-02

Pos logit contributions
 ears+0.007
uders+0.006
 tail+0.006
e+0.006
 wheels+0.006
Neg logit contributions
des-0.009
and-0.009
to-0.008
old-0.008
ido-0.008
Token He
Token ID679
Feature activation-8.21e-03

Pos logit contributions
 ears+0.030
ps+0.029
 tail+0.029
e+0.029
uders+0.029
Neg logit contributions
des-0.032
and-0.032
to-0.031
old-0.030
ido-0.029
Token yelled
Token ID22760
Feature activation-2.67e-02

Pos logit contributions
 ears+0.013
ities+0.013
uders+0.013
ates+0.013
 tail+0.013
Neg logit contributions
des-0.014
to-0.014
and-0.013
old-0.013
bie-0.013
Token and
Token ID290
Feature activation-1.87e-02

Pos logit contributions
uders+0.037
ates+0.036
ities+0.036
 wheels+0.035
 ears+0.035
Neg logit contributions
and-0.052
to-0.051
des-0.048
old-0.046
 inspection-0.046
Token punched
Token ID25436
Feature activation-1.17e-02

Pos logit contributions
 ears+0.027
 wheels+0.026
 tail+0.026
uders+0.026
ities+0.026
Neg logit contributions
des-0.036
and-0.034
to-0.034
ido-0.032
 inspection-0.032
Token the
Token ID262
Feature activation-1.39e-02

Pos logit contributions
 ears+0.013
uders+0.012
 tail+0.012
 wheels+0.012
ates+0.012
Neg logit contributions
and-0.025
des-0.025
to-0.025
 inspection-0.024
ido-0.024
Token wall
Token ID3355
Feature activation-2.38e-03

Pos logit contributions
 ears+0.022
uders+0.022
e+0.022
ates+0.021
ps+0.021
Neg logit contributions
des-0.024
and-0.023
to-0.022
ido-0.021
old-0.021
Token,
Token ID11
Feature activation-1.02e-02

Pos logit contributions
 ears+0.003
uders+0.003
ities+0.003
ates+0.003
 tail+0.003
Neg logit contributions
and-0.005
des-0.005
to-0.005
 inspection-0.005
old-0.005
Token they
Token ID484
Feature activation+3.20e-03

Pos logit contributions
des+0.010
and+0.009
to+0.009
ido+0.009
old+0.008
Neg logit contributions
 ears-0.010
 wheels-0.010
 tail-0.010
ps-0.009
 scales-0.009
Token still
Token ID991
Feature activation+7.33e-03

Pos logit contributions
des+0.006
and+0.006
to+0.006
ido+0.006
 inspection+0.006
Neg logit contributions
 ears-0.004
 wheels-0.004
 tail-0.004
ps-0.004
 scales-0.004
Token looked
Token ID3114
Feature activation+2.03e-02

Pos logit contributions
des+0.015
and+0.014
to+0.014
ido+0.013
 inspection+0.013
Neg logit contributions
 ears-0.010
 wheels-0.010
 tail-0.010
ps-0.010
 scales-0.009
Token sad
Token ID6507
Feature activation-8.39e-03

Pos logit contributions
des+0.045
and+0.043
to+0.043
ido+0.041
 inspection+0.041
Neg logit contributions
 ears-0.024
 wheels-0.023
 tail-0.023
ps-0.022
 scales-0.022
Token.
Token ID13
Feature activation+9.38e-04

Pos logit contributions
 ears+0.009
uders+0.009
e+0.009
 tail+0.009
 wheels+0.009
Neg logit contributions
and-0.018
des-0.018
to-0.018
 inspection-0.017
ido-0.017
Token They
Token ID1119
Feature activation-8.69e-03

Pos logit contributions
des+0.002
and+0.002
to+0.002
ido+0.002
old+0.002
Neg logit contributions
 ears-0.001
 tail-0.001
 wheels-0.001
 scales-0.001
ps-0.001
Token hoped
Token ID10719
Feature activation+1.70e-02

Pos logit contributions
 ears+0.013
 wheels+0.012
 tail+0.012
ps+0.012
 scales+0.012
Neg logit contributions
des-0.017
and-0.016
to-0.016
ido-0.015
 inspection-0.015
Token the
Token ID262
Feature activation+3.07e-04

Pos logit contributions
des+0.039
and+0.038
to+0.037
ido+0.036
 inspection+0.035
Neg logit contributions
 ears-0.019
 wheels-0.018
 tail-0.018
 scales-0.017
ps-0.017
Token baby
Token ID5156
Feature activation+1.43e-02

Pos logit contributions
des+0.001
and+0.001
ido+0.001
to+0.001
 inspection+0.001
Neg logit contributions
 ears-0.000
ities-0.000
e-0.000
uders-0.000
 tail-0.000
Token k
Token ID479
Feature activation+9.02e-03

Pos logit contributions
des+0.029
and+0.028
ido+0.028
bie+0.028
to+0.028
Neg logit contributions
 ears-0.017
ities-0.017
uders-0.016
 tail-0.016
ates-0.016
Tokenang
Token ID648
Feature activation-3.00e-03

Pos logit contributions
des+0.012
and+0.012
to+0.012
ido+0.012
old+0.011
Neg logit contributions
 ears-0.017
 tail-0.017
ities-0.017
 wheels-0.017
 scales-0.017
Token cannot
Token ID2314
Feature activation+9.75e-03

Pos logit contributions
 tail+0.029
ates+0.029
uders+0.029
ities+0.028
 wheels+0.028
Neg logit contributions
des-0.036
old-0.034
bie-0.034
to-0.034
ido-0.033
Token go
Token ID467
Feature activation-6.28e-03

Pos logit contributions
des+0.016
ido+0.015
to+0.015
and+0.015
bie+0.014
Neg logit contributions
ates-0.016
 ears-0.016
uders-0.016
 wheels-0.016
ps-0.016
Token to
Token ID284
Feature activation-3.48e-03

Pos logit contributions
 scales+0.014
uders+0.014
 ears+0.014
ities+0.014
 tail+0.014
Neg logit contributions
and-0.007
des-0.007
to-0.007
ido-0.006
old-0.006
Token school
Token ID1524
Feature activation-2.20e-02

Pos logit contributions
 ears+0.004
ates+0.004
 wheels+0.004
 tail+0.004
ities+0.004
Neg logit contributions
des-0.007
and-0.007
to-0.007
ido-0.007
 inspection-0.007
Token.
Token ID13
Feature activation-9.72e-05

Pos logit contributions
 ears+0.025
 tail+0.024
 scales+0.024
ps+0.024
 wheels+0.024
Neg logit contributions
and-0.049
des-0.048
to-0.046
ido-0.045
old-0.045
Token She
Token ID1375
Feature activation-2.19e-02

Pos logit contributions
 ears+0.000
 wheels+0.000
 middle+0.000
 tail+0.000
 hair+0.000
Neg logit contributions
des-0.000
and-0.000
to-0.000
 inspection-0.000
ido-0.000
Token misses
Token ID18297
Feature activation+1.97e-04

Pos logit contributions
ates+0.033
 tail+0.033
uders+0.032
ities+0.031
bed+0.031
Neg logit contributions
des-0.039
old-0.037
bie-0.037
to-0.037
ido-0.035
Token Ben
Token ID3932
Feature activation-2.43e-02

Pos logit contributions
des+0.000
and+0.000
to+0.000
ido+0.000
bie+0.000
Neg logit contributions
 ears-0.000
 tail-0.000
 wheels-0.000
uders-0.000
 scales-0.000
Token a
Token ID257
Feature activation-3.44e-03

Pos logit contributions
uders+0.026
 scales+0.026
 wheels+0.026
 ears+0.025
ates+0.025
Neg logit contributions
and-0.053
des-0.052
to-0.050
ido-0.050
old-0.049
Token lot
Token ID1256
Feature activation+1.10e-02

Pos logit contributions
 ears+0.005
 tail+0.004
 wheels+0.004
 scales+0.004
 wings+0.004
Neg logit contributions
des-0.007
and-0.007
to-0.007
ido-0.007
old-0.007
Token.
Token ID13
Feature activation-5.23e-03

Pos logit contributions
des+0.026
and+0.026
to+0.025
ido+0.024
old+0.024
Neg logit contributions
 ears-0.011
 tail-0.011
 wheels-0.010
ps-0.010
 scales-0.010
Token excited
Token ID6568
Feature activation-2.04e-02

Pos logit contributions
 ears+0.001
 wheels+0.001
 tail+0.001
 middle+0.001
e+0.001
Neg logit contributions
des-0.001
and-0.001
 inspection-0.001
to-0.001
ido-0.001
Token.
Token ID13
Feature activation-1.71e-02

Pos logit contributions
 ears+0.029
 wheels+0.027
 tail+0.027
ps+0.026
 scales+0.026
Neg logit contributions
des-0.040
and-0.039
to-0.038
ido-0.037
 inspection-0.036
Token She
Token ID1375
Feature activation-1.34e-02

Pos logit contributions
 ears+0.023
 wheels+0.022
 tail+0.021
 walls+0.020
 clouds+0.020
Neg logit contributions
des-0.036
to-0.034
and-0.033
ll-0.033
ido-0.033
Token was
Token ID373
Feature activation-1.36e-02

Pos logit contributions
 ears+0.020
 tail+0.020
 wheels+0.019
ps+0.019
uders+0.019
Neg logit contributions
des-0.025
and-0.024
to-0.023
ido-0.023
old-0.022
Token going
Token ID1016
Feature activation+1.27e-03

Pos logit contributions
 ears+0.023
ities+0.022
 tail+0.022
 wheels+0.022
uders+0.022
Neg logit contributions
des-0.023
and-0.022
to-0.022
ido-0.021
bie-0.020
Token on
Token ID319
Feature activation-3.61e-02

Pos logit contributions
des+0.002
and+0.002
ido+0.002
to+0.002
old+0.002
Neg logit contributions
 ears-0.002
 wheels-0.002
 tail-0.002
 scales-0.002
 wings-0.002
Token her
Token ID607
Feature activation-9.92e-03

Pos logit contributions
 ears+0.034
 wheels+0.033
 tail+0.033
ps+0.033
uders+0.032
Neg logit contributions
des-0.087
to-0.085
and-0.084
 inspection-0.081
ido-0.080
Token very
Token ID845
Feature activation+1.86e-03

Pos logit contributions
 ears+0.016
ps+0.015
 wheels+0.015
 tail+0.015
ities+0.015
Neg logit contributions
des-0.017
and-0.016
to-0.016
ido-0.016
bie-0.015
Token first
Token ID717
Feature activation-2.80e-02

Pos logit contributions
des+0.003
and+0.003
to+0.003
ido+0.003
 inspection+0.003
Neg logit contributions
 ears-0.003
 wheels-0.003
 tail-0.003
 scales-0.003
 wings-0.003
Token trip
Token ID5296
Feature activation-5.27e-03

Pos logit contributions
ps+0.041
 ears+0.041
 wheels+0.040
 scales+0.039
 tail+0.039
Neg logit contributions
des-0.051
and-0.051
to-0.049
ido-0.049
bie-0.047
Token!
Token ID0
Feature activation-1.43e-02

Pos logit contributions
 ears+0.006
 wheels+0.005
 tail+0.005
ps+0.005
e+0.005
Neg logit contributions
des-0.012
and-0.012
to-0.011
ido-0.011
 inspection-0.011
Token
Token ID198
Feature activation+1.52e-02

Pos logit contributions
des+0.023
to+0.021
 inspection+0.021
bie+0.021
and+0.021
Neg logit contributions
 ears-0.010
 tail-0.010
 wheels-0.010
e-0.009
 middle-0.009
Token
Token ID198
Feature activation-9.15e-03

Pos logit contributions
des+0.037
 inspection+0.034
ido+0.033
bie+0.033
to+0.033
Neg logit contributions
e-0.014
 middle-0.013
 tail-0.013
 ears-0.013
 end-0.013
TokenFrom
Token ID4863
Feature activation-2.14e-02

Pos logit contributions
 ears+0.010
 wheels+0.010
 tail+0.010
e+0.010
 middle+0.009
Neg logit contributions
des-0.020
to-0.019
and-0.019
bie-0.019
ido-0.019
Token that
Token ID326
Feature activation+5.66e-04

Pos logit contributions
ities+0.032
 ears+0.031
uders+0.030
ates+0.029
 wheels+0.029
Neg logit contributions
des-0.040
ido-0.039
old-0.038
to-0.036
 inspection-0.036
Token day
Token ID1110
Feature activation-3.36e-02

Pos logit contributions
des+0.001
and+0.001
to+0.001
ido+0.001
 inspection+0.001
Neg logit contributions
 ears-0.001
 wheels-0.001
 tail-0.001
ps-0.001
uders-0.001
Token on
Token ID319
Feature activation-4.67e-02

Pos logit contributions
 scales+0.039
 ears+0.038
uders+0.038
 wheels+0.037
ities+0.036
Neg logit contributions
des-0.070
to-0.069
and-0.068
ido-0.068
urry-0.066
Token,
Token ID11
Feature activation+2.38e-03

Pos logit contributions
ities+0.066
ps+0.063
 ears+0.061
 scales+0.061
 wheels+0.061
Neg logit contributions
 inspection-0.088
ido-0.085
 Su-0.084
des-0.083
old-0.081
Token Tim
Token ID5045
Feature activation-2.96e-03

Pos logit contributions
des+0.004
and+0.004
to+0.004
ido+0.004
 inspection+0.004
Neg logit contributions
 ears-0.004
 wheels-0.004
 tail-0.004
 scales-0.003
ps-0.003
Token and
Token ID290
Feature activation+2.79e-03

Pos logit contributions
 ears+0.003
 wheels+0.003
ities+0.003
uders+0.003
ider+0.003
Neg logit contributions
des-0.006
and-0.006
to-0.006
old-0.006
bie-0.006
Token his
Token ID465
Feature activation+1.67e-03

Pos logit contributions
des+0.007
and+0.007
to+0.007
ido+0.007
old+0.006
Neg logit contributions
 ears-0.003
 tail-0.002
 wheels-0.002
 scales-0.002
ps-0.002
Token friends
Token ID2460
Feature activation+4.80e-03

Pos logit contributions
des+0.004
and+0.004
to+0.004
ido+0.004
old+0.004
Neg logit contributions
 ears-0.002
 wheels-0.002
uders-0.002
ities-0.002
 tail-0.002
Tokenmy
Token ID1820
Feature activation-1.78e-03

Pos logit contributions
des+0.052
and+0.047
 inspection+0.046
to+0.045
ido+0.044
Neg logit contributions
 ears-0.054
 tail-0.052
 head-0.050
e-0.050
 wheels-0.050
Token replied
Token ID8712
Feature activation-3.73e-03

Pos logit contributions
 ears+0.003
 wheels+0.003
ps+0.003
 tail+0.003
uders+0.003
Neg logit contributions
des-0.002
to-0.002
and-0.002
old-0.002
ido-0.002
Token.
Token ID13
Feature activation+8.88e-03

Pos logit contributions
 ears+0.004
 wheels+0.004
uders+0.004
 tail+0.004
ps+0.004
Neg logit contributions
des-0.009
and-0.008
to-0.008
ido-0.008
old-0.008
Token
Token ID198
Feature activation+9.52e-03

Pos logit contributions
des+0.023
and+0.020
ability+0.020
ido+0.020
old+0.020
Neg logit contributions
 wheels-0.009
 ears-0.009
 tail-0.009
bed-0.008
 front-0.008
Token
Token ID198
Feature activation-9.02e-03

Pos logit contributions
des+0.024
 inspection+0.021
bie+0.021
ido+0.021
ability+0.021
Neg logit contributions
 wheels-0.008
bed-0.007
 ears-0.007
 tail-0.007
 front-0.007
TokenSuddenly
Token ID38582
Feature activation-3.58e-02

Pos logit contributions
 wheels+0.010
 ears+0.010
 tail+0.010
 scales+0.009
 feathers+0.009
Neg logit contributions
des-0.020
and-0.019
to-0.019
ido-0.019
bie-0.019
Token,
Token ID11
Feature activation-2.28e-02

Pos logit contributions
ates+0.044
ities+0.044
ps+0.044
uders+0.043
 ears+0.042
Neg logit contributions
des-0.073
 inspection-0.073
to-0.071
 Su-0.070
old-0.069
Token they
Token ID484
Feature activation-3.79e-02

Pos logit contributions
 ears+0.042
ates+0.042
uders+0.041
ps+0.041
ities+0.041
Neg logit contributions
des-0.033
to-0.031
 inspection-0.030
ido-0.030
and-0.030
Token saw
Token ID2497
Feature activation-1.42e-02

Pos logit contributions
ates+0.060
 ears+0.059
ps+0.058
uders+0.058
 tail+0.057
Neg logit contributions
des-0.062
to-0.061
and-0.061
bie-0.060
 inspection-0.059
Token a
Token ID257
Feature activation+1.05e-02

Pos logit contributions
 ears+0.012
 tail+0.011
ates+0.011
ities+0.011
uders+0.011
Neg logit contributions
des-0.035
and-0.034
to-0.034
old-0.034
ido-0.033
Token big
Token ID1263
Feature activation+1.01e-02

Pos logit contributions
des+0.023
and+0.022
to+0.022
ido+0.021
old+0.021
Neg logit contributions
 ears-0.013
 tail-0.013
 wheels-0.013
 scales-0.012
 wings-0.012
Token,
Token ID11
Feature activation-8.38e-03

Pos logit contributions
des+0.014
and+0.014
to+0.014
ido+0.013
old+0.013
Neg logit contributions
 ears-0.010
 tail-0.010
 wheels-0.010
ps-0.009
 scales-0.009
Token "
Token ID366
Feature activation+2.16e-02

Pos logit contributions
 ears+0.013
 wheels+0.012
 tail+0.012
uders+0.012
ps+0.012
Neg logit contributions
des-0.015
and-0.015
to-0.014
ido-0.014
 inspection-0.014
TokenRex
Token ID47389
Feature activation-1.51e-02

Pos logit contributions
des+0.038
and+0.037
to+0.036
yes+0.034
ido+0.034
Neg logit contributions
 ears-0.035
uders-0.034
 wheels-0.033
 tail-0.033
ps-0.033
Token,
Token ID11
Feature activation-8.89e-03

Pos logit contributions
 ears+0.015
 wheels+0.014
ities+0.014
 tail+0.013
 scales+0.013
Neg logit contributions
des-0.035
to-0.034
and-0.034
ido-0.033
bie-0.033
Token stop
Token ID2245
Feature activation-6.12e-03

Pos logit contributions
 ears+0.011
 tail+0.011
 wheels+0.011
ps+0.011
 scales+0.010
Neg logit contributions
des-0.019
and-0.018
to-0.018
ido-0.017
old-0.017
Token barking
Token ID48961
Feature activation-3.51e-02

Pos logit contributions
ates+0.008
 wheels+0.008
ities+0.008
 ears+0.008
 tail+0.008
Neg logit contributions
des-0.012
and-0.012
to-0.012
bie-0.011
 inspection-0.011
Token.
Token ID13
Feature activation-8.65e-03

Pos logit contributions
 ears+0.041
 wheels+0.040
ates+0.040
ities+0.039
 scales+0.039
Neg logit contributions
and-0.073
des-0.073
to-0.071
ido-0.068
old-0.067
Token It
Token ID632
Feature activation-8.22e-03

Pos logit contributions
 wings+0.013
 ears+0.013
 wheels+0.013
ates+0.013
ps+0.012
Neg logit contributions
des-0.015
and-0.015
to-0.015
ido-0.014
old-0.014
Token's
Token ID338
Feature activation-2.56e-02

Pos logit contributions
ities+0.012
ates+0.012
 wheels+0.011
 ears+0.011
 tail+0.010
Neg logit contributions
des-0.016
to-0.015
and-0.015
ll-0.014
bie-0.014
Token just
Token ID655
Feature activation-1.50e-02

Pos logit contributions
 skies+0.033
ities+0.032
bed+0.032
ates+0.032
ider+0.032
Neg logit contributions
des-0.047
ido-0.047
and-0.046
to-0.046
bie-0.046
Token a
Token ID257
Feature activation+4.81e-03

Pos logit contributions
 wheels+0.014
 ears+0.014
ities+0.014
ates+0.014
uders+0.014
Neg logit contributions
des-0.034
to-0.033
and-0.033
ido-0.032
bie-0.032
Token was
Token ID373
Feature activation-4.59e-03

Pos logit contributions
and+0.001
des+0.001
ido+0.001
 inspection+0.001
to+0.001
Neg logit contributions
uders-0.001
ities-0.001
ps-0.001
ider-0.001
 skies-0.001
Token Velvet
Token ID36271
Feature activation-1.87e-02

Pos logit contributions
 ears+0.006
uders+0.006
ities+0.006
ps+0.006
 wheels+0.006
Neg logit contributions
and-0.009
des-0.009
ido-0.009
to-0.009
old-0.009
Token.
Token ID13
Feature activation-1.50e-02

Pos logit contributions
uders+0.026
 wheels+0.026
ps+0.025
 skies+0.025
 scales+0.024
Neg logit contributions
and-0.035
des-0.034
to-0.034
ido-0.033
old-0.033
Token Velvet
Token ID36271
Feature activation-1.15e-02

Pos logit contributions
uders+0.024
ps+0.024
ates+0.023
 scales+0.023
ities+0.022
Neg logit contributions
to-0.025
des-0.025
and-0.025
ido-0.024
 inspection-0.023
Token had
Token ID550
Feature activation-2.18e-02

Pos logit contributions
ps+0.020
ates+0.020
 wheels+0.020
ities+0.019
 ears+0.019
Neg logit contributions
des-0.017
to-0.017
and-0.017
bie-0.016
ll-0.016
Token soft
Token ID2705
Feature activation-3.57e-02

Pos logit contributions
uders+0.023
ities+0.022
ps+0.022
 ears+0.021
ates+0.021
Neg logit contributions
des-0.049
and-0.048
to-0.047
ido-0.045
old-0.045
Token hair
Token ID4190
Feature activation-3.63e-02

Pos logit contributions
uders+0.056
ider+0.054
pie+0.049
 American+0.048
eper+0.047
Neg logit contributions
to-0.058
and-0.056
 proper-0.056
 inspection-0.055
bie-0.055
Token and
Token ID290
Feature activation-1.09e-02

Pos logit contributions
uders+0.061
AR+0.051
athed+0.051
ates+0.049
ities+0.049
Neg logit contributions
and-0.057
to-0.057
of-0.055
ido-0.054
ll-0.052
Token a
Token ID257
Feature activation-3.39e-03

Pos logit contributions
uders+0.015
ps+0.014
ates+0.014
ities+0.014
 wheels+0.014
Neg logit contributions
des-0.020
to-0.020
and-0.020
bie-0.019
 inspection-0.019
Token pretty
Token ID2495
Feature activation-1.08e-02

Pos logit contributions
uders+0.006
ps+0.005
ates+0.005
ities+0.005
e+0.005
Neg logit contributions
des-0.005
to-0.005
bie-0.005
 inspection-0.005
ido-0.005
Token dress
Token ID6576
Feature activation-2.45e-02

Pos logit contributions
uders+0.018
ps+0.016
ities+0.016
ates+0.016
 yourselves+0.015
Neg logit contributions
des-0.017
to-0.017
 inspection-0.017
yes-0.017
 proper-0.017