TOP ACTIVATIONS
MAX = 4.480

named B oun cy . B oun cy loved to jump
" Hello friend !" The k ang aroo sque aked happily
the wall ?" Mrs . Wilson laughed . She
k ang aroo !" The k ang aroo sque aked in
she would be out riding her sc ooter . She was
, y um ," the k ang aroo says .
the trail . Dad pointed at the trees and the birds
k ang aroo . The k ang aroo hops faster .
us ?" The k ang aroo nods . It
many wonderful things . The k ang aroo saw the beautiful
and started to pat the k ang aroo . The k
who wanted to decor ate his house . He searched everywhere
in the park . The k ang aroo is big and
k ang aroo !". The k ang aroo stopped jumping and
, little boys ," the k ang aroo says . "
" Ro ar !" the k ang aroo says .
, long stick . The k ang aroo snapped the stick
He looked up at the k ang aroo and thanked it
Lily says . " The k ang aroo is not from
the other side . The k ang aroo is in a

MIN ACTIVATIONS
MIN = -4.099

eager little girl named Rachel . Rachel loved to organize things
a little girl named Rachel . She was three years old
that said " O st ric hes ". Sara and Tom
a little girl named Rachel . She was very excited to
in the jungle . Every night , she heard the mosquit
. They were called ost ric hes . Lily and Ben
small bird needed a tw ig for her nest . She
was a girl named Rachel . Rachel was very hungry ,
sweet little girl called Rachel . She liked to spin around
named Rachel and her uncle . Rachel wanted to walk somewhere
. He kept throwing a tant rum . His mom was
was a girl called Rachel . Rachel liked to write .
a little girl called Rachel . Rachel was very restless everyday
stage . They use cr ay ons and stickers to decor
was a girl named Rachel . Rachel liked pink things .
pasta . She wrote " past a " on a page
did not know what ost ric hes were . They looked
Rachel who wanted a peach . She went to the river
a little girl named Rachel . Rachel loved to draw circles
a little girl called Rachel . Rachel loved to ride her

INTERVAL 2.335 - 4.480
CONTAINS 0.125%

of the tree nearest to her and began to sing .
to get the cookies . Can you please give me the
The boy p atted F uzz 's head and said
When the k ite was close by ,
the log and then jumped onto a nearby tree . The

INTERVAL 0.191 - 2.335
CONTAINS 38.410%

. He says , " You 're welcome , Mia .
. " Go away , Tim ! You are
called Emily . Emily had a very special dream - to
" Tim ," said his mom , " you need to
fun !" L ila said . She spun around and hugged

INTERVAL -1.954 - 0.191
CONTAINS 61.134%

he asked the fox to help him get back to the
to stare out the window , watching as the world passed
. Sam was happy and nodded his head . They put
would be fine soon . Tom and Lily learned
as smart as Jack ". J ill was so

INTERVAL -4.099 - -1.954
CONTAINS 0.331%

bang . They see smoke and sparks . They feel a
and go oe y when the timer be ep ed .
was amazed ! She had never seen an avocado man before
time , there was a long snake , who lived in
. It is not spicy , it is sweet . It
Token named
Token ID3706
Feature activation+4.35e-01

Pos logit contributions
 wra+0.006
 Bub+0.006
 mar+0.005
 Nib+0.005
endant+0.005
Neg logit contributions
zed-0.017
gh-0.016
ze-0.016
aster-0.016
 aids-0.016
Token B
Token ID347
Feature activation+2.04e+00

Pos logit contributions
endant+0.009
 wra+0.009
ocular+0.007
 mar+0.007
rou+0.007
Neg logit contributions
gh-0.015
zed-0.015
aster-0.015
ainer-0.014
ze-0.014
Tokenoun
Token ID977
Feature activation+9.13e-01

Pos logit contributions
 wra+0.053
 mar+0.051
 Nib+0.049
endant+0.048
 jewels+0.047
Neg logit contributions
gh-0.061
zed-0.060
over-0.059
ainer-0.059
ze-0.056
Tokency
Token ID948
Feature activation+7.20e-01

Pos logit contributions
endant+0.019
 wra+0.019
 Nib+0.018
 Bub+0.018
 mar+0.017
Neg logit contributions
zed-0.033
gh-0.032
ze-0.032
ainer-0.031
aster-0.030
Token.
Token ID13
Feature activation+2.70e+00

Pos logit contributions
 wra+0.016
 Bub+0.015
 Nib+0.015
endant+0.015
 mar+0.015
Neg logit contributions
zed-0.025
ze-0.025
gh-0.024
ainer-0.024
aster-0.024
Token B
Token ID347
Feature activation+4.11e+00

Pos logit contributions
endant+0.050
 wra+0.048
 buried+0.041
ll+0.038
 mar+0.038
Neg logit contributions
zed-0.094
ainer-0.094
gh-0.092
 paragraph-0.089
ap-0.087
Tokenoun
Token ID977
Feature activation+9.16e-01

Pos logit contributions
 wra+0.120
endant+0.118
 slammed+0.114
 Nib+0.114
 mar+0.114
Neg logit contributions
zed-0.106
over-0.103
gh-0.100
ainer-0.098
ze-0.097
Tokency
Token ID948
Feature activation+1.21e+00

Pos logit contributions
 wra+0.019
endant+0.019
 Nib+0.018
 Bub+0.018
 mar+0.017
Neg logit contributions
zed-0.033
gh-0.032
ze-0.031
ainer-0.031
ap-0.030
Token loved
Token ID6151
Feature activation+6.64e-01

Pos logit contributions
 wra+0.021
 Bub+0.021
 Nib+0.021
 mar+0.020
endant+0.020
Neg logit contributions
zed-0.049
gh-0.047
ze-0.046
 aids-0.045
aster-0.043
Token to
Token ID284
Feature activation-6.30e-01

Pos logit contributions
 mar+0.016
 Bub+0.016
 Nib+0.015
 wra+0.014
 H+0.014
Neg logit contributions
zed-0.024
ainer-0.024
gh-0.023
 aids-0.023
ze-0.022
Token jump
Token ID4391
Feature activation+2.86e-01

Pos logit contributions
zed+0.024
gh+0.024
 aids+0.024
ainer+0.024
 sentence+0.022
Neg logit contributions
 Bub-0.013
 mar-0.012
 Nib-0.012
 ot-0.011
 buried-0.010
Token "
Token ID366
Feature activation+6.28e-01

Pos logit contributions
aster+0.030
gh+0.030
ainer+0.030
 aids+0.030
zed+0.029
Neg logit contributions
 Bub-0.014
 Nib-0.013
 Ell-0.012
 could-0.012
 mar-0.011
TokenHello
Token ID15496
Feature activation+2.71e+00

Pos logit contributions
 Bub+0.015
 mar+0.014
 Nib+0.014
 wra+0.014
ocular+0.013
Neg logit contributions
zed-0.022
gh-0.021
 aids-0.021
ainer-0.020
aster-0.020
Token friend
Token ID1545
Feature activation+7.33e-01

Pos logit contributions
 Bub+0.056
 mar+0.054
 Nib+0.054
 wra+0.051
endant+0.050
Neg logit contributions
zed-0.101
gh-0.098
 aids-0.095
ainer-0.095
aster-0.094
Token!"
Token ID2474
Feature activation+9.48e-01

Pos logit contributions
 Bub+0.016
 mar+0.016
 Nib+0.015
 wra+0.015
endant+0.014
Neg logit contributions
zed-0.026
gh-0.025
 aids-0.025
ainer-0.025
aster-0.024
Token The
Token ID383
Feature activation+2.53e+00

Pos logit contributions
 Bub+0.016
 Nib+0.015
 Ell+0.015
 mar+0.014
 H+0.013
Neg logit contributions
gh-0.041
zed-0.039
ainer-0.038
ze-0.038
aster-0.038
Token k
Token ID479
Feature activation+4.05e+00

Pos logit contributions
 Bub+0.040
 Nib+0.038
 mar+0.038
endant+0.037
 wra+0.037
Neg logit contributions
zed-0.105
gh-0.102
ze-0.100
ainer-0.100
 aids-0.100
Tokenang
Token ID648
Feature activation+1.16e+00

Pos logit contributions
 Bub+0.099
 wra+0.095
 Nib+0.095
 mar+0.094
endant+0.091
Neg logit contributions
zed-0.126
ze-0.122
gh-0.122
ainer-0.119
aster-0.118
Tokenaroo
Token ID38049
Feature activation+1.43e+00

Pos logit contributions
 Bub+0.029
 mar+0.028
 Nib+0.028
 wra+0.026
endant+0.026
Neg logit contributions
zed-0.038
gh-0.037
ainer-0.036
 aids-0.036
aster-0.035
Token sque
Token ID35776
Feature activation-3.61e-01

Pos logit contributions
 Bub+0.024
 mar+0.023
 wra+0.023
 Nib+0.023
endant+0.023
Neg logit contributions
zed-0.058
gh-0.055
ze-0.054
 aids-0.054
ainer-0.052
Tokenaked
Token ID4335
Feature activation+1.16e-01

Pos logit contributions
zed+0.009
gh+0.008
 aids+0.008
aster+0.008
ainer+0.008
Neg logit contributions
 Bub-0.012
 mar-0.012
 Nib-0.012
endant-0.011
 wra-0.011
Token happily
Token ID18177
Feature activation+6.95e-01

Pos logit contributions
 Bub+0.002
 mar+0.002
 Nib+0.002
 wra+0.002
 Ell+0.002
Neg logit contributions
zed-0.005
aster-0.005
gh-0.005
ainer-0.004
 aids-0.004
Token the
Token ID262
Feature activation-2.73e-02

Pos logit contributions
 Bub+0.006
 mar+0.006
 Nib+0.006
 wra+0.005
endant+0.005
Neg logit contributions
zed-0.008
gh-0.007
ainer-0.007
 aids-0.007
ze-0.007
Token wall
Token ID3355
Feature activation-1.60e+00

Pos logit contributions
zed+0.001
 aids+0.001
gh+0.001
aster+0.001
ze+0.001
Neg logit contributions
 Bub-0.001
 Nib-0.001
endant-0.001
 mar-0.001
 wra-0.001
Token?"
Token ID1701
Feature activation+7.57e-01

Pos logit contributions
zed+0.055
gh+0.053
ainer+0.051
ze+0.050
 aids+0.049
Neg logit contributions
 Bub-0.038
 mar-0.037
 Nib-0.035
 could-0.035
 haven-0.035
Token
Token ID198
Feature activation+1.64e+00

Pos logit contributions
 Bub+0.016
 mar+0.016
 Nib+0.015
 wra+0.015
endant+0.014
Neg logit contributions
zed-0.028
gh-0.027
 aids-0.027
aster-0.026
ainer-0.026
Token
Token ID198
Feature activation+1.40e+00

Pos logit contributions
 Bub+0.036
 mar+0.035
 Nib+0.034
 wra+0.033
endant+0.033
Neg logit contributions
zed-0.060
gh-0.057
aster-0.056
 aids-0.056
ainer-0.055
TokenMrs
Token ID27034
Feature activation+4.04e+00

Pos logit contributions
 Bub+0.037
 Nib+0.037
 mar+0.036
 wra+0.036
 H+0.034
Neg logit contributions
zed-0.043
gh-0.041
ainer-0.040
 aids-0.040
ze-0.039
Token.
Token ID13
Feature activation+2.31e+00

Pos logit contributions
 Bub+0.112
 mar+0.107
 buried+0.103
 ot+0.099
 haven+0.098
Neg logit contributions
gh-0.114
 sentence-0.106
zed-0.106
over-0.105
aster-0.105
Token Wilson
Token ID8127
Feature activation+4.59e-01

Pos logit contributions
 Bub+0.060
 mar+0.059
 Nib+0.057
endant+0.056
 wra+0.056
Neg logit contributions
zed-0.074
gh-0.071
ainer-0.068
 aids-0.067
aster-0.067
Token laughed
Token ID13818
Feature activation+5.58e-01

Pos logit contributions
 Bub+0.008
 mar+0.008
 Nib+0.008
 wra+0.008
endant+0.007
Neg logit contributions
zed-0.018
gh-0.018
 aids-0.017
ze-0.017
aster-0.017
Token.
Token ID13
Feature activation+2.03e+00

Pos logit contributions
 Bub+0.008
 mar+0.008
 Nib+0.008
 wra+0.007
endant+0.007
Neg logit contributions
zed-0.024
gh-0.023
aster-0.023
 aids-0.023
ainer-0.023
Token She
Token ID1375
Feature activation+1.48e+00

Pos logit contributions
 mar+0.059
 Bub+0.058
endant+0.057
 buried+0.057
 wra+0.056
Neg logit contributions
zed-0.057
 aids-0.053
gh-0.052
ainer-0.052
aster-0.050
Token k
Token ID479
Feature activation+2.74e+00

Pos logit contributions
zed+0.018
aster+0.017
 aids+0.016
ainer+0.016
gh+0.016
Neg logit contributions
 Bub-0.007
 mar-0.006
 Nib-0.006
 H-0.006
 Ell-0.006
Tokenang
Token ID648
Feature activation-4.62e-01

Pos logit contributions
 Bub+0.064
 Nib+0.062
 wra+0.060
 mar+0.059
endant+0.058
Neg logit contributions
gh-0.087
zed-0.086
ze-0.084
ainer-0.082
 aids-0.082
Tokenaroo
Token ID38049
Feature activation+5.52e-01

Pos logit contributions
ainer+0.017
zed+0.017
 aids+0.017
aster+0.017
gh+0.016
Neg logit contributions
 Bub-0.010
 mar-0.010
 Nib-0.009
 ot-0.009
 Ell-0.009
Token!"
Token ID2474
Feature activation+1.35e+00

Pos logit contributions
 Bub+0.012
 mar+0.011
 Nib+0.011
 wra+0.010
endant+0.010
Neg logit contributions
zed-0.021
gh-0.020
ainer-0.020
 aids-0.019
aster-0.019
Token The
Token ID383
Feature activation+2.23e+00

Pos logit contributions
 Bub+0.028
 mar+0.026
 Nib+0.026
 wra+0.024
endant+0.024
Neg logit contributions
zed-0.051
gh-0.050
 aids-0.049
ainer-0.049
aster-0.048
Token k
Token ID479
Feature activation+4.04e+00

Pos logit contributions
 Bub+0.037
 mar+0.035
endant+0.035
 Nib+0.035
 wra+0.034
Neg logit contributions
zed-0.091
gh-0.087
ze-0.087
ainer-0.087
 aids-0.086
Tokenang
Token ID648
Feature activation+1.14e+00

Pos logit contributions
 Bub+0.102
 wra+0.098
 Nib+0.097
 mar+0.097
endant+0.094
Neg logit contributions
zed-0.119
ze-0.117
gh-0.114
ainer-0.113
ap-0.112
Tokenaroo
Token ID38049
Feature activation+1.65e+00

Pos logit contributions
 Bub+0.031
 mar+0.030
 Nib+0.029
 wra+0.028
endant+0.028
Neg logit contributions
zed-0.036
gh-0.035
 aids-0.033
ainer-0.033
aster-0.033
Token sque
Token ID35776
Feature activation-3.88e-01

Pos logit contributions
 Bub+0.030
 Nib+0.029
 wra+0.029
 mar+0.029
endant+0.028
Neg logit contributions
zed-0.064
gh-0.060
ze-0.060
 aids-0.060
ainer-0.058
Tokenaked
Token ID4335
Feature activation+1.02e-01

Pos logit contributions
zed+0.009
gh+0.009
 aids+0.008
ainer+0.008
ze+0.008
Neg logit contributions
 Bub-0.014
 mar-0.013
 Nib-0.013
endant-0.013
 wra-0.013
Token in
Token ID287
Feature activation+5.21e-01

Pos logit contributions
 Bub+0.002
 Nib+0.002
 mar+0.002
 wra+0.001
 Ell+0.001
Neg logit contributions
zed-0.004
gh-0.004
aster-0.004
ainer-0.004
 aids-0.004
Token she
Token ID673
Feature activation+4.19e-01

Pos logit contributions
 Bub+0.004
 mar+0.003
 Nib+0.003
 wra+0.003
endant+0.003
Neg logit contributions
zed-0.008
gh-0.008
 aids-0.008
ainer-0.008
ze-0.007
Token would
Token ID561
Feature activation+6.13e-01

Pos logit contributions
 Bub+0.006
 mar+0.006
 Nib+0.006
 wra+0.006
endant+0.005
Neg logit contributions
zed-0.018
gh-0.017
 aids-0.017
ainer-0.017
aster-0.017
Token be
Token ID307
Feature activation+2.26e-01

Pos logit contributions
 Bub+0.015
 mar+0.015
 Nib+0.014
 wra+0.014
endant+0.013
Neg logit contributions
zed-0.020
gh-0.020
ainer-0.019
 aids-0.019
aster-0.019
Token out
Token ID503
Feature activation-4.88e-02

Pos logit contributions
 mar+0.004
 Bub+0.004
 buried+0.004
 called+0.004
 lost+0.004
Neg logit contributions
zed-0.009
 aids-0.009
ainer-0.009
 sentence-0.009
gh-0.009
Token riding
Token ID10311
Feature activation+2.17e-01

Pos logit contributions
zed+0.002
gh+0.002
ainer+0.002
 aids+0.002
aster+0.002
Neg logit contributions
 Bub-0.001
 mar-0.001
 Nib-0.001
 wra-0.001
endant-0.001
Token her
Token ID607
Feature activation+4.04e+00

Pos logit contributions
 Bub+0.005
 mar+0.005
 Nib+0.005
 wra+0.005
 H+0.004
Neg logit contributions
zed-0.008
ainer-0.007
gh-0.007
 aids-0.007
aster-0.007
Token sc
Token ID629
Feature activation+2.39e+00

Pos logit contributions
 Nib+0.076
 Bub+0.071
 wra+0.063
 buried+0.061
unn+0.061
Neg logit contributions
zed-0.156
gh-0.154
ze-0.148
rac-0.143
 paragraph-0.136
Tokenooter
Token ID25141
Feature activation+7.28e-01

Pos logit contributions
 Bub+0.072
 wra+0.070
 Nib+0.070
 mar+0.069
 buried+0.065
Neg logit contributions
zed-0.067
gh-0.064
 aids-0.061
aster-0.059
ze-0.058
Token.
Token ID13
Feature activation+3.15e-01

Pos logit contributions
 Bub+0.016
 Nib+0.016
 mar+0.015
 wra+0.015
endant+0.015
Neg logit contributions
zed-0.026
gh-0.025
ze-0.024
 aids-0.024
aster-0.024
Token She
Token ID1375
Feature activation+1.49e+00

Pos logit contributions
 Bub+0.007
 mar+0.007
 Nib+0.007
 wra+0.007
endant+0.007
Neg logit contributions
zed-0.011
gh-0.010
 aids-0.010
ainer-0.010
aster-0.010
Token was
Token ID373
Feature activation+6.18e-01

Pos logit contributions
 Bub+0.026
 Nib+0.025
 mar+0.024
 wra+0.024
endant+0.023
Neg logit contributions
zed-0.061
gh-0.058
ze-0.057
 aids-0.056
aster-0.056
Token,
Token ID11
Feature activation+2.33e-01

Pos logit contributions
 Bub+0.026
 mar+0.025
 Nib+0.025
 wra+0.024
endant+0.024
Neg logit contributions
zed-0.036
gh-0.035
aster-0.034
 aids-0.034
ainer-0.034
Token y
Token ID331
Feature activation-3.71e-01

Pos logit contributions
 Bub+0.005
 mar+0.005
endant+0.005
 Nib+0.005
 wra+0.005
Neg logit contributions
zed-0.008
aster-0.007
 aids-0.007
ze-0.007
gh-0.007
Tokenum
Token ID388
Feature activation+1.14e+00

Pos logit contributions
zed+0.012
 aids+0.011
ze+0.010
gh+0.010
aster+0.010
Neg logit contributions
 mar-0.010
 wra-0.010
unn-0.010
 Nib-0.010
ocular-0.010
Token,"
Token ID553
Feature activation-8.55e-01

Pos logit contributions
 Bub+0.027
 mar+0.026
 Nib+0.026
 wra+0.025
endant+0.025
Neg logit contributions
zed-0.038
gh-0.037
aster-0.036
 aids-0.036
ainer-0.035
Token the
Token ID262
Feature activation+2.82e+00

Pos logit contributions
aster+0.024
zed+0.024
 aids+0.023
ainer+0.022
ze+0.022
Neg logit contributions
endant-0.023
 Bub-0.023
 haven-0.022
ocular-0.022
 mar-0.022
Token k
Token ID479
Feature activation+4.01e+00

Pos logit contributions
 Bub+0.043
 mar+0.041
 Nib+0.040
 wra+0.038
endant+0.037
Neg logit contributions
zed-0.121
gh-0.117
ainer-0.114
 aids-0.114
ze-0.114
Tokenang
Token ID648
Feature activation+1.40e+00

Pos logit contributions
 Bub+0.097
 wra+0.094
 mar+0.093
 Nib+0.090
endant+0.090
Neg logit contributions
zed-0.125
ze-0.121
gh-0.121
aster-0.121
ainer-0.120
Tokenaroo
Token ID38049
Feature activation+1.05e+00

Pos logit contributions
 Bub+0.035
 Nib+0.033
 mar+0.033
 wra+0.031
 ot+0.030
Neg logit contributions
zed-0.047
gh-0.046
 aids-0.044
ainer-0.044
aster-0.043
Token says
Token ID1139
Feature activation-6.74e-01

Pos logit contributions
 Bub+0.017
 mar+0.016
 wra+0.016
 Nib+0.015
endant+0.015
Neg logit contributions
zed-0.043
 aids-0.041
gh-0.041
ze-0.041
aster-0.040
Token.
Token ID13
Feature activation-6.81e-01

Pos logit contributions
zed+0.029
gh+0.028
 aids+0.027
ainer+0.027
aster+0.027
Neg logit contributions
 Bub-0.010
 Nib-0.010
 mar-0.010
 wra-0.009
endant-0.009
Token
Token ID198
Feature activation-7.70e-03

Pos logit contributions
gh+0.033
zed+0.032
 aids+0.030
aster+0.030
ze+0.030
Neg logit contributions
 Bub-0.009
 Nib-0.009
 could-0.008
 L-0.007
 Tim-0.007
Token the
Token ID262
Feature activation+1.87e+00

Pos logit contributions
 Bub+0.058
endant+0.057
 Nib+0.057
 wra+0.055
 mar+0.054
Neg logit contributions
zed-0.069
gh-0.065
 aids-0.063
aster-0.063
ze-0.062
Token trail
Token ID8025
Feature activation+1.95e-01

Pos logit contributions
 Nib+0.044
 Bub+0.043
endant+0.042
 wra+0.040
 mar+0.040
Neg logit contributions
zed-0.065
gh-0.061
ze-0.060
 aids-0.059
 sentence-0.059
Token.
Token ID13
Feature activation+2.82e+00

Pos logit contributions
 Bub+0.004
 mar+0.003
 Nib+0.003
 wra+0.003
endant+0.003
Neg logit contributions
zed-0.008
gh-0.007
 aids-0.007
ainer-0.007
aster-0.007
Token Dad
Token ID17415
Feature activation+1.41e+00

Pos logit contributions
endant+0.071
 Bub+0.069
 buried+0.068
 mar+0.068
 wra+0.068
Neg logit contributions
zed-0.086
ainer-0.082
aster-0.080
gh-0.079
 aids-0.078
Token pointed
Token ID6235
Feature activation+7.02e-01

Pos logit contributions
 Nib+0.028
 Bub+0.027
endant+0.027
 wra+0.025
 mar+0.023
Neg logit contributions
zed-0.052
gh-0.047
ze-0.045
aster-0.045
rac-0.044
Token at
Token ID379
Feature activation+4.01e+00

Pos logit contributions
 Bub+0.016
 mar+0.016
 Nib+0.015
 wra+0.014
ocular+0.014
Neg logit contributions
zed-0.026
gh-0.025
ainer-0.024
aster-0.024
ze-0.023
Token the
Token ID262
Feature activation+1.81e+00

Pos logit contributions
ll+0.132
endant+0.129
unn+0.124
 haven+0.121
 wra+0.113
Neg logit contributions
zed-0.091
aster-0.082
ze-0.080
 aids-0.077
gh-0.076
Token trees
Token ID7150
Feature activation+1.49e+00

Pos logit contributions
 Bub+0.054
 Nib+0.052
endant+0.052
unn+0.050
ll+0.050
Neg logit contributions
ze-0.040
 sentence-0.040
over-0.038
 aids-0.037
 paragraph-0.036
Token and
Token ID290
Feature activation+2.32e+00

Pos logit contributions
 Bub+0.032
 Nib+0.030
 mar+0.029
endant+0.028
 wra+0.028
Neg logit contributions
zed-0.053
gh-0.051
 aids-0.050
ze-0.050
aster-0.049
Token the
Token ID262
Feature activation+1.64e+00

Pos logit contributions
endant+0.055
 haven+0.049
unn+0.047
ll+0.047
 wra+0.043
Neg logit contributions
aster-0.067
zed-0.067
 aids-0.063
Nik-0.061
ze-0.059
Token birds
Token ID10087
Feature activation+1.02e+00

Pos logit contributions
 Bub+0.045
endant+0.045
 Nib+0.044
 haven+0.042
ll+0.042
Neg logit contributions
ze-0.041
 aids-0.039
 sentence-0.038
 paragraph-0.036
over-0.036
Token k
Token ID479
Feature activation+3.74e+00

Pos logit contributions
endant+0.048
 Bub+0.046
 Nib+0.044
 mar+0.044
 wra+0.043
Neg logit contributions
zed-0.089
ze-0.087
gh-0.086
ainer-0.086
 sentence-0.085
Tokenang
Token ID648
Feature activation+8.63e-01

Pos logit contributions
 Bub+0.091
 wra+0.086
 Nib+0.086
 mar+0.084
endant+0.082
Neg logit contributions
zed-0.118
gh-0.116
aster-0.113
ze-0.113
ainer-0.112
Tokenaroo
Token ID38049
Feature activation+1.13e+00

Pos logit contributions
 Bub+0.021
 Nib+0.020
 mar+0.020
 wra+0.019
 ot+0.018
Neg logit contributions
zed-0.029
gh-0.028
 aids-0.028
ainer-0.028
ze-0.027
Token.
Token ID13
Feature activation-1.52e-02

Pos logit contributions
 Bub+0.030
endant+0.030
 Hatt+0.030
 Em+0.029
 wra+0.029
Neg logit contributions
zed-0.033
ze-0.032
gh-0.031
ap-0.031
aster-0.029
Token The
Token ID383
Feature activation+2.45e+00

Pos logit contributions
zed+0.000
ainer+0.000
 aids+0.000
gh+0.000
aster+0.000
Neg logit contributions
 mar-0.000
 wra-0.000
endant-0.000
 Bub-0.000
 Nib-0.000
Token k
Token ID479
Feature activation+4.00e+00

Pos logit contributions
endant+0.053
 Bub+0.050
 wra+0.048
 mar+0.047
 Nib+0.046
Neg logit contributions
ze-0.082
zed-0.081
ainer-0.080
 sentence-0.079
gh-0.078
Tokenang
Token ID648
Feature activation+1.02e+00

Pos logit contributions
 Bub+0.104
 wra+0.100
 Nib+0.098
 mar+0.097
 buried+0.096
Neg logit contributions
zed-0.113
ze-0.111
gh-0.109
aster-0.109
ap-0.108
Tokenaroo
Token ID38049
Feature activation+1.35e+00

Pos logit contributions
 Bub+0.026
 mar+0.025
 Nib+0.025
 wra+0.024
endant+0.023
Neg logit contributions
zed-0.033
gh-0.032
ainer-0.032
 aids-0.031
aster-0.031
Token hops
Token ID29438
Feature activation+7.35e-01

Pos logit contributions
 Bub+0.036
 mar+0.034
 Nib+0.033
endant+0.033
 wra+0.033
Neg logit contributions
zed-0.043
gh-0.040
 aids-0.039
ze-0.039
aster-0.038
Token faster
Token ID5443
Feature activation-3.62e-01

Pos logit contributions
 Bub+0.022
 mar+0.021
 Nib+0.021
endant+0.020
 wra+0.020
Neg logit contributions
zed-0.021
gh-0.020
 aids-0.019
ainer-0.019
ze-0.019
Token.
Token ID13
Feature activation+1.80e-01

Pos logit contributions
zed+0.015
gh+0.014
 aids+0.014
ainer+0.014
aster+0.014
Neg logit contributions
 Bub-0.006
 mar-0.006
 Nib-0.006
 wra-0.006
endant-0.005
Token us
Token ID514
Feature activation+3.59e-01

Pos logit contributions
zed+0.000
gh+0.000
ainer+0.000
 aids+0.000
aster+0.000
Neg logit contributions
 Bub-0.000
 mar-0.000
 Nib-0.000
 wra-0.000
endant-0.000
Token?"
Token ID1701
Feature activation-6.13e-01

Pos logit contributions
 Bub+0.010
 mar+0.010
 Nib+0.009
 wra+0.009
endant+0.009
Neg logit contributions
zed-0.011
gh-0.011
ainer-0.010
 aids-0.010
aster-0.010
Token
Token ID198
Feature activation+3.04e-01

Pos logit contributions
zed+0.023
gh+0.022
 aids+0.022
ainer+0.021
aster+0.021
Neg logit contributions
 Bub-0.012
 mar-0.012
 Nib-0.012
 wra-0.012
endant-0.011
Token
Token ID198
Feature activation+3.91e-01

Pos logit contributions
 Bub+0.006
 mar+0.005
 Nib+0.005
 wra+0.005
endant+0.005
Neg logit contributions
zed-0.012
gh-0.012
 aids-0.011
ainer-0.011
ze-0.011
TokenThe
Token ID464
Feature activation+2.41e+00

Pos logit contributions
 Bub+0.010
 mar+0.010
 Nib+0.010
 wra+0.009
 H+0.009
Neg logit contributions
zed-0.014
gh-0.013
 aids-0.012
ainer-0.012
ze-0.011
Token k
Token ID479
Feature activation+3.98e+00

Pos logit contributions
 Bub+0.042
endant+0.040
 Nib+0.039
 mar+0.039
 wra+0.039
Neg logit contributions
zed-0.095
gh-0.092
ze-0.092
ainer-0.091
 aids-0.090
Tokenang
Token ID648
Feature activation+1.22e+00

Pos logit contributions
 Bub+0.099
 wra+0.094
 Nib+0.093
 mar+0.093
endant+0.091
Neg logit contributions
zed-0.121
ze-0.118
gh-0.117
ainer-0.116
aster-0.116
Tokenaroo
Token ID38049
Feature activation+1.31e+00

Pos logit contributions
 Bub+0.030
 mar+0.029
 Nib+0.029
 wra+0.027
endant+0.027
Neg logit contributions
zed-0.041
gh-0.040
ainer-0.039
 aids-0.038
aster-0.038
Token nods
Token ID34833
Feature activation-8.82e-01

Pos logit contributions
 Bub+0.035
 mar+0.034
 Nib+0.034
 wra+0.032
endant+0.032
Neg logit contributions
zed-0.042
gh-0.040
ainer-0.039
 aids-0.038
aster-0.038
Token.
Token ID13
Feature activation+1.51e-02

Pos logit contributions
zed+0.044
aster+0.044
 aids+0.043
gh+0.042
ainer+0.042
Neg logit contributions
 Bub-0.006
 could-0.006
 said-0.006
 Nib-0.005
 c-0.005
Token It
Token ID632
Feature activation+7.09e-01

Pos logit contributions
 Bub+0.000
 Nib+0.000
 mar+0.000
 Ell+0.000
 could+0.000
Neg logit contributions
gh-0.001
zed-0.001
ze-0.000
aster-0.000
ainer-0.000
Token many
Token ID867
Feature activation+2.24e+00

Pos logit contributions
zed+0.006
gh+0.006
 aids+0.006
aster+0.006
ainer+0.006
Neg logit contributions
 Bub-0.006
 Nib-0.006
 mar-0.006
 wra-0.006
endant-0.006
Token wonderful
Token ID7932
Feature activation+5.34e-01

Pos logit contributions
endant+0.067
 Bub+0.066
 Nib+0.065
 haven+0.061
ocular+0.060
Neg logit contributions
zed-0.058
 aids-0.053
gh-0.053
ze-0.052
 sentence-0.050
Token things
Token ID1243
Feature activation+1.85e+00

Pos logit contributions
 Bub+0.017
 Nib+0.017
endant+0.016
 wra+0.016
 mar+0.016
Neg logit contributions
zed-0.013
gh-0.012
 aids-0.012
ze-0.012
aster-0.011
Token.
Token ID13
Feature activation-1.29e-01

Pos logit contributions
 Bub+0.047
 Nib+0.046
endant+0.044
 mar+0.043
unn+0.043
Neg logit contributions
zed-0.057
gh-0.055
ze-0.051
 aids-0.050
aster-0.048
Token The
Token ID383
Feature activation+2.42e+00

Pos logit contributions
zed+0.004
gh+0.004
 aids+0.004
ainer+0.004
aster+0.004
Neg logit contributions
 Bub-0.003
 mar-0.003
 Nib-0.003
 wra-0.003
endant-0.003
Token k
Token ID479
Feature activation+3.97e+00

Pos logit contributions
 Bub+0.051
endant+0.050
 Nib+0.048
 mar+0.046
 wra+0.046
Neg logit contributions
zed-0.085
ze-0.084
ainer-0.082
gh-0.078
 aids-0.078
Tokenang
Token ID648
Feature activation+8.03e-01

Pos logit contributions
 Bub+0.104
 Nib+0.101
 wra+0.100
 mar+0.097
endant+0.095
Neg logit contributions
zed-0.113
ze-0.110
over-0.109
gh-0.108
ap-0.107
Tokenaroo
Token ID38049
Feature activation+8.20e-01

Pos logit contributions
 Bub+0.022
 mar+0.021
 Nib+0.021
 wra+0.020
endant+0.020
Neg logit contributions
zed-0.025
gh-0.024
 aids-0.023
ainer-0.023
aster-0.023
Token saw
Token ID2497
Feature activation+7.23e-01

Pos logit contributions
 Nib+0.017
endant+0.017
 wra+0.017
 Bub+0.017
 mar+0.016
Neg logit contributions
zed-0.030
ze-0.029
gh-0.028
 aids-0.027
aster-0.026
Token the
Token ID262
Feature activation+2.52e+00

Pos logit contributions
endant+0.021
 haven+0.020
 Bub+0.020
 wra+0.020
 Nib+0.020
Neg logit contributions
zed-0.020
gh-0.019
 aids-0.019
ze-0.019
aster-0.018
Token beautiful
Token ID4950
Feature activation+2.65e+00

Pos logit contributions
 Bub+0.060
endant+0.059
 Nib+0.056
 mar+0.054
 wra+0.054
Neg logit contributions
ze-0.078
zed-0.076
ainer-0.074
gh-0.073
 sentence-0.073
Token and
Token ID290
Feature activation-1.40e-01

Pos logit contributions
 Bub+0.015
 mar+0.014
 Nib+0.014
 wra+0.013
endant+0.013
Neg logit contributions
zed-0.037
gh-0.036
 aids-0.035
ainer-0.035
ze-0.035
Token started
Token ID2067
Feature activation+8.09e-02

Pos logit contributions
zed+0.005
ze+0.005
aster+0.004
 aids+0.004
gh+0.004
Neg logit contributions
endant-0.003
 wra-0.003
 Bub-0.003
 Nib-0.003
 mar-0.003
Token to
Token ID284
Feature activation+8.88e-01

Pos logit contributions
 mar+0.002
 Bub+0.002
 Nib+0.002
 le+0.001
 lost+0.001
Neg logit contributions
ainer-0.003
zed-0.003
 aids-0.003
gh-0.003
aster-0.003
Token pat
Token ID1458
Feature activation+4.37e-01

Pos logit contributions
 Bub+0.023
 mar+0.022
 Nib+0.022
 wra+0.021
endant+0.020
Neg logit contributions
zed-0.029
gh-0.028
 aids-0.027
ainer-0.027
aster-0.026
Token the
Token ID262
Feature activation+2.74e+00

Pos logit contributions
 Bub+0.011
 mar+0.010
 Nib+0.010
 wra+0.010
endant+0.009
Neg logit contributions
zed-0.015
gh-0.014
 aids-0.014
ainer-0.014
aster-0.014
Token k
Token ID479
Feature activation+3.96e+00

Pos logit contributions
 Bub+0.054
endant+0.053
 Nib+0.051
 mar+0.051
 wra+0.050
Neg logit contributions
ze-0.099
zed-0.099
ainer-0.095
 sentence-0.094
 aids-0.093
Tokenang
Token ID648
Feature activation+1.07e+00

Pos logit contributions
 Bub+0.092
 mar+0.088
 wra+0.088
 Nib+0.087
endant+0.084
Neg logit contributions
zed-0.128
gh-0.125
ze-0.125
ainer-0.124
aster-0.123
Tokenaroo
Token ID38049
Feature activation+1.35e+00

Pos logit contributions
 Bub+0.024
 Nib+0.023
 mar+0.023
 ot+0.022
 Ell+0.021
Neg logit contributions
zed-0.038
gh-0.037
 aids-0.037
ainer-0.036
aster-0.035
Token.
Token ID13
Feature activation+6.72e-01

Pos logit contributions
 Bub+0.026
 mar+0.025
 Nib+0.025
 wra+0.023
endant+0.022
Neg logit contributions
zed-0.052
gh-0.051
ainer-0.049
 aids-0.049
aster-0.049
Token The
Token ID383
Feature activation+2.17e+00

Pos logit contributions
 Bub+0.016
 mar+0.015
 Nib+0.015
 wra+0.015
endant+0.014
Neg logit contributions
zed-0.023
gh-0.022
 aids-0.022
ainer-0.022
ze-0.021
Token k
Token ID479
Feature activation+3.58e+00

Pos logit contributions
endant+0.046
 Bub+0.045
 wra+0.043
 Nib+0.043
 mar+0.042
Neg logit contributions
ze-0.076
zed-0.075
ainer-0.073
 sentence-0.072
 aids-0.072
Token who
Token ID508
Feature activation+5.00e-01

Pos logit contributions
 wra+0.017
 Bub+0.017
 Nib+0.016
 mar+0.016
endant+0.015
Neg logit contributions
zed-0.053
gh-0.051
ze-0.050
aster-0.050
 aids-0.050
Token wanted
Token ID2227
Feature activation-5.36e-02

Pos logit contributions
 Bub+0.008
 mar+0.008
 Nib+0.008
 wra+0.008
endant+0.008
Neg logit contributions
zed-0.021
gh-0.020
ze-0.019
 aids-0.019
aster-0.019
Token to
Token ID284
Feature activation-2.67e-01

Pos logit contributions
zed+0.002
ainer+0.002
gh+0.002
 aids+0.002
aster+0.002
Neg logit contributions
 Bub-0.001
 mar-0.001
 Nib-0.001
 wra-0.001
 buried-0.001
Token decor
Token ID11705
Feature activation+5.28e-01

Pos logit contributions
zed+0.010
gh+0.010
 aids+0.010
ainer+0.010
ze+0.009
Neg logit contributions
 Bub-0.006
 mar-0.005
 Nib-0.005
 buried-0.005
 ot-0.005
Tokenate
Token ID378
Feature activation+5.57e-01

Pos logit contributions
 mar+0.016
 Bub+0.016
 Nib+0.016
endant+0.015
 wra+0.015
Neg logit contributions
zed-0.014
 aids-0.014
gh-0.013
aster-0.013
ainer-0.013
Token his
Token ID465
Feature activation+3.95e+00

Pos logit contributions
 Bub+0.013
 mar+0.013
 Nib+0.013
 wra+0.012
endant+0.012
Neg logit contributions
zed-0.019
gh-0.018
ainer-0.018
 aids-0.018
aster-0.018
Token house
Token ID2156
Feature activation+2.73e-01

Pos logit contributions
 Bub+0.113
 Nib+0.107
 wra+0.107
 haven+0.100
 mar+0.099
Neg logit contributions
zed-0.107
ze-0.105
gh-0.102
 aids-0.101
 sentence-0.101
Token.
Token ID13
Feature activation-5.50e-01

Pos logit contributions
 Bub+0.005
 mar+0.005
 Nib+0.005
 wra+0.005
endant+0.005
Neg logit contributions
zed-0.011
gh-0.010
 aids-0.010
ainer-0.010
aster-0.010
Token He
Token ID679
Feature activation+1.65e-01

Pos logit contributions
zed+0.023
gh+0.023
ze+0.022
ainer+0.021
aster+0.021
Neg logit contributions
 Bub-0.010
 Tim-0.009
 Nib-0.009
 mar-0.009
 H-0.009
Token searched
Token ID16499
Feature activation-8.99e-01

Pos logit contributions
 Nib+0.003
 Bub+0.003
endant+0.003
 wra+0.003
 mar+0.003
Neg logit contributions
zed-0.006
gh-0.006
ze-0.006
aster-0.006
ainer-0.005
Token everywhere
Token ID8347
Feature activation-5.79e-01

Pos logit contributions
zed+0.033
gh+0.032
 aids+0.031
ainer+0.031
ze+0.031
Neg logit contributions
 Bub-0.020
 mar-0.019
 Nib-0.019
 wra-0.018
endant-0.017
Token in
Token ID287
Feature activation-5.74e-02

Pos logit contributions
 Bub+0.018
 Nib+0.017
 wra+0.016
 mar+0.016
endant+0.016
Neg logit contributions
zed-0.026
gh-0.025
ze-0.025
 aids-0.024
aster-0.024
Token the
Token ID262
Feature activation+1.75e+00

Pos logit contributions
zed+0.002
gh+0.002
 aids+0.002
ainer+0.002
ze+0.002
Neg logit contributions
 Bub-0.001
 mar-0.001
 Nib-0.001
 wra-0.001
 H-0.001
Token park
Token ID3952
Feature activation-2.72e-01

Pos logit contributions
 Bub+0.037
 Nib+0.034
 mar+0.034
endant+0.033
 wra+0.033
Neg logit contributions
zed-0.063
 aids-0.060
aster-0.060
gh-0.060
ze-0.060
Token.
Token ID13
Feature activation+5.72e-01

Pos logit contributions
zed+0.010
gh+0.010
 aids+0.009
aster+0.009
ainer+0.009
Neg logit contributions
 Bub-0.006
 Nib-0.006
 mar-0.006
 wra-0.005
endant-0.005
Token The
Token ID383
Feature activation+2.48e+00

Pos logit contributions
 Bub+0.015
 mar+0.015
 Nib+0.015
 wra+0.015
endant+0.014
Neg logit contributions
zed-0.017
gh-0.017
 aids-0.016
ainer-0.016
aster-0.016
Token k
Token ID479
Feature activation+3.95e+00

Pos logit contributions
 Bub+0.043
endant+0.041
 Nib+0.041
 mar+0.041
 wra+0.040
Neg logit contributions
zed-0.097
ze-0.094
gh-0.093
ainer-0.093
 aids-0.092
Tokenang
Token ID648
Feature activation+1.01e+00

Pos logit contributions
 Bub+0.098
 Nib+0.093
 wra+0.093
 mar+0.091
 buried+0.089
Neg logit contributions
zed-0.118
ze-0.115
gh-0.115
ainer-0.113
 aids-0.113
Tokenaroo
Token ID38049
Feature activation+1.30e+00

Pos logit contributions
 Bub+0.025
 mar+0.024
 Nib+0.024
 wra+0.023
endant+0.022
Neg logit contributions
zed-0.033
gh-0.032
ainer-0.031
 aids-0.031
aster-0.031
Token is
Token ID318
Feature activation+3.45e-02

Pos logit contributions
 Bub+0.029
 Nib+0.028
 mar+0.028
 wra+0.028
endant+0.027
Neg logit contributions
zed-0.045
 aids-0.042
gh-0.042
ze-0.041
 sentence-0.040
Token big
Token ID1263
Feature activation+3.48e-01

Pos logit contributions
 Bub+0.001
 mar+0.001
 Nib+0.001
 wra+0.001
endant+0.001
Neg logit contributions
zed-0.001
gh-0.001
 aids-0.001
ainer-0.001
ze-0.001
Token and
Token ID290
Feature activation+8.31e-01

Pos logit contributions
 Bub+0.010
 Nib+0.009
endant+0.009
 Hatt+0.009
 mar+0.009
Neg logit contributions
ze-0.009
over-0.009
zed-0.009
 sentence-0.009
gh-0.009
Token k
Token ID479
Feature activation+3.37e+00

Pos logit contributions
 Bub+0.060
 mar+0.058
 wra+0.058
 Nib+0.057
endant+0.056
Neg logit contributions
zed-0.086
gh-0.086
ze-0.083
 aids-0.081
ainer-0.080
Tokenang
Token ID648
Feature activation+2.52e-02

Pos logit contributions
 Bub+0.084
 wra+0.080
 Nib+0.080
 mar+0.078
 buried+0.078
Neg logit contributions
gh-0.098
ze-0.097
zed-0.097
over-0.096
aster-0.095
Tokenaroo
Token ID38049
Feature activation+1.20e+00

Pos logit contributions
 Bub+0.001
 mar+0.001
 Nib+0.000
 Ell+0.000
 ot+0.000
Neg logit contributions
zed-0.001
ainer-0.001
 aids-0.001
gh-0.001
aster-0.001
Token!".
Token ID48220
Feature activation+2.51e+00

Pos logit contributions
 Bub+0.026
 mar+0.025
 Nib+0.025
 wra+0.024
endant+0.024
Neg logit contributions
zed-0.043
gh-0.042
 aids-0.041
ainer-0.040
ze-0.040
Token The
Token ID383
Feature activation+1.80e+00

Pos logit contributions
 mar+0.058
 Bub+0.057
 wra+0.057
 Nib+0.056
endant+0.055
Neg logit contributions
zed-0.086
gh-0.080
 aids-0.078
ze-0.078
ainer-0.076
Token k
Token ID479
Feature activation+3.94e+00

Pos logit contributions
 Bub+0.033
endant+0.032
 Nib+0.032
 mar+0.031
 wra+0.031
Neg logit contributions
zed-0.069
ze-0.067
ainer-0.067
gh-0.066
 aids-0.066
Tokenang
Token ID648
Feature activation+1.04e+00

Pos logit contributions
 Bub+0.099
 Nib+0.095
 wra+0.095
 mar+0.093
endant+0.091
Neg logit contributions
zed-0.116
ze-0.115
gh-0.112
ap-0.111
ainer-0.111
Tokenaroo
Token ID38049
Feature activation+1.50e+00

Pos logit contributions
 Bub+0.028
 mar+0.027
 Nib+0.027
 wra+0.026
endant+0.026
Neg logit contributions
zed-0.032
gh-0.031
 aids-0.030
ainer-0.030
aster-0.029
Token stopped
Token ID5025
Feature activation+4.59e-01

Pos logit contributions
 wra+0.025
 Bub+0.025
 Nib+0.025
 mar+0.025
endant+0.024
Neg logit contributions
zed-0.061
gh-0.057
ze-0.057
 aids-0.056
 sentence-0.053
Token jumping
Token ID14284
Feature activation+7.19e-01

Pos logit contributions
 Bub+0.010
 mar+0.009
 Nib+0.009
 wra+0.009
endant+0.009
Neg logit contributions
zed-0.017
gh-0.016
 aids-0.016
ainer-0.016
ze-0.016
Token and
Token ID290
Feature activation+1.95e-02

Pos logit contributions
 Bub+0.012
 mar+0.011
 Nib+0.011
 wra+0.011
endant+0.010
Neg logit contributions
zed-0.030
gh-0.029
 aids-0.028
ainer-0.028
ze-0.028
Token,
Token ID11
Feature activation+1.55e+00

Pos logit contributions
 Bub+0.038
 mar+0.037
 Nib+0.036
 wra+0.035
endant+0.035
Neg logit contributions
zed-0.063
gh-0.062
 aids-0.059
ze-0.059
ainer-0.059
Token little
Token ID1310
Feature activation+2.63e+00

Pos logit contributions
endant+0.039
 mar+0.036
 Bub+0.036
 wra+0.034
 buried+0.034
Neg logit contributions
ze-0.044
zed-0.044
 aids-0.043
 sentence-0.042
aster-0.042
Token boys
Token ID6510
Feature activation+3.16e-01

Pos logit contributions
 Bub+0.055
 mar+0.053
 Nib+0.052
endant+0.051
 wra+0.050
Neg logit contributions
zed-0.093
gh-0.091
ainer-0.090
ze-0.089
 aids-0.088
Token,"
Token ID553
Feature activation-8.01e-01

Pos logit contributions
 Bub+0.007
 mar+0.007
 Nib+0.007
 wra+0.007
endant+0.007
Neg logit contributions
zed-0.010
gh-0.010
aster-0.010
ze-0.010
 aids-0.010
Token the
Token ID262
Feature activation+2.45e+00

Pos logit contributions
aster+0.021
 aids+0.018
zed+0.018
 sentence+0.017
ainer+0.017
Neg logit contributions
endant-0.025
 wra-0.024
 haven-0.023
unn-0.023
 mar-0.022
Token k
Token ID479
Feature activation+3.93e+00

Pos logit contributions
 Bub+0.038
 mar+0.036
 Nib+0.036
 wra+0.034
endant+0.033
Neg logit contributions
zed-0.104
gh-0.101
 aids-0.098
ainer-0.098
ze-0.098
Tokenang
Token ID648
Feature activation+8.81e-01

Pos logit contributions
 Bub+0.097
 wra+0.094
 mar+0.093
 Nib+0.091
endant+0.091
Neg logit contributions
zed-0.119
ze-0.115
gh-0.115
aster-0.114
ainer-0.113
Tokenaroo
Token ID38049
Feature activation+1.07e+00

Pos logit contributions
 Bub+0.022
 Nib+0.021
 mar+0.021
 wra+0.019
 ot+0.019
Neg logit contributions
zed-0.030
gh-0.029
ainer-0.028
 aids-0.028
aster-0.027
Token says
Token ID1139
Feature activation-5.68e-01

Pos logit contributions
 Bub+0.016
 mar+0.015
 Nib+0.014
 wra+0.014
endant+0.013
Neg logit contributions
zed-0.046
gh-0.045
 aids-0.044
ainer-0.044
ze-0.043
Token.
Token ID13
Feature activation-3.60e-01

Pos logit contributions
zed+0.024
gh+0.023
 aids+0.023
ainer+0.023
aster+0.022
Neg logit contributions
 Bub-0.009
 Nib-0.009
 mar-0.008
 wra-0.008
endant-0.008
Token "
Token ID366
Feature activation-2.69e-01

Pos logit contributions
zed+0.014
gh+0.014
 aids+0.013
ze+0.013
ainer+0.013
Neg logit contributions
 Bub-0.008
 Nib-0.007
 mar-0.007
 H-0.007
 wra-0.006
Token"
Token ID1
Feature activation+1.76e-01

Pos logit contributions
zed+0.002
 aids+0.002
gh+0.002
ainer+0.001
ze+0.001
Neg logit contributions
 Bub-0.002
 mar-0.002
 Nib-0.002
 wra-0.002
 could-0.002
TokenRo
Token ID15450
Feature activation+1.39e-01

Pos logit contributions
 Bub+0.005
 mar+0.004
 Nib+0.004
 wra+0.004
 ot+0.004
Neg logit contributions
zed-0.006
gh-0.006
 aids-0.005
ainer-0.005
aster-0.005
Tokenar
Token ID283
Feature activation-6.40e-01

Pos logit contributions
 Bub+0.005
 mar+0.004
 Nib+0.004
 wra+0.004
endant+0.004
Neg logit contributions
zed-0.004
 aids-0.003
gh-0.003
ainer-0.003
ze-0.003
Token!"
Token ID2474
Feature activation-7.73e-01

Pos logit contributions
zed+0.027
 aids+0.026
ainer+0.026
gh+0.025
aster+0.025
Neg logit contributions
 mar-0.010
 Bub-0.010
 Nib-0.009
 said-0.009
 wra-0.009
Token the
Token ID262
Feature activation+2.29e+00

Pos logit contributions
zed+0.027
gh+0.026
 aids+0.025
ainer+0.025
aster+0.025
Neg logit contributions
 Bub-0.018
 mar-0.017
 Nib-0.017
 wra-0.017
endant-0.016
Token k
Token ID479
Feature activation+3.89e+00

Pos logit contributions
 Bub+0.033
 mar+0.031
 Nib+0.031
 wra+0.028
endant+0.026
Neg logit contributions
zed-0.101
gh-0.098
 aids-0.096
ainer-0.096
aster-0.095
Tokenang
Token ID648
Feature activation+8.35e-01

Pos logit contributions
 Bub+0.092
 mar+0.087
 wra+0.087
 Nib+0.086
endant+0.085
Neg logit contributions
zed-0.127
gh-0.123
ze-0.122
aster-0.120
ainer-0.120
Tokenaroo
Token ID38049
Feature activation+8.38e-01

Pos logit contributions
 Bub+0.019
 Nib+0.018
 mar+0.017
 ot+0.017
 Ell+0.016
Neg logit contributions
zed-0.030
gh-0.029
 aids-0.029
ainer-0.028
aster-0.027
Token says
Token ID1139
Feature activation-5.39e-01

Pos logit contributions
 Bub+0.014
 mar+0.013
 Nib+0.013
 wra+0.012
endant+0.012
Neg logit contributions
zed-0.035
gh-0.034
ainer-0.033
 aids-0.033
aster-0.033
Token.
Token ID13
Feature activation-4.49e-01

Pos logit contributions
zed+0.024
gh+0.023
aster+0.023
ainer+0.023
 aids+0.022
Neg logit contributions
 Bub-0.007
 Nib-0.007
 mar-0.007
 wra-0.006
ocular-0.006
Token
Token ID198
Feature activation-2.80e-02

Pos logit contributions
gh+0.019
zed+0.019
ze+0.018
 aids+0.018
ainer+0.018
Neg logit contributions
 Bub-0.008
 Nib-0.008
 Tim-0.007
 H-0.007
 could-0.007
Token,
Token ID11
Feature activation+6.82e-01

Pos logit contributions
 Bub+0.026
 Nib+0.025
 mar+0.024
 wra+0.024
endant+0.024
Neg logit contributions
gh-0.038
zed-0.037
aster-0.036
 aids-0.036
ze-0.036
Token long
Token ID890
Feature activation-5.39e-02

Pos logit contributions
 Bub+0.019
unn+0.019
 Nib+0.019
ll+0.018
 mar+0.018
Neg logit contributions
gh-0.019
 aids-0.018
zed-0.018
ze-0.018
 sentence-0.018
Token stick
Token ID4859
Feature activation-4.36e-01

Pos logit contributions
zed+0.002
gh+0.002
ainer+0.002
 aids+0.002
aster+0.002
Neg logit contributions
 Bub-0.001
 mar-0.001
 Nib-0.001
 wra-0.001
endant-0.001
Token.
Token ID13
Feature activation+5.15e-02

Pos logit contributions
zed+0.015
gh+0.015
ze+0.014
 aids+0.014
ainer+0.014
Neg logit contributions
 Bub-0.010
 Nib-0.010
 mar-0.009
 wra-0.009
endant-0.009
Token The
Token ID383
Feature activation+1.94e+00

Pos logit contributions
 mar+0.002
 wra+0.002
 Bub+0.001
endant+0.001
 Nib+0.001
Neg logit contributions
zed-0.001
 aids-0.001
ainer-0.001
aster-0.001
gh-0.001
Token k
Token ID479
Feature activation+3.89e+00

Pos logit contributions
 Bub+0.037
 Nib+0.035
endant+0.034
 mar+0.034
 wra+0.034
Neg logit contributions
zed-0.073
gh-0.071
ze-0.070
ainer-0.070
 aids-0.068
Tokenang
Token ID648
Feature activation+8.37e-01

Pos logit contributions
 Bub+0.099
 Nib+0.094
 wra+0.093
 mar+0.091
endant+0.090
Neg logit contributions
zed-0.113
ze-0.111
gh-0.110
aster-0.107
over-0.107
Tokenaroo
Token ID38049
Feature activation+6.48e-01

Pos logit contributions
 Bub+0.023
 mar+0.023
 Nib+0.022
 wra+0.022
endant+0.021
Neg logit contributions
zed-0.026
gh-0.024
 aids-0.024
ainer-0.023
ze-0.023
Token snapped
Token ID20821
Feature activation-7.71e-02

Pos logit contributions
 Nib+0.014
 wra+0.013
 Bub+0.013
endant+0.012
 mar+0.012
Neg logit contributions
zed-0.024
ze-0.022
gh-0.022
 aids-0.021
 sentence-0.020
Token the
Token ID262
Feature activation+1.13e+00

Pos logit contributions
zed+0.003
gh+0.003
aster+0.003
 aids+0.003
ainer+0.003
Neg logit contributions
 Bub-0.002
 mar-0.002
 Nib-0.002
 wra-0.002
 buried-0.002
Token stick
Token ID4859
Feature activation-1.01e+00

Pos logit contributions
 Bub+0.032
 Nib+0.030
endant+0.029
 mar+0.029
 Ell+0.029
Neg logit contributions
gh-0.031
zed-0.031
ze-0.031
ainer-0.030
 sentence-0.030
Token He
Token ID679
Feature activation+9.37e-01

Pos logit contributions
 Bub+0.011
 mar+0.011
 Nib+0.011
 wra+0.011
endant+0.010
Neg logit contributions
zed-0.015
gh-0.015
 aids-0.014
ainer-0.014
aster-0.014
Token looked
Token ID3114
Feature activation+9.36e-01

Pos logit contributions
 Nib+0.018
 Bub+0.017
 wra+0.017
endant+0.017
 mar+0.017
Neg logit contributions
zed-0.038
ze-0.034
gh-0.034
 aids-0.033
 sentence-0.032
Token up
Token ID510
Feature activation+2.08e-01

Pos logit contributions
 Bub+0.026
 mar+0.025
 Nib+0.025
 wra+0.025
endant+0.024
Neg logit contributions
zed-0.028
gh-0.027
 aids-0.026
ainer-0.026
ze-0.025
Token at
Token ID379
Feature activation+1.62e+00

Pos logit contributions
 Bub+0.004
 mar+0.004
 Nib+0.004
 wra+0.003
 buried+0.003
Neg logit contributions
zed-0.008
gh-0.008
 aids-0.008
ainer-0.008
ze-0.008
Token the
Token ID262
Feature activation+2.74e+00

Pos logit contributions
endant+0.047
 wra+0.047
 mar+0.045
 Bub+0.045
 Nib+0.044
Neg logit contributions
zed-0.046
aster-0.041
gh-0.041
 aids-0.041
ainer-0.041
Token k
Token ID479
Feature activation+3.88e+00

Pos logit contributions
 Bub+0.056
endant+0.052
 Nib+0.052
 mar+0.051
 wra+0.049
Neg logit contributions
ze-0.097
zed-0.097
ainer-0.094
 aids-0.093
 sentence-0.093
Tokenang
Token ID648
Feature activation+1.27e+00

Pos logit contributions
 Bub+0.090
 Nib+0.085
 wra+0.084
 mar+0.084
endant+0.080
Neg logit contributions
zed-0.129
gh-0.126
ze-0.123
ainer-0.122
aster-0.122
Tokenaroo
Token ID38049
Feature activation+1.76e+00

Pos logit contributions
 Bub+0.031
 mar+0.029
 Nib+0.029
 wra+0.027
 ot+0.027
Neg logit contributions
zed-0.043
gh-0.042
 aids-0.041
ainer-0.041
aster-0.040
Token and
Token ID290
Feature activation+7.43e-01

Pos logit contributions
 Bub+0.033
 mar+0.032
 Nib+0.031
 wra+0.031
endant+0.030
Neg logit contributions
zed-0.068
gh-0.065
ze-0.064
 aids-0.064
ainer-0.063
Token thanked
Token ID26280
Feature activation+1.16e+00

Pos logit contributions
 mar+0.010
 wra+0.010
 Bub+0.010
 Nib+0.010
endant+0.009
Neg logit contributions
zed-0.032
 aids-0.030
ze-0.030
gh-0.030
aster-0.029
Token it
Token ID340
Feature activation+7.25e-01

Pos logit contributions
 Bub+0.028
 mar+0.027
 Nib+0.027
 wra+0.026
endant+0.025
Neg logit contributions
zed-0.039
gh-0.038
 aids-0.036
ainer-0.036
ze-0.036
Token Lily
Token ID20037
Feature activation+2.71e-01

Pos logit contributions
aster+0.005
 aids+0.005
zed+0.004
 paragraph+0.004
 sentence+0.004
Neg logit contributions
ll-0.005
endant-0.005
unn-0.004
 haven-0.004
rou-0.004
Token says
Token ID1139
Feature activation-8.69e-02

Pos logit contributions
 Bub+0.006
 mar+0.006
endant+0.006
 haven+0.006
 Nib+0.006
Neg logit contributions
 aids-0.009
 sentence-0.009
ze-0.009
zed-0.009
gh-0.008
Token.
Token ID13
Feature activation-2.72e-01

Pos logit contributions
zed+0.003
gh+0.003
 aids+0.003
ainer+0.003
ze+0.003
Neg logit contributions
 Bub-0.002
 mar-0.002
 Nib-0.002
 wra-0.002
endant-0.002
Token "
Token ID366
Feature activation-5.62e-01

Pos logit contributions
zed+0.010
gh+0.010
ainer+0.010
 aids+0.009
aster+0.009
Neg logit contributions
 Bub-0.006
 Nib-0.006
 mar-0.005
 wra-0.005
 H-0.005
TokenThe
Token ID464
Feature activation+1.61e+00

Pos logit contributions
zed+0.020
gh+0.020
 aids+0.019
ainer+0.019
 sentence+0.018
Neg logit contributions
 Bub-0.013
 wra-0.012
 Nib-0.012
 mar-0.012
 could-0.011
Token k
Token ID479
Feature activation+3.87e+00

Pos logit contributions
 Bub+0.034
endant+0.034
 mar+0.033
 wra+0.032
 Nib+0.032
Neg logit contributions
zed-0.055
ze-0.054
 aids-0.053
gh-0.053
ainer-0.053
Tokenang
Token ID648
Feature activation+7.40e-01

Pos logit contributions
 Bub+0.099
 wra+0.095
 Nib+0.094
 mar+0.093
endant+0.092
Neg logit contributions
zed-0.112
ze-0.111
gh-0.109
ainer-0.108
aster-0.107
Tokenaroo
Token ID38049
Feature activation+6.39e-01

Pos logit contributions
 Bub+0.018
 mar+0.017
 Nib+0.017
 wra+0.016
 ot+0.016
Neg logit contributions
zed-0.025
gh-0.024
ainer-0.024
 aids-0.023
aster-0.023
Token is
Token ID318
Feature activation-6.16e-01

Pos logit contributions
 Bub+0.014
 mar+0.013
 Nib+0.013
 wra+0.012
endant+0.012
Neg logit contributions
zed-0.024
gh-0.023
 aids-0.022
ze-0.022
ainer-0.022
Token not
Token ID407
Feature activation-7.54e-01

Pos logit contributions
zed+0.022
gh+0.021
ainer+0.021
 aids+0.020
aster+0.020
Neg logit contributions
 Bub-0.014
 mar-0.014
 Nib-0.014
 wra-0.013
 buried-0.013
Token from
Token ID422
Feature activation+2.43e-01

Pos logit contributions
zed+0.027
gh+0.026
ainer+0.026
 aids+0.025
aster+0.025
Neg logit contributions
 Bub-0.017
 mar-0.016
 Nib-0.016
 wra-0.015
endant-0.015
Token the
Token ID262
Feature activation+1.68e+00

Pos logit contributions
 Bub+0.026
 mar+0.025
 Nib+0.025
 wra+0.024
endant+0.024
Neg logit contributions
zed-0.034
gh-0.033
 aids-0.032
ainer-0.032
aster-0.031
Token other
Token ID584
Feature activation+2.39e+00

Pos logit contributions
 Bub+0.043
 mar+0.041
 Nib+0.041
endant+0.041
 wra+0.039
Neg logit contributions
zed-0.055
ze-0.052
gh-0.052
ainer-0.051
 aids-0.050
Token side
Token ID1735
Feature activation-4.68e-01

Pos logit contributions
 Bub+0.080
 mar+0.078
 Nib+0.077
endant+0.077
 wra+0.076
Neg logit contributions
zed-0.058
gh-0.054
ainer-0.054
 aids-0.053
ze-0.053
Token.
Token ID13
Feature activation+1.31e-01

Pos logit contributions
zed+0.017
gh+0.017
 aids+0.017
aster+0.017
ainer+0.017
Neg logit contributions
 Bub-0.010
 Nib-0.009
 mar-0.009
 wra-0.009
endant-0.009
Token The
Token ID383
Feature activation+2.03e+00

Pos logit contributions
 mar+0.004
 wra+0.004
 Bub+0.004
endant+0.004
 Nib+0.004
Neg logit contributions
zed-0.004
 aids-0.003
ainer-0.003
gh-0.003
aster-0.003
Token k
Token ID479
Feature activation+3.87e+00

Pos logit contributions
 Bub+0.038
endant+0.036
 mar+0.035
 Nib+0.035
 wra+0.034
Neg logit contributions
zed-0.076
ze-0.075
ainer-0.074
 aids-0.073
gh-0.073
Tokenang
Token ID648
Feature activation+1.01e+00

Pos logit contributions
 Bub+0.097
 Nib+0.092
 wra+0.092
 mar+0.091
endant+0.088
Neg logit contributions
zed-0.116
ze-0.114
gh-0.113
ainer-0.112
aster-0.111
Tokenaroo
Token ID38049
Feature activation+1.16e+00

Pos logit contributions
 Bub+0.025
 Nib+0.024
 mar+0.024
 wra+0.022
endant+0.022
Neg logit contributions
zed-0.034
gh-0.033
ainer-0.032
 aids-0.032
aster-0.031
Token is
Token ID318
Feature activation+3.56e-02

Pos logit contributions
 Bub+0.028
 mar+0.026
 Nib+0.026
 wra+0.025
endant+0.025
Neg logit contributions
zed-0.040
gh-0.038
 aids-0.037
ze-0.036
ainer-0.036
Token in
Token ID287
Feature activation+4.66e-01

Pos logit contributions
 Bub+0.001
 mar+0.001
 Nib+0.001
 wra+0.001
endant+0.001
Neg logit contributions
zed-0.001
gh-0.001
ainer-0.001
 aids-0.001
aster-0.001
Token a
Token ID257
Feature activation+6.13e-01

Pos logit contributions
 Bub+0.012
 mar+0.012
 Nib+0.012
 wra+0.011
endant+0.011
Neg logit contributions
zed-0.015
gh-0.015
 aids-0.014
ainer-0.014
aster-0.014
Token eager
Token ID11069
Feature activation-8.44e-01

Pos logit contributions
aster+0.024
 aids+0.022
zed+0.022
ze+0.021
ainer+0.021
Neg logit contributions
 wra-0.019
 Nib-0.018
ll-0.018
 Bub-0.018
 haven-0.018
Token little
Token ID1310
Feature activation-4.20e-01

Pos logit contributions
zed+0.027
aster+0.027
ainer+0.027
gh+0.026
 aids+0.026
Neg logit contributions
 Bub-0.019
 wra-0.019
endant-0.019
 mar-0.019
 Nib-0.019
Token girl
Token ID2576
Feature activation-2.26e-02

Pos logit contributions
zed+0.015
gh+0.015
aster+0.015
ainer+0.014
 aids+0.014
Neg logit contributions
 Bub-0.008
 Nib-0.008
 mar-0.008
endant-0.008
 wra-0.008
Token named
Token ID3706
Feature activation-2.75e-01

Pos logit contributions
zed+0.001
aster+0.001
gh+0.001
 aids+0.001
ze+0.001
Neg logit contributions
 Bub-0.000
 wra-0.000
 mar-0.000
 Nib-0.000
endant-0.000
Token Rachel
Token ID15984
Feature activation-5.20e-01

Pos logit contributions
zed+0.010
gh+0.009
aster+0.009
 aids+0.009
ainer+0.009
Neg logit contributions
 mar-0.006
endant-0.006
 Nib-0.006
 wra-0.006
 ot-0.005
Token.
Token ID13
Feature activation-4.10e+00

Pos logit contributions
zed+0.019
gh+0.018
ze+0.017
aster+0.017
 aids+0.017
Neg logit contributions
 Bub-0.012
 mar-0.011
 Nib-0.011
 wra-0.011
endant-0.011
Token Rachel
Token ID15984
Feature activation+6.15e-01

Pos logit contributions
gh+0.166
ze+0.165
zed+0.164
ainer+0.150
aster+0.149
Neg logit contributions
 could-0.073
 Tim-0.069
 were-0.067
 Bub-0.066
 haven-0.065
Token loved
Token ID6151
Feature activation+4.96e-01

Pos logit contributions
 Bub+0.010
 Nib+0.010
 mar+0.010
 wra+0.009
endant+0.009
Neg logit contributions
zed-0.026
gh-0.025
ze-0.024
 aids-0.024
aster-0.024
Token to
Token ID284
Feature activation-1.28e+00

Pos logit contributions
 mar+0.012
 Bub+0.012
 Nib+0.011
 wra+0.011
 H+0.011
Neg logit contributions
zed-0.017
ainer-0.017
gh-0.017
 aids-0.016
ze-0.016
Token organize
Token ID16481
Feature activation+1.23e+00

Pos logit contributions
zed+0.051
gh+0.050
ainer+0.049
 aids+0.049
ze+0.047
Neg logit contributions
 Bub-0.023
 mar-0.023
 Nib-0.021
 buried-0.020
 c-0.020
Token things
Token ID1243
Feature activation+2.33e+00

Pos logit contributions
 Bub+0.038
 Nib+0.038
endant+0.037
 haven+0.036
 mar+0.035
Neg logit contributions
zed-0.033
gh-0.031
 aids-0.030
ainer-0.029
ze-0.029
Token a
Token ID257
Feature activation-2.40e-01

Pos logit contributions
endant+0.018
ocular+0.018
 Bub+0.018
 wra+0.018
 Nib+0.017
Neg logit contributions
zed-0.017
aster-0.016
gh-0.016
ainer-0.015
ze-0.015
Token little
Token ID1310
Feature activation+5.20e-01

Pos logit contributions
aster+0.007
zed+0.007
gh+0.007
ze+0.007
ainer+0.007
Neg logit contributions
endant-0.006
ocular-0.006
 Bub-0.006
 Nib-0.006
unn-0.005
Token girl
Token ID2576
Feature activation+2.34e-01

Pos logit contributions
 Bub+0.011
endant+0.011
 Nib+0.011
ocular+0.011
 haven+0.011
Neg logit contributions
aster-0.016
ze-0.016
ainer-0.016
ap-0.016
over-0.015
Token named
Token ID3706
Feature activation-4.20e-02

Pos logit contributions
 Bub+0.004
 wra+0.004
 Nib+0.004
 mar+0.004
endant+0.004
Neg logit contributions
zed-0.009
aster-0.009
gh-0.009
ze-0.009
 aids-0.009
Token Rachel
Token ID15984
Feature activation-6.81e-01

Pos logit contributions
zed+0.001
aster+0.001
gh+0.001
 aids+0.001
ainer+0.001
Neg logit contributions
 mar-0.001
endant-0.001
 wra-0.001
 Nib-0.001
 ot-0.001
Token.
Token ID13
Feature activation-4.04e+00

Pos logit contributions
zed+0.024
gh+0.023
ze+0.023
aster+0.023
 aids+0.023
Neg logit contributions
 Bub-0.015
 mar-0.015
 Nib-0.015
 wra-0.014
endant-0.014
Token She
Token ID1375
Feature activation+9.14e-01

Pos logit contributions
gh+0.144
ze+0.144
zed+0.144
 aids+0.133
aster+0.130
Neg logit contributions
 could-0.092
 were-0.086
 Bub-0.086
 Tim-0.086
 said-0.084
Token was
Token ID373
Feature activation-1.63e-01

Pos logit contributions
 Bub+0.016
 Nib+0.016
ocular+0.015
endant+0.015
 wra+0.015
Neg logit contributions
zed-0.038
gh-0.037
ze-0.035
 aids-0.034
aster-0.034
Token three
Token ID1115
Feature activation+5.55e-01

Pos logit contributions
zed+0.005
ze+0.005
gh+0.005
aster+0.005
 aids+0.004
Neg logit contributions
endant-0.004
 wra-0.004
 Bub-0.004
 mar-0.004
unn-0.004
Token years
Token ID812
Feature activation-2.97e-01

Pos logit contributions
 Bub+0.015
 Nib+0.015
endant+0.014
 mar+0.013
 haven+0.013
Neg logit contributions
zed-0.018
 aids-0.017
gh-0.017
ze-0.016
aster-0.016
Token old
Token ID1468
Feature activation+2.70e+00

Pos logit contributions
zed+0.010
aster+0.010
 aids+0.010
ainer+0.010
gh+0.010
Neg logit contributions
 Bub-0.006
 Nib-0.006
 mar-0.006
 wra-0.006
 haven-0.006
Token that
Token ID326
Feature activation-5.36e-01

Pos logit contributions
 Bub+0.015
 Nib+0.014
 mar+0.014
 haven+0.014
 wra+0.013
Neg logit contributions
 aids-0.019
gh-0.019
zed-0.019
 sentence-0.018
aster-0.018
Token said
Token ID531
Feature activation-1.02e+00

Pos logit contributions
zed+0.022
 aids+0.021
gh+0.021
rac+0.020
ze+0.020
Neg logit contributions
 Bub-0.008
 Nib-0.007
 mar-0.007
unn-0.007
endant-0.007
Token "
Token ID366
Feature activation+3.48e-01

Pos logit contributions
zed+0.036
ainer+0.035
gh+0.034
aster+0.034
 aids+0.034
Neg logit contributions
 Bub-0.024
 Nib-0.023
 mar-0.022
 Ell-0.020
 wra-0.020
TokenO
Token ID46
Feature activation+1.83e-01

Pos logit contributions
 Bub+0.010
 mar+0.010
 Nib+0.010
 wra+0.009
endant+0.009
Neg logit contributions
zed-0.011
gh-0.010
 aids-0.010
ainer-0.010
ze-0.009
Tokenst
Token ID301
Feature activation-1.12e+00

Pos logit contributions
 Bub+0.005
 mar+0.005
 Nib+0.005
 wra+0.005
endant+0.005
Neg logit contributions
zed-0.005
 aids-0.005
gh-0.005
ainer-0.005
aster-0.005
Tokenric
Token ID1173
Feature activation-4.02e+00

Pos logit contributions
zed+0.032
gh+0.030
ainer+0.030
aster+0.029
 aids+0.029
Neg logit contributions
 Bub-0.034
 Nib-0.032
 mar-0.031
endant-0.030
ocular-0.030
Tokenhes
Token ID956
Feature activation-1.96e+00

Pos logit contributions
ainer+0.100
zed+0.096
gh+0.095
 paragraph+0.091
 avail+0.086
Neg logit contributions
 could-0.124
 Bub-0.116
 Nib-0.116
 mar-0.114
 called-0.113
Token".
Token ID1911
Feature activation+1.18e+00

Pos logit contributions
zed+0.079
gh+0.075
ainer+0.074
aster+0.072
 aids+0.070
Neg logit contributions
 could-0.032
 Bub-0.032
 mar-0.031
 were-0.030
 said-0.030
Token Sara
Token ID24799
Feature activation+3.15e-01

Pos logit contributions
 Bub+0.020
 Nib+0.019
 mar+0.019
 wra+0.018
 Ell+0.017
Neg logit contributions
zed-0.049
gh-0.048
ze-0.046
ainer-0.046
 aids-0.046
Token and
Token ID290
Feature activation-5.28e-01

Pos logit contributions
 Bub+0.007
 Nib+0.006
 mar+0.006
 wra+0.006
endant+0.006
Neg logit contributions
zed-0.011
ze-0.011
gh-0.011
 aids-0.011
 sentence-0.011
Token Tom
Token ID4186
Feature activation+1.97e-01

Pos logit contributions
zed+0.024
gh+0.024
ainer+0.023
 aids+0.023
ze+0.023
Neg logit contributions
 mar-0.006
 Bub-0.006
 Nib-0.006
 wra-0.006
 Tim-0.006
Token a
Token ID257
Feature activation+1.01e-02

Pos logit contributions
endant+0.024
 Bub+0.024
ocular+0.024
 wra+0.023
 Nib+0.023
Neg logit contributions
zed-0.023
gh-0.022
aster-0.022
ze-0.021
ainer-0.021
Token little
Token ID1310
Feature activation+6.80e-01

Pos logit contributions
endant+0.000
ocular+0.000
 Bub+0.000
 wra+0.000
 Nib+0.000
Neg logit contributions
aster-0.000
ze-0.000
gh-0.000
zed-0.000
ainer-0.000
Token girl
Token ID2576
Feature activation+4.69e-01

Pos logit contributions
 Bub+0.014
endant+0.014
 Nib+0.014
ocular+0.014
 wra+0.014
Neg logit contributions
aster-0.022
ze-0.022
ainer-0.021
gh-0.021
over-0.021
Token named
Token ID3706
Feature activation+3.35e-02

Pos logit contributions
 Bub+0.008
 wra+0.008
 Nib+0.008
endant+0.008
 mar+0.008
Neg logit contributions
zed-0.019
aster-0.018
gh-0.018
ze-0.018
 aids-0.018
Token Rachel
Token ID15984
Feature activation-3.44e-01

Pos logit contributions
 mar+0.001
 wra+0.001
endant+0.001
 Nib+0.001
 Bub+0.001
Neg logit contributions
zed-0.001
gh-0.001
 aids-0.001
aster-0.001
ainer-0.001
Token.
Token ID13
Feature activation-3.97e+00

Pos logit contributions
zed+0.012
gh+0.012
ze+0.012
aster+0.012
 aids+0.012
Neg logit contributions
 Bub-0.008
 mar-0.007
 Nib-0.007
 wra-0.007
endant-0.007
Token She
Token ID1375
Feature activation+1.03e+00

Pos logit contributions
gh+0.145
ze+0.144
zed+0.143
 aids+0.133
aster+0.131
Neg logit contributions
 could-0.088
 were-0.081
 Tim-0.081
 Bub-0.079
 haven-0.079
Token was
Token ID373
Feature activation+1.16e-02

Pos logit contributions
 Bub+0.018
 Nib+0.018
endant+0.017
 wra+0.017
ocular+0.017
Neg logit contributions
zed-0.043
gh-0.041
ze-0.040
 aids-0.039
aster-0.038
Token very
Token ID845
Feature activation+7.87e-01

Pos logit contributions
endant+0.000
 wra+0.000
 Bub+0.000
 mar+0.000
ocular+0.000
Neg logit contributions
zed-0.000
ze-0.000
gh-0.000
 aids-0.000
aster-0.000
Token excited
Token ID6568
Feature activation-5.69e-01

Pos logit contributions
 Bub+0.021
 wra+0.021
 Nib+0.020
endant+0.020
 mar+0.020
Neg logit contributions
zed-0.024
gh-0.023
ze-0.022
aster-0.022
 aids-0.022
Token to
Token ID284
Feature activation-1.02e+00

Pos logit contributions
zed+0.024
ainer+0.023
gh+0.023
 aids+0.023
aster+0.023
Neg logit contributions
 mar-0.009
 Bub-0.009
 Nib-0.008
 buried-0.008
 wra-0.008
Token in
Token ID287
Feature activation-7.20e-01

Pos logit contributions
 Bub+0.014
 Nib+0.013
 mar+0.013
 wra+0.013
endant+0.012
Neg logit contributions
zed-0.024
gh-0.023
 aids-0.022
aster-0.022
ainer-0.022
Token the
Token ID262
Feature activation-7.79e-02

Pos logit contributions
zed+0.025
gh+0.025
ainer+0.024
 aids+0.024
ze+0.023
Neg logit contributions
 Bub-0.016
 mar-0.016
 Nib-0.016
 wra-0.015
endant-0.014
Token jungle
Token ID20712
Feature activation+3.75e-02

Pos logit contributions
zed+0.003
gh+0.003
ainer+0.002
 aids+0.002
aster+0.002
Neg logit contributions
 Bub-0.002
 mar-0.002
 Nib-0.002
 wra-0.002
endant-0.002
Token.
Token ID13
Feature activation-3.25e+00

Pos logit contributions
 Bub+0.001
 mar+0.001
 Nib+0.001
 wra+0.001
endant+0.001
Neg logit contributions
zed-0.002
gh-0.001
ainer-0.001
 aids-0.001
ze-0.001
Token Every
Token ID3887
Feature activation-9.70e-01

Pos logit contributions
gh+0.132
ze+0.130
zed+0.127
ainer+0.118
 aids+0.116
Neg logit contributions
 could-0.058
 Tim-0.056
 Bub-0.055
 haven-0.055
 said-0.055
Token night
Token ID1755
Feature activation-3.97e+00

Pos logit contributions
ainer+0.039
gh+0.037
ags+0.037
zed+0.037
ze+0.036
Neg logit contributions
 Bub-0.015
 buried-0.014
 was-0.014
 called-0.014
 could-0.014
Token,
Token ID11
Feature activation-3.48e+00

Pos logit contributions
gh+0.190
zed+0.188
ze+0.186
ags+0.186
ainer+0.182
Neg logit contributions
 could-0.039
 said-0.031
 called-0.030
 since-0.029
 couldn-0.029
Token she
Token ID673
Feature activation-8.20e-01

Pos logit contributions
gh+0.139
zed+0.134
ze+0.132
 aids+0.130
ainer+0.130
Neg logit contributions
 could-0.064
 Bub-0.059
 couldn-0.059
 said-0.058
 Tim-0.057
Token heard
Token ID2982
Feature activation-5.45e-01

Pos logit contributions
zed+0.035
gh+0.035
ainer+0.035
 aids+0.034
aster+0.033
Neg logit contributions
 Bub-0.012
 mar-0.011
 Nib-0.011
 wra-0.010
 buried-0.010
Token the
Token ID262
Feature activation-8.82e-01

Pos logit contributions
zed+0.019
gh+0.018
ainer+0.018
aster+0.017
 aids+0.017
Neg logit contributions
 Bub-0.013
 mar-0.012
 Nib-0.012
 wra-0.012
endant-0.011
Token mosquit
Token ID33373
Feature activation-9.89e-01

Pos logit contributions
zed+0.024
gh+0.023
 aids+0.022
ainer+0.022
ze+0.022
Neg logit contributions
 Bub-0.027
 mar-0.027
 Nib-0.026
 wra-0.026
endant-0.025
Token.
Token ID13
Feature activation-4.86e-01

Pos logit contributions
zed+0.006
gh+0.006
 aids+0.006
ze+0.006
ainer+0.006
Neg logit contributions
 Bub-0.003
 mar-0.003
 Nib-0.003
 wra-0.003
endant-0.003
Token They
Token ID1119
Feature activation+4.41e-01

Pos logit contributions
 aids+0.014
zed+0.014
ainer+0.013
aster+0.013
gh+0.013
Neg logit contributions
 mar-0.013
 wra-0.013
endant-0.013
 Bub-0.012
 buried-0.012
Token were
Token ID547
Feature activation+2.28e-01

Pos logit contributions
endant+0.008
 wra+0.008
 Bub+0.008
 Nib+0.008
 mar+0.008
Neg logit contributions
zed-0.016
ze-0.015
 aids-0.015
aster-0.015
gh-0.015
Token called
Token ID1444
Feature activation-3.78e-01

Pos logit contributions
 Bub+0.005
 mar+0.005
 Nib+0.005
 wra+0.004
endant+0.004
Neg logit contributions
zed-0.009
gh-0.008
ainer-0.008
aster-0.008
 aids-0.008
Token ost
Token ID23619
Feature activation-3.59e-01

Pos logit contributions
zed+0.013
gh+0.012
ainer+0.012
 aids+0.011
aster+0.011
Neg logit contributions
 Bub-0.010
 Nib-0.009
 mar-0.009
 wra-0.009
endant-0.009
Tokenric
Token ID1173
Feature activation-3.95e+00

Pos logit contributions
zed+0.011
gh+0.010
ainer+0.010
 aids+0.010
aster+0.010
Neg logit contributions
 Bub-0.010
 mar-0.010
 Nib-0.010
endant-0.009
 wra-0.009
Tokenhes
Token ID956
Feature activation-1.29e+00

Pos logit contributions
ainer+0.100
gh+0.092
zed+0.090
 paragraph+0.089
 sentence+0.085
Neg logit contributions
 could-0.123
 Bub-0.118
 Nib-0.118
 mar-0.117
 were-0.115
Token.
Token ID13
Feature activation-6.91e-02

Pos logit contributions
zed+0.054
gh+0.053
ainer+0.052
aster+0.051
 aids+0.051
Neg logit contributions
 Bub-0.020
 mar-0.019
 Nib-0.018
 wra-0.017
 could-0.016
Token Lily
Token ID20037
Feature activation+3.72e-01

Pos logit contributions
zed+0.002
 aids+0.002
ainer+0.002
aster+0.002
gh+0.002
Neg logit contributions
 mar-0.002
 wra-0.002
endant-0.002
 Bub-0.002
 buried-0.002
Token and
Token ID290
Feature activation+3.25e-01

Pos logit contributions
 Bub+0.008
 Nib+0.007
endant+0.007
 mar+0.007
 wra+0.007
Neg logit contributions
zed-0.014
ze-0.014
gh-0.013
 aids-0.013
 sentence-0.012
Token Ben
Token ID3932
Feature activation-2.25e-01

Pos logit contributions
 Bub+0.007
 mar+0.007
endant+0.006
 Nib+0.006
 wra+0.006
Neg logit contributions
zed-0.012
gh-0.011
 aids-0.011
aster-0.011
ze-0.011
Token small
Token ID1402
Feature activation-5.92e-01

Pos logit contributions
zed+0.043
gh+0.041
 aids+0.040
ainer+0.040
ze+0.040
Neg logit contributions
 Bub-0.032
 mar-0.031
 Nib-0.031
 wra-0.030
endant-0.029
Token bird
Token ID6512
Feature activation-1.51e+00

Pos logit contributions
zed+0.023
gh+0.022
 aids+0.021
ainer+0.021
aster+0.021
Neg logit contributions
 Bub-0.012
 mar-0.012
 Nib-0.011
 wra-0.011
endant-0.011
Token needed
Token ID2622
Feature activation-4.63e-01

Pos logit contributions
zed+0.054
ze+0.051
gh+0.049
aster+0.048
 aids+0.048
Neg logit contributions
 wra-0.033
 Nib-0.033
 Bub-0.033
 mar-0.033
endant-0.031
Token a
Token ID257
Feature activation-5.97e-01

Pos logit contributions
zed+0.014
gh+0.014
 aids+0.013
ainer+0.013
aster+0.013
Neg logit contributions
 Bub-0.012
 mar-0.012
 Nib-0.012
 wra-0.012
endant-0.011
Token tw
Token ID665
Feature activation-1.62e+00

Pos logit contributions
zed+0.020
gh+0.019
 aids+0.019
ainer+0.018
aster+0.018
Neg logit contributions
 Bub-0.015
 mar-0.014
 Nib-0.014
 wra-0.014
endant-0.013
Tokenig
Token ID328
Feature activation-3.92e+00

Pos logit contributions
 aids+0.060
zed+0.056
aster+0.055
 sentence+0.054
ainer+0.054
Neg logit contributions
endant-0.036
 mar-0.034
 buried-0.032
 Bub-0.031
unn-0.030
Token for
Token ID329
Feature activation-6.74e-01

Pos logit contributions
zed+0.132
gh+0.126
 aids+0.124
ainer+0.123
aster+0.122
Neg logit contributions
 Bub-0.095
 mar-0.092
 Nib-0.091
 wra-0.087
endant-0.087
Token her
Token ID607
Feature activation+5.99e-01

Pos logit contributions
zed+0.023
gh+0.022
 aids+0.022
ainer+0.022
ze+0.021
Neg logit contributions
 Bub-0.017
 mar-0.016
 Nib-0.016
 wra-0.015
endant-0.015
Token nest
Token ID16343
Feature activation-8.20e-01

Pos logit contributions
 Bub+0.017
 Nib+0.016
 wra+0.016
 mar+0.016
endant+0.015
Neg logit contributions
gh-0.017
zed-0.017
ze-0.017
 sentence-0.016
 aids-0.015
Token.
Token ID13
Feature activation-6.40e-01

Pos logit contributions
zed+0.030
gh+0.029
ainer+0.028
 aids+0.028
ze+0.027
Neg logit contributions
 Bub-0.018
 mar-0.017
 Nib-0.017
 wra-0.016
endant-0.016
Token She
Token ID1375
Feature activation-9.83e-01

Pos logit contributions
zed+0.023
gh+0.023
ainer+0.022
aster+0.022
 aids+0.022
Neg logit contributions
 Bub-0.014
 Nib-0.014
 mar-0.013
 wra-0.013
 Ell-0.012
Token was
Token ID373
Feature activation+7.07e-01

Pos logit contributions
zed+0.031
gh+0.029
ze+0.027
ainer+0.026
aster+0.026
Neg logit contributions
endant-0.016
ocular-0.014
 wra-0.014
unn-0.014
 Nib-0.014
Token a
Token ID257
Feature activation+1.27e-02

Pos logit contributions
endant+0.021
ocular+0.021
 Bub+0.021
 Nib+0.020
 wra+0.020
Neg logit contributions
zed-0.019
aster-0.018
gh-0.018
ze-0.018
ainer-0.018
Token girl
Token ID2576
Feature activation+5.85e-01

Pos logit contributions
endant+0.000
ocular+0.000
 Bub+0.000
 Nib+0.000
 wra+0.000
Neg logit contributions
ze-0.000
zed-0.000
gh-0.000
aster-0.000
ainer-0.000
Token named
Token ID3706
Feature activation+4.12e-01

Pos logit contributions
 Bub+0.010
 wra+0.010
 Nib+0.010
 mar+0.009
endant+0.009
Neg logit contributions
zed-0.023
aster-0.023
ze-0.023
gh-0.023
 aids-0.022
Token Rachel
Token ID15984
Feature activation-3.51e-01

Pos logit contributions
 mar+0.008
 wra+0.007
endant+0.007
 Nib+0.007
 Bub+0.007
Neg logit contributions
zed-0.016
aster-0.015
gh-0.015
 aids-0.015
ainer-0.015
Token.
Token ID13
Feature activation-3.91e+00

Pos logit contributions
zed+0.012
gh+0.012
aster+0.012
ze+0.011
 aids+0.011
Neg logit contributions
 Bub-0.008
 mar-0.008
 Nib-0.008
 wra-0.008
endant-0.008
Token Rachel
Token ID15984
Feature activation+6.87e-01

Pos logit contributions
zed+0.135
gh+0.134
ze+0.132
 aids+0.128
aster+0.124
Neg logit contributions
 could-0.090
 Bub-0.090
 Tim-0.086
 haven-0.085
 were-0.083
Token was
Token ID373
Feature activation+1.94e-01

Pos logit contributions
 Bub+0.012
 Nib+0.012
 mar+0.011
 wra+0.011
endant+0.011
Neg logit contributions
zed-0.029
gh-0.027
ze-0.027
 aids-0.026
aster-0.026
Token very
Token ID845
Feature activation+8.08e-01

Pos logit contributions
endant+0.005
 wra+0.005
 Bub+0.005
 mar+0.005
ocular+0.005
Neg logit contributions
zed-0.006
ze-0.005
gh-0.005
 aids-0.005
aster-0.005
Token hungry
Token ID14720
Feature activation+5.35e-01

Pos logit contributions
 Bub+0.022
 wra+0.021
 Nib+0.021
 mar+0.021
endant+0.021
Neg logit contributions
zed-0.024
gh-0.023
aster-0.022
ze-0.022
 aids-0.022
Token,
Token ID11
Feature activation-5.52e-01

Pos logit contributions
 Bub+0.014
endant+0.013
 Nib+0.013
 mar+0.012
unn+0.011
Neg logit contributions
zed-0.017
gh-0.017
aster-0.017
ze-0.017
ainer-0.017
Token sweet
Token ID6029
Feature activation-1.10e+00

Pos logit contributions
aster+0.005
ze+0.005
gh+0.005
zed+0.005
ainer+0.004
Neg logit contributions
endant-0.004
ocular-0.004
 Bub-0.004
 Nib-0.004
unn-0.004
Token little
Token ID1310
Feature activation+7.55e-01

Pos logit contributions
zed+0.036
aster+0.035
ze+0.035
 aids+0.035
ainer+0.035
Neg logit contributions
 Bub-0.025
 wra-0.024
endant-0.024
 Nib-0.024
ocular-0.024
Token girl
Token ID2576
Feature activation+3.92e-01

Pos logit contributions
 Bub+0.016
endant+0.016
 Nib+0.016
 wra+0.016
 haven+0.015
Neg logit contributions
aster-0.024
ze-0.023
ainer-0.023
 aids-0.023
gh-0.023
Token called
Token ID1444
Feature activation-3.04e-01

Pos logit contributions
 Bub+0.007
 wra+0.007
 Nib+0.007
endant+0.006
 mar+0.006
Neg logit contributions
zed-0.016
aster-0.015
gh-0.015
ze-0.015
 aids-0.015
Token Rachel
Token ID15984
Feature activation-7.84e-01

Pos logit contributions
zed+0.011
gh+0.011
aster+0.010
 aids+0.010
ainer+0.010
Neg logit contributions
 mar-0.006
endant-0.006
 wra-0.006
 Bub-0.006
 buried-0.006
Token.
Token ID13
Feature activation-3.91e+00

Pos logit contributions
zed+0.028
gh+0.027
 aids+0.026
ze+0.026
aster+0.026
Neg logit contributions
 Bub-0.017
 mar-0.016
 Nib-0.016
 wra-0.016
endant-0.015
Token She
Token ID1375
Feature activation+1.02e+00

Pos logit contributions
gh+0.140
ze+0.139
zed+0.139
aster+0.127
ainer+0.127
Neg logit contributions
 could-0.088
 Tim-0.081
 Bub-0.081
 were-0.081
 said-0.080
Token liked
Token ID8288
Feature activation-8.20e-03

Pos logit contributions
 Bub+0.018
 Nib+0.018
 wra+0.017
endant+0.017
 mar+0.017
Neg logit contributions
zed-0.042
gh-0.040
ze-0.039
 aids-0.038
aster-0.038
Token to
Token ID284
Feature activation-9.71e-01

Pos logit contributions
zed+0.000
ainer+0.000
gh+0.000
 aids+0.000
ze+0.000
Neg logit contributions
 mar-0.000
 Bub-0.000
 wra-0.000
 Nib-0.000
 H-0.000
Token spin
Token ID7906
Feature activation+5.36e-01

Pos logit contributions
zed+0.040
gh+0.039
 aids+0.038
ainer+0.038
ze+0.037
Neg logit contributions
 mar-0.016
 Bub-0.016
 Nib-0.014
 buried-0.013
 c-0.013
Token around
Token ID1088
Feature activation+4.69e-01

Pos logit contributions
 Bub+0.013
 mar+0.013
 Nib+0.012
 wra+0.012
 H+0.011
Neg logit contributions
zed-0.019
gh-0.018
ainer-0.018
 aids-0.018
aster-0.017
Token named
Token ID3706
Feature activation+2.71e-01

Pos logit contributions
 Bub+0.010
 wra+0.010
 Nib+0.010
endant+0.010
 mar+0.009
Neg logit contributions
zed-0.019
aster-0.018
ze-0.018
gh-0.018
ap-0.017
Token Rachel
Token ID15984
Feature activation-3.66e-01

Pos logit contributions
 mar+0.005
 wra+0.005
 Nib+0.005
endant+0.005
 ot+0.005
Neg logit contributions
zed-0.010
gh-0.009
aster-0.009
 aids-0.009
ainer-0.009
Token and
Token ID290
Feature activation+7.43e-01

Pos logit contributions
zed+0.012
gh+0.012
ze+0.011
aster+0.011
 aids+0.011
Neg logit contributions
 Bub-0.009
 Nib-0.009
 mar-0.009
 wra-0.009
endant-0.008
Token her
Token ID607
Feature activation+1.24e+00

Pos logit contributions
 wra+0.018
endant+0.018
 mar+0.018
 Bub+0.018
 Nib+0.017
Neg logit contributions
zed-0.024
gh-0.023
aster-0.023
ze-0.022
 aids-0.022
Token uncle
Token ID7711
Feature activation+9.41e-01

Pos logit contributions
 Bub+0.036
endant+0.036
 wra+0.036
 Nib+0.035
 since+0.035
Neg logit contributions
ze-0.031
zed-0.030
gh-0.029
ap-0.029
over-0.027
Token.
Token ID13
Feature activation-3.90e+00

Pos logit contributions
endant+0.026
 wra+0.025
lves+0.024
unn+0.023
rou+0.023
Neg logit contributions
gh-0.025
zed-0.024
ap-0.024
aster-0.023
ze-0.023
Token Rachel
Token ID15984
Feature activation+6.41e-01

Pos logit contributions
gh+0.158
ze+0.158
zed+0.155
 aids+0.145
aster+0.143
Neg logit contributions
 could-0.072
 Tim-0.071
 Bub-0.069
 were-0.068
 said-0.064
Token wanted
Token ID2227
Feature activation-1.38e-01

Pos logit contributions
 Bub+0.012
 Nib+0.012
 mar+0.011
 wra+0.011
endant+0.011
Neg logit contributions
zed-0.026
gh-0.024
ze-0.024
 aids-0.023
aster-0.023
Token to
Token ID284
Feature activation-5.31e-01

Pos logit contributions
zed+0.005
ainer+0.005
gh+0.005
 aids+0.005
ze+0.005
Neg logit contributions
 Bub-0.003
 mar-0.003
 Nib-0.003
 wra-0.003
 buried-0.003
Token walk
Token ID2513
Feature activation-3.32e-01

Pos logit contributions
zed+0.020
gh+0.019
 aids+0.019
ainer+0.018
ze+0.018
Neg logit contributions
 Bub-0.011
 mar-0.011
 Nib-0.010
 buried-0.010
 ot-0.010
Token somewhere
Token ID7382
Feature activation+1.05e+00

Pos logit contributions
zed+0.013
ainer+0.013
 aids+0.013
gh+0.013
ze+0.012
Neg logit contributions
 Bub-0.007
 mar-0.006
 Nib-0.006
 buried-0.006
 H-0.006
Token.
Token ID13
Feature activation-4.95e-01

Pos logit contributions
zed+0.037
ainer+0.035
 aids+0.034
aster+0.034
gh+0.034
Neg logit contributions
 wra-0.009
 Bub-0.009
 Nib-0.008
 mar-0.008
 H-0.007
Token He
Token ID679
Feature activation+5.99e-01

Pos logit contributions
zed+0.018
gh+0.018
ze+0.017
ainer+0.017
 aids+0.017
Neg logit contributions
 Bub-0.011
 Nib-0.010
 mar-0.010
 wra-0.010
endant-0.010
Token kept
Token ID4030
Feature activation-1.21e+00

Pos logit contributions
 Bub+0.015
unn+0.015
 Nib+0.015
endant+0.015
ocular+0.014
Neg logit contributions
zed-0.021
ze-0.019
gh-0.018
 aids-0.017
aster-0.017
Token throwing
Token ID9644
Feature activation+2.83e-02

Pos logit contributions
ainer+0.046
zed+0.046
gh+0.045
 aids+0.044
 paragraph+0.043
Neg logit contributions
 mar-0.025
 Nib-0.023
 wra-0.023
 Bub-0.023
 Tim-0.022
Token a
Token ID257
Feature activation+7.06e-01

Pos logit contributions
 Bub+0.001
 Nib+0.001
endant+0.001
 haven+0.001
 mar+0.001
Neg logit contributions
zed-0.001
gh-0.001
 aids-0.001
aster-0.001
ainer-0.001
Token tant
Token ID24246
Feature activation-3.90e+00

Pos logit contributions
 Bub+0.019
 Nib+0.019
 haven+0.018
endant+0.017
 H+0.017
Neg logit contributions
gh-0.020
 sentence-0.020
aster-0.020
 aids-0.020
ainer-0.020
Tokenrum
Token ID6582
Feature activation+5.41e-01

Pos logit contributions
zed+0.088
 paragraph+0.079
ainer+0.078
 aids+0.074
aster+0.074
Neg logit contributions
 Bub-0.128
unn-0.124
 mar-0.124
 Nib-0.120
 wra-0.118
Token.
Token ID13
Feature activation-5.20e-01

Pos logit contributions
 Bub+0.011
 mar+0.011
 Nib+0.011
endant+0.010
 wra+0.010
Neg logit contributions
zed-0.020
gh-0.020
ze-0.019
ainer-0.019
 aids-0.019
Token His
Token ID2399
Feature activation-2.08e-01

Pos logit contributions
zed+0.018
gh+0.017
 aids+0.017
ainer+0.017
ze+0.017
Neg logit contributions
 Bub-0.012
 mar-0.012
 Nib-0.012
 wra-0.011
endant-0.011
Token mom
Token ID1995
Feature activation+7.81e-01

Pos logit contributions
zed+0.006
gh+0.006
 aids+0.006
ainer+0.006
ze+0.006
Neg logit contributions
 Bub-0.006
 Nib-0.006
 mar-0.005
endant-0.005
 wra-0.005
Token was
Token ID373
Feature activation-6.17e-03

Pos logit contributions
 Bub+0.020
endant+0.020
 Nib+0.020
unn+0.019
 microsc+0.018
Neg logit contributions
zed-0.027
ze-0.026
gh-0.025
aster-0.024
 aids-0.023
Token was
Token ID373
Feature activation+5.41e-01

Pos logit contributions
endant+0.003
 wra+0.003
unn+0.003
ocular+0.003
 mar+0.003
Neg logit contributions
zed-0.006
gh-0.006
aster-0.005
ze-0.005
 aids-0.005
Token a
Token ID257
Feature activation+2.22e-02

Pos logit contributions
endant+0.016
ocular+0.016
 Bub+0.016
 wra+0.016
 Nib+0.016
Neg logit contributions
zed-0.014
aster-0.014
gh-0.014
ze-0.013
ainer-0.013
Token girl
Token ID2576
Feature activation+7.72e-01

Pos logit contributions
endant+0.001
 Bub+0.000
ocular+0.000
 Nib+0.000
 wra+0.000
Neg logit contributions
aster-0.001
zed-0.001
gh-0.001
ze-0.001
ainer-0.001
Token called
Token ID1444
Feature activation-1.54e-01

Pos logit contributions
 Bub+0.013
 wra+0.013
 Nib+0.012
 mar+0.012
endant+0.012
Neg logit contributions
zed-0.031
aster-0.031
gh-0.030
ze-0.030
 aids-0.029
Token Rachel
Token ID15984
Feature activation-5.88e-01

Pos logit contributions
zed+0.005
aster+0.005
gh+0.005
 aids+0.005
ainer+0.005
Neg logit contributions
 mar-0.003
endant-0.003
 wra-0.003
 Bub-0.003
 Nib-0.003
Token.
Token ID13
Feature activation-3.90e+00

Pos logit contributions
zed+0.021
gh+0.020
ze+0.019
 aids+0.019
aster+0.019
Neg logit contributions
 Bub-0.013
 mar-0.013
 Nib-0.013
 wra-0.013
endant-0.012
Token Rachel
Token ID15984
Feature activation+1.02e+00

Pos logit contributions
gh+0.151
zed+0.151
ze+0.149
 aids+0.140
aster+0.139
Neg logit contributions
 could-0.077
 Bub-0.072
 haven-0.071
 Tim-0.071
 were-0.070
Token liked
Token ID8288
Feature activation+4.34e-01

Pos logit contributions
 Bub+0.018
 Nib+0.017
 mar+0.016
 wra+0.015
endant+0.015
Neg logit contributions
zed-0.043
gh-0.041
ze-0.040
 aids-0.039
aster-0.039
Token to
Token ID284
Feature activation-9.97e-01

Pos logit contributions
 mar+0.011
 Bub+0.010
 Nib+0.010
 wra+0.010
 H+0.009
Neg logit contributions
zed-0.015
ainer-0.015
gh-0.015
 aids-0.014
ze-0.014
Token write
Token ID3551
Feature activation-1.22e+00

Pos logit contributions
zed+0.041
gh+0.039
ainer+0.039
 aids+0.038
ze+0.037
Neg logit contributions
 mar-0.017
 Bub-0.017
 Nib-0.015
 buried-0.014
 c-0.014
Token.
Token ID13
Feature activation-2.46e+00

Pos logit contributions
zed+0.044
ainer+0.042
aster+0.042
gh+0.041
 aids+0.041
Neg logit contributions
 mar-0.026
 Bub-0.026
 Nib-0.025
 wra-0.025
 H-0.023
Token a
Token ID257
Feature activation+8.16e-02

Pos logit contributions
endant+0.019
ocular+0.019
 wra+0.019
 Bub+0.019
 Nib+0.018
Neg logit contributions
zed-0.016
aster-0.016
gh-0.016
ainer-0.015
ze-0.015
Token little
Token ID1310
Feature activation+8.39e-01

Pos logit contributions
endant+0.002
ocular+0.002
 Bub+0.002
 Nib+0.002
 wra+0.002
Neg logit contributions
aster-0.003
ze-0.002
zed-0.002
gh-0.002
ainer-0.002
Token girl
Token ID2576
Feature activation+5.84e-01

Pos logit contributions
endant+0.018
 Bub+0.018
 Nib+0.018
ocular+0.017
 haven+0.017
Neg logit contributions
aster-0.027
over-0.025
ap-0.025
ze-0.025
ainer-0.025
Token called
Token ID1444
Feature activation-2.91e-01

Pos logit contributions
 Bub+0.010
 wra+0.010
 Nib+0.009
endant+0.009
 mar+0.009
Neg logit contributions
zed-0.024
aster-0.023
gh-0.023
ze-0.023
 aids-0.022
Token Rachel
Token ID15984
Feature activation-6.06e-01

Pos logit contributions
zed+0.010
gh+0.010
aster+0.009
 aids+0.009
ainer+0.009
Neg logit contributions
endant-0.007
 mar-0.007
 wra-0.006
 Bub-0.006
 buried-0.006
Token.
Token ID13
Feature activation-3.89e+00

Pos logit contributions
zed+0.022
gh+0.021
ze+0.020
 aids+0.020
aster+0.020
Neg logit contributions
 Bub-0.013
 mar-0.013
 Nib-0.013
 wra-0.012
endant-0.012
Token Rachel
Token ID15984
Feature activation+9.22e-01

Pos logit contributions
gh+0.140
zed+0.139
ze+0.139
 aids+0.127
aster+0.126
Neg logit contributions
 could-0.087
 Tim-0.082
 Bub-0.081
 were-0.081
 haven-0.080
Token was
Token ID373
Feature activation+1.40e-02

Pos logit contributions
 Bub+0.017
 Nib+0.016
 mar+0.015
 wra+0.015
endant+0.014
Neg logit contributions
zed-0.038
gh-0.036
ze-0.035
 aids-0.035
aster-0.034
Token very
Token ID845
Feature activation+9.28e-01

Pos logit contributions
endant+0.000
 wra+0.000
 Bub+0.000
ocular+0.000
 mar+0.000
Neg logit contributions
zed-0.000
ze-0.000
gh-0.000
aster-0.000
 aids-0.000
Token restless
Token ID42537
Feature activation-3.58e-01

Pos logit contributions
 Bub+0.025
 wra+0.025
endant+0.024
 Nib+0.024
 mar+0.024
Neg logit contributions
zed-0.027
gh-0.026
aster-0.025
 aids-0.025
ze-0.025
Token everyday
Token ID10908
Feature activation-1.55e-01

Pos logit contributions
zed+0.013
gh+0.013
ze+0.012
 aids+0.012
ainer+0.012
Neg logit contributions
 Bub-0.007
 Nib-0.007
 mar-0.007
endant-0.007
 wra-0.007
Token stage
Token ID3800
Feature activation-1.02e+00

Pos logit contributions
gh+0.018
zed+0.018
 aids+0.017
aster+0.017
ze+0.017
Neg logit contributions
 Bub-0.013
 Nib-0.013
 mar-0.012
unn-0.012
endant-0.012
Token.
Token ID13
Feature activation-1.84e-01

Pos logit contributions
zed+0.043
gh+0.041
ainer+0.040
 aids+0.040
aster+0.040
Neg logit contributions
 mar-0.016
 Bub-0.016
 Nib-0.015
 wra-0.015
endant-0.014
Token They
Token ID1119
Feature activation-5.67e-01

Pos logit contributions
zed+0.006
ainer+0.006
 aids+0.006
gh+0.006
aster+0.006
Neg logit contributions
 Bub-0.004
 mar-0.004
 wra-0.004
 Nib-0.004
endant-0.004
Token use
Token ID779
Feature activation-1.77e+00

Pos logit contributions
zed+0.017
aster+0.016
 aids+0.016
ze+0.016
gh+0.016
Neg logit contributions
 Bub-0.015
 Nib-0.015
 mar-0.014
 wra-0.014
endant-0.014
Token cr
Token ID1067
Feature activation-2.30e+00

Pos logit contributions
zed+0.062
gh+0.060
ainer+0.058
 aids+0.057
ze+0.057
Neg logit contributions
 Bub-0.041
 mar-0.041
 Nib-0.039
 wra-0.039
endant-0.037
Tokenay
Token ID323
Feature activation-3.88e+00

Pos logit contributions
zed+0.072
gh+0.065
ze+0.064
aster+0.064
 aids+0.063
Neg logit contributions
endant-0.062
 Bub-0.062
 mar-0.062
ocular-0.059
 wra-0.059
Tokenons
Token ID684
Feature activation-1.66e+00

Pos logit contributions
zed+0.132
ainer+0.128
ze+0.122
 aids+0.122
 paragraph+0.121
Neg logit contributions
 mar-0.095
 Bub-0.087
 wra-0.085
endant-0.084
 said-0.081
Token and
Token ID290
Feature activation-1.47e+00

Pos logit contributions
zed+0.057
gh+0.055
 aids+0.053
ainer+0.053
aster+0.053
Neg logit contributions
 Bub-0.039
 Nib-0.037
 mar-0.037
 wra-0.036
endant-0.035
Token stickers
Token ID28568
Feature activation-8.47e-01

Pos logit contributions
zed+0.054
gh+0.052
aster+0.051
 aids+0.051
ainer+0.051
Neg logit contributions
 Bub-0.031
 Nib-0.029
 mar-0.028
 wra-0.027
endant-0.027
Token to
Token ID284
Feature activation-2.74e-01

Pos logit contributions
zed+0.031
gh+0.030
ainer+0.029
 aids+0.029
aster+0.029
Neg logit contributions
 Bub-0.018
 Nib-0.017
 mar-0.017
 wra-0.016
endant-0.016
Token decor
Token ID11705
Feature activation-5.13e-01

Pos logit contributions
zed+0.008
gh+0.008
aster+0.008
 aids+0.008
ainer+0.008
Neg logit contributions
 Bub-0.007
 Nib-0.007
 mar-0.007
 wra-0.007
endant-0.007
Token was
Token ID373
Feature activation+7.63e-01

Pos logit contributions
zed+0.023
gh+0.022
ze+0.019
 aids+0.019
aster+0.019
Neg logit contributions
endant-0.016
unn-0.016
ocular-0.015
 wra-0.015
 Nib-0.014
Token a
Token ID257
Feature activation+7.81e-02

Pos logit contributions
endant+0.023
ocular+0.023
 Bub+0.023
 wra+0.022
 Nib+0.022
Neg logit contributions
zed-0.020
aster-0.020
gh-0.019
ze-0.019
ainer-0.019
Token girl
Token ID2576
Feature activation+7.74e-01

Pos logit contributions
endant+0.002
ocular+0.002
 Bub+0.002
 Nib+0.002
 wra+0.002
Neg logit contributions
aster-0.002
ze-0.002
gh-0.002
zed-0.002
 sentence-0.002
Token named
Token ID3706
Feature activation+3.15e-01

Pos logit contributions
 Bub+0.015
 wra+0.014
 Nib+0.013
endant+0.013
 mar+0.013
Neg logit contributions
zed-0.030
aster-0.030
gh-0.029
ze-0.029
ap-0.028
Token Rachel
Token ID15984
Feature activation-4.15e-01

Pos logit contributions
 mar+0.007
endant+0.006
 wra+0.006
 Nib+0.006
 Bub+0.006
Neg logit contributions
zed-0.011
aster-0.011
gh-0.011
 aids-0.011
ainer-0.010
Token.
Token ID13
Feature activation-3.86e+00

Pos logit contributions
zed+0.014
gh+0.014
aster+0.013
ze+0.013
 aids+0.013
Neg logit contributions
 Bub-0.010
 mar-0.010
 Nib-0.010
 wra-0.009
endant-0.009
Token Rachel
Token ID15984
Feature activation+6.25e-01

Pos logit contributions
gh+0.143
ze+0.142
zed+0.142
 aids+0.131
aster+0.128
Neg logit contributions
 could-0.084
 were-0.078
 said-0.077
 Tim-0.076
 Bub-0.076
Token liked
Token ID8288
Feature activation+2.52e-01

Pos logit contributions
 Bub+0.011
 Nib+0.010
 mar+0.010
 wra+0.009
ocular+0.009
Neg logit contributions
zed-0.026
gh-0.025
ze-0.024
 aids-0.024
aster-0.024
Token pink
Token ID11398
Feature activation+8.22e-01

Pos logit contributions
 mar+0.006
 Bub+0.006
 Nib+0.006
 wra+0.006
 H+0.005
Neg logit contributions
zed-0.009
ainer-0.009
gh-0.009
 aids-0.008
ze-0.008
Token things
Token ID1243
Feature activation+1.96e+00

Pos logit contributions
 Bub+0.023
 Nib+0.022
 haven+0.022
endant+0.021
 wra+0.021
Neg logit contributions
zed-0.023
gh-0.023
ze-0.023
 aids-0.023
ainer-0.022
Token.
Token ID13
Feature activation-2.53e+00

Pos logit contributions
 Bub+0.043
 Nib+0.041
 mar+0.040
endant+0.039
 wra+0.039
Neg logit contributions
zed-0.069
gh-0.068
ze-0.064
 aids-0.064
aster-0.063
Token pasta
Token ID26296
Feature activation-1.30e+00

Pos logit contributions
zed+0.035
gh+0.034
 aids+0.033
ainer+0.033
aster+0.033
Neg logit contributions
 Bub-0.033
 Nib-0.032
 mar-0.032
 wra-0.031
endant-0.030
Token.
Token ID13
Feature activation+3.44e-01

Pos logit contributions
zed+0.052
gh+0.050
 aids+0.050
ainer+0.049
aster+0.049
Neg logit contributions
 Bub-0.023
 mar-0.023
 Nib-0.022
 wra-0.021
endant-0.020
Token She
Token ID1375
Feature activation-2.06e-01

Pos logit contributions
endant+0.010
 wra+0.010
 buried+0.010
 mar+0.010
 Nib+0.010
Neg logit contributions
zed-0.009
ainer-0.009
 aids-0.009
gh-0.008
ze-0.008
Token wrote
Token ID2630
Feature activation-1.73e+00

Pos logit contributions
zed+0.007
gh+0.006
ze+0.006
aster+0.006
 aids+0.006
Neg logit contributions
 Nib-0.005
 Bub-0.005
ocular-0.005
endant-0.005
unn-0.005
Token "
Token ID366
Feature activation-1.62e+00

Pos logit contributions
zed+0.066
ainer+0.064
aster+0.064
 paragraph+0.063
gh+0.062
Neg logit contributions
 Bub-0.033
 mar-0.032
 Nib-0.031
 wra-0.030
 could-0.030
Tokenpast
Token ID30119
Feature activation-3.82e+00

Pos logit contributions
zed+0.048
gh+0.045
 aids+0.044
ainer+0.044
ze+0.043
Neg logit contributions
 Bub-0.047
 mar-0.046
 Nib-0.045
 wra-0.043
endant-0.042
Tokena
Token ID64
Feature activation-3.07e+00

Pos logit contributions
zed+0.121
ainer+0.118
 aids+0.115
 paragraph+0.111
ze+0.110
Neg logit contributions
 mar-0.095
 Bub-0.093
 Nib-0.091
 could-0.090
 wra-0.087
Token"
Token ID1
Feature activation+5.59e-02

Pos logit contributions
zed+0.099
ainer+0.095
gh+0.094
ags+0.094
 aids+0.092
Neg logit contributions
 mar-0.076
 Bub-0.074
 Nib-0.071
 could-0.068
 le-0.067
Token on
Token ID319
Feature activation-1.81e+00

Pos logit contributions
 Bub+0.001
 mar+0.001
 Nib+0.001
 wra+0.001
endant+0.001
Neg logit contributions
zed-0.002
gh-0.002
 aids-0.002
ainer-0.002
ze-0.002
Token a
Token ID257
Feature activation-1.10e+00

Pos logit contributions
zed+0.071
gh+0.068
ainer+0.067
 aids+0.066
ze+0.065
Neg logit contributions
 mar-0.034
 Bub-0.031
 wra-0.031
 Nib-0.030
 le-0.028
Token page
Token ID2443
Feature activation-1.64e+00

Pos logit contributions
zed+0.037
ainer+0.036
ags+0.035
 aids+0.035
ap+0.033
Neg logit contributions
 mar-0.026
 wra-0.025
 Bub-0.024
 buried-0.023
 c-0.023
Token did
Token ID750
Feature activation-9.85e-01

Pos logit contributions
 Nib+0.004
 Bub+0.004
 wra+0.004
endant+0.004
 mar+0.004
Neg logit contributions
zed-0.007
ze-0.007
rac-0.007
ap-0.007
gh-0.007
Token not
Token ID407
Feature activation+9.10e-02

Pos logit contributions
zed+0.030
ainer+0.030
gh+0.030
 aids+0.029
aster+0.028
Neg logit contributions
 Bub-0.026
 mar-0.025
 Nib-0.025
 wra-0.024
 buried-0.023
Token know
Token ID760
Feature activation+1.27e-01

Pos logit contributions
 Bub+0.002
 Nib+0.002
 mar+0.002
 H+0.001
 ot+0.001
Neg logit contributions
zed-0.004
gh-0.004
ainer-0.004
 aids-0.003
ze-0.003
Token what
Token ID644
Feature activation-1.23e+00

Pos logit contributions
 Bub+0.003
 mar+0.003
endant+0.003
 Nib+0.003
 wra+0.003
Neg logit contributions
zed-0.004
gh-0.004
 aids-0.004
aster-0.004
ze-0.004
Token ost
Token ID23619
Feature activation-6.40e-01

Pos logit contributions
ainer+0.046
zed+0.045
gh+0.043
ze+0.042
 aids+0.042
Neg logit contributions
 Bub-0.026
 mar-0.025
 Nib-0.025
 could-0.025
 Ell-0.024
Tokenric
Token ID1173
Feature activation-3.79e+00

Pos logit contributions
zed+0.018
gh+0.017
 aids+0.016
ainer+0.016
aster+0.016
Neg logit contributions
 Bub-0.019
 mar-0.019
 Nib-0.019
 wra-0.018
endant-0.018
Tokenhes
Token ID956
Feature activation-1.43e+00

Pos logit contributions
ainer+0.090
gh+0.088
zed+0.086
 paragraph+0.080
 sentence+0.076
Neg logit contributions
 could-0.121
 Bub-0.115
 mar-0.115
 Nib-0.113
 said-0.112
Token were
Token ID547
Feature activation+7.56e-02

Pos logit contributions
zed+0.056
gh+0.054
ainer+0.053
 aids+0.051
ze+0.051
Neg logit contributions
 Bub-0.025
 mar-0.025
 Nib-0.025
 could-0.023
 wra-0.023
Token.
Token ID13
Feature activation+4.50e-01

Pos logit contributions
 Bub+0.002
 Nib+0.002
 mar+0.002
 wra+0.001
endant+0.001
Neg logit contributions
zed-0.003
gh-0.003
 aids-0.003
ze-0.003
aster-0.003
Token They
Token ID1119
Feature activation-1.50e-01

Pos logit contributions
 Bub+0.009
 Nib+0.009
 mar+0.009
 wra+0.009
 Ell+0.008
Neg logit contributions
zed-0.017
gh-0.017
ze-0.016
ainer-0.016
aster-0.016
Token looked
Token ID3114
Feature activation-1.61e-01

Pos logit contributions
zed+0.005
ze+0.005
 aids+0.005
rac+0.005
gh+0.005
Neg logit contributions
 Nib-0.003
 Bub-0.003
endant-0.003
 mar-0.003
 wra-0.003
Token Rachel
Token ID15984
Feature activation-2.45e-01

Pos logit contributions
 mar+0.007
 wra+0.007
endant+0.007
 Nib+0.006
 Bub+0.006
Neg logit contributions
zed-0.013
aster-0.013
gh-0.012
 aids-0.012
ainer-0.012
Token who
Token ID508
Feature activation+8.19e-01

Pos logit contributions
zed+0.008
gh+0.008
aster+0.008
ze+0.008
 aids+0.008
Neg logit contributions
 Bub-0.006
 mar-0.006
 Nib-0.006
 wra-0.006
endant-0.006
Token wanted
Token ID2227
Feature activation+1.55e-01

Pos logit contributions
 Bub+0.017
endant+0.017
 Nib+0.016
ocular+0.016
 mar+0.016
Neg logit contributions
zed-0.031
gh-0.030
ze-0.029
aster-0.028
 aids-0.028
Token a
Token ID257
Feature activation+5.91e-01

Pos logit contributions
 mar+0.004
 Bub+0.004
 Nib+0.004
 wra+0.003
 H+0.003
Neg logit contributions
zed-0.005
ainer-0.005
gh-0.005
 aids-0.005
ze-0.005
Token peach
Token ID47565
Feature activation+7.61e-01

Pos logit contributions
 Bub+0.015
 Nib+0.015
 mar+0.014
 wra+0.014
endant+0.014
Neg logit contributions
zed-0.019
gh-0.018
 aids-0.018
aster-0.017
ze-0.017
Token.
Token ID13
Feature activation-3.76e+00

Pos logit contributions
 Bub+0.020
 Nib+0.018
 mar+0.017
endant+0.017
 haven+0.017
Neg logit contributions
gh-0.024
aster-0.023
zed-0.023
 sentence-0.023
ze-0.023
Token She
Token ID1375
Feature activation+7.52e-01

Pos logit contributions
zed+0.170
gh+0.170
ze+0.167
aster+0.159
 aids+0.157
Neg logit contributions
 could-0.056
 haven-0.046
 said-0.044
 were-0.044
 Tim-0.044
Token went
Token ID1816
Feature activation-1.84e-01

Pos logit contributions
 Bub+0.015
 Nib+0.014
endant+0.014
 mar+0.014
ocular+0.013
Neg logit contributions
zed-0.030
gh-0.028
ze-0.027
aster-0.027
 aids-0.026
Token to
Token ID284
Feature activation+9.06e-02

Pos logit contributions
zed+0.006
gh+0.006
 aids+0.005
ainer+0.005
ze+0.005
Neg logit contributions
 Bub-0.005
 mar-0.005
 Nib-0.005
 wra-0.005
endant-0.004
Token the
Token ID262
Feature activation+3.61e-01

Pos logit contributions
 Bub+0.002
 wra+0.002
endant+0.002
 Nib+0.002
 mar+0.002
Neg logit contributions
zed-0.003
aster-0.003
gh-0.002
 aids-0.002
ainer-0.002
Token river
Token ID7850
Feature activation-1.42e-01

Pos logit contributions
 Bub+0.008
 mar+0.007
 Nib+0.007
endant+0.007
 wra+0.007
Neg logit contributions
zed-0.013
gh-0.013
aster-0.012
 aids-0.012
ze-0.012
Token a
Token ID257
Feature activation+4.37e-01

Pos logit contributions
ocular+0.031
endant+0.031
 wra+0.031
 Bub+0.031
 Nib+0.030
Neg logit contributions
aster-0.025
zed-0.025
gh-0.024
ze-0.024
ainer-0.024
Token little
Token ID1310
Feature activation+1.16e+00

Pos logit contributions
endant+0.010
ocular+0.010
 Bub+0.010
 wra+0.010
 Nib+0.010
Neg logit contributions
aster-0.014
zed-0.013
gh-0.013
ze-0.013
ainer-0.013
Token girl
Token ID2576
Feature activation+1.07e+00

Pos logit contributions
endant+0.026
 Bub+0.025
 wra+0.025
 Nib+0.025
ocular+0.025
Neg logit contributions
aster-0.036
ap-0.035
ze-0.034
over-0.034
ainer-0.034
Token named
Token ID3706
Feature activation+3.74e-01

Pos logit contributions
 Bub+0.020
 wra+0.020
endant+0.019
 Nib+0.019
 mar+0.018
Neg logit contributions
zed-0.041
aster-0.041
gh-0.040
ze-0.040
ap-0.039
Token Rachel
Token ID15984
Feature activation-1.17e-02

Pos logit contributions
 mar+0.008
 wra+0.007
endant+0.007
 Nib+0.007
 Bub+0.007
Neg logit contributions
zed-0.014
aster-0.013
gh-0.013
ainer-0.013
 aids-0.013
Token.
Token ID13
Feature activation-3.76e+00

Pos logit contributions
zed+0.000
gh+0.000
aster+0.000
ze+0.000
 aids+0.000
Neg logit contributions
 Bub-0.000
 mar-0.000
 Nib-0.000
 wra-0.000
endant-0.000
Token Rachel
Token ID15984
Feature activation+1.16e+00

Pos logit contributions
gh+0.152
ze+0.151
zed+0.151
 aids+0.142
aster+0.139
Neg logit contributions
 could-0.068
 Tim-0.063
 Bub-0.063
 were-0.062
 haven-0.061
Token loved
Token ID6151
Feature activation+9.39e-01

Pos logit contributions
 Bub+0.022
 Nib+0.021
 mar+0.020
 wra+0.019
endant+0.019
Neg logit contributions
zed-0.047
gh-0.045
ze-0.044
 aids-0.043
aster-0.043
Token to
Token ID284
Feature activation-7.38e-01

Pos logit contributions
 mar+0.023
 Bub+0.023
 Nib+0.023
 wra+0.022
endant+0.021
Neg logit contributions
zed-0.031
gh-0.030
ainer-0.030
 aids-0.029
ze-0.028
Token draw
Token ID3197
Feature activation+1.78e-01

Pos logit contributions
zed+0.030
gh+0.029
ainer+0.029
 aids+0.028
ze+0.027
Neg logit contributions
 mar-0.013
 Bub-0.013
 Nib-0.012
 buried-0.011
 c-0.011
Token circles
Token ID13332
Feature activation-1.58e-01

Pos logit contributions
 Bub+0.004
 Nib+0.004
 mar+0.004
 wra+0.004
 haven+0.004
Neg logit contributions
zed-0.006
gh-0.006
 aids-0.006
ze-0.006
ainer-0.005
Token a
Token ID257
Feature activation-1.12e-02

Pos logit contributions
endant+0.036
ocular+0.035
ll+0.035
 haven+0.035
 wra+0.035
Neg logit contributions
aster-0.027
zed-0.026
gh-0.026
ze-0.025
ainer-0.025
Token little
Token ID1310
Feature activation+8.32e-01

Pos logit contributions
ze+0.000
aster+0.000
gh+0.000
zed+0.000
 sentence+0.000
Neg logit contributions
endant-0.000
ocular-0.000
 Bub-0.000
 Nib-0.000
 wra-0.000
Token girl
Token ID2576
Feature activation+5.52e-01

Pos logit contributions
endant+0.018
 Bub+0.018
 Nib+0.018
 haven+0.018
 wra+0.018
Neg logit contributions
aster-0.025
ze-0.025
over-0.024
ap-0.024
ainer-0.024
Token called
Token ID1444
Feature activation-1.75e-01

Pos logit contributions
 Bub+0.010
 wra+0.010
 Nib+0.010
 mar+0.009
endant+0.009
Neg logit contributions
zed-0.022
aster-0.021
ze-0.021
gh-0.021
 aids-0.020
Token Rachel
Token ID15984
Feature activation-6.12e-01

Pos logit contributions
zed+0.006
gh+0.006
aster+0.006
 aids+0.006
ainer+0.006
Neg logit contributions
endant-0.004
 mar-0.004
 wra-0.004
 Bub-0.003
 buried-0.003
Token.
Token ID13
Feature activation-3.75e+00

Pos logit contributions
zed+0.021
gh+0.020
ze+0.020
 aids+0.020
aster+0.020
Neg logit contributions
 Bub-0.014
 mar-0.014
 Nib-0.013
 wra-0.013
endant-0.013
Token Rachel
Token ID15984
Feature activation+6.69e-01

Pos logit contributions
gh+0.137
zed+0.136
ze+0.136
 aids+0.127
aster+0.126
Neg logit contributions
 could-0.082
 Bub-0.076
 Tim-0.075
 haven-0.074
 were-0.074
Token loved
Token ID6151
Feature activation+8.08e-01

Pos logit contributions
 Bub+0.012
 Nib+0.012
 mar+0.011
 wra+0.011
endant+0.010
Neg logit contributions
zed-0.028
gh-0.026
ze-0.026
 aids-0.025
aster-0.025
Token to
Token ID284
Feature activation-8.13e-01

Pos logit contributions
 mar+0.020
 Bub+0.020
 Nib+0.019
 wra+0.019
endant+0.018
Neg logit contributions
zed-0.027
gh-0.026
ainer-0.026
 aids-0.025
ze-0.025
Token ride
Token ID6594
Feature activation+3.82e-01

Pos logit contributions
zed+0.033
gh+0.032
ainer+0.032
 aids+0.032
ze+0.031
Neg logit contributions
 mar-0.014
 Bub-0.014
 Nib-0.013
 buried-0.012
 ot-0.011
Token her
Token ID607
Feature activation+2.47e+00

Pos logit contributions
 mar+0.009
 Bub+0.009
 Nib+0.009
 wra+0.008
 H+0.008
Neg logit contributions
zed-0.013
ainer-0.013
 aids-0.013
gh-0.013
aster-0.013
Token of
Token ID286
Feature activation+1.10e-01

Pos logit contributions
 Bub+0.003
 mar+0.003
 Nib+0.003
 wra+0.003
endant+0.003
Neg logit contributions
zed-0.006
gh-0.006
 aids-0.006
ainer-0.006
aster-0.006
Token the
Token ID262
Feature activation+7.64e-01

Pos logit contributions
 Bub+0.003
 mar+0.003
 Nib+0.003
 wra+0.003
endant+0.003
Neg logit contributions
zed-0.003
gh-0.003
 aids-0.003
aster-0.003
ainer-0.003
Token tree
Token ID5509
Feature activation+1.02e+00

Pos logit contributions
 Bub+0.020
 mar+0.019
 Nib+0.019
endant+0.019
 wra+0.018
Neg logit contributions
zed-0.023
ze-0.022
aster-0.022
gh-0.022
 aids-0.022
Token nearest
Token ID16936
Feature activation+1.13e+00

Pos logit contributions
 Bub+0.024
 Nib+0.023
 mar+0.023
 wra+0.022
endant+0.021
Neg logit contributions
zed-0.035
gh-0.034
ze-0.033
 aids-0.033
aster-0.032
Token to
Token ID284
Feature activation+4.60e-01

Pos logit contributions
 Bub+0.027
 mar+0.026
 Nib+0.026
 wra+0.025
endant+0.024
Neg logit contributions
zed-0.039
gh-0.037
ainer-0.036
 aids-0.036
aster-0.035
Token her
Token ID607
Feature activation+2.52e+00

Pos logit contributions
 mar+0.012
 Bub+0.012
 wra+0.012
endant+0.012
 Nib+0.012
Neg logit contributions
zed-0.014
aster-0.014
gh-0.013
 aids-0.013
ainer-0.013
Token and
Token ID290
Feature activation-3.05e-01

Pos logit contributions
 Bub+0.074
 wra+0.072
 Nib+0.069
unn+0.067
 mar+0.067
Neg logit contributions
ze-0.070
zed-0.070
gh-0.066
ap-0.064
 aids-0.064
Token began
Token ID2540
Feature activation-9.59e-01

Pos logit contributions
zed+0.012
gh+0.012
 aids+0.011
ze+0.011
aster+0.011
Neg logit contributions
 Bub-0.005
 mar-0.005
 Nib-0.005
 wra-0.005
endant-0.005
Token to
Token ID284
Feature activation+2.06e-02

Pos logit contributions
ainer+0.040
gh+0.038
zed+0.037
aster+0.037
 aids+0.036
Neg logit contributions
 mar-0.017
 Bub-0.016
 lost-0.016
 sc-0.015
 Nib-0.015
Token sing
Token ID1702
Feature activation-3.80e-01

Pos logit contributions
 Bub+0.001
 mar+0.001
 Nib+0.001
 wra+0.000
 buried+0.000
Neg logit contributions
zed-0.001
gh-0.001
ainer-0.001
 aids-0.001
aster-0.001
Token.
Token ID13
Feature activation+6.64e-01

Pos logit contributions
zed+0.014
aster+0.014
gh+0.014
ainer+0.014
 aids+0.013
Neg logit contributions
 Bub-0.008
 mar-0.007
 Nib-0.007
 H-0.007
 Ell-0.007
Token to
Token ID284
Feature activation-1.16e+00

Pos logit contributions
 Bub+0.000
 mar+0.000
 Nib+0.000
 wra+0.000
endant+0.000
Neg logit contributions
zed-0.000
gh-0.000
 aids-0.000
ainer-0.000
ze-0.000
Token get
Token ID651
Feature activation-6.35e-01

Pos logit contributions
zed+0.040
gh+0.039
 aids+0.038
ainer+0.038
ze+0.036
Neg logit contributions
 Bub-0.026
 mar-0.026
 Nib-0.025
 buried-0.024
 wra-0.024
Token the
Token ID262
Feature activation-1.34e+00

Pos logit contributions
zed+0.023
gh+0.023
ainer+0.023
 aids+0.022
 paragraph+0.021
Neg logit contributions
 Bub-0.014
 mar-0.014
 Nib-0.014
 wra-0.013
 Ell-0.013
Token cookies
Token ID14746
Feature activation-4.14e-01

Pos logit contributions
zed+0.051
gh+0.049
 aids+0.047
ainer+0.046
ze+0.046
Neg logit contributions
 wra-0.030
 mar-0.029
 Nib-0.026
 le-0.026
 Bub-0.026
Token.
Token ID13
Feature activation+1.33e+00

Pos logit contributions
zed+0.016
gh+0.016
 aids+0.016
ainer+0.015
ze+0.015
Neg logit contributions
 Bub-0.008
 mar-0.008
 Nib-0.007
 wra-0.007
 buried-0.007
Token Can
Token ID1680
Feature activation+2.35e+00

Pos logit contributions
 mar+0.041
 wra+0.041
 buried+0.040
 Nib+0.039
endant+0.038
Neg logit contributions
 aids-0.031
ainer-0.031
zed-0.030
rac-0.029
ap-0.029
Token you
Token ID345
Feature activation+1.11e+00

Pos logit contributions
 mar+0.056
 Bub+0.055
 wra+0.054
endant+0.054
 Nib+0.054
Neg logit contributions
zed-0.079
gh-0.075
aster-0.074
 aids-0.071
ainer-0.070
Token please
Token ID3387
Feature activation+1.23e+00

Pos logit contributions
 Bub+0.029
 mar+0.028
 Nib+0.028
 wra+0.027
endant+0.027
Neg logit contributions
zed-0.035
gh-0.034
ainer-0.033
 aids-0.033
aster-0.033
Token give
Token ID1577
Feature activation-1.83e-01

Pos logit contributions
 Bub+0.032
 mar+0.031
 Nib+0.031
 wra+0.030
endant+0.030
Neg logit contributions
zed-0.039
gh-0.038
aster-0.036
ainer-0.036
 aids-0.036
Token me
Token ID502
Feature activation-1.54e+00

Pos logit contributions
zed+0.006
gh+0.005
 aids+0.005
ainer+0.005
ze+0.005
Neg logit contributions
 Bub-0.005
 mar-0.005
 Nib-0.005
 wra-0.005
endant-0.005
Token the
Token ID262
Feature activation-1.54e+00

Pos logit contributions
zed+0.062
gh+0.061
 aids+0.059
 paragraph+0.058
ainer+0.057
Neg logit contributions
 Bub-0.029
 mar-0.029
 Nib-0.028
 buried-0.026
 le-0.025
Token
Token ID198
Feature activation+2.30e+00

Pos logit contributions
 mar+0.043
endant+0.043
 wra+0.043
 Bub+0.041
 Nib+0.040
Neg logit contributions
zed-0.063
gh-0.059
ze-0.055
ainer-0.055
 aids-0.054
TokenThe
Token ID464
Feature activation+3.39e-01

Pos logit contributions
 Bub+0.068
 mar+0.066
 Nib+0.065
 wra+0.064
endant+0.063
Neg logit contributions
zed-0.066
gh-0.063
 aids-0.060
ainer-0.060
ze-0.059
Token boy
Token ID2933
Feature activation-7.55e-02

Pos logit contributions
iw+0.010
 Nib+0.009
endant+0.009
unn+0.009
 haven+0.009
Neg logit contributions
ze-0.009
 aids-0.009
 sentence-0.009
zed-0.009
gh-0.009
Token p
Token ID279
Feature activation+1.22e+00

Pos logit contributions
zed+0.003
ze+0.003
 aids+0.003
gh+0.003
aster+0.003
Neg logit contributions
 Bub-0.001
 wra-0.001
 mar-0.001
 Nib-0.001
endant-0.001
Tokenatted
Token ID16898
Feature activation+2.43e+00

Pos logit contributions
 Bub+0.036
 Nib+0.035
 mar+0.034
 wra+0.034
 haven+0.032
Neg logit contributions
zed-0.034
gh-0.034
 aids-0.032
ze-0.031
ainer-0.030
Token F
Token ID376
Feature activation+3.11e+00

Pos logit contributions
endant+0.041
 mar+0.041
 Bub+0.040
 wra+0.039
 Nib+0.038
Neg logit contributions
zed-0.098
gh-0.094
ze-0.091
ainer-0.090
 aids-0.090
Tokenuzz
Token ID4715
Feature activation-7.64e-01

Pos logit contributions
 wra+0.101
 mar+0.096
 Bub+0.091
 jewels+0.091
 buried+0.090
Neg logit contributions
gh-0.078
zed-0.078
over-0.068
 aids-0.065
ze-0.065
Token's
Token ID338
Feature activation+3.71e-01

Pos logit contributions
zed+0.025
gh+0.025
ainer+0.025
 aids+0.024
aster+0.024
Neg logit contributions
 Bub-0.018
 mar-0.018
 wra-0.018
 Nib-0.018
endant-0.017
Token head
Token ID1182
Feature activation-2.73e-02

Pos logit contributions
 Bub+0.011
 mar+0.011
 Nib+0.011
 wra+0.011
endant+0.010
Neg logit contributions
zed-0.010
gh-0.010
aster-0.009
 aids-0.009
ainer-0.009
Token and
Token ID290
Feature activation-6.15e-03

Pos logit contributions
zed+0.001
gh+0.001
 aids+0.001
ainer+0.001
aster+0.001
Neg logit contributions
 Bub-0.001
 mar-0.001
 Nib-0.001
 wra-0.000
endant-0.000
Token said
Token ID531
Feature activation-3.17e-01

Pos logit contributions
zed+0.000
ze+0.000
gh+0.000
 aids+0.000
aster+0.000
Neg logit contributions
endant-0.000
 Bub-0.000
unn-0.000
 mar-0.000
 wra-0.000
Token
Token ID220
Feature activation+1.37e+00

Pos logit contributions
 mar+0.018
 Bub+0.017
 wra+0.017
 Nib+0.017
 buried+0.017
Neg logit contributions
zed-0.023
 aids-0.021
ainer-0.021
gh-0.021
aster-0.020
Token
Token ID198
Feature activation+4.08e-01

Pos logit contributions
 Bub+0.025
 Nib+0.024
 mar+0.024
 wra+0.022
endant+0.022
Neg logit contributions
zed-0.054
gh-0.053
 aids-0.051
ainer-0.051
aster-0.051
Token
Token ID198
Feature activation+3.28e-01

Pos logit contributions
 Bub+0.008
 mar+0.008
 Nib+0.008
 wra+0.007
endant+0.007
Neg logit contributions
zed-0.016
gh-0.015
 aids-0.015
ainer-0.015
aster-0.014
TokenWhen
Token ID2215
Feature activation+5.96e-01

Pos logit contributions
 Bub+0.009
 mar+0.008
 Nib+0.008
 wra+0.008
 Ell+0.008
Neg logit contributions
zed-0.010
gh-0.010
 aids-0.009
ainer-0.009
aster-0.009
Token the
Token ID262
Feature activation+8.85e-01

Pos logit contributions
 Bub+0.013
 mar+0.013
 Nib+0.013
endant+0.013
 wra+0.012
Neg logit contributions
zed-0.022
gh-0.020
aster-0.020
 aids-0.020
ainer-0.020
Token k
Token ID479
Feature activation+2.63e+00

Pos logit contributions
 Bub+0.026
 Nib+0.026
 mar+0.024
endant+0.024
 wra+0.024
Neg logit contributions
zed-0.025
gh-0.024
ze-0.024
 aids-0.023
ainer-0.023
Tokenite
Token ID578
Feature activation-5.61e-01

Pos logit contributions
 Bub+0.068
 Nib+0.065
 wra+0.063
 mar+0.062
 buried+0.058
Neg logit contributions
zed-0.079
aster-0.077
 aids-0.077
gh-0.077
ainer-0.077
Token was
Token ID373
Feature activation+2.79e-01

Pos logit contributions
zed+0.021
gh+0.020
ze+0.019
aster+0.019
 aids+0.019
Neg logit contributions
 Bub-0.012
 Nib-0.011
 mar-0.011
 wra-0.011
endant-0.011
Token close
Token ID1969
Feature activation+1.70e-03

Pos logit contributions
 Bub+0.006
 buried+0.006
 Nib+0.006
 mar+0.006
 locked+0.005
Neg logit contributions
zed-0.010
 aids-0.010
ainer-0.010
gh-0.010
ze-0.010
Token by
Token ID416
Feature activation+9.00e-01

Pos logit contributions
 Bub+0.000
 mar+0.000
 Nib+0.000
 wra+0.000
endant+0.000
Neg logit contributions
zed-0.000
gh-0.000
 aids-0.000
ainer-0.000
aster-0.000
Token,
Token ID11
Feature activation+5.17e-01

Pos logit contributions
 Bub+0.018
 mar+0.017
 Nib+0.017
 wra+0.016
endant+0.016
Neg logit contributions
zed-0.035
gh-0.033
 aids-0.033
ainer-0.033
ze-0.032
Token the
Token ID262
Feature activation-4.00e-01

Pos logit contributions
endant+0.059
 haven+0.055
unn+0.055
 wra+0.054
ll+0.052
Neg logit contributions
zed-0.062
gh-0.059
ze-0.055
aster-0.054
ainer-0.053
Token log
Token ID2604
Feature activation-2.49e-01

Pos logit contributions
zed+0.015
 aids+0.014
ainer+0.014
gh+0.014
aster+0.014
Neg logit contributions
 Bub-0.009
 mar-0.008
 Nib-0.008
 wra-0.008
 buried-0.008
Token and
Token ID290
Feature activation+1.57e-01

Pos logit contributions
zed+0.010
gh+0.010
 aids+0.010
ainer+0.010
aster+0.010
Neg logit contributions
 Bub-0.004
 mar-0.004
 Nib-0.004
 wra-0.004
endant-0.003
Token then
Token ID788
Feature activation-2.06e-02

Pos logit contributions
endant+0.004
unn+0.004
 haven+0.004
ocular+0.004
 wra+0.004
Neg logit contributions
zed-0.005
aster-0.005
ze-0.004
gh-0.004
ainer-0.004
Token jumped
Token ID11687
Feature activation+1.45e-01

Pos logit contributions
zed+0.001
gh+0.001
ze+0.001
aster+0.001
 aids+0.001
Neg logit contributions
endant-0.000
 Bub-0.000
 Nib-0.000
 wra-0.000
 mar-0.000
Token onto
Token ID4291
Feature activation+2.91e+00

Pos logit contributions
 Bub+0.004
 mar+0.004
 Nib+0.004
 wra+0.003
 buried+0.003
Neg logit contributions
zed-0.005
 aids-0.005
gh-0.005
aster-0.005
ainer-0.004
Token a
Token ID257
Feature activation+7.30e-01

Pos logit contributions
endant+0.076
 wra+0.068
unn+0.068
ll+0.068
ocular+0.065
Neg logit contributions
zed-0.098
aster-0.090
gh-0.089
ze-0.089
ainer-0.087
Token nearby
Token ID6716
Feature activation+1.24e+00

Pos logit contributions
 Bub+0.017
 Nib+0.017
 mar+0.017
 wra+0.016
endant+0.016
Neg logit contributions
zed-0.025
gh-0.024
ainer-0.023
 aids-0.023
ze-0.023
Token tree
Token ID5509
Feature activation+1.09e+00

Pos logit contributions
 Bub+0.024
 mar+0.023
 Nib+0.023
 wra+0.022
endant+0.021
Neg logit contributions
zed-0.048
gh-0.046
 aids-0.045
ainer-0.045
ze-0.044
Token.
Token ID13
Feature activation+2.39e+00

Pos logit contributions
 Bub+0.022
 mar+0.021
 Nib+0.021
 wra+0.020
endant+0.020
Neg logit contributions
zed-0.041
gh-0.039
 aids-0.038
ainer-0.038
ze-0.038
Token The
Token ID383
Feature activation-4.60e-01

Pos logit contributions
 wra+0.056
endant+0.055
 mar+0.052
 buried+0.050
 Nib+0.049
Neg logit contributions
zed-0.081
ainer-0.074
gh-0.073
ze-0.072
 aids-0.069
Token.
Token ID13
Feature activation-8.96e-01

Pos logit contributions
zed+0.012
gh+0.012
 aids+0.012
ainer+0.012
aster+0.011
Neg logit contributions
 Bub-0.007
 mar-0.007
 Nib-0.007
endant-0.007
 wra-0.007
Token He
Token ID679
Feature activation+1.10e+00

Pos logit contributions
zed+0.031
gh+0.029
 aids+0.029
ainer+0.028
aster+0.028
Neg logit contributions
 Bub-0.021
 mar-0.021
 Nib-0.021
 wra-0.020
endant-0.020
Token says
Token ID1139
Feature activation+2.22e-01

Pos logit contributions
 Bub+0.028
 haven+0.027
ocular+0.026
 mar+0.025
 wra+0.025
Neg logit contributions
zed-0.034
ze-0.033
 aids-0.031
 sentence-0.030
aster-0.029
Token,
Token ID11
Feature activation-9.57e-01

Pos logit contributions
 Bub+0.003
 Nib+0.003
 mar+0.003
 Ell+0.003
 wra+0.002
Neg logit contributions
zed-0.010
ainer-0.010
aster-0.010
 aids-0.010
 sentence-0.009
Token "
Token ID366
Feature activation+6.77e-01

Pos logit contributions
gh+0.034
zed+0.034
ainer+0.033
 aids+0.033
aster+0.032
Neg logit contributions
 Bub-0.022
 Nib-0.021
 mar-0.020
 wra-0.019
 Ell-0.018
TokenYou
Token ID1639
Feature activation+1.00e+00

Pos logit contributions
 Bub+0.015
 mar+0.015
 Nib+0.015
 wra+0.014
endant+0.013
Neg logit contributions
zed-0.024
gh-0.023
 aids-0.023
ainer-0.022
aster-0.022
Token're
Token ID821
Feature activation+5.43e-01

Pos logit contributions
 Bub+0.019
 Nib+0.018
 wra+0.018
endant+0.018
 mar+0.018
Neg logit contributions
zed-0.038
gh-0.037
ze-0.036
 aids-0.036
ainer-0.035
Token welcome
Token ID7062
Feature activation-6.31e-02

Pos logit contributions
 Bub+0.015
 Nib+0.015
 mar+0.014
 wra+0.014
endant+0.013
Neg logit contributions
zed-0.017
gh-0.016
 aids-0.016
ainer-0.016
aster-0.015
Token,
Token ID11
Feature activation-3.31e-01

Pos logit contributions
zed+0.003
 aids+0.003
aster+0.003
ainer+0.003
gh+0.003
Neg logit contributions
 Bub-0.001
 Nib-0.001
 mar-0.001
 Ell-0.001
 H-0.001
Token Mia
Token ID32189
Feature activation-1.89e-01

Pos logit contributions
 aids+0.009
 sentence+0.009
zed+0.009
aster+0.009
ainer+0.008
Neg logit contributions
endant-0.009
 wra-0.009
 mar-0.009
 Bub-0.009
ocular-0.009
Token.
Token ID13
Feature activation+7.70e-01

Pos logit contributions
zed+0.007
gh+0.007
ze+0.006
ainer+0.006
 aids+0.006
Neg logit contributions
 Bub-0.004
 mar-0.004
 Nib-0.004
endant-0.004
 wra-0.004
Token.
Token ID13
Feature activation-4.62e-02

Pos logit contributions
zed+0.032
gh+0.031
 aids+0.030
ainer+0.030
aster+0.030
Neg logit contributions
 Bub-0.012
 Nib-0.011
 mar-0.011
 wra-0.010
endant-0.010
Token
Token ID198
Feature activation+3.18e-01

Pos logit contributions
zed+0.002
gh+0.002
 aids+0.002
ainer+0.002
ze+0.002
Neg logit contributions
 Bub-0.001
 Nib-0.001
 mar-0.001
 wra-0.001
 H-0.001
Token
Token ID198
Feature activation+5.71e-01

Pos logit contributions
 mar+0.008
 wra+0.008
 Bub+0.008
endant+0.008
 Nib+0.007
Neg logit contributions
zed-0.010
gh-0.010
aster-0.010
ainer-0.010
 aids-0.010
Token"
Token ID1
Feature activation+8.83e-01

Pos logit contributions
 Bub+0.018
 mar+0.018
 Nib+0.018
 wra+0.017
endant+0.017
Neg logit contributions
zed-0.015
gh-0.014
 aids-0.014
ainer-0.013
aster-0.013
TokenGo
Token ID5247
Feature activation+3.78e-01

Pos logit contributions
 Bub+0.023
 mar+0.023
 Nib+0.022
 wra+0.022
endant+0.021
Neg logit contributions
zed-0.028
gh-0.027
 aids-0.027
ainer-0.027
aster-0.026
Token away
Token ID1497
Feature activation+8.59e-01

Pos logit contributions
 Bub+0.012
 mar+0.012
 Nib+0.012
 wra+0.011
 buried+0.011
Neg logit contributions
zed-0.010
 aids-0.010
gh-0.009
ainer-0.009
ze-0.009
Token,
Token ID11
Feature activation-1.82e-01

Pos logit contributions
 Bub+0.019
 Nib+0.018
 mar+0.018
endant+0.018
 wra+0.018
Neg logit contributions
zed-0.030
gh-0.029
ainer-0.029
ze-0.028
aster-0.028
Token Tim
Token ID5045
Feature activation+1.33e-01

Pos logit contributions
aster+0.006
zed+0.006
ze+0.006
gh+0.005
 aids+0.005
Neg logit contributions
endant-0.004
 mar-0.004
unn-0.004
 wra-0.004
 Nib-0.004
Token!
Token ID0
Feature activation+9.73e-01

Pos logit contributions
 Bub+0.003
 mar+0.003
 Nib+0.003
 wra+0.003
endant+0.003
Neg logit contributions
zed-0.005
gh-0.005
ainer-0.005
 aids-0.005
ze-0.005
Token You
Token ID921
Feature activation+1.36e+00

Pos logit contributions
 Bub+0.024
 Nib+0.023
 mar+0.023
 wra+0.022
endant+0.021
Neg logit contributions
zed-0.033
gh-0.032
 aids-0.031
ainer-0.031
ze-0.030
Token are
Token ID389
Feature activation+5.94e-01

Pos logit contributions
 Nib+0.034
 Bub+0.033
endant+0.032
 wra+0.031
unn+0.030
Neg logit contributions
ze-0.041
gh-0.040
zed-0.040
aster-0.038
 aids-0.038
Token called
Token ID1444
Feature activation-3.91e-01

Pos logit contributions
 Bub+0.008
 wra+0.007
 Nib+0.007
 mar+0.007
endant+0.007
Neg logit contributions
zed-0.022
aster-0.021
gh-0.021
ze-0.021
 aids-0.020
Token Emily
Token ID17608
Feature activation-1.84e-01

Pos logit contributions
zed+0.013
gh+0.013
aster+0.013
 aids+0.013
ainer+0.012
Neg logit contributions
 mar-0.009
endant-0.009
 wra-0.009
 Bub-0.008
 Nib-0.008
Token.
Token ID13
Feature activation-1.86e+00

Pos logit contributions
zed+0.006
gh+0.006
ze+0.006
aster+0.005
 aids+0.005
Neg logit contributions
 Bub-0.005
 mar-0.005
 Nib-0.005
endant-0.004
 wra-0.004
Token Emily
Token ID17608
Feature activation+8.97e-01

Pos logit contributions
zed+0.069
gh+0.067
 aids+0.064
ze+0.064
aster+0.064
Neg logit contributions
 Bub-0.040
 mar-0.038
 Nib-0.037
 wra-0.035
 H-0.035
Token had
Token ID550
Feature activation-6.71e-01

Pos logit contributions
 Bub+0.015
 Nib+0.015
 mar+0.014
 wra+0.014
endant+0.014
Neg logit contributions
zed-0.037
gh-0.036
ze-0.035
 aids-0.035
aster-0.034
Token a
Token ID257
Feature activation+6.38e-01

Pos logit contributions
zed+0.022
gh+0.021
 aids+0.021
ze+0.020
ainer+0.020
Neg logit contributions
 Bub-0.017
 Nib-0.016
 mar-0.016
 wra-0.016
endant-0.015
Token very
Token ID845
Feature activation+3.23e-01

Pos logit contributions
 Bub+0.016
 Nib+0.015
 mar+0.015
 wra+0.015
endant+0.015
Neg logit contributions
gh-0.020
zed-0.020
ze-0.019
 aids-0.019
aster-0.019
Token special
Token ID2041
Feature activation+1.72e-01

Pos logit contributions
 Bub+0.008
 mar+0.007
 Nib+0.007
 wra+0.007
endant+0.007
Neg logit contributions
zed-0.011
gh-0.011
 aids-0.010
ainer-0.010
aster-0.010
Token dream
Token ID4320
Feature activation-1.19e+00

Pos logit contributions
 Bub+0.004
 Nib+0.004
 mar+0.004
 wra+0.004
endant+0.004
Neg logit contributions
zed-0.005
gh-0.005
 aids-0.005
ze-0.005
ainer-0.005
Token -
Token ID532
Feature activation+2.87e-01

Pos logit contributions
zed+0.048
ainer+0.048
gh+0.047
 aids+0.047
aster+0.046
Neg logit contributions
 Bub-0.018
 mar-0.018
 Nib-0.017
 wra-0.017
 could-0.016
Token to
Token ID284
Feature activation-1.87e-01

Pos logit contributions
endant+0.007
unn+0.007
 wra+0.007
 mar+0.007
 Bub+0.007
Neg logit contributions
zed-0.009
gh-0.008
aster-0.008
 aids-0.008
ze-0.008
Token"
Token ID1
Feature activation+9.85e-02

Pos logit contributions
 Bub+0.001
 mar+0.001
 Nib+0.001
 wra+0.001
endant+0.001
Neg logit contributions
zed-0.003
gh-0.002
 aids-0.002
ainer-0.002
aster-0.002
TokenTim
Token ID14967
Feature activation-6.05e-01

Pos logit contributions
 Bub+0.002
 mar+0.002
 wra+0.002
 Nib+0.002
 H+0.002
Neg logit contributions
zed-0.004
gh-0.003
 aids-0.003
ainer-0.003
ze-0.003
Token,"
Token ID553
Feature activation-2.74e-01

Pos logit contributions
zed+0.024
gh+0.023
 aids+0.023
ainer+0.023
aster+0.022
Neg logit contributions
 Bub-0.011
 mar-0.011
 Nib-0.010
 wra-0.010
endant-0.010
Token said
Token ID531
Feature activation+4.62e-01

Pos logit contributions
zed+0.011
gh+0.011
aster+0.010
 aids+0.010
ainer+0.010
Neg logit contributions
 Bub-0.005
 Nib-0.005
 mar-0.005
endant-0.004
 wra-0.004
Token his
Token ID465
Feature activation+5.96e-01

Pos logit contributions
 Bub+0.007
 Tim+0.006
 Nib+0.006
 mar+0.006
 Ell+0.005
Neg logit contributions
zed-0.021
gh-0.021
 aids-0.020
ainer-0.020
ze-0.020
Token mom
Token ID1995
Feature activation+5.72e-01

Pos logit contributions
 Bub+0.015
 Nib+0.015
 mar+0.014
 wra+0.014
endant+0.014
Neg logit contributions
zed-0.020
gh-0.019
 aids-0.018
ainer-0.018
ze-0.018
Token,
Token ID11
Feature activation-7.44e-01

Pos logit contributions
 mar+0.014
 Bub+0.014
 Nib+0.013
 wra+0.013
endant+0.013
Neg logit contributions
zed-0.019
gh-0.019
ze-0.019
 aids-0.018
aster-0.017
Token "
Token ID366
Feature activation-1.12e+00

Pos logit contributions
zed+0.026
gh+0.025
ainer+0.025
 aids+0.025
aster+0.024
Neg logit contributions
 Bub-0.017
 mar-0.016
 Nib-0.016
 wra-0.016
endant-0.015
Tokenyou
Token ID5832
Feature activation+5.23e-01

Pos logit contributions
zed+0.039
gh+0.038
 aids+0.036
ainer+0.036
 sentence+0.035
Neg logit contributions
 mar-0.025
 Bub-0.024
 wra-0.024
 could-0.023
 Nib-0.023
Token need
Token ID761
Feature activation-6.29e-01

Pos logit contributions
 Bub+0.011
 Nib+0.011
 mar+0.010
 wra+0.010
endant+0.010
Neg logit contributions
zed-0.019
ze-0.018
gh-0.018
 aids-0.018
aster-0.018
Token to
Token ID284
Feature activation-3.41e-01

Pos logit contributions
zed+0.022
ainer+0.021
gh+0.021
 aids+0.020
ze+0.020
Neg logit contributions
 Bub-0.014
 mar-0.014
 Nib-0.014
 wra-0.013
 buried-0.013
Token fun
Token ID1257
Feature activation-8.93e-01

Pos logit contributions
 Bub+0.002
 Nib+0.002
 mar+0.002
 wra+0.002
endant+0.001
Neg logit contributions
zed-0.002
gh-0.002
 aids-0.002
ainer-0.002
aster-0.002
Token!"
Token ID2474
Feature activation+1.03e-01

Pos logit contributions
zed+0.033
gh+0.032
 aids+0.032
ainer+0.031
aster+0.031
Neg logit contributions
 Bub-0.019
 mar-0.018
 Nib-0.018
 wra-0.017
endant-0.017
Token L
Token ID406
Feature activation+7.87e-01

Pos logit contributions
endant+0.001
 wra+0.001
ll+0.001
 buried+0.001
 haven+0.001
Neg logit contributions
aster-0.004
zed-0.004
 aids-0.004
ainer-0.004
gh-0.004
Tokenila
Token ID10102
Feature activation-3.05e-01

Pos logit contributions
 Bub+0.021
 mar+0.020
 Nib+0.020
 wra+0.020
endant+0.019
Neg logit contributions
zed-0.025
gh-0.024
aster-0.023
ainer-0.023
 aids-0.023
Token said
Token ID531
Feature activation-1.17e-01

Pos logit contributions
zed+0.013
ze+0.013
 aids+0.013
gh+0.013
aster+0.013
Neg logit contributions
 Bub-0.004
 Nib-0.004
endant-0.004
 mar-0.003
 wra-0.003
Token.
Token ID13
Feature activation+2.18e-01

Pos logit contributions
zed+0.005
gh+0.005
 aids+0.005
aster+0.005
ainer+0.005
Neg logit contributions
 Bub-0.002
 Nib-0.002
 mar-0.001
 wra-0.001
 Ell-0.001
Token She
Token ID1375
Feature activation+4.76e-01

Pos logit contributions
 Bub+0.005
 Nib+0.005
 mar+0.004
 wra+0.004
endant+0.004
Neg logit contributions
zed-0.008
gh-0.008
 aids-0.008
ainer-0.008
aster-0.007
Token spun
Token ID26843
Feature activation-2.59e-01

Pos logit contributions
 Bub+0.010
 Nib+0.010
 mar+0.009
endant+0.009
 wra+0.009
Neg logit contributions
zed-0.018
ze-0.017
gh-0.017
 aids-0.016
ainer-0.016
Token around
Token ID1088
Feature activation+7.34e-01

Pos logit contributions
zed+0.010
gh+0.010
 aids+0.010
ainer+0.010
aster+0.010
Neg logit contributions
 Bub-0.006
 mar-0.005
 Nib-0.005
 Ell-0.005
 H-0.005
Token and
Token ID290
Feature activation-6.46e-01

Pos logit contributions
 Bub+0.016
endant+0.016
 Nib+0.016
 mar+0.016
 wra+0.015
Neg logit contributions
zed-0.026
gh-0.025
ze-0.024
aster-0.024
 aids-0.024
Token hugged
Token ID41130
Feature activation+8.50e-01

Pos logit contributions
zed+0.025
gh+0.024
ainer+0.024
 aids+0.024
aster+0.023
Neg logit contributions
 Bub-0.013
 mar-0.013
 Nib-0.012
 wra-0.011
 buried-0.011
Token he
Token ID339
Feature activation-3.71e-01

Pos logit contributions
zed+0.028
gh+0.028
 aids+0.027
ainer+0.027
ze+0.026
Neg logit contributions
 Bub-0.014
 mar-0.013
 Nib-0.013
 wra-0.013
endant-0.012
Token asked
Token ID1965
Feature activation-7.72e-01

Pos logit contributions
zed+0.013
ze+0.012
gh+0.012
 aids+0.011
aster+0.011
Neg logit contributions
endant-0.009
 mar-0.008
 wra-0.008
 Nib-0.008
 haven-0.008
Token the
Token ID262
Feature activation-1.35e-01

Pos logit contributions
zed+0.026
gh+0.025
 aids+0.024
ainer+0.024
aster+0.024
Neg logit contributions
 Bub-0.019
 mar-0.019
 Nib-0.019
 wra-0.018
endant-0.018
Token fox
Token ID21831
Feature activation-3.93e-01

Pos logit contributions
zed+0.004
gh+0.004
ze+0.004
ainer+0.004
 aids+0.004
Neg logit contributions
endant-0.004
 mar-0.004
 Bub-0.004
 Nib-0.004
 wra-0.004
Token to
Token ID284
Feature activation+7.23e-02

Pos logit contributions
zed+0.014
gh+0.014
 aids+0.013
ze+0.013
aster+0.013
Neg logit contributions
 mar-0.008
 wra-0.008
 Bub-0.008
endant-0.008
 Nib-0.008
Token help
Token ID1037
Feature activation-6.80e-01

Pos logit contributions
 Bub+0.002
 mar+0.002
 Nib+0.002
 wra+0.002
endant+0.002
Neg logit contributions
zed-0.002
gh-0.002
 aids-0.002
ainer-0.002
aster-0.002
Token him
Token ID683
Feature activation+5.61e-01

Pos logit contributions
zed+0.022
gh+0.021
 aids+0.021
ainer+0.021
ze+0.020
Neg logit contributions
 Bub-0.017
 mar-0.017
 Nib-0.016
 wra-0.016
 buried-0.015
Token get
Token ID651
Feature activation-2.14e-01

Pos logit contributions
 Bub+0.011
 mar+0.011
 Nib+0.011
 wra+0.010
 buried+0.010
Neg logit contributions
zed-0.021
gh-0.021
 aids-0.020
ainer-0.020
aster-0.020
Token back
Token ID736
Feature activation+5.35e-01

Pos logit contributions
zed+0.007
gh+0.006
 aids+0.006
ainer+0.006
aster+0.006
Neg logit contributions
 Bub-0.006
 mar-0.006
 Nib-0.006
 wra-0.006
endant-0.005
Token to
Token ID284
Feature activation+2.01e-01

Pos logit contributions
 Bub+0.014
 mar+0.013
 Nib+0.013
 wra+0.013
endant+0.013
Neg logit contributions
zed-0.017
gh-0.016
aster-0.015
ainer-0.015
 aids-0.015
Token the
Token ID262
Feature activation-1.02e-01

Pos logit contributions
 Bub+0.005
 mar+0.005
 Nib+0.005
 wra+0.005
endant+0.005
Neg logit contributions
zed-0.006
gh-0.006
aster-0.006
 aids-0.006
ainer-0.006
Token to
Token ID284
Feature activation+2.04e-01

Pos logit contributions
ainer+0.031
gh+0.029
ags+0.029
 aids+0.029
zed+0.029
Neg logit contributions
 mar-0.013
 sc-0.012
 H-0.011
 locked-0.011
 lost-0.011
Token stare
Token ID24170
Feature activation-6.83e-02

Pos logit contributions
 Bub+0.005
 mar+0.005
 Nib+0.005
 wra+0.005
 H+0.004
Neg logit contributions
zed-0.007
gh-0.007
ainer-0.006
 aids-0.006
ze-0.006
Token out
Token ID503
Feature activation-1.67e-01

Pos logit contributions
zed+0.003
gh+0.003
aster+0.003
 aids+0.003
ainer+0.003
Neg logit contributions
 Bub-0.001
 mar-0.001
 Nib-0.001
 buried-0.001
 Ell-0.001
Token the
Token ID262
Feature activation-2.43e-01

Pos logit contributions
zed+0.007
gh+0.007
 aids+0.006
ainer+0.006
ze+0.006
Neg logit contributions
 Bub-0.003
 mar-0.003
 Nib-0.003
 wra-0.003
endant-0.003
Token window
Token ID4324
Feature activation-6.54e-01

Pos logit contributions
zed+0.010
gh+0.009
ainer+0.009
 aids+0.009
ze+0.009
Neg logit contributions
 mar-0.005
 wra-0.005
 Bub-0.005
 Nib-0.004
endant-0.004
Token,
Token ID11
Feature activation-3.69e-01

Pos logit contributions
zed+0.028
gh+0.027
 aids+0.027
ainer+0.026
aster+0.026
Neg logit contributions
 Bub-0.010
 mar-0.009
 Nib-0.009
 wra-0.009
endant-0.008
Token watching
Token ID4964
Feature activation-9.97e-02

Pos logit contributions
zed+0.014
gh+0.014
ainer+0.013
 aids+0.013
aster+0.013
Neg logit contributions
 Bub-0.007
 mar-0.007
 Nib-0.007
 wra-0.007
endant-0.006
Token as
Token ID355
Feature activation-1.86e-01

Pos logit contributions
zed+0.004
gh+0.004
ainer+0.004
aster+0.003
 aids+0.003
Neg logit contributions
 Bub-0.002
 Nib-0.002
 mar-0.002
 Ell-0.002
 Tim-0.002
Token the
Token ID262
Feature activation-2.11e-01

Pos logit contributions
zed+0.005
aster+0.005
gh+0.005
 aids+0.005
ze+0.005
Neg logit contributions
 Bub-0.005
endant-0.005
 mar-0.005
 Nib-0.005
unn-0.005
Token world
Token ID995
Feature activation+1.21e-01

Pos logit contributions
zed+0.008
gh+0.008
ainer+0.008
 aids+0.007
ze+0.007
Neg logit contributions
 mar-0.004
 wra-0.004
 Bub-0.004
 Ell-0.004
 je-0.004
Token passed
Token ID3804
Feature activation+1.81e+00

Pos logit contributions
 Bub+0.002
 Nib+0.002
 mar+0.002
 wra+0.002
endant+0.002
Neg logit contributions
zed-0.005
gh-0.005
 aids-0.004
ze-0.004
ainer-0.004
Token.
Token ID13
Feature activation+1.01e-01

Pos logit contributions
zed+0.020
gh+0.019
ainer+0.019
 aids+0.019
aster+0.019
Neg logit contributions
 Bub-0.008
 Nib-0.008
 mar-0.008
 wra-0.007
 Ell-0.006
Token Sam
Token ID3409
Feature activation-2.07e-01

Pos logit contributions
 Bub+0.002
 mar+0.002
 Nib+0.002
 wra+0.002
endant+0.002
Neg logit contributions
zed-0.004
gh-0.004
 aids-0.003
ainer-0.003
ze-0.003
Token was
Token ID373
Feature activation-5.14e-01

Pos logit contributions
zed+0.008
gh+0.008
 aids+0.008
ze+0.007
ainer+0.007
Neg logit contributions
 Bub-0.004
 Nib-0.004
 mar-0.004
endant-0.004
 wra-0.004
Token happy
Token ID3772
Feature activation+5.16e-02

Pos logit contributions
zed+0.022
gh+0.021
ainer+0.021
 paragraph+0.021
aster+0.020
Neg logit contributions
 called-0.008
 mar-0.008
 Bub-0.008
 Nib-0.008
 buried-0.008
Token and
Token ID290
Feature activation-2.95e-01

Pos logit contributions
 Bub+0.001
 mar+0.001
 Nib+0.001
 wra+0.001
endant+0.001
Neg logit contributions
zed-0.002
gh-0.002
 aids-0.002
ainer-0.002
aster-0.002
Token nodded
Token ID14464
Feature activation-2.38e-01

Pos logit contributions
zed+0.012
gh+0.012
 aids+0.012
ainer+0.012
aster+0.011
Neg logit contributions
 Bub-0.005
 mar-0.005
 Nib-0.005
 wra-0.004
endant-0.004
Token his
Token ID465
Feature activation+7.75e-01

Pos logit contributions
zed+0.012
aster+0.011
 aids+0.011
ainer+0.011
gh+0.011
Neg logit contributions
 Bub-0.002
 said-0.002
 could-0.002
 Nib-0.002
 Tim-0.002
Token head
Token ID1182
Feature activation+2.88e-01

Pos logit contributions
 mar+0.021
 Bub+0.021
 wra+0.020
 Nib+0.020
endant+0.020
Neg logit contributions
zed-0.024
aster-0.023
gh-0.022
 aids-0.022
ainer-0.021
Token.
Token ID13
Feature activation-3.65e-01

Pos logit contributions
 Bub+0.005
 mar+0.005
 Nib+0.005
 wra+0.004
endant+0.004
Neg logit contributions
zed-0.012
gh-0.011
 aids-0.011
ainer-0.011
aster-0.011
Token They
Token ID1119
Feature activation+6.52e-02

Pos logit contributions
zed+0.012
ainer+0.011
gh+0.011
 aids+0.011
aster+0.011
Neg logit contributions
 mar-0.009
 wra-0.009
 Nib-0.009
endant-0.009
 Bub-0.009
Token put
Token ID1234
Feature activation-6.65e-01

Pos logit contributions
 Nib+0.002
endant+0.002
 Bub+0.001
 wra+0.001
unn+0.001
Neg logit contributions
zed-0.002
aster-0.002
ze-0.002
gh-0.002
rac-0.002
Token would
Token ID561
Feature activation+4.75e-02

Pos logit contributions
zed+0.040
gh+0.038
ze+0.037
 aids+0.037
aster+0.036
Neg logit contributions
 Bub-0.013
 mar-0.012
 Nib-0.011
endant-0.011
 wra-0.011
Token be
Token ID307
Feature activation-1.02e+00

Pos logit contributions
 Bub+0.001
 mar+0.001
 Nib+0.001
 wra+0.001
endant+0.001
Neg logit contributions
zed-0.002
gh-0.001
 aids-0.001
ainer-0.001
aster-0.001
Token fine
Token ID3734
Feature activation-2.91e-01

Pos logit contributions
zed+0.041
gh+0.039
ainer+0.038
 aids+0.038
 paragraph+0.038
Neg logit contributions
 Bub-0.019
 mar-0.019
 buried-0.019
 Nib-0.019
 called-0.018
Token soon
Token ID2582
Feature activation-4.97e-01

Pos logit contributions
zed+0.010
gh+0.010
ze+0.010
ainer+0.009
 aids+0.009
Neg logit contributions
 Bub-0.006
endant-0.006
 mar-0.006
 Nib-0.006
 wra-0.006
Token.
Token ID13
Feature activation-7.78e-01

Pos logit contributions
zed+0.019
gh+0.019
 aids+0.018
ainer+0.018
ze+0.018
Neg logit contributions
 Bub-0.010
 mar-0.009
 Nib-0.009
 wra-0.009
endant-0.009
Token
Token ID198
Feature activation-5.87e-01

Pos logit contributions
zed+0.025
 aids+0.024
gh+0.024
ainer+0.024
aster+0.024
Neg logit contributions
 mar-0.019
 Bub-0.019
 wra-0.018
endant-0.018
 Nib-0.018
Token
Token ID198
Feature activation-1.46e-01

Pos logit contributions
zed+0.020
gh+0.019
aster+0.019
 aids+0.018
ainer+0.018
Neg logit contributions
 mar-0.014
 Bub-0.014
 wra-0.014
endant-0.013
 Nib-0.013
TokenTom
Token ID13787
Feature activation-6.00e-01

Pos logit contributions
zed+0.005
gh+0.005
 aids+0.004
ainer+0.004
ze+0.004
Neg logit contributions
 Bub-0.004
 mar-0.004
 Nib-0.004
 wra-0.003
endant-0.003
Token and
Token ID290
Feature activation-6.99e-01

Pos logit contributions
zed+0.023
ze+0.022
gh+0.021
 aids+0.021
aster+0.021
Neg logit contributions
 Bub-0.012
 Nib-0.012
endant-0.011
 mar-0.011
 haven-0.011
Token Lily
Token ID20037
Feature activation-3.60e-01

Pos logit contributions
zed+0.026
aster+0.026
 aids+0.025
gh+0.024
 sentence+0.024
Neg logit contributions
endant-0.015
 Bub-0.014
 mar-0.013
 Nib-0.013
 wra-0.013
Token learned
Token ID4499
Feature activation+7.51e-01

Pos logit contributions
zed+0.012
ze+0.012
aster+0.011
gh+0.011
 aids+0.011
Neg logit contributions
 Bub-0.009
 Nib-0.008
endant-0.008
unn-0.008
 junk-0.008
Token as
Token ID355
Feature activation-3.84e-01

Pos logit contributions
zed+0.023
ainer+0.022
gh+0.021
 aids+0.021
aster+0.020
Neg logit contributions
 mar-0.012
 Bub-0.012
 Nib-0.011
 H-0.011
 buried-0.011
Token smart
Token ID4451
Feature activation-1.18e+00

Pos logit contributions
zed+0.011
gh+0.011
 aids+0.011
aster+0.010
ainer+0.010
Neg logit contributions
 Bub-0.011
 Nib-0.010
endant-0.010
 mar-0.010
 wra-0.010
Token as
Token ID355
Feature activation-5.36e-01

Pos logit contributions
zed+0.047
gh+0.046
 aids+0.045
ainer+0.044
aster+0.044
Neg logit contributions
 Bub-0.022
 mar-0.021
 Nib-0.020
 wra-0.020
 ot-0.018
Token Jack
Token ID3619
Feature activation-8.91e-01

Pos logit contributions
zed+0.018
gh+0.018
 aids+0.017
aster+0.017
ainer+0.017
Neg logit contributions
 Bub-0.013
 Nib-0.012
 mar-0.012
 wra-0.012
endant-0.012
Token".
Token ID1911
Feature activation-8.47e-01

Pos logit contributions
zed+0.034
gh+0.032
ainer+0.032
 aids+0.032
aster+0.031
Neg logit contributions
 Bub-0.018
 mar-0.017
 Nib-0.017
 wra-0.016
endant-0.016
Token
Token ID198
Feature activation-9.38e-01

Pos logit contributions
zed+0.028
gh+0.026
 aids+0.026
ainer+0.026
aster+0.026
Neg logit contributions
 Bub-0.021
 Nib-0.021
 mar-0.020
 wra-0.020
endant-0.020
Token
Token ID198
Feature activation-1.24e+00

Pos logit contributions
zed+0.034
gh+0.033
 aids+0.032
ainer+0.032
aster+0.031
Neg logit contributions
 Bub-0.020
 mar-0.020
 Nib-0.020
 wra-0.020
endant-0.019
TokenJ
Token ID41
Feature activation-1.61e+00

Pos logit contributions
zed+0.040
gh+0.038
 aids+0.037
ze+0.037
ainer+0.036
Neg logit contributions
 Bub-0.031
 mar-0.031
 wra-0.029
 Nib-0.028
 could-0.027
Tokenill
Token ID359
Feature activation-6.27e-01

Pos logit contributions
zed+0.055
gh+0.053
 aids+0.052
ainer+0.051
aster+0.051
Neg logit contributions
 Bub-0.038
 mar-0.037
 Nib-0.037
 wra-0.035
endant-0.034
Token was
Token ID373
Feature activation+1.91e-01

Pos logit contributions
zed+0.028
gh+0.027
 aids+0.026
ainer+0.026
ze+0.026
Neg logit contributions
 Bub-0.008
 mar-0.008
 Nib-0.008
 wra-0.007
endant-0.007
Token so
Token ID523
Feature activation+8.90e-02

Pos logit contributions
 Bub+0.003
 Nib+0.003
 mar+0.003
 buried+0.003
 lost+0.003
Neg logit contributions
zed-0.008
gh-0.007
ainer-0.007
aster-0.007
 paragraph-0.007
Token bang
Token ID20188
Feature activation-1.25e+00

Pos logit contributions
zed+0.059
ainer+0.058
gh+0.057
 paragraph+0.052
rac+0.052
Neg logit contributions
 Bub-0.044
 c-0.042
 Nib-0.041
 Tim-0.040
 ot-0.039
Token.
Token ID13
Feature activation-5.54e-01

Pos logit contributions
zed+0.064
gh+0.060
ainer+0.058
aster+0.058
 aids+0.058
Neg logit contributions
 called-0.010
 wra-0.010
 said-0.009
 Bub-0.009
 mar-0.009
Token They
Token ID1119
Feature activation-9.70e-01

Pos logit contributions
zed+0.016
gh+0.016
 aids+0.015
ainer+0.015
aster+0.015
Neg logit contributions
 Bub-0.016
 mar-0.015
 Nib-0.015
 wra-0.015
endant-0.015
Token see
Token ID766
Feature activation+1.61e-01

Pos logit contributions
zed+0.026
gh+0.025
 aids+0.024
ainer+0.024
aster+0.024
Neg logit contributions
 Bub-0.030
 mar-0.029
 Nib-0.029
 wra-0.028
endant-0.028
Token smoke
Token ID7523
Feature activation-1.14e+00

Pos logit contributions
 Bub+0.005
 haven+0.004
 mar+0.004
endant+0.004
ll+0.004
Neg logit contributions
 aids-0.004
zed-0.004
aster-0.004
ze-0.004
gh-0.004
Token and
Token ID290
Feature activation-2.37e+00

Pos logit contributions
zed+0.047
gh+0.045
ainer+0.043
 aids+0.043
aster+0.043
Neg logit contributions
 wra-0.019
 mar-0.018
 Bub-0.018
 Nib-0.018
 called-0.017
Token sparks
Token ID38306
Feature activation-1.42e+00

Pos logit contributions
zed+0.072
gh+0.069
ainer+0.066
 aids+0.066
ze+0.064
Neg logit contributions
 Bub-0.066
 mar-0.064
 Nib-0.064
 wra-0.063
endant-0.060
Token.
Token ID13
Feature activation-1.31e+00

Pos logit contributions
zed+0.057
gh+0.056
ainer+0.054
 aids+0.054
aster+0.053
Neg logit contributions
 mar-0.024
 Bub-0.024
 wra-0.023
 Nib-0.023
 buried-0.022
Token They
Token ID1119
Feature activation-1.17e+00

Pos logit contributions
zed+0.046
gh+0.044
ainer+0.042
 aids+0.042
ze+0.042
Neg logit contributions
 Bub-0.031
 Nib-0.030
 mar-0.030
 wra-0.028
endant-0.028
Token feel
Token ID1254
Feature activation+1.47e-01

Pos logit contributions
zed+0.033
gh+0.031
 aids+0.031
aster+0.030
ze+0.030
Neg logit contributions
 Bub-0.035
 mar-0.034
 Nib-0.033
 wra-0.032
endant-0.032
Token a
Token ID257
Feature activation-1.37e+00

Pos logit contributions
 Bub+0.004
 mar+0.004
 Nib+0.004
 wra+0.003
endant+0.003
Neg logit contributions
zed-0.005
gh-0.005
 aids-0.005
ainer-0.005
aster-0.005
Token and
Token ID290
Feature activation-1.37e+00

Pos logit contributions
zed+0.021
gh+0.021
ainer+0.021
ze+0.020
 aids+0.020
Neg logit contributions
 Bub-0.011
 Nib-0.011
 mar-0.010
endant-0.010
 haven-0.009
Token go
Token ID467
Feature activation-4.25e-01

Pos logit contributions
zed+0.048
gh+0.046
 aids+0.045
ainer+0.044
aster+0.044
Neg logit contributions
 Bub-0.032
 mar-0.030
 Nib-0.030
 wra-0.029
endant-0.028
Tokenoe
Token ID2577
Feature activation-1.57e+00

Pos logit contributions
zed+0.014
gh+0.013
ainer+0.013
 aids+0.013
aster+0.013
Neg logit contributions
 Bub-0.011
 mar-0.010
 Nib-0.010
 wra-0.010
endant-0.010
Tokeny
Token ID88
Feature activation-5.24e-01

Pos logit contributions
zed+0.053
gh+0.051
ainer+0.050
 aids+0.050
aster+0.048
Neg logit contributions
 mar-0.037
 wra-0.036
 Bub-0.036
 Nib-0.035
endant-0.034
Token when
Token ID618
Feature activation-1.31e+00

Pos logit contributions
zed+0.021
gh+0.021
ainer+0.020
 aids+0.020
aster+0.020
Neg logit contributions
 Bub-0.009
 mar-0.009
 Nib-0.008
 wra-0.008
endant-0.008
Token the
Token ID262
Feature activation-2.02e+00

Pos logit contributions
zed+0.046
gh+0.044
ainer+0.043
 aids+0.043
ze+0.043
Neg logit contributions
 Bub-0.030
 mar-0.029
 Nib-0.029
 wra-0.028
endant-0.026
Token timer
Token ID19781
Feature activation-1.53e+00

Pos logit contributions
zed+0.073
gh+0.070
ainer+0.067
 aids+0.065
ze+0.064
Neg logit contributions
 wra-0.046
 mar-0.044
 Nib-0.041
 buried-0.040
 Bub-0.039
Token be
Token ID307
Feature activation-8.16e-01

Pos logit contributions
zed+0.057
gh+0.055
 aids+0.053
ainer+0.053
aster+0.052
Neg logit contributions
 mar-0.030
 Bub-0.030
 wra-0.030
 Nib-0.029
endant-0.028
Tokenep
Token ID538
Feature activation-1.80e+00

Pos logit contributions
zed+0.021
ainer+0.020
 aids+0.020
 sentence+0.019
gh+0.019
Neg logit contributions
 wra-0.026
 called-0.026
 mar-0.026
 buried-0.025
 said-0.025
Tokened
Token ID276
Feature activation-2.51e-01

Pos logit contributions
zed+0.064
gh+0.061
 aids+0.060
ainer+0.060
aster+0.059
Neg logit contributions
 Bub-0.040
 mar-0.038
 Nib-0.038
 wra-0.036
endant-0.035
Token.
Token ID13
Feature activation-6.54e-01

Pos logit contributions
zed+0.009
gh+0.009
 aids+0.009
ainer+0.009
aster+0.009
Neg logit contributions
 Bub-0.005
 mar-0.005
 Nib-0.005
 wra-0.005
endant-0.005
Token was
Token ID373
Feature activation+2.02e-01

Pos logit contributions
 Bub+0.007
 Nib+0.007
 wra+0.007
endant+0.007
 mar+0.007
Neg logit contributions
zed-0.016
ze-0.016
 aids-0.015
gh-0.015
 sentence-0.015
Token amazed
Token ID24562
Feature activation+4.78e-01

Pos logit contributions
 Bub+0.004
 Nib+0.004
 mar+0.004
 wra+0.003
endant+0.003
Neg logit contributions
zed-0.008
gh-0.008
ainer-0.008
aster-0.008
 aids-0.008
Token!
Token ID0
Feature activation+1.56e-01

Pos logit contributions
 Bub+0.011
 Nib+0.010
 mar+0.010
 wra+0.010
endant+0.009
Neg logit contributions
zed-0.016
gh-0.016
 aids-0.016
ainer-0.016
ze-0.016
Token She
Token ID1375
Feature activation+9.21e-01

Pos logit contributions
 mar+0.004
 Bub+0.004
 wra+0.004
 Nib+0.004
endant+0.004
Neg logit contributions
zed-0.005
 aids-0.005
ainer-0.004
gh-0.004
aster-0.004
Token had
Token ID550
Feature activation-1.00e+00

Pos logit contributions
 Nib+0.019
 wra+0.019
unn+0.017
 mar+0.017
endant+0.017
Neg logit contributions
zed-0.034
ze-0.033
 aids-0.031
gh-0.031
rac-0.030
Token never
Token ID1239
Feature activation-2.34e+00

Pos logit contributions
zed+0.035
ainer+0.035
gh+0.034
aster+0.033
 aids+0.033
Neg logit contributions
 Bub-0.023
 mar-0.022
 Nib-0.021
 wra-0.020
 buried-0.020
Token seen
Token ID1775
Feature activation-2.56e-01

Pos logit contributions
zed+0.078
gh+0.075
 aids+0.073
aster+0.072
ze+0.072
Neg logit contributions
 Bub-0.058
 Nib-0.056
 mar-0.055
 wra-0.054
endant-0.052
Token an
Token ID281
Feature activation-2.58e-01

Pos logit contributions
zed+0.006
 aids+0.006
gh+0.006
aster+0.006
ze+0.006
Neg logit contributions
 Bub-0.008
 wra-0.008
 mar-0.008
 Nib-0.008
endant-0.008
Token avocado
Token ID40377
Feature activation+4.43e-01

Pos logit contributions
zed+0.007
aster+0.007
gh+0.007
ainer+0.007
 aids+0.007
Neg logit contributions
 Bub-0.008
 Nib-0.008
 mar-0.007
 wra-0.007
endant-0.007
Token man
Token ID582
Feature activation+5.59e-02

Pos logit contributions
 Bub+0.011
 Nib+0.010
endant+0.010
 mar+0.010
unn+0.009
Neg logit contributions
zed-0.015
gh-0.015
aster-0.015
ze-0.015
ainer-0.014
Token before
Token ID878
Feature activation+8.40e-02

Pos logit contributions
 Bub+0.001
 mar+0.001
endant+0.001
 wra+0.001
 Nib+0.001
Neg logit contributions
zed-0.002
gh-0.002
ze-0.002
aster-0.002
 aids-0.002
Token time
Token ID640
Feature activation-1.26e+00

Pos logit contributions
zed+0.032
ainer+0.031
 aids+0.031
ze+0.030
rac+0.029
Neg logit contributions
 mar-0.039
 Bub-0.039
 wra-0.036
 le-0.036
 Nib-0.036
Token,
Token ID11
Feature activation+4.26e-01

Pos logit contributions
zed+0.045
gh+0.044
 aids+0.042
aster+0.042
ze+0.042
Neg logit contributions
 Bub-0.028
 Nib-0.027
 mar-0.026
 wra-0.025
endant-0.025
Token there
Token ID612
Feature activation-7.15e-01

Pos logit contributions
endant+0.012
 wra+0.012
unn+0.012
 Nib+0.011
 mar+0.011
Neg logit contributions
zed-0.012
aster-0.012
gh-0.011
ainer-0.011
 aids-0.011
Token was
Token ID373
Feature activation+5.03e-01

Pos logit contributions
zed+0.025
gh+0.024
 aids+0.024
ze+0.023
aster+0.023
Neg logit contributions
 Bub-0.016
 Nib-0.015
 mar-0.015
endant-0.015
 wra-0.015
Token a
Token ID257
Feature activation-1.05e+00

Pos logit contributions
 Bub+0.013
 mar+0.012
 Nib+0.012
endant+0.012
 wra+0.012
Neg logit contributions
zed-0.016
gh-0.016
 aids-0.015
aster-0.015
ainer-0.015
Token long
Token ID890
Feature activation-2.84e+00

Pos logit contributions
gh+0.031
zed+0.030
aster+0.030
 aids+0.030
ze+0.030
Neg logit contributions
 Bub-0.028
 Nib-0.027
endant-0.027
unn-0.026
 mar-0.026
Token snake
Token ID17522
Feature activation-3.63e-01

Pos logit contributions
zed+0.096
gh+0.089
ainer+0.087
 aids+0.086
ze+0.084
Neg logit contributions
 mar-0.069
 Bub-0.068
 wra-0.066
 Nib-0.066
 buried-0.063
Token,
Token ID11
Feature activation+3.06e-01

Pos logit contributions
zed+0.012
gh+0.012
ze+0.012
 aids+0.012
aster+0.012
Neg logit contributions
 Bub-0.009
 mar-0.008
 Nib-0.008
 wra-0.008
endant-0.008
Token who
Token ID508
Feature activation-3.50e-01

Pos logit contributions
unn+0.005
endant+0.005
 wra+0.004
 mar+0.004
 Nib+0.004
Neg logit contributions
aster-0.012
 aids-0.012
zed-0.011
gh-0.011
ze-0.011
Token lived
Token ID5615
Feature activation+5.41e-01

Pos logit contributions
zed+0.014
 aids+0.013
gh+0.013
ze+0.013
aster+0.013
Neg logit contributions
 Bub-0.006
 Nib-0.006
 mar-0.006
 wra-0.005
endant-0.005
Token in
Token ID287
Feature activation-6.28e-01

Pos logit contributions
 Bub+0.012
 mar+0.012
 Nib+0.012
 wra+0.011
endant+0.011
Neg logit contributions
zed-0.019
gh-0.019
 aids-0.018
ainer-0.018
ze-0.018
Token.
Token ID13
Feature activation-7.33e-01

Pos logit contributions
zed+0.019
gh+0.019
 aids+0.018
ainer+0.018
aster+0.018
Neg logit contributions
 Bub-0.010
 mar-0.010
 Nib-0.010
 wra-0.009
endant-0.009
Token It
Token ID632
Feature activation-3.92e-01

Pos logit contributions
zed+0.023
gh+0.023
 aids+0.022
ainer+0.021
ze+0.021
Neg logit contributions
 Bub-0.019
 Nib-0.019
 mar-0.019
 wra-0.018
endant-0.017
Token is
Token ID318
Feature activation-1.35e+00

Pos logit contributions
zed+0.015
gh+0.015
ainer+0.015
 aids+0.015
aster+0.015
Neg logit contributions
 Bub-0.007
 Nib-0.007
 mar-0.007
 wra-0.007
endant-0.006
Token not
Token ID407
Feature activation-2.21e+00

Pos logit contributions
zed+0.049
gh+0.047
 aids+0.046
ainer+0.046
ze+0.045
Neg logit contributions
 Bub-0.030
 Nib-0.030
 mar-0.029
 wra-0.028
 buried-0.027
Token spicy
Token ID26880
Feature activation-1.67e+00

Pos logit contributions
zed+0.083
gh+0.079
 aids+0.077
ainer+0.077
ze+0.075
Neg logit contributions
 Bub-0.048
 mar-0.047
 Nib-0.046
 wra-0.044
 buried-0.043
Token,
Token ID11
Feature activation-2.03e+00

Pos logit contributions
zed+0.069
gh+0.065
 aids+0.065
aster+0.063
ainer+0.063
Neg logit contributions
 Bub-0.028
 mar-0.028
 wra-0.027
 Nib-0.027
 ot-0.024
Token it
Token ID340
Feature activation-7.92e-02

Pos logit contributions
zed+0.073
gh+0.070
 aids+0.068
ainer+0.068
aster+0.067
Neg logit contributions
 Bub-0.046
 mar-0.044
 Nib-0.044
 wra-0.042
endant-0.041
Token is
Token ID318
Feature activation-1.34e+00

Pos logit contributions
zed+0.003
gh+0.003
ainer+0.003
 aids+0.003
aster+0.003
Neg logit contributions
 Bub-0.001
 mar-0.001
 Nib-0.001
 wra-0.001
endant-0.001
Token sweet
Token ID6029
Feature activation-1.88e+00

Pos logit contributions
zed+0.047
gh+0.045
ainer+0.044
 aids+0.044
aster+0.043
Neg logit contributions
 Bub-0.031
 Nib-0.030
 mar-0.030
 wra-0.029
 buried-0.028
Token.
Token ID13
Feature activation-1.17e+00

Pos logit contributions
zed+0.086
gh+0.082
 aids+0.080
aster+0.079
ainer+0.079
Neg logit contributions
 Bub-0.024
 mar-0.024
 wra-0.023
 Nib-0.022
 ot-0.020
Token It
Token ID632
Feature activation-4.71e-01

Pos logit contributions
zed+0.035
gh+0.034
 aids+0.033
ainer+0.032
ze+0.032
Neg logit contributions
 Bub-0.033
 Nib-0.032
 mar-0.032
 wra-0.030
endant-0.030