TOP ACTIVATIONS
MAX = 0.045

. That is not how friends act ." Tom
" What are pictures ?" Lily asked . She
not how you treat your friends ." their parents say ,
" What are peanuts ?" Lily asked .
, " What does ' ups et ' mean ?". His
She said , " No cars ! Only marriage !" Tom
One of her favourite things to do was to
fair . You have four cars and I have three .
you want and not what others want or need . Being
air with a loud " who osh " sound . Everyone
didn 't Sarah answer my questions ?" she asked .
ila . Do you like trucks ?" L ila
the park playing with her favourite t eddy . Suddenly ,
Why did you race your cars on the stairs ? That
. " Do you like trucks ?" " Yes
. Max had a w agging tail and a shiny collar
always acted silly and mis chie vous . One
patient and not cry when things don 't go as planned
Jack was playing with his favourite car in his bedroom .
sorry . And no more cars for today ."

MIN ACTIVATIONS
MIN = -0.046

ask his neighbour , Mrs Clar ise , for help .
ready to play and asked Fl oppy to join . Fl
to her best friend , Deb by . When
His peace . They asked Him to calm the storm and
was a little girl called Lex i . She lived in
. She hoped they would grow into pretty flowers .
cute pink rabbit called Mrs Fl uffy . She was small
and said , " S oph ie , no matter how
and Lily calls her rabbit Fl uffy . Mom
my said , " S oph ie , let 's go
" When are we going on holiday again ?"
. " Mrs Clar ise , please help me
to invite her friend R uf us to play . She
Later , Tim saw Mr . Bear on the swing
After a while , they reached a big , dark cave
button . They hear a wh ir ring sound and see
grandma . Grand ma said Fl uffy was not used to
to the kitchen and look for food . They see a
" Hello , Fl oppy !" Tweet ie ch
way . Finally , they reached the end of their journey

INTERVAL 0.022 - 0.045
CONTAINS 0.422%

there was a little boy , who was very brave and
. She never knew what was going to happen each time
was very frustrated at not being able to reach it .
, " Well done Brad ! Now you know what was
the tree . " What 's wrong , Lily ?" he

INTERVAL -0.001 - 0.022
CONTAINS 54.189%

my way because I am very competitive and I want to
carefully . However , suddenly Winn ie forgot that
on the street stop and listen . When the
together and watch their favourite shows . They had so much
to the table . She sees the car and the toys

INTERVAL -0.023 - -0.001
CONTAINS 45.119%

She didn 't know what to do , so she asked
far away land , the little girl saw beautiful castles and
to go home . You will not be welcome here anymore
The dragon saw a toy boat and he wanted
time , there was a little girl named Lily . She

INTERVAL -0.046 - -0.023
CONTAINS 0.269%

was so proud of his caul iflower plants and kept them
His dad said , " Oh no , we have to
were busy and took the oy ster to a hidden spot
. She says , " Thank you , but no thank
noise . It was a dog barking . The dog saw
Token.
Token ID13
Feature activation-1.26e-03

Pos logit contributions
 Math+0.020
 sidewalk+0.018
 piano+0.017
 dictionary+0.017
 automobiles+0.017
Neg logit contributions
 nearer-0.019
 closer-0.019
pt-0.017
 afloat-0.017
 hes-0.017
Token That
Token ID1320
Feature activation+2.20e-03

Pos logit contributions
 nearer+0.002
apa+0.002
ung+0.002
 closer+0.002
 stro+0.002
Neg logit contributions
 sidewalk-0.001
 games-0.001
 books-0.001
 piano-0.001
 game-0.001
Token is
Token ID318
Feature activation+8.47e-03

Pos logit contributions
 Math+0.003
 sidewalk+0.003
 automobiles+0.003
 dictionary+0.003
issors+0.003
Neg logit contributions
 nearer-0.003
 closer-0.003
 afloat-0.003
 hes-0.003
eled-0.003
Token not
Token ID407
Feature activation+1.19e-02

Pos logit contributions
 Math+0.015
 sidewalk+0.015
board+0.015
 pavement+0.014
 automobiles+0.014
Neg logit contributions
 closer-0.011
 nearer-0.011
 deeper-0.009
 hes-0.009
 afloat-0.009
Token how
Token ID703
Feature activation+1.44e-02

Pos logit contributions
 Math+0.021
board+0.020
 sidewalk+0.019
 pavement+0.018
 exam+0.018
Neg logit contributions
 closer-0.016
 nearer-0.016
 afloat-0.014
 deeper-0.014
 hes-0.014
Token friends
Token ID2460
Feature activation+4.45e-02

Pos logit contributions
 Math+0.019
 sidewalk+0.017
 dictionary+0.016
 piano+0.016
 automobiles+0.016
Neg logit contributions
 nearer-0.026
 closer-0.025
apa-0.023
 afloat-0.023
 submerged-0.023
Token act
Token ID719
Feature activation+1.02e-02

Pos logit contributions
 Math+0.067
 automobiles+0.058
 sidewalk+0.058
 dictionary+0.057
 computer+0.056
Neg logit contributions
 closer-0.071
 nearer-0.070
 afloat-0.064
pt-0.064
 hes-0.061
Token."
Token ID526
Feature activation-4.90e-03

Pos logit contributions
 Math+0.014
 dictionary+0.012
 sidewalk+0.012
 automobiles+0.011
 books+0.011
Neg logit contributions
 nearer-0.019
 afloat-0.018
 closer-0.018
 submerged-0.017
eled-0.017
Token
Token ID198
Feature activation-1.87e-03

Pos logit contributions
 nearer+0.006
 afloat+0.006
 closer+0.006
eled+0.006
 submerged+0.006
Neg logit contributions
 Math-0.009
 dictionary-0.008
 automobiles-0.008
 books-0.008
 sidewalk-0.008
Token
Token ID198
Feature activation+2.24e-03

Pos logit contributions
 nearer+0.003
 afloat+0.003
eled+0.003
 closer+0.003
 submerged+0.003
Neg logit contributions
 Math-0.003
 sidewalk-0.003
 books-0.003
 automobiles-0.003
 dictionary-0.003
TokenTom
Token ID13787
Feature activation+1.02e-02

Pos logit contributions
 Math+0.002
 guys+0.002
 dictionary+0.002
 automobiles+0.002
 exam+0.002
Neg logit contributions
 nearer-0.005
 closer-0.004
eled-0.004
 afloat-0.004
 submerged-0.004
Token
Token ID198
Feature activation-1.12e-02

Pos logit contributions
eled+0.028
 nearer+0.027
 afloat+0.027
 submerged+0.026
igi+0.025
Neg logit contributions
 Math-0.024
 kids-0.020
 books-0.020
 sidewalk-0.019
 dictionary-0.019
Token
Token ID198
Feature activation-1.01e-02

Pos logit contributions
eled+0.021
 nearer+0.020
 afloat+0.020
 submerged+0.020
 fascination+0.019
Neg logit contributions
 Math-0.015
 kids-0.013
 books-0.012
 sidewalk-0.012
 automobiles-0.012
Token"
Token ID1
Feature activation+1.97e-03

Pos logit contributions
 nearer+0.018
 afloat+0.017
eled+0.017
 closer+0.017
 submerged+0.016
Neg logit contributions
 Math-0.014
 dictionary-0.012
 books-0.012
 automobiles-0.012
 sidewalk-0.012
TokenWhat
Token ID2061
Feature activation+6.41e-03

Pos logit contributions
 Math+0.003
 books+0.002
 automobiles+0.002
 dictionary+0.002
 earthquake+0.002
Neg logit contributions
 nearer-0.003
 closer-0.003
eled-0.003
 submerged-0.003
 afloat-0.003
Token are
Token ID389
Feature activation+2.55e-02

Pos logit contributions
 pavement+0.010
 sidewalk+0.010
 automobiles+0.009
arrass+0.009
 Math+0.009
Neg logit contributions
 closer-0.010
 nearer-0.010
 hes-0.009
ull-0.008
apa-0.008
Token pictures
Token ID5986
Feature activation+4.37e-02

Pos logit contributions
 pavement+0.029
 Math+0.029
 sidewalk+0.028
board+0.028
 earthquake+0.027
Neg logit contributions
 nearer-0.044
 closer-0.044
 deeper-0.040
apa-0.039
 hes-0.039
Token?"
Token ID1701
Feature activation-6.38e-03

Pos logit contributions
 Math+0.077
arrass+0.074
 automobiles+0.071
 sidewalk+0.071
 pavement+0.068
Neg logit contributions
 nearer-0.057
 closer-0.056
 afloat-0.046
inkle-0.045
 submerged-0.044
Token Lily
Token ID20037
Feature activation+2.48e-02

Pos logit contributions
 nearer+0.013
 afloat+0.013
eled+0.013
 closer+0.013
 submerged+0.012
Neg logit contributions
 Math-0.007
 dictionary-0.006
 automobiles-0.006
 books-0.006
 sidewalk-0.006
Token asked
Token ID1965
Feature activation+1.49e-02

Pos logit contributions
 Math+0.046
 automobiles+0.041
 books+0.041
 dictionary+0.041
 sidewalk+0.041
Neg logit contributions
 nearer-0.032
eled-0.030
 closer-0.030
 hes-0.029
 afloat-0.029
Token.
Token ID13
Feature activation-3.13e-03

Pos logit contributions
 Math+0.021
 dictionary+0.018
 sidewalk+0.018
 automobiles+0.018
 piano+0.017
Neg logit contributions
 nearer-0.027
 closer-0.026
 afloat-0.025
 hes-0.025
eled-0.024
Token She
Token ID1375
Feature activation+7.28e-03

Pos logit contributions
 nearer+0.005
 afloat+0.005
eled+0.005
 submerged+0.005
 closer+0.005
Neg logit contributions
 Math-0.005
 dictionary-0.004
 automobiles-0.004
 sidewalk-0.004
 books-0.004
Token not
Token ID407
Feature activation+1.34e-02

Pos logit contributions
board+0.025
 Math+0.024
 sidewalk+0.024
aces+0.024
 pavement+0.023
Neg logit contributions
 closer-0.012
 nearer-0.012
 afloat-0.011
 soon-0.011
 deeper-0.010
Token how
Token ID703
Feature activation+1.18e-02

Pos logit contributions
 Math+0.021
 sidewalk+0.019
board+0.019
 dictionary+0.019
 pavement+0.018
Neg logit contributions
 closer-0.020
 nearer-0.020
 deeper-0.019
 submerged-0.019
 afloat-0.019
Token you
Token ID345
Feature activation+2.26e-02

Pos logit contributions
 Math+0.015
 dictionary+0.014
 sidewalk+0.014
 automobiles+0.014
 piano+0.013
Neg logit contributions
 nearer-0.020
 closer-0.020
apa-0.020
 submerged-0.019
 afloat-0.019
Token treat
Token ID2190
Feature activation+2.13e-02

Pos logit contributions
 Math+0.033
 automobiles+0.028
 dictionary+0.028
 sidewalk+0.027
 books+0.026
Neg logit contributions
 nearer-0.040
 afloat-0.039
 closer-0.038
 submerged-0.038
eled-0.036
Token your
Token ID534
Feature activation-7.88e-04

Pos logit contributions
 Math+0.029
 dictionary+0.025
 sidewalk+0.025
 pavement+0.024
 automobiles+0.024
Neg logit contributions
 closer-0.035
 nearer-0.034
 afloat-0.033
apa-0.032
 deeper-0.031
Token friends
Token ID2460
Feature activation+4.27e-02

Pos logit contributions
 nearer+0.002
 afloat+0.001
 closer+0.001
eled+0.001
 submerged+0.001
Neg logit contributions
 Math-0.001
 dictionary-0.001
 automobiles-0.001
 sidewalk-0.001
 books-0.001
Token."
Token ID526
Feature activation+2.90e-03

Pos logit contributions
 Math+0.068
 automobiles+0.058
 dictionary+0.058
 sidewalk+0.058
 pavement+0.055
Neg logit contributions
 closer-0.065
 nearer-0.064
 afloat-0.063
pt-0.060
 submerged-0.060
Token their
Token ID511
Feature activation-5.94e-03

Pos logit contributions
 Math+0.003
 automobiles+0.003
 books+0.003
 kids+0.003
 dictionary+0.003
Neg logit contributions
 nearer-0.006
 afloat-0.006
eled-0.006
 submerged-0.006
igi-0.006
Token parents
Token ID3397
Feature activation+4.39e-03

Pos logit contributions
 nearer+0.011
 afloat+0.011
 closer+0.010
 submerged+0.010
igi+0.010
Neg logit contributions
 Math-0.008
 kids-0.008
 piano-0.007
 books-0.007
 computer-0.007
Token say
Token ID910
Feature activation-4.71e-03

Pos logit contributions
 Math+0.007
 dictionary+0.006
 automobiles+0.006
 sidewalk+0.006
 books+0.006
Neg logit contributions
 nearer-0.007
 closer-0.006
 hes-0.006
eled-0.006
 afloat-0.006
Token,
Token ID11
Feature activation+5.36e-03

Pos logit contributions
 nearer+0.008
 afloat+0.007
eled+0.007
 closer+0.007
 submerged+0.007
Neg logit contributions
 Math-0.007
 automobiles-0.006
 dictionary-0.006
 books-0.006
 sidewalk-0.006
Token
Token ID198
Feature activation+5.73e-04

Pos logit contributions
 nearer+0.010
 afloat+0.009
eled+0.009
 closer+0.009
 submerged+0.009
Neg logit contributions
 Math-0.011
 automobiles-0.010
 dictionary-0.010
 books-0.010
 sidewalk-0.010
Token
Token ID198
Feature activation+7.79e-03

Pos logit contributions
 Math+0.001
 sidewalk+0.001
 books+0.001
 automobiles+0.001
 kids+0.001
Neg logit contributions
 nearer-0.001
 afloat-0.001
eled-0.001
 submerged-0.001
 closer-0.001
Token"
Token ID1
Feature activation-1.32e-04

Pos logit contributions
 Math+0.013
 dictionary+0.012
 automobiles+0.012
 books+0.012
 exam+0.011
Neg logit contributions
 nearer-0.011
 closer-0.011
eled-0.011
 submerged-0.010
 afloat-0.010
TokenWhat
Token ID2061
Feature activation+5.36e-03

Pos logit contributions
 nearer+0.000
 closer+0.000
eled+0.000
 afloat+0.000
 submerged+0.000
Neg logit contributions
 Math-0.000
 dictionary-0.000
 books-0.000
 automobiles-0.000
 sidewalk-0.000
Token are
Token ID389
Feature activation+2.44e-02

Pos logit contributions
 pavement+0.008
 sidewalk+0.008
 Math+0.007
 automobiles+0.007
 earthquake+0.007
Neg logit contributions
 closer-0.008
 nearer-0.008
 hes-0.007
apa-0.007
ull-0.007
Token peanuts
Token ID45612
Feature activation+4.22e-02

Pos logit contributions
 Math+0.040
 dictionary+0.037
 pavement+0.035
aces+0.035
 sidewalk+0.035
Neg logit contributions
 nearer-0.035
 closer-0.035
apa-0.032
igi-0.031
 afloat-0.031
Token?"
Token ID1701
Feature activation+5.34e-04

Pos logit contributions
 Math+0.077
 automobiles+0.068
 dictionary+0.066
 sidewalk+0.065
arrass+0.064
Neg logit contributions
 nearer-0.055
 afloat-0.054
 closer-0.053
 submerged-0.049
 surfaced-0.045
Token Lily
Token ID20037
Feature activation+2.26e-02

Pos logit contributions
 Math+0.001
 dictionary+0.001
 automobiles+0.001
 sidewalk+0.001
 books+0.001
Neg logit contributions
 nearer-0.001
 closer-0.001
eled-0.001
 afloat-0.001
 submerged-0.001
Token asked
Token ID1965
Feature activation+2.31e-02

Pos logit contributions
 Math+0.042
 automobiles+0.038
 dictionary+0.037
 books+0.037
 sidewalk+0.037
Neg logit contributions
 nearer-0.029
eled-0.027
 closer-0.027
 afloat-0.027
 hes-0.026
Token.
Token ID13
Feature activation+1.07e-02

Pos logit contributions
 Math+0.033
 dictionary+0.029
 sidewalk+0.028
 automobiles+0.028
 mine+0.027
Neg logit contributions
 nearer-0.040
 closer-0.039
 afloat-0.037
 hes-0.037
eled-0.037
Token
Token ID198
Feature activation+6.95e-03

Pos logit contributions
 Math+0.018
 dictionary+0.015
 automobiles+0.015
 books+0.015
 sidewalk+0.015
Neg logit contributions
 nearer-0.017
 afloat-0.016
eled-0.015
 closer-0.015
 submerged-0.015
Token,
Token ID11
Feature activation-1.17e-04

Pos logit contributions
 nearer+0.003
 closer+0.003
 afloat+0.003
eled+0.003
apa+0.003
Neg logit contributions
 Math-0.003
 sidewalk-0.003
 automobiles-0.003
 dictionary-0.003
 piano-0.003
Token "
Token ID366
Feature activation-7.14e-03

Pos logit contributions
 nearer+0.000
 closer+0.000
 afloat+0.000
eled+0.000
 hes+0.000
Neg logit contributions
 Math-0.000
 automobiles-0.000
 sidewalk-0.000
 dictionary-0.000
 books-0.000
TokenWhat
Token ID2061
Feature activation+5.30e-03

Pos logit contributions
 nearer+0.011
 closer+0.011
eled+0.010
 afloat+0.010
 submerged+0.010
Neg logit contributions
 Math-0.012
 automobiles-0.010
 sidewalk-0.010
 books-0.010
 dictionary-0.010
Token does
Token ID857
Feature activation+5.44e-03

Pos logit contributions
 sidewalk+0.008
 pavement+0.008
 Math+0.007
 automobiles+0.007
issors+0.007
Neg logit contributions
 nearer-0.008
 closer-0.008
 hes-0.008
apa-0.007
pt-0.007
Token '
Token ID705
Feature activation+2.54e-02

Pos logit contributions
 Math+0.008
 automobiles+0.007
 sidewalk+0.007
 dictionary+0.007
 books+0.007
Neg logit contributions
 nearer-0.009
 afloat-0.009
 closer-0.009
 submerged-0.008
eled-0.008
Tokenups
Token ID4739
Feature activation+4.14e-02

Pos logit contributions
 Math+0.043
 sidewalk+0.040
 books+0.040
 computer+0.039
 automobiles+0.039
Neg logit contributions
 nearer-0.036
 closer-0.035
eled-0.033
 afloat-0.033
 submerged-0.033
Tokenet
Token ID316
Feature activation+1.59e-02

Pos logit contributions
 Math+0.060
 automobiles+0.054
 dictionary+0.052
 books+0.051
 sidewalk+0.050
Neg logit contributions
 nearer-0.070
 closer-0.069
 afloat-0.065
 submerged-0.063
pt-0.063
Token'
Token ID6
Feature activation+1.99e-02

Pos logit contributions
 Math+0.025
 sidewalk+0.023
 automobiles+0.022
 dictionary+0.022
 books+0.022
Neg logit contributions
 nearer-0.025
apa-0.024
 closer-0.024
 afloat-0.023
eled-0.022
Token mean
Token ID1612
Feature activation+9.62e-03

Pos logit contributions
 Math+0.030
 automobiles+0.026
 dictionary+0.026
 sidewalk+0.026
 books+0.026
Neg logit contributions
 nearer-0.032
 closer-0.031
 afloat-0.031
apa-0.029
 submerged-0.029
Token?".
Token ID43634
Feature activation+6.43e-04

Pos logit contributions
 Math+0.015
 sidewalk+0.014
 automobiles+0.013
 dictionary+0.013
aces+0.012
Neg logit contributions
 nearer-0.015
 closer-0.015
apa-0.015
 afloat-0.014
 hes-0.014
Token His
Token ID2399
Feature activation+4.15e-03

Pos logit contributions
 Math+0.001
 automobiles+0.001
 dictionary+0.001
 books+0.001
 sidewalk+0.001
Neg logit contributions
 nearer-0.001
eled-0.001
 afloat-0.001
 closer-0.001
 submerged-0.001
Token She
Token ID1375
Feature activation-2.01e-03

Pos logit contributions
 Math+0.003
 piano+0.003
 books+0.003
 dictionary+0.002
 games+0.002
Neg logit contributions
 nearer-0.002
apa-0.002
eled-0.002
 closer-0.002
 afloat-0.002
Token said
Token ID531
Feature activation-1.56e-03

Pos logit contributions
 nearer+0.003
eled+0.003
 closer+0.003
 hes+0.003
 submerged+0.003
Neg logit contributions
 Math-0.003
 automobiles-0.003
 sidewalk-0.003
 dictionary-0.003
 piano-0.003
Token,
Token ID11
Feature activation+8.02e-03

Pos logit contributions
 nearer+0.002
 closer+0.002
 afloat+0.002
eled+0.002
 submerged+0.002
Neg logit contributions
 Math-0.003
 sidewalk-0.002
 automobiles-0.002
 dictionary-0.002
 books-0.002
Token "
Token ID366
Feature activation-1.21e-02

Pos logit contributions
 Math+0.011
 automobiles+0.010
 sidewalk+0.010
 books+0.010
 dictionary+0.010
Neg logit contributions
 closer-0.013
 nearer-0.013
 hes-0.012
 stro-0.012
apa-0.012
TokenNo
Token ID2949
Feature activation-1.46e-02

Pos logit contributions
 nearer+0.020
eled+0.019
 closer+0.019
Wal+0.018
 submerged+0.018
Neg logit contributions
 Math-0.018
 books-0.017
 dictionary-0.017
 automobiles-0.016
 computer-0.016
Token cars
Token ID5006
Feature activation+4.13e-02

Pos logit contributions
 hes+0.010
 closer+0.010
 nearer+0.010
 deeper+0.008
 fever+0.007
Neg logit contributions
aces-0.032
 sidewalk-0.031
Ju-0.031
nam-0.030
 Math-0.030
Token!
Token ID0
Feature activation+1.11e-02

Pos logit contributions
 Math+0.069
 sidewalk+0.066
 pavement+0.064
 artist+0.063
 automobiles+0.062
Neg logit contributions
 closer-0.051
pt-0.048
 nearer-0.045
 exhausted-0.043
ung-0.042
Token Only
Token ID5514
Feature activation+3.82e-03

Pos logit contributions
 books+0.014
 dictionary+0.014
 games+0.014
 sidewalk+0.013
 exam+0.013
Neg logit contributions
ung-0.018
apa-0.018
ucks-0.017
eled-0.016
 nearer-0.016
Token marriage
Token ID4845
Feature activation+2.02e-02

Pos logit contributions
 Math+0.005
 dictionary+0.005
Deal+0.004
 sidewalk+0.004
scope+0.004
Neg logit contributions
 closer-0.006
 nearer-0.006
 deeper-0.006
 soon-0.006
 afloat-0.006
Token!"
Token ID2474
Feature activation-1.18e-04

Pos logit contributions
 Math+0.032
 sidewalk+0.030
 automobiles+0.028
 pavement+0.028
 dictionary+0.028
Neg logit contributions
 closer-0.028
 nearer-0.027
 soon-0.025
ung-0.024
 afloat-0.024
Token Tom
Token ID4186
Feature activation-2.92e-04

Pos logit contributions
 nearer+0.000
 closer+0.000
apa+0.000
 afloat+0.000
eled+0.000
Neg logit contributions
 Math-0.000
 dictionary-0.000
 books-0.000
 piano-0.000
 automobiles-0.000
Token
Token ID198
Feature activation-1.07e-02

Pos logit contributions
 Math+0.012
 dictionary+0.011
 automobiles+0.011
 books+0.011
 sidewalk+0.011
Neg logit contributions
 nearer-0.009
 afloat-0.008
 closer-0.008
eled-0.008
 submerged-0.008
Token
Token ID198
Feature activation+4.19e-03

Pos logit contributions
 nearer+0.019
 afloat+0.019
eled+0.019
 submerged+0.019
 numb+0.018
Neg logit contributions
 Math-0.015
 books-0.012
 dictionary-0.012
 sidewalk-0.012
 kids-0.012
TokenOne
Token ID3198
Feature activation+5.41e-03

Pos logit contributions
 Math+0.008
 automobiles+0.007
 dictionary+0.007
 sidewalk+0.007
 books+0.007
Neg logit contributions
 nearer-0.006
 closer-0.005
eled-0.005
 afloat-0.005
 submerged-0.005
Token of
Token ID286
Feature activation-6.36e-03

Pos logit contributions
 Math+0.007
 automobiles+0.006
 sidewalk+0.006
aces+0.006
 dictionary+0.006
Neg logit contributions
 closer-0.010
 nearer-0.010
eled-0.009
 afloat-0.009
 submerged-0.009
Token her
Token ID607
Feature activation-2.15e-04

Pos logit contributions
 nearer+0.012
 afloat+0.012
eled+0.012
 closer+0.012
 submerged+0.011
Neg logit contributions
 Math-0.008
 dictionary-0.007
 automobiles-0.007
 books-0.007
 sidewalk-0.007
Token favourite
Token ID12507
Feature activation+4.12e-02

Pos logit contributions
 nearer+0.000
 afloat+0.000
eled+0.000
 closer+0.000
 submerged+0.000
Neg logit contributions
 Math-0.000
 dictionary-0.000
 books-0.000
 automobiles-0.000
 sidewalk-0.000
Token things
Token ID1243
Feature activation+1.29e-02

Pos logit contributions
 Math+0.053
 automobiles+0.044
 sidewalk+0.044
issors+0.043
aces+0.040
Neg logit contributions
 nearer-0.075
 closer-0.074
eled-0.073
 hes-0.070
 submerged-0.067
Token to
Token ID284
Feature activation-1.23e-02

Pos logit contributions
 Math+0.021
 automobiles+0.018
 sidewalk+0.017
 dictionary+0.017
 books+0.017
Neg logit contributions
 nearer-0.020
 closer-0.019
 submerged-0.019
 afloat-0.018
eled-0.018
Token do
Token ID466
Feature activation-1.97e-02

Pos logit contributions
 nearer+0.023
 closer+0.022
 hes+0.020
 afloat+0.020
 submerged+0.020
Neg logit contributions
 Math-0.015
 automobiles-0.014
 dictionary-0.014
 sidewalk-0.013
aces-0.013
Token was
Token ID373
Feature activation+4.58e-03

Pos logit contributions
 nearer+0.029
 closer+0.028
 submerged+0.027
 afloat+0.027
 hes+0.027
Neg logit contributions
 Math-0.032
 dictionary-0.029
 automobiles-0.028
 sidewalk-0.027
 piano-0.027
Token to
Token ID284
Feature activation-1.09e-03

Pos logit contributions
 Math+0.007
 automobiles+0.006
 sidewalk+0.006
 dictionary+0.006
 books+0.006
Neg logit contributions
 nearer-0.007
 closer-0.007
 afloat-0.007
 submerged-0.007
 hes-0.006
Token fair
Token ID3148
Feature activation-4.36e-03

Pos logit contributions
 Math+0.012
board+0.010
 sidewalk+0.010
 dictionary+0.010
 automobiles+0.010
Neg logit contributions
 closer-0.011
 nearer-0.010
 afloat-0.010
 deeper-0.009
 submerged-0.009
Token.
Token ID13
Feature activation+7.04e-03

Pos logit contributions
 nearer+0.007
 closer+0.007
 afloat+0.007
 deeper+0.007
 submerged+0.007
Neg logit contributions
 Math-0.006
 dictionary-0.006
 sidewalk-0.005
 automobiles-0.005
 piano-0.005
Token You
Token ID921
Feature activation+1.09e-02

Pos logit contributions
 sidewalk+0.009
 piano+0.009
 games+0.009
 books+0.009
 exam+0.008
Neg logit contributions
apa-0.011
ung-0.011
 nearer-0.011
 closer-0.011
 stro-0.010
Token have
Token ID423
Feature activation+9.18e-03

Pos logit contributions
 Math+0.017
 sidewalk+0.015
 automobiles+0.015
 pavement+0.015
 dictionary+0.014
Neg logit contributions
 closer-0.017
pt-0.017
 nearer-0.017
 submerged-0.016
 afloat-0.015
Token four
Token ID1440
Feature activation+2.99e-03

Pos logit contributions
 Math+0.015
 sidewalk+0.013
 automobiles+0.013
 dictionary+0.013
 piano+0.012
Neg logit contributions
 nearer-0.014
 closer-0.014
eled-0.013
 submerged-0.013
 afloat-0.013
Token cars
Token ID5006
Feature activation+4.09e-02

Pos logit contributions
 Math+0.004
 dictionary+0.003
 sidewalk+0.003
 piano+0.003
scope+0.003
Neg logit contributions
 nearer-0.005
 closer-0.005
eled-0.005
 afloat-0.005
 hes-0.005
Token and
Token ID290
Feature activation-3.56e-03

Pos logit contributions
 Math+0.071
 sidewalk+0.064
 dictionary+0.063
 pavement+0.060
 automobiles+0.060
Neg logit contributions
 closer-0.054
pt-0.052
 nearer-0.050
 afloat-0.045
 deeper-0.045
Token I
Token ID314
Feature activation+1.12e-02

Pos logit contributions
 closer+0.005
 nearer+0.005
 hes+0.005
 stro+0.005
 deeper+0.005
Neg logit contributions
 Math-0.005
 sidewalk-0.005
 earthquake-0.005
 dictionary-0.005
 exam-0.004
Token have
Token ID423
Feature activation+6.87e-03

Pos logit contributions
 Math+0.015
 automobiles+0.013
 dictionary+0.013
 sidewalk+0.012
 books+0.012
Neg logit contributions
 nearer-0.021
 afloat-0.020
 closer-0.020
eled-0.019
 submerged-0.019
Token three
Token ID1115
Feature activation+6.38e-03

Pos logit contributions
 Math+0.011
 dictionary+0.010
 sidewalk+0.010
 piano+0.010
 automobiles+0.010
Neg logit contributions
 nearer-0.010
 closer-0.010
eled-0.009
 submerged-0.009
 hes-0.009
Token.
Token ID13
Feature activation+6.94e-03

Pos logit contributions
 Math+0.008
 dictionary+0.007
 sidewalk+0.006
 piano+0.006
 exam+0.006
Neg logit contributions
eled-0.012
 nearer-0.011
 closer-0.011
 hes-0.011
 afloat-0.011
Token you
Token ID345
Feature activation+2.08e-02

Pos logit contributions
 Math+0.020
 sidewalk+0.019
 pavement+0.019
 automobiles+0.017
Ju+0.017
Neg logit contributions
 nearer-0.031
 closer-0.031
 hes-0.028
apa-0.028
pt-0.027
Token want
Token ID765
Feature activation+9.54e-03

Pos logit contributions
 Math+0.031
 automobiles+0.031
 piano+0.029
 computer+0.029
 books+0.028
Neg logit contributions
 nearer-0.032
 closer-0.031
 hes-0.029
pt-0.029
 afloat-0.028
Token and
Token ID290
Feature activation+8.94e-03

Pos logit contributions
 Math+0.015
 sidewalk+0.013
 automobiles+0.013
 piano+0.013
 dictionary+0.013
Neg logit contributions
 nearer-0.015
 closer-0.014
 hes-0.013
apa-0.013
 afloat-0.013
Token not
Token ID407
Feature activation+7.12e-03

Pos logit contributions
 Math+0.013
 automobiles+0.011
board+0.011
 sidewalk+0.011
whe+0.011
Neg logit contributions
 nearer-0.014
 closer-0.014
 hes-0.013
 afloat-0.013
 stro-0.013
Token what
Token ID644
Feature activation+1.74e-02

Pos logit contributions
 Math+0.010
 sidewalk+0.009
 automobiles+0.009
 books+0.009
 piano+0.009
Neg logit contributions
 closer-0.011
 nearer-0.011
 submerged-0.010
 hes-0.010
 afloat-0.010
Token others
Token ID1854
Feature activation+4.09e-02

Pos logit contributions
 Math+0.024
 sidewalk+0.023
 pavement+0.023
 earthquake+0.021
 automobiles+0.021
Neg logit contributions
 nearer-0.027
 closer-0.027
apa-0.025
 hes-0.025
pt-0.023
Token want
Token ID765
Feature activation+6.58e-03

Pos logit contributions
 Math+0.058
 automobiles+0.051
 sidewalk+0.051
 dictionary+0.050
 computer+0.049
Neg logit contributions
 nearer-0.067
 closer-0.066
 afloat-0.062
pt-0.062
 submerged-0.062
Token or
Token ID393
Feature activation-2.28e-03

Pos logit contributions
 Math+0.010
 sidewalk+0.008
 dictionary+0.008
 piano+0.008
 automobiles+0.008
Neg logit contributions
 nearer-0.011
 closer-0.011
apa-0.010
 hes-0.010
 afloat-0.010
Token need
Token ID761
Feature activation+5.30e-03

Pos logit contributions
 nearer+0.004
 closer+0.004
 afloat+0.004
 hes+0.004
 submerged+0.003
Neg logit contributions
 Math-0.003
board-0.003
Ju-0.002
aces-0.002
 sidewalk-0.002
Token.
Token ID13
Feature activation-2.91e-03

Pos logit contributions
 Math+0.008
 sidewalk+0.007
 dictionary+0.007
 piano+0.006
 automobiles+0.006
Neg logit contributions
 nearer-0.009
 closer-0.009
 afloat-0.008
 hes-0.008
apa-0.008
Token Being
Token ID11204
Feature activation+1.96e-02

Pos logit contributions
 nearer+0.005
apa+0.005
 closer+0.005
eled+0.005
 afloat+0.005
Neg logit contributions
 Math-0.003
 piano-0.003
 books-0.003
 sidewalk-0.003
 games-0.003
Token air
Token ID1633
Feature activation-5.13e-03

Pos logit contributions
 Math+0.018
 dictionary+0.016
 piano+0.016
 exam+0.015
 books+0.015
Neg logit contributions
 afloat-0.023
 nearer-0.022
 closer-0.021
 submerged-0.021
Wal-0.020
Token with
Token ID351
Feature activation+1.39e-03

Pos logit contributions
 nearer+0.009
eled+0.009
 afloat+0.009
 closer+0.009
 submerged+0.009
Neg logit contributions
 Math-0.007
 books-0.006
 dictionary-0.006
 sidewalk-0.006
 automobiles-0.006
Token a
Token ID257
Feature activation-8.20e-04

Pos logit contributions
 Math+0.002
 dictionary+0.002
 automobiles+0.002
 sidewalk+0.002
 books+0.002
Neg logit contributions
 nearer-0.002
 afloat-0.002
 closer-0.002
eled-0.002
 submerged-0.002
Token loud
Token ID7812
Feature activation+1.39e-03

Pos logit contributions
 nearer+0.001
 afloat+0.001
 closer+0.001
eled+0.001
 submerged+0.001
Neg logit contributions
 Math-0.001
 automobiles-0.001
 dictionary-0.001
 books-0.001
 sidewalk-0.001
Token "
Token ID366
Feature activation+9.31e-03

Pos logit contributions
 Math+0.002
 automobiles+0.002
 dictionary+0.002
 books+0.002
Deal+0.002
Neg logit contributions
 nearer-0.002
 closer-0.002
 hes-0.002
 afloat-0.002
eled-0.002
Tokenwho
Token ID8727
Feature activation+4.09e-02

Pos logit contributions
 Math+0.016
 dictionary+0.015
 books+0.015
 computer+0.015
 sidewalk+0.015
Neg logit contributions
 nearer-0.014
eled-0.013
 closer-0.013
 afloat-0.012
 submerged-0.012
Tokenosh
Token ID3768
Feature activation+3.42e-03

Pos logit contributions
 Math+0.073
 sidewalk+0.068
 automobiles+0.065
 pavement+0.062
 dictionary+0.061
Neg logit contributions
 nearer-0.057
apa-0.056
 closer-0.055
eled-0.052
ung-0.051
Token"
Token ID1
Feature activation+1.89e-02

Pos logit contributions
 Math+0.006
 sidewalk+0.005
 automobiles+0.005
 books+0.005
 dictionary+0.005
Neg logit contributions
 nearer-0.005
 closer-0.005
 afloat-0.005
apa-0.004
fly-0.004
Token sound
Token ID2128
Feature activation-2.34e-03

Pos logit contributions
 Math+0.031
 dictionary+0.028
 piano+0.027
 sidewalk+0.027
 automobiles+0.027
Neg logit contributions
 nearer-0.029
 closer-0.027
 afloat-0.026
 submerged-0.025
eled-0.024
Token.
Token ID13
Feature activation+2.96e-03

Pos logit contributions
 nearer+0.003
 closer+0.003
 afloat+0.003
apa+0.003
 submerged+0.003
Neg logit contributions
 Math-0.004
 dictionary-0.003
 sidewalk-0.003
 automobiles-0.003
 books-0.003
Token Everyone
Token ID11075
Feature activation+2.04e-02

Pos logit contributions
 Math+0.005
 piano+0.004
 dictionary+0.004
 automobiles+0.004
 books+0.004
Neg logit contributions
 nearer-0.005
 closer-0.004
 afloat-0.004
apa-0.004
 hes-0.004
Token didn
Token ID1422
Feature activation+2.22e-02

Pos logit contributions
 pavement+0.006
 sidewalk+0.006
 Math+0.006
 automobiles+0.006
issors+0.005
Neg logit contributions
 closer-0.005
 nearer-0.005
apa-0.005
pt-0.005
 hes-0.005
Token't
Token ID470
Feature activation+9.26e-03

Pos logit contributions
 Math+0.040
Deal+0.037
 sidewalk+0.037
 books+0.037
 automobiles+0.037
Neg logit contributions
 nearer-0.027
 closer-0.026
 submerged-0.023
 hes-0.023
 afloat-0.022
Token Sarah
Token ID10490
Feature activation+2.37e-02

Pos logit contributions
 Math+0.013
aces+0.012
Deal+0.012
 books+0.011
 automobiles+0.011
Neg logit contributions
 closer-0.015
 nearer-0.015
 afloat-0.014
 hes-0.014
 submerged-0.014
Token answer
Token ID3280
Feature activation+6.52e-03

Pos logit contributions
 Math+0.039
 automobiles+0.034
 sidewalk+0.034
 games+0.033
 books+0.033
Neg logit contributions
 closer-0.034
 nearer-0.033
pt-0.030
 hes-0.030
 fever-0.030
Token my
Token ID616
Feature activation-5.42e-03

Pos logit contributions
 Math+0.011
 sidewalk+0.010
 automobiles+0.009
Deal+0.009
 pavement+0.009
Neg logit contributions
 nearer-0.010
 closer-0.010
 afloat-0.008
 hes-0.008
apa-0.008
Token questions
Token ID2683
Feature activation+4.09e-02

Pos logit contributions
 nearer+0.009
 closer+0.009
 hes+0.009
apa+0.009
 fever+0.008
Neg logit contributions
 Math-0.008
 automobiles-0.007
whe-0.006
 sidewalk-0.006
Ju-0.006
Token?"
Token ID1701
Feature activation+2.46e-03

Pos logit contributions
 Math+0.070
 sidewalk+0.063
 automobiles+0.063
 dictionary+0.060
 pavement+0.059
Neg logit contributions
 nearer-0.056
 closer-0.056
 hes-0.049
 afloat-0.048
pt-0.046
Token she
Token ID673
Feature activation+1.38e-02

Pos logit contributions
 Math+0.004
 automobiles+0.004
 books+0.004
 dictionary+0.004
 sidewalk+0.004
Neg logit contributions
 nearer-0.003
 closer-0.003
eled-0.003
 hes-0.003
 stro-0.003
Token asked
Token ID1965
Feature activation+1.91e-02

Pos logit contributions
 Math+0.025
 automobiles+0.023
 dictionary+0.023
 books+0.023
 sidewalk+0.023
Neg logit contributions
 nearer-0.018
eled-0.017
 closer-0.017
 afloat-0.017
 hes-0.016
Token.
Token ID13
Feature activation+4.87e-03

Pos logit contributions
 Math+0.030
 sidewalk+0.026
 dictionary+0.026
 automobiles+0.025
 mine+0.025
Neg logit contributions
 closer-0.030
 nearer-0.030
 hes-0.029
eled-0.028
pt-0.027
Token
Token ID220
Feature activation+5.91e-03

Pos logit contributions
 Math+0.007
 dictionary+0.007
 books+0.007
 automobiles+0.007
 piano+0.006
Neg logit contributions
 nearer-0.007
 closer-0.007
eled-0.007
apa-0.007
 hes-0.007
Tokenila
Token ID10102
Feature activation-8.73e-03

Pos logit contributions
 nearer+0.015
igi+0.015
iri+0.014
 closer+0.013
apa+0.013
Neg logit contributions
 Math-0.030
 automobiles-0.028
 sidewalk-0.028
 books-0.028
 dictionary-0.028
Token.
Token ID13
Feature activation+1.01e-03

Pos logit contributions
 closer+0.008
fly+0.008
 soon+0.008
 nearer+0.007
pt+0.007
Neg logit contributions
 Math-0.019
issors-0.017
 sidewalk-0.017
 exam-0.016
 automobiles-0.016
Token Do
Token ID2141
Feature activation-4.55e-04

Pos logit contributions
 sidewalk+0.001
 automobiles+0.001
 piano+0.001
 Math+0.001
 earthquake+0.001
Neg logit contributions
 nearer-0.002
 closer-0.002
 stro-0.002
fly-0.002
 hes-0.002
Token you
Token ID345
Feature activation+1.02e-02

Pos logit contributions
 closer+0.001
 nearer+0.001
 hes+0.001
iri+0.001
apa+0.001
Neg logit contributions
 pavement-0.001
 piano-0.001
 Math-0.001
 sidewalk-0.000
 automobiles-0.000
Token like
Token ID588
Feature activation+9.66e-03

Pos logit contributions
 Math+0.014
 automobiles+0.012
 dictionary+0.012
 sidewalk+0.011
 books+0.011
Neg logit contributions
 nearer-0.019
 afloat-0.018
 closer-0.018
eled-0.018
 submerged-0.017
Token trucks
Token ID13960
Feature activation+4.08e-02

Pos logit contributions
 Math+0.013
Deal+0.012
 sidewalk+0.012
arrass+0.012
 earthquake+0.012
Neg logit contributions
 closer-0.015
 nearer-0.015
 hes-0.014
 asleep-0.013
 numb-0.013
Token?"
Token ID1701
Feature activation-2.59e-03

Pos logit contributions
 Math+0.073
arrass+0.069
 pavement+0.068
 dictionary+0.068
 sidewalk+0.068
Neg logit contributions
 closer-0.047
 nearer-0.042
 afloat-0.039
 soon-0.039
 numb-0.037
Token
Token ID198
Feature activation-2.36e-03

Pos logit contributions
 nearer+0.003
 closer+0.003
 afloat+0.003
 hes+0.003
eled+0.003
Neg logit contributions
 Math-0.005
 dictionary-0.005
 automobiles-0.005
 piano-0.005
 mine-0.004
Token
Token ID198
Feature activation+3.39e-03

Pos logit contributions
 nearer+0.004
 afloat+0.004
eled+0.004
 submerged+0.003
 closer+0.003
Neg logit contributions
 Math-0.004
 books-0.003
 sidewalk-0.003
 automobiles-0.003
 kids-0.003
TokenL
Token ID43
Feature activation-6.44e-03

Pos logit contributions
 Math+0.004
 dictionary+0.004
 automobiles+0.004
 earthquake+0.004
 books+0.004
Neg logit contributions
 nearer-0.006
 closer-0.006
 afloat-0.006
 submerged-0.006
eled-0.006
Tokenila
Token ID10102
Feature activation+9.77e-03

Pos logit contributions
 nearer+0.008
 hes+0.007
eled+0.007
 closer+0.007
igi+0.007
Neg logit contributions
 Math-0.012
 automobiles-0.012
 dictionary-0.012
 piano-0.011
 books-0.011
Token the
Token ID262
Feature activation+7.92e-03

Pos logit contributions
 nearer+0.010
 afloat+0.010
eled+0.009
 closer+0.009
 submerged+0.009
Neg logit contributions
 Math-0.007
 dictionary-0.006
 books-0.006
 automobiles-0.006
 sidewalk-0.006
Token park
Token ID3952
Feature activation-9.73e-05

Pos logit contributions
 Math+0.011
 automobiles+0.010
 dictionary+0.009
 books+0.009
 sidewalk+0.009
Neg logit contributions
 nearer-0.014
 closer-0.013
 afloat-0.013
eled-0.013
 submerged-0.013
Token playing
Token ID2712
Feature activation+9.55e-03

Pos logit contributions
 nearer+0.000
 afloat+0.000
eled+0.000
 closer+0.000
apa+0.000
Neg logit contributions
 Math-0.000
 sidewalk-0.000
 books-0.000
 dictionary-0.000
 automobiles-0.000
Token with
Token ID351
Feature activation+8.96e-03

Pos logit contributions
 Math+0.017
 automobiles+0.015
 dictionary+0.015
 sidewalk+0.015
issors+0.014
Neg logit contributions
 nearer-0.013
 closer-0.013
 deeper-0.012
 afloat-0.012
 hes-0.011
Token her
Token ID607
Feature activation+6.87e-03

Pos logit contributions
 Math+0.015
 dictionary+0.013
 sidewalk+0.013
 automobiles+0.013
 piano+0.013
Neg logit contributions
 nearer-0.013
 closer-0.012
eled-0.012
 afloat-0.012
 submerged-0.012
Token favourite
Token ID12507
Feature activation+4.07e-02

Pos logit contributions
 Math+0.010
 sidewalk+0.008
issors+0.008
 pavement+0.008
 guys+0.008
Neg logit contributions
 closer-0.011
 nearer-0.011
 afloat-0.011
eled-0.010
 bur-0.010
Token t
Token ID256
Feature activation-1.80e-03

Pos logit contributions
 Math+0.053
issors+0.044
Deal+0.040
aces+0.040
whe+0.039
Neg logit contributions
 closer-0.074
eled-0.072
 nearer-0.072
 bur-0.067
 hes-0.065
Tokeneddy
Token ID21874
Feature activation-1.31e-02

Pos logit contributions
 nearer+0.002
 afloat+0.002
 closer+0.002
 hes+0.002
 submerged+0.002
Neg logit contributions
 Math-0.003
 automobiles-0.003
 books-0.003
 dictionary-0.003
 sidewalk-0.003
Token.
Token ID13
Feature activation+8.82e-03

Pos logit contributions
eled+0.018
 closer+0.018
 nearer+0.018
pt+0.017
 afloat+0.017
Neg logit contributions
 Math-0.023
 sidewalk-0.020
issors-0.019
 pavement-0.019
 law-0.019
Token Suddenly
Token ID24975
Feature activation-1.03e-02

Pos logit contributions
 Math+0.014
 dictionary+0.013
 piano+0.013
 automobiles+0.013
 books+0.012
Neg logit contributions
 nearer-0.012
 closer-0.012
eled-0.012
 submerged-0.012
uttered-0.011
Token,
Token ID11
Feature activation-1.90e-03

Pos logit contributions
 afloat+0.021
 nearer+0.020
eled+0.020
igi+0.020
 submerged+0.019
Neg logit contributions
 Math-0.012
 books-0.011
 kids-0.010
 piano-0.009
 dictionary-0.009
TokenWhy
Token ID5195
Feature activation+6.75e-03

Pos logit contributions
 Math+0.017
 dictionary+0.016
 books+0.016
 mine+0.015
 automobiles+0.015
Neg logit contributions
 nearer-0.020
eled-0.019
 afloat-0.019
 closer-0.019
 submerged-0.019
Token did
Token ID750
Feature activation+2.40e-02

Pos logit contributions
 sidewalk+0.010
 pavement+0.010
 Math+0.010
 automobiles+0.009
 dictionary+0.009
Neg logit contributions
pt-0.010
 closer-0.009
 nearer-0.009
apa-0.009
ung-0.009
Token you
Token ID345
Feature activation+1.19e-02

Pos logit contributions
 pavement+0.033
board+0.033
aces+0.032
 sidewalk+0.032
Deal+0.031
Neg logit contributions
 closer-0.035
 nearer-0.033
 pap-0.031
 distracted-0.031
 hes-0.031
Token race
Token ID3234
Feature activation+9.36e-03

Pos logit contributions
 Math+0.021
 automobiles+0.019
 dictionary+0.019
aces+0.019
 books+0.018
Neg logit contributions
 closer-0.017
 nearer-0.016
 afloat-0.016
 submerged-0.016
pt-0.015
Token your
Token ID534
Feature activation+4.96e-03

Pos logit contributions
 Math+0.015
 dictionary+0.014
Deal+0.013
 sidewalk+0.013
aces+0.013
Neg logit contributions
 nearer-0.014
 closer-0.014
 afloat-0.013
 deeper-0.012
 submerged-0.012
Token cars
Token ID5006
Feature activation+4.06e-02

Pos logit contributions
 Math+0.006
Deal+0.005
 dictionary+0.005
Ju+0.005
scope+0.004
Neg logit contributions
 afloat-0.009
 closer-0.009
 numb-0.009
 nearer-0.009
 drown-0.009
Token on
Token ID319
Feature activation+6.87e-03

Pos logit contributions
 Math+0.070
 dictionary+0.063
 sidewalk+0.059
 meanings+0.059
arrass+0.058
Neg logit contributions
 closer-0.056
 nearer-0.055
 afloat-0.052
 submerged-0.047
pt-0.046
Token the
Token ID262
Feature activation-2.66e-03

Pos logit contributions
 Math+0.010
Deal+0.010
 dictionary+0.009
 sidewalk+0.009
arrass+0.009
Neg logit contributions
 nearer-0.010
 afloat-0.010
 closer-0.010
eled-0.009
 submerged-0.009
Token stairs
Token ID16046
Feature activation+1.22e-02

Pos logit contributions
 numb+0.004
 sinking+0.004
 deeper+0.004
 afloat+0.004
 asleep+0.004
Neg logit contributions
 Math-0.004
Deal-0.004
arrass-0.003
 meanings-0.003
ems-0.003
Token?
Token ID30
Feature activation+9.66e-03

Pos logit contributions
 Math+0.021
 dictionary+0.018
 sidewalk+0.018
 automobiles+0.018
 books+0.017
Neg logit contributions
 nearer-0.017
 closer-0.017
 afloat-0.016
 submerged-0.015
eled-0.015
Token That
Token ID1320
Feature activation+3.02e-03

Pos logit contributions
 games+0.011
 game+0.011
 books+0.011
 sidewalk+0.011
 dictionary+0.011
Neg logit contributions
 nearer-0.016
ung-0.015
 closer-0.015
apa-0.015
 stro-0.015
Token.
Token ID13
Feature activation+6.69e-05

Pos logit contributions
 Math+0.007
 books+0.006
 automobiles+0.006
 dictionary+0.006
 sidewalk+0.005
Neg logit contributions
 nearer-0.016
eled-0.015
 afloat-0.015
 submerged-0.015
igi-0.015
Token "
Token ID366
Feature activation+6.26e-03

Pos logit contributions
 Math+0.000
 automobiles+0.000
 dictionary+0.000
 sidewalk+0.000
 piano+0.000
Neg logit contributions
 nearer-0.000
 closer-0.000
 afloat-0.000
eled-0.000
 submerged-0.000
TokenDo
Token ID5211
Feature activation-2.19e-03

Pos logit contributions
 Math+0.010
 dictionary+0.009
 books+0.009
 automobiles+0.009
 sidewalk+0.009
Neg logit contributions
 nearer-0.009
 closer-0.009
eled-0.009
 afloat-0.009
 submerged-0.008
Token you
Token ID345
Feature activation+9.33e-03

Pos logit contributions
 closer+0.004
 nearer+0.004
 hes+0.004
iri+0.004
 deeper+0.003
Neg logit contributions
 Math-0.003
 piano-0.002
 pavement-0.002
 dictionary-0.002
 computer-0.002
Token like
Token ID588
Feature activation+8.43e-03

Pos logit contributions
 Math+0.013
 automobiles+0.012
 dictionary+0.012
 sidewalk+0.011
 piano+0.011
Neg logit contributions
 nearer-0.016
 closer-0.015
 afloat-0.015
eled-0.015
 submerged-0.015
Token trucks
Token ID13960
Feature activation+4.04e-02

Pos logit contributions
 Math+0.012
Deal+0.010
aces+0.010
 dictionary+0.010
 sidewalk+0.010
Neg logit contributions
 nearer-0.013
 closer-0.013
 hes-0.012
eled-0.012
 submerged-0.012
Token?"
Token ID1701
Feature activation-5.69e-03

Pos logit contributions
 Math+0.072
 dictionary+0.065
 sidewalk+0.064
arrass+0.064
 pavement+0.063
Neg logit contributions
 closer-0.053
 nearer-0.050
 afloat-0.048
 submerged-0.042
 soon-0.041
Token
Token ID198
Feature activation-7.81e-03

Pos logit contributions
 nearer+0.008
 afloat+0.008
eled+0.008
 submerged+0.008
 closer+0.008
Neg logit contributions
 Math-0.010
 dictionary-0.009
 automobiles-0.009
 books-0.008
 sidewalk-0.008
Token
Token ID198
Feature activation-3.76e-03

Pos logit contributions
 nearer+0.013
 afloat+0.013
eled+0.013
 submerged+0.012
 hes+0.012
Neg logit contributions
 Math-0.012
 automobiles-0.010
 sidewalk-0.010
 kids-0.010
 books-0.010
Token"
Token ID1
Feature activation+1.42e-03

Pos logit contributions
 nearer+0.007
 closer+0.006
 afloat+0.006
eled+0.006
 submerged+0.006
Neg logit contributions
 Math-0.005
 dictionary-0.005
 piano-0.005
 exam-0.005
 books-0.005
TokenYes
Token ID5297
Feature activation-1.07e-02

Pos logit contributions
 Math+0.002
 dictionary+0.002
 earthquake+0.002
 books+0.002
 automobiles+0.002
Neg logit contributions
 nearer-0.002
 closer-0.002
 afloat-0.002
 submerged-0.002
eled-0.002
Token.
Token ID13
Feature activation+7.36e-03

Pos logit contributions
 nearer+0.039
 afloat+0.037
eled+0.037
 submerged+0.036
 closer+0.036
Neg logit contributions
 Math-0.020
 dictionary-0.016
 automobiles-0.016
 books-0.016
 sidewalk-0.015
Token Max
Token ID5436
Feature activation-4.93e-03

Pos logit contributions
 Math+0.008
aces+0.007
 test+0.007
Ju+0.007
 sidewalk+0.006
Neg logit contributions
 nearer-0.014
 closer-0.014
 afloat-0.013
 numb-0.013
 submerged-0.013
Token had
Token ID550
Feature activation+9.64e-03

Pos logit contributions
 nearer+0.010
 afloat+0.009
eled+0.009
 submerged+0.009
apa+0.009
Neg logit contributions
 Math-0.006
 dictionary-0.005
 books-0.005
 automobiles-0.005
 kids-0.005
Token a
Token ID257
Feature activation+1.06e-02

Pos logit contributions
 Math+0.015
 books+0.013
 dictionary+0.012
 automobiles+0.012
 sidewalk+0.012
Neg logit contributions
 nearer-0.016
 afloat-0.016
eled-0.016
 closer-0.015
apa-0.015
Token w
Token ID266
Feature activation+3.52e-03

Pos logit contributions
 Math+0.011
 dictionary+0.011
 books+0.010
 piano+0.010
 computer+0.010
Neg logit contributions
eled-0.023
 nearer-0.023
apa-0.022
 afloat-0.022
 hes-0.021
Tokenagging
Token ID16406
Feature activation+4.02e-02

Pos logit contributions
 Math+0.005
 automobiles+0.004
 dictionary+0.004
 sidewalk+0.004
 books+0.004
Neg logit contributions
 nearer-0.006
 afloat-0.006
eled-0.006
 closer-0.006
 submerged-0.006
Token tail
Token ID7894
Feature activation-1.04e-03

Pos logit contributions
 Math+0.050
 dictionary+0.042
 books+0.042
 automobiles+0.042
 sidewalk+0.042
Neg logit contributions
 nearer-0.077
 afloat-0.074
eled-0.072
 closer-0.072
 submerged-0.071
Token and
Token ID290
Feature activation+1.26e-02

Pos logit contributions
 nearer+0.002
 afloat+0.002
eled+0.002
apa+0.002
 closer+0.002
Neg logit contributions
 Math-0.001
 dictionary-0.001
 books-0.001
 automobiles-0.001
 piano-0.001
Token a
Token ID257
Feature activation+1.23e-02

Pos logit contributions
 Math+0.018
 automobiles+0.016
 dictionary+0.016
 sidewalk+0.015
 piano+0.015
Neg logit contributions
 nearer-0.021
 closer-0.020
 afloat-0.020
eled-0.020
 submerged-0.020
Token shiny
Token ID22441
Feature activation+2.21e-02

Pos logit contributions
 Math+0.016
 dictionary+0.013
 books+0.013
 sidewalk+0.013
 automobiles+0.013
Neg logit contributions
 nearer-0.024
eled-0.023
 afloat-0.022
 closer-0.022
apa-0.022
Token collar
Token ID19908
Feature activation+1.55e-02

Pos logit contributions
 Math+0.030
 automobiles+0.025
 sidewalk+0.024
 dictionary+0.024
 books+0.024
Neg logit contributions
 nearer-0.040
 closer-0.039
 afloat-0.038
eled-0.038
 submerged-0.037
Token always
Token ID1464
Feature activation-6.02e-03

Pos logit contributions
 Math+0.000
 automobiles+0.000
 dictionary+0.000
 sidewalk+0.000
 books+0.000
Neg logit contributions
 nearer-0.000
 closer-0.000
 afloat-0.000
eled-0.000
 submerged-0.000
Token acted
Token ID13134
Feature activation-4.42e-03

Pos logit contributions
 nearer+0.009
 closer+0.008
eled+0.008
 afloat+0.008
 submerged+0.008
Neg logit contributions
 Math-0.010
 automobiles-0.009
 sidewalk-0.009
 dictionary-0.009
 books-0.009
Token silly
Token ID14397
Feature activation-2.52e-04

Pos logit contributions
 nearer+0.008
 afloat+0.007
 closer+0.007
eled+0.007
 submerged+0.007
Neg logit contributions
 Math-0.006
 dictionary-0.006
 automobiles-0.006
 sidewalk-0.005
 books-0.005
Token and
Token ID290
Feature activation+1.24e-03

Pos logit contributions
 nearer+0.000
 closer+0.000
 hes+0.000
 afloat+0.000
pt+0.000
Neg logit contributions
 Math-0.000
 sidewalk-0.000
 dictionary-0.000
Ju-0.000
 automobiles-0.000
Token mis
Token ID2984
Feature activation+8.13e-03

Pos logit contributions
 Math+0.002
 automobiles+0.002
 dictionary+0.002
 sidewalk+0.002
board+0.002
Neg logit contributions
 closer-0.002
 nearer-0.002
 submerged-0.002
 afloat-0.002
 exhausted-0.002
Tokenchie
Token ID3043
Feature activation+4.02e-02

Pos logit contributions
 Math+0.013
 books+0.012
 dictionary+0.012
 sidewalk+0.012
 automobiles+0.012
Neg logit contributions
 nearer-0.013
 afloat-0.012
eled-0.012
 closer-0.012
 submerged-0.012
Tokenvous
Token ID31222
Feature activation-7.31e-03

Pos logit contributions
 Math+0.070
 sidewalk+0.066
 piano+0.065
 automobiles+0.064
 dictionary+0.064
Neg logit contributions
 nearer-0.051
ull-0.050
eled-0.050
 closer-0.049
 soon-0.047
Token.
Token ID13
Feature activation+3.71e-03

Pos logit contributions
 nearer+0.012
 closer+0.012
 hes+0.012
 afloat+0.011
 submerged+0.011
Neg logit contributions
 Math-0.010
 sidewalk-0.009
 dictionary-0.009
 automobiles-0.009
issors-0.009
Token
Token ID198
Feature activation-3.64e-03

Pos logit contributions
 Math+0.007
 piano+0.006
 dictionary+0.006
 books+0.006
 automobiles+0.006
Neg logit contributions
 nearer-0.005
eled-0.004
 closer-0.004
 afloat-0.004
apa-0.004
Token
Token ID198
Feature activation+6.56e-03

Pos logit contributions
 nearer+0.007
 afloat+0.006
eled+0.006
 submerged+0.006
 numb+0.006
Neg logit contributions
 Math-0.005
 books-0.004
 dictionary-0.004
 sidewalk-0.004
 automobiles-0.004
TokenOne
Token ID3198
Feature activation+3.38e-03

Pos logit contributions
 Math+0.012
 automobiles+0.011
 dictionary+0.011
 books+0.011
 sidewalk+0.011
Neg logit contributions
 nearer-0.009
 closer-0.008
 afloat-0.008
eled-0.008
 submerged-0.008
Token patient
Token ID5827
Feature activation-1.14e-02

Pos logit contributions
 Math+0.009
 dictionary+0.008
 automobiles+0.008
 sidewalk+0.008
 books+0.008
Neg logit contributions
 nearer-0.011
 afloat-0.010
 closer-0.010
eled-0.010
 submerged-0.010
Token and
Token ID290
Feature activation+3.91e-03

Pos logit contributions
 nearer+0.021
eled+0.020
 afloat+0.020
 closer+0.019
 submerged+0.019
Neg logit contributions
 Math-0.015
 books-0.013
 automobiles-0.013
 dictionary-0.013
 kids-0.013
Token not
Token ID407
Feature activation-5.87e-03

Pos logit contributions
 Math+0.006
 automobiles+0.005
 dictionary+0.005
 sidewalk+0.005
 books+0.005
Neg logit contributions
 nearer-0.006
 closer-0.006
 afloat-0.006
 submerged-0.006
 hes-0.006
Token cry
Token ID3960
Feature activation-1.98e-03

Pos logit contributions
 nearer+0.009
 closer+0.009
 afloat+0.009
 submerged+0.009
 numb+0.008
Neg logit contributions
 Math-0.009
 automobiles-0.008
 dictionary-0.008
 books-0.008
 sidewalk-0.008
Token when
Token ID618
Feature activation+8.12e-03

Pos logit contributions
 nearer+0.003
 closer+0.003
 afloat+0.003
 submerged+0.003
apa+0.003
Neg logit contributions
 Math-0.003
 dictionary-0.003
 sidewalk-0.003
 books-0.003
 automobiles-0.003
Token things
Token ID1243
Feature activation+4.01e-02

Pos logit contributions
 Math+0.010
 dictionary+0.008
 automobiles+0.008
 sidewalk+0.008
 books+0.008
Neg logit contributions
 nearer-0.016
 closer-0.015
 afloat-0.015
eled-0.015
 submerged-0.015
Token don
Token ID836
Feature activation+1.01e-02

Pos logit contributions
 Math+0.059
 dictionary+0.052
 automobiles+0.051
 books+0.049
 sidewalk+0.048
Neg logit contributions
 nearer-0.066
 closer-0.064
 afloat-0.063
 submerged-0.062
pt-0.061
Token't
Token ID470
Feature activation+3.18e-03

Pos logit contributions
 Math+0.016
 dictionary+0.014
 books+0.014
 automobiles+0.014
 sidewalk+0.014
Neg logit contributions
 nearer-0.016
 afloat-0.015
 closer-0.015
eled-0.015
 submerged-0.014
Token go
Token ID467
Feature activation-6.06e-04

Pos logit contributions
 Math+0.005
 dictionary+0.004
 automobiles+0.004
 books+0.004
 sidewalk+0.004
Neg logit contributions
 nearer-0.005
 closer-0.005
 afloat-0.005
 submerged-0.005
 numb-0.005
Token as
Token ID355
Feature activation-4.32e-03

Pos logit contributions
 nearer+0.001
 afloat+0.001
eled+0.001
 closer+0.001
 submerged+0.001
Neg logit contributions
 Math-0.001
 automobiles-0.001
 dictionary-0.001
 books-0.001
 sidewalk-0.001
Token planned
Token ID6027
Feature activation-6.58e-03

Pos logit contributions
 nearer+0.006
 afloat+0.006
eled+0.006
 closer+0.006
 submerged+0.006
Neg logit contributions
 Math-0.007
 dictionary-0.007
 automobiles-0.007
 books-0.006
 sidewalk-0.006
Token Jack
Token ID3619
Feature activation-5.22e-03

Pos logit contributions
 Math+0.017
 books+0.014
 dictionary+0.014
 sidewalk+0.014
 kids+0.014
Neg logit contributions
 nearer-0.025
 afloat-0.024
eled-0.024
apa-0.023
 submerged-0.023
Token was
Token ID373
Feature activation+7.71e-03

Pos logit contributions
 nearer+0.011
 afloat+0.010
 numb+0.010
 closer+0.010
igi+0.010
Neg logit contributions
 Math-0.006
 kids-0.005
 books-0.005
 sidewalk-0.005
 dictionary-0.005
Token playing
Token ID2712
Feature activation+1.21e-02

Pos logit contributions
 Math+0.008
 kids+0.007
 piano+0.007
 computer+0.006
 books+0.006
Neg logit contributions
 nearer-0.017
igi-0.017
apa-0.016
 afloat-0.016
iri-0.016
Token with
Token ID351
Feature activation+1.48e-02

Pos logit contributions
 Math+0.019
 automobiles+0.017
 dictionary+0.017
 sidewalk+0.016
 books+0.016
Neg logit contributions
 nearer-0.019
 closer-0.018
 afloat-0.018
 submerged-0.017
eled-0.017
Token his
Token ID465
Feature activation+1.26e-02

Pos logit contributions
 Math+0.025
 dictionary+0.022
 automobiles+0.021
 sidewalk+0.021
 books+0.021
Neg logit contributions
 nearer-0.022
 afloat-0.021
eled-0.021
 closer-0.021
 submerged-0.020
Token favourite
Token ID12507
Feature activation+4.00e-02

Pos logit contributions
 Math+0.014
 dictionary+0.011
 sidewalk+0.011
 automobiles+0.011
 books+0.011
Neg logit contributions
 nearer-0.025
 closer-0.025
 afloat-0.024
eled-0.024
 submerged-0.023
Token car
Token ID1097
Feature activation+1.56e-02

Pos logit contributions
 Math+0.051
issors+0.039
Deal+0.039
aces+0.038
 automobiles+0.037
Neg logit contributions
 closer-0.074
eled-0.072
 nearer-0.072
 afloat-0.069
 submerged-0.068
Token in
Token ID287
Feature activation+3.45e-03

Pos logit contributions
 Math+0.029
 dictionary+0.024
 meanings+0.024
Ju+0.024
Deal+0.024
Neg logit contributions
 closer-0.019
 afloat-0.018
eled-0.018
 submerged-0.018
 nearer-0.018
Token his
Token ID465
Feature activation+2.32e-02

Pos logit contributions
 Math+0.006
 dictionary+0.005
Deal+0.005
 automobiles+0.005
 sidewalk+0.005
Neg logit contributions
 nearer-0.005
eled-0.005
 closer-0.005
 afloat-0.004
 submerged-0.004
Token bedroom
Token ID14043
Feature activation-2.48e-03

Pos logit contributions
 Math+0.026
 automobiles+0.020
 dictionary+0.020
 sidewalk+0.020
 books+0.020
Neg logit contributions
 nearer-0.048
 afloat-0.046
eled-0.046
 closer-0.046
 submerged-0.045
Token.
Token ID13
Feature activation-1.54e-03

Pos logit contributions
 submerged+0.004
eled+0.004
 nearer+0.004
 afloat+0.004
 closer+0.003
Neg logit contributions
 Math-0.004
 dictionary-0.003
 automobiles-0.003
issors-0.003
 mine-0.003
Token sorry
Token ID7926
Feature activation-2.53e-03

Pos logit contributions
 closer+0.005
 nearer+0.005
apa+0.004
pt+0.004
 hes+0.004
Neg logit contributions
 Math-0.011
 sidewalk-0.011
 dictionary-0.010
 automobiles-0.010
 pavement-0.010
Token.
Token ID13
Feature activation+3.64e-03

Pos logit contributions
 nearer+0.004
 closer+0.004
apa+0.003
pt+0.003
 afloat+0.003
Neg logit contributions
 Math-0.004
 dictionary-0.004
 automobiles-0.004
 sidewalk-0.004
 piano-0.004
Token And
Token ID843
Feature activation+5.16e-03

Pos logit contributions
 game+0.004
 games+0.004
 books+0.004
 sidewalk+0.004
 exam+0.004
Neg logit contributions
ung-0.007
apa-0.006
ucks-0.006
 stro-0.006
ectar-0.006
Token no
Token ID645
Feature activation+4.92e-03

Pos logit contributions
 automobiles+0.006
 Math+0.006
 sidewalk+0.006
 dictionary+0.006
scope+0.006
Neg logit contributions
 closer-0.009
 nearer-0.009
 hes-0.008
 soon-0.008
apa-0.008
Token more
Token ID517
Feature activation+1.01e-02

Pos logit contributions
 Math+0.007
aces+0.006
scope+0.006
 sidewalk+0.006
 dictionary+0.006
Neg logit contributions
 closer-0.008
 nearer-0.008
eled-0.007
 deeper-0.007
 afloat-0.007
Token cars
Token ID5006
Feature activation+4.00e-02

Pos logit contributions
 Math+0.015
scope+0.012
Deal+0.012
 dictionary+0.012
Ju+0.012
Neg logit contributions
 nearer-0.016
 closer-0.015
 afloat-0.015
 submerged-0.015
 numb-0.015
Token for
Token ID329
Feature activation+6.54e-03

Pos logit contributions
 Math+0.066
 dictionary+0.057
 sidewalk+0.053
 automobiles+0.053
Ju+0.051
Neg logit contributions
 closer-0.059
 nearer-0.058
pt-0.057
 afloat-0.055
 submerged-0.055
Token today
Token ID1909
Feature activation+9.59e-03

Pos logit contributions
 Math+0.009
 dictionary+0.008
board+0.007
 automobiles+0.007
 piano+0.007
Neg logit contributions
 nearer-0.011
 closer-0.011
 hes-0.011
 submerged-0.010
 afloat-0.010
Token."
Token ID526
Feature activation-6.61e-03

Pos logit contributions
 Math+0.014
 dictionary+0.012
 automobiles+0.012
 sidewalk+0.012
 piano+0.012
Neg logit contributions
 nearer-0.015
 closer-0.015
 afloat-0.015
 submerged-0.015
apa-0.015
Token
Token ID198
Feature activation-4.66e-04

Pos logit contributions
 nearer+0.010
 afloat+0.010
eled+0.010
 closer+0.009
 submerged+0.009
Neg logit contributions
 Math-0.011
 dictionary-0.010
 sidewalk-0.009
 automobiles-0.009
 books-0.009
Token
Token ID198
Feature activation+4.56e-03

Pos logit contributions
 nearer+0.001
 afloat+0.001
eled+0.001
 submerged+0.001
 closer+0.001
Neg logit contributions
 Math-0.001
 sidewalk-0.001
 kids-0.001
 books-0.001
 dictionary-0.001
Token ask
Token ID1265
Feature activation-2.10e-03

Pos logit contributions
 Math+0.008
 automobiles+0.007
 dictionary+0.007
 sidewalk+0.007
 books+0.007
Neg logit contributions
 nearer-0.009
 closer-0.008
 afloat-0.008
eled-0.008
 hes-0.008
Token his
Token ID465
Feature activation-8.92e-03

Pos logit contributions
 nearer+0.003
 closer+0.003
 afloat+0.003
eled+0.003
 hes+0.003
Neg logit contributions
 Math-0.004
 sidewalk-0.003
 automobiles-0.003
 dictionary-0.003
 piano-0.003
Token neighbour
Token ID12250
Feature activation-5.62e-03

Pos logit contributions
 nearer+0.014
 closer+0.013
 afloat+0.013
eled+0.013
 submerged+0.013
Neg logit contributions
 Math-0.014
 automobiles-0.012
 sidewalk-0.012
 dictionary-0.012
 books-0.012
Token,
Token ID11
Feature activation+3.13e-03

Pos logit contributions
 nearer+0.008
 closer+0.008
 afloat+0.007
 hes+0.007
apa+0.007
Neg logit contributions
 Math-0.009
 sidewalk-0.008
 automobiles-0.008
 dictionary-0.008
 piano-0.008
Token Mrs
Token ID9074
Feature activation-3.89e-02

Pos logit contributions
 Math+0.004
 automobiles+0.003
 dictionary+0.003
 books+0.003
 sidewalk+0.003
Neg logit contributions
 nearer-0.006
 afloat-0.006
 closer-0.006
eled-0.006
 submerged-0.006
Token Clar
Token ID15420
Feature activation-4.61e-02

Pos logit contributions
 nearer+0.068
 closer+0.066
 afloat+0.065
apa+0.063
eled+0.063
Neg logit contributions
 Math-0.054
 sidewalk-0.046
 piano-0.046
 automobiles-0.045
 dictionary-0.045
Tokenise
Token ID786
Feature activation-6.66e-03

Pos logit contributions
 nearer+0.067
eled+0.065
 hes+0.063
 closer+0.063
apa+0.060
Neg logit contributions
 Math-0.082
 sidewalk-0.072
 dictionary-0.071
 automobiles-0.070
 books-0.069
Token,
Token ID11
Feature activation-9.74e-03

Pos logit contributions
 nearer+0.010
apa+0.009
 closer+0.009
eled+0.009
 hes+0.009
Neg logit contributions
 Math-0.011
 sidewalk-0.010
 games-0.009
 mine-0.009
 automobiles-0.009
Token for
Token ID329
Feature activation-1.04e-02

Pos logit contributions
 nearer+0.015
 closer+0.015
 afloat+0.015
 hes+0.014
eled+0.014
Neg logit contributions
 Math-0.015
 sidewalk-0.013
 automobiles-0.013
 games-0.013
aces-0.013
Token help
Token ID1037
Feature activation+5.55e-03

Pos logit contributions
 nearer+0.017
 afloat+0.016
 closer+0.016
eled+0.016
 submerged+0.016
Neg logit contributions
 Math-0.016
 automobiles-0.014
 dictionary-0.014
 sidewalk-0.014
 books-0.013
Token.
Token ID13
Feature activation+4.50e-03

Pos logit contributions
 Math+0.008
 dictionary+0.007
 automobiles+0.007
 sidewalk+0.007
 piano+0.007
Neg logit contributions
 nearer-0.009
 closer-0.009
 afloat-0.009
 hes-0.008
apa-0.008
Token ready
Token ID3492
Feature activation-6.82e-03

Pos logit contributions
 Math+0.002
 kids+0.002
 mine+0.002
 piano+0.002
 number+0.002
Neg logit contributions
 nearer-0.003
apa-0.003
eled-0.003
 afloat-0.003
pt-0.003
Token to
Token ID284
Feature activation-5.05e-03

Pos logit contributions
eled+0.012
 nearer+0.012
 afloat+0.012
 submerged+0.011
apa+0.011
Neg logit contributions
 Math-0.009
 books-0.008
 dictionary-0.008
 sidewalk-0.008
 computer-0.008
Token play
Token ID711
Feature activation-6.61e-03

Pos logit contributions
 nearer+0.009
 afloat+0.009
 closer+0.009
eled+0.008
 submerged+0.008
Neg logit contributions
 Math-0.007
 automobiles-0.006
 dictionary-0.006
 books-0.006
 sidewalk-0.006
Token and
Token ID290
Feature activation+3.07e-03

Pos logit contributions
 nearer+0.010
 closer+0.010
 afloat+0.009
 hes+0.009
 submerged+0.009
Neg logit contributions
 Math-0.011
 sidewalk-0.010
 dictionary-0.010
 automobiles-0.010
issors-0.009
Token asked
Token ID1965
Feature activation-5.32e-05

Pos logit contributions
 Math+0.005
 automobiles+0.004
 dictionary+0.004
issors+0.004
 piano+0.004
Neg logit contributions
 nearer-0.005
 closer-0.005
 afloat-0.005
 submerged-0.005
 stro-0.005
Token Fl
Token ID1610
Feature activation-4.49e-02

Pos logit contributions
 nearer+0.000
 closer+0.000
 afloat+0.000
eled+0.000
 hes+0.000
Neg logit contributions
 Math-0.000
 sidewalk-0.000
 dictionary-0.000
 automobiles-0.000
 piano-0.000
Tokenoppy
Token ID26696
Feature activation-2.30e-02

Pos logit contributions
 nearer+0.066
 afloat+0.059
 hes+0.058
 fascination+0.058
 closer+0.057
Neg logit contributions
 Math-0.076
 dictionary-0.067
Ju-0.066
 kids-0.066
 piano-0.065
Token to
Token ID284
Feature activation+2.15e-03

Pos logit contributions
eled+0.037
 afloat+0.037
 nearer+0.037
 submerged+0.036
apa+0.034
Neg logit contributions
 Math-0.035
 books-0.031
 dictionary-0.030
 automobiles-0.029
 kids-0.029
Token join
Token ID4654
Feature activation+2.01e-03

Pos logit contributions
 Math+0.004
 automobiles+0.003
 dictionary+0.003
 sidewalk+0.003
 books+0.003
Neg logit contributions
 nearer-0.003
 closer-0.003
 afloat-0.003
 submerged-0.003
 hes-0.003
Token.
Token ID13
Feature activation+3.99e-03

Pos logit contributions
 Math+0.004
 sidewalk+0.003
 dictionary+0.003
 automobiles+0.003
 piano+0.003
Neg logit contributions
 nearer-0.003
 closer-0.003
eled-0.002
 afloat-0.002
 hes-0.002
Token Fl
Token ID1610
Feature activation-3.02e-02

Pos logit contributions
 Math+0.007
 dictionary+0.006
 automobiles+0.006
 piano+0.006
 books+0.006
Neg logit contributions
 nearer-0.006
 closer-0.006
 afloat-0.006
eled-0.005
 submerged-0.005
Token to
Token ID284
Feature activation+4.41e-03

Pos logit contributions
 Math+0.023
 sidewalk+0.020
 automobiles+0.020
 earthquake+0.020
 Kids+0.020
Neg logit contributions
 closer-0.016
 nearer-0.016
pt-0.014
 submerged-0.014
 asleep-0.013
Token her
Token ID607
Feature activation+1.14e-03

Pos logit contributions
 automobiles+0.009
 Math+0.009
aces+0.009
issors+0.008
 sidewalk+0.008
Neg logit contributions
 closer-0.004
 nearer-0.004
 hes-0.004
 fascination-0.003
 stro-0.003
Token best
Token ID1266
Feature activation+1.75e-02

Pos logit contributions
 Math+0.002
 sidewalk+0.002
 pavement+0.002
 automobiles+0.002
aces+0.002
Neg logit contributions
 closer-0.002
 nearer-0.002
eled-0.001
apa-0.001
 submerged-0.001
Token friend
Token ID1545
Feature activation+1.22e-02

Pos logit contributions
 Math+0.031
 sidewalk+0.028
 automobiles+0.027
 dictionary+0.026
 pavement+0.026
Neg logit contributions
 nearer-0.024
 closer-0.023
eled-0.022
 afloat-0.020
 submerged-0.020
Token,
Token ID11
Feature activation+1.08e-03

Pos logit contributions
 Math+0.019
 sidewalk+0.018
 law+0.017
 pavement+0.017
 earthquake+0.017
Neg logit contributions
 closer-0.018
 nearer-0.018
 hes-0.017
pt-0.017
apa-0.017
Token Deb
Token ID8965
Feature activation-4.36e-02

Pos logit contributions
 Math+0.001
 sidewalk+0.001
 automobiles+0.001
 dictionary+0.001
 earthquake+0.001
Neg logit contributions
 nearer-0.002
 closer-0.002
 hes-0.002
 afloat-0.002
 submerged-0.002
Tokenby
Token ID1525
Feature activation-9.55e-03

Pos logit contributions
 nearer+0.032
 afloat+0.030
 submerged+0.026
 closer+0.025
eled+0.024
Neg logit contributions
 Math-0.106
 dictionary-0.098
 books-0.098
 automobiles-0.097
 piano-0.096
Token.
Token ID13
Feature activation-3.56e-03

Pos logit contributions
 nearer+0.015
 closer+0.015
eled+0.015
apa+0.015
 hes+0.014
Neg logit contributions
 Math-0.015
 sidewalk-0.013
 automobiles-0.012
 dictionary-0.012
 pavement-0.012
Token
Token ID198
Feature activation-1.15e-02

Pos logit contributions
 nearer+0.005
 closer+0.004
eled+0.004
 afloat+0.004
 hes+0.004
Neg logit contributions
 Math-0.006
 automobiles-0.006
 books-0.006
 dictionary-0.006
 sidewalk-0.006
Token
Token ID198
Feature activation-3.35e-03

Pos logit contributions
 nearer+0.018
 afloat+0.017
eled+0.017
 submerged+0.016
 closer+0.016
Neg logit contributions
 Math-0.019
 books-0.017
 dictionary-0.017
 sidewalk-0.016
 automobiles-0.016
TokenWhen
Token ID2215
Feature activation+6.86e-03

Pos logit contributions
 nearer+0.004
 closer+0.004
 hes+0.003
eled+0.003
 submerged+0.003
Neg logit contributions
 Math-0.006
 automobiles-0.006
 earthquake-0.006
 sidewalk-0.006
 computer-0.006
Token His
Token ID2399
Feature activation-5.10e-03

Pos logit contributions
 nearer+0.013
 afloat+0.012
eled+0.012
 submerged+0.012
apa+0.012
Neg logit contributions
 Math-0.014
 books-0.013
 dictionary-0.012
 automobiles-0.012
 sidewalk-0.012
Token peace
Token ID4167
Feature activation-2.33e-04

Pos logit contributions
 nearer+0.008
eled+0.007
 afloat+0.007
 submerged+0.007
igi+0.007
Neg logit contributions
 Math-0.008
 books-0.008
 dictionary-0.008
 piano-0.007
 sidewalk-0.007
Token.
Token ID13
Feature activation-1.56e-02

Pos logit contributions
 nearer+0.000
 afloat+0.000
eled+0.000
 closer+0.000
apa+0.000
Neg logit contributions
 Math-0.000
 books-0.000
 dictionary-0.000
 piano-0.000
 automobiles-0.000
Token They
Token ID1119
Feature activation-1.32e-02

Pos logit contributions
 nearer+0.029
eled+0.028
 afloat+0.028
 closer+0.027
 submerged+0.027
Neg logit contributions
 Math-0.023
 automobiles-0.018
 books-0.018
 sidewalk-0.018
 dictionary-0.018
Token asked
Token ID1965
Feature activation-9.17e-03

Pos logit contributions
 nearer+0.022
 afloat+0.021
eled+0.020
 closer+0.020
 submerged+0.020
Neg logit contributions
 Math-0.019
 books-0.018
 kids-0.017
 dictionary-0.017
 piano-0.016
Token Him
Token ID10978
Feature activation-4.34e-02

Pos logit contributions
 nearer+0.014
 afloat+0.014
 submerged+0.014
eled+0.014
ounced+0.013
Neg logit contributions
 Math-0.015
 books-0.014
 kids-0.013
 automobiles-0.013
 dictionary-0.013
Token to
Token ID284
Feature activation-6.76e-03

Pos logit contributions
eled+0.085
 afloat+0.081
 nearer+0.079
 submerged+0.078
igi+0.074
Neg logit contributions
 Math-0.056
 books-0.053
 dictionary-0.048
 kids-0.047
 games-0.045
Token calm
Token ID9480
Feature activation-1.16e-02

Pos logit contributions
 nearer+0.012
 afloat+0.011
eled+0.011
 closer+0.011
 submerged+0.011
Neg logit contributions
 Math-0.010
 dictionary-0.008
 automobiles-0.008
 books-0.008
 sidewalk-0.008
Token the
Token ID262
Feature activation+2.88e-04

Pos logit contributions
 afloat+0.018
 submerged+0.018
 nearer+0.018
eled+0.016
ounced+0.016
Neg logit contributions
 Math-0.018
 kids-0.017
 books-0.017
 automobiles-0.016
 sidewalk-0.016
Token storm
Token ID6388
Feature activation-1.38e-02

Pos logit contributions
 Math+0.000
 dictionary+0.000
 automobiles+0.000
 books+0.000
 sidewalk+0.000
Neg logit contributions
 nearer-0.000
 closer-0.000
 afloat-0.000
eled-0.000
 submerged-0.000
Token and
Token ID290
Feature activation-5.71e-03

Pos logit contributions
eled+0.024
 nearer+0.023
 afloat+0.023
apa+0.022
 submerged+0.022
Neg logit contributions
 Math-0.019
 books-0.017
 piano-0.017
 automobiles-0.017
 sidewalk-0.016
Token was
Token ID373
Feature activation+3.61e-03

Pos logit contributions
 nearer+0.000
 afloat+0.000
 numb+0.000
 closer+0.000
 submerged+0.000
Neg logit contributions
 Math-0.000
 kids-0.000
 books-0.000
 sidewalk-0.000
 dictionary-0.000
Token a
Token ID257
Feature activation+7.17e-03

Pos logit contributions
 Math+0.004
 books+0.004
 kids+0.003
 games+0.003
 automobiles+0.003
Neg logit contributions
 nearer-0.008
apa-0.007
 afloat-0.007
eled-0.007
igi-0.007
Token little
Token ID1310
Feature activation-4.88e-03

Pos logit contributions
 Math+0.009
aces+0.009
 automobiles+0.008
issors+0.008
 sidewalk+0.007
Neg logit contributions
 closer-0.013
 nearer-0.013
 afloat-0.012
 submerged-0.012
 polar-0.012
Token girl
Token ID2576
Feature activation-1.77e-02

Pos logit contributions
 nearer+0.009
 closer+0.009
 afloat+0.009
eled+0.009
 submerged+0.009
Neg logit contributions
 Math-0.006
 sidewalk-0.005
aces-0.005
 automobiles-0.005
 dictionary-0.004
Token called
Token ID1444
Feature activation-8.83e-03

Pos logit contributions
 nearer+0.026
eled+0.024
 afloat+0.024
 closer+0.024
 submerged+0.023
Neg logit contributions
 Math-0.031
 automobiles-0.027
 dictionary-0.027
 sidewalk-0.026
 books-0.026
Token Lex
Token ID17210
Feature activation-4.27e-02

Pos logit contributions
 Madd+0.014
 nearer+0.014
 closer+0.014
apa+0.013
 Maggie+0.013
Neg logit contributions
 automobiles-0.012
 Math-0.012
 sidewalk-0.012
Deal-0.011
cles-0.011
Tokeni
Token ID72
Feature activation-9.46e-03

Pos logit contributions
 nearer+0.046
 afloat+0.043
 submerged+0.041
 closer+0.040
eled+0.040
Neg logit contributions
 Math-0.087
 dictionary-0.080
 books-0.080
 automobiles-0.079
 kids-0.078
Token.
Token ID13
Feature activation-7.35e-03

Pos logit contributions
 closer+0.015
 nearer+0.015
eled+0.015
apa+0.015
pt+0.014
Neg logit contributions
 Math-0.015
 sidewalk-0.012
 automobiles-0.012
 mine-0.012
 dictionary-0.011
Token She
Token ID1375
Feature activation+4.05e-03

Pos logit contributions
 afloat+0.015
 nearer+0.015
 submerged+0.014
eled+0.014
 closer+0.014
Neg logit contributions
 Math-0.008
 books-0.007
 dictionary-0.007
 sidewalk-0.006
 automobiles-0.006
Token lived
Token ID5615
Feature activation+9.48e-03

Pos logit contributions
 Math+0.006
 automobiles+0.005
 dictionary+0.005
 books+0.005
 sidewalk+0.005
Neg logit contributions
 nearer-0.007
eled-0.006
 closer-0.006
 submerged-0.006
 afloat-0.006
Token in
Token ID287
Feature activation-1.03e-02

Pos logit contributions
 Math+0.008
 books+0.007
 kids+0.006
 dictionary+0.006
 piano+0.006
Neg logit contributions
 nearer-0.021
apa-0.021
 afloat-0.021
eled-0.021
 hes-0.020
Token.
Token ID13
Feature activation+1.31e-03

Pos logit contributions
eled+0.070
 afloat+0.068
 nearer+0.067
 fascination+0.067
 hes+0.066
Neg logit contributions
 Math-0.022
 books-0.021
 games-0.020
 kids-0.019
 automobiles-0.018
Token She
Token ID1375
Feature activation-6.19e-03

Pos logit contributions
 Math+0.002
 automobiles+0.002
 piano+0.002
 dictionary+0.002
 books+0.002
Neg logit contributions
 nearer-0.002
 closer-0.002
eled-0.002
apa-0.002
 submerged-0.002
Token hoped
Token ID10719
Feature activation-2.55e-02

Pos logit contributions
 nearer+0.013
 afloat+0.013
 closer+0.011
 numb+0.011
eled+0.011
Neg logit contributions
 Math-0.007
 books-0.007
 dictionary-0.006
 kids-0.006
 games-0.006
Token they
Token ID484
Feature activation-1.41e-02

Pos logit contributions
 nearer+0.048
 afloat+0.048
eled+0.047
 submerged+0.045
 closer+0.044
Neg logit contributions
 Math-0.032
 books-0.028
 dictionary-0.027
 sidewalk-0.027
 automobiles-0.026
Token would
Token ID561
Feature activation-2.33e-02

Pos logit contributions
 nearer+0.026
 afloat+0.025
eled+0.025
 submerged+0.024
 closer+0.024
Neg logit contributions
 Math-0.018
 books-0.016
 dictionary-0.015
 automobiles-0.015
 sidewalk-0.015
Token grow
Token ID1663
Feature activation-4.21e-02

Pos logit contributions
eled+0.051
ounced+0.046
apa+0.045
 nearer+0.045
oped+0.045
Neg logit contributions
 Math-0.024
 do-0.024
 test-0.024
 books-0.023
 number-0.023
Token into
Token ID656
Feature activation-2.00e-02

Pos logit contributions
eled+0.090
oped+0.085
 afloat+0.084
ounced+0.083
apa+0.082
Neg logit contributions
 books-0.042
 games-0.039
 Math-0.037
 kids-0.037
 other-0.035
Token pretty
Token ID2495
Feature activation-9.35e-03

Pos logit contributions
 nearer+0.038
eled+0.037
 afloat+0.037
uttered+0.037
ounced+0.036
Neg logit contributions
 Math-0.024
 books-0.023
 kids-0.021
 games-0.021
 dictionary-0.020
Token flowers
Token ID12734
Feature activation-7.22e-03

Pos logit contributions
 nearer+0.019
 afloat+0.019
eled+0.019
apa+0.018
 submerged+0.018
Neg logit contributions
 Math-0.009
 books-0.009
 games-0.008
 kids-0.008
 sidewalk-0.008
Token.
Token ID13
Feature activation-2.83e-03

Pos logit contributions
 nearer+0.013
 afloat+0.013
eled+0.013
 closer+0.013
 submerged+0.012
Neg logit contributions
 Math-0.009
 dictionary-0.008
 books-0.008
 automobiles-0.008
 sidewalk-0.008
Token
Token ID198
Feature activation-1.28e-02

Pos logit contributions
 nearer+0.003
 closer+0.003
eled+0.003
apa+0.003
 afloat+0.003
Neg logit contributions
 Math-0.005
 piano-0.005
 automobiles-0.005
 dictionary-0.005
 books-0.005
Token cute
Token ID13779
Feature activation+8.83e-03

Pos logit contributions
 Math+0.008
aces+0.007
 automobiles+0.007
issors+0.006
 sidewalk+0.006
Neg logit contributions
 closer-0.012
 nearer-0.011
 afloat-0.011
 submerged-0.011
Wal-0.010
Token pink
Token ID11398
Feature activation+5.41e-03

Pos logit contributions
 Math+0.011
 automobiles+0.009
aces+0.009
issors+0.009
 sidewalk+0.009
Neg logit contributions
 closer-0.016
 nearer-0.016
eled-0.015
 afloat-0.015
iri-0.015
Token rabbit
Token ID22746
Feature activation-3.83e-03

Pos logit contributions
 Math+0.008
 automobiles+0.007
aces+0.006
issors+0.006
 sidewalk+0.006
Neg logit contributions
 closer-0.009
 nearer-0.008
eled-0.008
iri-0.008
 bur-0.008
Token called
Token ID1444
Feature activation-6.40e-03

Pos logit contributions
 nearer+0.005
 closer+0.005
eled+0.005
 afloat+0.005
 submerged+0.005
Neg logit contributions
 Math-0.007
 sidewalk-0.006
 automobiles-0.006
 books-0.006
 dictionary-0.006
Token Mrs
Token ID9074
Feature activation-3.10e-02

Pos logit contributions
 nearer+0.009
apa+0.009
 closer+0.009
 Ralph+0.008
 hes+0.008
Neg logit contributions
 Math-0.010
 sidewalk-0.010
 automobiles-0.010
 piano-0.009
 exam-0.009
Token Fl
Token ID1610
Feature activation-4.21e-02

Pos logit contributions
 nearer+0.050
 closer+0.048
apa+0.047
 afloat+0.047
pt+0.046
Neg logit contributions
 Math-0.045
 sidewalk-0.039
 piano-0.039
 test-0.038
 games-0.038
Tokenuffy
Token ID15352
Feature activation-5.89e-03

Pos logit contributions
 nearer+0.047
 afloat+0.042
eled+0.042
 closer+0.041
 submerged+0.040
Neg logit contributions
 Math-0.087
 dictionary-0.079
 automobiles-0.077
 piano-0.077
 books-0.077
Token.
Token ID13
Feature activation-1.41e-02

Pos logit contributions
 closer+0.008
pt+0.008
 nearer+0.008
fly+0.008
 stro+0.008
Neg logit contributions
 Math-0.010
 sidewalk-0.009
 pavement-0.008
 piano-0.008
 test-0.008
Token She
Token ID1375
Feature activation+4.67e-03

Pos logit contributions
 nearer+0.022
eled+0.021
 afloat+0.021
 closer+0.021
 submerged+0.020
Neg logit contributions
 Math-0.023
 books-0.020
 automobiles-0.020
 dictionary-0.020
 sidewalk-0.020
Token was
Token ID373
Feature activation+7.06e-04

Pos logit contributions
 Math+0.005
 books+0.004
 dictionary+0.004
 sidewalk+0.004
 automobiles+0.004
Neg logit contributions
 nearer-0.010
 afloat-0.009
 closer-0.009
eled-0.009
apa-0.009
Token small
Token ID1402
Feature activation-1.42e-02

Pos logit contributions
 Math+0.001
 books+0.001
 dictionary+0.001
 kids+0.001
 piano+0.001
Neg logit contributions
 nearer-0.001
 afloat-0.001
eled-0.001
apa-0.001
 closer-0.001
Token and
Token ID290
Feature activation-4.50e-03

Pos logit contributions
 closer+0.022
 nearer+0.021
 deeper+0.018
 cottage+0.018
 afloat+0.018
Neg logit contributions
 Math-0.027
arrass-0.024
 Kids-0.023
 mine-0.023
aces-0.023
Token said
Token ID531
Feature activation-6.38e-03

Pos logit contributions
 nearer+0.007
 closer+0.007
eled+0.006
 afloat+0.006
 submerged+0.006
Neg logit contributions
 Math-0.007
 automobiles-0.006
 dictionary-0.006
 sidewalk-0.006
 books-0.006
Token,
Token ID11
Feature activation+1.48e-03

Pos logit contributions
 nearer+0.009
 closer+0.009
 afloat+0.008
apa+0.008
eled+0.008
Neg logit contributions
 Math-0.011
 automobiles-0.010
 sidewalk-0.010
 dictionary-0.009
 books-0.009
Token "
Token ID366
Feature activation-8.31e-03

Pos logit contributions
 Math+0.002
 automobiles+0.002
 books+0.002
 dictionary+0.002
 sidewalk+0.002
Neg logit contributions
 nearer-0.003
 closer-0.003
 hes-0.003
apa-0.002
 afloat-0.002
TokenS
Token ID50
Feature activation-1.41e-02

Pos logit contributions
 nearer+0.013
eled+0.013
 closer+0.012
 afloat+0.012
 submerged+0.012
Neg logit contributions
 Math-0.013
 automobiles-0.012
 dictionary-0.012
 books-0.012
 sidewalk-0.012
Tokenoph
Token ID2522
Feature activation-4.13e-02

Pos logit contributions
 nearer+0.017
eled+0.016
 closer+0.016
apa+0.015
igi+0.015
Neg logit contributions
 Math-0.027
 automobiles-0.025
 sidewalk-0.024
 piano-0.024
 dictionary-0.024
Tokenie
Token ID494
Feature activation-1.06e-02

Pos logit contributions
 nearer+0.055
 afloat+0.052
eled+0.051
 closer+0.050
 submerged+0.049
Neg logit contributions
 Math-0.076
 dictionary-0.067
 automobiles-0.067
 books-0.067
 sidewalk-0.067
Token,
Token ID11
Feature activation-2.35e-04

Pos logit contributions
 nearer+0.011
apa+0.011
 closer+0.010
 afloat+0.010
ung+0.009
Neg logit contributions
 Math-0.021
 sidewalk-0.020
 automobiles-0.019
 pavement-0.019
 piano-0.019
Token no
Token ID645
Feature activation+4.53e-04

Pos logit contributions
 closer+0.000
 nearer+0.000
 hes+0.000
 deeper+0.000
apa+0.000
Neg logit contributions
 Math-0.000
 automobiles-0.000
 sidewalk-0.000
aces-0.000
 robots-0.000
Token matter
Token ID2300
Feature activation+3.36e-03

Pos logit contributions
aces+0.001
Deal+0.001
issors+0.001
scope+0.001
Ju+0.001
Neg logit contributions
 closer-0.001
 nearer-0.001
 deeper-0.000
 soon-0.000
 hes-0.000
Token how
Token ID703
Feature activation+6.36e-03

Pos logit contributions
 sidewalk+0.007
 pavement+0.007
 Math+0.007
board+0.007
Ju+0.007
Neg logit contributions
 nearer-0.003
 closer-0.003
 hes-0.003
 deeper-0.003
pt-0.003
Token and
Token ID290
Feature activation+1.72e-02

Pos logit contributions
 nearer+0.019
 afloat+0.018
eled+0.018
 submerged+0.017
 closer+0.017
Neg logit contributions
 Math-0.013
 books-0.011
 dictionary-0.011
 automobiles-0.010
 sidewalk-0.010
Token Lily
Token ID20037
Feature activation+6.70e-03

Pos logit contributions
 Math+0.027
 sidewalk+0.024
 automobiles+0.023
 piano+0.023
 dictionary+0.022
Neg logit contributions
 nearer-0.028
 closer-0.027
 stro-0.025
 hes-0.025
 afloat-0.025
Token calls
Token ID3848
Feature activation-2.73e-03

Pos logit contributions
 Math+0.008
 dictionary+0.007
 automobiles+0.007
 books+0.007
 sidewalk+0.007
Neg logit contributions
 nearer-0.013
 afloat-0.013
eled-0.012
 closer-0.012
 submerged-0.012
Token her
Token ID607
Feature activation-1.68e-02

Pos logit contributions
 nearer+0.004
 afloat+0.004
 closer+0.004
eled+0.003
 submerged+0.003
Neg logit contributions
 Math-0.005
 dictionary-0.004
 automobiles-0.004
 books-0.004
 sidewalk-0.004
Token rabbit
Token ID22746
Feature activation-1.39e-02

Pos logit contributions
 nearer+0.030
 closer+0.028
 afloat+0.028
eled+0.027
 submerged+0.027
Neg logit contributions
 Math-0.024
 sidewalk-0.020
 automobiles-0.020
 dictionary-0.020
 books-0.019
Token Fl
Token ID1610
Feature activation-4.11e-02

Pos logit contributions
 nearer+0.026
 closer+0.025
 afloat+0.025
eled+0.024
 submerged+0.024
Neg logit contributions
 Math-0.018
 sidewalk-0.015
 automobiles-0.015
 dictionary-0.015
 piano-0.014
Tokenuffy
Token ID15352
Feature activation-1.44e-02

Pos logit contributions
 nearer+0.060
 hes+0.055
 afloat+0.055
eled+0.053
 closer+0.053
Neg logit contributions
 Math-0.067
 automobiles-0.062
 kids-0.062
 dictionary-0.061
 piano-0.061
Token.
Token ID13
Feature activation-6.92e-03

Pos logit contributions
 nearer+0.026
 closer+0.025
 afloat+0.025
eled+0.024
apa+0.024
Neg logit contributions
 Math-0.020
 sidewalk-0.017
 dictionary-0.016
 automobiles-0.016
 piano-0.016
Token
Token ID198
Feature activation-1.02e-02

Pos logit contributions
 nearer+0.014
eled+0.014
 afloat+0.013
 submerged+0.013
 closer+0.013
Neg logit contributions
 Math-0.009
 kids-0.007
 books-0.007
 automobiles-0.007
 sidewalk-0.007
Token
Token ID198
Feature activation-8.05e-04

Pos logit contributions
 nearer+0.022
 afloat+0.022
eled+0.022
maid+0.021
 submerged+0.021
Neg logit contributions
 Math-0.010
 kids-0.008
 sidewalk-0.008
 books-0.008
 Kids-0.008
TokenMom
Token ID29252
Feature activation+1.79e-03

Pos logit contributions
 nearer+0.001
 afloat+0.001
eled+0.001
 closer+0.001
 submerged+0.001
Neg logit contributions
 Math-0.001
 automobiles-0.001
 dictionary-0.001
 books-0.001
 sidewalk-0.001
Tokenmy
Token ID1820
Feature activation-3.09e-03

Pos logit contributions
 Math+0.005
 books+0.004
 dictionary+0.004
 automobiles+0.004
 kids+0.004
Neg logit contributions
 nearer-0.006
 afloat-0.006
 closer-0.006
eled-0.006
 submerged-0.006
Token said
Token ID531
Feature activation-8.72e-03

Pos logit contributions
 nearer+0.006
 afloat+0.006
 numb+0.006
 closer+0.005
eled+0.005
Neg logit contributions
 Math-0.004
 kids-0.004
 books-0.003
 automobiles-0.003
 dictionary-0.003
Token,
Token ID11
Feature activation+1.81e-03

Pos logit contributions
 nearer+0.015
 afloat+0.015
eled+0.014
 submerged+0.014
 closer+0.014
Neg logit contributions
 Math-0.013
 books-0.011
 dictionary-0.011
 automobiles-0.011
 piano-0.011
Token "
Token ID366
Feature activation-1.10e-02

Pos logit contributions
 Math+0.002
 automobiles+0.002
 sidewalk+0.002
aces+0.002
 pavement+0.002
Neg logit contributions
 closer-0.003
 nearer-0.003
 hes-0.003
apa-0.003
 stro-0.003
TokenS
Token ID50
Feature activation-1.61e-02

Pos logit contributions
 nearer+0.017
eled+0.016
 closer+0.016
 afloat+0.016
 submerged+0.016
Neg logit contributions
 Math-0.018
 automobiles-0.016
 dictionary-0.016
 books-0.015
 sidewalk-0.015
Tokenoph
Token ID2522
Feature activation-4.09e-02

Pos logit contributions
 nearer+0.021
eled+0.020
 closer+0.019
apa+0.019
 hes+0.018
Neg logit contributions
 Math-0.030
 automobiles-0.027
 sidewalk-0.027
 dictionary-0.027
 piano-0.026
Tokenie
Token ID494
Feature activation-7.42e-03

Pos logit contributions
 nearer+0.057
 afloat+0.055
 closer+0.052
eled+0.051
 submerged+0.051
Neg logit contributions
 Math-0.072
 books-0.065
 dictionary-0.065
 automobiles-0.064
 sidewalk-0.063
Token,
Token ID11
Feature activation-6.33e-03

Pos logit contributions
apa+0.008
 nearer+0.008
 afloat+0.007
 closer+0.007
ung+0.007
Neg logit contributions
 Math-0.015
 automobiles-0.014
 sidewalk-0.013
 dictionary-0.013
 books-0.013
Token let
Token ID1309
Feature activation-3.35e-03

Pos logit contributions
 closer+0.010
 nearer+0.010
 hes+0.009
 deeper+0.009
apa+0.008
Neg logit contributions
 Math-0.010
 automobiles-0.009
aces-0.009
 sidewalk-0.009
 robots-0.008
Token's
Token ID338
Feature activation+6.93e-03

Pos logit contributions
 nearer+0.006
apa+0.006
 closer+0.006
eled+0.006
 hes+0.005
Neg logit contributions
 Math-0.005
 dictionary-0.004
 automobiles-0.004
 sidewalk-0.004
 books-0.004
Token go
Token ID467
Feature activation-1.68e-02

Pos logit contributions
 Math+0.011
 dictionary+0.010
 automobiles+0.010
aces+0.010
 sidewalk+0.009
Neg logit contributions
 nearer-0.011
 afloat-0.011
 closer-0.010
 submerged-0.010
eled-0.009
Token "
Token ID366
Feature activation-1.00e-02

Pos logit contributions
 Math+0.002
 sidewalk+0.002
 automobiles+0.002
aces+0.002
 books+0.002
Neg logit contributions
 closer-0.003
 nearer-0.003
 hes-0.003
apa-0.003
 afloat-0.003
TokenWhen
Token ID2215
Feature activation-1.31e-03

Pos logit contributions
 nearer+0.016
 afloat+0.015
eled+0.015
 closer+0.015
 submerged+0.015
Neg logit contributions
 Math-0.016
 dictionary-0.014
 automobiles-0.014
 books-0.014
 sidewalk-0.014
Token are
Token ID389
Feature activation+3.29e-03

Pos logit contributions
 nearer+0.002
 closer+0.002
 submerged+0.002
 hes+0.002
 deeper+0.002
Neg logit contributions
 Math-0.002
 automobiles-0.002
 sidewalk-0.002
 dictionary-0.002
aces-0.001
Token we
Token ID356
Feature activation-1.41e-02

Pos logit contributions
 Math+0.004
 dictionary+0.003
 sidewalk+0.003
 automobiles+0.003
 piano+0.003
Neg logit contributions
 nearer-0.006
 closer-0.006
 afloat-0.006
eled-0.006
 hes-0.006
Token going
Token ID1016
Feature activation-1.67e-02

Pos logit contributions
 nearer+0.022
 afloat+0.020
eled+0.020
 submerged+0.020
apa+0.020
Neg logit contributions
 Math-0.023
 kids-0.020
 books-0.020
 automobiles-0.020
 dictionary-0.019
Token on
Token ID319
Feature activation-4.07e-02

Pos logit contributions
 nearer+0.028
eled+0.028
apa+0.027
 afloat+0.027
 submerged+0.027
Neg logit contributions
 Math-0.024
 books-0.022
 automobiles-0.021
 sidewalk-0.021
 kids-0.020
Token holiday
Token ID9912
Feature activation-6.72e-03

Pos logit contributions
 nearer+0.073
eled+0.071
 afloat+0.070
apa+0.069
 submerged+0.067
Neg logit contributions
 Math-0.058
 books-0.053
 automobiles-0.050
 games-0.049
 mine-0.048
Token again
Token ID757
Feature activation-8.31e-03

Pos logit contributions
 nearer+0.009
 closer+0.009
 afloat+0.009
 submerged+0.008
 deeper+0.008
Neg logit contributions
 Math-0.012
 dictionary-0.011
 automobiles-0.010
 sidewalk-0.010
aces-0.010
Token?"
Token ID1701
Feature activation-9.65e-04

Pos logit contributions
 nearer+0.014
 afloat+0.014
 closer+0.014
eled+0.013
 submerged+0.013
Neg logit contributions
 Math-0.012
 dictionary-0.010
 automobiles-0.010
 sidewalk-0.010
 books-0.010
Token
Token ID198
Feature activation-6.18e-03

Pos logit contributions
 nearer+0.001
eled+0.001
 closer+0.001
apa+0.001
 afloat+0.001
Neg logit contributions
 Math-0.002
 dictionary-0.002
 books-0.002
 automobiles-0.002
 sidewalk-0.001
Token
Token ID198
Feature activation+1.01e-03

Pos logit contributions
 afloat+0.012
 nearer+0.011
 submerged+0.011
eled+0.011
 hes+0.010
Neg logit contributions
 Math-0.008
 books-0.007
 automobiles-0.007
 kids-0.007
 dictionary-0.007
Token.
Token ID13
Feature activation+4.50e-03

Pos logit contributions
 Math+0.008
 dictionary+0.007
 automobiles+0.007
 sidewalk+0.007
 piano+0.007
Neg logit contributions
 nearer-0.009
 closer-0.009
 afloat-0.009
 hes-0.008
apa-0.008
Token
Token ID198
Feature activation-7.81e-04

Pos logit contributions
aces+0.006
 guys+0.006
 Math+0.006
 games+0.006
 mine+0.006
Neg logit contributions
 nearer-0.007
 closer-0.006
 hes-0.006
apa-0.006
 submerged-0.006
Token
Token ID198
Feature activation+3.21e-03

Pos logit contributions
 nearer+0.001
 closer+0.001
 afloat+0.001
eled+0.001
 submerged+0.001
Neg logit contributions
 Math-0.001
 automobiles-0.001
 dictionary-0.001
 books-0.001
 sidewalk-0.001
Token"
Token ID1
Feature activation-4.79e-04

Pos logit contributions
 Math+0.006
aces+0.006
 earthquake+0.006
 automobiles+0.006
 guys+0.005
Neg logit contributions
 nearer-0.004
 closer-0.004
 submerged-0.003
apa-0.003
eled-0.003
TokenMrs
Token ID27034
Feature activation-3.34e-02

Pos logit contributions
 nearer+0.001
 closer+0.001
eled+0.001
Wal+0.001
 hes+0.001
Neg logit contributions
 Math-0.001
aces-0.001
 sidewalk-0.001
 earthquake-0.001
 mine-0.001
Token Clar
Token ID15420
Feature activation-4.06e-02

Pos logit contributions
 nearer+0.052
apa+0.048
 closer+0.048
igi+0.047
 hes+0.047
Neg logit contributions
 Math-0.048
 mine-0.045
 games-0.045
 piano-0.044
 sidewalk-0.044
Tokenise
Token ID786
Feature activation+3.13e-03

Pos logit contributions
 nearer+0.060
 hes+0.059
eled+0.059
 closer+0.056
apa+0.056
Neg logit contributions
 Math-0.069
 sidewalk-0.063
 dictionary-0.060
 games-0.060
 books-0.059
Token,
Token ID11
Feature activation-1.49e-02

Pos logit contributions
 Math+0.005
 sidewalk+0.005
 games+0.004
 automobiles+0.004
 books+0.004
Neg logit contributions
apa-0.004
 nearer-0.004
 closer-0.004
pt-0.004
fly-0.004
Token please
Token ID3387
Feature activation+5.35e-03

Pos logit contributions
 pap+0.022
 closer+0.021
 nearer+0.020
apa+0.020
 hes+0.020
Neg logit contributions
Deal-0.024
whe-0.022
 sidewalk-0.021
 games-0.021
aces-0.021
Token help
Token ID1037
Feature activation+3.62e-03

Pos logit contributions
 Math+0.008
 automobiles+0.008
 sidewalk+0.007
 dictionary+0.007
 books+0.007
Neg logit contributions
 nearer-0.008
 closer-0.008
apa-0.008
 afloat-0.008
 submerged-0.007
Token me
Token ID502
Feature activation+8.97e-04

Pos logit contributions
 Math+0.006
aces+0.005
 dictionary+0.005
 automobiles+0.005
 sidewalk+0.005
Neg logit contributions
 nearer-0.005
 closer-0.005
apa-0.005
 afloat-0.005
 hes-0.005
Token to
Token ID284
Feature activation+7.57e-03

Pos logit contributions
 Math+0.013
 automobiles+0.012
 dictionary+0.012
 books+0.012
 sidewalk+0.012
Neg logit contributions
 nearer-0.010
 closer-0.009
 afloat-0.008
 hes-0.008
 submerged-0.008
Token invite
Token ID14037
Feature activation-5.59e-03

Pos logit contributions
 Math+0.012
 automobiles+0.012
 dictionary+0.011
aces+0.011
Deal+0.011
Neg logit contributions
 nearer-0.011
 closer-0.011
 afloat-0.010
 submerged-0.010
apa-0.010
Token her
Token ID607
Feature activation-9.47e-03

Pos logit contributions
 nearer+0.008
 closer+0.008
 afloat+0.008
 hes+0.007
eled+0.007
Neg logit contributions
 Math-0.009
 automobiles-0.008
 dictionary-0.008
 sidewalk-0.008
 piano-0.008
Token friend
Token ID1545
Feature activation+5.04e-03

Pos logit contributions
 nearer+0.016
 closer+0.015
 afloat+0.014
 submerged+0.014
apa+0.014
Neg logit contributions
 Math-0.014
 sidewalk-0.012
 automobiles-0.012
 dictionary-0.012
 pavement-0.012
Token R
Token ID371
Feature activation-2.47e-02

Pos logit contributions
 Math+0.006
 sidewalk+0.006
 automobiles+0.005
 dictionary+0.005
 piano+0.005
Neg logit contributions
 nearer-0.009
 closer-0.009
 afloat-0.009
eled-0.009
 submerged-0.009
Tokenuf
Token ID3046
Feature activation-4.01e-02

Pos logit contributions
 nearer+0.034
 afloat+0.032
eled+0.031
 closer+0.031
 submerged+0.030
Neg logit contributions
 Math-0.045
 dictionary-0.040
 automobiles-0.039
 sidewalk-0.039
 books-0.039
Tokenus
Token ID385
Feature activation-2.28e-02

Pos logit contributions
 nearer+0.027
 afloat+0.025
eled+0.024
 submerged+0.021
 closer+0.021
Neg logit contributions
 Math-0.099
 books-0.091
 dictionary-0.090
 kids-0.089
 automobiles-0.089
Token to
Token ID284
Feature activation+6.83e-03

Pos logit contributions
 nearer+0.041
 afloat+0.039
 closer+0.039
eled+0.039
 submerged+0.038
Neg logit contributions
 Math-0.031
 dictionary-0.026
 sidewalk-0.026
 automobiles-0.026
 books-0.026
Token play
Token ID711
Feature activation-2.48e-03

Pos logit contributions
 Math+0.010
 automobiles+0.009
 sidewalk+0.009
 dictionary+0.009
 books+0.009
Neg logit contributions
 nearer-0.011
 closer-0.011
 afloat-0.010
 submerged-0.010
 hes-0.010
Token.
Token ID13
Feature activation+2.56e-03

Pos logit contributions
 nearer+0.004
 closer+0.004
 afloat+0.004
 submerged+0.004
eled+0.004
Neg logit contributions
 Math-0.004
 sidewalk-0.003
 automobiles-0.003
 dictionary-0.003
 earthquake-0.003
Token She
Token ID1375
Feature activation-3.18e-04

Pos logit contributions
 Math+0.004
 sidewalk+0.003
 books+0.003
 dictionary+0.003
 automobiles+0.003
Neg logit contributions
 nearer-0.004
 afloat-0.004
eled-0.004
 closer-0.004
 submerged-0.004
Token
Token ID198
Feature activation+7.46e-03

Pos logit contributions
 nearer+0.013
eled+0.013
 afloat+0.012
 submerged+0.012
 closer+0.012
Neg logit contributions
 Math-0.009
 sidewalk-0.008
 books-0.008
 dictionary-0.008
 automobiles-0.008
TokenLater
Token ID18602
Feature activation+1.74e-02

Pos logit contributions
 Math+0.009
 dictionary+0.008
 piano+0.008
 automobiles+0.008
 books+0.008
Neg logit contributions
 nearer-0.014
 submerged-0.013
 afloat-0.013
eled-0.013
 closer-0.013
Token,
Token ID11
Feature activation+1.84e-02

Pos logit contributions
 Math+0.023
 dictionary+0.019
 automobiles+0.019
 books+0.019
 sidewalk+0.019
Neg logit contributions
 nearer-0.033
 afloat-0.031
eled-0.031
 closer-0.031
 submerged-0.030
Token Tim
Token ID5045
Feature activation-4.99e-04

Pos logit contributions
 Math+0.020
 dictionary+0.016
 sidewalk+0.016
 automobiles+0.016
 piano+0.015
Neg logit contributions
 nearer-0.039
 closer-0.038
 afloat-0.037
eled-0.036
 submerged-0.036
Token saw
Token ID2497
Feature activation+1.17e-02

Pos logit contributions
 nearer+0.001
 afloat+0.001
eled+0.001
 closer+0.001
apa+0.001
Neg logit contributions
 Math-0.001
 kids-0.001
 books-0.001
 sidewalk-0.001
 dictionary-0.001
Token Mr
Token ID1770
Feature activation-4.01e-02

Pos logit contributions
 Math+0.015
 books+0.013
 kids+0.013
 automobiles+0.012
 dictionary+0.012
Neg logit contributions
 nearer-0.023
 afloat-0.022
eled-0.022
 closer-0.021
 submerged-0.021
Token.
Token ID13
Feature activation-2.48e-02

Pos logit contributions
 nearer+0.085
eled+0.082
 afloat+0.079
 submerged+0.078
 closer+0.078
Neg logit contributions
 Math-0.045
 kids-0.038
 books-0.037
 automobiles-0.036
 guys-0.035
Token Bear
Token ID14732
Feature activation-1.06e-02

Pos logit contributions
 nearer+0.042
eled+0.041
ounced+0.040
apa+0.038
igi+0.038
Neg logit contributions
 Math-0.039
 robot-0.034
 robots-0.032
 kids-0.031
 shelf-0.031
Token on
Token ID319
Feature activation-1.11e-03

Pos logit contributions
 nearer+0.018
 afloat+0.017
 closer+0.017
eled+0.017
 submerged+0.017
Neg logit contributions
 Math-0.016
 automobiles-0.013
 dictionary-0.013
 sidewalk-0.013
 books-0.013
Token the
Token ID262
Feature activation+1.32e-03

Pos logit contributions
 nearer+0.001
 closer+0.001
 afloat+0.001
 submerged+0.001
 hes+0.001
Neg logit contributions
 Math-0.002
Deal-0.002
issors-0.002
aces-0.002
 sidewalk-0.002
Token swing
Token ID9628
Feature activation+1.46e-02

Pos logit contributions
 Math+0.002
 automobiles+0.001
 robots+0.001
 You+0.001
issors+0.001
Neg logit contributions
 closer-0.002
 nearer-0.002
 deeper-0.002
 sinking-0.002
 refreshed-0.002
Token After
Token ID2293
Feature activation-1.09e-02

Pos logit contributions
 nearer+0.000
apa+0.000
 closer+0.000
uttered+0.000
 submerged+0.000
Neg logit contributions
 exam-0.000
 dictionary-0.000
 piano-0.000
 Math-0.000
 test-0.000
Token a
Token ID257
Feature activation+1.10e-02

Pos logit contributions
eled+0.020
 afloat+0.020
 nearer+0.019
igi+0.019
apa+0.019
Neg logit contributions
 Math-0.015
 books-0.014
 games-0.013
 kids-0.013
 Kids-0.013
Token while
Token ID981
Feature activation-7.41e-03

Pos logit contributions
 Math+0.018
 automobiles+0.016
 dictionary+0.016
 piano+0.015
 sidewalk+0.015
Neg logit contributions
 nearer-0.017
 closer-0.016
 afloat-0.016
 submerged-0.015
eled-0.015
Token,
Token ID11
Feature activation-5.40e-04

Pos logit contributions
eled+0.014
 nearer+0.014
 afloat+0.014
 submerged+0.013
igi+0.013
Neg logit contributions
 Math-0.009
 books-0.008
 kids-0.008
 automobiles-0.007
 sidewalk-0.007
Token they
Token ID484
Feature activation-1.12e-02

Pos logit contributions
 nearer+0.001
 afloat+0.001
 closer+0.001
eled+0.001
 submerged+0.001
Neg logit contributions
 Math-0.001
 dictionary-0.001
 automobiles-0.001
 sidewalk-0.001
 books-0.001
Token reached
Token ID4251
Feature activation-3.99e-02

Pos logit contributions
 numb+0.017
 afloat+0.017
 nearer+0.017
 fever+0.017
apa+0.016
Neg logit contributions
 Math-0.015
 kids-0.015
 books-0.014
 guys-0.014
 sidewalk-0.014
Token a
Token ID257
Feature activation-1.12e-02

Pos logit contributions
eled+0.099
 numb+0.094
igi+0.094
oped+0.093
ounced+0.093
Neg logit contributions
 books-0.028
 Math-0.027
 kids-0.023
 games-0.020
 other-0.020
Token big
Token ID1263
Feature activation-7.98e-03

Pos logit contributions
 nearer+0.020
eled+0.019
 afloat+0.019
 closer+0.018
 submerged+0.018
Neg logit contributions
 Math-0.016
 dictionary-0.013
 books-0.013
 sidewalk-0.013
 automobiles-0.013
Token,
Token ID11
Feature activation-1.56e-02

Pos logit contributions
 nearer+0.013
 closer+0.012
 afloat+0.012
eled+0.012
 submerged+0.011
Neg logit contributions
 Math-0.013
 automobiles-0.011
 dictionary-0.011
 sidewalk-0.010
 books-0.010
Token dark
Token ID3223
Feature activation-1.86e-02

Pos logit contributions
 nearer+0.026
 closer+0.025
 afloat+0.024
 deeper+0.023
 submerged+0.022
Neg logit contributions
 Math-0.025
 automobiles-0.021
aces-0.021
 dictionary-0.020
issors-0.020
Token cave
Token ID11527
Feature activation-1.53e-02

Pos logit contributions
 nearer+0.028
 afloat+0.026
eled+0.026
 closer+0.026
 submerged+0.025
Neg logit contributions
 Math-0.031
 dictionary-0.027
 automobiles-0.027
 books-0.027
 sidewalk-0.027
Token button
Token ID4936
Feature activation+2.71e-03

Pos logit contributions
 closer+0.016
 nearer+0.015
 deeper+0.013
 afloat+0.013
 numb+0.012
Neg logit contributions
 Math-0.022
 automobiles-0.020
 artist-0.020
Deal-0.019
 rider-0.019
Token.
Token ID13
Feature activation-1.35e-02

Pos logit contributions
 Math+0.004
 dictionary+0.003
 sidewalk+0.003
 automobiles+0.003
 piano+0.003
Neg logit contributions
 nearer-0.005
 closer-0.005
 afloat-0.004
eled-0.004
 submerged-0.004
Token They
Token ID1119
Feature activation-4.70e-03

Pos logit contributions
 nearer+0.026
 afloat+0.025
eled+0.025
 closer+0.024
 submerged+0.024
Neg logit contributions
 Math-0.018
 books-0.014
 sidewalk-0.014
 automobiles-0.014
 dictionary-0.014
Token hear
Token ID3285
Feature activation-4.30e-03

Pos logit contributions
 nearer+0.008
 closer+0.008
 afloat+0.007
 submerged+0.007
 hes+0.007
Neg logit contributions
 Math-0.007
 automobiles-0.006
 dictionary-0.006
 sidewalk-0.006
 computer-0.006
Token a
Token ID257
Feature activation-6.33e-03

Pos logit contributions
 nearer+0.007
 afloat+0.007
eled+0.007
 closer+0.007
 submerged+0.007
Neg logit contributions
 Math-0.006
 dictionary-0.006
 automobiles-0.006
 books-0.005
 sidewalk-0.005
Token wh
Token ID348
Feature activation-3.99e-02

Pos logit contributions
 nearer+0.011
 closer+0.011
 afloat+0.010
 submerged+0.010
eled+0.010
Neg logit contributions
 Math-0.009
 automobiles-0.008
 dictionary-0.008
 books-0.007
aces-0.007
Tokenir
Token ID343
Feature activation-7.74e-04

Pos logit contributions
 nearer+0.055
 stro+0.051
 closer+0.051
 afloat+0.051
 submerged+0.050
Neg logit contributions
 Math-0.069
 piano-0.061
 exam-0.060
 test-0.060
 kids-0.060
Tokenring
Token ID1806
Feature activation+1.07e-03

Pos logit contributions
 nearer+0.001
 closer+0.001
 afloat+0.001
 hes+0.001
eled+0.001
Neg logit contributions
 Math-0.001
 dictionary-0.001
 books-0.001
 automobiles-0.001
 piano-0.001
Token sound
Token ID2128
Feature activation-1.18e-02

Pos logit contributions
 Math+0.002
 dictionary+0.001
issors+0.001
 automobiles+0.001
 mine+0.001
Neg logit contributions
 nearer-0.002
 closer-0.002
 hes-0.002
eled-0.002
 submerged-0.002
Token and
Token ID290
Feature activation-1.92e-02

Pos logit contributions
 nearer+0.023
 afloat+0.022
eled+0.022
apa+0.022
 submerged+0.022
Neg logit contributions
 Math-0.014
 books-0.012
 automobiles-0.012
 sidewalk-0.012
 dictionary-0.012
Token see
Token ID766
Feature activation-1.40e-03

Pos logit contributions
 nearer+0.031
eled+0.031
 afloat+0.031
igi+0.030
apa+0.029
Neg logit contributions
 Math-0.029
 books-0.025
 sidewalk-0.025
 kids-0.024
 dictionary-0.024
Token grandma
Token ID49890
Feature activation-1.74e-02

Pos logit contributions
 nearer+0.022
 afloat+0.021
eled+0.021
 submerged+0.020
 closer+0.020
Neg logit contributions
 Math-0.015
 books-0.013
 dictionary-0.013
 automobiles-0.013
 kids-0.013
Token.
Token ID13
Feature activation-7.63e-03

Pos logit contributions
 nearer+0.029
 afloat+0.028
eled+0.028
 submerged+0.027
 closer+0.027
Neg logit contributions
 Math-0.026
 books-0.023
 dictionary-0.022
 automobiles-0.022
 sidewalk-0.022
Token Grand
Token ID5675
Feature activation-2.43e-02

Pos logit contributions
 afloat+0.017
 nearer+0.016
eled+0.016
 submerged+0.015
 surfaced+0.015
Neg logit contributions
 Math-0.009
 books-0.007
 Miss-0.007
 dictionary-0.007
 kids-0.007
Tokenma
Token ID2611
Feature activation-4.97e-03

Pos logit contributions
 nearer+0.043
 afloat+0.042
eled+0.040
 closer+0.040
 submerged+0.040
Neg logit contributions
 Math-0.035
 books-0.029
 automobiles-0.029
 dictionary-0.029
 sidewalk-0.029
Token said
Token ID531
Feature activation-1.44e-02

Pos logit contributions
 afloat+0.009
 nearer+0.009
eled+0.008
 submerged+0.008
 closer+0.008
Neg logit contributions
 Math-0.007
 books-0.006
 dictionary-0.006
 automobiles-0.006
 piano-0.006
Token Fl
Token ID1610
Feature activation-3.99e-02

Pos logit contributions
 nearer+0.025
 afloat+0.024
eled+0.024
 submerged+0.023
 closer+0.022
Neg logit contributions
 Math-0.022
 books-0.019
 dictionary-0.018
 automobiles-0.018
 kids-0.018
Tokenuffy
Token ID15352
Feature activation-1.08e-02

Pos logit contributions
 nearer+0.054
 afloat+0.051
eled+0.049
 closer+0.049
 submerged+0.047
Neg logit contributions
 Math-0.074
 books-0.066
 dictionary-0.066
 automobiles-0.065
 piano-0.064
Token was
Token ID373
Feature activation+9.87e-04

Pos logit contributions
 nearer+0.017
 closer+0.016
eled+0.016
 afloat+0.016
apa+0.016
Neg logit contributions
 Math-0.017
 sidewalk-0.015
 automobiles-0.014
 dictionary-0.014
 piano-0.014
Token not
Token ID407
Feature activation+1.28e-03

Pos logit contributions
 Math+0.002
 automobiles+0.001
 sidewalk+0.001
 dictionary+0.001
 books+0.001
Neg logit contributions
 nearer-0.001
 closer-0.001
 afloat-0.001
 submerged-0.001
eled-0.001
Token used
Token ID973
Feature activation+2.09e-02

Pos logit contributions
 Math+0.002
 sidewalk+0.002
board+0.002
 automobiles+0.002
 pavement+0.002
Neg logit contributions
 closer-0.002
 nearer-0.002
 afloat-0.002
 hes-0.002
pt-0.002
Token to
Token ID284
Feature activation-3.19e-03

Pos logit contributions
 Math+0.036
 sidewalk+0.033
 dictionary+0.032
 automobiles+0.032
 piano+0.031
Neg logit contributions
 nearer-0.029
 closer-0.028
 afloat-0.027
 submerged-0.026
eled-0.026
Token to
Token ID284
Feature activation-7.36e-03

Pos logit contributions
eled+0.001
 afloat+0.001
apa+0.001
 hes+0.001
 nearer+0.001
Neg logit contributions
 Math-0.001
 books-0.001
 test-0.001
 sidewalk-0.001
 kids-0.001
Token the
Token ID262
Feature activation-1.11e-02

Pos logit contributions
 nearer+0.015
eled+0.015
 afloat+0.015
 submerged+0.014
 closer+0.014
Neg logit contributions
 Math-0.008
 books-0.007
 sidewalk-0.006
 dictionary-0.006
 automobiles-0.006
Token kitchen
Token ID9592
Feature activation-1.95e-02

Pos logit contributions
 nearer+0.019
 afloat+0.019
 closer+0.018
eled+0.018
 submerged+0.018
Neg logit contributions
 Math-0.016
 automobiles-0.013
 dictionary-0.013
 books-0.013
 sidewalk-0.013
Token and
Token ID290
Feature activation-6.36e-04

Pos logit contributions
 nearer+0.034
 afloat+0.033
eled+0.032
 closer+0.032
 submerged+0.032
Neg logit contributions
 Math-0.028
 dictionary-0.023
 automobiles-0.023
 books-0.023
 sidewalk-0.023
Token look
Token ID804
Feature activation-1.77e-02

Pos logit contributions
 nearer+0.001
 closer+0.001
 afloat+0.001
eled+0.001
 submerged+0.001
Neg logit contributions
 Math-0.001
 automobiles-0.001
 dictionary-0.001
 sidewalk-0.001
 piano-0.001
Token for
Token ID329
Feature activation-3.99e-02

Pos logit contributions
 nearer+0.033
 afloat+0.032
eled+0.031
 closer+0.031
 submerged+0.031
Neg logit contributions
 Math-0.023
 dictionary-0.019
 automobiles-0.019
 books-0.019
 sidewalk-0.019
Token food
Token ID2057
Feature activation-9.39e-03

Pos logit contributions
 nearer+0.064
 afloat+0.063
eled+0.061
 submerged+0.059
igi+0.058
Neg logit contributions
 Math-0.062
 books-0.057
 kids-0.054
 dictionary-0.053
 automobiles-0.052
Token.
Token ID13
Feature activation-1.31e-02

Pos logit contributions
 nearer+0.018
 afloat+0.017
 closer+0.017
eled+0.017
 submerged+0.017
Neg logit contributions
 Math-0.012
 automobiles-0.010
 dictionary-0.010
 sidewalk-0.010
 piano-0.009
Token They
Token ID1119
Feature activation-1.14e-02

Pos logit contributions
 nearer+0.025
 afloat+0.024
eled+0.024
igi+0.024
 submerged+0.023
Neg logit contributions
 Math-0.018
 kids-0.014
 books-0.014
 sidewalk-0.013
 dictionary-0.013
Token see
Token ID766
Feature activation-2.04e-02

Pos logit contributions
 nearer+0.019
 afloat+0.018
eled+0.018
igi+0.017
 submerged+0.017
Neg logit contributions
 Math-0.017
 books-0.015
 dictionary-0.015
 kids-0.015
 test-0.015
Token a
Token ID257
Feature activation-1.26e-02

Pos logit contributions
 nearer+0.038
 afloat+0.037
eled+0.036
uttered+0.035
igi+0.035
Neg logit contributions
 books-0.027
 Math-0.026
 kids-0.024
 automobiles-0.023
 dictionary-0.023
Token
Token ID198
Feature activation-1.99e-03

Pos logit contributions
 Math+0.005
 automobiles+0.004
 kids+0.004
 books+0.004
 dictionary+0.004
Neg logit contributions
 nearer-0.005
 afloat-0.005
eled-0.005
 submerged-0.005
 closer-0.005
Token
Token ID198
Feature activation+5.11e-03

Pos logit contributions
 afloat+0.004
 nearer+0.004
eled+0.004
 submerged+0.004
apa+0.003
Neg logit contributions
 Math-0.003
 books-0.002
 kids-0.002
 dictionary-0.002
 sidewalk-0.002
Token"
Token ID1
Feature activation+3.23e-03

Pos logit contributions
 Math+0.008
 dictionary+0.007
 automobiles+0.007
 books+0.007
 sidewalk+0.007
Neg logit contributions
 nearer-0.008
 afloat-0.008
 closer-0.008
eled-0.008
 submerged-0.008
TokenHello
Token ID15496
Feature activation-1.05e-02

Pos logit contributions
 Math+0.005
 dictionary+0.004
 automobiles+0.004
 sidewalk+0.004
 books+0.004
Neg logit contributions
 nearer-0.006
 afloat-0.006
apa-0.005
eled-0.005
 closer-0.005
Token,
Token ID11
Feature activation+4.86e-04

Pos logit contributions
 nearer+0.017
 afloat+0.017
eled+0.017
 closer+0.016
 submerged+0.016
Neg logit contributions
 Math-0.016
 dictionary-0.014
 books-0.014
 automobiles-0.014
 sidewalk-0.013
Token Fl
Token ID1610
Feature activation-3.99e-02

Pos logit contributions
 Math+0.001
 sidewalk+0.001
 automobiles+0.001
 books+0.001
 dictionary+0.001
Neg logit contributions
 nearer-0.001
 closer-0.001
 afloat-0.001
 submerged-0.001
eled-0.001
Tokenoppy
Token ID26696
Feature activation-1.89e-02

Pos logit contributions
 nearer+0.053
 hes+0.047
eled+0.046
 closer+0.046
 afloat+0.045
Neg logit contributions
 Math-0.076
 dictionary-0.069
 piano-0.066
 robot-0.066
 kids-0.066
Token!"
Token ID2474
Feature activation+2.68e-03

Pos logit contributions
 nearer+0.032
 closer+0.031
 afloat+0.030
 submerged+0.029
eled+0.029
Neg logit contributions
 Math-0.029
 automobiles-0.024
 sidewalk-0.024
 dictionary-0.023
 books-0.023
Token Tweet
Token ID18752
Feature activation+2.44e-03

Pos logit contributions
 Math+0.004
 automobiles+0.004
 dictionary+0.004
 sidewalk+0.004
 books+0.004
Neg logit contributions
 nearer-0.004
 afloat-0.004
 closer-0.004
eled-0.004
 submerged-0.004
Tokenie
Token ID494
Feature activation+6.81e-03

Pos logit contributions
 Math+0.004
 dictionary+0.003
 automobiles+0.003
 books+0.003
 sidewalk+0.003
Neg logit contributions
 nearer-0.004
 afloat-0.004
eled-0.004
 submerged-0.003
 closer-0.003
Token ch
Token ID442
Feature activation-4.25e-03

Pos logit contributions
 Math+0.009
 dictionary+0.008
 books+0.008
 automobiles+0.008
 kids+0.007
Neg logit contributions
 nearer-0.012
 afloat-0.012
eled-0.012
 submerged-0.011
 closer-0.011
Token way
Token ID835
Feature activation-9.60e-03

Pos logit contributions
 closer+0.005
 nearer+0.005
 afloat+0.005
 deeper+0.005
 underwater+0.005
Neg logit contributions
 Math-0.006
issors-0.005
 automobiles-0.005
aces-0.005
 dictionary-0.005
Token.
Token ID13
Feature activation-7.66e-03

Pos logit contributions
 nearer+0.019
 afloat+0.018
eled+0.018
 closer+0.018
 submerged+0.017
Neg logit contributions
 Math-0.012
 automobiles-0.010
 sidewalk-0.009
 dictionary-0.009
 books-0.009
Token Finally
Token ID9461
Feature activation-1.09e-02

Pos logit contributions
 nearer+0.012
 afloat+0.011
eled+0.011
 closer+0.011
 submerged+0.011
Neg logit contributions
 Math-0.012
 dictionary-0.011
 automobiles-0.011
 books-0.011
 sidewalk-0.011
Token,
Token ID11
Feature activation+1.66e-03

Pos logit contributions
 nearer+0.020
eled+0.020
 afloat+0.019
 submerged+0.019
 closer+0.019
Neg logit contributions
 Math-0.015
 books-0.012
 kids-0.012
 games-0.011
 mine-0.011
Token they
Token ID484
Feature activation-9.11e-03

Pos logit contributions
 Math+0.003
 dictionary+0.003
 automobiles+0.003
 sidewalk+0.003
 books+0.003
Neg logit contributions
 nearer-0.002
 afloat-0.002
 closer-0.002
eled-0.002
 submerged-0.002
Token reached
Token ID4251
Feature activation-3.98e-02

Pos logit contributions
 nearer+0.014
 afloat+0.014
eled+0.013
 numb+0.013
 submerged+0.013
Neg logit contributions
 Math-0.014
 books-0.013
 kids-0.012
 dictionary-0.012
 sidewalk-0.012
Token the
Token ID262
Feature activation-7.85e-03

Pos logit contributions
eled+0.093
igi+0.088
 nearer+0.088
ounced+0.088
apa+0.087
Neg logit contributions
 Math-0.034
 books-0.032
 kids-0.027
 games-0.026
 sidewalk-0.025
Token end
Token ID886
Feature activation-1.40e-02

Pos logit contributions
 nearer+0.013
 closer+0.013
 afloat+0.012
 deeper+0.012
 submerged+0.011
Neg logit contributions
 Math-0.012
 automobiles-0.011
issors-0.010
aces-0.010
 dictionary-0.010
Token of
Token ID286
Feature activation-1.33e-02

Pos logit contributions
 nearer+0.026
eled+0.025
 afloat+0.025
 submerged+0.025
 closer+0.024
Neg logit contributions
 Math-0.018
 books-0.015
 automobiles-0.015
 dictionary-0.015
 sidewalk-0.015
Token their
Token ID511
Feature activation+4.00e-03

Pos logit contributions
 nearer+0.030
apa+0.030
eled+0.030
ounced+0.029
 submerged+0.029
Neg logit contributions
 Math-0.012
 books-0.010
 kids-0.009
 automobiles-0.009
 games-0.009
Token journey
Token ID7002
Feature activation+2.96e-03

Pos logit contributions
 Math+0.006
 automobiles+0.005
 dictionary+0.005
 sidewalk+0.005
 books+0.005
Neg logit contributions
 nearer-0.007
 afloat-0.006
 closer-0.006
eled-0.006
 submerged-0.006
Token there
Token ID612
Feature activation+1.32e-03

Pos logit contributions
 Math+0.010
 books+0.009
 kids+0.009
 dictionary+0.009
 automobiles+0.009
Neg logit contributions
 nearer-0.006
 afloat-0.006
 submerged-0.005
eled-0.005
 stro-0.005
Token was
Token ID373
Feature activation+2.52e-03

Pos logit contributions
 Math+0.002
 books+0.001
 kids+0.001
 dictionary+0.001
 sidewalk+0.001
Neg logit contributions
 nearer-0.003
 afloat-0.002
 submerged-0.002
 numb-0.002
 closer-0.002
Token a
Token ID257
Feature activation+9.75e-03

Pos logit contributions
 Math+0.003
 kids+0.002
 books+0.002
 games+0.002
 automobiles+0.002
Neg logit contributions
 nearer-0.005
 afloat-0.005
apa-0.005
ounced-0.005
igi-0.005
Token little
Token ID1310
Feature activation-5.08e-04

Pos logit contributions
 Math+0.012
aces+0.011
 automobiles+0.011
issors+0.010
 sidewalk+0.010
Neg logit contributions
 closer-0.019
 nearer-0.018
 afloat-0.018
 submerged-0.017
Wal-0.017
Token boy
Token ID2933
Feature activation-1.56e-02

Pos logit contributions
 closer+0.001
 nearer+0.001
 afloat+0.001
eled+0.001
 submerged+0.001
Neg logit contributions
 Math-0.001
aces-0.001
 sidewalk-0.001
 automobiles-0.001
 books-0.000
Token,
Token ID11
Feature activation+2.41e-02

Pos logit contributions
 nearer+0.023
 afloat+0.021
eled+0.021
 closer+0.021
 submerged+0.021
Neg logit contributions
 Math-0.027
 automobiles-0.023
 dictionary-0.023
 sidewalk-0.023
 books-0.023
Token who
Token ID508
Feature activation+1.63e-02

Pos logit contributions
 Math+0.030
 dictionary+0.025
 books+0.025
 automobiles+0.025
 piano+0.024
Neg logit contributions
 nearer-0.047
 afloat-0.045
eled-0.044
 submerged-0.044
apa-0.044
Token was
Token ID373
Feature activation+2.98e-03

Pos logit contributions
 Math+0.021
 dictionary+0.017
 books+0.017
 automobiles+0.017
 sidewalk+0.017
Neg logit contributions
 nearer-0.032
 afloat-0.030
eled-0.030
 closer-0.030
 submerged-0.029
Token very
Token ID845
Feature activation-7.45e-03

Pos logit contributions
 Math+0.005
 books+0.004
 dictionary+0.004
 piano+0.004
 automobiles+0.004
Neg logit contributions
 nearer-0.005
 afloat-0.005
eled-0.005
apa-0.005
 closer-0.005
Token brave
Token ID14802
Feature activation-9.50e-04

Pos logit contributions
 nearer+0.011
 afloat+0.010
 closer+0.010
eled+0.010
 submerged+0.010
Neg logit contributions
 Math-0.013
 automobiles-0.012
 dictionary-0.012
 books-0.011
 sidewalk-0.011
Token and
Token ID290
Feature activation+1.44e-02

Pos logit contributions
 asleep+0.001
 closer+0.001
 hes+0.001
 nearer+0.001
eled+0.001
Neg logit contributions
 Math-0.002
 sidewalk-0.002
issors-0.002
Ju-0.002
 dictionary-0.001
Token.
Token ID13
Feature activation+4.00e-03

Pos logit contributions
 Math+0.006
 automobiles+0.005
 dictionary+0.005
 sidewalk+0.005
 books+0.005
Neg logit contributions
 nearer-0.006
 closer-0.006
 afloat-0.006
 submerged-0.006
 hes-0.005
Token She
Token ID1375
Feature activation-4.37e-04

Pos logit contributions
 Math+0.006
 automobiles+0.005
 books+0.005
 sidewalk+0.005
 dictionary+0.005
Neg logit contributions
 nearer-0.006
apa-0.006
 closer-0.006
eled-0.006
 hes-0.006
Token never
Token ID1239
Feature activation-1.68e-02

Pos logit contributions
 nearer+0.001
 afloat+0.001
 closer+0.001
eled+0.001
 submerged+0.001
Neg logit contributions
 Math-0.001
 books-0.001
 dictionary-0.001
 sidewalk-0.001
 automobiles-0.001
Token knew
Token ID2993
Feature activation-8.86e-03

Pos logit contributions
 nearer+0.023
 closer+0.022
eled+0.021
 afloat+0.021
 submerged+0.021
Neg logit contributions
 Math-0.030
 automobiles-0.028
 dictionary-0.027
 piano-0.027
 books-0.027
Token what
Token ID644
Feature activation+9.85e-03

Pos logit contributions
 nearer+0.013
 closer+0.013
 hes+0.012
apa+0.011
eled+0.011
Neg logit contributions
 Math-0.015
 sidewalk-0.014
 automobiles-0.014
 dictionary-0.013
 pavement-0.013
Token was
Token ID373
Feature activation+2.35e-02

Pos logit contributions
 Math+0.016
 sidewalk+0.016
 pavement+0.016
 automobiles+0.015
issors+0.015
Neg logit contributions
 nearer-0.013
 closer-0.013
 hes-0.012
apa-0.011
 stro-0.011
Token going
Token ID1016
Feature activation+4.29e-03

Pos logit contributions
 Math+0.041
 sidewalk+0.036
 automobiles+0.036
 dictionary+0.035
 books+0.035
Neg logit contributions
 nearer-0.036
 closer-0.035
 hes-0.031
 afloat-0.030
 submerged-0.029
Token to
Token ID284
Feature activation-1.18e-03

Pos logit contributions
 Math+0.007
 dictionary+0.006
 automobiles+0.006
 books+0.006
 sidewalk+0.006
Neg logit contributions
 nearer-0.007
 closer-0.007
 afloat-0.007
eled-0.006
 submerged-0.006
Token happen
Token ID1645
Feature activation-1.29e-02

Pos logit contributions
 nearer+0.003
 closer+0.002
 afloat+0.002
 hes+0.002
eled+0.002
Neg logit contributions
 Math-0.001
 automobiles-0.001
 dictionary-0.001
 books-0.001
 sidewalk-0.001
Token each
Token ID1123
Feature activation+1.03e-02

Pos logit contributions
 nearer+0.021
 afloat+0.020
eled+0.020
 closer+0.019
 submerged+0.019
Neg logit contributions
 Math-0.020
 books-0.017
 dictionary-0.017
 automobiles-0.017
 sidewalk-0.017
Token time
Token ID640
Feature activation-2.37e-03

Pos logit contributions
 Math+0.014
aces+0.013
 automobiles+0.013
 sidewalk+0.012
Deal+0.012
Neg logit contributions
 closer-0.018
 nearer-0.018
eled-0.016
 deeper-0.016
 hes-0.016
Token was
Token ID373
Feature activation+2.96e-03

Pos logit contributions
 nearer+0.010
 afloat+0.010
 closer+0.009
 submerged+0.009
igi+0.009
Neg logit contributions
 Math-0.008
 books-0.007
 dictionary-0.007
 piano-0.007
 automobiles-0.007
Token very
Token ID845
Feature activation+6.40e-04

Pos logit contributions
 Math+0.004
 kids+0.003
 piano+0.003
 books+0.003
 mine+0.003
Neg logit contributions
apa-0.005
igi-0.005
 nearer-0.005
 afloat-0.005
iri-0.005
Token frustrated
Token ID14718
Feature activation+4.32e-03

Pos logit contributions
 Math+0.001
 kids+0.001
 test+0.001
 law+0.001
 mine+0.001
Neg logit contributions
igi-0.001
eled-0.001
 nearer-0.001
 afloat-0.001
iri-0.001
Token at
Token ID379
Feature activation+1.82e-02

Pos logit contributions
 Math+0.008
 automobiles+0.007
 sidewalk+0.007
 dictionary+0.007
 books+0.007
Neg logit contributions
 nearer-0.006
 closer-0.005
 hes-0.005
 afloat-0.005
 submerged-0.005
Token not
Token ID407
Feature activation-1.65e-03

Pos logit contributions
 Math+0.025
 automobiles+0.022
 sidewalk+0.021
 dictionary+0.021
 books+0.020
Neg logit contributions
 nearer-0.033
 closer-0.032
 hes-0.030
 afloat-0.030
eled-0.030
Token being
Token ID852
Feature activation+2.53e-02

Pos logit contributions
 nearer+0.002
 closer+0.002
 afloat+0.002
 hes+0.002
 submerged+0.002
Neg logit contributions
 Math-0.003
 automobiles-0.003
 dictionary-0.002
 books-0.002
 sidewalk-0.002
Token able
Token ID1498
Feature activation+2.18e-03

Pos logit contributions
 Math+0.046
 sidewalk+0.044
 automobiles+0.043
 dictionary+0.041
board+0.041
Neg logit contributions
 nearer-0.035
 closer-0.035
 submerged-0.031
 afloat-0.031
 distracted-0.030
Token to
Token ID284
Feature activation-3.16e-03

Pos logit contributions
 Math+0.004
 automobiles+0.003
 dictionary+0.003
 sidewalk+0.003
 books+0.003
Neg logit contributions
 nearer-0.003
 afloat-0.003
 closer-0.003
eled-0.003
 submerged-0.003
Token reach
Token ID3151
Feature activation-1.89e-02

Pos logit contributions
 nearer+0.005
 closer+0.005
 afloat+0.005
 submerged+0.005
 hes+0.005
Neg logit contributions
 Math-0.005
 automobiles-0.005
 dictionary-0.004
 computer-0.004
 books-0.004
Token it
Token ID340
Feature activation-5.91e-04

Pos logit contributions
 nearer+0.034
 afloat+0.033
 closer+0.032
eled+0.032
 submerged+0.031
Neg logit contributions
 Math-0.026
 dictionary-0.022
 automobiles-0.022
 sidewalk-0.021
 books-0.021
Token.
Token ID13
Feature activation+8.28e-03

Pos logit contributions
 nearer+0.001
 closer+0.001
 afloat+0.001
 submerged+0.001
eled+0.001
Neg logit contributions
 Math-0.001
 automobiles-0.001
 dictionary-0.001
 sidewalk-0.001
 books-0.001
Token,
Token ID11
Feature activation+1.43e-02

Pos logit contributions
 Math+0.009
 automobiles+0.008
 dictionary+0.008
 sidewalk+0.008
 books+0.008
Neg logit contributions
 nearer-0.009
 afloat-0.008
 closer-0.008
eled-0.008
 submerged-0.008
Token "
Token ID366
Feature activation-7.03e-03

Pos logit contributions
 Math+0.018
 automobiles+0.015
 books+0.015
 dictionary+0.015
 sidewalk+0.015
Neg logit contributions
 nearer-0.028
 closer-0.027
 afloat-0.026
 hes-0.026
eled-0.026
TokenWell
Token ID5779
Feature activation-7.15e-03

Pos logit contributions
 nearer+0.013
eled+0.012
 closer+0.012
 afloat+0.012
 submerged+0.011
Neg logit contributions
 Math-0.010
 books-0.009
 automobiles-0.008
 sidewalk-0.008
 dictionary-0.008
Token done
Token ID1760
Feature activation+1.77e-03

Pos logit contributions
 nearer+0.012
 closer+0.011
 afloat+0.011
eled+0.011
 hes+0.011
Neg logit contributions
 Math-0.011
 automobiles-0.010
 sidewalk-0.010
 dictionary-0.010
 piano-0.009
Token Brad
Token ID8114
Feature activation-1.85e-03

Pos logit contributions
 Math+0.003
 sidewalk+0.003
 automobiles+0.003
 dictionary+0.003
issors+0.003
Neg logit contributions
 nearer-0.002
 closer-0.002
 afloat-0.002
 hes-0.002
apa-0.002
Token!
Token ID0
Feature activation+2.51e-02

Pos logit contributions
 nearer+0.003
 closer+0.003
apa+0.003
 afloat+0.003
 hes+0.002
Neg logit contributions
 Math-0.003
 automobiles-0.003
 sidewalk-0.003
 dictionary-0.003
 piano-0.003
Token Now
Token ID2735
Feature activation+1.04e-02

Pos logit contributions
 books+0.028
 sidewalk+0.027
 Math+0.027
 games+0.026
 piano+0.026
Neg logit contributions
 nearer-0.047
apa-0.046
 hes-0.045
eled-0.044
 closer-0.044
Token you
Token ID345
Feature activation-7.22e-04

Pos logit contributions
 Math+0.013
 automobiles+0.012
 sidewalk+0.012
 dictionary+0.012
 computer+0.011
Neg logit contributions
 nearer-0.020
 closer-0.019
 afloat-0.018
 hes-0.018
apa-0.018
Token know
Token ID760
Feature activation-6.20e-03

Pos logit contributions
 nearer+0.001
 afloat+0.001
 closer+0.001
eled+0.001
 submerged+0.001
Neg logit contributions
 Math-0.001
 automobiles-0.001
 dictionary-0.001
 sidewalk-0.001
 books-0.001
Token what
Token ID644
Feature activation+8.24e-03

Pos logit contributions
 nearer+0.009
 closer+0.009
apa+0.008
 hes+0.008
 afloat+0.008
Neg logit contributions
 Math-0.010
 sidewalk-0.009
 automobiles-0.009
 dictionary-0.009
 piano-0.009
Token was
Token ID373
Feature activation+2.13e-02

Pos logit contributions
 Math+0.012
 automobiles+0.011
 sidewalk+0.011
 pavement+0.011
issors+0.011
Neg logit contributions
 nearer-0.013
 closer-0.013
 hes-0.012
apa-0.012
 pap-0.012
Token the
Token ID262
Feature activation+1.71e-03

Pos logit contributions
 Math+0.015
 books+0.013
 automobiles+0.013
 dictionary+0.013
 sidewalk+0.012
Neg logit contributions
 nearer-0.024
 afloat-0.024
 submerged-0.023
eled-0.023
apa-0.023
Token tree
Token ID5509
Feature activation-9.47e-03

Pos logit contributions
 Math+0.003
 books+0.002
 automobiles+0.002
issors+0.002
aces+0.002
Neg logit contributions
 nearer-0.003
 closer-0.003
 afloat-0.003
 submerged-0.003
eled-0.002
Token.
Token ID13
Feature activation-5.99e-04

Pos logit contributions
 nearer+0.015
 closer+0.015
 afloat+0.015
 submerged+0.014
eled+0.014
Neg logit contributions
 Math-0.015
 automobiles-0.012
 dictionary-0.012
 sidewalk-0.012
 guys-0.012
Token "
Token ID366
Feature activation+1.54e-04

Pos logit contributions
 nearer+0.001
 closer+0.001
 afloat+0.001
 submerged+0.001
eled+0.001
Neg logit contributions
 Math-0.001
 piano-0.001
 automobiles-0.001
 dictionary-0.001
 books-0.001
TokenWhat
Token ID2061
Feature activation+6.42e-03

Pos logit contributions
 Math+0.000
 automobiles+0.000
 dictionary+0.000
 books+0.000
 sidewalk+0.000
Neg logit contributions
 nearer-0.000
eled-0.000
 closer-0.000
 afloat-0.000
 submerged-0.000
Token's
Token ID338
Feature activation+2.71e-02

Pos logit contributions
 sidewalk+0.009
 pavement+0.009
 Math+0.009
 automobiles+0.008
issors+0.008
Neg logit contributions
 nearer-0.010
 closer-0.010
pt-0.009
 hes-0.009
 submerged-0.009
Token wrong
Token ID2642
Feature activation-2.53e-04

Pos logit contributions
 Math+0.041
board+0.037
 sidewalk+0.035
 pavement+0.034
 dictionary+0.034
Neg logit contributions
 closer-0.044
 nearer-0.044
 afloat-0.040
 perched-0.039
 distracted-0.038
Token,
Token ID11
Feature activation+1.42e-02

Pos logit contributions
 nearer+0.000
 closer+0.000
 afloat+0.000
eled+0.000
 submerged+0.000
Neg logit contributions
 Math-0.000
 automobiles-0.000
 sidewalk-0.000
 dictionary-0.000
 books-0.000
Token Lily
Token ID20037
Feature activation+1.06e-02

Pos logit contributions
aces+0.027
 Math+0.026
cles+0.026
 sidewalk+0.025
 pavement+0.025
Neg logit contributions
 closer-0.017
 hes-0.014
 nearer-0.014
iri-0.014
Wal-0.014
Token?"
Token ID1701
Feature activation+1.84e-03

Pos logit contributions
 Math+0.020
 sidewalk+0.017
 automobiles+0.017
 books+0.017
 Kids+0.016
Neg logit contributions
 nearer-0.014
 closer-0.013
 afloat-0.013
eled-0.013
igi-0.012
Token he
Token ID339
Feature activation+1.39e-02

Pos logit contributions
 Math+0.003
 automobiles+0.003
 piano+0.003
 sidewalk+0.003
 dictionary+0.003
Neg logit contributions
 nearer-0.003
 closer-0.003
 hes-0.003
eled-0.003
 stro-0.002
Token my
Token ID616
Feature activation+8.42e-03

Pos logit contributions
 Math+0.006
 sidewalk+0.005
 automobiles+0.005
 dictionary+0.005
 pavement+0.005
Neg logit contributions
 closer-0.006
 nearer-0.006
 afloat-0.005
 deeper-0.005
 submerged-0.005
Token way
Token ID835
Feature activation-8.74e-03

Pos logit contributions
 Math+0.011
Ju+0.009
 sidewalk+0.009
whe+0.008
 automobiles+0.008
Neg logit contributions
 closer-0.015
 nearer-0.014
 afloat-0.014
 bur-0.013
 hes-0.013
Token because
Token ID780
Feature activation+1.22e-02

Pos logit contributions
 nearer+0.016
 closer+0.016
 afloat+0.015
 submerged+0.014
 deeper+0.014
Neg logit contributions
 Math-0.012
 dictionary-0.010
 books-0.010
 automobiles-0.010
 sidewalk-0.010
Token I
Token ID314
Feature activation+7.02e-03

Pos logit contributions
 Math+0.021
Ju+0.020
 sidewalk+0.020
arrass+0.020
 dictionary+0.019
Neg logit contributions
 closer-0.016
 nearer-0.016
 deeper-0.013
eled-0.013
 afloat-0.013
Token am
Token ID716
Feature activation+3.96e-03

Pos logit contributions
 Math+0.009
 dictionary+0.008
 automobiles+0.008
 books+0.008
 sidewalk+0.008
Neg logit contributions
 nearer-0.013
 afloat-0.013
eled-0.012
 closer-0.012
 submerged-0.012
Token very
Token ID845
Feature activation+2.19e-03

Pos logit contributions
 Math+0.006
 dictionary+0.006
 automobiles+0.006
 sidewalk+0.006
 books+0.006
Neg logit contributions
 nearer-0.006
 closer-0.006
 afloat-0.006
 submerged-0.006
eled-0.006
Token competitive
Token ID7606
Feature activation+1.01e-02

Pos logit contributions
 Math+0.003
 automobiles+0.003
 dictionary+0.003
 sidewalk+0.003
 books+0.003
Neg logit contributions
 nearer-0.004
 closer-0.004
 afloat-0.003
 submerged-0.003
eled-0.003
Token and
Token ID290
Feature activation+1.21e-02

Pos logit contributions
 Math+0.015
 sidewalk+0.014
 dictionary+0.014
 automobiles+0.013
Ju+0.013
Neg logit contributions
 closer-0.015
 nearer-0.015
 afloat-0.015
 hes-0.014
 asleep-0.014
Token I
Token ID314
Feature activation+4.13e-03

Pos logit contributions
 Math+0.016
board+0.016
 automobiles+0.016
 dictionary+0.015
 computer+0.015
Neg logit contributions
 closer-0.021
 nearer-0.020
 distracted-0.018
 exhausted-0.018
 soon-0.018
Token want
Token ID765
Feature activation-8.83e-03

Pos logit contributions
 Math+0.006
 automobiles+0.005
 sidewalk+0.005
 dictionary+0.005
 books+0.005
Neg logit contributions
 nearer-0.007
 closer-0.007
 afloat-0.006
 submerged-0.006
eled-0.006
Token to
Token ID284
Feature activation+9.16e-04

Pos logit contributions
 nearer+0.011
 closer+0.011
 hes+0.010
 submerged+0.009
 afloat+0.009
Neg logit contributions
 Math-0.016
 sidewalk-0.015
 dictionary-0.015
 piano-0.014
 automobiles-0.014
Token carefully
Token ID7773
Feature activation-5.45e-03

Pos logit contributions
 nearer+0.020
eled+0.020
 afloat+0.020
 submerged+0.019
 hes+0.019
Neg logit contributions
 Math-0.017
 books-0.015
 dictionary-0.015
 sidewalk-0.014
 automobiles-0.014
Token.
Token ID13
Feature activation+5.50e-03

Pos logit contributions
 nearer+0.011
 afloat+0.011
eled+0.011
 closer+0.010
 submerged+0.010
Neg logit contributions
 Math-0.006
 books-0.005
 sidewalk-0.005
 dictionary-0.005
 automobiles-0.005
Token
Token ID198
Feature activation-3.60e-03

Pos logit contributions
 Math+0.009
 books+0.008
 dictionary+0.008
 automobiles+0.008
 sidewalk+0.008
Neg logit contributions
 nearer-0.008
eled-0.007
 closer-0.007
 afloat-0.007
apa-0.007
Token
Token ID198
Feature activation+5.09e-03

Pos logit contributions
 nearer+0.006
 afloat+0.006
eled+0.006
 submerged+0.006
 closer+0.006
Neg logit contributions
 Math-0.005
 sidewalk-0.004
 dictionary-0.004
 automobiles-0.004
 piano-0.004
TokenHowever
Token ID4864
Feature activation-5.98e-03

Pos logit contributions
 Math+0.009
 dictionary+0.008
 books+0.008
 automobiles+0.008
 sidewalk+0.008
Neg logit contributions
 nearer-0.007
 afloat-0.007
eled-0.007
 closer-0.007
 submerged-0.007
Token,
Token ID11
Feature activation+9.31e-03

Pos logit contributions
 nearer+0.010
 afloat+0.009
eled+0.009
 closer+0.009
 submerged+0.009
Neg logit contributions
 Math-0.009
 dictionary-0.008
 books-0.008
 automobiles-0.008
 sidewalk-0.008
Token suddenly
Token ID6451
Feature activation-6.84e-03

Pos logit contributions
 Math+0.015
 sidewalk+0.014
aces+0.014
 automobiles+0.014
 dictionary+0.013
Neg logit contributions
 nearer-0.014
 closer-0.013
eled-0.013
 afloat-0.012
apa-0.012
Token Winn
Token ID19454
Feature activation-2.42e-02

Pos logit contributions
 nearer+0.011
 afloat+0.011
eled+0.011
 closer+0.011
 submerged+0.010
Neg logit contributions
 Math-0.010
 books-0.009
 dictionary-0.009
 automobiles-0.009
 sidewalk-0.009
Tokenie
Token ID494
Feature activation-7.15e-03

Pos logit contributions
 nearer+0.029
eled+0.029
 closer+0.027
 afloat+0.027
apa+0.026
Neg logit contributions
 Math-0.048
 sidewalk-0.042
 automobiles-0.042
 dictionary-0.042
 books-0.041
Token forgot
Token ID16453
Feature activation-1.12e-03

Pos logit contributions
 nearer+0.012
 afloat+0.012
 closer+0.011
eled+0.011
igi+0.011
Neg logit contributions
 Math-0.010
 dictionary-0.009
 books-0.009
 kids-0.009
 sidewalk-0.009
Token that
Token ID326
Feature activation+4.38e-03

Pos logit contributions
 nearer+0.002
 closer+0.002
 submerged+0.002
eled+0.002
 afloat+0.002
Neg logit contributions
 Math-0.002
 automobiles-0.002
 sidewalk-0.002
 dictionary-0.002
 piano-0.002
Token on
Token ID319
Feature activation+2.27e-05

Pos logit contributions
 Math+0.013
 books+0.011
 dictionary+0.011
 piano+0.011
 sidewalk+0.011
Neg logit contributions
 nearer-0.013
eled-0.013
 afloat-0.012
 submerged-0.012
apa-0.012
Token the
Token ID262
Feature activation+4.76e-03

Pos logit contributions
 Math+0.000
 dictionary+0.000
 automobiles+0.000
 sidewalk+0.000
 books+0.000
Neg logit contributions
 nearer-0.000
 afloat-0.000
 closer-0.000
eled-0.000
 submerged-0.000
Token street
Token ID4675
Feature activation+7.75e-03

Pos logit contributions
 Math+0.007
aces+0.006
issors+0.005
 automobiles+0.005
 dictionary+0.005
Neg logit contributions
 closer-0.008
 afloat-0.008
 nearer-0.008
 hes-0.007
 deeper-0.007
Token stop
Token ID2245
Feature activation-5.47e-03

Pos logit contributions
 Math+0.012
 dictionary+0.010
 automobiles+0.010
 books+0.009
 sidewalk+0.009
Neg logit contributions
 nearer-0.013
 closer-0.012
 afloat-0.012
eled-0.012
 submerged-0.012
Token and
Token ID290
Feature activation-1.12e-02

Pos logit contributions
 nearer+0.010
 afloat+0.009
eled+0.009
 closer+0.009
 submerged+0.009
Neg logit contributions
 Math-0.008
 automobiles-0.007
 dictionary-0.007
 sidewalk-0.007
 books-0.007
Token listen
Token ID6004
Feature activation+5.37e-03

Pos logit contributions
 nearer+0.017
 closer+0.016
 afloat+0.016
eled+0.016
 submerged+0.016
Neg logit contributions
 Math-0.018
 dictionary-0.016
 automobiles-0.016
 sidewalk-0.016
 books-0.016
Token.
Token ID13
Feature activation+1.11e-03

Pos logit contributions
 Math+0.008
 dictionary+0.007
 automobiles+0.007
 sidewalk+0.007
 books+0.007
Neg logit contributions
 nearer-0.009
 closer-0.008
 afloat-0.008
eled-0.008
 submerged-0.008
Token
Token ID198
Feature activation-3.59e-03

Pos logit contributions
 Math+0.002
 books+0.002
 dictionary+0.002
 piano+0.002
 automobiles+0.002
Neg logit contributions
 nearer-0.002
apa-0.002
eled-0.002
 afloat-0.001
 closer-0.001
Token
Token ID198
Feature activation+2.09e-03

Pos logit contributions
 nearer+0.006
 afloat+0.006
eled+0.006
 numb+0.006
 closer+0.005
Neg logit contributions
 Math-0.006
 sidewalk-0.005
 books-0.004
 dictionary-0.004
 kids-0.004
TokenWhen
Token ID2215
Feature activation+7.32e-03

Pos logit contributions
 Math+0.003
 dictionary+0.003
 automobiles+0.003
 sidewalk+0.003
 books+0.003
Neg logit contributions
 nearer-0.004
 afloat-0.004
eled-0.003
 closer-0.003
 submerged-0.003
Token the
Token ID262
Feature activation+5.58e-03

Pos logit contributions
 Math+0.012
 dictionary+0.011
 automobiles+0.011
 sidewalk+0.011
 piano+0.011
Neg logit contributions
 nearer-0.011
 closer-0.010
 afloat-0.010
eled-0.010
 submerged-0.010
Token together
Token ID1978
Feature activation+5.24e-03

Pos logit contributions
 nearer+0.010
 closer+0.010
 afloat+0.009
 submerged+0.009
eled+0.009
Neg logit contributions
 Math-0.010
 automobiles-0.009
 dictionary-0.009
 sidewalk-0.009
 mine-0.008
Token and
Token ID290
Feature activation-3.53e-03

Pos logit contributions
 Math+0.008
 automobiles+0.007
 dictionary+0.007
 sidewalk+0.006
 books+0.006
Neg logit contributions
 nearer-0.009
 closer-0.009
 afloat-0.008
eled-0.008
 submerged-0.008
Token watch
Token ID2342
Feature activation-3.83e-03

Pos logit contributions
 nearer+0.006
 closer+0.006
 afloat+0.005
 submerged+0.005
 hes+0.005
Neg logit contributions
 Math-0.005
 automobiles-0.005
aces-0.005
issors-0.005
 dictionary-0.005
Token their
Token ID511
Feature activation-2.95e-03

Pos logit contributions
 closer+0.006
 nearer+0.006
 afloat+0.005
 hes+0.005
 deeper+0.005
Neg logit contributions
 Math-0.006
 dictionary-0.006
Deal-0.006
issors-0.005
 sidewalk-0.005
Token favourite
Token ID12507
Feature activation+2.13e-02

Pos logit contributions
 nearer+0.005
 closer+0.005
 afloat+0.005
 submerged+0.005
 hes+0.005
Neg logit contributions
 Math-0.004
 dictionary-0.003
 automobiles-0.003
 sidewalk-0.003
aces-0.003
Token shows
Token ID2523
Feature activation+1.52e-03

Pos logit contributions
 Math+0.027
issors+0.024
aces+0.022
 sidewalk+0.021
Ju+0.021
Neg logit contributions
 closer-0.039
 nearer-0.038
 fascination-0.036
 hes-0.036
eled-0.035
Token.
Token ID13
Feature activation+9.77e-04

Pos logit contributions
 Math+0.002
 dictionary+0.002
 sidewalk+0.002
 automobiles+0.002
 piano+0.002
Neg logit contributions
 nearer-0.003
 closer-0.003
 afloat-0.002
 hes-0.002
 submerged-0.002
Token They
Token ID1119
Feature activation-8.97e-03

Pos logit contributions
 piano+0.001
 exam+0.001
 Math+0.001
 dictionary+0.001
 test+0.001
Neg logit contributions
 nearer-0.001
eled-0.001
 afloat-0.001
 closer-0.001
apa-0.001
Token had
Token ID550
Feature activation-3.97e-04

Pos logit contributions
 nearer+0.014
 closer+0.014
eled+0.014
 afloat+0.014
 submerged+0.013
Neg logit contributions
 Math-0.014
 automobiles-0.012
 sidewalk-0.012
 dictionary-0.012
 books-0.011
Token so
Token ID523
Feature activation-9.85e-03

Pos logit contributions
 nearer+0.001
 afloat+0.001
 closer+0.001
eled+0.001
 submerged+0.001
Neg logit contributions
 Math-0.001
 dictionary-0.001
 automobiles-0.001
 sidewalk-0.001
 piano-0.001
Token much
Token ID881
Feature activation-3.11e-03

Pos logit contributions
 nearer+0.011
 afloat+0.010
eled+0.010
 closer+0.010
 submerged+0.010
Neg logit contributions
 Math-0.020
 dictionary-0.018
 automobiles-0.018
 books-0.018
 sidewalk-0.018
Token to
Token ID284
Feature activation+8.97e-03

Pos logit contributions
 Math+0.003
 books+0.003
 kids+0.003
 automobiles+0.003
 sidewalk+0.003
Neg logit contributions
eled-0.004
 afloat-0.004
 nearer-0.004
apa-0.004
 submerged-0.004
Token the
Token ID262
Feature activation+8.65e-04

Pos logit contributions
 Math+0.015
 automobiles+0.013
 dictionary+0.013
 sidewalk+0.013
 books+0.013
Neg logit contributions
 nearer-0.014
 closer-0.013
 afloat-0.013
eled-0.012
 submerged-0.012
Token table
Token ID3084
Feature activation-3.27e-03

Pos logit contributions
 Math+0.001
 automobiles+0.001
 dictionary+0.001
aces+0.001
 sidewalk+0.001
Neg logit contributions
 nearer-0.001
 closer-0.001
 afloat-0.001
 submerged-0.001
eled-0.001
Token.
Token ID13
Feature activation-7.45e-03

Pos logit contributions
 nearer+0.007
 afloat+0.006
eled+0.006
 closer+0.006
 submerged+0.006
Neg logit contributions
 Math-0.004
 dictionary-0.003
 books-0.003
 automobiles-0.003
 sidewalk-0.003
Token She
Token ID1375
Feature activation+7.12e-04

Pos logit contributions
 nearer+0.015
 afloat+0.015
eled+0.015
 submerged+0.014
 closer+0.014
Neg logit contributions
 Math-0.009
 dictionary-0.007
 automobiles-0.007
 sidewalk-0.007
 books-0.007
Token sees
Token ID7224
Feature activation+3.91e-03

Pos logit contributions
 Math+0.001
 books+0.001
 dictionary+0.001
 kids+0.001
 automobiles+0.001
Neg logit contributions
 afloat-0.001
 nearer-0.001
eled-0.001
 numb-0.001
igi-0.001
Token the
Token ID262
Feature activation-2.03e-03

Pos logit contributions
 Math+0.005
 dictionary+0.004
 automobiles+0.004
 books+0.004
 sidewalk+0.004
Neg logit contributions
 nearer-0.008
 afloat-0.007
eled-0.007
 closer-0.007
 submerged-0.007
Token car
Token ID1097
Feature activation+5.89e-03

Pos logit contributions
 nearer+0.003
 closer+0.003
 afloat+0.003
 submerged+0.003
eled+0.003
Neg logit contributions
 Math-0.003
 automobiles-0.002
 dictionary-0.002
 sidewalk-0.002
aces-0.002
Token and
Token ID290
Feature activation+1.31e-03

Pos logit contributions
 Math+0.010
 dictionary+0.009
 automobiles+0.008
 sidewalk+0.008
 books+0.008
Neg logit contributions
 nearer-0.009
 closer-0.009
 afloat-0.008
 submerged-0.008
eled-0.008
Token the
Token ID262
Feature activation-4.93e-03

Pos logit contributions
 Math+0.002
 dictionary+0.002
 sidewalk+0.001
 automobiles+0.001
 piano+0.001
Neg logit contributions
 nearer-0.002
 closer-0.002
 hes-0.002
eled-0.002
 afloat-0.002
Token toys
Token ID14958
Feature activation+1.60e-02

Pos logit contributions
 afloat+0.009
 nearer+0.009
 closer+0.009
 submerged+0.008
eled+0.008
Neg logit contributions
 Math-0.007
 automobiles-0.005
 dictionary-0.005
 sidewalk-0.005
Deal-0.005
Token She
Token ID1375
Feature activation+1.34e-03

Pos logit contributions
Deal+0.015
 Math+0.014
 automobiles+0.014
 mine+0.014
 earthquake+0.014
Neg logit contributions
 nearer-0.013
uttered-0.012
 closer-0.012
 hes-0.012
 stro-0.011
Token didn
Token ID1422
Feature activation+2.47e-03

Pos logit contributions
 Math+0.002
 books+0.002
 dictionary+0.002
 kids+0.002
 sidewalk+0.002
Neg logit contributions
 afloat-0.002
 nearer-0.002
 numb-0.002
igi-0.002
apa-0.002
Token't
Token ID470
Feature activation+2.74e-03

Pos logit contributions
 Math+0.003
 piano+0.002
 test+0.002
 dictionary+0.002
 sidewalk+0.002
Neg logit contributions
eled-0.005
 nearer-0.005
 afloat-0.005
igi-0.005
apa-0.005
Token know
Token ID760
Feature activation-1.20e-02

Pos logit contributions
 Math+0.006
 automobiles+0.005
 dictionary+0.005
 sidewalk+0.005
 books+0.005
Neg logit contributions
 nearer-0.003
 closer-0.003
 afloat-0.003
 submerged-0.003
 hes-0.003
Token what
Token ID644
Feature activation+5.91e-03

Pos logit contributions
 nearer+0.016
 closer+0.015
 hes+0.014
 afloat+0.014
eled+0.014
Neg logit contributions
 Math-0.022
 sidewalk-0.020
 dictionary-0.019
 automobiles-0.019
 piano-0.019
Token to
Token ID284
Feature activation-5.15e-03

Pos logit contributions
 Math+0.011
 sidewalk+0.010
 automobiles+0.010
 pavement+0.010
issors+0.010
Neg logit contributions
 nearer-0.007
 closer-0.007
 hes-0.006
apa-0.006
pt-0.006
Token do
Token ID466
Feature activation+4.81e-03

Pos logit contributions
 nearer+0.011
 closer+0.011
 afloat+0.011
 hes+0.010
pt+0.010
Neg logit contributions
 Math-0.005
 automobiles-0.004
 robots-0.004
 dictionary-0.004
 sidewalk-0.003
Token,
Token ID11
Feature activation-5.65e-03

Pos logit contributions
 Math+0.008
 automobiles+0.007
 dictionary+0.007
 sidewalk+0.007
 piano+0.007
Neg logit contributions
 nearer-0.007
 closer-0.007
 hes-0.006
 afloat-0.006
pt-0.006
Token so
Token ID523
Feature activation+8.41e-03

Pos logit contributions
 nearer+0.010
 closer+0.010
 afloat+0.009
 hes+0.009
eled+0.009
Neg logit contributions
 Math-0.008
 automobiles-0.007
 dictionary-0.007
 sidewalk-0.007
 piano-0.007
Token she
Token ID673
Feature activation-2.51e-03

Pos logit contributions
 Math+0.013
 dictionary+0.011
 automobiles+0.011
 books+0.011
 sidewalk+0.011
Neg logit contributions
 nearer-0.013
 afloat-0.013
eled-0.013
 closer-0.013
 submerged-0.012
Token asked
Token ID1965
Feature activation+1.30e-02

Pos logit contributions
 nearer+0.004
 afloat+0.004
eled+0.004
 closer+0.003
 submerged+0.003
Neg logit contributions
 Math-0.004
 dictionary-0.004
 books-0.004
 sidewalk-0.003
 automobiles-0.003
Token far
Token ID1290
Feature activation+4.93e-03

Pos logit contributions
 test+0.003
 Math+0.003
 books+0.003
 number+0.003
 dictionary+0.003
Neg logit contributions
eled-0.005
 afloat-0.005
ounced-0.005
 nearer-0.005
igi-0.004
Tokenaway
Token ID8272
Feature activation-4.17e-03

Pos logit contributions
 Math+0.009
 dictionary+0.008
 automobiles+0.008
 books+0.008
 sidewalk+0.007
Neg logit contributions
 nearer-0.007
 afloat-0.007
eled-0.007
 closer-0.007
 submerged-0.006
Token land
Token ID1956
Feature activation+2.66e-03

Pos logit contributions
 nearer+0.006
 afloat+0.006
eled+0.006
 submerged+0.006
apa+0.006
Neg logit contributions
 Math-0.007
 books-0.006
 sidewalk-0.006
 dictionary-0.006
 kids-0.006
Token,
Token ID11
Feature activation+2.33e-03

Pos logit contributions
 Math+0.004
 dictionary+0.004
 automobiles+0.004
 books+0.004
 sidewalk+0.004
Neg logit contributions
 nearer-0.004
 afloat-0.004
eled-0.004
 closer-0.004
 submerged-0.004
Token the
Token ID262
Feature activation-3.41e-04

Pos logit contributions
 Math+0.004
 automobiles+0.004
 dictionary+0.004
 sidewalk+0.004
issors+0.004
Neg logit contributions
 nearer-0.003
 closer-0.003
 afloat-0.003
 hes-0.003
 submerged-0.003
Token little
Token ID1310
Feature activation-6.90e-03

Pos logit contributions
 nearer+0.001
 closer+0.001
 afloat+0.001
 submerged+0.001
 hes+0.001
Neg logit contributions
 Math-0.000
 automobiles-0.000
issors-0.000
 dictionary-0.000
 sidewalk-0.000
Token girl
Token ID2576
Feature activation-1.24e-02

Pos logit contributions
 nearer+0.014
 closer+0.014
 afloat+0.013
eled+0.013
 hes+0.013
Neg logit contributions
 Math-0.008
 automobiles-0.007
 sidewalk-0.007
 dictionary-0.006
issors-0.006
Token saw
Token ID2497
Feature activation+1.07e-02

Pos logit contributions
 afloat+0.022
 nearer+0.022
 numb+0.021
igi+0.021
iri+0.020
Neg logit contributions
 Math-0.015
 kids-0.014
 books-0.014
 dictionary-0.014
 sidewalk-0.014
Token beautiful
Token ID4950
Feature activation-3.09e-03

Pos logit contributions
 Math+0.016
 books+0.014
 dictionary+0.014
 automobiles+0.014
 sidewalk+0.014
Neg logit contributions
 nearer-0.018
 afloat-0.017
eled-0.017
 closer-0.016
 submerged-0.016
Token castles
Token ID49203
Feature activation+6.21e-03

Pos logit contributions
 nearer+0.006
 closer+0.005
 afloat+0.005
eled+0.005
 submerged+0.005
Neg logit contributions
 Math-0.004
 automobiles-0.003
 sidewalk-0.003
 dictionary-0.003
issors-0.003
Token and
Token ID290
Feature activation-4.11e-03

Pos logit contributions
 Math+0.008
 books+0.007
 sidewalk+0.007
 automobiles+0.007
 dictionary+0.007
Neg logit contributions
 nearer-0.011
eled-0.011
 afloat-0.011
 closer-0.010
apa-0.010
Token to
Token ID284
Feature activation-2.62e-03

Pos logit contributions
 nearer+0.000
 afloat+0.000
eled+0.000
 closer+0.000
 submerged+0.000
Neg logit contributions
 Math-0.000
 dictionary-0.000
 automobiles-0.000
 books-0.000
 sidewalk-0.000
Token go
Token ID467
Feature activation-8.10e-03

Pos logit contributions
 nearer+0.004
 closer+0.004
 afloat+0.004
 submerged+0.004
 hes+0.004
Neg logit contributions
 Math-0.004
 automobiles-0.003
 dictionary-0.003
 books-0.003
 computer-0.003
Token home
Token ID1363
Feature activation+2.34e-03

Pos logit contributions
 nearer+0.012
eled+0.012
apa+0.012
 afloat+0.012
 hes+0.012
Neg logit contributions
 Math-0.013
 books-0.011
 automobiles-0.011
 sidewalk-0.011
 kids-0.011
Token.
Token ID13
Feature activation+2.98e-03

Pos logit contributions
 Math+0.004
 dictionary+0.003
 automobiles+0.003
 sidewalk+0.003
 books+0.003
Neg logit contributions
 nearer-0.004
 afloat-0.004
 closer-0.004
eled-0.004
 submerged-0.004
Token You
Token ID921
Feature activation+6.44e-03

Pos logit contributions
 sidewalk+0.003
 games+0.003
 books+0.003
 piano+0.003
 game+0.003
Neg logit contributions
 nearer-0.005
apa-0.005
 stro-0.005
 closer-0.005
 hes-0.005
Token will
Token ID481
Feature activation-3.14e-03

Pos logit contributions
 Math+0.010
 sidewalk+0.009
 dictionary+0.009
 automobiles+0.009
 pavement+0.009
Neg logit contributions
 nearer-0.010
 closer-0.010
 afloat-0.010
pt-0.010
 submerged-0.009
Token not
Token ID407
Feature activation+5.62e-03

Pos logit contributions
 nearer+0.006
 afloat+0.005
 closer+0.005
eled+0.005
 submerged+0.005
Neg logit contributions
 Math-0.004
 automobiles-0.004
 dictionary-0.004
 books-0.004
 sidewalk-0.004
Token be
Token ID307
Feature activation+9.30e-03

Pos logit contributions
 Math+0.008
 automobiles+0.007
 books+0.007
 dictionary+0.007
aces+0.007
Neg logit contributions
 nearer-0.009
 closer-0.009
 afloat-0.009
 submerged-0.009
pt-0.008
Token welcome
Token ID7062
Feature activation-1.01e-02

Pos logit contributions
 Math+0.016
 sidewalk+0.015
board+0.015
 dictionary+0.015
 automobiles+0.014
Neg logit contributions
 nearer-0.013
 closer-0.013
 afloat-0.012
 submerged-0.012
eled-0.012
Token here
Token ID994
Feature activation+3.00e-03

Pos logit contributions
 nearer+0.016
 closer+0.016
 afloat+0.015
apa+0.014
 hes+0.014
Neg logit contributions
 Math-0.016
 sidewalk-0.015
 dictionary-0.014
 automobiles-0.014
 pavement-0.013
Token anymore
Token ID7471
Feature activation+1.12e-02

Pos logit contributions
 Math+0.004
 sidewalk+0.004
 dictionary+0.004
 automobiles+0.004
 piano+0.003
Neg logit contributions
 nearer-0.005
 closer-0.005
 afloat-0.005
 hes-0.005
apa-0.005
Token
Token ID198
Feature activation-1.06e-02

Pos logit contributions
 nearer+0.002
 afloat+0.002
 closer+0.002
eled+0.002
apa+0.002
Neg logit contributions
 Math-0.002
 piano-0.002
 dictionary-0.002
 automobiles-0.002
 books-0.002
Token
Token ID198
Feature activation+1.92e-03

Pos logit contributions
 nearer+0.020
eled+0.020
 afloat+0.020
 submerged+0.019
 numb+0.019
Neg logit contributions
 Math-0.013
 books-0.011
 kids-0.011
 Kids-0.011
 sidewalk-0.010
TokenThe
Token ID464
Feature activation-1.12e-03

Pos logit contributions
 Math+0.004
 automobiles+0.003
 dictionary+0.003
 books+0.003
 piano+0.003
Neg logit contributions
 nearer-0.002
 afloat-0.002
 closer-0.002
eled-0.002
 submerged-0.002
Token dragon
Token ID10441
Feature activation-7.23e-03

Pos logit contributions
 nearer+0.002
 closer+0.002
 afloat+0.002
 deeper+0.001
 submerged+0.001
Neg logit contributions
 Math-0.002
 automobiles-0.002
issors-0.002
 sidewalk-0.002
 piano-0.002
Token saw
Token ID2497
Feature activation+1.35e-02

Pos logit contributions
 nearer+0.012
 afloat+0.012
 numb+0.012
eled+0.011
 submerged+0.011
Neg logit contributions
 Math-0.010
 books-0.009
 dictionary-0.009
 kids-0.009
 sidewalk-0.009
Token a
Token ID257
Feature activation-4.26e-03

Pos logit contributions
 Math+0.020
 books+0.017
 automobiles+0.017
 dictionary+0.017
 kids+0.017
Neg logit contributions
 nearer-0.023
 afloat-0.022
eled-0.022
 submerged-0.021
 closer-0.021
Token toy
Token ID13373
Feature activation-2.00e-02

Pos logit contributions
 nearer+0.008
 closer+0.007
 afloat+0.007
eled+0.007
 submerged+0.007
Neg logit contributions
 Math-0.006
 automobiles-0.005
 sidewalk-0.005
 dictionary-0.005
 books-0.005
Token boat
Token ID8848
Feature activation-9.07e-04

Pos logit contributions
 nearer+0.033
 closer+0.032
 afloat+0.031
eled+0.031
 submerged+0.030
Neg logit contributions
 Math-0.030
 automobiles-0.024
 dictionary-0.024
 sidewalk-0.024
 books-0.023
Token and
Token ID290
Feature activation-7.55e-04

Pos logit contributions
 nearer+0.002
 afloat+0.001
 closer+0.001
eled+0.001
 submerged+0.001
Neg logit contributions
 Math-0.001
 dictionary-0.001
 automobiles-0.001
 books-0.001
 sidewalk-0.001
Token he
Token ID339
Feature activation+5.41e-03

Pos logit contributions
 nearer+0.001
 closer+0.001
 afloat+0.001
eled+0.001
 submerged+0.001
Neg logit contributions
 Math-0.001
 automobiles-0.001
 dictionary-0.001
 sidewalk-0.001
 piano-0.001
Token wanted
Token ID2227
Feature activation-7.99e-03

Pos logit contributions
 Math+0.007
 books+0.006
 dictionary+0.006
 sidewalk+0.006
 automobiles+0.005
Neg logit contributions
 nearer-0.010
 afloat-0.010
eled-0.010
 closer-0.010
apa-0.009
Token time
Token ID640
Feature activation-8.36e-03

Pos logit contributions
 Math+0.005
 books+0.004
 dictionary+0.004
 piano+0.004
 kids+0.004
Neg logit contributions
 nearer-0.013
 afloat-0.013
eled-0.012
 closer-0.012
 submerged-0.012
Token,
Token ID11
Feature activation+5.48e-03

Pos logit contributions
 nearer+0.017
eled+0.017
 afloat+0.017
 submerged+0.016
 numb+0.016
Neg logit contributions
 Math-0.010
 kids-0.008
 books-0.008
 dictionary-0.008
 mine-0.008
Token there
Token ID612
Feature activation-1.17e-03

Pos logit contributions
 Math+0.008
 books+0.007
 kids+0.007
 dictionary+0.007
 mine+0.007
Neg logit contributions
 nearer-0.009
 afloat-0.009
eled-0.009
apa-0.009
 submerged-0.009
Token was
Token ID373
Feature activation+1.86e-03

Pos logit contributions
 nearer+0.002
 afloat+0.002
 numb+0.002
 closer+0.002
 submerged+0.002
Neg logit contributions
 Math-0.001
 kids-0.001
 books-0.001
 dictionary-0.001
 sidewalk-0.001
Token a
Token ID257
Feature activation+5.69e-03

Pos logit contributions
 Math+0.002
 books+0.002
 kids+0.002
 games+0.002
 automobiles+0.002
Neg logit contributions
 nearer-0.004
 afloat-0.003
eled-0.003
apa-0.003
 submerged-0.003
Token little
Token ID1310
Feature activation-6.45e-03

Pos logit contributions
 Math+0.008
aces+0.007
 automobiles+0.007
 sidewalk+0.006
issors+0.006
Neg logit contributions
 closer-0.010
 nearer-0.010
 afloat-0.010
 submerged-0.009
Wal-0.009
Token girl
Token ID2576
Feature activation-1.73e-02

Pos logit contributions
 nearer+0.012
 closer+0.012
 afloat+0.012
eled+0.011
 submerged+0.011
Neg logit contributions
 Math-0.008
 sidewalk-0.007
 automobiles-0.007
aces-0.006
 dictionary-0.006
Token named
Token ID3706
Feature activation-2.55e-03

Pos logit contributions
 nearer+0.023
eled+0.022
 closer+0.022
 afloat+0.022
 submerged+0.021
Neg logit contributions
 Math-0.031
 automobiles-0.027
 sidewalk-0.027
 dictionary-0.027
 books-0.027
Token Lily
Token ID20037
Feature activation-2.46e-03

Pos logit contributions
 closer+0.005
 nearer+0.004
 Madd+0.004
apa+0.004
 deeper+0.004
Neg logit contributions
 Math-0.003
 automobiles-0.003
 sidewalk-0.003
 piano-0.003
cles-0.003
Token.
Token ID13
Feature activation-3.07e-03

Pos logit contributions
 nearer+0.004
eled+0.004
 closer+0.004
 afloat+0.004
 submerged+0.004
Neg logit contributions
 Math-0.003
 automobiles-0.003
 sidewalk-0.003
 dictionary-0.003
 books-0.003
Token She
Token ID1375
Feature activation+8.29e-03

Pos logit contributions
 closer+0.004
 nearer+0.004
 numb+0.004
ectar+0.004
 submerged+0.004
Neg logit contributions
 Math-0.005
 piano-0.005
 automobiles-0.005
 law-0.005
 test-0.005
Token was
Token ID373
Feature activation+5.39e-03

Pos logit contributions
 afloat+0.006
 nearer+0.006
 numb+0.006
igi+0.006
 closer+0.006
Neg logit contributions
 Math-0.004
 books-0.004
 kids-0.003
 dictionary-0.003
 games-0.003
Token so
Token ID523
Feature activation+2.53e-03

Pos logit contributions
 Math+0.005
 kids+0.004
 mine+0.004
 number+0.004
 books+0.004
Neg logit contributions
 nearer-0.011
 afloat-0.011
apa-0.011
igi-0.011
eled-0.011
Token proud
Token ID6613
Feature activation-4.67e-03

Pos logit contributions
 Math+0.003
 kids+0.003
 do+0.003
 number+0.003
 games+0.003
Neg logit contributions
 nearer-0.005
 afloat-0.005
eled-0.004
igi-0.004
apa-0.004
Token of
Token ID286
Feature activation+6.00e-03

Pos logit contributions
 nearer+0.008
 closer+0.007
eled+0.007
 afloat+0.007
 submerged+0.007
Neg logit contributions
 Math-0.007
 automobiles-0.006
 sidewalk-0.006
 dictionary-0.006
 piano-0.006
Token his
Token ID465
Feature activation-3.11e-03

Pos logit contributions
 Math+0.011
 piano+0.010
 automobiles+0.010
 sidewalk+0.010
 dictionary+0.010
Neg logit contributions
 nearer-0.008
 closer-0.007
 hes-0.007
eled-0.007
 afloat-0.007
Token caul
Token ID46087
Feature activation-3.00e-02

Pos logit contributions
 nearer+0.005
 afloat+0.005
eled+0.005
 closer+0.005
 submerged+0.005
Neg logit contributions
 Math-0.004
 dictionary-0.004
 books-0.004
 automobiles-0.004
 sidewalk-0.004
Tokeniflower
Token ID42642
Feature activation-8.79e-03

Pos logit contributions
 nearer+0.050
 afloat+0.047
 closer+0.047
eled+0.045
 submerged+0.045
Neg logit contributions
 Math-0.045
 dictionary-0.040
 books-0.040
 sidewalk-0.038
 automobiles-0.038
Token plants
Token ID6134
Feature activation+2.81e-03

Pos logit contributions
 nearer+0.014
 closer+0.014
 afloat+0.014
eled+0.013
apa+0.013
Neg logit contributions
 Math-0.014
 automobiles-0.012
 sidewalk-0.012
 piano-0.011
 dictionary-0.011
Token and
Token ID290
Feature activation+5.58e-03

Pos logit contributions
 Math+0.004
 dictionary+0.003
 automobiles+0.003
 books+0.003
 sidewalk+0.003
Neg logit contributions
 nearer-0.005
 afloat-0.005
eled-0.005
 closer-0.005
 submerged-0.005
Token kept
Token ID4030
Feature activation+3.80e-03

Pos logit contributions
 Math+0.008
 books+0.007
 dictionary+0.007
 automobiles+0.007
 sidewalk+0.007
Neg logit contributions
 nearer-0.010
 afloat-0.009
eled-0.009
 closer-0.009
 submerged-0.009
Token them
Token ID606
Feature activation-8.27e-03

Pos logit contributions
 Math+0.005
 mine+0.004
 books+0.004
 kids+0.004
 dictionary+0.004
Neg logit contributions
 nearer-0.007
 afloat-0.007
apa-0.006
igi-0.006
 submerged-0.006
Token His
Token ID2399
Feature activation+6.49e-04

Pos logit contributions
 nearer+0.008
 afloat+0.008
eled+0.008
 closer+0.008
 submerged+0.008
Neg logit contributions
 Math-0.006
 sidewalk-0.005
 books-0.005
 dictionary-0.005
 automobiles-0.005
Token dad
Token ID9955
Feature activation-8.39e-03

Pos logit contributions
 Math+0.001
 books+0.001
 dictionary+0.001
 sidewalk+0.001
 automobiles+0.001
Neg logit contributions
 nearer-0.001
 afloat-0.001
eled-0.001
 closer-0.001
 submerged-0.001
Token said
Token ID531
Feature activation-9.44e-03

Pos logit contributions
 nearer+0.013
 afloat+0.013
eled+0.012
 closer+0.012
 submerged+0.012
Neg logit contributions
 Math-0.013
 books-0.012
 dictionary-0.012
 automobiles-0.012
 sidewalk-0.011
Token,
Token ID11
Feature activation+1.67e-03

Pos logit contributions
eled+0.018
 nearer+0.018
 afloat+0.017
 submerged+0.017
igi+0.016
Neg logit contributions
 Math-0.012
 books-0.011
 kids-0.010
 dictionary-0.010
 automobiles-0.010
Token "
Token ID366
Feature activation-8.22e-03

Pos logit contributions
 Math+0.002
 automobiles+0.002
 dictionary+0.002
 sidewalk+0.002
 books+0.002
Neg logit contributions
 nearer-0.003
 closer-0.003
 afloat-0.003
 hes-0.003
eled-0.003
TokenOh
Token ID5812
Feature activation-2.45e-02

Pos logit contributions
 nearer+0.011
 closer+0.010
eled+0.010
 afloat+0.010
 submerged+0.010
Neg logit contributions
 Math-0.015
 dictionary-0.014
 books-0.013
 automobiles-0.013
 sidewalk-0.013
Token no
Token ID645
Feature activation-8.58e-03

Pos logit contributions
 nearer+0.038
 afloat+0.036
 closer+0.035
eled+0.035
 submerged+0.034
Neg logit contributions
 Math-0.040
 dictionary-0.035
 automobiles-0.035
 sidewalk-0.035
 books-0.034
Token,
Token ID11
Feature activation-1.92e-03

Pos logit contributions
 nearer+0.011
 closer+0.011
 afloat+0.011
 hes+0.010
 submerged+0.010
Neg logit contributions
 Math-0.015
 dictionary-0.014
 automobiles-0.014
 sidewalk-0.014
 piano-0.014
Token we
Token ID356
Feature activation-1.11e-02

Pos logit contributions
 closer+0.003
 nearer+0.003
 hes+0.003
 deeper+0.003
 numb+0.003
Neg logit contributions
 Math-0.003
 sidewalk-0.003
 automobiles-0.002
 pavement-0.002
 alphabet-0.002
Token have
Token ID423
Feature activation-2.18e-03

Pos logit contributions
 nearer+0.021
 afloat+0.020
eled+0.020
 closer+0.020
 submerged+0.019
Neg logit contributions
 Math-0.014
 books-0.012
 dictionary-0.012
 automobiles-0.012
 kids-0.012
Token to
Token ID284
Feature activation-4.44e-04

Pos logit contributions
 nearer+0.004
 afloat+0.004
eled+0.003
 closer+0.003
 submerged+0.003
Neg logit contributions
 Math-0.003
 books-0.003
 kids-0.003
 automobiles-0.003
 mine-0.003
Token were
Token ID547
Feature activation-1.11e-02

Pos logit contributions
 nearer+0.007
 afloat+0.007
eled+0.006
 closer+0.006
 submerged+0.006
Neg logit contributions
 Math-0.008
 books-0.007
 dictionary-0.007
 automobiles-0.007
 sidewalk-0.007
Token busy
Token ID8179
Feature activation-1.09e-02

Pos logit contributions
 nearer+0.019
eled+0.019
apa+0.019
 afloat+0.018
fly+0.018
Neg logit contributions
 Math-0.015
 kids-0.014
 books-0.014
 piano-0.013
 test-0.013
Token and
Token ID290
Feature activation-1.38e-02

Pos logit contributions
 nearer+0.022
 afloat+0.020
eled+0.020
apa+0.020
igi+0.020
Neg logit contributions
 Math-0.013
 books-0.011
 sidewalk-0.011
 automobiles-0.011
 kids-0.011
Token took
Token ID1718
Feature activation-3.17e-03

Pos logit contributions
 nearer+0.020
 afloat+0.019
eled+0.019
 submerged+0.018
 closer+0.018
Neg logit contributions
 Math-0.023
 books-0.021
 dictionary-0.021
 automobiles-0.021
 sidewalk-0.021
Token the
Token ID262
Feature activation-6.13e-03

Pos logit contributions
 nearer+0.004
 afloat+0.004
eled+0.004
 closer+0.004
 submerged+0.004
Neg logit contributions
 Math-0.006
 books-0.005
 automobiles-0.005
 dictionary-0.005
 sidewalk-0.005
Token oy
Token ID35104
Feature activation-3.56e-02

Pos logit contributions
 nearer+0.008
 afloat+0.008
eled+0.007
 submerged+0.007
apa+0.007
Neg logit contributions
 Math-0.011
 books-0.010
 dictionary-0.010
 kids-0.010
 mine-0.010
Tokenster
Token ID1706
Feature activation-1.16e-02

Pos logit contributions
 nearer+0.054
 fascination+0.052
 afloat+0.051
 submerged+0.051
eled+0.051
Neg logit contributions
 Math-0.050
 books-0.048
 kids-0.047
 dictionary-0.046
 test-0.045
Token to
Token ID284
Feature activation+6.51e-04

Pos logit contributions
 nearer+0.022
eled+0.021
 afloat+0.021
 submerged+0.021
 hes+0.021
Neg logit contributions
 Math-0.013
 books-0.012
 dictionary-0.012
 automobiles-0.011
 sidewalk-0.011
Token a
Token ID257
Feature activation-1.79e-03

Pos logit contributions
 Math+0.001
 sidewalk+0.001
 books+0.001
 kids+0.001
 dictionary+0.001
Neg logit contributions
 nearer-0.001
eled-0.001
 afloat-0.001
 submerged-0.001
apa-0.001
Token hidden
Token ID7104
Feature activation-2.64e-03

Pos logit contributions
eled+0.003
 nearer+0.003
 afloat+0.003
apa+0.003
uttered+0.003
Neg logit contributions
 Math-0.002
 books-0.002
 dictionary-0.002
 sidewalk-0.002
 piano-0.002
Token spot
Token ID4136
Feature activation-2.03e-03

Pos logit contributions
 nearer+0.004
 afloat+0.004
eled+0.004
 closer+0.004
 submerged+0.004
Neg logit contributions
 Math-0.004
 books-0.003
 dictionary-0.003
 sidewalk-0.003
 automobiles-0.003
Token.
Token ID13
Feature activation-3.41e-03

Pos logit contributions
 nearer+0.009
igi+0.009
apa+0.009
 afloat+0.009
eled+0.009
Neg logit contributions
 Math-0.003
 books-0.002
 kids-0.002
 dictionary-0.002
 automobiles-0.002
Token She
Token ID1375
Feature activation-2.61e-03

Pos logit contributions
 nearer+0.006
 closer+0.006
 afloat+0.006
eled+0.006
 submerged+0.005
Neg logit contributions
 Math-0.005
 automobiles-0.004
 books-0.004
 dictionary-0.004
 sidewalk-0.004
Token says
Token ID1139
Feature activation-5.73e-03

Pos logit contributions
 afloat+0.005
 nearer+0.005
eled+0.005
igi+0.005
 submerged+0.005
Neg logit contributions
 Math-0.003
 books-0.003
 dictionary-0.003
 kids-0.003
 automobiles-0.003
Token,
Token ID11
Feature activation+2.65e-03

Pos logit contributions
 afloat+0.010
 nearer+0.010
eled+0.010
 submerged+0.010
igi+0.009
Neg logit contributions
 Math-0.008
 books-0.007
 dictionary-0.007
 automobiles-0.007
 kids-0.007
Token "
Token ID366
Feature activation-1.27e-02

Pos logit contributions
 Math+0.003
 automobiles+0.003
 books+0.003
 sidewalk+0.003
 dictionary+0.003
Neg logit contributions
 nearer-0.005
 closer-0.005
 hes-0.005
 afloat-0.005
eled-0.005
TokenThank
Token ID10449
Feature activation-2.35e-02

Pos logit contributions
 nearer+0.021
 closer+0.020
 stro+0.019
eled+0.019
 hes+0.018
Neg logit contributions
 Math-0.018
 earthquake-0.018
 automobiles-0.018
 books-0.018
 computer-0.017
Token you
Token ID345
Feature activation-9.24e-03

Pos logit contributions
 nearer+0.046
 closer+0.045
 hes+0.042
apa+0.042
iri+0.040
Neg logit contributions
 Math-0.025
 sidewalk-0.025
 automobiles-0.024
 piano-0.024
 computer-0.023
Token,
Token ID11
Feature activation-1.54e-03

Pos logit contributions
 closer+0.011
 nearer+0.011
apa+0.010
 deeper+0.009
 afloat+0.009
Neg logit contributions
 Math-0.018
 sidewalk-0.017
 piano-0.016
 automobiles-0.016
 pavement-0.016
Token but
Token ID475
Feature activation-1.94e-03

Pos logit contributions
 closer+0.002
 hes+0.002
 nearer+0.002
 bur+0.002
iri+0.002
Neg logit contributions
 Math-0.002
 automobiles-0.002
 sidewalk-0.002
 pavement-0.002
cles-0.002
Token no
Token ID645
Feature activation+8.37e-04

Pos logit contributions
 nearer+0.004
 closer+0.004
 hes+0.004
 deeper+0.004
 pap+0.004
Neg logit contributions
 Math-0.002
 automobiles-0.002
 sidewalk-0.002
 pavement-0.002
 piano-0.002
Token thank
Token ID5875
Feature activation-1.03e-02

Pos logit contributions
 Math+0.001
issors+0.001
 sidewalk+0.001
aces+0.001
 automobiles+0.001
Neg logit contributions
 closer-0.001
 nearer-0.001
 deeper-0.001
 hes-0.001
 afloat-0.001
Token noise
Token ID7838
Feature activation-1.31e-02

Pos logit contributions
 nearer+0.006
 closer+0.006
 afloat+0.006
eled+0.005
 submerged+0.005
Neg logit contributions
 Math-0.005
 automobiles-0.004
 dictionary-0.004
 books-0.004
 sidewalk-0.004
Token.
Token ID13
Feature activation+3.93e-03

Pos logit contributions
 nearer+0.026
eled+0.026
apa+0.026
 afloat+0.026
igi+0.026
Neg logit contributions
 Math-0.014
 books-0.011
 sidewalk-0.011
 automobiles-0.011
 piano-0.011
Token It
Token ID632
Feature activation-3.57e-03

Pos logit contributions
 Math+0.006
 automobiles+0.005
 piano+0.005
 dictionary+0.005
 books+0.005
Neg logit contributions
 nearer-0.007
 closer-0.006
 hes-0.006
 submerged-0.006
 afloat-0.006
Token was
Token ID373
Feature activation+6.78e-03

Pos logit contributions
 afloat+0.006
apa+0.006
 nearer+0.006
eled+0.006
igi+0.006
Neg logit contributions
 Math-0.004
 dictionary-0.004
 books-0.004
 kids-0.004
 sidewalk-0.004
Token a
Token ID257
Feature activation-4.14e-03

Pos logit contributions
 kids+0.006
 Math+0.006
 earthquake+0.006
 mine+0.005
 books+0.005
Neg logit contributions
apa-0.015
igi-0.015
eled-0.015
ub-0.015
ossom-0.014
Token dog
Token ID3290
Feature activation-2.47e-02

Pos logit contributions
 nearer+0.008
 closer+0.007
 afloat+0.007
eled+0.007
 submerged+0.007
Neg logit contributions
 Math-0.005
 automobiles-0.005
 books-0.004
 sidewalk-0.004
 dictionary-0.004
Token barking
Token ID48961
Feature activation-9.07e-04

Pos logit contributions
 nearer+0.045
 afloat+0.043
eled+0.043
 closer+0.042
 submerged+0.042
Neg logit contributions
 Math-0.033
 dictionary-0.028
 automobiles-0.028
 books-0.028
 sidewalk-0.027
Token.
Token ID13
Feature activation+8.20e-03

Pos logit contributions
 nearer+0.002
 afloat+0.001
eled+0.001
 closer+0.001
 submerged+0.001
Neg logit contributions
 Math-0.001
 dictionary-0.001
 automobiles-0.001
 books-0.001
 sidewalk-0.001
Token The
Token ID383
Feature activation+3.19e-03

Pos logit contributions
 Math+0.012
 piano+0.011
 exam+0.011
 automobiles+0.011
 dictionary+0.011
Neg logit contributions
 nearer-0.013
 closer-0.012
apa-0.012
 submerged-0.012
 hes-0.012
Token dog
Token ID3290
Feature activation-6.79e-03

Pos logit contributions
 Math+0.003
 automobiles+0.003
 sidewalk+0.002
 books+0.002
aces+0.002
Neg logit contributions
 nearer-0.007
 closer-0.007
 afloat-0.006
eled-0.006
 submerged-0.006
Token saw
Token ID2497
Feature activation+1.68e-02

Pos logit contributions
 nearer+0.011
 afloat+0.010
 closer+0.010
eled+0.010
 submerged+0.010
Neg logit contributions
 Math-0.011
 automobiles-0.009
 dictionary-0.009
 sidewalk-0.009
 books-0.009