TOP ACTIVATIONS
MAX = 0.093

said , " Hello , ham ster . I will call
had a quarrel about what topp ings to put on the
'm sorry for the bad ham . I hope you feel
ham ster cage . Their ham ster , Fl uffy ,
. She remembered that her ham ster liked to eat seeds
they can add their own topp ings too . Sara chooses
put the cheese and the ham on the bread ?" Mom
my told him that the ham ster would always be with
stick the mustache on her upper lip with some tape .
The chip m unk was excited and realized
" Oh no ! My cr ay on is broken ."
? That is not a cr ay on . That is
? That is not a cr ay on . That is
my hurts from the bad ham ," Lily said , rubbing
source . It was their ham ster cage . Their ham
cheeks . That is a ham ster . A rabbit has
No , this is my cr ay on . I am
the cheese and add the topp ings . Lily and Ben
it was hard . The cr ay ons were sticky and
or get stuck . The cr ay ons and the clay

MIN ACTIVATIONS
MIN = -0.106

elf and waved goodbye as <|endoftext|> Tom my was walking in
prince thanked the wizard and <|endoftext|> Once upon a time ,
ia smiled and went back <|endoftext|> Once upon a time ,
and the rabbit were the <|endoftext|> Once there was a man
expected â <|endoftext|> Once upon a time there
she couldn 't get it <|endoftext|> Once upon a time there
that she hugged the little <|endoftext|> Once upon a time there
smiled to herself , remembering <|endoftext|> John was preparing to go
was just in the other <|endoftext|> Once upon a time ,
, but she knew she <|endoftext|> M andy opened the tiny
, and she was right <|endoftext|> Once upon a time ,
buying it . â <|endoftext|> Once upon a time there
That day , she learnt <|endoftext|> Once upon a time ,
's suggestion had paid off <|endoftext|> Once upon a time ,
opened the box , and <|endoftext|> Once upon a time there
nice rest ! â <|endoftext|> One day , John was
was happy to have something <|endoftext|> Jane went to the store
David nodded and watched <|endoftext|> Once there was a little
a smile on his face <|endoftext|> Once upon a time ,
and baby walk away and <|endoftext|> Once there was a famous

INTERVAL 0.044 - 0.093
CONTAINS 0.896%

He pressed it on the cardboard and made a big print
Andy said , " Let 's look and see if we
and stronger each day . One day , Lily 's friend
taking my lumber ! That 's for my new house !"
The baby 's face lit up and she

INTERVAL -0.006 - 0.044
CONTAINS 65.437%

could not pass the test that his teacher had given him
Anna felt surprised and scared . She did not expect the
made a bigger and better tower . They had a nice
She said , " That 's not fair ! I want
and Mia has a doll . Tom likes to make loud

INTERVAL -0.056 - -0.006
CONTAINS 33.082%

the door . She sees them crying . "
So , she went to the art supply store and picked
new friends and was always laughing . His days were filled
no ! I 'm feeling so dizz y !" Then she
." Jane was so happy . She couldn 't believe it

INTERVAL -0.106 - -0.056
CONTAINS 0.586%

fun time drinking it . <|endoftext|> Once upon a time ,
The end ! <|endoftext|> One day , Larry went
even when having fun . <|endoftext|> Once upon a time ,
would never fight again . <|endoftext|> L ily and Sam are
the wild morning again . <|endoftext|> One sunny day , Ben
Token said
Token ID531
Feature activation-3.53e-02

Pos logit contributions
leaf+0.052
 rubble+0.050
 stret+0.046
Hop+0.044
sm+0.044
Neg logit contributions
omo-0.072
 size-0.064
omm-0.063
ums-0.061
ama-0.059
Token,
Token ID11
Feature activation-1.71e-02

Pos logit contributions
 rubble+0.065
leaf+0.064
 stret+0.061
sm+0.059
Hop+0.059
Neg logit contributions
omo-0.052
 size-0.045
 head-0.040
omm-0.038
ums-0.037
Token "
Token ID366
Feature activation-1.23e-02

Pos logit contributions
 rubble+0.036
leaf+0.034
 stret+0.033
lves+0.031
sm+0.031
Neg logit contributions
omo-0.022
 size-0.018
 head-0.016
omm-0.016
ums-0.016
TokenHello
Token ID15496
Feature activation+3.90e-03

Pos logit contributions
leaf+0.016
Hop+0.014
 rubble+0.014
 stret+0.014
sm+0.013
Neg logit contributions
omo-0.025
 size-0.023
omm-0.022
 head-0.022
ums-0.021
Token,
Token ID11
Feature activation+1.32e-02

Pos logit contributions
omo+0.006
 size+0.006
ums+0.005
omm+0.005
 head+0.005
Neg logit contributions
sm-0.005
ts-0.005
Hop-0.005
 rubble-0.005
leaf-0.005
Token ham
Token ID8891
Feature activation+9.33e-02

Pos logit contributions
omo+0.019
 size+0.016
ums+0.015
omm+0.014
 head+0.014
Neg logit contributions
 rubble-0.025
leaf-0.024
 stret-0.023
Hop-0.022
sm-0.022
Tokenster
Token ID1706
Feature activation+4.26e-02

Pos logit contributions
omo+0.170
 Sorry+0.143
 size+0.139
 neither+0.138
omm+0.137
Neg logit contributions
sm-0.107
Hop-0.100
 nearby-0.099
ers-0.097
 stret-0.094
Token.
Token ID13
Feature activation+3.40e-02

Pos logit contributions
omo+0.074
 size+0.065
omm+0.064
ums+0.060
 Sorry+0.060
Neg logit contributions
 rubble-0.061
sm-0.060
leaf-0.059
 stret-0.057
Hop-0.056
Token I
Token ID314
Feature activation-2.20e-02

Pos logit contributions
omo+0.061
 size+0.055
 head+0.049
ums+0.046
omm+0.046
Neg logit contributions
leaf-0.054
 rubble-0.049
Hop-0.048
sm-0.048
free-0.046
Token will
Token ID481
Feature activation+2.86e-02

Pos logit contributions
leaf+0.037
 rubble+0.037
sm+0.035
 stret+0.034
Hop+0.034
Neg logit contributions
omo-0.035
omm-0.030
 size-0.030
ums-0.029
 head-0.028
Token call
Token ID869
Feature activation+1.15e-02

Pos logit contributions
omo+0.050
omm+0.044
ums+0.041
 size+0.040
ama+0.040
Neg logit contributions
 rubble-0.043
leaf-0.042
 stret-0.041
Hop-0.037
aid-0.037
Token had
Token ID550
Feature activation-4.02e-03

Pos logit contributions
 rubble+0.010
leaf+0.010
 stret+0.009
ts+0.009
lves+0.009
Neg logit contributions
omo-0.010
 size-0.009
omm-0.008
ums-0.008
 head-0.008
Token a
Token ID257
Feature activation+9.01e-03

Pos logit contributions
 rubble+0.006
leaf+0.006
 stret+0.006
lves+0.005
sm+0.005
Neg logit contributions
omo-0.007
 size-0.006
 head-0.006
omm-0.006
ums-0.006
Token quarrel
Token ID41998
Feature activation-1.40e-03

Pos logit contributions
omo+0.015
 size+0.013
 head+0.012
omm+0.012
ums+0.012
Neg logit contributions
 rubble-0.015
leaf-0.015
 stret-0.014
sm-0.013
lves-0.013
Token about
Token ID546
Feature activation-1.48e-03

Pos logit contributions
 rubble+0.002
leaf+0.002
 everywhere+0.002
sm+0.002
 outside+0.002
Neg logit contributions
omo-0.003
omm-0.002
ums-0.002
 size-0.002
mm-0.002
Token what
Token ID644
Feature activation-5.13e-03

Pos logit contributions
 rubble+0.002
leaf+0.002
 stret+0.002
lves+0.002
sm+0.002
Neg logit contributions
omo-0.003
 size-0.003
 head-0.002
omm-0.002
ums-0.002
Token topp
Token ID23126
Feature activation+8.91e-02

Pos logit contributions
leaf+0.007
 rubble+0.007
sm+0.007
looking+0.006
 groceries+0.006
Neg logit contributions
omo-0.009
ums-0.009
 size-0.008
omm-0.008
 head-0.008
Tokenings
Token ID654
Feature activation+2.91e-03

Pos logit contributions
omo+0.133
 seeker+0.131
omm+0.121
 throat+0.119
 Ro+0.118
Neg logit contributions
ts-0.127
led-0.121
ve-0.118
ers-0.118
hell-0.116
Token to
Token ID284
Feature activation-3.21e-03

Pos logit contributions
omo+0.005
 size+0.004
omm+0.004
ums+0.004
 head+0.004
Neg logit contributions
 rubble-0.005
leaf-0.005
 stret-0.005
sm-0.004
ts-0.004
Token put
Token ID1234
Feature activation-1.21e-02

Pos logit contributions
 rubble+0.006
leaf+0.005
 stret+0.005
ts+0.005
 groceries+0.005
Neg logit contributions
omo-0.005
 size-0.004
omm-0.004
ums-0.004
ama-0.004
Token on
Token ID319
Feature activation-6.73e-03

Pos logit contributions
 rubble+0.019
leaf+0.019
 stret+0.018
Hop+0.016
lves+0.016
Neg logit contributions
omo-0.021
 size-0.019
 head-0.018
 Mom-0.017
ums-0.017
Token the
Token ID262
Feature activation-2.37e-02

Pos logit contributions
 rubble+0.010
leaf+0.009
 stret+0.009
sm+0.008
ts+0.008
Neg logit contributions
omo-0.013
 size-0.011
omm-0.011
ums-0.011
 head-0.011
Token'm
Token ID1101
Feature activation-2.70e-03

Pos logit contributions
 rubble+0.030
leaf+0.030
 stret+0.028
sm+0.028
Hop+0.027
Neg logit contributions
omo-0.029
 size-0.025
omm-0.024
ums-0.024
 head-0.022
Token sorry
Token ID7926
Feature activation+5.38e-02

Pos logit contributions
 rubble+0.004
leaf+0.004
 stret+0.004
sm+0.003
lves+0.003
Neg logit contributions
omo-0.005
 size-0.004
 head-0.004
omm-0.004
ums-0.004
Token for
Token ID329
Feature activation+8.34e-04

Pos logit contributions
omo+0.098
 size+0.085
omm+0.078
ums+0.078
master+0.078
Neg logit contributions
 rubble-0.074
 everywhere-0.071
 outside-0.068
 outdoors-0.068
free-0.066
Token the
Token ID262
Feature activation+2.74e-03

Pos logit contributions
omo+0.001
 size+0.001
 head+0.001
 Mom+0.001
omm+0.001
Neg logit contributions
 rubble-0.002
leaf-0.001
 stret-0.001
lves-0.001
sm-0.001
Token bad
Token ID2089
Feature activation+1.90e-02

Pos logit contributions
omo+0.004
 size+0.004
 head+0.004
omm+0.003
 Mom+0.003
Neg logit contributions
 rubble-0.005
leaf-0.005
 stret-0.005
lves-0.004
ts-0.004
Token ham
Token ID8891
Feature activation+8.83e-02

Pos logit contributions
omo+0.032
ums+0.027
 size+0.027
 Sorry+0.026
omm+0.026
Neg logit contributions
 rubble-0.030
leaf-0.028
 stret-0.027
sm-0.026
 groceries-0.026
Token.
Token ID13
Feature activation+2.67e-02

Pos logit contributions
omo+0.154
 size+0.123
 Sorry+0.122
omm+0.119
ums+0.119
Neg logit contributions
 rubble-0.133
 stret-0.130
sm-0.119
Hop-0.118
leaf-0.118
Token I
Token ID314
Feature activation-2.22e-02

Pos logit contributions
omo+0.047
 size+0.046
castle+0.038
ums+0.037
zag+0.036
Neg logit contributions
leaf-0.041
Here-0.039
free-0.036
aid-0.035
Hop-0.035
Token hope
Token ID2911
Feature activation-1.92e-03

Pos logit contributions
 rubble+0.036
leaf+0.036
sm+0.034
 stret+0.034
Hop+0.033
Neg logit contributions
omo-0.035
 size-0.032
omm-0.031
ums-0.031
 head-0.028
Token you
Token ID345
Feature activation-4.67e-02

Pos logit contributions
 rubble+0.003
leaf+0.003
 stret+0.003
sm+0.003
lves+0.003
Neg logit contributions
omo-0.003
 size-0.003
omm-0.003
 head-0.003
ums-0.003
Token feel
Token ID1254
Feature activation-1.16e-02

Pos logit contributions
 rubble+0.088
leaf+0.085
 stret+0.083
lves+0.076
sm+0.075
Neg logit contributions
omo-0.067
 size-0.059
 head-0.055
ums-0.050
 Sorry-0.050
Token ham
Token ID8891
Feature activation+8.24e-02

Pos logit contributions
omo+0.025
 size+0.020
omm+0.019
ums+0.019
 Sorry+0.018
Neg logit contributions
 rubble-0.035
leaf-0.033
 stret-0.032
sm-0.029
ts-0.029
Tokenster
Token ID1706
Feature activation+2.85e-02

Pos logit contributions
omo+0.118
 Sorry+0.111
omm+0.090
 Chess+0.088
mm+0.087
Neg logit contributions
 stret-0.138
 rubble-0.135
ts-0.132
 everywhere-0.132
sm-0.121
Token cage
Token ID17186
Feature activation+8.54e-03

Pos logit contributions
omo+0.047
 Sorry+0.040
omm+0.040
 size+0.037
ums+0.037
Neg logit contributions
 rubble-0.047
 stret-0.044
ts-0.041
sm-0.040
leaf-0.040
Token.
Token ID13
Feature activation+1.88e-02

Pos logit contributions
omo+0.015
 size+0.012
omm+0.012
 Sorry+0.012
ums+0.012
Neg logit contributions
 rubble-0.014
 stret-0.013
leaf-0.013
sm-0.012
ts-0.012
Token Their
Token ID5334
Feature activation+5.17e-02

Pos logit contributions
omo+0.033
 size+0.033
 head+0.029
master+0.027
ums+0.027
Neg logit contributions
 rubble-0.027
leaf-0.026
 stret-0.025
sm-0.024
Hop-0.024
Token ham
Token ID8891
Feature activation+8.70e-02

Pos logit contributions
omo+0.063
 size+0.051
omm+0.046
ums+0.045
 head+0.044
Neg logit contributions
 rubble-0.111
leaf-0.105
 stret-0.104
ts-0.097
sm-0.096
Tokenster
Token ID1706
Feature activation+2.70e-02

Pos logit contributions
omo+0.134
 Sorry+0.123
omm+0.106
 Chess+0.103
mm+0.101
Neg logit contributions
ts-0.133
 rubble-0.131
 stret-0.127
 everywhere-0.125
sm-0.120
Token,
Token ID11
Feature activation-5.68e-03

Pos logit contributions
omo+0.041
 Sorry+0.036
omm+0.035
 size+0.034
ums+0.034
Neg logit contributions
 rubble-0.047
ts-0.043
 stret-0.043
leaf-0.041
sm-0.040
Token Fl
Token ID1610
Feature activation+4.99e-02

Pos logit contributions
 rubble+0.010
leaf+0.010
 stret+0.009
lves+0.009
ival+0.009
Neg logit contributions
omo-0.009
 size-0.008
 head-0.007
 Mom-0.007
 throat-0.007
Tokenuffy
Token ID15352
Feature activation+1.54e-02

Pos logit contributions
omo+0.069
 size+0.056
 Sorry+0.055
omm+0.054
master+0.053
Neg logit contributions
 rubble-0.084
ts-0.081
sm-0.081
 stret-0.080
leaf-0.078
Token,
Token ID11
Feature activation-1.17e-02

Pos logit contributions
omo+0.023
 size+0.019
omm+0.018
ums+0.018
 Sorry+0.018
Neg logit contributions
 rubble-0.029
 stret-0.027
leaf-0.027
ts-0.026
sm-0.025
Token.
Token ID13
Feature activation+1.59e-02

Pos logit contributions
omo+0.043
 size+0.037
omm+0.034
 head+0.034
ums+0.034
Neg logit contributions
 rubble-0.047
leaf-0.045
 stret-0.043
sm-0.040
lves-0.040
Token She
Token ID1375
Feature activation+1.56e-02

Pos logit contributions
omo+0.018
 size+0.015
 head+0.014
omm+0.014
ums+0.013
Neg logit contributions
 rubble-0.035
leaf-0.033
 stret-0.033
sm-0.030
ts-0.030
Token remembered
Token ID12086
Feature activation+2.31e-02

Pos logit contributions
omo+0.028
ums+0.025
 size+0.025
 Sorry+0.024
omm+0.024
Neg logit contributions
 rubble-0.023
leaf-0.021
 stret-0.021
ts-0.020
Hop-0.020
Token that
Token ID326
Feature activation+2.16e-02

Pos logit contributions
omo+0.038
 size+0.036
 Mom+0.035
 only+0.032
 head+0.032
Neg logit contributions
 rubble-0.038
leaf-0.037
 stret-0.036
lves-0.034
Hop-0.033
Token her
Token ID607
Feature activation+4.27e-02

Pos logit contributions
omo+0.042
 size+0.036
omm+0.035
ums+0.034
 head+0.034
Neg logit contributions
 rubble-0.031
leaf-0.029
 stret-0.028
sm-0.026
ts-0.025
Token ham
Token ID8891
Feature activation+8.67e-02

Pos logit contributions
omo+0.046
 Sorry+0.036
 size+0.035
omm+0.034
ums+0.032
Neg logit contributions
 rubble-0.092
 stret-0.088
leaf-0.087
sm-0.083
 outdoors-0.081
Tokenster
Token ID1706
Feature activation+3.35e-02

Pos logit contributions
omo+0.138
 Sorry+0.126
omm+0.110
 size+0.107
 Me+0.104
Neg logit contributions
 rubble-0.127
 stret-0.125
sm-0.122
ts-0.122
Hop-0.121
Token liked
Token ID8288
Feature activation+2.87e-03

Pos logit contributions
omo+0.045
 size+0.037
omm+0.035
 Sorry+0.034
ums+0.034
Neg logit contributions
 rubble-0.066
leaf-0.063
 stret-0.061
sm-0.059
ts-0.058
Token to
Token ID284
Feature activation+7.91e-03

Pos logit contributions
omo+0.005
 size+0.005
 head+0.004
 Mom+0.004
ums+0.004
Neg logit contributions
 rubble-0.005
leaf-0.004
 stret-0.004
lves-0.004
Hop-0.004
Token eat
Token ID4483
Feature activation+1.72e-02

Pos logit contributions
omo+0.014
omm+0.012
 size+0.011
ums+0.011
ama+0.011
Neg logit contributions
 rubble-0.013
 stret-0.012
leaf-0.012
ts-0.011
sm-0.011
Token seeds
Token ID11904
Feature activation+7.68e-03

Pos logit contributions
omo+0.028
 size+0.025
 head+0.023
ums+0.022
 Mom+0.022
Neg logit contributions
 rubble-0.030
leaf-0.029
 stret-0.028
lves-0.026
Hop-0.025
Token they
Token ID484
Feature activation-2.17e-02

Pos logit contributions
 rubble+0.059
leaf+0.058
 stret+0.054
sm+0.052
ts+0.052
Neg logit contributions
omo-0.056
 size-0.050
 head-0.047
ums-0.045
omm-0.043
Token can
Token ID460
Feature activation+1.56e-02

Pos logit contributions
 rubble+0.045
 stret+0.043
leaf+0.043
lves+0.040
sm+0.040
Neg logit contributions
omo-0.027
 size-0.024
 head-0.022
 only-0.020
 Mom-0.020
Token add
Token ID751
Feature activation+1.25e-02

Pos logit contributions
omo+0.028
omm+0.024
urse+0.024
ia+0.024
 ending+0.023
Neg logit contributions
 rubble-0.019
 outdoors-0.019
 outside-0.019
leaf-0.019
ts-0.018
Token their
Token ID511
Feature activation-6.94e-04

Pos logit contributions
omo+0.023
 size+0.021
 Mom+0.019
 head+0.019
ums+0.019
Neg logit contributions
 rubble-0.019
leaf-0.018
 stret-0.017
lves-0.016
Hop-0.015
Token own
Token ID898
Feature activation-8.67e-03

Pos logit contributions
 rubble+0.001
leaf+0.001
 stret+0.001
lves+0.001
Hop+0.001
Neg logit contributions
omo-0.001
 size-0.001
 head-0.001
 Mom-0.001
ums-0.001
Token topp
Token ID23126
Feature activation+8.66e-02

Pos logit contributions
 rubble+0.013
leaf+0.012
 stret+0.011
lves+0.010
Hop+0.010
Neg logit contributions
omo-0.017
 size-0.015
 head-0.014
ums-0.014
omm-0.014
Tokenings
Token ID654
Feature activation-1.22e-02

Pos logit contributions
 seeker+0.132
omo+0.128
 Sorry+0.123
 Ro+0.122
 Is+0.119
Neg logit contributions
ts-0.109
ers-0.108
 goodies-0.106
hell-0.101
 items-0.099
Token too
Token ID1165
Feature activation+2.06e-03

Pos logit contributions
leaf+0.018
 rubble+0.018
 stret+0.017
sm+0.017
 groceries+0.016
Neg logit contributions
omo-0.022
 size-0.019
 head-0.018
omm-0.018
ums-0.017
Token.
Token ID13
Feature activation+1.92e-03

Pos logit contributions
omo+0.004
 size+0.003
omm+0.003
 head+0.003
ums+0.003
Neg logit contributions
 rubble-0.003
leaf-0.003
 stret-0.003
sm-0.003
 groceries-0.003
Token Sara
Token ID24799
Feature activation-3.03e-03

Pos logit contributions
 size+0.004
omo+0.003
master+0.003
 head+0.003
 loser+0.003
Neg logit contributions
leaf-0.002
 stret-0.002
 freshly-0.002
 groceries-0.002
 rubble-0.002
Token chooses
Token ID19769
Feature activation+2.93e-03

Pos logit contributions
leaf+0.005
 rubble+0.005
ts+0.004
Hop+0.004
 aback+0.004
Neg logit contributions
omo-0.005
 Sorry-0.004
 size-0.004
omm-0.004
ums-0.004
Token put
Token ID1234
Feature activation+7.79e-03

Pos logit contributions
omo+0.036
 size+0.032
omm+0.032
ums+0.032
master+0.029
Neg logit contributions
 rubble-0.024
leaf-0.023
 stret-0.021
aid-0.021
ts-0.020
Token the
Token ID262
Feature activation-4.72e-03

Pos logit contributions
omo+0.012
 size+0.011
 Mom+0.011
 head+0.011
ums+0.010
Neg logit contributions
 rubble-0.014
leaf-0.013
 stret-0.013
lves-0.012
Hop-0.011
Token cheese
Token ID9891
Feature activation+4.31e-02

Pos logit contributions
 rubble+0.009
leaf+0.009
 stret+0.008
lves+0.008
Hop+0.007
Neg logit contributions
omo-0.007
 size-0.007
 head-0.006
 Mom-0.006
 throat-0.006
Token and
Token ID290
Feature activation+9.00e-03

Pos logit contributions
omo+0.077
 size+0.068
omm+0.064
ums+0.063
 Sorry+0.062
Neg logit contributions
 rubble-0.066
 stret-0.062
leaf-0.061
sm-0.056
Hop-0.055
Token the
Token ID262
Feature activation-8.00e-03

Pos logit contributions
omo+0.017
 size+0.015
omm+0.014
ums+0.014
 head+0.014
Neg logit contributions
 rubble-0.013
leaf-0.012
 stret-0.012
sm-0.011
lves-0.011
Token ham
Token ID8891
Feature activation+8.56e-02

Pos logit contributions
 rubble+0.013
leaf+0.013
 stret+0.012
lves+0.012
sm+0.012
Neg logit contributions
omo-0.013
 size-0.012
 head-0.012
 Mom-0.011
 throat-0.011
Token on
Token ID319
Feature activation+2.72e-02

Pos logit contributions
omo+0.141
 size+0.121
 Sorry+0.118
omm+0.117
 Me+0.114
Neg logit contributions
 stret-0.126
 rubble-0.122
 somewhere-0.119
ts-0.115
 everywhere-0.115
Token the
Token ID262
Feature activation-6.44e-03

Pos logit contributions
omo+0.056
 size+0.050
ums+0.048
omm+0.047
 Sorry+0.046
Neg logit contributions
 rubble-0.035
 stret-0.032
leaf-0.030
ts-0.029
 groceries-0.028
Token bread
Token ID8509
Feature activation+4.49e-02

Pos logit contributions
 rubble+0.012
leaf+0.012
 stret+0.011
sm+0.011
lves+0.011
Neg logit contributions
omo-0.009
 size-0.008
 head-0.007
omm-0.007
ums-0.007
Token?"
Token ID1701
Feature activation+3.06e-02

Pos logit contributions
omo+0.059
 size+0.050
omm+0.050
ums+0.047
 Sorry+0.044
Neg logit contributions
 rubble-0.086
 stret-0.083
 nearby-0.079
leaf-0.078
 somewhere-0.078
Token Mom
Token ID11254
Feature activation-1.15e-02

Pos logit contributions
 size+0.034
omo+0.033
omm+0.028
 head+0.028
ums+0.027
Neg logit contributions
 rubble-0.062
leaf-0.061
 stret-0.058
 outdoors-0.058
Here-0.057
Tokenmy
Token ID1820
Feature activation+5.17e-03

Pos logit contributions
omo+0.026
 size+0.024
 Sorry+0.022
ums+0.022
omm+0.022
Neg logit contributions
 rubble-0.021
leaf-0.020
 stret-0.019
sm-0.018
lves-0.017
Token told
Token ID1297
Feature activation-3.40e-02

Pos logit contributions
omo+0.009
 size+0.008
 Sorry+0.007
ums+0.007
omm+0.007
Neg logit contributions
 rubble-0.008
leaf-0.008
 stret-0.008
sm-0.007
Hop-0.007
Token him
Token ID683
Feature activation-1.78e-02

Pos logit contributions
leaf+0.077
Hop+0.067
ival+0.067
lves+0.067
 rubble+0.066
Neg logit contributions
 Mom-0.041
omo-0.040
 only-0.037
 be-0.035
 mom-0.035
Token that
Token ID326
Feature activation+6.59e-03

Pos logit contributions
 rubble+0.030
leaf+0.029
 stret+0.029
lves+0.026
Hop+0.026
Neg logit contributions
omo-0.029
 size-0.026
 head-0.024
omm-0.023
ums-0.023
Token the
Token ID262
Feature activation+1.13e-02

Pos logit contributions
omo+0.012
 size+0.010
omm+0.010
ums+0.010
 head+0.010
Neg logit contributions
 rubble-0.010
leaf-0.009
 stret-0.009
ts-0.009
sm-0.008
Token ham
Token ID8891
Feature activation+8.44e-02

Pos logit contributions
omo+0.020
 size+0.017
ums+0.017
omm+0.017
 Sorry+0.017
Neg logit contributions
 rubble-0.017
leaf-0.016
 stret-0.015
Hop-0.014
 risks-0.014
Tokenster
Token ID1706
Feature activation+3.20e-02

Pos logit contributions
omo+0.140
 Sorry+0.117
omm+0.107
 size+0.103
ums+0.101
Neg logit contributions
 rubble-0.126
Hop-0.122
 everywhere-0.120
ts-0.120
sm-0.119
Token would
Token ID561
Feature activation+8.60e-03

Pos logit contributions
omo+0.042
 size+0.034
omm+0.032
ums+0.032
 Sorry+0.031
Neg logit contributions
 rubble-0.065
leaf-0.062
 stret-0.059
sm-0.058
ts-0.057
Token always
Token ID1464
Feature activation+1.21e-02

Pos logit contributions
omo+0.012
ums+0.009
omm+0.009
 Sorry+0.009
 size+0.009
Neg logit contributions
 rubble-0.018
 stret-0.016
leaf-0.015
Hop-0.015
ts-0.015
Token be
Token ID307
Feature activation+8.01e-05

Pos logit contributions
omo+0.015
 size+0.012
omm+0.012
ums+0.012
 Sorry+0.011
Neg logit contributions
 rubble-0.025
leaf-0.024
 stret-0.024
ts-0.022
sm-0.022
Token with
Token ID351
Feature activation-6.72e-03

Pos logit contributions
omo+0.000
 size+0.000
 head+0.000
 only+0.000
 Mom+0.000
Neg logit contributions
leaf-0.000
 rubble-0.000
ival-0.000
 stret-0.000
looking-0.000
Token stick
Token ID4859
Feature activation+2.69e-02

Pos logit contributions
omo+0.007
 size+0.006
 head+0.006
omm+0.006
ums+0.006
Neg logit contributions
 rubble-0.008
leaf-0.007
 stret-0.007
lves-0.006
sm-0.006
Token the
Token ID262
Feature activation-2.25e-02

Pos logit contributions
omo+0.046
 size+0.042
 head+0.039
 Mom+0.038
omm+0.037
Neg logit contributions
 rubble-0.043
leaf-0.042
 stret-0.040
lves-0.037
sm-0.036
Token mustache
Token ID49303
Feature activation+1.10e-02

Pos logit contributions
leaf+0.040
 rubble+0.039
sm+0.036
 stret+0.036
lves+0.035
Neg logit contributions
omo-0.034
 size-0.033
 head-0.031
 Mom-0.027
 throat-0.027
Token on
Token ID319
Feature activation+2.46e-02

Pos logit contributions
omo+0.022
omm+0.018
 size+0.018
ums+0.018
 Sorry+0.017
Neg logit contributions
 rubble-0.015
 stret-0.014
leaf-0.014
 everywhere-0.013
 nearby-0.013
Token her
Token ID607
Feature activation+3.66e-03

Pos logit contributions
omo+0.050
 size+0.042
ums+0.041
 Sorry+0.041
omm+0.040
Neg logit contributions
 rubble-0.032
 stret-0.029
leaf-0.029
 outdoors-0.026
sm-0.026
Token upper
Token ID6727
Feature activation+8.39e-02

Pos logit contributions
omo+0.005
 size+0.004
omm+0.004
ums+0.003
 Sorry+0.003
Neg logit contributions
 rubble-0.008
 stret-0.007
leaf-0.007
ts-0.007
Hop-0.006
Token lip
Token ID10645
Feature activation+2.08e-02

Pos logit contributions
 Sorry+0.106
omo+0.106
 size+0.097
 Me+0.091
 seeker+0.089
Neg logit contributions
 stret-0.145
 rubble-0.129
keeping-0.126
and-0.125
aid-0.125
Token with
Token ID351
Feature activation-4.91e-03

Pos logit contributions
omo+0.038
 Sorry+0.033
 size+0.033
omm+0.033
 Me+0.031
Neg logit contributions
 stret-0.028
 rubble-0.027
aid-0.025
 outdoors-0.024
 outside-0.024
Token some
Token ID617
Feature activation+4.61e-03

Pos logit contributions
 rubble+0.007
leaf+0.007
 stret+0.006
sm+0.006
ts+0.006
Neg logit contributions
omo-0.010
 size-0.008
omm-0.008
ums-0.008
 head-0.008
Token tape
Token ID9154
Feature activation+4.08e-02

Pos logit contributions
omo+0.008
 size+0.008
 head+0.007
 Mom+0.007
ums+0.007
Neg logit contributions
 rubble-0.007
leaf-0.007
 stret-0.006
lves-0.006
Hop-0.006
Token.
Token ID13
Feature activation+2.08e-02

Pos logit contributions
omo+0.070
omm+0.059
 Sorry+0.058
 size+0.058
ums+0.056
Neg logit contributions
 rubble-0.064
 stret-0.061
leaf-0.059
ts-0.054
sm-0.054
Token
Token ID220
Feature activation+5.77e-03

Pos logit contributions
 rubble+0.006
leaf+0.006
 stret+0.006
ts+0.006
 unnecessary+0.006
Neg logit contributions
omo-0.005
 size-0.004
ums-0.004
 Mom-0.004
omm-0.004
Token
Token ID198
Feature activation-9.35e-03

Pos logit contributions
omo+0.008
 size+0.007
ums+0.007
omm+0.007
 Mom+0.006
Neg logit contributions
 rubble-0.011
leaf-0.011
 stret-0.010
ts-0.010
lves-0.010
Token
Token ID198
Feature activation-1.30e-02

Pos logit contributions
 rubble+0.021
 unnecessary+0.020
 risks+0.020
 stret+0.019
 groceries+0.019
Neg logit contributions
omo-0.011
 Mom-0.008
ums-0.007
ia-0.007
 Sorry-0.007
TokenThe
Token ID464
Feature activation+2.24e-02

Pos logit contributions
 rubble+0.023
leaf+0.021
 stret+0.021
 unnecessary+0.019
ts+0.019
Neg logit contributions
omo-0.021
 size-0.018
ums-0.016
omm-0.016
 Sorry-0.016
Token chip
Token ID11594
Feature activation+4.45e-02

Pos logit contributions
omo+0.028
 size+0.023
 Sorry+0.021
ums+0.021
omm+0.021
Neg logit contributions
 rubble-0.046
leaf-0.044
 stret-0.044
lves-0.041
sm-0.041
Tokenm
Token ID76
Feature activation+8.37e-02

Pos logit contributions
omo+0.074
 size+0.065
 head+0.060
ums+0.060
omm+0.059
Neg logit contributions
 rubble-0.076
leaf-0.072
 stret-0.070
 unnecessary-0.064
lves-0.063
Tokenunk
Token ID2954
Feature activation-4.73e-03

Pos logit contributions
omo+0.127
 size+0.113
 Sorry+0.110
 head+0.100
omm+0.098
Neg logit contributions
 rubble-0.143
leaf-0.139
 stret-0.136
sm-0.128
lves-0.124
Token was
Token ID373
Feature activation+1.30e-02

Pos logit contributions
 rubble+0.009
leaf+0.008
 stret+0.008
sm+0.008
Hop+0.007
Neg logit contributions
omo-0.007
 size-0.006
omm-0.006
ums-0.006
 Sorry-0.006
Token excited
Token ID6568
Feature activation-9.45e-03

Pos logit contributions
omo+0.028
 size+0.025
omm+0.024
 Sorry+0.024
ums+0.023
Neg logit contributions
 rubble-0.015
leaf-0.015
 stret-0.014
sm-0.012
 outdoors-0.012
Token and
Token ID290
Feature activation-2.89e-02

Pos logit contributions
 rubble+0.019
leaf+0.018
 stret+0.018
lves+0.017
ts+0.016
Neg logit contributions
omo-0.013
 size-0.012
 head-0.010
 Mom-0.010
ums-0.010
Token realized
Token ID6939
Feature activation+5.05e-03

Pos logit contributions
 rubble+0.045
leaf+0.043
 stret+0.039
ts+0.037
sm+0.037
Neg logit contributions
omo-0.053
 size-0.046
omm-0.045
ums-0.044
 head-0.041
Token "
Token ID366
Feature activation-1.19e-02

Pos logit contributions
 rubble+0.045
leaf+0.043
 stret+0.041
lves+0.039
sm+0.039
Neg logit contributions
omo-0.025
 size-0.022
 head-0.020
omm-0.019
ums-0.019
TokenOh
Token ID5812
Feature activation-8.83e-04

Pos logit contributions
 rubble+0.020
leaf+0.019
 stret+0.018
lves+0.017
Hop+0.017
Neg logit contributions
omo-0.019
 size-0.017
 head-0.016
ums-0.016
omm-0.015
Token no
Token ID645
Feature activation+2.87e-02

Pos logit contributions
 rubble+0.002
sm+0.002
 stret+0.002
leaf+0.002
lves+0.002
Neg logit contributions
omo-0.001
 size-0.001
ums-0.001
 head-0.001
omm-0.001
Token!
Token ID0
Feature activation+6.86e-04

Pos logit contributions
omo+0.048
master+0.046
ums+0.044
omm+0.043
 head+0.042
Neg logit contributions
 rubble-0.040
sm-0.035
leaf-0.034
 risks-0.033
 outdoors-0.033
Token My
Token ID2011
Feature activation+6.00e-02

Pos logit contributions
 size+0.001
omo+0.001
 head+0.001
rey+0.001
master+0.001
Neg logit contributions
 rubble-0.001
leaf-0.001
free-0.001
sm-0.001
lves-0.001
Token cr
Token ID1067
Feature activation+8.32e-02

Pos logit contributions
omo+0.094
 size+0.079
omm+0.073
ums+0.073
 Sorry+0.071
Neg logit contributions
 rubble-0.106
leaf-0.101
 stret-0.098
sm-0.091
ts-0.090
Tokenay
Token ID323
Feature activation+2.05e-02

Pos logit contributions
omo+0.126
 size+0.120
 head+0.110
 Sorry+0.107
 throat+0.105
Neg logit contributions
 rubble-0.135
leaf-0.133
sm-0.122
aid-0.120
ds-0.119
Tokenon
Token ID261
Feature activation+4.91e-02

Pos logit contributions
omo+0.052
 size+0.048
 head+0.046
omm+0.045
ums+0.045
Neg logit contributions
 rubble-0.015
leaf-0.015
sm-0.012
 stret-0.011
ts-0.010
Token is
Token ID318
Feature activation+3.03e-02

Pos logit contributions
omo+0.070
ums+0.059
omm+0.055
 size+0.055
 Brady+0.053
Neg logit contributions
 rubble-0.090
 everywhere-0.081
sm-0.079
ts-0.077
ve-0.075
Token broken
Token ID5445
Feature activation+2.73e-02

Pos logit contributions
omo+0.055
 size+0.048
ums+0.046
omm+0.045
 Sorry+0.045
Neg logit contributions
 rubble-0.047
leaf-0.042
 stret-0.041
sm-0.038
lves-0.038
Token."
Token ID526
Feature activation+1.01e-02

Pos logit contributions
omo+0.051
 Sorry+0.044
ums+0.043
omm+0.041
mm+0.041
Neg logit contributions
 rubble-0.039
 everywhere-0.036
 outdoors-0.034
 outside-0.034
free-0.033
Token?
Token ID30
Feature activation+2.15e-03

Pos logit contributions
omo+0.063
 size+0.048
omm+0.047
ums+0.046
 throat+0.046
Neg logit contributions
 rubble-0.075
leaf-0.075
 outdoors-0.074
ts-0.072
 outside-0.071
Token That
Token ID1320
Feature activation+3.03e-02

Pos logit contributions
omo+0.004
 size+0.003
 head+0.003
master+0.003
omm+0.003
Neg logit contributions
 rubble-0.004
leaf-0.004
free-0.003
 stret-0.003
sm-0.003
Token is
Token ID318
Feature activation+4.31e-02

Pos logit contributions
omo+0.050
ums+0.047
omm+0.042
eter+0.042
ama+0.039
Neg logit contributions
 rubble-0.051
leaf-0.043
 stret-0.043
 newspapers-0.043
sm-0.043
Token not
Token ID407
Feature activation+2.37e-02

Pos logit contributions
omo+0.054
 size+0.045
ums+0.041
omm+0.039
 head+0.039
Neg logit contributions
 rubble-0.092
leaf-0.087
 stret-0.085
ts-0.081
sm-0.080
Token a
Token ID257
Feature activation+4.11e-02

Pos logit contributions
omo+0.042
 size+0.038
 head+0.035
omm+0.034
 Mom+0.034
Neg logit contributions
 rubble-0.037
leaf-0.036
 stret-0.034
lves-0.032
Hop-0.032
Token cr
Token ID1067
Feature activation+8.30e-02

Pos logit contributions
omo+0.071
 size+0.061
ums+0.059
omm+0.059
 head+0.056
Neg logit contributions
 rubble-0.067
leaf-0.062
 stret-0.062
sm-0.056
lves-0.056
Tokenay
Token ID323
Feature activation+2.02e-02

Pos logit contributions
omo+0.142
 Sorry+0.137
 size+0.134
 head+0.126
 throat+0.122
Neg logit contributions
 rubble-0.115
leaf-0.106
aid-0.104
sm-0.102
ds-0.100
Tokenon
Token ID261
Feature activation+6.39e-02

Pos logit contributions
omo+0.045
 size+0.041
 head+0.038
 Sorry+0.038
omm+0.038
Neg logit contributions
 rubble-0.022
leaf-0.021
 stret-0.020
sm-0.020
ts-0.018
Token.
Token ID13
Feature activation+2.22e-02

Pos logit contributions
omo+0.144
 Sorry+0.124
omm+0.123
ums+0.122
rey+0.118
Neg logit contributions
 rubble-0.057
 outside-0.056
free-0.053
 everywhere-0.052
 stret-0.051
Token That
Token ID1320
Feature activation+2.87e-02

Pos logit contributions
omo+0.043
 size+0.042
master+0.036
 head+0.035
zag+0.035
Neg logit contributions
leaf-0.029
 rubble-0.027
free-0.026
Here-0.024
sm-0.024
Token is
Token ID318
Feature activation+4.10e-02

Pos logit contributions
omo+0.047
ums+0.042
omm+0.039
eter+0.036
ama+0.036
Neg logit contributions
 rubble-0.048
 newspapers-0.042
leaf-0.042
 rubbish-0.041
sm-0.041
Token?
Token ID30
Feature activation+7.26e-03

Pos logit contributions
omo+0.043
 size+0.034
omm+0.032
 throat+0.032
ums+0.031
Neg logit contributions
 outdoors-0.052
 rubble-0.051
 outside-0.050
leaf-0.050
 stret-0.049
Token That
Token ID1320
Feature activation+3.25e-02

Pos logit contributions
omo+0.013
 size+0.012
 head+0.010
master+0.010
ia+0.010
Neg logit contributions
leaf-0.011
 rubble-0.011
sm-0.011
free-0.010
 stret-0.010
Token is
Token ID318
Feature activation+4.59e-02

Pos logit contributions
omo+0.052
ums+0.045
eter+0.041
 Sorry+0.040
omm+0.039
Neg logit contributions
 rubble-0.055
leaf-0.049
sm-0.049
 everywhere-0.047
 nearby-0.046
Token not
Token ID407
Feature activation+2.66e-02

Pos logit contributions
omo+0.063
 size+0.053
ums+0.049
 head+0.047
omm+0.047
Neg logit contributions
 rubble-0.092
leaf-0.087
 stret-0.084
sm-0.080
ts-0.079
Token a
Token ID257
Feature activation+4.32e-02

Pos logit contributions
omo+0.046
 size+0.042
 head+0.038
omm+0.037
 Mom+0.037
Neg logit contributions
 rubble-0.043
leaf-0.042
 stret-0.040
Hop-0.037
lves-0.037
Token cr
Token ID1067
Feature activation+8.30e-02

Pos logit contributions
omo+0.079
ums+0.068
 size+0.066
omm+0.066
 Sorry+0.063
Neg logit contributions
 rubble-0.067
 stret-0.061
leaf-0.059
sm-0.055
lves-0.053
Tokenay
Token ID323
Feature activation+2.01e-02

Pos logit contributions
omo+0.135
 Sorry+0.132
 size+0.121
 head+0.117
 seeker+0.115
Neg logit contributions
 rubble-0.126
leaf-0.116
sm-0.114
aid-0.111
ds-0.107
Tokenon
Token ID261
Feature activation+6.33e-02

Pos logit contributions
omo+0.044
 size+0.039
 head+0.037
 Sorry+0.036
omm+0.036
Neg logit contributions
 rubble-0.023
leaf-0.022
sm-0.021
 stret-0.021
ts-0.019
Token.
Token ID13
Feature activation+2.69e-02

Pos logit contributions
omo+0.133
 Sorry+0.113
omm+0.113
ums+0.108
 size+0.106
Neg logit contributions
 rubble-0.073
 outside-0.064
 everywhere-0.063
sm-0.063
free-0.062
Token That
Token ID1320
Feature activation+3.05e-02

Pos logit contributions
omo+0.053
 size+0.053
 head+0.045
master+0.045
rey+0.044
Neg logit contributions
leaf-0.034
 rubble-0.031
free-0.031
sm-0.030
Here-0.029
Token is
Token ID318
Feature activation+4.29e-02

Pos logit contributions
omo+0.050
ums+0.042
omm+0.038
eter+0.037
 Sorry+0.037
Neg logit contributions
 rubble-0.051
leaf-0.045
sm-0.044
 everywhere-0.043
 newspapers-0.043
Tokenmy
Token ID1820
Feature activation+1.40e-02

Pos logit contributions
omo+0.047
 size+0.043
omm+0.039
 head+0.039
ums+0.039
Neg logit contributions
 rubble-0.039
leaf-0.037
 stret-0.034
sm-0.033
ts-0.032
Token hurts
Token ID20406
Feature activation+3.75e-02

Pos logit contributions
omo+0.021
ums+0.018
 size+0.017
omm+0.017
 Sorry+0.016
Neg logit contributions
 rubble-0.025
 stret-0.024
leaf-0.024
ts-0.022
Hop-0.022
Token from
Token ID422
Feature activation+5.83e-03

Pos logit contributions
omo+0.070
 size+0.063
ums+0.061
omm+0.061
 head+0.057
Neg logit contributions
 rubble-0.054
 stret-0.049
leaf-0.047
sm-0.046
 nearby-0.044
Token the
Token ID262
Feature activation-2.59e-03

Pos logit contributions
omo+0.009
 size+0.008
 head+0.008
omm+0.007
ums+0.007
Neg logit contributions
 rubble-0.010
leaf-0.010
 stret-0.009
sm-0.009
lves-0.009
Token bad
Token ID2089
Feature activation+2.55e-02

Pos logit contributions
 rubble+0.005
leaf+0.004
 stret+0.004
lves+0.004
sm+0.004
Neg logit contributions
omo-0.004
 size-0.004
 head-0.003
omm-0.003
ums-0.003
Token ham
Token ID8891
Feature activation+8.27e-02

Pos logit contributions
omo+0.043
ums+0.037
 Sorry+0.036
omm+0.035
 size+0.034
Neg logit contributions
 rubble-0.041
 stret-0.036
 outdoors-0.036
 groceries-0.035
leaf-0.035
Token,"
Token ID553
Feature activation+1.93e-02

Pos logit contributions
omo+0.147
 Sorry+0.121
omm+0.118
 size+0.117
ums+0.114
Neg logit contributions
 rubble-0.124
 stret-0.121
ts-0.107
 groceries-0.105
 nearby-0.105
Token Lily
Token ID20037
Feature activation-5.51e-03

Pos logit contributions
omo+0.026
 size+0.023
omm+0.021
ums+0.021
 head+0.020
Neg logit contributions
 rubble-0.038
leaf-0.036
 stret-0.035
sm-0.033
ts-0.033
Token said
Token ID531
Feature activation-4.04e-02

Pos logit contributions
 stret+0.006
leaf+0.006
 rubble+0.006
 aback+0.005
 nearby+0.005
Neg logit contributions
omo-0.011
 size-0.011
 Sorry-0.010
ums-0.010
 loser-0.010
Token,
Token ID11
Feature activation-2.31e-02

Pos logit contributions
 rubble+0.076
leaf+0.073
 stret+0.071
lves+0.066
sm+0.066
Neg logit contributions
omo-0.059
 size-0.051
 head-0.046
omm-0.046
ums-0.046
Token rubbing
Token ID27714
Feature activation+3.31e-02

Pos logit contributions
 rubble+0.038
leaf+0.036
 stret+0.035
sm+0.032
lves+0.032
Neg logit contributions
omo-0.040
 size-0.035
omm-0.032
 head-0.032
ums-0.032
Token source
Token ID2723
Feature activation+2.05e-02

Pos logit contributions
 rubble+0.018
leaf+0.018
 stret+0.017
lves+0.017
Hop+0.016
Neg logit contributions
omo-0.016
 size-0.016
 head-0.014
 Mom-0.013
omm-0.013
Token.
Token ID13
Feature activation+1.94e-02

Pos logit contributions
omo+0.035
 size+0.031
omm+0.029
 head+0.028
 Sorry+0.028
Neg logit contributions
 rubble-0.034
leaf-0.031
 stret-0.030
sm-0.029
ts-0.028
Token It
Token ID632
Feature activation+1.86e-02

Pos logit contributions
omo+0.029
 size+0.027
 head+0.024
ums+0.023
omm+0.022
Neg logit contributions
 rubble-0.035
leaf-0.033
 stret-0.033
sm-0.031
Hop-0.031
Token was
Token ID373
Feature activation+2.41e-02

Pos logit contributions
ums+0.032
omo+0.032
 Sorry+0.031
omm+0.030
mm+0.030
Neg logit contributions
 rubble-0.027
ts-0.027
 nearby-0.027
sm-0.026
leaf-0.024
Token their
Token ID511
Feature activation+1.76e-02

Pos logit contributions
omo+0.046
 size+0.041
 Sorry+0.039
ums+0.039
omm+0.039
Neg logit contributions
 rubble-0.034
leaf-0.031
 stret-0.031
ts-0.028
sm-0.027
Token ham
Token ID8891
Feature activation+8.24e-02

Pos logit contributions
omo+0.025
 size+0.020
omm+0.019
ums+0.019
 Sorry+0.018
Neg logit contributions
 rubble-0.035
leaf-0.033
 stret-0.032
sm-0.029
ts-0.029
Tokenster
Token ID1706
Feature activation+2.85e-02

Pos logit contributions
omo+0.118
 Sorry+0.111
omm+0.090
 Chess+0.088
mm+0.087
Neg logit contributions
 stret-0.138
 rubble-0.135
ts-0.132
 everywhere-0.132
sm-0.121
Token cage
Token ID17186
Feature activation+8.54e-03

Pos logit contributions
omo+0.047
 Sorry+0.040
omm+0.040
 size+0.037
ums+0.037
Neg logit contributions
 rubble-0.047
 stret-0.044
ts-0.041
sm-0.040
leaf-0.040
Token.
Token ID13
Feature activation+1.88e-02

Pos logit contributions
omo+0.015
 size+0.012
omm+0.012
 Sorry+0.012
ums+0.012
Neg logit contributions
 rubble-0.014
 stret-0.013
leaf-0.013
sm-0.012
ts-0.012
Token Their
Token ID5334
Feature activation+5.17e-02

Pos logit contributions
omo+0.033
 size+0.033
 head+0.029
master+0.027
ums+0.027
Neg logit contributions
 rubble-0.027
leaf-0.026
 stret-0.025
sm-0.024
Hop-0.024
Token ham
Token ID8891
Feature activation+8.70e-02

Pos logit contributions
omo+0.063
 size+0.051
omm+0.046
ums+0.045
 head+0.044
Neg logit contributions
 rubble-0.111
leaf-0.105
 stret-0.104
ts-0.097
sm-0.096
Token cheeks
Token ID25839
Feature activation-1.01e-03

Pos logit contributions
 rubble+0.010
 stret+0.010
leaf+0.010
 risks+0.009
ts+0.009
Neg logit contributions
omo-0.007
omm-0.006
 size-0.005
ums-0.005
ama-0.005
Token.
Token ID13
Feature activation+2.28e-02

Pos logit contributions
 rubble+0.001
 stret+0.001
leaf+0.001
sm+0.001
 everywhere+0.001
Neg logit contributions
omo-0.002
omm-0.002
ums-0.002
 size-0.002
mm-0.002
Token That
Token ID1320
Feature activation+2.85e-02

Pos logit contributions
omo+0.049
 size+0.046
omm+0.040
 head+0.038
ums+0.038
Neg logit contributions
leaf-0.028
Here-0.023
 rubble-0.023
free-0.023
Hop-0.022
Token is
Token ID318
Feature activation+4.24e-02

Pos logit contributions
omo+0.051
omm+0.043
ums+0.041
ama+0.036
mm+0.035
Neg logit contributions
 rubble-0.042
 nearby-0.041
leaf-0.041
 everywhere-0.041
 risks-0.040
Token a
Token ID257
Feature activation+3.47e-02

Pos logit contributions
omo+0.072
 size+0.061
ums+0.060
omm+0.059
 head+0.056
Neg logit contributions
 rubble-0.070
leaf-0.066
 stret-0.063
ts-0.060
sm-0.059
Token ham
Token ID8891
Feature activation+8.21e-02

Pos logit contributions
omo+0.064
ums+0.056
omm+0.054
 size+0.052
mm+0.051
Neg logit contributions
 rubble-0.051
 stret-0.050
leaf-0.047
Hop-0.042
 nearby-0.042
Tokenster
Token ID1706
Feature activation+3.45e-02

Pos logit contributions
omo+0.154
 Sorry+0.126
omm+0.122
eter+0.116
 size+0.116
Neg logit contributions
 stret-0.108
 somewhere-0.102
 everywhere-0.100
sm-0.096
 nearby-0.096
Token.
Token ID13
Feature activation+2.70e-02

Pos logit contributions
omo+0.063
omm+0.054
 size+0.052
 Sorry+0.051
ums+0.051
Neg logit contributions
 rubble-0.051
leaf-0.048
 stret-0.048
sm-0.047
 nearby-0.044
Token A
Token ID317
Feature activation+3.95e-02

Pos logit contributions
omo+0.053
 size+0.052
 head+0.043
ums+0.041
zag+0.040
Neg logit contributions
leaf-0.037
Here-0.033
 rubble-0.031
 aback-0.030
aid-0.030
Token rabbit
Token ID22746
Feature activation+2.62e-03

Pos logit contributions
omo+0.070
omm+0.060
 size+0.059
ums+0.059
mm+0.054
Neg logit contributions
 rubble-0.058
 stret-0.056
leaf-0.055
Hop-0.050
sm-0.049
Token has
Token ID468
Feature activation+5.11e-03

Pos logit contributions
omo+0.004
 size+0.003
omm+0.003
ums+0.003
 head+0.003
Neg logit contributions
 rubble-0.005
leaf-0.005
 stret-0.005
sm-0.005
Hop-0.004
TokenNo
Token ID2949
Feature activation-1.07e-03

Pos logit contributions
leaf+0.021
 rubble+0.021
 stret+0.020
Hop+0.020
lves+0.019
Neg logit contributions
omo-0.015
 size-0.013
omm-0.012
ums-0.011
 head-0.011
Token,
Token ID11
Feature activation-1.02e-02

Pos logit contributions
 rubble+0.002
 stret+0.002
 risks+0.001
 groceries+0.001
leaf+0.001
Neg logit contributions
omo-0.002
 size-0.002
master-0.002
ums-0.001
omm-0.001
Token this
Token ID428
Feature activation+4.35e-02

Pos logit contributions
 rubble+0.018
leaf+0.017
 stret+0.016
ts+0.015
sm+0.015
Neg logit contributions
omo-0.016
 size-0.014
ums-0.013
omm-0.013
ama-0.012
Token is
Token ID318
Feature activation+5.16e-02

Pos logit contributions
omo+0.058
 Sorry+0.046
ums+0.046
omm+0.044
 size+0.043
Neg logit contributions
 rubble-0.073
sm-0.072
leaf-0.072
 newspapers-0.070
ts-0.069
Token my
Token ID616
Feature activation+2.42e-02

Pos logit contributions
omo+0.088
ums+0.083
 size+0.076
omm+0.074
ama+0.073
Neg logit contributions
 rubble-0.074
leaf-0.070
ts-0.068
 stret-0.067
 everywhere-0.066
Token cr
Token ID1067
Feature activation+8.06e-02

Pos logit contributions
omo+0.041
 Sorry+0.033
ums+0.033
 size+0.032
omm+0.032
Neg logit contributions
 rubble-0.038
leaf-0.036
 stret-0.034
 groceries-0.032
sm-0.032
Tokenay
Token ID323
Feature activation+1.82e-02

Pos logit contributions
omo+0.138
 size+0.137
 Sorry+0.135
 neither+0.126
 seeker+0.125
Neg logit contributions
aid-0.095
 rubble-0.095
leaf-0.093
sm-0.092
hell-0.090
Tokenon
Token ID261
Feature activation+5.40e-02

Pos logit contributions
omo+0.044
 size+0.040
 seeker+0.037
 Sorry+0.037
 head+0.037
Neg logit contributions
sm-0.014
leaf-0.014
 rubble-0.013
ons-0.012
aid-0.011
Token.
Token ID13
Feature activation+3.42e-02

Pos logit contributions
omo+0.120
 Sorry+0.107
ums+0.104
omm+0.104
 Brady+0.103
Neg logit contributions
 everywhere-0.040
 outside-0.040
hell-0.039
 rubble-0.039
sm-0.038
Token I
Token ID314
Feature activation-2.55e-02

Pos logit contributions
omo+0.068
 size+0.068
rey+0.055
 ending+0.055
 head+0.055
Neg logit contributions
leaf-0.041
 rubble-0.036
Here-0.036
free-0.035
aid-0.033
Token am
Token ID716
Feature activation+1.21e-02

Pos logit contributions
leaf+0.037
sm+0.035
 rubble+0.035
 stret+0.033
lves+0.033
Neg logit contributions
omo-0.045
omm-0.043
ums-0.040
 size-0.039
ama-0.038
Token the
Token ID262
Feature activation-2.56e-02

Pos logit contributions
omo+0.049
 size+0.044
 head+0.040
omm+0.040
ums+0.040
Neg logit contributions
 rubble-0.038
leaf-0.036
 stret-0.035
sm-0.031
lves-0.031
Token cheese
Token ID9891
Feature activation+3.02e-02

Pos logit contributions
 rubble+0.042
leaf+0.040
 stret+0.038
lves+0.036
Hop+0.035
Neg logit contributions
omo-0.044
 size-0.040
 head-0.037
ums-0.036
omm-0.035
Token and
Token ID290
Feature activation-1.72e-02

Pos logit contributions
omo+0.059
omm+0.049
 size+0.047
 Sorry+0.047
ums+0.047
Neg logit contributions
 rubble-0.043
 stret-0.040
leaf-0.038
ts-0.037
 everywhere-0.036
Token add
Token ID751
Feature activation+1.34e-02

Pos logit contributions
 rubble+0.025
leaf+0.023
 stret+0.022
ts+0.021
sm+0.020
Neg logit contributions
omo-0.033
 size-0.029
omm-0.028
ums-0.028
 head-0.027
Token the
Token ID262
Feature activation-2.06e-02

Pos logit contributions
omo+0.024
 size+0.022
 Mom+0.020
 head+0.020
ums+0.020
Neg logit contributions
 rubble-0.021
leaf-0.020
 stret-0.019
lves-0.018
Hop-0.017
Token topp
Token ID23126
Feature activation+8.05e-02

Pos logit contributions
 rubble+0.034
leaf+0.032
 stret+0.031
lves+0.029
Hop+0.028
Neg logit contributions
omo-0.036
 size-0.034
 head-0.031
ums-0.029
 Mom-0.029
Tokenings
Token ID654
Feature activation-2.01e-02

Pos logit contributions
omo+0.123
 seeker+0.119
 Sorry+0.106
omm+0.100
 throat+0.099
Neg logit contributions
ts-0.112
ers-0.105
hell-0.105
ve-0.105
 goodies-0.103
Token.
Token ID13
Feature activation+1.72e-02

Pos logit contributions
 rubble+0.033
leaf+0.032
 stret+0.030
sm+0.029
Hop+0.027
Neg logit contributions
omo-0.035
 size-0.030
omm-0.028
 head-0.028
ums-0.027
Token Lily
Token ID20037
Feature activation+1.04e-02

Pos logit contributions
omo+0.027
 size+0.026
 head+0.023
master+0.022
ums+0.021
Neg logit contributions
 rubble-0.028
leaf-0.027
 stret-0.026
 groceries-0.024
 outdoors-0.024
Token and
Token ID290
Feature activation+1.42e-02

Pos logit contributions
omo+0.019
 Sorry+0.019
 size+0.018
omm+0.018
ums+0.017
Neg logit contributions
leaf-0.013
 rubble-0.012
Hop-0.011
ts-0.011
 stret-0.011
Token Ben
Token ID3932
Feature activation-3.61e-03

Pos logit contributions
omo+0.023
 size+0.021
master+0.020
omm+0.019
eter+0.019
Neg logit contributions
 groceries-0.020
 everywhere-0.019
 stret-0.019
 newspapers-0.019
leaf-0.019
Token it
Token ID340
Feature activation+2.05e-02

Pos logit contributions
omo+0.033
omm+0.029
ums+0.029
 size+0.028
 Sorry+0.027
Neg logit contributions
 rubble-0.018
ts-0.015
 everywhere-0.014
leaf-0.014
 rubbish-0.013
Token was
Token ID373
Feature activation+3.21e-02

Pos logit contributions
omo+0.034
ums+0.034
 Sorry+0.034
omm+0.032
mm+0.031
Neg logit contributions
 rubble-0.031
ts-0.030
 everywhere-0.027
 nearby-0.026
sm-0.026
Token hard
Token ID1327
Feature activation+7.38e-03

Pos logit contributions
omo+0.054
ums+0.047
 Sorry+0.046
 size+0.046
omm+0.045
Neg logit contributions
 rubble-0.053
leaf-0.047
 stret-0.047
ts-0.045
sm-0.044
Token.
Token ID13
Feature activation+2.94e-02

Pos logit contributions
omo+0.011
 size+0.010
 head+0.009
omm+0.009
ums+0.009
Neg logit contributions
 rubble-0.013
leaf-0.013
 stret-0.012
sm-0.011
lves-0.011
Token The
Token ID383
Feature activation+2.14e-02

Pos logit contributions
omo+0.043
 size+0.040
 head+0.035
ums+0.034
omm+0.033
Neg logit contributions
 rubble-0.053
leaf-0.050
 stret-0.050
Hop-0.046
sm-0.046
Token cr
Token ID1067
Feature activation+8.02e-02

Pos logit contributions
omo+0.035
 size+0.031
 head+0.029
omm+0.028
ums+0.027
Neg logit contributions
 rubble-0.037
leaf-0.036
 stret-0.035
lves-0.032
sm-0.032
Tokenay
Token ID323
Feature activation+4.89e-03

Pos logit contributions
omo+0.119
 size+0.109
 Sorry+0.107
 head+0.103
 seeker+0.099
Neg logit contributions
 rubble-0.140
leaf-0.126
aid-0.120
ds-0.119
sm-0.119
Tokenons
Token ID684
Feature activation+2.84e-02

Pos logit contributions
omo+0.010
 size+0.009
 head+0.008
omm+0.008
ums+0.008
Neg logit contributions
 rubble-0.007
leaf-0.006
 stret-0.006
lves-0.005
sm-0.005
Token were
Token ID547
Feature activation+2.30e-02

Pos logit contributions
omo+0.044
 size+0.038
omm+0.035
ums+0.034
 head+0.034
Neg logit contributions
 rubble-0.052
leaf-0.049
 stret-0.048
ts-0.044
sm-0.044
Token sticky
Token ID23408
Feature activation+1.30e-02

Pos logit contributions
omo+0.040
 size+0.035
ums+0.033
omm+0.033
 Sorry+0.032
Neg logit contributions
 rubble-0.037
leaf-0.035
 stret-0.034
ts-0.031
sm-0.031
Token and
Token ID290
Feature activation-2.44e-02

Pos logit contributions
omo+0.021
 size+0.018
omm+0.016
 head+0.016
ums+0.016
Neg logit contributions
 rubble-0.023
leaf-0.022
 stret-0.021
sm-0.020
ts-0.020
Token or
Token ID393
Feature activation+9.60e-03

Pos logit contributions
omo+0.029
 size+0.025
omm+0.023
ums+0.023
 head+0.023
Neg logit contributions
 rubble-0.032
leaf-0.030
 stret-0.030
sm-0.027
lves-0.027
Token get
Token ID651
Feature activation+2.37e-02

Pos logit contributions
omo+0.013
 size+0.012
 head+0.011
omm+0.010
ums+0.010
Neg logit contributions
 rubble-0.018
leaf-0.018
 stret-0.017
sm-0.016
lves-0.016
Token stuck
Token ID7819
Feature activation+2.67e-02

Pos logit contributions
 size+0.035
omo+0.035
 Mom+0.033
 head+0.031
 only+0.031
Neg logit contributions
leaf-0.040
 stret-0.040
 rubble-0.039
Hop-0.036
ival-0.035
Token.
Token ID13
Feature activation+3.70e-02

Pos logit contributions
omo+0.039
 size+0.035
 head+0.032
ums+0.031
omm+0.030
Neg logit contributions
 rubble-0.050
leaf-0.048
 stret-0.046
lves-0.043
sm-0.042
Token The
Token ID383
Feature activation+1.94e-02

Pos logit contributions
omo+0.061
 size+0.061
 head+0.051
master+0.049
ums+0.047
Neg logit contributions
leaf-0.058
 rubble-0.057
 stret-0.052
Hop-0.052
Here-0.051
Token cr
Token ID1067
Feature activation+7.96e-02

Pos logit contributions
omo+0.033
 size+0.031
 head+0.028
 Mom+0.026
ums+0.025
Neg logit contributions
 rubble-0.032
leaf-0.032
 stret-0.030
sm-0.028
lves-0.028
Tokenay
Token ID323
Feature activation+5.70e-03

Pos logit contributions
omo+0.132
 size+0.117
 Sorry+0.107
 head+0.106
 Mom+0.103
Neg logit contributions
 rubble-0.136
leaf-0.126
sm-0.114
ds-0.114
 stret-0.112
Tokenons
Token ID684
Feature activation+2.71e-02

Pos logit contributions
omo+0.009
 size+0.008
 head+0.008
ums+0.008
 Sorry+0.007
Neg logit contributions
 rubble-0.010
 stret-0.009
leaf-0.009
lves-0.008
Hop-0.008
Token and
Token ID290
Feature activation+2.02e-03

Pos logit contributions
omo+0.038
 size+0.034
 head+0.031
ums+0.029
 Sorry+0.029
Neg logit contributions
 rubble-0.051
leaf-0.050
 stret-0.048
lves-0.045
sm-0.045
Token the
Token ID262
Feature activation-4.47e-03

Pos logit contributions
omo+0.004
 size+0.003
omm+0.003
ums+0.003
 head+0.003
Neg logit contributions
 rubble-0.003
leaf-0.003
 stret-0.003
ts-0.002
sm-0.002
Token clay
Token ID21558
Feature activation+4.13e-02

Pos logit contributions
leaf+0.007
 rubble+0.007
 stret+0.007
sm+0.007
lves+0.007
Neg logit contributions
omo-0.007
 size-0.007
 head-0.007
 Mom-0.006
 throat-0.006
Token elf
Token ID23878
Feature activation-3.46e-02

Pos logit contributions
 rubble+0.012
 stret+0.011
leaf+0.011
ds+0.011
 unnecessary+0.010
Neg logit contributions
omo-0.012
 size-0.011
 head-0.011
omm-0.011
 Mom-0.010
Token and
Token ID290
Feature activation-1.73e-02

Pos logit contributions
 rubble+0.064
leaf+0.061
 stret+0.059
lves+0.055
ts+0.055
Neg logit contributions
omo-0.051
 size-0.046
 head-0.043
ums-0.041
omm-0.041
Token waved
Token ID26834
Feature activation-1.93e-02

Pos logit contributions
 rubble+0.028
leaf+0.027
 stret+0.026
Hop+0.024
sm+0.024
Neg logit contributions
omo-0.030
 size-0.026
omm-0.025
ums-0.025
 Sorry-0.023
Token goodbye
Token ID24829
Feature activation-1.10e-02

Pos logit contributions
 rubble+0.029
leaf+0.027
 stret+0.026
lves+0.023
ts+0.023
Neg logit contributions
omo-0.036
 size-0.033
 head-0.031
 Mom-0.030
ums-0.030
Token as
Token ID355
Feature activation-3.56e-02

Pos logit contributions
 rubble+0.022
leaf+0.021
 stret+0.020
lves+0.019
ts+0.019
Neg logit contributions
omo-0.014
 size-0.014
 head-0.013
 Mom-0.012
ums-0.011
Token<|endoftext|>
Token ID50256
Feature activation-1.04e-01

Pos logit contributions
 rubble+0.051
leaf+0.049
sm+0.044
 stret+0.044
Hop+0.042
Neg logit contributions
omo-0.068
 size-0.059
omm-0.057
 head-0.057
ums-0.056
TokenTom
Token ID13787
Feature activation-3.81e-03

Pos logit contributions
 rubble+0.198
 stret+0.181
leaf+0.181
 unnecessary+0.170
lves+0.169
Neg logit contributions
omo-0.156
 Mom-0.132
 Me-0.131
 size-0.128
ums-0.124
Tokenmy
Token ID1820
Feature activation-4.80e-03

Pos logit contributions
 rubble+0.005
leaf+0.005
 stret+0.004
sm+0.004
 outdoors+0.004
Neg logit contributions
omo-0.007
 size-0.007
 Sorry-0.006
omm-0.006
ums-0.006
Token was
Token ID373
Feature activation-9.29e-03

Pos logit contributions
 rubble+0.008
leaf+0.008
 stret+0.008
sm+0.007
Hop+0.007
Neg logit contributions
omo-0.008
 size-0.007
omm-0.006
ums-0.006
 Sorry-0.006
Token walking
Token ID6155
Feature activation-5.92e-02

Pos logit contributions
 rubble+0.013
leaf+0.012
 stret+0.011
lves+0.010
ts+0.010
Neg logit contributions
omo-0.018
 size-0.017
 head-0.015
omm-0.015
ums-0.015
Token in
Token ID287
Feature activation-3.27e-03

Pos logit contributions
 rubble+0.103
leaf+0.097
 stret+0.094
ts+0.089
lves+0.088
Neg logit contributions
omo-0.091
 size-0.090
 head-0.088
 Mom-0.085
 Mum-0.077
Token prince
Token ID19716
Feature activation+8.09e-03

Pos logit contributions
omo+0.016
 size+0.016
 head+0.015
omm+0.013
 Mom+0.013
Neg logit contributions
 rubble-0.020
 stret-0.019
leaf-0.019
lves-0.018
sm-0.017
Token thanked
Token ID26280
Feature activation-2.02e-02

Pos logit contributions
omo+0.012
 size+0.010
omm+0.010
ums+0.010
 Sorry+0.010
Neg logit contributions
 rubble-0.015
leaf-0.014
 stret-0.013
Hop-0.013
sm-0.013
Token the
Token ID262
Feature activation-3.16e-03

Pos logit contributions
 rubble+0.030
leaf+0.029
 stret+0.028
sm+0.025
lves+0.025
Neg logit contributions
omo-0.037
 size-0.033
 head-0.031
omm-0.031
ums-0.030
Token wizard
Token ID18731
Feature activation-2.09e-02

Pos logit contributions
 rubble+0.006
 stret+0.006
leaf+0.005
lves+0.005
sm+0.005
Neg logit contributions
 size-0.005
omo-0.004
 head-0.004
omm-0.004
ums-0.004
Token and
Token ID290
Feature activation-6.99e-03

Pos logit contributions
 rubble+0.040
leaf+0.038
 stret+0.037
lves+0.034
ts+0.034
Neg logit contributions
omo-0.030
 size-0.027
 head-0.024
ums-0.024
omm-0.023
Token<|endoftext|>
Token ID50256
Feature activation-1.04e-01

Pos logit contributions
 rubble+0.011
leaf+0.011
 stret+0.010
sm+0.010
Hop+0.010
Neg logit contributions
omo-0.012
 size-0.011
omm-0.010
ums-0.010
 Sorry-0.009
TokenOnce
Token ID7454
Feature activation-5.86e-03

Pos logit contributions
 rubble+0.181
 stret+0.167
leaf+0.164
lves+0.152
 unnecessary+0.149
Neg logit contributions
omo-0.173
 size-0.147
 Mom-0.143
ums-0.141
 Me-0.140
Token upon
Token ID2402
Feature activation-4.80e-03

Pos logit contributions
 rubble+0.008
leaf+0.008
 stret+0.008
lves+0.007
Hop+0.007
Neg logit contributions
omo-0.012
 size-0.010
 Mom-0.010
ums-0.009
 head-0.009
Token a
Token ID257
Feature activation+1.15e-02

Pos logit contributions
 rubble+0.009
 stret+0.009
lves+0.009
leaf+0.009
 unnecessary+0.008
Neg logit contributions
omo-0.006
 Mom-0.006
 size-0.005
 only-0.005
 mom-0.005
Token time
Token ID640
Feature activation-1.07e-02

Pos logit contributions
omo+0.019
 size+0.016
 head+0.015
omm+0.015
ums+0.014
Neg logit contributions
 rubble-0.021
leaf-0.020
 stret-0.019
lves-0.017
ts-0.017
Token,
Token ID11
Feature activation+1.37e-02

Pos logit contributions
 rubble+0.025
leaf+0.024
 stret+0.024
lves+0.022
ts+0.022
Neg logit contributions
omo-0.011
 size-0.009
omm-0.008
 Mom-0.008
ums-0.008
Tokenia
Token ID544
Feature activation-2.00e-03

Pos logit contributions
omo+0.014
 size+0.013
 Sorry+0.013
 seeker+0.011
 head+0.011
Neg logit contributions
 rubble-0.022
leaf-0.021
sm-0.020
ts-0.020
 stret-0.019
Token smiled
Token ID13541
Feature activation+1.09e-02

Pos logit contributions
leaf+0.003
 rubble+0.003
 stret+0.003
sm+0.003
Hop+0.003
Neg logit contributions
omo-0.004
 size-0.003
 Sorry-0.003
omm-0.003
ums-0.003
Token and
Token ID290
Feature activation-4.84e-02

Pos logit contributions
omo+0.023
 Sorry+0.020
mm+0.019
omm+0.019
 size+0.019
Neg logit contributions
 outdoors-0.011
 rubble-0.011
 aback-0.011
 everywhere-0.011
 nearby-0.011
Token went
Token ID1816
Feature activation-4.09e-02

Pos logit contributions
leaf+0.061
 rubble+0.053
aid+0.048
Hop+0.047
sm+0.047
Neg logit contributions
omo-0.103
 size-0.094
omm-0.093
ums-0.088
 loser-0.088
Token back
Token ID736
Feature activation+4.19e-03

Pos logit contributions
 rubble+0.073
leaf+0.066
lves+0.064
 stret+0.063
ts+0.062
Neg logit contributions
omo-0.063
 size-0.056
 head-0.056
 Mom-0.054
ums-0.052
Token<|endoftext|>
Token ID50256
Feature activation-1.03e-01

Pos logit contributions
omo+0.006
 size+0.006
 head+0.005
 Mom+0.005
ums+0.005
Neg logit contributions
 rubble-0.008
leaf-0.007
 stret-0.007
lves-0.007
Hop-0.006
TokenOnce
Token ID7454
Feature activation-4.09e-03

Pos logit contributions
 rubble+0.181
 stret+0.167
leaf+0.165
lves+0.153
 unnecessary+0.150
Neg logit contributions
omo-0.170
 Mom-0.147
 size-0.146
ums-0.141
 Me-0.137
Token upon
Token ID2402
Feature activation-5.76e-03

Pos logit contributions
 rubble+0.006
leaf+0.005
 stret+0.005
Hop+0.004
lves+0.004
Neg logit contributions
omo-0.008
 size-0.007
 Mom-0.007
 head-0.007
ums-0.007
Token a
Token ID257
Feature activation+1.14e-02

Pos logit contributions
 rubble+0.011
 stret+0.011
leaf+0.011
lves+0.011
ival+0.010
Neg logit contributions
omo-0.008
 Mom-0.007
 size-0.007
 only-0.007
 head-0.006
Token time
Token ID640
Feature activation-1.08e-02

Pos logit contributions
omo+0.019
 size+0.017
 head+0.015
omm+0.015
ums+0.015
Neg logit contributions
 rubble-0.019
leaf-0.018
 stret-0.018
lves-0.016
sm-0.016
Token,
Token ID11
Feature activation+1.20e-02

Pos logit contributions
 rubble+0.025
leaf+0.024
 stret+0.023
lves+0.022
ts+0.021
Neg logit contributions
omo-0.012
 size-0.009
omm-0.008
ums-0.008
 head-0.008
Token and
Token ID290
Feature activation+1.06e-02

Pos logit contributions
 rubble+0.001
sm+0.001
leaf+0.001
Hop+0.001
 nearby+0.001
Neg logit contributions
omo-0.002
 Sorry-0.001
omm-0.001
 size-0.001
ums-0.001
Token the
Token ID262
Feature activation+1.09e-03

Pos logit contributions
omo+0.018
 size+0.017
 head+0.016
ums+0.015
 Mom+0.015
Neg logit contributions
 rubble-0.017
leaf-0.016
 stret-0.016
lves-0.014
sm-0.014
Token rabbit
Token ID22746
Feature activation-1.72e-02

Pos logit contributions
omo+0.001
 size+0.001
 head+0.001
 mother+0.001
 Mom+0.001
Neg logit contributions
leaf-0.002
 rubble-0.002
 stret-0.002
ts-0.002
and-0.002
Token were
Token ID547
Feature activation-4.63e-04

Pos logit contributions
 rubble+0.031
leaf+0.029
 stret+0.029
sm+0.027
lves+0.026
Neg logit contributions
omo-0.027
 size-0.023
omm-0.021
ums-0.021
 head-0.021
Token the
Token ID262
Feature activation-3.68e-02

Pos logit contributions
 rubble+0.001
leaf+0.001
 stret+0.001
sm+0.001
lves+0.001
Neg logit contributions
omo-0.001
 size-0.001
omm-0.001
 Sorry-0.001
 head-0.001
Token<|endoftext|>
Token ID50256
Feature activation-1.03e-01

Pos logit contributions
 rubble+0.089
leaf+0.086
 stret+0.084
sm+0.080
lves+0.080
Neg logit contributions
omo-0.035
 size-0.027
 head-0.022
omm-0.022
ums-0.022
TokenOnce
Token ID7454
Feature activation-1.74e-03

Pos logit contributions
 rubble+0.196
 stret+0.182
leaf+0.182
lves+0.170
 unnecessary+0.168
Neg logit contributions
omo-0.156
 Mom-0.135
 size-0.129
 Me-0.126
ums-0.123
Token there
Token ID612
Feature activation+7.38e-03

Pos logit contributions
leaf+0.003
 rubble+0.003
 stret+0.002
Hop+0.002
 risks+0.002
Neg logit contributions
omo-0.003
 size-0.003
 Mom-0.003
 head-0.003
ums-0.003
Token was
Token ID373
Feature activation+2.06e-02

Pos logit contributions
omo+0.009
 size+0.008
omm+0.007
ums+0.007
 head+0.007
Neg logit contributions
 rubble-0.015
leaf-0.014
 stret-0.014
sm-0.013
lves-0.013
Token a
Token ID257
Feature activation+2.20e-03

Pos logit contributions
omo+0.029
 Mom+0.027
 only+0.024
 size+0.024
 Mum+0.022
Neg logit contributions
 rubble-0.041
 stret-0.038
leaf-0.037
lves-0.036
ts-0.036
Token man
Token ID582
Feature activation+7.11e-03

Pos logit contributions
omo+0.004
 size+0.004
omm+0.003
ums+0.003
 head+0.003
Neg logit contributions
 rubble-0.003
leaf-0.003
 stret-0.003
sm-0.003
lves-0.003
Token expected
Token ID2938
Feature activation-9.69e-03

Pos logit contributions
 rubble+0.016
leaf+0.016
 stret+0.016
Hop+0.014
lves+0.014
Neg logit contributions
omo-0.028
 size-0.028
 head-0.027
 only-0.025
 be-0.025
Tokenâ
Token ID22940
Feature activation-1.66e-02

Pos logit contributions
 rubble+0.017
leaf+0.016
 stret+0.015
sm+0.014
 outdoors+0.014
Neg logit contributions
omo-0.016
 size-0.013
omm-0.012
ums-0.012
 head-0.012
Token
Token ID26391
Feature activation-1.75e-02

Pos logit contributions
 rubble+0.028
leaf+0.027
 stret+0.026
sm+0.024
lves+0.024
Neg logit contributions
omo-0.028
 size-0.024
omm-0.022
ums-0.022
 head-0.022
Token
Token ID126
Feature activation-2.56e-02

Pos logit contributions
 rubble+0.029
 stret+0.029
leaf+0.028
sm+0.026
lves+0.026
Neg logit contributions
omo-0.028
 size-0.025
 head-0.022
omm-0.022
ums-0.021
Token
Token ID99
Feature activation+2.00e-02

Pos logit contributions
 rubble+0.040
leaf+0.038
 stret+0.036
lves+0.033
sm+0.033
Neg logit contributions
omo-0.046
 size-0.040
 head-0.038
omm-0.038
ums-0.037
Token<|endoftext|>
Token ID50256
Feature activation-1.03e-01

Pos logit contributions
omo+0.035
 size+0.031
 head+0.029
omm+0.028
ums+0.028
Neg logit contributions
 rubble-0.032
leaf-0.031
 stret-0.030
sm-0.027
lves-0.027
TokenOnce
Token ID7454
Feature activation-8.29e-03

Pos logit contributions
 rubble+0.177
leaf+0.160
 stret+0.160
lves+0.148
 unnecessary+0.146
Neg logit contributions
omo-0.174
 size-0.148
 Mom-0.147
ums-0.144
 Me-0.144
Token upon
Token ID2402
Feature activation-9.65e-03

Pos logit contributions
 rubble+0.012
leaf+0.012
 stret+0.011
Hop+0.010
lves+0.010
Neg logit contributions
omo-0.016
 size-0.014
 Mom-0.014
 head-0.013
ums-0.013
Token a
Token ID257
Feature activation+1.30e-02

Pos logit contributions
 rubble+0.019
 stret+0.018
lves+0.018
leaf+0.018
 unnecessary+0.016
Neg logit contributions
omo-0.012
 Mom-0.012
 only-0.011
 size-0.010
 mom-0.010
Token time
Token ID640
Feature activation-1.82e-02

Pos logit contributions
omo+0.021
 size+0.018
 head+0.016
omm+0.016
ums+0.016
Neg logit contributions
 rubble-0.024
leaf-0.022
 stret-0.021
lves-0.020
ts-0.020
Token there
Token ID612
Feature activation+2.47e-03

Pos logit contributions
 rubble+0.042
leaf+0.041
 stret+0.040
lves+0.037
ts+0.037
Neg logit contributions
omo-0.019
 size-0.015
omm-0.013
ums-0.013
 Mom-0.013
Token she
Token ID673
Feature activation-8.00e-04

Pos logit contributions
 rubble+0.028
leaf+0.026
 stret+0.025
sm+0.023
lves+0.023
Neg logit contributions
omo-0.026
 size-0.023
 head-0.021
omm-0.021
ums-0.021
Token couldn
Token ID3521
Feature activation-1.47e-02

Pos logit contributions
 rubble+0.001
 stret+0.001
leaf+0.001
lves+0.001
ts+0.001
Neg logit contributions
omo-0.001
 size-0.001
ums-0.001
omm-0.001
mm-0.001
Token't
Token ID470
Feature activation-3.29e-03

Pos logit contributions
 rubble+0.022
 stret+0.020
leaf+0.020
Roger+0.018
ival+0.018
Neg logit contributions
omo-0.026
 size-0.026
 throat-0.023
 head-0.022
ums-0.022
Token get
Token ID651
Feature activation+1.88e-02

Pos logit contributions
 rubble+0.006
leaf+0.006
 stret+0.005
lves+0.005
sm+0.005
Neg logit contributions
omo-0.005
 size-0.004
 head-0.004
omm-0.004
ums-0.004
Token it
Token ID340
Feature activation-1.71e-02

Pos logit contributions
omo+0.026
 Mom+0.026
 size+0.025
 only+0.025
 head+0.024
Neg logit contributions
 rubble-0.034
leaf-0.033
 stret-0.032
Hop-0.029
ival-0.029
Token<|endoftext|>
Token ID50256
Feature activation-1.03e-01

Pos logit contributions
leaf+0.026
 rubble+0.026
 stret+0.024
Hop+0.023
ts+0.023
Neg logit contributions
omo-0.030
 size-0.028
 head-0.025
 Mom-0.024
omm-0.024
TokenOnce
Token ID7454
Feature activation-2.50e-03

Pos logit contributions
 rubble+0.192
leaf+0.179
 stret+0.178
lves+0.166
 unnecessary+0.166
Neg logit contributions
omo-0.159
 Mom-0.136
 size-0.133
 Me-0.128
ums-0.127
Token upon
Token ID2402
Feature activation-3.28e-03

Pos logit contributions
leaf+0.004
 rubble+0.004
 stret+0.004
Hop+0.003
 risks+0.003
Neg logit contributions
omo-0.005
 size-0.004
 Mom-0.004
omm-0.004
ums-0.004
Token a
Token ID257
Feature activation+8.15e-03

Pos logit contributions
 rubble+0.006
leaf+0.006
lves+0.006
 stret+0.006
 unnecessary+0.005
Neg logit contributions
omo-0.004
 Mom-0.004
 only-0.004
 size-0.004
 head-0.004
Token time
Token ID640
Feature activation-1.61e-02

Pos logit contributions
omo+0.013
 size+0.011
 head+0.010
omm+0.010
 Mom+0.010
Neg logit contributions
 rubble-0.015
leaf-0.015
 stret-0.014
lves-0.013
ts-0.013
Token there
Token ID612
Feature activation-4.20e-03

Pos logit contributions
 rubble+0.038
leaf+0.037
 stret+0.035
lves+0.033
ts+0.033
Neg logit contributions
omo-0.017
 size-0.014
omm-0.012
 Mom-0.012
ums-0.012
Token that
Token ID326
Feature activation+3.11e-03

Pos logit contributions
omo+0.032
 size+0.027
omm+0.026
ums+0.025
 Sorry+0.025
Neg logit contributions
 rubble-0.032
leaf-0.031
 stret-0.030
sm-0.028
lves-0.027
Token she
Token ID673
Feature activation-1.36e-02

Pos logit contributions
omo+0.006
 size+0.005
omm+0.005
ums+0.005
 head+0.005
Neg logit contributions
 rubble-0.005
leaf-0.004
 stret-0.004
sm-0.004
 outdoors-0.004
Token hugged
Token ID41130
Feature activation-7.09e-03

Pos logit contributions
 rubble+0.024
leaf+0.023
 stret+0.022
Hop+0.021
sm+0.021
Neg logit contributions
omo-0.022
 size-0.019
ums-0.018
omm-0.018
 Sorry-0.017
Token the
Token ID262
Feature activation-1.93e-02

Pos logit contributions
 rubble+0.012
leaf+0.011
 stret+0.011
lves+0.010
ts+0.010
Neg logit contributions
omo-0.012
 size-0.011
 head-0.010
omm-0.010
ums-0.010
Token little
Token ID1310
Feature activation-2.37e-02

Pos logit contributions
 rubble+0.039
leaf+0.038
ds+0.036
irs+0.036
 stret+0.036
Neg logit contributions
 head-0.026
 size-0.025
omo-0.023
 only-0.023
 mom-0.022
Token<|endoftext|>
Token ID50256
Feature activation-1.03e-01

Pos logit contributions
 rubble+0.039
leaf+0.037
 stret+0.035
lves+0.032
sm+0.032
Neg logit contributions
omo-0.041
 size-0.037
 head-0.034
omm-0.033
ums-0.033
TokenOnce
Token ID7454
Feature activation-4.52e-04

Pos logit contributions
 rubble+0.183
 stret+0.168
leaf+0.167
lves+0.155
 unnecessary+0.153
Neg logit contributions
omo-0.167
 Mom-0.144
 size-0.143
 Me-0.138
ums-0.136
Token upon
Token ID2402
Feature activation-6.02e-03

Pos logit contributions
 rubble+0.001
leaf+0.001
 stret+0.001
Hop+0.001
ts+0.001
Neg logit contributions
omo-0.001
 size-0.001
 Mom-0.001
 head-0.001
ums-0.001
Token a
Token ID257
Feature activation+1.58e-02

Pos logit contributions
 rubble+0.012
 stret+0.012
lves+0.012
leaf+0.012
ds+0.010
Neg logit contributions
omo-0.007
 Mom-0.007
 only-0.007
 size-0.006
 no-0.006
Token time
Token ID640
Feature activation-9.03e-03

Pos logit contributions
omo+0.025
 size+0.022
 head+0.020
omm+0.019
ums+0.019
Neg logit contributions
 rubble-0.029
leaf-0.028
 stret-0.026
ts-0.025
lves-0.025
Token there
Token ID612
Feature activation+5.96e-03

Pos logit contributions
 rubble+0.021
leaf+0.021
 stret+0.020
lves+0.019
ts+0.018
Neg logit contributions
omo-0.010
 size-0.008
omm-0.007
 Mom-0.006
ums-0.006
Token smiled
Token ID13541
Feature activation+1.14e-02

Pos logit contributions
 rubble+0.004
leaf+0.004
 stret+0.004
sm+0.003
Hop+0.003
Neg logit contributions
omo-0.003
 size-0.003
ums-0.002
omm-0.002
 Sorry-0.002
Token to
Token ID284
Feature activation-1.53e-02

Pos logit contributions
omo+0.022
mm+0.018
 Sorry+0.018
 size+0.018
omm+0.017
Neg logit contributions
 rubble-0.016
leaf-0.014
 stret-0.014
sm-0.014
 outdoors-0.014
Token herself
Token ID5223
Feature activation-1.61e-02

Pos logit contributions
 rubble+0.020
leaf+0.019
 stret+0.018
sm+0.016
ts+0.016
Neg logit contributions
omo-0.032
 size-0.028
omm-0.026
ums-0.026
 Sorry-0.026
Token,
Token ID11
Feature activation-2.10e-02

Pos logit contributions
 rubble+0.033
leaf+0.033
 stret+0.031
lves+0.029
ts+0.029
Neg logit contributions
omo-0.021
 size-0.019
 head-0.017
ums-0.016
 Mom-0.016
Token remembering
Token ID24865
Feature activation+1.48e-02

Pos logit contributions
 rubble+0.039
leaf+0.038
 stret+0.036
ts+0.034
lves+0.034
Neg logit contributions
omo-0.031
 size-0.029
 head-0.027
 Mom-0.026
 only-0.025
Token<|endoftext|>
Token ID50256
Feature activation-1.02e-01

Pos logit contributions
omo+0.024
 size+0.023
 Mom+0.022
 only+0.021
 head+0.021
Neg logit contributions
 rubble-0.024
 stret-0.023
leaf-0.023
lves-0.022
Hop-0.021
TokenJohn
Token ID7554
Feature activation-1.25e-02

Pos logit contributions
 rubble+0.199
 stret+0.185
leaf+0.182
lves+0.170
 unnecessary+0.169
Neg logit contributions
omo-0.151
 Mom-0.133
 size-0.125
 Me-0.124
ums-0.121
Token was
Token ID373
Feature activation-1.28e-02

Pos logit contributions
 rubble+0.024
leaf+0.023
 stret+0.022
lves+0.021
ts+0.020
Neg logit contributions
omo-0.018
 size-0.016
 head-0.014
ums-0.014
omm-0.014
Token preparing
Token ID10629
Feature activation-2.53e-02

Pos logit contributions
 rubble+0.017
leaf+0.016
 stret+0.015
lves+0.013
ts+0.013
Neg logit contributions
omo-0.026
 size-0.024
 head-0.022
omm-0.022
ums-0.022
Token to
Token ID284
Feature activation-1.51e-02

Pos logit contributions
 rubble+0.043
leaf+0.041
 stret+0.040
lves+0.040
Hop+0.038
Neg logit contributions
omo-0.040
 size-0.038
 Mom-0.036
 head-0.034
ums-0.033
Token go
Token ID467
Feature activation-2.98e-02

Pos logit contributions
 rubble+0.025
leaf+0.024
 stret+0.023
 outdoors+0.021
ts+0.021
Neg logit contributions
omo-0.026
 size-0.022
omm-0.021
ums-0.021
 head-0.019
Token was
Token ID373
Feature activation+4.61e-04

Pos logit contributions
 rubble+0.050
leaf+0.049
 stret+0.046
sm+0.044
Hop+0.042
Neg logit contributions
omo-0.048
 size-0.043
ums-0.040
omm-0.040
 head-0.039
Token just
Token ID655
Feature activation-1.01e-02

Pos logit contributions
omo+0.001
 size+0.001
 head+0.001
omm+0.001
ums+0.001
Neg logit contributions
 rubble-0.001
leaf-0.001
 stret-0.001
lves-0.001
sm-0.001
Token in
Token ID287
Feature activation-5.54e-03

Pos logit contributions
 rubble+0.016
 stret+0.016
leaf+0.015
Hop+0.014
lves+0.014
Neg logit contributions
omo-0.016
 size-0.016
 head-0.015
 Mom-0.015
 only-0.014
Token the
Token ID262
Feature activation-1.39e-02

Pos logit contributions
 rubble+0.008
leaf+0.007
 stret+0.007
sm+0.007
 outdoors+0.006
Neg logit contributions
omo-0.011
 size-0.009
omm-0.009
ums-0.009
 head-0.009
Token other
Token ID584
Feature activation+1.25e-02

Pos logit contributions
 rubble+0.022
leaf+0.021
 stret+0.020
sm+0.018
lves+0.018
Neg logit contributions
omo-0.025
 size-0.022
ums-0.020
omm-0.020
 head-0.020
Token<|endoftext|>
Token ID50256
Feature activation-1.02e-01

Pos logit contributions
omo+0.021
omm+0.017
 size+0.017
ums+0.017
 Sorry+0.017
Neg logit contributions
 rubble-0.021
 stret-0.019
leaf-0.019
 outdoors-0.018
sm-0.017
TokenOnce
Token ID7454
Feature activation-6.24e-03

Pos logit contributions
 rubble+0.179
leaf+0.166
 stret+0.166
lves+0.155
 unnecessary+0.149
Neg logit contributions
omo-0.169
 size-0.143
 Mom-0.143
ums-0.137
omm-0.134
Token upon
Token ID2402
Feature activation-9.78e-03

Pos logit contributions
 rubble+0.008
leaf+0.008
 stret+0.008
Hop+0.007
lves+0.007
Neg logit contributions
omo-0.013
 size-0.011
 head-0.011
ums-0.011
omm-0.011
Token a
Token ID257
Feature activation+7.41e-03

Pos logit contributions
 rubble+0.018
 stret+0.017
leaf+0.017
lves+0.017
 unnecessary+0.015
Neg logit contributions
omo-0.014
 Mom-0.013
 size-0.012
 only-0.012
 head-0.011
Token time
Token ID640
Feature activation-1.38e-02

Pos logit contributions
omo+0.012
 size+0.010
 head+0.009
omm+0.009
ums+0.009
Neg logit contributions
 rubble-0.013
leaf-0.013
 stret-0.012
lves-0.011
ts-0.011
Token,
Token ID11
Feature activation+9.63e-03

Pos logit contributions
 rubble+0.030
leaf+0.029
 stret+0.028
sm+0.026
lves+0.026
Neg logit contributions
omo-0.017
 size-0.014
 head-0.012
omm-0.012
ums-0.012
Token,
Token ID11
Feature activation-5.45e-03

Pos logit contributions
 rubble+0.017
leaf+0.017
 stret+0.015
lves+0.015
ts+0.015
Neg logit contributions
omo-0.019
 size-0.018
 head-0.017
 Mom-0.016
ums-0.016
Token but
Token ID475
Feature activation+1.18e-03

Pos logit contributions
 rubble+0.009
leaf+0.009
 stret+0.008
lves+0.008
sm+0.008
Neg logit contributions
omo-0.009
 size-0.008
 head-0.007
ums-0.007
omm-0.007
Token she
Token ID673
Feature activation+1.68e-02

Pos logit contributions
omo+0.002
 size+0.002
omm+0.002
ums+0.002
 head+0.002
Neg logit contributions
 rubble-0.002
leaf-0.002
 stret-0.002
sm-0.002
lves-0.002
Token knew
Token ID2993
Feature activation-1.33e-02

Pos logit contributions
omo+0.026
 size+0.022
ums+0.022
omm+0.022
 Sorry+0.021
Neg logit contributions
 rubble-0.029
leaf-0.027
 stret-0.026
sm-0.026
Hop-0.025
Token she
Token ID673
Feature activation-1.28e-02

Pos logit contributions
 rubble+0.020
leaf+0.018
 stret+0.018
sm+0.016
lves+0.016
Neg logit contributions
omo-0.025
 size-0.022
 head-0.021
omm-0.021
ums-0.021
Token<|endoftext|>
Token ID50256
Feature activation-1.02e-01

Pos logit contributions
 rubble+0.022
leaf+0.021
sm+0.019
 stret+0.019
ts+0.018
Neg logit contributions
omo-0.021
 size-0.018
ums-0.017
omm-0.017
 Sorry-0.017
TokenM
Token ID44
Feature activation-1.81e-02

Pos logit contributions
 rubble+0.174
 stret+0.158
leaf+0.158
lves+0.148
 unnecessary+0.143
Neg logit contributions
omo-0.172
 size-0.148
 Mom-0.148
ums-0.142
 Me-0.142
Tokenandy
Token ID10757
Feature activation-3.05e-02

Pos logit contributions
 rubble+0.037
leaf+0.036
 stret+0.035
sm+0.034
 outdoors+0.032
Neg logit contributions
omo-0.021
 size-0.019
 head-0.017
 Sorry-0.016
ums-0.015
Token opened
Token ID4721
Feature activation-1.10e-02

Pos logit contributions
 rubble+0.055
leaf+0.053
 stret+0.052
sm+0.048
Hop+0.048
Neg logit contributions
omo-0.046
 size-0.040
omm-0.037
ums-0.036
 head-0.035
Token the
Token ID262
Feature activation-3.14e-02

Pos logit contributions
 rubble+0.022
leaf+0.021
 stret+0.021
 unnecessary+0.020
lves+0.019
Neg logit contributions
omo-0.014
 Mom-0.014
 size-0.013
 Mum-0.013
 mom-0.012
Token tiny
Token ID7009
Feature activation-2.33e-02

Pos logit contributions
 rubble+0.055
leaf+0.053
 stret+0.052
ts+0.049
lves+0.048
Neg logit contributions
omo-0.049
 size-0.044
 head-0.041
omm-0.040
 Mom-0.040
Token,
Token ID11
Feature activation-7.13e-03

Pos logit contributions
omo+0.017
omm+0.014
 size+0.014
 Sorry+0.014
ums+0.013
Neg logit contributions
 rubble-0.016
leaf-0.015
 stret-0.015
 outdoors-0.014
sm-0.014
Token and
Token ID290
Feature activation+2.96e-03

Pos logit contributions
 rubble+0.012
leaf+0.011
 stret+0.011
lves+0.010
ts+0.010
Neg logit contributions
omo-0.012
 size-0.011
 head-0.010
ums-0.009
omm-0.009
Token she
Token ID673
Feature activation+3.76e-03

Pos logit contributions
omo+0.005
 size+0.004
omm+0.004
ums+0.004
 head+0.004
Neg logit contributions
 rubble-0.005
leaf-0.005
 stret-0.004
sm-0.004
lves-0.004
Token was
Token ID373
Feature activation+6.55e-03

Pos logit contributions
omo+0.006
omm+0.005
ums+0.005
mm+0.005
 size+0.005
Neg logit contributions
 rubble-0.006
leaf-0.006
Hop-0.006
sm-0.006
 stret-0.006
Token right
Token ID826
Feature activation+7.86e-04

Pos logit contributions
omo+0.013
 size+0.012
omm+0.011
ums+0.011
 Sorry+0.011
Neg logit contributions
 rubble-0.009
leaf-0.008
 stret-0.008
sm-0.007
lves-0.007
Token<|endoftext|>
Token ID50256
Feature activation-1.02e-01

Pos logit contributions
omo+0.001
 size+0.001
omm+0.001
 Sorry+0.001
ums+0.001
Neg logit contributions
 rubble-0.001
leaf-0.001
 stret-0.001
sm-0.001
Hop-0.001
TokenOnce
Token ID7454
Feature activation+7.95e-04

Pos logit contributions
 rubble+0.193
leaf+0.177
 stret+0.177
lves+0.163
 unnecessary+0.162
Neg logit contributions
omo-0.155
 Mom-0.135
 size-0.129
 Me-0.126
ums-0.124
Token upon
Token ID2402
Feature activation-2.03e-03

Pos logit contributions
omo+0.002
 size+0.001
 Mom+0.001
 head+0.001
ums+0.001
Neg logit contributions
 rubble-0.001
leaf-0.001
 stret-0.001
lves-0.001
Hop-0.001
Token a
Token ID257
Feature activation+1.14e-02

Pos logit contributions
 rubble+0.004
 stret+0.004
leaf+0.004
lves+0.004
ival+0.003
Neg logit contributions
omo-0.003
 Mom-0.002
 size-0.002
 only-0.002
 head-0.002
Token time
Token ID640
Feature activation-1.46e-02

Pos logit contributions
omo+0.018
 size+0.016
 head+0.014
omm+0.014
ums+0.014
Neg logit contributions
 rubble-0.020
leaf-0.019
 stret-0.018
lves-0.017
ts-0.017
Token,
Token ID11
Feature activation+1.12e-02

Pos logit contributions
 rubble+0.032
leaf+0.031
 stret+0.030
lves+0.028
sm+0.028
Neg logit contributions
omo-0.017
 size-0.014
 head-0.012
omm-0.012
ums-0.012
Token buying
Token ID7067
Feature activation+3.51e-03

Pos logit contributions
 rubble+0.021
leaf+0.020
 stret+0.019
Hop+0.018
lves+0.018
Neg logit contributions
omo-0.022
 size-0.020
 head-0.019
ums-0.018
 Mom-0.018
Token it
Token ID340
Feature activation-1.34e-02

Pos logit contributions
omo+0.006
 size+0.006
 head+0.005
 Mom+0.005
ums+0.005
Neg logit contributions
 rubble-0.005
leaf-0.005
 stret-0.005
lves-0.004
Hop-0.004
Token.
Token ID13
Feature activation-3.75e-03

Pos logit contributions
 rubble+0.022
leaf+0.021
 stret+0.020
sm+0.019
 outdoors+0.018
Neg logit contributions
omo-0.023
 size-0.020
omm-0.019
ums-0.019
 head-0.018
Tokenâ
Token ID22940
Feature activation-1.01e-02

Pos logit contributions
leaf+0.006
 rubble+0.006
 stret+0.006
Hop+0.006
sm+0.006
Neg logit contributions
omo-0.006
 size-0.005
 head-0.005
ums-0.005
omm-0.005
Token
Token ID26391
Feature activation-1.59e-02

Pos logit contributions
 rubble+0.019
leaf+0.018
 stret+0.018
lves+0.016
sm+0.016
Neg logit contributions
omo-0.015
 size-0.013
 head-0.012
omm-0.012
ums-0.011
Token<|endoftext|>
Token ID50256
Feature activation-1.02e-01

Pos logit contributions
 rubble+0.026
leaf+0.025
 stret+0.025
sm+0.022
Hop+0.022
Neg logit contributions
omo-0.026
 size-0.024
omm-0.021
 head-0.021
ums-0.021
TokenOnce
Token ID7454
Feature activation-4.58e-03

Pos logit contributions
 rubble+0.169
 stret+0.153
leaf+0.152
lves+0.139
 unnecessary+0.135
Neg logit contributions
omo-0.179
 size-0.153
 Mom-0.150
ums-0.146
 Me-0.144
Token upon
Token ID2402
Feature activation-1.08e-02

Pos logit contributions
 rubble+0.006
leaf+0.005
 stret+0.005
lves+0.004
sm+0.004
Neg logit contributions
omo-0.010
 size-0.009
 head-0.008
omm-0.008
ums-0.008
Token a
Token ID257
Feature activation+5.07e-03

Pos logit contributions
 rubble+0.021
 stret+0.020
lves+0.019
leaf+0.019
 unnecessary+0.017
Neg logit contributions
omo-0.015
 Mom-0.014
 size-0.013
 only-0.013
 head-0.012
Token time
Token ID640
Feature activation-2.02e-02

Pos logit contributions
omo+0.008
 size+0.007
 head+0.007
omm+0.007
ums+0.007
Neg logit contributions
 rubble-0.009
leaf-0.008
 stret-0.008
lves-0.007
sm-0.007
Token there
Token ID612
Feature activation+1.59e-03

Pos logit contributions
 rubble+0.044
leaf+0.042
 stret+0.041
sm+0.038
lves+0.038
Neg logit contributions
omo-0.024
 size-0.020
 head-0.018
omm-0.017
ums-0.017
Token That
Token ID1320
Feature activation+2.44e-02

Pos logit contributions
 rubble+0.012
leaf+0.012
 stret+0.012
sm+0.011
Hop+0.011
Neg logit contributions
omo-0.015
 size-0.015
 head-0.013
master-0.012
omm-0.012
Token day
Token ID1110
Feature activation-4.49e-02

Pos logit contributions
omo+0.039
omm+0.034
ums+0.033
 Sorry+0.032
 size+0.032
Neg logit contributions
leaf-0.038
 rubble-0.037
 outdoors-0.035
sm-0.035
 photographs-0.034
Token,
Token ID11
Feature activation+1.44e-03

Pos logit contributions
 rubble+0.062
leaf+0.062
sm+0.058
 everywhere+0.057
 stret+0.057
Neg logit contributions
omo-0.080
 size-0.073
omm-0.065
 head-0.065
master-0.064
Token she
Token ID673
Feature activation+2.54e-03

Pos logit contributions
omo+0.003
 size+0.002
omm+0.002
 head+0.002
ums+0.002
Neg logit contributions
 rubble-0.002
leaf-0.002
 stret-0.002
sm-0.002
 outdoors-0.002
Token learnt
Token ID26338
Feature activation-1.69e-02

Pos logit contributions
omo+0.004
omm+0.004
ums+0.004
 size+0.004
 Sorry+0.004
Neg logit contributions
leaf-0.004
 rubble-0.004
Hop-0.003
sm-0.003
 stret-0.003
Token<|endoftext|>
Token ID50256
Feature activation-1.01e-01

Pos logit contributions
 rubble+0.026
leaf+0.024
 stret+0.024
lves+0.022
ts+0.021
Neg logit contributions
omo-0.031
 size-0.028
omm-0.026
 head-0.026
ums-0.025
TokenOnce
Token ID7454
Feature activation-2.77e-03

Pos logit contributions
 rubble+0.182
leaf+0.167
 stret+0.167
lves+0.154
 unnecessary+0.152
Neg logit contributions
omo-0.164
 Mom-0.142
 size-0.136
 Me-0.134
ums-0.132
Token upon
Token ID2402
Feature activation-6.95e-03

Pos logit contributions
 rubble+0.004
leaf+0.004
 stret+0.004
lves+0.003
Hop+0.003
Neg logit contributions
omo-0.006
 size-0.005
 Mom-0.005
ums-0.005
omm-0.005
Token a
Token ID257
Feature activation+1.33e-02

Pos logit contributions
 rubble+0.014
 stret+0.013
lves+0.013
leaf+0.013
 unnecessary+0.011
Neg logit contributions
omo-0.009
 Mom-0.009
 size-0.008
 only-0.008
 no-0.007
Token time
Token ID640
Feature activation-1.04e-02

Pos logit contributions
omo+0.021
 size+0.018
 head+0.016
omm+0.016
ums+0.016
Neg logit contributions
 rubble-0.024
leaf-0.023
 stret-0.022
lves-0.021
ts-0.021
Token,
Token ID11
Feature activation+1.13e-02

Pos logit contributions
 rubble+0.024
leaf+0.023
 stret+0.023
lves+0.021
ts+0.021
Neg logit contributions
omo-0.011
 size-0.009
omm-0.008
ums-0.008
 Mom-0.008
Token's
Token ID338
Feature activation+4.26e-02

Pos logit contributions
 rubble+0.005
leaf+0.005
 stret+0.005
sm+0.005
Hop+0.005
Neg logit contributions
omo-0.005
 size-0.004
omm-0.004
 Sorry-0.004
ums-0.004
Token suggestion
Token ID13052
Feature activation-2.30e-02

Pos logit contributions
omo+0.062
 size+0.057
 head+0.053
 Mom+0.050
omm+0.049
Neg logit contributions
 rubble-0.081
leaf-0.080
 stret-0.076
lves-0.071
Hop-0.070
Token had
Token ID550
Feature activation-1.13e-02

Pos logit contributions
 rubble+0.046
leaf+0.044
 stret+0.043
lves+0.040
sm+0.040
Neg logit contributions
omo-0.031
 size-0.026
 head-0.024
omm-0.023
ums-0.023
Token paid
Token ID3432
Feature activation+3.18e-03

Pos logit contributions
 rubble+0.018
leaf+0.017
 stret+0.017
lves+0.015
sm+0.015
Neg logit contributions
omo-0.020
 size-0.017
 head-0.016
omm-0.016
ums-0.016
Token off
Token ID572
Feature activation+6.76e-03

Pos logit contributions
omo+0.004
 size+0.004
 head+0.004
 Mom+0.004
ums+0.004
Neg logit contributions
 rubble-0.006
leaf-0.006
 stret-0.005
lves-0.005
Hop-0.005
Token<|endoftext|>
Token ID50256
Feature activation-1.01e-01

Pos logit contributions
omo+0.011
 size+0.009
 head+0.008
ums+0.008
omm+0.008
Neg logit contributions
 rubble-0.012
leaf-0.011
 stret-0.011
lves-0.010
Hop-0.010
TokenOnce
Token ID7454
Feature activation-3.97e-03

Pos logit contributions
 rubble+0.183
 stret+0.170
leaf+0.169
lves+0.156
 unnecessary+0.154
Neg logit contributions
omo-0.160
 Mom-0.139
 size-0.138
 Me-0.133
ums-0.132
Token upon
Token ID2402
Feature activation-4.98e-03

Pos logit contributions
leaf+0.006
 rubble+0.006
 stret+0.006
Hop+0.005
 risks+0.005
Neg logit contributions
omo-0.007
 Mom-0.006
 size-0.006
 head-0.006
ama-0.006
Token a
Token ID257
Feature activation+1.42e-02

Pos logit contributions
 rubble+0.010
leaf+0.009
 stret+0.009
lves+0.009
assed+0.008
Neg logit contributions
omo-0.007
 Mom-0.006
 size-0.006
 only-0.006
 no-0.005
Token time
Token ID640
Feature activation-1.08e-02

Pos logit contributions
omo+0.021
 size+0.018
 head+0.016
omm+0.015
 Mom+0.015
Neg logit contributions
 rubble-0.028
leaf-0.027
 stret-0.025
ts-0.024
lves-0.024
Token,
Token ID11
Feature activation+1.27e-02

Pos logit contributions
 rubble+0.025
leaf+0.025
 stret+0.024
lves+0.022
ts+0.022
Neg logit contributions
omo-0.011
 size-0.009
omm-0.008
 Mom-0.008
ums-0.008
Token opened
Token ID4721
Feature activation+2.06e-03

Pos logit contributions
 rubble+0.010
leaf+0.009
 stret+0.009
sm+0.008
Hop+0.008
Neg logit contributions
omo-0.009
 size-0.008
ums-0.008
omm-0.008
 Sorry-0.008
Token the
Token ID262
Feature activation-5.70e-03

Pos logit contributions
omo+0.003
 size+0.003
 Mom+0.003
 head+0.003
ums+0.003
Neg logit contributions
 rubble-0.004
leaf-0.003
 stret-0.003
Hop-0.003
 unnecessary-0.003
Token box
Token ID3091
Feature activation-1.25e-02

Pos logit contributions
leaf+0.011
 rubble+0.011
 stret+0.011
Hop+0.010
ts+0.010
Neg logit contributions
omo-0.008
 size-0.008
 head-0.007
 Mom-0.007
omm-0.006
Token,
Token ID11
Feature activation-7.13e-03

Pos logit contributions
 rubble+0.021
leaf+0.020
 stret+0.019
sm+0.017
 outdoors+0.017
Neg logit contributions
omo-0.021
 size-0.018
omm-0.017
 Sorry-0.017
ums-0.017
Token and
Token ID290
Feature activation-2.03e-03

Pos logit contributions
 rubble+0.014
leaf+0.013
 stret+0.012
 unnecessary+0.012
lves+0.012
Neg logit contributions
omo-0.010
 size-0.009
 head-0.009
 Mom-0.009
 only-0.008
Token<|endoftext|>
Token ID50256
Feature activation-1.01e-01

Pos logit contributions
 rubble+0.003
leaf+0.003
 stret+0.003
sm+0.003
lves+0.003
Neg logit contributions
omo-0.003
 size-0.003
omm-0.003
 head-0.003
ums-0.003
TokenOnce
Token ID7454
Feature activation-3.13e-03

Pos logit contributions
 rubble+0.181
 stret+0.165
leaf+0.164
lves+0.150
 unnecessary+0.149
Neg logit contributions
omo-0.166
 Mom-0.140
 size-0.139
 Me-0.136
ums-0.135
Token upon
Token ID2402
Feature activation-4.39e-03

Pos logit contributions
 rubble+0.004
leaf+0.004
 stret+0.004
lves+0.003
Hop+0.003
Neg logit contributions
omo-0.006
 size-0.006
 Mom-0.005
 head-0.005
omm-0.005
Token a
Token ID257
Feature activation+8.64e-03

Pos logit contributions
 rubble+0.009
 stret+0.008
lves+0.008
leaf+0.008
 unnecessary+0.007
Neg logit contributions
omo-0.006
 Mom-0.005
 size-0.005
 only-0.005
 head-0.005
Token time
Token ID640
Feature activation-1.68e-02

Pos logit contributions
omo+0.014
 size+0.012
 head+0.011
omm+0.011
ums+0.011
Neg logit contributions
 rubble-0.015
leaf-0.015
 stret-0.014
lves-0.013
ts-0.013
Token there
Token ID612
Feature activation+4.68e-03

Pos logit contributions
 rubble+0.038
leaf+0.037
 stret+0.036
lves+0.033
ts+0.033
Neg logit contributions
omo-0.019
 size-0.015
omm-0.013
 head-0.013
ums-0.013
Token nice
Token ID3621
Feature activation+1.17e-02

Pos logit contributions
omo+0.037
 size+0.033
ums+0.032
omm+0.031
 head+0.030
Neg logit contributions
 rubble-0.029
leaf-0.028
 stret-0.027
Hop-0.024
sm-0.024
Token rest
Token ID1334
Feature activation-6.29e-03

Pos logit contributions
omo+0.020
 size+0.018
ums+0.017
omm+0.017
 Sorry+0.017
Neg logit contributions
 rubble-0.018
leaf-0.017
 stret-0.016
 outdoors-0.015
Hop-0.015
Token!
Token ID0
Feature activation-1.23e-02

Pos logit contributions
 rubble+0.009
leaf+0.009
 outdoors+0.008
sm+0.008
 stret+0.008
Neg logit contributions
omo-0.011
omm-0.009
 size-0.009
 Sorry-0.009
ums-0.009
Tokenâ
Token ID22940
Feature activation-9.39e-03

Pos logit contributions
leaf+0.019
 rubble+0.017
 stret+0.016
Hop+0.016
sm+0.016
Neg logit contributions
omo-0.022
 size-0.021
 head-0.019
master-0.018
ums-0.018
Token
Token ID26391
Feature activation-1.98e-02

Pos logit contributions
 rubble+0.018
leaf+0.017
 stret+0.017
lves+0.016
ts+0.016
Neg logit contributions
omo-0.014
 size-0.011
 head-0.011
omm-0.010
 Sorry-0.010
Token<|endoftext|>
Token ID50256
Feature activation-1.01e-01

Pos logit contributions
 rubble+0.034
leaf+0.032
 stret+0.031
sm+0.029
lves+0.029
Neg logit contributions
omo-0.032
 size-0.029
omm-0.026
 head-0.026
ums-0.025
TokenOne
Token ID3198
Feature activation+1.90e-02

Pos logit contributions
 rubble+0.168
leaf+0.152
 stret+0.151
lves+0.139
 unnecessary+0.135
Neg logit contributions
omo-0.177
 size-0.151
 Mom-0.148
ums-0.144
 Me-0.143
Token day
Token ID1110
Feature activation-3.09e-02

Pos logit contributions
omo+0.032
 size+0.029
omm+0.027
ums+0.026
 head+0.026
Neg logit contributions
 rubble-0.030
leaf-0.029
 stret-0.027
sm-0.027
 outdoors-0.026
Token,
Token ID11
Feature activation+6.71e-03

Pos logit contributions
 rubble+0.060
leaf+0.058
 stret+0.057
sm+0.054
Hop+0.053
Neg logit contributions
omo-0.041
 size-0.036
 head-0.032
omm-0.032
ums-0.031
Token John
Token ID1757
Feature activation-6.52e-03

Pos logit contributions
omo+0.011
 Mom+0.009
 size+0.009
 head+0.009
 only+0.009
Neg logit contributions
 rubble-0.012
leaf-0.012
 stret-0.011
ts-0.010
 unnecessary-0.010
Token was
Token ID373
Feature activation+7.26e-05

Pos logit contributions
 rubble+0.011
leaf+0.011
 stret+0.011
sm+0.010
Hop+0.010
Neg logit contributions
omo-0.010
 size-0.009
omm-0.008
ums-0.008
 Sorry-0.008
Token was
Token ID373
Feature activation+1.43e-02

Pos logit contributions
omo+0.022
ums+0.020
 Sorry+0.020
omm+0.020
 size+0.019
Neg logit contributions
 rubble-0.020
leaf-0.019
 stret-0.018
Hop-0.017
sm-0.017
Token happy
Token ID3772
Feature activation+1.16e-02

Pos logit contributions
omo+0.029
 size+0.026
omm+0.024
 head+0.024
ums+0.024
Neg logit contributions
 rubble-0.019
leaf-0.018
 stret-0.017
sm-0.016
lves-0.015
Token to
Token ID284
Feature activation-6.32e-03

Pos logit contributions
omo+0.022
 size+0.020
omm+0.019
ums+0.018
 head+0.018
Neg logit contributions
 rubble-0.017
leaf-0.016
 stret-0.015
sm-0.014
Hop-0.013
Token have
Token ID423
Feature activation-1.18e-02

Pos logit contributions
 rubble+0.008
leaf+0.007
 stret+0.007
 care+0.007
 outdoors+0.007
Neg logit contributions
omo-0.013
omm-0.011
ums-0.011
master-0.011
eter-0.010
Token something
Token ID1223
Feature activation-1.67e-03

Pos logit contributions
 rubble+0.020
 stret+0.019
leaf+0.019
lves+0.017
Hop+0.017
Neg logit contributions
omo-0.020
 size-0.018
 Mom-0.017
 head-0.016
 only-0.016
Token<|endoftext|>
Token ID50256
Feature activation-1.01e-01

Pos logit contributions
 rubble+0.002
leaf+0.002
 stret+0.002
lves+0.002
ts+0.002
Neg logit contributions
omo-0.003
 size-0.003
 head-0.003
omm-0.003
ums-0.003
TokenJane
Token ID41083
Feature activation-2.03e-02

Pos logit contributions
 rubble+0.185
 stret+0.171
leaf+0.170
lves+0.159
 unnecessary+0.156
Neg logit contributions
omo-0.159
 Mom-0.139
 size-0.135
 Me-0.130
ums-0.128
Token went
Token ID1816
Feature activation-5.41e-02

Pos logit contributions
 rubble+0.033
leaf+0.032
 stret+0.030
sm+0.029
Hop+0.028
Neg logit contributions
omo-0.034
 size-0.030
omm-0.029
ums-0.028
 Sorry-0.028
Token to
Token ID284
Feature activation-1.76e-02

Pos logit contributions
 rubble+0.102
ts+0.093
leaf+0.093
 stret+0.092
lves+0.088
Neg logit contributions
omo-0.077
 size-0.069
 head-0.068
 Mom-0.067
ums-0.063
Token the
Token ID262
Feature activation-2.76e-02

Pos logit contributions
 rubble+0.034
leaf+0.031
 stret+0.031
aid+0.029
lves+0.029
Neg logit contributions
omo-0.025
 Mom-0.024
 size-0.022
 head-0.022
 Mum-0.021
Token store
Token ID3650
Feature activation-3.73e-03

Pos logit contributions
 rubble+0.043
leaf+0.041
 stret+0.039
ts+0.036
sm+0.036
Neg logit contributions
omo-0.049
 size-0.044
 head-0.041
 Mom-0.040
omm-0.040
Token
Token ID198
Feature activation-2.71e-02

Pos logit contributions
 rubble+0.043
leaf+0.038
lves+0.038
 stret+0.037
ts+0.036
Neg logit contributions
omo-0.040
 Mom-0.033
ums-0.031
ia-0.030
ama-0.030
TokenDavid
Token ID11006
Feature activation-1.26e-02

Pos logit contributions
 rubble+0.051
leaf+0.046
 stret+0.044
ts+0.042
lves+0.041
Neg logit contributions
omo-0.045
 size-0.035
ums-0.033
omm-0.032
 Mom-0.031
Token nodded
Token ID14464
Feature activation+2.08e-02

Pos logit contributions
 rubble+0.020
leaf+0.020
 stret+0.019
ts+0.018
Hop+0.018
Neg logit contributions
omo-0.020
 size-0.018
 Sorry-0.017
ums-0.017
omm-0.017
Token and
Token ID290
Feature activation-2.75e-02

Pos logit contributions
omo+0.038
 size+0.032
omm+0.031
 Sorry+0.031
ums+0.031
Neg logit contributions
 rubble-0.030
 stret-0.030
leaf-0.029
sm-0.027
 outdoors-0.026
Token watched
Token ID7342
Feature activation+8.14e-03

Pos logit contributions
 rubble+0.041
leaf+0.040
 stret+0.038
sm+0.035
ts+0.035
Neg logit contributions
omo-0.049
 size-0.044
omm-0.043
ums-0.041
 loser-0.039
Token<|endoftext|>
Token ID50256
Feature activation-1.01e-01

Pos logit contributions
omo+0.015
 size+0.013
omm+0.012
 head+0.012
ums+0.012
Neg logit contributions
 rubble-0.013
leaf-0.012
 stret-0.012
sm-0.011
ts-0.011
TokenOnce
Token ID7454
Feature activation-2.94e-03

Pos logit contributions
 rubble+0.183
 stret+0.167
leaf+0.165
lves+0.153
 unnecessary+0.150
Neg logit contributions
omo-0.162
 Mom-0.137
 size-0.135
 Me-0.133
ums-0.131
Token there
Token ID612
Feature activation+1.05e-02

Pos logit contributions
 rubble+0.004
leaf+0.004
 stret+0.004
lves+0.003
Hop+0.003
Neg logit contributions
omo-0.006
 size-0.005
 head-0.005
 Mom-0.005
ums-0.005
Token was
Token ID373
Feature activation+2.19e-02

Pos logit contributions
omo+0.013
 size+0.011
ums+0.010
omm+0.010
 head+0.010
Neg logit contributions
 rubble-0.021
leaf-0.019
 stret-0.019
sm-0.019
 outdoors-0.018
Token a
Token ID257
Feature activation+2.77e-03

Pos logit contributions
omo+0.031
 Mom+0.029
 size+0.026
 only+0.025
 Mum+0.024
Neg logit contributions
 rubble-0.042
 stret-0.041
leaf-0.039
lves-0.038
ts-0.038
Token little
Token ID1310
Feature activation-4.41e-02

Pos logit contributions
omo+0.005
 size+0.005
omm+0.004
ums+0.004
 head+0.004
Neg logit contributions
 rubble-0.004
leaf-0.004
 stret-0.004
sm-0.003
lves-0.003
Token a
Token ID257
Feature activation+1.13e-02

Pos logit contributions
 rubble+0.027
leaf+0.025
 stret+0.025
lves+0.022
sm+0.022
Neg logit contributions
omo-0.033
 size-0.029
 head-0.027
omm-0.027
ums-0.027
Token smile
Token ID8212
Feature activation-9.53e-03

Pos logit contributions
omo+0.020
 size+0.017
 head+0.016
 Mom+0.016
omm+0.015
Neg logit contributions
 rubble-0.018
leaf-0.018
 stret-0.017
ts-0.016
lves-0.016
Token on
Token ID319
Feature activation-5.75e-03

Pos logit contributions
 stret+0.012
 rubble+0.012
leaf+0.012
sm+0.011
free+0.011
Neg logit contributions
omo-0.019
omm-0.016
 size-0.016
 Sorry-0.015
ums-0.015
Token his
Token ID465
Feature activation-1.04e-02

Pos logit contributions
 rubble+0.010
leaf+0.010
 stret+0.010
sm+0.009
ts+0.009
Neg logit contributions
omo-0.009
 size-0.008
omm-0.007
ums-0.007
 head-0.007
Token face
Token ID1986
Feature activation-6.86e-03

Pos logit contributions
 rubble+0.024
leaf+0.023
lves+0.022
 stret+0.022
Hop+0.022
Neg logit contributions
omo-0.011
 size-0.009
 head-0.009
omm-0.008
 Mom-0.008
Token<|endoftext|>
Token ID50256
Feature activation-1.01e-01

Pos logit contributions
 rubble+0.010
leaf+0.010
 stret+0.010
sm+0.009
 outdoors+0.009
Neg logit contributions
omo-0.012
 size-0.010
 Sorry-0.010
omm-0.010
ums-0.009
TokenOnce
Token ID7454
Feature activation+4.41e-03

Pos logit contributions
 rubble+0.189
 stret+0.177
leaf+0.175
lves+0.161
 unnecessary+0.161
Neg logit contributions
omo-0.154
 Mom-0.133
 size-0.128
 Me-0.127
ums-0.123
Token upon
Token ID2402
Feature activation-1.47e-03

Pos logit contributions
omo+0.009
 size+0.008
 Mom+0.007
 head+0.007
ums+0.007
Neg logit contributions
 rubble-0.006
leaf-0.006
 stret-0.006
lves-0.005
Hop-0.005
Token a
Token ID257
Feature activation+1.37e-02

Pos logit contributions
 rubble+0.003
 stret+0.003
leaf+0.003
lves+0.003
ival+0.002
Neg logit contributions
omo-0.002
 Mom-0.002
 size-0.002
 only-0.002
 head-0.002
Token time
Token ID640
Feature activation-1.26e-02

Pos logit contributions
omo+0.021
 size+0.018
 head+0.017
omm+0.016
 Mom+0.016
Neg logit contributions
 rubble-0.026
leaf-0.025
 stret-0.023
ts-0.022
lves-0.022
Token,
Token ID11
Feature activation+1.24e-02

Pos logit contributions
 rubble+0.028
leaf+0.027
 stret+0.027
lves+0.025
ts+0.025
Neg logit contributions
omo-0.014
 size-0.012
omm-0.010
 head-0.010
ums-0.010
Token and
Token ID290
Feature activation-1.55e-02

Pos logit contributions
 rubble+0.017
leaf+0.016
 stret+0.016
sm+0.014
lves+0.014
Neg logit contributions
omo-0.014
 size-0.013
 head-0.011
omm-0.011
ums-0.011
Token baby
Token ID5156
Feature activation-2.31e-02

Pos logit contributions
 rubble+0.031
leaf+0.030
 stret+0.029
lves+0.027
sm+0.027
Neg logit contributions
omo-0.021
 size-0.018
 head-0.016
omm-0.016
ums-0.015
Token walk
Token ID2513
Feature activation-4.61e-02

Pos logit contributions
 rubble+0.039
leaf+0.039
 stret+0.037
sm+0.035
Hop+0.034
Neg logit contributions
omo-0.037
 size-0.032
omm-0.030
ums-0.029
 head-0.029
Token away
Token ID1497
Feature activation+1.11e-02

Pos logit contributions
 rubble+0.068
leaf+0.061
 stret+0.058
ts+0.055
lves+0.054
Neg logit contributions
omo-0.086
 size-0.079
 head-0.075
 Mom-0.074
ums-0.072
Token and
Token ID290
Feature activation-1.58e-02

Pos logit contributions
omo+0.018
 size+0.015
omm+0.015
ums+0.014
 Sorry+0.014
Neg logit contributions
 rubble-0.019
leaf-0.018
 stret-0.018
sm-0.017
 outdoors-0.016
Token<|endoftext|>
Token ID50256
Feature activation-1.01e-01

Pos logit contributions
 rubble+0.028
leaf+0.027
 stret+0.026
sm+0.024
lves+0.024
Neg logit contributions
omo-0.025
 size-0.022
omm-0.020
ums-0.020
 head-0.019
TokenOnce
Token ID7454
Feature activation-1.42e-02

Pos logit contributions
 rubble+0.177
leaf+0.161
 stret+0.158
lves+0.144
ts+0.142
Neg logit contributions
omo-0.166
 Mom-0.140
 size-0.139
ums-0.138
 Me-0.136
Token there
Token ID612
Feature activation+7.50e-03

Pos logit contributions
 rubble+0.022
leaf+0.022
 stret+0.020
ts+0.018
 risks+0.018
Neg logit contributions
omo-0.028
 Mom-0.023
 size-0.023
omm-0.022
ama-0.022
Token was
Token ID373
Feature activation+1.81e-02

Pos logit contributions
omo+0.010
 size+0.008
omm+0.007
ums+0.007
 head+0.007
Neg logit contributions
 rubble-0.015
leaf-0.014
 stret-0.014
sm-0.013
 outdoors-0.013
Token a
Token ID257
Feature activation+7.11e-03

Pos logit contributions
omo+0.028
 Mom+0.025
 size+0.024
 only+0.023
ums+0.021
Neg logit contributions
 rubble-0.033
 stret-0.032
leaf-0.031
ts-0.030
lves-0.030
Token famous
Token ID5863
Feature activation-9.02e-03

Pos logit contributions
omo+0.014
 size+0.012
ums+0.012
omm+0.012
 head+0.011
Neg logit contributions
 rubble-0.010
leaf-0.009
 stret-0.009
sm-0.008
lves-0.008
Token He
Token ID679
Feature activation+1.32e-02

Pos logit contributions
omo+0.035
 size+0.033
 head+0.029
ums+0.028
omm+0.027
Neg logit contributions
 rubble-0.042
leaf-0.039
 stret-0.038
sm-0.036
Hop-0.036
Token pressed
Token ID12070
Feature activation+2.55e-02

Pos logit contributions
omo+0.024
omm+0.023
 Sorry+0.022
ums+0.022
ama+0.022
Neg logit contributions
leaf-0.018
 rubble-0.018
ts-0.017
sm-0.016
aid-0.015
Token it
Token ID340
Feature activation-7.34e-03

Pos logit contributions
omo+0.051
 size+0.045
omm+0.043
 head+0.042
ums+0.042
Neg logit contributions
 rubble-0.035
leaf-0.032
 stret-0.031
sm-0.028
ts-0.028
Token on
Token ID319
Feature activation+1.31e-02

Pos logit contributions
 rubble+0.010
 everywhere+0.009
leaf+0.009
 stret+0.008
 aside+0.008
Neg logit contributions
omo-0.016
 size-0.012
omm-0.012
ums-0.012
 Sorry-0.012
Token the
Token ID262
Feature activation-4.31e-03

Pos logit contributions
omo+0.026
 size+0.022
 Sorry+0.022
ums+0.022
omm+0.022
Neg logit contributions
 rubble-0.018
 stret-0.016
leaf-0.015
ts-0.015
sm-0.014
Token cardboard
Token ID26819
Feature activation+4.61e-02

Pos logit contributions
leaf+0.009
 rubble+0.008
 stret+0.008
sm+0.008
lves+0.008
Neg logit contributions
 size-0.005
omo-0.005
 head-0.005
 Mom-0.004
omm-0.004
Token and
Token ID290
Feature activation-4.91e-03

Pos logit contributions
omo+0.089
omm+0.071
 Sorry+0.070
ums+0.069
 size+0.069
Neg logit contributions
 rubble-0.072
 stret-0.063
leaf-0.059
 everywhere-0.058
ts-0.057
Token made
Token ID925
Feature activation+2.51e-02

Pos logit contributions
 rubble+0.009
leaf+0.008
ts+0.008
 stret+0.008
sm+0.007
Neg logit contributions
omo-0.008
omm-0.007
ums-0.007
 size-0.007
ama-0.006
Token a
Token ID257
Feature activation+2.25e-02

Pos logit contributions
omo+0.039
 size+0.036
 Mom+0.035
 head+0.033
ums+0.031
Neg logit contributions
 rubble-0.043
leaf-0.042
 stret-0.041
lves-0.039
Hop-0.038
Token big
Token ID1263
Feature activation+1.59e-02

Pos logit contributions
omo+0.038
 size+0.032
omm+0.031
ums+0.030
 head+0.029
Neg logit contributions
 rubble-0.038
leaf-0.035
 stret-0.035
lves-0.031
sm-0.031
Token print
Token ID3601
Feature activation+2.20e-02

Pos logit contributions
omo+0.024
 size+0.020
omm+0.020
ums+0.019
 Sorry+0.019
Neg logit contributions
 rubble-0.029
leaf-0.026
 stret-0.026
sm-0.024
lves-0.024
TokenAndy
Token ID35314
Feature activation+3.75e-03

Pos logit contributions
 rubble+0.013
leaf+0.012
 stret+0.011
ts+0.011
lves+0.011
Neg logit contributions
omo-0.011
 size-0.008
omm-0.008
 Mom-0.008
ums-0.008
Token said
Token ID531
Feature activation-2.55e-02

Pos logit contributions
omo+0.006
 size+0.005
 Sorry+0.005
ums+0.005
omm+0.005
Neg logit contributions
 rubble-0.006
leaf-0.006
 stret-0.006
sm-0.005
Hop-0.005
Token,
Token ID11
Feature activation-2.20e-02

Pos logit contributions
 rubble+0.052
leaf+0.050
 stret+0.049
sm+0.046
Hop+0.045
Neg logit contributions
omo-0.033
 size-0.028
ums-0.025
omm-0.024
 head-0.024
Token "
Token ID366
Feature activation-9.57e-03

Pos logit contributions
 rubble+0.048
leaf+0.046
 stret+0.044
lves+0.042
sm+0.042
Neg logit contributions
omo-0.026
 size-0.022
 head-0.020
omm-0.020
 Mom-0.019
TokenLet
Token ID5756
Feature activation-2.00e-02

Pos logit contributions
 rubble+0.013
leaf+0.012
 stret+0.012
sm+0.010
lves+0.010
Neg logit contributions
omo-0.019
 size-0.017
 head-0.016
omm-0.016
ums-0.016
Token's
Token ID338
Feature activation+4.65e-02

Pos logit contributions
 rubble+0.040
leaf+0.038
 stret+0.038
lves+0.035
sm+0.035
Neg logit contributions
omo-0.027
 size-0.023
ums-0.021
omm-0.021
 head-0.020
Token look
Token ID804
Feature activation-1.06e-02

Pos logit contributions
omo+0.087
ums+0.073
omm+0.073
 size+0.069
mm+0.069
Neg logit contributions
 rubble-0.073
 stret-0.063
 outdoors-0.060
leaf-0.060
 everywhere-0.058
Token and
Token ID290
Feature activation-5.04e-03

Pos logit contributions
leaf+0.019
 rubble+0.018
 stret+0.018
lves+0.017
handed+0.017
Neg logit contributions
omo-0.015
 Mom-0.015
 size-0.014
 mom-0.013
 head-0.013
Token see
Token ID766
Feature activation+1.80e-02

Pos logit contributions
 rubble+0.009
leaf+0.009
 stret+0.009
sm+0.008
lves+0.008
Neg logit contributions
omo-0.007
 size-0.006
 head-0.006
 Mom-0.006
 Sorry-0.006
Token if
Token ID611
Feature activation+2.86e-02

Pos logit contributions
omo+0.035
 size+0.033
 Mom+0.032
 head+0.031
 Sorry+0.029
Neg logit contributions
 rubble-0.024
leaf-0.024
 stret-0.022
Hop-0.020
lves-0.020
Token we
Token ID356
Feature activation-3.49e-02

Pos logit contributions
omo+0.045
 size+0.038
ums+0.037
omm+0.035
 head+0.034
Neg logit contributions
 rubble-0.052
leaf-0.048
 stret-0.046
ts-0.044
sm-0.044
Token and
Token ID290
Feature activation-1.91e-02

Pos logit contributions
omo+0.038
 Sorry+0.032
 size+0.032
omm+0.032
ums+0.030
Neg logit contributions
 rubble-0.035
leaf-0.032
 stret-0.032
sm-0.030
ts-0.029
Token stronger
Token ID7387
Feature activation+1.59e-02

Pos logit contributions
 rubble+0.026
leaf+0.025
 stret+0.023
Hop+0.023
ts+0.022
Neg logit contributions
omo-0.037
ums-0.033
 size-0.032
omm-0.032
 Sorry-0.030
Token each
Token ID1123
Feature activation-2.69e-03

Pos logit contributions
omo+0.025
 size+0.021
omm+0.020
ums+0.020
 Sorry+0.019
Neg logit contributions
 rubble-0.028
leaf-0.027
 stret-0.026
sm-0.024
Hop-0.024
Token day
Token ID1110
Feature activation-3.74e-02

Pos logit contributions
 rubble+0.005
leaf+0.004
 stret+0.004
sm+0.004
lves+0.004
Neg logit contributions
omo-0.004
 size-0.004
omm-0.004
ums-0.004
 Sorry-0.003
Token.
Token ID13
Feature activation+1.55e-02

Pos logit contributions
 rubble+0.069
leaf+0.067
 stret+0.064
lves+0.060
Hop+0.059
Neg logit contributions
omo-0.056
 size-0.049
 head-0.045
ums-0.044
omm-0.044
Token One
Token ID1881
Feature activation+4.57e-02

Pos logit contributions
omo+0.026
 size+0.025
 head+0.022
ums+0.021
 Sorry+0.020
Neg logit contributions
 rubble-0.025
leaf-0.023
 stret-0.023
sm-0.021
lves-0.021
Token day
Token ID1110
Feature activation-8.41e-03

Pos logit contributions
omo+0.088
 Sorry+0.080
omm+0.079
ums+0.079
 size+0.078
Neg logit contributions
sm-0.056
 rubble-0.055
leaf-0.055
 outdoors-0.052
 nearby-0.050
Token,
Token ID11
Feature activation+2.78e-02

Pos logit contributions
 rubble+0.018
leaf+0.017
 stret+0.017
sm+0.016
lves+0.016
Neg logit contributions
omo-0.010
 size-0.009
 head-0.008
omm-0.008
ums-0.008
Token Lily
Token ID20037
Feature activation+2.85e-02

Pos logit contributions
omo+0.043
 size+0.037
 head+0.034
omm+0.034
ums+0.034
Neg logit contributions
 rubble-0.050
leaf-0.048
 stret-0.047
sm-0.043
lves-0.043
Token's
Token ID338
Feature activation+6.53e-02

Pos logit contributions
omo+0.050
 Sorry+0.047
ums+0.046
mm+0.046
omm+0.046
Neg logit contributions
 rubble-0.038
 stret-0.036
Hop-0.035
lves-0.034
 nearby-0.034
Token friend
Token ID1545
Feature activation-6.09e-03

Pos logit contributions
omo+0.068
 size+0.055
 Sorry+0.051
ums+0.049
omm+0.048
Neg logit contributions
 rubble-0.144
 stret-0.141
leaf-0.138
 groceries-0.131
 outdoors-0.130
Token taking
Token ID2263
Feature activation-7.84e-03

Pos logit contributions
omo+0.020
 size+0.018
 head+0.016
ums+0.015
omm+0.015
Neg logit contributions
 rubble-0.024
leaf-0.023
 stret-0.022
lves-0.021
sm-0.021
Token my
Token ID616
Feature activation-1.16e-02

Pos logit contributions
 rubble+0.015
leaf+0.014
 stret+0.013
lves+0.013
Hop+0.012
Neg logit contributions
omo-0.010
 size-0.010
 Mom-0.010
 head-0.009
 mom-0.008
Token lumber
Token ID34840
Feature activation+2.13e-02

Pos logit contributions
 rubble+0.020
leaf+0.019
lves+0.018
 stret+0.018
sm+0.017
Neg logit contributions
omo-0.018
 size-0.017
 head-0.016
 Mom-0.015
 throat-0.015
Token!
Token ID0
Feature activation+7.21e-03

Pos logit contributions
omo+0.034
 size+0.029
omm+0.027
ums+0.027
 Sorry+0.026
Neg logit contributions
 rubble-0.038
 stret-0.035
leaf-0.035
sm-0.032
lves-0.032
Token That
Token ID1320
Feature activation+3.07e-02

Pos logit contributions
 size+0.013
omo+0.012
 head+0.011
master+0.010
ums+0.010
Neg logit contributions
leaf-0.011
 rubble-0.010
 stret-0.010
sm-0.010
free-0.009
Token's
Token ID338
Feature activation+4.59e-02

Pos logit contributions
omo+0.047
ums+0.043
omm+0.038
mm+0.038
 size+0.037
Neg logit contributions
 rubble-0.057
 stret-0.050
 nearby-0.047
leaf-0.047
lves-0.047
Token for
Token ID329
Feature activation-1.12e-02

Pos logit contributions
omo+0.067
 size+0.057
ums+0.054
omm+0.052
 head+0.051
Neg logit contributions
 rubble-0.088
leaf-0.082
 stret-0.082
ts-0.076
lves-0.075
Token my
Token ID616
Feature activation+1.23e-02

Pos logit contributions
leaf+0.020
 rubble+0.020
 stret+0.019
lves+0.018
Hop+0.018
Neg logit contributions
omo-0.017
 size-0.015
 Mom-0.015
 head-0.014
 Sorry-0.013
Token new
Token ID649
Feature activation+1.43e-02

Pos logit contributions
omo+0.018
 size+0.016
 head+0.015
omm+0.014
ums+0.014
Neg logit contributions
 rubble-0.023
leaf-0.022
 stret-0.021
lves-0.019
sm-0.019
Token house
Token ID2156
Feature activation+8.91e-03

Pos logit contributions
omo+0.025
 size+0.021
omm+0.020
ums+0.020
 head+0.020
Neg logit contributions
 rubble-0.023
leaf-0.022
 stret-0.021
sm-0.019
lves-0.019
Token!"
Token ID2474
Feature activation+7.51e-03

Pos logit contributions
omo+0.014
 size+0.012
omm+0.011
ums+0.011
 head+0.011
Neg logit contributions
 rubble-0.016
leaf-0.015
 stret-0.015
sm-0.014
ts-0.014
Token
Token ID220
Feature activation+7.65e-03

Pos logit contributions
omo+0.024
 size+0.021
ums+0.019
 head+0.019
omm+0.019
Neg logit contributions
 rubble-0.024
leaf-0.023
 stret-0.023
sm-0.021
lves-0.020
Token
Token ID198
Feature activation+7.16e-03

Pos logit contributions
omo+0.014
 size+0.013
 head+0.012
 Sorry+0.011
omm+0.011
Neg logit contributions
 rubble-0.011
leaf-0.010
 stret-0.010
sm-0.010
lves-0.009
Token
Token ID198
Feature activation-1.66e-03

Pos logit contributions
omo+0.012
 size+0.011
 head+0.010
omm+0.010
ums+0.010
Neg logit contributions
 rubble-0.012
leaf-0.011
 stret-0.011
sm-0.010
lves-0.010
TokenThe
Token ID464
Feature activation+1.78e-02

Pos logit contributions
 rubble+0.003
leaf+0.003
 stret+0.003
Hop+0.002
sm+0.002
Neg logit contributions
omo-0.003
 size-0.003
 head-0.002
ums-0.002
omm-0.002
Token baby
Token ID5156
Feature activation+1.42e-02

Pos logit contributions
omo+0.027
 size+0.023
ums+0.022
omm+0.021
 head+0.021
Neg logit contributions
 rubble-0.033
leaf-0.031
 stret-0.030
sm-0.028
 groceries-0.027
Token's
Token ID338
Feature activation+5.43e-02

Pos logit contributions
omo+0.023
 size+0.021
ums+0.020
 Sorry+0.020
omm+0.019
Neg logit contributions
 rubble-0.022
leaf-0.021
 stret-0.021
Hop-0.020
sm-0.020
Token face
Token ID1986
Feature activation+4.51e-03

Pos logit contributions
omo+0.053
 size+0.043
mm+0.042
 Sorry+0.040
ums+0.039
Neg logit contributions
 rubble-0.119
 stret-0.114
leaf-0.114
 groceries-0.105
 goodies-0.104
Token lit
Token ID6578
Feature activation-7.91e-03

Pos logit contributions
omo+0.008
 seeker+0.008
omm+0.007
ums+0.007
 loser+0.007
Neg logit contributions
 rubble-0.006
 stret-0.006
sm-0.005
leaf-0.005
 nearby-0.005
Token up
Token ID510
Feature activation+1.13e-02

Pos logit contributions
 rubble+0.011
leaf+0.010
 stret+0.010
sm+0.009
lves+0.009
Neg logit contributions
omo-0.015
 size-0.014
 head-0.013
omm-0.013
ums-0.013
Token and
Token ID290
Feature activation-1.72e-02

Pos logit contributions
omo+0.019
 size+0.016
 Sorry+0.016
omm+0.016
ums+0.015
Neg logit contributions
 rubble-0.018
leaf-0.017
 stret-0.017
sm-0.016
 outdoors-0.016
Token she
Token ID673
Feature activation-2.79e-03

Pos logit contributions
 rubble+0.025
leaf+0.023
 stret+0.023
sm+0.021
lves+0.021
Neg logit contributions
omo-0.032
 size-0.028
ums-0.028
omm-0.027
 loser-0.025
Token could
Token ID714
Feature activation+2.55e-02

Pos logit contributions
 rubble+0.008
leaf+0.007
Hop+0.007
 stret+0.006
ts+0.006
Neg logit contributions
omo-0.008
 size-0.007
ums-0.007
omm-0.006
 Sorry-0.006
Token not
Token ID407
Feature activation+1.41e-02

Pos logit contributions
omo+0.038
 size+0.032
ums+0.030
omm+0.030
eter+0.030
Neg logit contributions
 rubble-0.049
 stret-0.044
leaf-0.044
Hop-0.042
 outdoors-0.041
Token pass
Token ID1208
Feature activation+1.49e-02

Pos logit contributions
omo+0.022
 size+0.019
omm+0.017
ums+0.017
 head+0.017
Neg logit contributions
 rubble-0.026
leaf-0.025
 stret-0.024
sm-0.022
Hop-0.022
Token the
Token ID262
Feature activation-1.90e-02

Pos logit contributions
omo+0.025
 size+0.022
 head+0.020
omm+0.020
ums+0.020
Neg logit contributions
 rubble-0.025
leaf-0.024
 stret-0.023
lves-0.021
sm-0.021
Token test
Token ID1332
Feature activation+1.95e-03

Pos logit contributions
 rubble+0.032
leaf+0.030
 stret+0.029
sm+0.027
ts+0.027
Neg logit contributions
omo-0.032
 size-0.029
 head-0.027
omm-0.026
ums-0.025
Token that
Token ID326
Feature activation+3.35e-03

Pos logit contributions
omo+0.003
 size+0.003
omm+0.003
ums+0.003
 head+0.003
Neg logit contributions
 rubble-0.003
leaf-0.003
 stret-0.003
sm-0.003
ts-0.003
Token his
Token ID465
Feature activation+3.03e-02

Pos logit contributions
omo+0.006
omm+0.005
ums+0.005
 size+0.005
ama+0.005
Neg logit contributions
 rubble-0.005
leaf-0.005
 stret-0.004
 outdoors-0.004
sm-0.004
Token teacher
Token ID4701
Feature activation+4.61e-04

Pos logit contributions
omo+0.036
 size+0.026
omm+0.026
mm+0.025
 Sorry+0.025
Neg logit contributions
 rubble-0.065
leaf-0.061
 stret-0.060
 outdoors-0.058
 groceries-0.057
Token had
Token ID550
Feature activation-7.91e-03

Pos logit contributions
omo+0.001
 size+0.001
omm+0.001
ums+0.001
 Sorry+0.001
Neg logit contributions
 rubble-0.001
leaf-0.001
 stret-0.001
sm-0.001
ts-0.001
Token given
Token ID1813
Feature activation-2.74e-02

Pos logit contributions
 rubble+0.013
leaf+0.012
 stret+0.012
lves+0.011
Hop+0.010
Neg logit contributions
omo-0.014
 size-0.012
 head-0.011
omm-0.011
ums-0.011
Token him
Token ID683
Feature activation-2.03e-02

Pos logit contributions
 rubble+0.048
leaf+0.046
 stret+0.045
lves+0.042
Hop+0.040
Neg logit contributions
 size-0.041
omo-0.040
 Mom-0.039
 head-0.037
 only-0.036
TokenAnna
Token ID31160
Feature activation+1.48e-02

Pos logit contributions
 rubble+0.000
leaf+0.000
 stret+0.000
sm+0.000
lves+0.000
Neg logit contributions
omo-0.000
 size-0.000
 head-0.000
omm-0.000
ums-0.000
Token felt
Token ID2936
Feature activation-1.14e-02

Pos logit contributions
omo+0.027
 size+0.024
 Sorry+0.024
ums+0.024
omm+0.023
Neg logit contributions
 rubble-0.020
leaf-0.020
 stret-0.019
Hop-0.018
ts-0.018
Token surprised
Token ID6655
Feature activation+1.88e-02

Pos logit contributions
 rubble+0.020
leaf+0.019
 stret+0.018
lves+0.017
ival+0.017
Neg logit contributions
omo-0.017
 Mom-0.016
 size-0.015
 head-0.015
 only-0.015
Token and
Token ID290
Feature activation-3.03e-02

Pos logit contributions
omo+0.028
 size+0.025
 head+0.022
omm+0.022
ums+0.022
Neg logit contributions
 rubble-0.035
leaf-0.033
 stret-0.033
lves-0.030
sm-0.030
Token scared
Token ID12008
Feature activation+4.25e-03

Pos logit contributions
 rubble+0.048
 stret+0.046
leaf+0.046
lves+0.041
sm+0.040
Neg logit contributions
omo-0.049
 head-0.048
 size-0.046
 Mom-0.044
 Sorry-0.043
Token.
Token ID13
Feature activation+1.09e-02

Pos logit contributions
omo+0.006
 size+0.005
 head+0.005
ums+0.005
omm+0.005
Neg logit contributions
 rubble-0.008
leaf-0.008
 stret-0.008
lves-0.007
Hop-0.007
Token She
Token ID1375
Feature activation+9.27e-03

Pos logit contributions
omo+0.017
 size+0.014
ums+0.012
omm+0.012
 head+0.012
Neg logit contributions
 rubble-0.020
leaf-0.019
 stret-0.018
Hop-0.018
 outdoors-0.017
Token did
Token ID750
Feature activation+4.08e-02

Pos logit contributions
omo+0.015
 size+0.013
ums+0.013
omm+0.013
 Sorry+0.012
Neg logit contributions
 rubble-0.016
leaf-0.015
 stret-0.015
Hop-0.014
ts-0.014
Token not
Token ID407
Feature activation+1.30e-03

Pos logit contributions
omo+0.023
 size+0.017
 head+0.013
 Mom+0.011
omm+0.009
Neg logit contributions
 rubble-0.113
leaf-0.110
 stret-0.108
lves-0.103
Hop-0.102
Token expect
Token ID1607
Feature activation-1.39e-02

Pos logit contributions
omo+0.002
 size+0.002
 head+0.001
 Mom+0.001
 only+0.001
Neg logit contributions
 rubble-0.003
leaf-0.002
 stret-0.002
lves-0.002
ival-0.002
Token the
Token ID262
Feature activation+1.49e-02

Pos logit contributions
 rubble+0.028
leaf+0.027
 stret+0.026
lves+0.025
sm+0.024
Neg logit contributions
omo-0.018
 size-0.016
 head-0.014
omm-0.014
 Mom-0.014
Token made
Token ID925
Feature activation+1.73e-02

Pos logit contributions
 rubble+0.013
leaf+0.012
 stret+0.012
lves+0.011
Hop+0.011
Neg logit contributions
omo-0.012
 size-0.011
 head-0.010
 Mom-0.010
ums-0.009
Token a
Token ID257
Feature activation+2.02e-02

Pos logit contributions
 size+0.023
omo+0.023
 Mom+0.023
 mom+0.020
 only+0.020
Neg logit contributions
 rubble-0.032
leaf-0.032
 stret-0.031
lves-0.031
Roger-0.030
Token bigger
Token ID5749
Feature activation+4.55e-02

Pos logit contributions
omo+0.034
 size+0.030
 head+0.027
omm+0.027
ums+0.027
Neg logit contributions
 rubble-0.034
leaf-0.032
 stret-0.031
sm-0.028
lves-0.028
Token and
Token ID290
Feature activation-1.47e-02

Pos logit contributions
omo+0.064
omm+0.051
 size+0.050
 Sorry+0.049
ums+0.048
Neg logit contributions
 rubble-0.089
 stret-0.081
leaf-0.079
ts-0.074
 outdoors-0.073
Token better
Token ID1365
Feature activation+3.40e-02

Pos logit contributions
 rubble+0.023
leaf+0.022
 stret+0.021
sm+0.019
ts+0.019
Neg logit contributions
omo-0.026
 size-0.023
omm-0.021
ums-0.021
 head-0.021
Token tower
Token ID10580
Feature activation+1.90e-02

Pos logit contributions
omo+0.054
 size+0.047
 head+0.043
ums+0.042
omm+0.042
Neg logit contributions
 rubble-0.060
leaf-0.058
 stret-0.056
lves-0.052
sm-0.052
Token.
Token ID13
Feature activation+2.23e-02

Pos logit contributions
omo+0.032
 size+0.027
omm+0.026
ums+0.025
 Sorry+0.025
Neg logit contributions
 rubble-0.032
leaf-0.029
 stret-0.029
ts-0.027
sm-0.027
Token They
Token ID1119
Feature activation+7.08e-03

Pos logit contributions
omo+0.033
 size+0.030
 head+0.027
omm+0.027
ums+0.026
Neg logit contributions
 rubble-0.041
leaf-0.038
 stret-0.036
sm-0.035
Hop-0.034
Token had
Token ID550
Feature activation+4.68e-03

Pos logit contributions
omo+0.013
omm+0.011
 size+0.011
ums+0.011
 Sorry+0.010
Neg logit contributions
 rubble-0.011
leaf-0.010
 stret-0.010
ts-0.010
sm-0.009
Token a
Token ID257
Feature activation+2.27e-02

Pos logit contributions
omo+0.008
 size+0.008
 only+0.007
 Mom+0.007
 head+0.007
Neg logit contributions
leaf-0.007
 rubble-0.007
 stret-0.007
lves-0.007
Hop-0.007
Token nice
Token ID3621
Feature activation+8.01e-03

Pos logit contributions
omo+0.037
 size+0.033
 head+0.029
 Mom+0.029
 only+0.028
Neg logit contributions
leaf-0.041
 rubble-0.040
 stret-0.036
ts-0.036
lves-0.035
Token She
Token ID1375
Feature activation-1.21e-02

Pos logit contributions
 rubble+0.024
leaf+0.023
 stret+0.022
 outdoors+0.021
Hop+0.021
Neg logit contributions
omo-0.021
 size-0.018
 head-0.016
ums-0.015
 throat-0.015
Token said
Token ID531
Feature activation-3.76e-02

Pos logit contributions
 rubble+0.018
leaf+0.018
 stret+0.017
Hop+0.016
lves+0.016
Neg logit contributions
omo-0.021
 size-0.019
ums-0.019
omm-0.018
 Sorry-0.017
Token,
Token ID11
Feature activation-2.70e-02

Pos logit contributions
 rubble+0.072
leaf+0.070
 stret+0.069
sm+0.064
Hop+0.064
Neg logit contributions
omo-0.053
 size-0.045
 head-0.039
omm-0.039
ums-0.039
Token "
Token ID366
Feature activation-1.63e-02

Pos logit contributions
 rubble+0.052
leaf+0.050
 stret+0.049
sm+0.045
Hop+0.045
Neg logit contributions
omo-0.038
 size-0.033
ums-0.029
 head-0.029
omm-0.029
TokenThat
Token ID2504
Feature activation+2.53e-02

Pos logit contributions
leaf+0.025
 rubble+0.025
 stret+0.023
Hop+0.023
lves+0.022
Neg logit contributions
omo-0.028
 size-0.026
 head-0.024
omm-0.024
ums-0.023
Token's
Token ID338
Feature activation+4.21e-02

Pos logit contributions
omo+0.040
ums+0.034
omm+0.034
 Sorry+0.033
 size+0.033
Neg logit contributions
leaf-0.041
 rubble-0.040
 stret-0.038
 outdoors-0.037
 risks-0.036
Token not
Token ID407
Feature activation+1.28e-02

Pos logit contributions
omo+0.057
 size+0.049
ums+0.045
 head+0.044
omm+0.043
Neg logit contributions
 rubble-0.083
leaf-0.078
 stret-0.077
ts-0.072
lves-0.072
Token fair
Token ID3148
Feature activation+1.16e-02

Pos logit contributions
omo+0.022
 size+0.020
 head+0.018
 Mom+0.018
 Sorry+0.017
Neg logit contributions
 rubble-0.021
leaf-0.020
 stret-0.019
Hop-0.018
lves-0.018
Token!
Token ID0
Feature activation-3.60e-03

Pos logit contributions
omo+0.019
omm+0.016
 size+0.016
 Sorry+0.015
 head+0.015
Neg logit contributions
 rubble-0.018
leaf-0.018
 stret-0.017
sm-0.016
 outdoors-0.016
Token I
Token ID314
Feature activation-3.21e-02

Pos logit contributions
leaf+0.005
free+0.004
 aback+0.004
Hop+0.004
sm+0.004
Neg logit contributions
 size-0.007
omo-0.006
 head-0.006
master-0.005
 ending-0.005
Token want
Token ID765
Feature activation-5.61e-03

Pos logit contributions
 rubble+0.054
leaf+0.054
 stret+0.051
sm+0.049
lves+0.048
Neg logit contributions
omo-0.050
 size-0.045
omm-0.043
ums-0.041
 head-0.040
Token and
Token ID290
Feature activation-1.66e-02

Pos logit contributions
omo+0.033
 Sorry+0.028
omm+0.028
 size+0.028
mm+0.028
Neg logit contributions
 rubble-0.019
 outside-0.018
leaf-0.017
 stret-0.017
 everywhere-0.017
Token Mia
Token ID32189
Feature activation-1.73e-03

Pos logit contributions
leaf+0.029
 rubble+0.028
ts+0.025
sm+0.024
 newspapers+0.024
Neg logit contributions
omo-0.025
 size-0.023
ums-0.022
omm-0.022
 head-0.021
Token has
Token ID468
Feature activation-8.53e-04

Pos logit contributions
 rubble+0.003
leaf+0.003
 stret+0.003
ts+0.003
sm+0.003
Neg logit contributions
omo-0.002
 size-0.002
omm-0.002
ums-0.002
 Sorry-0.002
Token a
Token ID257
Feature activation+8.16e-03

Pos logit contributions
 rubble+0.001
leaf+0.001
 stret+0.001
lves+0.001
Hop+0.001
Neg logit contributions
omo-0.002
 size-0.001
 head-0.001
 Mom-0.001
ums-0.001
Token doll
Token ID3654
Feature activation+1.91e-02

Pos logit contributions
omo+0.013
 size+0.012
ums+0.011
omm+0.011
 head+0.011
Neg logit contributions
 rubble-0.014
 stret-0.013
leaf-0.013
sm-0.011
lves-0.011
Token.
Token ID13
Feature activation+3.83e-04

Pos logit contributions
omo+0.035
 size+0.030
omm+0.030
 Sorry+0.029
ums+0.028
Neg logit contributions
 stret-0.028
 rubble-0.027
leaf-0.026
sm-0.024
 outdoors-0.024
Token Tom
Token ID4186
Feature activation+1.04e-02

Pos logit contributions
 size+0.001
omo+0.001
 head+0.001
ums+0.000
 throat+0.000
Neg logit contributions
 rubble-0.001
leaf-0.001
 stret-0.001
 benches-0.001
 bushes-0.001
Token likes
Token ID7832
Feature activation+1.33e-02

Pos logit contributions
omo+0.014
 size+0.012
ums+0.012
omm+0.012
 Sorry+0.011
Neg logit contributions
 rubble-0.020
leaf-0.019
 stret-0.019
ts-0.018
Hop-0.018
Token to
Token ID284
Feature activation+1.63e-02

Pos logit contributions
omo+0.023
 size+0.021
 head+0.019
omm+0.019
ums+0.019
Neg logit contributions
 rubble-0.021
leaf-0.020
 stret-0.019
lves-0.018
sm-0.018
Token make
Token ID787
Feature activation+3.31e-02

Pos logit contributions
omo+0.028
omm+0.026
ums+0.025
 size+0.025
ama+0.024
Neg logit contributions
 rubble-0.026
 stret-0.026
leaf-0.024
ts-0.023
aid-0.021
Token loud
Token ID7812
Feature activation+1.85e-02

Pos logit contributions
omo+0.053
 size+0.047
ums+0.044
 head+0.043
 Mom+0.043
Neg logit contributions
 rubble-0.058
leaf-0.055
 stret-0.052
Hop-0.050
lves-0.049
Token the
Token ID262
Feature activation-9.77e-03

Pos logit contributions
 rubble+0.008
leaf+0.008
 stret+0.007
lves+0.007
Hop+0.007
Neg logit contributions
omo-0.008
 size-0.007
 Mom-0.007
 head-0.006
omm-0.006
Token door
Token ID3420
Feature activation-9.44e-03

Pos logit contributions
 rubble+0.019
leaf+0.018
 stret+0.018
lves+0.017
Hop+0.017
Neg logit contributions
omo-0.014
 size-0.012
 head-0.011
omm-0.011
ums-0.010
Token.
Token ID13
Feature activation+5.16e-03

Pos logit contributions
 rubble+0.014
leaf+0.013
 stret+0.012
sm+0.012
 nearby+0.012
Neg logit contributions
omo-0.018
 size-0.015
 Sorry-0.015
omm-0.015
ums-0.014
Token She
Token ID1375
Feature activation-6.51e-03

Pos logit contributions
omo+0.007
 size+0.006
 head+0.005
ums+0.005
omm+0.005
Neg logit contributions
 rubble-0.010
leaf-0.010
 stret-0.009
sm-0.009
lves-0.009
Token sees
Token ID7224
Feature activation-1.48e-02

Pos logit contributions
 rubble+0.011
leaf+0.010
 stret+0.010
sm+0.009
ts+0.009
Neg logit contributions
omo-0.011
 size-0.010
ums-0.009
omm-0.009
 Sorry-0.009
Token them
Token ID606
Feature activation-9.83e-03

Pos logit contributions
leaf+0.031
 stret+0.029
 rubble+0.029
Hop+0.029
wn+0.028
Neg logit contributions
 Mom-0.019
omo-0.017
 mom-0.016
 size-0.015
 Mum-0.015
Token crying
Token ID13774
Feature activation+7.60e-03

Pos logit contributions
 rubble+0.019
leaf+0.018
 stret+0.017
lves+0.016
Hop+0.016
Neg logit contributions
omo-0.014
 size-0.013
 head-0.012
ums-0.011
 Mom-0.011
Token.
Token ID13
Feature activation+4.16e-03

Pos logit contributions
omo+0.015
omm+0.012
 size+0.012
ums+0.012
 Sorry+0.012
Neg logit contributions
 rubble-0.011
 stret-0.010
sm-0.010
leaf-0.010
ts-0.010
Token
Token ID198
Feature activation-3.83e-03

Pos logit contributions
omo+0.006
 size+0.006
 head+0.005
omm+0.005
ums+0.005
Neg logit contributions
 rubble-0.008
leaf-0.007
 stret-0.007
sm-0.007
Hop-0.007
Token
Token ID198
Feature activation-5.11e-03

Pos logit contributions
 rubble+0.008
leaf+0.008
 stret+0.008
lves+0.007
 unnecessary+0.007
Neg logit contributions
omo-0.005
 Mom-0.004
 size-0.004
ums-0.003
omm-0.003
Token"
Token ID1
Feature activation-2.13e-02

Pos logit contributions
 rubble+0.012
leaf+0.012
 stret+0.011
 unnecessary+0.011
lves+0.011
Neg logit contributions
omo-0.005
 Mom-0.004
 Sorry-0.004
 size-0.004
 head-0.004
TokenSo
Token ID2396
Feature activation-2.88e-03

Pos logit contributions
omo+0.000
 size+0.000
 head+0.000
omm+0.000
ums+0.000
Neg logit contributions
 rubble-0.000
leaf-0.000
 stret-0.000
sm-0.000
Hop-0.000
Token,
Token ID11
Feature activation+1.19e-02

Pos logit contributions
 everywhere+0.003
 shopping+0.003
free+0.003
 rubble+0.003
 aback+0.003
Neg logit contributions
omo-0.006
 size-0.005
 goalie-0.004
omm-0.004
arter-0.004
Token she
Token ID673
Feature activation+1.52e-03

Pos logit contributions
omo+0.023
 size+0.020
omm+0.019
ums+0.018
 head+0.018
Neg logit contributions
 rubble-0.016
 stret-0.015
leaf-0.015
sm-0.014
 everywhere-0.014
Token went
Token ID1816
Feature activation-2.81e-02

Pos logit contributions
omo+0.003
ums+0.003
omm+0.002
 size+0.002
mm+0.002
Neg logit contributions
leaf-0.002
 rubble-0.002
 stret-0.002
Hop-0.002
 nearby-0.002
Token to
Token ID284
Feature activation+6.69e-03

Pos logit contributions
 rubble+0.054
lves+0.048
leaf+0.048
ts+0.047
 stret+0.046
Neg logit contributions
omo-0.037
 size-0.035
 head-0.035
 Mom-0.033
 only-0.031
Token the
Token ID262
Feature activation-7.53e-03

Pos logit contributions
omo+0.010
 size+0.010
 head+0.009
 Mom+0.009
 Sorry+0.008
Neg logit contributions
 rubble-0.012
leaf-0.011
 stret-0.011
lves-0.010
Hop-0.010
Token art
Token ID1242
Feature activation+8.87e-03

Pos logit contributions
 rubble+0.012
leaf+0.012
 stret+0.011
lves+0.010
sm+0.010
Neg logit contributions
omo-0.013
 size-0.011
omm-0.010
 head-0.010
ums-0.010
Token supply
Token ID5127
Feature activation+1.44e-02

Pos logit contributions
omo+0.015
 size+0.013
 head+0.012
omm+0.012
ums+0.012
Neg logit contributions
 rubble-0.015
leaf-0.014
 stret-0.014
lves-0.013
sm-0.013
Token store
Token ID3650
Feature activation-9.38e-03

Pos logit contributions
omo+0.023
 size+0.021
 head+0.019
ums+0.019
omm+0.018
Neg logit contributions
 rubble-0.025
leaf-0.024
 stret-0.023
lves-0.021
Hop-0.021
Token and
Token ID290
Feature activation-1.29e-02

Pos logit contributions
 rubble+0.019
leaf+0.018
 stret+0.018
 unnecessary+0.017
Hop+0.017
Neg logit contributions
omo-0.012
 size-0.011
 head-0.010
 Mom-0.009
ums-0.009
Token picked
Token ID6497
Feature activation-2.13e-02

Pos logit contributions
 rubble+0.022
leaf+0.021
 stret+0.020
sm+0.019
lves+0.019
Neg logit contributions
omo-0.021
 size-0.019
omm-0.017
 head-0.017
ums-0.017
Token new
Token ID649
Feature activation-8.83e-03

Pos logit contributions
omo+0.036
 size+0.032
 Mom+0.031
ums+0.030
 head+0.030
Neg logit contributions
 rubble-0.030
leaf-0.028
 stret-0.027
lves-0.026
sm-0.025
Token friends
Token ID2460
Feature activation-2.57e-02

Pos logit contributions
 rubble+0.015
leaf+0.015
 stret+0.014
 outdoors+0.013
ts+0.013
Neg logit contributions
omo-0.014
 size-0.012
omm-0.012
 Sorry-0.011
ums-0.011
Token and
Token ID290
Feature activation-2.19e-02

Pos logit contributions
 rubble+0.051
leaf+0.048
 stret+0.047
lves+0.044
 unnecessary+0.043
Neg logit contributions
omo-0.035
 size-0.031
 head-0.029
ums-0.027
 Mom-0.026
Token was
Token ID373
Feature activation+9.84e-03

Pos logit contributions
 rubble+0.036
leaf+0.034
 stret+0.033
ts+0.031
Hop+0.030
Neg logit contributions
omo-0.036
omm-0.032
 size-0.032
ums-0.030
 Sorry-0.029
Token always
Token ID1464
Feature activation-7.50e-03

Pos logit contributions
omo+0.019
 size+0.017
omm+0.016
 Sorry+0.016
ums+0.016
Neg logit contributions
 rubble-0.013
leaf-0.013
 stret-0.012
sm-0.011
 outdoors-0.011
Token laughing
Token ID14376
Feature activation-1.43e-02

Pos logit contributions
 rubble+0.011
leaf+0.010
 stret+0.010
lves+0.009
sm+0.009
Neg logit contributions
omo-0.015
 size-0.013
 head-0.012
ums-0.012
omm-0.012
Token.
Token ID13
Feature activation-3.56e-03

Pos logit contributions
 rubble+0.024
leaf+0.023
 stret+0.022
sm+0.021
Hop+0.020
Neg logit contributions
omo-0.024
 size-0.021
omm-0.020
ums-0.019
 head-0.019
Token His
Token ID2399
Feature activation+4.30e-02

Pos logit contributions
 rubble+0.005
leaf+0.004
 stret+0.004
Hop+0.004
sm+0.004
Neg logit contributions
 size-0.006
omo-0.006
 head-0.006
ums-0.005
master-0.005
Token days
Token ID1528
Feature activation-3.16e-02

Pos logit contributions
omo+0.040
 size+0.032
omm+0.029
ums+0.029
 Sorry+0.028
Neg logit contributions
 rubble-0.099
leaf-0.096
 stret-0.095
 outdoors-0.088
 groceries-0.087
Token were
Token ID547
Feature activation+1.48e-02

Pos logit contributions
 rubble+0.056
leaf+0.054
 stret+0.052
sm+0.048
lves+0.048
Neg logit contributions
omo-0.050
 size-0.043
 head-0.039
omm-0.039
ums-0.039
Token filled
Token ID5901
Feature activation-1.15e-02

Pos logit contributions
omo+0.024
 Sorry+0.022
omm+0.022
ums+0.022
 size+0.021
Neg logit contributions
 rubble-0.023
leaf-0.021
 stret-0.021
 outdoors-0.020
sm-0.020
Token no
Token ID645
Feature activation+2.66e-02

Pos logit contributions
 rubble+0.007
leaf+0.007
 stret+0.007
sm+0.007
lves+0.006
Neg logit contributions
omo-0.004
 size-0.003
ums-0.003
omm-0.002
 head-0.002
Token!
Token ID0
Feature activation+3.87e-03

Pos logit contributions
omo+0.045
master+0.041
omm+0.040
ums+0.040
 head+0.039
Neg logit contributions
 rubble-0.040
 stret-0.036
leaf-0.035
sm-0.035
 risks-0.035
Token I
Token ID314
Feature activation-3.12e-02

Pos logit contributions
 size+0.006
omo+0.005
master+0.004
 head+0.004
ums+0.004
Neg logit contributions
 rubble-0.007
leaf-0.007
free-0.006
sm-0.006
Hop-0.006
Token'm
Token ID1101
Feature activation-4.43e-03

Pos logit contributions
 rubble+0.051
leaf+0.048
sm+0.046
 stret+0.045
lves+0.044
Neg logit contributions
omo-0.052
 size-0.046
omm-0.046
ums-0.044
mm-0.043
Token feeling
Token ID4203
Feature activation-2.87e-02

Pos logit contributions
 rubble+0.007
leaf+0.007
 stret+0.007
lves+0.006
sm+0.006
Neg logit contributions
omo-0.008
 size-0.007
ums-0.006
omm-0.006
 Sorry-0.006
Token so
Token ID523
Feature activation-1.78e-02

Pos logit contributions
 rubble+0.046
leaf+0.044
 stret+0.043
lves+0.039
Hop+0.039
Neg logit contributions
omo-0.049
 size-0.045
 head-0.041
omm-0.040
ums-0.040
Token dizz
Token ID29527
Feature activation+9.37e-03

Pos logit contributions
 rubble+0.027
leaf+0.026
 stret+0.024
sm+0.023
lves+0.022
Neg logit contributions
omo-0.033
 size-0.029
omm-0.027
ums-0.027
 head-0.027
Tokeny
Token ID88
Feature activation-1.70e-03

Pos logit contributions
omo+0.014
 size+0.012
omm+0.011
 Sorry+0.011
 head+0.011
Neg logit contributions
 rubble-0.017
leaf-0.016
 stret-0.015
sm-0.015
free-0.015
Token!"
Token ID2474
Feature activation+1.67e-03

Pos logit contributions
 rubble+0.003
leaf+0.003
 stret+0.003
sm+0.003
 outdoors+0.003
Neg logit contributions
omo-0.003
 size-0.002
omm-0.002
ums-0.002
 Sorry-0.002
Token Then
Token ID3244
Feature activation+3.24e-04

Pos logit contributions
omo+0.002
 size+0.002
 head+0.002
omm+0.002
ums+0.001
Neg logit contributions
 rubble-0.004
leaf-0.003
 stret-0.003
lves-0.003
ts-0.003
Token she
Token ID673
Feature activation-1.17e-02

Pos logit contributions
omo+0.001
 size+0.001
ums+0.000
omm+0.000
 loser+0.000
Neg logit contributions
 rubble-0.000
leaf-0.000
 outdoors-0.000
 stret-0.000
sm-0.000
Token."
Token ID526
Feature activation-8.86e-04

Pos logit contributions
 rubble+0.088
leaf+0.084
 stret+0.082
sm+0.076
Hop+0.075
Neg logit contributions
omo-0.078
 size-0.067
omm-0.063
ums-0.061
 head-0.060
Token Jane
Token ID12091
Feature activation-2.84e-02

Pos logit contributions
 rubble+0.001
leaf+0.001
 stret+0.001
sm+0.001
Hop+0.001
Neg logit contributions
omo-0.001
 size-0.001
 head-0.001
omm-0.001
ums-0.001
Token was
Token ID373
Feature activation+1.64e-03

Pos logit contributions
leaf+0.045
 rubble+0.045
 stret+0.044
Hop+0.040
sm+0.040
Neg logit contributions
omo-0.048
 size-0.042
ums-0.039
omm-0.039
mm-0.038
Token so
Token ID523
Feature activation-2.00e-02

Pos logit contributions
omo+0.003
 size+0.003
 head+0.003
omm+0.003
ums+0.003
Neg logit contributions
 rubble-0.002
leaf-0.002
 stret-0.002
sm-0.002
lves-0.002
Token happy
Token ID3772
Feature activation+6.03e-03

Pos logit contributions
 rubble+0.024
leaf+0.022
 stret+0.021
sm+0.019
lves+0.018
Neg logit contributions
omo-0.043
 size-0.040
omm-0.037
 head-0.036
ums-0.036
Token.
Token ID13
Feature activation-1.81e-02

Pos logit contributions
omo+0.009
 size+0.008
 head+0.007
ums+0.007
omm+0.007
Neg logit contributions
 rubble-0.011
leaf-0.010
 stret-0.010
lves-0.009
ts-0.009
Token She
Token ID1375
Feature activation-3.40e-03

Pos logit contributions
 rubble+0.030
leaf+0.029
 stret+0.028
 outdoors+0.026
Hop+0.026
Neg logit contributions
omo-0.028
 size-0.026
 head-0.023
ums-0.023
omm-0.022
Token couldn
Token ID3521
Feature activation-5.51e-03

Pos logit contributions
 rubble+0.005
leaf+0.005
 stret+0.005
Hop+0.004
sm+0.004
Neg logit contributions
omo-0.006
 size-0.005
ums-0.005
omm-0.005
mm-0.005
Token't
Token ID470
Feature activation+9.08e-04

Pos logit contributions
leaf+0.008
 rubble+0.008
 stret+0.008
ival+0.008
looking+0.007
Neg logit contributions
omo-0.010
 size-0.009
 head-0.008
 throat-0.008
ums-0.008
Token believe
Token ID1975
Feature activation-9.46e-03

Pos logit contributions
omo+0.001
 size+0.001
omm+0.001
ums+0.001
 head+0.001
Neg logit contributions
 rubble-0.002
leaf-0.002
 stret-0.002
sm-0.001
Hop-0.001
Token it
Token ID340
Feature activation-1.58e-02

Pos logit contributions
 rubble+0.015
leaf+0.015
 stret+0.014
lves+0.013
Hop+0.013
Neg logit contributions
omo-0.016
 size-0.015
 head-0.014
 Mom-0.014
omm-0.013
Token fun
Token ID1257
Feature activation-2.39e-02

Pos logit contributions
omo+0.006
 size+0.006
 Mom+0.005
 head+0.005
 Sorry+0.005
Neg logit contributions
 rubble-0.006
leaf-0.006
 stret-0.005
ts-0.005
lves-0.005
Token time
Token ID640
Feature activation-1.63e-02

Pos logit contributions
 rubble+0.048
leaf+0.046
 stret+0.044
lves+0.043
ts+0.042
Neg logit contributions
omo-0.034
 size-0.030
 head-0.025
 Mom-0.025
ums-0.024
Token drinking
Token ID7722
Feature activation+1.87e-02

Pos logit contributions
 rubble+0.029
leaf+0.028
 stret+0.028
lves+0.025
Hop+0.025
Neg logit contributions
omo-0.025
 size-0.023
ums-0.020
 head-0.020
 Mom-0.020
Token it
Token ID340
Feature activation-3.52e-02

Pos logit contributions
omo+0.033
 size+0.030
 head+0.027
omm+0.027
ums+0.027
Neg logit contributions
 rubble-0.029
leaf-0.028
 stret-0.027
lves-0.025
sm-0.024
Token.
Token ID13
Feature activation+6.90e-04

Pos logit contributions
 rubble+0.055
leaf+0.050
 stret+0.049
sm+0.047
ts+0.046
Neg logit contributions
omo-0.064
 size-0.054
omm-0.053
ums-0.052
 Sorry-0.050
Token<|endoftext|>
Token ID50256
Feature activation-7.41e-02

Pos logit contributions
omo+0.001
 size+0.001
 head+0.001
omm+0.001
ums+0.001
Neg logit contributions
 rubble-0.001
leaf-0.001
 stret-0.001
sm-0.001
lves-0.001
TokenOnce
Token ID7454
Feature activation+7.03e-03

Pos logit contributions
 rubble+0.149
leaf+0.142
 stret+0.139
lves+0.130
 unnecessary+0.128
Neg logit contributions
omo-0.102
 size-0.084
 Mom-0.081
ums-0.076
 head-0.076
Token upon
Token ID2402
Feature activation+4.09e-03

Pos logit contributions
omo+0.014
 size+0.012
 Mom+0.012
 head+0.012
ums+0.012
Neg logit contributions
 rubble-0.010
leaf-0.010
 stret-0.009
ts-0.008
lves-0.008
Token a
Token ID257
Feature activation+1.91e-02

Pos logit contributions
omo+0.005
 Mom+0.005
 only+0.005
 size+0.005
 be+0.004
Neg logit contributions
 rubble-0.008
leaf-0.008
 stret-0.008
lves-0.008
 unnecessary-0.007
Token time
Token ID640
Feature activation-1.13e-02

Pos logit contributions
omo+0.030
 size+0.025
 head+0.023
omm+0.022
ums+0.022
Neg logit contributions
 rubble-0.036
leaf-0.034
 stret-0.032
lves-0.030
ts-0.030
Token,
Token ID11
Feature activation+9.44e-03

Pos logit contributions
 rubble+0.024
leaf+0.023
 stret+0.022
sm+0.021
lves+0.021
Neg logit contributions
omo-0.014
 size-0.012
 head-0.011
omm-0.010
ums-0.010
Token
Token ID198
Feature activation-1.51e-02

Pos logit contributions
 rubble+0.001
leaf+0.001
 stret+0.001
lves+0.001
ts+0.001
Neg logit contributions
omo-0.001
 size-0.000
omm-0.000
ums-0.000
 head-0.000
Token
Token ID198
Feature activation-1.75e-02

Pos logit contributions
 rubble+0.031
leaf+0.029
 stret+0.029
lves+0.028
 unnecessary+0.027
Neg logit contributions
omo-0.020
 Mom-0.016
ums-0.015
 size-0.015
omm-0.015
TokenThe
Token ID464
Feature activation+6.77e-03

Pos logit contributions
 rubble+0.029
leaf+0.027
 stret+0.026
lves+0.024
ts+0.024
Neg logit contributions
omo-0.031
 size-0.026
omm-0.024
ums-0.024
 head-0.024
Token end
Token ID886
Feature activation-7.79e-03

Pos logit contributions
omo+0.009
 size+0.008
 head+0.007
 Mom+0.007
 mom+0.006
Neg logit contributions
 rubble-0.014
leaf-0.014
 stret-0.014
lves-0.013
ts-0.013
Token!
Token ID0
Feature activation-7.51e-03

Pos logit contributions
 rubble+0.014
leaf+0.014
 stret+0.013
sm+0.012
Hop+0.012
Neg logit contributions
omo-0.012
 size-0.010
omm-0.010
ums-0.010
 head-0.010
Token<|endoftext|>
Token ID50256
Feature activation-8.69e-02

Pos logit contributions
 rubble+0.010
leaf+0.010
 stret+0.009
sm+0.009
Hop+0.009
Neg logit contributions
omo-0.014
 size-0.014
 head-0.012
omm-0.012
ums-0.012
TokenOne
Token ID3198
Feature activation+2.00e-02

Pos logit contributions
 rubble+0.147
leaf+0.136
 stret+0.135
lves+0.124
ts+0.122
Neg logit contributions
omo-0.149
 size-0.128
 Mom-0.122
ums-0.119
omm-0.119
Token day
Token ID1110
Feature activation-2.76e-02

Pos logit contributions
omo+0.033
 size+0.030
omm+0.028
ums+0.028
 head+0.027
Neg logit contributions
 rubble-0.033
leaf-0.032
 stret-0.030
sm-0.029
 outdoors-0.028
Token,
Token ID11
Feature activation+4.74e-03

Pos logit contributions
 rubble+0.057
leaf+0.055
 stret+0.054
sm+0.051
Hop+0.050
Neg logit contributions
omo-0.035
 size-0.029
 head-0.026
omm-0.026
ums-0.026
Token Larry
Token ID13633
Feature activation-2.81e-03

Pos logit contributions
omo+0.007
 Mom+0.006
 size+0.006
 only+0.006
 Sorry+0.006
Neg logit contributions
 rubble-0.009
leaf-0.009
 stret-0.009
aid-0.008
lves-0.008
Token went
Token ID1816
Feature activation-4.32e-02

Pos logit contributions
 rubble+0.005
leaf+0.005
 stret+0.005
sm+0.004
Hop+0.004
Neg logit contributions
omo-0.004
 size-0.004
omm-0.004
ums-0.003
 Sorry-0.003
Token even
Token ID772
Feature activation-7.32e-03

Pos logit contributions
 rubble+0.035
leaf+0.033
 stret+0.032
sm+0.030
Hop+0.029
Neg logit contributions
omo-0.030
 size-0.025
omm-0.024
ums-0.023
 head-0.023
Token when
Token ID618
Feature activation+2.00e-02

Pos logit contributions
 rubble+0.012
leaf+0.011
 stret+0.011
sm+0.010
lves+0.010
Neg logit contributions
omo-0.013
 size-0.011
omm-0.010
ums-0.010
 head-0.010
Token having
Token ID1719
Feature activation-1.13e-02

Pos logit contributions
omo+0.039
 size+0.033
 head+0.033
omm+0.032
ums+0.032
Neg logit contributions
 rubble-0.028
leaf-0.026
 stret-0.024
sm-0.023
 outdoors-0.023
Token fun
Token ID1257
Feature activation-7.52e-03

Pos logit contributions
 rubble+0.017
leaf+0.016
 stret+0.016
lves+0.015
sm+0.014
Neg logit contributions
omo-0.020
 size-0.019
 Mom-0.018
 head-0.017
 only-0.017
Token.
Token ID13
Feature activation+5.83e-04

Pos logit contributions
 rubble+0.014
leaf+0.014
 stret+0.013
lves+0.012
sm+0.012
Neg logit contributions
omo-0.011
 size-0.010
 head-0.009
ums-0.009
omm-0.009
Token<|endoftext|>
Token ID50256
Feature activation-8.55e-02

Pos logit contributions
omo+0.001
 size+0.001
 head+0.001
omm+0.001
ums+0.001
Neg logit contributions
 rubble-0.001
leaf-0.001
 stret-0.001
lves-0.001
sm-0.001
TokenOnce
Token ID7454
Feature activation+4.06e-03

Pos logit contributions
 rubble+0.152
leaf+0.141
 stret+0.141
lves+0.129
 unnecessary+0.127
Neg logit contributions
omo-0.140
 size-0.119
 Mom-0.117
 Me-0.111
omm-0.110
Token upon
Token ID2402
Feature activation-2.99e-03

Pos logit contributions
omo+0.008
 size+0.007
omm+0.007
 head+0.007
ums+0.007
Neg logit contributions
 rubble-0.005
leaf-0.005
 stret-0.005
lves-0.004
Hop-0.004
Token a
Token ID257
Feature activation+1.13e-02

Pos logit contributions
 rubble+0.006
lves+0.006
 stret+0.006
leaf+0.006
 unnecessary+0.005
Neg logit contributions
omo-0.003
 Mom-0.003
 only-0.003
 size-0.003
 no-0.003
Token time
Token ID640
Feature activation-1.29e-02

Pos logit contributions
omo+0.018
 size+0.015
 head+0.014
omm+0.014
ums+0.013
Neg logit contributions
 rubble-0.021
leaf-0.020
 stret-0.019
lves-0.018
ts-0.018
Token,
Token ID11
Feature activation+8.88e-03

Pos logit contributions
 rubble+0.028
leaf+0.027
 stret+0.026
sm+0.024
lves+0.024
Neg logit contributions
omo-0.016
 size-0.013
 head-0.011
omm-0.011
ums-0.011
Token would
Token ID561
Feature activation+1.52e-02

Pos logit contributions
 rubble+0.024
leaf+0.023
 stret+0.023
lves+0.020
sm+0.020
Neg logit contributions
omo-0.016
 size-0.014
 head-0.012
omm-0.012
 Mom-0.012
Token never
Token ID1239
Feature activation+3.68e-04

Pos logit contributions
omo+0.021
 size+0.017
omm+0.017
ums+0.016
 Sorry+0.015
Neg logit contributions
 rubble-0.030
leaf-0.028
 stret-0.028
Hop-0.026
ts-0.026
Token fight
Token ID1907
Feature activation+3.12e-02

Pos logit contributions
omo+0.001
omm+0.000
 size+0.000
ums+0.000
ama+0.000
Neg logit contributions
 rubble-0.001
leaf-0.001
 stret-0.001
Hop-0.001
sm-0.001
Token again
Token ID757
Feature activation+2.61e-02

Pos logit contributions
omo+0.063
omm+0.053
ums+0.052
 size+0.051
ama+0.051
Neg logit contributions
 rubble-0.042
leaf-0.037
 everywhere-0.037
 stret-0.037
ts-0.036
Token.
Token ID13
Feature activation+2.13e-02

Pos logit contributions
omo+0.054
omm+0.050
ums+0.046
 Sorry+0.045
 size+0.044
Neg logit contributions
 rubble-0.032
sm-0.030
 nearby-0.029
 stret-0.029
ts-0.029
Token<|endoftext|>
Token ID50256
Feature activation-7.96e-02

Pos logit contributions
omo+0.035
 size+0.031
 head+0.029
ums+0.028
omm+0.028
Neg logit contributions
 rubble-0.035
leaf-0.033
 stret-0.031
sm-0.031
lves-0.030
TokenL
Token ID43
Feature activation-2.18e-02

Pos logit contributions
 rubble+0.123
leaf+0.114
 stret+0.112
lves+0.103
 unnecessary+0.098
Neg logit contributions
omo-0.147
 size-0.129
ums-0.121
 Mom-0.120
omm-0.120
Tokenily
Token ID813
Feature activation-1.33e-03

Pos logit contributions
 rubble+0.038
leaf+0.036
 stret+0.035
lves+0.032
Hop+0.032
Neg logit contributions
omo-0.036
 size-0.032
omm-0.029
ums-0.029
 head-0.029
Token and
Token ID290
Feature activation-1.20e-02

Pos logit contributions
 rubble+0.002
leaf+0.002
 stret+0.002
sm+0.002
Hop+0.002
Neg logit contributions
omo-0.002
ums-0.002
omm-0.002
 size-0.002
 Sorry-0.002
Token Sam
Token ID3409
Feature activation-1.03e-02

Pos logit contributions
 rubble+0.021
leaf+0.021
 stret+0.020
sm+0.019
Hop+0.018
Neg logit contributions
omo-0.019
 size-0.016
omm-0.015
ums-0.015
 head-0.014
Token are
Token ID389
Feature activation+7.81e-03

Pos logit contributions
 rubble+0.018
leaf+0.017
 stret+0.017
sm+0.016
lves+0.016
Neg logit contributions
omo-0.016
 size-0.014
omm-0.013
ums-0.013
 head-0.013
Token the
Token ID262
Feature activation-3.50e-02

Pos logit contributions
 rubble+0.002
leaf+0.002
 stret+0.002
lves+0.002
sm+0.002
Neg logit contributions
omo-0.002
 size-0.002
 head-0.002
omm-0.002
ums-0.002
Token wild
Token ID4295
Feature activation-2.55e-03

Pos logit contributions
leaf+0.059
 rubble+0.057
ts+0.053
lves+0.053
 stret+0.052
Neg logit contributions
omo-0.057
 size-0.052
 head-0.050
 Mom-0.047
omm-0.044
Token morning
Token ID3329
Feature activation-2.70e-02

Pos logit contributions
 rubble+0.004
leaf+0.004
 stret+0.004
lves+0.004
sm+0.004
Neg logit contributions
omo-0.004
 size-0.004
 head-0.004
omm-0.003
ums-0.003
Token again
Token ID757
Feature activation+2.69e-02

Pos logit contributions
 rubble+0.050
leaf+0.048
 stret+0.046
lves+0.043
sm+0.043
Neg logit contributions
omo-0.040
 size-0.035
 head-0.032
omm-0.031
ums-0.031
Token.
Token ID13
Feature activation+3.86e-03

Pos logit contributions
omo+0.048
omm+0.041
 size+0.039
ums+0.039
 Sorry+0.038
Neg logit contributions
 rubble-0.042
 stret-0.039
leaf-0.039
 outdoors-0.038
sm-0.037
Token<|endoftext|>
Token ID50256
Feature activation-7.27e-02

Pos logit contributions
omo+0.007
 size+0.006
 head+0.006
ums+0.006
omm+0.005
Neg logit contributions
 rubble-0.006
leaf-0.006
 stret-0.005
Hop-0.005
sm-0.005
TokenOne
Token ID3198
Feature activation+1.89e-02

Pos logit contributions
 rubble+0.138
leaf+0.130
 stret+0.128
lves+0.118
 unnecessary+0.116
Neg logit contributions
omo-0.110
 size-0.090
 Mom-0.090
omm-0.084
ums-0.084
Token sunny
Token ID27737
Feature activation-8.68e-04

Pos logit contributions
omo+0.029
 size+0.026
 head+0.023
omm+0.023
ums+0.023
Neg logit contributions
 rubble-0.034
leaf-0.033
 stret-0.032
sm-0.029
lves-0.029
Token day
Token ID1110
Feature activation-2.38e-02

Pos logit contributions
 rubble+0.001
leaf+0.001
 stret+0.001
sm+0.001
Hop+0.001
Neg logit contributions
omo-0.001
 size-0.001
omm-0.001
ums-0.001
 head-0.001
Token,
Token ID11
Feature activation+8.23e-03

Pos logit contributions
 rubble+0.051
leaf+0.049
 stret+0.048
sm+0.045
lves+0.044
Neg logit contributions
omo-0.029
 size-0.024
 head-0.021
omm-0.021
ums-0.021
Token Ben
Token ID3932
Feature activation+1.53e-02

Pos logit contributions
omo+0.012
 Mom+0.011
 only+0.010
 size+0.010
 Me+0.010
Neg logit contributions
 rubble-0.017
leaf-0.016
 stret-0.015
 unnecessary-0.015
 risks-0.014