TOP ACTIVATIONS
MAX = 0.120

went to knock on his neighbor 's door .
little Emma ran to her neighbor 's house . She was
They decided to ask their neighbor to help them repair the
So they knocked on their neighbor 's door . The neighbor
She decided to ask her neighbor , Mr . Johnson ,
mom went to visit their neighbor , Mrs . Lee .
decided to ask her wealthy neighbor for help . The neighbor
basket of cookies to their neighbor , Mrs . Lee .
to introduce themselves to their neighbor . John
He asked his neighbor for help . His neighbor
They decided to ask their neighbor who knew a lot about
were going to visit their neighbor who was sick . Tim
Jack went to ask his neighbour , Mrs Clar ise ,
, so he asked his neighbor , " Would you lend
, the mom asked her daughter , " Would you like
shop . She asked her son if he wanted it .
mom to take her to Uncle David 's house . Mia
shoes and went to his neighbor 's home . He knocked
, Joe went to his neighbour 's house and asked if
ople decided to ask their neighbour , Old Tom , for

MIN ACTIVATIONS
MIN = -0.098

ka dot dress . The dots on her dress looked like
each day . Very soon , Anna was whist
the log . Their little hearts were full of joy as
of her garden . The shapes in the clouds moved and
out . The view from the top was beautiful
fun ?" The shapes thought for a moment and
ummy smelling biscuits . The shapes were all pretty and covered
better . The view that he wanted to reach
the water . The funny shapes they made made him laugh
the forest . The two shapes were no longer weak and
. The two shapes became friends and they played
's courage . The two de ers became good friends after
from the bird above . Start led , Bobby screamed and
the trees . All these sounds brought him peace and happiness
end , the garden was even more beautiful than before !
makes a loud noise and flashes its lights . Tom is
, but it felt so com fy and warm . She
beautiful rainbow ! All the colours were so bright and beautiful
. Suddenly , the paper shapes the girl had folded came
horse was actually alive ! Start led , Lucy jumped back

INTERVAL 0.066 - 0.120
CONTAINS 0.707%

and cool . Tom wanted to play with it , but
there was a useful dog named Spot . Spot liked to
was very hungry and wanted to eat an egg . But
Can you teach me how to fly ?" The squash did
uddle . " I want to play !", said Mary .

INTERVAL 0.011 - 0.066
CONTAINS 30.989%

Mom my . I didn 't mean to break the v
was Fred and he liked to design things with his friends
't find the perfect one . One day ,
they walked away , Lily 's dad said , " Did
was not sure if they should buy the shelf , so

INTERVAL -0.044 - 0.011
CONTAINS 66.232%

mom said yes . The little girl was so
the truck driver . He thanks them back . He gives
. " Look , Sara , my block fits
Ben felt bad . He realized he had done a
top . It was called the attic . The attic was

INTERVAL -0.098 - -0.044
CONTAINS 2.072%

If you open it , something good will happen ."
but then she saw something very unusual . There was a
. Even with a few sprink les inside ."
He pushed himself up and wob bled on his feet for
zoom ed around his town feeling proud . <|endoftext|> Once there
Token went
Token ID1816
Feature activation+2.66e-02

Pos logit contributions
lessly+0.019
 much+0.015
ww+0.014
 precious+0.013
 nice+0.013
Neg logit contributions
keeping-0.020
birds-0.017
girls-0.016
ASH-0.016
rones-0.016
Token to
Token ID284
Feature activation+6.23e-02

Pos logit contributions
keeping+0.057
rones+0.047
birds+0.047
group+0.046
growth+0.046
Neg logit contributions
lessly-0.048
 much-0.039
 nice-0.036
 precious-0.036
 shiny-0.035
Token knock
Token ID10643
Feature activation+2.64e-02

Pos logit contributions
keeping+0.130
birds+0.113
rones+0.113
girls+0.110
group+0.107
Neg logit contributions
lessly-0.104
 much-0.101
 nice-0.097
 shiny-0.093
 proud-0.092
Token on
Token ID319
Feature activation+3.31e-02

Pos logit contributions
keeping+0.061
ASH+0.050
birds+0.050
rones+0.050
group+0.049
Neg logit contributions
lessly-0.041
 much-0.032
ww-0.031
 precious-0.031
 valuable-0.030
Token his
Token ID465
Feature activation+4.17e-02

Pos logit contributions
keeping+0.076
rones+0.065
birds+0.063
group+0.063
girls+0.063
Neg logit contributions
lessly-0.055
 much-0.047
 nice-0.044
 shiny-0.043
 good-0.042
Token neighbor
Token ID4780
Feature activation+1.20e-01

Pos logit contributions
keeping+0.079
rones+0.063
birds+0.062
growth+0.061
ASH+0.061
Neg logit contributions
lessly-0.087
 much-0.071
 nice-0.066
 precious-0.066
ww-0.066
Token's
Token ID338
Feature activation+4.99e-02

Pos logit contributions
keeping+0.305
rones+0.274
growth+0.270
IT+0.269
group+0.265
Neg logit contributions
 nice-0.111
 very-0.109
lessly-0.104
 much-0.102
 shiny-0.100
Token door
Token ID3420
Feature activation+1.56e-02

Pos logit contributions
keeping+0.108
rones+0.096
birds+0.094
girls+0.094
group+0.093
Neg logit contributions
lessly-0.078
 nice-0.067
 much-0.066
 shiny-0.064
 precious-0.062
Token.
Token ID13
Feature activation+2.29e-02

Pos logit contributions
keeping+0.034
rones+0.029
birds+0.029
growth+0.028
group+0.028
Neg logit contributions
lessly-0.027
 much-0.022
 nice-0.020
 precious-0.020
 shiny-0.019
Token
Token ID198
Feature activation-2.58e-02

Pos logit contributions
keeping+0.050
rones+0.042
growth+0.041
birds+0.041
ASH+0.040
Neg logit contributions
lessly-0.041
 much-0.036
 proud-0.033
 nice-0.032
 shiny-0.029
Token
Token ID198
Feature activation-1.62e-02

Pos logit contributions
lessly+0.049
 much+0.043
 nice+0.039
 proud+0.038
 shiny+0.036
Neg logit contributions
keeping-0.052
ASH-0.043
birds-0.043
rones-0.042
IT-0.042
Token little
Token ID1310
Feature activation+1.86e-03

Pos logit contributions
lessly+0.005
 much+0.004
 nice+0.004
 precious+0.004
 shiny+0.004
Neg logit contributions
keeping-0.006
rones-0.005
birds-0.005
group-0.005
growth-0.005
Token Emma
Token ID18966
Feature activation+1.56e-02

Pos logit contributions
keeping+0.003
rones+0.002
group+0.002
growth+0.002
birds+0.002
Neg logit contributions
lessly-0.004
 much-0.004
 nice-0.004
ww-0.004
 precious-0.004
Token ran
Token ID4966
Feature activation+1.95e-02

Pos logit contributions
keeping+0.038
rones+0.032
birds+0.032
group+0.032
girls+0.032
Neg logit contributions
lessly-0.022
 much-0.018
 nice-0.016
 precious-0.016
 shiny-0.016
Token to
Token ID284
Feature activation+5.71e-02

Pos logit contributions
keeping+0.033
group+0.025
birds+0.025
rones+0.025
date+0.024
Neg logit contributions
lessly-0.043
 precious-0.036
ww-0.035
 much-0.034
arest-0.034
Token her
Token ID607
Feature activation+4.68e-02

Pos logit contributions
keeping+0.161
girls+0.137
rones+0.134
ASH+0.133
birds+0.131
Neg logit contributions
 nice-0.065
 much-0.062
 shiny-0.058
 beautiful-0.055
 big-0.055
Token neighbor
Token ID4780
Feature activation+1.11e-01

Pos logit contributions
keeping+0.100
growth+0.079
birds+0.078
rones+0.078
ASH+0.076
Neg logit contributions
 nice-0.075
 much-0.072
lessly-0.072
 precious-0.072
 shiny-0.071
Token's
Token ID338
Feature activation+4.73e-02

Pos logit contributions
keeping+0.330
growth+0.282
IT+0.275
ASH+0.274
rones+0.267
Neg logit contributions
 much-0.087
 very-0.084
 nice-0.083
 for-0.077
 of-0.074
Token house
Token ID2156
Feature activation+2.38e-02

Pos logit contributions
keeping+0.097
girls+0.084
growth+0.081
birds+0.080
osc+0.079
Neg logit contributions
 nice-0.078
 much-0.074
 big-0.072
 very-0.071
 proud-0.070
Token.
Token ID13
Feature activation+1.56e-02

Pos logit contributions
keeping+0.058
rones+0.051
growth+0.050
girls+0.050
ASH+0.049
Neg logit contributions
lessly-0.034
 much-0.028
 nice-0.026
 proud-0.024
 shiny-0.024
Token She
Token ID1375
Feature activation-1.62e-02

Pos logit contributions
keeping+0.029
girls+0.024
birds+0.023
rones+0.022
ASH+0.022
Neg logit contributions
lessly-0.032
 much-0.030
 nice-0.029
 good-0.026
 shiny-0.026
Token was
Token ID373
Feature activation-1.99e-02

Pos logit contributions
lessly+0.024
 much+0.019
 nice+0.018
 precious+0.018
 shiny+0.017
Neg logit contributions
keeping-0.040
rones-0.034
birds-0.033
growth-0.033
group-0.033
Token They
Token ID1119
Feature activation-1.70e-02

Pos logit contributions
keeping+0.029
rones+0.023
growth+0.022
date+0.022
IT+0.022
Neg logit contributions
lessly-0.026
 much-0.022
 nice-0.020
 valuable-0.019
 shiny-0.019
Token decided
Token ID3066
Feature activation+1.30e-02

Pos logit contributions
lessly+0.031
 much+0.025
ww+0.023
 precious+0.023
 nice+0.023
Neg logit contributions
keeping-0.035
rones-0.029
birds-0.029
group-0.028
ASH-0.028
Token to
Token ID284
Feature activation+5.49e-02

Pos logit contributions
keeping+0.026
rones+0.022
group+0.022
birds+0.022
growth+0.021
Neg logit contributions
lessly-0.025
 much-0.021
 nice-0.020
 shiny-0.020
 precious-0.019
Token ask
Token ID1265
Feature activation+5.34e-02

Pos logit contributions
keeping+0.113
rones+0.101
birds+0.092
 Lee+0.090
ASH+0.089
Neg logit contributions
lessly-0.091
 nice-0.083
 much-0.081
 good-0.078
 shiny-0.077
Token their
Token ID511
Feature activation+3.85e-02

Pos logit contributions
keeping+0.138
rones+0.120
ASH+0.116
birds+0.115
growth+0.114
Neg logit contributions
lessly-0.076
 much-0.066
 nice-0.063
 good-0.061
 hard-0.057
Token neighbor
Token ID4780
Feature activation+1.10e-01

Pos logit contributions
keeping+0.086
rones+0.072
growth+0.071
birds+0.068
group+0.068
Neg logit contributions
lessly-0.064
 nice-0.059
 good-0.057
 precious-0.057
 much-0.057
Token to
Token ID284
Feature activation+7.54e-02

Pos logit contributions
keeping+0.354
growth+0.314
ASH+0.301
rones+0.300
IT+0.297
Neg logit contributions
lessly-0.079
 nice-0.077
 much-0.074
 good-0.069
 for-0.066
Token help
Token ID1037
Feature activation+5.80e-02

Pos logit contributions
keeping+0.139
rones+0.117
birds+0.112
ASH+0.109
growth+0.108
Neg logit contributions
lessly-0.151
 much-0.133
 nice-0.133
 shiny-0.126
 proud-0.125
Token them
Token ID606
Feature activation+5.97e-02

Pos logit contributions
keeping+0.128
rones+0.117
growth+0.113
ASH+0.110
seek+0.109
Neg logit contributions
lessly-0.086
 nice-0.085
 much-0.081
 good-0.076
 shiny-0.075
Token repair
Token ID9185
Feature activation+4.47e-02

Pos logit contributions
keeping+0.135
rones+0.117
growth+0.112
ASH+0.111
birds+0.110
Neg logit contributions
lessly-0.099
 much-0.084
 nice-0.080
 shiny-0.076
 proud-0.075
Token the
Token ID262
Feature activation+9.52e-03

Pos logit contributions
keeping+0.098
rones+0.086
birds+0.084
growth+0.083
group+0.082
Neg logit contributions
lessly-0.072
 much-0.063
 nice-0.060
 good-0.057
 shiny-0.057
TokenSo
Token ID2396
Feature activation-2.44e-02

Pos logit contributions
lessly+0.018
 much+0.014
 precious+0.013
 nice+0.013
 shiny+0.013
Neg logit contributions
keeping-0.023
rones-0.019
birds-0.019
group-0.019
ASH-0.018
Token they
Token ID484
Feature activation-1.06e-02

Pos logit contributions
lessly+0.042
 much+0.033
 precious+0.031
 nice+0.031
ww+0.030
Neg logit contributions
keeping-0.054
rones-0.045
birds-0.045
group-0.044
growth-0.044
Token knocked
Token ID13642
Feature activation+9.22e-03

Pos logit contributions
lessly+0.017
 much+0.013
 nice+0.013
 shiny+0.012
 proud+0.012
Neg logit contributions
keeping-0.025
rones-0.021
birds-0.021
ASH-0.021
growth-0.020
Token on
Token ID319
Feature activation+3.17e-02

Pos logit contributions
keeping+0.024
rones+0.021
growth+0.021
group+0.021
birds+0.021
Neg logit contributions
lessly-0.012
 much-0.009
 nice-0.008
 hard-0.008
 good-0.008
Token their
Token ID511
Feature activation+4.16e-02

Pos logit contributions
keeping+0.082
rones+0.071
girls+0.070
seek+0.070
date+0.067
Neg logit contributions
 much-0.038
lessly-0.037
 nice-0.036
 shiny-0.035
 good-0.034
Token neighbor
Token ID4780
Feature activation+1.10e-01

Pos logit contributions
keeping+0.092
rones+0.080
group+0.078
birds+0.077
growth+0.076
Neg logit contributions
lessly-0.060
 nice-0.058
 shiny-0.057
 precious-0.055
 much-0.053
Token's
Token ID338
Feature activation+4.94e-02

Pos logit contributions
keeping+0.299
rones+0.260
growth+0.256
seek+0.245
IT+0.244
Neg logit contributions
 very-0.101
 much-0.095
 nice-0.094
lessly-0.092
 for-0.088
Token door
Token ID3420
Feature activation+1.13e-02

Pos logit contributions
keeping+0.106
rones+0.093
girls+0.090
birds+0.090
growth+0.089
Neg logit contributions
lessly-0.072
 nice-0.069
 shiny-0.066
 much-0.065
 precious-0.063
Token.
Token ID13
Feature activation+1.26e-02

Pos logit contributions
keeping+0.025
rones+0.021
birds+0.021
growth+0.020
group+0.020
Neg logit contributions
lessly-0.019
 much-0.016
 nice-0.014
 precious-0.014
 shiny-0.014
Token The
Token ID383
Feature activation-5.31e-03

Pos logit contributions
keeping+0.023
rones+0.018
girls+0.017
ASH+0.016
seek+0.016
Neg logit contributions
lessly-0.028
 much-0.025
 proud-0.023
 nice-0.023
 heavy-0.022
Token neighbor
Token ID4780
Feature activation+3.41e-02

Pos logit contributions
lessly+0.011
 much+0.010
 nice+0.009
 precious+0.009
 shiny+0.009
Neg logit contributions
keeping-0.010
rones-0.008
growth-0.008
birds-0.008
group-0.007
Token She
Token ID1375
Feature activation-1.97e-02

Pos logit contributions
keeping+0.048
girls+0.040
growth+0.040
rones+0.038
birds+0.038
Neg logit contributions
lessly-0.047
 much-0.041
 nice-0.040
 hard-0.037
 good-0.037
Token decided
Token ID3066
Feature activation+1.38e-02

Pos logit contributions
lessly+0.040
 much+0.031
 precious+0.029
ww+0.029
 nice+0.028
Neg logit contributions
keeping-0.037
rones-0.031
birds-0.030
ASH-0.030
IT-0.029
Token to
Token ID284
Feature activation+5.92e-02

Pos logit contributions
keeping+0.027
rones+0.024
birds+0.023
growth+0.023
group+0.022
Neg logit contributions
lessly-0.027
 much-0.023
 nice-0.021
 precious-0.021
 proud-0.021
Token ask
Token ID1265
Feature activation+5.51e-02

Pos logit contributions
keeping+0.128
rones+0.113
birds+0.110
growth+0.104
ASH+0.104
Neg logit contributions
lessly-0.096
 much-0.083
 nice-0.082
 good-0.078
 shiny-0.075
Token her
Token ID607
Feature activation+4.62e-02

Pos logit contributions
keeping+0.121
rones+0.098
girls+0.097
birds+0.097
ASH+0.097
Neg logit contributions
lessly-0.101
 much-0.095
 nice-0.095
 good-0.090
 hard-0.087
Token neighbor
Token ID4780
Feature activation+1.08e-01

Pos logit contributions
keeping+0.109
rones+0.092
growth+0.091
ASH+0.090
birds+0.090
Neg logit contributions
lessly-0.074
 much-0.063
 nice-0.062
 precious-0.059
 good-0.059
Token,
Token ID11
Feature activation+1.88e-02

Pos logit contributions
keeping+0.356
growth+0.323
rones+0.299
IT+0.299
ASH+0.299
Neg logit contributions
 nice-0.076
 for-0.075
 much-0.067
 good-0.066
lessly-0.062
Token Mr
Token ID1770
Feature activation+4.71e-02

Pos logit contributions
keeping+0.040
rones+0.034
growth+0.033
ASH+0.032
birds+0.031
Neg logit contributions
lessly-0.033
 nice-0.031
 much-0.031
 proud-0.029
 good-0.029
Token.
Token ID13
Feature activation+3.11e-02

Pos logit contributions
keeping+0.119
growth+0.112
IT+0.110
rones+0.107
group+0.107
Neg logit contributions
lessly-0.055
 much-0.047
 nice-0.045
 good-0.042
 precious-0.041
Token Johnson
Token ID5030
Feature activation+4.33e-02

Pos logit contributions
keeping+0.044
birds+0.039
 occasions+0.038
growth+0.037
rones+0.035
Neg logit contributions
 nice-0.061
 good-0.060
 much-0.059
 fine-0.058
lessly-0.057
Token,
Token ID11
Feature activation-6.03e-03

Pos logit contributions
keeping+0.119
ASH+0.106
growth+0.106
rones+0.104
IT+0.103
Neg logit contributions
lessly-0.053
 much-0.045
 nice-0.042
 good-0.039
 proud-0.038
Token mom
Token ID1995
Feature activation+4.02e-02

Pos logit contributions
keeping+0.053
rones+0.044
birds+0.044
group+0.043
growth+0.043
Neg logit contributions
lessly-0.050
 much-0.039
ww-0.036
 precious-0.036
 nice-0.035
Token went
Token ID1816
Feature activation+4.09e-02

Pos logit contributions
keeping+0.092
rones+0.077
birds+0.077
growth+0.075
group+0.075
Neg logit contributions
lessly-0.065
 much-0.053
 nice-0.050
 precious-0.048
 shiny-0.047
Token to
Token ID284
Feature activation+6.73e-02

Pos logit contributions
keeping+0.083
rones+0.068
birds+0.067
growth+0.067
ASH+0.066
Neg logit contributions
lessly-0.078
 much-0.065
 nice-0.061
 precious-0.059
 shiny-0.059
Token visit
Token ID3187
Feature activation+6.26e-02

Pos logit contributions
keeping+0.180
birds+0.149
rones+0.148
ASH+0.147
girls+0.146
Neg logit contributions
 nice-0.079
 much-0.078
 shiny-0.074
 proud-0.071
 good-0.070
Token their
Token ID511
Feature activation+4.63e-02

Pos logit contributions
keeping+0.155
birds+0.129
rones+0.127
ASH+0.125
growth+0.125
Neg logit contributions
lessly-0.087
 much-0.083
 nice-0.081
 good-0.074
 shiny-0.072
Token neighbor
Token ID4780
Feature activation+1.08e-01

Pos logit contributions
keeping+0.120
growth+0.102
birds+0.100
rones+0.096
stick+0.095
Neg logit contributions
 nice-0.060
 precious-0.054
 shiny-0.054
 beautiful-0.052
 much-0.052
Token,
Token ID11
Feature activation+2.39e-02

Pos logit contributions
keeping+0.337
growth+0.312
IT+0.291
ASH+0.284
rones+0.276
Neg logit contributions
 for-0.068
 much-0.061
 nice-0.060
 very-0.059
 at-0.053
Token Mrs
Token ID9074
Feature activation+5.77e-02

Pos logit contributions
keeping+0.043
rones+0.035
birds+0.033
growth+0.033
ASH+0.033
Neg logit contributions
lessly-0.051
 much-0.045
 nice-0.043
 shiny-0.042
 precious-0.042
Token.
Token ID13
Feature activation+3.09e-02

Pos logit contributions
keeping+0.145
IT+0.136
ASH+0.134
date+0.133
growth+0.133
Neg logit contributions
lessly-0.067
 much-0.056
 nice-0.051
 good-0.047
 precious-0.045
Token Lee
Token ID5741
Feature activation+3.80e-02

Pos logit contributions
keeping+0.047
 occasions+0.039
birds+0.037
nob+0.036
stick+0.035
Neg logit contributions
 much-0.064
 good-0.062
 nice-0.062
 fine-0.059
 sweet-0.059
Token.
Token ID13
Feature activation+2.78e-02

Pos logit contributions
keeping+0.106
growth+0.093
ASH+0.092
IT+0.091
rones+0.090
Neg logit contributions
lessly-0.040
 much-0.038
 nice-0.034
 proud-0.031
 shiny-0.030
Token decided
Token ID3066
Feature activation+1.59e-02

Pos logit contributions
lessly+0.024
 much+0.019
 nice+0.017
 precious+0.017
 proud+0.017
Neg logit contributions
keeping-0.034
rones-0.029
birds-0.029
group-0.028
growth-0.028
Token to
Token ID284
Feature activation+5.43e-02

Pos logit contributions
keeping+0.033
rones+0.029
birds+0.028
growth+0.028
group+0.027
Neg logit contributions
lessly-0.030
 much-0.026
 nice-0.024
 precious-0.023
 shiny-0.023
Token ask
Token ID1265
Feature activation+5.60e-02

Pos logit contributions
keeping+0.115
rones+0.100
birds+0.097
group+0.094
ASH+0.092
Neg logit contributions
lessly-0.095
 much-0.081
 nice-0.078
 good-0.075
 shiny-0.074
Token her
Token ID607
Feature activation+4.72e-02

Pos logit contributions
keeping+0.131
rones+0.113
birds+0.111
growth+0.110
ASH+0.108
Neg logit contributions
lessly-0.094
 much-0.085
 nice-0.083
 good-0.079
 hard-0.076
Token wealthy
Token ID11574
Feature activation+4.09e-02

Pos logit contributions
keeping+0.102
rones+0.084
growth+0.083
ASH+0.082
birds+0.082
Neg logit contributions
lessly-0.087
 much-0.075
 nice-0.072
 precious-0.070
 good-0.069
Token neighbor
Token ID4780
Feature activation+1.07e-01

Pos logit contributions
keeping+0.094
growth+0.084
rones+0.084
ASH+0.082
IT+0.082
Neg logit contributions
lessly-0.060
 nice-0.054
 precious-0.052
 good-0.051
 much-0.051
Token for
Token ID329
Feature activation+4.73e-02

Pos logit contributions
keeping+0.354
growth+0.327
ASH+0.310
rones+0.310
IT+0.308
Neg logit contributions
 nice-0.061
 for-0.060
lessly-0.058
 much-0.057
 good-0.053
Token help
Token ID1037
Feature activation+5.12e-02

Pos logit contributions
keeping+0.090
birds+0.077
rones+0.077
group+0.073
ASH+0.073
Neg logit contributions
lessly-0.093
 much-0.085
 nice-0.082
 good-0.078
 shiny-0.078
Token.
Token ID13
Feature activation+2.75e-02

Pos logit contributions
rones+0.136
keeping+0.134
growth+0.132
ASH+0.129
birds+0.127
Neg logit contributions
lessly-0.052
 much-0.049
 nice-0.046
 very-0.043
 good-0.040
Token The
Token ID383
Feature activation-6.61e-03

Pos logit contributions
keeping+0.057
growth+0.049
birds+0.048
girls+0.048
rones+0.046
Neg logit contributions
lessly-0.048
 much-0.046
 proud-0.041
 nice-0.040
 tight-0.038
Token neighbor
Token ID4780
Feature activation+2.54e-02

Pos logit contributions
lessly+0.012
 much+0.011
 nice+0.011
 precious+0.010
 good+0.010
Neg logit contributions
keeping-0.014
rones-0.012
birds-0.011
growth-0.011
girls-0.011
Token basket
Token ID7988
Feature activation+1.72e-02

Pos logit contributions
keeping+0.024
birds+0.020
rones+0.019
group+0.019
growth+0.019
Neg logit contributions
lessly-0.029
 much-0.023
 precious-0.022
 proud-0.021
ww-0.021
Token of
Token ID286
Feature activation+1.48e-02

Pos logit contributions
keeping+0.042
rones+0.035
birds+0.035
group+0.035
growth+0.034
Neg logit contributions
lessly-0.026
 much-0.019
 precious-0.018
 nice-0.018
ww-0.018
Token cookies
Token ID14746
Feature activation+1.74e-02

Pos logit contributions
keeping+0.026
rones+0.021
birds+0.021
group+0.021
growth+0.020
Neg logit contributions
lessly-0.032
 much-0.025
ww-0.024
 precious-0.023
 proud-0.023
Token to
Token ID284
Feature activation+7.64e-02

Pos logit contributions
keeping+0.041
rones+0.035
birds+0.035
growth+0.035
ASH+0.034
Neg logit contributions
lessly-0.027
 much-0.021
 nice-0.020
 precious-0.019
 proud-0.019
Token their
Token ID511
Feature activation+4.34e-02

Pos logit contributions
keeping+0.146
rones+0.117
birds+0.116
ASH+0.114
growth+0.111
Neg logit contributions
lessly-0.153
 much-0.138
 nice-0.133
 proud-0.126
 shiny-0.126
Token neighbor
Token ID4780
Feature activation+1.07e-01

Pos logit contributions
keeping+0.087
growth+0.077
birds+0.073
rones+0.070
nob+0.069
Neg logit contributions
 nice-0.073
 good-0.068
 precious-0.067
 much-0.066
 very-0.066
Token,
Token ID11
Feature activation+2.78e-02

Pos logit contributions
keeping+0.298
growth+0.270
rones+0.251
IT+0.246
ASH+0.244
Neg logit contributions
 very-0.093
 much-0.088
 nice-0.087
 for-0.084
 really-0.071
Token Mrs
Token ID9074
Feature activation+5.45e-02

Pos logit contributions
keeping+0.058
rones+0.048
ASH+0.047
growth+0.046
birds+0.046
Neg logit contributions
lessly-0.050
 much-0.045
 nice-0.043
 precious-0.041
 shiny-0.040
Token.
Token ID13
Feature activation+2.61e-02

Pos logit contributions
keeping+0.144
growth+0.135
IT+0.134
ASH+0.133
 arrest+0.131
Neg logit contributions
lessly-0.056
 much-0.045
 nice-0.044
 good-0.040
 very-0.040
Token Lee
Token ID5741
Feature activation+3.94e-02

Pos logit contributions
keeping+0.046
 occasions+0.038
birds+0.037
growth+0.035
oos+0.035
Neg logit contributions
 much-0.049
 nice-0.047
lessly-0.046
 sweet-0.045
 good-0.045
Token.
Token ID13
Feature activation+2.45e-02

Pos logit contributions
keeping+0.100
growth+0.085
ASH+0.085
rones+0.084
IT+0.082
Neg logit contributions
lessly-0.058
 much-0.048
 nice-0.044
 proud-0.041
 good-0.041
Token to
Token ID284
Feature activation+4.18e-02

Pos logit contributions
keeping+0.011
rones+0.009
group+0.009
birds+0.009
 preschool+0.009
Neg logit contributions
lessly-0.007
ww-0.005
 much-0.005
 precious-0.005
 nice-0.005
Token introduce
Token ID10400
Feature activation+3.80e-02

Pos logit contributions
keeping+0.080
rones+0.064
group+0.064
birds+0.063
growth+0.062
Neg logit contributions
lessly-0.084
 much-0.068
 precious-0.065
ww-0.064
 nice-0.063
Token themselves
Token ID2405
Feature activation+3.01e-02

Pos logit contributions
keeping+0.077
rones+0.068
birds+0.066
growth+0.066
ASH+0.065
Neg logit contributions
lessly-0.069
 much-0.058
 nice-0.054
 good-0.051
 precious-0.051
Token to
Token ID284
Feature activation+6.07e-02

Pos logit contributions
keeping+0.068
rones+0.057
birds+0.056
growth+0.056
ASH+0.055
Neg logit contributions
lessly-0.050
 much-0.040
 nice-0.037
 precious-0.037
 shiny-0.036
Token their
Token ID511
Feature activation+3.23e-02

Pos logit contributions
keeping+0.134
rones+0.114
birds+0.112
ASH+0.111
growth+0.109
Neg logit contributions
lessly-0.103
 much-0.089
 nice-0.083
 good-0.079
 shiny-0.078
Token neighbor
Token ID4780
Feature activation+1.06e-01

Pos logit contributions
keeping+0.067
rones+0.057
growth+0.055
birds+0.055
ASH+0.054
Neg logit contributions
lessly-0.057
 much-0.050
 nice-0.050
 precious-0.049
 good-0.048
Token.
Token ID13
Feature activation+2.67e-02

Pos logit contributions
keeping+0.271
growth+0.242
rones+0.234
ASH+0.230
IT+0.228
Neg logit contributions
lessly-0.133
 much-0.117
 nice-0.111
 very-0.103
 good-0.102
Token
Token ID220
Feature activation-8.29e-03

Pos logit contributions
keeping+0.053
rones+0.043
girls+0.043
growth+0.042
birds+0.041
Neg logit contributions
lessly-0.051
 much-0.047
 nice-0.042
 proud-0.041
 good-0.039
Token
Token ID198
Feature activation-1.36e-02

Pos logit contributions
lessly+0.017
 much+0.014
 nice+0.013
 proud+0.012
 good+0.012
Neg logit contributions
keeping-0.016
growth-0.013
rones-0.013
ASH-0.013
birds-0.013
Token
Token ID198
Feature activation-5.04e-03

Pos logit contributions
lessly+0.028
 much+0.023
 nice+0.022
 proud+0.021
ww+0.021
Neg logit contributions
keeping-0.026
rones-0.021
birds-0.021
growth-0.020
ASH-0.020
TokenJohn
Token ID7554
Feature activation+8.05e-03

Pos logit contributions
lessly+0.010
 precious+0.008
 much+0.008
 shiny+0.007
ww+0.007
Neg logit contributions
keeping-0.010
group-0.008
rones-0.008
birds-0.008
date-0.008
Token
Token ID198
Feature activation-1.84e-02

Pos logit contributions
keeping+0.042
girls+0.034
birds+0.034
rones+0.033
stick+0.033
Neg logit contributions
lessly-0.035
 much-0.032
 nice-0.027
 proud-0.026
 hard-0.025
Token
Token ID198
Feature activation-9.65e-03

Pos logit contributions
lessly+0.037
 much+0.033
 nice+0.030
 proud+0.028
 good+0.028
Neg logit contributions
keeping-0.037
rones-0.029
growth-0.029
birds-0.028
girls-0.028
TokenHe
Token ID1544
Feature activation-1.30e-02

Pos logit contributions
lessly+0.017
 precious+0.013
ww+0.013
 shiny+0.013
 much+0.013
Neg logit contributions
keeping-0.019
group-0.017
rones-0.017
birds-0.017
date-0.016
Token asked
Token ID1965
Feature activation+5.68e-02

Pos logit contributions
lessly+0.021
 much+0.017
 nice+0.016
 precious+0.015
 shiny+0.015
Neg logit contributions
keeping-0.030
rones-0.025
birds-0.025
growth-0.024
group-0.024
Token his
Token ID465
Feature activation+4.45e-02

Pos logit contributions
keeping+0.138
rones+0.116
birds+0.115
girls+0.114
ASH+0.114
Neg logit contributions
lessly-0.089
 much-0.077
 nice-0.073
 good-0.068
 shiny-0.066
Token neighbor
Token ID4780
Feature activation+1.06e-01

Pos logit contributions
keeping+0.102
rones+0.084
IT+0.082
birds+0.081
growth+0.081
Neg logit contributions
lessly-0.071
 nice-0.064
 much-0.064
 precious-0.062
 shiny-0.062
Token for
Token ID329
Feature activation+4.68e-02

Pos logit contributions
keeping+0.310
growth+0.266
rones+0.261
ASH+0.259
IT+0.257
Neg logit contributions
lessly-0.111
 much-0.105
 nice-0.098
 very-0.089
 good-0.088
Token help
Token ID1037
Feature activation+5.57e-02

Pos logit contributions
keeping+0.105
rones+0.092
birds+0.090
IT+0.089
ASH+0.088
Neg logit contributions
lessly-0.074
 much-0.071
 nice-0.068
 shiny-0.063
 good-0.063
Token.
Token ID13
Feature activation+2.33e-02

Pos logit contributions
keeping+0.137
rones+0.132
growth+0.129
ASH+0.124
birds+0.124
Neg logit contributions
lessly-0.073
 much-0.061
 nice-0.058
 very-0.054
 hard-0.052
Token His
Token ID2399
Feature activation+3.17e-02

Pos logit contributions
keeping+0.052
rones+0.044
girls+0.043
growth+0.041
birds+0.039
Neg logit contributions
lessly-0.042
 much-0.037
 nice-0.035
 proud-0.033
 very-0.032
Token neighbor
Token ID4780
Feature activation+4.57e-02

Pos logit contributions
keeping+0.067
rones+0.056
growth+0.054
birds+0.052
ASH+0.052
Neg logit contributions
lessly-0.058
 nice-0.054
 much-0.052
 good-0.050
 proud-0.049
Token They
Token ID1119
Feature activation-2.10e-02

Pos logit contributions
keeping+0.062
growth+0.052
girls+0.050
rones+0.048
 arrest+0.046
Neg logit contributions
lessly-0.043
 much-0.043
 nice-0.041
 very-0.039
 sweet-0.038
Token decided
Token ID3066
Feature activation+1.42e-02

Pos logit contributions
lessly+0.036
 much+0.029
 nice+0.027
 precious+0.026
 proud+0.026
Neg logit contributions
keeping-0.047
rones-0.040
birds-0.039
group-0.038
growth-0.038
Token to
Token ID284
Feature activation+5.74e-02

Pos logit contributions
keeping+0.030
rones+0.027
birds+0.026
growth+0.026
group+0.025
Neg logit contributions
lessly-0.025
 much-0.022
 nice-0.021
 shiny-0.020
 proud-0.020
Token ask
Token ID1265
Feature activation+5.41e-02

Pos logit contributions
keeping+0.124
rones+0.109
birds+0.103
ASH+0.101
group+0.100
Neg logit contributions
lessly-0.094
 much-0.080
 nice-0.079
 good-0.074
 shiny-0.074
Token their
Token ID511
Feature activation+3.34e-02

Pos logit contributions
keeping+0.126
rones+0.111
birds+0.106
growth+0.105
ASH+0.104
Neg logit contributions
lessly-0.089
 much-0.081
 nice-0.079
 good-0.075
 hard-0.073
Token neighbor
Token ID4780
Feature activation+1.06e-01

Pos logit contributions
keeping+0.082
rones+0.071
growth+0.070
birds+0.067
group+0.066
Neg logit contributions
 nice-0.047
 good-0.045
 precious-0.045
lessly-0.045
 much-0.042
Token who
Token ID508
Feature activation+2.70e-02

Pos logit contributions
keeping+0.345
growth+0.310
rones+0.292
ASH+0.292
IT+0.288
Neg logit contributions
 nice-0.076
 much-0.072
 for-0.071
lessly-0.069
 good-0.065
Token knew
Token ID2993
Feature activation-3.69e-03

Pos logit contributions
keeping+0.070
rones+0.061
ASH+0.060
growth+0.058
group+0.057
Neg logit contributions
lessly-0.036
 much-0.031
 nice-0.028
 shiny-0.028
 precious-0.028
Token a
Token ID257
Feature activation+1.04e-02

Pos logit contributions
lessly+0.006
 much+0.004
ww+0.004
 nice+0.004
 precious+0.004
Neg logit contributions
keeping-0.009
birds-0.007
rones-0.007
group-0.007
growth-0.007
Token lot
Token ID1256
Feature activation-1.52e-02

Pos logit contributions
keeping+0.021
 exhibit+0.018
ASH+0.018
birds+0.017
IT+0.017
Neg logit contributions
lessly-0.019
ww-0.013
bling-0.012
 much-0.012
fy-0.012
Token about
Token ID546
Feature activation+3.87e-02

Pos logit contributions
lessly+0.019
 much+0.014
ww+0.013
 nice+0.012
 precious+0.012
Neg logit contributions
keeping-0.040
birds-0.035
rones-0.034
group-0.033
growth-0.033
Token were
Token ID547
Feature activation-1.86e-02

Pos logit contributions
keeping+0.018
birds+0.015
rones+0.014
group+0.014
girls+0.014
Neg logit contributions
lessly-0.015
ww-0.012
 much-0.012
 precious-0.012
 nice-0.011
Token going
Token ID1016
Feature activation+4.18e-02

Pos logit contributions
lessly+0.034
ww+0.026
 much+0.026
 precious+0.025
 valuable+0.024
Neg logit contributions
keeping-0.038
rones-0.032
birds-0.031
group-0.031
IT-0.030
Token to
Token ID284
Feature activation+6.51e-02

Pos logit contributions
keeping+0.084
rones+0.069
birds+0.068
growth+0.067
ASH+0.066
Neg logit contributions
lessly-0.080
 much-0.068
 nice-0.064
 shiny-0.062
 precious-0.062
Token visit
Token ID3187
Feature activation+4.89e-02

Pos logit contributions
keeping+0.147
rones+0.128
birds+0.126
ASH+0.120
group+0.119
Neg logit contributions
 much-0.100
 nice-0.096
 good-0.094
lessly-0.093
 shiny-0.087
Token their
Token ID511
Feature activation+2.42e-02

Pos logit contributions
keeping+0.114
birds+0.097
rones+0.097
growth+0.096
ASH+0.095
Neg logit contributions
 much-0.074
lessly-0.070
 nice-0.069
 good-0.064
 shiny-0.060
Token neighbor
Token ID4780
Feature activation+1.06e-01

Pos logit contributions
keeping+0.053
growth+0.044
rones+0.043
birds+0.042
ASH+0.041
Neg logit contributions
 nice-0.039
lessly-0.039
 much-0.037
 precious-0.037
 good-0.036
Token who
Token ID508
Feature activation+1.71e-02

Pos logit contributions
keeping+0.293
growth+0.265
IT+0.241
rones+0.239
ASH+0.238
Neg logit contributions
 much-0.108
 nice-0.104
 very-0.101
 for-0.099
 fine-0.092
Token was
Token ID373
Feature activation-1.58e-03

Pos logit contributions
keeping+0.043
rones+0.037
ASH+0.037
growth+0.037
birds+0.036
Neg logit contributions
lessly-0.024
 much-0.020
 nice-0.018
 proud-0.017
 precious-0.017
Token sick
Token ID6639
Feature activation-5.35e-04

Pos logit contributions
lessly+0.002
 much+0.002
 nice+0.002
 precious+0.002
 proud+0.002
Neg logit contributions
keeping-0.004
rones-0.003
birds-0.003
growth-0.003
group-0.003
Token.
Token ID13
Feature activation+1.93e-02

Pos logit contributions
lessly+0.001
 much+0.001
 nice+0.001
 good+0.001
 precious+0.001
Neg logit contributions
keeping-0.001
rones-0.001
growth-0.001
IT-0.001
birds-0.001
Token Tim
Token ID5045
Feature activation+3.11e-02

Pos logit contributions
keeping+0.038
growth+0.032
rones+0.031
birds+0.030
girls+0.029
Neg logit contributions
lessly-0.038
 much-0.035
 nice-0.032
 proud-0.031
 shiny-0.030
TokenJack
Token ID14295
Feature activation-1.38e-04

Pos logit contributions
lessly+0.011
 much+0.008
 precious+0.008
 nice+0.008
ww+0.008
Neg logit contributions
keeping-0.014
rones-0.012
birds-0.011
group-0.011
growth-0.011
Token went
Token ID1816
Feature activation+1.62e-02

Pos logit contributions
lessly+0.000
 much+0.000
 nice+0.000
 precious+0.000
 shiny+0.000
Neg logit contributions
keeping-0.000
rones-0.000
birds-0.000
group-0.000
growth-0.000
Token to
Token ID284
Feature activation+5.61e-02

Pos logit contributions
keeping+0.031
rones+0.026
birds+0.025
growth+0.025
ASH+0.025
Neg logit contributions
lessly-0.032
 much-0.027
 nice-0.026
 precious-0.025
 shiny-0.025
Token ask
Token ID1265
Feature activation+5.48e-02

Pos logit contributions
keeping+0.141
birds+0.128
rones+0.123
girls+0.122
ASH+0.117
Neg logit contributions
 nice-0.066
lessly-0.066
 much-0.064
 shiny-0.061
 good-0.060
Token his
Token ID465
Feature activation+4.29e-02

Pos logit contributions
keeping+0.133
rones+0.113
birds+0.113
ASH+0.111
growth+0.109
Neg logit contributions
lessly-0.086
 much-0.071
 nice-0.068
 good-0.064
 shiny-0.062
Token neighbour
Token ID12250
Feature activation+1.05e-01

Pos logit contributions
keeping+0.101
rones+0.083
growth+0.082
ASH+0.080
birds+0.080
Neg logit contributions
lessly-0.068
 nice-0.061
 much-0.060
 precious-0.059
 good-0.058
Token,
Token ID11
Feature activation+1.62e-02

Pos logit contributions
keeping+0.319
growth+0.277
rones+0.270
ASH+0.268
IT+0.263
Neg logit contributions
lessly-0.093
 nice-0.089
 much-0.086
 for-0.082
 very-0.079
Token Mrs
Token ID9074
Feature activation+5.31e-02

Pos logit contributions
keeping+0.037
rones+0.032
growth+0.031
ASH+0.030
birds+0.030
Neg logit contributions
lessly-0.026
 much-0.023
 nice-0.023
 proud-0.022
 good-0.021
Token Clar
Token ID15420
Feature activation+1.41e-02

Pos logit contributions
keeping+0.130
ASH+0.122
IT+0.122
growth+0.119
date+0.119
Neg logit contributions
lessly-0.068
 much-0.055
 nice-0.052
 precious-0.049
 good-0.048
Tokenise
Token ID786
Feature activation+2.37e-02

Pos logit contributions
keeping+0.023
rones+0.018
growth+0.017
ASH+0.017
birds+0.017
Neg logit contributions
lessly-0.032
 much-0.027
 precious-0.027
 nice-0.026
 proud-0.026
Token,
Token ID11
Feature activation+6.08e-04

Pos logit contributions
keeping+0.057
rones+0.049
ASH+0.048
growth+0.047
IT+0.046
Neg logit contributions
lessly-0.037
 much-0.029
 nice-0.028
 shiny-0.026
 precious-0.026
Token,
Token ID11
Feature activation-1.38e-02

Pos logit contributions
lessly+0.038
ww+0.032
 precious+0.031
arest+0.030
 proud+0.029
Neg logit contributions
keeping-0.041
birds-0.034
rones-0.032
group-0.032
 Lee-0.032
Token so
Token ID523
Feature activation-3.60e-02

Pos logit contributions
lessly+0.021
 much+0.016
 precious+0.016
ww+0.015
 nice+0.015
Neg logit contributions
keeping-0.032
birds-0.027
rones-0.027
group-0.026
ASH-0.026
Token he
Token ID339
Feature activation-1.45e-02

Pos logit contributions
lessly+0.054
 precious+0.037
 much+0.037
ww+0.037
 nice+0.035
Neg logit contributions
keeping-0.086
birds-0.073
rones-0.073
growth-0.072
group-0.071
Token asked
Token ID1965
Feature activation+5.88e-02

Pos logit contributions
lessly+0.029
 much+0.023
arest+0.022
 precious+0.022
ww+0.021
Neg logit contributions
keeping-0.026
IT-0.021
girls-0.021
rones-0.021
ASH-0.021
Token his
Token ID465
Feature activation+4.16e-02

Pos logit contributions
keeping+0.154
rones+0.130
ASH+0.127
birds+0.127
IT+0.127
Neg logit contributions
lessly-0.083
 much-0.075
 nice-0.071
 good-0.068
 hard-0.065
Token neighbor
Token ID4780
Feature activation+1.05e-01

Pos logit contributions
keeping+0.098
rones+0.082
growth+0.080
group+0.080
birds+0.079
Neg logit contributions
lessly-0.066
 nice-0.056
 much-0.056
 precious-0.054
 good-0.054
Token,
Token ID11
Feature activation+8.80e-03

Pos logit contributions
keeping+0.331
growth+0.290
rones+0.278
IT+0.277
ASH+0.277
Neg logit contributions
lessly-0.085
 nice-0.083
 much-0.083
 for-0.077
 very-0.072
Token "
Token ID366
Feature activation+3.13e-02

Pos logit contributions
keeping+0.022
rones+0.019
growth+0.018
ASH+0.018
birds+0.018
Neg logit contributions
lessly-0.013
 much-0.011
 nice-0.011
 proud-0.010
 good-0.010
TokenWould
Token ID17353
Feature activation+2.85e-02

Pos logit contributions
keeping+0.061
rones+0.051
girls+0.048
birds+0.048
growth+0.047
Neg logit contributions
lessly-0.060
 nice-0.048
 much-0.048
 precious-0.046
 shiny-0.046
Token you
Token ID345
Feature activation+4.69e-02

Pos logit contributions
keeping+0.071
rones+0.061
ASH+0.058
birds+0.058
growth+0.058
Neg logit contributions
lessly-0.041
 much-0.036
 nice-0.033
ww-0.032
 proud-0.031
Token lend
Token ID22096
Feature activation+5.24e-02

Pos logit contributions
keeping+0.105
birds+0.091
rones+0.088
group+0.087
growth+0.087
Neg logit contributions
lessly-0.075
ww-0.052
 much-0.051
 precious-0.050
 proud-0.048
Token,
Token ID11
Feature activation-2.64e-02

Pos logit contributions
lessly+0.030
 much+0.023
 precious+0.023
 valuable+0.022
 nice+0.022
Neg logit contributions
keeping-0.029
birds-0.024
rones-0.024
group-0.024
 Lee-0.023
Token the
Token ID262
Feature activation-8.42e-03

Pos logit contributions
lessly+0.046
 much+0.040
 nice+0.038
 shiny+0.037
 good+0.037
Neg logit contributions
keeping-0.058
rones-0.048
birds-0.046
growth-0.046
ASH-0.046
Token mom
Token ID1995
Feature activation+1.80e-02

Pos logit contributions
lessly+0.017
 much+0.014
 precious+0.013
 nice+0.013
ww+0.013
Neg logit contributions
keeping-0.016
rones-0.013
birds-0.013
group-0.013
ASH-0.013
Token asked
Token ID1965
Feature activation+5.96e-02

Pos logit contributions
keeping+0.040
growth+0.033
rones+0.033
birds+0.032
group+0.032
Neg logit contributions
lessly-0.031
 much-0.025
 nice-0.024
 precious-0.023
 proud-0.023
Token her
Token ID607
Feature activation+5.26e-02

Pos logit contributions
keeping+0.151
growth+0.129
girls+0.126
rones+0.124
seek+0.122
Neg logit contributions
 nice-0.087
 much-0.086
 good-0.083
lessly-0.078
 sweet-0.075
Token daughter
Token ID4957
Feature activation+1.04e-01

Pos logit contributions
keeping+0.125
rones+0.106
ASH+0.104
growth+0.104
IT+0.102
Neg logit contributions
lessly-0.085
 much-0.074
 nice-0.071
 good-0.068
 precious-0.067
Token,
Token ID11
Feature activation+8.50e-03

Pos logit contributions
keeping+0.306
growth+0.264
ASH+0.261
rones+0.259
IT+0.256
Neg logit contributions
 much-0.122
 nice-0.112
lessly-0.106
 good-0.103
 very-0.099
Token "
Token ID366
Feature activation+2.98e-02

Pos logit contributions
keeping+0.022
rones+0.019
growth+0.019
ASH+0.018
birds+0.018
Neg logit contributions
lessly-0.012
 much-0.010
 nice-0.010
 proud-0.010
 good-0.009
TokenWould
Token ID17353
Feature activation+3.53e-02

Pos logit contributions
keeping+0.063
rones+0.052
girls+0.051
growth+0.051
anks+0.050
Neg logit contributions
lessly-0.050
 nice-0.042
 much-0.042
a-0.039
ww-0.039
Token you
Token ID345
Feature activation+5.30e-02

Pos logit contributions
keeping+0.101
rones+0.085
growth+0.083
ASH+0.083
stick+0.080
Neg logit contributions
 much-0.039
lessly-0.038
ww-0.037
 nice-0.035
 good-0.032
Token like
Token ID588
Feature activation+5.33e-02

Pos logit contributions
keeping+0.133
birds+0.117
rones+0.115
group+0.114
growth+0.112
Neg logit contributions
lessly-0.072
 much-0.047
 precious-0.045
ww-0.045
 shiny-0.043
Token shop
Token ID6128
Feature activation+3.00e-02

Pos logit contributions
keeping+0.039
rones+0.031
birds+0.030
girls+0.029
growth+0.029
Neg logit contributions
lessly-0.045
 much-0.043
 nice-0.043
 shiny-0.041
 good-0.040
Token.
Token ID13
Feature activation+6.71e-03

Pos logit contributions
keeping+0.063
rones+0.052
birds+0.051
group+0.050
growth+0.050
Neg logit contributions
lessly-0.055
 much-0.045
 nice-0.042
 precious-0.042
ww-0.040
Token She
Token ID1375
Feature activation-1.32e-02

Pos logit contributions
keeping+0.012
girls+0.010
rones+0.009
seek+0.009
birds+0.009
Neg logit contributions
lessly-0.014
 much-0.013
 nice-0.012
 shiny-0.012
 proud-0.012
Token asked
Token ID1965
Feature activation+5.57e-02

Pos logit contributions
lessly+0.030
arest+0.022
 much+0.022
 steep+0.022
ier+0.022
Neg logit contributions
keeping-0.021
 Her-0.019
rones-0.018
birds-0.018
bird-0.017
Token her
Token ID607
Feature activation+4.70e-02

Pos logit contributions
keeping+0.119
rones+0.094
girls+0.094
growth+0.093
birds+0.092
Neg logit contributions
lessly-0.105
 much-0.096
 nice-0.092
 good-0.088
 shiny-0.084
Token son
Token ID3367
Feature activation+1.03e-01

Pos logit contributions
keeping+0.122
rones+0.105
birds+0.103
growth+0.102
group+0.102
Neg logit contributions
lessly-0.064
 much-0.050
 nice-0.046
 precious-0.045
 shiny-0.044
Token if
Token ID611
Feature activation+1.61e-02

Pos logit contributions
keeping+0.314
growth+0.276
IT+0.268
ASH+0.266
rones+0.266
Neg logit contributions
 much-0.102
lessly-0.094
 nice-0.091
 good-0.082
 very-0.079
Token he
Token ID339
Feature activation+3.41e-02

Pos logit contributions
keeping+0.049
rones+0.044
growth+0.043
ASH+0.042
IT+0.042
Neg logit contributions
lessly-0.014
 much-0.011
 nice-0.010
 good-0.009
 shiny-0.009
Token wanted
Token ID2227
Feature activation+4.44e-02

Pos logit contributions
keeping+0.087
rones+0.075
birds+0.073
group+0.073
ASH+0.072
Neg logit contributions
lessly-0.047
 much-0.036
 nice-0.033
 precious-0.032
 shiny-0.031
Token it
Token ID340
Feature activation+3.20e-02

Pos logit contributions
keeping+0.109
rones+0.091
girls+0.089
group+0.089
IT+0.088
Neg logit contributions
 much-0.066
lessly-0.064
 nice-0.059
 good-0.057
 shiny-0.056
Token.
Token ID13
Feature activation+2.10e-02

Pos logit contributions
keeping+0.075
rones+0.063
growth+0.063
birds+0.062
group+0.061
Neg logit contributions
lessly-0.052
 much-0.044
 nice-0.041
 precious-0.039
 good-0.039
Token mom
Token ID1995
Feature activation+7.61e-02

Pos logit contributions
keeping+0.114
ASH+0.095
IT+0.094
growth+0.093
rones+0.092
Neg logit contributions
 nice-0.054
 good-0.052
 much-0.051
 shiny-0.051
 precious-0.050
Token to
Token ID284
Feature activation+7.17e-02

Pos logit contributions
keeping+0.181
growth+0.151
rones+0.146
ASH+0.145
IT+0.143
Neg logit contributions
lessly-0.124
 much-0.112
 nice-0.108
 good-0.099
 shiny-0.098
Token take
Token ID1011
Feature activation+3.49e-02

Pos logit contributions
keeping+0.159
rones+0.136
birds+0.129
ASH+0.128
growth+0.128
Neg logit contributions
lessly-0.118
 nice-0.108
 much-0.105
 shiny-0.098
 good-0.098
Token her
Token ID607
Feature activation+3.62e-02

Pos logit contributions
keeping+0.079
girls+0.071
birds+0.069
smith+0.067
rones+0.067
Neg logit contributions
 much-0.051
 good-0.051
 nice-0.050
 hard-0.045
 very-0.045
Token to
Token ID284
Feature activation+6.61e-02

Pos logit contributions
keeping+0.081
rones+0.071
ASH+0.067
birds+0.067
growth+0.067
Neg logit contributions
lessly-0.056
 much-0.052
 nice-0.051
 shiny-0.050
 precious-0.050
Token Uncle
Token ID23169
Feature activation+1.03e-01

Pos logit contributions
keeping+0.182
birds+0.161
girls+0.159
rones+0.154
ASH+0.152
Neg logit contributions
 nice-0.075
 much-0.073
 shiny-0.068
 good-0.068
 beautiful-0.063
Token David
Token ID3271
Feature activation+9.17e-02

Pos logit contributions
keeping+0.313
growth+0.288
rones+0.267
ASH+0.257
alk+0.256
Neg logit contributions
 for-0.081
 nice-0.080
 very-0.079
 sweet-0.069
 really-0.060
Token's
Token ID338
Feature activation+3.78e-02

Pos logit contributions
keeping+0.250
growth+0.217
ASH+0.213
rones+0.212
IT+0.205
Neg logit contributions
 much-0.097
lessly-0.090
 nice-0.090
 very-0.090
 for-0.088
Token house
Token ID2156
Feature activation+2.97e-02

Pos logit contributions
keeping+0.074
growth+0.064
birds+0.060
girls+0.058
rones+0.058
Neg logit contributions
 nice-0.071
 good-0.064
 very-0.064
 big-0.064
 shiny-0.063
Token.
Token ID13
Feature activation+2.71e-02

Pos logit contributions
keeping+0.074
growth+0.065
rones+0.065
birds+0.063
ASH+0.062
Neg logit contributions
lessly-0.043
 much-0.036
 nice-0.033
 proud-0.030
 hard-0.030
Token Mia
Token ID32189
Feature activation-1.74e-02

Pos logit contributions
keeping+0.057
birds+0.044
girls+0.044
growth+0.042
rones+0.042
Neg logit contributions
lessly-0.048
 much-0.044
 nice-0.041
 proud-0.041
 good-0.038
Token shoes
Token ID10012
Feature activation+5.06e-03

Pos logit contributions
keeping+0.081
rones+0.068
birds+0.067
growth+0.066
ASH+0.064
Neg logit contributions
lessly-0.048
 nice-0.043
 much-0.041
 shiny-0.041
 precious-0.040
Token and
Token ID290
Feature activation-1.53e-02

Pos logit contributions
keeping+0.012
rones+0.010
birds+0.010
group+0.009
ASH+0.009
Neg logit contributions
lessly-0.008
 much-0.006
 precious-0.006
ww-0.006
 nice-0.006
Token went
Token ID1816
Feature activation+2.11e-02

Pos logit contributions
lessly+0.025
 much+0.020
 precious+0.018
 nice+0.018
ww+0.018
Neg logit contributions
keeping-0.035
rones-0.029
birds-0.029
group-0.028
growth-0.028
Token to
Token ID284
Feature activation+5.74e-02

Pos logit contributions
keeping+0.045
rones+0.038
birds+0.037
growth+0.037
ASH+0.037
Neg logit contributions
lessly-0.038
 much-0.031
 nice-0.030
 shiny-0.028
 proud-0.028
Token his
Token ID465
Feature activation+3.53e-02

Pos logit contributions
keeping+0.143
birds+0.130
rones+0.129
ASH+0.125
girls+0.123
Neg logit contributions
 nice-0.068
 much-0.067
 shiny-0.063
 good-0.063
lessly-0.062
Token neighbor
Token ID4780
Feature activation+1.03e-01

Pos logit contributions
keeping+0.080
growth+0.066
birds+0.065
rones+0.064
stick+0.062
Neg logit contributions
 nice-0.056
 good-0.052
 precious-0.051
 shiny-0.051
 much-0.050
Token's
Token ID338
Feature activation+3.92e-02

Pos logit contributions
keeping+0.272
growth+0.256
IT+0.244
rones+0.240
ASH+0.240
Neg logit contributions
 very-0.092
 nice-0.087
 much-0.086
 for-0.082
lessly-0.077
Token home
Token ID1363
Feature activation+7.18e-03

Pos logit contributions
keeping+0.075
growth+0.069
rones+0.066
birds+0.064
icans+0.063
Neg logit contributions
 nice-0.068
 much-0.062
 good-0.061
 very-0.061
 big-0.060
Token.
Token ID13
Feature activation+1.21e-02

Pos logit contributions
keeping+0.016
rones+0.015
growth+0.014
birds+0.014
ASH+0.014
Neg logit contributions
lessly-0.012
 much-0.009
 nice-0.009
 proud-0.008
 shiny-0.008
Token He
Token ID679
Feature activation-2.16e-02

Pos logit contributions
keeping+0.021
birds+0.016
rones+0.016
girls+0.016
ASH+0.016
Neg logit contributions
lessly-0.027
 much-0.024
 nice-0.022
 proud-0.021
 precious-0.021
Token knocked
Token ID13642
Feature activation+3.86e-03

Pos logit contributions
lessly+0.038
 much+0.030
 nice+0.028
 precious+0.028
ww+0.028
Neg logit contributions
keeping-0.046
rones-0.039
birds-0.038
group-0.038
growth-0.037
Token,
Token ID11
Feature activation-2.46e-02

Pos logit contributions
lessly+0.052
 much+0.041
 precious+0.040
 nice+0.038
ww+0.038
Neg logit contributions
keeping-0.052
birds-0.043
rones-0.043
growth-0.042
ASH-0.042
Token Joe
Token ID5689
Feature activation-5.05e-04

Pos logit contributions
lessly+0.036
 much+0.026
 precious+0.025
 nice+0.024
ww+0.023
Neg logit contributions
keeping-0.060
birds-0.052
rones-0.052
growth-0.051
group-0.051
Token went
Token ID1816
Feature activation+1.69e-02

Pos logit contributions
lessly+0.001
 much+0.001
 nice+0.001
 precious+0.001
 shiny+0.001
Neg logit contributions
keeping-0.001
rones-0.001
birds-0.001
group-0.001
growth-0.001
Token to
Token ID284
Feature activation+5.44e-02

Pos logit contributions
keeping+0.030
rones+0.023
birds+0.023
group+0.023
growth+0.022
Neg logit contributions
lessly-0.036
 much-0.030
 precious-0.029
ww-0.029
 nice-0.028
Token his
Token ID465
Feature activation+3.23e-02

Pos logit contributions
keeping+0.138
birds+0.125
rones+0.121
girls+0.119
group+0.115
Neg logit contributions
 nice-0.064
 shiny-0.063
 much-0.061
lessly-0.061
 good-0.058
Token neighbour
Token ID12250
Feature activation+1.03e-01

Pos logit contributions
keeping+0.075
birds+0.062
rones+0.062
growth+0.061
group+0.060
Neg logit contributions
lessly-0.047
 nice-0.044
 much-0.044
 precious-0.043
 shiny-0.043
Token's
Token ID338
Feature activation+4.08e-02

Pos logit contributions
keeping+0.279
growth+0.246
rones+0.244
IT+0.242
ASH+0.240
Neg logit contributions
 very-0.088
 nice-0.084
 for-0.084
 much-0.083
lessly-0.070
Token house
Token ID2156
Feature activation+2.12e-02

Pos logit contributions
keeping+0.085
growth+0.073
girls+0.072
birds+0.071
rones+0.071
Neg logit contributions
 nice-0.068
 big-0.065
 shiny-0.063
 very-0.061
 heavy-0.061
Token and
Token ID290
Feature activation-1.19e-02

Pos logit contributions
keeping+0.049
rones+0.042
birds+0.042
growth+0.042
ASH+0.041
Neg logit contributions
lessly-0.034
 much-0.027
 nice-0.025
 proud-0.024
 shiny-0.024
Token asked
Token ID1965
Feature activation+6.68e-02

Pos logit contributions
lessly+0.019
 much+0.016
 nice+0.016
 proud+0.016
 shiny+0.015
Neg logit contributions
keeping-0.027
rones-0.023
birds-0.023
growth-0.022
ASH-0.022
Token if
Token ID611
Feature activation+2.47e-02

Pos logit contributions
keeping+0.187
growth+0.164
rones+0.163
ASH+0.163
IT+0.162
Neg logit contributions
lessly-0.081
 much-0.070
 nice-0.066
 good-0.063
 hard-0.059
Tokenople
Token ID643
Feature activation-1.91e-02

Pos logit contributions
lessly+0.048
 much+0.042
 nice+0.040
 precious+0.040
 proud+0.040
Neg logit contributions
keeping-0.020
rones-0.014
birds-0.013
growth-0.013
group-0.013
Token decided
Token ID3066
Feature activation+1.05e-02

Pos logit contributions
lessly+0.033
 much+0.027
 nice+0.025
 precious+0.025
ww+0.024
Neg logit contributions
keeping-0.042
rones-0.035
birds-0.034
group-0.034
growth-0.033
Token to
Token ID284
Feature activation+5.38e-02

Pos logit contributions
keeping+0.020
rones+0.016
birds+0.016
group+0.015
growth+0.015
Neg logit contributions
lessly-0.022
 much-0.018
 nice-0.017
 precious-0.017
 shiny-0.017
Token ask
Token ID1265
Feature activation+4.73e-02

Pos logit contributions
keeping+0.113
rones+0.107
birds+0.099
girls+0.096
ASH+0.096
Neg logit contributions
lessly-0.085
 nice-0.077
 much-0.076
 good-0.072
 shiny-0.071
Token their
Token ID511
Feature activation+3.31e-02

Pos logit contributions
keeping+0.123
rones+0.109
ASH+0.104
birds+0.103
growth+0.103
Neg logit contributions
lessly-0.068
 much-0.056
 nice-0.053
 good-0.050
 hard-0.048
Token neighbour
Token ID12250
Feature activation+1.02e-01

Pos logit contributions
keeping+0.071
rones+0.061
growth+0.059
ASH+0.058
IT+0.057
Neg logit contributions
lessly-0.056
 nice-0.051
 precious-0.050
 much-0.050
 good-0.049
Token,
Token ID11
Feature activation+6.13e-03

Pos logit contributions
keeping+0.293
growth+0.260
rones+0.257
ASH+0.252
IT+0.247
Neg logit contributions
lessly-0.108
 much-0.091
 nice-0.089
 good-0.079
 hard-0.079
Token Old
Token ID5706
Feature activation+4.79e-02

Pos logit contributions
keeping+0.014
rones+0.012
birds+0.011
growth+0.011
group+0.011
Neg logit contributions
lessly-0.010
 much-0.008
 nice-0.008
 precious-0.008
 proud-0.007
Token Tom
Token ID4186
Feature activation+4.42e-02

Pos logit contributions
keeping+0.086
growth+0.077
rones+0.074
date+0.069
ASH+0.067
Neg logit contributions
lessly-0.094
 nice-0.083
 sweet-0.081
 shiny-0.080
 precious-0.078
Token,
Token ID11
Feature activation-9.79e-03

Pos logit contributions
keeping+0.112
rones+0.100
growth+0.100
ASH+0.099
IT+0.097
Neg logit contributions
lessly-0.061
 much-0.050
 nice-0.046
 good-0.043
 hard-0.042
Token for
Token ID329
Feature activation+2.54e-02

Pos logit contributions
lessly+0.017
 much+0.013
 precious+0.012
ww+0.012
 nice+0.012
Neg logit contributions
keeping-0.021
birds-0.018
rones-0.018
group-0.017
growth-0.017
Tokenka
Token ID4914
Feature activation-3.54e-02

Pos logit contributions
lessly+0.053
 much+0.044
 steep+0.039
 valuable+0.039
ww+0.038
Neg logit contributions
keeping-0.057
rones-0.049
birds-0.048
ASH-0.047
group-0.046
Token dot
Token ID16605
Feature activation-2.84e-02

Pos logit contributions
lessly+0.070
 much+0.056
 steep+0.056
 valuable+0.055
 important+0.055
Neg logit contributions
keeping-0.064
birds-0.060
bird-0.057
rones-0.055
growth-0.055
Token dress
Token ID6576
Feature activation-1.52e-02

Pos logit contributions
lessly+0.043
 much+0.034
 precious+0.031
ww+0.031
 nice+0.031
Neg logit contributions
keeping-0.068
birds-0.058
rones-0.057
growth-0.055
group-0.055
Token.
Token ID13
Feature activation+1.23e-02

Pos logit contributions
lessly+0.027
 much+0.022
ww+0.020
 precious+0.020
 nice+0.020
Neg logit contributions
keeping-0.033
birds-0.027
rones-0.027
group-0.026
growth-0.026
Token The
Token ID383
Feature activation-1.65e-02

Pos logit contributions
keeping+0.026
girls+0.022
growth+0.021
rones+0.020
date+0.019
Neg logit contributions
lessly-0.023
 much-0.022
 nice-0.020
 de-0.019
 sweet-0.019
Token dots
Token ID22969
Feature activation-9.83e-02

Pos logit contributions
lessly+0.033
 much+0.027
 nice+0.025
 precious+0.025
ww+0.025
Neg logit contributions
keeping-0.032
rones-0.027
birds-0.026
group-0.026
growth-0.025
Token on
Token ID319
Feature activation+2.45e-04

Pos logit contributions
lessly+0.180
ww+0.174
 valuable+0.158
 precious+0.157
 much+0.152
Neg logit contributions
birds-0.148
keeping-0.144
bird-0.136
stick-0.130
group-0.127
Token her
Token ID607
Feature activation+3.52e-02

Pos logit contributions
keeping+0.001
rones+0.000
girls+0.000
birds+0.000
seek+0.000
Neg logit contributions
lessly-0.000
 much-0.000
 nice-0.000
 shiny-0.000
 good-0.000
Token dress
Token ID6576
Feature activation-1.28e-02

Pos logit contributions
keeping+0.069
rones+0.057
birds+0.056
group+0.055
growth+0.054
Neg logit contributions
lessly-0.069
 much-0.057
 nice-0.054
 precious-0.054
 shiny-0.053
Token looked
Token ID3114
Feature activation-4.38e-02

Pos logit contributions
lessly+0.021
 much+0.017
 precious+0.015
 nice+0.015
ww+0.015
Neg logit contributions
keeping-0.029
rones-0.025
birds-0.024
group-0.024
ASH-0.024
Token like
Token ID588
Feature activation-2.63e-02

Pos logit contributions
lessly+0.069
ww+0.062
arest+0.055
ang+0.054
a+0.047
Neg logit contributions
birds-0.072
 Her-0.071
group-0.071
keeping-0.069
stick-0.066
Token each
Token ID1123
Feature activation-6.41e-03

Pos logit contributions
lessly+0.034
 much+0.027
 nice+0.025
 precious+0.025
 shiny+0.024
Neg logit contributions
keeping-0.042
rones-0.035
birds-0.035
group-0.034
growth-0.034
Token day
Token ID1110
Feature activation+4.71e-03

Pos logit contributions
lessly+0.011
 much+0.009
 nice+0.008
 precious+0.008
ww+0.008
Neg logit contributions
keeping-0.014
rones-0.012
birds-0.012
group-0.012
growth-0.012
Token.
Token ID13
Feature activation+1.42e-02

Pos logit contributions
keeping+0.010
birds+0.008
rones+0.008
group+0.008
growth+0.008
Neg logit contributions
lessly-0.009
 much-0.007
 precious-0.007
ww-0.007
 nice-0.007
Token
Token ID198
Feature activation-3.62e-02

Pos logit contributions
keeping+0.034
date+0.027
birds+0.027
ASH+0.027
IT+0.026
Neg logit contributions
lessly-0.024
 much-0.022
 proud-0.018
 nice-0.018
 good-0.017
Token
Token ID198
Feature activation-2.75e-02

Pos logit contributions
lessly+0.067
 much+0.059
 nice+0.052
 proud+0.052
 good+0.050
Neg logit contributions
keeping-0.078
ASH-0.063
birds-0.062
IT-0.062
rones-0.061
TokenVery
Token ID16371
Feature activation-9.40e-02

Pos logit contributions
lessly+0.046
 much+0.036
 precious+0.034
 nice+0.034
 shiny+0.033
Neg logit contributions
keeping-0.061
rones-0.052
birds-0.051
group-0.051
growth-0.050
Token soon
Token ID2582
Feature activation-3.56e-02

Pos logit contributions
lessly+0.113
 steep+0.084
 shallow+0.082
ented+0.080
 refreshing+0.078
Neg logit contributions
group-0.162
date-0.162
keeping-0.161
 meetings-0.159
 Wisdom-0.156
Token,
Token ID11
Feature activation-2.84e-02

Pos logit contributions
lessly+0.073
 precious+0.060
 valuable+0.058
 steep+0.057
 much+0.057
Neg logit contributions
keeping-0.063
 Lee-0.050
group-0.048
birds-0.047
rones-0.047
Token Anna
Token ID11735
Feature activation-1.97e-02

Pos logit contributions
lessly+0.065
 much+0.056
 nice+0.053
 precious+0.052
 proud+0.051
Neg logit contributions
keeping-0.047
rones-0.036
birds-0.036
group-0.035
ASH-0.034
Token was
Token ID373
Feature activation-3.30e-02

Pos logit contributions
lessly+0.031
 much+0.024
 nice+0.022
 precious+0.022
 proud+0.021
Neg logit contributions
keeping-0.046
rones-0.039
birds-0.039
group-0.038
growth-0.038
Token whist
Token ID32262
Feature activation-5.56e-02

Pos logit contributions
lessly+0.047
ww+0.034
 precious+0.031
 valuable+0.029
 much+0.029
Neg logit contributions
keeping-0.081
rones-0.069
birds-0.068
group-0.068
 preschool-0.067
Token the
Token ID262
Feature activation-8.16e-03

Pos logit contributions
keeping+0.053
rones+0.046
birds+0.044
ASH+0.044
group+0.043
Neg logit contributions
lessly-0.043
 much-0.037
 nice-0.035
 shiny-0.034
 precious-0.033
Token log
Token ID2604
Feature activation+4.03e-03

Pos logit contributions
lessly+0.017
 much+0.015
 nice+0.014
 precious+0.014
ww+0.013
Neg logit contributions
keeping-0.015
rones-0.012
birds-0.012
group-0.011
ASH-0.011
Token.
Token ID13
Feature activation+6.13e-03

Pos logit contributions
keeping+0.009
rones+0.007
birds+0.007
group+0.007
growth+0.007
Neg logit contributions
lessly-0.007
 much-0.005
 nice-0.005
 precious-0.005
 shiny-0.005
Token Their
Token ID5334
Feature activation+5.46e-03

Pos logit contributions
keeping+0.011
rones+0.009
ASH+0.009
group+0.009
girls+0.009
Neg logit contributions
lessly-0.013
 much-0.011
 proud-0.010
 nice-0.010
 precious-0.010
Token little
Token ID1310
Feature activation-4.35e-03

Pos logit contributions
keeping+0.011
rones+0.009
group+0.009
birds+0.009
growth+0.009
Neg logit contributions
lessly-0.010
 much-0.008
 nice-0.008
 precious-0.008
 shiny-0.008
Token hearts
Token ID11954
Feature activation-9.35e-02

Pos logit contributions
lessly+0.009
 much+0.007
ww+0.007
 precious+0.007
 nice+0.006
Neg logit contributions
keeping-0.008
birds-0.007
rones-0.007
ASH-0.007
group-0.007
Token were
Token ID547
Feature activation-4.16e-02

Pos logit contributions
lessly+0.162
ww+0.150
 much+0.147
 precious+0.143
 steep+0.143
Neg logit contributions
keeping-0.162
birds-0.143
bird-0.136
stick-0.132
growth-0.119
Token full
Token ID1336
Feature activation-2.55e-02

Pos logit contributions
lessly+0.061
ww+0.048
arest+0.044
ang+0.040
 steep+0.037
Neg logit contributions
keeping-0.091
birds-0.081
stick-0.076
rones-0.075
bird-0.075
Token of
Token ID286
Feature activation-1.77e-02

Pos logit contributions
lessly+0.037
 much+0.027
ww+0.027
 precious+0.026
 nice+0.025
Neg logit contributions
keeping-0.061
birds-0.053
group-0.050
rones-0.050
growth-0.050
Token joy
Token ID8716
Feature activation-3.81e-02

Pos logit contributions
lessly+0.023
ww+0.017
 much+0.015
arest+0.015
a+0.014
Neg logit contributions
keeping-0.045
birds-0.039
rones-0.038
growth-0.038
group-0.037
Token as
Token ID355
Feature activation-4.29e-02

Pos logit contributions
lessly+0.068
 much+0.056
ww+0.055
 precious+0.054
 steep+0.053
Neg logit contributions
keeping-0.077
birds-0.064
rones-0.058
stick-0.058
bird-0.057
Token of
Token ID286
Feature activation+4.01e-03

Pos logit contributions
lessly+0.023
 much+0.019
 precious+0.017
 nice+0.017
ww+0.017
Neg logit contributions
keeping-0.026
rones-0.021
birds-0.021
group-0.021
growth-0.020
Token her
Token ID607
Feature activation+1.03e-02

Pos logit contributions
keeping+0.008
rones+0.007
birds+0.007
group+0.006
ASH+0.006
Neg logit contributions
lessly-0.008
 much-0.006
 nice-0.006
 precious-0.006
 shiny-0.006
Token garden
Token ID11376
Feature activation-1.41e-03

Pos logit contributions
keeping+0.023
rones+0.019
birds+0.019
group+0.018
ASH+0.018
Neg logit contributions
lessly-0.018
 much-0.015
 nice-0.014
 precious-0.014
 shiny-0.013
Token.
Token ID13
Feature activation+1.00e-02

Pos logit contributions
lessly+0.002
 much+0.002
 nice+0.002
 shiny+0.002
 precious+0.002
Neg logit contributions
keeping-0.003
rones-0.003
ASH-0.003
growth-0.003
IT-0.003
Token The
Token ID383
Feature activation-1.90e-02

Pos logit contributions
keeping+0.019
girls+0.016
ASH+0.015
date+0.015
rones+0.015
Neg logit contributions
lessly-0.021
 much-0.019
 proud-0.016
 nice-0.016
 valuable-0.016
Token shapes
Token ID15268
Feature activation-9.27e-02

Pos logit contributions
lessly+0.035
 much+0.027
ww+0.026
 precious+0.024
 valuable+0.024
Neg logit contributions
keeping-0.039
birds-0.033
group-0.032
rones-0.032
 preschool-0.031
Token in
Token ID287
Feature activation-1.86e-03

Pos logit contributions
lessly+0.158
ww+0.153
 precious+0.136
 valuable+0.135
 much+0.134
Neg logit contributions
birds-0.151
keeping-0.146
bird-0.143
stick-0.131
group-0.129
Token the
Token ID262
Feature activation-6.87e-03

Pos logit contributions
lessly+0.003
 much+0.003
 nice+0.003
 shiny+0.003
 good+0.003
Neg logit contributions
keeping-0.004
girls-0.003
rones-0.003
seek-0.003
birds-0.003
Token clouds
Token ID15114
Feature activation-5.18e-02

Pos logit contributions
lessly+0.012
 much+0.010
 nice+0.009
 precious+0.009
 shiny+0.009
Neg logit contributions
keeping-0.015
rones-0.012
birds-0.012
group-0.012
growth-0.012
Token moved
Token ID3888
Feature activation-2.06e-02

Pos logit contributions
lessly+0.093
 much+0.075
ww+0.074
 precious+0.073
 valuable+0.072
Neg logit contributions
keeping-0.101
birds-0.092
bird-0.084
group-0.084
rones-0.084
Token and
Token ID290
Feature activation-3.80e-02

Pos logit contributions
lessly+0.032
 much+0.024
ww+0.024
 precious+0.023
 proud+0.023
Neg logit contributions
keeping-0.047
birds-0.041
group-0.039
rones-0.039
ASH-0.038
Token out
Token ID503
Feature activation-4.69e-02

Pos logit contributions
ww+0.072
lessly+0.065
arest+0.058
br+0.054
 pounds+0.054
Neg logit contributions
keeping-0.053
birds-0.045
group-0.044
 Her-0.041
 exhibit-0.041
Token.
Token ID13
Feature activation+1.08e-02

Pos logit contributions
lessly+0.089
ww+0.089
arest+0.077
 precious+0.075
a+0.073
Neg logit contributions
keeping-0.076
birds-0.069
growth-0.063
 Her-0.062
group-0.061
Token
Token ID198
Feature activation-3.86e-02

Pos logit contributions
keeping+0.020
girls+0.016
rones+0.016
birds+0.016
ASH+0.015
Neg logit contributions
lessly-0.023
 much-0.020
 proud-0.018
 nice-0.017
 good-0.017
Token
Token ID198
Feature activation-2.99e-02

Pos logit contributions
lessly+0.084
 much+0.072
 nice+0.067
 precious+0.066
 proud+0.066
Neg logit contributions
keeping-0.068
rones-0.054
birds-0.054
ASH-0.052
group-0.052
TokenThe
Token ID464
Feature activation-2.31e-02

Pos logit contributions
lessly+0.051
 precious+0.040
 shiny+0.039
 much+0.039
ww+0.038
Neg logit contributions
keeping-0.062
group-0.053
rones-0.052
birds-0.052
date-0.052
Token view
Token ID1570
Feature activation-8.99e-02

Pos logit contributions
lessly+0.041
 much+0.032
ww+0.030
 precious+0.028
 proud+0.028
Neg logit contributions
keeping-0.047
birds-0.040
growth-0.039
group-0.039
rones-0.038
Token from
Token ID422
Feature activation+2.08e-03

Pos logit contributions
lessly+0.140
ww+0.120
 much+0.117
 precious+0.116
 valuable+0.110
Neg logit contributions
birds-0.166
keeping-0.165
bird-0.159
date-0.150
group-0.146
Token the
Token ID262
Feature activation-6.29e-03

Pos logit contributions
keeping+0.004
rones+0.003
birds+0.003
group+0.003
growth+0.003
Neg logit contributions
lessly-0.004
 much-0.003
 nice-0.003
 precious-0.003
 shiny-0.003
Token top
Token ID1353
Feature activation-4.33e-02

Pos logit contributions
lessly+0.013
 much+0.010
ww+0.010
 precious+0.010
 proud+0.009
Neg logit contributions
keeping-0.012
birds-0.010
growth-0.009
rones-0.009
group-0.009
Token was
Token ID373
Feature activation-3.47e-02

Pos logit contributions
lessly+0.060
 much+0.044
 precious+0.042
 nice+0.041
ww+0.041
Neg logit contributions
keeping-0.106
birds-0.095
rones-0.092
growth-0.091
group-0.091
Token beautiful
Token ID4950
Feature activation-3.63e-02

Pos logit contributions
lessly+0.058
ww+0.049
arest+0.044
 precious+0.043
 much+0.041
Neg logit contributions
keeping-0.068
birds-0.059
IT-0.058
ASH-0.057
stick-0.057
Token fun
Token ID1257
Feature activation+8.67e-03

Pos logit contributions
lessly+0.040
ww+0.025
arest+0.024
 much+0.022
 proud+0.021
Neg logit contributions
keeping-0.080
rones-0.069
birds-0.067
ASH-0.067
IT-0.066
Token?"
Token ID1701
Feature activation+3.22e-02

Pos logit contributions
keeping+0.018
birds+0.014
rones+0.014
group+0.014
ASH+0.014
Neg logit contributions
lessly-0.016
 much-0.013
 precious-0.012
ww-0.012
 nice-0.012
Token
Token ID198
Feature activation-2.09e-02

Pos logit contributions
keeping+0.077
rones+0.066
ASH+0.064
girls+0.063
growth+0.063
Neg logit contributions
lessly-0.046
 much-0.042
 proud-0.041
 very-0.034
 happy-0.033
Token
Token ID198
Feature activation-1.78e-02

Pos logit contributions
lessly+0.029
 much+0.028
 proud+0.024
 nice+0.023
 good+0.021
Neg logit contributions
keeping-0.052
ASH-0.044
IT-0.043
rones-0.042
stick-0.042
TokenThe
Token ID464
Feature activation-1.65e-02

Pos logit contributions
lessly+0.025
 much+0.020
 nice+0.018
 proud+0.018
ww+0.018
Neg logit contributions
keeping-0.045
rones-0.039
growth-0.038
ASH-0.038
birds-0.037
Token shapes
Token ID15268
Feature activation-8.93e-02

Pos logit contributions
lessly+0.024
 much+0.019
 nice+0.017
 precious+0.017
 shiny+0.017
Neg logit contributions
keeping-0.041
rones-0.035
birds-0.034
group-0.034
growth-0.034
Token thought
Token ID1807
Feature activation-2.25e-02

Pos logit contributions
lessly+0.149
ww+0.139
 much+0.128
 precious+0.127
 valuable+0.122
Neg logit contributions
keeping-0.154
birds-0.150
bird-0.146
group-0.142
date-0.137
Token for
Token ID329
Feature activation+1.91e-02

Pos logit contributions
lessly+0.032
ww+0.025
 much+0.024
 precious+0.023
 proud+0.022
Neg logit contributions
keeping-0.051
birds-0.045
group-0.043
growth-0.043
rones-0.042
Token a
Token ID257
Feature activation-1.35e-02

Pos logit contributions
keeping+0.051
rones+0.045
birds+0.044
group+0.044
girls+0.043
Neg logit contributions
lessly-0.022
 much-0.020
 nice-0.019
 good-0.017
 hard-0.017
Token moment
Token ID2589
Feature activation-1.77e-02

Pos logit contributions
lessly+0.027
 precious+0.021
bling+0.021
 proud+0.020
ww+0.020
Neg logit contributions
keeping-0.022
group-0.018
date-0.018
 Monkey-0.018
growth-0.018
Token and
Token ID290
Feature activation-4.01e-02

Pos logit contributions
lessly+0.029
 much+0.023
 precious+0.022
ww+0.022
 nice+0.021
Neg logit contributions
keeping-0.039
birds-0.033
rones-0.032
group-0.032
growth-0.032
Tokenummy
Token ID13513
Feature activation+4.56e-03

Pos logit contributions
lessly+0.027
 much+0.022
 precious+0.021
 nice+0.021
 proud+0.021
Neg logit contributions
keeping-0.024
rones-0.020
birds-0.020
group-0.019
growth-0.019
Token smelling
Token ID48090
Feature activation-2.25e-02

Pos logit contributions
keeping+0.007
birds+0.006
rones+0.006
group+0.005
ASH+0.005
Neg logit contributions
lessly-0.011
 much-0.009
 precious-0.008
ww-0.008
 proud-0.008
Token biscuits
Token ID50128
Feature activation-1.38e-02

Pos logit contributions
lessly+0.043
 much+0.033
 precious+0.032
ww+0.032
arest+0.031
Neg logit contributions
keeping-0.044
birds-0.037
rones-0.035
group-0.035
growth-0.034
Token.
Token ID13
Feature activation+1.03e-02

Pos logit contributions
lessly+0.024
 much+0.019
 precious+0.018
ww+0.018
 nice+0.018
Neg logit contributions
keeping-0.030
birds-0.024
rones-0.023
group-0.023
girls-0.023
Token The
Token ID383
Feature activation-1.88e-02

Pos logit contributions
keeping+0.020
rones+0.016
ASH+0.016
IT+0.015
birds+0.015
Neg logit contributions
lessly-0.021
 much-0.018
 proud-0.015
 nice-0.015
 valuable-0.015
Token shapes
Token ID15268
Feature activation-8.92e-02

Pos logit contributions
lessly+0.038
 much+0.031
ww+0.029
 precious+0.029
 nice+0.029
Neg logit contributions
keeping-0.035
birds-0.029
rones-0.029
group-0.028
ASH-0.027
Token were
Token ID547
Feature activation-4.16e-02

Pos logit contributions
lessly+0.169
ww+0.155
 precious+0.143
 steep+0.142
 valuable+0.141
Neg logit contributions
keeping-0.141
birds-0.133
bird-0.124
group-0.118
date-0.115
Token all
Token ID477
Feature activation-4.90e-02

Pos logit contributions
lessly+0.061
ww+0.046
arest+0.043
 much+0.036
 precious+0.035
Neg logit contributions
keeping-0.096
birds-0.084
rones-0.082
girls-0.080
 preschool-0.078
Token pretty
Token ID2495
Feature activation-2.77e-02

Pos logit contributions
lessly+0.080
ww+0.061
arest+0.059
 much+0.055
 precious+0.054
Neg logit contributions
keeping-0.107
birds-0.090
rones-0.087
stick-0.084
group-0.083
Token and
Token ID290
Feature activation-4.94e-02

Pos logit contributions
lessly+0.047
 much+0.036
ww+0.033
 precious+0.033
 nice+0.033
Neg logit contributions
keeping-0.063
birds-0.053
rones-0.052
growth-0.050
ASH-0.050
Token covered
Token ID5017
Feature activation-1.78e-02

Pos logit contributions
lessly+0.072
arest+0.048
ww+0.047
 much+0.043
ust+0.041
Neg logit contributions
keeping-0.120
birds-0.100
rones-0.098
girls-0.096
ASH-0.093
Token better
Token ID1365
Feature activation-1.41e-02

Pos logit contributions
lessly+0.083
ww+0.079
arest+0.061
ang+0.060
a+0.057
Neg logit contributions
keeping-0.123
group-0.104
 lectures-0.104
 heaven-0.103
growth-0.103
Token.
Token ID13
Feature activation+1.23e-02

Pos logit contributions
lessly+0.028
ww+0.024
 much+0.024
 precious+0.023
 nice+0.022
Neg logit contributions
keeping-0.028
rones-0.019
birds-0.019
group-0.019
ASH-0.019
Token
Token ID220
Feature activation-1.66e-02

Pos logit contributions
keeping+0.023
birds+0.019
rones+0.019
group+0.019
ASH+0.018
Neg logit contributions
lessly-0.026
 much-0.022
 proud-0.020
 nice-0.019
 valuable-0.019
Token
Token ID198
Feature activation-2.94e-02

Pos logit contributions
lessly+0.033
 much+0.029
 proud+0.025
 nice+0.025
 precious+0.024
Neg logit contributions
keeping-0.032
rones-0.026
birds-0.026
group-0.026
growth-0.026
TokenThe
Token ID464
Feature activation-2.40e-02

Pos logit contributions
lessly+0.063
 much+0.054
 nice+0.050
 precious+0.049
 proud+0.049
Neg logit contributions
keeping-0.053
rones-0.042
birds-0.042
group-0.041
ASH-0.041
Token view
Token ID1570
Feature activation-8.88e-02

Pos logit contributions
lessly+0.053
 much+0.044
ww+0.041
 precious+0.041
 nice+0.041
Neg logit contributions
keeping-0.041
rones-0.032
birds-0.032
group-0.032
growth-0.031
Token that
Token ID326
Feature activation-1.53e-02

Pos logit contributions
lessly+0.145
ww+0.122
 much+0.120
 precious+0.118
arest+0.115
Neg logit contributions
keeping-0.165
birds-0.154
bird-0.148
date-0.145
group-0.138
Token he
Token ID339
Feature activation-2.46e-02

Pos logit contributions
lessly+0.036
 much+0.030
 precious+0.029
 nice+0.029
ww+0.029
Neg logit contributions
keeping-0.024
rones-0.018
birds-0.018
group-0.017
ASH-0.017
Token wanted
Token ID2227
Feature activation+1.56e-02

Pos logit contributions
lessly+0.043
 much+0.033
ww+0.033
 nice+0.032
 precious+0.030
Neg logit contributions
keeping-0.052
birds-0.043
rones-0.043
growth-0.043
girls-0.042
Token to
Token ID284
Feature activation+6.17e-02

Pos logit contributions
keeping+0.032
rones+0.028
group+0.027
birds+0.027
ASH+0.026
Neg logit contributions
lessly-0.028
 much-0.025
 nice-0.023
 good-0.022
 precious-0.022
Token reach
Token ID3151
Feature activation+1.27e-02

Pos logit contributions
keeping+0.134
rones+0.117
birds+0.113
ASH+0.111
group+0.110
Neg logit contributions
lessly-0.105
 much-0.088
 nice-0.082
 precious-0.079
 proud-0.078
Token the
Token ID262
Feature activation-5.34e-03

Pos logit contributions
keeping+0.026
rones+0.022
birds+0.022
group+0.021
IT+0.021
Neg logit contributions
lessly-0.019
 much-0.017
 nice-0.015
 shiny-0.015
 good-0.015
Token water
Token ID1660
Feature activation-4.48e-03

Pos logit contributions
lessly+0.012
 much+0.009
 impressed+0.009
arest+0.009
 pleased+0.009
Neg logit contributions
keeping-0.009
growth-0.007
ASH-0.007
rones-0.007
group-0.007
Token.
Token ID13
Feature activation+1.58e-02

Pos logit contributions
lessly+0.008
 much+0.007
 precious+0.006
 nice+0.006
ww+0.006
Neg logit contributions
keeping-0.009
rones-0.008
birds-0.008
group-0.007
ASH-0.007
Token The
Token ID383
Feature activation-1.43e-02

Pos logit contributions
keeping+0.027
rones+0.021
birds+0.020
group+0.020
ASH+0.020
Neg logit contributions
lessly-0.036
 much-0.031
 nice-0.029
 precious-0.028
 shiny-0.028
Token funny
Token ID8258
Feature activation-5.30e-03

Pos logit contributions
lessly+0.028
 much+0.022
ww+0.021
 precious+0.020
arest+0.019
Neg logit contributions
keeping-0.027
birds-0.024
rones-0.023
girls-0.023
group-0.023
Token shapes
Token ID15268
Feature activation-8.70e-02

Pos logit contributions
lessly+0.010
 much+0.008
arest+0.008
 precious+0.008
ww+0.008
Neg logit contributions
keeping-0.010
birds-0.009
rones-0.008
bird-0.008
group-0.008
Token they
Token ID484
Feature activation-2.84e-02

Pos logit contributions
ww+0.174
lessly+0.172
 precious+0.158
 valuable+0.156
 steep+0.148
Neg logit contributions
birds-0.116
bird-0.110
keeping-0.103
growth-0.097
stick-0.094
Token made
Token ID925
Feature activation-4.46e-03

Pos logit contributions
lessly+0.051
ww+0.044
 much+0.042
 precious+0.040
 valuable+0.040
Neg logit contributions
keeping-0.053
birds-0.046
growth-0.046
girls-0.045
rones-0.043
Token made
Token ID925
Feature activation-2.55e-03

Pos logit contributions
lessly+0.009
 much+0.008
 precious+0.007
ww+0.007
 valuable+0.007
Neg logit contributions
keeping-0.008
birds-0.006
rones-0.006
group-0.006
growth-0.006
Token him
Token ID683
Feature activation+2.33e-02

Pos logit contributions
lessly+0.005
ww+0.004
 precious+0.004
 much+0.004
 valuable+0.004
Neg logit contributions
keeping-0.005
birds-0.004
rones-0.004
group-0.004
growth-0.004
Token laugh
Token ID6487
Feature activation-1.89e-02

Pos logit contributions
keeping+0.048
birds+0.041
rones+0.038
girls+0.038
group+0.038
Neg logit contributions
lessly-0.042
ww-0.033
 precious-0.030
arest-0.029
 much-0.028
Token the
Token ID262
Feature activation-9.82e-04

Pos logit contributions
keeping+0.009
rones+0.008
birds+0.008
growth+0.008
IT+0.008
Neg logit contributions
lessly-0.006
 much-0.005
 nice-0.005
 shiny-0.004
 proud-0.004
Token forest
Token ID8222
Feature activation+2.86e-03

Pos logit contributions
lessly+0.002
 much+0.002
ww+0.002
 precious+0.002
 nice+0.001
Neg logit contributions
keeping-0.002
rones-0.001
group-0.001
ASH-0.001
birds-0.001
Token.
Token ID13
Feature activation+1.38e-02

Pos logit contributions
keeping+0.006
rones+0.005
birds+0.005
group+0.005
growth+0.005
Neg logit contributions
lessly-0.005
 much-0.004
 nice-0.004
 precious-0.004
 shiny-0.003
Token The
Token ID383
Feature activation-1.77e-02

Pos logit contributions
keeping+0.027
rones+0.021
date+0.020
group+0.020
birds+0.020
Neg logit contributions
lessly-0.029
 much-0.026
 proud-0.022
 nice-0.022
 hard-0.022
Token two
Token ID734
Feature activation-1.72e-02

Pos logit contributions
lessly+0.037
ww+0.030
 much+0.029
 precious+0.027
arest+0.026
Neg logit contributions
keeping-0.031
birds-0.026
ASH-0.026
group-0.026
rones-0.025
Token shapes
Token ID15268
Feature activation-8.70e-02

Pos logit contributions
lessly+0.034
ww+0.028
 much+0.026
 precious+0.026
 valuable+0.026
Neg logit contributions
keeping-0.029
birds-0.029
group-0.026
rones-0.026
bird-0.025
Token were
Token ID547
Feature activation-3.14e-02

Pos logit contributions
lessly+0.169
ww+0.157
 precious+0.145
 valuable+0.140
 much+0.138
Neg logit contributions
birds-0.131
keeping-0.129
bird-0.124
group-0.118
stick-0.116
Token no
Token ID645
Feature activation-2.71e-02

Pos logit contributions
lessly+0.051
ww+0.041
arest+0.038
 precious+0.034
 much+0.034
Neg logit contributions
keeping-0.069
birds-0.059
girls-0.058
rones-0.058
stick-0.056
Token longer
Token ID2392
Feature activation-5.30e-02

Pos logit contributions
lessly+0.047
ww+0.039
bling+0.033
arest+0.032
 much+0.031
Neg logit contributions
keeping-0.051
birds-0.047
rones-0.047
 meetings-0.045
group-0.045
Token weak
Token ID4939
Feature activation-1.96e-02

Pos logit contributions
lessly+0.079
ww+0.063
arest+0.060
 much+0.056
 precious+0.054
Neg logit contributions
keeping-0.127
birds-0.104
rones-0.104
group-0.099
ASH-0.098
Token and
Token ID290
Feature activation-4.16e-02

Pos logit contributions
lessly+0.038
 much+0.031
ww+0.030
 precious+0.030
 proud+0.029
Neg logit contributions
keeping-0.039
birds-0.031
rones-0.030
group-0.030
growth-0.030
Token.
Token ID13
Feature activation+1.29e-02

Pos logit contributions
keeping+0.000
rones+0.000
birds+0.000
group+0.000
growth+0.000
Neg logit contributions
lessly-0.000
 much-0.000
 nice-0.000
 precious-0.000
 shiny-0.000
Token
Token ID198
Feature activation-4.06e-02

Pos logit contributions
keeping+0.026
group+0.020
date+0.020
rones+0.020
birds+0.020
Neg logit contributions
lessly-0.026
 much-0.024
 proud-0.020
 nice-0.020
 valuable-0.020
Token
Token ID198
Feature activation-3.13e-02

Pos logit contributions
lessly+0.071
 much+0.059
 nice+0.054
 proud+0.053
 precious+0.052
Neg logit contributions
keeping-0.090
rones-0.074
birds-0.074
ASH-0.073
group-0.073
TokenThe
Token ID464
Feature activation-1.98e-02

Pos logit contributions
lessly+0.049
 much+0.036
 precious+0.035
 nice+0.034
 shiny+0.034
Neg logit contributions
keeping-0.071
birds-0.062
rones-0.062
group-0.062
growth-0.061
Token two
Token ID734
Feature activation-1.74e-02

Pos logit contributions
lessly+0.039
 much+0.031
ww+0.031
 precious+0.029
 nice+0.028
Neg logit contributions
keeping-0.037
birds-0.031
group-0.031
ASH-0.031
rones-0.031
Token shapes
Token ID15268
Feature activation-8.66e-02

Pos logit contributions
lessly+0.035
ww+0.029
 much+0.027
 precious+0.026
 valuable+0.026
Neg logit contributions
keeping-0.029
birds-0.028
group-0.026
rones-0.026
bird-0.025
Token became
Token ID2627
Feature activation-4.60e-02

Pos logit contributions
lessly+0.159
ww+0.138
 much+0.130
 precious+0.129
 steep+0.125
Neg logit contributions
keeping-0.151
birds-0.144
bird-0.136
group-0.133
date-0.131
Token friends
Token ID2460
Feature activation-6.47e-04

Pos logit contributions
lessly+0.063
 much+0.045
 precious+0.041
ww+0.040
 nice+0.040
Neg logit contributions
keeping-0.118
rones-0.102
birds-0.101
group-0.099
ASH-0.098
Token and
Token ID290
Feature activation-2.03e-02

Pos logit contributions
lessly+0.001
 much+0.001
 nice+0.001
 precious+0.001
 proud+0.001
Neg logit contributions
keeping-0.002
rones-0.001
ASH-0.001
growth-0.001
birds-0.001
Token they
Token ID484
Feature activation-2.31e-02

Pos logit contributions
lessly+0.034
 much+0.027
 nice+0.025
 precious+0.025
 shiny+0.024
Neg logit contributions
keeping-0.046
rones-0.039
birds-0.038
group-0.038
growth-0.037
Token played
Token ID2826
Feature activation+1.22e-02

Pos logit contributions
lessly+0.043
 much+0.034
 precious+0.033
ww+0.033
 nice+0.032
Neg logit contributions
keeping-0.046
birds-0.039
rones-0.038
group-0.037
girls-0.037
Token's
Token ID338
Feature activation+1.28e-02

Pos logit contributions
keeping+0.051
rones+0.042
birds+0.042
group+0.041
growth+0.041
Neg logit contributions
lessly-0.042
 much-0.033
 nice-0.031
 precious-0.031
ww-0.030
Token courage
Token ID11917
Feature activation-2.39e-02

Pos logit contributions
keeping+0.027
rones+0.023
group+0.022
birds+0.022
ASH+0.022
Neg logit contributions
lessly-0.024
 much-0.020
 nice-0.019
 precious-0.019
 valuable-0.018
Token.
Token ID13
Feature activation+1.95e-02

Pos logit contributions
lessly+0.045
ww+0.040
 much+0.036
 precious+0.036
 nice+0.036
Neg logit contributions
keeping-0.046
birds-0.035
group-0.033
growth-0.033
stick-0.033
Token The
Token ID383
Feature activation-9.12e-03

Pos logit contributions
keeping+0.034
rones+0.027
date+0.027
group+0.026
birds+0.026
Neg logit contributions
lessly-0.044
 much-0.039
 proud-0.034
 valuable-0.034
 nice-0.034
Token two
Token ID734
Feature activation-1.56e-02

Pos logit contributions
lessly+0.016
 much+0.012
ww+0.012
arest+0.011
 proud+0.011
Neg logit contributions
keeping-0.019
birds-0.016
rones-0.016
growth-0.016
group-0.016
Token de
Token ID390
Feature activation-8.60e-02

Pos logit contributions
lessly+0.031
ww+0.025
 much+0.024
arest+0.023
 valuable+0.022
Neg logit contributions
keeping-0.025
birds-0.025
girls-0.023
rones-0.022
 Monkey-0.022
Tokeners
Token ID364
Feature activation-2.83e-02

Pos logit contributions
lessly+0.148
 shiny+0.118
 much+0.118
 steep+0.117
 precious+0.115
Neg logit contributions
keeping-0.152
group-0.132
birds-0.131
date-0.131
girls-0.130
Token became
Token ID2627
Feature activation-4.50e-02

Pos logit contributions
lessly+0.047
 much+0.037
 nice+0.034
 precious+0.034
 proud+0.033
Neg logit contributions
keeping-0.064
rones-0.054
birds-0.053
group-0.052
ASH-0.052
Token good
Token ID922
Feature activation-8.84e-03

Pos logit contributions
lessly+0.058
 much+0.043
 nice+0.039
 precious+0.038
 shiny+0.037
Neg logit contributions
keeping-0.119
rones-0.102
birds-0.101
group-0.100
growth-0.100
Token friends
Token ID2460
Feature activation+5.72e-03

Pos logit contributions
lessly+0.014
 much+0.011
 nice+0.010
 precious+0.010
 proud+0.010
Neg logit contributions
keeping-0.021
rones-0.018
growth-0.018
ASH-0.018
group-0.017
Token after
Token ID706
Feature activation+1.29e-02

Pos logit contributions
keeping+0.013
birds+0.011
rones+0.011
group+0.011
growth+0.011
Neg logit contributions
lessly-0.009
 much-0.007
ww-0.007
 nice-0.007
 precious-0.007
Token from
Token ID422
Feature activation+4.93e-03

Pos logit contributions
lessly+0.014
 much+0.011
 precious+0.010
 nice+0.010
ww+0.010
Neg logit contributions
keeping-0.018
rones-0.015
birds-0.015
group-0.015
growth-0.015
Token the
Token ID262
Feature activation-4.31e-03

Pos logit contributions
keeping+0.010
growth+0.008
rones+0.008
birds+0.008
ASH+0.008
Neg logit contributions
lessly-0.010
 much-0.008
ww-0.007
 precious-0.007
 nice-0.007
Token bird
Token ID6512
Feature activation+2.49e-02

Pos logit contributions
lessly+0.009
 much+0.007
ww+0.007
 precious+0.007
 nice+0.007
Neg logit contributions
keeping-0.008
birds-0.006
ASH-0.006
rones-0.006
IT-0.006
Token above
Token ID2029
Feature activation-1.62e-02

Pos logit contributions
keeping+0.053
rones+0.044
birds+0.043
group+0.042
growth+0.042
Neg logit contributions
lessly-0.045
 much-0.036
 nice-0.034
 precious-0.034
ww-0.033
Token.
Token ID13
Feature activation+1.28e-02

Pos logit contributions
lessly+0.032
 much+0.027
 nice+0.026
 shiny+0.025
 good+0.025
Neg logit contributions
keeping-0.032
rones-0.027
ASH-0.025
girls-0.025
birds-0.025
Token Start
Token ID7253
Feature activation-8.56e-02

Pos logit contributions
keeping+0.022
rones+0.018
birds+0.017
group+0.017
ASH+0.017
Neg logit contributions
lessly-0.028
 much-0.024
 nice-0.022
 proud-0.022
 precious-0.022
Tokenled
Token ID992
Feature activation-3.29e-02

Pos logit contributions
lessly+0.122
 much+0.113
ww+0.113
 precious+0.113
 steep+0.112
Neg logit contributions
keeping-0.154
growth-0.136
birds-0.135
gy-0.134
stick-0.133
Token,
Token ID11
Feature activation-3.65e-02

Pos logit contributions
lessly+0.063
 much+0.051
 precious+0.049
ww+0.049
 valuable+0.048
Neg logit contributions
keeping-0.063
birds-0.052
growth-0.051
group-0.051
ASH-0.050
Token Bobby
Token ID17155
Feature activation+7.64e-04

Pos logit contributions
lessly+0.075
ww+0.064
 precious+0.062
 much+0.061
arest+0.061
Neg logit contributions
keeping-0.063
growth-0.050
birds-0.048
group-0.047
 Monkey-0.047
Token screamed
Token ID25421
Feature activation-2.04e-03

Pos logit contributions
keeping+0.002
birds+0.001
rones+0.001
ASH+0.001
growth+0.001
Neg logit contributions
lessly-0.001
 much-0.001
 precious-0.001
 nice-0.001
ww-0.001
Token and
Token ID290
Feature activation-2.33e-02

Pos logit contributions
lessly+0.004
ww+0.003
 precious+0.003
arest+0.003
 much+0.003
Neg logit contributions
keeping-0.004
birds-0.003
group-0.003
ASH-0.003
stick-0.003
Token the
Token ID262
Feature activation-1.02e-02

Pos logit contributions
lessly+0.008
 much+0.006
 nice+0.006
 shiny+0.006
 precious+0.006
Neg logit contributions
keeping-0.009
rones-0.008
group-0.007
birds-0.007
ASH-0.007
Token trees
Token ID7150
Feature activation-1.08e-02

Pos logit contributions
lessly+0.020
 much+0.017
ww+0.016
 precious+0.015
 proud+0.015
Neg logit contributions
keeping-0.019
birds-0.016
rones-0.015
IT-0.015
ASH-0.015
Token.
Token ID13
Feature activation+1.66e-02

Pos logit contributions
lessly+0.018
 much+0.015
 nice+0.013
 precious+0.013
 shiny+0.013
Neg logit contributions
keeping-0.024
rones-0.020
birds-0.020
group-0.020
growth-0.020
Token All
Token ID1439
Feature activation-5.31e-02

Pos logit contributions
keeping+0.031
birds+0.025
rones+0.024
group+0.024
ASH+0.024
Neg logit contributions
lessly-0.035
 much-0.031
 nice-0.027
 proud-0.027
 valuable-0.026
Token these
Token ID777
Feature activation-2.69e-02

Pos logit contributions
lessly+0.110
ww+0.092
 precious+0.092
arest+0.091
 much+0.089
Neg logit contributions
keeping-0.085
birds-0.075
growth-0.069
bird-0.068
rones-0.068
Token sounds
Token ID5238
Feature activation-8.53e-02

Pos logit contributions
lessly+0.052
 much+0.042
ww+0.042
 proud+0.039
 nice+0.039
Neg logit contributions
keeping-0.050
birds-0.044
rones-0.041
growth-0.040
girls-0.040
Token brought
Token ID3181
Feature activation+4.20e-03

Pos logit contributions
ww+0.132
lessly+0.126
arest+0.123
a+0.121
ented+0.118
Neg logit contributions
birds-0.135
keeping-0.118
 Maria-0.116
bird-0.115
 meetings-0.113
Token him
Token ID683
Feature activation+2.72e-02

Pos logit contributions
keeping+0.008
rones+0.006
birds+0.006
group+0.006
growth+0.006
Neg logit contributions
lessly-0.009
 much-0.007
 nice-0.007
 precious-0.007
ww-0.006
Token peace
Token ID4167
Feature activation-1.74e-02

Pos logit contributions
keeping+0.057
birds+0.047
rones+0.046
growth+0.045
group+0.045
Neg logit contributions
lessly-0.049
ww-0.037
 much-0.037
 precious-0.035
 proud-0.034
Token and
Token ID290
Feature activation-3.47e-02

Pos logit contributions
lessly+0.031
 much+0.025
ww+0.024
 precious+0.023
 proud+0.023
Neg logit contributions
keeping-0.038
birds-0.031
rones-0.030
growth-0.029
group-0.029
Token happiness
Token ID12157
Feature activation-3.35e-02

Pos logit contributions
lessly+0.060
ww+0.044
 much+0.044
arest+0.043
 precious+0.041
Neg logit contributions
keeping-0.075
birds-0.060
girls-0.060
growth-0.059
rones-0.059
Token end
Token ID886
Feature activation-1.09e-03

Pos logit contributions
lessly+0.036
 much+0.031
 precious+0.029
 nice+0.029
ww+0.029
Neg logit contributions
keeping-0.021
rones-0.016
birds-0.016
group-0.015
growth-0.015
Token,
Token ID11
Feature activation-2.82e-02

Pos logit contributions
lessly+0.001
 much+0.001
 nice+0.001
 shiny+0.001
 precious+0.001
Neg logit contributions
keeping-0.003
rones-0.002
ASH-0.002
IT-0.002
birds-0.002
Token the
Token ID262
Feature activation-2.80e-02

Pos logit contributions
lessly+0.048
ww+0.038
 much+0.037
 precious+0.036
arest+0.035
Neg logit contributions
keeping-0.060
birds-0.050
group-0.048
rones-0.048
ASH-0.047
Token garden
Token ID11376
Feature activation-2.86e-02

Pos logit contributions
lessly+0.063
 much+0.053
ww+0.050
 precious+0.049
 nice+0.048
Neg logit contributions
keeping-0.047
group-0.037
rones-0.036
birds-0.036
date-0.034
Token was
Token ID373
Feature activation-4.00e-02

Pos logit contributions
lessly+0.047
 much+0.036
 nice+0.034
 precious+0.034
ww+0.033
Neg logit contributions
keeping-0.066
rones-0.055
birds-0.055
group-0.054
growth-0.053
Token even
Token ID772
Feature activation-8.52e-02

Pos logit contributions
lessly+0.061
ww+0.050
arest+0.042
 precious+0.039
ang+0.037
Neg logit contributions
keeping-0.090
date-0.075
birds-0.075
rones-0.075
group-0.074
Token more
Token ID517
Feature activation-6.83e-02

Pos logit contributions
lessly+0.146
ww+0.133
arest+0.129
oun+0.112
ve+0.112
Neg logit contributions
keeping-0.143
birds-0.120
rones-0.119
ASH-0.118
 moderation-0.118
Token beautiful
Token ID4950
Feature activation-3.36e-02

Pos logit contributions
lessly+0.084
ww+0.071
arest+0.061
bles+0.055
br+0.055
Neg logit contributions
keeping-0.158
birds-0.126
 moderation-0.123
rones-0.122
 meetings-0.121
Token than
Token ID621
Feature activation-2.12e-02

Pos logit contributions
lessly+0.059
 much+0.047
ww+0.045
arest+0.043
 precious+0.043
Neg logit contributions
keeping-0.075
birds-0.060
rones-0.057
growth-0.056
group-0.056
Token before
Token ID878
Feature activation-1.27e-02

Pos logit contributions
lessly+0.040
 much+0.032
 nice+0.030
 precious+0.030
 shiny+0.029
Neg logit contributions
keeping-0.044
rones-0.036
birds-0.036
group-0.035
growth-0.035
Token!
Token ID0
Feature activation+6.16e-03

Pos logit contributions
lessly+0.020
 much+0.016
 nice+0.015
 precious+0.014
 shiny+0.014
Neg logit contributions
keeping-0.030
rones-0.026
birds-0.025
group-0.025
ASH-0.025
Token makes
Token ID1838
Feature activation-7.58e-03

Pos logit contributions
lessly+0.037
 much+0.030
ww+0.028
 precious+0.028
 valuable+0.028
Neg logit contributions
keeping-0.033
ASH-0.029
birds-0.029
rones-0.028
group-0.027
Token a
Token ID257
Feature activation-1.53e-02

Pos logit contributions
lessly+0.014
 much+0.011
 precious+0.011
ww+0.011
 nice+0.011
Neg logit contributions
keeping-0.015
rones-0.013
birds-0.012
ASH-0.012
growth-0.012
Token loud
Token ID7812
Feature activation-4.54e-03

Pos logit contributions
lessly+0.026
dd+0.020
led+0.018
ier+0.018
arest+0.016
Neg logit contributions
ASH-0.029
 exhibit-0.025
 visit-0.025
keeping-0.025
IT-0.025
Token noise
Token ID7838
Feature activation-3.94e-02

Pos logit contributions
lessly+0.009
 precious+0.007
 much+0.007
 valuable+0.007
arest+0.006
Neg logit contributions
keeping-0.008
ASH-0.007
stick-0.007
birds-0.007
IT-0.007
Token and
Token ID290
Feature activation-3.78e-02

Pos logit contributions
lessly+0.085
 valuable+0.072
 precious+0.072
ww+0.071
arest+0.067
Neg logit contributions
keeping-0.059
birds-0.051
ASH-0.048
group-0.047
 Lee-0.047
Token flashes
Token ID29227
Feature activation-8.51e-02

Pos logit contributions
lessly+0.077
ww+0.062
arest+0.060
ust+0.060
 much+0.058
Neg logit contributions
keeping-0.062
ASH-0.055
girls-0.054
 Lee-0.054
growth-0.053
Token its
Token ID663
Feature activation-1.65e-02

Pos logit contributions
lessly+0.159
 valuable+0.148
ww+0.148
 precious+0.147
arest+0.135
Neg logit contributions
birds-0.127
keeping-0.125
bird-0.115
stick-0.112
group-0.109
Token lights
Token ID7588
Feature activation-4.99e-02

Pos logit contributions
lessly+0.033
ww+0.027
 much+0.026
arest+0.025
 precious+0.025
Neg logit contributions
keeping-0.029
ASH-0.025
stick-0.025
birds-0.024
rones-0.024
Token.
Token ID13
Feature activation+1.71e-02

Pos logit contributions
lessly+0.098
ww+0.086
 valuable+0.084
 precious+0.084
 important+0.080
Neg logit contributions
keeping-0.083
birds-0.073
ASH-0.072
stick-0.070
growth-0.068
Token Tom
Token ID4186
Feature activation+2.57e-03

Pos logit contributions
keeping+0.032
birds+0.026
rones+0.026
group+0.025
date+0.025
Neg logit contributions
lessly-0.036
 much-0.031
 nice-0.028
 shiny-0.028
 good-0.027
Token is
Token ID318
Feature activation-3.56e-02

Pos logit contributions
keeping+0.005
birds+0.004
rones+0.004
ASH+0.004
girls+0.004
Neg logit contributions
lessly-0.005
 much-0.004
ww-0.004
 precious-0.004
 valuable-0.004
Token,
Token ID11
Feature activation-2.97e-02

Pos logit contributions
lessly+0.029
ww+0.023
arest+0.021
 much+0.021
 precious+0.019
Neg logit contributions
keeping-0.036
birds-0.032
bird-0.032
 Lee-0.030
 Smith-0.030
Token but
Token ID475
Feature activation-3.72e-02

Pos logit contributions
lessly+0.053
ww+0.049
arest+0.043
br+0.041
 much+0.040
Neg logit contributions
keeping-0.057
birds-0.046
 Her-0.045
ASH-0.044
rones-0.044
Token it
Token ID340
Feature activation-4.28e-02

Pos logit contributions
lessly+0.064
ww+0.049
 much+0.048
 nice+0.046
 precious+0.046
Neg logit contributions
keeping-0.077
birds-0.065
growth-0.064
group-0.063
rones-0.063
Token felt
Token ID2936
Feature activation-6.07e-02

Pos logit contributions
lessly+0.089
ww+0.073
arest+0.070
 much+0.069
 precious+0.067
Neg logit contributions
keeping-0.068
birds-0.059
nob-0.057
group-0.057
ASH-0.057
Token so
Token ID523
Feature activation-5.57e-02

Pos logit contributions
lessly+0.083
ww+0.078
ang+0.065
arest+0.065
a+0.059
Neg logit contributions
keeping-0.124
 Her-0.112
group-0.112
 moderation-0.110
 lectures-0.110
Token com
Token ID401
Feature activation-8.49e-02

Pos logit contributions
lessly+0.068
ww+0.052
b+0.047
ang+0.047
br+0.046
Neg logit contributions
keeping-0.119
 Her-0.118
 moderation-0.105
growth-0.105
 The-0.104
Tokenfy
Token ID24928
Feature activation-2.27e-02

Pos logit contributions
lessly+0.102
 shiny+0.087
 much+0.081
 impressed+0.081
 nice+0.078
Neg logit contributions
keeping-0.179
seek-0.163
osc-0.155
tail-0.150
girls-0.148
Token and
Token ID290
Feature activation-4.30e-02

Pos logit contributions
lessly+0.041
 much+0.033
ww+0.033
 precious+0.032
 nice+0.031
Neg logit contributions
keeping-0.048
birds-0.037
group-0.036
growth-0.036
rones-0.036
Token warm
Token ID5814
Feature activation-2.00e-02

Pos logit contributions
lessly+0.062
ww+0.050
arest+0.047
ang+0.045
a+0.044
Neg logit contributions
keeping-0.097
 Her-0.076
nob-0.073
ASH-0.072
 lectures-0.072
Token.
Token ID13
Feature activation+1.32e-02

Pos logit contributions
lessly+0.039
 much+0.032
 precious+0.031
ww+0.031
 valuable+0.030
Neg logit contributions
keeping-0.040
birds-0.031
rones-0.030
ASH-0.030
group-0.030
Token She
Token ID1375
Feature activation-2.09e-02

Pos logit contributions
keeping+0.023
rones+0.018
birds+0.018
girls+0.018
group+0.018
Neg logit contributions
lessly-0.030
 much-0.026
 nice-0.023
 proud-0.023
 precious-0.023
Token beautiful
Token ID4950
Feature activation-2.17e-02

Pos logit contributions
lessly+0.037
 much+0.023
 precious+0.022
ust+0.022
ww+0.022
Neg logit contributions
keeping-0.052
birds-0.049
bird-0.047
rones-0.045
 Monkey-0.045
Token rainbow
Token ID27223
Feature activation-4.71e-02

Pos logit contributions
lessly+0.045
 much+0.035
 precious+0.032
ww+0.032
arest+0.031
Neg logit contributions
keeping-0.040
birds-0.035
rones-0.033
bird-0.033
group-0.033
Token!
Token ID0
Feature activation+6.19e-03

Pos logit contributions
lessly+0.087
 much+0.070
 valuable+0.067
 precious+0.066
 nice+0.064
Neg logit contributions
keeping-0.093
birds-0.086
bird-0.080
group-0.076
date-0.075
Token All
Token ID1439
Feature activation-5.55e-02

Pos logit contributions
keeping+0.013
group+0.010
date+0.010
rones+0.010
ASH+0.010
Neg logit contributions
lessly-0.012
 much-0.011
 nice-0.009
 proud-0.009
 valuable-0.009
Token the
Token ID262
Feature activation-1.87e-02

Pos logit contributions
lessly+0.112
ww+0.090
arest+0.089
 much+0.089
 precious+0.088
Neg logit contributions
keeping-0.096
birds-0.086
rones-0.079
bird-0.077
growth-0.076
Token colours
Token ID18915
Feature activation-8.49e-02

Pos logit contributions
lessly+0.033
 much+0.026
ww+0.025
 precious+0.024
 nice+0.024
Neg logit contributions
keeping-0.040
birds-0.034
rones-0.033
group-0.032
growth-0.032
Token were
Token ID547
Feature activation-2.82e-02

Pos logit contributions
ww+0.146
lessly+0.146
 precious+0.124
 valuable+0.123
 steep+0.122
Neg logit contributions
birds-0.143
keeping-0.135
bird-0.130
stick-0.117
growth-0.116
Token so
Token ID523
Feature activation-5.45e-02

Pos logit contributions
lessly+0.045
ww+0.036
 much+0.032
arest+0.032
 precious+0.030
Neg logit contributions
keeping-0.062
birds-0.055
rones-0.052
stick-0.050
bird-0.050
Token bright
Token ID6016
Feature activation-5.24e-02

Pos logit contributions
lessly+0.068
ww+0.055
 pounds+0.043
arest+0.043
hold+0.042
Neg logit contributions
birds-0.118
keeping-0.116
 Her-0.110
rones-0.108
 meetings-0.104
Token and
Token ID290
Feature activation-5.21e-02

Pos logit contributions
lessly+0.100
ww+0.081
 much+0.080
arest+0.079
 valuable+0.076
Neg logit contributions
keeping-0.099
birds-0.087
rones-0.080
growth-0.079
ASH-0.078
Token beautiful
Token ID4950
Feature activation-2.43e-02

Pos logit contributions
lessly+0.070
ww+0.050
arest+0.046
a+0.043
ust+0.041
Neg logit contributions
keeping-0.124
birds-0.110
rones-0.103
 preschool-0.103
growth-0.102
Token.
Token ID13
Feature activation+8.92e-03

Pos logit contributions
lessly+0.044
 much+0.036
ww+0.035
 precious+0.035
 nice+0.033
Neg logit contributions
keeping-0.044
birds-0.037
group-0.035
rones-0.035
date-0.034
Token Suddenly
Token ID24975
Feature activation-2.88e-02

Pos logit contributions
keeping+0.016
rones+0.013
ASH+0.012
girls+0.012
group+0.012
Neg logit contributions
lessly-0.020
 much-0.017
 proud-0.015
 nice-0.015
 good-0.015
Token,
Token ID11
Feature activation-4.12e-02

Pos logit contributions
lessly+0.050
 much+0.040
 nice+0.037
 precious+0.037
ww+0.036
Neg logit contributions
keeping-0.063
rones-0.052
birds-0.052
group-0.051
growth-0.050
Token the
Token ID262
Feature activation-2.02e-02

Pos logit contributions
lessly+0.080
 much+0.065
 precious+0.061
 nice+0.061
ww+0.060
Neg logit contributions
keeping-0.082
rones-0.067
birds-0.066
group-0.065
growth-0.064
Token paper
Token ID3348
Feature activation-4.58e-02

Pos logit contributions
lessly+0.030
 much+0.023
ww+0.020
 precious+0.019
 valuable+0.019
Neg logit contributions
keeping-0.049
birds-0.042
group-0.041
rones-0.041
growth-0.041
Token shapes
Token ID15268
Feature activation-8.46e-02

Pos logit contributions
lessly+0.082
 much+0.066
ww+0.062
 precious+0.060
 nice+0.060
Neg logit contributions
keeping-0.096
birds-0.083
growth-0.079
rones-0.078
stick-0.078
Token the
Token ID262
Feature activation-1.27e-02

Pos logit contributions
lessly+0.150
ww+0.146
 precious+0.133
 valuable+0.130
 much+0.128
Neg logit contributions
keeping-0.139
birds-0.136
bird-0.126
growth-0.123
stick-0.123
Token girl
Token ID2576
Feature activation+7.76e-03

Pos logit contributions
lessly+0.023
 much+0.018
 nice+0.017
 precious+0.017
 shiny+0.016
Neg logit contributions
keeping-0.027
rones-0.023
birds-0.022
group-0.022
growth-0.022
Token had
Token ID550
Feature activation+1.02e-03

Pos logit contributions
keeping+0.018
birds+0.015
rones+0.015
growth+0.014
group+0.014
Neg logit contributions
lessly-0.013
 much-0.010
ww-0.009
 nice-0.009
 precious-0.009
Token folded
Token ID24650
Feature activation-3.02e-03

Pos logit contributions
keeping+0.002
rones+0.002
birds+0.002
growth+0.002
ASH+0.002
Neg logit contributions
lessly-0.002
 much-0.001
ww-0.001
 precious-0.001
 nice-0.001
Token came
Token ID1625
Feature activation+9.62e-03

Pos logit contributions
lessly+0.005
 much+0.004
 nice+0.004
 precious+0.003
ww+0.003
Neg logit contributions
keeping-0.007
rones-0.006
birds-0.006
group-0.006
growth-0.006
Token horse
Token ID8223
Feature activation+6.68e-03

Pos logit contributions
lessly+0.038
 much+0.031
 impressed+0.029
ww+0.029
 pleased+0.029
Neg logit contributions
keeping-0.025
smith-0.021
group-0.019
 Lee-0.019
rones-0.019
Token was
Token ID373
Feature activation-2.72e-02

Pos logit contributions
keeping+0.016
rones+0.013
birds+0.013
group+0.013
growth+0.013
Neg logit contributions
lessly-0.010
 much-0.008
 nice-0.007
 precious-0.007
 shiny-0.007
Token actually
Token ID1682
Feature activation-2.98e-02

Pos logit contributions
lessly+0.047
ww+0.038
 much+0.033
arest+0.033
 precious+0.033
Neg logit contributions
keeping-0.057
rones-0.048
birds-0.048
group-0.046
 preschool-0.046
Token alive
Token ID6776
Feature activation-1.77e-02

Pos logit contributions
lessly+0.053
 much+0.040
ww+0.040
 precious+0.038
arest+0.037
Neg logit contributions
keeping-0.064
rones-0.052
birds-0.051
group-0.051
ASH-0.050
Token!
Token ID0
Feature activation+1.39e-02

Pos logit contributions
lessly+0.029
 much+0.023
 nice+0.021
 precious+0.021
 shiny+0.021
Neg logit contributions
keeping-0.040
rones-0.034
birds-0.033
group-0.033
growth-0.033
Token Start
Token ID7253
Feature activation-8.45e-02

Pos logit contributions
keeping+0.026
girls+0.022
rones+0.021
group+0.020
parents+0.020
Neg logit contributions
lessly-0.029
 much-0.027
 nice-0.023
 de-0.022
 proud-0.022
Tokenled
Token ID992
Feature activation-3.02e-02

Pos logit contributions
lessly+0.118
ww+0.106
 much+0.103
 steep+0.103
 precious+0.099
Neg logit contributions
keeping-0.158
group-0.141
birds-0.140
stick-0.138
bird-0.136
Token,
Token ID11
Feature activation-3.24e-02

Pos logit contributions
lessly+0.052
 much+0.042
 nice+0.039
 precious+0.038
ww+0.037
Neg logit contributions
keeping-0.066
rones-0.056
birds-0.055
group-0.054
growth-0.054
Token Lucy
Token ID22162
Feature activation-4.68e-03

Pos logit contributions
lessly+0.066
 much+0.054
 precious+0.051
 nice+0.051
ww+0.051
Neg logit contributions
keeping-0.061
birds-0.049
rones-0.049
group-0.047
growth-0.047
Token jumped
Token ID11687
Feature activation-1.86e-02

Pos logit contributions
lessly+0.007
 much+0.006
 nice+0.005
 precious+0.005
 proud+0.005
Neg logit contributions
keeping-0.011
rones-0.009
birds-0.009
group-0.009
growth-0.009
Token back
Token ID736
Feature activation+2.57e-02

Pos logit contributions
lessly+0.033
ww+0.029
 precious+0.028
 much+0.027
arest+0.027
Neg logit contributions
keeping-0.036
birds-0.029
group-0.029
ASH-0.029
stick-0.028
Token and
Token ID290
Feature activation-4.66e-02

Pos logit contributions
lessly+0.016
 much+0.012
 precious+0.011
ww+0.011
 nice+0.011
Neg logit contributions
keeping-0.022
rones-0.018
birds-0.018
ASH-0.017
group-0.017
Token cool
Token ID3608
Feature activation-1.69e-02

Pos logit contributions
lessly+0.068
ww+0.056
arest+0.053
a+0.044
ier+0.044
Neg logit contributions
keeping-0.098
girls-0.085
 lectures-0.084
 Chess-0.084
 preschool-0.083
Token.
Token ID13
Feature activation+1.60e-02

Pos logit contributions
lessly+0.032
 much+0.027
 precious+0.025
ww+0.025
 nice+0.024
Neg logit contributions
keeping-0.034
rones-0.027
birds-0.027
ASH-0.027
group-0.026
Token Tom
Token ID4186
Feature activation+5.11e-03

Pos logit contributions
keeping+0.036
seek+0.029
IT+0.029
girls+0.028
rones+0.028
Neg logit contributions
lessly-0.029
 much-0.027
 nice-0.023
 good-0.022
 shiny-0.022
Token wanted
Token ID2227
Feature activation+2.11e-02

Pos logit contributions
keeping+0.012
rones+0.010
birds+0.010
ASH+0.010
group+0.009
Neg logit contributions
lessly-0.009
 much-0.007
 precious-0.006
ww-0.006
 nice-0.006
Token to
Token ID284
Feature activation+6.77e-02

Pos logit contributions
keeping+0.053
group+0.047
IT+0.047
rones+0.047
date+0.047
Neg logit contributions
 much-0.028
 nice-0.027
 shiny-0.026
 good-0.025
lessly-0.025
Token play
Token ID711
Feature activation+2.48e-02

Pos logit contributions
keeping+0.144
rones+0.123
ASH+0.115
birds+0.114
group+0.113
Neg logit contributions
lessly-0.117
 much-0.106
 nice-0.105
 shiny-0.103
 proud-0.098
Token with
Token ID351
Feature activation+2.60e-02

Pos logit contributions
keeping+0.068
rones+0.059
growth+0.057
birds+0.056
girls+0.055
Neg logit contributions
lessly-0.031
 much-0.027
 nice-0.026
 proud-0.024
 shiny-0.024
Token it
Token ID340
Feature activation+1.95e-03

Pos logit contributions
keeping+0.068
rones+0.058
birds+0.056
date+0.056
group+0.056
Neg logit contributions
lessly-0.035
 much-0.030
 nice-0.029
 shiny-0.028
 good-0.027
Token,
Token ID11
Feature activation-3.73e-03

Pos logit contributions
keeping+0.005
rones+0.004
growth+0.004
birds+0.004
IT+0.004
Neg logit contributions
lessly-0.003
 much-0.002
 nice-0.002
 shiny-0.002
 good-0.002
Token but
Token ID475
Feature activation-2.34e-02

Pos logit contributions
lessly+0.006
 much+0.005
ww+0.005
 precious+0.004
 nice+0.004
Neg logit contributions
keeping-0.008
birds-0.007
rones-0.007
group-0.007
ASH-0.007
Token there
Token ID612
Feature activation-2.14e-03

Pos logit contributions
keeping+0.010
group+0.009
gy+0.009
birds+0.009
rones+0.009
Neg logit contributions
lessly-0.014
ww-0.010
 valuable-0.010
 precious-0.010
arest-0.010
Token was
Token ID373
Feature activation-1.90e-03

Pos logit contributions
lessly+0.003
 much+0.003
 nice+0.002
 shiny+0.002
 good+0.002
Neg logit contributions
keeping-0.005
rones-0.004
birds-0.004
growth-0.004
girls-0.004
Token a
Token ID257
Feature activation-1.63e-02

Pos logit contributions
lessly+0.003
 much+0.002
 nice+0.002
 shiny+0.002
 good+0.002
Neg logit contributions
keeping-0.005
birds-0.004
rones-0.004
girls-0.004
IT-0.004
Token useful
Token ID4465
Feature activation-1.49e-02

Pos logit contributions
lessly+0.027
 much+0.016
arest+0.015
ww+0.014
b+0.014
Neg logit contributions
keeping-0.034
group-0.033
birds-0.032
 preschool-0.032
 Lee-0.032
Token dog
Token ID3290
Feature activation+3.36e-02

Pos logit contributions
lessly+0.031
 much+0.025
 precious+0.024
 nice+0.023
ww+0.023
Neg logit contributions
keeping-0.027
rones-0.022
birds-0.022
group-0.022
ASH-0.021
Token named
Token ID3706
Feature activation+6.85e-02

Pos logit contributions
keeping+0.068
ASH+0.055
rones+0.055
growth+0.053
IT+0.053
Neg logit contributions
lessly-0.063
 much-0.058
 nice-0.054
 proud-0.052
 shiny-0.051
Token Spot
Token ID15899
Feature activation+5.76e-02

Pos logit contributions
keeping+0.141
rones+0.108
birds+0.107
ASH+0.106
group+0.105
Neg logit contributions
 much-0.126
 nice-0.125
lessly-0.124
 shiny-0.116
 sweet-0.116
Token.
Token ID13
Feature activation+1.49e-02

Pos logit contributions
keeping+0.146
ASH+0.124
rones+0.123
growth+0.123
birds+0.123
Neg logit contributions
lessly-0.079
 much-0.073
 nice-0.065
 good-0.060
 very-0.060
Token Spot
Token ID15899
Feature activation+1.71e-02

Pos logit contributions
keeping+0.031
birds+0.024
rones+0.024
ASH+0.023
group+0.023
Neg logit contributions
lessly-0.028
 much-0.026
 nice-0.025
 good-0.024
 shiny-0.024
Token liked
Token ID8288
Feature activation+2.24e-02

Pos logit contributions
keeping+0.046
rones+0.039
birds+0.039
group+0.039
growth+0.039
Neg logit contributions
lessly-0.021
 much-0.016
 nice-0.014
 precious-0.014
 shiny-0.013
Token to
Token ID284
Feature activation+5.38e-02

Pos logit contributions
keeping+0.050
group+0.043
rones+0.043
ASH+0.042
growth+0.042
Neg logit contributions
lessly-0.035
 much-0.033
 nice-0.032
 shiny-0.031
 good-0.030
Token was
Token ID373
Feature activation-1.61e-02

Pos logit contributions
lessly+0.008
 much+0.006
 nice+0.006
 precious+0.006
ww+0.005
Neg logit contributions
keeping-0.013
rones-0.011
birds-0.011
group-0.011
growth-0.011
Token very
Token ID845
Feature activation-3.94e-02

Pos logit contributions
lessly+0.016
 much+0.009
ww+0.008
 precious+0.008
 nice+0.007
Neg logit contributions
keeping-0.047
rones-0.042
birds-0.041
group-0.041
growth-0.040
Token hungry
Token ID14720
Feature activation+2.72e-03

Pos logit contributions
lessly+0.058
 much+0.036
ww+0.035
 steep+0.033
 precious+0.033
Neg logit contributions
keeping-0.092
rones-0.081
birds-0.080
group-0.080
 Her-0.079
Token and
Token ID290
Feature activation-2.18e-02

Pos logit contributions
keeping+0.006
birds+0.005
rones+0.005
group+0.005
growth+0.004
Neg logit contributions
lessly-0.005
ww-0.004
 much-0.004
 precious-0.004
 proud-0.004
Token wanted
Token ID2227
Feature activation+2.13e-02

Pos logit contributions
lessly+0.040
arest+0.029
 much+0.029
ww+0.029
 precious+0.029
Neg logit contributions
keeping-0.044
rones-0.036
 Her-0.036
birds-0.035
group-0.035
Token to
Token ID284
Feature activation+6.72e-02

Pos logit contributions
keeping+0.052
IT+0.046
group+0.045
ASH+0.045
date+0.044
Neg logit contributions
 much-0.032
 nice-0.028
lessly-0.027
 good-0.027
 shiny-0.027
Token eat
Token ID4483
Feature activation+2.85e-02

Pos logit contributions
keeping+0.161
rones+0.141
ASH+0.140
birds+0.140
 Lee+0.136
Neg logit contributions
 nice-0.089
 much-0.087
 good-0.082
 shiny-0.076
 big-0.076
Token an
Token ID281
Feature activation-1.12e-02

Pos logit contributions
keeping+0.059
rones+0.049
birds+0.048
ASH+0.047
group+0.047
Neg logit contributions
lessly-0.053
 much-0.045
 nice-0.042
 precious-0.041
 shiny-0.041
Token egg
Token ID5935
Feature activation+1.51e-02

Pos logit contributions
lessly+0.020
 much+0.017
 nice+0.016
 precious+0.016
 shiny+0.016
Neg logit contributions
keeping-0.023
rones-0.019
birds-0.019
ASH-0.019
growth-0.018
Token.
Token ID13
Feature activation+1.89e-02

Pos logit contributions
keeping+0.035
rones+0.031
ASH+0.031
growth+0.031
IT+0.030
Neg logit contributions
lessly-0.023
 much-0.019
 nice-0.018
 shiny-0.017
 precious-0.017
Token But
Token ID887
Feature activation-2.42e-02

Pos logit contributions
keeping+0.033
rones+0.025
birds+0.025
group+0.025
growth+0.024
Neg logit contributions
lessly-0.042
 much-0.037
 nice-0.035
 shiny-0.034
 precious-0.033
Token Can
Token ID1680
Feature activation+3.79e-02

Pos logit contributions
keeping+0.042
rones+0.035
ASH+0.031
growth+0.031
IT+0.031
Neg logit contributions
lessly-0.035
 much-0.030
ww-0.027
 nice-0.027
 proud-0.027
Token you
Token ID345
Feature activation+5.38e-02

Pos logit contributions
keeping+0.102
rones+0.089
group+0.083
growth+0.083
ASH+0.082
Neg logit contributions
lessly-0.048
 much-0.040
 nice-0.036
 proud-0.035
 precious-0.035
Token teach
Token ID4545
Feature activation+3.70e-02

Pos logit contributions
keeping+0.109
rones+0.087
ASH+0.082
birds+0.081
growth+0.080
Neg logit contributions
lessly-0.105
 much-0.094
 nice-0.093
 proud-0.088
 shiny-0.088
Token me
Token ID502
Feature activation+5.06e-02

Pos logit contributions
keeping+0.101
rones+0.085
group+0.084
growth+0.083
ASH+0.082
Neg logit contributions
 nice-0.040
lessly-0.039
 much-0.039
 good-0.034
 shiny-0.034
Token how
Token ID703
Feature activation+1.55e-03

Pos logit contributions
keeping+0.104
birds+0.091
rones+0.089
group+0.088
growth+0.086
Neg logit contributions
lessly-0.091
 much-0.069
ww-0.067
 precious-0.065
 nice-0.063
Token to
Token ID284
Feature activation+8.00e-02

Pos logit contributions
keeping+0.002
birds+0.002
rones+0.002
group+0.002
growth+0.002
Neg logit contributions
lessly-0.004
 much-0.003
ww-0.003
 precious-0.003
 nice-0.003
Token fly
Token ID6129
Feature activation+1.25e-02

Pos logit contributions
keeping+0.162
rones+0.133
ASH+0.125
birds+0.122
 Lee+0.122
Neg logit contributions
lessly-0.144
 nice-0.136
 much-0.132
 shiny-0.129
 proud-0.125
Token?"
Token ID1701
Feature activation+4.47e-02

Pos logit contributions
keeping+0.025
birds+0.022
rones+0.021
group+0.021
ASH+0.021
Neg logit contributions
lessly-0.023
 much-0.017
ww-0.017
 precious-0.016
 proud-0.016
Token The
Token ID383
Feature activation+6.65e-03

Pos logit contributions
keeping+0.104
 preschool+0.078
rones+0.077
parents+0.077
girls+0.075
Neg logit contributions
 much-0.070
 proud-0.069
 nice-0.067
 very-0.067
 wind-0.063
Token squash
Token ID34613
Feature activation-1.54e-02

Pos logit contributions
keeping+0.015
rones+0.013
growth+0.012
group+0.012
girls+0.012
Neg logit contributions
lessly-0.011
 nice-0.010
 much-0.009
 shiny-0.009
 precious-0.009
Token did
Token ID750
Feature activation+4.85e-03

Pos logit contributions
lessly+0.024
 much+0.019
 nice+0.018
 precious+0.017
 proud+0.017
Neg logit contributions
keeping-0.037
rones-0.031
growth-0.030
birds-0.030
group-0.030
Tokenuddle
Token ID24500
Feature activation-2.06e-02

Pos logit contributions
lessly+0.053
 much+0.044
 good+0.042
 valuable+0.042
 precious+0.041
Neg logit contributions
keeping-0.047
rones-0.042
 exhibit-0.041
group-0.040
birds-0.040
Token.
Token ID13
Feature activation+1.60e-02

Pos logit contributions
lessly+0.036
 much+0.030
 precious+0.030
ww+0.028
 proud+0.028
Neg logit contributions
keeping-0.044
birds-0.035
group-0.035
ASH-0.034
rones-0.034
Token "
Token ID366
Feature activation-1.05e-03

Pos logit contributions
keeping+0.028
date+0.021
girls+0.021
rones+0.021
group+0.021
Neg logit contributions
lessly-0.036
 much-0.031
 proud-0.028
 nice-0.028
 steep-0.027
TokenI
Token ID40
Feature activation+1.17e-02

Pos logit contributions
lessly+0.002
 much+0.002
 precious+0.002
 nice+0.002
 shiny+0.002
Neg logit contributions
keeping-0.002
rones-0.001
birds-0.001
growth-0.001
group-0.001
Token want
Token ID765
Feature activation+2.10e-02

Pos logit contributions
keeping+0.020
group+0.016
date+0.016
rones+0.016
birds+0.016
Neg logit contributions
lessly-0.025
ww-0.019
 much-0.019
 precious-0.018
 shiny-0.018
Token to
Token ID284
Feature activation+7.51e-02

Pos logit contributions
keeping+0.039
rones+0.031
birds+0.031
group+0.030
growth+0.030
Neg logit contributions
lessly-0.044
 much-0.036
 precious-0.034
 nice-0.034
 proud-0.033
Token play
Token ID711
Feature activation+3.34e-02

Pos logit contributions
keeping+0.153
rones+0.129
birds+0.125
ASH+0.122
group+0.121
Neg logit contributions
lessly-0.138
 much-0.118
 nice-0.112
 shiny-0.108
 good-0.106
Token!",
Token ID40754
Feature activation+2.69e-02

Pos logit contributions
keeping+0.077
rones+0.065
birds+0.064
growth+0.063
ASH+0.062
Neg logit contributions
lessly-0.055
 much-0.044
 nice-0.041
 precious-0.040
 shiny-0.039
Token said
Token ID531
Feature activation+1.42e-02

Pos logit contributions
keeping+0.058
birds+0.048
rones+0.048
growth+0.047
group+0.046
Neg logit contributions
lessly-0.047
 much-0.036
 precious-0.035
 nice-0.034
ww-0.034
Token Mary
Token ID5335
Feature activation+1.95e-02

Pos logit contributions
keeping+0.021
rones+0.016
ASH+0.015
girls+0.015
group+0.014
Neg logit contributions
lessly-0.035
 much-0.032
 nice-0.030
 proud-0.030
 good-0.029
Token.
Token ID13
Feature activation+2.63e-02

Pos logit contributions
keeping+0.042
rones+0.035
birds+0.034
group+0.034
growth+0.034
Neg logit contributions
lessly-0.035
 much-0.028
 nice-0.026
 precious-0.026
 proud-0.025
Token Mom
Token ID11254
Feature activation+1.89e-02

Pos logit contributions
lessly+0.009
 much+0.008
arest+0.008
ww+0.008
b+0.008
Neg logit contributions
keeping-0.004
 Lee-0.004
birds-0.003
ASH-0.003
group-0.003
Tokenmy
Token ID1820
Feature activation-2.57e-03

Pos logit contributions
keeping+0.036
rones+0.030
growth+0.029
birds+0.029
group+0.029
Neg logit contributions
lessly-0.038
 much-0.032
 nice-0.030
 precious-0.029
 proud-0.029
Token.
Token ID13
Feature activation+1.75e-02

Pos logit contributions
lessly+0.005
 much+0.004
 precious+0.004
 nice+0.004
ww+0.003
Neg logit contributions
keeping-0.005
rones-0.004
birds-0.004
ASH-0.004
group-0.004
Token I
Token ID314
Feature activation+4.59e-03

Pos logit contributions
keeping+0.046
rones+0.040
growth+0.038
birds+0.038
group+0.038
Neg logit contributions
lessly-0.023
 much-0.019
 nice-0.017
 precious-0.017
 proud-0.017
Token didn
Token ID1422
Feature activation+1.91e-02

Pos logit contributions
keeping+0.010
rones+0.008
birds+0.008
ASH+0.008
group+0.008
Neg logit contributions
lessly-0.008
 much-0.006
 nice-0.006
 precious-0.006
ww-0.006
Token't
Token ID470
Feature activation+1.12e-02

Pos logit contributions
keeping+0.034
rones+0.028
birds+0.025
group+0.025
ASH+0.025
Neg logit contributions
lessly-0.042
 much-0.039
 nice-0.035
 shiny-0.035
 hard-0.034
Token mean
Token ID1612
Feature activation+2.92e-02

Pos logit contributions
keeping+0.027
rones+0.022
birds+0.022
group+0.022
ASH+0.022
Neg logit contributions
lessly-0.018
 much-0.013
 precious-0.012
 nice-0.012
ww-0.012
Token to
Token ID284
Feature activation+6.59e-02

Pos logit contributions
keeping+0.055
rones+0.045
birds+0.044
group+0.043
growth+0.043
Neg logit contributions
lessly-0.060
 much-0.051
 nice-0.048
 precious-0.047
 shiny-0.046
Token break
Token ID2270
Feature activation+1.59e-02

Pos logit contributions
keeping+0.154
rones+0.133
birds+0.129
group+0.128
growth+0.127
Neg logit contributions
lessly-0.103
 much-0.087
 nice-0.082
 shiny-0.079
 precious-0.077
Token the
Token ID262
Feature activation+8.11e-03

Pos logit contributions
keeping+0.033
rones+0.027
birds+0.027
group+0.027
growth+0.026
Neg logit contributions
lessly-0.030
 much-0.025
 nice-0.023
 precious-0.023
 shiny-0.022
Token v
Token ID410
Feature activation+1.31e-02

Pos logit contributions
keeping+0.020
rones+0.017
birds+0.017
growth+0.017
group+0.016
Neg logit contributions
lessly-0.012
 much-0.009
 nice-0.009
 precious-0.009
 shiny-0.009
Token was
Token ID373
Feature activation+1.26e-02

Pos logit contributions
keeping+0.019
rones+0.016
girls+0.016
birds+0.016
group+0.016
Neg logit contributions
lessly-0.009
 much-0.008
 nice-0.007
 shiny-0.007
 precious-0.007
Token Fred
Token ID8559
Feature activation+3.79e-02

Pos logit contributions
keeping+0.024
ASH+0.018
birds+0.018
rones+0.018
group+0.018
Neg logit contributions
lessly-0.026
 much-0.025
 nice-0.024
 good-0.023
 shiny-0.023
Token and
Token ID290
Feature activation-2.10e-03

Pos logit contributions
keeping+0.080
rones+0.066
birds+0.065
group+0.064
growth+0.063
Neg logit contributions
lessly-0.069
 much-0.056
 nice-0.052
 precious-0.052
ww-0.051
Token he
Token ID339
Feature activation-8.38e-03

Pos logit contributions
lessly+0.003
 much+0.002
ww+0.002
 precious+0.002
arest+0.002
Neg logit contributions
keeping-0.005
rones-0.004
birds-0.004
growth-0.004
group-0.004
Token liked
Token ID8288
Feature activation+2.41e-02

Pos logit contributions
lessly+0.011
 much+0.008
 nice+0.008
 precious+0.007
 shiny+0.007
Neg logit contributions
keeping-0.022
rones-0.019
birds-0.019
group-0.018
ASH-0.018
Token to
Token ID284
Feature activation+5.57e-02

Pos logit contributions
keeping+0.055
group+0.050
ASH+0.049
rones+0.049
growth+0.049
Neg logit contributions
lessly-0.034
 much-0.033
 nice-0.032
 shiny-0.032
 good-0.030
Token design
Token ID1486
Feature activation+1.77e-02

Pos logit contributions
keeping+0.116
ASH+0.097
rones+0.096
birds+0.093
girls+0.091
Neg logit contributions
 nice-0.090
lessly-0.089
 much-0.088
 shiny-0.088
 good-0.083
Token things
Token ID1243
Feature activation-1.22e-02

Pos logit contributions
keeping+0.038
rones+0.032
birds+0.032
group+0.031
growth+0.031
Neg logit contributions
lessly-0.031
 much-0.025
 precious-0.023
 nice-0.023
ww-0.023
Token with
Token ID351
Feature activation+1.12e-02

Pos logit contributions
lessly+0.024
ww+0.020
 much+0.019
 precious+0.018
 steep+0.018
Neg logit contributions
keeping-0.022
birds-0.018
group-0.018
rones-0.017
girls-0.017
Token his
Token ID465
Feature activation+2.00e-02

Pos logit contributions
keeping+0.025
rones+0.020
birds+0.020
ASH+0.020
group+0.020
Neg logit contributions
lessly-0.020
 much-0.016
 nice-0.015
 shiny-0.015
 precious-0.015
Token friends
Token ID2460
Feature activation+4.78e-02

Pos logit contributions
keeping+0.042
rones+0.035
birds+0.034
group+0.033
ASH+0.033
Neg logit contributions
lessly-0.036
 much-0.031
 nice-0.029
 precious-0.029
 shiny-0.029
Token't
Token ID470
Feature activation+1.31e-02

Pos logit contributions
keeping+0.045
rones+0.035
girls+0.033
birds+0.032
ASH+0.032
Neg logit contributions
lessly-0.056
 much-0.052
 nice-0.046
 hard-0.046
 shiny-0.046
Token find
Token ID1064
Feature activation+2.90e-02

Pos logit contributions
keeping+0.022
rones+0.018
birds+0.018
ASH+0.017
girls+0.017
Neg logit contributions
lessly-0.029
 much-0.026
 nice-0.024
 proud-0.024
 shiny-0.024
Token the
Token ID262
Feature activation-1.19e-03

Pos logit contributions
keeping+0.062
rones+0.051
growth+0.050
birds+0.050
group+0.050
Neg logit contributions
lessly-0.052
 much-0.040
ww-0.039
 precious-0.038
 proud-0.037
Token perfect
Token ID2818
Feature activation+2.54e-03

Pos logit contributions
lessly+0.002
ww+0.002
 much+0.002
 precious+0.001
arest+0.001
Neg logit contributions
keeping-0.002
group-0.002
rones-0.002
ASH-0.002
 preschool-0.002
Token one
Token ID530
Feature activation-3.81e-03

Pos logit contributions
keeping+0.004
group+0.003
ASH+0.003
rones+0.003
 preschool+0.003
Neg logit contributions
lessly-0.006
 much-0.005
ww-0.005
 precious-0.005
 valuable-0.005
Token.
Token ID13
Feature activation+2.48e-02

Pos logit contributions
lessly+0.007
 much+0.006
 nice+0.005
 precious+0.005
ww+0.005
Neg logit contributions
keeping-0.008
rones-0.007
birds-0.006
group-0.006
growth-0.006
Token
Token ID198
Feature activation-2.37e-02

Pos logit contributions
keeping+0.055
group+0.043
birds+0.042
rones+0.042
IT+0.041
Neg logit contributions
lessly-0.044
 much-0.041
 nice-0.036
 good-0.034
 hard-0.034
Token
Token ID198
Feature activation-1.41e-02

Pos logit contributions
lessly+0.044
 much+0.041
 nice+0.036
 proud+0.034
 shiny+0.034
Neg logit contributions
keeping-0.050
ASH-0.040
IT-0.039
birds-0.039
rones-0.038
TokenOne
Token ID3198
Feature activation-2.37e-02

Pos logit contributions
lessly+0.032
 precious+0.025
 much+0.025
 proud+0.025
ww+0.025
Neg logit contributions
keeping-0.022
rones-0.019
birds-0.018
group-0.018
growth-0.018
Token day
Token ID1110
Feature activation-1.35e-02

Pos logit contributions
lessly+0.045
 much+0.033
 precious+0.032
ww+0.032
 proud+0.031
Neg logit contributions
keeping-0.046
birds-0.040
rones-0.039
group-0.039
growth-0.038
Token,
Token ID11
Feature activation-2.97e-02

Pos logit contributions
lessly+0.026
 much+0.020
 precious+0.020
 valuable+0.019
 proud+0.019
Neg logit contributions
keeping-0.026
rones-0.022
birds-0.022
growth-0.021
group-0.021
Token they
Token ID484
Feature activation-1.20e-02

Pos logit contributions
lessly+0.060
 much+0.049
 nice+0.043
 precious+0.042
 shiny+0.041
Neg logit contributions
keeping-0.108
rones-0.094
birds-0.092
ASH-0.091
group-0.091
Token walked
Token ID6807
Feature activation+7.49e-03

Pos logit contributions
lessly+0.024
 much+0.019
 precious+0.019
 valuable+0.019
 important+0.018
Neg logit contributions
keeping-0.021
birds-0.020
growth-0.019
rones-0.019
bird-0.018
Token away
Token ID1497
Feature activation-2.71e-02

Pos logit contributions
keeping+0.015
birds+0.012
group+0.012
rones+0.012
girls+0.012
Neg logit contributions
lessly-0.014
 much-0.011
 precious-0.011
ww-0.011
 valuable-0.011
Token,
Token ID11
Feature activation-2.88e-02

Pos logit contributions
lessly+0.052
 much+0.042
ww+0.041
 precious+0.041
 valuable+0.039
Neg logit contributions
keeping-0.053
birds-0.043
group-0.042
rones-0.041
 Lee-0.041
Token Lily
Token ID20037
Feature activation-9.24e-03

Pos logit contributions
lessly+0.050
 much+0.044
 nice+0.040
 good+0.040
 proud+0.040
Neg logit contributions
keeping-0.064
rones-0.054
birds-0.052
ASH-0.052
growth-0.051
Token's
Token ID338
Feature activation+1.61e-02

Pos logit contributions
lessly+0.017
 much+0.014
 precious+0.013
 nice+0.013
ww+0.012
Neg logit contributions
keeping-0.019
rones-0.016
birds-0.016
group-0.016
ASH-0.016
Token dad
Token ID9955
Feature activation+1.25e-02

Pos logit contributions
keeping+0.029
birds+0.023
rones+0.023
 preschool+0.023
ASH+0.022
Neg logit contributions
lessly-0.035
 much-0.028
ww-0.027
 valuable-0.025
 precious-0.025
Token said
Token ID531
Feature activation-3.14e-04

Pos logit contributions
keeping+0.028
rones+0.024
birds+0.023
group+0.023
growth+0.023
Neg logit contributions
lessly-0.021
 much-0.017
 nice-0.015
 precious-0.015
 proud-0.015
Token,
Token ID11
Feature activation-6.86e-03

Pos logit contributions
lessly+0.001
ww+0.000
 precious+0.000
 steep+0.000
 much+0.000
Neg logit contributions
keeping-0.001
birds-0.000
 Lee-0.000
 Her-0.000
group-0.000
Token "
Token ID366
Feature activation+1.01e-02

Pos logit contributions
lessly+0.013
 much+0.011
 precious+0.010
ww+0.010
 nice+0.010
Neg logit contributions
keeping-0.013
birds-0.011
rones-0.011
group-0.011
ASH-0.010
TokenDid
Token ID11633
Feature activation+1.49e-02

Pos logit contributions
keeping+0.020
birds+0.016
rones+0.016
group+0.016
ASH+0.016
Neg logit contributions
lessly-0.019
 much-0.016
 precious-0.015
 nice-0.015
 shiny-0.015
Token was
Token ID373
Feature activation-2.34e-02

Pos logit contributions
lessly+0.005
 much+0.004
 precious+0.004
 nice+0.004
ww+0.004
Neg logit contributions
keeping-0.006
rones-0.005
birds-0.005
group-0.005
ASH-0.005
Token not
Token ID407
Feature activation-3.54e-02

Pos logit contributions
lessly+0.028
ww+0.017
 much+0.017
 precious+0.015
 nice+0.014
Neg logit contributions
keeping-0.064
rones-0.056
group-0.056
birds-0.055
ASH-0.054
Token sure
Token ID1654
Feature activation+9.87e-03

Pos logit contributions
lessly+0.042
ww+0.032
 steep+0.026
arest+0.026
 much+0.024
Neg logit contributions
keeping-0.096
rones-0.078
group-0.078
 Lee-0.077
 preschool-0.077
Token if
Token ID611
Feature activation+7.54e-03

Pos logit contributions
keeping+0.022
birds+0.018
rones+0.018
 Lee+0.018
group+0.018
Neg logit contributions
lessly-0.016
ww-0.013
 much-0.012
 precious-0.011
 proud-0.011
Token they
Token ID484
Feature activation+2.32e-02

Pos logit contributions
keeping+0.021
rones+0.018
ASH+0.018
birds+0.018
IT+0.018
Neg logit contributions
lessly-0.009
 much-0.007
 nice-0.006
 precious-0.006
 shiny-0.006
Token should
Token ID815
Feature activation+5.96e-02

Pos logit contributions
keeping+0.058
rones+0.051
birds+0.050
ASH+0.049
group+0.049
Neg logit contributions
lessly-0.033
 much-0.026
 nice-0.023
 precious-0.023
 shiny-0.022
Token buy
Token ID2822
Feature activation+2.72e-02

Pos logit contributions
keeping+0.096
rones+0.075
birds+0.073
group+0.072
growth+0.071
Neg logit contributions
lessly-0.138
 much-0.117
 nice-0.111
 precious-0.111
ww-0.109
Token the
Token ID262
Feature activation+6.30e-03

Pos logit contributions
keeping+0.060
rones+0.050
birds+0.049
group+0.048
growth+0.048
Neg logit contributions
lessly-0.047
 much-0.038
 nice-0.035
 precious-0.035
ww-0.034
Token shelf
Token ID18316
Feature activation+8.98e-03

Pos logit contributions
keeping+0.014
rones+0.011
birds+0.011
group+0.011
growth+0.011
Neg logit contributions
lessly-0.011
 much-0.009
 nice-0.008
 precious-0.008
ww-0.008
Token,
Token ID11
Feature activation-1.46e-02

Pos logit contributions
keeping+0.019
group+0.016
birds+0.016
rones+0.016
ASH+0.015
Neg logit contributions
lessly-0.016
 much-0.012
ww-0.012
 precious-0.011
 nice-0.011
Token so
Token ID523
Feature activation-3.82e-02

Pos logit contributions
lessly+0.022
 much+0.016
ww+0.016
 precious+0.016
 nice+0.015
Neg logit contributions
keeping-0.035
birds-0.029
rones-0.029
group-0.029
ASH-0.028
Token mom
Token ID1995
Feature activation+1.72e-02

Pos logit contributions
keeping+0.036
rones+0.030
growth+0.029
birds+0.029
group+0.028
Neg logit contributions
lessly-0.032
 much-0.027
 nice-0.026
 precious-0.026
 shiny-0.026
Token said
Token ID531
Feature activation+5.96e-03

Pos logit contributions
keeping+0.035
birds+0.030
rones+0.029
girls+0.029
 Lee+0.028
Neg logit contributions
lessly-0.030
 much-0.023
ww-0.023
 precious-0.022
 shiny-0.021
Token yes
Token ID3763
Feature activation+1.41e-02

Pos logit contributions
keeping+0.012
birds+0.010
group+0.010
rones+0.010
ASH+0.010
Neg logit contributions
lessly-0.011
 much-0.008
ww-0.008
 precious-0.008
 nice-0.008
Token.
Token ID13
Feature activation+1.47e-02

Pos logit contributions
keeping+0.028
birds+0.022
group+0.022
rones+0.022
ASH+0.022
Neg logit contributions
lessly-0.027
 much-0.022
 precious-0.021
ww-0.020
 shiny-0.020
Token
Token ID198
Feature activation-2.66e-02

Pos logit contributions
keeping+0.035
growth+0.030
nob+0.029
smith+0.029
seek+0.028
Neg logit contributions
lessly-0.019
 much-0.019
 de-0.015
 tight-0.014
 proud-0.014
Token
Token ID198
Feature activation-2.10e-02

Pos logit contributions
lessly+0.044
 much+0.040
 nice+0.034
a+0.033
 proud+0.033
Neg logit contributions
keeping-0.061
ASH-0.051
IT-0.050
growth-0.049
rones-0.048
TokenThe
Token ID464
Feature activation-1.72e-02

Pos logit contributions
lessly+0.034
 much+0.027
 nice+0.025
ww+0.024
 precious+0.024
Neg logit contributions
keeping-0.050
rones-0.042
birds-0.041
growth-0.041
ASH-0.040
Token little
Token ID1310
Feature activation-1.31e-02

Pos logit contributions
lessly+0.030
 much+0.024
 precious+0.021
 nice+0.021
ww+0.021
Neg logit contributions
keeping-0.037
rones-0.031
birds-0.031
group-0.030
ASH-0.030
Token girl
Token ID2576
Feature activation-3.00e-03

Pos logit contributions
lessly+0.019
 much+0.015
 nice+0.014
 precious+0.014
 shiny+0.013
Neg logit contributions
keeping-0.032
rones-0.028
birds-0.027
growth-0.027
group-0.027
Token was
Token ID373
Feature activation-2.78e-02

Pos logit contributions
lessly+0.005
 much+0.004
 nice+0.003
 precious+0.003
 proud+0.003
Neg logit contributions
keeping-0.007
rones-0.006
birds-0.006
group-0.006
growth-0.006
Token so
Token ID523
Feature activation-4.96e-02

Pos logit contributions
lessly+0.036
ww+0.023
 much+0.022
 precious+0.022
 valuable+0.020
Neg logit contributions
keeping-0.073
rones-0.064
birds-0.063
group-0.062
ASH-0.062
Token the
Token ID262
Feature activation+3.58e-03

Pos logit contributions
keeping+0.042
rones+0.036
growth+0.034
group+0.033
ASH+0.033
Neg logit contributions
lessly-0.025
 much-0.023
 nice-0.022
 good-0.022
 proud-0.021
Token truck
Token ID7779
Feature activation+2.84e-02

Pos logit contributions
keeping+0.008
rones+0.007
birds+0.007
group+0.006
growth+0.006
Neg logit contributions
lessly-0.006
 much-0.005
 nice-0.005
 precious-0.005
ww-0.004
Token driver
Token ID4639
Feature activation+3.21e-02

Pos logit contributions
keeping+0.069
rones+0.059
growth+0.059
birds+0.057
group+0.057
Neg logit contributions
lessly-0.043
 much-0.035
 nice-0.032
 proud-0.030
 shiny-0.029
Token.
Token ID13
Feature activation+1.13e-02

Pos logit contributions
keeping+0.080
rones+0.069
growth+0.068
IT+0.067
ASH+0.067
Neg logit contributions
lessly-0.047
 much-0.038
 nice-0.035
 proud-0.033
 good-0.032
Token He
Token ID679
Feature activation-1.73e-02

Pos logit contributions
keeping+0.023
rones+0.018
birds+0.018
IT+0.018
ASH+0.018
Neg logit contributions
lessly-0.022
 much-0.020
 nice-0.018
 proud-0.018
 good-0.018
Token thanks
Token ID5176
Feature activation+5.98e-03

Pos logit contributions
lessly+0.037
 much+0.030
ww+0.029
 nice+0.028
 precious+0.028
Neg logit contributions
keeping-0.029
rones-0.025
birds-0.025
girls-0.024
group-0.024
Token them
Token ID606
Feature activation+2.24e-02

Pos logit contributions
keeping+0.013
rones+0.011
IT+0.011
ASH+0.011
group+0.010
Neg logit contributions
lessly-0.011
 much-0.009
 nice-0.009
 good-0.008
 shiny-0.008
Token back
Token ID736
Feature activation+2.70e-02

Pos logit contributions
keeping+0.057
rones+0.049
growth+0.048
ASH+0.048
IT+0.048
Neg logit contributions
lessly-0.033
 much-0.027
 nice-0.024
 good-0.023
 proud-0.022
Token.
Token ID13
Feature activation+1.08e-02

Pos logit contributions
keeping+0.061
rones+0.053
growth+0.051
ASH+0.051
group+0.051
Neg logit contributions
lessly-0.045
 much-0.037
 nice-0.034
 precious-0.033
 good-0.032
Token He
Token ID679
Feature activation-1.77e-02

Pos logit contributions
keeping+0.022
birds+0.018
IT+0.018
ASH+0.017
rones+0.017
Neg logit contributions
lessly-0.021
 much-0.019
 nice-0.017
 good-0.017
 proud-0.017
Token gives
Token ID3607
Feature activation+1.25e-02

Pos logit contributions
lessly+0.038
 much+0.030
ww+0.029
 nice+0.028
 precious+0.028
Neg logit contributions
keeping-0.030
rones-0.026
birds-0.026
girls-0.025
group-0.025
Token.
Token ID13
Feature activation+2.09e-02

Pos logit contributions
lessly+0.084
ww+0.076
arest+0.072
 precious+0.066
 much+0.065
Neg logit contributions
keeping-0.053
birds-0.044
group-0.042
 Lee-0.042
 Her-0.041
Token
Token ID198
Feature activation-3.81e-02

Pos logit contributions
keeping+0.039
birds+0.031
rones+0.031
ASH+0.030
group+0.030
Neg logit contributions
lessly-0.044
 much-0.038
 nice-0.035
 proud-0.034
 precious-0.034
Token
Token ID198
Feature activation-3.08e-02

Pos logit contributions
lessly+0.078
 much+0.068
 nice+0.062
 proud+0.061
 good+0.060
Neg logit contributions
keeping-0.072
birds-0.058
rones-0.057
group-0.056
ASH-0.056
Token"
Token ID1
Feature activation-2.09e-03

Pos logit contributions
lessly+0.060
 precious+0.051
 shiny+0.047
 valuable+0.047
 much+0.046
Neg logit contributions
keeping-0.053
group-0.047
rones-0.044
growth-0.042
ASH-0.042
TokenLook
Token ID8567
Feature activation+1.02e-02

Pos logit contributions
lessly+0.005
 much+0.004
 precious+0.004
 nice+0.004
 proud+0.003
Neg logit contributions
keeping-0.004
rones-0.003
group-0.003
birds-0.003
ASH-0.003
Token,
Token ID11
Feature activation-3.52e-03

Pos logit contributions
keeping+0.019
birds+0.017
rones+0.016
 Lee+0.016
group+0.016
Neg logit contributions
lessly-0.020
 much-0.015
 valuable-0.015
 precious-0.014
 steep-0.014
Token Sara
Token ID24799
Feature activation-7.16e-03

Pos logit contributions
lessly+0.007
 much+0.006
 precious+0.006
 proud+0.005
 nice+0.005
Neg logit contributions
keeping-0.006
rones-0.005
group-0.005
birds-0.005
growth-0.005
Token,
Token ID11
Feature activation-7.05e-03

Pos logit contributions
lessly+0.013
 much+0.010
 precious+0.010
 nice+0.009
 valuable+0.009
Neg logit contributions
keeping-0.015
rones-0.012
birds-0.012
ASH-0.012
group-0.012
Token my
Token ID616
Feature activation-1.73e-04

Pos logit contributions
lessly+0.014
 much+0.010
 precious+0.010
 valuable+0.010
 proud+0.010
Neg logit contributions
keeping-0.014
 Lee-0.011
birds-0.011
ASH-0.011
growth-0.011
Token block
Token ID2512
Feature activation-2.60e-02

Pos logit contributions
lessly+0.000
ww+0.000
 much+0.000
 valuable+0.000
 precious+0.000
Neg logit contributions
ASH-0.000
keeping-0.000
 exhibit-0.000
birds-0.000
rones-0.000
Token fits
Token ID11414
Feature activation+6.14e-03

Pos logit contributions
lessly+0.059
 much+0.044
 valuable+0.044
 precious+0.043
 pleased+0.042
Neg logit contributions
keeping-0.041
ASH-0.037
birds-0.034
stick-0.033
gy-0.033
Token
Token ID198
Feature activation-2.67e-02

Pos logit contributions
lessly+0.054
 much+0.048
 nice+0.043
 proud+0.042
 shiny+0.040
Neg logit contributions
keeping-0.061
ASH-0.050
IT-0.049
group-0.049
rones-0.049
TokenBen
Token ID11696
Feature activation-3.86e-03

Pos logit contributions
lessly+0.044
 much+0.035
 nice+0.033
 precious+0.032
ww+0.031
Neg logit contributions
keeping-0.061
rones-0.051
birds-0.051
group-0.050
growth-0.050
Token felt
Token ID2936
Feature activation-5.99e-02

Pos logit contributions
lessly+0.006
 much+0.005
 precious+0.005
 nice+0.005
ww+0.005
Neg logit contributions
keeping-0.009
rones-0.007
birds-0.007
ASH-0.007
group-0.007
Token bad
Token ID2089
Feature activation-5.45e-03

Pos logit contributions
lessly+0.100
ww+0.090
arest+0.075
 steep+0.074
ust+0.071
Neg logit contributions
keeping-0.115
 Her-0.101
group-0.098
 lectures-0.098
growth-0.097
Token.
Token ID13
Feature activation+1.52e-02

Pos logit contributions
lessly+0.010
 much+0.008
ww+0.007
 precious+0.007
 nice+0.007
Neg logit contributions
keeping-0.012
birds-0.009
rones-0.009
group-0.009
ASH-0.009
Token He
Token ID679
Feature activation-2.02e-02

Pos logit contributions
keeping+0.032
rones+0.026
birds+0.026
group+0.025
growth+0.025
Neg logit contributions
lessly-0.029
 much-0.024
 nice-0.022
 precious-0.021
 proud-0.021
Token realized
Token ID6939
Feature activation-2.13e-02

Pos logit contributions
lessly+0.040
arest+0.031
ww+0.031
 much+0.031
 steep+0.029
Neg logit contributions
keeping-0.034
rones-0.030
 Her-0.029
IT-0.029
ASH-0.029
Token he
Token ID339
Feature activation+6.57e-03

Pos logit contributions
lessly+0.031
ww+0.023
 much+0.023
 precious+0.022
 nice+0.021
Neg logit contributions
keeping-0.050
birds-0.043
rones-0.042
 Lee-0.042
growth-0.041
Token had
Token ID550
Feature activation-1.06e-03

Pos logit contributions
keeping+0.015
rones+0.012
birds+0.012
group+0.012
ASH+0.012
Neg logit contributions
lessly-0.011
 much-0.008
ww-0.008
 nice-0.008
 precious-0.008
Token done
Token ID1760
Feature activation+1.00e-02

Pos logit contributions
lessly+0.002
 much+0.001
ww+0.001
 precious+0.001
 nice+0.001
Neg logit contributions
keeping-0.002
rones-0.002
birds-0.002
ASH-0.002
group-0.002
Token a
Token ID257
Feature activation-1.21e-02

Pos logit contributions
keeping+0.025
rones+0.022
birds+0.022
growth+0.021
girls+0.021
Neg logit contributions
lessly-0.014
 much-0.013
 nice-0.012
 good-0.011
 proud-0.011
Token top
Token ID1353
Feature activation-2.77e-02

Pos logit contributions
keeping+0.015
rones+0.012
birds+0.012
girls+0.012
growth+0.012
Neg logit contributions
lessly-0.018
 much-0.016
 nice-0.016
 shiny-0.015
 precious-0.015
Token.
Token ID13
Feature activation+9.19e-03

Pos logit contributions
lessly+0.033
 much+0.024
 nice+0.021
 precious+0.021
 shiny+0.020
Neg logit contributions
keeping-0.075
rones-0.065
birds-0.065
group-0.064
growth-0.064
Token It
Token ID632
Feature activation-3.81e-02

Pos logit contributions
keeping+0.018
girls+0.014
ASH+0.014
stick+0.013
IT+0.013
Neg logit contributions
lessly-0.019
 much-0.018
 nice-0.015
 proud-0.014
 fine-0.014
Token was
Token ID373
Feature activation-2.83e-02

Pos logit contributions
lessly+0.074
 much+0.057
ww+0.055
 precious+0.054
 nice+0.053
Neg logit contributions
keeping-0.073
birds-0.063
rones-0.062
ASH-0.061
group-0.060
Token called
Token ID1444
Feature activation+1.26e-02

Pos logit contributions
lessly+0.044
ww+0.033
arest+0.031
 much+0.029
 precious+0.029
Neg logit contributions
keeping-0.065
rones-0.056
birds-0.055
ASH-0.054
group-0.054
Token the
Token ID262
Feature activation-1.02e-02

Pos logit contributions
keeping+0.028
rones+0.023
birds+0.022
girls+0.022
ASH+0.022
Neg logit contributions
lessly-0.021
 much-0.019
 nice-0.018
 shiny-0.017
 good-0.017
Token attic
Token ID46113
Feature activation-2.27e-02

Pos logit contributions
lessly+0.019
 much+0.015
 nice+0.015
 precious+0.014
 shiny+0.014
Neg logit contributions
keeping-0.021
rones-0.018
birds-0.017
group-0.017
growth-0.017
Token.
Token ID13
Feature activation+1.48e-02

Pos logit contributions
lessly+0.042
 much+0.032
ww+0.032
 precious+0.031
 nice+0.030
Neg logit contributions
keeping-0.047
birds-0.039
group-0.038
rones-0.037
ASH-0.037
Token The
Token ID383
Feature activation-1.50e-02

Pos logit contributions
keeping+0.032
ASH+0.025
IT+0.025
seek+0.024
stick+0.024
Neg logit contributions
lessly-0.028
 much-0.026
 nice-0.021
 proud-0.020
 fine-0.020
Token attic
Token ID46113
Feature activation-3.97e-02

Pos logit contributions
lessly+0.033
 much+0.027
ww+0.026
 precious+0.025
 valuable+0.025
Neg logit contributions
keeping-0.025
group-0.020
birds-0.019
rones-0.019
ASH-0.019
Token was
Token ID373
Feature activation-3.49e-02

Pos logit contributions
lessly+0.066
 much+0.052
 nice+0.048
 precious+0.048
ww+0.047
Neg logit contributions
keeping-0.089
rones-0.075
birds-0.074
group-0.073
ASH-0.072
Token If
Token ID1002
Feature activation-6.10e-03

Pos logit contributions
keeping+0.024
rones+0.020
seek+0.019
group+0.019
IT+0.018
Neg logit contributions
lessly-0.022
 much-0.019
 nice-0.017
 precious-0.017
 valuable-0.016
Token you
Token ID345
Feature activation+2.32e-02

Pos logit contributions
lessly+0.010
 much+0.008
 nice+0.007
 precious+0.007
 shiny+0.007
Neg logit contributions
keeping-0.014
rones-0.012
birds-0.012
group-0.012
ASH-0.011
Token open
Token ID1280
Feature activation+2.46e-02

Pos logit contributions
keeping+0.051
rones+0.043
birds+0.042
ASH+0.042
group+0.042
Neg logit contributions
lessly-0.040
 much-0.032
 nice-0.030
 precious-0.030
 shiny-0.029
Token it
Token ID340
Feature activation+2.45e-02

Pos logit contributions
keeping+0.062
rones+0.053
birds+0.052
group+0.052
girls+0.052
Neg logit contributions
lessly-0.035
 much-0.027
 nice-0.025
 precious-0.024
 shiny-0.024
Token,
Token ID11
Feature activation-2.02e-02

Pos logit contributions
keeping+0.063
rones+0.053
growth+0.053
birds+0.052
girls+0.051
Neg logit contributions
lessly-0.035
 much-0.029
 nice-0.027
 good-0.026
 hard-0.025
Token something
Token ID1223
Feature activation-4.82e-02

Pos logit contributions
lessly+0.037
 much+0.028
ww+0.027
 precious+0.026
 proud+0.026
Neg logit contributions
keeping-0.041
birds-0.034
growth-0.033
group-0.033
rones-0.032
Token good
Token ID922
Feature activation-2.06e-02

Pos logit contributions
lessly+0.095
ww+0.066
 much+0.064
bling+0.064
 steep+0.062
Neg logit contributions
keeping-0.087
birds-0.076
 exhibit-0.072
 Her-0.071
 meetings-0.070
Token will
Token ID481
Feature activation+5.00e-03

Pos logit contributions
lessly+0.035
 much+0.028
 precious+0.025
 nice+0.025
 proud+0.025
Neg logit contributions
keeping-0.045
birds-0.037
rones-0.037
group-0.037
ASH-0.036
Token happen
Token ID1645
Feature activation+1.80e-02

Pos logit contributions
keeping+0.009
birds+0.007
growth+0.007
group+0.007
IT+0.007
Neg logit contributions
lessly-0.010
 much-0.008
ww-0.008
 steep-0.008
 proud-0.008
Token."
Token ID526
Feature activation+2.96e-02

Pos logit contributions
keeping+0.036
rones+0.030
birds+0.029
growth+0.029
group+0.029
Neg logit contributions
lessly-0.034
 much-0.028
 nice-0.026
 precious-0.026
 shiny-0.025
Token
Token ID198
Feature activation-2.04e-02

Pos logit contributions
keeping+0.065
girls+0.057
growth+0.055
rones+0.053
smith+0.052
Neg logit contributions
lessly-0.050
 much-0.047
 proud-0.042
 nice-0.041
 sweet-0.040
Token but
Token ID475
Feature activation-2.84e-02

Pos logit contributions
lessly+0.028
ww+0.022
 much+0.021
 precious+0.019
 proud+0.019
Neg logit contributions
keeping-0.035
birds-0.030
group-0.029
growth-0.029
rones-0.029
Token then
Token ID788
Feature activation-7.77e-03

Pos logit contributions
lessly+0.045
 much+0.037
 nice+0.033
 precious+0.033
 shiny+0.033
Neg logit contributions
keeping-0.068
rones-0.058
birds-0.056
group-0.055
growth-0.055
Token she
Token ID673
Feature activation-2.41e-02

Pos logit contributions
lessly+0.010
 much+0.009
 nice+0.008
 shiny+0.008
 good+0.007
Neg logit contributions
keeping-0.021
rones-0.018
girls-0.018
IT-0.017
group-0.017
Token saw
Token ID2497
Feature activation+4.76e-03

Pos logit contributions
lessly+0.044
 much+0.036
 nice+0.033
 precious+0.033
ww+0.032
Neg logit contributions
keeping-0.051
rones-0.042
birds-0.041
group-0.041
growth-0.040
Token something
Token ID1223
Feature activation-4.84e-02

Pos logit contributions
keeping+0.012
rones+0.011
girls+0.011
group+0.010
date+0.010
Neg logit contributions
lessly-0.006
 much-0.006
 shiny-0.005
 nice-0.005
 hard-0.005
Token very
Token ID845
Feature activation-4.56e-02

Pos logit contributions
lessly+0.067
ww+0.049
arest+0.040
 much+0.037
 thrill+0.037
Neg logit contributions
keeping-0.105
birds-0.101
group-0.095
 Her-0.093
bird-0.093
Token unusual
Token ID8468
Feature activation-1.36e-02

Pos logit contributions
lessly+0.062
ww+0.045
br+0.038
arest+0.037
plets+0.035
Neg logit contributions
keeping-0.093
 lectures-0.086
group-0.085
birds-0.084
 Her-0.084
Token.
Token ID13
Feature activation+2.07e-02

Pos logit contributions
lessly+0.023
 much+0.018
 nice+0.017
 precious+0.017
 shiny+0.017
Neg logit contributions
keeping-0.030
rones-0.026
birds-0.025
growth-0.025
ASH-0.025
Token There
Token ID1318
Feature activation-1.85e-02

Pos logit contributions
keeping+0.046
girls+0.038
group+0.036
date+0.036
rones+0.036
Neg logit contributions
lessly-0.037
 much-0.034
 shiny-0.029
 proud-0.029
 very-0.028
Token was
Token ID373
Feature activation-8.89e-03

Pos logit contributions
lessly+0.023
 much+0.020
 shiny+0.018
 nice+0.017
 hard+0.016
Neg logit contributions
keeping-0.052
rones-0.044
IT-0.042
ASH-0.042
girls-0.042
Token a
Token ID257
Feature activation-1.88e-02

Pos logit contributions
lessly+0.015
 much+0.012
 nice+0.011
 precious+0.011
 shiny+0.011
Neg logit contributions
keeping-0.020
rones-0.017
birds-0.016
group-0.016
ASH-0.016
Token.
Token ID13
Feature activation+1.14e-02

Pos logit contributions
lessly+0.046
 much+0.036
 precious+0.034
 nice+0.033
ww+0.033
Neg logit contributions
keeping-0.055
birds-0.045
rones-0.045
group-0.045
ASH-0.043
Token Even
Token ID3412
Feature activation-5.97e-02

Pos logit contributions
keeping+0.023
rones+0.019
group+0.018
growth+0.017
birds+0.017
Neg logit contributions
lessly-0.022
 much-0.019
 proud-0.017
 nice-0.017
 precious-0.017
Token with
Token ID351
Feature activation-2.08e-02

Pos logit contributions
lessly+0.117
arest+0.094
ww+0.088
 much+0.084
 proud+0.082
Neg logit contributions
keeping-0.107
birds-0.089
 lectures-0.086
 meetings-0.085
 preschool-0.084
Token a
Token ID257
Feature activation-2.08e-02

Pos logit contributions
lessly+0.038
 much+0.029
ww+0.027
 precious+0.027
 proud+0.027
Neg logit contributions
keeping-0.044
birds-0.036
rones-0.036
growth-0.035
group-0.034
Token few
Token ID1178
Feature activation-2.77e-02

Pos logit contributions
lessly+0.034
 much+0.022
ww+0.022
arest+0.021
bling+0.021
Neg logit contributions
keeping-0.045
birds-0.038
group-0.037
bird-0.037
date-0.037
Token sprink
Token ID27562
Feature activation-7.20e-02

Pos logit contributions
lessly+0.049
 much+0.036
ww+0.036
 steep+0.034
 proud+0.034
Neg logit contributions
keeping-0.056
birds-0.049
rones-0.048
growth-0.047
group-0.047
Tokenles
Token ID829
Feature activation-3.48e-02

Pos logit contributions
lessly+0.115
 much+0.095
 precious+0.090
 valuable+0.088
 nice+0.085
Neg logit contributions
keeping-0.148
birds-0.137
group-0.130
rones-0.129
bird-0.124
Token inside
Token ID2641
Feature activation-2.19e-03

Pos logit contributions
lessly+0.058
 much+0.046
 precious+0.043
 nice+0.042
ww+0.042
Neg logit contributions
keeping-0.076
birds-0.064
group-0.062
rones-0.062
growth-0.060
Token."
Token ID526
Feature activation+3.60e-02

Pos logit contributions
lessly+0.004
 much+0.003
 precious+0.003
 nice+0.003
ww+0.003
Neg logit contributions
keeping-0.005
rones-0.004
birds-0.004
group-0.004
ASH-0.004
Token
Token ID198
Feature activation-2.03e-02

Pos logit contributions
keeping+0.079
rones+0.069
girls+0.069
stick+0.067
group+0.067
Neg logit contributions
lessly-0.054
 much-0.051
 proud-0.045
 de-0.040
 nice-0.040
Token
Token ID198
Feature activation-1.60e-02

Pos logit contributions
lessly+0.036
 much+0.032
 proud+0.029
 nice+0.029
 valuable+0.027
Neg logit contributions
keeping-0.043
ASH-0.036
group-0.035
rones-0.035
birds-0.035
Token He
Token ID679
Feature activation-1.89e-02

Pos logit contributions
keeping+0.031
rones+0.025
birds+0.025
group+0.024
growth+0.024
Neg logit contributions
lessly-0.035
 much-0.029
 nice-0.027
 precious-0.027
 proud-0.027
Token pushed
Token ID7121
Feature activation+4.66e-03

Pos logit contributions
lessly+0.036
 much+0.028
ww+0.027
arest+0.027
 precious+0.026
Neg logit contributions
keeping-0.035
rones-0.030
birds-0.030
growth-0.029
ASH-0.029
Token himself
Token ID2241
Feature activation+1.04e-02

Pos logit contributions
keeping+0.009
ASH+0.007
growth+0.007
birds+0.007
rones+0.007
Neg logit contributions
lessly-0.009
 precious-0.007
 much-0.007
ww-0.007
 valuable-0.007
Token up
Token ID510
Feature activation-1.87e-02

Pos logit contributions
keeping+0.019
stick+0.015
birds+0.015
ASH+0.015
gy+0.015
Neg logit contributions
lessly-0.020
ww-0.017
 precious-0.016
 much-0.015
arest-0.015
Token and
Token ID290
Feature activation-2.63e-02

Pos logit contributions
lessly+0.030
 much+0.024
 precious+0.022
 nice+0.022
ww+0.021
Neg logit contributions
keeping-0.043
rones-0.036
birds-0.036
group-0.035
growth-0.035
Token wob
Token ID46867
Feature activation-5.47e-02

Pos logit contributions
lessly+0.046
 much+0.037
ww+0.036
 precious+0.036
arest+0.035
Neg logit contributions
keeping-0.054
ASH-0.045
stick-0.044
IT-0.044
 Lee-0.043
Tokenbled
Token ID9342
Feature activation-6.66e-03

Pos logit contributions
ww+0.091
 valuable+0.090
 much+0.090
lessly+0.090
 precious+0.088
Neg logit contributions
keeping-0.093
ASH-0.087
gy-0.083
IT-0.081
group-0.079
Token on
Token ID319
Feature activation+2.28e-02

Pos logit contributions
lessly+0.011
ww+0.010
 precious+0.009
arest+0.009
 much+0.009
Neg logit contributions
keeping-0.014
ASH-0.011
birds-0.011
growth-0.011
group-0.011
Token his
Token ID465
Feature activation+3.09e-02

Pos logit contributions
keeping+0.050
rones+0.044
birds+0.044
girls+0.042
group+0.042
Neg logit contributions
lessly-0.038
 much-0.032
 nice-0.030
 shiny-0.029
 good-0.029
Token feet
Token ID3625
Feature activation-1.22e-02

Pos logit contributions
keeping+0.059
ASH+0.046
growth+0.046
rones+0.046
birds+0.045
Neg logit contributions
lessly-0.063
 much-0.051
ww-0.048
 precious-0.047
 proud-0.047
Token for
Token ID329
Feature activation+3.11e-02

Pos logit contributions
lessly+0.023
ww+0.021
 precious+0.020
 much+0.019
 valuable+0.019
Neg logit contributions
keeping-0.021
stick-0.017
birds-0.017
group-0.016
ASH-0.016
Token zoom
Token ID19792
Feature activation-2.60e-02

Pos logit contributions
lessly+0.049
ww+0.038
 much+0.037
arest+0.036
 precious+0.036
Neg logit contributions
keeping-0.065
rones-0.051
birds-0.050
girls-0.049
growth-0.049
Tokened
Token ID276
Feature activation+2.20e-03

Pos logit contributions
lessly+0.050
 much+0.043
 precious+0.042
ww+0.041
 nice+0.040
Neg logit contributions
keeping-0.050
birds-0.039
rones-0.039
group-0.038
ASH-0.037
Token around
Token ID1088
Feature activation+5.86e-03

Pos logit contributions
keeping+0.005
rones+0.005
birds+0.005
group+0.005
ASH+0.004
Neg logit contributions
lessly-0.003
 much-0.003
 nice-0.002
 precious-0.002
 shiny-0.002
Token his
Token ID465
Feature activation+2.85e-02

Pos logit contributions
keeping+0.014
rones+0.013
ASH+0.012
birds+0.012
growth+0.012
Neg logit contributions
lessly-0.009
 much-0.007
 nice-0.007
 good-0.006
 hard-0.006
Token town
Token ID3240
Feature activation+3.58e-03

Pos logit contributions
keeping+0.051
rones+0.040
growth+0.040
birds+0.040
ASH+0.039
Neg logit contributions
lessly-0.058
 much-0.052
 nice-0.051
 precious-0.051
 shiny-0.050
Token feeling
Token ID4203
Feature activation-6.11e-02

Pos logit contributions
keeping+0.008
rones+0.006
birds+0.006
group+0.006
growth+0.006
Neg logit contributions
lessly-0.006
 much-0.005
 nice-0.005
 precious-0.005
 shiny-0.005
Token proud
Token ID6613
Feature activation-2.92e-02

Pos logit contributions
lessly+0.070
ww+0.062
 steep+0.051
arest+0.048
 precious+0.047
Neg logit contributions
keeping-0.151
group-0.133
date-0.128
birds-0.127
rones-0.124
Token.
Token ID13
Feature activation+6.94e-03

Pos logit contributions
lessly+0.046
ww+0.036
 much+0.034
 precious+0.034
 steep+0.033
Neg logit contributions
keeping-0.066
birds-0.057
group-0.054
rones-0.054
growth-0.052
Token<|endoftext|>
Token ID50256
Feature activation+3.35e-02

Pos logit contributions
keeping+0.014
rones+0.011
ASH+0.011
group+0.011
birds+0.011
Neg logit contributions
lessly-0.014
 much-0.012
 nice-0.010
 proud-0.010
 valuable-0.010
TokenOnce
Token ID7454
Feature activation+3.93e-03

Pos logit contributions
keeping+0.072
 clients+0.064
stick+0.062
 arrest+0.061
IT+0.060
Neg logit contributions
lessly-0.054
 much-0.048
 nice-0.040
 good-0.039
 very-0.039
Token there
Token ID612
Feature activation+9.30e-03

Pos logit contributions
keeping+0.007
rones+0.006
birds+0.006
group+0.006
 Her+0.006
Neg logit contributions
lessly-0.007
ww-0.006
 precious-0.005
 much-0.005
 valuable-0.005