TOP ACTIVATIONS
MAX = 0.087

and you can take the ones on the bottom ." His
. I can take the ones on top and you can
important to always protect the ones we love . Tim my
, they can also bring something beautiful . <|endoftext|> Once upon
made sure to add new ones to his collection slowly and
things than to buy new ones . <|endoftext|> Once upon a
him and only grabbed the ones he found in the forest
feathers to replace the rough ones ." So , the squirrel
sure only to pick the ones that were sweet and delicious
giant ! We should send someone to check it out ."
some money to buy new ones . The end . <|endoftext|>
nice . Can I show Jen ?" Tom says .
You can take the small ones ," the truck driver said
fix them or get new ones . Lily felt a little
's not okay to take someone else 's things without asking
's important to cherish the ones you have now . From
. She realized that protecting someone is more important than having
man who gave away juicy ones . The End . <|endoftext|>
we visit the park to find it ?" Her mom said
You should pick the sweet ones . They taste much better

MIN ACTIVATIONS
MIN = -0.128

man walking on a tight ro pe . He had to
it here a long time ago ." " Maybe
walk around on a tight ro pe , he could even
famous artist a long time ago ." The
painted them a long time ago . She thought they were
same dress a long time ago . <|endoftext|> Once upon a
important person a long time ago . She was excited and
he had seen a year ago . He went home ,
it here a long time ago ." They decided
by a famous artist long ago . One
the scale a long time ago . As he
Jim from a long time ago , and he was very
her birthday a few days ago . She liked to practice
a ship that sank long ago . She said they could
big farm a long time ago . Lily listened as the
someone a very long time ago . The little girl thought
best friend a long time ago . He was my dog
had lived a long time ago . L
children put on their bath ro bes and ran around in
queen from long , long ago . Mia listened and her

INTERVAL 0.033 - 0.087
CONTAINS 5.210%

farm . They liked to see the animals and hear their
She brought it home and put the microphone
wall and started to pour himself a cup of tea .
but her mom promised to buy her some candy that was
comfortable . He remembered his friend the fox had a belt

INTERVAL -0.021 - 0.033
CONTAINS 81.157%

smiled and said , " I know cleaning isn 't fun
wandered around with the kitten , showing it all of the
O w , ow , ow !" Sara cries . "
: never give up and encourage yourself to do great things
the sun . <|endoftext|> Once upon a time , there was

INTERVAL -0.074 - -0.021
CONTAINS 13.589%

She started to draw a sky . But then , she
not know that the stick would break soon .
corn ?" Tom nods . He takes a corn
happy , and she liked to wear colorful clothes . Her
it was very cold and the ground was icy . Lily

INTERVAL -0.128 - -0.074
CONTAINS 0.043%

mess . And they have to keep away from her makeup
" You don 't have to get a stitch if you
a joke . I forgot to throw it away . It
something . And you have to take turns and play nicely
was okay . They decided to play with their own toys
Token and
Token ID290
Feature activation+4.57e-03

Pos logit contributions
pe+0.008
ro+0.008
bench+0.007
 swoop+0.007
ider+0.007
Neg logit contributions
illy-0.004
 favourite-0.004
 biggest-0.004
 groceries-0.004
 favorite-0.004
Token you
Token ID345
Feature activation-1.09e-02

Pos logit contributions
illy+0.008
 biggest+0.007
 favourite+0.007
 groceries+0.007
 favorite+0.007
Neg logit contributions
pe-0.008
ro-0.008
 swoop-0.007
bench-0.007
 dent-0.006
Token can
Token ID460
Feature activation-2.69e-02

Pos logit contributions
pe+0.022
ro+0.021
bench+0.019
ider+0.019
 dent+0.019
Neg logit contributions
illy-0.014
 favourite-0.014
 biggest-0.014
 favorite-0.014
 groceries-0.014
Token take
Token ID1011
Feature activation+3.66e-02

Pos logit contributions
pe+0.046
ro+0.045
bench+0.039
 swoop+0.038
ider+0.038
Neg logit contributions
illy-0.044
 favourite-0.043
 biggest-0.043
 groceries-0.043
 favorite-0.042
Token the
Token ID262
Feature activation+7.40e-03

Pos logit contributions
illy+0.050
 favourite+0.044
idy+0.044
 biggest+0.043
 favorite+0.042
Neg logit contributions
pe-0.073
ro-0.070
 swoop-0.064
bench-0.061
 charged-0.061
Token ones
Token ID3392
Feature activation+8.70e-02

Pos logit contributions
illy+0.009
 favourite+0.008
pieces+0.008
 favorite+0.008
 biggest+0.008
Neg logit contributions
pe-0.016
ro-0.015
bench-0.014
 dent-0.013
 swoop-0.013
Token on
Token ID319
Feature activation-2.63e-02

Pos logit contributions
illy+0.141
 favourite+0.134
 biggest+0.129
 favorite+0.126
pieces+0.125
Neg logit contributions
pe-0.154
ro-0.144
bench-0.125
ider-0.119
h-0.118
Token the
Token ID262
Feature activation+1.30e-02

Pos logit contributions
pe+0.046
ro+0.045
bench+0.039
 swoop+0.038
 ago+0.037
Neg logit contributions
illy-0.042
 favourite-0.041
 biggest-0.041
 groceries-0.040
 favorite-0.040
Token bottom
Token ID4220
Feature activation-6.35e-03

Pos logit contributions
illy+0.028
idy+0.025
elling+0.024
pieces+0.024
 favourite+0.023
Neg logit contributions
pe-0.020
ro-0.016
bench-0.015
 dodging-0.013
 dent-0.013
Token."
Token ID526
Feature activation+4.64e-03

Pos logit contributions
pe+0.011
ro+0.011
bench+0.009
ider+0.009
 ago+0.009
Neg logit contributions
illy-0.010
 favourite-0.010
 groceries-0.010
 biggest-0.010
 favorite-0.010
Token His
Token ID2399
Feature activation+3.96e-03

Pos logit contributions
illy+0.006
 favourite+0.006
 biggest+0.006
 groceries+0.006
 favorite+0.006
Neg logit contributions
pe-0.009
ro-0.009
bench-0.008
 swoop-0.008
ider-0.008
Token.
Token ID13
Feature activation+1.88e-02

Pos logit contributions
illy+0.057
 biggest+0.052
 favourite+0.051
ippi+0.050
pieces+0.049
Neg logit contributions
pe-0.075
ro-0.074
bench-0.066
 ago-0.064
ider-0.064
Token I
Token ID314
Feature activation-1.45e-02

Pos logit contributions
illy+0.034
 groceries+0.032
 favourite+0.031
 biggest+0.031
 favorite+0.030
Neg logit contributions
ro-0.030
pe-0.030
ider-0.025
 ago-0.024
bench-0.024
Token can
Token ID460
Feature activation-2.29e-02

Pos logit contributions
pe+0.028
ro+0.027
bench+0.025
 dodging+0.024
ider+0.024
Neg logit contributions
 favourite-0.021
 favorite-0.021
illy-0.020
 biggest-0.020
 groceries-0.019
Token take
Token ID1011
Feature activation+4.13e-02

Pos logit contributions
pe+0.038
ro+0.036
bench+0.031
 swoop+0.031
ider+0.030
Neg logit contributions
illy-0.040
 groceries-0.038
 biggest-0.038
 favourite-0.038
 favorite-0.038
Token the
Token ID262
Feature activation+7.58e-03

Pos logit contributions
illy+0.058
idy+0.051
 favourite+0.051
 biggest+0.050
 favorite+0.050
Neg logit contributions
pe-0.079
ro-0.075
 swoop-0.069
 hon-0.067
 charged-0.067
Token ones
Token ID3392
Feature activation+8.66e-02

Pos logit contributions
illy+0.010
pieces+0.009
 favourite+0.009
idy+0.009
ippi+0.008
Neg logit contributions
pe-0.016
ro-0.015
bench-0.014
 dent-0.013
 swoop-0.013
Token on
Token ID319
Feature activation-2.31e-02

Pos logit contributions
illy+0.136
 favourite+0.127
 biggest+0.125
pieces+0.122
 favorite+0.120
Neg logit contributions
pe-0.157
ro-0.149
bench-0.126
ider-0.122
h-0.120
Token top
Token ID1353
Feature activation-3.53e-03

Pos logit contributions
pe+0.042
ro+0.041
bench+0.035
 swoop+0.034
 ago+0.034
Neg logit contributions
illy-0.036
 favourite-0.035
 biggest-0.035
 groceries-0.034
 favorite-0.034
Token and
Token ID290
Feature activation+4.57e-03

Pos logit contributions
pe+0.008
ro+0.008
bench+0.007
 swoop+0.007
ider+0.007
Neg logit contributions
illy-0.004
 favourite-0.004
 biggest-0.004
 groceries-0.004
 favorite-0.004
Token you
Token ID345
Feature activation-1.09e-02

Pos logit contributions
illy+0.008
 biggest+0.007
 favourite+0.007
 groceries+0.007
 favorite+0.007
Neg logit contributions
pe-0.008
ro-0.008
 swoop-0.007
bench-0.007
 dent-0.006
Token can
Token ID460
Feature activation-2.69e-02

Pos logit contributions
pe+0.022
ro+0.021
bench+0.019
ider+0.019
 dent+0.019
Neg logit contributions
illy-0.014
 favourite-0.014
 biggest-0.014
 favorite-0.014
 groceries-0.014
Token important
Token ID1593
Feature activation-9.79e-03

Pos logit contributions
pe+0.036
ro+0.035
bench+0.032
 swoop+0.031
ider+0.031
Neg logit contributions
illy-0.013
 favourite-0.013
 biggest-0.013
 groceries-0.013
 favorite-0.012
Token to
Token ID284
Feature activation-5.72e-02

Pos logit contributions
pe+0.025
ro+0.024
 swoop+0.022
bench+0.022
ider+0.022
Neg logit contributions
illy-0.008
 favorite-0.008
 groceries-0.008
 favourite-0.008
 biggest-0.008
Token always
Token ID1464
Feature activation-2.60e-02

Pos logit contributions
ro+0.096
pe+0.096
bench+0.084
ider+0.083
 swoop+0.080
Neg logit contributions
illy-0.095
 groceries-0.091
 biggest-0.091
 favourite-0.091
 favorite-0.090
Token protect
Token ID1805
Feature activation+3.20e-02

Pos logit contributions
ro+0.051
pe+0.048
ider+0.045
bench+0.045
 ago+0.041
Neg logit contributions
 new-0.036
 favorite-0.035
 biggest-0.035
 favourite-0.035
 groceries-0.034
Token the
Token ID262
Feature activation+1.02e-02

Pos logit contributions
illy+0.041
 favourite+0.040
 biggest+0.039
 groceries+0.037
 favorite+0.037
Neg logit contributions
pe-0.067
ro-0.064
 hon-0.055
 dent-0.055
 swoop-0.055
Token ones
Token ID3392
Feature activation+8.65e-02

Pos logit contributions
illy+0.013
 favourite+0.012
 groceries+0.012
 biggest+0.012
 favorite+0.012
Neg logit contributions
pe-0.021
ro-0.021
bench-0.019
 swoop-0.018
 dent-0.018
Token we
Token ID356
Feature activation+8.94e-03

Pos logit contributions
illy+0.140
 favourite+0.134
 biggest+0.129
 favorite+0.127
 groceries+0.126
Neg logit contributions
pe-0.157
ro-0.140
bench-0.126
 ago-0.121
ider-0.119
Token love
Token ID1842
Feature activation+1.03e-02

Pos logit contributions
illy+0.016
 biggest+0.015
 favourite+0.015
 groceries+0.015
 favorite+0.015
Neg logit contributions
pe-0.014
ro-0.014
bench-0.012
 swoop-0.012
ider-0.012
Token.
Token ID13
Feature activation+2.71e-02

Pos logit contributions
 biggest+0.018
illy+0.018
 favourite+0.018
 groceries+0.017
 favorite+0.017
Neg logit contributions
pe-0.017
ro-0.016
bench-0.014
 ago-0.013
 dent-0.012
Token Tim
Token ID5045
Feature activation+4.26e-02

Pos logit contributions
illy+0.047
 biggest+0.047
 groceries+0.046
 favourite+0.046
 favorite+0.045
Neg logit contributions
pe-0.043
ro-0.042
bench-0.036
 dent-0.035
 swoop-0.034
Tokenmy
Token ID1820
Feature activation+1.92e-02

Pos logit contributions
illy+0.053
 favourite+0.052
 groceries+0.051
 biggest+0.051
 favorite+0.050
Neg logit contributions
pe-0.089
ro-0.088
bench-0.079
 swoop-0.077
ider-0.076
Token,
Token ID11
Feature activation+2.00e-02

Pos logit contributions
illy+0.020
 favourite+0.020
 biggest+0.020
 groceries+0.019
 favorite+0.019
Neg logit contributions
pe-0.023
ro-0.022
bench-0.020
 swoop-0.019
ider-0.019
Token they
Token ID484
Feature activation+1.59e-02

Pos logit contributions
illy+0.030
 biggest+0.029
 groceries+0.028
 favourite+0.028
ingu+0.027
Neg logit contributions
ro-0.036
pe-0.036
 ago-0.030
 dent-0.030
 swoop-0.030
Token can
Token ID460
Feature activation-1.50e-02

Pos logit contributions
illy+0.023
 groceries+0.022
 favourite+0.022
 biggest+0.022
 favorite+0.021
Neg logit contributions
pe-0.031
ro-0.030
bench-0.026
 swoop-0.026
ider-0.026
Token also
Token ID635
Feature activation+4.15e-03

Pos logit contributions
pe+0.024
ro+0.024
bench+0.021
ider+0.020
 swoop+0.020
Neg logit contributions
illy-0.026
 favourite-0.025
 biggest-0.025
 groceries-0.025
 favorite-0.025
Token bring
Token ID2222
Feature activation+6.45e-02

Pos logit contributions
 favourite+0.006
 favorite+0.006
 biggest+0.006
illy+0.006
 groceries+0.006
Neg logit contributions
pe-0.007
ro-0.007
bench-0.007
ider-0.006
 dent-0.006
Token something
Token ID1223
Feature activation+8.59e-02

Pos logit contributions
illy+0.096
ingu+0.087
ippi+0.079
oll+0.078
 biggest+0.077
Neg logit contributions
ro-0.117
pe-0.116
 breath-0.100
 hon-0.100
 swoop-0.100
Token beautiful
Token ID4950
Feature activation+1.54e-02

Pos logit contributions
illy+0.119
ingu+0.098
pieces+0.096
ippi+0.096
 groceries+0.095
Neg logit contributions
ro-0.166
pe-0.165
 hon-0.147
ider-0.146
 swoop-0.145
Token.
Token ID13
Feature activation+2.28e-02

Pos logit contributions
illy+0.023
 favourite+0.022
 biggest+0.022
 groceries+0.022
 favorite+0.022
Neg logit contributions
pe-0.029
ro-0.028
bench-0.025
 swoop-0.024
ider-0.024
Token<|endoftext|>
Token ID50256
Feature activation-4.87e-02

Pos logit contributions
 groceries+0.039
 biggest+0.039
illy+0.038
 favourite+0.037
 favorite+0.037
Neg logit contributions
ro-0.037
pe-0.037
bench-0.031
 ago-0.030
 dent-0.029
TokenOnce
Token ID7454
Feature activation+3.22e-02

Pos logit contributions
pe+0.097
ro+0.095
ider+0.086
 swoop+0.084
bench+0.084
Neg logit contributions
illy-0.065
 biggest-0.063
 favourite-0.062
 groceries-0.062
 favorite-0.061
Token upon
Token ID2402
Feature activation+7.52e-03

Pos logit contributions
 groceries+0.058
illy+0.057
 biggest+0.057
 favourite+0.056
 favorite+0.054
Neg logit contributions
pe-0.047
ro-0.046
bench-0.039
 ago-0.039
 swoop-0.038
Token made
Token ID925
Feature activation+4.98e-02

Pos logit contributions
 favorite+0.021
 favourite+0.021
 groceries+0.021
 biggest+0.021
 new+0.020
Neg logit contributions
pe-0.030
ro-0.030
ider-0.028
bench-0.026
 dent-0.025
Token sure
Token ID1654
Feature activation+4.25e-02

Pos logit contributions
illy+0.058
 groceries+0.051
elling+0.048
 favourite+0.047
ummy+0.047
Neg logit contributions
pe-0.106
ro-0.105
 hon-0.099
 swoop-0.098
 dent-0.095
Token to
Token ID284
Feature activation-5.65e-02

Pos logit contributions
illy+0.064
 favourite+0.056
 biggest+0.055
pieces+0.053
 groceries+0.052
Neg logit contributions
pe-0.082
ro-0.080
 hon-0.071
bench-0.070
 dent-0.070
Token add
Token ID751
Feature activation+6.28e-02

Pos logit contributions
ider+0.116
pe+0.113
bench+0.110
ro+0.107
 ago+0.104
Neg logit contributions
 new-0.066
 favorite-0.066
 amazing-0.065
 favourite-0.063
 biggest-0.062
Token new
Token ID649
Feature activation+1.58e-02

Pos logit contributions
illy+0.077
 favourite+0.066
 biggest+0.065
 favorite+0.062
idy+0.061
Neg logit contributions
ro-0.137
pe-0.134
bench-0.119
 dent-0.119
 swoop-0.117
Token ones
Token ID3392
Feature activation+8.58e-02

Pos logit contributions
illy+0.017
 biggest+0.015
 favourite+0.015
 groceries+0.014
 favorite+0.014
Neg logit contributions
pe-0.037
ro-0.036
bench-0.033
 dent-0.032
 swoop-0.032
Token to
Token ID284
Feature activation-2.82e-02

Pos logit contributions
illy+0.148
 favourite+0.138
 biggest+0.137
 groceries+0.137
pieces+0.131
Neg logit contributions
pe-0.143
ro-0.139
bench-0.116
 wob-0.108
ider-0.108
Token his
Token ID465
Feature activation+2.65e-02

Pos logit contributions
pe+0.053
ro+0.052
bench+0.045
 dent+0.045
 swoop+0.044
Neg logit contributions
illy-0.042
 favourite-0.040
 biggest-0.040
 groceries-0.039
 favorite-0.038
Token collection
Token ID4947
Feature activation+2.39e-02

Pos logit contributions
 favourite+0.031
 favorite+0.031
 biggest+0.030
 groceries+0.030
illy+0.029
Neg logit contributions
pe-0.059
ro-0.058
bench-0.052
 swoop-0.052
ider-0.051
Token slowly
Token ID6364
Feature activation-2.04e-02

Pos logit contributions
illy+0.037
 favourite+0.036
 biggest+0.036
 groceries+0.036
 favorite+0.035
Neg logit contributions
pe-0.043
ro-0.042
bench-0.037
 swoop-0.036
ider-0.036
Token and
Token ID290
Feature activation+9.34e-03

Pos logit contributions
pe+0.044
ider+0.041
ro+0.040
bench+0.039
 swoop+0.038
Neg logit contributions
 groceries-0.022
 favorite-0.022
 biggest-0.022
 new-0.022
 favourite-0.021
Token things
Token ID1243
Feature activation+4.73e-02

Pos logit contributions
illy+0.002
 favourite+0.002
 biggest+0.002
 favorite+0.002
 groceries+0.002
Neg logit contributions
pe-0.005
ro-0.005
bench-0.004
 dent-0.004
 swoop-0.004
Token than
Token ID621
Feature activation+2.15e-02

Pos logit contributions
illy+0.070
 favourite+0.067
 biggest+0.067
 groceries+0.067
 favorite+0.065
Neg logit contributions
pe-0.089
ro-0.087
bench-0.077
 dent-0.075
 swoop-0.074
Token to
Token ID284
Feature activation-5.98e-02

Pos logit contributions
illy+0.026
 favourite+0.025
 biggest+0.025
 groceries+0.024
 favorite+0.024
Neg logit contributions
pe-0.046
ro-0.045
bench-0.041
 swoop-0.040
 dent-0.040
Token buy
Token ID2822
Feature activation+3.82e-02

Pos logit contributions
ro+0.095
pe+0.095
ider+0.082
bench+0.082
 swoop+0.078
Neg logit contributions
illy-0.102
 groceries-0.100
 biggest-0.100
 favourite-0.100
 favorite-0.100
Token new
Token ID649
Feature activation+1.92e-02

Pos logit contributions
illy+0.058
 favourite+0.050
ippi+0.048
pieces+0.048
ingu+0.048
Neg logit contributions
pe-0.069
ro-0.064
 dent-0.061
 ago-0.058
bench-0.058
Token ones
Token ID3392
Feature activation+8.53e-02

Pos logit contributions
illy+0.021
 favourite+0.017
 biggest+0.017
 groceries+0.017
 favorite+0.016
Neg logit contributions
pe-0.045
ro-0.044
 dent-0.040
bench-0.040
 swoop-0.039
Token.
Token ID13
Feature activation+3.20e-02

Pos logit contributions
illy+0.148
 favourite+0.137
 biggest+0.133
 groceries+0.131
 favorite+0.129
Neg logit contributions
pe-0.141
ro-0.132
bench-0.118
 ago-0.113
 dent-0.112
Token<|endoftext|>
Token ID50256
Feature activation-5.62e-02

Pos logit contributions
illy+0.055
 groceries+0.054
 favourite+0.053
 biggest+0.053
 favorite+0.052
Neg logit contributions
pe-0.051
ro-0.050
bench-0.043
 dent-0.042
 swoop-0.041
TokenOnce
Token ID7454
Feature activation+2.86e-02

Pos logit contributions
pe+0.108
ro+0.106
ider+0.093
bench+0.093
 swoop+0.093
Neg logit contributions
illy-0.080
 favourite-0.077
 biggest-0.077
 groceries-0.076
 favorite-0.075
Token upon
Token ID2402
Feature activation+8.76e-03

Pos logit contributions
 groceries+0.053
 biggest+0.052
illy+0.052
 favourite+0.051
 tomatoes+0.050
Neg logit contributions
pe-0.039
ro-0.038
bench-0.034
 ago-0.033
 dent-0.032
Token a
Token ID257
Feature activation+6.08e-03

Pos logit contributions
illy+0.010
 groceries+0.009
 biggest+0.009
 favourite+0.009
 favorite+0.009
Neg logit contributions
pe-0.019
ro-0.019
bench-0.017
 swoop-0.017
ider-0.017
Token him
Token ID683
Feature activation+1.85e-02

Pos logit contributions
pe+0.037
ro+0.036
 swoop+0.031
bench+0.031
 ago+0.031
Neg logit contributions
 favourite-0.024
illy-0.024
 biggest-0.023
 favorite-0.023
 groceries-0.023
Token and
Token ID290
Feature activation+4.24e-03

Pos logit contributions
illy+0.026
 favourite+0.026
 biggest+0.025
 favorite+0.025
 groceries+0.025
Neg logit contributions
pe-0.035
ro-0.035
bench-0.032
ider-0.031
 swoop-0.030
Token only
Token ID691
Feature activation-1.36e-02

Pos logit contributions
 favourite+0.006
illy+0.006
 favorite+0.006
 biggest+0.006
 groceries+0.006
Neg logit contributions
pe-0.008
ro-0.008
ider-0.007
bench-0.007
 dent-0.007
Token grabbed
Token ID13176
Feature activation+2.07e-02

Pos logit contributions
pe+0.024
ro+0.023
bench+0.020
ider+0.020
 dent+0.019
Neg logit contributions
illy-0.022
 favourite-0.022
 biggest-0.022
 favorite-0.022
 groceries-0.021
Token the
Token ID262
Feature activation+1.05e-02

Pos logit contributions
illy+0.033
idy+0.032
cakes+0.031
affles+0.031
ummy+0.030
Neg logit contributions
pe-0.033
ro-0.032
 swoop-0.028
 charged-0.028
 breath-0.028
Token ones
Token ID3392
Feature activation+8.49e-02

Pos logit contributions
illy+0.009
 favourite+0.008
pieces+0.008
 groceries+0.008
idy+0.008
Neg logit contributions
pe-0.026
ro-0.025
 swoop-0.023
 dent-0.023
 ago-0.022
Token he
Token ID339
Feature activation+1.31e-02

Pos logit contributions
illy+0.135
 biggest+0.127
 favourite+0.126
 favorite+0.123
affles+0.122
Neg logit contributions
pe-0.152
ro-0.139
bench-0.120
 swoop-0.114
 ago-0.113
Token found
Token ID1043
Feature activation+3.09e-02

Pos logit contributions
illy+0.022
 groceries+0.021
 biggest+0.021
 favourite+0.021
 favorite+0.020
Neg logit contributions
pe-0.023
ro-0.022
 swoop-0.019
bench-0.019
ider-0.018
Token in
Token ID287
Feature activation-3.22e-02

Pos logit contributions
illy+0.048
 favourite+0.047
 biggest+0.047
 groceries+0.046
 favorite+0.046
Neg logit contributions
pe-0.056
ro-0.054
bench-0.047
 swoop-0.046
ider-0.046
Token the
Token ID262
Feature activation+1.54e-02

Pos logit contributions
ro+0.075
pe+0.075
ider+0.075
 swoop+0.068
bench+0.066
Neg logit contributions
 new-0.032
 groceries-0.032
 favourite-0.030
 biggest-0.030
 favorite-0.030
Token forest
Token ID8222
Feature activation-3.58e-02

Pos logit contributions
illy+0.025
 groceries+0.023
idy+0.023
 favourite+0.023
pieces+0.022
Neg logit contributions
pe-0.028
ro-0.027
 swoop-0.023
 dent-0.023
bench-0.023
Token feathers
Token ID27400
Feature activation+2.77e-02

Pos logit contributions
pe+0.010
ro+0.010
 swoop+0.009
bench+0.009
 dent+0.008
Neg logit contributions
illy-0.005
 favourite-0.005
 biggest-0.005
 groceries-0.005
 favorite-0.005
Token to
Token ID284
Feature activation-1.70e-02

Pos logit contributions
illy+0.041
 favourite+0.040
 groceries+0.039
 biggest+0.039
 favorite+0.038
Neg logit contributions
pe-0.053
ro-0.052
bench-0.043
 swoop-0.043
ider-0.043
Token replace
Token ID6330
Feature activation+3.33e-02

Pos logit contributions
pe+0.028
ro+0.026
 swoop+0.024
 dent+0.022
bench+0.021
Neg logit contributions
illy-0.031
 favourite-0.029
 groceries-0.029
 biggest-0.029
 favorite-0.028
Token the
Token ID262
Feature activation+1.91e-02

Pos logit contributions
illy+0.050
 favourite+0.046
 groceries+0.045
 biggest+0.044
oll+0.043
Neg logit contributions
pe-0.065
ro-0.060
 swoop-0.055
 hon-0.050
fly-0.050
Token rough
Token ID5210
Feature activation-1.66e-02

Pos logit contributions
illy+0.024
 biggest+0.024
 favourite+0.024
 groceries+0.023
 favorite+0.023
Neg logit contributions
pe-0.040
ro-0.039
bench-0.035
ider-0.034
 swoop-0.034
Token ones
Token ID3392
Feature activation+8.46e-02

Pos logit contributions
pe+0.031
ro+0.031
bench+0.029
ider+0.029
 dent+0.027
Neg logit contributions
 biggest-0.023
 favorite-0.023
 favourite-0.022
illy-0.022
 groceries-0.022
Token."
Token ID526
Feature activation+7.89e-03

Pos logit contributions
illy+0.161
 favourite+0.152
 groceries+0.150
 biggest+0.146
ippi+0.146
Neg logit contributions
pe-0.129
ro-0.119
ider-0.099
 ago-0.096
bench-0.095
Token So
Token ID1406
Feature activation-1.13e-02

Pos logit contributions
illy+0.011
 biggest+0.011
 favourite+0.011
 groceries+0.011
 favorite+0.010
Neg logit contributions
pe-0.016
ro-0.015
bench-0.014
 swoop-0.013
ider-0.013
Token,
Token ID11
Feature activation+1.54e-02

Pos logit contributions
pe+0.021
ro+0.021
bench+0.018
 swoop+0.018
ider+0.018
Neg logit contributions
illy-0.017
 favourite-0.016
 biggest-0.016
 groceries-0.016
 favorite-0.016
Token the
Token ID262
Feature activation-1.49e-02

Pos logit contributions
illy+0.021
 biggest+0.020
 favourite+0.020
 groceries+0.020
 favorite+0.020
Neg logit contributions
pe-0.031
ro-0.030
bench-0.027
 swoop-0.027
 dent-0.026
Token squirrel
Token ID33039
Feature activation+2.81e-02

Pos logit contributions
ro+0.030
pe+0.030
bench+0.027
ider+0.026
 dent+0.026
Neg logit contributions
 biggest-0.020
illy-0.019
 favourite-0.019
 favorite-0.019
 groceries-0.019
Token sure
Token ID1654
Feature activation+4.15e-02

Pos logit contributions
illy+0.053
 groceries+0.049
elling+0.048
 favourite+0.047
 biggest+0.046
Neg logit contributions
pe-0.072
 swoop-0.066
ro-0.066
 wob-0.063
 dent-0.063
Token only
Token ID691
Feature activation-4.86e-03

Pos logit contributions
illy+0.069
 favourite+0.062
 biggest+0.059
 groceries+0.058
affles+0.058
Neg logit contributions
pe-0.073
ro-0.066
bench-0.061
 dent-0.060
 swoop-0.060
Token to
Token ID284
Feature activation-6.47e-02

Pos logit contributions
pe+0.010
ro+0.010
 swoop+0.008
bench+0.008
 dent+0.008
Neg logit contributions
illy-0.007
 groceries-0.006
 favourite-0.006
 biggest-0.006
 favorite-0.006
Token pick
Token ID2298
Feature activation+3.87e-02

Pos logit contributions
pe+0.123
ro+0.121
ider+0.115
bench+0.114
 dent+0.104
Neg logit contributions
illy-0.091
 biggest-0.085
 favorite-0.085
 favourite-0.085
 groceries-0.084
Token the
Token ID262
Feature activation+2.31e-02

Pos logit contributions
illy+0.059
ingu+0.055
affles+0.055
 favourite+0.054
oll+0.054
Neg logit contributions
pe-0.063
ro-0.059
 swoop-0.056
 dent-0.052
 wob-0.051
Token ones
Token ID3392
Feature activation+8.35e-02

Pos logit contributions
illy+0.024
 favourite+0.021
 groceries+0.020
ingu+0.020
oll+0.020
Neg logit contributions
pe-0.053
ro-0.050
bench-0.046
 swoop-0.046
 dent-0.045
Token that
Token ID326
Feature activation-1.29e-03

Pos logit contributions
illy+0.142
 favourite+0.137
 biggest+0.133
 favorite+0.132
 groceries+0.130
Neg logit contributions
pe-0.139
ro-0.124
bench-0.109
 wob-0.105
ider-0.103
Token were
Token ID547
Feature activation-5.37e-03

Pos logit contributions
pe+0.002
ro+0.002
bench+0.002
 swoop+0.002
ider+0.002
Neg logit contributions
illy-0.002
 favourite-0.002
 biggest-0.002
 groceries-0.002
 favorite-0.002
Token sweet
Token ID6029
Feature activation+1.56e-02

Pos logit contributions
pe+0.011
ro+0.011
bench+0.010
ider+0.010
 dent+0.010
Neg logit contributions
illy-0.007
 biggest-0.007
 favourite-0.007
 groceries-0.007
 favorite-0.007
Token and
Token ID290
Feature activation-2.53e-03

Pos logit contributions
 favourite+0.026
illy+0.026
 biggest+0.025
 favorite+0.025
 groceries+0.025
Neg logit contributions
ro-0.025
pe-0.025
bench-0.020
 ago-0.020
 swoop-0.020
Token delicious
Token ID12625
Feature activation-5.83e-04

Pos logit contributions
pe+0.006
ro+0.006
bench+0.005
 swoop+0.005
 dent+0.005
Neg logit contributions
illy-0.003
 groceries-0.003
 favourite-0.003
 biggest-0.003
 favorite-0.002
Token giant
Token ID6175
Feature activation+9.86e-03

Pos logit contributions
pe+0.022
ro+0.022
 swoop+0.019
 dent+0.019
bench+0.019
Neg logit contributions
illy-0.014
 groceries-0.013
 favourite-0.012
 biggest-0.012
pieces-0.012
Token!
Token ID0
Feature activation+2.08e-02

Pos logit contributions
illy+0.014
 biggest+0.014
 favourite+0.014
 favorite+0.014
 groceries+0.014
Neg logit contributions
pe-0.018
ro-0.018
bench-0.016
ider-0.015
 swoop-0.015
Token We
Token ID775
Feature activation+2.04e-03

Pos logit contributions
illy+0.033
 groceries+0.032
 favourite+0.031
 biggest+0.031
 favorite+0.030
Neg logit contributions
ro-0.037
pe-0.037
ider-0.032
 swoop-0.031
bench-0.031
Token should
Token ID815
Feature activation-3.72e-02

Pos logit contributions
illy+0.003
 groceries+0.003
 biggest+0.003
 favourite+0.003
 favorite+0.003
Neg logit contributions
pe-0.004
ro-0.004
 swoop-0.003
bench-0.003
 dent-0.003
Token send
Token ID3758
Feature activation+5.05e-02

Pos logit contributions
ro+0.067
pe+0.066
bench+0.060
ider+0.058
 dent+0.056
Neg logit contributions
illy-0.056
 favourite-0.055
 biggest-0.055
 favorite-0.055
 groceries-0.054
Token someone
Token ID2130
Feature activation+8.34e-02

Pos logit contributions
illy+0.087
ingu+0.077
affles+0.075
oll+0.072
 favourite+0.072
Neg logit contributions
pe-0.085
ro-0.079
 swoop-0.077
 hon-0.077
 dent-0.073
Token to
Token ID284
Feature activation-2.19e-02

Pos logit contributions
illy+0.153
ingu+0.126
add+0.125
ummy+0.125
affles+0.124
Neg logit contributions
pe-0.132
ro-0.125
 swoop-0.123
 hon-0.117
 ago-0.114
Token check
Token ID2198
Feature activation+1.01e-02

Pos logit contributions
pe+0.040
ro+0.037
 hon+0.035
 swoop+0.035
 ago+0.035
Neg logit contributions
illy-0.033
 favourite-0.031
 groceries-0.030
 biggest-0.030
idy-0.029
Token it
Token ID340
Feature activation+4.39e-02

Pos logit contributions
 favourite+0.017
illy+0.016
 favorite+0.016
ingu+0.015
 biggest+0.015
Neg logit contributions
pe-0.017
ro-0.016
 ago-0.015
h-0.014
 hon-0.014
Token out
Token ID503
Feature activation+1.99e-02

Pos logit contributions
illy+0.072
 groceries+0.064
 favourite+0.064
 biggest+0.063
pieces+0.060
Neg logit contributions
pe-0.079
ro-0.078
 swoop-0.068
bench-0.067
ider-0.064
Token."
Token ID526
Feature activation+3.62e-03

Pos logit contributions
illy+0.035
 favourite+0.034
 groceries+0.034
 biggest+0.032
 favorite+0.032
Neg logit contributions
ro-0.031
pe-0.031
 ago-0.027
 swoop-0.026
ider-0.025
Token some
Token ID617
Feature activation+3.16e-02

Pos logit contributions
illy+0.043
 biggest+0.038
 favourite+0.037
idy+0.037
elling+0.036
Neg logit contributions
ro-0.058
pe-0.057
bench-0.054
 swoop-0.052
 hon-0.050
Token money
Token ID1637
Feature activation+8.71e-03

Pos logit contributions
illy+0.035
 biggest+0.032
 favourite+0.031
 favorite+0.030
ingu+0.029
Neg logit contributions
ro-0.068
pe-0.066
bench-0.060
 dent-0.058
ider-0.058
Token to
Token ID284
Feature activation-3.98e-02

Pos logit contributions
illy+0.011
 groceries+0.011
 favourite+0.011
 biggest+0.011
 favorite+0.010
Neg logit contributions
pe-0.018
ro-0.018
bench-0.016
 swoop-0.016
 dent-0.016
Token buy
Token ID2822
Feature activation+5.26e-02

Pos logit contributions
pe+0.073
ro+0.072
bench+0.064
ider+0.062
 swoop+0.062
Neg logit contributions
illy-0.060
 favourite-0.058
 groceries-0.058
 biggest-0.058
 favorite-0.057
Token new
Token ID649
Feature activation+1.58e-02

Pos logit contributions
illy+0.073
ippi+0.068
 biggest+0.068
elling+0.067
idy+0.067
Neg logit contributions
pe-0.089
ro-0.086
bench-0.080
 hon-0.072
 ago-0.071
Token ones
Token ID3392
Feature activation+8.34e-02

Pos logit contributions
illy+0.019
 biggest+0.017
ingu+0.016
idy+0.016
 favourite+0.016
Neg logit contributions
ro-0.032
pe-0.032
bench-0.029
 ago-0.028
 dent-0.028
Token.
Token ID13
Feature activation+2.07e-02

Pos logit contributions
illy+0.151
 biggest+0.143
 favourite+0.142
 groceries+0.139
pieces+0.137
Neg logit contributions
pe-0.129
ro-0.120
bench-0.105
ider-0.098
 ago-0.096
Token The
Token ID383
Feature activation-1.72e-02

Pos logit contributions
illy+0.033
 biggest+0.033
 groceries+0.033
 favourite+0.032
 favorite+0.032
Neg logit contributions
pe-0.035
ro-0.035
bench-0.030
 swoop-0.029
 ago-0.028
Token end
Token ID886
Feature activation-6.56e-03

Pos logit contributions
pe+0.035
ro+0.035
bench+0.031
 swoop+0.030
ider+0.030
Neg logit contributions
illy-0.023
 favourite-0.022
 biggest-0.022
 groceries-0.022
 favorite-0.022
Token.
Token ID13
Feature activation+2.55e-02

Pos logit contributions
pe+0.012
ro+0.012
bench+0.010
 swoop+0.010
ider+0.010
Neg logit contributions
illy-0.010
 favourite-0.010
 biggest-0.010
 groceries-0.010
 favorite-0.010
Token<|endoftext|>
Token ID50256
Feature activation-5.44e-02

Pos logit contributions
illy+0.044
 biggest+0.044
 groceries+0.043
 favourite+0.043
 favorite+0.042
Neg logit contributions
pe-0.041
ro-0.040
bench-0.034
 swoop-0.033
 dent-0.033
Token nice
Token ID3621
Feature activation-7.70e-03

Pos logit contributions
pe+0.025
ro+0.025
bench+0.023
 swoop+0.022
ider+0.022
Neg logit contributions
illy-0.010
 favourite-0.009
 biggest-0.009
 groceries-0.009
 favorite-0.009
Token.
Token ID13
Feature activation-1.08e-03

Pos logit contributions
pe+0.014
ro+0.014
bench+0.012
 swoop+0.012
ider+0.012
Neg logit contributions
illy-0.012
 biggest-0.012
 favourite-0.012
 groceries-0.012
 favorite-0.011
Token Can
Token ID1680
Feature activation+5.29e-04

Pos logit contributions
pe+0.002
ro+0.002
bench+0.002
ider+0.002
 swoop+0.002
Neg logit contributions
illy-0.002
 groceries-0.002
 favourite-0.002
 biggest-0.002
 favorite-0.002
Token I
Token ID314
Feature activation-8.75e-03

Pos logit contributions
illy+0.001
 biggest+0.001
 groceries+0.001
 favourite+0.001
 favorite+0.001
Neg logit contributions
pe-0.001
ro-0.001
ider-0.001
 swoop-0.001
 dent-0.001
Token show
Token ID905
Feature activation+4.77e-02

Pos logit contributions
ro+0.014
pe+0.013
 swoop+0.011
 dent+0.011
ider+0.011
Neg logit contributions
illy-0.016
 groceries-0.016
 biggest-0.015
 favourite-0.015
 tomatoes-0.015
Token Jen
Token ID13364
Feature activation+8.33e-02

Pos logit contributions
illy+0.077
 biggest+0.077
 favourite+0.075
 crops+0.075
 groceries+0.074
Neg logit contributions
pe-0.075
ro-0.068
h-0.067
 hon-0.066
 swoop-0.066
Token?"
Token ID1701
Feature activation-1.30e-02

Pos logit contributions
illy+0.143
 biggest+0.139
 groceries+0.136
 favourite+0.134
idy+0.126
Neg logit contributions
ro-0.124
pe-0.123
h-0.106
 swoop-0.105
 hon-0.103
Token Tom
Token ID4186
Feature activation-2.35e-03

Pos logit contributions
pe+0.027
ro+0.026
bench+0.024
ider+0.023
 dent+0.023
Neg logit contributions
 groceries-0.016
illy-0.016
 favorite-0.016
 favourite-0.016
 biggest-0.016
Token says
Token ID1139
Feature activation-1.08e-02

Pos logit contributions
pe+0.004
ro+0.004
bench+0.004
 swoop+0.004
ider+0.004
Neg logit contributions
illy-0.003
 favourite-0.003
 biggest-0.003
 groceries-0.003
 favorite-0.003
Token.
Token ID13
Feature activation-1.93e-02

Pos logit contributions
pe+0.021
ro+0.021
 swoop+0.018
bench+0.018
 dent+0.018
Neg logit contributions
illy-0.015
 favorite-0.015
 favourite-0.015
 biggest-0.014
 groceries-0.014
Token
Token ID198
Feature activation-1.97e-02

Pos logit contributions
pe+0.044
ro+0.043
ider+0.041
 dent+0.040
 swoop+0.040
Neg logit contributions
 new-0.019
 favorite-0.017
 biggest-0.017
 favourite-0.016
 groceries-0.016
Token You
Token ID921
Feature activation+1.18e-02

Pos logit contributions
illy+0.032
 favourite+0.031
 biggest+0.030
 favorite+0.030
 groceries+0.030
Neg logit contributions
pe-0.034
ro-0.034
ider-0.028
 ago-0.028
bench-0.028
Token can
Token ID460
Feature activation-2.75e-02

Pos logit contributions
illy+0.017
 biggest+0.017
 favourite+0.017
 groceries+0.017
 favorite+0.017
Neg logit contributions
pe-0.022
ro-0.022
bench-0.019
ider-0.019
 swoop-0.018
Token take
Token ID1011
Feature activation+3.59e-02

Pos logit contributions
ro+0.046
pe+0.046
bench+0.041
ider+0.039
 swoop+0.039
Neg logit contributions
illy-0.044
 favourite-0.044
 biggest-0.044
 groceries-0.044
 favorite-0.043
Token the
Token ID262
Feature activation+9.42e-03

Pos logit contributions
illy+0.054
 favourite+0.048
 biggest+0.047
 favorite+0.046
idy+0.044
Neg logit contributions
pe-0.069
ro-0.068
 swoop-0.060
 dent-0.057
bench-0.057
Token small
Token ID1402
Feature activation-1.82e-02

Pos logit contributions
illy+0.016
 favourite+0.015
 biggest+0.014
 groceries+0.014
 favorite+0.014
Neg logit contributions
pe-0.017
ro-0.016
bench-0.014
 swoop-0.014
 dent-0.014
Token ones
Token ID3392
Feature activation+8.33e-02

Pos logit contributions
pe+0.033
ro+0.033
bench+0.029
 swoop+0.028
ider+0.028
Neg logit contributions
illy-0.027
 groceries-0.027
 biggest-0.027
 favourite-0.027
 favorite-0.026
Token,"
Token ID553
Feature activation+4.55e-03

Pos logit contributions
illy+0.140
 favourite+0.131
 biggest+0.127
pieces+0.123
ippi+0.123
Neg logit contributions
pe-0.149
ro-0.136
bench-0.120
ider-0.115
h-0.113
Token the
Token ID262
Feature activation-2.15e-02

Pos logit contributions
illy+0.006
 biggest+0.006
 favourite+0.006
 groceries+0.006
 favorite+0.006
Neg logit contributions
pe-0.009
ro-0.009
bench-0.008
 swoop-0.008
 dent-0.008
Token truck
Token ID7779
Feature activation-1.03e-02

Pos logit contributions
ro+0.039
pe+0.036
bench+0.033
ider+0.032
 swoop+0.031
Neg logit contributions
 biggest-0.036
 new-0.035
 groceries-0.035
 favorite-0.034
 favourite-0.034
Token driver
Token ID4639
Feature activation+1.09e-03

Pos logit contributions
ro+0.017
pe+0.017
 dent+0.015
 ago+0.015
bench+0.015
Neg logit contributions
 favorite-0.017
 favourite-0.016
 biggest-0.016
 groceries-0.015
 new-0.015
Token said
Token ID531
Feature activation-1.04e-02

Pos logit contributions
illy+0.002
 favourite+0.002
 favorite+0.002
 groceries+0.002
 biggest+0.002
Neg logit contributions
pe-0.002
ro-0.002
bench-0.002
 dent-0.002
ider-0.002
Token fix
Token ID4259
Feature activation+2.22e-02

Pos logit contributions
ro+0.041
pe+0.041
bench+0.036
ider+0.035
 swoop+0.034
Neg logit contributions
 favourite-0.037
 biggest-0.037
 favorite-0.037
illy-0.037
 groceries-0.036
Token them
Token ID606
Feature activation+5.42e-02

Pos logit contributions
illy+0.036
 favourite+0.033
ete+0.032
 biggest+0.032
ingu+0.032
Neg logit contributions
pe-0.036
ro-0.033
 dent-0.033
 hon-0.029
bench-0.028
Token or
Token ID393
Feature activation+1.54e-02

Pos logit contributions
illy+0.097
 biggest+0.089
 favourite+0.088
 groceries+0.087
 favorite+0.084
Neg logit contributions
pe-0.087
ro-0.086
bench-0.074
ider-0.072
 dent-0.071
Token get
Token ID651
Feature activation+3.61e-02

Pos logit contributions
illy+0.029
 biggest+0.026
 favourite+0.026
 groceries+0.026
 favorite+0.025
Neg logit contributions
pe-0.024
ro-0.024
 dent-0.020
 swoop-0.020
bench-0.020
Token new
Token ID649
Feature activation+1.43e-02

Pos logit contributions
illy+0.055
 favourite+0.049
 biggest+0.048
 favorite+0.045
 groceries+0.045
Neg logit contributions
pe-0.064
ro-0.063
 dent-0.060
 swoop-0.059
 charged-0.057
Token ones
Token ID3392
Feature activation+8.22e-02

Pos logit contributions
illy+0.018
 biggest+0.017
 favourite+0.017
 groceries+0.016
 favorite+0.016
Neg logit contributions
pe-0.030
ro-0.030
bench-0.027
 swoop-0.026
 dent-0.026
Token.
Token ID13
Feature activation+2.37e-02

Pos logit contributions
illy+0.142
 favourite+0.133
 biggest+0.132
 groceries+0.126
 favorite+0.125
Neg logit contributions
pe-0.137
ro-0.130
bench-0.118
 ago-0.109
 dent-0.108
Token Lily
Token ID20037
Feature activation+1.69e-02

Pos logit contributions
illy+0.037
 biggest+0.037
 groceries+0.036
 favourite+0.035
 amazing+0.034
Neg logit contributions
ro-0.042
pe-0.037
bench-0.034
 ago-0.034
ider-0.033
Token felt
Token ID2936
Feature activation+1.47e-02

Pos logit contributions
illy+0.025
 favourite+0.025
 biggest+0.024
 favorite+0.024
 groceries+0.024
Neg logit contributions
pe-0.032
ro-0.031
bench-0.027
ider-0.027
 swoop-0.027
Token a
Token ID257
Feature activation+8.32e-03

Pos logit contributions
illy+0.024
 groceries+0.022
 tomatoes+0.021
 favourite+0.021
 biggest+0.021
Neg logit contributions
pe-0.024
ro-0.023
bench-0.022
 dent-0.022
 swoop-0.021
Token little
Token ID1310
Feature activation-7.16e-03

Pos logit contributions
 biggest+0.007
 favourite+0.007
illy+0.007
 favorite+0.007
 groceries+0.007
Neg logit contributions
pe-0.021
ro-0.020
bench-0.019
ider-0.018
 ago-0.018
Token's
Token ID338
Feature activation-1.21e-02

Pos logit contributions
illy+0.012
 groceries+0.011
 favourite+0.011
 biggest+0.011
 favorite+0.010
Neg logit contributions
pe-0.034
ro-0.033
bench-0.030
 swoop-0.030
 dent-0.030
Token not
Token ID407
Feature activation-5.05e-03

Pos logit contributions
pe+0.025
ro+0.024
bench+0.022
 swoop+0.021
ider+0.021
Neg logit contributions
illy-0.016
 favourite-0.015
 biggest-0.015
 groceries-0.015
 favorite-0.015
Token okay
Token ID8788
Feature activation+1.90e-02

Pos logit contributions
pe+0.012
ro+0.012
bench+0.011
ider+0.010
 swoop+0.010
Neg logit contributions
illy-0.005
 biggest-0.005
 favourite-0.005
 groceries-0.005
 favorite-0.005
Token to
Token ID284
Feature activation-6.63e-02

Pos logit contributions
illy+0.026
 biggest+0.024
 groceries+0.024
 favourite+0.024
 favorite+0.023
Neg logit contributions
pe-0.038
ro-0.038
bench-0.033
ider-0.033
 swoop-0.033
Token take
Token ID1011
Feature activation+3.36e-02

Pos logit contributions
pe+0.114
ro+0.109
bench+0.101
ider+0.099
 ago+0.090
Neg logit contributions
illy-0.107
 favorite-0.104
 biggest-0.103
 favourite-0.102
 new-0.101
Token someone
Token ID2130
Feature activation+8.22e-02

Pos logit contributions
illy+0.051
 favourite+0.046
 biggest+0.045
 favorite+0.042
 groceries+0.042
Neg logit contributions
ro-0.061
pe-0.060
 dent-0.054
 swoop-0.054
 crash-0.052
Token else
Token ID2073
Feature activation+5.12e-02

Pos logit contributions
illy+0.135
 favourite+0.121
ingu+0.113
affles+0.112
 biggest+0.112
Neg logit contributions
ro-0.139
pe-0.133
 swoop-0.121
 fought-0.120
 dent-0.120
Token's
Token ID338
Feature activation+8.88e-03

Pos logit contributions
illy+0.094
 favourite+0.085
 biggest+0.085
add+0.079
affles+0.078
Neg logit contributions
ro-0.081
pe-0.075
bench-0.064
 dent-0.063
 ago-0.062
Token things
Token ID1243
Feature activation+4.71e-02

Pos logit contributions
illy+0.010
 favourite+0.009
 biggest+0.009
 groceries+0.009
 favorite+0.009
Neg logit contributions
pe-0.020
ro-0.020
bench-0.018
 swoop-0.018
ider-0.018
Token without
Token ID1231
Feature activation-7.74e-03

Pos logit contributions
illy+0.073
 favourite+0.071
 biggest+0.071
 groceries+0.071
 favorite+0.069
Neg logit contributions
pe-0.084
ro-0.083
bench-0.073
 swoop-0.071
ider-0.070
Token asking
Token ID4737
Feature activation+3.96e-02

Pos logit contributions
pe+0.011
ro+0.011
bench+0.009
ider+0.009
 swoop+0.009
Neg logit contributions
illy-0.015
 groceries-0.014
 biggest-0.014
 favourite-0.014
 favorite-0.014
Token's
Token ID338
Feature activation-6.39e-03

Pos logit contributions
illy+0.008
 groceries+0.008
 favourite+0.007
 biggest+0.007
 favorite+0.007
Neg logit contributions
pe-0.021
ro-0.020
 swoop-0.019
bench-0.019
 dent-0.018
Token important
Token ID1593
Feature activation+1.42e-03

Pos logit contributions
pe+0.016
ro+0.016
bench+0.014
 swoop+0.014
ider+0.014
Neg logit contributions
illy-0.006
 groceries-0.005
 favourite-0.005
 biggest-0.005
 favorite-0.005
Token to
Token ID284
Feature activation-5.03e-02

Pos logit contributions
illy+0.001
 favourite+0.001
 groceries+0.001
 favorite+0.001
 biggest+0.001
Neg logit contributions
pe-0.004
ro-0.004
bench-0.003
 swoop-0.003
ider-0.003
Token cherish
Token ID48303
Feature activation+2.45e-02

Pos logit contributions
ro+0.080
pe+0.079
bench+0.071
ider+0.071
 dent+0.067
Neg logit contributions
illy-0.088
 biggest-0.084
 favourite-0.083
 favorite-0.083
 groceries-0.082
Token the
Token ID262
Feature activation+2.57e-03

Pos logit contributions
illy+0.032
 biggest+0.031
 favourite+0.031
 groceries+0.031
 favorite+0.030
Neg logit contributions
pe-0.050
ro-0.048
bench-0.043
 dent-0.042
 swoop-0.041
Token ones
Token ID3392
Feature activation+8.21e-02

Pos logit contributions
illy+0.003
 favourite+0.003
 biggest+0.003
 groceries+0.003
 favorite+0.003
Neg logit contributions
pe-0.005
ro-0.005
bench-0.005
 swoop-0.005
 dent-0.005
Token you
Token ID345
Feature activation+2.92e-03

Pos logit contributions
illy+0.125
 biggest+0.120
 favourite+0.119
 groceries+0.118
ippi+0.115
Neg logit contributions
pe-0.149
ro-0.131
bench-0.116
 ago-0.113
ider-0.112
Token have
Token ID423
Feature activation+1.97e-02

Pos logit contributions
illy+0.005
 biggest+0.005
 groceries+0.005
 favourite+0.005
 favorite+0.005
Neg logit contributions
pe-0.005
ro-0.005
bench-0.004
 swoop-0.004
ider-0.004
Token now
Token ID783
Feature activation-7.70e-03

Pos logit contributions
illy+0.028
 favourite+0.028
 groceries+0.027
 biggest+0.027
 favorite+0.027
Neg logit contributions
pe-0.038
ro-0.037
bench-0.033
 swoop-0.032
 dent-0.032
Token.
Token ID13
Feature activation+2.16e-02

Pos logit contributions
pe+0.016
ro+0.015
bench+0.014
 swoop+0.014
 dent+0.013
Neg logit contributions
 favourite-0.010
illy-0.010
 favorite-0.010
 groceries-0.010
 biggest-0.010
Token From
Token ID3574
Feature activation-1.18e-02

Pos logit contributions
illy+0.033
 favourite+0.032
 biggest+0.032
 groceries+0.031
 favorite+0.031
Neg logit contributions
pe-0.040
ro-0.039
bench-0.034
 swoop-0.034
ider-0.033
Token.
Token ID13
Feature activation+2.01e-02

Pos logit contributions
illy+0.047
 biggest+0.046
 favourite+0.045
 groceries+0.045
 favorite+0.043
Neg logit contributions
pe-0.046
ro-0.046
bench-0.039
 swoop-0.038
ider-0.038
Token She
Token ID1375
Feature activation-1.24e-02

Pos logit contributions
 biggest+0.032
illy+0.032
 groceries+0.032
 favourite+0.032
 favorite+0.030
Neg logit contributions
ro-0.035
pe-0.033
bench-0.029
 ago-0.028
ider-0.027
Token realized
Token ID6939
Feature activation+2.54e-02

Pos logit contributions
pe+0.020
ro+0.019
 swoop+0.017
bench+0.016
 charged+0.016
Neg logit contributions
 groceries-0.022
 biggest-0.022
illy-0.022
 favourite-0.021
 favorite-0.020
Token that
Token ID326
Feature activation-7.96e-04

Pos logit contributions
 biggest+0.038
 favourite+0.037
 favorite+0.035
 groceries+0.034
illy+0.033
Neg logit contributions
pe-0.048
ro-0.043
 swoop-0.040
 hon-0.039
h-0.039
Token protecting
Token ID10192
Feature activation+7.58e-03

Pos logit contributions
pe+0.001
ro+0.001
bench+0.001
 swoop+0.001
 dent+0.001
Neg logit contributions
illy-0.001
 biggest-0.001
 favourite-0.001
 groceries-0.001
 favorite-0.001
Token someone
Token ID2130
Feature activation+8.19e-02

Pos logit contributions
 favourite+0.011
illy+0.011
 biggest+0.011
 groceries+0.010
 favorite+0.010
Neg logit contributions
ro-0.014
pe-0.014
 swoop-0.012
 dent-0.011
bench-0.011
Token is
Token ID318
Feature activation-1.52e-04

Pos logit contributions
illy+0.123
 favourite+0.112
 groceries+0.111
 biggest+0.110
ingu+0.107
Neg logit contributions
pe-0.149
ro-0.144
 swoop-0.125
 dent-0.122
bench-0.122
Token more
Token ID517
Feature activation+1.00e-02

Pos logit contributions
pe+0.000
ro+0.000
bench+0.000
 swoop+0.000
 dent+0.000
Neg logit contributions
illy-0.000
 groceries-0.000
 favourite-0.000
 biggest-0.000
 favorite-0.000
Token important
Token ID1593
Feature activation+1.47e-02

Pos logit contributions
illy+0.007
 groceries+0.007
 favourite+0.007
 biggest+0.007
 favorite+0.006
Neg logit contributions
pe-0.027
ro-0.026
bench-0.024
 swoop-0.024
ider-0.024
Token than
Token ID621
Feature activation+1.08e-02

Pos logit contributions
 favorite+0.027
 favourite+0.027
 groceries+0.027
illy+0.027
 biggest+0.026
Neg logit contributions
pe-0.022
ro-0.021
 swoop-0.020
 dent-0.019
bench-0.018
Token having
Token ID1719
Feature activation+1.64e-02

Pos logit contributions
illy+0.016
 favourite+0.016
 biggest+0.016
 groceries+0.016
 favorite+0.015
Neg logit contributions
pe-0.020
ro-0.020
bench-0.017
 swoop-0.017
ider-0.017
Token man
Token ID582
Feature activation+3.33e-02

Pos logit contributions
illy+0.036
idy+0.033
pieces+0.032
 favourite+0.031
elling+0.031
Neg logit contributions
pe-0.033
ro-0.032
 swoop-0.026
 dent-0.026
bench-0.026
Token who
Token ID508
Feature activation+5.09e-03

Pos logit contributions
illy+0.044
 biggest+0.042
 groceries+0.042
 favourite+0.041
 favorite+0.040
Neg logit contributions
pe-0.068
ro-0.067
bench-0.058
ider-0.057
 swoop-0.057
Token gave
Token ID2921
Feature activation+4.76e-02

Pos logit contributions
illy+0.008
 favourite+0.008
 biggest+0.008
 groceries+0.008
 favorite+0.008
Neg logit contributions
pe-0.009
ro-0.009
bench-0.008
 swoop-0.008
ider-0.007
Token away
Token ID1497
Feature activation+2.10e-02

Pos logit contributions
illy+0.061
 biggest+0.056
 favourite+0.056
oll+0.055
ippi+0.055
Neg logit contributions
pe-0.094
 swoop-0.080
ro-0.080
 hon-0.069
 charged-0.069
Token juicy
Token ID35075
Feature activation+4.52e-03

Pos logit contributions
illy+0.030
 favourite+0.028
 biggest+0.027
 favorite+0.027
 groceries+0.026
Neg logit contributions
pe-0.040
ro-0.038
 swoop-0.030
 hon-0.030
 ago-0.030
Token ones
Token ID3392
Feature activation+8.16e-02

Pos logit contributions
illy+0.004
 favourite+0.003
 biggest+0.003
pieces+0.003
 favorite+0.003
Neg logit contributions
ro-0.011
pe-0.011
 swoop-0.010
h-0.009
 ago-0.009
Token.
Token ID13
Feature activation+1.86e-02

Pos logit contributions
illy+0.139
 biggest+0.134
 favourite+0.134
 groceries+0.131
 favorite+0.129
Neg logit contributions
pe-0.134
ro-0.123
bench-0.103
 ago-0.101
ider-0.098
Token The
Token ID383
Feature activation-2.38e-02

Pos logit contributions
illy+0.030
 groceries+0.029
 biggest+0.029
 favourite+0.029
 favorite+0.028
Neg logit contributions
pe-0.033
ro-0.032
bench-0.028
 swoop-0.027
 dent-0.027
Token End
Token ID5268
Feature activation-2.93e-02

Pos logit contributions
pe+0.039
ro+0.038
bench+0.033
 swoop+0.032
ider+0.032
Neg logit contributions
illy-0.041
 biggest-0.040
 favourite-0.040
 groceries-0.040
 favorite-0.040
Token.
Token ID13
Feature activation+1.57e-02

Pos logit contributions
pe+0.050
ro+0.048
bench+0.042
ider+0.041
 swoop+0.040
Neg logit contributions
illy-0.049
 biggest-0.048
 groceries-0.047
 favourite-0.047
 favorite-0.046
Token<|endoftext|>
Token ID50256
Feature activation-6.03e-02

Pos logit contributions
illy+0.025
 biggest+0.025
 groceries+0.025
 favourite+0.025
 favorite+0.024
Neg logit contributions
pe-0.027
ro-0.027
bench-0.023
 swoop-0.023
 dent-0.022
Token we
Token ID356
Feature activation+1.50e-02

Pos logit contributions
pe+0.004
ro+0.004
bench+0.004
 swoop+0.004
ider+0.004
Neg logit contributions
illy-0.003
 biggest-0.003
 favourite-0.003
 groceries-0.003
 favorite-0.003
Token visit
Token ID3187
Feature activation+2.76e-02

Pos logit contributions
illy+0.026
 groceries+0.025
 biggest+0.024
 favourite+0.024
 favorite+0.023
Neg logit contributions
pe-0.024
ro-0.024
 swoop-0.021
 dent-0.021
bench-0.020
Token the
Token ID262
Feature activation+1.51e-02

Pos logit contributions
 biggest+0.054
illy+0.053
oll+0.053
idy+0.051
ovable+0.050
Neg logit contributions
ro-0.032
pe-0.030
h-0.030
 breath-0.030
 seek-0.029
Token park
Token ID3952
Feature activation-1.30e-02

Pos logit contributions
illy+0.025
pieces+0.022
 favourite+0.021
 groceries+0.021
idy+0.021
Neg logit contributions
ro-0.026
pe-0.026
 dent-0.023
bench-0.023
ider-0.022
Token to
Token ID284
Feature activation-7.84e-03

Pos logit contributions
ro+0.024
pe+0.023
ider+0.021
 swoop+0.020
bench+0.020
Neg logit contributions
illy-0.021
 groceries-0.019
 biggest-0.019
idy-0.018
 favourite-0.018
Token find
Token ID1064
Feature activation+8.15e-02

Pos logit contributions
ro+0.010
pe+0.010
 swoop+0.009
 hon+0.009
 crash+0.008
Neg logit contributions
illy-0.016
idy-0.015
 favourite-0.015
 biggest-0.015
 groceries-0.015
Token it
Token ID340
Feature activation+4.92e-02

Pos logit contributions
illy+0.152
pieces+0.143
ingu+0.142
ete+0.141
affles+0.139
Neg logit contributions
 hit-0.085
 hon-0.085
 swoop-0.083
 dent-0.082
 flash-0.080
Token?"
Token ID1701
Feature activation-3.48e-04

Pos logit contributions
illy+0.092
pieces+0.085
idy+0.082
 groceries+0.081
 favourite+0.080
Neg logit contributions
pe-0.072
ro-0.072
 swoop-0.062
ider-0.062
 seek-0.061
Token Her
Token ID2332
Feature activation-7.68e-03

Pos logit contributions
pe+0.001
ro+0.001
bench+0.001
 swoop+0.001
 dent+0.001
Neg logit contributions
illy-0.000
 biggest-0.000
 groceries-0.000
 favourite-0.000
 favorite-0.000
Token mom
Token ID1995
Feature activation-3.23e-03

Pos logit contributions
pe+0.018
ro+0.018
bench+0.016
 dent+0.016
ider+0.016
Neg logit contributions
illy-0.008
 groceries-0.007
 biggest-0.007
 favourite-0.007
 favorite-0.006
Token said
Token ID531
Feature activation-6.81e-03

Pos logit contributions
ro+0.006
pe+0.006
bench+0.006
 ago+0.006
ider+0.006
Neg logit contributions
illy-0.005
 favorite-0.004
 favourite-0.004
 new-0.004
 biggest-0.004
TokenYou
Token ID1639
Feature activation+3.33e-03

Pos logit contributions
pe+0.035
ro+0.035
bench+0.032
 dent+0.032
ider+0.031
Neg logit contributions
illy-0.032
 biggest-0.030
 favourite-0.030
 favorite-0.028
 groceries-0.028
Token should
Token ID815
Feature activation-4.53e-02

Pos logit contributions
 groceries+0.005
illy+0.005
 favourite+0.005
 biggest+0.005
 favorite+0.005
Neg logit contributions
pe-0.006
ro-0.006
 swoop-0.005
bench-0.005
 charged-0.005
Token pick
Token ID2298
Feature activation+3.52e-02

Pos logit contributions
ro+0.071
pe+0.071
bench+0.062
ider+0.060
 swoop+0.059
Neg logit contributions
illy-0.080
 biggest-0.077
 favourite-0.077
 groceries-0.076
 favorite-0.076
Token the
Token ID262
Feature activation+1.62e-02

Pos logit contributions
ingu+0.048
 favourite+0.046
 journals+0.045
illy+0.044
pieces+0.044
Neg logit contributions
pe-0.060
ro-0.057
 swoop-0.052
h-0.049
 charges-0.045
Token sweet
Token ID6029
Feature activation+1.33e-02

Pos logit contributions
illy+0.021
pieces+0.020
ingu+0.018
 favourite+0.017
idy+0.017
Neg logit contributions
pe-0.033
ro-0.033
 swoop-0.029
 dent-0.028
 wob-0.027
Token ones
Token ID3392
Feature activation+8.13e-02

Pos logit contributions
 favourite+0.014
illy+0.014
pieces+0.013
ingu+0.013
 biggest+0.013
Neg logit contributions
ro-0.030
pe-0.028
 swoop-0.026
h-0.024
 dent-0.024
Token.
Token ID13
Feature activation+1.15e-02

Pos logit contributions
illy+0.135
 favourite+0.133
 favorite+0.128
 biggest+0.127
 groceries+0.126
Neg logit contributions
pe-0.140
ro-0.125
 ago-0.105
h-0.105
bench-0.104
Token They
Token ID1119
Feature activation+5.40e-04

Pos logit contributions
illy+0.018
 groceries+0.017
 favourite+0.017
 biggest+0.017
 favorite+0.017
Neg logit contributions
pe-0.021
ro-0.021
bench-0.018
ider-0.018
 swoop-0.018
Token taste
Token ID6938
Feature activation+2.83e-02

Pos logit contributions
illy+0.001
 groceries+0.001
 favourite+0.001
 biggest+0.001
 favorite+0.001
Neg logit contributions
pe-0.001
ro-0.001
 swoop-0.001
bench-0.001
 dent-0.001
Token much
Token ID881
Feature activation+3.62e-02

Pos logit contributions
illy+0.037
ippi+0.035
 favourite+0.035
 groceries+0.035
 biggest+0.033
Neg logit contributions
pe-0.054
ro-0.049
 swoop-0.047
 hon-0.045
h-0.044
Token better
Token ID1365
Feature activation+1.96e-03

Pos logit contributions
illy+0.043
 favourite+0.041
 biggest+0.039
 favorite+0.039
 groceries+0.038
Neg logit contributions
pe-0.077
ro-0.077
 swoop-0.069
 dent-0.068
bench-0.068
Token man
Token ID582
Feature activation+6.26e-04

Pos logit contributions
 biggest+0.012
 new+0.012
 favourite+0.011
 favorite+0.011
illy+0.011
Neg logit contributions
pe-0.030
ro-0.029
bench-0.027
ider-0.027
 ago-0.026
Token walking
Token ID6155
Feature activation-2.62e-02

Pos logit contributions
illy+0.001
 groceries+0.001
 favourite+0.001
 new+0.001
 favorite+0.001
Neg logit contributions
ro-0.001
pe-0.001
bench-0.001
ider-0.001
 dent-0.001
Token on
Token ID319
Feature activation-1.45e-02

Pos logit contributions
pe+0.057
ro+0.055
ider+0.052
bench+0.051
 dent+0.051
Neg logit contributions
 groceries-0.029
illy-0.029
 favorite-0.029
 favourite-0.028
 biggest-0.028
Token a
Token ID257
Feature activation+3.49e-02

Pos logit contributions
pe+0.034
ro+0.033
ider+0.031
bench+0.030
 swoop+0.029
Neg logit contributions
 groceries-0.015
illy-0.015
 favourite-0.014
 biggest-0.014
 favorite-0.014
Token tight
Token ID5381
Feature activation-3.63e-02

Pos logit contributions
illy+0.073
 favourite+0.069
 groceries+0.069
 biggest+0.069
 favorite+0.068
Neg logit contributions
pe-0.047
ro-0.045
 swoop-0.037
bench-0.036
 dent-0.035
Tokenro
Token ID305
Feature activation-1.12e-01

Pos logit contributions
ider+0.064
pe+0.062
 fought+0.057
 charged+0.057
bench+0.055
Neg logit contributions
 new-0.051
illy-0.050
 groceries-0.049
 favorite-0.048
 tomato-0.047
Tokenpe
Token ID431
Feature activation-3.65e-02

Pos logit contributions
Uh+0.117
 dent+0.101
ider+0.100
Copy+0.098
 collapse+0.097
Neg logit contributions
's-0.217
 favorite-0.208
 favourite-0.204
illy-0.204
ly-0.200
Token.
Token ID13
Feature activation+1.41e-02

Pos logit contributions
ro+0.068
bench+0.067
 dent+0.064
pe+0.062
ider+0.062
Neg logit contributions
 favorite-0.046
's-0.045
 favourite-0.045
illy-0.045
 new-0.042
Token He
Token ID679
Feature activation-1.33e-02

Pos logit contributions
illy+0.023
 biggest+0.022
 favourite+0.022
 groceries+0.022
 favorite+0.022
Neg logit contributions
ro-0.024
pe-0.024
bench-0.021
 swoop-0.020
 dent-0.020
Token had
Token ID550
Feature activation+5.73e-03

Pos logit contributions
pe+0.022
ro+0.021
bench+0.018
 swoop+0.018
ider+0.018
Neg logit contributions
illy-0.023
 biggest-0.022
 groceries-0.022
 favourite-0.022
 favorite-0.022
Token to
Token ID284
Feature activation-6.83e-02

Pos logit contributions
illy+0.007
 favourite+0.007
 groceries+0.007
 biggest+0.007
 favorite+0.007
Neg logit contributions
pe-0.012
ro-0.012
bench-0.010
 swoop-0.010
 dent-0.010
Token it
Token ID340
Feature activation+1.76e-02

Pos logit contributions
illy+0.003
 favourite+0.003
 biggest+0.003
 favorite+0.003
 groceries+0.003
Neg logit contributions
pe-0.005
ro-0.005
bench-0.005
 swoop-0.005
 dent-0.005
Token here
Token ID994
Feature activation-2.09e-02

Pos logit contributions
illy+0.024
 biggest+0.023
 favourite+0.023
 groceries+0.023
 favorite+0.023
Neg logit contributions
pe-0.035
ro-0.034
bench-0.031
 dent-0.030
 swoop-0.029
Token a
Token ID257
Feature activation+7.32e-03

Pos logit contributions
pe+0.037
ro+0.036
bench+0.032
 swoop+0.031
 dent+0.031
Neg logit contributions
illy-0.033
 favourite-0.032
 biggest-0.032
 groceries-0.032
 favorite-0.031
Token long
Token ID890
Feature activation-4.47e-02

Pos logit contributions
illy+0.010
 biggest+0.010
 favourite+0.010
 groceries+0.010
 favorite+0.010
Neg logit contributions
pe-0.014
ro-0.014
bench-0.012
ider-0.012
 swoop-0.012
Token time
Token ID640
Feature activation-4.64e-02

Pos logit contributions
ider+0.090
pe+0.085
 bleed+0.083
bench+0.078
 dodging+0.078
Neg logit contributions
 other-0.049
 tomato-0.048
 new-0.048
 tomatoes-0.048
 favorite-0.047
Token ago
Token ID2084
Feature activation-1.12e-01

Pos logit contributions
pe+0.050
ro+0.048
bench+0.041
 dent+0.040
 swoop+0.039
Neg logit contributions
 favorite-0.102
 biggest-0.100
 favourite-0.100
 groceries-0.100
 new-0.098
Token."
Token ID526
Feature activation-9.30e-03

Pos logit contributions
 dent+0.238
ro+0.238
pe+0.237
Uh+0.233
 swoop+0.231
Neg logit contributions
 groceries-0.110
 tomatoes-0.104
 sandwiches-0.102
 favorite-0.102
 biggest-0.098
Token
Token ID198
Feature activation-1.61e-02

Pos logit contributions
pe+0.019
ro+0.018
bench+0.017
 swoop+0.016
ider+0.016
Neg logit contributions
 biggest-0.012
 favorite-0.012
 groceries-0.012
 favourite-0.012
illy-0.012
Token
Token ID198
Feature activation-2.12e-02

Pos logit contributions
pe+0.027
ro+0.027
bench+0.023
 swoop+0.023
 dent+0.022
Neg logit contributions
illy-0.027
 groceries-0.026
 favourite-0.026
 biggest-0.026
 favorite-0.025
Token"
Token ID1
Feature activation-1.97e-02

Pos logit contributions
ro+0.041
pe+0.040
bench+0.038
 ago+0.037
 dent+0.036
Neg logit contributions
 biggest-0.027
 new-0.027
 groceries-0.026
illy-0.026
 favorite-0.026
TokenMaybe
Token ID13300
Feature activation+1.70e-02

Pos logit contributions
ro+0.035
pe+0.034
bench+0.034
 ago+0.034
 dent+0.033
Neg logit contributions
 biggest-0.028
 favorite-0.027
 favourite-0.027
 groceries-0.026
 new-0.025
Token walk
Token ID2513
Feature activation-5.49e-03

Pos logit contributions
ro+0.052
pe+0.051
bench+0.048
ider+0.046
 dent+0.045
Neg logit contributions
 favourite-0.043
 favorite-0.042
 biggest-0.041
 amazing-0.039
 new-0.038
Token around
Token ID1088
Feature activation-1.21e-03

Pos logit contributions
pe+0.011
ro+0.011
bench+0.010
 swoop+0.009
ider+0.009
Neg logit contributions
illy-0.008
 groceries-0.007
 biggest-0.007
 favourite-0.007
 favorite-0.007
Token on
Token ID319
Feature activation-8.35e-03

Pos logit contributions
pe+0.003
ro+0.003
ider+0.003
bench+0.003
 dent+0.003
Neg logit contributions
 favourite-0.001
 new-0.001
 favorite-0.001
 biggest-0.001
 amazing-0.001
Token a
Token ID257
Feature activation+3.64e-02

Pos logit contributions
pe+0.017
ro+0.017
bench+0.015
 swoop+0.014
 dent+0.014
Neg logit contributions
illy-0.011
 favourite-0.011
 biggest-0.011
 groceries-0.010
 favorite-0.010
Token tight
Token ID5381
Feature activation-4.24e-02

Pos logit contributions
illy+0.060
 groceries+0.055
elling+0.053
 favourite+0.051
 biggest+0.051
Neg logit contributions
pe-0.069
ro-0.064
 swoop-0.059
bench-0.057
 hon-0.055
Tokenro
Token ID305
Feature activation-1.10e-01

Pos logit contributions
ider+0.073
pe+0.069
 dent+0.065
 charged+0.064
bench+0.064
Neg logit contributions
 new-0.061
 biggest-0.059
 favorite-0.059
 favourite-0.058
 tomatoes-0.057
Tokenpe
Token ID431
Feature activation-3.79e-02

Pos logit contributions
Uh+0.117
 dent+0.106
 collapse+0.099
 bleed+0.095
 whine+0.091
Neg logit contributions
 favorite-0.233
 favourite-0.228
's-0.226
 new-0.211
illy-0.208
Token,
Token ID11
Feature activation-5.41e-03

Pos logit contributions
ro+0.065
bench+0.065
 dent+0.065
pe+0.062
ider+0.060
Neg logit contributions
 favorite-0.055
 favourite-0.054
 biggest-0.051
's-0.050
illy-0.050
Token he
Token ID339
Feature activation+8.44e-03

Pos logit contributions
pe+0.011
ro+0.010
bench+0.009
ider+0.009
 dent+0.009
Neg logit contributions
 favourite-0.008
 favorite-0.008
 biggest-0.007
 groceries-0.007
 new-0.007
Token could
Token ID714
Feature activation-2.89e-02

Pos logit contributions
 favourite+0.011
 favorite+0.011
 biggest+0.011
illy+0.010
 amazing+0.010
Neg logit contributions
pe-0.017
ro-0.017
bench-0.015
ider-0.015
 dent-0.015
Token even
Token ID772
Feature activation+3.02e-02

Pos logit contributions
ro+0.055
pe+0.055
bench+0.053
ider+0.050
 dent+0.049
Neg logit contributions
 favourite-0.041
 favorite-0.041
 biggest-0.039
 amazing-0.037
 new-0.037
Token famous
Token ID5863
Feature activation-1.78e-02

Pos logit contributions
illy+0.027
 groceries+0.026
 favourite+0.025
 biggest+0.025
 favorite+0.024
Neg logit contributions
pe-0.053
ro-0.052
 swoop-0.047
bench-0.047
 dent-0.046
Token artist
Token ID6802
Feature activation-2.43e-03

Pos logit contributions
pe+0.032
ro+0.032
bench+0.028
ider+0.027
 swoop+0.027
Neg logit contributions
illy-0.027
 biggest-0.027
 favourite-0.027
 groceries-0.026
 favorite-0.026
Token a
Token ID257
Feature activation+1.50e-02

Pos logit contributions
ro+0.005
pe+0.005
 dent+0.004
bench+0.004
 swoop+0.004
Neg logit contributions
 biggest-0.003
 favorite-0.003
 favourite-0.003
illy-0.003
 new-0.003
Token long
Token ID890
Feature activation-3.55e-02

Pos logit contributions
illy+0.016
 groceries+0.016
 favourite+0.015
 biggest+0.015
 favorite+0.015
Neg logit contributions
pe-0.034
ro-0.033
bench-0.030
 swoop-0.030
 dent-0.029
Token time
Token ID640
Feature activation-4.02e-02

Pos logit contributions
ider+0.075
 bleed+0.066
pe+0.066
ro+0.065
 dodging+0.064
Neg logit contributions
 new-0.040
 favorite-0.039
 tomato-0.039
 favourite-0.039
 amazing-0.038
Token ago
Token ID2084
Feature activation-1.10e-01

Pos logit contributions
pe+0.073
ro+0.072
bench+0.067
 dent+0.066
 swoop+0.062
Neg logit contributions
 favorite-0.059
 biggest-0.059
 favourite-0.058
 new-0.055
 amazing-0.055
Token."
Token ID526
Feature activation+6.76e-03

Pos logit contributions
 dent+0.240
pe+0.236
Uh+0.233
ro+0.233
 swoop+0.232
Neg logit contributions
 favorite-0.109
 groceries-0.101
 favourite-0.100
 biggest-0.100
illy-0.099
Token
Token ID220
Feature activation-1.30e-02

Pos logit contributions
illy+0.012
 biggest+0.011
 favourite+0.011
 groceries+0.011
 favorite+0.011
Neg logit contributions
pe-0.011
ro-0.011
bench-0.009
 swoop-0.009
ider-0.009
Token
Token ID198
Feature activation-1.14e-02

Pos logit contributions
pe+0.021
ro+0.021
bench+0.018
 swoop+0.017
ider+0.017
Neg logit contributions
illy-0.022
 groceries-0.022
 biggest-0.022
 favourite-0.022
 favorite-0.021
Token
Token ID198
Feature activation-7.88e-03

Pos logit contributions
pe+0.021
ro+0.020
bench+0.018
 swoop+0.017
ider+0.017
Neg logit contributions
illy-0.018
 favourite-0.017
 biggest-0.017
 groceries-0.017
 favorite-0.017
TokenThe
Token ID464
Feature activation-2.00e-02

Pos logit contributions
pe+0.015
ro+0.015
bench+0.013
 dent+0.013
ider+0.013
Neg logit contributions
illy-0.011
 favourite-0.011
 biggest-0.011
 groceries-0.011
 favorite-0.011
Token painted
Token ID13055
Feature activation+9.86e-03

Pos logit contributions
 biggest+0.034
 favourite+0.034
 favorite+0.034
illy+0.034
 groceries+0.034
Neg logit contributions
pe-0.035
ro-0.035
bench-0.031
ider-0.030
 dent-0.029
Token them
Token ID606
Feature activation+3.26e-02

Pos logit contributions
illy+0.012
 groceries+0.011
 favourite+0.011
 biggest+0.010
 favorite+0.010
Neg logit contributions
pe-0.021
ro-0.021
 swoop-0.019
 ago-0.018
bench-0.018
Token a
Token ID257
Feature activation+2.00e-02

Pos logit contributions
illy+0.039
 favourite+0.038
 biggest+0.038
 groceries+0.038
 favorite+0.038
Neg logit contributions
pe-0.069
ro-0.068
bench-0.062
 swoop-0.060
ider-0.060
Token long
Token ID890
Feature activation-3.66e-02

Pos logit contributions
illy+0.022
 biggest+0.021
 favourite+0.021
 groceries+0.021
 favorite+0.021
Neg logit contributions
pe-0.045
ro-0.044
bench-0.040
ider-0.039
 swoop-0.039
Token time
Token ID640
Feature activation-4.63e-02

Pos logit contributions
pe+0.064
ider+0.063
bench+0.060
 bleed+0.058
ro+0.057
Neg logit contributions
 favourite-0.052
 favorite-0.051
 new-0.051
 biggest-0.048
 tomato-0.048
Token ago
Token ID2084
Feature activation-1.10e-01

Pos logit contributions
pe+0.075
ro+0.070
 swoop+0.065
bench+0.065
Ar+0.064
Neg logit contributions
 favorite-0.077
 favourite-0.077
 new-0.075
 biggest-0.075
 amazing-0.074
Token.
Token ID13
Feature activation+1.75e-02

Pos logit contributions
Uh+0.245
Ar+0.245
Neigh+0.244
pe+0.244
 dent+0.239
Neg logit contributions
's-0.095
 favorite-0.094
illy-0.092
 favourite-0.089
 groceries-0.089
Token She
Token ID1375
Feature activation-1.36e-02

Pos logit contributions
illy+0.031
idy+0.029
 biggest+0.029
 groceries+0.028
 amazing+0.027
Neg logit contributions
ro-0.030
pe-0.026
ider-0.024
bench-0.023
 ago-0.022
Token thought
Token ID1807
Feature activation+3.22e-02

Pos logit contributions
pe+0.022
ro+0.021
 swoop+0.018
bench+0.018
ider+0.017
Neg logit contributions
illy-0.024
 groceries-0.024
 biggest-0.024
 favourite-0.023
 favorite-0.023
Token they
Token ID484
Feature activation-8.77e-05

Pos logit contributions
 biggest+0.056
illy+0.054
 groceries+0.053
elling+0.053
ingu+0.051
Neg logit contributions
ro-0.047
pe-0.047
 ago-0.045
 crash-0.041
 spe-0.039
Token were
Token ID547
Feature activation-1.49e-02

Pos logit contributions
pe+0.000
ro+0.000
 swoop+0.000
 fought+0.000
 wob+0.000
Neg logit contributions
illy-0.000
 groceries-0.000
 biggest-0.000
 tomatoes-0.000
ummy-0.000
Token same
Token ID976
Feature activation+1.01e-02

Pos logit contributions
illy+0.013
 groceries+0.012
 favourite+0.012
 tomatoes+0.012
 biggest+0.012
Neg logit contributions
ro-0.021
pe-0.021
bench-0.018
 swoop-0.018
 dent-0.018
Token dress
Token ID6576
Feature activation-8.19e-03

Pos logit contributions
illy+0.013
 favourite+0.013
 biggest+0.013
 groceries+0.013
 favorite+0.013
Neg logit contributions
pe-0.020
ro-0.020
bench-0.018
 swoop-0.017
ider-0.017
Token a
Token ID257
Feature activation+2.17e-02

Pos logit contributions
ro+0.015
pe+0.015
bench+0.013
ider+0.012
 swoop+0.012
Neg logit contributions
illy-0.013
 groceries-0.012
 biggest-0.012
 favourite-0.012
 favorite-0.012
Token long
Token ID890
Feature activation-3.52e-02

Pos logit contributions
illy+0.026
 favourite+0.025
 biggest+0.025
 groceries+0.025
 favorite+0.025
Neg logit contributions
pe-0.047
ro-0.046
bench-0.042
 swoop-0.041
ider-0.040
Token time
Token ID640
Feature activation-4.02e-02

Pos logit contributions
pe+0.077
ider+0.072
 collapse+0.072
 dodging+0.071
stack+0.071
Neg logit contributions
 favourite-0.040
 favorite-0.039
 new-0.036
illy-0.034
 biggest-0.032
Token ago
Token ID2084
Feature activation-1.10e-01

Pos logit contributions
pe+0.074
ro+0.070
bench+0.065
 dent+0.064
 swoop+0.064
Neg logit contributions
 favourite-0.061
illy-0.060
 favorite-0.060
 biggest-0.058
 groceries-0.057
Token.
Token ID13
Feature activation+1.90e-02

Pos logit contributions
 dent+0.240
pe+0.237
Uh+0.234
 swoop+0.234
 bleed+0.231
Neg logit contributions
 favorite-0.106
 favourite-0.106
illy-0.104
 groceries-0.101
's-0.095
Token<|endoftext|>
Token ID50256
Feature activation-5.85e-02

Pos logit contributions
 groceries+0.035
illy+0.034
 biggest+0.033
 favourite+0.033
 tomatoes+0.032
Neg logit contributions
ro-0.028
pe-0.027
bench-0.023
 ago-0.022
ider-0.021
TokenOnce
Token ID7454
Feature activation+2.59e-02

Pos logit contributions
pe+0.117
ro+0.114
ider+0.102
 swoop+0.102
bench+0.101
Neg logit contributions
illy-0.078
 biggest-0.075
 favourite-0.075
 groceries-0.075
 favorite-0.074
Token upon
Token ID2402
Feature activation+5.55e-03

Pos logit contributions
 groceries+0.047
illy+0.046
 biggest+0.046
 favourite+0.045
 tomatoes+0.044
Neg logit contributions
pe-0.037
ro-0.037
bench-0.032
 ago-0.031
 swoop-0.030
Token a
Token ID257
Feature activation+5.94e-03

Pos logit contributions
illy+0.006
 favourite+0.005
 biggest+0.005
 groceries+0.005
 favorite+0.005
Neg logit contributions
pe-0.013
ro-0.013
bench-0.012
 swoop-0.011
ider-0.011
Token important
Token ID1593
Feature activation-2.14e-02

Pos logit contributions
illy+0.032
 groceries+0.032
 favourite+0.031
pieces+0.030
cakes+0.030
Neg logit contributions
ro-0.044
pe-0.043
 ago-0.042
 swoop-0.038
 dent-0.037
Token person
Token ID1048
Feature activation+3.49e-02

Pos logit contributions
pe+0.040
ro+0.039
bench+0.034
 swoop+0.033
ider+0.033
Neg logit contributions
illy-0.032
 groceries-0.031
 biggest-0.031
 favourite-0.031
 favorite-0.030
Token a
Token ID257
Feature activation-2.76e-03

Pos logit contributions
illy+0.052
 biggest+0.051
 groceries+0.050
 favourite+0.050
 favorite+0.049
Neg logit contributions
pe-0.065
ro-0.063
bench-0.055
ider-0.054
 ago-0.054
Token long
Token ID890
Feature activation-4.46e-02

Pos logit contributions
pe+0.006
ro+0.006
bench+0.005
 swoop+0.005
ider+0.005
Neg logit contributions
illy-0.003
 groceries-0.003
 favourite-0.003
 biggest-0.003
 favorite-0.003
Token time
Token ID640
Feature activation-3.93e-02

Pos logit contributions
pe+0.087
ider+0.080
ro+0.079
bench+0.076
 dent+0.075
Neg logit contributions
 favorite-0.060
 favourite-0.060
illy-0.058
 new-0.055
 biggest-0.055
Token ago
Token ID2084
Feature activation-1.09e-01

Pos logit contributions
pe+0.044
ro+0.044
 dent+0.037
bench+0.037
 swoop+0.035
Neg logit contributions
 favourite-0.085
 favorite-0.084
illy-0.083
 biggest-0.082
 groceries-0.082
Token.
Token ID13
Feature activation+1.36e-02

Pos logit contributions
ro+0.233
 dent+0.232
pe+0.230
 swoop+0.225
Uh+0.222
Neg logit contributions
illy-0.110
 favorite-0.109
 favourite-0.109
 groceries-0.107
 tomatoes-0.097
Token She
Token ID1375
Feature activation-1.86e-02

Pos logit contributions
illy+0.022
 biggest+0.021
 groceries+0.021
 favourite+0.021
 favorite+0.020
Neg logit contributions
ro-0.024
pe-0.023
bench-0.020
 ago-0.019
ider-0.019
Token was
Token ID373
Feature activation-1.05e-02

Pos logit contributions
pe+0.030
ro+0.029
 swoop+0.026
bench+0.025
 charged+0.024
Neg logit contributions
 groceries-0.032
 biggest-0.032
illy-0.032
 favourite-0.031
 favorite-0.030
Token excited
Token ID6568
Feature activation-1.53e-04

Pos logit contributions
ro+0.025
pe+0.024
bench+0.021
 dent+0.021
ider+0.021
Neg logit contributions
 favourite-0.011
 favorite-0.011
 biggest-0.011
 amazing-0.011
 new-0.011
Token and
Token ID290
Feature activation-9.44e-03

Pos logit contributions
pe+0.000
ro+0.000
bench+0.000
 swoop+0.000
ider+0.000
Neg logit contributions
illy-0.000
 biggest-0.000
 favourite-0.000
 groceries-0.000
 favorite-0.000
Token he
Token ID339
Feature activation+1.18e-02

Pos logit contributions
illy+0.064
 groceries+0.061
 biggest+0.059
 favourite+0.058
 favorite+0.057
Neg logit contributions
pe-0.097
ro-0.093
bench-0.083
 swoop-0.080
ider-0.080
Token had
Token ID550
Feature activation+1.29e-02

Pos logit contributions
illy+0.020
 favourite+0.020
 biggest+0.020
 groceries+0.019
 favorite+0.019
Neg logit contributions
pe-0.019
ro-0.019
bench-0.017
 swoop-0.016
ider-0.016
Token seen
Token ID1775
Feature activation+1.87e-02

Pos logit contributions
illy+0.020
 favourite+0.020
 favorite+0.019
 biggest+0.019
 groceries+0.019
Neg logit contributions
pe-0.023
ro-0.023
bench-0.020
 dent-0.020
ider-0.019
Token a
Token ID257
Feature activation+1.94e-02

Pos logit contributions
illy+0.030
 biggest+0.029
 favourite+0.029
 groceries+0.029
 favorite+0.029
Neg logit contributions
pe-0.032
ro-0.031
bench-0.027
 swoop-0.026
ider-0.026
Token year
Token ID614
Feature activation-6.81e-03

Pos logit contributions
illy+0.024
 biggest+0.024
 favourite+0.024
 groceries+0.023
 favorite+0.023
Neg logit contributions
pe-0.041
ro-0.040
bench-0.036
ider-0.035
 swoop-0.035
Token ago
Token ID2084
Feature activation-1.08e-01

Pos logit contributions
pe+0.012
ro+0.012
bench+0.011
 ago+0.011
 swoop+0.010
Neg logit contributions
illy-0.010
 favourite-0.010
 groceries-0.010
 biggest-0.010
 favorite-0.010
Token.
Token ID13
Feature activation+1.29e-02

Pos logit contributions
ro+0.230
 dent+0.225
pe+0.222
Uh+0.221
 swoop+0.221
Neg logit contributions
 groceries-0.108
 favourite-0.107
illy-0.106
 favorite-0.104
's-0.096
Token He
Token ID679
Feature activation-8.30e-03

Pos logit contributions
illy+0.022
 groceries+0.022
 biggest+0.021
 favourite+0.021
 favorite+0.021
Neg logit contributions
pe-0.021
ro-0.021
bench-0.018
ider-0.017
 swoop-0.017
Token went
Token ID1816
Feature activation-5.79e-03

Pos logit contributions
pe+0.014
ro+0.014
bench+0.012
ider+0.012
 dent+0.012
Neg logit contributions
illy-0.014
 favourite-0.013
 favorite-0.013
 biggest-0.013
 groceries-0.013
Token home
Token ID1363
Feature activation-2.14e-02

Pos logit contributions
pe+0.014
ro+0.014
bench+0.012
 swoop+0.012
ider+0.012
Neg logit contributions
illy-0.006
 biggest-0.006
 favourite-0.005
 groceries-0.005
 favorite-0.005
Token,
Token ID11
Feature activation-9.55e-05

Pos logit contributions
pe+0.050
ro+0.048
 swoop+0.047
 dent+0.045
ider+0.044
Neg logit contributions
 new-0.023
 favorite-0.020
 favourite-0.020
 groceries-0.019
 lots-0.018
Token it
Token ID340
Feature activation+2.56e-02

Pos logit contributions
illy+0.012
 favourite+0.011
 biggest+0.011
 favorite+0.011
 groceries+0.011
Neg logit contributions
pe-0.018
ro-0.018
bench-0.016
 swoop-0.016
 ago-0.015
Token here
Token ID994
Feature activation-1.52e-02

Pos logit contributions
illy+0.036
 favourite+0.036
 biggest+0.036
 groceries+0.035
 favorite+0.035
Neg logit contributions
pe-0.049
ro-0.048
bench-0.043
 swoop-0.042
 dent-0.041
Token a
Token ID257
Feature activation+1.53e-02

Pos logit contributions
pe+0.027
ro+0.026
bench+0.023
 swoop+0.022
ider+0.022
Neg logit contributions
illy-0.024
 favourite-0.024
 biggest-0.024
 groceries-0.024
 favorite-0.023
Token long
Token ID890
Feature activation-3.58e-02

Pos logit contributions
illy+0.022
 favourite+0.021
 groceries+0.021
 biggest+0.021
 favorite+0.020
Neg logit contributions
pe-0.030
ro-0.029
bench-0.025
 swoop-0.025
 dent-0.025
Token time
Token ID640
Feature activation-3.82e-02

Pos logit contributions
ider+0.072
pe+0.069
 bleed+0.066
 dodging+0.064
 collapse+0.063
Neg logit contributions
 tomato-0.041
 tomatoes-0.040
 favorite-0.039
 favourite-0.038
 spaghetti-0.038
Token ago
Token ID2084
Feature activation-1.08e-01

Pos logit contributions
pe+0.045
ro+0.042
bench+0.037
 dent+0.036
 swoop+0.036
Neg logit contributions
 favorite-0.081
 biggest-0.081
 favourite-0.080
illy-0.079
 groceries-0.079
Token."
Token ID526
Feature activation+1.85e-03

Pos logit contributions
pe+0.228
 dent+0.227
ro+0.225
 swoop+0.225
Uh+0.220
Neg logit contributions
 groceries-0.106
 favorite-0.105
illy-0.103
 tomatoes-0.103
 biggest-0.102
Token
Token ID198
Feature activation-7.77e-03

Pos logit contributions
illy+0.003
 groceries+0.003
 biggest+0.003
 favourite+0.003
 favorite+0.003
Neg logit contributions
pe-0.003
ro-0.003
bench-0.003
 swoop-0.003
ider-0.003
Token
Token ID198
Feature activation-1.35e-02

Pos logit contributions
pe+0.013
ro+0.013
bench+0.011
 swoop+0.011
 dent+0.011
Neg logit contributions
illy-0.013
 groceries-0.013
 biggest-0.013
 favourite-0.013
 favorite-0.013
TokenThey
Token ID2990
Feature activation-1.53e-02

Pos logit contributions
ro+0.025
pe+0.024
bench+0.023
ider+0.022
 dent+0.022
Neg logit contributions
illy-0.020
 favourite-0.018
 biggest-0.018
 groceries-0.018
 favorite-0.018
Token decided
Token ID3066
Feature activation+1.14e-02

Pos logit contributions
pe+0.023
ro+0.023
bench+0.020
 swoop+0.019
ider+0.019
Neg logit contributions
illy-0.028
 biggest-0.027
 groceries-0.027
 favourite-0.027
 favorite-0.027
Token by
Token ID416
Feature activation-1.82e-02

Pos logit contributions
illy+0.056
 groceries+0.055
 favourite+0.052
 biggest+0.050
 favorite+0.049
Neg logit contributions
pe-0.104
ro-0.104
 swoop-0.095
 hon-0.091
bench-0.090
Token a
Token ID257
Feature activation+2.57e-02

Pos logit contributions
pe+0.038
ro+0.038
ider+0.036
bench+0.036
 dent+0.033
Neg logit contributions
 biggest-0.022
illy-0.021
 favourite-0.021
 favorite-0.021
 new-0.021
Token famous
Token ID5863
Feature activation-2.12e-02

Pos logit contributions
illy+0.030
 biggest+0.030
 favourite+0.029
 favorite+0.029
 groceries+0.029
Neg logit contributions
pe-0.056
ro-0.055
bench-0.050
ider-0.049
 swoop-0.048
Token artist
Token ID6802
Feature activation-5.45e-03

Pos logit contributions
pe+0.038
ro+0.037
bench+0.033
 swoop+0.032
ider+0.032
Neg logit contributions
illy-0.033
 biggest-0.032
 favourite-0.032
 groceries-0.032
 favorite-0.032
Token long
Token ID890
Feature activation-2.51e-02

Pos logit contributions
ro+0.011
pe+0.011
 dent+0.010
bench+0.010
 swoop+0.010
Neg logit contributions
 favorite-0.007
 favourite-0.007
 biggest-0.007
illy-0.007
 groceries-0.006
Token ago
Token ID2084
Feature activation-1.08e-01

Pos logit contributions
ider+0.048
pe+0.046
ro+0.045
Uh+0.043
 dent+0.043
Neg logit contributions
 favorite-0.030
 other-0.030
 favourite-0.030
 new-0.029
 biggest-0.029
Token.
Token ID13
Feature activation+2.17e-02

Pos logit contributions
Uh+0.244
 dent+0.242
 swoop+0.236
 bleed+0.234
ro+0.233
Neg logit contributions
 favorite-0.103
 favourite-0.099
illy-0.091
 groceries-0.091
's-0.089
Token
Token ID220
Feature activation-2.79e-03

Pos logit contributions
illy+0.034
 groceries+0.033
 biggest+0.033
 favourite+0.033
 favorite+0.032
Neg logit contributions
pe-0.039
ro-0.039
bench-0.034
 swoop-0.033
ider-0.032
Token
Token ID198
Feature activation-2.96e-03

Pos logit contributions
pe+0.004
ro+0.004
bench+0.004
 dent+0.003
 swoop+0.003
Neg logit contributions
illy-0.005
 groceries-0.005
 favourite-0.005
 biggest-0.005
 favorite-0.005
Token
Token ID198
Feature activation-3.96e-03

Pos logit contributions
pe+0.005
ro+0.005
bench+0.004
 swoop+0.004
ider+0.004
Neg logit contributions
illy-0.005
 groceries-0.005
 biggest-0.005
 favourite-0.005
 favorite-0.005
TokenOne
Token ID3198
Feature activation+5.49e-03

Pos logit contributions
ro+0.008
pe+0.008
 dent+0.007
bench+0.007
ider+0.007
Neg logit contributions
illy-0.005
 favourite-0.005
 favorite-0.005
 biggest-0.005
 groceries-0.005
Token the
Token ID262
Feature activation-3.77e-03

Pos logit contributions
illy+0.024
 favourite+0.022
 biggest+0.021
 favorite+0.021
 groceries+0.021
Neg logit contributions
pe-0.031
ro-0.030
 dent-0.026
bench-0.026
 swoop-0.026
Token scale
Token ID5046
Feature activation+7.00e-03

Pos logit contributions
pe+0.006
ro+0.006
bench+0.006
 dent+0.005
 swoop+0.005
Neg logit contributions
illy-0.006
 favourite-0.006
 groceries-0.006
 biggest-0.006
 favorite-0.006
Token a
Token ID257
Feature activation+1.58e-02

Pos logit contributions
illy+0.011
 biggest+0.010
 groceries+0.010
 favourite+0.010
 favorite+0.010
Neg logit contributions
pe-0.013
ro-0.013
bench-0.011
ider-0.011
 swoop-0.011
Token long
Token ID890
Feature activation-3.77e-02

Pos logit contributions
 favourite+0.017
 biggest+0.016
 favorite+0.016
 groceries+0.016
 new+0.016
Neg logit contributions
pe-0.036
ro-0.036
bench-0.032
ider-0.032
 swoop-0.031
Token time
Token ID640
Feature activation-3.90e-02

Pos logit contributions
pe+0.079
ider+0.079
 dodging+0.072
 bleed+0.072
cks+0.072
Neg logit contributions
 favorite-0.041
 favourite-0.041
 new-0.037
 groceries-0.037
 spaghetti-0.036
Token ago
Token ID2084
Feature activation-1.08e-01

Pos logit contributions
pe+0.073
ro+0.069
 swoop+0.064
bench+0.063
 dent+0.063
Neg logit contributions
 favourite-0.057
 favorite-0.057
illy-0.055
 groceries-0.055
 biggest-0.053
Token.
Token ID13
Feature activation+2.04e-02

Pos logit contributions
pe+0.232
 dent+0.232
Uh+0.232
ro+0.231
 swoop+0.230
Neg logit contributions
 favorite-0.098
 groceries-0.097
 favourite-0.096
illy-0.094
's-0.091
Token
Token ID198
Feature activation+2.55e-03

Pos logit contributions
illy+0.032
 groceries+0.031
 biggest+0.031
 favourite+0.031
 favorite+0.030
Neg logit contributions
pe-0.037
ro-0.036
bench-0.031
 swoop-0.031
ider-0.030
Token
Token ID198
Feature activation-2.43e-03

Pos logit contributions
illy+0.004
 groceries+0.004
 biggest+0.004
 favourite+0.004
 favorite+0.004
Neg logit contributions
pe-0.004
ro-0.004
bench-0.004
 swoop-0.004
 dent-0.004
TokenAs
Token ID1722
Feature activation+2.16e-02

Pos logit contributions
ro+0.004
pe+0.004
bench+0.004
 dent+0.004
ider+0.003
Neg logit contributions
illy-0.004
 favourite-0.004
 biggest-0.004
 groceries-0.004
 favorite-0.004
Token he
Token ID339
Feature activation-1.01e-02

Pos logit contributions
illy+0.030
 biggest+0.029
 favourite+0.029
 groceries+0.028
 favorite+0.028
Neg logit contributions
pe-0.043
ro-0.042
 swoop-0.037
bench-0.037
 dent-0.036
Token Jim
Token ID5395
Feature activation+1.80e-02

Pos logit contributions
pe+0.049
ro+0.047
bench+0.042
 swoop+0.042
 dent+0.041
Neg logit contributions
illy-0.030
 favourite-0.029
 biggest-0.029
 groceries-0.029
 favorite-0.029
Token from
Token ID422
Feature activation-1.55e-02

Pos logit contributions
illy+0.025
 favourite+0.024
 groceries+0.024
 favorite+0.024
 biggest+0.024
Neg logit contributions
pe-0.035
ro-0.034
bench-0.031
ider-0.030
 dent-0.030
Token a
Token ID257
Feature activation+1.00e-02

Pos logit contributions
pe+0.035
ro+0.034
ider+0.031
bench+0.031
 dent+0.030
Neg logit contributions
 groceries-0.017
illy-0.017
 favourite-0.016
 favorite-0.016
 biggest-0.016
Token long
Token ID890
Feature activation-3.67e-02

Pos logit contributions
 biggest+0.011
 favourite+0.011
illy+0.011
 groceries+0.011
 favorite+0.011
Neg logit contributions
pe-0.022
ro-0.022
bench-0.020
ider-0.020
 swoop-0.019
Token time
Token ID640
Feature activation-3.91e-02

Pos logit contributions
pe+0.072
ider+0.071
ro+0.067
 dent+0.064
 bleed+0.063
Neg logit contributions
illy-0.046
 favourite-0.045
 favorite-0.045
 new-0.045
 groceries-0.044
Token ago
Token ID2084
Feature activation-1.07e-01

Pos logit contributions
pe+0.071
ro+0.070
 swoop+0.064
bench+0.063
Ar+0.061
Neg logit contributions
 favourite-0.057
 favorite-0.055
 new-0.054
illy-0.054
 groceries-0.053
Token,
Token ID11
Feature activation-5.32e-03

Pos logit contributions
ro+0.235
 dent+0.231
pe+0.228
Uh+0.227
Ar+0.225
Neg logit contributions
illy-0.106
 favourite-0.101
 favorite-0.100
's-0.098
 tomatoes-0.097
Token and
Token ID290
Feature activation+8.26e-03

Pos logit contributions
pe+0.010
ro+0.009
bench+0.008
ider+0.008
 swoop+0.008
Neg logit contributions
 favourite-0.008
 favorite-0.008
 biggest-0.008
 groceries-0.008
illy-0.008
Token he
Token ID339
Feature activation+2.42e-03

Pos logit contributions
illy+0.013
 groceries+0.012
 favourite+0.012
 biggest+0.012
 favorite+0.012
Neg logit contributions
pe-0.015
ro-0.015
 swoop-0.013
bench-0.013
ider-0.012
Token was
Token ID373
Feature activation-1.41e-02

Pos logit contributions
illy+0.004
 favourite+0.004
 biggest+0.004
 groceries+0.004
 favorite+0.004
Neg logit contributions
pe-0.004
ro-0.004
bench-0.004
ider-0.003
 swoop-0.003
Token very
Token ID845
Feature activation-6.11e-03

Pos logit contributions
ro+0.028
pe+0.028
bench+0.024
ider+0.024
 swoop+0.023
Neg logit contributions
 favourite-0.019
 biggest-0.019
 favorite-0.019
illy-0.019
 new-0.018
Token her
Token ID607
Feature activation+2.97e-03

Pos logit contributions
pe+0.010
ro+0.010
bench+0.009
 swoop+0.009
 ago+0.009
Neg logit contributions
 biggest-0.005
illy-0.005
 favourite-0.005
 groceries-0.005
 favorite-0.005
Token birthday
Token ID10955
Feature activation-1.07e-02

Pos logit contributions
 favourite+0.004
illy+0.004
 favorite+0.004
 biggest+0.004
 groceries+0.004
Neg logit contributions
pe-0.006
ro-0.006
bench-0.005
 swoop-0.005
 dent-0.005
Token a
Token ID257
Feature activation+9.30e-03

Pos logit contributions
pe+0.019
ro+0.019
bench+0.016
 swoop+0.016
ider+0.016
Neg logit contributions
illy-0.017
 biggest-0.016
 favourite-0.016
 groceries-0.016
 favorite-0.016
Token few
Token ID1178
Feature activation+9.86e-03

Pos logit contributions
 favourite+0.010
 biggest+0.010
 favorite+0.010
illy+0.010
 groceries+0.009
Neg logit contributions
pe-0.021
ro-0.021
bench-0.019
ider-0.019
 swoop-0.019
Token days
Token ID1528
Feature activation-3.90e-02

Pos logit contributions
illy+0.013
 biggest+0.013
 groceries+0.013
 favourite+0.013
 favorite+0.012
Neg logit contributions
pe-0.020
ro-0.019
bench-0.017
ider-0.017
 swoop-0.017
Token ago
Token ID2084
Feature activation-1.07e-01

Pos logit contributions
pe+0.081
ro+0.077
 dent+0.075
 swoop+0.072
bench+0.070
Neg logit contributions
illy-0.050
 favourite-0.047
 groceries-0.047
 favorite-0.046
 biggest-0.044
Token.
Token ID13
Feature activation+5.30e-03

Pos logit contributions
 dent+0.242
Uh+0.238
ro+0.232
 swoop+0.229
pe+0.229
Neg logit contributions
illy-0.101
 favourite-0.099
 groceries-0.098
 favorite-0.098
's-0.087
Token She
Token ID1375
Feature activation-2.01e-02

Pos logit contributions
 groceries+0.009
illy+0.009
 biggest+0.009
 favorite+0.008
 favourite+0.008
Neg logit contributions
pe-0.009
ro-0.008
bench-0.007
 ago-0.007
 swoop-0.007
Token liked
Token ID8288
Feature activation+3.68e-03

Pos logit contributions
pe+0.033
ro+0.033
bench+0.028
 swoop+0.028
ider+0.027
Neg logit contributions
illy-0.034
 groceries-0.033
 biggest-0.033
 favourite-0.033
 favorite-0.032
Token to
Token ID284
Feature activation-5.31e-02

Pos logit contributions
illy+0.005
 biggest+0.004
 groceries+0.004
 favourite+0.004
 favorite+0.004
Neg logit contributions
pe-0.008
ro-0.008
bench-0.007
 swoop-0.007
 dent-0.006
Token practice
Token ID3357
Feature activation-2.71e-02

Pos logit contributions
pe+0.100
ro+0.098
bench+0.088
ider+0.088
 dent+0.085
Neg logit contributions
illy-0.076
 favourite-0.076
 favorite-0.074
 biggest-0.074
 groceries-0.073
Token a
Token ID257
Feature activation+1.88e-02

Pos logit contributions
pe+0.004
ro+0.003
ider+0.003
bench+0.003
 dent+0.003
Neg logit contributions
 groceries-0.002
 new-0.002
illy-0.002
 favorite-0.002
 favourite-0.002
Token ship
Token ID4074
Feature activation-9.92e-03

Pos logit contributions
 new+0.023
 groceries+0.023
 biggest+0.023
 favourite+0.023
 favorite+0.022
Neg logit contributions
pe-0.039
ro-0.038
ider-0.036
bench-0.034
 swoop-0.033
Token that
Token ID326
Feature activation+1.14e-02

Pos logit contributions
pe+0.020
ro+0.020
 dent+0.019
bench+0.018
 swoop+0.018
Neg logit contributions
 favorite-0.011
 new-0.011
 groceries-0.011
illy-0.011
 favourite-0.011
Token sank
Token ID30895
Feature activation-1.53e-02

Pos logit contributions
illy+0.019
 groceries+0.018
 biggest+0.018
 favourite+0.018
 favorite+0.017
Neg logit contributions
pe-0.020
ro-0.019
bench-0.017
 swoop-0.016
 dent-0.016
Token long
Token ID890
Feature activation-2.73e-02

Pos logit contributions
pe+0.035
ro+0.034
bench+0.032
ider+0.032
 dent+0.031
Neg logit contributions
 favorite-0.015
 new-0.015
 groceries-0.015
 biggest-0.015
 favourite-0.014
Token ago
Token ID2084
Feature activation-1.07e-01

Pos logit contributions
pe+0.054
ro+0.051
ider+0.051
 swoop+0.051
 dent+0.048
Neg logit contributions
 groceries-0.033
 other-0.031
 favorite-0.030
 new-0.030
 tomatoes-0.030
Token.
Token ID13
Feature activation-1.68e-03

Pos logit contributions
Uh+0.265
 swoop+0.254
Neigh+0.251
 dent+0.248
Ar+0.247
Neg logit contributions
 favorite-0.084
 groceries-0.082
's-0.079
 new-0.076
 other-0.075
Token She
Token ID1375
Feature activation-2.27e-02

Pos logit contributions
ro+0.003
pe+0.003
bench+0.002
ider+0.002
 ago+0.002
Neg logit contributions
illy-0.003
 groceries-0.003
 biggest-0.003
 favourite-0.003
 favorite-0.003
Token said
Token ID531
Feature activation+9.49e-03

Pos logit contributions
pe+0.039
ro+0.038
bench+0.034
ider+0.033
 swoop+0.032
Neg logit contributions
illy-0.037
 favourite-0.036
 biggest-0.036
 groceries-0.036
 favorite-0.035
Token they
Token ID484
Feature activation+6.85e-04

Pos logit contributions
 biggest+0.017
illy+0.017
 favourite+0.017
 groceries+0.017
 favorite+0.016
Neg logit contributions
pe-0.014
ro-0.013
 ago-0.012
 hon-0.011
 swoop-0.011
Token could
Token ID714
Feature activation-3.54e-02

Pos logit contributions
illy+0.001
 groceries+0.001
 biggest+0.001
 favourite+0.001
 tomatoes+0.001
Neg logit contributions
pe-0.001
ro-0.001
 fought-0.001
 swoop-0.001
bench-0.001
Token big
Token ID1263
Feature activation-1.52e-02

Pos logit contributions
 biggest+0.014
 favourite+0.014
illy+0.013
 favorite+0.013
 groceries+0.013
Neg logit contributions
pe-0.033
ro-0.032
bench-0.029
ider-0.029
 swoop-0.028
Token farm
Token ID5318
Feature activation-2.40e-02

Pos logit contributions
pe+0.030
ro+0.029
bench+0.027
ider+0.026
 swoop+0.025
Neg logit contributions
 biggest-0.020
 favourite-0.020
 favorite-0.020
 groceries-0.020
illy-0.020
Token a
Token ID257
Feature activation+1.53e-02

Pos logit contributions
pe+0.045
ro+0.045
bench+0.039
 swoop+0.039
ider+0.038
Neg logit contributions
illy-0.035
 favourite-0.034
 biggest-0.034
 groceries-0.033
 favorite-0.033
Token long
Token ID890
Feature activation-3.28e-02

Pos logit contributions
illy+0.017
 biggest+0.017
 favourite+0.017
 groceries+0.016
 favorite+0.016
Neg logit contributions
pe-0.034
ro-0.033
bench-0.030
ider-0.029
 swoop-0.029
Token time
Token ID640
Feature activation-3.57e-02

Pos logit contributions
ider+0.063
pe+0.063
ro+0.061
 bleed+0.060
 dent+0.059
Neg logit contributions
 favourite-0.039
 favorite-0.038
 new-0.037
 groceries-0.036
 spaghetti-0.036
Token ago
Token ID2084
Feature activation-1.07e-01

Pos logit contributions
pe+0.053
ro+0.050
bench+0.047
 dent+0.046
 swoop+0.045
Neg logit contributions
 favourite-0.064
 favorite-0.063
illy-0.062
 biggest-0.062
 new-0.061
Token.
Token ID13
Feature activation+1.80e-02

Pos logit contributions
 dent+0.233
Uh+0.231
ro+0.227
pe+0.227
 swoop+0.226
Neg logit contributions
 favorite-0.100
 favourite-0.097
 groceries-0.097
illy-0.096
's-0.091
Token Lily
Token ID20037
Feature activation+1.60e-02

Pos logit contributions
illy+0.024
 biggest+0.023
 groceries+0.023
 favourite+0.023
 favorite+0.022
Neg logit contributions
ro-0.037
pe-0.036
bench-0.032
ider-0.031
 dent-0.031
Token listened
Token ID16399
Feature activation+1.73e-02

Pos logit contributions
illy+0.023
 favourite+0.023
 favorite+0.022
 biggest+0.021
 groceries+0.021
Neg logit contributions
pe-0.030
ro-0.029
bench-0.027
 dent-0.026
ider-0.026
Token as
Token ID355
Feature activation+2.29e-02

Pos logit contributions
illy+0.021
 groceries+0.020
 biggest+0.020
 favourite+0.020
 favorite+0.019
Neg logit contributions
pe-0.037
ro-0.036
bench-0.032
ider-0.032
 swoop-0.032
Token the
Token ID262
Feature activation-2.25e-02

Pos logit contributions
illy+0.031
 biggest+0.030
 groceries+0.030
 favourite+0.030
 favorite+0.029
Neg logit contributions
pe-0.045
ro-0.045
bench-0.040
 swoop-0.039
ider-0.038
Token someone
Token ID2130
Feature activation+6.82e-02

Pos logit contributions
pe+0.068
ro+0.067
bench+0.063
ider+0.061
 dent+0.060
Neg logit contributions
illy-0.038
 biggest-0.036
 favourite-0.035
 groceries-0.035
 favorite-0.035
Token a
Token ID257
Feature activation+1.09e-02

Pos logit contributions
illy+0.081
 favourite+0.075
 biggest+0.074
 groceries+0.073
 favorite+0.071
Neg logit contributions
pe-0.145
ro-0.143
 ago-0.132
 swoop-0.128
bench-0.125
Token very
Token ID845
Feature activation+5.60e-03

Pos logit contributions
 biggest+0.014
illy+0.014
 favourite+0.014
 groceries+0.013
 favorite+0.013
Neg logit contributions
pe-0.022
ro-0.022
bench-0.020
ider-0.019
 swoop-0.019
Token long
Token ID890
Feature activation-4.91e-02

Pos logit contributions
illy+0.006
 groceries+0.005
 biggest+0.005
 favourite+0.005
 spaghetti+0.005
Neg logit contributions
pe-0.013
ro-0.013
 swoop-0.012
ider-0.011
bench-0.011
Token time
Token ID640
Feature activation-3.77e-02

Pos logit contributions
ider+0.098
pe+0.096
 dodging+0.090
ro+0.089
 bleed+0.088
Neg logit contributions
 favourite-0.053
 favorite-0.052
illy-0.052
 new-0.052
 groceries-0.050
Token ago
Token ID2084
Feature activation-1.07e-01

Pos logit contributions
pe+0.043
ro+0.041
bench+0.035
 dent+0.034
 swoop+0.033
Neg logit contributions
illy-0.082
 favourite-0.082
 favorite-0.081
 groceries-0.080
 biggest-0.080
Token.
Token ID13
Feature activation+1.04e-02

Pos logit contributions
ro+0.227
pe+0.222
 dent+0.219
Ar+0.217
 swoop+0.214
Neg logit contributions
 groceries-0.112
illy-0.109
 favorite-0.106
 favourite-0.103
 tomatoes-0.100
Token The
Token ID383
Feature activation-1.30e-02

Pos logit contributions
illy+0.016
 groceries+0.016
 biggest+0.016
 favourite+0.016
 favorite+0.015
Neg logit contributions
pe-0.019
ro-0.018
bench-0.016
ider-0.015
 swoop-0.015
Token little
Token ID1310
Feature activation-1.19e-02

Pos logit contributions
ro+0.030
pe+0.030
ider+0.026
bench+0.026
 swoop+0.026
Neg logit contributions
illy-0.014
 groceries-0.013
 favourite-0.013
 biggest-0.012
 favorite-0.012
Token girl
Token ID2576
Feature activation-2.11e-03

Pos logit contributions
pe+0.025
ro+0.024
bench+0.022
 swoop+0.021
ider+0.021
Neg logit contributions
illy-0.015
 biggest-0.015
 favourite-0.015
 groceries-0.015
 favorite-0.015
Token thought
Token ID1807
Feature activation+2.95e-02

Pos logit contributions
pe+0.004
ro+0.004
bench+0.003
 dent+0.003
ider+0.003
Neg logit contributions
 favourite-0.003
 favorite-0.003
illy-0.003
 biggest-0.003
 groceries-0.003
Token best
Token ID1266
Feature activation+1.25e-02

Pos logit contributions
pe+0.004
ro+0.004
bench+0.003
ider+0.003
 swoop+0.003
Neg logit contributions
illy-0.003
 groceries-0.003
 biggest-0.003
 favourite-0.003
 favorite-0.002
Token friend
Token ID1545
Feature activation+2.30e-02

Pos logit contributions
 favourite+0.013
illy+0.013
 favorite+0.013
 biggest+0.013
 groceries+0.012
Neg logit contributions
pe-0.029
ro-0.028
bench-0.026
 swoop-0.025
ider-0.025
Token a
Token ID257
Feature activation-4.20e-03

Pos logit contributions
illy+0.035
 favourite+0.035
 biggest+0.034
 groceries+0.034
 favorite+0.034
Neg logit contributions
pe-0.041
ro-0.041
bench-0.036
 swoop-0.035
 dent-0.034
Token long
Token ID890
Feature activation-4.86e-02

Pos logit contributions
pe+0.009
ro+0.009
bench+0.008
ider+0.008
 swoop+0.008
Neg logit contributions
 favourite-0.005
 biggest-0.005
 favorite-0.005
illy-0.005
 groceries-0.004
Token time
Token ID640
Feature activation-3.92e-02

Pos logit contributions
pe+0.093
ider+0.093
 collapse+0.089
 dodging+0.086
bench+0.086
Neg logit contributions
 favorite-0.060
 favourite-0.058
 new-0.054
illy-0.054
 groceries-0.051
Token ago
Token ID2084
Feature activation-1.07e-01

Pos logit contributions
pe+0.077
ro+0.075
 swoop+0.069
bench+0.068
 dent+0.068
Neg logit contributions
 favorite-0.053
 favourite-0.053
 biggest-0.050
 new-0.050
 groceries-0.049
Token.
Token ID13
Feature activation+1.04e-02

Pos logit contributions
 swoop+0.242
Uh+0.242
 dent+0.239
pe+0.236
 scree+0.233
Neg logit contributions
 favorite-0.096
 groceries-0.089
 favourite-0.087
's-0.084
 biggest-0.081
Token He
Token ID679
Feature activation-5.62e-03

Pos logit contributions
illy+0.017
 groceries+0.016
 biggest+0.016
 favourite+0.016
 favorite+0.016
Neg logit contributions
pe-0.018
ro-0.018
ider-0.015
bench-0.015
 swoop-0.015
Token was
Token ID373
Feature activation-8.06e-03

Pos logit contributions
pe+0.011
ro+0.011
bench+0.009
ider+0.009
 swoop+0.009
Neg logit contributions
illy-0.008
 favourite-0.008
 biggest-0.008
 favorite-0.008
 groceries-0.008
Token my
Token ID616
Feature activation-1.05e-02

Pos logit contributions
pe+0.017
ro+0.016
bench+0.015
 swoop+0.014
ider+0.014
Neg logit contributions
illy-0.010
 groceries-0.010
 favourite-0.010
 biggest-0.010
 favorite-0.010
Token dog
Token ID3290
Feature activation+7.36e-03

Pos logit contributions
ro+0.024
pe+0.024
ider+0.022
 dent+0.021
 ago+0.021
Neg logit contributions
illy-0.011
 groceries-0.010
 tomatoes-0.009
idy-0.009
 biggest-0.009
Token had
Token ID550
Feature activation+1.70e-02

Pos logit contributions
illy+0.002
 favourite+0.002
 biggest+0.002
 groceries+0.002
 favorite+0.002
Neg logit contributions
pe-0.002
ro-0.002
bench-0.002
ider-0.002
 swoop-0.002
Token lived
Token ID5615
Feature activation+6.60e-03

Pos logit contributions
illy+0.028
 biggest+0.028
 favourite+0.028
 favorite+0.027
 groceries+0.027
Neg logit contributions
pe-0.028
ro-0.028
bench-0.024
ider-0.024
 dent-0.023
Token a
Token ID257
Feature activation+1.02e-02

Pos logit contributions
illy+0.009
 favourite+0.009
 biggest+0.009
 favorite+0.009
 groceries+0.009
Neg logit contributions
pe-0.013
ro-0.013
bench-0.012
 dent-0.011
ider-0.011
Token long
Token ID890
Feature activation-3.98e-02

Pos logit contributions
 biggest+0.013
 favourite+0.013
illy+0.013
 favorite+0.013
 groceries+0.012
Neg logit contributions
pe-0.021
ro-0.021
bench-0.019
ider-0.018
 swoop-0.018
Token time
Token ID640
Feature activation-3.71e-02

Pos logit contributions
ider+0.078
 bleed+0.073
bench+0.071
pe+0.071
 collapse+0.071
Neg logit contributions
 favorite-0.051
 favourite-0.049
 new-0.046
 other-0.044
 tomato-0.044
Token ago
Token ID2084
Feature activation-1.07e-01

Pos logit contributions
ro+0.050
pe+0.049
bench+0.044
 dent+0.044
 swoop+0.041
Neg logit contributions
 favourite-0.070
 favorite-0.069
 new-0.067
 biggest-0.067
 amazing-0.067
Token.
Token ID13
Feature activation+2.20e-02

Pos logit contributions
ro+0.229
 dent+0.229
pe+0.227
Uh+0.225
 swoop+0.222
Neg logit contributions
 favorite-0.104
 favourite-0.103
illy-0.100
 groceries-0.100
's-0.093
Token
Token ID220
Feature activation-1.06e-02

Pos logit contributions
illy+0.032
 biggest+0.031
 groceries+0.031
 favourite+0.031
 favorite+0.030
Neg logit contributions
ro-0.042
pe-0.041
bench-0.037
ider-0.035
 dent-0.035
Token
Token ID198
Feature activation-5.87e-03

Pos logit contributions
pe+0.017
ro+0.017
bench+0.014
 dent+0.014
 swoop+0.014
Neg logit contributions
illy-0.018
 groceries-0.018
 favourite-0.018
 biggest-0.018
 favorite-0.018
Token
Token ID198
Feature activation-5.89e-03

Pos logit contributions
pe+0.010
ro+0.010
bench+0.008
 swoop+0.008
ider+0.008
Neg logit contributions
illy-0.010
 groceries-0.010
 biggest-0.010
 favourite-0.010
 favorite-0.009
TokenL
Token ID43
Feature activation+2.85e-02

Pos logit contributions
pe+0.010
ro+0.010
bench+0.009
 swoop+0.008
 dent+0.008
Neg logit contributions
illy-0.010
 favourite-0.009
 biggest-0.009
 favorite-0.009
 groceries-0.009
Token children
Token ID1751
Feature activation-2.08e-02

Pos logit contributions
pe+0.026
ro+0.025
 swoop+0.022
bench+0.022
 dent+0.021
Neg logit contributions
illy-0.030
 groceries-0.030
 biggest-0.029
 favourite-0.029
 favorite-0.028
Token put
Token ID1234
Feature activation+3.02e-02

Pos logit contributions
pe+0.038
ro+0.038
bench+0.034
 dent+0.032
 swoop+0.032
Neg logit contributions
illy-0.031
 favourite-0.030
 biggest-0.030
 groceries-0.030
 favorite-0.029
Token on
Token ID319
Feature activation-3.67e-03

Pos logit contributions
illy+0.040
 favourite+0.036
 biggest+0.036
 favorite+0.035
 groceries+0.035
Neg logit contributions
pe-0.061
ro-0.060
bench-0.054
 swoop-0.052
 dent-0.050
Token their
Token ID511
Feature activation+2.43e-02

Pos logit contributions
ro+0.007
pe+0.007
bench+0.006
 swoop+0.006
 ago+0.006
Neg logit contributions
illy-0.005
 biggest-0.005
 favourite-0.005
 groceries-0.005
 favorite-0.005
Token bath
Token ID7837
Feature activation-2.31e-02

Pos logit contributions
illy+0.035
 biggest+0.030
idy+0.030
 groceries+0.030
 favourite+0.029
Neg logit contributions
pe-0.049
ro-0.048
 swoop-0.040
bench-0.040
 dent-0.040
Tokenro
Token ID305
Feature activation-1.07e-01

Pos logit contributions
pe+0.041
ro+0.038
 swoop+0.036
bench+0.035
 dent+0.035
Neg logit contributions
 favourite-0.036
illy-0.036
 favorite-0.036
 groceries-0.036
 biggest-0.036
Tokenbes
Token ID12636
Feature activation+1.11e-02

Pos logit contributions
Uh+0.136
 dent+0.128
 collapse+0.123
jar+0.115
 whine+0.111
Neg logit contributions
 favourite-0.201
 favorite-0.200
's-0.195
illy-0.189
 new-0.182
Token and
Token ID290
Feature activation+8.17e-03

Pos logit contributions
illy+0.017
 biggest+0.016
 favourite+0.016
 groceries+0.016
 favorite+0.016
Neg logit contributions
pe-0.020
ro-0.020
bench-0.017
ider-0.017
 swoop-0.017
Token ran
Token ID4966
Feature activation-1.51e-03

Pos logit contributions
illy+0.014
 biggest+0.014
 favourite+0.013
 groceries+0.013
 favorite+0.013
Neg logit contributions
ro-0.013
pe-0.013
 swoop-0.011
bench-0.011
 hon-0.011
Token around
Token ID1088
Feature activation-6.66e-03

Pos logit contributions
ro+0.004
pe+0.003
bench+0.003
 swoop+0.003
ider+0.003
Neg logit contributions
illy-0.002
 biggest-0.002
 groceries-0.001
 favourite-0.001
 favorite-0.001
Token in
Token ID287
Feature activation-1.25e-02

Pos logit contributions
pe+0.015
ro+0.014
ider+0.013
bench+0.013
 swoop+0.013
Neg logit contributions
 favourite-0.007
 favorite-0.007
illy-0.007
 groceries-0.007
 new-0.007
Token queen
Token ID16599
Feature activation-9.22e-03

Pos logit contributions
pe+0.061
ro+0.058
bench+0.053
ider+0.052
 swoop+0.052
Neg logit contributions
illy-0.038
 favourite-0.038
 biggest-0.038
 favorite-0.038
 groceries-0.037
Token from
Token ID422
Feature activation-1.74e-02

Pos logit contributions
ro+0.018
pe+0.018
 dent+0.017
bench+0.017
 swoop+0.016
Neg logit contributions
 favorite-0.011
 favourite-0.011
illy-0.011
 biggest-0.011
 groceries-0.011
Token long
Token ID890
Feature activation-2.07e-02

Pos logit contributions
pe+0.037
ro+0.037
bench+0.033
ider+0.032
 dent+0.032
Neg logit contributions
illy-0.021
 favourite-0.021
 groceries-0.021
 favorite-0.020
 biggest-0.020
Token,
Token ID11
Feature activation-6.52e-03

Pos logit contributions
pe+0.040
ro+0.039
ider+0.038
 dent+0.035
 dodging+0.034
Neg logit contributions
 favourite-0.027
 favorite-0.027
illy-0.027
 groceries-0.027
 new-0.026
Token long
Token ID890
Feature activation-2.87e-02

Pos logit contributions
pe+0.013
ro+0.013
bench+0.012
 swoop+0.011
ider+0.011
Neg logit contributions
illy-0.009
 favourite-0.008
 biggest-0.008
 groceries-0.008
 favorite-0.008
Token ago
Token ID2084
Feature activation-1.07e-01

Pos logit contributions
pe+0.054
ro+0.051
ider+0.050
 dent+0.047
bench+0.046
Neg logit contributions
 favourite-0.038
 new-0.038
 favorite-0.038
 groceries-0.038
illy-0.038
Token.
Token ID13
Feature activation+4.05e-03

Pos logit contributions
Uh+0.250
 dent+0.240
ro+0.239
Ar+0.238
 swoop+0.236
Neg logit contributions
 favorite-0.094
 favourite-0.092
 groceries-0.088
illy-0.088
's-0.085
Token Mia
Token ID32189
Feature activation+1.58e-02

Pos logit contributions
illy+0.007
 groceries+0.006
 favourite+0.006
 biggest+0.006
 favorite+0.006
Neg logit contributions
pe-0.007
ro-0.007
bench-0.006
 swoop-0.006
ider-0.006
Token listened
Token ID16399
Feature activation+1.32e-02

Pos logit contributions
 favourite+0.023
 favorite+0.023
illy+0.022
 biggest+0.021
 amazing+0.021
Neg logit contributions
ro-0.029
pe-0.029
bench-0.027
 dent-0.026
ider-0.026
Token and
Token ID290
Feature activation-2.30e-02

Pos logit contributions
illy+0.015
 groceries+0.014
 biggest+0.014
 favourite+0.014
 favorite+0.013
Neg logit contributions
pe-0.030
ro-0.029
bench-0.026
ider-0.026
 swoop-0.025
Token her
Token ID607
Feature activation-1.02e-02

Pos logit contributions
pe+0.041
ro+0.040
ider+0.035
bench+0.035
 dent+0.034
Neg logit contributions
illy-0.035
 favourite-0.035
 favorite-0.034
 biggest-0.034
 new-0.034
Token farm
Token ID5318
Feature activation-3.84e-02

Pos logit contributions
illy+0.052
 groceries+0.048
 favourite+0.048
 favorite+0.046
 biggest+0.046
Neg logit contributions
pe-0.056
ro-0.056
bench-0.047
 swoop-0.047
 dent-0.047
Token.
Token ID13
Feature activation+2.41e-03

Pos logit contributions
ro+0.078
pe+0.078
 dent+0.074
bench+0.071
 swoop+0.071
Neg logit contributions
 favorite-0.048
 favourite-0.048
 biggest-0.046
 groceries-0.046
 new-0.046
Token They
Token ID1119
Feature activation-6.29e-03

Pos logit contributions
illy+0.004
 favourite+0.003
 biggest+0.003
 groceries+0.003
 favorite+0.003
Neg logit contributions
pe-0.005
ro-0.004
bench-0.004
 swoop-0.004
ider-0.004
Token liked
Token ID8288
Feature activation+5.79e-03

Pos logit contributions
pe+0.012
ro+0.012
ider+0.011
bench+0.011
 dent+0.011
Neg logit contributions
 favourite-0.009
 favorite-0.009
illy-0.008
 biggest-0.008
 groceries-0.008
Token to
Token ID284
Feature activation-3.95e-02

Pos logit contributions
illy+0.008
 biggest+0.007
 favourite+0.007
 groceries+0.007
idy+0.007
Neg logit contributions
pe-0.012
ro-0.012
 swoop-0.011
 hon-0.010
bench-0.010
Token see
Token ID766
Feature activation+5.91e-02

Pos logit contributions
pe+0.069
ro+0.068
bench+0.060
ider+0.059
 dent+0.058
Neg logit contributions
 favourite-0.061
illy-0.061
 biggest-0.061
 groceries-0.061
 favorite-0.061
Token the
Token ID262
Feature activation+1.67e-02

Pos logit contributions
illy+0.126
ete+0.121
olly+0.120
enny+0.112
oll+0.110
Neg logit contributions
 hon-0.069
 spe-0.063
pe-0.062
 swoop-0.061
 landing-0.056
Token animals
Token ID4695
Feature activation+1.92e-02

Pos logit contributions
illy+0.024
ete+0.021
pieces+0.020
oll+0.020
ingu+0.020
Neg logit contributions
pe-0.032
ro-0.031
 swoop-0.029
 hon-0.028
 spe-0.027
Token and
Token ID290
Feature activation+1.89e-02

Pos logit contributions
illy+0.029
 favourite+0.028
 biggest+0.028
 groceries+0.028
 favorite+0.027
Neg logit contributions
pe-0.035
ro-0.035
bench-0.030
 swoop-0.030
ider-0.030
Token hear
Token ID3285
Feature activation+5.26e-02

Pos logit contributions
illy+0.032
 favourite+0.028
 biggest+0.028
 groceries+0.028
idy+0.028
Neg logit contributions
pe-0.033
ro-0.032
 swoop-0.029
 hon-0.028
 dodging-0.027
Token their
Token ID511
Feature activation+4.63e-03

Pos logit contributions
illy+0.096
ippi+0.084
 groceries+0.083
 favourite+0.083
 biggest+0.082
Neg logit contributions
 hon-0.077
pe-0.077
ro-0.074
 crash-0.069
 swoop-0.069
Token
Token ID220
Feature activation-1.25e-02

Pos logit contributions
illy+0.020
 groceries+0.020
 biggest+0.020
 favourite+0.019
 favorite+0.019
Neg logit contributions
ro-0.021
pe-0.021
bench-0.018
 ago-0.017
ider-0.017
Token
Token ID198
Feature activation-1.38e-02

Pos logit contributions
pe+0.019
ro+0.019
bench+0.016
ider+0.015
 dent+0.015
Neg logit contributions
illy-0.023
 groceries-0.022
 biggest-0.022
 favourite-0.022
 favorite-0.021
Token
Token ID198
Feature activation-1.10e-02

Pos logit contributions
pe+0.023
ro+0.022
bench+0.019
 swoop+0.018
ider+0.017
Neg logit contributions
illy-0.024
 groceries-0.023
 biggest-0.023
 favourite-0.023
 favorite-0.023
TokenShe
Token ID3347
Feature activation-1.47e-02

Pos logit contributions
pe+0.020
ro+0.020
bench+0.018
 dent+0.018
ider+0.017
Neg logit contributions
illy-0.016
 favourite-0.016
 biggest-0.016
 groceries-0.015
 favorite-0.015
Token brought
Token ID3181
Feature activation+5.20e-02

Pos logit contributions
pe+0.024
ro+0.023
bench+0.020
 swoop+0.020
ider+0.019
Neg logit contributions
illy-0.025
 biggest-0.025
 groceries-0.025
 favourite-0.024
 favorite-0.024
Token it
Token ID340
Feature activation+3.72e-02

Pos logit contributions
affles+0.111
illy+0.109
idy+0.102
ingu+0.100
oll+0.099
Neg logit contributions
 breath-0.054
ro-0.048
 swoop-0.046
 crash-0.040
pe-0.039
Token home
Token ID1363
Feature activation+3.26e-03

Pos logit contributions
illy+0.053
pieces+0.048
affles+0.048
idy+0.047
 biggest+0.046
Neg logit contributions
ro-0.071
pe-0.067
 swoop-0.061
 seek-0.059
 breath-0.058
Token and
Token ID290
Feature activation+9.08e-03

Pos logit contributions
illy+0.004
 biggest+0.004
 groceries+0.004
 favourite+0.004
 favorite+0.004
Neg logit contributions
pe-0.006
ro-0.006
bench-0.006
ider-0.005
 swoop-0.005
Token put
Token ID1234
Feature activation+2.05e-02

Pos logit contributions
illy+0.016
 biggest+0.016
 favourite+0.015
 groceries+0.015
 favorite+0.015
Neg logit contributions
ro-0.014
pe-0.013
 swoop-0.012
bench-0.012
 charged-0.011
Token the
Token ID262
Feature activation+4.47e-03

Pos logit contributions
illy+0.032
 biggest+0.030
 favourite+0.030
 favorite+0.029
idy+0.029
Neg logit contributions
pe-0.036
ro-0.035
bench-0.031
 swoop-0.030
 dent-0.029
Token microphone
Token ID21822
Feature activation+1.70e-02

Pos logit contributions
illy+0.006
 groceries+0.006
 favourite+0.006
 biggest+0.006
 favorite+0.006
Neg logit contributions
pe-0.009
ro-0.008
bench-0.008
ider-0.007
 dent-0.007
Token wall
Token ID3355
Feature activation-3.18e-02

Pos logit contributions
illy+0.006
 favourite+0.006
 groceries+0.006
 favorite+0.006
 biggest+0.006
Neg logit contributions
ro-0.006
pe-0.006
 dent-0.005
bench-0.005
 swoop-0.005
Token and
Token ID290
Feature activation-2.29e-03

Pos logit contributions
pe+0.066
ro+0.064
bench+0.058
 dent+0.058
 swoop+0.058
Neg logit contributions
 favourite-0.039
 favorite-0.038
illy-0.038
 biggest-0.038
 new-0.036
Token started
Token ID2067
Feature activation-2.99e-02

Pos logit contributions
pe+0.004
ro+0.004
bench+0.003
ider+0.003
 dent+0.003
Neg logit contributions
 favourite-0.004
illy-0.004
 favorite-0.004
 biggest-0.004
 groceries-0.004
Token to
Token ID284
Feature activation-5.84e-02

Pos logit contributions
pe+0.077
ro+0.076
ider+0.072
bench+0.069
 dent+0.068
Neg logit contributions
 favourite-0.024
 new-0.024
 groceries-0.023
 favorite-0.023
 biggest-0.022
Token pour
Token ID12797
Feature activation+2.56e-02

Pos logit contributions
pe+0.115
ider+0.115
ro+0.113
bench+0.107
 ago+0.100
Neg logit contributions
 favourite-0.077
 vegetables-0.073
 favorite-0.073
 biggest-0.073
 delicious-0.072
Token himself
Token ID2241
Feature activation+5.73e-02

Pos logit contributions
illy+0.034
 favourite+0.033
 biggest+0.032
 favorite+0.032
 groceries+0.031
Neg logit contributions
ro-0.051
pe-0.051
bench-0.045
 swoop-0.044
 dent-0.044
Token a
Token ID257
Feature activation+3.22e-02

Pos logit contributions
illy+0.068
 groceries+0.062
 biggest+0.062
 favourite+0.061
 favorite+0.059
Neg logit contributions
pe-0.126
ro-0.124
bench-0.110
 swoop-0.110
ider-0.108
Token cup
Token ID6508
Feature activation+1.59e-02

Pos logit contributions
illy+0.042
 groceries+0.040
 favourite+0.040
 biggest+0.040
 favorite+0.039
Neg logit contributions
pe-0.066
ro-0.065
bench-0.058
 swoop-0.057
 dent-0.057
Token of
Token ID286
Feature activation-2.78e-03

Pos logit contributions
illy+0.017
 favourite+0.016
 biggest+0.016
 groceries+0.016
 favorite+0.016
Neg logit contributions
pe-0.036
ro-0.036
bench-0.032
ider-0.032
 swoop-0.031
Token tea
Token ID8887
Feature activation-7.21e-03

Pos logit contributions
ro+0.006
pe+0.006
bench+0.005
 swoop+0.005
 dent+0.005
Neg logit contributions
illy-0.004
 biggest-0.003
 favourite-0.003
 favorite-0.003
 groceries-0.003
Token.
Token ID13
Feature activation+1.37e-02

Pos logit contributions
pe+0.013
ro+0.013
bench+0.012
 swoop+0.011
 dent+0.011
Neg logit contributions
illy-0.011
 favourite-0.010
 groceries-0.010
 biggest-0.010
 favorite-0.010
Token but
Token ID475
Feature activation+4.00e-03

Pos logit contributions
pe+0.003
ro+0.003
bench+0.003
ider+0.003
 dent+0.003
Neg logit contributions
 favorite-0.002
illy-0.002
 favourite-0.002
 biggest-0.002
 groceries-0.002
Token her
Token ID607
Feature activation-1.21e-02

Pos logit contributions
 biggest+0.007
 groceries+0.006
illy+0.006
 favourite+0.006
pieces+0.006
Neg logit contributions
pe-0.006
ro-0.006
 swoop-0.006
 wob-0.005
bench-0.005
Token mom
Token ID1995
Feature activation-3.35e-03

Pos logit contributions
ro+0.027
pe+0.026
 dent+0.024
bench+0.024
ider+0.023
Neg logit contributions
illy-0.014
pieces-0.012
 groceries-0.012
 biggest-0.012
idy-0.012
Token promised
Token ID8072
Feature activation+1.63e-02

Pos logit contributions
pe+0.006
ro+0.006
bench+0.005
 swoop+0.005
ider+0.005
Neg logit contributions
illy-0.005
 biggest-0.005
 groceries-0.005
 favourite-0.005
 favorite-0.005
Token to
Token ID284
Feature activation-7.00e-02

Pos logit contributions
illy+0.017
 biggest+0.017
 favourite+0.017
 groceries+0.016
 favorite+0.016
Neg logit contributions
pe-0.038
ro-0.037
bench-0.033
 swoop-0.033
ider-0.032
Token buy
Token ID2822
Feature activation+3.66e-02

Pos logit contributions
ro+0.138
pe+0.137
ider+0.133
bench+0.131
 dodging+0.130
Neg logit contributions
illy-0.092
 groceries-0.083
 new-0.080
 favourite-0.080
 favorite-0.079
Token her
Token ID607
Feature activation+2.25e-02

Pos logit contributions
illy+0.060
ovable+0.060
idy+0.059
ete+0.059
ippi+0.058
Neg logit contributions
pe-0.041
ro-0.039
 hon-0.038
 spe-0.035
 swoop-0.032
Token some
Token ID617
Feature activation+4.31e-02

Pos logit contributions
illy+0.035
idy+0.030
pieces+0.030
elling+0.029
ete+0.028
Neg logit contributions
ro-0.040
pe-0.039
bench-0.033
 hon-0.032
 breath-0.032
Token candy
Token ID18550
Feature activation+9.70e-03

Pos logit contributions
illy+0.056
pieces+0.053
ippi+0.052
ete+0.047
 biggest+0.047
Neg logit contributions
ro-0.087
pe-0.080
bench-0.071
 dent-0.070
ider-0.070
Token that
Token ID326
Feature activation+2.16e-03

Pos logit contributions
illy+0.016
 biggest+0.015
 favourite+0.015
 groceries+0.015
 favorite+0.015
Neg logit contributions
pe-0.017
ro-0.017
bench-0.014
ider-0.014
 swoop-0.013
Token was
Token ID373
Feature activation-1.64e-02

Pos logit contributions
illy+0.003
 biggest+0.003
 groceries+0.003
 favourite+0.003
 favorite+0.003
Neg logit contributions
pe-0.004
ro-0.004
bench-0.003
 swoop-0.003
 dent-0.003
Token comfortable
Token ID6792
Feature activation-2.23e-02

Pos logit contributions
illy+0.009
 biggest+0.009
 favourite+0.009
 favorite+0.008
 groceries+0.008
Neg logit contributions
pe-0.013
ro-0.013
bench-0.012
 swoop-0.011
ider-0.011
Token.
Token ID13
Feature activation+1.73e-02

Pos logit contributions
pe+0.045
ro+0.044
 swoop+0.040
bench+0.040
 dent+0.039
Neg logit contributions
 favourite-0.029
 groceries-0.029
 favorite-0.029
 biggest-0.029
illy-0.029
Token He
Token ID679
Feature activation-7.60e-03

Pos logit contributions
illy+0.030
 groceries+0.029
 biggest+0.029
 favourite+0.029
 favorite+0.028
Neg logit contributions
ro-0.028
pe-0.028
bench-0.024
 swoop-0.023
 dent-0.022
Token remembered
Token ID12086
Feature activation+1.34e-02

Pos logit contributions
pe+0.012
ro+0.011
 swoop+0.010
bench+0.009
 wob+0.009
Neg logit contributions
illy-0.014
 groceries-0.014
 biggest-0.014
 favourite-0.013
 favorite-0.013
Token his
Token ID465
Feature activation-6.51e-03

Pos logit contributions
 biggest+0.018
illy+0.018
 favourite+0.017
 groceries+0.017
pieces+0.017
Neg logit contributions
pe-0.026
ro-0.025
 swoop-0.023
 ago-0.022
 hon-0.022
Token friend
Token ID1545
Feature activation+3.51e-02

Pos logit contributions
pe+0.014
ro+0.014
bench+0.012
 swoop+0.012
ider+0.012
Neg logit contributions
illy-0.008
 favourite-0.008
 biggest-0.008
 groceries-0.007
 favorite-0.007
Token the
Token ID262
Feature activation-1.54e-02

Pos logit contributions
 biggest+0.060
 groceries+0.059
 favourite+0.057
illy+0.056
elling+0.056
Neg logit contributions
pe-0.053
ro-0.053
 swoop-0.046
 ago-0.044
 charged-0.043
Token fox
Token ID21831
Feature activation+1.61e-02

Pos logit contributions
pe+0.031
ro+0.030
 swoop+0.027
bench+0.026
 dent+0.026
Neg logit contributions
illy-0.021
 groceries-0.020
 favourite-0.019
 favorite-0.019
 biggest-0.019
Token had
Token ID550
Feature activation+8.47e-03

Pos logit contributions
 biggest+0.025
 groceries+0.024
illy+0.024
 favourite+0.023
 favorite+0.023
Neg logit contributions
pe-0.029
ro-0.028
 swoop-0.025
 ago-0.024
bench-0.024
Token a
Token ID257
Feature activation+1.33e-02

Pos logit contributions
illy+0.013
 groceries+0.013
 biggest+0.013
 favourite+0.013
 favorite+0.012
Neg logit contributions
pe-0.015
ro-0.015
bench-0.013
 swoop-0.013
ider-0.013
Token belt
Token ID10999
Feature activation-4.01e-03

Pos logit contributions
illy+0.015
 groceries+0.014
 biggest+0.014
 favourite+0.014
 favorite+0.013
Neg logit contributions
pe-0.030
ro-0.029
bench-0.027
 swoop-0.026
 dent-0.026
Token smiled
Token ID13541
Feature activation-2.23e-02

Pos logit contributions
pe+0.002
ro+0.002
bench+0.002
ider+0.002
 swoop+0.002
Neg logit contributions
illy-0.002
 favourite-0.002
 biggest-0.002
 groceries-0.002
 favorite-0.002
Token and
Token ID290
Feature activation-1.77e-02

Pos logit contributions
pe+0.044
ro+0.043
bench+0.038
 swoop+0.037
ider+0.037
Neg logit contributions
illy-0.031
 favourite-0.030
 biggest-0.030
 groceries-0.030
 favorite-0.030
Token said
Token ID531
Feature activation-8.61e-03

Pos logit contributions
pe+0.024
ro+0.023
 swoop+0.022
bench+0.020
 hon+0.020
Neg logit contributions
 biggest-0.034
 favourite-0.033
 groceries-0.033
illy-0.032
 favorite-0.032
Token,
Token ID11
Feature activation-2.27e-02

Pos logit contributions
pe+0.014
ro+0.014
bench+0.012
ider+0.011
 swoop+0.011
Neg logit contributions
illy-0.015
 biggest-0.015
 favourite-0.015
 groceries-0.015
 favorite-0.014
Token "
Token ID366
Feature activation-2.03e-02

Pos logit contributions
ro+0.054
pe+0.051
 dent+0.049
bench+0.049
 swoop+0.047
Neg logit contributions
 new-0.022
 favorite-0.021
 favourite-0.020
illy-0.020
 biggest-0.020
TokenI
Token ID40
Feature activation-1.96e-02

Pos logit contributions
pe+0.032
ro+0.031
bench+0.027
 swoop+0.026
ider+0.026
Neg logit contributions
illy-0.036
 biggest-0.035
 favourite-0.035
 groceries-0.035
 favorite-0.035
Token know
Token ID760
Feature activation+1.31e-02

Pos logit contributions
pe+0.034
ro+0.033
bench+0.029
ider+0.028
 swoop+0.028
Neg logit contributions
illy-0.032
 favourite-0.031
 favorite-0.031
 biggest-0.031
 groceries-0.031
Token cleaning
Token ID12724
Feature activation-1.69e-02

Pos logit contributions
illy+0.021
 favourite+0.021
 biggest+0.021
 favorite+0.020
 groceries+0.020
Neg logit contributions
pe-0.021
ro-0.020
 ago-0.018
 swoop-0.018
ider-0.018
Token isn
Token ID2125
Feature activation-1.04e-03

Pos logit contributions
ro+0.032
pe+0.031
 dent+0.027
bench+0.027
 wob+0.025
Neg logit contributions
illy-0.023
pieces-0.022
 favourite-0.021
 biggest-0.021
 favorite-0.020
Token't
Token ID470
Feature activation+5.71e-03

Pos logit contributions
pe+0.001
ro+0.001
bench+0.001
 swoop+0.001
ider+0.001
Neg logit contributions
illy-0.002
 biggest-0.002
 favourite-0.002
 groceries-0.002
 favorite-0.002
Token fun
Token ID1257
Feature activation-2.54e-03

Pos logit contributions
 groceries+0.008
illy+0.008
 biggest+0.008
 favorite+0.008
 new+0.008
Neg logit contributions
ro-0.011
pe-0.011
ider-0.010
 swoop-0.009
bench-0.009
Token wandered
Token ID36036
Feature activation-3.00e-03

Pos logit contributions
pe+0.005
ro+0.005
bench+0.004
 swoop+0.004
ider+0.004
Neg logit contributions
illy-0.005
 favourite-0.005
 biggest-0.005
 groceries-0.005
 favorite-0.005
Token around
Token ID1088
Feature activation+2.76e-03

Pos logit contributions
ro+0.007
pe+0.007
 swoop+0.006
bench+0.006
ider+0.006
Neg logit contributions
illy-0.003
 groceries-0.003
 biggest-0.003
 favourite-0.003
 favorite-0.003
Token with
Token ID351
Feature activation+3.82e-02

Pos logit contributions
 favourite+0.003
 favorite+0.003
 new+0.003
illy+0.003
 biggest+0.003
Neg logit contributions
pe-0.006
ro-0.006
ider-0.006
bench-0.006
 dent-0.006
Token the
Token ID262
Feature activation+1.53e-02

Pos logit contributions
illy+0.060
 groceries+0.055
idy+0.054
elling+0.053
 biggest+0.052
Neg logit contributions
pe-0.064
ro-0.064
 swoop-0.059
 breath-0.057
bench-0.057
Token kitten
Token ID42143
Feature activation+1.33e-02

Pos logit contributions
illy+0.017
 groceries+0.015
 favourite+0.014
 favorite+0.014
idy+0.014
Neg logit contributions
pe-0.035
ro-0.034
 swoop-0.030
bench-0.030
 dent-0.030
Token,
Token ID11
Feature activation+1.20e-03

Pos logit contributions
illy+0.021
 groceries+0.021
 biggest+0.020
 favourite+0.019
 favorite+0.019
Neg logit contributions
pe-0.023
ro-0.023
 swoop-0.020
 ago-0.019
bench-0.019
Token showing
Token ID4478
Feature activation+4.76e-02

Pos logit contributions
illy+0.002
 favourite+0.002
 favorite+0.002
 biggest+0.002
 groceries+0.002
Neg logit contributions
pe-0.002
ro-0.002
bench-0.002
ider-0.002
 dent-0.002
Token it
Token ID340
Feature activation+4.32e-02

Pos logit contributions
 groceries+0.077
illy+0.077
affles+0.075
 tomatoes+0.070
 spaghetti+0.070
Neg logit contributions
pe-0.077
ro-0.074
 swoop-0.069
 hon-0.067
 crash-0.064
Token all
Token ID477
Feature activation+3.34e-02

Pos logit contributions
illy+0.065
 groceries+0.058
ingu+0.053
ummy+0.053
affles+0.052
Neg logit contributions
pe-0.082
ro-0.080
 swoop-0.075
 crash-0.069
 hon-0.069
Token of
Token ID286
Feature activation+1.25e-02

Pos logit contributions
illy+0.039
 favourite+0.037
 biggest+0.037
 groceries+0.037
 favorite+0.036
Neg logit contributions
pe-0.073
ro-0.073
ider-0.066
bench-0.065
 swoop-0.063
Token the
Token ID262
Feature activation+1.72e-02

Pos logit contributions
illy+0.018
 biggest+0.018
 favourite+0.018
 groceries+0.018
 favorite+0.018
Neg logit contributions
pe-0.023
ro-0.023
bench-0.020
 swoop-0.019
 dent-0.019
TokenO
Token ID46
Feature activation-3.60e-03

Pos logit contributions
pe+0.016
ro+0.016
bench+0.015
ider+0.014
 ago+0.014
Neg logit contributions
illy-0.014
 groceries-0.014
 biggest-0.013
 favourite-0.013
 favorite-0.013
Tokenw
Token ID86
Feature activation+7.41e-03

Pos logit contributions
ro+0.007
pe+0.007
bench+0.006
 dodging+0.006
 dent+0.006
Neg logit contributions
illy-0.005
idy-0.004
ummy-0.004
ip-0.004
erry-0.004
Token,
Token ID11
Feature activation-1.24e-02

Pos logit contributions
illy+0.012
 favourite+0.012
 groceries+0.012
 biggest+0.012
 favorite+0.012
Neg logit contributions
pe-0.012
ro-0.012
bench-0.010
 swoop-0.010
ider-0.010
Token ow
Token ID12334
Feature activation+1.07e-02

Pos logit contributions
pe+0.024
ro+0.023
 swoop+0.021
 dent+0.020
bench+0.020
Neg logit contributions
illy-0.017
 favourite-0.017
 groceries-0.017
 biggest-0.017
 favorite-0.016
Token,
Token ID11
Feature activation-9.64e-03

Pos logit contributions
illy+0.015
 biggest+0.014
 groceries+0.014
 favorite+0.014
 favourite+0.014
Neg logit contributions
pe-0.021
ro-0.021
bench-0.019
 dent-0.018
ider-0.017
Token ow
Token ID12334
Feature activation+1.10e-02

Pos logit contributions
pe+0.019
ro+0.018
 swoop+0.016
 dent+0.016
bench+0.016
Neg logit contributions
illy-0.013
 favourite-0.013
 groceries-0.013
 biggest-0.013
 favorite-0.013
Token!"
Token ID2474
Feature activation-6.25e-03

Pos logit contributions
illy+0.017
 biggest+0.016
 groceries+0.016
 favorite+0.015
 favourite+0.015
Neg logit contributions
pe-0.020
ro-0.020
bench-0.018
 dent-0.017
ider-0.017
Token Sara
Token ID24799
Feature activation-1.44e-02

Pos logit contributions
pe+0.012
ro+0.012
bench+0.011
 swoop+0.011
ider+0.010
Neg logit contributions
illy-0.009
 favourite-0.008
 favorite-0.008
 groceries-0.008
 biggest-0.008
Token cries
Token ID24691
Feature activation-4.13e-02

Pos logit contributions
pe+0.030
ro+0.029
bench+0.027
ider+0.026
 ago+0.025
Neg logit contributions
illy-0.018
 biggest-0.017
 favorite-0.017
 favourite-0.017
 groceries-0.016
Token.
Token ID13
Feature activation-1.55e-02

Pos logit contributions
pe+0.066
ro+0.065
 swoop+0.055
bench+0.054
ider+0.054
Neg logit contributions
illy-0.072
 favourite-0.071
 biggest-0.071
 groceries-0.071
 favorite-0.069
Token "
Token ID366
Feature activation-2.86e-02

Pos logit contributions
pe+0.033
ro+0.032
bench+0.030
 swoop+0.029
 dent+0.029
Neg logit contributions
illy-0.018
 biggest-0.018
 favorite-0.018
 favourite-0.018
 new-0.017
Token:
Token ID25
Feature activation-1.09e-02

Pos logit contributions
pe+0.002
ro+0.002
bench+0.002
ider+0.002
 swoop+0.002
Neg logit contributions
illy-0.001
 biggest-0.001
 favourite-0.001
 groceries-0.001
 favorite-0.001
Token never
Token ID1239
Feature activation-1.00e-02

Pos logit contributions
pe+0.021
ro+0.021
bench+0.018
 swoop+0.018
ider+0.018
Neg logit contributions
illy-0.015
 favourite-0.015
 biggest-0.015
 groceries-0.015
 favorite-0.015
Token give
Token ID1577
Feature activation+3.41e-02

Pos logit contributions
pe+0.019
ro+0.018
bench+0.016
ider+0.016
 dent+0.015
Neg logit contributions
 biggest-0.015
illy-0.015
 favorite-0.015
 new-0.015
 favourite-0.015
Token up
Token ID510
Feature activation-2.18e-03

Pos logit contributions
illy+0.047
ingu+0.043
affles+0.042
 favourite+0.041
pieces+0.041
Neg logit contributions
pe-0.060
ro-0.058
 swoop-0.055
 breath-0.049
 wob-0.049
Token and
Token ID290
Feature activation-9.92e-03

Pos logit contributions
pe+0.004
ro+0.004
bench+0.003
ider+0.003
 swoop+0.003
Neg logit contributions
illy-0.003
 biggest-0.003
 favourite-0.003
 groceries-0.003
 favorite-0.003
Token encourage
Token ID7898
Feature activation+1.50e-02

Pos logit contributions
pe+0.020
ro+0.019
bench+0.017
ider+0.017
 dent+0.017
Neg logit contributions
illy-0.014
 biggest-0.013
 favourite-0.013
 groceries-0.013
 favorite-0.013
Token yourself
Token ID3511
Feature activation+3.08e-02

Pos logit contributions
illy+0.018
 biggest+0.018
 favourite+0.018
 groceries+0.018
 favorite+0.018
Neg logit contributions
pe-0.032
ro-0.031
bench-0.027
 swoop-0.026
 ago-0.026
Token to
Token ID284
Feature activation-5.98e-02

Pos logit contributions
illy+0.043
 groceries+0.042
 biggest+0.042
 favourite+0.042
 favorite+0.041
Neg logit contributions
pe-0.060
ro-0.059
bench-0.052
 swoop-0.050
ider-0.050
Token do
Token ID466
Feature activation-2.85e-03

Pos logit contributions
ro+0.109
pe+0.107
ider+0.100
bench+0.099
 dent+0.093
Neg logit contributions
illy-0.089
 biggest-0.085
 favourite-0.085
 new-0.083
 amazing-0.083
Token great
Token ID1049
Feature activation+1.03e-02

Pos logit contributions
pe+0.006
ro+0.006
bench+0.005
 swoop+0.005
 dent+0.005
Neg logit contributions
illy-0.003
 groceries-0.003
 favourite-0.003
 biggest-0.003
 favorite-0.003
Token things
Token ID1243
Feature activation+3.36e-02

Pos logit contributions
illy+0.010
 favourite+0.009
 biggest+0.009
 groceries+0.009
 favorite+0.009
Neg logit contributions
pe-0.024
ro-0.024
bench-0.022
 swoop-0.021
 dent-0.021
Token the
Token ID262
Feature activation+8.14e-03

Pos logit contributions
ro+0.072
pe+0.071
bench+0.065
ider+0.064
 swoop+0.062
Neg logit contributions
 favourite-0.033
illy-0.033
 biggest-0.033
 favorite-0.032
 groceries-0.032
Token sun
Token ID4252
Feature activation-2.76e-02

Pos logit contributions
illy+0.011
 groceries+0.010
 favourite+0.010
 biggest+0.010
 favorite+0.010
Neg logit contributions
pe-0.016
ro-0.015
 swoop-0.014
bench-0.014
 dent-0.014
Token.
Token ID13
Feature activation+1.74e-02

Pos logit contributions
pe+0.052
ro+0.052
bench+0.047
ider+0.046
 dent+0.046
Neg logit contributions
illy-0.039
 favourite-0.038
 biggest-0.038
 favorite-0.038
 groceries-0.037
Token<|endoftext|>
Token ID50256
Feature activation-5.84e-02

Pos logit contributions
illy+0.028
 groceries+0.028
 biggest+0.027
 favourite+0.027
 favorite+0.026
Neg logit contributions
ro-0.030
pe-0.030
bench-0.026
 ago-0.025
 swoop-0.025
TokenOnce
Token ID7454
Feature activation+2.31e-02

Pos logit contributions
pe+0.111
ro+0.108
bench+0.096
 swoop+0.096
ider+0.096
Neg logit contributions
illy-0.084
 biggest-0.081
 favourite-0.080
 groceries-0.080
 favorite-0.078
Token upon
Token ID2402
Feature activation+1.05e-02

Pos logit contributions
 groceries+0.041
illy+0.041
 biggest+0.040
 favourite+0.039
 favorite+0.038
Neg logit contributions
pe-0.036
ro-0.034
bench-0.030
 ago-0.029
 swoop-0.029
Token a
Token ID257
Feature activation+7.94e-03

Pos logit contributions
illy+0.012
 groceries+0.012
 biggest+0.011
 favourite+0.011
 favorite+0.011
Neg logit contributions
pe-0.023
ro-0.022
bench-0.020
 swoop-0.020
ider-0.020
Token time
Token ID640
Feature activation-1.85e-02

Pos logit contributions
 favourite+0.012
illy+0.012
 favorite+0.012
 biggest+0.011
 new+0.011
Neg logit contributions
pe-0.015
ro-0.015
bench-0.013
 dent-0.013
ider-0.013
Token,
Token ID11
Feature activation+1.82e-02

Pos logit contributions
ro+0.037
pe+0.037
bench+0.034
 dent+0.032
 swoop+0.032
Neg logit contributions
illy-0.025
 favourite-0.025
 favorite-0.024
 new-0.023
 biggest-0.023
Token there
Token ID612
Feature activation-3.60e-02

Pos logit contributions
 favourite+0.025
illy+0.024
 favorite+0.024
 biggest+0.024
 groceries+0.023
Neg logit contributions
pe-0.036
ro-0.036
bench-0.032
 dent-0.031
ider-0.031
Token was
Token ID373
Feature activation+4.13e-02

Pos logit contributions
pe+0.067
ro+0.066
bench+0.059
ider+0.058
 swoop+0.057
Neg logit contributions
illy-0.053
 favourite-0.052
 biggest-0.052
 favorite-0.051
 groceries-0.051
Token She
Token ID1375
Feature activation-1.43e-02

Pos logit contributions
illy+0.013
 biggest+0.012
 groceries+0.012
 favourite+0.012
 favorite+0.012
Neg logit contributions
ro-0.014
pe-0.014
bench-0.012
ider-0.012
 dent-0.011
Token started
Token ID2067
Feature activation-2.70e-02

Pos logit contributions
pe+0.023
ro+0.022
bench+0.019
 swoop+0.019
ider+0.019
Neg logit contributions
illy-0.025
 groceries-0.025
 biggest-0.024
 favourite-0.024
 favorite-0.024
Token to
Token ID284
Feature activation-6.04e-02

Pos logit contributions
pe+0.073
ro+0.069
ider+0.066
bench+0.064
 dent+0.063
Neg logit contributions
 favorite-0.020
 favourite-0.019
 new-0.018
illy-0.018
 biggest-0.018
Token draw
Token ID3197
Feature activation+2.84e-02

Pos logit contributions
pe+0.149
ider+0.138
ro+0.137
 dodging+0.127
bench+0.125
Neg logit contributions
 favorite-0.059
 biggest-0.057
 favourite-0.056
 tomatoes-0.054
 new-0.054
Token a
Token ID257
Feature activation+2.78e-02

Pos logit contributions
illy+0.044
 groceries+0.041
 favourite+0.039
 biggest+0.038
 favorite+0.037
Neg logit contributions
pe-0.052
ro-0.052
bench-0.045
 swoop-0.045
 dent-0.044
Token sky
Token ID6766
Feature activation-3.89e-02

Pos logit contributions
illy+0.039
 groceries+0.037
 favourite+0.036
 biggest+0.035
 favorite+0.034
Neg logit contributions
pe-0.054
ro-0.054
 swoop-0.048
 dent-0.047
bench-0.047
Token.
Token ID13
Feature activation+6.53e-03

Pos logit contributions
pe+0.083
ro+0.078
bench+0.076
 dent+0.073
 charged+0.072
Neg logit contributions
 favorite-0.048
 favourite-0.047
 biggest-0.047
 new-0.046
 amazing-0.044
Token But
Token ID887
Feature activation-7.87e-03

Pos logit contributions
illy+0.011
 groceries+0.010
 favourite+0.010
 biggest+0.010
 favorite+0.010
Neg logit contributions
ro-0.012
pe-0.011
bench-0.010
ider-0.009
 ago-0.009
Token then
Token ID788
Feature activation-2.42e-02

Pos logit contributions
pe+0.016
ro+0.016
bench+0.014
 swoop+0.014
ider+0.014
Neg logit contributions
illy-0.010
 biggest-0.010
 groceries-0.010
 favourite-0.010
 favorite-0.009
Token,
Token ID11
Feature activation+1.23e-02

Pos logit contributions
pe+0.037
ro+0.036
 swoop+0.032
 dent+0.030
bench+0.030
Neg logit contributions
illy-0.043
 biggest-0.042
 groceries-0.042
 favourite-0.042
 favorite-0.041
Token she
Token ID673
Feature activation-7.17e-04

Pos logit contributions
illy+0.018
 biggest+0.017
 favourite+0.017
 groceries+0.017
 favorite+0.016
Neg logit contributions
ro-0.022
pe-0.022
 swoop-0.020
bench-0.019
ider-0.019
Token not
Token ID407
Feature activation-1.45e-02

Pos logit contributions
ro+0.031
pe+0.031
bench+0.029
fly+0.027
ider+0.027
Neg logit contributions
 new-0.026
 favorite-0.025
 amazing-0.025
 favourite-0.025
 lots-0.024
Token know
Token ID760
Feature activation+2.96e-02

Pos logit contributions
pe+0.031
ider+0.029
ro+0.029
stack+0.028
bench+0.027
Neg logit contributions
illy-0.016
 amazing-0.016
 favorite-0.016
 even-0.015
 guys-0.015
Token that
Token ID326
Feature activation-8.28e-03

Pos logit contributions
illy+0.040
 favourite+0.038
 groceries+0.037
 biggest+0.037
 favorite+0.036
Neg logit contributions
pe-0.060
ro-0.056
 swoop-0.051
 dent-0.050
 hon-0.050
Token the
Token ID262
Feature activation-3.25e-02

Pos logit contributions
pe+0.017
ro+0.016
bench+0.015
 swoop+0.014
 dent+0.014
Neg logit contributions
illy-0.011
 favourite-0.011
 biggest-0.011
 groceries-0.010
 favorite-0.010
Token stick
Token ID4859
Feature activation-1.02e-03

Pos logit contributions
ro+0.047
pe+0.046
ider+0.041
bench+0.040
h+0.038
Neg logit contributions
 biggest-0.064
 favorite-0.061
 new-0.061
 favourite-0.060
 groceries-0.060
Token would
Token ID561
Feature activation-4.43e-02

Pos logit contributions
pe+0.002
ro+0.002
bench+0.002
ider+0.002
 dent+0.002
Neg logit contributions
 favorite-0.001
 favourite-0.001
 biggest-0.001
illy-0.001
 groceries-0.001
Token break
Token ID2270
Feature activation-3.54e-03

Pos logit contributions
ider+0.082
ro+0.078
pe+0.077
bench+0.074
ighed+0.072
Neg logit contributions
 amazing-0.064
 new-0.063
 other-0.061
 special-0.060
 delicious-0.059
Token soon
Token ID2582
Feature activation+2.06e-03

Pos logit contributions
ro+0.006
pe+0.006
 dent+0.005
bench+0.005
 swoop+0.005
Neg logit contributions
illy-0.006
 favourite-0.005
 biggest-0.005
 groceries-0.005
 favorite-0.005
Token.
Token ID13
Feature activation+1.46e-02

Pos logit contributions
 groceries+0.002
 new+0.002
 favorite+0.002
illy+0.002
 biggest+0.002
Neg logit contributions
pe-0.004
ro-0.004
ider-0.004
Ar-0.004
bench-0.004
Token
Token ID198
Feature activation-2.17e-04

Pos logit contributions
illy+0.021
 biggest+0.021
 favourite+0.021
 groceries+0.020
 favorite+0.020
Neg logit contributions
pe-0.028
ro-0.027
bench-0.025
 swoop-0.024
 dent-0.024
Token
Token ID198
Feature activation-9.17e-03

Pos logit contributions
pe+0.000
ro+0.000
bench+0.000
 swoop+0.000
ider+0.000
Neg logit contributions
illy-0.000
 groceries-0.000
 biggest-0.000
 favourite-0.000
 favorite-0.000
Token corn
Token ID11676
Feature activation+5.41e-03

Pos logit contributions
illy+0.063
ippi+0.060
pieces+0.060
 favourite+0.057
 biggest+0.055
Neg logit contributions
pe-0.095
ro-0.090
 swoop-0.085
h-0.080
 spe-0.079
Token?"
Token ID1701
Feature activation-2.44e-02

Pos logit contributions
illy+0.007
 biggest+0.007
 favourite+0.007
 groceries+0.007
 favorite+0.007
Neg logit contributions
pe-0.011
ro-0.011
bench-0.009
 swoop-0.009
ider-0.009
Token
Token ID198
Feature activation-2.36e-02

Pos logit contributions
pe+0.042
ro+0.041
bench+0.036
 swoop+0.035
ider+0.034
Neg logit contributions
illy-0.040
 biggest-0.039
 favourite-0.039
 groceries-0.039
 favorite-0.038
Token
Token ID198
Feature activation-2.44e-02

Pos logit contributions
pe+0.041
ro+0.040
bench+0.035
 swoop+0.034
ider+0.034
Neg logit contributions
illy-0.038
 favourite-0.037
 biggest-0.037
 groceries-0.037
 favorite-0.036
TokenTom
Token ID13787
Feature activation+5.83e-03

Pos logit contributions
pe+0.042
ro+0.041
 dent+0.038
bench+0.038
fly+0.037
Neg logit contributions
illy-0.036
 groceries-0.036
 new-0.035
 favourite-0.035
 favorite-0.034
Token nods
Token ID34833
Feature activation-2.48e-02

Pos logit contributions
illy+0.009
 favorite+0.009
 favourite+0.009
 biggest+0.009
 groceries+0.009
Neg logit contributions
pe-0.010
ro-0.010
bench-0.009
ider-0.009
 dent-0.008
Token.
Token ID13
Feature activation-4.69e-03

Pos logit contributions
ro+0.053
pe+0.052
bench+0.049
 dent+0.049
 swoop+0.047
Neg logit contributions
 favorite-0.029
 new-0.029
illy-0.028
 favourite-0.027
 biggest-0.026
Token He
Token ID679
Feature activation-2.36e-02

Pos logit contributions
pe+0.010
ro+0.009
 swoop+0.009
 dent+0.008
bench+0.008
Neg logit contributions
 favorite-0.006
 biggest-0.006
 new-0.006
 favourite-0.006
 groceries-0.006
Token takes
Token ID2753
Feature activation+3.68e-02

Pos logit contributions
pe+0.038
ro+0.037
bench+0.032
 swoop+0.031
ider+0.031
Neg logit contributions
illy-0.041
 groceries-0.040
 biggest-0.040
 favourite-0.039
 favorite-0.039
Token a
Token ID257
Feature activation+2.16e-02

Pos logit contributions
illy+0.056
 favourite+0.050
idy+0.049
 biggest+0.047
 groceries+0.047
Neg logit contributions
ro-0.068
pe-0.067
 swoop-0.062
bench-0.058
 charged-0.058
Token corn
Token ID11676
Feature activation-2.12e-02

Pos logit contributions
illy+0.028
 favourite+0.027
 biggest+0.027
 groceries+0.027
 favorite+0.026
Neg logit contributions
pe-0.044
ro-0.043
bench-0.039
 swoop-0.038
ider-0.038
Token happy
Token ID3772
Feature activation-1.73e-02

Pos logit contributions
illy+0.007
 groceries+0.007
 biggest+0.006
 favourite+0.006
 favorite+0.006
Neg logit contributions
pe-0.011
ro-0.010
bench-0.009
ider-0.009
 swoop-0.009
Token,
Token ID11
Feature activation+1.37e-02

Pos logit contributions
pe+0.039
ro+0.038
bench+0.035
 swoop+0.034
 dent+0.034
Neg logit contributions
 favorite-0.020
illy-0.020
 favourite-0.020
 groceries-0.019
 biggest-0.019
Token and
Token ID290
Feature activation+1.25e-03

Pos logit contributions
illy+0.020
 favourite+0.019
 biggest+0.019
 groceries+0.019
 favorite+0.019
Neg logit contributions
pe-0.026
ro-0.026
bench-0.023
 swoop-0.022
ider-0.022
Token she
Token ID673
Feature activation-3.92e-03

Pos logit contributions
illy+0.002
 groceries+0.002
 biggest+0.002
 favourite+0.002
 favorite+0.002
Neg logit contributions
pe-0.002
ro-0.002
bench-0.002
 swoop-0.002
ider-0.002
Token liked
Token ID8288
Feature activation+3.36e-03

Pos logit contributions
pe+0.009
ro+0.009
bench+0.009
 dent+0.008
ider+0.008
Neg logit contributions
 favorite-0.004
 favourite-0.004
 biggest-0.004
illy-0.004
 amazing-0.003
Token to
Token ID284
Feature activation-4.16e-02

Pos logit contributions
illy+0.004
 biggest+0.004
 favourite+0.004
 groceries+0.003
 favorite+0.003
Neg logit contributions
pe-0.007
ro-0.007
bench-0.007
 swoop-0.007
 ago-0.006
Token wear
Token ID5806
Feature activation+1.63e-02

Pos logit contributions
pe+0.085
ro+0.083
ider+0.077
bench+0.076
 dent+0.075
Neg logit contributions
 favourite-0.053
illy-0.053
 favorite-0.052
 biggest-0.052
 groceries-0.050
Token colorful
Token ID20239
Feature activation+1.33e-02

Pos logit contributions
illy+0.024
ete+0.021
 biggest+0.021
idy+0.021
elling+0.021
Neg logit contributions
ro-0.030
pe-0.028
 swoop-0.026
bench-0.025
bes-0.025
Token clothes
Token ID8242
Feature activation+1.80e-02

Pos logit contributions
illy+0.019
 biggest+0.018
ete+0.017
 groceries+0.016
 favourite+0.016
Neg logit contributions
ro-0.025
pe-0.024
 swoop-0.021
bench-0.021
 dent-0.021
Token.
Token ID13
Feature activation+1.73e-02

Pos logit contributions
illy+0.031
 biggest+0.030
 groceries+0.030
 favourite+0.030
 tomatoes+0.029
Neg logit contributions
pe-0.028
ro-0.028
bench-0.025
ider-0.024
 ago-0.022
Token Her
Token ID2332
Feature activation-1.88e-03

Pos logit contributions
illy+0.030
 groceries+0.030
 biggest+0.029
idy+0.029
ils+0.028
Neg logit contributions
ro-0.028
pe-0.026
 ago-0.022
bench-0.021
h-0.020
Token it
Token ID340
Feature activation+4.43e-03

Pos logit contributions
ro+0.011
pe+0.010
h+0.009
 swoop+0.009
 ago+0.009
Neg logit contributions
 biggest-0.015
illy-0.014
 groceries-0.014
 favourite-0.014
 favorite-0.014
Token was
Token ID373
Feature activation+7.94e-03

Pos logit contributions
illy+0.007
 biggest+0.007
 favourite+0.007
 groceries+0.007
 favorite+0.007
Neg logit contributions
pe-0.008
ro-0.008
bench-0.007
ider-0.006
 swoop-0.006
Token very
Token ID845
Feature activation+7.25e-03

Pos logit contributions
 favorite+0.009
 favourite+0.009
 biggest+0.009
 new+0.009
 groceries+0.009
Neg logit contributions
pe-0.018
ro-0.018
fer-0.016
ider-0.015
bench-0.015
Token cold
Token ID4692
Feature activation-2.14e-02

Pos logit contributions
illy+0.010
 biggest+0.009
 groceries+0.009
 favourite+0.009
 favorite+0.009
Neg logit contributions
pe-0.015
ro-0.015
bench-0.013
 swoop-0.013
ider-0.013
Token and
Token ID290
Feature activation+1.57e-03

Pos logit contributions
pe+0.044
ro+0.044
bench+0.039
 swoop+0.038
ider+0.038
Neg logit contributions
illy-0.027
 favourite-0.027
 biggest-0.026
 groceries-0.026
 favorite-0.026
Token the
Token ID262
Feature activation-2.22e-02

Pos logit contributions
illy+0.002
 biggest+0.002
 favourite+0.002
 groceries+0.002
 favorite+0.002
Neg logit contributions
pe-0.003
ro-0.003
bench-0.003
ider-0.003
 swoop-0.003
Token ground
Token ID2323
Feature activation-3.44e-02

Pos logit contributions
pe+0.042
ro+0.041
bench+0.036
ider+0.035
 swoop+0.035
Neg logit contributions
illy-0.033
 biggest-0.032
 groceries-0.032
 favourite-0.032
 favorite-0.031
Token was
Token ID373
Feature activation+1.44e-03

Pos logit contributions
pe+0.066
ro+0.064
bench+0.057
ider+0.056
 dent+0.055
Neg logit contributions
illy-0.049
 biggest-0.048
 favorite-0.048
 favourite-0.048
 groceries-0.047
Token icy
Token ID30284
Feature activation-2.77e-02

Pos logit contributions
 favorite+0.002
 new+0.002
 biggest+0.002
 favourite+0.002
 amazing+0.001
Neg logit contributions
pe-0.003
ro-0.003
fer-0.003
ider-0.003
bench-0.003
Token.
Token ID13
Feature activation+2.35e-02

Pos logit contributions
pe+0.057
ro+0.055
bench+0.049
ider+0.049
 swoop+0.049
Neg logit contributions
 biggest-0.036
 groceries-0.036
illy-0.035
 favorite-0.035
 favourite-0.034
Token Lily
Token ID20037
Feature activation+1.47e-02

Pos logit contributions
illy+0.044
 groceries+0.043
 biggest+0.042
ils+0.042
 amazing+0.042
Neg logit contributions
ro-0.036
pe-0.031
bench-0.027
 dent-0.025
 ago-0.025
Token mess
Token ID2085
Feature activation-3.40e-02

Pos logit contributions
pe+0.002
ro+0.001
bench+0.001
 swoop+0.001
ider+0.001
Neg logit contributions
illy-0.002
 biggest-0.002
 groceries-0.002
 favourite-0.002
 favorite-0.002
Token.
Token ID13
Feature activation-2.05e-03

Pos logit contributions
pe+0.065
ro+0.063
 swoop+0.058
bench+0.056
 dent+0.055
Neg logit contributions
illy-0.048
 groceries-0.047
 favorite-0.046
 biggest-0.046
 favourite-0.046
Token And
Token ID843
Feature activation-1.13e-02

Pos logit contributions
pe+0.004
ro+0.004
bench+0.003
ider+0.003
 dent+0.003
Neg logit contributions
illy-0.003
 favourite-0.003
 biggest-0.003
 groceries-0.003
 favorite-0.003
Token they
Token ID484
Feature activation-7.59e-03

Pos logit contributions
ro+0.022
pe+0.021
ider+0.018
bench+0.018
 ago+0.018
Neg logit contributions
illy-0.016
 biggest-0.015
 groceries-0.015
 favourite-0.015
 favorite-0.014
Token have
Token ID423
Feature activation+1.77e-02

Pos logit contributions
pe+0.014
ro+0.014
bench+0.012
ider+0.012
 swoop+0.012
Neg logit contributions
illy-0.011
 favourite-0.011
 groceries-0.011
 favorite-0.011
 biggest-0.011
Token to
Token ID284
Feature activation-7.72e-02

Pos logit contributions
illy+0.022
 biggest+0.020
 favourite+0.019
idy+0.019
pieces+0.018
Neg logit contributions
pe-0.037
ro-0.037
bench-0.034
 ago-0.033
 dent-0.033
Token keep
Token ID1394
Feature activation+2.00e-02

Pos logit contributions
pe+0.161
ro+0.157
ider+0.150
bench+0.145
 swoop+0.140
Neg logit contributions
illy-0.096
 favorite-0.088
 groceries-0.088
 favourite-0.087
 amazing-0.087
Token away
Token ID1497
Feature activation+9.39e-03

Pos logit contributions
illy+0.033
idy+0.031
 biggest+0.031
erry+0.030
oll+0.030
Neg logit contributions
pe-0.027
 dent-0.026
ro-0.026
 breath-0.026
bench-0.026
Token from
Token ID422
Feature activation+7.87e-04

Pos logit contributions
 favourite+0.018
 biggest+0.017
illy+0.017
 groceries+0.016
pieces+0.016
Neg logit contributions
ro-0.013
pe-0.013
 dent-0.010
 hon-0.010
 ago-0.010
Token her
Token ID607
Feature activation+1.14e-02

Pos logit contributions
illy+0.001
 favourite+0.001
 biggest+0.001
 groceries+0.001
 favorite+0.001
Neg logit contributions
pe-0.001
ro-0.001
 ago-0.001
 dent-0.001
 dodging-0.001
Token makeup
Token ID16029
Feature activation-7.25e-03

Pos logit contributions
illy+0.016
 biggest+0.015
 favourite+0.015
 groceries+0.015
 favorite+0.015
Neg logit contributions
pe-0.023
ro-0.022
bench-0.020
 swoop-0.019
ider-0.019
Token"
Token ID1
Feature activation-2.27e-02

Pos logit contributions
pe+0.061
ro+0.060
bench+0.057
 dent+0.054
 swoop+0.052
Neg logit contributions
illy-0.050
 new-0.045
 groceries-0.045
 biggest-0.045
 favorite-0.044
TokenYou
Token ID1639
Feature activation-6.82e-03

Pos logit contributions
bench+0.041
 ago+0.041
pe+0.040
ro+0.040
 dent+0.037
Neg logit contributions
 favourite-0.032
 favorite-0.032
 biggest-0.030
 other-0.028
 groceries-0.028
Token don
Token ID836
Feature activation-8.46e-03

Pos logit contributions
pe+0.013
ro+0.012
 swoop+0.011
 dent+0.011
 wob+0.011
Neg logit contributions
illy-0.010
 groceries-0.010
 favourite-0.010
 biggest-0.009
 favorite-0.009
Token't
Token ID470
Feature activation-2.66e-02

Pos logit contributions
ro+0.011
pe+0.011
bench+0.009
ider+0.009
 swoop+0.008
Neg logit contributions
illy-0.017
 biggest-0.017
 groceries-0.017
 favourite-0.017
 favorite-0.016
Token have
Token ID423
Feature activation+1.34e-02

Pos logit contributions
pe+0.052
ro+0.049
ider+0.046
bench+0.045
 dodging+0.042
Neg logit contributions
illy-0.038
 biggest-0.036
 favorite-0.035
 groceries-0.035
 favourite-0.035
Token to
Token ID284
Feature activation-7.90e-02

Pos logit contributions
illy+0.018
 biggest+0.016
 favourite+0.016
pieces+0.016
 groceries+0.015
Neg logit contributions
ro-0.025
pe-0.024
 dent-0.023
 bru-0.023
 swoop-0.023
Token get
Token ID651
Feature activation+1.65e-02

Pos logit contributions
pe+0.154
bench+0.147
ro+0.146
ider+0.145
 ago+0.139
Neg logit contributions
illy-0.106
 amazing-0.097
 new-0.096
 favorite-0.095
 delicious-0.095
Token a
Token ID257
Feature activation+1.47e-02

Pos logit contributions
illy+0.033
 favourite+0.032
 biggest+0.031
 favorite+0.030
 groceries+0.030
Neg logit contributions
pe-0.022
ro-0.022
 swoop-0.019
 dent-0.019
bench-0.018
Token stitch
Token ID24695
Feature activation-2.50e-02

Pos logit contributions
 biggest+0.023
 new+0.022
 favorite+0.022
 favourite+0.022
 groceries+0.022
Neg logit contributions
pe-0.027
ro-0.026
bench-0.024
ider-0.023
 ago-0.023
Token if
Token ID611
Feature activation+2.44e-02

Pos logit contributions
pe+0.046
ro+0.045
bench+0.041
 swoop+0.039
ider+0.039
Neg logit contributions
 biggest-0.036
 favorite-0.035
illy-0.035
 groceries-0.035
 favourite-0.035
Token you
Token ID345
Feature activation-2.43e-02

Pos logit contributions
illy+0.035
 favourite+0.032
 biggest+0.032
pieces+0.031
 favorite+0.031
Neg logit contributions
pe-0.046
ro-0.045
 swoop-0.040
ider-0.039
 ago-0.039
Token a
Token ID257
Feature activation-2.68e-04

Pos logit contributions
illy+0.018
 groceries+0.017
 biggest+0.017
 favourite+0.017
 favorite+0.016
Neg logit contributions
pe-0.033
ro-0.033
ider-0.029
 swoop-0.029
bench-0.029
Token joke
Token ID9707
Feature activation+5.27e-03

Pos logit contributions
pe+0.001
ro+0.001
bench+0.000
ider+0.000
 swoop+0.000
Neg logit contributions
illy-0.000
 biggest-0.000
 favourite-0.000
 groceries-0.000
 favorite-0.000
Token.
Token ID13
Feature activation+9.69e-03

Pos logit contributions
illy+0.007
 groceries+0.007
 favourite+0.007
 biggest+0.007
 favorite+0.007
Neg logit contributions
pe-0.010
ro-0.010
bench-0.009
 swoop-0.009
 dent-0.008
Token I
Token ID314
Feature activation-2.20e-02

Pos logit contributions
illy+0.014
 biggest+0.013
 favourite+0.013
 groceries+0.013
 favorite+0.013
Neg logit contributions
pe-0.019
ro-0.018
bench-0.017
 swoop-0.016
ider-0.016
Token forgot
Token ID16453
Feature activation+2.58e-02

Pos logit contributions
pe+0.042
ro+0.040
bench+0.040
ider+0.038
 dodging+0.038
Neg logit contributions
 favorite-0.031
 favourite-0.030
illy-0.028
 amazing-0.028
 delicious-0.027
Token to
Token ID284
Feature activation-7.49e-02

Pos logit contributions
illy+0.038
 favourite+0.035
 biggest+0.035
 groceries+0.033
ippi+0.032
Neg logit contributions
pe-0.048
ro-0.046
 hon-0.042
 dent-0.041
 ago-0.040
Token throw
Token ID3714
Feature activation+2.04e-02

Pos logit contributions
pe+0.144
ro+0.140
ider+0.138
bench+0.131
 ago+0.120
Neg logit contributions
illy-0.107
 amazing-0.097
 delicious-0.096
 favorite-0.096
 new-0.095
Token it
Token ID340
Feature activation+2.28e-02

Pos logit contributions
illy+0.030
 favourite+0.028
 biggest+0.028
 groceries+0.027
 favorite+0.027
Neg logit contributions
pe-0.039
ro-0.038
 dent-0.034
 swoop-0.034
bench-0.032
Token away
Token ID1497
Feature activation-1.25e-02

Pos logit contributions
 favorite+0.029
 groceries+0.029
 biggest+0.029
 favourite+0.029
illy+0.029
Neg logit contributions
pe-0.046
ro-0.045
bench-0.041
 swoop-0.039
ider-0.039
Token.
Token ID13
Feature activation+1.36e-02

Pos logit contributions
pe+0.023
ro+0.022
bench+0.020
 swoop+0.019
ider+0.019
Neg logit contributions
illy-0.019
 biggest-0.018
 groceries-0.018
 favourite-0.018
 favorite-0.018
Token It
Token ID632
Feature activation-1.06e-02

Pos logit contributions
illy+0.021
 groceries+0.020
 favourite+0.020
 biggest+0.020
 favorite+0.019
Neg logit contributions
pe-0.025
ro-0.025
bench-0.021
ider-0.021
 swoop-0.021
Token something
Token ID1223
Feature activation+7.01e-02

Pos logit contributions
illy+0.044
 favourite+0.040
 biggest+0.039
idy+0.039
 favorite+0.037
Neg logit contributions
ro-0.050
pe-0.047
 swoop-0.042
 breath-0.042
 dent-0.042
Token.
Token ID13
Feature activation+8.59e-03

Pos logit contributions
illy+0.104
pieces+0.099
 favourite+0.094
 biggest+0.093
ippi+0.091
Neg logit contributions
ro-0.124
pe-0.116
ider-0.113
 dent-0.106
 wob-0.102
Token And
Token ID843
Feature activation-1.52e-02

Pos logit contributions
illy+0.013
 favourite+0.012
 biggest+0.012
 groceries+0.012
 favorite+0.012
Neg logit contributions
pe-0.016
ro-0.016
bench-0.014
ider-0.013
 swoop-0.013
Token you
Token ID345
Feature activation-9.13e-03

Pos logit contributions
pe+0.030
ro+0.030
bench+0.026
 swoop+0.026
ider+0.026
Neg logit contributions
illy-0.021
 favourite-0.020
 biggest-0.020
 groceries-0.020
 favorite-0.020
Token have
Token ID423
Feature activation+1.97e-02

Pos logit contributions
pe+0.014
ro+0.014
bench+0.012
 swoop+0.012
 dent+0.011
Neg logit contributions
illy-0.016
 favourite-0.016
 biggest-0.016
 groceries-0.016
 favorite-0.016
Token to
Token ID284
Feature activation-7.47e-02

Pos logit contributions
illy+0.027
 biggest+0.024
 favourite+0.024
pieces+0.023
idy+0.023
Neg logit contributions
ro-0.039
pe-0.038
 ago-0.034
 swoop-0.034
bench-0.034
Token take
Token ID1011
Feature activation+3.28e-02

Pos logit contributions
pe+0.146
ro+0.144
ider+0.140
bench+0.131
 dodging+0.127
Neg logit contributions
illy-0.102
 groceries-0.094
 new-0.093
 amazing-0.092
 biggest-0.092
Token turns
Token ID4962
Feature activation-8.43e-03

Pos logit contributions
illy+0.053
 favourite+0.048
idy+0.046
 biggest+0.046
 favorite+0.044
Neg logit contributions
ro-0.058
pe-0.054
 swoop-0.048
 dent-0.047
 hon-0.047
Token and
Token ID290
Feature activation-8.85e-03

Pos logit contributions
pe+0.018
ro+0.016
 swoop+0.016
bench+0.016
ider+0.015
Neg logit contributions
 groceries-0.010
 new-0.010
illy-0.010
 favorite-0.009
 favourite-0.009
Token play
Token ID711
Feature activation-4.72e-04

Pos logit contributions
pe+0.017
ro+0.016
bench+0.015
ider+0.015
 swoop+0.014
Neg logit contributions
 groceries-0.013
 favourite-0.012
 favorite-0.012
 biggest-0.012
 new-0.012
Token nicely
Token ID16576
Feature activation-4.15e-03

Pos logit contributions
pe+0.001
ro+0.001
ider+0.001
bench+0.001
 dent+0.001
Neg logit contributions
illy-0.001
 groceries-0.001
 favourite-0.001
 biggest-0.001
 favorite-0.001
Token was
Token ID373
Feature activation-1.07e-02

Pos logit contributions
illy+0.053
 favourite+0.052
 favorite+0.051
 groceries+0.051
 biggest+0.051
Neg logit contributions
pe-0.061
ro-0.060
bench-0.053
ider-0.051
 swoop-0.051
Token okay
Token ID8788
Feature activation+1.15e-02

Pos logit contributions
pe+0.019
ro+0.018
bench+0.016
 swoop+0.015
ider+0.015
Neg logit contributions
illy-0.018
 favourite-0.017
 groceries-0.017
 biggest-0.017
 favorite-0.017
Token.
Token ID13
Feature activation+2.20e-03

Pos logit contributions
illy+0.015
 favorite+0.014
 favourite+0.014
 groceries+0.014
 new+0.014
Neg logit contributions
pe-0.023
ro-0.023
bench-0.021
 swoop-0.021
 dent-0.020
Token They
Token ID1119
Feature activation-1.88e-02

Pos logit contributions
illy+0.003
 favourite+0.003
 biggest+0.003
 groceries+0.003
 favorite+0.003
Neg logit contributions
pe-0.004
ro-0.004
bench-0.003
 swoop-0.003
ider-0.003
Token decided
Token ID3066
Feature activation+1.57e-02

Pos logit contributions
pe+0.033
ro+0.032
bench+0.028
 swoop+0.027
ider+0.027
Neg logit contributions
illy-0.031
 biggest-0.030
 groceries-0.030
 favourite-0.030
 favorite-0.029
Token to
Token ID284
Feature activation-7.81e-02

Pos logit contributions
illy+0.018
 biggest+0.017
 favourite+0.017
 groceries+0.016
 favorite+0.015
Neg logit contributions
pe-0.035
ro-0.035
bench-0.031
 swoop-0.031
 dent-0.030
Token play
Token ID711
Feature activation-5.34e-03

Pos logit contributions
pe+0.145
ro+0.145
bench+0.136
ider+0.135
 ago+0.129
Neg logit contributions
illy-0.107
 favourite-0.106
 favorite-0.105
 biggest-0.105
 groceries-0.102
Token with
Token ID351
Feature activation+3.62e-02

Pos logit contributions
pe+0.009
ro+0.009
bench+0.007
ider+0.007
 dent+0.007
Neg logit contributions
illy-0.009
 biggest-0.009
 groceries-0.009
 favourite-0.009
 favorite-0.008
Token their
Token ID511
Feature activation+1.04e-02

Pos logit contributions
 biggest+0.057
illy+0.056
elling+0.055
 favourite+0.054
ingu+0.054
Neg logit contributions
ro-0.056
pe-0.054
 dent-0.046
 fought-0.045
 hon-0.045
Token own
Token ID898
Feature activation+9.87e-03

Pos logit contributions
illy+0.012
 biggest+0.012
 favourite+0.011
 groceries+0.011
 favorite+0.011
Neg logit contributions
pe-0.023
ro-0.023
bench-0.020
 dent-0.020
 swoop-0.020
Token toys
Token ID14958
Feature activation+1.13e-02

Pos logit contributions
illy+0.010
 favourite+0.010
 groceries+0.010
 biggest+0.010
 favorite+0.010
Neg logit contributions
pe-0.023
ro-0.022
bench-0.021
 swoop-0.020
ider-0.020