TOP ACTIVATIONS
MAX = 0.175

Some of them whispered . Some of them shook their heads
are small and shiny . Some are big and rough .
on the black board . Some drew flowers , some drew
figure out the answer . Some people looked up in the
was different and unique . Some of the kids swapped dolls
know where it landed . Some of them did not see
smiled and waved back . Some of them told him stories
it broke into pieces . Some of the egg fell on
and made a mess . Some of the cheese sticks and
and shells and coral . Some of the coral was broken
and some were rough . Some were soft and some were
box has a hole . Some toys fall out of the
room was a mess . Some of their toys were broken
rocks and the flowers . Some were smooth and some were
shook and tremb led . Some of the books and toys
, something bad happened . Some of the plants started to
and breaks into pieces . Some of the coins roll out
is high and fast . Some people and cars are stuck
was torn and ripped . Some pages were on the floor
colors , and hardness . Some minerals are very rare and

MIN ACTIVATIONS
MIN = -0.184

She loved to go on pic n ics with her family
Tim my loved going on pic n ics with his family
family were ready to go pic n icking again . But
and it was going on pic n ics !
. She loved to have pic n ics with her family
a family who really enjoyed pic n ics . They loved
girl named Lily who loved pic n ics . She would
liked to play and have pic n ics . One day
favourite place to go for pic n ics . <|endoftext|> Once
near the cliff and have pic n ics . One day
saw lots of people having pic n ics and walking their
even see other people having pic n ics .
in the garden and have pic n ics together .
that said " Welcome to Kinder g arten !"
the warm sunshine and having pic n ics with her family
their salad and have happy pic n ics together . <|endoftext|>
so excited ! She loved pic n ics and spending time
to be careful when having pic n ics and not to
roof . They would have pic n ics and play games
see the bird and have pic n ics . <|endoftext|> Once

INTERVAL 0.085 - 0.175
CONTAINS 0.911%

mom to help her . But her mom was very selfish
smile , but the best part was when she found something
big , red ball . But her mom said , "
what to do . She remembered that her mom told her
healthy and safe . Even though it was hard , he

INTERVAL -0.004 - 0.085
CONTAINS 56.031%

They went to get ice cream from the shop and Mark
see who could make the tallest tower . They used all
! How can you make tea so quickly ?"
her normal size . From that day on , Lily never
you are very kind . Of course you can touch it

INTERVAL -0.094 - -0.004
CONTAINS 41.332%

time , there was a dolphin named Flip . Flip was
porch . <|endoftext|> Ben and Mia were playing in the yard
ice cream and fruit . They did not want to lose
to help Max , but he said it was too late
go . She felt sad and alone . Twe

INTERVAL -0.184 - -0.094
CONTAINS 1.726%

stay best friends forever . <|endoftext|> M olly and her dad
was worried that something might go wrong and that the machine
. <|endoftext|> Once upon a time , there was a big
and had a lot of fun . The
. <|endoftext|> Once upon a time , there was a little
Token Some
Token ID2773
Feature activation+1.68e-01

Pos logit contributions
 grandmother+0.020
 gen+0.019
 parent+0.019
 lamp+0.018
 grandfather+0.018
Neg logit contributions
|-0.013
des-0.009
 jog-0.009
 stroll-0.009
 walks-0.009
Token of
Token ID286
Feature activation+7.41e-02

Pos logit contributions
 grandmother+0.304
 gen+0.270
 parent+0.267
 grandfather+0.260
 lamp+0.259
Neg logit contributions
|-0.307
des-0.250
 jog-0.242
go-0.237
 stroll-0.236
Token them
Token ID606
Feature activation-1.34e-02

Pos logit contributions
 grandmother+0.183
 gen+0.169
 parent+0.166
 grandfather+0.164
 lamp+0.163
Neg logit contributions
|-0.086
 jog-0.062
 walks-0.058
 stroll-0.056
des-0.056
Token whispered
Token ID25029
Feature activation+6.60e-02

Pos logit contributions
|+0.021
des+0.016
 jog+0.015
go+0.015
 stroll+0.014
Neg logit contributions
 grandmother-0.028
 gen-0.026
 parent-0.025
 lamp-0.025
 grandfather-0.025
Token.
Token ID13
Feature activation+9.01e-03

Pos logit contributions
 grandmother+0.151
 gen+0.139
 parent+0.136
 grandfather+0.134
 lamp+0.133
Neg logit contributions
|-0.081
 jog-0.063
 walks-0.059
des-0.059
 stroll-0.059
Token Some
Token ID2773
Feature activation+1.70e-01

Pos logit contributions
 grandmother+0.020
 gen+0.019
 parent+0.018
 lamp+0.018
aby+0.018
Neg logit contributions
|-0.012
 jog-0.009
des-0.009
 stroll-0.009
 walks-0.009
Token of
Token ID286
Feature activation+7.68e-02

Pos logit contributions
 grandmother+0.311
 gen+0.276
 parent+0.273
 grandfather+0.266
 lamp+0.263
Neg logit contributions
|-0.309
des-0.251
 jog-0.248
go-0.241
 stroll-0.240
Token them
Token ID606
Feature activation-1.17e-02

Pos logit contributions
 grandmother+0.189
 gen+0.175
 parent+0.172
 grandfather+0.169
omm+0.169
Neg logit contributions
|-0.091
 jog-0.065
 walks-0.060
des-0.059
 stroll-0.058
Token shook
Token ID14682
Feature activation+2.58e-02

Pos logit contributions
|+0.018
des+0.014
 jog+0.013
go+0.013
 stroll+0.012
Neg logit contributions
 grandmother-0.025
 gen-0.023
 parent-0.023
 grandfather-0.022
 lamp-0.022
Token their
Token ID511
Feature activation+2.59e-02

Pos logit contributions
 grandmother+0.058
 gen+0.053
 parent+0.053
 grandfather+0.052
 lamp+0.051
Neg logit contributions
|-0.035
des-0.026
 jog-0.026
 stroll-0.024
go-0.024
Token heads
Token ID6665
Feature activation+1.25e-02

Pos logit contributions
 grandmother+0.041
 gen+0.036
 parent+0.035
 grandfather+0.034
 lamp+0.034
Neg logit contributions
|-0.053
des-0.044
 jog-0.044
go-0.043
 stroll-0.043
Token are
Token ID389
Feature activation+3.05e-02

Pos logit contributions
 grandmother+0.315
 gen+0.279
 parent+0.277
 grandfather+0.270
 lamp+0.267
Neg logit contributions
|-0.296
des-0.240
 jog-0.235
 stroll-0.228
go-0.228
Token small
Token ID1402
Feature activation+7.85e-03

Pos logit contributions
 grandmother+0.068
 gen+0.061
omm+0.061
 grandfather+0.059
aby+0.059
Neg logit contributions
|-0.037
 jog-0.034
 walks-0.032
 walk-0.031
 stroll-0.030
Token and
Token ID290
Feature activation-2.92e-02

Pos logit contributions
 grandmother+0.019
 gen+0.017
 parent+0.017
 grandfather+0.016
omm+0.016
Neg logit contributions
|-0.010
 jog-0.007
 walks-0.007
des-0.007
 stroll-0.007
Token shiny
Token ID22441
Feature activation+3.20e-04

Pos logit contributions
|+0.043
 walks+0.037
 jog+0.036
 walk+0.034
+0.034
Neg logit contributions
 grandmother-0.062
aby-0.055
omm-0.055
 grandfather-0.054
 parent-0.053
Token.
Token ID13
Feature activation+3.35e-03

Pos logit contributions
 grandmother+0.001
 parent+0.001
 gen+0.001
 grandfather+0.001
omm+0.001
Neg logit contributions
|-0.000
 walks-0.000
 jog-0.000
des-0.000
-0.000
Token Some
Token ID2773
Feature activation+1.70e-01

Pos logit contributions
 grandmother+0.008
 gen+0.007
 parent+0.007
 grandfather+0.007
aby+0.007
Neg logit contributions
|-0.005
 jog-0.003
des-0.003
-0.003
-0.003
Token are
Token ID389
Feature activation+3.10e-02

Pos logit contributions
 grandmother+0.352
 gen+0.315
 parent+0.313
 grandfather+0.307
 lamp+0.303
Neg logit contributions
|-0.266
des-0.209
 jog-0.205
 stroll-0.197
go-0.197
Token big
Token ID1263
Feature activation-3.73e-02

Pos logit contributions
 grandmother+0.069
omm+0.062
 gen+0.062
aby+0.060
 grandfather+0.060
Neg logit contributions
|-0.037
 jog-0.033
 walks-0.033
 walk-0.031
 high-0.031
Token and
Token ID290
Feature activation-3.24e-02

Pos logit contributions
|+0.042
 walks+0.033
 jog+0.032
des+0.030
 stroll+0.030
Neg logit contributions
 grandmother-0.090
 gen-0.081
aby-0.080
 parent-0.080
omm-0.080
Token rough
Token ID5210
Feature activation-2.29e-02

Pos logit contributions
|+0.045
 walks+0.039
 jog+0.037
+0.036
 walk+0.035
Neg logit contributions
 grandmother-0.071
aby-0.065
omm-0.063
 grandfather-0.062
 parent-0.062
Token.
Token ID13
Feature activation+3.21e-03

Pos logit contributions
|+0.031
des+0.024
 jog+0.023
 stroll+0.022
go+0.022
Neg logit contributions
 grandmother-0.052
 gen-0.047
 parent-0.047
 grandfather-0.046
 lamp-0.045
Token on
Token ID319
Feature activation-2.69e-02

Pos logit contributions
 grandmother+0.124
 gen+0.113
 parent+0.112
 grandfather+0.110
omm+0.108
Neg logit contributions
|-0.073
des-0.054
 jog-0.053
 stroll-0.051
 walks-0.050
Token the
Token ID262
Feature activation-1.68e-02

Pos logit contributions
|+0.031
 jog+0.023
 walks+0.022
 time+0.021
 stroll+0.021
Neg logit contributions
 grandmother-0.063
 gen-0.059
 parent-0.058
 grandfather-0.057
omm-0.057
Token black
Token ID2042
Feature activation-3.30e-02

Pos logit contributions
|+0.035
 jog+0.029
 stroll+0.029
 walks+0.028
des+0.028
Neg logit contributions
 grandmother-0.024
omm-0.021
 gen-0.021
 parent-0.021
illian-0.021
Tokenboard
Token ID3526
Feature activation+3.63e-02

Pos logit contributions
|+0.058
 jog+0.049
 stroll+0.048
hed+0.047
+0.047
Neg logit contributions
 grandmother-0.058
 parent-0.050
 gen-0.050
 grandfather-0.050
 Ev-0.049
Token.
Token ID13
Feature activation+9.88e-03

Pos logit contributions
 grandmother+0.080
 gen+0.073
 parent+0.072
 grandfather+0.071
 lamp+0.070
Neg logit contributions
|-0.051
des-0.039
 jog-0.038
go-0.037
 stroll-0.036
Token Some
Token ID2773
Feature activation+1.70e-01

Pos logit contributions
 grandmother+0.022
 gen+0.020
 parent+0.020
 grandfather+0.019
 lamp+0.019
Neg logit contributions
|-0.014
des-0.010
 jog-0.010
 stroll-0.010
go-0.010
Token drew
Token ID9859
Feature activation+1.01e-01

Pos logit contributions
 grandmother+0.326
aby+0.289
 gen+0.288
omm+0.287
 grandfather+0.284
Neg logit contributions
|-0.277
 jog-0.234
 walks-0.229
 stroll-0.224
des-0.223
Token flowers
Token ID12734
Feature activation+5.09e-02

Pos logit contributions
 grandmother+0.218
 parent+0.196
 grandfather+0.194
omm+0.193
 gen+0.193
Neg logit contributions
|-0.136
 walks-0.110
 stroll-0.104
hed-0.104
 jog-0.102
Token,
Token ID11
Feature activation+2.96e-02

Pos logit contributions
 grandmother+0.120
 parent+0.108
 gen+0.108
 grandfather+0.107
omm+0.105
Neg logit contributions
|-0.064
 jog-0.048
des-0.046
 stroll-0.045
 walks-0.045
Token some
Token ID617
Feature activation+4.15e-02

Pos logit contributions
 grandmother+0.060
aby+0.056
omm+0.056
 grandfather+0.053
 parent+0.052
Neg logit contributions
|-0.042
 walks-0.040
 jog-0.036
-0.035
 stroll-0.035
Token drew
Token ID9859
Feature activation+9.41e-02

Pos logit contributions
 grandmother+0.078
omm+0.072
aby+0.069
 grandfather+0.068
 gen+0.067
Neg logit contributions
|-0.066
 walks-0.059
 jog-0.058
 stroll-0.057
 walk-0.055
Token figure
Token ID3785
Feature activation+8.20e-02

Pos logit contributions
|+0.016
 jog+0.013
des+0.012
 stroll+0.012
 walk+0.012
Neg logit contributions
 grandmother-0.025
 gen-0.023
 parent-0.022
 grandfather-0.022
ware-0.022
Token out
Token ID503
Feature activation+4.19e-02

Pos logit contributions
 grandmother+0.185
 gen+0.172
 parent+0.167
omm+0.167
 grandfather+0.164
Neg logit contributions
|-0.106
 jog-0.081
des-0.081
 walks-0.079
 stroll-0.078
Token the
Token ID262
Feature activation-5.29e-03

Pos logit contributions
 grandmother+0.076
 gen+0.067
 parent+0.067
 grandfather+0.065
 lamp+0.064
Neg logit contributions
|-0.077
des-0.062
 jog-0.061
go-0.059
 stroll-0.059
Token answer
Token ID3280
Feature activation+7.67e-02

Pos logit contributions
|+0.009
des+0.008
go+0.007
 jog+0.007
 stroll+0.007
Neg logit contributions
 grandmother-0.010
 gen-0.009
 parent-0.009
 lamp-0.009
 grandfather-0.009
Token.
Token ID13
Feature activation+6.54e-03

Pos logit contributions
 grandmother+0.169
 gen+0.154
 parent+0.152
 grandfather+0.149
 lamp+0.148
Neg logit contributions
|-0.108
des-0.082
 jog-0.080
go-0.077
 stroll-0.077
Token Some
Token ID2773
Feature activation+1.69e-01

Pos logit contributions
 grandmother+0.015
 gen+0.014
 parent+0.014
 lamp+0.014
ware+0.014
Neg logit contributions
|-0.008
 jog-0.006
 stroll-0.006
des-0.006
 walks-0.006
Token people
Token ID661
Feature activation+4.16e-02

Pos logit contributions
 grandmother+0.284
 gen+0.252
 parent+0.251
 lamp+0.241
 grandfather+0.240
Neg logit contributions
|-0.333
des-0.274
 jog-0.260
go-0.259
 stroll-0.254
Token looked
Token ID3114
Feature activation-7.71e-03

Pos logit contributions
 grandmother+0.091
 gen+0.083
 parent+0.082
 grandfather+0.080
 lamp+0.080
Neg logit contributions
|-0.060
des-0.046
 jog-0.044
go-0.043
 stroll-0.042
Token up
Token ID510
Feature activation+1.13e-02

Pos logit contributions
|+0.012
des+0.009
go+0.008
 jog+0.008
war+0.008
Neg logit contributions
 grandmother-0.017
 parent-0.015
 gen-0.015
 grandfather-0.015
 lamp-0.015
Token in
Token ID287
Feature activation+2.08e-02

Pos logit contributions
 grandmother+0.025
 parent+0.023
 gen+0.023
 grandfather+0.023
 lamp+0.022
Neg logit contributions
|-0.017
des-0.012
go-0.012
 jog-0.011
 stroll-0.011
Token the
Token ID262
Feature activation+7.59e-03

Pos logit contributions
 grandmother+0.048
 gen+0.044
 parent+0.043
 grandfather+0.043
omm+0.042
Neg logit contributions
|-0.027
 jog-0.020
des-0.020
 stroll-0.019
 walks-0.019
Token was
Token ID373
Feature activation+1.18e-02

Pos logit contributions
 grandmother+0.171
 gen+0.156
aby+0.151
 parent+0.151
 grandfather+0.150
Neg logit contributions
|-0.129
 jog-0.104
 walks-0.097
des-0.097
go-0.097
Token different
Token ID1180
Feature activation+6.27e-02

Pos logit contributions
 grandmother+0.027
 gen+0.025
omm+0.025
 parent+0.025
ware+0.024
Neg logit contributions
|-0.014
 jog-0.011
 walks-0.010
 play-0.010
 time-0.010
Token and
Token ID290
Feature activation-2.46e-02

Pos logit contributions
 grandmother+0.150
 gen+0.142
 parent+0.137
aby+0.137
 grandfather+0.136
Neg logit contributions
|-0.070
 jog-0.052
 walks-0.048
 stroll-0.047
 around-0.046
Token unique
Token ID3748
Feature activation+2.42e-02

Pos logit contributions
|+0.031
 jog+0.024
 walks+0.022
 play+0.022
+0.022
Neg logit contributions
 grandmother-0.056
 gen-0.052
omm-0.051
aby-0.051
 parent-0.050
Token.
Token ID13
Feature activation+8.91e-03

Pos logit contributions
 grandmother+0.056
 gen+0.052
 parent+0.051
 grandfather+0.050
illian+0.050
Neg logit contributions
|-0.030
 jog-0.023
 walks-0.022
 stroll-0.021
des-0.021
Token Some
Token ID2773
Feature activation+1.69e-01

Pos logit contributions
 grandmother+0.020
 gen+0.019
 parent+0.019
aby+0.018
 grandfather+0.018
Neg logit contributions
|-0.012
 jog-0.008
des-0.008
 stroll-0.008
go-0.008
Token of
Token ID286
Feature activation+7.06e-02

Pos logit contributions
 grandmother+0.318
 gen+0.282
 parent+0.281
 grandfather+0.273
 lamp+0.272
Neg logit contributions
|-0.299
des-0.241
 jog-0.234
go-0.228
 stroll-0.227
Token the
Token ID262
Feature activation+8.47e-03

Pos logit contributions
 grandmother+0.188
 gen+0.180
omm+0.177
 grandfather+0.172
 parent+0.169
Neg logit contributions
|-0.063
 jog-0.044
 walks-0.040
 play-0.038
 time-0.034
Token kids
Token ID3988
Feature activation-5.83e-03

Pos logit contributions
 gen+0.015
 grandmother+0.014
 Ang+0.014
bul+0.014
aby+0.014
Neg logit contributions
|-0.014
 jog-0.014
 play-0.012
 walks-0.012
 walk-0.012
Token swapped
Token ID37245
Feature activation+2.99e-02

Pos logit contributions
|+0.008
des+0.006
 jog+0.006
go+0.006
 stroll+0.006
Neg logit contributions
 grandmother-0.013
 gen-0.012
 parent-0.012
 grandfather-0.011
 lamp-0.011
Token dolls
Token ID36062
Feature activation+5.34e-02

Pos logit contributions
 grandmother+0.056
 gen+0.050
 parent+0.050
 grandfather+0.048
 lamp+0.048
Neg logit contributions
|-0.052
des-0.041
 jog-0.041
 stroll-0.040
 walks-0.040
Token know
Token ID760
Feature activation+7.91e-02

Pos logit contributions
 grandmother+0.047
 gen+0.043
 parent+0.042
 grandfather+0.041
 lamp+0.041
Neg logit contributions
|-0.028
des-0.020
 jog-0.019
 stroll-0.019
go-0.019
Token where
Token ID810
Feature activation+3.46e-02

Pos logit contributions
 grandmother+0.176
omm+0.166
 gen+0.163
 parent+0.160
 grandfather+0.157
Neg logit contributions
|-0.085
 walks-0.077
 jog-0.072
 trips-0.069
 time-0.069
Token it
Token ID340
Feature activation-2.08e-02

Pos logit contributions
 grandmother+0.071
 gen+0.064
 parent+0.064
 grandfather+0.062
 lamp+0.062
Neg logit contributions
|-0.055
des-0.043
 jog-0.042
go-0.040
 stroll-0.040
Token landed
Token ID11406
Feature activation-3.33e-02

Pos logit contributions
|+0.036
des+0.031
ashing+0.028
 stroll+0.027
ited+0.027
Neg logit contributions
 grandmother-0.038
 gen-0.037
 lamp-0.036
 parent-0.035
 fairy-0.034
Token.
Token ID13
Feature activation+3.65e-03

Pos logit contributions
|+0.055
des+0.043
 jog+0.042
 stroll+0.041
go+0.040
Neg logit contributions
 grandmother-0.067
 gen-0.061
 parent-0.060
 lamp-0.059
ware-0.058
Token Some
Token ID2773
Feature activation+1.69e-01

Pos logit contributions
 grandmother+0.008
 gen+0.008
 parent+0.007
ware+0.007
 lamp+0.007
Neg logit contributions
|-0.005
des-0.004
 jog-0.004
 stroll-0.003
go-0.003
Token of
Token ID286
Feature activation+7.43e-02

Pos logit contributions
 grandmother+0.284
 gen+0.249
 parent+0.246
 grandfather+0.240
 lamp+0.237
Neg logit contributions
|-0.328
des-0.272
 jog-0.268
go-0.262
 stroll-0.260
Token them
Token ID606
Feature activation-1.12e-02

Pos logit contributions
 grandmother+0.204
omm+0.197
 gen+0.190
 grandfather+0.186
 parent+0.183
Neg logit contributions
|-0.056
 walks-0.039
 jog-0.034
 time-0.031
 play-0.030
Token did
Token ID750
Feature activation+1.04e-02

Pos logit contributions
|+0.017
des+0.014
go+0.013
 jog+0.012
 stroll+0.012
Neg logit contributions
 grandmother-0.023
 parent-0.021
 gen-0.021
 lamp-0.021
 grandfather-0.020
Token not
Token ID407
Feature activation+2.53e-02

Pos logit contributions
 grandmother+0.019
 gen+0.017
 parent+0.016
omm+0.016
 grandfather+0.016
Neg logit contributions
|-0.018
 jog-0.016
 walks-0.015
 walk-0.015
des-0.015
Token see
Token ID766
Feature activation+3.05e-02

Pos logit contributions
 grandmother+0.055
 gen+0.050
 parent+0.049
 grandfather+0.049
omm+0.048
Neg logit contributions
|-0.037
des-0.028
 jog-0.028
 stroll-0.027
go-0.027
Token smiled
Token ID13541
Feature activation+4.46e-02

Pos logit contributions
 grandmother+0.160
omm+0.150
 parent+0.145
aby+0.145
 gen+0.143
Neg logit contributions
|-0.084
 jog-0.059
hed-0.057
 walks-0.056
 walk-0.055
Token and
Token ID290
Feature activation-1.58e-02

Pos logit contributions
 grandmother+0.108
 gen+0.101
 parent+0.097
ware+0.097
 lamp+0.097
Neg logit contributions
|-0.048
 jog-0.037
des-0.035
 stroll-0.034
go-0.033
Token waved
Token ID26834
Feature activation+1.36e-03

Pos logit contributions
|+0.018
 jog+0.015
 walks+0.015
 played+0.015
 chat+0.014
Neg logit contributions
 grandmother-0.036
omm-0.035
ware-0.034
 lamp-0.033
illian-0.033
Token back
Token ID736
Feature activation-3.62e-02

Pos logit contributions
 grandmother+0.003
 parent+0.003
 gen+0.003
 grandfather+0.003
 lamp+0.003
Neg logit contributions
|-0.002
des-0.001
 jog-0.001
go-0.001
 stroll-0.001
Token.
Token ID13
Feature activation+1.10e-02

Pos logit contributions
|+0.040
 jog+0.029
 walks+0.028
 stroll+0.027
des+0.027
Neg logit contributions
 grandmother-0.088
 gen-0.085
 parent-0.080
omm-0.080
 lamp-0.079
Token Some
Token ID2773
Feature activation+1.69e-01

Pos logit contributions
 grandmother+0.027
 gen+0.026
 parent+0.025
ware+0.025
 lamp+0.025
Neg logit contributions
|-0.013
 jog-0.008
 walks-0.008
des-0.008
-0.008
Token of
Token ID286
Feature activation+7.28e-02

Pos logit contributions
 grandmother+0.325
 gen+0.287
aby+0.287
 grandfather+0.284
 parent+0.282
Neg logit contributions
|-0.283
 jog-0.230
des-0.227
 walks-0.226
 stroll-0.221
Token them
Token ID606
Feature activation-1.26e-02

Pos logit contributions
 grandmother+0.199
 gen+0.190
omm+0.188
 grandfather+0.182
 parent+0.179
Neg logit contributions
|-0.057
 walks-0.037
 jog-0.035
 time-0.030
 play-0.030
Token told
Token ID1297
Feature activation+6.16e-02

Pos logit contributions
|+0.019
des+0.015
 jog+0.014
go+0.014
 stroll+0.013
Neg logit contributions
 grandmother-0.027
 gen-0.024
 parent-0.024
 grandfather-0.024
 lamp-0.024
Token him
Token ID683
Feature activation-3.36e-02

Pos logit contributions
 grandmother+0.145
 gen+0.135
 parent+0.132
ware+0.131
 lamp+0.130
Neg logit contributions
|-0.075
 jog-0.055
 walks-0.055
des-0.053
 stroll-0.053
Token stories
Token ID3923
Feature activation+2.08e-02

Pos logit contributions
|+0.059
des+0.047
 jog+0.045
go+0.045
 stroll+0.044
Neg logit contributions
 grandmother-0.064
 gen-0.057
 parent-0.057
 grandfather-0.055
 lamp-0.055
Token it
Token ID340
Feature activation-1.77e-02

Pos logit contributions
|+0.040
 jog+0.031
 walks+0.031
des+0.030
go+0.029
Neg logit contributions
 grandmother-0.069
 parent-0.062
 gen-0.062
 grandfather-0.062
aby-0.060
Token broke
Token ID6265
Feature activation+6.90e-02

Pos logit contributions
|+0.024
go+0.018
 jog+0.018
des+0.017
 walks+0.017
Neg logit contributions
 grandmother-0.040
 grandfather-0.036
 parent-0.035
 gen-0.035
omm-0.034
Token into
Token ID656
Feature activation+3.15e-02

Pos logit contributions
 grandmother+0.180
 gen+0.169
 grandfather+0.168
 parent+0.165
 wizard+0.164
Neg logit contributions
|-0.062
 down-0.045
 around-0.039
-0.038
 away-0.038
Token pieces
Token ID5207
Feature activation+3.26e-02

Pos logit contributions
 grandmother+0.066
 grandfather+0.063
rings+0.062
 wizard+0.061
 gen+0.060
Neg logit contributions
|-0.038
 time-0.034
 walks-0.033
 turns-0.033
 trips-0.031
Token.
Token ID13
Feature activation+9.05e-03

Pos logit contributions
 grandmother+0.075
 gen+0.068
 parent+0.067
 grandfather+0.066
 lamp+0.065
Neg logit contributions
|-0.044
des-0.033
 jog-0.032
go-0.031
 stroll-0.030
Token Some
Token ID2773
Feature activation+1.69e-01

Pos logit contributions
 grandmother+0.016
 gen+0.014
 parent+0.014
 grandfather+0.014
 lamp+0.014
Neg logit contributions
|-0.017
des-0.014
 jog-0.013
go-0.013
 stroll-0.013
Token of
Token ID286
Feature activation+7.13e-02

Pos logit contributions
 grandmother+0.336
 grandfather+0.298
aby+0.293
 parent+0.290
omm+0.289
Neg logit contributions
|-0.258
 walks-0.215
des-0.213
go-0.211
 jog-0.210
Token the
Token ID262
Feature activation+8.09e-03

Pos logit contributions
 grandmother+0.201
 grandfather+0.190
 gen+0.186
omm+0.182
 parent+0.182
Neg logit contributions
|-0.045
 time-0.028
 walks-0.026
 us-0.024
 jog-0.022
Token egg
Token ID5935
Feature activation+6.72e-02

Pos logit contributions
 grandmother+0.012
aby+0.011
omm+0.011
illian+0.011
 grandfather+0.010
Neg logit contributions
|-0.014
 picnic-0.014
 walk-0.013
 time-0.013
-0.013
Token fell
Token ID3214
Feature activation-1.90e-02

Pos logit contributions
 grandmother+0.156
 parent+0.139
 grandfather+0.139
 gen+0.138
omm+0.135
Neg logit contributions
|-0.086
 jog-0.063
des-0.062
go-0.062
 walks-0.061
Token on
Token ID319
Feature activation-1.37e-02

Pos logit contributions
|+0.022
 jog+0.015
des+0.015
go+0.014
 stroll+0.014
Neg logit contributions
 grandmother-0.048
 gen-0.043
 grandfather-0.043
 parent-0.043
omm-0.041
Token and
Token ID290
Feature activation-1.88e-02

Pos logit contributions
 grandmother+0.020
 gen+0.018
 parent+0.018
 grandfather+0.018
 lamp+0.018
Neg logit contributions
|-0.013
des-0.010
 jog-0.009
 stroll-0.009
go-0.009
Token made
Token ID925
Feature activation+3.30e-02

Pos logit contributions
|+0.024
 walks+0.020
 jog+0.019
 walk+0.018
 picnic+0.018
Neg logit contributions
 grandmother-0.042
omm-0.039
 grandfather-0.038
 parent-0.038
aby-0.038
Token a
Token ID257
Feature activation+2.81e-02

Pos logit contributions
 grandmother+0.070
 gen+0.063
 parent+0.063
 grandfather+0.062
 lamp+0.061
Neg logit contributions
|-0.049
des-0.038
 jog-0.037
 walks-0.036
go-0.036
Token mess
Token ID2085
Feature activation+1.12e-02

Pos logit contributions
 grandmother+0.065
ware+0.059
illian+0.059
rings+0.059
 grandfather+0.059
Neg logit contributions
|-0.033
 jog-0.026
 picnic-0.025
 walk-0.025
Yo-0.024
Token.
Token ID13
Feature activation+1.01e-02

Pos logit contributions
 grandmother+0.024
 gen+0.022
 parent+0.021
 lamp+0.021
 grandfather+0.021
Neg logit contributions
|-0.017
des-0.013
 jog-0.013
 stroll-0.012
go-0.012
Token Some
Token ID2773
Feature activation+1.69e-01

Pos logit contributions
 grandmother+0.024
 gen+0.023
 parent+0.023
ware+0.022
 grandfather+0.022
Neg logit contributions
|-0.012
-0.007
des-0.007
 walks-0.007
 stroll-0.007
Token of
Token ID286
Feature activation+7.51e-02

Pos logit contributions
 grandmother+0.321
 gen+0.286
 parent+0.282
 grandfather+0.277
 lamp+0.273
Neg logit contributions
|-0.293
des-0.236
 jog-0.231
go-0.224
 stroll-0.224
Token the
Token ID262
Feature activation+7.44e-03

Pos logit contributions
 grandmother+0.207
 gen+0.193
 grandfather+0.192
omm+0.190
 parent+0.186
Neg logit contributions
|-0.060
 walks-0.036
 jog-0.032
 time-0.032
 play-0.031
Token cheese
Token ID9891
Feature activation+6.01e-02

Pos logit contributions
 grandmother+0.012
 gen+0.011
aby+0.011
 grandfather+0.011
 parent+0.011
Neg logit contributions
|-0.014
 jog-0.012
des-0.011
 walks-0.011
 stroll-0.011
Token sticks
Token ID16461
Feature activation+4.56e-02

Pos logit contributions
 grandmother+0.121
 gen+0.113
 lamp+0.109
 parent+0.109
ware+0.108
Neg logit contributions
|-0.097
des-0.079
 jog-0.073
go-0.072
 stroll-0.072
Token and
Token ID290
Feature activation-2.07e-02

Pos logit contributions
 grandmother+0.089
 gen+0.084
 lamp+0.082
 parent+0.081
ware+0.081
Neg logit contributions
|-0.075
des-0.059
go-0.055
 jog-0.054
 stroll-0.053
Token and
Token ID290
Feature activation-3.08e-02

Pos logit contributions
|+0.033
des+0.027
 jog+0.024
+0.024
ited+0.024
Neg logit contributions
 grandmother-0.034
 parent-0.032
 gen-0.032
 lamp-0.031
 grandfather-0.031
Token shells
Token ID19679
Feature activation+5.97e-03

Pos logit contributions
|+0.043
 walks+0.037
 jog+0.036
 dives+0.034
 picnic+0.034
Neg logit contributions
 grandmother-0.065
omm-0.060
illian-0.057
aby-0.057
 gen-0.057
Token and
Token ID290
Feature activation-2.57e-02

Pos logit contributions
 grandmother+0.012
 gen+0.011
 parent+0.011
 lamp+0.010
ware+0.010
Neg logit contributions
|-0.010
des-0.008
 jog-0.007
go-0.007
 stroll-0.007
Token coral
Token ID29537
Feature activation+2.44e-02

Pos logit contributions
|+0.038
 walks+0.032
 jog+0.032
 walk+0.029
des+0.029
Neg logit contributions
 grandmother-0.054
omm-0.050
aby-0.048
 gen-0.047
illian-0.047
Token.
Token ID13
Feature activation+3.78e-03

Pos logit contributions
 grandmother+0.046
 gen+0.044
ware+0.043
 parent+0.042
 lamp+0.042
Neg logit contributions
|-0.042
des-0.034
go-0.031
 jog-0.031
 stroll-0.030
Token Some
Token ID2773
Feature activation+1.69e-01

Pos logit contributions
 grandmother+0.009
 gen+0.008
 parent+0.008
 grandfather+0.008
 lamp+0.008
Neg logit contributions
|-0.005
des-0.003
 jog-0.003
go-0.003
-0.003
Token of
Token ID286
Feature activation+7.51e-02

Pos logit contributions
 grandmother+0.296
 gen+0.268
 parent+0.265
 grandfather+0.254
 lamp+0.254
Neg logit contributions
|-0.318
des-0.259
 jog-0.244
go-0.242
 stroll-0.239
Token the
Token ID262
Feature activation+1.07e-02

Pos logit contributions
 grandmother+0.197
omm+0.181
 gen+0.180
 grandfather+0.177
 parent+0.175
Neg logit contributions
|-0.073
 jog-0.052
 walks-0.050
 stroll-0.046
 walk-0.044
Token coral
Token ID29537
Feature activation+3.37e-02

Pos logit contributions
 grandmother+0.022
omm+0.020
 gen+0.020
 parent+0.019
 Ev+0.019
Neg logit contributions
|-0.016
 jog-0.013
des-0.013
 stroll-0.013
 walks-0.013
Token was
Token ID373
Feature activation+1.73e-02

Pos logit contributions
 grandmother+0.066
 gen+0.062
 parent+0.060
ware+0.060
 lamp+0.059
Neg logit contributions
|-0.056
des-0.046
go-0.042
 jog-0.041
 stroll-0.041
Token broken
Token ID5445
Feature activation+3.44e-02

Pos logit contributions
 grandmother+0.036
 gen+0.033
 parent+0.032
 grandfather+0.032
 lamp+0.032
Neg logit contributions
|-0.026
des-0.020
 jog-0.020
 stroll-0.019
go-0.019
Token and
Token ID290
Feature activation-2.69e-02

Pos logit contributions
|+0.022
des+0.017
 jog+0.016
go+0.015
 stroll+0.015
Neg logit contributions
 grandmother-0.035
 gen-0.032
 parent-0.032
 grandfather-0.031
 lamp-0.031
Token some
Token ID617
Feature activation+4.49e-02

Pos logit contributions
|+0.037
 jog+0.032
 walks+0.032
 walk+0.030
+0.029
Neg logit contributions
 grandmother-0.059
aby-0.052
omm-0.052
 parent-0.051
 grandfather-0.051
Token were
Token ID547
Feature activation+1.65e-02

Pos logit contributions
 grandmother+0.112
 parent+0.101
 grandfather+0.101
 gen+0.101
omm+0.100
Neg logit contributions
|-0.051
 jog-0.037
des-0.036
 walks-0.035
 stroll-0.035
Token rough
Token ID5210
Feature activation-2.95e-02

Pos logit contributions
 grandmother+0.037
 gen+0.033
omm+0.033
 parent+0.032
 grandfather+0.032
Neg logit contributions
|-0.020
 jog-0.018
 walks-0.017
 stroll-0.016
 walk-0.016
Token.
Token ID13
Feature activation+1.68e-03

Pos logit contributions
|+0.043
des+0.032
 jog+0.031
go+0.030
 stroll+0.030
Neg logit contributions
 grandmother-0.065
 gen-0.059
 parent-0.058
 lamp-0.057
 grandfather-0.057
Token Some
Token ID2773
Feature activation+1.69e-01

Pos logit contributions
 grandmother+0.004
 gen+0.004
 parent+0.004
 grandfather+0.004
 wizard+0.004
Neg logit contributions
|-0.002
 jog-0.001
 stroll-0.001
des-0.001
 walks-0.001
Token were
Token ID547
Feature activation+2.34e-02

Pos logit contributions
 grandmother+0.383
 gen+0.347
 parent+0.345
 grandfather+0.338
 lamp+0.336
Neg logit contributions
|-0.232
des-0.175
 jog-0.169
go-0.163
 stroll-0.162
Token soft
Token ID2705
Feature activation-2.32e-02

Pos logit contributions
 grandmother+0.052
 gen+0.048
omm+0.046
 parent+0.046
 grandfather+0.045
Neg logit contributions
|-0.028
 jog-0.026
 walks-0.024
 walk-0.024
 stroll-0.023
Token and
Token ID290
Feature activation-2.75e-02

Pos logit contributions
|+0.034
des+0.026
 jog+0.024
go+0.024
 stroll+0.023
Neg logit contributions
 grandmother-0.051
 gen-0.047
 parent-0.046
 lamp-0.045
 grandfather-0.045
Token some
Token ID617
Feature activation+4.41e-02

Pos logit contributions
|+0.039
 jog+0.033
 walks+0.033
 walk+0.031
 stroll+0.031
Neg logit contributions
 grandmother-0.059
aby-0.052
 grandfather-0.052
omm-0.051
 gen-0.051
Token were
Token ID547
Feature activation+1.71e-02

Pos logit contributions
 grandmother+0.109
 gen+0.098
 parent+0.098
 grandfather+0.098
omm+0.097
Neg logit contributions
|-0.052
 jog-0.037
des-0.037
 stroll-0.035
 walks-0.035
Token box
Token ID3091
Feature activation+5.33e-02

Pos logit contributions
 grandmother+0.004
 Ang+0.004
aby+0.004
 parent+0.004
 grandfather+0.004
Neg logit contributions
|-0.004
 jog-0.004
 walks-0.004
 picnic-0.004
 walk-0.004
Token has
Token ID468
Feature activation-4.02e-03

Pos logit contributions
 grandmother+0.109
 gen+0.101
 lamp+0.100
 parent+0.098
ware+0.096
Neg logit contributions
|-0.085
des-0.072
 stroll-0.063
war-0.061
ashing-0.061
Token a
Token ID257
Feature activation+1.12e-02

Pos logit contributions
|+0.005
 walks+0.005
 time+0.004
 jog+0.004
 trips+0.004
Neg logit contributions
 grandmother-0.009
omm-0.008
aby-0.008
 gen-0.008
 parent-0.008
Token hole
Token ID7604
Feature activation+2.01e-02

Pos logit contributions
 grandmother+0.021
illian+0.020
omm+0.020
 Ev+0.020
rays+0.019
Neg logit contributions
|-0.015
 picnic-0.015
 jog-0.015
 race-0.014
 time-0.014
Token.
Token ID13
Feature activation+4.77e-03

Pos logit contributions
 grandmother+0.046
 gen+0.042
 lamp+0.042
 parent+0.041
ware+0.041
Neg logit contributions
|-0.028
des-0.021
 jog-0.019
 stroll-0.019
go-0.018
Token Some
Token ID2773
Feature activation+1.69e-01

Pos logit contributions
 grandmother+0.011
 gen+0.010
 parent+0.010
 grandfather+0.010
aby+0.010
Neg logit contributions
|-0.006
des-0.004
 jog-0.004
 walks-0.004
go-0.004
Token toys
Token ID14958
Feature activation+5.75e-02

Pos logit contributions
 grandmother+0.304
 gen+0.267
 parent+0.263
 grandfather+0.260
omm+0.256
Neg logit contributions
|-0.309
 jog-0.251
des-0.251
 walks-0.244
 stroll-0.243
Token fall
Token ID2121
Feature activation-2.15e-02

Pos logit contributions
 grandmother+0.122
 parent+0.111
 lamp+0.111
 gen+0.111
 grandfather+0.108
Neg logit contributions
|-0.086
des-0.070
go-0.062
ited-0.060
 stroll-0.059
Token out
Token ID503
Feature activation+4.58e-02

Pos logit contributions
|+0.029
des+0.023
 jog+0.020
go+0.020
 stroll+0.020
Neg logit contributions
 grandmother-0.049
 parent-0.045
 lamp-0.045
 gen-0.045
 grandfather-0.044
Token of
Token ID286
Feature activation+6.33e-02

Pos logit contributions
 grandmother+0.088
 gen+0.078
 parent+0.077
 grandfather+0.076
omm+0.074
Neg logit contributions
|-0.079
des-0.063
 jog-0.062
go-0.061
 stroll-0.060
Token the
Token ID262
Feature activation-5.39e-03

Pos logit contributions
 grandmother+0.162
 gen+0.151
omm+0.151
 parent+0.149
 grandfather+0.148
Neg logit contributions
|-0.059
 walks-0.043
 time-0.040
 jog-0.037
 walk-0.037
Token room
Token ID2119
Feature activation+3.95e-02

Pos logit contributions
 grandmother+0.021
 gen+0.018
 grandfather+0.018
 parent+0.018
omm+0.018
Neg logit contributions
|-0.024
 jog-0.019
des-0.018
 walks-0.018
go-0.018
Token was
Token ID373
Feature activation+7.92e-03

Pos logit contributions
 grandmother+0.083
 gen+0.075
 parent+0.074
 lamp+0.073
 grandfather+0.073
Neg logit contributions
|-0.061
des-0.048
 jog-0.046
go-0.044
 stroll-0.044
Token a
Token ID257
Feature activation+1.35e-02

Pos logit contributions
 grandmother+0.018
 parent+0.016
 grandfather+0.016
 gen+0.016
ware+0.016
Neg logit contributions
|-0.010
 jog-0.008
 walks-0.007
des-0.007
 stroll-0.007
Token mess
Token ID2085
Feature activation-1.30e-03

Pos logit contributions
 grandmother+0.025
rings+0.025
 Ev+0.024
illian+0.024
omm+0.024
Neg logit contributions
|-0.020
 jog-0.018
 picnic-0.017
 walk-0.016
-0.016
Token.
Token ID13
Feature activation+6.19e-03

Pos logit contributions
|+0.002
des+0.002
 jog+0.002
 stroll+0.001
go+0.001
Neg logit contributions
 grandmother-0.003
 gen-0.002
 parent-0.002
 lamp-0.002
 grandfather-0.002
Token Some
Token ID2773
Feature activation+1.69e-01

Pos logit contributions
 grandmother+0.015
 gen+0.014
 parent+0.014
 grandfather+0.013
 lamp+0.013
Neg logit contributions
|-0.008
 jog-0.005
des-0.005
 walks-0.005
 stroll-0.005
Token of
Token ID286
Feature activation+7.27e-02

Pos logit contributions
 grandmother+0.312
 gen+0.279
 grandfather+0.275
aby+0.271
 parent+0.271
Neg logit contributions
|-0.290
 jog-0.242
des-0.236
 walks-0.233
go-0.230
Token their
Token ID511
Feature activation+1.30e-02

Pos logit contributions
 grandmother+0.208
omm+0.194
 grandfather+0.194
 gen+0.194
 wizard+0.188
Neg logit contributions
|-0.047
 jog-0.026
 walks-0.025
 play-0.024
 swim-0.024
Token toys
Token ID14958
Feature activation+4.97e-02

Pos logit contributions
 grandmother+0.020
 gen+0.019
omm+0.017
aby+0.017
illian+0.017
Neg logit contributions
|-0.026
 jog-0.022
 walks-0.021
des-0.021
 stroll-0.021
Token were
Token ID547
Feature activation+6.72e-03

Pos logit contributions
 grandmother+0.103
 gen+0.093
 lamp+0.093
 parent+0.092
ware+0.090
Neg logit contributions
|-0.078
des-0.063
go-0.057
war-0.055
 stroll-0.055
Token broken
Token ID5445
Feature activation+2.81e-02

Pos logit contributions
 grandmother+0.014
 gen+0.012
 parent+0.012
 grandfather+0.012
 lamp+0.012
Neg logit contributions
|-0.011
des-0.008
 jog-0.008
go-0.008
 stroll-0.008
Token rocks
Token ID12586
Feature activation+7.81e-04

Pos logit contributions
|+0.004
des+0.003
 jog+0.003
 stroll+0.003
go+0.003
Neg logit contributions
 grandmother-0.004
 gen-0.004
 parent-0.004
 grandfather-0.004
 lamp-0.004
Token and
Token ID290
Feature activation-2.49e-02

Pos logit contributions
 grandmother+0.001
 gen+0.001
 lamp+0.001
 parent+0.001
ware+0.001
Neg logit contributions
|-0.001
des-0.001
-0.001
 jog-0.001
war-0.001
Token the
Token ID262
Feature activation+6.25e-03

Pos logit contributions
|+0.030
 jog+0.024
 walks+0.024
 walk+0.023
 stroll+0.022
Neg logit contributions
 grandmother-0.060
omm-0.056
 gen-0.054
illian-0.053
 parent-0.053
Token flowers
Token ID12734
Feature activation+4.65e-02

Pos logit contributions
 grandmother+0.011
 gen+0.010
 parent+0.010
omm+0.010
 grandfather+0.010
Neg logit contributions
|-0.011
 jog-0.009
des-0.009
 stroll-0.009
 walks-0.009
Token.
Token ID13
Feature activation+2.64e-03

Pos logit contributions
 grandmother+0.091
 gen+0.083
 lamp+0.082
 parent+0.082
ware+0.080
Neg logit contributions
|-0.078
des-0.061
go-0.056
 jog-0.055
-0.054
Token Some
Token ID2773
Feature activation+1.69e-01

Pos logit contributions
 grandmother+0.007
 gen+0.006
 parent+0.006
 lamp+0.006
 grandfather+0.006
Neg logit contributions
|-0.003
 jog-0.002
des-0.002
 stroll-0.002
 walks-0.002
Token were
Token ID547
Feature activation+2.21e-02

Pos logit contributions
 grandmother+0.355
 gen+0.319
 parent+0.317
 grandfather+0.310
 lamp+0.307
Neg logit contributions
|-0.260
des-0.203
 jog-0.198
go-0.192
 stroll-0.191
Token smooth
Token ID7209
Feature activation-1.57e-02

Pos logit contributions
 grandmother+0.049
 gen+0.045
omm+0.044
 parent+0.044
 grandfather+0.043
Neg logit contributions
|-0.027
 jog-0.024
 walks-0.023
 stroll-0.022
 walk-0.022
Token and
Token ID290
Feature activation-2.69e-02

Pos logit contributions
|+0.022
des+0.017
 jog+0.016
go+0.015
 stroll+0.015
Neg logit contributions
 grandmother-0.035
 gen-0.032
 parent-0.032
 grandfather-0.031
 lamp-0.031
Token some
Token ID617
Feature activation+4.49e-02

Pos logit contributions
|+0.037
 jog+0.032
 walks+0.032
 walk+0.030
+0.029
Neg logit contributions
 grandmother-0.059
aby-0.052
omm-0.052
 parent-0.051
 grandfather-0.051
Token were
Token ID547
Feature activation+1.65e-02

Pos logit contributions
 grandmother+0.112
 parent+0.101
 grandfather+0.101
 gen+0.101
omm+0.100
Neg logit contributions
|-0.051
 jog-0.037
des-0.036
 walks-0.035
 stroll-0.035
Token shook
Token ID14682
Feature activation+2.19e-02

Pos logit contributions
 grandmother+0.030
 gen+0.027
 parent+0.026
 lamp+0.026
 grandfather+0.026
Neg logit contributions
|-0.023
des-0.018
 jog-0.018
 stroll-0.017
go-0.017
Token and
Token ID290
Feature activation-1.00e-02

Pos logit contributions
 grandmother+0.049
 gen+0.044
 parent+0.044
 grandfather+0.043
 lamp+0.043
Neg logit contributions
|-0.031
des-0.024
 jog-0.023
go-0.022
 stroll-0.022
Token tremb
Token ID29868
Feature activation-2.55e-03

Pos logit contributions
|+0.014
 jog+0.011
 walks+0.010
des+0.010
 stroll+0.010
Neg logit contributions
 grandmother-0.022
 parent-0.020
 gen-0.020
omm-0.020
 grandfather-0.020
Tokenled
Token ID992
Feature activation-1.31e-02

Pos logit contributions
|+0.002
des+0.002
 jog+0.002
 stroll+0.001
go+0.001
Neg logit contributions
 grandmother-0.007
 gen-0.006
 parent-0.006
 grandfather-0.006
 lamp-0.006
Token.
Token ID13
Feature activation+1.28e-02

Pos logit contributions
|+0.021
des+0.016
go+0.015
 jog+0.015
 stroll+0.014
Neg logit contributions
 grandmother-0.027
 gen-0.025
 parent-0.025
 lamp-0.024
 grandfather-0.024
Token Some
Token ID2773
Feature activation+1.69e-01

Pos logit contributions
 grandmother+0.029
 gen+0.027
 parent+0.027
ware+0.026
 lamp+0.026
Neg logit contributions
|-0.017
 walks-0.012
 jog-0.012
 stroll-0.012
des-0.012
Token of
Token ID286
Feature activation+7.67e-02

Pos logit contributions
 grandmother+0.319
omm+0.284
aby+0.283
 gen+0.280
 grandfather+0.280
Neg logit contributions
|-0.281
 walks-0.240
 jog-0.232
go-0.231
des-0.225
Token the
Token ID262
Feature activation+1.15e-02

Pos logit contributions
 grandmother+0.224
omm+0.213
 gen+0.210
 grandfather+0.206
 parent+0.203
Neg logit contributions
|-0.049
 walks-0.028
 time-0.022
 jog-0.021
 us-0.019
Token books
Token ID3835
Feature activation+4.39e-02

Pos logit contributions
 grandmother+0.018
omm+0.018
aby+0.018
 Ang+0.017
illian+0.017
Neg logit contributions
|-0.020
 walks-0.019
 walk-0.018
 jog-0.018
 time-0.017
Token and
Token ID290
Feature activation-2.21e-02

Pos logit contributions
 grandmother+0.098
 gen+0.091
 parent+0.088
 lamp+0.088
ware+0.086
Neg logit contributions
|-0.061
des-0.047
 jog-0.044
 stroll-0.042
go-0.042
Token toys
Token ID14958
Feature activation+4.80e-02

Pos logit contributions
|+0.027
 walks+0.025
 jog+0.023
 walk+0.023
 turns+0.022
Neg logit contributions
 grandmother-0.049
omm-0.048
aby-0.047
 gen-0.044
illian-0.044
Token,
Token ID11
Feature activation+3.78e-02

Pos logit contributions
|+0.042
 jog+0.032
des+0.032
 stroll+0.031
 walks+0.030
Neg logit contributions
 grandmother-0.082
 parent-0.075
 gen-0.075
 lamp-0.073
 grandfather-0.073
Token something
Token ID1223
Feature activation+7.42e-02

Pos logit contributions
 grandmother+0.084
omm+0.077
 gen+0.077
 parent+0.076
aby+0.075
Neg logit contributions
|-0.050
 jog-0.039
des-0.037
 stroll-0.037
 walks-0.036
Token bad
Token ID2089
Feature activation-1.42e-02

Pos logit contributions
 grandmother+0.129
 gen+0.123
 mother+0.116
 parent+0.116
 wizard+0.116
Neg logit contributions
|-0.144
des-0.110
-0.106
go-0.102
 Away-0.101
Token happened
Token ID3022
Feature activation-4.81e-02

Pos logit contributions
|+0.022
des+0.017
go+0.016
 jog+0.015
 stroll+0.015
Neg logit contributions
 grandmother-0.031
 gen-0.028
 parent-0.028
 grandfather-0.027
 lamp-0.027
Token.
Token ID13
Feature activation+6.89e-03

Pos logit contributions
|+0.089
des+0.070
go+0.068
 jog+0.065
war+0.065
Neg logit contributions
 grandmother-0.091
 grandfather-0.079
 parent-0.079
 gen-0.079
 wizard-0.077
Token Some
Token ID2773
Feature activation+1.69e-01

Pos logit contributions
 grandmother+0.015
 gen+0.014
 parent+0.013
 lamp+0.013
 grandfather+0.013
Neg logit contributions
|-0.010
des-0.008
 jog-0.007
 stroll-0.007
go-0.007
Token of
Token ID286
Feature activation+7.16e-02

Pos logit contributions
 grandmother+0.298
 gen+0.263
 parent+0.260
 grandfather+0.254
 lamp+0.252
Neg logit contributions
|-0.316
des-0.258
 jog-0.252
go-0.246
 stroll-0.246
Token the
Token ID262
Feature activation+8.94e-03

Pos logit contributions
 grandmother+0.192
omm+0.178
 gen+0.176
 grandfather+0.174
 parent+0.172
Neg logit contributions
|-0.065
 jog-0.042
 walks-0.042
 stroll-0.040
 time-0.038
Token plants
Token ID6134
Feature activation+6.36e-02

Pos logit contributions
 grandmother+0.015
 gen+0.014
omm+0.013
 parent+0.013
 grandfather+0.013
Neg logit contributions
|-0.017
 jog-0.014
 stroll-0.014
des-0.014
 walks-0.014
Token started
Token ID2067
Feature activation+4.26e-02

Pos logit contributions
 grandmother+0.131
 gen+0.119
 parent+0.119
 lamp+0.116
 grandfather+0.114
Neg logit contributions
|-0.100
des-0.080
go-0.074
 jog-0.070
Yo-0.070
Token to
Token ID284
Feature activation-2.20e-03

Pos logit contributions
 grandmother+0.089
 gen+0.080
 parent+0.080
 grandfather+0.078
 lamp+0.078
Neg logit contributions
|-0.066
des-0.052
 jog-0.049
go-0.048
 stroll-0.048
Token and
Token ID290
Feature activation-2.39e-02

Pos logit contributions
|+0.060
des+0.047
 jog+0.045
go+0.044
 stroll+0.043
Neg logit contributions
 grandmother-0.084
 gen-0.077
 parent-0.075
 lamp-0.074
 grandfather-0.073
Token breaks
Token ID9457
Feature activation+6.06e-02

Pos logit contributions
|+0.034
des+0.027
go+0.024
 jog+0.024
 stroll+0.024
Neg logit contributions
 grandmother-0.053
 gen-0.049
 parent-0.048
 lamp-0.048
 grandfather-0.047
Token into
Token ID656
Feature activation+3.09e-02

Pos logit contributions
 grandmother+0.147
 parent+0.133
 grandfather+0.131
 gen+0.131
omm+0.126
Neg logit contributions
|-0.075
 walks-0.051
des-0.051
 jog-0.050
 stroll-0.049
Token pieces
Token ID5207
Feature activation+2.86e-02

Pos logit contributions
 grandmother+0.066
omm+0.060
 grandfather+0.059
rings+0.058
 gen+0.057
Neg logit contributions
|-0.045
 walks-0.037
 trips-0.034
 turns-0.033
 time-0.033
Token.
Token ID13
Feature activation+5.54e-03

Pos logit contributions
 grandmother+0.062
 gen+0.056
 parent+0.056
 lamp+0.054
 grandfather+0.054
Neg logit contributions
|-0.042
des-0.033
 jog-0.031
go-0.030
 stroll-0.030
Token Some
Token ID2773
Feature activation+1.69e-01

Pos logit contributions
 grandmother+0.013
 parent+0.012
 gen+0.012
 grandfather+0.012
ware+0.012
Neg logit contributions
|-0.007
-0.004
 walks-0.004
des-0.004
 jog-0.004
Token of
Token ID286
Feature activation+7.13e-02

Pos logit contributions
 grandmother+0.315
 parent+0.273
 gen+0.270
 grandfather+0.269
aby+0.268
Neg logit contributions
|-0.299
des-0.244
 jog-0.243
 walks-0.239
go-0.237
Token the
Token ID262
Feature activation+6.25e-03

Pos logit contributions
 grandmother+0.207
omm+0.191
 grandfather+0.188
 gen+0.186
 parent+0.184
Neg logit contributions
|-0.049
 walks-0.031
 jog-0.026
 play-0.025
 time-0.025
Token coins
Token ID10796
Feature activation+5.04e-02

Pos logit contributions
 grandmother+0.011
aby+0.010
omm+0.009
illian+0.009
 Ang+0.009
Neg logit contributions
|-0.011
 jog-0.010
 walks-0.010
 picnic-0.009
 walk-0.009
Token roll
Token ID4836
Feature activation-3.43e-02

Pos logit contributions
 grandmother+0.109
 gen+0.103
 parent+0.100
 lamp+0.100
ware+0.098
Neg logit contributions
|-0.073
des-0.056
go-0.050
 stroll-0.050
 jog-0.050
Token out
Token ID503
Feature activation+5.05e-02

Pos logit contributions
|+0.054
des+0.046
 stroll+0.041
 jog+0.041
go+0.039
Neg logit contributions
 grandmother-0.067
 gen-0.063
 lamp-0.063
ware-0.062
 parent-0.062
Token is
Token ID318
Feature activation+1.51e-02

Pos logit contributions
 grandmother+0.051
 gen+0.047
 parent+0.046
 lamp+0.046
ware+0.045
Neg logit contributions
|-0.034
des-0.027
 stroll-0.024
 jog-0.024
go-0.024
Token high
Token ID1029
Feature activation-7.23e-02

Pos logit contributions
 grandmother+0.032
 gen+0.029
 parent+0.028
 grandfather+0.028
ware+0.028
Neg logit contributions
|-0.022
 jog-0.017
 walks-0.016
des-0.016
go-0.016
Token and
Token ID290
Feature activation-2.74e-02

Pos logit contributions
|+0.108
des+0.084
 jog+0.079
 stroll+0.077
go+0.077
Neg logit contributions
 grandmother-0.157
 gen-0.142
 parent-0.141
 lamp-0.139
 grandfather-0.138
Token fast
Token ID3049
Feature activation-3.73e-02

Pos logit contributions
|+0.043
des+0.034
 jog+0.032
go+0.032
 stroll+0.031
Neg logit contributions
 grandmother-0.057
 gen-0.052
 parent-0.051
 lamp-0.050
 grandfather-0.050
Token.
Token ID13
Feature activation+8.95e-04

Pos logit contributions
|+0.062
des+0.049
go+0.045
 jog+0.044
 stroll+0.044
Neg logit contributions
 grandmother-0.076
 parent-0.068
 gen-0.068
 lamp-0.068
 grandfather-0.066
Token Some
Token ID2773
Feature activation+1.69e-01

Pos logit contributions
 grandmother+0.002
 gen+0.002
 parent+0.002
 grandfather+0.002
 lamp+0.002
Neg logit contributions
|-0.001
 jog-0.001
des-0.001
 stroll-0.001
 walks-0.001
Token people
Token ID661
Feature activation+4.08e-02

Pos logit contributions
 grandmother+0.324
 gen+0.289
 parent+0.286
 grandfather+0.280
 lamp+0.277
Neg logit contributions
|-0.290
des-0.232
 jog-0.228
go-0.222
 stroll-0.220
Token and
Token ID290
Feature activation-2.98e-02

Pos logit contributions
 grandmother+0.089
 gen+0.081
 parent+0.081
 lamp+0.080
 grandfather+0.079
Neg logit contributions
|-0.058
des-0.046
go-0.040
 stroll-0.040
 jog-0.040
Token cars
Token ID5006
Feature activation+1.28e-02

Pos logit contributions
|+0.047
 jog+0.043
 walks+0.042
 walk+0.041
 stroll+0.039
Neg logit contributions
 grandmother-0.057
omm-0.054
aby-0.053
 gen-0.051
illian-0.050
Token are
Token ID389
Feature activation+1.40e-02

Pos logit contributions
 grandmother+0.025
 lamp+0.024
 parent+0.023
rings+0.023
 gen+0.023
Neg logit contributions
|-0.020
des-0.017
ited-0.015
 stroll-0.014
ashing-0.014
Token stuck
Token ID7819
Feature activation-2.02e-02

Pos logit contributions
 grandmother+0.030
 gen+0.027
 parent+0.027
 grandfather+0.026
 lamp+0.026
Neg logit contributions
|-0.021
des-0.016
 jog-0.016
go-0.015
 stroll-0.015
Token was
Token ID373
Feature activation+2.19e-02

Pos logit contributions
 grandmother+0.052
 parent+0.047
 gen+0.047
 grandfather+0.046
 lamp+0.045
Neg logit contributions
|-0.033
des-0.024
 jog-0.024
go-0.023
 walks-0.023
Token torn
Token ID12445
Feature activation+1.89e-02

Pos logit contributions
 grandmother+0.049
ware+0.045
 gen+0.045
 parent+0.044
illian+0.044
Neg logit contributions
|-0.026
 walks-0.021
 time-0.020
 jog-0.019
 trips-0.017
Token and
Token ID290
Feature activation-2.24e-02

Pos logit contributions
 grandmother+0.047
 gen+0.043
 parent+0.042
 grandfather+0.042
omm+0.041
Neg logit contributions
|-0.021
des-0.015
 jog-0.014
go-0.014
 walks-0.013
Token ripped
Token ID19551
Feature activation+5.45e-02

Pos logit contributions
|+0.034
 jog+0.026
des+0.026
 walks+0.025
go+0.024
Neg logit contributions
 grandmother-0.047
 grandfather-0.041
 parent-0.041
omm-0.041
 gen-0.041
Token.
Token ID13
Feature activation+6.45e-03

Pos logit contributions
 grandmother+0.135
 gen+0.125
 grandfather+0.123
 parent+0.123
 wizard+0.121
Neg logit contributions
|-0.057
 jog-0.038
 walks-0.037
des-0.036
 around-0.036
Token Some
Token ID2773
Feature activation+1.69e-01

Pos logit contributions
 grandmother+0.015
 gen+0.014
ware+0.014
 wizard+0.014
 grandfather+0.014
Neg logit contributions
|-0.008
-0.005
 jog-0.005
 stroll-0.005
 walks-0.005
Token pages
Token ID5468
Feature activation+9.83e-02

Pos logit contributions
 grandmother+0.329
 grandfather+0.295
omm+0.293
aby+0.291
 gen+0.289
Neg logit contributions
|-0.271
 walks-0.226
 jog-0.220
go-0.214
des-0.213
Token were
Token ID547
Feature activation+1.70e-02

Pos logit contributions
 grandmother+0.224
 gen+0.204
 parent+0.202
 grandfather+0.198
 lamp+0.197
Neg logit contributions
|-0.133
des-0.100
 jog-0.095
go-0.092
 stroll-0.092
Token on
Token ID319
Feature activation-2.79e-02

Pos logit contributions
 grandmother+0.036
 gen+0.033
omm+0.032
 grandfather+0.032
 wizard+0.032
Neg logit contributions
|-0.020
 walks-0.018
 around-0.017
 time-0.017
 jog-0.016
Token the
Token ID262
Feature activation-1.65e-02

Pos logit contributions
|+0.030
 walks+0.023
 time+0.021
 jog+0.020
 walk+0.020
Neg logit contributions
 grandmother-0.068
 grandfather-0.063
 gen-0.062
omm-0.062
 parent-0.062
Token floor
Token ID4314
Feature activation+3.51e-02

Pos logit contributions
|+0.034
 walks+0.029
 jog+0.028
 walk+0.028
+0.028
Neg logit contributions
 grandmother-0.023
illian-0.021
omm-0.021
 gen-0.020
 grandfather-0.020
Token colors
Token ID7577
Feature activation+6.34e-02

Pos logit contributions
 grandmother+0.037
omm+0.034
illian+0.033
aby+0.033
 gen+0.032
Neg logit contributions
|-0.021
 walks-0.019
 jog-0.018
 time-0.017
 swim-0.017
Token,
Token ID11
Feature activation+2.14e-02

Pos logit contributions
 grandmother+0.170
 gen+0.154
 grandfather+0.153
 parent+0.152
omm+0.150
Neg logit contributions
|-0.054
 jog-0.039
 walks-0.038
hed-0.037
 stroll-0.036
Token and
Token ID290
Feature activation-2.09e-02

Pos logit contributions
 grandmother+0.048
omm+0.043
illian+0.042
 gen+0.042
 grandfather+0.042
Neg logit contributions
|-0.028
 walks-0.025
 jog-0.024
 stroll-0.022
 swim-0.022
Token hardness
Token ID49604
Feature activation+5.72e-02

Pos logit contributions
 walks+0.033
|+0.031
 jog+0.030
 play+0.029
 turns+0.029
Neg logit contributions
 grandmother-0.040
omm-0.038
aby-0.037
illian-0.036
 gen-0.034
Token.
Token ID13
Feature activation+9.40e-03

Pos logit contributions
 grandmother+0.134
 gen+0.122
 parent+0.121
 grandfather+0.119
omm+0.117
Neg logit contributions
|-0.072
des-0.053
 jog-0.053
go-0.050
 walks-0.050
Token Some
Token ID2773
Feature activation+1.68e-01

Pos logit contributions
 grandmother+0.022
 gen+0.022
 parent+0.021
ware+0.021
 lamp+0.021
Neg logit contributions
|-0.011
 jog-0.007
-0.007
-0.007
des-0.007
Token minerals
Token ID21622
Feature activation+2.33e-02

Pos logit contributions
 grandmother+0.356
 gen+0.321
 parent+0.319
 grandfather+0.312
 lamp+0.310
Neg logit contributions
|-0.257
des-0.200
 jog-0.193
go-0.187
 stroll-0.186
Token are
Token ID389
Feature activation+2.45e-02

Pos logit contributions
 grandmother+0.048
 gen+0.045
ware+0.044
 lamp+0.044
 parent+0.044
Neg logit contributions
|-0.036
des-0.029
go-0.026
 stroll-0.025
ited-0.024
Token very
Token ID845
Feature activation-2.50e-02

Pos logit contributions
 grandmother+0.054
 gen+0.049
omm+0.049
aby+0.047
 grandfather+0.047
Neg logit contributions
|-0.030
 jog-0.026
 walks-0.025
 time-0.024
 walk-0.024
Token rare
Token ID4071
Feature activation+1.93e-02

Pos logit contributions
|+0.037
 jog+0.029
des+0.029
 stroll+0.027
 walks+0.027
Neg logit contributions
 grandmother-0.054
 gen-0.048
 parent-0.047
omm-0.047
 grandfather-0.047
Token and
Token ID290
Feature activation-3.63e-02

Pos logit contributions
 grandmother+0.049
 gen+0.045
 parent+0.044
omm+0.044
illian+0.044
Neg logit contributions
|-0.019
 jog-0.014
 walks-0.014
 stroll-0.013
des-0.013
Token She
Token ID1375
Feature activation+2.40e-02

Pos logit contributions
|+0.062
des+0.054
 jog+0.051
go+0.051
 stroll+0.050
Neg logit contributions
 grandmother-0.050
 parent-0.044
 lamp-0.044
 gen-0.043
 grandfather-0.041
Token loved
Token ID6151
Feature activation+2.90e-03

Pos logit contributions
 grandmother+0.041
 lamp+0.039
 parent+0.038
 gen+0.037
 fairy+0.034
Neg logit contributions
|-0.043
war-0.039
des-0.038
ited-0.037
ashes-0.037
Token to
Token ID284
Feature activation-2.28e-02

Pos logit contributions
 grandmother+0.006
 parent+0.005
 lamp+0.005
 grandfather+0.005
 gen+0.005
Neg logit contributions
|-0.005
des-0.004
ited-0.004
-0.004
war-0.004
Token go
Token ID467
Feature activation-1.35e-01

Pos logit contributions
|+0.030
des+0.021
ashing+0.020
ited+0.020
hed+0.020
Neg logit contributions
 grandmother-0.054
 lamp-0.050
 parent-0.050
 gen-0.048
 mother-0.047
Token on
Token ID319
Feature activation-5.06e-02

Pos logit contributions
|+0.216
ited+0.180
war+0.164
des+0.163
hed+0.161
Neg logit contributions
 grandmother-0.270
 parent-0.244
 lamp-0.242
 grandfather-0.232
 Ang-0.226
Token pic
Token ID8301
Feature activation-1.84e-01

Pos logit contributions
|+0.074
ashing+0.066
go+0.063
ashes+0.062
ited+0.061
Neg logit contributions
 grandmother-0.111
 lamp-0.094
 parent-0.094
 grandfather-0.093
 mother-0.091
Tokenn
Token ID77
Feature activation-4.81e-02

Pos logit contributions
|+0.268
des+0.214
ited+0.212
ashing+0.208
hed+0.205
Neg logit contributions
 grandmother-0.381
aby-0.350
 parent-0.339
 gen-0.328
 lamp-0.324
Tokenics
Token ID873
Feature activation-1.02e-01

Pos logit contributions
|+0.063
des+0.049
 jog+0.045
ited+0.045
 stroll+0.044
Neg logit contributions
 grandmother-0.109
 parent-0.099
aby-0.097
 lamp-0.097
 gen-0.096
Token with
Token ID351
Feature activation+1.68e-02

Pos logit contributions
|+0.222
war+0.198
des+0.194
ited+0.194
venture+0.190
Neg logit contributions
 grandmother-0.137
 lamp-0.110
 parent-0.106
ware-0.099
 mother-0.099
Token her
Token ID607
Feature activation+7.61e-03

Pos logit contributions
 grandmother+0.030
 gen+0.027
 parent+0.027
 grandfather+0.026
 lamp+0.026
Neg logit contributions
|-0.031
des-0.025
 jog-0.025
go-0.024
 stroll-0.024
Token family
Token ID1641
Feature activation-1.43e-02

Pos logit contributions
 grandmother+0.008
 parent+0.007
 gen+0.006
 grandfather+0.006
 lamp+0.006
Neg logit contributions
|-0.020
des-0.018
go-0.017
 jog-0.017
 stroll-0.017
Token Tim
Token ID5045
Feature activation-3.65e-02

Pos logit contributions
|+0.060
des+0.052
 jog+0.050
ited+0.049
 walks+0.049
Neg logit contributions
 grandmother-0.055
 lamp-0.053
omm-0.049
 gen-0.049
 parent-0.049
Tokenmy
Token ID1820
Feature activation+4.64e-02

Pos logit contributions
|+0.084
 stroll+0.079
 jog+0.077
 dives+0.077
cha+0.076
Neg logit contributions
 lamp-0.041
 grandmother-0.040
 parent-0.036
 gen-0.032
 grandfather-0.032
Token loved
Token ID6151
Feature activation+3.86e-03

Pos logit contributions
 grandmother+0.091
 lamp+0.084
 parent+0.084
 gen+0.080
 grandfather+0.078
Neg logit contributions
|-0.075
des-0.063
war-0.063
ited-0.062
 stroll-0.061
Token going
Token ID1016
Feature activation-1.46e-01

Pos logit contributions
 grandmother+0.008
 parent+0.007
 lamp+0.007
 gen+0.007
 grandfather+0.007
Neg logit contributions
|-0.007
des-0.005
ited-0.005
go-0.005
-0.005
Token on
Token ID319
Feature activation-5.50e-02

Pos logit contributions
|+0.258
ited+0.202
des+0.198
war+0.186
acks+0.185
Neg logit contributions
 grandmother-0.285
 parent-0.250
 grandfather-0.248
 lamp-0.247
 gen-0.236
Token pic
Token ID8301
Feature activation-1.83e-01

Pos logit contributions
|+0.083
ashing+0.076
ashes+0.072
ited+0.071
rated+0.070
Neg logit contributions
 grandmother-0.116
 lamp-0.099
 grandfather-0.099
 dentist-0.097
 parent-0.097
Tokenn
Token ID77
Feature activation-4.87e-02

Pos logit contributions
|+0.279
ashing+0.229
des+0.228
ited+0.228
eal+0.221
Neg logit contributions
 grandmother-0.347
aby-0.317
 gen-0.303
 parent-0.301
 lamp-0.295
Tokenics
Token ID873
Feature activation-9.94e-02

Pos logit contributions
|+0.066
des+0.051
 stroll+0.047
 jog+0.047
war+0.046
Neg logit contributions
 grandmother-0.108
 parent-0.097
 gen-0.096
 lamp-0.096
aby-0.094
Token with
Token ID351
Feature activation+1.85e-02

Pos logit contributions
|+0.222
war+0.193
ited+0.192
des+0.191
 AW+0.189
Neg logit contributions
 grandmother-0.120
 lamp-0.099
ware-0.097
 gen-0.094
 grandfather-0.091
Token his
Token ID465
Feature activation-7.23e-04

Pos logit contributions
 grandmother+0.037
 gen+0.033
 parent+0.033
 grandfather+0.032
 lamp+0.032
Neg logit contributions
|-0.031
des-0.024
 jog-0.024
go-0.023
 stroll-0.023
Token family
Token ID1641
Feature activation-1.81e-02

Pos logit contributions
|+0.002
des+0.002
war+0.002
go+0.002
ited+0.002
Neg logit contributions
 grandmother-0.001
 parent-0.001
 mother-0.001
 grandfather-0.001
 lamp-0.001
Token family
Token ID1641
Feature activation+8.73e-03

Pos logit contributions
 grandmother+0.055
 gen+0.046
 parent+0.044
 grandfather+0.043
 lamp+0.042
Neg logit contributions
|-0.115
des-0.099
 jog-0.098
 stroll-0.096
go-0.096
Token were
Token ID547
Feature activation-4.64e-03

Pos logit contributions
 grandmother+0.020
 parent+0.018
 gen+0.018
 grandfather+0.018
 mother+0.017
Neg logit contributions
|-0.012
des-0.009
war-0.009
acks-0.008
go-0.008
Token ready
Token ID3492
Feature activation-4.60e-02

Pos logit contributions
|+0.008
des+0.006
war+0.006
 jog+0.006
+0.006
Neg logit contributions
 grandmother-0.010
 parent-0.009
 gen-0.008
 mother-0.008
 grandfather-0.008
Token to
Token ID284
Feature activation-2.64e-02

Pos logit contributions
|+0.090
des+0.067
go+0.066
Yo+0.064
war+0.064
Neg logit contributions
 grandmother-0.085
 parent-0.073
 grandfather-0.072
 mother-0.071
 gen-0.070
Token go
Token ID467
Feature activation-1.34e-01

Pos logit contributions
|+0.032
ashing+0.024
des+0.024
go+0.023
ited+0.021
Neg logit contributions
 grandmother-0.064
 parent-0.059
 gen-0.059
 lamp-0.058
 mother-0.057
Token pic
Token ID8301
Feature activation-1.83e-01

Pos logit contributions
|+0.222
ited+0.196
war+0.180
rated+0.178
des+0.173
Neg logit contributions
 grandmother-0.254
 parent-0.224
 lamp-0.216
 grandfather-0.212
 wizard-0.208
Tokenn
Token ID77
Feature activation-4.81e-02

Pos logit contributions
|+0.273
des+0.225
ashing+0.223
ited+0.218
go+0.215
Neg logit contributions
 grandmother-0.360
aby-0.330
 parent-0.313
 gen-0.311
 mother-0.297
Tokenicking
Token ID7958
Feature activation+2.12e-02

Pos logit contributions
|+0.067
des+0.056
ited+0.053
war+0.051
 jog+0.050
Neg logit contributions
 grandmother-0.103
 parent-0.091
aby-0.090
 grandfather-0.087
omm-0.087
Token again
Token ID757
Feature activation-2.18e-02

Pos logit contributions
 grandmother+0.039
 lamp+0.034
 gen+0.033
 parent+0.033
ware+0.033
Neg logit contributions
|-0.037
des-0.031
go-0.031
ited-0.030
ashing-0.028
Token.
Token ID13
Feature activation-1.36e-03

Pos logit contributions
|+0.036
des+0.029
go+0.028
Yo+0.026
war+0.026
Neg logit contributions
 grandmother-0.044
 parent-0.039
 grandfather-0.038
 mother-0.037
 gen-0.037
Token But
Token ID887
Feature activation+9.73e-02

Pos logit contributions
|+0.002
des+0.002
 jog+0.002
go+0.002
 walks+0.002
Neg logit contributions
 grandmother-0.003
 parent-0.003
 gen-0.003
 grandfather-0.002
 lamp-0.002
Token and
Token ID290
Feature activation-4.13e-02

Pos logit contributions
|+0.041
des+0.031
 jog+0.030
go+0.029
 stroll+0.029
Neg logit contributions
 grandmother-0.061
 gen-0.056
 parent-0.055
 grandfather-0.054
 lamp-0.054
Token it
Token ID340
Feature activation-2.84e-02

Pos logit contributions
|+0.053
 jog+0.038
des+0.038
 stroll+0.036
 walks+0.035
Neg logit contributions
 grandmother-0.097
 gen-0.089
 parent-0.088
 grandfather-0.086
 lamp-0.086
Token was
Token ID373
Feature activation-4.59e-03

Pos logit contributions
|+0.047
des+0.041
 stroll+0.037
go+0.036
 jog+0.036
Neg logit contributions
 grandmother-0.055
 parent-0.050
 gen-0.050
 lamp-0.049
 grandfather-0.047
Token going
Token ID1016
Feature activation-1.51e-01

Pos logit contributions
|+0.007
des+0.005
 jog+0.005
go+0.005
 stroll+0.005
Neg logit contributions
 grandmother-0.010
 gen-0.009
 parent-0.009
 grandfather-0.009
 lamp-0.009
Token on
Token ID319
Feature activation-5.52e-02

Pos logit contributions
|+0.282
des+0.257
ited+0.243
text+0.228
eal+0.228
Neg logit contributions
 grandmother-0.251
 parent-0.224
 grandfather-0.218
rings-0.217
 lamp-0.213
Token pic
Token ID8301
Feature activation-1.83e-01

Pos logit contributions
|+0.103
des+0.096
go+0.092
eal+0.089
ited+0.088
Neg logit contributions
 grandmother-0.094
 parent-0.078
 lamp-0.078
 grandfather-0.078
 gen-0.077
Tokenn
Token ID77
Feature activation-4.83e-02

Pos logit contributions
|+0.272
des+0.254
ited+0.246
go+0.237
ashing+0.236
Neg logit contributions
 grandmother-0.313
aby-0.312
 gen-0.273
 parent-0.272
omm-0.266
Tokenics
Token ID873
Feature activation-1.01e-01

Pos logit contributions
|+0.080
des+0.068
 jog+0.063
ited+0.062
 stroll+0.062
Neg logit contributions
 grandmother-0.093
 parent-0.082
aby-0.081
 grandfather-0.079
 gen-0.078
Token!
Token ID0
Feature activation-3.02e-02

Pos logit contributions
|+0.192
des+0.170
ited+0.163
go+0.158
eal+0.155
Neg logit contributions
 grandmother-0.162
 lamp-0.137
 parent-0.136
 grandfather-0.135
 gen-0.133
Token
Token ID198
Feature activation-1.65e-02

Pos logit contributions
|+0.048
des+0.040
go+0.037
 jog+0.036
 stroll+0.036
Neg logit contributions
 grandmother-0.061
 parent-0.055
 grandfather-0.054
 gen-0.053
 lamp-0.052
Token
Token ID198
Feature activation-1.41e-02

Pos logit contributions
|+0.024
des+0.019
 jog+0.018
 stroll+0.017
go+0.017
Neg logit contributions
 grandmother-0.036
 gen-0.032
 parent-0.032
 grandfather-0.032
 lamp-0.031
Token.
Token ID13
Feature activation-2.89e-02

Pos logit contributions
|+0.043
des+0.033
 jog+0.032
go+0.031
 stroll+0.030
Neg logit contributions
 grandmother-0.067
 gen-0.060
 parent-0.060
 grandfather-0.059
 lamp-0.058
Token She
Token ID1375
Feature activation+2.41e-02

Pos logit contributions
|+0.055
des+0.045
 jog+0.044
go+0.043
 stroll+0.043
Neg logit contributions
 grandmother-0.051
 gen-0.045
 parent-0.044
 grandfather-0.043
 lamp-0.043
Token loved
Token ID6151
Feature activation+1.00e-02

Pos logit contributions
 grandmother+0.044
 parent+0.041
 lamp+0.041
 gen+0.039
 mother+0.036
Neg logit contributions
|-0.041
war-0.037
des-0.036
ashes-0.035
ited-0.035
Token to
Token ID284
Feature activation-2.06e-02

Pos logit contributions
 grandmother+0.020
 parent+0.018
 gen+0.018
 lamp+0.018
 grandfather+0.018
Neg logit contributions
|-0.017
des-0.013
go-0.012
 jog-0.012
ited-0.012
Token have
Token ID423
Feature activation-1.58e-02

Pos logit contributions
|+0.025
des+0.017
go+0.015
ashing+0.015
hed+0.015
Neg logit contributions
 grandmother-0.051
 parent-0.047
 lamp-0.047
 gen-0.046
 grandfather-0.045
Token pic
Token ID8301
Feature activation-1.81e-01

Pos logit contributions
|+0.026
rated+0.019
des+0.018
ashing+0.018
go+0.018
Neg logit contributions
 grandmother-0.033
 parent-0.029
 lamp-0.029
 scissors-0.028
 grandfather-0.028
Tokenn
Token ID77
Feature activation-4.33e-02

Pos logit contributions
|+0.262
des+0.205
ashing+0.202
go+0.199
 jog+0.190
Neg logit contributions
 grandmother-0.373
aby-0.339
 gen-0.329
 parent-0.325
omm-0.324
Tokenics
Token ID873
Feature activation-9.36e-02

Pos logit contributions
|+0.056
des+0.042
 jog+0.039
 stroll+0.038
+0.037
Neg logit contributions
 grandmother-0.101
 parent-0.091
 gen-0.090
 lamp-0.089
aby-0.089
Token with
Token ID351
Feature activation+2.05e-02

Pos logit contributions
|+0.175
des+0.142
war+0.140
ited+0.135
venture+0.134
Neg logit contributions
 grandmother-0.165
 lamp-0.141
 gen-0.140
 parent-0.140
 grandfather-0.135
Token her
Token ID607
Feature activation+1.13e-02

Pos logit contributions
 grandmother+0.035
 parent+0.031
 gen+0.031
 lamp+0.030
 grandfather+0.030
Neg logit contributions
|-0.040
des-0.033
go-0.032
 jog-0.031
 stroll-0.031
Token family
Token ID1641
Feature activation-1.72e-02

Pos logit contributions
 grandmother+0.014
 parent+0.012
 gen+0.011
 grandfather+0.011
 lamp+0.011
Neg logit contributions
|-0.028
des-0.025
go-0.024
 jog-0.023
war-0.023
Token a
Token ID257
Feature activation-8.58e-03

Pos logit contributions
|+0.054
des+0.047
war+0.046
ited+0.046
ashing+0.045
Neg logit contributions
 grandmother-0.042
 lamp-0.036
 parent-0.035
 gen-0.034
 Ang-0.033
Token family
Token ID1641
Feature activation-1.88e-02

Pos logit contributions
|+0.016
des+0.013
 jog+0.013
go+0.013
 stroll+0.012
Neg logit contributions
 grandmother-0.015
 gen-0.013
 parent-0.013
 grandfather-0.013
 lamp-0.013
Token who
Token ID508
Feature activation+3.57e-02

Pos logit contributions
|+0.031
des+0.025
 jog+0.025
go+0.024
 stroll+0.023
Neg logit contributions
 grandmother-0.037
 gen-0.033
 parent-0.032
 grandfather-0.032
 lamp-0.032
Token really
Token ID1107
Feature activation-4.99e-02

Pos logit contributions
 grandmother+0.058
 parent+0.055
 lamp+0.053
 gen+0.052
 Ang+0.049
Neg logit contributions
|-0.060
war-0.059
ogging-0.058
des-0.054
ited-0.054
Token enjoyed
Token ID8359
Feature activation+1.33e-02

Pos logit contributions
|+0.075
des+0.062
 stroll+0.058
war+0.057
 walks+0.057
Neg logit contributions
 grandmother-0.099
 parent-0.091
 gen-0.088
 lamp-0.087
 Ev-0.086
Token pic
Token ID8301
Feature activation-1.80e-01

Pos logit contributions
 grandmother+0.027
 parent+0.024
 lamp+0.023
 gen+0.023
 grandfather+0.023
Neg logit contributions
|-0.023
ited-0.019
war-0.019
des-0.018
go-0.018
Tokenn
Token ID77
Feature activation-4.44e-02

Pos logit contributions
|+0.269
ashing+0.226
des+0.221
go+0.218
 jog+0.217
Neg logit contributions
 grandmother-0.339
aby-0.322
 gen-0.306
omm-0.292
 Ang-0.288
Tokenics
Token ID873
Feature activation-9.98e-02

Pos logit contributions
|+0.057
des+0.044
 jog+0.042
 stroll+0.040
+0.039
Neg logit contributions
 grandmother-0.103
 parent-0.094
aby-0.093
 gen-0.091
 lamp-0.091
Token.
Token ID13
Feature activation-2.92e-02

Pos logit contributions
|+0.193
des+0.164
ited+0.158
war+0.157
go+0.151
Neg logit contributions
 grandmother-0.169
 parent-0.146
 lamp-0.144
 gen-0.143
 grandfather-0.139
Token They
Token ID1119
Feature activation-4.18e-02

Pos logit contributions
|+0.039
des+0.030
 jog+0.028
go+0.028
 stroll+0.027
Neg logit contributions
 grandmother-0.067
 gen-0.061
 parent-0.061
 grandfather-0.059
 lamp-0.059
Token loved
Token ID6151
Feature activation+6.42e-03

Pos logit contributions
|+0.080
war+0.078
des+0.077
acks+0.077
ogging+0.071
Neg logit contributions
 parent-0.061
 grandmother-0.061
 queen-0.060
 mother-0.056
 Queen-0.055
Token girl
Token ID2576
Feature activation+6.40e-03

Pos logit contributions
|+0.024
 jog+0.020
des+0.020
go+0.019
 stroll+0.019
Neg logit contributions
 grandmother-0.022
 gen-0.019
 parent-0.019
 grandfather-0.018
aby-0.018
Token named
Token ID3706
Feature activation+1.02e-02

Pos logit contributions
 grandmother+0.014
ware+0.013
 gen+0.013
omm+0.013
 grandfather+0.012
Neg logit contributions
|-0.009
 jog-0.007
des-0.007
go-0.006
 walk-0.006
Token Lily
Token ID20037
Feature activation-3.10e-02

Pos logit contributions
 grandmother+0.019
 gen+0.017
 parent+0.017
 grandfather+0.017
ware+0.016
Neg logit contributions
|-0.019
des-0.015
 jog-0.014
go-0.014
 stroll-0.014
Token who
Token ID508
Feature activation+3.80e-02

Pos logit contributions
|+0.045
des+0.035
 jog+0.033
go+0.032
 stroll+0.032
Neg logit contributions
 grandmother-0.068
 gen-0.062
 parent-0.061
 grandfather-0.060
 lamp-0.060
Token loved
Token ID6151
Feature activation-1.72e-04

Pos logit contributions
 grandmother+0.072
 lamp+0.067
 parent+0.067
 gen+0.064
 mother+0.060
Neg logit contributions
|-0.063
ashes-0.052
war-0.051
 dives-0.051
 stroll-0.051
Token pic
Token ID8301
Feature activation-1.79e-01

Pos logit contributions
|+0.000
des+0.000
ited+0.000
go+0.000
+0.000
Neg logit contributions
 grandmother-0.000
 parent-0.000
 lamp-0.000
 gen-0.000
 grandfather-0.000
Tokenn
Token ID77
Feature activation-4.72e-02

Pos logit contributions
|+0.263
ashing+0.227
 dives+0.219
ashes+0.218
 jog+0.215
Neg logit contributions
 grandmother-0.345
aby-0.326
 gen-0.313
omm-0.302
 lamp-0.294
Tokenics
Token ID873
Feature activation-1.02e-01

Pos logit contributions
|+0.059
des+0.044
 jog+0.041
 stroll+0.040
 walks+0.039
Neg logit contributions
 grandmother-0.112
 parent-0.103
 gen-0.101
 lamp-0.100
aby-0.100
Token.
Token ID13
Feature activation-2.74e-02

Pos logit contributions
|+0.181
des+0.146
war+0.141
 stroll+0.135
venture+0.135
Neg logit contributions
 grandmother-0.193
 parent-0.168
 gen-0.168
 lamp-0.166
 grandfather-0.161
Token She
Token ID1375
Feature activation+2.73e-02

Pos logit contributions
|+0.046
des+0.037
 jog+0.037
 stroll+0.035
go+0.035
Neg logit contributions
 grandmother-0.054
 gen-0.049
 parent-0.048
 grandfather-0.047
 lamp-0.046
Token would
Token ID561
Feature activation-8.34e-03

Pos logit contributions
 grandmother+0.042
 parent+0.037
 mother+0.035
 grandfather+0.033
ware+0.033
Neg logit contributions
|-0.052
war-0.051
des-0.047
ogging-0.045
iling-0.045
Token liked
Token ID8288
Feature activation+2.59e-02

Pos logit contributions
|+0.067
des+0.060
acks+0.054
war+0.053
go+0.052
Neg logit contributions
 grandmother-0.076
 parent-0.076
 mother-0.069
 lamp-0.069
 queen-0.068
Token to
Token ID284
Feature activation-1.73e-02

Pos logit contributions
 grandmother+0.041
 parent+0.037
 lamp+0.036
 grandfather+0.035
 mother+0.035
Neg logit contributions
|-0.055
ited-0.047
des-0.045
-0.044
hed-0.043
Token play
Token ID711
Feature activation-9.08e-02

Pos logit contributions
|+0.016
des+0.011
go+0.010
ited+0.009
hed+0.009
Neg logit contributions
 grandmother-0.046
 parent-0.043
 lamp-0.043
 gen-0.042
 grandfather-0.041
Token and
Token ID290
Feature activation-5.37e-02

Pos logit contributions
|+0.134
des+0.113
ited+0.111
 wanna+0.096
go+0.095
Neg logit contributions
 grandmother-0.190
 lamp-0.182
ware-0.175
 parent-0.175
 gen-0.174
Token have
Token ID423
Feature activation-7.64e-03

Pos logit contributions
|+0.053
 jog+0.034
des+0.033
 stroll+0.031
go+0.030
Neg logit contributions
 grandmother-0.144
 gen-0.134
 parent-0.131
 grandfather-0.130
omm-0.130
Token pic
Token ID8301
Feature activation-1.79e-01

Pos logit contributions
|+0.013
ited+0.011
go+0.010
+0.010
rated+0.010
Neg logit contributions
 grandmother-0.013
 scissors-0.013
 lamp-0.012
 parent-0.012
 queen-0.012
Tokenn
Token ID77
Feature activation-3.56e-02

Pos logit contributions
|+0.269
des+0.217
ashing+0.216
go+0.215
+0.213
Neg logit contributions
 grandmother-0.331
aby-0.313
 gen-0.291
 parent-0.288
omm-0.284
Tokenics
Token ID873
Feature activation-9.41e-02

Pos logit contributions
|+0.048
des+0.037
 jog+0.034
 stroll+0.034
ited+0.033
Neg logit contributions
 grandmother-0.081
 parent-0.074
aby-0.072
 lamp-0.071
 gen-0.071
Token.
Token ID13
Feature activation-1.59e-02

Pos logit contributions
|+0.148
des+0.119
ited+0.109
 stroll+0.108
go+0.107
Neg logit contributions
 grandmother-0.192
 parent-0.170
 gen-0.168
 grandfather-0.167
 lamp-0.166
Token One
Token ID1881
Feature activation+3.59e-02

Pos logit contributions
|+0.025
des+0.021
go+0.019
 jog+0.019
 stroll+0.018
Neg logit contributions
 grandmother-0.033
 parent-0.029
 grandfather-0.029
 lamp-0.028
 gen-0.028
Token day
Token ID1110
Feature activation-3.99e-02

Pos logit contributions
 mother+0.066
 grandmother+0.064
 parent+0.063
 Mother+0.059
 lamp+0.059
Neg logit contributions
|-0.060
cha-0.055
des-0.052
 walks-0.052
Yo-0.051
Token favourite
Token ID12507
Feature activation+1.71e-02

Pos logit contributions
|+0.010
des+0.008
go+0.008
war+0.008
hed+0.008
Neg logit contributions
 grandmother-0.011
 parent-0.010
 grandfather-0.010
 gen-0.010
 mother-0.010
Token place
Token ID1295
Feature activation+4.04e-02

Pos logit contributions
 grandmother+0.034
 gen+0.030
 parent+0.030
 lamp+0.030
 grandfather+0.029
Neg logit contributions
|-0.029
des-0.023
go-0.022
 jog-0.022
war-0.021
Token to
Token ID284
Feature activation-1.44e-02

Pos logit contributions
 grandmother+0.083
 gen+0.074
 parent+0.073
 lamp+0.072
 grandfather+0.072
Neg logit contributions
|-0.065
des-0.051
 jog-0.049
go-0.048
 walks-0.047
Token go
Token ID467
Feature activation-1.28e-01

Pos logit contributions
|+0.018
des+0.013
go+0.013
ashing+0.012
war+0.012
Neg logit contributions
 grandmother-0.034
 gen-0.031
 parent-0.031
 lamp-0.031
 grandfather-0.030
Token for
Token ID329
Feature activation+1.37e-02

Pos logit contributions
|+0.220
war+0.190
go+0.188
ited+0.184
des+0.183
Neg logit contributions
 grandmother-0.238
 parent-0.203
 lamp-0.197
 grandfather-0.196
 wizard-0.188
Token pic
Token ID8301
Feature activation-1.79e-01

Pos logit contributions
 grandmother+0.028
 gen+0.024
 parent+0.024
 grandfather+0.023
 lamp+0.023
Neg logit contributions
|-0.022
go-0.018
des-0.018
ashing-0.017
 jog-0.017
Tokenn
Token ID77
Feature activation-4.26e-02

Pos logit contributions
|+0.271
ashing+0.242
Run+0.237
go+0.236
ashes+0.233
Neg logit contributions
 grandmother-0.314
aby-0.300
 gen-0.260
 parent-0.258
omm-0.247
Tokenics
Token ID873
Feature activation-9.65e-02

Pos logit contributions
|+0.068
des+0.058
ited+0.054
 jog+0.054
war+0.053
Neg logit contributions
 grandmother-0.084
 parent-0.073
aby-0.073
 grandfather-0.069
 lamp-0.069
Token.
Token ID13
Feature activation+1.58e-03

Pos logit contributions
|+0.175
des+0.147
go+0.145
ited+0.141
war+0.141
Neg logit contributions
 grandmother-0.176
 parent-0.149
 grandfather-0.145
 gen-0.142
 mother-0.140
Token<|endoftext|>
Token ID50256
Feature activation-9.15e-02

Pos logit contributions
 grandmother+0.003
 parent+0.003
 gen+0.003
 grandfather+0.003
 lamp+0.003
Neg logit contributions
|-0.002
des-0.002
 jog-0.002
go-0.002
 stroll-0.002
TokenOnce
Token ID7454
Feature activation-3.08e-02

Pos logit contributions
des+0.151
ited+0.147
|+0.146
ashing+0.141
 jog+0.134
Neg logit contributions
 grandmother-0.153
 lamp-0.138
 gen-0.137
 Genie-0.131
aby-0.130
Token near
Token ID1474
Feature activation-2.78e-02

Pos logit contributions
|+0.210
des+0.204
ited+0.199
 wanna+0.177
text+0.177
Neg logit contributions
 grandmother-0.216
 parent-0.187
 grandfather-0.184
 lamp-0.180
aby-0.177
Token the
Token ID262
Feature activation-2.35e-02

Pos logit contributions
|+0.056
des+0.046
ited+0.045
ashing+0.044
go+0.044
Neg logit contributions
 grandmother-0.050
 parent-0.045
 mother-0.044
 lamp-0.043
 grandfather-0.042
Token cliff
Token ID19516
Feature activation-6.36e-02

Pos logit contributions
|+0.047
des+0.040
ited+0.036
war+0.036
hed+0.035
Neg logit contributions
 grandmother-0.040
 parent-0.037
 mother-0.036
 gen-0.036
 lamp-0.035
Token and
Token ID290
Feature activation-3.83e-02

Pos logit contributions
|+0.107
des+0.092
 jog+0.082
 stroll+0.081
Yo+0.081
Neg logit contributions
 grandmother-0.118
 parent-0.105
 lamp-0.105
 gen-0.101
 grandfather-0.101
Token have
Token ID423
Feature activation-5.23e-05

Pos logit contributions
|+0.047
des+0.037
go+0.034
ited+0.034
text+0.033
Neg logit contributions
 grandmother-0.088
 parent-0.081
 lamp-0.079
 grandfather-0.079
 gen-0.078
Token pic
Token ID8301
Feature activation-1.73e-01

Pos logit contributions
|+0.000
ited+0.000
rated+0.000
iling+0.000
+0.000
Neg logit contributions
 grandmother-0.000
 scissors-0.000
 mother-0.000
 parent-0.000
 queen-0.000
Tokenn
Token ID77
Feature activation-2.87e-02

Pos logit contributions
|+0.265
ashing+0.225
+0.217
go+0.214
des+0.213
Neg logit contributions
 grandmother-0.304
aby-0.299
 gen-0.267
omm-0.261
 parent-0.256
Tokenics
Token ID873
Feature activation-8.82e-02

Pos logit contributions
|+0.041
des+0.032
ited+0.031
 jog+0.030
 stroll+0.029
Neg logit contributions
 grandmother-0.061
aby-0.056
 parent-0.056
rings-0.054
 lamp-0.053
Token.
Token ID13
Feature activation-9.19e-03

Pos logit contributions
|+0.152
ited+0.123
des+0.123
acks+0.117
venture+0.116
Neg logit contributions
 grandmother-0.165
 parent-0.143
 grandfather-0.140
 mother-0.137
 gen-0.136
Token One
Token ID1881
Feature activation+3.90e-02

Pos logit contributions
|+0.014
des+0.012
ited+0.011
go+0.011
 jog+0.011
Neg logit contributions
 grandmother-0.019
 parent-0.016
 grandfather-0.016
 lamp-0.016
 gen-0.016
Token day
Token ID1110
Feature activation-3.58e-02

Pos logit contributions
 grandmother+0.076
 mother+0.075
 parent+0.073
 Mother+0.068
 lamp+0.068
Neg logit contributions
|-0.062
cha-0.055
hed-0.053
 AW-0.051
Yo-0.050
Token saw
Token ID2497
Feature activation+6.22e-02

Pos logit contributions
 grandmother+0.027
 parent+0.024
 grandfather+0.023
 mother+0.023
 gen+0.023
Neg logit contributions
|-0.026
des-0.022
war-0.020
go-0.019
cluded-0.019
Token lots
Token ID6041
Feature activation-3.58e-03

Pos logit contributions
 grandmother+0.145
omm+0.133
 gen+0.133
 parent+0.130
ware+0.130
Neg logit contributions
|-0.071
 jog-0.058
 walks-0.058
 stroll-0.055
des-0.055
Token of
Token ID286
Feature activation+5.76e-02

Pos logit contributions
|+0.007
des+0.006
 jog+0.006
 stroll+0.006
go+0.006
Neg logit contributions
 grandmother-0.006
 gen-0.005
 parent-0.005
 lamp-0.005
 grandfather-0.005
Token people
Token ID661
Feature activation+1.41e-02

Pos logit contributions
 grandmother+0.099
 gen+0.088
 parent+0.088
 lamp+0.086
 grandfather+0.084
Neg logit contributions
|-0.111
des-0.090
 jog-0.085
go-0.085
 stroll-0.084
Token having
Token ID1719
Feature activation-2.51e-02

Pos logit contributions
 grandmother+0.026
 parent+0.023
 lamp+0.023
 grandfather+0.023
 gen+0.023
Neg logit contributions
|-0.025
des-0.020
war-0.019
Yo-0.018
-0.018
Token pic
Token ID8301
Feature activation-1.72e-01

Pos logit contributions
|+0.041
ited+0.034
rated+0.034
go+0.033
+0.032
Neg logit contributions
 grandmother-0.047
 scissors-0.043
 curtains-0.041
 parent-0.040
 lamp-0.040
Tokenn
Token ID77
Feature activation-4.00e-02

Pos logit contributions
|+0.265
+0.233
 down+0.221
ashing+0.216
go+0.215
Neg logit contributions
 grandmother-0.286
aby-0.279
omm-0.253
 gen-0.248
ware-0.244
Tokenics
Token ID873
Feature activation-9.39e-02

Pos logit contributions
|+0.056
des+0.046
 jog+0.044
+0.042
 stroll+0.040
Neg logit contributions
 grandmother-0.085
aby-0.079
 parent-0.077
illian-0.075
rings-0.074
Token and
Token ID290
Feature activation-4.45e-02

Pos logit contributions
|+0.181
des+0.150
go+0.144
war+0.141
ited+0.139
Neg logit contributions
 grandmother-0.164
 grandfather-0.139
 parent-0.138
 lamp-0.135
 mother-0.133
Token walking
Token ID6155
Feature activation-1.29e-01

Pos logit contributions
|+0.060
 jog+0.046
des+0.044
 stroll+0.044
 walks+0.043
Neg logit contributions
 grandmother-0.101
 gen-0.093
 parent-0.090
omm-0.089
 grandfather-0.089
Token their
Token ID511
Feature activation-8.24e-04

Pos logit contributions
|+0.199
des+0.178
ited+0.178
text+0.170
iling+0.163
Neg logit contributions
 grandmother-0.226
 parent-0.203
 mother-0.200
 lamp-0.195
rings-0.194
Token even
Token ID772
Feature activation+1.38e-03

Pos logit contributions
 grandmother+0.076
 parent+0.069
 gen+0.068
 mother+0.068
 lamp+0.067
Neg logit contributions
|-0.061
des-0.050
war-0.045
ashing-0.044
venture-0.044
Token see
Token ID766
Feature activation+3.49e-02

Pos logit contributions
 grandmother+0.003
 gen+0.003
 parent+0.003
 grandfather+0.003
omm+0.003
Neg logit contributions
|-0.002
des-0.001
 jog-0.001
go-0.001
 stroll-0.001
Token other
Token ID584
Feature activation+8.74e-02

Pos logit contributions
 grandmother+0.071
 gen+0.064
 parent+0.064
 lamp+0.062
 grandfather+0.062
Neg logit contributions
|-0.057
des-0.045
 jog-0.042
go-0.041
 stroll-0.041
Token people
Token ID661
Feature activation+1.82e-02

Pos logit contributions
 grandmother+0.110
 lamp+0.099
 parent+0.094
 gen+0.091
 curtains+0.090
Neg logit contributions
|-0.204
des-0.170
war-0.163
ashing-0.160
ited-0.159
Token having
Token ID1719
Feature activation-2.58e-02

Pos logit contributions
 grandmother+0.034
 lamp+0.030
 gen+0.030
rings+0.030
 parent+0.030
Neg logit contributions
|-0.032
des-0.025
war-0.025
text-0.024
-0.023
Token pic
Token ID8301
Feature activation-1.71e-01

Pos logit contributions
|+0.042
ashing+0.034
rated+0.033
war+0.032
ited+0.032
Neg logit contributions
 grandmother-0.048
 scissors-0.044
 lamp-0.043
 gen-0.043
 curtains-0.042
Tokenn
Token ID77
Feature activation-3.84e-02

Pos logit contributions
|+0.269
+0.232
ashing+0.228
 jog+0.224
 down+0.219
Neg logit contributions
 grandmother-0.275
aby-0.269
 gen-0.245
ware-0.237
ais-0.236
Tokenics
Token ID873
Feature activation-9.17e-02

Pos logit contributions
|+0.057
des+0.046
 jog+0.045
 stroll+0.042
+0.041
Neg logit contributions
 grandmother-0.078
aby-0.072
 parent-0.069
rings-0.069
illian-0.068
Token.
Token ID13
Feature activation-6.05e-03

Pos logit contributions
|+0.167
des+0.134
go+0.126
war+0.124
ited+0.124
Neg logit contributions
 grandmother-0.169
 grandfather-0.145
 parent-0.144
 gen-0.144
 lamp-0.142
Token
Token ID198
Feature activation-1.13e-02

Pos logit contributions
|+0.010
des+0.009
 jog+0.008
go+0.008
ited+0.008
Neg logit contributions
 grandmother-0.012
 parent-0.010
 grandfather-0.010
illian-0.010
omm-0.010
Token
Token ID198
Feature activation-8.52e-03

Pos logit contributions
|+0.017
des+0.014
 jog+0.014
go+0.013
ited+0.013
Neg logit contributions
 grandmother-0.023
 gen-0.020
 parent-0.020
aby-0.020
illian-0.020
Token in
Token ID287
Feature activation-2.45e-02

Pos logit contributions
|+0.152
ited+0.134
des+0.127
 wanna+0.117
war+0.116
Neg logit contributions
 grandmother-0.167
 lamp-0.166
 gen-0.155
ware-0.155
 fairy-0.154
Token the
Token ID262
Feature activation-2.29e-02

Pos logit contributions
|+0.044
des+0.038
ited+0.037
ashing+0.036
go+0.036
Neg logit contributions
 grandmother-0.045
 lamp-0.041
 parent-0.040
 gen-0.040
 mother-0.038
Token garden
Token ID11376
Feature activation+1.63e-04

Pos logit contributions
|+0.053
des+0.045
go+0.043
ited+0.042
war+0.042
Neg logit contributions
 grandmother-0.032
 parent-0.027
 lamp-0.026
 mother-0.025
 gen-0.025
Token and
Token ID290
Feature activation-4.58e-02

Pos logit contributions
 grandmother+0.000
 gen+0.000
 parent+0.000
 lamp+0.000
 grandfather+0.000
Neg logit contributions
|-0.000
des-0.000
 jog-0.000
go-0.000
 stroll-0.000
Token have
Token ID423
Feature activation-2.97e-03

Pos logit contributions
|+0.050
des+0.034
 jog+0.033
 stroll+0.031
go+0.030
Neg logit contributions
 grandmother-0.118
 gen-0.108
 parent-0.107
 grandfather-0.105
 lamp-0.104
Token pic
Token ID8301
Feature activation-1.71e-01

Pos logit contributions
|+0.005
ited+0.004
rated+0.004
ashing+0.004
aling+0.004
Neg logit contributions
 grandmother-0.005
 scissors-0.005
 lamp-0.005
 curtains-0.005
 gen-0.005
Tokenn
Token ID77
Feature activation-2.82e-02

Pos logit contributions
|+0.262
ashing+0.219
go+0.213
+0.211
des+0.210
Neg logit contributions
 grandmother-0.306
aby-0.296
 gen-0.266
omm-0.264
ware-0.260
Tokenics
Token ID873
Feature activation-8.71e-02

Pos logit contributions
|+0.040
des+0.031
 jog+0.029
 stroll+0.029
ited+0.029
Neg logit contributions
 grandmother-0.062
aby-0.057
 parent-0.056
 lamp-0.054
illian-0.053
Token together
Token ID1978
Feature activation-2.25e-02

Pos logit contributions
|+0.157
des+0.127
ited+0.124
acks+0.120
venture+0.120
Neg logit contributions
 grandmother-0.161
 parent-0.139
 grandfather-0.136
 gen-0.134
 lamp-0.133
Token.
Token ID13
Feature activation-1.04e-02

Pos logit contributions
|+0.045
des+0.038
ited+0.035
 jog+0.034
go+0.034
Neg logit contributions
 grandmother-0.037
ware-0.032
 lamp-0.032
 gen-0.031
 parent-0.031
Token
Token ID220
Feature activation+2.13e-02

Pos logit contributions
|+0.016
des+0.013
 jog+0.012
go+0.012
ited+0.011
Neg logit contributions
 grandmother-0.022
 parent-0.020
 gen-0.020
 grandfather-0.019
 lamp-0.019
Token that
Token ID326
Feature activation+1.67e-02

Pos logit contributions
 grandmother+0.122
 parent+0.111
omm+0.109
 gen+0.109
aby+0.108
Neg logit contributions
|-0.077
 jog-0.063
go-0.063
 walks-0.060
des-0.060
Token said
Token ID531
Feature activation+2.99e-02

Pos logit contributions
 grandmother+0.039
omm+0.037
 parent+0.035
 gen+0.035
aby+0.035
Neg logit contributions
|-0.020
 jog-0.015
 walks-0.014
go-0.014
 walk-0.014
Token "
Token ID366
Feature activation-5.46e-03

Pos logit contributions
 grandmother+0.051
 parent+0.045
 grandfather+0.044
 gen+0.044
 lamp+0.043
Neg logit contributions
|-0.051
 jog-0.045
 walks-0.043
 stroll-0.043
go-0.042
TokenWelcome
Token ID14618
Feature activation-7.72e-02

Pos logit contributions
|+0.008
des+0.007
 walks+0.006
 jog+0.006
 stroll+0.006
Neg logit contributions
 grandmother-0.011
 gen-0.010
 parent-0.010
 grandfather-0.010
 Ev-0.009
Token to
Token ID284
Feature activation-1.68e-02

Pos logit contributions
|+0.135
des+0.113
ited+0.104
go+0.101
Yo+0.097
Neg logit contributions
 grandmother-0.155
 parent-0.137
 grandfather-0.133
aby-0.129
 mother-0.128
Token Kinder
Token ID46832
Feature activation-1.69e-01

Pos logit contributions
|+0.027
des+0.022
go+0.020
 jog+0.020
 stroll+0.020
Neg logit contributions
 grandmother-0.034
 parent-0.030
 gen-0.030
 grandfather-0.030
 lamp-0.029
Tokeng
Token ID70
Feature activation-7.29e-02

Pos logit contributions
|+0.182
des+0.174
ashing+0.168
irm+0.166
ited+0.148
Neg logit contributions
 grandmother-0.371
 Genie-0.342
aby-0.334
 gen-0.333
 wizard-0.324
Tokenarten
Token ID23996
Feature activation+4.68e-02

Pos logit contributions
|+0.133
des+0.119
ited+0.106
 stroll+0.102
go+0.101
Neg logit contributions
 grandmother-0.133
 parent-0.115
aby-0.114
omm-0.111
 grandfather-0.110
Token!"
Token ID2474
Feature activation-2.21e-02

Pos logit contributions
 grandmother+0.104
 parent+0.094
 gen+0.093
 grandfather+0.091
 lamp+0.091
Neg logit contributions
|-0.067
des-0.052
 jog-0.048
go-0.047
 stroll-0.047
Token
Token ID198
Feature activation-1.18e-02

Pos logit contributions
|+0.033
des+0.026
 jog+0.024
go+0.024
 stroll+0.024
Neg logit contributions
 grandmother-0.048
 gen-0.043
 parent-0.043
 grandfather-0.042
 lamp-0.041
Token
Token ID198
Feature activation-9.45e-03

Pos logit contributions
|+0.016
des+0.011
 jog+0.011
go+0.011
 stroll+0.010
Neg logit contributions
 grandmother-0.027
 gen-0.025
 parent-0.025
 grandfather-0.024
 lamp-0.024
Token the
Token ID262
Feature activation-1.94e-02

Pos logit contributions
|+0.013
 jog+0.009
 walks+0.009
 stroll+0.009
des+0.009
Neg logit contributions
 grandmother-0.025
 gen-0.023
 parent-0.023
 grandfather-0.022
omm-0.022
Token warm
Token ID5814
Feature activation-4.37e-02

Pos logit contributions
|+0.041
des+0.033
war+0.032
ited+0.032
go+0.031
Neg logit contributions
 grandmother-0.032
 lamp-0.029
 parent-0.028
 gen-0.027
 wizard-0.026
Token sunshine
Token ID34488
Feature activation-1.93e-02

Pos logit contributions
|+0.080
des+0.065
 jog+0.062
go+0.062
 stroll+0.061
Neg logit contributions
 grandmother-0.079
 gen-0.070
 parent-0.070
 lamp-0.069
 grandfather-0.066
Token and
Token ID290
Feature activation-3.42e-02

Pos logit contributions
|+0.026
des+0.019
 jog+0.019
go+0.018
 stroll+0.018
Neg logit contributions
 grandmother-0.044
 gen-0.040
 parent-0.040
 grandfather-0.039
 lamp-0.039
Token having
Token ID1719
Feature activation-1.65e-02

Pos logit contributions
|+0.048
 jog+0.038
 walks+0.036
 stroll+0.036
des+0.035
Neg logit contributions
 grandmother-0.077
 gen-0.071
omm-0.070
 parent-0.069
 grandfather-0.068
Token pic
Token ID8301
Feature activation-1.69e-01

Pos logit contributions
|+0.023
ashing+0.019
go+0.019
rated+0.018
ited+0.018
Neg logit contributions
 grandmother-0.035
 lamp-0.030
 parent-0.030
 gen-0.030
 scissors-0.030
Tokenn
Token ID77
Feature activation-2.62e-02

Pos logit contributions
|+0.251
ashing+0.215
+0.212
go+0.211
des+0.208
Neg logit contributions
 grandmother-0.291
aby-0.290
 gen-0.259
ware-0.258
omm-0.256
Tokenics
Token ID873
Feature activation-8.49e-02

Pos logit contributions
|+0.036
des+0.029
 jog+0.027
 stroll+0.026
ited+0.025
Neg logit contributions
 grandmother-0.058
aby-0.053
 parent-0.052
illian-0.051
 lamp-0.050
Token with
Token ID351
Feature activation+2.94e-02

Pos logit contributions
|+0.137
des+0.110
go+0.104
 jog+0.099
 stroll+0.099
Neg logit contributions
 grandmother-0.173
 parent-0.152
 gen-0.150
 lamp-0.149
 grandfather-0.148
Token her
Token ID607
Feature activation+1.77e-02

Pos logit contributions
 grandmother+0.063
 gen+0.061
 grandfather+0.059
 parent+0.057
omm+0.057
Neg logit contributions
|-0.041
 walks-0.032
 stroll-0.031
 jog-0.030
 play-0.029
Token family
Token ID1641
Feature activation-1.27e-02

Pos logit contributions
 grandmother+0.027
 gen+0.025
omm+0.024
illian+0.024
 grandfather+0.023
Neg logit contributions
|-0.035
 jog-0.029
 stroll-0.028
 walks-0.028
des-0.028
Token their
Token ID511
Feature activation-1.40e-02

Pos logit contributions
|+0.023
des+0.018
 jog+0.018
go+0.017
 stroll+0.017
Neg logit contributions
 grandmother-0.033
 gen-0.029
 parent-0.029
 lamp-0.029
 grandfather-0.029
Token salad
Token ID20698
Feature activation-2.58e-02

Pos logit contributions
|+0.032
 jog+0.026
go+0.026
des+0.026
hed+0.026
Neg logit contributions
 grandmother-0.019
 lamp-0.017
 gen-0.017
 parent-0.016
 mother-0.016
Token and
Token ID290
Feature activation-4.42e-02

Pos logit contributions
|+0.032
des+0.023
 jog+0.022
 stroll+0.021
go+0.021
Neg logit contributions
 grandmother-0.062
 gen-0.056
 parent-0.056
 lamp-0.055
 grandfather-0.055
Token have
Token ID423
Feature activation-6.96e-03

Pos logit contributions
|+0.046
 jog+0.031
des+0.031
 walk+0.029
 walks+0.029
Neg logit contributions
 grandmother-0.115
omm-0.106
 gen-0.106
illian-0.104
 grandfather-0.103
Token happy
Token ID3772
Feature activation-3.36e-02

Pos logit contributions
|+0.011
ited+0.009
go+0.009
des+0.009
ashing+0.008
Neg logit contributions
 grandmother-0.014
 parent-0.013
 gen-0.012
 mother-0.012
 lamp-0.012
Token pic
Token ID8301
Feature activation-1.68e-01

Pos logit contributions
|+0.070
go+0.056
war+0.056
des+0.056
 jog+0.055
Neg logit contributions
 grandmother-0.054
 parent-0.047
 gen-0.046
 lamp-0.045
 mother-0.044
Tokenn
Token ID77
Feature activation-2.78e-02

Pos logit contributions
|+0.241
 jog+0.209
ashing+0.205
 down+0.199
 dives+0.199
Neg logit contributions
 grandmother-0.314
aby-0.300
 gen-0.290
omm-0.276
 Genie-0.267
Tokenics
Token ID873
Feature activation-8.93e-02

Pos logit contributions
|+0.040
 jog+0.031
des+0.031
 stroll+0.030
+0.029
Neg logit contributions
 grandmother-0.060
 parent-0.055
aby-0.054
 gen-0.052
illian-0.052
Token together
Token ID1978
Feature activation-2.51e-02

Pos logit contributions
|+0.129
ited+0.102
des+0.102
 stroll+0.098
 jog+0.096
Neg logit contributions
 grandmother-0.188
 parent-0.170
 gen-0.168
 grandfather-0.163
 lamp-0.161
Token.
Token ID13
Feature activation-4.34e-03

Pos logit contributions
|+0.045
des+0.038
 jog+0.035
go+0.035
ited+0.035
Neg logit contributions
 grandmother-0.046
 parent-0.040
 gen-0.040
 lamp-0.040
 grandfather-0.039
Token<|endoftext|>
Token ID50256
Feature activation-1.01e-01

Pos logit contributions
|+0.006
des+0.005
 jog+0.004
go+0.004
 stroll+0.004
Neg logit contributions
 grandmother-0.010
 gen-0.009
 parent-0.009
 grandfather-0.009
 lamp-0.009
Token so
Token ID523
Feature activation-4.25e-02

Pos logit contributions
 grandmother+0.024
 parent+0.021
 gen+0.021
 lamp+0.021
 grandfather+0.020
Neg logit contributions
|-0.017
des-0.014
war-0.013
 jog-0.013
-0.012
Token excited
Token ID6568
Feature activation-3.42e-02

Pos logit contributions
|+0.079
war+0.065
 stroll+0.062
atch+0.062
 walks+0.062
Neg logit contributions
 grandmother-0.073
 gen-0.065
 parent-0.065
 mother-0.064
 Ev-0.062
Token!
Token ID0
Feature activation-2.08e-02

Pos logit contributions
|+0.064
des+0.049
 jog+0.047
Yo+0.047
war+0.046
Neg logit contributions
 grandmother-0.064
 parent-0.055
 grandfather-0.054
 mother-0.053
 lamp-0.053
Token She
Token ID1375
Feature activation+4.46e-02

Pos logit contributions
|+0.033
des+0.026
 jog+0.025
 stroll+0.025
go+0.024
Neg logit contributions
 grandmother-0.042
 gen-0.039
 parent-0.038
 grandfather-0.037
 lamp-0.037
Token loved
Token ID6151
Feature activation+1.60e-02

Pos logit contributions
 grandmother+0.094
 parent+0.084
 mother+0.081
 grandfather+0.081
 gen+0.081
Neg logit contributions
|-0.072
des-0.058
war-0.054
go-0.050
 jog-0.050
Token pic
Token ID8301
Feature activation-1.67e-01

Pos logit contributions
 grandmother+0.031
 parent+0.027
 lamp+0.027
 gen+0.026
 mother+0.026
Neg logit contributions
|-0.029
des-0.023
ited-0.022
-0.022
ashing-0.021
Tokenn
Token ID77
Feature activation-2.75e-02

Pos logit contributions
|+0.244
ashing+0.196
 jog+0.193
des+0.192
ashes+0.189
Neg logit contributions
 grandmother-0.333
aby-0.312
 gen-0.298
omm-0.290
 lamp-0.283
Tokenics
Token ID873
Feature activation-8.22e-02

Pos logit contributions
|+0.039
des+0.029
 jog+0.028
 stroll+0.027
+0.027
Neg logit contributions
 grandmother-0.061
 parent-0.055
aby-0.055
 lamp-0.054
 gen-0.053
Token and
Token ID290
Feature activation-4.28e-02

Pos logit contributions
|+0.124
des+0.096
 jog+0.089
go+0.089
 stroll+0.087
Neg logit contributions
 grandmother-0.178
 gen-0.159
 parent-0.158
 grandfather-0.155
 lamp-0.154
Token spending
Token ID4581
Feature activation-7.83e-02

Pos logit contributions
|+0.057
des+0.043
go+0.040
 jog+0.039
 stroll+0.038
Neg logit contributions
 grandmother-0.098
 parent-0.089
 gen-0.089
 lamp-0.087
 grandfather-0.087
Token time
Token ID640
Feature activation-8.29e-02

Pos logit contributions
des+0.095
|+0.091
go+0.087
cha+0.081
ashing+0.080
Neg logit contributions
 grandmother-0.152
aby-0.143
 scissors-0.142
 lamp-0.139
ware-0.138
Token to
Token ID284
Feature activation-8.07e-03

Pos logit contributions
 grandmother+0.151
 gen+0.135
 parent+0.133
 grandfather+0.130
 lamp+0.129
Neg logit contributions
|-0.137
des-0.110
 jog-0.107
go-0.104
 stroll-0.104
Token be
Token ID307
Feature activation-3.31e-02

Pos logit contributions
|+0.012
 jog+0.009
des+0.009
 walk+0.008
 stroll+0.008
Neg logit contributions
 grandmother-0.018
 gen-0.016
 parent-0.016
 grandfather-0.016
ware-0.016
Token careful
Token ID8161
Feature activation+3.11e-02

Pos logit contributions
|+0.065
des+0.051
 stroll+0.051
 jog+0.050
go+0.049
Neg logit contributions
 grandmother-0.057
 gen-0.052
 parent-0.051
 lamp-0.048
 grandfather-0.048
Token when
Token ID618
Feature activation+5.61e-02

Pos logit contributions
 grandmother+0.069
 gen+0.062
 parent+0.062
 grandfather+0.060
 lamp+0.060
Neg logit contributions
|-0.045
des-0.034
 jog-0.033
go-0.032
 stroll-0.032
Token having
Token ID1719
Feature activation-2.92e-02

Pos logit contributions
 grandmother+0.111
 parent+0.099
 gen+0.098
 grandfather+0.096
 lamp+0.095
Neg logit contributions
|-0.096
des-0.076
go-0.073
 jog-0.072
 stroll-0.070
Token pic
Token ID8301
Feature activation-1.66e-01

Pos logit contributions
|+0.055
go+0.044
ashing+0.042
war+0.040
des+0.039
Neg logit contributions
 grandmother-0.051
 gen-0.046
 scissors-0.046
 lamp-0.046
 parent-0.046
Tokenn
Token ID77
Feature activation-3.36e-02

Pos logit contributions
|+0.243
go+0.196
 jog+0.195
+0.194
des+0.191
Neg logit contributions
 grandmother-0.314
aby-0.293
 gen-0.287
omm-0.284
 parent-0.273
Tokenics
Token ID873
Feature activation-8.74e-02

Pos logit contributions
|+0.049
des+0.041
 jog+0.037
 stroll+0.036
ited+0.036
Neg logit contributions
 grandmother-0.071
 parent-0.065
aby-0.063
omm-0.063
 grandfather-0.062
Token and
Token ID290
Feature activation-4.09e-02

Pos logit contributions
|+0.163
des+0.136
go+0.130
ited+0.123
 stroll+0.120
Neg logit contributions
 grandmother-0.157
 parent-0.140
 grandfather-0.135
 gen-0.133
 mother-0.132
Token not
Token ID407
Feature activation+2.48e-02

Pos logit contributions
|+0.064
des+0.051
 jog+0.050
 stroll+0.048
go+0.047
Neg logit contributions
 grandmother-0.084
 gen-0.076
 parent-0.075
 grandfather-0.074
omm-0.073
Token to
Token ID284
Feature activation-6.74e-03

Pos logit contributions
 grandmother+0.048
 parent+0.045
 gen+0.044
 lamp+0.044
 mother+0.042
Neg logit contributions
|-0.040
des-0.031
go-0.029
war-0.029
Yo-0.029
Token roof
Token ID9753
Feature activation+4.50e-02

Pos logit contributions
|+0.047
acks+0.037
hed+0.037
des+0.037
war+0.037
Neg logit contributions
 grandmother-0.043
 gen-0.038
 lamp-0.038
 parent-0.037
 grandfather-0.036
Token.
Token ID13
Feature activation-2.91e-03

Pos logit contributions
 grandmother+0.104
 gen+0.096
 parent+0.094
 grandfather+0.093
omm+0.092
Neg logit contributions
|-0.059
des-0.043
go-0.042
 jog-0.042
 stroll-0.039
Token They
Token ID1119
Feature activation-2.74e-02

Pos logit contributions
|+0.004
des+0.003
 jog+0.003
go+0.003
 stroll+0.003
Neg logit contributions
 grandmother-0.007
 parent-0.006
 gen-0.006
 grandfather-0.006
 lamp-0.006
Token would
Token ID561
Feature activation+1.17e-03

Pos logit contributions
|+0.037
des+0.028
 jog+0.026
go+0.026
 stroll+0.026
Neg logit contributions
 grandmother-0.063
 gen-0.057
 parent-0.057
 grandfather-0.056
 lamp-0.056
Token have
Token ID423
Feature activation-4.65e-03

Pos logit contributions
 grandmother+0.003
 gen+0.002
 parent+0.002
 grandfather+0.002
 lamp+0.002
Neg logit contributions
|-0.002
des-0.001
 jog-0.001
 stroll-0.001
go-0.001
Token pic
Token ID8301
Feature activation-1.66e-01

Pos logit contributions
|+0.009
des+0.007
 jog+0.007
go+0.006
ashing+0.006
Neg logit contributions
 grandmother-0.008
 gen-0.007
 parent-0.007
 grandfather-0.007
 lamp-0.007
Tokenn
Token ID77
Feature activation-2.66e-02

Pos logit contributions
|+0.252
ashing+0.214
 jog+0.205
+0.201
des+0.200
Neg logit contributions
 grandmother-0.291
aby-0.279
 gen-0.263
omm-0.258
ware-0.248
Tokenics
Token ID873
Feature activation-8.09e-02

Pos logit contributions
|+0.040
des+0.031
 jog+0.029
 stroll+0.029
ited+0.029
Neg logit contributions
 grandmother-0.055
 parent-0.050
aby-0.049
omm-0.047
 lamp-0.047
Token and
Token ID290
Feature activation-3.89e-02

Pos logit contributions
|+0.123
des+0.096
ited+0.088
 stroll+0.087
go+0.087
Neg logit contributions
 grandmother-0.172
 parent-0.153
 gen-0.151
 grandfather-0.150
 lamp-0.147
Token play
Token ID711
Feature activation-7.50e-02

Pos logit contributions
|+0.045
 jog+0.033
 walk+0.031
 play+0.031
 walks+0.031
Neg logit contributions
 grandmother-0.096
omm-0.090
 gen-0.088
aby-0.088
 parent-0.086
Token games
Token ID1830
Feature activation-5.86e-02

Pos logit contributions
|+0.112
ited+0.092
des+0.091
 wanna+0.083
 stroll+0.082
Neg logit contributions
 grandmother-0.152
 lamp-0.142
 gen-0.141
ware-0.136
 parent-0.132
Token see
Token ID766
Feature activation+2.25e-02

Pos logit contributions
|+0.006
go+0.005
des+0.005
war+0.005
ashing+0.004
Neg logit contributions
 grandmother-0.012
 gen-0.011
 parent-0.011
 lamp-0.011
 wizard-0.011
Token the
Token ID262
Feature activation-6.00e-03

Pos logit contributions
 grandmother+0.049
 gen+0.044
 parent+0.044
 grandfather+0.043
 lamp+0.043
Neg logit contributions
|-0.034
des-0.026
 jog-0.025
go-0.024
 stroll-0.024
Token bird
Token ID6512
Feature activation+2.32e-02

Pos logit contributions
|+0.012
des+0.010
 jog+0.009
go+0.009
 stroll+0.009
Neg logit contributions
 grandmother-0.010
 gen-0.009
 parent-0.009
 lamp-0.009
 grandfather-0.009
Token and
Token ID290
Feature activation-2.55e-02

Pos logit contributions
 grandmother+0.043
 gen+0.040
 lamp+0.040
 parent+0.040
 fairy+0.039
Neg logit contributions
|-0.039
des-0.031
war-0.031
Yo-0.029
 jog-0.029
Token have
Token ID423
Feature activation+7.50e-03

Pos logit contributions
|+0.030
 jog+0.022
des+0.021
 walk+0.020
 play+0.020
Neg logit contributions
 grandmother-0.063
omm-0.060
 gen-0.058
ware-0.057
illian-0.057
Token pic
Token ID8301
Feature activation-1.65e-01

Pos logit contributions
 grandmother+0.014
 parent+0.013
 gen+0.013
 mother+0.012
 grandfather+0.012
Neg logit contributions
|-0.013
go-0.010
des-0.010
 jog-0.010
ited-0.010
Tokenn
Token ID77
Feature activation-2.20e-02

Pos logit contributions
|+0.248
ashing+0.204
+0.197
 jog+0.193
go+0.192
Neg logit contributions
 grandmother-0.309
aby-0.292
 gen-0.278
omm-0.265
 parent-0.260
Tokenics
Token ID873
Feature activation-8.01e-02

Pos logit contributions
|+0.031
des+0.023
 jog+0.023
ited+0.022
+0.022
Neg logit contributions
 grandmother-0.048
aby-0.043
 parent-0.043
 gen-0.041
 lamp-0.041
Token.
Token ID13
Feature activation+1.50e-03

Pos logit contributions
|+0.130
des+0.101
ited+0.099
go+0.098
war+0.096
Neg logit contributions
 grandmother-0.161
 parent-0.140
 gen-0.137
 grandfather-0.137
 mother-0.134
Token<|endoftext|>
Token ID50256
Feature activation-9.69e-02

Pos logit contributions
 grandmother+0.003
 parent+0.003
 gen+0.003
 grandfather+0.003
 lamp+0.003
Neg logit contributions
|-0.002
des-0.002
 jog-0.002
go-0.002
 stroll-0.002
TokenOnce
Token ID7454
Feature activation-3.54e-02

Pos logit contributions
ited+0.170
|+0.168
des+0.168
ashing+0.160
 jog+0.155
Neg logit contributions
 grandmother-0.156
 gen-0.136
 lamp-0.134
aby-0.131
 parent-0.126
Token mom
Token ID1995
Feature activation+3.14e-02

Pos logit contributions
omm+0.034
ware+0.033
 grandmother+0.033
aby+0.033
 gen+0.033
Neg logit contributions
 jog-0.051
 play-0.050
|-0.048
 walks-0.048
 walk-0.047
Token to
Token ID284
Feature activation-8.13e-03

Pos logit contributions
 grandmother+0.069
 gen+0.068
aby+0.063
omm+0.063
 parent+0.063
Neg logit contributions
|-0.038
 jog-0.031
des-0.029
-0.029
 play-0.029
Token help
Token ID1037
Feature activation+3.04e-02

Pos logit contributions
|+0.012
 jog+0.010
des+0.010
 walk+0.010
 stroll+0.009
Neg logit contributions
 grandmother-0.017
 gen-0.016
 parent-0.015
 grandfather-0.015
ware-0.015
Token her
Token ID607
Feature activation+2.48e-02

Pos logit contributions
 grandmother+0.058
 gen+0.051
 parent+0.051
 grandfather+0.050
 lamp+0.049
Neg logit contributions
|-0.053
des-0.043
 jog-0.042
go-0.041
 stroll-0.041
Token.
Token ID13
Feature activation-4.10e-03

Pos logit contributions
 grandmother+0.054
 gen+0.050
 parent+0.049
 grandfather+0.048
omm+0.047
Neg logit contributions
|-0.035
 jog-0.027
des-0.027
 stroll-0.025
go-0.025
Token But
Token ID887
Feature activation+9.70e-02

Pos logit contributions
|+0.006
 jog+0.004
des+0.004
go+0.004
+0.004
Neg logit contributions
 grandmother-0.009
 gen-0.009
 parent-0.008
 grandfather-0.008
ware-0.008
Token her
Token ID607
Feature activation+4.12e-02

Pos logit contributions
 gen+0.223
 grandmother+0.221
ware+0.216
illian+0.215
aby+0.210
Neg logit contributions
|-0.078
 jog-0.074
 play-0.074
 walk-0.072
 down-0.070
Token mom
Token ID1995
Feature activation+5.39e-02

Pos logit contributions
 grandmother+0.041
 gen+0.038
ware+0.036
 Ang+0.035
 Ev+0.034
Neg logit contributions
|-0.099
 jog-0.091
 walks-0.088
 walk-0.088
go-0.087
Token was
Token ID373
Feature activation-1.67e-03

Pos logit contributions
 grandmother+0.129
ware+0.124
 gen+0.122
aby+0.122
 lamp+0.117
Neg logit contributions
|-0.051
go-0.044
-0.041
hed-0.039
 played-0.039
Token very
Token ID845
Feature activation-2.96e-02

Pos logit contributions
|+0.003
des+0.002
 jog+0.002
go+0.002
 stroll+0.002
Neg logit contributions
 grandmother-0.004
 gen-0.003
 parent-0.003
 grandfather-0.003
 lamp-0.003
Token selfish
Token ID20363
Feature activation-3.31e-02

Pos logit contributions
|+0.048
des+0.037
 stroll+0.035
 walks+0.034
 jog+0.034
Neg logit contributions
 grandmother-0.062
 parent-0.056
 gen-0.055
 mother-0.054
 grandfather-0.054
Token smile
Token ID8212
Feature activation-1.27e-02

Pos logit contributions
 grandmother+0.074
 gen+0.068
 parent+0.067
 grandfather+0.066
 lamp+0.065
Neg logit contributions
|-0.047
des-0.035
 jog-0.035
 stroll-0.033
go-0.033
Token,
Token ID11
Feature activation+2.56e-02

Pos logit contributions
|+0.016
 jog+0.012
des+0.011
go+0.011
 stroll+0.011
Neg logit contributions
 grandmother-0.030
 gen-0.027
 parent-0.027
 grandfather-0.027
omm-0.026
Token but
Token ID475
Feature activation+5.15e-02

Pos logit contributions
 grandmother+0.062
 gen+0.057
 parent+0.056
 grandfather+0.056
illian+0.056
Neg logit contributions
|-0.029
 jog-0.023
 walks-0.022
 stroll-0.021
des-0.021
Token the
Token ID262
Feature activation+9.07e-03

Pos logit contributions
 grandmother+0.104
 gen+0.097
aby+0.094
omm+0.094
 parent+0.094
Neg logit contributions
|-0.078
 jog-0.064
 walks-0.060
 walk-0.059
-0.059
Token best
Token ID1266
Feature activation-2.61e-02

Pos logit contributions
 grandmother+0.017
omm+0.016
 gen+0.016
 Ang+0.016
 grandfather+0.015
Neg logit contributions
|-0.014
 jog-0.013
 play-0.012
 walk-0.012
 time-0.012
Token part
Token ID636
Feature activation+1.02e-01

Pos logit contributions
|+0.042
 jog+0.037
des+0.035
go+0.034
 stroll+0.034
Neg logit contributions
 grandmother-0.050
 gen-0.046
 grandfather-0.044
omm-0.044
 parent-0.044
Token was
Token ID373
Feature activation-7.59e-03

Pos logit contributions
 grandmother+0.220
 gen+0.199
 parent+0.197
 grandfather+0.195
 lamp+0.190
Neg logit contributions
|-0.150
des-0.114
 jog-0.112
go-0.110
 stroll-0.106
Token when
Token ID618
Feature activation+5.95e-02

Pos logit contributions
|+0.011
 jog+0.008
des+0.008
 walks+0.008
 stroll+0.008
Neg logit contributions
 grandmother-0.016
 gen-0.015
 parent-0.015
 grandfather-0.015
omm-0.014
Token she
Token ID673
Feature activation-2.83e-02

Pos logit contributions
 grandmother+0.107
 gen+0.100
 parent+0.096
 grandfather+0.094
omm+0.093
Neg logit contributions
|-0.101
 jog-0.086
 walks-0.084
 stroll-0.082
des-0.081
Token found
Token ID1043
Feature activation+4.32e-02

Pos logit contributions
|+0.046
des+0.037
 stroll+0.034
go+0.034
 jog+0.034
Neg logit contributions
 grandmother-0.057
 gen-0.052
 parent-0.051
 lamp-0.050
ware-0.050
Token something
Token ID1223
Feature activation+5.52e-02

Pos logit contributions
 grandmother+0.102
 parent+0.097
 gen+0.096
 grandfather+0.096
omm+0.096
Neg logit contributions
 walks-0.036
|-0.036
 time-0.034
 around-0.033
 trips-0.030
Token big
Token ID1263
Feature activation-4.87e-02

Pos logit contributions
 grandmother+0.040
rings+0.040
rays+0.039
illian+0.039
 Ev+0.039
Neg logit contributions
|-0.021
 jog-0.020
 picnic-0.019
 walk-0.018
 play-0.018
Token,
Token ID11
Feature activation+1.85e-02

Pos logit contributions
|+0.083
des+0.066
 jog+0.065
 stroll+0.063
go+0.062
Neg logit contributions
 grandmother-0.093
 gen-0.083
 parent-0.083
 grandfather-0.081
 lamp-0.079
Token red
Token ID2266
Feature activation-1.84e-02

Pos logit contributions
 grandmother+0.034
 gen+0.031
 parent+0.030
 lamp+0.030
 grandfather+0.029
Neg logit contributions
|-0.033
des-0.027
 jog-0.026
 stroll-0.025
go-0.025
Token ball
Token ID2613
Feature activation-1.19e-02

Pos logit contributions
|+0.029
 jog+0.025
 walk+0.023
 picnic+0.023
 stroll+0.023
Neg logit contributions
 grandmother-0.036
illian-0.032
 gen-0.032
 grandfather-0.032
 parent-0.031
Token.
Token ID13
Feature activation+3.97e-03

Pos logit contributions
|+0.017
des+0.013
 jog+0.013
 stroll+0.012
go+0.012
Neg logit contributions
 grandmother-0.026
 gen-0.023
 parent-0.023
 lamp-0.023
 grandfather-0.023
Token But
Token ID887
Feature activation+1.02e-01

Pos logit contributions
 grandmother+0.007
 gen+0.007
 parent+0.007
 grandfather+0.007
 lamp+0.006
Neg logit contributions
|-0.007
des-0.006
 jog-0.006
 stroll-0.005
go-0.005
Token her
Token ID607
Feature activation+4.64e-02

Pos logit contributions
 grandmother+0.230
 gen+0.219
ware+0.210
illian+0.210
aby+0.210
Neg logit contributions
|-0.117
 jog-0.100
 walks-0.094
 play-0.094
 walk-0.092
Token mom
Token ID1995
Feature activation+5.64e-02

Pos logit contributions
 grandmother+0.047
 gen+0.039
 parent+0.036
ware+0.035
 grandfather+0.035
Neg logit contributions
|-0.119
 jog-0.105
des-0.103
 stroll-0.101
go-0.101
Token said
Token ID531
Feature activation+2.61e-02

Pos logit contributions
 grandmother+0.128
 gen+0.119
ware+0.118
aby+0.115
 parent+0.115
Neg logit contributions
|-0.069
go-0.056
 jog-0.052
des-0.050
 walk-0.050
Token,
Token ID11
Feature activation+1.01e-02

Pos logit contributions
 grandmother+0.056
 gen+0.051
 parent+0.051
 grandfather+0.050
 lamp+0.049
Neg logit contributions
|-0.038
des-0.029
 jog-0.029
go-0.028
 stroll-0.028
Token "
Token ID366
Feature activation-6.24e-04

Pos logit contributions
 grandmother+0.016
 gen+0.014
 parent+0.014
 grandfather+0.013
aby+0.013
Neg logit contributions
|-0.019
 jog-0.017
go-0.016
 stroll-0.016
 walks-0.016
Token what
Token ID644
Feature activation+4.71e-02

Pos logit contributions
 grandmother+0.080
 gen+0.073
 parent+0.072
omm+0.072
 grandfather+0.070
Neg logit contributions
|-0.053
 walks-0.042
 jog-0.041
 stroll-0.039
des-0.038
Token to
Token ID284
Feature activation-1.13e-02

Pos logit contributions
 grandmother+0.093
 gen+0.083
 parent+0.082
 grandfather+0.080
 lamp+0.080
Neg logit contributions
|-0.080
des-0.064
 jog-0.062
go-0.060
 stroll-0.060
Token do
Token ID466
Feature activation-2.35e-02

Pos logit contributions
|+0.015
des+0.012
ashing+0.011
 stroll+0.010
go+0.010
Neg logit contributions
 grandmother-0.026
 gen-0.024
 parent-0.024
 lamp-0.024
 grandfather-0.023
Token.
Token ID13
Feature activation-4.47e-04

Pos logit contributions
|+0.047
 stroll+0.038
des+0.038
ited+0.038
 jog+0.037
Neg logit contributions
 grandmother-0.039
 lamp-0.033
 parent-0.033
 gen-0.033
 grandfather-0.032
Token She
Token ID1375
Feature activation+4.21e-02

Pos logit contributions
|+0.001
des+0.001
 jog+0.001
 stroll+0.001
go+0.001
Neg logit contributions
 grandmother-0.001
 gen-0.001
 parent-0.001
 grandfather-0.001
 lamp-0.001
Token remembered
Token ID12086
Feature activation+1.04e-01

Pos logit contributions
 grandmother+0.091
 gen+0.082
 parent+0.082
 grandfather+0.080
 lamp+0.079
Neg logit contributions
|-0.063
des-0.048
 jog-0.047
go-0.045
 stroll-0.045
Token that
Token ID326
Feature activation+2.94e-03

Pos logit contributions
 grandmother+0.212
 gen+0.201
omm+0.191
 parent+0.191
ware+0.190
Neg logit contributions
|-0.153
 jog-0.127
 walks-0.126
des-0.122
 stroll-0.121
Token her
Token ID607
Feature activation+3.39e-02

Pos logit contributions
 grandmother+0.006
omm+0.006
 gen+0.006
ware+0.006
aby+0.006
Neg logit contributions
|-0.004
 jog-0.003
 walks-0.003
 walk-0.003
 play-0.003
Token mom
Token ID1995
Feature activation+4.35e-02

Pos logit contributions
 grandmother+0.048
 gen+0.045
ware+0.044
illian+0.044
omm+0.044
Neg logit contributions
|-0.069
 jog-0.062
 walks-0.059
 walk-0.059
 stroll-0.058
Token told
Token ID1297
Feature activation+4.75e-02

Pos logit contributions
 grandmother+0.091
 gen+0.083
 parent+0.082
 grandfather+0.080
 lamp+0.079
Neg logit contributions
|-0.066
des-0.051
 jog-0.050
go-0.049
 stroll-0.048
Token her
Token ID607
Feature activation+1.02e-02

Pos logit contributions
 grandmother+0.078
 parent+0.067
 lamp+0.064
 gen+0.063
 grandfather+0.063
Neg logit contributions
|-0.099
des-0.084
go-0.080
ashing-0.078
 jog-0.077
Token healthy
Token ID5448
Feature activation-4.30e-02

Pos logit contributions
|+0.107
go+0.081
des+0.078
war+0.076
ited+0.071
Neg logit contributions
 grandmother-0.147
 parent-0.132
 grandfather-0.129
 mother-0.125
 gen-0.124
Token and
Token ID290
Feature activation-3.29e-02

Pos logit contributions
|+0.074
des+0.058
go+0.056
 jog+0.053
 stroll+0.053
Neg logit contributions
 grandmother-0.085
 parent-0.075
 gen-0.074
 grandfather-0.073
 lamp-0.073
Token safe
Token ID3338
Feature activation-1.71e-02

Pos logit contributions
|+0.046
des+0.032
go+0.031
 stroll+0.030
 jog+0.030
Neg logit contributions
 grandmother-0.075
 parent-0.067
 gen-0.067
 grandfather-0.066
 lamp-0.065
Token.
Token ID13
Feature activation+6.43e-03

Pos logit contributions
|+0.027
des+0.021
go+0.020
 jog+0.020
 stroll+0.019
Neg logit contributions
 grandmother-0.036
 gen-0.032
 parent-0.032
 grandfather-0.031
 lamp-0.031
Token Even
Token ID3412
Feature activation+1.15e-01

Pos logit contributions
 grandmother+0.014
 gen+0.014
ware+0.013
 parent+0.013
 lamp+0.013
Neg logit contributions
|-0.009
des-0.007
 jog-0.006
 walks-0.006
 stroll-0.006
Token though
Token ID996
Feature activation+1.10e-01

Pos logit contributions
 grandmother+0.221
 gen+0.199
 parent+0.194
omm+0.194
 lamp+0.189
Neg logit contributions
|-0.195
 jog-0.159
des-0.156
 walks-0.151
 stroll-0.150
Token it
Token ID340
Feature activation-4.22e-03

Pos logit contributions
 grandmother+0.224
 gen+0.203
 parent+0.200
 grandfather+0.195
 lamp+0.195
Neg logit contributions
|-0.174
 jog-0.135
des-0.134
 stroll-0.130
 walks-0.130
Token was
Token ID373
Feature activation+1.18e-02

Pos logit contributions
|+0.007
des+0.006
 jog+0.005
Yo+0.005
go+0.005
Neg logit contributions
 grandmother-0.008
 gen-0.007
 parent-0.007
 lamp-0.007
 grandfather-0.007
Token hard
Token ID1327
Feature activation-2.86e-02

Pos logit contributions
 grandmother+0.025
 parent+0.022
 gen+0.022
 grandfather+0.021
 lamp+0.021
Neg logit contributions
|-0.019
des-0.015
go-0.014
 jog-0.014
 stroll-0.013
Token,
Token ID11
Feature activation+2.29e-02

Pos logit contributions
|+0.049
des+0.039
go+0.038
Yo+0.036
war+0.035
Neg logit contributions
 grandmother-0.058
 parent-0.052
 grandfather-0.050
 lamp-0.049
 gen-0.049
Token he
Token ID339
Feature activation-2.50e-02

Pos logit contributions
 grandmother+0.049
 gen+0.045
omm+0.044
 parent+0.044
aby+0.044
Neg logit contributions
|-0.032
 jog-0.026
 walks-0.024
des-0.024
 stroll-0.024
Token They
Token ID1119
Feature activation-2.09e-02

Pos logit contributions
 grandmother+0.005
 gen+0.005
 parent+0.005
 lamp+0.005
ware+0.005
Neg logit contributions
|-0.003
 jog-0.002
des-0.002
 stroll-0.002
 walks-0.002
Token went
Token ID1816
Feature activation-9.33e-02

Pos logit contributions
|+0.032
des+0.026
go+0.023
 jog+0.023
 stroll+0.023
Neg logit contributions
 grandmother-0.044
 parent-0.040
 gen-0.039
 grandfather-0.039
 mother-0.038
Token to
Token ID284
Feature activation+2.31e-03

Pos logit contributions
|+0.131
des+0.100
go+0.093
 jog+0.092
 stroll+0.089
Neg logit contributions
 grandmother-0.210
 parent-0.188
 gen-0.187
 grandfather-0.184
 lamp-0.183
Token get
Token ID651
Feature activation-3.42e-03

Pos logit contributions
 grandmother+0.005
 parent+0.004
 gen+0.004
 grandfather+0.004
 lamp+0.004
Neg logit contributions
|-0.004
des-0.003
go-0.003
 jog-0.003
 stroll-0.003
Token ice
Token ID4771
Feature activation-4.62e-02

Pos logit contributions
|+0.006
des+0.005
go+0.004
 jog+0.004
 stroll+0.004
Neg logit contributions
 grandmother-0.007
 gen-0.006
 parent-0.006
 lamp-0.006
 grandfather-0.006
Token cream
Token ID8566
Feature activation+2.06e-03

Pos logit contributions
|+0.091
 stroll+0.071
des+0.070
 jog+0.068
 walks+0.067
Neg logit contributions
 grandmother-0.076
ware-0.070
 gen-0.069
 lamp-0.067
rings-0.064
Token from
Token ID422
Feature activation+1.48e-02

Pos logit contributions
 grandmother+0.004
 gen+0.004
 parent+0.004
 grandfather+0.004
 lamp+0.004
Neg logit contributions
|-0.003
des-0.003
 jog-0.002
 stroll-0.002
go-0.002
Token the
Token ID262
Feature activation-1.24e-02

Pos logit contributions
 grandmother+0.034
 gen+0.032
 parent+0.031
omm+0.030
 grandfather+0.030
Neg logit contributions
|-0.020
 jog-0.014
des-0.014
 stroll-0.014
 walks-0.013
Token shop
Token ID6128
Feature activation+2.72e-02

Pos logit contributions
|+0.022
 jog+0.020
 picnic+0.018
 walk+0.018
 walks+0.018
Neg logit contributions
 grandmother-0.021
aby-0.019
illian-0.018
 Ang-0.018
 gen-0.018
Token and
Token ID290
Feature activation-2.74e-02

Pos logit contributions
 grandmother+0.062
 gen+0.060
 parent+0.058
omm+0.057
 lamp+0.057
Neg logit contributions
|-0.031
 jog-0.024
des-0.023
 walk-0.023
 stroll-0.022
Token Mark
Token ID2940
Feature activation+2.83e-03

Pos logit contributions
|+0.038
des+0.028
 jog+0.028
 stroll+0.027
go+0.026
Neg logit contributions
 grandmother-0.061
 gen-0.056
 parent-0.056
 grandfather-0.054
 lamp-0.054
Token see
Token ID766
Feature activation+1.40e-02

Pos logit contributions
|+0.036
 jog+0.028
des+0.026
go+0.026
 walk+0.025
Neg logit contributions
 grandmother-0.057
 gen-0.052
 parent-0.051
 grandfather-0.051
omm-0.050
Token who
Token ID508
Feature activation+4.70e-02

Pos logit contributions
 grandmother+0.031
omm+0.031
 gen+0.029
aby+0.029
 parent+0.028
Neg logit contributions
|-0.015
 walks-0.014
 jog-0.014
 play-0.013
 walk-0.012
Token could
Token ID714
Feature activation+2.50e-02

Pos logit contributions
 grandmother+0.108
 parent+0.098
 gen+0.098
 grandfather+0.096
omm+0.095
Neg logit contributions
|-0.064
des-0.046
 jog-0.046
go-0.045
 stroll-0.043
Token make
Token ID787
Feature activation+4.58e-02

Pos logit contributions
 grandmother+0.059
omm+0.054
 grandfather+0.053
 gen+0.053
 parent+0.053
Neg logit contributions
|-0.031
 jog-0.025
 walk-0.024
 play-0.023
Yo-0.022
Token the
Token ID262
Feature activation-4.21e-03

Pos logit contributions
 grandmother+0.101
 parent+0.090
 grandfather+0.090
 gen+0.090
omm+0.089
Neg logit contributions
|-0.061
 walks-0.049
 jog-0.047
des-0.046
Yo-0.046
Token tallest
Token ID38760
Feature activation+7.52e-04

Pos logit contributions
|+0.007
 jog+0.006
Yo+0.006
go+0.006
 walks+0.006
Neg logit contributions
 grandmother-0.008
aby-0.007
illian-0.007
 Ang-0.007
omm-0.007
Token tower
Token ID10580
Feature activation-9.59e-03

Pos logit contributions
 grandmother+0.001
aby+0.001
 parent+0.001
omm+0.001
 grandfather+0.001
Neg logit contributions
|-0.001
go-0.001
 jog-0.001
 walks-0.001
-0.001
Token.
Token ID13
Feature activation-1.50e-02

Pos logit contributions
|+0.015
des+0.013
 jog+0.012
 stroll+0.012
ited+0.011
Neg logit contributions
 grandmother-0.019
 gen-0.018
 lamp-0.018
 parent-0.017
ware-0.017
Token They
Token ID1119
Feature activation-3.76e-02

Pos logit contributions
|+0.024
des+0.020
go+0.018
 jog+0.018
ited+0.018
Neg logit contributions
 grandmother-0.031
 parent-0.027
 grandfather-0.026
 mother-0.026
 gen-0.026
Token used
Token ID973
Feature activation+7.77e-02

Pos logit contributions
|+0.047
des+0.033
 jog+0.033
go+0.032
 stroll+0.031
Neg logit contributions
 grandmother-0.090
 gen-0.083
 parent-0.081
 grandfather-0.080
 lamp-0.080
Token all
Token ID477
Feature activation-1.56e-02

Pos logit contributions
 grandmother+0.195
omm+0.178
 gen+0.178
 grandfather+0.176
 parent+0.175
Neg logit contributions
|-0.082
 walks-0.065
 jog-0.062
 stroll-0.060
 walk-0.059
Token!
Token ID0
Feature activation-1.30e-02

Pos logit contributions
|+0.059
des+0.046
go+0.044
 jog+0.043
 stroll+0.042
Neg logit contributions
 grandmother-0.067
 gen-0.061
 lamp-0.060
 parent-0.060
ware-0.058
Token How
Token ID1374
Feature activation-4.19e-03

Pos logit contributions
|+0.018
+0.013
des+0.012
+0.012
 jog+0.012
Neg logit contributions
 grandmother-0.030
 gen-0.027
ware-0.027
 parent-0.027
 grandfather-0.027
Token can
Token ID460
Feature activation+3.39e-02

Pos logit contributions
|+0.007
des+0.005
 stroll+0.005
 jog+0.005
 walks+0.005
Neg logit contributions
 grandmother-0.008
 lamp-0.008
 gen-0.007
 parent-0.007
illian-0.007
Token you
Token ID345
Feature activation-7.76e-02

Pos logit contributions
 grandmother+0.072
 gen+0.065
 parent+0.064
 grandfather+0.064
 lamp+0.063
Neg logit contributions
|-0.050
 jog-0.039
des-0.039
 stroll-0.037
go-0.037
Token make
Token ID787
Feature activation+4.49e-02

Pos logit contributions
|+0.123
des+0.096
go+0.090
 stroll+0.087
 jog+0.087
Neg logit contributions
 grandmother-0.159
 gen-0.144
 parent-0.143
 lamp-0.139
 grandfather-0.138
Token tea
Token ID8887
Feature activation+1.17e-02

Pos logit contributions
 grandmother+0.099
 gen+0.091
 parent+0.090
 grandfather+0.089
 lamp+0.087
Neg logit contributions
|-0.062
 jog-0.047
des-0.047
 walks-0.045
 stroll-0.045
Token so
Token ID523
Feature activation-3.89e-02

Pos logit contributions
 grandmother+0.025
 gen+0.022
 parent+0.022
 lamp+0.022
ware+0.021
Neg logit contributions
|-0.018
des-0.015
 stroll-0.013
go-0.013
 walks-0.013
Token quickly
Token ID2952
Feature activation-6.52e-03

Pos logit contributions
|+0.057
des+0.043
 jog+0.041
go+0.041
 stroll+0.041
Neg logit contributions
 grandmother-0.085
 gen-0.077
 parent-0.077
 lamp-0.075
 grandfather-0.074
Token?"
Token ID1701
Feature activation+1.41e-02

Pos logit contributions
|+0.010
des+0.008
go+0.007
 jog+0.007
 stroll+0.007
Neg logit contributions
 grandmother-0.014
 gen-0.012
 parent-0.012
 lamp-0.012
 grandfather-0.012
Token
Token ID220
Feature activation+2.91e-02

Pos logit contributions
 grandmother+0.034
 gen+0.031
 parent+0.031
 grandfather+0.030
 lamp+0.030
Neg logit contributions
|-0.018
des-0.012
 jog-0.012
 stroll-0.012
-0.012
Token
Token ID198
Feature activation-3.10e-04

Pos logit contributions
 grandmother+0.062
 parent+0.056
 gen+0.055
 grandfather+0.054
 lamp+0.054
Neg logit contributions
|-0.044
des-0.035
 jog-0.033
go-0.032
 stroll-0.032
Token her
Token ID607
Feature activation+2.96e-02

Pos logit contributions
 grandmother+0.013
omm+0.013
 gen+0.012
illian+0.012
ware+0.012
Neg logit contributions
|-0.007
 play-0.006
 walks-0.006
 jog-0.005
 walk-0.005
Token normal
Token ID3487
Feature activation+1.15e-02

Pos logit contributions
 grandmother+0.052
 gen+0.047
 parent+0.046
omm+0.045
 grandfather+0.045
Neg logit contributions
|-0.053
 jog-0.043
des-0.043
 stroll-0.042
 walks-0.042
Token size
Token ID2546
Feature activation+3.83e-02

Pos logit contributions
 grandmother+0.017
 gen+0.015
illian+0.015
 grandfather+0.015
omm+0.015
Neg logit contributions
|-0.023
 jog-0.020
des-0.019
 stroll-0.019
 walks-0.019
Token.
Token ID13
Feature activation+2.49e-03

Pos logit contributions
 grandmother+0.083
 gen+0.075
 parent+0.075
 grandfather+0.073
 lamp+0.073
Neg logit contributions
|-0.056
des-0.043
 jog-0.041
go-0.040
 stroll-0.040
Token From
Token ID3574
Feature activation+3.06e-02

Pos logit contributions
 grandmother+0.006
 gen+0.005
 parent+0.005
ware+0.005
 grandfather+0.005
Neg logit contributions
|-0.004
des-0.002
-0.002
go-0.002
 jog-0.002
Token that
Token ID326
Feature activation+9.07e-03

Pos logit contributions
 grandmother+0.055
 gen+0.048
 parent+0.048
 grandfather+0.047
 lamp+0.046
Neg logit contributions
|-0.058
des-0.048
 jog-0.046
go-0.045
 stroll-0.045
Token day
Token ID1110
Feature activation-4.14e-02

Pos logit contributions
 grandmother+0.022
 parent+0.020
 lamp+0.020
 gen+0.020
 mother+0.019
Neg logit contributions
|-0.011
des-0.009
 walks-0.008
Yo-0.008
ashes-0.008
Token on
Token ID319
Feature activation-2.95e-02

Pos logit contributions
|+0.068
des+0.055
 stroll+0.051
 pic+0.049
Yo+0.049
Neg logit contributions
 grandmother-0.082
 parent-0.076
 lamp-0.072
 gen-0.072
illian-0.071
Token,
Token ID11
Feature activation+2.66e-02

Pos logit contributions
|+0.045
des+0.036
go+0.034
 jog+0.033
 stroll+0.032
Neg logit contributions
 grandmother-0.064
 gen-0.057
 parent-0.057
 lamp-0.056
 grandfather-0.055
Token Lily
Token ID20037
Feature activation+1.08e-02

Pos logit contributions
 grandmother+0.047
 gen+0.042
 parent+0.042
 grandfather+0.040
 lamp+0.040
Neg logit contributions
|-0.050
des-0.041
 jog-0.040
go-0.039
 stroll-0.038
Token never
Token ID1239
Feature activation+2.56e-02

Pos logit contributions
 grandmother+0.022
 parent+0.020
 gen+0.020
 grandfather+0.020
 lamp+0.020
Neg logit contributions
|-0.017
des-0.013
 jog-0.013
 stroll-0.013
go-0.012
Token you
Token ID345
Feature activation-7.16e-02

Pos logit contributions
 grandmother+0.012
 gen+0.011
 parent+0.011
 grandfather+0.011
 lamp+0.010
Neg logit contributions
|-0.011
 jog-0.010
go-0.009
des-0.009
 walks-0.009
Token are
Token ID389
Feature activation-2.35e-03

Pos logit contributions
|+0.071
 jog+0.045
des+0.043
 stroll+0.040
go+0.039
Neg logit contributions
 grandmother-0.191
 gen-0.177
ware-0.176
 grandfather-0.174
 parent-0.172
Token very
Token ID845
Feature activation-3.85e-02

Pos logit contributions
|+0.003
des+0.002
 jog+0.002
go+0.002
 stroll+0.002
Neg logit contributions
 grandmother-0.005
 gen-0.005
 parent-0.005
 grandfather-0.005
 lamp-0.005
Token kind
Token ID1611
Feature activation-2.73e-03

Pos logit contributions
|+0.057
des+0.045
 jog+0.043
go+0.042
 stroll+0.041
Neg logit contributions
 grandmother-0.083
 gen-0.075
 parent-0.075
 grandfather-0.074
 lamp-0.073
Token.
Token ID13
Feature activation-4.94e-03

Pos logit contributions
|+0.003
des+0.002
 jog+0.002
go+0.002
 walks+0.002
Neg logit contributions
 grandmother-0.007
 gen-0.006
 parent-0.006
 grandfather-0.006
 lamp-0.006
Token Of
Token ID3226
Feature activation+4.29e-02

Pos logit contributions
|+0.006
des+0.004
 jog+0.004
+0.004
go+0.004
Neg logit contributions
 grandmother-0.012
ware-0.011
 gen-0.011
 parent-0.011
 lamp-0.011
Token course
Token ID1781
Feature activation+1.84e-02

Pos logit contributions
 grandmother+0.081
 gen+0.074
 parent+0.072
 fairy+0.069
 mother+0.069
Neg logit contributions
|-0.078
ashing-0.069
des-0.062
ashes-0.059
go-0.059
Token you
Token ID345
Feature activation-7.36e-02

Pos logit contributions
 grandmother+0.040
 parent+0.035
 gen+0.035
 mother+0.035
 grandfather+0.034
Neg logit contributions
|-0.029
des-0.022
war-0.020
go-0.020
ashing-0.020
Token can
Token ID460
Feature activation+2.56e-02

Pos logit contributions
|+0.090
 jog+0.063
des+0.063
go+0.059
 stroll+0.058
Neg logit contributions
 grandmother-0.180
 gen-0.164
 parent-0.162
 grandfather-0.160
ware-0.159
Token touch
Token ID3638
Feature activation+2.21e-02

Pos logit contributions
 grandmother+0.059
 gen+0.054
 parent+0.053
 grandfather+0.052
 lamp+0.052
Neg logit contributions
|-0.035
des-0.026
 jog-0.025
go-0.024
 stroll-0.024
Token it
Token ID340
Feature activation-4.27e-02

Pos logit contributions
 grandmother+0.052
 gen+0.048
omm+0.048
 parent+0.048
 grandfather+0.047
Neg logit contributions
|-0.026
 walks-0.020
 jog-0.019
des-0.019
-0.018
Token time
Token ID640
Feature activation-9.93e-02

Pos logit contributions
|+0.023
beh+0.020
cha+0.019
hed+0.019
war+0.018
Neg logit contributions
 grandmother-0.033
 mother-0.030
 parent-0.029
omm-0.029
 lamp-0.029
Token,
Token ID11
Feature activation-2.14e-02

Pos logit contributions
|+0.184
des+0.166
ited+0.159
war+0.158
go+0.154
Neg logit contributions
 grandmother-0.162
 lamp-0.143
 gen-0.142
omm-0.142
 parent-0.140
Token there
Token ID612
Feature activation-5.51e-02

Pos logit contributions
|+0.032
des+0.027
 jog+0.025
 stroll+0.024
 walks+0.024
Neg logit contributions
 grandmother-0.045
 lamp-0.041
 gen-0.041
 parent-0.041
 mother-0.039
Token was
Token ID373
Feature activation-2.81e-02

Pos logit contributions
|+0.098
des+0.085
 jog+0.078
 stroll+0.078
go+0.078
Neg logit contributions
 grandmother-0.099
 gen-0.089
 lamp-0.088
 parent-0.087
 grandfather-0.083
Token a
Token ID257
Feature activation-8.10e-03

Pos logit contributions
|+0.057
des+0.048
ited+0.048
war+0.046
rated+0.046
Neg logit contributions
 grandmother-0.044
 lamp-0.038
 parent-0.037
 gen-0.037
 scissors-0.035
Token dolphin
Token ID44136
Feature activation-5.21e-02

Pos logit contributions
|+0.015
des+0.012
 jog+0.012
go+0.012
 stroll+0.011
Neg logit contributions
 grandmother-0.015
 gen-0.013
 parent-0.013
 grandfather-0.012
 lamp-0.012
Token named
Token ID3706
Feature activation+2.50e-04

Pos logit contributions
|+0.095
des+0.078
 jog+0.074
 stroll+0.074
go+0.073
Neg logit contributions
 grandmother-0.094
 parent-0.084
 gen-0.084
 lamp-0.081
 grandfather-0.080
Token Flip
Token ID29583
Feature activation-3.27e-02

Pos logit contributions
 grandmother+0.000
 parent+0.000
 gen+0.000
 grandfather+0.000
 lamp+0.000
Neg logit contributions
|-0.000
des-0.000
go-0.000
 jog-0.000
ited-0.000
Token.
Token ID13
Feature activation-3.55e-02

Pos logit contributions
|+0.049
des+0.038
 jog+0.036
 stroll+0.035
go+0.035
Neg logit contributions
 grandmother-0.071
 parent-0.064
 gen-0.064
 grandfather-0.062
 lamp-0.062
Token Flip
Token ID29583
Feature activation-7.99e-03

Pos logit contributions
|+0.070
des+0.062
go+0.058
venture+0.056
 stroll+0.056
Neg logit contributions
 grandmother-0.054
 parent-0.049
 gen-0.048
 lamp-0.047
omm-0.047
Token was
Token ID373
Feature activation-3.17e-03

Pos logit contributions
|+0.015
des+0.014
war+0.013
ited+0.013
 stroll+0.013
Neg logit contributions
 grandmother-0.013
 parent-0.012
 gen-0.012
 lamp-0.011
 grandfather-0.011
Token porch
Token ID33179
Feature activation-1.22e-02

Pos logit contributions
|+0.062
 jog+0.053
 walk+0.050
 walks+0.050
des+0.050
Neg logit contributions
 grandmother-0.051
 gen-0.047
omm-0.044
illian-0.044
 parent-0.044
Token.
Token ID13
Feature activation+1.95e-03

Pos logit contributions
|+0.014
des+0.010
 jog+0.010
go+0.010
 walks+0.009
Neg logit contributions
 grandmother-0.030
 gen-0.027
 parent-0.027
 grandfather-0.027
 Ang-0.026
Token<|endoftext|>
Token ID50256
Feature activation-1.00e-01

Pos logit contributions
 grandmother+0.004
 parent+0.004
 gen+0.004
 grandfather+0.004
 lamp+0.004
Neg logit contributions
|-0.003
des-0.002
 jog-0.002
go-0.002
ited-0.002
TokenBen
Token ID11696
Feature activation-5.25e-02

Pos logit contributions
|+0.165
des+0.164
ited+0.164
rated+0.153
ashing+0.148
Neg logit contributions
 grandmother-0.166
 lamp-0.158
 gen-0.156
ware-0.150
 Genie-0.147
Token and
Token ID290
Feature activation-6.15e-02

Pos logit contributions
|+0.096
des+0.083
 stroll+0.080
war+0.078
ued+0.078
Neg logit contributions
 grandmother-0.085
 lamp-0.079
 parent-0.078
 wizard-0.077
 grandfather-0.077
Token Mia
Token ID32189
Feature activation-4.06e-02

Pos logit contributions
|+0.099
des+0.077
ited+0.076
ashing+0.075
go+0.073
Neg logit contributions
 grandmother-0.126
 parent-0.116
 gen-0.111
 grandfather-0.111
 mother-0.110
Token were
Token ID547
Feature activation-3.12e-02

Pos logit contributions
|+0.071
war+0.061
ashing+0.060
 unnecessary+0.060
ogging+0.059
Neg logit contributions
 grandmother-0.059
 queen-0.058
 lamp-0.056
 parent-0.056
 scissors-0.056
Token playing
Token ID2712
Feature activation-1.06e-01

Pos logit contributions
|+0.071
war+0.063
des+0.061
rated+0.058
acks+0.058
Neg logit contributions
 grandmother-0.049
 parent-0.043
 mother-0.039
 grandfather-0.039
 lamp-0.037
Token in
Token ID287
Feature activation-2.76e-02

Pos logit contributions
|+0.171
des+0.140
ited+0.128
go+0.125
 wanna+0.121
Neg logit contributions
 grandmother-0.216
 lamp-0.204
ware-0.198
 parent-0.196
 gen-0.196
Token the
Token ID262
Feature activation-3.52e-02

Pos logit contributions
|+0.041
des+0.033
go+0.031
 jog+0.030
 stroll+0.029
Neg logit contributions
 grandmother-0.059
 parent-0.053
 gen-0.053
 lamp-0.053
 grandfather-0.052
Token yard
Token ID12699
Feature activation-3.32e-02

Pos logit contributions
|+0.070
des+0.059
go+0.056
ited+0.054
war+0.054
Neg logit contributions
 grandmother-0.060
 parent-0.053
 lamp-0.053
 grandfather-0.050
 gen-0.050
Token ice
Token ID4771
Feature activation-4.23e-02

Pos logit contributions
|+0.064
des+0.053
 jog+0.052
 walks+0.051
 walk+0.049
Neg logit contributions
 grandmother-0.073
 gen-0.064
 parent-0.064
 grandfather-0.064
aby-0.063
Token cream
Token ID8566
Feature activation+7.54e-03

Pos logit contributions
|+0.082
des+0.066
 stroll+0.066
 jog+0.066
go+0.061
Neg logit contributions
 grandmother-0.073
 gen-0.066
ware-0.065
 lamp-0.065
 parent-0.062
Token and
Token ID290
Feature activation-3.52e-02

Pos logit contributions
 grandmother+0.016
 gen+0.014
 parent+0.014
 lamp+0.014
 grandfather+0.014
Neg logit contributions
|-0.012
des-0.009
 jog-0.009
 stroll-0.009
go-0.008
Token fruit
Token ID8234
Feature activation+3.26e-02

Pos logit contributions
|+0.057
des+0.046
 jog+0.045
 walks+0.043
go+0.043
Neg logit contributions
 grandmother-0.070
 gen-0.062
 parent-0.062
 grandfather-0.061
omm-0.060
Token.
Token ID13
Feature activation+4.97e-04

Pos logit contributions
 grandmother+0.063
 gen+0.058
ware+0.057
 lamp+0.057
 parent+0.056
Neg logit contributions
|-0.057
des-0.044
 stroll-0.042
 jog-0.042
go-0.040
Token They
Token ID1119
Feature activation-2.81e-02

Pos logit contributions
 grandmother+0.001
 gen+0.001
 parent+0.001
 grandfather+0.001
 lamp+0.001
Neg logit contributions
|-0.001
 jog-0.001
des-0.001
-0.000
 stroll-0.000
Token did
Token ID750
Feature activation+4.99e-03

Pos logit contributions
|+0.036
des+0.027
 jog+0.027
go+0.025
 walks+0.025
Neg logit contributions
 grandmother-0.065
 gen-0.060
 parent-0.059
 grandfather-0.058
 lamp-0.058
Token not
Token ID407
Feature activation+1.90e-02

Pos logit contributions
 grandmother+0.006
 lamp+0.005
 parent+0.005
 gen+0.005
 grandfather+0.005
Neg logit contributions
|-0.012
des-0.011
go-0.010
 stroll-0.010
hed-0.010
Token want
Token ID765
Feature activation-5.72e-02

Pos logit contributions
 grandmother+0.046
 parent+0.043
 lamp+0.042
 gen+0.042
 grandfather+0.041
Neg logit contributions
|-0.023
des-0.016
 stroll-0.015
go-0.015
 walks-0.015
Token to
Token ID284
Feature activation-1.72e-02

Pos logit contributions
|+0.072
 jog+0.054
 walks+0.054
des+0.052
 stroll+0.050
Neg logit contributions
 grandmother-0.130
 gen-0.121
 parent-0.119
aby-0.118
 grandfather-0.118
Token lose
Token ID4425
Feature activation+2.34e-02

Pos logit contributions
|+0.023
des+0.017
go+0.016
 jog+0.016
 stroll+0.016
Neg logit contributions
 grandmother-0.040
 parent-0.036
 gen-0.036
 lamp-0.036
 grandfather-0.035
Token to
Token ID284
Feature activation+4.76e-03

Pos logit contributions
 grandmother+0.086
 gen+0.084
ware+0.082
illian+0.081
aby+0.080
Neg logit contributions
|-0.032
 walks-0.024
 jog-0.024
 walk-0.022
 play-0.022
Token help
Token ID1037
Feature activation+3.77e-02

Pos logit contributions
 grandmother+0.010
 gen+0.009
omm+0.009
ware+0.009
 parent+0.009
Neg logit contributions
|-0.007
 jog-0.006
des-0.006
 walk-0.006
 walks-0.006
Token Max
Token ID5436
Feature activation-2.09e-02

Pos logit contributions
 grandmother+0.091
 gen+0.085
ware+0.082
 grandfather+0.082
 parent+0.082
Neg logit contributions
|-0.045
 jog-0.031
des-0.030
go-0.029
 walks-0.029
Token,
Token ID11
Feature activation+1.63e-02

Pos logit contributions
|+0.036
des+0.029
 pic+0.028
Yo+0.028
ited+0.028
Neg logit contributions
 grandmother-0.040
 parent-0.036
 lamp-0.036
 gen-0.035
 grandfather-0.034
Token but
Token ID475
Feature activation+4.91e-02

Pos logit contributions
aby+0.040
ware+0.040
 grandmother+0.039
omm+0.039
 gen+0.037
Neg logit contributions
|-0.014
 walks-0.012
 walk-0.012
 time-0.012
 jog-0.012
Token he
Token ID339
Feature activation-2.52e-02

Pos logit contributions
 grandmother+0.112
ware+0.106
omm+0.105
aby+0.103
 grandfather+0.102
Neg logit contributions
|-0.059
 jog-0.045
 play-0.041
 walk-0.041
 down-0.040
Token said
Token ID531
Feature activation+3.19e-02

Pos logit contributions
|+0.029
go+0.021
 jog+0.019
 played+0.019
 walk+0.019
Neg logit contributions
 grandmother-0.061
ware-0.055
 gen-0.055
aby-0.054
omm-0.054
Token it
Token ID340
Feature activation-3.41e-02

Pos logit contributions
 grandmother+0.057
 parent+0.050
 gen+0.050
 grandfather+0.049
aby+0.049
Neg logit contributions
|-0.054
des-0.046
 jog-0.046
 walks-0.045
-0.043
Token was
Token ID373
Feature activation-2.61e-03

Pos logit contributions
|+0.048
des+0.036
 jog+0.035
go+0.034
 walks+0.033
Neg logit contributions
 grandmother-0.076
 gen-0.069
 parent-0.068
 grandfather-0.067
 lamp-0.066
Token too
Token ID1165
Feature activation-4.72e-02

Pos logit contributions
|+0.004
des+0.004
 jog+0.003
go+0.003
 stroll+0.003
Neg logit contributions
 grandmother-0.005
 parent-0.005
 gen-0.005
 grandfather-0.005
 lamp-0.005
Token late
Token ID2739
Feature activation-5.07e-02

Pos logit contributions
|+0.077
 stroll+0.054
des+0.054
hed+0.052
cha+0.052
Neg logit contributions
 grandmother-0.100
 parent-0.091
 gen-0.089
 grandfather-0.089
 mother-0.088
Token go
Token ID467
Feature activation-1.27e-01

Pos logit contributions
|+0.019
ashing+0.016
go+0.015
des+0.015
war+0.015
Neg logit contributions
 grandmother-0.029
 gen-0.027
 parent-0.027
 lamp-0.026
 mother-0.025
Token.
Token ID13
Feature activation+7.19e-04

Pos logit contributions
|+0.206
war+0.173
ited+0.172
go+0.168
des+0.168
Neg logit contributions
 grandmother-0.237
 parent-0.216
 lamp-0.215
 gen-0.211
 fairy-0.205
Token She
Token ID1375
Feature activation+4.37e-02

Pos logit contributions
 grandmother+0.001
 gen+0.001
 parent+0.001
 grandfather+0.001
 lamp+0.001
Neg logit contributions
|-0.001
des-0.001
 jog-0.001
go-0.001
 stroll-0.001
Token felt
Token ID2936
Feature activation+3.67e-02

Pos logit contributions
 grandmother+0.094
 gen+0.085
 parent+0.085
 grandfather+0.083
 lamp+0.082
Neg logit contributions
|-0.066
des-0.051
 jog-0.049
go-0.048
 stroll-0.047
Token sad
Token ID6507
Feature activation+1.41e-02

Pos logit contributions
 grandmother+0.072
 gen+0.067
 parent+0.066
 lamp+0.063
 mother+0.062
Neg logit contributions
|-0.062
go-0.047
des-0.047
war-0.046
 stroll-0.045
Token and
Token ID290
Feature activation-2.05e-02

Pos logit contributions
 grandmother+0.034
 gen+0.032
 parent+0.031
 grandfather+0.030
omm+0.030
Neg logit contributions
|-0.016
des-0.012
 jog-0.012
 walks-0.011
ashing-0.011
Token alone
Token ID3436
Feature activation-2.82e-02

Pos logit contributions
|+0.035
war+0.028
des+0.027
acks+0.027
go+0.027
Neg logit contributions
 grandmother-0.038
 parent-0.036
 gen-0.035
 mother-0.034
 lamp-0.034
Token.
Token ID13
Feature activation+7.53e-03

Pos logit contributions
|+0.046
des+0.037
 jog+0.035
go+0.034
 stroll+0.034
Neg logit contributions
 grandmother-0.057
 gen-0.051
 parent-0.051
 lamp-0.051
 grandfather-0.049
Token
Token ID198
Feature activation-6.56e-03

Pos logit contributions
 grandmother+0.016
 gen+0.015
 parent+0.015
 grandfather+0.014
 lamp+0.014
Neg logit contributions
|-0.011
des-0.009
 jog-0.008
go-0.008
 stroll-0.008
Token
Token ID198
Feature activation-5.69e-03

Pos logit contributions
|+0.010
des+0.008
 jog+0.008
ited+0.007
go+0.007
Neg logit contributions
 grandmother-0.014
 parent-0.012
 gen-0.012
 grandfather-0.012
 lamp-0.012
TokenTwe
Token ID32665
Feature activation-1.86e-03

Pos logit contributions
|+0.011
 dives+0.009
acks+0.009
ashes+0.009
war+0.009
Neg logit contributions
 grandmother-0.009
 gen-0.008
 parent-0.008
 lamp-0.007
 grandfather-0.007
Token stay
Token ID2652
Feature activation-6.10e-02

Pos logit contributions
|+0.010
des+0.007
 jog+0.007
go+0.007
 stroll+0.007
Neg logit contributions
 grandmother-0.013
 parent-0.012
 gen-0.012
 grandfather-0.011
 lamp-0.011
Token best
Token ID1266
Feature activation-3.39e-02

Pos logit contributions
|+0.105
go+0.080
des+0.077
war+0.077
venture+0.071
Neg logit contributions
 grandmother-0.131
 parent-0.118
 grandfather-0.114
 mother-0.113
aby-0.107
Token friends
Token ID2460
Feature activation-1.41e-02

Pos logit contributions
|+0.060
des+0.048
go+0.046
 jog+0.046
 stroll+0.045
Neg logit contributions
 grandmother-0.065
 parent-0.058
 gen-0.057
 grandfather-0.056
 lamp-0.055
Token forever
Token ID8097
Feature activation-2.16e-02

Pos logit contributions
|+0.023
des+0.018
go+0.017
 jog+0.017
 stroll+0.017
Neg logit contributions
 grandmother-0.029
 parent-0.026
 gen-0.026
 grandfather-0.025
 lamp-0.025
Token.
Token ID13
Feature activation-4.60e-03

Pos logit contributions
|+0.037
des+0.029
go+0.028
 jog+0.027
Yo+0.027
Neg logit contributions
 grandmother-0.043
 parent-0.038
 grandfather-0.037
 gen-0.037
 mother-0.036
Token<|endoftext|>
Token ID50256
Feature activation-9.45e-02

Pos logit contributions
|+0.007
des+0.006
 jog+0.006
ited+0.006
go+0.006
Neg logit contributions
 grandmother-0.009
 parent-0.008
 grandfather-0.008
 gen-0.008
 lamp-0.008
TokenM
Token ID44
Feature activation-3.75e-03

Pos logit contributions
|+0.162
des+0.159
ited+0.155
ashing+0.146
 jog+0.146
Neg logit contributions
 grandmother-0.155
 gen-0.136
 lamp-0.135
aby-0.131
 parent-0.128
Tokenolly
Token ID5098
Feature activation-3.50e-02

Pos logit contributions
|+0.005
des+0.004
hed+0.004
war+0.004
 jog+0.004
Neg logit contributions
 grandmother-0.009
 parent-0.008
omm-0.008
 mother-0.008
 gen-0.008
Token and
Token ID290
Feature activation-5.42e-02

Pos logit contributions
|+0.062
des+0.054
war+0.050
go+0.047
cluded+0.047
Neg logit contributions
 grandmother-0.066
 gen-0.057
 grandfather-0.057
 parent-0.056
 lamp-0.056
Token her
Token ID607
Feature activation+1.47e-02

Pos logit contributions
|+0.113
des+0.095
go+0.090
ashing+0.090
war+0.089
Neg logit contributions
 grandmother-0.086
 parent-0.076
 mother-0.074
 grandfather-0.071
 Ang-0.070
Token dad
Token ID9955
Feature activation-1.55e-02

Pos logit contributions
 grandmother+0.018
 parent+0.015
 mother+0.014
 grandfather+0.014
 lamp+0.013
Neg logit contributions
|-0.039
des-0.034
ited-0.032
war-0.031
go-0.031
Token was
Token ID373
Feature activation+1.87e-02

Pos logit contributions
 grandmother+0.049
 gen+0.044
 parent+0.044
 grandfather+0.043
 lamp+0.043
Neg logit contributions
|-0.031
des-0.024
 jog-0.023
go-0.022
 stroll-0.022
Token worried
Token ID7960
Feature activation+2.76e-02

Pos logit contributions
 grandmother+0.043
 gen+0.039
 parent+0.038
 grandfather+0.038
 lamp+0.037
Neg logit contributions
|-0.025
des-0.019
 jog-0.019
go-0.018
 stroll-0.018
Token that
Token ID326
Feature activation+8.61e-03

Pos logit contributions
 grandmother+0.060
 gen+0.055
 parent+0.054
 grandfather+0.053
 lamp+0.052
Neg logit contributions
|-0.040
des-0.031
 jog-0.030
 stroll-0.029
 walks-0.029
Token something
Token ID1223
Feature activation+6.38e-02

Pos logit contributions
 grandmother+0.019
omm+0.019
aby+0.018
ware+0.018
 gen+0.018
Neg logit contributions
|-0.010
 jog-0.009
 walks-0.008
 walk-0.008
 stroll-0.008
Token might
Token ID1244
Feature activation+4.79e-02

Pos logit contributions
 grandmother+0.099
 gen+0.097
 lamp+0.095
 parent+0.088
 wizard+0.087
Neg logit contributions
|-0.130
des-0.105
ashing-0.100
-0.100
ited-0.099
Token go
Token ID467
Feature activation-1.16e-01

Pos logit contributions
 grandmother+0.096
 gen+0.089
 parent+0.088
 lamp+0.086
 grandfather+0.084
Neg logit contributions
|-0.077
des-0.062
go-0.058
ashing-0.057
-0.056
Token wrong
Token ID2642
Feature activation-3.03e-03

Pos logit contributions
|+0.173
des+0.141
text+0.131
acks+0.131
eal+0.131
Neg logit contributions
 grandmother-0.245
 parent-0.223
 lamp-0.219
 grandfather-0.218
 gen-0.214
Token and
Token ID290
Feature activation-3.30e-02

Pos logit contributions
|+0.005
des+0.004
go+0.004
 jog+0.004
 stroll+0.003
Neg logit contributions
 grandmother-0.006
 parent-0.006
 gen-0.006
 grandfather-0.005
 lamp-0.005
Token that
Token ID326
Feature activation+1.24e-02

Pos logit contributions
|+0.058
des+0.046
 jog+0.046
go+0.044
 stroll+0.044
Neg logit contributions
 grandmother-0.062
 gen-0.056
 parent-0.055
 grandfather-0.054
 lamp-0.053
Token the
Token ID262
Feature activation+6.53e-03

Pos logit contributions
 grandmother+0.024
 gen+0.021
 parent+0.021
omm+0.021
 grandfather+0.020
Neg logit contributions
|-0.021
 jog-0.017
des-0.017
 walks-0.016
 stroll-0.016
Token machine
Token ID4572
Feature activation+3.80e-02

Pos logit contributions
 grandmother+0.011
aby+0.011
rings+0.011
illian+0.011
omm+0.011
Neg logit contributions
|-0.010
 jog-0.010
 walk-0.009
 walks-0.009
 time-0.009
Token.
Token ID13
Feature activation-9.76e-04

Pos logit contributions
|+0.017
des+0.014
Yo+0.013
 jog+0.013
war+0.013
Neg logit contributions
 grandmother-0.018
 parent-0.016
 grandfather-0.015
 gen-0.015
 lamp-0.015
Token<|endoftext|>
Token ID50256
Feature activation-9.68e-02

Pos logit contributions
|+0.001
des+0.001
 jog+0.001
ited+0.001
 stroll+0.001
Neg logit contributions
 grandmother-0.002
 parent-0.002
 gen-0.002
 grandfather-0.002
 lamp-0.002
TokenOnce
Token ID7454
Feature activation-3.70e-02

Pos logit contributions
ited+0.176
des+0.175
|+0.172
ashing+0.164
 jog+0.160
Neg logit contributions
 grandmother-0.143
 gen-0.127
 lamp-0.124
aby-0.122
 parent-0.116
Token upon
Token ID2402
Feature activation-3.70e-03

Pos logit contributions
|+0.069
des+0.059
 jog+0.056
go+0.055
 stroll+0.055
Neg logit contributions
 grandmother-0.065
 parent-0.057
 gen-0.056
 grandfather-0.055
 lamp-0.055
Token a
Token ID257
Feature activation-1.55e-02

Pos logit contributions
|+0.008
ashing+0.007
ited+0.007
 AW+0.007
ashes+0.007
Neg logit contributions
 grandmother-0.005
 lamp-0.004
 gen-0.004
 parent-0.004
 dentist-0.004
Token time
Token ID640
Feature activation-9.61e-02

Pos logit contributions
|+0.023
beh+0.023
cha+0.022
aling+0.020
hed+0.020
Neg logit contributions
 grandmother-0.026
 mother-0.023
 parent-0.022
 lamp-0.022
ena-0.022
Token,
Token ID11
Feature activation-2.02e-02

Pos logit contributions
|+0.190
des+0.180
ited+0.167
war+0.166
go+0.160
Neg logit contributions
 grandmother-0.140
 lamp-0.124
omm-0.123
 gen-0.122
 parent-0.121
Token there
Token ID612
Feature activation-5.46e-02

Pos logit contributions
|+0.037
des+0.036
acks+0.033
 walks+0.032
 pic+0.032
Neg logit contributions
 lamp-0.032
 grandmother-0.030
 gen-0.028
 parent-0.028
 mother-0.028
Token was
Token ID373
Feature activation-2.54e-02

Pos logit contributions
|+0.114
des+0.110
war+0.102
ashing+0.101
ited+0.100
Neg logit contributions
 grandmother-0.072
 lamp-0.068
 gen-0.066
 parent-0.062
 mother-0.060
Token a
Token ID257
Feature activation-8.82e-03

Pos logit contributions
|+0.055
des+0.048
ited+0.047
war+0.046
acks+0.046
Neg logit contributions
 grandmother-0.035
 lamp-0.031
 parent-0.029
 gen-0.028
 mother-0.027
Token big
Token ID1263
Feature activation-6.69e-02

Pos logit contributions
|+0.016
des+0.013
 jog+0.013
go+0.012
 stroll+0.012
Neg logit contributions
 grandmother-0.016
 gen-0.014
 parent-0.014
 grandfather-0.014
 lamp-0.014
Token and
Token ID290
Feature activation-2.18e-02

Pos logit contributions
 grandmother+0.006
 gen+0.005
 parent+0.005
 grandfather+0.005
 lamp+0.005
Neg logit contributions
|-0.004
des-0.003
 jog-0.003
go-0.003
 stroll-0.002
Token had
Token ID550
Feature activation-9.71e-03

Pos logit contributions
|+0.029
 jog+0.020
des+0.020
 stroll+0.019
 walks+0.019
Neg logit contributions
 grandmother-0.052
 gen-0.048
omm-0.048
 parent-0.047
 lamp-0.047
Token a
Token ID257
Feature activation+1.10e-02

Pos logit contributions
|+0.018
des+0.015
go+0.015
+0.015
ashing+0.014
Neg logit contributions
 grandmother-0.017
 parent-0.014
 gen-0.014
 grandfather-0.014
 lamp-0.014
Token lot
Token ID1256
Feature activation+3.21e-02

Pos logit contributions
 grandmother+0.027
 gen+0.024
 parent+0.024
 grandfather+0.023
 lamp+0.023
Neg logit contributions
|-0.014
des-0.010
go-0.010
 jog-0.009
hed-0.009
Token of
Token ID286
Feature activation+5.13e-02

Pos logit contributions
 grandmother+0.057
 parent+0.050
 gen+0.050
 grandfather+0.049
 lamp+0.048
Neg logit contributions
|-0.061
des-0.051
 jog-0.049
go-0.048
 stroll-0.048
Token fun
Token ID1257
Feature activation-1.14e-01

Pos logit contributions
 grandmother+0.110
 gen+0.098
 lamp+0.098
 parent+0.097
 grandfather+0.095
Neg logit contributions
|-0.073
go-0.054
ashing-0.054
war-0.052
-0.050
Token.
Token ID13
Feature activation+3.98e-03

Pos logit contributions
|+0.190
des+0.147
war+0.145
go+0.145
ited+0.141
Neg logit contributions
 grandmother-0.222
 lamp-0.197
 parent-0.196
 gen-0.195
ware-0.193
Token
Token ID220
Feature activation+2.87e-02

Pos logit contributions
 grandmother+0.009
 parent+0.008
 gen+0.008
 grandfather+0.008
 lamp+0.008
Neg logit contributions
|-0.006
des-0.004
 jog-0.004
go-0.004
 stroll-0.004
Token
Token ID198
Feature activation-5.20e-03

Pos logit contributions
 grandmother+0.060
 parent+0.054
 gen+0.053
 grandfather+0.052
omm+0.052
Neg logit contributions
|-0.043
des-0.035
 jog-0.033
ited-0.032
go-0.032
Token
Token ID198
Feature activation-2.74e-03

Pos logit contributions
|+0.008
des+0.007
ited+0.006
 stroll+0.006
 jog+0.006
Neg logit contributions
 grandmother-0.010
 gen-0.009
 parent-0.009
aby-0.009
 grandfather-0.009
TokenThe
Token ID464
Feature activation+4.29e-02

Pos logit contributions
|+0.005
des+0.004
 stroll+0.004
 walks+0.004
go+0.004
Neg logit contributions
 grandmother-0.005
 parent-0.004
 gen-0.004
illian-0.004
omm-0.004
Token.
Token ID13
Feature activation-1.24e-03

Pos logit contributions
|+0.109
des+0.090
 jog+0.083
ited+0.081
go+0.081
Neg logit contributions
 grandmother-0.117
 parent-0.105
 grandfather-0.101
 lamp-0.099
 mother-0.097
Token<|endoftext|>
Token ID50256
Feature activation-9.40e-02

Pos logit contributions
|+0.002
des+0.002
ited+0.002
go+0.002
 jog+0.002
Neg logit contributions
 grandmother-0.003
 parent-0.002
 gen-0.002
 lamp-0.002
 grandfather-0.002
TokenOnce
Token ID7454
Feature activation-3.48e-02

Pos logit contributions
|+0.155
des+0.151
ited+0.148
ashing+0.140
 jog+0.140
Neg logit contributions
 grandmother-0.162
 lamp-0.142
 gen-0.142
aby-0.138
 parent-0.137
Token upon
Token ID2402
Feature activation-4.19e-03

Pos logit contributions
|+0.060
des+0.048
 jog+0.047
go+0.046
 stroll+0.046
Neg logit contributions
 grandmother-0.066
 gen-0.059
 parent-0.058
 grandfather-0.057
 lamp-0.057
Token a
Token ID257
Feature activation-1.38e-02

Pos logit contributions
|+0.009
ited+0.008
ashing+0.008
war+0.008
eal+0.008
Neg logit contributions
 grandmother-0.005
 lamp-0.005
 gen-0.005
 parent-0.005
 dentist-0.004
Token time
Token ID640
Feature activation-9.81e-02

Pos logit contributions
beh+0.019
|+0.019
cha+0.018
 down+0.017
hed+0.017
Neg logit contributions
 grandmother-0.024
 mother-0.021
 parent-0.021
 lamp-0.021
aby-0.021
Token,
Token ID11
Feature activation-2.13e-02

Pos logit contributions
|+0.183
des+0.164
ited+0.157
war+0.156
go+0.151
Neg logit contributions
 grandmother-0.161
 lamp-0.141
 parent-0.141
 gen-0.140
omm-0.138
Token there
Token ID612
Feature activation-5.69e-02

Pos logit contributions
|+0.039
des+0.036
acks+0.035
eal+0.033
war+0.033
Neg logit contributions
 grandmother-0.035
 lamp-0.034
 parent-0.032
 gen-0.032
 mother-0.031
Token was
Token ID373
Feature activation-2.62e-02

Pos logit contributions
|+0.110
des+0.100
war+0.093
ashing+0.092
ited+0.091
Neg logit contributions
 grandmother-0.091
 lamp-0.082
 gen-0.081
 parent-0.079
 mother-0.075
Token a
Token ID257
Feature activation-8.08e-03

Pos logit contributions
|+0.055
des+0.048
ited+0.047
war+0.047
ashing+0.046
Neg logit contributions
 grandmother-0.038
 lamp-0.032
 parent-0.032
 gen-0.031
 grandfather-0.029
Token little
Token ID1310
Feature activation-1.34e-02

Pos logit contributions
|+0.015
des+0.012
 jog+0.012
go+0.012
 stroll+0.012
Neg logit contributions
 grandmother-0.014
 gen-0.013
 parent-0.013
 grandfather-0.012
 lamp-0.012