TOP ACTIVATIONS
MAX = 0.148

them and never threw a tant rum again . <|endoftext|> Once
temper and often threw temper tant rums . One day ,
his feet and threw a tant rum . "
not a monster , my de ars . That is a
so glad the rh in oc eros went away . He
you like the rh in oc eros . But remember ,
Sammy would often share av oc ados and other y ummy
the rain , and the de w . You can smell
learn more about rh in oc er oses and respect them
am not mad , my de ars . I am glad
that it produced more av oc ados than ever before .
stubborn . She throws a tant rum . She kicks and
" Thank you , my de ars . I appreciate your
're the best rh in oc eros ever !" Tim my
that boss y rh in oc eros . <|endoftext|> D ee
you like the rh in oc eros , but it is
thanked the elderly rh in oc eros for his wonderful surprise
happy for the rh in oc eros . She said ,
had discovered the rh in oc eros , but they had
" Thank you , my de ars ," she said .

MIN ACTIVATIONS
MIN = -0.144

ily and Sam went on a saf ari with their mom
things together . <|endoftext|> On a sunny day , Joe decided
so happy ! <|endoftext|> On a sunny day , a little
bed . <|endoftext|> Once Upon A Time there was a little
M andy who lived near a peaceful mountain . One day
friends again . <|endoftext|> On a bright and sunny day ,
upon a time , in a small town , there lived
upon a time , in a big green forest , there
Lily . She lived near a big river . One day
Lily . She lived near a small creek . One day
named Lily who lived near a big , scary volcano .
went fishing . <|endoftext|> On a bright and sunny day ,
Max . Max lived near a big river . One day
little girl who lived near a tunnel . She had a
upon a time , in a thick forest , there lived
upon a time , in a big forest , there was
, Jack was walking near a lake . Jack saw a
upon a time , in a small town , there was
upon a time , in a peaceful town , there was
upon a time , in a small house , Tim and

INTERVAL 0.075 - 0.148
CONTAINS 0.262%

. They felt very sad and sorry . They had lost
to never take my things without asking again . And you
kept falling . L ily said , " Tim
doing this ." L ily felt sad and scared
so Tim could catch it without breaking it . Tim was

INTERVAL 0.002 - 0.075
CONTAINS 51.001%

relieved and thanked the squirrel for his kindness . From then
. She felt so peaceful . She lay down in the
!" she says . She puts the cards back .
L ily smiled . She said , " OK
the tall grass , feeling the warm sun on his back

INTERVAL -0.071 - 0.002
CONTAINS 45.508%

€ ™ ll try to rescue it , but I need
strange . The flowers she had picked are all gone !
play with toys and books . One day , they find
she loved to take pictures with . One day , she
center . It was a big building with many boxes and

INTERVAL -0.144 - -0.071
CONTAINS 3.229%

her new friend . <|endoftext|> Once upon a time , there
<|endoftext|> Once upon a time , there was a boy named
L ily and Ben were playing in the park . They
you . <|endoftext|> Once upon a time , a little girl
<|endoftext|> Once upon a time , there was a little girl
Token them
Token ID606
Feature activation+1.84e-02

Pos logit contributions
asing+0.117
ums+0.116
atted+0.116
 supervision+0.113
olds+0.109
Neg logit contributions
 park-0.062
 picnic-0.053
 forest-0.053
 store-0.052
 museum-0.050
Token and
Token ID290
Feature activation+5.24e-02

Pos logit contributions
 supervision+0.048
asing+0.048
ums+0.047
atted+0.047
aged+0.045
Neg logit contributions
 park-0.026
 forest-0.023
 picnic-0.022
 museum-0.022
 library-0.022
Token never
Token ID1239
Feature activation+5.63e-02

Pos logit contributions
ums+0.134
asing+0.132
atted+0.129
ging+0.127
 supervision+0.125
Neg logit contributions
 park-0.087
 store-0.076
 picnic-0.076
 forest-0.074
 ride-0.072
Token threw
Token ID9617
Feature activation+1.73e-02

Pos logit contributions
 supervision+0.132
asing+0.127
atted+0.126
olds+0.121
aged+0.120
Neg logit contributions
 park-0.096
 forest-0.086
 museum-0.085
 theater-0.084
 hike-0.084
Token a
Token ID257
Feature activation-2.67e-02

Pos logit contributions
ums+0.040
asing+0.040
 supervision+0.039
atted+0.039
olds+0.037
Neg logit contributions
 park-0.031
 picnic-0.028
 forest-0.028
 store-0.027
 ride-0.027
Token tant
Token ID24246
Feature activation+1.48e-01

Pos logit contributions
 park+0.043
 forest+0.039
 hike+0.038
 library+0.037
 museum+0.037
Neg logit contributions
 supervision-0.063
atted-0.063
asing-0.062
ums-0.062
ging-0.060
Tokenrum
Token ID6582
Feature activation+4.61e-02

Pos logit contributions
 supervision+0.152
ucker+0.135
ented+0.134
ums+0.129
 wounds+0.129
Neg logit contributions
?"-0.329
 park-0.329
 forest-0.320
 ride-0.320
 museum-0.314
Token again
Token ID757
Feature activation+3.65e-02

Pos logit contributions
ums+0.133
asing+0.132
 supervision+0.131
atted+0.131
olds+0.126
Neg logit contributions
 park-0.057
 picnic-0.048
 forest-0.046
 store-0.046
 hike-0.045
Token.
Token ID13
Feature activation+8.67e-03

Pos logit contributions
ums+0.103
 supervision+0.102
asing+0.102
atted+0.101
ging+0.098
Neg logit contributions
 park-0.046
 picnic-0.040
 forest-0.038
 hike-0.038
 museum-0.038
Token<|endoftext|>
Token ID50256
Feature activation-7.24e-02

Pos logit contributions
asing+0.025
ums+0.025
 supervision+0.025
atted+0.025
ging+0.024
Neg logit contributions
 park-0.011
 forest-0.009
 store-0.009
 picnic-0.009
 library-0.009
TokenOnce
Token ID7454
Feature activation-6.28e-02

Pos logit contributions
 picnic+0.117
 hike+0.113
 park+0.108
 walk+0.107
 party+0.106
Neg logit contributions
ums-0.161
olds-0.155
atted-0.148
cks-0.148
asing-0.145
Token temper
Token ID4124
Feature activation-1.11e-02

Pos logit contributions
ums+0.051
asing+0.050
 supervision+0.050
atted+0.050
aged+0.047
Neg logit contributions
 park-0.039
 picnic-0.035
 forest-0.034
 store-0.034
 hike-0.033
Token and
Token ID290
Feature activation+2.40e-02

Pos logit contributions
 park+0.015
 picnic+0.013
 forest+0.013
 store+0.013
 hike+0.013
Neg logit contributions
ums-0.030
asing-0.030
 supervision-0.029
atted-0.029
aged-0.028
Token often
Token ID1690
Feature activation-2.80e-03

Pos logit contributions
ums+0.065
asing+0.064
 supervision+0.064
atted+0.064
aged+0.062
Neg logit contributions
 park-0.033
 picnic-0.029
 forest-0.028
 hike-0.027
 store-0.027
Token threw
Token ID9617
Feature activation-1.62e-02

Pos logit contributions
 park+0.004
 picnic+0.004
 forest+0.004
 hike+0.004
 museum+0.004
Neg logit contributions
atted-0.007
 supervision-0.007
asing-0.007
ums-0.007
aged-0.007
Token temper
Token ID4124
Feature activation+3.55e-03

Pos logit contributions
 park+0.024
 picnic+0.021
 forest+0.020
 store+0.020
 hike+0.019
Neg logit contributions
ums-0.043
asing-0.043
 supervision-0.042
atted-0.042
olds-0.040
Token tant
Token ID24246
Feature activation+1.46e-01

Pos logit contributions
ums+0.008
asing+0.008
 supervision+0.008
atted+0.008
olds+0.007
Neg logit contributions
 park-0.007
 picnic-0.006
 forest-0.006
 store-0.006
 ride-0.006
Tokenrums
Token ID45241
Feature activation+1.51e-02

Pos logit contributions
 supervision+0.148
ucker+0.143
ums+0.140
othe+0.125
 wasteful+0.124
Neg logit contributions
 forest-0.339
 park-0.336
 museum-0.334
 ride-0.323
 arrived-0.321
Token.
Token ID13
Feature activation-1.38e-02

Pos logit contributions
ums+0.040
asing+0.040
 supervision+0.040
atted+0.040
olds+0.038
Neg logit contributions
 park-0.022
 picnic-0.019
 forest-0.018
 store-0.018
 hike-0.018
Token One
Token ID1881
Feature activation-5.77e-02

Pos logit contributions
 park+0.017
 picnic+0.017
 party+0.015
 hike+0.015
 nap+0.015
Neg logit contributions
ums-0.037
olds-0.036
 supervision-0.036
atted-0.035
asing-0.034
Token day
Token ID1110
Feature activation-4.59e-02

Pos logit contributions
 park+0.075
 picnic+0.069
 hike+0.064
 library+0.062
 museum+0.061
Neg logit contributions
ums-0.153
asing-0.151
atted-0.151
 supervision-0.146
aged-0.144
Token,
Token ID11
Feature activation-3.09e-02

Pos logit contributions
 park+0.066
 forest+0.059
 picnic+0.056
 library+0.055
 hike+0.053
Neg logit contributions
aged-0.117
atted-0.115
ums-0.114
ging-0.113
 supervision-0.112
Token his
Token ID465
Feature activation+1.50e-02

Pos logit contributions
 supervision+0.108
asing+0.107
ums+0.107
atted+0.104
olds+0.101
Neg logit contributions
 park-0.038
 store-0.030
 big-0.029
 forest-0.029
 peek-0.028
Token feet
Token ID3625
Feature activation+9.39e-03

Pos logit contributions
asing+0.031
ums+0.030
atted+0.030
 supervision+0.029
olds+0.028
Neg logit contributions
 park-0.032
 picnic-0.029
 forest-0.028
 store-0.028
 ride-0.028
Token and
Token ID290
Feature activation+5.56e-02

Pos logit contributions
ums+0.027
asing+0.027
 supervision+0.027
atted+0.027
olds+0.025
Neg logit contributions
 park-0.012
 picnic-0.010
 store-0.010
 forest-0.009
 ride-0.009
Token threw
Token ID9617
Feature activation+5.73e-03

Pos logit contributions
ums+0.146
asing+0.145
 supervision+0.139
atted+0.138
olds+0.135
Neg logit contributions
 park-0.089
 store-0.075
 picnic-0.075
 forest-0.075
 peek-0.073
Token a
Token ID257
Feature activation-3.80e-02

Pos logit contributions
ums+0.016
asing+0.016
 supervision+0.015
atted+0.015
olds+0.015
Neg logit contributions
 park-0.008
 picnic-0.007
 forest-0.007
 store-0.007
 hike-0.006
Token tant
Token ID24246
Feature activation+1.42e-01

Pos logit contributions
 park+0.059
 forest+0.053
 museum+0.052
 hike+0.052
 library+0.051
Neg logit contributions
 supervision-0.092
atted-0.092
ums-0.091
asing-0.090
ging-0.090
Tokenrum
Token ID6582
Feature activation+4.15e-02

Pos logit contributions
 supervision+0.163
ucker+0.145
atted+0.141
ums+0.139
ushes+0.139
Neg logit contributions
?"-0.306
 park-0.303
 ride-0.301
 rock-0.296
 forest-0.294
Token.
Token ID13
Feature activation+1.75e-02

Pos logit contributions
ums+0.123
asing+0.123
 supervision+0.122
atted+0.122
olds+0.117
Neg logit contributions
 park-0.047
 picnic-0.038
 forest-0.038
 store-0.037
 hike-0.035
Token
Token ID198
Feature activation+1.25e-03

Pos logit contributions
ums+0.049
atted+0.049
asing+0.048
 supervision+0.048
olds+0.047
Neg logit contributions
 park-0.022
 picnic-0.019
 forest-0.019
 store-0.018
 hike-0.018
Token
Token ID198
Feature activation+8.36e-03

Pos logit contributions
asing+0.004
ums+0.004
 supervision+0.004
atted+0.004
ging+0.004
Neg logit contributions
 park-0.001
 picnic-0.001
 forest-0.001
 store-0.001
 ride-0.001
Token"
Token ID1
Feature activation+2.84e-02

Pos logit contributions
asing+0.021
ums+0.021
 supervision+0.021
atted+0.021
aged+0.019
Neg logit contributions
 park-0.014
 picnic-0.012
 store-0.012
 forest-0.012
 ride-0.011
Token not
Token ID407
Feature activation+3.63e-02

Pos logit contributions
ums+0.056
asing+0.055
 supervision+0.055
atted+0.055
olds+0.053
Neg logit contributions
 park-0.030
 picnic-0.026
 forest-0.025
 store-0.025
 hike-0.024
Token a
Token ID257
Feature activation-9.99e-03

Pos logit contributions
asing+0.111
 supervision+0.111
ums+0.110
atted+0.108
aged+0.105
Neg logit contributions
 park-0.037
 picnic-0.030
 hike-0.030
 store-0.029
 forest-0.029
Token monster
Token ID9234
Feature activation+2.33e-02

Pos logit contributions
 park+0.012
 hike+0.012
 picnic+0.011
 library+0.011
 ride+0.011
Neg logit contributions
ums-0.025
 supervision-0.025
ging-0.025
asing-0.024
atted-0.024
Token,
Token ID11
Feature activation+3.55e-02

Pos logit contributions
 supervision+0.064
asing+0.063
ums+0.062
atted+0.062
olds+0.060
Neg logit contributions
 park-0.031
 forest-0.026
 hike-0.026
 picnic-0.026
 store-0.025
Token my
Token ID616
Feature activation+1.27e-02

Pos logit contributions
 supervision+0.090
ums+0.090
asing+0.090
atted+0.088
olds+0.085
Neg logit contributions
 park-0.053
 forest-0.046
 picnic-0.046
 hike-0.045
 ride-0.044
Token de
Token ID390
Feature activation+1.38e-01

Pos logit contributions
ums+0.033
 supervision+0.032
atted+0.032
asing+0.032
olds+0.031
Neg logit contributions
 park-0.019
 hike-0.017
 picnic-0.017
 store-0.016
 forest-0.016
Tokenars
Token ID945
Feature activation+2.20e-02

Pos logit contributions
ums+0.229
atted+0.225
 supervision+0.209
aged+0.203
olds+0.188
Neg logit contributions
 park-0.353
 ride-0.323
 store-0.322
 picnic-0.319
 museum-0.318
Token.
Token ID13
Feature activation+3.59e-02

Pos logit contributions
ums+0.059
asing+0.058
 supervision+0.058
atted+0.057
ging+0.056
Neg logit contributions
 park-0.030
 picnic-0.027
 forest-0.026
 hike-0.026
 library-0.026
Token That
Token ID1320
Feature activation-5.90e-03

Pos logit contributions
asing+0.083
ums+0.083
 supervision+0.082
atted+0.082
ging+0.078
Neg logit contributions
 park-0.064
 picnic-0.056
 forest-0.056
 store-0.055
 museum-0.054
Token is
Token ID318
Feature activation+2.74e-02

Pos logit contributions
 park+0.010
 picnic+0.010
 hike+0.010
 festival+0.009
 library+0.009
Neg logit contributions
asing-0.012
ums-0.012
 supervision-0.012
olds-0.011
atted-0.011
Token a
Token ID257
Feature activation-1.99e-02

Pos logit contributions
ums+0.078
atted+0.077
asing+0.076
 supervision+0.075
ging+0.073
Neg logit contributions
 park-0.036
 picnic-0.030
 forest-0.030
 store-0.030
 museum-0.029
Token so
Token ID523
Feature activation+4.34e-02

Pos logit contributions
asing+0.038
ums+0.038
 supervision+0.037
atted+0.037
olds+0.036
Neg logit contributions
 park-0.027
 picnic-0.023
 forest-0.023
 store-0.023
 ride-0.023
Token glad
Token ID9675
Feature activation+3.10e-02

Pos logit contributions
 supervision+0.087
asing+0.087
ums+0.086
atted+0.083
ging+0.079
Neg logit contributions
 park-0.093
 big-0.083
 store-0.083
 picnic-0.082
 forest-0.082
Token the
Token ID262
Feature activation+4.27e-02

Pos logit contributions
atted+0.082
ums+0.081
 supervision+0.080
asing+0.080
olds+0.078
Neg logit contributions
 park-0.043
 picnic-0.038
 library-0.037
 store-0.037
 museum-0.037
Token rh
Token ID9529
Feature activation+8.71e-02

Pos logit contributions
ums+0.102
 supervision+0.101
edient+0.097
atted+0.096
 choices+0.096
Neg logit contributions
 park-0.055
 hike-0.053
 picnic-0.049
 gym-0.049
 Under-0.049
Tokenin
Token ID259
Feature activation+4.71e-02

Pos logit contributions
 supervision+0.182
ums+0.180
atted+0.177
asing+0.176
olds+0.168
Neg logit contributions
 park-0.178
 forest-0.160
 store-0.158
 picnic-0.156
 museum-0.155
Tokenoc
Token ID420
Feature activation+1.38e-01

Pos logit contributions
ums+0.082
asing+0.081
 supervision+0.080
atted+0.080
olds+0.075
Neg logit contributions
 park-0.113
 picnic-0.103
 forest-0.103
 store-0.102
 ride-0.101
Tokeneros
Token ID27498
Feature activation+6.34e-02

Pos logit contributions
 supervision+0.281
asing+0.281
atted+0.260
ums+0.257
othe+0.254
Neg logit contributions
 park-0.267
 forest-0.255
 museum-0.244
 store-0.241
 ride-0.235
Token went
Token ID1816
Feature activation+5.80e-03

Pos logit contributions
asing+0.170
atted+0.170
 supervision+0.166
ums+0.166
aged+0.163
Neg logit contributions
 park-0.085
 picnic-0.074
 museum-0.071
 library-0.071
 hike-0.070
Token away
Token ID1497
Feature activation+1.20e-02

Pos logit contributions
ging+0.012
ums+0.011
atted+0.011
aged+0.011
 supervision+0.011
Neg logit contributions
 park-0.011
 picnic-0.010
 museum-0.010
 library-0.010
 festival-0.010
Token.
Token ID13
Feature activation+1.60e-02

Pos logit contributions
ums+0.033
atted+0.032
 supervision+0.032
asing+0.032
aged+0.031
Neg logit contributions
 park-0.016
 picnic-0.014
 forest-0.013
 store-0.013
 museum-0.013
Token He
Token ID679
Feature activation+3.05e-04

Pos logit contributions
asing+0.051
atted+0.050
ums+0.050
 supervision+0.050
ging+0.049
Neg logit contributions
 park-0.017
 forest-0.013
 store-0.013
 ride-0.012
 museum-0.012
Token you
Token ID345
Feature activation+4.52e-02

Pos logit contributions
ums+0.087
asing+0.086
atted+0.085
 supervision+0.085
olds+0.081
Neg logit contributions
 park-0.073
 picnic-0.065
 forest-0.064
 store-0.064
 hike-0.063
Token like
Token ID588
Feature activation+4.93e-03

Pos logit contributions
asing+0.103
 supervision+0.100
atted+0.100
ums+0.099
olds+0.095
Neg logit contributions
 park-0.081
 picnic-0.073
 forest-0.072
 museum-0.071
 store-0.070
Token the
Token ID262
Feature activation+1.95e-02

Pos logit contributions
 supervision+0.012
ums+0.012
atted+0.012
asing+0.012
olds+0.011
Neg logit contributions
 park-0.008
 picnic-0.007
 hike-0.007
 forest-0.007
 store-0.007
Token rh
Token ID9529
Feature activation+8.66e-02

Pos logit contributions
ums+0.047
 supervision+0.046
asing+0.044
atted+0.044
olds+0.044
Neg logit contributions
 park-0.029
 hike-0.027
 picnic-0.027
 peek-0.026
 walk-0.025
Tokenin
Token ID259
Feature activation+2.74e-02

Pos logit contributions
ums+0.172
 supervision+0.171
asing+0.170
atted+0.169
olds+0.159
Neg logit contributions
 park-0.184
 forest-0.166
 picnic-0.165
 store-0.165
 museum-0.161
Tokenoc
Token ID420
Feature activation+1.38e-01

Pos logit contributions
ums+0.044
asing+0.044
 supervision+0.043
atted+0.043
olds+0.040
Neg logit contributions
 park-0.068
 picnic-0.063
 forest-0.062
 store-0.062
 hike-0.061
Tokeneros
Token ID27498
Feature activation+4.05e-02

Pos logit contributions
asing+0.271
 supervision+0.263
atted+0.255
ums+0.251
othe+0.246
Neg logit contributions
 park-0.286
 forest-0.270
 museum-0.264
 store-0.261
 ride-0.254
Token.
Token ID13
Feature activation+3.05e-02

Pos logit contributions
asing+0.106
 supervision+0.105
ums+0.103
atted+0.102
olds+0.099
Neg logit contributions
 park-0.057
 picnic-0.050
 hike-0.049
 museum-0.048
 forest-0.048
Token But
Token ID887
Feature activation+3.04e-02

Pos logit contributions
asing+0.074
atted+0.074
ums+0.074
 supervision+0.074
ging+0.072
Neg logit contributions
 park-0.051
 forest-0.044
 store-0.044
 picnic-0.043
 library-0.043
Token remember
Token ID3505
Feature activation+5.14e-03

Pos logit contributions
ums+0.074
asing+0.073
atted+0.073
 supervision+0.072
olds+0.069
Neg logit contributions
 park-0.052
 picnic-0.046
 store-0.045
 forest-0.045
 museum-0.043
Token,
Token ID11
Feature activation+1.71e-02

Pos logit contributions
atted+0.014
 supervision+0.014
asing+0.014
ums+0.014
olds+0.014
Neg logit contributions
 park-0.006
 picnic-0.006
 hike-0.005
 ride-0.005
 store-0.005
Token Sammy
Token ID44067
Feature activation+3.10e-02

Pos logit contributions
ums+0.115
asing+0.115
 supervision+0.114
atted+0.113
olds+0.109
Neg logit contributions
 park-0.062
 picnic-0.053
 forest-0.053
 store-0.052
 hike-0.051
Token would
Token ID561
Feature activation-1.70e-02

Pos logit contributions
ums+0.076
asing+0.076
 supervision+0.075
atted+0.075
olds+0.072
Neg logit contributions
 park-0.051
 picnic-0.045
 forest-0.044
 store-0.043
 hike-0.042
Token often
Token ID1690
Feature activation+1.03e-03

Pos logit contributions
 park+0.027
 picnic+0.023
 forest+0.023
 library+0.023
 hike+0.023
Neg logit contributions
ums-0.041
asing-0.039
 supervision-0.039
atted-0.039
olds-0.038
Token share
Token ID2648
Feature activation+8.36e-03

Pos logit contributions
 supervision+0.003
asing+0.002
ums+0.002
atted+0.002
olds+0.002
Neg logit contributions
 park-0.002
 forest-0.001
 picnic-0.001
 festival-0.001
 library-0.001
Token av
Token ID1196
Feature activation+4.83e-02

Pos logit contributions
asing+0.022
ums+0.022
 supervision+0.022
atted+0.022
olds+0.021
Neg logit contributions
 park-0.012
 picnic-0.011
 store-0.010
 forest-0.010
 ride-0.010
Tokenoc
Token ID420
Feature activation+1.38e-01

Pos logit contributions
 supervision+0.112
atted+0.107
asing+0.105
ums+0.104
olds+0.101
Neg logit contributions
 park-0.084
 forest-0.079
 ride-0.078
 picnic-0.078
 store-0.075
Tokenados
Token ID22484
Feature activation+8.81e-03

Pos logit contributions
 supervision+0.231
othe+0.225
edient+0.225
lessness+0.222
inge+0.220
Neg logit contributions
 park-0.262
 forest-0.250
 museum-0.245
 store-0.245
 ride-0.234
Token and
Token ID290
Feature activation+5.09e-02

Pos logit contributions
atted+0.024
 supervision+0.024
ums+0.023
asing+0.023
ging+0.023
Neg logit contributions
 park-0.013
 picnic-0.012
 ride-0.011
 store-0.011
 forest-0.011
Token other
Token ID584
Feature activation+3.13e-02

Pos logit contributions
ums+0.136
asing+0.135
atted+0.133
 supervision+0.133
ging+0.128
Neg logit contributions
 park-0.076
 picnic-0.066
 store-0.064
 forest-0.064
 ride-0.063
Token y
Token ID331
Feature activation+6.90e-02

Pos logit contributions
ums+0.075
asing+0.075
 supervision+0.074
atted+0.074
olds+0.070
Neg logit contributions
 park-0.054
 picnic-0.048
 forest-0.047
 store-0.047
 museum-0.045
Tokenummy
Token ID13513
Feature activation+3.54e-02

Pos logit contributions
 supervision+0.154
asing+0.147
ging+0.141
edient+0.139
 wounds+0.138
Neg logit contributions
 park-0.129
 picnic-0.122
 forest-0.115
 store-0.113
 hike-0.110
Token the
Token ID262
Feature activation+2.21e-02

Pos logit contributions
ums+0.138
asing+0.137
atted+0.136
 supervision+0.135
olds+0.131
Neg logit contributions
 park-0.070
 picnic-0.059
 store-0.058
 forest-0.058
 ride-0.056
Token rain
Token ID6290
Feature activation+3.74e-02

Pos logit contributions
 supervision+0.056
ums+0.054
asing+0.053
atted+0.053
ging+0.052
Neg logit contributions
 park-0.028
 hike-0.027
 picnic-0.026
 museum-0.025
 store-0.025
Token,
Token ID11
Feature activation+3.75e-02

Pos logit contributions
ums+0.112
asing+0.112
 supervision+0.110
atted+0.110
olds+0.107
Neg logit contributions
 park-0.043
 picnic-0.035
 store-0.033
 forest-0.033
 ride-0.033
Token and
Token ID290
Feature activation+4.67e-02

Pos logit contributions
ums+0.108
asing+0.107
 supervision+0.106
atted+0.106
olds+0.102
Neg logit contributions
 park-0.048
 picnic-0.040
 forest-0.039
 store-0.039
 ride-0.037
Token the
Token ID262
Feature activation+2.03e-02

Pos logit contributions
asing+0.120
 supervision+0.120
ums+0.118
atted+0.117
olds+0.113
Neg logit contributions
 park-0.069
 picnic-0.061
 forest-0.059
 hike-0.058
 store-0.058
Token de
Token ID390
Feature activation+1.38e-01

Pos logit contributions
 supervision+0.052
ums+0.050
atted+0.050
asing+0.050
 wounds+0.048
Neg logit contributions
 park-0.025
 hike-0.024
 picnic-0.023
 museum-0.023
 store-0.023
Tokenw
Token ID86
Feature activation+5.56e-02

Pos logit contributions
ums+0.233
atted+0.222
 supervision+0.216
aged+0.201
olds+0.198
Neg logit contributions
 park-0.349
 ride-0.322
 store-0.320
 picnic-0.319
 museum-0.310
Token.
Token ID13
Feature activation+2.65e-02

Pos logit contributions
ums+0.157
asing+0.157
 supervision+0.155
atted+0.153
olds+0.149
Neg logit contributions
 park-0.069
 picnic-0.058
 forest-0.057
 store-0.056
 ride-0.054
Token You
Token ID921
Feature activation+2.32e-02

Pos logit contributions
atted+0.059
asing+0.058
ums+0.058
 supervision+0.058
ging+0.056
Neg logit contributions
 park-0.050
 store-0.044
 forest-0.044
 picnic-0.044
 museum-0.042
Token can
Token ID460
Feature activation-4.23e-03

Pos logit contributions
asing+0.060
ums+0.058
 supervision+0.058
atted+0.058
olds+0.056
Neg logit contributions
 park-0.035
 picnic-0.030
 forest-0.030
 museum-0.029
 store-0.029
Token smell
Token ID8508
Feature activation+9.58e-03

Pos logit contributions
 park+0.006
 picnic+0.005
 museum+0.005
 forest+0.005
 library+0.005
Neg logit contributions
ums-0.011
asing-0.011
atted-0.010
 supervision-0.010
olds-0.010
Token learn
Token ID2193
Feature activation+4.12e-03

Pos logit contributions
 supervision+0.050
ums+0.050
asing+0.049
atted+0.048
olds+0.047
Neg logit contributions
 park-0.034
 picnic-0.031
 forest-0.031
 museum-0.030
 library-0.030
Token more
Token ID517
Feature activation+4.25e-02

Pos logit contributions
asing+0.010
 supervision+0.010
ums+0.010
atted+0.010
olds+0.010
Neg logit contributions
 park-0.006
 picnic-0.006
 forest-0.005
 hike-0.005
 store-0.005
Token about
Token ID546
Feature activation+4.09e-02

Pos logit contributions
asing+0.097
ums+0.096
 supervision+0.095
atted+0.095
olds+0.094
Neg logit contributions
 park-0.075
 picnic-0.070
 store-0.067
 forest-0.067
 hike-0.065
Token rh
Token ID9529
Feature activation+7.50e-02

Pos logit contributions
ums+0.104
 supervision+0.103
asing+0.103
atted+0.102
olds+0.098
Neg logit contributions
 park-0.062
 picnic-0.055
 forest-0.053
 store-0.053
 hike-0.052
Tokenin
Token ID259
Feature activation+3.48e-02

Pos logit contributions
 supervision+0.127
ums+0.123
atted+0.121
asing+0.116
aged+0.113
Neg logit contributions
 park-0.183
 forest-0.168
 museum-0.166
 store-0.165
 picnic-0.161
Tokenoc
Token ID420
Feature activation+1.38e-01

Pos logit contributions
ums+0.077
asing+0.076
atted+0.076
 supervision+0.076
ging+0.071
Neg logit contributions
 park-0.067
 forest-0.059
 picnic-0.059
 store-0.059
 museum-0.058
Tokener
Token ID263
Feature activation+3.67e-02

Pos logit contributions
othe+0.211
 supervision+0.209
asing+0.209
ging+0.204
inge+0.202
Neg logit contributions
 park-0.305
 museum-0.291
 forest-0.289
 store-0.279
 ride-0.271
Tokenoses
Token ID4629
Feature activation+4.27e-02

Pos logit contributions
asing+0.105
ums+0.104
atted+0.104
 supervision+0.103
ging+0.099
Neg logit contributions
 park-0.045
 store-0.037
 forest-0.036
 ride-0.035
 picnic-0.034
Token and
Token ID290
Feature activation+6.08e-02

Pos logit contributions
ums+0.120
asing+0.120
 supervision+0.119
atted+0.118
olds+0.114
Neg logit contributions
 park-0.055
 picnic-0.046
 forest-0.045
 store-0.045
 hike-0.043
Token respect
Token ID2461
Feature activation+4.09e-02

Pos logit contributions
ums+0.145
asing+0.144
atted+0.142
 supervision+0.142
ging+0.136
Neg logit contributions
 park-0.106
 picnic-0.093
 forest-0.092
 store-0.091
 museum-0.089
Token them
Token ID606
Feature activation+3.94e-02

Pos logit contributions
ums+0.113
asing+0.112
atted+0.112
 supervision+0.109
aged+0.107
Neg logit contributions
 park-0.058
 forest-0.049
 store-0.048
 picnic-0.047
 museum-0.046
Token am
Token ID716
Feature activation+1.98e-02

Pos logit contributions
ums+0.037
asing+0.036
 supervision+0.036
atted+0.036
ging+0.034
Neg logit contributions
 park-0.030
 picnic-0.027
 store-0.027
 forest-0.026
 ride-0.026
Token not
Token ID407
Feature activation+3.81e-02

Pos logit contributions
asing+0.031
ums+0.030
atted+0.030
 supervision+0.030
ging+0.028
Neg logit contributions
 park-0.052
 store-0.048
 big-0.047
 forest-0.046
 picnic-0.046
Token mad
Token ID8805
Feature activation+5.47e-02

Pos logit contributions
ums+0.065
asing+0.064
atted+0.063
 supervision+0.063
olds+0.059
Neg logit contributions
 park-0.093
 picnic-0.085
 forest-0.084
 store-0.084
 ride-0.082
Token,
Token ID11
Feature activation+4.03e-02

Pos logit contributions
ums+0.153
asing+0.151
atted+0.148
 supervision+0.148
olds+0.145
Neg logit contributions
 park-0.075
 picnic-0.066
 forest-0.064
 hike-0.063
 store-0.061
Token my
Token ID616
Feature activation+2.66e-02

Pos logit contributions
asing+0.109
ums+0.108
atted+0.107
 supervision+0.105
aged+0.105
Neg logit contributions
 park-0.061
 picnic-0.052
 store-0.051
 forest-0.048
 big-0.047
Token de
Token ID390
Feature activation+1.37e-01

Pos logit contributions
ums+0.053
atted+0.053
asing+0.052
 supervision+0.052
olds+0.050
Neg logit contributions
 park-0.055
 forest-0.050
 picnic-0.050
 store-0.050
 hike-0.050
Tokenars
Token ID945
Feature activation+1.87e-02

Pos logit contributions
ums+0.231
atted+0.225
 supervision+0.213
aged+0.206
ieties+0.191
Neg logit contributions
 park-0.344
 ride-0.320
 store-0.319
 museum-0.312
 picnic-0.309
Token.
Token ID13
Feature activation+3.53e-02

Pos logit contributions
asing+0.053
ums+0.053
 supervision+0.052
atted+0.052
olds+0.051
Neg logit contributions
 park-0.023
 picnic-0.021
 forest-0.020
 hike-0.020
 store-0.019
Token I
Token ID314
Feature activation+1.89e-02

Pos logit contributions
ums+0.083
asing+0.083
atted+0.082
 supervision+0.082
ging+0.079
Neg logit contributions
 park-0.062
 forest-0.054
 picnic-0.054
 store-0.054
 library-0.053
Token am
Token ID716
Feature activation+2.25e-02

Pos logit contributions
ums+0.047
asing+0.046
 supervision+0.046
atted+0.046
ging+0.044
Neg logit contributions
 park-0.032
 picnic-0.028
 store-0.027
 forest-0.027
 ride-0.026
Token glad
Token ID9675
Feature activation+4.17e-02

Pos logit contributions
ums+0.042
asing+0.042
atted+0.041
 supervision+0.041
ging+0.039
Neg logit contributions
 park-0.052
 store-0.046
 picnic-0.046
 forest-0.045
 ride-0.045
Token that
Token ID326
Feature activation+1.38e-02

Pos logit contributions
ums+0.044
asing+0.044
atted+0.044
 supervision+0.043
olds+0.042
Neg logit contributions
 park-0.018
 picnic-0.015
 store-0.015
 forest-0.015
 hike-0.014
Token it
Token ID340
Feature activation+3.52e-02

Pos logit contributions
ums+0.040
asing+0.040
atted+0.040
 supervision+0.039
olds+0.038
Neg logit contributions
 park-0.018
 forest-0.015
 store-0.015
 picnic-0.015
 ride-0.014
Token produced
Token ID4635
Feature activation+7.69e-03

Pos logit contributions
ums+0.098
asing+0.097
 supervision+0.096
atted+0.096
olds+0.092
Neg logit contributions
 park-0.047
 picnic-0.040
 forest-0.039
 store-0.039
 hike-0.037
Token more
Token ID517
Feature activation+3.78e-02

Pos logit contributions
asing+0.024
ums+0.024
 supervision+0.024
atted+0.024
ging+0.023
Neg logit contributions
 park-0.008
 picnic-0.006
 forest-0.006
 store-0.006
 big-0.006
Token av
Token ID1196
Feature activation+4.74e-02

Pos logit contributions
ums+0.099
asing+0.099
 supervision+0.098
atted+0.097
ging+0.093
Neg logit contributions
 park-0.057
 picnic-0.049
 forest-0.049
 store-0.048
 museum-0.046
Tokenoc
Token ID420
Feature activation+1.37e-01

Pos logit contributions
 supervision+0.102
atted+0.101
asing+0.099
ums+0.098
olds+0.097
Neg logit contributions
 park-0.089
 forest-0.087
 store-0.083
 ride-0.083
 picnic-0.082
Tokenados
Token ID22484
Feature activation+1.44e-02

Pos logit contributions
 supervision+0.229
asing+0.224
cks+0.216
 elbows+0.212
ging+0.211
Neg logit contributions
 park-0.278
 forest-0.272
 store-0.267
 museum-0.262
 tree-0.254
Token than
Token ID621
Feature activation+7.42e-02

Pos logit contributions
ums+0.041
asing+0.041
atted+0.040
 supervision+0.040
olds+0.039
Neg logit contributions
 park-0.018
 picnic-0.016
 forest-0.015
 store-0.015
 ride-0.014
Token ever
Token ID1683
Feature activation+5.96e-02

Pos logit contributions
ums+0.200
asing+0.196
atted+0.195
 supervision+0.192
olds+0.186
Neg logit contributions
 park-0.109
 picnic-0.094
 store-0.093
 forest-0.093
 ride-0.090
Token before
Token ID878
Feature activation+5.30e-02

Pos logit contributions
 supervision+0.138
asing+0.137
atted+0.135
ums+0.135
ging+0.130
Neg logit contributions
 park-0.105
 picnic-0.094
 forest-0.092
 store-0.091
 hike-0.091
Token.
Token ID13
Feature activation+1.01e-02

Pos logit contributions
ums+0.151
 supervision+0.149
asing+0.149
atted+0.148
ging+0.143
Neg logit contributions
 park-0.065
 picnic-0.055
 forest-0.053
 store-0.052
 museum-0.052
Token stubborn
Token ID24444
Feature activation+2.39e-02

Pos logit contributions
ums+0.028
asing+0.028
 supervision+0.028
atted+0.027
ging+0.026
Neg logit contributions
 park-0.025
 store-0.022
 picnic-0.022
 forest-0.022
 big-0.022
Token.
Token ID13
Feature activation+2.64e-02

Pos logit contributions
ums+0.066
asing+0.066
 supervision+0.066
atted+0.065
olds+0.064
Neg logit contributions
 park-0.031
 picnic-0.028
 forest-0.027
 hike-0.027
 museum-0.026
Token She
Token ID1375
Feature activation+1.04e-02

Pos logit contributions
ums+0.068
asing+0.067
 supervision+0.066
atted+0.065
olds+0.065
Neg logit contributions
 park-0.040
 picnic-0.037
 hike-0.036
 forest-0.035
 ride-0.035
Token throws
Token ID12542
Feature activation-3.75e-03

Pos logit contributions
asing+0.021
ums+0.021
atted+0.021
 supervision+0.021
olds+0.020
Neg logit contributions
 park-0.021
 picnic-0.019
 forest-0.019
 store-0.018
 hike-0.018
Token a
Token ID257
Feature activation-4.15e-02

Pos logit contributions
 park+0.005
 store+0.004
 picnic+0.004
 forest+0.004
 ride+0.003
Neg logit contributions
ums-0.011
asing-0.011
atted-0.011
 supervision-0.011
aged-0.010
Token tant
Token ID24246
Feature activation+1.36e-01

Pos logit contributions
 park+0.087
 forest+0.080
 picnic+0.079
 hike+0.078
 museum+0.078
Neg logit contributions
ums-0.079
 supervision-0.079
atted-0.079
asing-0.079
ging-0.075
Tokenrum
Token ID6582
Feature activation+4.22e-02

Pos logit contributions
 supervision+0.164
ucker+0.133
atted+0.132
ums+0.130
 wounds+0.130
Neg logit contributions
?"-0.308
 park-0.303
 peek-0.299
 ride-0.297
 museum-0.296
Token.
Token ID13
Feature activation+2.38e-02

Pos logit contributions
ums+0.123
asing+0.122
atted+0.121
 supervision+0.121
olds+0.117
Neg logit contributions
 park-0.050
 picnic-0.043
 forest-0.041
 store-0.041
 hike-0.040
Token She
Token ID1375
Feature activation-2.06e-03

Pos logit contributions
ums+0.061
asing+0.060
 supervision+0.059
atted+0.059
olds+0.058
Neg logit contributions
 park-0.036
 picnic-0.034
 hike-0.032
 forest-0.032
 ride-0.031
Token kicks
Token ID17364
Feature activation+5.53e-03

Pos logit contributions
 park+0.004
 picnic+0.004
 forest+0.004
 store+0.004
 hike+0.004
Neg logit contributions
ums-0.004
asing-0.004
 supervision-0.004
atted-0.004
ging-0.004
Token and
Token ID290
Feature activation+5.17e-02

Pos logit contributions
ums+0.017
asing+0.016
atted+0.016
 supervision+0.016
aged+0.016
Neg logit contributions
 park-0.006
 picnic-0.005
 store-0.005
 forest-0.005
 ride-0.005
Token "
Token ID366
Feature activation+5.12e-02

Pos logit contributions
asing+0.062
ums+0.062
 supervision+0.061
atted+0.061
olds+0.058
Neg logit contributions
 park-0.026
 store-0.022
 picnic-0.021
 forest-0.021
 ride-0.021
TokenThank
Token ID10449
Feature activation+5.46e-02

Pos logit contributions
 supervision+0.138
ums+0.138
atted+0.138
asing+0.134
aged+0.131
Neg logit contributions
 park-0.078
 store-0.065
 forest-0.063
 picnic-0.063
 library-0.062
Token you
Token ID345
Feature activation+2.86e-02

Pos logit contributions
 supervision+0.145
atted+0.144
ums+0.144
asing+0.142
aged+0.141
Neg logit contributions
 park-0.079
 store-0.067
 picnic-0.066
 forest-0.065
 museum-0.063
Token,
Token ID11
Feature activation+2.65e-02

Pos logit contributions
asing+0.091
ums+0.091
 supervision+0.090
atted+0.090
aged+0.087
Neg logit contributions
 park-0.027
 picnic-0.021
 store-0.020
 forest-0.020
 ride-0.019
Token my
Token ID616
Feature activation+2.86e-02

Pos logit contributions
asing+0.068
ums+0.067
atted+0.067
 supervision+0.067
aged+0.065
Neg logit contributions
 park-0.043
 picnic-0.037
 store-0.037
 library-0.035
 forest-0.035
Token de
Token ID390
Feature activation+1.35e-01

Pos logit contributions
asing+0.056
ums+0.056
 supervision+0.056
atted+0.055
ging+0.053
Neg logit contributions
 park-0.061
 picnic-0.056
 store-0.055
 forest-0.055
 museum-0.054
Tokenars
Token ID945
Feature activation+2.27e-02

Pos logit contributions
ums+0.238
atted+0.236
 supervision+0.226
aged+0.213
ieties+0.203
Neg logit contributions
 park-0.331
 ride-0.309
 store-0.307
 museum-0.300
 picnic-0.293
Token.
Token ID13
Feature activation+3.43e-02

Pos logit contributions
ums+0.064
asing+0.063
 supervision+0.063
atted+0.062
olds+0.061
Neg logit contributions
 park-0.029
 picnic-0.026
 forest-0.025
 hike-0.025
 store-0.023
Token I
Token ID314
Feature activation+2.08e-02

Pos logit contributions
asing+0.078
ums+0.078
atted+0.078
 supervision+0.077
ging+0.075
Neg logit contributions
 park-0.062
 store-0.054
 forest-0.054
 picnic-0.054
 library-0.053
Token appreciate
Token ID9144
Feature activation+3.42e-02

Pos logit contributions
ums+0.051
 supervision+0.050
asing+0.050
atted+0.050
ging+0.047
Neg logit contributions
 park-0.036
 picnic-0.031
 store-0.031
 forest-0.030
 ride-0.030
Token your
Token ID534
Feature activation+4.94e-02

Pos logit contributions
ums+0.086
asing+0.085
 supervision+0.084
atted+0.084
olds+0.080
Neg logit contributions
 park-0.055
 picnic-0.048
 forest-0.047
 store-0.047
 museum-0.046
Token're
Token ID821
Feature activation+1.68e-02

Pos logit contributions
asing+0.017
ums+0.016
atted+0.015
 supervision+0.015
cks+0.015
Neg logit contributions
 park-0.015
 picnic-0.014
 library-0.014
 store-0.014
 forest-0.014
Token the
Token ID262
Feature activation+1.11e-02

Pos logit contributions
ums+0.044
asing+0.044
 supervision+0.043
atted+0.042
aged+0.041
Neg logit contributions
 park-0.024
 picnic-0.022
 hike-0.021
 store-0.021
 forest-0.021
Token best
Token ID1266
Feature activation+3.43e-02

Pos logit contributions
ums+0.023
asing+0.021
 supervision+0.020
ubb+0.019
ging+0.019
Neg logit contributions
 park-0.021
 hike-0.020
 picnic-0.020
 Under-0.019
 walk-0.019
Token rh
Token ID9529
Feature activation+8.11e-02

Pos logit contributions
ums+0.082
asing+0.073
aged+0.072
atted+0.072
othe+0.070
Neg logit contributions
 park-0.056
 library-0.051
 hike-0.050
 store-0.049
 picnic-0.047
Tokenin
Token ID259
Feature activation+3.95e-02

Pos logit contributions
ums+0.156
asing+0.155
 supervision+0.154
atted+0.153
olds+0.144
Neg logit contributions
 park-0.177
 picnic-0.161
 forest-0.159
 store-0.158
 museum-0.155
Tokenoc
Token ID420
Feature activation+1.35e-01

Pos logit contributions
asing+0.088
ums+0.086
 supervision+0.084
atted+0.084
aged+0.082
Neg logit contributions
 park-0.074
 picnic-0.067
 hike-0.065
 forest-0.064
 store-0.064
Tokeneros
Token ID27498
Feature activation+5.65e-02

Pos logit contributions
 supervision+0.251
asing+0.243
atted+0.239
othe+0.232
leness+0.231
Neg logit contributions
 park-0.280
 forest-0.260
 museum-0.258
 ride-0.256
 store-0.256
Token ever
Token ID1683
Feature activation+4.10e-02

Pos logit contributions
asing+0.119
atted+0.114
 supervision+0.114
ums+0.114
aged+0.109
Neg logit contributions
 park-0.114
 picnic-0.102
 hike-0.101
 forest-0.101
 store-0.100
Token!"
Token ID2474
Feature activation+1.80e-02

Pos logit contributions
asing+0.100
atted+0.098
 supervision+0.096
ums+0.096
ging+0.095
Neg logit contributions
 park-0.066
 picnic-0.058
 store-0.057
 hike-0.057
 library-0.056
Token Tim
Token ID5045
Feature activation+3.57e-02

Pos logit contributions
ums+0.046
atted+0.046
asing+0.045
 supervision+0.044
aged+0.044
Neg logit contributions
 park-0.027
 picnic-0.024
 forest-0.023
 library-0.023
 museum-0.023
Tokenmy
Token ID1820
Feature activation+1.94e-02

Pos logit contributions
ums+0.087
asing+0.087
atted+0.087
 supervision+0.086
aged+0.082
Neg logit contributions
 park-0.059
 picnic-0.053
 forest-0.051
 museum-0.051
 store-0.051
Token that
Token ID326
Feature activation+1.35e-02

Pos logit contributions
 supervision+0.043
ums+0.043
asing+0.043
atted+0.043
olds+0.041
Neg logit contributions
 park-0.022
 picnic-0.019
 forest-0.018
 store-0.018
 hike-0.018
Token boss
Token ID6478
Feature activation+7.39e-02

Pos logit contributions
ums+0.037
asing+0.037
 supervision+0.037
atted+0.037
olds+0.035
Neg logit contributions
 park-0.018
 picnic-0.015
 forest-0.015
 store-0.015
 hike-0.014
Tokeny
Token ID88
Feature activation+4.06e-02

Pos logit contributions
ums+0.168
asing+0.165
 supervision+0.164
atted+0.163
olds+0.155
Neg logit contributions
 park-0.136
 forest-0.119
 picnic-0.119
 store-0.119
 ride-0.116
Token rh
Token ID9529
Feature activation+7.77e-02

Pos logit contributions
ums+0.109
asing+0.108
 supervision+0.107
atted+0.107
olds+0.102
Neg logit contributions
 park-0.058
 picnic-0.050
 forest-0.049
 store-0.049
 ride-0.047
Tokenin
Token ID259
Feature activation+4.31e-02

Pos logit contributions
 supervision+0.159
ums+0.158
atted+0.155
asing+0.154
olds+0.147
Neg logit contributions
 park-0.161
 forest-0.146
 store-0.143
 picnic-0.142
 museum-0.141
Tokenoc
Token ID420
Feature activation+1.34e-01

Pos logit contributions
ums+0.104
atted+0.101
 supervision+0.101
asing+0.101
olds+0.096
Neg logit contributions
 park-0.075
 forest-0.066
 store-0.065
 ride-0.064
 picnic-0.064
Tokeneros
Token ID27498
Feature activation+5.03e-02

Pos logit contributions
 supervision+0.249
asing+0.245
atted+0.237
ums+0.235
othe+0.231
Neg logit contributions
 park-0.271
 forest-0.259
 museum-0.250
 store-0.243
 ride-0.238
Token.
Token ID13
Feature activation+1.64e-02

Pos logit contributions
ums+0.138
asing+0.138
 supervision+0.136
atted+0.136
olds+0.131
Neg logit contributions
 park-0.068
 picnic-0.058
 forest-0.057
 store-0.056
 hike-0.054
Token<|endoftext|>
Token ID50256
Feature activation-7.39e-02

Pos logit contributions
asing+0.054
atted+0.053
 supervision+0.053
ums+0.053
ging+0.053
Neg logit contributions
 park-0.014
 forest-0.011
 store-0.010
 ride-0.010
 museum-0.010
TokenD
Token ID35
Feature activation+2.71e-02

Pos logit contributions
 picnic+0.130
 hike+0.125
 party+0.122
 race+0.120
 walk+0.119
Neg logit contributions
ums-0.149
atted-0.136
olds-0.136
ging-0.133
ins-0.131
Tokenee
Token ID1453
Feature activation+5.28e-02

Pos logit contributions
ums+0.047
atted+0.045
asing+0.044
 supervision+0.044
olds+0.042
Neg logit contributions
 park-0.064
 picnic-0.060
 forest-0.058
 store-0.058
 library-0.057
Token you
Token ID345
Feature activation+3.12e-02

Pos logit contributions
 supervision+0.011
asing+0.011
atted+0.011
ums+0.011
olds+0.011
Neg logit contributions
 park-0.011
 picnic-0.010
 forest-0.010
 hike-0.010
 store-0.009
Token like
Token ID588
Feature activation+3.05e-03

Pos logit contributions
asing+0.075
ums+0.074
 supervision+0.073
atted+0.073
olds+0.070
Neg logit contributions
 park-0.054
 picnic-0.047
 forest-0.047
 store-0.046
 hike-0.046
Token the
Token ID262
Feature activation+1.62e-02

Pos logit contributions
 supervision+0.008
atted+0.008
ums+0.008
asing+0.008
olds+0.007
Neg logit contributions
 park-0.004
 picnic-0.004
 hike-0.004
 forest-0.004
 store-0.004
Token rh
Token ID9529
Feature activation+8.24e-02

Pos logit contributions
ums+0.041
 supervision+0.041
asing+0.040
atted+0.039
olds+0.039
Neg logit contributions
 park-0.023
 picnic-0.021
 hike-0.021
 peek-0.019
 store-0.019
Tokenin
Token ID259
Feature activation+2.20e-02

Pos logit contributions
 supervision+0.172
ums+0.170
atted+0.169
asing+0.167
aged+0.159
Neg logit contributions
 park-0.168
 forest-0.151
 store-0.150
 picnic-0.147
 museum-0.147
Tokenoc
Token ID420
Feature activation+1.34e-01

Pos logit contributions
ums+0.038
asing+0.038
atted+0.038
 supervision+0.038
ging+0.035
Neg logit contributions
 park-0.052
 forest-0.048
 store-0.047
 picnic-0.047
 ride-0.047
Tokeneros
Token ID27498
Feature activation+3.57e-02

Pos logit contributions
asing+0.268
 supervision+0.260
atted+0.251
othe+0.246
ums+0.246
Neg logit contributions
 park-0.268
 forest-0.252
 museum-0.248
 store-0.244
 ride-0.238
Token,
Token ID11
Feature activation+3.15e-02

Pos logit contributions
asing+0.098
 supervision+0.098
ums+0.097
atted+0.096
olds+0.093
Neg logit contributions
 park-0.047
 picnic-0.040
 hike-0.039
 forest-0.039
 museum-0.038
Token but
Token ID475
Feature activation+4.66e-02

Pos logit contributions
asing+0.091
ums+0.091
atted+0.091
 supervision+0.089
aged+0.088
Neg logit contributions
 park-0.041
 store-0.034
 picnic-0.033
 forest-0.033
 ride-0.031
Token it
Token ID340
Feature activation+1.30e-02

Pos logit contributions
ums+0.121
asing+0.119
atted+0.119
 supervision+0.118
ging+0.114
Neg logit contributions
 park-0.073
 store-0.063
 picnic-0.063
 forest-0.062
 museum-0.060
Token is
Token ID318
Feature activation+2.77e-02

Pos logit contributions
ums+0.035
asing+0.035
 supervision+0.035
atted+0.035
ging+0.033
Neg logit contributions
 park-0.018
 picnic-0.016
 forest-0.015
 store-0.015
 hike-0.015
Token thanked
Token ID26280
Feature activation+3.48e-02

Pos logit contributions
atted+0.051
asing+0.051
ums+0.050
 supervision+0.049
aged+0.048
Neg logit contributions
 park-0.036
 picnic-0.032
 forest-0.032
 store-0.032
 library-0.031
Token the
Token ID262
Feature activation+1.71e-02

Pos logit contributions
asing+0.085
ums+0.085
 supervision+0.084
atted+0.084
aged+0.080
Neg logit contributions
 park-0.059
 picnic-0.052
 forest-0.051
 store-0.051
 ride-0.050
Token elderly
Token ID13830
Feature activation+5.29e-02

Pos logit contributions
ums+0.045
 supervision+0.044
asing+0.043
atted+0.043
ging+0.042
Neg logit contributions
 park-0.024
 picnic-0.021
 hike-0.020
 store-0.020
 library-0.020
Token rh
Token ID9529
Feature activation+7.60e-02

Pos logit contributions
asing+0.151
ums+0.150
atted+0.149
 supervision+0.149
olds+0.143
Neg logit contributions
 park-0.069
 forest-0.058
 picnic-0.057
 store-0.057
 ride-0.055
Tokenin
Token ID259
Feature activation+1.67e-02

Pos logit contributions
 supervision+0.171
ums+0.164
atted+0.162
asing+0.159
olds+0.155
Neg logit contributions
 park-0.146
 forest-0.133
 store-0.130
 museum-0.128
 picnic-0.126
Tokenoc
Token ID420
Feature activation+1.34e-01

Pos logit contributions
 supervision+0.032
asing+0.032
ums+0.032
atted+0.032
aged+0.030
Neg logit contributions
 park-0.037
 forest-0.034
 picnic-0.034
 store-0.034
 ride-0.034
Tokeneros
Token ID27498
Feature activation+2.20e-02

Pos logit contributions
 supervision+0.269
asing+0.265
othe+0.250
atted+0.243
cks+0.242
Neg logit contributions
 park-0.262
 forest-0.247
 museum-0.241
 store-0.236
 picnic-0.227
Token for
Token ID329
Feature activation+4.60e-02

Pos logit contributions
ums+0.050
asing+0.050
 supervision+0.049
atted+0.049
olds+0.047
Neg logit contributions
 park-0.040
 picnic-0.036
 forest-0.035
 store-0.035
 hike-0.034
Token his
Token ID465
Feature activation+1.69e-02

Pos logit contributions
ums+0.122
asing+0.121
atted+0.120
 supervision+0.117
olds+0.114
Neg logit contributions
 park-0.073
 store-0.063
 picnic-0.062
 forest-0.061
 ride-0.059
Token wonderful
Token ID7932
Feature activation+2.97e-02

Pos logit contributions
 supervision+0.041
ums+0.039
asing+0.039
atted+0.039
olds+0.037
Neg logit contributions
 park-0.028
 forest-0.024
 store-0.024
 library-0.023
 museum-0.023
Token surprise
Token ID5975
Feature activation+2.73e-03

Pos logit contributions
atted+0.081
ums+0.080
 supervision+0.078
aged+0.075
asing+0.075
Neg logit contributions
 park-0.032
 store-0.028
 forest-0.027
Hi-0.026
 library-0.026
Token happy
Token ID3772
Feature activation+4.93e-02

Pos logit contributions
asing+0.139
ums+0.139
 supervision+0.136
atted+0.135
ging+0.132
Neg logit contributions
 park-0.095
 picnic-0.082
 store-0.082
 forest-0.081
 ride-0.079
Token for
Token ID329
Feature activation+5.32e-02

Pos logit contributions
ums+0.131
atted+0.129
asing+0.128
 supervision+0.127
olds+0.123
Neg logit contributions
 park-0.072
 picnic-0.063
 store-0.062
 hike-0.061
 forest-0.061
Token the
Token ID262
Feature activation+1.95e-02

Pos logit contributions
ums+0.128
atted+0.128
asing+0.128
 supervision+0.124
aged+0.121
Neg logit contributions
 park-0.095
 store-0.082
 forest-0.082
 picnic-0.080
 museum-0.079
Token rh
Token ID9529
Feature activation+8.06e-02

Pos logit contributions
 supervision+0.051
ums+0.051
asing+0.049
olds+0.049
atted+0.048
Neg logit contributions
 park-0.024
 hike-0.021
 picnic-0.021
 forest-0.021
 store-0.021
Tokenin
Token ID259
Feature activation+2.05e-02

Pos logit contributions
 supervision+0.173
ums+0.169
atted+0.168
asing+0.166
aged+0.159
Neg logit contributions
 park-0.161
 forest-0.144
 store-0.143
 museum-0.141
 picnic-0.140
Tokenoc
Token ID420
Feature activation+1.34e-01

Pos logit contributions
atted+0.041
 supervision+0.041
asing+0.041
ums+0.041
ging+0.038
Neg logit contributions
 park-0.045
 forest-0.041
 ride-0.040
 store-0.040
 picnic-0.039
Tokeneros
Token ID27498
Feature activation+3.48e-02

Pos logit contributions
asing+0.259
 supervision+0.253
atted+0.245
othe+0.244
ums+0.238
Neg logit contributions
 park-0.280
 forest-0.259
 museum-0.257
 store-0.252
 ride-0.246
Token.
Token ID13
Feature activation+2.83e-02

Pos logit contributions
ums+0.099
asing+0.099
 supervision+0.099
atted+0.098
olds+0.095
Neg logit contributions
 park-0.043
 picnic-0.036
 forest-0.035
 store-0.034
 hike-0.034
Token She
Token ID1375
Feature activation+1.30e-02

Pos logit contributions
ums+0.076
asing+0.075
 supervision+0.075
atted+0.075
olds+0.072
Neg logit contributions
 park-0.040
 picnic-0.035
 forest-0.034
 hike-0.034
 store-0.034
Token said
Token ID531
Feature activation+1.68e-02

Pos logit contributions
atted+0.029
asing+0.029
ums+0.029
 supervision+0.028
olds+0.028
Neg logit contributions
 park-0.023
 picnic-0.021
 forest-0.021
 hike-0.021
 museum-0.020
Token,
Token ID11
Feature activation+1.58e-02

Pos logit contributions
ums+0.046
atted+0.045
asing+0.045
 supervision+0.045
olds+0.043
Neg logit contributions
 park-0.024
 picnic-0.020
 forest-0.020
 store-0.020
 hike-0.019
Token had
Token ID550
Feature activation+6.32e-03

Pos logit contributions
asing+0.024
ums+0.024
 supervision+0.024
atted+0.024
olds+0.023
Neg logit contributions
 park-0.025
 picnic-0.023
 forest-0.022
 museum-0.022
 hike-0.022
Token discovered
Token ID5071
Feature activation-1.45e-02

Pos logit contributions
 supervision+0.015
asing+0.015
ums+0.015
atted+0.015
olds+0.015
Neg logit contributions
 park-0.010
 forest-0.009
 picnic-0.009
 store-0.009
 museum-0.009
Token the
Token ID262
Feature activation+1.82e-02

Pos logit contributions
 park+0.016
 picnic+0.012
 forest+0.012
 store+0.012
 museum+0.012
Neg logit contributions
ums-0.044
asing-0.044
 supervision-0.044
atted-0.044
ging-0.042
Token rh
Token ID9529
Feature activation+7.39e-02

Pos logit contributions
ums+0.048
 supervision+0.047
asing+0.046
atted+0.046
olds+0.045
Neg logit contributions
 park-0.023
 picnic-0.021
 hike-0.020
 peek-0.020
 store-0.019
Tokenin
Token ID259
Feature activation+1.72e-02

Pos logit contributions
 supervision+0.154
ums+0.150
atted+0.149
asing+0.146
aged+0.140
Neg logit contributions
 park-0.154
 forest-0.140
 store-0.137
 museum-0.136
 picnic-0.133
Tokenoc
Token ID420
Feature activation+1.33e-01

Pos logit contributions
asing+0.033
 supervision+0.033
ums+0.032
atted+0.032
ging+0.030
Neg logit contributions
 park-0.039
 forest-0.036
 store-0.035
 museum-0.035
 ride-0.035
Tokeneros
Token ID27498
Feature activation+3.29e-02

Pos logit contributions
asing+0.268
 supervision+0.264
othe+0.253
atted+0.247
cks+0.246
Neg logit contributions
 park-0.259
 forest-0.248
 museum-0.244
 store-0.236
 ride-0.224
Token,
Token ID11
Feature activation+2.62e-02

Pos logit contributions
asing+0.089
 supervision+0.088
ums+0.088
atted+0.088
olds+0.084
Neg logit contributions
 park-0.044
 picnic-0.038
 forest-0.037
 store-0.036
 hike-0.036
Token but
Token ID475
Feature activation+4.69e-02

Pos logit contributions
ums+0.077
asing+0.077
atted+0.076
 supervision+0.074
ging+0.074
Neg logit contributions
 park-0.034
 store-0.028
 picnic-0.027
 forest-0.027
 ride-0.026
Token they
Token ID484
Feature activation+1.78e-02

Pos logit contributions
ums+0.129
asing+0.128
atted+0.126
 supervision+0.124
ging+0.121
Neg logit contributions
 park-0.070
 store-0.060
 picnic-0.059
 big-0.058
 museum-0.057
Token had
Token ID550
Feature activation+7.43e-03

Pos logit contributions
ums+0.049
asing+0.049
 supervision+0.048
atted+0.048
ging+0.047
Neg logit contributions
 park-0.025
 picnic-0.021
 store-0.021
 forest-0.021
 ride-0.020
Token"
Token ID1
Feature activation+3.93e-02

Pos logit contributions
ums+0.047
 supervision+0.047
atted+0.047
asing+0.047
aged+0.044
Neg logit contributions
 park-0.027
 store-0.023
 forest-0.022
 picnic-0.021
 library-0.021
TokenThank
Token ID10449
Feature activation+4.59e-02

Pos logit contributions
asing+0.086
ums+0.085
 supervision+0.084
atted+0.084
olds+0.080
Neg logit contributions
 park-0.076
 picnic-0.069
 forest-0.068
 hike-0.067
 store-0.066
Token you
Token ID345
Feature activation+2.21e-02

Pos logit contributions
 supervision+0.122
ums+0.121
atted+0.121
aged+0.119
asing+0.118
Neg logit contributions
 park-0.067
 store-0.056
 picnic-0.056
 forest-0.055
 museum-0.053
Token,
Token ID11
Feature activation+2.68e-02

Pos logit contributions
ums+0.068
asing+0.068
 supervision+0.068
atted+0.067
olds+0.065
Neg logit contributions
 park-0.023
 picnic-0.018
 forest-0.018
 store-0.017
 hike-0.016
Token my
Token ID616
Feature activation+2.15e-02

Pos logit contributions
ums+0.074
asing+0.073
 supervision+0.073
atted+0.073
olds+0.070
Neg logit contributions
 park-0.037
 picnic-0.031
 forest-0.031
 store-0.030
 hike-0.030
Token de
Token ID390
Feature activation+1.33e-01

Pos logit contributions
ums+0.046
atted+0.045
asing+0.045
 supervision+0.044
olds+0.043
Neg logit contributions
 park-0.042
 forest-0.038
 hike-0.038
 picnic-0.038
 store-0.038
Tokenars
Token ID945
Feature activation+1.75e-02

Pos logit contributions
ums+0.222
atted+0.216
 supervision+0.211
aged+0.197
asing+0.188
Neg logit contributions
 park-0.341
 store-0.313
 ride-0.312
 museum-0.309
 picnic-0.303
Token,"
Token ID553
Feature activation+5.01e-02

Pos logit contributions
ums+0.048
asing+0.047
 supervision+0.046
atted+0.046
olds+0.045
Neg logit contributions
 park-0.024
 picnic-0.021
 hike-0.021
 forest-0.021
 museum-0.020
Token she
Token ID673
Feature activation+1.18e-02

Pos logit contributions
ums+0.127
asing+0.126
 supervision+0.125
atted+0.125
olds+0.121
Neg logit contributions
 park-0.078
 picnic-0.069
 forest-0.068
 hike-0.067
 store-0.065
Token said
Token ID531
Feature activation+2.74e-02

Pos logit contributions
ums+0.028
asing+0.028
atted+0.028
 supervision+0.028
olds+0.026
Neg logit contributions
 park-0.020
 picnic-0.018
 forest-0.018
 store-0.017
 hike-0.017
Token.
Token ID13
Feature activation+3.02e-02

Pos logit contributions
asing+0.092
ums+0.091
 supervision+0.090
atted+0.088
cks+0.088
Neg logit contributions
 park-0.017
 store-0.014
 ride-0.013
 big-0.012
 peek-0.012
Tokenily
Token ID813
Feature activation-5.39e-02

Pos logit contributions
ums+0.018
asing+0.017
 supervision+0.017
atted+0.017
olds+0.016
Neg logit contributions
 park-0.018
 picnic-0.016
 forest-0.016
 store-0.016
 museum-0.015
Token and
Token ID290
Feature activation-5.00e-02

Pos logit contributions
 park+0.077
 picnic+0.071
 museum+0.068
 hike+0.067
 forest+0.066
Neg logit contributions
ums-0.138
atted-0.137
asing-0.136
 supervision-0.135
aged-0.130
Token Sam
Token ID3409
Feature activation-4.74e-02

Pos logit contributions
 park+0.068
 picnic+0.059
 forest+0.058
 store+0.057
 museum+0.057
Neg logit contributions
ums-0.135
asing-0.134
 supervision-0.134
atted-0.133
aged-0.128
Token went
Token ID1816
Feature activation-9.15e-02

Pos logit contributions
 park+0.063
 picnic+0.054
 forest+0.053
 store+0.052
 hike+0.051
Neg logit contributions
ums-0.130
asing-0.129
atted-0.129
 supervision-0.128
olds-0.123
Token on
Token ID319
Feature activation-5.70e-02

Pos logit contributions
 park+0.127
 museum+0.116
 forest+0.115
 library+0.115
 store+0.113
Neg logit contributions
atted-0.235
ums-0.225
olds-0.223
asing-0.222
aged-0.219
Token a
Token ID257
Feature activation-1.44e-01

Pos logit contributions
 museum+0.068
 store+0.067
 park+0.062
 arrived+0.061
 knight+0.061
Neg logit contributions
cks-0.133
 themselves-0.130
ums-0.130
atted-0.128
aged-0.125
Token saf
Token ID1932
Feature activation-7.69e-02

Pos logit contributions
Hi+0.104
 museum+0.100
 theater+0.099
 princess+0.097
 park+0.094
Neg logit contributions
cking-0.344
cks-0.330
ck-0.329
edient-0.325
 supervision-0.325
Tokenari
Token ID2743
Feature activation-8.37e-02

Pos logit contributions
 park+0.151
 picnic+0.148
 nap+0.139
 hike+0.136
 store+0.134
Neg logit contributions
ums-0.158
atted-0.141
olds-0.141
asing-0.138
ams-0.129
Token with
Token ID351
Feature activation-3.19e-02

Pos logit contributions
 park+0.113
 picnic+0.107
 festival+0.100
 library+0.100
 hike+0.098
Neg logit contributions
ums-0.217
atted-0.203
asing-0.201
aged-0.199
olds-0.197
Token their
Token ID511
Feature activation-6.43e-02

Pos logit contributions
 park+0.054
 picnic+0.048
 forest+0.047
 store+0.047
 hike+0.046
Neg logit contributions
ums-0.077
asing-0.077
 supervision-0.076
atted-0.076
olds-0.072
Token mom
Token ID1995
Feature activation-1.84e-02

Pos logit contributions
 park+0.112
 hike+0.108
 picnic+0.106
 store+0.103
 festival+0.102
Neg logit contributions
ums-0.137
 supervision-0.130
atted-0.127
olds-0.123
aged-0.121
Token things
Token ID1243
Feature activation+2.15e-02

Pos logit contributions
ums+0.066
asing+0.066
 supervision+0.065
atted+0.065
olds+0.062
Neg logit contributions
 park-0.042
 picnic-0.037
 forest-0.036
 store-0.036
 hike-0.035
Token together
Token ID1978
Feature activation+1.74e-02

Pos logit contributions
asing+0.058
 supervision+0.057
ums+0.057
atted+0.057
ging+0.055
Neg logit contributions
 park-0.029
 picnic-0.025
 forest-0.025
 store-0.024
 hike-0.024
Token.
Token ID13
Feature activation+7.31e-03

Pos logit contributions
asing+0.048
ums+0.047
 supervision+0.047
atted+0.046
ging+0.045
Neg logit contributions
 park-0.023
 picnic-0.019
 forest-0.019
 store-0.019
 hike-0.019
Token<|endoftext|>
Token ID50256
Feature activation-9.74e-02

Pos logit contributions
asing+0.021
ums+0.021
 supervision+0.021
atted+0.021
ging+0.020
Neg logit contributions
 park-0.009
 picnic-0.007
 forest-0.007
 store-0.007
 museum-0.007
TokenOn
Token ID2202
Feature activation-8.60e-02

Pos logit contributions
 picnic+0.174
 party+0.163
 hike+0.162
 race+0.157
 festival+0.155
Neg logit contributions
ums-0.193
olds-0.179
cks-0.177
atted-0.174
ging-0.172
Token a
Token ID257
Feature activation-1.42e-01

Pos logit contributions
 park+0.103
 museum+0.096
 peek+0.095
 forest+0.093
 store+0.092
Neg logit contributions
ums-0.214
 supervision-0.213
atted-0.210
cks-0.202
othe-0.200
Token sunny
Token ID27737
Feature activation-8.43e-02

Pos logit contributions
 museum+0.160
 park+0.160
Hi+0.152
 store+0.152
 theater+0.151
Neg logit contributions
cking-0.306
othe-0.306
leness-0.304
atted-0.299
ging-0.299
Token day
Token ID1110
Feature activation-9.77e-02

Pos logit contributions
 museum+0.114
 library+0.112
Hi+0.110
 Under+0.105
 peek+0.105
Neg logit contributions
atted-0.173
ums-0.171
aged-0.159
ams-0.157
ieties-0.156
Token,
Token ID11
Feature activation-8.55e-02

Pos logit contributions
 park+0.138
 library+0.128
 museum+0.119
 forest+0.118
 hike+0.117
Neg logit contributions
atted-0.249
ums-0.243
othe-0.240
ging-0.235
aged-0.234
Token Joe
Token ID5689
Feature activation-5.51e-02

Pos logit contributions
 park+0.118
 forest+0.107
 library+0.105
 store+0.105
 hike+0.102
Neg logit contributions
othe-0.213
atted-0.212
aged-0.211
ums-0.207
ging-0.206
Token decided
Token ID3066
Feature activation-8.68e-02

Pos logit contributions
 park+0.079
 picnic+0.076
 museum+0.075
 library+0.073
 festival+0.073
Neg logit contributions
atted-0.134
aged-0.124
asing-0.123
ums-0.121
 supervision-0.121
Token so
Token ID523
Feature activation+3.82e-02

Pos logit contributions
asing+0.028
 supervision+0.027
ums+0.027
atted+0.027
ging+0.026
Neg logit contributions
 park-0.012
 picnic-0.010
 store-0.010
 ride-0.010
 forest-0.010
Token happy
Token ID3772
Feature activation+8.13e-03

Pos logit contributions
 supervision+0.095
asing+0.094
ums+0.090
atted+0.089
ging+0.088
Neg logit contributions
 park-0.066
 big-0.061
 picnic-0.059
 store-0.058
 ride-0.057
Token!
Token ID0
Feature activation+1.48e-02

Pos logit contributions
ums+0.022
asing+0.022
atted+0.022
 supervision+0.021
olds+0.021
Neg logit contributions
 park-0.011
 picnic-0.010
 forest-0.010
 hike-0.009
 store-0.009
Token<|endoftext|>
Token ID50256
Feature activation-8.12e-02

Pos logit contributions
asing+0.048
 supervision+0.047
atted+0.046
ums+0.046
ging+0.046
Neg logit contributions
 park-0.014
 forest-0.010
 picnic-0.010
 store-0.010
 library-0.009
TokenOn
Token ID2202
Feature activation-8.50e-02

Pos logit contributions
 picnic+0.139
 hike+0.136
 race+0.130
 party+0.129
 ride+0.129
Neg logit contributions
ums-0.167
ins-0.155
atted-0.154
ging-0.152
olds-0.150
Token a
Token ID257
Feature activation-1.41e-01

Pos logit contributions
 park+0.111
 museum+0.110
 forest+0.103
 library+0.103
 peek+0.102
Neg logit contributions
ums-0.200
 supervision-0.197
atted-0.196
cks-0.186
othe-0.185
Token sunny
Token ID27737
Feature activation-8.31e-02

Pos logit contributions
 museum+0.170
 park+0.164
 chamber+0.159
 library+0.158
 theater+0.158
Neg logit contributions
othe-0.282
leness-0.277
 supervision-0.277
ging-0.275
cking-0.275
Token day
Token ID1110
Feature activation-9.83e-02

Pos logit contributions
 museum+0.111
 library+0.110
Hi+0.104
 university+0.102
 park+0.102
Neg logit contributions
atted-0.173
ums-0.167
aged-0.157
ieties-0.155
ams-0.155
Token,
Token ID11
Feature activation-8.37e-02

Pos logit contributions
 park+0.140
 library+0.133
 museum+0.128
 forest+0.123
 hike+0.122
Neg logit contributions
atted-0.242
ums-0.238
othe-0.236
ging-0.229
aged-0.228
Token a
Token ID257
Feature activation-1.28e-01

Pos logit contributions
 park+0.105
 store+0.092
 forest+0.091
 hike+0.091
 library+0.091
Neg logit contributions
othe-0.219
aged-0.217
ums-0.217
ging-0.216
atted-0.216
Token little
Token ID1310
Feature activation-6.06e-02

Pos logit contributions
 park+0.115
 hike+0.114
 picnic+0.103
 Under+0.098
 forest+0.097
Neg logit contributions
ging-0.340
ums-0.337
othe-0.332
 supervision-0.326
atted-0.319
Token bed
Token ID3996
Feature activation-3.87e-03

Pos logit contributions
 supervision+0.013
ums+0.013
atted+0.013
olds+0.013
asing+0.013
Neg logit contributions
 park-0.007
 picnic-0.007
 hike-0.006
 forest-0.006
 library-0.006
Token.
Token ID13
Feature activation+1.47e-02

Pos logit contributions
 park+0.005
 picnic+0.004
 forest+0.004
 museum+0.004
 hike+0.004
Neg logit contributions
ums-0.011
atted-0.011
 supervision-0.010
asing-0.010
aged-0.010
Token<|endoftext|>
Token ID50256
Feature activation-8.21e-02

Pos logit contributions
ums+0.044
asing+0.043
 supervision+0.043
atted+0.043
olds+0.042
Neg logit contributions
 park-0.017
 picnic-0.014
 forest-0.013
 store-0.013
 hike-0.013
TokenOnce
Token ID7454
Feature activation-8.02e-02

Pos logit contributions
 picnic+0.148
 hike+0.144
 party+0.141
 race+0.139
 ride+0.138
Neg logit contributions
ums-0.158
atted-0.146
ging-0.144
ins-0.144
olds-0.143
Token Upon
Token ID14438
Feature activation-5.63e-02

Pos logit contributions
 park+0.114
 picnic+0.101
 forest+0.097
 store+0.096
 library+0.094
Neg logit contributions
ums-0.212
atted-0.212
asing-0.208
 supervision-0.207
olds-0.202
Token A
Token ID317
Feature activation-1.41e-01

Pos logit contributions
 park+0.061
 store+0.058
 picnic+0.058
 party+0.052
 library+0.051
Neg logit contributions
atted-0.148
cks-0.145
asing-0.144
cking-0.143
olds-0.142
Token Time
Token ID3862
Feature activation-6.46e-02

Pos logit contributions
 park+0.153
 forest+0.143
 store+0.139
 museum+0.128
 picnic+0.126
Neg logit contributions
ums-0.338
ging-0.331
 supervision-0.329
atted-0.325
asing-0.324
Token there
Token ID612
Feature activation-7.80e-02

Pos logit contributions
 park+0.094
 ride+0.083
 picnic+0.082
 store+0.079
 library+0.077
Neg logit contributions
atted-0.161
ging-0.157
ums-0.155
aged-0.153
anged-0.151
Token was
Token ID373
Feature activation-6.20e-02

Pos logit contributions
 park+0.109
 picnic+0.104
 hike+0.098
 festival+0.097
 library+0.097
Neg logit contributions
ums-0.196
atted-0.188
aged-0.185
asing-0.184
 supervision-0.181
Token a
Token ID257
Feature activation-1.11e-01

Pos logit contributions
 hike+0.076
 ride+0.075
 park+0.071
 picnic+0.070
 hill+0.069
Neg logit contributions
ums-0.147
asing-0.142
atted-0.142
olds-0.141
aged-0.141
Token little
Token ID1310
Feature activation-3.89e-02

Pos logit contributions
 hike+0.113
 park+0.102
 arrived+0.098
 picnic+0.094
 Under+0.092
Neg logit contributions
ums-0.278
ging-0.273
edient-0.264
othe-0.263
akers-0.256
Token M
Token ID337
Feature activation+8.92e-03

Pos logit contributions
 park+0.073
 forest+0.066
 picnic+0.063
 hike+0.062
 ride+0.062
Neg logit contributions
ums-0.091
 supervision-0.090
atted-0.086
asing-0.085
olds-0.085
Tokenandy
Token ID10757
Feature activation-3.27e-02

Pos logit contributions
ums+0.013
asing+0.012
atted+0.012
 supervision+0.011
olds+0.011
Neg logit contributions
 park-0.024
 picnic-0.023
 ride-0.023
 forest-0.023
 hike-0.022
Token who
Token ID508
Feature activation-2.51e-02

Pos logit contributions
 park+0.046
 picnic+0.043
 hike+0.040
 forest+0.039
 library+0.038
Neg logit contributions
ums-0.087
atted-0.086
asing-0.085
 supervision-0.084
aged-0.082
Token lived
Token ID5615
Feature activation-7.39e-02

Pos logit contributions
 park+0.042
 picnic+0.038
 hike+0.036
 forest+0.036
 store+0.036
Neg logit contributions
ums-0.059
asing-0.058
atted-0.057
 supervision-0.056
aged-0.056
Token near
Token ID1474
Feature activation-5.27e-02

Pos logit contributions
 park+0.103
 picnic+0.100
 hike+0.098
 library+0.094
 museum+0.092
Neg logit contributions
ums-0.195
asing-0.190
aged-0.188
atted-0.186
ging-0.186
Token a
Token ID257
Feature activation-1.40e-01

Pos logit contributions
 park+0.063
 picnic+0.061
 hike+0.061
 arrived+0.060
 ride+0.057
Neg logit contributions
ums-0.136
 supervision-0.130
asing-0.127
olds-0.127
atted-0.124
Token peaceful
Token ID12309
Feature activation-6.36e-02

Pos logit contributions
 arrived+0.125
 hike+0.121
 Under+0.118
 ride+0.113
 picnic+0.110
Neg logit contributions
ums-0.314
ucker-0.311
edient-0.307
othe-0.304
inging-0.298
Token mountain
Token ID8598
Feature activation-6.02e-02

Pos logit contributions
 arrived+0.058
 hike+0.058
 Under+0.057
One+0.054
Three+0.052
Neg logit contributions
ums-0.147
atted-0.140
edient-0.133
ucker-0.132
ieties-0.132
Token.
Token ID13
Feature activation-3.41e-02

Pos logit contributions
 park+0.085
 picnic+0.083
 hike+0.077
 library+0.073
Hi+0.072
Neg logit contributions
ums-0.148
asing-0.146
atted-0.146
cks-0.141
olds-0.141
Token One
Token ID1881
Feature activation-7.60e-02

Pos logit contributions
 park+0.053
 picnic+0.053
 ride+0.050
 festival+0.050
 hike+0.049
Neg logit contributions
ums-0.083
atted-0.077
olds-0.073
asing-0.073
ectar-0.072
Token day
Token ID1110
Feature activation-6.45e-02

Pos logit contributions
 park+0.103
 picnic+0.094
 store+0.092
 library+0.092
 museum+0.089
Neg logit contributions
ums-0.182
atted-0.176
asing-0.176
othe-0.171
edient-0.170
Token friends
Token ID2460
Feature activation-1.15e-03

Pos logit contributions
 park+0.003
 picnic+0.002
 forest+0.002
 store+0.002
 hike+0.002
Neg logit contributions
ums-0.003
 supervision-0.003
asing-0.003
atted-0.003
aged-0.003
Token again
Token ID757
Feature activation+2.55e-02

Pos logit contributions
 park+0.002
 picnic+0.002
 museum+0.002
 hike+0.002
 forest+0.002
Neg logit contributions
 supervision-0.003
asing-0.003
ums-0.003
ging-0.003
atted-0.003
Token.
Token ID13
Feature activation+1.15e-02

Pos logit contributions
ums+0.065
 supervision+0.065
asing+0.064
ging+0.063
atted+0.062
Neg logit contributions
 park-0.037
 picnic-0.033
 hike-0.032
 museum-0.032
 library-0.031
Token<|endoftext|>
Token ID50256
Feature activation-8.41e-02

Pos logit contributions
ums+0.032
asing+0.031
atted+0.031
 supervision+0.031
olds+0.030
Neg logit contributions
 park-0.016
 picnic-0.013
 forest-0.013
 store-0.013
 hike-0.013
TokenOn
Token ID2202
Feature activation-8.34e-02

Pos logit contributions
 picnic+0.155
 party+0.145
 hike+0.145
 race+0.140
 walk+0.140
Neg logit contributions
ums-0.160
atted-0.149
olds-0.148
ins-0.145
ging-0.145
Token a
Token ID257
Feature activation-1.39e-01

Pos logit contributions
 park+0.104
 museum+0.100
 peek+0.099
 library+0.097
 forest+0.095
Neg logit contributions
ums-0.197
 supervision-0.196
atted-0.192
othe-0.184
cks-0.184
Token bright
Token ID6016
Feature activation-8.41e-02

Pos logit contributions
 museum+0.172
 park+0.168
 theater+0.163
 library+0.163
 chamber+0.161
Neg logit contributions
othe-0.277
leness-0.267
cking-0.265
 supervision-0.265
ging-0.263
Token and
Token ID290
Feature activation-3.64e-02

Pos logit contributions
 park+0.100
 library+0.100
 museum+0.099
 store+0.094
 picnic+0.093
Neg logit contributions
atted-0.210
ums-0.191
ams-0.185
ging-0.183
othe-0.180
Token sunny
Token ID27737
Feature activation-6.04e-02

Pos logit contributions
 park+0.052
 picnic+0.048
 library+0.048
 hike+0.047
 museum+0.046
Neg logit contributions
asing-0.084
ams-0.084
ging-0.083
aged-0.081
 supervision-0.081
Token day
Token ID1110
Feature activation-7.98e-02

Pos logit contributions
 museum+0.084
 library+0.083
 park+0.077
 store+0.076
 Under+0.076
Neg logit contributions
atted-0.129
ums-0.124
aged-0.123
ging-0.120
ams-0.118
Token,
Token ID11
Feature activation-6.85e-02

Pos logit contributions
 park+0.125
 library+0.119
 store+0.108
 museum+0.108
 forest+0.107
Neg logit contributions
atted-0.189
othe-0.188
aged-0.184
ums-0.184
ging-0.183
Token upon
Token ID2402
Feature activation-1.01e-01

Pos logit contributions
 park+0.116
 picnic+0.101
 forest+0.098
 store+0.097
 library+0.094
Neg logit contributions
ums-0.238
atted-0.236
asing-0.234
 supervision-0.232
olds-0.225
Token a
Token ID257
Feature activation-1.28e-01

Pos logit contributions
 park+0.115
 forest+0.112
 museum+0.110
 hike+0.110
 festival+0.108
Neg logit contributions
ums-0.248
asing-0.248
cks-0.247
aged-0.246
olds-0.245
Token time
Token ID640
Feature activation-8.05e-02

Pos logit contributions
 park+0.128
 store+0.114
 forest+0.112
 museum+0.110
 library+0.105
Neg logit contributions
ums-0.373
atted-0.366
asing-0.365
 supervision-0.357
cks-0.350
Token,
Token ID11
Feature activation-9.91e-02

Pos logit contributions
 park+0.092
 picnic+0.077
 store+0.076
 forest+0.075
 library+0.073
Neg logit contributions
ums-0.238
atted-0.234
asing-0.231
 supervision-0.227
olds-0.224
Token in
Token ID287
Feature activation-7.97e-02

Pos logit contributions
 park+0.116
 picnic+0.103
 library+0.103
 hike+0.101
 forest+0.099
Neg logit contributions
atted-0.276
asing-0.276
ums-0.273
aged-0.273
ging-0.269
Token a
Token ID257
Feature activation-1.39e-01

Pos logit contributions
 park+0.067
 forest+0.055
 museum+0.054
 store+0.052
 picnic+0.050
Neg logit contributions
ums-0.246
atted-0.240
 supervision-0.239
asing-0.239
aged-0.236
Token small
Token ID1402
Feature activation-8.54e-02

Pos logit contributions
 tr+0.132
 hike+0.126
 museum+0.125
 arrived+0.124
 park+0.124
Neg logit contributions
cking-0.348
ums-0.347
cks-0.346
aged-0.337
 supervision-0.337
Token town
Token ID3240
Feature activation-8.43e-02

Pos logit contributions
 hike+0.093
 arrived+0.091
 ride+0.090
Hi+0.088
 picnic+0.086
Neg logit contributions
ums-0.208
atted-0.187
edient-0.182
aged-0.180
ang-0.177
Token,
Token ID11
Feature activation-8.11e-02

Pos logit contributions
 park+0.101
 picnic+0.097
 hike+0.092
 forest+0.082
 ride+0.082
Neg logit contributions
ums-0.238
atted-0.233
aged-0.224
asing-0.223
olds-0.223
Token there
Token ID612
Feature activation-9.15e-02

Pos logit contributions
 park+0.097
 picnic+0.092
 party+0.090
 hike+0.088
 beach+0.085
Neg logit contributions
aged-0.215
atted-0.208
asing-0.206
othe-0.206
cks-0.203
Token lived
Token ID5615
Feature activation-9.77e-02

Pos logit contributions
 park+0.135
 picnic+0.132
 hike+0.127
 library+0.123
 store+0.122
Neg logit contributions
ums-0.216
atted-0.211
aged-0.206
asing-0.204
 supervision-0.196
Token upon
Token ID2402
Feature activation-1.01e-01

Pos logit contributions
 park+0.113
 picnic+0.097
 forest+0.095
 store+0.093
 museum+0.090
Neg logit contributions
ums-0.242
atted-0.240
asing-0.239
 supervision-0.237
olds-0.230
Token a
Token ID257
Feature activation-1.29e-01

Pos logit contributions
 hike+0.116
 forest+0.115
 park+0.114
 museum+0.114
 festival+0.111
Neg logit contributions
cks-0.243
ums-0.243
olds-0.241
asing-0.241
aged-0.239
Token time
Token ID640
Feature activation-7.62e-02

Pos logit contributions
 park+0.127
 store+0.112
 forest+0.110
 museum+0.108
 library+0.102
Neg logit contributions
ums-0.378
atted-0.371
asing-0.370
 supervision-0.362
cks-0.357
Token,
Token ID11
Feature activation-9.76e-02

Pos logit contributions
 park+0.085
 picnic+0.071
 store+0.070
 forest+0.070
 library+0.068
Neg logit contributions
ums-0.227
atted-0.224
asing-0.221
 supervision-0.217
olds-0.214
Token in
Token ID287
Feature activation-7.98e-02

Pos logit contributions
 park+0.110
 library+0.098
 hike+0.096
 picnic+0.096
 forest+0.094
Neg logit contributions
atted-0.277
asing-0.276
aged-0.272
ums-0.268
cks-0.267
Token a
Token ID257
Feature activation-1.38e-01

Pos logit contributions
 park+0.072
 museum+0.063
 forest+0.062
 store+0.059
 hike+0.058
Neg logit contributions
ums-0.234
atted-0.228
 supervision-0.227
asing-0.227
cks-0.226
Token big
Token ID1263
Feature activation-9.60e-02

Pos logit contributions
 tr+0.131
 ride+0.125
 hike+0.124
 arrived+0.120
 museum+0.118
Neg logit contributions
cks-0.346
cking-0.345
ums-0.345
atted-0.331
aged-0.330
Token green
Token ID4077
Feature activation-9.02e-02

Pos logit contributions
 ride+0.082
 hike+0.082
 arrived+0.080
Hi+0.080
 picnic+0.079
Neg logit contributions
ums-0.240
atted-0.235
ams-0.216
cks-0.216
 supervision-0.216
Token forest
Token ID8222
Feature activation-9.74e-02

Pos logit contributions
 hike+0.086
Hi+0.086
 Under+0.085
 arrived+0.085
 ride+0.082
Neg logit contributions
ums-0.211
atted-0.182
 throats-0.179
alks-0.176
ucking-0.175
Token,
Token ID11
Feature activation-7.45e-02

Pos logit contributions
 park+0.117
 picnic+0.108
 hike+0.098
 library+0.097
 ride+0.096
Neg logit contributions
ums-0.277
atted-0.274
aged-0.269
asing-0.264
othe-0.261
Token there
Token ID612
Feature activation-8.69e-02

Pos logit contributions
 park+0.084
 picnic+0.078
 party+0.076
 ride+0.074
 hike+0.073
Neg logit contributions
aged-0.200
atted-0.198
othe-0.197
cks-0.196
asing-0.194
Token Lily
Token ID20037
Feature activation-3.37e-02

Pos logit contributions
 park+0.054
 forest+0.052
 museum+0.048
 hike+0.048
 stadium+0.048
Neg logit contributions
asing-0.068
 supervision-0.068
ums-0.068
atted-0.068
olds-0.067
Token.
Token ID13
Feature activation-3.57e-02

Pos logit contributions
 park+0.042
 picnic+0.038
 hike+0.035
 forest+0.035
 store+0.034
Neg logit contributions
asing-0.094
ums-0.094
 supervision-0.093
atted-0.093
olds-0.090
Token She
Token ID1375
Feature activation-4.32e-02

Pos logit contributions
 park+0.064
 picnic+0.061
 hike+0.060
 forest+0.058
 ride+0.058
Neg logit contributions
ums-0.079
asing-0.075
olds-0.074
atted-0.073
othe-0.071
Token lived
Token ID5615
Feature activation-7.55e-02

Pos logit contributions
 park+0.073
 picnic+0.068
 hike+0.066
 museum+0.065
 store+0.064
Neg logit contributions
aged-0.096
atted-0.095
asing-0.095
ums-0.094
ages-0.090
Token near
Token ID1474
Feature activation-5.21e-02

Pos logit contributions
 park+0.102
 picnic+0.097
 hike+0.096
 museum+0.090
 library+0.090
Neg logit contributions
ums-0.198
asing-0.197
aged-0.194
atted-0.192
ging-0.188
Token a
Token ID257
Feature activation-1.38e-01

Pos logit contributions
 park+0.050
 hike+0.048
 picnic+0.047
 arrived+0.044
 forest+0.044
Neg logit contributions
ums-0.149
asing-0.143
 supervision-0.142
olds-0.141
aged-0.138
Token big
Token ID1263
Feature activation-8.90e-02

Pos logit contributions
 arrived+0.113
 hike+0.113
Hi+0.109
 Under+0.106
 One+0.105
Neg logit contributions
ucker-0.333
ums-0.331
othe-0.312
inging-0.308
edient-0.308
Token river
Token ID7850
Feature activation-7.72e-02

Pos logit contributions
Hi+0.086
 peek+0.084
One+0.083
 hike+0.082
 Under+0.082
Neg logit contributions
ums-0.212
atted-0.201
alks-0.198
 supervision-0.195
ging-0.194
Token.
Token ID13
Feature activation-3.17e-02

Pos logit contributions
 park+0.112
 picnic+0.104
 hike+0.097
 forest+0.095
 library+0.095
Neg logit contributions
ums-0.198
asing-0.194
atted-0.194
 supervision-0.187
olds-0.187
Token One
Token ID1881
Feature activation-7.82e-02

Pos logit contributions
 hike+0.053
 picnic+0.052
 park+0.051
 forest+0.050
 festival+0.050
Neg logit contributions
ums-0.066
olds-0.063
atted-0.061
othe-0.059
asing-0.059
Token day
Token ID1110
Feature activation-6.37e-02

Pos logit contributions
 park+0.092
 picnic+0.080
 library+0.078
 store+0.077
 forest+0.074
Neg logit contributions
ums-0.218
asing-0.215
atted-0.215
 supervision-0.206
olds-0.205
Token Lily
Token ID20037
Feature activation-3.16e-02

Pos logit contributions
 park+0.052
 forest+0.051
 hike+0.048
 stadium+0.047
 cabin+0.046
Neg logit contributions
ums-0.062
asing-0.061
atted-0.061
olds-0.061
 supervision-0.059
Token.
Token ID13
Feature activation-3.26e-02

Pos logit contributions
 park+0.040
 picnic+0.037
 hike+0.034
 forest+0.033
 store+0.033
Neg logit contributions
asing-0.087
ums-0.087
 supervision-0.086
atted-0.086
olds-0.083
Token She
Token ID1375
Feature activation-3.95e-02

Pos logit contributions
 park+0.058
 hike+0.057
 picnic+0.056
 forest+0.054
 festival+0.053
Neg logit contributions
ums-0.069
olds-0.065
asing-0.064
atted-0.064
othe-0.064
Token lived
Token ID5615
Feature activation-7.47e-02

Pos logit contributions
 park+0.067
 picnic+0.063
 hike+0.061
 forest+0.059
 store+0.058
Neg logit contributions
aged-0.088
ums-0.088
asing-0.088
atted-0.087
olds-0.083
Token near
Token ID1474
Feature activation-5.06e-02

Pos logit contributions
 park+0.098
 picnic+0.092
 hike+0.091
 museum+0.085
 library+0.084
Neg logit contributions
ums-0.200
asing-0.199
aged-0.195
atted-0.194
ging-0.189
Token a
Token ID257
Feature activation-1.38e-01

Pos logit contributions
 hike+0.052
 park+0.051
 picnic+0.050
 arrived+0.048
 forest+0.047
Neg logit contributions
ums-0.139
olds-0.132
asing-0.132
 supervision-0.131
aged-0.128
Token small
Token ID1402
Feature activation-7.47e-02

Pos logit contributions
 arrived+0.119
 hike+0.117
 Under+0.105
 One+0.105
 picnic+0.104
Neg logit contributions
ucker-0.332
ums-0.331
othe-0.313
inging-0.308
ectar-0.305
Token creek
Token ID39459
Feature activation-7.39e-02

Pos logit contributions
 hike+0.093
 arrived+0.088
 Under+0.080
 ride+0.080
 picnic+0.078
Neg logit contributions
ums-0.172
edient-0.151
atted-0.151
ucker-0.150
alks-0.149
Token.
Token ID13
Feature activation-2.96e-02

Pos logit contributions
 park+0.108
 picnic+0.106
 hike+0.101
 library+0.093
 forest+0.093
Neg logit contributions
asing-0.180
ums-0.179
atted-0.179
olds-0.173
aged-0.169
Token One
Token ID1881
Feature activation-7.77e-02

Pos logit contributions
 hike+0.048
 picnic+0.047
 park+0.046
 festival+0.045
 arrived+0.044
Neg logit contributions
ums-0.065
olds-0.062
atted-0.060
othe-0.060
asing-0.059
Token day
Token ID1110
Feature activation-6.38e-02

Pos logit contributions
 park+0.094
 picnic+0.084
 library+0.081
 store+0.081
 museum+0.079
Neg logit contributions
ums-0.208
asing-0.205
atted-0.204
olds-0.195
aged-0.195
Token named
Token ID3706
Feature activation-2.89e-02

Pos logit contributions
 park+0.056
 picnic+0.050
 hike+0.048
 forest+0.048
 store+0.047
Neg logit contributions
ums-0.095
asing-0.094
atted-0.093
 supervision-0.093
olds-0.089
Token Lily
Token ID20037
Feature activation-2.93e-02

Pos logit contributions
 park+0.049
 forest+0.047
 hike+0.046
 stadium+0.045
 museum+0.045
Neg logit contributions
 supervision-0.063
ums-0.061
atted-0.060
asing-0.060
olds-0.060
Token who
Token ID508
Feature activation-2.58e-02

Pos logit contributions
 park+0.035
 picnic+0.030
 forest+0.028
 store+0.028
 hike+0.028
Neg logit contributions
ums-0.085
asing-0.085
 supervision-0.084
atted-0.084
olds-0.081
Token lived
Token ID5615
Feature activation-7.53e-02

Pos logit contributions
 park+0.044
 picnic+0.040
 hike+0.038
 forest+0.038
 store+0.037
Neg logit contributions
asing-0.060
ums-0.060
atted-0.059
 supervision-0.058
aged-0.058
Token near
Token ID1474
Feature activation-5.09e-02

Pos logit contributions
 park+0.101
 picnic+0.095
 hike+0.094
 library+0.091
 museum+0.090
Neg logit contributions
ums-0.204
asing-0.199
aged-0.197
atted-0.196
ging-0.193
Token a
Token ID257
Feature activation-1.38e-01

Pos logit contributions
 park+0.060
 hike+0.057
 picnic+0.056
 arrived+0.054
 forest+0.053
Neg logit contributions
ums-0.136
 supervision-0.131
asing-0.128
olds-0.128
atted-0.126
Token big
Token ID1263
Feature activation-8.53e-02

Pos logit contributions
 arrived+0.117
 hike+0.116
 Under+0.115
 picnic+0.106
 One+0.105
Neg logit contributions
ucker-0.309
ums-0.308
edient-0.299
othe-0.297
 wasteful-0.292
Token,
Token ID11
Feature activation-3.97e-02

Pos logit contributions
 peek+0.084
One+0.081
 hike+0.081
 arrived+0.081
 Under+0.079
Neg logit contributions
ums-0.201
atted-0.190
 supervision-0.189
alks-0.187
ging-0.184
Token scary
Token ID14343
Feature activation-5.26e-02

Pos logit contributions
 park+0.065
 hike+0.062
 picnic+0.060
 arrived+0.059
 theater+0.058
Neg logit contributions
ging-0.078
 supervision-0.074
asing-0.074
edient-0.074
 wasteful-0.073
Token volcano
Token ID32410
Feature activation-3.70e-02

Pos logit contributions
One+0.068
On+0.063
Inside+0.062
 Under+0.061
 picnic+0.060
Neg logit contributions
ums-0.114
ging-0.108
atted-0.107
aged-0.102
 throats-0.102
Token.
Token ID13
Feature activation-1.97e-02

Pos logit contributions
 park+0.055
 picnic+0.054
 hike+0.050
 library+0.049
 nap+0.048
Neg logit contributions
atted-0.087
ums-0.085
asing-0.085
cks-0.084
olds-0.083
Token went
Token ID1816
Feature activation-1.03e-02

Pos logit contributions
 park+0.009
 picnic+0.008
 store+0.008
 forest+0.008
 museum+0.008
Neg logit contributions
asing-0.017
atted-0.017
ums-0.017
 supervision-0.017
aged-0.016
Token fishing
Token ID12478
Feature activation-3.13e-02

Pos logit contributions
 park+0.013
 picnic+0.012
 library+0.012
 museum+0.012
 hike+0.011
Neg logit contributions
ums-0.028
atted-0.027
asing-0.027
ging-0.027
 supervision-0.027
Token.
Token ID13
Feature activation+7.91e-03

Pos logit contributions
 park+0.043
 picnic+0.038
 library+0.037
 store+0.036
 forest+0.036
Neg logit contributions
asing-0.082
ums-0.082
 supervision-0.081
atted-0.081
ging-0.078
Token<|endoftext|>
Token ID50256
Feature activation-8.59e-02

Pos logit contributions
asing+0.027
atted+0.026
 supervision+0.026
ums+0.026
ging+0.026
Neg logit contributions
 park-0.007
 forest-0.005
 store-0.004
 library-0.004
 picnic-0.004
TokenOn
Token ID2202
Feature activation-8.38e-02

Pos logit contributions
 picnic+0.152
 hike+0.145
 party+0.143
 race+0.139
 walk+0.139
Neg logit contributions
ums-0.173
atted-0.159
ging-0.157
ins-0.156
olds-0.155
Token a
Token ID257
Feature activation-1.38e-01

Pos logit contributions
 park+0.106
 museum+0.105
 peek+0.099
 forest+0.098
 library+0.096
Neg logit contributions
 supervision-0.199
ums-0.198
atted-0.194
othe-0.186
cks-0.183
Token bright
Token ID6016
Feature activation-8.20e-02

Pos logit contributions
 museum+0.164
 park+0.161
 theater+0.157
 library+0.154
 chamber+0.152
Neg logit contributions
othe-0.290
leness-0.285
 supervision-0.285
ging-0.284
cking-0.283
Token and
Token ID290
Feature activation-3.60e-02

Pos logit contributions
 park+0.096
 library+0.095
 museum+0.093
 store+0.088
 picnic+0.085
Neg logit contributions
atted-0.210
ums-0.196
ging-0.190
ams-0.189
leness-0.186
Token sunny
Token ID27737
Feature activation-5.82e-02

Pos logit contributions
 park+0.049
 museum+0.045
 picnic+0.045
 hike+0.045
 library+0.044
Neg logit contributions
asing-0.085
ging-0.085
aged-0.084
 supervision-0.083
ams-0.083
Token day
Token ID1110
Feature activation-7.99e-02

Pos logit contributions
 library+0.079
 museum+0.076
 park+0.074
 peek+0.071
 university+0.071
Neg logit contributions
atted-0.131
ums-0.127
aged-0.124
ging-0.124
ams-0.122
Token,
Token ID11
Feature activation-6.90e-02

Pos logit contributions
 park+0.116
 library+0.106
 museum+0.105
 store+0.100
 forest+0.099
Neg logit contributions
atted-0.199
ums-0.197
aged-0.195
othe-0.194
ging-0.192
Token Max
Token ID5436
Feature activation-2.76e-02

Pos logit contributions
 park+0.059
 forest+0.057
 museum+0.056
 valley+0.055
 store+0.055
Neg logit contributions
 supervision-0.052
othe-0.049
ums-0.048
asing-0.046
cks-0.045
Token.
Token ID13
Feature activation-3.76e-02

Pos logit contributions
 park+0.041
 picnic+0.040
 hike+0.037
 festival+0.036
 forest+0.035
Neg logit contributions
asing-0.067
ums-0.066
atted-0.066
 supervision-0.066
cks-0.065
Token Max
Token ID5436
Feature activation-1.10e-02

Pos logit contributions
 park+0.068
 picnic+0.065
 beach+0.064
 store+0.062
 festival+0.061
Neg logit contributions
ums-0.073
asing-0.072
othe-0.071
 supervision-0.069
olds-0.068
Token lived
Token ID5615
Feature activation-7.54e-02

Pos logit contributions
 park+0.019
 picnic+0.018
 hike+0.017
 festival+0.017
 store+0.017
Neg logit contributions
atted-0.022
aged-0.022
ums-0.022
asing-0.022
edient-0.021
Token near
Token ID1474
Feature activation-5.13e-02

Pos logit contributions
 park+0.107
 picnic+0.105
 hike+0.101
 library+0.098
 museum+0.097
Neg logit contributions
asing-0.191
aged-0.189
ums-0.189
atted-0.185
ging-0.183
Token a
Token ID257
Feature activation-1.38e-01

Pos logit contributions
 picnic+0.049
 park+0.047
 hike+0.044
 arrived+0.044
 party+0.043
Neg logit contributions
ums-0.145
 supervision-0.142
asing-0.142
olds-0.140
cks-0.136
Token big
Token ID1263
Feature activation-8.77e-02

Pos logit contributions
 arrived+0.126
 Under+0.109
 picnic+0.108
 hike+0.107
Hi+0.104
Neg logit contributions
ucker-0.316
inging-0.315
ums-0.314
edient-0.306
othe-0.304
Token river
Token ID7850
Feature activation-7.67e-02

Pos logit contributions
 arrived+0.081
 peek+0.075
 picnic+0.072
 hike+0.072
Hi+0.072
Neg logit contributions
ums-0.216
 supervision-0.205
atted-0.201
ging-0.201
alks-0.195
Token.
Token ID13
Feature activation-3.43e-02

Pos logit contributions
 park+0.108
 picnic+0.103
 hike+0.095
 library+0.093
 forest+0.091
Neg logit contributions
ums-0.196
asing-0.193
atted-0.192
olds-0.187
aged-0.185
Token One
Token ID1881
Feature activation-7.76e-02

Pos logit contributions
 picnic+0.057
 park+0.055
 party+0.054
 festival+0.053
 beach+0.052
Neg logit contributions
ums-0.070
olds-0.068
othe-0.067
asing-0.065
atted-0.064
Token day
Token ID1110
Feature activation-6.65e-02

Pos logit contributions
 park+0.093
 picnic+0.085
 library+0.082
 store+0.081
 museum+0.078
Neg logit contributions
ums-0.207
asing-0.205
atted-0.204
aged-0.196
 supervision-0.196
Token little
Token ID1310
Feature activation-4.87e-02

Pos logit contributions
 hike+0.121
 park+0.118
 arrived+0.116
 picnic+0.107
 forest+0.100
Neg logit contributions
ums-0.301
ging-0.287
ucker-0.277
edient-0.277
othe-0.276
Token girl
Token ID2576
Feature activation-3.74e-02

Pos logit contributions
 park+0.067
 hike+0.064
 arrived+0.060
 picnic+0.060
 Under+0.057
Neg logit contributions
ums-0.119
atted-0.115
ging-0.114
asing-0.113
 supervision-0.111
Token who
Token ID508
Feature activation-2.65e-02

Pos logit contributions
 park+0.060
 picnic+0.055
 hike+0.053
 forest+0.052
 museum+0.051
Neg logit contributions
ums-0.092
asing-0.091
atted-0.091
 supervision-0.090
olds-0.087
Token lived
Token ID5615
Feature activation-7.53e-02

Pos logit contributions
 park+0.047
 picnic+0.044
 hike+0.043
 forest+0.041
 store+0.041
Neg logit contributions
ums-0.056
atted-0.055
asing-0.055
aged-0.055
olds-0.053
Token near
Token ID1474
Feature activation-5.21e-02

Pos logit contributions
 park+0.099
 picnic+0.096
 hike+0.094
 library+0.089
 museum+0.088
Neg logit contributions
ums-0.204
asing-0.201
atted-0.198
aged-0.198
ging-0.195
Token a
Token ID257
Feature activation-1.38e-01

Pos logit contributions
 park+0.052
 picnic+0.048
 hike+0.048
 forest+0.044
 arrived+0.043
Neg logit contributions
ums-0.151
 supervision-0.146
asing-0.144
olds-0.142
atted-0.142
Token tunnel
Token ID13275
Feature activation-7.48e-02

Pos logit contributions
 arrived+0.128
 hike+0.125
 Under+0.115
 picnic+0.112
 ride+0.111
Neg logit contributions
ums-0.322
edient-0.311
ucker-0.311
othe-0.306
 wasteful-0.299
Token.
Token ID13
Feature activation-2.92e-02

Pos logit contributions
 picnic+0.109
 park+0.109
 hike+0.102
 camp+0.098
 library+0.096
Neg logit contributions
atted-0.178
ums-0.175
asing-0.173
aged-0.169
othe-0.166
Token She
Token ID1375
Feature activation-3.49e-02

Pos logit contributions
 picnic+0.055
 park+0.052
 hike+0.051
 camp+0.050
 party+0.049
Neg logit contributions
ums-0.058
olds-0.055
atted-0.053
 supervision-0.052
asing-0.052
Token had
Token ID550
Feature activation-5.64e-02

Pos logit contributions
 picnic+0.057
 park+0.056
 hike+0.051
 nap+0.050
 festival+0.050
Neg logit contributions
atted-0.077
aged-0.075
asing-0.074
ums-0.073
olds-0.070
Token a
Token ID257
Feature activation-8.71e-02

Pos logit contributions
 hike+0.060
 park+0.060
 festival+0.059
 picnic+0.059
 museum+0.057
Neg logit contributions
aged-0.155
asing-0.145
ums-0.145
atted-0.144
olds-0.142
Token upon
Token ID2402
Feature activation-1.00e-01

Pos logit contributions
 park+0.107
 picnic+0.092
 forest+0.090
 store+0.088
 hike+0.085
Neg logit contributions
ums-0.232
atted-0.230
asing-0.229
 supervision-0.228
olds-0.220
Token a
Token ID257
Feature activation-1.26e-01

Pos logit contributions
 forest+0.117
 park+0.115
 hike+0.114
 museum+0.112
 festival+0.110
Neg logit contributions
cks-0.238
ums-0.236
asing-0.236
aged-0.236
olds-0.234
Token time
Token ID640
Feature activation-7.61e-02

Pos logit contributions
 park+0.122
 store+0.108
 forest+0.106
 museum+0.102
 library+0.096
Neg logit contributions
ums-0.371
atted-0.364
asing-0.363
 supervision-0.356
olds-0.349
Token,
Token ID11
Feature activation-9.72e-02

Pos logit contributions
 park+0.086
 picnic+0.071
 store+0.070
 forest+0.070
 library+0.068
Neg logit contributions
ums-0.226
atted-0.223
asing-0.221
 supervision-0.217
olds-0.213
Token in
Token ID287
Feature activation-7.83e-02

Pos logit contributions
 park+0.118
 library+0.109
 hike+0.105
 picnic+0.105
 forest+0.103
Neg logit contributions
atted-0.267
aged-0.262
asing-0.261
ging-0.261
cks-0.258
Token a
Token ID257
Feature activation-1.38e-01

Pos logit contributions
 park+0.072
 forest+0.062
 museum+0.061
 store+0.059
 library+0.057
Neg logit contributions
ums-0.228
atted-0.222
 supervision-0.222
cks-0.221
asing-0.219
Token thick
Token ID6546
Feature activation-8.87e-02

Pos logit contributions
 tr+0.126
 hike+0.124
 museum+0.121
 ride+0.121
 park+0.120
Neg logit contributions
ums-0.351
cking-0.350
cks-0.349
atted-0.339
 supervision-0.339
Token forest
Token ID8222
Feature activation-9.83e-02

Pos logit contributions
Hi+0.066
 Under+0.064
 hike+0.061
 tr+0.057
 picnic+0.057
Neg logit contributions
ums-0.235
atted-0.228
aged-0.214
ieties-0.213
ams-0.208
Token,
Token ID11
Feature activation-8.06e-02

Pos logit contributions
 park+0.116
 picnic+0.106
 library+0.096
 hike+0.096
 store+0.094
Neg logit contributions
ums-0.282
atted-0.279
aged-0.271
asing-0.268
olds-0.265
Token there
Token ID612
Feature activation-8.84e-02

Pos logit contributions
 park+0.093
 picnic+0.086
 party+0.084
 library+0.081
 hike+0.079
Neg logit contributions
aged-0.216
atted-0.216
cks-0.215
othe-0.214
asing-0.211
Token lived
Token ID5615
Feature activation-9.40e-02

Pos logit contributions
 park+0.121
 picnic+0.115
 hike+0.110
 library+0.110
 store+0.107
Neg logit contributions
ums-0.225
atted-0.219
asing-0.215
aged-0.214
cks-0.206
Token upon
Token ID2402
Feature activation-1.00e-01

Pos logit contributions
 park+0.111
 picnic+0.095
 forest+0.092
 store+0.091
 museum+0.088
Neg logit contributions
ums-0.241
asing-0.239
atted-0.238
 supervision-0.237
olds-0.228
Token a
Token ID257
Feature activation-1.26e-01

Pos logit contributions
 park+0.114
 forest+0.114
 museum+0.110
 hike+0.110
 festival+0.107
Neg logit contributions
cks-0.244
asing-0.244
aged-0.242
ums-0.239
olds-0.236
Token time
Token ID640
Feature activation-7.53e-02

Pos logit contributions
 park+0.128
 store+0.116
 forest+0.113
 museum+0.109
 library+0.103
Neg logit contributions
ums-0.361
atted-0.355
asing-0.353
 supervision-0.344
olds-0.339
Token,
Token ID11
Feature activation-9.57e-02

Pos logit contributions
 park+0.082
 picnic+0.069
 store+0.067
 forest+0.067
 library+0.064
Neg logit contributions
ums-0.226
atted-0.223
asing-0.222
 supervision-0.218
olds-0.213
Token in
Token ID287
Feature activation-7.69e-02

Pos logit contributions
 park+0.119
 library+0.107
 picnic+0.107
 hike+0.103
 forest+0.103
Neg logit contributions
asing-0.262
atted-0.262
aged-0.258
ging-0.254
cks-0.253
Token a
Token ID257
Feature activation-1.37e-01

Pos logit contributions
 park+0.070
 forest+0.059
 museum+0.058
 store+0.058
 picnic+0.055
Neg logit contributions
ums-0.225
 supervision-0.221
asing-0.220
atted-0.219
cks-0.219
Token big
Token ID1263
Feature activation-9.65e-02

Pos logit contributions
 tr+0.136
 hike+0.135
 arrived+0.133
 picnic+0.130
 museum+0.129
Neg logit contributions
cking-0.336
ums-0.334
cks-0.333
aged-0.321
 supervision-0.319
Token forest
Token ID8222
Feature activation-9.99e-02

Pos logit contributions
 arrived+0.084
 hike+0.084
 peek+0.080
 ride+0.080
Hi+0.079
Neg logit contributions
ums-0.243
atted-0.233
 supervision-0.219
cks-0.217
alks-0.214
Token,
Token ID11
Feature activation-7.90e-02

Pos logit contributions
 park+0.119
 picnic+0.111
 hike+0.100
 store+0.099
 library+0.098
Neg logit contributions
ums-0.285
atted-0.281
aged-0.274
asing-0.272
othe-0.267
Token there
Token ID612
Feature activation-8.62e-02

Pos logit contributions
 park+0.088
 picnic+0.082
 party+0.078
 library+0.075
 hike+0.074
Neg logit contributions
aged-0.218
asing-0.217
atted-0.217
cks-0.214
othe-0.212
Token was
Token ID373
Feature activation-7.33e-02

Pos logit contributions
 park+0.118
 picnic+0.112
 hike+0.106
 store+0.104
 library+0.104
Neg logit contributions
ums-0.221
atted-0.215
asing-0.214
aged-0.210
 supervision-0.203
Token,
Token ID11
Feature activation-7.82e-02

Pos logit contributions
 park+0.111
 library+0.100
 forest+0.099
 museum+0.093
 store+0.093
Neg logit contributions
atted-0.202
ging-0.199
ums-0.199
aged-0.197
othe-0.189
Token Jack
Token ID3619
Feature activation-4.17e-02

Pos logit contributions
 park+0.094
 library+0.089
 party+0.084
 forest+0.084
 hike+0.083
Neg logit contributions
ging-0.216
aged-0.209
atted-0.208
ums-0.204
othe-0.202
Token was
Token ID373
Feature activation-6.55e-02

Pos logit contributions
 park+0.052
 picnic+0.048
 museum+0.047
 festival+0.046
 hike+0.046
Neg logit contributions
atted-0.110
ums-0.107
asing-0.105
 supervision-0.105
aged-0.105
Token walking
Token ID6155
Feature activation-8.70e-02

Pos logit contributions
 park+0.088
 picnic+0.075
 forest+0.074
 store+0.073
 hike+0.071
Neg logit contributions
ums-0.182
asing-0.181
 supervision-0.179
atted-0.179
olds-0.172
Token near
Token ID1474
Feature activation-5.72e-02

Pos logit contributions
 library+0.142
 park+0.136
 picnic+0.134
 museum+0.133
 festival+0.132
Neg logit contributions
ging-0.189
atted-0.183
ums-0.183
aged-0.180
asing-0.176
Token a
Token ID257
Feature activation-1.37e-01

Pos logit contributions
 picnic+0.081
 park+0.080
 festival+0.079
 library+0.079
 hike+0.079
Neg logit contributions
 supervision-0.129
ums-0.129
atted-0.126
cks-0.125
asing-0.123
Token lake
Token ID13546
Feature activation-6.83e-02

Pos logit contributions
 arrived+0.120
 Under+0.117
 set+0.105
 Near+0.102
 hike+0.102
Neg logit contributions
ucker-0.285
inging-0.285
ums-0.280
edient-0.280
cking-0.279
Token.
Token ID13
Feature activation-3.11e-02

Pos logit contributions
 park+0.095
 picnic+0.092
 library+0.089
 hike+0.086
 festival+0.084
Neg logit contributions
ums-0.172
atted-0.167
asing-0.162
olds-0.158
ging-0.158
Token Jack
Token ID3619
Feature activation-2.34e-02

Pos logit contributions
 picnic+0.055
 park+0.055
 festival+0.054
 party+0.053
 library+0.051
Neg logit contributions
ums-0.059
atted-0.055
alks-0.052
aged-0.052
anked-0.052
Token saw
Token ID2497
Feature activation-6.61e-02

Pos logit contributions
 park+0.031
 festival+0.029
 picnic+0.028
 library+0.028
 museum+0.028
Neg logit contributions
atted-0.061
aged-0.058
asing-0.057
ums-0.057
ging-0.056
Token a
Token ID257
Feature activation-1.00e-01

Pos logit contributions
 hike+0.082
 picnic+0.079
 library+0.079
 festival+0.078
 walk+0.078
Neg logit contributions
atted-0.163
ums-0.158
asing-0.156
aged-0.155
olds-0.153
Token upon
Token ID2402
Feature activation-9.83e-02

Pos logit contributions
 park+0.104
 picnic+0.088
 forest+0.086
 store+0.085
 hike+0.082
Neg logit contributions
ums-0.232
asing-0.231
atted-0.229
 supervision-0.229
olds-0.220
Token a
Token ID257
Feature activation-1.27e-01

Pos logit contributions
 hike+0.114
 forest+0.114
 park+0.113
 festival+0.112
 museum+0.111
Neg logit contributions
cks-0.236
asing-0.235
ums-0.234
aged-0.234
olds-0.232
Token time
Token ID640
Feature activation-7.61e-02

Pos logit contributions
 park+0.124
 store+0.110
 forest+0.107
 museum+0.105
 library+0.099
Neg logit contributions
ums-0.374
atted-0.367
asing-0.367
 supervision-0.359
cks-0.353
Token,
Token ID11
Feature activation-9.72e-02

Pos logit contributions
 park+0.084
 picnic+0.070
 store+0.068
 forest+0.068
 library+0.066
Neg logit contributions
ums-0.228
atted-0.225
asing-0.224
 supervision-0.220
aged-0.216
Token in
Token ID287
Feature activation-7.64e-02

Pos logit contributions
 park+0.115
 library+0.103
 picnic+0.102
 hike+0.100
 forest+0.098
Neg logit contributions
asing-0.273
atted-0.272
aged-0.268
ums-0.265
ging-0.265
Token a
Token ID257
Feature activation-1.37e-01

Pos logit contributions
 park+0.069
 forest+0.058
 museum+0.057
 store+0.056
 hike+0.054
Neg logit contributions
ums-0.227
 supervision-0.221
atted-0.221
asing-0.220
cks-0.219
Token small
Token ID1402
Feature activation-8.59e-02

Pos logit contributions
 tr+0.132
 hike+0.128
 park+0.126
 ride+0.126
 picnic+0.125
Neg logit contributions
cks-0.345
cking-0.345
ums-0.342
 supervision-0.333
aged-0.333
Token town
Token ID3240
Feature activation-8.09e-02

Pos logit contributions
 hike+0.090
 ride+0.088
 arrived+0.085
 tr+0.083
 picnic+0.083
Neg logit contributions
ums-0.211
atted-0.192
edient-0.188
aged-0.184
ucking-0.182
Token,
Token ID11
Feature activation-7.94e-02

Pos logit contributions
 park+0.095
 picnic+0.091
 hike+0.085
 nap+0.077
 ride+0.077
Neg logit contributions
ums-0.231
atted-0.225
aged-0.219
asing-0.218
olds-0.217
Token there
Token ID612
Feature activation-8.88e-02

Pos logit contributions
 park+0.098
 picnic+0.092
 party+0.090
 hike+0.088
 library+0.087
Neg logit contributions
aged-0.208
atted-0.204
asing-0.201
othe-0.200
cks-0.199
Token was
Token ID373
Feature activation-7.27e-02

Pos logit contributions
 park+0.126
 picnic+0.122
 hike+0.117
 library+0.116
 festival+0.113
Neg logit contributions
ums-0.216
atted-0.212
aged-0.207
asing-0.205
cks-0.198
Token upon
Token ID2402
Feature activation-1.00e-01

Pos logit contributions
 park+0.114
 picnic+0.099
 forest+0.095
 store+0.095
 library+0.092
Neg logit contributions
ums-0.238
atted-0.236
asing-0.235
 supervision-0.233
olds-0.225
Token a
Token ID257
Feature activation-1.26e-01

Pos logit contributions
 park+0.111
 museum+0.108
 hike+0.106
 festival+0.105
 forest+0.105
Neg logit contributions
cks-0.250
asing-0.248
ums-0.247
aged-0.246
olds-0.240
Token time
Token ID640
Feature activation-7.80e-02

Pos logit contributions
 park+0.120
 store+0.105
 forest+0.101
 museum+0.099
 library+0.094
Neg logit contributions
ums-0.376
asing-0.369
atted-0.368
 supervision-0.360
cks-0.355
Token,
Token ID11
Feature activation-9.84e-02

Pos logit contributions
 park+0.083
 picnic+0.068
 store+0.067
 forest+0.066
 library+0.063
Neg logit contributions
ums-0.237
atted-0.234
asing-0.234
 supervision-0.229
olds-0.224
Token in
Token ID287
Feature activation-7.81e-02

Pos logit contributions
 park+0.120
 library+0.107
 picnic+0.104
 hike+0.103
 museum+0.103
Neg logit contributions
asing-0.274
atted-0.272
aged-0.269
ging-0.266
cks-0.265
Token a
Token ID257
Feature activation-1.37e-01

Pos logit contributions
 park+0.065
 museum+0.053
 forest+0.052
 store+0.050
 picnic+0.049
Neg logit contributions
ums-0.241
atted-0.235
asing-0.235
 supervision-0.235
aged-0.231
Token peaceful
Token ID12309
Feature activation-7.87e-02

Pos logit contributions
 tr+0.126
 museum+0.123
 hike+0.121
 ride+0.121
 arrived+0.120
Neg logit contributions
cking-0.345
cks-0.345
ums-0.342
 supervision-0.331
aged-0.330
Token town
Token ID3240
Feature activation-7.66e-02

Pos logit contributions
 Under+0.082
 museum+0.079
 vest+0.079
 arrived+0.074
 hike+0.074
Neg logit contributions
ums-0.160
atted-0.158
rim-0.151
ieties-0.149
aged-0.148
Token,
Token ID11
Feature activation-8.07e-02

Pos logit contributions
 park+0.095
 picnic+0.092
 hike+0.087
 nap+0.078
 festival+0.078
Neg logit contributions
ums-0.213
atted-0.205
aged-0.200
asing-0.199
olds-0.197
Token there
Token ID612
Feature activation-8.84e-02

Pos logit contributions
 park+0.107
 party+0.105
 picnic+0.100
 beach+0.100
 library+0.096
Neg logit contributions
aged-0.195
othe-0.189
atted-0.188
asing-0.186
cks-0.186
Token was
Token ID373
Feature activation-7.13e-02

Pos logit contributions
 park+0.129
 picnic+0.122
 hike+0.118
 library+0.116
 store+0.115
Neg logit contributions
ums-0.217
atted-0.210
asing-0.207
aged-0.206
cks-0.198
Token upon
Token ID2402
Feature activation-9.79e-02

Pos logit contributions
 park+0.109
 picnic+0.093
 forest+0.090
 store+0.089
 museum+0.086
Neg logit contributions
ums-0.234
asing-0.232
atted-0.232
 supervision-0.230
olds-0.222
Token a
Token ID257
Feature activation-1.24e-01

Pos logit contributions
 park+0.108
 museum+0.104
 hike+0.101
 forest+0.101
 store+0.100
Neg logit contributions
cks-0.248
ums-0.247
asing-0.246
aged-0.244
olds-0.240
Token time
Token ID640
Feature activation-7.68e-02

Pos logit contributions
 park+0.120
 store+0.107
 museum+0.101
 forest+0.101
 library+0.096
Neg logit contributions
ums-0.366
atted-0.359
asing-0.357
 supervision-0.350
cks-0.345
Token,
Token ID11
Feature activation-9.79e-02

Pos logit contributions
 park+0.086
 store+0.071
 picnic+0.071
 forest+0.069
 library+0.067
Neg logit contributions
ums-0.229
atted-0.225
asing-0.223
 supervision-0.219
olds-0.214
Token in
Token ID287
Feature activation-7.92e-02

Pos logit contributions
 park+0.111
 library+0.099
 picnic+0.098
 store+0.096
 hike+0.096
Neg logit contributions
atted-0.277
asing-0.276
ging-0.271
aged-0.271
ums-0.270
Token a
Token ID257
Feature activation-1.37e-01

Pos logit contributions
 park+0.067
 museum+0.054
 forest+0.053
 store+0.053
 picnic+0.051
Neg logit contributions
ums-0.243
asing-0.237
atted-0.237
 supervision-0.236
cks-0.233
Token small
Token ID1402
Feature activation-8.42e-02

Pos logit contributions
 tr+0.133
 hike+0.130
 ride+0.130
 arrived+0.128
 museum+0.127
Neg logit contributions
cking-0.342
cks-0.338
ums-0.338
atted-0.324
aged-0.323
Token house
Token ID2156
Feature activation-9.76e-02

Pos logit contributions
 hike+0.092
 ride+0.091
 arrived+0.088
 Under+0.087
 picnic+0.084
Neg logit contributions
ums-0.201
atted-0.182
edient-0.176
ucking-0.173
ang-0.172
Token,
Token ID11
Feature activation-7.97e-02

Pos logit contributions
 hike+0.136
 picnic+0.136
 park+0.133
 tr+0.121
 camp+0.119
Neg logit contributions
ums-0.245
atted-0.238
aged-0.233
olds-0.224
asing-0.222
Token Tim
Token ID5045
Feature activation-4.00e-02

Pos logit contributions
 park+0.094
 picnic+0.089
 party+0.085
 hike+0.085
 festival+0.082
Neg logit contributions
aged-0.213
atted-0.208
asing-0.206
othe-0.206
cks-0.205
Token and
Token ID290
Feature activation-3.22e-02

Pos logit contributions
 picnic+0.072
 festival+0.070
 park+0.069
 hike+0.069
 museum+0.066
Neg logit contributions
atted-0.080
asing-0.079
ums-0.078
 supervision-0.078
cks-0.077
Token.
Token ID13
Feature activation+2.38e-02

Pos logit contributions
ums+0.024
atted+0.023
asing+0.023
 supervision+0.023
ging+0.023
Neg logit contributions
 park-0.011
 picnic-0.010
 store-0.010
 forest-0.009
 museum-0.009
Token They
Token ID1119
Feature activation+7.81e-03

Pos logit contributions
ums+0.060
asing+0.059
atted+0.059
 supervision+0.058
olds+0.056
Neg logit contributions
 park-0.038
 picnic-0.033
 forest-0.033
 store-0.032
 hike-0.032
Token felt
Token ID2936
Feature activation+3.19e-02

Pos logit contributions
atted+0.019
asing+0.018
ums+0.018
 supervision+0.018
aged+0.018
Neg logit contributions
 park-0.013
 picnic-0.012
 museum-0.011
 forest-0.011
 library-0.011
Token very
Token ID845
Feature activation+5.41e-02

Pos logit contributions
asing+0.085
ums+0.083
 supervision+0.081
atted+0.079
aged+0.076
Neg logit contributions
 park-0.051
 big-0.046
 forest-0.044
 picnic-0.043
 store-0.043
Token sad
Token ID6507
Feature activation+3.26e-02

Pos logit contributions
asing+0.137
 supervision+0.136
ums+0.132
atted+0.126
ging+0.121
Neg logit contributions
 park-0.094
 big-0.087
 forest-0.081
 picnic-0.079
 little-0.078
Token and
Token ID290
Feature activation+7.61e-02

Pos logit contributions
ums+0.091
asing+0.091
atted+0.090
 supervision+0.090
olds+0.087
Neg logit contributions
 park-0.042
 picnic-0.036
 forest-0.035
 store-0.035
 hike-0.034
Token sorry
Token ID7926
Feature activation+3.99e-02

Pos logit contributions
ums+0.163
asing+0.161
 supervision+0.155
atted+0.152
olds+0.146
Neg logit contributions
 park-0.157
 picnic-0.140
 forest-0.139
 store-0.138
 ride-0.134
Token.
Token ID13
Feature activation+2.84e-02

Pos logit contributions
ums+0.118
asing+0.118
 supervision+0.117
atted+0.116
ging+0.112
Neg logit contributions
 park-0.046
 picnic-0.038
 forest-0.037
 store-0.037
 ride-0.035
Token They
Token ID1119
Feature activation+1.41e-02

Pos logit contributions
asing+0.083
ums+0.082
 supervision+0.082
atted+0.081
ging+0.078
Neg logit contributions
 park-0.035
 picnic-0.029
 forest-0.028
 store-0.028
 museum-0.027
Token had
Token ID550
Feature activation+6.52e-03

Pos logit contributions
atted+0.029
asing+0.028
 supervision+0.028
ums+0.028
aged+0.027
Neg logit contributions
 park-0.027
 museum-0.025
 picnic-0.025
 forest-0.025
 library-0.024
Token lost
Token ID2626
Feature activation+7.43e-03

Pos logit contributions
 supervision+0.015
asing+0.015
atted+0.015
ums+0.015
aged+0.014
Neg logit contributions
 park-0.011
 museum-0.010
 forest-0.010
 library-0.010
 hike-0.010
Token to
Token ID284
Feature activation+2.88e-02

Pos logit contributions
asing+0.058
ums+0.058
 supervision+0.058
atted+0.057
olds+0.055
Neg logit contributions
 park-0.032
 picnic-0.028
 forest-0.028
 hike-0.027
 store-0.027
Token never
Token ID1239
Feature activation+5.28e-02

Pos logit contributions
ums+0.070
 supervision+0.070
asing+0.069
atted+0.069
olds+0.066
Neg logit contributions
 park-0.046
 forest-0.041
 picnic-0.041
 hike-0.040
 museum-0.040
Token take
Token ID1011
Feature activation+4.45e-02

Pos logit contributions
 supervision+0.130
olds+0.128
asing+0.128
atted+0.124
beh+0.121
Neg logit contributions
 park-0.073
 forest-0.073
 library-0.070
 festival-0.069
 stadium-0.069
Token my
Token ID616
Feature activation+3.73e-02

Pos logit contributions
ums+0.108
asing+0.108
 supervision+0.107
atted+0.106
olds+0.102
Neg logit contributions
 park-0.075
 picnic-0.066
 forest-0.065
 store-0.064
 hike-0.063
Token things
Token ID1243
Feature activation+2.21e-02

Pos logit contributions
atted+0.098
 supervision+0.097
asing+0.096
ums+0.096
olds+0.093
Neg logit contributions
 park-0.048
 store-0.042
 picnic-0.042
 hike-0.041
 festival-0.041
Token without
Token ID1231
Feature activation+1.24e-01

Pos logit contributions
 supervision+0.058
asing+0.058
ums+0.057
atted+0.056
ging+0.055
Neg logit contributions
 park-0.032
 forest-0.028
 picnic-0.028
 hike-0.027
 museum-0.026
Token asking
Token ID4737
Feature activation+4.29e-02

Pos logit contributions
ums+0.260
asing+0.255
atted+0.253
 supervision+0.239
olds+0.237
Neg logit contributions
 park-0.257
 picnic-0.230
 store-0.230
 forest-0.227
 ride-0.222
Token again
Token ID757
Feature activation+4.30e-02

Pos logit contributions
ums+0.125
asing+0.124
 supervision+0.123
atted+0.123
olds+0.118
Neg logit contributions
 park-0.051
 picnic-0.043
 forest-0.042
 store-0.041
 hike-0.040
Token.
Token ID13
Feature activation+2.23e-02

Pos logit contributions
 supervision+0.120
ums+0.119
asing+0.119
atted+0.117
olds+0.115
Neg logit contributions
 park-0.056
 picnic-0.049
 forest-0.048
 hike-0.046
 museum-0.045
Token And
Token ID843
Feature activation+4.61e-02

Pos logit contributions
ums+0.046
asing+0.046
 supervision+0.045
atted+0.045
ging+0.043
Neg logit contributions
 park-0.045
 picnic-0.041
 forest-0.040
 store-0.040
 museum-0.039
Token you
Token ID345
Feature activation+2.56e-02

Pos logit contributions
ums+0.112
asing+0.111
atted+0.111
 supervision+0.110
ging+0.105
Neg logit contributions
 park-0.079
 picnic-0.069
 store-0.068
 forest-0.067
 ride-0.066
Token kept
Token ID4030
Feature activation+3.77e-02

Pos logit contributions
ums+0.018
asing+0.018
 supervision+0.018
atted+0.017
olds+0.017
Neg logit contributions
 park-0.009
 store-0.008
 picnic-0.008
 forest-0.008
 ride-0.007
Token falling
Token ID7463
Feature activation+1.90e-02

Pos logit contributions
ums+0.099
asing+0.099
 supervision+0.095
olds+0.094
atted+0.093
Neg logit contributions
 park-0.061
 forest-0.053
 big-0.053
 store-0.051
 ride-0.050
Token.
Token ID13
Feature activation+1.54e-02

Pos logit contributions
ums+0.048
atted+0.047
asing+0.047
 supervision+0.047
ging+0.046
Neg logit contributions
 park-0.029
 picnic-0.026
 museum-0.025
 library-0.025
 forest-0.025
Token
Token ID198
Feature activation+2.33e-03

Pos logit contributions
ums+0.041
asing+0.041
atted+0.040
 supervision+0.040
olds+0.039
Neg logit contributions
 park-0.022
 picnic-0.019
 forest-0.019
 store-0.018
 hike-0.018
Token
Token ID198
Feature activation+8.92e-03

Pos logit contributions
ums+0.007
asing+0.007
 supervision+0.007
atted+0.007
olds+0.007
Neg logit contributions
 park-0.002
 picnic-0.002
 forest-0.002
 store-0.002
 hike-0.002
TokenL
Token ID43
Feature activation+7.50e-02

Pos logit contributions
ums+0.024
asing+0.024
 supervision+0.024
atted+0.023
olds+0.022
Neg logit contributions
 park-0.013
 picnic-0.011
 forest-0.011
 store-0.011
 museum-0.011
Tokenily
Token ID813
Feature activation+2.21e-02

Pos logit contributions
asing+0.141
ums+0.139
atted+0.137
 supervision+0.135
olds+0.129
Neg logit contributions
 park-0.168
 picnic-0.155
 forest-0.154
 store-0.152
 hike-0.151
Token said
Token ID531
Feature activation+4.95e-03

Pos logit contributions
asing+0.058
atted+0.058
ums+0.058
 supervision+0.057
aged+0.056
Neg logit contributions
 park-0.031
 picnic-0.028
 museum-0.027
 forest-0.027
 hike-0.026
Token,
Token ID11
Feature activation+1.40e-02

Pos logit contributions
atted+0.015
ums+0.015
asing+0.015
 supervision+0.015
olds+0.014
Neg logit contributions
 park-0.005
 forest-0.004
 picnic-0.004
 store-0.004
 library-0.004
Token "
Token ID366
Feature activation+4.15e-02

Pos logit contributions
 supervision+0.030
atted+0.030
asing+0.029
aged+0.029
ging+0.029
Neg logit contributions
 park-0.026
 picnic-0.025
 hike-0.023
 forest-0.023
 museum-0.023
TokenTim
Token ID14967
Feature activation+3.81e-02

Pos logit contributions
atted+0.118
ums+0.114
 supervision+0.113
asing+0.112
olds+0.109
Neg logit contributions
 park-0.063
 forest-0.053
 store-0.052
Hi-0.050
 picnic-0.050
Token doing
Token ID1804
Feature activation+5.85e-03

Pos logit contributions
ums+0.076
asing+0.076
atted+0.075
 supervision+0.074
olds+0.072
Neg logit contributions
 park-0.039
 store-0.033
 forest-0.032
 picnic-0.032
 library-0.031
Token this
Token ID428
Feature activation-5.13e-03

Pos logit contributions
asing+0.015
ums+0.015
atted+0.015
 supervision+0.015
ging+0.014
Neg logit contributions
 park-0.009
 picnic-0.007
 store-0.007
 forest-0.007
 big-0.007
Token."
Token ID526
Feature activation+3.15e-02

Pos logit contributions
 park+0.009
 forest+0.008
 picnic+0.008
 hike+0.007
 museum+0.007
Neg logit contributions
ums-0.012
asing-0.012
 supervision-0.012
atted-0.012
aged-0.012
Token
Token ID198
Feature activation+1.57e-02

Pos logit contributions
asing+0.106
 supervision+0.104
atted+0.103
ums+0.103
ging+0.101
Neg logit contributions
 park-0.027
 store-0.020
 forest-0.019
 picnic-0.018
 museum-0.017
Token
Token ID198
Feature activation+2.27e-02

Pos logit contributions
ums+0.049
asing+0.049
 supervision+0.048
atted+0.048
olds+0.046
Neg logit contributions
 park-0.016
 picnic-0.012
 forest-0.012
 store-0.012
 hike-0.011
TokenL
Token ID43
Feature activation+7.86e-02

Pos logit contributions
ums+0.061
atted+0.060
asing+0.060
 supervision+0.060
aged+0.057
Neg logit contributions
 park-0.034
 store-0.029
 forest-0.028
 picnic-0.028
 library-0.027
Tokenily
Token ID813
Feature activation+2.60e-02

Pos logit contributions
ums+0.160
 supervision+0.158
asing+0.158
atted+0.157
ging+0.148
Neg logit contributions
 park-0.165
 picnic-0.147
 store-0.146
 forest-0.146
 museum-0.143
Token felt
Token ID2936
Feature activation+2.05e-02

Pos logit contributions
ums+0.066
asing+0.065
 supervision+0.065
atted+0.064
ging+0.062
Neg logit contributions
 park-0.041
 picnic-0.036
 store-0.035
 forest-0.035
 ride-0.034
Token sad
Token ID6507
Feature activation+2.28e-02

Pos logit contributions
asing+0.045
ums+0.044
atted+0.043
 supervision+0.043
ams+0.041
Neg logit contributions
 park-0.041
 big-0.039
 store-0.037
 a-0.036
 forest-0.035
Token and
Token ID290
Feature activation+7.09e-02

Pos logit contributions
ums+0.067
asing+0.067
 supervision+0.066
atted+0.066
ging+0.063
Neg logit contributions
 park-0.027
 store-0.022
 forest-0.022
 picnic-0.022
 ride-0.021
Token scared
Token ID12008
Feature activation+2.86e-02

Pos logit contributions
ums+0.129
asing+0.126
 supervision+0.124
atted+0.122
ging+0.116
Neg logit contributions
 park-0.167
 picnic-0.151
 forest-0.150
 store-0.150
 ride-0.146
Token so
Token ID523
Feature activation+3.31e-02

Pos logit contributions
ums+0.104
atted+0.102
asing+0.101
 supervision+0.100
olds+0.098
Neg logit contributions
 park-0.036
 store-0.028
 forest-0.028
 ride-0.027
 picnic-0.027
Token Tim
Token ID5045
Feature activation+2.93e-02

Pos logit contributions
ums+0.094
asing+0.093
 supervision+0.093
atted+0.092
olds+0.089
Neg logit contributions
 park-0.043
 picnic-0.036
 forest-0.035
 store-0.035
 hike-0.034
Token could
Token ID714
Feature activation-1.24e-02

Pos logit contributions
ums+0.078
asing+0.077
 supervision+0.077
atted+0.076
olds+0.073
Neg logit contributions
 park-0.043
 picnic-0.037
 forest-0.037
 store-0.036
 ride-0.035
Token catch
Token ID4929
Feature activation+2.95e-04

Pos logit contributions
 park+0.019
 forest+0.017
 museum+0.016
 picnic+0.016
 library+0.016
Neg logit contributions
ums-0.031
asing-0.031
 supervision-0.031
atted-0.031
othe-0.030
Token it
Token ID340
Feature activation-9.97e-03

Pos logit contributions
ums+0.001
asing+0.001
 supervision+0.001
atted+0.001
olds+0.001
Neg logit contributions
 park-0.000
 forest-0.000
 store-0.000
 picnic-0.000
 ride-0.000
Token without
Token ID1231
Feature activation+1.01e-01

Pos logit contributions
 park+0.015
 picnic+0.014
 forest+0.013
 museum+0.013
 hike+0.013
Neg logit contributions
asing-0.025
ums-0.025
 supervision-0.024
atted-0.024
othe-0.023
Token breaking
Token ID7163
Feature activation+3.29e-02

Pos logit contributions
ums+0.264
asing+0.258
atted+0.254
olds+0.243
 supervision+0.238
Neg logit contributions
 park-0.160
 store-0.140
 ride-0.137
 forest-0.136
 picnic-0.135
Token it
Token ID340
Feature activation+8.09e-03

Pos logit contributions
ums+0.094
asing+0.094
atted+0.092
 supervision+0.091
olds+0.089
Neg logit contributions
 park-0.042
 store-0.035
 forest-0.035
 picnic-0.033
 big-0.032
Token.
Token ID13
Feature activation+1.26e-02

Pos logit contributions
asing+0.022
 supervision+0.022
ums+0.022
atted+0.021
aged+0.021
Neg logit contributions
 park-0.011
 picnic-0.010
 forest-0.010
 museum-0.009
 library-0.009
Token Tim
Token ID5045
Feature activation+2.58e-02

Pos logit contributions
ums+0.032
asing+0.032
atted+0.032
 supervision+0.032
olds+0.031
Neg logit contributions
 park-0.020
 picnic-0.017
 forest-0.017
 store-0.017
 hike-0.016
Token was
Token ID373
Feature activation+7.21e-03

Pos logit contributions
ums+0.072
 supervision+0.071
asing+0.071
atted+0.070
ging+0.068
Neg logit contributions
 park-0.035
 forest-0.029
 picnic-0.029
 store-0.029
 ride-0.028
Token relieved
Token ID22598
Feature activation+2.94e-02

Pos logit contributions
asing+0.030
ums+0.030
 supervision+0.030
atted+0.030
olds+0.029
Neg logit contributions
 park-0.022
 picnic-0.019
 forest-0.019
 store-0.018
 ride-0.018
Token and
Token ID290
Feature activation+5.34e-02

Pos logit contributions
asing+0.090
ums+0.090
 supervision+0.089
atted+0.088
olds+0.085
Neg logit contributions
 park-0.032
 picnic-0.026
 forest-0.026
 store-0.025
 ride-0.024
Token thanked
Token ID26280
Feature activation+4.01e-02

Pos logit contributions
ums+0.136
asing+0.135
 supervision+0.131
atted+0.127
olds+0.126
Neg logit contributions
 park-0.089
 picnic-0.078
 forest-0.077
 store-0.077
 ride-0.075
Token the
Token ID262
Feature activation+1.08e-02

Pos logit contributions
ums+0.109
asing+0.107
 supervision+0.107
atted+0.107
olds+0.103
Neg logit contributions
 park-0.055
 picnic-0.048
 forest-0.046
 store-0.046
 hike-0.045
Token squirrel
Token ID33039
Feature activation+2.18e-02

Pos logit contributions
 supervision+0.030
ums+0.029
atted+0.029
 wounds+0.029
olds+0.028
Neg logit contributions
 hike-0.010
 park-0.010
 picnic-0.009
 theater-0.009
 stadium-0.008
Token for
Token ID329
Feature activation+5.08e-02

Pos logit contributions
asing+0.056
ums+0.056
 supervision+0.056
atted+0.055
ging+0.053
Neg logit contributions
 park-0.033
 picnic-0.029
 store-0.028
 forest-0.028
 ride-0.028
Token his
Token ID465
Feature activation+2.16e-02

Pos logit contributions
ums+0.131
asing+0.130
atted+0.128
 supervision+0.124
olds+0.122
Neg logit contributions
 park-0.084
 store-0.073
 forest-0.072
 picnic-0.070
 ride-0.070
Token kindness
Token ID23887
Feature activation+3.28e-02

Pos logit contributions
 supervision+0.047
ums+0.045
asing+0.045
atted+0.045
olds+0.043
Neg logit contributions
 park-0.041
 forest-0.037
 store-0.036
 museum-0.036
 library-0.036
Token.
Token ID13
Feature activation+2.28e-02

Pos logit contributions
ums+0.096
asing+0.095
 supervision+0.095
atted+0.094
olds+0.091
Neg logit contributions
 park-0.039
 picnic-0.032
 forest-0.032
 store-0.031
 hike-0.030
Token From
Token ID3574
Feature activation+3.58e-02

Pos logit contributions
ums+0.056
asing+0.055
 supervision+0.055
atted+0.055
olds+0.053
Neg logit contributions
 park-0.038
 picnic-0.033
 forest-0.032
 store-0.032
 hike-0.032
Token then
Token ID788
Feature activation+5.27e-03

Pos logit contributions
atted+0.078
asing+0.077
ums+0.077
 supervision+0.076
ging+0.074
Neg logit contributions
 park-0.066
 picnic-0.061
 hike-0.059
 museum-0.059
 store-0.059
Token.
Token ID13
Feature activation+4.14e-03

Pos logit contributions
 park+0.001
 picnic+0.001
 hike+0.001
 forest+0.001
 store+0.001
Neg logit contributions
ums-0.002
atted-0.002
asing-0.002
ging-0.002
 supervision-0.002
Token She
Token ID1375
Feature activation-1.78e-02

Pos logit contributions
ums+0.010
atted+0.010
asing+0.010
 supervision+0.009
olds+0.009
Neg logit contributions
 park-0.007
 picnic-0.006
 hike-0.006
 ride-0.006
 forest-0.006
Token felt
Token ID2936
Feature activation+1.25e-02

Pos logit contributions
 park+0.027
 picnic+0.024
 forest+0.024
 store+0.023
 museum+0.023
Neg logit contributions
asing-0.045
atted-0.045
ums-0.045
 supervision-0.044
aged-0.043
Token so
Token ID523
Feature activation+3.71e-02

Pos logit contributions
asing+0.037
ums+0.037
 supervision+0.037
atted+0.036
olds+0.035
Neg logit contributions
 park-0.016
 big-0.013
 store-0.013
 forest-0.012
 picnic-0.012
Token peaceful
Token ID12309
Feature activation+1.83e-02

Pos logit contributions
 supervision+0.093
asing+0.091
ums+0.091
atted+0.088
olds+0.084
Neg logit contributions
 park-0.064
 big-0.059
 ride-0.054
 peek-0.054
 store-0.053
Token.
Token ID13
Feature activation+3.76e-03

Pos logit contributions
ums+0.045
asing+0.045
ging+0.045
 supervision+0.045
atted+0.045
Neg logit contributions
 park-0.027
 picnic-0.025
 museum-0.024
 hike-0.024
 forest-0.024
Token She
Token ID1375
Feature activation-9.54e-03

Pos logit contributions
ums+0.009
asing+0.009
atted+0.009
 supervision+0.009
olds+0.009
Neg logit contributions
 park-0.006
 picnic-0.006
 forest-0.005
 store-0.005
 ride-0.005
Token lay
Token ID3830
Feature activation+1.65e-02

Pos logit contributions
 park+0.016
 picnic+0.014
 museum+0.014
 forest+0.014
 hike+0.014
Neg logit contributions
atted-0.023
asing-0.023
ums-0.022
 supervision-0.022
aged-0.022
Token down
Token ID866
Feature activation-1.68e-02

Pos logit contributions
ums+0.045
asing+0.044
 supervision+0.044
atted+0.044
olds+0.042
Neg logit contributions
 park-0.024
 picnic-0.020
 forest-0.020
 store-0.020
 ride-0.019
Token in
Token ID287
Feature activation+1.41e-03

Pos logit contributions
 park+0.024
 picnic+0.023
 hike+0.022
 museum+0.022
 forest+0.021
Neg logit contributions
atted-0.043
ums-0.042
asing-0.041
ging-0.041
aged-0.040
Token the
Token ID262
Feature activation-2.83e-02

Pos logit contributions
 supervision+0.003
ums+0.003
atted+0.003
asing+0.003
ging+0.003
Neg logit contributions
 park-0.002
 hike-0.002
 picnic-0.002
 museum-0.002
 forest-0.002
Token!"
Token ID2474
Feature activation+3.34e-02

Pos logit contributions
ums+0.054
asing+0.054
 supervision+0.054
atted+0.054
olds+0.052
Neg logit contributions
 park-0.019
 picnic-0.015
 forest-0.015
 store-0.015
 hike-0.014
Token she
Token ID673
Feature activation+1.36e-02

Pos logit contributions
asing+0.092
ums+0.091
 supervision+0.090
atted+0.089
ging+0.086
Neg logit contributions
 park-0.048
 picnic-0.041
 store-0.040
 forest-0.040
 ride-0.038
Token says
Token ID1139
Feature activation+2.08e-02

Pos logit contributions
ums+0.025
asing+0.025
 supervision+0.025
atted+0.025
ging+0.023
Neg logit contributions
 park-0.031
 picnic-0.028
 store-0.028
 forest-0.028
 hike-0.027
Token.
Token ID13
Feature activation+2.05e-02

Pos logit contributions
asing+0.069
ums+0.069
 supervision+0.068
atted+0.068
leness+0.067
Neg logit contributions
 park-0.014
 ride-0.011
 store-0.010
 big-0.010
 peek-0.010
Token She
Token ID1375
Feature activation+3.47e-03

Pos logit contributions
asing+0.058
 supervision+0.058
ums+0.057
atted+0.057
ging+0.055
Neg logit contributions
 park-0.027
 picnic-0.022
 store-0.022
 forest-0.022
 ride-0.021
Token puts
Token ID7584
Feature activation+1.03e-02

Pos logit contributions
ums+0.007
asing+0.007
atted+0.007
 supervision+0.007
olds+0.007
Neg logit contributions
 park-0.007
 picnic-0.007
 forest-0.007
 store-0.006
 hike-0.006
Token the
Token ID262
Feature activation+1.19e-02

Pos logit contributions
asing+0.033
ums+0.033
 supervision+0.032
atted+0.032
olds+0.031
Neg logit contributions
 park-0.010
 store-0.008
 big-0.008
 picnic-0.008
 forest-0.007
Token cards
Token ID4116
Feature activation+3.03e-03

Pos logit contributions
 supervision+0.027
ums+0.027
asing+0.027
olds+0.025
cks+0.025
Neg logit contributions
 park-0.019
 hike-0.019
 forest-0.018
 festival-0.017
 picnic-0.017
Token back
Token ID736
Feature activation+9.82e-03

Pos logit contributions
ums+0.009
asing+0.009
 supervision+0.008
atted+0.008
olds+0.008
Neg logit contributions
 park-0.004
 picnic-0.003
 store-0.003
 forest-0.003
 ride-0.003
Token.
Token ID13
Feature activation+1.83e-02

Pos logit contributions
asing+0.026
atted+0.026
ums+0.025
 supervision+0.025
ging+0.025
Neg logit contributions
 park-0.014
 picnic-0.013
 hike-0.012
 forest-0.012
 museum-0.012
Token
Token ID198
Feature activation+4.42e-03

Pos logit contributions
asing+0.053
ums+0.053
 supervision+0.052
atted+0.052
ging+0.050
Neg logit contributions
 park-0.023
 picnic-0.019
 forest-0.018
 store-0.018
 ride-0.017
Token
Token ID198
Feature activation+2.92e-02

Pos logit contributions
ums+0.125
asing+0.124
atted+0.123
 supervision+0.123
olds+0.119
Neg logit contributions
 park-0.043
 picnic-0.035
 forest-0.034
 store-0.034
 hike-0.033
Token
Token ID198
Feature activation+3.34e-02

Pos logit contributions
asing+0.099
 supervision+0.098
ums+0.097
atted+0.096
ging+0.094
Neg logit contributions
 park-0.025
 store-0.017
 forest-0.017
 picnic-0.016
 big-0.016
TokenL
Token ID43
Feature activation+8.49e-02

Pos logit contributions
 supervision+0.089
atted+0.088
ums+0.088
asing+0.087
aged+0.083
Neg logit contributions
 park-0.053
 store-0.045
 forest-0.043
 picnic-0.042
 ride-0.041
Tokenily
Token ID813
Feature activation+3.33e-02

Pos logit contributions
ums+0.169
asing+0.167
 supervision+0.166
atted+0.165
ging+0.156
Neg logit contributions
 park-0.182
 picnic-0.164
 forest-0.162
 store-0.162
 museum-0.158
Token smiled
Token ID13541
Feature activation+4.42e-02

Pos logit contributions
ums+0.077
asing+0.076
atted+0.076
 supervision+0.075
olds+0.072
Neg logit contributions
 park-0.060
 picnic-0.053
 forest-0.053
 store-0.052
 hike-0.051
Token.
Token ID13
Feature activation+3.71e-02

Pos logit contributions
ums+0.126
asing+0.125
atted+0.124
 supervision+0.124
olds+0.119
Neg logit contributions
 park-0.056
 picnic-0.047
 forest-0.046
 store-0.045
 hike-0.044
Token She
Token ID1375
Feature activation+1.31e-02

Pos logit contributions
ums+0.093
atted+0.091
asing+0.091
 supervision+0.090
olds+0.089
Neg logit contributions
 park-0.058
 picnic-0.055
 hike-0.052
 forest-0.051
 ride-0.051
Token said
Token ID531
Feature activation+2.30e-02

Pos logit contributions
atted+0.029
asing+0.029
ums+0.029
 supervision+0.028
aged+0.027
Neg logit contributions
 park-0.023
 picnic-0.021
 forest-0.021
 museum-0.021
 hike-0.021
Token,
Token ID11
Feature activation+1.96e-02

Pos logit contributions
ums+0.073
asing+0.072
atted+0.072
 supervision+0.072
olds+0.069
Neg logit contributions
 park-0.022
 picnic-0.018
 forest-0.017
 store-0.017
 hike-0.016
Token "
Token ID366
Feature activation+4.60e-02

Pos logit contributions
ums+0.053
asing+0.052
 supervision+0.052
atted+0.052
olds+0.050
Neg logit contributions
 park-0.028
 picnic-0.024
 forest-0.024
 store-0.023
 hike-0.023
TokenOK
Token ID11380
Feature activation+2.93e-02

Pos logit contributions
atted+0.124
ums+0.122
 supervision+0.121
asing+0.119
aged+0.116
Neg logit contributions
 park-0.073
 store-0.062
 ride-0.060
 forest-0.059
Hi-0.059
Token the
Token ID262
Feature activation-3.45e-02

Pos logit contributions
 picnic+0.045
 park+0.044
 party+0.043
 hike+0.041
 festival+0.041
Neg logit contributions
atted-0.068
ging-0.066
 supervision-0.066
ums-0.066
aged-0.065
Token tall
Token ID7331
Feature activation-3.74e-02

Pos logit contributions
 park+0.042
 picnic+0.038
 hike+0.037
 ride+0.037
 museum+0.036
Neg logit contributions
ums-0.095
asing-0.093
atted-0.093
 supervision-0.092
ging-0.090
Token grass
Token ID8701
Feature activation-2.91e-02

Pos logit contributions
 park+0.044
 picnic+0.041
 ride+0.040
 hike+0.039
 nap+0.038
Neg logit contributions
ums-0.099
atted-0.099
 supervision-0.097
asing-0.096
aged-0.094
Token,
Token ID11
Feature activation-1.84e-04

Pos logit contributions
 park+0.038
 picnic+0.036
 hike+0.034
 library+0.032
 ride+0.032
Neg logit contributions
ums-0.079
atted-0.077
asing-0.077
 supervision-0.074
aged-0.073
Token feeling
Token ID4203
Feature activation+2.86e-02

Pos logit contributions
 park+0.000
 picnic+0.000
 forest+0.000
 store+0.000
 museum+0.000
Neg logit contributions
asing-0.000
ums-0.000
atted-0.000
 supervision-0.000
ging-0.000
Token the
Token ID262
Feature activation+1.08e-02

Pos logit contributions
asing+0.094
ums+0.093
 supervision+0.092
atted+0.090
aned+0.089
Neg logit contributions
 park-0.027
 big-0.025
 store-0.022
 a-0.021
 there-0.020
Token warm
Token ID5814
Feature activation+3.77e-02

Pos logit contributions
ums+0.028
asing+0.028
 supervision+0.028
atted+0.028
ging+0.027
Neg logit contributions
 park-0.016
 picnic-0.014
 forest-0.013
 store-0.013
 hike-0.013
Token sun
Token ID4252
Feature activation+3.65e-02

Pos logit contributions
asing+0.117
 supervision+0.114
ums+0.111
olds+0.110
atted+0.109
Neg logit contributions
 park-0.047
 forest-0.038
 picnic-0.038
 store-0.037
 ride-0.036
Token on
Token ID319
Feature activation+3.47e-02

Pos logit contributions
asing+0.109
ums+0.108
 supervision+0.108
atted+0.107
olds+0.104
Neg logit contributions
 park-0.042
 picnic-0.035
 forest-0.034
 store-0.034
 ride-0.034
Token his
Token ID465
Feature activation-2.31e-02

Pos logit contributions
ums+0.089
atted+0.085
 supervision+0.085
asing+0.084
ging+0.083
Neg logit contributions
 park-0.051
 picnic-0.046
 library-0.044
 hike-0.044
 forest-0.044
Token back
Token ID736
Feature activation-5.29e-03

Pos logit contributions
 park+0.049
 picnic+0.045
 forest+0.043
 big+0.043
 ride+0.042
Neg logit contributions
asing-0.051
ums-0.048
atted-0.048
 supervision-0.048
olds-0.047
Token€
Token ID26391
Feature activation+3.44e-02

Pos logit contributions
ums+0.136
atted+0.134
 supervision+0.133
asing+0.133
aged+0.126
Neg logit contributions
 park-0.136
 store-0.121
 picnic-0.120
 ride-0.120
 forest-0.120
Tokenâ„¢
Token ID8151
Feature activation+2.89e-02

Pos logit contributions
ums+0.077
asing+0.074
 supervision+0.074
atted+0.073
olds+0.070
Neg logit contributions
 park-0.065
 picnic-0.058
 forest-0.058
 store-0.057
 museum-0.056
Tokenll
Token ID297
Feature activation+3.31e-02

Pos logit contributions
ums+0.059
asing+0.059
 supervision+0.058
atted+0.058
olds+0.054
Neg logit contributions
 park-0.059
 picnic-0.053
 store-0.053
 forest-0.053
 ride-0.051
Token try
Token ID1949
Feature activation-3.50e-02

Pos logit contributions
asing+0.087
ums+0.087
atted+0.086
 supervision+0.086
olds+0.082
Neg logit contributions
 park-0.050
 picnic-0.042
 store-0.042
 ride-0.042
 forest-0.041
Token to
Token ID284
Feature activation-1.23e-02

Pos logit contributions
 park+0.046
 picnic+0.041
 forest+0.040
 hike+0.039
 museum+0.038
Neg logit contributions
asing-0.096
 supervision-0.096
ums-0.095
atted-0.095
ging-0.092
Token rescue
Token ID9992
Feature activation-1.40e-02

Pos logit contributions
 park+0.019
 library+0.018
 museum+0.018
 forest+0.018
 festival+0.017
Neg logit contributions
 supervision-0.029
ums-0.029
asing-0.028
ck-0.027
aged-0.027
Token it
Token ID340
Feature activation-1.51e-02

Pos logit contributions
 park+0.020
 picnic+0.017
 store+0.017
 forest+0.017
 ride+0.017
Neg logit contributions
ums-0.037
asing-0.037
atted-0.037
 supervision-0.037
aged-0.035
Token,
Token ID11
Feature activation+3.90e-03

Pos logit contributions
 park+0.023
 picnic+0.021
 museum+0.020
 forest+0.020
 library+0.020
Neg logit contributions
asing-0.037
 supervision-0.037
ums-0.035
atted-0.035
ging-0.034
Token but
Token ID475
Feature activation+2.66e-02

Pos logit contributions
 supervision+0.010
ums+0.010
asing+0.010
atted+0.010
olds+0.009
Neg logit contributions
 park-0.006
 forest-0.005
 picnic-0.005
 hike-0.005
 museum-0.005
Token I
Token ID314
Feature activation-2.34e-03

Pos logit contributions
 supervision+0.065
asing+0.064
ums+0.064
atted+0.063
olds+0.061
Neg logit contributions
 park-0.042
 picnic-0.038
 forest-0.037
 hike-0.036
 museum-0.035
Token need
Token ID761
Feature activation-1.68e-02

Pos logit contributions
 park+0.004
 picnic+0.004
 forest+0.004
 store+0.004
 museum+0.004
Neg logit contributions
asing-0.005
ums-0.005
 supervision-0.005
atted-0.005
olds-0.005
Token strange
Token ID6283
Feature activation+4.48e-03

Pos logit contributions
 park+0.029
 forest+0.025
 picnic+0.025
 hike+0.025
 library+0.024
Neg logit contributions
ums-0.053
asing-0.053
atted-0.053
 supervision-0.053
ging-0.052
Token.
Token ID13
Feature activation+9.62e-03

Pos logit contributions
atted+0.011
 supervision+0.011
asing+0.011
ging+0.011
ums+0.011
Neg logit contributions
 park-0.006
 picnic-0.006
 forest-0.005
 hike-0.005
 museum-0.005
Token The
Token ID383
Feature activation-7.52e-03

Pos logit contributions
ums+0.024
atted+0.023
 supervision+0.023
asing+0.023
olds+0.023
Neg logit contributions
 park-0.015
 picnic-0.014
 hike-0.014
 ride-0.013
 forest-0.013
Token flowers
Token ID12734
Feature activation+2.46e-03

Pos logit contributions
 park+0.013
 picnic+0.012
 hike+0.012
 library+0.011
 ride+0.011
Neg logit contributions
ums-0.017
asing-0.017
 supervision-0.017
atted-0.017
aged-0.017
Token she
Token ID673
Feature activation+8.49e-03

Pos logit contributions
ums+0.007
asing+0.007
 supervision+0.007
atted+0.007
olds+0.006
Neg logit contributions
 park-0.003
 picnic-0.003
 forest-0.003
 store-0.003
 hike-0.003
Token had
Token ID550
Feature activation-1.10e-02

Pos logit contributions
asing+0.024
ums+0.024
atted+0.024
 supervision+0.024
aged+0.023
Neg logit contributions
 park-0.010
 picnic-0.009
 forest-0.009
 store-0.009
 hike-0.008
Token picked
Token ID6497
Feature activation+3.10e-04

Pos logit contributions
 park+0.020
 picnic+0.018
 forest+0.018
 museum+0.018
 hike+0.018
Neg logit contributions
atted-0.024
asing-0.024
 supervision-0.024
ums-0.024
aged-0.023
Token are
Token ID389
Feature activation+2.17e-02

Pos logit contributions
ums+0.001
asing+0.001
 supervision+0.001
atted+0.001
olds+0.001
Neg logit contributions
 park-0.000
 picnic-0.000
 forest-0.000
 store-0.000
 hike-0.000
Token all
Token ID477
Feature activation+2.77e-02

Pos logit contributions
ums+0.057
 supervision+0.057
asing+0.057
atted+0.057
olds+0.054
Neg logit contributions
 park-0.030
 picnic-0.026
 forest-0.026
 store-0.025
 hike-0.025
Token gone
Token ID3750
Feature activation+3.13e-03

Pos logit contributions
ums+0.073
asing+0.072
 supervision+0.072
atted+0.072
olds+0.069
Neg logit contributions
 park-0.041
 picnic-0.036
 forest-0.035
 store-0.035
 hike-0.034
Token!
Token ID0
Feature activation+1.05e-02

Pos logit contributions
 supervision+0.008
ums+0.008
atted+0.008
asing+0.008
ging+0.008
Neg logit contributions
 park-0.004
 picnic-0.004
 hike-0.004
 museum-0.004
 forest-0.004
Token play
Token ID711
Feature activation-5.45e-02

Pos logit contributions
 forest+0.026
 woods+0.024
 park+0.024
 cabin+0.023
 museum+0.023
Neg logit contributions
ums-0.056
olds-0.052
 supervision-0.052
asing-0.052
ases-0.050
Token with
Token ID351
Feature activation-6.85e-03

Pos logit contributions
 arrived+0.092
 library+0.088
 hike+0.084
 museum+0.083
 giant+0.083
Neg logit contributions
asing-0.113
ums-0.104
ins-0.102
 themselves-0.100
aged-0.100
Token toys
Token ID14958
Feature activation-3.85e-02

Pos logit contributions
 park+0.013
 hike+0.012
 arrived+0.012
 ride+0.012
 forest+0.012
Neg logit contributions
atted-0.013
ums-0.013
 supervision-0.013
asing-0.013
olds-0.013
Token and
Token ID290
Feature activation+1.32e-02

Pos logit contributions
 park+0.056
 hike+0.054
 picnic+0.054
 forest+0.051
 arrived+0.051
Neg logit contributions
asing-0.093
 supervision-0.088
ums-0.088
olds-0.086
atted-0.085
Token books
Token ID3835
Feature activation-1.55e-02

Pos logit contributions
 supervision+0.034
asing+0.032
olds+0.032
 wounds+0.031
aged+0.031
Neg logit contributions
 park-0.017
 forest-0.016
 hike-0.014
 store-0.014
 festival-0.014
Token.
Token ID13
Feature activation-8.84e-03

Pos logit contributions
 park+0.024
 forest+0.023
 picnic+0.023
 hike+0.022
 museum+0.021
Neg logit contributions
asing-0.035
ums-0.034
cks-0.033
olds-0.033
 supervision-0.033
Token One
Token ID1881
Feature activation-6.12e-02

Pos logit contributions
 park+0.008
 forest+0.008
 picnic+0.007
 hike+0.007
 store+0.007
Neg logit contributions
ums-0.026
olds-0.026
atted-0.026
 supervision-0.025
asing-0.025
Token day
Token ID1110
Feature activation-5.01e-02

Pos logit contributions
 park+0.078
 picnic+0.073
 forest+0.071
 hike+0.070
 store+0.066
Neg logit contributions
ums-0.157
asing-0.152
atted-0.152
olds-0.151
 supervision-0.147
Token,
Token ID11
Feature activation-2.79e-02

Pos logit contributions
 park+0.064
 forest+0.059
 store+0.054
 library+0.052
 hike+0.052
Neg logit contributions
atted-0.141
ums-0.140
olds-0.136
asing-0.135
aged-0.135
Token they
Token ID484
Feature activation-3.16e-02

Pos logit contributions
 park+0.060
 forest+0.058
 store+0.054
 hike+0.053
 museum+0.053
Neg logit contributions
ums-0.053
atted-0.052
asing-0.051
 supervision-0.050
olds-0.049
Token find
Token ID1064
Feature activation-5.38e-02

Pos logit contributions
 park+0.050
 forest+0.047
 picnic+0.047
 library+0.045
 tr+0.045
Neg logit contributions
asing-0.067
 supervision-0.067
ums-0.066
olds-0.065
atted-0.062
Token she
Token ID673
Feature activation-1.29e-02

Pos logit contributions
 park+0.044
 picnic+0.039
 forest+0.038
 hike+0.037
 store+0.037
Neg logit contributions
ums-0.074
asing-0.073
atted-0.072
 supervision-0.072
olds-0.069
Token loved
Token ID6151
Feature activation-2.82e-02

Pos logit contributions
 park+0.020
 picnic+0.017
 forest+0.017
 store+0.016
 hike+0.016
Neg logit contributions
ums-0.033
asing-0.033
atted-0.033
 supervision-0.033
olds-0.031
Token to
Token ID284
Feature activation-1.75e-02

Pos logit contributions
 park+0.028
 picnic+0.022
 forest+0.021
 store+0.021
 hike+0.020
Neg logit contributions
ums-0.088
asing-0.087
 supervision-0.087
atted-0.087
olds-0.084
Token take
Token ID1011
Feature activation-1.88e-02

Pos logit contributions
 park+0.015
 forest+0.012
 picnic+0.012
 hike+0.011
 museum+0.011
Neg logit contributions
ums-0.055
asing-0.054
atted-0.054
 supervision-0.053
olds-0.052
Token pictures
Token ID5986
Feature activation-3.36e-02

Pos logit contributions
 park+0.027
 picnic+0.023
 forest+0.023
 store+0.022
 ride+0.022
Neg logit contributions
ums-0.050
asing-0.050
 supervision-0.050
atted-0.050
olds-0.048
Token with
Token ID351
Feature activation+1.56e-03

Pos logit contributions
 park+0.052
 picnic+0.047
 hike+0.046
 forest+0.046
 museum+0.044
Neg logit contributions
ums-0.083
asing-0.083
 supervision-0.078
olds-0.078
atted-0.077
Token.
Token ID13
Feature activation-1.95e-02

Pos logit contributions
ums+0.004
asing+0.004
 supervision+0.004
atted+0.004
olds+0.004
Neg logit contributions
 park-0.003
 picnic-0.002
 hike-0.002
 forest-0.002
 ride-0.002
Token One
Token ID1881
Feature activation-6.98e-02

Pos logit contributions
 picnic+0.023
 hike+0.023
 arrived+0.022
 park+0.022
 camp+0.021
Neg logit contributions
ums-0.051
olds-0.047
atted-0.047
 supervision-0.047
asing-0.046
Token day
Token ID1110
Feature activation-5.79e-02

Pos logit contributions
 park+0.084
 picnic+0.072
 forest+0.072
 hike+0.070
 store+0.069
Neg logit contributions
ums-0.192
asing-0.189
atted-0.186
 supervision-0.184
aged-0.180
Token,
Token ID11
Feature activation-3.91e-02

Pos logit contributions
 park+0.069
 forest+0.058
 picnic+0.056
 hike+0.055
 store+0.055
Neg logit contributions
atted-0.168
ums-0.168
asing-0.164
aged-0.162
 supervision-0.162
Token she
Token ID673
Feature activation-4.59e-02

Pos logit contributions
 park+0.068
 forest+0.065
 hike+0.064
 museum+0.062
 picnic+0.061
Neg logit contributions
othe-0.083
ums-0.083
atted-0.083
aged-0.081
ging-0.081
Token center
Token ID3641
Feature activation-2.55e-02

Pos logit contributions
 park+0.014
 picnic+0.014
 hike+0.014
 ride+0.013
 forest+0.013
Neg logit contributions
 supervision-0.041
ums-0.041
asing-0.040
atted-0.040
olds-0.039
Token.
Token ID13
Feature activation+2.44e-02

Pos logit contributions
 park+0.036
 picnic+0.032
 forest+0.031
 hike+0.030
 store+0.029
Neg logit contributions
ums-0.068
asing-0.068
 supervision-0.067
atted-0.067
olds-0.065
Token It
Token ID632
Feature activation-1.32e-03

Pos logit contributions
ums+0.059
asing+0.058
 supervision+0.058
atted+0.058
olds+0.056
Neg logit contributions
 park-0.041
 picnic-0.037
 forest-0.036
 store-0.035
 ride-0.035
Token was
Token ID373
Feature activation+8.51e-03

Pos logit contributions
 park+0.002
 picnic+0.002
 museum+0.001
 forest+0.001
 library+0.001
Neg logit contributions
atted-0.004
asing-0.003
ums-0.003
 supervision-0.003
aged-0.003
Token a
Token ID257
Feature activation-3.81e-02

Pos logit contributions
ums+0.025
 supervision+0.025
atted+0.025
asing+0.025
aged+0.024
Neg logit contributions
 park-0.009
 picnic-0.008
 forest-0.008
 hike-0.007
 museum-0.007
Token big
Token ID1263
Feature activation-3.90e-03

Pos logit contributions
 hike+0.044
 park+0.042
 Under+0.041
 One+0.039
 forest+0.038
Neg logit contributions
 supervision-0.093
ging-0.089
ums-0.089
edient-0.086
othe-0.086
Token building
Token ID2615
Feature activation-2.44e-02

Pos logit contributions
 hike+0.004
 park+0.004
 Under+0.004
 picnic+0.003
 peek+0.003
Neg logit contributions
 supervision-0.010
ums-0.010
ging-0.010
atted-0.010
asing-0.010
Token with
Token ID351
Feature activation+2.43e-02

Pos logit contributions
 park+0.033
 picnic+0.030
 forest+0.029
 hike+0.028
 store+0.027
Neg logit contributions
ums-0.065
asing-0.065
 supervision-0.064
atted-0.064
olds-0.062
Token many
Token ID867
Feature activation+4.18e-03

Pos logit contributions
ums+0.067
 supervision+0.067
asing+0.066
atted+0.066
olds+0.065
Neg logit contributions
 park-0.029
 hike-0.026
 picnic-0.026
 forest-0.025
 ride-0.024
Token boxes
Token ID10559
Feature activation-1.21e-02

Pos logit contributions
ums+0.010
asing+0.010
atted+0.010
 supervision+0.010
olds+0.009
Neg logit contributions
 park-0.007
 picnic-0.006
 hike-0.006
 forest-0.006
 ride-0.006
Token and
Token ID290
Feature activation+3.49e-02

Pos logit contributions
 park+0.016
 picnic+0.014
 forest+0.014
 hike+0.013
 museum+0.013
Neg logit contributions
ums-0.033
asing-0.033
 supervision-0.033
atted-0.033
ging-0.032
Token her
Token ID607
Feature activation+8.79e-03

Pos logit contributions
ums+0.129
atted+0.127
 supervision+0.127
asing+0.127
olds+0.122
Neg logit contributions
 park-0.083
 picnic-0.074
 store-0.072
 forest-0.071
 hike-0.071
Token new
Token ID649
Feature activation+2.39e-02

Pos logit contributions
ums+0.024
 supervision+0.024
asing+0.024
atted+0.024
aged+0.023
Neg logit contributions
 park-0.012
 picnic-0.010
 store-0.010
 forest-0.010
 hike-0.010
Token friend
Token ID1545
Feature activation+2.95e-03

Pos logit contributions
ums+0.059
atted+0.059
 supervision+0.058
asing+0.057
aged+0.055
Neg logit contributions
 park-0.036
 store-0.032
 museum-0.032
 hike-0.032
 picnic-0.032
Token.
Token ID13
Feature activation+1.02e-02

Pos logit contributions
 supervision+0.008
asing+0.008
ums+0.008
atted+0.008
ging+0.007
Neg logit contributions
 park-0.004
 museum-0.004
 picnic-0.004
 hike-0.004
 forest-0.003
Token<|endoftext|>
Token ID50256
Feature activation-7.68e-02

Pos logit contributions
asing+0.033
 supervision+0.033
atted+0.032
ums+0.032
ging+0.032
Neg logit contributions
 park-0.010
 forest-0.007
 picnic-0.007
 store-0.007
 library-0.007
TokenOnce
Token ID7454
Feature activation-7.20e-02

Pos logit contributions
 picnic+0.128
 hike+0.122
 party+0.118
 park+0.117
 walk+0.116
Neg logit contributions
ums-0.165
olds-0.154
cks-0.152
atted-0.150
asing-0.145
Token upon
Token ID2402
Feature activation-7.84e-02

Pos logit contributions
 park+0.096
 picnic+0.083
 forest+0.081
 store+0.080
 hike+0.077
Neg logit contributions
ums-0.198
atted-0.196
asing-0.196
 supervision-0.195
olds-0.188
Token a
Token ID257
Feature activation-1.04e-01

Pos logit contributions
 park+0.097
 hike+0.096
 forest+0.095
 museum+0.094
 festival+0.093
Neg logit contributions
ums-0.180
asing-0.179
cks-0.179
aged-0.177
olds-0.176
Token time
Token ID640
Feature activation-5.98e-02

Pos logit contributions
 park+0.097
 store+0.083
 forest+0.080
 museum+0.079
 library+0.074
Neg logit contributions
ums-0.317
atted-0.312
asing-0.311
 supervision-0.307
olds-0.296
Token,
Token ID11
Feature activation-8.14e-02

Pos logit contributions
 park+0.073
 store+0.061
 picnic+0.061
 forest+0.061
 library+0.059
Neg logit contributions
ums-0.170
atted-0.168
asing-0.165
 supervision-0.163
olds-0.160
Token there
Token ID612
Feature activation-7.78e-02

Pos logit contributions
 park+0.099
 library+0.091
 hike+0.091
 picnic+0.090
 museum+0.086
Neg logit contributions
ging-0.222
aged-0.219
atted-0.219
asing-0.216
cks-0.216
Token<|endoftext|>
Token ID50256
Feature activation-8.57e-02

Pos logit contributions
asing+0.043
 supervision+0.043
ums+0.042
atted+0.042
ging+0.041
Neg logit contributions
 park-0.015
 picnic-0.012
 forest-0.011
 store-0.011
 library-0.011
TokenOnce
Token ID7454
Feature activation-7.20e-02

Pos logit contributions
 picnic+0.156
 hike+0.148
 party+0.147
 race+0.147
 festival+0.145
Neg logit contributions
ums-0.163
olds-0.150
cks-0.150
atted-0.148
ging-0.143
Token upon
Token ID2402
Feature activation-8.91e-02

Pos logit contributions
 park+0.096
 picnic+0.083
 forest+0.081
 store+0.080
 hike+0.077
Neg logit contributions
ums-0.198
atted-0.196
asing-0.196
 supervision-0.195
olds-0.188
Token a
Token ID257
Feature activation-1.14e-01

Pos logit contributions
 park+0.108
 forest+0.107
 hike+0.104
 museum+0.104
 festival+0.103
Neg logit contributions
cks-0.207
ums-0.205
aged-0.204
asing-0.203
olds-0.203
Token time
Token ID640
Feature activation-6.53e-02

Pos logit contributions
 park+0.112
 store+0.099
 forest+0.096
 museum+0.093
 library+0.089
Neg logit contributions
ums-0.336
atted-0.331
asing-0.328
 supervision-0.323
olds-0.315
Token,
Token ID11
Feature activation-8.79e-02

Pos logit contributions
 park+0.082
 picnic+0.070
 store+0.069
 forest+0.069
 library+0.066
Neg logit contributions
ums-0.183
atted-0.180
asing-0.176
 supervision-0.175
olds-0.172
Token there
Token ID612
Feature activation-8.00e-02

Pos logit contributions
 park+0.112
 library+0.103
 hike+0.100
 picnic+0.099
 ride+0.096
Neg logit contributions
ging-0.230
aged-0.227
atted-0.227
asing-0.222
cks-0.222
Token was
Token ID373
Feature activation-8.06e-02

Pos logit contributions
 park+0.102
 picnic+0.094
 hike+0.090
 library+0.088
 ride+0.088
Neg logit contributions
ums-0.213
atted-0.208
asing-0.200
aged-0.199
 supervision-0.198
Token a
Token ID257
Feature activation-1.19e-01

Pos logit contributions
 park+0.084
 hike+0.082
 museum+0.078
 forest+0.077
 store+0.076
Neg logit contributions
ums-0.215
asing-0.207
aged-0.204
atted-0.201
ging-0.199
Token boy
Token ID2933
Feature activation-3.58e-02

Pos logit contributions
 hike+0.118
 park+0.116
 arrived+0.111
 picnic+0.105
 forest+0.099
Neg logit contributions
ums-0.308
ging-0.291
akers-0.284
ucker-0.283
atted-0.282
Token named
Token ID3706
Feature activation-3.09e-02

Pos logit contributions
 park+0.052
 picnic+0.046
 forest+0.044
 store+0.044
 hike+0.043
Neg logit contributions
ums-0.095
asing-0.094
atted-0.093
 supervision-0.093
olds-0.090
TokenL
Token ID43
Feature activation+2.82e-02

Pos logit contributions
 picnic+0.132
 hike+0.131
 walk+0.125
 ride+0.124
 bus+0.122
Neg logit contributions
ums-0.159
ins-0.153
olds-0.147
asing-0.142
atted-0.140
Tokenily
Token ID813
Feature activation-2.43e-02

Pos logit contributions
ums+0.055
asing+0.055
 supervision+0.054
atted+0.054
olds+0.051
Neg logit contributions
 park-0.061
 picnic-0.056
 forest-0.055
 store-0.055
 hike-0.054
Token and
Token ID290
Feature activation-2.28e-02

Pos logit contributions
 park+0.040
 picnic+0.037
 forest+0.036
 hike+0.036
 museum+0.035
Neg logit contributions
asing-0.057
atted-0.057
ums-0.056
 supervision-0.056
olds-0.054
Token Ben
Token ID3932
Feature activation-1.92e-02

Pos logit contributions
 park+0.036
 forest+0.034
 museum+0.033
 hike+0.032
 picnic+0.032
Neg logit contributions
 supervision-0.055
ums-0.054
asing-0.053
atted-0.053
aged-0.052
Token were
Token ID547
Feature activation-7.87e-02

Pos logit contributions
 park+0.028
 forest+0.025
 picnic+0.025
 hike+0.024
 museum+0.024
Neg logit contributions
asing-0.047
atted-0.047
 supervision-0.047
ums-0.047
olds-0.045
Token playing
Token ID2712
Feature activation-7.91e-02

Pos logit contributions
 park+0.140
 hike+0.126
 picnic+0.125
 forest+0.125
 museum+0.122
Neg logit contributions
ums-0.184
atted-0.180
asing-0.179
 supervision-0.174
olds-0.169
Token in
Token ID287
Feature activation-5.77e-02

Pos logit contributions
 park+0.110
 library+0.110
 hike+0.107
 picnic+0.104
 arrived+0.102
Neg logit contributions
asing-0.194
ums-0.192
ging-0.184
aged-0.183
atted-0.181
Token the
Token ID262
Feature activation-7.07e-02

Pos logit contributions
 picnic+0.084
 park+0.081
 hike+0.081
 arrived+0.078
 festival+0.076
Neg logit contributions
 supervision-0.128
cks-0.127
ums-0.127
atted-0.125
ging-0.123
Token park
Token ID3952
Feature activation-6.77e-02

Pos logit contributions
 hike+0.056
 arrived+0.055
 ride+0.049
 woke+0.048
 Under+0.046
Neg logit contributions
ums-0.181
 supervision-0.175
atted-0.169
cking-0.167
olds-0.167
Token.
Token ID13
Feature activation-3.20e-02

Pos logit contributions
 park+0.087
 picnic+0.077
 forest+0.074
 hike+0.073
 library+0.071
Neg logit contributions
ums-0.189
asing-0.184
atted-0.183
 supervision-0.181
olds-0.177
Token They
Token ID1119
Feature activation-4.30e-02

Pos logit contributions
 park+0.061
 forest+0.056
 festival+0.056
 ride+0.056
 picnic+0.055
Neg logit contributions
ums-0.066
atted-0.063
 supervision-0.060
ging-0.059
olds-0.058
Token you
Token ID345
Feature activation+3.09e-02

Pos logit contributions
ums+0.119
asing+0.119
atted+0.118
 supervision+0.118
olds+0.112
Neg logit contributions
 park-0.077
 picnic-0.067
 forest-0.066
 store-0.066
 museum-0.064
Token.
Token ID13
Feature activation+1.87e-03

Pos logit contributions
asing+0.081
 supervision+0.081
ums+0.079
atted+0.078
olds+0.078
Neg logit contributions
 park-0.043
 picnic-0.038
 forest-0.038
 hike-0.036
 store-0.036
Token<|endoftext|>
Token ID50256
Feature activation-7.04e-02

Pos logit contributions
ums+0.005
asing+0.005
 supervision+0.005
atted+0.004
olds+0.004
Neg logit contributions
 park-0.003
 picnic-0.003
 hike-0.003
 forest-0.003
 store-0.003
TokenOnce
Token ID7454
Feature activation-6.74e-02

Pos logit contributions
 picnic+0.123
 hike+0.118
 party+0.113
 walk+0.111
 bus+0.110
Neg logit contributions
ums-0.144
olds-0.137
cks-0.133
atted-0.130
ins-0.127
Token upon
Token ID2402
Feature activation-7.45e-02

Pos logit contributions
 park+0.096
 picnic+0.085
 forest+0.081
 store+0.081
 hike+0.079
Neg logit contributions
ums-0.179
atted-0.178
asing-0.176
 supervision-0.174
olds-0.169
Token a
Token ID257
Feature activation-1.00e-01

Pos logit contributions
 hike+0.101
 park+0.096
 forest+0.096
 hill+0.096
 festival+0.096
Neg logit contributions
ums-0.161
cks-0.160
asing-0.160
olds-0.159
aged-0.159
Token time
Token ID640
Feature activation-5.88e-02

Pos logit contributions
 park+0.094
 store+0.079
 forest+0.077
 museum+0.075
 library+0.071
Neg logit contributions
ums-0.307
asing-0.300
atted-0.300
 supervision-0.295
olds-0.286
Token,
Token ID11
Feature activation-7.62e-02

Pos logit contributions
 park+0.066
 picnic+0.055
 forest+0.054
 store+0.054
 library+0.052
Neg logit contributions
ums-0.174
atted-0.171
asing-0.170
 supervision-0.168
olds-0.164
Token a
Token ID257
Feature activation-1.11e-01

Pos logit contributions
 park+0.087
 hike+0.083
 picnic+0.081
 library+0.078
 museum+0.077
Neg logit contributions
ging-0.214
aged-0.212
atted-0.210
asing-0.210
cks-0.207
Token little
Token ID1310
Feature activation-4.39e-02

Pos logit contributions
 park+0.095
 hike+0.088
 picnic+0.084
 store+0.081
 forest+0.077
Neg logit contributions
ums-0.330
ging-0.311
asing-0.310
atted-0.310
ins-0.302
Token girl
Token ID2576
Feature activation-4.06e-02

Pos logit contributions
 park+0.056
 hike+0.053
 picnic+0.051
 store+0.048
 library+0.047
Neg logit contributions
ums-0.115
atted-0.112
asing-0.111
ging-0.111
 supervision-0.107
Token<|endoftext|>
Token ID50256
Feature activation-7.81e-02

Pos logit contributions
 park+0.003
 picnic+0.003
 forest+0.003
 store+0.003
 hike+0.002
Neg logit contributions
ums-0.007
asing-0.007
 supervision-0.007
atted-0.007
olds-0.007
TokenOnce
Token ID7454
Feature activation-7.07e-02

Pos logit contributions
 picnic+0.136
 hike+0.129
 party+0.128
 walk+0.124
 race+0.124
Neg logit contributions
ums-0.158
atted-0.145
olds-0.145
cks-0.143
ging-0.140
Token upon
Token ID2402
Feature activation-7.82e-02

Pos logit contributions
 park+0.092
 picnic+0.078
 forest+0.076
 store+0.075
 hike+0.073
Neg logit contributions
ums-0.198
asing-0.197
 supervision-0.195
atted-0.195
olds-0.187
Token a
Token ID257
Feature activation-1.03e-01

Pos logit contributions
 hike+0.101
 park+0.101
 forest+0.100
 museum+0.099
 festival+0.099
Neg logit contributions
asing-0.172
cks-0.172
ums-0.171
 themselves-0.169
aged-0.169
Token time
Token ID640
Feature activation-5.91e-02

Pos logit contributions
 park+0.096
 store+0.082
 forest+0.078
 museum+0.076
 library+0.072
Neg logit contributions
ums-0.317
asing-0.311
atted-0.310
 supervision-0.305
olds-0.295
Token,
Token ID11
Feature activation-8.30e-02

Pos logit contributions
 park+0.073
 picnic+0.062
 store+0.061
 forest+0.060
 library+0.058
Neg logit contributions
ums-0.168
atted-0.165
asing-0.162
 supervision-0.160
olds-0.157
Token there
Token ID612
Feature activation-7.84e-02

Pos logit contributions
 park+0.102
 picnic+0.094
 hike+0.093
 library+0.092
 museum+0.088
Neg logit contributions
ging-0.230
aged-0.226
atted-0.224
asing-0.223
cks-0.220
Token was
Token ID373
Feature activation-7.11e-02

Pos logit contributions
 park+0.095
 picnic+0.087
 hike+0.083
 library+0.081
 ride+0.080
Neg logit contributions
ums-0.216
atted-0.211
asing-0.207
 supervision-0.204
aged-0.202
Token a
Token ID257
Feature activation-1.09e-01

Pos logit contributions
 park+0.070
 hike+0.067
 museum+0.065
 library+0.062
 store+0.062
Neg logit contributions
ums-0.202
asing-0.195
aged-0.192
atted-0.192
ging-0.188
Token little
Token ID1310
Feature activation-3.90e-02

Pos logit contributions
 hike+0.107
 park+0.107
 arrived+0.099
 picnic+0.093
 forest+0.090
Neg logit contributions
ums-0.289
ging-0.275
atted-0.269
ucker-0.266
akers-0.265
Token girl
Token ID2576
Feature activation-3.05e-02

Pos logit contributions
 park+0.053
 hike+0.050
 picnic+0.047
 arrived+0.046
 library+0.044
Neg logit contributions
ums-0.098
atted-0.095
ging-0.094
asing-0.094
 supervision-0.091