TOP ACTIVATIONS
MAX = 5.383

Fox y . Fox y liked to dance in the forest
She loved to explore and try new things . Daisy wanted
She loved to explore and try new things . One day
She loved to explore and try new things .
Lily . She loved to try new things and have adventures
to study nature . She liked to look at the trees
loved animals . She especially liked bees ! One
mom and dad . He liked to laugh and cl ap
was very curious . He liked to explore the world around
He liked to explore and try new things . He would
Paw and Tree . They liked to play together in the
Jane . She liked to try new things , but sometimes
curious little boy . He liked to explore his neighborhood .
She loved to explore and try new things . Today ,
play in nature . They liked to run on the grass
was very curious . He liked to explore his surroundings .
on the farm . They liked to run , jump ,
very brave and liked to try new things .
who loved nature . She liked to play in the trees
in the classroom . She liked to touch the books ,

MIN ACTIVATIONS
MIN = -3.989

toys . She wanted her bag , her book , her
went home and put her box on her dress er .
it kept her warm and cos y all night long .
her brother and get the blanket back . Lily nodded her
if she could have her blanket back and her brother gave
she forgot to put the bowl away . As
the box and put the lid on . They ran to
He forgot to put his books and lunch in the bag
and wanted to get her ball back . Amy
it kept her warm and cos y all night long .
, and how light and boun cy it was .
yawn ed and rubbed her eyes . She wanted to go
They we pt until they felt better . Then they buried
. It was an ash tr ay . Fl uffy did
and made him drop his basket . She felt sorry and
, the bird put her blanket back inside . She was
dress , and a spark ly cape . Tom thought Elsa
he sp ied an ash tr ay . He picked up
Ben said , holding his bowl tightly . L
washed the dress until it was perfect , just like new

INTERVAL 3.040 - 5.383
CONTAINS 0.395%

girl named Lily . She loved to help her mom my
on a hill . They wanted to go inside and look
bird . The blue bird loved to sing and fly in
a big spoon . She loved to watch the vegetables move
had many teeth . They wanted to print crocod iles too

INTERVAL 0.697 - 3.040
CONTAINS 22.302%

yelled . " Stop it , both of you
her own warning . From then on , she made sure
saw the most amazing things and felt how great the world
! He went over hills , through fields , and even
the university was coming true . <|endoftext|> Once there was a

INTERVAL -1.646 - 0.697
CONTAINS 74.058%

owl saw Benny and flew down to help him . The
trees . " Look , I found the cow
that was almost as tall as her . She wanted to
watched it spin round and round and felt the gentle breeze
and ran to the water . He held out the stick

INTERVAL -3.989 - -1.646
CONTAINS 3.245%

insects away . The flower felt sad and soon its pet
and Lily struggling with the box . She smiles and walks
the treasure and give the rest to the museum . They
His dad put a hand on his shoulder
finally get to the hotel . It is very big and
Token Fox
Token ID5426
Feature activation+5.94e-01

Pos logit contributions
 guessing+0.006
 facts+0.006
 tours+0.006
 topics+0.006
 motorcycles+0.006
Neg logit contributions
kin-0.005
ivery-0.005
ifying-0.004
 bruised-0.004
ishy-0.004
Tokeny
Token ID88
Feature activation+1.34e-01

Pos logit contributions
 guessing+0.032
 tours+0.031
 facts+0.031
 topics+0.031
 motorcycles+0.031
Neg logit contributions
kin-0.019
ivery-0.016
ifying-0.016
 bruised-0.016
ishy-0.015
Token.
Token ID13
Feature activation+8.64e-01

Pos logit contributions
 guessing+0.007
 facts+0.006
 tours+0.006
 topics+0.006
 motorcycles+0.006
Neg logit contributions
kin-0.005
ivery-0.005
ifying-0.004
 bruised-0.004
ishy-0.004
Token Fox
Token ID5426
Feature activation-2.34e-02

Pos logit contributions
 guessing+0.045
 facts+0.042
 tours+0.042
 topics+0.042
 discovering+0.041
Neg logit contributions
kin-0.029
ivery-0.027
ifying-0.025
 bruised-0.025
ishy-0.023
Tokeny
Token ID88
Feature activation+8.19e-01

Pos logit contributions
kin+0.001
ivery+0.001
ifying+0.001
 bruised+0.001
ishy+0.001
Neg logit contributions
 guessing-0.001
 facts-0.001
 tours-0.001
 topics-0.001
 motorcycles-0.001
Token liked
Token ID8288
Feature activation+5.38e+00

Pos logit contributions
 guessing+0.038
 facts+0.037
 tours+0.037
 topics+0.036
 motorcycles+0.036
Neg logit contributions
kin-0.033
ivery-0.029
 bruised-0.027
ifying-0.027
robe-0.027
Token to
Token ID284
Feature activation+1.14e+00

Pos logit contributions
ves+0.309
icians+0.302
bors+0.298
ises+0.295
 occasions+0.292
Neg logit contributions
 Lily-0.105
 a-0.092
 cold-0.092
 hungry-0.091
 full-0.090
Token dance
Token ID9280
Feature activation+6.31e-01

Pos logit contributions
 guessing+0.048
 topics+0.047
 facts+0.047
 discovering+0.046
 motorcycles+0.046
Neg logit contributions
kin-0.049
ivery-0.044
 bruised-0.041
 limp-0.040
ishy-0.039
Token in
Token ID287
Feature activation+1.95e+00

Pos logit contributions
 guessing+0.026
 facts+0.025
 motorcycles+0.025
 mills+0.025
 topics+0.025
Neg logit contributions
kin-0.029
ivery-0.025
ifying-0.025
robe-0.024
 bruised-0.024
Token the
Token ID262
Feature activation+1.10e+00

Pos logit contributions
 facts+0.104
arm+0.102
ups+0.101
 mills+0.097
 topics+0.097
Neg logit contributions
kin-0.051
 sill-0.048
ifying-0.046
 bruised-0.045
 empty-0.043
Token forest
Token ID8222
Feature activation+1.38e+00

Pos logit contributions
 facts+0.046
 guessing+0.045
 motorcycles+0.044
 topics+0.043
 mills+0.043
Neg logit contributions
kin-0.047
ivery-0.044
ifying-0.042
ishy-0.041
 bruised-0.040
Token She
Token ID1375
Feature activation+9.01e-01

Pos logit contributions
kin+0.005
ivery+0.004
ifying+0.004
 bruised+0.004
ishy+0.003
Neg logit contributions
 guessing-0.008
 facts-0.007
 tours-0.007
 topics-0.007
 discovering-0.007
Token loved
Token ID6151
Feature activation+4.79e+00

Pos logit contributions
 facts+0.043
 guessing+0.042
 tours+0.041
 motorcycles+0.041
 topics+0.041
Neg logit contributions
kin-0.035
ivery-0.031
robe-0.028
 bruised-0.028
ishy-0.028
Token to
Token ID284
Feature activation+1.49e+00

Pos logit contributions
ves+0.256
icians+0.239
bors+0.233
isting+0.224
 Street+0.223
Neg logit contributions
 Lily-0.135
 cold-0.123
 dry-0.121
 wet-0.121
 itch-0.119
Token explore
Token ID7301
Feature activation+2.97e+00

Pos logit contributions
ups+0.067
 guessing+0.066
icians+0.066
 facts+0.066
ths+0.065
Neg logit contributions
kin-0.057
ivery-0.050
 limp-0.047
robe-0.045
ishy-0.045
Token and
Token ID290
Feature activation+1.91e+00

Pos logit contributions
icians+0.179
 facts+0.167
 Exhibit+0.165
cles+0.164
bors+0.163
Neg logit contributions
he-0.066
 empty-0.061
 alone-0.057
 dry-0.053
 like-0.051
Token try
Token ID1949
Feature activation+5.38e+00

Pos logit contributions
 motorcycles+0.085
icians+0.082
cles+0.081
 facts+0.080
arm+0.079
Neg logit contributions
kin-0.065
 sill-0.063
ivery-0.063
ifying-0.061
 bruised-0.061
Token new
Token ID649
Feature activation+3.40e+00

Pos logit contributions
 occasions+0.285
cles+0.283
icians+0.273
bors+0.270
ths+0.253
Neg logit contributions
 out-0.134
 low-0.133
 down-0.132
 okay-0.131
 sad-0.122
Token things
Token ID1243
Feature activation+1.16e+00

Pos logit contributions
icians+0.141
 Exhibit+0.134
arm+0.132
ending+0.131
bors+0.129
Neg logit contributions
 eye-0.109
ishy-0.106
 away-0.106
 sill-0.104
 like-0.100
Token.
Token ID13
Feature activation+6.85e-01

Pos logit contributions
 guessing+0.061
 topics+0.060
 motorcycles+0.060
 facts+0.060
 mills+0.059
Neg logit contributions
kin-0.036
ivery-0.030
 bruised-0.030
ifying-0.028
robe-0.028
Token Daisy
Token ID40355
Feature activation+2.53e+00

Pos logit contributions
 guessing+0.036
 facts+0.036
 motorcycles+0.035
 tours+0.035
 mills+0.034
Neg logit contributions
kin-0.021
ivery-0.020
ifying-0.018
 bruised-0.018
ishy-0.018
Token wanted
Token ID2227
Feature activation+3.44e+00

Pos logit contributions
 facts+0.135
 motorcycles+0.128
 tours+0.127
 guessing+0.126
 topics+0.125
Neg logit contributions
kin-0.078
cy-0.066
ishy-0.066
 still-0.064
ivery-0.064
Token She
Token ID1375
Feature activation-2.88e-02

Pos logit contributions
kin+0.050
ivery+0.045
ifying+0.043
 bruised+0.042
ishy+0.041
Neg logit contributions
 topics-0.054
 facts-0.054
 discovering-0.053
 tours-0.053
 guessing-0.052
Token loved
Token ID6151
Feature activation+4.14e+00

Pos logit contributions
kin+0.001
ivery+0.001
ifying+0.001
 bruised+0.001
ishy+0.001
Neg logit contributions
 guessing-0.001
 facts-0.001
 tours-0.001
 topics-0.001
 discovering-0.001
Token to
Token ID284
Feature activation+1.49e+00

Pos logit contributions
icians+0.197
ves+0.196
bors+0.184
arm+0.184
isting+0.175
Neg logit contributions
 Lily-0.134
 cold-0.129
 empty-0.126
 wet-0.126
 bruised-0.125
Token explore
Token ID7301
Feature activation+2.73e+00

Pos logit contributions
icians+0.069
 guessing+0.068
 facts+0.067
 motorcycles+0.067
ups+0.067
Neg logit contributions
kin-0.056
ivery-0.049
 limp-0.046
 bruised-0.044
 sill-0.043
Token and
Token ID290
Feature activation+1.98e+00

Pos logit contributions
icians+0.163
 guessing+0.149
 facts+0.148
arm+0.148
cles+0.148
Neg logit contributions
 empty-0.059
he-0.057
 alone-0.053
 dry-0.050
 Lily-0.049
Token try
Token ID1949
Feature activation+5.36e+00

Pos logit contributions
 motorcycles+0.088
icians+0.088
cles+0.086
arm+0.085
 facts+0.083
Neg logit contributions
 sill-0.066
kin-0.065
 bruised-0.064
ivery-0.064
 limp-0.064
Token new
Token ID649
Feature activation+3.17e+00

Pos logit contributions
 occasions+0.284
cles+0.283
icians+0.276
bors+0.275
ths+0.258
Neg logit contributions
 low-0.133
 down-0.132
 out-0.131
 okay-0.128
 sad-0.124
Token things
Token ID1243
Feature activation+8.10e-01

Pos logit contributions
icians+0.133
arm+0.126
 Exhibit+0.123
ending+0.122
ths+0.122
Neg logit contributions
 eye-0.102
 sill-0.100
ishy-0.097
 away-0.095
 idea-0.094
Token.
Token ID13
Feature activation+4.73e-01

Pos logit contributions
 guessing+0.043
 facts+0.042
 topics+0.042
 motorcycles+0.041
 tours+0.041
Neg logit contributions
kin-0.026
ivery-0.022
 bruised-0.022
ifying-0.021
robe-0.020
Token One
Token ID1881
Feature activation-3.04e-01

Pos logit contributions
 guessing+0.025
 facts+0.025
 tours+0.024
 motorcycles+0.024
 discovering+0.024
Neg logit contributions
kin-0.015
ivery-0.014
ifying-0.013
 bruised-0.013
ishy-0.012
Token day
Token ID1110
Feature activation+6.93e-01

Pos logit contributions
kin+0.015
ivery+0.015
ifying+0.014
 bruised+0.014
ishy+0.013
Neg logit contributions
 guessing-0.011
 tours-0.010
 facts-0.010
 discovering-0.010
 topics-0.010
Token She
Token ID1375
Feature activation+2.33e-01

Pos logit contributions
kin+0.064
ivery+0.059
ifying+0.056
 bruised+0.054
ishy+0.053
Neg logit contributions
 topics-0.062
 facts-0.060
 discovering-0.060
 tours-0.059
 museums-0.058
Token loved
Token ID6151
Feature activation+4.07e+00

Pos logit contributions
 guessing+0.010
 facts+0.009
 tours+0.009
 motorcycles+0.009
 topics+0.009
Neg logit contributions
kin-0.011
ivery-0.010
ifying-0.009
 bruised-0.009
ishy-0.009
Token to
Token ID284
Feature activation+9.96e-01

Pos logit contributions
icians+0.191
ves+0.184
bors+0.183
arm+0.181
 Street+0.171
Neg logit contributions
 Lily-0.128
 empty-0.126
 bruised-0.125
 cold-0.124
 wet-0.123
Token explore
Token ID7301
Feature activation+2.83e+00

Pos logit contributions
 guessing+0.043
 facts+0.042
 topics+0.041
 motorcycles+0.041
 tours+0.041
Neg logit contributions
kin-0.043
ivery-0.038
 bruised-0.035
ifying-0.034
ishy-0.034
Token and
Token ID290
Feature activation+1.85e+00

Pos logit contributions
icians+0.173
 facts+0.162
arm+0.160
 Exhibit+0.160
cles+0.158
Neg logit contributions
he-0.056
 empty-0.055
 alone-0.049
 dry-0.048
kin-0.048
Token try
Token ID1949
Feature activation+5.33e+00

Pos logit contributions
 motorcycles+0.082
icians+0.080
cles+0.079
arm+0.079
 facts+0.078
Neg logit contributions
kin-0.063
 sill-0.061
ivery-0.060
ifying-0.059
 bruised-0.059
Token new
Token ID649
Feature activation+3.20e+00

Pos logit contributions
cles+0.280
 occasions+0.279
bors+0.275
icians+0.274
ths+0.263
Neg logit contributions
 low-0.130
 down-0.125
 out-0.123
 sad-0.116
 okay-0.113
Token things
Token ID1243
Feature activation+8.90e-01

Pos logit contributions
icians+0.132
arm+0.128
 Exhibit+0.127
ending+0.126
bors+0.124
Neg logit contributions
 eye-0.101
 sill-0.098
 away-0.095
ishy-0.094
 like-0.092
Token.
Token ID13
Feature activation+3.46e-01

Pos logit contributions
 guessing+0.048
 facts+0.047
 topics+0.047
 motorcycles+0.046
 tours+0.046
Neg logit contributions
kin-0.028
ivery-0.023
 bruised-0.023
ifying-0.022
robe-0.021
Token
Token ID198
Feature activation+1.81e+00

Pos logit contributions
 guessing+0.018
 facts+0.017
 tours+0.017
 motorcycles+0.017
 discovering+0.017
Neg logit contributions
kin-0.012
ivery-0.011
ifying-0.010
 bruised-0.010
ishy-0.010
Token
Token ID198
Feature activation+1.71e+00

Pos logit contributions
 facts+0.091
 guessing+0.087
 motorcycles+0.087
 tours+0.086
 topics+0.085
Neg logit contributions
kin-0.062
he-0.058
ivery-0.055
robe-0.048
ifying-0.048
Token Lily
Token ID20037
Feature activation-5.95e-01

Pos logit contributions
kin+0.021
ivery+0.018
ifying+0.017
 bruised+0.017
ishy+0.016
Neg logit contributions
 guessing-0.039
 discovering-0.037
 tours-0.037
 mills-0.035
 topics-0.035
Token.
Token ID13
Feature activation-1.52e+00

Pos logit contributions
kin+0.027
ivery+0.025
ifying+0.025
 bruised+0.023
ishy+0.023
Neg logit contributions
 guessing-0.024
 discovering-0.023
 motorcycles-0.023
 facts-0.023
 topics-0.022
Token She
Token ID1375
Feature activation+5.28e-03

Pos logit contributions
kin+0.067
ivery+0.059
ifying+0.059
rier+0.056
 bruised+0.055
Neg logit contributions
 guessing-0.063
 discovering-0.062
 topics-0.061
 museums-0.060
 tours-0.060
Token loved
Token ID6151
Feature activation+4.43e+00

Pos logit contributions
 guessing+0.000
 facts+0.000
 tours+0.000
 discovering+0.000
 topics+0.000
Neg logit contributions
kin-0.000
ivery-0.000
ifying-0.000
 bruised-0.000
ishy-0.000
Token to
Token ID284
Feature activation+9.22e-01

Pos logit contributions
icians+0.217
ves+0.211
bors+0.208
arm+0.199
 Street+0.195
Neg logit contributions
 Lily-0.134
 wet-0.129
 dry-0.129
 cold-0.128
 bruised-0.128
Token try
Token ID1949
Feature activation+5.31e+00

Pos logit contributions
 guessing+0.040
 facts+0.040
 topics+0.039
 discovering+0.038
 tours+0.038
Neg logit contributions
kin-0.039
ivery-0.035
 bruised-0.032
ifying-0.032
robe-0.031
Token new
Token ID649
Feature activation+3.57e+00

Pos logit contributions
bors+0.292
cles+0.286
 occasions+0.280
icians+0.279
ager+0.276
Neg logit contributions
 warm-0.109
 out-0.105
 cold-0.104
 wet-0.103
 dark-0.103
Token things
Token ID1243
Feature activation+7.50e-01

Pos logit contributions
icians+0.165
ending+0.164
ll+0.163
bors+0.160
 Exhibit+0.156
Neg logit contributions
 eye-0.096
 idea-0.089
 Lily-0.084
 bruised-0.082
ishy-0.082
Token and
Token ID290
Feature activation+1.79e+00

Pos logit contributions
 guessing+0.037
 facts+0.036
 topics+0.036
 motorcycles+0.036
 tours+0.035
Neg logit contributions
kin-0.027
ivery-0.024
 bruised-0.023
ifying-0.022
ishy-0.022
Token have
Token ID423
Feature activation+2.18e+00

Pos logit contributions
 motorcycles+0.088
arm+0.087
icians+0.087
cles+0.085
 tours+0.085
Neg logit contributions
kin-0.058
 sill-0.050
 bruised-0.050
 limp-0.048
ivery-0.047
Token adventures
Token ID17545
Feature activation+5.88e-01

Pos logit contributions
arm+0.101
cles+0.097
 mills+0.097
ule+0.096
 topics+0.094
Neg logit contributions
kin-0.067
 empty-0.063
 OK-0.061
 sill-0.060
 bruised-0.060
Token to
Token ID284
Feature activation+8.50e-01

Pos logit contributions
icians+0.176
bors+0.171
ves+0.162
arm+0.158
 Street+0.155
Neg logit contributions
 bruised-0.135
 Lily-0.130
 empty-0.125
 itch-0.123
 Mom-0.123
Token study
Token ID2050
Feature activation+1.80e+00

Pos logit contributions
 guessing+0.036
 facts+0.036
 topics+0.035
 discovering+0.035
 motorcycles+0.035
Neg logit contributions
kin-0.037
ivery-0.032
 bruised-0.030
ifying-0.030
robe-0.030
Token nature
Token ID3450
Feature activation+1.63e+00

Pos logit contributions
 guessing+0.065
bors+0.062
icians+0.062
 whales+0.062
 discovering+0.061
Neg logit contributions
kin-0.084
he-0.073
robe-0.072
ifying-0.070
 bruised-0.070
Token.
Token ID13
Feature activation-3.11e-01

Pos logit contributions
 guessing+0.086
 discovering+0.083
icians+0.083
 motorcycles+0.083
 mills+0.082
Neg logit contributions
kin-0.050
robe-0.042
he-0.041
ivery-0.039
ifying-0.039
Token She
Token ID1375
Feature activation+1.47e+00

Pos logit contributions
kin+0.011
ivery+0.010
ifying+0.010
 bruised+0.009
ishy+0.009
Neg logit contributions
 guessing-0.016
 facts-0.015
 tours-0.015
 topics-0.015
 motorcycles-0.015
Token liked
Token ID8288
Feature activation+5.30e+00

Pos logit contributions
 facts+0.074
 tours+0.073
 guessing+0.073
 topics+0.072
 heights+0.070
Neg logit contributions
kin-0.050
robe-0.040
 bruised-0.040
ivery-0.040
ishy-0.038
Token to
Token ID284
Feature activation+1.42e+00

Pos logit contributions
bors+0.323
ises+0.313
icians+0.305
isting+0.302
ves+0.300
Neg logit contributions
 Lily-0.108
 her-0.097
 Mom-0.093
 full-0.088
 like-0.087
Token look
Token ID804
Feature activation-3.81e-02

Pos logit contributions
 guessing+0.055
 motorcycles+0.055
 facts+0.053
icians+0.053
bors+0.053
Neg logit contributions
kin-0.063
ivery-0.056
 sill-0.055
he-0.052
 limp-0.051
Token at
Token ID379
Feature activation+1.93e+00

Pos logit contributions
kin+0.002
ivery+0.001
ifying+0.001
 bruised+0.001
ishy+0.001
Neg logit contributions
 guessing-0.002
 facts-0.002
 tours-0.002
 motorcycles-0.002
 topics-0.002
Token the
Token ID262
Feature activation+2.09e+00

Pos logit contributions
 guessing+0.090
 facts+0.084
icians+0.083
bors+0.082
 topics+0.082
Neg logit contributions
 bruised-0.064
kin-0.064
ifying-0.062
 sill-0.057
 limp-0.056
Token trees
Token ID7150
Feature activation+1.11e+00

Pos logit contributions
 guessing+0.090
icians+0.086
bors+0.085
 motorcycles+0.085
 discovering+0.084
Neg logit contributions
ifying-0.082
kin-0.077
ishy-0.072
he-0.071
ivery-0.070
Token loved
Token ID6151
Feature activation+4.47e+00

Pos logit contributions
 guessing+0.041
 motorcycles+0.040
 facts+0.040
 tours+0.039
 topics+0.039
Neg logit contributions
kin-0.035
ivery-0.031
 bruised-0.030
ifying-0.030
ishy-0.029
Token animals
Token ID4695
Feature activation+1.69e+00

Pos logit contributions
ves+0.224
icians+0.222
bors+0.213
is+0.211
arm+0.203
Neg logit contributions
 dark-0.122
 cold-0.117
 Lily-0.114
 full-0.113
 gloomy-0.112
Token.
Token ID13
Feature activation+5.88e-01

Pos logit contributions
 motorcycles+0.105
 tours+0.104
ths+0.103
 facts+0.103
 mills+0.102
Neg logit contributions
kin-0.034
 bruised-0.026
ivery-0.025
he-0.024
ifying-0.024
Token She
Token ID1375
Feature activation+1.09e+00

Pos logit contributions
 guessing+0.032
 facts+0.031
 motorcycles+0.031
 tours+0.030
 topics+0.030
Neg logit contributions
kin-0.018
ivery-0.016
ifying-0.015
 bruised-0.015
ishy-0.014
Token especially
Token ID2592
Feature activation+8.03e-01

Pos logit contributions
 facts+0.056
 guessing+0.055
 motorcycles+0.054
 topics+0.053
 tours+0.053
Neg logit contributions
kin-0.038
ivery-0.032
 bruised-0.030
ishy-0.030
robe-0.029
Token liked
Token ID8288
Feature activation+5.29e+00

Pos logit contributions
 guessing+0.040
 tours+0.039
 motorcycles+0.039
 facts+0.038
 mills+0.037
Neg logit contributions
kin-0.030
ivery-0.025
 bruised-0.025
ishy-0.024
ifying-0.023
Token bees
Token ID17002
Feature activation+2.04e+00

Pos logit contributions
icians+0.257
cles+0.255
ves+0.255
is+0.250
iting+0.245
Neg logit contributions
 Lily-0.134
 like-0.122
 dark-0.118
 cold-0.117
 full-0.117
Token!
Token ID0
Feature activation+1.42e-01

Pos logit contributions
 motorcycles+0.103
 topics+0.098
 tours+0.098
 guessing+0.096
 mills+0.096
Neg logit contributions
kin-0.071
 bruised-0.061
ivery-0.061
ifying-0.057
ishy-0.055
Token
Token ID198
Feature activation+1.20e+00

Pos logit contributions
 guessing+0.007
 facts+0.006
 tours+0.006
 topics+0.006
 motorcycles+0.006
Neg logit contributions
kin-0.006
ivery-0.005
ifying-0.005
 bruised-0.005
ishy-0.005
Token
Token ID198
Feature activation+1.37e+00

Pos logit contributions
 facts+0.055
 guessing+0.054
 motorcycles+0.053
 tours+0.053
 topics+0.052
Neg logit contributions
kin-0.046
ivery-0.041
he-0.040
ifying-0.038
 bruised-0.037
TokenOne
Token ID3198
Feature activation-3.56e-01

Pos logit contributions
 facts+0.055
 guessing+0.054
 motorcycles+0.054
icians+0.053
 tours+0.050
Neg logit contributions
kin-0.059
ivery-0.057
he-0.053
ifying-0.050
ishy-0.050
Token mom
Token ID1995
Feature activation+5.15e-01

Pos logit contributions
 discovering+0.037
 guessing+0.037
 tours+0.036
 facts+0.036
 topics+0.036
Neg logit contributions
kin-0.037
ivery-0.031
ifying-0.031
 bruised-0.031
robe-0.031
Token and
Token ID290
Feature activation+1.79e+00

Pos logit contributions
 guessing+0.026
 facts+0.026
 tours+0.025
 topics+0.025
 discovering+0.025
Neg logit contributions
kin-0.019
ivery-0.016
ifying-0.016
 bruised-0.015
ishy-0.014
Token dad
Token ID9955
Feature activation+1.02e+00

Pos logit contributions
 topics+0.094
 facts+0.093
 guessing+0.092
 tours+0.092
arm+0.092
Neg logit contributions
kin-0.055
 bruised-0.044
 sorry-0.042
cloth-0.040
robe-0.039
Token.
Token ID13
Feature activation-1.98e-01

Pos logit contributions
 tours+0.057
 facts+0.057
 guessing+0.056
 motorcycles+0.055
 mills+0.055
Neg logit contributions
kin-0.031
ivery-0.025
ifying-0.024
 bruised-0.023
cloth-0.022
Token He
Token ID679
Feature activation+1.57e+00

Pos logit contributions
kin+0.008
ivery+0.007
ifying+0.007
 bruised+0.007
ishy+0.007
Neg logit contributions
 guessing-0.009
 facts-0.009
 tours-0.009
 motorcycles-0.009
 topics-0.009
Token liked
Token ID8288
Feature activation+5.26e+00

Pos logit contributions
 tours+0.080
 facts+0.079
 topics+0.078
 heights+0.078
 guessing+0.077
Neg logit contributions
kin-0.057
robe-0.044
ivery-0.043
 bruised-0.043
ishy-0.042
Token to
Token ID284
Feature activation+1.29e+00

Pos logit contributions
bors+0.273
ises+0.268
ths+0.267
icians+0.262
ves+0.258
Neg logit contributions
 Lily-0.123
 Mom-0.104
 a-0.104
 sleepy-0.103
 like-0.102
Token laugh
Token ID6487
Feature activation+6.19e-01

Pos logit contributions
 guessing+0.051
 motorcycles+0.050
 facts+0.050
 discovering+0.050
 topics+0.050
Neg logit contributions
kin-0.058
ivery-0.051
ifying-0.047
cloth-0.046
ishy-0.046
Token and
Token ID290
Feature activation+9.44e-01

Pos logit contributions
 guessing+0.024
 facts+0.023
 discovering+0.023
 tours+0.023
 topics+0.023
Neg logit contributions
kin-0.029
ivery-0.027
ifying-0.025
robe-0.025
 bruised-0.025
Token cl
Token ID537
Feature activation-6.27e-01

Pos logit contributions
 tours+0.038
 guessing+0.037
 facts+0.037
 mills+0.037
 topics+0.037
Neg logit contributions
kin-0.042
ivery-0.037
ifying-0.036
 bruised-0.036
robe-0.034
Tokenap
Token ID499
Feature activation+1.25e-01

Pos logit contributions
kin+0.028
 bruised+0.026
ivery+0.026
ifying+0.025
 limp+0.024
Neg logit contributions
 guessing-0.025
 tours-0.024
 topics-0.024
 facts-0.024
 discovering-0.024
Token was
Token ID373
Feature activation-1.44e+00

Pos logit contributions
kin+0.004
ivery+0.004
ifying+0.004
 bruised+0.004
ishy+0.004
Neg logit contributions
 guessing-0.004
 facts-0.003
 tours-0.003
 motorcycles-0.003
 discovering-0.003
Token very
Token ID845
Feature activation-1.20e+00

Pos logit contributions
kin+0.075
ivery+0.070
ifying+0.069
ishy+0.064
cloth+0.062
Neg logit contributions
 guessing-0.058
 discovering-0.052
 learning-0.051
 exploring-0.049
 weighing-0.049
Token curious
Token ID11040
Feature activation+7.86e-01

Pos logit contributions
kin+0.053
ifying+0.052
ivery+0.050
robe+0.046
cloth+0.046
Neg logit contributions
 guessing-0.051
 discovering-0.045
 tours-0.044
 weighing-0.043
 learning-0.043
Token.
Token ID13
Feature activation+2.20e-01

Pos logit contributions
 guessing+0.035
 tours+0.034
 facts+0.033
 topics+0.033
icians+0.033
Neg logit contributions
kin-0.032
 bruised-0.028
ivery-0.028
ifying-0.027
ishy-0.026
Token He
Token ID679
Feature activation+9.06e-01

Pos logit contributions
 guessing+0.013
 facts+0.012
 tours+0.012
 discovering+0.012
 topics+0.012
Neg logit contributions
kin-0.006
ivery-0.005
ifying-0.005
 bruised-0.005
ishy-0.005
Token liked
Token ID8288
Feature activation+5.24e+00

Pos logit contributions
 guessing+0.045
 facts+0.045
 tours+0.044
 topics+0.044
 motorcycles+0.043
Neg logit contributions
kin-0.031
ivery-0.027
 bruised-0.026
ishy-0.026
ifying-0.025
Token to
Token ID284
Feature activation+1.76e+00

Pos logit contributions
ves+0.299
bors+0.288
is+0.284
arm+0.280
ises+0.279
Neg logit contributions
 alone-0.109
 dark-0.109
 empty-0.105
 Lily-0.104
 cold-0.102
Token explore
Token ID7301
Feature activation+3.05e+00

Pos logit contributions
ups+0.075
icians+0.074
ths+0.073
 facts+0.072
 guessing+0.072
Neg logit contributions
kin-0.069
ivery-0.063
 limp-0.059
 sill-0.058
 eye-0.057
Token the
Token ID262
Feature activation+2.13e+00

Pos logit contributions
icians+0.179
bors+0.171
ending+0.170
arm+0.168
 occasions+0.168
Neg logit contributions
 empty-0.062
he-0.058
 alone-0.058
 like-0.053
 dark-0.052
Token world
Token ID995
Feature activation+1.88e+00

Pos logit contributions
icians+0.096
bors+0.095
 facts+0.092
ves+0.092
 occasions+0.088
Neg logit contributions
he-0.072
 seeker-0.071
 empty-0.071
 sill-0.069
kin-0.069
Token around
Token ID1088
Feature activation+8.89e-01

Pos logit contributions
icians+0.101
 guessing+0.100
bors+0.100
 facts+0.100
 mills+0.099
Neg logit contributions
kin-0.047
he-0.047
 sill-0.041
robe-0.039
 alone-0.038
Token He
Token ID679
Feature activation+3.57e-01

Pos logit contributions
kin+0.019
ivery+0.018
ifying+0.017
 bruised+0.016
ishy+0.016
Neg logit contributions
 facts-0.021
 guessing-0.021
 tours-0.021
 motorcycles-0.021
 topics-0.020
Token liked
Token ID8288
Feature activation+4.36e+00

Pos logit contributions
 guessing+0.017
 facts+0.016
 tours+0.016
 topics+0.016
 motorcycles+0.016
Neg logit contributions
kin-0.014
ivery-0.013
 bruised-0.012
ifying-0.012
robe-0.011
Token to
Token ID284
Feature activation+6.98e-01

Pos logit contributions
arm+0.231
icians+0.223
bors+0.221
ves+0.220
 topics+0.220
Neg logit contributions
 bruised-0.115
 alone-0.114
 full-0.112
 empty-0.105
 wet-0.104
Token explore
Token ID7301
Feature activation+2.90e+00

Pos logit contributions
 guessing+0.030
 facts+0.029
 topics+0.029
 tours+0.029
 motorcycles+0.029
Neg logit contributions
kin-0.031
ivery-0.027
 bruised-0.025
ifying-0.024
robe-0.024
Token and
Token ID290
Feature activation+1.94e+00

Pos logit contributions
icians+0.169
 occasions+0.162
 facts+0.162
arm+0.161
bors+0.160
Neg logit contributions
he-0.057
 alone-0.055
 empty-0.055
 low-0.050
 like-0.049
Token try
Token ID1949
Feature activation+5.22e+00

Pos logit contributions
 motorcycles+0.088
arm+0.087
 facts+0.086
 tours+0.086
cles+0.085
Neg logit contributions
kin-0.065
 sill-0.059
 bruised-0.059
ifying-0.058
 limp-0.058
Token new
Token ID649
Feature activation+3.35e+00

Pos logit contributions
 occasions+0.277
bors+0.264
cles+0.263
ths+0.260
pill+0.251
Neg logit contributions
 low-0.141
 down-0.136
 out-0.135
 okay-0.130
 sad-0.122
Token things
Token ID1243
Feature activation+1.12e+00

Pos logit contributions
arm+0.137
 Exhibit+0.130
ending+0.129
icians+0.129
ths+0.127
Neg logit contributions
 eye-0.106
 sill-0.106
 away-0.104
 like-0.097
 OK-0.096
Token.
Token ID13
Feature activation+2.89e-01

Pos logit contributions
 topics+0.061
 guessing+0.060
 facts+0.060
 mills+0.059
 tours+0.059
Neg logit contributions
kin-0.034
 bruised-0.027
ivery-0.027
ifying-0.027
robe-0.026
Token He
Token ID679
Feature activation+1.21e+00

Pos logit contributions
 guessing+0.015
 facts+0.014
 tours+0.014
 discovering+0.014
 topics+0.014
Neg logit contributions
kin-0.010
ivery-0.009
ifying-0.009
 bruised-0.009
ishy-0.008
Token would
Token ID561
Feature activation+1.22e+00

Pos logit contributions
 facts+0.057
 tours+0.056
 guessing+0.056
 topics+0.056
 motorcycles+0.055
Neg logit contributions
kin-0.049
ivery-0.039
robe-0.039
 bruised-0.038
ishy-0.037
Token Paw
Token ID35553
Feature activation+1.04e+00

Pos logit contributions
 facts+0.029
 guessing+0.029
 tours+0.029
 topics+0.028
 motorcycles+0.028
Neg logit contributions
kin-0.020
ivery-0.017
ifying-0.017
 bruised-0.017
ishy-0.015
Token and
Token ID290
Feature activation-5.93e-01

Pos logit contributions
 guessing+0.042
 tours+0.041
 topics+0.041
 facts+0.041
 mills+0.039
Neg logit contributions
kin-0.048
ivery-0.041
 bruised-0.040
ifying-0.040
ishy-0.039
Token Tree
Token ID12200
Feature activation+1.01e+00

Pos logit contributions
kin+0.024
ivery+0.022
ifying+0.022
 bruised+0.020
ishy+0.020
Neg logit contributions
 guessing-0.028
 facts-0.027
 tours-0.027
 discovering-0.027
 motorcycles-0.027
Token.
Token ID13
Feature activation+7.09e-01

Pos logit contributions
 guessing+0.052
 tours+0.052
 facts+0.051
 topics+0.050
 discovering+0.050
Neg logit contributions
kin-0.034
ivery-0.029
ifying-0.029
 bruised-0.028
ishy-0.026
Token They
Token ID1119
Feature activation-1.28e-01

Pos logit contributions
 guessing+0.036
 facts+0.034
 topics+0.034
 discovering+0.034
 tours+0.034
Neg logit contributions
kin-0.024
ivery-0.022
ifying-0.021
 bruised-0.020
ishy-0.020
Token liked
Token ID8288
Feature activation+5.17e+00

Pos logit contributions
kin+0.006
ivery+0.005
ifying+0.005
 bruised+0.005
ishy+0.005
Neg logit contributions
 guessing-0.006
 facts-0.005
 tours-0.005
 discovering-0.005
 topics-0.005
Token to
Token ID284
Feature activation+1.33e+00

Pos logit contributions
is+0.295
ves+0.293
icians+0.291
 occasions+0.283
bors+0.282
Neg logit contributions
 Lily-0.105
 like-0.105
 a-0.097
 Tim-0.094
 full-0.094
Token play
Token ID711
Feature activation+1.87e+00

Pos logit contributions
 guessing+0.056
 facts+0.055
 motorcycles+0.054
 discovering+0.054
 tours+0.053
Neg logit contributions
kin-0.055
ivery-0.051
robe-0.045
 bruised-0.045
 limp-0.045
Token together
Token ID1978
Feature activation+1.30e+00

Pos logit contributions
 facts+0.094
 cultures+0.089
ups+0.089
 motorcycles+0.088
 topics+0.087
Neg logit contributions
kin-0.067
robe-0.056
he-0.054
ivery-0.054
 seeker-0.046
Token in
Token ID287
Feature activation+1.67e+00

Pos logit contributions
 facts+0.058
 tours+0.057
 motorcycles+0.057
 guessing+0.056
 mills+0.056
Neg logit contributions
kin-0.055
ivery-0.043
ifying-0.043
he-0.041
robe-0.041
Token the
Token ID262
Feature activation+1.09e+00

Pos logit contributions
 facts+0.088
ups+0.085
arm+0.085
ending+0.082
cles+0.081
Neg logit contributions
kin-0.047
 sill-0.039
 bruised-0.039
he-0.039
ifying-0.038
Token Jane
Token ID12091
Feature activation-4.26e-01

Pos logit contributions
kin+0.024
ivery+0.023
ifying+0.021
 bruised+0.020
ishy+0.020
Neg logit contributions
 guessing-0.032
 tours-0.032
 facts-0.031
 discovering-0.031
 topics-0.030
Token.
Token ID13
Feature activation-7.50e-01

Pos logit contributions
kin+0.019
ivery+0.017
ifying+0.016
 bruised+0.016
ishy+0.016
Neg logit contributions
 guessing-0.018
 facts-0.017
 discovering-0.017
 motorcycles-0.017
 tours-0.017
Token She
Token ID1375
Feature activation+1.28e-01

Pos logit contributions
kin+0.031
ivery+0.028
ifying+0.028
 bruised+0.027
robe+0.024
Neg logit contributions
 guessing-0.035
 facts-0.035
 tours-0.034
 discovering-0.034
 topics-0.034
Token liked
Token ID8288
Feature activation+4.49e+00

Pos logit contributions
 guessing+0.005
 facts+0.005
 tours+0.005
 topics+0.005
 motorcycles+0.005
Neg logit contributions
kin-0.006
ivery-0.005
ifying-0.005
 bruised-0.005
ishy-0.005
Token to
Token ID284
Feature activation+3.44e-01

Pos logit contributions
icians+0.221
bors+0.218
ves+0.214
arm+0.213
ises+0.207
Neg logit contributions
 Lily-0.134
 Mom-0.120
 wet-0.119
 empty-0.118
 cold-0.117
Token try
Token ID1949
Feature activation+5.17e+00

Pos logit contributions
 guessing+0.014
 facts+0.013
 tours+0.013
 motorcycles+0.013
 topics+0.013
Neg logit contributions
kin-0.016
ivery-0.015
 bruised-0.014
ifying-0.014
ishy-0.014
Token new
Token ID649
Feature activation+3.38e+00

Pos logit contributions
cles+0.278
bors+0.276
 occasions+0.272
icians+0.263
aro+0.258
Neg logit contributions
 out-0.113
 dark-0.109
 sad-0.109
 cold-0.106
 wet-0.103
Token things
Token ID1243
Feature activation+8.42e-01

Pos logit contributions
 Exhibit+0.150
icians+0.149
ending+0.147
bors+0.145
arm+0.145
Neg logit contributions
 eye-0.097
 idea-0.090
ishy-0.088
 away-0.086
 like-0.085
Token,
Token ID11
Feature activation+1.08e+00

Pos logit contributions
 guessing+0.041
 facts+0.040
 topics+0.040
 motorcycles+0.040
 tours+0.040
Neg logit contributions
kin-0.031
ivery-0.026
 bruised-0.025
ifying-0.025
robe-0.024
Token but
Token ID475
Feature activation+1.33e-01

Pos logit contributions
 guessing+0.049
 topics+0.048
 facts+0.048
arm+0.048
 tours+0.048
Neg logit contributions
kin-0.042
ivery-0.036
 bruised-0.035
ifying-0.035
ishy-0.033
Token sometimes
Token ID3360
Feature activation+1.72e+00

Pos logit contributions
 guessing+0.005
 facts+0.005
 tours+0.005
 topics+0.005
 motorcycles+0.005
Neg logit contributions
kin-0.006
ivery-0.006
ifying-0.005
 bruised-0.005
ishy-0.005
Token curious
Token ID11040
Feature activation-3.81e-02

Pos logit contributions
kin+0.046
ivery+0.044
ifying+0.040
 bruised+0.040
robe+0.040
Neg logit contributions
 guessing-0.041
 motorcycles-0.038
 discovering-0.037
 tours-0.037
 topics-0.037
Token little
Token ID1310
Feature activation-1.58e-01

Pos logit contributions
kin+0.002
ivery+0.001
ifying+0.001
 bruised+0.001
ishy+0.001
Neg logit contributions
 guessing-0.002
 facts-0.002
 tours-0.002
 motorcycles-0.002
 topics-0.002
Token boy
Token ID2933
Feature activation+3.71e-01

Pos logit contributions
kin+0.006
ivery+0.005
ifying+0.005
 bruised+0.004
ishy+0.004
Neg logit contributions
 guessing-0.008
 facts-0.008
 motorcycles-0.008
 tours-0.008
 topics-0.008
Token.
Token ID13
Feature activation-4.22e-01

Pos logit contributions
 guessing+0.018
 facts+0.017
 tours+0.017
 topics+0.017
 motorcycles+0.017
Neg logit contributions
kin-0.014
ivery-0.013
ifying-0.012
 bruised-0.012
ishy-0.011
Token He
Token ID679
Feature activation+7.80e-01

Pos logit contributions
kin+0.016
ivery+0.014
 bruised+0.013
ifying+0.013
ishy+0.012
Neg logit contributions
 guessing-0.021
 motorcycles-0.021
 topics-0.021
 facts-0.021
 tours-0.021
Token liked
Token ID8288
Feature activation+5.16e+00

Pos logit contributions
 guessing+0.036
 facts+0.035
 tours+0.035
 topics+0.034
 motorcycles+0.034
Neg logit contributions
kin-0.031
ivery-0.027
 bruised-0.026
ifying-0.026
ishy-0.025
Token to
Token ID284
Feature activation+1.93e+00

Pos logit contributions
ves+0.269
arm+0.257
bors+0.249
icians+0.242
is+0.239
Neg logit contributions
 empty-0.127
 full-0.126
 alone-0.122
 Lily-0.121
 a-0.112
Token explore
Token ID7301
Feature activation+3.08e+00

Pos logit contributions
ups+0.075
 guessing+0.074
 facts+0.072
icians+0.072
ths+0.070
Neg logit contributions
kin-0.082
ivery-0.074
 limp-0.070
 sill-0.068
 bruised-0.067
Token his
Token ID465
Feature activation+1.31e+00

Pos logit contributions
icians+0.167
arm+0.161
ending+0.161
bors+0.159
 facts+0.159
Neg logit contributions
 empty-0.067
he-0.064
 alone-0.062
 like-0.058
kin-0.056
Token neighborhood
Token ID6232
Feature activation+2.20e+00

Pos logit contributions
icians+0.059
 facts+0.059
bors+0.056
 guessing+0.056
 mills+0.055
Neg logit contributions
kin-0.050
robe-0.043
 eye-0.043
ishy-0.043
ifying-0.042
Token.
Token ID13
Feature activation-8.27e-02

Pos logit contributions
 facts+0.109
 guessing+0.109
bors+0.108
icians+0.106
arm+0.105
Neg logit contributions
kin-0.063
ifying-0.057
 sill-0.056
robe-0.055
ivery-0.054
Token She
Token ID1375
Feature activation+6.70e-02

Pos logit contributions
kin+0.023
ivery+0.019
ifying+0.019
 bruised+0.018
ishy+0.017
Neg logit contributions
 guessing-0.030
 tours-0.030
 facts-0.030
 topics-0.029
 discovering-0.029
Token loved
Token ID6151
Feature activation+4.37e+00

Pos logit contributions
 guessing+0.003
 facts+0.003
 tours+0.003
 topics+0.003
 motorcycles+0.003
Neg logit contributions
kin-0.003
ivery-0.003
ifying-0.003
 bruised-0.003
ishy-0.003
Token to
Token ID284
Feature activation+1.02e+00

Pos logit contributions
icians+0.214
ves+0.213
bors+0.210
arm+0.196
 Street+0.194
Neg logit contributions
 Lily-0.135
 cold-0.129
 dry-0.128
 bruised-0.128
 wet-0.127
Token explore
Token ID7301
Feature activation+2.82e+00

Pos logit contributions
 guessing+0.045
 facts+0.044
 topics+0.043
 discovering+0.043
icians+0.043
Neg logit contributions
kin-0.042
ivery-0.037
 bruised-0.034
ifying-0.034
ishy-0.034
Token and
Token ID290
Feature activation+1.77e+00

Pos logit contributions
icians+0.169
 facts+0.160
bors+0.155
 Exhibit+0.155
 guessing+0.155
Neg logit contributions
he-0.060
 empty-0.059
 dry-0.053
kin-0.050
 alone-0.050
Token try
Token ID1949
Feature activation+5.15e+00

Pos logit contributions
 motorcycles+0.081
icians+0.080
cles+0.078
 facts+0.078
arm+0.077
Neg logit contributions
kin-0.059
 sill-0.056
ivery-0.055
 bruised-0.054
 limp-0.054
Token new
Token ID649
Feature activation+3.24e+00

Pos logit contributions
bors+0.272
 occasions+0.270
cles+0.269
icians+0.267
ths+0.250
Neg logit contributions
 down-0.120
 low-0.120
 out-0.117
 okay-0.114
 sad-0.109
Token things
Token ID1243
Feature activation+8.80e-01

Pos logit contributions
icians+0.132
 Exhibit+0.127
arm+0.127
bors+0.125
ending+0.124
Neg logit contributions
 eye-0.104
 sill-0.101
ishy-0.100
 away-0.098
 OK-0.095
Token.
Token ID13
Feature activation+6.60e-01

Pos logit contributions
 guessing+0.047
 facts+0.046
 topics+0.046
 motorcycles+0.045
 mills+0.045
Neg logit contributions
kin-0.028
ivery-0.023
 bruised-0.023
ifying-0.022
robe-0.021
Token Today
Token ID6288
Feature activation+2.18e-01

Pos logit contributions
 guessing+0.034
 facts+0.033
 tours+0.032
 discovering+0.032
 topics+0.032
Neg logit contributions
kin-0.022
ivery-0.020
 bruised-0.019
ifying-0.019
ishy-0.018
Token,
Token ID11
Feature activation+7.23e-01

Pos logit contributions
 guessing+0.008
 facts+0.008
 tours+0.008
 topics+0.008
 motorcycles+0.008
Neg logit contributions
kin-0.011
ivery-0.010
ifying-0.009
 bruised-0.009
ishy-0.009
Token play
Token ID711
Feature activation+6.11e-01

Pos logit contributions
 guessing+0.029
 facts+0.028
 discovering+0.028
 tours+0.028
 topics+0.027
Neg logit contributions
kin-0.033
ivery-0.029
ifying-0.028
 bruised-0.027
ishy-0.027
Token in
Token ID287
Feature activation+1.40e+00

Pos logit contributions
 facts+0.027
 guessing+0.026
 tours+0.026
 topics+0.026
 discovering+0.025
Neg logit contributions
kin-0.026
ivery-0.023
ifying-0.022
robe-0.021
 bruised-0.021
Token nature
Token ID3450
Feature activation+1.05e+00

Pos logit contributions
 facts+0.067
arm+0.064
ups+0.063
 guessing+0.063
icians+0.063
Neg logit contributions
kin-0.049
ifying-0.042
 bruised-0.041
he-0.041
ivery-0.041
Token.
Token ID13
Feature activation-1.17e+00

Pos logit contributions
 guessing+0.057
 facts+0.056
 tours+0.055
 topics+0.055
 discovering+0.054
Neg logit contributions
kin-0.032
ivery-0.027
ifying-0.026
 bruised-0.025
robe-0.025
Token They
Token ID1119
Feature activation-1.15e-01

Pos logit contributions
kin+0.061
ivery+0.058
 bruised+0.055
ifying+0.052
 bleed+0.051
Neg logit contributions
 tours-0.043
 motorcycles-0.042
 topics-0.042
 facts-0.041
 guessing-0.041
Token liked
Token ID8288
Feature activation+5.12e+00

Pos logit contributions
kin+0.005
ivery+0.005
ifying+0.004
 bruised+0.004
ishy+0.004
Neg logit contributions
 guessing-0.005
 facts-0.005
 tours-0.005
 discovering-0.005
 motorcycles-0.005
Token to
Token ID284
Feature activation+1.70e+00

Pos logit contributions
icians+0.285
 occasions+0.283
ves+0.278
ises+0.271
bors+0.270
Neg logit contributions
 like-0.120
 full-0.113
 wet-0.110
 Lily-0.106
 dry-0.104
Token run
Token ID1057
Feature activation+9.93e-01

Pos logit contributions
 motorcycles+0.073
 facts+0.072
 guessing+0.071
ups+0.070
 topics+0.070
Neg logit contributions
kin-0.068
ivery-0.061
ishy-0.058
 limp-0.058
 sill-0.054
Token on
Token ID319
Feature activation+2.94e+00

Pos logit contributions
 guessing+0.044
 facts+0.044
 motorcycles+0.042
 topics+0.042
icians+0.042
Neg logit contributions
kin-0.040
ivery-0.035
robe-0.034
ishy-0.034
ifying-0.034
Token the
Token ID262
Feature activation+1.77e+00

Pos logit contributions
icians+0.140
cles+0.135
arm+0.134
ending+0.129
 guessing+0.127
Neg logit contributions
 empty-0.094
 limp-0.092
kin-0.084
 wet-0.084
 bruised-0.083
Token grass
Token ID8701
Feature activation-3.31e-01

Pos logit contributions
icians+0.078
 topics+0.076
 facts+0.074
 occasions+0.071
 guessing+0.071
Neg logit contributions
kin-0.068
ishy-0.063
robe-0.062
ivery-0.062
 limp-0.061
Token was
Token ID373
Feature activation-4.74e-01

Pos logit contributions
kin+0.002
ivery+0.002
ifying+0.002
 bruised+0.002
ishy+0.002
Neg logit contributions
 guessing-0.001
 facts-0.001
 tours-0.001
 discovering-0.001
 topics-0.001
Token very
Token ID845
Feature activation-7.94e-01

Pos logit contributions
kin+0.023
ivery+0.020
ifying+0.020
ishy+0.019
 bruised+0.018
Neg logit contributions
 guessing-0.020
 discovering-0.019
 tours-0.018
 motorcycles-0.018
 seeing-0.018
Token curious
Token ID11040
Feature activation+8.97e-01

Pos logit contributions
kin+0.035
ifying+0.032
ivery+0.032
robe+0.030
 bruised+0.030
Neg logit contributions
 guessing-0.035
 discovering-0.031
 tours-0.031
 facts-0.031
 topics-0.030
Token.
Token ID13
Feature activation+1.40e+00

Pos logit contributions
 guessing+0.048
 tours+0.046
 facts+0.045
icians+0.045
 mills+0.045
Neg logit contributions
kin-0.028
 bruised-0.024
ifying-0.024
ivery-0.024
ishy-0.023
Token He
Token ID679
Feature activation+6.72e-01

Pos logit contributions
 guessing+0.084
icians+0.082
 facts+0.081
 tours+0.079
bors+0.079
Neg logit contributions
kin-0.029
 sill-0.028
ivery-0.026
ishy-0.024
ifying-0.023
Token liked
Token ID8288
Feature activation+5.12e+00

Pos logit contributions
 guessing+0.031
 facts+0.031
 tours+0.031
 topics+0.030
 motorcycles+0.030
Neg logit contributions
kin-0.026
ivery-0.023
 bruised-0.022
ifying-0.022
ishy-0.021
Token to
Token ID284
Feature activation+1.39e+00

Pos logit contributions
ves+0.298
bors+0.282
ises+0.279
is+0.275
icians+0.273
Neg logit contributions
 Lily-0.108
 cold-0.106
 a-0.101
 dark-0.100
 alone-0.099
Token explore
Token ID7301
Feature activation+3.06e+00

Pos logit contributions
 guessing+0.055
 facts+0.054
 motorcycles+0.053
icians+0.053
ups+0.053
Neg logit contributions
kin-0.060
ivery-0.055
 limp-0.051
 sill-0.050
 eye-0.049
Token his
Token ID465
Feature activation+1.63e+00

Pos logit contributions
icians+0.178
bors+0.170
ending+0.167
 occasions+0.167
ths+0.165
Neg logit contributions
 empty-0.063
he-0.062
 alone-0.059
 like-0.057
 dark-0.055
Token surroundings
Token ID21334
Feature activation+1.88e+00

Pos logit contributions
icians+0.080
bors+0.078
 facts+0.077
ves+0.073
 mills+0.072
Neg logit contributions
kin-0.057
 eye-0.052
robe-0.051
 empty-0.050
ishy-0.049
Token.
Token ID13
Feature activation+1.30e+00

Pos logit contributions
icians+0.091
bors+0.090
 guessing+0.090
 tours+0.089
 facts+0.089
Neg logit contributions
kin-0.063
he-0.052
 sill-0.052
 bruised-0.051
ifying-0.049
Token on
Token ID319
Feature activation+1.85e+00

Pos logit contributions
 guessing+0.013
 facts+0.013
 tours+0.013
 topics+0.013
 discovering+0.013
Neg logit contributions
kin-0.014
ivery-0.013
ifying-0.012
 bruised-0.012
robe-0.012
Token the
Token ID262
Feature activation+1.90e+00

Pos logit contributions
 facts+0.088
arm+0.088
cles+0.087
icians+0.087
 guessing+0.086
Neg logit contributions
kin-0.063
ifying-0.050
he-0.048
 bruised-0.048
ivery-0.048
Token farm
Token ID5318
Feature activation+1.21e+00

Pos logit contributions
 facts+0.066
 topics+0.063
icians+0.063
arm+0.060
 discovering+0.060
Neg logit contributions
kin-0.092
ivery-0.084
robe-0.082
 empty-0.079
ifying-0.077
Token.
Token ID13
Feature activation+1.64e-01

Pos logit contributions
 guessing+0.062
 facts+0.062
 motorcycles+0.061
 topics+0.061
 tours+0.060
Neg logit contributions
kin-0.040
ivery-0.033
robe-0.031
 bruised-0.031
ifying-0.031
Token They
Token ID1119
Feature activation-1.50e-01

Pos logit contributions
 guessing+0.008
 facts+0.008
 tours+0.008
 topics+0.008
 motorcycles+0.008
Neg logit contributions
kin-0.006
ivery-0.005
ifying-0.005
 bruised-0.005
ishy-0.005
Token liked
Token ID8288
Feature activation+5.10e+00

Pos logit contributions
kin+0.007
ivery+0.006
ifying+0.006
 bruised+0.006
ishy+0.005
Neg logit contributions
 guessing-0.006
 facts-0.006
 tours-0.006
 discovering-0.006
 motorcycles-0.006
Token to
Token ID284
Feature activation+8.92e-01

Pos logit contributions
icians+0.286
 occasions+0.286
ises+0.282
ves+0.279
is+0.275
Neg logit contributions
 Lily-0.104
 full-0.101
 like-0.097
 a-0.097
 wet-0.096
Token run
Token ID1057
Feature activation+1.18e+00

Pos logit contributions
 guessing+0.036
 facts+0.035
 topics+0.035
 motorcycles+0.034
 discovering+0.034
Neg logit contributions
kin-0.040
ivery-0.036
 bruised-0.034
ifying-0.034
ishy-0.033
Token,
Token ID11
Feature activation+1.62e+00

Pos logit contributions
 guessing+0.052
 facts+0.052
 discovering+0.050
 topics+0.050
 motorcycles+0.050
Neg logit contributions
kin-0.047
robe-0.041
ivery-0.041
suit-0.041
ishy-0.041
Token jump
Token ID4391
Feature activation+9.11e-01

Pos logit contributions
 topics+0.081
 facts+0.081
cles+0.078
 occasions+0.078
 feats+0.077
Neg logit contributions
kin-0.052
 bruised-0.048
 limp-0.047
 sill-0.045
cloth-0.044
Token,
Token ID11
Feature activation+1.36e+00

Pos logit contributions
 facts+0.038
 guessing+0.037
 topics+0.037
 discovering+0.036
 motorcycles+0.036
Neg logit contributions
kin-0.042
ivery-0.036
ifying-0.035
robe-0.035
 bruised-0.034
Token very
Token ID845
Feature activation-1.12e+00

Pos logit contributions
kin+0.050
ivery+0.046
ifying+0.043
ishy+0.041
 bruised+0.039
Neg logit contributions
 guessing-0.043
 discovering-0.040
 tours-0.037
 learning-0.037
 seeing-0.037
Token brave
Token ID14802
Feature activation+1.37e-01

Pos logit contributions
kin+0.050
ifying+0.048
ivery+0.046
 bruised+0.043
robe+0.042
Neg logit contributions
 guessing-0.049
 discovering-0.044
 tours-0.043
 seeing-0.041
 learning-0.041
Token and
Token ID290
Feature activation+3.33e-01

Pos logit contributions
 guessing+0.005
 facts+0.004
 tours+0.004
 topics+0.004
 motorcycles+0.004
Neg logit contributions
kin-0.007
ivery-0.007
ifying-0.006
 bruised-0.006
ishy-0.006
Token liked
Token ID8288
Feature activation+4.34e+00

Pos logit contributions
 guessing+0.014
 facts+0.014
 tours+0.014
 motorcycles+0.014
 topics+0.013
Neg logit contributions
kin-0.015
ivery-0.013
ifying-0.013
 bruised-0.013
ishy-0.012
Token to
Token ID284
Feature activation+5.92e-01

Pos logit contributions
bors+0.226
icians+0.224
arm+0.223
ves+0.219
ises+0.218
Neg logit contributions
 Lily-0.111
 alone-0.102
 full-0.100
 asleep-0.097
 hers-0.092
Token try
Token ID1949
Feature activation+5.10e+00

Pos logit contributions
 guessing+0.022
 facts+0.022
 motorcycles+0.021
 topics+0.021
 tours+0.021
Neg logit contributions
kin-0.028
ivery-0.026
 bruised-0.024
ifying-0.024
robe-0.023
Token new
Token ID649
Feature activation+3.43e+00

Pos logit contributions
bors+0.266
cles+0.264
 occasions+0.257
icians+0.247
ths+0.246
Neg logit contributions
 out-0.118
 low-0.117
 dark-0.117
 down-0.115
 okay-0.112
Token things
Token ID1243
Feature activation+9.23e-01

Pos logit contributions
arm+0.140
icians+0.139
ending+0.139
bors+0.139
 Exhibit+0.139
Neg logit contributions
 idea-0.100
 eye-0.098
 sill-0.097
 away-0.097
 like-0.095
Token.
Token ID13
Feature activation-5.48e-01

Pos logit contributions
 guessing+0.049
 facts+0.048
 topics+0.048
 motorcycles+0.047
 tours+0.047
Neg logit contributions
kin-0.030
ivery-0.025
 bruised-0.024
ifying-0.024
robe-0.023
Token
Token ID198
Feature activation+9.14e-01

Pos logit contributions
kin+0.025
ivery+0.022
ifying+0.021
 bruised+0.021
ishy+0.020
Neg logit contributions
 guessing-0.023
 tours-0.023
 topics-0.022
 discovering-0.022
 facts-0.022
Token
Token ID198
Feature activation+6.00e-01

Pos logit contributions
 facts+0.041
 guessing+0.040
 motorcycles+0.039
 tours+0.039
 topics+0.039
Neg logit contributions
kin-0.037
ivery-0.034
ifying-0.031
he-0.031
 bruised-0.031
Token who
Token ID508
Feature activation-1.21e+00

Pos logit contributions
kin+0.011
ivery+0.010
ifying+0.010
 bruised+0.009
ishy+0.009
Neg logit contributions
 guessing-0.014
 discovering-0.013
 facts-0.013
 tours-0.013
 topics-0.013
Token loved
Token ID6151
Feature activation+3.46e+00

Pos logit contributions
kin+0.070
ifying+0.068
ivery+0.067
rier+0.065
 bruised+0.063
Neg logit contributions
 guessing-0.032
 adventures-0.031
 topics-0.031
 discovering-0.031
 facts-0.030
Token nature
Token ID3450
Feature activation+4.69e-02

Pos logit contributions
icians+0.152
bors+0.144
ves+0.142
arm+0.140
 Street+0.128
Neg logit contributions
 bruised-0.130
 empty-0.121
 Lily-0.119
 itch-0.118
 full-0.118
Token.
Token ID13
Feature activation-2.68e-01

Pos logit contributions
 guessing+0.002
 facts+0.002
 tours+0.002
 topics+0.002
 motorcycles+0.002
Neg logit contributions
kin-0.002
ivery-0.002
ifying-0.002
 bruised-0.002
ishy-0.001
Token She
Token ID1375
Feature activation+1.05e+00

Pos logit contributions
kin+0.009
ivery+0.008
ifying+0.007
 bruised+0.007
ishy+0.007
Neg logit contributions
 guessing-0.014
 facts-0.014
 tours-0.014
 topics-0.014
 discovering-0.014
Token liked
Token ID8288
Feature activation+5.09e+00

Pos logit contributions
 facts+0.052
 guessing+0.051
 motorcycles+0.051
 topics+0.050
 tours+0.050
Neg logit contributions
kin-0.040
ivery-0.033
ishy-0.031
 bruised-0.031
robe-0.030
Token to
Token ID284
Feature activation+1.30e+00

Pos logit contributions
icians+0.274
cles+0.255
 Street+0.253
bors+0.250
 site+0.248
Neg logit contributions
 full-0.126
 wind-0.124
 dry-0.120
 wet-0.119
 cold-0.119
Token play
Token ID711
Feature activation+1.81e+00

Pos logit contributions
 motorcycles+0.056
 guessing+0.056
 facts+0.055
 topics+0.054
icians+0.053
Neg logit contributions
kin-0.055
ivery-0.048
ishy-0.044
 limp-0.044
 bruised-0.043
Token in
Token ID287
Feature activation+2.03e+00

Pos logit contributions
 facts+0.101
 motorcycles+0.098
 topics+0.095
 cultures+0.094
ups+0.092
Neg logit contributions
kin-0.059
ivery-0.046
robe-0.046
he-0.045
ishy-0.038
Token the
Token ID262
Feature activation+1.27e+00

Pos logit contributions
 facts+0.118
arm+0.114
cles+0.112
icians+0.112
ups+0.111
Neg logit contributions
kin-0.048
 empty-0.043
he-0.042
 Lily-0.041
 sill-0.041
Token trees
Token ID7150
Feature activation+1.88e-01

Pos logit contributions
 facts+0.054
 guessing+0.051
icians+0.050
 topics+0.050
 motorcycles+0.049
Neg logit contributions
kin-0.053
ivery-0.051
ifying-0.048
ishy-0.048
 bruised-0.046
Token in
Token ID287
Feature activation-5.78e-01

Pos logit contributions
 guessing+0.009
 facts+0.009
 tours+0.009
 motorcycles+0.009
 topics+0.009
Neg logit contributions
kin-0.007
ivery-0.007
ifying-0.006
 bruised-0.006
ishy-0.006
Token the
Token ID262
Feature activation-1.66e-01

Pos logit contributions
kin+0.028
ivery+0.026
ifying+0.025
ishy+0.024
 bruised+0.024
Neg logit contributions
 guessing-0.023
 tours-0.023
 topics-0.022
 discovering-0.022
 facts-0.022
Token classroom
Token ID15806
Feature activation+6.26e-01

Pos logit contributions
kin+0.010
ivery+0.009
 bruised+0.008
ifying+0.008
ishy+0.008
Neg logit contributions
 guessing-0.005
 tours-0.005
 facts-0.005
 topics-0.005
 motorcycles-0.005
Token.
Token ID13
Feature activation-8.26e-01

Pos logit contributions
 guessing+0.032
 facts+0.031
 tours+0.031
 motorcycles+0.031
 topics+0.030
Neg logit contributions
kin-0.022
ivery-0.019
ifying-0.018
 bruised-0.017
robe-0.017
Token She
Token ID1375
Feature activation+7.68e-01

Pos logit contributions
kin+0.041
ivery+0.036
 bruised+0.035
ifying+0.034
ishy+0.032
Neg logit contributions
 guessing-0.033
 topics-0.033
 tours-0.032
 discovering-0.032
 facts-0.031
Token liked
Token ID8288
Feature activation+5.09e+00

Pos logit contributions
 guessing+0.037
 facts+0.036
 tours+0.036
 motorcycles+0.035
 topics+0.035
Neg logit contributions
kin-0.029
ivery-0.025
 bruised-0.024
ifying-0.023
ishy-0.023
Token to
Token ID284
Feature activation+1.56e+00

Pos logit contributions
ves+0.262
arm+0.246
bors+0.245
ises+0.242
icy+0.241
Neg logit contributions
 Lily-0.140
 alone-0.117
 like-0.115
 her-0.113
 empty-0.111
Token touch
Token ID3638
Feature activation+2.15e+00

Pos logit contributions
 motorcycles+0.069
 facts+0.067
 guessing+0.067
icians+0.066
ups+0.066
Neg logit contributions
kin-0.059
ivery-0.052
 limp-0.051
 sill-0.051
cloth-0.048
Token the
Token ID262
Feature activation+4.19e-01

Pos logit contributions
 guessing+0.094
 discovering+0.082
icians+0.082
ths+0.081
 facts+0.081
Neg logit contributions
kin-0.081
 bruised-0.079
ifying-0.077
cloth-0.073
ishy-0.073
Token books
Token ID3835
Feature activation-2.34e-01

Pos logit contributions
 guessing+0.013
 facts+0.013
 tours+0.012
 discovering+0.012
 motorcycles+0.012
Neg logit contributions
kin-0.023
ivery-0.021
ifying-0.021
 bruised-0.020
ishy-0.020
Token,
Token ID11
Feature activation+1.25e+00

Pos logit contributions
kin+0.012
ivery+0.011
ifying+0.011
 bruised+0.011
ishy+0.011
Neg logit contributions
 guessing-0.008
 facts-0.008
 tours-0.007
 discovering-0.007
 motorcycles-0.007
Token toys
Token ID14958
Feature activation-1.84e+00

Pos logit contributions
ivery+0.043
kin+0.043
ifying+0.041
ishy+0.038
 bruised+0.038
Neg logit contributions
 guessing-0.036
 facts-0.035
 tours-0.034
 motorcycles-0.033
 adventures-0.033
Token.
Token ID13
Feature activation-9.05e-01

Pos logit contributions
kin+0.091
ifying+0.090
ivery+0.089
ishy+0.086
 bruised+0.079
Neg logit contributions
 guessing-0.063
 adventures-0.059
 bikes-0.059
 tours-0.059
 facts-0.059
Token She
Token ID1375
Feature activation-1.68e+00

Pos logit contributions
kin+0.039
ivery+0.035
ifying+0.033
ishy+0.033
 bruised+0.032
Neg logit contributions
 topics-0.040
 guessing-0.040
 facts-0.040
 tours-0.039
 discovering-0.038
Token wanted
Token ID2227
Feature activation+3.91e-01

Pos logit contributions
kin+0.094
ivery+0.093
ifying+0.092
ishy+0.085
rier+0.083
Neg logit contributions
 guessing-0.051
 facts-0.048
 rides-0.047
 adventures-0.047
 bikes-0.047
Token her
Token ID607
Feature activation-5.42e-02

Pos logit contributions
 guessing+0.014
 facts+0.014
 tours+0.014
 topics+0.013
 discovering+0.013
Neg logit contributions
kin-0.019
ivery-0.018
 bruised-0.017
ifying-0.017
ishy-0.016
Token bag
Token ID6131
Feature activation-3.99e+00

Pos logit contributions
kin+0.003
ivery+0.002
ifying+0.002
 bruised+0.002
ishy+0.002
Neg logit contributions
 guessing-0.002
 facts-0.002
 tours-0.002
 motorcycles-0.002
 topics-0.002
Token,
Token ID11
Feature activation-4.94e-01

Pos logit contributions
ifying+0.231
ishy+0.204
ivery+0.202
rified+0.193
kin+0.192
Neg logit contributions
 to-0.131
ups-0.107
 adventures-0.105
 bikes-0.096
 rules-0.095
Token her
Token ID607
Feature activation-1.01e+00

Pos logit contributions
kin+0.024
ivery+0.023
ifying+0.023
ishy+0.022
 bruised+0.021
Neg logit contributions
 guessing-0.018
 facts-0.018
 tours-0.017
 discovering-0.017
 motorcycles-0.017
Token book
Token ID1492
Feature activation-2.33e+00

Pos logit contributions
ivery+0.047
kin+0.045
ifying+0.044
ishy+0.041
 bruised+0.041
Neg logit contributions
 guessing-0.039
 facts-0.039
 tours-0.038
 motorcycles-0.037
 adventures-0.037
Token,
Token ID11
Feature activation-2.33e-01

Pos logit contributions
ifying+0.126
ivery+0.120
ishy+0.118
kin+0.116
 bruised+0.107
Neg logit contributions
 guessing-0.073
 adventures-0.073
 facts-0.072
 bikes-0.071
 tours-0.069
Token her
Token ID607
Feature activation-8.59e-01

Pos logit contributions
kin+0.011
ivery+0.011
ifying+0.010
 bruised+0.010
ishy+0.010
Neg logit contributions
 guessing-0.009
 facts-0.009
 tours-0.008
 discovering-0.008
 motorcycles-0.008
Token went
Token ID1816
Feature activation+1.11e+00

Pos logit contributions
 guessing+0.009
 facts+0.008
 tours+0.008
 topics+0.008
 motorcycles+0.008
Neg logit contributions
kin-0.012
ivery-0.011
ifying-0.010
 bruised-0.010
ishy-0.010
Token home
Token ID1363
Feature activation-1.13e-01

Pos logit contributions
 facts+0.041
 topics+0.040
 motorcycles+0.038
bors+0.038
 mills+0.038
Neg logit contributions
kin-0.051
 bruised-0.048
robe-0.047
ivery-0.047
ifying-0.046
Token and
Token ID290
Feature activation-7.93e-01

Pos logit contributions
kin+0.006
ivery+0.005
ifying+0.005
 bruised+0.005
ishy+0.005
Neg logit contributions
 guessing-0.004
 facts-0.004
 tours-0.004
 discovering-0.004
 topics-0.004
Token put
Token ID1234
Feature activation-1.89e+00

Pos logit contributions
kin+0.040
ivery+0.038
ifying+0.036
ishy+0.035
 bruised+0.034
Neg logit contributions
 guessing-0.031
 discovering-0.030
 seeing-0.028
 tours-0.028
 motorcycles-0.027
Token her
Token ID607
Feature activation-1.03e+00

Pos logit contributions
ivery+0.115
kin+0.107
ifying+0.106
istered+0.102
ishy+0.098
Neg logit contributions
 motorcycles-0.049
 bikes-0.049
 adventures-0.049
 guessing-0.049
ons-0.049
Token box
Token ID3091
Feature activation-3.95e+00

Pos logit contributions
ivery+0.057
kin+0.052
ifying+0.050
ishy+0.049
 bruised+0.048
Neg logit contributions
 guessing-0.035
 tours-0.031
 motorcycles-0.031
 adventures-0.031
 bikes-0.030
Token on
Token ID319
Feature activation-7.58e-01

Pos logit contributions
ivery+0.193
kin+0.190
ifying+0.184
ishy+0.181
 Deal+0.181
Neg logit contributions
 to-0.131
 airplanes-0.112
 guessing-0.111
 adventures-0.110
 exploring-0.108
Token her
Token ID607
Feature activation-1.82e-01

Pos logit contributions
kin+0.039
ivery+0.037
ishy+0.036
 bruised+0.035
ifying+0.034
Neg logit contributions
 guessing-0.028
 tours-0.027
 motorcycles-0.026
 discovering-0.026
 topics-0.026
Token dress
Token ID6576
Feature activation-2.27e+00

Pos logit contributions
kin+0.008
ivery+0.008
ifying+0.007
 bruised+0.007
ishy+0.007
Neg logit contributions
 guessing-0.008
 facts-0.007
 tours-0.007
 motorcycles-0.007
 discovering-0.007
Tokener
Token ID263
Feature activation-1.51e+00

Pos logit contributions
kin+0.114
ivery+0.110
ishy+0.108
ifying+0.107
 bruised+0.107
Neg logit contributions
 adventures-0.071
 guessing-0.071
 bikes-0.070
 tours-0.070
 motorcycles-0.069
Token.
Token ID13
Feature activation-9.36e-01

Pos logit contributions
kin+0.071
ivery+0.068
ishy+0.065
 bruised+0.063
ifying+0.063
Neg logit contributions
 guessing-0.057
 discovering-0.056
 tours-0.053
 facts-0.053
 motorcycles-0.053
Token it
Token ID340
Feature activation-8.63e-02

Pos logit contributions
kin+0.014
ivery+0.013
ifying+0.012
 bruised+0.012
ishy+0.012
Neg logit contributions
 guessing-0.011
 facts-0.010
 discovering-0.010
 tours-0.010
 topics-0.010
Token kept
Token ID4030
Feature activation-7.48e-01

Pos logit contributions
kin+0.004
ivery+0.004
ifying+0.004
 bruised+0.004
ishy+0.004
Neg logit contributions
 guessing-0.003
 facts-0.003
 tours-0.003
 discovering-0.003
 topics-0.003
Token her
Token ID607
Feature activation-1.38e+00

Pos logit contributions
kin+0.036
ivery+0.033
ifying+0.031
ishy+0.031
robe+0.029
Neg logit contributions
 guessing-0.031
 discovering-0.029
 tours-0.028
 facts-0.027
 adventures-0.027
Token warm
Token ID5814
Feature activation-2.48e+00

Pos logit contributions
kin+0.057
ivery+0.056
ifying+0.052
ishy+0.048
 bruised+0.046
Neg logit contributions
 adventures-0.060
 guessing-0.060
 discovering-0.056
 tours-0.056
 journeys-0.056
Token and
Token ID290
Feature activation-8.15e-01

Pos logit contributions
kin+0.124
ivery+0.121
ifying+0.116
istered+0.111
ishy+0.109
Neg logit contributions
 adventures-0.085
 journeys-0.076
 guessing-0.076
 exploring-0.075
 discovering-0.074
Token cos
Token ID8615
Feature activation-3.92e+00

Pos logit contributions
kin+0.030
ivery+0.028
ifying+0.027
ishy+0.025
robe+0.024
Neg logit contributions
 guessing-0.040
 discovering-0.039
 facts-0.038
 tours-0.037
 adventures-0.037
Tokeny
Token ID88
Feature activation-1.51e+00

Pos logit contributions
 Deal+0.218
 limp+0.199
 trigger+0.197
 sorry+0.197
 spoiler+0.195
Neg logit contributions
 to-0.087
we-0.087
 discovering-0.081
and-0.076
 seeing-0.075
Token all
Token ID477
Feature activation-2.05e+00

Pos logit contributions
kin+0.065
ivery+0.064
ifying+0.060
ishy+0.057
 bruised+0.056
Neg logit contributions
 guessing-0.062
 adventures-0.060
 discovering-0.058
 facts-0.057
 topics-0.057
Token night
Token ID1755
Feature activation-4.77e-01

Pos logit contributions
kin+0.104
ivery+0.099
ifying+0.093
istered+0.091
ishy+0.088
Neg logit contributions
 discovering-0.073
 adventures-0.072
 guessing-0.072
 exploring-0.069
 tours-0.066
Token long
Token ID890
Feature activation-6.55e-01

Pos logit contributions
kin+0.020
ivery+0.019
 bruised+0.018
ifying+0.017
ishy+0.017
Neg logit contributions
 guessing-0.021
 tours-0.020
 discovering-0.020
 facts-0.020
 topics-0.019
Token.
Token ID13
Feature activation-7.43e-01

Pos logit contributions
kin+0.028
ivery+0.026
ifying+0.024
 bruised+0.024
ishy+0.023
Neg logit contributions
 guessing-0.029
 discovering-0.027
 facts-0.027
 tours-0.027
 topics-0.027
Token her
Token ID607
Feature activation-9.07e-01

Pos logit contributions
kin+0.023
ivery+0.022
ifying+0.020
 bruised+0.019
ishy+0.019
Neg logit contributions
 guessing-0.016
 tours-0.016
 motorcycles-0.016
 facts-0.016
 discovering-0.015
Token brother
Token ID3956
Feature activation-7.63e-01

Pos logit contributions
kin+0.038
ivery+0.037
ifying+0.034
 bruised+0.033
ishy+0.033
Neg logit contributions
 guessing-0.040
 adventures-0.038
 tours-0.038
 facts-0.037
 discovering-0.036
Token and
Token ID290
Feature activation+1.98e-01

Pos logit contributions
kin+0.041
ivery+0.039
ifying+0.038
 bruised+0.036
ishy+0.036
Neg logit contributions
 guessing-0.025
 facts-0.024
 tours-0.024
 discovering-0.023
 topics-0.023
Token get
Token ID651
Feature activation-1.34e+00

Pos logit contributions
 guessing+0.008
 facts+0.007
 tours+0.007
 topics+0.007
 motorcycles+0.007
Neg logit contributions
kin-0.009
ivery-0.009
 bruised-0.008
ifying-0.008
ishy-0.008
Token the
Token ID262
Feature activation-1.44e+00

Pos logit contributions
kin+0.081
ivery+0.077
ifying+0.071
ishy+0.069
robe+0.066
Neg logit contributions
 guessing-0.041
 motorcycles-0.037
 discovering-0.036
 tours-0.035
 exploring-0.034
Token blanket
Token ID18447
Feature activation-3.89e+00

Pos logit contributions
kin+0.064
ivery+0.062
ifying+0.057
 bruised+0.056
ishy+0.056
Neg logit contributions
 adventures-0.059
 guessing-0.058
 tours-0.057
 bikes-0.056
 motorcycles-0.055
Token back
Token ID736
Feature activation-1.94e+00

Pos logit contributions
kin+0.213
ivery+0.210
ishy+0.208
 disgust+0.204
ifying+0.202
Neg logit contributions
 to-0.119
 airplanes-0.089
 guessing-0.086
 adventures-0.081
 exploring-0.075
Token.
Token ID13
Feature activation-1.22e+00

Pos logit contributions
kin+0.102
ivery+0.098
ishy+0.090
ifying+0.089
istered+0.086
Neg logit contributions
 guessing-0.065
 discovering-0.062
 to-0.058
 exploring-0.058
 motorcycles-0.057
Token Lily
Token ID20037
Feature activation+1.01e+00

Pos logit contributions
kin+0.049
ivery+0.042
ifying+0.040
 bruised+0.037
ishy+0.035
Neg logit contributions
 guessing-0.061
 topics-0.057
 discovering-0.056
 tours-0.056
 motorcycles-0.056
Token nodded
Token ID14464
Feature activation-1.20e+00

Pos logit contributions
 facts+0.048
 tours+0.047
 guessing+0.047
 topics+0.046
 discovering+0.046
Neg logit contributions
kin-0.040
ivery-0.035
 bruised-0.033
cloth-0.032
ishy-0.031
Token her
Token ID607
Feature activation-1.18e+00

Pos logit contributions
kin+0.069
ifying+0.068
ivery+0.066
ishy+0.064
 bruised+0.062
Neg logit contributions
 guessing-0.033
 discovering-0.030
 seeing-0.030
 motorcycles-0.028
 adventures-0.027
Token if
Token ID611
Feature activation+6.25e-01

Pos logit contributions
 tours+0.103
 motorcycles+0.097
 guessing+0.097
 mills+0.096
 journeys+0.095
Neg logit contributions
kin-0.082
cloth-0.068
robe-0.066
 bruised-0.064
ifying-0.062
Token she
Token ID673
Feature activation-8.85e-01

Pos logit contributions
 guessing+0.026
 facts+0.025
 tours+0.025
 topics+0.025
 motorcycles+0.024
Neg logit contributions
kin-0.028
 bruised-0.024
ivery-0.024
ifying-0.023
robe-0.023
Token could
Token ID714
Feature activation-1.30e+00

Pos logit contributions
kin+0.043
ivery+0.043
ifying+0.040
 bruised+0.038
robe+0.037
Neg logit contributions
 guessing-0.032
 facts-0.031
arm-0.030
 motorcycles-0.030
 discovering-0.030
Token have
Token ID423
Feature activation+8.63e-02

Pos logit contributions
kin+0.066
ivery+0.064
ifying+0.060
ishy+0.059
 bruised+0.059
Neg logit contributions
 guessing-0.044
 discovering-0.040
 seeing-0.039
 tours-0.038
 adventures-0.038
Token her
Token ID607
Feature activation-1.14e+00

Pos logit contributions
 guessing+0.004
 facts+0.003
 tours+0.003
 motorcycles+0.003
 topics+0.003
Neg logit contributions
kin-0.004
ivery-0.004
ifying-0.003
 bruised-0.003
ishy-0.003
Token blanket
Token ID18447
Feature activation-3.85e+00

Pos logit contributions
kin+0.048
ivery+0.048
ifying+0.043
 bruised+0.042
ishy+0.042
Neg logit contributions
 guessing-0.049
 adventures-0.048
 tours-0.046
 facts-0.045
 journeys-0.044
Token back
Token ID736
Feature activation-1.47e+00

Pos logit contributions
ivery+0.232
kin+0.218
ishy+0.216
ifying+0.215
 disgust+0.210
Neg logit contributions
 to-0.104
 guessing-0.082
 adventures-0.076
 discovering-0.069
 exploring-0.068
Token and
Token ID290
Feature activation-6.06e-01

Pos logit contributions
kin+0.075
ivery+0.074
ifying+0.066
ishy+0.066
 bruised+0.066
Neg logit contributions
 guessing-0.052
 discovering-0.050
 adventures-0.047
 tours-0.047
 seeing-0.046
Token her
Token ID607
Feature activation-1.51e+00

Pos logit contributions
kin+0.030
ivery+0.028
ifying+0.027
ishy+0.026
 bruised+0.025
Neg logit contributions
 guessing-0.024
 facts-0.022
 discovering-0.022
 motorcycles-0.022
 tours-0.021
Token brother
Token ID3956
Feature activation-1.93e+00

Pos logit contributions
ivery+0.074
ifying+0.071
kin+0.070
ishy+0.066
he+0.065
Neg logit contributions
 adventures-0.056
 guessing-0.056
 motorcycles-0.052
 facts-0.050
 computers-0.050
Token gave
Token ID2921
Feature activation-1.63e+00

Pos logit contributions
ivery+0.100
kin+0.098
ifying+0.097
ishy+0.088
 bruised+0.084
Neg logit contributions
 guessing-0.066
 facts-0.064
 adventures-0.060
 motorcycles-0.060
 discovering-0.059
Token she
Token ID673
Feature activation-9.82e-01

Pos logit contributions
ivery+0.073
ifying+0.070
kin+0.070
ishy+0.068
robe+0.064
Neg logit contributions
 guessing-0.042
 discovering-0.039
 seeing-0.038
 learning-0.035
 motorcycles-0.035
Token forgot
Token ID16453
Feature activation+9.33e-01

Pos logit contributions
ifying+0.048
ivery+0.047
kin+0.046
 bruised+0.043
ishy+0.042
Neg logit contributions
 guessing-0.035
arm-0.031
 discovering-0.031
 motorcycles-0.031
 adventures-0.030
Token to
Token ID284
Feature activation-1.79e+00

Pos logit contributions
 guessing+0.030
 tours+0.029
 topics+0.029
 facts+0.028
icians+0.028
Neg logit contributions
kin-0.049
 bruised-0.046
ifying-0.045
ivery-0.045
cloth-0.043
Token put
Token ID1234
Feature activation-2.02e+00

Pos logit contributions
ifying+0.099
robe+0.094
kin+0.094
ivery+0.090
ishy+0.085
Neg logit contributions
 guessing-0.057
 seeing-0.054
 discovering-0.053
 facts-0.052
 adventures-0.051
Token the
Token ID262
Feature activation-8.70e-01

Pos logit contributions
ivery+0.133
kin+0.127
ifying+0.122
robe+0.117
istered+0.117
Neg logit contributions
 guessing-0.041
ons-0.040
 tours-0.038
 motorcycles-0.038
 adventures-0.036
Token bowl
Token ID9396
Feature activation-3.85e+00

Pos logit contributions
kin+0.044
ivery+0.043
ifying+0.040
 bruised+0.038
robe+0.037
Neg logit contributions
 guessing-0.032
 facts-0.029
 motorcycles-0.029
 tours-0.029
 topics-0.028
Token away
Token ID1497
Feature activation-2.39e+00

Pos logit contributions
ivery+0.198
 Deal+0.194
 spoiler+0.187
kin+0.187
 relief+0.186
Neg logit contributions
 to-0.130
ups-0.100
 guessing-0.093
ons-0.090
 discovering-0.081
Token.
Token ID13
Feature activation-6.74e-01

Pos logit contributions
ivery+0.131
kin+0.127
ishy+0.122
ifying+0.118
 bruised+0.116
Neg logit contributions
 to-0.069
 guessing-0.068
 discovering-0.065
 seeing-0.062
 facts-0.059
Token
Token ID198
Feature activation+6.24e-01

Pos logit contributions
kin+0.037
ivery+0.033
ifying+0.033
 bruised+0.032
ishy+0.031
Neg logit contributions
 guessing-0.023
 tours-0.022
 topics-0.021
 discovering-0.021
 motorcycles-0.021
Token
Token ID198
Feature activation+3.08e-01

Pos logit contributions
 guessing+0.026
 facts+0.026
 tours+0.025
 topics+0.025
 motorcycles+0.025
Neg logit contributions
kin-0.027
ivery-0.025
ifying-0.023
 bruised-0.023
he-0.022
TokenAs
Token ID1722
Feature activation-6.23e-01

Pos logit contributions
 guessing+0.011
 facts+0.011
 tours+0.011
 motorcycles+0.011
 discovering+0.011
Neg logit contributions
kin-0.015
ivery-0.014
ifying-0.014
 bruised-0.013
ishy-0.013
Token the
Token ID262
Feature activation-1.14e+00

Pos logit contributions
ivery+0.067
kin+0.065
ishy+0.064
ifying+0.060
 bruised+0.059
Neg logit contributions
 guessing-0.032
 tours-0.030
 topics-0.029
 discovering-0.029
 adventures-0.028
Token box
Token ID3091
Feature activation-1.35e+00

Pos logit contributions
kin+0.062
ivery+0.058
ishy+0.053
 bruised+0.051
ifying+0.051
Neg logit contributions
 guessing-0.038
 tours-0.037
 motorcycles-0.036
 bikes-0.035
 topics-0.035
Token and
Token ID290
Feature activation-7.84e-01

Pos logit contributions
kin+0.063
ivery+0.062
ifying+0.058
ishy+0.058
 bruised+0.057
Neg logit contributions
 guessing-0.053
 discovering-0.049
 facts-0.048
 tours-0.047
 seeing-0.046
Token put
Token ID1234
Feature activation-1.99e+00

Pos logit contributions
ivery+0.037
kin+0.037
ifying+0.034
ishy+0.033
 bruised+0.031
Neg logit contributions
 guessing-0.032
 facts-0.030
 discovering-0.030
 tours-0.029
 seeing-0.029
Token the
Token ID262
Feature activation-2.30e+00

Pos logit contributions
ivery+0.154
kin+0.143
ishy+0.133
istered+0.131
ifying+0.130
Neg logit contributions
 bikes-0.021
 guessing-0.020
 motorcycles-0.019
 tours-0.018
ons-0.017
Token lid
Token ID19789
Feature activation-3.82e+00

Pos logit contributions
kin+0.107
ivery+0.104
ishy+0.095
gan+0.094
ifying+0.092
Neg logit contributions
 bikes-0.087
 guessing-0.085
 instructions-0.084
 adventures-0.084
 motorcycles-0.083
Token on
Token ID319
Feature activation-1.61e+00

Pos logit contributions
ivery+0.206
kin+0.191
 seeker+0.181
ishy+0.175
 Deal+0.175
Neg logit contributions
 to-0.127
ups-0.102
 guessing-0.099
 airplanes-0.092
 instructions-0.091
Token.
Token ID13
Feature activation-3.02e-01

Pos logit contributions
ivery+0.105
kin+0.101
ishy+0.097
 bruised+0.094
ifying+0.092
Neg logit contributions
 guessing-0.036
 tours-0.033
 motorcycles-0.032
 bikes-0.032
 instructions-0.030
Token They
Token ID1119
Feature activation-1.51e+00

Pos logit contributions
kin+0.013
ivery+0.012
 bruised+0.011
ifying+0.011
ishy+0.010
Neg logit contributions
 guessing-0.014
 tours-0.013
 facts-0.013
 topics-0.013
 discovering-0.013
Token ran
Token ID4966
Feature activation+3.61e-01

Pos logit contributions
ivery+0.074
kin+0.074
ifying+0.069
ishy+0.066
robe+0.065
Neg logit contributions
 guessing-0.053
 mills-0.050
arm-0.049
 seeing-0.049
 motorcycles-0.049
Token to
Token ID284
Feature activation-3.81e-01

Pos logit contributions
 guessing+0.011
 facts+0.011
 topics+0.011
 tours+0.011
 motorcycles+0.011
Neg logit contributions
kin-0.019
ivery-0.018
ifying-0.018
 bruised-0.017
ishy-0.017
Token He
Token ID679
Feature activation-9.28e-01

Pos logit contributions
kin+0.026
ivery+0.023
ifying+0.022
 bruised+0.021
ishy+0.021
Neg logit contributions
 guessing-0.026
 motorcycles-0.025
 tours-0.025
 facts-0.024
 mills-0.024
Token forgot
Token ID16453
Feature activation+1.74e+00

Pos logit contributions
kin+0.046
ivery+0.043
ifying+0.043
 bruised+0.040
ishy+0.039
Neg logit contributions
 guessing-0.036
 motorcycles-0.033
 topics-0.032
 adventures-0.032
 facts-0.032
Token to
Token ID284
Feature activation-2.31e+00

Pos logit contributions
 topics+0.064
icians+0.063
 facts+0.062
 guessing+0.062
 tours+0.061
Neg logit contributions
kin-0.077
 bruised-0.076
cloth-0.071
ifying-0.071
 OK-0.070
Token put
Token ID1234
Feature activation-1.28e+00

Pos logit contributions
ifying+0.130
kin+0.118
ishy+0.117
ivery+0.116
 bruised+0.116
Neg logit contributions
 guessing-0.073
 tours-0.070
 adventures-0.068
 events-0.067
 discovering-0.067
Token his
Token ID465
Feature activation-7.99e-01

Pos logit contributions
ivery+0.074
kin+0.070
ifying+0.066
ishy+0.063
 bruised+0.062
Neg logit contributions
 motorcycles-0.040
 guessing-0.039
 tours-0.038
 bikes-0.036
 adventures-0.035
Token books
Token ID3835
Feature activation-3.78e+00

Pos logit contributions
ivery+0.037
kin+0.036
ifying+0.034
 bruised+0.034
ishy+0.032
Neg logit contributions
 guessing-0.031
 tours-0.030
 motorcycles-0.030
 facts-0.029
 bikes-0.029
Token and
Token ID290
Feature activation-9.71e-01

Pos logit contributions
kin+0.166
ivery+0.164
 thirst+0.160
 scalp+0.158
ishy+0.157
Neg logit contributions
 to-0.148
 guessing-0.126
 bikes-0.117
 tours-0.117
 motorcycles-0.117
Token lunch
Token ID9965
Feature activation-2.04e+00

Pos logit contributions
kin+0.052
ivery+0.052
ifying+0.049
ishy+0.048
 bruised+0.047
Neg logit contributions
 guessing-0.031
 tours-0.030
 motorcycles-0.030
 facts-0.030
 bikes-0.029
Token in
Token ID287
Feature activation-1.42e+00

Pos logit contributions
ivery+0.106
ifying+0.101
kin+0.098
 bruised+0.097
ishy+0.097
Neg logit contributions
 guessing-0.069
 tours-0.062
 adventures-0.061
 bikes-0.060
 motorcycles-0.060
Token the
Token ID262
Feature activation-9.42e-01

Pos logit contributions
ivery+0.085
kin+0.082
ishy+0.080
ifying+0.076
 bruised+0.074
Neg logit contributions
 guessing-0.040
 motorcycles-0.040
 adventures-0.039
 tours-0.039
 bikes-0.036
Token bag
Token ID6131
Feature activation-2.59e+00

Pos logit contributions
kin+0.053
ivery+0.050
 bruised+0.047
ishy+0.046
ifying+0.046
Neg logit contributions
 guessing-0.030
 tours-0.029
 motorcycles-0.028
 topics-0.027
 discovering-0.027
Token and
Token ID290
Feature activation-1.52e+00

Pos logit contributions
kin+0.007
ivery+0.006
ifying+0.006
 bruised+0.006
ishy+0.006
Neg logit contributions
 guessing-0.005
 facts-0.005
 tours-0.005
 discovering-0.005
 motorcycles-0.005
Token wanted
Token ID2227
Feature activation+1.24e+00

Pos logit contributions
kin+0.075
ivery+0.068
ifying+0.067
ishy+0.062
robe+0.060
Neg logit contributions
 guessing-0.062
 seeing-0.059
 discovering-0.058
 facts-0.057
 learning-0.056
Token to
Token ID284
Feature activation-8.28e-01

Pos logit contributions
 tours+0.034
 guessing+0.034
 topics+0.034
icians+0.033
arm+0.033
Neg logit contributions
kin-0.067
 bruised-0.064
 sill-0.059
ivery-0.059
ifying-0.059
Token get
Token ID651
Feature activation-9.80e-01

Pos logit contributions
kin+0.045
ifying+0.043
ivery+0.042
ishy+0.040
 bruised+0.040
Neg logit contributions
 guessing-0.028
 tours-0.026
 facts-0.025
 adventures-0.024
 discovering-0.024
Token her
Token ID607
Feature activation-1.28e+00

Pos logit contributions
kin+0.052
ivery+0.048
ifying+0.044
ishy+0.042
robe+0.042
Neg logit contributions
 guessing-0.037
 discovering-0.034
 motorcycles-0.033
 tours-0.033
 facts-0.033
Token ball
Token ID2613
Feature activation-3.77e+00

Pos logit contributions
ivery+0.062
kin+0.061
ifying+0.056
ishy+0.054
 bruised+0.052
Neg logit contributions
 guessing-0.050
 tours-0.046
 motorcycles-0.046
 adventures-0.045
 facts-0.045
Token back
Token ID736
Feature activation-1.86e+00

Pos logit contributions
ifying+0.207
kin+0.198
ivery+0.191
 disgust+0.191
ishy+0.185
Neg logit contributions
 to-0.127
 guessing-0.096
 adventures-0.096
 bikes-0.095
 motorcycles-0.085
Token.
Token ID13
Feature activation+5.09e-01

Pos logit contributions
kin+0.102
ivery+0.097
ifying+0.092
ishy+0.089
istered+0.088
Neg logit contributions
 guessing-0.061
 discovering-0.059
 adventures-0.057
 to-0.055
 tours-0.055
Token
Token ID198
Feature activation+8.49e-01

Pos logit contributions
 guessing+0.026
 facts+0.026
 tours+0.025
 motorcycles+0.025
 discovering+0.025
Neg logit contributions
kin-0.017
ivery-0.015
 bruised-0.014
ifying-0.014
ishy-0.013
Token
Token ID198
Feature activation+8.43e-01

Pos logit contributions
 guessing+0.038
 facts+0.038
 tours+0.037
 motorcycles+0.036
 topics+0.036
Neg logit contributions
kin-0.034
ivery-0.031
he-0.029
 bruised-0.029
ifying-0.028
TokenAmy
Token ID40797
Feature activation+1.44e-01

Pos logit contributions
 facts+0.034
 guessing+0.034
 tours+0.033
 motorcycles+0.033
 discovering+0.032
Neg logit contributions
kin-0.039
ivery-0.036
ifying-0.033
he-0.032
 bruised-0.032
Token it
Token ID340
Feature activation-5.41e-03

Pos logit contributions
kin+0.022
ivery+0.021
ifying+0.019
 bruised+0.019
ishy+0.019
Neg logit contributions
 guessing-0.016
 discovering-0.015
 facts-0.015
 tours-0.015
 topics-0.015
Token kept
Token ID4030
Feature activation-4.94e-01

Pos logit contributions
kin+0.000
ivery+0.000
ifying+0.000
 bruised+0.000
ishy+0.000
Neg logit contributions
 guessing-0.000
 facts-0.000
 tours-0.000
 topics-0.000
 discovering-0.000
Token her
Token ID607
Feature activation-1.37e+00

Pos logit contributions
kin+0.023
ivery+0.021
ifying+0.020
ishy+0.019
 bruised+0.019
Neg logit contributions
 guessing-0.021
 discovering-0.020
 tours-0.019
 facts-0.019
 motorcycles-0.019
Token warm
Token ID5814
Feature activation-2.54e+00

Pos logit contributions
kin+0.058
ivery+0.057
ifying+0.051
ishy+0.049
 bruised+0.047
Neg logit contributions
 guessing-0.059
 adventures-0.059
 facts-0.056
 discovering-0.056
 tours-0.055
Token and
Token ID290
Feature activation-7.07e-01

Pos logit contributions
kin+0.126
ivery+0.125
ifying+0.117
istered+0.113
ishy+0.111
Neg logit contributions
 adventures-0.087
 guessing-0.080
 discovering-0.079
 exploring-0.079
 journeys-0.078
Token cos
Token ID8615
Feature activation-3.77e+00

Pos logit contributions
kin+0.027
ivery+0.025
ifying+0.023
ishy+0.022
robe+0.021
Neg logit contributions
 guessing-0.035
 discovering-0.033
 facts-0.033
 tours-0.032
 adventures-0.032
Tokeny
Token ID88
Feature activation-1.65e+00

Pos logit contributions
 Deal+0.209
 sorry+0.195
 limp+0.194
 trigger+0.193
ivery+0.190
Neg logit contributions
we-0.083
 to-0.081
 discovering-0.078
 seeing-0.069
and-0.069
Token all
Token ID477
Feature activation-1.81e+00

Pos logit contributions
kin+0.072
ivery+0.071
ifying+0.065
ishy+0.063
 bruised+0.062
Neg logit contributions
 guessing-0.067
 adventures-0.064
 discovering-0.063
 facts-0.063
 topics-0.062
Token night
Token ID1755
Feature activation-4.79e-01

Pos logit contributions
kin+0.090
ivery+0.085
ifying+0.078
istered+0.077
ishy+0.076
Neg logit contributions
 discovering-0.067
 guessing-0.067
 adventures-0.064
 tours-0.062
 exploring-0.062
Token long
Token ID890
Feature activation-6.12e-01

Pos logit contributions
kin+0.020
ivery+0.019
 bruised+0.018
ifying+0.017
ishy+0.017
Neg logit contributions
 guessing-0.021
 tours-0.020
 discovering-0.020
 facts-0.020
 topics-0.020
Token.
Token ID13
Feature activation-9.00e-01

Pos logit contributions
kin+0.026
ivery+0.024
ifying+0.022
 bruised+0.022
ishy+0.022
Neg logit contributions
 guessing-0.027
 facts-0.026
 discovering-0.026
 tours-0.026
 topics-0.025
Token,
Token ID11
Feature activation+1.93e-01

Pos logit contributions
 guessing+0.008
 facts+0.008
 tours+0.008
 topics+0.008
 motorcycles+0.008
Neg logit contributions
kin-0.008
ivery-0.007
ifying-0.007
 bruised-0.007
ishy-0.006
Token and
Token ID290
Feature activation+8.25e-02

Pos logit contributions
 guessing+0.008
 facts+0.008
 tours+0.008
 topics+0.008
 motorcycles+0.008
Neg logit contributions
kin-0.008
ivery-0.008
ifying-0.007
 bruised-0.007
ishy-0.007
Token how
Token ID703
Feature activation-9.80e-01

Pos logit contributions
 guessing+0.003
 facts+0.003
 tours+0.003
 topics+0.003
 discovering+0.003
Neg logit contributions
kin-0.004
ivery-0.004
ifying-0.004
 bruised-0.003
ishy-0.003
Token light
Token ID1657
Feature activation-8.06e-01

Pos logit contributions
kin+0.049
ivery+0.047
ifying+0.046
 bruised+0.044
robe+0.042
Neg logit contributions
 guessing-0.035
 tours-0.034
 facts-0.034
 discovering-0.033
 adventures-0.032
Token and
Token ID290
Feature activation+4.20e-01

Pos logit contributions
kin+0.042
ivery+0.040
ifying+0.038
 bruised+0.037
ishy+0.037
Neg logit contributions
 guessing-0.027
 facts-0.026
 tours-0.026
 motorcycles-0.025
 topics-0.025
Token boun
Token ID31283
Feature activation-3.76e+00

Pos logit contributions
 guessing+0.016
 tours+0.015
 facts+0.015
 motorcycles+0.015
 topics+0.015
Neg logit contributions
kin-0.020
ivery-0.018
 bruised-0.018
ifying-0.018
ishy-0.017
Tokency
Token ID948
Feature activation+4.40e-01

Pos logit contributions
 OK+0.132
ivery+0.130
 sorry+0.130
 bruised+0.127
istered+0.125
Neg logit contributions
ons-0.155
ites-0.145
st-0.144
cles-0.143
 topics-0.143
Token it
Token ID340
Feature activation-9.99e-01

Pos logit contributions
 guessing+0.018
 facts+0.017
 tours+0.017
 topics+0.017
 motorcycles+0.016
Neg logit contributions
kin-0.021
ivery-0.018
ifying-0.018
 bruised-0.018
ishy-0.017
Token was
Token ID373
Feature activation-2.18e-01

Pos logit contributions
kin+0.049
ivery+0.047
ifying+0.047
 bruised+0.043
ishy+0.043
Neg logit contributions
 guessing-0.037
 facts-0.034
 adventures-0.034
 tours-0.034
 motorcycles-0.033
Token.
Token ID13
Feature activation+1.05e-01

Pos logit contributions
kin+0.010
ivery+0.009
ifying+0.009
 bruised+0.009
ishy+0.008
Neg logit contributions
 guessing-0.009
 facts-0.008
 tours-0.008
 discovering-0.008
 motorcycles-0.008
Token
Token ID220
Feature activation-2.51e-01

Pos logit contributions
 guessing+0.005
 facts+0.005
 tours+0.005
 topics+0.005
 motorcycles+0.005
Neg logit contributions
kin-0.004
ivery-0.004
ifying-0.003
 bruised-0.003
ishy-0.003
Token yawn
Token ID40908
Feature activation-1.60e+00

Pos logit contributions
 guessing+0.018
 facts+0.017
 tours+0.017
 topics+0.017
 motorcycles+0.017
Neg logit contributions
kin-0.020
ivery-0.018
 bruised-0.017
ifying-0.017
ishy-0.016
Tokened
Token ID276
Feature activation-1.94e+00

Pos logit contributions
kin+0.070
ivery+0.061
ifying+0.061
 bruised+0.060
ishy+0.057
Neg logit contributions
 guessing-0.064
 facts-0.061
 motorcycles-0.061
ups-0.059
arm-0.059
Token and
Token ID290
Feature activation-2.01e+00

Pos logit contributions
kin+0.132
ivery+0.126
ifying+0.121
ishy+0.117
 bruised+0.112
Neg logit contributions
 guessing-0.036
 motorcycles-0.033
 discovering-0.032
 adventures-0.032
 seeing-0.032
Token rubbed
Token ID31862
Feature activation-1.81e+00

Pos logit contributions
kin+0.098
ivery+0.096
ifying+0.087
ishy+0.084
he+0.081
Neg logit contributions
 guessing-0.075
 seeing-0.070
 facts-0.069
 discovering-0.067
 walks-0.066
Token her
Token ID607
Feature activation-1.55e+00

Pos logit contributions
ivery+0.096
kin+0.095
ishy+0.087
ifying+0.087
 bruised+0.083
Neg logit contributions
 facts-0.050
 bikes-0.049
 guessing-0.049
 motorcycles-0.048
 topics-0.048
Token eyes
Token ID2951
Feature activation-3.76e+00

Pos logit contributions
ivery+0.068
kin+0.062
he+0.057
ishy+0.056
ifying+0.055
Neg logit contributions
 guessing-0.064
 motorcycles-0.062
 facts-0.062
 tricks-0.062
 bikes-0.061
Token.
Token ID13
Feature activation-6.74e-01

Pos logit contributions
kin+0.221
ivery+0.205
 seeker+0.200
robe+0.193
idget+0.193
Neg logit contributions
 to-0.098
 adventures-0.077
 guessing-0.074
 airplanes-0.071
 instructions-0.070
Token She
Token ID1375
Feature activation-1.23e+00

Pos logit contributions
kin+0.030
ivery+0.027
ifying+0.025
 bruised+0.024
ishy+0.024
Neg logit contributions
 guessing-0.029
 tours-0.028
 discovering-0.028
 facts-0.028
 motorcycles-0.028
Token wanted
Token ID2227
Feature activation+8.63e-01

Pos logit contributions
kin+0.054
ivery+0.054
ifying+0.050
ishy+0.047
 bruised+0.047
Neg logit contributions
 guessing-0.050
 facts-0.048
 adventures-0.046
 motorcycles-0.046
 discovering-0.045
Token to
Token ID284
Feature activation-1.36e+00

Pos logit contributions
 guessing+0.023
 tours+0.022
 topics+0.022
 facts+0.021
 discovering+0.021
Neg logit contributions
kin-0.050
 bruised-0.047
ifying-0.045
ivery-0.045
 sill-0.044
Token go
Token ID467
Feature activation-4.11e-01

Pos logit contributions
kin+0.069
ivery+0.066
ifying+0.066
ishy+0.065
 bruised+0.063
Neg logit contributions
 guessing-0.048
 tours-0.043
 facts-0.043
 adventures-0.043
 discovering-0.042
Token They
Token ID1119
Feature activation-2.04e+00

Pos logit contributions
kin+0.075
ivery+0.068
ishy+0.062
 bruised+0.061
he+0.058
Neg logit contributions
 mills-0.048
 facts-0.047
 topics-0.047
 museums-0.047
 guessing-0.047
Token we
Token ID356
Feature activation-2.16e+00

Pos logit contributions
kin+0.105
ifying+0.102
ivery+0.100
ishy+0.096
rier+0.095
Neg logit contributions
 adventures-0.067
 guessing-0.066
 facts-0.065
 seeing-0.061
 airports-0.060
Tokenpt
Token ID457
Feature activation-1.07e+00

Pos logit contributions
kin+0.093
ivery+0.079
rier+0.079
ifying+0.077
robe+0.075
Neg logit contributions
 adventures-0.089
 guessing-0.086
 facts-0.083
 motorcycles-0.083
 tours-0.082
Token until
Token ID1566
Feature activation-1.89e+00

Pos logit contributions
kin+0.055
ifying+0.050
ivery+0.050
ishy+0.048
 bruised+0.047
Neg logit contributions
 guessing-0.037
 discovering-0.034
 facts-0.034
 mills-0.033
 tours-0.033
Token they
Token ID484
Feature activation-2.80e+00

Pos logit contributions
ivery+0.123
ishy+0.118
kin+0.114
suit+0.112
umb+0.110
Neg logit contributions
 guessing-0.044
 seeing-0.040
 tours-0.039
 discovering-0.039
 adventures-0.039
Token felt
Token ID2936
Feature activation-3.74e+00

Pos logit contributions
rier+0.153
ishy+0.149
ivery+0.147
ifying+0.147
kin+0.144
Neg logit contributions
 adventures-0.078
 seeing-0.077
 guessing-0.071
 whales-0.068
 museums-0.064
Token better
Token ID1365
Feature activation-1.42e+00

Pos logit contributions
ifying+0.175
kin+0.172
ivery+0.170
cy+0.169
suit+0.159
Neg logit contributions
 adventures-0.128
 guessing-0.127
 to-0.123
 seeing-0.122
 tricks-0.119
Token.
Token ID13
Feature activation-1.38e+00

Pos logit contributions
kin+0.071
ivery+0.068
ifying+0.066
ishy+0.065
robe+0.061
Neg logit contributions
 guessing-0.050
 discovering-0.047
 adventures-0.046
 seeing-0.046
 facts-0.045
Token Then
Token ID3244
Feature activation-8.02e-01

Pos logit contributions
kin+0.077
ivery+0.071
ishy+0.065
 bruised+0.063
he+0.062
Neg logit contributions
 mills-0.042
 topics-0.042
 guessing-0.041
 facts-0.041
 museums-0.041
Token they
Token ID484
Feature activation-2.40e+00

Pos logit contributions
kin+0.050
ivery+0.049
ifying+0.046
ishy+0.045
 bruised+0.044
Neg logit contributions
 guessing-0.020
 tours-0.019
 facts-0.018
 discovering-0.018
 adventures-0.017
Token buried
Token ID11694
Feature activation-2.06e+00

Pos logit contributions
ivery+0.123
kin+0.121
ishy+0.115
ifying+0.114
rier+0.113
Neg logit contributions
 airplanes-0.074
 planes-0.073
 whales-0.073
 adventures-0.072
 walks-0.072
Token.
Token ID13
Feature activation-6.26e-01

Pos logit contributions
ivery+0.073
kin+0.071
ishy+0.068
 bruised+0.068
ifying+0.068
Neg logit contributions
 guessing-0.061
 facts-0.057
 discovering-0.057
 tours-0.055
 mills-0.054
Token It
Token ID632
Feature activation-2.79e-01

Pos logit contributions
kin+0.024
ivery+0.022
ifying+0.020
 bruised+0.019
ishy+0.019
Neg logit contributions
 guessing-0.030
 facts-0.029
 tours-0.029
 topics-0.029
 motorcycles-0.028
Token was
Token ID373
Feature activation-1.10e+00

Pos logit contributions
kin+0.015
ivery+0.014
ifying+0.013
 bruised+0.013
ishy+0.013
Neg logit contributions
 guessing-0.010
 facts-0.009
 discovering-0.009
 motorcycles-0.009
 tours-0.009
Token an
Token ID281
Feature activation-5.24e-01

Pos logit contributions
ivery+0.052
kin+0.050
ifying+0.046
ishy+0.046
he+0.044
Neg logit contributions
 guessing-0.047
 discovering-0.042
 facts-0.042
 seeing-0.040
 tours-0.039
Token ash
Token ID12530
Feature activation-6.73e-01

Pos logit contributions
kin+0.024
ivery+0.022
ifying+0.022
 bruised+0.021
robe+0.020
Neg logit contributions
 guessing-0.022
 facts-0.020
 discovering-0.020
 tours-0.020
 motorcycles-0.020
Tokentr
Token ID2213
Feature activation-3.72e+00

Pos logit contributions
kin+0.020
ivery+0.019
 bruised+0.016
ifying+0.016
ishy+0.016
Neg logit contributions
 guessing-0.038
 facts-0.036
 tours-0.035
 motorcycles-0.035
 discovering-0.035
Tokenay
Token ID323
Feature activation-7.60e-01

Pos logit contributions
kin+0.207
 thirst+0.205
 limp+0.199
 relief+0.191
ivery+0.190
Neg logit contributions
ons-0.101
 motorcycles-0.075
ades-0.074
ths-0.073
cles-0.069
Token.
Token ID13
Feature activation-7.60e-01

Pos logit contributions
kin+0.037
ivery+0.035
ishy+0.032
ifying+0.032
 bruised+0.031
Neg logit contributions
 guessing-0.030
 motorcycles-0.027
 discovering-0.027
 facts-0.026
 tours-0.026
Token Fl
Token ID1610
Feature activation-6.42e-01

Pos logit contributions
kin+0.030
ivery+0.028
ifying+0.025
ishy+0.024
 bruised+0.024
Neg logit contributions
 guessing-0.035
 facts-0.034
 tours-0.033
 motorcycles-0.033
 topics-0.033
Tokenuffy
Token ID15352
Feature activation+1.79e+00

Pos logit contributions
kin+0.023
ifying+0.021
ivery+0.021
 bruised+0.020
cloth+0.019
Neg logit contributions
 guessing-0.031
arm-0.029
 tours-0.029
 topics-0.028
 facts-0.028
Token did
Token ID750
Feature activation+1.19e+00

Pos logit contributions
 tours+0.098
 facts+0.095
 topics+0.091
 motorcycles+0.091
 events+0.090
Neg logit contributions
kin-0.058
 bruised-0.049
ivery-0.046
ishy-0.045
cloth-0.045
Token and
Token ID290
Feature activation-4.84e-01

Pos logit contributions
 facts+0.059
 tours+0.058
 topics+0.057
 mills+0.055
 motorcycles+0.054
Neg logit contributions
kin-0.053
 bruised-0.052
ivery-0.050
ifying-0.050
 limp-0.047
Token made
Token ID925
Feature activation-5.01e-01

Pos logit contributions
kin+0.023
ivery+0.021
ifying+0.020
ishy+0.019
 bruised+0.019
Neg logit contributions
 guessing-0.020
 motorcycles-0.019
 facts-0.019
 tours-0.018
 discovering-0.018
Token him
Token ID683
Feature activation-1.32e+00

Pos logit contributions
kin+0.022
ivery+0.020
ifying+0.019
 bruised+0.019
ishy+0.018
Neg logit contributions
 guessing-0.022
 tours-0.021
 facts-0.021
 motorcycles-0.021
 discovering-0.020
Token drop
Token ID4268
Feature activation-2.01e+00

Pos logit contributions
kin+0.060
ivery+0.053
ifying+0.051
robe+0.047
ishy+0.046
Neg logit contributions
 guessing-0.058
 tours-0.054
 facts-0.054
 motorcycles-0.054
 discovering-0.052
Token his
Token ID465
Feature activation-7.49e-01

Pos logit contributions
kin+0.122
ivery+0.118
robe+0.102
 seeker+0.097
ifying+0.096
Neg logit contributions
 motorcycles-0.059
 instructions-0.051
 tours-0.051
 facts-0.050
 bikes-0.050
Token basket
Token ID7988
Feature activation-3.69e+00

Pos logit contributions
kin+0.035
ivery+0.034
ifying+0.031
 bruised+0.030
robe+0.029
Neg logit contributions
 guessing-0.029
 tours-0.028
 motorcycles-0.028
 facts-0.028
 discovering-0.026
Token.
Token ID13
Feature activation-3.60e-01

Pos logit contributions
kin+0.201
ivery+0.197
 relief+0.196
 Deal+0.187
ishy+0.185
Neg logit contributions
 to-0.104
 guessing-0.088
 climbing-0.072
 exploring-0.070
ups-0.069
Token She
Token ID1375
Feature activation-5.02e-02

Pos logit contributions
kin+0.015
ivery+0.013
ifying+0.012
 bruised+0.012
ishy+0.011
Neg logit contributions
 guessing-0.017
 facts-0.017
 motorcycles-0.016
 tours-0.016
 topics-0.016
Token felt
Token ID2936
Feature activation-2.13e+00

Pos logit contributions
kin+0.002
ivery+0.002
ifying+0.002
 bruised+0.002
ishy+0.002
Neg logit contributions
 guessing-0.002
 facts-0.002
 motorcycles-0.002
 tours-0.002
 discovering-0.002
Token sorry
Token ID7926
Feature activation-5.46e-01

Pos logit contributions
kin+0.097
ivery+0.084
suit+0.083
cy+0.082
ifying+0.081
Neg logit contributions
 guessing-0.090
 learning-0.086
 discovering-0.085
 seeing-0.084
 facts-0.082
Token and
Token ID290
Feature activation-1.12e+00

Pos logit contributions
kin+0.031
ivery+0.029
ifying+0.028
 bruised+0.026
ishy+0.026
Neg logit contributions
 guessing-0.017
 facts-0.016
 discovering-0.015
 motorcycles-0.015
 tours-0.015
Token,
Token ID11
Feature activation-1.42e+00

Pos logit contributions
kin+0.049
ifying+0.046
ivery+0.046
ishy+0.043
 bruised+0.043
Neg logit contributions
 guessing-0.027
 tours-0.025
 discovering-0.025
 adventures-0.025
 facts-0.025
Token the
Token ID262
Feature activation-3.59e-01

Pos logit contributions
ivery+0.089
kin+0.087
ifying+0.084
ishy+0.083
istered+0.080
Neg logit contributions
 guessing-0.032
 tours-0.032
 adventures-0.031
 museums-0.030
 discovering-0.030
Token bird
Token ID6512
Feature activation-2.39e-02

Pos logit contributions
kin+0.018
ifying+0.017
ivery+0.017
ishy+0.016
 bruised+0.016
Neg logit contributions
 guessing-0.013
 tours-0.012
 facts-0.012
 motorcycles-0.012
 discovering-0.012
Token put
Token ID1234
Feature activation-2.87e+00

Pos logit contributions
kin+0.001
ivery+0.001
ifying+0.001
 bruised+0.001
ishy+0.001
Neg logit contributions
 guessing-0.001
 facts-0.001
 tours-0.001
 motorcycles-0.001
 discovering-0.001
Token her
Token ID607
Feature activation-1.79e+00

Pos logit contributions
ivery+0.198
ifying+0.188
istered+0.181
ishy+0.168
 Deal+0.167
Neg logit contributions
ons-0.058
 things-0.055
 adventures-0.050
 to-0.047
 bikes-0.046
Token blanket
Token ID18447
Feature activation-3.68e+00

Pos logit contributions
ivery+0.080
kin+0.076
ifying+0.072
ishy+0.071
idget+0.068
Neg logit contributions
 adventures-0.068
 guessing-0.066
 bikes-0.063
 tours-0.060
 journeys-0.060
Token back
Token ID736
Feature activation-1.48e+00

Pos logit contributions
ivery+0.218
ishy+0.203
 Deal+0.202
ifying+0.200
kin+0.199
Neg logit contributions
 to-0.101
 guessing-0.074
 airplanes-0.068
 exploring-0.067
 adventures-0.065
Token inside
Token ID2641
Feature activation-7.83e-01

Pos logit contributions
kin+0.072
ivery+0.072
ifying+0.065
ishy+0.065
istered+0.063
Neg logit contributions
 guessing-0.053
 discovering-0.050
 tours-0.047
 motorcycles-0.047
 exploring-0.047
Token.
Token ID13
Feature activation-3.02e-01

Pos logit contributions
kin+0.037
ivery+0.036
ifying+0.033
ishy+0.033
 bruised+0.032
Neg logit contributions
 guessing-0.032
 discovering-0.030
 facts-0.029
 seeing-0.028
 tours-0.028
Token She
Token ID1375
Feature activation+3.31e-01

Pos logit contributions
kin+0.011
ivery+0.010
ifying+0.009
 bruised+0.009
ishy+0.009
Neg logit contributions
 facts-0.015
 guessing-0.015
 tours-0.015
 topics-0.015
 discovering-0.014
Token was
Token ID373
Feature activation-1.93e+00

Pos logit contributions
 guessing+0.015
 facts+0.015
 tours+0.015
 topics+0.014
 discovering+0.014
Neg logit contributions
kin-0.014
ivery-0.012
 bruised-0.012
ifying-0.011
ishy-0.011
Token dress
Token ID6576
Feature activation-1.67e+00

Pos logit contributions
ivery+0.128
kin+0.115
ishy+0.113
istered+0.110
ifying+0.105
Neg logit contributions
 guessing-0.079
 bikes-0.078
 motorcycles-0.074
 occasions-0.071
 counting-0.070
Token,
Token ID11
Feature activation-8.70e-01

Pos logit contributions
ivery+0.101
kin+0.097
ishy+0.092
ifying+0.092
 bruised+0.091
Neg logit contributions
 guessing-0.042
 facts-0.037
 discovering-0.035
 seeing-0.035
 bikes-0.035
Token and
Token ID290
Feature activation-1.14e+00

Pos logit contributions
ivery+0.049
kin+0.049
ifying+0.044
ishy+0.043
 bruised+0.043
Neg logit contributions
 guessing-0.027
 facts-0.025
 discovering-0.025
 motorcycles-0.025
 tours-0.024
Token a
Token ID257
Feature activation-2.18e+00

Pos logit contributions
ivery+0.052
kin+0.052
ishy+0.045
ifying+0.044
 bruised+0.044
Neg logit contributions
 guessing-0.048
 facts-0.047
 motorcycles-0.046
 discovering-0.045
 tours-0.044
Token spark
Token ID9009
Feature activation-2.27e+00

Pos logit contributions
ivery+0.106
kin+0.098
ishy+0.093
ifying+0.091
he+0.088
Neg logit contributions
 guessing-0.087
 facts-0.076
 motorcycles-0.074
 tricks-0.073
 discovering-0.071
Tokenly
Token ID306
Feature activation-3.68e+00

Pos logit contributions
ivery+0.127
ifying+0.110
 bruised+0.109
 Deal+0.108
kin+0.107
Neg logit contributions
 facts-0.070
 tricks-0.070
 guessing-0.067
is-0.061
 occasions-0.060
Token cape
Token ID36023
Feature activation-2.10e+00

Pos logit contributions
ivery+0.199
ifying+0.169
istered+0.168
idget+0.163
ishy+0.160
Neg logit contributions
 guessing-0.120
 tricks-0.120
 facts-0.113
 bikes-0.113
 occasions-0.110
Token.
Token ID13
Feature activation-1.91e+00

Pos logit contributions
ivery+0.109
kin+0.104
ifying+0.099
ishy+0.098
 bruised+0.095
Neg logit contributions
 guessing-0.073
 facts-0.067
 discovering-0.065
 seeing-0.064
 tricks-0.061
Token Tom
Token ID4186
Feature activation+1.03e+00

Pos logit contributions
kin+0.093
ivery+0.091
ishy+0.082
 bruised+0.078
ifying+0.077
Neg logit contributions
 motorcycles-0.072
 guessing-0.069
 museums-0.069
 tours-0.068
 topics-0.068
Token thought
Token ID1807
Feature activation+2.86e-01

Pos logit contributions
 tours+0.050
 facts+0.049
 guessing+0.049
 topics+0.049
 mills+0.047
Neg logit contributions
kin-0.038
 bruised-0.033
ivery-0.031
ifying-0.031
robe-0.030
Token Elsa
Token ID19226
Feature activation+1.19e-01

Pos logit contributions
 guessing+0.011
 facts+0.011
 tours+0.011
 topics+0.010
 motorcycles+0.010
Neg logit contributions
kin-0.014
ivery-0.012
 bruised-0.012
ifying-0.012
ishy-0.011
Token he
Token ID339
Feature activation-6.26e-01

Pos logit contributions
kin+0.017
ivery+0.017
ifying+0.016
 bruised+0.015
ishy+0.015
Neg logit contributions
 guessing-0.010
 tours-0.010
 motorcycles-0.010
 facts-0.010
 discovering-0.010
Token sp
Token ID599
Feature activation-1.10e+00

Pos logit contributions
kin+0.031
ivery+0.030
ifying+0.028
 bruised+0.028
ishy+0.027
Neg logit contributions
 guessing-0.023
 tours-0.021
 motorcycles-0.021
 discovering-0.021
 facts-0.020
Tokenied
Token ID798
Feature activation-7.39e-01

Pos logit contributions
kin+0.056
 bruised+0.055
ifying+0.053
ivery+0.052
 limp+0.049
Neg logit contributions
 guessing-0.035
 tours-0.034
 motorcycles-0.033
 facts-0.033
 topics-0.032
Token an
Token ID281
Feature activation-2.52e-01

Pos logit contributions
kin+0.034
ivery+0.034
ifying+0.032
ishy+0.030
 bruised+0.030
Neg logit contributions
 guessing-0.030
 tours-0.028
 motorcycles-0.028
 facts-0.028
 discovering-0.027
Token ash
Token ID12530
Feature activation-8.81e-01

Pos logit contributions
kin+0.012
ivery+0.011
ifying+0.010
 bruised+0.010
ishy+0.010
Neg logit contributions
 guessing-0.010
 facts-0.010
 tours-0.010
 motorcycles-0.010
 discovering-0.010
Tokentr
Token ID2213
Feature activation-3.67e+00

Pos logit contributions
kin+0.027
ivery+0.026
 bruised+0.023
ishy+0.023
ifying+0.022
Neg logit contributions
 guessing-0.048
 facts-0.046
 tours-0.045
 motorcycles-0.045
 mills-0.044
Tokenay
Token ID323
Feature activation-1.24e+00

Pos logit contributions
kin+0.198
 thirst+0.191
 limp+0.184
ivery+0.183
 relief+0.178
Neg logit contributions
ons-0.108
 motorcycles-0.091
ades-0.084
ths-0.079
ups-0.078
Token.
Token ID13
Feature activation+2.46e-01

Pos logit contributions
kin+0.058
ivery+0.056
ishy+0.052
 bruised+0.051
ifying+0.051
Neg logit contributions
 guessing-0.049
 motorcycles-0.047
 discovering-0.045
 facts-0.045
 tours-0.044
Token He
Token ID679
Feature activation+3.72e-01

Pos logit contributions
 guessing+0.013
 facts+0.012
 tours+0.012
 topics+0.012
 discovering+0.012
Neg logit contributions
kin-0.009
ivery-0.008
ifying-0.007
 bruised-0.007
ishy-0.007
Token picked
Token ID6497
Feature activation+7.28e-01

Pos logit contributions
 guessing+0.017
 facts+0.016
 tours+0.016
 topics+0.016
 discovering+0.016
Neg logit contributions
kin-0.016
ivery-0.014
 bruised-0.013
ifying-0.013
ishy-0.013
Token up
Token ID510
Feature activation+4.89e-01

Pos logit contributions
 facts+0.027
 guessing+0.027
 topics+0.027
 tours+0.027
 mills+0.026
Neg logit contributions
kin-0.034
 bruised-0.031
ivery-0.030
ifying-0.030
ishy-0.030
Token Ben
Token ID3932
Feature activation-7.57e-01

Pos logit contributions
ivery+0.038
kin+0.038
ifying+0.037
ishy+0.033
robe+0.033
Neg logit contributions
 guessing-0.036
 motorcycles-0.035
 facts-0.033
 tours-0.033
 discovering-0.032
Token said
Token ID531
Feature activation-1.76e+00

Pos logit contributions
kin+0.038
ifying+0.037
ivery+0.035
 bruised+0.033
ishy+0.032
Neg logit contributions
 guessing-0.028
 facts-0.027
 motorcycles-0.026
arm-0.026
 mills-0.025
Token,
Token ID11
Feature activation-3.15e-01

Pos logit contributions
ifying+0.109
kin+0.105
ivery+0.105
ishy+0.097
robe+0.093
Neg logit contributions
 guessing-0.044
 to-0.041
 counting-0.039
 instructions-0.037
 seeing-0.035
Token holding
Token ID4769
Feature activation-1.88e+00

Pos logit contributions
kin+0.018
ivery+0.017
ifying+0.016
ishy+0.016
 bruised+0.016
Neg logit contributions
 guessing-0.010
 discovering-0.009
 facts-0.009
 tours-0.008
 motorcycles-0.008
Token his
Token ID465
Feature activation+7.18e-01

Pos logit contributions
ivery+0.119
kin+0.105
ifying+0.103
 bruised+0.100
istered+0.100
Neg logit contributions
 quizz-0.046
 guessing-0.043
 motorcycles-0.041
 tricks-0.041
 walks-0.041
Token bowl
Token ID9396
Feature activation-3.65e+00

Pos logit contributions
 guessing+0.031
 facts+0.031
 tours+0.031
 topics+0.031
 discovering+0.030
Neg logit contributions
kin-0.031
 bruised-0.026
ivery-0.026
ifying-0.025
ishy-0.025
Token tightly
Token ID17707
Feature activation-2.40e+00

Pos logit contributions
ivery+0.199
ifying+0.195
kin+0.173
 Deal+0.168
Uh+0.167
Neg logit contributions
 to-0.112
ups-0.096
 instructions-0.092
 guessing-0.089
 facts-0.084
Token.
Token ID13
Feature activation-1.21e+00

Pos logit contributions
ivery+0.137
kin+0.134
ishy+0.115
ifying+0.114
robe+0.114
Neg logit contributions
 guessing-0.073
 to-0.070
 counting-0.064
 discovering-0.063
 motorcycles-0.063
Token
Token ID198
Feature activation+1.63e-01

Pos logit contributions
kin+0.073
ivery+0.073
ifying+0.069
ishy+0.068
 bruised+0.067
Neg logit contributions
 guessing-0.030
 motorcycles-0.027
 facts-0.026
 topics-0.026
 tours-0.025
Token
Token ID198
Feature activation-1.06e-02

Pos logit contributions
 guessing+0.006
 facts+0.006
 tours+0.006
 topics+0.006
 discovering+0.006
Neg logit contributions
kin-0.008
ivery-0.007
ifying-0.007
 bruised-0.007
ishy-0.007
TokenL
Token ID43
Feature activation-1.41e+00

Pos logit contributions
kin+0.001
ivery+0.000
ifying+0.000
 bruised+0.000
ishy+0.000
Neg logit contributions
 guessing-0.000
 facts-0.000
 tours-0.000
 topics-0.000
 motorcycles-0.000
Token washed
Token ID18989
Feature activation-1.39e+00

Pos logit contributions
kin+0.061
ivery+0.059
ifying+0.056
rier+0.053
ishy+0.052
Neg logit contributions
 guessing-0.055
 discovering-0.051
 seeing-0.047
 computers-0.046
 exploring-0.046
Token the
Token ID262
Feature activation-2.13e-01

Pos logit contributions
kin+0.080
ivery+0.079
ifying+0.075
 bruised+0.071
istered+0.069
Neg logit contributions
 topics-0.035
 guessing-0.034
 instructions-0.033
 facts-0.033
 bikes-0.033
Token dress
Token ID6576
Feature activation-1.62e+00

Pos logit contributions
kin+0.010
ivery+0.009
ifying+0.009
 bruised+0.009
ishy+0.008
Neg logit contributions
 guessing-0.009
 facts-0.008
 tours-0.008
 discovering-0.008
 topics-0.008
Token until
Token ID1566
Feature activation-1.81e+00

Pos logit contributions
kin+0.079
ivery+0.078
ifying+0.078
 bruised+0.074
ishy+0.071
Neg logit contributions
 guessing-0.054
ups-0.050
 tours-0.049
 facts-0.048
 instructions-0.048
Token it
Token ID340
Feature activation-2.57e+00

Pos logit contributions
ivery+0.122
ishy+0.115
ifying+0.111
kin+0.110
 bruised+0.108
Neg logit contributions
 guessing-0.034
 seeing-0.028
 discovering-0.028
 tours-0.027
 adventures-0.026
Token was
Token ID373
Feature activation-3.65e+00

Pos logit contributions
ifying+0.172
ivery+0.158
ishy+0.157
arl+0.156
 bruised+0.155
Neg logit contributions
ups-0.041
 guessing-0.040
ves-0.038
 to-0.034
 adventures-0.031
Token perfect
Token ID2818
Feature activation-1.11e+00

Pos logit contributions
rified+0.187
ifying+0.173
suit+0.166
cy+0.166
hy+0.164
Neg logit contributions
 guessing-0.121
 adventures-0.106
 to-0.105
 exploring-0.104
 seeing-0.104
Token,
Token ID11
Feature activation-1.07e+00

Pos logit contributions
kin+0.053
ivery+0.053
ifying+0.051
 bruised+0.047
ishy+0.047
Neg logit contributions
 guessing-0.043
 discovering-0.039
 seeing-0.036
 tours-0.036
 adventures-0.036
Token just
Token ID655
Feature activation-2.29e+00

Pos logit contributions
kin+0.061
ivery+0.059
ifying+0.056
ishy+0.054
robe+0.052
Neg logit contributions
 guessing-0.034
 discovering-0.031
 seeing-0.029
 facts-0.028
 motorcycles-0.027
Token like
Token ID588
Feature activation-3.49e-01

Pos logit contributions
ivery+0.115
kin+0.111
he+0.104
ifying+0.099
robe+0.097
Neg logit contributions
 guessing-0.090
 discovering-0.089
 seeing-0.087
 exploring-0.081
 learning-0.079
Token new
Token ID649
Feature activation+2.83e-01

Pos logit contributions
kin+0.016
ivery+0.015
ifying+0.014
ishy+0.013
 bruised+0.013
Neg logit contributions
 guessing-0.015
 facts-0.014
 tours-0.014
 discovering-0.014
 motorcycles-0.014
Token girl
Token ID2576
Feature activation-1.55e-01

Pos logit contributions
kin+0.039
ivery+0.038
ifying+0.036
 bruised+0.033
robe+0.032
Neg logit contributions
 guessing-0.045
 motorcycles-0.044
 mills-0.042
 whales-0.042
 tours-0.042
Token named
Token ID3706
Feature activation-1.30e+00

Pos logit contributions
kin+0.006
ivery+0.005
ifying+0.005
 bruised+0.005
ishy+0.005
Neg logit contributions
 guessing-0.008
 facts-0.007
 discovering-0.007
 tours-0.007
 topics-0.007
Token Lily
Token ID20037
Feature activation-3.64e-01

Pos logit contributions
kin+0.049
ivery+0.043
ifying+0.042
 bruised+0.040
ishy+0.040
Neg logit contributions
 guessing-0.068
 discovering-0.063
 tours-0.062
 seeing-0.060
arm-0.059
Token.
Token ID13
Feature activation-1.79e+00

Pos logit contributions
kin+0.016
ivery+0.014
ifying+0.014
 bruised+0.013
ishy+0.013
Neg logit contributions
 guessing-0.016
 discovering-0.015
 facts-0.015
 motorcycles-0.015
 tours-0.015
Token She
Token ID1375
Feature activation+1.66e-02

Pos logit contributions
kin+0.087
ifying+0.079
rier+0.076
ivery+0.076
 bruised+0.074
Neg logit contributions
 topics-0.068
 guessing-0.065
 museums-0.064
 discovering-0.064
 adventures-0.063
Token loved
Token ID6151
Feature activation+4.15e+00

Pos logit contributions
 guessing+0.001
 facts+0.001
 tours+0.001
 discovering+0.001
 topics+0.001
Neg logit contributions
kin-0.001
ivery-0.001
ifying-0.001
 bruised-0.001
ishy-0.001
Token to
Token ID284
Feature activation+9.41e-01

Pos logit contributions
icians+0.196
ves+0.194
bors+0.192
arm+0.182
ises+0.178
Neg logit contributions
 Lily-0.135
 bruised-0.130
 wet-0.126
 dry-0.125
 cold-0.124
Token help
Token ID1037
Feature activation+1.37e+00

Pos logit contributions
 guessing+0.041
 facts+0.041
 topics+0.040
 discovering+0.039
 motorcycles+0.039
Neg logit contributions
kin-0.039
ivery-0.035
 bruised-0.032
ifying-0.032
ishy-0.032
Token her
Token ID607
Feature activation+1.09e+00

Pos logit contributions
 guessing+0.074
 facts+0.073
 discovering+0.071
icians+0.070
 tours+0.069
Neg logit contributions
kin-0.040
ivery-0.039
 bruised-0.035
cloth-0.033
ishy-0.033
Token mom
Token ID1995
Feature activation+3.91e-01

Pos logit contributions
 facts+0.065
 discovering+0.064
icians+0.064
 guessing+0.064
 tours+0.062
Neg logit contributions
kin-0.030
ivery-0.025
 bruised-0.022
robe-0.021
 limp-0.020
Tokenmy
Token ID1820
Feature activation+9.23e-01

Pos logit contributions
 guessing+0.020
 facts+0.019
 tours+0.019
 topics+0.019
 motorcycles+0.019
Neg logit contributions
kin-0.014
ivery-0.013
 bruised-0.012
ifying-0.012
ishy-0.011
Token on
Token ID319
Feature activation-3.31e-01

Pos logit contributions
 guessing+0.014
 facts+0.014
 tours+0.014
 topics+0.013
 motorcycles+0.013
Neg logit contributions
kin-0.013
ivery-0.012
ifying-0.011
 bruised-0.011
ishy-0.011
Token a
Token ID257
Feature activation+1.25e+00

Pos logit contributions
kin+0.017
ivery+0.016
 bruised+0.015
ifying+0.015
ishy+0.015
Neg logit contributions
 guessing-0.012
 tours-0.011
 facts-0.011
 motorcycles-0.011
 topics-0.011
Token hill
Token ID12788
Feature activation-4.32e-01

Pos logit contributions
 facts+0.056
 topics+0.054
 discovering+0.053
icians+0.053
 mills+0.053
Neg logit contributions
kin-0.052
ifying-0.045
ivery-0.044
he-0.042
 limp-0.042
Token.
Token ID13
Feature activation-8.90e-01

Pos logit contributions
kin+0.018
ivery+0.017
 bruised+0.016
ifying+0.016
ishy+0.016
Neg logit contributions
 guessing-0.019
 tours-0.018
 facts-0.018
 discovering-0.018
 motorcycles-0.018
Token They
Token ID1119
Feature activation+9.14e-01

Pos logit contributions
kin+0.042
ivery+0.040
 bruised+0.037
ifying+0.034
ishy+0.034
Neg logit contributions
 tours-0.036
 guessing-0.036
 motorcycles-0.036
 facts-0.035
 topics-0.035
Token wanted
Token ID2227
Feature activation+3.11e+00

Pos logit contributions
 guessing+0.048
 facts+0.047
 tours+0.047
 topics+0.046
 discovering+0.045
Neg logit contributions
kin-0.029
ivery-0.026
ifying-0.025
 bruised-0.025
cloth-0.024
Token to
Token ID284
Feature activation+6.49e-01

Pos logit contributions
pill+0.152
icians+0.149
 topics+0.148
bors+0.145
 feats+0.144
Neg logit contributions
 empty-0.098
 full-0.093
 away-0.088
 wet-0.087
 no-0.087
Token go
Token ID467
Feature activation+1.08e+00

Pos logit contributions
 facts+0.029
 guessing+0.029
 topics+0.028
 motorcycles+0.028
 tours+0.028
Neg logit contributions
kin-0.025
ivery-0.023
ifying-0.022
 bruised-0.022
 limp-0.021
Token inside
Token ID2641
Feature activation+1.21e+00

Pos logit contributions
 facts+0.058
 topics+0.056
 motorcycles+0.054
 guessing+0.053
 tours+0.052
Neg logit contributions
kin-0.034
ivery-0.031
 bruised-0.030
ifying-0.029
ishy-0.028
Token and
Token ID290
Feature activation-1.60e-01

Pos logit contributions
 facts+0.057
 topics+0.054
 tours+0.054
 guessing+0.053
 motorcycles+0.053
Neg logit contributions
kin-0.047
 bruised-0.040
cloth-0.040
ivery-0.039
ifying-0.039
Token look
Token ID804
Feature activation-9.03e-01

Pos logit contributions
kin+0.008
ivery+0.007
ifying+0.007
 bruised+0.007
ishy+0.006
Neg logit contributions
 guessing-0.006
 facts-0.006
 tours-0.006
 discovering-0.006
 motorcycles-0.006
Token bird
Token ID6512
Feature activation-1.49e-01

Pos logit contributions
kin+0.049
ivery+0.048
 bruised+0.045
ifying+0.043
ishy+0.043
Neg logit contributions
 guessing-0.033
 motorcycles-0.032
 facts-0.031
 tours-0.031
 mills-0.030
Token.
Token ID13
Feature activation-8.93e-01

Pos logit contributions
kin+0.006
ivery+0.006
ifying+0.005
 bruised+0.005
ishy+0.005
Neg logit contributions
 guessing-0.007
 facts-0.006
 tours-0.006
 discovering-0.006
 motorcycles-0.006
Token The
Token ID383
Feature activation-7.30e-01

Pos logit contributions
kin+0.042
ivery+0.039
ifying+0.036
ishy+0.035
 bruised+0.035
Neg logit contributions
 motorcycles-0.035
 facts-0.035
 tours-0.035
 guessing-0.035
 discovering-0.034
Token blue
Token ID4171
Feature activation-5.26e-01

Pos logit contributions
kin+0.039
ivery+0.037
ifying+0.037
 bruised+0.034
cloth+0.034
Neg logit contributions
 guessing-0.025
 tours-0.023
 motorcycles-0.023
 mills-0.022
 facts-0.022
Token bird
Token ID6512
Feature activation+1.10e-01

Pos logit contributions
kin+0.027
ivery+0.026
ifying+0.025
 bruised+0.024
ishy+0.024
Neg logit contributions
 guessing-0.018
 tours-0.017
 motorcycles-0.017
 facts-0.017
 mills-0.016
Token loved
Token ID6151
Feature activation+3.06e+00

Pos logit contributions
 guessing+0.005
 facts+0.005
 tours+0.005
 topics+0.004
 motorcycles+0.004
Neg logit contributions
kin-0.005
ivery-0.004
ifying-0.004
 bruised-0.004
ishy-0.004
Token to
Token ID284
Feature activation+3.74e-01

Pos logit contributions
 topics+0.132
icians+0.132
arm+0.130
cles+0.123
 Street+0.120
Neg logit contributions
 bruised-0.118
 full-0.108
 empty-0.106
 alone-0.106
 wet-0.104
Token sing
Token ID1702
Feature activation+7.19e-01

Pos logit contributions
 guessing+0.016
 facts+0.015
 topics+0.015
 tours+0.015
 motorcycles+0.015
Neg logit contributions
kin-0.017
ivery-0.015
 bruised-0.014
ifying-0.014
ishy-0.014
Token and
Token ID290
Feature activation+1.48e+00

Pos logit contributions
 guessing+0.032
 facts+0.031
 topics+0.030
 mills+0.030
 tours+0.030
Neg logit contributions
kin-0.031
ivery-0.027
ifying-0.026
robe-0.025
 bruised-0.025
Token fly
Token ID6129
Feature activation-3.13e-01

Pos logit contributions
 tours+0.062
arm+0.061
 mills+0.061
 topics+0.061
 motorcycles+0.061
Neg logit contributions
kin-0.061
 bruised-0.052
ivery-0.052
ifying-0.051
 limp-0.050
Token in
Token ID287
Feature activation+6.33e-01

Pos logit contributions
kin+0.014
ivery+0.013
 bruised+0.012
ifying+0.012
ishy+0.012
Neg logit contributions
 guessing-0.013
 facts-0.013
 tours-0.013
 topics-0.012
 discovering-0.012
Token a
Token ID257
Feature activation-2.74e-01

Pos logit contributions
kin+0.039
ivery+0.038
ifying+0.035
robe+0.035
 bruised+0.034
Neg logit contributions
 guessing-0.030
 facts-0.029
 tours-0.028
 discovering-0.027
 motorcycles-0.027
Token big
Token ID1263
Feature activation-2.91e-01

Pos logit contributions
kin+0.012
ivery+0.011
ifying+0.010
 bruised+0.010
ishy+0.010
Neg logit contributions
 guessing-0.012
 facts-0.011
 tours-0.011
 motorcycles-0.011
 topics-0.011
Token spoon
Token ID24556
Feature activation-1.20e+00

Pos logit contributions
kin+0.012
ivery+0.011
ifying+0.011
 bruised+0.010
robe+0.010
Neg logit contributions
 guessing-0.013
 facts-0.012
 motorcycles-0.012
 tours-0.012
 discovering-0.012
Token.
Token ID13
Feature activation-6.72e-02

Pos logit contributions
kin+0.056
ivery+0.055
ifying+0.051
 bruised+0.050
ishy+0.049
Neg logit contributions
 guessing-0.045
 discovering-0.041
 facts-0.040
 mills-0.039
 seeing-0.039
Token She
Token ID1375
Feature activation+5.24e-01

Pos logit contributions
kin+0.003
ivery+0.002
ifying+0.002
 bruised+0.002
ishy+0.002
Neg logit contributions
 guessing-0.003
 facts-0.003
 tours-0.003
 topics-0.003
 motorcycles-0.003
Token loved
Token ID6151
Feature activation+3.19e+00

Pos logit contributions
 guessing+0.025
 tours+0.024
 facts+0.024
 topics+0.024
 discovering+0.024
Neg logit contributions
kin-0.021
ivery-0.018
 bruised-0.018
ifying-0.017
ishy-0.017
Token to
Token ID284
Feature activation+1.41e+00

Pos logit contributions
arm+0.151
 cultures+0.150
icians+0.149
 tours+0.149
cles+0.147
Neg logit contributions
kin-0.104
 bruised-0.102
 Lily-0.098
 Mom-0.093
 sorry-0.092
Token watch
Token ID2342
Feature activation+1.84e+00

Pos logit contributions
 tours+0.065
 motorcycles+0.065
 topics+0.065
 facts+0.064
 guessing+0.062
Neg logit contributions
kin-0.057
ivery-0.049
 bruised-0.047
ishy-0.045
ifying-0.044
Token the
Token ID262
Feature activation+7.85e-01

Pos logit contributions
 facts+0.083
 tours+0.082
 topics+0.082
 cultures+0.081
 mills+0.080
Neg logit contributions
kin-0.071
 bruised-0.061
ifying-0.059
ivery-0.057
cloth-0.057
Token vegetables
Token ID13701
Feature activation+9.94e-01

Pos logit contributions
 guessing+0.032
 discovering+0.032
 tours+0.032
 facts+0.031
 topics+0.031
Neg logit contributions
kin-0.035
ivery-0.031
 bruised-0.030
ifying-0.030
ishy-0.029
Token move
Token ID1445
Feature activation-3.54e-01

Pos logit contributions
 facts+0.044
 motorcycles+0.043
 tours+0.043
 topics+0.043
 guessing+0.043
Neg logit contributions
kin-0.042
ivery-0.037
 bruised-0.035
ifying-0.035
ishy-0.033
Token had
Token ID550
Feature activation+1.47e+00

Pos logit contributions
kin+0.010
ivery+0.009
ifying+0.008
 bruised+0.008
ishy+0.008
Neg logit contributions
 guessing-0.011
 facts-0.010
 tours-0.010
 motorcycles-0.010
 discovering-0.010
Token many
Token ID867
Feature activation+1.93e+00

Pos logit contributions
 guessing+0.061
 topics+0.059
arm+0.058
 discovering+0.057
 mills+0.057
Neg logit contributions
kin-0.058
ifying-0.055
 bruised-0.054
 sill-0.053
cloth-0.049
Token teeth
Token ID9941
Feature activation+4.81e-01

Pos logit contributions
 mills+0.068
 guessing+0.067
ths+0.067
 discovering+0.066
 tours+0.066
Neg logit contributions
kin-0.085
 bruised-0.078
ishy-0.076
 sill-0.075
ifying-0.074
Token.
Token ID13
Feature activation+1.69e-01

Pos logit contributions
 guessing+0.023
 facts+0.023
 tours+0.023
 topics+0.023
 motorcycles+0.022
Neg logit contributions
kin-0.018
ivery-0.016
ifying-0.015
 bruised-0.015
ishy-0.014
Token They
Token ID1119
Feature activation-5.58e-01

Pos logit contributions
 guessing+0.009
 facts+0.008
 tours+0.008
 topics+0.008
 motorcycles+0.008
Neg logit contributions
kin-0.006
ivery-0.005
ifying-0.005
 bruised-0.005
ishy-0.005
Token wanted
Token ID2227
Feature activation+3.07e+00

Pos logit contributions
kin+0.025
ivery+0.024
ifying+0.022
 bruised+0.022
ishy+0.021
Neg logit contributions
 guessing-0.023
 facts-0.023
 motorcycles-0.022
 tours-0.022
 discovering-0.022
Token to
Token ID284
Feature activation+7.24e-01

Pos logit contributions
ths+0.118
bors+0.116
pill+0.116
 occasions+0.113
 topics+0.112
Neg logit contributions
 sill-0.117
 Lily-0.108
 away-0.101
 full-0.101
 empty-0.100
Token print
Token ID3601
Feature activation+6.26e-01

Pos logit contributions
 guessing+0.030
 facts+0.029
 topics+0.029
 tours+0.029
 motorcycles+0.029
Neg logit contributions
kin-0.031
ivery-0.028
ifying-0.027
 bruised-0.026
 limp-0.025
Token crocod
Token ID37565
Feature activation+1.50e+00

Pos logit contributions
 guessing+0.023
 discovering+0.022
 tours+0.022
 topics+0.022
 facts+0.022
Neg logit contributions
kin-0.030
ivery-0.026
ifying-0.026
 bruised-0.026
ishy-0.025
Tokeniles
Token ID2915
Feature activation+1.05e+00

Pos logit contributions
 guessing+0.047
 discovering+0.047
 motorcycles+0.043
 topics+0.043
 airports+0.043
Neg logit contributions
kin-0.080
ivery-0.072
ishy-0.072
ifying-0.072
robe-0.069
Token too
Token ID1165
Feature activation-2.27e-01

Pos logit contributions
 facts+0.049
 topics+0.049
 tours+0.049
 guessing+0.049
 motorcycles+0.048
Neg logit contributions
kin-0.040
ifying-0.034
ivery-0.033
 bruised-0.033
ishy-0.031
Token yelled
Token ID22760
Feature activation-1.90e-01

Pos logit contributions
kin+0.034
ivery+0.032
ifying+0.030
robe+0.029
ishy+0.028
Neg logit contributions
 guessing-0.027
 motorcycles-0.024
 facts-0.024
 discovering-0.024
 mills-0.023
Token.
Token ID13
Feature activation-1.88e-01

Pos logit contributions
kin+0.009
ivery+0.008
ifying+0.008
 bruised+0.008
ishy+0.007
Neg logit contributions
 guessing-0.008
 facts-0.007
 tours-0.007
 motorcycles-0.007
 discovering-0.007
Token
Token ID198
Feature activation+8.51e-01

Pos logit contributions
kin+0.009
ivery+0.008
ifying+0.008
 bruised+0.008
ishy+0.007
Neg logit contributions
 guessing-0.008
 facts-0.007
 tours-0.007
 topics-0.007
 motorcycles-0.007
Token
Token ID198
Feature activation+6.80e-01

Pos logit contributions
 facts+0.038
 guessing+0.038
 tours+0.036
 topics+0.036
 discovering+0.036
Neg logit contributions
kin-0.035
ivery-0.032
 bruised-0.030
ifying-0.030
he-0.028
Token"
Token ID1
Feature activation-1.71e-01

Pos logit contributions
 facts+0.026
 guessing+0.025
 tours+0.025
 discovering+0.024
 motorcycles+0.024
Neg logit contributions
kin-0.034
ivery-0.032
ifying-0.030
 bruised-0.029
cloth-0.028
TokenStop
Token ID19485
Feature activation+1.40e+00

Pos logit contributions
kin+0.008
ivery+0.008
ifying+0.007
 bruised+0.007
ishy+0.007
Neg logit contributions
 guessing-0.006
 facts-0.006
 tours-0.006
 motorcycles-0.006
 topics-0.006
Token it
Token ID340
Feature activation-6.72e-01

Pos logit contributions
 facts+0.064
 topics+0.062
 tours+0.061
 cultures+0.059
 mills+0.058
Neg logit contributions
kin-0.057
 bruised-0.054
ivery-0.050
 sorry-0.049
 OK-0.049
Token,
Token ID11
Feature activation-1.53e+00

Pos logit contributions
kin+0.038
ifying+0.036
ivery+0.034
ishy+0.034
 bruised+0.034
Neg logit contributions
 guessing-0.021
 tours-0.019
 motorcycles-0.018
 discovering-0.017
 facts-0.017
Token both
Token ID1111
Feature activation-2.65e-01

Pos logit contributions
ivery+0.081
kin+0.079
ifying+0.074
robe+0.073
ishy+0.073
Neg logit contributions
 guessing-0.051
 motorcycles-0.043
 bikes-0.041
 tours-0.040
 facts-0.040
Token of
Token ID286
Feature activation+6.01e-01

Pos logit contributions
kin+0.013
ivery+0.012
ifying+0.012
 bruised+0.012
ishy+0.011
Neg logit contributions
 guessing-0.010
 tours-0.009
 facts-0.009
 motorcycles-0.009
 topics-0.009
Token you
Token ID345
Feature activation-1.04e-01

Pos logit contributions
 guessing+0.020
 facts+0.020
 topics+0.019
 discovering+0.019
 tours+0.019
Neg logit contributions
kin-0.032
ivery-0.029
 bruised-0.028
ifying-0.028
cloth-0.027
Token her
Token ID607
Feature activation-1.61e-01

Pos logit contributions
kin+0.028
ifying+0.025
ivery+0.025
ishy+0.024
 bruised+0.022
Neg logit contributions
 guessing-0.025
 discovering-0.024
 facts-0.024
 tours-0.024
 topics-0.023
Token own
Token ID898
Feature activation+2.63e-01

Pos logit contributions
kin+0.007
ivery+0.007
ifying+0.006
 bruised+0.006
ishy+0.006
Neg logit contributions
 guessing-0.007
 facts-0.007
 tours-0.006
 topics-0.006
 discovering-0.006
Token warning
Token ID6509
Feature activation-2.21e-01

Pos logit contributions
 guessing+0.011
 facts+0.011
 tours+0.010
 motorcycles+0.010
 topics+0.010
Neg logit contributions
kin-0.012
ivery-0.011
 bruised-0.010
ifying-0.010
ishy-0.010
Token.
Token ID13
Feature activation-1.33e+00

Pos logit contributions
kin+0.010
ivery+0.009
ifying+0.009
 bruised+0.009
ishy+0.008
Neg logit contributions
 guessing-0.009
 facts-0.009
 tours-0.009
 topics-0.009
 discovering-0.009
Token From
Token ID3574
Feature activation-3.63e-01

Pos logit contributions
kin+0.078
ivery+0.067
ifying+0.065
ishy+0.064
 bruised+0.060
Neg logit contributions
 discovering-0.042
 guessing-0.041
 facts-0.039
 topics-0.039
 mills-0.038
Token then
Token ID788
Feature activation+1.19e+00

Pos logit contributions
kin+0.022
ivery+0.020
ifying+0.020
 bruised+0.019
ishy+0.019
Neg logit contributions
 guessing-0.010
 discovering-0.009
 tours-0.009
 facts-0.009
 motorcycles-0.009
Token on
Token ID319
Feature activation+2.81e-01

Pos logit contributions
 facts+0.066
 guessing+0.064
 topics+0.063
 motorcycles+0.063
 tours+0.063
Neg logit contributions
kin-0.037
ivery-0.033
 bruised-0.032
ifying-0.031
he-0.028
Token,
Token ID11
Feature activation-1.08e+00

Pos logit contributions
 guessing+0.011
 facts+0.011
 tours+0.010
 motorcycles+0.010
 topics+0.010
Neg logit contributions
kin-0.014
ivery-0.012
 bruised-0.012
ifying-0.012
ishy-0.011
Token she
Token ID673
Feature activation+1.31e+00

Pos logit contributions
kin+0.041
ivery+0.040
ifying+0.040
ishy+0.037
suit+0.035
Neg logit contributions
 guessing-0.052
 discovering-0.050
 topics-0.048
 adventures-0.048
 learning-0.048
Token made
Token ID925
Feature activation+1.12e+00

Pos logit contributions
 tours+0.061
 guessing+0.061
 facts+0.060
 topics+0.059
 motorcycles+0.058
Neg logit contributions
kin-0.053
ivery-0.047
 bruised-0.047
robe-0.043
ishy-0.042
Token sure
Token ID1654
Feature activation+1.08e+00

Pos logit contributions
 facts+0.048
 guessing+0.047
 topics+0.046
 motorcycles+0.046
 tours+0.046
Neg logit contributions
kin-0.045
ivery-0.042
 bruised-0.041
ishy-0.038
 sill-0.038
Token saw
Token ID2497
Feature activation+8.01e-01

Pos logit contributions
 facts+0.064
 tours+0.064
 topics+0.064
 guessing+0.063
 motorcycles+0.061
Neg logit contributions
kin-0.050
 bruised-0.041
ifying-0.040
ivery-0.040
ishy-0.038
Token the
Token ID262
Feature activation+7.94e-01

Pos logit contributions
 guessing+0.032
 tours+0.031
 facts+0.031
 topics+0.031
 mills+0.030
Neg logit contributions
kin-0.035
 bruised-0.032
ifying-0.031
ivery-0.031
 sill-0.030
Token most
Token ID749
Feature activation+1.34e+00

Pos logit contributions
 facts+0.032
 tours+0.032
 guessing+0.032
 topics+0.032
 discovering+0.031
Neg logit contributions
kin-0.036
ivery-0.032
ifying-0.031
 bruised-0.031
ishy-0.030
Token amazing
Token ID4998
Feature activation+2.14e+00

Pos logit contributions
 tours+0.061
 facts+0.060
 topics+0.060
 guessing+0.059
arm+0.059
Neg logit contributions
kin-0.052
ishy-0.045
 bruised-0.045
ifying-0.043
ivery-0.042
Token things
Token ID1243
Feature activation+8.49e-01

Pos logit contributions
 guessing+0.080
arm+0.080
 facts+0.079
icians+0.078
 tours+0.077
Neg logit contributions
kin-0.101
 eye-0.082
ishy-0.077
 sill-0.075
 bruised-0.074
Token and
Token ID290
Feature activation+1.04e+00

Pos logit contributions
 guessing+0.035
 facts+0.034
 tours+0.033
 topics+0.033
 motorcycles+0.033
Neg logit contributions
kin-0.038
ifying-0.034
ivery-0.034
 bruised-0.034
ishy-0.032
Token felt
Token ID2936
Feature activation-1.11e+00

Pos logit contributions
 guessing+0.043
 tours+0.042
 topics+0.041
 facts+0.041
 motorcycles+0.041
Neg logit contributions
kin-0.044
 bruised-0.040
ifying-0.038
ivery-0.038
 sill-0.037
Token how
Token ID703
Feature activation+6.62e-01

Pos logit contributions
kin+0.047
ivery+0.046
robe+0.039
ifying+0.039
 bruised+0.039
Neg logit contributions
 discovering-0.051
 guessing-0.051
 tours-0.047
 facts-0.047
 seeing-0.047
Token great
Token ID1049
Feature activation+9.63e-01

Pos logit contributions
 guessing+0.030
 facts+0.029
 topics+0.028
 tours+0.028
 motorcycles+0.028
Neg logit contributions
kin-0.027
ivery-0.025
ifying-0.024
 bruised-0.024
ishy-0.023
Token the
Token ID262
Feature activation+5.75e-01

Pos logit contributions
 guessing+0.037
 facts+0.034
 topics+0.034
 tours+0.034
 mills+0.033
Neg logit contributions
kin-0.046
 bruised-0.040
ifying-0.040
ivery-0.039
ishy-0.038
Token world
Token ID995
Feature activation+3.37e-03

Pos logit contributions
 guessing+0.023
 facts+0.023
 tours+0.022
 discovering+0.022
 topics+0.022
Neg logit contributions
kin-0.026
ivery-0.024
 bruised-0.023
ifying-0.023
ishy-0.022
Token!
Token ID0
Feature activation-1.41e-01

Pos logit contributions
kin+0.012
ivery+0.011
ifying+0.011
 bruised+0.011
ishy+0.010
Neg logit contributions
 guessing-0.012
 tours-0.011
 discovering-0.011
 facts-0.011
 motorcycles-0.011
Token He
Token ID679
Feature activation+8.70e-01

Pos logit contributions
kin+0.005
ivery+0.005
ifying+0.004
 bruised+0.004
ishy+0.004
Neg logit contributions
 guessing-0.007
 facts-0.007
 tours-0.007
 motorcycles-0.007
 topics-0.007
Token went
Token ID1816
Feature activation+1.14e+00

Pos logit contributions
 guessing+0.041
 facts+0.041
 topics+0.041
 tours+0.040
 mills+0.039
Neg logit contributions
kin-0.034
ivery-0.029
 bruised-0.028
ifying-0.027
ishy-0.027
Token over
Token ID625
Feature activation+1.72e+00

Pos logit contributions
 facts+0.059
 topics+0.058
 motorcycles+0.055
 mills+0.054
arm+0.054
Neg logit contributions
kin-0.038
 bruised-0.034
robe-0.033
ivery-0.033
 limp-0.033
Token hills
Token ID18639
Feature activation+1.20e+00

Pos logit contributions
 facts+0.056
bors+0.055
 guessing+0.053
icians+0.053
arm+0.053
Neg logit contributions
kin-0.081
 sill-0.075
 bruised-0.075
he-0.074
robe-0.072
Token,
Token ID11
Feature activation+1.89e+00

Pos logit contributions
 facts+0.049
 topics+0.048
 guessing+0.047
 tours+0.046
 motorcycles+0.046
Neg logit contributions
kin-0.053
ivery-0.047
 bruised-0.046
he-0.046
 sill-0.045
Token through
Token ID832
Feature activation+1.45e+00

Pos logit contributions
 facts+0.089
 occasions+0.088
 topics+0.087
 tours+0.080
 recipes+0.080
Neg logit contributions
 bruised-0.069
kin-0.068
 sill-0.065
 limp-0.061
ishy-0.060
Token fields
Token ID7032
Feature activation+1.28e+00

Pos logit contributions
 guessing+0.061
 facts+0.060
arm+0.059
 topics+0.056
 mills+0.056
Neg logit contributions
kin-0.057
 bruised-0.054
ifying-0.054
 sill-0.052
cloth-0.049
Token,
Token ID11
Feature activation+2.39e+00

Pos logit contributions
 facts+0.054
 topics+0.053
 motorcycles+0.052
 guessing+0.052
 tours+0.052
Neg logit contributions
kin-0.055
 bruised-0.048
ivery-0.048
 sill-0.047
he-0.047
Token and
Token ID290
Feature activation+1.64e+00

Pos logit contributions
 occasions+0.119
 facts+0.117
 topics+0.114
arm+0.106
 tours+0.104
Neg logit contributions
 sill-0.080
kin-0.080
 bruised-0.079
he-0.070
 full-0.069
Token even
Token ID772
Feature activation+5.21e-02

Pos logit contributions
 facts+0.066
 topics+0.064
 tours+0.064
arm+0.063
 occasions+0.063
Neg logit contributions
kin-0.066
 bruised-0.063
 sill-0.062
ifying-0.057
robe-0.056
Token the
Token ID262
Feature activation+1.03e+00

Pos logit contributions
 facts+0.045
 guessing+0.045
 topics+0.044
arm+0.044
 motorcycles+0.043
Neg logit contributions
kin-0.055
ivery-0.050
 bruised-0.049
ifying-0.048
 sill-0.047
Token university
Token ID6403
Feature activation+9.11e-01

Pos logit contributions
 facts+0.027
 guessing+0.026
 motorcycles+0.025
 mills+0.025
 whales+0.024
Neg logit contributions
kin-0.058
ivery-0.055
ifying-0.053
 bruised-0.052
he-0.051
Token was
Token ID373
Feature activation-3.93e-01

Pos logit contributions
 guessing+0.036
 facts+0.035
 tours+0.034
 motorcycles+0.034
 topics+0.034
Neg logit contributions
kin-0.042
ivery-0.038
ifying-0.036
 bruised-0.036
robe-0.035
Token coming
Token ID2406
Feature activation-1.44e+00

Pos logit contributions
kin+0.018
ivery+0.017
ifying+0.016
 bruised+0.016
ishy+0.015
Neg logit contributions
 guessing-0.016
 facts-0.015
 tours-0.015
 discovering-0.015
 motorcycles-0.015
Token true
Token ID2081
Feature activation-7.27e-02

Pos logit contributions
kin+0.081
ivery+0.071
ishy+0.069
ifying+0.066
 bruised+0.066
Neg logit contributions
 guessing-0.048
 tours-0.047
 discovering-0.046
 motorcycles-0.046
 seeing-0.043
Token.
Token ID13
Feature activation+7.53e-01

Pos logit contributions
kin+0.003
ivery+0.003
ifying+0.003
 bruised+0.003
ishy+0.003
Neg logit contributions
 guessing-0.003
 facts-0.003
 tours-0.003
 topics-0.003
 motorcycles-0.003
Token<|endoftext|>
Token ID50256
Feature activation-1.11e+00

Pos logit contributions
 guessing+0.038
 facts+0.036
 tours+0.036
 motorcycles+0.035
 topics+0.035
Neg logit contributions
kin-0.027
ifying-0.024
ivery-0.024
 bruised-0.022
 sill-0.021
TokenOnce
Token ID7454
Feature activation-9.40e-01

Pos logit contributions
kin+0.063
ivery+0.061
 bruised+0.059
ishy+0.058
ifying+0.056
Neg logit contributions
 guessing-0.032
 tours-0.031
 discovering-0.030
 topics-0.030
 motorcycles-0.030
Token there
Token ID612
Feature activation-1.13e+00

Pos logit contributions
kin+0.042
ivery+0.041
ifying+0.038
 bruised+0.038
ishy+0.036
Neg logit contributions
 guessing-0.039
 tours-0.037
 discovering-0.036
 motorcycles-0.036
 facts-0.036
Token was
Token ID373
Feature activation-1.41e+00

Pos logit contributions
kin+0.067
ivery+0.066
ifying+0.064
ishy+0.061
 bruised+0.060
Neg logit contributions
 guessing-0.030
 discovering-0.028
 topics-0.027
 tours-0.027
 adventures-0.026
Token a
Token ID257
Feature activation-2.67e-01

Pos logit contributions
kin+0.070
ivery+0.066
ifying+0.062
 bruised+0.062
robe+0.058
Neg logit contributions
 guessing-0.055
 adventures-0.053
 tours-0.053
 motorcycles-0.053
 discovering-0.052
Token owl
Token ID39610
Feature activation+3.03e-01

Pos logit contributions
 guessing+0.000
 facts+0.000
 tours+0.000
 motorcycles+0.000
 topics+0.000
Neg logit contributions
kin-0.001
ivery-0.001
ifying-0.001
 bruised-0.001
ishy-0.001
Token saw
Token ID2497
Feature activation+3.97e-01

Pos logit contributions
 guessing+0.013
 facts+0.013
 tours+0.013
 topics+0.012
 motorcycles+0.012
Neg logit contributions
kin-0.013
ivery-0.012
 bruised-0.011
ifying-0.011
ishy-0.011
Token Benny
Token ID44275
Feature activation+4.91e-01

Pos logit contributions
 guessing+0.017
 facts+0.017
 tours+0.017
 topics+0.017
 discovering+0.016
Neg logit contributions
kin-0.017
ivery-0.015
 bruised-0.015
ifying-0.015
ishy-0.014
Token and
Token ID290
Feature activation+6.41e-02

Pos logit contributions
 facts+0.019
 guessing+0.018
 tours+0.018
 topics+0.018
 motorcycles+0.017
Neg logit contributions
kin-0.024
ivery-0.022
 bruised-0.021
ifying-0.021
ishy-0.020
Token flew
Token ID13112
Feature activation-1.49e+00

Pos logit contributions
 guessing+0.003
 facts+0.003
 tours+0.003
 motorcycles+0.003
 discovering+0.003
Neg logit contributions
kin-0.003
ivery-0.003
ifying-0.003
 bruised-0.002
ishy-0.002
Token down
Token ID866
Feature activation+1.66e-01

Pos logit contributions
kin+0.088
ivery+0.077
ifying+0.070
ishy+0.068
robe+0.066
Neg logit contributions
 to-0.050
 guessing-0.050
 discovering-0.047
 tours-0.046
 exploring-0.044
Token to
Token ID284
Feature activation-1.47e-01

Pos logit contributions
 guessing+0.004
 facts+0.004
 tours+0.004
 topics+0.004
 motorcycles+0.004
Neg logit contributions
kin-0.010
ivery-0.010
ifying-0.009
 bruised-0.009
ishy-0.009
Token help
Token ID1037
Feature activation-6.88e-02

Pos logit contributions
kin+0.008
ivery+0.007
ifying+0.007
 bruised+0.007
ishy+0.007
Neg logit contributions
 guessing-0.005
 tours-0.005
 facts-0.005
 discovering-0.005
 topics-0.004
Token him
Token ID683
Feature activation-3.52e-01

Pos logit contributions
kin+0.003
ivery+0.003
ifying+0.003
 bruised+0.003
ishy+0.003
Neg logit contributions
 guessing-0.003
 facts-0.003
 tours-0.003
 topics-0.003
 motorcycles-0.003
Token.
Token ID13
Feature activation+8.74e-02

Pos logit contributions
kin+0.015
ivery+0.013
ifying+0.013
 bruised+0.012
ishy+0.012
Neg logit contributions
 guessing-0.016
 tours-0.015
 discovering-0.015
 facts-0.015
 motorcycles-0.015
Token The
Token ID383
Feature activation+2.39e-01

Pos logit contributions
 guessing+0.004
 facts+0.004
 tours+0.004
 topics+0.004
 motorcycles+0.004
Neg logit contributions
kin-0.003
ivery-0.003
ifying-0.003
 bruised-0.003
ishy-0.002
Token trees
Token ID7150
Feature activation+6.05e-01

Pos logit contributions
 guessing+0.045
 tours+0.045
arm+0.044
 topics+0.044
 facts+0.044
Neg logit contributions
kin-0.056
ifying-0.051
 bruised-0.051
ivery-0.048
cloth-0.047
Token.
Token ID13
Feature activation-2.86e-01

Pos logit contributions
 guessing+0.030
 facts+0.030
 tours+0.030
 topics+0.029
 motorcycles+0.029
Neg logit contributions
kin-0.022
ivery-0.019
ifying-0.018
 bruised-0.018
ishy-0.017
Token
Token ID198
Feature activation+6.62e-01

Pos logit contributions
kin+0.014
ivery+0.014
 bruised+0.013
ifying+0.013
ishy+0.012
Neg logit contributions
 guessing-0.010
 tours-0.010
 facts-0.010
 topics-0.010
 motorcycles-0.010
Token
Token ID198
Feature activation+4.44e-01

Pos logit contributions
 facts+0.029
 guessing+0.029
 tours+0.028
 motorcycles+0.028
 topics+0.028
Neg logit contributions
kin-0.028
ivery-0.025
ifying-0.024
 bruised-0.023
he-0.023
Token"
Token ID1
Feature activation-3.02e-01

Pos logit contributions
 facts+0.014
 guessing+0.014
 tours+0.014
 motorcycles+0.014
 discovering+0.014
Neg logit contributions
kin-0.024
ivery-0.023
ifying-0.022
ishy-0.021
 bruised-0.021
TokenLook
Token ID8567
Feature activation+2.69e-01

Pos logit contributions
kin+0.014
ivery+0.013
 bruised+0.012
ifying+0.012
ishy+0.012
Neg logit contributions
 guessing-0.012
 topics-0.012
 facts-0.012
 tours-0.012
 motorcycles-0.012
Token,
Token ID11
Feature activation-2.74e-01

Pos logit contributions
 guessing+0.011
 facts+0.011
 tours+0.010
 topics+0.010
 discovering+0.010
Neg logit contributions
kin-0.013
ivery-0.011
ifying-0.011
 bruised-0.011
ishy-0.011
Token I
Token ID314
Feature activation-2.37e-01

Pos logit contributions
kin+0.009
ivery+0.009
ifying+0.008
 bruised+0.008
ishy+0.008
Neg logit contributions
 guessing-0.014
 facts-0.014
 tours-0.014
 motorcycles-0.014
 discovering-0.014
Token found
Token ID1043
Feature activation+9.84e-01

Pos logit contributions
kin+0.010
ivery+0.009
 bruised+0.009
ifying+0.009
ishy+0.008
Neg logit contributions
 guessing-0.011
 facts-0.010
 tours-0.010
 discovering-0.010
 topics-0.010
Token the
Token ID262
Feature activation+5.10e-01

Pos logit contributions
 guessing+0.052
 topics+0.052
 facts+0.051
 discovering+0.050
 tours+0.050
Neg logit contributions
kin-0.031
ifying-0.028
 bruised-0.026
ivery-0.025
ishy-0.024
Token cow
Token ID9875
Feature activation+4.95e-02

Pos logit contributions
 guessing+0.022
 facts+0.021
 tours+0.021
 topics+0.021
 discovering+0.021
Neg logit contributions
kin-0.022
ivery-0.020
ifying-0.019
 bruised-0.019
ishy-0.018
Token that
Token ID326
Feature activation-1.38e+00

Pos logit contributions
kin+0.030
ivery+0.028
 bruised+0.026
ifying+0.026
ishy+0.026
Neg logit contributions
 guessing-0.028
 discovering-0.026
 tours-0.026
 facts-0.026
 motorcycles-0.025
Token was
Token ID373
Feature activation-1.20e+00

Pos logit contributions
kin+0.067
ivery+0.066
ifying+0.063
ishy+0.060
 bruised+0.058
Neg logit contributions
 guessing-0.049
 discovering-0.047
 adventures-0.047
 mills-0.047
 facts-0.046
Token almost
Token ID2048
Feature activation-2.14e+00

Pos logit contributions
kin+0.063
ivery+0.059
ishy+0.056
robe+0.055
ifying+0.054
Neg logit contributions
 guessing-0.044
 discovering-0.040
 seeing-0.037
 tours-0.036
 adventures-0.035
Token as
Token ID355
Feature activation-2.55e-01

Pos logit contributions
ivery+0.110
kin+0.110
robe+0.105
istered+0.098
ishy+0.098
Neg logit contributions
 guessing-0.073
 seeing-0.068
 discovering-0.067
 exploring-0.062
 visiting-0.062
Token tall
Token ID7331
Feature activation+6.82e-01

Pos logit contributions
kin+0.011
ivery+0.011
ifying+0.010
 bruised+0.010
ishy+0.010
Neg logit contributions
 guessing-0.011
 facts-0.010
 tours-0.010
 discovering-0.010
 motorcycles-0.010
Token as
Token ID355
Feature activation-1.19e-01

Pos logit contributions
 guessing+0.037
 facts+0.036
 topics+0.035
 tours+0.035
 motorcycles+0.035
Neg logit contributions
kin-0.022
ivery-0.018
 bruised-0.018
ifying-0.018
ishy-0.017
Token her
Token ID607
Feature activation-1.38e-01

Pos logit contributions
kin+0.006
ivery+0.006
ifying+0.005
 bruised+0.005
ishy+0.005
Neg logit contributions
 guessing-0.004
 facts-0.004
 tours-0.004
 discovering-0.004
 motorcycles-0.004
Token.
Token ID13
Feature activation-4.23e-02

Pos logit contributions
kin+0.006
ivery+0.005
ifying+0.005
 bruised+0.005
ishy+0.005
Neg logit contributions
 guessing-0.006
 facts-0.006
 tours-0.006
 discovering-0.006
 topics-0.006
Token She
Token ID1375
Feature activation+1.88e+00

Pos logit contributions
kin+0.001
ivery+0.001
ifying+0.001
 bruised+0.001
ishy+0.001
Neg logit contributions
 guessing-0.002
 facts-0.002
 tours-0.002
 discovering-0.002
 topics-0.002
Token wanted
Token ID2227
Feature activation+3.03e+00

Pos logit contributions
 facts+0.103
 tours+0.102
 topics+0.098
 guessing+0.095
 events+0.094
Neg logit contributions
kin-0.060
ivery-0.049
 bruised-0.049
 limp-0.046
ishy-0.045
Token to
Token ID284
Feature activation+1.07e+00

Pos logit contributions
 topics+0.130
icians+0.126
cles+0.126
 cultures+0.124
 feats+0.122
Neg logit contributions
 sill-0.106
 bruised-0.104
 okay-0.101
 full-0.101
 away-0.098
Token watched
Token ID7342
Feature activation+4.40e-01

Pos logit contributions
 guessing+0.027
 facts+0.027
 tours+0.027
 topics+0.026
 motorcycles+0.026
Neg logit contributions
kin-0.024
ivery-0.021
 bruised-0.020
ifying-0.020
ishy-0.019
Token it
Token ID340
Feature activation+8.78e-01

Pos logit contributions
 guessing+0.016
 facts+0.016
 tours+0.016
 topics+0.015
 motorcycles+0.015
Neg logit contributions
kin-0.022
ivery-0.020
 bruised-0.019
ifying-0.019
ishy-0.018
Token spin
Token ID7906
Feature activation-4.54e-01

Pos logit contributions
 facts+0.038
 guessing+0.038
 motorcycles+0.037
 tours+0.037
 topics+0.037
Neg logit contributions
kin-0.036
ivery-0.033
 bruised-0.031
ifying-0.031
ishy-0.030
Token round
Token ID2835
Feature activation+7.35e-01

Pos logit contributions
kin+0.023
ivery+0.022
 bruised+0.020
ifying+0.020
ishy+0.019
Neg logit contributions
 guessing-0.017
 tours-0.016
 facts-0.015
 discovering-0.015
 motorcycles-0.015
Token and
Token ID290
Feature activation+3.80e-01

Pos logit contributions
 guessing+0.029
 facts+0.029
 tours+0.029
 motorcycles+0.028
 topics+0.028
Neg logit contributions
kin-0.034
ifying-0.030
ivery-0.030
 bruised-0.029
ishy-0.028
Token round
Token ID2835
Feature activation+2.49e-01

Pos logit contributions
 guessing+0.017
 facts+0.017
 tours+0.017
 topics+0.016
 motorcycles+0.016
Neg logit contributions
kin-0.016
ivery-0.014
 bruised-0.013
ifying-0.013
ishy-0.013
Token and
Token ID290
Feature activation+1.27e-01

Pos logit contributions
 guessing+0.012
 facts+0.011
 tours+0.011
 motorcycles+0.011
 topics+0.011
Neg logit contributions
kin-0.010
ivery-0.009
ifying-0.009
 bruised-0.008
ishy-0.008
Token felt
Token ID2936
Feature activation-1.26e+00

Pos logit contributions
 guessing+0.005
 facts+0.005
 tours+0.005
 topics+0.005
 motorcycles+0.005
Neg logit contributions
kin-0.006
ivery-0.005
ifying-0.005
 bruised-0.005
ishy-0.005
Token the
Token ID262
Feature activation-1.53e+00

Pos logit contributions
kin+0.063
ivery+0.061
robe+0.054
ifying+0.053
ishy+0.051
Neg logit contributions
 guessing-0.049
 discovering-0.047
 seeing-0.044
 facts-0.043
 tours-0.043
Token gentle
Token ID10296
Feature activation-4.14e-01

Pos logit contributions
kin+0.065
ivery+0.060
ifying+0.055
istered+0.052
robe+0.052
Neg logit contributions
 guessing-0.067
 adventures-0.063
 tours-0.061
 discovering-0.061
 instructions-0.061
Token breeze
Token ID28633
Feature activation-1.84e+00

Pos logit contributions
kin+0.015
ivery+0.013
ifying+0.012
 bruised+0.012
ishy+0.012
Neg logit contributions
 guessing-0.021
 facts-0.020
 topics-0.020
 tours-0.020
 motorcycles-0.020
Token and
Token ID290
Feature activation-2.52e-01

Pos logit contributions
kin+0.009
ivery+0.008
ifying+0.008
 bruised+0.008
ishy+0.008
Neg logit contributions
 guessing-0.005
 facts-0.005
 tours-0.005
 motorcycles-0.005
 discovering-0.005
Token ran
Token ID4966
Feature activation+4.48e-02

Pos logit contributions
kin+0.011
ivery+0.011
ifying+0.010
 bruised+0.009
ishy+0.009
Neg logit contributions
 guessing-0.011
 facts-0.010
 motorcycles-0.010
 tours-0.010
 discovering-0.010
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 guessing+0.002
 facts+0.001
 tours+0.001
 discovering+0.001
 topics+0.001
Neg logit contributions
kin-0.002
ivery-0.002
ifying-0.002
 bruised-0.002
ishy-0.002
Token the
Token ID262
Feature activation-2.07e-01

Pos logit contributions
kin+0.060
ivery+0.059
ifying+0.057
ishy+0.055
robe+0.052
Neg logit contributions
 guessing-0.031
 tours-0.027
 discovering-0.027
 seeing-0.027
 museums-0.024
Token water
Token ID1660
Feature activation-3.71e-01

Pos logit contributions
kin+0.010
ivery+0.009
ifying+0.009
 bruised+0.009
ishy+0.009
Neg logit contributions
 guessing-0.008
 facts-0.008
 tours-0.007
 motorcycles-0.007
 discovering-0.007
Token.
Token ID13
Feature activation-9.66e-01

Pos logit contributions
kin+0.015
ivery+0.015
ifying+0.014
 bruised+0.013
ishy+0.013
Neg logit contributions
 guessing-0.017
 facts-0.016
 tours-0.016
 discovering-0.015
 mills-0.015
Token He
Token ID679
Feature activation+2.85e-01

Pos logit contributions
kin+0.043
ivery+0.042
ifying+0.037
ishy+0.035
 bruised+0.034
Neg logit contributions
 guessing-0.040
 tours-0.039
 facts-0.039
 motorcycles-0.038
 discovering-0.038
Token held
Token ID2714
Feature activation-1.78e+00

Pos logit contributions
 guessing+0.013
 facts+0.013
 tours+0.013
 topics+0.013
 discovering+0.012
Neg logit contributions
kin-0.012
ivery-0.010
 bruised-0.010
ifying-0.010
ishy-0.009
Token out
Token ID503
Feature activation-7.90e-01

Pos logit contributions
ivery+0.136
kin+0.122
ifying+0.116
istered+0.113
robe+0.113
Neg logit contributions
 guessing-0.024
 discovering-0.019
 counting-0.019
 motorcycles-0.019
arm-0.019
Token the
Token ID262
Feature activation-7.78e-02

Pos logit contributions
ivery+0.043
kin+0.043
ifying+0.038
robe+0.038
ishy+0.037
Neg logit contributions
 guessing-0.025
 tours-0.024
 motorcycles-0.023
 facts-0.023
 mills-0.022
Token stick
Token ID4859
Feature activation-4.93e-01

Pos logit contributions
kin+0.004
ivery+0.004
ifying+0.003
 bruised+0.003
ishy+0.003
Neg logit contributions
 guessing-0.003
 facts-0.003
 tours-0.003
 motorcycles-0.003
 discovering-0.003
Token insects
Token ID19435
Feature activation+3.09e-02

Pos logit contributions
 guessing+0.002
 facts+0.002
 tours+0.002
 topics+0.002
 discovering+0.002
Neg logit contributions
kin-0.003
ivery-0.003
ifying-0.003
 bruised-0.003
ishy-0.002
Token away
Token ID1497
Feature activation-1.34e+00

Pos logit contributions
 guessing+0.001
 facts+0.001
 tours+0.001
 discovering+0.001
 topics+0.001
Neg logit contributions
kin-0.001
ivery-0.001
ifying-0.001
 bruised-0.001
ishy-0.001
Token.
Token ID13
Feature activation-9.13e-01

Pos logit contributions
ivery+0.064
kin+0.062
ishy+0.057
 bruised+0.055
ifying+0.054
Neg logit contributions
 guessing-0.051
 discovering-0.049
 facts-0.048
 tours-0.047
 adventures-0.046
Token The
Token ID383
Feature activation+2.38e-01

Pos logit contributions
kin+0.041
ivery+0.038
 bruised+0.034
ishy+0.033
ifying+0.033
Neg logit contributions
 guessing-0.038
 discovering-0.037
 tours-0.037
 motorcycles-0.037
 facts-0.037
Token flower
Token ID15061
Feature activation+5.91e-01

Pos logit contributions
 guessing+0.009
 facts+0.008
 tours+0.008
 motorcycles+0.008
 topics+0.008
Neg logit contributions
kin-0.012
ivery-0.011
ifying-0.011
 bruised-0.010
ishy-0.010
Token felt
Token ID2936
Feature activation-1.92e+00

Pos logit contributions
 guessing+0.026
 facts+0.026
 tours+0.025
 motorcycles+0.025
 topics+0.025
Neg logit contributions
kin-0.025
ivery-0.022
 bruised-0.021
ifying-0.021
ishy-0.020
Token sad
Token ID6507
Feature activation-8.76e-02

Pos logit contributions
kin+0.092
ivery+0.086
robe+0.079
ifying+0.077
ishy+0.076
Neg logit contributions
 guessing-0.075
 discovering-0.074
 seeing-0.069
 facts-0.068
 adventures-0.067
Token and
Token ID290
Feature activation-1.23e+00

Pos logit contributions
kin+0.005
ivery+0.004
ifying+0.004
 bruised+0.004
ishy+0.004
Neg logit contributions
 guessing-0.003
 facts-0.003
 tours-0.003
 discovering-0.003
 topics-0.003
Token soon
Token ID2582
Feature activation-3.43e-01

Pos logit contributions
kin+0.053
ivery+0.049
ifying+0.045
robe+0.043
ishy+0.042
Neg logit contributions
 guessing-0.056
 discovering-0.054
 facts-0.053
 seeing-0.052
 tours-0.051
Token its
Token ID663
Feature activation-4.28e-01

Pos logit contributions
kin+0.017
ivery+0.016
ifying+0.015
 bruised+0.014
ishy+0.014
Neg logit contributions
 guessing-0.013
 facts-0.012
 discovering-0.012
 tours-0.012
 motorcycles-0.012
Token pet
Token ID4273
Feature activation-1.13e+00

Pos logit contributions
kin+0.019
ivery+0.018
ifying+0.017
 bruised+0.017
ishy+0.016
Neg logit contributions
 guessing-0.018
 facts-0.017
 tours-0.017
 discovering-0.017
 topics-0.016
Token and
Token ID290
Feature activation+1.12e-02

Pos logit contributions
 facts+0.027
 guessing+0.027
 tours+0.027
 topics+0.026
 motorcycles+0.026
Neg logit contributions
kin-0.038
ivery-0.034
 bruised-0.034
ifying-0.033
cloth-0.031
Token Lily
Token ID20037
Feature activation+6.48e-01

Pos logit contributions
 guessing+0.001
 facts+0.001
 tours+0.001
 topics+0.001
 motorcycles+0.001
Neg logit contributions
kin-0.000
ivery-0.000
ifying-0.000
 bruised-0.000
ishy-0.000
Token struggling
Token ID9648
Feature activation+2.97e-01

Pos logit contributions
 facts+0.025
 guessing+0.025
 tours+0.025
 topics+0.024
 motorcycles+0.024
Neg logit contributions
kin-0.030
ivery-0.028
 bruised-0.027
ifying-0.027
cloth-0.025
Token with
Token ID351
Feature activation+3.51e-02

Pos logit contributions
 guessing+0.011
 facts+0.010
 tours+0.010
 motorcycles+0.010
 topics+0.010
Neg logit contributions
kin-0.015
ivery-0.014
 bruised-0.013
ifying-0.013
ishy-0.013
Token the
Token ID262
Feature activation-4.35e-01

Pos logit contributions
 guessing+0.001
 facts+0.001
 tours+0.001
 motorcycles+0.001
 topics+0.001
Neg logit contributions
kin-0.002
ivery-0.002
ifying-0.002
 bruised-0.002
ishy-0.002
Token box
Token ID3091
Feature activation-1.67e+00

Pos logit contributions
kin+0.020
ivery+0.019
ifying+0.019
 bruised+0.018
ishy+0.017
Neg logit contributions
 guessing-0.018
 facts-0.016
 tours-0.016
 motorcycles-0.016
 topics-0.016
Token.
Token ID13
Feature activation-9.56e-02

Pos logit contributions
ivery+0.083
kin+0.082
ifying+0.076
ishy+0.073
 bruised+0.072
Neg logit contributions
 guessing-0.060
 facts-0.055
 discovering-0.054
 tours-0.053
 adventures-0.052
Token She
Token ID1375
Feature activation+1.54e+00

Pos logit contributions
kin+0.003
ivery+0.003
ifying+0.002
 bruised+0.002
ishy+0.002
Neg logit contributions
 guessing-0.005
 facts-0.005
 tours-0.005
 topics-0.005
 motorcycles-0.005
Token smiles
Token ID21845
Feature activation-1.73e-01

Pos logit contributions
 topics+0.060
 guessing+0.060
 discovering+0.060
 tours+0.059
 facts+0.058
Neg logit contributions
kin-0.063
 bruised-0.063
ifying-0.059
 limp-0.059
cloth-0.057
Token and
Token ID290
Feature activation+7.64e-01

Pos logit contributions
kin+0.010
ivery+0.009
ifying+0.009
 bruised+0.009
ishy+0.008
Neg logit contributions
 guessing-0.005
 facts-0.005
 tours-0.005
 discovering-0.005
 motorcycles-0.005
Token walks
Token ID11114
Feature activation+3.52e-01

Pos logit contributions
 guessing+0.030
 topics+0.029
 tours+0.029
 facts+0.028
 mills+0.028
Neg logit contributions
kin-0.034
 bruised-0.032
ivery-0.031
ifying-0.031
ishy-0.030
Token the
Token ID262
Feature activation-3.96e-01

Pos logit contributions
kin+0.017
ivery+0.016
ifying+0.015
 bruised+0.015
ishy+0.015
Neg logit contributions
 guessing-0.010
 tours-0.010
 facts-0.010
 motorcycles-0.010
 discovering-0.009
Token treasure
Token ID14068
Feature activation-3.62e-01

Pos logit contributions
kin+0.022
ivery+0.021
ifying+0.020
 bruised+0.020
ishy+0.019
Neg logit contributions
 guessing-0.013
 facts-0.012
 tours-0.012
 motorcycles-0.012
 topics-0.011
Token and
Token ID290
Feature activation-9.41e-01

Pos logit contributions
kin+0.017
ivery+0.017
ifying+0.016
 bruised+0.015
ishy+0.015
Neg logit contributions
 guessing-0.014
 facts-0.013
 tours-0.013
 motorcycles-0.013
 discovering-0.013
Token give
Token ID1577
Feature activation-5.64e-02

Pos logit contributions
ivery+0.046
kin+0.046
ifying+0.041
ishy+0.041
 bruised+0.040
Neg logit contributions
 guessing-0.036
 facts-0.035
 tours-0.034
 motorcycles-0.034
 discovering-0.033
Token the
Token ID262
Feature activation-4.43e-01

Pos logit contributions
kin+0.003
ivery+0.003
ifying+0.003
 bruised+0.003
ishy+0.003
Neg logit contributions
 guessing-0.002
 facts-0.001
 tours-0.001
 motorcycles-0.001
 discovering-0.001
Token rest
Token ID1334
Feature activation-1.75e+00

Pos logit contributions
kin+0.024
ivery+0.024
ifying+0.022
 bruised+0.022
ishy+0.021
Neg logit contributions
 guessing-0.014
 facts-0.014
 tours-0.013
 motorcycles-0.013
 discovering-0.013
Token to
Token ID284
Feature activation-5.05e-01

Pos logit contributions
ivery+0.090
kin+0.079
 bruised+0.079
ifying+0.077
 limp+0.076
Neg logit contributions
 facts-0.061
 guessing-0.059
 motorcycles-0.056
 tours-0.056
 discovering-0.056
Token the
Token ID262
Feature activation-4.21e-01

Pos logit contributions
kin+0.026
ivery+0.025
ifying+0.023
ishy+0.023
 bruised+0.023
Neg logit contributions
 guessing-0.019
 tours-0.018
 facts-0.017
 discovering-0.017
 motorcycles-0.017
Token museum
Token ID13257
Feature activation+4.20e-01

Pos logit contributions
kin+0.022
ivery+0.021
ifying+0.020
 bruised+0.020
ishy+0.020
Neg logit contributions
 guessing-0.014
 facts-0.014
 tours-0.014
 motorcycles-0.013
 topics-0.013
Token.
Token ID13
Feature activation-2.63e-01

Pos logit contributions
 guessing+0.021
 facts+0.020
 tours+0.020
 topics+0.020
 discovering+0.020
Neg logit contributions
kin-0.016
ivery-0.014
ifying-0.013
 bruised-0.013
ishy-0.012
Token They
Token ID1119
Feature activation-8.61e-01

Pos logit contributions
kin+0.012
ivery+0.011
 bruised+0.010
ifying+0.010
ishy+0.010
Neg logit contributions
 guessing-0.011
 facts-0.011
 tours-0.011
 motorcycles-0.010
 topics-0.010
Token
Token ID26391
Feature activation+7.66e-01

Pos logit contributions
kin+0.041
ifying+0.039
 bruised+0.037
ivery+0.035
istered+0.034
Neg logit contributions
 guessing-0.051
arm-0.050
 tours-0.050
 discovering-0.049
 topics-0.049
Token
Token ID198
Feature activation+4.17e-01

Pos logit contributions
 guessing+0.031
 facts+0.030
 topics+0.029
 motorcycles+0.028
 tours+0.028
Neg logit contributions
kin-0.036
ivery-0.033
ifying-0.032
 bruised-0.031
ishy-0.030
Token
Token ID198
Feature activation+5.00e-01

Pos logit contributions
 guessing+0.017
 facts+0.017
 tours+0.016
 topics+0.016
 motorcycles+0.016
Neg logit contributions
kin-0.019
ivery-0.018
ifying-0.017
 bruised-0.016
ishy-0.016
TokenHis
Token ID6653
Feature activation+1.52e-02

Pos logit contributions
 guessing+0.019
 facts+0.019
 tours+0.018
 motorcycles+0.018
 topics+0.018
Neg logit contributions
kin-0.024
ivery-0.023
ifying-0.021
 bruised-0.021
ishy-0.020
Token dad
Token ID9955
Feature activation+4.59e-01

Pos logit contributions
 guessing+0.001
 facts+0.001
 tours+0.001
 topics+0.001
 motorcycles+0.001
Neg logit contributions
kin-0.001
ivery-0.001
ifying-0.001
 bruised-0.001
ishy-0.001
Token put
Token ID1234
Feature activation-1.69e+00

Pos logit contributions
 guessing+0.021
 facts+0.020
 tours+0.020
 topics+0.020
 discovering+0.019
Neg logit contributions
kin-0.019
ivery-0.017
 bruised-0.016
ifying-0.016
ishy-0.015
Token a
Token ID257
Feature activation-7.39e-01

Pos logit contributions
ivery+0.099
kin+0.095
istered+0.085
ifying+0.084
ishy+0.084
Neg logit contributions
 motorcycles-0.049
 guessing-0.046
 bikes-0.045
 tours-0.044
ons-0.043
Token hand
Token ID1021
Feature activation-1.65e+00

Pos logit contributions
kin+0.033
ivery+0.033
ifying+0.030
 bruised+0.029
ishy+0.029
Neg logit contributions
 guessing-0.030
 facts-0.028
 motorcycles-0.027
 tours-0.027
 topics-0.027
Token on
Token ID319
Feature activation-9.81e-01

Pos logit contributions
kin+0.092
ivery+0.089
ishy+0.081
ifying+0.080
 bruised+0.078
Neg logit contributions
 guessing-0.048
 discovering-0.044
 motorcycles-0.043
 mills-0.041
 to-0.040
Token his
Token ID465
Feature activation+1.23e-01

Pos logit contributions
kin+0.053
ivery+0.053
ishy+0.048
 bruised+0.048
ifying+0.047
Neg logit contributions
 guessing-0.031
 motorcycles-0.030
 tours-0.030
 discovering-0.028
 facts-0.028
Token shoulder
Token ID8163
Feature activation-1.09e+00

Pos logit contributions
 guessing+0.006
 facts+0.006
 tours+0.006
 topics+0.006
 motorcycles+0.006
Neg logit contributions
kin-0.004
ivery-0.004
ifying-0.004
 bruised-0.003
ishy-0.003
Token finally
Token ID3443
Feature activation-1.05e-01

Pos logit contributions
kin+0.022
ivery+0.021
 bruised+0.019
ifying+0.019
ishy+0.018
Neg logit contributions
 guessing-0.019
 tours-0.019
 facts-0.019
 motorcycles-0.018
 discovering-0.018
Token get
Token ID651
Feature activation-1.22e-01

Pos logit contributions
kin+0.005
ivery+0.004
ifying+0.004
 bruised+0.004
ishy+0.004
Neg logit contributions
 guessing-0.004
 tours-0.004
 facts-0.004
 motorcycles-0.004
 discovering-0.004
Token to
Token ID284
Feature activation+3.23e-01

Pos logit contributions
kin+0.006
ivery+0.005
ifying+0.005
 bruised+0.005
ishy+0.005
Neg logit contributions
 guessing-0.005
 tours-0.005
 facts-0.005
 motorcycles-0.005
 discovering-0.005
Token the
Token ID262
Feature activation-1.56e-01

Pos logit contributions
 guessing+0.013
 facts+0.012
 topics+0.012
 tours+0.012
 discovering+0.012
Neg logit contributions
kin-0.015
ivery-0.014
ifying-0.013
 bruised-0.013
ishy-0.012
Token hotel
Token ID7541
Feature activation+3.21e-01

Pos logit contributions
kin+0.008
ivery+0.007
 bruised+0.007
ifying+0.007
ishy+0.007
Neg logit contributions
 guessing-0.006
 tours-0.005
 facts-0.005
 motorcycles-0.005
 topics-0.005
Token.
Token ID13
Feature activation-1.86e+00

Pos logit contributions
 guessing+0.015
 facts+0.015
 tours+0.014
 topics+0.014
 motorcycles+0.014
Neg logit contributions
kin-0.013
ivery-0.011
ifying-0.011
 bruised-0.011
ishy-0.010
Token It
Token ID632
Feature activation-7.11e-02

Pos logit contributions
kin+0.087
ivery+0.086
 bruised+0.078
 bleed+0.073
ishy+0.070
Neg logit contributions
 motorcycles-0.073
 tours-0.073
 guessing-0.072
 topics-0.071
 museums-0.071
Token is
Token ID318
Feature activation-1.71e+00

Pos logit contributions
kin+0.004
ivery+0.004
ifying+0.004
 bruised+0.004
ishy+0.003
Neg logit contributions
 guessing-0.002
 facts-0.002
 tours-0.002
 topics-0.002
 motorcycles-0.002
Token very
Token ID845
Feature activation-1.42e+00

Pos logit contributions
ivery+0.086
kin+0.078
ishy+0.077
ifying+0.075
rified+0.073
Neg logit contributions
 guessing-0.068
 tours-0.062
 adventures-0.061
 seeing-0.060
 discovering-0.060
Token big
Token ID1263
Feature activation+8.61e-01

Pos logit contributions
ivery+0.065
kin+0.064
ifying+0.062
 bruised+0.061
rified+0.060
Neg logit contributions
 guessing-0.056
 tours-0.055
 facts-0.051
 adventures-0.050
 experiments-0.049
Token and
Token ID290
Feature activation-1.32e+00

Pos logit contributions
 guessing+0.033
 facts+0.032
 tours+0.032
 topics+0.032
 mills+0.031
Neg logit contributions
kin-0.040
 bruised-0.036
ivery-0.035
ifying-0.035
ishy-0.034