TOP ACTIVATIONS
MAX = 0.236

, eleven , twelve , thirteen , fourteen , fifteen ,
, eleven , twelve , thirteen , fourteen , fifteen ,
soldiers . " Left , right , left ,
persistent . Left , right , left again
. Eleven , twelve , thirteen , fourteen , fifteen .
a loud noise . " Pop ! Pop ! Pop !"
slap the bubble . " Pop !" went the bubble ,
noise . " Pop !" Tom and Mia say
can kick the ball the furthe st ?" Alf ie was
popped . " Pop !" They heard a loud
stove . " Pop , pop , pop !"
made a sound . " Pop !" Sara laughed . She
, fifteen , sixteen , seventeen , eighteen , nineteen ,
, fifteen , sixteen , seventeen , eighteen , nineteen ,
. Can you say five kil os ?" uncle Tom said
" They weigh five kil os . That means five
. " Five kil os ," Anna repeated .
a face . Lu cy decided to carefully stand
counted 1 , 2 , 3 , 4 , 5 ,
the bald monkey weighed 4 kil os . The monkey was

MIN ACTIVATIONS
MIN = -0.183

too dangerous , honey " said Mom my . Just
" Not yet ," Lily said . " We have to
't worry ," her friend said . " We 'll help
" Don 't worry ," said the squirrel . " I
's not true ," Sam says . " You can play
" Don 't worry ," said the boy . " I
Don 't worry ," he said . " I will help
okay , Max ," Lily said . " I 'm sorry
OK , kids ," she says . " But next time
share the car ?" Tom says . " OK
I don 't ," Tom says . Their moms
can ," the oct opus said . " We can share
too , Mom ," Tom says . Mom sigh
they are hiding ," Tom says . He opens
Okay , Ben ," Sara said . " We can be
okay , Sara ," Ben said . " I forgive you
I 'm sorry ," Ben says . " I just wanted
of the beach " he said . Jack felt
ily , wait !" he said , running after her .
't know . The sign says no . Maybe it is

INTERVAL 0.131 - 0.236
CONTAINS 0.192%

, four , five , six , seven , eight ,
It had ger ms and bugs in it . Lily and
cried and said , " O w , ow , ow
the best game Ruby had ever played . At
Mia and said , " Hey Mia , do you want

INTERVAL 0.026 - 0.131
CONTAINS 26.786%

saw that the skeleton was very patient , just standing there
a big hug . <|endoftext|> Once upon a time there was
liked to explore new places , but he had never been
and violet is a light purple . Do you want to
were friends forever . <|endoftext|> One day , a boy named

INTERVAL -0.078 - 0.026
CONTAINS 69.101%

beautiful butterfly flying around in the garden . She pointed to
gift from his dad . She says that Sam 's car
. The little girl even noticed new details about the ornament
" Can I help them ?" His mom said ,
are you ? We are looking for you . We are

INTERVAL -0.183 - -0.078
CONTAINS 3.922%

catch it . His friend said , " No , Tim
little girl walking by the farm . She stopped and said
great book . Let 's print out some pictures from it
and fries , and she said , " I want that
They had a big rocket that they pretended to fly to
Token,
Token ID11
Feature activation+3.32e-02

Pos logit contributions
right+0.340
ancy+0.324
 furthe+0.316
rog+0.315
rated+0.314
Neg logit contributions
 he-0.181
 that-0.172
 grandma-0.170
 dentist-0.166
 mail-0.161
Token eleven
Token ID22216
Feature activation+1.48e-01

Pos logit contributions
mer+0.054
illed+0.050
rated+0.049
right+0.049
 perspective+0.047
Neg logit contributions
 mail-0.057
 that-0.053
 he-0.052
 grandma-0.052
 dentist-0.051
Token,
Token ID11
Feature activation+3.29e-02

Pos logit contributions
right+0.317
Joy+0.315
mer+0.314
ancy+0.296
rated+0.296
Neg logit contributions
 he-0.158
 grandma-0.157
 dentist-0.151
 that-0.150
 car-0.147
Token twelve
Token ID14104
Feature activation+1.64e-01

Pos logit contributions
mer+0.044
illed+0.044
 perspective+0.044
right+0.042
ancy+0.040
Neg logit contributions
 mail-0.061
 he-0.059
 that-0.057
 dentist-0.057
 car-0.056
Token,
Token ID11
Feature activation+3.29e-02

Pos logit contributions
Joy+0.353
right+0.352
mer+0.343
ancy+0.335
zzy+0.334
Neg logit contributions
 he-0.172
 that-0.162
 car-0.157
 dentist-0.156
 grandma-0.156
Token thirteen
Token ID28306
Feature activation+2.36e-01

Pos logit contributions
mer+0.043
 perspective+0.042
illed+0.042
right+0.040
ancy+0.038
Neg logit contributions
 mail-0.062
 he-0.060
 that-0.059
 dentist-0.058
 grandma-0.057
Token,
Token ID11
Feature activation+3.49e-02

Pos logit contributions
right+0.541
iona+0.541
Joy+0.531
chie+0.526
 perspective+0.525
Neg logit contributions
 he-0.208
 that-0.203
 mail-0.180
 dreaming-0.178
 dentist-0.171
Token fourteen
Token ID29167
Feature activation+1.11e-01

Pos logit contributions
mer+0.046
right+0.046
illed+0.044
 perspective+0.044
ancy+0.041
Neg logit contributions
 mail-0.062
 he-0.062
 dentist-0.061
 that-0.061
 grandma-0.059
Token,
Token ID11
Feature activation+3.33e-02

Pos logit contributions
right+0.246
mer+0.241
illed+0.226
ancy+0.226
rog+0.224
Neg logit contributions
 he-0.132
 mail-0.122
 grandma-0.121
 dentist-0.116
 that-0.111
Token fifteen
Token ID17280
Feature activation+1.92e-01

Pos logit contributions
mer+0.039
right+0.038
illed+0.037
 perspective+0.035
ancy+0.034
Neg logit contributions
 mail-0.067
 dentist-0.065
 he-0.064
 grandma-0.063
 dreaming-0.062
Token,
Token ID11
Feature activation+3.33e-02

Pos logit contributions
right+0.412
mer+0.395
Joy+0.390
iona+0.389
rog+0.382
Neg logit contributions
 he-0.216
 mail-0.197
 dentist-0.197
 grandma-0.195
 that-0.189
Token,
Token ID11
Feature activation+3.28e-02

Pos logit contributions
right+0.327
ancy+0.300
illed+0.299
rated+0.299
mer+0.298
Neg logit contributions
 that-0.193
 he-0.186
 grandma-0.172
 mail-0.168
 tell-0.164
Token eleven
Token ID22216
Feature activation+1.48e-01

Pos logit contributions
mer+0.052
illed+0.048
right+0.047
rated+0.047
ancy+0.046
Neg logit contributions
 mail-0.055
 that-0.053
 he-0.053
 grandma-0.051
 tell-0.050
Token,
Token ID11
Feature activation+3.28e-02

Pos logit contributions
right+0.354
mer+0.345
rated+0.331
 disgust+0.324
vy+0.323
Neg logit contributions
 that-0.125
 grandma-0.105
 friends-0.103
 mail-0.098
 tells-0.089
Token twelve
Token ID14104
Feature activation+1.62e-01

Pos logit contributions
illed+0.049
mer+0.049
 perspective+0.047
right+0.046
rated+0.046
Neg logit contributions
 mail-0.055
 that-0.055
 he-0.054
 car-0.051
 airplane-0.049
Token,
Token ID11
Feature activation+3.27e-02

Pos logit contributions
right+0.385
mer+0.371
ust+0.368
zzy+0.365
rated+0.364
Neg logit contributions
 that-0.136
 hugs-0.105
 friends-0.105
 he-0.099
 mail-0.099
Token thirteen
Token ID28306
Feature activation+2.35e-01

Pos logit contributions
mer+0.047
illed+0.046
 perspective+0.044
right+0.044
rated+0.042
Neg logit contributions
 mail-0.058
 that-0.057
 he-0.055
 car-0.053
 hugs-0.052
Token,
Token ID11
Feature activation+3.48e-02

Pos logit contributions
right+0.568
iona+0.550
chie+0.549
mer+0.542
Joy+0.541
Neg logit contributions
 that-0.181
 he-0.163
 for-0.138
 friends-0.131
 dreaming-0.131
Token fourteen
Token ID29167
Feature activation+1.09e-01

Pos logit contributions
right+0.047
mer+0.046
illed+0.045
 perspective+0.044
rated+0.041
Neg logit contributions
 mail-0.060
 that-0.060
 he-0.060
 dentist-0.058
 grandma-0.058
Token,
Token ID11
Feature activation+3.30e-02

Pos logit contributions
right+0.252
mer+0.243
illed+0.233
ancy+0.226
chie+0.224
Neg logit contributions
 he-0.123
 mail-0.111
 grandma-0.106
 that-0.105
 say-0.101
Token fifteen
Token ID17280
Feature activation+1.90e-01

Pos logit contributions
mer+0.041
right+0.039
illed+0.039
 perspective+0.036
ancy+0.035
Neg logit contributions
 mail-0.065
 he-0.063
 dentist-0.061
 that-0.060
 grandma-0.060
Token,
Token ID11
Feature activation+3.31e-02

Pos logit contributions
right+0.467
mer+0.437
illed+0.434
ust+0.427
rated+0.425
Neg logit contributions
 that-0.152
 he-0.152
 friends-0.140
 mail-0.129
 hugs-0.129
Token soldiers
Token ID5795
Feature activation-3.13e-02

Pos logit contributions
right+0.032
 thirteen+0.031
 steep+0.030
mer+0.029
 length+0.029
Neg logit contributions
grand-0.023
 dentist-0.022
 mail-0.021
 granddaughter-0.021
 grandma-0.020
Token.
Token ID13
Feature activation+7.66e-03

Pos logit contributions
grand+0.050
 dentist+0.050
 mail+0.048
 granddaughter+0.048
 grandma+0.048
Neg logit contributions
right-0.043
mer-0.043
 thirteen-0.042
 steep-0.041
 length-0.040
Token
Token ID198
Feature activation-1.91e-02

Pos logit contributions
right+0.014
 thirteen+0.013
mer+0.013
 steep+0.013
ancy+0.013
Neg logit contributions
 dentist-0.010
 mail-0.009
grand-0.009
 dreaming-0.009
 granddaughter-0.009
Token
Token ID198
Feature activation-1.76e-02

Pos logit contributions
 mail+0.018
 dentist+0.017
grand+0.017
 granddaughter+0.016
 dreaming+0.016
Neg logit contributions
right-0.040
 thirteen-0.039
mer-0.038
 steep-0.038
ancy-0.038
Token"
Token ID1
Feature activation+2.31e-02

Pos logit contributions
grand+0.025
 mail+0.025
 dentist+0.023
 grandma+0.023
 granddaughter+0.023
Neg logit contributions
 thirteen-0.029
right-0.029
mer-0.028
 steep-0.027
 length-0.027
TokenLeft
Token ID18819
Feature activation+2.34e-01

Pos logit contributions
right+0.037
 thirteen+0.035
 steep+0.035
 centimeters+0.033
illed+0.033
Neg logit contributions
 mail-0.035
grand-0.034
 dentist-0.033
 granddaughter-0.032
 grandma-0.032
Token,
Token ID11
Feature activation+3.29e-02

Pos logit contributions
 thirteen+0.426
right+0.420
 steep+0.404
 seventeen+0.394
Joy+0.382
Neg logit contributions
 dreaming-0.296
 tell-0.292
 grandma-0.291
 daddy-0.274
 mail-0.273
Token right
Token ID826
Feature activation+1.19e-01

Pos logit contributions
right+0.057
 thirteen+0.055
mer+0.054
ancy+0.053
 length+0.053
Neg logit contributions
 mail-0.043
 dentist-0.042
 grandma-0.041
grand-0.041
 granddaughter-0.039
Token,
Token ID11
Feature activation+3.32e-02

Pos logit contributions
right+0.224
 thirteen+0.215
illed+0.202
ancy+0.201
chie+0.199
Neg logit contributions
 mail-0.147
 grandma-0.147
 dentist-0.144
 dreaming-0.143
 that-0.136
Token left
Token ID1364
Feature activation-3.47e-02

Pos logit contributions
right+0.058
 thirteen+0.057
mer+0.056
 steep+0.055
 length+0.054
Neg logit contributions
 mail-0.043
grand-0.043
 dentist-0.042
 granddaughter-0.040
 grandma-0.040
Token,
Token ID11
Feature activation+3.39e-02

Pos logit contributions
 mail+0.051
 grandma+0.050
 dentist+0.050
grand+0.049
 granddaughter+0.047
Neg logit contributions
right-0.056
 thirteen-0.054
 length-0.051
illed-0.051
 steep-0.050
Token persistent
Token ID16218
Feature activation+5.47e-02

Pos logit contributions
 thirteen+0.038
right+0.037
aney+0.036
ancy+0.036
Joy+0.036
Neg logit contributions
 mail-0.025
 dreaming-0.024
 grandma-0.021
 he-0.021
 that-0.020
Token.
Token ID13
Feature activation+8.95e-03

Pos logit contributions
 thirteen+0.112
 seventeen+0.111
right+0.109
ancy+0.109
Joy+0.107
Neg logit contributions
 that-0.061
 he-0.058
 mail-0.052
 dreaming-0.051
 Teddy-0.050
Token
Token ID220
Feature activation+4.04e-02

Pos logit contributions
right+0.019
 thirteen+0.018
mer+0.018
 steep+0.018
 length+0.018
Neg logit contributions
 mail-0.009
grand-0.009
 dentist-0.008
 grandma-0.008
 granddaughter-0.008
Token
Token ID198
Feature activation-2.13e-02

Pos logit contributions
right+0.087
 thirteen+0.085
mer+0.083
 steep+0.082
 length+0.082
Neg logit contributions
 mail-0.036
grand-0.035
 dentist-0.035
 granddaughter-0.033
 grandma-0.032
Token
Token ID198
Feature activation-2.00e-02

Pos logit contributions
 mail+0.017
grand+0.017
 dentist+0.017
 grandma+0.016
 granddaughter+0.016
Neg logit contributions
right-0.048
 thirteen-0.047
mer-0.046
 steep-0.045
 length-0.045
TokenLeft
Token ID18819
Feature activation+2.33e-01

Pos logit contributions
 mail+0.038
grand+0.037
 dentist+0.037
 doctor+0.036
 grandma+0.035
Neg logit contributions
mer-0.023
 thirteen-0.022
right-0.022
 steep-0.021
ancy-0.020
Token,
Token ID11
Feature activation+3.34e-02

Pos logit contributions
 thirteen+0.383
 steep+0.372
right+0.368
 seventeen+0.362
Joy+0.350
Neg logit contributions
 dreaming-0.326
 tell-0.318
 daddy-0.313
 grandma-0.302
 he-0.302
Token right
Token ID826
Feature activation+1.22e-01

Pos logit contributions
right+0.064
 thirteen+0.063
 length+0.061
ancy+0.061
mer+0.060
Neg logit contributions
 mail-0.037
 grandma-0.036
 dreaming-0.036
 dentist-0.034
 he-0.033
Token,
Token ID11
Feature activation+3.33e-02

Pos logit contributions
right+0.239
 thirteen+0.236
ancy+0.225
aney+0.223
 seventeen+0.222
Neg logit contributions
 that-0.138
 mail-0.138
 he-0.135
 dreaming-0.129
 grandma-0.128
Token left
Token ID1364
Feature activation-3.47e-02

Pos logit contributions
right+0.066
 thirteen+0.065
 length+0.063
ancy+0.063
mer+0.062
Neg logit contributions
 mail-0.035
 grandma-0.034
 dreaming-0.033
 dentist-0.032
 he-0.031
Token again
Token ID757
Feature activation+1.11e-01

Pos logit contributions
 mail+0.038
 grandma+0.037
 dentist+0.036
 doctor+0.035
 dreaming+0.035
Neg logit contributions
right-0.069
 thirteen-0.068
ancy-0.065
 seventeen-0.065
 steep-0.064
Token.
Token ID13
Feature activation+7.73e-03

Pos logit contributions
 mail+0.011
 grandma+0.010
 granddaughter+0.010
 dreaming+0.010
 dentist+0.010
Neg logit contributions
right-0.014
 thirteen-0.013
mer-0.012
 steep-0.012
 length-0.012
Token Eleven
Token ID37339
Feature activation+1.47e-01

Pos logit contributions
right+0.017
 thirteen+0.017
mer+0.017
 length+0.016
 steep+0.016
Neg logit contributions
 mail-0.007
 grandma-0.007
grand-0.006
 dentist-0.006
 granddaughter-0.006
Token,
Token ID11
Feature activation+3.57e-02

Pos logit contributions
right+0.296
 steep+0.291
 furthe+0.282
 perspective+0.281
mer+0.280
Neg logit contributions
 grandma-0.165
 mail-0.158
 that-0.149
 he-0.143
 tell-0.137
Token twelve
Token ID14104
Feature activation+1.59e-01

Pos logit contributions
right+0.046
illed+0.044
 perspective+0.044
mer+0.044
ancy+0.044
Neg logit contributions
 mail-0.067
 dentist-0.064
 grandma-0.063
 doctor-0.060
 dreaming-0.060
Token,
Token ID11
Feature activation+3.51e-02

Pos logit contributions
right+0.334
Joy+0.329
mer+0.320
ancy+0.315
ust+0.313
Neg logit contributions
 grandma-0.161
 mail-0.159
 he-0.156
 that-0.156
 tell-0.145
Token thirteen
Token ID28306
Feature activation+2.30e-01

Pos logit contributions
right+0.037
 perspective+0.037
illed+0.036
mer+0.036
ancy+0.034
Neg logit contributions
 mail-0.073
 dentist-0.069
 grandma-0.069
 dreaming-0.067
 he-0.067
Token,
Token ID11
Feature activation+3.44e-02

Pos logit contributions
right+0.526
chie+0.505
Joy+0.503
 perspective+0.497
ust+0.493
Neg logit contributions
 that-0.207
 he-0.206
 mail-0.188
 say-0.185
 dreaming-0.183
Token fourteen
Token ID29167
Feature activation+1.08e-01

Pos logit contributions
right+0.036
mer+0.035
 perspective+0.034
illed+0.033
ancy+0.032
Neg logit contributions
 mail-0.070
 dentist-0.068
 grandma-0.067
 dreaming-0.066
 he-0.066
Token,
Token ID11
Feature activation+3.16e-02

Pos logit contributions
right+0.234
mer+0.224
ancy+0.213
illed+0.212
chie+0.211
Neg logit contributions
 he-0.130
 mail-0.126
 grandma-0.122
 dentist-0.115
 dreaming-0.109
Token fifteen
Token ID17280
Feature activation+1.88e-01

Pos logit contributions
right+0.033
mer+0.032
illed+0.030
 perspective+0.029
ancy+0.029
Neg logit contributions
 mail-0.067
 dentist-0.064
 grandma-0.063
 he-0.062
 dreaming-0.062
Token.
Token ID13
Feature activation+6.10e-03

Pos logit contributions
right+0.418
Joy+0.404
iona+0.402
mer+0.394
rog+0.388
Neg logit contributions
 he-0.195
 mail-0.178
 grandma-0.172
 that-0.171
 dentist-0.165
Token a
Token ID257
Feature activation-1.00e-02

Pos logit contributions
right+0.017
 thirteen+0.016
mer+0.016
 steep+0.016
 length+0.016
Neg logit contributions
 mail-0.009
grand-0.009
 dentist-0.009
 grandma-0.009
 granddaughter-0.009
Token loud
Token ID7812
Feature activation+4.85e-02

Pos logit contributions
grand+0.014
 mail+0.014
 dentist+0.014
 granddaughter+0.013
 grandma+0.013
Neg logit contributions
right-0.016
 thirteen-0.016
mer-0.016
 steep-0.015
 length-0.015
Token noise
Token ID7838
Feature activation-1.54e-02

Pos logit contributions
right+0.092
 thirteen+0.091
mer+0.088
 steep+0.086
 length+0.086
Neg logit contributions
grand-0.055
 dentist-0.053
 mail-0.053
 granddaughter-0.052
 grandma-0.051
Token.
Token ID13
Feature activation+7.19e-03

Pos logit contributions
 mail+0.023
 dentist+0.022
grand+0.022
 grandma+0.022
 dreaming+0.021
Neg logit contributions
right-0.024
 thirteen-0.024
mer-0.023
 steep-0.023
 length-0.023
Token "
Token ID366
Feature activation+6.23e-02

Pos logit contributions
right+0.013
 thirteen+0.013
 steep+0.012
mer+0.012
ancy+0.012
Neg logit contributions
grand-0.009
 dentist-0.009
 mail-0.009
 granddaughter-0.008
 dreaming-0.008
TokenPop
Token ID16979
Feature activation+2.15e-01

Pos logit contributions
right+0.091
 thirteen+0.088
 steep+0.087
mer+0.087
 length+0.084
Neg logit contributions
 mail-0.100
grand-0.098
 dentist-0.097
 grandma-0.095
 granddaughter-0.094
Token!
Token ID0
Feature activation+2.54e-02

Pos logit contributions
right+0.390
 perspective+0.378
chie+0.364
 thirteen+0.362
 length+0.358
Neg logit contributions
 dentist-0.273
 mail-0.270
 grandma-0.269
 dreaming-0.265
 that-0.262
Token Pop
Token ID8099
Feature activation+1.62e-01

Pos logit contributions
right+0.047
 thirteen+0.046
mer+0.045
 steep+0.044
 length+0.044
Neg logit contributions
 mail-0.030
grand-0.030
 dentist-0.030
 grandma-0.028
 granddaughter-0.028
Token!
Token ID0
Feature activation+2.62e-02

Pos logit contributions
right+0.282
 perspective+0.282
 thirteen+0.276
 length+0.267
illed+0.265
Neg logit contributions
 mail-0.215
 that-0.212
 grandma-0.211
 dentist-0.211
 Teddy-0.208
Token Pop
Token ID8099
Feature activation+1.63e-01

Pos logit contributions
right+0.048
 thirteen+0.048
mer+0.046
 steep+0.045
 length+0.045
Neg logit contributions
 mail-0.031
 dentist-0.031
grand-0.031
 grandma-0.029
 granddaughter-0.029
Token!"
Token ID2474
Feature activation+1.43e-02

Pos logit contributions
right+0.281
 perspective+0.280
 thirteen+0.273
 length+0.267
illed+0.265
Neg logit contributions
 mail-0.215
 that-0.213
 dentist-0.211
 grandma-0.209
 he-0.208
Token slap
Token ID23905
Feature activation+1.41e-02

Pos logit contributions
 mail+0.006
 tell+0.006
 say+0.005
 send+0.005
 write+0.005
Neg logit contributions
right-0.009
 thirteen-0.009
mer-0.009
illed-0.008
 steep-0.008
Token the
Token ID262
Feature activation-6.60e-03

Pos logit contributions
right+0.033
illed+0.032
 perspective+0.032
ancy+0.032
mer+0.032
Neg logit contributions
 mail-0.010
 grandma-0.010
 Brown-0.009
 he-0.009
 Teddy-0.009
Token bubble
Token ID14310
Feature activation+1.86e-03

Pos logit contributions
 mail+0.009
 dentist+0.008
 grandma+0.008
 granddaughter+0.008
 doctor+0.008
Neg logit contributions
right-0.012
ancy-0.012
 thirteen-0.012
illed-0.011
aney-0.011
Token.
Token ID13
Feature activation+8.15e-03

Pos logit contributions
right+0.003
 thirteen+0.003
ancy+0.003
 perspective+0.003
illed+0.003
Neg logit contributions
 mail-0.002
 he-0.002
 doctor-0.002
 that-0.002
 tell-0.002
Token "
Token ID366
Feature activation+6.11e-02

Pos logit contributions
right+0.017
 thirteen+0.016
mer+0.016
 length+0.016
 steep+0.016
Neg logit contributions
 mail-0.008
 dentist-0.008
grand-0.008
 grandma-0.008
 granddaughter-0.007
TokenPop
Token ID16979
Feature activation+2.12e-01

Pos logit contributions
right+0.080
 steep+0.078
ancy+0.078
mer+0.076
illed+0.076
Neg logit contributions
 mail-0.103
grand-0.100
 dentist-0.099
 doctor-0.094
 grandma-0.093
Token!"
Token ID2474
Feature activation+1.42e-02

Pos logit contributions
right+0.428
ust+0.401
 perspective+0.401
aney+0.398
chie+0.396
Neg logit contributions
 that-0.248
 Teddy-0.220
 bear-0.216
 he-0.204
 dreaming-0.202
Token went
Token ID1816
Feature activation+1.70e-02

Pos logit contributions
right+0.025
 thirteen+0.024
 steep+0.024
mer+0.023
 disgust+0.022
Neg logit contributions
grand-0.017
 dentist-0.017
 grandparents-0.017
 granddaughter-0.016
 mail-0.016
Token the
Token ID262
Feature activation-5.46e-03

Pos logit contributions
right+0.027
mer+0.026
 steep+0.025
 thirteen+0.025
oe+0.024
Neg logit contributions
grand-0.026
 mail-0.025
 dentist-0.025
 granddaughter-0.024
 grandma-0.024
Token bubble
Token ID14310
Feature activation+1.99e-03

Pos logit contributions
 mail+0.007
 dentist+0.006
 grandma+0.006
 granddaughter+0.006
 doctor+0.006
Neg logit contributions
right-0.010
 thirteen-0.010
ancy-0.010
mer-0.010
illed-0.010
Token,
Token ID11
Feature activation+2.87e-02

Pos logit contributions
right+0.003
 thirteen+0.003
mer+0.003
 steep+0.003
ancy+0.003
Neg logit contributions
 mail-0.003
 dentist-0.003
grand-0.003
 grandma-0.003
 dreaming-0.003
Token noise
Token ID7838
Feature activation-1.90e-02

Pos logit contributions
right+0.099
 thirteen+0.095
mer+0.095
 steep+0.093
ancy+0.092
Neg logit contributions
 mail-0.041
 dentist-0.039
grand-0.038
 grandma-0.038
 dreaming-0.036
Token.
Token ID13
Feature activation+1.23e-02

Pos logit contributions
 mail+0.026
 dentist+0.024
 he+0.024
 that+0.024
 grandma+0.023
Neg logit contributions
right-0.034
ancy-0.032
 thirteen-0.032
illed-0.032
 perspective-0.032
Token
Token ID198
Feature activation-1.68e-02

Pos logit contributions
mer+0.029
right+0.029
 thirteen+0.028
 length+0.028
 perspective+0.027
Neg logit contributions
 mail-0.009
 grandma-0.009
 dentist-0.009
grand-0.008
 granddaughter-0.007
Token
Token ID198
Feature activation-1.62e-02

Pos logit contributions
 mail+0.013
grand+0.013
 dentist+0.013
 grandma+0.012
 granddaughter+0.012
Neg logit contributions
right-0.038
 thirteen-0.038
mer-0.037
 steep-0.036
 length-0.036
Token"
Token ID1
Feature activation+2.68e-02

Pos logit contributions
 mail+0.020
 dentist+0.020
 he+0.020
 grandma+0.020
grand+0.019
Neg logit contributions
right-0.029
 perspective-0.029
mer-0.029
 length-0.028
iful-0.028
TokenPop
Token ID16979
Feature activation+2.12e-01

Pos logit contributions
illed+0.039
right+0.039
ancy+0.038
 steep+0.037
 perspective+0.037
Neg logit contributions
 mail-0.043
 dentist-0.041
 doctor-0.038
 grandma-0.038
 cash-0.037
Token!"
Token ID2474
Feature activation+4.92e-03

Pos logit contributions
right+0.435
ust+0.420
chie+0.419
illed+0.402
aney+0.397
Neg logit contributions
 that-0.243
 mail-0.214
 grandma-0.204
 dentist-0.196
 say-0.196
Token Tom
Token ID4186
Feature activation+4.91e-02

Pos logit contributions
right+0.009
 thirteen+0.009
mer+0.009
 length+0.009
 steep+0.009
Neg logit contributions
 mail-0.006
 dentist-0.006
grand-0.006
 grandma-0.005
 granddaughter-0.005
Token and
Token ID290
Feature activation-5.44e-02

Pos logit contributions
right+0.122
mer+0.119
 thirteen+0.118
 length+0.117
ancy+0.117
Neg logit contributions
 mail-0.028
 dentist-0.026
 grandma-0.025
grand-0.025
 granddaughter-0.023
Token Mia
Token ID32189
Feature activation+1.02e-01

Pos logit contributions
 mail+0.094
 dentist+0.093
 grandma+0.092
grand+0.090
 granddaughter+0.090
Neg logit contributions
right-0.072
 thirteen-0.071
mer-0.070
 perspective-0.068
 length-0.068
Token say
Token ID910
Feature activation-1.59e-01

Pos logit contributions
right+0.272
mer+0.268
 thirteen+0.266
 length+0.264
ancy+0.263
Neg logit contributions
 mail-0.043
 tell-0.035
 say-0.032
 grandma-0.031
grand-0.031
Token can
Token ID460
Feature activation+2.59e-02

Pos logit contributions
right+0.069
 thirteen+0.069
mer+0.068
ancy+0.066
 steep+0.065
Neg logit contributions
 mail-0.058
grand-0.055
 grandma-0.054
 dentist-0.054
 granddaughter-0.052
Token kick
Token ID4829
Feature activation+4.13e-02

Pos logit contributions
right+0.050
 thirteen+0.049
mer+0.048
 steep+0.048
 length+0.047
Neg logit contributions
 mail-0.029
grand-0.028
 dentist-0.028
 grandma-0.027
 granddaughter-0.026
Token the
Token ID262
Feature activation-1.01e-02

Pos logit contributions
right+0.095
illed+0.091
ancy+0.091
 thirteen+0.090
aney+0.089
Neg logit contributions
 mail-0.038
 grandma-0.034
 that-0.033
 Teddy-0.031
 he-0.030
Token ball
Token ID2613
Feature activation+2.12e-03

Pos logit contributions
 mail+0.012
 dentist+0.011
 car+0.011
 soldier+0.011
 doctor+0.011
Neg logit contributions
right-0.020
illed-0.019
ancy-0.019
mer-0.019
 thirteen-0.019
Token the
Token ID262
Feature activation-1.07e-02

Pos logit contributions
right+0.004
 thirteen+0.004
 length+0.004
ancy+0.004
mer+0.004
Neg logit contributions
 mail-0.003
 dentist-0.003
grand-0.003
 grandma-0.003
 doctor-0.002
Token furthe
Token ID46186
Feature activation+2.11e-01

Pos logit contributions
 mail+0.011
 car+0.010
 grandma+0.010
 soldier+0.010
 dentist+0.010
Neg logit contributions
right-0.022
illed-0.022
ancy-0.022
 thirteen-0.021
mer-0.021
Tokenst
Token ID301
Feature activation+9.60e-02

Pos logit contributions
 thirteen+0.357
 seventeen+0.348
 steep+0.341
right+0.314
anne+0.308
Neg logit contributions
 mail-0.316
 dentist-0.279
 dreaming-0.266
 that-0.266
 grandma-0.265
Token?"
Token ID1701
Feature activation+9.66e-03

Pos logit contributions
right+0.171
 thirteen+0.170
 centimeters+0.164
 length+0.161
Joy+0.161
Neg logit contributions
 that-0.110
 mail-0.108
 doctor-0.103
 grandma-0.102
 he-0.100
Token Alf
Token ID17136
Feature activation+5.54e-04

Pos logit contributions
right+0.015
 perspective+0.015
mer+0.015
 thirteen+0.014
rog+0.014
Neg logit contributions
 dentist-0.012
grand-0.012
 mail-0.011
 dreaming-0.011
 thanking-0.011
Tokenie
Token ID494
Feature activation+1.75e-02

Pos logit contributions
 thirteen+0.001
right+0.001
 steep+0.001
 length+0.001
 furthe+0.001
Neg logit contributions
 dentist-0.001
grand-0.001
 mail-0.001
 grandma-0.001
 granddaughter-0.001
Token was
Token ID373
Feature activation+2.56e-02

Pos logit contributions
right+0.040
 thirteen+0.039
 steep+0.039
 perspective+0.039
 length+0.038
Neg logit contributions
 mail-0.014
 grandma-0.014
 granddaughter-0.013
 dentist-0.013
 tell-0.012
Token popped
Token ID22928
Feature activation+6.17e-02

Pos logit contributions
 mail+0.074
 dentist+0.074
 grandma+0.072
grand+0.070
 dreaming+0.069
Neg logit contributions
right-0.101
mer-0.097
 thirteen-0.096
ancy-0.094
 length-0.094
Token.
Token ID13
Feature activation+7.14e-03

Pos logit contributions
 perspective+0.140
right+0.139
mer+0.138
rog+0.135
Joy+0.134
Neg logit contributions
 Brown-0.053
 mail-0.048
 that-0.046
 Teddy-0.045
 grandma-0.045
Token
Token ID198
Feature activation-1.85e-02

Pos logit contributions
right+0.015
 thirteen+0.014
mer+0.014
 steep+0.014
 length+0.014
Neg logit contributions
 mail-0.007
 dentist-0.007
grand-0.007
 grandma-0.007
 granddaughter-0.006
Token
Token ID198
Feature activation-1.76e-02

Pos logit contributions
 mail+0.018
 dentist+0.018
grand+0.018
 granddaughter+0.017
 dreaming+0.017
Neg logit contributions
right-0.038
 thirteen-0.037
 steep-0.036
mer-0.036
ancy-0.036
Token"
Token ID1
Feature activation+2.61e-02

Pos logit contributions
 mail+0.025
grand+0.025
 grandma+0.025
 dentist+0.025
 granddaughter+0.023
Neg logit contributions
right-0.028
mer-0.028
 thirteen-0.027
 length-0.027
 perspective-0.027
TokenPop
Token ID16979
Feature activation+2.11e-01

Pos logit contributions
 steep+0.035
right+0.035
illed+0.034
 length+0.033
 perspective+0.033
Neg logit contributions
 mail-0.044
 dentist-0.043
grand-0.042
 grandma-0.042
 doctor-0.040
Token!"
Token ID2474
Feature activation+7.40e-03

Pos logit contributions
right+0.435
chie+0.420
ust+0.417
 perspective+0.413
illed+0.405
Neg logit contributions
 that-0.235
 he-0.201
 bear-0.198
 Brown-0.196
 dentist-0.192
Token They
Token ID1119
Feature activation-3.93e-03

Pos logit contributions
right+0.011
 thirteen+0.011
mer+0.010
 steep+0.010
 length+0.010
Neg logit contributions
 mail-0.011
grand-0.011
 dentist-0.011
 granddaughter-0.011
 grandma-0.011
Token heard
Token ID2982
Feature activation+6.57e-03

Pos logit contributions
 mail+0.005
 grandma+0.005
 dentist+0.005
grand+0.005
 granddaughter+0.005
Neg logit contributions
right-0.007
 thirteen-0.007
mer-0.007
 steep-0.007
 length-0.007
Token a
Token ID257
Feature activation-1.24e-02

Pos logit contributions
 thirteen+0.010
right+0.010
 steep+0.009
ancy+0.009
 length+0.009
Neg logit contributions
grand-0.010
 dreaming-0.009
 dentist-0.009
 mail-0.008
 granddaughter-0.008
Token loud
Token ID7812
Feature activation+4.47e-02

Pos logit contributions
 mail+0.016
 dentist+0.015
 grandma+0.014
 doctor+0.014
grand+0.014
Neg logit contributions
right-0.023
 thirteen-0.022
mer-0.022
ancy-0.021
illed-0.021
Token stove
Token ID27635
Feature activation-1.36e-02

Pos logit contributions
 mail+0.004
 dentist+0.004
 grandma+0.004
grand+0.004
 granddaughter+0.004
Neg logit contributions
right-0.006
 thirteen-0.006
mer-0.006
illed-0.006
 steep-0.006
Token.
Token ID13
Feature activation+8.74e-03

Pos logit contributions
 mail+0.014
 dentist+0.013
 grandma+0.013
grand+0.012
 dreaming+0.012
Neg logit contributions
right-0.028
 thirteen-0.027
mer-0.027
 steep-0.026
ancy-0.026
Token
Token ID198
Feature activation-1.79e-02

Pos logit contributions
right+0.015
 thirteen+0.015
 steep+0.015
mer+0.014
ancy+0.014
Neg logit contributions
grand-0.011
 dentist-0.011
 mail-0.011
 granddaughter-0.011
 dreaming-0.010
Token
Token ID198
Feature activation-1.66e-02

Pos logit contributions
 dentist+0.020
grand+0.020
 mail+0.020
 granddaughter+0.020
 dreaming+0.019
Neg logit contributions
right-0.034
 steep-0.032
 thirteen-0.032
ancy-0.032
mer-0.031
Token"
Token ID1
Feature activation+2.69e-02

Pos logit contributions
 mail+0.027
grand+0.026
 dentist+0.026
 grandma+0.025
 granddaughter+0.025
Neg logit contributions
right-0.025
 thirteen-0.024
mer-0.023
 steep-0.023
 length-0.022
TokenPop
Token ID16979
Feature activation+2.11e-01

Pos logit contributions
right+0.044
 steep+0.043
mer+0.042
illed+0.041
 perspective+0.041
Neg logit contributions
 mail-0.041
 dentist-0.039
grand-0.037
 grandma-0.036
 granddaughter-0.036
Token,
Token ID11
Feature activation+3.48e-02

Pos logit contributions
right+0.399
chie+0.386
 perspective+0.385
 thirteen+0.368
ust+0.367
Neg logit contributions
 dentist-0.249
 mail-0.248
 that-0.247
 grandma-0.246
 dreaming-0.244
Token pop
Token ID1461
Feature activation+5.80e-02

Pos logit contributions
 thirteen+0.055
right+0.054
 steep+0.051
mer+0.051
 length+0.050
Neg logit contributions
grand-0.053
 mail-0.050
 dentist-0.050
 granddaughter-0.050
 grandma-0.048
Token,
Token ID11
Feature activation+3.46e-02

Pos logit contributions
 perspective+0.123
right+0.122
illed+0.117
mer+0.114
ust+0.112
Neg logit contributions
 that-0.064
 grandma-0.055
 say-0.053
 him-0.053
 Teddy-0.053
Token pop
Token ID1461
Feature activation+5.83e-02

Pos logit contributions
right+0.053
 thirteen+0.052
mer+0.050
 steep+0.049
 length+0.048
Neg logit contributions
grand-0.053
 mail-0.052
 dentist-0.052
 granddaughter-0.051
 grandma-0.050
Token!"
Token ID2474
Feature activation+7.92e-03

Pos logit contributions
 perspective+0.121
right+0.120
illed+0.117
mer+0.113
ust+0.110
Neg logit contributions
 that-0.066
 say-0.055
 grandma-0.055
 him-0.055
 mail-0.055
Token made
Token ID925
Feature activation+1.00e-03

Pos logit contributions
 he+0.022
 mail+0.021
 tell+0.020
 doctor+0.020
 dentist+0.020
Neg logit contributions
 perspective-0.045
right-0.044
ancy-0.043
illed-0.042
 thirteen-0.042
Token a
Token ID257
Feature activation-1.17e-02

Pos logit contributions
 thirteen+0.002
right+0.002
 length+0.002
 steep+0.002
ancy+0.002
Neg logit contributions
grand-0.001
 dentist-0.001
 granddaughter-0.001
 mail-0.001
 dreaming-0.001
Token sound
Token ID2128
Feature activation-7.95e-03

Pos logit contributions
 mail+0.016
 dentist+0.015
 grandma+0.015
grand+0.014
 granddaughter+0.014
Neg logit contributions
right-0.020
 thirteen-0.020
mer-0.019
ancy-0.019
illed-0.019
Token.
Token ID13
Feature activation+3.87e-03

Pos logit contributions
 mail+0.011
 dentist+0.011
 he+0.010
 dreaming+0.010
 grandma+0.010
Neg logit contributions
right-0.014
 thirteen-0.013
ancy-0.013
mer-0.013
 perspective-0.013
Token "
Token ID366
Feature activation+5.99e-02

Pos logit contributions
right+0.009
 thirteen+0.009
mer+0.009
 length+0.008
 steep+0.008
Neg logit contributions
 mail-0.003
 dentist-0.003
 grandma-0.003
grand-0.003
 granddaughter-0.003
TokenPop
Token ID16979
Feature activation+2.08e-01

Pos logit contributions
right+0.095
 steep+0.093
 thirteen+0.092
 length+0.092
mer+0.091
Neg logit contributions
 mail-0.087
grand-0.084
 dentist-0.081
 grandma-0.077
 granddaughter-0.076
Token!"
Token ID2474
Feature activation+7.93e-03

Pos logit contributions
right+0.418
ust+0.399
 perspective+0.390
chie+0.389
illed+0.385
Neg logit contributions
 that-0.248
 Teddy-0.226
 mail-0.222
 bear-0.219
 dreaming-0.219
Token Sara
Token ID24799
Feature activation+7.45e-02

Pos logit contributions
right+0.016
 thirteen+0.015
mer+0.015
 steep+0.015
ancy+0.015
Neg logit contributions
grand-0.008
 mail-0.008
 dentist-0.008
 granddaughter-0.008
 grandma-0.008
Token laughed
Token ID13818
Feature activation+8.39e-03

Pos logit contributions
right+0.192
 perspective+0.191
 steep+0.187
 thirteen+0.186
ancy+0.185
Neg logit contributions
 mail-0.035
 he-0.032
 tell-0.031
 say-0.030
 grandma-0.029
Token.
Token ID13
Feature activation+3.83e-03

Pos logit contributions
right+0.014
 thirteen+0.014
mer+0.013
 length+0.013
 steep+0.013
Neg logit contributions
 mail-0.012
 grandma-0.011
 dentist-0.011
 dreaming-0.011
 he-0.011
Token She
Token ID1375
Feature activation-3.21e-02

Pos logit contributions
right+0.009
mer+0.009
rated+0.009
 thirteen+0.009
 centimeters+0.009
Neg logit contributions
 mail-0.002
 grandma-0.002
 doctor-0.002
 he-0.002
 dentist-0.002
Token,
Token ID11
Feature activation+3.33e-02

Pos logit contributions
right+0.246
mer+0.241
illed+0.226
ancy+0.226
rog+0.224
Neg logit contributions
 he-0.132
 mail-0.122
 grandma-0.121
 dentist-0.116
 that-0.111
Token fifteen
Token ID17280
Feature activation+1.92e-01

Pos logit contributions
mer+0.039
right+0.038
illed+0.037
 perspective+0.035
ancy+0.034
Neg logit contributions
 mail-0.067
 dentist-0.065
 he-0.064
 grandma-0.063
 dreaming-0.062
Token,
Token ID11
Feature activation+3.33e-02

Pos logit contributions
right+0.412
mer+0.395
Joy+0.390
iona+0.389
rog+0.382
Neg logit contributions
 he-0.216
 mail-0.197
 dentist-0.197
 grandma-0.195
 that-0.189
Token sixteen
Token ID27913
Feature activation+1.40e-01

Pos logit contributions
right+0.032
mer+0.032
 perspective+0.031
illed+0.031
ancy+0.028
Neg logit contributions
 mail-0.072
 dentist-0.069
 he-0.068
 grandma-0.068
 dreaming-0.067
Token,
Token ID11
Feature activation+3.37e-02

Pos logit contributions
right+0.316
mer+0.312
Joy+0.299
iona+0.297
ancy+0.296
Neg logit contributions
 he-0.142
 mail-0.136
 dentist-0.132
 grandma-0.129
 that-0.122
Token seventeen
Token ID38741
Feature activation+2.06e-01

Pos logit contributions
right+0.029
 perspective+0.027
mer+0.027
illed+0.026
ancy+0.024
Neg logit contributions
 mail-0.077
 dentist-0.074
 grandma-0.073
 he-0.072
 dreaming-0.071
Token,
Token ID11
Feature activation+3.38e-02

Pos logit contributions
right+0.467
mer+0.455
chie+0.437
iona+0.434
Joy+0.434
Neg logit contributions
 dentist-0.198
 grandma-0.198
 he-0.192
 mail-0.192
 that-0.180
Token eighteen
Token ID29095
Feature activation+1.68e-01

Pos logit contributions
right+0.030
mer+0.028
 perspective+0.027
illed+0.027
ancy+0.025
Neg logit contributions
 mail-0.076
 dentist-0.073
 grandma-0.072
 he-0.071
 dreaming-0.071
Token,
Token ID11
Feature activation+3.36e-02

Pos logit contributions
right+0.361
mer+0.356
ancy+0.332
rog+0.328
Joy+0.326
Neg logit contributions
 he-0.187
 grandma-0.183
 dentist-0.176
 mail-0.175
 dreaming-0.169
Token nineteen
Token ID43063
Feature activation+1.27e-01

Pos logit contributions
right+0.031
mer+0.029
illed+0.027
 perspective+0.027
ancy+0.026
Neg logit contributions
 mail-0.074
 dentist-0.072
 grandma-0.071
 dreaming-0.069
 he-0.069
Token,
Token ID11
Feature activation+3.37e-02

Pos logit contributions
mer+0.292
right+0.276
Joy+0.269
 slope+0.268
chie+0.264
Neg logit contributions
 that-0.125
 grandma-0.120
 he-0.118
 mail-0.117
 tell-0.110
Token,
Token ID11
Feature activation+3.30e-02

Pos logit contributions
right+0.252
mer+0.243
illed+0.233
ancy+0.226
chie+0.224
Neg logit contributions
 he-0.123
 mail-0.111
 grandma-0.106
 that-0.105
 say-0.101
Token fifteen
Token ID17280
Feature activation+1.90e-01

Pos logit contributions
mer+0.041
right+0.039
illed+0.039
 perspective+0.036
ancy+0.035
Neg logit contributions
 mail-0.065
 he-0.063
 dentist-0.061
 that-0.060
 grandma-0.060
Token,
Token ID11
Feature activation+3.31e-02

Pos logit contributions
right+0.467
mer+0.437
illed+0.434
ust+0.427
rated+0.425
Neg logit contributions
 that-0.152
 he-0.152
 friends-0.140
 mail-0.129
 hugs-0.129
Token sixteen
Token ID27913
Feature activation+1.39e-01

Pos logit contributions
right+0.032
mer+0.031
illed+0.031
 perspective+0.030
ancy+0.028
Neg logit contributions
 mail-0.072
 he-0.068
 dentist-0.067
 grandma-0.067
 that-0.067
Token,
Token ID11
Feature activation+3.34e-02

Pos logit contributions
right+0.365
mer+0.349
ust+0.331
chie+0.329
zzy+0.329
Neg logit contributions
 hugs-0.097
 friends-0.090
 that-0.088
 he-0.086
 say-0.083
Token seventeen
Token ID38741
Feature activation+2.06e-01

Pos logit contributions
right+0.030
 perspective+0.028
illed+0.028
mer+0.027
ancy+0.025
Neg logit contributions
 mail-0.077
 he-0.072
 dentist-0.071
 grandma-0.071
 say-0.070
Token,
Token ID11
Feature activation+3.34e-02

Pos logit contributions
right+0.519
mer+0.506
chie+0.494
iona+0.483
 perspective+0.476
Neg logit contributions
 he-0.150
 that-0.146
 grandma-0.144
 mail-0.138
 hugs-0.137
Token eighteen
Token ID29095
Feature activation+1.67e-01

Pos logit contributions
right+0.029
mer+0.028
illed+0.027
 perspective+0.027
ancy+0.025
Neg logit contributions
 mail-0.075
 he-0.071
 dentist-0.071
 grandma-0.071
 dreaming-0.069
Token,
Token ID11
Feature activation+3.34e-02

Pos logit contributions
right+0.401
mer+0.398
chie+0.364
rated+0.363
ancy+0.363
Neg logit contributions
 he-0.154
 friends-0.140
 grandma-0.140
 that-0.139
 tell-0.135
Token nineteen
Token ID43063
Feature activation+1.26e-01

Pos logit contributions
right+0.031
mer+0.029
illed+0.028
 perspective+0.027
ancy+0.026
Neg logit contributions
 mail-0.074
 dentist-0.070
 grandma-0.070
 he-0.069
 dreaming-0.068
Token,
Token ID11
Feature activation+3.33e-02

Pos logit contributions
mer+0.299
right+0.282
 slope+0.276
amy+0.273
chie+0.270
Neg logit contributions
 that-0.116
 he-0.105
 grandma-0.104
 say-0.103
 tell-0.102
Token.
Token ID13
Feature activation+3.31e-03

Pos logit contributions
right+0.207
 perspective+0.203
 thirteen+0.196
Joy+0.194
illed+0.193
Neg logit contributions
 mail-0.139
grand-0.130
 grandma-0.130
 he-0.129
 that-0.128
Token Can
Token ID1680
Feature activation+6.91e-02

Pos logit contributions
right+0.006
 thirteen+0.006
mer+0.006
 steep+0.006
 length+0.006
Neg logit contributions
 mail-0.004
grand-0.004
 dentist-0.004
 grandma-0.004
 granddaughter-0.004
Token you
Token ID345
Feature activation+5.90e-02

Pos logit contributions
right+0.127
 thirteen+0.126
mer+0.120
 steep+0.120
 length+0.118
Neg logit contributions
grand-0.080
 mail-0.079
 dentist-0.079
 granddaughter-0.077
 dreaming-0.075
Token say
Token ID910
Feature activation-1.58e-01

Pos logit contributions
right+0.109
 thirteen+0.108
 steep+0.103
mer+0.103
 length+0.103
Neg logit contributions
grand-0.068
 dentist-0.066
 granddaughter-0.066
 grandma-0.064
 dreaming-0.063
Token five
Token ID1936
Feature activation+1.36e-01

Pos logit contributions
grand+0.181
 mail+0.176
 dentist+0.173
 granddaughter+0.166
 grandma+0.165
Neg logit contributions
right-0.296
 thirteen-0.295
 steep-0.280
mer-0.280
 length-0.277
Token kil
Token ID8769
Feature activation+2.03e-01

Pos logit contributions
right+0.260
illed+0.254
ancy+0.248
amy+0.248
 perspective+0.247
Neg logit contributions
 dentist-0.160
 grandma-0.158
 mail-0.157
 that-0.156
 doctor-0.149
Tokenos
Token ID418
Feature activation+3.36e-02

Pos logit contributions
 steep+0.347
 length+0.345
 thirteen+0.335
 furthe+0.334
 centimeters+0.328
Neg logit contributions
grand-0.263
 grandma-0.252
 mail-0.242
 dentist-0.236
 granddaughter-0.233
Token?"
Token ID1701
Feature activation+1.11e-02

Pos logit contributions
right+0.060
 thirteen+0.058
 length+0.058
 perspective+0.057
 steep+0.057
Neg logit contributions
 mail-0.044
grand-0.043
 dentist-0.042
 grandma-0.041
 granddaughter-0.039
Token uncle
Token ID7711
Feature activation-1.04e-01

Pos logit contributions
right+0.020
 thirteen+0.019
 steep+0.019
mer+0.018
 length+0.018
Neg logit contributions
grand-0.013
 mail-0.013
 dentist-0.013
 dreaming-0.012
 granddaughter-0.012
Token Tom
Token ID4186
Feature activation+4.74e-02

Pos logit contributions
 mail+0.124
 dentist+0.122
grand+0.122
 grandma+0.118
 granddaughter+0.117
Neg logit contributions
right-0.191
 thirteen-0.187
mer-0.183
 steep-0.181
 length-0.179
Token said
Token ID531
Feature activation-1.65e-01

Pos logit contributions
right+0.102
 thirteen+0.100
mer+0.098
 steep+0.096
 length+0.096
Neg logit contributions
 mail-0.042
grand-0.042
 dentist-0.041
 grandma-0.039
 granddaughter-0.038
Token
Token ID198
Feature activation-1.73e-02

Pos logit contributions
 mail+0.016
grand+0.016
 dentist+0.016
 granddaughter+0.015
 dreaming+0.015
Neg logit contributions
right-0.038
 thirteen-0.037
 steep-0.036
mer-0.036
 length-0.036
Token"
Token ID1
Feature activation+2.73e-02

Pos logit contributions
 mail+0.025
 grandma+0.025
grand+0.025
 dentist+0.024
 granddaughter+0.023
Neg logit contributions
 thirteen-0.028
right-0.028
mer-0.028
 furthe-0.027
 length-0.026
TokenThey
Token ID2990
Feature activation-2.05e-03

Pos logit contributions
 thirteen+0.044
right+0.043
illed+0.042
rated+0.041
 steep+0.041
Neg logit contributions
 mail-0.041
grand-0.040
 grandma-0.040
 dentist-0.039
 doctor-0.039
Token weigh
Token ID10164
Feature activation+1.57e-02

Pos logit contributions
 dentist+0.003
grand+0.003
 granddaughter+0.003
 mail+0.003
 grandma+0.003
Neg logit contributions
right-0.003
 thirteen-0.003
mer-0.003
 steep-0.003
 length-0.003
Token five
Token ID1936
Feature activation+1.35e-01

Pos logit contributions
 perspective+0.033
right+0.032
illed+0.031
mer+0.030
 furthe+0.030
Neg logit contributions
 grandma-0.016
 mail-0.016
 that-0.015
 he-0.015
 him-0.014
Token kil
Token ID8769
Feature activation+2.03e-01

Pos logit contributions
right+0.262
illed+0.249
 perspective+0.249
ancy+0.247
Joy+0.245
Neg logit contributions
 mail-0.160
 he-0.160
 that-0.155
 dentist-0.145
 car-0.139
Tokenos
Token ID418
Feature activation+3.31e-02

Pos logit contributions
 steep+0.366
 length+0.364
 furthe+0.351
 thirteen+0.349
 perspective+0.345
Neg logit contributions
grand-0.235
 grandma-0.222
 mail-0.218
 he-0.206
 dentist-0.205
Token.
Token ID13
Feature activation+2.80e-03

Pos logit contributions
right+0.058
 perspective+0.058
ancy+0.056
 length+0.056
illed+0.055
Neg logit contributions
 mail-0.044
 he-0.040
grand-0.040
 that-0.040
 grandma-0.038
Token That
Token ID1320
Feature activation+5.31e-02

Pos logit contributions
right+0.005
mer+0.005
 thirteen+0.005
 steep+0.005
 length+0.005
Neg logit contributions
 mail-0.003
grand-0.003
 dentist-0.003
 grandma-0.003
 granddaughter-0.003
Token means
Token ID1724
Feature activation-8.68e-02

Pos logit contributions
illed+0.110
right+0.109
mer+0.106
ancy+0.105
chie+0.102
Neg logit contributions
 mail-0.063
 dentist-0.058
grand-0.054
 doctor-0.054
 grandma-0.053
Token five
Token ID1936
Feature activation+1.36e-01

Pos logit contributions
 mail+0.078
 grandma+0.077
 he+0.073
 that+0.069
 dentist+0.068
Neg logit contributions
mer-0.198
right-0.194
illed-0.194
rated-0.185
 perspective-0.185
Token.
Token ID13
Feature activation+7.36e-03

Pos logit contributions
 mail+0.236
 grandma+0.225
 dentist+0.224
 granddaughter+0.223
 dreaming+0.222
Neg logit contributions
 thirteen-0.267
right-0.265
 steep-0.260
mer-0.256
 length-0.252
Token
Token ID198
Feature activation-1.86e-02

Pos logit contributions
right+0.015
 thirteen+0.015
mer+0.015
 steep+0.015
 length+0.014
Neg logit contributions
grand-0.007
 mail-0.007
 dentist-0.007
 granddaughter-0.006
 dreaming-0.006
Token
Token ID198
Feature activation-1.79e-02

Pos logit contributions
 mail+0.015
grand+0.015
 dentist+0.015
 grandma+0.014
 granddaughter+0.014
Neg logit contributions
right-0.041
 thirteen-0.040
mer-0.040
 steep-0.039
 length-0.039
Token"
Token ID1
Feature activation+2.71e-02

Pos logit contributions
 grandma+0.028
 dentist+0.027
 doctor+0.027
 mail+0.027
 granddaughter+0.026
Neg logit contributions
right-0.028
 thirteen-0.027
mer-0.027
 furthe-0.027
 perspective-0.027
TokenFive
Token ID20029
Feature activation+1.24e-01

Pos logit contributions
right+0.044
illed+0.043
 thirteen+0.042
 steep+0.042
ancy+0.041
Neg logit contributions
 mail-0.038
grand-0.036
 dentist-0.036
 doctor-0.036
 grandma-0.035
Token kil
Token ID8769
Feature activation+2.02e-01

Pos logit contributions
right+0.228
ancy+0.222
 perspective+0.222
Joy+0.219
eways+0.215
Neg logit contributions
 mail-0.154
 that-0.144
 tell-0.144
 grandma-0.143
 he-0.142
Tokenos
Token ID418
Feature activation+3.31e-02

Pos logit contributions
 steep+0.368
 length+0.357
 furthe+0.357
 thirteen+0.345
 centimeters+0.341
Neg logit contributions
grand-0.234
 grandma-0.228
 mail-0.220
 dentist-0.211
 grandson-0.208
Token,"
Token ID553
Feature activation-4.54e-02

Pos logit contributions
right+0.056
 thirteen+0.053
 steep+0.053
 length+0.053
 perspective+0.053
Neg logit contributions
 mail-0.046
grand-0.044
 dentist-0.042
 grandma-0.042
 he-0.041
Token Anna
Token ID11735
Feature activation+8.47e-02

Pos logit contributions
grand+0.078
 mail+0.077
 dreaming+0.073
 dentist+0.073
 thanking+0.072
Neg logit contributions
 thirteen-0.058
right-0.058
 steep-0.055
mer-0.053
 length-0.052
Token repeated
Token ID5100
Feature activation-3.59e-02

Pos logit contributions
right+0.213
 thirteen+0.209
mer+0.205
 steep+0.204
 length+0.203
Neg logit contributions
 mail-0.046
grand-0.044
 dentist-0.044
 grandma-0.041
 granddaughter-0.040
Token.
Token ID13
Feature activation+4.07e-03

Pos logit contributions
 grandma+0.047
 mail+0.044
 dentist+0.043
 granddaughter+0.042
 he+0.042
Neg logit contributions
right-0.067
mer-0.065
 thirteen-0.064
 steep-0.064
 perspective-0.064
Token a
Token ID257
Feature activation-1.24e-02

Pos logit contributions
grand+0.001
 dentist+0.000
 officer+0.000
 dreaming+0.000
 mail+0.000
Neg logit contributions
right-0.001
ancy-0.001
 thirteen-0.001
 length-0.001
illed-0.001
Token face
Token ID1986
Feature activation-1.38e-02

Pos logit contributions
grand+0.017
 mail+0.017
 dentist+0.017
 grandma+0.016
 granddaughter+0.016
Neg logit contributions
right-0.020
 thirteen-0.020
mer-0.019
 steep-0.019
 length-0.019
Token.
Token ID13
Feature activation+9.53e-03

Pos logit contributions
 mail+0.020
 dentist+0.019
grand+0.019
 grandma+0.019
 dreaming+0.018
Neg logit contributions
right-0.023
 thirteen-0.022
mer-0.022
 steep-0.021
ancy-0.021
Token
Token ID198
Feature activation-1.96e-02

Pos logit contributions
 thirteen+0.021
right+0.021
mer+0.021
 length+0.020
 steep+0.020
Neg logit contributions
 mail-0.008
 grandma-0.008
grand-0.007
 dentist-0.007
 granddaughter-0.007
Token
Token ID198
Feature activation-1.85e-02

Pos logit contributions
 mail+0.016
grand+0.016
 dentist+0.016
 grandma+0.015
 granddaughter+0.015
Neg logit contributions
right-0.043
 thirteen-0.042
mer-0.042
 steep-0.041
 length-0.041
TokenLu
Token ID25596
Feature activation+1.98e-01

Pos logit contributions
 mail+0.034
 grandma+0.032
grand+0.032
 that+0.031
 granddaughter+0.030
Neg logit contributions
 thirteen-0.025
mer-0.023
ill-0.023
iful-0.023
teen-0.022
Tokency
Token ID948
Feature activation+6.36e-02

Pos logit contributions
 thirteen+0.236
 steep+0.222
 seventeen+0.212
 length+0.200
 centimeters+0.200
Neg logit contributions
 granddaughter-0.397
 grandma-0.397
 mail-0.386
grand-0.383
 grandson-0.375
Token decided
Token ID3066
Feature activation-3.32e-02

Pos logit contributions
right+0.129
 thirteen+0.126
 steep+0.124
mer+0.123
 length+0.120
Neg logit contributions
 mail-0.068
 granddaughter-0.064
 grandma-0.064
 dentist-0.064
grand-0.064
Token to
Token ID284
Feature activation-6.47e-03

Pos logit contributions
grand+0.040
 dentist+0.040
 mail+0.040
 granddaughter+0.038
 grandma+0.037
Neg logit contributions
right-0.060
 steep-0.057
 thirteen-0.057
 length-0.054
mer-0.054
Token carefully
Token ID7773
Feature activation+3.99e-03

Pos logit contributions
 mail+0.007
grand+0.007
 dentist+0.007
 grandma+0.007
 granddaughter+0.006
Neg logit contributions
right-0.013
 thirteen-0.013
mer-0.013
 steep-0.012
 length-0.012
Token stand
Token ID1302
Feature activation+3.35e-02

Pos logit contributions
mer+0.007
 thirteen+0.007
right+0.007
 steep+0.007
 perspective+0.007
Neg logit contributions
 mail-0.005
 tell-0.005
 dentist-0.005
 send-0.005
grand-0.005
Token counted
Token ID14789
Feature activation+9.10e-02

Pos logit contributions
 mail+0.022
 dentist+0.022
 grandma+0.020
 granddaughter+0.020
grand+0.020
Neg logit contributions
right-0.028
 thirteen-0.027
 perspective-0.027
ancy-0.026
mer-0.026
Token 1
Token ID352
Feature activation+1.37e-01

Pos logit contributions
mer+0.197
right+0.192
 thirteen+0.182
 furthe+0.182
rated+0.179
Neg logit contributions
 grandma-0.087
 he-0.086
 mail-0.086
 dentist-0.080
 that-0.080
Token,
Token ID11
Feature activation+3.24e-02

Pos logit contributions
right+0.259
mer+0.257
rated+0.257
 perspective+0.256
 furthe+0.255
Neg logit contributions
 he-0.158
 that-0.152
 mail-0.143
 grandma-0.140
 dentist-0.138
Token2
Token ID17
Feature activation+1.65e-01

Pos logit contributions
mer+0.047
right+0.047
 perspective+0.046
ide+0.045
ud+0.044
Neg logit contributions
 mail-0.056
 granddaughter-0.052
 dentist-0.052
 grandma-0.052
 he-0.051
Token,
Token ID11
Feature activation+3.20e-02

Pos logit contributions
 disgust+0.324
right+0.318
Joy+0.317
 perspective+0.315
 steep+0.314
Neg logit contributions
 he-0.191
 that-0.176
 mail-0.147
 grandma-0.143
 granddaughter-0.138
Token3
Token ID18
Feature activation+1.98e-01

Pos logit contributions
mer+0.046
right+0.046
 perspective+0.044
illed+0.043
ancy+0.043
Neg logit contributions
 mail-0.055
 dentist-0.051
 he-0.051
 grandma-0.050
 granddaughter-0.050
Token,
Token ID11
Feature activation+3.19e-02

Pos logit contributions
rated+0.447
 perspective+0.443
Joy+0.438
Uh+0.431
 steep+0.431
Neg logit contributions
 he-0.186
 that-0.176
 him-0.132
 grandma-0.119
 his-0.117
Token4
Token ID19
Feature activation+1.51e-01

Pos logit contributions
mer+0.042
 perspective+0.042
right+0.042
illed+0.039
ancy+0.039
Neg logit contributions
 mail-0.057
 dentist-0.054
 he-0.053
 granddaughter-0.052
 grandma-0.052
Token,
Token ID11
Feature activation+3.18e-02

Pos logit contributions
right+0.302
 perspective+0.298
 steep+0.296
Joy+0.294
 furthe+0.294
Neg logit contributions
 he-0.170
 that-0.156
 mail-0.132
 grandma-0.127
 car-0.121
Token5
Token ID20
Feature activation+1.38e-01

Pos logit contributions
mer+0.044
right+0.043
 perspective+0.042
illed+0.040
ancy+0.040
Neg logit contributions
 mail-0.055
 dentist-0.053
 he-0.052
 grandma-0.051
 doctor-0.050
Token,
Token ID11
Feature activation+3.14e-02

Pos logit contributions
rated+0.268
 perspective+0.266
mer+0.266
Joy+0.260
 disgust+0.259
Neg logit contributions
 he-0.156
 that-0.151
 mail-0.139
 grandma-0.131
 dentist-0.129
Token the
Token ID262
Feature activation-9.51e-03

Pos logit contributions
 mail+0.113
 dentist+0.110
 grandma+0.106
grand+0.102
 he+0.097
Neg logit contributions
right-0.263
mer-0.260
 thirteen-0.256
ancy-0.251
illed-0.248
Token bald
Token ID28273
Feature activation+1.93e-02

Pos logit contributions
 mail+0.012
 dentist+0.012
 grandma+0.011
grand+0.011
 granddaughter+0.011
Neg logit contributions
right-0.017
 thirteen-0.016
mer-0.016
ancy-0.016
 steep-0.016
Token monkey
Token ID21657
Feature activation+1.13e-02

Pos logit contributions
right+0.043
 thirteen+0.042
mer+0.041
 steep+0.041
ancy+0.040
Neg logit contributions
 mail-0.016
 dentist-0.016
grand-0.016
 grandma-0.015
 granddaughter-0.015
Token weighed
Token ID20261
Feature activation+2.20e-03

Pos logit contributions
right+0.021
 thirteen+0.020
mer+0.020
ancy+0.020
 steep+0.019
Neg logit contributions
 mail-0.014
 dentist-0.013
grand-0.013
 grandma-0.013
 granddaughter-0.013
Token 4
Token ID604
Feature activation+1.25e-01

Pos logit contributions
mer+0.005
Joy+0.005
ancy+0.005
 perspective+0.005
aney+0.004
Neg logit contributions
 he-0.002
 mail-0.002
 that-0.002
 grandma-0.002
 dentist-0.002
Token kil
Token ID8769
Feature activation+1.96e-01

Pos logit contributions
Joy+0.167
right+0.163
 perspective+0.162
ancy+0.161
 steep+0.159
Neg logit contributions
 he-0.212
 mail-0.208
 that-0.205
 dentist-0.200
 doctor-0.192
Tokenos
Token ID418
Feature activation+2.49e-02

Pos logit contributions
 steep+0.358
 length+0.344
 seasons+0.341
 spe+0.340
 thirteen+0.333
Neg logit contributions
grand-0.220
 mail-0.201
 he-0.198
on-0.197
 dentist-0.193
Token.
Token ID13
Feature activation+7.01e-03

Pos logit contributions
 perspective+0.045
right+0.045
ancy+0.044
 steep+0.043
Joy+0.042
Neg logit contributions
 mail-0.032
 he-0.030
 that-0.029
grand-0.028
 dentist-0.027
Token The
Token ID383
Feature activation-3.15e-02

Pos logit contributions
right+0.016
 thirteen+0.016
mer+0.015
 steep+0.015
 length+0.015
Neg logit contributions
 mail-0.006
 dentist-0.005
grand-0.005
 grandma-0.005
 granddaughter-0.005
Token monkey
Token ID21657
Feature activation+1.09e-02

Pos logit contributions
 mail+0.035
 dentist+0.034
grand+0.034
 grandma+0.033
 granddaughter+0.033
Neg logit contributions
right-0.062
 thirteen-0.060
mer-0.059
 steep-0.058
 length-0.057
Token was
Token ID373
Feature activation+2.55e-02

Pos logit contributions
right+0.019
 thirteen+0.019
mer+0.018
 steep+0.018
ancy+0.018
Neg logit contributions
 mail-0.014
 dentist-0.014
grand-0.014
 grandma-0.013
 granddaughter-0.013
Token too
Token ID1165
Feature activation+4.57e-02

Pos logit contributions
 mail+0.082
grand+0.081
 dentist+0.080
 grandma+0.077
 granddaughter+0.076
Neg logit contributions
right-0.143
 thirteen-0.140
mer-0.137
 steep-0.135
 length-0.134
Token dangerous
Token ID4923
Feature activation-4.49e-02

Pos logit contributions
 thirteen+0.088
rog+0.087
mer+0.085
about+0.085
abouts+0.084
Neg logit contributions
 mail-0.056
 grandma-0.053
 he-0.051
 dreaming-0.050
 that-0.048
Token,
Token ID11
Feature activation+6.72e-03

Pos logit contributions
 mail+0.057
 grandma+0.053
 that+0.053
grand+0.053
 dentist+0.052
Neg logit contributions
right-0.082
 thirteen-0.082
mer-0.079
ancy-0.078
Joy-0.077
Token honey
Token ID12498
Feature activation+5.10e-02

Pos logit contributions
right+0.012
 thirteen+0.012
mer+0.012
 steep+0.012
 length+0.012
Neg logit contributions
 mail-0.008
grand-0.008
 dentist-0.008
 grandma-0.008
 granddaughter-0.007
Token"
Token ID1
Feature activation+7.41e-04

Pos logit contributions
right+0.084
 thirteen+0.081
mer+0.078
 steep+0.077
 length+0.076
Neg logit contributions
 mail-0.072
 dentist-0.072
grand-0.071
 grandma-0.070
 granddaughter-0.068
Token said
Token ID531
Feature activation-1.80e-01

Pos logit contributions
 thirteen+0.001
right+0.001
 steep+0.001
mer+0.001
 perspective+0.001
Neg logit contributions
 grandma-0.001
 dentist-0.001
 mail-0.001
 granddaughter-0.001
 doctor-0.001
Token Mom
Token ID11254
Feature activation+4.41e-02

Pos logit contributions
 grandma+0.205
 mail+0.199
 dentist+0.198
 doctor+0.193
 he+0.190
Neg logit contributions
right-0.350
mer-0.341
 length-0.336
 steep-0.333
 thirteen-0.333
Tokenmy
Token ID1820
Feature activation-1.83e-02

Pos logit contributions
right+0.084
mer+0.081
 thirteen+0.080
 steep+0.079
 length+0.078
Neg logit contributions
 mail-0.053
 dentist-0.052
 grandma-0.051
grand-0.051
 granddaughter-0.050
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
 mail+0.027
grand+0.026
 dentist+0.026
 grandma+0.026
 granddaughter+0.025
Neg logit contributions
right-0.030
 thirteen-0.029
mer-0.028
 steep-0.028
 length-0.027
Token
Token ID198
Feature activation-3.13e-02

Pos logit contributions
 mail+0.011
 grandma+0.011
grand+0.011
 dentist+0.011
 granddaughter+0.011
Neg logit contributions
right-0.022
 thirteen-0.022
mer-0.022
 steep-0.021
 length-0.021
TokenJust
Token ID5703
Feature activation+9.60e-02

Pos logit contributions
 mail+0.027
 dentist+0.027
grand+0.027
 granddaughter+0.025
 grandma+0.025
Neg logit contributions
right-0.068
 thirteen-0.066
mer-0.065
 steep-0.064
 length-0.064
Token"
Token ID1
Feature activation-2.50e-02

Pos logit contributions
right+0.008
ancy+0.008
 steep+0.008
mer+0.007
 thirteen+0.007
Neg logit contributions
grand-0.012
 dentist-0.012
 airplane-0.012
 dreaming-0.012
 mail-0.011
TokenNot
Token ID3673
Feature activation+3.71e-02

Pos logit contributions
 mail+0.042
grand+0.042
 dentist+0.041
 grandma+0.040
 granddaughter+0.040
Neg logit contributions
right-0.034
 thirteen-0.033
mer-0.032
 steep-0.031
 length-0.031
Token yet
Token ID1865
Feature activation+4.73e-03

Pos logit contributions
mer+0.078
right+0.077
 thirteen+0.074
illed+0.072
ud+0.072
Neg logit contributions
 grandma-0.043
 he-0.037
 dreaming-0.037
 mail-0.037
 dentist-0.037
Token,"
Token ID553
Feature activation-5.46e-02

Pos logit contributions
mer+0.010
right+0.010
 thirteen+0.010
illed+0.009
asp+0.009
Neg logit contributions
 grandma-0.006
 he-0.005
 mail-0.005
 Brown-0.005
 that-0.005
Token Lily
Token ID20037
Feature activation+2.76e-02

Pos logit contributions
 mail+0.053
grand+0.052
 dentist+0.052
 grandma+0.049
 granddaughter+0.049
Neg logit contributions
right-0.112
 thirteen-0.110
mer-0.108
 steep-0.106
 length-0.105
Token said
Token ID531
Feature activation-1.77e-01

Pos logit contributions
right+0.073
mer+0.072
 thirteen+0.072
 perspective+0.070
 length+0.070
Neg logit contributions
 mail-0.012
 dentist-0.012
 grandma-0.011
 granddaughter-0.011
grand-0.010
Token.
Token ID13
Feature activation-1.02e-02

Pos logit contributions
 mail+0.234
 grandma+0.234
 dreaming+0.228
 granddaughter+0.227
 he+0.218
Neg logit contributions
 thirteen-0.320
 steep-0.308
right-0.307
mer-0.300
 perspective-0.299
Token "
Token ID366
Feature activation+4.84e-02

Pos logit contributions
 grandma+0.006
 mail+0.006
 Brown+0.005
 doctor+0.005
 granddaughter+0.005
Neg logit contributions
right-0.026
mer-0.026
 thirteen-0.026
rated-0.026
 centimeters-0.025
TokenWe
Token ID1135
Feature activation+5.74e-02

Pos logit contributions
right+0.079
 thirteen+0.078
 steep+0.076
 length+0.075
mer+0.075
Neg logit contributions
 mail-0.067
grand-0.066
 dentist-0.065
 grandma-0.063
 granddaughter-0.063
Token have
Token ID423
Feature activation-2.54e-02

Pos logit contributions
right+0.094
 thirteen+0.091
mer+0.089
 steep+0.087
 length+0.087
Neg logit contributions
 mail-0.083
grand-0.080
 dentist-0.080
 grandma-0.078
 granddaughter-0.077
Token to
Token ID284
Feature activation-1.05e-02

Pos logit contributions
 mail+0.022
 grandma+0.021
 dentist+0.021
grand+0.020
 granddaughter+0.020
Neg logit contributions
right-0.057
mer-0.057
 thirteen-0.055
illed-0.055
ancy-0.055
Token't
Token ID470
Feature activation+4.97e-02

Pos logit contributions
right+0.041
oe+0.038
 thirteen+0.038
h+0.037
mer+0.037
Neg logit contributions
 granddaughter-0.034
 dentist-0.033
grand-0.033
 dreaming-0.032
 grandson-0.032
Token worry
Token ID5490
Feature activation-1.92e-03

Pos logit contributions
right+0.076
 thirteen+0.075
mer+0.074
 steep+0.071
 length+0.070
Neg logit contributions
 mail-0.077
grand-0.075
 grandma-0.074
 dentist-0.073
 granddaughter-0.071
Token,"
Token ID553
Feature activation-4.98e-02

Pos logit contributions
 grandma+0.002
 that+0.002
 Teddy+0.002
 Brown+0.002
 daddy+0.002
Neg logit contributions
 thirteen-0.004
mer-0.004
right-0.004
 length-0.004
 seventeen-0.004
Token her
Token ID607
Feature activation-4.94e-02

Pos logit contributions
grand+0.071
 dentist+0.071
 mail+0.064
 dreaming+0.063
 thanking+0.061
Neg logit contributions
right-0.074
Joy-0.067
y-0.065
mer-0.064
illed-0.064
Token friend
Token ID1545
Feature activation-7.22e-02

Pos logit contributions
grand+0.036
 mail+0.034
 dentist+0.033
 dreaming+0.030
 granddaughter+0.029
Neg logit contributions
right-0.114
 thirteen-0.110
mer-0.110
 steep-0.109
 length-0.109
Token said
Token ID531
Feature activation-1.76e-01

Pos logit contributions
 dentist+0.055
grand+0.054
 mail+0.053
 dreaming+0.047
 grandma+0.047
Neg logit contributions
right-0.164
mer-0.157
 thirteen-0.154
illed-0.153
 steep-0.153
Token.
Token ID13
Feature activation-4.21e-03

Pos logit contributions
 mail+0.289
grand+0.289
 dentist+0.288
 grandma+0.277
 granddaughter+0.276
Neg logit contributions
right-0.246
 thirteen-0.236
mer-0.231
 steep-0.225
 length-0.223
Token "
Token ID366
Feature activation+5.23e-02

Pos logit contributions
 dentist+0.006
grand+0.006
 mail+0.005
 granddaughter+0.005
 grandma+0.005
Neg logit contributions
right-0.007
 thirteen-0.007
ancy-0.007
mer-0.007
 steep-0.007
TokenWe
Token ID1135
Feature activation+5.54e-02

Pos logit contributions
right+0.071
 thirteen+0.067
mer+0.067
 length+0.063
 steep+0.062
Neg logit contributions
 mail-0.090
 dentist-0.089
grand-0.087
 grandma-0.087
 granddaughter-0.087
Token'll
Token ID1183
Feature activation-3.28e-03

Pos logit contributions
right+0.067
mer+0.064
 thirteen+0.063
 perspective+0.060
 furthe+0.060
Neg logit contributions
 dentist-0.104
grand-0.101
 granddaughter-0.097
 grandma-0.095
 grandson-0.093
Token help
Token ID1037
Feature activation-1.75e-02

Pos logit contributions
 dentist+0.005
 granddaughter+0.005
grand+0.005
 grandson+0.005
 waitress+0.004
Neg logit contributions
right-0.005
 length-0.005
 perspective-0.005
ancy-0.005
chie-0.005
Token "
Token ID366
Feature activation+3.16e-02

Pos logit contributions
 mail+0.016
grand+0.016
 grandma+0.016
 dentist+0.015
 granddaughter+0.015
Neg logit contributions
right-0.041
 thirteen-0.040
mer-0.039
 length-0.039
 steep-0.039
TokenDon
Token ID3987
Feature activation+7.88e-03

Pos logit contributions
 steep+0.055
 thirteen+0.051
 centimeters+0.050
mer+0.050
 length+0.050
Neg logit contributions
 mail-0.043
grand-0.042
 granddaughter-0.040
 dentist-0.039
 grandson-0.038
Token't
Token ID470
Feature activation+3.66e-02

Pos logit contributions
right+0.014
 thirteen+0.013
mer+0.013
 steep+0.013
oe+0.013
Neg logit contributions
 dentist-0.010
 granddaughter-0.010
 dreaming-0.010
 mail-0.010
grand-0.010
Token worry
Token ID5490
Feature activation-5.96e-03

Pos logit contributions
right+0.063
 thirteen+0.063
mer+0.060
 steep+0.059
ancy+0.059
Neg logit contributions
 mail-0.051
grand-0.048
 grandma-0.047
 tell-0.046
 dentist-0.046
Token,"
Token ID553
Feature activation-4.80e-02

Pos logit contributions
 grandma+0.008
 that+0.007
 he+0.007
 mail+0.007
 Brown+0.007
Neg logit contributions
right-0.011
mer-0.011
 thirteen-0.011
 length-0.011
illed-0.011
Token said
Token ID531
Feature activation-1.76e-01

Pos logit contributions
grand+0.042
 dentist+0.042
 mail+0.040
 dreaming+0.037
 granddaughter+0.037
Neg logit contributions
right-0.099
 thirteen-0.095
mer-0.094
 steep-0.093
ancy-0.091
Token the
Token ID262
Feature activation-1.65e-02

Pos logit contributions
 mail+0.183
 dentist+0.178
grand+0.177
 grandma+0.173
 granddaughter+0.170
Neg logit contributions
right-0.355
 thirteen-0.350
mer-0.341
 steep-0.339
 length-0.335
Token squirrel
Token ID33039
Feature activation+2.36e-02

Pos logit contributions
 mail+0.016
 dentist+0.015
 doctor+0.015
 grandma+0.015
 granddaughter+0.014
Neg logit contributions
right-0.036
 thirteen-0.035
illed-0.034
mer-0.034
 steep-0.034
Token.
Token ID13
Feature activation-7.12e-03

Pos logit contributions
 thirteen+0.038
right+0.037
 steep+0.036
 length+0.036
mer+0.036
Neg logit contributions
 mail-0.034
grand-0.033
 granddaughter-0.032
 dentist-0.032
 grandma-0.032
Token "
Token ID366
Feature activation+4.72e-02

Pos logit contributions
 mail+0.007
 dentist+0.007
grand+0.007
 grandma+0.006
 granddaughter+0.006
Neg logit contributions
right-0.015
 thirteen-0.015
mer-0.014
 steep-0.014
 length-0.014
TokenI
Token ID40
Feature activation+8.93e-02

Pos logit contributions
 thirteen+0.070
right+0.070
 steep+0.068
 length+0.067
mer+0.066
Neg logit contributions
 mail-0.072
grand-0.072
 dentist-0.070
 granddaughter-0.068
 grandma-0.068
Token's
Token ID338
Feature activation-3.55e-02

Pos logit contributions
right+0.096
 thirteen+0.094
mer+0.092
ancy+0.090
illed+0.090
Neg logit contributions
 mail-0.047
 dentist-0.044
grand-0.044
 grandma-0.042
 granddaughter-0.041
Token not
Token ID407
Feature activation+4.73e-02

Pos logit contributions
 mail+0.039
 grandma+0.038
 dentist+0.034
 Brown+0.033
 Teddy+0.033
Neg logit contributions
mer-0.080
 perspective-0.076
ud-0.075
 thirteen-0.074
about-0.073
Token true
Token ID2081
Feature activation-8.21e-03

Pos logit contributions
mer+0.098
 thirteen+0.095
right+0.093
ancy+0.090
 seventeen+0.089
Neg logit contributions
 mail-0.058
 grandma-0.054
 dentist-0.050
 granddaughter-0.049
grand-0.048
Token,"
Token ID553
Feature activation-4.95e-02

Pos logit contributions
 mail+0.012
 grandma+0.011
 dentist+0.011
 granddaughter+0.011
grand+0.011
Neg logit contributions
right-0.014
 thirteen-0.014
mer-0.013
ancy-0.013
 steep-0.013
Token Sam
Token ID3409
Feature activation+3.87e-02

Pos logit contributions
 mail+0.056
grand+0.056
 dentist+0.055
 grandma+0.053
 granddaughter+0.053
Neg logit contributions
right-0.094
 thirteen-0.093
mer-0.090
 steep-0.089
 length-0.088
Token says
Token ID1139
Feature activation-1.76e-01

Pos logit contributions
right+0.092
 thirteen+0.091
 steep+0.088
mer+0.088
 length+0.086
Neg logit contributions
grand-0.026
 mail-0.025
 dentist-0.025
 grandma-0.023
 granddaughter-0.023
Token.
Token ID13
Feature activation+7.94e-05

Pos logit contributions
 mail+0.282
 dentist+0.274
grand+0.271
 grandma+0.271
 granddaughter+0.267
Neg logit contributions
right-0.258
 thirteen-0.252
mer-0.245
 steep-0.242
 length-0.240
Token "
Token ID366
Feature activation+5.53e-02

Pos logit contributions
right+0.000
 thirteen+0.000
mer+0.000
 steep+0.000
 length+0.000
Neg logit contributions
grand-0.000
 mail-0.000
 dentist-0.000
 granddaughter-0.000
 grandma-0.000
TokenYou
Token ID1639
Feature activation+7.59e-02

Pos logit contributions
right+0.083
 thirteen+0.081
 steep+0.079
mer+0.078
 length+0.077
Neg logit contributions
 mail-0.085
grand-0.084
 dentist-0.083
 grandma-0.080
 granddaughter-0.079
Token can
Token ID460
Feature activation+2.65e-02

Pos logit contributions
 thirteen+0.101
right+0.100
mer+0.099
 steep+0.096
 furthe+0.093
Neg logit contributions
 dentist-0.124
grand-0.122
 granddaughter-0.120
 dreaming-0.118
 thanking-0.116
Token play
Token ID711
Feature activation-6.12e-02

Pos logit contributions
right+0.053
 thirteen+0.052
mer+0.051
 steep+0.051
 length+0.050
Neg logit contributions
grand-0.027
 dentist-0.026
 mail-0.026
 granddaughter-0.025
 grandma-0.025
Token "
Token ID366
Feature activation+3.53e-02

Pos logit contributions
 mail+0.011
 grandma+0.011
grand+0.010
 dentist+0.010
 granddaughter+0.010
Neg logit contributions
right-0.027
 thirteen-0.026
mer-0.026
 length-0.026
 steep-0.026
TokenDon
Token ID3987
Feature activation+9.54e-03

Pos logit contributions
 steep+0.062
mer+0.057
right+0.056
 length+0.055
 centimeters+0.055
Neg logit contributions
 mail-0.047
grand-0.043
 granddaughter-0.042
 dentist-0.042
 write-0.040
Token't
Token ID470
Feature activation+3.79e-02

Pos logit contributions
right+0.016
oe+0.015
 thirteen+0.015
mer+0.015
chie+0.015
Neg logit contributions
 dentist-0.013
 granddaughter-0.013
 dreaming-0.013
 thanking-0.012
 mail-0.012
Token worry
Token ID5490
Feature activation-4.73e-03

Pos logit contributions
right+0.066
 thirteen+0.065
mer+0.063
 steep+0.063
ancy+0.062
Neg logit contributions
 mail-0.052
grand-0.049
 grandma-0.048
 tell-0.047
 dentist-0.047
Token,"
Token ID553
Feature activation-4.69e-02

Pos logit contributions
 grandma+0.006
 Brown+0.005
 dentist+0.005
 mail+0.005
 he+0.005
Neg logit contributions
right-0.009
mer-0.009
 thirteen-0.009
 length-0.008
illed-0.008
Token said
Token ID531
Feature activation-1.76e-01

Pos logit contributions
grand+0.044
 dentist+0.044
 mail+0.042
 granddaughter+0.038
 dreaming+0.038
Neg logit contributions
right-0.093
 thirteen-0.090
mer-0.088
 steep-0.088
ancy-0.085
Token the
Token ID262
Feature activation-1.46e-02

Pos logit contributions
 mail+0.190
 dentist+0.184
grand+0.183
 grandma+0.180
 granddaughter+0.177
Neg logit contributions
right-0.348
 thirteen-0.343
mer-0.333
 steep-0.331
 length-0.328
Token boy
Token ID2933
Feature activation-1.94e-02

Pos logit contributions
 mail+0.016
 dentist+0.014
 doctor+0.014
 grandma+0.014
 granddaughter+0.013
Neg logit contributions
right-0.032
illed-0.030
 thirteen-0.030
mer-0.030
ancy-0.029
Token.
Token ID13
Feature activation-7.10e-03

Pos logit contributions
 mail+0.031
grand+0.031
 dentist+0.031
 grandma+0.030
 granddaughter+0.030
Neg logit contributions
right-0.028
 thirteen-0.027
mer-0.026
 steep-0.026
 length-0.025
Token "
Token ID366
Feature activation+4.81e-02

Pos logit contributions
 mail+0.007
grand+0.007
 dentist+0.007
 grandma+0.007
 granddaughter+0.006
Neg logit contributions
right-0.015
 thirteen-0.014
mer-0.014
 steep-0.014
 length-0.014
TokenI
Token ID40
Feature activation+8.92e-02

Pos logit contributions
right+0.071
 thirteen+0.070
 steep+0.068
mer+0.067
 length+0.067
Neg logit contributions
 mail-0.074
grand-0.074
 dentist-0.073
 granddaughter-0.070
 grandma-0.070
TokenDon
Token ID3987
Feature activation+5.76e-03

Pos logit contributions
 steep+0.040
 thirteen+0.038
right+0.038
mer+0.038
 length+0.038
Neg logit contributions
 mail-0.034
grand-0.033
 granddaughter-0.032
 dentist-0.032
 grandson-0.030
Token't
Token ID470
Feature activation+3.34e-02

Pos logit contributions
right+0.009
oe+0.009
 thirteen+0.009
h+0.009
mer+0.009
Neg logit contributions
 dentist-0.008
 granddaughter-0.008
 dreaming-0.008
 thanking-0.008
grand-0.007
Token worry
Token ID5490
Feature activation-8.15e-03

Pos logit contributions
right+0.055
 thirteen+0.054
mer+0.052
 steep+0.051
 length+0.051
Neg logit contributions
 mail-0.049
grand-0.047
 grandma-0.046
 dentist-0.046
 granddaughter-0.045
Token,"
Token ID553
Feature activation-4.86e-02

Pos logit contributions
 grandma+0.011
 mail+0.009
 Brown+0.009
 that+0.009
 he+0.009
Neg logit contributions
right-0.016
mer-0.015
 thirteen-0.015
 length-0.015
illed-0.015
Token he
Token ID339
Feature activation-3.14e-02

Pos logit contributions
grand+0.045
 dentist+0.044
 mail+0.043
 dreaming+0.039
 granddaughter+0.039
Neg logit contributions
right-0.098
 thirteen-0.095
mer-0.093
 steep-0.092
ancy-0.090
Token said
Token ID531
Feature activation-1.76e-01

Pos logit contributions
grand+0.025
 dentist+0.024
 mail+0.023
 grandma+0.022
 dreaming+0.021
Neg logit contributions
right-0.071
 thirteen-0.069
mer-0.067
 steep-0.067
 length-0.066
Token.
Token ID13
Feature activation-9.45e-03

Pos logit contributions
 mail+0.267
 grandma+0.255
 dentist+0.255
 granddaughter+0.253
grand+0.253
Neg logit contributions
 thirteen-0.274
right-0.272
 steep-0.260
mer-0.259
 length-0.255
Token "
Token ID366
Feature activation+4.51e-02

Pos logit contributions
 mail+0.009
grand+0.009
 dentist+0.009
 grandma+0.009
 granddaughter+0.009
Neg logit contributions
right-0.019
 thirteen-0.019
mer-0.019
 steep-0.018
 length-0.018
TokenI
Token ID40
Feature activation+8.80e-02

Pos logit contributions
right+0.065
 thirteen+0.065
 steep+0.063
mer+0.062
 length+0.061
Neg logit contributions
 mail-0.071
grand-0.070
 dentist-0.070
 granddaughter-0.068
 grandma-0.068
Token will
Token ID481
Feature activation-3.88e-04

Pos logit contributions
right+0.130
mer+0.124
 thirteen+0.124
 steep+0.121
 length+0.118
Neg logit contributions
 dentist-0.137
grand-0.136
 mail-0.135
 grandma-0.133
 granddaughter-0.132
Token help
Token ID1037
Feature activation-1.95e-02

Pos logit contributions
grand+0.000
 dentist+0.000
 mail+0.000
 granddaughter+0.000
 grandma+0.000
Neg logit contributions
right-0.001
 thirteen-0.001
mer-0.001
 steep-0.001
 length-0.001
Token okay
Token ID8788
Feature activation+9.35e-02

Pos logit contributions
 mail+0.038
 grandma+0.036
 dreaming+0.031
 tell+0.031
 dentist+0.030
Neg logit contributions
mer-0.099
ancy-0.090
about-0.088
 perspective-0.088
 slope-0.088
Token,
Token ID11
Feature activation+2.94e-02

Pos logit contributions
 thirteen+0.194
right+0.186
mer+0.186
 furthe+0.185
 length+0.179
Neg logit contributions
 grandma-0.119
 mail-0.109
 dentist-0.102
 tell-0.100
 that-0.099
Token Max
Token ID5436
Feature activation+7.68e-03

Pos logit contributions
 thirteen+0.059
 steep+0.058
right+0.057
 furthe+0.055
 length+0.055
Neg logit contributions
grand-0.033
 granddaughter-0.026
 mail-0.026
 dentist-0.026
 dreaming-0.024
Token,"
Token ID553
Feature activation-5.12e-02

Pos logit contributions
right+0.011
 thirteen+0.011
mer+0.011
 steep+0.010
 length+0.010
Neg logit contributions
 mail-0.012
grand-0.012
 dentist-0.012
 grandma-0.012
 granddaughter-0.012
Token Lily
Token ID20037
Feature activation+3.86e-02

Pos logit contributions
grand+0.085
 mail+0.076
card+0.075
check+0.075
 dentist+0.073
Neg logit contributions
 steep-0.068
right-0.066
 thirteen-0.066
oe-0.059
Joy-0.059
Token said
Token ID531
Feature activation-1.76e-01

Pos logit contributions
right+0.078
 thirteen+0.076
mer+0.073
 steep+0.072
illed+0.072
Neg logit contributions
grand-0.038
 dentist-0.035
 mail-0.034
 dreaming-0.032
 grandparents-0.031
Token.
Token ID13
Feature activation-3.65e-03

Pos logit contributions
grand+0.310
 dentist+0.307
 mail+0.305
 grandma+0.294
 granddaughter+0.293
Neg logit contributions
right-0.228
 thirteen-0.217
mer-0.211
 steep-0.206
 length-0.203
Token "
Token ID366
Feature activation+5.33e-02

Pos logit contributions
grand+0.005
 mail+0.004
 dentist+0.004
 granddaughter+0.004
 grandma+0.004
Neg logit contributions
right-0.006
 thirteen-0.006
 steep-0.006
mer-0.006
 length-0.006
TokenI
Token ID40
Feature activation+8.80e-02

Pos logit contributions
right+0.074
 thirteen+0.072
mer+0.071
 steep+0.068
 length+0.067
Neg logit contributions
 mail-0.088
grand-0.087
 dentist-0.087
 granddaughter-0.085
 grandma-0.085
Token'm
Token ID1101
Feature activation+2.94e-02

Pos logit contributions
right+0.150
 thirteen+0.147
mer+0.143
 steep+0.141
 length+0.140
Neg logit contributions
 mail-0.119
grand-0.117
 dentist-0.116
 grandma-0.113
 granddaughter-0.112
Token sorry
Token ID7926
Feature activation-1.17e-02

Pos logit contributions
right+0.059
 thirteen+0.059
 steep+0.057
ancy+0.054
Joy+0.053
Neg logit contributions
grand-0.033
 dentist-0.028
 mail-0.028
 granddaughter-0.027
 cash-0.026
Token OK
Token ID7477
Feature activation+9.94e-02

Pos logit contributions
 mail+0.042
 grandma+0.041
 dentist+0.037
 granddaughter+0.036
 dreaming+0.034
Neg logit contributions
mer-0.086
 perspective-0.081
 thirteen-0.081
about-0.079
 slope-0.079
Token,
Token ID11
Feature activation+3.19e-02

Pos logit contributions
 thirteen+0.221
 furthe+0.215
 steep+0.212
mer+0.209
right+0.206
Neg logit contributions
 grandma-0.122
 mail-0.109
 daddy-0.100
 granddaughter-0.099
 tell-0.098
Token kids
Token ID3988
Feature activation-5.51e-03

Pos logit contributions
 thirteen+0.059
right+0.059
 steep+0.056
mer+0.055
 length+0.055
Neg logit contributions
grand-0.039
 mail-0.038
 dentist-0.037
 granddaughter-0.036
 grandma-0.034
Token,"
Token ID553
Feature activation-4.91e-02

Pos logit contributions
 mail+0.008
 grandma+0.008
grand+0.008
 dentist+0.007
 granddaughter+0.007
Neg logit contributions
 thirteen-0.009
right-0.009
 steep-0.009
 length-0.009
mer-0.009
Token she
Token ID673
Feature activation-3.30e-02

Pos logit contributions
grand+0.062
 dentist+0.056
 mail+0.054
 dreaming+0.053
 granddaughter+0.052
Neg logit contributions
right-0.086
 thirteen-0.085
 steep-0.083
 length-0.079
ancy-0.078
Token says
Token ID1139
Feature activation-1.76e-01

Pos logit contributions
grand+0.044
 dreaming+0.039
check+0.037
 dentist+0.036
glass+0.036
Neg logit contributions
 thirteen-0.058
right-0.056
 steep-0.053
 seasons-0.051
 seventeen-0.049
Token.
Token ID13
Feature activation+7.46e-04

Pos logit contributions
 mail+0.288
grand+0.283
 dentist+0.282
 grandma+0.277
 granddaughter+0.274
Neg logit contributions
right-0.248
 thirteen-0.242
mer-0.235
 steep-0.231
 length-0.228
Token "
Token ID366
Feature activation+5.45e-02

Pos logit contributions
right+0.002
 thirteen+0.001
mer+0.001
 steep+0.001
 length+0.001
Neg logit contributions
 mail-0.001
grand-0.001
 dentist-0.001
 grandma-0.001
 granddaughter-0.001
TokenBut
Token ID1537
Feature activation+4.53e-02

Pos logit contributions
right+0.068
 thirteen+0.065
mer+0.064
 length+0.060
ancy+0.060
Neg logit contributions
 mail-0.097
grand-0.097
 dentist-0.096
 granddaughter-0.095
 grandma-0.094
Token next
Token ID1306
Feature activation+6.38e-02

Pos logit contributions
right+0.088
 thirteen+0.086
mer+0.084
 steep+0.083
 length+0.083
Neg logit contributions
 mail-0.049
grand-0.049
 dentist-0.049
 granddaughter-0.046
 grandma-0.046
Token time
Token ID640
Feature activation+1.00e-01

Pos logit contributions
mer+0.068
 thirteen+0.063
 furthe+0.060
right+0.060
chie+0.057
Neg logit contributions
 dentist-0.137
 grandma-0.136
 mail-0.136
 doctor-0.129
 dreaming-0.129
Token share
Token ID2648
Feature activation+1.17e-02

Pos logit contributions
right+0.093
 thirteen+0.091
mer+0.089
 steep+0.088
 length+0.087
Neg logit contributions
grand-0.049
 dentist-0.048
 mail-0.048
 granddaughter-0.046
 grandma-0.046
Token the
Token ID262
Feature activation-1.01e-02

Pos logit contributions
right+0.023
 thirteen+0.022
mer+0.022
 steep+0.022
 length+0.022
Neg logit contributions
 mail-0.013
 dentist-0.013
grand-0.013
 grandma-0.012
 granddaughter-0.012
Token car
Token ID1097
Feature activation-7.56e-02

Pos logit contributions
grand+0.011
 dentist+0.011
 granddaughter+0.010
 grandparents+0.010
 thanking+0.010
Neg logit contributions
right-0.019
 length-0.018
 thirteen-0.018
 steep-0.018
mer-0.017
Token?"
Token ID1701
Feature activation+1.25e-03

Pos logit contributions
grand+0.105
 mail+0.104
 dentist+0.102
 granddaughter+0.102
 dreaming+0.099
Neg logit contributions
right-0.121
mer-0.118
 thirteen-0.118
 steep-0.114
 length-0.110
Token Tom
Token ID4186
Feature activation+4.10e-02

Pos logit contributions
right+0.003
 thirteen+0.003
mer+0.002
 length+0.002
 steep+0.002
Neg logit contributions
 mail-0.001
 grandma-0.001
 dentist-0.001
grand-0.001
 granddaughter-0.001
Token says
Token ID1139
Feature activation-1.76e-01

Pos logit contributions
right+0.091
 thirteen+0.089
mer+0.088
 steep+0.087
 length+0.086
Neg logit contributions
 mail-0.035
 dentist-0.034
grand-0.033
 grandma-0.032
 granddaughter-0.032
Token.
Token ID13
Feature activation-1.20e-03

Pos logit contributions
 mail+0.235
 grandma+0.227
 granddaughter+0.218
 doctor+0.215
 he+0.214
Neg logit contributions
 thirteen-0.315
right-0.309
 perspective-0.306
 steep-0.304
 furthe-0.304
Token
Token ID198
Feature activation-2.63e-02

Pos logit contributions
 mail+0.001
 dentist+0.001
 grandma+0.001
 doctor+0.001
 he+0.001
Neg logit contributions
right-0.003
 thirteen-0.003
 perspective-0.003
 length-0.003
mer-0.003
Token
Token ID198
Feature activation-2.54e-02

Pos logit contributions
 mail+0.019
grand+0.018
 dentist+0.018
 grandma+0.018
 granddaughter+0.016
Neg logit contributions
right-0.062
 thirteen-0.061
mer-0.060
 length-0.059
 steep-0.059
Token"
Token ID1
Feature activation+1.95e-02

Pos logit contributions
 mail+0.032
 grandma+0.029
 he+0.029
 doctor+0.029
 dentist+0.029
Neg logit contributions
 perspective-0.048
right-0.047
 furthe-0.046
iful-0.046
 thirteen-0.045
TokenOK
Token ID11380
Feature activation+1.03e-01

Pos logit contributions
right+0.032
 thirteen+0.031
 steep+0.030
ancy+0.030
 length+0.030
Neg logit contributions
 mail-0.028
grand-0.026
 grandma-0.026
 doctor-0.026
 dentist-0.026
Token I
Token ID314
Feature activation+9.35e-02

Pos logit contributions
 mail+0.068
grand+0.067
 dentist+0.067
 granddaughter+0.064
 grandma+0.064
Neg logit contributions
right-0.114
 thirteen-0.114
mer-0.110
 steep-0.109
 length-0.108
Token don
Token ID836
Feature activation+5.94e-02

Pos logit contributions
right+0.154
 thirteen+0.151
mer+0.145
 steep+0.143
 length+0.142
Neg logit contributions
 mail-0.134
grand-0.130
 dentist-0.128
 grandma-0.126
 granddaughter-0.124
Token't
Token ID470
Feature activation+5.26e-02

Pos logit contributions
right+0.121
 thirteen+0.118
mer+0.115
 steep+0.114
 length+0.113
Neg logit contributions
grand-0.059
 dentist-0.058
 mail-0.058
 granddaughter-0.057
 grandma-0.055
Token,"
Token ID553
Feature activation-4.92e-02

Pos logit contributions
mer+0.105
right+0.104
 thirteen+0.104
ancy+0.102
 steep+0.101
Neg logit contributions
 mail-0.063
 tell-0.060
 say-0.058
grand-0.056
 write-0.055
Token Tom
Token ID4186
Feature activation+3.79e-02

Pos logit contributions
 grandma+0.051
 dentist+0.050
 mail+0.049
 granddaughter+0.045
 tell+0.045
Neg logit contributions
mer-0.104
right-0.103
 thirteen-0.100
 furthe-0.098
ancy-0.098
Token says
Token ID1139
Feature activation-1.76e-01

Pos logit contributions
right+0.091
 thirteen+0.090
 steep+0.087
mer+0.087
 length+0.086
Neg logit contributions
grand-0.024
 mail-0.023
 dentist-0.023
 grandma-0.021
 granddaughter-0.021
Token.
Token ID13
Feature activation+1.88e-03

Pos logit contributions
 mail+0.274
 dentist+0.264
 grandma+0.264
grand+0.262
 granddaughter+0.260
Neg logit contributions
right-0.263
 thirteen-0.260
mer-0.253
 steep-0.250
 length-0.245
Token
Token ID198
Feature activation-2.72e-02

Pos logit contributions
right+0.005
 thirteen+0.004
mer+0.004
 length+0.004
 steep+0.004
Neg logit contributions
 dentist-0.001
 mail-0.001
 grandma-0.001
grand-0.001
 granddaughter-0.001
Token
Token ID198
Feature activation-2.65e-02

Pos logit contributions
 mail+0.027
grand+0.027
 dentist+0.026
 granddaughter+0.026
 dreaming+0.026
Neg logit contributions
right-0.055
 thirteen-0.053
 steep-0.053
ancy-0.051
mer-0.051
TokenTheir
Token ID14574
Feature activation-5.77e-02

Pos logit contributions
 mail+0.035
 dentist+0.035
 grandma+0.035
grand+0.034
 doctor+0.033
Neg logit contributions
mer-0.046
right-0.045
 furthe-0.044
 thirteen-0.043
 perspective-0.043
Token moms
Token ID34015
Feature activation+1.52e-03

Pos logit contributions
 mail+0.054
grand+0.053
 dentist+0.053
 grandma+0.050
 granddaughter+0.050
Neg logit contributions
right-0.122
 thirteen-0.120
mer-0.117
 steep-0.116
 length-0.115
Token can
Token ID460
Feature activation+2.67e-02

Pos logit contributions
 thirteen+0.055
 steep+0.053
right+0.052
 length+0.050
 furthe+0.049
Neg logit contributions
grand-0.098
 dentist-0.093
 granddaughter-0.092
 dreaming-0.091
 grandma-0.089
Token,"
Token ID553
Feature activation-5.02e-02

Pos logit contributions
right+0.049
 thirteen+0.048
mer+0.047
 steep+0.046
 length+0.045
Neg logit contributions
 mail-0.035
 grandma-0.032
grand-0.032
 dentist-0.032
 tell-0.031
Token the
Token ID262
Feature activation-8.36e-03

Pos logit contributions
grand+0.066
 dentist+0.063
 mail+0.063
 dreaming+0.060
 granddaughter+0.060
Neg logit contributions
 thirteen-0.085
right-0.085
mer-0.084
 steep-0.081
 length-0.077
Token oct
Token ID19318
Feature activation+3.67e-03

Pos logit contributions
grand+0.014
 mail+0.013
 dentist+0.013
 granddaughter+0.013
 grandma+0.013
Neg logit contributions
right-0.012
mer-0.012
 thirteen-0.012
 length-0.011
ancy-0.011
Tokenopus
Token ID25790
Feature activation+3.82e-03

Pos logit contributions
right+0.007
 thirteen+0.007
mer+0.007
 steep+0.007
 length+0.006
Neg logit contributions
 mail-0.004
grand-0.004
 dentist-0.004
 grandma-0.004
 granddaughter-0.004
Token said
Token ID531
Feature activation-1.76e-01

Pos logit contributions
right+0.009
 thirteen+0.009
mer+0.009
 steep+0.009
 length+0.009
Neg logit contributions
 mail-0.002
grand-0.002
 dentist-0.002
 grandma-0.002
 granddaughter-0.002
Token.
Token ID13
Feature activation-1.80e-03

Pos logit contributions
 mail+0.273
 dentist+0.258
 grandma+0.258
 granddaughter+0.256
grand+0.252
Neg logit contributions
right-0.269
 thirteen-0.264
 steep-0.258
 length-0.254
mer-0.250
Token "
Token ID366
Feature activation+5.43e-02

Pos logit contributions
grand+0.002
 dentist+0.002
 mail+0.002
 granddaughter+0.002
 grandma+0.002
Neg logit contributions
 thirteen-0.003
right-0.003
mer-0.003
 steep-0.003
ancy-0.003
TokenWe
Token ID1135
Feature activation+5.78e-02

Pos logit contributions
right+0.076
 thirteen+0.074
mer+0.072
 steep+0.070
 length+0.069
Neg logit contributions
 mail-0.089
grand-0.088
 dentist-0.088
 grandma-0.086
 granddaughter-0.085
Token can
Token ID460
Feature activation+2.63e-02

Pos logit contributions
right+0.091
 thirteen+0.089
mer+0.086
 steep+0.085
 length+0.084
Neg logit contributions
 mail-0.086
grand-0.084
 dentist-0.083
 grandma-0.081
 granddaughter-0.081
Token share
Token ID2648
Feature activation+1.48e-02

Pos logit contributions
right+0.052
 thirteen+0.051
mer+0.050
 length+0.050
 steep+0.049
Neg logit contributions
grand-0.028
 dentist-0.027
 granddaughter-0.026
 mail-0.026
 grandma-0.025
Token too
Token ID1165
Feature activation+7.96e-02

Pos logit contributions
 perspective+0.216
 thirteen+0.215
 length+0.213
right+0.212
mer+0.202
Neg logit contributions
 grandma-0.145
 mail-0.142
 dentist-0.134
 tell-0.130
 that-0.129
Token,
Token ID11
Feature activation+3.28e-02

Pos logit contributions
 thirteen+0.181
right+0.178
 furthe+0.178
 perspective+0.177
about+0.174
Neg logit contributions
 grandma-0.076
 that-0.070
 mail-0.064
 tell-0.063
 he-0.063
Token Mom
Token ID11254
Feature activation+3.79e-02

Pos logit contributions
right+0.055
 thirteen+0.054
mer+0.053
 steep+0.052
 length+0.051
Neg logit contributions
 mail-0.045
grand-0.044
 dentist-0.044
 grandma-0.043
 granddaughter-0.042
Token,"
Token ID553
Feature activation-4.99e-02

Pos logit contributions
right+0.062
 thirteen+0.061
mer+0.059
 steep+0.058
 length+0.057
Neg logit contributions
 mail-0.056
 grandma-0.054
 dentist-0.054
grand-0.053
 granddaughter-0.052
Token Tom
Token ID4186
Feature activation+4.12e-02

Pos logit contributions
grand+0.059
 mail+0.058
 dentist+0.057
 granddaughter+0.055
 grandma+0.054
Neg logit contributions
right-0.093
 thirteen-0.092
 steep-0.088
mer-0.088
 length-0.087
Token says
Token ID1139
Feature activation-1.76e-01

Pos logit contributions
right+0.097
 thirteen+0.096
 steep+0.092
 length+0.091
mer+0.091
Neg logit contributions
grand-0.029
 mail-0.027
 dentist-0.026
 dreaming-0.024
 granddaughter-0.024
Token.
Token ID13
Feature activation+1.86e-03

Pos logit contributions
 mail+0.264
 grandma+0.254
 dentist+0.249
 granddaughter+0.246
 dreaming+0.244
Neg logit contributions
right-0.277
 thirteen-0.275
mer-0.270
 steep-0.265
 perspective-0.261
Token
Token ID198
Feature activation-2.68e-02

Pos logit contributions
right+0.004
 thirteen+0.004
mer+0.004
 steep+0.004
 length+0.004
Neg logit contributions
 mail-0.001
 dentist-0.001
grand-0.001
 grandma-0.001
 granddaughter-0.001
Token
Token ID198
Feature activation-2.58e-02

Pos logit contributions
grand+0.030
 dentist+0.030
 mail+0.029
 dreaming+0.029
 granddaughter+0.029
Neg logit contributions
right-0.050
 steep-0.047
ancy-0.047
 thirteen-0.047
 length-0.046
TokenMom
Token ID29252
Feature activation+4.95e-02

Pos logit contributions
 mail+0.038
 grandma+0.037
 dentist+0.036
grand+0.036
 granddaughter+0.035
Neg logit contributions
right-0.042
 thirteen-0.041
mer-0.041
 steep-0.039
 furthe-0.039
Token sigh
Token ID19680
Feature activation+2.93e-02

Pos logit contributions
right+0.087
 thirteen+0.087
 steep+0.082
mer+0.082
 length+0.081
Neg logit contributions
grand-0.065
 mail-0.062
 dentist-0.061
 granddaughter-0.060
 grandma-0.059
Token they
Token ID484
Feature activation-1.63e-02

Pos logit contributions
right+0.031
 thirteen+0.031
mer+0.030
 steep+0.029
 length+0.029
Neg logit contributions
 mail-0.016
grand-0.016
 dentist-0.016
 grandma-0.015
 granddaughter-0.015
Token are
Token ID389
Feature activation+9.94e-03

Pos logit contributions
 mail+0.023
grand+0.023
 dentist+0.023
 grandma+0.022
 granddaughter+0.022
Neg logit contributions
right-0.026
 thirteen-0.025
mer-0.025
 steep-0.024
 length-0.024
Token hiding
Token ID11816
Feature activation+2.03e-02

Pos logit contributions
right+0.019
 thirteen+0.018
mer+0.018
 steep+0.017
 length+0.017
Neg logit contributions
 mail-0.012
 dentist-0.012
grand-0.011
 grandma-0.011
 granddaughter-0.011
Token,"
Token ID553
Feature activation-5.05e-02

Pos logit contributions
right+0.043
 thirteen+0.043
 seventeen+0.042
illed+0.040
 length+0.040
Neg logit contributions
 grandma-0.023
 mail-0.021
 that-0.020
 Brown-0.018
 he-0.017
Token Tom
Token ID4186
Feature activation+3.58e-02

Pos logit contributions
 mail+0.056
 dentist+0.055
 grandma+0.054
grand+0.054
 granddaughter+0.052
Neg logit contributions
right-0.099
 thirteen-0.097
mer-0.096
 length-0.093
 steep-0.093
Token says
Token ID1139
Feature activation-1.76e-01

Pos logit contributions
right+0.091
 thirteen+0.090
mer+0.089
 steep+0.088
 length+0.087
Neg logit contributions
 mail-0.019
 dentist-0.017
grand-0.017
 grandma-0.016
 granddaughter-0.016
Token.
Token ID13
Feature activation-1.29e-03

Pos logit contributions
 mail+0.253
 grandma+0.242
 granddaughter+0.238
 dentist+0.234
 dreaming+0.232
Neg logit contributions
 thirteen-0.290
right-0.285
 perspective-0.280
 steep-0.279
mer-0.278
Token
Token ID198
Feature activation-3.06e-02

Pos logit contributions
 mail+0.001
 dentist+0.001
grand+0.001
 grandma+0.001
 granddaughter+0.001
Neg logit contributions
right-0.003
 thirteen-0.003
mer-0.003
 length-0.003
 steep-0.003
Token
Token ID198
Feature activation-2.98e-02

Pos logit contributions
 mail+0.032
grand+0.032
 dentist+0.032
 dreaming+0.031
 granddaughter+0.030
Neg logit contributions
right-0.060
 steep-0.057
 thirteen-0.057
ancy-0.056
mer-0.056
TokenHe
Token ID1544
Feature activation-3.50e-02

Pos logit contributions
 mail+0.036
 grandma+0.035
 doctor+0.034
 dentist+0.034
grand+0.034
Neg logit contributions
right-0.055
mer-0.053
 thirteen-0.053
 perspective-0.052
iful-0.052
Token opens
Token ID9808
Feature activation-1.22e-02

Pos logit contributions
 mail+0.049
 dentist+0.048
 grandma+0.046
grand+0.045
 granddaughter+0.045
Neg logit contributions
right-0.059
mer-0.057
 thirteen-0.055
 perspective-0.055
ancy-0.055
TokenOkay
Token ID16454
Feature activation+1.08e-01

Pos logit contributions
right+0.024
 thirteen+0.023
mer+0.023
 steep+0.022
 length+0.022
Neg logit contributions
 mail-0.034
grand-0.034
 dentist-0.033
 grandma-0.033
 granddaughter-0.033
Token,
Token ID11
Feature activation+3.00e-02

Pos logit contributions
right+0.181
 thirteen+0.175
mer+0.170
 steep+0.167
 length+0.167
Neg logit contributions
 mail-0.155
 grandma-0.153
 dentist-0.150
grand-0.148
 granddaughter-0.145
Token Ben
Token ID3932
Feature activation+3.99e-02

Pos logit contributions
 thirteen+0.045
right+0.044
 steep+0.042
 length+0.039
ancy+0.039
Neg logit contributions
grand-0.049
 mail-0.046
 granddaughter-0.044
 dentist-0.044
 dreaming-0.042
Token,"
Token ID553
Feature activation-4.93e-02

Pos logit contributions
right+0.065
 thirteen+0.064
 length+0.060
mer+0.060
 steep+0.059
Neg logit contributions
 mail-0.059
grand-0.057
 grandma-0.057
 dentist-0.056
 granddaughter-0.055
Token Sara
Token ID24799
Feature activation+6.57e-02

Pos logit contributions
grand+0.074
 mail+0.072
 dentist+0.070
 granddaughter+0.069
 dreaming+0.068
Neg logit contributions
 thirteen-0.074
right-0.074
 steep-0.070
ancy-0.068
Joy-0.068
Token said
Token ID531
Feature activation-1.76e-01

Pos logit contributions
right+0.126
mer+0.123
 thirteen+0.122
y+0.120
illed+0.119
Neg logit contributions
grand-0.068
 dentist-0.064
 mail-0.061
 dreaming-0.059
 grandparents-0.059
Token.
Token ID13
Feature activation-3.77e-03

Pos logit contributions
 mail+0.290
grand+0.286
 dentist+0.285
 grandma+0.278
 granddaughter+0.276
Neg logit contributions
right-0.246
 thirteen-0.239
mer-0.232
 steep-0.228
 length-0.225
Token "
Token ID366
Feature activation+5.28e-02

Pos logit contributions
 mail+0.003
 grandma+0.003
 dentist+0.003
 granddaughter+0.003
 dreaming+0.003
Neg logit contributions
right-0.009
 thirteen-0.008
mer-0.008
 steep-0.008
 length-0.008
TokenWe
Token ID1135
Feature activation+5.59e-02

Pos logit contributions
right+0.065
 thirteen+0.062
mer+0.061
 length+0.058
ancy+0.057
Neg logit contributions
 mail-0.095
grand-0.095
 dentist-0.095
 granddaughter-0.093
 grandma-0.093
Token can
Token ID460
Feature activation+2.40e-02

Pos logit contributions
right+0.085
 thirteen+0.083
mer+0.081
 steep+0.079
 length+0.078
Neg logit contributions
 mail-0.085
grand-0.084
 dentist-0.084
 grandma-0.081
 granddaughter-0.081
Token be
Token ID307
Feature activation+3.24e-02

Pos logit contributions
right+0.043
 thirteen+0.042
 length+0.041
oe+0.040
 steep+0.040
Neg logit contributions
grand-0.029
 dentist-0.029
 granddaughter-0.028
 grandson-0.026
 dreaming-0.026
Token okay
Token ID8788
Feature activation+9.37e-02

Pos logit contributions
 mail+0.044
 grandma+0.042
 dentist+0.037
 tell+0.036
 granddaughter+0.036
Neg logit contributions
mer-0.091
 thirteen-0.084
 perspective-0.082
 slope-0.082
right-0.081
Token,
Token ID11
Feature activation+3.08e-02

Pos logit contributions
 thirteen+0.206
 furthe+0.202
right+0.196
mer+0.195
 length+0.193
Neg logit contributions
 grandma-0.111
 he-0.098
 that-0.094
 mail-0.094
 Brown-0.090
Token Sara
Token ID24799
Feature activation+6.53e-02

Pos logit contributions
right+0.051
mer+0.049
 thirteen+0.049
 steep+0.048
 length+0.048
Neg logit contributions
 mail-0.043
 dentist-0.043
 grandma-0.042
grand-0.041
 granddaughter-0.041
Token,"
Token ID553
Feature activation-4.94e-02

Pos logit contributions
right+0.088
 thirteen+0.085
mer+0.084
 steep+0.082
 length+0.080
Neg logit contributions
 mail-0.107
grand-0.107
 dentist-0.107
 granddaughter-0.103
 grandma-0.102
Token Ben
Token ID3932
Feature activation+4.03e-02

Pos logit contributions
grand+0.073
 mail+0.069
 dentist+0.066
 granddaughter+0.066
 dreaming+0.063
Neg logit contributions
 thirteen-0.079
right-0.078
 steep-0.074
ancy-0.071
mer-0.071
Token said
Token ID531
Feature activation-1.76e-01

Pos logit contributions
right+0.074
 thirteen+0.073
mer+0.071
y+0.069
h+0.068
Neg logit contributions
grand-0.045
 dentist-0.042
 grandparents-0.040
 dreaming-0.039
 mail-0.039
Token.
Token ID13
Feature activation-1.96e-03

Pos logit contributions
 mail+0.281
 dentist+0.273
grand+0.273
 grandma+0.269
 granddaughter+0.267
Neg logit contributions
right-0.255
 thirteen-0.252
mer-0.242
 steep-0.241
 length-0.237
Token "
Token ID366
Feature activation+5.31e-02

Pos logit contributions
 mail+0.002
grand+0.002
 dentist+0.002
 grandma+0.002
 granddaughter+0.002
Neg logit contributions
right-0.004
 thirteen-0.004
mer-0.004
 steep-0.004
 length-0.004
TokenI
Token ID40
Feature activation+8.77e-02

Pos logit contributions
right+0.073
 thirteen+0.071
mer+0.070
 steep+0.067
 length+0.067
Neg logit contributions
 mail-0.088
grand-0.087
 dentist-0.087
 grandma-0.085
 granddaughter-0.085
Token forgive
Token ID20927
Feature activation-1.37e-02

Pos logit contributions
right+0.158
mer+0.153
 thirteen+0.152
 steep+0.151
 length+0.145
Neg logit contributions
 dentist-0.106
grand-0.103
 mail-0.102
 granddaughter-0.100
 dreaming-0.100
Token you
Token ID345
Feature activation+5.15e-02

Pos logit contributions
 grandma+0.008
 mail+0.008
 tell+0.007
 Teddy+0.007
 he+0.007
Neg logit contributions
 steep-0.035
right-0.035
mer-0.034
 length-0.034
 thirteen-0.034
TokenI
Token ID40
Feature activation+9.10e-02

Pos logit contributions
right+0.034
 thirteen+0.033
 steep+0.032
illed+0.032
 length+0.031
Neg logit contributions
 mail-0.032
 dentist-0.032
 doctor-0.031
 grandma-0.031
grand-0.031
Token'm
Token ID1101
Feature activation+3.10e-02

Pos logit contributions
right+0.129
mer+0.123
 steep+0.122
 thirteen+0.122
 length+0.117
Neg logit contributions
 dentist-0.146
 mail-0.145
grand-0.143
 granddaughter-0.139
 dreaming-0.139
Token sorry
Token ID7926
Feature activation-1.21e-02

Pos logit contributions
 thirteen+0.055
right+0.052
 steep+0.052
 centimeters+0.048
Joy+0.047
Neg logit contributions
grand-0.045
 grandparents-0.037
 dentist-0.036
 cash-0.035
 granddaughter-0.034
Token,"
Token ID553
Feature activation-5.09e-02

Pos logit contributions
 grandma+0.016
 mail+0.016
 dentist+0.015
grand+0.015
 he+0.015
Neg logit contributions
right-0.022
 thirteen-0.021
mer-0.021
 length-0.020
 steep-0.020
Token Ben
Token ID3932
Feature activation+4.03e-02

Pos logit contributions
grand+0.066
 mail+0.065
 dentist+0.064
 granddaughter+0.062
 grandma+0.061
Neg logit contributions
right-0.089
 thirteen-0.088
 steep-0.085
mer-0.084
 length-0.082
Token says
Token ID1139
Feature activation-1.76e-01

Pos logit contributions
 thirteen+0.090
right+0.089
 steep+0.085
mer+0.084
 length+0.083
Neg logit contributions
grand-0.034
 dentist-0.030
 mail-0.030
 dreaming-0.029
 granddaughter-0.028
Token.
Token ID13
Feature activation+1.12e-04

Pos logit contributions
 mail+0.281
 dentist+0.273
grand+0.271
 grandma+0.270
 granddaughter+0.266
Neg logit contributions
right-0.257
 thirteen-0.251
mer-0.244
 steep-0.240
 length-0.238
Token "
Token ID366
Feature activation+5.50e-02

Pos logit contributions
right+0.000
 thirteen+0.000
mer+0.000
 steep+0.000
 length+0.000
Neg logit contributions
 mail-0.000
 dentist-0.000
grand-0.000
 grandma-0.000
 granddaughter-0.000
TokenI
Token ID40
Feature activation+8.95e-02

Pos logit contributions
right+0.081
 thirteen+0.079
mer+0.077
 steep+0.076
 length+0.075
Neg logit contributions
 mail-0.086
grand-0.085
 dentist-0.085
 grandma-0.083
 granddaughter-0.082
Token just
Token ID655
Feature activation+5.31e-02

Pos logit contributions
right+0.143
 thirteen+0.138
mer+0.136
 steep+0.134
 length+0.131
Neg logit contributions
 mail-0.130
 dentist-0.129
grand-0.128
 grandma-0.124
 granddaughter-0.124
Token wanted
Token ID2227
Feature activation-3.46e-02

Pos logit contributions
mer+0.097
right+0.097
 thirteen+0.096
ancy+0.095
 length+0.093
Neg logit contributions
 mail-0.069
grand-0.062
 tell-0.060
 dreaming-0.060
 grandma-0.060
Token of
Token ID286
Feature activation-1.10e-02

Pos logit contributions
 mail+0.001
grand+0.001
 grandma+0.001
 dentist+0.001
 granddaughter+0.001
Neg logit contributions
 thirteen-0.004
right-0.004
 length-0.004
ancy-0.004
mer-0.004
Token the
Token ID262
Feature activation-1.15e-02

Pos logit contributions
 mail+0.012
 grandma+0.011
 dentist+0.010
grand+0.010
 dreaming+0.010
Neg logit contributions
right-0.023
 thirteen-0.023
mer-0.022
 steep-0.022
illed-0.022
Token beach
Token ID10481
Feature activation-2.44e-02

Pos logit contributions
 mail+0.015
 dentist+0.013
 doctor+0.013
 grandma+0.013
 dreaming+0.012
Neg logit contributions
right-0.022
illed-0.022
 thirteen-0.022
ist-0.021
ancy-0.021
Token"
Token ID1
Feature activation+1.66e-02

Pos logit contributions
 mail+0.034
grand+0.033
 grandma+0.033
 dentist+0.032
 dreaming+0.031
Neg logit contributions
right-0.042
 thirteen-0.041
 length-0.038
ancy-0.038
illed-0.038
Token he
Token ID339
Feature activation-2.11e-02

Pos logit contributions
right+0.033
 thirteen+0.033
mer+0.032
 steep+0.032
 length+0.032
Neg logit contributions
 mail-0.017
grand-0.017
 dentist-0.017
 grandma-0.016
 granddaughter-0.016
Token said
Token ID531
Feature activation-1.76e-01

Pos logit contributions
 mail+0.016
grand+0.015
 dentist+0.015
 granddaughter+0.015
 grandma+0.015
Neg logit contributions
right-0.048
 thirteen-0.048
mer-0.046
 perspective-0.046
 length-0.046
Token.
Token ID13
Feature activation-7.25e-03

Pos logit contributions
 mail+0.224
 grandma+0.223
 he+0.222
 dreaming+0.219
 that+0.214
Neg logit contributions
right-0.306
 thirteen-0.302
mer-0.301
 steep-0.297
 perspective-0.290
Token
Token ID198
Feature activation-3.09e-02

Pos logit contributions
 mail+0.008
grand+0.007
 dentist+0.007
 granddaughter+0.007
 grandma+0.007
Neg logit contributions
right-0.015
 thirteen-0.014
mer-0.014
 steep-0.014
 length-0.014
Token
Token ID198
Feature activation-2.84e-02

Pos logit contributions
 mail+0.032
grand+0.031
 dentist+0.031
 dreaming+0.030
 granddaughter+0.029
Neg logit contributions
right-0.062
 thirteen-0.059
mer-0.058
ancy-0.058
 steep-0.058
TokenJack
Token ID14295
Feature activation+9.76e-02

Pos logit contributions
 mail+0.046
grand+0.046
 dentist+0.046
 grandma+0.044
 granddaughter+0.044
Neg logit contributions
 thirteen-0.040
right-0.040
mer-0.038
 steep-0.038
 length-0.037
Token felt
Token ID2936
Feature activation-9.20e-03

Pos logit contributions
right+0.191
 thirteen+0.185
 steep+0.182
mer+0.181
 length+0.178
Neg logit contributions
 mail-0.107
 dentist-0.103
grand-0.102
 granddaughter-0.102
 grandma-0.101
Tokenily
Token ID813
Feature activation+7.26e-02

Pos logit contributions
right+0.138
 thirteen+0.136
mer+0.133
 steep+0.131
 length+0.129
Neg logit contributions
 mail-0.135
grand-0.134
 dentist-0.134
 grandma-0.129
 granddaughter-0.128
Token,
Token ID11
Feature activation+2.91e-02

Pos logit contributions
right+0.101
 thirteen+0.099
mer+0.096
 steep+0.094
 length+0.093
Neg logit contributions
grand-0.117
 mail-0.117
 dentist-0.117
 granddaughter-0.113
 grandma-0.112
Token wait
Token ID4043
Feature activation+3.06e-02

Pos logit contributions
 thirteen+0.044
right+0.042
 steep+0.038
 furthe+0.038
 ten+0.037
Neg logit contributions
grand-0.046
 granddaughter-0.044
 grandson-0.041
 grandparents-0.041
 mail-0.040
Token!"
Token ID2474
Feature activation-9.82e-04

Pos logit contributions
mer+0.066
right+0.066
 thirteen+0.065
Joy+0.063
ancy+0.062
Neg logit contributions
 grandma-0.029
 dreaming-0.028
 mail-0.026
 tell-0.025
 he-0.025
Token he
Token ID339
Feature activation-2.15e-02

Pos logit contributions
grand+0.001
 granddaughter+0.001
 grandparents+0.001
card+0.001
eddy+0.001
Neg logit contributions
right-0.002
 steep-0.002
 thirteen-0.002
illed-0.001
 low-0.001
Token said
Token ID531
Feature activation-1.76e-01

Pos logit contributions
grand+0.020
 dentist+0.019
 mail+0.019
 dreaming+0.017
 grandma+0.017
Neg logit contributions
right-0.045
 thirteen-0.044
mer-0.043
 steep-0.043
 length-0.041
Token,
Token ID11
Feature activation+2.98e-02

Pos logit contributions
 mail+0.249
 grandma+0.241
 granddaughter+0.236
 dentist+0.234
 dreaming+0.234
Neg logit contributions
 thirteen-0.288
right-0.288
 steep-0.277
mer-0.275
 length-0.274
Token running
Token ID2491
Feature activation+6.09e-02

Pos logit contributions
right+0.055
 thirteen+0.054
mer+0.052
 steep+0.051
 length+0.051
Neg logit contributions
grand-0.036
 mail-0.036
 dentist-0.035
 grandma-0.034
 granddaughter-0.034
Token after
Token ID706
Feature activation+3.49e-03

Pos logit contributions
 thirteen+0.120
right+0.120
 length+0.114
 steep+0.113
 perspective+0.112
Neg logit contributions
 mail-0.066
 grandma-0.063
grand-0.062
 dentist-0.061
 granddaughter-0.060
Token her
Token ID607
Feature activation-4.92e-02

Pos logit contributions
right+0.007
 thirteen+0.007
mer+0.007
 steep+0.007
 length+0.007
Neg logit contributions
grand-0.003
 mail-0.003
 dentist-0.003
 granddaughter-0.003
 grandma-0.003
Token.
Token ID13
Feature activation-3.55e-03

Pos logit contributions
 dentist+0.093
grand+0.092
 granddaughter+0.087
 dreaming+0.087
 mail+0.086
Neg logit contributions
right-0.051
 length-0.050
mer-0.050
 perspective-0.048
 thirteen-0.047
Token't
Token ID470
Feature activation+5.34e-02

Pos logit contributions
right+0.119
 thirteen+0.116
 steep+0.112
mer+0.111
 length+0.110
Neg logit contributions
 dentist-0.065
grand-0.065
 granddaughter-0.064
 mail-0.063
 grandma-0.061
Token know
Token ID760
Feature activation-1.55e-01

Pos logit contributions
mer+0.145
 thirteen+0.144
 seventeen+0.140
 length+0.140
right+0.139
Neg logit contributions
 tell-0.030
 say-0.028
 mail-0.027
 send-0.018
grand-0.018
Token.
Token ID13
Feature activation-2.27e-03

Pos logit contributions
 that+0.179
 grandma+0.167
 mail+0.158
 he+0.155
 hugs+0.149
Neg logit contributions
mer-0.332
ancy-0.317
right-0.315
 thirteen-0.313
chie-0.308
Token The
Token ID383
Feature activation-3.22e-02

Pos logit contributions
 mail+0.003
grand+0.002
 dentist+0.002
 grandma+0.002
 granddaughter+0.002
Neg logit contributions
right-0.004
 thirteen-0.004
mer-0.004
 steep-0.004
 length-0.004
Token sign
Token ID1051
Feature activation-3.17e-02

Pos logit contributions
 mail+0.054
 dentist+0.050
 grandma+0.050
 tell+0.049
 doctor+0.049
Neg logit contributions
right-0.046
 thirteen-0.046
mer-0.045
ancy-0.043
illed-0.042
Token says
Token ID1139
Feature activation-1.76e-01

Pos logit contributions
 mail+0.034
grand+0.031
 dentist+0.031
 grandma+0.031
 granddaughter+0.029
Neg logit contributions
right-0.064
 thirteen-0.063
mer-0.062
ancy-0.061
 length-0.061
Token no
Token ID645
Feature activation+8.20e-02

Pos logit contributions
 mail+0.193
 grandma+0.184
 that+0.178
 doctor+0.167
 he+0.162
Neg logit contributions
right-0.362
 perspective-0.350
mer-0.346
illed-0.343
 length-0.337
Token.
Token ID13
Feature activation-2.89e-03

Pos logit contributions
 thirteen+0.173
ancy+0.167
right+0.167
mer+0.163
 furthe+0.163
Neg logit contributions
 mail-0.098
 grandma-0.091
 that-0.091
 say-0.089
 tell-0.087
Token Maybe
Token ID6674
Feature activation+1.93e-02

Pos logit contributions
 mail+0.003
grand+0.003
 dentist+0.003
 grandma+0.003
 granddaughter+0.003
Neg logit contributions
right-0.006
 thirteen-0.005
mer-0.005
 steep-0.005
 length-0.005
Token it
Token ID340
Feature activation-6.27e-03

Pos logit contributions
right+0.042
 thirteen+0.041
mer+0.040
ancy+0.039
 length+0.039
Neg logit contributions
 mail-0.019
 grandma-0.018
 dentist-0.017
grand-0.017
 granddaughter-0.016
Token is
Token ID318
Feature activation+1.32e-02

Pos logit contributions
 mail+0.008
grand+0.008
 dentist+0.008
 grandma+0.008
 granddaughter+0.008
Neg logit contributions
right-0.011
 thirteen-0.011
ancy-0.010
 perspective-0.010
mer-0.010
Token,
Token ID11
Feature activation+3.37e-02

Pos logit contributions
right+0.288
 furthe+0.277
rated+0.273
mer+0.269
ancy+0.268
Neg logit contributions
 that-0.117
 he-0.113
 friends-0.098
 grandma-0.089
 tell-0.086
Token four
Token ID1440
Feature activation+6.88e-02

Pos logit contributions
right+0.056
mer+0.056
 steep+0.053
 thirteen+0.052
illed+0.052
Neg logit contributions
 mail-0.048
 dentist-0.046
 grandma-0.045
 granddaughter-0.044
grand-0.043
Token,
Token ID11
Feature activation+3.37e-02

Pos logit contributions
right+0.158
ancy+0.151
mer+0.148
rated+0.148
amy+0.144
Neg logit contributions
 he-0.068
 that-0.065
 tell-0.057
 friends-0.056
 mail-0.055
Token five
Token ID1936
Feature activation+1.30e-01

Pos logit contributions
mer+0.055
right+0.052
illed+0.051
rated+0.049
 perspective+0.049
Neg logit contributions
 mail-0.055
 grandma-0.050
 tell-0.049
 send-0.048
 dentist-0.048
Token,
Token ID11
Feature activation+3.32e-02

Pos logit contributions
right+0.285
ancy+0.278
mer+0.270
rated+0.268
amy+0.268
Neg logit contributions
 he-0.139
 that-0.134
 friends-0.118
 tell-0.114
 hugs-0.108
Token six
Token ID2237
Feature activation+1.50e-01

Pos logit contributions
mer+0.052
right+0.050
illed+0.048
rated+0.047
ancy+0.047
Neg logit contributions
 mail-0.056
 grandma-0.051
 dentist-0.050
 tell-0.049
 send-0.049
Token,
Token ID11
Feature activation+3.31e-02

Pos logit contributions
right+0.350
mer+0.341
ancy+0.333
amy+0.331
rated+0.331
Neg logit contributions
 he-0.138
 that-0.130
 friends-0.119
 tell-0.114
 grandma-0.105
Token seven
Token ID3598
Feature activation+1.63e-01

Pos logit contributions
ide+0.043
 perspective+0.042
mer+0.041
illed+0.041
right+0.040
Neg logit contributions
 mail-0.065
 car-0.059
 grandma-0.057
 dentist-0.057
 doctor-0.057
Token,
Token ID11
Feature activation+3.28e-02

Pos logit contributions
mer+0.382
right+0.377
rated+0.377
ancy+0.367
illed+0.352
Neg logit contributions
 he-0.152
 that-0.147
 mail-0.134
 grandma-0.134
 tell-0.132
Token eight
Token ID3624
Feature activation+1.50e-01

Pos logit contributions
mer+0.041
illed+0.040
right+0.040
ide+0.039
rated+0.039
Neg logit contributions
 mail-0.063
 grandma-0.058
 dentist-0.057
 doctor-0.057
 cash-0.056
Token,
Token ID11
Feature activation+3.23e-02

Pos logit contributions
rated+0.365
ust+0.364
right+0.361
ancy+0.356
mer+0.345
Neg logit contributions
 he-0.130
 that-0.120
 hugs-0.105
 car-0.102
 mail-0.099
Token It
Token ID632
Feature activation-1.94e-02

Pos logit contributions
right+0.025
 thirteen+0.024
mer+0.024
 steep+0.024
 length+0.024
Neg logit contributions
 mail-0.011
grand-0.011
 dentist-0.011
 grandma-0.011
 granddaughter-0.010
Token had
Token ID550
Feature activation+2.96e-02

Pos logit contributions
 mail+0.025
 dentist+0.025
grand+0.024
 grandma+0.024
 granddaughter+0.023
Neg logit contributions
right-0.035
 thirteen-0.034
 perspective-0.033
mer-0.032
 length-0.032
Token ger
Token ID27602
Feature activation+7.51e-02

Pos logit contributions
mer+0.058
right+0.058
illed+0.054
 perspective+0.054
 thirteen+0.054
Neg logit contributions
 mail-0.035
 dentist-0.034
 grandma-0.033
 granddaughter-0.031
grand-0.031
Tokenms
Token ID907
Feature activation+3.74e-02

Pos logit contributions
right+0.069
 thirteen+0.066
mer+0.062
 steep+0.061
 length+0.060
Neg logit contributions
 mail-0.160
grand-0.159
 dentist-0.159
 grandma-0.155
 granddaughter-0.154
Token and
Token ID290
Feature activation-5.35e-02

Pos logit contributions
right+0.074
 thirteen+0.072
mer+0.070
 length+0.070
 perspective+0.070
Neg logit contributions
 mail-0.041
 grandma-0.038
 dentist-0.037
grand-0.037
 dreaming-0.036
Token bugs
Token ID11316
Feature activation+1.31e-01

Pos logit contributions
 mail+0.068
 grandma+0.065
 dentist+0.065
grand+0.064
 granddaughter+0.062
Neg logit contributions
right-0.098
mer-0.097
 perspective-0.093
 thirteen-0.093
illed-0.092
Token in
Token ID287
Feature activation+1.25e-01

Pos logit contributions
right+0.220
 thirteen+0.213
mer+0.205
 steep+0.203
 length+0.202
Neg logit contributions
 mail-0.184
grand-0.177
 dentist-0.177
 grandma-0.171
 dreaming-0.170
Token it
Token ID340
Feature activation-7.39e-04

Pos logit contributions
right+0.279
 thirteen+0.277
mer+0.273
 centimeters+0.266
illed+0.266
Neg logit contributions
 mail-0.115
 grandma-0.113
 dentist-0.100
 he-0.094
 Teddy-0.093
Token.
Token ID13
Feature activation+1.32e-02

Pos logit contributions
 mail+0.001
 he+0.001
 dentist+0.001
 that+0.001
 dreaming+0.001
Neg logit contributions
right-0.001
 thirteen-0.001
 perspective-0.001
ancy-0.001
chie-0.001
Token Lily
Token ID20037
Feature activation+4.64e-02

Pos logit contributions
right+0.028
 thirteen+0.028
mer+0.027
 steep+0.027
 length+0.027
Neg logit contributions
 mail-0.012
 dentist-0.012
grand-0.012
 grandma-0.012
 granddaughter-0.011
Token and
Token ID290
Feature activation-5.40e-02

Pos logit contributions
right+0.088
 thirteen+0.087
mer+0.085
 steep+0.084
 length+0.083
Neg logit contributions
 mail-0.053
grand-0.052
 dentist-0.052
 grandma-0.050
 granddaughter-0.049
Token cried
Token ID16896
Feature activation-6.95e-02

Pos logit contributions
 mail+0.033
 dentist+0.032
 grandma+0.031
 granddaughter+0.030
grand+0.030
Neg logit contributions
right-0.049
 perspective-0.047
 thirteen-0.046
mer-0.046
 steep-0.046
Token and
Token ID290
Feature activation-5.52e-02

Pos logit contributions
 mail+0.064
 grandma+0.063
 granddaughter+0.058
 dentist+0.057
 Teddy+0.055
Neg logit contributions
mer-0.155
 thirteen-0.154
right-0.154
 perspective-0.147
 steep-0.146
Token said
Token ID531
Feature activation-1.61e-01

Pos logit contributions
 mail+0.068
 dentist+0.066
grand+0.065
 grandma+0.064
 granddaughter+0.063
Neg logit contributions
right-0.101
 thirteen-0.099
mer-0.097
 steep-0.095
 length-0.095
Token,
Token ID11
Feature activation+3.48e-02

Pos logit contributions
 grandma+0.190
 he+0.183
 mail+0.182
 doctor+0.181
 dentist+0.181
Neg logit contributions
mer-0.331
right-0.329
 length-0.306
 perspective-0.304
rated-0.303
Token "
Token ID366
Feature activation+6.27e-02

Pos logit contributions
right+0.076
mer+0.074
 thirteen+0.073
 length+0.071
ancy+0.071
Neg logit contributions
 mail-0.031
 dentist-0.030
 grandma-0.030
 dreaming-0.029
grand-0.029
TokenO
Token ID46
Feature activation+1.56e-01

Pos logit contributions
 steep+0.090
right+0.089
mer+0.088
 thirteen+0.085
illed+0.082
Neg logit contributions
 mail-0.102
 dentist-0.101
grand-0.101
 grandma-0.099
 granddaughter-0.095
Tokenw
Token ID86
Feature activation+4.93e-02

Pos logit contributions
 furthe+0.255
 thirteen+0.255
 perspective+0.243
 length+0.239
right+0.235
Neg logit contributions
grand-0.229
 dentist-0.214
 grandma-0.209
 mail-0.206
 that-0.202
Token,
Token ID11
Feature activation+3.36e-02

Pos logit contributions
mer+0.101
right+0.100
 steep+0.098
 perspective+0.097
 thirteen+0.094
Neg logit contributions
 Teddy-0.054
 grandma-0.054
 Brown-0.052
 that-0.050
 mail-0.050
Token ow
Token ID12334
Feature activation+4.24e-02

Pos logit contributions
 thirteen+0.054
right+0.052
 steep+0.049
 length+0.048
mer+0.048
Neg logit contributions
grand-0.049
 mail-0.048
 granddaughter-0.047
 dentist-0.046
 dreaming-0.045
Token,
Token ID11
Feature activation+3.39e-02

Pos logit contributions
right+0.088
 thirteen+0.086
mer+0.086
 perspective+0.083
Joy+0.083
Neg logit contributions
 that-0.044
 he-0.044
 grandma-0.043
 mail-0.040
 dentist-0.039
Token ow
Token ID12334
Feature activation+4.23e-02

Pos logit contributions
right+0.055
 thirteen+0.055
 steep+0.052
mer+0.052
 length+0.051
Neg logit contributions
grand-0.048
 mail-0.047
 dentist-0.047
 granddaughter-0.046
 grandma-0.045
Token the
Token ID262
Feature activation-7.44e-03

Pos logit contributions
mer+0.057
right+0.057
 thirteen+0.055
ancy+0.054
 perspective+0.053
Neg logit contributions
 mail-0.032
 grandma-0.029
 dreaming-0.029
 dentist-0.029
 thanking-0.028
Token best
Token ID1266
Feature activation+2.97e-02

Pos logit contributions
 mail+0.009
 dentist+0.009
 grandma+0.009
 granddaughter+0.008
 doctor+0.008
Neg logit contributions
right-0.014
illed-0.014
ancy-0.013
mer-0.013
 thirteen-0.013
Token game
Token ID983
Feature activation-3.59e-03

Pos logit contributions
right+0.057
 thirteen+0.056
mer+0.054
 steep+0.054
 length+0.054
Neg logit contributions
grand-0.033
 mail-0.033
 dentist-0.033
 granddaughter-0.031
 grandma-0.031
Token Ruby
Token ID10888
Feature activation+7.39e-02

Pos logit contributions
 mail+0.005
grand+0.005
 grandma+0.005
 dreaming+0.005
 dentist+0.005
Neg logit contributions
 thirteen-0.005
right-0.005
ancy-0.005
 steep-0.005
 length-0.005
Token had
Token ID550
Feature activation+2.46e-02

Pos logit contributions
 thirteen+0.123
right+0.122
 length+0.115
 steep+0.114
mer+0.111
Neg logit contributions
 mail-0.104
grand-0.103
 grandma-0.101
 granddaughter-0.099
 dreaming-0.098
Token ever
Token ID1683
Feature activation+1.32e-01

Pos logit contributions
 thirteen+0.034
right+0.033
 steep+0.032
mer+0.030
 length+0.030
Neg logit contributions
 mail-0.042
 dreaming-0.041
grand-0.041
 grandma-0.040
 dentist-0.039
Token played
Token ID2826
Feature activation-5.27e-02

Pos logit contributions
right+0.263
 thirteen+0.259
ancy+0.251
mer+0.248
 steep+0.247
Neg logit contributions
 mail-0.145
 dreaming-0.143
grand-0.137
 dentist-0.131
 thanking-0.131
Token.
Token ID13
Feature activation+6.38e-03

Pos logit contributions
 mail+0.056
 dreaming+0.051
 grandma+0.051
 dentist+0.051
 doctor+0.049
Neg logit contributions
right-0.108
 thirteen-0.107
 steep-0.102
mer-0.100
 seventeen-0.099
Token
Token ID198
Feature activation-1.70e-02

Pos logit contributions
right+0.012
 thirteen+0.012
mer+0.012
 steep+0.012
ancy+0.012
Neg logit contributions
 mail-0.007
grand-0.007
 dentist-0.007
 grandma-0.006
 granddaughter-0.006
Token
Token ID198
Feature activation-1.52e-02

Pos logit contributions
 mail+0.017
grand+0.017
 dentist+0.017
 granddaughter+0.016
 dreaming+0.016
Neg logit contributions
right-0.034
 thirteen-0.033
 steep-0.032
ancy-0.032
mer-0.032
TokenAt
Token ID2953
Feature activation+2.89e-02

Pos logit contributions
 mail+0.026
 dentist+0.026
grand+0.026
 grandma+0.025
 granddaughter+0.025
Neg logit contributions
right-0.020
 thirteen-0.020
mer-0.019
 steep-0.019
 length-0.019
Token Mia
Token ID32189
Feature activation+1.01e-01

Pos logit contributions
right+0.075
 thirteen+0.073
mer+0.072
 steep+0.070
 length+0.070
Neg logit contributions
 mail-0.039
grand-0.039
 dentist-0.038
 grandma-0.036
 granddaughter-0.036
Token and
Token ID290
Feature activation-5.74e-02

Pos logit contributions
right+0.175
 thirteen+0.170
mer+0.166
 steep+0.164
 length+0.162
Neg logit contributions
grand-0.133
 dentist-0.133
 mail-0.127
 grandma-0.119
 granddaughter-0.119
Token said
Token ID531
Feature activation-1.63e-01

Pos logit contributions
grand+0.050
 dentist+0.048
 mail+0.048
 granddaughter+0.042
 grandma+0.041
Neg logit contributions
right-0.123
mer-0.122
 thirteen-0.119
 steep-0.118
ancy-0.115
Token,
Token ID11
Feature activation+3.41e-02

Pos logit contributions
 he+0.182
 that+0.155
 hugs+0.135
 grandma+0.133
 mail+0.132
Neg logit contributions
right-0.366
 length-0.355
mer-0.354
ud-0.348
 perspective-0.345
Token "
Token ID366
Feature activation+6.09e-02

Pos logit contributions
right+0.080
mer+0.077
 thirteen+0.076
 length+0.076
 steep+0.075
Neg logit contributions
 mail-0.025
 grandma-0.024
 dreaming-0.024
 he-0.023
 thanking-0.023
TokenHey
Token ID10814
Feature activation+1.37e-01

Pos logit contributions
right+0.104
 thirteen+0.102
 steep+0.099
 length+0.097
mer+0.096
Neg logit contributions
 mail-0.083
grand-0.082
 dentist-0.079
 granddaughter-0.078
 grandma-0.077
Token Mia
Token ID32189
Feature activation+9.90e-02

Pos logit contributions
right+0.244
 thirteen+0.239
ancy+0.228
 length+0.228
mer+0.226
Neg logit contributions
 grandma-0.177
 mail-0.177
 dentist-0.172
 granddaughter-0.167
 tell-0.163
Token,
Token ID11
Feature activation+3.32e-02

Pos logit contributions
right+0.215
 thirteen+0.211
 furthe+0.204
 length+0.200
ancy+0.198
Neg logit contributions
 grandma-0.106
 Brown-0.100
 Teddy-0.092
 mail-0.091
 granddaughter-0.090
Token do
Token ID466
Feature activation-1.81e-02

Pos logit contributions
right+0.060
 thirteen+0.059
 steep+0.056
mer+0.055
 furthe+0.055
Neg logit contributions
grand-0.042
 dentist-0.041
 granddaughter-0.040
 mail-0.040
 dreaming-0.038
Token you
Token ID345
Feature activation+5.59e-02

Pos logit contributions
 mail+0.018
grand+0.018
 dentist+0.017
 grandma+0.017
 dreaming+0.017
Neg logit contributions
right-0.037
 thirteen-0.037
mer-0.036
 length-0.035
 steep-0.035
Token want
Token ID765
Feature activation-2.52e-02

Pos logit contributions
 thirteen+0.100
right+0.098
 length+0.095
mer+0.094
ancy+0.093
Neg logit contributions
 mail-0.075
 tell-0.072
grand-0.069
 dreaming-0.067
 say-0.067
Token saw
Token ID2497
Feature activation-6.22e-02

Pos logit contributions
grand+0.080
 mail+0.079
 dentist+0.078
 grandma+0.076
 granddaughter+0.076
Neg logit contributions
right-0.091
 thirteen-0.088
mer-0.086
 steep-0.085
 length-0.083
Token that
Token ID326
Feature activation-1.18e-01

Pos logit contributions
grand+0.066
 mail+0.064
 dentist+0.063
 granddaughter+0.061
 grandma+0.059
Neg logit contributions
right-0.124
 thirteen-0.122
 steep-0.118
mer-0.117
 length-0.116
Token the
Token ID262
Feature activation-3.76e-03

Pos logit contributions
 mail+0.107
 grandma+0.103
 dentist+0.103
grand+0.096
 he+0.095
Neg logit contributions
right-0.258
 thirteen-0.252
mer-0.249
ancy-0.244
illed-0.243
Token skeleton
Token ID18328
Feature activation+5.54e-03

Pos logit contributions
grand+0.003
 mail+0.003
 dentist+0.003
 grandma+0.003
 granddaughter+0.003
Neg logit contributions
right-0.008
 thirteen-0.008
mer-0.008
 steep-0.008
 length-0.008
Token was
Token ID373
Feature activation+3.08e-02

Pos logit contributions
right+0.010
 thirteen+0.010
ancy+0.010
 perspective+0.010
mer+0.010
Neg logit contributions
 mail-0.007
 dentist-0.006
 he-0.006
grand-0.006
 dreaming-0.006
Token very
Token ID845
Feature activation+4.61e-02

Pos logit contributions
mer+0.060
right+0.060
 perspective+0.058
ancy+0.057
about+0.056
Neg logit contributions
 mail-0.037
 dentist-0.035
 dreaming-0.035
 grandma-0.033
 thanking-0.033
Token patient
Token ID5827
Feature activation+3.86e-02

Pos logit contributions
 thirteen+0.089
right+0.088
 perspective+0.087
mer+0.085
about+0.084
Neg logit contributions
 mail-0.054
 he-0.049
 grandma-0.048
 tell-0.048
 dentist-0.046
Token,
Token ID11
Feature activation+3.28e-02

Pos logit contributions
right+0.075
 thirteen+0.074
Joy+0.073
ancy+0.072
 steep+0.071
Neg logit contributions
 mail-0.042
 he-0.041
 dreaming-0.041
 that-0.041
 grandma-0.039
Token just
Token ID655
Feature activation+5.43e-02

Pos logit contributions
right+0.056
 thirteen+0.055
mer+0.053
 steep+0.053
 length+0.052
Neg logit contributions
 mail-0.044
grand-0.043
 dentist-0.043
 grandma-0.041
 granddaughter-0.041
Token standing
Token ID5055
Feature activation+4.72e-02

Pos logit contributions
right+0.103
mer+0.103
ancy+0.101
 thirteen+0.101
Joy+0.097
Neg logit contributions
 mail-0.063
 dreaming-0.059
 thanking-0.056
 he-0.056
 grandma-0.056
Token there
Token ID612
Feature activation+7.93e-03

Pos logit contributions
aney+0.094
 perspective+0.093
right+0.092
 seventeen+0.091
 thirteen+0.089
Neg logit contributions
 dreaming-0.050
 mail-0.050
 he-0.048
 that-0.046
 grandma-0.046
Token a
Token ID257
Feature activation-1.47e-02

Pos logit contributions
grand+0.037
 mail+0.036
 dentist+0.036
 grandma+0.034
 granddaughter+0.034
Neg logit contributions
right-0.061
 thirteen-0.059
mer-0.057
 steep-0.057
 length-0.056
Token big
Token ID1263
Feature activation+1.62e-02

Pos logit contributions
grand+0.018
 dentist+0.018
 mail+0.017
 grandma+0.017
 dreaming+0.017
Neg logit contributions
right-0.026
 thirteen-0.025
 steep-0.025
mer-0.024
 length-0.024
Token hug
Token ID16225
Feature activation-6.68e-02

Pos logit contributions
right+0.041
mer+0.040
 thirteen+0.040
 length+0.039
 steep+0.039
Neg logit contributions
 mail-0.010
 grandma-0.008
 dentist-0.008
 granddaughter-0.008
 tell-0.007
Token.
Token ID13
Feature activation+1.10e-02

Pos logit contributions
 mail+0.105
grand+0.104
 dentist+0.103
 grandma+0.101
 granddaughter+0.100
Neg logit contributions
right-0.098
 thirteen-0.095
mer-0.092
 steep-0.091
 length-0.090
Token<|endoftext|>
Token ID50256
Feature activation-3.40e-03

Pos logit contributions
right+0.022
mer+0.021
 thirteen+0.021
 length+0.021
 steep+0.020
Neg logit contributions
 mail-0.011
 dentist-0.011
 grandma-0.011
grand-0.011
 granddaughter-0.011
TokenOnce
Token ID7454
Feature activation+5.77e-02

Pos logit contributions
 mail+0.003
 grandma+0.003
 Brown+0.003
 dentist+0.003
 Teddy+0.002
Neg logit contributions
 thirteen-0.008
right-0.008
 length-0.008
 steep-0.007
chie-0.007
Token upon
Token ID2402
Feature activation+2.93e-02

Pos logit contributions
right+0.080
 thirteen+0.076
mer+0.075
 steep+0.075
 length+0.072
Neg logit contributions
 mail-0.096
grand-0.095
 dentist-0.094
 granddaughter-0.091
 grandma-0.090
Token a
Token ID257
Feature activation-1.36e-02

Pos logit contributions
right+0.052
 thirteen+0.049
 steep+0.048
 length+0.047
ancy+0.047
Neg logit contributions
grand-0.037
 dentist-0.036
 mail-0.035
 granddaughter-0.034
 grandma-0.034
Token time
Token ID640
Feature activation+9.87e-02

Pos logit contributions
 grandma+0.024
 dreaming+0.022
 tell+0.022
 mail+0.022
 dentist+0.022
Neg logit contributions
 thirteen-0.020
chie-0.019
mer-0.019
 length-0.017
 furthe-0.017
Token there
Token ID612
Feature activation+5.56e-03

Pos logit contributions
 thirteen+0.218
 length+0.215
mer+0.212
right+0.206
rated+0.203
Neg logit contributions
 he-0.096
 grandma-0.095
 that-0.089
 Brown-0.086
 Teddy-0.083
Token was
Token ID373
Feature activation+2.83e-02

Pos logit contributions
 thirteen+0.012
right+0.012
mer+0.011
 length+0.011
 steep+0.011
Neg logit contributions
 mail-0.006
 grandma-0.005
 dentist-0.005
 he-0.005
grand-0.005
Token liked
Token ID8288
Feature activation-7.55e-03

Pos logit contributions
 mail+0.014
grand+0.013
 dentist+0.013
 grandma+0.013
 dreaming+0.013
Neg logit contributions
right-0.028
 thirteen-0.028
mer-0.026
 perspective-0.026
 length-0.026
Token to
Token ID284
Feature activation-7.05e-03

Pos logit contributions
grand+0.010
 mail+0.009
 dentist+0.009
 granddaughter+0.009
 grandma+0.008
Neg logit contributions
right-0.013
 steep-0.013
 thirteen-0.013
 length-0.012
 perspective-0.012
Token explore
Token ID7301
Feature activation-7.32e-03

Pos logit contributions
 mail+0.008
grand+0.007
 dentist+0.007
 grandma+0.007
 granddaughter+0.007
Neg logit contributions
right-0.014
 thirteen-0.014
mer-0.013
 steep-0.013
 length-0.013
Token new
Token ID649
Feature activation-2.17e-02

Pos logit contributions
 mail+0.007
 grandma+0.007
 dentist+0.006
 dreaming+0.006
grand+0.006
Neg logit contributions
 thirteen-0.016
right-0.016
mer-0.015
ancy-0.015
 length-0.015
Token places
Token ID4113
Feature activation-1.07e-02

Pos logit contributions
 mail+0.023
 grandma+0.022
grand+0.022
 dentist+0.022
 granddaughter+0.021
Neg logit contributions
 thirteen-0.045
right-0.044
 seventeen-0.042
mer-0.041
 steep-0.041
Token,
Token ID11
Feature activation+3.41e-02

Pos logit contributions
grand+0.015
 dentist+0.015
 mail+0.015
 granddaughter+0.015
 grandma+0.014
Neg logit contributions
right-0.017
 thirteen-0.017
mer-0.017
 steep-0.016
 length-0.016
Token but
Token ID475
Feature activation+2.70e-02

Pos logit contributions
right+0.044
 steep+0.043
 thirteen+0.043
mer+0.040
 length+0.040
Neg logit contributions
grand-0.060
 dentist-0.059
 granddaughter-0.059
 mail-0.057
 grandma-0.056
Token he
Token ID339
Feature activation-1.10e-02

Pos logit contributions
right+0.056
 thirteen+0.055
 steep+0.054
mer+0.053
 length+0.053
Neg logit contributions
 mail-0.026
grand-0.026
 dentist-0.025
 granddaughter-0.024
 grandma-0.024
Token had
Token ID550
Feature activation+3.12e-02

Pos logit contributions
 dentist+0.013
 mail+0.013
grand+0.013
 grandma+0.012
 dreaming+0.012
Neg logit contributions
right-0.021
 thirteen-0.021
mer-0.020
 length-0.019
ancy-0.019
Token never
Token ID1239
Feature activation+3.42e-02

Pos logit contributions
right+0.062
mer+0.061
 thirteen+0.061
ancy+0.058
 length+0.057
Neg logit contributions
 mail-0.034
 dentist-0.034
 grandma-0.032
grand-0.032
 dreaming-0.032
Token been
Token ID587
Feature activation+1.46e-02

Pos logit contributions
 thirteen+0.061
right+0.061
mer+0.060
ancy+0.058
 steep+0.056
Neg logit contributions
 mail-0.044
 dentist-0.043
grand-0.043
 dreaming-0.042
 granddaughter-0.041
Token and
Token ID290
Feature activation-5.86e-02

Pos logit contributions
right+0.048
mer+0.047
 perspective+0.047
ancy+0.046
 thirteen+0.046
Neg logit contributions
 mail-0.040
 grandma-0.037
 dentist-0.035
 dreaming-0.035
 granddaughter-0.034
Token violet
Token ID45725
Feature activation+8.40e-02

Pos logit contributions
 mail+0.075
 grandma+0.071
 dentist+0.067
grand+0.066
 dreaming+0.066
Neg logit contributions
right-0.111
mer-0.109
ancy-0.106
 thirteen-0.105
 perspective-0.105
Token is
Token ID318
Feature activation+1.72e-02

Pos logit contributions
right+0.150
 thirteen+0.148
 steep+0.143
 furthe+0.143
 length+0.140
Neg logit contributions
 mail-0.105
 grandma-0.103
grand-0.102
 that-0.102
 granddaughter-0.095
Token a
Token ID257
Feature activation-1.03e-02

Pos logit contributions
mer+0.035
right+0.033
about+0.033
 perspective+0.033
ancy+0.032
Neg logit contributions
 mail-0.020
 grandma-0.019
 dreaming-0.019
 dentist-0.019
 granddaughter-0.018
Token light
Token ID1657
Feature activation-1.41e-02

Pos logit contributions
 mail+0.013
 grandma+0.013
 dentist+0.012
 doctor+0.012
 granddaughter+0.011
Neg logit contributions
right-0.021
illed-0.021
 thirteen-0.020
mer-0.019
ancy-0.019
Token purple
Token ID14032
Feature activation+3.39e-02

Pos logit contributions
 mail+0.019
 grandma+0.017
 that+0.017
 doctor+0.016
 dentist+0.016
Neg logit contributions
right-0.026
 thirteen-0.026
illed-0.025
ancy-0.024
 furthe-0.024
Token.
Token ID13
Feature activation+9.27e-03

Pos logit contributions
right+0.066
 thirteen+0.064
illed+0.062
 steep+0.061
chie+0.061
Neg logit contributions
 mail-0.046
 grandma-0.040
 doctor-0.040
 dentist-0.039
 that-0.038
Token Do
Token ID2141
Feature activation+4.85e-02

Pos logit contributions
right+0.019
 thirteen+0.018
mer+0.018
 steep+0.018
 length+0.018
Neg logit contributions
 mail-0.010
grand-0.009
 dentist-0.009
 grandma-0.009
 granddaughter-0.009
Token you
Token ID345
Feature activation+5.84e-02

Pos logit contributions
right+0.095
 thirteen+0.094
 steep+0.090
mer+0.090
 length+0.089
Neg logit contributions
 mail-0.053
grand-0.052
 dentist-0.051
 granddaughter-0.049
 grandma-0.049
Token want
Token ID765
Feature activation-2.19e-02

Pos logit contributions
right+0.104
 thirteen+0.104
mer+0.103
 steep+0.099
 length+0.099
Neg logit contributions
 mail-0.078
 tell-0.075
 write-0.070
 say-0.070
grand-0.070
Token to
Token ID284
Feature activation-2.63e-03

Pos logit contributions
grand+0.024
 dentist+0.023
 mail+0.023
 granddaughter+0.022
 grandma+0.022
Neg logit contributions
right-0.042
 thirteen-0.042
 steep-0.040
 length-0.040
mer-0.040
Token were
Token ID547
Feature activation+1.18e-02

Pos logit contributions
mer+0.063
right+0.063
 thirteen+0.063
 steep+0.062
 length+0.060
Neg logit contributions
grand-0.055
 dentist-0.055
 mail-0.054
 granddaughter-0.052
 grandma-0.050
Token friends
Token ID2460
Feature activation-5.50e-02

Pos logit contributions
right+0.023
 thirteen+0.023
 steep+0.022
 length+0.022
mer+0.021
Neg logit contributions
grand-0.014
 dentist-0.013
 mail-0.013
 granddaughter-0.012
 grandson-0.012
Token forever
Token ID8097
Feature activation+4.35e-02

Pos logit contributions
 mail+0.088
 dentist+0.086
grand+0.086
 grandma+0.084
 granddaughter+0.083
Neg logit contributions
right-0.080
 thirteen-0.079
mer-0.075
 steep-0.075
 length-0.074
Token.
Token ID13
Feature activation+4.17e-03

Pos logit contributions
right+0.081
 thirteen+0.081
mer+0.078
 length+0.075
ancy+0.075
Neg logit contributions
 mail-0.053
 dreaming-0.052
 grandma-0.050
 dentist-0.049
grand-0.048
Token<|endoftext|>
Token ID50256
Feature activation-1.08e-02

Pos logit contributions
right+0.007
 steep+0.007
ancy+0.007
 thirteen+0.006
mer+0.006
Neg logit contributions
 dentist-0.006
 mail-0.005
grand-0.005
 airplane-0.005
 granddaughter-0.005
TokenOne
Token ID3198
Feature activation+8.73e-02

Pos logit contributions
 mail+0.007
 dentist+0.007
 grandma+0.007
grand+0.007
 granddaughter+0.006
Neg logit contributions
right-0.026
 thirteen-0.026
mer-0.025
 length-0.025
 steep-0.025
Token day
Token ID1110
Feature activation-3.22e-02

Pos logit contributions
 thirteen+0.165
mer+0.157
right+0.154
 length+0.152
 furthe+0.149
Neg logit contributions
 mail-0.113
 grandma-0.108
 dentist-0.104
grand-0.103
 dreaming-0.101
Token,
Token ID11
Feature activation+3.44e-02

Pos logit contributions
 he+0.032
 that+0.026
 grandma+0.025
 Teddy+0.024
 dreaming+0.023
Neg logit contributions
 thirteen-0.079
 steep-0.074
 perspective-0.074
 length-0.073
 seventeen-0.072
Token a
Token ID257
Feature activation-1.38e-02

Pos logit contributions
 thirteen+0.096
 perspective+0.092
 length+0.090
 seventeen+0.090
mer+0.090
Neg logit contributions
 he-0.016
 grandma-0.015
 mail-0.015
 that-0.014
 Brown-0.010
Token boy
Token ID2933
Feature activation-1.54e-02

Pos logit contributions
 mail+0.013
 grandma+0.013
 dentist+0.012
 granddaughter+0.012
 doctor+0.012
Neg logit contributions
 thirteen-0.032
right-0.031
illed-0.029
 perspective-0.029
 length-0.029
Token named
Token ID3706
Feature activation-5.27e-02

Pos logit contributions
 mail+0.016
grand+0.016
 dentist+0.016
 grandma+0.015
 granddaughter+0.015
Neg logit contributions
right-0.031
 thirteen-0.030
mer-0.029
 steep-0.029
 length-0.029
Token beautiful
Token ID4950
Feature activation-2.52e-02

Pos logit contributions
 mail+0.028
 dentist+0.027
 grandma+0.026
 doctor+0.025
 granddaughter+0.024
Neg logit contributions
right-0.051
 thirteen-0.049
illed-0.048
mer-0.048
ancy-0.047
Token butterfly
Token ID35113
Feature activation+6.15e-03

Pos logit contributions
 mail+0.032
 dentist+0.030
 grandma+0.030
grand+0.028
 granddaughter+0.028
Neg logit contributions
right-0.049
 thirteen-0.048
mer-0.046
illed-0.045
 steep-0.045
Token flying
Token ID7348
Feature activation-1.14e-03

Pos logit contributions
right+0.011
 thirteen+0.010
mer+0.010
 steep+0.010
 length+0.010
Neg logit contributions
 mail-0.008
 dentist-0.008
grand-0.008
 grandma-0.008
 dreaming-0.008
Token around
Token ID1088
Feature activation+2.96e-02

Pos logit contributions
 that+0.001
 mail+0.001
 he+0.001
 invitation+0.001
 for+0.001
Neg logit contributions
 thirteen-0.002
 centimeters-0.002
 seventeen-0.002
 degrees-0.002
ud-0.002
Token in
Token ID287
Feature activation+1.22e-01

Pos logit contributions
 thirteen+0.068
right+0.068
 seventeen+0.067
ust+0.065
 steep+0.065
Neg logit contributions
 that-0.027
 he-0.026
 him-0.025
 mail-0.025
 dreaming-0.024
Token the
Token ID262
Feature activation-1.14e-02

Pos logit contributions
mer+0.280
right+0.279
 thirteen+0.277
 steep+0.268
ancy+0.267
Neg logit contributions
 mail-0.111
 grandma-0.093
 dentist-0.084
 dreaming-0.084
 that-0.083
Token garden
Token ID11376
Feature activation-4.96e-02

Pos logit contributions
 mail+0.014
 dentist+0.011
 grandma+0.011
 airplane+0.010
 doctor+0.010
Neg logit contributions
right-0.024
illed-0.023
 thirteen-0.023
mer-0.023
ancy-0.022
Token.
Token ID13
Feature activation+3.44e-03

Pos logit contributions
 mail+0.068
 dreaming+0.066
 grandma+0.063
 dentist+0.062
 he+0.061
Neg logit contributions
right-0.085
 thirteen-0.084
mer-0.080
ancy-0.078
 steep-0.078
Token She
Token ID1375
Feature activation-2.93e-02

Pos logit contributions
right+0.007
 thirteen+0.007
mer+0.007
 steep+0.007
 length+0.007
Neg logit contributions
 mail-0.003
grand-0.003
 dentist-0.003
 grandma-0.003
 granddaughter-0.003
Token pointed
Token ID6235
Feature activation-7.43e-03

Pos logit contributions
 mail+0.032
 grandma+0.030
 tell+0.030
 granddaughter+0.029
 dreaming+0.029
Neg logit contributions
right-0.060
 thirteen-0.057
mer-0.057
ancy-0.055
 steep-0.055
Token to
Token ID284
Feature activation-7.43e-03

Pos logit contributions
 mail+0.007
 grandma+0.007
 granddaughter+0.007
 dentist+0.006
 thanking+0.006
Neg logit contributions
right-0.016
 thirteen-0.015
mer-0.015
 length-0.015
 steep-0.015
Token gift
Token ID6979
Feature activation-8.51e-02

Pos logit contributions
grand+0.019
 mail+0.019
 dentist+0.019
 granddaughter+0.018
 grandma+0.018
Neg logit contributions
right-0.027
 thirteen-0.027
mer-0.026
 steep-0.026
 length-0.026
Token from
Token ID422
Feature activation-1.01e-03

Pos logit contributions
grand+0.101
 granddaughter+0.099
 dentist+0.098
 dreaming+0.095
 mail+0.095
Neg logit contributions
right-0.146
mer-0.145
 thirteen-0.140
 steep-0.137
 length-0.135
Token his
Token ID465
Feature activation-4.16e-02

Pos logit contributions
 mail+0.001
grand+0.001
 dentist+0.001
 grandma+0.001
 granddaughter+0.001
Neg logit contributions
right-0.002
 thirteen-0.002
mer-0.002
 steep-0.002
 length-0.002
Token dad
Token ID9955
Feature activation-3.52e-02

Pos logit contributions
grand+0.026
 mail+0.026
 dentist+0.026
 granddaughter+0.023
 dreaming+0.022
Neg logit contributions
right-0.097
mer-0.095
 thirteen-0.095
 length-0.094
 steep-0.094
Token.
Token ID13
Feature activation+8.02e-03

Pos logit contributions
 mail+0.054
grand+0.053
 dentist+0.053
 grandma+0.051
 granddaughter+0.051
Neg logit contributions
right-0.053
 thirteen-0.052
mer-0.050
 steep-0.049
 length-0.049
Token She
Token ID1375
Feature activation-3.12e-02

Pos logit contributions
right+0.011
 steep+0.010
Joy+0.010
 thirteen+0.010
ill+0.010
Neg logit contributions
 airplane-0.012
 dreaming-0.012
 mail-0.012
grand-0.012
 delivery-0.011
Token says
Token ID1139
Feature activation-1.63e-01

Pos logit contributions
grand+0.040
 dreaming+0.035
 reunited+0.032
 granddaughter+0.032
handed+0.031
Neg logit contributions
 thirteen-0.055
right-0.050
 steep-0.048
 length-0.047
 seasons-0.047
Token that
Token ID326
Feature activation-1.19e-01

Pos logit contributions
grand+0.126
 dentist+0.113
 mail+0.109
 granddaughter+0.106
 dreaming+0.103
Neg logit contributions
 thirteen-0.364
right-0.340
 steep-0.330
h-0.326
 seventeen-0.324
Token Sam
Token ID3409
Feature activation+3.98e-02

Pos logit contributions
grand+0.099
 granddaughter+0.098
 mail+0.095
 dentist+0.093
 reunited+0.088
Neg logit contributions
 thirteen-0.230
right-0.228
 steep-0.225
 length-0.218
 seasons-0.215
Token's
Token ID338
Feature activation-3.03e-02

Pos logit contributions
right+0.059
 thirteen+0.057
mer+0.056
 steep+0.055
 length+0.053
Neg logit contributions
 dentist-0.060
 mail-0.059
grand-0.059
 granddaughter-0.058
 grandma-0.058
Token car
Token ID1097
Feature activation-6.85e-02

Pos logit contributions
grand+0.024
 dentist+0.022
 thanking+0.022
 Prince+0.021
 dreaming+0.021
Neg logit contributions
 length-0.063
right-0.063
 steep-0.061
 thirteen-0.060
mer-0.059
Token.
Token ID13
Feature activation+7.79e-03

Pos logit contributions
right+0.016
 thirteen+0.016
ancy+0.015
illed+0.015
 steep+0.015
Neg logit contributions
 mail-0.014
 dreaming-0.013
grand-0.013
 grandma-0.013
 dentist-0.013
Token The
Token ID383
Feature activation-3.04e-02

Pos logit contributions
right+0.015
 thirteen+0.015
 steep+0.014
mer+0.014
ancy+0.014
Neg logit contributions
 mail-0.008
 dentist-0.008
grand-0.008
 granddaughter-0.008
 grandma-0.008
Token little
Token ID1310
Feature activation+5.15e-03

Pos logit contributions
 mail+0.037
grand+0.036
 dentist+0.036
 grandma+0.035
 granddaughter+0.034
Neg logit contributions
right-0.056
 thirteen-0.055
mer-0.054
 steep-0.053
 length-0.053
Token girl
Token ID2576
Feature activation-3.72e-02

Pos logit contributions
right+0.012
 thirteen+0.012
 steep+0.011
 seventeen+0.011
mer+0.011
Neg logit contributions
 mail-0.005
 grandma-0.004
 granddaughter-0.004
 dentist-0.004
 dreaming-0.004
Token even
Token ID772
Feature activation+4.38e-02

Pos logit contributions
 dentist+0.053
grand+0.053
 mail+0.052
 grandma+0.050
 granddaughter+0.050
Neg logit contributions
right-0.058
 thirteen-0.057
mer-0.056
 length-0.055
 steep-0.054
Token noticed
Token ID6810
Feature activation-4.18e-02

Pos logit contributions
right+0.075
 thirteen+0.072
mer+0.071
 steep+0.069
 length+0.069
Neg logit contributions
grand-0.059
 mail-0.057
 dentist-0.057
 grandma-0.055
 granddaughter-0.053
Token new
Token ID649
Feature activation-2.39e-02

Pos logit contributions
 mail+0.030
 dentist+0.028
grand+0.028
 grandma+0.027
 granddaughter+0.026
Neg logit contributions
right-0.099
 thirteen-0.097
mer-0.096
 steep-0.094
 length-0.093
Token details
Token ID3307
Feature activation+1.04e-02

Pos logit contributions
 mail+0.028
 grandma+0.026
 dentist+0.026
grand+0.026
 granddaughter+0.025
Neg logit contributions
right-0.047
 thirteen-0.046
mer-0.045
 steep-0.045
ancy-0.043
Token about
Token ID546
Feature activation-4.40e-02

Pos logit contributions
right+0.022
 thirteen+0.021
mer+0.021
ancy+0.021
 steep+0.021
Neg logit contributions
 mail-0.010
 dentist-0.009
 grandma-0.009
 he-0.009
 dreaming-0.009
Token the
Token ID262
Feature activation-9.26e-03

Pos logit contributions
 mail+0.043
 grandma+0.040
 dreaming+0.037
 dentist+0.036
 thanking+0.036
Neg logit contributions
mer-0.098
right-0.098
 thirteen-0.094
 steep-0.093
illed-0.093
Token ornament
Token ID45618
Feature activation-8.11e-03

Pos logit contributions
 mail+0.012
 dentist+0.011
 grandma+0.011
grand+0.011
 dreaming+0.010
Neg logit contributions
right-0.018
 thirteen-0.017
mer-0.017
illed-0.016
 steep-0.016
Token "
Token ID366
Feature activation+5.63e-02

Pos logit contributions
right+0.058
 thirteen+0.057
mer+0.055
 steep+0.055
 length+0.055
Neg logit contributions
grand-0.034
 mail-0.034
 dentist-0.034
 grandma-0.032
 granddaughter-0.032
TokenCan
Token ID6090
Feature activation+4.83e-02

Pos logit contributions
right+0.083
 thirteen+0.079
 steep+0.079
mer+0.079
illed+0.075
Neg logit contributions
 mail-0.091
 dentist-0.088
grand-0.087
 grandma-0.085
 granddaughter-0.085
Token I
Token ID314
Feature activation+9.63e-02

Pos logit contributions
right+0.070
 thirteen+0.069
 steep+0.066
mer+0.064
 furthe+0.063
Neg logit contributions
grand-0.073
 dentist-0.072
 granddaughter-0.072
 mail-0.071
 thanking-0.070
Token help
Token ID1037
Feature activation-1.54e-02

Pos logit contributions
right+0.186
 thirteen+0.180
mer+0.177
 steep+0.175
 length+0.174
Neg logit contributions
 dentist-0.106
grand-0.106
 mail-0.104
 granddaughter-0.100
 grandma-0.100
Token them
Token ID606
Feature activation-9.31e-03

Pos logit contributions
grand+0.020
 dentist+0.020
 granddaughter+0.019
 dreaming+0.018
 mail+0.018
Neg logit contributions
right-0.026
 thirteen-0.025
mer-0.025
 length-0.025
ancy-0.024
Token?"
Token ID1701
Feature activation+1.25e-02

Pos logit contributions
grand+0.013
 dentist+0.013
 granddaughter+0.013
 mail+0.012
 grandma+0.012
Neg logit contributions
right-0.015
 thirteen-0.015
mer-0.014
 length-0.014
 steep-0.014
Token
Token ID198
Feature activation-2.50e-02

Pos logit contributions
right+0.022
oe+0.020
y+0.020
h+0.020
about+0.020
Neg logit contributions
grand-0.013
 dentist-0.013
 airplane-0.013
 invitation-0.013
 mail-0.012
Token His
Token ID2399
Feature activation-3.39e-02

Pos logit contributions
 dreaming+0.037
 dentist+0.037
grand+0.035
 mail+0.035
 thanking+0.035
Neg logit contributions
right-0.039
oe-0.036
ancy-0.035
h-0.035
mer-0.035
Token mom
Token ID1995
Feature activation-1.53e-02

Pos logit contributions
grand+0.029
 mail+0.025
 dentist+0.024
 dreaming+0.023
 thanking+0.022
Neg logit contributions
right-0.073
 length-0.072
 steep-0.071
ancy-0.070
mer-0.069
Token said
Token ID531
Feature activation-1.67e-01

Pos logit contributions
grand+0.014
 dentist+0.014
 mail+0.014
 grandma+0.013
 granddaughter+0.013
Neg logit contributions
right-0.032
 thirteen-0.032
mer-0.031
 length-0.030
 steep-0.030
Token,
Token ID11
Feature activation+3.21e-02

Pos logit contributions
 mail+0.199
 grandma+0.197
 he+0.195
 that+0.190
 granddaughter+0.188
Neg logit contributions
right-0.309
 thirteen-0.308
 steep-0.308
 perspective-0.304
mer-0.304
Token are
Token ID389
Feature activation+1.65e-02

Pos logit contributions
grand+0.064
 dentist+0.064
 mail+0.064
 granddaughter+0.063
 thanking+0.063
Neg logit contributions
right-0.062
 thirteen-0.060
 steep-0.058
mer-0.055
ancy-0.055
Token you
Token ID345
Feature activation+5.87e-02

Pos logit contributions
right+0.030
 thirteen+0.029
ancy+0.028
 steep+0.028
mer+0.027
Neg logit contributions
 dentist-0.019
grand-0.019
 mail-0.019
 granddaughter-0.018
 dreaming-0.018
Token?
Token ID30
Feature activation+3.78e-02

Pos logit contributions
right+0.072
 thirteen+0.067
mer+0.066
 length+0.065
ancy+0.064
Neg logit contributions
grand-0.106
 dentist-0.103
 granddaughter-0.100
 mail-0.099
 grandma-0.096
Token We
Token ID775
Feature activation+6.24e-02

Pos logit contributions
right+0.063
 thirteen+0.062
mer+0.059
ancy+0.058
 steep+0.057
Neg logit contributions
 dentist-0.050
 mail-0.048
 granddaughter-0.046
grand-0.046
 grandma-0.046
Token are
Token ID389
Feature activation+1.67e-02

Pos logit contributions
right+0.091
 thirteen+0.089
mer+0.088
 steep+0.085
 length+0.082
Neg logit contributions
 dentist-0.096
grand-0.093
 granddaughter-0.092
 grandma-0.088
 mail-0.087
Token looking
Token ID2045
Feature activation-6.47e-03

Pos logit contributions
right+0.029
 thirteen+0.029
mer+0.028
 steep+0.028
 length+0.027
Neg logit contributions
grand-0.021
 dentist-0.020
 mail-0.020
 granddaughter-0.020
 grandma-0.019
Token for
Token ID329
Feature activation-1.13e-02

Pos logit contributions
 mail+0.006
grand+0.006
 dentist+0.006
 grandma+0.006
 granddaughter+0.006
Neg logit contributions
right-0.013
 thirteen-0.013
mer-0.013
 steep-0.013
 length-0.012
Token you
Token ID345
Feature activation+5.90e-02

Pos logit contributions
 mail+0.012
grand+0.012
 dentist+0.012
 granddaughter+0.011
 grandma+0.011
Neg logit contributions
right-0.022
 thirteen-0.022
mer-0.021
 steep-0.021
 length-0.021
Token.
Token ID13
Feature activation+5.13e-03

Pos logit contributions
 thirteen+0.097
right+0.097
mer+0.091
ancy+0.089
 steep+0.089
Neg logit contributions
 mail-0.086
 grandma-0.083
 dentist-0.081
 dreaming-0.081
grand-0.079
Token We
Token ID775
Feature activation+6.23e-02

Pos logit contributions
right+0.009
 thirteen+0.009
mer+0.008
ancy+0.008
 steep+0.008
Neg logit contributions
 dentist-0.007
 mail-0.007
 dreaming-0.006
grand-0.006
 granddaughter-0.006
Token are
Token ID389
Feature activation+1.68e-02

Pos logit contributions
right+0.088
mer+0.086
 thirteen+0.085
 steep+0.082
 furthe+0.079
Neg logit contributions
 dentist-0.099
 granddaughter-0.096
grand-0.095
 grandma-0.091
 dreaming-0.090
Token catch
Token ID4929
Feature activation+2.20e-02

Pos logit contributions
 mail+0.007
grand+0.007
 dentist+0.007
 grandma+0.007
 granddaughter+0.007
Neg logit contributions
right-0.011
 thirteen-0.011
mer-0.011
 steep-0.011
 length-0.011
Token it
Token ID340
Feature activation-2.51e-03

Pos logit contributions
right+0.044
 thirteen+0.043
mer+0.042
 steep+0.042
 length+0.042
Neg logit contributions
 mail-0.023
grand-0.022
 dentist-0.022
 grandma-0.022
 granddaughter-0.021
Token.
Token ID13
Feature activation+1.12e-02

Pos logit contributions
 mail+0.003
 he+0.003
 that+0.003
 tell+0.003
 grandma+0.003
Neg logit contributions
 thirteen-0.005
right-0.005
 seventeen-0.004
mer-0.004
ancy-0.004
Token His
Token ID2399
Feature activation-3.05e-02

Pos logit contributions
right+0.023
 thirteen+0.022
 steep+0.022
mer+0.022
 length+0.021
Neg logit contributions
grand-0.011
 dentist-0.011
 mail-0.011
 dreaming-0.010
 granddaughter-0.010
Token friend
Token ID1545
Feature activation-6.46e-02

Pos logit contributions
grand+0.027
 dentist+0.023
 mail+0.022
 dreaming+0.021
 granddaughter+0.020
Neg logit contributions
right-0.067
 length-0.066
 steep-0.066
mer-0.065
 thirteen-0.064
Token said
Token ID531
Feature activation-1.62e-01

Pos logit contributions
 mail+0.057
 grandma+0.054
 granddaughter+0.054
 dentist+0.052
grand+0.052
Neg logit contributions
right-0.143
 thirteen-0.142
mer-0.137
 length-0.137
 steep-0.137
Token,
Token ID11
Feature activation+3.32e-02

Pos logit contributions
 he+0.201
 mail+0.199
 grandma+0.192
 dentist+0.188
 doctor+0.187
Neg logit contributions
right-0.301
mer-0.291
 length-0.291
 thirteen-0.285
 perspective-0.281
Token "
Token ID366
Feature activation+6.07e-02

Pos logit contributions
right+0.068
 thirteen+0.067
mer+0.066
 steep+0.065
 length+0.065
Neg logit contributions
 mail-0.033
 dentist-0.032
grand-0.032
 grandma-0.031
 granddaughter-0.031
TokenNo
Token ID2949
Feature activation+9.47e-02

Pos logit contributions
right+0.083
 thirteen+0.081
mer+0.077
 steep+0.077
 length+0.076
Neg logit contributions
 mail-0.103
grand-0.102
 dentist-0.100
 grandma-0.098
 granddaughter-0.098
Token,
Token ID11
Feature activation+3.00e-02

Pos logit contributions
mer+0.173
right+0.172
 thirteen+0.168
ancy+0.167
illed+0.159
Neg logit contributions
 mail-0.123
 grandma-0.123
 dentist-0.120
 tell-0.117
 he-0.115
Token Tim
Token ID5045
Feature activation+5.79e-02

Pos logit contributions
 thirteen+0.049
right+0.047
 steep+0.047
 furthe+0.045
mer+0.043
Neg logit contributions
grand-0.044
 granddaughter-0.042
 dentist-0.042
 dreaming-0.040
 mail-0.039
Token little
Token ID1310
Feature activation+1.44e-02

Pos logit contributions
 mail+0.011
 dentist+0.011
 grandma+0.011
grand+0.010
 granddaughter+0.010
Neg logit contributions
right-0.020
 thirteen-0.020
mer-0.019
illed-0.018
ancy-0.018
Token girl
Token ID2576
Feature activation-2.69e-02

Pos logit contributions
right+0.033
 thirteen+0.032
ancy+0.031
illed+0.031
 length+0.030
Neg logit contributions
 mail-0.013
 grandma-0.011
 granddaughter-0.011
 dentist-0.011
 doctor-0.011
Token walking
Token ID6155
Feature activation+3.16e-02

Pos logit contributions
grand+0.033
 dentist+0.033
 granddaughter+0.031
 grandma+0.031
 mail+0.031
Neg logit contributions
right-0.047
 thirteen-0.045
mer-0.045
illed-0.044
 length-0.044
Token by
Token ID416
Feature activation+6.85e-02

Pos logit contributions
right+0.053
 thirteen+0.052
mer+0.050
 steep+0.050
 length+0.050
Neg logit contributions
 mail-0.044
grand-0.043
 dentist-0.042
 grandma-0.041
 granddaughter-0.041
Token the
Token ID262
Feature activation-2.90e-03

Pos logit contributions
 thirteen+0.153
 seventeen+0.147
mer+0.143
 steep+0.141
right+0.141
Neg logit contributions
 mail-0.070
 grandma-0.069
 dentist-0.064
 dreaming-0.063
 Brown-0.062
Token farm
Token ID5318
Feature activation-9.57e-02

Pos logit contributions
grand+0.004
 dentist+0.004
 mail+0.004
 granddaughter+0.004
 dreaming+0.004
Neg logit contributions
right-0.005
 thirteen-0.005
 steep-0.005
mer-0.005
 length-0.005
Token.
Token ID13
Feature activation+1.14e-02

Pos logit contributions
 mail+0.154
grand+0.152
 dentist+0.151
 grandma+0.148
 granddaughter+0.146
Neg logit contributions
right-0.137
 thirteen-0.134
mer-0.130
 steep-0.127
 length-0.126
Token She
Token ID1375
Feature activation-2.61e-02

Pos logit contributions
 steep+0.014
right+0.014
y+0.014
 low+0.013
illed+0.013
Neg logit contributions
 dentist-0.018
grand-0.017
 dreaming-0.016
 cowboy-0.016
 airplane-0.016
Token stopped
Token ID5025
Feature activation+5.87e-02

Pos logit contributions
 mail+0.030
 grandma+0.029
 dentist+0.029
grand+0.029
 granddaughter+0.028
Neg logit contributions
right-0.051
 thirteen-0.051
mer-0.048
 length-0.048
 steep-0.048
Token and
Token ID290
Feature activation-5.64e-02

Pos logit contributions
right+0.116
 thirteen+0.114
mer+0.112
 steep+0.110
 length+0.109
Neg logit contributions
 mail-0.064
 dentist-0.061
grand-0.061
 grandma-0.060
 granddaughter-0.059
Token said
Token ID531
Feature activation-1.60e-01

Pos logit contributions
grand+0.082
 dentist+0.079
 mail+0.079
 granddaughter+0.077
 grandma+0.075
Neg logit contributions
right-0.087
 thirteen-0.085
mer-0.084
 steep-0.082
 length-0.080
Token great
Token ID1049
Feature activation+4.44e-02

Pos logit contributions
 mail+0.023
 dentist+0.022
 grandma+0.022
grand+0.022
 dreaming+0.021
Neg logit contributions
right-0.029
 thirteen-0.029
illed-0.028
mer-0.028
 steep-0.027
Token book
Token ID1492
Feature activation-3.10e-02

Pos logit contributions
right+0.080
 thirteen+0.078
mer+0.076
 steep+0.075
 length+0.075
Neg logit contributions
 mail-0.056
grand-0.055
 dentist-0.055
 grandma-0.053
 granddaughter-0.052
Token.
Token ID13
Feature activation+1.10e-03

Pos logit contributions
 dentist+0.042
 mail+0.042
grand+0.042
 granddaughter+0.041
 dreaming+0.040
Neg logit contributions
right-0.050
 thirteen-0.048
mer-0.048
 steep-0.047
 length-0.046
Token Let
Token ID3914
Feature activation+6.25e-02

Pos logit contributions
right+0.002
 thirteen+0.002
mer+0.002
 steep+0.002
ancy+0.002
Neg logit contributions
 dentist-0.001
grand-0.001
 mail-0.001
 dreaming-0.001
 granddaughter-0.001
Token's
Token ID338
Feature activation-3.48e-02

Pos logit contributions
right+0.124
 thirteen+0.120
mer+0.119
 steep+0.116
 length+0.116
Neg logit contributions
 dentist-0.064
grand-0.063
 mail-0.062
 dreaming-0.059
 granddaughter-0.059
Token print
Token ID3601
Feature activation-9.37e-02

Pos logit contributions
grand+0.037
 dentist+0.036
 granddaughter+0.035
 mail+0.034
 dreaming+0.034
Neg logit contributions
right-0.067
 thirteen-0.066
 length-0.065
 steep-0.064
mer-0.063
Token out
Token ID503
Feature activation+3.21e-02

Pos logit contributions
grand+0.122
 dentist+0.115
 granddaughter+0.110
 dreaming+0.109
 mail+0.108
Neg logit contributions
 thirteen-0.164
right-0.160
 length-0.153
 steep-0.152
mer-0.150
Token some
Token ID617
Feature activation+2.62e-02

Pos logit contributions
 thirteen+0.058
right+0.057
 steep+0.055
 length+0.054
mer+0.053
Neg logit contributions
grand-0.039
 dentist-0.038
 mail-0.036
 granddaughter-0.035
 dreaming-0.035
Token pictures
Token ID5986
Feature activation-5.95e-02

Pos logit contributions
right+0.053
 thirteen+0.052
mer+0.051
 steep+0.050
 length+0.049
Neg logit contributions
 mail-0.027
grand-0.026
 dentist-0.026
 grandma-0.026
 granddaughter-0.025
Token from
Token ID422
Feature activation-1.14e-02

Pos logit contributions
grand+0.076
 dentist+0.076
 mail+0.075
 granddaughter+0.073
 grandma+0.071
Neg logit contributions
right-0.102
 thirteen-0.102
mer-0.098
 steep-0.096
 length-0.095
Token it
Token ID340
Feature activation-7.89e-03

Pos logit contributions
 mail+0.012
 grandma+0.012
 dentist+0.012
grand+0.011
 granddaughter+0.011
Neg logit contributions
right-0.023
 thirteen-0.022
mer-0.022
 steep-0.022
illed-0.021
Token and
Token ID290
Feature activation-5.78e-02

Pos logit contributions
grand+0.033
 mail+0.032
 dentist+0.031
 granddaughter+0.031
 grandma+0.030
Neg logit contributions
right-0.040
 thirteen-0.040
mer-0.039
 length-0.038
 steep-0.038
Token fries
Token ID31757
Feature activation-5.26e-02

Pos logit contributions
grand+0.070
 dentist+0.066
 mail+0.065
 dreaming+0.062
 granddaughter+0.061
Neg logit contributions
 thirteen-0.100
right-0.097
 length-0.097
 steep-0.093
ancy-0.092
Token,
Token ID11
Feature activation+3.38e-02

Pos logit contributions
grand+0.097
 dentist+0.093
 granddaughter+0.092
 mail+0.092
 grandma+0.090
Neg logit contributions
 thirteen-0.060
right-0.058
mer-0.058
 length-0.056
ancy-0.054
Token and
Token ID290
Feature activation-5.80e-02

Pos logit contributions
right+0.052
 thirteen+0.052
 steep+0.049
mer+0.048
 length+0.048
Neg logit contributions
grand-0.051
 mail-0.050
 dentist-0.049
 granddaughter-0.047
 grandma-0.047
Token she
Token ID673
Feature activation-2.66e-02

Pos logit contributions
grand+0.069
 mail+0.063
 dentist+0.063
 dreaming+0.059
 granddaughter+0.058
Neg logit contributions
 thirteen-0.104
right-0.100
 length-0.098
 steep-0.096
ancy-0.095
Token said
Token ID531
Feature activation-1.63e-01

Pos logit contributions
 mail+0.036
grand+0.036
 dentist+0.036
 grandma+0.034
 granddaughter+0.034
Neg logit contributions
right-0.044
 thirteen-0.043
mer-0.042
 steep-0.041
 length-0.041
Token,
Token ID11
Feature activation+3.36e-02

Pos logit contributions
 mail+0.198
 grandma+0.192
 he+0.187
 dentist+0.185
grand+0.181
Neg logit contributions
right-0.316
mer-0.305
 length-0.295
 thirteen-0.295
 steep-0.293
Token "
Token ID366
Feature activation+5.91e-02

Pos logit contributions
right+0.068
 thirteen+0.066
mer+0.065
 steep+0.064
 length+0.064
Neg logit contributions
 mail-0.035
grand-0.034
 dentist-0.034
 grandma-0.032
 granddaughter-0.032
TokenI
Token ID40
Feature activation+9.38e-02

Pos logit contributions
right+0.080
 thirteen+0.077
mer+0.075
 steep+0.075
 length+0.073
Neg logit contributions
 mail-0.101
grand-0.100
 dentist-0.099
 grandma-0.097
 granddaughter-0.096
Token want
Token ID765
Feature activation-2.71e-02

Pos logit contributions
right+0.137
 thirteen+0.132
mer+0.130
 steep+0.128
 length+0.125
Neg logit contributions
 dentist-0.146
grand-0.145
 mail-0.145
 grandma-0.141
 granddaughter-0.141
Token that
Token ID326
Feature activation-1.20e-01

Pos logit contributions
grand+0.033
 granddaughter+0.031
 dentist+0.031
 mail+0.030
 officer+0.030
Neg logit contributions
 thirteen-0.048
right-0.046
 length-0.044
ancy-0.044
 steep-0.043
Token They
Token ID1119
Feature activation-7.41e-03

Pos logit contributions
right+0.015
 thirteen+0.015
mer+0.014
 steep+0.014
 length+0.014
Neg logit contributions
grand-0.010
 mail-0.010
 dentist-0.010
 granddaughter-0.009
 grandma-0.009
Token had
Token ID550
Feature activation+3.05e-02

Pos logit contributions
 mail+0.010
 grandma+0.009
 tell+0.009
 dreaming+0.009
grand+0.009
Neg logit contributions
right-0.014
 thirteen-0.013
mer-0.013
ancy-0.013
 perspective-0.013
Token a
Token ID257
Feature activation-1.39e-02

Pos logit contributions
mer+0.073
right+0.072
rated+0.067
 thirteen+0.067
illed+0.067
Neg logit contributions
 mail-0.026
 grandma-0.025
 dentist-0.024
 dreaming-0.023
 granddaughter-0.022
Token big
Token ID1263
Feature activation+1.89e-02

Pos logit contributions
 mail+0.016
 grandma+0.016
 dentist+0.015
 doctor+0.014
 granddaughter+0.014
Neg logit contributions
right-0.029
 thirteen-0.029
illed-0.028
mer-0.028
ancy-0.027
Token rocket
Token ID10701
Feature activation-6.34e-03

Pos logit contributions
right+0.040
mer+0.039
 thirteen+0.038
ancy+0.037
illed+0.037
Neg logit contributions
 mail-0.021
 grandma-0.020
 dentist-0.019
 dreaming-0.018
 granddaughter-0.018
Token that
Token ID326
Feature activation-1.17e-01

Pos logit contributions
grand+0.009
 granddaughter+0.009
 dentist+0.009
 mail+0.008
 thanking+0.008
Neg logit contributions
mer-0.010
 steep-0.010
y-0.010
 thirteen-0.010
 length-0.010
Token they
Token ID484
Feature activation-8.96e-03

Pos logit contributions
 dentist+0.150
grand+0.149
 granddaughter+0.148
 mail+0.147
 thanking+0.141
Neg logit contributions
right-0.198
mer-0.193
 thirteen-0.192
 steep-0.192
 length-0.183
Token pretended
Token ID36766
Feature activation-8.57e-02

Pos logit contributions
 dentist+0.013
grand+0.013
 mail+0.013
 granddaughter+0.013
 grandma+0.013
Neg logit contributions
right-0.014
 thirteen-0.014
mer-0.013
 steep-0.013
 length-0.013
Token to
Token ID284
Feature activation-5.41e-03

Pos logit contributions
 mail+0.086
grand+0.084
 dentist+0.084
 grandma+0.081
 granddaughter+0.079
Neg logit contributions
right-0.175
 thirteen-0.171
mer-0.168
 steep-0.165
 length-0.164
Token fly
Token ID6129
Feature activation+1.03e-02

Pos logit contributions
 mail+0.006
 dentist+0.006
grand+0.006
 grandma+0.005
 granddaughter+0.005
Neg logit contributions
right-0.011
 thirteen-0.010
mer-0.010
 steep-0.010
ancy-0.010
Token to
Token ID284
Feature activation-5.62e-03

Pos logit contributions
right+0.023
illed+0.021
 thirteen+0.021
ancy+0.021
 disgust+0.021
Neg logit contributions
 mail-0.010
 that-0.010
 grandma-0.010
 he-0.010
 Brown-0.009