TOP ACTIVATIONS
MAX = 9.634

who was standing in the doorway . " What do you
kind face standing in the doorway . She shook her head
's mom called from the doorway . " Jack ! Come
She walked over to the doorway and saw a big truck
, until she found a doorway that wasn 't there before
of the bridge from the doorway . He took
his mother standing in the doorway . â
big shadow appeared in the doorway . It was the fierce
mother , standing in the doorway . " What are you
been watching Daisy from the doorway . Her
door opened and in the doorway was a beautiful fairy .
her mom appeared in the doorway and saw what was happening
year old boy in the doorway . The boy was pointing
bright smile appeared in the doorway . She said , "
, but stopped in a doorway . " John
old lady standing in the doorway . She was so surprised
box . It opened a doorway to a magical place and
the other side of the doorway . He was very happy
mom was watching from the doorway and was very happy to
young daughter appeared in the doorway , looking very hungry .

MIN ACTIVATIONS
MIN = -12.613

He made the biggest and rou g hest pile of sand
, but he was much rou g her and the car
of her favorite books got rou g hed up . She
floor and make noises . V room , v room ,
through the woods feeling the rou g her bark on the
floor and make noises . V room , v room ,
. They say , " U gly !" " Pretty !"
and make loud noises . V room , v room ,
makes a loud noise . V room , v room !
loo and pretended to be Esk im os . After a
. It looked like a V . Tom saw
!" " Pretty !" " U gly !" " Pretty !"
play with it too . V room v room !" <|endoftext|>
very fast and loud . V room , v room !"
. They make noises . V room , v room ,
are careful and happy . V room , v room ,
Maybe later , honey . V ases are not for playing
. They make noises . V room , v room !
'm a famous racer . V room , v room !"
the cars have motors . V room , v room ,

INTERVAL 4.072 - 9.634
CONTAINS 6.105%

the car . As they were unp acking ,
the milk and felt very happy . He wanted to play
up their things and started home . The nice country was
urious means to be very happy ." Ben shook
. She was persistent in her efforts , but eventually the

INTERVAL -1.490 - 4.072
CONTAINS 69.573%

a head but no body ?" The boy thought for a
and then scr abb led underneath some nearby leaves . After
the lion and the mouse and the snake happy with my
had so much fun . The water felt cool and nice
and Tom nodded and said they would . They learned their

INTERVAL -7.051 - -1.490
CONTAINS 23.902%

. They think it is a clever trick .
to play a trick on Tom . She took his star
play . It was summer , and the sun was shining
room . One day , they decide to play
ONG ! B ONG ! B ONG ! Sam

INTERVAL -12.613 - -7.051
CONTAINS 0.420%

same way ?" Sam thought about what Mom my
have buttons for you . Do you want some ?"
replied . Tim my asked , " Can
sweet ie , but I don 't have enough money to
Y ay !" J enny 's mom my asked
Token who
Token ID508
Feature activation+3.79e+00

Pos logit contributions
gh+0.065
ners+0.063
ven+0.058
irl+0.058
 tide+0.057
Neg logit contributions
 cousins-0.046
 granddaughter-0.044
 nephew-0.042
 owners-0.041
 From-0.040
Token was
Token ID373
Feature activation-1.55e-01

Pos logit contributions
 cousins+0.108
 granddaughter+0.105
 From+0.102
 nephew+0.101
 owners+0.101
Neg logit contributions
gh-0.109
ners-0.104
 tide-0.093
ven-0.092
irl-0.092
Token standing
Token ID5055
Feature activation-4.09e+00

Pos logit contributions
ners+0.005
gh+0.005
about+0.005
ven+0.005
works+0.005
Neg logit contributions
 cousins-0.004
 granddaughter-0.004
 owners-0.004
 nephew-0.003
 glad-0.003
Token in
Token ID287
Feature activation-1.65e+00

Pos logit contributions
gh+0.131
ners+0.125
works+0.116
ven+0.114
about+0.113
Neg logit contributions
 granddaughter-0.101
 cousins-0.100
 owners-0.095
 glad-0.094
 home-0.092
Token the
Token ID262
Feature activation-4.21e-01

Pos logit contributions
gh+0.053
ners+0.051
ven+0.047
irl+0.046
 tide+0.046
Neg logit contributions
 cousins-0.041
 granddaughter-0.040
 nephew-0.038
 owners-0.037
 From-0.036
Token doorway
Token ID32439
Feature activation+9.43e+00

Pos logit contributions
gh+0.014
ners+0.013
 tide+0.012
ven+0.012
irl+0.012
Neg logit contributions
 cousins-0.011
 granddaughter-0.010
 nephew-0.010
 owners-0.009
 From-0.009
Token.
Token ID13
Feature activation-6.51e-02

Pos logit contributions
 cousins+0.252
 granddaughter+0.251
 owners+0.236
 nephew+0.230
 From+0.230
Neg logit contributions
ners-0.279
gh-0.276
irl-0.255
ven-0.252
 tide-0.246
Token "
Token ID366
Feature activation-5.92e-01

Pos logit contributions
gh+0.002
ners+0.002
ven+0.002
 tide+0.002
irl+0.002
Neg logit contributions
 cousins-0.001
 granddaughter-0.001
 nephew-0.001
 owners-0.001
 From-0.001
TokenWhat
Token ID2061
Feature activation-2.04e+00

Pos logit contributions
gh+0.018
ners+0.017
irl+0.016
ven+0.015
 tide+0.015
Neg logit contributions
 granddaughter-0.016
 cousins-0.016
 nephew-0.015
 owners-0.015
 From-0.015
Token do
Token ID466
Feature activation-2.75e+00

Pos logit contributions
ners+0.070
gh+0.068
 tide+0.062
irl+0.062
about+0.062
Neg logit contributions
 cousins-0.050
 granddaughter-0.046
 nephew-0.044
 owners-0.041
 grandma-0.040
Token you
Token ID345
Feature activation-4.38e+00

Pos logit contributions
ners+0.084
gh+0.080
about+0.078
irl+0.072
 tide+0.071
Neg logit contributions
 cousins-0.073
 granddaughter-0.073
 owners-0.069
 nephew-0.068
 wife-0.065
Token kind
Token ID1611
Feature activation+1.53e+00

Pos logit contributions
 granddaughter+0.056
 cousins+0.054
 owners+0.052
 nephew+0.050
 wife+0.047
Neg logit contributions
ners-0.106
gh-0.100
ven-0.099
irl-0.097
works-0.095
Token face
Token ID1986
Feature activation+4.79e+00

Pos logit contributions
 granddaughter+0.032
 cousins+0.031
 nephew+0.029
 owners+0.029
 wife+0.026
Neg logit contributions
gh-0.057
irl-0.056
ners-0.056
ven-0.054
isting-0.051
Token standing
Token ID5055
Feature activation-4.21e+00

Pos logit contributions
 granddaughter+0.104
 owners+0.102
 From+0.099
 cousins+0.098
 knowing+0.097
Neg logit contributions
ners-0.155
irl-0.149
gh-0.145
ven-0.142
 tide-0.141
Token in
Token ID287
Feature activation-1.69e+00

Pos logit contributions
gh+0.141
works+0.140
ners+0.140
 rect+0.130
ven+0.130
Neg logit contributions
 home-0.092
 granddaughter-0.089
 husband-0.085
 owners-0.085
 owner-0.084
Token the
Token ID262
Feature activation-4.73e-01

Pos logit contributions
gh+0.062
ners+0.061
ven+0.059
irl+0.056
works+0.056
Neg logit contributions
 cousins-0.034
 granddaughter-0.034
 nephew-0.033
 owners-0.032
 pride-0.030
Token doorway
Token ID32439
Feature activation+9.41e+00

Pos logit contributions
gh+0.013
ners+0.013
 tide+0.012
about+0.012
 competition+0.011
Neg logit contributions
 cousins-0.013
 granddaughter-0.012
 nephew-0.012
 From-0.011
 owners-0.011
Token.
Token ID13
Feature activation+6.56e-02

Pos logit contributions
 granddaughter+0.255
 cousins+0.250
 owners+0.241
 nephew+0.233
 From+0.232
Neg logit contributions
ners-0.272
gh-0.264
irl-0.252
ven-0.252
works-0.241
Token She
Token ID1375
Feature activation+3.12e+00

Pos logit contributions
 cousins+0.001
 granddaughter+0.001
 nephew+0.001
 owners+0.001
 From+0.001
Neg logit contributions
gh-0.003
ners-0.003
ven-0.002
 tide-0.002
irl-0.002
Token shook
Token ID14682
Feature activation+5.97e-01

Pos logit contributions
 cousins+0.091
 granddaughter+0.087
 nephew+0.085
 owners+0.083
 From+0.081
Neg logit contributions
gh-0.087
ners-0.083
ven-0.076
 tide-0.075
irl-0.075
Token her
Token ID607
Feature activation+4.70e+00

Pos logit contributions
 cousins+0.012
 granddaughter+0.012
 nephew+0.011
 From+0.011
 owners+0.011
Neg logit contributions
gh-0.022
ners-0.021
ven-0.020
irl-0.020
 tide-0.019
Token head
Token ID1182
Feature activation+3.33e+00

Pos logit contributions
 cousins+0.102
 granddaughter+0.095
 nephew+0.091
 From+0.088
 owners+0.087
Neg logit contributions
gh-0.167
ners-0.159
ven-0.149
 tide-0.148
irl-0.148
Token's
Token ID338
Feature activation+2.65e+00

Pos logit contributions
gh+0.142
ners+0.138
works+0.135
irl+0.128
ven+0.127
Neg logit contributions
 cousins-0.083
 granddaughter-0.083
 owners-0.082
 nephew-0.078
 parent-0.071
Token mom
Token ID1995
Feature activation+3.20e+00

Pos logit contributions
 cousins+0.038
 granddaughter+0.036
 owners+0.034
 nephew+0.032
 wife+0.028
Neg logit contributions
gh-0.115
ners-0.113
ven-0.111
irl-0.106
works-0.105
Token called
Token ID1444
Feature activation+6.60e-01

Pos logit contributions
 granddaughter+0.072
 cousins+0.072
 nephew+0.067
 owners+0.064
 niece+0.060
Neg logit contributions
gh-0.116
ners-0.113
irl-0.107
 tide-0.105
works-0.103
Token from
Token ID422
Feature activation+3.44e+00

Pos logit contributions
 cousins+0.012
 granddaughter+0.012
 nephew+0.011
 owners+0.010
 From+0.010
Neg logit contributions
ners-0.025
gh-0.025
ven-0.024
irl-0.024
about-0.023
Token the
Token ID262
Feature activation-6.19e-01

Pos logit contributions
 cousins+0.097
 granddaughter+0.093
 nephew+0.089
 owners+0.088
 From+0.086
Neg logit contributions
gh-0.100
ners-0.095
ven-0.087
 tide-0.086
irl-0.086
Token doorway
Token ID32439
Feature activation+9.36e+00

Pos logit contributions
gh+0.022
ners+0.021
ven+0.020
irl+0.019
 tide+0.019
Neg logit contributions
 cousins-0.014
 granddaughter-0.013
 nephew-0.013
 owners-0.013
 From-0.012
Token.
Token ID13
Feature activation-4.29e-02

Pos logit contributions
 granddaughter+0.243
 cousins+0.240
 owners+0.223
 nephew+0.223
 From+0.216
Neg logit contributions
ners-0.294
gh-0.294
irl-0.288
ven-0.275
 tide-0.271
Token "
Token ID366
Feature activation-5.92e-01

Pos logit contributions
gh+0.001
ners+0.001
 tide+0.001
ven+0.001
about+0.001
Neg logit contributions
 cousins-0.001
 granddaughter-0.001
 nephew-0.001
 owners-0.001
ringe-0.001
TokenJack
Token ID14295
Feature activation-5.04e+00

Pos logit contributions
gh+0.010
ners+0.010
ven+0.008
irl+0.008
 tide+0.008
Neg logit contributions
 cousins-0.024
 granddaughter-0.023
 nephew-0.022
 owners-0.022
 From-0.022
Token!
Token ID0
Feature activation-9.09e-01

Pos logit contributions
ners+0.163
 rect+0.159
about+0.158
gh+0.152
 tide+0.151
Neg logit contributions
 granddaughter-0.121
 cousins-0.115
 glad-0.112
 nephew-0.112
 grandma-0.107
Token Come
Token ID7911
Feature activation-8.69e+00

Pos logit contributions
gh+0.027
ners+0.026
ven+0.024
 tide+0.024
irl+0.023
Neg logit contributions
 cousins-0.024
 granddaughter-0.023
 nephew-0.022
 owners-0.022
 From-0.020
Token She
Token ID1375
Feature activation+2.88e+00

Pos logit contributions
 cousins+0.001
 granddaughter+0.001
 From+0.001
 nephew+0.001
 owners+0.001
Neg logit contributions
gh-0.002
ners-0.002
ven-0.002
irl-0.002
 tide-0.002
Token walked
Token ID6807
Feature activation-2.55e+00

Pos logit contributions
 cousins+0.085
 granddaughter+0.082
 nephew+0.081
 owners+0.081
 From+0.080
Neg logit contributions
gh-0.081
ners-0.075
ven-0.069
irl-0.068
 tide-0.067
Token over
Token ID625
Feature activation+1.47e+00

Pos logit contributions
gh+0.073
ners+0.071
ven+0.064
 tide+0.064
irl+0.063
Neg logit contributions
 cousins-0.072
 granddaughter-0.069
 nephew-0.067
 owners-0.065
 From-0.064
Token to
Token ID284
Feature activation-6.36e-01

Pos logit contributions
 cousins+0.024
 granddaughter+0.024
 nephew+0.023
 owners+0.022
 From+0.022
Neg logit contributions
gh-0.060
ners-0.057
irl-0.055
ven-0.055
 tide-0.053
Token the
Token ID262
Feature activation-6.00e-01

Pos logit contributions
gh+0.013
 tide+0.013
irl+0.012
 squ+0.011
 competition+0.011
Neg logit contributions
 cousins-0.022
 granddaughter-0.021
ringe-0.020
 nephew-0.020
 owners-0.020
Token doorway
Token ID32439
Feature activation+9.36e+00

Pos logit contributions
gh+0.019
ners+0.018
 tide+0.018
works+0.017
about+0.017
Neg logit contributions
 cousins-0.014
 granddaughter-0.013
 From-0.013
 nephew-0.013
 owners-0.012
Token and
Token ID290
Feature activation+2.33e+00

Pos logit contributions
 granddaughter+0.195
 cousins+0.195
 From+0.179
 owners+0.179
 nephew+0.176
Neg logit contributions
gh-0.334
ners-0.329
irl-0.310
ven-0.307
 tide-0.298
Token saw
Token ID2497
Feature activation+3.31e+00

Pos logit contributions
 cousins+0.074
 granddaughter+0.069
 nephew+0.067
 owners+0.066
ringe+0.064
Neg logit contributions
gh-0.057
ners-0.054
ven-0.052
 tide-0.051
h-0.050
Token a
Token ID257
Feature activation-1.31e+00

Pos logit contributions
 cousins+0.090
 granddaughter+0.086
 nephew+0.083
 owners+0.081
 From+0.080
Neg logit contributions
gh-0.099
ners-0.094
 tide-0.087
ven-0.086
irl-0.085
Token big
Token ID1263
Feature activation-2.13e+00

Pos logit contributions
gh+0.044
ners+0.042
ven+0.040
irl+0.039
about+0.037
Neg logit contributions
 cousins-0.032
 granddaughter-0.031
 nephew-0.030
 owners-0.029
 grandparents-0.027
Token truck
Token ID7779
Feature activation+9.39e-01

Pos logit contributions
gh+0.072
ners+0.068
ven+0.065
irl+0.065
works+0.061
Neg logit contributions
 cousins-0.052
 granddaughter-0.049
 nephew-0.048
 heart-0.045
 aunt-0.044
Token,
Token ID11
Feature activation-2.00e+00

Pos logit contributions
gh+0.053
ven+0.050
works+0.049
ners+0.048
irl+0.047
Neg logit contributions
 From-0.025
 cousins-0.024
 pride-0.023
 knowing-0.022
 home-0.022
Token until
Token ID1566
Feature activation-2.78e-01

Pos logit contributions
gh+0.059
ners+0.056
ven+0.051
 tide+0.050
irl+0.050
Neg logit contributions
 cousins-0.056
 granddaughter-0.053
 nephew-0.051
 owners-0.051
 From-0.050
Token she
Token ID673
Feature activation+1.98e+00

Pos logit contributions
gh+0.009
ners+0.009
works+0.008
ven+0.008
about+0.008
Neg logit contributions
 cousins-0.006
 granddaughter-0.006
 From-0.006
 owners-0.006
 nephew-0.005
Token found
Token ID1043
Feature activation+2.61e+00

Pos logit contributions
 cousins+0.054
 granddaughter+0.052
 owners+0.050
 From+0.049
 nephew+0.049
Neg logit contributions
gh-0.057
ners-0.055
ven-0.050
 tide-0.049
works-0.049
Token a
Token ID257
Feature activation-1.30e+00

Pos logit contributions
 cousins+0.077
 granddaughter+0.072
 nephew+0.071
 owners+0.068
 From+0.067
Neg logit contributions
gh-0.074
ners-0.070
 tide-0.065
ven-0.064
irl-0.063
Token doorway
Token ID32439
Feature activation+9.34e+00

Pos logit contributions
ners+0.039
gh+0.039
ven+0.037
works+0.036
about+0.035
Neg logit contributions
 cousins-0.033
 granddaughter-0.032
 owners-0.030
 nephew-0.030
 heart-0.030
Token that
Token ID326
Feature activation+2.04e+00

Pos logit contributions
 cousins+0.227
 granddaughter+0.225
 owners+0.213
 nephew+0.206
 knowing+0.205
Neg logit contributions
ners-0.287
gh-0.283
ven-0.269
irl-0.269
isting-0.257
Token wasn
Token ID2492
Feature activation-4.62e+00

Pos logit contributions
 cousins+0.055
 granddaughter+0.054
 owners+0.053
 From+0.051
 nephew+0.051
Neg logit contributions
gh-0.059
ners-0.057
ven-0.052
about-0.052
isting-0.050
Token't
Token ID470
Feature activation-3.12e+00

Pos logit contributions
gh+0.161
ners+0.160
about+0.149
ven+0.147
 tide+0.146
Neg logit contributions
 cousins-0.091
 granddaughter-0.085
 From-0.084
 owners-0.082
 nephew-0.080
Token there
Token ID612
Feature activation-2.02e+00

Pos logit contributions
gh+0.096
ners+0.094
ven+0.087
irl+0.086
about+0.086
Neg logit contributions
 cousins-0.081
 granddaughter-0.079
 nephew-0.073
 owners-0.072
 glad-0.071
Token before
Token ID878
Feature activation-8.37e-01

Pos logit contributions
ners+0.069
irl+0.066
ven+0.063
gh+0.062
isting+0.061
Neg logit contributions
 home-0.044
 knowing-0.043
 cousins-0.042
 owners-0.041
 granddaughter-0.040
Token of
Token ID286
Feature activation+2.35e+00

Pos logit contributions
gh+0.037
ners+0.036
irl+0.034
isting+0.032
 seaw+0.032
Neg logit contributions
 owners-0.017
 window-0.016
 owner-0.016
 grandma-0.015
 cousins-0.015
Token the
Token ID262
Feature activation-5.90e-01

Pos logit contributions
 cousins+0.059
 granddaughter+0.056
 nephew+0.054
 owners+0.054
 From+0.052
Neg logit contributions
gh-0.077
ners-0.074
ven-0.069
irl-0.066
about-0.064
Token bridge
Token ID7696
Feature activation-1.82e+00

Pos logit contributions
gh+0.020
ners+0.018
irl+0.017
ven+0.017
h+0.016
Neg logit contributions
 cousins-0.015
 granddaughter-0.014
 owners-0.014
 nephew-0.014
 grandparents-0.013
Token from
Token ID422
Feature activation+3.39e+00

Pos logit contributions
gh+0.059
ners+0.058
irl+0.056
ven+0.052
lex+0.051
Neg logit contributions
 owners-0.038
 cousins-0.037
 From-0.037
 owner-0.036
 home-0.036
Token the
Token ID262
Feature activation-6.04e-01

Pos logit contributions
 cousins+0.081
 granddaughter+0.077
 nephew+0.075
 owners+0.075
 From+0.073
Neg logit contributions
gh-0.113
ners-0.108
ven-0.102
irl-0.095
about-0.094
Token doorway
Token ID32439
Feature activation+9.34e+00

Pos logit contributions
gh+0.020
ners+0.018
irl+0.018
ven+0.017
h+0.016
Neg logit contributions
 cousins-0.016
 owners-0.015
 granddaughter-0.015
 nephew-0.015
 owner-0.014
Token.
Token ID13
Feature activation-3.71e-02

Pos logit contributions
 cousins+0.248
 granddaughter+0.247
 owners+0.242
 From+0.232
 nephew+0.229
Neg logit contributions
ners-0.273
gh-0.266
irl-0.256
ven-0.240
 tide-0.234
Token
Token ID198
Feature activation+1.19e+00

Pos logit contributions
gh+0.001
ners+0.001
ven+0.001
 tide+0.001
about+0.001
Neg logit contributions
 cousins-0.001
 granddaughter-0.001
 nephew-0.001
 owners-0.001
ringe-0.001
Token
Token ID198
Feature activation+1.16e+00

Pos logit contributions
 cousins+0.023
 granddaughter+0.021
 nephew+0.020
 owners+0.019
 From+0.018
Neg logit contributions
gh-0.046
ners-0.043
ven-0.041
irl-0.041
 tide-0.040
TokenHe
Token ID1544
Feature activation+1.83e+00

Pos logit contributions
 cousins+0.028
 granddaughter+0.027
 From+0.027
 owners+0.026
 nephew+0.026
Neg logit contributions
gh-0.037
ners-0.036
 tide-0.033
irl-0.033
ven-0.033
Token took
Token ID1718
Feature activation+3.33e-01

Pos logit contributions
 cousins+0.053
 granddaughter+0.051
 owners+0.050
 nephew+0.049
 From+0.049
Neg logit contributions
gh-0.053
ners-0.050
 tide-0.044
ven-0.044
irl-0.044
Token his
Token ID465
Feature activation+3.21e+00

Pos logit contributions
 cousins+0.080
 granddaughter+0.076
 nephew+0.073
 owners+0.072
 From+0.071
Neg logit contributions
gh-0.102
ners-0.098
ven-0.091
irl-0.090
 tide-0.088
Token mother
Token ID2802
Feature activation+5.60e+00

Pos logit contributions
 cousins+0.064
 granddaughter+0.061
 nephew+0.057
 owners+0.057
 From+0.052
Neg logit contributions
gh-0.121
ners-0.118
ven-0.110
irl-0.107
 tide-0.107
Token standing
Token ID5055
Feature activation-4.18e+00

Pos logit contributions
 cousins+0.135
 granddaughter+0.134
 nephew+0.126
 owners+0.123
 niece+0.116
Neg logit contributions
ners-0.192
gh-0.187
obb-0.178
 tide-0.173
irl-0.172
Token in
Token ID287
Feature activation-1.68e+00

Pos logit contributions
gh+0.139
ners+0.130
works+0.127
ven+0.121
 matter+0.121
Neg logit contributions
 cousins-0.096
 granddaughter-0.095
 home-0.092
 owners-0.091
 glad-0.089
Token the
Token ID262
Feature activation-6.20e-01

Pos logit contributions
gh+0.061
ners+0.058
ven+0.057
irl+0.055
works+0.054
Neg logit contributions
 cousins-0.036
 granddaughter-0.034
 nephew-0.033
 owners-0.033
 From-0.031
Token doorway
Token ID32439
Feature activation+9.33e+00

Pos logit contributions
gh+0.020
ners+0.020
 tide+0.018
ven+0.018
irl+0.018
Neg logit contributions
 cousins-0.015
 granddaughter-0.014
 nephew-0.014
 owners-0.013
 From-0.013
Token.
Token ID13
Feature activation+3.93e-02

Pos logit contributions
 granddaughter+0.244
 cousins+0.242
 owners+0.230
 nephew+0.221
 heart+0.219
Neg logit contributions
ners-0.278
gh-0.270
irl-0.256
ven-0.254
works-0.246
Token
Token ID198
Feature activation+1.28e+00

Pos logit contributions
 cousins+0.001
 granddaughter+0.001
 From+0.001
 nephew+0.001
 owners+0.001
Neg logit contributions
gh-0.002
ners-0.002
ven-0.001
irl-0.001
 tide-0.001
Token
Token ID198
Feature activation+1.26e+00

Pos logit contributions
 From+0.014
 cousins+0.012
 granddaughter+0.012
 owners+0.012
 glad+0.011
Neg logit contributions
gh-0.058
ners-0.057
ven-0.054
 tide-0.053
irl-0.053
Tokenâ
Token ID22940
Feature activation-6.67e+00

Pos logit contributions
 From+0.036
 glad+0.033
 owners+0.032
 granddaughter+0.031
 cousins+0.031
Neg logit contributions
gh-0.036
ners-0.035
works-0.032
ven-0.031
irl-0.031
Token
Token ID26391
Feature activation-5.68e+00

Pos logit contributions
gh+0.169
ners+0.157
irl+0.150
 tide+0.146
 rect+0.140
Neg logit contributions
 From-0.206
 granddaughter-0.195
 nephew-0.186
 owners-0.180
 cousins-0.178
Token big
Token ID1263
Feature activation-2.12e+00

Pos logit contributions
gh+0.043
ners+0.041
ven+0.039
irl+0.039
about+0.038
Neg logit contributions
 cousins-0.029
 granddaughter-0.028
 nephew-0.027
 owners-0.027
 parent-0.026
Token shadow
Token ID9082
Feature activation+1.13e+00

Pos logit contributions
gh+0.079
ners+0.071
ven+0.071
irl+0.069
works+0.066
Neg logit contributions
 cousins-0.047
 nephew-0.045
 granddaughter-0.045
 heart-0.043
 aunt-0.042
Token appeared
Token ID4120
Feature activation+1.09e+00

Pos logit contributions
 cousins+0.028
 nephew+0.026
 owners+0.026
 granddaughter+0.025
 From+0.025
Neg logit contributions
gh-0.035
ners-0.033
irl-0.032
ven-0.032
isting-0.030
Token in
Token ID287
Feature activation-1.71e+00

Pos logit contributions
 From+0.020
 cousins+0.019
 knowing+0.019
 owners+0.018
 home+0.018
Neg logit contributions
gh-0.041
irl-0.040
ven-0.039
ners-0.039
 tide-0.036
Token the
Token ID262
Feature activation-5.94e-01

Pos logit contributions
gh+0.061
ners+0.057
ven+0.055
irl+0.053
about+0.051
Neg logit contributions
 cousins-0.039
 nephew-0.036
 granddaughter-0.036
 owners-0.036
 From-0.035
Token doorway
Token ID32439
Feature activation+9.32e+00

Pos logit contributions
ners+0.017
gh+0.017
 tide+0.016
about+0.015
ven+0.015
Neg logit contributions
 cousins-0.017
 granddaughter-0.015
 nephew-0.015
 From-0.014
 owners-0.014
Token.
Token ID13
Feature activation-8.14e-02

Pos logit contributions
 cousins+0.254
 granddaughter+0.246
 owners+0.240
 From+0.237
 nephew+0.233
Neg logit contributions
gh-0.266
ners-0.265
irl-0.250
ven-0.244
 tide-0.232
Token It
Token ID632
Feature activation-9.32e-02

Pos logit contributions
gh+0.003
ners+0.003
ven+0.003
 tide+0.003
irl+0.003
Neg logit contributions
 cousins-0.001
 granddaughter-0.001
 nephew-0.001
 owners-0.001
 From-0.001
Token was
Token ID373
Feature activation-2.51e-01

Pos logit contributions
gh+0.003
ners+0.002
ven+0.002
 tide+0.002
irl+0.002
Neg logit contributions
 cousins-0.003
 granddaughter-0.003
 nephew-0.003
 owners-0.003
 From-0.003
Token the
Token ID262
Feature activation-6.26e-01

Pos logit contributions
gh+0.007
ners+0.007
ven+0.007
irl+0.006
 tide+0.006
Neg logit contributions
 cousins-0.007
 granddaughter-0.007
 nephew-0.006
 owners-0.006
 From-0.006
Token fierce
Token ID14800
Feature activation-3.24e+00

Pos logit contributions
gh+0.021
ners+0.020
ven+0.019
irl+0.019
about+0.018
Neg logit contributions
 cousins-0.016
 granddaughter-0.015
 nephew-0.014
 owners-0.014
 niece-0.013
Token mother
Token ID2802
Feature activation+5.56e+00

Pos logit contributions
 cousins+0.056
 granddaughter+0.055
 owners+0.049
 nephew+0.049
 wife+0.045
Neg logit contributions
gh-0.138
ners-0.136
ven-0.129
h-0.126
th-0.125
Token,
Token ID11
Feature activation-1.98e+00

Pos logit contributions
 cousins+0.163
 granddaughter+0.163
 nephew+0.155
 owners+0.149
 From+0.147
Neg logit contributions
ners-0.163
gh-0.157
irl-0.145
 tide-0.143
about-0.143
Token standing
Token ID5055
Feature activation-4.15e+00

Pos logit contributions
gh+0.065
ners+0.063
ven+0.058
irl+0.057
 tide+0.057
Neg logit contributions
 cousins-0.049
 granddaughter-0.047
 nephew-0.045
 owners-0.044
 From-0.043
Token in
Token ID287
Feature activation-1.64e+00

Pos logit contributions
gh+0.117
ners+0.110
ven+0.099
irl+0.095
about+0.095
Neg logit contributions
 cousins-0.122
 granddaughter-0.119
 owners-0.113
 nephew-0.111
 From-0.111
Token the
Token ID262
Feature activation-5.66e-01

Pos logit contributions
gh+0.053
ners+0.051
ven+0.048
irl+0.047
 tide+0.047
Neg logit contributions
 cousins-0.041
 granddaughter-0.039
 nephew-0.037
 owners-0.036
 From-0.036
Token doorway
Token ID32439
Feature activation+9.31e+00

Pos logit contributions
gh+0.017
ners+0.016
ven+0.015
irl+0.015
 tide+0.015
Neg logit contributions
 cousins-0.015
 granddaughter-0.015
 nephew-0.014
 owners-0.014
 From-0.014
Token.
Token ID13
Feature activation-2.14e-01

Pos logit contributions
 cousins+0.250
 granddaughter+0.247
 From+0.231
 owners+0.231
 nephew+0.229
Neg logit contributions
gh-0.273
ners-0.273
irl-0.249
ven-0.247
 tide-0.242
Token "
Token ID366
Feature activation-7.51e-01

Pos logit contributions
gh+0.007
ners+0.007
ven+0.006
 tide+0.006
about+0.006
Neg logit contributions
 cousins-0.005
 granddaughter-0.004
 nephew-0.004
 owners-0.004
ringe-0.004
TokenWhat
Token ID2061
Feature activation-2.12e+00

Pos logit contributions
gh+0.022
ners+0.021
irl+0.019
ven+0.019
 tide+0.019
Neg logit contributions
 cousins-0.021
 granddaughter-0.020
 nephew-0.020
 owners-0.019
 From-0.019
Token are
Token ID389
Feature activation+8.42e-01

Pos logit contributions
ners+0.070
gh+0.067
 tide+0.062
h+0.061
about+0.061
Neg logit contributions
 cousins-0.054
 granddaughter-0.050
 nephew-0.048
 owners-0.045
 From-0.043
Token you
Token ID345
Feature activation-4.42e+00

Pos logit contributions
 cousins+0.023
 granddaughter+0.022
 owners+0.021
 nephew+0.021
 From+0.020
Neg logit contributions
ners-0.026
gh-0.025
about-0.023
irl-0.022
ven-0.021
Token been
Token ID587
Feature activation-5.11e-01

Pos logit contributions
gh+0.030
ners+0.028
ven+0.026
 tide+0.026
irl+0.025
Neg logit contributions
 cousins-0.027
 granddaughter-0.025
 nephew-0.025
 owners-0.024
 From-0.024
Token watching
Token ID4964
Feature activation-3.26e+00

Pos logit contributions
gh+0.016
ners+0.015
ven+0.014
irl+0.014
 tide+0.014
Neg logit contributions
 cousins-0.013
 granddaughter-0.013
 nephew-0.012
 owners-0.012
 From-0.012
Token Daisy
Token ID40355
Feature activation+5.79e-01

Pos logit contributions
gh+0.105
ners+0.101
ven+0.095
irl+0.094
 tide+0.093
Neg logit contributions
 cousins-0.082
 granddaughter-0.079
 owners-0.074
 nephew-0.074
 From-0.073
Token from
Token ID422
Feature activation+3.33e+00

Pos logit contributions
 cousins+0.016
 granddaughter+0.016
 nephew+0.015
 owners+0.015
 From+0.015
Neg logit contributions
gh-0.017
ners-0.016
ven-0.015
 tide-0.014
irl-0.014
Token the
Token ID262
Feature activation-2.99e-01

Pos logit contributions
 cousins+0.081
 granddaughter+0.078
 nephew+0.075
 owners+0.074
 From+0.071
Neg logit contributions
gh-0.112
ners-0.109
ven-0.103
about-0.099
irl-0.096
Token doorway
Token ID32439
Feature activation+9.28e+00

Pos logit contributions
gh+0.010
ners+0.009
ven+0.009
irl+0.008
about+0.008
Neg logit contributions
 cousins-0.008
 granddaughter-0.007
 owners-0.007
 nephew-0.007
 From-0.007
Token.
Token ID13
Feature activation-5.36e-02

Pos logit contributions
 cousins+0.243
 granddaughter+0.240
 owners+0.225
 nephew+0.224
 From+0.219
Neg logit contributions
gh-0.287
ners-0.279
irl-0.260
ven-0.256
 tide-0.253
Token
Token ID220
Feature activation-2.66e-01

Pos logit contributions
ners+0.001
+0.001
gh+0.001
 tide+0.001
ven+0.001
Neg logit contributions
 cousins-0.002
 nephew-0.002
ringe-0.002
 granddaughter-0.002
 client-0.002
Token
Token ID198
Feature activation+1.49e+00

Pos logit contributions
gh+0.009
ners+0.009
ven+0.008
irl+0.008
h+0.008
Neg logit contributions
 cousins-0.006
 nephew-0.005
 granddaughter-0.005
 owners-0.005
ringe-0.005
Token
Token ID198
Feature activation+1.48e+00

Pos logit contributions
 cousins+0.030
 granddaughter+0.028
 nephew+0.027
 owners+0.025
 client+0.023
Neg logit contributions
gh-0.056
ners-0.054
irl-0.050
ven-0.050
 tide-0.049
TokenHer
Token ID9360
Feature activation+5.99e+00

Pos logit contributions
 cousins+0.043
 granddaughter+0.041
 From+0.041
 owners+0.040
 nephew+0.040
Neg logit contributions
gh-0.042
ners-0.039
ven-0.036
 tide-0.036
irl-0.035
Token door
Token ID3420
Feature activation+4.45e+00

Pos logit contributions
gh+0.015
ners+0.014
 tide+0.013
ven+0.013
irl+0.013
Neg logit contributions
 cousins-0.011
 granddaughter-0.010
 nephew-0.010
 From-0.010
 owners-0.010
Token opened
Token ID4721
Feature activation+1.00e+00

Pos logit contributions
 cousins+0.123
 granddaughter+0.121
 owners+0.118
 nephew+0.116
 From+0.115
Neg logit contributions
gh-0.125
ners-0.121
irl-0.111
 tide-0.109
about-0.108
Token and
Token ID290
Feature activation+2.45e+00

Pos logit contributions
 From+0.021
 granddaughter+0.020
 cousins+0.020
 owners+0.020
 nephew+0.019
Neg logit contributions
gh-0.035
ners-0.034
irl-0.032
ven-0.031
works-0.030
Token in
Token ID287
Feature activation-1.70e+00

Pos logit contributions
 cousins+0.077
 granddaughter+0.074
 nephew+0.071
 owners+0.070
 From+0.068
Neg logit contributions
gh-0.061
ners-0.059
 tide-0.055
ven-0.055
irl-0.052
Token the
Token ID262
Feature activation-4.82e-01

Pos logit contributions
ners+0.055
gh+0.054
irl+0.052
about+0.051
ven+0.051
Neg logit contributions
 From-0.039
 cousins-0.038
 granddaughter-0.038
 owners-0.038
 nephew-0.037
Token doorway
Token ID32439
Feature activation+9.28e+00

Pos logit contributions
gh+0.015
 tide+0.015
ners+0.015
 competition+0.014
about+0.013
Neg logit contributions
 cousins-0.011
 granddaughter-0.010
 nephew-0.010
 From-0.010
 owners-0.009
Token was
Token ID373
Feature activation-1.44e-01

Pos logit contributions
 granddaughter+0.243
 cousins+0.242
 owners+0.235
 From+0.231
 nephew+0.226
Neg logit contributions
ners-0.273
gh-0.262
irl-0.247
 tide-0.238
ven-0.236
Token a
Token ID257
Feature activation-1.19e+00

Pos logit contributions
gh+0.004
ners+0.004
ven+0.003
irl+0.003
 tide+0.003
Neg logit contributions
 cousins-0.004
 granddaughter-0.004
 nephew-0.004
 owners-0.004
 From-0.004
Token beautiful
Token ID4950
Feature activation+4.58e-01

Pos logit contributions
gh+0.038
ners+0.036
ven+0.034
irl+0.034
about+0.033
Neg logit contributions
 cousins-0.031
 granddaughter-0.030
 nephew-0.029
 owners-0.028
 grandparents-0.027
Token fairy
Token ID25607
Feature activation+4.73e+00

Pos logit contributions
 granddaughter+0.011
 cousins+0.011
 nephew+0.010
 owners+0.010
 heart+0.009
Neg logit contributions
gh-0.016
ners-0.015
ven-0.014
irl-0.014
about-0.013
Token.
Token ID13
Feature activation-1.22e-01

Pos logit contributions
 cousins+0.139
 granddaughter+0.139
 nephew+0.131
 owners+0.129
 niece+0.123
Neg logit contributions
gh-0.131
ners-0.131
about-0.121
irl-0.120
ven-0.118
Token her
Token ID607
Feature activation+4.51e+00

Pos logit contributions
gh+0.061
ners+0.059
 tide+0.054
ven+0.053
irl+0.053
Neg logit contributions
 cousins-0.064
 granddaughter-0.060
 nephew-0.059
 owners-0.058
 From-0.057
Token mom
Token ID1995
Feature activation+3.23e+00

Pos logit contributions
 From+0.081
 cousins+0.080
 nephew+0.076
 granddaughter+0.075
ringe+0.069
Neg logit contributions
gh-0.167
ners-0.159
 tide-0.154
irl-0.152
ven-0.143
Token appeared
Token ID4120
Feature activation+9.82e-01

Pos logit contributions
 cousins+0.096
 granddaughter+0.092
 nephew+0.089
 owners+0.087
 From+0.085
Neg logit contributions
gh-0.090
ners-0.086
 tide-0.078
ven-0.078
irl-0.078
Token in
Token ID287
Feature activation-1.92e+00

Pos logit contributions
 cousins+0.020
 granddaughter+0.020
 From+0.019
 owners+0.018
 nephew+0.018
Neg logit contributions
gh-0.036
ners-0.035
ven-0.034
 tide-0.033
irl-0.033
Token the
Token ID262
Feature activation-7.27e-01

Pos logit contributions
gh+0.050
ners+0.047
 tide+0.043
irl+0.042
ven+0.041
Neg logit contributions
 cousins-0.059
 granddaughter-0.056
 nephew-0.055
 owners-0.054
 From-0.053
Token doorway
Token ID32439
Feature activation+9.26e+00

Pos logit contributions
ners+0.021
 tide+0.020
gh+0.020
 bubble+0.019
 competition+0.019
Neg logit contributions
 cousins-0.019
 nephew-0.018
 granddaughter-0.017
 From-0.017
 owners-0.017
Token and
Token ID290
Feature activation+2.21e+00

Pos logit contributions
 cousins+0.255
 granddaughter+0.249
 nephew+0.234
 owners+0.234
 From+0.233
Neg logit contributions
gh-0.273
ners-0.264
ven-0.244
irl-0.241
 tide-0.240
Token saw
Token ID2497
Feature activation+3.08e+00

Pos logit contributions
 cousins+0.067
 granddaughter+0.063
 nephew+0.062
 owners+0.061
 client+0.060
Neg logit contributions
gh-0.055
ners-0.053
ven-0.049
h-0.049
 tide-0.048
Token what
Token ID644
Feature activation+1.88e+00

Pos logit contributions
 cousins+0.073
 granddaughter+0.069
 nephew+0.066
 From+0.064
 owners+0.064
Neg logit contributions
gh-0.105
ners-0.100
ven-0.093
irl-0.092
 tide-0.092
Token was
Token ID373
Feature activation-3.57e-01

Pos logit contributions
 cousins+0.067
 granddaughter+0.066
 owners+0.063
 nephew+0.062
 From+0.062
Neg logit contributions
gh-0.039
ners-0.037
ven-0.034
 tide-0.034
about-0.033
Token happening
Token ID5836
Feature activation-2.47e+00

Pos logit contributions
ners+0.008
gh+0.008
about+0.007
ven+0.007
irl+0.007
Neg logit contributions
 cousins-0.012
 granddaughter-0.011
 glad-0.011
 owners-0.011
 nephew-0.011
Token year
Token ID614
Feature activation+6.02e-01

Pos logit contributions
ners+0.018
gh+0.017
ven+0.017
works+0.016
irl+0.016
Neg logit contributions
 cousins-0.017
 granddaughter-0.016
 nephew-0.015
 owners-0.015
 glad-0.014
Token old
Token ID1468
Feature activation+2.60e+00

Pos logit contributions
 granddaughter+0.010
 cousins+0.009
 From+0.008
 owners+0.008
 nephew+0.008
Neg logit contributions
gh-0.026
ners-0.024
irl-0.024
ven-0.023
about-0.022
Token boy
Token ID2933
Feature activation+1.21e+00

Pos logit contributions
 granddaughter+0.052
 cousins+0.051
 nephew+0.049
 owners+0.044
 niece+0.042
Neg logit contributions
gh-0.110
irl-0.107
ners-0.104
isting-0.101
ven-0.098
Token in
Token ID287
Feature activation-1.63e+00

Pos logit contributions
 cousins+0.027
 granddaughter+0.026
 nephew+0.025
 glad+0.024
 ear+0.023
Neg logit contributions
ners-0.045
obb-0.042
works-0.042
 tide-0.041
inner-0.040
Token the
Token ID262
Feature activation-6.67e-01

Pos logit contributions
gh+0.052
ners+0.050
ven+0.047
irl+0.046
 tide+0.045
Neg logit contributions
 cousins-0.042
 granddaughter-0.040
 nephew-0.039
 owners-0.038
 From-0.037
Token doorway
Token ID32439
Feature activation+9.25e+00

Pos logit contributions
gh+0.022
ners+0.021
 tide+0.020
about+0.019
 competition+0.019
Neg logit contributions
 cousins-0.016
 granddaughter-0.015
 nephew-0.014
 From-0.014
 owners-0.014
Token.
Token ID13
Feature activation+7.15e-02

Pos logit contributions
 granddaughter+0.239
 cousins+0.237
 owners+0.225
 nephew+0.220
 From+0.217
Neg logit contributions
ners-0.282
gh-0.262
ven-0.258
irl-0.257
works-0.253
Token The
Token ID383
Feature activation-1.47e-01

Pos logit contributions
 cousins+0.001
 granddaughter+0.001
 nephew+0.001
 owners+0.001
ringe+0.001
Neg logit contributions
gh-0.003
ners-0.003
ven-0.002
 tide-0.002
about-0.002
Token boy
Token ID2933
Feature activation+1.22e+00

Pos logit contributions
gh+0.004
ners+0.004
 tide+0.004
ven+0.004
about+0.004
Neg logit contributions
 cousins-0.004
 granddaughter-0.004
 nephew-0.004
 owners-0.004
 From-0.004
Token was
Token ID373
Feature activation-2.89e-01

Pos logit contributions
 cousins+0.033
 granddaughter+0.032
 From+0.031
 nephew+0.031
 owners+0.030
Neg logit contributions
gh-0.038
ners-0.037
 tide-0.034
irl-0.033
ven-0.032
Token pointing
Token ID10609
Feature activation+1.33e+00

Pos logit contributions
gh+0.009
ners+0.009
ven+0.008
irl+0.008
 tide+0.008
Neg logit contributions
 cousins-0.007
 granddaughter-0.007
 nephew-0.007
 owners-0.007
 From-0.006
Token bright
Token ID6016
Feature activation-1.30e+00

Pos logit contributions
gh+0.040
ners+0.039
ven+0.036
irl+0.036
about+0.035
Neg logit contributions
 cousins-0.035
 granddaughter-0.034
 nephew-0.033
 owners-0.032
 From-0.030
Token smile
Token ID8212
Feature activation+3.67e+00

Pos logit contributions
gh+0.043
ners+0.041
ven+0.040
irl+0.038
works+0.037
Neg logit contributions
 cousins-0.031
 granddaughter-0.031
 nephew-0.029
 heart-0.028
 owners-0.027
Token appeared
Token ID4120
Feature activation+1.10e+00

Pos logit contributions
 cousins+0.095
 granddaughter+0.093
 nephew+0.089
 owners+0.086
 ear+0.085
Neg logit contributions
gh-0.109
ven-0.108
ners-0.107
irl-0.102
redible-0.098
Token in
Token ID287
Feature activation-1.67e+00

Pos logit contributions
 cousins+0.023
 home+0.023
 owners+0.023
 granddaughter+0.023
 From+0.023
Neg logit contributions
gh-0.039
ners-0.038
irl-0.037
ven-0.037
about-0.034
Token the
Token ID262
Feature activation-6.55e-01

Pos logit contributions
gh+0.052
ners+0.049
ven+0.046
irl+0.045
 tide+0.044
Neg logit contributions
 cousins-0.045
 granddaughter-0.043
 nephew-0.041
 owners-0.040
 From-0.040
Token doorway
Token ID32439
Feature activation+9.25e+00

Pos logit contributions
gh+0.018
ners+0.018
 tide+0.016
ven+0.016
irl+0.016
Neg logit contributions
 cousins-0.019
 granddaughter-0.018
 nephew-0.018
 owners-0.017
 From-0.017
Token.
Token ID13
Feature activation-4.01e-02

Pos logit contributions
 cousins+0.260
 granddaughter+0.259
 owners+0.243
 From+0.239
 nephew+0.236
Neg logit contributions
gh-0.262
ners-0.261
ven-0.245
irl-0.244
 tide-0.229
Token She
Token ID1375
Feature activation+3.00e+00

Pos logit contributions
ners+0.001
gh+0.001
ven+0.001
 tide+0.001
about+0.001
Neg logit contributions
 cousins-0.001
ringe-0.001
 nephew-0.001
 owners-0.001
 granddaughter-0.001
Token said
Token ID531
Feature activation+4.91e+00

Pos logit contributions
 cousins+0.084
 granddaughter+0.081
 nephew+0.078
 owners+0.076
 From+0.074
Neg logit contributions
gh-0.086
ners-0.083
ven-0.076
irl-0.075
 tide-0.075
Token,
Token ID11
Feature activation-2.05e+00

Pos logit contributions
 cousins+0.149
 granddaughter+0.144
 From+0.140
 owners+0.136
 nephew+0.134
Neg logit contributions
ners-0.131
gh-0.125
irl-0.120
about-0.114
 tide-0.112
Token "
Token ID366
Feature activation-5.63e-01

Pos logit contributions
gh+0.071
ners+0.069
ven+0.064
irl+0.063
 tide+0.063
Neg logit contributions
 cousins-0.045
 granddaughter-0.043
 nephew-0.041
 owners-0.040
 From-0.040
Token,
Token ID11
Feature activation-2.14e+00

Pos logit contributions
 cousins+0.032
 granddaughter+0.031
 nephew+0.029
 owners+0.028
 From+0.028
Neg logit contributions
gh-0.045
ners-0.044
irl-0.041
ven-0.040
 tide-0.039
Token but
Token ID475
Feature activation-1.06e+00

Pos logit contributions
gh+0.058
ners+0.055
ven+0.050
 tide+0.050
irl+0.049
Neg logit contributions
 cousins-0.064
 granddaughter-0.062
 nephew-0.060
 owners-0.059
 From-0.058
Token stopped
Token ID5025
Feature activation+1.14e+00

Pos logit contributions
gh+0.025
 tide+0.021
ven+0.021
ners+0.021
irl+0.019
Neg logit contributions
 cousins-0.035
 granddaughter-0.034
 nephew-0.033
ringe-0.032
 owners-0.032
Token in
Token ID287
Feature activation-1.79e+00

Pos logit contributions
 cousins+0.024
 granddaughter+0.024
 nephew+0.023
 owners+0.021
 parent+0.021
Neg logit contributions
gh-0.042
ners-0.042
ven-0.039
about-0.038
irl-0.038
Token a
Token ID257
Feature activation-1.41e+00

Pos logit contributions
gh+0.063
ners+0.061
ven+0.057
 tide+0.056
irl+0.056
Neg logit contributions
 cousins-0.039
 granddaughter-0.037
 nephew-0.036
 owners-0.035
 From-0.034
Token doorway
Token ID32439
Feature activation+9.24e+00

Pos logit contributions
gh+0.045
ners+0.044
ven+0.043
irl+0.042
works+0.041
Neg logit contributions
 cousins-0.033
 granddaughter-0.033
 nephew-0.030
 owners-0.030
 glad-0.029
Token.
Token ID13
Feature activation+5.69e-03

Pos logit contributions
 granddaughter+0.241
 cousins+0.238
 nephew+0.226
 owners+0.225
 From+0.216
Neg logit contributions
ners-0.276
gh-0.266
irl-0.257
ven-0.253
redible-0.240
Token
Token ID198
Feature activation+1.25e+00

Pos logit contributions
 cousins+0.000
ringe+0.000
 granddaughter+0.000
 nephew+0.000
 owners+0.000
Neg logit contributions
gh-0.000
ven-0.000
 tide-0.000
ners-0.000
about-0.000
Token
Token ID198
Feature activation+1.25e+00

Pos logit contributions
 cousins+0.028
 granddaughter+0.026
 nephew+0.025
 owners+0.023
ringe+0.022
Neg logit contributions
gh-0.045
ners-0.042
ven-0.040
irl-0.039
 tide-0.038
Token"
Token ID1
Feature activation+1.63e-01

Pos logit contributions
 cousins+0.044
 granddaughter+0.042
 nephew+0.041
 owners+0.040
ringe+0.039
Neg logit contributions
gh-0.028
ners-0.025
ven-0.023
about-0.022
 tide-0.022
TokenJohn
Token ID7554
Feature activation-1.98e+00

Pos logit contributions
 cousins+0.006
 granddaughter+0.005
 From+0.005
 nephew+0.005
 owners+0.005
Neg logit contributions
gh-0.004
ners-0.003
 tide-0.003
ven-0.003
about-0.003
Token old
Token ID1468
Feature activation+2.53e+00

Pos logit contributions
gh+0.090
ners+0.086
 tide+0.085
about+0.081
th+0.078
Neg logit contributions
 cousins-0.087
 granddaughter-0.080
 nephew-0.079
 From-0.078
ringe-0.078
Token lady
Token ID10846
Feature activation+5.67e+00

Pos logit contributions
 cousins+0.050
 granddaughter+0.047
 nephew+0.045
 owners+0.044
 From+0.041
Neg logit contributions
gh-0.096
ners-0.093
ven-0.087
irl-0.086
 tide-0.084
Token standing
Token ID5055
Feature activation-4.23e+00

Pos logit contributions
 cousins+0.143
 granddaughter+0.141
 nephew+0.131
 owners+0.128
 From+0.125
Neg logit contributions
gh-0.184
ners-0.182
ven-0.166
 tide-0.165
irl-0.164
Token in
Token ID287
Feature activation-1.83e+00

Pos logit contributions
gh+0.124
ners+0.117
ven+0.107
irl+0.104
about+0.102
Neg logit contributions
 cousins-0.120
 granddaughter-0.117
 owners-0.111
 nephew-0.109
 From-0.108
Token the
Token ID262
Feature activation-6.64e-01

Pos logit contributions
gh+0.058
ners+0.056
ven+0.051
 tide+0.051
irl+0.051
Neg logit contributions
 cousins-0.047
 granddaughter-0.045
 nephew-0.043
 owners-0.042
 From-0.041
Token doorway
Token ID32439
Feature activation+9.24e+00

Pos logit contributions
gh+0.018
 tide+0.017
ners+0.017
 competition+0.016
 powder+0.016
Neg logit contributions
 cousins-0.018
 nephew-0.017
 granddaughter-0.016
ucks-0.016
 From-0.016
Token.
Token ID13
Feature activation-2.47e-01

Pos logit contributions
 cousins+0.254
 granddaughter+0.254
 owners+0.241
 From+0.235
 nephew+0.230
Neg logit contributions
ners-0.264
gh-0.263
irl-0.243
ven-0.241
 tide-0.232
Token She
Token ID1375
Feature activation+3.02e+00

Pos logit contributions
gh+0.009
ners+0.009
ven+0.008
 tide+0.008
about+0.008
Neg logit contributions
 cousins-0.005
 nephew-0.005
 granddaughter-0.004
 owners-0.004
ringe-0.004
Token was
Token ID373
Feature activation-3.25e-01

Pos logit contributions
 cousins+0.089
 granddaughter+0.085
 nephew+0.082
 owners+0.081
 From+0.079
Neg logit contributions
gh-0.084
ners-0.080
ven-0.073
 tide-0.072
irl-0.072
Token so
Token ID523
Feature activation+8.60e-01

Pos logit contributions
gh+0.010
ners+0.010
ven+0.009
irl+0.009
 tide+0.009
Neg logit contributions
 cousins-0.008
 granddaughter-0.008
 nephew-0.007
 owners-0.007
 From-0.007
Token surprised
Token ID6655
Feature activation+3.88e+00

Pos logit contributions
 cousins+0.019
 granddaughter+0.018
 nephew+0.018
 owners+0.017
 From+0.017
Neg logit contributions
gh-0.030
ners-0.029
ven-0.027
 tide-0.026
irl-0.026
Token box
Token ID3091
Feature activation+1.30e+00

Pos logit contributions
 granddaughter+0.051
 cousins+0.050
 owners+0.048
 nephew+0.047
 grandparents+0.046
Neg logit contributions
ners-0.080
gh-0.078
ven-0.075
irl-0.072
about-0.070
Token.
Token ID13
Feature activation-2.05e-01

Pos logit contributions
 cousins+0.033
 granddaughter+0.033
 owners+0.031
 nephew+0.031
 From+0.030
Neg logit contributions
ners-0.040
gh-0.040
irl-0.037
ven-0.036
 tide-0.035
Token It
Token ID632
Feature activation-1.48e-01

Pos logit contributions
gh+0.007
ners+0.007
 tide+0.006
ven+0.006
about+0.006
Neg logit contributions
 cousins-0.005
 granddaughter-0.004
 nephew-0.004
 owners-0.004
ringe-0.004
Token opened
Token ID4721
Feature activation+9.69e-01

Pos logit contributions
gh+0.004
ners+0.004
 tide+0.003
ven+0.003
irl+0.003
Neg logit contributions
 cousins-0.005
 granddaughter-0.004
 owners-0.004
 nephew-0.004
 From-0.004
Token a
Token ID257
Feature activation-1.49e+00

Pos logit contributions
 cousins+0.024
 granddaughter+0.023
 From+0.023
 owners+0.022
 nephew+0.021
Neg logit contributions
gh-0.031
ners-0.030
irl-0.027
ven-0.027
 tide-0.027
Token doorway
Token ID32439
Feature activation+9.24e+00

Pos logit contributions
gh+0.045
ners+0.044
irl+0.040
ven+0.040
about+0.039
Neg logit contributions
 cousins-0.040
 granddaughter-0.038
 owners-0.037
 nephew-0.037
 From-0.035
Token to
Token ID284
Feature activation-7.47e-01

Pos logit contributions
 cousins+0.247
 granddaughter+0.243
 nephew+0.231
 owners+0.230
 From+0.226
Neg logit contributions
ners-0.273
gh-0.270
irl-0.248
ven-0.240
 tide-0.235
Token a
Token ID257
Feature activation-1.50e+00

Pos logit contributions
gh+0.019
 tide+0.018
ners+0.017
irl+0.017
ven+0.016
Neg logit contributions
 cousins-0.024
 granddaughter-0.022
 nephew-0.022
 owners-0.021
 From-0.020
Token magical
Token ID10883
Feature activation+1.37e+00

Pos logit contributions
gh+0.047
ners+0.046
ven+0.042
irl+0.042
about+0.040
Neg logit contributions
 cousins-0.039
 granddaughter-0.038
 nephew-0.036
 owners-0.036
 grandparents-0.034
Token place
Token ID1295
Feature activation+2.59e+00

Pos logit contributions
 granddaughter+0.029
 cousins+0.028
 heart+0.027
 owners+0.027
 nephew+0.026
Neg logit contributions
gh-0.050
ners-0.048
ven-0.045
irl-0.044
about-0.042
Token and
Token ID290
Feature activation+2.24e+00

Pos logit contributions
 cousins+0.068
 granddaughter+0.065
 owners+0.063
 From+0.062
 glad+0.062
Neg logit contributions
ners-0.079
gh-0.078
irl-0.071
ven-0.068
about-0.067
Token the
Token ID262
Feature activation-5.08e-01

Pos logit contributions
gh+0.106
ners+0.104
ven+0.097
irl+0.092
about+0.091
Neg logit contributions
 cousins-0.066
 From-0.062
 owners-0.062
 granddaughter-0.061
 nephew-0.059
Token other
Token ID584
Feature activation+5.11e+00

Pos logit contributions
gh+0.016
ners+0.015
irl+0.014
ven+0.014
about+0.014
Neg logit contributions
 cousins-0.013
 granddaughter-0.013
 owners-0.012
 nephew-0.012
 From-0.012
Token side
Token ID1735
Feature activation+2.45e+00

Pos logit contributions
 cousins+0.145
 granddaughter+0.140
 nephew+0.135
 From+0.129
 owners+0.127
Neg logit contributions
gh-0.141
ners-0.133
 tide-0.125
ven-0.123
about-0.121
Token of
Token ID286
Feature activation+2.44e+00

Pos logit contributions
 cousins+0.040
 granddaughter+0.039
 owners+0.037
 From+0.036
 nephew+0.035
Neg logit contributions
gh-0.097
ners-0.095
irl-0.092
ven-0.089
 tide-0.086
Token the
Token ID262
Feature activation-4.40e-01

Pos logit contributions
 cousins+0.062
 granddaughter+0.059
 owners+0.057
 nephew+0.057
 From+0.055
Neg logit contributions
gh-0.079
ners-0.076
ven-0.071
irl-0.068
about-0.067
Token doorway
Token ID32439
Feature activation+9.23e+00

Pos logit contributions
gh+0.017
ners+0.016
ven+0.015
irl+0.015
about+0.015
Neg logit contributions
 cousins-0.009
 granddaughter-0.008
 owners-0.008
 nephew-0.008
 From-0.007
Token.
Token ID13
Feature activation-2.09e-01

Pos logit contributions
 cousins+0.267
 granddaughter+0.261
 owners+0.249
 From+0.246
 nephew+0.245
Neg logit contributions
gh-0.253
ners-0.251
irl-0.226
ven-0.225
 tide-0.219
Token He
Token ID679
Feature activation+8.09e-01

Pos logit contributions
ners+0.007
gh+0.006
ven+0.006
about+0.006
 tide+0.006
Neg logit contributions
 cousins-0.005
 granddaughter-0.004
 nephew-0.004
 owners-0.004
ringe-0.004
Token was
Token ID373
Feature activation-2.24e-01

Pos logit contributions
 cousins+0.025
 granddaughter+0.024
 nephew+0.023
 owners+0.022
 From+0.022
Neg logit contributions
gh-0.022
ners-0.021
ven-0.019
 tide-0.018
irl-0.018
Token very
Token ID845
Feature activation+5.38e-01

Pos logit contributions
gh+0.006
ners+0.006
 tide+0.006
ven+0.006
irl+0.006
Neg logit contributions
 cousins-0.006
 granddaughter-0.006
 nephew-0.006
 owners-0.006
ringe-0.005
Token happy
Token ID3772
Feature activation+4.61e+00

Pos logit contributions
 cousins+0.011
 granddaughter+0.011
 nephew+0.011
 owners+0.010
ringe+0.010
Neg logit contributions
gh-0.019
ners-0.018
 tide-0.017
irl-0.017
ven-0.017
Token mom
Token ID1995
Feature activation+4.16e+00

Pos logit contributions
gh+0.030
ners+0.030
 tide+0.028
ven+0.027
irl+0.027
Neg logit contributions
 cousins-0.017
 granddaughter-0.016
 From-0.015
 nephew-0.015
 owners-0.014
Token was
Token ID373
Feature activation+4.29e-01

Pos logit contributions
 cousins+0.111
 granddaughter+0.108
 nephew+0.102
 owners+0.099
 From+0.098
Neg logit contributions
gh-0.129
ners-0.124
ven-0.113
irl-0.112
 tide-0.112
Token watching
Token ID4964
Feature activation-2.68e+00

Pos logit contributions
 cousins+0.010
 granddaughter+0.010
 nephew+0.009
 owners+0.009
 From+0.009
Neg logit contributions
gh-0.014
ners-0.014
ven-0.013
 tide-0.012
irl-0.012
Token from
Token ID422
Feature activation+2.80e+00

Pos logit contributions
gh+0.099
ners+0.097
ven+0.094
irl+0.091
about+0.087
Neg logit contributions
 granddaughter-0.056
 cousins-0.056
 nephew-0.050
 grandma-0.047
 owners-0.047
Token the
Token ID262
Feature activation-5.77e-01

Pos logit contributions
 cousins+0.068
 granddaughter+0.066
 nephew+0.064
 owners+0.060
 grandma+0.059
Neg logit contributions
gh-0.098
ners-0.093
ven-0.089
irl-0.082
about-0.082
Token doorway
Token ID32439
Feature activation+9.23e+00

Pos logit contributions
gh+0.020
ners+0.019
ven+0.018
irl+0.017
h+0.017
Neg logit contributions
 cousins-0.014
 granddaughter-0.014
 nephew-0.013
 owners-0.013
 grandparents-0.012
Token and
Token ID290
Feature activation+2.24e+00

Pos logit contributions
 cousins+0.243
 granddaughter+0.240
 nephew+0.221
 owners+0.220
 From+0.214
Neg logit contributions
gh-0.283
ners-0.277
ven-0.258
irl-0.257
 tide-0.246
Token was
Token ID373
Feature activation-6.09e-01

Pos logit contributions
 cousins+0.062
 granddaughter+0.060
 nephew+0.057
 owners+0.056
 From+0.055
Neg logit contributions
gh-0.067
ners-0.064
ven-0.058
irl-0.058
 tide-0.057
Token very
Token ID845
Feature activation+3.82e-01

Pos logit contributions
gh+0.019
ners+0.018
ven+0.016
 tide+0.016
irl+0.016
Neg logit contributions
 cousins-0.016
 granddaughter-0.015
 nephew-0.015
 owners-0.014
 From-0.014
Token happy
Token ID3772
Feature activation+4.61e+00

Pos logit contributions
 cousins+0.008
 granddaughter+0.007
 nephew+0.007
 owners+0.007
 From+0.007
Neg logit contributions
gh-0.014
ners-0.014
ven-0.013
about-0.012
irl-0.012
Token to
Token ID284
Feature activation-1.12e+00

Pos logit contributions
 cousins+0.135
 granddaughter+0.129
 owners+0.124
 nephew+0.122
 From+0.119
Neg logit contributions
gh-0.131
ners-0.129
ven-0.121
irl-0.119
about-0.112
Token young
Token ID1862
Feature activation+1.16e+00

Pos logit contributions
 cousins+0.087
 granddaughter+0.084
 nephew+0.078
 owners+0.076
 grandparents+0.068
Neg logit contributions
gh-0.175
ners-0.169
ven-0.159
irl-0.156
 tide-0.154
Token daughter
Token ID4957
Feature activation+7.15e+00

Pos logit contributions
 granddaughter+0.019
 cousins+0.018
 nephew+0.017
 owners+0.016
 niece+0.015
Neg logit contributions
gh-0.052
ners-0.049
irl-0.048
ven-0.048
works-0.046
Token appeared
Token ID4120
Feature activation+9.91e-01

Pos logit contributions
 granddaughter+0.185
 cousins+0.185
 nephew+0.172
 owners+0.165
 niece+0.159
Neg logit contributions
ners-0.230
gh-0.228
irl-0.221
 tide-0.215
works-0.211
Token in
Token ID287
Feature activation-1.72e+00

Pos logit contributions
 granddaughter+0.023
 cousins+0.022
 owners+0.021
 home+0.021
 nephew+0.020
Neg logit contributions
gh-0.034
ners-0.033
ven-0.032
irl-0.032
 tide-0.031
Token the
Token ID262
Feature activation-7.41e-01

Pos logit contributions
gh+0.058
ners+0.055
ven+0.052
irl+0.051
 tide+0.051
Neg logit contributions
 cousins-0.041
 granddaughter-0.039
 nephew-0.038
 owners-0.037
 From-0.036
Token doorway
Token ID32439
Feature activation+9.22e+00

Pos logit contributions
gh+0.025
ners+0.025
 tide+0.023
about+0.022
ven+0.022
Neg logit contributions
 cousins-0.017
 granddaughter-0.016
 nephew-0.015
 owners-0.015
 From-0.015
Token,
Token ID11
Feature activation-1.95e+00

Pos logit contributions
 granddaughter+0.252
 cousins+0.250
 owners+0.233
 nephew+0.230
 From+0.228
Neg logit contributions
gh-0.271
ners-0.266
irl-0.250
ven-0.249
 tide-0.241
Token looking
Token ID2045
Feature activation-3.48e-01

Pos logit contributions
gh+0.067
ners+0.063
ven+0.060
irl+0.059
 tide+0.058
Neg logit contributions
 cousins-0.046
 granddaughter-0.045
 nephew-0.043
 From-0.042
 owners-0.042
Token very
Token ID845
Feature activation+3.41e-01

Pos logit contributions
gh+0.012
ners+0.011
ven+0.010
irl+0.010
 tide+0.010
Neg logit contributions
 cousins-0.008
 granddaughter-0.008
 nephew-0.008
 owners-0.007
 From-0.007
Token hungry
Token ID14720
Feature activation-1.01e+00

Pos logit contributions
 glad+0.007
 cousins+0.007
 granddaughter+0.007
 From+0.006
 pride+0.006
Neg logit contributions
gh-0.013
ners-0.012
ven-0.011
works-0.011
about-0.011
Token.
Token ID13
Feature activation-5.24e-02

Pos logit contributions
irl+0.042
gh+0.039
ners+0.038
ven+0.038
isting+0.038
Neg logit contributions
 granddaughter-0.018
 owner-0.017
 knowing-0.017
 cousins-0.017
 owners-0.017
Token He
Token ID679
Feature activation+7.33e-01

Pos logit contributions
ners+0.027
gh+0.026
about+0.025
ven+0.025
 tide+0.024
Neg logit contributions
 cousins-0.023
 granddaughter-0.020
 nephew-0.020
 owners-0.019
ringe-0.018
Token made
Token ID925
Feature activation-2.21e+00

Pos logit contributions
 cousins+0.019
 granddaughter+0.019
 From+0.018
 owners+0.018
 nephew+0.018
Neg logit contributions
gh-0.023
ners-0.021
 tide-0.020
works-0.020
ven-0.019
Token the
Token ID262
Feature activation-7.66e-01

Pos logit contributions
gh+0.075
ners+0.073
ven+0.069
works+0.067
 tide+0.065
Neg logit contributions
 cousins-0.050
 granddaughter-0.050
 nephew-0.045
 owners-0.044
 From-0.044
Token biggest
Token ID4094
Feature activation+2.46e+00

Pos logit contributions
gh+0.023
ners+0.023
ven+0.021
irl+0.020
works+0.020
Neg logit contributions
 cousins-0.021
 granddaughter-0.020
 owners-0.019
 nephew-0.019
 wife-0.018
Token and
Token ID290
Feature activation+2.19e+00

Pos logit contributions
 cousins+0.067
 granddaughter+0.064
 nephew+0.061
 owners+0.061
 From+0.057
Neg logit contributions
gh-0.076
ners-0.072
ven-0.068
irl-0.066
works-0.064
Token rou
Token ID13805
Feature activation-1.26e+01

Pos logit contributions
 cousins+0.057
 granddaughter+0.057
 nephew+0.054
 owners+0.053
 glad+0.052
Neg logit contributions
gh-0.070
ners-0.062
irl-0.060
isting-0.059
works-0.059
Tokeng
Token ID70
Feature activation-4.74e+00

Pos logit contributions
 tide+0.361
 rep+0.317
 rect+0.296
gh+0.293
 tides+0.291
Neg logit contributions
handed-0.283
 From-0.278
 glad-0.259
ringe-0.254
rolley-0.251
Tokenhest
Token ID3634
Feature activation+2.21e+00

Pos logit contributions
gh+0.127
ners+0.119
ven+0.118
 matter+0.116
 spe+0.115
Neg logit contributions
 From-0.128
 cousins-0.119
 owners-0.116
 granddaughter-0.115
rolley-0.111
Token pile
Token ID14540
Feature activation-3.35e-01

Pos logit contributions
 cousins+0.054
 home+0.053
 friends+0.050
 family+0.050
 owners+0.049
Neg logit contributions
gh-0.071
ven-0.068
ners-0.066
irl-0.063
redible-0.062
Token of
Token ID286
Feature activation+2.17e+00

Pos logit contributions
irl+0.014
gh+0.013
ners+0.013
ven+0.013
isting+0.013
Neg logit contributions
 cousins-0.004
 home-0.004
 knowing-0.004
 owner-0.004
 granddaughter-0.004
Token sand
Token ID6450
Feature activation-2.28e+00

Pos logit contributions
 cousins+0.055
 friends+0.049
 granddaughter+0.049
 pride+0.048
 family+0.047
Neg logit contributions
gh-0.073
ven-0.066
ners-0.065
h-0.064
works-0.060
Token,
Token ID11
Feature activation-2.01e+00

Pos logit contributions
 cousins+0.071
 granddaughter+0.067
 owners+0.067
 knowing+0.064
 heart+0.063
Neg logit contributions
ners-0.079
irl-0.078
gh-0.077
ven-0.074
 tide-0.072
Token but
Token ID475
Feature activation-8.88e-01

Pos logit contributions
gh+0.056
ners+0.053
ven+0.048
irl+0.047
 tide+0.047
Neg logit contributions
 cousins-0.059
 granddaughter-0.057
 nephew-0.055
 owners-0.054
 From-0.053
Token he
Token ID339
Feature activation-1.53e-01

Pos logit contributions
gh+0.025
ners+0.024
ven+0.022
irl+0.022
 tide+0.022
Neg logit contributions
 cousins-0.026
 granddaughter-0.025
 nephew-0.024
 owners-0.023
 From-0.023
Token was
Token ID373
Feature activation-3.55e-01

Pos logit contributions
gh+0.004
ners+0.004
 tide+0.004
ven+0.004
irl+0.004
Neg logit contributions
 cousins-0.004
 granddaughter-0.004
 owners-0.004
 nephew-0.004
 From-0.004
Token much
Token ID881
Feature activation+2.35e-01

Pos logit contributions
gh+0.012
ners+0.011
ven+0.010
 tide+0.010
about+0.010
Neg logit contributions
 cousins-0.009
 granddaughter-0.008
 owners-0.008
 nephew-0.008
 From-0.008
Token rou
Token ID13805
Feature activation-1.25e+01

Pos logit contributions
 cousins+0.006
 granddaughter+0.006
 nephew+0.006
 owners+0.006
 glad+0.005
Neg logit contributions
gh-0.008
ners-0.007
ven-0.007
works-0.007
irl-0.007
Tokeng
Token ID70
Feature activation-4.55e+00

Pos logit contributions
 tide+0.365
 seaw+0.305
 rect+0.300
 tides+0.300
gh+0.296
Neg logit contributions
 From-0.323
handed-0.299
 glad-0.275
 owners-0.272
 cousins-0.271
Tokenher
Token ID372
Feature activation-7.03e-01

Pos logit contributions
 tide+0.142
gh+0.140
ners+0.135
ven+0.134
 seaw+0.132
Neg logit contributions
 From-0.107
 cousins-0.096
 owners-0.092
 granddaughter-0.091
rolley-0.087
Token and
Token ID290
Feature activation+2.47e+00

Pos logit contributions
irl+0.022
ven+0.022
ners+0.021
gh+0.021
 tide+0.021
Neg logit contributions
 cousins-0.016
 owners-0.016
 home-0.016
 granddaughter-0.016
 nephew-0.016
Token the
Token ID262
Feature activation-6.00e-01

Pos logit contributions
 cousins+0.064
 owners+0.061
 glad+0.060
 nephew+0.059
 granddaughter+0.059
Neg logit contributions
gh-0.071
ners-0.067
about-0.066
 tide-0.063
ven-0.063
Token car
Token ID1097
Feature activation+2.17e+00

Pos logit contributions
gh+0.022
ners+0.021
ven+0.021
irl+0.020
h+0.019
Neg logit contributions
 cousins-0.013
 granddaughter-0.012
 nephew-0.012
 owners-0.011
 grandparents-0.010
Token of
Token ID286
Feature activation+2.82e+00

Pos logit contributions
 cousins+0.049
 granddaughter+0.046
 owners+0.044
 nephew+0.043
 From+0.040
Neg logit contributions
gh-0.074
ners-0.071
ven-0.070
irl-0.066
h-0.064
Token her
Token ID607
Feature activation+5.23e+00

Pos logit contributions
 cousins+0.065
 granddaughter+0.062
 nephew+0.059
 owners+0.059
 From+0.057
Neg logit contributions
gh-0.097
ners-0.093
ven-0.088
irl-0.085
 tide-0.084
Token favorite
Token ID4004
Feature activation+2.99e+00

Pos logit contributions
 cousins+0.114
 granddaughter+0.109
 nephew+0.102
 owners+0.102
 From+0.093
Neg logit contributions
gh-0.185
ners-0.182
ven-0.172
 tide-0.167
irl-0.166
Token books
Token ID3835
Feature activation-7.63e-01

Pos logit contributions
 cousins+0.072
 granddaughter+0.069
 nephew+0.065
 owners+0.065
 From+0.059
Neg logit contributions
gh-0.098
ners-0.095
ven-0.092
 tide-0.087
irl-0.087
Token got
Token ID1392
Feature activation+1.23e+00

Pos logit contributions
gh+0.022
ners+0.021
 tide+0.019
irl+0.019
ven+0.019
Neg logit contributions
 cousins-0.021
 granddaughter-0.020
 owners-0.019
 From-0.019
 nephew-0.019
Token rou
Token ID13805
Feature activation-1.24e+01

Pos logit contributions
 cousins+0.030
 granddaughter+0.028
 owners+0.027
 nephew+0.026
 grandma+0.025
Neg logit contributions
gh-0.039
about-0.038
ners-0.037
irl-0.037
 tide-0.036
Tokeng
Token ID70
Feature activation-4.12e+00

Pos logit contributions
 tide+0.359
gh+0.310
 rect+0.294
 seaw+0.291
 tides+0.285
Neg logit contributions
 From-0.343
 cousins-0.314
 owners-0.308
ringe-0.305
 nephew-0.298
Tokenhed
Token ID704
Feature activation+2.92e+00

Pos logit contributions
gh+0.116
 tide+0.115
ven+0.106
 seaw+0.104
 rect+0.103
Neg logit contributions
 From-0.112
 granddaughter-0.107
 cousins-0.105
 owners-0.104
rolley-0.097
Token up
Token ID510
Feature activation+3.69e+00

Pos logit contributions
 home+0.055
 cousins+0.055
 owners+0.055
 From+0.054
 granddaughter+0.053
Neg logit contributions
gh-0.101
 tide-0.097
ven-0.096
irl-0.094
works-0.090
Token.
Token ID13
Feature activation-3.82e-02

Pos logit contributions
 cousins+0.081
 granddaughter+0.077
 owners+0.076
 home+0.076
 From+0.075
Neg logit contributions
gh-0.120
ven-0.115
ners-0.113
 tide-0.111
irl-0.109
Token She
Token ID1375
Feature activation+2.72e+00

Pos logit contributions
gh+0.001
ners+0.001
ven+0.001
irl+0.001
 tide+0.001
Neg logit contributions
 cousins-0.001
 granddaughter-0.001
 nephew-0.001
 owners-0.001
ringe-0.001
Token floor
Token ID4314
Feature activation-8.74e-01

Pos logit contributions
gh+0.042
ven+0.040
irl+0.039
ners+0.039
h+0.038
Neg logit contributions
 cousins-0.029
 owners-0.026
 granddaughter-0.026
 nephew-0.025
 aunt-0.024
Token and
Token ID290
Feature activation+1.68e+00

Pos logit contributions
gh+0.029
ners+0.028
ven+0.026
irl+0.026
 tide+0.026
Neg logit contributions
 cousins-0.020
 granddaughter-0.019
 owners-0.018
 From-0.017
 nephew-0.017
Token make
Token ID787
Feature activation-1.95e+00

Pos logit contributions
 cousins+0.049
 granddaughter+0.046
 nephew+0.045
 owners+0.045
 From+0.043
Neg logit contributions
gh-0.048
ners-0.045
ven-0.041
about-0.040
irl-0.040
Token noises
Token ID26782
Feature activation-2.51e+00

Pos logit contributions
gh+0.060
ners+0.057
ven+0.056
 tide+0.052
about+0.051
Neg logit contributions
 cousins-0.052
 granddaughter-0.050
 owners-0.046
 nephew-0.045
 grandma-0.045
Token.
Token ID13
Feature activation-4.78e-01

Pos logit contributions
gh+0.084
ven+0.078
ners+0.077
irl+0.075
 tide+0.072
Neg logit contributions
 cousins-0.059
 granddaughter-0.054
 owners-0.053
 knowing-0.052
 nephew-0.051
Token V
Token ID569
Feature activation-1.24e+01

Pos logit contributions
gh+0.020
ners+0.019
ven+0.018
irl+0.018
 tide+0.018
Neg logit contributions
 cousins-0.008
 granddaughter-0.007
 nephew-0.007
 owners-0.006
 From-0.006
Tokenroom
Token ID3823
Feature activation-5.93e+00

Pos logit contributions
gh+0.309
 tide+0.308
 impossible+0.303
 rou+0.300
 som+0.296
Neg logit contributions
 cousins-0.346
 granddaughter-0.314
 From-0.306
 cousin-0.298
ucks-0.297
Token,
Token ID11
Feature activation-2.65e+00

Pos logit contributions
 tide+0.186
 seaw+0.185
gh+0.180
irl+0.178
isting+0.178
Neg logit contributions
 cousins-0.126
 From-0.114
 grandma-0.112
 aunt-0.111
 granddaughter-0.110
Token v
Token ID410
Feature activation-6.82e+00

Pos logit contributions
gh+0.084
ven+0.073
ade+0.073
ners+0.071
irl+0.070
Neg logit contributions
 cousins-0.064
 glad-0.064
 heart-0.061
 ear-0.061
 granddaughter-0.060
Tokenroom
Token ID3823
Feature activation-5.92e+00

Pos logit contributions
gh+0.122
ners+0.114
ven+0.097
irl+0.095
 tide+0.095
Neg logit contributions
 cousins-0.268
 granddaughter-0.260
 nephew-0.254
 owners-0.250
 From-0.247
Token,
Token ID11
Feature activation-2.68e+00

Pos logit contributions
 tide+0.177
gh+0.175
irl+0.166
 seaw+0.165
 tides+0.162
Neg logit contributions
 cousins-0.134
 From-0.129
 granddaughter-0.118
 hugs-0.117
 grandma-0.117
Token through
Token ID832
Feature activation+1.77e+00

Pos logit contributions
gh+0.089
ners+0.085
irl+0.079
ven+0.079
 tide+0.077
Neg logit contributions
 cousins-0.061
 From-0.057
 granddaughter-0.057
 owners-0.057
 nephew-0.055
Token the
Token ID262
Feature activation-5.86e-01

Pos logit contributions
 cousins+0.038
 owners+0.036
 From+0.035
 granddaughter+0.034
 nephew+0.034
Neg logit contributions
gh-0.064
ners-0.060
ven-0.058
irl-0.056
works-0.056
Token woods
Token ID16479
Feature activation+1.72e+00

Pos logit contributions
gh+0.020
ners+0.019
ven+0.017
irl+0.017
 tide+0.017
Neg logit contributions
 cousins-0.014
 granddaughter-0.014
 nephew-0.013
 owners-0.013
 From-0.013
Token feeling
Token ID4203
Feature activation+2.73e+00

Pos logit contributions
 cousins+0.051
 granddaughter+0.049
 nephew+0.048
 owners+0.048
 From+0.047
Neg logit contributions
gh-0.047
ners-0.045
ven-0.040
irl-0.040
 tide-0.039
Token the
Token ID262
Feature activation-5.28e-01

Pos logit contributions
 glad+0.051
 cousins+0.049
 From+0.047
 granddaughter+0.046
 pride+0.045
Neg logit contributions
gh-0.105
ners-0.099
ven-0.096
works-0.094
about-0.091
Token rou
Token ID13805
Feature activation-1.24e+01

Pos logit contributions
gh+0.018
ners+0.016
ven+0.016
irl+0.015
h+0.014
Neg logit contributions
 cousins-0.013
 granddaughter-0.013
 owners-0.013
 nephew-0.012
 pride-0.012
Tokeng
Token ID70
Feature activation-4.33e+00

Pos logit contributions
 tide+0.346
gh+0.311
 rect+0.300
 rep+0.298
teen+0.288
Neg logit contributions
 From-0.305
age-0.278
handed-0.276
 owners-0.267
 glad-0.258
Tokenher
Token ID372
Feature activation-6.28e-01

Pos logit contributions
gh+0.111
ners+0.107
ven+0.106
 tide+0.105
 rect+0.102
Neg logit contributions
 From-0.122
 owners-0.110
 cousins-0.104
 granddaughter-0.102
rolley-0.100
Token bark
Token ID21405
Feature activation-1.53e+00

Pos logit contributions
irl+0.023
gh+0.022
ven+0.022
ners+0.021
isting+0.020
Neg logit contributions
 owners-0.013
 home-0.013
 cousins-0.012
 owner-0.012
 nephew-0.012
Token on
Token ID319
Feature activation-3.01e+00

Pos logit contributions
irl+0.053
ven+0.049
 tide+0.047
ners+0.047
gh+0.047
Neg logit contributions
 owners-0.030
 knowing-0.030
 back-0.028
 From-0.027
 home-0.027
Token the
Token ID262
Feature activation-4.89e-01

Pos logit contributions
ners+0.121
gh+0.120
ven+0.118
irl+0.114
works+0.112
Neg logit contributions
 cousins-0.045
 owners-0.044
 From-0.043
 granddaughter-0.040
 nephew-0.040
Token floor
Token ID4314
Feature activation-3.92e-01

Pos logit contributions
gh+0.008
ners+0.008
ven+0.007
irl+0.007
h+0.007
Neg logit contributions
 cousins-0.006
 granddaughter-0.005
 owners-0.005
 nephew-0.005
 ear-0.005
Token and
Token ID290
Feature activation+2.28e+00

Pos logit contributions
gh+0.012
ners+0.012
irl+0.011
ven+0.011
 tide+0.010
Neg logit contributions
 cousins-0.009
 granddaughter-0.009
 From-0.008
 owners-0.008
 heart-0.008
Token make
Token ID787
Feature activation-1.81e+00

Pos logit contributions
 cousins+0.066
 granddaughter+0.062
 nephew+0.060
 owners+0.059
 From+0.058
Neg logit contributions
gh-0.068
ners-0.062
ven-0.057
irl-0.055
about-0.055
Token noises
Token ID26782
Feature activation-2.39e+00

Pos logit contributions
gh+0.057
ners+0.055
ven+0.053
about+0.049
irl+0.049
Neg logit contributions
 cousins-0.047
 granddaughter-0.046
 nephew-0.042
 owners-0.042
 From-0.040
Token.
Token ID13
Feature activation-1.28e-01

Pos logit contributions
gh+0.076
ven+0.073
ners+0.071
irl+0.069
lex+0.066
Neg logit contributions
 cousins-0.059
 knowing-0.054
 granddaughter-0.054
 owners-0.052
 glad-0.051
Token V
Token ID569
Feature activation-1.23e+01

Pos logit contributions
gh+0.005
ners+0.005
ven+0.005
irl+0.005
 tide+0.005
Neg logit contributions
 cousins-0.002
 granddaughter-0.002
 From-0.002
 nephew-0.002
 owners-0.002
Tokenroom
Token ID3823
Feature activation-5.94e+00

Pos logit contributions
 impossible+0.289
 rou+0.288
gh+0.287
 tide+0.285
ners+0.285
Neg logit contributions
 cousins-0.345
 From-0.319
 granddaughter-0.317
 grandma-0.303
ucks-0.301
Token,
Token ID11
Feature activation-2.47e+00

Pos logit contributions
 tide+0.182
irl+0.176
gh+0.175
 tides+0.174
 seaw+0.174
Neg logit contributions
 cousins-0.124
 From-0.119
 grandma-0.110
 granddaughter-0.109
 aunt-0.109
Token v
Token ID410
Feature activation-6.76e+00

Pos logit contributions
gh+0.077
ven+0.069
ners+0.068
ade+0.067
irl+0.065
Neg logit contributions
 ear-0.060
 cousins-0.060
 glad-0.058
 heart-0.058
 From-0.056
Tokenroom
Token ID3823
Feature activation-5.95e+00

Pos logit contributions
gh+0.122
ners+0.114
ven+0.098
 tide+0.096
irl+0.096
Neg logit contributions
 cousins-0.263
 granddaughter-0.255
 nephew-0.249
 owners-0.246
 From-0.243
Token,
Token ID11
Feature activation-2.56e+00

Pos logit contributions
 tide+0.176
gh+0.170
irl+0.165
 tides+0.162
ners+0.162
Neg logit contributions
 cousins-0.131
 From-0.130
 ear-0.121
 hugs-0.119
 heart-0.118
Token.
Token ID13
Feature activation-1.85e-01

Pos logit contributions
gh+0.112
ners+0.107
 tide+0.103
ven+0.100
irl+0.099
Neg logit contributions
 cousins-0.071
 granddaughter-0.066
 From-0.064
 owners-0.063
 nephew-0.063
Token They
Token ID1119
Feature activation-1.88e+00

Pos logit contributions
gh+0.008
ners+0.007
irl+0.007
ven+0.007
 tide+0.007
Neg logit contributions
 cousins-0.003
 From-0.003
 granddaughter-0.003
 nephew-0.003
 owners-0.003
Token say
Token ID910
Feature activation+2.71e+00

Pos logit contributions
gh+0.060
ners+0.056
 tide+0.052
irl+0.050
ven+0.050
Neg logit contributions
 cousins-0.049
 granddaughter-0.045
 nephew-0.045
 owners-0.044
 From-0.043
Token,
Token ID11
Feature activation-2.21e+00

Pos logit contributions
 cousins+0.070
 granddaughter+0.064
 grandma+0.062
 heart+0.061
 owners+0.061
Neg logit contributions
ners-0.084
irl-0.083
gh-0.082
about-0.080
right-0.079
Token "
Token ID366
Feature activation-1.14e+00

Pos logit contributions
gh+0.094
ners+0.090
irl+0.087
 tide+0.085
ven+0.085
Neg logit contributions
 cousins-0.034
 granddaughter-0.032
 From-0.031
 nephew-0.028
 owners-0.028
TokenU
Token ID52
Feature activation-1.23e+01

Pos logit contributions
gh+0.033
 tide+0.030
ners+0.029
irl+0.028
ven+0.028
Neg logit contributions
 cousins-0.031
 granddaughter-0.031
 owners-0.030
 nephew-0.030
 From-0.028
Tokengly
Token ID10853
Feature activation-3.64e+00

Pos logit contributions
 rou+0.337
 narrower+0.324
 cloudy+0.323
 spe+0.315
sc+0.314
Neg logit contributions
 hugs-0.327
 cousins-0.326
 parents-0.302
 grandma-0.302
 granddaughter-0.289
Token!"
Token ID2474
Feature activation-1.23e-01

Pos logit contributions
gh+0.104
irl+0.100
 tide+0.099
ners+0.095
 rou+0.089
Neg logit contributions
 cousins-0.107
 granddaughter-0.101
 nephew-0.099
 owners-0.091
 grandma-0.091
Token "
Token ID366
Feature activation-9.67e-01

Pos logit contributions
gh+0.005
ners+0.004
ven+0.004
irl+0.004
 tide+0.004
Neg logit contributions
 cousins-0.002
 granddaughter-0.002
 nephew-0.002
 owners-0.002
 From-0.002
TokenPretty
Token ID35700
Feature activation-3.34e+00

Pos logit contributions
gh+0.028
ners+0.026
ven+0.025
 tide+0.025
irl+0.025
Neg logit contributions
 cousins-0.026
 granddaughter-0.025
 nephew-0.024
 owners-0.024
 From-0.024
Token!"
Token ID2474
Feature activation-4.10e-02

Pos logit contributions
gh+0.110
 tide+0.096
ners+0.094
obb+0.094
ulu+0.093
Neg logit contributions
 cousins-0.073
 grandma-0.070
 granddaughter-0.068
 nephew-0.067
 owner-0.066
Token and
Token ID290
Feature activation+1.78e+00

Pos logit contributions
gh+0.021
ners+0.020
irl+0.019
ven+0.019
 tide+0.018
Neg logit contributions
 cousins-0.018
 owners-0.017
 granddaughter-0.017
 owner-0.017
 From-0.016
Token make
Token ID787
Feature activation-2.04e+00

Pos logit contributions
 cousins+0.056
 granddaughter+0.055
 nephew+0.053
 owners+0.051
 From+0.051
Neg logit contributions
gh-0.044
ners-0.042
 tide-0.039
ven-0.038
irl-0.038
Token loud
Token ID7812
Feature activation-5.00e+00

Pos logit contributions
gh+0.068
ners+0.067
ven+0.064
h+0.060
isting+0.059
Neg logit contributions
 cousins-0.049
 granddaughter-0.046
 owners-0.043
 grandma-0.043
 nephew-0.041
Token noises
Token ID26782
Feature activation-2.39e+00

Pos logit contributions
gh+0.193
ners+0.180
ven+0.178
irl+0.174
kin+0.163
Neg logit contributions
 cousins-0.099
 granddaughter-0.085
 owners-0.085
 From-0.084
 nephew-0.081
Token.
Token ID13
Feature activation-4.27e-01

Pos logit contributions
gh+0.076
ven+0.071
ners+0.071
irl+0.069
 tide+0.063
Neg logit contributions
 cousins-0.062
 owners-0.057
 granddaughter-0.057
 knowing-0.054
 nephew-0.054
Token V
Token ID569
Feature activation-1.23e+01

Pos logit contributions
gh+0.016
ners+0.016
ven+0.015
 tide+0.014
irl+0.014
Neg logit contributions
 cousins-0.008
 granddaughter-0.008
 nephew-0.007
 owners-0.007
 From-0.007
Tokenroom
Token ID3823
Feature activation-5.77e+00

Pos logit contributions
ners+0.302
 impossible+0.298
 rou+0.294
 matter+0.290
gh+0.289
Neg logit contributions
 cousins-0.329
ucks-0.300
 From-0.299
 granddaughter-0.288
 owners-0.284
Token,
Token ID11
Feature activation-2.53e+00

Pos logit contributions
gh+0.178
irl+0.177
ners+0.168
 tide+0.167
isting+0.166
Neg logit contributions
 cousins-0.135
 From-0.120
 granddaughter-0.118
 grandma-0.118
 owners-0.117
Token v
Token ID410
Feature activation-6.76e+00

Pos logit contributions
gh+0.075
ven+0.065
ners+0.065
about+0.062
irl+0.062
Neg logit contributions
 glad-0.071
 cousins-0.070
 heart-0.065
 From-0.064
 granddaughter-0.064
Tokenroom
Token ID3823
Feature activation-5.76e+00

Pos logit contributions
gh+0.129
ners+0.120
ven+0.104
irl+0.102
 tide+0.102
Neg logit contributions
 cousins-0.256
 granddaughter-0.248
 nephew-0.241
 owners-0.239
 From-0.236
Token,
Token ID11
Feature activation-2.55e+00

Pos logit contributions
gh+0.172
irl+0.164
ners+0.161
 tide+0.160
ven+0.156
Neg logit contributions
 cousins-0.142
 From-0.131
 granddaughter-0.127
 owners-0.124
 grandma-0.122
Token makes
Token ID1838
Feature activation-2.94e+00

Pos logit contributions
gh+0.022
ners+0.021
ven+0.019
 tide+0.018
irl+0.018
Neg logit contributions
 cousins-0.026
 granddaughter-0.025
 nephew-0.024
 owners-0.024
 From-0.024
Token a
Token ID257
Feature activation-2.04e+00

Pos logit contributions
gh+0.092
ners+0.089
ven+0.083
irl+0.081
 tide+0.080
Neg logit contributions
 cousins-0.076
 granddaughter-0.073
 nephew-0.070
 owners-0.069
 From-0.068
Token loud
Token ID7812
Feature activation-4.85e+00

Pos logit contributions
gh+0.050
ners+0.047
 tide+0.042
irl+0.041
ven+0.041
Neg logit contributions
 cousins-0.066
 granddaughter-0.064
 nephew-0.062
 From-0.061
 owners-0.061
Token noise
Token ID7838
Feature activation-3.94e-01

Pos logit contributions
gh+0.155
ners+0.152
ven+0.149
irl+0.145
 tide+0.141
Neg logit contributions
 cousins-0.108
 granddaughter-0.103
 nephew-0.102
 From-0.100
 owners-0.098
Token.
Token ID13
Feature activation-2.43e-01

Pos logit contributions
gh+0.013
ners+0.013
ven+0.013
irl+0.012
redible+0.012
Neg logit contributions
 cousins-0.009
 nephew-0.008
 knowing-0.008
 owners-0.008
 granddaughter-0.008
Token V
Token ID569
Feature activation-1.23e+01

Pos logit contributions
gh+0.009
ners+0.009
ven+0.008
 tide+0.008
irl+0.008
Neg logit contributions
 cousins-0.005
 granddaughter-0.005
 nephew-0.004
 owners-0.004
 From-0.004
Tokenroom
Token ID3823
Feature activation-5.94e+00

Pos logit contributions
 impossible+0.317
 rou+0.316
 matter+0.309
ners+0.306
 tide+0.304
Neg logit contributions
 cousins-0.303
 From-0.286
ucks-0.282
ringe-0.279
 granddaughter-0.279
Token,
Token ID11
Feature activation-2.49e+00

Pos logit contributions
irl+0.189
 tide+0.189
gh+0.182
 tides+0.181
ners+0.175
Neg logit contributions
 cousins-0.114
 From-0.111
 grandma-0.105
 owner-0.104
 granddaughter-0.101
Token v
Token ID410
Feature activation-6.68e+00

Pos logit contributions
gh+0.074
ven+0.069
ners+0.069
irl+0.067
about+0.067
Neg logit contributions
 glad-0.063
 cousins-0.061
 heart-0.060
 ear-0.059
 From-0.058
Tokenroom
Token ID3823
Feature activation-5.93e+00

Pos logit contributions
gh+0.118
ners+0.110
ven+0.094
irl+0.090
 tide+0.089
Neg logit contributions
 cousins-0.266
 granddaughter-0.260
 nephew-0.255
 owners-0.248
 From-0.245
Token!
Token ID0
Feature activation-1.17e+00

Pos logit contributions
 tide+0.182
gh+0.177
irl+0.176
ners+0.168
 tides+0.168
Neg logit contributions
 cousins-0.126
 From-0.126
 hugs-0.115
 heart-0.114
 ear-0.114
Tokenloo
Token ID29680
Feature activation+7.31e-01

Pos logit contributions
gh+0.132
ners+0.126
ven+0.116
 tide+0.116
irl+0.115
Neg logit contributions
 cousins-0.101
 granddaughter-0.096
 nephew-0.093
 owners-0.090
 From-0.088
Token and
Token ID290
Feature activation+2.21e+00

Pos logit contributions
 cousins+0.019
 owners+0.018
 granddaughter+0.018
 nephew+0.017
 From+0.017
Neg logit contributions
gh-0.022
ners-0.021
irl-0.020
ven-0.020
 tide-0.019
Token pretended
Token ID36766
Feature activation+8.12e-01

Pos logit contributions
 cousins+0.066
 granddaughter+0.063
 nephew+0.061
 owners+0.061
 From+0.059
Neg logit contributions
gh-0.060
ners-0.058
ven-0.052
irl-0.052
 tide-0.051
Token to
Token ID284
Feature activation-9.70e-01

Pos logit contributions
 cousins+0.016
 owners+0.015
 granddaughter+0.014
 From+0.014
 grandma+0.013
Neg logit contributions
ners-0.030
irl-0.028
ven-0.028
gh-0.028
about-0.027
Token be
Token ID307
Feature activation-2.00e+00

Pos logit contributions
ners+0.032
gh+0.032
ven+0.029
irl+0.027
isting+0.027
Neg logit contributions
 cousins-0.021
 owners-0.021
 nephew-0.020
 granddaughter-0.020
 aunt-0.020
Token Esk
Token ID47534
Feature activation-1.22e+01

Pos logit contributions
gh+0.064
ners+0.061
ven+0.056
irl+0.056
 tide+0.055
Neg logit contributions
 cousins-0.051
 granddaughter-0.049
 nephew-0.047
 owners-0.046
 From-0.045
Tokenim
Token ID320
Feature activation-5.88e+00

Pos logit contributions
ners+0.299
gh+0.292
 tide+0.271
ven+0.265
th+0.258
Neg logit contributions
 cousins-0.386
 granddaughter-0.358
 owners-0.354
 From-0.350
 nephew-0.349
Tokenos
Token ID418
Feature activation-3.47e+00

Pos logit contributions
 tide+0.176
ners+0.174
gh+0.170
irl+0.163
 seaw+0.159
Neg logit contributions
 cousins-0.148
 owners-0.137
 nephew-0.135
 granddaughter-0.135
 From-0.129
Token.
Token ID13
Feature activation-2.98e-01

Pos logit contributions
gh+0.111
 tide+0.100
ners+0.099
irl+0.098
ven+0.098
Neg logit contributions
 cousins-0.082
 From-0.074
 granddaughter-0.072
 nephew-0.072
 owners-0.071
Token After
Token ID2293
Feature activation+2.55e+00

Pos logit contributions
gh+0.010
ners+0.010
ven+0.009
irl+0.009
 tide+0.009
Neg logit contributions
 cousins-0.007
 granddaughter-0.006
 nephew-0.006
 owners-0.006
ringe-0.005
Token a
Token ID257
Feature activation-1.59e+00

Pos logit contributions
 cousins+0.057
 granddaughter+0.052
 From+0.051
 owners+0.050
 nephew+0.049
Neg logit contributions
ners-0.089
gh-0.087
ven-0.083
about-0.082
works-0.080
Token.
Token ID13
Feature activation-3.10e-01

Pos logit contributions
 tide+0.192
gh+0.188
irl+0.182
ners+0.181
redible+0.172
Neg logit contributions
 cousins-0.178
 granddaughter-0.174
 nephew-0.162
 owners-0.162
 grandma-0.155
Token It
Token ID632
Feature activation-5.59e-01

Pos logit contributions
ners+0.011
gh+0.011
ven+0.010
irl+0.010
 tide+0.010
Neg logit contributions
 cousins-0.007
 granddaughter-0.006
 nephew-0.006
 owners-0.006
 wife-0.005
Token looked
Token ID3114
Feature activation-7.95e-01

Pos logit contributions
gh+0.016
ners+0.015
 tide+0.014
about+0.014
ven+0.013
Neg logit contributions
 cousins-0.016
 granddaughter-0.015
 owners-0.015
 nephew-0.015
 From-0.014
Token like
Token ID588
Feature activation-5.90e-01

Pos logit contributions
gh+0.024
ners+0.023
ven+0.021
about+0.021
 tide+0.021
Neg logit contributions
 cousins-0.022
 granddaughter-0.021
 From-0.020
 nephew-0.019
 glad-0.019
Token a
Token ID257
Feature activation-1.97e+00

Pos logit contributions
gh+0.021
ners+0.020
about+0.019
ven+0.019
irl+0.019
Neg logit contributions
 cousins-0.014
 granddaughter-0.013
 owners-0.012
 nephew-0.012
 grandma-0.012
Token V
Token ID569
Feature activation-1.22e+01

Pos logit contributions
gh+0.062
ners+0.061
ven+0.058
irl+0.056
about+0.055
Neg logit contributions
 cousins-0.050
 granddaughter-0.048
 nephew-0.046
 owners-0.045
 parent-0.042
Token.
Token ID13
Feature activation-3.33e-01

Pos logit contributions
 rou+0.338
 impossible+0.337
 narrower+0.326
 tide+0.320
 steep+0.319
Neg logit contributions
 granddaughter-0.303
 cousins-0.293
ucks-0.292
 grandma-0.281
ringe-0.280
Token
Token ID198
Feature activation+1.02e+00

Pos logit contributions
ners+0.011
gh+0.011
ven+0.010
irl+0.009
 tide+0.009
Neg logit contributions
 cousins-0.008
 granddaughter-0.008
 nephew-0.007
 owners-0.007
 wife-0.007
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 cousins+0.024
 granddaughter+0.023
 nephew+0.022
 owners+0.020
 niece+0.020
Neg logit contributions
gh-0.035
ners-0.033
irl-0.032
ven-0.030
h-0.030
TokenTom
Token ID13787
Feature activation-6.28e+00

Pos logit contributions
 cousins+0.029
 granddaughter+0.028
 nephew+0.027
 From+0.027
 owners+0.027
Neg logit contributions
gh-0.028
ners-0.026
 tide-0.024
ven-0.024
irl-0.024
Token saw
Token ID2497
Feature activation+2.72e+00

Pos logit contributions
gh+0.188
ners+0.176
 tide+0.167
irl+0.161
ven+0.159
Neg logit contributions
 cousins-0.175
 granddaughter-0.170
 nephew-0.162
 owners-0.158
 From-0.158
Token!"
Token ID2474
Feature activation-1.23e-01

Pos logit contributions
gh+0.104
irl+0.100
 tide+0.099
ners+0.095
 rou+0.089
Neg logit contributions
 cousins-0.107
 granddaughter-0.101
 nephew-0.099
 owners-0.091
 grandma-0.091
Token "
Token ID366
Feature activation-9.67e-01

Pos logit contributions
gh+0.005
ners+0.004
ven+0.004
irl+0.004
 tide+0.004
Neg logit contributions
 cousins-0.002
 granddaughter-0.002
 nephew-0.002
 owners-0.002
 From-0.002
TokenPretty
Token ID35700
Feature activation-3.34e+00

Pos logit contributions
gh+0.028
ners+0.026
ven+0.025
 tide+0.025
irl+0.025
Neg logit contributions
 cousins-0.026
 granddaughter-0.025
 nephew-0.024
 owners-0.024
 From-0.024
Token!"
Token ID2474
Feature activation-4.10e-02

Pos logit contributions
gh+0.110
 tide+0.096
ners+0.094
obb+0.094
ulu+0.093
Neg logit contributions
 cousins-0.073
 grandma-0.070
 granddaughter-0.068
 nephew-0.067
 owner-0.066
Token "
Token ID366
Feature activation-8.53e-01

Pos logit contributions
gh+0.002
ners+0.001
ven+0.001
irl+0.001
 tide+0.001
Neg logit contributions
 cousins-0.001
 granddaughter-0.001
 nephew-0.001
 owners-0.001
 From-0.001
TokenU
Token ID52
Feature activation-1.22e+01

Pos logit contributions
gh+0.026
ners+0.023
ven+0.023
 tide+0.023
irl+0.022
Neg logit contributions
 cousins-0.022
 granddaughter-0.021
 nephew-0.021
 owners-0.021
 From-0.020
Tokengly
Token ID10853
Feature activation-3.53e+00

Pos logit contributions
 cloudy+0.336
 rou+0.331
 narrower+0.321
 spe+0.314
 rocky+0.312
Neg logit contributions
 cousins-0.312
 hugs-0.311
 parents-0.285
 granddaughter-0.282
ringe-0.279
Token!"
Token ID2474
Feature activation-1.60e-02

Pos logit contributions
gh+0.100
 tide+0.097
irl+0.093
ners+0.092
 rou+0.086
Neg logit contributions
 cousins-0.099
 granddaughter-0.095
 nephew-0.094
 From-0.088
 owners-0.086
Token "
Token ID366
Feature activation-7.70e-01

Pos logit contributions
gh+0.001
ners+0.001
ven+0.001
 tide+0.001
irl+0.001
Neg logit contributions
 cousins-0.000
 granddaughter-0.000
 nephew-0.000
 owners-0.000
 From-0.000
TokenPretty
Token ID35700
Feature activation-3.24e+00

Pos logit contributions
gh+0.024
ners+0.022
ven+0.021
 tide+0.021
irl+0.021
Neg logit contributions
 cousins-0.019
 granddaughter-0.019
 nephew-0.018
 owners-0.018
 From-0.018
Token!"
Token ID2474
Feature activation+2.44e-02

Pos logit contributions
gh+0.107
 tide+0.093
ners+0.093
irl+0.089
obb+0.089
Neg logit contributions
 cousins-0.072
 grandma-0.068
 nephew-0.068
 granddaughter-0.068
 owners-0.066
Token play
Token ID711
Feature activation-3.48e+00

Pos logit contributions
gh+0.186
ners+0.181
 tide+0.166
ven+0.163
about+0.162
Neg logit contributions
 cousins-0.143
 granddaughter-0.135
 nephew-0.129
 owners-0.128
 From-0.128
Token with
Token ID351
Feature activation+1.40e+00

Pos logit contributions
gh+0.103
ners+0.099
ven+0.091
 tide+0.090
irl+0.089
Neg logit contributions
 cousins-0.096
 granddaughter-0.091
 nephew-0.087
 owners-0.086
 From-0.085
Token it
Token ID340
Feature activation-7.52e-01

Pos logit contributions
 cousins+0.047
 granddaughter+0.047
 nephew+0.046
 From+0.044
 owners+0.044
Neg logit contributions
gh-0.032
ners-0.030
irl-0.027
ven-0.026
 tide-0.026
Token too
Token ID1165
Feature activation-1.89e+00

Pos logit contributions
gh+0.023
ners+0.022
 tide+0.021
ven+0.021
irl+0.020
Neg logit contributions
 cousins-0.020
 granddaughter-0.018
 owners-0.018
 From-0.017
 nephew-0.017
Token.
Token ID13
Feature activation-4.66e-01

Pos logit contributions
gh+0.062
ners+0.062
 tide+0.057
irl+0.057
ven+0.057
Neg logit contributions
 cousins-0.042
 From-0.039
 granddaughter-0.039
 owners-0.039
 grandma-0.038
Token V
Token ID569
Feature activation-1.22e+01

Pos logit contributions
gh+0.013
ners+0.013
ven+0.012
 tide+0.011
irl+0.011
Neg logit contributions
 cousins-0.013
 granddaughter-0.012
 owners-0.011
 nephew-0.011
rolley-0.011
Tokenroom
Token ID3823
Feature activation-5.65e+00

Pos logit contributions
 rou+0.363
 impossible+0.362
 narrower+0.347
 daunting+0.346
 curved+0.343
Neg logit contributions
 cousins-0.286
ags-0.264
 grandma-0.258
 heart-0.254
 And-0.252
Token v
Token ID410
Feature activation-6.61e+00

Pos logit contributions
 tide+0.182
irl+0.180
gh+0.173
 tides+0.172
 seaw+0.169
Neg logit contributions
 cousins-0.124
 granddaughter-0.114
 grandma-0.111
 owner-0.111
 hugs-0.109
Tokenroom
Token ID3823
Feature activation-5.64e+00

Pos logit contributions
gh+0.177
ners+0.168
ven+0.153
 tide+0.151
irl+0.151
Neg logit contributions
 cousins-0.201
 granddaughter-0.193
 nephew-0.186
 owners-0.183
 From-0.180
Token!"
Token ID2474
Feature activation-2.65e-02

Pos logit contributions
 tide+0.174
gh+0.169
irl+0.165
 tides+0.159
 seaw+0.156
Neg logit contributions
 cousins-0.133
 granddaughter-0.121
 From-0.118
 owner-0.115
 grandma-0.115
Token<|endoftext|>
Token ID50256
Feature activation+6.96e-01

Pos logit contributions
ners+0.001
gh+0.001
 soda+0.001
 tide+0.001
cl+0.001
Neg logit contributions
 cousins-0.001
 nephew-0.001
ringe-0.001
 owners-0.001
 granddaughter-0.001
Token very
Token ID845
Feature activation-2.46e-01

Pos logit contributions
 cousins+0.015
 granddaughter+0.015
 nephew+0.014
 owners+0.014
 From+0.014
Neg logit contributions
gh-0.015
ners-0.014
ven-0.013
irl-0.013
 tide-0.013
Token fast
Token ID3049
Feature activation-3.25e+00

Pos logit contributions
gh+0.008
ners+0.007
ven+0.007
about+0.007
ede+0.007
Neg logit contributions
 cousins-0.006
 granddaughter-0.006
 glad-0.006
 From-0.006
 owners-0.006
Token and
Token ID290
Feature activation+1.85e+00

Pos logit contributions
gh+0.110
ners+0.098
ven+0.097
irl+0.096
 tide+0.095
Neg logit contributions
 cousins-0.085
 granddaughter-0.079
 nephew-0.076
 owners-0.073
 From-0.069
Token loud
Token ID7812
Feature activation-5.03e+00

Pos logit contributions
 cousins+0.055
 granddaughter+0.051
 nephew+0.051
 owners+0.050
 glad+0.048
Neg logit contributions
gh-0.054
ners-0.049
about-0.046
ven-0.044
 tide-0.044
Token.
Token ID13
Feature activation+4.88e-03

Pos logit contributions
gh+0.154
irl+0.137
ners+0.137
ven+0.136
 tide+0.134
Neg logit contributions
 cousins-0.143
 granddaughter-0.133
 nephew-0.129
 From-0.126
 owners-0.124
Token V
Token ID569
Feature activation-1.21e+01

Pos logit contributions
 cousins+0.000
 granddaughter+0.000
 nephew+0.000
 owners+0.000
 From+0.000
Neg logit contributions
gh-0.000
ners-0.000
ven-0.000
irl-0.000
 tide-0.000
Tokenroom
Token ID3823
Feature activation-5.74e+00

Pos logit contributions
 impossible+0.363
 rou+0.360
 inc+0.351
 spe+0.350
 narrower+0.347
Neg logit contributions
 cousins-0.269
ucks-0.246
ringe-0.239
 And-0.235
 grandma-0.232
Token,
Token ID11
Feature activation-2.47e+00

Pos logit contributions
 tide+0.184
gh+0.177
irl+0.175
 tides+0.173
 seaw+0.170
Neg logit contributions
 cousins-0.127
 grandma-0.113
 aunt-0.110
 granddaughter-0.109
 From-0.108
Token v
Token ID410
Feature activation-6.74e+00

Pos logit contributions
gh+0.078
ade+0.073
ners+0.069
irl+0.069
ok+0.068
Neg logit contributions
 cousins-0.061
 granddaughter-0.057
 From-0.055
 heart-0.055
 grandma-0.055
Tokenroom
Token ID3823
Feature activation-5.73e+00

Pos logit contributions
gh+0.132
ners+0.122
ven+0.107
 tide+0.106
irl+0.105
Neg logit contributions
 cousins-0.252
 granddaughter-0.244
 nephew-0.237
 owners-0.235
 From-0.232
Token!"
Token ID2474
Feature activation-8.04e-02

Pos logit contributions
 tide+0.177
gh+0.175
irl+0.166
 tides+0.162
 seaw+0.160
Neg logit contributions
 cousins-0.135
 From-0.119
 grandma-0.118
 granddaughter-0.118
 aunt-0.116
Token.
Token ID13
Feature activation-3.63e-01

Pos logit contributions
gh+0.027
ners+0.025
 tide+0.025
irl+0.025
ven+0.023
Neg logit contributions
 cousins-0.023
 granddaughter-0.021
 owners-0.020
 nephew-0.020
 grandma-0.019
Token They
Token ID1119
Feature activation-1.68e+00

Pos logit contributions
gh+0.014
ners+0.013
ven+0.012
irl+0.012
 tide+0.012
Neg logit contributions
 cousins-0.007
 granddaughter-0.007
 nephew-0.007
 owners-0.006
 From-0.006
Token make
Token ID787
Feature activation-1.69e+00

Pos logit contributions
gh+0.052
ners+0.049
ven+0.045
 tide+0.045
irl+0.045
Neg logit contributions
 cousins-0.044
 granddaughter-0.042
 nephew-0.041
 owners-0.040
 From-0.039
Token noises
Token ID26782
Feature activation-2.12e+00

Pos logit contributions
gh+0.054
ners+0.052
ven+0.049
 tide+0.047
irl+0.046
Neg logit contributions
 cousins-0.043
 granddaughter-0.042
 nephew-0.039
 owners-0.039
 From-0.038
Token.
Token ID13
Feature activation-3.52e-01

Pos logit contributions
gh+0.069
ners+0.065
ven+0.063
irl+0.061
 tide+0.059
Neg logit contributions
 cousins-0.052
 granddaughter-0.048
 owners-0.046
 knowing-0.046
 nephew-0.045
Token V
Token ID569
Feature activation-1.21e+01

Pos logit contributions
gh+0.014
ners+0.013
ven+0.013
irl+0.013
 tide+0.012
Neg logit contributions
 cousins-0.006
 granddaughter-0.006
 From-0.006
 owners-0.005
 nephew-0.005
Tokenroom
Token ID3823
Feature activation-5.65e+00

Pos logit contributions
 rou+0.334
 tide+0.327
 impossible+0.325
 inc+0.324
 spe+0.323
Neg logit contributions
 cousins-0.309
 From-0.273
ucks-0.270
 And-0.265
 grandma-0.264
Token,
Token ID11
Feature activation-2.44e+00

Pos logit contributions
gh+0.178
 tide+0.176
irl+0.174
ners+0.166
 seaw+0.165
Neg logit contributions
 cousins-0.126
 From-0.116
 grandma-0.114
 granddaughter-0.113
 hugs-0.110
Token v
Token ID410
Feature activation-6.75e+00

Pos logit contributions
gh+0.074
ners+0.068
irl+0.066
ven+0.065
ade+0.064
Neg logit contributions
 cousins-0.066
 granddaughter-0.062
 glad-0.059
 nephew-0.059
 From-0.057
Tokenroom
Token ID3823
Feature activation-5.66e+00

Pos logit contributions
gh+0.106
ners+0.099
ven+0.081
irl+0.077
about+0.075
Neg logit contributions
 cousins-0.281
 granddaughter-0.276
 nephew-0.271
 owners-0.265
 From-0.261
Token,
Token ID11
Feature activation-2.46e+00

Pos logit contributions
gh+0.179
 tide+0.175
irl+0.171
ners+0.165
ven+0.162
Neg logit contributions
 cousins-0.129
 From-0.120
 granddaughter-0.115
 hugs-0.114
 grandma-0.114
Token are
Token ID389
Feature activation+8.55e-01

Pos logit contributions
gh+0.039
ners+0.038
ven+0.035
irl+0.035
 tide+0.034
Neg logit contributions
 cousins-0.044
 granddaughter-0.043
 nephew-0.041
 owners-0.040
 From-0.039
Token careful
Token ID8161
Feature activation-2.15e+00

Pos logit contributions
rolley+0.021
 granddaughter+0.021
 nephew+0.021
 cousins+0.021
 From+0.020
Neg logit contributions
gh-0.024
ners-0.023
irl-0.023
 tide-0.022
 curved-0.021
Token and
Token ID290
Feature activation+1.79e+00

Pos logit contributions
gh+0.069
ners+0.067
ven+0.062
 tide+0.061
irl+0.061
Neg logit contributions
 cousins-0.054
 granddaughter-0.051
 nephew-0.049
 owners-0.049
 From-0.047
Token happy
Token ID3772
Feature activation+4.37e+00

Pos logit contributions
 cousins+0.050
 granddaughter+0.048
 nephew+0.046
 owners+0.045
 From+0.045
Neg logit contributions
gh-0.052
ners-0.050
ven-0.046
 tide-0.045
irl-0.045
Token.
Token ID13
Feature activation-2.47e-01

Pos logit contributions
 cousins+0.141
 granddaughter+0.133
 owners+0.130
 nephew+0.128
 From+0.127
Neg logit contributions
gh-0.113
ners-0.107
ven-0.100
irl-0.095
 tide-0.093
Token V
Token ID569
Feature activation-1.21e+01

Pos logit contributions
gh+0.007
ners+0.007
 tide+0.006
ven+0.006
irl+0.006
Neg logit contributions
 cousins-0.007
 granddaughter-0.006
 nephew-0.006
ringe-0.006
 reunited-0.006
Tokenroom
Token ID3823
Feature activation-5.60e+00

Pos logit contributions
 tide+0.280
gh+0.277
ners+0.275
 som+0.268
 rou+0.261
Neg logit contributions
 cousins-0.372
 granddaughter-0.342
 From-0.328
 owners-0.328
 nephew-0.327
Token,
Token ID11
Feature activation-2.41e+00

Pos logit contributions
irl+0.180
gh+0.176
 seaw+0.174
 tide+0.174
ners+0.168
Neg logit contributions
 cousins-0.126
 owner-0.116
 grandma-0.116
 aunt-0.114
 granddaughter-0.113
Token v
Token ID410
Feature activation-6.61e+00

Pos logit contributions
gh+0.075
ven+0.068
ners+0.068
irl+0.067
about+0.066
Neg logit contributions
 cousins-0.063
 granddaughter-0.060
 glad-0.059
 nephew-0.057
 grandma-0.057
Tokenroom
Token ID3823
Feature activation-5.61e+00

Pos logit contributions
gh+0.105
ners+0.099
ven+0.081
irl+0.078
about+0.075
Neg logit contributions
 cousins-0.274
 granddaughter-0.269
 nephew-0.263
 owners-0.255
 From-0.254
Token,
Token ID11
Feature activation-2.44e+00

Pos logit contributions
gh+0.175
 tide+0.172
irl+0.171
 seaw+0.164
ners+0.163
Neg logit contributions
 cousins-0.130
 From-0.120
 grandma-0.117
 granddaughter-0.117
 owners-0.117
TokenMaybe
Token ID13300
Feature activation-1.08e+00

Pos logit contributions
gh+0.005
ners+0.005
irl+0.004
ven+0.004
 tide+0.004
Neg logit contributions
 cousins-0.005
 granddaughter-0.004
 nephew-0.004
 owners-0.004
 From-0.004
Token later
Token ID1568
Feature activation-1.04e+00

Pos logit contributions
ners+0.038
gh+0.038
about+0.033
isting+0.033
 hilarious+0.033
Neg logit contributions
 grandma-0.027
 granddaughter-0.026
 cousins-0.025
 owners-0.024
 aunt-0.023
Token,
Token ID11
Feature activation-2.42e+00

Pos logit contributions
ners+0.035
about+0.033
irl+0.032
 tide+0.031
isting+0.031
Neg logit contributions
 grandma-0.027
 cousins-0.026
 granddaughter-0.025
 uncle-0.024
 aunt-0.024
Token honey
Token ID12498
Feature activation-3.31e+00

Pos logit contributions
gh+0.076
ners+0.072
irl+0.065
ven+0.065
 tide+0.064
Neg logit contributions
 cousins-0.066
 granddaughter-0.063
 nephew-0.060
 owners-0.058
 From-0.056
Token.
Token ID13
Feature activation-1.77e-01

Pos logit contributions
ners+0.094
gh+0.090
 rect+0.083
irl+0.082
about+0.082
Neg logit contributions
 granddaughter-0.093
 cousins-0.092
 grandma-0.086
 aunt-0.085
 nephew-0.084
Token V
Token ID569
Feature activation-1.21e+01

Pos logit contributions
gh+0.006
ners+0.006
ven+0.005
 tide+0.005
irl+0.005
Neg logit contributions
 cousins-0.004
 granddaughter-0.004
 nephew-0.004
 owners-0.004
 From-0.003
Tokenases
Token ID1386
Feature activation-1.92e+00

Pos logit contributions
 impossible+0.329
 tide+0.328
 hilarious+0.319
 rou+0.318
 seaw+0.303
Neg logit contributions
 cousins-0.309
 granddaughter-0.305
ringe-0.296
 grandma-0.281
ucks-0.279
Token are
Token ID389
Feature activation+5.70e-01

Pos logit contributions
gh+0.059
 tide+0.054
irl+0.053
ven+0.053
ners+0.052
Neg logit contributions
 granddaughter-0.049
 cousins-0.048
 owners-0.046
 grandma-0.045
 nephew-0.045
Token not
Token ID407
Feature activation-2.76e+00

Pos logit contributions
 cousins+0.013
 granddaughter+0.012
 owners+0.011
 nephew+0.011
 grandma+0.011
Neg logit contributions
gh-0.020
ners-0.020
about-0.019
ven-0.018
 tide-0.018
Token for
Token ID329
Feature activation+1.56e+00

Pos logit contributions
gh+0.095
ners+0.095
about+0.088
ven+0.088
irl+0.086
Neg logit contributions
 cousins-0.065
 granddaughter-0.063
 nephew-0.057
 owners-0.055
 grandma-0.054
Token playing
Token ID2712
Feature activation-2.75e+00

Pos logit contributions
 cousins+0.039
 granddaughter+0.036
 grandma+0.034
 owners+0.033
 nephew+0.032
Neg logit contributions
gh-0.052
ners-0.051
about-0.048
ven-0.047
works-0.045
Token.
Token ID13
Feature activation-8.11e-02

Pos logit contributions
gh+0.031
ners+0.030
irl+0.029
ven+0.028
 tide+0.028
Neg logit contributions
 cousins-0.030
 granddaughter-0.027
 owners-0.027
 nephew-0.027
 owner-0.025
Token They
Token ID1119
Feature activation-1.46e+00

Pos logit contributions
gh+0.003
ners+0.003
ven+0.003
irl+0.003
 tide+0.003
Neg logit contributions
 cousins-0.002
 granddaughter-0.001
 nephew-0.001
 From-0.001
 owners-0.001
Token make
Token ID787
Feature activation-1.81e+00

Pos logit contributions
gh+0.043
ners+0.040
ven+0.037
 tide+0.036
irl+0.036
Neg logit contributions
 cousins-0.041
 granddaughter-0.039
 nephew-0.038
 owners-0.037
 From-0.036
Token noises
Token ID26782
Feature activation-2.15e+00

Pos logit contributions
gh+0.057
ners+0.054
ven+0.052
 tide+0.049
about+0.049
Neg logit contributions
 cousins-0.048
 granddaughter-0.046
 nephew-0.043
 owners-0.043
 From-0.040
Token.
Token ID13
Feature activation-8.93e-02

Pos logit contributions
gh+0.075
ners+0.071
ven+0.070
irl+0.070
isting+0.065
Neg logit contributions
 cousins-0.049
 knowing-0.044
 granddaughter-0.043
 owners-0.042
 nephew-0.041
Token V
Token ID569
Feature activation-1.20e+01

Pos logit contributions
gh+0.004
ners+0.003
ven+0.003
irl+0.003
 tide+0.003
Neg logit contributions
 cousins-0.001
 granddaughter-0.001
 From-0.001
 nephew-0.001
 owners-0.001
Tokenroom
Token ID3823
Feature activation-5.64e+00

Pos logit contributions
 rou+0.320
 impossible+0.313
 tide+0.310
 curved+0.298
 seaw+0.298
Neg logit contributions
 cousins-0.336
 granddaughter-0.295
ucks-0.287
 grandma-0.282
 From-0.281
Token,
Token ID11
Feature activation-2.44e+00

Pos logit contributions
gh+0.165
 tide+0.163
irl+0.161
ners+0.156
 seaw+0.153
Neg logit contributions
 cousins-0.144
 granddaughter-0.126
 aunt-0.124
 From-0.123
 grandma-0.123
Token v
Token ID410
Feature activation-6.64e+00

Pos logit contributions
gh+0.083
isting+0.073
ven+0.073
irl+0.071
ners+0.071
Neg logit contributions
 cousins-0.059
 granddaughter-0.053
 aunt-0.050
 ear-0.049
 grandma-0.049
Tokenroom
Token ID3823
Feature activation-5.64e+00

Pos logit contributions
gh+0.152
ners+0.149
ven+0.129
irl+0.127
about+0.124
Neg logit contributions
 cousins-0.225
 granddaughter-0.220
 nephew-0.217
 owners-0.209
 From-0.209
Token!
Token ID0
Feature activation-1.14e+00

Pos logit contributions
gh+0.163
 tide+0.160
irl+0.156
ners+0.153
ven+0.149
Neg logit contributions
 cousins-0.148
 granddaughter-0.131
 From-0.129
 owners-0.128
 nephew-0.127
Token'm
Token ID1101
Feature activation-1.75e+00

Pos logit contributions
gh+0.130
ners+0.124
 tide+0.112
ven+0.112
irl+0.109
Neg logit contributions
 cousins-0.134
 granddaughter-0.131
 nephew-0.126
 owners-0.122
 From-0.120
Token a
Token ID257
Feature activation-1.87e+00

Pos logit contributions
gh+0.046
ners+0.044
ven+0.040
 tide+0.040
irl+0.040
Neg logit contributions
 cousins-0.053
 granddaughter-0.051
 nephew-0.050
 owners-0.049
 From-0.048
Token famous
Token ID5863
Feature activation-8.43e-01

Pos logit contributions
gh+0.063
ners+0.061
ven+0.057
irl+0.056
about+0.055
Neg logit contributions
 cousins-0.045
 granddaughter-0.043
 nephew-0.042
 owners-0.039
 glad-0.037
Token racer
Token ID48233
Feature activation-7.58e-01

Pos logit contributions
gh+0.034
ners+0.032
ven+0.031
irl+0.031
h+0.030
Neg logit contributions
 granddaughter-0.016
 cousins-0.016
 nephew-0.016
 owners-0.014
 niece-0.014
Token.
Token ID13
Feature activation-1.09e-01

Pos logit contributions
gh+0.018
ners+0.017
irl+0.015
 tide+0.015
ven+0.015
Neg logit contributions
 cousins-0.026
 granddaughter-0.025
 nephew-0.024
 owners-0.023
 From-0.022
Token V
Token ID569
Feature activation-1.20e+01

Pos logit contributions
gh+0.003
ners+0.003
ven+0.003
 tide+0.003
irl+0.003
Neg logit contributions
 cousins-0.003
 granddaughter-0.003
 nephew-0.003
 owners-0.003
 From-0.003
Tokenroom
Token ID3823
Feature activation-5.70e+00

Pos logit contributions
 impossible+0.320
 rou+0.317
 tide+0.315
 curved+0.306
 hilarious+0.302
Neg logit contributions
 cousins-0.325
 granddaughter-0.293
 grandma-0.282
 cousin-0.278
ringe-0.277
Token,
Token ID11
Feature activation-2.37e+00

Pos logit contributions
 tide+0.163
gh+0.161
irl+0.159
ners+0.153
 seaw+0.150
Neg logit contributions
 cousins-0.152
 granddaughter-0.140
 nephew-0.138
 owners-0.136
 aunt-0.131
Token v
Token ID410
Feature activation-6.66e+00

Pos logit contributions
gh+0.070
ners+0.064
ade+0.061
irl+0.060
ven+0.060
Neg logit contributions
 cousins-0.064
 granddaughter-0.060
 nephew-0.060
 owners-0.058
 From-0.056
Tokenroom
Token ID3823
Feature activation-5.71e+00

Pos logit contributions
gh+0.133
ners+0.124
ven+0.108
 tide+0.107
irl+0.107
Neg logit contributions
 cousins-0.248
 granddaughter-0.240
 nephew-0.233
 owners-0.230
 From-0.227
Token!"
Token ID2474
Feature activation+2.68e-01

Pos logit contributions
gh+0.161
 tide+0.158
irl+0.153
ners+0.150
 seaw+0.142
Neg logit contributions
 cousins-0.156
 granddaughter-0.145
 nephew-0.142
 owners-0.140
 From-0.135
Token the
Token ID262
Feature activation-1.20e+00

Pos logit contributions
gh+0.020
ven+0.019
irl+0.019
ners+0.019
isting+0.018
Neg logit contributions
 cousins-0.012
 owners-0.010
 granddaughter-0.010
 nephew-0.010
 grandma-0.009
Token cars
Token ID5006
Feature activation-1.14e+00

Pos logit contributions
gh+0.045
ven+0.041
ners+0.041
irl+0.040
h+0.039
Neg logit contributions
 cousins-0.027
 owners-0.024
 granddaughter-0.024
 nephew-0.024
 parent-0.021
Token have
Token ID423
Feature activation+8.28e-01

Pos logit contributions
gh+0.036
ners+0.033
 tide+0.031
ven+0.031
irl+0.031
Neg logit contributions
 cousins-0.029
 owners-0.026
 granddaughter-0.026
 nephew-0.026
 From-0.024
Token motors
Token ID24699
Feature activation-3.34e+00

Pos logit contributions
 cousins+0.024
 owners+0.022
 granddaughter+0.022
 nephew+0.021
 From+0.021
Neg logit contributions
gh-0.024
ners-0.022
irl-0.022
 tide-0.021
ven-0.021
Token.
Token ID13
Feature activation-1.21e-01

Pos logit contributions
gh+0.096
ners+0.088
irl+0.084
ven+0.082
 tide+0.081
Neg logit contributions
 cousins-0.097
 granddaughter-0.091
 nephew-0.088
 owners-0.088
 From-0.082
Token V
Token ID569
Feature activation-1.20e+01

Pos logit contributions
gh+0.005
ners+0.005
ven+0.004
 tide+0.004
irl+0.004
Neg logit contributions
 cousins-0.002
 From-0.002
 granddaughter-0.002
 owners-0.002
 nephew-0.002
Tokenroom
Token ID3823
Feature activation-5.57e+00

Pos logit contributions
 rou+0.306
 impossible+0.305
 tide+0.300
 curved+0.298
 inc+0.292
Neg logit contributions
 cousins-0.324
 granddaughter-0.285
ucks-0.282
 cousin-0.280
 From-0.279
Token,
Token ID11
Feature activation-2.35e+00

Pos logit contributions
 tide+0.172
irl+0.168
gh+0.166
 seaw+0.165
right+0.160
Neg logit contributions
 cousins-0.130
 From-0.115
 granddaughter-0.115
 grandma-0.114
 aunt-0.111
Token v
Token ID410
Feature activation-6.55e+00

Pos logit contributions
gh+0.078
ade+0.068
irl+0.068
isting+0.067
ven+0.066
Neg logit contributions
 cousins-0.056
 glad-0.052
 granddaughter-0.050
 From-0.050
 grandma-0.049
Tokenroom
Token ID3823
Feature activation-5.57e+00

Pos logit contributions
gh+0.113
ners+0.107
ven+0.089
irl+0.087
 tide+0.084
Neg logit contributions
 cousins-0.262
 granddaughter-0.256
 nephew-0.250
 owners-0.245
 From-0.242
Token,
Token ID11
Feature activation-2.36e+00

Pos logit contributions
 tide+0.167
gh+0.164
irl+0.159
 seaw+0.154
 tides+0.152
Neg logit contributions
 cousins-0.136
 From-0.125
 granddaughter-0.122
 grandma-0.118
 owners-0.116
Token the
Token ID262
Feature activation-7.24e-01

Pos logit contributions
 cousins+0.081
 granddaughter+0.077
 nephew+0.075
 owners+0.073
 From+0.072
Neg logit contributions
gh-0.101
ners-0.097
ven-0.089
 tide-0.089
irl-0.088
Token car
Token ID1097
Feature activation+2.24e+00

Pos logit contributions
gh+0.024
ners+0.023
ven+0.022
irl+0.021
h+0.020
Neg logit contributions
 cousins-0.018
 granddaughter-0.017
 owners-0.017
 nephew-0.016
 grandparents-0.015
Token.
Token ID13
Feature activation+2.70e-01

Pos logit contributions
 cousins+0.072
 granddaughter+0.071
 nephew+0.068
 owners+0.067
 From+0.065
Neg logit contributions
gh-0.057
ners-0.054
irl-0.050
 tide-0.048
ven-0.048
Token
Token ID198
Feature activation+1.57e+00

Pos logit contributions
 cousins+0.008
 nephew+0.007
 owners+0.007
 privacy+0.007
 granddaughter+0.007
Neg logit contributions
ners-0.007
gh-0.007
about-0.007
ven-0.006
cl-0.006
Token
Token ID198
Feature activation+1.54e+00

Pos logit contributions
 cousins+0.046
 nephew+0.043
 granddaughter+0.041
ringe+0.041
 owners+0.040
Neg logit contributions
gh-0.045
ners-0.041
irl-0.040
h-0.040
 time-0.039
TokenAs
Token ID1722
Feature activation+4.62e+00

Pos logit contributions
 cousins+0.045
 granddaughter+0.045
 From+0.043
 nephew+0.042
 owners+0.042
Neg logit contributions
gh-0.042
ners-0.039
 tide-0.038
ven-0.037
irl-0.036
Token they
Token ID484
Feature activation-9.24e-01

Pos logit contributions
 cousins+0.133
 granddaughter+0.128
 nephew+0.120
 From+0.120
 owners+0.119
Neg logit contributions
gh-0.139
ners-0.132
ven-0.122
about-0.120
works-0.116
Token were
Token ID547
Feature activation+8.00e-01

Pos logit contributions
gh+0.026
ners+0.025
ven+0.022
 tide+0.022
irl+0.022
Neg logit contributions
 cousins-0.027
 granddaughter-0.026
 nephew-0.025
 owners-0.025
 From-0.024
Token unp
Token ID8593
Feature activation-6.77e+00

Pos logit contributions
 cousins+0.022
 granddaughter+0.020
 nephew+0.020
 owners+0.019
 From+0.019
Neg logit contributions
gh-0.024
ners-0.024
ven-0.021
 tide-0.021
irl-0.021
Tokenacking
Token ID5430
Feature activation-1.88e-01

Pos logit contributions
ners+0.169
gh+0.163
ven+0.146
irl+0.144
about+0.143
Neg logit contributions
 cousins-0.222
 granddaughter-0.213
 nephew-0.212
 owners-0.206
 wife-0.201
Token,
Token ID11
Feature activation-1.96e+00

Pos logit contributions
gh+0.005
ners+0.005
ven+0.004
irl+0.004
isting+0.004
Neg logit contributions
 cousins-0.006
 granddaughter-0.005
 From-0.005
 owners-0.005
 nephew-0.005
Token the
Token ID262
Feature activation-6.94e-01

Pos logit contributions
 cousins+0.056
 From+0.056
 granddaughter+0.056
 owners+0.053
 nephew+0.052
Neg logit contributions
gh-0.084
ners-0.080
ven-0.077
irl-0.075
 tide-0.072
Token milk
Token ID7545
Feature activation-3.49e-01

Pos logit contributions
gh+0.025
ners+0.023
ven+0.022
irl+0.021
 tide+0.021
Neg logit contributions
 granddaughter-0.015
 cousins-0.015
 owners-0.014
 nephew-0.014
 From-0.013
Token and
Token ID290
Feature activation+2.45e+00

Pos logit contributions
gh+0.013
ners+0.012
ven+0.012
irl+0.012
 tide+0.011
Neg logit contributions
 cousins-0.006
 granddaughter-0.006
 owners-0.006
 From-0.006
 nephew-0.006
Token felt
Token ID2936
Feature activation+3.15e+00

Pos logit contributions
 cousins+0.072
 granddaughter+0.069
 nephew+0.067
 owners+0.066
 From+0.064
Neg logit contributions
gh-0.067
ners-0.064
ven-0.058
 tide-0.058
irl-0.058
Token very
Token ID845
Feature activation+3.92e-01

Pos logit contributions
 cousins+0.082
 granddaughter+0.078
 nephew+0.076
 owners+0.073
 From+0.070
Neg logit contributions
gh-0.099
ners-0.095
 tide-0.087
irl-0.087
ven-0.086
Token happy
Token ID3772
Feature activation+4.69e+00

Pos logit contributions
 cousins+0.007
 granddaughter+0.007
 From+0.007
 glad+0.006
 owners+0.006
Neg logit contributions
gh-0.015
ners-0.014
ven-0.014
about-0.013
 tide-0.013
Token.
Token ID13
Feature activation-1.61e-01

Pos logit contributions
 cousins+0.151
 granddaughter+0.145
 nephew+0.139
 owners+0.138
 From+0.137
Neg logit contributions
gh-0.118
ners-0.114
ven-0.105
irl-0.102
 tide-0.100
Token He
Token ID679
Feature activation+7.04e-01

Pos logit contributions
gh+0.006
ners+0.006
about+0.005
ven+0.005
 tide+0.005
Neg logit contributions
 cousins-0.003
 granddaughter-0.003
 nephew-0.003
 owners-0.003
ringe-0.003
Token wanted
Token ID2227
Feature activation-1.49e-01

Pos logit contributions
 cousins+0.019
 granddaughter+0.018
 nephew+0.017
 owners+0.017
 From+0.016
Neg logit contributions
gh-0.022
ners-0.021
ven-0.019
irl-0.019
 tide-0.019
Token to
Token ID284
Feature activation-8.75e-01

Pos logit contributions
gh+0.005
ners+0.005
ven+0.004
irl+0.004
 tide+0.004
Neg logit contributions
 cousins-0.004
 granddaughter-0.003
 nephew-0.003
 owners-0.003
 From-0.003
Token play
Token ID711
Feature activation-3.14e+00

Pos logit contributions
gh+0.027
ners+0.026
ven+0.024
about+0.023
 tide+0.023
Neg logit contributions
 cousins-0.022
 granddaughter-0.022
 owners-0.021
 From-0.021
 nephew-0.020
Token up
Token ID510
Feature activation+3.55e+00

Pos logit contributions
 cousins+0.067
 granddaughter+0.063
 From+0.060
 owners+0.059
 nephew+0.059
Neg logit contributions
gh-0.094
ners-0.090
ven-0.085
h-0.081
 tide-0.080
Token their
Token ID511
Feature activation+3.13e+00

Pos logit contributions
 cousins+0.077
 granddaughter+0.071
 From+0.071
 owners+0.067
 grandma+0.065
Neg logit contributions
gh-0.133
ners-0.126
ven-0.119
h-0.115
isting-0.115
Token things
Token ID1243
Feature activation-8.02e-01

Pos logit contributions
 cousins+0.073
 granddaughter+0.067
 nephew+0.064
 owners+0.063
 From+0.059
Neg logit contributions
gh-0.110
ners-0.106
ven-0.101
about-0.097
irl-0.097
Token and
Token ID290
Feature activation+2.13e+00

Pos logit contributions
gh+0.029
ners+0.028
irl+0.026
ven+0.026
 tide+0.025
Neg logit contributions
 cousins-0.017
 granddaughter-0.015
 From-0.015
 owners-0.015
 glad-0.014
Token started
Token ID2067
Feature activation+2.75e+00

Pos logit contributions
 cousins+0.064
 granddaughter+0.062
 nephew+0.060
 owners+0.059
 From+0.058
Neg logit contributions
gh-0.057
ners-0.054
ven-0.049
 tide-0.049
irl-0.049
Token home
Token ID1363
Feature activation+6.20e+00

Pos logit contributions
 cousins+0.059
 granddaughter+0.055
 nephew+0.051
 owners+0.051
 From+0.049
Neg logit contributions
ners-0.101
gh-0.100
ven-0.093
about-0.091
works-0.091
Token.
Token ID13
Feature activation-3.44e-02

Pos logit contributions
 cousins+0.206
 granddaughter+0.198
 nephew+0.190
 owners+0.189
 From+0.184
Neg logit contributions
gh-0.152
ners-0.148
ven-0.131
irl-0.129
 tide-0.128
Token The
Token ID383
Feature activation-2.37e-01

Pos logit contributions
ners+0.001
gh+0.001
 tide+0.001
ven+0.001
about+0.001
Neg logit contributions
 cousins-0.001
 granddaughter-0.001
 nephew-0.001
 owners-0.001
ringe-0.001
Token nice
Token ID3621
Feature activation-4.49e-02

Pos logit contributions
gh+0.008
ners+0.008
 tide+0.007
ven+0.007
irl+0.007
Neg logit contributions
 cousins-0.005
 granddaughter-0.005
 nephew-0.005
 From-0.005
 owners-0.005
Token country
Token ID1499
Feature activation+4.78e+00

Pos logit contributions
 tide+0.001
gh+0.001
ners+0.001
 competition+0.001
irl+0.001
Neg logit contributions
 cousins-0.001
 granddaughter-0.001
 From-0.001
 nephew-0.001
ringe-0.001
Token was
Token ID373
Feature activation-3.60e-01

Pos logit contributions
 cousins+0.140
 granddaughter+0.135
 owners+0.132
 nephew+0.128
 From+0.127
Neg logit contributions
gh-0.134
ners-0.127
irl-0.115
ven-0.115
 tide-0.111
Tokenurious
Token ID16421
Feature activation+5.98e-01

Pos logit contributions
 rect+0.279
 rou+0.273
 tide+0.265
 seaw+0.248
 hilarious+0.248
Neg logit contributions
 granddaughter-0.255
ugg-0.237
 From-0.230
 ear-0.230
child-0.228
Token means
Token ID1724
Feature activation+2.31e+00

Pos logit contributions
 heart+0.013
 granddaughter+0.013
 aunt+0.013
 owners+0.013
 cousins+0.013
Neg logit contributions
irl-0.019
ners-0.019
gh-0.018
ven-0.018
 rect-0.017
Token to
Token ID284
Feature activation-7.82e-01

Pos logit contributions
 cousins+0.052
 granddaughter+0.050
 owners+0.049
 parents+0.048
 grandma+0.047
Neg logit contributions
ners-0.085
about-0.078
ade-0.074
gh-0.074
works-0.072
Token be
Token ID307
Feature activation-1.87e+00

Pos logit contributions
ners+0.025
gh+0.024
about+0.021
ven+0.021
 rect+0.021
Neg logit contributions
 cousins-0.019
 owners-0.019
 granddaughter-0.019
 nephew-0.018
 wife-0.018
Token very
Token ID845
Feature activation+2.00e-01

Pos logit contributions
gh+0.059
ners+0.059
 tide+0.054
about+0.054
ven+0.053
Neg logit contributions
 cousins-0.048
 granddaughter-0.046
 nephew-0.044
 owners-0.044
 glad-0.041
Token happy
Token ID3772
Feature activation+4.39e+00

Pos logit contributions
 cousins+0.005
 granddaughter+0.005
 glad+0.005
 wife+0.005
 owners+0.005
Neg logit contributions
ners-0.007
gh-0.006
about-0.006
works-0.006
 rect-0.006
Token."
Token ID526
Feature activation+9.07e-01

Pos logit contributions
 cousins+0.117
 granddaughter+0.109
 owners+0.106
 nephew+0.102
 grandma+0.098
Neg logit contributions
gh-0.140
ners-0.134
ven-0.126
irl-0.125
 tide-0.122
Token
Token ID198
Feature activation+1.08e+00

Pos logit contributions
 cousins+0.017
 granddaughter+0.015
 nephew+0.015
 owners+0.014
 From+0.013
Neg logit contributions
gh-0.035
ners-0.034
ven-0.032
 tide-0.032
irl-0.032
Token
Token ID198
Feature activation+1.07e+00

Pos logit contributions
 cousins+0.028
 nephew+0.025
 granddaughter+0.025
 owners+0.023
ringe+0.023
Neg logit contributions
gh-0.036
ners-0.033
irl-0.032
ven-0.031
h-0.030
TokenBen
Token ID11696
Feature activation-6.95e+00

Pos logit contributions
 cousins+0.035
 granddaughter+0.034
 owners+0.033
 From+0.033
 nephew+0.033
Neg logit contributions
gh-0.026
ners-0.024
 tide-0.022
ven-0.022
irl-0.021
Token shook
Token ID14682
Feature activation+5.48e-01

Pos logit contributions
gh+0.234
ners+0.220
irl+0.208
 tide+0.207
works+0.196
Neg logit contributions
 cousins-0.169
 granddaughter-0.164
 owners-0.159
 From-0.158
 nephew-0.154
Token.
Token ID13
Feature activation+4.17e-02

Pos logit contributions
irl+0.042
gh+0.039
ners+0.038
ven+0.037
isting+0.036
Neg logit contributions
 owners-0.026
 home-0.026
 owner-0.026
 granddaughter-0.025
 cousins-0.025
Token She
Token ID1375
Feature activation+3.28e+00

Pos logit contributions
 cousins+0.001
 nephew+0.001
 granddaughter+0.001
 owners+0.001
ringe+0.001
Neg logit contributions
gh-0.001
ners-0.001
ven-0.001
about-0.001
 tide-0.001
Token was
Token ID373
Feature activation-1.49e-01

Pos logit contributions
 cousins+0.096
 granddaughter+0.093
 owners+0.091
 nephew+0.090
 From+0.088
Neg logit contributions
gh-0.090
ners-0.086
ven-0.078
 tide-0.078
irl-0.077
Token persistent
Token ID16218
Feature activation-1.61e+00

Pos logit contributions
gh+0.005
ners+0.004
irl+0.004
ven+0.004
 tide+0.004
Neg logit contributions
 cousins-0.004
 granddaughter-0.004
 nephew-0.004
 owners-0.003
 From-0.003
Token in
Token ID287
Feature activation-1.68e+00

Pos logit contributions
gh+0.056
ners+0.055
 tide+0.055
irl+0.055
works+0.054
Neg logit contributions
 cousins-0.034
 granddaughter-0.033
 knowing-0.032
 owners-0.032
 nephew-0.030
Token her
Token ID607
Feature activation+4.79e+00

Pos logit contributions
ners+0.057
gh+0.057
ven+0.053
 tide+0.052
works+0.052
Neg logit contributions
 cousins-0.038
 granddaughter-0.036
 owners-0.036
 nephew-0.035
 From-0.033
Token efforts
Token ID4040
Feature activation+2.13e+00

Pos logit contributions
 cousins+0.095
 granddaughter+0.089
 owners+0.085
 nephew+0.084
 From+0.077
Neg logit contributions
gh-0.180
ners-0.177
ven-0.169
 tide-0.164
about-0.164
Token,
Token ID11
Feature activation-2.05e+00

Pos logit contributions
 cousins+0.055
 granddaughter+0.054
 owners+0.053
 From+0.051
 nephew+0.051
Neg logit contributions
ners-0.063
gh-0.061
irl-0.059
 tide-0.058
ven-0.056
Token but
Token ID475
Feature activation-8.90e-01

Pos logit contributions
gh+0.063
ners+0.060
ven+0.056
irl+0.055
 tide+0.054
Neg logit contributions
 cousins-0.055
 granddaughter-0.053
 owners-0.051
 nephew-0.051
 From-0.049
Token eventually
Token ID4191
Feature activation+3.16e-01

Pos logit contributions
gh+0.024
ners+0.023
ven+0.022
 tide+0.021
irl+0.021
Neg logit contributions
 cousins-0.026
 granddaughter-0.025
 nephew-0.024
 owners-0.024
 From-0.023
Token the
Token ID262
Feature activation-4.44e-01

Pos logit contributions
 cousins+0.009
 granddaughter+0.008
 nephew+0.008
 owners+0.008
 From+0.008
Neg logit contributions
gh-0.009
ners-0.009
ven-0.008
irl-0.008
 tide-0.008
Token a
Token ID257
Feature activation-1.57e+00

Pos logit contributions
gh+0.030
ners+0.029
about+0.027
irl+0.027
ven+0.027
Neg logit contributions
 cousins-0.020
 granddaughter-0.019
 owners-0.017
 nephew-0.017
 From-0.017
Token head
Token ID1182
Feature activation+3.14e+00

Pos logit contributions
gh+0.042
ners+0.040
ven+0.036
 tide+0.036
irl+0.036
Neg logit contributions
 cousins-0.047
 granddaughter-0.046
 nephew-0.044
 owners-0.043
 From-0.043
Token but
Token ID475
Feature activation-1.29e+00

Pos logit contributions
 cousins+0.095
 granddaughter+0.093
 nephew+0.089
 owners+0.087
 From+0.084
Neg logit contributions
gh-0.082
ners-0.079
ven-0.070
irl-0.070
about-0.069
Token no
Token ID645
Feature activation-2.41e+00

Pos logit contributions
gh+0.039
ners+0.038
about+0.034
irl+0.033
ven+0.033
Neg logit contributions
 cousins-0.035
 granddaughter-0.034
 nephew-0.033
 owners-0.033
 From-0.032
Token body
Token ID1767
Feature activation-1.03e+00

Pos logit contributions
ners+0.069
gh+0.069
ven+0.064
irl+0.061
works+0.060
Neg logit contributions
 cousins-0.067
 granddaughter-0.067
 nephew-0.065
 owners-0.061
 wife-0.059
Token?"
Token ID1701
Feature activation+9.88e-01

Pos logit contributions
gh+0.028
ners+0.027
about+0.024
ven+0.024
 tide+0.023
Neg logit contributions
 cousins-0.029
 granddaughter-0.028
 owners-0.027
 nephew-0.027
 From-0.026
Token The
Token ID383
Feature activation-3.38e-01

Pos logit contributions
 cousins+0.018
 granddaughter+0.016
 nephew+0.015
 owners+0.015
 grandparents+0.013
Neg logit contributions
gh-0.039
ners-0.038
ven-0.036
 tide-0.035
irl-0.035
Token boy
Token ID2933
Feature activation+1.14e+00

Pos logit contributions
ners+0.011
gh+0.011
 tide+0.010
about+0.010
ven+0.010
Neg logit contributions
 cousins-0.008
 granddaughter-0.008
 nephew-0.007
 owners-0.007
 From-0.007
Token thought
Token ID1807
Feature activation+1.49e+00

Pos logit contributions
 cousins+0.031
 granddaughter+0.030
 nephew+0.029
 owners+0.028
 From+0.028
Neg logit contributions
gh-0.034
ners-0.033
 tide-0.030
irl-0.030
ven-0.030
Token for
Token ID329
Feature activation+1.49e+00

Pos logit contributions
 granddaughter+0.020
 cousins+0.019
 From+0.019
 glad+0.018
 nephew+0.017
Neg logit contributions
ners-0.061
gh-0.061
about-0.057
ven-0.057
 tide-0.057
Token a
Token ID257
Feature activation-1.67e+00

Pos logit contributions
 cousins+0.039
 granddaughter+0.037
 nephew+0.036
 owners+0.035
 From+0.034
Neg logit contributions
gh-0.047
ners-0.045
ven-0.041
irl-0.041
 tide-0.041
Token and
Token ID290
Feature activation+2.27e+00

Pos logit contributions
gh+0.027
irl+0.024
works+0.024
ners+0.024
ven+0.024
Neg logit contributions
 granddaughter-0.018
 From-0.018
 glad-0.018
 owners-0.018
 cousins-0.017
Token then
Token ID788
Feature activation+8.44e-01

Pos logit contributions
 cousins+0.063
 granddaughter+0.061
 nephew+0.059
 owners+0.058
 From+0.058
Neg logit contributions
gh-0.066
ners-0.064
ven-0.058
irl-0.057
 tide-0.057
Token scr
Token ID6040
Feature activation-4.03e+00

Pos logit contributions
 granddaughter+0.020
 cousins+0.020
 owners+0.019
 nephew+0.019
 From+0.018
Neg logit contributions
gh-0.028
ners-0.027
irl-0.025
works-0.024
about-0.024
Tokenabb
Token ID6485
Feature activation-5.45e+00

Pos logit contributions
gh+0.109
ners+0.099
irl+0.095
 tide+0.092
ven+0.089
Neg logit contributions
 cousins-0.130
 granddaughter-0.124
 nephew-0.123
 owners-0.114
 niece-0.111
Tokenled
Token ID992
Feature activation+2.79e+00

Pos logit contributions
ners+0.180
gh+0.180
 competition+0.177
ven+0.173
 tide+0.170
Neg logit contributions
 From-0.092
 granddaughter-0.090
 owners-0.088
 cousins-0.087
 nephew-0.079
Token underneath
Token ID14638
Feature activation-3.63e-01

Pos logit contributions
 cousins+0.051
 granddaughter+0.051
 owners+0.048
 From+0.048
 nephew+0.048
Neg logit contributions
gh-0.104
ners-0.103
ven-0.096
irl-0.095
about-0.093
Token some
Token ID617
Feature activation+1.65e+00

Pos logit contributions
ven+0.013
ners+0.013
gh+0.013
works+0.013
irl+0.013
Neg logit contributions
 owners-0.006
 From-0.006
 granddaughter-0.006
 heart-0.006
 ear-0.006
Token nearby
Token ID6716
Feature activation-1.33e+00

Pos logit contributions
 owners+0.034
 cousins+0.034
 granddaughter+0.033
 nephew+0.032
 pride+0.029
Neg logit contributions
gh-0.059
ners-0.057
ven-0.057
irl-0.055
works-0.052
Token leaves
Token ID5667
Feature activation-1.88e+00

Pos logit contributions
irl+0.045
gh+0.044
ners+0.043
ven+0.041
isting+0.039
Neg logit contributions
 owners-0.032
 granddaughter-0.031
 nephew-0.030
 cousins-0.029
 heart-0.029
Token.
Token ID13
Feature activation-7.79e-02

Pos logit contributions
gh+0.063
irl+0.061
ners+0.061
ven+0.060
works+0.056
Neg logit contributions
 owners-0.038
 home-0.036
 owner-0.036
 knowing-0.036
 reunited-0.036
Token After
Token ID2293
Feature activation+2.72e+00

Pos logit contributions
gh+0.003
ners+0.003
ven+0.003
 tide+0.003
irl+0.003
Neg logit contributions
 cousins-0.001
 granddaughter-0.001
 nephew-0.001
 owners-0.001
 From-0.001
Token the
Token ID262
Feature activation-9.99e-01

Pos logit contributions
gh+0.049
ners+0.047
ven+0.044
irl+0.043
 tide+0.042
Neg logit contributions
 cousins-0.042
 granddaughter-0.040
 nephew-0.038
 owners-0.038
 From-0.037
Token lion
Token ID18744
Feature activation+1.71e+00

Pos logit contributions
gh+0.033
ners+0.032
 tide+0.030
ven+0.029
irl+0.029
Neg logit contributions
 cousins-0.023
 granddaughter-0.022
 nephew-0.021
 From-0.020
 owners-0.020
Token and
Token ID290
Feature activation+1.96e+00

Pos logit contributions
 cousins+0.048
 granddaughter+0.046
 nephew+0.044
 owners+0.043
 From+0.042
Neg logit contributions
gh-0.050
ners-0.048
ven-0.044
irl-0.043
 tide-0.043
Token the
Token ID262
Feature activation-9.81e-01

Pos logit contributions
 cousins+0.058
 granddaughter+0.056
 nephew+0.054
 owners+0.053
 From+0.052
Neg logit contributions
gh-0.050
ners-0.049
 tide-0.045
ven-0.045
irl-0.044
Token mouse
Token ID10211
Feature activation+3.40e+00

Pos logit contributions
gh+0.035
ners+0.034
ven+0.032
irl+0.032
about+0.031
Neg logit contributions
 cousins-0.022
 granddaughter-0.020
 nephew-0.020
 owners-0.019
 From-0.018
Token and
Token ID290
Feature activation+2.01e+00

Pos logit contributions
 cousins+0.103
 granddaughter+0.099
 nephew+0.095
 owners+0.095
 From+0.092
Neg logit contributions
gh-0.091
ners-0.087
ven-0.080
 tide-0.077
irl-0.077
Token the
Token ID262
Feature activation-9.53e-01

Pos logit contributions
 cousins+0.064
 granddaughter+0.063
 nephew+0.060
 owners+0.058
rolley+0.057
Neg logit contributions
gh-0.045
ners-0.045
 tide-0.042
ven-0.041
irl-0.040
Token snake
Token ID17522
Feature activation+3.81e-01

Pos logit contributions
gh+0.034
ners+0.033
ven+0.031
irl+0.031
about+0.030
Neg logit contributions
 cousins-0.021
 granddaughter-0.020
 nephew-0.020
 owners-0.019
 From-0.017
Token happy
Token ID3772
Feature activation+4.42e+00

Pos logit contributions
 cousins+0.012
 granddaughter+0.012
 nephew+0.011
 owners+0.011
 From+0.011
Neg logit contributions
gh-0.010
ners-0.009
ven-0.008
irl-0.008
 tide-0.008
Token with
Token ID351
Feature activation+1.67e+00

Pos logit contributions
 cousins+0.128
 granddaughter+0.122
 owners+0.118
 From+0.117
 nephew+0.116
Neg logit contributions
gh-0.124
ners-0.117
ven-0.109
irl-0.106
 tide-0.104
Token my
Token ID616
Feature activation+2.49e+00

Pos logit contributions
 cousins+0.046
 granddaughter+0.044
 nephew+0.042
 owners+0.041
 From+0.040
Neg logit contributions
gh-0.049
ners-0.047
ven-0.043
irl-0.043
 tide-0.043
Token had
Token ID550
Feature activation-1.33e+00

Pos logit contributions
 cousins+0.060
 From+0.055
 granddaughter+0.054
 owners+0.054
 nephew+0.054
Neg logit contributions
gh-0.065
ners-0.058
ven-0.056
about-0.054
 tide-0.053
Token so
Token ID523
Feature activation+5.45e-01

Pos logit contributions
ners+0.044
gh+0.043
works+0.042
ven+0.041
irl+0.040
Neg logit contributions
 cousins-0.031
 From-0.029
 granddaughter-0.028
 owners-0.028
 pride-0.027
Token much
Token ID881
Feature activation+6.98e-02

Pos logit contributions
 cousins+0.010
 From+0.010
 glad+0.010
 granddaughter+0.009
 owners+0.009
Neg logit contributions
gh-0.020
ners-0.020
ven-0.019
 tide-0.019
works-0.018
Token fun
Token ID1257
Feature activation-5.54e+00

Pos logit contributions
 cousins+0.002
 From+0.002
 glad+0.002
 granddaughter+0.002
 pride+0.002
Neg logit contributions
gh-0.002
ners-0.002
ven-0.002
redible-0.002
about-0.002
Token.
Token ID13
Feature activation-2.45e-01

Pos logit contributions
ven+0.185
gh+0.183
ners+0.181
redible+0.176
works+0.167
Neg logit contributions
 cousins-0.123
 friends-0.116
 knowing-0.115
 home-0.108
 owners-0.106
Token The
Token ID383
Feature activation-5.85e-01

Pos logit contributions
ners+0.007
gh+0.007
irl+0.006
about+0.006
ven+0.006
Neg logit contributions
 cousins-0.006
 nephew-0.006
 granddaughter-0.006
 owners-0.006
ringe-0.005
Token water
Token ID1660
Feature activation-2.44e+00

Pos logit contributions
gh+0.020
ven+0.018
h+0.018
ners+0.018
lex+0.016
Neg logit contributions
 cousins-0.015
 owners-0.014
 granddaughter-0.014
 parents-0.013
 family-0.013
Token felt
Token ID2936
Feature activation+3.00e+00

Pos logit contributions
gh+0.071
ven+0.066
ners+0.066
lex+0.060
irl+0.059
Neg logit contributions
 cousins-0.065
 owners-0.063
 granddaughter-0.058
 From-0.058
 owner-0.058
Token cool
Token ID3608
Feature activation-2.48e+00

Pos logit contributions
 cousins+0.074
 glad+0.070
 granddaughter+0.070
 From+0.070
 owners+0.068
Neg logit contributions
gh-0.094
ners-0.089
ven-0.087
about-0.083
irl-0.080
Token and
Token ID290
Feature activation+2.15e+00

Pos logit contributions
gh+0.106
ners+0.100
ven+0.099
irl+0.099
works+0.093
Neg logit contributions
 knowing-0.035
 cousins-0.032
 home-0.028
 granddaughter-0.028
 From-0.028
Token nice
Token ID3621
Feature activation-1.45e-01

Pos logit contributions
 cousins+0.053
 glad+0.052
 From+0.049
 owners+0.048
 granddaughter+0.048
Neg logit contributions
gh-0.070
ners-0.065
ven-0.063
about-0.062
th-0.057
Token and
Token ID290
Feature activation+2.05e+00

Pos logit contributions
gh+0.071
ners+0.070
ven+0.064
irl+0.063
about+0.062
Neg logit contributions
 cousins-0.063
 granddaughter-0.059
 nephew-0.058
 owners-0.056
 From-0.055
Token Tom
Token ID4186
Feature activation-5.18e+00

Pos logit contributions
 cousins+0.063
 granddaughter+0.060
 From+0.059
 nephew+0.058
 owners+0.057
Neg logit contributions
gh-0.051
ners-0.048
 tide-0.045
ven-0.043
irl-0.043
Token nodded
Token ID14464
Feature activation+3.23e+00

Pos logit contributions
gh+0.146
ners+0.139
ven+0.129
 tide+0.126
irl+0.125
Neg logit contributions
 cousins-0.147
 granddaughter-0.141
 nephew-0.136
 owners-0.133
 From-0.131
Token and
Token ID290
Feature activation+2.03e+00

Pos logit contributions
 cousins+0.086
 granddaughter+0.085
 owners+0.080
 nephew+0.080
 From+0.078
Neg logit contributions
gh-0.100
ners-0.096
ven-0.092
irl-0.089
 tide-0.088
Token said
Token ID531
Feature activation+4.85e+00

Pos logit contributions
 cousins+0.056
 granddaughter+0.053
 nephew+0.052
 owners+0.050
 From+0.050
Neg logit contributions
gh-0.059
ners-0.057
ven-0.052
 tide-0.052
irl-0.051
Token they
Token ID484
Feature activation-1.16e+00

Pos logit contributions
 cousins+0.094
 glad+0.091
 granddaughter+0.090
 From+0.090
 owners+0.086
Neg logit contributions
ners-0.181
gh-0.173
about-0.171
ven-0.168
 matter-0.166
Token would
Token ID561
Feature activation-1.94e+00

Pos logit contributions
gh+0.029
ners+0.028
ven+0.026
irl+0.025
 tide+0.025
Neg logit contributions
 cousins-0.036
 granddaughter-0.035
 nephew-0.033
 owners-0.032
 From-0.032
Token.
Token ID13
Feature activation-2.81e-01

Pos logit contributions
ners+0.065
gh+0.063
about+0.058
ven+0.057
 tide+0.056
Neg logit contributions
 cousins-0.046
 owners-0.044
 granddaughter-0.044
 nephew-0.042
 From-0.042
Token They
Token ID1119
Feature activation-1.47e+00

Pos logit contributions
ners+0.007
gh+0.007
 tide+0.006
about+0.006
ven+0.006
Neg logit contributions
 cousins-0.008
 nephew-0.007
 granddaughter-0.007
ringe-0.007
 owners-0.007
Token learned
Token ID4499
Feature activation+5.91e+00

Pos logit contributions
gh+0.038
ners+0.038
ven+0.035
irl+0.034
 tide+0.034
Neg logit contributions
 cousins-0.042
 granddaughter-0.040
 nephew-0.039
 From-0.037
 owners-0.037
Token their
Token ID511
Feature activation+2.82e+00

Pos logit contributions
 cousins+0.181
 granddaughter+0.171
 nephew+0.170
 owners+0.161
 From+0.161
Neg logit contributions
gh-0.147
ners-0.139
 tide-0.129
irl-0.125
ven-0.122
Token.
Token ID13
Feature activation-3.62e-01

Pos logit contributions
 cousins+0.099
 granddaughter+0.091
 From+0.090
 owners+0.090
 nephew+0.087
Neg logit contributions
gh-0.099
ners-0.092
irl-0.086
 tide-0.084
ven-0.083
Token They
Token ID1119
Feature activation-1.57e+00

Pos logit contributions
ners+0.011
gh+0.011
 tide+0.010
ven+0.010
irl+0.010
Neg logit contributions
 cousins-0.009
 granddaughter-0.009
 nephew-0.008
 owners-0.008
ringe-0.008
Token think
Token ID892
Feature activation+3.69e-01

Pos logit contributions
gh+0.048
ners+0.046
ven+0.043
irl+0.042
 tide+0.042
Neg logit contributions
 cousins-0.041
 granddaughter-0.040
 nephew-0.038
 owners-0.037
 From-0.036
Token it
Token ID340
Feature activation-6.65e-01

Pos logit contributions
 cousins+0.008
 granddaughter+0.007
 From+0.007
 grandma+0.007
 parents+0.007
Neg logit contributions
gh-0.013
ners-0.013
ven-0.012
h-0.012
redible-0.012
Token is
Token ID318
Feature activation-7.28e-01

Pos logit contributions
gh+0.020
ners+0.019
 tide+0.018
about+0.017
ven+0.017
Neg logit contributions
 cousins-0.018
 From-0.016
 granddaughter-0.016
 owners-0.016
 nephew-0.016
Token a
Token ID257
Feature activation-1.67e+00

Pos logit contributions
ners+0.025
about+0.025
gh+0.025
th+0.023
h+0.023
Neg logit contributions
 cousins-0.018
 granddaughter-0.016
 owners-0.016
 nephew-0.015
 grandma-0.015
Token clever
Token ID14169
Feature activation-1.92e+00

Pos logit contributions
gh+0.047
ners+0.045
ven+0.043
irl+0.040
about+0.040
Neg logit contributions
 cousins-0.049
 granddaughter-0.046
 nephew-0.045
 owners-0.044
 grandparents-0.042
Token trick
Token ID6908
Feature activation-2.33e+00

Pos logit contributions
gh+0.066
ven+0.064
ners+0.064
irl+0.062
redible+0.060
Neg logit contributions
 cousins-0.045
 granddaughter-0.045
 nephew-0.043
 wife-0.040
 owners-0.039
Token.
Token ID13
Feature activation-3.32e-01

Pos logit contributions
ners+0.066
gh+0.063
irl+0.062
ven+0.060
 tide+0.058
Neg logit contributions
 cousins-0.068
 granddaughter-0.065
 nephew-0.063
 owners-0.063
 niece-0.057
Token
Token ID198
Feature activation+7.43e-01

Pos logit contributions
gh+0.011
ners+0.011
 tide+0.010
ven+0.010
irl+0.010
Neg logit contributions
 cousins-0.008
 granddaughter-0.007
 nephew-0.007
 owners-0.007
ringe-0.006
Token
Token ID198
Feature activation+7.60e-01

Pos logit contributions
 cousins+0.014
 granddaughter+0.013
 nephew+0.012
 owners+0.012
 From+0.011
Neg logit contributions
gh-0.029
ners-0.028
irl-0.026
ven-0.026
 tide-0.026
Token to
Token ID284
Feature activation-8.47e-01

Pos logit contributions
gh+0.013
ners+0.012
irl+0.012
ven+0.011
about+0.011
Neg logit contributions
 cousins-0.007
 granddaughter-0.007
 nephew-0.006
 owners-0.006
 From-0.006
Token play
Token ID711
Feature activation-3.27e+00

Pos logit contributions
gh+0.026
ners+0.026
ven+0.023
works+0.023
 tide+0.023
Neg logit contributions
 cousins-0.021
 granddaughter-0.020
 nephew-0.020
 owners-0.019
 aunt-0.018
Token a
Token ID257
Feature activation-1.69e+00

Pos logit contributions
gh+0.111
ners+0.106
ven+0.101
 tide+0.099
works+0.097
Neg logit contributions
 cousins-0.073
 granddaughter-0.069
 owners-0.063
 nephew-0.063
 From-0.062
Token trick
Token ID6908
Feature activation-2.18e+00

Pos logit contributions
gh+0.039
ners+0.038
ven+0.035
irl+0.033
 tide+0.032
Neg logit contributions
 cousins-0.056
 granddaughter-0.054
 nephew-0.051
 owners-0.051
 grandparents-0.049
Token on
Token ID319
Feature activation-3.23e+00

Pos logit contributions
ners+0.068
gh+0.067
 tide+0.064
irl+0.063
ven+0.063
Neg logit contributions
 cousins-0.054
 granddaughter-0.052
 nephew-0.048
 owners-0.048
 owner-0.044
Token Tom
Token ID4186
Feature activation-4.88e+00

Pos logit contributions
gh+0.121
ners+0.116
ven+0.112
isting+0.108
 tide+0.108
Neg logit contributions
 cousins-0.063
 grandma-0.061
 granddaughter-0.060
 owners-0.058
 aunt-0.055
Token.
Token ID13
Feature activation+6.44e-02

Pos logit contributions
 tide+0.193
ners+0.188
works+0.182
obb+0.181
gh+0.180
Neg logit contributions
 cousins-0.087
 granddaughter-0.081
 nephew-0.076
 owners-0.066
 niece-0.064
Token She
Token ID1375
Feature activation+2.84e+00

Pos logit contributions
 cousins+0.001
 granddaughter+0.001
 nephew+0.001
 owners+0.001
 From+0.001
Neg logit contributions
gh-0.003
ners-0.003
ven-0.002
irl-0.002
 tide-0.002
Token took
Token ID1718
Feature activation-5.36e-02

Pos logit contributions
 cousins+0.070
 granddaughter+0.069
 nephew+0.065
 owners+0.064
 From+0.063
Neg logit contributions
gh-0.098
ners-0.086
 tide-0.084
redible-0.082
works-0.082
Token his
Token ID465
Feature activation+2.92e+00

Pos logit contributions
gh+0.002
ners+0.002
ven+0.002
irl+0.002
 tide+0.002
Neg logit contributions
 cousins-0.001
 granddaughter-0.001
 nephew-0.001
 owners-0.001
 From-0.001
Token star
Token ID3491
Feature activation+7.52e-01

Pos logit contributions
 cousins+0.084
 granddaughter+0.081
 nephew+0.077
 owners+0.075
 From+0.071
Neg logit contributions
gh-0.084
ners-0.080
ven-0.074
irl-0.072
about-0.072
Token play
Token ID711
Feature activation-3.02e+00

Pos logit contributions
gh+0.021
ners+0.021
irl+0.019
 tide+0.019
ven+0.019
Neg logit contributions
 cousins-0.023
 granddaughter-0.022
 nephew-0.021
 owners-0.021
 From-0.020
Token.
Token ID13
Feature activation+1.37e-01

Pos logit contributions
gh+0.085
ners+0.081
ven+0.076
 tide+0.074
works+0.072
Neg logit contributions
 cousins-0.086
 granddaughter-0.082
 From-0.079
 owners-0.079
 nephew-0.078
Token It
Token ID632
Feature activation-1.96e-01

Pos logit contributions
 cousins+0.003
 nephew+0.003
 granddaughter+0.002
 owners+0.002
 grandparents+0.002
Neg logit contributions
ners-0.005
gh-0.005
about-0.005
 tide-0.004
irl-0.004
Token was
Token ID373
Feature activation-4.14e-01

Pos logit contributions
gh+0.006
ners+0.005
ven+0.005
 tide+0.005
irl+0.005
Neg logit contributions
 cousins-0.005
 From-0.005
 owners-0.005
 granddaughter-0.005
 nephew-0.005
Token summer
Token ID3931
Feature activation-6.22e+00

Pos logit contributions
gh+0.013
ners+0.013
ven+0.012
irl+0.011
 tide+0.011
Neg logit contributions
 cousins-0.011
 granddaughter-0.010
 nephew-0.010
 owners-0.010
 From-0.010
Token,
Token ID11
Feature activation-2.20e+00

Pos logit contributions
ners+0.185
gh+0.184
irl+0.175
 tide+0.170
isting+0.168
Neg logit contributions
 cousins-0.142
 owners-0.137
 From-0.137
 grandma-0.136
 granddaughter-0.134
Token and
Token ID290
Feature activation+2.22e+00

Pos logit contributions
gh+0.073
ners+0.068
ven+0.067
works+0.062
irl+0.061
Neg logit contributions
 cousins-0.052
 From-0.050
 glad-0.049
 granddaughter-0.048
 owners-0.046
Token the
Token ID262
Feature activation-7.74e-01

Pos logit contributions
 cousins+0.057
 granddaughter+0.054
 owners+0.052
 From+0.051
 nephew+0.050
Neg logit contributions
gh-0.072
ners-0.067
ven-0.063
irl-0.060
 tide-0.060
Token sun
Token ID4252
Feature activation-5.85e-01

Pos logit contributions
gh+0.030
ven+0.028
ners+0.028
irl+0.026
isting+0.026
Neg logit contributions
 cousins-0.016
 owners-0.015
 granddaughter-0.015
 nephew-0.014
 owner-0.013
Token was
Token ID373
Feature activation-3.85e-01

Pos logit contributions
gh+0.016
ners+0.015
ven+0.014
 tide+0.014
irl+0.014
Neg logit contributions
 cousins-0.017
 granddaughter-0.016
 owners-0.016
 nephew-0.016
 From-0.015
Token shining
Token ID22751
Feature activation-8.35e-01

Pos logit contributions
ners+0.012
gh+0.011
ven+0.011
about+0.011
th+0.010
Neg logit contributions
 cousins-0.010
 glad-0.010
 granddaughter-0.009
 owners-0.009
 From-0.009
Token room
Token ID2119
Feature activation+2.72e+00

Pos logit contributions
 granddaughter+0.058
 From+0.058
 cousins+0.057
 knowing+0.056
 nephew+0.055
Neg logit contributions
gh-0.077
ners-0.074
irl-0.067
 competition-0.065
 time-0.065
Token.
Token ID13
Feature activation+4.21e-02

Pos logit contributions
 cousins+0.079
 granddaughter+0.075
 owners+0.071
 nephew+0.068
 From+0.068
Neg logit contributions
gh-0.077
ners-0.076
ven-0.068
irl-0.068
 tide-0.068
Token
Token ID198
Feature activation+1.05e+00

Pos logit contributions
 cousins+0.001
 granddaughter+0.001
 From+0.001
 nephew+0.001
 owners+0.001
Neg logit contributions
gh-0.002
ners-0.002
ven-0.001
irl-0.001
th-0.001
Token
Token ID198
Feature activation+1.08e+00

Pos logit contributions
 From+0.013
 cousins+0.013
 granddaughter+0.012
 owners+0.011
 nephew+0.011
Neg logit contributions
gh-0.044
ners-0.043
 tide-0.042
ven-0.041
works-0.040
TokenOne
Token ID3198
Feature activation-9.50e-01

Pos logit contributions
 owners+0.021
 From+0.021
 owner+0.021
 cousins+0.021
 While+0.020
Neg logit contributions
gh-0.035
works-0.033
 happen-0.032
ners-0.032
 tide-0.031
Token day
Token ID1110
Feature activation-1.86e+00

Pos logit contributions
gh+0.025
ners+0.024
ven+0.022
irl+0.021
 tide+0.020
Neg logit contributions
 cousins-0.030
 granddaughter-0.029
 nephew-0.028
 owners-0.028
 From-0.027
Token,
Token ID11
Feature activation-2.35e+00

Pos logit contributions
ners+0.065
gh+0.063
works+0.061
rain+0.059
 rou+0.059
Neg logit contributions
 cousins-0.038
 friends-0.036
 From-0.034
 parents-0.033
 family-0.033
Token they
Token ID484
Feature activation-1.40e+00

Pos logit contributions
gh+0.078
ners+0.072
th+0.064
ven+0.064
irl+0.064
Neg logit contributions
 cousins-0.062
 granddaughter-0.059
 nephew-0.056
 owners-0.055
 wife-0.054
Token decide
Token ID5409
Feature activation+4.73e+00

Pos logit contributions
gh+0.046
ners+0.045
ven+0.041
 tide+0.041
irl+0.041
Neg logit contributions
 cousins-0.034
 granddaughter-0.032
 nephew-0.031
 owners-0.030
 From-0.029
Token to
Token ID284
Feature activation-9.66e-01

Pos logit contributions
 cousins+0.077
 granddaughter+0.076
 nephew+0.066
 owners+0.063
 grandma+0.062
Neg logit contributions
ners-0.205
gh-0.200
irl-0.191
ven-0.187
obb-0.185
Token play
Token ID711
Feature activation-3.28e+00

Pos logit contributions
gh+0.030
ners+0.028
ven+0.026
 tide+0.026
h+0.026
Neg logit contributions
 cousins-0.025
 nephew-0.023
 owners-0.023
 granddaughter-0.023
 aunt-0.022
TokenONG
Token ID18494
Feature activation-8.27e+00

Pos logit contributions
gh+0.194
ners+0.181
 tide+0.175
ven+0.168
irl+0.168
Neg logit contributions
 granddaughter-0.196
 cousins-0.193
 nephew-0.182
 From-0.180
 owners-0.176
Token!
Token ID0
Feature activation-9.07e-01

Pos logit contributions
 tide+0.211
works+0.207
 rect+0.205
 seaw+0.202
 hilarious+0.197
Neg logit contributions
 granddaughter-0.229
 home-0.220
 aunt-0.218
ringe-0.217
 ear-0.215
Token B
Token ID347
Feature activation-6.74e+00

Pos logit contributions
gh+0.028
ners+0.027
ven+0.024
irl+0.024
 tide+0.024
Neg logit contributions
 cousins-0.024
 granddaughter-0.023
 nephew-0.022
 owners-0.022
 From-0.021
TokenONG
Token ID18494
Feature activation-8.20e+00

Pos logit contributions
gh+0.188
ners+0.177
 tide+0.172
irl+0.164
ven+0.162
Neg logit contributions
 granddaughter-0.191
 cousins-0.189
 nephew-0.178
 From-0.177
 owners-0.173
Token!
Token ID0
Feature activation-9.51e-01

Pos logit contributions
 tide+0.221
 rect+0.221
works+0.218
 seaw+0.217
irl+0.206
Neg logit contributions
 home-0.208
 granddaughter-0.203
 aunt-0.201
 ear-0.198
ringe-0.195
Token B
Token ID347
Feature activation-6.65e+00

Pos logit contributions
gh+0.028
ners+0.027
ven+0.025
 tide+0.024
irl+0.024
Neg logit contributions
 cousins-0.026
 granddaughter-0.025
 nephew-0.024
 owners-0.023
 From-0.023
TokenONG
Token ID18494
Feature activation-8.15e+00

Pos logit contributions
gh+0.185
ners+0.175
 tide+0.171
irl+0.163
about+0.160
Neg logit contributions
 granddaughter-0.187
 cousins-0.185
 From-0.175
 nephew-0.174
 owners-0.170
Token!
Token ID0
Feature activation-9.81e-01

Pos logit contributions
 tide+0.226
 rect+0.225
 seaw+0.222
works+0.219
irl+0.207
Neg logit contributions
 home-0.203
 granddaughter-0.195
 aunt-0.194
 ear-0.190
ringe-0.187
Token
Token ID198
Feature activation+1.03e+00

Pos logit contributions
gh+0.029
ners+0.028
ven+0.025
 tide+0.025
irl+0.025
Neg logit contributions
 cousins-0.027
 granddaughter-0.026
 nephew-0.025
 owners-0.024
 From-0.024
Token
Token ID198
Feature activation+1.05e+00

Pos logit contributions
 cousins+0.016
 granddaughter+0.015
 From+0.014
 nephew+0.013
 owners+0.013
Neg logit contributions
gh-0.043
ners-0.042
ven-0.040
 tide-0.039
irl-0.039
TokenSam
Token ID16305
Feature activation-7.49e+00

Pos logit contributions
 From+0.028
 owners+0.028
 cousins+0.027
 granddaughter+0.027
 nephew+0.025
Neg logit contributions
gh-0.030
ven-0.027
ners-0.027
 tide-0.027
works-0.026
Token same
Token ID976
Feature activation+6.51e-01

Pos logit contributions
gh+0.035
ven+0.032
ners+0.031
irl+0.030
about+0.030
Neg logit contributions
 cousins-0.030
 granddaughter-0.029
 nephew-0.028
 owners-0.027
 parent-0.025
Token way
Token ID835
Feature activation+7.50e-01

Pos logit contributions
 cousins+0.018
 granddaughter+0.017
 nephew+0.016
 owners+0.016
 heart+0.015
Neg logit contributions
gh-0.020
ven-0.019
ners-0.019
irl-0.018
 tide-0.017
Token?"
Token ID1701
Feature activation+5.61e-01

Pos logit contributions
 cousins+0.017
 granddaughter+0.017
 grandma+0.016
 home+0.016
 parent+0.016
Neg logit contributions
gh-0.026
ven-0.023
irl-0.022
ners-0.022
 rect-0.022
Token
Token ID198
Feature activation+1.07e+00

Pos logit contributions
 cousins+0.011
 granddaughter+0.010
 nephew+0.010
 owners+0.010
 From+0.009
Neg logit contributions
gh-0.021
ners-0.020
ven-0.018
 tide-0.018
irl-0.018
Token
Token ID198
Feature activation+1.05e+00

Pos logit contributions
 cousins+0.020
 granddaughter+0.019
 nephew+0.018
 owners+0.017
 From+0.016
Neg logit contributions
gh-0.041
ners-0.039
irl-0.037
ven-0.037
 tide-0.036
TokenSam
Token ID16305
Feature activation-7.56e+00

Pos logit contributions
 cousins+0.037
 granddaughter+0.036
 nephew+0.036
 From+0.036
 owners+0.036
Neg logit contributions
gh-0.022
ners-0.021
 tide-0.019
works-0.019
ven-0.019
Token thought
Token ID1807
Feature activation+1.58e+00

Pos logit contributions
gh+0.213
ners+0.204
ven+0.186
irl+0.183
 tide+0.182
Neg logit contributions
 cousins-0.218
 granddaughter-0.208
 nephew-0.202
 owners-0.198
 From-0.193
Token about
Token ID546
Feature activation+4.22e+00

Pos logit contributions
 cousins+0.018
 granddaughter+0.018
 From+0.017
 glad+0.015
 grandma+0.014
Neg logit contributions
gh-0.070
ners-0.069
ven-0.066
 tide-0.065
about-0.065
Token what
Token ID644
Feature activation+1.77e+00

Pos logit contributions
 cousins+0.113
 granddaughter+0.108
 From+0.105
 nephew+0.103
 owners+0.103
Neg logit contributions
gh-0.126
ners-0.125
ven-0.115
 tide-0.111
about-0.110
Token Mom
Token ID11254
Feature activation-1.06e+00

Pos logit contributions
 cousins+0.032
 granddaughter+0.031
 nephew+0.029
 owners+0.028
 From+0.026
Neg logit contributions
ners-0.070
gh-0.069
ven-0.065
about-0.064
irl-0.064
Tokenmy
Token ID1820
Feature activation+7.02e-01

Pos logit contributions
gh+0.042
ners+0.039
 tide+0.038
irl+0.038
ven+0.037
Neg logit contributions
 cousins-0.021
 granddaughter-0.020
 nephew-0.019
 owners-0.018
 From-0.017
Token have
Token ID423
Feature activation+7.58e-01

Pos logit contributions
gh+0.167
ners+0.158
ven+0.144
 tide+0.144
irl+0.142
Neg logit contributions
 cousins-0.151
 granddaughter-0.142
 nephew-0.138
 owners-0.136
 From-0.132
Token buttons
Token ID12163
Feature activation-2.14e+00

Pos logit contributions
 cousins+0.020
 granddaughter+0.019
 nephew+0.018
 owners+0.018
 From+0.018
Neg logit contributions
gh-0.023
ners-0.023
ven-0.021
irl-0.021
 tide-0.020
Token for
Token ID329
Feature activation+1.43e+00

Pos logit contributions
gh+0.062
ners+0.058
irl+0.055
ven+0.054
 tide+0.054
Neg logit contributions
 cousins-0.061
 granddaughter-0.058
 nephew-0.056
 owners-0.055
 From-0.053
Token you
Token ID345
Feature activation-4.84e+00

Pos logit contributions
 cousins+0.039
 granddaughter+0.036
 nephew+0.035
 owners+0.034
 From+0.033
Neg logit contributions
gh-0.044
ners-0.043
ven-0.039
about-0.038
 tide-0.038
Token.
Token ID13
Feature activation-5.21e-01

Pos logit contributions
gh+0.133
ners+0.130
irl+0.117
ven+0.117
 tide+0.116
Neg logit contributions
 cousins-0.145
 granddaughter-0.138
 nephew-0.132
 owners-0.132
 From-0.128
Token Do
Token ID2141
Feature activation-7.21e+00

Pos logit contributions
gh+0.014
ners+0.014
ven+0.013
 tide+0.013
works+0.012
Neg logit contributions
 cousins-0.014
 granddaughter-0.014
 nephew-0.013
 owners-0.013
 From-0.012
Token you
Token ID345
Feature activation-4.86e+00

Pos logit contributions
gh+0.184
ners+0.182
 tide+0.160
about+0.159
ven+0.157
Neg logit contributions
 cousins-0.229
 granddaughter-0.215
 nephew-0.209
 owners-0.208
 From-0.205
Token want
Token ID765
Feature activation-2.22e-01

Pos logit contributions
gh+0.168
ners+0.161
ven+0.148
 tide+0.147
works+0.146
Neg logit contributions
 cousins-0.109
 granddaughter-0.105
 nephew-0.098
 From-0.093
 pride-0.093
Token some
Token ID617
Feature activation+1.18e+00

Pos logit contributions
gh+0.007
ners+0.007
ven+0.006
 tide+0.006
irl+0.006
Neg logit contributions
 cousins-0.006
 granddaughter-0.006
 nephew-0.005
 From-0.005
 owners-0.005
Token?"
Token ID1701
Feature activation+2.81e-01

Pos logit contributions
 cousins+0.038
 granddaughter+0.036
 nephew+0.035
 owners+0.034
 From+0.034
Neg logit contributions
gh-0.030
ners-0.028
ven-0.026
irl-0.025
 tide-0.025
Token
Token ID198
Feature activation+5.80e-01

Pos logit contributions
 cousins+0.006
 granddaughter+0.005
 nephew+0.005
 owners+0.005
ringe+0.005
Neg logit contributions
gh-0.010
ners-0.010
 tide-0.009
ven-0.009
irl-0.009
Token replied
Token ID8712
Feature activation+5.44e+00

Pos logit contributions
gh+0.071
ners+0.068
ven+0.062
irl+0.061
 tide+0.060
Neg logit contributions
 cousins-0.072
 granddaughter-0.068
 nephew-0.067
 owners-0.065
 From-0.062
Token.
Token ID13
Feature activation+4.46e-01

Pos logit contributions
 cousins+0.162
 granddaughter+0.162
 From+0.153
 nephew+0.151
 owners+0.144
Neg logit contributions
gh-0.143
ners-0.141
ven-0.136
 tide-0.135
works-0.131
Token
Token ID220
Feature activation-3.00e-01

Pos logit contributions
 cousins+0.011
 granddaughter+0.011
 nephew+0.010
 owners+0.010
 client+0.010
Neg logit contributions
gh-0.014
ners-0.013
ven-0.012
about-0.012
h-0.012
Token
Token ID198
Feature activation+1.62e+00

Pos logit contributions
gh+0.008
ners+0.007
h+0.007
irl+0.007
cl+0.007
Neg logit contributions
 cousins-0.009
 granddaughter-0.008
 nephew-0.008
 waitress-0.008
 wife-0.008
Token
Token ID198
Feature activation+1.60e+00

Pos logit contributions
 cousins+0.049
 granddaughter+0.046
 nephew+0.046
 client+0.045
 owners+0.043
Neg logit contributions
gh-0.045
ners-0.041
h-0.041
cl-0.039
irl-0.039
TokenTim
Token ID14967
Feature activation-7.34e+00

Pos logit contributions
 cousins+0.053
 granddaughter+0.051
 nephew+0.050
 From+0.049
 owners+0.049
Neg logit contributions
gh-0.038
ners-0.036
ven-0.033
 tide-0.033
irl-0.032
Tokenmy
Token ID1820
Feature activation+1.01e+00

Pos logit contributions
gh+0.308
 tide+0.296
ners+0.294
irl+0.283
ven+0.280
Neg logit contributions
 cousins-0.114
 granddaughter-0.111
 nephew-0.104
 From-0.097
 owners-0.095
Token asked
Token ID1965
Feature activation+4.67e+00

Pos logit contributions
 cousins+0.027
 granddaughter+0.026
 From+0.025
 owners+0.025
 nephew+0.025
Neg logit contributions
gh-0.030
ners-0.029
 tide-0.027
ven-0.027
irl-0.026
Token,
Token ID11
Feature activation-2.04e+00

Pos logit contributions
 cousins+0.128
 granddaughter+0.123
 nephew+0.118
 owners+0.115
 From+0.115
Neg logit contributions
gh-0.140
ners-0.137
ven-0.126
irl-0.125
 tide-0.124
Token "
Token ID366
Feature activation-5.07e-01

Pos logit contributions
gh+0.077
ners+0.074
irl+0.070
ven+0.069
 tide+0.068
Neg logit contributions
 cousins-0.041
 granddaughter-0.040
 nephew-0.037
 From-0.036
 owners-0.035
TokenCan
Token ID6090
Feature activation-5.56e+00

Pos logit contributions
gh+0.015
ners+0.014
ven+0.013
irl+0.013
 tide+0.013
Neg logit contributions
 cousins-0.014
 granddaughter-0.014
 nephew-0.014
 owners-0.013
 From-0.013
Token sweet
Token ID6029
Feature activation-2.02e+00

Pos logit contributions
gh+0.066
ners+0.063
ven+0.058
irl+0.058
 tide+0.056
Neg logit contributions
 cousins-0.053
 granddaughter-0.051
 nephew-0.048
 owners-0.047
 From-0.046
Tokenie
Token ID494
Feature activation+8.36e-01

Pos logit contributions
gh+0.063
ners+0.058
ven+0.057
irl+0.056
 rect+0.054
Neg logit contributions
 granddaughter-0.059
 cousins-0.056
 nephew-0.052
 waitress-0.052
 owners-0.051
Token,
Token ID11
Feature activation-2.09e+00

Pos logit contributions
 granddaughter+0.023
 cousins+0.023
 nephew+0.021
 owners+0.020
 niece+0.020
Neg logit contributions
gh-0.026
irl-0.025
ners-0.025
ven-0.024
 tide-0.023
Token but
Token ID475
Feature activation-1.06e+00

Pos logit contributions
gh+0.065
ners+0.062
ven+0.058
irl+0.058
about+0.055
Neg logit contributions
 cousins-0.054
 granddaughter-0.053
 glad-0.050
 owners-0.049
 From-0.049
Token I
Token ID314
Feature activation-4.74e+00

Pos logit contributions
gh+0.033
ners+0.032
about+0.028
ven+0.028
irl+0.028
Neg logit contributions
 cousins-0.029
 granddaughter-0.027
 owners-0.026
 nephew-0.026
 From-0.026
Token don
Token ID836
Feature activation-7.98e+00

Pos logit contributions
gh+0.125
ners+0.118
ven+0.107
 tide+0.106
irl+0.104
Neg logit contributions
 cousins-0.146
 granddaughter-0.141
 nephew-0.136
 owners-0.134
 From-0.133
Token't
Token ID470
Feature activation-3.34e+00

Pos logit contributions
gh+0.267
ners+0.259
 tide+0.242
ven+0.240
about+0.235
Neg logit contributions
 cousins-0.180
 granddaughter-0.169
 From-0.164
 owners-0.163
 nephew-0.161
Token have
Token ID423
Feature activation+8.40e-01

Pos logit contributions
gh+0.125
ners+0.125
works+0.117
 rect+0.115
ven+0.115
Neg logit contributions
 granddaughter-0.059
 cousins-0.059
 glad-0.056
 nephew-0.055
 owners-0.054
Token enough
Token ID1576
Feature activation+7.92e-01

Pos logit contributions
 cousins+0.020
 granddaughter+0.020
 nephew+0.018
 owners+0.018
 From+0.017
Neg logit contributions
gh-0.027
ners-0.027
works-0.025
about-0.025
ven-0.025
Token money
Token ID1637
Feature activation+3.27e+00

Pos logit contributions
 cousins+0.018
 granddaughter+0.018
 nephew+0.016
 owners+0.016
 From+0.015
Neg logit contributions
gh-0.027
ners-0.027
ven-0.025
irl-0.025
 tide-0.024
Token to
Token ID284
Feature activation-1.08e+00

Pos logit contributions
 cousins+0.075
 granddaughter+0.071
 nephew+0.066
 owners+0.066
 From+0.063
Neg logit contributions
gh-0.111
ners-0.109
irl-0.103
 tide-0.101
about-0.100
TokenY
Token ID56
Feature activation-8.07e+00

Pos logit contributions
gh+0.016
ners+0.015
ven+0.013
 tide+0.013
irl+0.013
Neg logit contributions
 cousins-0.017
 granddaughter-0.016
 nephew-0.016
 owners-0.016
 From-0.015
Tokenay
Token ID323
Feature activation-3.18e+00

Pos logit contributions
gh+0.264
ners+0.262
irl+0.248
 tide+0.244
ven+0.232
Neg logit contributions
 cousins-0.186
 granddaughter-0.180
 From-0.172
 nephew-0.168
 owners-0.159
Token!"
Token ID2474
Feature activation+4.92e-01

Pos logit contributions
ners+0.086
 rou+0.084
irl+0.081
 tide+0.078
gh+0.078
Neg logit contributions
 cousins-0.093
 granddaughter-0.082
 nephew-0.079
 reunited-0.077
 aunt-0.077
Token
Token ID198
Feature activation+1.30e+00

Pos logit contributions
 cousins+0.012
 nephew+0.012
 granddaughter+0.011
 prisoners+0.011
 owners+0.011
Neg logit contributions
gh-0.015
ners-0.015
irl-0.014
about-0.014
h-0.014
Token
Token ID198
Feature activation+1.32e+00

Pos logit contributions
 cousins+0.038
 nephew+0.036
ringe+0.036
 granddaughter+0.035
 client+0.034
Neg logit contributions
gh-0.038
irl-0.034
ners-0.034
h-0.033
cl-0.032
TokenJ
Token ID41
Feature activation-7.77e+00

Pos logit contributions
 cousins+0.038
 granddaughter+0.036
 nephew+0.035
 owners+0.034
 From+0.034
Neg logit contributions
gh-0.038
ners-0.036
ven-0.033
 tide-0.033
irl-0.032
Tokenenny
Token ID11870
Feature activation-1.16e+00

Pos logit contributions
gh+0.283
ners+0.262
 tide+0.257
irl+0.257
ven+0.254
Neg logit contributions
 cousins-0.162
 granddaughter-0.160
 From-0.139
 nephew-0.139
 grandparents-0.130
Token's
Token ID338
Feature activation+2.51e+00

Pos logit contributions
gh+0.036
ners+0.035
irl+0.032
 tide+0.032
works+0.031
Neg logit contributions
 cousins-0.031
 granddaughter-0.030
 nephew-0.029
 From-0.029
 owners-0.029
Token mom
Token ID1995
Feature activation+2.98e+00

Pos logit contributions
ringe+0.060
 From+0.058
 Always+0.057
check+0.055
rolley+0.054
Neg logit contributions
gh-0.069
 time-0.064
 tide-0.062
ners-0.062
 competition-0.060
Tokenmy
Token ID1820
Feature activation+6.86e-01

Pos logit contributions
 cousins+0.058
 granddaughter+0.055
 nephew+0.052
 owners+0.049
 From+0.049
Neg logit contributions
gh-0.117
ners-0.113
 tide-0.106
irl-0.106
ven-0.103
Token asked
Token ID1965
Feature activation+4.59e+00

Pos logit contributions
 cousins+0.016
 granddaughter+0.016
 nephew+0.015
 owners+0.014
 From+0.014
Neg logit contributions
gh-0.024
ners-0.023
irl-0.022
 tide-0.021
ven-0.020