TOP ACTIVATIONS
MAX = 8.353

then , a big , boun cy ball floated by .
It was a big , boun cy ball ! When she
. It was shiny and boun cy and fun . But
up and down like a boun cy ball .
is big and round and boun cy . Can I hold
. It is big and boun cy . They grab it
she saw a big , boun cy ball ! Lily was
is shiny and round and boun cy . I like it
a shiny rock and a boun cy ball .
, boxes and even a boun cy ball .
. It was a big boun cy ball . The 3
. It was a big boun cy ball . The 3
thought for a moment before rep lying . " Yes I
He saw a big boun cy castle . He wanted
rubber and it was so boun cy . He had great
It was a big , boun cy ball ! The kid
they found a big , boun cy ball . They were
round , and ever so boun cy . Every day ,
too high and too w obb ly , and it fell
was small and red and boun cy . He went back

MIN ACTIVATIONS
MIN = -8.446

very kind and offered Hel io some of the oxygen from
she smiled and gave her daughter a big hug .
and the mum gave her daughter lots of hugs .
um smiled and gave her daughter another hug . ' That
was angry and asked her daughter why she had done something
happy that he gave his wife a big hug , and
be fun to give the lady a big hug . So
so they gave their little girl a measuring tape . She
her feet and handed the woman a piece . They both
friend suggested they ask the stranger what his name was .
him . He gave his son a big hug and thanked
ran to him and gave him a big hug . They
knowing he had given his daughter an interesting gift that would
she decided to give her daughter a teaspoon . The little
. He then gave her daughter some medicine to take every
ummy and Daddy took their daughter to the beach . It
sorry to Ben and gave him back the shiny gold block
to the boy and give him her sc ooter as a
decided to bring their little child with them to the station
ran to Max and picked him up . She checked his

INTERVAL 4.153 - 8.353
CONTAINS 0.926%

The young boy felt so sad ! He tried to patch
a time , in a small house , lived a funny
living room and saw a big , filthy cushion . She
. The bear was so surprised it just stopped still watching
L ily laughed and cl apped her hands . She

INTERVAL -0.046 - 4.153
CONTAINS 53.120%

and Lily were scared . They did not know what the
knew the mine had more to discover . He couldn 't
" Hey , you kids ! Get away from
" L ily , if you take a coin , you
Jack hopped and hopped until he found a pretty mushroom .

INTERVAL -4.246 - -0.046
CONTAINS 44.540%

take it from her . " Hey , give
Ben said yes . They shared their coal . They taught
was still messy . She was very quiet and didn 't
her to her beautiful self . <|endoftext|> Once upon a time
the math . He explains the numbers and signs . He

INTERVAL -8.446 - -4.246
CONTAINS 1.415%

decided to not put the bird in the cage anymore .
scared , but you helped me be brave ." Tim my
Can we stop soon ?" Her mom my said , "
a good friend to help him along the way . <|endoftext|>
mom asked her to help her with a task . She
Token then
Token ID788
Feature activation+2.29e+00

Pos logit contributions
 score+0.003
Run+0.003
Stop+0.003
OK+0.002
Mine+0.002
Neg logit contributions
aj-0.005
iqu-0.005
ppa-0.005
oll-0.005
omm-0.005
Token,
Token ID11
Feature activation-8.18e-01

Pos logit contributions
Stop+0.027
Mine+0.026
 score+0.025
Run+0.024
!"+0.023
Neg logit contributions
ppa-0.062
 grandmother-0.060
 granddaughter-0.059
aj-0.059
oll-0.059
Token a
Token ID257
Feature activation+2.84e+00

Pos logit contributions
ppa+0.022
aj+0.021
oll+0.021
omm+0.021
iqu+0.020
Neg logit contributions
Stop-0.010
 score-0.010
Run-0.010
!"-0.010
Mine-0.010
Token big
Token ID1263
Feature activation+6.02e+00

Pos logit contributions
Mine+0.048
Stop+0.048
 score+0.045
atch+0.044
 hill+0.044
Neg logit contributions
ppa-0.061
 grandmother-0.060
 granddaughter-0.060
apa-0.057
aj-0.055
Token,
Token ID11
Feature activation-1.88e-01

Pos logit contributions
Mine+0.115
ok+0.103
Stop+0.102
atch+0.100
OK+0.099
Neg logit contributions
 grandmother-0.121
 Mother-0.120
 granddaughter-0.119
 pap-0.119
 mother-0.118
Token boun
Token ID31283
Feature activation+8.35e+00

Pos logit contributions
 granddaughter+0.004
 grandmother+0.004
 Mother+0.003
ppa+0.003
 parents+0.003
Neg logit contributions
Mine-0.003
atch-0.003
Stop-0.003
 score-0.003
ok-0.003
Tokency
Token ID948
Feature activation+8.08e-01

Pos logit contributions
 wants+0.124
 AW+0.123
Stop+0.122
www+0.120
Mine+0.115
Neg logit contributions
oll-0.154
ascal-0.149
 grandmother-0.148
ppa-0.146
isk-0.145
Token ball
Token ID2613
Feature activation+1.50e+00

Pos logit contributions
Stop+0.013
Mine+0.012
OK+0.012
ok+0.011
www+0.011
Neg logit contributions
 granddaughter-0.017
 grandmother-0.017
 parents-0.017
 owners-0.017
 throat-0.016
Token floated
Token ID29742
Feature activation+2.00e+00

Pos logit contributions
Mine+0.029
www+0.028
Stop+0.028
OK+0.028
Wow+0.025
Neg logit contributions
oll-0.025
ovable-0.024
icked-0.024
 owners-0.024
ppa-0.023
Token by
Token ID416
Feature activation+4.12e-01

Pos logit contributions
Mine+0.033
Stop+0.032
www+0.032
 score+0.031
 challenge+0.031
Neg logit contributions
ppa-0.040
 grandmother-0.039
 exactly-0.039
ocular-0.039
 granddaughter-0.039
Token.
Token ID13
Feature activation+8.58e-01

Pos logit contributions
Mine+0.007
Stop+0.007
OK+0.006
www+0.006
 score+0.006
Neg logit contributions
ppa-0.009
 grandmother-0.009
 granddaughter-0.009
apa-0.008
aj-0.008
Token It
Token ID632
Feature activation+2.55e+00

Pos logit contributions
Mine+0.000
Stop+0.000
 hill+0.000
OK+0.000
 score+0.000
Neg logit contributions
ppa-0.000
urger-0.000
rina-0.000
aj-0.000
oll-0.000
Token was
Token ID373
Feature activation-1.50e+00

Pos logit contributions
Mine+0.043
Stop+0.042
 score+0.041
atch+0.039
 hill+0.039
Neg logit contributions
ppa-0.053
 grandmother-0.053
 granddaughter-0.052
apa-0.051
oll-0.049
Token a
Token ID257
Feature activation+1.98e+00

Pos logit contributions
ppa+0.033
 grandmother+0.031
 granddaughter+0.031
apa+0.030
aj+0.030
Neg logit contributions
Mine-0.026
Stop-0.026
 score-0.025
 race-0.024
 wind-0.024
Token big
Token ID1263
Feature activation+5.62e+00

Pos logit contributions
Stop+0.035
Mine+0.034
atch+0.033
ok+0.032
Run+0.031
Neg logit contributions
 grandmother-0.042
 granddaughter-0.040
ppa-0.039
 parents-0.037
apa-0.037
Token,
Token ID11
Feature activation-1.01e+00

Pos logit contributions
Stop+0.108
ok+0.105
Mine+0.100
Give+0.096
atch+0.095
Neg logit contributions
 grandmother-0.111
 granddaughter-0.101
 throat-0.099
 aunt-0.096
 Mother-0.096
Token boun
Token ID31283
Feature activation+8.18e+00

Pos logit contributions
 granddaughter+0.021
 grandmother+0.021
ppa+0.020
aj+0.019
oll+0.019
Neg logit contributions
Stop-0.017
Mine-0.017
atch-0.016
 score-0.015
OK-0.015
Tokency
Token ID948
Feature activation+3.46e-01

Pos logit contributions
 wants+0.140
Stop+0.138
 AW+0.133
 tries+0.132
 blows+0.131
Neg logit contributions
oll-0.140
 grandmother-0.126
uffy-0.124
ppa-0.124
aj-0.124
Token ball
Token ID2613
Feature activation+1.04e+00

Pos logit contributions
Stop+0.006
OK+0.005
Check+0.005
Mine+0.005
ok+0.005
Neg logit contributions
 grandmother-0.007
 granddaughter-0.007
 parents-0.007
 owners-0.007
 throat-0.007
Token!
Token ID0
Feature activation+2.08e-01

Pos logit contributions
Stop+0.022
Mine+0.020
OK+0.020
Hey+0.019
Check+0.018
Neg logit contributions
oll-0.019
 grandmother-0.017
ppa-0.017
tr-0.017
 granddaughter-0.017
Token When
Token ID1649
Feature activation+1.39e+00

Pos logit contributions
Mine+0.004
Stop+0.003
OK+0.003
 score+0.003
 hill+0.003
Neg logit contributions
ppa-0.005
 grandmother-0.004
urger-0.004
aj-0.004
 patients-0.004
Token she
Token ID673
Feature activation+1.75e+00

Pos logit contributions
Stop+0.021
Mine+0.021
 score+0.020
 hill+0.019
OK+0.019
Neg logit contributions
ppa-0.033
 grandmother-0.032
 granddaughter-0.032
aj-0.031
apa-0.031
Token.
Token ID13
Feature activation+6.16e-01

Pos logit contributions
ppa+0.009
 grandmother+0.009
 granddaughter+0.009
apa+0.009
aj+0.009
Neg logit contributions
Mine-0.005
 score-0.005
Stop-0.005
 hill-0.004
 race-0.004
Token It
Token ID632
Feature activation+2.14e+00

Pos logit contributions
Mine+0.009
Stop+0.009
OK+0.008
 score+0.008
 hill+0.008
Neg logit contributions
ppa-0.016
 grandmother-0.015
 granddaughter-0.015
apa-0.015
oll-0.015
Token was
Token ID373
Feature activation-2.29e+00

Pos logit contributions
Mine+0.031
Stop+0.031
 score+0.030
OK+0.028
 hill+0.028
Neg logit contributions
ppa-0.052
 grandmother-0.051
 granddaughter-0.050
apa-0.049
aj-0.048
Token shiny
Token ID22441
Feature activation+3.96e+00

Pos logit contributions
ppa+0.060
apa+0.059
 grandmother+0.057
uf+0.057
lor+0.057
Neg logit contributions
 score-0.029
 wind-0.028
 race-0.028
 kick-0.027
Mine-0.027
Token and
Token ID290
Feature activation-2.92e-01

Pos logit contributions
Mine+0.068
Stop+0.067
 score+0.061
www+0.061
Run+0.061
Neg logit contributions
 grandmother-0.085
 granddaughter-0.085
ppa-0.085
oll-0.084
aj-0.080
Token boun
Token ID31283
Feature activation+8.16e+00

Pos logit contributions
ppa+0.006
 grandmother+0.006
 granddaughter+0.006
aj+0.006
oll+0.006
Neg logit contributions
Stop-0.005
Mine-0.004
atch-0.004
 score-0.004
Run-0.004
Tokency
Token ID948
Feature activation-1.51e-01

Pos logit contributions
 challenge+0.107
 race+0.105
Stop+0.104
 hill+0.103
 score+0.103
Neg logit contributions
oll-0.190
ppa-0.186
aj-0.180
 granddaughter-0.179
 grandmother-0.178
Token and
Token ID290
Feature activation-9.83e-01

Pos logit contributions
ppa+0.003
 grandmother+0.003
 granddaughter+0.003
oll+0.003
aj+0.003
Neg logit contributions
Stop-0.003
Mine-0.003
Run-0.002
www-0.002
 score-0.002
Token fun
Token ID1257
Feature activation+2.10e+00

Pos logit contributions
ppa+0.022
 grandmother+0.022
 granddaughter+0.021
oll+0.021
apa+0.020
Neg logit contributions
Stop-0.015
Mine-0.015
 score-0.015
 hill-0.014
atch-0.014
Token.
Token ID13
Feature activation+2.62e-01

Pos logit contributions
Stop+0.046
Run+0.044
Mine+0.043
Hey+0.042
www+0.042
Neg logit contributions
 grandmother-0.035
 granddaughter-0.034
 parents-0.033
 owners-0.032
 patients-0.032
Token But
Token ID887
Feature activation+5.53e-01

Pos logit contributions
Stop+0.004
Mine+0.004
OK+0.004
Wow+0.003
No+0.003
Neg logit contributions
ppa-0.007
 grandmother-0.007
oll-0.006
apa-0.006
rina-0.006
Token up
Token ID510
Feature activation+1.11e+00

Pos logit contributions
Mine+0.042
Stop+0.039
OK+0.039
www+0.037
about+0.037
Neg logit contributions
ppa-0.038
 grandmother-0.037
 granddaughter-0.037
mary-0.036
iqu-0.036
Token and
Token ID290
Feature activation+1.23e+00

Pos logit contributions
Mine+0.026
Stop+0.025
www+0.024
Check+0.024
OK+0.023
Neg logit contributions
 grandmother-0.016
 exactly-0.016
ppa-0.015
 granddaughter-0.014
aj-0.014
Token down
Token ID866
Feature activation+2.09e+00

Pos logit contributions
Mine+0.018
 score+0.017
ok+0.017
www+0.017
atch+0.017
Neg logit contributions
 grandmother-0.027
 Chick-0.026
ppa-0.026
 granddaughter-0.026
 parents-0.026
Token like
Token ID588
Feature activation-2.09e+00

Pos logit contributions
Mine+0.047
www+0.047
Stop+0.044
ww+0.044
atch+0.044
Neg logit contributions
 exactly-0.029
 grandmother-0.029
 granddaughter-0.025
 parents-0.025
 Mother-0.025
Token a
Token ID257
Feature activation+1.11e+00

Pos logit contributions
ppa+0.058
aj+0.054
apa+0.053
 grandmother+0.053
rina+0.052
Neg logit contributions
Mine-0.027
Stop-0.027
 race-0.026
 hill-0.026
 wind-0.025
Token boun
Token ID31283
Feature activation+8.15e+00

Pos logit contributions
Stop+0.017
Mine+0.017
atch+0.017
ok+0.015
Run+0.015
Neg logit contributions
 grandmother-0.026
 granddaughter-0.025
ppa-0.024
 parents-0.023
 patients-0.023
Tokency
Token ID948
Feature activation+3.65e-01

Pos logit contributions
Stop+0.112
Run+0.107
Mine+0.103
www+0.100
 wants+0.098
Neg logit contributions
oll-0.180
 grandmother-0.176
ppa-0.175
aj-0.174
isk-0.172
Token ball
Token ID2613
Feature activation+9.11e-01

Pos logit contributions
Stop+0.006
Mine+0.006
atch+0.005
Check+0.005
OK+0.005
Neg logit contributions
 granddaughter-0.008
aj-0.007
 throat-0.007
 parents-0.007
 grandmother-0.007
Token.
Token ID13
Feature activation-5.72e-01

Pos logit contributions
Stop+0.022
Mine+0.021
OK+0.021
www+0.020
Wow+0.019
Neg logit contributions
oll-0.013
tr-0.013
 grandmother-0.012
 granddaughter-0.012
ppa-0.012
Token
Token ID220
Feature activation-1.18e+00

Pos logit contributions
ppa+0.011
 grandmother+0.010
 granddaughter+0.010
apa+0.010
aj+0.010
Neg logit contributions
Mine-0.011
Stop-0.011
 score-0.011
 hill-0.010
OK-0.010
Token
Token ID198
Feature activation-6.96e-01

Pos logit contributions
ppa+0.030
 granddaughter+0.029
 grandmother+0.029
 patients+0.028
 former+0.028
Neg logit contributions
!"-0.015
Stop-0.015
Mine-0.015
OK-0.014
 hill-0.014
Token is
Token ID318
Feature activation-1.43e+00

Pos logit contributions
Mine+0.028
Stop+0.027
Run+0.026
 slope+0.025
 score+0.025
Neg logit contributions
ppa-0.038
oll-0.038
 grandmother-0.038
 granddaughter-0.037
tr-0.037
Token big
Token ID1263
Feature activation+3.04e+00

Pos logit contributions
ppa+0.035
 grandmother+0.034
 granddaughter+0.034
apa+0.034
 patients+0.032
Neg logit contributions
Mine-0.020
 score-0.020
Stop-0.019
!"-0.019
 race-0.019
Token and
Token ID290
Feature activation-9.67e-01

Pos logit contributions
Mine+0.058
Stop+0.056
OK+0.051
Run+0.051
 score+0.051
Neg logit contributions
 granddaughter-0.062
 grandmother-0.060
oll-0.059
aj-0.057
ppa-0.057
Token round
Token ID2835
Feature activation+3.46e+00

Pos logit contributions
ppa+0.020
 grandmother+0.020
 granddaughter+0.020
 Chick+0.019
aj+0.019
Neg logit contributions
Stop-0.016
Mine-0.015
atch-0.014
about-0.014
ok-0.014
Token and
Token ID290
Feature activation-1.48e+00

Pos logit contributions
Mine+0.071
Stop+0.069
 score+0.066
Run+0.066
 challenge+0.066
Neg logit contributions
oll-0.059
 granddaughter-0.055
aj-0.054
 patients-0.053
ppa-0.053
Token boun
Token ID31283
Feature activation+8.06e+00

Pos logit contributions
ppa+0.031
 Chick+0.031
 grandmother+0.031
 granddaughter+0.030
oll+0.030
Neg logit contributions
Stop-0.023
Mine-0.022
about-0.021
atch-0.021
ok-0.020
Tokency
Token ID948
Feature activation-4.07e-01

Pos logit contributions
 challenge+0.131
Run+0.128
 race+0.126
 slope+0.124
 hill+0.123
Neg logit contributions
oll-0.155
uggets-0.147
aj-0.145
ppa-0.144
ingu-0.137
Token.
Token ID13
Feature activation+8.55e-01

Pos logit contributions
oll+0.007
ppa+0.007
 granddaughter+0.007
aj+0.007
 parents+0.007
Neg logit contributions
Stop-0.008
Mine-0.008
Run-0.007
OK-0.007
Hey-0.007
Token Can
Token ID1680
Feature activation+8.33e-01

Pos logit contributions
Mine+0.009
Stop+0.009
 score+0.008
OK+0.008
Run+0.008
Neg logit contributions
ppa-0.025
 grandmother-0.023
 granddaughter-0.023
apa-0.023
aj-0.022
Token I
Token ID314
Feature activation+8.10e-01

Pos logit contributions
Stop+0.008
Mine+0.008
 score+0.008
 hill+0.007
atch+0.007
Neg logit contributions
 grandmother-0.023
ppa-0.023
 granddaughter-0.022
apa-0.022
oll-0.021
Token hold
Token ID1745
Feature activation+2.14e-01

Pos logit contributions
Stop+0.011
Run+0.011
Mine+0.011
www+0.010
 slope+0.010
Neg logit contributions
 grandmother-0.019
ppa-0.019
 granddaughter-0.019
aj-0.018
oll-0.017
Token.
Token ID13
Feature activation+1.26e+00

Pos logit contributions
Stop+0.001
Mine+0.001
www+0.001
Hey+0.001
OK+0.001
Neg logit contributions
 grandmother-0.001
 granddaughter-0.001
ppa-0.001
oll-0.001
tr-0.001
Token It
Token ID632
Feature activation+1.99e+00

Pos logit contributions
Mine+0.020
Stop+0.019
 score+0.019
Run+0.018
OK+0.018
Neg logit contributions
ppa-0.032
 grandmother-0.029
oll-0.028
rina-0.028
 granddaughter-0.028
Token is
Token ID318
Feature activation-2.11e+00

Pos logit contributions
Stop+0.030
www+0.029
 gol+0.028
Mine+0.028
ok+0.027
Neg logit contributions
aj-0.043
mary-0.042
oll-0.041
 grandmother-0.041
 parents-0.041
Token big
Token ID1263
Feature activation+2.65e+00

Pos logit contributions
ppa+0.050
 grandmother+0.048
 granddaughter+0.048
apa+0.047
oll+0.046
Neg logit contributions
Mine-0.032
Stop-0.032
 score-0.031
 hill-0.030
OK-0.029
Token and
Token ID290
Feature activation-3.39e-01

Pos logit contributions
Mine+0.050
Stop+0.048
OK+0.045
www+0.044
 score+0.044
Neg logit contributions
 granddaughter-0.055
 grandmother-0.054
oll-0.052
ppa-0.052
aj-0.051
Token boun
Token ID31283
Feature activation+8.04e+00

Pos logit contributions
 grandmother+0.007
 granddaughter+0.007
aj+0.007
ppa+0.007
 Mother+0.007
Neg logit contributions
Stop-0.005
Mine-0.005
atch-0.005
 score-0.005
about-0.005
Tokency
Token ID948
Feature activation-1.02e-01

Pos logit contributions
 challenge+0.121
 race+0.115
www+0.114
 slope+0.112
Run+0.111
Neg logit contributions
oll-0.169
aj-0.157
ppa-0.156
uggets-0.152
 grandmother-0.152
Token.
Token ID13
Feature activation+7.80e-01

Pos logit contributions
ppa+0.002
 granddaughter+0.002
 grandmother+0.002
oll+0.002
aj+0.002
Neg logit contributions
Stop-0.002
Mine-0.002
Hey-0.002
OK-0.002
www-0.002
Token They
Token ID1119
Feature activation+1.13e+00

Pos logit contributions
Stop+0.010
Mine+0.010
Run+0.009
OK+0.009
Wow+0.009
Neg logit contributions
ppa-0.022
rina-0.020
urger-0.020
 grandmother-0.020
oll-0.020
Token grab
Token ID5552
Feature activation+1.63e+00

Pos logit contributions
www+0.018
Stop+0.016
ww+0.016
atch+0.016
Run+0.015
Neg logit contributions
ppa-0.024
 parents-0.024
 grandmother-0.024
 patients-0.023
oll-0.023
Token it
Token ID340
Feature activation-2.46e+00

Pos logit contributions
www+0.031
Stop+0.029
atch+0.029
 hill+0.029
 challenge+0.028
Neg logit contributions
 granddaughter-0.029
apa-0.028
 grandmother-0.028
 parents-0.028
 Chick-0.027
Token she
Token ID673
Feature activation+7.83e-01

Pos logit contributions
ppa+0.010
urger+0.010
aj+0.010
omm+0.010
uf+0.010
Neg logit contributions
Stop-0.004
 hill-0.004
Mine-0.004
Run-0.003
No-0.003
Token saw
Token ID2497
Feature activation-1.57e+00

Pos logit contributions
 score+0.006
Stop+0.005
 hill+0.005
Mine+0.005
 wants+0.005
Neg logit contributions
ppa-0.024
 granddaughter-0.024
 grandmother-0.023
omm-0.023
oll-0.023
Token a
Token ID257
Feature activation+2.00e+00

Pos logit contributions
aj+0.045
ppa+0.045
urger+0.043
ascal+0.042
 former+0.042
Neg logit contributions
 it-0.019
Stop-0.019
Mine-0.018
 turns-0.017
!"-0.016
Token big
Token ID1263
Feature activation+5.43e+00

Pos logit contributions
Stop+0.037
Mine+0.036
atch+0.036
ok+0.034
Run+0.033
Neg logit contributions
 grandmother-0.042
 granddaughter-0.040
ppa-0.038
aj-0.036
 parents-0.035
Token,
Token ID11
Feature activation-1.42e+00

Pos logit contributions
ok+0.118
Stop+0.115
OK+0.109
Mine+0.107
atch+0.106
Neg logit contributions
 grandmother-0.098
 granddaughter-0.088
 aunt-0.087
 throat-0.086
 dentist-0.085
Token boun
Token ID31283
Feature activation+8.03e+00

Pos logit contributions
 granddaughter+0.031
 grandmother+0.031
ppa+0.030
aj+0.029
oll+0.028
Neg logit contributions
Mine-0.023
Stop-0.023
atch-0.022
 score-0.021
OK-0.021
Tokency
Token ID948
Feature activation-1.52e-01

Pos logit contributions
 wants+0.126
Stop+0.126
Run+0.123
 AW+0.123
OK+0.120
Neg logit contributions
 grandmother-0.141
oll-0.140
 aunt-0.133
isk-0.132
ppa-0.130
Token ball
Token ID2613
Feature activation+5.09e-01

Pos logit contributions
 grandmother+0.003
 granddaughter+0.003
 parents+0.003
tr+0.003
 owners+0.003
Neg logit contributions
Stop-0.003
ok-0.002
OK-0.002
atch-0.002
Mine-0.002
Token!
Token ID0
Feature activation+5.04e-02

Pos logit contributions
Stop+0.012
OK+0.012
Mine+0.011
www+0.011
 turns+0.010
Neg logit contributions
oll-0.008
tr-0.007
 exactly-0.006
ocular-0.006
 grandmother-0.006
Token Lily
Token ID20037
Feature activation+2.85e-01

Pos logit contributions
Mine+0.001
Stop+0.001
 score+0.001
 hill+0.001
OK+0.001
Neg logit contributions
ppa-0.001
rina-0.001
 grandmother-0.001
urger-0.001
 patients-0.001
Token was
Token ID373
Feature activation-2.75e+00

Pos logit contributions
Stop+0.005
Mine+0.005
 score+0.005
Run+0.005
 hill+0.005
Neg logit contributions
ppa-0.006
 grandmother-0.005
 granddaughter-0.005
aj-0.005
apa-0.005
Token is
Token ID318
Feature activation-1.55e+00

Pos logit contributions
Mine+0.017
Stop+0.017
 slope+0.015
 score+0.015
OK+0.015
Neg logit contributions
ppa-0.026
oll-0.025
 grandmother-0.025
 granddaughter-0.025
apa-0.024
Token shiny
Token ID22441
Feature activation+4.09e+00

Pos logit contributions
ppa+0.041
 grandmother+0.040
 granddaughter+0.040
apa+0.039
aj+0.038
Neg logit contributions
Mine-0.019
Stop-0.019
 score-0.018
 hill-0.017
OK-0.017
Token and
Token ID290
Feature activation-7.69e-01

Pos logit contributions
Mine+0.084
Stop+0.081
OK+0.075
Run+0.074
 score+0.074
Neg logit contributions
oll-0.076
ppa-0.072
aj-0.071
 granddaughter-0.071
 grandmother-0.070
Token round
Token ID2835
Feature activation+3.45e+00

Pos logit contributions
ppa+0.015
 grandmother+0.014
aj+0.014
oll+0.014
 granddaughter+0.014
Neg logit contributions
Stop-0.014
about-0.013
Mine-0.013
ok-0.012
Run-0.012
Token and
Token ID290
Feature activation-1.01e+00

Pos logit contributions
Mine+0.070
Stop+0.069
www+0.066
OK+0.066
Run+0.066
Neg logit contributions
oll-0.059
ppa-0.055
 granddaughter-0.055
aj-0.054
 grandmother-0.052
Token boun
Token ID31283
Feature activation+8.01e+00

Pos logit contributions
ppa+0.021
 grandmother+0.021
oll+0.020
aj+0.020
 granddaughter+0.020
Neg logit contributions
Stop-0.016
Mine-0.015
about-0.015
ok-0.014
Run-0.014
Tokency
Token ID948
Feature activation-6.89e-01

Pos logit contributions
 challenge+0.128
 race+0.125
www+0.123
 hill+0.123
 slope+0.121
Neg logit contributions
oll-0.155
ppa-0.144
uggets-0.143
aj-0.139
ovable-0.137
Token.
Token ID13
Feature activation+7.39e-01

Pos logit contributions
ppa+0.013
oll+0.013
 granddaughter+0.013
aj+0.012
 parents+0.012
Neg logit contributions
Stop-0.013
Mine-0.012
OK-0.012
Hey-0.012
Run-0.012
Token I
Token ID314
Feature activation+2.18e+00

Pos logit contributions
Mine+0.008
Stop+0.008
 score+0.007
OK+0.007
Run+0.007
Neg logit contributions
ppa-0.021
 grandmother-0.020
 granddaughter-0.020
apa-0.020
aj-0.020
Token like
Token ID588
Feature activation-3.95e-01

Pos logit contributions
 hill+0.031
Stop+0.030
 slope+0.030
Mine+0.029
www+0.028
Neg logit contributions
ppa-0.049
 granddaughter-0.048
oll-0.048
isk-0.047
 grandmother-0.047
Token it
Token ID340
Feature activation-9.17e-01

Pos logit contributions
 grandmother+0.010
 granddaughter+0.010
ppa+0.009
 parents+0.009
 patients+0.009
Neg logit contributions
Stop-0.005
OK-0.005
Mine-0.005
Run-0.005
ok-0.005
Token a
Token ID257
Feature activation+7.06e-01

Pos logit contributions
ppa+0.042
apa+0.038
 grandmother+0.037
rina+0.037
uf+0.037
Neg logit contributions
Mine-0.026
 hill-0.025
Stop-0.024
 race-0.024
 score-0.024
Token shiny
Token ID22441
Feature activation+4.20e+00

Pos logit contributions
Stop+0.012
Mine+0.012
atch+0.011
Run+0.011
 score+0.011
Neg logit contributions
 grandmother-0.015
 granddaughter-0.015
ppa-0.015
aj-0.014
apa-0.014
Token rock
Token ID3881
Feature activation+2.69e-01

Pos logit contributions
Stop+0.084
Run+0.080
ok+0.078
Mine+0.075
atch+0.075
Neg logit contributions
 grandmother-0.086
 granddaughter-0.080
tr-0.074
aj-0.073
 aunt-0.073
Token and
Token ID290
Feature activation-1.48e+00

Pos logit contributions
Stop+0.005
Mine+0.005
OK+0.004
Run+0.004
No+0.004
Neg logit contributions
ppa-0.006
 grandmother-0.006
 granddaughter-0.005
oll-0.005
aj-0.005
Token a
Token ID257
Feature activation+1.19e-01

Pos logit contributions
ppa+0.033
apa+0.032
 grandmother+0.031
omm+0.031
 granddaughter+0.030
Neg logit contributions
Mine-0.025
 score-0.024
Stop-0.024
 race-0.024
 hill-0.023
Token boun
Token ID31283
Feature activation+7.97e+00

Pos logit contributions
Stop+0.002
Mine+0.002
 score+0.002
atch+0.002
OK+0.002
Neg logit contributions
 grandmother-0.003
ppa-0.003
 granddaughter-0.003
aj-0.002
apa-0.002
Tokency
Token ID948
Feature activation-5.43e-01

Pos logit contributions
Stop+0.119
Run+0.118
www+0.107
 blows+0.104
 wants+0.102
Neg logit contributions
 grandmother-0.170
oll-0.165
ppa-0.162
 granddaughter-0.162
aj-0.161
Token ball
Token ID2613
Feature activation+3.78e-01

Pos logit contributions
 grandmother+0.010
 granddaughter+0.010
aj+0.009
tr+0.009
urger+0.009
Neg logit contributions
Stop-0.012
atch-0.011
Mine-0.010
Run-0.010
OK-0.010
Token.
Token ID13
Feature activation-3.37e-01

Pos logit contributions
Stop+0.009
Mine+0.008
OK+0.008
Run+0.008
www+0.007
Neg logit contributions
oll-0.006
 grandmother-0.006
tr-0.006
 granddaughter-0.006
ppa-0.006
Token
Token ID198
Feature activation-8.14e-01

Pos logit contributions
ppa+0.007
 grandmother+0.007
 granddaughter+0.006
apa+0.006
aj+0.006
Neg logit contributions
Mine-0.006
Stop-0.006
 score-0.006
 hill-0.006
OK-0.006
Token
Token ID198
Feature activation+1.77e-01

Pos logit contributions
ppa+0.017
 grandmother+0.017
 granddaughter+0.017
oll+0.016
apa+0.016
Neg logit contributions
Stop-0.014
Mine-0.014
 score-0.013
 hill-0.013
!"-0.013
Token,
Token ID11
Feature activation-2.51e+00

Pos logit contributions
ppa+0.013
 grandmother+0.013
 granddaughter+0.012
oll+0.012
apa+0.012
Neg logit contributions
Mine-0.009
Stop-0.009
 score-0.008
Run-0.008
OK-0.008
Token boxes
Token ID10559
Feature activation+4.35e-01

Pos logit contributions
ppa+0.058
 grandmother+0.056
 granddaughter+0.054
apa+0.054
oll+0.053
Neg logit contributions
Mine-0.039
Stop-0.039
 score-0.038
 hill-0.037
 race-0.037
Token and
Token ID290
Feature activation-4.31e-01

Pos logit contributions
Stop+0.007
Mine+0.007
Run+0.006
 score+0.006
OK+0.006
Neg logit contributions
 grandmother-0.010
ppa-0.010
 granddaughter-0.010
apa-0.010
aj-0.009
Token even
Token ID772
Feature activation+4.86e-01

Pos logit contributions
ppa+0.010
 grandmother+0.009
apa+0.009
oll+0.009
aj+0.009
Neg logit contributions
Mine-0.007
Stop-0.007
 hill-0.007
 score-0.007
 race-0.007
Token a
Token ID257
Feature activation+6.42e-01

Pos logit contributions
 race+0.007
 score+0.007
Stop+0.007
 turns+0.007
 wind+0.007
Neg logit contributions
ppa-0.012
omm-0.012
 grandmother-0.011
apa-0.011
uf-0.011
Token boun
Token ID31283
Feature activation+7.95e+00

Pos logit contributions
Stop+0.010
Mine+0.010
 score+0.010
 hill+0.009
OK+0.009
Neg logit contributions
ppa-0.014
 grandmother-0.014
 granddaughter-0.014
apa-0.013
aj-0.013
Tokency
Token ID948
Feature activation-3.41e-01

Pos logit contributions
Stop+0.119
Run+0.117
 wants+0.108
 tries+0.107
www+0.106
Neg logit contributions
 grandmother-0.168
oll-0.164
aj-0.163
uggets-0.162
 granddaughter-0.162
Token ball
Token ID2613
Feature activation+6.72e-01

Pos logit contributions
 grandmother+0.007
 granddaughter+0.007
ppa+0.007
urger+0.007
aj+0.007
Neg logit contributions
Stop-0.006
Mine-0.006
OK-0.005
Run-0.005
Check-0.005
Token.
Token ID13
Feature activation-6.31e-01

Pos logit contributions
Stop+0.013
Mine+0.013
OK+0.012
Run+0.012
Hey+0.011
Neg logit contributions
 grandmother-0.013
oll-0.013
ppa-0.013
 granddaughter-0.012
apa-0.012
Token
Token ID198
Feature activation-1.14e+00

Pos logit contributions
ppa+0.014
 grandmother+0.013
aj+0.013
 granddaughter+0.013
oll+0.013
Neg logit contributions
Mine-0.011
Stop-0.010
 hill-0.010
 score-0.010
OK-0.010
Token
Token ID198
Feature activation-5.31e-01

Pos logit contributions
ppa+0.026
 grandmother+0.025
 granddaughter+0.025
oll+0.024
 patients+0.024
Neg logit contributions
Mine-0.018
Stop-0.018
 score-0.017
OK-0.017
 hill-0.017
Token.
Token ID13
Feature activation+1.30e+00

Pos logit contributions
Mine+0.029
Stop+0.029
 score+0.028
 hill+0.026
OK+0.026
Neg logit contributions
ppa-0.047
 grandmother-0.047
 granddaughter-0.046
apa-0.044
aj-0.044
Token It
Token ID632
Feature activation+2.86e+00

Pos logit contributions
Mine+0.021
Stop+0.020
 score+0.019
OK+0.018
 hill+0.018
Neg logit contributions
ppa-0.032
 grandmother-0.029
 granddaughter-0.028
aj-0.028
apa-0.028
Token was
Token ID373
Feature activation-1.73e+00

Pos logit contributions
Stop+0.046
Mine+0.046
 score+0.044
 hill+0.043
OK+0.042
Neg logit contributions
ppa-0.063
 grandmother-0.062
 granddaughter-0.061
apa-0.059
oll-0.058
Token a
Token ID257
Feature activation+1.57e+00

Pos logit contributions
ppa+0.043
aj+0.039
apa+0.039
 granddaughter+0.039
uf+0.039
Neg logit contributions
 score-0.025
!"-0.025
Mine-0.024
 race-0.023
 wind-0.023
Token big
Token ID1263
Feature activation+5.26e+00

Pos logit contributions
Stop+0.030
Mine+0.029
 score+0.028
atch+0.028
Run+0.028
Neg logit contributions
 grandmother-0.031
 granddaughter-0.030
ppa-0.030
apa-0.028
aj-0.027
Token boun
Token ID31283
Feature activation+7.92e+00

Pos logit contributions
Stop+0.111
Run+0.098
atch+0.095
Give+0.094
Mine+0.092
Neg logit contributions
 grandmother-0.114
 granddaughter-0.104
 aunt-0.097
 prayers-0.094
 parents-0.093
Tokency
Token ID948
Feature activation+4.04e-01

Pos logit contributions
Stop+0.129
Run+0.120
 wants+0.116
 AW+0.115
 blows+0.114
Neg logit contributions
oll-0.165
 grandmother-0.165
 granddaughter-0.157
ppa-0.152
aj-0.151
Token ball
Token ID2613
Feature activation+7.83e-01

Pos logit contributions
Stop+0.009
atch+0.008
Check+0.008
Run+0.008
OK+0.007
Neg logit contributions
 grandmother-0.008
 granddaughter-0.008
 parents-0.007
 cousins-0.007
 aunt-0.007
Token.
Token ID13
Feature activation+9.47e-01

Pos logit contributions
Stop+0.016
Mine+0.014
OK+0.014
Run+0.013
Hey+0.013
Neg logit contributions
oll-0.016
 grandmother-0.016
 granddaughter-0.015
ppa-0.015
tr-0.014
Token The
Token ID383
Feature activation+1.68e-01

Pos logit contributions
Mine+0.017
Stop+0.016
 score+0.016
 hill+0.015
OK+0.015
Neg logit contributions
ppa-0.021
 grandmother-0.020
 granddaughter-0.019
aj-0.019
apa-0.019
Token 3
Token ID513
Feature activation+5.33e-01

Pos logit contributions
Mine+0.003
Stop+0.003
 score+0.003
atch+0.003
 hill+0.003
Neg logit contributions
 grandmother-0.004
ppa-0.004
 granddaughter-0.004
apa-0.003
oll-0.003
Token.
Token ID13
Feature activation+1.30e+00

Pos logit contributions
Mine+0.029
Stop+0.029
 score+0.028
 hill+0.026
OK+0.026
Neg logit contributions
ppa-0.047
 grandmother-0.047
 granddaughter-0.046
apa-0.044
aj-0.044
Token It
Token ID632
Feature activation+2.86e+00

Pos logit contributions
Mine+0.021
Stop+0.020
 score+0.019
OK+0.018
 hill+0.018
Neg logit contributions
ppa-0.032
 grandmother-0.029
 granddaughter-0.028
aj-0.028
apa-0.028
Token was
Token ID373
Feature activation-1.73e+00

Pos logit contributions
Stop+0.046
Mine+0.046
 score+0.044
 hill+0.043
OK+0.042
Neg logit contributions
ppa-0.063
 grandmother-0.062
 granddaughter-0.061
apa-0.059
oll-0.058
Token a
Token ID257
Feature activation+1.57e+00

Pos logit contributions
ppa+0.043
aj+0.039
apa+0.039
 granddaughter+0.039
uf+0.039
Neg logit contributions
 score-0.025
!"-0.025
Mine-0.024
 race-0.023
 wind-0.023
Token big
Token ID1263
Feature activation+5.26e+00

Pos logit contributions
Stop+0.030
Mine+0.029
 score+0.028
atch+0.028
Run+0.028
Neg logit contributions
 grandmother-0.031
 granddaughter-0.030
ppa-0.030
apa-0.028
aj-0.027
Token boun
Token ID31283
Feature activation+7.92e+00

Pos logit contributions
Stop+0.111
Run+0.098
atch+0.095
Give+0.094
Mine+0.092
Neg logit contributions
 grandmother-0.114
 granddaughter-0.104
 aunt-0.097
 prayers-0.094
 parents-0.093
Tokency
Token ID948
Feature activation+4.04e-01

Pos logit contributions
Stop+0.129
Run+0.120
 wants+0.116
 AW+0.115
 blows+0.114
Neg logit contributions
oll-0.165
 grandmother-0.165
 granddaughter-0.157
ppa-0.152
aj-0.151
Token ball
Token ID2613
Feature activation+7.83e-01

Pos logit contributions
Stop+0.009
atch+0.008
Check+0.008
Run+0.008
OK+0.007
Neg logit contributions
 grandmother-0.008
 granddaughter-0.008
 parents-0.007
 cousins-0.007
 aunt-0.007
Token.
Token ID13
Feature activation+9.47e-01

Pos logit contributions
Stop+0.016
Mine+0.014
OK+0.014
Run+0.013
Hey+0.013
Neg logit contributions
oll-0.016
 grandmother-0.016
 granddaughter-0.015
ppa-0.015
tr-0.014
Token The
Token ID383
Feature activation+1.68e-01

Pos logit contributions
Mine+0.017
Stop+0.016
 score+0.016
 hill+0.015
OK+0.015
Neg logit contributions
ppa-0.021
 grandmother-0.020
 granddaughter-0.019
aj-0.019
apa-0.019
Token 3
Token ID513
Feature activation+5.33e-01

Pos logit contributions
Mine+0.003
Stop+0.003
 score+0.003
atch+0.003
 hill+0.003
Neg logit contributions
 grandmother-0.004
ppa-0.004
 granddaughter-0.004
apa-0.003
oll-0.003
Token thought
Token ID1807
Feature activation+1.41e+00

Pos logit contributions
Mine+0.016
Stop+0.016
 score+0.016
 hill+0.015
OK+0.015
Neg logit contributions
ppa-0.026
 grandmother-0.026
 granddaughter-0.026
apa-0.025
oll-0.024
Token for
Token ID329
Feature activation+2.66e-01

Pos logit contributions
Mine+0.021
Stop+0.021
 score+0.021
 hill+0.020
 race+0.019
Neg logit contributions
ppa-0.033
 grandmother-0.032
 granddaughter-0.032
aj-0.030
apa-0.030
Token a
Token ID257
Feature activation+2.56e+00

Pos logit contributions
 score+0.004
Mine+0.004
Stop+0.004
!"+0.004
 challenge+0.004
Neg logit contributions
ppa-0.006
aj-0.006
apa-0.006
oll-0.006
 granddaughter-0.006
Token moment
Token ID2589
Feature activation+7.12e-01

Pos logit contributions
Stop+0.024
Mine+0.022
atch+0.021
 score+0.019
 hill+0.019
Neg logit contributions
 grandmother-0.075
 granddaughter-0.074
ppa-0.072
apa-0.071
 patients-0.069
Token before
Token ID878
Feature activation+2.12e+00

Pos logit contributions
Mine+0.008
Stop+0.008
 score+0.008
 hill+0.007
OK+0.007
Neg logit contributions
ppa-0.019
 grandmother-0.019
 granddaughter-0.019
apa-0.018
aj-0.018
Token rep
Token ID1128
Feature activation+7.92e+00

Pos logit contributions
Stop+0.032
 score+0.031
Mine+0.031
 hill+0.030
Run+0.029
Neg logit contributions
ppa-0.050
 granddaughter-0.047
 grandmother-0.047
oll-0.047
aj-0.047
Tokenlying
Token ID3157
Feature activation-4.13e-01

Pos logit contributions
Stop+0.121
 hill+0.115
Run+0.113
Mine+0.112
 score+0.109
Neg logit contributions
ppa-0.184
apa-0.178
 grandmother-0.177
 granddaughter-0.177
aj-0.175
Token.
Token ID13
Feature activation-1.60e-01

Pos logit contributions
ppa+0.011
 granddaughter+0.011
 grandmother+0.011
aj+0.011
 patients+0.010
Neg logit contributions
 score-0.004
Mine-0.004
Stop-0.004
!"-0.004
 hill-0.004
Token "
Token ID366
Feature activation+2.49e+00

Pos logit contributions
ppa+0.003
 grandmother+0.003
 granddaughter+0.003
oll+0.003
aj+0.003
Neg logit contributions
Mine-0.003
Stop-0.003
 score-0.003
 hill-0.003
OK-0.003
TokenYes
Token ID5297
Feature activation+1.49e+00

Pos logit contributions
 hill+0.034
 score+0.033
 race+0.031
!"+0.031
Stop+0.031
Neg logit contributions
ppa-0.059
 granddaughter-0.058
 grandmother-0.058
apa-0.056
oll-0.054
Token I
Token ID314
Feature activation+2.46e+00

Pos logit contributions
Stop+0.020
Mine+0.020
 score+0.018
 hill+0.017
atch+0.017
Neg logit contributions
 grandmother-0.038
ppa-0.038
 granddaughter-0.037
apa-0.037
oll-0.036
Token
Token ID198
Feature activation+4.92e-01

Pos logit contributions
ppa+0.013
 grandmother+0.013
 granddaughter+0.013
apa+0.012
 patients+0.012
Neg logit contributions
Stop-0.009
Mine-0.009
 score-0.009
!"-0.009
 hill-0.009
TokenHe
Token ID1544
Feature activation+1.42e+00

Pos logit contributions
Mine+0.008
Stop+0.008
 score+0.008
 hill+0.008
 challenge+0.008
Neg logit contributions
ppa-0.010
 granddaughter-0.009
 grandmother-0.009
oll-0.009
aj-0.009
Token saw
Token ID2497
Feature activation-1.68e+00

Pos logit contributions
Stop+0.014
 score+0.014
Mine+0.014
 hill+0.013
 race+0.013
Neg logit contributions
ppa-0.040
 grandmother-0.039
 granddaughter-0.039
omm-0.039
oll-0.039
Token a
Token ID257
Feature activation+2.12e+00

Pos logit contributions
ppa+0.046
aj+0.043
omm+0.042
apa+0.042
uf+0.041
Neg logit contributions
 wind-0.024
Stop-0.023
Mine-0.022
 race-0.022
 turns-0.022
Token big
Token ID1263
Feature activation+5.27e+00

Pos logit contributions
Mine+0.036
Stop+0.036
atch+0.034
 score+0.032
ok+0.032
Neg logit contributions
 grandmother-0.046
 granddaughter-0.045
ppa-0.045
aj-0.042
 parents-0.041
Token boun
Token ID31283
Feature activation+7.92e+00

Pos logit contributions
ok+0.104
Stop+0.102
Mine+0.102
OK+0.096
atch+0.092
Neg logit contributions
 grandmother-0.108
 granddaughter-0.102
 aunt-0.098
 parents-0.097
 Mother-0.095
Tokency
Token ID948
Feature activation+1.41e-01

Pos logit contributions
Stop+0.127
Run+0.120
 wants+0.120
 AW+0.119
Mine+0.116
Neg logit contributions
 grandmother-0.152
oll-0.147
ppa-0.142
 granddaughter-0.141
 aunt-0.141
Token castle
Token ID16669
Feature activation-7.21e-01

Pos logit contributions
Stop+0.002
ok+0.002
OK+0.002
Mine+0.002
atch+0.002
Neg logit contributions
 grandmother-0.003
 granddaughter-0.003
 parents-0.003
 owners-0.003
ppa-0.003
Token.
Token ID13
Feature activation+1.07e+00

Pos logit contributions
 grandmother+0.013
oll+0.013
tr+0.012
 granddaughter+0.012
ppa+0.012
Neg logit contributions
Mine-0.015
Stop-0.015
www-0.014
OK-0.013
Hey-0.013
Token He
Token ID679
Feature activation+1.18e+00

Pos logit contributions
Mine+0.018
Stop+0.018
 score+0.017
 hill+0.017
OK+0.016
Neg logit contributions
ppa-0.023
 grandmother-0.023
 granddaughter-0.022
apa-0.021
aj-0.021
Token wanted
Token ID2227
Feature activation+8.02e-01

Pos logit contributions
Stop+0.018
Mine+0.018
 score+0.017
atch+0.017
 hill+0.016
Neg logit contributions
ppa-0.027
 grandmother-0.026
 granddaughter-0.026
aj-0.024
apa-0.024
Token rubber
Token ID14239
Feature activation-1.48e+00

Pos logit contributions
 grandmother+0.008
 granddaughter+0.008
 parents+0.008
ppa+0.007
 owners+0.007
Neg logit contributions
Stop-0.007
atch-0.007
OK-0.007
www-0.007
ok-0.007
Token and
Token ID290
Feature activation+7.86e-01

Pos logit contributions
 parents+0.028
 owners+0.026
 grandmother+0.026
 throat+0.026
 Chick+0.025
Neg logit contributions
Stop-0.025
www-0.025
atch-0.024
OK-0.023
Check-0.023
Token it
Token ID340
Feature activation+1.83e+00

Pos logit contributions
Mine+0.013
Stop+0.013
atch+0.012
 score+0.012
ok+0.012
Neg logit contributions
ppa-0.017
 grandmother-0.017
 granddaughter-0.016
 parents-0.016
aj-0.015
Token was
Token ID373
Feature activation-2.00e+00

Pos logit contributions
Stop+0.032
Mine+0.032
 score+0.030
OK+0.029
atch+0.029
Neg logit contributions
ppa-0.038
 grandmother-0.037
 granddaughter-0.036
apa-0.036
oll-0.035
Token so
Token ID523
Feature activation+3.58e-01

Pos logit contributions
ppa+0.049
 grandmother+0.047
 granddaughter+0.046
apa+0.046
uf+0.045
Neg logit contributions
 score-0.029
!"-0.029
Mine-0.029
 wind-0.028
 race-0.027
Token boun
Token ID31283
Feature activation+7.90e+00

Pos logit contributions
Stop+0.007
 score+0.006
Mine+0.006
atch+0.006
Run+0.006
Neg logit contributions
oll-0.007
ppa-0.007
aj-0.006
 grandmother-0.006
apa-0.006
Tokency
Token ID948
Feature activation+5.06e-02

Pos logit contributions
 challenge+0.111
 race+0.111
Stop+0.109
 score+0.109
 hill+0.107
Neg logit contributions
oll-0.174
ppa-0.163
ovable-0.156
aj-0.153
isk-0.151
Token.
Token ID13
Feature activation+1.11e-01

Pos logit contributions
Stop+0.001
Mine+0.001
Hey+0.001
OK+0.001
Uh+0.001
Neg logit contributions
ppa-0.001
oll-0.001
 exactly-0.001
 parents-0.001
 grandmother-0.001
Token He
Token ID679
Feature activation+1.30e+00

Pos logit contributions
Mine+0.002
Stop+0.002
 score+0.002
 hill+0.002
OK+0.002
Neg logit contributions
ppa-0.003
 grandmother-0.003
 granddaughter-0.002
apa-0.002
oll-0.002
Token had
Token ID550
Feature activation+5.06e-02

Pos logit contributions
Stop+0.021
Mine+0.020
 score+0.019
 hill+0.019
atch+0.019
Neg logit contributions
ppa-0.030
 grandmother-0.028
 granddaughter-0.028
apa-0.027
aj-0.027
Token great
Token ID1049
Feature activation+1.22e+00

Pos logit contributions
Mine+0.001
Stop+0.001
 score+0.001
 hill+0.001
!"+0.001
Neg logit contributions
ppa-0.001
 grandmother-0.001
 granddaughter-0.001
apa-0.001
aj-0.001
TokenIt
Token ID1026
Feature activation+3.04e+00

Pos logit contributions
 score+0.005
 hill+0.005
 forth+0.005
 challenge+0.005
 tries+0.005
Neg logit contributions
ppa-0.004
apa-0.004
 granddaughter-0.004
 grandmother-0.004
aj-0.004
Token was
Token ID373
Feature activation-1.85e+00

Pos logit contributions
Mine+0.049
Stop+0.049
 score+0.047
 hill+0.045
OK+0.045
Neg logit contributions
ppa-0.068
 grandmother-0.067
 granddaughter-0.066
apa-0.064
oll-0.063
Token a
Token ID257
Feature activation+1.84e+00

Pos logit contributions
ppa+0.046
aj+0.042
lor+0.042
ascal+0.042
emic+0.041
Neg logit contributions
Mine-0.027
 wind-0.026
!"-0.026
 score-0.026
 race-0.025
Token big
Token ID1263
Feature activation+5.49e+00

Pos logit contributions
Stop+0.033
Mine+0.032
 score+0.031
atch+0.030
Run+0.030
Neg logit contributions
 grandmother-0.038
ppa-0.037
 granddaughter-0.037
apa-0.035
aj-0.034
Token,
Token ID11
Feature activation-1.21e+00

Pos logit contributions
Stop+0.113
ok+0.102
Run+0.100
Mine+0.096
atch+0.096
Neg logit contributions
 grandmother-0.112
 granddaughter-0.106
 throat-0.097
 parents-0.096
 aunt-0.094
Token boun
Token ID31283
Feature activation+7.88e+00

Pos logit contributions
 grandmother+0.027
 granddaughter+0.027
ppa+0.027
aj+0.026
oll+0.025
Neg logit contributions
Stop-0.019
Mine-0.019
 score-0.017
atch-0.017
OK-0.017
Tokency
Token ID948
Feature activation+1.63e-01

Pos logit contributions
Stop+0.124
 wants+0.124
 AW+0.122
Run+0.121
 tries+0.117
Neg logit contributions
 grandmother-0.146
 granddaughter-0.142
oll-0.141
ppa-0.139
isk-0.138
Token ball
Token ID2613
Feature activation+6.34e-01

Pos logit contributions
Stop+0.003
OK+0.002
Mine+0.002
ok+0.002
Run+0.002
Neg logit contributions
 grandmother-0.004
 granddaughter-0.004
 parents-0.003
ppa-0.003
aj-0.003
Token!
Token ID0
Feature activation+8.62e-01

Pos logit contributions
Stop+0.014
OK+0.013
Mine+0.013
Wow+0.012
Bang+0.012
Neg logit contributions
oll-0.011
 grandmother-0.011
tr-0.010
 granddaughter-0.010
ppa-0.010
Token The
Token ID383
Feature activation-2.20e-01

Pos logit contributions
Mine+0.014
Stop+0.014
 score+0.013
 hill+0.013
OK+0.013
Neg logit contributions
ppa-0.020
 grandmother-0.019
 granddaughter-0.018
rina-0.018
 patients-0.018
Token kid
Token ID5141
Feature activation-1.47e+00

Pos logit contributions
 granddaughter+0.004
 grandmother+0.004
ppa+0.004
 parents+0.004
aj+0.004
Neg logit contributions
Stop-0.004
Mine-0.004
 score-0.004
atch-0.004
www-0.004
Token they
Token ID484
Feature activation+1.97e+00

Pos logit contributions
ppa+0.008
 grandmother+0.007
 granddaughter+0.007
omm+0.007
aj+0.007
Neg logit contributions
Mine-0.004
Stop-0.004
 score-0.004
 hill-0.004
 race-0.004
Token found
Token ID1043
Feature activation-1.07e+00

Pos logit contributions
Stop+0.029
Mine+0.029
 score+0.028
OK+0.027
 hill+0.027
Neg logit contributions
ppa-0.046
 grandmother-0.045
 granddaughter-0.045
apa-0.043
oll-0.042
Token a
Token ID257
Feature activation+1.52e+00

Pos logit contributions
ppa+0.025
aj+0.023
 grandmother+0.022
 granddaughter+0.022
ascal+0.022
Neg logit contributions
Mine-0.018
Stop-0.017
 score-0.017
 turns-0.017
!"-0.017
Token big
Token ID1263
Feature activation+5.38e+00

Pos logit contributions
Stop+0.026
Mine+0.026
atch+0.024
Run+0.024
 score+0.024
Neg logit contributions
 grandmother-0.033
 granddaughter-0.032
ppa-0.031
apa-0.029
aj-0.029
Token,
Token ID11
Feature activation-1.58e+00

Pos logit contributions
Stop+0.113
ok+0.108
atch+0.098
OK+0.098
Run+0.096
Neg logit contributions
 grandmother-0.108
 granddaughter-0.097
 aunt-0.092
 throat-0.091
 Mother-0.091
Token boun
Token ID31283
Feature activation+7.87e+00

Pos logit contributions
 grandmother+0.036
 granddaughter+0.035
ppa+0.035
aj+0.033
oll+0.033
Neg logit contributions
Stop-0.025
Mine-0.024
atch-0.023
 score-0.022
OK-0.022
Tokency
Token ID948
Feature activation-2.45e-02

Pos logit contributions
Stop+0.130
 wants+0.125
Run+0.124
 AW+0.124
www+0.121
Neg logit contributions
 grandmother-0.140
oll-0.137
ail-0.129
 granddaughter-0.129
uggets-0.128
Token ball
Token ID2613
Feature activation+6.36e-01

Pos logit contributions
 grandmother+0.001
 granddaughter+0.001
tr+0.001
 parents+0.000
 owners+0.000
Neg logit contributions
Stop-0.000
OK-0.000
ok-0.000
atch-0.000
Mine-0.000
Token.
Token ID13
Feature activation+2.26e-01

Pos logit contributions
Stop+0.016
OK+0.014
Mine+0.014
www+0.014
Hey+0.013
Neg logit contributions
oll-0.009
tr-0.009
ocular-0.008
 grandmother-0.008
 owners-0.008
Token They
Token ID1119
Feature activation+1.49e+00

Pos logit contributions
Mine+0.004
Stop+0.004
 score+0.004
 hill+0.003
OK+0.003
Neg logit contributions
ppa-0.005
 grandmother-0.005
 granddaughter-0.005
 patients-0.005
oll-0.005
Token were
Token ID547
Feature activation-3.01e+00

Pos logit contributions
Stop+0.032
www+0.029
atch+0.029
Mine+0.029
Run+0.029
Neg logit contributions
ppa-0.026
 granddaughter-0.024
 grandmother-0.024
apa-0.023
 parents-0.022
Token round
Token ID2835
Feature activation+3.86e+00

Pos logit contributions
Stop+0.000
Mine+0.000
atch+0.000
 score+0.000
about+0.000
Neg logit contributions
ppa-0.000
 grandmother-0.000
 Chick-0.000
 granddaughter-0.000
aj-0.000
Token,
Token ID11
Feature activation-1.08e+00

Pos logit contributions
Mine+0.065
Stop+0.065
 score+0.061
OK+0.059
www+0.058
Neg logit contributions
ppa-0.082
 granddaughter-0.081
oll-0.080
 grandmother-0.080
aj-0.076
Token and
Token ID290
Feature activation+3.00e-01

Pos logit contributions
 exactly+0.018
 Chick+0.018
 grandmother+0.017
 Mitt+0.017
 owners+0.017
Neg logit contributions
Stop-0.021
Mine-0.020
five-0.020
atch-0.020
 challenge-0.019
Token ever
Token ID1683
Feature activation+1.18e+00

Pos logit contributions
Stop+0.004
Mine+0.004
 score+0.004
atch+0.004
 challenge+0.003
Neg logit contributions
ppa-0.007
 grandmother-0.007
 granddaughter-0.007
 Chick-0.007
oll-0.007
Token so
Token ID523
Feature activation+6.11e-02

Pos logit contributions
Mine+0.019
Stop+0.019
 score+0.018
 hill+0.017
OK+0.017
Neg logit contributions
ppa-0.027
 grandmother-0.026
 granddaughter-0.026
aj-0.025
oll-0.024
Token boun
Token ID31283
Feature activation+7.85e+00

Pos logit contributions
Stop+0.001
Mine+0.001
 score+0.001
 hill+0.001
atch+0.001
Neg logit contributions
ppa-0.001
 grandmother-0.001
 granddaughter-0.001
apa-0.001
oll-0.001
Tokency
Token ID948
Feature activation+7.35e-01

Pos logit contributions
 score+0.095
Stop+0.093
 challenge+0.093
 hill+0.092
Mine+0.092
Neg logit contributions
ppa-0.184
oll-0.182
 grandmother-0.173
ovable-0.173
apa-0.170
Token.
Token ID13
Feature activation-1.60e-02

Pos logit contributions
Stop+0.014
Mine+0.014
Hey+0.013
OK+0.013
 score+0.013
Neg logit contributions
ppa-0.014
 granddaughter-0.013
 grandmother-0.013
oll-0.013
 parents-0.013
Token Every
Token ID3887
Feature activation+1.81e+00

Pos logit contributions
 grandmother+0.000
 granddaughter+0.000
ppa+0.000
apa+0.000
 Mother+0.000
Neg logit contributions
Stop-0.000
 score-0.000
Mine-0.000
www-0.000
atch-0.000
Token day
Token ID1110
Feature activation+2.10e+00

Pos logit contributions
Stop+0.023
Mine+0.021
atch+0.019
 score+0.019
Run+0.018
Neg logit contributions
 grandmother-0.046
 granddaughter-0.045
oll-0.044
 owners-0.042
apa-0.042
Token,
Token ID11
Feature activation-7.80e-01

Pos logit contributions
Mine+0.036
 score+0.034
atch+0.034
Stop+0.034
 hill+0.033
Neg logit contributions
 grandmother-0.043
 granddaughter-0.042
apa-0.040
 Mother-0.040
 parents-0.040
Token too
Token ID1165
Feature activation+1.27e+00

Pos logit contributions
ppa+0.047
 grandmother+0.046
 granddaughter+0.046
apa+0.044
aj+0.043
Neg logit contributions
Mine-0.041
Stop-0.041
 score-0.041
 hill-0.039
 race-0.038
Token high
Token ID1029
Feature activation+5.07e+00

Pos logit contributions
Mine+0.025
Stop+0.024
Give+0.023
atch+0.022
Run+0.022
Neg logit contributions
ppa-0.024
aj-0.023
 granddaughter-0.023
ache-0.022
 grandmother-0.022
Token and
Token ID290
Feature activation+5.25e-01

Pos logit contributions
Mine+0.110
Stop+0.101
OK+0.100
www+0.098
Check+0.097
Neg logit contributions
oll-0.085
 grandmother-0.083
ppa-0.083
 granddaughter-0.081
isk-0.080
Token too
Token ID1165
Feature activation+2.15e+00

Pos logit contributions
Mine+0.009
Stop+0.008
atch+0.008
 score+0.008
www+0.008
Neg logit contributions
ppa-0.011
 grandmother-0.011
 granddaughter-0.010
apa-0.010
oll-0.010
Token w
Token ID266
Feature activation+3.86e+00

Pos logit contributions
Mine+0.039
Stop+0.038
atch+0.036
Give+0.036
Hey+0.034
Neg logit contributions
ache-0.040
ppa-0.040
oll-0.039
apa-0.039
 grandmother-0.038
Tokenobb
Token ID21963
Feature activation+7.84e+00

Pos logit contributions
Mine+0.076
Stop+0.075
 score+0.072
 hill+0.072
OK+0.071
Neg logit contributions
ppa-0.074
 grandmother-0.069
oll-0.069
 granddaughter-0.068
apa-0.068
Tokenly
Token ID306
Feature activation+2.28e+00

Pos logit contributions
Mine+0.141
 hill+0.138
Stop+0.130
 challenge+0.128
 score+0.125
Neg logit contributions
ppa-0.174
oll-0.162
aj-0.162
 granddaughter-0.155
ache-0.155
Token,
Token ID11
Feature activation-1.02e+00

Pos logit contributions
Mine+0.041
Stop+0.041
OK+0.039
www+0.038
Run+0.037
Neg logit contributions
ppa-0.046
 grandmother-0.044
 granddaughter-0.044
oll-0.042
apa-0.042
Token and
Token ID290
Feature activation+1.21e-01

Pos logit contributions
ppa+0.020
 grandmother+0.020
 granddaughter+0.020
oll+0.019
apa+0.019
Neg logit contributions
Mine-0.017
Stop-0.017
www-0.016
OK-0.016
atch-0.016
Token it
Token ID340
Feature activation+2.22e+00

Pos logit contributions
Mine+0.002
Stop+0.002
 score+0.002
atch+0.002
 hill+0.002
Neg logit contributions
ppa-0.003
 grandmother-0.003
 granddaughter-0.003
apa-0.002
oll-0.002
Token fell
Token ID3214
Feature activation+3.37e+00

Pos logit contributions
Mine+0.035
Stop+0.034
OK+0.033
atch+0.033
www+0.033
Neg logit contributions
oll-0.045
ppa-0.045
 grandmother-0.044
tr-0.044
 granddaughter-0.042
Token was
Token ID373
Feature activation-2.56e+00

Pos logit contributions
Mine+0.020
Stop+0.020
 score+0.019
atch+0.018
www+0.018
Neg logit contributions
 grandmother-0.026
ppa-0.026
 granddaughter-0.026
apa-0.025
oll-0.024
Token small
Token ID1402
Feature activation+2.28e+00

Pos logit contributions
ppa+0.058
 grandmother+0.057
 granddaughter+0.056
apa+0.056
 patients+0.053
Neg logit contributions
 score-0.039
Mine-0.039
Stop-0.038
 race-0.037
 hill-0.036
Token and
Token ID290
Feature activation-8.62e-02

Pos logit contributions
Stop+0.041
Mine+0.039
www+0.039
OK+0.038
Run+0.036
Neg logit contributions
 grandmother-0.049
 granddaughter-0.048
oll-0.046
 patients-0.045
ppa-0.045
Token red
Token ID2266
Feature activation+2.47e+00

Pos logit contributions
ppa+0.002
 grandmother+0.002
 granddaughter+0.002
aj+0.002
apa+0.002
Neg logit contributions
Stop-0.001
Mine-0.001
 score-0.001
atch-0.001
OK-0.001
Token and
Token ID290
Feature activation-2.63e-01

Pos logit contributions
www+0.045
Stop+0.044
Mine+0.043
OK+0.042
Run+0.041
Neg logit contributions
oll-0.048
 grandmother-0.048
 granddaughter-0.047
tr-0.045
omm-0.045
Token boun
Token ID31283
Feature activation+7.84e+00

Pos logit contributions
ppa+0.006
 grandmother+0.006
 granddaughter+0.006
apa+0.006
oll+0.006
Neg logit contributions
Stop-0.004
Mine-0.004
 score-0.003
atch-0.003
OK-0.003
Tokency
Token ID948
Feature activation-1.03e+00

Pos logit contributions
 challenge+0.113
www+0.113
 hill+0.112
 race+0.110
 slope+0.108
Neg logit contributions
oll-0.165
ppa-0.153
apa-0.151
 grandmother-0.150
aj-0.149
Token.
Token ID13
Feature activation+9.73e-01

Pos logit contributions
ppa+0.021
 grandmother+0.021
 granddaughter+0.021
oll+0.020
aj+0.020
Neg logit contributions
Stop-0.019
Mine-0.017
OK-0.017
www-0.017
Run-0.016
Token He
Token ID679
Feature activation+6.49e-01

Pos logit contributions
Mine+0.014
Stop+0.014
 score+0.013
OK+0.012
 hill+0.012
Neg logit contributions
ppa-0.025
 grandmother-0.024
 granddaughter-0.023
rina-0.022
aj-0.022
Token went
Token ID1816
Feature activation+1.35e+00

Pos logit contributions
Mine+0.009
Stop+0.009
 score+0.009
OK+0.008
 hill+0.008
Neg logit contributions
ppa-0.016
 grandmother-0.016
 granddaughter-0.016
apa-0.015
aj-0.015
Token back
Token ID736
Feature activation-1.74e+00

Pos logit contributions
Stop+0.023
Mine+0.023
 score+0.022
 hill+0.021
OK+0.021
Neg logit contributions
ppa-0.029
 grandmother-0.028
 granddaughter-0.028
apa-0.027
oll-0.027
Token very
Token ID845
Feature activation-1.80e-01

Pos logit contributions
ppa+0.039
 grandmother+0.037
aj+0.037
 granddaughter+0.036
apa+0.036
Neg logit contributions
 score-0.032
Mine-0.031
Stop-0.031
!"-0.031
 race-0.030
Token kind
Token ID1611
Feature activation+2.58e+00

Pos logit contributions
 grandmother+0.003
ppa+0.003
 granddaughter+0.003
apa+0.003
aj+0.003
Neg logit contributions
Stop-0.004
Mine-0.004
 hill-0.004
atch-0.004
 score-0.004
Token and
Token ID290
Feature activation-3.16e-01

Pos logit contributions
Mine+0.036
Stop+0.036
 score+0.034
Run+0.032
 hill+0.032
Neg logit contributions
 grandmother-0.064
ppa-0.063
 granddaughter-0.063
oll-0.060
apa-0.060
Token offered
Token ID4438
Feature activation-3.03e-01

Pos logit contributions
ppa+0.007
aj+0.007
 grandmother+0.006
apa+0.006
omm+0.006
Neg logit contributions
Stop-0.005
 score-0.005
Mine-0.005
OK-0.005
 race-0.005
Token Hel
Token ID5053
Feature activation-1.02e+00

Pos logit contributions
aj+0.008
ppa+0.008
oll+0.007
omm+0.007
rina+0.007
Neg logit contributions
Stop-0.004
 score-0.004
Mine-0.004
 turns-0.003
No-0.003
Tokenio
Token ID952
Feature activation-8.45e+00

Pos logit contributions
 grandmother+0.017
ppa+0.016
 granddaughter+0.016
 patients+0.015
apa+0.015
Neg logit contributions
 score-0.021
Stop-0.021
Mine-0.021
atch-0.021
!"-0.020
Token some
Token ID617
Feature activation-5.22e-01

Pos logit contributions
ppa+0.190
omm+0.184
aj+0.181
 former+0.180
 grandmother+0.180
Neg logit contributions
!"-0.132
 score-0.123
 turns-0.120
 wants-0.117
Mine-0.115
Token of
Token ID286
Feature activation-9.75e-01

Pos logit contributions
ppa+0.012
 grandmother+0.011
 granddaughter+0.011
aj+0.011
omm+0.011
Neg logit contributions
Mine-0.008
Stop-0.008
 score-0.008
 hill-0.008
 race-0.008
Token the
Token ID262
Feature activation+9.88e-01

Pos logit contributions
aj+0.026
ppa+0.025
omm+0.024
tr+0.024
urger+0.024
Neg logit contributions
Stop-0.012
Mine-0.012
 score-0.012
No-0.011
Hey-0.010
Token oxygen
Token ID11863
Feature activation-1.52e+00

Pos logit contributions
Mine+0.019
Stop+0.019
www+0.018
Run+0.017
atch+0.017
Neg logit contributions
 grandmother-0.021
 granddaughter-0.020
 patients-0.019
 parents-0.019
ppa-0.018
Token from
Token ID422
Feature activation-1.51e+00

Pos logit contributions
ppa+0.034
 grandmother+0.034
 granddaughter+0.034
apa+0.033
aj+0.032
Neg logit contributions
Stop-0.024
Mine-0.024
 score-0.022
atch-0.021
OK-0.021
Token she
Token ID673
Feature activation+3.38e-01

Pos logit contributions
ppa+0.032
aj+0.031
isk+0.030
urger+0.030
omm+0.029
Neg logit contributions
Stop-0.016
Mine-0.016
 score-0.016
!"-0.015
OK-0.015
Token smiled
Token ID13541
Feature activation+2.18e+00

Pos logit contributions
Stop+0.006
Mine+0.005
 score+0.005
OK+0.005
Run+0.005
Neg logit contributions
ppa-0.007
 grandmother-0.007
 granddaughter-0.007
 patients-0.007
omm-0.007
Token and
Token ID290
Feature activation-2.82e-01

Pos logit contributions
Mine+0.027
 score+0.026
Stop+0.026
OK+0.024
 hill+0.024
Neg logit contributions
ppa-0.057
 grandmother-0.054
 granddaughter-0.054
apa-0.053
aj-0.053
Token gave
Token ID2921
Feature activation-7.17e-01

Pos logit contributions
ppa+0.007
 grandmother+0.006
 granddaughter+0.006
aj+0.006
 patients+0.006
Neg logit contributions
Mine-0.004
Stop-0.004
 score-0.004
OK-0.004
 race-0.004
Token her
Token ID607
Feature activation-6.05e+00

Pos logit contributions
aj+0.016
ppa+0.016
 former+0.016
oll+0.015
ascal+0.015
Neg logit contributions
Stop-0.012
Mine-0.012
!"-0.011
 turns-0.011
OK-0.011
Token daughter
Token ID4957
Feature activation-8.34e+00

Pos logit contributions
ppa+0.150
ascal+0.137
isk+0.137
rina+0.136
omm+0.135
Neg logit contributions
 score-0.087
Mine-0.087
!"-0.085
 race-0.083
 challenge-0.083
Token a
Token ID257
Feature activation+7.23e-01

Pos logit contributions
ppa+0.191
omm+0.186
ascal+0.184
rina+0.181
 grandmother+0.177
Neg logit contributions
 score-0.130
!"-0.125
Mine-0.122
 kick-0.120
 race-0.118
Token big
Token ID1263
Feature activation+3.92e+00

Pos logit contributions
Mine+0.011
Stop+0.011
 score+0.011
 hill+0.010
OK+0.010
Neg logit contributions
ppa-0.017
 grandmother-0.016
 granddaughter-0.016
apa-0.015
aj-0.015
Token hug
Token ID16225
Feature activation+5.52e-01

Pos logit contributions
Stop+0.085
atch+0.081
Mine+0.077
 slope+0.075
Run+0.074
Neg logit contributions
 grandmother-0.073
 granddaughter-0.071
apa-0.062
 prayers-0.062
 Mother-0.062
Token.
Token ID13
Feature activation-1.00e-01

Pos logit contributions
Mine+0.006
Stop+0.006
 score+0.006
!"+0.006
OK+0.006
Neg logit contributions
ppa-0.015
 grandmother-0.014
aj-0.014
apa-0.014
 granddaughter-0.014
Token
Token ID220
Feature activation-6.15e-01

Pos logit contributions
ppa+0.002
 patients+0.002
oll+0.002
 granddaughter+0.002
 grandmother+0.002
Neg logit contributions
Mine-0.002
Stop-0.001
OK-0.001
!"-0.001
 hill-0.001
Token and
Token ID290
Feature activation-2.57e-01

Pos logit contributions
Mine+0.023
Stop+0.023
 score+0.022
 hill+0.021
OK+0.020
Neg logit contributions
ppa-0.048
 grandmother-0.047
 granddaughter-0.046
apa-0.045
aj-0.044
Token the
Token ID262
Feature activation-7.05e-01

Pos logit contributions
ppa+0.006
omm+0.006
aj+0.006
oll+0.006
uf+0.006
Neg logit contributions
OK-0.004
 score-0.004
 race-0.004
Stop-0.003
Mine-0.003
Token mum
Token ID25682
Feature activation-9.15e-01

Pos logit contributions
ppa+0.012
 grandmother+0.011
 granddaughter+0.010
apa+0.010
aj+0.010
Neg logit contributions
Mine-0.016
Stop-0.016
 score-0.015
 hill-0.015
OK-0.015
Token gave
Token ID2921
Feature activation-3.81e-01

Pos logit contributions
ppa+0.022
 grandmother+0.021
 granddaughter+0.021
aj+0.021
omm+0.020
Neg logit contributions
 score-0.012
Mine-0.012
Stop-0.012
 hill-0.012
!"-0.012
Token her
Token ID607
Feature activation-5.76e+00

Pos logit contributions
aj+0.010
ppa+0.009
omm+0.009
urger+0.009
oll+0.009
Neg logit contributions
Stop-0.006
Mine-0.005
OK-0.005
 turns-0.005
No-0.005
Token daughter
Token ID4957
Feature activation-8.24e+00

Pos logit contributions
ppa+0.144
aj+0.140
omm+0.138
ascal+0.137
iqu+0.132
Neg logit contributions
!"-0.078
 score-0.074
 turns-0.074
No-0.074
 challenge-0.073
Token lots
Token ID6041
Feature activation+1.29e+00

Pos logit contributions
ppa+0.201
omm+0.198
ascal+0.195
aj+0.187
iqu+0.186
Neg logit contributions
 score-0.117
!"-0.114
Mine-0.113
 turns-0.110
 race-0.109
Token of
Token ID286
Feature activation-6.30e-01

Pos logit contributions
Mine+0.017
Stop+0.017
 score+0.016
 hill+0.015
OK+0.015
Neg logit contributions
ppa-0.032
 grandmother-0.032
 granddaughter-0.031
apa-0.030
aj-0.030
Token hugs
Token ID40657
Feature activation+1.15e+00

Pos logit contributions
ppa+0.013
 grandmother+0.012
 granddaughter+0.012
apa+0.012
omm+0.012
Neg logit contributions
Stop-0.011
Mine-0.011
 score-0.011
OK-0.011
 race-0.011
Token.
Token ID13
Feature activation+1.21e-01

Pos logit contributions
Stop+0.013
Mine+0.012
OK+0.012
 score+0.011
!"+0.011
Neg logit contributions
ppa-0.032
aj-0.031
 grandmother-0.030
 granddaughter-0.030
apa-0.030
Token
Token ID198
Feature activation-1.51e+00

Pos logit contributions
Mine+0.002
Stop+0.002
OK+0.002
 hill+0.001
!"+0.001
Neg logit contributions
ppa-0.003
urger-0.003
 patients-0.003
oll-0.003
ache-0.003
Tokenum
Token ID388
Feature activation-1.16e-01

Pos logit contributions
ppa+0.023
 grandmother+0.023
 granddaughter+0.022
 patients+0.021
apa+0.021
Neg logit contributions
Mine-0.019
Stop-0.019
 score-0.019
OK-0.018
atch-0.018
Token smiled
Token ID13541
Feature activation+2.38e+00

Pos logit contributions
ppa+0.003
 grandmother+0.002
 granddaughter+0.002
apa+0.002
aj+0.002
Neg logit contributions
Mine-0.002
Stop-0.002
 score-0.002
 hill-0.002
OK-0.002
Token and
Token ID290
Feature activation+3.59e-01

Pos logit contributions
Mine+0.030
Stop+0.030
 score+0.029
OK+0.027
 hill+0.027
Neg logit contributions
ppa-0.061
 grandmother-0.059
 granddaughter-0.059
apa-0.057
aj-0.057
Token gave
Token ID2921
Feature activation+3.99e-01

Pos logit contributions
Stop+0.005
Mine+0.005
 score+0.005
OK+0.005
Run+0.005
Neg logit contributions
ppa-0.008
 grandmother-0.008
 granddaughter-0.008
aj-0.008
 patients-0.008
Token her
Token ID607
Feature activation-5.52e+00

Pos logit contributions
Stop+0.006
Mine+0.006
!"+0.006
OK+0.006
 turns+0.006
Neg logit contributions
aj-0.009
ppa-0.009
 grandmother-0.009
 patients-0.009
urger-0.008
Token daughter
Token ID4957
Feature activation-8.11e+00

Pos logit contributions
ppa+0.126
aj+0.125
ascal+0.119
isk+0.118
omm+0.118
Neg logit contributions
Mine-0.085
!"-0.084
 score-0.083
No-0.083
Stop-0.080
Token another
Token ID1194
Feature activation+9.77e-01

Pos logit contributions
ppa+0.186
ascal+0.178
omm+0.177
aj+0.177
rina+0.173
Neg logit contributions
 score-0.127
Mine-0.126
!"-0.121
 kick-0.116
 race-0.114
Token hug
Token ID16225
Feature activation+1.23e+00

Pos logit contributions
Stop+0.017
Mine+0.017
atch+0.016
 hill+0.016
 score+0.015
Neg logit contributions
 granddaughter-0.021
 grandmother-0.021
ppa-0.020
apa-0.019
oll-0.019
Token.
Token ID13
Feature activation+7.39e-01

Pos logit contributions
Mine+0.016
Stop+0.016
 score+0.015
OK+0.015
 hill+0.014
Neg logit contributions
ppa-0.031
 grandmother-0.030
 granddaughter-0.030
aj-0.029
apa-0.029
Token '
Token ID705
Feature activation+1.73e+00

Pos logit contributions
Mine+0.014
Stop+0.014
OK+0.013
 score+0.013
!"+0.013
Neg logit contributions
ppa-0.015
 grandmother-0.014
 granddaughter-0.014
 patients-0.013
oll-0.013
TokenThat
Token ID2504
Feature activation+2.42e+00

Pos logit contributions
 hill+0.024
 score+0.023
 race+0.022
Mine+0.022
 challenge+0.021
Neg logit contributions
ppa-0.043
 granddaughter-0.041
 grandmother-0.040
apa-0.038
oll-0.038
Token was
Token ID373
Feature activation-2.47e+00

Pos logit contributions
 cl+0.002
!"+0.002
 score+0.002
Stop+0.002
Give+0.002
Neg logit contributions
aj-0.004
oll-0.004
 former-0.004
 patients-0.004
ppa-0.004
Token angry
Token ID7954
Feature activation+2.97e+00

Pos logit contributions
aj+0.063
apa+0.063
 former+0.062
ometers+0.061
 Him+0.061
Neg logit contributions
 cl-0.033
!"-0.030
 score-0.029
 wave-0.027
 cheered-0.027
Token and
Token ID290
Feature activation-1.02e+00

Pos logit contributions
Mine+0.046
Stop+0.045
 score+0.043
OK+0.041
 hill+0.041
Neg logit contributions
 grandmother-0.069
ppa-0.069
 granddaughter-0.068
oll-0.065
apa-0.063
Token asked
Token ID1965
Feature activation-3.14e+00

Pos logit contributions
ppa+0.022
 grandmother+0.022
 granddaughter+0.022
oll+0.022
aj+0.022
Neg logit contributions
Stop-0.016
Mine-0.016
 score-0.015
Run-0.015
OK-0.015
Token her
Token ID607
Feature activation-6.67e+00

Pos logit contributions
ppa+0.079
aj+0.079
 former+0.076
oll+0.075
urger+0.075
Neg logit contributions
Stop-0.045
!"-0.042
Mine-0.039
 want-0.038
No-0.037
Token daughter
Token ID4957
Feature activation-8.04e+00

Pos logit contributions
 former+0.147
ppa+0.147
aj+0.145
ridge+0.141
lor+0.140
Neg logit contributions
!"-0.107
 want-0.102
 it-0.097
,-0.096
 thing-0.096
Token why
Token ID1521
Feature activation+5.27e-01

Pos logit contributions
ppa+0.202
aj+0.196
 former+0.191
ascal+0.191
oll+0.185
Neg logit contributions
 want-0.114
 score-0.112
!"-0.111
 kick-0.103
 race-0.100
Token she
Token ID673
Feature activation-1.14e+00

Pos logit contributions
Stop+0.007
Mine+0.007
 score+0.007
 hill+0.006
!"+0.006
Neg logit contributions
ppa-0.013
 granddaughter-0.013
 grandmother-0.013
urger-0.012
aj-0.012
Token had
Token ID550
Feature activation+6.88e-02

Pos logit contributions
ppa+0.029
 grandmother+0.028
 granddaughter+0.028
apa+0.027
aj+0.027
Neg logit contributions
Stop-0.015
Mine-0.015
 score-0.015
!"-0.014
 hill-0.014
Token done
Token ID1760
Feature activation-6.40e-01

Pos logit contributions
 score+0.001
Stop+0.001
Mine+0.001
!"+0.001
 race+0.001
Neg logit contributions
 grandmother-0.001
ppa-0.001
apa-0.001
aj-0.001
oll-0.001
Token something
Token ID1223
Feature activation-2.86e-01

Pos logit contributions
ppa+0.016
apa+0.016
aj+0.015
oll+0.015
 grandmother+0.015
Neg logit contributions
 score-0.009
Stop-0.009
Mine-0.009
!"-0.009
 race-0.008
Token happy
Token ID3772
Feature activation+2.45e+00

Pos logit contributions
ppa+0.022
 grandmother+0.021
 granddaughter+0.021
apa+0.020
aj+0.020
Neg logit contributions
Mine-0.016
Stop-0.016
 score-0.016
OK-0.015
 race-0.015
Token that
Token ID326
Feature activation+6.77e-01

Pos logit contributions
Mine+0.052
Stop+0.050
 score+0.046
OK+0.046
atch+0.046
Neg logit contributions
 grandmother-0.046
 granddaughter-0.044
ppa-0.043
 parents-0.041
oll-0.040
Token he
Token ID339
Feature activation+1.02e+00

Pos logit contributions
Stop+0.007
Mine+0.007
 score+0.007
 hill+0.007
 race+0.007
Neg logit contributions
ppa-0.019
aj-0.018
 granddaughter-0.018
apa-0.018
 grandmother-0.018
Token gave
Token ID2921
Feature activation-5.91e-01

Pos logit contributions
Mine+0.019
Stop+0.018
 score+0.018
 hill+0.017
atch+0.017
Neg logit contributions
ppa-0.021
 grandmother-0.020
 granddaughter-0.020
oll-0.018
apa-0.018
Token his
Token ID465
Feature activation-4.51e+00

Pos logit contributions
aj+0.016
omm+0.015
ppa+0.015
iqu+0.015
tr+0.015
Neg logit contributions
Stop-0.007
 score-0.007
 turns-0.006
 wind-0.006
Mine-0.006
Token wife
Token ID3656
Feature activation-8.03e+00

Pos logit contributions
ppa+0.048
omm+0.048
 Chick+0.047
aj+0.046
oll+0.045
Neg logit contributions
 score-0.126
 race-0.120
OK-0.119
Stop-0.119
 challenge-0.118
Token a
Token ID257
Feature activation+5.78e-02

Pos logit contributions
ppa+0.178
omm+0.174
 grandmother+0.172
iqu+0.171
aj+0.170
Neg logit contributions
 score-0.133
Mine-0.125
!"-0.122
 race-0.118
 turns-0.116
Token big
Token ID1263
Feature activation+3.13e+00

Pos logit contributions
 score+0.001
Mine+0.001
 race+0.001
 challenge+0.001
 hill+0.001
Neg logit contributions
ppa-0.002
rina-0.001
aj-0.001
 grandmother-0.001
omm-0.001
Token hug
Token ID16225
Feature activation+5.23e-01

Pos logit contributions
Stop+0.060
Mine+0.056
atch+0.055
OK+0.052
 slope+0.052
Neg logit contributions
 granddaughter-0.064
 grandmother-0.064
 parents-0.057
oll-0.057
apa-0.057
Token,
Token ID11
Feature activation-2.82e+00

Pos logit contributions
 score+0.005
Mine+0.005
Stop+0.005
!"+0.005
 hill+0.005
Neg logit contributions
ppa-0.015
aj-0.014
 grandmother-0.014
apa-0.014
 granddaughter-0.014
Token and
Token ID290
Feature activation+1.60e-01

Pos logit contributions
ppa+0.073
apa+0.071
 grandmother+0.070
 granddaughter+0.069
aj+0.068
Neg logit contributions
 score-0.034
Mine-0.033
Stop-0.033
OK-0.032
 hill-0.031
Token be
Token ID307
Feature activation-1.91e+00

Pos logit contributions
Mine+0.020
Stop+0.019
 score+0.019
 hill+0.017
OK+0.017
Neg logit contributions
ppa-0.051
 grandmother-0.050
 granddaughter-0.049
apa-0.048
aj-0.048
Token fun
Token ID1257
Feature activation+1.40e+00

Pos logit contributions
apa+0.049
lor+0.047
uf+0.047
ppa+0.047
omm+0.047
Neg logit contributions
 wind-0.025
!"-0.025
 race-0.023
Stop-0.023
 want-0.022
Token to
Token ID284
Feature activation-2.47e-01

Pos logit contributions
Mine+0.031
Stop+0.031
Run+0.029
 score+0.029
 hill+0.028
Neg logit contributions
 grandmother-0.024
 granddaughter-0.023
ppa-0.022
 patients-0.022
aj-0.021
Token give
Token ID1577
Feature activation-6.63e-01

Pos logit contributions
ppa+0.007
 grandmother+0.007
 granddaughter+0.007
apa+0.006
oll+0.006
Neg logit contributions
Mine-0.003
Stop-0.003
 score-0.003
 hill-0.002
 race-0.002
Token the
Token ID262
Feature activation-2.16e+00

Pos logit contributions
ppa+0.016
 grandmother+0.015
aj+0.015
omm+0.015
oll+0.015
Neg logit contributions
Stop-0.009
Mine-0.009
 score-0.009
!"-0.009
 turns-0.009
Token lady
Token ID10846
Feature activation-7.93e+00

Pos logit contributions
ppa+0.048
 grandmother+0.047
 granddaughter+0.047
aj+0.044
apa+0.044
Neg logit contributions
Mine-0.035
Stop-0.034
 score-0.034
atch-0.032
 hill-0.032
Token a
Token ID257
Feature activation+8.58e-01

Pos logit contributions
omm+0.184
ppa+0.182
iqu+0.178
 grandmother+0.177
aj+0.176
Neg logit contributions
 score-0.121
!"-0.118
 race-0.116
 game-0.109
 want-0.109
Token big
Token ID1263
Feature activation+3.66e+00

Pos logit contributions
Mine+0.012
Stop+0.012
 score+0.012
 hill+0.011
 race+0.011
Neg logit contributions
ppa-0.021
 grandmother-0.020
 granddaughter-0.020
apa-0.019
oll-0.019
Token hug
Token ID16225
Feature activation-5.75e-02

Pos logit contributions
Stop+0.078
Mine+0.072
atch+0.072
Run+0.069
Give+0.069
Neg logit contributions
 granddaughter-0.069
 grandmother-0.068
 throat-0.061
 prayers-0.059
aj-0.059
Token.
Token ID13
Feature activation-3.76e-01

Pos logit contributions
ppa+0.002
 grandmother+0.001
apa+0.001
 granddaughter+0.001
aj+0.001
Neg logit contributions
Mine-0.001
Stop-0.001
 score-0.001
!"-0.001
 hill-0.001
Token So
Token ID1406
Feature activation+1.17e+00

Pos logit contributions
ppa+0.008
 grandmother+0.008
 granddaughter+0.008
apa+0.008
oll+0.007
Neg logit contributions
Mine-0.006
Stop-0.006
 score-0.006
 hill-0.006
OK-0.006
Token so
Token ID523
Feature activation+1.20e+00

Pos logit contributions
ppa+0.037
 grandmother+0.036
 granddaughter+0.036
apa+0.034
aj+0.034
Neg logit contributions
Mine-0.027
Stop-0.027
 score-0.026
 hill-0.025
OK-0.024
Token they
Token ID484
Feature activation+1.61e+00

Pos logit contributions
 race+0.013
 hill+0.013
Mine+0.013
!"+0.013
 wind+0.013
Neg logit contributions
ppa-0.033
oll-0.032
isk-0.031
omm-0.031
uf-0.031
Token gave
Token ID2921
Feature activation-6.89e-01

Pos logit contributions
Mine+0.023
Stop+0.023
 score+0.022
 hill+0.021
OK+0.021
Neg logit contributions
ppa-0.038
 grandmother-0.038
 granddaughter-0.037
aj-0.036
apa-0.036
Token their
Token ID511
Feature activation-3.96e+00

Pos logit contributions
aj+0.020
ppa+0.019
oll+0.019
omm+0.018
 former+0.018
Neg logit contributions
Mine-0.008
Stop-0.008
 turns-0.008
 wind-0.007
OK-0.007
Token little
Token ID1310
Feature activation-3.43e-03

Pos logit contributions
ppa+0.082
oll+0.074
lor+0.071
rina+0.070
isk+0.070
Neg logit contributions
Mine-0.077
 hill-0.076
 race-0.074
OK-0.073
Stop-0.073
Token girl
Token ID2576
Feature activation-7.86e+00

Pos logit contributions
ppa+0.000
 grandmother+0.000
 granddaughter+0.000
apa+0.000
oll+0.000
Neg logit contributions
Mine-0.000
Stop-0.000
 score-0.000
 hill-0.000
OK-0.000
Token a
Token ID257
Feature activation+9.90e-01

Pos logit contributions
iqu+0.193
omm+0.192
ppa+0.190
 former+0.190
ascal+0.189
Neg logit contributions
!"-0.102
 score-0.093
Mine-0.088
 past-0.087
 race-0.086
Token measuring
Token ID15964
Feature activation+1.34e+00

Pos logit contributions
Mine+0.015
Stop+0.015
 score+0.015
 hill+0.014
OK+0.014
Neg logit contributions
ppa-0.023
 grandmother-0.022
 granddaughter-0.022
apa-0.021
aj-0.021
Token tape
Token ID9154
Feature activation-9.69e-01

Pos logit contributions
Stop+0.024
Mine+0.023
Run+0.022
 score+0.021
atch+0.021
Neg logit contributions
 grandmother-0.029
 granddaughter-0.028
ppa-0.027
 patients-0.026
oll-0.026
Token.
Token ID13
Feature activation+4.49e-01

Pos logit contributions
 grandmother+0.021
ppa+0.020
 granddaughter+0.020
apa+0.019
oll+0.019
Neg logit contributions
Stop-0.017
Mine-0.017
 score-0.016
Run-0.015
OK-0.015
Token She
Token ID1375
Feature activation+9.16e-01

Pos logit contributions
Mine+0.008
Stop+0.008
 hill+0.007
 score+0.007
OK+0.007
Neg logit contributions
ppa-0.010
oll-0.009
 grandmother-0.009
rina-0.009
aj-0.009
Token her
Token ID607
Feature activation-4.01e+00

Pos logit contributions
ppa+0.034
aj+0.034
omm+0.033
oll+0.033
apa+0.033
Neg logit contributions
 score-0.017
Mine-0.016
Stop-0.016
 race-0.015
 hill-0.015
Token feet
Token ID3625
Feature activation+5.06e-01

Pos logit contributions
ppa+0.090
oll+0.082
apa+0.081
 granddaughter+0.079
isk+0.079
Neg logit contributions
 hill-0.072
 score-0.070
Mine-0.067
 race-0.065
!"-0.064
Token and
Token ID290
Feature activation+4.19e-01

Pos logit contributions
Mine+0.005
Stop+0.005
 score+0.005
 hill+0.005
 race+0.005
Neg logit contributions
ppa-0.014
 grandmother-0.014
 granddaughter-0.014
apa-0.013
aj-0.013
Token handed
Token ID10158
Feature activation-4.72e-01

Pos logit contributions
Stop+0.007
Mine+0.007
 score+0.007
 hill+0.006
OK+0.006
Neg logit contributions
ppa-0.009
 grandmother-0.009
 granddaughter-0.009
apa-0.009
aj-0.008
Token the
Token ID262
Feature activation-2.09e+00

Pos logit contributions
aj+0.013
 former+0.012
ppa+0.012
oll+0.012
omm+0.012
Neg logit contributions
Stop-0.006
Mine-0.006
 score-0.005
OK-0.005
Give-0.005
Token woman
Token ID2415
Feature activation-7.82e+00

Pos logit contributions
ppa+0.046
 grandmother+0.043
apa+0.042
oll+0.042
 granddaughter+0.042
Neg logit contributions
Stop-0.036
Mine-0.035
 score-0.034
 hill-0.034
OK-0.033
Token a
Token ID257
Feature activation+6.60e-01

Pos logit contributions
aj+0.191
ppa+0.190
 grandmother+0.190
omm+0.189
iqu+0.183
Neg logit contributions
 score-0.107
!"-0.103
Mine-0.099
 kick-0.094
 race-0.091
Token piece
Token ID3704
Feature activation-5.30e-01

Pos logit contributions
Stop+0.011
Mine+0.011
atch+0.010
 score+0.010
Run+0.010
Neg logit contributions
 grandmother-0.014
ppa-0.014
 granddaughter-0.014
apa-0.013
aj-0.013
Token.
Token ID13
Feature activation+3.98e-01

Pos logit contributions
ppa+0.011
oll+0.011
 grandmother+0.011
 granddaughter+0.011
apa+0.010
Neg logit contributions
Stop-0.010
Mine-0.009
OK-0.009
Run-0.009
 score-0.009
Token They
Token ID1119
Feature activation+1.51e+00

Pos logit contributions
Mine+0.007
Stop+0.007
 score+0.007
 hill+0.007
OK+0.007
Neg logit contributions
ppa-0.008
oll-0.008
 grandmother-0.008
apa-0.008
 patients-0.007
Token both
Token ID1111
Feature activation+1.42e+00

Pos logit contributions
Stop+0.028
Mine+0.027
 score+0.027
 hill+0.026
OK+0.026
Neg logit contributions
ppa-0.030
 grandmother-0.029
 granddaughter-0.029
apa-0.028
oll-0.028
Token friend
Token ID1545
Feature activation-7.92e-01

Pos logit contributions
ppa+0.064
apa+0.056
isk+0.056
lor+0.054
ridge+0.053
Neg logit contributions
 race-0.074
 hill-0.074
 game-0.072
 wind-0.071
 score-0.071
Token suggested
Token ID5220
Feature activation-1.23e+00

Pos logit contributions
ppa+0.021
aj+0.021
 former+0.021
oll+0.020
 grandmother+0.020
Neg logit contributions
Stop-0.008
 race-0.008
OK-0.008
Mine-0.008
 score-0.008
Token they
Token ID484
Feature activation-1.43e+00

Pos logit contributions
ppa+0.031
aj+0.030
 grandmother+0.030
oll+0.030
 granddaughter+0.029
Neg logit contributions
Stop-0.017
 score-0.016
Mine-0.016
 race-0.015
Run-0.015
Token ask
Token ID1265
Feature activation-3.31e+00

Pos logit contributions
ppa+0.039
 grandmother+0.038
 granddaughter+0.037
apa+0.036
aj+0.036
Neg logit contributions
Mine-0.016
Stop-0.016
 score-0.016
 hill-0.015
OK-0.014
Token the
Token ID262
Feature activation-1.26e+00

Pos logit contributions
aj+0.087
ppa+0.085
iqu+0.084
oll+0.083
lor+0.082
Neg logit contributions
 score-0.040
Stop-0.040
!"-0.040
Mine-0.039
 race-0.038
Token stranger
Token ID16195
Feature activation-7.80e+00

Pos logit contributions
ppa+0.030
 grandmother+0.029
 granddaughter+0.029
apa+0.028
oll+0.027
Neg logit contributions
Mine-0.019
Stop-0.018
 score-0.018
 hill-0.017
atch-0.017
Token what
Token ID644
Feature activation-3.52e-01

Pos logit contributions
ppa+0.218
omm+0.208
aj+0.205
ascal+0.203
oll+0.201
Neg logit contributions
 score-0.089
!"-0.087
 want-0.084
 race-0.081
 wants-0.079
Token his
Token ID465
Feature activation-2.41e+00

Pos logit contributions
ppa+0.009
 grandmother+0.008
 granddaughter+0.008
aj+0.008
apa+0.008
Neg logit contributions
Stop-0.005
Mine-0.005
 score-0.005
 hill-0.005
 race-0.005
Token name
Token ID1438
Feature activation-1.34e+00

Pos logit contributions
ppa+0.048
apa+0.042
aj+0.041
oll+0.041
omm+0.040
Neg logit contributions
 race-0.047
 score-0.047
 hill-0.047
OK-0.047
Stop-0.046
Token was
Token ID373
Feature activation-2.71e+00

Pos logit contributions
ppa+0.032
 grandmother+0.032
 granddaughter+0.031
apa+0.030
aj+0.030
Neg logit contributions
Stop-0.019
Mine-0.019
 score-0.018
 hill-0.018
OK-0.017
Token.
Token ID13
Feature activation-9.03e-02

Pos logit contributions
ppa+0.074
apa+0.071
oll+0.070
omm+0.070
 grandmother+0.069
Neg logit contributions
!"-0.032
Stop-0.029
 wants-0.029
 score-0.029
 race-0.029
Token him
Token ID683
Feature activation-2.67e+00

Pos logit contributions
ppa+0.008
 granddaughter+0.007
 grandmother+0.007
aj+0.007
apa+0.007
Neg logit contributions
Stop-0.004
Mine-0.004
 score-0.004
 hill-0.004
!"-0.004
Token.
Token ID13
Feature activation+1.40e+00

Pos logit contributions
ppa+0.071
 grandmother+0.068
 granddaughter+0.067
aj+0.066
apa+0.066
Neg logit contributions
 score-0.032
!"-0.029
Mine-0.029
 hill-0.029
Stop-0.029
Token He
Token ID679
Feature activation+9.07e-01

Pos logit contributions
Stop+0.024
Mine+0.024
 score+0.023
 hill+0.022
OK+0.022
Neg logit contributions
ppa-0.030
 grandmother-0.029
 granddaughter-0.029
apa-0.028
aj-0.027
Token gave
Token ID2921
Feature activation-2.03e-02

Pos logit contributions
Mine+0.015
Stop+0.015
 score+0.014
 hill+0.014
OK+0.014
Neg logit contributions
ppa-0.020
 grandmother-0.019
 granddaughter-0.019
apa-0.018
aj-0.018
Token his
Token ID465
Feature activation-3.95e+00

Pos logit contributions
ppa+0.001
aj+0.001
 grandmother+0.001
 granddaughter+0.001
oll+0.000
Neg logit contributions
Stop-0.000
Mine-0.000
 score-0.000
 turns-0.000
!"-0.000
Token son
Token ID3367
Feature activation-7.80e+00

Pos logit contributions
ppa+0.060
isk+0.057
omm+0.056
oll+0.055
rina+0.054
Neg logit contributions
 hill-0.089
 score-0.088
Mine-0.088
 challenge-0.087
 race-0.087
Token a
Token ID257
Feature activation+6.01e-01

Pos logit contributions
ppa+0.183
 grandmother+0.176
rina+0.174
 former+0.174
omm+0.174
Neg logit contributions
 score-0.116
 kick-0.109
!"-0.109
Mine-0.107
 race-0.104
Token big
Token ID1263
Feature activation+3.66e+00

Pos logit contributions
Mine+0.009
Stop+0.009
 score+0.009
 hill+0.009
OK+0.009
Neg logit contributions
ppa-0.014
 grandmother-0.013
 granddaughter-0.013
apa-0.013
oll-0.013
Token hug
Token ID16225
Feature activation+6.13e-01

Pos logit contributions
Stop+0.079
atch+0.071
Mine+0.071
Give+0.068
OK+0.068
Neg logit contributions
 granddaughter-0.067
 grandmother-0.066
apa-0.061
aj-0.061
 parents-0.060
Token and
Token ID290
Feature activation-5.51e-01

Pos logit contributions
 score+0.005
!"+0.005
Stop+0.005
Mine+0.005
 hill+0.005
Neg logit contributions
ppa-0.018
 grandmother-0.017
aj-0.017
 granddaughter-0.017
apa-0.017
Token thanked
Token ID26280
Feature activation-2.00e+00

Pos logit contributions
ppa+0.014
 grandmother+0.013
omm+0.013
uf+0.013
ascal+0.013
Neg logit contributions
 score-0.007
Mine-0.007
 race-0.007
Stop-0.006
OK-0.006
Token ran
Token ID4966
Feature activation+2.82e+00

Pos logit contributions
ppa+0.000
 grandmother+0.000
 patients+0.000
rina+0.000
 granddaughter+0.000
Neg logit contributions
Mine-0.000
 hill-0.000
Stop-0.000
!"-0.000
OK-0.000
Token to
Token ID284
Feature activation-2.38e+00

Pos logit contributions
Mine+0.061
www+0.056
Stop+0.055
atch+0.055
Check+0.054
Neg logit contributions
 granddaughter-0.043
 grandmother-0.042
ppa-0.042
 owners-0.041
 parents-0.041
Token him
Token ID683
Feature activation-4.97e+00

Pos logit contributions
ppa+0.053
 grandmother+0.053
 granddaughter+0.052
apa+0.050
 patients+0.049
Neg logit contributions
Mine-0.037
Stop-0.037
 score-0.036
 hill-0.034
OK-0.034
Token and
Token ID290
Feature activation-7.18e-01

Pos logit contributions
ppa+0.138
 grandmother+0.133
 granddaughter+0.132
apa+0.130
omm+0.130
Neg logit contributions
 score-0.055
 hill-0.050
!"-0.049
Mine-0.049
 race-0.048
Token gave
Token ID2921
Feature activation-1.63e+00

Pos logit contributions
ppa+0.017
 grandmother+0.017
oll+0.017
 granddaughter+0.016
omm+0.016
Neg logit contributions
Stop-0.010
 score-0.010
Mine-0.010
 race-0.010
OK-0.010
Token him
Token ID683
Feature activation-7.78e+00

Pos logit contributions
ppa+0.042
aj+0.042
 grandmother+0.041
 granddaughter+0.041
omm+0.040
Neg logit contributions
Mine-0.020
Stop-0.020
 score-0.019
!"-0.018
 race-0.017
Token a
Token ID257
Feature activation-1.74e-01

Pos logit contributions
omm+0.205
ppa+0.200
 grandmother+0.195
rina+0.192
ascal+0.187
Neg logit contributions
 score-0.105
!"-0.102
 kick-0.095
Mine-0.091
 turns-0.090
Token big
Token ID1263
Feature activation+3.18e+00

Pos logit contributions
ppa+0.004
 grandmother+0.004
 granddaughter+0.004
apa+0.004
aj+0.004
Neg logit contributions
Mine-0.003
Stop-0.003
 score-0.003
 hill-0.002
OK-0.002
Token hug
Token ID16225
Feature activation+3.37e-01

Pos logit contributions
Stop+0.064
Mine+0.059
atch+0.057
OK+0.055
Give+0.055
Neg logit contributions
 granddaughter-0.062
 grandmother-0.062
 throat-0.056
aj-0.056
apa-0.055
Token.
Token ID13
Feature activation-4.94e-01

Pos logit contributions
!"+0.003
 score+0.003
Stop+0.003
Mine+0.003
 hill+0.003
Neg logit contributions
ppa-0.010
 grandmother-0.009
apa-0.009
aj-0.009
iqu-0.009
Token They
Token ID1119
Feature activation+1.63e+00

Pos logit contributions
ppa+0.012
 patients+0.011
oll+0.011
 grandmother+0.011
rina+0.011
Neg logit contributions
Mine-0.007
Stop-0.007
 hill-0.007
 score-0.007
!"-0.007
Token knowing
Token ID6970
Feature activation-7.17e-04

Pos logit contributions
ppa+0.060
apa+0.059
 grandmother+0.058
oll+0.057
 granddaughter+0.057
Neg logit contributions
Stop-0.043
Mine-0.042
 score-0.042
 hill-0.041
OK-0.040
Token he
Token ID339
Feature activation+6.45e-01

Pos logit contributions
ppa+0.000
aj+0.000
apa+0.000
omm+0.000
urger+0.000
Neg logit contributions
Stop-0.000
 score-0.000
!"-0.000
Mine-0.000
 race-0.000
Token had
Token ID550
Feature activation-8.97e-02

Pos logit contributions
Stop+0.010
Mine+0.010
 score+0.009
 hill+0.009
OK+0.009
Neg logit contributions
ppa-0.015
 grandmother-0.015
 granddaughter-0.014
aj-0.014
apa-0.014
Token given
Token ID1813
Feature activation-7.17e-01

Pos logit contributions
ppa+0.002
omm+0.002
aj+0.002
oll+0.002
apa+0.002
Neg logit contributions
 score-0.001
Stop-0.001
!"-0.001
Mine-0.001
 wind-0.001
Token his
Token ID465
Feature activation-3.74e+00

Pos logit contributions
ppa+0.017
aj+0.017
oll+0.016
apa+0.016
uf+0.016
Neg logit contributions
Stop-0.011
Mine-0.011
 score-0.011
 turns-0.010
 wind-0.010
Token daughter
Token ID4957
Feature activation-7.77e+00

Pos logit contributions
ppa+0.070
oll+0.063
 grandmother+0.062
isk+0.062
apa+0.062
Neg logit contributions
 score-0.075
Mine-0.075
Stop-0.074
 hill-0.073
 race-0.072
Token an
Token ID281
Feature activation+1.94e+00

Pos logit contributions
omm+0.184
ppa+0.180
ascal+0.172
iqu+0.168
oll+0.168
Neg logit contributions
 score-0.123
!"-0.116
 race-0.112
 kick-0.109
 want-0.108
Token interesting
Token ID3499
Feature activation+2.15e+00

Pos logit contributions
Stop+0.036
Mine+0.036
 score+0.035
 hill+0.033
OK+0.033
Neg logit contributions
 grandmother-0.038
ppa-0.038
 granddaughter-0.038
oll-0.035
apa-0.035
Token gift
Token ID6979
Feature activation-1.97e+00

Pos logit contributions
Stop+0.043
Mine+0.040
Run+0.039
atch+0.039
Hey+0.037
Neg logit contributions
 grandmother-0.047
 granddaughter-0.044
 prayers-0.041
 parents-0.040
 Mother-0.038
Token that
Token ID326
Feature activation+9.94e-01

Pos logit contributions
ppa+0.053
aj+0.051
apa+0.050
 granddaughter+0.049
uf+0.049
Neg logit contributions
 score-0.022
!"-0.022
 hill-0.021
 race-0.021
Mine-0.020
Token would
Token ID561
Feature activation+1.67e+00

Pos logit contributions
Stop+0.014
Mine+0.014
 score+0.014
 hill+0.013
OK+0.013
Neg logit contributions
ppa-0.024
 grandmother-0.023
 granddaughter-0.023
aj-0.023
apa-0.022
Token she
Token ID673
Feature activation+8.86e-01

Pos logit contributions
ppa+0.024
isk+0.023
aj+0.023
urger+0.023
 granddaughter+0.022
Neg logit contributions
www-0.013
Stop-0.013
OK-0.013
!"-0.013
Mine-0.013
Token decided
Token ID3066
Feature activation+3.81e-01

Pos logit contributions
Stop+0.014
Mine+0.013
 score+0.013
OK+0.013
 race+0.013
Neg logit contributions
 grandmother-0.019
ppa-0.019
 granddaughter-0.019
apa-0.019
aj-0.019
Token to
Token ID284
Feature activation-2.98e-01

Pos logit contributions
Stop+0.005
!"+0.005
 score+0.005
Mine+0.005
 race+0.005
Neg logit contributions
ppa-0.009
oll-0.009
apa-0.009
aj-0.009
omm-0.009
Token give
Token ID1577
Feature activation-7.92e-01

Pos logit contributions
ppa+0.007
 grandmother+0.007
 granddaughter+0.007
apa+0.006
 patients+0.006
Neg logit contributions
Stop-0.005
Mine-0.005
 score-0.004
 hill-0.004
Run-0.004
Token her
Token ID607
Feature activation-5.17e+00

Pos logit contributions
aj+0.019
ppa+0.018
iqu+0.018
lor+0.018
ovable+0.018
Neg logit contributions
Mine-0.012
Stop-0.012
 turns-0.011
 wind-0.011
!"-0.011
Token daughter
Token ID4957
Feature activation-7.76e+00

Pos logit contributions
ppa+0.135
rina+0.127
isk+0.126
ascal+0.125
omm+0.123
Neg logit contributions
 race-0.071
 game-0.069
!"-0.068
 score-0.067
Mine-0.067
Token a
Token ID257
Feature activation+1.02e+00

Pos logit contributions
ppa+0.206
omm+0.206
ascal+0.206
rina+0.202
iqu+0.201
Neg logit contributions
 score-0.093
!"-0.090
 race-0.086
 kick-0.084
 want-0.084
Token teaspoon
Token ID22326
Feature activation-1.74e+00

Pos logit contributions
 score+0.014
 race+0.014
Mine+0.014
 challenge+0.013
 hill+0.013
Neg logit contributions
ppa-0.026
rina-0.024
isk-0.024
oll-0.024
aj-0.024
Token.
Token ID13
Feature activation+8.69e-01

Pos logit contributions
ppa+0.046
 grandmother+0.044
aj+0.044
 granddaughter+0.043
apa+0.043
Neg logit contributions
!"-0.020
 score-0.020
 hill-0.019
Mine-0.019
 race-0.018
Token The
Token ID383
Feature activation+1.88e-01

Pos logit contributions
Mine+0.014
Stop+0.013
 hill+0.012
OK+0.012
!"+0.012
Neg logit contributions
ppa-0.022
oll-0.020
rina-0.019
ovable-0.019
urger-0.019
Token little
Token ID1310
Feature activation+2.13e+00

Pos logit contributions
Mine+0.004
OK+0.004
 hill+0.004
 wind+0.004
Stop+0.004
Neg logit contributions
ppa-0.003
isk-0.003
aj-0.003
apa-0.003
rina-0.002
Token.
Token ID13
Feature activation+6.23e-01

Pos logit contributions
Mine+0.041
Stop+0.041
 score+0.039
 hill+0.037
OK+0.037
Neg logit contributions
ppa-0.075
 grandmother-0.074
 granddaughter-0.073
apa-0.070
oll-0.070
Token He
Token ID679
Feature activation+7.02e-01

Pos logit contributions
Mine+0.012
Stop+0.012
 score+0.012
 hill+0.011
OK+0.011
Neg logit contributions
ppa-0.012
 grandmother-0.011
 granddaughter-0.011
oll-0.011
aj-0.011
Token then
Token ID788
Feature activation+1.04e+00

Pos logit contributions
Mine+0.009
 score+0.009
Stop+0.009
OK+0.009
!"+0.009
Neg logit contributions
ppa-0.017
 grandmother-0.016
 granddaughter-0.016
aj-0.016
oll-0.016
Token gave
Token ID2921
Feature activation-3.47e-01

Pos logit contributions
Stop+0.016
Mine+0.016
 score+0.016
 hill+0.015
OK+0.015
Neg logit contributions
ppa-0.024
 grandmother-0.023
 granddaughter-0.022
apa-0.022
aj-0.022
Token her
Token ID607
Feature activation-6.07e+00

Pos logit contributions
ppa+0.008
aj+0.008
 grandmother+0.008
 granddaughter+0.008
 former+0.007
Neg logit contributions
Mine-0.005
Stop-0.005
 score-0.005
!"-0.005
No-0.005
Token daughter
Token ID4957
Feature activation-7.76e+00

Pos logit contributions
ppa+0.149
aj+0.145
iqu+0.144
ascal+0.142
 former+0.142
Neg logit contributions
!"-0.081
Mine-0.081
No-0.078
 score-0.077
 turns-0.073
Token some
Token ID617
Feature activation-5.64e-01

Pos logit contributions
ppa+0.182
 grandmother+0.173
aj+0.172
rina+0.171
iqu+0.171
Neg logit contributions
 score-0.115
Mine-0.114
 kick-0.109
!"-0.108
No-0.101
Token medicine
Token ID9007
Feature activation+4.05e-02

Pos logit contributions
ppa+0.011
 grandmother+0.011
 granddaughter+0.011
apa+0.010
aj+0.010
Neg logit contributions
Mine-0.010
Stop-0.010
 score-0.010
 hill-0.010
OK-0.010
Token to
Token ID284
Feature activation+1.50e-02

Pos logit contributions
Mine+0.001
Stop+0.001
 score+0.001
 hill+0.001
OK+0.000
Neg logit contributions
ppa-0.001
 grandmother-0.001
 granddaughter-0.001
apa-0.001
aj-0.001
Token take
Token ID1011
Feature activation+6.79e-01

Pos logit contributions
 score+0.000
Stop+0.000
Mine+0.000
!"+0.000
 kick+0.000
Neg logit contributions
ppa-0.000
 grandmother-0.000
 granddaughter-0.000
aj-0.000
oll-0.000
Token every
Token ID790
Feature activation+5.12e-01

Pos logit contributions
Stop+0.010
Mine+0.009
 score+0.009
!"+0.009
 turns+0.009
Neg logit contributions
ppa-0.017
aj-0.016
 grandmother-0.016
 granddaughter-0.016
iqu-0.015
Tokenummy
Token ID13513
Feature activation+9.33e-01

Pos logit contributions
ppa+0.042
 patients+0.041
 granddaughter+0.039
 grandmother+0.039
isk+0.038
Neg logit contributions
www-0.027
Mine-0.027
 score-0.027
 hill-0.027
!"-0.026
Token and
Token ID290
Feature activation+5.77e-01

Pos logit contributions
Mine+0.015
Stop+0.015
 score+0.014
OK+0.014
No+0.013
Neg logit contributions
ppa-0.021
 grandmother-0.021
 granddaughter-0.021
apa-0.020
oll-0.019
Token Daddy
Token ID26926
Feature activation+3.22e-01

Pos logit contributions
 score+0.011
Mine+0.011
Stop+0.011
 hill+0.010
atch+0.010
Neg logit contributions
ppa-0.011
 granddaughter-0.011
 grandmother-0.011
apa-0.010
 Mother-0.010
Token took
Token ID1718
Feature activation+5.22e-01

Pos logit contributions
Mine+0.006
Stop+0.006
 score+0.006
OK+0.005
 challenge+0.005
Neg logit contributions
ppa-0.007
 grandmother-0.006
 granddaughter-0.006
apa-0.006
oll-0.006
Token their
Token ID511
Feature activation-3.22e+00

Pos logit contributions
Mine+0.009
Stop+0.009
 score+0.008
 race+0.008
 turns+0.008
Neg logit contributions
ppa-0.012
 grandmother-0.011
aj-0.011
 granddaughter-0.011
omm-0.010
Token daughter
Token ID4957
Feature activation-7.75e+00

Pos logit contributions
ppa+0.077
oll+0.070
isk+0.068
aj+0.068
omm+0.068
Neg logit contributions
 race-0.053
 hill-0.052
Mine-0.052
 score-0.051
 challenge-0.050
Token to
Token ID284
Feature activation-1.88e+00

Pos logit contributions
ppa+0.205
omm+0.190
oll+0.187
ascal+0.185
rina+0.184
Neg logit contributions
 score-0.105
 race-0.099
Mine-0.094
 hill-0.094
 kick-0.094
Token the
Token ID262
Feature activation-8.78e-01

Pos logit contributions
ppa+0.049
oll+0.047
aj+0.046
apa+0.046
 grandmother+0.046
Neg logit contributions
Mine-0.025
 score-0.025
Stop-0.024
 race-0.023
 hill-0.022
Token beach
Token ID10481
Feature activation-1.82e+00

Pos logit contributions
ppa+0.026
oll+0.025
apa+0.024
aj+0.024
rina+0.024
Neg logit contributions
Mine-0.009
 hill-0.009
Stop-0.009
 score-0.009
 race-0.009
Token.
Token ID13
Feature activation+1.37e+00

Pos logit contributions
ppa+0.044
 grandmother+0.043
 granddaughter+0.043
apa+0.041
oll+0.041
Neg logit contributions
Mine-0.026
Stop-0.025
 score-0.025
 hill-0.024
OK-0.023
Token It
Token ID632
Feature activation+2.48e+00

Pos logit contributions
Mine+0.024
Stop+0.024
 score+0.023
 hill+0.022
OK+0.022
Neg logit contributions
ppa-0.030
 grandmother-0.028
 granddaughter-0.028
oll-0.027
apa-0.027
Token sorry
Token ID7926
Feature activation+1.46e+00

Pos logit contributions
ppa+0.058
isk+0.056
 former+0.055
 grandmother+0.055
aj+0.055
Neg logit contributions
Stop-0.032
 score-0.030
Mine-0.029
!"-0.029
 race-0.028
Token to
Token ID284
Feature activation-2.39e+00

Pos logit contributions
Mine+0.027
Stop+0.026
 score+0.026
 hill+0.025
Run+0.025
Neg logit contributions
ppa-0.029
 grandmother-0.029
 granddaughter-0.029
aj-0.027
apa-0.027
Token Ben
Token ID3932
Feature activation-2.73e+00

Pos logit contributions
aj+0.057
oll+0.056
ppa+0.056
apa+0.056
urger+0.055
Neg logit contributions
Stop-0.037
 score-0.036
Mine-0.036
 race-0.033
 kick-0.032
Token and
Token ID290
Feature activation-1.75e+00

Pos logit contributions
ppa+0.081
 grandmother+0.079
aj+0.078
 granddaughter+0.078
 former+0.077
Neg logit contributions
Stop-0.023
 score-0.023
!"-0.021
Mine-0.021
OK-0.020
Token gave
Token ID2921
Feature activation-1.57e+00

Pos logit contributions
ppa+0.039
 grandmother+0.037
 granddaughter+0.036
apa+0.036
aj+0.036
Neg logit contributions
Stop-0.029
Mine-0.029
 score-0.028
OK-0.027
 race-0.027
Token him
Token ID683
Feature activation-7.75e+00

Pos logit contributions
ppa+0.040
aj+0.039
 grandmother+0.039
 former+0.038
omm+0.038
Neg logit contributions
Stop-0.022
Mine-0.022
 score-0.020
Hey-0.019
 turns-0.018
Token back
Token ID736
Feature activation-3.64e+00

Pos logit contributions
ppa+0.180
omm+0.176
 grandmother+0.175
rina+0.172
 former+0.171
Neg logit contributions
 score-0.120
Mine-0.119
Stop-0.117
!"-0.113
 turns-0.111
Token the
Token ID262
Feature activation-6.41e-01

Pos logit contributions
omm+0.104
oll+0.103
ppa+0.103
ascal+0.100
lor+0.100
Neg logit contributions
!"-0.041
 score-0.033
 past-0.033
Stop-0.033
 turns-0.033
Token shiny
Token ID22441
Feature activation+2.45e+00

Pos logit contributions
ppa+0.015
 grandmother+0.015
 granddaughter+0.014
apa+0.014
aj+0.014
Neg logit contributions
Mine-0.010
Stop-0.010
 score-0.009
 hill-0.009
OK-0.009
Token gold
Token ID3869
Feature activation-1.93e+00

Pos logit contributions
Stop+0.042
Run+0.040
www+0.040
atch+0.038
Mine+0.037
Neg logit contributions
 granddaughter-0.053
 grandmother-0.052
 parents-0.047
aj-0.046
 throat-0.045
Token block
Token ID2512
Feature activation-2.51e+00

Pos logit contributions
ppa+0.040
 grandmother+0.040
 granddaughter+0.039
oll+0.039
apa+0.038
Neg logit contributions
Stop-0.035
Mine-0.034
Run-0.032
www-0.031
atch-0.030
Token to
Token ID284
Feature activation-2.90e+00

Pos logit contributions
Stop+0.010
Mine+0.010
 score+0.010
OK+0.009
 hill+0.009
Neg logit contributions
ppa-0.016
 grandmother-0.015
 granddaughter-0.015
apa-0.014
oll-0.014
Token the
Token ID262
Feature activation-1.25e+00

Pos logit contributions
aj+0.072
ppa+0.071
oll+0.069
apa+0.068
iqu+0.067
Neg logit contributions
Stop-0.044
 score-0.043
Mine-0.042
 race-0.038
 kick-0.038
Token boy
Token ID2933
Feature activation-4.96e+00

Pos logit contributions
 grandmother+0.031
ppa+0.031
 granddaughter+0.030
oll+0.028
 patients+0.028
Neg logit contributions
Mine-0.017
Stop-0.016
 score-0.016
atch-0.015
www-0.015
Token and
Token ID290
Feature activation-1.75e+00

Pos logit contributions
ppa+0.139
aj+0.137
 grandmother+0.136
apa+0.133
omm+0.132
Neg logit contributions
 score-0.052
!"-0.048
Stop-0.047
 race-0.046
Mine-0.045
Token give
Token ID1577
Feature activation-1.12e+00

Pos logit contributions
ppa+0.041
aj+0.039
 grandmother+0.039
apa+0.039
oll+0.039
Neg logit contributions
Stop-0.027
 score-0.026
Mine-0.026
 race-0.025
OK-0.025
Token him
Token ID683
Feature activation-7.72e+00

Pos logit contributions
ppa+0.026
aj+0.025
 former+0.025
 grandmother+0.024
 granddaughter+0.024
Neg logit contributions
Stop-0.018
Mine-0.018
 score-0.017
OK-0.016
 turns-0.016
Token her
Token ID607
Feature activation-5.96e+00

Pos logit contributions
ppa+0.171
 grandmother+0.170
omm+0.167
rina+0.164
apa+0.159
Neg logit contributions
Stop-0.130
Mine-0.126
 score-0.125
 race-0.124
OK-0.120
Token sc
Token ID629
Feature activation-2.94e+00

Pos logit contributions
ppa+0.138
apa+0.133
 grandmother+0.131
omm+0.130
rina+0.128
Neg logit contributions
Stop-0.092
 race-0.091
Mine-0.090
 score-0.089
!"-0.088
Tokenooter
Token ID25141
Feature activation-3.92e+00

Pos logit contributions
 grandmother+0.053
ppa+0.052
rina+0.051
apa+0.050
 granddaughter+0.050
Neg logit contributions
 score-0.056
Stop-0.055
atch-0.055
OK-0.055
Mine-0.054
Token as
Token ID355
Feature activation-1.28e+00

Pos logit contributions
ppa+0.087
 grandmother+0.085
apa+0.082
 granddaughter+0.082
omm+0.079
Neg logit contributions
Stop-0.064
Mine-0.063
 score-0.063
OK-0.060
 race-0.060
Token a
Token ID257
Feature activation+6.63e-01

Pos logit contributions
ppa+0.028
apa+0.026
 grandmother+0.026
omm+0.026
rina+0.026
Neg logit contributions
Stop-0.022
 score-0.021
Mine-0.021
OK-0.020
 race-0.020
Token decided
Token ID3066
Feature activation+2.18e-01

Pos logit contributions
Mine+0.021
Stop+0.021
 score+0.021
 hill+0.020
atch+0.019
Neg logit contributions
ppa-0.027
 grandmother-0.026
 granddaughter-0.026
apa-0.025
oll-0.025
Token to
Token ID284
Feature activation+3.60e-01

Pos logit contributions
Stop+0.003
Mine+0.003
 score+0.003
OK+0.003
 race+0.003
Neg logit contributions
ppa-0.005
oll-0.005
 grandmother-0.005
apa-0.005
aj-0.005
Token bring
Token ID2222
Feature activation-5.88e-01

Pos logit contributions
Stop+0.005
Mine+0.005
 score+0.005
 hill+0.005
Run+0.005
Neg logit contributions
 grandmother-0.008
ppa-0.008
 granddaughter-0.008
apa-0.008
 patients-0.008
Token their
Token ID511
Feature activation-3.25e+00

Pos logit contributions
ppa+0.015
aj+0.014
omm+0.014
oll+0.014
ascal+0.014
Neg logit contributions
Mine-0.008
 turns-0.008
Stop-0.008
 score-0.008
!"-0.008
Token little
Token ID1310
Feature activation+5.00e-01

Pos logit contributions
ppa+0.079
oll+0.074
omm+0.071
apa+0.070
rina+0.068
Neg logit contributions
 race-0.053
 hill-0.050
 score-0.050
 game-0.049
 challenge-0.049
Token child
Token ID1200
Feature activation-7.71e+00

Pos logit contributions
Mine+0.009
Stop+0.009
 score+0.009
atch+0.009
OK+0.008
Neg logit contributions
 grandmother-0.010
 granddaughter-0.010
ppa-0.010
 patients-0.009
apa-0.009
Token with
Token ID351
Feature activation-1.11e+00

Pos logit contributions
ppa+0.175
omm+0.173
oll+0.164
ascal+0.163
 grandmother+0.163
Neg logit contributions
 score-0.121
 race-0.119
!"-0.114
 hill-0.111
 game-0.111
Token them
Token ID606
Feature activation-3.96e+00

Pos logit contributions
ppa+0.028
aj+0.026
omm+0.026
oll+0.025
rina+0.025
Neg logit contributions
Mine-0.017
Stop-0.016
 score-0.016
 race-0.015
 turns-0.015
Token to
Token ID284
Feature activation-1.13e+00

Pos logit contributions
ppa+0.115
omm+0.113
apa+0.109
iqu+0.109
oll+0.108
Neg logit contributions
!"-0.038
 score-0.033
 hill-0.033
 race-0.031
 game-0.029
Token the
Token ID262
Feature activation-8.04e-01

Pos logit contributions
ppa+0.030
oll+0.029
aj+0.029
omm+0.028
apa+0.028
Neg logit contributions
 score-0.015
 race-0.014
Mine-0.014
Stop-0.014
!"-0.013
Token station
Token ID4429
Feature activation-2.62e+00

Pos logit contributions
ppa+0.018
oll+0.017
apa+0.016
aj+0.016
 grandmother+0.016
Neg logit contributions
Stop-0.014
Mine-0.014
 score-0.014
 race-0.014
 hill-0.014
Token ran
Token ID4966
Feature activation+3.02e+00

Pos logit contributions
ppa+0.009
 grandmother+0.009
 granddaughter+0.009
aj+0.008
apa+0.008
Neg logit contributions
Stop-0.005
Mine-0.005
 score-0.005
Run-0.005
 hill-0.005
Token to
Token ID284
Feature activation-1.85e+00

Pos logit contributions
Mine+0.060
Stop+0.058
 score+0.057
 hill+0.056
atch+0.056
Neg logit contributions
ppa-0.055
 grandmother-0.055
 granddaughter-0.054
apa-0.050
 patients-0.050
Token Max
Token ID5436
Feature activation-3.05e+00

Pos logit contributions
ppa+0.041
 grandmother+0.040
 granddaughter+0.040
apa+0.038
aj+0.038
Neg logit contributions
Mine-0.030
Stop-0.030
 score-0.029
 hill-0.028
OK-0.027
Token and
Token ID290
Feature activation-2.01e+00

Pos logit contributions
ppa+0.079
 grandmother+0.077
 granddaughter+0.076
apa+0.073
oll+0.073
Neg logit contributions
Stop-0.039
Mine-0.039
 score-0.036
OK-0.035
 hill-0.034
Token picked
Token ID6497
Feature activation-5.39e-01

Pos logit contributions
 former+0.044
oll+0.043
 grandmother+0.043
omm+0.043
ppa+0.043
Neg logit contributions
Stop-0.033
 score-0.030
Mine-0.030
Run-0.030
!"-0.029
Token him
Token ID683
Feature activation-7.71e+00

Pos logit contributions
ppa+0.012
 grandmother+0.012
 granddaughter+0.011
apa+0.011
aj+0.011
Neg logit contributions
Mine-0.009
Stop-0.009
 score-0.009
 hill-0.008
OK-0.008
Token up
Token ID510
Feature activation-5.93e-01

Pos logit contributions
ppa+0.218
 grandmother+0.214
 granddaughter+0.211
aj+0.205
apa+0.205
Neg logit contributions
 score-0.079
Mine-0.078
Stop-0.078
!"-0.075
 race-0.072
Token.
Token ID13
Feature activation+1.10e+00

Pos logit contributions
ppa+0.017
 granddaughter+0.016
 grandmother+0.016
apa+0.016
oll+0.016
Neg logit contributions
 score-0.006
!"-0.006
 hill-0.005
Mine-0.005
OK-0.005
Token She
Token ID1375
Feature activation-4.64e-01

Pos logit contributions
Stop+0.017
Mine+0.017
OK+0.016
 hill+0.015
No+0.015
Neg logit contributions
ppa-0.027
oll-0.025
urger-0.024
apa-0.024
 granddaughter-0.024
Token checked
Token ID10667
Feature activation+3.27e-01

Pos logit contributions
 former+0.013
 Him+0.012
omm+0.012
apa+0.012
oll+0.012
Neg logit contributions
 tries-0.005
Stop-0.005
 cl-0.005
Give-0.005
OK-0.005
Token his
Token ID465
Feature activation-3.94e+00

Pos logit contributions
Stop+0.005
Mine+0.005
 score+0.005
 hill+0.005
OK+0.005
Neg logit contributions
ppa-0.008
 granddaughter-0.007
 grandmother-0.007
apa-0.007
aj-0.007
TokenThe
Token ID464
Feature activation+6.84e-02

Pos logit contributions
ppa+0.010
 grandmother+0.009
 granddaughter+0.009
apa+0.009
aj+0.009
Neg logit contributions
Mine-0.009
Stop-0.009
 score-0.009
 hill-0.009
OK-0.009
Token young
Token ID1862
Feature activation+8.82e-01

Pos logit contributions
Mine+0.001
Stop+0.001
 score+0.001
 hill+0.001
OK+0.001
Neg logit contributions
ppa-0.002
 grandmother-0.002
 granddaughter-0.002
apa-0.001
aj-0.001
Token boy
Token ID2933
Feature activation-6.09e-01

Pos logit contributions
Stop+0.012
 AW+0.012
Mine+0.012
atch+0.012
www+0.012
Neg logit contributions
 granddaughter-0.023
 grandmother-0.022
 parents-0.021
 owners-0.020
ppa-0.020
Token felt
Token ID2936
Feature activation+9.16e-01

Pos logit contributions
ppa+0.014
 grandmother+0.014
 granddaughter+0.014
apa+0.013
aj+0.013
Neg logit contributions
Mine-0.009
Stop-0.009
 score-0.009
 hill-0.008
OK-0.008
Token so
Token ID523
Feature activation-1.36e+00

Pos logit contributions
Stop+0.017
Mine+0.017
 score+0.017
 race+0.016
Run+0.016
Neg logit contributions
ppa-0.018
 grandmother-0.018
 granddaughter-0.018
aj-0.017
apa-0.017
Token sad
Token ID6507
Feature activation+4.23e+00

Pos logit contributions
ppa+0.024
 grandmother+0.023
 granddaughter+0.023
apa+0.022
oll+0.021
Neg logit contributions
Mine-0.029
Stop-0.029
 score-0.028
 hill-0.027
OK-0.027
Token!
Token ID0
Feature activation+2.75e-01

Pos logit contributions
Mine+0.076
Stop+0.074
 score+0.070
OK+0.069
atch+0.068
Neg logit contributions
 grandmother-0.088
ppa-0.086
 granddaughter-0.085
oll-0.083
apa-0.081
Token He
Token ID679
Feature activation+4.17e-01

Pos logit contributions
Mine+0.004
Stop+0.004
 score+0.003
 hill+0.003
OK+0.003
Neg logit contributions
ppa-0.007
 grandmother-0.007
 granddaughter-0.007
aj-0.007
 patients-0.007
Token tried
Token ID3088
Feature activation+2.19e+00

Pos logit contributions
Stop+0.007
Mine+0.007
 score+0.006
 hill+0.006
OK+0.006
Neg logit contributions
ppa-0.009
 grandmother-0.009
 granddaughter-0.009
apa-0.009
aj-0.009
Token to
Token ID284
Feature activation-9.09e-01

Pos logit contributions
Mine+0.038
Stop+0.038
 score+0.037
 hill+0.036
 race+0.035
Neg logit contributions
ppa-0.047
 grandmother-0.045
 granddaughter-0.044
apa-0.043
aj-0.043
Token patch
Token ID8529
Feature activation-1.10e+00

Pos logit contributions
ppa+0.022
 grandmother+0.021
 granddaughter+0.021
 patients+0.020
 parents+0.020
Neg logit contributions
Mine-0.013
Stop-0.013
www-0.012
OK-0.012
atch-0.012
Token a
Token ID257
Feature activation+1.09e+00

Pos logit contributions
ppa+0.041
 grandmother+0.039
apa+0.039
 granddaughter+0.038
aj+0.037
Neg logit contributions
Stop-0.041
Mine-0.041
 score-0.040
!"-0.038
Run-0.038
Token time
Token ID640
Feature activation+5.65e-01

Pos logit contributions
Stop+0.014
Mine+0.014
 score+0.013
 hill+0.012
Run+0.012
Neg logit contributions
ppa-0.028
 grandmother-0.027
 granddaughter-0.027
aj-0.027
apa-0.027
Token,
Token ID11
Feature activation-2.00e+00

Pos logit contributions
Mine+0.008
Stop+0.008
 score+0.007
OK+0.007
 hill+0.007
Neg logit contributions
ppa-0.014
 grandmother-0.014
 granddaughter-0.013
aj-0.013
apa-0.013
Token in
Token ID287
Feature activation+6.19e-01

Pos logit contributions
ppa+0.043
 grandmother+0.042
 granddaughter+0.042
apa+0.040
oll+0.040
Neg logit contributions
Mine-0.033
Stop-0.033
 score-0.032
 hill-0.030
atch-0.030
Token a
Token ID257
Feature activation+8.94e-01

Pos logit contributions
Mine+0.013
Stop+0.013
 score+0.012
 hill+0.012
 race+0.012
Neg logit contributions
ppa-0.011
 grandmother-0.011
 granddaughter-0.011
apa-0.010
aj-0.010
Token small
Token ID1402
Feature activation+4.64e+00

Pos logit contributions
Stop+0.016
Mine+0.016
 score+0.015
Run+0.015
www+0.015
Neg logit contributions
 grandmother-0.018
ppa-0.018
 granddaughter-0.018
 patients-0.017
apa-0.017
Token house
Token ID2156
Feature activation+2.35e-01

Pos logit contributions
Stop+0.084
Mine+0.083
Run+0.079
OK+0.078
ok+0.076
Neg logit contributions
 grandmother-0.099
 granddaughter-0.097
ppa-0.090
 patients-0.089
 parents-0.088
Token,
Token ID11
Feature activation-2.11e+00

Pos logit contributions
Mine+0.003
Stop+0.003
 score+0.003
OK+0.003
Run+0.003
Neg logit contributions
 grandmother-0.006
ppa-0.006
 granddaughter-0.006
apa-0.005
oll-0.005
Token lived
Token ID5615
Feature activation-3.19e+00

Pos logit contributions
 grandmother+0.043
ppa+0.043
 granddaughter+0.042
apa+0.040
oll+0.040
Neg logit contributions
Mine-0.037
 score-0.036
Stop-0.036
atch-0.034
 challenge-0.033
Token a
Token ID257
Feature activation+8.09e-01

Pos logit contributions
ppa+0.065
 grandmother+0.064
 granddaughter+0.062
apa+0.061
aj+0.060
Neg logit contributions
Stop-0.057
Mine-0.057
 score-0.056
 hill-0.054
 race-0.053
Token funny
Token ID8258
Feature activation+3.26e+00

Pos logit contributions
Stop+0.013
Mine+0.013
 score+0.013
 hill+0.012
OK+0.012
Neg logit contributions
ppa-0.018
 grandmother-0.017
 granddaughter-0.017
apa-0.017
aj-0.017
Token living
Token ID2877
Feature activation+1.54e+00

Pos logit contributions
ppa+0.005
apa+0.005
 grandmother+0.005
oll+0.005
 granddaughter+0.005
Neg logit contributions
Mine-0.003
Stop-0.003
 hill-0.003
 score-0.003
 race-0.003
Token room
Token ID2119
Feature activation-5.59e-01

Pos logit contributions
Mine+0.031
Stop+0.031
 score+0.030
Run+0.029
www+0.029
Neg logit contributions
 grandmother-0.028
ppa-0.028
 granddaughter-0.028
aj-0.026
 patients-0.025
Token and
Token ID290
Feature activation+6.13e-01

Pos logit contributions
 grandmother+0.014
ppa+0.014
 granddaughter+0.013
apa+0.013
aj+0.013
Neg logit contributions
Stop-0.008
Mine-0.008
 score-0.007
Run-0.007
OK-0.007
Token saw
Token ID2497
Feature activation-1.66e+00

Pos logit contributions
Mine+0.008
Stop+0.008
 score+0.008
 hill+0.008
 race+0.008
Neg logit contributions
ppa-0.015
 grandmother-0.015
omm-0.014
 granddaughter-0.014
apa-0.014
Token a
Token ID257
Feature activation+1.35e+00

Pos logit contributions
ppa+0.046
aj+0.043
isk+0.042
ascal+0.042
urger+0.042
Neg logit contributions
Mine-0.022
Stop-0.020
 wind-0.019
 it-0.019
 turns-0.019
Token big
Token ID1263
Feature activation+4.66e+00

Pos logit contributions
Stop+0.023
Mine+0.023
atch+0.022
 score+0.021
Run+0.021
Neg logit contributions
 grandmother-0.029
 granddaughter-0.028
ppa-0.028
aj-0.026
 patients-0.026
Token,
Token ID11
Feature activation-1.63e+00

Pos logit contributions
Stop+0.097
ok+0.094
OK+0.093
atch+0.090
Mine+0.089
Neg logit contributions
 grandmother-0.094
 granddaughter-0.085
 aunt-0.080
 parents-0.078
 Aunt-0.077
Token filthy
Token ID35677
Feature activation+3.38e+00

Pos logit contributions
 grandmother+0.037
 granddaughter+0.037
ppa+0.036
aj+0.035
oll+0.034
Neg logit contributions
Stop-0.025
Mine-0.025
 score-0.024
atch-0.024
OK-0.023
Token cushion
Token ID33579
Feature activation-1.27e-01

Pos logit contributions
Stop+0.067
Mine+0.066
OK+0.064
Run+0.063
atch+0.063
Neg logit contributions
 grandmother-0.066
 granddaughter-0.063
ppa-0.058
omm-0.058
aj-0.057
Token.
Token ID13
Feature activation+2.08e-01

Pos logit contributions
ppa+0.003
 grandmother+0.003
 granddaughter+0.003
oll+0.003
aj+0.002
Neg logit contributions
Stop-0.002
Mine-0.002
 score-0.002
OK-0.002
atch-0.002
Token She
Token ID1375
Feature activation+4.38e-01

Pos logit contributions
Mine+0.003
Stop+0.003
 hill+0.003
Run+0.003
OK+0.003
Neg logit contributions
ppa-0.005
urger-0.005
aj-0.005
 grandmother-0.004
oll-0.004
Token.
Token ID13
Feature activation-3.26e-01

Pos logit contributions
Mine+0.003
Stop+0.003
 score+0.002
 hill+0.002
OK+0.002
Neg logit contributions
ppa-0.004
 grandmother-0.004
 granddaughter-0.004
apa-0.004
aj-0.004
Token The
Token ID383
Feature activation+5.13e-01

Pos logit contributions
ppa+0.008
oll+0.008
urger+0.007
 patients+0.007
ascal+0.007
Neg logit contributions
Stop-0.005
Mine-0.005
OK-0.005
No-0.004
Wow-0.004
Token bear
Token ID6842
Feature activation-1.47e+00

Pos logit contributions
Stop+0.008
Mine+0.008
 hill+0.008
 score+0.008
 race+0.007
Neg logit contributions
ppa-0.012
apa-0.012
 granddaughter-0.011
 grandmother-0.011
aj-0.011
Token was
Token ID373
Feature activation-2.38e+00

Pos logit contributions
ppa+0.034
 grandmother+0.034
 granddaughter+0.033
apa+0.032
aj+0.032
Neg logit contributions
Stop-0.022
Mine-0.022
 score-0.021
 hill-0.021
OK-0.020
Token so
Token ID523
Feature activation-7.49e-01

Pos logit contributions
ppa+0.053
apa+0.051
 grandmother+0.051
 granddaughter+0.050
aj+0.050
Neg logit contributions
 score-0.040
Stop-0.037
 race-0.037
Mine-0.037
 kick-0.036
Token surprised
Token ID6655
Feature activation+4.51e+00

Pos logit contributions
ppa+0.014
 grandmother+0.014
 granddaughter+0.014
apa+0.013
aj+0.013
Neg logit contributions
Mine-0.015
Stop-0.015
 score-0.014
 hill-0.014
OK-0.014
Token it
Token ID340
Feature activation-4.90e-02

Pos logit contributions
Mine+0.087
 score+0.081
 hill+0.080
Stop+0.079
 race+0.077
Neg logit contributions
 grandmother-0.087
ppa-0.087
 granddaughter-0.085
apa-0.080
 exactly-0.079
Token just
Token ID655
Feature activation-3.34e-01

Pos logit contributions
ppa+0.001
 grandmother+0.001
 granddaughter+0.001
oll+0.001
rina+0.001
Neg logit contributions
Mine-0.001
Stop-0.001
OK-0.001
atch-0.001
 score-0.001
Token stopped
Token ID5025
Feature activation+2.68e-01

Pos logit contributions
ppa+0.007
 grandmother+0.007
 granddaughter+0.007
apa+0.006
oll+0.006
Neg logit contributions
Mine-0.006
Stop-0.006
 score-0.006
 hill-0.005
OK-0.005
Token still
Token ID991
Feature activation+9.63e-01

Pos logit contributions
Mine+0.004
Stop+0.004
 score+0.004
OK+0.004
 hill+0.004
Neg logit contributions
ppa-0.006
 grandmother-0.006
 granddaughter-0.006
apa-0.005
aj-0.005
Token watching
Token ID4964
Feature activation-5.65e-01

Pos logit contributions
Mine+0.015
Stop+0.015
OK+0.013
atch+0.013
 score+0.013
Neg logit contributions
 grandmother-0.022
ppa-0.022
 granddaughter-0.021
apa-0.020
oll-0.020
Token
Token ID198
Feature activation+7.65e-01

Pos logit contributions
ppa+0.004
 grandmother+0.003
 granddaughter+0.003
oll+0.003
apa+0.003
Neg logit contributions
Stop-0.003
Mine-0.003
 score-0.003
 hill-0.003
OK-0.003
TokenL
Token ID43
Feature activation+1.36e-03

Pos logit contributions
 score+0.013
 challenge+0.013
Mine+0.012
 race+0.012
 hill+0.012
Neg logit contributions
ppa-0.016
apa-0.015
 grandmother-0.014
 granddaughter-0.014
aj-0.014
Tokenily
Token ID813
Feature activation+1.10e+00

Pos logit contributions
Stop+0.000
Mine+0.000
 score+0.000
 hill+0.000
OK+0.000
Neg logit contributions
ppa-0.000
 grandmother-0.000
 granddaughter-0.000
apa-0.000
aj-0.000
Token laughed
Token ID13818
Feature activation+2.34e+00

Pos logit contributions
Mine+0.016
Stop+0.016
 score+0.015
 hill+0.014
OK+0.014
Neg logit contributions
ppa-0.027
 grandmother-0.026
 granddaughter-0.026
apa-0.025
aj-0.025
Token and
Token ID290
Feature activation+6.88e-01

Pos logit contributions
Mine+0.035
Stop+0.035
 score+0.033
 hill+0.032
atch+0.032
Neg logit contributions
ppa-0.054
 grandmother-0.054
 granddaughter-0.054
oll-0.051
apa-0.051
Token cl
Token ID537
Feature activation+4.22e+00

Pos logit contributions
Mine+0.011
Stop+0.011
 hill+0.010
atch+0.010
 score+0.010
Neg logit contributions
ppa-0.015
 granddaughter-0.015
 grandmother-0.015
aj-0.014
oll-0.014
Tokenapped
Token ID6320
Feature activation+1.38e+00

Pos logit contributions
Stop+0.095
 hill+0.094
Mine+0.093
 score+0.092
 race+0.089
Neg logit contributions
ppa-0.068
 granddaughter-0.066
 grandmother-0.065
oll-0.063
apa-0.063
Token her
Token ID607
Feature activation-3.23e+00

Pos logit contributions
Stop+0.029
Mine+0.029
 hill+0.028
 score+0.027
atch+0.027
Neg logit contributions
 grandmother-0.024
ppa-0.024
 granddaughter-0.023
apa-0.022
aj-0.022
Token hands
Token ID2832
Feature activation-7.53e-01

Pos logit contributions
ppa+0.058
 grandmother+0.058
 granddaughter+0.058
apa+0.055
aj+0.055
Neg logit contributions
Stop-0.067
Mine-0.066
 score-0.062
 hill-0.062
atch-0.061
Token.
Token ID13
Feature activation+9.54e-01

Pos logit contributions
ppa+0.016
 grandmother+0.015
 granddaughter+0.015
oll+0.015
apa+0.014
Neg logit contributions
Stop-0.013
Mine-0.013
OK-0.012
 hill-0.012
www-0.012
Token She
Token ID1375
Feature activation+1.10e+00

Pos logit contributions
Mine+0.015
Stop+0.015
 score+0.013
OK+0.013
 hill+0.013
Neg logit contributions
ppa-0.024
urger-0.022
 grandmother-0.022
rina-0.022
apa-0.021
Token and
Token ID290
Feature activation+3.10e-02

Pos logit contributions
Mine+0.020
Stop+0.020
 score+0.019
 hill+0.018
OK+0.017
Neg logit contributions
ppa-0.038
 grandmother-0.037
 granddaughter-0.036
apa-0.035
aj-0.035
Token Lily
Token ID20037
Feature activation+1.85e+00

Pos logit contributions
Stop+0.001
Mine+0.001
 score+0.000
OK+0.000
 race+0.000
Neg logit contributions
ppa-0.001
 grandmother-0.001
apa-0.001
oll-0.001
aj-0.001
Token were
Token ID547
Feature activation-2.89e+00

Pos logit contributions
Mine+0.028
Stop+0.028
 score+0.027
 hill+0.026
OK+0.026
Neg logit contributions
ppa-0.043
 grandmother-0.042
 granddaughter-0.041
apa-0.040
aj-0.039
Token scared
Token ID12008
Feature activation+3.93e+00

Pos logit contributions
ppa+0.076
apa+0.076
 grandmother+0.075
aj+0.072
 granddaughter+0.071
Neg logit contributions
 score-0.036
 race-0.036
 kick-0.034
 wind-0.034
Stop-0.034
Token.
Token ID13
Feature activation+8.76e-01

Pos logit contributions
Mine+0.059
Stop+0.057
 score+0.056
 hill+0.053
OK+0.053
Neg logit contributions
ppa-0.093
 grandmother-0.092
 granddaughter-0.091
apa-0.087
oll-0.087
Token They
Token ID1119
Feature activation+1.37e+00

Pos logit contributions
Mine+0.014
Stop+0.013
 score+0.013
OK+0.012
 hill+0.012
Neg logit contributions
ppa-0.021
 grandmother-0.020
 granddaughter-0.020
oll-0.019
aj-0.019
Token did
Token ID750
Feature activation-5.69e-01

Pos logit contributions
Stop+0.017
Mine+0.017
 score+0.016
 hill+0.016
OK+0.016
Neg logit contributions
ppa-0.035
 grandmother-0.035
 granddaughter-0.034
apa-0.033
aj-0.033
Token not
Token ID407
Feature activation+2.85e+00

Pos logit contributions
ppa+0.009
oll+0.008
apa+0.008
 granddaughter+0.008
 grandmother+0.008
Neg logit contributions
 score-0.014
Stop-0.014
Mine-0.013
 race-0.013
!"-0.013
Token know
Token ID760
Feature activation-1.20e+00

Pos logit contributions
Mine+0.049
 hill+0.044
Stop+0.042
 slope+0.041
Run+0.040
Neg logit contributions
ppa-0.065
 granddaughter-0.062
 patients-0.060
isk-0.059
 grandmother-0.059
Token what
Token ID644
Feature activation-6.83e-01

Pos logit contributions
ppa+0.031
 former+0.031
aj+0.031
omm+0.030
auldron+0.028
Neg logit contributions
 it-0.019
!"-0.017
 times-0.017
 thing-0.016
 you-0.016
Token the
Token ID262
Feature activation+2.75e-01

Pos logit contributions
ppa+0.019
aj+0.018
 former+0.017
omm+0.017
oll+0.017
Neg logit contributions
Stop-0.009
 race-0.009
 score-0.009
 kick-0.009
Mine-0.009
Token knew
Token ID2993
Feature activation+3.53e-01

Pos logit contributions
 score+0.018
Mine+0.018
Stop+0.018
OK+0.017
 hill+0.017
Neg logit contributions
ppa-0.024
 grandmother-0.024
 granddaughter-0.023
aj-0.023
apa-0.022
Token the
Token ID262
Feature activation+3.53e-01

Pos logit contributions
 score+0.004
 hill+0.004
!"+0.003
 race+0.003
Stop+0.003
Neg logit contributions
ppa-0.010
aj-0.010
 granddaughter-0.009
omm-0.009
isk-0.009
Token mine
Token ID6164
Feature activation+3.10e-01

Pos logit contributions
Stop+0.006
Mine+0.005
atch+0.005
 score+0.005
Run+0.005
Neg logit contributions
 grandmother-0.008
 granddaughter-0.008
ppa-0.008
oll-0.007
 parents-0.007
Token had
Token ID550
Feature activation+8.43e-01

Pos logit contributions
Mine+0.005
Stop+0.005
 score+0.005
 hill+0.005
OK+0.004
Neg logit contributions
ppa-0.007
 grandmother-0.007
 granddaughter-0.007
apa-0.007
aj-0.007
Token more
Token ID517
Feature activation+7.51e-01

Pos logit contributions
 score+0.014
Mine+0.014
Stop+0.014
 race+0.013
!"+0.013
Neg logit contributions
ppa-0.019
 grandmother-0.018
 granddaughter-0.018
apa-0.017
aj-0.017
Token to
Token ID284
Feature activation+1.90e+00

Pos logit contributions
Stop+0.013
Mine+0.012
Run+0.012
OK+0.011
 hill+0.011
Neg logit contributions
 grandmother-0.016
ppa-0.016
 granddaughter-0.016
oll-0.015
aj-0.015
Token discover
Token ID7073
Feature activation-1.07e+00

Pos logit contributions
Stop+0.029
Run+0.028
Mine+0.027
www+0.026
 hill+0.026
Neg logit contributions
ppa-0.044
 grandmother-0.043
 granddaughter-0.041
isk-0.040
 former-0.040
Token.
Token ID13
Feature activation-6.77e-02

Pos logit contributions
ppa+0.026
 granddaughter+0.025
 grandmother+0.025
aj+0.024
apa+0.024
Neg logit contributions
 score-0.015
Stop-0.015
Mine-0.015
!"-0.015
 hill-0.015
Token He
Token ID679
Feature activation+1.47e+00

Pos logit contributions
ppa+0.002
 grandmother+0.002
rina+0.002
aj+0.002
 patients+0.002
Neg logit contributions
Mine-0.001
Stop-0.001
 score-0.001
 hill-0.001
OK-0.001
Token couldn
Token ID3521
Feature activation+1.75e+00

Pos logit contributions
Mine+0.024
Stop+0.024
 score+0.023
 hill+0.022
OK+0.022
Neg logit contributions
ppa-0.033
 grandmother-0.032
 granddaughter-0.031
apa-0.030
aj-0.030
Token't
Token ID470
Feature activation+2.95e+00

Pos logit contributions
Stop+0.029
Mine+0.029
 score+0.028
OK+0.028
Run+0.027
Neg logit contributions
 grandmother-0.037
ppa-0.036
 granddaughter-0.035
apa-0.035
oll-0.035
Token
Token ID198
Feature activation-4.30e-01

Pos logit contributions
Stop+0.001
 wants+0.001
!"+0.001
Run+0.001
Mine+0.001
Neg logit contributions
ppa-0.002
 former-0.002
 grandmother-0.002
aj-0.002
 granddaughter-0.002
Token
Token ID198
Feature activation+7.96e-01

Pos logit contributions
ppa+0.009
 granddaughter+0.009
 grandmother+0.009
oll+0.009
 patients+0.009
Neg logit contributions
Stop-0.007
Mine-0.007
 score-0.007
 hill-0.007
!"-0.007
Token"
Token ID1
Feature activation+1.95e+00

Pos logit contributions
 score+0.011
Mine+0.011
 race+0.010
 challenge+0.010
 forth+0.010
Neg logit contributions
ppa-0.019
apa-0.018
 grandmother-0.018
 granddaughter-0.017
 Mother-0.017
TokenHey
Token ID10814
Feature activation+2.40e+00

Pos logit contributions
 score+0.029
!"+0.027
atch+0.027
 hill+0.026
 challenge+0.026
Neg logit contributions
ppa-0.044
 granddaughter-0.043
 grandmother-0.042
 patients-0.041
 owners-0.041
Token,
Token ID11
Feature activation-1.47e+00

Pos logit contributions
Mine+0.042
Stop+0.037
OK+0.035
ok+0.035
Check+0.035
Neg logit contributions
ppa-0.050
 granddaughter-0.049
 grandmother-0.049
apa-0.048
 parents-0.047
Token you
Token ID345
Feature activation+4.68e-01

Pos logit contributions
ppa+0.032
 grandmother+0.032
 granddaughter+0.032
apa+0.030
oll+0.030
Neg logit contributions
Mine-0.024
Stop-0.022
 score-0.022
OK-0.022
 hill-0.021
Token kids
Token ID3988
Feature activation-1.02e-01

Pos logit contributions
Stop+0.006
!"+0.006
 score+0.006
Mine+0.006
 hill+0.006
Neg logit contributions
ppa-0.012
 grandmother-0.012
 granddaughter-0.012
apa-0.011
oll-0.011
Token!
Token ID0
Feature activation+8.35e-02

Pos logit contributions
ppa+0.002
uf+0.002
 granddaughter+0.002
 grandmother+0.002
isk+0.002
Neg logit contributions
Mine-0.002
Stop-0.002
Wow-0.002
Check-0.002
Bang-0.002
Token Get
Token ID3497
Feature activation+4.02e-01

Pos logit contributions
Mine+0.001
Stop+0.001
 score+0.001
 hill+0.001
OK+0.001
Neg logit contributions
ppa-0.002
 grandmother-0.002
 granddaughter-0.002
apa-0.002
aj-0.002
Token away
Token ID1497
Feature activation+6.86e-01

Pos logit contributions
Mine+0.007
 score+0.006
Stop+0.006
 hill+0.006
OK+0.006
Neg logit contributions
ppa-0.009
 grandmother-0.009
 granddaughter-0.009
apa-0.008
oll-0.008
Token from
Token ID422
Feature activation-1.99e+00

Pos logit contributions
Mine+0.014
Stop+0.013
OK+0.013
atch+0.013
 score+0.013
Neg logit contributions
 grandmother-0.012
ppa-0.012
 granddaughter-0.012
apa-0.011
oll-0.011
Token "
Token ID366
Feature activation+3.21e+00

Pos logit contributions
ppa+0.005
iqu+0.005
rina+0.005
urger+0.005
apa+0.005
Neg logit contributions
 hill-0.002
 wind-0.002
No-0.002
OK-0.002
 score-0.002
TokenL
Token ID43
Feature activation-8.34e-02

Pos logit contributions
!"+0.045
 hill+0.044
 score+0.044
 race+0.042
 challenge+0.040
Neg logit contributions
ppa-0.076
 granddaughter-0.073
 grandmother-0.072
 patients-0.071
 owners-0.068
Tokenily
Token ID813
Feature activation+6.86e-01

Pos logit contributions
ppa+0.001
oll+0.001
 granddaughter+0.001
 grandmother+0.001
isk+0.001
Neg logit contributions
Mine-0.002
 hill-0.002
Stop-0.002
 score-0.002
Run-0.002
Token,
Token ID11
Feature activation-1.76e+00

Pos logit contributions
Mine+0.009
Stop+0.009
 score+0.008
Run+0.008
 hill+0.008
Neg logit contributions
ppa-0.018
 grandmother-0.017
 granddaughter-0.017
apa-0.017
aj-0.017
Token if
Token ID611
Feature activation+8.63e-01

Pos logit contributions
ppa+0.050
 grandmother+0.049
 granddaughter+0.048
apa+0.047
aj+0.047
Neg logit contributions
Stop-0.018
Mine-0.018
 score-0.017
 hill-0.016
 race-0.016
Token you
Token ID345
Feature activation+2.15e+00

Pos logit contributions
Stop+0.005
Mine+0.005
 score+0.004
OK+0.004
 hill+0.004
Neg logit contributions
ppa-0.029
 granddaughter-0.028
 grandmother-0.027
oll-0.027
aj-0.027
Token take
Token ID1011
Feature activation+9.79e-01

Pos logit contributions
Stop+0.027
Mine+0.026
 hill+0.024
www+0.024
Run+0.023
Neg logit contributions
ppa-0.056
 grandmother-0.054
 granddaughter-0.054
apa-0.053
oll-0.052
Token a
Token ID257
Feature activation+8.64e-01

Pos logit contributions
Stop+0.015
Mine+0.015
 score+0.015
!"+0.014
 turns+0.014
Neg logit contributions
ppa-0.023
 grandmother-0.022
 granddaughter-0.022
aj-0.022
apa-0.021
Token coin
Token ID10752
Feature activation-1.87e+00

Pos logit contributions
Stop+0.016
Mine+0.015
atch+0.014
Run+0.014
ok+0.014
Neg logit contributions
 grandmother-0.018
ppa-0.018
 granddaughter-0.018
apa-0.017
aj-0.017
Token,
Token ID11
Feature activation-1.89e+00

Pos logit contributions
ppa+0.045
 grandmother+0.044
 granddaughter+0.043
apa+0.042
oll+0.042
Neg logit contributions
Mine-0.028
Stop-0.028
Run-0.025
 score-0.025
OK-0.025
Token you
Token ID345
Feature activation+1.04e+00

Pos logit contributions
ppa+0.048
 grandmother+0.046
 granddaughter+0.045
apa+0.044
aj+0.044
Neg logit contributions
Stop-0.026
Mine-0.026
 score-0.025
 race-0.024
 hill-0.023
TokenJack
Token ID14295
Feature activation+1.75e+00

Pos logit contributions
 blows+0.002
 hill+0.002
 hits+0.002
 times+0.002
 forth+0.002
Neg logit contributions
ppa-0.001
apa-0.001
 grandmother-0.001
 granddaughter-0.001
Mrs-0.001
Token hopped
Token ID45230
Feature activation+2.94e+00

Pos logit contributions
Stop+0.025
Mine+0.025
 score+0.024
 hill+0.023
OK+0.023
Neg logit contributions
ppa-0.042
 grandmother-0.041
 granddaughter-0.041
apa-0.040
oll-0.039
Token and
Token ID290
Feature activation+1.22e+00

Pos logit contributions
Mine+0.061
Stop+0.057
atch+0.055
www+0.053
 score+0.052
Neg logit contributions
 grandmother-0.052
ppa-0.051
 granddaughter-0.049
oll-0.049
apa-0.048
Token hopped
Token ID45230
Feature activation+2.53e+00

Pos logit contributions
Mine+0.020
Stop+0.020
 score+0.019
atch+0.018
OK+0.018
Neg logit contributions
ppa-0.027
 grandmother-0.026
 granddaughter-0.026
aj-0.025
apa-0.025
Token until
Token ID1566
Feature activation+2.57e+00

Pos logit contributions
Mine+0.053
Stop+0.050
atch+0.047
Check+0.046
 blows+0.046
Neg logit contributions
 grandmother-0.043
ppa-0.043
oll-0.042
apa-0.041
 parents-0.041
Token he
Token ID339
Feature activation+2.85e+00

Pos logit contributions
Mine+0.042
Stop+0.041
atch+0.039
OK+0.038
 score+0.038
Neg logit contributions
 grandmother-0.056
ppa-0.055
apa-0.054
 granddaughter-0.054
oll-0.052
Token found
Token ID1043
Feature activation-1.34e+00

Pos logit contributions
 score+0.033
Stop+0.032
Mine+0.030
 hill+0.029
OK+0.029
Neg logit contributions
ppa-0.077
 grandmother-0.077
 granddaughter-0.076
 patients-0.073
aj-0.073
Token a
Token ID257
Feature activation+2.25e+00

Pos logit contributions
ppa+0.034
aj+0.034
rina+0.032
 grandmother+0.032
 granddaughter+0.031
Neg logit contributions
 wind-0.019
!"-0.019
Stop-0.019
 rock-0.018
 score-0.018
Token pretty
Token ID2495
Feature activation+3.72e+00

Pos logit contributions
Mine+0.038
Stop+0.038
 score+0.036
atch+0.035
Run+0.035
Neg logit contributions
 grandmother-0.048
ppa-0.048
 granddaughter-0.047
apa-0.044
aj-0.044
Token mushroom
Token ID28520
Feature activation-7.75e-01

Pos logit contributions
Stop+0.070
Mine+0.067
Run+0.065
OK+0.063
ok+0.062
Neg logit contributions
 grandmother-0.079
 granddaughter-0.075
ppa-0.073
 former-0.068
aj-0.068
Token.
Token ID13
Feature activation+5.89e-01

Pos logit contributions
oll+0.015
ppa+0.015
 grandmother+0.015
 granddaughter+0.014
apa+0.013
Neg logit contributions
Stop-0.016
Mine-0.015
OK-0.014
 score-0.013
No-0.013
Token take
Token ID1011
Feature activation+3.83e-01

Pos logit contributions
ppa+0.003
isk+0.003
 grandmother+0.003
 patients+0.003
 granddaughter+0.003
Neg logit contributions
www-0.002
ok-0.002
ww-0.002
 hill-0.002
OK-0.002
Token it
Token ID340
Feature activation-3.13e+00

Pos logit contributions
Mine+0.004
Stop+0.004
 score+0.004
OK+0.004
 race+0.004
Neg logit contributions
ppa-0.010
 grandmother-0.010
 granddaughter-0.010
aj-0.010
apa-0.010
Token from
Token ID422
Feature activation-1.80e+00

Pos logit contributions
ppa+0.074
 grandmother+0.073
 granddaughter+0.072
oll+0.069
apa+0.069
Neg logit contributions
Stop-0.046
Mine-0.045
OK-0.042
Run-0.042
 score-0.041
Token her
Token ID607
Feature activation-3.79e+00

Pos logit contributions
aj+0.044
ppa+0.044
oll+0.043
iqu+0.042
apa+0.042
Neg logit contributions
 score-0.027
Stop-0.026
Mine-0.026
 race-0.024
 kick-0.023
Token.
Token ID13
Feature activation-8.97e-03

Pos logit contributions
ppa+0.101
apa+0.094
 granddaughter+0.093
 grandmother+0.093
oll+0.093
Neg logit contributions
 score-0.049
Mine-0.046
 hill-0.045
!"-0.044
 race-0.043
Token
Token ID198
Feature activation-1.12e+00

Pos logit contributions
ppa+0.000
 grandmother+0.000
 granddaughter+0.000
rina+0.000
urger+0.000
Neg logit contributions
Mine-0.000
Stop-0.000
 score-0.000
OK-0.000
No-0.000
Token
Token ID198
Feature activation+9.06e-03

Pos logit contributions
ppa+0.026
 granddaughter+0.025
 grandmother+0.025
 patients+0.024
oll+0.024
Neg logit contributions
Stop-0.018
Mine-0.018
 score-0.017
!"-0.016
OK-0.016
Token"
Token ID1
Feature activation+2.34e+00

Pos logit contributions
 hill+0.000
Mine+0.000
 score+0.000
 challenge+0.000
 race+0.000
Neg logit contributions
ppa-0.000
apa-0.000
aj-0.000
 granddaughter-0.000
oll-0.000
TokenHey
Token ID10814
Feature activation+2.12e+00

Pos logit contributions
 hill+0.028
!"+0.026
 score+0.025
 race+0.024
 challenge+0.024
Neg logit contributions
ppa-0.061
 granddaughter-0.060
 grandmother-0.059
 patients-0.056
apa-0.056
Token,
Token ID11
Feature activation-1.40e+00

Pos logit contributions
Mine+0.032
Stop+0.030
 hill+0.028
OK+0.028
ok+0.027
Neg logit contributions
ppa-0.051
 granddaughter-0.050
 grandmother-0.050
aj-0.047
apa-0.047
Token give
Token ID1577
Feature activation+7.60e-01

Pos logit contributions
ppa+0.034
 grandmother+0.033
 granddaughter+0.033
oll+0.032
apa+0.031
Neg logit contributions
Mine-0.020
Stop-0.019
 hill-0.018
 score-0.018
OK-0.018
Token Ben
Token ID3932
Feature activation+1.92e+00

Pos logit contributions
ppa+0.019
aj+0.019
oll+0.019
lor+0.018
 former+0.018
Neg logit contributions
Mine-0.010
 score-0.010
Stop-0.010
 race-0.009
 kick-0.009
Token said
Token ID531
Feature activation-2.59e+00

Pos logit contributions
Mine+0.028
Stop+0.028
 score+0.027
 hill+0.025
OK+0.025
Neg logit contributions
ppa-0.046
 grandmother-0.045
 granddaughter-0.044
apa-0.043
aj-0.042
Token yes
Token ID3763
Feature activation+2.25e+00

Pos logit contributions
ppa+0.044
 grandmother+0.041
 granddaughter+0.040
aj+0.040
isk+0.039
Neg logit contributions
Mine-0.057
Stop-0.056
 score-0.055
OK-0.054
 race-0.054
Token.
Token ID13
Feature activation+4.75e-01

Pos logit contributions
Mine+0.033
Stop+0.032
 score+0.030
Run+0.030
 hill+0.029
Neg logit contributions
ppa-0.055
 grandmother-0.054
 granddaughter-0.053
apa-0.051
aj-0.050
Token They
Token ID1119
Feature activation+1.90e+00

Pos logit contributions
Mine+0.007
Stop+0.006
OK+0.006
 score+0.006
Wow+0.006
Neg logit contributions
ppa-0.013
oll-0.012
 grandmother-0.012
iqu-0.012
rina-0.012
Token shared
Token ID4888
Feature activation-1.33e+00

Pos logit contributions
Mine+0.028
Stop+0.027
 score+0.027
OK+0.025
 race+0.025
Neg logit contributions
ppa-0.046
 grandmother-0.045
 granddaughter-0.044
aj-0.043
apa-0.043
Token their
Token ID511
Feature activation-2.37e+00

Pos logit contributions
ppa+0.034
 grandmother+0.033
omm+0.032
apa+0.032
oll+0.032
Neg logit contributions
Mine-0.020
 score-0.019
!"-0.018
 turns-0.018
Stop-0.017
Token coal
Token ID5655
Feature activation-2.81e+00

Pos logit contributions
ppa+0.049
 grandmother+0.048
 granddaughter+0.047
apa+0.045
aj+0.045
Neg logit contributions
Stop-0.041
Mine-0.041
 score-0.040
 hill-0.038
OK-0.038
Token.
Token ID13
Feature activation+8.91e-01

Pos logit contributions
ppa+0.068
 grandmother+0.066
 granddaughter+0.065
apa+0.063
aj+0.063
Neg logit contributions
Stop-0.040
Mine-0.040
 score-0.038
 hill-0.037
OK-0.036
Token They
Token ID1119
Feature activation+2.17e+00

Pos logit contributions
Mine+0.012
Stop+0.011
OK+0.011
Wow+0.010
 score+0.010
Neg logit contributions
ppa-0.026
oll-0.023
rina-0.023
urger-0.023
ache-0.023
Token taught
Token ID7817
Feature activation-8.36e-01

Pos logit contributions
Mine+0.034
 score+0.033
Stop+0.032
OK+0.031
 hill+0.031
Neg logit contributions
ppa-0.050
 grandmother-0.050
 granddaughter-0.049
apa-0.047
aj-0.047
Token was
Token ID373
Feature activation-3.08e+00

Pos logit contributions
ppa+0.024
 grandmother+0.023
 granddaughter+0.023
apa+0.022
aj+0.022
Neg logit contributions
Mine-0.015
Stop-0.015
 score-0.014
 hill-0.014
OK-0.014
Token still
Token ID991
Feature activation-3.33e-01

Pos logit contributions
ppa+0.070
 grandmother+0.067
 granddaughter+0.067
apa+0.067
aj+0.064
Neg logit contributions
Mine-0.049
Stop-0.048
 score-0.047
 hill-0.045
 race-0.045
Token messy
Token ID24097
Feature activation+3.33e+00

Pos logit contributions
ppa+0.008
 grandmother+0.008
 granddaughter+0.007
apa+0.007
oll+0.007
Neg logit contributions
Mine-0.005
Stop-0.005
 score-0.005
 hill-0.005
OK-0.005
Token.
Token ID13
Feature activation+1.16e+00

Pos logit contributions
Mine+0.061
Stop+0.061
 score+0.059
Run+0.058
OK+0.057
Neg logit contributions
 grandmother-0.067
 granddaughter-0.065
ppa-0.065
oll-0.064
 patients-0.062
Token She
Token ID1375
Feature activation+1.46e-01

Pos logit contributions
Mine+0.022
Stop+0.022
 score+0.021
 hill+0.020
OK+0.020
Neg logit contributions
ppa-0.023
 grandmother-0.022
 granddaughter-0.022
apa-0.021
oll-0.021
Token was
Token ID373
Feature activation-3.03e+00

Pos logit contributions
Stop+0.002
Mine+0.002
 score+0.002
OK+0.002
Run+0.002
Neg logit contributions
ppa-0.004
 grandmother-0.003
 granddaughter-0.003
apa-0.003
oll-0.003
Token very
Token ID845
Feature activation-1.11e+00

Pos logit contributions
apa+0.072
ppa+0.069
aj+0.069
lor+0.068
urger+0.067
Neg logit contributions
 cl-0.043
 score-0.043
Stop-0.043
 race-0.042
 wind-0.042
Token quiet
Token ID5897
Feature activation+2.06e+00

Pos logit contributions
ppa+0.021
 grandmother+0.020
 granddaughter+0.020
apa+0.019
aj+0.019
Neg logit contributions
Mine-0.022
Stop-0.022
 score-0.022
 hill-0.021
atch-0.021
Token and
Token ID290
Feature activation-6.13e-01

Pos logit contributions
Mine+0.031
Stop+0.030
 score+0.029
Run+0.028
OK+0.028
Neg logit contributions
 grandmother-0.049
ppa-0.048
 granddaughter-0.048
aj-0.045
oll-0.045
Token didn
Token ID1422
Feature activation+5.32e-01

Pos logit contributions
ppa+0.015
 grandmother+0.014
apa+0.014
oll+0.014
 granddaughter+0.014
Neg logit contributions
Stop-0.009
Mine-0.008
OK-0.008
 score-0.008
 race-0.008
Token't
Token ID470
Feature activation+2.99e+00

Pos logit contributions
Mine+0.007
Stop+0.006
 score+0.006
 race+0.006
!"+0.006
Neg logit contributions
ppa-0.014
 granddaughter-0.014
 grandmother-0.014
aj-0.013
apa-0.013
Token her
Token ID607
Feature activation-3.86e+00

Pos logit contributions
Stop+0.020
Mine+0.019
 score+0.019
 hill+0.018
OK+0.018
Neg logit contributions
ppa-0.027
 grandmother-0.026
 granddaughter-0.026
apa-0.025
aj-0.025
Token to
Token ID284
Feature activation-3.92e-01

Pos logit contributions
ppa+0.088
 grandmother+0.081
 granddaughter+0.081
apa+0.081
oll+0.080
Neg logit contributions
 score-0.062
Mine-0.060
Stop-0.059
 hill-0.059
!"-0.058
Token her
Token ID607
Feature activation-4.43e+00

Pos logit contributions
ppa+0.010
 granddaughter+0.010
 grandmother+0.009
apa+0.009
oll+0.009
Neg logit contributions
Stop-0.005
 score-0.005
Mine-0.005
 race-0.005
!"-0.005
Token beautiful
Token ID4950
Feature activation+7.11e-01

Pos logit contributions
ppa+0.106
apa+0.097
oll+0.097
omm+0.096
 granddaughter+0.095
Neg logit contributions
 score-0.068
 hill-0.067
 race-0.065
!"-0.064
Mine-0.063
Token self
Token ID2116
Feature activation+8.34e-01

Pos logit contributions
Stop+0.014
Mine+0.013
Run+0.013
www+0.013
atch+0.013
Neg logit contributions
 grandmother-0.014
 granddaughter-0.014
aj-0.013
ppa-0.013
 parents-0.013
Token.
Token ID13
Feature activation-6.38e-02

Pos logit contributions
 score+0.010
Stop+0.010
Mine+0.010
!"+0.010
 hill+0.010
Neg logit contributions
ppa-0.021
 grandmother-0.020
 granddaughter-0.020
apa-0.020
aj-0.020
Token<|endoftext|>
Token ID50256
Feature activation+2.53e+00

Pos logit contributions
ppa+0.002
iqu+0.002
 patients+0.002
urger+0.002
rina+0.002
Neg logit contributions
!"-0.001
Mine-0.001
Wow-0.001
No-0.001
OK-0.001
TokenOnce
Token ID7454
Feature activation-1.50e+00

Pos logit contributions
Stop+0.035
!"+0.035
Mine+0.034
 hill+0.034
 score+0.034
Neg logit contributions
ppa-0.065
 granddaughter-0.061
 grandmother-0.060
 patients-0.059
apa-0.059
Token upon
Token ID2402
Feature activation-2.12e+00

Pos logit contributions
ppa+0.035
 granddaughter+0.033
 grandmother+0.033
 patients+0.032
 former+0.031
Neg logit contributions
 score-0.022
Mine-0.022
Stop-0.022
 hill-0.022
!"-0.021
Token a
Token ID257
Feature activation+1.29e+00

Pos logit contributions
ppa+0.047
apa+0.045
 grandmother+0.045
omm+0.044
aj+0.044
Neg logit contributions
Mine-0.037
Stop-0.036
 score-0.036
!"-0.035
 kick-0.034
Token time
Token ID640
Feature activation+7.51e-01

Pos logit contributions
Stop+0.015
Mine+0.015
 score+0.014
 hill+0.013
OK+0.013
Neg logit contributions
ppa-0.035
 grandmother-0.034
 granddaughter-0.034
apa-0.033
aj-0.033
Token the
Token ID262
Feature activation-7.31e-01

Pos logit contributions
ppa+0.018
 grandmother+0.017
 granddaughter+0.017
apa+0.017
aj+0.017
Neg logit contributions
Mine-0.008
Stop-0.008
 score-0.008
 hill-0.008
 race-0.007
Token math
Token ID10688
Feature activation-1.32e-01

Pos logit contributions
 grandmother+0.018
 granddaughter+0.018
 patients+0.017
 parents+0.017
 owners+0.016
Neg logit contributions
Run-0.010
Mine-0.010
Stop-0.010
atch-0.009
Wow-0.009
Token.
Token ID13
Feature activation+3.14e-01

Pos logit contributions
 grandmother+0.003
 granddaughter+0.002
 patients+0.002
oll+0.002
 parents+0.002
Neg logit contributions
Mine-0.002
Stop-0.002
Run-0.002
atch-0.002
Wow-0.002
Token He
Token ID679
Feature activation+1.13e+00

Pos logit contributions
Mine+0.004
Stop+0.004
OK+0.003
No+0.003
!"+0.003
Neg logit contributions
ppa-0.009
rina-0.008
iqu-0.008
 grandmother-0.008
urger-0.008
Token explains
Token ID6688
Feature activation-7.04e-01

Pos logit contributions
Stop+0.015
Mine+0.015
www+0.015
Run+0.015
 hill+0.014
Neg logit contributions
 grandmother-0.027
 granddaughter-0.027
oll-0.026
ppa-0.026
aj-0.026
Token the
Token ID262
Feature activation-8.48e-01

Pos logit contributions
ppa+0.018
aj+0.016
isk+0.016
 grandmother+0.016
 granddaughter+0.016
Neg logit contributions
!"-0.010
Stop-0.010
 score-0.010
Mine-0.010
 race-0.009
Token numbers
Token ID3146
Feature activation-1.19e+00

Pos logit contributions
 grandmother+0.017
 granddaughter+0.017
oll+0.016
 parents+0.015
 patients+0.015
Neg logit contributions
Run-0.015
Mine-0.015
Stop-0.014
atch-0.014
www-0.014
Token and
Token ID290
Feature activation-1.01e+00

Pos logit contributions
 grandmother+0.024
 granddaughter+0.023
oll+0.023
ppa+0.023
 patients+0.022
Neg logit contributions
Stop-0.021
Mine-0.020
Run-0.020
www-0.020
atch-0.019
Token signs
Token ID5895
Feature activation-1.29e+00

Pos logit contributions
 grandmother+0.023
 granddaughter+0.023
ppa+0.022
 patients+0.021
apa+0.021
Neg logit contributions
Mine-0.015
Stop-0.015
www-0.015
 hill-0.014
 score-0.014
Token.
Token ID13
Feature activation+2.87e-01

Pos logit contributions
ppa+0.030
 grandmother+0.030
 granddaughter+0.029
apa+0.028
oll+0.028
Neg logit contributions
Mine-0.019
Stop-0.019
 score-0.018
 hill-0.018
OK-0.017
Token He
Token ID679
Feature activation+9.39e-01

Pos logit contributions
Mine+0.003
Stop+0.003
OK+0.003
No+0.003
!"+0.003
Neg logit contributions
ppa-0.008
rina-0.008
iqu-0.008
 grandmother-0.008
urger-0.007
Token decided
Token ID3066
Feature activation+4.09e-01

Pos logit contributions
 cl+0.008
 score+0.008
OK+0.008
Run+0.008
 wants+0.008
Neg logit contributions
omm-0.017
 grandmother-0.017
ppa-0.017
uggets-0.017
apa-0.017
Token to
Token ID284
Feature activation+3.97e-01

Pos logit contributions
Stop+0.005
!"+0.005
OK+0.005
 score+0.005
Mine+0.005
Neg logit contributions
ppa-0.010
oll-0.010
omm-0.010
apa-0.010
 grandmother-0.010
Token not
Token ID407
Feature activation+2.15e+00

Pos logit contributions
Mine+0.005
Stop+0.005
 score+0.005
 hill+0.005
 race+0.004
Neg logit contributions
ppa-0.010
 grandmother-0.010
 granddaughter-0.010
apa-0.010
aj-0.010
Token put
Token ID1234
Feature activation+4.48e-01

Pos logit contributions
Mine+0.033
Stop+0.031
OK+0.030
Run+0.029
www+0.029
Neg logit contributions
ppa-0.050
 grandmother-0.048
 granddaughter-0.047
 patients-0.046
oll-0.045
Token the
Token ID262
Feature activation-1.73e+00

Pos logit contributions
Stop+0.006
 score+0.006
Mine+0.006
!"+0.005
 race+0.005
Neg logit contributions
ppa-0.012
 grandmother-0.011
apa-0.011
 granddaughter-0.011
aj-0.011
Token bird
Token ID6512
Feature activation-5.57e+00

Pos logit contributions
 grandmother+0.038
 granddaughter+0.038
ppa+0.038
 patients+0.035
apa+0.035
Neg logit contributions
Mine-0.028
Stop-0.028
 score-0.026
atch-0.025
www-0.025
Token in
Token ID287
Feature activation+5.24e-01

Pos logit contributions
 grandmother+0.153
omm+0.153
ppa+0.148
apa+0.148
 granddaughter+0.148
Neg logit contributions
 score-0.063
!"-0.060
 race-0.060
 hill-0.059
Stop-0.055
Token the
Token ID262
Feature activation+2.59e-01

Pos logit contributions
Mine+0.007
 score+0.007
Stop+0.007
!"+0.007
 hill+0.006
Neg logit contributions
ppa-0.014
oll-0.013
aj-0.013
apa-0.013
iqu-0.013
Token cage
Token ID17186
Feature activation-2.43e+00

Pos logit contributions
Mine+0.004
Stop+0.004
 score+0.003
 hill+0.003
OK+0.003
Neg logit contributions
ppa-0.007
 grandmother-0.006
 granddaughter-0.006
apa-0.006
aj-0.006
Token anymore
Token ID7471
Feature activation+6.51e-01

Pos logit contributions
ppa+0.068
 grandmother+0.066
omm+0.066
 granddaughter+0.066
apa+0.065
Neg logit contributions
 hill-0.025
!"-0.025
 score-0.024
OK-0.023
 race-0.022
Token.
Token ID13
Feature activation+8.86e-01

Pos logit contributions
 score+0.008
Stop+0.007
Mine+0.007
 hill+0.007
!"+0.007
Neg logit contributions
ppa-0.017
 grandmother-0.017
 granddaughter-0.017
apa-0.016
aj-0.016
Token scared
Token ID12008
Feature activation+3.21e+00

Pos logit contributions
ppa+0.060
 grandmother+0.058
 granddaughter+0.058
oll+0.054
aj+0.054
Neg logit contributions
Mine-0.065
Stop-0.064
 score-0.061
 hill-0.060
Run-0.060
Token,
Token ID11
Feature activation-1.65e+00

Pos logit contributions
Mine+0.073
 score+0.064
OK+0.064
Give+0.062
Stop+0.062
Neg logit contributions
 grandmother-0.053
 granddaughter-0.053
ppa-0.052
 patients-0.050
 parents-0.050
Token but
Token ID475
Feature activation+9.97e-02

Pos logit contributions
 grandmother+0.034
ppa+0.034
 granddaughter+0.034
oll+0.032
apa+0.032
Neg logit contributions
Mine-0.028
Stop-0.027
 score-0.026
atch-0.025
OK-0.025
Token you
Token ID345
Feature activation+1.47e+00

Pos logit contributions
Mine+0.001
Stop+0.001
ok+0.001
 score+0.001
about+0.001
Neg logit contributions
 grandmother-0.002
 granddaughter-0.002
 parents-0.002
ppa-0.002
 Mother-0.002
Token helped
Token ID4193
Feature activation-8.60e-02

Pos logit contributions
Mine+0.022
Stop+0.022
 score+0.021
 hill+0.020
OK+0.020
Neg logit contributions
ppa-0.034
 grandmother-0.033
 granddaughter-0.033
apa-0.032
aj-0.031
Token me
Token ID502
Feature activation-4.49e+00

Pos logit contributions
ppa+0.003
 grandmother+0.002
 granddaughter+0.002
apa+0.002
 patients+0.002
Neg logit contributions
Mine-0.001
Stop-0.001
 score-0.001
 hill-0.001
Run-0.001
Token be
Token ID307
Feature activation-2.57e+00

Pos logit contributions
ppa+0.114
 grandmother+0.111
 granddaughter+0.110
aj+0.108
oll+0.105
Neg logit contributions
Mine-0.061
Stop-0.059
Run-0.054
OK-0.051
Hey-0.051
Token brave
Token ID14802
Feature activation+2.30e+00

Pos logit contributions
ppa+0.047
 granddaughter+0.045
 grandmother+0.044
aj+0.042
 patients+0.041
Neg logit contributions
Mine-0.055
Stop-0.053
 hill-0.050
 score-0.049
Run-0.049
Token."
Token ID526
Feature activation-7.15e-01

Pos logit contributions
Mine+0.038
Stop+0.037
Run+0.034
 score+0.034
Give+0.033
Neg logit contributions
 granddaughter-0.053
ppa-0.052
 grandmother-0.052
 patients-0.049
aj-0.049
Token Tim
Token ID5045
Feature activation+1.20e-01

Pos logit contributions
ppa+0.019
 grandmother+0.018
apa+0.017
 granddaughter+0.017
oll+0.017
Neg logit contributions
Mine-0.010
Stop-0.010
 score-0.009
 hill-0.009
!"-0.009
Tokenmy
Token ID1820
Feature activation+7.29e-01

Pos logit contributions
 score+0.002
Stop+0.002
Mine+0.002
OK+0.002
!"+0.002
Neg logit contributions
 grandmother-0.003
ppa-0.003
 granddaughter-0.003
apa-0.003
 patients-0.003
TokenCan
Token ID6090
Feature activation+1.37e+00

Pos logit contributions
Stop+0.017
Mine+0.017
OK+0.015
 score+0.015
Run+0.015
Neg logit contributions
ppa-0.023
 grandmother-0.022
 granddaughter-0.022
apa-0.022
oll-0.021
Token we
Token ID356
Feature activation-4.66e-01

Pos logit contributions
Stop+0.007
Mine+0.006
 hill+0.006
 score+0.006
 race+0.005
Neg logit contributions
ppa-0.047
oll-0.045
 granddaughter-0.045
apa-0.044
urger-0.044
Token stop
Token ID2245
Feature activation+1.15e+00

Pos logit contributions
ppa+0.013
 grandmother+0.012
 granddaughter+0.012
apa+0.012
aj+0.012
Neg logit contributions
Mine-0.005
Stop-0.005
 score-0.005
 hill-0.005
 race-0.005
Token soon
Token ID2582
Feature activation+1.08e+00

Pos logit contributions
Mine+0.020
Stop+0.020
 score+0.019
OK+0.018
 hill+0.018
Neg logit contributions
ppa-0.024
 grandmother-0.024
 granddaughter-0.024
aj-0.023
apa-0.022
Token?"
Token ID1701
Feature activation-1.90e+00

Pos logit contributions
Mine+0.016
Stop+0.015
OK+0.014
Run+0.014
 score+0.014
Neg logit contributions
ppa-0.026
 grandmother-0.026
 granddaughter-0.025
aj-0.025
oll-0.024
Token Her
Token ID2332
Feature activation-4.57e+00

Pos logit contributions
ppa+0.029
oll+0.028
iqu+0.027
ovable+0.027
urger+0.027
Neg logit contributions
Mine-0.046
Stop-0.045
No-0.044
Give-0.043
OK-0.043
Token mom
Token ID1995
Feature activation-1.62e+00

Pos logit contributions
ppa+0.092
apa+0.089
lor+0.088
ridge+0.086
isk+0.082
Neg logit contributions
 hill-0.089
 game-0.088
 race-0.088
 ball-0.087
 wind-0.087
Tokenmy
Token ID1820
Feature activation+2.51e-01

Pos logit contributions
ppa+0.037
 grandmother+0.037
apa+0.037
 patients+0.036
 granddaughter+0.036
Neg logit contributions
Stop-0.023
Mine-0.023
 score-0.022
OK-0.022
!"-0.022
Token said
Token ID531
Feature activation-2.13e+00

Pos logit contributions
Mine+0.003
Stop+0.003
 score+0.003
OK+0.003
Run+0.003
Neg logit contributions
ppa-0.006
 former-0.006
 grandmother-0.006
 patients-0.006
aj-0.006
Token,
Token ID11
Feature activation-3.33e+00

Pos logit contributions
ppa+0.066
 grandmother+0.064
 granddaughter+0.064
aj+0.063
 former+0.062
Neg logit contributions
Mine-0.015
Stop-0.015
 score-0.014
!"-0.014
OK-0.013
Token "
Token ID366
Feature activation+1.77e+00

Pos logit contributions
ppa+0.076
 grandmother+0.074
apa+0.073
aj+0.073
 former+0.073
Neg logit contributions
Stop-0.050
Mine-0.049
!"-0.047
OK-0.047
Run-0.046
Token a
Token ID257
Feature activation+1.10e+00

Pos logit contributions
ppa+0.001
 grandmother+0.001
 granddaughter+0.001
apa+0.001
aj+0.001
Neg logit contributions
Mine-0.001
Stop-0.001
 score-0.001
 hill-0.001
OK-0.001
Token good
Token ID922
Feature activation+3.48e+00

Pos logit contributions
Stop+0.018
Mine+0.017
 score+0.016
atch+0.016
Run+0.016
Neg logit contributions
 grandmother-0.024
ppa-0.024
 granddaughter-0.024
apa-0.023
aj-0.022
Token friend
Token ID1545
Feature activation-1.53e+00

Pos logit contributions
Stop+0.057
Mine+0.054
atch+0.052
Run+0.052
Hey+0.050
Neg logit contributions
 grandmother-0.082
 granddaughter-0.080
 patients-0.073
 parents-0.072
 prayers-0.072
Token to
Token ID284
Feature activation-4.28e-01

Pos logit contributions
ppa+0.038
 grandmother+0.037
 granddaughter+0.036
aj+0.036
oll+0.036
Neg logit contributions
Stop-0.020
 score-0.020
Mine-0.020
!"-0.019
 race-0.019
Token help
Token ID1037
Feature activation+2.10e-01

Pos logit contributions
ppa+0.011
 grandmother+0.011
 granddaughter+0.011
aj+0.010
apa+0.010
Neg logit contributions
Mine-0.006
Stop-0.006
 score-0.005
 race-0.005
 hill-0.005
Token him
Token ID683
Feature activation-5.17e+00

Pos logit contributions
Stop+0.003
Mine+0.003
 score+0.003
 hill+0.003
 race+0.003
Neg logit contributions
ppa-0.005
 grandmother-0.005
 granddaughter-0.005
aj-0.005
oll-0.004
Token along
Token ID1863
Feature activation+8.62e-01

Pos logit contributions
ppa+0.129
 grandmother+0.125
 granddaughter+0.124
apa+0.120
aj+0.119
Neg logit contributions
 score-0.070
Mine-0.069
Stop-0.068
 hill-0.065
 race-0.064
Token the
Token ID262
Feature activation+3.29e-02

Pos logit contributions
Stop+0.014
Mine+0.014
 score+0.013
OK+0.013
Run+0.012
Neg logit contributions
 grandmother-0.019
ppa-0.019
 granddaughter-0.019
apa-0.018
 patients-0.018
Token way
Token ID835
Feature activation+3.29e-01

Pos logit contributions
Stop+0.000
Mine+0.000
Run+0.000
OK+0.000
Hey+0.000
Neg logit contributions
 grandmother-0.001
 granddaughter-0.001
ppa-0.001
 patients-0.001
 parents-0.001
Token.
Token ID13
Feature activation-1.41e-01

Pos logit contributions
Stop+0.006
Mine+0.006
 score+0.005
Run+0.005
OK+0.005
Neg logit contributions
 grandmother-0.007
ppa-0.007
 granddaughter-0.007
apa-0.007
 patients-0.006
Token<|endoftext|>
Token ID50256
Feature activation+2.65e+00

Pos logit contributions
ppa+0.003
 grandmother+0.003
oll+0.003
iqu+0.003
aj+0.003
Neg logit contributions
Mine-0.002
Stop-0.002
 hill-0.002
 score-0.002
Run-0.002
Token mom
Token ID1995
Feature activation-2.90e-01

Pos logit contributions
ppa+0.053
ridge+0.051
rina+0.050
apa+0.047
ighters+0.046
Neg logit contributions
 race-0.041
 ball-0.039
OK-0.038
 wind-0.038
 game-0.037
Token asked
Token ID1965
Feature activation-1.27e+00

Pos logit contributions
ppa+0.006
 grandmother+0.006
apa+0.006
oll+0.006
 granddaughter+0.006
Neg logit contributions
Stop-0.005
Mine-0.004
 score-0.004
!"-0.004
 race-0.004
Token her
Token ID607
Feature activation-5.73e+00

Pos logit contributions
ppa+0.027
 grandmother+0.026
 former+0.025
aj+0.025
oll+0.025
Neg logit contributions
Stop-0.023
Mine-0.022
 score-0.021
!"-0.020
atch-0.020
Token to
Token ID284
Feature activation-2.07e+00

Pos logit contributions
ppa+0.182
 former+0.181
aj+0.174
ascal+0.174
iqu+0.174
Neg logit contributions
!"-0.046
 race-0.045
"-0.044
 want-0.041
 thing-0.039
Token help
Token ID1037
Feature activation-2.06e-01

Pos logit contributions
ppa+0.055
oll+0.053
aj+0.053
 granddaughter+0.052
 grandmother+0.051
Neg logit contributions
 score-0.026
 race-0.026
Mine-0.025
Stop-0.024
 kick-0.024
Token her
Token ID607
Feature activation-5.59e+00

Pos logit contributions
ppa+0.005
 grandmother+0.005
 granddaughter+0.004
aj+0.004
oll+0.004
Neg logit contributions
Stop-0.003
Mine-0.003
 score-0.003
 race-0.003
 hill-0.003
Token with
Token ID351
Feature activation-1.38e+00

Pos logit contributions
ppa+0.158
oll+0.156
aj+0.150
ovable+0.149
abella+0.149
Neg logit contributions
 race-0.064
 score-0.063
 kick-0.062
 challenge-0.057
!"-0.057
Token a
Token ID257
Feature activation+7.40e-01

Pos logit contributions
ppa+0.034
rina+0.032
lor+0.032
oll+0.031
apa+0.031
Neg logit contributions
Mine-0.021
 race-0.020
 score-0.019
 wind-0.019
Stop-0.019
Token task
Token ID4876
Feature activation+1.50e+00

Pos logit contributions
Mine+0.012
Stop+0.012
 score+0.012
 hill+0.011
OK+0.011
Neg logit contributions
ppa-0.016
 grandmother-0.016
 granddaughter-0.016
apa-0.015
oll-0.015
Token.
Token ID13
Feature activation-3.61e-02

Pos logit contributions
Stop+0.025
Mine+0.025
 score+0.023
Run+0.023
OK+0.023
Neg logit contributions
 grandmother-0.033
ppa-0.033
 granddaughter-0.032
apa-0.031
oll-0.030
Token She
Token ID1375
Feature activation+4.74e-01

Pos logit contributions
ppa+0.001
oll+0.001
apa+0.001
ovable+0.001
urger+0.001
Neg logit contributions
Stop-0.001
Mine-0.001
 score-0.001
OK-0.001
 hill-0.001