TOP ACTIVATIONS
MAX = 0.372

knew to be careful around volcan oes . <|endoftext|> Once upon
just wanted to write about volcan oes . We like volcan
They learned a lot about volcan oes . They thanked the
volcan oes . We like volcan oes . They are rough
just wanted to write about volcan oes . We like volcan
volcan oes . We like volcan oes . They are rough
. You should not like volcan oes . You should like
big book with pictures of volcan oes . Vol can oes
learned even more about how volcan oes work . She thought
. They had enough of volcan oes for one day .
not understand . They like volcan oes . They think they
away from dangerous things like volcan oes . <|endoftext|> Once upon
teach you something special about volcan oes ." So
The book has pictures of volcan oes . Vol can oes
. You should not like volcan oes . You should like
They want to write about volcan oes . They
little girl all about how volcan oes work . She taught
Cars didn 't fly to volcan oes . <|endoftext|> L ily
. Lily and Ben think volcan oes are cool . They
They want to write about volcan oes . They

MIN ACTIVATIONS
MIN = -0.400

and luxurious ". Kim and Hank looked around the park ,
two friends , Jack and Hank . They loved to play
Hannah 's mom . But Hannah was not worried . She
was a little girl named Hannah . She loved to play
" Oh no !" exclaimed Hannah 's mom . But Hannah
One day , Hannah went to the park .
walked down the street , Hannah saw something that made her
" Thanks Dad !" and Hank was happy he had been
little girl called Hannah . Hannah was very naughty and often
But we are good jog gers ." They laugh
it smelled so delicious ! Hannah looked around and saw a
was a little girl called Hannah . Hannah was very naughty
fashion . Every day when Hannah got dressed , her mum
there were two friends , Hank and Sally . They loved
jungle . As Hannah walked further and further away
S ally and Hank hugged each other and from
. Suddenly , Hank saw something dug in the
there was a girl named Hannah . She had a cool
cup and gave it to Hannah . Suddenly , it began
all tired but happy . Hannah had a very fun day

INTERVAL 0.179 - 0.372
CONTAINS 0.377%

how he could use the avocado to make something else .
but he didn 't know how . Then one
be so mean to the statue . Tim my felt bad
to say hello . She climbed up the tree and said
back and find more pretty pe b bles and learn more

INTERVAL -0.014 - 0.179
CONTAINS 63.687%

, the goat was able to play the game with ease
says . " Oh , I see . That
come and play with him . Jill said no , she
make more ?" Lily smiles and says , " Yes ,
chair , and they read the book together . They laughed

INTERVAL -0.207 - -0.014
CONTAINS 35.842%

Maybe it is a monster ," Tom says . He is
to explore . One day , Lucy was feeling
. It was Mom . She came into the kitchen and
the ground . He reaches for the je ep . But
happened . Finally , everyone was talking about Daisy and her

INTERVAL -0.400 - -0.207
CONTAINS 0.094%

much fun ! Jimmy 's exploring didn 't stop
Tom to his bed and tucked him in . They kissed
was a lovely bear named Oscar . Oscar was always a
about we make him a sweater too ? A very it
John 's mom held the leash and lead Daisy to the
Token knew
Token ID2993
Feature activation+1.35e-01

Pos logit contributions
 Hank+0.076
 Fiona+0.076
 Holly+0.075
 Hannah+0.075
gers+0.074
Neg logit contributions
 volcan-0.064
degree-0.061
endo-0.060
tr-0.057
 eru-0.057
Token to
Token ID284
Feature activation+8.81e-04

Pos logit contributions
 volcan+0.145
degree+0.122
 erupt+0.113
raper+0.112
 eru+0.111
Neg logit contributions
gers-0.227
 Holly-0.225
 trainers-0.221
 Hannah-0.219
 Hank-0.218
Token be
Token ID307
Feature activation+6.58e-03

Pos logit contributions
 volcan+0.001
degree+0.001
endo+0.001
 hex+0.001
tr+0.001
Neg logit contributions
 Hank-0.001
 Fiona-0.001
 Hannah-0.001
 Holly-0.001
gers-0.001
Token careful
Token ID8161
Feature activation-5.87e-02

Pos logit contributions
 volcan+0.008
degree+0.008
endo+0.007
 eru+0.007
 clim+0.007
Neg logit contributions
 Hank-0.010
 Hannah-0.010
 Holly-0.010
 Fiona-0.009
gers-0.009
Token around
Token ID1088
Feature activation-5.66e-02

Pos logit contributions
 Holly+0.118
 Hank+0.117
 Hannah+0.115
 Fiona+0.114
 Woo+0.113
Neg logit contributions
 volcan-0.048
degree-0.036
 erupt-0.036
 how-0.034
endo-0.034
Token volcan
Token ID17516
Feature activation+3.72e-01

Pos logit contributions
 Fiona+0.074
 Hank+0.073
 Hannah+0.072
gers+0.071
 Holly+0.071
Neg logit contributions
degree-0.075
endo-0.073
 volcan-0.073
tr-0.071
 hex-0.068
Tokenoes
Token ID3028
Feature activation+1.37e-01

Pos logit contributions
degree+0.478
endo+0.451
 BO+0.447
 volcan+0.440
 hex+0.437
Neg logit contributions
gers-0.482
 Hank-0.457
 Fiona-0.453
 trainers-0.435
latable-0.421
Token.
Token ID13
Feature activation+2.50e-04

Pos logit contributions
 volcan+0.166
degree+0.157
endo+0.153
 hex+0.147
 eru+0.144
Neg logit contributions
 Hank-0.201
 Hannah-0.198
 Fiona-0.198
 Holly-0.196
gers-0.193
Token<|endoftext|>
Token ID50256
Feature activation-7.38e-02

Pos logit contributions
 volcan+0.000
degree+0.000
endo+0.000
 hex+0.000
tr+0.000
Neg logit contributions
 Hank-0.000
 Fiona-0.000
 Hannah-0.000
 Holly-0.000
 Woo-0.000
TokenOnce
Token ID7454
Feature activation+7.76e-02

Pos logit contributions
 Hank+0.133
 Hannah+0.131
gers+0.131
 Fiona+0.131
 trainers+0.130
Neg logit contributions
 volcan-0.059
endo-0.054
degree-0.052
 hex-0.050
 eru-0.050
Token upon
Token ID2402
Feature activation-5.01e-02

Pos logit contributions
 volcan+0.107
degree+0.102
endo+0.099
 hex+0.095
 eru+0.094
Neg logit contributions
 Hank-0.099
gers-0.098
 Fiona-0.097
 Hannah-0.097
 Holly-0.096
Token just
Token ID655
Feature activation+3.65e-02

Pos logit contributions
 volcan+0.015
degree+0.014
tr+0.014
endo+0.014
pre+0.013
Neg logit contributions
 Fiona-0.022
 Hank-0.022
 Holly-0.022
 Hannah-0.022
gers-0.022
Token wanted
Token ID2227
Feature activation+2.11e-02

Pos logit contributions
 volcan+0.050
degree+0.048
endo+0.047
 hex+0.045
raper+0.044
Neg logit contributions
 Hank-0.047
 Hannah-0.046
 Fiona-0.046
 Holly-0.045
gers-0.045
Token to
Token ID284
Feature activation-1.08e-03

Pos logit contributions
degree+0.028
 volcan+0.028
endo+0.027
 hex+0.026
pre+0.026
Neg logit contributions
 Hank-0.028
 Hannah-0.027
 Fiona-0.027
 Holly-0.027
gers-0.026
Token write
Token ID3551
Feature activation+5.81e-02

Pos logit contributions
 Hank+0.001
 Fiona+0.001
 Hannah+0.001
 Holly+0.001
gers+0.001
Neg logit contributions
 volcan-0.001
degree-0.001
endo-0.001
 hex-0.001
tr-0.001
Token about
Token ID546
Feature activation-6.66e-02

Pos logit contributions
 volcan+0.070
degree+0.067
endo+0.065
 hex+0.062
tr+0.062
Neg logit contributions
 Hank-0.085
 Fiona-0.084
 Hannah-0.083
 Holly-0.083
gers-0.081
Token volcan
Token ID17516
Feature activation+3.69e-01

Pos logit contributions
 Hank+0.081
 Hannah+0.081
 Fiona+0.079
 Holly+0.078
 Rover+0.076
Neg logit contributions
degree-0.096
endo-0.090
pre-0.089
tr-0.089
 volcan-0.088
Tokenoes
Token ID3028
Feature activation+1.37e-01

Pos logit contributions
degree+0.543
endo+0.513
 BO+0.510
BO+0.493
AAAA+0.488
Neg logit contributions
gers-0.408
 Hank-0.357
 jackets-0.357
 trainers-0.352
latable-0.344
Token.
Token ID13
Feature activation+7.06e-03

Pos logit contributions
 volcan+0.162
degree+0.151
endo+0.147
 hex+0.143
tr+0.141
Neg logit contributions
 Hank-0.207
 Fiona-0.204
 Hannah-0.202
 Holly-0.200
gers-0.198
Token We
Token ID775
Feature activation+1.38e-02

Pos logit contributions
degree+0.009
 volcan+0.009
endo+0.009
aph+0.009
 hex+0.009
Neg logit contributions
 Hank-0.009
 Hannah-0.009
 Fiona-0.009
gers-0.008
 Holly-0.008
Token like
Token ID588
Feature activation-5.70e-02

Pos logit contributions
 volcan+0.015
degree+0.014
tr+0.014
endo+0.014
 hex+0.013
Neg logit contributions
 Fiona-0.022
 Hank-0.021
 Holly-0.021
 Hannah-0.021
gers-0.021
Token volcan
Token ID17516
Feature activation+3.67e-01

Pos logit contributions
 Fiona+0.054
 Hank+0.052
 Rover+0.052
 Hannah+0.049
 Woo+0.049
Neg logit contributions
degree-0.087
tr-0.086
aph-0.085
endo-0.084
pre-0.082
Token They
Token ID1119
Feature activation-3.97e-02

Pos logit contributions
degree+0.009
 volcan+0.008
endo+0.008
pre+0.008
 hex+0.008
Neg logit contributions
 Hank-0.006
 Hannah-0.006
 Fiona-0.006
 Holly-0.006
gers-0.006
Token learned
Token ID4499
Feature activation+6.13e-03

Pos logit contributions
 Hank+0.057
 Fiona+0.056
 Hannah+0.056
 Holly+0.055
gers+0.055
Neg logit contributions
 volcan-0.049
degree-0.047
endo-0.046
 hex-0.044
tr-0.043
Token a
Token ID257
Feature activation+4.16e-03

Pos logit contributions
 volcan+0.008
degree+0.008
endo+0.007
 hex+0.007
tr+0.007
Neg logit contributions
 Hank-0.009
 Fiona-0.009
 Hannah-0.008
 Holly-0.008
gers-0.008
Token lot
Token ID1256
Feature activation+7.09e-02

Pos logit contributions
endo+0.007
degree+0.007
 volcan+0.007
sur+0.006
eks+0.006
Neg logit contributions
gers-0.004
 Fiona-0.004
 Hank-0.004
 Woo-0.003
 Holly-0.003
Token about
Token ID546
Feature activation-6.61e-02

Pos logit contributions
 volcan+0.092
degree+0.089
endo+0.087
 hex+0.083
tr+0.082
Neg logit contributions
 Hank-0.097
 Fiona-0.096
 Hannah-0.095
 Holly-0.095
gers-0.094
Token volcan
Token ID17516
Feature activation+3.68e-01

Pos logit contributions
 Hank+0.089
 Fiona+0.086
 Rover+0.083
 Oscar+0.082
 Otto+0.081
Neg logit contributions
degree-0.087
endo-0.078
tr-0.078
aph-0.075
pre-0.074
Tokenoes
Token ID3028
Feature activation+1.34e-01

Pos logit contributions
degree+0.501
endo+0.486
 BO+0.463
BO+0.453
 POP+0.439
Neg logit contributions
gers-0.451
 Hank-0.412
latable-0.390
 Fiona-0.387
 Oscar-0.381
Token.
Token ID13
Feature activation+2.58e-03

Pos logit contributions
 volcan+0.167
degree+0.163
endo+0.158
 hex+0.150
tr+0.149
Neg logit contributions
 Hank-0.188
 Fiona-0.186
 Hannah-0.185
 Holly-0.184
gers-0.182
Token They
Token ID1119
Feature activation-3.88e-02

Pos logit contributions
degree+0.004
 volcan+0.004
endo+0.004
pre+0.004
 hex+0.004
Neg logit contributions
 Hank-0.003
 Hannah-0.003
 Fiona-0.003
 Holly-0.003
 Woo-0.003
Token thanked
Token ID26280
Feature activation+3.98e-02

Pos logit contributions
 Hank+0.055
 Fiona+0.054
 Hannah+0.054
 Holly+0.054
gers+0.053
Neg logit contributions
 volcan-0.049
degree-0.046
endo-0.045
 hex-0.044
tr-0.043
Token the
Token ID262
Feature activation-7.21e-03

Pos logit contributions
degree+0.070
tr+0.067
endo+0.067
destruct+0.063
 hex+0.063
Neg logit contributions
 Hank-0.040
 Fiona-0.038
 Hannah-0.038
 Rover-0.037
 Woo-0.035
Token volcan
Token ID17516
Feature activation+3.69e-01

Pos logit contributions
 Hank+0.081
 Hannah+0.081
 Fiona+0.079
 Holly+0.078
 Rover+0.076
Neg logit contributions
degree-0.096
endo-0.090
pre-0.089
tr-0.089
 volcan-0.088
Tokenoes
Token ID3028
Feature activation+1.37e-01

Pos logit contributions
degree+0.543
endo+0.513
 BO+0.510
BO+0.493
AAAA+0.488
Neg logit contributions
gers-0.408
 Hank-0.357
 jackets-0.357
 trainers-0.352
latable-0.344
Token.
Token ID13
Feature activation+7.06e-03

Pos logit contributions
 volcan+0.162
degree+0.151
endo+0.147
 hex+0.143
tr+0.141
Neg logit contributions
 Hank-0.207
 Fiona-0.204
 Hannah-0.202
 Holly-0.200
gers-0.198
Token We
Token ID775
Feature activation+1.38e-02

Pos logit contributions
degree+0.009
 volcan+0.009
endo+0.009
aph+0.009
 hex+0.009
Neg logit contributions
 Hank-0.009
 Hannah-0.009
 Fiona-0.009
gers-0.008
 Holly-0.008
Token like
Token ID588
Feature activation-5.70e-02

Pos logit contributions
 volcan+0.015
degree+0.014
tr+0.014
endo+0.014
 hex+0.013
Neg logit contributions
 Fiona-0.022
 Hank-0.021
 Holly-0.021
 Hannah-0.021
gers-0.021
Token volcan
Token ID17516
Feature activation+3.67e-01

Pos logit contributions
 Fiona+0.054
 Hank+0.052
 Rover+0.052
 Hannah+0.049
 Woo+0.049
Neg logit contributions
degree-0.087
tr-0.086
aph-0.085
endo-0.084
pre-0.082
Tokenoes
Token ID3028
Feature activation+1.36e-01

Pos logit contributions
degree+0.506
endo+0.478
 BO+0.469
BO+0.457
 volcan+0.446
Neg logit contributions
gers-0.433
 Hank-0.399
 Fiona-0.392
latable-0.383
 trainers-0.382
Token.
Token ID13
Feature activation+8.62e-03

Pos logit contributions
 volcan+0.160
degree+0.150
endo+0.145
 hex+0.141
tr+0.139
Neg logit contributions
 Hank-0.207
 Fiona-0.204
 Hannah-0.203
 Holly-0.201
gers-0.198
Token They
Token ID1119
Feature activation-3.77e-02

Pos logit contributions
degree+0.012
endo+0.012
aph+0.012
 volcan+0.011
pre+0.011
Neg logit contributions
 Hank-0.010
 Fiona-0.010
 Hannah-0.009
gers-0.009
 Woo-0.009
Token are
Token ID389
Feature activation+1.52e-02

Pos logit contributions
 Hank+0.053
 Fiona+0.052
 Hannah+0.052
 Holly+0.052
gers+0.051
Neg logit contributions
 volcan-0.048
degree-0.046
endo-0.044
 hex-0.043
tr-0.042
Token rough
Token ID5210
Feature activation-3.40e-02

Pos logit contributions
degree+0.020
endo+0.019
 volcan+0.019
pre+0.019
tr+0.018
Neg logit contributions
 Hank-0.020
 Fiona-0.020
 Hannah-0.019
 Holly-0.019
gers-0.019
Token just
Token ID655
Feature activation+3.28e-02

Pos logit contributions
 volcan+0.009
degree+0.009
tr+0.008
endo+0.008
pre+0.008
Neg logit contributions
 Fiona-0.013
 Hank-0.013
 Holly-0.013
 Hannah-0.013
gers-0.013
Token wanted
Token ID2227
Feature activation+1.72e-02

Pos logit contributions
 volcan+0.047
degree+0.045
endo+0.044
 hex+0.043
raper+0.043
Neg logit contributions
 Hank-0.040
 Hannah-0.039
 Fiona-0.038
 Holly-0.038
gers-0.037
Token to
Token ID284
Feature activation-3.39e-03

Pos logit contributions
degree+0.028
endo+0.028
pre+0.027
aph+0.026
tr+0.026
Neg logit contributions
 Hannah-0.017
 Hank-0.017
 Fiona-0.017
 Holly-0.016
 Woo-0.016
Token write
Token ID3551
Feature activation+5.13e-02

Pos logit contributions
 Hank+0.004
 Hannah+0.004
 Fiona+0.004
 Holly+0.003
 Woo+0.003
Neg logit contributions
 volcan-0.005
endo-0.005
degree-0.005
pre-0.005
 hex-0.005
Token about
Token ID546
Feature activation-7.26e-02

Pos logit contributions
 volcan+0.063
degree+0.061
endo+0.059
 hex+0.057
tr+0.056
Neg logit contributions
 Hank-0.073
 Fiona-0.072
 Hannah-0.072
 Holly-0.071
gers-0.070
Token volcan
Token ID17516
Feature activation+3.67e-01

Pos logit contributions
 Hannah+0.076
 Hank+0.075
 Fiona+0.074
 Holly+0.072
 Rover+0.071
Neg logit contributions
degree-0.116
tr-0.109
endo-0.109
pre-0.109
aph-0.106
Tokenoes
Token ID3028
Feature activation+1.33e-01

Pos logit contributions
degree+0.526
 BO+0.501
endo+0.501
AAAA+0.487
 POP+0.477
Neg logit contributions
gers-0.409
 Hank-0.357
 jackets-0.351
latable-0.347
agging-0.347
Token.
Token ID13
Feature activation-1.52e-03

Pos logit contributions
 volcan+0.157
degree+0.148
endo+0.144
 hex+0.139
tr+0.137
Neg logit contributions
 Hank-0.199
 Fiona-0.197
 Hannah-0.195
 Holly-0.194
gers-0.191
Token We
Token ID775
Feature activation+7.27e-03

Pos logit contributions
 Hank+0.002
 Hannah+0.002
 Fiona+0.002
 Holly+0.002
gers+0.002
Neg logit contributions
degree-0.002
 volcan-0.002
endo-0.002
aph-0.002
 hex-0.002
Token like
Token ID588
Feature activation-6.22e-02

Pos logit contributions
 volcan+0.008
degree+0.008
tr+0.007
endo+0.007
pre+0.007
Neg logit contributions
 Fiona-0.012
 Holly-0.011
 Hank-0.011
 Hannah-0.011
gers-0.011
Token volcan
Token ID17516
Feature activation+3.66e-01

Pos logit contributions
 Fiona+0.049
 Hank+0.046
 Hannah+0.045
 Rover+0.045
 Woo+0.044
Neg logit contributions
tr-0.103
aph-0.103
degree-0.102
endo-0.099
pre-0.099
Token volcan
Token ID17516
Feature activation+3.67e-01

Pos logit contributions
 Hannah+0.076
 Hank+0.075
 Fiona+0.074
 Holly+0.072
 Rover+0.071
Neg logit contributions
degree-0.116
tr-0.109
endo-0.109
pre-0.109
aph-0.106
Tokenoes
Token ID3028
Feature activation+1.33e-01

Pos logit contributions
degree+0.526
 BO+0.501
endo+0.501
AAAA+0.487
 POP+0.477
Neg logit contributions
gers-0.409
 Hank-0.357
 jackets-0.351
latable-0.347
agging-0.347
Token.
Token ID13
Feature activation-1.52e-03

Pos logit contributions
 volcan+0.157
degree+0.148
endo+0.144
 hex+0.139
tr+0.137
Neg logit contributions
 Hank-0.199
 Fiona-0.197
 Hannah-0.195
 Holly-0.194
gers-0.191
Token We
Token ID775
Feature activation+7.27e-03

Pos logit contributions
 Hank+0.002
 Hannah+0.002
 Fiona+0.002
 Holly+0.002
gers+0.002
Neg logit contributions
degree-0.002
 volcan-0.002
endo-0.002
aph-0.002
 hex-0.002
Token like
Token ID588
Feature activation-6.22e-02

Pos logit contributions
 volcan+0.008
degree+0.008
tr+0.007
endo+0.007
pre+0.007
Neg logit contributions
 Fiona-0.012
 Holly-0.011
 Hank-0.011
 Hannah-0.011
gers-0.011
Token volcan
Token ID17516
Feature activation+3.66e-01

Pos logit contributions
 Fiona+0.049
 Hank+0.046
 Hannah+0.045
 Rover+0.045
 Woo+0.044
Neg logit contributions
tr-0.103
aph-0.103
degree-0.102
endo-0.099
pre-0.099
Tokenoes
Token ID3028
Feature activation+1.32e-01

Pos logit contributions
degree+0.486
endo+0.464
 BO+0.455
 volcan+0.442
BO+0.437
Neg logit contributions
gers-0.443
 Hank-0.412
 Fiona-0.397
latable-0.394
 trainers-0.387
Token.
Token ID13
Feature activation-1.52e-03

Pos logit contributions
 volcan+0.151
degree+0.140
endo+0.135
 hex+0.132
tr+0.130
Neg logit contributions
 Hank-0.206
 Fiona-0.203
 Hannah-0.201
 Holly-0.199
gers-0.196
Token They
Token ID1119
Feature activation-4.33e-02

Pos logit contributions
 Hank+0.002
 Fiona+0.002
 Hannah+0.002
gers+0.002
 Woo+0.002
Neg logit contributions
degree-0.002
endo-0.002
aph-0.002
 volcan-0.002
pre-0.002
Token are
Token ID389
Feature activation+1.10e-02

Pos logit contributions
 Hank+0.059
 Fiona+0.058
 Hannah+0.058
 Holly+0.057
gers+0.057
Neg logit contributions
 volcan-0.056
degree-0.054
endo-0.053
 hex-0.050
tr-0.050
Token rough
Token ID5210
Feature activation-3.67e-02

Pos logit contributions
degree+0.015
endo+0.014
pre+0.014
aph+0.014
 volcan+0.014
Neg logit contributions
 Hank-0.014
 Fiona-0.014
 Hannah-0.014
 Holly-0.014
gers-0.013
Token.
Token ID13
Feature activation+1.44e-03

Pos logit contributions
 volcan+0.042
degree+0.040
endo+0.039
 hex+0.037
tr+0.037
Neg logit contributions
 Hank-0.052
 Fiona-0.052
 Hannah-0.051
 Holly-0.051
gers-0.050
Token You
Token ID921
Feature activation+1.16e-02

Pos logit contributions
 volcan+0.002
degree+0.001
endo+0.001
 Or+0.001
 hex+0.001
Neg logit contributions
 Hank-0.002
gers-0.002
 Fiona-0.002
 Holly-0.002
 Hannah-0.002
Token should
Token ID815
Feature activation-4.53e-02

Pos logit contributions
 volcan+0.019
degree+0.019
endo+0.019
aph+0.017
raper+0.017
Neg logit contributions
 Hank-0.011
 Fiona-0.011
 Hannah-0.011
 Holly-0.011
gers-0.010
Token not
Token ID407
Feature activation+7.11e-03

Pos logit contributions
gers+0.080
 Fiona+0.077
 Holly+0.076
 Hank+0.076
 Hannah+0.076
Neg logit contributions
 volcan-0.043
 erupt-0.040
endo-0.039
ast-0.038
tr-0.037
Token like
Token ID588
Feature activation-6.09e-02

Pos logit contributions
 volcan+0.010
degree+0.010
endo+0.010
 hex+0.009
 eru+0.009
Neg logit contributions
 Hank-0.009
 Hannah-0.008
 Fiona-0.008
 Holly-0.008
 Woo-0.008
Token volcan
Token ID17516
Feature activation+3.65e-01

Pos logit contributions
 Fiona+0.060
 Hank+0.057
 Rover+0.055
 Woo+0.055
 Hannah+0.055
Neg logit contributions
degree-0.097
tr-0.096
endo-0.091
pre-0.088
aph-0.087
Tokenoes
Token ID3028
Feature activation+1.34e-01

Pos logit contributions
degree+0.512
endo+0.497
 BO+0.476
BO+0.463
 hex+0.461
Neg logit contributions
gers-0.407
 Hank-0.368
 jackets-0.363
 Fiona-0.360
agging-0.358
Token.
Token ID13
Feature activation+2.06e-03

Pos logit contributions
 volcan+0.158
degree+0.152
endo+0.148
 hex+0.140
tr+0.139
Neg logit contributions
 Hank-0.198
 Fiona-0.196
 Hannah-0.195
 Holly-0.194
gers-0.192
Token You
Token ID921
Feature activation+1.09e-02

Pos logit contributions
 volcan+0.003
degree+0.003
endo+0.003
 hex+0.002
tr+0.002
Neg logit contributions
 Hank-0.003
 Fiona-0.003
 Hannah-0.003
 Holly-0.003
gers-0.003
Token should
Token ID815
Feature activation-4.63e-02

Pos logit contributions
 volcan+0.017
degree+0.017
endo+0.017
 hex+0.016
raper+0.016
Neg logit contributions
 Hank-0.012
 Fiona-0.011
 Hannah-0.011
 Holly-0.011
gers-0.011
Token like
Token ID588
Feature activation-6.07e-02

Pos logit contributions
gers+0.081
 Fiona+0.078
 Hank+0.077
 Holly+0.077
 Hannah+0.077
Neg logit contributions
 volcan-0.045
endo-0.041
 erupt-0.041
ast-0.039
tr-0.039
Token big
Token ID1263
Feature activation+8.65e-03

Pos logit contributions
 Hank+0.002
 Hannah+0.002
 Fiona+0.002
 Holly+0.002
gers+0.002
Neg logit contributions
 volcan-0.002
degree-0.002
endo-0.002
 eru-0.002
 hex-0.002
Token book
Token ID1492
Feature activation+7.42e-02

Pos logit contributions
 volcan+0.010
 hex+0.009
tr+0.009
degree+0.009
endo+0.009
Neg logit contributions
 Hank-0.013
 Fiona-0.013
 Hannah-0.013
 Holly-0.013
gers-0.013
Token with
Token ID351
Feature activation-4.01e-03

Pos logit contributions
 volcan+0.096
degree+0.093
endo+0.091
 hex+0.087
 eru+0.085
Neg logit contributions
 Hank-0.101
 Fiona-0.100
 Hannah-0.099
 Holly-0.099
gers-0.098
Token pictures
Token ID5986
Feature activation+1.45e-01

Pos logit contributions
 Hank+0.006
 Hannah+0.006
 Fiona+0.006
 Holly+0.006
gers+0.006
Neg logit contributions
 volcan-0.005
degree-0.005
endo-0.005
pre-0.004
 eru-0.004
Token of
Token ID286
Feature activation-7.19e-02

Pos logit contributions
 volcan+0.198
degree+0.191
endo+0.186
 hex+0.179
tr+0.178
Neg logit contributions
 Hank-0.189
 Fiona-0.186
 Hannah-0.185
 Holly-0.183
gers-0.180
Token volcan
Token ID17516
Feature activation+3.65e-01

Pos logit contributions
 Hank+0.106
 Fiona+0.105
 Hannah+0.105
 Holly+0.104
gers+0.103
Neg logit contributions
 volcan-0.084
degree-0.082
endo-0.080
 hex-0.075
tr-0.075
Tokenoes
Token ID3028
Feature activation+1.31e-01

Pos logit contributions
degree+0.553
 BO+0.534
endo+0.523
AAAA+0.514
 POP+0.514
Neg logit contributions
gers-0.384
 jackets-0.330
agging-0.328
 trainers-0.316
latable-0.314
Token.
Token ID13
Feature activation-1.78e-03

Pos logit contributions
 volcan+0.156
degree+0.146
endo+0.142
 hex+0.137
tr+0.136
Neg logit contributions
 Hank-0.194
 Fiona-0.192
 Hannah-0.191
 Holly-0.189
gers-0.187
Token Vol
Token ID4709
Feature activation+2.07e-01

Pos logit contributions
 Fiona+0.003
 Hank+0.003
 Holly+0.003
 Hannah+0.003
gers+0.003
Neg logit contributions
 volcan-0.002
degree-0.002
endo-0.002
 hex-0.002
 eru-0.002
Tokencan
Token ID5171
Feature activation+1.00e-01

Pos logit contributions
 volcan+0.234
degree+0.225
endo+0.217
 hex+0.206
 eru+0.203
Neg logit contributions
 Hank-0.319
 Fiona-0.313
 Hannah-0.313
 Holly-0.310
gers-0.310
Tokenoes
Token ID3028
Feature activation+1.30e-01

Pos logit contributions
 volcan+0.115
degree+0.108
endo+0.106
 hex+0.101
tr+0.101
Neg logit contributions
 Hank-0.153
 Fiona-0.150
 Hannah-0.150
 Holly-0.149
gers-0.147
Token learned
Token ID4499
Feature activation-6.09e-05

Pos logit contributions
 volcan+0.006
degree+0.006
endo+0.005
pre+0.005
 eru+0.005
Neg logit contributions
 Hannah-0.005
 Fiona-0.005
 Holly-0.005
 Hank-0.005
gers-0.005
Token even
Token ID772
Feature activation+2.86e-02

Pos logit contributions
 Hank+0.000
 Fiona+0.000
 Hannah+0.000
gers+0.000
 Holly+0.000
Neg logit contributions
 volcan-0.000
degree-0.000
endo-0.000
 hex-0.000
tr-0.000
Token more
Token ID517
Feature activation-3.34e-02

Pos logit contributions
endo+0.050
 volcan+0.048
pre+0.046
degree+0.045
 eru+0.044
Neg logit contributions
 Holly-0.028
 Hannah-0.027
gers-0.027
 Hank-0.026
 Fiona-0.025
Token about
Token ID546
Feature activation-6.80e-02

Pos logit contributions
 Fiona+0.032
 Hannah+0.032
 Holly+0.031
 Hank+0.031
gers+0.030
Neg logit contributions
endo-0.056
degree-0.055
 volcan-0.054
pre-0.052
tr-0.052
Token how
Token ID703
Feature activation+2.33e-01

Pos logit contributions
 Fiona+0.085
 Hannah+0.085
 Holly+0.084
 Hank+0.084
 Woo+0.079
Neg logit contributions
degree-0.097
endo-0.090
tr-0.090
pre-0.089
 volcan-0.088
Token volcan
Token ID17516
Feature activation+3.64e-01

Pos logit contributions
 volcan+0.267
degree+0.251
endo+0.237
 hex+0.229
 eru+0.224
Neg logit contributions
 Hank-0.363
 Fiona-0.353
 Hannah-0.352
 Holly-0.351
gers-0.346
Tokenoes
Token ID3028
Feature activation+1.26e-01

Pos logit contributions
degree+0.555
endo+0.546
 BO+0.537
BO+0.522
AAAA+0.498
Neg logit contributions
gers-0.359
 Hank-0.321
 Fiona-0.317
 jackets-0.300
 lately-0.294
Token work
Token ID670
Feature activation+9.35e-02

Pos logit contributions
degree+0.180
endo+0.175
 volcan+0.171
tr+0.168
raper+0.164
Neg logit contributions
 Fiona-0.150
 Holly-0.147
 Hank-0.145
 Hannah-0.144
gers-0.143
Token.
Token ID13
Feature activation-6.00e-03

Pos logit contributions
 volcan+0.122
degree+0.121
endo+0.117
 hex+0.111
tr+0.110
Neg logit contributions
 Hank-0.123
 Hannah-0.123
 Fiona-0.122
gers-0.122
 Holly-0.121
Token She
Token ID1375
Feature activation-3.27e-02

Pos logit contributions
 Fiona+0.008
 Hannah+0.008
 Holly+0.008
 Hank+0.008
 Woo+0.007
Neg logit contributions
 volcan-0.009
degree-0.008
pre-0.008
endo-0.008
 hex-0.008
Token thought
Token ID1807
Feature activation+5.06e-02

Pos logit contributions
 Hank+0.041
 Hannah+0.040
 Holly+0.040
 Fiona+0.040
gers+0.039
Neg logit contributions
 volcan-0.046
degree-0.046
endo-0.043
tr-0.041
 eru-0.041
Token.
Token ID13
Feature activation-1.12e-02

Pos logit contributions
 volcan+0.291
degree+0.280
endo+0.271
 hex+0.260
tr+0.258
Neg logit contributions
 Hank-0.350
 Fiona-0.346
 Hannah-0.343
 Holly-0.341
gers-0.337
Token They
Token ID1119
Feature activation-4.60e-02

Pos logit contributions
 Hank+0.015
 Hannah+0.015
 Fiona+0.014
 Holly+0.014
 Woo+0.014
Neg logit contributions
degree-0.016
 volcan-0.015
endo-0.015
pre-0.014
 hex-0.014
Token had
Token ID550
Feature activation-7.88e-02

Pos logit contributions
 Fiona+0.073
 Hank+0.072
 Hannah+0.072
gers+0.072
 Holly+0.071
Neg logit contributions
 volcan-0.048
degree-0.044
endo-0.043
tr-0.043
 hex-0.042
Token enough
Token ID1576
Feature activation-9.61e-02

Pos logit contributions
 therapist+0.143
 Hank+0.143
 Fiona+0.142
gers+0.141
 Hannah+0.140
Neg logit contributions
 volcan-0.074
 hex-0.062
endo-0.059
 erupt-0.059
tr-0.057
Token of
Token ID286
Feature activation-7.51e-02

Pos logit contributions
 Hank+0.121
gers+0.120
 Fiona+0.119
 Hannah+0.117
 Holly+0.116
Neg logit contributions
degree-0.129
endo-0.126
 volcan-0.125
tr-0.121
pre-0.119
Token volcan
Token ID17516
Feature activation+3.63e-01

Pos logit contributions
 Hank+0.102
 Fiona+0.101
 Hannah+0.100
 Holly+0.099
gers+0.098
Neg logit contributions
 volcan-0.098
degree-0.096
endo-0.093
tr-0.089
 hex-0.089
Tokenoes
Token ID3028
Feature activation+1.27e-01

Pos logit contributions
degree+0.450
 volcan+0.441
endo+0.434
 BO+0.416
 hex+0.409
Neg logit contributions
gers-0.488
 Hank-0.473
 Fiona-0.453
 trainers-0.451
 Hannah-0.438
Token for
Token ID329
Feature activation-5.94e-02

Pos logit contributions
 volcan+0.166
degree+0.164
endo+0.158
 hex+0.150
 eru+0.148
Neg logit contributions
 Hank-0.169
 Fiona-0.168
 Hannah-0.167
 Holly-0.166
gers-0.163
Token one
Token ID530
Feature activation-3.30e-02

Pos logit contributions
 Hank+0.069
 Fiona+0.068
 Hannah+0.067
 Holly+0.066
 Rover+0.063
Neg logit contributions
degree-0.090
tr-0.087
endo-0.084
pre-0.083
 volcan-0.081
Token day
Token ID1110
Feature activation+6.42e-02

Pos logit contributions
gers+0.030
 Fiona+0.027
 Rover+0.027
 Hank+0.026
 Holly+0.025
Neg logit contributions
endo-0.059
degree-0.054
destruct-0.052
 Ap-0.052
pre-0.051
Token.
Token ID13
Feature activation-8.49e-03

Pos logit contributions
 volcan+0.071
degree+0.066
endo+0.062
 hex+0.061
 eru+0.060
Neg logit contributions
 Hank-0.100
 Fiona-0.099
 Hannah-0.098
 Holly-0.098
gers-0.097
Token not
Token ID407
Feature activation+4.39e-03

Pos logit contributions
 Hank+0.175
 Hannah+0.173
 Fiona+0.170
 Holly+0.170
gers+0.166
Neg logit contributions
 volcan-0.163
degree-0.158
endo-0.153
 hex-0.149
tr-0.149
Token understand
Token ID1833
Feature activation-5.73e-02

Pos logit contributions
 volcan+0.006
degree+0.006
endo+0.005
 hex+0.005
tr+0.005
Neg logit contributions
 Hank-0.006
 Fiona-0.006
 Hannah-0.006
 Holly-0.006
gers-0.006
Token.
Token ID13
Feature activation-1.23e-03

Pos logit contributions
 Hank+0.078
 Hannah+0.076
 Fiona+0.076
 Holly+0.075
gers+0.074
Neg logit contributions
 volcan-0.074
degree-0.074
endo-0.072
 hex-0.068
tr-0.068
Token They
Token ID1119
Feature activation-4.30e-02

Pos logit contributions
 Hank+0.001
 Hannah+0.001
 Fiona+0.001
 Holly+0.001
 Woo+0.001
Neg logit contributions
degree-0.002
 volcan-0.002
endo-0.002
pre-0.002
 hex-0.002
Token like
Token ID588
Feature activation-6.43e-02

Pos logit contributions
 Fiona+0.070
 Hannah+0.069
 Holly+0.069
gers+0.069
 Hank+0.069
Neg logit contributions
 volcan-0.040
degree-0.039
tr-0.038
endo-0.038
 Or-0.037
Token volcan
Token ID17516
Feature activation+3.62e-01

Pos logit contributions
 Fiona+0.058
 Hank+0.055
 Rover+0.053
 Woo+0.053
 Hannah+0.052
Neg logit contributions
degree-0.108
tr-0.101
endo-0.100
ab-0.097
ometime-0.095
Tokenoes
Token ID3028
Feature activation+1.31e-01

Pos logit contributions
degree+0.456
endo+0.446
 volcan+0.427
 hex+0.424
 BO+0.421
Neg logit contributions
gers-0.472
 Fiona-0.443
 Hank-0.443
 Hannah-0.422
 Holly-0.419
Token.
Token ID13
Feature activation-2.33e-03

Pos logit contributions
 volcan+0.149
degree+0.137
endo+0.133
 hex+0.129
tr+0.128
Neg logit contributions
 Hank-0.204
 Fiona-0.200
 Hannah-0.200
 Holly-0.198
gers-0.194
Token They
Token ID1119
Feature activation-4.42e-02

Pos logit contributions
 Hank+0.003
 Hannah+0.003
 Fiona+0.003
 Holly+0.002
 Woo+0.002
Neg logit contributions
degree-0.004
endo-0.004
 volcan-0.004
pre-0.003
 hex-0.003
Token think
Token ID892
Feature activation+1.07e-01

Pos logit contributions
 Hank+0.072
 Fiona+0.072
 Hannah+0.072
 Holly+0.071
gers+0.070
Neg logit contributions
 volcan-0.047
degree-0.042
endo-0.041
 hex-0.041
 eru-0.040
Token they
Token ID484
Feature activation+1.83e-02

Pos logit contributions
degree+0.149
endo+0.146
tr+0.141
aph+0.136
pre+0.131
Neg logit contributions
 Fiona-0.132
 Hank-0.125
gers-0.122
 Hannah-0.120
 Holly-0.117
Token away
Token ID1497
Feature activation+1.23e-02

Pos logit contributions
 Holly+0.113
liv+0.112
 Fiona+0.112
 Hank+0.111
scar+0.109
Neg logit contributions
ast-0.037
degree-0.033
 together-0.031
 how-0.030
 exc-0.030
Token from
Token ID422
Feature activation+7.82e-02

Pos logit contributions
 volcan+0.009
 hex+0.008
 erupt+0.008
degree+0.008
 eru+0.008
Neg logit contributions
 Hank-0.023
gers-0.023
 Holly-0.023
 trainers-0.023
 Hannah-0.023
Token dangerous
Token ID4923
Feature activation-2.27e-04

Pos logit contributions
 volcan+0.101
degree+0.096
endo+0.094
 hex+0.090
 eru+0.088
Neg logit contributions
 Hank-0.107
 Fiona-0.107
 Hannah-0.106
 Holly-0.105
gers-0.105
Token things
Token ID1243
Feature activation+5.37e-02

Pos logit contributions
 Holly+0.000
 Fiona+0.000
 Hank+0.000
 Hannah+0.000
liv+0.000
Neg logit contributions
 volcan-0.000
degree-0.000
 eru-0.000
endo-0.000
 towers-0.000
Token like
Token ID588
Feature activation-6.05e-02

Pos logit contributions
 volcan+0.055
degree+0.052
endo+0.049
tr+0.047
 erupt+0.046
Neg logit contributions
 Fiona-0.087
 Hank-0.086
 Holly-0.085
 Hannah-0.084
gers-0.083
Token volcan
Token ID17516
Feature activation+3.62e-01

Pos logit contributions
 Hank+0.083
 Fiona+0.082
 Hannah+0.082
 Holly+0.081
gers+0.080
Neg logit contributions
 volcan-0.077
degree-0.075
endo-0.073
 hex-0.070
tr-0.069
Tokenoes
Token ID3028
Feature activation+1.28e-01

Pos logit contributions
degree+0.495
endo+0.470
 hex+0.456
 volcan+0.452
 BO+0.450
Neg logit contributions
gers-0.435
 Hank-0.414
 Fiona-0.398
 jackets-0.385
 trainers-0.383
Token.
Token ID13
Feature activation-1.16e-02

Pos logit contributions
 volcan+0.153
degree+0.145
endo+0.140
 hex+0.133
 eru+0.132
Neg logit contributions
 Hank-0.189
 Fiona-0.186
 Hannah-0.186
 Holly-0.184
gers-0.181
Token<|endoftext|>
Token ID50256
Feature activation-7.45e-02

Pos logit contributions
 Hank+0.016
 Fiona+0.016
 Hannah+0.016
 Holly+0.016
 Woo+0.015
Neg logit contributions
 volcan-0.015
degree-0.015
endo-0.014
 hex-0.014
pre-0.014
TokenOnce
Token ID7454
Feature activation+7.32e-02

Pos logit contributions
 Hank+0.121
 Fiona+0.119
 Hannah+0.119
 Holly+0.118
gers+0.117
Neg logit contributions
 volcan-0.076
degree-0.073
endo-0.071
 hex-0.067
tr-0.066
Token upon
Token ID2402
Feature activation-5.52e-02

Pos logit contributions
 volcan+0.114
degree+0.111
endo+0.108
 hex+0.104
tr+0.104
Neg logit contributions
 Hank-0.081
 Fiona-0.080
 Hannah-0.079
 Holly-0.078
gers-0.077
Token teach
Token ID4545
Feature activation-4.70e-02

Pos logit contributions
 Hank+0.106
 Hannah+0.103
 Fiona+0.103
 Holly+0.100
gers+0.096
Neg logit contributions
 volcan-0.120
degree-0.118
endo-0.117
tr-0.110
 hex-0.109
Token you
Token ID345
Feature activation+7.12e-02

Pos logit contributions
 Fiona+0.071
 Hannah+0.071
 Hank+0.071
 Holly+0.071
gers+0.067
Neg logit contributions
 volcan-0.053
degree-0.053
endo-0.051
tr-0.049
pre-0.048
Token something
Token ID1223
Feature activation-5.38e-02

Pos logit contributions
 volcan+0.058
degree+0.052
endo+0.048
 hex+0.048
 eru+0.044
Neg logit contributions
 Hank-0.137
 Hannah-0.134
 Fiona-0.133
 Holly-0.132
gers-0.132
Token special
Token ID2041
Feature activation-1.73e-02

Pos logit contributions
 Fiona+0.069
 Hank+0.069
 Hannah+0.068
 Holly+0.068
gers+0.066
Neg logit contributions
 volcan-0.072
endo-0.071
degree-0.071
tr-0.066
pre-0.066
Token about
Token ID546
Feature activation-6.82e-02

Pos logit contributions
 Fiona+0.021
 Holly+0.021
 Hannah+0.021
 Hank+0.021
gers+0.021
Neg logit contributions
 volcan-0.024
degree-0.024
endo-0.023
pre-0.022
 eru-0.022
Token volcan
Token ID17516
Feature activation+3.61e-01

Pos logit contributions
 Fiona+0.080
 Holly+0.076
 Hannah+0.075
 Hank+0.074
 Rover+0.073
Neg logit contributions
degree-0.101
pre-0.098
tr-0.095
aph-0.092
endo-0.090
Tokenoes
Token ID3028
Feature activation+1.26e-01

Pos logit contributions
degree+0.524
endo+0.510
 BO+0.508
BO+0.498
AAAA+0.488
Neg logit contributions
gers-0.384
 Hank-0.345
 Fiona-0.333
agging-0.329
latable-0.328
Token."
Token ID526
Feature activation-1.99e-03

Pos logit contributions
 volcan+0.164
degree+0.159
endo+0.154
 hex+0.148
tr+0.146
Neg logit contributions
 Hank-0.169
 Fiona-0.167
 Hannah-0.166
 Holly-0.165
gers-0.163
Token
Token ID198
Feature activation+4.24e-02

Pos logit contributions
 Hannah+0.002
 Fiona+0.002
 Holly+0.002
 Hank+0.002
 Nora+0.002
Neg logit contributions
endo-0.003
degree-0.003
 volcan-0.003
raper-0.003
destruct-0.003
Token
Token ID198
Feature activation+4.21e-02

Pos logit contributions
 volcan+0.040
degree+0.038
endo+0.037
 hex+0.034
 eru+0.034
Neg logit contributions
 Hank-0.072
gers-0.071
 Hannah-0.071
 Fiona-0.071
 Holly-0.070
TokenSo
Token ID2396
Feature activation-5.58e-03

Pos logit contributions
 volcan+0.061
degree+0.059
endo+0.057
 hex+0.056
tr+0.055
Neg logit contributions
 Hank-0.052
 Fiona-0.051
 Hannah-0.051
 Holly-0.050
gers-0.050
Token The
Token ID383
Feature activation+1.62e-02

Pos logit contributions
Matthew+0.013
gers+0.012
 Fiona+0.012
 therapist+0.012
 Hank+0.012
Neg logit contributions
 volcan-0.007
 hex-0.007
 Or-0.007
endo-0.006
 eru-0.006
Token book
Token ID1492
Feature activation+6.71e-02

Pos logit contributions
endo+0.024
 volcan+0.024
pre+0.024
 eru+0.023
degree+0.022
Neg logit contributions
gers-0.019
 therapist-0.018
 Holly-0.018
 Hannah-0.017
 Rover-0.017
Token has
Token ID468
Feature activation+3.23e-03

Pos logit contributions
 volcan+0.095
degree+0.092
endo+0.090
pre+0.087
 eru+0.085
Neg logit contributions
gers-0.083
 Holly-0.082
 Hannah-0.080
 Fiona-0.080
 Hank-0.079
Token pictures
Token ID5986
Feature activation+1.36e-01

Pos logit contributions
 volcan+0.004
degree+0.004
endo+0.004
pre+0.003
 eru+0.003
Neg logit contributions
 Hank-0.005
 Hannah-0.005
 Fiona-0.005
gers-0.005
 Holly-0.005
Token of
Token ID286
Feature activation-7.71e-02

Pos logit contributions
 volcan+0.193
degree+0.187
endo+0.181
 hex+0.176
tr+0.175
Neg logit contributions
 Hank-0.170
 Fiona-0.167
 Hannah-0.166
 Holly-0.164
gers-0.162
Token volcan
Token ID17516
Feature activation+3.61e-01

Pos logit contributions
 Hank+0.110
 Fiona+0.109
 Hannah+0.108
 Holly+0.108
gers+0.105
Neg logit contributions
degree-0.093
 volcan-0.091
endo-0.089
tr-0.084
pre-0.084
Tokenoes
Token ID3028
Feature activation+1.28e-01

Pos logit contributions
degree+0.557
 BO+0.548
endo+0.529
AAAA+0.528
 volcan+0.522
Neg logit contributions
gers-0.363
agging-0.317
 jackets-0.306
latable-0.302
 trainers-0.299
Token.
Token ID13
Feature activation-8.91e-03

Pos logit contributions
 volcan+0.158
degree+0.152
endo+0.148
 hex+0.141
tr+0.140
Neg logit contributions
 Hank-0.182
 Fiona-0.180
 Hannah-0.179
 Holly-0.178
gers-0.176
Token Vol
Token ID4709
Feature activation+2.03e-01

Pos logit contributions
 Fiona+0.014
gers+0.014
Matthew+0.014
 Hank+0.014
 therapist+0.014
Neg logit contributions
 volcan-0.010
endo-0.009
 hex-0.008
degree-0.008
 eru-0.008
Tokencan
Token ID5171
Feature activation+1.00e-01

Pos logit contributions
 volcan+0.237
degree+0.229
endo+0.220
 eru+0.208
 hex+0.207
Neg logit contributions
 Hank-0.305
 Hannah-0.299
gers-0.298
 Fiona-0.296
 Holly-0.296
Tokenoes
Token ID3028
Feature activation+1.28e-01

Pos logit contributions
 volcan+0.107
degree+0.101
endo+0.098
 hex+0.094
tr+0.093
Neg logit contributions
 Hank-0.162
 Fiona-0.159
 Hannah-0.159
 Holly-0.158
gers-0.156
Token.
Token ID13
Feature activation-6.94e-03

Pos logit contributions
 volcan+0.033
degree+0.032
endo+0.031
 hex+0.030
tr+0.029
Neg logit contributions
 Hank-0.043
 Fiona-0.042
 Hannah-0.042
 Holly-0.041
gers-0.041
Token You
Token ID921
Feature activation+4.35e-03

Pos logit contributions
 Hank+0.011
 Fiona+0.011
 Holly+0.011
gers+0.011
 Hannah+0.011
Neg logit contributions
 volcan-0.008
degree-0.007
endo-0.007
 Or-0.007
 hex-0.007
Token should
Token ID815
Feature activation-5.16e-02

Pos logit contributions
 volcan+0.007
degree+0.007
endo+0.007
aph+0.007
raper+0.007
Neg logit contributions
 Hank-0.004
 Hannah-0.004
 Fiona-0.004
 Holly-0.004
 Woo-0.004
Token not
Token ID407
Feature activation+6.17e-04

Pos logit contributions
gers+0.092
 Fiona+0.089
 therapist+0.088
 Holly+0.088
 Hank+0.087
Neg logit contributions
 volcan-0.047
 erupt-0.044
endo-0.044
ast-0.043
 exc-0.041
Token like
Token ID588
Feature activation-6.74e-02

Pos logit contributions
 volcan+0.001
degree+0.001
endo+0.001
 hex+0.001
 eru+0.001
Neg logit contributions
 Hank-0.001
 Hannah-0.001
 Fiona-0.001
 Woo-0.001
 Holly-0.001
Token volcan
Token ID17516
Feature activation+3.61e-01

Pos logit contributions
 Fiona+0.063
 Hank+0.059
 Hannah+0.058
 Woo+0.057
 Rover+0.056
Neg logit contributions
degree-0.112
tr-0.110
endo-0.104
pre-0.103
aph-0.102
Tokenoes
Token ID3028
Feature activation+1.27e-01

Pos logit contributions
degree+0.513
endo+0.501
 BO+0.484
 hex+0.467
BO+0.466
Neg logit contributions
gers-0.385
agging-0.346
 jackets-0.345
 Hank-0.344
over-0.330
Token.
Token ID13
Feature activation-6.92e-03

Pos logit contributions
 volcan+0.147
degree+0.141
endo+0.136
 hex+0.130
tr+0.128
Neg logit contributions
 Hank-0.192
 Fiona-0.190
 Hannah-0.189
 Holly-0.188
gers-0.186
Token You
Token ID921
Feature activation+4.15e-03

Pos logit contributions
 Hank+0.009
 Fiona+0.009
 Hannah+0.009
 Holly+0.009
gers+0.009
Neg logit contributions
 volcan-0.009
degree-0.009
endo-0.009
 hex-0.008
tr-0.008
Token should
Token ID815
Feature activation-5.25e-02

Pos logit contributions
 volcan+0.006
degree+0.006
endo+0.006
 hex+0.006
tr+0.006
Neg logit contributions
 Hank-0.004
 Fiona-0.004
 Hannah-0.004
 Holly-0.004
gers-0.004
Token like
Token ID588
Feature activation-6.80e-02

Pos logit contributions
gers+0.093
 Fiona+0.090
 therapist+0.088
 Holly+0.088
 Hank+0.088
Neg logit contributions
 volcan-0.049
endo-0.046
 erupt-0.045
ast-0.045
tr-0.043
Token They
Token ID1119
Feature activation-4.61e-02

Pos logit contributions
 Hank+0.007
 Fiona+0.007
 Hannah+0.007
 Holly+0.007
gers+0.007
Neg logit contributions
 volcan-0.008
degree-0.007
endo-0.007
 hex-0.007
tr-0.007
Token want
Token ID765
Feature activation-2.15e-03

Pos logit contributions
 Fiona+0.075
 Hank+0.074
 Hannah+0.073
 Holly+0.072
gers+0.071
Neg logit contributions
 volcan-0.048
degree-0.045
endo-0.043
 hex-0.043
tr-0.042
Token to
Token ID284
Feature activation-8.45e-03

Pos logit contributions
 Fiona+0.002
 Hank+0.002
 Hannah+0.002
 Holly+0.002
 Woo+0.002
Neg logit contributions
degree-0.004
endo-0.004
tr-0.004
ults-0.003
pre-0.003
Token write
Token ID3551
Feature activation+4.52e-02

Pos logit contributions
 Hank+0.010
 Hannah+0.010
 Fiona+0.010
 Holly+0.010
 Woo+0.009
Neg logit contributions
 volcan-0.012
degree-0.012
endo-0.012
pre-0.011
tr-0.011
Token about
Token ID546
Feature activation-7.71e-02

Pos logit contributions
 volcan+0.060
degree+0.059
endo+0.058
 hex+0.055
tr+0.054
Neg logit contributions
 Hank-0.059
 Fiona-0.058
gers-0.058
 Hannah-0.058
 Holly-0.058
Token volcan
Token ID17516
Feature activation+3.61e-01

Pos logit contributions
 Fiona+0.082
 Hank+0.077
 Rover+0.073
 Hannah+0.072
 Holly+0.071
Neg logit contributions
degree-0.125
tr-0.117
endo-0.116
aph-0.110
destruct-0.109
Tokenoes
Token ID3028
Feature activation+1.29e-01

Pos logit contributions
degree+0.503
endo+0.481
 BO+0.480
BO+0.458
 POP+0.454
Neg logit contributions
gers-0.423
 Fiona-0.368
 jackets-0.365
 Hank-0.361
 trainers-0.360
Token.
Token ID13
Feature activation-6.83e-03

Pos logit contributions
degree+0.165
 volcan+0.163
endo+0.160
 hex+0.148
tr+0.148
Neg logit contributions
 Hank-0.174
 Fiona-0.173
 Holly-0.172
 Hannah-0.171
gers-0.170
Token
Token ID198
Feature activation+4.34e-02

Pos logit contributions
 Hank+0.010
 Fiona+0.010
 Hannah+0.010
 Holly+0.009
gers+0.009
Neg logit contributions
 volcan-0.009
degree-0.008
endo-0.008
 hex-0.008
tr-0.008
Token
Token ID198
Feature activation+4.34e-02

Pos logit contributions
 volcan+0.049
degree+0.047
endo+0.046
 hex+0.044
tr+0.043
Neg logit contributions
 Hank-0.066
 Fiona-0.066
 Hannah-0.066
 Holly-0.065
gers-0.065
TokenThey
Token ID2990
Feature activation-7.31e-02

Pos logit contributions
 volcan+0.051
degree+0.049
endo+0.047
 Or+0.047
tr+0.046
Neg logit contributions
 Fiona-0.065
 Hank-0.065
 Holly-0.064
 Hannah-0.064
gers-0.062
Token little
Token ID1310
Feature activation+5.59e-02

Pos logit contributions
 Fiona+0.014
 Hannah+0.012
 Ned+0.012
 Holly+0.011
 Nora+0.011
Neg logit contributions
degree-0.035
tr-0.035
endo-0.033
pre-0.033
imet-0.033
Token girl
Token ID2576
Feature activation-2.04e-02

Pos logit contributions
degree+0.079
 volcan+0.078
endo+0.078
tr+0.073
 hex+0.071
Neg logit contributions
 Fiona-0.070
 Hannah-0.068
 Hank-0.068
 Holly-0.068
gers-0.066
Token all
Token ID477
Feature activation+8.90e-03

Pos logit contributions
 Hank+0.039
 Fiona+0.038
 Hannah+0.038
 Holly+0.038
gers+0.038
Neg logit contributions
 volcan-0.016
degree-0.015
endo-0.014
 hex-0.013
tr-0.012
Token about
Token ID546
Feature activation-6.99e-02

Pos logit contributions
 volcan+0.012
endo+0.012
degree+0.012
tr+0.012
 hex+0.012
Neg logit contributions
 Fiona-0.011
 Hannah-0.010
 Hank-0.010
 Holly-0.010
gers-0.010
Token how
Token ID703
Feature activation+2.29e-01

Pos logit contributions
 Fiona+0.064
 Rover+0.052
 Hank+0.051
 Nora+0.050
 Das+0.050
Neg logit contributions
tr-0.119
degree-0.116
 Or-0.109
pre-0.109
aph-0.108
Token volcan
Token ID17516
Feature activation+3.60e-01

Pos logit contributions
tr+0.337
endo+0.325
degree+0.317
 hex+0.297
 Or+0.296
Neg logit contributions
 Fiona-0.269
gers-0.260
 Hank-0.251
 Hannah-0.242
 Holly-0.239
Tokenoes
Token ID3028
Feature activation+1.24e-01

Pos logit contributions
degree+0.531
endo+0.520
 BO+0.512
BO+0.492
 Or+0.472
Neg logit contributions
gers-0.359
 Hank-0.337
 Fiona-0.335
 bundled-0.310
 Ned-0.307
Token work
Token ID670
Feature activation+8.97e-02

Pos logit contributions
degree+0.181
endo+0.174
tr+0.171
raper+0.163
aph+0.161
Neg logit contributions
 Fiona-0.140
 Hank-0.135
 Holly-0.133
gers-0.132
 Hannah-0.128
Token.
Token ID13
Feature activation-5.72e-03

Pos logit contributions
degree+0.121
 volcan+0.119
endo+0.117
tr+0.110
pre+0.109
Neg logit contributions
gers-0.113
 Hank-0.113
 Fiona-0.112
 Hannah-0.112
 Holly-0.111
Token She
Token ID1375
Feature activation-3.48e-02

Pos logit contributions
 Fiona+0.006
 Hannah+0.005
 Holly+0.005
 Hank+0.005
 Nora+0.005
Neg logit contributions
degree-0.010
endo-0.009
 volcan-0.009
pre-0.009
tr-0.009
Token taught
Token ID7817
Feature activation-6.68e-02

Pos logit contributions
 Hank+0.042
 Fiona+0.042
 Hannah+0.041
gers+0.041
 Holly+0.041
Neg logit contributions
degree-0.049
 volcan-0.049
endo-0.047
 hex-0.044
 eru-0.044
Token Cars
Token ID25277
Feature activation-6.52e-02

Pos logit contributions
 Hank+0.009
 Fiona+0.009
 Hannah+0.009
 Holly+0.009
gers+0.009
Neg logit contributions
 volcan-0.010
degree-0.010
endo-0.010
 hex-0.009
tr-0.009
Token didn
Token ID1422
Feature activation+5.94e-02

Pos logit contributions
 Fiona+0.097
 Hank+0.097
 Hannah+0.096
 Holly+0.095
gers+0.094
Neg logit contributions
 volcan-0.076
degree-0.073
endo-0.070
 hex-0.068
tr-0.067
Token't
Token ID470
Feature activation+1.37e-03

Pos logit contributions
 volcan+0.090
degree+0.088
endo+0.084
pre+0.082
 hex+0.081
Neg logit contributions
 Hank-0.067
 Fiona-0.066
 Hannah-0.065
 Holly-0.064
 Woo-0.064
Token fly
Token ID6129
Feature activation+8.89e-02

Pos logit contributions
 volcan+0.002
degree+0.002
endo+0.002
 hex+0.002
tr+0.002
Neg logit contributions
 Hank-0.002
 Fiona-0.002
 Hannah-0.002
 Holly-0.002
gers-0.002
Token to
Token ID284
Feature activation-1.15e-02

Pos logit contributions
degree+0.121
 volcan+0.121
endo+0.119
raper+0.111
 hex+0.110
Neg logit contributions
gers-0.113
 Hank-0.112
 Fiona-0.110
 Holly-0.107
 Woo-0.106
Token volcan
Token ID17516
Feature activation+3.59e-01

Pos logit contributions
 Hank+0.014
 Fiona+0.014
 Hannah+0.014
 Woo+0.014
 Holly+0.013
Neg logit contributions
degree-0.016
endo-0.016
 volcan-0.016
pre-0.015
tr-0.015
Tokenoes
Token ID3028
Feature activation+1.25e-01

Pos logit contributions
degree+0.498
 volcan+0.481
endo+0.474
 BO+0.466
 hex+0.464
Neg logit contributions
gers-0.407
 Hank-0.390
 Fiona-0.378
 trainers-0.369
latable-0.367
Token.
Token ID13
Feature activation-9.05e-03

Pos logit contributions
 volcan+0.150
degree+0.148
endo+0.143
 hex+0.134
tr+0.133
Neg logit contributions
 Hank-0.181
 Fiona-0.178
 Hannah-0.177
 Holly-0.176
gers-0.174
Token<|endoftext|>
Token ID50256
Feature activation-8.89e-02

Pos logit contributions
 Hank+0.011
 Fiona+0.011
 Hannah+0.011
 Holly+0.011
 Woo+0.011
Neg logit contributions
degree-0.013
 volcan-0.013
endo-0.012
tr-0.012
 hex-0.012
TokenL
Token ID43
Feature activation+3.05e-02

Pos logit contributions
 Hank+0.152
 Fiona+0.150
 Hannah+0.150
 Holly+0.148
gers+0.148
Neg logit contributions
 volcan-0.084
degree-0.079
endo-0.077
 hex-0.073
tr-0.071
Tokenily
Token ID813
Feature activation-3.21e-03

Pos logit contributions
 volcan+0.038
endo+0.036
degree+0.035
 hex+0.034
od+0.033
Neg logit contributions
 Hank-0.044
 Fiona-0.043
gers-0.042
 Hannah-0.042
 therapist-0.042
Token.
Token ID13
Feature activation-9.72e-03

Pos logit contributions
 volcan+0.171
degree+0.170
endo+0.165
 hex+0.156
pre+0.154
Neg logit contributions
 Hank-0.175
 Fiona-0.173
 Hannah-0.171
gers-0.171
 Holly-0.171
Token Lily
Token ID20037
Feature activation+1.53e-02

Pos logit contributions
gers+0.016
 Fiona+0.016
 Hank+0.016
 therapist+0.015
 Hannah+0.015
Neg logit contributions
 volcan-0.010
degree-0.009
endo-0.009
 hex-0.009
 eru-0.009
Token and
Token ID290
Feature activation+2.85e-03

Pos logit contributions
 volcan+0.018
degree+0.017
endo+0.017
 hex+0.016
tr+0.016
Neg logit contributions
 Hank-0.023
 Fiona-0.022
 Hannah-0.022
 Holly-0.022
gers-0.022
Token Ben
Token ID3932
Feature activation-5.88e-02

Pos logit contributions
 volcan+0.002
endo+0.002
aph+0.002
 ding+0.002
 hex+0.002
Neg logit contributions
gers-0.005
 Fiona-0.005
 therapist-0.005
 Holly-0.005
Matthew-0.005
Token think
Token ID892
Feature activation+1.03e-01

Pos logit contributions
 Hank+0.091
 Fiona+0.090
 Hannah+0.090
 Holly+0.089
gers+0.088
Neg logit contributions
 volcan-0.066
degree-0.063
endo-0.061
 hex-0.058
 eru-0.058
Token volcan
Token ID17516
Feature activation+3.59e-01

Pos logit contributions
 volcan+0.138
degree+0.135
endo+0.132
 hex+0.126
tr+0.125
Neg logit contributions
 Hank-0.134
 Fiona-0.133
 Hannah-0.132
 Holly-0.131
gers-0.129
Tokenoes
Token ID3028
Feature activation+1.27e-01

Pos logit contributions
degree+0.474
endo+0.454
 volcan+0.450
 BO+0.446
 hex+0.426
Neg logit contributions
gers-0.439
 Fiona-0.420
 Hank-0.409
 Holly-0.385
 Hannah-0.385
Token are
Token ID389
Feature activation+3.61e-03

Pos logit contributions
degree+0.171
 volcan+0.170
endo+0.165
 hex+0.157
tr+0.156
Neg logit contributions
 Hank-0.158
 Fiona-0.157
 Hannah-0.154
gers-0.154
 Holly-0.153
Token cool
Token ID3608
Feature activation+1.27e-02

Pos logit contributions
degree+0.004
endo+0.004
 volcan+0.004
tr+0.004
pre+0.004
Neg logit contributions
 Hank-0.005
 Fiona-0.005
 Holly-0.005
 Hannah-0.005
gers-0.005
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
 volcan+0.015
degree+0.014
endo+0.013
 hex+0.013
 eru+0.013
Neg logit contributions
 Hank-0.019
 Fiona-0.019
 Hannah-0.019
 Holly-0.019
gers-0.018
Token They
Token ID1119
Feature activation-5.07e-02

Pos logit contributions
gers+0.018
 Hank+0.017
 therapist+0.017
 Fiona+0.017
 trainers+0.017
Neg logit contributions
 volcan-0.011
degree-0.010
 Or-0.010
endo-0.010
 hex-0.010
Token They
Token ID1119
Feature activation-5.07e-02

Pos logit contributions
gers+0.018
 Hank+0.017
 therapist+0.017
 Fiona+0.017
 trainers+0.017
Neg logit contributions
 volcan-0.011
degree-0.010
 Or-0.010
endo-0.010
 hex-0.010
Token want
Token ID765
Feature activation-4.51e-03

Pos logit contributions
 Hank+0.075
 Fiona+0.074
 Hannah+0.074
 Holly+0.073
gers+0.073
Neg logit contributions
 volcan-0.060
degree-0.057
endo-0.056
 hex-0.053
tr-0.052
Token to
Token ID284
Feature activation-1.22e-02

Pos logit contributions
 Fiona+0.005
 Hank+0.004
 Hannah+0.004
 Holly+0.004
 Rover+0.004
Neg logit contributions
degree-0.007
endo-0.007
tr-0.007
pre-0.007
aph-0.007
Token write
Token ID3551
Feature activation+4.35e-02

Pos logit contributions
 Hank+0.014
 Fiona+0.014
 Hannah+0.014
 Holly+0.014
gers+0.013
Neg logit contributions
degree-0.018
endo-0.017
 volcan-0.017
pre-0.017
tr-0.016
Token about
Token ID546
Feature activation-8.08e-02

Pos logit contributions
 volcan+0.056
degree+0.054
endo+0.052
 hex+0.050
tr+0.049
Neg logit contributions
 Hank-0.060
 Fiona-0.060
 Hannah-0.059
 Holly-0.059
gers-0.058
Token volcan
Token ID17516
Feature activation+3.58e-01

Pos logit contributions
 Fiona+0.089
 Hank+0.084
 Rover+0.083
 Hannah+0.080
 Holly+0.078
Neg logit contributions
degree-0.125
tr-0.121
endo-0.118
pre-0.116
aph-0.114
Tokenoes
Token ID3028
Feature activation+1.26e-01

Pos logit contributions
degree+0.517
 BO+0.506
endo+0.497
AAAA+0.482
 POP+0.478
Neg logit contributions
gers-0.402
 jackets-0.342
 trainers-0.339
over-0.338
 Fiona-0.335
Token.
Token ID13
Feature activation-1.17e-02

Pos logit contributions
 volcan+0.158
degree+0.157
endo+0.152
 hex+0.143
tr+0.143
Neg logit contributions
 Hank-0.174
 Fiona-0.172
 Hannah-0.171
 Holly-0.170
gers-0.168
Token
Token ID198
Feature activation+4.10e-02

Pos logit contributions
gers+0.021
 Hank+0.020
 therapist+0.020
 Fiona+0.020
 trainers+0.020
Neg logit contributions
 volcan-0.010
degree-0.009
 Or-0.009
endo-0.009
 hex-0.008
Token
Token ID198
Feature activation+4.09e-02

Pos logit contributions
 volcan+0.028
degree+0.025
endo+0.024
 eru+0.023
 Or+0.022
Neg logit contributions
gers-0.080
 Hank-0.079
 trainers-0.079
 Hannah-0.079
 therapist-0.078
TokenThey
Token ID2990
Feature activation-7.74e-02

Pos logit contributions
BO+0.027
 Or+0.023
degree+0.022
Or+0.022
CR+0.022
Neg logit contributions
 Fiona-0.080
 Hank-0.078
 Buster-0.077
 Hannah-0.076
 Riley-0.076
Token and
Token ID290
Feature activation-1.62e-02

Pos logit contributions
 Hank+0.137
 Hannah+0.133
 Fiona+0.131
 Holly+0.131
 Woo+0.131
Neg logit contributions
 volcan-0.101
degree-0.089
endo-0.086
 hex-0.085
 eru-0.083
Token luxurious
Token ID35985
Feature activation-1.06e-01

Pos logit contributions
 Hank+0.024
 Fiona+0.023
 Hannah+0.023
 Holly+0.023
gers+0.023
Neg logit contributions
 volcan-0.019
degree-0.018
endo-0.018
 hex-0.017
tr-0.017
Token".
Token ID1911
Feature activation-5.80e-02

Pos logit contributions
 Hank+0.170
 Woo+0.167
Matthew+0.166
 Hannah+0.164
 Holly+0.163
Neg logit contributions
 volcan-0.115
degree-0.101
 eru-0.094
 hex-0.094
tr-0.094
Token Kim
Token ID6502
Feature activation+6.39e-02

Pos logit contributions
 Hank+0.087
gers+0.086
 Fiona+0.086
 Hannah+0.085
 Holly+0.085
Neg logit contributions
 volcan-0.064
degree-0.062
endo-0.060
 hex-0.057
tr-0.056
Token and
Token ID290
Feature activation-9.29e-03

Pos logit contributions
 volcan+0.071
degree+0.069
endo+0.069
tr+0.066
 hex+0.064
Neg logit contributions
gers-0.095
 Hank-0.095
 Fiona-0.094
 Holly-0.092
 Woo-0.092
Token Hank
Token ID24386
Feature activation-3.64e-01

Pos logit contributions
 Hank+0.013
 Fiona+0.013
 Hannah+0.013
 Holly+0.013
gers+0.012
Neg logit contributions
 volcan-0.012
degree-0.012
endo-0.011
 hex-0.011
tr-0.011
Token looked
Token ID3114
Feature activation+7.06e-02

Pos logit contributions
gers+0.571
 Fiona+0.570
 Hank+0.567
 Holly+0.561
 Woo+0.557
Neg logit contributions
 volcan-0.355
degree-0.350
endo-0.337
tr-0.327
 eru-0.317
Token around
Token ID1088
Feature activation-7.01e-02

Pos logit contributions
 volcan+0.060
degree+0.055
endo+0.051
 eru+0.050
 hex+0.046
Neg logit contributions
 Hank-0.126
 Fiona-0.126
gers-0.125
 therapist-0.125
 Woo-0.124
Token the
Token ID262
Feature activation-1.20e-02

Pos logit contributions
gers+0.131
scar+0.131
 therapist+0.129
liv+0.128
 Woo+0.128
Neg logit contributions
 volcan-0.057
 eru-0.044
 BO-0.041
endo-0.040
degree-0.040
Token park
Token ID3952
Feature activation-1.92e-02

Pos logit contributions
 Hank+0.017
 Fiona+0.017
 Hannah+0.017
 Holly+0.017
gers+0.017
Neg logit contributions
 volcan-0.015
degree-0.014
endo-0.013
tr-0.013
 hex-0.013
Token,
Token ID11
Feature activation-8.07e-03

Pos logit contributions
 Hank+0.035
 Woo+0.035
 Fiona+0.034
 Hannah+0.034
 Holly+0.034
Neg logit contributions
 volcan-0.016
tr-0.013
 erupt-0.013
 structure-0.013
 towers-0.012
Token two
Token ID734
Feature activation-4.87e-02

Pos logit contributions
 Hank+0.013
 Fiona+0.013
 Hannah+0.013
 Holly+0.013
gers+0.012
Neg logit contributions
 volcan-0.014
degree-0.014
endo-0.013
 hex-0.013
tr-0.013
Token friends
Token ID2460
Feature activation-5.53e-02

Pos logit contributions
 Hank+0.075
 therapist+0.074
 Fiona+0.074
 Hannah+0.073
gers+0.073
Neg logit contributions
 volcan-0.059
 eru-0.051
 hex-0.048
endo-0.048
 bricks-0.047
Token,
Token ID11
Feature activation-1.74e-02

Pos logit contributions
gers+0.103
 therapist+0.102
 trainers+0.099
 Hank+0.099
 Holly+0.098
Neg logit contributions
 volcan-0.048
od-0.039
 eru-0.038
degree-0.037
 bricks-0.036
Token Jack
Token ID3619
Feature activation-2.08e-02

Pos logit contributions
 Hank+0.027
gers+0.027
 Fiona+0.026
 Hannah+0.026
 Holly+0.026
Neg logit contributions
 volcan-0.019
degree-0.018
endo-0.017
 hex-0.017
tr-0.016
Token and
Token ID290
Feature activation+7.09e-03

Pos logit contributions
 Hank+0.040
gers+0.039
 Fiona+0.039
 Holly+0.038
 Hannah+0.038
Neg logit contributions
 volcan-0.016
endo-0.014
degree-0.013
ast-0.013
 hex-0.013
Token Hank
Token ID24386
Feature activation-3.63e-01

Pos logit contributions
 volcan+0.008
degree+0.007
endo+0.007
 hex+0.007
aph+0.007
Neg logit contributions
gers-0.011
 trainers-0.011
 therapist-0.010
 Hank-0.010
 Fiona-0.010
Token.
Token ID13
Feature activation-9.09e-03

Pos logit contributions
 Fiona+0.606
 Hank+0.601
 Holly+0.596
 Hannah+0.592
gers+0.589
Neg logit contributions
 volcan-0.348
degree-0.327
pre-0.310
 hex-0.310
endo-0.309
Token They
Token ID1119
Feature activation-4.59e-02

Pos logit contributions
 therapist+0.017
gers+0.016
 Fiona+0.015
 trainers+0.015
 shepherd+0.015
Neg logit contributions
 volcan-0.007
endo-0.006
 eru-0.006
 hex-0.006
 BO-0.006
Token loved
Token ID6151
Feature activation-4.94e-02

Pos logit contributions
 Fiona+0.072
 Hank+0.072
 Hannah+0.072
 therapist+0.071
gers+0.071
Neg logit contributions
 volcan-0.047
degree-0.044
endo-0.043
 hex-0.042
 eru-0.041
Token to
Token ID284
Feature activation-1.19e-02

Pos logit contributions
 Hank+0.060
 Fiona+0.058
 Hannah+0.058
 Holly+0.057
 Woo+0.056
Neg logit contributions
degree-0.073
 volcan-0.070
tr-0.069
endo-0.069
 hex-0.067
Token play
Token ID711
Feature activation-3.76e-02

Pos logit contributions
 Hank+0.017
 Hannah+0.017
 Fiona+0.017
 Holly+0.017
gers+0.017
Neg logit contributions
 volcan-0.015
degree-0.014
endo-0.013
 hex-0.013
tr-0.013
Token Hannah
Token ID26013
Feature activation-3.61e-01

Pos logit contributions
latable+0.014
gers+0.014
 shady+0.012
 trainers+0.012
scar+0.012
Neg logit contributions
 how-0.018
 BO-0.018
 Ap-0.018
 POP-0.018
aph-0.017
Token's
Token ID338
Feature activation-1.50e-02

Pos logit contributions
gers+0.586
 Hank+0.576
 trainers+0.572
 Fiona+0.571
 Holly+0.562
Neg logit contributions
 volcan-0.344
degree-0.331
 hex-0.319
od-0.314
tr-0.312
Token mom
Token ID1995
Feature activation+1.86e-03

Pos logit contributions
 Fiona+0.025
 Hank+0.025
 Holly+0.024
gers+0.024
 Hannah+0.024
Neg logit contributions
 volcan-0.015
degree-0.014
tr-0.013
endo-0.013
 hex-0.013
Token.
Token ID13
Feature activation-6.39e-03

Pos logit contributions
 volcan+0.002
degree+0.002
tr+0.002
 erupt+0.002
od+0.002
Neg logit contributions
gers-0.003
 Hank-0.003
 Holly-0.003
 Fiona-0.003
 Woo-0.003
Token But
Token ID887
Feature activation+3.90e-02

Pos logit contributions
gers+0.013
 trainers+0.012
latable+0.012
scar+0.011
Matthew+0.011
Neg logit contributions
 Or-0.004
 Ap-0.004
 BO-0.004
 POP-0.003
degree-0.003
Token Hannah
Token ID26013
Feature activation-3.62e-01

Pos logit contributions
 how+0.041
 when+0.034
 as+0.032
 POP+0.029
 ding+0.029
Neg logit contributions
latable-0.064
gers-0.062
 jerseys-0.059
 trainers-0.057
aders-0.055
Token was
Token ID373
Feature activation-3.30e-02

Pos logit contributions
gers+0.599
 Fiona+0.583
 Hank+0.576
 Woo+0.572
 trainers+0.569
Neg logit contributions
 volcan-0.317
pre-0.293
tr-0.292
degree-0.291
endo-0.289
Token not
Token ID407
Feature activation-8.48e-04

Pos logit contributions
gers+0.063
 Fiona+0.058
 Hank+0.057
 Buster+0.057
 Woo+0.057
Neg logit contributions
 volcan-0.027
 erupt-0.026
 clim-0.025
 painting-0.023
raper-0.022
Token worried
Token ID7960
Feature activation-5.25e-02

Pos logit contributions
 Hank+0.001
 Fiona+0.001
 Hannah+0.001
 Holly+0.001
gers+0.001
Neg logit contributions
 volcan-0.001
degree-0.001
endo-0.001
 hex-0.001
 eru-0.001
Token.
Token ID13
Feature activation-6.56e-03

Pos logit contributions
 Hank+0.082
 Fiona+0.082
gers+0.082
 Holly+0.080
 Hannah+0.080
Neg logit contributions
 volcan-0.057
degree-0.053
endo-0.050
 eru-0.049
 hex-0.048
Token She
Token ID1375
Feature activation-3.67e-02

Pos logit contributions
gers+0.011
 trainers+0.010
 Hank+0.010
Matthew+0.010
 Fiona+0.010
Neg logit contributions
 Or-0.006
endo-0.005
 volcan-0.005
degree-0.005
tr-0.005
Token was
Token ID373
Feature activation-3.93e-02

Pos logit contributions
gers+0.114
 Hank+0.114
 Fiona+0.113
 Hannah+0.112
 Holly+0.112
Neg logit contributions
 volcan-0.099
degree-0.092
endo-0.091
 hex-0.087
 eru-0.087
Token a
Token ID257
Feature activation-1.10e-02

Pos logit contributions
 Hank+0.060
 Fiona+0.059
 Hannah+0.059
gers+0.058
 Holly+0.058
Neg logit contributions
 volcan-0.045
degree-0.043
endo-0.042
 hex-0.039
 eru-0.039
Token little
Token ID1310
Feature activation+5.19e-02

Pos logit contributions
 Hank+0.016
 Hannah+0.015
 Holly+0.015
 Fiona+0.015
gers+0.015
Neg logit contributions
 volcan-0.014
degree-0.014
endo-0.013
tr-0.012
 hex-0.012
Token girl
Token ID2576
Feature activation-2.53e-02

Pos logit contributions
 volcan+0.059
degree+0.054
 hex+0.053
pre+0.053
endo+0.051
Neg logit contributions
 Hank-0.077
 Fiona-0.076
 Hannah-0.075
 Holly-0.075
gers-0.075
Token named
Token ID3706
Feature activation-3.75e-02

Pos logit contributions
 Hank+0.031
 Fiona+0.031
 Hannah+0.030
 Holly+0.030
gers+0.029
Neg logit contributions
 volcan-0.036
degree-0.036
endo-0.035
 hex-0.033
 eru-0.033
Token Hannah
Token ID26013
Feature activation-3.61e-01

Pos logit contributions
 Fiona+0.053
 Hannah+0.053
 Holly+0.052
 Hank+0.051
 Woo+0.050
Neg logit contributions
 volcan-0.049
degree-0.048
endo-0.046
 hex-0.045
pre-0.044
Token.
Token ID13
Feature activation-1.56e-02

Pos logit contributions
 Hank+0.481
 Hannah+0.478
 Fiona+0.477
 Holly+0.470
gers+0.463
Neg logit contributions
 volcan-0.478
degree-0.466
endo-0.460
 hex-0.434
 eru-0.425
Token She
Token ID1375
Feature activation-4.19e-02

Pos logit contributions
scar+0.034
achers+0.034
guards+0.034
cher+0.034
Matthew+0.033
Neg logit contributions
 Pretty-0.004
 At-0.002
 Or-0.002
 Ap-0.001
 Ding-0.001
Token loved
Token ID6151
Feature activation-5.20e-02

Pos logit contributions
 Hank+0.056
 Fiona+0.055
 Hannah+0.055
 Holly+0.054
gers+0.054
Neg logit contributions
 volcan-0.056
degree-0.054
endo-0.053
 hex-0.051
tr-0.051
Token to
Token ID284
Feature activation-1.53e-02

Pos logit contributions
 Woo+0.054
 Hannah+0.053
 Fiona+0.052
 Hank+0.052
 Holly+0.051
Neg logit contributions
degree-0.086
endo-0.081
 volcan-0.081
tr-0.080
aph-0.080
Token play
Token ID711
Feature activation-4.15e-02

Pos logit contributions
 Hank+0.020
 Fiona+0.020
 Hannah+0.020
 Holly+0.020
gers+0.019
Neg logit contributions
 volcan-0.021
degree-0.020
endo-0.020
 hex-0.019
tr-0.019
Token"
Token ID1
Feature activation-3.78e-02

Pos logit contributions
BO+0.033
Sur+0.033
CR+0.030
Pretty+0.030
Ps+0.029
Neg logit contributions
smanship-0.077
 avail-0.076
gers-0.075
latable-0.075
 Fiona-0.072
TokenOh
Token ID5812
Feature activation+7.79e-02

Pos logit contributions
gers+0.060
 Fiona+0.060
 Hank+0.059
 Holly+0.058
 Hannah+0.058
Neg logit contributions
 volcan-0.039
degree-0.038
endo-0.037
tr-0.035
 hex-0.034
Token no
Token ID645
Feature activation-5.89e-02

Pos logit contributions
degree+0.075
 volcan+0.074
 clim+0.069
 hex+0.068
 how+0.068
Neg logit contributions
gers-0.127
 trainers-0.118
Matthew-0.116
 therapist-0.114
 Hank-0.114
Token!"
Token ID2474
Feature activation-3.21e-02

Pos logit contributions
gers+0.084
 Holly+0.084
 Fiona+0.084
 Hannah+0.083
 Hank+0.083
Neg logit contributions
 volcan-0.071
degree-0.067
 hex-0.065
 eru-0.063
endo-0.063
Token exclaimed
Token ID33552
Feature activation-1.34e-02

Pos logit contributions
latable+0.042
gers+0.040
 trainers+0.038
agging+0.037
scar+0.037
Neg logit contributions
 BO-0.037
 POP-0.036
degree-0.035
 Ding-0.035
 pressed-0.034
Token Hannah
Token ID26013
Feature activation-3.61e-01

Pos logit contributions
latable+0.014
gers+0.014
 shady+0.012
 trainers+0.012
scar+0.012
Neg logit contributions
 how-0.018
 BO-0.018
 Ap-0.018
 POP-0.018
aph-0.017
Token's
Token ID338
Feature activation-1.50e-02

Pos logit contributions
gers+0.586
 Hank+0.576
 trainers+0.572
 Fiona+0.571
 Holly+0.562
Neg logit contributions
 volcan-0.344
degree-0.331
 hex-0.319
od-0.314
tr-0.312
Token mom
Token ID1995
Feature activation+1.86e-03

Pos logit contributions
 Fiona+0.025
 Hank+0.025
 Holly+0.024
gers+0.024
 Hannah+0.024
Neg logit contributions
 volcan-0.015
degree-0.014
tr-0.013
endo-0.013
 hex-0.013
Token.
Token ID13
Feature activation-6.39e-03

Pos logit contributions
 volcan+0.002
degree+0.002
tr+0.002
 erupt+0.002
od+0.002
Neg logit contributions
gers-0.003
 Hank-0.003
 Holly-0.003
 Fiona-0.003
 Woo-0.003
Token But
Token ID887
Feature activation+3.90e-02

Pos logit contributions
gers+0.013
 trainers+0.012
latable+0.012
scar+0.011
Matthew+0.011
Neg logit contributions
 Or-0.004
 Ap-0.004
 BO-0.004
 POP-0.003
degree-0.003
Token Hannah
Token ID26013
Feature activation-3.62e-01

Pos logit contributions
 how+0.041
 when+0.034
 as+0.032
 POP+0.029
 ding+0.029
Neg logit contributions
latable-0.064
gers-0.062
 jerseys-0.059
 trainers-0.057
aders-0.055
Token
Token ID198
Feature activation+3.68e-02

Pos logit contributions
gers+0.023
Matthew+0.021
 trainers+0.021
scar+0.021
 Hank+0.021
Neg logit contributions
 Or-0.007
endo-0.007
 Pretty-0.007
 Pret-0.007
 volcan-0.007
Token
Token ID198
Feature activation+3.65e-02

Pos logit contributions
 volcan+0.024
degree+0.023
endo+0.022
 eru+0.019
 Or+0.019
Neg logit contributions
 Hank-0.071
gers-0.070
 trainers-0.069
 Hannah-0.068
 therapist-0.068
TokenOne
Token ID3198
Feature activation-5.46e-02

Pos logit contributions
Pretty+0.016
ometime+0.016
Ps+0.016
Down+0.016
degree+0.015
Neg logit contributions
 afloat-0.071
 Oscar-0.071
 Randy-0.070
 Fiona-0.069
 Ned-0.069
Token day
Token ID1110
Feature activation+5.68e-02

Pos logit contributions
 Fiona+0.067
gers+0.067
 Hank+0.067
 Hannah+0.067
 Holly+0.065
Neg logit contributions
 volcan-0.076
endo-0.075
degree-0.074
raper-0.070
 hex-0.069
Token,
Token ID11
Feature activation-2.08e-02

Pos logit contributions
degree+0.033
 volcan+0.033
 eru+0.031
 BO+0.031
 clim+0.030
Neg logit contributions
gers-0.102
 trainers-0.100
 Fiona-0.099
 Hank-0.098
latable-0.098
Token Hannah
Token ID26013
Feature activation-3.61e-01

Pos logit contributions
gers+0.037
latable+0.037
 trainers+0.036
scar+0.036
eming+0.036
Neg logit contributions
 how-0.010
degree-0.010
 POP-0.010
 Ding-0.009
 Pretty-0.009
Token went
Token ID1816
Feature activation+1.18e-01

Pos logit contributions
 Fiona+0.618
 Hank+0.618
 Holly+0.609
 Hannah+0.603
gers+0.599
Neg logit contributions
 volcan-0.332
degree-0.314
endo-0.304
 hex-0.287
pre-0.286
Token to
Token ID284
Feature activation-1.41e-02

Pos logit contributions
degree+0.095
 volcan+0.094
endo+0.088
tr+0.087
 hex+0.084
Neg logit contributions
 Hank-0.214
 Hannah-0.204
 Fiona-0.203
gers-0.202
 Holly-0.199
Token the
Token ID262
Feature activation-1.97e-02

Pos logit contributions
 Hannah+0.017
 Fiona+0.017
 Hank+0.017
 Holly+0.017
 Woo+0.016
Neg logit contributions
 volcan-0.021
degree-0.021
endo-0.020
pre-0.020
tr-0.020
Token park
Token ID3952
Feature activation-2.50e-02

Pos logit contributions
 Hank+0.027
 Fiona+0.027
 Hannah+0.026
 Holly+0.026
gers+0.026
Neg logit contributions
 volcan-0.026
degree-0.025
endo-0.024
 hex-0.023
tr-0.023
Token.
Token ID13
Feature activation-1.09e-02

Pos logit contributions
 Hank+0.040
 Hannah+0.039
 Fiona+0.039
 Holly+0.039
gers+0.037
Neg logit contributions
 volcan-0.027
degree-0.026
endo-0.024
tr-0.024
 hex-0.024
Token walked
Token ID6807
Feature activation+2.48e-03

Pos logit contributions
 volcan+0.020
degree+0.020
endo+0.019
 hex+0.018
tr+0.018
Neg logit contributions
 Hank-0.021
 Fiona-0.021
 Hannah-0.021
 Holly-0.021
gers-0.020
Token down
Token ID866
Feature activation+8.17e-02

Pos logit contributions
aph+0.002
 volcan+0.002
degree+0.001
 structure+0.001
 BO+0.001
Neg logit contributions
scar-0.005
 Hank-0.004
liv-0.004
 therapist-0.004
 Hannah-0.004
Token the
Token ID262
Feature activation-1.98e-02

Pos logit contributions
 volcan+0.106
degree+0.102
endo+0.099
 hex+0.095
tr+0.094
Neg logit contributions
 Hank-0.112
 Fiona-0.111
 Hannah-0.110
 Holly-0.109
gers-0.108
Token street
Token ID4675
Feature activation-1.10e-02

Pos logit contributions
 Hank+0.031
 Fiona+0.031
 Hannah+0.031
 Holly+0.030
gers+0.029
Neg logit contributions
 volcan-0.020
degree-0.019
endo-0.019
tr-0.019
 hex-0.018
Token,
Token ID11
Feature activation-1.96e-02

Pos logit contributions
 Hank+0.018
 Fiona+0.018
 Holly+0.018
 Hannah+0.018
gers+0.018
Neg logit contributions
 volcan-0.011
degree-0.009
 hex-0.009
tr-0.009
 clim-0.009
Token Hannah
Token ID26013
Feature activation-3.61e-01

Pos logit contributions
scar+0.030
latable+0.029
eming+0.029
agging+0.029
 ult+0.029
Neg logit contributions
 Ding-0.016
 volcan-0.015
 POP-0.015
 Pretty-0.015
 BO-0.014
Token saw
Token ID2497
Feature activation-3.36e-02

Pos logit contributions
gers+0.569
 Hank+0.563
 Fiona+0.562
 Holly+0.552
 Hannah+0.539
Neg logit contributions
 volcan-0.363
degree-0.323
endo-0.317
tr-0.316
 pressed-0.313
Token something
Token ID1223
Feature activation-5.71e-02

Pos logit contributions
gers+0.064
 Hank+0.061
uster+0.061
 Holly+0.060
 Fiona+0.060
Neg logit contributions
 volcan-0.025
 how-0.020
 lightning-0.019
 bricks-0.018
 towers-0.017
Token that
Token ID326
Feature activation-2.18e-02

Pos logit contributions
 Woo+0.097
gers+0.096
 Hank+0.096
 Hannah+0.094
 Fiona+0.094
Neg logit contributions
 volcan-0.051
 pr-0.044
degree-0.044
tr-0.043
 hex-0.043
Token made
Token ID925
Feature activation-9.30e-02

Pos logit contributions
 Hank+0.032
 Fiona+0.031
gers+0.031
 Holly+0.031
 Hannah+0.031
Neg logit contributions
 volcan-0.026
degree-0.024
tr-0.023
endo-0.023
 hex-0.022
Token her
Token ID607
Feature activation+2.53e-02

Pos logit contributions
gers+0.158
 Hank+0.154
Matthew+0.151
 therapist+0.150
 trainers+0.150
Neg logit contributions
 volcan-0.088
endo-0.073
degree-0.073
 eru-0.072
od-0.070
Token "
Token ID366
Feature activation+1.72e-02

Pos logit contributions
 Hannah+0.012
 Hank+0.011
 Holly+0.011
 Woo+0.011
 Fiona+0.011
Neg logit contributions
 volcan-0.019
degree-0.019
endo-0.018
destruct-0.018
 hex-0.018
TokenThanks
Token ID9690
Feature activation-6.58e-02

Pos logit contributions
 volcan+0.024
degree+0.023
endo+0.022
 hex+0.021
 eru+0.021
Neg logit contributions
 Hank-0.022
 Hannah-0.022
 Fiona-0.022
 Holly-0.022
gers-0.021
Token Dad
Token ID17415
Feature activation+3.42e-02

Pos logit contributions
gers+0.102
 therapist+0.098
 trainers+0.097
 bundled+0.097
Matthew+0.096
Neg logit contributions
 volcan-0.066
degree-0.063
endo-0.062
aph-0.062
tr-0.061
Token!"
Token ID2474
Feature activation-3.46e-02

Pos logit contributions
 volcan+0.033
degree+0.031
 hex+0.030
od+0.030
endo+0.029
Neg logit contributions
 Fiona-0.056
 Hank-0.055
 trainers-0.055
gers-0.054
 Holly-0.054
Token and
Token ID290
Feature activation+1.04e-02

Pos logit contributions
 Hank+0.049
 Fiona+0.049
 Hannah+0.049
 Holly+0.048
gers+0.047
Neg logit contributions
 volcan-0.043
degree-0.042
endo-0.040
 hex-0.039
tr-0.038
Token Hank
Token ID24386
Feature activation-3.60e-01

Pos logit contributions
 volcan+0.012
degree+0.011
endo+0.011
 hex+0.011
 eru+0.010
Neg logit contributions
 Hank-0.015
 Fiona-0.015
gers-0.015
 Hannah-0.015
 Holly-0.015
Token was
Token ID373
Feature activation-3.33e-02

Pos logit contributions
 Fiona+0.548
 Hank+0.543
gers+0.539
 Holly+0.534
 Hannah+0.529
Neg logit contributions
degree-0.383
 volcan-0.383
endo-0.376
 hex-0.371
tr-0.365
Token happy
Token ID3772
Feature activation+5.20e-02

Pos logit contributions
gers+0.061
 Hank+0.059
 Fiona+0.059
 Holly+0.057
 trainers+0.057
Neg logit contributions
 volcan-0.028
degree-0.027
endo-0.026
 hex-0.025
 eru-0.024
Token he
Token ID339
Feature activation-6.90e-03

Pos logit contributions
 volcan+0.060
degree+0.056
endo+0.054
 hex+0.053
 eru+0.052
Neg logit contributions
gers-0.078
 Hank-0.077
 Fiona-0.077
 Hannah-0.076
 Holly-0.075
Token had
Token ID550
Feature activation-8.33e-02

Pos logit contributions
 Hannah+0.008
 Hank+0.008
gers+0.008
 Holly+0.007
 Fiona+0.007
Neg logit contributions
 volcan-0.011
endo-0.010
degree-0.010
 eru-0.010
raper-0.010
Token been
Token ID587
Feature activation-1.09e-02

Pos logit contributions
 Hank+0.114
 Hannah+0.113
 Fiona+0.111
 Holly+0.110
gers+0.107
Neg logit contributions
 volcan-0.110
degree-0.104
endo-0.104
tr-0.099
 hex-0.098
Token little
Token ID1310
Feature activation+5.20e-02

Pos logit contributions
 Hank+0.015
 Hannah+0.014
 Holly+0.014
 Fiona+0.014
 Woo+0.014
Neg logit contributions
degree-0.014
 volcan-0.014
endo-0.014
tr-0.013
destruct-0.013
Token girl
Token ID2576
Feature activation-2.48e-02

Pos logit contributions
 volcan+0.064
degree+0.061
endo+0.059
 hex+0.058
tr+0.057
Neg logit contributions
 Hank-0.073
 Fiona-0.073
 Hannah-0.072
 Holly-0.071
gers-0.071
Token called
Token ID1444
Feature activation-5.52e-02

Pos logit contributions
 Hank+0.030
 Fiona+0.030
 Hannah+0.030
 Holly+0.029
gers+0.029
Neg logit contributions
 volcan-0.036
degree-0.035
endo-0.034
 hex-0.033
 eru-0.032
Token Hannah
Token ID26013
Feature activation-3.60e-01

Pos logit contributions
 Fiona+0.080
 Hannah+0.080
 Holly+0.079
 Hank+0.079
gers+0.075
Neg logit contributions
 volcan-0.068
degree-0.066
endo-0.064
 hex-0.060
 eru-0.060
Token.
Token ID13
Feature activation-1.84e-02

Pos logit contributions
 Hank+0.523
 Fiona+0.516
 Hannah+0.511
 Holly+0.509
gers+0.504
Neg logit contributions
 volcan-0.438
degree-0.421
endo-0.407
 hex-0.391
tr-0.389
Token Hannah
Token ID26013
Feature activation-3.60e-01

Pos logit contributions
scar+0.041
 afloat+0.040
rim+0.040
Patrick+0.040
achers+0.040
Neg logit contributions
 Pretty-0.005
 At-0.003
 Ap-0.002
 Or-0.002
 Ding-0.002
Token was
Token ID373
Feature activation-4.24e-02

Pos logit contributions
 Hank+0.561
 Fiona+0.554
 Holly+0.549
 trainers+0.539
 Hannah+0.539
Neg logit contributions
 volcan-0.401
degree-0.366
endo-0.366
tr-0.365
 eru-0.348
Token very
Token ID845
Feature activation+7.51e-02

Pos logit contributions
 Hank+0.066
 Fiona+0.065
 Hannah+0.065
 Holly+0.064
gers+0.063
Neg logit contributions
 volcan-0.047
degree-0.046
endo-0.044
 hex-0.042
tr-0.041
Token naughty
Token ID45128
Feature activation-4.42e-03

Pos logit contributions
 volcan+0.092
degree+0.089
endo+0.086
 hex+0.083
 eru+0.081
Neg logit contributions
 Hank-0.107
 Fiona-0.106
 Hannah-0.104
gers-0.104
 Holly-0.103
Token and
Token ID290
Feature activation-4.25e-04

Pos logit contributions
 Hank+0.008
 Fiona+0.008
 Holly+0.008
 trainers+0.008
gers+0.008
Neg logit contributions
 volcan-0.004
degree-0.003
 clim-0.003
 eru-0.003
 hex-0.003
Token often
Token ID1690
Feature activation-6.48e-02

Pos logit contributions
 Hank+0.001
 Fiona+0.001
Matthew+0.001
 trainers+0.001
 Holly+0.001
Neg logit contributions
 volcan-0.000
degree-0.000
 exc-0.000
endo-0.000
tr-0.000
TokenBut
Token ID1537
Feature activation-2.94e-02

Pos logit contributions
 volcan+0.023
degree+0.023
endo+0.022
tr+0.021
 hex+0.021
Neg logit contributions
gers-0.013
 Hank-0.013
 Woo-0.013
 therapist-0.012
 Holly-0.012
Token we
Token ID356
Feature activation+5.23e-02

Pos logit contributions
 Hank+0.042
 Fiona+0.042
 Hannah+0.041
 Holly+0.041
gers+0.040
Neg logit contributions
 volcan-0.036
degree-0.035
endo-0.034
 hex-0.032
tr-0.032
Token are
Token ID389
Feature activation+3.16e-03

Pos logit contributions
 volcan+0.053
 eru+0.046
 erupt+0.046
 hex+0.044
 exc+0.044
Neg logit contributions
 Fiona-0.087
 Hannah-0.087
 Holly-0.086
 therapist-0.085
 Hank-0.085
Token good
Token ID922
Feature activation+6.29e-03

Pos logit contributions
 volcan+0.004
endo+0.003
degree+0.003
 hex+0.003
 erupt+0.003
Neg logit contributions
 Hank-0.005
gers-0.005
 Fiona-0.005
 Hannah-0.005
 therapist-0.005
Token jog
Token ID48342
Feature activation-1.78e-01

Pos logit contributions
 volcan+0.007
degree+0.006
endo+0.006
 eru+0.006
 hex+0.006
Neg logit contributions
 Fiona-0.010
 Hank-0.010
 Hannah-0.010
 Holly-0.010
gers-0.009
Tokengers
Token ID5355
Feature activation-3.60e-01

Pos logit contributions
 Fiona+0.337
 Hannah+0.328
 Hank+0.324
Matthew+0.319
scar+0.318
Neg logit contributions
degree-0.112
 Or-0.111
 volcan-0.110
Or-0.107
pre-0.103
Token."
Token ID526
Feature activation-1.15e-02

Pos logit contributions
 Hank+0.663
 Fiona+0.660
 Hannah+0.642
 Wendy+0.630
Matthew+0.629
Neg logit contributions
 volcan-0.301
degree-0.258
 hex-0.245
 clim-0.242
 Or-0.239
Token
Token ID198
Feature activation+4.12e-02

Pos logit contributions
 Hank+0.020
 Hannah+0.019
 Fiona+0.019
 Holly+0.019
gers+0.019
Neg logit contributions
 volcan-0.011
degree-0.010
endo-0.010
 eru-0.009
 hex-0.009
Token
Token ID198
Feature activation+4.09e-02

Pos logit contributions
 volcan+0.033
degree+0.031
endo+0.030
 eru+0.028
 Or+0.028
Neg logit contributions
 Hannah-0.076
 Hank-0.076
 Holly-0.075
 Fiona-0.075
gers-0.075
TokenThey
Token ID2990
Feature activation-7.85e-02

Pos logit contributions
 volcan+0.047
degree+0.045
 Or+0.045
BO+0.044
 hex+0.043
Neg logit contributions
 Hannah-0.065
 Fiona-0.064
 Holly-0.064
 Hank-0.062
 Nora-0.060
Token laugh
Token ID6487
Feature activation-1.34e-02

Pos logit contributions
gers+0.124
 Fiona+0.123
 therapist+0.123
 Hank+0.123
 Hannah+0.123
Neg logit contributions
aph-0.069
 exc-0.068
 volcan-0.067
endo-0.066
degree-0.066
Token it
Token ID340
Feature activation-5.91e-02

Pos logit contributions
 volcan+0.008
degree+0.007
endo+0.007
 ding+0.007
 erupt+0.007
Neg logit contributions
 Fiona-0.012
 Hank-0.011
gers-0.011
 Holly-0.011
 Woo-0.011
Token smelled
Token ID37609
Feature activation+5.26e-02

Pos logit contributions
 Hank+0.092
 Fiona+0.091
 Hannah+0.090
 Holly+0.089
 Woo+0.089
Neg logit contributions
 volcan-0.066
degree-0.060
endo-0.059
tr-0.059
 hex-0.058
Token so
Token ID523
Feature activation-3.11e-03

Pos logit contributions
 volcan+0.047
 eru+0.042
 erupt+0.040
degree+0.038
 ding+0.038
Neg logit contributions
 Fiona-0.094
 therapist-0.093
 trainers-0.092
Matthew-0.092
 Holly-0.091
Token delicious
Token ID12625
Feature activation-1.33e-02

Pos logit contributions
Matthew+0.006
 trainers+0.006
 Fiona+0.005
 Buster+0.005
 Hank+0.005
Neg logit contributions
 volcan-0.003
 eru-0.002
degree-0.002
endo-0.002
 erupt-0.002
Token!
Token ID0
Feature activation-3.24e-02

Pos logit contributions
 Fiona+0.022
 trainers+0.022
 Hank+0.022
Matthew+0.022
 Hannah+0.022
Neg logit contributions
 volcan-0.012
 eru-0.010
degree-0.010
 erupt-0.010
 hex-0.010
Token Hannah
Token ID26013
Feature activation-3.60e-01

Pos logit contributions
scar+0.041
Matthew+0.039
Tyler+0.038
 trainers+0.037
liv+0.036
Neg logit contributions
 Pretty-0.039
 Ap-0.039
 Or-0.039
endo-0.039
 BO-0.039
Token looked
Token ID3114
Feature activation+6.97e-02

Pos logit contributions
gers+0.559
 Hank+0.557
 Fiona+0.556
 Woo+0.543
 Holly+0.539
Neg logit contributions
 volcan-0.372
endo-0.355
degree-0.346
 hex-0.339
tr-0.327
Token around
Token ID1088
Feature activation-6.62e-02

Pos logit contributions
 volcan+0.072
degree+0.067
endo+0.065
 eru+0.062
 hex+0.060
Neg logit contributions
 Hank-0.113
 Fiona-0.112
gers-0.111
 Hannah-0.111
 Holly-0.110
Token and
Token ID290
Feature activation+7.60e-03

Pos logit contributions
 Hank+0.120
gers+0.120
scar+0.119
 Woo+0.117
 Fiona+0.115
Neg logit contributions
 volcan-0.058
 eru-0.047
 hex-0.045
degree-0.045
 BO-0.044
Token saw
Token ID2497
Feature activation-3.07e-02

Pos logit contributions
 volcan+0.010
degree+0.009
endo+0.009
 hex+0.009
 eru+0.009
Neg logit contributions
 Hank-0.010
 Fiona-0.010
gers-0.010
 Hannah-0.010
 Holly-0.010
Token a
Token ID257
Feature activation-5.69e-03

Pos logit contributions
gers+0.051
 Hank+0.050
 Fiona+0.049
 Hannah+0.048
 Holly+0.048
Neg logit contributions
 volcan-0.033
endo-0.026
degree-0.026
 hex-0.026
 eru-0.025
Token was
Token ID373
Feature activation-4.16e-02

Pos logit contributions
 Hank+0.121
gers+0.120
 Hannah+0.119
 Fiona+0.118
 Holly+0.118
Neg logit contributions
 volcan-0.094
endo-0.087
degree-0.084
 hex-0.083
 eru-0.082
Token a
Token ID257
Feature activation-1.08e-02

Pos logit contributions
 Hank+0.059
 Fiona+0.059
 Hannah+0.058
 Holly+0.058
gers+0.057
Neg logit contributions
 volcan-0.051
degree-0.050
endo-0.048
 hex-0.046
tr-0.045
Token little
Token ID1310
Feature activation+5.20e-02

Pos logit contributions
 Hank+0.015
 Hannah+0.014
 Holly+0.014
 Fiona+0.014
 Woo+0.014
Neg logit contributions
degree-0.014
 volcan-0.014
endo-0.014
tr-0.013
destruct-0.013
Token girl
Token ID2576
Feature activation-2.48e-02

Pos logit contributions
 volcan+0.064
degree+0.061
endo+0.059
 hex+0.058
tr+0.057
Neg logit contributions
 Hank-0.073
 Fiona-0.073
 Hannah-0.072
 Holly-0.071
gers-0.071
Token called
Token ID1444
Feature activation-5.52e-02

Pos logit contributions
 Hank+0.030
 Fiona+0.030
 Hannah+0.030
 Holly+0.029
gers+0.029
Neg logit contributions
 volcan-0.036
degree-0.035
endo-0.034
 hex-0.033
 eru-0.032
Token Hannah
Token ID26013
Feature activation-3.60e-01

Pos logit contributions
 Fiona+0.080
 Hannah+0.080
 Holly+0.079
 Hank+0.079
gers+0.075
Neg logit contributions
 volcan-0.068
degree-0.066
endo-0.064
 hex-0.060
 eru-0.060
Token.
Token ID13
Feature activation-1.84e-02

Pos logit contributions
 Hank+0.523
 Fiona+0.516
 Hannah+0.511
 Holly+0.509
gers+0.504
Neg logit contributions
 volcan-0.438
degree-0.421
endo-0.407
 hex-0.391
tr-0.389
Token Hannah
Token ID26013
Feature activation-3.60e-01

Pos logit contributions
scar+0.041
 afloat+0.040
rim+0.040
Patrick+0.040
achers+0.040
Neg logit contributions
 Pretty-0.005
 At-0.003
 Ap-0.002
 Or-0.002
 Ding-0.002
Token was
Token ID373
Feature activation-4.24e-02

Pos logit contributions
 Hank+0.561
 Fiona+0.554
 Holly+0.549
 trainers+0.539
 Hannah+0.539
Neg logit contributions
 volcan-0.401
degree-0.366
endo-0.366
tr-0.365
 eru-0.348
Token very
Token ID845
Feature activation+7.51e-02

Pos logit contributions
 Hank+0.066
 Fiona+0.065
 Hannah+0.065
 Holly+0.064
gers+0.063
Neg logit contributions
 volcan-0.047
degree-0.046
endo-0.044
 hex-0.042
tr-0.041
Token naughty
Token ID45128
Feature activation-4.42e-03

Pos logit contributions
 volcan+0.092
degree+0.089
endo+0.086
 hex+0.083
 eru+0.081
Neg logit contributions
 Hank-0.107
 Fiona-0.106
 Hannah-0.104
gers-0.104
 Holly-0.103
Token fashion
Token ID6977
Feature activation-1.65e-02

Pos logit contributions
gers+0.114
 Hank+0.109
 Fiona+0.108
Matthew+0.107
 therapist+0.106
Neg logit contributions
 volcan-0.055
endo-0.047
degree-0.045
 hex-0.045
od-0.044
Token.
Token ID13
Feature activation-4.89e-03

Pos logit contributions
 Fiona+0.026
 Hank+0.026
 Otto+0.025
 Holly+0.025
 Woo+0.024
Neg logit contributions
degree-0.016
 volcan-0.016
 hex-0.015
tr-0.015
pre-0.015
Token Every
Token ID3887
Feature activation-7.37e-02

Pos logit contributions
scar+0.009
gers+0.009
Matthew+0.009
tto+0.009
achers+0.008
Neg logit contributions
 Pretty-0.003
 Or-0.003
 Ap-0.003
endo-0.003
 At-0.003
Token day
Token ID1110
Feature activation+5.87e-02

Pos logit contributions
 Hank+0.150
 Woo+0.145
 Fiona+0.143
Matthew+0.143
 Otto+0.143
Neg logit contributions
 volcan-0.041
degree-0.040
endo-0.040
 hex-0.034
 clim-0.033
Token when
Token ID618
Feature activation+9.16e-02

Pos logit contributions
 bal+0.031
 stops+0.027
 Pretty+0.026
 Ap+0.026
 clim+0.025
Neg logit contributions
latable-0.111
scar-0.103
Matthew-0.101
eming-0.100
 afloat-0.100
Token Hannah
Token ID26013
Feature activation-3.60e-01

Pos logit contributions
 Pretty+0.066
 Ap+0.063
 birth+0.063
 Sorry+0.062
 exc+0.059
Neg logit contributions
latable-0.160
eming-0.152
uster-0.151
scar-0.143
agging-0.142
Token got
Token ID1392
Feature activation+7.65e-02

Pos logit contributions
 Hank+0.518
 Hannah+0.515
 Fiona+0.511
 Holly+0.507
gers+0.497
Neg logit contributions
 volcan-0.444
degree-0.430
endo-0.413
 hex-0.394
 eru-0.387
Token dressed
Token ID12049
Feature activation-2.34e-01

Pos logit contributions
 volcan+0.076
degree+0.071
endo+0.067
 hex+0.064
aph+0.063
Neg logit contributions
gers-0.121
 Hank-0.120
 Fiona-0.119
latable-0.117
 Rover-0.116
Token,
Token ID11
Feature activation-1.22e-02

Pos logit contributions
 Rover+0.442
scar+0.442
 Fiona+0.441
 Woo+0.440
 Otto+0.436
Neg logit contributions
 how-0.131
 ding-0.116
 volcan-0.116
 towers-0.109
degree-0.108
Token her
Token ID607
Feature activation+2.90e-02

Pos logit contributions
latable+0.028
gers+0.027
achers+0.027
agging+0.027
eming+0.027
Neg logit contributions
 how-0.004
 pr-0.002
 bal-0.001
 exc-0.001
 she-0.001
Token mum
Token ID25682
Feature activation-9.65e-02

Pos logit contributions
 volcan+0.033
degree+0.032
 hex+0.031
endo+0.031
tr+0.030
Neg logit contributions
 Woo-0.040
 Hank-0.040
 Fiona-0.040
gers-0.039
Matthew-0.039
Token there
Token ID612
Feature activation-7.50e-02

Pos logit contributions
 Hank+0.015
 Fiona+0.015
 Hannah+0.015
 Holly+0.015
gers+0.015
Neg logit contributions
 volcan-0.019
degree-0.018
endo-0.018
 hex-0.017
tr-0.017
Token were
Token ID547
Feature activation-7.10e-03

Pos logit contributions
gers+0.119
 trainers+0.117
 Hank+0.117
 Hannah+0.116
 therapist+0.116
Neg logit contributions
 volcan-0.081
endo-0.073
 hex-0.070
degree-0.070
 eru-0.070
Token two
Token ID734
Feature activation-4.21e-02

Pos logit contributions
 Hank+0.009
 Fiona+0.009
 Hannah+0.009
 Holly+0.009
gers+0.009
Neg logit contributions
 volcan-0.010
degree-0.009
endo-0.009
 hex-0.009
 eru-0.009
Token friends
Token ID2460
Feature activation-4.82e-02

Pos logit contributions
 Hank+0.064
 therapist+0.063
 Hannah+0.063
 Fiona+0.062
 Holly+0.062
Neg logit contributions
 volcan-0.052
 eru-0.045
endo-0.043
 hex-0.042
degree-0.042
Token,
Token ID11
Feature activation-1.19e-02

Pos logit contributions
 Hank+0.077
gers+0.076
 therapist+0.076
 Holly+0.075
 Hannah+0.075
Neg logit contributions
 volcan-0.055
degree-0.048
endo-0.047
 eru-0.046
 hex-0.044
Token Hank
Token ID24386
Feature activation-3.59e-01

Pos logit contributions
 Hank+0.018
 Fiona+0.018
 Hannah+0.018
gers+0.018
 Holly+0.018
Neg logit contributions
 volcan-0.013
degree-0.013
endo-0.012
 hex-0.012
tr-0.011
Token and
Token ID290
Feature activation+1.04e-02

Pos logit contributions
 Hank+0.633
 Fiona+0.633
 Holly+0.627
 Hannah+0.620
 trainers+0.616
Neg logit contributions
 volcan-0.319
degree-0.293
endo-0.280
pre-0.279
 hex-0.270
Token Sally
Token ID25737
Feature activation-4.19e-02

Pos logit contributions
 volcan+0.009
degree+0.008
endo+0.008
 clim+0.008
aph+0.008
Neg logit contributions
gers-0.018
 therapist-0.018
 trainers-0.017
latable-0.016
 ult-0.016
Token.
Token ID13
Feature activation-6.56e-03

Pos logit contributions
 Hank+0.066
gers+0.066
 Holly+0.065
 Fiona+0.065
 Hannah+0.065
Neg logit contributions
 volcan-0.045
degree-0.041
endo-0.040
tr-0.039
 eru-0.039
Token They
Token ID1119
Feature activation-4.17e-02

Pos logit contributions
 therapist+0.014
guard+0.012
gers+0.012
Matthew+0.012
liv+0.012
Neg logit contributions
 Pretty-0.003
endo-0.003
 skysc-0.003
 Ap-0.003
 Ding-0.003
Token loved
Token ID6151
Feature activation-4.68e-02

Pos logit contributions
 Fiona+0.065
 Hank+0.065
 Hannah+0.065
 therapist+0.064
 Holly+0.064
Neg logit contributions
 volcan-0.045
degree-0.042
endo-0.041
 hex-0.040
 eru-0.039
Token jungle
Token ID20712
Feature activation+1.58e-01

Pos logit contributions
 Hank+0.022
 Fiona+0.022
 Hannah+0.022
 Holly+0.022
gers+0.021
Neg logit contributions
 volcan-0.021
degree-0.020
endo-0.020
 hex-0.019
tr-0.018
Token.
Token ID13
Feature activation-4.72e-03

Pos logit contributions
 volcan+0.185
degree+0.175
endo+0.169
 hex+0.163
tr+0.161
Neg logit contributions
 Hank-0.240
 Fiona-0.235
 Hannah-0.235
 Holly-0.233
gers-0.227
Token
Token ID198
Feature activation+4.21e-02

Pos logit contributions
gers+0.009
scar+0.009
Matthew+0.008
agging+0.008
smanship+0.008
Neg logit contributions
 Or-0.003
 Pretty-0.003
 Coming-0.002
degree-0.002
 Amid-0.002
Token
Token ID198
Feature activation+4.20e-02

Pos logit contributions
degree+0.012
 Or+0.010
endo+0.009
BO+0.008
 volcan+0.007
Neg logit contributions
gers-0.094
 trainers-0.092
 Hank-0.092
 therapist-0.090
latable-0.090
TokenAs
Token ID1722
Feature activation+2.88e-02

Pos logit contributions
Down+0.022
BO+0.022
Upon+0.021
Sur+0.021
Ps+0.020
Neg logit contributions
smanship-0.078
 Oscar-0.077
 Ned-0.077
 avail-0.077
 Hank-0.077
Token Hannah
Token ID26013
Feature activation-3.59e-01

Pos logit contributions
 Pretty+0.008
 Ap+0.006
 exc+0.006
BO+0.005
endo+0.005
Neg logit contributions
latable-0.063
eming-0.058
 trainers-0.058
 ult-0.058
gers-0.057
Token walked
Token ID6807
Feature activation+3.73e-03

Pos logit contributions
 Hank+0.579
 Fiona+0.574
 Hannah+0.573
 Holly+0.570
gers+0.559
Neg logit contributions
 volcan-0.371
degree-0.354
endo-0.347
 hex-0.325
tr-0.322
Token further
Token ID2252
Feature activation-1.23e-01

Pos logit contributions
 volcan+0.003
degree+0.003
aph+0.003
 clim+0.003
 eru+0.003
Neg logit contributions
 Hank-0.006
 Hannah-0.006
 therapist-0.006
 Woo-0.006
Matthew-0.006
Token and
Token ID290
Feature activation+7.01e-03

Pos logit contributions
 Hank+0.189
 Fiona+0.187
 Hannah+0.186
 Holly+0.185
gers+0.183
Neg logit contributions
 volcan-0.138
degree-0.133
endo-0.128
 hex-0.122
tr-0.120
Token further
Token ID2252
Feature activation-1.23e-01

Pos logit contributions
 volcan+0.012
degree+0.012
endo+0.012
pre+0.012
tr+0.012
Neg logit contributions
 Fiona-0.006
gers-0.006
 Hank-0.006
 Holly-0.006
 Hannah-0.006
Token away
Token ID1497
Feature activation+9.86e-03

Pos logit contributions
 Hank+0.193
 Hannah+0.191
 Fiona+0.189
 Holly+0.188
gers+0.184
Neg logit contributions
 volcan-0.133
degree-0.128
endo-0.120
 hex-0.115
 eru-0.115
Token
Token ID198
Feature activation+4.21e-02

Pos logit contributions
 Hank+0.004
 Hannah+0.004
 Fiona+0.004
 Holly+0.004
 Nora+0.003
Neg logit contributions
 volcan-0.004
degree-0.004
endo-0.004
 hex-0.004
raper-0.004
Token
Token ID198
Feature activation+4.24e-02

Pos logit contributions
endo+0.036
degree+0.036
 volcan+0.035
 eru+0.033
 Or+0.031
Neg logit contributions
gers-0.073
 Hank-0.071
 trainers-0.071
 Fiona-0.070
 Hannah-0.070
TokenS
Token ID50
Feature activation+6.85e-02

Pos logit contributions
degree+0.040
 volcan+0.040
endo+0.039
pre+0.036
 eru+0.036
Neg logit contributions
 Fiona-0.070
 Hank-0.070
 Hannah-0.069
 Holly-0.068
gers-0.068
Tokenally
Token ID453
Feature activation-4.27e-02

Pos logit contributions
 volcan+0.131
degree+0.123
 hex+0.121
endo+0.120
tr+0.117
Neg logit contributions
 Hank-0.056
gers-0.054
 Hannah-0.053
 Fiona-0.053
 Holly-0.051
Token and
Token ID290
Feature activation+8.52e-03

Pos logit contributions
 Hank+0.059
 Fiona+0.058
 Hannah+0.057
 Holly+0.057
gers+0.057
Neg logit contributions
 volcan-0.054
degree-0.052
endo-0.051
 hex-0.049
tr-0.049
Token Hank
Token ID24386
Feature activation-3.59e-01

Pos logit contributions
 volcan+0.012
degree+0.012
endo+0.011
 hex+0.011
tr+0.011
Neg logit contributions
 Hank-0.011
 Fiona-0.011
 Hannah-0.011
 Holly-0.011
gers-0.010
Token hugged
Token ID41130
Feature activation-3.10e-02

Pos logit contributions
 Fiona+0.610
 Hank+0.589
 Holly+0.586
gers+0.580
 Hannah+0.579
Neg logit contributions
 volcan-0.307
degree-0.303
endo-0.302
 hex-0.293
tr-0.290
Token each
Token ID1123
Feature activation-7.09e-02

Pos logit contributions
gers+0.053
 therapist+0.052
 trainers+0.052
scar+0.050
Matthew+0.050
Neg logit contributions
 eru-0.021
 volcan-0.021
endo-0.021
 how-0.020
 BO-0.020
Token other
Token ID584
Feature activation-2.37e-02

Pos logit contributions
 Hank+0.101
 Fiona+0.098
 Hannah+0.098
gers+0.097
 Holly+0.097
Neg logit contributions
 volcan-0.088
degree-0.086
endo-0.083
 hex-0.079
 eru-0.078
Token and
Token ID290
Feature activation+9.85e-03

Pos logit contributions
 Hank+0.038
 Fiona+0.037
 Holly+0.036
gers+0.036
 Hannah+0.036
Neg logit contributions
 volcan-0.024
degree-0.022
endo-0.022
tr-0.021
 eru-0.021
Token from
Token ID422
Feature activation+7.43e-02

Pos logit contributions
 volcan+0.009
degree+0.009
endo+0.008
 eru+0.008
pre+0.008
Neg logit contributions
gers-0.016
 Hank-0.016
 trainers-0.016
 Holly-0.015
 Fiona-0.015
Token.
Token ID13
Feature activation-2.90e-03

Pos logit contributions
 volcan+0.000
degree+0.000
endo+0.000
 hex+0.000
 eru+0.000
Neg logit contributions
 Hank-0.001
 Hannah-0.001
 Fiona-0.001
 Holly-0.001
gers-0.001
Token
Token ID198
Feature activation+4.91e-02

Pos logit contributions
gers+0.006
smanship+0.005
scar+0.005
 therapist+0.005
liv+0.005
Neg logit contributions
 Ding-0.002
 Pretty-0.002
 volcan-0.002
 BO-0.002
 Or-0.002
Token
Token ID198
Feature activation+4.89e-02

Pos logit contributions
 volcan+0.018
endo+0.018
degree+0.017
BO+0.015
 Or+0.015
Neg logit contributions
gers-0.107
 trainers-0.103
 therapist-0.103
 Hank-0.100
latable-0.100
TokenSuddenly
Token ID38582
Feature activation+2.82e-03

Pos logit contributions
BO+0.034
Down+0.028
degree+0.026
CR+0.025
Pretty+0.024
Neg logit contributions
 avail-0.091
gers-0.089
smanship-0.088
 therapist-0.086
 rightful-0.086
Token,
Token ID11
Feature activation-9.20e-03

Pos logit contributions
 volcan+0.002
 BO+0.001
 Ding+0.001
 hex+0.001
 POP+0.001
Neg logit contributions
latable-0.005
liv-0.005
gers-0.005
agging-0.005
 rightful-0.005
Token Hank
Token ID24386
Feature activation-3.59e-01

Pos logit contributions
gers+0.016
latable+0.015
agging+0.015
 ult+0.015
 trainers+0.014
Neg logit contributions
 volcan-0.008
 Ding-0.007
 BO-0.007
 hex-0.007
 ding-0.007
Token saw
Token ID2497
Feature activation-2.62e-02

Pos logit contributions
 Hank+0.523
 Fiona+0.522
gers+0.515
 Holly+0.514
 Hannah+0.513
Neg logit contributions
 volcan-0.414
degree-0.396
endo-0.386
 hex-0.373
tr-0.372
Token something
Token ID1223
Feature activation-4.90e-02

Pos logit contributions
gers+0.049
 Holly+0.048
 therapist+0.048
 Fiona+0.047
 Hannah+0.047
Neg logit contributions
 volcan-0.022
 how-0.015
 BO-0.015
 bricks-0.015
 hex-0.015
Token dug
Token ID18735
Feature activation+3.75e-02

Pos logit contributions
 Hannah+0.082
 Hank+0.081
 Woo+0.081
gers+0.081
 Holly+0.081
Neg logit contributions
 volcan-0.048
degree-0.042
 erupt-0.041
 hex-0.041
tr-0.040
Token in
Token ID287
Feature activation+5.04e-02

Pos logit contributions
 volcan+0.030
endo+0.026
 erupt+0.026
degree+0.025
 eru+0.024
Neg logit contributions
 Hannah-0.073
 Hank-0.071
 Holly-0.070
 therapist-0.070
 Fiona-0.070
Token the
Token ID262
Feature activation-1.29e-02

Pos logit contributions
 volcan+0.038
 erupt+0.028
 bricks+0.026
 BO+0.026
 hex+0.025
Neg logit contributions
 therapist-0.107
gers-0.106
liv-0.103
scar-0.102
 trainers-0.101
Token there
Token ID612
Feature activation-7.73e-02

Pos logit contributions
 Hank+0.021
 Fiona+0.021
 Hannah+0.021
 Holly+0.020
gers+0.020
Neg logit contributions
 volcan-0.026
degree-0.025
endo-0.025
 hex-0.024
tr-0.024
Token was
Token ID373
Feature activation-3.59e-02

Pos logit contributions
gers+0.122
 Hank+0.120
 Hannah+0.120
 trainers+0.119
 Fiona+0.119
Neg logit contributions
 volcan-0.084
endo-0.076
degree-0.073
 hex-0.073
 eru-0.072
Token a
Token ID257
Feature activation-7.76e-03

Pos logit contributions
 Hank+0.050
 Fiona+0.049
 Hannah+0.049
 Holly+0.049
gers+0.048
Neg logit contributions
 volcan-0.046
degree-0.045
endo-0.043
 hex-0.041
tr-0.041
Token girl
Token ID2576
Feature activation-2.05e-02

Pos logit contributions
 Hank+0.010
 Hannah+0.010
 Holly+0.010
 Fiona+0.010
 Woo+0.010
Neg logit contributions
degree-0.010
 volcan-0.010
endo-0.010
tr-0.009
destruct-0.009
Token named
Token ID3706
Feature activation-3.28e-02

Pos logit contributions
 Hank+0.026
 Fiona+0.026
 Hannah+0.026
 Holly+0.025
gers+0.025
Neg logit contributions
 volcan-0.029
degree-0.028
endo-0.027
 hex-0.026
tr-0.026
Token Hannah
Token ID26013
Feature activation-3.58e-01

Pos logit contributions
 Hannah+0.044
 Holly+0.044
 Fiona+0.044
 Hank+0.041
 Woo+0.041
Neg logit contributions
 volcan-0.046
degree-0.045
endo-0.044
pre-0.044
 hex-0.043
Token.
Token ID13
Feature activation-9.51e-03

Pos logit contributions
 Hank+0.477
 Hannah+0.473
 Fiona+0.472
 Holly+0.466
gers+0.461
Neg logit contributions
 volcan-0.473
degree-0.463
endo-0.455
 hex-0.430
 eru-0.422
Token She
Token ID1375
Feature activation-3.61e-02

Pos logit contributions
scar+0.021
guards+0.020
achers+0.020
rim+0.020
 afloat+0.020
Neg logit contributions
 Pretty-0.002
 At-0.002
 Ap-0.001
 Ding-0.001
 Or-0.001
Token had
Token ID550
Feature activation-8.58e-02

Pos logit contributions
 Hank+0.045
 Fiona+0.044
 Hannah+0.044
 Holly+0.043
gers+0.043
Neg logit contributions
 volcan-0.052
degree-0.050
endo-0.049
 hex-0.047
tr-0.047
Token a
Token ID257
Feature activation-8.28e-03

Pos logit contributions
 Hank+0.140
 Fiona+0.138
gers+0.137
 Holly+0.135
 Hannah+0.135
Neg logit contributions
 volcan-0.089
degree-0.084
endo-0.083
tr-0.081
 hex-0.078
Token cool
Token ID3608
Feature activation+2.09e-02

Pos logit contributions
 Hannah+0.011
 Hank+0.011
 Holly+0.011
 Fiona+0.011
gers+0.010
Neg logit contributions
 volcan-0.012
degree-0.011
endo-0.011
 eru-0.010
 hex-0.010
Token cup
Token ID6508
Feature activation-5.84e-02

Pos logit contributions
 Fiona+0.006
 Hank+0.006
 Hannah+0.006
 Holly+0.006
gers+0.006
Neg logit contributions
 volcan-0.005
degree-0.005
 hex-0.005
endo-0.005
tr-0.005
Token and
Token ID290
Feature activation+1.19e-02

Pos logit contributions
 Hank+0.094
 Fiona+0.094
 Hannah+0.091
 Holly+0.090
gers+0.089
Neg logit contributions
 volcan-0.059
tr-0.057
degree-0.057
endo-0.053
 hex-0.052
Token gave
Token ID2921
Feature activation+5.82e-03

Pos logit contributions
 volcan+0.014
degree+0.014
endo+0.013
tr+0.013
 hex+0.013
Neg logit contributions
gers-0.017
 Fiona-0.016
 Hank-0.016
 Holly-0.016
 Hannah-0.016
Token it
Token ID340
Feature activation-5.35e-02

Pos logit contributions
 how+0.003
degree+0.003
dark+0.003
endo+0.003
sur+0.003
Neg logit contributions
latable-0.011
gers-0.011
Matthew-0.011
 trainers-0.011
 shepherd-0.011
Token to
Token ID284
Feature activation-7.24e-03

Pos logit contributions
 Hank+0.078
 Fiona+0.076
 Hannah+0.076
 Holly+0.075
gers+0.075
Neg logit contributions
 volcan-0.065
degree-0.063
endo-0.061
tr-0.058
 hex-0.058
Token Hannah
Token ID26013
Feature activation-3.58e-01

Pos logit contributions
gers+0.010
latable+0.009
 trainers+0.009
scar+0.009
Matthew+0.008
Neg logit contributions
 how-0.008
endo-0.008
 exc-0.008
degree-0.008
 Pret-0.007
Token.
Token ID13
Feature activation-3.76e-03

Pos logit contributions
gers+0.656
 Hank+0.632
 Fiona+0.623
 trainers+0.621
Matthew+0.615
Neg logit contributions
 volcan-0.275
ast-0.248
degree-0.245
od-0.243
tr-0.242
Token Suddenly
Token ID24975
Feature activation-3.29e-02

Pos logit contributions
scar+0.007
smanship+0.007
gers+0.007
Matthew+0.007
liv+0.006
Neg logit contributions
 Pretty-0.003
 POP-0.002
 Ding-0.002
 At-0.002
 BO-0.002
Token,
Token ID11
Feature activation-1.39e-02

Pos logit contributions
latable+0.080
 rightful+0.074
scar+0.074
agging+0.073
liv+0.072
Neg logit contributions
 stops-0.005
 bal-0.003
 BO-0.003
 POP-0.003
 pr-0.002
Token it
Token ID340
Feature activation-5.43e-02

Pos logit contributions
latable+0.028
agging+0.027
eming+0.026
gers+0.026
scar+0.025
Neg logit contributions
 how-0.007
 Ding-0.007
 POP-0.006
 bal-0.006
 BO-0.006
Token began
Token ID2540
Feature activation-1.98e-02

Pos logit contributions
 Hank+0.083
 Fiona+0.083
 Hannah+0.082
 Holly+0.081
gers+0.081
Neg logit contributions
 volcan-0.061
degree-0.058
endo-0.056
tr-0.056
 hex-0.055
Token all
Token ID477
Feature activation+1.14e-02

Pos logit contributions
gers+0.007
 therapist+0.006
 Hank+0.006
 Woo+0.006
 Fiona+0.006
Neg logit contributions
 volcan-0.003
 erupt-0.003
 ding-0.002
endo-0.002
 clim-0.002
Token tired
Token ID10032
Feature activation-1.03e-01

Pos logit contributions
 volcan+0.011
degree+0.010
 clim+0.010
 erupt+0.010
 ding+0.009
Neg logit contributions
gers-0.019
 Hank-0.018
 Holly-0.018
 Fiona-0.018
 therapist-0.018
Token but
Token ID475
Feature activation-5.64e-02

Pos logit contributions
 Holly+0.168
 Fiona+0.168
 Hank+0.168
 Hannah+0.165
 trainers+0.164
Neg logit contributions
 volcan-0.102
degree-0.090
 eru-0.087
tr-0.086
 erupt-0.085
Token happy
Token ID3772
Feature activation+5.36e-02

Pos logit contributions
gers+0.110
 trainers+0.105
Matthew+0.105
uster+0.104
 Woo+0.103
Neg logit contributions
 volcan-0.040
 erupt-0.034
 exc-0.032
 how-0.031
 eru-0.030
Token.
Token ID13
Feature activation-6.37e-03

Pos logit contributions
 volcan+0.055
degree+0.048
 eru+0.047
endo+0.045
 erupt+0.045
Neg logit contributions
 Hank-0.086
 Holly-0.085
 Fiona-0.085
gers-0.084
 Hannah-0.084
Token Hannah
Token ID26013
Feature activation-3.58e-01

Pos logit contributions
gers+0.010
 trainers+0.010
Matthew+0.010
scar+0.010
 therapist+0.009
Neg logit contributions
endo-0.006
 Ap-0.006
 Or-0.006
 volcan-0.005
 Pret-0.005
Token had
Token ID550
Feature activation-8.05e-02

Pos logit contributions
gers+0.604
 Hank+0.595
 Fiona+0.587
 Woo+0.579
 Holly+0.578
Neg logit contributions
 volcan-0.280
endo-0.273
degree-0.269
 pressed-0.266
ast-0.261
Token a
Token ID257
Feature activation-5.09e-03

Pos logit contributions
gers+0.148
 Fiona+0.145
 Hank+0.141
 Holly+0.139
 therapist+0.139
Neg logit contributions
endo-0.057
 pressed-0.056
degree-0.055
BO-0.052
 volcan-0.050
Token very
Token ID845
Feature activation+8.25e-02

Pos logit contributions
 Hank+0.007
 Fiona+0.007
 Hannah+0.007
 Holly+0.007
gers+0.007
Neg logit contributions
 volcan-0.007
degree-0.006
endo-0.006
 hex-0.006
tr-0.006
Token fun
Token ID1257
Feature activation+1.59e-03

Pos logit contributions
 volcan+0.093
degree+0.090
endo+0.087
 eru+0.082
 hex+0.080
Neg logit contributions
 Hank-0.127
 Hannah-0.125
 Fiona-0.125
 Holly-0.124
 Woo-0.121
Token day
Token ID1110
Feature activation+5.96e-02

Pos logit contributions
 volcan+0.001
degree+0.001
endo+0.001
tr+0.001
 eru+0.001
Neg logit contributions
 Hank-0.003
 Holly-0.003
 Hannah-0.003
 Fiona-0.003
gers-0.003
Token how
Token ID703
Feature activation+2.30e-01

Pos logit contributions
degree+0.041
 volcan+0.040
endo+0.040
tr+0.038
pre+0.038
Neg logit contributions
 Hank-0.035
 Hannah-0.034
 Holly-0.034
gers-0.034
 Fiona-0.034
Token he
Token ID339
Feature activation-9.55e-03

Pos logit contributions
 volcan+0.273
degree+0.260
endo+0.249
 hex+0.241
 eru+0.236
Neg logit contributions
 Hank-0.343
 Fiona-0.342
 Hannah-0.338
 Holly-0.338
gers-0.331
Token could
Token ID714
Feature activation+5.25e-02

Pos logit contributions
gers+0.011
 Hank+0.010
 therapist+0.010
 Holly+0.010
 Hannah+0.010
Neg logit contributions
 volcan-0.014
degree-0.014
endo-0.013
 eru-0.013
raper-0.013
Token use
Token ID779
Feature activation+4.09e-04

Pos logit contributions
 volcan+0.064
degree+0.061
endo+0.060
 hex+0.057
tr+0.057
Neg logit contributions
 Hank-0.076
 Fiona-0.075
 Hannah-0.075
 Holly-0.074
gers-0.073
Token the
Token ID262
Feature activation-1.82e-02

Pos logit contributions
tr+0.001
degree+0.001
pre+0.001
endo+0.001
aph+0.001
Neg logit contributions
 Hank-0.000
 Hannah-0.000
 Holly-0.000
 Fiona-0.000
 Rover-0.000
Token avocado
Token ID40377
Feature activation+2.17e-01

Pos logit contributions
 Hannah+0.022
 Holly+0.022
 Hank+0.022
 trainers+0.021
gers+0.021
Neg logit contributions
endo-0.024
degree-0.022
tr-0.021
ast-0.021
destruct-0.020
Token to
Token ID284
Feature activation-1.04e-02

Pos logit contributions
endo+0.367
degree+0.359
 eru+0.342
 volcan+0.341
 BO+0.338
Neg logit contributions
gers-0.213
 Holly-0.193
 Fiona-0.188
 therapist-0.185
 trainers-0.183
Token make
Token ID787
Feature activation-4.91e-02

Pos logit contributions
 Hank+0.013
 Hannah+0.013
 Fiona+0.013
 Holly+0.013
gers+0.013
Neg logit contributions
 volcan-0.014
degree-0.014
endo-0.014
tr-0.013
pre-0.013
Token something
Token ID1223
Feature activation-5.56e-02

Pos logit contributions
 Hank+0.055
 Hannah+0.053
 Fiona+0.052
 Holly+0.051
gers+0.049
Neg logit contributions
tr-0.076
degree-0.075
endo-0.072
 volcan-0.071
pre-0.070
Token else
Token ID2073
Feature activation+1.62e-01

Pos logit contributions
 Hank+0.069
 Holly+0.068
 Hannah+0.067
 Fiona+0.067
gers+0.065
Neg logit contributions
endo-0.078
 volcan-0.077
degree-0.076
tr-0.072
destruct-0.071
Token.
Token ID13
Feature activation-9.10e-03

Pos logit contributions
 volcan+0.207
degree+0.202
endo+0.198
 hex+0.186
tr+0.186
Neg logit contributions
 Hank-0.221
 Fiona-0.218
 Hannah-0.217
 Holly-0.216
gers-0.213
Token but
Token ID475
Feature activation-5.89e-02

Pos logit contributions
 Hank+0.019
 Fiona+0.019
 Hannah+0.018
 Holly+0.018
gers+0.018
Neg logit contributions
 volcan-0.019
degree-0.019
endo-0.018
 hex-0.018
tr-0.017
Token he
Token ID339
Feature activation-9.93e-03

Pos logit contributions
 Hank+0.077
 Fiona+0.076
 Hannah+0.076
 Holly+0.075
gers+0.074
Neg logit contributions
 volcan-0.080
degree-0.078
endo-0.076
 hex-0.073
tr-0.073
Token didn
Token ID1422
Feature activation+5.76e-02

Pos logit contributions
 Hank+0.015
 Fiona+0.015
 Hannah+0.015
 Holly+0.015
gers+0.015
Neg logit contributions
 volcan-0.011
degree-0.011
endo-0.011
 hex-0.010
tr-0.010
Token't
Token ID470
Feature activation+2.11e-04

Pos logit contributions
 volcan+0.069
degree+0.068
endo+0.067
 hex+0.063
tr+0.062
Neg logit contributions
 Hank-0.082
 Fiona-0.082
 Hannah-0.081
gers-0.081
 Holly-0.081
Token know
Token ID760
Feature activation+7.28e-02

Pos logit contributions
 volcan+0.000
degree+0.000
endo+0.000
 hex+0.000
tr+0.000
Neg logit contributions
 Hank-0.000
 Fiona-0.000
 Hannah-0.000
 Holly-0.000
gers-0.000
Token how
Token ID703
Feature activation+2.30e-01

Pos logit contributions
endo+0.059
degree+0.057
 volcan+0.055
pre+0.052
tr+0.052
Neg logit contributions
 Hank-0.133
 Fiona-0.132
 Hannah-0.130
 Holly-0.129
gers-0.126
Token.
Token ID13
Feature activation-1.88e-03

Pos logit contributions
 volcan+0.201
degree+0.185
 erupt+0.159
 exc+0.158
 eru+0.156
Neg logit contributions
 Holly-0.431
 Hannah-0.430
 Hank-0.429
 Fiona-0.417
 therapist-0.414
Token
Token ID198
Feature activation+4.17e-02

Pos logit contributions
 Hank+0.002
 Fiona+0.002
 Holly+0.002
 Hannah+0.002
 Oscar+0.002
Neg logit contributions
 volcan-0.003
degree-0.003
endo-0.003
raper-0.003
pre-0.003
Token
Token ID198
Feature activation+4.17e-02

Pos logit contributions
 volcan+0.052
degree+0.050
endo+0.048
 hex+0.047
tr+0.046
Neg logit contributions
 Hank-0.059
 Fiona-0.059
 Hannah-0.057
 Holly-0.057
gers-0.056
TokenThen
Token ID6423
Feature activation+4.01e-02

Pos logit contributions
 volcan+0.057
degree+0.055
endo+0.053
 hex+0.051
tr+0.051
Neg logit contributions
 Hank-0.054
 Fiona-0.053
gers-0.053
 Hannah-0.053
 Holly-0.053
Token one
Token ID530
Feature activation-3.82e-02

Pos logit contributions
degree+0.057
endo+0.057
 volcan+0.057
tr+0.054
pre+0.053
Neg logit contributions
 Hank-0.049
 Fiona-0.048
gers-0.048
 Hannah-0.047
 Holly-0.047
Token be
Token ID307
Feature activation-5.24e-03

Pos logit contributions
 volcan+0.000
endo+0.000
degree+0.000
 hex+0.000
 eru+0.000
Neg logit contributions
 Hank-0.000
 Fiona-0.000
 Hannah-0.000
 Woo-0.000
 Holly-0.000
Token so
Token ID523
Feature activation-5.09e-03

Pos logit contributions
 Hank+0.008
 Fiona+0.007
 Hannah+0.007
 Holly+0.007
gers+0.007
Neg logit contributions
 volcan-0.006
degree-0.006
endo-0.006
 hex-0.006
tr-0.006
Token mean
Token ID1612
Feature activation+1.82e-02

Pos logit contributions
 Hank+0.007
 Hannah+0.006
 Fiona+0.006
 Holly+0.006
gers+0.006
Neg logit contributions
 volcan-0.007
endo-0.007
degree-0.007
 hex-0.006
tr-0.006
Token to
Token ID284
Feature activation-1.07e-02

Pos logit contributions
 volcan+0.020
degree+0.019
endo+0.018
 hex+0.017
tr+0.017
Neg logit contributions
 Hank-0.028
 Fiona-0.028
 Hannah-0.028
 Holly-0.028
gers-0.027
Token the
Token ID262
Feature activation-1.52e-02

Pos logit contributions
 Hank+0.014
 Fiona+0.014
 Hannah+0.014
 Holly+0.014
 Woo+0.013
Neg logit contributions
 volcan-0.015
degree-0.014
endo-0.014
 hex-0.014
pre-0.013
Token statue
Token ID15207
Feature activation+2.03e-01

Pos logit contributions
gers+0.017
 pup+0.014
 Hank+0.014
 amusement+0.014
 Rover+0.014
Neg logit contributions
endo-0.024
bul-0.023
tr-0.023
imet-0.022
ants-0.022
Token.
Token ID13
Feature activation-1.58e-02

Pos logit contributions
 volcan+0.311
endo+0.311
degree+0.301
 eru+0.289
 hex+0.287
Neg logit contributions
gers-0.220
 Hannah-0.219
 Holly-0.218
 Fiona-0.215
 Hank-0.212
Token Tim
Token ID5045
Feature activation+8.17e-02

Pos logit contributions
 Hank+0.020
 Hannah+0.020
 Fiona+0.019
 Holly+0.019
 Woo+0.019
Neg logit contributions
 volcan-0.024
degree-0.023
 hex-0.023
raper-0.022
endo-0.022
Tokenmy
Token ID1820
Feature activation+1.81e-03

Pos logit contributions
 volcan+0.108
degree+0.103
endo+0.100
 hex+0.096
 eru+0.095
Neg logit contributions
 Hank-0.112
 Fiona-0.110
 Hannah-0.110
 Holly-0.109
gers-0.107
Token felt
Token ID2936
Feature activation-2.19e-02

Pos logit contributions
 volcan+0.003
degree+0.003
endo+0.002
 hex+0.002
raper+0.002
Neg logit contributions
 Hank-0.002
 Hannah-0.002
 Holly-0.002
 Fiona-0.002
gers-0.002
Token bad
Token ID2089
Feature activation+4.21e-02

Pos logit contributions
 Fiona+0.035
 Hank+0.035
 Hannah+0.035
 Holly+0.034
gers+0.034
Neg logit contributions
 volcan-0.023
degree-0.022
endo-0.021
 eru-0.020
 hex-0.020
Token to
Token ID284
Feature activation-1.18e-02

Pos logit contributions
 volcan+0.012
degree+0.012
endo+0.011
 hex+0.011
 eru+0.011
Neg logit contributions
 Hank-0.014
 Fiona-0.014
 Hannah-0.014
 Holly-0.014
gers-0.014
Token say
Token ID910
Feature activation-3.03e-02

Pos logit contributions
 Fiona+0.012
 Hank+0.011
 Holly+0.011
 trainers+0.011
 Wendy+0.011
Neg logit contributions
endo-0.020
 volcan-0.019
degree-0.018
pre-0.018
tr-0.018
Token hello
Token ID23748
Feature activation-2.95e-02

Pos logit contributions
 Fiona+0.028
 Holly+0.027
 Rover+0.027
 Hank+0.027
gers+0.026
Neg logit contributions
endo-0.051
 volcan-0.050
tr-0.048
degree-0.048
pre-0.048
Token.
Token ID13
Feature activation-9.35e-03

Pos logit contributions
 Hank+0.034
 Fiona+0.034
 Hannah+0.033
 Holly+0.033
 Woo+0.032
Neg logit contributions
endo-0.045
 volcan-0.044
degree-0.043
 hex-0.041
 eru-0.041
Token She
Token ID1375
Feature activation-3.98e-02

Pos logit contributions
 Fiona+0.012
 Hannah+0.012
 Holly+0.012
 Hank+0.011
 Wendy+0.011
Neg logit contributions
 volcan-0.015
degree-0.014
endo-0.014
 hex-0.013
raper-0.013
Token climbed
Token ID19952
Feature activation+1.82e-01

Pos logit contributions
 Hank+0.053
 Fiona+0.052
 Hannah+0.052
 Holly+0.052
gers+0.050
Neg logit contributions
 volcan-0.054
degree-0.052
endo-0.050
 hex-0.049
 eru-0.049
Token up
Token ID510
Feature activation-3.06e-02

Pos logit contributions
 volcan+0.285
degree+0.282
endo+0.275
tr+0.263
pre+0.262
Neg logit contributions
 Fiona-0.219
 Holly-0.208
 Hank-0.202
gers-0.201
 Woo-0.201
Token the
Token ID262
Feature activation-2.29e-02

Pos logit contributions
 Fiona+0.027
 Holly+0.025
 Wendy+0.025
 Woo+0.024
 Hannah+0.021
Neg logit contributions
degree-0.060
endo-0.058
 volcan-0.057
raper-0.056
ologist-0.056
Token tree
Token ID5509
Feature activation+7.49e-02

Pos logit contributions
 pouch+0.023
gers+0.022
 shady+0.021
 Woo+0.020
 pup+0.020
Neg logit contributions
endo-0.039
 volcan-0.037
 eru-0.035
degree-0.035
 Ap-0.034
Token and
Token ID290
Feature activation+4.46e-03

Pos logit contributions
 volcan+0.117
degree+0.116
endo+0.115
pre+0.110
 eru+0.110
Neg logit contributions
 Fiona-0.080
 Holly-0.076
gers-0.074
 Hannah-0.072
 Wendy-0.072
Token said
Token ID531
Feature activation-3.20e-02

Pos logit contributions
 volcan+0.007
endo+0.007
degree+0.006
 hex+0.006
 eru+0.006
Neg logit contributions
 Hank-0.006
 Fiona-0.006
 Hannah-0.005
 Holly-0.005
 Woo-0.005
Token back
Token ID736
Feature activation+2.58e-02

Pos logit contributions
 volcan+0.032
degree+0.030
endo+0.029
tr+0.027
 hex+0.027
Neg logit contributions
 Hank-0.054
 Hannah-0.054
 Fiona-0.053
 Holly-0.053
gers-0.052
Token and
Token ID290
Feature activation+1.41e-02

Pos logit contributions
 volcan+0.032
degree+0.031
endo+0.030
 hex+0.029
tr+0.029
Neg logit contributions
 Hank-0.036
 Fiona-0.036
 Hannah-0.036
 Holly-0.036
gers-0.035
Token find
Token ID1064
Feature activation-1.47e-02

Pos logit contributions
 volcan+0.021
degree+0.021
endo+0.021
pre+0.020
 hex+0.020
Neg logit contributions
 Fiona-0.016
 Hank-0.016
 Holly-0.015
 Hannah-0.015
 Rover-0.015
Token more
Token ID517
Feature activation-3.20e-02

Pos logit contributions
 Hank+0.021
 Fiona+0.021
 Hannah+0.021
 Holly+0.021
gers+0.021
Neg logit contributions
 volcan-0.018
degree-0.017
endo-0.017
 hex-0.016
tr-0.016
Token pretty
Token ID2495
Feature activation+5.47e-02

Pos logit contributions
 Hank+0.048
 Fiona+0.048
 Holly+0.046
 Hannah+0.046
gers+0.046
Neg logit contributions
endo-0.037
degree-0.035
tr-0.034
 volcan-0.034
raper-0.032
Token pe
Token ID613
Feature activation+2.15e-01

Pos logit contributions
endo+0.067
degree+0.067
 volcan+0.065
tr+0.064
raper+0.061
Neg logit contributions
 Fiona-0.076
 Hank-0.076
gers-0.076
 Holly-0.074
 Hannah-0.074
Tokenb
Token ID65
Feature activation+6.88e-02

Pos logit contributions
endo+0.318
tr+0.313
 volcan+0.305
degree+0.302
 Or+0.297
Neg logit contributions
gers-0.236
 Fiona-0.221
 Hank-0.213
 trainers-0.213
 therapist-0.206
Tokenbles
Token ID7689
Feature activation+6.67e-02

Pos logit contributions
 volcan+0.090
endo+0.090
degree+0.089
tr+0.083
 eru+0.083
Neg logit contributions
gers-0.087
 Fiona-0.087
 Hank-0.086
 Holly-0.082
 Hannah-0.082
Token and
Token ID290
Feature activation+1.32e-02

Pos logit contributions
 volcan+0.079
degree+0.075
endo+0.071
 hex+0.070
 eru+0.068
Neg logit contributions
 Hank-0.100
 Fiona-0.098
 Hannah-0.098
 Holly-0.096
gers-0.095
Token learn
Token ID2193
Feature activation-1.49e-02

Pos logit contributions
 volcan+0.017
degree+0.017
endo+0.016
 hex+0.015
tr+0.015
Neg logit contributions
 Hank-0.018
 Fiona-0.018
 Hannah-0.017
 Holly-0.017
gers-0.017
Token more
Token ID517
Feature activation-3.33e-02

Pos logit contributions
 Hank+0.022
 Fiona+0.021
 Hannah+0.021
 Holly+0.021
gers+0.021
Neg logit contributions
 volcan-0.018
degree-0.017
endo-0.017
 hex-0.016
 eru-0.016
Token,
Token ID11
Feature activation-5.20e-03

Pos logit contributions
 volcan+0.024
degree+0.020
 hex+0.020
 eru+0.020
endo+0.020
Neg logit contributions
 therapist-0.039
 Hannah-0.038
 Hank-0.038
 Fiona-0.038
gers-0.038
Token the
Token ID262
Feature activation-9.66e-03

Pos logit contributions
gers+0.007
 Fiona+0.007
 Hank+0.007
 Hannah+0.007
 therapist+0.007
Neg logit contributions
 volcan-0.006
degree-0.005
endo-0.005
 hex-0.005
 eru-0.005
Token goat
Token ID26917
Feature activation-1.03e-01

Pos logit contributions
 Fiona+0.015
 Hannah+0.014
 Hank+0.014
 Holly+0.014
gers+0.014
Neg logit contributions
 volcan-0.011
 hex-0.010
degree-0.010
endo-0.010
tr-0.009
Token was
Token ID373
Feature activation-2.72e-02

Pos logit contributions
 Hank+0.157
 Fiona+0.156
 Hannah+0.156
gers+0.153
 Holly+0.153
Neg logit contributions
 volcan-0.112
degree-0.105
endo-0.102
tr-0.101
 hex-0.100
Token able
Token ID1498
Feature activation+9.88e-02

Pos logit contributions
gers+0.048
 Hank+0.045
 Fiona+0.045
 therapist+0.044
 Hannah+0.044
Neg logit contributions
 volcan-0.024
endo-0.023
 erupt-0.023
degree-0.022
 clim-0.021
Token to
Token ID284
Feature activation+3.58e-04

Pos logit contributions
 volcan+0.080
degree+0.069
 erupt+0.064
endo+0.062
 hex+0.061
Neg logit contributions
gers-0.186
 Hank-0.184
 Hannah-0.182
 Fiona-0.181
 Holly-0.181
Token play
Token ID711
Feature activation-2.54e-02

Pos logit contributions
 volcan+0.000
 erupt+0.000
 withstand+0.000
degree+0.000
 hex+0.000
Neg logit contributions
 Hannah-0.001
gers-0.001
 Hank-0.001
 Fiona-0.001
 Holly-0.001
Token the
Token ID262
Feature activation-8.47e-03

Pos logit contributions
 Fiona+0.041
 Hannah+0.040
 Hank+0.040
 therapist+0.040
Matthew+0.039
Neg logit contributions
 volcan-0.025
 hex-0.023
endo-0.023
degree-0.022
 eru-0.021
Token game
Token ID983
Feature activation+4.97e-02

Pos logit contributions
 Fiona+0.013
 Hank+0.013
 Hannah+0.012
 Holly+0.012
gers+0.012
Neg logit contributions
 volcan-0.010
degree-0.009
 hex-0.009
endo-0.009
tr-0.009
Token with
Token ID351
Feature activation-1.86e-03

Pos logit contributions
 volcan+0.057
degree+0.054
endo+0.052
 hex+0.052
tr+0.051
Neg logit contributions
 Hank-0.074
 Fiona-0.073
 Hannah-0.072
 Holly-0.071
gers-0.070
Token ease
Token ID10152
Feature activation+6.36e-02

Pos logit contributions
 Hank+0.003
 Fiona+0.003
 Hannah+0.003
gers+0.003
 Holly+0.003
Neg logit contributions
 volcan-0.002
degree-0.002
endo-0.002
 hex-0.002
 eru-0.002
Token says
Token ID1139
Feature activation+5.17e-02

Pos logit contributions
 Hank+0.079
 Fiona+0.078
 Hannah+0.077
 Holly+0.077
gers+0.076
Neg logit contributions
 volcan-0.056
degree-0.054
endo-0.053
 hex-0.050
tr-0.049
Token.
Token ID13
Feature activation-1.88e-03

Pos logit contributions
 volcan+0.069
degree+0.067
endo+0.065
 hex+0.062
tr+0.062
Neg logit contributions
 Hank-0.068
 Hannah-0.068
 Fiona-0.068
 Holly-0.067
gers-0.066
Token
Token ID198
Feature activation+5.87e-02

Pos logit contributions
 Hank+0.003
 Fiona+0.003
 Holly+0.002
 Hannah+0.002
 Woo+0.002
Neg logit contributions
degree-0.003
 volcan-0.002
endo-0.002
pre-0.002
tr-0.002
Token
Token ID198
Feature activation+5.84e-02

Pos logit contributions
 volcan+0.050
endo+0.045
degree+0.045
 eru+0.042
 Or+0.042
Neg logit contributions
gers-0.106
 Hank-0.106
 Hannah-0.105
 Fiona-0.104
 Holly-0.104
Token"
Token ID1
Feature activation-2.70e-02

Pos logit contributions
 Or+0.055
 volcan+0.055
BO+0.053
 hex+0.048
ast+0.048
Neg logit contributions
 Fiona-0.102
 Hank-0.101
 Hannah-0.101
 Holly-0.098
 Buster-0.097
TokenOh
Token ID5812
Feature activation+8.47e-02

Pos logit contributions
 Hank+0.033
gers+0.033
 Holly+0.032
 Hannah+0.032
 therapist+0.032
Neg logit contributions
 volcan-0.039
degree-0.038
endo-0.038
 eru-0.035
tr-0.035
Token,
Token ID11
Feature activation-5.22e-03

Pos logit contributions
 volcan+0.119
degree+0.119
endo+0.118
pre+0.113
tr+0.112
Neg logit contributions
 Fiona-0.105
 Hank-0.104
 Hannah-0.103
 Holly-0.101
 Woo-0.097
Token I
Token ID314
Feature activation-1.77e-02

Pos logit contributions
 Hank+0.008
gers+0.008
 Fiona+0.008
 Holly+0.008
 Hannah+0.008
Neg logit contributions
 volcan-0.006
degree-0.005
endo-0.005
 hex-0.005
 eru-0.005
Token see
Token ID766
Feature activation+5.27e-03

Pos logit contributions
gers+0.030
 Fiona+0.030
 Holly+0.029
 Hank+0.029
 Hannah+0.029
Neg logit contributions
 volcan-0.016
 hex-0.015
degree-0.015
endo-0.015
 eru-0.015
Token.
Token ID13
Feature activation-2.19e-03

Pos logit contributions
degree+0.008
endo+0.008
tr+0.008
 volcan+0.008
raper+0.007
Neg logit contributions
 Hank-0.006
 Fiona-0.006
 Hannah-0.006
 Holly-0.006
 Woo-0.006
Token That
Token ID1320
Feature activation+1.12e-02

Pos logit contributions
 Hank+0.003
 Fiona+0.003
 Hannah+0.003
 Holly+0.003
gers+0.003
Neg logit contributions
 volcan-0.003
degree-0.003
endo-0.003
 hex-0.002
 eru-0.002
Token come
Token ID1282
Feature activation+2.73e-02

Pos logit contributions
gers+0.016
 Hank+0.015
 Fiona+0.015
 Holly+0.015
 Hannah+0.015
Neg logit contributions
 volcan-0.012
endo-0.011
degree-0.011
 hex-0.010
tr-0.010
Token and
Token ID290
Feature activation+1.63e-02

Pos logit contributions
 volcan+0.026
degree+0.024
endo+0.024
tr+0.023
 hex+0.023
Neg logit contributions
 Hank-0.046
gers-0.045
 Hannah-0.044
 therapist-0.044
 Fiona-0.043
Token play
Token ID711
Feature activation-3.28e-02

Pos logit contributions
 volcan+0.022
degree+0.022
endo+0.021
 hex+0.020
 eru+0.020
Neg logit contributions
 Fiona-0.021
 Hank-0.021
 Hannah-0.021
 Holly-0.021
gers-0.020
Token with
Token ID351
Feature activation-8.68e-03

Pos logit contributions
 Hank+0.060
scar+0.059
Matthew+0.058
 Hannah+0.057
 Fiona+0.057
Neg logit contributions
 volcan-0.024
 hex-0.022
degree-0.021
 Or-0.020
 BO-0.020
Token him
Token ID683
Feature activation-3.41e-02

Pos logit contributions
 Hank+0.013
 Fiona+0.013
gers+0.013
 Hannah+0.013
 Holly+0.013
Neg logit contributions
 volcan-0.010
degree-0.009
endo-0.009
 hex-0.009
 eru-0.008
Token.
Token ID13
Feature activation+4.36e-03

Pos logit contributions
 Hank+0.056
 Fiona+0.055
 Hannah+0.055
gers+0.054
 Woo+0.054
Neg logit contributions
 volcan-0.033
degree-0.031
tr-0.030
 hex-0.029
endo-0.029
Token Jill
Token ID22772
Feature activation+1.14e-01

Pos logit contributions
 volcan+0.007
degree+0.006
endo+0.006
pre+0.006
 eru+0.006
Neg logit contributions
 Hannah-0.005
 Holly-0.005
 Hank-0.005
 Fiona-0.005
 Wendy-0.005
Token said
Token ID531
Feature activation-2.22e-02

Pos logit contributions
 volcan+0.124
degree+0.120
endo+0.119
tr+0.115
 hex+0.113
Neg logit contributions
 Hank-0.171
gers-0.168
 Fiona-0.167
 Woo-0.164
 Hannah-0.163
Token no
Token ID645
Feature activation-5.54e-02

Pos logit contributions
gers+0.035
 Hank+0.034
 Fiona+0.034
 trainers+0.034
latable+0.033
Neg logit contributions
 volcan-0.023
degree-0.021
od-0.020
 hex-0.020
endo-0.019
Token,
Token ID11
Feature activation-4.40e-03

Pos logit contributions
 Hank+0.082
 Fiona+0.082
 Holly+0.081
 Hannah+0.081
gers+0.080
Neg logit contributions
 volcan-0.066
degree-0.063
endo-0.059
 hex-0.059
tr-0.058
Token she
Token ID673
Feature activation+3.34e-02

Pos logit contributions
gers+0.006
 Hank+0.006
 Fiona+0.006
 Holly+0.006
 Hannah+0.006
Neg logit contributions
 volcan-0.005
degree-0.005
endo-0.005
 hex-0.005
 eru-0.005
Token make
Token ID787
Feature activation-4.64e-02

Pos logit contributions
 volcan+0.081
degree+0.080
endo+0.080
 eru+0.072
 hex+0.072
Neg logit contributions
 Hank-0.061
 Hannah-0.059
 Holly-0.059
 Fiona-0.059
 Woo-0.057
Token more
Token ID517
Feature activation-3.70e-02

Pos logit contributions
 Fiona+0.047
 Holly+0.047
 Hannah+0.046
 Woo+0.045
 Hank+0.045
Neg logit contributions
tr-0.074
degree-0.073
endo-0.070
 volcan-0.069
pre-0.069
Token?"
Token ID1701
Feature activation+2.09e-03

Pos logit contributions
 Fiona+0.046
gers+0.045
 Holly+0.044
 Hannah+0.044
 trainers+0.043
Neg logit contributions
endo-0.050
degree-0.048
 volcan-0.046
tr-0.046
raper-0.044
Token Lily
Token ID20037
Feature activation+1.98e-02

Pos logit contributions
 volcan+0.003
degree+0.003
endo+0.003
tr+0.003
raper+0.003
Neg logit contributions
 Hank-0.003
 Holly-0.003
 Hannah-0.003
 Fiona-0.003
 Woo-0.003
Token smiles
Token ID21845
Feature activation-3.40e-02

Pos logit contributions
 volcan+0.027
degree+0.027
endo+0.026
tr+0.025
pre+0.025
Neg logit contributions
 Hank-0.026
 Holly-0.026
 Hannah-0.026
 Fiona-0.026
gers-0.024
Token and
Token ID290
Feature activation+8.68e-03

Pos logit contributions
 Hank+0.042
 Hannah+0.042
 Holly+0.042
 Fiona+0.041
gers+0.041
Neg logit contributions
 volcan-0.049
degree-0.046
endo-0.046
 hex-0.043
raper-0.043
Token says
Token ID1139
Feature activation+4.08e-02

Pos logit contributions
 volcan+0.013
endo+0.012
degree+0.012
tr+0.012
raper+0.012
Neg logit contributions
 Hank-0.011
 Holly-0.011
 Hannah-0.010
 Fiona-0.010
 Woo-0.010
Token,
Token ID11
Feature activation-1.34e-02

Pos logit contributions
 volcan+0.049
degree+0.048
endo+0.046
 hex+0.045
tr+0.043
Neg logit contributions
 Hank-0.059
 Fiona-0.058
 Hannah-0.058
 Holly-0.057
gers-0.057
Token "
Token ID366
Feature activation+1.71e-02

Pos logit contributions
 Holly+0.015
 Hannah+0.014
 Hank+0.014
 Woo+0.014
 Rover+0.014
Neg logit contributions
 volcan-0.022
degree-0.021
endo-0.021
destruct-0.021
 hex-0.020
TokenYes
Token ID5297
Feature activation-4.31e-03

Pos logit contributions
 volcan+0.030
endo+0.028
 erupt+0.026
 eru+0.026
degree+0.026
Neg logit contributions
 Holly-0.017
 Hank-0.017
gers-0.017
 Hannah-0.017
 Fiona-0.017
Token,
Token ID11
Feature activation-1.45e-02

Pos logit contributions
gers+0.007
 Fiona+0.007
 Hank+0.007
 trainers+0.007
 therapist+0.007
Neg logit contributions
 volcan-0.004
degree-0.004
 hex-0.003
 eru-0.003
endo-0.003
Tokenchair
Token ID16337
Feature activation-2.19e-02

Pos logit contributions
 Hannah+0.022
 Fiona+0.022
gers+0.021
 Holly+0.021
 Hank+0.021
Neg logit contributions
 volcan-0.036
endo-0.035
degree-0.034
 eru-0.033
 hex-0.033
Token,
Token ID11
Feature activation-8.23e-03

Pos logit contributions
 Hank+0.031
 Fiona+0.031
 Hannah+0.031
 Holly+0.031
gers+0.030
Neg logit contributions
 volcan-0.027
degree-0.026
endo-0.025
 hex-0.024
 eru-0.024
Token and
Token ID290
Feature activation+1.72e-02

Pos logit contributions
 Hank+0.012
gers+0.011
 Fiona+0.011
 Hannah+0.011
 Holly+0.011
Neg logit contributions
 volcan-0.010
degree-0.010
endo-0.010
tr-0.009
 hex-0.009
Token they
Token ID484
Feature activation+2.56e-02

Pos logit contributions
 volcan+0.021
degree+0.020
endo+0.019
tr+0.019
 hex+0.019
Neg logit contributions
 Hank-0.024
 Fiona-0.024
gers-0.024
 Hannah-0.024
 Holly-0.024
Token read
Token ID1100
Feature activation+5.91e-02

Pos logit contributions
 volcan+0.030
degree+0.028
endo+0.028
tr+0.026
 hex+0.026
Neg logit contributions
 Hank-0.038
 Fiona-0.038
 Hannah-0.038
 Holly-0.037
gers-0.037
Token the
Token ID262
Feature activation-1.19e-02

Pos logit contributions
 volcan+0.067
degree+0.062
endo+0.060
tr+0.060
 hex+0.058
Neg logit contributions
 Hank-0.091
 Fiona-0.088
 Hannah-0.088
 Holly-0.087
gers-0.086
Token book
Token ID1492
Feature activation+7.02e-02

Pos logit contributions
 Fiona+0.016
 Hannah+0.016
 Hank+0.016
 Holly+0.015
gers+0.015
Neg logit contributions
 volcan-0.016
degree-0.015
endo-0.015
 hex-0.014
 eru-0.014
Token together
Token ID1978
Feature activation+1.25e-01

Pos logit contributions
 volcan+0.087
endo+0.082
degree+0.081
pre+0.079
 eru+0.078
Neg logit contributions
gers-0.102
 Fiona-0.100
 Holly-0.100
 Hannah-0.097
 Hank-0.093
Token.
Token ID13
Feature activation-4.80e-04

Pos logit contributions
 volcan+0.153
degree+0.147
endo+0.143
 hex+0.137
tr+0.136
Neg logit contributions
 Hank-0.179
 Fiona-0.177
 Hannah-0.176
 Holly-0.174
gers-0.173
Token They
Token ID1119
Feature activation-3.76e-02

Pos logit contributions
 Hannah+0.001
 Holly+0.001
 Hank+0.001
 Fiona+0.001
 Woo+0.001
Neg logit contributions
 volcan-0.001
degree-0.001
endo-0.001
pre-0.001
 hex-0.001
Token laughed
Token ID13818
Feature activation+1.52e-02

Pos logit contributions
 Hank+0.053
 Fiona+0.053
 Hannah+0.052
 Holly+0.052
gers+0.052
Neg logit contributions
 volcan-0.046
degree-0.045
endo-0.044
 hex-0.042
tr-0.041
TokenMaybe
Token ID13300
Feature activation+5.88e-02

Pos logit contributions
 Hank+0.056
 Fiona+0.056
 Hannah+0.055
 Holly+0.055
gers+0.054
Neg logit contributions
 volcan-0.054
degree-0.053
endo-0.051
 hex-0.049
 eru-0.048
Token it
Token ID340
Feature activation-5.64e-02

Pos logit contributions
 volcan+0.081
degree+0.079
endo+0.078
 hex+0.074
tr+0.074
Neg logit contributions
 Hank-0.075
 Fiona-0.074
 Hannah-0.073
 Holly-0.073
gers-0.072
Token is
Token ID318
Feature activation-1.81e-02

Pos logit contributions
 Fiona+0.056
gers+0.055
 Hannah+0.053
 Holly+0.053
 Hank+0.052
Neg logit contributions
 volcan-0.089
degree-0.086
endo-0.084
raper-0.084
aph-0.083
Token a
Token ID257
Feature activation-7.89e-03

Pos logit contributions
 Hank+0.026
 Fiona+0.025
 Hannah+0.025
 Holly+0.025
gers+0.025
Neg logit contributions
 volcan-0.022
degree-0.022
endo-0.021
 hex-0.020
pre-0.020
Token monster
Token ID9234
Feature activation+9.09e-02

Pos logit contributions
 Hank+0.011
 Hannah+0.010
 Fiona+0.010
 Holly+0.010
 Woo+0.010
Neg logit contributions
endo-0.010
degree-0.010
 volcan-0.010
od-0.009
destruct-0.009
Token,"
Token ID553
Feature activation-6.71e-02

Pos logit contributions
 volcan+0.133
degree+0.130
endo+0.128
 hex+0.121
 eru+0.120
Neg logit contributions
 Hank-0.107
 Fiona-0.106
 Hannah-0.105
 Holly-0.104
gers-0.104
Token Tom
Token ID4186
Feature activation+1.44e-02

Pos logit contributions
Matthew+0.099
gers+0.099
 Buster+0.098
 trainers+0.098
 Fiona+0.097
Neg logit contributions
 volcan-0.072
degree-0.065
 eru-0.063
endo-0.061
 BO-0.060
Token says
Token ID1139
Feature activation+4.20e-02

Pos logit contributions
 volcan+0.019
degree+0.018
endo+0.018
pre+0.018
 hex+0.017
Neg logit contributions
 Fiona-0.019
 Hank-0.019
 Holly-0.019
 Hannah-0.019
gers-0.019
Token.
Token ID13
Feature activation-1.12e-02

Pos logit contributions
 volcan+0.054
degree+0.053
endo+0.052
 hex+0.049
tr+0.048
Neg logit contributions
 Hank-0.057
 Fiona-0.057
 Hannah-0.056
 Holly-0.056
gers-0.055
Token He
Token ID679
Feature activation-2.84e-02

Pos logit contributions
 Hank+0.016
 Fiona+0.016
 Hannah+0.016
 Holly+0.016
gers+0.016
Neg logit contributions
 volcan-0.013
degree-0.012
endo-0.012
 hex-0.012
tr-0.011
Token is
Token ID318
Feature activation-1.87e-02

Pos logit contributions
 Hank+0.044
 Fiona+0.044
 Hannah+0.043
 Holly+0.043
gers+0.043
Neg logit contributions
 volcan-0.031
degree-0.030
endo-0.029
 hex-0.028
tr-0.027
Token to
Token ID284
Feature activation-5.77e-03

Pos logit contributions
 Fiona+0.049
 Hannah+0.049
 Hank+0.049
 Holly+0.048
 Woo+0.047
Neg logit contributions
degree-0.064
endo-0.061
 volcan-0.061
tr-0.059
pre-0.059
Token explore
Token ID7301
Feature activation-1.20e-01

Pos logit contributions
 Hank+0.008
 Fiona+0.008
 Hannah+0.008
 Holly+0.008
gers+0.008
Neg logit contributions
 volcan-0.007
degree-0.007
endo-0.007
 hex-0.007
tr-0.007
Token.
Token ID13
Feature activation-3.76e-03

Pos logit contributions
 Hank+0.208
 therapist+0.201
 Hannah+0.200
 Holly+0.198
 Fiona+0.196
Neg logit contributions
 volcan-0.130
degree-0.106
 eru-0.104
endo-0.100
 hex-0.096
Token
Token ID198
Feature activation+4.66e-02

Pos logit contributions
 Hank+0.006
gers+0.006
 Fiona+0.006
 Hannah+0.006
 Holly+0.005
Neg logit contributions
 volcan-0.004
degree-0.004
endo-0.004
tr-0.004
 eru-0.004
Token
Token ID198
Feature activation+4.65e-02

Pos logit contributions
 volcan+0.039
degree+0.037
endo+0.036
tr+0.032
 eru+0.032
Neg logit contributions
 Hank-0.084
gers-0.082
 Hannah-0.082
 Holly-0.081
 Fiona-0.081
TokenOne
Token ID3198
Feature activation-4.45e-02

Pos logit contributions
 volcan+0.053
degree+0.050
endo+0.050
tr+0.048
BO+0.044
Neg logit contributions
 Hank-0.075
 Hannah-0.070
 Oscar-0.069
 Holly-0.069
gers-0.069
Token day
Token ID1110
Feature activation+6.42e-02

Pos logit contributions
 Fiona+0.059
 Hank+0.058
 Hannah+0.058
gers+0.058
 Holly+0.057
Neg logit contributions
 volcan-0.058
degree-0.057
endo-0.057
 hex-0.053
raper-0.052
Token,
Token ID11
Feature activation-1.04e-02

Pos logit contributions
 volcan+0.090
degree+0.088
endo+0.086
 hex+0.083
tr+0.081
Neg logit contributions
 Fiona-0.082
 Hank-0.082
 Hannah-0.082
 Holly-0.081
gers-0.078
Token Lucy
Token ID22162
Feature activation-5.24e-02

Pos logit contributions
 Hank+0.013
 Fiona+0.013
 Hannah+0.013
 Holly+0.013
gers+0.013
Neg logit contributions
 volcan-0.015
degree-0.014
endo-0.014
 hex-0.013
tr-0.013
Token was
Token ID373
Feature activation-3.26e-02

Pos logit contributions
 Hank+0.088
 Fiona+0.086
 Holly+0.086
 Hannah+0.085
gers+0.084
Neg logit contributions
 volcan-0.052
degree-0.049
endo-0.048
tr-0.045
 hex-0.045
Token feeling
Token ID4203
Feature activation-1.42e-01

Pos logit contributions
 Hank+0.044
 Fiona+0.043
gers+0.043
 Hannah+0.043
 Holly+0.043
Neg logit contributions
 volcan-0.043
degree-0.041
endo-0.040
tr-0.038
 hex-0.038
Token.
Token ID13
Feature activation-2.65e-04

Pos logit contributions
endo+0.034
degree+0.033
 volcan+0.032
destruct+0.031
tr+0.031
Neg logit contributions
 Fiona-0.023
 Hank-0.023
 Hannah-0.022
gers-0.022
 Holly-0.022
Token It
Token ID632
Feature activation-3.57e-02

Pos logit contributions
 Hank+0.000
 Fiona+0.000
 Hannah+0.000
 Holly+0.000
gers+0.000
Neg logit contributions
 volcan-0.000
degree-0.000
endo-0.000
 hex-0.000
tr-0.000
Token was
Token ID373
Feature activation-2.28e-02

Pos logit contributions
 Hank+0.046
 Fiona+0.045
 Hannah+0.045
 Holly+0.044
gers+0.044
Neg logit contributions
 volcan-0.050
degree-0.048
endo-0.047
 hex-0.045
tr-0.044
Token Mom
Token ID11254
Feature activation+2.90e-02

Pos logit contributions
 Hank+0.035
 Fiona+0.034
gers+0.034
 Hannah+0.034
 Holly+0.034
Neg logit contributions
 volcan-0.027
degree-0.025
endo-0.024
 hex-0.023
tr-0.023
Token.
Token ID13
Feature activation+6.48e-04

Pos logit contributions
 volcan+0.023
 erupt+0.020
degree+0.019
tr+0.019
 ding+0.019
Neg logit contributions
gers-0.056
 Hank-0.055
 Holly-0.054
 trainers-0.053
 Fiona-0.053
Token She
Token ID1375
Feature activation-3.27e-02

Pos logit contributions
 volcan+0.001
degree+0.001
endo+0.001
 hex+0.001
tr+0.001
Neg logit contributions
 Hank-0.001
 Fiona-0.001
 Hannah-0.001
 Holly-0.001
gers-0.001
Token came
Token ID1625
Feature activation+1.10e-01

Pos logit contributions
 Hank+0.049
 Fiona+0.048
 Hannah+0.048
 Holly+0.048
gers+0.047
Neg logit contributions
 volcan-0.038
degree-0.037
endo-0.036
 hex-0.034
tr-0.034
Token into
Token ID656
Feature activation+8.62e-02

Pos logit contributions
 volcan+0.108
endo+0.098
degree+0.097
 eru+0.089
 hex+0.088
Neg logit contributions
 Hank-0.186
gers-0.184
 Hannah-0.180
 therapist-0.179
 Holly-0.179
Token the
Token ID262
Feature activation-5.89e-03

Pos logit contributions
 volcan+0.103
degree+0.093
 hex+0.092
endo+0.091
 eru+0.090
Neg logit contributions
 Hank-0.128
gers-0.127
 Fiona-0.125
 Holly-0.123
 Hannah-0.123
Token kitchen
Token ID9592
Feature activation-3.92e-02

Pos logit contributions
 Hannah+0.007
 Hank+0.006
gers+0.006
 Fiona+0.006
 Holly+0.006
Neg logit contributions
degree-0.009
 volcan-0.009
endo-0.009
aph-0.008
 hex-0.008
Token and
Token ID290
Feature activation+1.60e-02

Pos logit contributions
 Hank+0.071
 Fiona+0.070
 Holly+0.068
 Hannah+0.068
gers+0.068
Neg logit contributions
 volcan-0.037
degree-0.030
tr-0.030
 erupt-0.030
endo-0.029
Token the
Token ID262
Feature activation-2.12e-02

Pos logit contributions
 Hank+0.037
 Fiona+0.036
 Hannah+0.036
 Holly+0.036
gers+0.036
Neg logit contributions
 volcan-0.034
degree-0.033
endo-0.032
 hex-0.031
tr-0.030
Token ground
Token ID2323
Feature activation+1.18e-02

Pos logit contributions
 Hank+0.026
gers+0.025
 Woo+0.025
 Hannah+0.024
 Buster+0.024
Neg logit contributions
endo-0.029
 volcan-0.028
degree-0.028
aph-0.027
raper-0.027
Token.
Token ID13
Feature activation-9.70e-03

Pos logit contributions
 volcan+0.015
degree+0.015
endo+0.015
 hex+0.014
 eru+0.014
Neg logit contributions
 Hank-0.016
 Fiona-0.016
 Hannah-0.016
 Holly-0.015
gers-0.015
Token He
Token ID679
Feature activation-3.01e-02

Pos logit contributions
 Hank+0.012
 Fiona+0.012
 Holly+0.012
 Hannah+0.012
 Woo+0.011
Neg logit contributions
 volcan-0.014
degree-0.014
endo-0.014
tr-0.013
raper-0.013
Token reaches
Token ID12229
Feature activation+1.78e-01

Pos logit contributions
 Hank+0.045
 Fiona+0.045
 Hannah+0.044
 Holly+0.044
gers+0.044
Neg logit contributions
 volcan-0.035
degree-0.034
endo-0.032
 hex-0.031
tr-0.030
Token for
Token ID329
Feature activation-6.53e-02

Pos logit contributions
 volcan+0.253
degree+0.252
endo+0.246
pre+0.237
 hex+0.231
Neg logit contributions
 Fiona-0.220
 Hank-0.218
 Hannah-0.215
 Holly-0.214
gers-0.206
Token the
Token ID262
Feature activation-2.09e-02

Pos logit contributions
 Hank+0.090
 Fiona+0.089
 Hannah+0.089
 Holly+0.088
gers+0.087
Neg logit contributions
 volcan-0.084
degree-0.081
endo-0.079
 hex-0.075
tr-0.074
Token je
Token ID11223
Feature activation+1.18e-01

Pos logit contributions
 Hank+0.031
gers+0.029
 Hannah+0.029
 Woo+0.029
 Fiona+0.028
Neg logit contributions
endo-0.025
degree-0.023
 volcan-0.023
aph-0.021
raper-0.021
Tokenep
Token ID538
Feature activation+1.20e-01

Pos logit contributions
 volcan+0.157
endo+0.147
 eru+0.143
 hex+0.141
degree+0.136
Neg logit contributions
gers-0.154
Matthew-0.142
 trainers-0.139
 Hank-0.139
 Woo-0.135
Token.
Token ID13
Feature activation-9.79e-03

Pos logit contributions
 volcan+0.173
endo+0.168
degree+0.168
 eru+0.158
 hex+0.157
Neg logit contributions
 Hank-0.143
 Fiona-0.140
gers-0.140
 Hannah-0.139
 Holly-0.137
Token But
Token ID887
Feature activation+3.46e-02

Pos logit contributions
 Hank+0.012
 Holly+0.012
 Fiona+0.012
 Hannah+0.012
 Woo+0.012
Neg logit contributions
 volcan-0.014
degree-0.014
endo-0.014
raper-0.013
pre-0.013
Token happened
Token ID3022
Feature activation+1.25e-01

Pos logit contributions
 Fiona+0.098
 Hannah+0.098
 Hank+0.098
 Holly+0.097
gers+0.096
Neg logit contributions
endo-0.085
 volcan-0.084
degree-0.083
tr-0.081
raper-0.077
Token.
Token ID13
Feature activation+7.37e-03

Pos logit contributions
 volcan+0.098
od+0.084
degree+0.083
 Or+0.079
endo+0.076
Neg logit contributions
 Hank-0.238
 Fiona-0.233
 Holly-0.231
 Hannah-0.231
 Otto-0.229
Token Finally
Token ID9461
Feature activation+8.92e-02

Pos logit contributions
 volcan+0.012
degree+0.011
endo+0.011
 hex+0.011
tr+0.011
Neg logit contributions
 Fiona-0.008
 Hank-0.008
 Hannah-0.008
 Holly-0.008
gers-0.008
Token,
Token ID11
Feature activation-2.77e-03

Pos logit contributions
 volcan+0.089
degree+0.079
 eru+0.077
endo+0.076
 hex+0.074
Neg logit contributions
 Hank-0.145
gers-0.143
 trainers-0.140
Matthew-0.137
 therapist-0.137
Token everyone
Token ID2506
Feature activation+7.73e-02

Pos logit contributions
 Hank+0.003
 Fiona+0.003
 Hannah+0.003
 Holly+0.003
gers+0.003
Neg logit contributions
 volcan-0.004
degree-0.004
endo-0.004
 hex-0.004
tr-0.004
Token was
Token ID373
Feature activation-2.31e-02

Pos logit contributions
 volcan+0.090
degree+0.085
endo+0.082
 hex+0.079
 eru+0.079
Neg logit contributions
 Hank-0.116
gers-0.114
 Woo-0.114
 Hannah-0.113
 Holly-0.112
Token talking
Token ID3375
Feature activation+2.31e-02

Pos logit contributions
gers+0.039
 Hank+0.038
 Woo+0.037
 Hannah+0.037
 Holly+0.037
Neg logit contributions
 volcan-0.024
 erupt-0.022
degree-0.022
endo-0.021
od-0.020
Token about
Token ID546
Feature activation-5.75e-02

Pos logit contributions
 volcan+0.025
degree+0.024
tr+0.022
endo+0.022
 hex+0.022
Neg logit contributions
 Hank-0.036
 Hannah-0.035
 Fiona-0.035
 Holly-0.035
gers-0.035
Token Daisy
Token ID40355
Feature activation-8.41e-02

Pos logit contributions
 Hank+0.087
gers+0.086
 Hannah+0.085
 Fiona+0.085
 Holly+0.085
Neg logit contributions
 volcan-0.067
degree-0.061
endo-0.061
 hex-0.058
 eru-0.057
Token and
Token ID290
Feature activation+1.88e-02

Pos logit contributions
 Hank+0.152
gers+0.150
 Woo+0.148
 trainers+0.146
Matthew+0.142
Neg logit contributions
 volcan-0.068
od-0.063
tr-0.058
 erupt-0.057
 hex-0.057
Token her
Token ID607
Feature activation+4.08e-02

Pos logit contributions
 volcan+0.023
degree+0.021
endo+0.021
 hex+0.019
 eru+0.019
Neg logit contributions
gers-0.027
 Hank-0.027
 Hannah-0.026
 Fiona-0.026
 Holly-0.026
Token much
Token ID881
Feature activation+6.33e-02

Pos logit contributions
 volcan+0.004
degree+0.004
tr+0.004
endo+0.004
raper+0.004
Neg logit contributions
 Hank-0.005
gers-0.005
 Fiona-0.005
 Holly-0.005
 Hannah-0.005
Token fun
Token ID1257
Feature activation+7.20e-03

Pos logit contributions
 volcan+0.074
degree+0.071
endo+0.069
 hex+0.066
 eru+0.065
Neg logit contributions
 Hank-0.094
 Fiona-0.094
 Hannah-0.093
 Holly-0.093
gers-0.091
Token!
Token ID0
Feature activation-2.51e-02

Pos logit contributions
 volcan+0.009
degree+0.008
endo+0.008
 hex+0.007
 eru+0.007
Neg logit contributions
 Hank-0.011
 Hannah-0.011
 Holly-0.011
 Fiona-0.011
gers-0.010
Token
Token ID198
Feature activation+5.00e-02

Pos logit contributions
gers+0.043
 Fiona+0.041
Matthew+0.041
 trainers+0.040
 shepherd+0.040
Neg logit contributions
endo-0.022
 volcan-0.020
degree-0.019
 BO-0.018
 Amid-0.018
Token
Token ID198
Feature activation+4.97e-02

Pos logit contributions
endo+0.023
 volcan+0.022
degree+0.019
 eru+0.017
 Or+0.016
Neg logit contributions
gers-0.107
 trainers-0.104
 therapist-0.102
 Hank-0.101
 Hannah-0.101
TokenJimmy
Token ID40335
Feature activation-2.46e-01

Pos logit contributions
endo+0.037
degree+0.035
BO+0.034
ast+0.034
raper+0.032
Neg logit contributions
 Fiona-0.095
 Wendy-0.093
 Holly-0.087
 Hannah-0.086
gers-0.086
Token's
Token ID338
Feature activation-1.14e-02

Pos logit contributions
 Fiona+0.404
gers+0.400
 Hank+0.398
 Holly+0.397
 Hannah+0.397
Neg logit contributions
 volcan-0.239
endo-0.227
degree-0.224
 hex-0.214
ast-0.206
Token exploring
Token ID13504
Feature activation-1.19e-01

Pos logit contributions
 Fiona+0.018
 Hank+0.017
 Hannah+0.017
 Holly+0.017
gers+0.017
Neg logit contributions
 volcan-0.013
degree-0.012
endo-0.012
 hex-0.011
tr-0.011
Token didn
Token ID1422
Feature activation+6.27e-02

Pos logit contributions
 Hannah+0.207
 Hank+0.207
 therapist+0.204
 Holly+0.203
 Woo+0.203
Neg logit contributions
 volcan-0.115
 eru-0.091
degree-0.091
 hex-0.089
endo-0.087
Token't
Token ID470
Feature activation+6.14e-03

Pos logit contributions
 volcan+0.075
degree+0.073
endo+0.072
 hex+0.069
tr+0.067
Neg logit contributions
 Hank-0.090
 Fiona-0.089
 Hannah-0.089
gers-0.089
 Holly-0.088
Token stop
Token ID2245
Feature activation+1.07e-01

Pos logit contributions
 volcan+0.008
degree+0.008
endo+0.008
tr+0.008
pre+0.008
Neg logit contributions
 Hank-0.008
 Fiona-0.008
 Hannah-0.008
 Holly-0.008
gers-0.008
Token Tom
Token ID4186
Feature activation+1.96e-02

Pos logit contributions
 volcan+0.102
degree+0.101
endo+0.095
pre+0.095
raper+0.094
Neg logit contributions
 Hannah-0.075
 Hank-0.075
 Holly-0.072
 Woo-0.072
gers-0.070
Token to
Token ID284
Feature activation-8.71e-03

Pos logit contributions
 volcan+0.027
degree+0.026
endo+0.026
 hex+0.025
 eru+0.025
Neg logit contributions
 Hannah-0.025
 Fiona-0.025
 Hank-0.024
 Holly-0.024
gers-0.024
Token his
Token ID465
Feature activation+2.76e-02

Pos logit contributions
gers+0.015
 therapist+0.014
 Fiona+0.014
 Hank+0.014
 Holly+0.014
Neg logit contributions
 volcan-0.009
endo-0.008
 hex-0.008
degree-0.008
 eru-0.008
Token bed
Token ID3996
Feature activation-1.27e-01

Pos logit contributions
 volcan+0.034
degree+0.032
endo+0.032
tr+0.031
 hex+0.030
Neg logit contributions
 Hank-0.040
 Fiona-0.039
 Hannah-0.039
 Holly-0.038
gers-0.038
Token and
Token ID290
Feature activation+8.57e-03

Pos logit contributions
 Hank+0.195
 Fiona+0.192
 Hannah+0.189
 Holly+0.188
gers+0.185
Neg logit contributions
 volcan-0.142
degree-0.137
endo-0.132
tr-0.128
 hex-0.126
Token tucked
Token ID29779
Feature activation-2.25e-01

Pos logit contributions
 volcan+0.012
degree+0.011
endo+0.011
 hex+0.011
pre+0.011
Neg logit contributions
 Hannah-0.011
 Fiona-0.011
 Hank-0.011
 Holly-0.011
gers-0.011
Token him
Token ID683
Feature activation-3.73e-02

Pos logit contributions
 Hank+0.296
 Fiona+0.292
 Hannah+0.291
 Holly+0.288
gers+0.285
Neg logit contributions
 volcan-0.306
degree-0.297
endo-0.289
 hex-0.278
tr-0.274
Token in
Token ID287
Feature activation+4.49e-02

Pos logit contributions
gers+0.046
 Holly+0.045
 Hannah+0.045
 Fiona+0.045
 Hank+0.044
Neg logit contributions
 volcan-0.053
 hex-0.051
degree-0.051
endo-0.049
 eru-0.048
Token.
Token ID13
Feature activation-2.85e-03

Pos logit contributions
 volcan+0.039
 eru+0.032
endo+0.032
 hex+0.032
degree+0.030
Neg logit contributions
 Hank-0.081
gers-0.080
 Fiona-0.080
scar-0.079
Matthew-0.078
Token They
Token ID1119
Feature activation-4.65e-02

Pos logit contributions
 Hank+0.004
 Fiona+0.004
 Hannah+0.004
 Holly+0.004
gers+0.004
Neg logit contributions
 volcan-0.004
degree-0.004
endo-0.003
 hex-0.003
tr-0.003
Token kissed
Token ID27946
Feature activation-8.74e-02

Pos logit contributions
 Hank+0.067
 Fiona+0.066
 Hannah+0.066
 Holly+0.065
gers+0.065
Neg logit contributions
 volcan-0.056
degree-0.054
endo-0.053
tr-0.051
 hex-0.050
Token was
Token ID373
Feature activation-3.57e-02

Pos logit contributions
 Hank+0.118
gers+0.118
 Hannah+0.118
 Fiona+0.118
 Holly+0.118
Neg logit contributions
 volcan-0.091
endo-0.086
degree-0.084
 eru-0.080
tr-0.079
Token a
Token ID257
Feature activation-9.89e-03

Pos logit contributions
 Hank+0.054
 Fiona+0.053
 Hannah+0.053
 Holly+0.053
gers+0.052
Neg logit contributions
 volcan-0.041
degree-0.039
endo-0.039
 hex-0.037
tr-0.036
Token lovely
Token ID14081
Feature activation-2.53e-02

Pos logit contributions
 Hank+0.013
 Hannah+0.013
 Holly+0.012
 Fiona+0.012
 Woo+0.012
Neg logit contributions
degree-0.014
 volcan-0.013
endo-0.013
destruct-0.012
tr-0.012
Token bear
Token ID6842
Feature activation+2.35e-03

Pos logit contributions
 Hank+0.036
 Hannah+0.035
 Fiona+0.035
 Holly+0.035
gers+0.034
Neg logit contributions
 volcan-0.032
degree-0.031
endo-0.030
 hex-0.028
tr-0.028
Token named
Token ID3706
Feature activation-3.45e-02

Pos logit contributions
 volcan+0.003
degree+0.003
endo+0.003
 hex+0.003
 eru+0.003
Neg logit contributions
 Hank-0.003
 Fiona-0.003
 Hannah-0.003
gers-0.003
 Holly-0.003
Token Oscar
Token ID15694
Feature activation-3.12e-01

Pos logit contributions
 Hank+0.053
gers+0.052
 Fiona+0.052
 Hannah+0.052
 Holly+0.052
Neg logit contributions
 volcan-0.039
degree-0.037
endo-0.036
 hex-0.034
tr-0.034
Token.
Token ID13
Feature activation-2.23e-03

Pos logit contributions
 Hank+0.535
gers+0.535
 Fiona+0.534
 Hannah+0.532
 trainers+0.532
Neg logit contributions
 volcan-0.300
degree-0.256
endo-0.252
 hex-0.241
pre-0.241
Token Oscar
Token ID15694
Feature activation-3.12e-01

Pos logit contributions
 therapist+0.004
 Hanna+0.004
 versa+0.004
 Hannah+0.004
 convince+0.004
Neg logit contributions
 Pretty-0.001
 At-0.001
 BO-0.001
 Ding-0.001
 O-0.001
Token was
Token ID373
Feature activation-3.69e-02

Pos logit contributions
 Hank+0.457
 Fiona+0.456
 Hannah+0.449
 Holly+0.449
gers+0.444
Neg logit contributions
 volcan-0.373
endo-0.348
degree-0.343
tr-0.330
 hex-0.329
Token always
Token ID1464
Feature activation-7.81e-02

Pos logit contributions
gers+0.065
 Hank+0.065
 Fiona+0.064
 Hannah+0.063
 Holly+0.063
Neg logit contributions
 volcan-0.033
endo-0.031
degree-0.031
 hex-0.029
tr-0.028
Token a
Token ID257
Feature activation-1.10e-02

Pos logit contributions
 Hank+0.115
gers+0.115
 Fiona+0.114
 Hannah+0.113
 Holly+0.112
Neg logit contributions
 volcan-0.090
degree-0.086
endo-0.084
 hex-0.080
tr-0.077
Token about
Token ID546
Feature activation-6.62e-02

Pos logit contributions
 volcan+0.145
degree+0.140
endo+0.136
 hex+0.132
tr+0.130
Neg logit contributions
 Hank-0.135
 Fiona-0.133
 Hannah-0.132
 Holly-0.131
gers-0.128
Token we
Token ID356
Feature activation+5.64e-02

Pos logit contributions
gers+0.104
 Hank+0.102
 Fiona+0.101
 Holly+0.101
 Hannah+0.100
Neg logit contributions
 volcan-0.076
degree-0.068
endo-0.067
 hex-0.065
tr-0.063
Token make
Token ID787
Feature activation-4.13e-02

Pos logit contributions
 volcan+0.086
degree+0.084
endo+0.082
 hex+0.078
tr+0.077
Neg logit contributions
 Hank-0.065
 Hannah-0.063
 Fiona-0.063
 Holly-0.063
gers-0.061
Token him
Token ID683
Feature activation-2.68e-02

Pos logit contributions
 Hank+0.057
 Fiona+0.057
 Hannah+0.056
 Holly+0.056
gers+0.056
Neg logit contributions
 volcan-0.053
degree-0.051
endo-0.050
 hex-0.048
tr-0.047
Token a
Token ID257
Feature activation+2.06e-03

Pos logit contributions
 Hank+0.038
 Fiona+0.037
 Hannah+0.037
 Holly+0.037
gers+0.036
Neg logit contributions
 volcan-0.034
degree-0.032
endo-0.032
 hex-0.030
tr-0.030
Token sweater
Token ID35685
Feature activation-2.19e-01

Pos logit contributions
 volcan+0.003
degree+0.003
endo+0.003
destruct+0.002
pre+0.002
Neg logit contributions
 Hank-0.003
 Hannah-0.003
 Fiona-0.003
 Holly-0.003
gers-0.003
Token too
Token ID1165
Feature activation-1.85e-02

Pos logit contributions
 Hank+0.325
 Fiona+0.323
 Hannah+0.317
 Holly+0.316
gers+0.310
Neg logit contributions
 volcan-0.254
degree-0.248
tr-0.238
endo-0.237
 hex-0.233
Token?
Token ID30
Feature activation+6.24e-02

Pos logit contributions
 Hank+0.030
 Fiona+0.030
 Holly+0.030
 Hannah+0.029
gers+0.029
Neg logit contributions
 volcan-0.020
degree-0.018
endo-0.018
tr-0.017
 hex-0.017
Token A
Token ID317
Feature activation+6.68e-02

Pos logit contributions
 volcan+0.073
degree+0.070
endo+0.068
 hex+0.065
 eru+0.064
Neg logit contributions
 Hank-0.094
 Hannah-0.092
 Fiona-0.092
 Holly-0.091
gers-0.089
Token very
Token ID845
Feature activation+8.31e-02

Pos logit contributions
tr+0.077
degree+0.074
 hex+0.071
 volcan+0.071
endo+0.067
Neg logit contributions
 Hank-0.100
gers-0.095
 Fiona-0.092
 Holly-0.090
 Ned-0.090
Token it
Token ID340
Feature activation-4.96e-02

Pos logit contributions
 volcan+0.101
degree+0.100
endo+0.096
 hex+0.091
tr+0.090
Neg logit contributions
 Hank-0.119
 Fiona-0.118
 Hannah-0.117
 Holly-0.116
gers-0.113
Token John
Token ID1757
Feature activation-1.08e-02

Pos logit contributions
 volcan+0.002
degree+0.002
endo+0.002
 hex+0.002
tr+0.002
Neg logit contributions
 Hank-0.002
 Fiona-0.002
 Hannah-0.002
gers-0.002
 Holly-0.002
Token's
Token ID338
Feature activation-6.76e-03

Pos logit contributions
gers+0.018
 Hank+0.018
 Fiona+0.018
 Holly+0.017
 Woo+0.017
Neg logit contributions
endo-0.009
 volcan-0.009
degree-0.009
 pressed-0.009
tr-0.008
Token mom
Token ID1995
Feature activation+1.62e-02

Pos logit contributions
 Fiona+0.013
 Hank+0.012
 Holly+0.012
 Das+0.012
 Nora+0.012
Neg logit contributions
tr-0.004
 tower-0.003
 queen-0.003
 structure-0.003
 Or-0.003
Token held
Token ID2714
Feature activation-5.60e-02

Pos logit contributions
tr+0.014
 volcan+0.013
 pressed+0.013
degree+0.013
 pr+0.013
Neg logit contributions
gers-0.028
 Hank-0.027
 Fiona-0.027
 Woo-0.026
 Holly-0.026
Token the
Token ID262
Feature activation-4.09e-03

Pos logit contributions
gers+0.100
 Hank+0.098
 therapist+0.098
scar+0.097
latable+0.096
Neg logit contributions
 volcan-0.038
endo-0.037
degree-0.037
 bricks-0.031
 BO-0.031
Token leash
Token ID40529
Feature activation-2.35e-01

Pos logit contributions
 Ned+0.006
iley+0.006
 Randy+0.006
 Nora+0.006
 Fiona+0.006
Neg logit contributions
tr-0.004
 bell-0.004
 stops-0.003
degree-0.003
 sc-0.003
Token and
Token ID290
Feature activation+1.80e-02

Pos logit contributions
 Hank+0.372
 Fiona+0.370
 Holly+0.363
 Hannah+0.362
gers+0.352
Neg logit contributions
 volcan-0.237
degree-0.237
tr-0.233
endo-0.218
pre-0.207
Token lead
Token ID1085
Feature activation-9.56e-02

Pos logit contributions
 volcan+0.020
degree+0.019
endo+0.018
tr+0.017
 eru+0.017
Neg logit contributions
gers-0.027
 Hank-0.026
 Holly-0.026
 Fiona-0.026
 Hannah-0.025
Token Daisy
Token ID40355
Feature activation-8.59e-02

Pos logit contributions
gers+0.150
 trainers+0.147
 Hank+0.147
 therapist+0.145
 Woo+0.140
Neg logit contributions
 volcan-0.100
degree-0.089
endo-0.089
 eru-0.079
 erupt-0.078
Token to
Token ID284
Feature activation-2.68e-03

Pos logit contributions
 Hank+0.148
gers+0.145
 Hannah+0.140
 Fiona+0.140
 Holly+0.139
Neg logit contributions
 volcan-0.074
degree-0.074
tr-0.073
endo-0.067
 clim-0.065
Token the
Token ID262
Feature activation-5.28e-03

Pos logit contributions
gers+0.005
 therapist+0.004
 Hank+0.004
scar+0.004
 Holly+0.004
Neg logit contributions
 volcan-0.003
endo-0.002
degree-0.002
 clim-0.002
 hex-0.002