TOP ACTIVATIONS
MAX = 0.100

. They high - five each other and smile
with them . It w agged its tail and jumped up
. The 3 - year old boy was very
It saw them and w agged its tail . It l
nice ." The dog w agged his tail and l icked
. The weird - looking man said goodbye to
. Max w agged his tail . He did
The dog w agged its tail . It liked
Tim and Mia and w agged his tail . He wanted
back at her and w agged his tail . He came
play more . He w agged his tail and l icked
Max bark ed and w agged his tail . He knew
his mind . He w agged his tail and bark ed
It bark ed and w agged its tail . It wanted
Max bark ed and w agged his tail . He knew
the ball . Rex w agged his tail and ran to
not listen . He w agged his tail and kept playing
Max w agged his tail and ran around
dirty . The dog w agged its tail and l icked
dog bark ed and w agged its tail in response ,

MIN ACTIVATIONS
MIN = -0.103

the sugar would not bud ge . All it did all
rest of the day to ge h ter and , when
trunk and wouldn 't bud ge . The squirrel was hungry
but it wouldn 't bud ge . She asked her mom
but it wouldn 't bud ge . Then ,
but it wouldn 't bud ge . Tim my tried and
â t bud ge . â
the screw wouldn 't bud ge . Suddenly ,
but it wouldn 't bud ge . Tim my said ,
â t bud ge . She tried to unt
cart just wouldn 't bud ge . He pushed and he
, they wouldn 't bud ge ! Tim my realized he
but it wouldn 't bud ge . His mom saw him
but it wouldn 't bud ge . Suddenly
it still wouldn 't bud ge . After
and he couldn 't bud ge it . He was so
but it wouldn 't bud ge . Suddenly , he remembered
but it wouldn 't bud ge . Tim
but it wouldn 't bud ge . Tim my was sad
but it wouldn 't bud ge . L

INTERVAL 0.049 - 0.100
CONTAINS 0.088%

John , a 3 - year - old boy ,
said he wanted a band - aid . His
The moral of this story is that we should always
The moral of the story is that it 's important
she said , " Y ay , pancakes !" They went

INTERVAL -0.001 - 0.049
CONTAINS 56.777%

. It was a sm elly fish , but he liked
you can do it ! Now run along and have fun
ily and Ben were going on a trip with Mom and
for a walk . The sky was blue and the sun
so sad and she said that she had made a promise

INTERVAL -0.052 - -0.001
CONTAINS 43.121%

loved to play with his toy helicopter . He liked to
do it . But then she realized that it was okay
it worked . Every time she put a small stone inside
said , " Yes , let 's do that !"
her room . She was having so much fun ! She

INTERVAL -0.103 - -0.052
CONTAINS 0.014%

in the warm , so apy water . She spl ashed
steer the boat with a remote control . The boat is
too late the little girl was
the brush it wasn 't quite
out the towels and so apy water . Little one brushed
Token.
Token ID13
Feature activation+1.27e-02

Pos logit contributions
 swollen+0.044
ge+0.043
 stret+0.041
 sy+0.038
 prope+0.037
Neg logit contributions
uck-0.024
bye-0.024
arn-0.020
iddle-0.019
enge-0.019
Token
Token ID198
Feature activation+8.92e-03

Pos logit contributions
bye+0.019
uck+0.018
enge+0.016
aph+0.016
arn+0.016
Neg logit contributions
ge-0.026
 swollen-0.025
 stret-0.025
 sy-0.023
 stuffed-0.021
Token
Token ID198
Feature activation+6.78e-03

Pos logit contributions
bye+0.014
uck+0.013
enge+0.012
arn+0.012
aph+0.012
Neg logit contributions
ge-0.018
 swollen-0.018
 stret-0.018
 sy-0.016
 stuffed-0.015
TokenThey
Token ID2990
Feature activation+2.17e-02

Pos logit contributions
bye+0.012
uck+0.011
aph+0.010
enge+0.010
arn+0.010
Neg logit contributions
ge-0.013
 swollen-0.012
 stret-0.012
 sy-0.012
 stuffed-0.011
Token high
Token ID1029
Feature activation-1.36e-03

Pos logit contributions
bye+0.039
uck+0.036
enge+0.034
aph+0.033
arn+0.033
Neg logit contributions
ge-0.039
 swollen-0.038
 stret-0.038
 sy-0.034
 stuffed-0.030
Token-
Token ID12
Feature activation+9.22e-02

Pos logit contributions
ge+0.003
 stret+0.003
 swollen+0.003
 sy+0.002
 stuffed+0.002
Neg logit contributions
bye-0.002
uck-0.002
arn-0.002
enge-0.002
aph-0.002
Tokenfive
Token ID13261
Feature activation-1.21e-02

Pos logit contributions
bye+0.151
uck+0.142
enge+0.132
arn+0.131
aph+0.129
Neg logit contributions
ge-0.180
 stret-0.172
 swollen-0.171
 sy-0.155
 stuffed-0.138
Token each
Token ID1123
Feature activation+1.05e-02

Pos logit contributions
ge+0.025
 stret+0.024
 swollen+0.024
 sy+0.022
 stuffed+0.021
Neg logit contributions
bye-0.018
uck-0.017
enge-0.016
aph-0.016
arn-0.016
Token other
Token ID584
Feature activation+2.98e-03

Pos logit contributions
bye+0.017
uck+0.016
enge+0.014
aph+0.014
arn+0.014
Neg logit contributions
ge-0.021
 swollen-0.020
 stret-0.020
 sy-0.019
 stuffed-0.016
Token and
Token ID290
Feature activation-9.06e-04

Pos logit contributions
bye+0.004
uck+0.004
enge+0.004
arn+0.004
aph+0.004
Neg logit contributions
ge-0.006
 swollen-0.006
 stret-0.006
 sy-0.006
 stuffed-0.005
Token smile
Token ID8212
Feature activation-5.98e-03

Pos logit contributions
ge+0.001
 swollen+0.001
 stret+0.001
 sy+0.001
 stuffed+0.001
Neg logit contributions
bye-0.002
uck-0.002
enge-0.002
aph-0.002
arn-0.002
Token with
Token ID351
Feature activation-4.12e-03

Pos logit contributions
ge+0.020
 stret+0.020
 swollen+0.019
 sy+0.018
 stuffed+0.016
Neg logit contributions
bye-0.016
uck-0.014
enge-0.013
arn-0.013
aph-0.013
Token them
Token ID606
Feature activation-1.07e-02

Pos logit contributions
ge+0.007
 swollen+0.007
 stret+0.007
 sy+0.006
 stuffed+0.006
Neg logit contributions
bye-0.008
uck-0.007
enge-0.007
aph-0.007
arn-0.007
Token.
Token ID13
Feature activation+4.06e-03

Pos logit contributions
ge+0.022
 swollen+0.021
 stret+0.021
 sy+0.020
 stuffed+0.018
Neg logit contributions
bye-0.017
uck-0.015
enge-0.014
arn-0.014
aph-0.014
Token It
Token ID632
Feature activation+6.25e-03

Pos logit contributions
bye+0.008
uck+0.007
enge+0.007
iddle+0.007
aph+0.007
Neg logit contributions
 stret-0.006
 swollen-0.006
 sy-0.006
ge-0.006
Are-0.005
Token w
Token ID266
Feature activation+2.58e-04

Pos logit contributions
bye+0.010
uck+0.009
enge+0.008
aph+0.008
arn+0.008
Neg logit contributions
ge-0.013
 stret-0.012
 swollen-0.012
 sy-0.011
 flashing-0.010
Tokenagged
Token ID14655
Feature activation+9.20e-02

Pos logit contributions
bye+0.000
uck+0.000
aph+0.000
arn+0.000
enge+0.000
Neg logit contributions
 stret-0.001
 swollen-0.001
ge-0.001
 widened-0.001
 sy-0.001
Token its
Token ID663
Feature activation-1.08e-02

Pos logit contributions
bye+0.196
uck+0.192
arn+0.184
anks+0.182
aph+0.180
Neg logit contributions
 swollen-0.122
ge-0.121
 stret-0.110
 stuffed-0.095
 sy-0.094
Token tail
Token ID7894
Feature activation+4.59e-03

Pos logit contributions
ge+0.022
 stret+0.021
 swollen+0.020
 sy+0.019
 Kel+0.016
Neg logit contributions
bye-0.018
uck-0.016
enge-0.015
aph-0.015
arn-0.015
Token and
Token ID290
Feature activation-7.23e-04

Pos logit contributions
bye+0.008
uck+0.007
enge+0.007
aph+0.006
arn+0.006
Neg logit contributions
ge-0.009
 swollen-0.009
 stret-0.009
 sy-0.008
 stuffed-0.007
Token jumped
Token ID11687
Feature activation+9.47e-03

Pos logit contributions
 swollen+0.001
ge+0.001
 stret+0.001
 sy+0.001
 stuffed+0.001
Neg logit contributions
bye-0.001
uck-0.001
enge-0.001
arn-0.001
aph-0.001
Token up
Token ID510
Feature activation+2.60e-03

Pos logit contributions
bye+0.016
uck+0.014
enge+0.014
arn+0.014
aph+0.013
Neg logit contributions
ge-0.018
 stret-0.018
 swollen-0.017
 sy-0.016
 stuffed-0.015
Token.
Token ID13
Feature activation+9.85e-04

Pos logit contributions
bye+0.019
uck+0.017
enge+0.015
aph+0.015
arn+0.015
Neg logit contributions
ge-0.026
 swollen-0.025
 stret-0.025
 sy-0.023
 stuffed-0.021
Token
Token ID198
Feature activation+3.59e-03

Pos logit contributions
bye+0.001
uck+0.001
enge+0.001
arn+0.001
aph+0.001
Neg logit contributions
ge-0.002
 swollen-0.002
 stret-0.002
 sy-0.002
 stuffed-0.002
Token
Token ID198
Feature activation+3.85e-03

Pos logit contributions
bye+0.005
uck+0.005
enge+0.005
aph+0.005
arn+0.005
Neg logit contributions
ge-0.007
 swollen-0.007
 stret-0.007
 sy-0.007
 stuffed-0.006
TokenThe
Token ID464
Feature activation+8.89e-03

Pos logit contributions
bye+0.007
uck+0.006
aph+0.006
enge+0.006
arn+0.006
Neg logit contributions
ge-0.007
 swollen-0.007
 stret-0.007
 sy-0.007
 stuffed-0.006
Token 3
Token ID513
Feature activation+9.51e-03

Pos logit contributions
bye+0.015
uck+0.014
enge+0.013
arn+0.013
anks+0.012
Neg logit contributions
 swollen-0.017
 stret-0.016
ge-0.016
 sy-0.015
 stuffed-0.014
Token-
Token ID12
Feature activation+9.16e-02

Pos logit contributions
bye+0.015
uck+0.014
enge+0.013
aph+0.013
arn+0.013
Neg logit contributions
ge-0.019
 swollen-0.018
 stret-0.018
 sy-0.016
 stuffed-0.015
Tokenyear
Token ID1941
Feature activation+5.75e-02

Pos logit contributions
bye+0.127
uck+0.116
enge+0.107
aph+0.104
arn+0.104
Neg logit contributions
ge-0.201
 swollen-0.194
 stret-0.190
 sy-0.177
 stuffed-0.160
Token old
Token ID1468
Feature activation-1.93e-03

Pos logit contributions
enge+0.085
bye+0.084
uck+0.083
aph+0.079
arn+0.079
Neg logit contributions
ge-0.108
 swollen-0.104
 stret-0.100
 sy-0.094
lest-0.088
Token boy
Token ID2933
Feature activation+8.66e-03

Pos logit contributions
ge+0.004
 swollen+0.004
 stret+0.004
 sy+0.004
 stuffed+0.003
Neg logit contributions
bye-0.003
uck-0.003
enge-0.002
aph-0.002
arn-0.002
Token was
Token ID373
Feature activation+2.95e-03

Pos logit contributions
bye+0.016
uck+0.015
enge+0.014
arn+0.014
aph+0.014
Neg logit contributions
ge-0.015
 swollen-0.015
 stret-0.014
 sy-0.013
 stuffed-0.011
Token very
Token ID845
Feature activation+7.13e-03

Pos logit contributions
bye+0.005
uck+0.004
enge+0.004
aph+0.004
arn+0.004
Neg logit contributions
ge-0.006
 swollen-0.006
 stret-0.006
 sy-0.005
 stuffed-0.005
Token It
Token ID632
Feature activation+8.00e-03

Pos logit contributions
bye+0.010
uck+0.009
enge+0.009
aph+0.008
arn+0.008
Neg logit contributions
 swollen-0.012
 stret-0.012
ge-0.011
 sy-0.011
 widened-0.010
Token saw
Token ID2497
Feature activation+1.99e-02

Pos logit contributions
bye+0.014
uck+0.013
enge+0.012
aph+0.011
arn+0.011
Neg logit contributions
ge-0.015
 stret-0.015
 swollen-0.014
 sy-0.013
 Kel-0.011
Token them
Token ID606
Feature activation-1.10e-02

Pos logit contributions
bye+0.032
uck+0.028
enge+0.027
arn+0.026
aph+0.026
Neg logit contributions
 swollen-0.041
ge-0.041
 stret-0.040
 sy-0.037
 stuffed-0.034
Token and
Token ID290
Feature activation+4.34e-03

Pos logit contributions
 stret+0.026
 swollen+0.025
 sy+0.024
ge+0.023
 Kel+0.022
Neg logit contributions
uck-0.012
bye-0.011
ow-0.010
apped-0.008
 Sam-0.008
Token w
Token ID266
Feature activation+3.61e-04

Pos logit contributions
bye+0.007
uck+0.006
 Sam+0.006
enge+0.006
 Fred+0.006
Neg logit contributions
ge-0.009
 stret-0.008
 swollen-0.008
 sy-0.007
lest-0.006
Tokenagged
Token ID14655
Feature activation+9.13e-02

Pos logit contributions
bye+0.000
uck+0.000
enge+0.000
aph+0.000
arn+0.000
Neg logit contributions
ge-0.001
 swollen-0.001
 stret-0.001
 sy-0.001
 stuffed-0.001
Token its
Token ID663
Feature activation-1.05e-02

Pos logit contributions
bye+0.163
uck+0.151
enge+0.140
aph+0.137
arn+0.136
Neg logit contributions
ge-0.169
 stret-0.163
 swollen-0.162
 sy-0.150
 stuffed-0.130
Token tail
Token ID7894
Feature activation+5.18e-03

Pos logit contributions
ge+0.022
 stret+0.021
 swollen+0.020
 sy+0.019
 Kel+0.017
Neg logit contributions
bye-0.017
uck-0.015
arn-0.013
aph-0.013
enge-0.013
Token.
Token ID13
Feature activation+4.30e-03

Pos logit contributions
bye+0.006
uck+0.006
arn+0.005
aph+0.005
apped+0.005
Neg logit contributions
ge-0.012
 stret-0.012
 swollen-0.012
 sy-0.011
 Kel-0.010
Token It
Token ID632
Feature activation+6.68e-03

Pos logit contributions
bye+0.008
uck+0.007
enge+0.006
aph+0.006
iddle+0.006
Neg logit contributions
 swollen-0.008
 stret-0.008
ge-0.007
 sy-0.007
Are-0.006
Token l
Token ID300
Feature activation+6.46e-03

Pos logit contributions
bye+0.010
uck+0.009
enge+0.009
aph+0.008
arn+0.008
Neg logit contributions
ge-0.014
 stret-0.013
 swollen-0.012
 sy-0.012
lest-0.010
Token nice
Token ID3621
Feature activation+2.47e-02

Pos logit contributions
 swollen+0.002
ge+0.002
 stret+0.002
 sy+0.002
 stuffed+0.002
Neg logit contributions
bye-0.003
uck-0.003
aph-0.003
arn-0.003
enge-0.003
Token."
Token ID526
Feature activation+1.00e-02

Pos logit contributions
uck+0.048
bye+0.048
enge+0.048
aph+0.047
arn+0.044
Neg logit contributions
ge-0.032
 swollen-0.029
 sy-0.028
 stret-0.027
 stuffed-0.026
Token The
Token ID383
Feature activation+2.35e-03

Pos logit contributions
bye+0.017
uck+0.016
enge+0.015
iddle+0.015
aph+0.014
Neg logit contributions
 stret-0.018
ge-0.018
 swollen-0.017
 sy-0.017
 stuffed-0.015
Token dog
Token ID3290
Feature activation+2.85e-03

Pos logit contributions
bye+0.004
uck+0.004
arn+0.003
enge+0.003
aph+0.003
Neg logit contributions
 swollen-0.005
ge-0.004
 stret-0.004
 sy-0.004
 stuffed-0.004
Token w
Token ID266
Feature activation-1.76e-03

Pos logit contributions
bye+0.006
uck+0.005
enge+0.005
arn+0.005
aph+0.005
Neg logit contributions
ge-0.004
 swollen-0.004
 stret-0.004
 sy-0.004
 stuffed-0.003
Tokenagged
Token ID14655
Feature activation+9.11e-02

Pos logit contributions
ge+0.004
 swollen+0.004
 stret+0.004
 tug+0.003
 sy+0.003
Neg logit contributions
bye-0.002
aph-0.002
umm-0.002
 noon-0.002
 Queen-0.002
Token his
Token ID465
Feature activation+1.09e-05

Pos logit contributions
bye+0.200
uck+0.193
arn+0.184
anks+0.182
aph+0.181
Neg logit contributions
 swollen-0.122
ge-0.117
 stret-0.108
 sy-0.093
 stuffed-0.090
Token tail
Token ID7894
Feature activation+4.71e-03

Pos logit contributions
bye+0.000
uck+0.000
enge+0.000
aph+0.000
arn+0.000
Neg logit contributions
ge-0.000
 swollen-0.000
 stret-0.000
 sy-0.000
 stuffed-0.000
Token and
Token ID290
Feature activation-5.60e-04

Pos logit contributions
bye+0.007
uck+0.007
enge+0.006
aph+0.006
arn+0.006
Neg logit contributions
ge-0.010
 swollen-0.009
 stret-0.009
 sy-0.008
 stuffed-0.008
Token l
Token ID300
Feature activation+5.71e-03

Pos logit contributions
 swollen+0.001
ge+0.001
 stret+0.001
 sy+0.001
 stuffed+0.001
Neg logit contributions
bye-0.001
uck-0.001
arn-0.001
enge-0.001
aph-0.001
Tokenicked
Token ID9484
Feature activation+4.69e-02

Pos logit contributions
bye+0.008
uck+0.006
aph+0.006
 noon+0.006
arn+0.006
Neg logit contributions
 swollen-0.012
ge-0.012
 sy-0.011
 stret-0.011
 scrap-0.010
Token.
Token ID13
Feature activation+2.72e-03

Pos logit contributions
bye+0.008
uck+0.007
aph+0.006
enge+0.006
arn+0.006
Neg logit contributions
ge-0.012
 swollen-0.012
 stret-0.012
 sy-0.011
 stuffed-0.010
Token
Token ID198
Feature activation+3.00e-03

Pos logit contributions
bye+0.004
uck+0.004
enge+0.004
arn+0.004
aph+0.004
Neg logit contributions
ge-0.005
 stret-0.005
 swollen-0.005
 sy-0.005
 stuffed-0.004
Token
Token ID198
Feature activation+3.89e-03

Pos logit contributions
bye+0.004
uck+0.004
arn+0.004
enge+0.004
aph+0.004
Neg logit contributions
ge-0.007
 swollen-0.006
 stret-0.006
 sy-0.006
 stuffed-0.005
TokenThe
Token ID464
Feature activation+9.86e-03

Pos logit contributions
bye+0.007
uck+0.007
aph+0.006
arn+0.006
enge+0.006
Neg logit contributions
ge-0.007
 swollen-0.007
 stret-0.007
 sy-0.006
 stuffed-0.006
Token weird
Token ID7650
Feature activation+1.21e-02

Pos logit contributions
bye+0.016
uck+0.016
enge+0.015
arn+0.015
aph+0.014
Neg logit contributions
ge-0.018
 swollen-0.018
 stret-0.018
 sy-0.017
 stuffed-0.015
Token-
Token ID12
Feature activation+9.09e-02

Pos logit contributions
bye+0.019
uck+0.018
enge+0.017
arn+0.017
aph+0.016
Neg logit contributions
ge-0.024
 swollen-0.023
 stret-0.023
 sy-0.021
 stuffed-0.019
Tokenlooking
Token ID11534
Feature activation-2.31e-03

Pos logit contributions
bye+0.150
uck+0.140
enge+0.129
arn+0.127
aph+0.127
Neg logit contributions
ge-0.177
 swollen-0.171
 stret-0.170
 sy-0.156
 stuffed-0.139
Token man
Token ID582
Feature activation+5.56e-03

Pos logit contributions
ge+0.004
 swollen+0.004
 stret+0.004
 sy+0.004
 stuffed+0.004
Neg logit contributions
bye-0.004
uck-0.004
enge-0.003
arn-0.003
aph-0.003
Token said
Token ID531
Feature activation+8.25e-03

Pos logit contributions
bye+0.011
uck+0.010
enge+0.009
aph+0.009
arn+0.009
Neg logit contributions
ge-0.009
 swollen-0.009
 stret-0.009
 sy-0.008
 stuffed-0.007
Token goodbye
Token ID24829
Feature activation-9.74e-05

Pos logit contributions
bye+0.012
uck+0.011
enge+0.010
aph+0.010
arn+0.010
Neg logit contributions
ge-0.018
 swollen-0.017
 stret-0.017
 sy-0.016
 stuffed-0.014
Token to
Token ID284
Feature activation-1.35e-03

Pos logit contributions
ge+0.000
 stret+0.000
 swollen+0.000
 sy+0.000
 stuffed+0.000
Neg logit contributions
bye-0.000
uck-0.000
enge-0.000
aph-0.000
arn-0.000
Token.
Token ID13
Feature activation-1.35e-03

Pos logit contributions
bye+0.009
uck+0.009
arn+0.008
aph+0.007
anks+0.007
Neg logit contributions
 swollen-0.018
ge-0.018
 stret-0.017
 sy-0.017
 widened-0.015
Token
Token ID198
Feature activation-1.95e-03

Pos logit contributions
ge+0.003
 swollen+0.003
 stret+0.003
 sy+0.003
 stuffed+0.003
Neg logit contributions
bye-0.001
uck-0.001
 Now-0.001
enge-0.001
arn-0.001
Token
Token ID198
Feature activation-1.76e-03

Pos logit contributions
ge+0.005
 swollen+0.004
 stret+0.004
 sy+0.004
 stuffed+0.004
Neg logit contributions
bye-0.002
uck-0.002
arn-0.002
enge-0.002
aph-0.002
TokenMax
Token ID11518
Feature activation+9.03e-03

Pos logit contributions
ge+0.004
 stret+0.004
 swollen+0.003
 sy+0.003
 stuffed+0.003
Neg logit contributions
bye-0.002
uck-0.002
 Now-0.002
enge-0.002
arn-0.002
Token w
Token ID266
Feature activation-9.49e-04

Pos logit contributions
bye+0.016
uck+0.015
enge+0.014
arn+0.014
aph+0.013
Neg logit contributions
ge-0.017
 swollen-0.016
 stret-0.016
 sy-0.014
 stuffed-0.013
Tokenagged
Token ID14655
Feature activation+9.09e-02

Pos logit contributions
ge+0.002
 swollen+0.002
 stret+0.002
 sy+0.002
 scrap+0.002
Neg logit contributions
bye-0.001
uck-0.001
aph-0.001
arn-0.001
enge-0.001
Token his
Token ID465
Feature activation-8.29e-05

Pos logit contributions
bye+0.200
uck+0.190
enge+0.179
aph+0.178
arn+0.178
Neg logit contributions
ge-0.127
 swollen-0.126
 stret-0.120
 sy-0.103
 scrap-0.091
Token tail
Token ID7894
Feature activation+3.74e-03

Pos logit contributions
ge+0.000
 swollen+0.000
 stret+0.000
 sy+0.000
 Kel+0.000
Neg logit contributions
bye-0.000
uck-0.000
enge-0.000
apped-0.000
arn-0.000
Token.
Token ID13
Feature activation-9.73e-04

Pos logit contributions
bye+0.005
uck+0.005
arn+0.004
enge+0.004
aph+0.004
Neg logit contributions
ge-0.008
 stret-0.008
 swollen-0.008
 sy-0.007
 Kel-0.007
Token He
Token ID679
Feature activation+5.60e-03

Pos logit contributions
ge+0.002
 swollen+0.002
 stret+0.002
 sy+0.002
 stuffed+0.002
Neg logit contributions
bye-0.002
uck-0.001
enge-0.001
aph-0.001
arn-0.001
Token did
Token ID750
Feature activation+9.71e-03

Pos logit contributions
bye+0.009
uck+0.009
enge+0.008
aph+0.008
arn+0.008
Neg logit contributions
ge-0.011
 stret-0.010
 swollen-0.010
 sy-0.009
 drift-0.008
Token
Token ID198
Feature activation+8.96e-04

Pos logit contributions
bye+0.030
uck+0.028
enge+0.027
arn+0.026
aph+0.025
Neg logit contributions
ge-0.031
 swollen-0.031
 stret-0.031
 sy-0.029
 stuffed-0.025
Token
Token ID198
Feature activation+7.01e-04

Pos logit contributions
bye+0.001
uck+0.001
enge+0.001
aph+0.001
arn+0.001
Neg logit contributions
ge-0.002
 swollen-0.002
 stret-0.002
 sy-0.002
 stuffed-0.002
TokenThe
Token ID464
Feature activation+5.95e-03

Pos logit contributions
bye+0.001
uck+0.001
enge+0.001
aph+0.001
arn+0.001
Neg logit contributions
ge-0.001
 stret-0.001
 swollen-0.001
 sy-0.001
 stuffed-0.001
Token dog
Token ID3290
Feature activation+5.08e-03

Pos logit contributions
bye+0.009
uck+0.009
aph+0.008
arn+0.008
enge+0.008
Neg logit contributions
 swollen-0.012
 stret-0.012
ge-0.012
 sy-0.011
 stuffed-0.010
Token w
Token ID266
Feature activation-2.30e-03

Pos logit contributions
bye+0.010
uck+0.009
enge+0.009
aph+0.009
arn+0.009
Neg logit contributions
ge-0.008
 swollen-0.008
 stret-0.008
 sy-0.007
 stuffed-0.007
Tokenagged
Token ID14655
Feature activation+9.06e-02

Pos logit contributions
 stret+0.005
ge+0.005
 swollen+0.005
 sy+0.004
 tug+0.004
Neg logit contributions
bye-0.003
aph-0.003
umm-0.002
uck-0.002
 noon-0.002
Token its
Token ID663
Feature activation-1.07e-02

Pos logit contributions
bye+0.198
uck+0.193
anks+0.183
arn+0.183
aph+0.181
Neg logit contributions
 swollen-0.119
ge-0.114
 stret-0.105
 sy-0.091
 stuffed-0.088
Token tail
Token ID7894
Feature activation+5.28e-03

Pos logit contributions
ge+0.020
 stret+0.019
 swollen+0.019
 sy+0.017
 widened+0.015
Neg logit contributions
bye-0.019
uck-0.017
enge-0.016
arn-0.016
aph-0.016
Token.
Token ID13
Feature activation-9.28e-04

Pos logit contributions
bye+0.008
uck+0.008
enge+0.007
aph+0.007
arn+0.007
Neg logit contributions
ge-0.011
 swollen-0.010
 stret-0.010
 sy-0.010
 stuffed-0.009
Token It
Token ID632
Feature activation+5.37e-03

Pos logit contributions
 swollen+0.001
 stret+0.001
 sy+0.001
Are+0.001
 widened+0.001
Neg logit contributions
bye-0.002
uck-0.002
enge-0.002
iddle-0.002
arn-0.002
Token liked
Token ID8288
Feature activation+1.28e-02

Pos logit contributions
bye+0.009
uck+0.009
enge+0.008
arn+0.008
aph+0.008
Neg logit contributions
ge-0.010
 stret-0.010
 swollen-0.009
 sy-0.009
 flashing-0.007
Token Tim
Token ID5045
Feature activation-6.28e-03

Pos logit contributions
bye+0.029
uck+0.026
arn+0.024
enge+0.024
aph+0.023
Neg logit contributions
 swollen-0.037
ge-0.037
 stret-0.036
 sy-0.033
 stuffed-0.032
Token and
Token ID290
Feature activation+8.37e-04

Pos logit contributions
 swollen+0.013
ge+0.013
 stret+0.013
 sy+0.012
 widened+0.011
Neg logit contributions
bye-0.009
uck-0.008
arn-0.007
enge-0.007
aph-0.007
Token Mia
Token ID32189
Feature activation-3.61e-03

Pos logit contributions
bye+0.001
uck+0.001
enge+0.000
aph+0.000
arn+0.000
Neg logit contributions
ge-0.002
 swollen-0.002
 stret-0.002
 sy-0.002
 stuffed-0.002
Token and
Token ID290
Feature activation+7.90e-04

Pos logit contributions
 stret+0.009
 swollen+0.009
ge+0.009
 sy+0.008
 Kel+0.008
Neg logit contributions
bye-0.003
uck-0.003
ow-0.003
apped-0.003
 Sam-0.002
Token w
Token ID266
Feature activation-1.24e-03

Pos logit contributions
bye+0.001
uck+0.001
enge+0.001
arn+0.001
aph+0.001
Neg logit contributions
ge-0.002
 swollen-0.002
 stret-0.002
 sy-0.002
 stuffed-0.002
Tokenagged
Token ID14655
Feature activation+9.06e-02

Pos logit contributions
ge+0.003
 stret+0.003
 swollen+0.003
 sy+0.003
 tug+0.003
Neg logit contributions
bye-0.001
uck-0.001
aph-0.001
enge-0.001
arn-0.001
Token his
Token ID465
Feature activation+2.82e-04

Pos logit contributions
bye+0.167
uck+0.153
enge+0.143
aph+0.140
arn+0.137
Neg logit contributions
ge-0.162
 stret-0.158
 swollen-0.157
 sy-0.146
 stuffed-0.123
Token tail
Token ID7894
Feature activation+4.65e-03

Pos logit contributions
bye+0.000
uck+0.000
apped+0.000
enge+0.000
aph+0.000
Neg logit contributions
ge-0.001
 stret-0.001
 swollen-0.001
 sy-0.001
 Kel-0.000
Token.
Token ID13
Feature activation+3.72e-03

Pos logit contributions
bye+0.006
uck+0.006
aph+0.005
arn+0.005
enge+0.005
Neg logit contributions
ge-0.011
 stret-0.010
 swollen-0.010
 sy-0.010
 Kel-0.009
Token He
Token ID679
Feature activation+6.23e-03

Pos logit contributions
bye+0.007
uck+0.007
night+0.006
enge+0.006
iddle+0.006
Neg logit contributions
 stret-0.005
 swollen-0.005
ge-0.005
 Kel-0.005
 widened-0.004
Token wanted
Token ID2227
Feature activation-2.58e-03

Pos logit contributions
bye+0.012
uck+0.011
enge+0.011
arn+0.011
aph+0.011
Neg logit contributions
ge-0.010
 swollen-0.010
 stret-0.010
 sy-0.009
 stuffed-0.008
Token back
Token ID736
Feature activation+1.99e-02

Pos logit contributions
bye+0.021
uck+0.019
enge+0.016
aph+0.015
 Fred+0.014
Neg logit contributions
ge-0.066
 swollen-0.063
 stret-0.062
 sy-0.061
 prope-0.053
Token at
Token ID379
Feature activation+1.18e-02

Pos logit contributions
bye+0.035
enge+0.031
uck+0.030
arn+0.029
iddle+0.028
Neg logit contributions
ge-0.038
 swollen-0.036
 stret-0.034
 sy-0.030
 flashing-0.029
Token her
Token ID607
Feature activation-1.19e-03

Pos logit contributions
bye+0.021
uck+0.020
arn+0.019
anks+0.019
enge+0.019
Neg logit contributions
ge-0.022
 swollen-0.021
 stret-0.021
 sy-0.018
 Kel-0.017
Token and
Token ID290
Feature activation+5.25e-03

Pos logit contributions
ge+0.003
 swollen+0.003
 stret+0.002
 sy+0.002
 Kel+0.002
Neg logit contributions
bye-0.002
uck-0.002
aph-0.001
arn-0.001
anks-0.001
Token w
Token ID266
Feature activation-1.99e-03

Pos logit contributions
bye+0.009
uck+0.008
enge+0.008
aph+0.008
arn+0.008
Neg logit contributions
ge-0.010
 swollen-0.009
 stret-0.009
 sy-0.009
 stuffed-0.008
Tokenagged
Token ID14655
Feature activation+9.05e-02

Pos logit contributions
ge+0.004
 swollen+0.004
 stret+0.004
 sy+0.004
 scrap+0.004
Neg logit contributions
bye-0.003
uck-0.002
enge-0.002
aph-0.002
arn-0.002
Token his
Token ID465
Feature activation-1.42e-03

Pos logit contributions
bye+0.174
uck+0.163
enge+0.152
aph+0.150
arn+0.149
Neg logit contributions
ge-0.154
 swollen-0.148
 stret-0.147
 sy-0.134
 stuffed-0.116
Token tail
Token ID7894
Feature activation+3.69e-03

Pos logit contributions
ge+0.003
 swollen+0.003
 stret+0.003
 sy+0.003
 stal+0.002
Neg logit contributions
bye-0.002
uck-0.002
enge-0.002
arn-0.002
apped-0.002
Token.
Token ID13
Feature activation+1.77e-03

Pos logit contributions
bye+0.005
uck+0.004
arn+0.004
aph+0.004
enge+0.004
Neg logit contributions
ge-0.008
 stret-0.008
 swollen-0.008
 sy-0.008
 Kel-0.007
Token He
Token ID679
Feature activation+4.26e-03

Pos logit contributions
bye+0.003
uck+0.003
enge+0.003
aph+0.003
arn+0.003
Neg logit contributions
 stret-0.003
 swollen-0.003
ge-0.003
 sy-0.003
 Kel-0.003
Token came
Token ID1625
Feature activation+2.45e-02

Pos logit contributions
bye+0.008
uck+0.008
enge+0.007
arn+0.007
aph+0.007
Neg logit contributions
ge-0.007
 swollen-0.007
 stret-0.007
 sy-0.006
 stuffed-0.005
Token play
Token ID711
Feature activation-1.07e-02

Pos logit contributions
ge+0.007
 swollen+0.007
 stret+0.007
 sy+0.006
 stuffed+0.005
Neg logit contributions
bye-0.007
uck-0.006
enge-0.006
aph-0.006
arn-0.006
Token more
Token ID517
Feature activation+2.41e-02

Pos logit contributions
 stret+0.020
ge+0.020
 swollen+0.020
 sy+0.018
 stuffed+0.017
Neg logit contributions
bye-0.018
uck-0.017
aph-0.016
arn-0.016
enge-0.015
Token.
Token ID13
Feature activation-1.13e-03

Pos logit contributions
bye+0.043
uck+0.042
enge+0.038
arn+0.038
aph+0.038
Neg logit contributions
ge-0.045
 stret-0.042
 swollen-0.040
 sy-0.037
 stuffed-0.035
Token He
Token ID679
Feature activation+6.86e-03

Pos logit contributions
 swollen+0.002
 stret+0.002
 sy+0.001
ge+0.001
Are+0.001
Neg logit contributions
bye-0.002
uck-0.002
enge-0.002
aph-0.002
arn-0.002
Token w
Token ID266
Feature activation-3.06e-03

Pos logit contributions
bye+0.013
uck+0.013
arn+0.012
aph+0.012
enge+0.012
Neg logit contributions
ge-0.011
 swollen-0.011
 stret-0.011
 sy-0.010
 stuffed-0.009
Tokenagged
Token ID14655
Feature activation+9.05e-02

Pos logit contributions
 stret+0.008
ge+0.007
 swollen+0.007
 sy+0.007
 tug+0.006
Neg logit contributions
bye-0.003
uck-0.003
aph-0.003
arn-0.002
enge-0.002
Token his
Token ID465
Feature activation-4.62e-04

Pos logit contributions
bye+0.215
uck+0.211
arn+0.204
aph+0.200
umm+0.198
Neg logit contributions
 swollen-0.103
ge-0.101
 stret-0.091
 sy-0.074
 scrap-0.072
Token tail
Token ID7894
Feature activation+3.22e-03

Pos logit contributions
ge+0.001
 stret+0.001
 swollen+0.001
 sy+0.001
 widened+0.001
Neg logit contributions
bye-0.001
uck-0.001
enge-0.001
apped-0.001
aph-0.001
Token and
Token ID290
Feature activation-5.38e-03

Pos logit contributions
bye+0.005
uck+0.005
enge+0.004
aph+0.004
arn+0.004
Neg logit contributions
ge-0.007
 swollen-0.006
 stret-0.006
 sy-0.006
 stuffed-0.005
Token l
Token ID300
Feature activation+3.96e-03

Pos logit contributions
 swollen+0.008
ge+0.008
 stret+0.007
 sy+0.007
 stuffed+0.006
Neg logit contributions
bye-0.012
uck-0.011
arn-0.010
aph-0.010
enge-0.010
Tokenicked
Token ID9484
Feature activation+4.56e-02

Pos logit contributions
bye+0.005
uck+0.004
aph+0.004
arn+0.004
enge+0.004
Neg logit contributions
 swollen-0.009
ge-0.009
 stret-0.009
 sy-0.008
 scrap-0.008
TokenMax
Token ID11518
Feature activation+1.16e-02

Pos logit contributions
bye+0.004
uck+0.003
enge+0.003
arn+0.003
aph+0.003
Neg logit contributions
ge-0.004
 swollen-0.003
 stret-0.003
 sy-0.003
 stuffed-0.003
Token bark
Token ID21405
Feature activation-2.21e-03

Pos logit contributions
bye+0.022
uck+0.020
enge+0.019
aph+0.019
arn+0.019
Neg logit contributions
ge-0.020
 swollen-0.019
 stret-0.019
 sy-0.017
 stuffed-0.015
Tokened
Token ID276
Feature activation+3.26e-02

Pos logit contributions
ge+0.005
 swollen+0.005
 stret+0.005
 sy+0.005
 stuffed+0.005
Neg logit contributions
bye-0.002
uck-0.002
aph-0.002
enge-0.002
arn-0.002
Token and
Token ID290
Feature activation+2.30e-03

Pos logit contributions
bye+0.048
uck+0.044
enge+0.039
aph+0.039
arn+0.039
Neg logit contributions
ge-0.070
 swollen-0.068
 stret-0.068
 sy-0.063
 stuffed-0.055
Token w
Token ID266
Feature activation-8.56e-04

Pos logit contributions
bye+0.004
uck+0.003
enge+0.003
aph+0.003
 Fred+0.003
Neg logit contributions
ge-0.005
 stret-0.004
 swollen-0.004
 sy-0.004
 prope-0.003
Tokenagged
Token ID14655
Feature activation+9.04e-02

Pos logit contributions
ge+0.002
 swollen+0.002
 stret+0.002
 sy+0.002
 tug+0.002
Neg logit contributions
bye-0.001
uck-0.001
aph-0.001
enge-0.001
arn-0.001
Token his
Token ID465
Feature activation+2.57e-03

Pos logit contributions
bye+0.174
uck+0.163
enge+0.152
aph+0.150
arn+0.150
Neg logit contributions
ge-0.153
 swollen-0.147
 stret-0.146
 sy-0.133
 stuffed-0.115
Token tail
Token ID7894
Feature activation+7.11e-03

Pos logit contributions
bye+0.005
uck+0.004
enge+0.004
arn+0.004
aph+0.004
Neg logit contributions
ge-0.005
 swollen-0.004
 stret-0.004
 sy-0.004
 stuffed-0.004
Token.
Token ID13
Feature activation+8.40e-03

Pos logit contributions
bye+0.010
uck+0.010
enge+0.009
aph+0.008
arn+0.008
Neg logit contributions
ge-0.015
 swollen-0.015
 stret-0.015
 sy-0.014
 stuffed-0.012
Token He
Token ID679
Feature activation+1.02e-02

Pos logit contributions
bye+0.016
uck+0.014
enge+0.013
aph+0.013
arn+0.013
Neg logit contributions
 stret-0.015
 swollen-0.014
ge-0.014
 sy-0.013
 stuffed-0.012
Token knew
Token ID2993
Feature activation+5.83e-03

Pos logit contributions
bye+0.018
uck+0.017
enge+0.016
aph+0.016
arn+0.016
Neg logit contributions
ge-0.019
 stret-0.018
 swollen-0.018
 sy-0.016
 stuffed-0.014
Token his
Token ID465
Feature activation-6.24e-03

Pos logit contributions
bye+0.016
uck+0.015
enge+0.014
aph+0.013
arn+0.013
Neg logit contributions
ge-0.021
 swollen-0.020
 stret-0.020
 sy-0.018
 stuffed-0.016
Token mind
Token ID2000
Feature activation-5.94e-04

Pos logit contributions
ge+0.011
 stret+0.011
 swollen+0.011
 sy+0.010
 stuffed+0.010
Neg logit contributions
bye-0.011
uck-0.010
enge-0.009
arn-0.009
aph-0.009
Token.
Token ID13
Feature activation-2.03e-03

Pos logit contributions
ge+0.001
 swollen+0.001
 stret+0.001
 sy+0.001
 stuffed+0.001
Neg logit contributions
bye-0.001
uck-0.001
enge-0.001
aph-0.001
arn-0.001
Token He
Token ID679
Feature activation+1.05e-03

Pos logit contributions
ge+0.004
 stret+0.004
 swollen+0.004
 sy+0.004
 Kel+0.003
Neg logit contributions
bye-0.003
uck-0.003
enge-0.003
arn-0.003
aph-0.003
Token w
Token ID266
Feature activation-5.52e-03

Pos logit contributions
bye+0.002
uck+0.002
enge+0.002
arn+0.002
aph+0.002
Neg logit contributions
ge-0.002
 swollen-0.002
 stret-0.002
 sy-0.002
 stuffed-0.001
Tokenagged
Token ID14655
Feature activation+9.03e-02

Pos logit contributions
ge+0.014
 stret+0.014
 swollen+0.013
 sy+0.012
 tug+0.012
Neg logit contributions
bye-0.006
uck-0.005
aph-0.005
enge-0.005
arn-0.005
Token his
Token ID465
Feature activation-6.90e-03

Pos logit contributions
bye+0.158
uck+0.146
enge+0.134
aph+0.131
arn+0.130
Neg logit contributions
ge-0.165
 swollen-0.162
 stret-0.158
 sy-0.150
 stuffed-0.125
Token tail
Token ID7894
Feature activation+2.68e-03

Pos logit contributions
ge+0.014
 stret+0.013
 swollen+0.013
 sy+0.012
 stal+0.011
Neg logit contributions
bye-0.011
uck-0.010
enge-0.009
arn-0.009
apped-0.009
Token and
Token ID290
Feature activation-3.06e-03

Pos logit contributions
bye+0.004
uck+0.004
enge+0.004
aph+0.004
arn+0.004
Neg logit contributions
ge-0.005
 swollen-0.005
 stret-0.005
 sy-0.005
 stuffed-0.004
Token bark
Token ID21405
Feature activation-6.32e-03

Pos logit contributions
ge+0.005
 swollen+0.005
 stret+0.005
 sy+0.005
 stuffed+0.004
Neg logit contributions
bye-0.006
uck-0.005
enge-0.005
aph-0.005
arn-0.005
Tokened
Token ID276
Feature activation+3.24e-02

Pos logit contributions
ge+0.016
 stret+0.016
 swollen+0.016
 sy+0.015
 stuffed+0.014
Neg logit contributions
bye-0.006
uck-0.005
enge-0.005
aph-0.005
arn-0.005
Token It
Token ID632
Feature activation+5.08e-03

Pos logit contributions
bye+0.000
uck+0.000
enge+0.000
iddle+0.000
arn+0.000
Neg logit contributions
 stret-0.000
 swollen-0.000
 sy-0.000
 widened-0.000
ge-0.000
Token bark
Token ID21405
Feature activation-3.64e-03

Pos logit contributions
bye+0.008
uck+0.008
enge+0.007
aph+0.007
arn+0.007
Neg logit contributions
ge-0.010
 stret-0.010
 swollen-0.009
 sy-0.009
 prope-0.008
Tokened
Token ID276
Feature activation+3.27e-02

Pos logit contributions
ge+0.009
 swollen+0.009
 stret+0.009
 sy+0.009
 stuffed+0.008
Neg logit contributions
bye-0.004
uck-0.003
enge-0.003
aph-0.003
arn-0.003
Token and
Token ID290
Feature activation-6.38e-05

Pos logit contributions
bye+0.044
uck+0.040
enge+0.036
aph+0.035
arn+0.034
Neg logit contributions
ge-0.075
 stret-0.072
 swollen-0.072
 sy-0.067
 stuffed-0.059
Token w
Token ID266
Feature activation-7.68e-04

Pos logit contributions
ge+0.000
 stret+0.000
 swollen+0.000
 sy+0.000
 stal+0.000
Neg logit contributions
bye-0.000
uck-0.000
enge-0.000
aph-0.000
 Fred-0.000
Tokenagged
Token ID14655
Feature activation+9.03e-02

Pos logit contributions
ge+0.002
 swollen+0.002
 stret+0.002
 sy+0.002
 widened+0.002
Neg logit contributions
bye-0.001
uck-0.001
aph-0.001
enge-0.001
arn-0.001
Token its
Token ID663
Feature activation-1.05e-02

Pos logit contributions
bye+0.171
uck+0.160
enge+0.150
arn+0.147
aph+0.147
Neg logit contributions
ge-0.156
 swollen-0.151
 stret-0.148
 sy-0.134
 stuffed-0.119
Token tail
Token ID7894
Feature activation+4.96e-03

Pos logit contributions
ge+0.021
 stret+0.020
 swollen+0.019
 sy+0.018
 Kel+0.016
Neg logit contributions
bye-0.018
uck-0.016
enge-0.015
arn-0.015
aph-0.015
Token.
Token ID13
Feature activation+1.47e-04

Pos logit contributions
bye+0.007
uck+0.006
arn+0.005
enge+0.005
aph+0.005
Neg logit contributions
ge-0.011
 stret-0.011
 swollen-0.011
 sy-0.010
 Kel-0.009
Token It
Token ID632
Feature activation+4.63e-03

Pos logit contributions
bye+0.000
uck+0.000
enge+0.000
iddle+0.000
arn+0.000
Neg logit contributions
 widened-0.000
 swollen-0.000
Are-0.000
 stret-0.000
ge-0.000
Token wanted
Token ID2227
Feature activation-3.10e-03

Pos logit contributions
bye+0.008
uck+0.007
enge+0.007
aph+0.007
arn+0.007
Neg logit contributions
ge-0.009
 stret-0.009
 swollen-0.008
 sy-0.008
 prope-0.006
TokenMax
Token ID11518
Feature activation+1.17e-02

Pos logit contributions
bye+0.003
uck+0.003
enge+0.003
arn+0.003
aph+0.003
Neg logit contributions
ge-0.003
 swollen-0.003
 stret-0.003
 sy-0.003
 stuffed-0.003
Token bark
Token ID21405
Feature activation-2.32e-03

Pos logit contributions
bye+0.022
uck+0.021
enge+0.019
aph+0.019
arn+0.019
Neg logit contributions
ge-0.020
 swollen-0.019
 stret-0.019
 sy-0.017
 stuffed-0.015
Tokened
Token ID276
Feature activation+3.24e-02

Pos logit contributions
ge+0.006
 swollen+0.006
 stret+0.006
 sy+0.005
 stuffed+0.005
Neg logit contributions
bye-0.002
uck-0.002
aph-0.002
enge-0.002
arn-0.002
Token and
Token ID290
Feature activation+2.38e-03

Pos logit contributions
bye+0.048
uck+0.044
enge+0.039
aph+0.039
arn+0.038
Neg logit contributions
ge-0.070
 swollen-0.067
 stret-0.067
 sy-0.062
 stuffed-0.055
Token w
Token ID266
Feature activation-1.13e-03

Pos logit contributions
bye+0.004
uck+0.004
enge+0.003
aph+0.003
arn+0.003
Neg logit contributions
ge-0.005
 stret-0.004
 swollen-0.004
 sy-0.004
 prope-0.003
Tokenagged
Token ID14655
Feature activation+9.02e-02

Pos logit contributions
 swollen+0.003
ge+0.003
 stret+0.003
 sy+0.002
 tug+0.002
Neg logit contributions
bye-0.001
uck-0.001
aph-0.001
enge-0.001
arn-0.001
Token his
Token ID465
Feature activation+2.35e-03

Pos logit contributions
bye+0.174
uck+0.163
enge+0.152
aph+0.150
arn+0.150
Neg logit contributions
ge-0.152
 swollen-0.147
 stret-0.146
 sy-0.133
 stuffed-0.115
Token tail
Token ID7894
Feature activation+6.86e-03

Pos logit contributions
bye+0.004
uck+0.004
enge+0.004
arn+0.004
aph+0.004
Neg logit contributions
ge-0.004
 swollen-0.004
 stret-0.004
 sy-0.004
 stuffed-0.003
Token.
Token ID13
Feature activation+8.23e-03

Pos logit contributions
bye+0.010
uck+0.009
enge+0.008
arn+0.008
aph+0.008
Neg logit contributions
ge-0.015
 swollen-0.014
 stret-0.014
 sy-0.013
 stuffed-0.012
Token He
Token ID679
Feature activation+1.01e-02

Pos logit contributions
bye+0.016
uck+0.014
enge+0.013
aph+0.013
arn+0.013
Neg logit contributions
 stret-0.014
 swollen-0.014
ge-0.014
 sy-0.013
 stuffed-0.012
Token knew
Token ID2993
Feature activation+5.99e-03

Pos logit contributions
bye+0.018
uck+0.017
enge+0.016
aph+0.016
arn+0.016
Neg logit contributions
ge-0.018
 stret-0.018
 swollen-0.018
 sy-0.016
 stuffed-0.014
Token the
Token ID262
Feature activation+3.64e-03

Pos logit contributions
ge+0.003
 swollen+0.003
 stret+0.003
 sy+0.003
 stuffed+0.003
Neg logit contributions
bye-0.002
uck-0.002
enge-0.002
aph-0.002
arn-0.002
Token ball
Token ID2613
Feature activation+3.31e-03

Pos logit contributions
bye+0.005
uck+0.005
enge+0.004
aph+0.004
arn+0.004
Neg logit contributions
ge-0.008
 stret-0.008
 swollen-0.008
 sy-0.007
 Kel-0.006
Token.
Token ID13
Feature activation+5.70e-03

Pos logit contributions
bye+0.005
uck+0.005
enge+0.005
arn+0.005
aph+0.005
Neg logit contributions
ge-0.006
 swollen-0.006
 stret-0.006
 sy-0.006
 stuffed-0.005
Token Rex
Token ID17853
Feature activation+2.67e-04

Pos logit contributions
bye+0.008
uck+0.008
enge+0.007
aph+0.007
arn+0.007
Neg logit contributions
ge-0.012
 stret-0.012
 swollen-0.012
 sy-0.011
 stuffed-0.010
Token w
Token ID266
Feature activation-1.93e-03

Pos logit contributions
bye+0.000
uck+0.000
enge+0.000
arn+0.000
aph+0.000
Neg logit contributions
ge-0.001
 swollen-0.000
 stret-0.000
 sy-0.000
 flashing-0.000
Tokenagged
Token ID14655
Feature activation+9.02e-02

Pos logit contributions
ge+0.005
 stret+0.005
 swollen+0.005
 sy+0.004
 tug+0.004
Neg logit contributions
bye-0.002
uck-0.002
aph-0.001
enge-0.001
arn-0.001
Token his
Token ID465
Feature activation-1.15e-03

Pos logit contributions
bye+0.195
uck+0.188
arn+0.176
enge+0.175
aph+0.175
Neg logit contributions
ge-0.125
 swollen-0.124
 stret-0.117
 sy-0.100
 stuffed-0.095
Token tail
Token ID7894
Feature activation+4.21e-03

Pos logit contributions
ge+0.003
 stret+0.002
 swollen+0.002
 sy+0.002
 Kel+0.002
Neg logit contributions
bye-0.002
uck-0.001
apped-0.001
enge-0.001
arn-0.001
Token and
Token ID290
Feature activation+6.12e-03

Pos logit contributions
bye+0.006
uck+0.006
arn+0.005
enge+0.005
aph+0.005
Neg logit contributions
ge-0.009
 stret-0.009
 swollen-0.009
 sy-0.008
 Kel-0.007
Token ran
Token ID4966
Feature activation+3.55e-03

Pos logit contributions
bye+0.012
uck+0.012
enge+0.011
aph+0.011
arn+0.011
Neg logit contributions
ge-0.010
 swollen-0.009
 stret-0.009
 sy-0.008
 stuffed-0.007
Token to
Token ID284
Feature activation-3.23e-03

Pos logit contributions
bye+0.006
uck+0.005
arn+0.005
iddle+0.005
anks+0.005
Neg logit contributions
ge-0.007
 stret-0.007
 swollen-0.006
 tug-0.006
 stuffed-0.006
Token not
Token ID407
Feature activation+3.84e-03

Pos logit contributions
bye+0.012
uck+0.011
aph+0.010
enge+0.009
arn+0.009
Neg logit contributions
ge-0.025
 stret-0.025
 swollen-0.025
 sy-0.024
 stuffed-0.021
Token listen
Token ID6004
Feature activation-5.86e-04

Pos logit contributions
bye+0.006
uck+0.006
aph+0.005
enge+0.005
arn+0.005
Neg logit contributions
ge-0.007
 swollen-0.007
 stret-0.007
 sy-0.007
 stuffed-0.006
Token.
Token ID13
Feature activation+1.70e-03

Pos logit contributions
 swollen+0.001
 stret+0.001
ge+0.001
 sy+0.001
 stuffed+0.001
Neg logit contributions
bye-0.001
uck-0.001
arn-0.000
aph-0.000
night-0.000
Token He
Token ID679
Feature activation+6.12e-03

Pos logit contributions
bye+0.004
enge+0.003
uck+0.003
iddle+0.003
arn+0.003
Neg logit contributions
 swollen-0.002
 stret-0.002
 sy-0.002
 scrap-0.002
Are-0.002
Token w
Token ID266
Feature activation-8.32e-04

Pos logit contributions
bye+0.012
uck+0.012
enge+0.011
arn+0.011
aph+0.011
Neg logit contributions
ge-0.010
 swollen-0.009
 stret-0.009
 sy-0.008
 stuffed-0.008
Tokenagged
Token ID14655
Feature activation+9.02e-02

Pos logit contributions
 stret+0.002
ge+0.002
 swollen+0.002
 sy+0.002
 tug+0.002
Neg logit contributions
bye-0.001
uck-0.001
aph-0.001
arn-0.001
enge-0.001
Token his
Token ID465
Feature activation+3.01e-04

Pos logit contributions
bye+0.207
uck+0.200
arn+0.192
aph+0.189
enge+0.187
Neg logit contributions
 swollen-0.114
ge-0.113
 stret-0.102
 sy-0.085
 stuffed-0.080
Token tail
Token ID7894
Feature activation+4.15e-03

Pos logit contributions
bye+0.000
uck+0.000
enge+0.000
aph+0.000
arn+0.000
Neg logit contributions
ge-0.001
 stret-0.001
 swollen-0.001
 sy-0.001
 widened-0.000
Token and
Token ID290
Feature activation-2.22e-03

Pos logit contributions
bye+0.007
uck+0.006
enge+0.006
aph+0.006
arn+0.006
Neg logit contributions
ge-0.008
 swollen-0.008
 stret-0.008
 sy-0.007
 stuffed-0.006
Token kept
Token ID4030
Feature activation-2.72e-04

Pos logit contributions
 swollen+0.003
ge+0.003
 stret+0.003
 sy+0.003
 stuffed+0.003
Neg logit contributions
bye-0.005
uck-0.004
enge-0.004
arn-0.004
aph-0.004
Token playing
Token ID2712
Feature activation-2.33e-02

Pos logit contributions
ge+0.000
 swollen+0.000
 stret+0.000
 sy+0.000
 scrap+0.000
Neg logit contributions
bye-0.001
uck-0.001
enge-0.001
arn-0.001
aph-0.001
Token
Token ID220
Feature activation-1.74e-02

Pos logit contributions
bye+0.012
uck+0.011
enge+0.010
aph+0.010
arn+0.010
Neg logit contributions
ge-0.008
 swollen-0.008
 stret-0.008
 sy-0.007
 stuffed-0.006
Token
Token ID198
Feature activation-5.61e-05

Pos logit contributions
ge+0.037
 swollen+0.036
 stret+0.035
 sy+0.033
 stuffed+0.029
Neg logit contributions
bye-0.026
uck-0.024
enge-0.022
aph-0.022
arn-0.021
Token
Token ID198
Feature activation-4.28e-04

Pos logit contributions
ge+0.000
 swollen+0.000
 stret+0.000
 sy+0.000
 stuffed+0.000
Neg logit contributions
bye-0.000
uck-0.000
enge-0.000
aph-0.000
arn-0.000
TokenMax
Token ID11518
Feature activation+1.04e-02

Pos logit contributions
ge+0.001
 swollen+0.001
 stret+0.001
 sy+0.001
 stuffed+0.001
Neg logit contributions
bye-0.001
uck-0.001
enge-0.001
arn-0.001
aph-0.001
Token w
Token ID266
Feature activation-8.57e-04

Pos logit contributions
bye+0.021
uck+0.020
enge+0.019
aph+0.019
arn+0.018
Neg logit contributions
ge-0.016
 swollen-0.015
 stret-0.015
 sy-0.014
 stuffed-0.013
Tokenagged
Token ID14655
Feature activation+9.02e-02

Pos logit contributions
 swollen+0.002
 stret+0.002
ge+0.002
 sy+0.002
 tug+0.002
Neg logit contributions
bye-0.001
aph-0.001
 noon-0.001
uck-0.001
 Queen-0.001
Token his
Token ID465
Feature activation+5.10e-04

Pos logit contributions
bye+0.176
uck+0.165
enge+0.154
aph+0.152
arn+0.151
Neg logit contributions
ge-0.150
 swollen-0.145
 stret-0.144
 sy-0.130
 stuffed-0.113
Token tail
Token ID7894
Feature activation+6.44e-03

Pos logit contributions
bye+0.001
uck+0.001
enge+0.001
arn+0.001
aph+0.001
Neg logit contributions
ge-0.001
 swollen-0.001
 stret-0.001
 sy-0.001
 widened-0.001
Token and
Token ID290
Feature activation+1.59e-03

Pos logit contributions
bye+0.011
uck+0.011
enge+0.010
arn+0.010
aph+0.010
Neg logit contributions
ge-0.012
 swollen-0.011
 stret-0.011
 sy-0.010
 stuffed-0.009
Token ran
Token ID4966
Feature activation+4.20e-03

Pos logit contributions
bye+0.003
uck+0.003
arn+0.003
enge+0.003
aph+0.003
Neg logit contributions
 swollen-0.003
ge-0.003
 stret-0.002
 sy-0.002
 stuffed-0.002
Token around
Token ID1088
Feature activation+6.40e-03

Pos logit contributions
bye+0.007
uck+0.006
arn+0.006
enge+0.006
umm+0.006
Neg logit contributions
ge-0.008
 stret-0.008
 swollen-0.007
 tug-0.007
 stuffed-0.007
Token dirty
Token ID11841
Feature activation-1.79e-02

Pos logit contributions
bye+0.007
uck+0.006
enge+0.006
apped+0.006
aph+0.006
Neg logit contributions
ge-0.007
 stret-0.007
 swollen-0.006
 sy-0.006
 widened-0.005
Token.
Token ID13
Feature activation+5.40e-03

Pos logit contributions
ge+0.036
 swollen+0.035
 stret+0.034
 sy+0.032
 stuffed+0.028
Neg logit contributions
bye-0.029
uck-0.026
enge-0.025
aph-0.024
arn-0.024
Token The
Token ID383
Feature activation+4.04e-03

Pos logit contributions
bye+0.010
uck+0.009
enge+0.008
iddle+0.008
aph+0.008
Neg logit contributions
 swollen-0.010
 stret-0.009
ge-0.009
 sy-0.009
Are-0.008
Token dog
Token ID3290
Feature activation+8.55e-04

Pos logit contributions
bye+0.005
uck+0.005
enge+0.004
arn+0.004
aph+0.004
Neg logit contributions
ge-0.010
 stret-0.009
 swollen-0.009
 sy-0.008
 Kel-0.008
Token w
Token ID266
Feature activation-8.49e-04

Pos logit contributions
bye+0.001
uck+0.001
enge+0.001
arn+0.001
apped+0.001
Neg logit contributions
ge-0.002
 stret-0.002
 swollen-0.002
 sy-0.001
 Kel-0.001
Tokenagged
Token ID14655
Feature activation+9.02e-02

Pos logit contributions
ge+0.002
 swollen+0.002
 stret+0.002
 sy+0.002
 scrap+0.002
Neg logit contributions
bye-0.001
uck-0.001
aph-0.001
enge-0.001
arn-0.001
Token its
Token ID663
Feature activation-1.25e-02

Pos logit contributions
bye+0.188
uck+0.180
arn+0.169
aph+0.168
enge+0.168
Neg logit contributions
 swollen-0.133
ge-0.132
 stret-0.128
 sy-0.111
 stuffed-0.102
Token tail
Token ID7894
Feature activation+3.00e-03

Pos logit contributions
ge+0.026
 stret+0.024
 swollen+0.023
 sy+0.022
 Kel+0.019
Neg logit contributions
bye-0.021
uck-0.018
enge-0.017
aph-0.017
arn-0.017
Token and
Token ID290
Feature activation+2.45e-03

Pos logit contributions
bye+0.004
uck+0.004
arn+0.003
enge+0.003
apped+0.003
Neg logit contributions
ge-0.007
 stret-0.006
 swollen-0.006
 sy-0.006
 Kel-0.005
Token l
Token ID300
Feature activation+5.33e-03

Pos logit contributions
bye+0.005
uck+0.004
enge+0.004
aph+0.004
arn+0.004
Neg logit contributions
ge-0.004
 swollen-0.004
 stret-0.004
 sy-0.004
 stuffed-0.003
Tokenicked
Token ID9484
Feature activation+4.76e-02

Pos logit contributions
bye+0.006
uck+0.005
enge+0.004
aph+0.004
arn+0.004
Neg logit contributions
ge-0.013
 swollen-0.013
 stret-0.013
 sy-0.012
 stuffed-0.011
Token dog
Token ID3290
Feature activation+1.62e-03

Pos logit contributions
bye+0.003
uck+0.003
enge+0.002
aph+0.002
arn+0.002
Neg logit contributions
 swollen-0.004
ge-0.004
 stret-0.004
 sy-0.003
 stuffed-0.003
Token bark
Token ID21405
Feature activation-3.47e-03

Pos logit contributions
bye+0.003
uck+0.003
enge+0.003
aph+0.003
arn+0.003
Neg logit contributions
ge-0.003
 swollen-0.003
 stret-0.003
 sy-0.002
 stuffed-0.002
Tokened
Token ID276
Feature activation+3.41e-02

Pos logit contributions
ge+0.009
 swollen+0.009
 stret+0.009
 sy+0.009
 stuffed+0.008
Neg logit contributions
bye-0.003
uck-0.003
enge-0.002
aph-0.002
arn-0.002
Token and
Token ID290
Feature activation+1.16e-03

Pos logit contributions
bye+0.048
uck+0.044
enge+0.039
aph+0.038
arn+0.038
Neg logit contributions
ge-0.076
 stret-0.073
 swollen-0.072
 sy-0.067
 flashing-0.059
Token w
Token ID266
Feature activation-3.74e-04

Pos logit contributions
bye+0.002
uck+0.002
enge+0.002
aph+0.002
arn+0.002
Neg logit contributions
ge-0.002
 swollen-0.002
 stret-0.002
 sy-0.002
 stuffed-0.002
Tokenagged
Token ID14655
Feature activation+9.02e-02

Pos logit contributions
ge+0.001
 swollen+0.001
 stret+0.001
 sy+0.001
 tug+0.001
Neg logit contributions
bye-0.000
uck-0.000
aph-0.000
enge-0.000
arn-0.000
Token its
Token ID663
Feature activation-1.00e-02

Pos logit contributions
bye+0.163
uck+0.152
enge+0.141
aph+0.139
arn+0.138
Neg logit contributions
ge-0.163
 stret-0.157
 swollen-0.157
 sy-0.144
 stuffed-0.125
Token tail
Token ID7894
Feature activation+4.66e-03

Pos logit contributions
ge+0.019
 stret+0.019
 swollen+0.018
 sy+0.017
 widened+0.014
Neg logit contributions
bye-0.018
uck-0.016
enge-0.015
arn-0.015
aph-0.014
Token in
Token ID287
Feature activation+7.33e-03

Pos logit contributions
bye+0.006
uck+0.005
arn+0.005
enge+0.005
aph+0.005
Neg logit contributions
ge-0.011
 stret-0.011
 swollen-0.010
 sy-0.010
 Kel-0.009
Token response
Token ID2882
Feature activation-1.02e-02

Pos logit contributions
bye+0.014
uck+0.013
enge+0.012
aph+0.012
arn+0.012
Neg logit contributions
ge-0.013
 swollen-0.012
 stret-0.012
 sy-0.011
 stuffed-0.010
Token,
Token ID11
Feature activation+3.47e-03

Pos logit contributions
ge+0.020
 swollen+0.020
 stret+0.020
 sy+0.018
 stuffed+0.016
Neg logit contributions
bye-0.016
uck-0.015
enge-0.014
aph-0.014
arn-0.014
Token the
Token ID262
Feature activation-1.52e-02

Pos logit contributions
 swollen+0.049
ge+0.047
 stret+0.046
 sy+0.044
 widened+0.040
Neg logit contributions
bye-0.018
uck-0.018
aph-0.015
enge-0.014
anks-0.014
Token sugar
Token ID7543
Feature activation-1.58e-02

Pos logit contributions
ge+0.030
 swollen+0.029
 stret+0.029
 sy+0.026
 stuffed+0.023
Neg logit contributions
bye-0.025
uck-0.023
enge-0.021
aph-0.021
arn-0.021
Token would
Token ID561
Feature activation+1.63e-02

Pos logit contributions
ge+0.033
 swollen+0.032
 stret+0.032
 sy+0.029
 stuffed+0.026
Neg logit contributions
bye-0.024
uck-0.023
enge-0.020
arn-0.019
aph-0.019
Token not
Token ID407
Feature activation+1.23e-03

Pos logit contributions
bye+0.026
uck+0.023
arn+0.020
um+0.019
ow+0.019
Neg logit contributions
ge-0.032
 swollen-0.031
 stret-0.031
 sy-0.029
 levers-0.025
Token bud
Token ID3600
Feature activation-5.06e-03

Pos logit contributions
bye+0.002
uck+0.002
aph+0.002
enge+0.002
arn+0.002
Neg logit contributions
ge-0.002
 swollen-0.002
 stret-0.002
 sy-0.002
 Kel-0.002
Tokenge
Token ID469
Feature activation-1.03e-01

Pos logit contributions
 stret+0.005
 swollen+0.004
 sy+0.004
 stuffed+0.003
 widened+0.003
Neg logit contributions
uck-0.013
bye-0.012
aph-0.011
iddle-0.011
ow-0.011
Token.
Token ID13
Feature activation-3.91e-03

Pos logit contributions
ge+0.220
 swollen+0.218
 stret+0.216
 sy+0.202
 stuffed+0.180
Neg logit contributions
bye-0.146
uck-0.133
enge-0.120
arn-0.116
aph-0.116
Token All
Token ID1439
Feature activation+8.94e-03

Pos logit contributions
ge+0.007
 stret+0.007
 swollen+0.007
 sy+0.007
 Kel+0.006
Neg logit contributions
bye-0.006
uck-0.006
enge-0.005
aph-0.005
arn-0.005
Token it
Token ID340
Feature activation+1.11e-02

Pos logit contributions
bye+0.014
uck+0.013
enge+0.012
aph+0.011
arn+0.011
Neg logit contributions
ge-0.019
 swollen-0.018
 stret-0.018
 sy-0.017
 stuffed-0.015
Token did
Token ID750
Feature activation+1.61e-02

Pos logit contributions
bye+0.019
uck+0.018
enge+0.016
aph+0.016
arn+0.016
Neg logit contributions
ge-0.021
 stret-0.020
 swollen-0.020
 sy-0.018
 flashing-0.016
Token all
Token ID477
Feature activation-2.12e-03

Pos logit contributions
bye+0.016
uck+0.014
 example+0.011
aph+0.011
 Fred+0.011
Neg logit contributions
 swollen-0.039
 stret-0.039
 sy-0.039
ge-0.039
 scrap-0.034
Token rest
Token ID1334
Feature activation-1.69e-02

Pos logit contributions
bye+0.002
uck+0.002
aph+0.001
enge+0.001
arn+0.001
Neg logit contributions
ge-0.002
 swollen-0.002
 stret-0.002
 sy-0.002
 flashing-0.002
Token of
Token ID286
Feature activation-1.78e-04

Pos logit contributions
ge+0.034
 swollen+0.033
 stret+0.033
 sy+0.030
 stuffed+0.027
Neg logit contributions
bye-0.027
uck-0.025
enge-0.023
aph-0.022
arn-0.022
Token the
Token ID262
Feature activation+3.90e-04

Pos logit contributions
ge+0.000
 swollen+0.000
 stret+0.000
 sy+0.000
 stuffed+0.000
Neg logit contributions
bye-0.000
uck-0.000
enge-0.000
arn-0.000
iddle-0.000
Token day
Token ID1110
Feature activation+1.60e-03

Pos logit contributions
bye+0.000
uck+0.000
enge+0.000
aph+0.000
arn+0.000
Neg logit contributions
ge-0.001
 stret-0.001
 swollen-0.001
 sy-0.001
 stuffed-0.001
Token to
Token ID284
Feature activation-8.76e-03

Pos logit contributions
bye+0.003
uck+0.002
enge+0.002
arn+0.002
aph+0.002
Neg logit contributions
ge-0.003
 swollen-0.003
 stret-0.003
 sy-0.003
 stuffed-0.003
Tokenge
Token ID469
Feature activation-9.91e-02

Pos logit contributions
ge+0.016
 swollen+0.015
 stret+0.015
 sy+0.014
 stuffed+0.012
Neg logit contributions
bye-0.016
uck-0.015
enge-0.014
aph-0.014
arn-0.013
Tokenh
Token ID71
Feature activation+3.73e-03

Pos logit contributions
ge+0.205
 swollen+0.198
 stret+0.196
 sy+0.181
 stuffed+0.164
Neg logit contributions
bye-0.152
uck-0.141
enge-0.129
arn-0.127
aph-0.126
Tokenter
Token ID353
Feature activation+3.28e-03

Pos logit contributions
bye+0.005
uck+0.005
aph+0.004
arn+0.004
enge+0.004
Neg logit contributions
 swollen-0.009
 stret-0.008
ge-0.008
 sy-0.008
 stuffed-0.007
Token and
Token ID290
Feature activation-2.26e-03

Pos logit contributions
bye+0.005
uck+0.005
enge+0.005
aph+0.004
arn+0.004
Neg logit contributions
ge-0.006
 swollen-0.006
 stret-0.006
 sy-0.006
 stuffed-0.005
Token,
Token ID11
Feature activation-7.63e-04

Pos logit contributions
ge+0.004
 swollen+0.004
 stret+0.004
 sy+0.004
 stuffed+0.003
Neg logit contributions
bye-0.004
uck-0.004
enge-0.004
arn-0.004
aph-0.003
Token when
Token ID618
Feature activation-1.45e-02

Pos logit contributions
ge+0.002
 swollen+0.002
 stret+0.002
 sy+0.001
 scrap+0.001
Neg logit contributions
bye-0.001
uck-0.001
enge-0.001
aph-0.001
arn-0.001
Token trunk
Token ID21427
Feature activation-1.31e-02

Pos logit contributions
ge+0.008
 swollen+0.007
 stret+0.006
 Kel+0.006
 sy+0.006
Neg logit contributions
bye-0.005
uck-0.004
aph-0.004
um-0.004
olly-0.004
Token and
Token ID290
Feature activation+7.60e-05

Pos logit contributions
ge+0.028
 swollen+0.027
 stret+0.027
 sy+0.025
 stuffed+0.022
Neg logit contributions
bye-0.019
uck-0.018
aph-0.016
enge-0.016
arn-0.015
Token wouldn
Token ID3636
Feature activation+6.12e-04

Pos logit contributions
bye+0.000
uck+0.000
arn+0.000
enge+0.000
aph+0.000
Neg logit contributions
ge-0.000
 swollen-0.000
 stret-0.000
 sy-0.000
 stuffed-0.000
Token't
Token ID470
Feature activation-4.34e-03

Pos logit contributions
bye+0.001
uck+0.001
aph+0.001
arn+0.001
um+0.001
Neg logit contributions
 swollen-0.001
 stret-0.001
ge-0.001
 sy-0.001
 stuffed-0.001
Token bud
Token ID3600
Feature activation-3.08e-03

Pos logit contributions
 swollen+0.008
 stret+0.008
ge+0.008
 sy+0.008
 flashing+0.007
Neg logit contributions
bye-0.007
uck-0.006
aph-0.006
 Fred-0.005
arn-0.005
Tokenge
Token ID469
Feature activation-9.89e-02

Pos logit contributions
 stret+0.002
 swollen+0.002
 sy+0.002
 prope+0.001
 levers+0.001
Neg logit contributions
uck-0.008
bye-0.008
ow-0.008
 home-0.008
aph-0.008
Token.
Token ID13
Feature activation-5.46e-04

Pos logit contributions
ge+0.212
 swollen+0.212
 stret+0.211
 sy+0.196
 stuffed+0.175
Neg logit contributions
bye-0.140
uck-0.128
enge-0.114
aph-0.112
arn-0.112
Token The
Token ID383
Feature activation-1.82e-03

Pos logit contributions
ge+0.001
 stret+0.001
 swollen+0.001
 sy+0.001
 widened+0.001
Neg logit contributions
bye-0.001
uck-0.001
enge-0.001
aph-0.001
arn-0.001
Token squirrel
Token ID33039
Feature activation-8.87e-03

Pos logit contributions
 swollen+0.003
ge+0.003
 stret+0.003
 sy+0.003
 stuffed+0.003
Neg logit contributions
bye-0.003
uck-0.003
enge-0.003
arn-0.003
aph-0.003
Token was
Token ID373
Feature activation-4.84e-03

Pos logit contributions
ge+0.017
 stret+0.016
 swollen+0.016
 sy+0.015
 levers+0.013
Neg logit contributions
bye-0.015
uck-0.014
enge-0.013
aph-0.012
arn-0.012
Token hungry
Token ID14720
Feature activation-1.15e-02

Pos logit contributions
ge+0.011
 stret+0.011
 swollen+0.011
 sy+0.010
 stuffed+0.009
Neg logit contributions
bye-0.006
uck-0.006
enge-0.005
arn-0.005
aph-0.005
Token but
Token ID475
Feature activation+2.08e-03

Pos logit contributions
 Sam+0.001
 Polly+0.001
uck+0.001
 Jack+0.001
 Fred+0.001
Neg logit contributions
ge-0.002
 swollen-0.002
 stret-0.002
 sy-0.002
lest-0.002
Token it
Token ID340
Feature activation+9.31e-03

Pos logit contributions
bye+0.003
uck+0.003
enge+0.002
aph+0.002
arn+0.002
Neg logit contributions
ge-0.005
 stret-0.005
 swollen-0.004
 sy-0.004
 widened-0.004
Token wouldn
Token ID3636
Feature activation+9.27e-04

Pos logit contributions
bye+0.018
uck+0.017
enge+0.016
aph+0.016
arn+0.016
Neg logit contributions
ge-0.015
 swollen-0.015
 stret-0.014
 sy-0.013
 stuffed-0.011
Token't
Token ID470
Feature activation-1.54e-03

Pos logit contributions
uck+0.001
bye+0.001
aph+0.001
um+0.001
arn+0.001
Neg logit contributions
 stret-0.002
ge-0.002
 swollen-0.002
 sy-0.002
 stuffed-0.002
Token bud
Token ID3600
Feature activation-3.33e-03

Pos logit contributions
 stret+0.003
 swollen+0.003
ge+0.003
 sy+0.003
 widened+0.002
Neg logit contributions
bye-0.002
uck-0.002
aph-0.002
arn-0.002
enge-0.002
Tokenge
Token ID469
Feature activation-9.87e-02

Pos logit contributions
 stret+0.002
 swollen+0.002
 sy+0.002
 widened+0.001
 stuffed+0.001
Neg logit contributions
uck-0.009
bye-0.009
aph-0.008
ow-0.008
anks-0.008
Token.
Token ID13
Feature activation+5.28e-03

Pos logit contributions
 stret+0.230
 swollen+0.226
ge+0.221
 sy+0.212
 widened+0.191
Neg logit contributions
bye-0.122
uck-0.111
arn-0.095
enge-0.094
aph-0.092
Token She
Token ID1375
Feature activation-1.96e-03

Pos logit contributions
bye+0.011
enge+0.010
aph+0.010
iddle+0.010
uck+0.010
Neg logit contributions
 swollen-0.007
 stret-0.007
ge-0.007
 sy-0.007
 Kel-0.006
Token asked
Token ID1965
Feature activation+5.91e-03

Pos logit contributions
ge+0.003
 swollen+0.003
 stret+0.003
 sy+0.003
 stuffed+0.003
Neg logit contributions
bye-0.004
uck-0.003
enge-0.003
aph-0.003
arn-0.003
Token her
Token ID607
Feature activation-1.45e-03

Pos logit contributions
bye+0.012
uck+0.012
aph+0.011
enge+0.011
arn+0.011
Neg logit contributions
ge-0.009
 swollen-0.009
 stret-0.009
 sy-0.008
 stuffed-0.007
Token mom
Token ID1995
Feature activation+1.52e-03

Pos logit contributions
ge+0.003
 swollen+0.003
 stret+0.003
 sy+0.003
 stuffed+0.003
Neg logit contributions
bye-0.002
uck-0.002
enge-0.001
arn-0.001
aph-0.001
Token but
Token ID475
Feature activation+1.38e-03

Pos logit contributions
ge+0.003
 swollen+0.003
 stret+0.003
 sy+0.002
istered+0.002
Neg logit contributions
 Sam-0.002
 Jack-0.002
 Fred-0.002
 Polly-0.001
uck-0.001
Token it
Token ID340
Feature activation+8.20e-03

Pos logit contributions
bye+0.002
uck+0.002
enge+0.002
 Sam+0.002
aph+0.001
Neg logit contributions
 stret-0.003
ge-0.003
 swollen-0.003
 sy-0.003
 widened-0.002
Token wouldn
Token ID3636
Feature activation+9.42e-04

Pos logit contributions
bye+0.016
uck+0.015
enge+0.014
aph+0.014
arn+0.014
Neg logit contributions
ge-0.014
 swollen-0.013
 stret-0.013
 sy-0.012
 stuffed-0.010
Token't
Token ID470
Feature activation-1.01e-03

Pos logit contributions
uck+0.001
bye+0.001
aph+0.001
um+0.001
arn+0.001
Neg logit contributions
 stret-0.002
ge-0.002
 swollen-0.002
 sy-0.002
 stuffed-0.002
Token bud
Token ID3600
Feature activation-3.75e-03

Pos logit contributions
 stret+0.002
 swollen+0.002
ge+0.002
 sy+0.002
 scrap+0.002
Neg logit contributions
bye-0.002
uck-0.001
aph-0.001
 noon-0.001
iddle-0.001
Tokenge
Token ID469
Feature activation-9.87e-02

Pos logit contributions
 stret+0.002
 swollen+0.002
 sy+0.002
 widened+0.001
 stuffed+0.001
Neg logit contributions
bye-0.010
uck-0.010
aph-0.010
anks-0.010
ow-0.009
Token.
Token ID13
Feature activation+3.52e-03

Pos logit contributions
 stret+0.230
 swollen+0.228
ge+0.220
 sy+0.213
 widened+0.191
Neg logit contributions
bye-0.123
uck-0.110
enge-0.093
arn-0.092
aph-0.091
Token
Token ID198
Feature activation-5.08e-03

Pos logit contributions
bye+0.007
uck+0.006
enge+0.006
aph+0.006
arn+0.006
Neg logit contributions
ge-0.006
 swollen-0.006
 stret-0.006
 sy-0.005
 Kel-0.005
Token
Token ID198
Feature activation-5.10e-03

Pos logit contributions
ge+0.011
 swollen+0.010
 stret+0.010
 sy+0.010
 stuffed+0.009
Neg logit contributions
bye-0.008
uck-0.007
enge-0.006
aph-0.006
arn-0.006
TokenThen
Token ID6423
Feature activation+1.92e-03

Pos logit contributions
ge+0.009
 stret+0.009
 swollen+0.009
 sy+0.009
 stuffed+0.008
Neg logit contributions
bye-0.009
uck-0.009
arn-0.008
um-0.008
enge-0.007
Token,
Token ID11
Feature activation+1.37e-03

Pos logit contributions
bye+0.003
uck+0.003
enge+0.002
aph+0.002
arn+0.002
Neg logit contributions
ge-0.004
 swollen-0.004
 stret-0.004
 sy-0.004
 stuffed-0.003
Token but
Token ID475
Feature activation+1.68e-03

Pos logit contributions
bye+0.001
uck+0.001
aph+0.001
 Sam+0.001
 Fred+0.001
Neg logit contributions
ge-0.001
 swollen-0.001
 stret-0.001
 sy-0.001
 scrap-0.001
Token it
Token ID340
Feature activation+9.75e-03

Pos logit contributions
bye+0.003
uck+0.003
enge+0.003
aph+0.002
arn+0.002
Neg logit contributions
ge-0.003
 stret-0.003
 swollen-0.003
 sy-0.003
 stuffed-0.002
Token wouldn
Token ID3636
Feature activation+1.13e-03

Pos logit contributions
bye+0.021
uck+0.019
aph+0.018
arn+0.018
enge+0.018
Neg logit contributions
 swollen-0.015
ge-0.014
 stret-0.013
 sy-0.012
 stuffed-0.011
Token't
Token ID470
Feature activation-1.21e-03

Pos logit contributions
uck+0.001
bye+0.001
aph+0.001
um+0.001
arn+0.001
Neg logit contributions
 stret-0.003
ge-0.003
 swollen-0.003
 sy-0.002
 stuffed-0.002
Token bud
Token ID3600
Feature activation-2.49e-03

Pos logit contributions
 stret+0.002
 swollen+0.002
ge+0.002
 sy+0.002
 scrap+0.002
Neg logit contributions
bye-0.002
uck-0.002
aph-0.002
arn-0.002
enge-0.002
Tokenge
Token ID469
Feature activation-9.86e-02

Pos logit contributions
 stret+0.001
 swollen+0.001
 sy+0.001
 scrap+0.000
 stuffed+0.000
Neg logit contributions
uck-0.007
bye-0.007
aph-0.007
anks-0.007
arn-0.007
Token.
Token ID13
Feature activation+3.03e-03

Pos logit contributions
 stret+0.221
 swollen+0.220
ge+0.215
 sy+0.205
 widened+0.182
Neg logit contributions
bye-0.130
uck-0.118
enge-0.103
arn-0.102
aph-0.101
Token Tim
Token ID5045
Feature activation-1.00e-02

Pos logit contributions
bye+0.005
uck+0.005
enge+0.004
aph+0.004
arn+0.004
Neg logit contributions
ge-0.006
 swollen-0.006
 stret-0.005
 sy-0.005
 stuffed-0.005
Tokenmy
Token ID1820
Feature activation+1.34e-02

Pos logit contributions
ge+0.016
 swollen+0.015
 stret+0.014
 sy+0.013
 scrap+0.011
Neg logit contributions
bye-0.021
uck-0.019
enge-0.018
aph-0.018
arn-0.018
Token tried
Token ID3088
Feature activation-1.11e-02

Pos logit contributions
bye+0.026
uck+0.024
enge+0.023
arn+0.022
aph+0.022
Neg logit contributions
ge-0.022
 swollen-0.022
 stret-0.021
 sy-0.019
 stuffed-0.017
Token and
Token ID290
Feature activation+2.63e-03

Pos logit contributions
 stret+0.027
 swollen+0.026
ge+0.026
 sy+0.024
 widened+0.022
Neg logit contributions
uck-0.012
bye-0.012
aph-0.010
arn-0.010
enge-0.010
Tokenâ
Token ID22940
Feature activation+3.09e-03

Pos logit contributions
uck+0.003
bye+0.002
aph+0.002
um+0.002
arn+0.002
Neg logit contributions
 stret-0.006
ge-0.005
 swollen-0.005
 sy-0.005
 stuffed-0.005
Token
Token ID26391
Feature activation+9.27e-03

Pos logit contributions
uck+0.004
bye+0.004
aph+0.004
 advisor+0.004
apped+0.004
Neg logit contributions
ge-0.006
 stret-0.005
 swollen-0.005
 sy-0.005
 drift-0.004
Token
Token ID8151
Feature activation+2.13e-02

Pos logit contributions
bye+0.015
uck+0.014
enge+0.013
arn+0.013
aph+0.013
Neg logit contributions
ge-0.018
 swollen-0.017
 stret-0.017
 sy-0.016
 stuffed-0.014
Tokent
Token ID83
Feature activation+9.82e-03

Pos logit contributions
bye+0.034
uck+0.032
enge+0.028
arn+0.028
aph+0.028
Neg logit contributions
ge-0.042
 swollen-0.042
 stret-0.042
 sy-0.039
 stuffed-0.035
Token bud
Token ID3600
Feature activation-2.78e-03

Pos logit contributions
bye+0.014
uck+0.014
aph+0.012
iddle+0.012
arn+0.011
Neg logit contributions
 stret-0.021
 swollen-0.020
 sy-0.018
ge-0.018
 stuffed-0.016
Tokenge
Token ID469
Feature activation-9.86e-02

Pos logit contributions
 stret+0.002
 swollen+0.002
 sy+0.002
 widened+0.001
 stuffed+0.001
Neg logit contributions
uck-0.007
bye-0.007
aph-0.007
ow-0.007
anks-0.007
Token.
Token ID13
Feature activation+2.75e-03

Pos logit contributions
 stret+0.235
 swollen+0.227
ge+0.225
 sy+0.217
 widened+0.193
Neg logit contributions
bye-0.116
uck-0.108
enge-0.089
arn-0.088
aph-0.085
Token
Token ID198
Feature activation-4.29e-03

Pos logit contributions
bye+0.004
uck+0.004
enge+0.004
aph+0.004
arn+0.004
Neg logit contributions
ge-0.006
 swollen-0.005
 stret-0.005
 sy-0.005
 Kel-0.004
Token
Token ID198
Feature activation-4.31e-03

Pos logit contributions
ge+0.009
 swollen+0.009
 stret+0.009
 sy+0.008
 stuffed+0.008
Neg logit contributions
bye-0.006
uck-0.006
enge-0.005
arn-0.005
aph-0.005
Tokenâ
Token ID22940
Feature activation+4.24e-03

Pos logit contributions
 sy+0.009
ge+0.009
 stret+0.009
 swollen+0.008
 stuffed+0.008
Neg logit contributions
uck-0.007
bye-0.006
 Now-0.006
um-0.005
arn-0.005
Token
Token ID26391
Feature activation+1.04e-02

Pos logit contributions
uck+0.006
bye+0.006
aph+0.006
enge+0.006
apped+0.005
Neg logit contributions
ge-0.008
 stret-0.008
 swollen-0.008
 sy-0.007
 tug-0.006
Token the
Token ID262
Feature activation+1.65e-03

Pos logit contributions
bye+0.004
uck+0.004
enge+0.003
aph+0.003
arn+0.003
Neg logit contributions
 stret-0.004
ge-0.004
 swollen-0.004
 sy-0.004
 stuffed-0.003
Token screw
Token ID9580
Feature activation-2.34e-03

Pos logit contributions
bye+0.003
uck+0.003
iddle+0.003
 noon+0.003
enge+0.003
Neg logit contributions
ge-0.003
 stret-0.003
 swollen-0.002
 widened-0.002
 sy-0.002
Token wouldn
Token ID3636
Feature activation+6.64e-04

Pos logit contributions
ge+0.004
 stret+0.004
 swollen+0.004
 sy+0.004
 stuffed+0.003
Neg logit contributions
bye-0.004
uck-0.004
enge-0.004
arn-0.004
aph-0.004
Token't
Token ID470
Feature activation-2.22e-03

Pos logit contributions
uck+0.001
bye+0.001
aph+0.001
um+0.001
arn+0.001
Neg logit contributions
 stret-0.002
 swollen-0.002
ge-0.002
 sy-0.001
 stuffed-0.001
Token bud
Token ID3600
Feature activation-3.54e-03

Pos logit contributions
 stret+0.005
 swollen+0.005
 sy+0.004
ge+0.004
 scrap+0.004
Neg logit contributions
bye-0.003
uck-0.003
aph-0.003
iddle-0.003
anks-0.003
Tokenge
Token ID469
Feature activation-9.86e-02

Pos logit contributions
 stret+0.002
 swollen+0.002
 sy+0.001
 stuffed+0.001
 widened+0.001
Neg logit contributions
uck-0.010
bye-0.010
aph-0.009
anks-0.009
iddle-0.009
Token.
Token ID13
Feature activation+1.87e-03

Pos logit contributions
 stret+0.224
 swollen+0.223
ge+0.216
 sy+0.206
 widened+0.185
Neg logit contributions
bye-0.127
uck-0.115
enge-0.100
arn-0.099
aph-0.098
Token
Token ID198
Feature activation-4.00e-03

Pos logit contributions
bye+0.004
uck+0.003
enge+0.003
aph+0.003
arn+0.003
Neg logit contributions
ge-0.003
 stret-0.003
 swollen-0.003
 sy-0.003
 Kel-0.002
Token
Token ID198
Feature activation-4.07e-03

Pos logit contributions
ge+0.008
 swollen+0.008
 stret+0.008
 sy+0.007
 stuffed+0.007
Neg logit contributions
bye-0.006
uck-0.006
enge-0.005
aph-0.005
arn-0.005
TokenSuddenly
Token ID38582
Feature activation+1.55e-02

Pos logit contributions
 swollen+0.008
ge+0.007
 stret+0.007
 sy+0.007
 stuffed+0.006
Neg logit contributions
bye-0.007
uck-0.007
enge-0.006
um-0.006
arn-0.006
Token,
Token ID11
Feature activation+3.64e-03

Pos logit contributions
bye+0.019
uck+0.018
enge+0.015
aph+0.015
arn+0.014
Neg logit contributions
ge-0.037
 stret-0.036
 swollen-0.036
 sy-0.034
 stuffed-0.030
Token but
Token ID475
Feature activation+3.33e-03

Pos logit contributions
bye+0.003
uck+0.003
aph+0.002
 Sam+0.002
arn+0.002
Neg logit contributions
ge-0.002
 stret-0.002
 swollen-0.002
 sy-0.002
 widened-0.002
Token it
Token ID340
Feature activation+8.30e-03

Pos logit contributions
bye+0.005
uck+0.004
enge+0.004
aph+0.004
arn+0.004
Neg logit contributions
 stret-0.007
ge-0.007
 swollen-0.007
 sy-0.007
 widened-0.006
Token wouldn
Token ID3636
Feature activation+1.66e-03

Pos logit contributions
bye+0.017
uck+0.016
enge+0.015
aph+0.015
arn+0.015
Neg logit contributions
ge-0.013
 swollen-0.013
 stret-0.012
 sy-0.011
 stuffed-0.010
Token't
Token ID470
Feature activation-2.07e-03

Pos logit contributions
uck+0.002
bye+0.002
aph+0.002
arn+0.002
um+0.002
Neg logit contributions
ge-0.004
 stret-0.004
 swollen-0.004
 sy-0.003
 stuffed-0.003
Token bud
Token ID3600
Feature activation-2.49e-03

Pos logit contributions
 stret+0.004
 swollen+0.004
ge+0.004
 sy+0.004
 scrap+0.003
Neg logit contributions
bye-0.003
uck-0.003
aph-0.003
arn-0.002
iddle-0.002
Tokenge
Token ID469
Feature activation-9.85e-02

Pos logit contributions
 stret+0.001
 swollen+0.001
 sy+0.001
 widened+0.001
 scrap+0.001
Neg logit contributions
uck-0.007
bye-0.007
aph-0.007
anks-0.006
ow-0.006
Token.
Token ID13
Feature activation+4.68e-03

Pos logit contributions
 stret+0.225
 swollen+0.223
ge+0.219
 sy+0.207
 widened+0.186
Neg logit contributions
bye-0.124
uck-0.114
arn-0.098
enge-0.096
aph-0.094
Token Tim
Token ID5045
Feature activation-1.10e-02

Pos logit contributions
bye+0.007
uck+0.006
aph+0.006
enge+0.006
arn+0.005
Neg logit contributions
 swollen-0.010
ge-0.010
 stret-0.010
 sy-0.009
 Kel-0.008
Tokenmy
Token ID1820
Feature activation+1.30e-02

Pos logit contributions
ge+0.018
 swollen+0.017
 stret+0.016
 sy+0.015
 stuffed+0.013
Neg logit contributions
bye-0.022
uck-0.020
enge-0.020
aph-0.020
arn-0.019
Token said
Token ID531
Feature activation+3.09e-03

Pos logit contributions
bye+0.026
uck+0.025
enge+0.023
aph+0.023
arn+0.023
Neg logit contributions
ge-0.020
 swollen-0.020
 stret-0.019
 sy-0.018
 stuffed-0.015
Token,
Token ID11
Feature activation+1.76e-03

Pos logit contributions
bye+0.003
uck+0.003
enge+0.002
apped+0.002
um+0.002
Neg logit contributions
ge-0.008
 stret-0.008
 swollen-0.008
 sy-0.007
 widened-0.007
Tokenâ
Token ID22940
Feature activation+2.71e-03

Pos logit contributions
uck+0.006
bye+0.006
aph+0.005
um+0.005
arn+0.005
Neg logit contributions
ge-0.013
 swollen-0.013
 stret-0.013
 sy-0.011
 stuffed-0.011
Token
Token ID26391
Feature activation+1.01e-02

Pos logit contributions
 advisor+0.004
aph+0.004
enge+0.004
bye+0.004
arn+0.004
Neg logit contributions
 stret-0.005
ge-0.005
 swollen-0.004
 sy-0.004
 Kel-0.004
Token
Token ID8151
Feature activation+2.77e-02

Pos logit contributions
bye+0.017
uck+0.016
enge+0.015
aph+0.015
arn+0.015
Neg logit contributions
ge-0.019
 stret-0.019
 swollen-0.018
 sy-0.017
 stuffed-0.015
Tokent
Token ID83
Feature activation+1.37e-02

Pos logit contributions
bye+0.046
uck+0.043
enge+0.039
aph+0.038
arn+0.038
Neg logit contributions
ge-0.053
 swollen-0.053
 stret-0.052
 sy-0.048
 stuffed-0.044
Token bud
Token ID3600
Feature activation-8.37e-04

Pos logit contributions
bye+0.020
uck+0.019
aph+0.017
iddle+0.017
arn+0.016
Neg logit contributions
 stret-0.028
 swollen-0.028
ge-0.026
 sy-0.025
 prope-0.023
Tokenge
Token ID469
Feature activation-9.84e-02

Pos logit contributions
 stret+0.001
 swollen+0.001
 sy+0.000
 scrap+0.000
 stuffed+0.000
Neg logit contributions
uck-0.002
bye-0.002
aph-0.002
ow-0.002
anks-0.002
Token.
Token ID13
Feature activation+2.41e-03

Pos logit contributions
 stret+0.217
 swollen+0.216
ge+0.215
 sy+0.202
 stuffed+0.179
Neg logit contributions
bye-0.132
uck-0.122
enge-0.106
arn-0.104
aph-0.103
Token She
Token ID1375
Feature activation-1.50e-03

Pos logit contributions
bye+0.004
uck+0.003
enge+0.003
aph+0.003
arn+0.003
Neg logit contributions
ge-0.005
 stret-0.005
 swollen-0.005
 sy-0.004
 stuffed-0.004
Token tried
Token ID3088
Feature activation-9.66e-03

Pos logit contributions
ge+0.003
 stret+0.003
 swollen+0.003
 sy+0.002
 flashing+0.002
Neg logit contributions
bye-0.003
uck-0.003
enge-0.002
arn-0.002
aph-0.002
Token to
Token ID284
Feature activation-4.30e-03

Pos logit contributions
 stret+0.024
ge+0.023
 swollen+0.023
 sy+0.022
 widened+0.021
Neg logit contributions
uck-0.010
bye-0.009
aph-0.008
arn-0.007
anks-0.007
Token unt
Token ID1418
Feature activation+1.08e-02

Pos logit contributions
ge+0.006
 swollen+0.006
 stret+0.006
 sy+0.006
 stuffed+0.005
Neg logit contributions
bye-0.009
uck-0.009
aph-0.008
enge-0.008
arn-0.008
Token cart
Token ID6383
Feature activation-3.95e-03

Pos logit contributions
bye+0.001
uck+0.001
um+0.001
aph+0.001
enge+0.001
Neg logit contributions
ge-0.001
 swollen-0.001
 stret-0.001
 sy-0.001
 widened-0.001
Token just
Token ID655
Feature activation+2.62e-05

Pos logit contributions
ge+0.007
 swollen+0.007
 stret+0.006
 sy+0.006
 scrap+0.005
Neg logit contributions
bye-0.007
uck-0.007
arn-0.007
enge-0.006
aph-0.006
Token wouldn
Token ID3636
Feature activation+2.51e-03

Pos logit contributions
bye+0.000
uck+0.000
enge+0.000
arn+0.000
aph+0.000
Neg logit contributions
ge-0.000
 swollen-0.000
 stret-0.000
 sy-0.000
 stuffed-0.000
Token't
Token ID470
Feature activation-3.66e-03

Pos logit contributions
uck+0.003
bye+0.003
aph+0.003
um+0.002
arn+0.002
Neg logit contributions
 stret-0.006
 swollen-0.006
ge-0.006
 sy-0.005
 stuffed-0.005
Token bud
Token ID3600
Feature activation-2.97e-03

Pos logit contributions
 stret+0.007
 swollen+0.007
 sy+0.007
ge+0.007
 widened+0.006
Neg logit contributions
bye-0.006
uck-0.005
aph-0.005
iddle-0.004
 Fred-0.004
Tokenge
Token ID469
Feature activation-9.84e-02

Pos logit contributions
 stret+0.002
 swollen+0.002
 sy+0.001
 widened+0.001
 stuffed+0.001
Neg logit contributions
uck-0.009
bye-0.008
aph-0.008
anks-0.008
iddle-0.008
Token.
Token ID13
Feature activation+3.73e-03

Pos logit contributions
ge+0.211
 swollen+0.209
 stret+0.209
 sy+0.194
 stuffed+0.173
Neg logit contributions
bye-0.140
uck-0.127
enge-0.114
aph-0.112
arn-0.112
Token He
Token ID679
Feature activation+1.75e-03

Pos logit contributions
bye+0.007
uck+0.006
enge+0.006
arn+0.006
aph+0.006
Neg logit contributions
ge-0.006
 stret-0.006
 swollen-0.006
 sy-0.005
 Kel-0.005
Token pushed
Token ID7121
Feature activation-2.36e-03

Pos logit contributions
bye+0.003
uck+0.003
enge+0.003
aph+0.003
arn+0.003
Neg logit contributions
ge-0.003
 swollen-0.003
 stret-0.003
 sy-0.003
 stuffed-0.002
Token and
Token ID290
Feature activation+4.71e-03

Pos logit contributions
ge+0.005
 swollen+0.005
 stret+0.004
 sy+0.004
 stuffed+0.004
Neg logit contributions
bye-0.004
uck-0.004
enge-0.003
arn-0.003
aph-0.003
Token he
Token ID339
Feature activation-5.20e-03

Pos logit contributions
bye+0.009
uck+0.008
aph+0.007
enge+0.007
arn+0.007
Neg logit contributions
ge-0.008
 stret-0.008
 swollen-0.008
 sy-0.007
 stuffed-0.006
Token,
Token ID11
Feature activation+2.67e-04

Pos logit contributions
bye+0.010
uck+0.010
enge+0.009
aph+0.008
arn+0.008
Neg logit contributions
ge-0.018
 swollen-0.018
 stret-0.017
 sy-0.016
 stuffed-0.015
Token they
Token ID484
Feature activation-2.00e-03

Pos logit contributions
bye+0.000
uck+0.000
aph+0.000
enge+0.000
 Fred+0.000
Neg logit contributions
ge-0.000
 stret-0.000
 swollen-0.000
 sy-0.000
 widened-0.000
Token wouldn
Token ID3636
Feature activation+7.89e-04

Pos logit contributions
ge+0.004
 stret+0.004
 swollen+0.004
 sy+0.004
 Kel+0.003
Neg logit contributions
bye-0.003
uck-0.003
enge-0.003
aph-0.003
apped-0.003
Token't
Token ID470
Feature activation-2.27e-03

Pos logit contributions
uck+0.001
bye+0.001
aph+0.001
arn+0.001
anks+0.001
Neg logit contributions
 swollen-0.002
 stret-0.002
ge-0.002
 sy-0.002
 stuffed-0.002
Token bud
Token ID3600
Feature activation-2.03e-03

Pos logit contributions
 swollen+0.005
 stret+0.005
 sy+0.004
ge+0.004
 flashing+0.004
Neg logit contributions
bye-0.003
uck-0.003
aph-0.003
anks-0.003
iddle-0.003
Tokenge
Token ID469
Feature activation-9.83e-02

Pos logit contributions
 stret+0.001
 swollen+0.001
 sy+0.001
 drift+0.000
 stuffed+0.000
Neg logit contributions
uck-0.006
bye-0.005
aph-0.005
anks-0.005
ow-0.005
Token!
Token ID0
Feature activation+6.04e-03

Pos logit contributions
 swollen+0.215
 stret+0.214
ge+0.213
 sy+0.200
 stuffed+0.178
Neg logit contributions
bye-0.134
uck-0.122
enge-0.108
aph-0.106
arn-0.104
Token Tim
Token ID5045
Feature activation-1.04e-02

Pos logit contributions
bye+0.011
uck+0.009
enge+0.009
arn+0.009
aph+0.009
Neg logit contributions
ge-0.011
 stret-0.011
 swollen-0.011
 sy-0.010
 Kel-0.009
Tokenmy
Token ID1820
Feature activation+1.35e-02

Pos logit contributions
ge+0.017
 swollen+0.017
 stret+0.016
 sy+0.015
 stuffed+0.013
Neg logit contributions
bye-0.021
uck-0.019
enge-0.018
aph-0.018
arn-0.018
Token realized
Token ID6939
Feature activation+5.23e-03

Pos logit contributions
bye+0.023
uck+0.022
enge+0.020
arn+0.019
apped+0.019
Neg logit contributions
ge-0.026
 stret-0.025
 swollen-0.024
 sy-0.022
 flashing-0.019
Token he
Token ID339
Feature activation-6.03e-03

Pos logit contributions
bye+0.006
uck+0.005
enge+0.005
aph+0.005
arn+0.005
Neg logit contributions
ge-0.013
 swollen-0.012
 stret-0.012
 sy-0.012
 stuffed-0.011
Token but
Token ID475
Feature activation+9.79e-04

Pos logit contributions
bye+0.001
 Sam+0.001
uck+0.001
 example+0.001
 Fred+0.001
Neg logit contributions
ge-0.001
 swollen-0.001
 stret-0.001
 sy-0.001
lest-0.001
Token it
Token ID340
Feature activation+9.81e-03

Pos logit contributions
bye+0.002
uck+0.001
enge+0.001
aph+0.001
arn+0.001
Neg logit contributions
 stret-0.002
ge-0.002
 swollen-0.002
 sy-0.002
 stuffed-0.001
Token wouldn
Token ID3636
Feature activation+1.86e-03

Pos logit contributions
bye+0.019
uck+0.018
enge+0.017
aph+0.017
arn+0.017
Neg logit contributions
ge-0.016
 swollen-0.016
 stret-0.015
 sy-0.014
 stuffed-0.012
Token't
Token ID470
Feature activation-4.20e-04

Pos logit contributions
uck+0.002
bye+0.002
aph+0.002
um+0.002
arn+0.002
Neg logit contributions
 stret-0.004
 swollen-0.004
ge-0.004
 sy-0.004
 stuffed-0.004
Token bud
Token ID3600
Feature activation-3.83e-03

Pos logit contributions
 stret+0.001
 swollen+0.001
ge+0.001
 sy+0.001
 scrap+0.001
Neg logit contributions
bye-0.001
uck-0.001
aph-0.001
iddle-0.000
arn-0.000
Tokenge
Token ID469
Feature activation-9.82e-02

Pos logit contributions
 stret+0.003
 swollen+0.003
 sy+0.002
 stuffed+0.001
 scrap+0.001
Neg logit contributions
uck-0.010
bye-0.010
aph-0.010
ow-0.010
anks-0.010
Token.
Token ID13
Feature activation+3.95e-03

Pos logit contributions
 stret+0.227
 swollen+0.227
ge+0.217
 sy+0.211
 widened+0.188
Neg logit contributions
bye-0.123
uck-0.109
enge-0.094
arn-0.093
aph-0.092
Token His
Token ID2399
Feature activation+1.08e-02

Pos logit contributions
bye+0.006
uck+0.006
enge+0.005
arn+0.005
aph+0.005
Neg logit contributions
ge-0.008
 swollen-0.008
 stret-0.008
 sy-0.007
 stuffed-0.007
Token mom
Token ID1995
Feature activation+1.44e-03

Pos logit contributions
bye+0.017
uck+0.015
enge+0.014
arn+0.014
aph+0.014
Neg logit contributions
ge-0.023
 swollen-0.022
 stret-0.022
 sy-0.020
 stuffed-0.019
Token saw
Token ID2497
Feature activation+1.80e-02

Pos logit contributions
uck+0.002
bye+0.002
enge+0.002
arn+0.002
um+0.002
Neg logit contributions
ge-0.003
 stret-0.003
 swollen-0.003
 sy-0.003
 prope-0.002
Token him
Token ID683
Feature activation-6.01e-03

Pos logit contributions
uck+0.020
bye+0.018
arn+0.015
um+0.015
enge+0.015
Neg logit contributions
 swollen-0.043
ge-0.043
 stret-0.041
 sy-0.039
 prope-0.036
Token but
Token ID475
Feature activation+2.60e-03

Pos logit contributions
 Sam+0.003
uck+0.003
bye+0.003
 Polly+0.003
aph+0.003
Neg logit contributions
ge-0.005
 stret-0.004
 swollen-0.004
 sy-0.004
lest-0.004
Token it
Token ID340
Feature activation+1.04e-02

Pos logit contributions
bye+0.004
uck+0.004
enge+0.003
aph+0.003
arn+0.003
Neg logit contributions
 stret-0.005
ge-0.005
 swollen-0.005
 sy-0.005
 widened-0.004
Token wouldn
Token ID3636
Feature activation+1.47e-03

Pos logit contributions
bye+0.022
uck+0.020
aph+0.019
enge+0.019
arn+0.019
Neg logit contributions
ge-0.016
 swollen-0.016
 stret-0.014
 sy-0.013
 stuffed-0.012
Token't
Token ID470
Feature activation-5.02e-04

Pos logit contributions
uck+0.002
bye+0.002
aph+0.001
um+0.001
arn+0.001
Neg logit contributions
 stret-0.003
ge-0.003
 swollen-0.003
 sy-0.003
 stuffed-0.003
Token bud
Token ID3600
Feature activation-2.35e-03

Pos logit contributions
 stret+0.001
 swollen+0.001
ge+0.001
 sy+0.001
 scrap+0.001
Neg logit contributions
bye-0.001
uck-0.001
aph-0.001
arn-0.001
enge-0.001
Tokenge
Token ID469
Feature activation-9.82e-02

Pos logit contributions
 stret+0.001
 swollen+0.001
 sy+0.001
 widened+0.000
 stuffed+0.000
Neg logit contributions
bye-0.007
uck-0.007
aph-0.006
anks-0.006
ow-0.006
Token.
Token ID13
Feature activation+6.22e-03

Pos logit contributions
 stret+0.227
 swollen+0.223
ge+0.219
 sy+0.209
 widened+0.187
Neg logit contributions
bye-0.122
uck-0.111
arn-0.096
enge-0.095
aph-0.093
Token
Token ID220
Feature activation-1.92e-02

Pos logit contributions
bye+0.014
iddle+0.012
enge+0.012
aph+0.012
 advisor+0.012
Neg logit contributions
 swollen-0.008
 stret-0.007
ge-0.007
 sy-0.007
Are-0.006
Token
Token ID198
Feature activation-2.21e-03

Pos logit contributions
ge+0.038
 swollen+0.037
 stret+0.037
 sy+0.034
 stuffed+0.030
Neg logit contributions
bye-0.031
uck-0.028
enge-0.026
aph-0.025
arn-0.025
Token
Token ID198
Feature activation-2.35e-03

Pos logit contributions
ge+0.005
 swollen+0.004
 stret+0.004
 sy+0.004
 stuffed+0.004
Neg logit contributions
bye-0.003
uck-0.003
enge-0.003
aph-0.003
arn-0.003
TokenSuddenly
Token ID38582
Feature activation+1.60e-02

Pos logit contributions
ge+0.004
 swollen+0.004
 stret+0.004
 sy+0.004
 stuffed+0.003
Neg logit contributions
bye-0.004
uck-0.004
arn-0.004
enge-0.004
aph-0.004
Token it
Token ID340
Feature activation+9.92e-03

Pos logit contributions
bye+0.004
uck+0.003
enge+0.003
arn+0.003
aph+0.003
Neg logit contributions
ge-0.005
 stret-0.005
 swollen-0.004
 sy-0.004
 widened-0.004
Token still
Token ID991
Feature activation-2.05e-03

Pos logit contributions
bye+0.019
uck+0.018
enge+0.017
aph+0.017
arn+0.017
Neg logit contributions
ge-0.016
 swollen-0.016
 stret-0.016
 sy-0.014
 stuffed-0.013
Token wouldn
Token ID3636
Feature activation+6.74e-04

Pos logit contributions
ge+0.003
 stret+0.003
 swollen+0.003
 sy+0.003
 stuffed+0.003
Neg logit contributions
bye-0.004
uck-0.004
enge-0.003
arn-0.003
aph-0.003
Token't
Token ID470
Feature activation-2.61e-03

Pos logit contributions
uck+0.001
bye+0.001
aph+0.001
arn+0.001
um+0.001
Neg logit contributions
 stret-0.002
ge-0.002
 swollen-0.002
 sy-0.001
 stuffed-0.001
Token bud
Token ID3600
Feature activation-2.33e-03

Pos logit contributions
 stret+0.006
 swollen+0.005
ge+0.005
 sy+0.005
 prope+0.005
Neg logit contributions
bye-0.004
uck-0.003
 noon-0.003
arn-0.003
aph-0.003
Tokenge
Token ID469
Feature activation-9.82e-02

Pos logit contributions
 stret+0.002
 swollen+0.001
 sy+0.001
 widened+0.001
 prope+0.001
Neg logit contributions
uck-0.006
bye-0.006
ow-0.006
anks-0.006
aph-0.006
Token.
Token ID13
Feature activation+3.97e-03

Pos logit contributions
 stret+0.222
 swollen+0.219
ge+0.217
 sy+0.204
 widened+0.184
Neg logit contributions
bye-0.128
uck-0.115
arn-0.100
enge-0.099
aph-0.097
Token
Token ID220
Feature activation-2.04e-02

Pos logit contributions
bye+0.007
uck+0.007
enge+0.006
aph+0.006
iddle+0.006
Neg logit contributions
ge-0.007
 swollen-0.007
 stret-0.007
 sy-0.006
 stuffed-0.006
Token
Token ID198
Feature activation-3.47e-03

Pos logit contributions
ge+0.039
 stret+0.037
 swollen+0.037
 sy+0.034
 Kel+0.030
Neg logit contributions
bye-0.035
uck-0.031
enge-0.030
aph-0.029
arn-0.028
Token
Token ID198
Feature activation-3.54e-03

Pos logit contributions
ge+0.007
 swollen+0.007
 stret+0.007
 sy+0.006
 stuffed+0.006
Neg logit contributions
bye-0.005
uck-0.005
enge-0.005
aph-0.005
arn-0.004
TokenAfter
Token ID3260
Feature activation+1.74e-02

Pos logit contributions
ge+0.006
 swollen+0.006
 stret+0.006
 sy+0.006
 stuffed+0.005
Neg logit contributions
bye-0.007
uck-0.006
arn-0.006
enge-0.006
aph-0.005
Token and
Token ID290
Feature activation+1.05e-03

Pos logit contributions
 swollen+0.029
 stret+0.028
ge+0.028
 sy+0.026
 scrap+0.024
Neg logit contributions
bye-0.014
uck-0.013
arn-0.011
anks-0.010
aph-0.010
Token he
Token ID339
Feature activation-5.64e-03

Pos logit contributions
bye+0.002
uck+0.002
enge+0.002
aph+0.002
arn+0.002
Neg logit contributions
ge-0.002
 stret-0.002
 swollen-0.002
 sy-0.002
 stuffed-0.001
Token couldn
Token ID3521
Feature activation-1.05e-02

Pos logit contributions
ge+0.011
 stret+0.010
 swollen+0.010
 sy+0.009
 stuffed+0.008
Neg logit contributions
bye-0.010
uck-0.009
enge-0.008
aph-0.008
arn-0.008
Token't
Token ID470
Feature activation-3.54e-03

Pos logit contributions
ge+0.022
 stret+0.021
 swollen+0.021
 sy+0.019
 stuffed+0.018
Neg logit contributions
bye-0.016
uck-0.015
aph-0.013
arn-0.013
enge-0.013
Token bud
Token ID3600
Feature activation-3.56e-03

Pos logit contributions
 swollen+0.007
ge+0.006
 stret+0.006
 sy+0.006
 stuffed+0.005
Neg logit contributions
bye-0.006
uck-0.006
aph-0.005
arn-0.005
iddle-0.005
Tokenge
Token ID469
Feature activation-9.80e-02

Pos logit contributions
 stret+0.003
 swollen+0.003
 sy+0.002
 stuffed+0.002
 widened+0.002
Neg logit contributions
uck-0.009
ow-0.009
aph-0.009
bye-0.009
 home-0.008
Token it
Token ID340
Feature activation+7.94e-03

Pos logit contributions
 stret+0.229
 swollen+0.228
ge+0.220
 sy+0.214
 widened+0.192
Neg logit contributions
bye-0.119
uck-0.106
arn-0.089
enge-0.089
aph-0.089
Token.
Token ID13
Feature activation+4.52e-03

Pos logit contributions
bye+0.012
uck+0.011
enge+0.010
aph+0.010
arn+0.010
Neg logit contributions
ge-0.016
 swollen-0.016
 stret-0.016
 sy-0.015
 stuffed-0.013
Token He
Token ID679
Feature activation+3.33e-03

Pos logit contributions
bye+0.009
uck+0.008
enge+0.007
aph+0.007
iddle+0.007
Neg logit contributions
 swollen-0.008
ge-0.007
 stret-0.007
 sy-0.007
 scrap-0.006
Token was
Token ID373
Feature activation-3.95e-03

Pos logit contributions
bye+0.006
uck+0.006
enge+0.005
aph+0.005
arn+0.005
Neg logit contributions
ge-0.006
 stret-0.006
 swollen-0.006
 sy-0.005
 stuffed-0.005
Token so
Token ID523
Feature activation-1.18e-02

Pos logit contributions
ge+0.009
 stret+0.008
 swollen+0.008
 sy+0.008
 stuffed+0.007
Neg logit contributions
uck-0.005
bye-0.005
iddle-0.004
ow-0.004
arn-0.004
Token but
Token ID475
Feature activation+2.83e-03

Pos logit contributions
ge+0.001
 swollen+0.001
 stret+0.001
 sy+0.001
lest+0.001
Neg logit contributions
bye-0.001
 Sam-0.001
uck-0.001
 noon-0.001
 Jack-0.001
Token it
Token ID340
Feature activation+8.24e-03

Pos logit contributions
bye+0.005
uck+0.005
enge+0.005
aph+0.004
arn+0.004
Neg logit contributions
ge-0.005
 swollen-0.005
 stret-0.005
 sy-0.004
 stuffed-0.004
Token wouldn
Token ID3636
Feature activation+6.85e-04

Pos logit contributions
bye+0.017
uck+0.016
aph+0.015
enge+0.015
arn+0.015
Neg logit contributions
ge-0.012
 swollen-0.012
 stret-0.012
 sy-0.011
 stuffed-0.010
Token't
Token ID470
Feature activation-2.82e-03

Pos logit contributions
uck+0.001
bye+0.001
aph+0.001
arn+0.001
um+0.001
Neg logit contributions
ge-0.002
 stret-0.002
 swollen-0.002
 sy-0.001
 stuffed-0.001
Token bud
Token ID3600
Feature activation-3.08e-03

Pos logit contributions
 stret+0.006
 swollen+0.006
ge+0.006
 sy+0.005
 scrap+0.005
Neg logit contributions
bye-0.004
uck-0.003
aph-0.003
arn-0.003
 noon-0.003
Tokenge
Token ID469
Feature activation-9.77e-02

Pos logit contributions
 stret+0.002
 swollen+0.002
 sy+0.001
 scrap+0.001
 widened+0.001
Neg logit contributions
bye-0.009
uck-0.009
aph-0.008
ow-0.008
anks-0.008
Token.
Token ID13
Feature activation+3.20e-03

Pos logit contributions
 swollen+0.214
 stret+0.214
ge+0.212
 sy+0.199
 stuffed+0.176
Neg logit contributions
bye-0.134
uck-0.120
enge-0.107
arn-0.105
aph-0.104
Token Suddenly
Token ID24975
Feature activation+8.63e-04

Pos logit contributions
bye+0.005
uck+0.005
enge+0.005
aph+0.005
arn+0.004
Neg logit contributions
ge-0.006
 swollen-0.006
 stret-0.006
 sy-0.005
 Kel-0.005
Token,
Token ID11
Feature activation+2.89e-03

Pos logit contributions
bye+0.001
uck+0.001
aph+0.001
enge+0.001
arn+0.001
Neg logit contributions
ge-0.002
 stret-0.002
 swollen-0.002
 sy-0.002
 stuffed-0.002
Token he
Token ID339
Feature activation-4.05e-03

Pos logit contributions
bye+0.005
uck+0.005
arn+0.004
enge+0.004
aph+0.004
Neg logit contributions
ge-0.005
 swollen-0.005
 stret-0.005
 sy-0.005
 stuffed-0.004
Token remembered
Token ID12086
Feature activation+7.33e-03

Pos logit contributions
ge+0.007
 stret+0.006
 swollen+0.006
 sy+0.006
 stuffed+0.005
Neg logit contributions
bye-0.008
uck-0.008
enge-0.007
aph-0.007
arn-0.007
Token but
Token ID475
Feature activation+4.48e-04

Pos logit contributions
 Sam+0.001
 Fred+0.001
 Jack+0.001
 Polly+0.001
uck+0.001
Neg logit contributions
ge-0.002
 swollen-0.002
 stret-0.002
 sy-0.002
istered-0.002
Token it
Token ID340
Feature activation+9.41e-03

Pos logit contributions
bye+0.001
uck+0.001
 Sam+0.000
enge+0.000
 noon+0.000
Neg logit contributions
 stret-0.001
ge-0.001
 swollen-0.001
 sy-0.001
 widened-0.001
Token wouldn
Token ID3636
Feature activation+3.76e-04

Pos logit contributions
bye+0.019
uck+0.017
enge+0.016
arn+0.016
aph+0.016
Neg logit contributions
ge-0.015
 swollen-0.015
 stret-0.015
 sy-0.013
 stuffed-0.012
Token't
Token ID470
Feature activation-1.51e-03

Pos logit contributions
uck+0.000
bye+0.000
aph+0.000
um+0.000
arn+0.000
Neg logit contributions
 stret-0.001
ge-0.001
 swollen-0.001
 sy-0.001
 stuffed-0.001
Token bud
Token ID3600
Feature activation-3.50e-03

Pos logit contributions
 stret+0.003
 swollen+0.003
ge+0.003
 sy+0.003
 widened+0.002
Neg logit contributions
bye-0.002
uck-0.002
aph-0.002
arn-0.002
 noon-0.002
Tokenge
Token ID469
Feature activation-9.76e-02

Pos logit contributions
 stret+0.002
 swollen+0.002
 sy+0.002
 widened+0.001
 stuffed+0.001
Neg logit contributions
bye-0.010
uck-0.010
aph-0.009
anks-0.009
ow-0.009
Token.
Token ID13
Feature activation+4.56e-03

Pos logit contributions
 stret+0.223
 swollen+0.221
ge+0.215
 sy+0.206
 widened+0.184
Neg logit contributions
bye-0.126
uck-0.114
enge-0.097
arn-0.097
aph-0.096
Token
Token ID220
Feature activation-2.01e-02

Pos logit contributions
bye+0.008
uck+0.008
enge+0.007
aph+0.007
arn+0.007
Neg logit contributions
ge-0.008
 swollen-0.008
 stret-0.008
 sy-0.007
 Kel-0.006
Token
Token ID198
Feature activation-4.26e-03

Pos logit contributions
ge+0.044
 swollen+0.043
 stret+0.042
 sy+0.039
 stuffed+0.036
Neg logit contributions
bye-0.028
uck-0.027
enge-0.024
arn-0.023
aph-0.023
Token
Token ID198
Feature activation-4.28e-03

Pos logit contributions
ge+0.009
 swollen+0.009
 stret+0.009
 sy+0.009
 stuffed+0.008
Neg logit contributions
bye-0.006
uck-0.005
enge-0.005
arn-0.005
aph-0.005
TokenTim
Token ID14967
Feature activation-1.46e-02

Pos logit contributions
ge+0.008
 stret+0.008
 swollen+0.008
 sy+0.008
 stuffed+0.007
Neg logit contributions
bye-0.007
uck-0.007
um-0.006
arn-0.006
enge-0.006
Token but
Token ID475
Feature activation+8.27e-04

Pos logit contributions
ge+0.006
 stret+0.006
 swollen+0.006
 sy+0.006
istered+0.006
Neg logit contributions
 Sam-0.004
 Fred-0.004
 Jack-0.004
 Polly-0.003
uck-0.003
Token it
Token ID340
Feature activation+7.36e-03

Pos logit contributions
bye+0.001
uck+0.001
enge+0.001
aph+0.001
arn+0.001
Neg logit contributions
 stret-0.002
ge-0.002
 swollen-0.002
 sy-0.002
 widened-0.001
Token wouldn
Token ID3636
Feature activation+2.08e-03

Pos logit contributions
bye+0.014
uck+0.013
enge+0.012
aph+0.012
arn+0.012
Neg logit contributions
ge-0.013
 stret-0.012
 swollen-0.012
 sy-0.011
 stuffed-0.009
Token't
Token ID470
Feature activation-1.79e-03

Pos logit contributions
uck+0.002
bye+0.002
aph+0.002
um+0.002
arn+0.002
Neg logit contributions
 stret-0.005
ge-0.005
 swollen-0.005
 sy-0.005
 stuffed-0.004
Token bud
Token ID3600
Feature activation-4.60e-03

Pos logit contributions
 stret+0.004
 swollen+0.004
ge+0.003
 sy+0.003
 Kel+0.003
Neg logit contributions
bye-0.003
uck-0.002
aph-0.002
arn-0.002
iddle-0.002
Tokenge
Token ID469
Feature activation-9.75e-02

Pos logit contributions
 stret+0.004
 swollen+0.003
 sy+0.002
 widened+0.002
 stuffed+0.002
Neg logit contributions
uck-0.012
bye-0.012
aph-0.012
ow-0.012
anks-0.011
Token.
Token ID13
Feature activation+2.46e-03

Pos logit contributions
 stret+0.226
 swollen+0.223
ge+0.217
 sy+0.210
 widened+0.187
Neg logit contributions
bye-0.123
uck-0.110
enge-0.094
arn-0.093
aph-0.091
Token Tim
Token ID5045
Feature activation-1.27e-02

Pos logit contributions
bye+0.004
uck+0.004
enge+0.003
aph+0.003
arn+0.003
Neg logit contributions
ge-0.005
 swollen-0.005
 stret-0.005
 sy-0.004
 Kel-0.004
Tokenmy
Token ID1820
Feature activation+1.21e-02

Pos logit contributions
ge+0.021
 swollen+0.021
 stret+0.020
 sy+0.018
 stuffed+0.016
Neg logit contributions
bye-0.025
uck-0.023
enge-0.022
aph-0.021
arn-0.021
Token was
Token ID373
Feature activation-3.33e-03

Pos logit contributions
bye+0.022
uck+0.021
enge+0.019
arn+0.018
aph+0.018
Neg logit contributions
ge-0.022
 stret-0.021
 swollen-0.021
 sy-0.019
 flashing-0.016
Token sad
Token ID6507
Feature activation-2.57e-02

Pos logit contributions
ge+0.007
 stret+0.006
 swollen+0.006
 sy+0.006
 widened+0.005
Neg logit contributions
uck-0.005
bye-0.005
enge-0.004
arn-0.004
iddle-0.004
Token but
Token ID475
Feature activation+3.75e-03

Pos logit contributions
ge+0.058
 swollen+0.057
 stret+0.056
 sy+0.052
 stuffed+0.047
Neg logit contributions
bye-0.040
uck-0.038
enge-0.034
aph-0.034
arn-0.034
Token it
Token ID340
Feature activation+9.60e-03

Pos logit contributions
bye+0.005
uck+0.005
enge+0.004
aph+0.004
arn+0.004
Neg logit contributions
 stret-0.008
ge-0.008
 swollen-0.008
 sy-0.007
 widened-0.007
Token wouldn
Token ID3636
Feature activation+2.54e-03

Pos logit contributions
bye+0.019
uck+0.018
enge+0.017
arn+0.017
aph+0.016
Neg logit contributions
ge-0.016
 swollen-0.015
 stret-0.015
 sy-0.013
 stuffed-0.012
Token't
Token ID470
Feature activation-6.84e-04

Pos logit contributions
uck+0.003
bye+0.003
aph+0.002
arn+0.002
um+0.002
Neg logit contributions
 stret-0.006
ge-0.006
 swollen-0.006
 sy-0.005
 stuffed-0.005
Token bud
Token ID3600
Feature activation-1.50e-03

Pos logit contributions
 stret+0.001
 swollen+0.001
ge+0.001
 sy+0.001
 widened+0.001
Neg logit contributions
bye-0.001
uck-0.001
aph-0.001
 noon-0.001
arn-0.001
Tokenge
Token ID469
Feature activation-9.74e-02

Pos logit contributions
 stret+0.001
 swollen+0.001
 sy+0.001
 widened+0.000
 stuffed+0.000
Neg logit contributions
bye-0.004
uck-0.004
aph-0.004
anks-0.004
ow-0.004
Token.
Token ID13
Feature activation+5.16e-03

Pos logit contributions
 stret+0.221
 swollen+0.219
ge+0.215
 sy+0.204
 widened+0.183
Neg logit contributions
bye-0.125
uck-0.114
arn-0.098
enge-0.098
aph-0.096
Token
Token ID220
Feature activation-2.04e-02

Pos logit contributions
bye+0.011
enge+0.010
iddle+0.010
aph+0.010
 advisor+0.009
Neg logit contributions
 swollen-0.007
ge-0.007
 stret-0.007
 sy-0.006
 Kel-0.006
Token
Token ID198
Feature activation-3.77e-03

Pos logit contributions
ge+0.042
 swollen+0.041
 stret+0.040
 sy+0.037
 stuffed+0.033
Neg logit contributions
bye-0.032
uck-0.029
enge-0.027
aph-0.026
arn-0.026
Token
Token ID198
Feature activation-3.90e-03

Pos logit contributions
ge+0.008
 swollen+0.008
 stret+0.008
 sy+0.007
 stuffed+0.006
Neg logit contributions
bye-0.006
uck-0.005
enge-0.005
aph-0.005
arn-0.005
TokenL
Token ID43
Feature activation+2.21e-02

Pos logit contributions
ge+0.007
 swollen+0.007
 stret+0.007
 sy+0.006
 stuffed+0.006
Neg logit contributions
bye-0.007
uck-0.007
arn-0.006
enge-0.006
aph-0.006
Token
Token ID198
Feature activation-5.97e-03

Pos logit contributions
 swollen+0.013
ge+0.013
 stret+0.013
 sy+0.013
 stuffed+0.011
Neg logit contributions
bye-0.009
uck-0.009
arn-0.008
enge-0.008
um-0.007
TokenJohn
Token ID7554
Feature activation-6.33e-03

Pos logit contributions
ge+0.014
 swollen+0.014
 stret+0.014
 sy+0.013
 prope+0.012
Neg logit contributions
uck-0.008
bye-0.007
um-0.006
 Now-0.006
arn-0.006
Token,
Token ID11
Feature activation+2.68e-03

Pos logit contributions
ge+0.012
 swollen+0.012
 stret+0.012
 sy+0.011
 flashing+0.009
Neg logit contributions
bye-0.010
uck-0.010
arn-0.009
enge-0.009
aph-0.009
Token a
Token ID257
Feature activation+1.57e-03

Pos logit contributions
bye+0.004
uck+0.004
enge+0.003
aph+0.003
arn+0.003
Neg logit contributions
ge-0.006
 swollen-0.005
 stret-0.005
 sy-0.005
 stuffed-0.005
Token 3
Token ID513
Feature activation-9.33e-04

Pos logit contributions
bye+0.002
uck+0.002
enge+0.002
aph+0.002
arn+0.002
Neg logit contributions
ge-0.003
 swollen-0.003
 stret-0.003
 sy-0.003
 widened-0.002
Token-
Token ID12
Feature activation+7.88e-02

Pos logit contributions
ge+0.002
 stret+0.002
 swollen+0.002
 sy+0.002
 stuffed+0.002
Neg logit contributions
bye-0.001
uck-0.001
enge-0.001
aph-0.001
arn-0.001
Tokenyear
Token ID1941
Feature activation+4.84e-02

Pos logit contributions
bye+0.104
uck+0.094
enge+0.086
aph+0.085
arn+0.085
Neg logit contributions
ge-0.178
 swollen-0.172
 stret-0.172
 sy-0.160
 stuffed-0.145
Token-
Token ID12
Feature activation+7.86e-02

Pos logit contributions
bye+0.073
uck+0.071
enge+0.070
aph+0.067
arn+0.065
Neg logit contributions
ge-0.092
 stret-0.090
 swollen-0.089
 sy-0.083
 stuffed-0.076
Tokenold
Token ID727
Feature activation-9.15e-03

Pos logit contributions
bye+0.130
uck+0.122
aph+0.115
enge+0.114
arn+0.113
Neg logit contributions
ge-0.149
 stret-0.140
 swollen-0.139
 sy-0.128
 stuffed-0.112
Token boy
Token ID2933
Feature activation+1.61e-03

Pos logit contributions
ge+0.017
 stret+0.016
 swollen+0.016
 sy+0.015
 stuffed+0.014
Neg logit contributions
bye-0.016
uck-0.015
arn-0.014
aph-0.014
enge-0.014
Token,
Token ID11
Feature activation+1.20e-03

Pos logit contributions
bye+0.002
uck+0.002
enge+0.002
arn+0.002
aph+0.002
Neg logit contributions
ge-0.004
 swollen-0.004
 stret-0.004
 sy-0.003
 stuffed-0.003
Token said
Token ID531
Feature activation+2.58e-03

Pos logit contributions
bye+0.011
uck+0.010
enge+0.009
arn+0.009
aph+0.009
Neg logit contributions
ge-0.014
 stret-0.013
 swollen-0.013
 sy-0.012
 flashing-0.010
Token he
Token ID339
Feature activation-4.93e-03

Pos logit contributions
bye+0.005
uck+0.005
aph+0.004
enge+0.004
arn+0.004
Neg logit contributions
ge-0.005
 stret-0.004
 swollen-0.004
 sy-0.004
 stuffed-0.003
Token wanted
Token ID2227
Feature activation-3.01e-03

Pos logit contributions
ge+0.009
 stret+0.009
 swollen+0.009
 sy+0.008
 drift+0.007
Neg logit contributions
bye-0.009
uck-0.008
enge-0.008
arn-0.007
aph-0.007
Token a
Token ID257
Feature activation-4.46e-04

Pos logit contributions
 swollen+0.006
ge+0.005
 stret+0.005
 sy+0.005
 stuffed+0.005
Neg logit contributions
bye-0.005
uck-0.005
enge-0.005
aph-0.005
apped-0.004
Token band
Token ID4097
Feature activation+1.59e-02

Pos logit contributions
ge+0.001
 swollen+0.001
 stret+0.001
 sy+0.001
 stuffed+0.001
Neg logit contributions
bye-0.001
uck-0.001
enge-0.001
aph-0.001
arn-0.001
Token-
Token ID12
Feature activation+7.85e-02

Pos logit contributions
bye+0.025
uck+0.024
arn+0.023
aph+0.023
enge+0.023
Neg logit contributions
ge-0.031
 stret-0.030
 swollen-0.029
 sy-0.028
 stuffed-0.024
Tokenaid
Token ID1698
Feature activation+2.47e-02

Pos logit contributions
bye+0.146
uck+0.139
arn+0.134
aph+0.134
enge+0.133
Neg logit contributions
ge-0.127
 swollen-0.123
 stret-0.119
 sy-0.110
 scrap-0.093
Token.
Token ID13
Feature activation+4.45e-03

Pos logit contributions
bye+0.042
uck+0.042
enge+0.041
arn+0.040
aph+0.039
Neg logit contributions
ge-0.042
 stret-0.041
 swollen-0.039
 sy-0.039
 stuffed-0.033
Token
Token ID198
Feature activation+2.75e-03

Pos logit contributions
bye+0.009
uck+0.008
aph+0.008
enge+0.008
arn+0.007
Neg logit contributions
 swollen-0.007
 stret-0.007
ge-0.007
 sy-0.007
 scrap-0.006
Token
Token ID198
Feature activation+2.21e-03

Pos logit contributions
bye+0.004
uck+0.004
enge+0.004
aph+0.004
arn+0.004
Neg logit contributions
ge-0.006
 swollen-0.005
 stret-0.005
 sy-0.005
 stuffed-0.004
TokenHis
Token ID6653
Feature activation+2.03e-02

Pos logit contributions
bye+0.004
uck+0.004
enge+0.004
arn+0.004
um+0.004
Neg logit contributions
ge-0.004
 stret-0.003
 swollen-0.003
 sy-0.003
 stuffed-0.003
Token
Token ID198
Feature activation-3.82e-03

Pos logit contributions
ge+0.009
 swollen+0.009
 stret+0.008
 sy+0.008
 stuffed+0.007
Neg logit contributions
bye-0.005
uck-0.005
arn-0.004
enge-0.004
aph-0.004
TokenThe
Token ID464
Feature activation+3.79e-03

Pos logit contributions
ge+0.007
 swollen+0.007
 stret+0.007
 sy+0.007
 stuffed+0.006
Neg logit contributions
bye-0.007
uck-0.006
um-0.005
enge-0.005
aph-0.005
Token moral
Token ID6573
Feature activation+2.17e-02

Pos logit contributions
bye+0.006
uck+0.006
aph+0.005
enge+0.005
arn+0.005
Neg logit contributions
ge-0.008
 stret-0.007
 swollen-0.007
 sy-0.007
 stuffed-0.006
Token of
Token ID286
Feature activation+1.80e-03

Pos logit contributions
bye+0.039
uck+0.037
aph+0.037
enge+0.036
anks+0.036
Neg logit contributions
ge-0.034
 swollen-0.033
 stret-0.031
 sy-0.030
 flashing-0.026
Token this
Token ID428
Feature activation-6.09e-03

Pos logit contributions
bye+0.003
uck+0.003
enge+0.003
aph+0.003
arn+0.003
Neg logit contributions
ge-0.003
 swollen-0.003
 stret-0.003
 sy-0.003
 stuffed-0.003
Token story
Token ID1621
Feature activation+5.09e-02

Pos logit contributions
 swollen+0.011
ge+0.011
 stret+0.010
 sy+0.010
 drift+0.009
Neg logit contributions
bye-0.010
enge-0.009
uck-0.009
aph-0.009
anks-0.008
Token is
Token ID318
Feature activation+4.22e-03

Pos logit contributions
bye+0.104
uck+0.096
aph+0.095
enge+0.094
arn+0.091
Neg logit contributions
ge-0.070
 swollen-0.066
 stret-0.058
 sy-0.058
 flashing-0.048
Token that
Token ID326
Feature activation+6.64e-03

Pos logit contributions
bye+0.006
enge+0.006
oline+0.005
uck+0.005
aph+0.005
Neg logit contributions
ge-0.008
 swollen-0.008
 sy-0.007
 stret-0.007
 pol-0.007
Token we
Token ID356
Feature activation-4.51e-03

Pos logit contributions
bye+0.010
uck+0.009
aph+0.008
enge+0.008
arn+0.008
Neg logit contributions
 stret-0.014
ge-0.014
 swollen-0.013
 sy-0.013
 widened-0.011
Token should
Token ID815
Feature activation+1.26e-02

Pos logit contributions
ge+0.009
 stret+0.009
 swollen+0.009
 sy+0.008
 stuffed+0.007
Neg logit contributions
bye-0.007
uck-0.007
enge-0.006
aph-0.006
arn-0.006
Token always
Token ID1464
Feature activation+5.01e-03

Pos logit contributions
bye+0.019
uck+0.017
enge+0.016
aph+0.015
arn+0.015
Neg logit contributions
ge-0.027
 swollen-0.026
 stret-0.026
 sy-0.024
 stuffed-0.022
Token
Token ID198
Feature activation-6.00e-03

Pos logit contributions
ge+0.016
 swollen+0.016
 stret+0.015
 sy+0.015
 stuffed+0.013
Neg logit contributions
bye-0.008
uck-0.008
arn-0.007
aph-0.007
enge-0.007
TokenThe
Token ID464
Feature activation+1.37e-03

Pos logit contributions
ge+0.012
 swollen+0.012
 stret+0.011
 sy+0.011
 stuffed+0.010
Neg logit contributions
uck-0.009
bye-0.009
aph-0.008
arn-0.008
enge-0.008
Token moral
Token ID6573
Feature activation+2.01e-02

Pos logit contributions
bye+0.002
uck+0.002
enge+0.002
arn+0.002
aph+0.002
Neg logit contributions
 swollen-0.003
 stret-0.003
ge-0.002
 sy-0.002
 stuffed-0.002
Token of
Token ID286
Feature activation+1.07e-03

Pos logit contributions
bye+0.036
uck+0.034
aph+0.034
enge+0.033
anks+0.033
Neg logit contributions
ge-0.031
 swollen-0.030
 stret-0.029
 sy-0.028
 flashing-0.025
Token the
Token ID262
Feature activation+5.76e-04

Pos logit contributions
bye+0.002
uck+0.001
enge+0.001
aph+0.001
arn+0.001
Neg logit contributions
ge-0.002
 swollen-0.002
 stret-0.002
 sy-0.002
 stuffed-0.002
Token story
Token ID1621
Feature activation+5.10e-02

Pos logit contributions
bye+0.001
uck+0.001
enge+0.001
apped+0.001
arn+0.001
Neg logit contributions
 stret-0.001
 swollen-0.001
ge-0.001
 sy-0.001
 stuffed-0.001
Token is
Token ID318
Feature activation+3.89e-03

Pos logit contributions
bye+0.103
uck+0.096
aph+0.094
enge+0.092
arn+0.090
Neg logit contributions
ge-0.072
 swollen-0.068
 stret-0.064
 sy-0.061
 Kel-0.053
Token that
Token ID326
Feature activation+4.84e-03

Pos logit contributions
bye+0.006
enge+0.005
aph+0.005
uck+0.005
umm+0.005
Neg logit contributions
ge-0.008
 swollen-0.007
 stret-0.007
 sy-0.007
 Kel-0.006
Token it
Token ID340
Feature activation+8.68e-03

Pos logit contributions
bye+0.008
uck+0.007
enge+0.006
aph+0.006
arn+0.006
Neg logit contributions
ge-0.010
 stret-0.010
 swollen-0.010
 sy-0.009
 widened-0.008
Token's
Token ID338
Feature activation+7.86e-03

Pos logit contributions
bye+0.017
uck+0.016
enge+0.015
arn+0.014
aph+0.014
Neg logit contributions
ge-0.015
 stret-0.014
 swollen-0.014
 sy-0.013
 stuffed-0.011
Token important
Token ID1593
Feature activation+3.36e-02

Pos logit contributions
bye+0.014
uck+0.013
enge+0.012
aph+0.012
arn+0.012
Neg logit contributions
ge-0.015
 swollen-0.015
 stret-0.014
 sy-0.013
 stuffed-0.011
Token she
Token ID673
Feature activation-1.09e-02

Pos logit contributions
ge+0.026
 stret+0.025
 swollen+0.025
 sy+0.023
 stuffed+0.020
Neg logit contributions
bye-0.026
uck-0.024
enge-0.022
arn-0.022
aph-0.022
Token said
Token ID531
Feature activation+2.62e-04

Pos logit contributions
ge+0.017
 stret+0.017
 swollen+0.017
 sy+0.015
 stuffed+0.015
Neg logit contributions
bye-0.022
uck-0.020
aph-0.019
arn-0.019
enge-0.019
Token,
Token ID11
Feature activation-1.09e-03

Pos logit contributions
bye+0.000
uck+0.000
enge+0.000
arn+0.000
aph+0.000
Neg logit contributions
ge-0.001
 swollen-0.001
 stret-0.001
 sy-0.001
 scrap-0.000
Token "
Token ID366
Feature activation-1.42e-02

Pos logit contributions
ge+0.002
 swollen+0.002
 stret+0.002
 sy+0.002
 stuffed+0.002
Neg logit contributions
bye-0.002
uck-0.002
enge-0.002
aph-0.001
arn-0.001
TokenY
Token ID56
Feature activation+1.53e-03

Pos logit contributions
ge+0.028
 swollen+0.027
 stret+0.027
 sy+0.024
 stuffed+0.022
Neg logit contributions
bye-0.023
uck-0.022
enge-0.020
aph-0.020
arn-0.020
Tokenay
Token ID323
Feature activation+5.16e-02

Pos logit contributions
bye+0.001
aph+0.001
enge+0.001
 advisor+0.001
iddle+0.001
Neg logit contributions
ge-0.004
 stret-0.004
 swollen-0.004
 sy-0.004
 drift-0.003
Token,
Token ID11
Feature activation-2.78e-03

Pos logit contributions
bye+0.084
uck+0.077
enge+0.072
aph+0.071
arn+0.070
Neg logit contributions
ge-0.102
 swollen-0.099
 stret-0.098
 sy-0.090
 stuffed-0.081
Token pancakes
Token ID47008
Feature activation-8.28e-03

Pos logit contributions
ge+0.006
 swollen+0.006
 stret+0.006
 sy+0.005
 widened+0.005
Neg logit contributions
bye-0.004
uck-0.003
enge-0.003
arn-0.003
aph-0.003
Token!"
Token ID2474
Feature activation+1.18e-02

Pos logit contributions
ge+0.016
 swollen+0.016
 stret+0.016
 sy+0.014
 stuffed+0.013
Neg logit contributions
bye-0.014
uck-0.013
enge-0.012
aph-0.012
arn-0.011
Token They
Token ID1119
Feature activation+6.19e-03

Pos logit contributions
bye+0.021
enge+0.020
uck+0.019
aph+0.019
iddle+0.019
Neg logit contributions
 stret-0.021
ge-0.019
 sy-0.018
 swollen-0.018
 stuffed-0.017
Token went
Token ID1816
Feature activation+2.04e-02

Pos logit contributions
bye+0.009
uck+0.008
enge+0.007
apped+0.007
aph+0.007
Neg logit contributions
ge-0.013
 swollen-0.013
 stret-0.013
 sy-0.012
 Kel-0.011
Token.
Token ID13
Feature activation+3.02e-04

Pos logit contributions
ge+0.038
 stret+0.037
 swollen+0.037
 sy+0.034
 stuffed+0.030
Neg logit contributions
bye-0.025
uck-0.023
enge-0.020
aph-0.020
arn-0.020
Token It
Token ID632
Feature activation+1.33e-03

Pos logit contributions
uck+0.001
bye+0.001
enge+0.001
aph+0.000
anks+0.000
Neg logit contributions
ge-0.000
 stret-0.000
 swollen-0.000
 sy-0.000
 stuffed-0.000
Token was
Token ID373
Feature activation-2.50e-03

Pos logit contributions
bye+0.002
uck+0.002
enge+0.002
aph+0.002
arn+0.002
Neg logit contributions
ge-0.003
 stret-0.003
 swollen-0.002
 sy-0.002
 Kel-0.002
Token a
Token ID257
Feature activation-9.13e-04

Pos logit contributions
 swollen+0.005
ge+0.004
 stret+0.004
 sy+0.004
 stuffed+0.004
Neg logit contributions
bye-0.004
uck-0.004
enge-0.004
aph-0.004
arn-0.004
Token sm
Token ID895
Feature activation+1.80e-02

Pos logit contributions
 swollen+0.001
ge+0.001
 stret+0.001
 sy+0.001
 stuffed+0.001
Neg logit contributions
bye-0.002
uck-0.002
arn-0.002
enge-0.002
anks-0.002
Tokenelly
Token ID6148
Feature activation-1.37e-03

Pos logit contributions
anks+0.027
bye+0.027
uck+0.026
 Day+0.026
 Fran+0.025
Neg logit contributions
ge-0.027
 swollen-0.025
 sy-0.024
 numb-0.023
 stret-0.022
Token fish
Token ID5916
Feature activation-9.83e-03

Pos logit contributions
 swollen+0.002
ge+0.002
 stret+0.002
 sy+0.002
 stuffed+0.002
Neg logit contributions
bye-0.002
uck-0.002
enge-0.002
arn-0.002
aph-0.002
Token,
Token ID11
Feature activation+1.63e-04

Pos logit contributions
ge+0.018
 swollen+0.017
 stret+0.016
 sy+0.016
 stuffed+0.014
Neg logit contributions
bye-0.017
uck-0.016
enge-0.015
aph-0.015
arn-0.015
Token but
Token ID475
Feature activation-4.50e-04

Pos logit contributions
bye+0.000
uck+0.000
enge+0.000
aph+0.000
 Fred+0.000
Neg logit contributions
ge-0.000
 stret-0.000
 swollen-0.000
 sy-0.000
 widened-0.000
Token he
Token ID339
Feature activation-3.39e-03

Pos logit contributions
ge+0.001
 stret+0.001
 swollen+0.001
 sy+0.001
 stuffed+0.001
Neg logit contributions
bye-0.001
uck-0.001
enge-0.001
aph-0.001
arn-0.001
Token liked
Token ID8288
Feature activation+7.48e-03

Pos logit contributions
ge+0.006
 swollen+0.006
 stret+0.006
 sy+0.005
 stuffed+0.004
Neg logit contributions
bye-0.006
uck-0.006
enge-0.006
aph-0.005
arn-0.005
Token you
Token ID345
Feature activation+1.29e-03

Pos logit contributions
ge+0.034
 swollen+0.033
 stret+0.033
 sy+0.030
 stuffed+0.027
Neg logit contributions
bye-0.021
uck-0.020
enge-0.018
aph-0.017
arn-0.017
Token can
Token ID460
Feature activation+2.28e-03

Pos logit contributions
bye+0.002
uck+0.002
enge+0.002
arn+0.002
aph+0.002
Neg logit contributions
ge-0.002
 swollen-0.002
 stret-0.002
 sy-0.002
 scrap-0.002
Token do
Token ID466
Feature activation-9.41e-03

Pos logit contributions
bye+0.004
uck+0.004
arn+0.003
enge+0.003
aph+0.003
Neg logit contributions
ge-0.004
 swollen-0.004
 stret-0.004
 sy-0.004
 stuffed-0.003
Token it
Token ID340
Feature activation+6.86e-03

Pos logit contributions
ge+0.021
 swollen+0.021
 stret+0.020
 sy+0.019
 widened+0.017
Neg logit contributions
bye-0.012
uck-0.011
arn-0.010
aph-0.010
anks-0.009
Token!
Token ID0
Feature activation-5.37e-04

Pos logit contributions
bye+0.012
uck+0.011
enge+0.011
aph+0.010
arn+0.010
Neg logit contributions
ge-0.013
 stret-0.012
 swollen-0.011
 sy-0.011
 stuffed-0.010
Token Now
Token ID2735
Feature activation+5.64e-03

Pos logit contributions
ge+0.001
 swollen+0.001
 sy+0.001
 stret+0.001
 Kel+0.001
Neg logit contributions
bye-0.001
uck-0.001
aph-0.001
enge-0.001
arn-0.001
Token run
Token ID1057
Feature activation-1.00e-02

Pos logit contributions
bye+0.009
uck+0.008
enge+0.007
aph+0.007
arn+0.007
Neg logit contributions
ge-0.012
 swollen-0.011
 stret-0.011
 sy-0.010
 stuffed-0.009
Token along
Token ID1863
Feature activation-2.64e-02

Pos logit contributions
ge+0.019
 stret+0.019
 swollen+0.017
 sy+0.016
 stuffed+0.015
Neg logit contributions
bye-0.017
uck-0.016
enge-0.015
arn-0.015
aph-0.015
Token and
Token ID290
Feature activation-3.03e-03

Pos logit contributions
ge+0.053
 stret+0.050
 swollen+0.050
 sy+0.046
 stuffed+0.043
Neg logit contributions
bye-0.043
uck-0.040
enge-0.037
arn-0.036
aph-0.036
Token have
Token ID423
Feature activation-8.83e-03

Pos logit contributions
ge+0.006
 swollen+0.005
 stret+0.005
 sy+0.005
 stuffed+0.004
Neg logit contributions
bye-0.005
uck-0.005
enge-0.005
aph-0.005
arn-0.005
Token fun
Token ID1257
Feature activation+2.64e-02

Pos logit contributions
ge+0.019
 swollen+0.018
 stret+0.018
 sy+0.017
 stuffed+0.015
Neg logit contributions
bye-0.013
uck-0.012
enge-0.011
aph-0.011
arn-0.011
Tokenily
Token ID813
Feature activation+6.88e-03

Pos logit contributions
bye+0.024
uck+0.019
enge+0.019
aph+0.017
arn+0.017
Neg logit contributions
ge-0.054
 stret-0.051
 swollen-0.050
 sy-0.047
 stuffed-0.044
Token and
Token ID290
Feature activation-1.94e-03

Pos logit contributions
bye+0.012
uck+0.012
enge+0.011
aph+0.011
arn+0.011
Neg logit contributions
ge-0.012
 swollen-0.012
 stret-0.012
 sy-0.011
 stuffed-0.010
Token Ben
Token ID3932
Feature activation+1.02e-02

Pos logit contributions
ge+0.004
 stret+0.004
 swollen+0.004
 sy+0.003
 stuffed+0.003
Neg logit contributions
bye-0.003
uck-0.003
arn-0.003
enge-0.003
aph-0.003
Token were
Token ID547
Feature activation-1.16e-02

Pos logit contributions
bye+0.016
uck+0.015
enge+0.013
apped+0.012
aph+0.012
Neg logit contributions
 swollen-0.022
ge-0.021
 stret-0.021
 sy-0.019
 flashing-0.017
Token going
Token ID1016
Feature activation-1.98e-03

Pos logit contributions
ge+0.019
 swollen+0.019
 stret+0.019
 sy+0.017
 stuffed+0.016
Neg logit contributions
bye-0.022
uck-0.019
enge-0.019
aph-0.019
arn-0.019
Token on
Token ID319
Feature activation+6.51e-04

Pos logit contributions
ge+0.004
 stret+0.003
 swollen+0.003
 sy+0.003
 stuffed+0.003
Neg logit contributions
bye-0.003
arn-0.003
uck-0.003
enge-0.003
aph-0.003
Token a
Token ID257
Feature activation+3.02e-04

Pos logit contributions
bye+0.001
uck+0.001
enge+0.001
aph+0.001
arn+0.001
Neg logit contributions
ge-0.001
 stret-0.001
 swollen-0.001
 sy-0.001
 stuffed-0.001
Token trip
Token ID5296
Feature activation+1.74e-02

Pos logit contributions
bye+0.000
uck+0.000
aph+0.000
arn+0.000
enge+0.000
Neg logit contributions
ge-0.001
 swollen-0.001
 stret-0.001
 sy-0.001
 widened-0.000
Token with
Token ID351
Feature activation-6.61e-03

Pos logit contributions
bye+0.029
uck+0.027
enge+0.026
aph+0.026
arn+0.026
Neg logit contributions
ge-0.033
 stret-0.030
 swollen-0.027
 sy-0.026
 flashing-0.024
Token Mom
Token ID11254
Feature activation-1.36e-02

Pos logit contributions
ge+0.012
 swollen+0.011
 stret+0.011
 sy+0.010
 stuffed+0.009
Neg logit contributions
bye-0.012
uck-0.011
enge-0.011
aph-0.010
arn-0.010
Token and
Token ID290
Feature activation-2.16e-03

Pos logit contributions
 swollen+0.031
ge+0.030
 stret+0.029
 sy+0.027
 prope+0.025
Neg logit contributions
bye-0.018
uck-0.017
arn-0.015
enge-0.014
aph-0.014
Token for
Token ID329
Feature activation+8.47e-03

Pos logit contributions
bye+0.036
enge+0.033
uck+0.031
arn+0.031
aph+0.031
Neg logit contributions
 stret-0.040
ge-0.040
 sy-0.038
 swollen-0.037
 stuffed-0.034
Token a
Token ID257
Feature activation-1.23e-03

Pos logit contributions
bye+0.016
uck+0.015
enge+0.015
iddle+0.014
aph+0.014
Neg logit contributions
 stret-0.015
 swollen-0.015
ge-0.014
 sy-0.013
 stuffed-0.013
Token walk
Token ID2513
Feature activation+1.46e-03

Pos logit contributions
ge+0.002
 stret+0.002
 swollen+0.002
 sy+0.001
 stuffed+0.001
Neg logit contributions
bye-0.003
uck-0.003
enge-0.003
arn-0.003
aph-0.003
Token.
Token ID13
Feature activation+2.64e-03

Pos logit contributions
enge+0.003
uck+0.003
aph+0.003
bye+0.003
arn+0.003
Neg logit contributions
ge-0.002
 stret-0.002
 swollen-0.002
 sy-0.002
 stuffed-0.002
Token The
Token ID383
Feature activation-1.49e-03

Pos logit contributions
bye+0.004
uck+0.004
enge+0.004
aph+0.004
iddle+0.004
Neg logit contributions
ge-0.005
 stret-0.005
 swollen-0.005
 sy-0.005
 stuffed-0.004
Token sky
Token ID6766
Feature activation+1.64e-02

Pos logit contributions
 swollen+0.003
 stret+0.003
 sy+0.003
 stuffed+0.003
 tissue+0.003
Neg logit contributions
aph-0.002
enge-0.002
uck-0.002
bye-0.002
 advisor-0.002
Token was
Token ID373
Feature activation-3.26e-03

Pos logit contributions
bye+0.033
uck+0.031
aph+0.029
enge+0.029
arn+0.029
Neg logit contributions
ge-0.025
 swollen-0.025
 stret-0.024
 sy-0.022
 stuffed-0.019
Token blue
Token ID4171
Feature activation+5.58e-04

Pos logit contributions
 swollen+0.005
ge+0.005
 stret+0.005
 sy+0.005
 stuffed+0.004
Neg logit contributions
bye-0.006
uck-0.006
enge-0.006
aph-0.006
arn-0.006
Token and
Token ID290
Feature activation+2.73e-03

Pos logit contributions
bye+0.001
enge+0.001
aph+0.001
 advisor+0.001
anks+0.001
Neg logit contributions
ge-0.001
 swollen-0.001
 stret-0.001
 sy-0.000
 flashing-0.000
Token the
Token ID262
Feature activation+2.53e-03

Pos logit contributions
bye+0.005
uck+0.005
enge+0.004
aph+0.004
arn+0.004
Neg logit contributions
ge-0.005
 stret-0.005
 swollen-0.005
 sy-0.004
 stuffed-0.004
Token sun
Token ID4252
Feature activation+1.69e-02

Pos logit contributions
uck+0.004
bye+0.003
enge+0.003
arn+0.003
anks+0.003
Neg logit contributions
 swollen-0.005
 stret-0.005
 sy-0.005
ge-0.005
 stuffed-0.005
Token so
Token ID523
Feature activation-1.17e-02

Pos logit contributions
ge+0.023
 stret+0.022
 swollen+0.022
 sy+0.020
 stuffed+0.017
Neg logit contributions
bye-0.025
uck-0.024
enge-0.022
aph-0.022
arn-0.022
Token sad
Token ID6507
Feature activation-2.56e-02

Pos logit contributions
ge+0.017
 swollen+0.017
 stret+0.016
 sy+0.014
 stuffed+0.012
Neg logit contributions
bye-0.025
uck-0.024
enge-0.023
arn-0.022
aph-0.022
Token and
Token ID290
Feature activation+1.59e-03

Pos logit contributions
ge+0.048
 swollen+0.045
 stret+0.044
 sy+0.042
 stuffed+0.037
Neg logit contributions
bye-0.046
uck-0.041
enge-0.041
aph-0.038
arn-0.038
Token she
Token ID673
Feature activation-5.85e-03

Pos logit contributions
bye+0.003
uck+0.003
arn+0.003
enge+0.003
aph+0.003
Neg logit contributions
 swollen-0.003
ge-0.002
 stret-0.002
 sy-0.002
 scrap-0.002
Token said
Token ID531
Feature activation+4.67e-03

Pos logit contributions
ge+0.010
 swollen+0.010
 stret+0.010
 sy+0.009
 stuffed+0.008
Neg logit contributions
bye-0.011
uck-0.010
enge-0.009
aph-0.009
arn-0.009
Token that
Token ID326
Feature activation+7.59e-03

Pos logit contributions
bye+0.007
uck+0.006
enge+0.006
arn+0.005
aph+0.005
Neg logit contributions
ge-0.010
 stret-0.010
 swollen-0.010
 sy-0.009
 stuffed-0.008
Token she
Token ID673
Feature activation-4.71e-03

Pos logit contributions
bye+0.013
uck+0.013
enge+0.012
aph+0.011
arn+0.011
Neg logit contributions
ge-0.014
 stret-0.013
 swollen-0.013
 sy-0.012
 stuffed-0.010
Token had
Token ID550
Feature activation-1.37e-02

Pos logit contributions
ge+0.008
 swollen+0.007
 stret+0.007
 sy+0.007
 stuffed+0.006
Neg logit contributions
bye-0.009
uck-0.009
enge-0.008
aph-0.008
arn-0.008
Token made
Token ID925
Feature activation+5.08e-03

Pos logit contributions
ge+0.028
 swollen+0.027
 stret+0.027
 sy+0.025
 stuffed+0.023
Neg logit contributions
bye-0.021
uck-0.020
enge-0.018
aph-0.018
arn-0.018
Token a
Token ID257
Feature activation-2.41e-03

Pos logit contributions
bye+0.010
enge+0.009
aph+0.008
uck+0.008
apped+0.008
Neg logit contributions
 swollen-0.008
ge-0.008
 stret-0.008
 sy-0.007
 stuffed-0.007
Token promise
Token ID6991
Feature activation-1.62e-02

Pos logit contributions
ge+0.004
 swollen+0.004
 stret+0.004
 sy+0.003
 stuffed+0.003
Neg logit contributions
bye-0.005
uck-0.004
enge-0.004
aph-0.004
arn-0.004
Token loved
Token ID6151
Feature activation+9.52e-03

Pos logit contributions
bye+0.008
uck+0.008
aph+0.007
enge+0.007
arn+0.007
Neg logit contributions
ge-0.007
 swollen-0.007
 stret-0.007
 sy-0.007
 stuffed-0.006
Token to
Token ID284
Feature activation-4.74e-03

Pos logit contributions
bye+0.013
uck+0.012
enge+0.011
arn+0.011
aph+0.011
Neg logit contributions
ge-0.021
 swollen-0.020
 stret-0.020
 sy-0.019
 stuffed-0.017
Token play
Token ID711
Feature activation-1.19e-02

Pos logit contributions
ge+0.009
 swollen+0.009
 stret+0.009
 sy+0.008
 stuffed+0.007
Neg logit contributions
bye-0.008
uck-0.007
aph-0.007
enge-0.007
arn-0.006
Token with
Token ID351
Feature activation-6.54e-03

Pos logit contributions
ge+0.025
 stret+0.025
 swollen+0.024
 sy+0.023
 stuffed+0.021
Neg logit contributions
bye-0.018
uck-0.016
enge-0.015
arn-0.015
aph-0.015
Token his
Token ID465
Feature activation-1.15e-03

Pos logit contributions
ge+0.011
 swollen+0.011
 stret+0.011
 sy+0.010
 stuffed+0.008
Neg logit contributions
bye-0.013
uck-0.012
enge-0.011
arn-0.011
aph-0.011
Token toy
Token ID13373
Feature activation-3.38e-03

Pos logit contributions
ge+0.002
 stret+0.002
 swollen+0.002
 sy+0.002
 widened+0.002
Neg logit contributions
bye-0.002
uck-0.002
enge-0.002
aph-0.002
arn-0.002
Token helicopter
Token ID15392
Feature activation-5.07e-04

Pos logit contributions
ge+0.006
 swollen+0.005
 stret+0.005
 sy+0.005
 stuffed+0.004
Neg logit contributions
bye-0.007
uck-0.006
enge-0.006
arn-0.006
aph-0.006
Token.
Token ID13
Feature activation-2.26e-03

Pos logit contributions
ge+0.001
 stret+0.001
 swollen+0.001
 sy+0.001
 stuffed+0.001
Neg logit contributions
bye-0.001
uck-0.001
enge-0.001
arn-0.001
aph-0.001
Token He
Token ID679
Feature activation-9.01e-05

Pos logit contributions
ge+0.004
 swollen+0.004
 stret+0.004
 sy+0.003
 stuffed+0.003
Neg logit contributions
bye-0.004
uck-0.004
enge-0.003
aph-0.003
arn-0.003
Token liked
Token ID8288
Feature activation+7.92e-03

Pos logit contributions
ge+0.000
 stret+0.000
 swollen+0.000
 sy+0.000
 Kel+0.000
Neg logit contributions
bye-0.000
enge-0.000
uck-0.000
apped-0.000
aph-0.000
Token to
Token ID284
Feature activation-4.95e-03

Pos logit contributions
bye+0.007
uck+0.006
enge+0.005
aph+0.004
arn+0.004
Neg logit contributions
ge-0.022
 swollen-0.021
 stret-0.021
 sy-0.019
 widened-0.018
Token do
Token ID466
Feature activation-4.99e-03

Pos logit contributions
ge+0.004
 swollen+0.004
 stret+0.004
 sy+0.003
 stuffed+0.003
Neg logit contributions
bye-0.005
uck-0.004
enge-0.004
arn-0.004
aph-0.004
Token it
Token ID340
Feature activation+7.98e-03

Pos logit contributions
ge+0.010
 stret+0.010
 swollen+0.010
 sy+0.009
 stuffed+0.008
Neg logit contributions
bye-0.008
uck-0.007
aph-0.007
enge-0.007
arn-0.006
Token.
Token ID13
Feature activation+5.04e-04

Pos logit contributions
bye+0.015
enge+0.013
uck+0.013
arn+0.012
aph+0.012
Neg logit contributions
ge-0.015
 swollen-0.013
 stret-0.013
 sy-0.012
 stuffed-0.011
Token But
Token ID887
Feature activation+1.17e-03

Pos logit contributions
bye+0.001
enge+0.001
uck+0.001
iddle+0.001
aph+0.001
Neg logit contributions
 swollen-0.001
 stret-0.001
ge-0.001
 sy-0.001
 scrap-0.001
Token then
Token ID788
Feature activation-4.25e-03

Pos logit contributions
bye+0.002
uck+0.002
enge+0.002
arn+0.002
aph+0.002
Neg logit contributions
ge-0.002
 swollen-0.002
 stret-0.002
 sy-0.002
 stuffed-0.002
Token she
Token ID673
Feature activation-8.99e-03

Pos logit contributions
ge+0.008
 swollen+0.008
 stret+0.007
 sy+0.007
 stuffed+0.006
Neg logit contributions
bye-0.008
uck-0.007
enge-0.007
arn-0.007
aph-0.007
Token realized
Token ID6939
Feature activation+5.57e-03

Pos logit contributions
ge+0.015
 swollen+0.014
 stret+0.014
 sy+0.013
 stuffed+0.011
Neg logit contributions
bye-0.018
uck-0.017
enge-0.016
aph-0.015
arn-0.015
Token that
Token ID326
Feature activation+6.96e-03

Pos logit contributions
bye+0.007
uck+0.006
enge+0.005
aph+0.005
arn+0.005
Neg logit contributions
ge-0.013
 swollen-0.013
 stret-0.013
 sy-0.012
 stuffed-0.011
Token it
Token ID340
Feature activation+9.21e-03

Pos logit contributions
bye+0.010
uck+0.010
aph+0.008
enge+0.008
arn+0.008
Neg logit contributions
 stret-0.015
ge-0.015
 swollen-0.013
 sy-0.013
 widened-0.012
Token was
Token ID373
Feature activation-2.51e-03

Pos logit contributions
bye+0.019
uck+0.017
enge+0.016
aph+0.016
arn+0.016
Neg logit contributions
ge-0.015
 swollen-0.014
 stret-0.014
 sy-0.013
 stuffed-0.011
Token okay
Token ID8788
Feature activation+8.36e-03

Pos logit contributions
ge+0.005
 swollen+0.005
 stret+0.005
 sy+0.005
 stuffed+0.004
Neg logit contributions
bye-0.004
uck-0.004
enge-0.003
aph-0.003
arn-0.003
Token it
Token ID340
Feature activation+9.97e-03

Pos logit contributions
ge+0.008
 swollen+0.008
 stret+0.008
 sy+0.007
 stuffed+0.007
Neg logit contributions
bye-0.006
uck-0.006
enge-0.005
aph-0.005
arn-0.005
Token worked
Token ID3111
Feature activation-1.25e-02

Pos logit contributions
bye+0.021
uck+0.019
arn+0.018
enge+0.018
aph+0.017
Neg logit contributions
ge-0.016
 swollen-0.015
 stret-0.014
 sy-0.014
 stuffed-0.013
Token.
Token ID13
Feature activation+4.37e-03

Pos logit contributions
ge+0.024
 swollen+0.022
 stret+0.022
 sy+0.020
 stuffed+0.018
Neg logit contributions
bye-0.022
uck-0.020
enge-0.019
arn-0.019
aph-0.018
Token Every
Token ID3887
Feature activation-8.90e-03

Pos logit contributions
bye+0.008
enge+0.007
uck+0.007
aph+0.007
arn+0.006
Neg logit contributions
ge-0.006
 swollen-0.006
 stret-0.006
 sy-0.006
 Kel-0.006
Token time
Token ID640
Feature activation+1.53e-02

Pos logit contributions
ge+0.023
 swollen+0.022
 stret+0.022
 sy+0.021
 stuffed+0.019
Neg logit contributions
bye-0.009
uck-0.008
enge-0.007
aph-0.007
arn-0.007
Token she
Token ID673
Feature activation-7.29e-03

Pos logit contributions
bye+0.028
uck+0.026
enge+0.024
aph+0.024
arn+0.024
Neg logit contributions
ge-0.027
 swollen-0.026
 stret-0.026
 sy-0.024
 stuffed-0.021
Token put
Token ID1234
Feature activation+7.29e-03

Pos logit contributions
ge+0.012
 swollen+0.011
 stret+0.011
 sy+0.010
 stuffed+0.010
Neg logit contributions
bye-0.015
uck-0.013
arn-0.013
enge-0.013
aph-0.012
Token a
Token ID257
Feature activation-9.65e-04

Pos logit contributions
bye+0.014
uck+0.012
aph+0.011
arn+0.011
enge+0.011
Neg logit contributions
 swollen-0.013
ge-0.013
 stret-0.013
 sy-0.012
 stuffed-0.011
Token small
Token ID1402
Feature activation-1.22e-02

Pos logit contributions
ge+0.002
 swollen+0.002
 stret+0.002
 Kel+0.002
 sy+0.002
Neg logit contributions
bye-0.001
uck-0.001
enge-0.001
um-0.001
aph-0.001
Token stone
Token ID7815
Feature activation-4.91e-03

Pos logit contributions
ge+0.023
 stret+0.021
 swollen+0.021
 Kel+0.018
 sy+0.018
Neg logit contributions
uck-0.021
bye-0.020
enge-0.018
um-0.017
 example-0.017
Token inside
Token ID2641
Feature activation-1.52e-02

Pos logit contributions
ge+0.011
 stret+0.011
 swollen+0.011
 sy+0.010
 widened+0.009
Neg logit contributions
bye-0.006
uck-0.006
enge-0.005
aph-0.005
arn-0.005
Token said
Token ID531
Feature activation-4.29e-05

Pos logit contributions
 swollen+0.004
ge+0.004
 stret+0.004
 sy+0.003
 stuffed+0.003
Neg logit contributions
bye-0.005
uck-0.005
arn-0.004
enge-0.004
aph-0.004
Token,
Token ID11
Feature activation-2.75e-03

Pos logit contributions
ge+0.000
 swollen+0.000
 stret+0.000
 sy+0.000
 stuffed+0.000
Neg logit contributions
bye-0.000
uck-0.000
enge-0.000
arn-0.000
aph-0.000
Token "
Token ID366
Feature activation-1.68e-02

Pos logit contributions
ge+0.006
 swollen+0.005
 stret+0.005
 sy+0.005
 stuffed+0.004
Neg logit contributions
bye-0.004
uck-0.004
enge-0.004
aph-0.004
arn-0.004
TokenYes
Token ID5297
Feature activation-1.52e-02

Pos logit contributions
ge+0.035
 swollen+0.033
 stret+0.033
 sy+0.031
 stuffed+0.028
Neg logit contributions
bye-0.026
uck-0.024
enge-0.022
aph-0.022
arn-0.022
Token,
Token ID11
Feature activation-3.70e-03

Pos logit contributions
ge+0.032
 swollen+0.031
 stret+0.031
 sy+0.029
 stuffed+0.026
Neg logit contributions
bye-0.022
uck-0.021
enge-0.019
aph-0.019
arn-0.018
Token let
Token ID1309
Feature activation-2.22e-02

Pos logit contributions
ge+0.008
 swollen+0.008
 stret+0.008
 sy+0.007
 stuffed+0.006
Neg logit contributions
bye-0.005
uck-0.005
enge-0.004
aph-0.004
arn-0.004
Token's
Token ID338
Feature activation+7.02e-03

Pos logit contributions
 stret+0.026
ge+0.026
 swollen+0.025
 sy+0.022
 scrap+0.018
Neg logit contributions
bye-0.053
enge-0.048
aph-0.048
uck-0.047
iddle-0.047
Token do
Token ID466
Feature activation-6.97e-03

Pos logit contributions
bye+0.014
uck+0.013
enge+0.012
arn+0.012
aph+0.012
Neg logit contributions
ge-0.011
 stret-0.011
 swollen-0.011
 sy-0.010
 stuffed-0.009
Token that
Token ID326
Feature activation+2.17e-03

Pos logit contributions
ge+0.015
 swollen+0.015
 stret+0.014
 sy+0.014
 stuffed+0.012
Neg logit contributions
bye-0.010
uck-0.009
enge-0.008
arn-0.008
aph-0.008
Token!"
Token ID2474
Feature activation+1.04e-02

Pos logit contributions
bye+0.003
uck+0.003
aph+0.002
enge+0.002
arn+0.002
Neg logit contributions
ge-0.005
 swollen-0.005
 stret-0.005
 sy-0.004
 widened-0.004
Token
Token ID220
Feature activation-2.36e-02

Pos logit contributions
bye+0.019
uck+0.017
enge+0.017
arn+0.016
aph+0.016
Neg logit contributions
 stret-0.018
ge-0.018
 swollen-0.017
 sy-0.016
 widened-0.014
Token her
Token ID607
Feature activation-7.63e-04

Pos logit contributions
bye+0.011
uck+0.010
enge+0.009
aph+0.009
arn+0.009
Neg logit contributions
ge-0.015
 swollen-0.014
 stret-0.014
 sy-0.013
 stuffed-0.012
Token room
Token ID2119
Feature activation+1.62e-02

Pos logit contributions
ge+0.002
 swollen+0.002
 stret+0.001
 sy+0.001
Are+0.001
Neg logit contributions
bye-0.001
uck-0.001
aph-0.001
enge-0.001
arn-0.001
Token.
Token ID13
Feature activation+4.63e-03

Pos logit contributions
bye+0.023
uck+0.021
enge+0.020
aph+0.019
arn+0.019
Neg logit contributions
ge-0.035
 swollen-0.034
 stret-0.034
 sy-0.031
 stuffed-0.028
Token She
Token ID1375
Feature activation-6.09e-04

Pos logit contributions
bye+0.008
uck+0.007
enge+0.007
 advisor+0.007
arn+0.007
Neg logit contributions
ge-0.007
 swollen-0.007
 stret-0.007
 sy-0.007
 Kel-0.006
Token was
Token ID373
Feature activation-1.51e-03

Pos logit contributions
ge+0.001
 stret+0.001
 swollen+0.001
 sy+0.001
 stuffed+0.001
Neg logit contributions
bye-0.001
uck-0.001
enge-0.001
aph-0.001
arn-0.001
Token having
Token ID1719
Feature activation-1.44e-02

Pos logit contributions
ge+0.003
 stret+0.003
 swollen+0.003
 sy+0.003
 stuffed+0.002
Neg logit contributions
bye-0.002
uck-0.002
enge-0.002
aph-0.002
arn-0.002
Token so
Token ID523
Feature activation-8.29e-03

Pos logit contributions
ge+0.033
 swollen+0.032
 stret+0.032
 sy+0.029
 stuffed+0.027
Neg logit contributions
bye-0.019
uck-0.018
enge-0.016
aph-0.015
arn-0.015
Token much
Token ID881
Feature activation+8.98e-03

Pos logit contributions
ge+0.013
 swollen+0.013
 stret+0.012
 sy+0.011
 widened+0.010
Neg logit contributions
bye-0.016
uck-0.015
enge-0.015
arn-0.014
aph-0.014
Token fun
Token ID1257
Feature activation+3.11e-02

Pos logit contributions
bye+0.015
uck+0.014
enge+0.013
arn+0.012
aph+0.012
Neg logit contributions
ge-0.018
 swollen-0.017
 stret-0.017
 sy-0.016
 stuffed-0.014
Token!
Token ID0
Feature activation+6.28e-03

Pos logit contributions
bye+0.040
uck+0.037
aph+0.032
enge+0.032
arn+0.031
Neg logit contributions
 swollen-0.071
ge-0.071
 stret-0.070
 sy-0.065
 scrap-0.058
Token She
Token ID1375
Feature activation-1.18e-03

Pos logit contributions
bye+0.011
enge+0.010
uck+0.010
arn+0.009
 advisor+0.009
Neg logit contributions
ge-0.010
 swollen-0.010
 stret-0.010
 sy-0.009
 Kel-0.009
Token in
Token ID287
Feature activation+4.96e-03

Pos logit contributions
ge+0.004
 swollen+0.004
 stret+0.004
 sy+0.004
 scrap+0.003
Neg logit contributions
bye-0.003
uck-0.002
enge-0.002
arn-0.002
aph-0.002
Token the
Token ID262
Feature activation+5.85e-04

Pos logit contributions
bye+0.006
uck+0.005
 noon+0.005
arn+0.004
enge+0.004
Neg logit contributions
ge-0.012
 stret-0.012
 swollen-0.012
 sy-0.011
lest-0.010
Token warm
Token ID5814
Feature activation-4.08e-03

Pos logit contributions
bye+0.001
uck+0.001
 noon+0.001
um+0.001
night+0.001
Neg logit contributions
ge-0.001
 swollen-0.001
 stret-0.001
 sy-0.001
 Kel-0.001
Token,
Token ID11
Feature activation-9.92e-04

Pos logit contributions
ge+0.009
 swollen+0.008
 stret+0.008
 sy+0.008
 stuffed+0.007
Neg logit contributions
bye-0.006
uck-0.006
enge-0.005
arn-0.005
aph-0.005
Token so
Token ID523
Feature activation-1.41e-02

Pos logit contributions
ge+0.002
 stret+0.002
 swollen+0.002
 sy+0.002
rum+0.002
Neg logit contributions
 noon-0.001
bye-0.001
uck-0.001
 sun-0.001
 Dad-0.001
Tokenapy
Token ID12826
Feature activation-6.16e-02

Pos logit contributions
 swollen+0.022
ge+0.021
 stret+0.019
 scrap+0.017
 sy+0.017
Neg logit contributions
bye-0.026
enge-0.026
iddle-0.025
arn-0.025
anks-0.025
Token water
Token ID1660
Feature activation-5.38e-03

Pos logit contributions
ge+0.151
 stret+0.142
 swollen+0.141
 sy+0.130
 Kel+0.118
Neg logit contributions
bye-0.077
uck-0.070
enge-0.060
arn-0.057
aph-0.056
Token.
Token ID13
Feature activation-1.39e-03

Pos logit contributions
ge+0.012
 swollen+0.011
 stret+0.011
 sy+0.010
 stuffed+0.009
Neg logit contributions
bye-0.008
uck-0.007
enge-0.006
arn-0.006
aph-0.006
Token She
Token ID1375
Feature activation-4.22e-03

Pos logit contributions
 swollen+0.002
ge+0.002
 stret+0.002
Are+0.002
 Kel+0.002
Neg logit contributions
bye-0.003
enge-0.002
uck-0.002
iddle-0.002
arn-0.002
Token spl
Token ID4328
Feature activation-1.54e-03

Pos logit contributions
ge+0.009
 stret+0.008
 swollen+0.008
 sy+0.008
 flashing+0.007
Neg logit contributions
bye-0.006
uck-0.006
enge-0.005
apped-0.005
night-0.005
Tokenashed
Token ID5263
Feature activation+1.67e-02

Pos logit contributions
ge+0.004
 stret+0.003
 swollen+0.003
 sy+0.003
 stuffed+0.003
Neg logit contributions
bye-0.002
uck-0.002
aph-0.002
enge-0.002
arn-0.002
Token steer
Token ID27401
Feature activation-2.03e-02

Pos logit contributions
ge+0.010
 swollen+0.010
 stret+0.009
 sy+0.009
 scrap+0.007
Neg logit contributions
bye-0.009
uck-0.008
aph-0.008
enge-0.008
arn-0.007
Token the
Token ID262
Feature activation+1.27e-03

Pos logit contributions
ge+0.041
 swollen+0.040
 stret+0.040
 sy+0.036
 stuffed+0.033
Neg logit contributions
bye-0.032
uck-0.030
enge-0.028
arn-0.027
aph-0.027
Token boat
Token ID8848
Feature activation+2.23e-03

Pos logit contributions
bye+0.002
uck+0.002
enge+0.002
aph+0.002
arn+0.002
Neg logit contributions
ge-0.003
 swollen-0.002
 stret-0.002
 sy-0.002
 widened-0.002
Token with
Token ID351
Feature activation-8.10e-03

Pos logit contributions
bye+0.003
uck+0.003
enge+0.003
arn+0.003
aph+0.003
Neg logit contributions
ge-0.005
 stret-0.004
 swollen-0.004
 sy-0.004
 stuffed-0.004
Token a
Token ID257
Feature activation-2.76e-03

Pos logit contributions
ge+0.015
 swollen+0.014
 stret+0.014
 sy+0.013
 stuffed+0.012
Neg logit contributions
bye-0.015
uck-0.014
enge-0.013
arn-0.013
aph-0.012
Token remote
Token ID6569
Feature activation-5.75e-02

Pos logit contributions
ge+0.006
 Kel+0.005
 swollen+0.005
 stret+0.005
 sy+0.004
Neg logit contributions
bye-0.003
 sun-0.003
 contest-0.003
aph-0.003
 home-0.003
Token control
Token ID1630
Feature activation-2.54e-02

Pos logit contributions
ge+0.105
 stret+0.101
 swollen+0.101
 sy+0.092
 stuffed+0.082
Neg logit contributions
bye-0.102
uck-0.096
enge-0.089
arn-0.088
aph-0.088
Token.
Token ID13
Feature activation-4.19e-03

Pos logit contributions
ge+0.055
 swollen+0.055
 stret+0.054
 sy+0.050
 stuffed+0.045
Neg logit contributions
bye-0.036
uck-0.032
enge-0.029
aph-0.028
arn-0.028
Token The
Token ID383
Feature activation-7.61e-04

Pos logit contributions
 stret+0.006
 swollen+0.006
ge+0.006
 levers+0.005
 Kel+0.005
Neg logit contributions
bye-0.008
uck-0.008
enge-0.007
iddle-0.007
aph-0.007
Token boat
Token ID8848
Feature activation+2.08e-03

Pos logit contributions
ge+0.002
 swollen+0.001
 stret+0.001
 sy+0.001
 Kel+0.001
Neg logit contributions
bye-0.001
uck-0.001
enge-0.001
aph-0.001
iddle-0.001
Token is
Token ID318
Feature activation+4.16e-03

Pos logit contributions
bye+0.003
uck+0.003
enge+0.003
aph+0.003
apped+0.003
Neg logit contributions
ge-0.004
 swollen-0.004
 stret-0.004
 sy-0.004
 Kel-0.003
Token too
Token ID1165
Feature activation+7.51e-03

Pos logit contributions
ge+0.007
 stret+0.007
 sy+0.007
 swollen+0.007
lest+0.006
Neg logit contributions
bye-0.002
uck-0.002
arn-0.002
 noon-0.002
iddle-0.002
Token late
Token ID2739
Feature activation-1.36e-03

Pos logit contributions
bye+0.009
uck+0.008
enge+0.007
aph+0.007
arn+0.007
Neg logit contributions
ge-0.018
 swollen-0.018
 stret-0.018
 sy-0.016
 stuffed-0.015
Token
Token ID6184
Feature activation+2.01e-02

Pos logit contributions
ge+0.003
 swollen+0.003
 stret+0.003
 sy+0.003
 stuffed+0.002
Neg logit contributions
bye-0.002
uck-0.002
enge-0.002
aph-0.002
arn-0.002
Token
Token ID95
Feature activation+2.65e-02

Pos logit contributions
bye+0.034
uck+0.031
enge+0.030
arn+0.029
aph+0.029
Neg logit contributions
ge-0.038
 stret-0.038
 swollen-0.036
 sy-0.034
 tug-0.030
Token
Token ID26391
Feature activation+1.11e-02

Pos logit contributions
bye+0.041
uck+0.038
aph+0.036
enge+0.036
apped+0.034
Neg logit contributions
 stret-0.052
ge-0.051
 swollen-0.048
 sy-0.047
 tug-0.041
Token
Token ID447
Feature activation-6.33e-02

Pos logit contributions
bye+0.020
uck+0.019
enge+0.017
aph+0.017
arn+0.017
Neg logit contributions
ge-0.020
 swollen-0.019
 stret-0.019
 sy-0.017
 stuffed-0.015
Token
Token ID250
Feature activation+1.55e-02

Pos logit contributions
 swollen+0.120
ge+0.119
 stret+0.117
 sy+0.108
 scrap+0.095
Neg logit contributions
bye-0.107
uck-0.095
enge-0.092
arn-0.092
aph-0.090
Token the
Token ID262
Feature activation+2.53e-03

Pos logit contributions
bye+0.028
uck+0.026
enge+0.025
aph+0.024
arn+0.024
Neg logit contributions
ge-0.028
 swollen-0.027
 stret-0.026
 sy-0.024
 stuffed-0.021
Token little
Token ID1310
Feature activation+3.85e-03

Pos logit contributions
bye+0.004
uck+0.004
enge+0.004
arn+0.004
aph+0.003
Neg logit contributions
ge-0.005
 stret-0.005
 swollen-0.005
 sy-0.004
 stuffed-0.004
Token girl
Token ID2576
Feature activation-3.39e-03

Pos logit contributions
bye+0.006
uck+0.006
enge+0.005
arn+0.005
aph+0.005
Neg logit contributions
ge-0.008
 swollen-0.008
 stret-0.008
 sy-0.007
 stuffed-0.006
Token was
Token ID373
Feature activation-4.93e-03

Pos logit contributions
ge+0.006
 swollen+0.006
 stret+0.006
 sy+0.005
 stuffed+0.005
Neg logit contributions
bye-0.006
uck-0.006
enge-0.005
aph-0.005
arn-0.005
Token the
Token ID262
Feature activation+2.48e-03

Pos logit contributions
ge+0.007
 stret+0.007
 swollen+0.006
 sy+0.006
 stuffed+0.005
Neg logit contributions
bye-0.004
uck-0.004
arn-0.004
enge-0.004
aph-0.004
Token brush
Token ID14093
Feature activation-1.57e-02

Pos logit contributions
bye+0.004
uck+0.004
enge+0.004
aph+0.003
arn+0.003
Neg logit contributions
ge-0.005
 stret-0.005
 swollen-0.005
 sy-0.004
 widened-0.004
Token
Token ID6184
Feature activation+1.78e-02

Pos logit contributions
 stret+0.034
ge+0.033
 swollen+0.031
 sy+0.030
 prope+0.028
Neg logit contributions
uck-0.023
bye-0.020
um-0.018
arn-0.017
anks-0.016
Token
Token ID95
Feature activation+2.39e-02

Pos logit contributions
bye+0.031
enge+0.027
aph+0.026
uck+0.026
apped+0.025
Neg logit contributions
 stret-0.031
ge-0.030
 swollen-0.030
 sy-0.028
 tug-0.026
Token
Token ID26391
Feature activation+1.04e-02

Pos logit contributions
bye+0.041
aph+0.038
uck+0.037
enge+0.035
apped+0.035
Neg logit contributions
 stret-0.041
ge-0.039
 swollen-0.038
 sy-0.036
 tug-0.032
Token
Token ID447
Feature activation-6.33e-02

Pos logit contributions
bye+0.020
aph+0.018
uck+0.018
enge+0.017
iddle+0.017
Neg logit contributions
ge-0.016
 swollen-0.016
 stret-0.016
 sy-0.014
 stuffed-0.013
Token
Token ID250
Feature activation+1.55e-02

Pos logit contributions
 swollen+0.112
ge+0.106
 stret+0.104
 sy+0.097
 scrap+0.086
Neg logit contributions
bye-0.115
enge-0.101
uck-0.101
arn-0.100
aph-0.100
Token it
Token ID340
Feature activation+9.74e-03

Pos logit contributions
bye+0.028
uck+0.026
enge+0.025
aph+0.024
arn+0.024
Neg logit contributions
ge-0.028
 swollen-0.027
 stret-0.026
 sy-0.024
 stuffed-0.021
Token wasn
Token ID2492
Feature activation-6.95e-03

Pos logit contributions
bye+0.018
uck+0.018
enge+0.016
arn+0.016
aph+0.016
Neg logit contributions
ge-0.017
 stret-0.016
 swollen-0.015
 sy-0.014
 stuffed-0.012
Token't
Token ID470
Feature activation-2.53e-03

Pos logit contributions
ge+0.010
 stret+0.009
 swollen+0.009
 sy+0.008
 stuffed+0.007
Neg logit contributions
uck-0.015
bye-0.015
arn-0.013
aph-0.013
enge-0.013
Token quite
Token ID2407
Feature activation+3.87e-03

Pos logit contributions
ge+0.005
 swollen+0.005
 stret+0.005
 sy+0.004
 stuffed+0.004
Neg logit contributions
bye-0.004
uck-0.004
enge-0.004
arn-0.004
aph-0.004
Token out
Token ID503
Feature activation+1.12e-03

Pos logit contributions
uck+0.029
bye+0.029
enge+0.024
apped+0.023
aph+0.022
Neg logit contributions
ge-0.059
 swollen-0.058
 stret-0.057
 sy-0.053
 widened-0.047
Token the
Token ID262
Feature activation+6.44e-04

Pos logit contributions
uck+0.001
bye+0.001
apped+0.001
 noon+0.001
um+0.001
Neg logit contributions
 stret-0.003
ge-0.003
 swollen-0.003
 sy-0.003
Are-0.002
Token towels
Token ID38494
Feature activation-8.74e-03

Pos logit contributions
bye+0.001
uck+0.001
iddle+0.001
enge+0.001
 job+0.001
Neg logit contributions
ge-0.001
 swollen-0.001
 stret-0.001
 widened-0.001
 sy-0.001
Token and
Token ID290
Feature activation+2.74e-03

Pos logit contributions
 swollen+0.019
ge+0.019
 stret+0.018
 sy+0.017
 scrap+0.016
Neg logit contributions
bye-0.012
uck-0.011
enge-0.010
arn-0.009
aph-0.009
Token so
Token ID523
Feature activation-1.16e-02

Pos logit contributions
bye+0.005
uck+0.004
enge+0.004
aph+0.004
arn+0.004
Neg logit contributions
ge-0.005
 swollen-0.005
 stret-0.005
 sy-0.005
 widened-0.004
Tokenapy
Token ID12826
Feature activation-5.96e-02

Pos logit contributions
ge+0.020
 swollen+0.019
 stret+0.019
 sy+0.017
 stuffed+0.016
Neg logit contributions
bye-0.022
uck-0.020
enge-0.019
aph-0.019
arn-0.019
Token water
Token ID1660
Feature activation-3.25e-03

Pos logit contributions
ge+0.129
 swollen+0.120
 stret+0.114
 sy+0.107
 scrap+0.096
Neg logit contributions
bye-0.091
uck-0.082
enge-0.074
 Now-0.071
night-0.071
Token.
Token ID13
Feature activation+3.13e-03

Pos logit contributions
ge+0.007
 swollen+0.007
 stret+0.007
 sy+0.006
 stuffed+0.006
Neg logit contributions
bye-0.005
uck-0.004
enge-0.004
arn-0.004
aph-0.004
Token Little
Token ID7703
Feature activation-1.30e-02

Pos logit contributions
bye+0.004
uck+0.004
enge+0.003
arn+0.003
aph+0.003
Neg logit contributions
ge-0.007
 swollen-0.007
 stret-0.007
 sy-0.006
 stuffed-0.006
Token one
Token ID530
Feature activation+5.18e-04

Pos logit contributions
ge+0.025
 swollen+0.024
 stret+0.023
 sy+0.022
 stuffed+0.020
Neg logit contributions
bye-0.021
uck-0.019
enge-0.018
aph-0.018
anks-0.018
Token brushed
Token ID31782
Feature activation-1.78e-02

Pos logit contributions
bye+0.001
uck+0.001
enge+0.001
arn+0.001
aph+0.001
Neg logit contributions
ge-0.001
 stret-0.001
 swollen-0.001
 sy-0.001
 stuffed-0.001