TOP ACTIVATIONS
MAX = 0.114

Bang ! You 're under arrest !" Their mom
song about rain bows and umb rell as . They were
We had no jackets or umb rell as , so now
giant ! You are under arrest ! You can 't take
, " You are under arrest for harassing and stealing from
robber ! You are under arrest !" Lily shouts , holding
sad . They have no umb rell as .
jackets became heavy and their umb rell as began to dro
for them with towels and umb rell as . They hug
just like in a fair yt ale ! <|endoftext|> Once upon
the tree to its once former beauty . He was happy
were four seasons in a year : winter , spring ,
sad . He had no umb rell as or hats to
the big fork did not reapp ear . They searched for
. And I like your umb rell as . They are
got out their coats and umb rell as and they held
door . Inside were two umb rell as - one big
hard game for a three year old . She also knew
, elephants , monkeys and gir aff es . They pretend
ze br as , and gir aff es . Anna and

MIN ACTIVATIONS
MIN = -0.104

She put the Band - Aid on Lily 's knee and
Grace bought the thick f udge and the two of them
excited . She held the bar of chocolate and skipped all
and a wand . She dips the wand in the soap
make one too . He dips his brush in the red
George asked the bar ber for a new hairst
decided to cheat . He dipped his claws in ink and
outfit . She put her favourite necklace on too , for
that she gave the w affle maker a kiss .
Jack asked the mer maid why the waves were
Tim . She put the muff ins in a box and
The store keeper put the muff in on a big scale
the counter and asked the bar ista if they could print
and some soap . Lily dipped a cloth in the water
mix . They put the muff ins in the oven and
handsome . He thanked the bar ber and ran out of
Sc amp i took the muff in and hopped away ,
to the stream . She dips her hand in the water
its tail . Lily counted how many spots the dog had
and a brush . She dips the brush in the water

INTERVAL 0.059 - 0.114
CONTAINS 0.414%

his friends and did not cry when he did not get
Then , Tom and Anna heard a knock on their doors
some more . They do not lose her again . They
learned their lesson . They never went near the road again
there was a little dog named Spot . Spot loved to

INTERVAL 0.005 - 0.059
CONTAINS 37.840%

" That was a great race ! What should we do
wanted to help , but she was too small to reach
in the air all night long , until the sun rose
put her in an even more peaceful mood .
then , something happened . The elevator stopped . The lights

INTERVAL -0.050 - 0.005
CONTAINS 60.940%

was a nice toy , but reminded him to be careful
with blocks . We built a yacht . Do you like
had a wonderful day full of spinning and fun . <|endoftext|>
he knew how to softly rub her shoulders . His mom
he asked the baby to open his mouth and say '

INTERVAL -0.104 - -0.050
CONTAINS 0.805%

nice again . Would you like to help me ?"
." Mia and Tom looked at each other and knew they
hand . They asked him what happened and they started to
Tom said . He picked up the pin and held it
. He was amazed by what he found inside - a
Token Bang
Token ID9801
Feature activation-7.45e-03

Pos logit contributions
 fault+0.063
 earthquake+0.056
 fallen+0.055
 enough+0.054
erved+0.054
Neg logit contributions
 quietly-0.049
 mer-0.046
 curiously-0.046
 cheer-0.045
Sure-0.045
Token!
Token ID0
Feature activation+3.19e-02

Pos logit contributions
 quietly+0.011
fish+0.011
 carefully+0.011
 curiously+0.011
in+0.010
Neg logit contributions
 fault-0.015
 arrest-0.013
 inconvenience-0.012
erved-0.012
 Kim-0.012
Token You
Token ID921
Feature activation-9.85e-03

Pos logit contributions
 fault+0.060
 earthquake+0.051
 fallen+0.050
 enough+0.050
 supposed+0.049
Neg logit contributions
 quietly-0.049
Sure-0.047
 cheer-0.046
Yes-0.045
 mer-0.045
Token're
Token ID821
Feature activation+1.82e-02

Pos logit contributions
 carefully+0.014
 quietly+0.014
fish+0.013
 else+0.013
 cheered+0.013
Neg logit contributions
 fault-0.020
 earthquake-0.017
 inconvenience-0.017
opl-0.017
 arrest-0.016
Token under
Token ID739
Feature activation+2.63e-02

Pos logit contributions
 fault+0.035
 arrest+0.032
erved+0.031
opl+0.030
 inconvenience+0.029
Neg logit contributions
 quietly-0.026
 carefully-0.025
 mer-0.025
 cheer-0.024
 excited-0.023
Token arrest
Token ID3251
Feature activation+1.14e-01

Pos logit contributions
 fault+0.031
erved+0.023
ichever+0.021
hy+0.021
 inconvenience+0.020
Neg logit contributions
 quietly-0.061
 carefully-0.059
 mer-0.059
 curiously-0.057
fish-0.056
Token!"
Token ID2474
Feature activation-1.23e-02

Pos logit contributions
 fault+0.228
erved+0.186
ained+0.186
 vain+0.185
 Kim+0.184
Neg logit contributions
 cheer-0.164
 quietly-0.163
 carefully-0.158
 happily-0.153
 curiously-0.152
Token
Token ID198
Feature activation-2.52e-03

Pos logit contributions
 quietly+0.014
 curiously+0.014
 carefully+0.013
 eagerly+0.013
fish+0.013
Neg logit contributions
 fault-0.029
 arrest-0.027
 fallen-0.026
 Kim-0.025
 enough-0.025
Token
Token ID198
Feature activation+2.55e-03

Pos logit contributions
 curiously+0.003
fish+0.003
 quietly+0.003
 mer+0.003
 eagerly+0.003
Neg logit contributions
 Kim-0.005
 arrest-0.005
 fault-0.005
 fallen-0.004
 Wh-0.004
TokenTheir
Token ID14574
Feature activation+3.92e-02

Pos logit contributions
 arrest+0.005
 Kim+0.004
 fault+0.004
 fallen+0.004
 earthquake+0.004
Neg logit contributions
 curiously-0.004
 quietly-0.004
fish-0.004
 eagerly-0.004
 carefully-0.004
Token mom
Token ID1995
Feature activation+3.48e-02

Pos logit contributions
 fault+0.091
 arrest+0.086
 fallen+0.080
 Kim+0.079
 earthquake+0.078
Neg logit contributions
 quietly-0.047
 curiously-0.046
 carefully-0.043
 eagerly-0.042
fish-0.042
Token song
Token ID3496
Feature activation+5.74e-03

Pos logit contributions
 curiously+0.025
 quietly+0.025
 eagerly+0.024
Hop+0.023
in+0.022
Neg logit contributions
 arrest-0.023
 fault-0.023
 fallen-0.021
hy-0.021
 AW-0.020
Token about
Token ID546
Feature activation-7.84e-03

Pos logit contributions
 fault+0.011
 arrest+0.010
 enough+0.009
 Kim+0.009
 fallen+0.009
Neg logit contributions
 curiously-0.009
 quietly-0.009
Hop-0.008
 carefully-0.008
fish-0.008
Token rain
Token ID6290
Feature activation+4.11e-02

Pos logit contributions
 quietly+0.013
 curiously+0.012
 carefully+0.012
 eagerly+0.012
fish+0.011
Neg logit contributions
 fault-0.015
 arrest-0.014
 fallen-0.013
 Kim-0.013
 enough-0.013
Tokenbows
Token ID25435
Feature activation+1.79e-02

Pos logit contributions
 fault+0.055
 arrest+0.051
 fallen+0.044
 Kim+0.043
 earthquake+0.042
Neg logit contributions
 quietly-0.089
 curiously-0.087
 carefully-0.085
 eagerly-0.083
fish-0.082
Token and
Token ID290
Feature activation+2.68e-02

Pos logit contributions
 fault+0.039
 arrest+0.037
 fallen+0.034
 enough+0.034
 Kim+0.033
Neg logit contributions
 quietly-0.023
 curiously-0.023
 carefully-0.021
 eagerly-0.021
Hop-0.020
Token umb
Token ID20810
Feature activation+1.10e-01

Pos logit contributions
 fault+0.057
 arrest+0.053
erved+0.048
 fallen+0.048
 earthquake+0.048
Neg logit contributions
 quietly-0.038
 curiously-0.036
 carefully-0.036
 eagerly-0.034
fish-0.034
Tokenrell
Token ID11252
Feature activation+1.39e-02

Pos logit contributions
 fault+0.111
 arrest+0.096
 enough+0.080
 earthquake+0.074
 enemy+0.074
Neg logit contributions
 quietly-0.270
 carefully-0.268
 curiously-0.265
 eagerly-0.261
Hop-0.252
Tokenas
Token ID292
Feature activation+1.46e-02

Pos logit contributions
 fault+0.030
 arrest+0.029
 fallen+0.027
 Kim+0.026
 earthquake+0.026
Neg logit contributions
 quietly-0.018
 curiously-0.018
 carefully-0.017
 eagerly-0.016
fish-0.016
Token.
Token ID13
Feature activation+1.09e-02

Pos logit contributions
 fault+0.032
 arrest+0.030
 fallen+0.028
 Kim+0.027
 earthquake+0.027
Neg logit contributions
 quietly-0.020
 curiously-0.019
 carefully-0.018
 eagerly-0.018
fish-0.017
Token They
Token ID1119
Feature activation+1.24e-02

Pos logit contributions
 fault+0.019
 arrest+0.017
 Kim+0.017
 fallen+0.016
 enough+0.016
Neg logit contributions
 curiously-0.019
 quietly-0.019
fish-0.018
 carefully-0.018
 eagerly-0.017
Token were
Token ID547
Feature activation+9.43e-03

Pos logit contributions
 fault+0.022
 arrest+0.021
 fallen+0.020
 Kim+0.020
 enough+0.019
Neg logit contributions
 curiously-0.020
 quietly-0.019
Hop-0.018
 eagerly-0.018
 carefully-0.018
Token We
Token ID775
Feature activation-1.42e-02

Pos logit contributions
 fault+0.009
 arrest+0.008
 fallen+0.007
 Kim+0.007
 enough+0.007
Neg logit contributions
 quietly-0.010
 curiously-0.010
 carefully-0.009
 eagerly-0.009
fish-0.009
Token had
Token ID550
Feature activation+2.72e-02

Pos logit contributions
 quietly+0.024
 curiously+0.024
 carefully+0.023
 eagerly+0.022
fish+0.022
Neg logit contributions
 fault-0.025
 arrest-0.023
 Kim-0.021
 fallen-0.021
 earthquake-0.021
Token no
Token ID645
Feature activation+3.51e-02

Pos logit contributions
opl+0.059
 earthquake+0.056
ross+0.055
 arrest+0.055
 Kim+0.054
Neg logit contributions
 quietly-0.035
 carefully-0.034
 closer-0.030
 eagerly-0.029
 excited-0.028
Token jackets
Token ID35129
Feature activation-4.50e-03

Pos logit contributions
 fault+0.062
 arrest+0.061
 Kim+0.055
 Entry+0.054
hy+0.053
Neg logit contributions
 quietly-0.057
 carefully-0.055
 curiously-0.054
 eagerly-0.053
 cheer-0.050
Token or
Token ID393
Feature activation+1.60e-02

Pos logit contributions
 quietly+0.006
 carefully+0.006
fish+0.005
 curiously+0.005
 eagerly+0.005
Neg logit contributions
 arrest-0.010
 fault-0.009
 Kim-0.009
 earthquake-0.009
 Entry-0.009
Token umb
Token ID20810
Feature activation+1.08e-01

Pos logit contributions
 fault+0.031
 arrest+0.028
opl+0.027
erved+0.026
OO+0.026
Neg logit contributions
 quietly-0.024
 carefully-0.024
 curiously-0.022
fish-0.021
 eagerly-0.021
Tokenrell
Token ID11252
Feature activation+1.07e-02

Pos logit contributions
 fault+0.134
 arrest+0.122
 enemy+0.118
 enough+0.110
 tresp+0.109
Neg logit contributions
 carefully-0.229
 eagerly-0.223
 quietly-0.221
 curiously-0.217
 around-0.211
Tokenas
Token ID292
Feature activation+1.25e-02

Pos logit contributions
 arrest+0.026
 fault+0.026
 earthquake+0.024
 fallen+0.023
 Entry+0.023
Neg logit contributions
 quietly-0.010
 carefully-0.010
fish-0.010
 curiously-0.010
Hop-0.009
Token,
Token ID11
Feature activation+1.72e-02

Pos logit contributions
 fault+0.024
 arrest+0.023
erved+0.022
 earthquake+0.021
 Kim+0.021
Neg logit contributions
 quietly-0.019
 carefully-0.019
fish-0.018
 curiously-0.018
 eagerly-0.017
Token so
Token ID523
Feature activation+6.98e-03

Pos logit contributions
 fault+0.034
 arrest+0.032
 fallen+0.030
erved+0.030
 earthquake+0.029
Neg logit contributions
 quietly-0.026
 carefully-0.024
 curiously-0.024
 eagerly-0.023
fish-0.022
Token now
Token ID783
Feature activation-6.33e-03

Pos logit contributions
 fault+0.015
 arrest+0.014
 fallen+0.013
erved+0.013
 earthquake+0.013
Neg logit contributions
 quietly-0.009
 curiously-0.009
 carefully-0.009
 eagerly-0.008
fish-0.008
Token giant
Token ID6175
Feature activation+3.67e-02

Pos logit contributions
 fault+0.070
 arrest+0.062
 fallen+0.059
erved+0.059
 enough+0.056
Neg logit contributions
 quietly-0.063
 curiously-0.061
 carefully-0.060
 eagerly-0.057
fish-0.056
Token!
Token ID0
Feature activation+3.31e-02

Pos logit contributions
 fault+0.083
erved+0.075
 arrest+0.071
 fallen+0.070
ained+0.070
Neg logit contributions
 quietly-0.047
 carefully-0.045
fish-0.044
 curiously-0.043
in-0.040
Token You
Token ID921
Feature activation-9.07e-03

Pos logit contributions
 fault+0.059
 fallen+0.051
 enough+0.050
erved+0.049
 earthquake+0.049
Neg logit contributions
 quietly-0.053
 curiously-0.051
Sure-0.048
Hop-0.048
 carefully-0.047
Token are
Token ID389
Feature activation-2.96e-03

Pos logit contributions
 carefully+0.013
 quietly+0.012
 else+0.012
 gently+0.011
Hop+0.011
Neg logit contributions
 fault-0.019
opl-0.016
 earthquake-0.016
 Entry-0.016
 inconvenience-0.016
Token under
Token ID739
Feature activation+2.23e-02

Pos logit contributions
 quietly+0.005
 carefully+0.005
 gently+0.004
 curiously+0.004
 mer+0.004
Neg logit contributions
 fault-0.006
erved-0.005
 arrest-0.005
 Entry-0.005
opl-0.005
Token arrest
Token ID3251
Feature activation+1.06e-01

Pos logit contributions
 fault+0.025
ichever+0.018
erved+0.017
EEP+0.016
upid+0.016
Neg logit contributions
 quietly-0.052
 carefully-0.051
 mer-0.050
 curiously-0.048
 dolphin-0.047
Token!
Token ID0
Feature activation+3.02e-02

Pos logit contributions
 fault+0.235
ained+0.201
erved+0.197
 supposed+0.195
 earthquake+0.195
Neg logit contributions
 quietly-0.131
 carefully-0.126
 cheer-0.125
 curiously-0.122
 happily-0.121
Token You
Token ID921
Feature activation-1.07e-02

Pos logit contributions
 fault+0.058
 enough+0.050
 fallen+0.049
erved+0.048
 earthquake+0.047
Neg logit contributions
 quietly-0.044
Hop-0.041
 curiously-0.040
Sure-0.040
Yes-0.040
Token can
Token ID460
Feature activation-6.25e-03

Pos logit contributions
 carefully+0.015
 quietly+0.015
 else+0.014
Hop+0.014
 gently+0.013
Neg logit contributions
 fault-0.022
 earthquake-0.019
opl-0.019
 appearance-0.019
 Entry-0.019
Token't
Token ID470
Feature activation+5.39e-02

Pos logit contributions
 quietly+0.014
 curiously+0.014
 carefully+0.013
 eagerly+0.013
fish+0.013
Neg logit contributions
 fault-0.008
 arrest-0.008
 fallen-0.007
 Kim-0.006
 earthquake-0.006
Token take
Token ID1011
Feature activation+2.27e-02

Pos logit contributions
 fault+0.101
 arrest+0.095
 fallen+0.087
 Kim+0.086
 earthquake+0.085
Neg logit contributions
 quietly-0.088
 curiously-0.086
 carefully-0.082
 eagerly-0.081
fish-0.080
Token,
Token ID11
Feature activation-8.03e-04

Pos logit contributions
 quietly+0.013
 carefully+0.012
 curiously+0.012
 eagerly+0.012
fish+0.011
Neg logit contributions
 fault-0.021
 arrest-0.019
erved-0.019
 earthquake-0.018
 fallen-0.018
Token "
Token ID366
Feature activation-8.52e-03

Pos logit contributions
 quietly+0.001
 curiously+0.001
 carefully+0.001
fish+0.001
 eagerly+0.001
Neg logit contributions
 fault-0.002
 arrest-0.001
 fallen-0.001
 Kim-0.001
 enough-0.001
TokenYou
Token ID1639
Feature activation-2.72e-02

Pos logit contributions
 quietly+0.012
 curiously+0.011
 carefully+0.011
 eagerly+0.010
Hop+0.010
Neg logit contributions
 fault-0.019
 arrest-0.017
 fallen-0.016
 Kim-0.016
 earthquake-0.016
Token are
Token ID389
Feature activation-4.61e-03

Pos logit contributions
 carefully+0.040
 quietly+0.039
 else+0.036
 curiously+0.036
 cheer+0.036
Neg logit contributions
 fault-0.055
opl-0.050
erved-0.047
 earthquake-0.046
 arrest-0.046
Token under
Token ID739
Feature activation+2.20e-02

Pos logit contributions
 quietly+0.007
 carefully+0.006
 curiously+0.006
Hop+0.006
 mer+0.006
Neg logit contributions
 fault-0.010
 arrest-0.009
erved-0.009
EEP-0.008
OO-0.008
Token arrest
Token ID3251
Feature activation+1.06e-01

Pos logit contributions
 fault+0.029
erved+0.023
ichever+0.022
EEP+0.022
 Entry+0.022
Neg logit contributions
 quietly-0.047
 carefully-0.045
 mer-0.044
 curiously-0.042
 happily-0.041
Token for
Token ID329
Feature activation+2.04e-02

Pos logit contributions
 fault+0.239
erved+0.203
 Kim+0.203
 Entry+0.201
ained+0.201
Neg logit contributions
 quietly-0.129
 carefully-0.125
 cheer-0.121
 curiously-0.120
 happily-0.115
Token harassing
Token ID34110
Feature activation+3.09e-02

Pos logit contributions
 fault+0.049
erved+0.041
upid+0.041
EEP+0.040
OO+0.040
Neg logit contributions
 quietly-0.027
 carefully-0.025
 cheer-0.023
 gently-0.021
 eagerly-0.021
Token and
Token ID290
Feature activation+2.84e-02

Pos logit contributions
 fault+0.063
 arrest+0.060
 fallen+0.057
 earthquake+0.056
 Entry+0.056
Neg logit contributions
 quietly-0.045
 carefully-0.041
 curiously-0.039
 eagerly-0.039
 mer-0.037
Token stealing
Token ID16568
Feature activation+2.53e-02

Pos logit contributions
 fault+0.050
 arrest+0.044
erved+0.041
 fallen+0.040
 Entry+0.040
Neg logit contributions
 quietly-0.051
 curiously-0.049
 carefully-0.049
 eagerly-0.047
 mer-0.045
Token from
Token ID422
Feature activation+3.22e-03

Pos logit contributions
 fault+0.049
 arrest+0.047
erved+0.043
 fallen+0.043
 Kim+0.043
Neg logit contributions
 quietly-0.038
 carefully-0.037
 curiously-0.035
 eagerly-0.033
fish-0.033
Token robber
Token ID29979
Feature activation+7.09e-02

Pos logit contributions
 fault+0.061
 arrest+0.048
 fallen+0.047
opl+0.047
erved+0.046
Neg logit contributions
 quietly-0.073
 carefully-0.071
Sure-0.068
 curiously-0.067
 cheer-0.066
Token!
Token ID0
Feature activation+4.59e-02

Pos logit contributions
 fault+0.139
erved+0.123
 earthquake+0.116
 arrest+0.114
ained+0.114
Neg logit contributions
 quietly-0.114
 carefully-0.108
 curiously-0.104
fish-0.100
 gently-0.099
Token You
Token ID921
Feature activation-8.10e-03

Pos logit contributions
 fault+0.086
 fallen+0.077
 done+0.074
 enough+0.074
 arrest+0.073
Neg logit contributions
 quietly-0.070
 curiously-0.069
Hop-0.067
Sure-0.067
 carefully-0.062
Token are
Token ID389
Feature activation-4.71e-03

Pos logit contributions
 quietly+0.012
 carefully+0.012
Hop+0.011
 curiously+0.011
 gently+0.011
Neg logit contributions
 fault-0.016
 Kim-0.014
erved-0.014
 earthquake-0.014
 Entry-0.014
Token under
Token ID739
Feature activation+2.09e-02

Pos logit contributions
 quietly+0.008
 carefully+0.008
 gently+0.007
 curiously+0.007
 closely+0.007
Neg logit contributions
 fault-0.009
erved-0.007
 arrest-0.007
opl-0.007
EEP-0.007
Token arrest
Token ID3251
Feature activation+1.05e-01

Pos logit contributions
 fault+0.032
upid+0.024
EEP+0.024
ah+0.024
 inconvenience+0.024
Neg logit contributions
 quietly-0.042
 carefully-0.041
 mer-0.038
 outside-0.037
 around-0.037
Token!"
Token ID2474
Feature activation+1.09e-02

Pos logit contributions
 fault+0.218
ained+0.180
 Entry+0.178
 earthquake+0.178
 supposed+0.178
Neg logit contributions
 quietly-0.145
 carefully-0.142
 curiously-0.139
 cheer-0.138
 around-0.132
Token Lily
Token ID20037
Feature activation+7.15e-03

Pos logit contributions
 fault+0.021
 arrest+0.019
 fallen+0.017
erved+0.017
 Entry+0.017
Neg logit contributions
 quietly-0.019
 curiously-0.018
 carefully-0.018
 eagerly-0.017
 closely-0.016
Token shouts
Token ID34757
Feature activation+2.33e-02

Pos logit contributions
 fault+0.014
 arrest+0.013
 fallen+0.012
 earthquake+0.012
 Kim+0.012
Neg logit contributions
 quietly-0.012
 curiously-0.011
 carefully-0.011
 eagerly-0.010
 closely-0.010
Token,
Token ID11
Feature activation+1.17e-02

Pos logit contributions
 fault+0.045
 arrest+0.042
 enough+0.040
 Kim+0.039
 fallen+0.039
Neg logit contributions
 quietly-0.034
fish-0.034
 curiously-0.034
Hop-0.033
 carefully-0.032
Token holding
Token ID4769
Feature activation-5.45e-02

Pos logit contributions
 fault+0.020
 arrest+0.019
 fallen+0.018
 Kim+0.017
 enough+0.017
Neg logit contributions
 quietly-0.020
 curiously-0.020
fish-0.019
 carefully-0.019
Hop-0.019
Token sad
Token ID6507
Feature activation+3.71e-02

Pos logit contributions
 quietly+0.006
 curiously+0.006
 carefully+0.006
 eagerly+0.005
fish+0.005
Neg logit contributions
 fault-0.008
 arrest-0.007
 fallen-0.007
 Kim-0.007
 earthquake-0.007
Token.
Token ID13
Feature activation+2.97e-02

Pos logit contributions
 fault+0.074
 arrest+0.067
 enough+0.064
 fallen+0.062
 Kim+0.062
Neg logit contributions
 curiously-0.056
 quietly-0.055
Hop-0.053
 eagerly-0.053
 carefully-0.052
Token They
Token ID1119
Feature activation+1.40e-02

Pos logit contributions
 fault+0.050
 Kim+0.043
 arrest+0.043
 enough+0.039
 fallen+0.038
Neg logit contributions
fish-0.054
 curiously-0.054
 carefully-0.052
 quietly-0.052
 eagerly-0.050
Token have
Token ID423
Feature activation+1.44e-02

Pos logit contributions
 fault+0.023
 arrest+0.022
 fallen+0.021
 Kim+0.019
 earthquake+0.019
Neg logit contributions
 quietly-0.025
 curiously-0.025
fish-0.023
 carefully-0.023
 eagerly-0.023
Token no
Token ID645
Feature activation+3.82e-02

Pos logit contributions
opl+0.023
 earthquake+0.021
 arrest+0.020
 OUT+0.020
ross+0.020
Neg logit contributions
 quietly-0.027
 carefully-0.026
 gently-0.024
 else-0.023
 excited-0.023
Token umb
Token ID20810
Feature activation+1.02e-01

Pos logit contributions
 fault+0.070
 arrest+0.069
erved+0.059
 Entry+0.059
 Kim+0.059
Neg logit contributions
 quietly-0.062
 carefully-0.059
 curiously-0.057
 eagerly-0.055
 cheer-0.053
Tokenrell
Token ID11252
Feature activation+7.88e-03

Pos logit contributions
 fault+0.123
 arrest+0.110
 enemy+0.106
 enough+0.101
 earthquake+0.092
Neg logit contributions
 carefully-0.215
 quietly-0.214
 cheer-0.209
 eagerly-0.202
 gently-0.199
Tokenas
Token ID292
Feature activation+7.98e-03

Pos logit contributions
 fault+0.019
 arrest+0.018
 earthquake+0.017
 fallen+0.017
 enough+0.017
Neg logit contributions
 quietly-0.008
 carefully-0.008
fish-0.008
 curiously-0.007
Hop-0.007
Token.
Token ID13
Feature activation+2.63e-02

Pos logit contributions
 fault+0.018
 arrest+0.018
erved+0.017
 earthquake+0.017
 Entry+0.017
Neg logit contributions
 quietly-0.009
 carefully-0.009
fish-0.008
 gently-0.008
 curiously-0.008
Token
Token ID198
Feature activation+2.23e-03

Pos logit contributions
 fault+0.047
 Kim+0.041
 arrest+0.040
 enough+0.037
 fallen+0.036
Neg logit contributions
 curiously-0.046
fish-0.045
 quietly-0.044
 carefully-0.044
 eagerly-0.043
Token
Token ID198
Feature activation+5.24e-03

Pos logit contributions
 Kim+0.004
 fault+0.003
 arrest+0.003
 fallen+0.003
upid+0.003
Neg logit contributions
fish-0.004
beans-0.004
 mer-0.004
 curiously-0.004
 therapist-0.004
Token jackets
Token ID35129
Feature activation-1.34e-02

Pos logit contributions
 fault+0.067
 arrest+0.065
 Kim+0.058
 Entry+0.058
 inconvenience+0.057
Neg logit contributions
 quietly-0.048
 carefully-0.045
 curiously-0.044
 eagerly-0.043
fish-0.042
Token became
Token ID2627
Feature activation+2.25e-02

Pos logit contributions
 quietly+0.018
 curiously+0.018
 carefully+0.017
 eagerly+0.017
fish+0.016
Neg logit contributions
 fault-0.028
 arrest-0.027
 Kim-0.025
 earthquake-0.024
 fallen-0.024
Token heavy
Token ID4334
Feature activation-2.11e-04

Pos logit contributions
 arrest+0.051
 fault+0.049
 inconvenience+0.044
 Entry+0.044
opl+0.044
Neg logit contributions
 quietly-0.031
 carefully-0.029
 curiously-0.028
 eagerly-0.027
 gently-0.026
Token and
Token ID290
Feature activation+9.16e-03

Pos logit contributions
 quietly+0.000
 curiously+0.000
 carefully+0.000
 eagerly+0.000
 closely+0.000
Neg logit contributions
 fault-0.000
 arrest-0.000
 Kim-0.000
 inconvenience-0.000
erved-0.000
Token their
Token ID511
Feature activation+3.94e-02

Pos logit contributions
 fault+0.019
 arrest+0.018
 fallen+0.016
erved+0.016
 Kim+0.016
Neg logit contributions
 quietly-0.013
 curiously-0.012
 carefully-0.012
 eagerly-0.012
fish-0.011
Token umb
Token ID20810
Feature activation+1.01e-01

Pos logit contributions
 fault+0.077
 arrest+0.072
 fallen+0.067
 Kim+0.065
 earthquake+0.065
Neg logit contributions
 quietly-0.061
 curiously-0.060
 carefully-0.057
 eagerly-0.056
fish-0.056
Tokenrell
Token ID11252
Feature activation+1.16e-02

Pos logit contributions
 fault+0.121
 arrest+0.106
 enemy+0.098
 enough+0.094
 inconvenience+0.091
Neg logit contributions
 carefully-0.231
 quietly-0.227
 eagerly-0.223
 curiously-0.223
 gently-0.213
Tokenas
Token ID292
Feature activation+9.32e-03

Pos logit contributions
 fault+0.027
 arrest+0.026
 earthquake+0.024
 inconvenience+0.024
 enough+0.024
Neg logit contributions
 quietly-0.013
 carefully-0.012
 curiously-0.012
fish-0.012
 eagerly-0.011
Token began
Token ID2540
Feature activation+8.02e-03

Pos logit contributions
 fault+0.020
 arrest+0.019
 earthquake+0.017
 inconvenience+0.017
 Kim+0.017
Neg logit contributions
 quietly-0.013
 carefully-0.013
 curiously-0.012
 eagerly-0.012
fish-0.012
Token to
Token ID284
Feature activation+3.22e-02

Pos logit contributions
 fault+0.015
 arrest+0.015
 earthquake+0.013
 Entry+0.013
erved+0.013
Neg logit contributions
 quietly-0.013
 curiously-0.012
 carefully-0.012
 eagerly-0.012
 gently-0.011
Token dro
Token ID3102
Feature activation+1.66e-02

Pos logit contributions
 fault+0.060
 arrest+0.056
 fallen+0.052
 Kim+0.051
 earthquake+0.051
Neg logit contributions
 quietly-0.052
 curiously-0.051
 carefully-0.049
fish-0.048
 eagerly-0.048
Token for
Token ID329
Feature activation-9.82e-03

Pos logit contributions
 quietly+0.004
 curiously+0.003
 carefully+0.003
 eagerly+0.003
fish+0.003
Neg logit contributions
 fault-0.004
 arrest-0.004
 fallen-0.004
 Kim-0.004
 enough-0.003
Token them
Token ID606
Feature activation-2.74e-03

Pos logit contributions
Hop+0.015
 curiously+0.015
fish+0.015
 quietly+0.014
Sure+0.013
Neg logit contributions
 arrest-0.017
 Kim-0.017
 fault-0.017
 enough-0.016
 fallen-0.016
Token with
Token ID351
Feature activation-3.75e-02

Pos logit contributions
 quietly+0.004
 curiously+0.003
 carefully+0.003
 eagerly+0.003
fish+0.003
Neg logit contributions
 fault-0.006
 arrest-0.006
 fallen-0.005
 Kim-0.005
 earthquake-0.005
Token towels
Token ID38494
Feature activation-4.67e-03

Pos logit contributions
 quietly+0.056
 curiously+0.054
 carefully+0.052
 eagerly+0.051
fish+0.050
Neg logit contributions
 fault-0.076
 arrest-0.071
 fallen-0.066
 Kim-0.065
 earthquake-0.064
Token and
Token ID290
Feature activation+1.30e-02

Pos logit contributions
 quietly+0.006
 carefully+0.005
 curiously+0.005
 eagerly+0.005
fish+0.005
Neg logit contributions
 fault-0.011
 arrest-0.011
 fallen-0.010
 earthquake-0.010
erved-0.010
Token umb
Token ID20810
Feature activation+9.81e-02

Pos logit contributions
 fault+0.027
 arrest+0.025
 earthquake+0.023
 inconvenience+0.023
 fallen+0.023
Neg logit contributions
 quietly-0.018
 carefully-0.017
 curiously-0.017
 eagerly-0.016
fish-0.016
Tokenrell
Token ID11252
Feature activation+5.29e-03

Pos logit contributions
 fault+0.103
 arrest+0.094
 enemy+0.081
 loser+0.081
 enough+0.080
Neg logit contributions
 carefully-0.232
 quietly-0.231
 curiously-0.223
 eagerly-0.221
 gently-0.215
Tokenas
Token ID292
Feature activation+9.01e-03

Pos logit contributions
 fault+0.012
 arrest+0.011
 fallen+0.010
 Kim+0.010
 earthquake+0.010
Neg logit contributions
 quietly-0.007
 curiously-0.007
 carefully-0.006
 eagerly-0.006
fish-0.006
Token.
Token ID13
Feature activation+1.79e-02

Pos logit contributions
 fault+0.020
 arrest+0.019
 earthquake+0.018
 fallen+0.018
erved+0.018
Neg logit contributions
 quietly-0.011
 carefully-0.011
 curiously-0.011
 eagerly-0.010
fish-0.010
Token They
Token ID1119
Feature activation+9.25e-03

Pos logit contributions
 Kim+0.022
 fault+0.021
 arrest+0.019
 enough+0.016
 fallen+0.016
Neg logit contributions
fish-0.040
 curiously-0.037
Hop-0.037
in-0.037
 carefully-0.036
Token hug
Token ID16225
Feature activation+1.75e-02

Pos logit contributions
 arrest+0.015
 fault+0.014
 Kim+0.012
 enough+0.012
 fallen+0.012
Neg logit contributions
fish-0.017
Hop-0.016
 curiously-0.016
 quietly-0.016
in-0.015
Token just
Token ID655
Feature activation+3.36e-02

Pos logit contributions
 fault+0.042
 arrest+0.040
 fallen+0.037
 Kim+0.037
 enough+0.036
Neg logit contributions
 quietly-0.024
 curiously-0.023
 carefully-0.022
 eagerly-0.022
fish-0.021
Token like
Token ID588
Feature activation-3.93e-02

Pos logit contributions
 fault+0.064
 arrest+0.060
 fallen+0.055
 Kim+0.054
 enough+0.053
Neg logit contributions
 quietly-0.054
 curiously-0.053
 carefully-0.050
 eagerly-0.049
fish-0.049
Token in
Token ID287
Feature activation-1.50e-03

Pos logit contributions
 quietly+0.058
 carefully+0.056
 curiously+0.056
 eagerly+0.053
 mer+0.049
Neg logit contributions
 fault-0.082
 arrest-0.075
erved-0.072
 fallen-0.069
 inconvenience-0.068
Token a
Token ID257
Feature activation+1.83e-02

Pos logit contributions
 quietly+0.002
 carefully+0.002
 curiously+0.002
 eagerly+0.002
 cheer+0.002
Neg logit contributions
 fault-0.003
erved-0.003
 arrest-0.003
EEP-0.003
 inconvenience-0.003
Token fair
Token ID3148
Feature activation+2.01e-02

Pos logit contributions
 fault+0.032
 arrest+0.031
 fallen+0.028
 earthquake+0.027
 enough+0.027
Neg logit contributions
 quietly-0.030
 curiously-0.030
 eagerly-0.028
 carefully-0.028
fish-0.028
Tokenyt
Token ID20760
Feature activation+9.72e-02

Pos logit contributions
 fault+0.039
 arrest+0.036
 fallen+0.033
 Kim+0.033
erved+0.032
Neg logit contributions
 quietly-0.032
 curiously-0.031
 carefully-0.030
 eagerly-0.029
fish-0.029
Tokenale
Token ID1000
Feature activation+1.46e-02

Pos logit contributions
 arrest+0.180
 tresp+0.180
 caused+0.178
 fault+0.177
 earthquake+0.175
Neg logit contributions
 carefully-0.147
 curiously-0.141
 closely-0.139
 eagerly-0.137
 quietly-0.130
Token!
Token ID0
Feature activation-5.46e-03

Pos logit contributions
 fault+0.030
 arrest+0.028
erved+0.026
 Kim+0.026
 fallen+0.026
Neg logit contributions
 quietly-0.021
 curiously-0.021
 carefully-0.020
fish-0.019
 eagerly-0.019
Token<|endoftext|>
Token ID50256
Feature activation-2.21e-02

Pos logit contributions
 quietly+0.006
 curiously+0.005
 carefully+0.005
fish+0.005
 eagerly+0.005
Neg logit contributions
 fault-0.013
 arrest-0.013
 Kim-0.012
 fallen-0.012
 enough-0.012
TokenOnce
Token ID7454
Feature activation-1.76e-02

Pos logit contributions
 quietly+0.029
 curiously+0.029
 carefully+0.028
fish+0.027
 closely+0.026
Neg logit contributions
 fault-0.045
 arrest-0.043
 fallen-0.042
 Kim-0.041
 enough-0.041
Token upon
Token ID2402
Feature activation+1.26e-02

Pos logit contributions
 quietly+0.026
 curiously+0.026
 carefully+0.025
 eagerly+0.024
fish+0.024
Neg logit contributions
 fault-0.035
 arrest-0.033
 fallen-0.030
 Kim-0.030
 enough-0.030
Token the
Token ID262
Feature activation-6.30e-03

Pos logit contributions
 quietly+0.019
 curiously+0.018
fish+0.018
Hop+0.017
 carefully+0.017
Neg logit contributions
 fault-0.019
 arrest-0.017
 enough-0.016
 Kim-0.016
 fallen-0.015
Token tree
Token ID5509
Feature activation+7.07e-03

Pos logit contributions
 quietly+0.009
 curiously+0.009
 carefully+0.009
 eagerly+0.008
fish+0.008
Neg logit contributions
 fault-0.013
 arrest-0.012
 Kim-0.011
erved-0.011
 fallen-0.011
Token to
Token ID284
Feature activation+1.33e-02

Pos logit contributions
 fault+0.016
 arrest+0.015
 fallen+0.014
 Kim+0.014
 inconvenience+0.014
Neg logit contributions
 quietly-0.009
 curiously-0.009
 carefully-0.009
 eagerly-0.008
fish-0.008
Token its
Token ID663
Feature activation+3.96e-02

Pos logit contributions
 fault+0.027
 arrest+0.024
 fallen+0.023
erved+0.022
 enough+0.022
Neg logit contributions
 quietly-0.021
 curiously-0.020
 carefully-0.020
 eagerly-0.019
fish-0.018
Token once
Token ID1752
Feature activation-9.95e-03

Pos logit contributions
 fault+0.075
 arrest+0.070
 fallen+0.064
 Kim+0.064
 earthquake+0.062
Neg logit contributions
 quietly-0.064
 curiously-0.062
 carefully-0.060
 eagerly-0.058
fish-0.057
Token former
Token ID1966
Feature activation+9.60e-02

Pos logit contributions
 quietly+0.014
 curiously+0.014
Hop+0.013
fish+0.013
 carefully+0.013
Neg logit contributions
 fault-0.020
 arrest-0.019
 fallen-0.018
 enough-0.018
 Kim-0.017
Token beauty
Token ID8737
Feature activation+1.91e-03

Pos logit contributions
 fault+0.146
 arrest+0.138
 Kim+0.133
 fallen+0.122
 inconvenience+0.120
Neg logit contributions
 quietly-0.183
 curiously-0.183
 carefully-0.182
 eagerly-0.176
 closely-0.170
Token.
Token ID13
Feature activation-3.38e-03

Pos logit contributions
 fault+0.004
 arrest+0.003
 enough+0.003
 fallen+0.003
 Kim+0.003
Neg logit contributions
 quietly-0.003
 curiously-0.003
 carefully-0.003
 eagerly-0.003
Hop-0.003
Token He
Token ID679
Feature activation-4.79e-03

Pos logit contributions
 quietly+0.005
 curiously+0.005
 carefully+0.005
 eagerly+0.005
Hop+0.004
Neg logit contributions
 fault-0.007
 arrest-0.006
 fallen-0.006
 earthquake-0.006
erved-0.006
Token was
Token ID373
Feature activation-3.93e-03

Pos logit contributions
 quietly+0.007
 curiously+0.007
fish+0.007
Hop+0.007
 mer+0.006
Neg logit contributions
 fault-0.009
 arrest-0.008
 fallen-0.008
 enough-0.008
 caused-0.007
Token happy
Token ID3772
Feature activation-9.77e-04

Pos logit contributions
 quietly+0.006
 curiously+0.006
 carefully+0.005
 eagerly+0.005
fish+0.005
Neg logit contributions
 fault-0.008
 arrest-0.008
 fallen-0.007
 earthquake-0.007
 Kim-0.007
Token were
Token ID547
Feature activation+1.57e-02

Pos logit contributions
 quietly+0.011
 curiously+0.011
 carefully+0.010
 eagerly+0.010
fish+0.010
Neg logit contributions
 fault-0.017
 arrest-0.016
 fallen-0.015
 Kim-0.015
 enough-0.015
Token four
Token ID1440
Feature activation+9.00e-03

Pos logit contributions
 arrest+0.032
 fault+0.032
erved+0.029
 AW+0.028
upid+0.027
Neg logit contributions
 quietly-0.023
 carefully-0.022
 curiously-0.021
 eagerly-0.020
 mer-0.020
Token seasons
Token ID7028
Feature activation+2.71e-02

Pos logit contributions
 fault+0.017
 arrest+0.016
 fallen+0.015
 Kim+0.015
 enough+0.015
Neg logit contributions
 quietly-0.014
 curiously-0.014
 carefully-0.013
 eagerly-0.013
fish-0.013
Token in
Token ID287
Feature activation+8.61e-03

Pos logit contributions
 fault+0.054
 arrest+0.050
 fallen+0.047
 Kim+0.046
 enough+0.046
Neg logit contributions
 quietly-0.040
 curiously-0.040
 carefully-0.037
 eagerly-0.036
fish-0.036
Token a
Token ID257
Feature activation+2.74e-02

Pos logit contributions
 fault+0.019
 arrest+0.017
erved+0.016
 fallen+0.016
EEP+0.016
Neg logit contributions
 quietly-0.012
 curiously-0.012
 carefully-0.011
 eagerly-0.011
fish-0.010
Token year
Token ID614
Feature activation+9.50e-02

Pos logit contributions
 fault+0.052
 arrest+0.050
 fallen+0.048
 earthquake+0.047
 enough+0.045
Neg logit contributions
 curiously-0.041
 quietly-0.039
 eagerly-0.038
fish-0.038
 carefully-0.037
Token:
Token ID25
Feature activation+1.16e-02

Pos logit contributions
 fault+0.203
 arrest+0.190
 fallen+0.174
erved+0.171
 Kim+0.171
Neg logit contributions
 quietly-0.136
 curiously-0.131
 carefully-0.129
 eagerly-0.122
fish-0.120
Token winter
Token ID7374
Feature activation+2.62e-02

Pos logit contributions
 fault+0.022
 arrest+0.020
 fallen+0.019
erved+0.018
 Kim+0.018
Neg logit contributions
 quietly-0.019
 curiously-0.019
 carefully-0.018
 eagerly-0.017
fish-0.017
Token,
Token ID11
Feature activation+2.40e-02

Pos logit contributions
 fault+0.056
 arrest+0.053
 fallen+0.049
 Kim+0.048
erved+0.048
Neg logit contributions
 quietly-0.036
 curiously-0.035
 carefully-0.033
 eagerly-0.032
fish-0.032
Token spring
Token ID6076
Feature activation+1.61e-02

Pos logit contributions
 fault+0.052
 arrest+0.049
 fallen+0.045
 Kim+0.045
erved+0.045
Neg logit contributions
 quietly-0.032
 curiously-0.031
 carefully-0.030
 eagerly-0.029
fish-0.028
Token,
Token ID11
Feature activation+3.18e-02

Pos logit contributions
 fault+0.034
 arrest+0.032
 fallen+0.029
 Kim+0.029
 enough+0.028
Neg logit contributions
 quietly-0.023
 curiously-0.022
 carefully-0.021
 eagerly-0.020
fish-0.020
Token sad
Token ID6507
Feature activation+2.76e-02

Pos logit contributions
 curiously+0.022
Hop+0.022
fish+0.021
 quietly+0.020
 eagerly+0.019
Neg logit contributions
 fault-0.052
 fallen-0.049
 Kim-0.049
 arrest-0.048
 earthquake-0.047
Token.
Token ID13
Feature activation+1.27e-02

Pos logit contributions
 fault+0.061
 arrest+0.059
 fallen+0.054
erved+0.053
 inconvenience+0.053
Neg logit contributions
 quietly-0.037
 curiously-0.036
 carefully-0.035
fish-0.032
 eagerly-0.032
Token He
Token ID679
Feature activation-6.30e-03

Pos logit contributions
 fault+0.025
 arrest+0.024
 fallen+0.021
 inconvenience+0.021
 enough+0.020
Neg logit contributions
 quietly-0.021
 curiously-0.021
 carefully-0.019
 eagerly-0.019
 softly-0.018
Token had
Token ID550
Feature activation+2.85e-02

Pos logit contributions
 quietly+0.010
 curiously+0.010
 carefully+0.010
 eagerly+0.009
fish+0.009
Neg logit contributions
 fault-0.012
 arrest-0.011
 fallen-0.010
 Kim-0.010
 earthquake-0.010
Token no
Token ID645
Feature activation+2.61e-02

Pos logit contributions
opl+0.057
ross+0.051
 earthquake+0.051
 Entry+0.050
 fault+0.050
Neg logit contributions
 quietly-0.044
 carefully-0.044
 excited-0.039
 eagerly-0.038
 slowly-0.037
Token umb
Token ID20810
Feature activation+9.45e-02

Pos logit contributions
 fault+0.051
 arrest+0.049
 fallen+0.044
 Kim+0.044
erved+0.043
Neg logit contributions
 quietly-0.040
 curiously-0.039
 carefully-0.038
 eagerly-0.036
fish-0.035
Tokenrell
Token ID11252
Feature activation-2.40e-04

Pos logit contributions
 fault+0.118
 arrest+0.105
 enemy+0.100
 enough+0.096
 tresp+0.093
Neg logit contributions
 carefully-0.203
 quietly-0.198
 eagerly-0.198
 curiously-0.192
 cheer-0.189
Tokenas
Token ID292
Feature activation+1.49e-03

Pos logit contributions
 quietly+0.000
 carefully+0.000
 curiously+0.000
fish+0.000
 eagerly+0.000
Neg logit contributions
 fault-0.001
 arrest-0.001
 earthquake-0.001
 Entry-0.001
 fallen-0.001
Token or
Token ID393
Feature activation+1.80e-02

Pos logit contributions
 fault+0.003
 arrest+0.003
erved+0.003
 Entry+0.003
 inconvenience+0.003
Neg logit contributions
 quietly-0.002
 carefully-0.002
 curiously-0.002
fish-0.002
 eagerly-0.002
Token hats
Token ID23910
Feature activation+1.72e-02

Pos logit contributions
 fault+0.037
 arrest+0.033
OO+0.032
erved+0.032
opl+0.031
Neg logit contributions
 quietly-0.026
 carefully-0.025
 mer-0.023
fish-0.023
 curiously-0.023
Token to
Token ID284
Feature activation+2.28e-02

Pos logit contributions
 fault+0.040
 arrest+0.040
 earthquake+0.037
 Entry+0.037
 inconvenience+0.037
Neg logit contributions
 quietly-0.020
 carefully-0.019
 curiously-0.018
 eagerly-0.017
fish-0.016
Token the
Token ID262
Feature activation+2.25e-02

Pos logit contributions
 fault+0.005
 arrest+0.005
 fallen+0.005
 Kim+0.004
erved+0.004
Neg logit contributions
 quietly-0.004
 curiously-0.004
 carefully-0.004
 eagerly-0.004
fish-0.004
Token big
Token ID1263
Feature activation-1.12e-02

Pos logit contributions
 fault+0.041
 fallen+0.037
 arrest+0.037
 earthquake+0.037
 enough+0.035
Neg logit contributions
 quietly-0.036
 curiously-0.035
Hop-0.035
fish-0.034
 carefully-0.034
Token fork
Token ID15563
Feature activation-3.30e-02

Pos logit contributions
 quietly+0.018
 curiously+0.017
 carefully+0.017
Hop+0.017
fish+0.016
Neg logit contributions
 fault-0.021
 arrest-0.019
 fallen-0.018
 earthquake-0.018
 enough-0.018
Token did
Token ID750
Feature activation+2.17e-02

Pos logit contributions
 quietly+0.044
 curiously+0.043
 carefully+0.041
 eagerly+0.039
fish+0.039
Neg logit contributions
 fault-0.072
 arrest-0.068
 fallen-0.063
 Kim-0.063
 earthquake-0.062
Token not
Token ID407
Feature activation+6.36e-02

Pos logit contributions
 fault+0.037
 arrest+0.033
opl+0.032
 earthquake+0.032
 inconvenience+0.031
Neg logit contributions
 quietly-0.041
 carefully-0.041
 curiously-0.039
 eagerly-0.038
 closely-0.037
Token reapp
Token ID24578
Feature activation+9.30e-02

Pos logit contributions
 fault+0.139
 arrest+0.129
erved+0.120
 Entry+0.116
 fallen+0.116
Neg logit contributions
 quietly-0.090
 carefully-0.086
 curiously-0.085
 eagerly-0.081
fish-0.079
Tokenear
Token ID451
Feature activation+4.79e-03

Pos logit contributions
 fault+0.180
 inconvenience+0.162
 earthquake+0.157
 arrest+0.154
 caused+0.142
Neg logit contributions
 curiously-0.155
 carefully-0.150
 closely-0.147
 quietly-0.146
 eagerly-0.137
Token.
Token ID13
Feature activation-1.99e-03

Pos logit contributions
 fault+0.010
 arrest+0.010
 fallen+0.009
 inconvenience+0.009
erved+0.009
Neg logit contributions
 quietly-0.007
 curiously-0.006
 carefully-0.006
 eagerly-0.006
fish-0.006
Token They
Token ID1119
Feature activation+7.61e-03

Pos logit contributions
 quietly+0.003
 curiously+0.003
 carefully+0.003
 eagerly+0.003
fish+0.003
Neg logit contributions
 fault-0.004
 arrest-0.003
 fallen-0.003
 Kim-0.003
 earthquake-0.003
Token searched
Token ID16499
Feature activation-8.46e-03

Pos logit contributions
 fault+0.014
 arrest+0.013
 fallen+0.012
 Kim+0.012
 enough+0.012
Neg logit contributions
 quietly-0.012
 curiously-0.012
fish-0.012
 carefully-0.011
Hop-0.011
Token for
Token ID329
Feature activation-9.72e-03

Pos logit contributions
 quietly+0.011
 curiously+0.011
fish+0.011
Hop+0.011
 carefully+0.011
Neg logit contributions
 fault-0.018
 arrest-0.016
 fallen-0.015
 enough-0.015
 Kim-0.015
Token.
Token ID13
Feature activation+2.18e-02

Pos logit contributions
 fault+0.032
 arrest+0.030
 fallen+0.027
erved+0.027
 Kim+0.026
Neg logit contributions
 quietly-0.027
 curiously-0.026
 carefully-0.025
 eagerly-0.024
fish-0.024
Token And
Token ID843
Feature activation+1.15e-02

Pos logit contributions
 fault+0.030
 arrest+0.027
 Kim+0.025
 fallen+0.024
 earthquake+0.023
Neg logit contributions
 curiously-0.046
 quietly-0.045
 carefully-0.044
fish-0.044
 eagerly-0.043
Token I
Token ID314
Feature activation-2.21e-02

Pos logit contributions
 fault+0.016
 arrest+0.015
 enough+0.014
 Kim+0.014
 fallen+0.013
Neg logit contributions
 curiously-0.023
 quietly-0.022
fish-0.022
 eagerly-0.021
Hop-0.021
Token like
Token ID588
Feature activation-5.10e-02

Pos logit contributions
 curiously+0.041
 quietly+0.039
fish+0.037
 eagerly+0.037
 carefully+0.037
Neg logit contributions
 fault-0.038
 arrest-0.033
 fallen-0.031
 enough-0.030
 inconvenience-0.029
Token your
Token ID534
Feature activation+1.21e-02

Pos logit contributions
 quietly+0.085
 curiously+0.082
 carefully+0.080
 eagerly+0.077
fish+0.077
Neg logit contributions
 fault-0.094
 arrest-0.087
 fallen-0.080
erved-0.079
 Kim-0.078
Token umb
Token ID20810
Feature activation+9.15e-02

Pos logit contributions
 fault+0.024
 arrest+0.022
 fallen+0.021
 Kim+0.020
 earthquake+0.020
Neg logit contributions
 quietly-0.018
 curiously-0.018
 carefully-0.017
 eagerly-0.017
fish-0.016
Tokenrell
Token ID11252
Feature activation-5.64e-03

Pos logit contributions
 fault+0.089
 arrest+0.080
 enough+0.064
 fallen+0.063
 Kim+0.060
Neg logit contributions
 quietly-0.227
 carefully-0.223
 curiously-0.219
 eagerly-0.216
fish-0.212
Tokenas
Token ID292
Feature activation+2.29e-03

Pos logit contributions
 quietly+0.008
 curiously+0.008
 carefully+0.007
 eagerly+0.007
fish+0.007
Neg logit contributions
 fault-0.012
 arrest-0.011
 fallen-0.010
 Kim-0.010
 enough-0.010
Token.
Token ID13
Feature activation+1.19e-02

Pos logit contributions
 fault+0.004
 arrest+0.004
 fallen+0.004
 Kim+0.004
erved+0.004
Neg logit contributions
 quietly-0.004
 curiously-0.003
 carefully-0.003
fish-0.003
 eagerly-0.003
Token They
Token ID1119
Feature activation+1.29e-02

Pos logit contributions
 fault+0.016
 arrest+0.015
 Kim+0.014
 fallen+0.013
 enough+0.013
Neg logit contributions
 curiously-0.025
 quietly-0.024
fish-0.024
 carefully-0.023
 eagerly-0.023
Token are
Token ID389
Feature activation-5.12e-03

Pos logit contributions
 fault+0.019
 arrest+0.018
 fallen+0.016
 enough+0.015
 Kim+0.015
Neg logit contributions
 quietly-0.026
 curiously-0.026
 carefully-0.024
 eagerly-0.024
fish-0.024
Token got
Token ID1392
Feature activation-1.95e-03

Pos logit contributions
 fault+0.003
 arrest+0.003
 enough+0.003
 fallen+0.003
 Kim+0.003
Neg logit contributions
 curiously-0.003
fish-0.003
 quietly-0.003
Hop-0.003
 closely-0.003
Token out
Token ID503
Feature activation-1.94e-02

Pos logit contributions
fish+0.003
 curiously+0.003
Hop+0.003
 quietly+0.003
 mer+0.003
Neg logit contributions
 fault-0.004
 enough-0.004
 fallen-0.003
 arrest-0.003
 Kim-0.003
Token their
Token ID511
Feature activation+1.71e-02

Pos logit contributions
 quietly+0.027
 curiously+0.026
 carefully+0.025
 eagerly+0.024
fish+0.023
Neg logit contributions
 fault-0.041
 arrest-0.039
 fallen-0.036
 Kim-0.035
erved-0.035
Token coats
Token ID30720
Feature activation+3.67e-03

Pos logit contributions
 fault+0.032
 arrest+0.030
 fallen+0.028
 Kim+0.027
 earthquake+0.027
Neg logit contributions
 quietly-0.028
 curiously-0.028
fish-0.026
 carefully-0.026
 eagerly-0.026
Token and
Token ID290
Feature activation-5.40e-03

Pos logit contributions
 fault+0.009
 arrest+0.009
 fallen+0.008
 earthquake+0.008
 inconvenience+0.008
Neg logit contributions
 quietly-0.004
 curiously-0.003
 carefully-0.003
 eagerly-0.003
fish-0.003
Token umb
Token ID20810
Feature activation+9.11e-02

Pos logit contributions
 quietly+0.008
 curiously+0.008
 carefully+0.008
 eagerly+0.007
fish+0.007
Neg logit contributions
 fault-0.011
 arrest-0.010
 fallen-0.009
 Kim-0.009
 earthquake-0.009
Tokenrell
Token ID11252
Feature activation+8.74e-04

Pos logit contributions
 fault+0.100
 arrest+0.087
 enemy+0.077
 enough+0.075
 inconvenience+0.072
Neg logit contributions
 carefully-0.218
 quietly-0.217
 curiously-0.214
 eagerly-0.212
 gently-0.201
Tokenas
Token ID292
Feature activation+1.30e-03

Pos logit contributions
 fault+0.002
 arrest+0.002
 fallen+0.002
 earthquake+0.002
 enough+0.002
Neg logit contributions
 quietly-0.001
 curiously-0.001
 carefully-0.001
 eagerly-0.001
fish-0.001
Token and
Token ID290
Feature activation+7.58e-03

Pos logit contributions
 fault+0.003
 arrest+0.003
 fallen+0.003
 Kim+0.003
 earthquake+0.003
Neg logit contributions
 quietly-0.002
 curiously-0.002
 carefully-0.001
 eagerly-0.001
fish-0.001
Token they
Token ID484
Feature activation+2.43e-02

Pos logit contributions
 fault+0.016
 arrest+0.015
 earthquake+0.014
 fallen+0.014
erved+0.013
Neg logit contributions
 quietly-0.011
 carefully-0.010
 curiously-0.010
 eagerly-0.010
fish-0.009
Token held
Token ID2714
Feature activation-4.78e-02

Pos logit contributions
 fault+0.044
 arrest+0.041
 fallen+0.039
 Kim+0.039
 enough+0.038
Neg logit contributions
 curiously-0.039
 quietly-0.039
fish-0.037
Hop-0.036
 eagerly-0.036
Token door
Token ID3420
Feature activation+2.67e-02

Pos logit contributions
 quietly+0.008
 curiously+0.007
 carefully+0.007
 eagerly+0.007
fish+0.006
Neg logit contributions
 fault-0.013
 arrest-0.013
 fallen-0.011
 earthquake-0.011
 Kim-0.011
Token.
Token ID13
Feature activation-1.12e-02

Pos logit contributions
 fault+0.053
 arrest+0.049
 fallen+0.046
 Kim+0.046
 enough+0.046
Neg logit contributions
 quietly-0.038
 curiously-0.038
Hop-0.037
fish-0.037
 carefully-0.035
Token Inside
Token ID14384
Feature activation-4.37e-02

Pos logit contributions
fish+0.021
 quietly+0.020
 curiously+0.020
 carefully+0.019
Hop+0.019
Neg logit contributions
 fault-0.018
 Kim-0.016
 arrest-0.016
 fallen-0.015
 enough-0.015
Token were
Token ID547
Feature activation+1.55e-02

Pos logit contributions
 quietly+0.063
 curiously+0.062
 carefully+0.060
 eagerly+0.058
fish+0.057
Neg logit contributions
 fault-0.090
 arrest-0.085
 fallen-0.078
erved-0.077
 earthquake-0.077
Token two
Token ID734
Feature activation+4.27e-02

Pos logit contributions
 fault+0.030
 arrest+0.029
 earthquake+0.026
erved+0.025
 fallen+0.025
Neg logit contributions
 quietly-0.024
 curiously-0.023
 carefully-0.023
 eagerly-0.022
Hop-0.021
Token umb
Token ID20810
Feature activation+9.04e-02

Pos logit contributions
 fault+0.082
 arrest+0.077
 fallen+0.073
 enough+0.071
 Kim+0.070
Neg logit contributions
 quietly-0.066
 curiously-0.065
 carefully-0.061
fish-0.061
 eagerly-0.060
Tokenrell
Token ID11252
Feature activation-2.71e-03

Pos logit contributions
 fault+0.103
 arrest+0.086
 enemy+0.077
 enough+0.077
 loser+0.075
Neg logit contributions
 carefully-0.211
 curiously-0.207
 quietly-0.207
 eagerly-0.206
 cheer-0.194
Tokenas
Token ID292
Feature activation-4.70e-04

Pos logit contributions
 quietly+0.003
 curiously+0.003
 carefully+0.003
 eagerly+0.003
fish+0.003
Neg logit contributions
 fault-0.006
 arrest-0.006
 fallen-0.005
 Kim-0.005
 earthquake-0.005
Token -
Token ID532
Feature activation+8.06e-03

Pos logit contributions
 quietly+0.001
 curiously+0.001
 carefully+0.001
 eagerly+0.001
fish+0.001
Neg logit contributions
 fault-0.001
 arrest-0.001
 earthquake-0.001
erved-0.001
 fallen-0.001
Token one
Token ID530
Feature activation-2.39e-03

Pos logit contributions
 fault+0.016
 arrest+0.015
erved+0.014
 earthquake+0.013
 fallen+0.013
Neg logit contributions
 quietly-0.012
 carefully-0.012
 curiously-0.012
 eagerly-0.011
 gently-0.011
Token big
Token ID1263
Feature activation-8.35e-03

Pos logit contributions
 curiously+0.004
 quietly+0.004
 eagerly+0.003
 carefully+0.003
Hop+0.003
Neg logit contributions
 fault-0.005
 arrest-0.004
 fallen-0.004
 Kim-0.004
 enough-0.004
Token hard
Token ID1327
Feature activation-6.63e-03

Pos logit contributions
 curiously+0.005
 quietly+0.005
 eagerly+0.005
Hop+0.005
 carefully+0.005
Neg logit contributions
 fault-0.005
 arrest-0.004
 fallen-0.004
 earthquake-0.004
 enough-0.004
Token game
Token ID983
Feature activation+1.96e-02

Pos logit contributions
 quietly+0.011
 curiously+0.011
 carefully+0.010
 eagerly+0.010
fish+0.010
Neg logit contributions
 fault-0.012
 arrest-0.012
 Kim-0.011
 fallen-0.011
erved-0.010
Token for
Token ID329
Feature activation+1.56e-02

Pos logit contributions
 fault+0.042
 arrest+0.039
 fallen+0.036
 Kim+0.036
erved+0.036
Neg logit contributions
 quietly-0.027
 curiously-0.026
 carefully-0.025
 eagerly-0.024
fish-0.024
Token a
Token ID257
Feature activation+1.31e-02

Pos logit contributions
 fault+0.033
 arrest+0.030
 fallen+0.028
erved+0.028
 enough+0.027
Neg logit contributions
 quietly-0.023
 curiously-0.023
 carefully-0.022
 eagerly-0.021
fish-0.020
Token three
Token ID1115
Feature activation+4.68e-03

Pos logit contributions
 fault+0.023
 arrest+0.021
 fallen+0.019
 earthquake+0.019
 inconvenience+0.019
Neg logit contributions
 quietly-0.022
 curiously-0.022
 eagerly-0.021
 carefully-0.021
fish-0.020
Token year
Token ID614
Feature activation+8.98e-02

Pos logit contributions
 fault+0.008
 arrest+0.007
 fallen+0.007
 earthquake+0.006
 enough+0.006
Neg logit contributions
 quietly-0.008
 curiously-0.008
 carefully-0.008
Hop-0.008
 eagerly-0.008
Token old
Token ID1468
Feature activation+6.20e-03

Pos logit contributions
 fault+0.210
 arrest+0.201
 fallen+0.185
erved+0.180
 Kim+0.180
Neg logit contributions
 quietly-0.112
 carefully-0.106
 curiously-0.101
 eagerly-0.094
 closely-0.093
Token.
Token ID13
Feature activation+1.76e-02

Pos logit contributions
 fault+0.013
 arrest+0.012
 fallen+0.011
 Kim+0.011
erved+0.011
Neg logit contributions
 quietly-0.009
 curiously-0.009
 carefully-0.008
 eagerly-0.008
fish-0.008
Token She
Token ID1375
Feature activation+1.15e-03

Pos logit contributions
 fault+0.030
 arrest+0.027
 Kim+0.025
 fallen+0.025
 enough+0.024
Neg logit contributions
 quietly-0.032
 curiously-0.032
fish-0.031
 carefully-0.030
 eagerly-0.030
Token also
Token ID635
Feature activation-8.77e-03

Pos logit contributions
 fault+0.002
 arrest+0.002
 enough+0.002
 fallen+0.002
 caused+0.002
Neg logit contributions
 curiously-0.002
fish-0.002
Hop-0.002
 quietly-0.002
 closely-0.002
Token knew
Token ID2993
Feature activation+8.91e-03

Pos logit contributions
 curiously+0.013
 quietly+0.013
fish+0.013
 carefully+0.012
Hop+0.012
Neg logit contributions
 fault-0.017
 arrest-0.016
 fallen-0.015
 Kim-0.015
 enough-0.014
Token,
Token ID11
Feature activation+2.75e-02

Pos logit contributions
 fault+0.085
 arrest+0.080
 fallen+0.073
 Kim+0.072
 enough+0.071
Neg logit contributions
 quietly-0.070
 curiously-0.069
 carefully-0.066
 eagerly-0.065
fish-0.064
Token elephants
Token ID25368
Feature activation+5.73e-02

Pos logit contributions
 fault+0.044
 arrest+0.038
 earthquake+0.035
erved+0.035
 fallen+0.035
Neg logit contributions
 quietly-0.054
 curiously-0.052
 carefully-0.052
fish-0.050
 eagerly-0.050
Token,
Token ID11
Feature activation+2.71e-02

Pos logit contributions
 fault+0.121
 arrest+0.113
 fallen+0.105
 Kim+0.103
 earthquake+0.103
Neg logit contributions
 quietly-0.080
 curiously-0.078
 carefully-0.075
 eagerly-0.072
fish-0.072
Token monkeys
Token ID26697
Feature activation+5.70e-02

Pos logit contributions
 fault+0.071
 arrest+0.064
 earthquake+0.063
 Kim+0.061
 inconvenience+0.061
Neg logit contributions
 quietly-0.027
 carefully-0.025
 curiously-0.024
 eagerly-0.022
fish-0.022
Token and
Token ID290
Feature activation+2.42e-02

Pos logit contributions
 fault+0.123
 arrest+0.114
 fallen+0.107
 earthquake+0.105
erved+0.105
Neg logit contributions
 quietly-0.078
 curiously-0.075
 carefully-0.073
 eagerly-0.070
fish-0.069
Token gir
Token ID37370
Feature activation+8.97e-02

Pos logit contributions
 fault+0.046
 arrest+0.038
erved+0.037
 earthquake+0.037
upid+0.037
Neg logit contributions
 quietly-0.041
 carefully-0.040
 curiously-0.039
 mer-0.038
Hop-0.038
Tokenaff
Token ID2001
Feature activation+3.36e-02

Pos logit contributions
 fault+0.191
 arrest+0.170
 earthquake+0.165
 fallen+0.164
 enough+0.161
Neg logit contributions
 quietly-0.119
 carefully-0.117
 curiously-0.114
fish-0.109
 eagerly-0.108
Tokenes
Token ID274
Feature activation+4.14e-02

Pos logit contributions
 fault+0.075
 arrest+0.068
 earthquake+0.065
 fallen+0.065
erved+0.064
Neg logit contributions
 quietly-0.045
fish-0.042
 curiously-0.042
 carefully-0.042
 eagerly-0.040
Token.
Token ID13
Feature activation+1.54e-02

Pos logit contributions
 fault+0.079
 arrest+0.076
 enough+0.070
 fallen+0.069
 Kim+0.068
Neg logit contributions
 curiously-0.062
 quietly-0.062
 carefully-0.057
 eagerly-0.057
fish-0.056
Token They
Token ID1119
Feature activation+1.01e-02

Pos logit contributions
 fault+0.021
 enough+0.018
 Kim+0.017
 arrest+0.017
 Entry+0.015
Neg logit contributions
fish-0.031
 mer-0.029
 curiously-0.029
 cheer-0.028
 quietly-0.028
Token pretend
Token ID16614
Feature activation-3.36e-02

Pos logit contributions
 fault+0.017
 arrest+0.016
 fallen+0.015
 Kim+0.015
 enough+0.014
Neg logit contributions
 quietly-0.017
 curiously-0.017
fish-0.016
 carefully-0.016
 eagerly-0.016
Token ze
Token ID41271
Feature activation+7.07e-02

Pos logit contributions
 fault+0.037
 arrest+0.033
 fallen+0.029
 earthquake+0.029
erved+0.029
Neg logit contributions
 quietly-0.051
 curiously-0.050
 carefully-0.049
 eagerly-0.047
fish-0.047
Tokenbr
Token ID1671
Feature activation+2.07e-02

Pos logit contributions
 fault+0.109
 arrest+0.097
 fallen+0.088
 Kim+0.086
 enough+0.086
Neg logit contributions
 quietly-0.140
 curiously-0.136
 carefully-0.133
fish-0.130
 eagerly-0.130
Tokenas
Token ID292
Feature activation+2.87e-02

Pos logit contributions
 fault+0.047
 arrest+0.045
 earthquake+0.043
 fallen+0.043
 inconvenience+0.041
Neg logit contributions
 quietly-0.025
 curiously-0.024
 carefully-0.023
 eagerly-0.022
fish-0.022
Token,
Token ID11
Feature activation+2.73e-02

Pos logit contributions
 fault+0.065
 arrest+0.060
 fallen+0.056
erved+0.056
 earthquake+0.056
Neg logit contributions
 quietly-0.037
 curiously-0.036
 carefully-0.035
fish-0.033
 eagerly-0.033
Token and
Token ID290
Feature activation+2.16e-02

Pos logit contributions
 fault+0.074
 arrest+0.067
 earthquake+0.065
 inconvenience+0.064
 Kim+0.064
Neg logit contributions
 quietly-0.025
 carefully-0.023
 curiously-0.023
 eagerly-0.021
fish-0.020
Token gir
Token ID37370
Feature activation+8.96e-02

Pos logit contributions
 fault+0.039
 arrest+0.032
erved+0.031
upid+0.031
 earthquake+0.030
Neg logit contributions
 quietly-0.040
 carefully-0.038
 curiously-0.038
Hop-0.036
 mer-0.036
Tokenaff
Token ID2001
Feature activation+3.39e-02

Pos logit contributions
 fault+0.192
 arrest+0.169
 earthquake+0.163
 fallen+0.163
 enough+0.160
Neg logit contributions
 quietly-0.120
 carefully-0.117
 curiously-0.114
fish-0.110
Hop-0.109
Tokenes
Token ID274
Feature activation+3.92e-02

Pos logit contributions
 fault+0.075
 arrest+0.069
 earthquake+0.066
 fallen+0.065
erved+0.065
Neg logit contributions
 quietly-0.045
 curiously-0.043
fish-0.042
 carefully-0.042
 eagerly-0.040
Token.
Token ID13
Feature activation+1.60e-02

Pos logit contributions
 fault+0.074
 arrest+0.071
 enough+0.065
 fallen+0.065
 Kim+0.064
Neg logit contributions
 curiously-0.059
 quietly-0.059
 carefully-0.055
 eagerly-0.054
fish-0.053
Token Anna
Token ID11735
Feature activation-2.27e-02

Pos logit contributions
 fault+0.023
 enough+0.020
 Kim+0.020
 arrest+0.019
 no+0.017
Neg logit contributions
fish-0.031
in-0.029
 carefully-0.029
Hop-0.029
 quietly-0.028
Token and
Token ID290
Feature activation+4.50e-03

Pos logit contributions
 curiously+0.034
fish+0.033
 cheer+0.033
Sure+0.033
 mer+0.032
Neg logit contributions
 fault-0.039
 arrest-0.037
 enough-0.035
 fallen-0.035
 Kim-0.035
Token She
Token ID1375
Feature activation-6.15e-04

Pos logit contributions
 quietly+0.004
fish+0.004
 curiously+0.004
 carefully+0.004
Hop+0.004
Neg logit contributions
 fault-0.004
 arrest-0.004
 Kim-0.004
 fallen-0.004
 enough-0.004
Token put
Token ID1234
Feature activation-5.22e-02

Pos logit contributions
Hop+0.001
fish+0.001
 curiously+0.001
 mer+0.001
 reefs+0.001
Neg logit contributions
 Kim-0.001
 fallen-0.001
 caused-0.001
 enough-0.001
 arrest-0.001
Token the
Token ID262
Feature activation-2.71e-02

Pos logit contributions
 quietly+0.085
 curiously+0.084
fish+0.082
 eagerly+0.078
 carefully+0.078
Neg logit contributions
 fault-0.094
 arrest-0.087
 fallen-0.086
 enough-0.086
 Kim-0.084
Token Band
Token ID10243
Feature activation-5.26e-02

Pos logit contributions
 quietly+0.042
 curiously+0.041
 carefully+0.039
 eagerly+0.038
fish+0.038
Neg logit contributions
 fault-0.053
 arrest-0.050
 fallen-0.047
 enough-0.045
 Kim-0.045
Token-
Token ID12
Feature activation-4.46e-02

Pos logit contributions
 curiously+0.080
 quietly+0.078
 carefully+0.072
 eagerly+0.072
 closely+0.071
Neg logit contributions
 fault-0.101
 arrest-0.097
 fallen-0.093
 Kim-0.089
 enough-0.089
TokenAid
Token ID44245
Feature activation-9.41e-02

Pos logit contributions
 curiously+0.064
 quietly+0.063
 eagerly+0.059
fish+0.058
 carefully+0.058
Neg logit contributions
 fault-0.088
 arrest-0.085
 fallen-0.082
 Kim-0.079
erved-0.078
Token on
Token ID319
Feature activation+3.50e-03

Pos logit contributions
 curiously+0.143
 quietly+0.137
 eagerly+0.131
Hop+0.129
 closely+0.127
Neg logit contributions
 fault-0.177
 fallen-0.164
 arrest-0.161
 enough-0.159
 Kim-0.157
Token Lily
Token ID20037
Feature activation-8.55e-03

Pos logit contributions
 fault+0.007
 arrest+0.007
 fallen+0.006
erved+0.006
 earthquake+0.006
Neg logit contributions
 quietly-0.005
 curiously-0.005
 carefully-0.005
 eagerly-0.005
 gently-0.004
Token's
Token ID338
Feature activation-4.91e-03

Pos logit contributions
 quietly+0.015
 curiously+0.014
 carefully+0.014
 eagerly+0.014
fish+0.013
Neg logit contributions
 fault-0.015
 arrest-0.014
 fallen-0.013
 Kim-0.013
 earthquake-0.013
Token knee
Token ID10329
Feature activation+2.55e-02

Pos logit contributions
 quietly+0.009
 curiously+0.009
fish+0.008
 carefully+0.008
Hop+0.008
Neg logit contributions
 fault-0.008
 arrest-0.008
 fallen-0.007
 Kim-0.007
erved-0.007
Token and
Token ID290
Feature activation-3.59e-03

Pos logit contributions
 fault+0.068
 arrest+0.066
 inconvenience+0.062
 earthquake+0.061
 Entry+0.061
Neg logit contributions
 quietly-0.024
 carefully-0.023
 curiously-0.020
 eagerly-0.020
 gently-0.019
Token Grace
Token ID16156
Feature activation-2.14e-02

Pos logit contributions
 fault+0.015
 arrest+0.014
 enough+0.013
 fallen+0.013
 Kim+0.013
Neg logit contributions
 quietly-0.011
 curiously-0.011
 carefully-0.011
fish-0.011
 eagerly-0.010
Token bought
Token ID5839
Feature activation-3.16e-02

Pos logit contributions
 quietly+0.029
 curiously+0.029
fish+0.027
 carefully+0.027
 eagerly+0.026
Neg logit contributions
 fault-0.044
 arrest-0.042
 fallen-0.039
 enough-0.039
 Kim-0.038
Token the
Token ID262
Feature activation-3.26e-02

Pos logit contributions
fish+0.061
 quietly+0.059
Hop+0.058
 curiously+0.057
 mer+0.053
Neg logit contributions
 fault-0.046
 enough-0.045
 arrest-0.044
 Kim-0.041
 fallen-0.038
Token thick
Token ID6546
Feature activation-4.98e-02

Pos logit contributions
 quietly+0.053
 curiously+0.052
 eagerly+0.050
Hop+0.048
 carefully+0.048
Neg logit contributions
 fault-0.063
 arrest-0.060
 fallen-0.056
 earthquake-0.054
 inconvenience-0.053
Token f
Token ID277
Feature activation-3.87e-02

Pos logit contributions
 quietly+0.080
 curiously+0.078
 carefully+0.074
Hop+0.073
 eagerly+0.073
Neg logit contributions
 fault-0.089
 arrest-0.088
 fallen-0.084
 earthquake-0.081
 Entry-0.078
Tokenudge
Token ID12587
Feature activation-9.24e-02

Pos logit contributions
 curiously+0.072
 quietly+0.071
 eagerly+0.066
 carefully+0.066
Hop+0.065
Neg logit contributions
 fault-0.065
 arrest-0.055
 Kim-0.053
upid-0.051
 fallen-0.051
Token and
Token ID290
Feature activation-1.17e-02

Pos logit contributions
 curiously+0.129
 quietly+0.127
Hop+0.122
fish+0.115
 eagerly+0.114
Neg logit contributions
 fault-0.182
 arrest-0.169
 enough-0.165
 Kim-0.162
 fallen-0.161
Token the
Token ID262
Feature activation+6.69e-03

Pos logit contributions
 quietly+0.016
 curiously+0.016
fish+0.015
 carefully+0.015
Hop+0.015
Neg logit contributions
 fault-0.024
 arrest-0.023
 fallen-0.021
 Kim-0.021
 enough-0.021
Token two
Token ID734
Feature activation+2.20e-02

Pos logit contributions
 fault+0.013
 arrest+0.012
 fallen+0.011
 earthquake+0.011
 Kim+0.011
Neg logit contributions
 quietly-0.011
 curiously-0.011
 carefully-0.010
 eagerly-0.010
fish-0.010
Token of
Token ID286
Feature activation+2.22e-03

Pos logit contributions
 fault+0.040
 arrest+0.039
 fallen+0.037
 enough+0.036
 Kim+0.035
Neg logit contributions
 quietly-0.035
 curiously-0.034
Hop-0.032
fish-0.032
 carefully-0.031
Token them
Token ID606
Feature activation-3.31e-03

Pos logit contributions
 fault+0.004
 arrest+0.004
 enough+0.004
 Kim+0.004
 fallen+0.004
Neg logit contributions
 quietly-0.003
 curiously-0.003
fish-0.003
Hop-0.003
 carefully-0.003
Token excited
Token ID6568
Feature activation+1.06e-04

Pos logit contributions
Hop+0.008
fish+0.008
 curiously+0.007
 quietly+0.007
 mer+0.007
Neg logit contributions
 fault-0.014
 fallen-0.013
 Kim-0.013
 enough-0.013
 arrest-0.013
Token.
Token ID13
Feature activation-8.70e-03

Pos logit contributions
 fault+0.000
 arrest+0.000
 fallen+0.000
 Kim+0.000
erved+0.000
Neg logit contributions
 quietly-0.000
 curiously-0.000
 carefully-0.000
 eagerly-0.000
fish-0.000
Token She
Token ID1375
Feature activation-6.67e-03

Pos logit contributions
 quietly+0.015
 curiously+0.015
 carefully+0.015
 eagerly+0.014
fish+0.014
Neg logit contributions
 fault-0.015
 arrest-0.014
 fallen-0.013
 Kim-0.013
 earthquake-0.012
Token held
Token ID2714
Feature activation-5.61e-02

Pos logit contributions
fish+0.010
 curiously+0.010
 quietly+0.010
Hop+0.010
 mer+0.010
Neg logit contributions
 arrest-0.011
 fault-0.011
 fallen-0.011
 enough-0.010
 Kim-0.010
Token the
Token ID262
Feature activation-2.75e-02

Pos logit contributions
fish+0.113
Hop+0.107
 mer+0.105
 quietly+0.101
 reefs+0.099
Neg logit contributions
 enough-0.080
 Kim-0.079
 arrest-0.076
 fault-0.073
 fallen-0.069
Token bar
Token ID2318
Feature activation-9.14e-02

Pos logit contributions
 quietly+0.053
 curiously+0.052
fish+0.051
 carefully+0.050
Hop+0.050
Neg logit contributions
 fault-0.043
 arrest-0.039
 fallen-0.037
 enough-0.035
 Kim-0.035
Token of
Token ID286
Feature activation-2.74e-02

Pos logit contributions
Hop+0.134
 mer+0.128
 curiously+0.121
Would+0.117
 reef+0.115
Neg logit contributions
 fault-0.167
 enough-0.164
 arrest-0.162
 Kim-0.155
hy-0.148
Token chocolate
Token ID11311
Feature activation-5.03e-02

Pos logit contributions
 quietly+0.049
 curiously+0.048
fish+0.047
Hop+0.047
 carefully+0.045
Neg logit contributions
 fault-0.045
 arrest-0.043
 Kim-0.040
 enough-0.039
 fallen-0.039
Token and
Token ID290
Feature activation-7.91e-03

Pos logit contributions
Hop+0.069
 mer+0.065
 quietly+0.064
fish+0.063
 curiously+0.063
Neg logit contributions
 fault-0.101
 enough-0.096
 arrest-0.094
 fallen-0.091
erved-0.089
Token skipped
Token ID26684
Feature activation+1.15e-02

Pos logit contributions
 curiously+0.012
 quietly+0.012
fish+0.011
Hop+0.011
 mer+0.010
Neg logit contributions
 arrest-0.015
 fault-0.014
 fallen-0.014
 Kim-0.014
 enough-0.013
Token all
Token ID477
Feature activation+8.18e-04

Pos logit contributions
 arrest+0.021
 fault+0.021
 enough+0.021
 fallen+0.020
 Kim+0.019
Neg logit contributions
fish-0.017
 curiously-0.016
 quietly-0.016
Hop-0.015
 carefully-0.014
Token and
Token ID290
Feature activation+1.02e-02

Pos logit contributions
 quietly+0.047
 curiously+0.047
 carefully+0.043
 eagerly+0.043
Hop+0.042
Neg logit contributions
 fault-0.092
 arrest-0.087
 fallen-0.082
 enough-0.081
 Kim-0.079
Token a
Token ID257
Feature activation-4.73e-03

Pos logit contributions
 fault+0.022
 arrest+0.020
 earthquake+0.019
 inconvenience+0.019
 fallen+0.019
Neg logit contributions
 quietly-0.014
 curiously-0.014
 carefully-0.014
 eagerly-0.013
fish-0.013
Token wand
Token ID11569
Feature activation-1.16e-02

Pos logit contributions
 curiously+0.007
 quietly+0.007
 eagerly+0.007
Hop+0.007
 carefully+0.006
Neg logit contributions
 fault-0.009
 fallen-0.008
 arrest-0.008
 enough-0.007
 loser-0.007
Token.
Token ID13
Feature activation+1.87e-02

Pos logit contributions
 quietly+0.016
 curiously+0.015
 carefully+0.015
 eagerly+0.014
fish+0.014
Neg logit contributions
 fault-0.025
 arrest-0.024
 fallen-0.022
 Kim-0.022
 earthquake-0.021
Token She
Token ID1375
Feature activation+6.77e-03

Pos logit contributions
 fault+0.028
 arrest+0.024
 Kim+0.023
 enough+0.023
 fallen+0.022
Neg logit contributions
fish-0.037
 quietly-0.036
 curiously-0.036
 carefully-0.035
Hop-0.035
Token dips
Token ID48904
Feature activation-9.12e-02

Pos logit contributions
 fault+0.011
 arrest+0.010
 fallen+0.009
 enough+0.009
erved+0.009
Neg logit contributions
 curiously-0.012
Hop-0.012
 quietly-0.012
Sure-0.011
fish-0.011
Token the
Token ID262
Feature activation-2.30e-02

Pos logit contributions
 curiously+0.159
 quietly+0.155
Hop+0.151
 eagerly+0.148
fish+0.148
Neg logit contributions
 fault-0.144
 enough-0.136
 Kim-0.130
 fallen-0.128
 arrest-0.123
Token wand
Token ID11569
Feature activation-5.55e-02

Pos logit contributions
 curiously+0.033
 quietly+0.033
 eagerly+0.031
 carefully+0.029
Hop+0.028
Neg logit contributions
 fault-0.048
 arrest-0.044
 fallen-0.043
 enough-0.041
 caused-0.040
Token in
Token ID287
Feature activation-3.58e-02

Pos logit contributions
Hop+0.071
 curiously+0.067
Sure+0.065
 quietly+0.063
Would+0.061
Neg logit contributions
 fault-0.110
 enough-0.109
 fallen-0.104
 arrest-0.103
erved-0.099
Token the
Token ID262
Feature activation+9.53e-03

Pos logit contributions
 quietly+0.059
 curiously+0.058
fish+0.055
 carefully+0.055
 eagerly+0.055
Neg logit contributions
 fault-0.065
 arrest-0.061
 fallen-0.056
 Kim-0.055
 enough-0.055
Token soap
Token ID19533
Feature activation-5.10e-02

Pos logit contributions
 fault+0.018
 arrest+0.017
 fallen+0.017
 earthquake+0.016
 enough+0.016
Neg logit contributions
 curiously-0.015
 quietly-0.014
 eagerly-0.014
 carefully-0.013
Hop-0.013
Token make
Token ID787
Feature activation-2.49e-03

Pos logit contributions
 fault+0.016
 arrest+0.015
 enough+0.014
erved+0.014
 fallen+0.014
Neg logit contributions
 quietly-0.010
 carefully-0.010
 curiously-0.009
 eagerly-0.009
 cheer-0.009
Token one
Token ID530
Feature activation-1.01e-02

Pos logit contributions
 quietly+0.004
 curiously+0.004
 carefully+0.004
fish+0.004
Hop+0.004
Neg logit contributions
 fault-0.005
 arrest-0.004
 Kim-0.004
 enough-0.004
 fallen-0.004
Token too
Token ID1165
Feature activation-2.25e-03

Pos logit contributions
Hop+0.017
 curiously+0.016
 quietly+0.015
 eagerly+0.015
 mer+0.014
Neg logit contributions
 fault-0.018
 enough-0.016
 Kim-0.016
 arrest-0.015
 earthquake-0.015
Token.
Token ID13
Feature activation+1.49e-02

Pos logit contributions
 quietly+0.003
 curiously+0.003
 carefully+0.003
 eagerly+0.002
 closely+0.002
Neg logit contributions
 fault-0.005
 arrest-0.005
 fallen-0.005
 inconvenience-0.005
 earthquake-0.005
Token He
Token ID679
Feature activation-7.65e-03

Pos logit contributions
 Kim+0.023
 fault+0.023
 enough+0.022
 arrest+0.020
 fallen+0.019
Neg logit contributions
fish-0.025
 curiously-0.024
 mer-0.024
Hop-0.023
 reef-0.022
Token dips
Token ID48904
Feature activation-9.08e-02

Pos logit contributions
 quietly+0.014
 curiously+0.014
fish+0.014
Hop+0.014
 eagerly+0.013
Neg logit contributions
 fault-0.012
 arrest-0.011
 enough-0.010
 fallen-0.009
erved-0.009
Token his
Token ID465
Feature activation-4.90e-03

Pos logit contributions
 curiously+0.170
 quietly+0.165
Hop+0.165
fish+0.158
 eagerly+0.157
Neg logit contributions
 fault-0.133
 enough-0.122
 Kim-0.121
 fallen-0.113
 arrest-0.113
Token brush
Token ID14093
Feature activation-1.58e-02

Pos logit contributions
 quietly+0.007
 curiously+0.007
 carefully+0.007
 eagerly+0.007
fish+0.007
Neg logit contributions
 fault-0.010
 arrest-0.009
 fallen-0.008
 Kim-0.008
 enough-0.008
Token in
Token ID287
Feature activation-4.05e-02

Pos logit contributions
 curiously+0.017
 quietly+0.017
Hop+0.017
fish+0.016
 eagerly+0.015
Neg logit contributions
 fault-0.036
 enough-0.033
 arrest-0.033
 fallen-0.032
erved-0.031
Token the
Token ID262
Feature activation+9.11e-03

Pos logit contributions
 quietly+0.067
 curiously+0.066
 carefully+0.063
fish+0.062
 eagerly+0.062
Neg logit contributions
 fault-0.074
 arrest-0.069
 fallen-0.064
 Kim-0.063
 enough-0.062
Token red
Token ID2266
Feature activation-2.74e-02

Pos logit contributions
 fault+0.019
 arrest+0.018
 Kim+0.016
 fallen+0.016
erved+0.016
Neg logit contributions
 quietly-0.013
 carefully-0.012
 curiously-0.012
 eagerly-0.012
fish-0.011
Token
Token ID198
Feature activation-1.79e-03

Pos logit contributions
 fault+0.003
 arrest+0.003
 Kim+0.003
 fallen+0.003
 earthquake+0.003
Neg logit contributions
fish-0.003
 quietly-0.003
 curiously-0.003
 mer-0.003
 carefully-0.003
Token
Token ID198
Feature activation+4.13e-03

Pos logit contributions
fish+0.003
 mer+0.003
beans+0.003
 curiously+0.002
ista+0.002
Neg logit contributions
 arrest-0.003
 fault-0.003
 Kim-0.003
 fallen-0.003
 caused-0.003
TokenGeorge
Token ID20191
Feature activation-2.83e-02

Pos logit contributions
 arrest+0.007
 fallen+0.007
 earthquake+0.006
 fault+0.006
 Entry+0.006
Neg logit contributions
fish-0.006
 curiously-0.006
 quietly-0.006
 mer-0.006
 eagerly-0.006
Token asked
Token ID1965
Feature activation-3.09e-02

Pos logit contributions
 quietly+0.039
 curiously+0.038
fish+0.037
 carefully+0.036
Hop+0.036
Neg logit contributions
 fault-0.056
 arrest-0.054
 fallen-0.051
 enough-0.050
 Kim-0.049
Token the
Token ID262
Feature activation-1.75e-02

Pos logit contributions
 quietly+0.053
 curiously+0.052
 carefully+0.050
fish+0.050
 eagerly+0.049
Neg logit contributions
 fault-0.054
 arrest-0.050
 Kim-0.046
 fallen-0.046
 enough-0.046
Token bar
Token ID2318
Feature activation-9.06e-02

Pos logit contributions
fish+0.024
 curiously+0.023
 quietly+0.022
Hop+0.022
 carefully+0.022
Neg logit contributions
 fault-0.036
 arrest-0.034
 enough-0.033
 fallen-0.032
 inconvenience-0.032
Tokenber
Token ID527
Feature activation-4.50e-02

Pos logit contributions
Hop+0.132
 curiously+0.125
 mer+0.120
 quietly+0.118
 cheer+0.118
Neg logit contributions
 fault-0.167
 enough-0.157
 Kim-0.150
one-0.144
 arrest-0.143
Token for
Token ID329
Feature activation-1.92e-02

Pos logit contributions
 quietly+0.070
 curiously+0.069
fish+0.068
Hop+0.068
 carefully+0.066
Neg logit contributions
 fault-0.082
 enough-0.073
 arrest-0.072
 Kim-0.071
 fallen-0.066
Token a
Token ID257
Feature activation-1.35e-03

Pos logit contributions
 quietly+0.028
 curiously+0.027
 carefully+0.027
 eagerly+0.025
fish+0.023
Neg logit contributions
 fault-0.041
 arrest-0.037
 fallen-0.035
erved-0.035
 earthquake-0.034
Token new
Token ID649
Feature activation-1.59e-02

Pos logit contributions
 quietly+0.003
 curiously+0.003
fish+0.002
 carefully+0.002
 eagerly+0.002
Neg logit contributions
 fault-0.002
 arrest-0.002
 fallen-0.002
 enough-0.002
 Kim-0.002
Token hairst
Token ID42658
Feature activation-5.04e-02

Pos logit contributions
 quietly+0.029
 curiously+0.029
fish+0.028
 carefully+0.028
Hop+0.028
Neg logit contributions
 fault-0.026
 arrest-0.023
 fallen-0.021
 enough-0.021
 Kim-0.021
Token decided
Token ID3066
Feature activation+1.28e-02

Pos logit contributions
 fault+0.009
 arrest+0.008
 fallen+0.008
 Kim+0.008
 enough+0.008
Neg logit contributions
 quietly-0.007
 curiously-0.007
 carefully-0.007
 eagerly-0.007
fish-0.007
Token to
Token ID284
Feature activation+6.70e-03

Pos logit contributions
 fault+0.029
opl+0.027
 arrest+0.027
 earthquake+0.026
upid+0.026
Neg logit contributions
 quietly-0.018
 carefully-0.017
 curiously-0.016
 eagerly-0.015
 gently-0.015
Token cheat
Token ID22705
Feature activation+1.12e-02

Pos logit contributions
 fault+0.013
 arrest+0.012
 fallen+0.011
 Kim+0.011
erved+0.011
Neg logit contributions
 quietly-0.011
 curiously-0.011
 carefully-0.010
 eagerly-0.010
fish-0.010
Token.
Token ID13
Feature activation-8.59e-03

Pos logit contributions
 fault+0.026
 arrest+0.024
 fallen+0.023
 earthquake+0.022
erved+0.022
Neg logit contributions
 quietly-0.015
 curiously-0.014
 carefully-0.013
 eagerly-0.013
fish-0.012
Token He
Token ID679
Feature activation-1.37e-02

Pos logit contributions
 quietly+0.014
 curiously+0.013
 carefully+0.013
 eagerly+0.012
Hop+0.012
Neg logit contributions
 fault-0.017
 arrest-0.016
 fallen-0.015
 earthquake-0.015
 inconvenience-0.015
Token dipped
Token ID35300
Feature activation-9.02e-02

Pos logit contributions
 quietly+0.022
 curiously+0.021
 carefully+0.020
 eagerly+0.020
fish+0.019
Neg logit contributions
 fault-0.027
 arrest-0.025
 fallen-0.023
 Kim-0.023
 earthquake-0.022
Token his
Token ID465
Feature activation-1.16e-02

Pos logit contributions
 quietly+0.161
 curiously+0.159
 carefully+0.153
 eagerly+0.149
fish+0.149
Neg logit contributions
 fault-0.152
 arrest-0.140
 fallen-0.129
 Kim-0.129
 enough-0.127
Token claws
Token ID28421
Feature activation+1.56e-03

Pos logit contributions
 quietly+0.022
 curiously+0.021
 carefully+0.021
 eagerly+0.020
fish+0.020
Neg logit contributions
 fault-0.019
 arrest-0.018
 fallen-0.016
 Kim-0.016
 enough-0.015
Token in
Token ID287
Feature activation-4.94e-02

Pos logit contributions
 fault+0.004
 arrest+0.004
 Kim+0.003
 fallen+0.003
 earthquake+0.003
Neg logit contributions
 quietly-0.002
 curiously-0.002
 carefully-0.001
 eagerly-0.001
fish-0.001
Token ink
Token ID16882
Feature activation-1.45e-02

Pos logit contributions
 quietly+0.075
 curiously+0.074
 carefully+0.072
 eagerly+0.068
 closely+0.064
Neg logit contributions
 fault-0.100
 arrest-0.092
erved-0.087
 fallen-0.086
 earthquake-0.086
Token and
Token ID290
Feature activation-7.57e-03

Pos logit contributions
 quietly+0.016
 curiously+0.015
 carefully+0.015
 eagerly+0.014
fish+0.014
Neg logit contributions
 fault-0.035
 arrest-0.033
 fallen-0.031
 Kim-0.031
 earthquake-0.030
Token outfit
Token ID16313
Feature activation-1.33e-02

Pos logit contributions
 curiously+0.061
 quietly+0.059
Hop+0.056
 carefully+0.055
 eagerly+0.055
Neg logit contributions
 fault-0.072
 arrest-0.068
 enough-0.064
 fallen-0.062
erved-0.061
Token.
Token ID13
Feature activation+6.16e-03

Pos logit contributions
 quietly+0.018
 curiously+0.017
 carefully+0.017
 eagerly+0.016
fish+0.016
Neg logit contributions
 fault-0.029
 arrest-0.028
 fallen-0.025
 earthquake-0.025
 Kim-0.025
Token She
Token ID1375
Feature activation-3.96e-03

Pos logit contributions
 fault+0.010
 arrest+0.009
 Kim+0.008
 fallen+0.008
 enough+0.008
Neg logit contributions
 quietly-0.012
 curiously-0.011
fish-0.011
 carefully-0.011
 eagerly-0.011
Token put
Token ID1234
Feature activation-6.01e-02

Pos logit contributions
 quietly+0.006
 curiously+0.006
fish+0.006
Hop+0.006
Sure+0.006
Neg logit contributions
 fault-0.006
 arrest-0.006
 fallen-0.006
erved-0.006
 Kim-0.006
Token her
Token ID607
Feature activation-2.84e-02

Pos logit contributions
 curiously+0.113
fish+0.109
 quietly+0.107
Hop+0.103
rig+0.099
Neg logit contributions
 enough-0.088
 Kim-0.081
 fault-0.080
 fallen-0.079
 yours-0.075
Token favourite
Token ID12507
Feature activation-8.92e-02

Pos logit contributions
 quietly+0.042
 curiously+0.042
Hop+0.040
 carefully+0.038
fish+0.037
Neg logit contributions
 fault-0.052
 arrest-0.047
 enough-0.046
 fallen-0.046
erved-0.045
Token necklace
Token ID38987
Feature activation-2.20e-02

Pos logit contributions
 quietly+0.153
 curiously+0.150
Hop+0.147
Would+0.139
 softly+0.135
Neg logit contributions
 fallen-0.121
 fault-0.120
 mine-0.120
 arrest-0.119
 enough-0.119
Token on
Token ID319
Feature activation+7.24e-03

Pos logit contributions
 curiously+0.027
 quietly+0.026
Hop+0.024
fish+0.024
 eagerly+0.023
Neg logit contributions
 fault-0.048
 arrest-0.045
 fallen-0.044
 enough-0.043
 Kim-0.043
Token too
Token ID1165
Feature activation+9.89e-03

Pos logit contributions
 fault+0.013
 arrest+0.013
 fallen+0.012
 enough+0.012
 Kim+0.012
Neg logit contributions
 quietly-0.011
 curiously-0.011
fish-0.011
Hop-0.011
 carefully-0.010
Token,
Token ID11
Feature activation+1.45e-02

Pos logit contributions
 fault+0.021
 arrest+0.020
 fallen+0.018
 earthquake+0.018
 inconvenience+0.018
Neg logit contributions
 quietly-0.014
 curiously-0.013
 carefully-0.013
 eagerly-0.013
fish-0.012
Token for
Token ID329
Feature activation+5.75e-03

Pos logit contributions
 fault+0.030
 arrest+0.029
 fallen+0.026
 Kim+0.026
 enough+0.026
Neg logit contributions
 quietly-0.020
 curiously-0.020
 carefully-0.019
fish-0.018
 eagerly-0.018
Token that
Token ID326
Feature activation-3.63e-02

Pos logit contributions
 quietly+0.012
 curiously+0.012
 carefully+0.011
 eagerly+0.011
fish+0.011
Neg logit contributions
 fault-0.017
 arrest-0.016
 fallen-0.015
 Kim-0.015
 earthquake-0.014
Token she
Token ID673
Feature activation-1.71e-02

Pos logit contributions
 curiously+0.053
 quietly+0.052
fish+0.049
Hop+0.049
 carefully+0.049
Neg logit contributions
 fault-0.071
 arrest-0.068
 Kim-0.065
 fallen-0.065
 enough-0.064
Token gave
Token ID2921
Feature activation-3.94e-02

Pos logit contributions
 curiously+0.029
 quietly+0.028
 carefully+0.026
fish+0.026
 closely+0.026
Neg logit contributions
 arrest-0.028
 fault-0.028
 fallen-0.027
 enough-0.025
 Kim-0.025
Token the
Token ID262
Feature activation-3.97e-02

Pos logit contributions
fish+0.074
 curiously+0.073
 quietly+0.073
Hop+0.070
 mer+0.068
Neg logit contributions
 fault-0.059
 arrest-0.055
 enough-0.055
 Kim-0.053
 fallen-0.053
Token w
Token ID266
Feature activation-5.14e-02

Pos logit contributions
 quietly+0.059
 curiously+0.058
fish+0.055
 carefully+0.054
 eagerly+0.053
Neg logit contributions
 fault-0.074
 fallen-0.067
 enough-0.067
 arrest-0.067
 Kim-0.066
Tokenaffle
Token ID30697
Feature activation-8.91e-02

Pos logit contributions
 quietly+0.098
 curiously+0.095
 carefully+0.090
 closer+0.090
 cheer+0.089
Neg logit contributions
 fault-0.081
 arrest-0.067
 earthquake-0.063
erved-0.063
opl-0.063
Token maker
Token ID16009
Feature activation-4.49e-02

Pos logit contributions
 curiously+0.142
 quietly+0.134
Hop+0.133
fish+0.126
 closely+0.124
Neg logit contributions
 fault-0.156
 arrest-0.152
 enough-0.144
 fallen-0.143
erved-0.138
Token a
Token ID257
Feature activation-2.89e-02

Pos logit contributions
 quietly+0.065
 curiously+0.064
 carefully+0.060
fish+0.059
 eagerly+0.058
Neg logit contributions
 fault-0.091
 arrest-0.086
 fallen-0.080
 enough-0.079
 Kim-0.078
Token kiss
Token ID9245
Feature activation-8.31e-03

Pos logit contributions
 curiously+0.047
 quietly+0.046
 closely+0.042
Hop+0.042
 eagerly+0.041
Neg logit contributions
 fault-0.054
 fallen-0.048
 arrest-0.047
 enough-0.045
 Kim-0.043
Token.
Token ID13
Feature activation-1.42e-02

Pos logit contributions
 quietly+0.010
 curiously+0.009
 carefully+0.009
 eagerly+0.009
fish+0.009
Neg logit contributions
 fault-0.019
 arrest-0.018
 earthquake-0.017
 fallen-0.017
 Kim-0.017
Token
Token ID220
Feature activation+1.26e-02

Pos logit contributions
 quietly+0.026
fish+0.026
 curiously+0.026
 carefully+0.025
Hop+0.024
Neg logit contributions
 fault-0.022
 arrest-0.021
 Kim-0.020
 fallen-0.019
 enough-0.018
Token
Token ID198
Feature activation-2.50e-03

Pos logit contributions
fish+0.013
 quietly+0.013
 curiously+0.012
Hop+0.012
 mer+0.012
Neg logit contributions
 fault-0.010
 arrest-0.009
 Kim-0.009
 enough-0.008
 fallen-0.008
Token
Token ID198
Feature activation+3.36e-03

Pos logit contributions
fish+0.004
 mer+0.004
 therapist+0.004
beans+0.004
ista+0.004
Neg logit contributions
 arrest-0.004
 Kim-0.004
 fault-0.004
EEP-0.003
upid-0.003
TokenJack
Token ID14295
Feature activation-2.67e-02

Pos logit contributions
 arrest+0.007
 fault+0.006
 fallen+0.006
 Kim+0.006
 earthquake+0.006
Neg logit contributions
fish-0.004
 quietly-0.004
 curiously-0.004
 eagerly-0.004
 mer-0.004
Token asked
Token ID1965
Feature activation-3.24e-02

Pos logit contributions
 quietly+0.037
 curiously+0.036
fish+0.034
Hop+0.034
Sure+0.033
Neg logit contributions
 fault-0.049
 arrest-0.049
 fallen-0.046
 enough-0.046
erved-0.045
Token the
Token ID262
Feature activation-1.76e-02

Pos logit contributions
fish+0.062
Hop+0.061
 curiously+0.060
 quietly+0.059
 carefully+0.056
Neg logit contributions
 fault-0.043
 enough-0.043
 Kim-0.042
 arrest-0.040
 earthquake-0.036
Token mer
Token ID4017
Feature activation-8.91e-02

Pos logit contributions
 curiously+0.019
rig+0.018
 quietly+0.018
 lov+0.018
 carefully+0.018
Neg logit contributions
 earthquake-0.029
 fallen-0.029
 no-0.029
 enemy-0.029
 appearance-0.028
Tokenmaid
Token ID23151
Feature activation-5.97e-02

Pos logit contributions
 quietly+0.103
Hop+0.099
 carefully+0.098
Sure+0.096
 curiously+0.095
Neg logit contributions
 fault-0.189
 arrest-0.167
 appearance-0.164
 enemy-0.163
erved-0.162
Token why
Token ID1521
Feature activation-6.00e-02

Pos logit contributions
 quietly+0.101
 curiously+0.098
Hop+0.097
 eagerly+0.094
 carefully+0.093
Neg logit contributions
 fault-0.101
 enough-0.090
 Kim-0.085
 arrest-0.082
 fallen-0.077
Token the
Token ID262
Feature activation+1.08e-02

Pos logit contributions
 quietly+0.076
 curiously+0.074
 carefully+0.071
 eagerly+0.068
fish+0.068
Neg logit contributions
 fault-0.135
 arrest-0.128
 fallen-0.119
erved-0.117
 Kim-0.116
Token waves
Token ID9813
Feature activation-6.07e-03

Pos logit contributions
 fault+0.027
 arrest+0.027
 fallen+0.026
 earthquake+0.025
 enough+0.024
Neg logit contributions
 curiously-0.009
 quietly-0.008
 eagerly-0.007
 carefully-0.007
Hop-0.007
Token were
Token ID547
Feature activation+6.15e-03

Pos logit contributions
 quietly+0.009
 curiously+0.009
 carefully+0.009
 eagerly+0.009
fish+0.008
Neg logit contributions
 fault-0.012
 arrest-0.011
 fallen-0.010
 Kim-0.010
 enough-0.010
Token Tim
Token ID5045
Feature activation+1.56e-02

Pos logit contributions
 fault+0.013
 arrest+0.012
 fallen+0.011
 Kim+0.010
erved+0.010
Neg logit contributions
 quietly-0.017
 curiously-0.017
 carefully-0.016
 eagerly-0.016
fish-0.016
Token.
Token ID13
Feature activation+1.26e-02

Pos logit contributions
 fault+0.038
 arrest+0.038
erved+0.035
 inconvenience+0.035
 Entry+0.035
Neg logit contributions
 quietly-0.017
 carefully-0.016
 gently-0.015
 curiously-0.014
 eagerly-0.014
Token She
Token ID1375
Feature activation+4.22e-03

Pos logit contributions
 fault+0.026
 arrest+0.025
 fallen+0.023
 inconvenience+0.022
 tresp+0.022
Neg logit contributions
 quietly-0.019
 curiously-0.018
 carefully-0.018
 eagerly-0.018
Sure-0.017
Token put
Token ID1234
Feature activation-5.57e-02

Pos logit contributions
 fault+0.006
 fallen+0.006
 arrest+0.006
 enough+0.006
 Kim+0.006
Neg logit contributions
 curiously-0.007
 quietly-0.007
fish-0.007
Hop-0.007
 mer-0.007
Token the
Token ID262
Feature activation-3.33e-02

Pos logit contributions
 curiously+0.093
 quietly+0.091
fish+0.087
 carefully+0.084
 eagerly+0.084
Neg logit contributions
 fault-0.095
 Kim-0.089
 enough-0.089
 arrest-0.089
 fallen-0.087
Token muff
Token ID27563
Feature activation-8.91e-02

Pos logit contributions
 quietly+0.043
 curiously+0.042
 eagerly+0.039
 carefully+0.039
Hop+0.038
Neg logit contributions
 fault-0.074
 arrest-0.069
 fallen-0.067
 enough-0.065
 inconvenience-0.063
Tokenins
Token ID1040
Feature activation-4.82e-02

Pos logit contributions
 curiously+0.112
 quietly+0.097
 eagerly+0.097
 closely+0.092
 mer+0.090
Neg logit contributions
 enough-0.178
 fault-0.175
one-0.170
 done-0.167
 Kim-0.166
Token in
Token ID287
Feature activation-2.39e-02

Pos logit contributions
 curiously+0.070
 quietly+0.063
Hop+0.060
 eagerly+0.058
 closely+0.058
Neg logit contributions
 fault-0.096
 enough-0.092
 arrest-0.089
 fallen-0.088
 Kim-0.085
Token a
Token ID257
Feature activation+4.42e-03

Pos logit contributions
 quietly+0.032
 carefully+0.031
 curiously+0.030
 eagerly+0.029
 gently+0.029
Neg logit contributions
 fault-0.055
 arrest-0.050
 inconvenience-0.048
erved-0.047
 earthquake-0.047
Token box
Token ID3091
Feature activation+2.06e-02

Pos logit contributions
 fault+0.008
 arrest+0.008
 fallen+0.008
 enough+0.007
 Kim+0.007
Neg logit contributions
 curiously-0.007
 quietly-0.006
 eagerly-0.006
Hop-0.006
fish-0.006
Token and
Token ID290
Feature activation+3.54e-04

Pos logit contributions
 fault+0.051
 arrest+0.049
 earthquake+0.046
 Entry+0.045
 inconvenience+0.045
Neg logit contributions
 quietly-0.023
 carefully-0.022
 curiously-0.021
 eagerly-0.020
fish-0.018
TokenThe
Token ID464
Feature activation+2.81e-03

Pos logit contributions
 fault+0.001
 arrest+0.001
 fallen+0.001
 Kim+0.001
 enough+0.001
Neg logit contributions
 quietly-0.001
 curiously-0.001
 carefully-0.001
fish-0.001
 eagerly-0.001
Token store
Token ID3650
Feature activation+2.45e-02

Pos logit contributions
 arrest+0.006
 fault+0.006
 fallen+0.006
 inconvenience+0.005
 enough+0.005
Neg logit contributions
 curiously-0.004
fish-0.003
 quietly-0.003
 carefully-0.003
Hop-0.003
Tokenkeeper
Token ID13884
Feature activation-1.49e-02

Pos logit contributions
 fault+0.056
 arrest+0.053
 fallen+0.051
 Kim+0.051
 enough+0.051
Neg logit contributions
 quietly-0.027
 curiously-0.027
fish-0.025
 carefully-0.025
Hop-0.024
Token put
Token ID1234
Feature activation-5.27e-02

Pos logit contributions
 curiously+0.021
 quietly+0.021
Hop+0.021
fish+0.021
 mer+0.020
Neg logit contributions
 fault-0.026
 arrest-0.026
 enough-0.025
 fallen-0.025
 caused-0.024
Token the
Token ID262
Feature activation-3.14e-02

Pos logit contributions
 quietly+0.090
 curiously+0.089
fish+0.086
 carefully+0.084
 eagerly+0.083
Neg logit contributions
 fault-0.091
 arrest-0.084
 enough-0.080
 fallen-0.080
 Kim-0.080
Token muff
Token ID27563
Feature activation-8.91e-02

Pos logit contributions
 quietly+0.039
 curiously+0.038
 carefully+0.035
fish+0.035
 eagerly+0.035
Neg logit contributions
 fault-0.070
 arrest-0.067
 fallen-0.063
 enough-0.062
 Kim-0.061
Tokenin
Token ID259
Feature activation-7.12e-02

Pos logit contributions
 curiously+0.083
 quietly+0.075
 mer+0.074
 eagerly+0.071
 cheer+0.069
Neg logit contributions
 enough-0.204
 fault-0.202
one-0.195
 done-0.191
 fallen-0.191
Token on
Token ID319
Feature activation+4.44e-03

Pos logit contributions
 quietly+0.089
 curiously+0.088
 carefully+0.082
 eagerly+0.080
fish+0.080
Neg logit contributions
 fault-0.158
 arrest-0.149
 fallen-0.140
 enough-0.138
 Kim-0.137
Token a
Token ID257
Feature activation+2.81e-03

Pos logit contributions
 fault+0.010
 arrest+0.009
 fallen+0.008
 inconvenience+0.008
erved+0.008
Neg logit contributions
 quietly-0.006
 curiously-0.006
 carefully-0.006
 eagerly-0.006
 gently-0.005
Token big
Token ID1263
Feature activation-4.21e-03

Pos logit contributions
 fault+0.005
 arrest+0.005
 fallen+0.005
 earthquake+0.005
 enough+0.005
Neg logit contributions
 quietly-0.004
 curiously-0.004
Hop-0.004
 eagerly-0.004
fish-0.004
Token scale
Token ID5046
Feature activation-2.93e-02

Pos logit contributions
 quietly+0.006
 curiously+0.006
 carefully+0.006
 eagerly+0.005
fish+0.005
Neg logit contributions
 fault-0.009
 arrest-0.008
 fallen-0.008
 Kim-0.008
 enough-0.007
Token the
Token ID262
Feature activation+4.78e-03

Pos logit contributions
fish+0.016
 quietly+0.016
Hop+0.016
 curiously+0.016
 carefully+0.015
Neg logit contributions
 arrest-0.014
 fault-0.014
 Kim-0.013
 Entry-0.012
 fallen-0.012
Token counter
Token ID3753
Feature activation-2.20e-02

Pos logit contributions
 fault+0.008
 arrest+0.008
 fallen+0.008
 enough+0.007
 Entry+0.007
Neg logit contributions
 quietly-0.008
 curiously-0.008
fish-0.008
Hop-0.007
 eagerly-0.007
Token and
Token ID290
Feature activation-1.42e-02

Pos logit contributions
Hop+0.028
fish+0.028
 quietly+0.027
 curiously+0.027
 carefully+0.025
Neg logit contributions
 fault-0.046
 arrest-0.044
 enough-0.042
 Kim-0.041
 fallen-0.040
Token asked
Token ID1965
Feature activation-3.22e-02

Pos logit contributions
fish+0.021
Hop+0.020
 curiously+0.020
 quietly+0.020
 mer+0.019
Neg logit contributions
 arrest-0.025
 Kim-0.024
 enough-0.023
 fault-0.023
 fallen-0.023
Token the
Token ID262
Feature activation-1.42e-02

Pos logit contributions
fish+0.053
 quietly+0.052
 curiously+0.052
Hop+0.051
 carefully+0.048
Neg logit contributions
 fault-0.054
 Kim-0.051
 enough-0.050
 arrest-0.049
 Entry-0.046
Token bar
Token ID2318
Feature activation-8.84e-02

Pos logit contributions
fish+0.023
Hop+0.022
 quietly+0.021
 curiously+0.021
utter+0.020
Neg logit contributions
 fault-0.025
 enough-0.023
 fallen-0.023
 inconvenience-0.022
 arrest-0.022
Tokenista
Token ID12523
Feature activation-7.68e-02

Pos logit contributions
Hop+0.143
 curiously+0.126
 mer+0.124
 cheer+0.120
 Bonnie+0.119
Neg logit contributions
 fault-0.145
 enough-0.141
 Kim-0.137
one-0.134
 loser-0.126
Token if
Token ID611
Feature activation-5.74e-02

Pos logit contributions
Hop+0.126
fish+0.121
 curiously+0.119
 quietly+0.118
 mer+0.112
Neg logit contributions
 enough-0.119
 fault-0.116
 Kim-0.104
one-0.100
 no-0.093
Token they
Token ID484
Feature activation+1.85e-02

Pos logit contributions
 quietly+0.074
 curiously+0.070
 carefully+0.069
 eagerly+0.066
fish+0.063
Neg logit contributions
 fault-0.133
 arrest-0.124
 fallen-0.115
erved-0.114
 earthquake-0.112
Token could
Token ID714
Feature activation+1.51e-02

Pos logit contributions
 fault+0.036
 arrest+0.034
 fallen+0.032
 enough+0.032
 Kim+0.031
Neg logit contributions
 quietly-0.027
 curiously-0.027
Hop-0.024
fish-0.024
 carefully-0.024
Token print
Token ID3601
Feature activation-1.13e-02

Pos logit contributions
 fault+0.029
 arrest+0.027
 fallen+0.025
 Kim+0.025
 enough+0.025
Neg logit contributions
 quietly-0.023
 curiously-0.023
Hop-0.021
fish-0.021
 carefully-0.021
Token and
Token ID290
Feature activation-1.35e-03

Pos logit contributions
 curiously+0.027
 quietly+0.026
 carefully+0.024
Hop+0.024
 eagerly+0.024
Neg logit contributions
 fault-0.053
 arrest-0.050
 fallen-0.047
 enough-0.047
 Kim-0.046
Token some
Token ID617
Feature activation+5.19e-03

Pos logit contributions
 curiously+0.002
 quietly+0.002
Hop+0.002
fish+0.002
 eagerly+0.002
Neg logit contributions
 fault-0.002
 arrest-0.002
 fallen-0.002
 Kim-0.002
 enough-0.002
Token soap
Token ID19533
Feature activation-3.54e-02

Pos logit contributions
 fault+0.009
 fallen+0.009
 enough+0.008
 arrest+0.008
 caused+0.008
Neg logit contributions
 curiously-0.008
Hop-0.008
 quietly-0.008
fish-0.007
 eagerly-0.007
Token.
Token ID13
Feature activation-5.40e-04

Pos logit contributions
 curiously+0.051
 quietly+0.051
Hop+0.047
 carefully+0.047
 eagerly+0.046
Neg logit contributions
 fault-0.073
 arrest-0.067
 fallen-0.064
 enough-0.062
 Kim-0.061
Token Lily
Token ID20037
Feature activation-1.42e-02

Pos logit contributions
 quietly+0.001
fish+0.001
 curiously+0.001
 carefully+0.001
Hop+0.001
Neg logit contributions
 fault-0.001
 arrest-0.001
 Kim-0.001
 fallen-0.001
 enough-0.001
Token dipped
Token ID35300
Feature activation-8.82e-02

Pos logit contributions
 curiously+0.022
fish+0.021
Hop+0.021
 quietly+0.021
 mer+0.020
Neg logit contributions
 fault-0.024
 arrest-0.023
 fallen-0.023
erved-0.022
 caused-0.022
Token a
Token ID257
Feature activation-5.72e-02

Pos logit contributions
 quietly+0.151
 curiously+0.151
fish+0.145
Hop+0.145
 eagerly+0.141
Neg logit contributions
 fault-0.146
 enough-0.133
 Kim-0.131
 fallen-0.128
 arrest-0.126
Token cloth
Token ID16270
Feature activation-3.61e-02

Pos logit contributions
Hop+0.094
 curiously+0.093
 quietly+0.093
 eagerly+0.090
Would+0.088
Neg logit contributions
 fallen-0.091
 fault-0.086
 arrest-0.080
upid-0.077
 enough-0.076
Token in
Token ID287
Feature activation-4.06e-02

Pos logit contributions
 curiously+0.044
 quietly+0.043
Hop+0.043
 eagerly+0.039
fish+0.039
Neg logit contributions
 fault-0.077
 arrest-0.073
 enough-0.072
 fallen-0.072
erved-0.068
Token the
Token ID262
Feature activation+1.04e-02

Pos logit contributions
fish+0.071
 quietly+0.071
Hop+0.071
 curiously+0.069
 eagerly+0.065
Neg logit contributions
 fault-0.067
 arrest-0.062
 enough-0.059
 fallen-0.059
 Kim-0.057
Token water
Token ID1660
Feature activation-2.67e-02

Pos logit contributions
 fault+0.022
 arrest+0.020
 fallen+0.020
 enough+0.019
 earthquake+0.019
Neg logit contributions
 curiously-0.014
 quietly-0.014
Hop-0.013
 eagerly-0.013
fish-0.013
Token mix
Token ID5022
Feature activation-3.75e-02

Pos logit contributions
 quietly+0.005
Hop+0.005
 curiously+0.005
fish+0.005
 eagerly+0.005
Neg logit contributions
 fault-0.007
 arrest-0.007
 fallen-0.007
 earthquake-0.006
 inconvenience-0.006
Token.
Token ID13
Feature activation-5.11e-03

Pos logit contributions
Hop+0.062
 curiously+0.061
 quietly+0.059
fish+0.055
 mer+0.054
Neg logit contributions
 fault-0.067
 enough-0.062
 fallen-0.058
 Kim-0.056
 arrest-0.056
Token They
Token ID1119
Feature activation+1.39e-03

Pos logit contributions
 quietly+0.009
 curiously+0.008
 carefully+0.008
fish+0.008
 eagerly+0.008
Neg logit contributions
 fault-0.009
 arrest-0.009
 Kim-0.008
 fallen-0.008
 earthquake-0.008
Token put
Token ID1234
Feature activation-5.35e-02

Pos logit contributions
 fallen+0.002
 enough+0.002
 fault+0.002
 arrest+0.002
 caused+0.002
Neg logit contributions
Hop-0.003
 curiously-0.002
 quietly-0.002
fish-0.002
 therapist-0.002
Token the
Token ID262
Feature activation-2.81e-02

Pos logit contributions
 curiously+0.107
Hop+0.103
fish+0.102
 quietly+0.101
 mer+0.097
Neg logit contributions
 enough-0.076
 fault-0.075
 Kim-0.068
 fallen-0.067
 arrest-0.061
Token muff
Token ID27563
Feature activation-8.78e-02

Pos logit contributions
 quietly+0.041
 curiously+0.041
Hop+0.039
 eagerly+0.038
fish+0.036
Neg logit contributions
 fault-0.056
 fallen-0.053
 arrest-0.051
 enough-0.049
 inconvenience-0.047
Tokenins
Token ID1040
Feature activation-5.04e-02

Pos logit contributions
 curiously+0.126
Hop+0.111
 quietly+0.109
 eagerly+0.109
 mer+0.109
Neg logit contributions
 enough-0.161
 fault-0.157
 done-0.156
one-0.147
 fallen-0.146
Token in
Token ID287
Feature activation-2.50e-02

Pos logit contributions
 curiously+0.069
Hop+0.062
 quietly+0.062
fish+0.057
 eagerly+0.057
Neg logit contributions
 fault-0.104
 enough-0.100
 fallen-0.096
 arrest-0.094
 Kim-0.091
Token the
Token ID262
Feature activation+2.32e-02

Pos logit contributions
 quietly+0.039
 curiously+0.038
 carefully+0.037
 eagerly+0.035
fish+0.034
Neg logit contributions
 fault-0.050
 arrest-0.047
 fallen-0.043
erved-0.043
 earthquake-0.042
Token oven
Token ID14361
Feature activation+1.59e-02

Pos logit contributions
 fault+0.043
 arrest+0.040
 fallen+0.038
 enough+0.036
 earthquake+0.036
Neg logit contributions
 quietly-0.038
 curiously-0.038
fish-0.036
 carefully-0.035
Hop-0.035
Token and
Token ID290
Feature activation+3.37e-03

Pos logit contributions
 fault+0.037
 arrest+0.035
 fallen+0.032
 Kim+0.032
 earthquake+0.032
Neg logit contributions
 quietly-0.019
 curiously-0.018
 carefully-0.018
 eagerly-0.017
fish-0.016
Token handsome
Token ID22665
Feature activation-3.97e-02

Pos logit contributions
Hop+0.074
fish+0.072
 quietly+0.069
 curiously+0.068
Sure+0.067
Neg logit contributions
 fault-0.079
 fallen-0.069
 enough-0.069
 earthquake-0.069
 arrest-0.067
Token.
Token ID13
Feature activation-4.95e-03

Pos logit contributions
 quietly+0.053
 curiously+0.052
 carefully+0.050
 eagerly+0.048
fish+0.048
Neg logit contributions
 fault-0.086
 arrest-0.081
 fallen-0.075
 Kim-0.074
 earthquake-0.074
Token He
Token ID679
Feature activation-1.29e-02

Pos logit contributions
 quietly+0.008
 curiously+0.008
 carefully+0.008
 eagerly+0.007
 closely+0.007
Neg logit contributions
 fault-0.010
 arrest-0.009
 fallen-0.009
 inconvenience-0.008
 tresp-0.008
Token thanked
Token ID26280
Feature activation-3.07e-02

Pos logit contributions
 curiously+0.019
 quietly+0.019
fish+0.018
Hop+0.018
 mer+0.018
Neg logit contributions
 arrest-0.022
 fault-0.022
 fallen-0.021
 enough-0.021
 Kim-0.020
Token the
Token ID262
Feature activation-1.42e-02

Pos logit contributions
fish+0.059
 curiously+0.059
Hop+0.059
 quietly+0.058
 carefully+0.055
Neg logit contributions
 arrest-0.042
 fault-0.041
 Kim-0.039
 enough-0.038
 earthquake-0.036
Token bar
Token ID2318
Feature activation-8.75e-02

Pos logit contributions
fish+0.020
 curiously+0.020
 quietly+0.019
Hop+0.018
 carefully+0.018
Neg logit contributions
 fault-0.028
 arrest-0.027
 fallen-0.026
 inconvenience-0.025
 enough-0.025
Tokenber
Token ID527
Feature activation-4.92e-02

Pos logit contributions
Hop+0.127
 curiously+0.123
 mer+0.118
 quietly+0.114
fish+0.111
Neg logit contributions
 fault-0.159
 enough-0.152
 arrest-0.151
 Kim-0.144
 loser-0.139
Token and
Token ID290
Feature activation-1.06e-02

Pos logit contributions
fish+0.066
Hop+0.066
 curiously+0.066
 quietly+0.065
 mer+0.063
Neg logit contributions
 fault-0.095
 enough-0.090
 arrest-0.089
 Kim-0.084
erved-0.080
Token ran
Token ID4966
Feature activation+2.90e-02

Pos logit contributions
 curiously+0.014
 quietly+0.014
fish+0.013
Hop+0.013
 carefully+0.013
Neg logit contributions
 fault-0.021
 arrest-0.021
 Kim-0.020
 fallen-0.019
 enough-0.019
Token out
Token ID503
Feature activation-2.76e-02

Pos logit contributions
 fault+0.052
 arrest+0.049
 fallen+0.045
 enough+0.045
 Kim+0.044
Neg logit contributions
 curiously-0.048
 quietly-0.048
fish-0.046
 carefully-0.045
 eagerly-0.044
Token of
Token ID286
Feature activation-1.11e-03

Pos logit contributions
Hop+0.051
fish+0.050
 quietly+0.049
 curiously+0.049
Sure+0.045
Neg logit contributions
 fault-0.042
 arrest-0.041
 Kim-0.038
 enough-0.037
 fallen-0.036
TokenSc
Token ID3351
Feature activation+2.55e-02

Pos logit contributions
 curiously+0.011
 quietly+0.010
 eagerly+0.010
fish+0.010
 carefully+0.009
Neg logit contributions
 arrest-0.013
 fault-0.012
 fallen-0.012
 Kim-0.012
 caused-0.011
Tokenamp
Token ID696
Feature activation-1.37e-02

Pos logit contributions
 fault+0.050
 arrest+0.047
 fallen+0.043
 Kim+0.042
erved+0.042
Neg logit contributions
 quietly-0.040
 curiously-0.039
 carefully-0.037
 eagerly-0.036
fish-0.036
Tokeni
Token ID72
Feature activation-2.62e-02

Pos logit contributions
 quietly+0.016
 curiously+0.016
 carefully+0.015
 eagerly+0.014
 cheer+0.014
Neg logit contributions
 fault-0.030
 arrest-0.029
 fallen-0.027
erved-0.027
 enough-0.027
Token took
Token ID1718
Feature activation-1.20e-02

Pos logit contributions
 curiously+0.033
 quietly+0.033
Hop+0.031
fish+0.031
 mer+0.030
Neg logit contributions
 fault-0.050
 arrest-0.049
 fallen-0.047
 enough-0.046
erved-0.046
Token the
Token ID262
Feature activation-2.52e-02

Pos logit contributions
 quietly+0.020
 curiously+0.019
 carefully+0.018
 eagerly+0.018
fish+0.017
Neg logit contributions
 fault-0.023
 arrest-0.021
 fallen-0.019
 Kim-0.019
 earthquake-0.019
Token muff
Token ID27563
Feature activation-8.71e-02

Pos logit contributions
 quietly+0.036
 curiously+0.033
 eagerly+0.031
 carefully+0.031
Hop+0.031
Neg logit contributions
 fault-0.052
 arrest-0.050
 fallen-0.049
 enough-0.046
 inconvenience-0.046
Tokenin
Token ID259
Feature activation-7.19e-02

Pos logit contributions
 curiously+0.093
 quietly+0.084
 eagerly+0.080
 closely+0.077
 mer+0.075
Neg logit contributions
 enough-0.190
 fault-0.186
one-0.184
 arrest-0.177
 fallen-0.175
Token and
Token ID290
Feature activation-1.57e-02

Pos logit contributions
 curiously+0.115
 quietly+0.112
fish+0.106
Hop+0.105
 mer+0.104
Neg logit contributions
 arrest-0.118
 fault-0.117
 enough-0.117
 fallen-0.106
 Kim-0.105
Token hopped
Token ID45230
Feature activation-3.83e-03

Pos logit contributions
 curiously+0.024
 quietly+0.023
fish+0.022
Hop+0.022
 closely+0.021
Neg logit contributions
 arrest-0.028
 fault-0.028
 fallen-0.027
 enough-0.027
 Kim-0.026
Token away
Token ID1497
Feature activation+2.38e-02

Pos logit contributions
fish+0.006
 curiously+0.006
 quietly+0.006
Hop+0.005
in+0.005
Neg logit contributions
 enough-0.007
 fault-0.007
 arrest-0.007
 fallen-0.006
 Kim-0.006
Token,
Token ID11
Feature activation+1.86e-02

Pos logit contributions
 fault+0.050
 arrest+0.047
 fallen+0.044
 Kim+0.043
 enough+0.043
Neg logit contributions
 quietly-0.033
 curiously-0.032
 carefully-0.030
 eagerly-0.029
fish-0.029
Token to
Token ID284
Feature activation-1.97e-03

Pos logit contributions
 fault+0.023
 arrest+0.023
 enough+0.021
 fallen+0.020
 Kim+0.020
Neg logit contributions
 curiously-0.035
fish-0.035
 quietly-0.033
Hop-0.032
 eagerly-0.032
Token the
Token ID262
Feature activation+3.39e-03

Pos logit contributions
fish+0.004
Hop+0.004
beans+0.004
in+0.004
 therapist+0.004
Neg logit contributions
 arrest-0.002
 Kim-0.002
 fault-0.002
 enough-0.002
 fallen-0.002
Token stream
Token ID4269
Feature activation-1.45e-03

Pos logit contributions
 fallen+0.006
 arrest+0.006
 fault+0.005
 earthquake+0.005
 enough+0.005
Neg logit contributions
 curiously-0.005
 eagerly-0.005
 quietly-0.005
 carefully-0.005
 closely-0.005
Token.
Token ID13
Feature activation+2.36e-02

Pos logit contributions
 quietly+0.002
 curiously+0.002
 carefully+0.002
 eagerly+0.002
fish+0.002
Neg logit contributions
 fault-0.003
 arrest-0.003
 fallen-0.003
 enough-0.003
 Kim-0.003
Token She
Token ID1375
Feature activation+6.62e-03

Pos logit contributions
 fault+0.037
 Kim+0.034
 arrest+0.032
 fallen+0.030
 enough+0.030
Neg logit contributions
fish-0.044
 curiously-0.043
Hop-0.042
 quietly-0.042
 carefully-0.041
Token dips
Token ID48904
Feature activation-8.60e-02

Pos logit contributions
 fault+0.009
 arrest+0.009
 fallen+0.008
 enough+0.008
erved+0.007
Neg logit contributions
 curiously-0.013
Hop-0.013
 quietly-0.012
fish-0.012
Sure-0.012
Token her
Token ID607
Feature activation-1.88e-02

Pos logit contributions
 curiously+0.155
 quietly+0.149
Hop+0.145
 eagerly+0.143
fish+0.141
Neg logit contributions
 fault-0.133
 enough-0.126
 Kim-0.122
 fallen-0.121
 arrest-0.115
Token hand
Token ID1021
Feature activation-1.58e-03

Pos logit contributions
 curiously+0.027
 quietly+0.026
Hop+0.025
 eagerly+0.024
Sure+0.024
Neg logit contributions
 fault-0.039
 fallen-0.035
 arrest-0.035
 enough-0.033
erved-0.033
Token in
Token ID287
Feature activation-3.60e-02

Pos logit contributions
Hop+0.002
 curiously+0.002
fish+0.002
Sure+0.002
 quietly+0.002
Neg logit contributions
 fallen-0.003
 fault-0.003
 enough-0.003
erved-0.003
 arrest-0.003
Token the
Token ID262
Feature activation+1.11e-02

Pos logit contributions
 quietly+0.060
 curiously+0.059
fish+0.056
 carefully+0.056
 eagerly+0.055
Neg logit contributions
 fault-0.065
 arrest-0.061
 fallen-0.057
 Kim-0.056
 enough-0.055
Token water
Token ID1660
Feature activation-1.90e-02

Pos logit contributions
 fault+0.024
 arrest+0.022
 fallen+0.022
 enough+0.021
 earthquake+0.020
Neg logit contributions
 curiously-0.015
 quietly-0.015
 eagerly-0.014
 carefully-0.014
Hop-0.013
Token its
Token ID663
Feature activation+4.61e-02

Pos logit contributions
 fault+0.069
 arrest+0.063
 inconvenience+0.060
 earthquake+0.056
upid+0.056
Neg logit contributions
 quietly-0.052
 curiously-0.050
 carefully-0.048
 eagerly-0.047
 happily-0.045
Token tail
Token ID7894
Feature activation+1.59e-02

Pos logit contributions
 fault+0.078
 arrest+0.072
 inconvenience+0.070
 earthquake+0.063
 Entry+0.063
Neg logit contributions
 quietly-0.086
 curiously-0.085
 carefully-0.083
 eagerly-0.082
 happily-0.080
Token.
Token ID13
Feature activation-6.38e-03

Pos logit contributions
 fault+0.035
 arrest+0.033
 Kim+0.031
 fallen+0.030
 earthquake+0.030
Neg logit contributions
 quietly-0.021
 curiously-0.021
 carefully-0.020
 eagerly-0.019
fish-0.018
Token Lily
Token ID20037
Feature activation-1.78e-02

Pos logit contributions
 quietly+0.010
 curiously+0.009
 carefully+0.009
 eagerly+0.009
 closely+0.008
Neg logit contributions
 fault-0.013
 arrest-0.013
 fallen-0.011
 tresp-0.011
 earthquake-0.011
Token counted
Token ID14789
Feature activation-1.95e-02

Pos logit contributions
 quietly+0.026
 curiously+0.026
fish+0.024
 carefully+0.024
Hop+0.023
Neg logit contributions
 fault-0.033
 arrest-0.033
 fallen-0.030
 enough-0.029
erved-0.029
Token how
Token ID703
Feature activation-8.59e-02

Pos logit contributions
 quietly+0.033
 curiously+0.032
fish+0.031
Hop+0.031
 carefully+0.031
Neg logit contributions
 fault-0.034
 arrest-0.031
 Kim-0.030
 enough-0.030
 fallen-0.029
Token many
Token ID867
Feature activation-2.04e-03

Pos logit contributions
 quietly+0.120
 curiously+0.116
 carefully+0.113
 eagerly+0.108
fish+0.106
Neg logit contributions
 fault-0.184
 arrest-0.172
 fallen-0.158
 earthquake-0.156
 Kim-0.155
Token spots
Token ID10222
Feature activation-2.77e-02

Pos logit contributions
 quietly+0.004
 curiously+0.004
 carefully+0.004
 eagerly+0.004
fish+0.004
Neg logit contributions
 fault-0.003
 arrest-0.003
 fallen-0.003
 Kim-0.002
 enough-0.002
Token the
Token ID262
Feature activation-4.27e-03

Pos logit contributions
 curiously+0.039
 quietly+0.038
Hop+0.035
 carefully+0.035
fish+0.035
Neg logit contributions
 fault-0.055
 arrest-0.052
 fallen-0.051
 enough-0.049
 Kim-0.048
Token dog
Token ID3290
Feature activation-1.08e-02

Pos logit contributions
 quietly+0.006
 carefully+0.006
 curiously+0.005
Hop+0.005
 eagerly+0.005
Neg logit contributions
 fault-0.009
 Kim-0.008
 arrest-0.008
upid-0.008
 enough-0.008
Token had
Token ID550
Feature activation+3.37e-02

Pos logit contributions
 curiously+0.016
 quietly+0.015
fish+0.015
Hop+0.014
 mer+0.014
Neg logit contributions
 fault-0.020
 arrest-0.019
 fallen-0.019
 enough-0.018
 Kim-0.018
Token and
Token ID290
Feature activation-2.87e-03

Pos logit contributions
 curiously+0.045
 quietly+0.044
Hop+0.043
 eagerly+0.041
Sure+0.040
Neg logit contributions
 fault-0.067
 arrest-0.066
 enough-0.061
 fallen-0.059
 Kim-0.058
Token a
Token ID257
Feature activation-1.26e-02

Pos logit contributions
 quietly+0.005
 curiously+0.005
 carefully+0.004
 eagerly+0.004
fish+0.004
Neg logit contributions
 fault-0.005
 arrest-0.005
 fallen-0.005
 Kim-0.005
 earthquake-0.004
Token brush
Token ID14093
Feature activation+3.79e-03

Pos logit contributions
 quietly+0.020
 curiously+0.020
Hop+0.019
 eagerly+0.018
 carefully+0.018
Neg logit contributions
 fault-0.023
 arrest-0.021
 fallen-0.021
 Kim-0.019
 Entry-0.019
Token.
Token ID13
Feature activation+1.74e-02

Pos logit contributions
 fault+0.009
 arrest+0.008
 fallen+0.008
 Kim+0.008
 earthquake+0.008
Neg logit contributions
 quietly-0.004
 curiously-0.004
 carefully-0.004
 eagerly-0.004
fish-0.004
Token She
Token ID1375
Feature activation+5.73e-03

Pos logit contributions
 fault+0.030
 arrest+0.029
 fallen+0.026
 earthquake+0.025
 enough+0.025
Neg logit contributions
 quietly-0.031
 curiously-0.030
 carefully-0.029
 eagerly-0.029
fish-0.028
Token dips
Token ID48904
Feature activation-8.58e-02

Pos logit contributions
 fault+0.009
 enough+0.008
 arrest+0.008
 fallen+0.007
erved+0.007
Neg logit contributions
Hop-0.011
 curiously-0.011
 quietly-0.010
Sure-0.010
fish-0.010
Token the
Token ID262
Feature activation-1.95e-02

Pos logit contributions
 curiously+0.157
 quietly+0.153
Hop+0.150
 eagerly+0.145
fish+0.144
Neg logit contributions
 fault-0.129
 enough-0.122
 Kim-0.118
 fallen-0.113
 arrest-0.108
Token brush
Token ID14093
Feature activation-1.84e-02

Pos logit contributions
 quietly+0.031
 curiously+0.030
 carefully+0.029
 eagerly+0.028
fish+0.028
Neg logit contributions
 fault-0.037
 arrest-0.035
 fallen-0.032
 Kim-0.032
 earthquake-0.031
Token in
Token ID287
Feature activation-3.50e-02

Pos logit contributions
Hop+0.022
 curiously+0.022
 quietly+0.021
Sure+0.020
fish+0.020
Neg logit contributions
 fault-0.040
 enough-0.037
 arrest-0.037
 fallen-0.036
erved-0.035
Token the
Token ID262
Feature activation+1.12e-02

Pos logit contributions
 quietly+0.058
 curiously+0.057
fish+0.054
 carefully+0.054
 eagerly+0.053
Neg logit contributions
 fault-0.064
 arrest-0.060
 fallen-0.055
 Kim-0.055
 enough-0.054
Token water
Token ID1660
Feature activation-2.26e-02

Pos logit contributions
 fault+0.021
 arrest+0.020
 fallen+0.019
 earthquake+0.018
 Kim+0.018
Neg logit contributions
 quietly-0.018
 curiously-0.018
 carefully-0.017
 eagerly-0.016
fish-0.016
Token his
Token ID465
Feature activation+1.22e-02

Pos logit contributions
 quietly+0.047
 carefully+0.045
 curiously+0.045
 eagerly+0.043
 gently+0.040
Neg logit contributions
 fault-0.060
 arrest-0.055
 inconvenience-0.052
erved-0.050
 Entry-0.050
Token friends
Token ID2460
Feature activation+2.79e-02

Pos logit contributions
 fault+0.024
 arrest+0.022
 fallen+0.020
 Kim+0.020
 earthquake+0.020
Neg logit contributions
 quietly-0.019
 curiously-0.019
 carefully-0.018
Hop-0.018
 eagerly-0.017
Token and
Token ID290
Feature activation+2.35e-03

Pos logit contributions
 fault+0.059
 arrest+0.055
 fallen+0.051
 Kim+0.051
 enough+0.050
Neg logit contributions
 quietly-0.038
 curiously-0.038
 carefully-0.036
 eagerly-0.035
fish-0.035
Token did
Token ID750
Feature activation+2.01e-02

Pos logit contributions
 fault+0.005
 arrest+0.005
 fallen+0.004
erved+0.004
 earthquake+0.004
Neg logit contributions
 quietly-0.003
 carefully-0.003
 curiously-0.003
 eagerly-0.003
fish-0.003
Token not
Token ID407
Feature activation+6.93e-02

Pos logit contributions
 fault+0.038
 arrest+0.037
 fallen+0.034
 earthquake+0.033
 inconvenience+0.033
Neg logit contributions
 quietly-0.034
 carefully-0.034
 curiously-0.033
 eagerly-0.031
 closely-0.031
Token cry
Token ID3960
Feature activation+6.28e-02

Pos logit contributions
 fault+0.133
 arrest+0.123
erved+0.112
 fallen+0.112
 Kim+0.111
Neg logit contributions
 quietly-0.113
 curiously-0.109
 carefully-0.108
 eagerly-0.103
fish-0.101
Token when
Token ID618
Feature activation-1.03e-02

Pos logit contributions
 fault+0.109
 arrest+0.108
 fallen+0.097
erved+0.094
 inconvenience+0.094
Neg logit contributions
 quietly-0.113
 curiously-0.112
 carefully-0.107
 eagerly-0.102
 closely-0.101
Token he
Token ID339
Feature activation+2.68e-02

Pos logit contributions
 quietly+0.012
 curiously+0.012
 carefully+0.012
 eagerly+0.011
fish+0.010
Neg logit contributions
 fault-0.025
 arrest-0.023
erved-0.022
upid-0.021
 Entry-0.021
Token did
Token ID750
Feature activation+2.70e-02

Pos logit contributions
 fault+0.051
 arrest+0.047
 Kim+0.043
 fallen+0.043
 earthquake+0.043
Neg logit contributions
 quietly-0.044
 curiously-0.042
 carefully-0.041
 eagerly-0.040
fish-0.039
Token not
Token ID407
Feature activation+5.93e-02

Pos logit contributions
 fault+0.054
 arrest+0.052
erved+0.048
 Kim+0.047
 fallen+0.047
Neg logit contributions
 quietly-0.042
 carefully-0.040
 curiously-0.039
 eagerly-0.038
 closely-0.036
Token get
Token ID651
Feature activation+1.66e-02

Pos logit contributions
 fault+0.114
 arrest+0.107
 fallen+0.098
 Kim+0.097
 enough+0.096
Neg logit contributions
 quietly-0.094
 curiously-0.092
 carefully-0.088
 eagerly-0.086
fish-0.085
TokenThen
Token ID6423
Feature activation-1.59e-02

Pos logit contributions
 curiously+0.010
fish+0.009
 quietly+0.009
 eagerly+0.009
 closely+0.009
Neg logit contributions
 arrest-0.010
 Kim-0.010
 fault-0.009
 fallen-0.009
 inconvenience-0.008
Token,
Token ID11
Feature activation+3.38e-03

Pos logit contributions
 quietly+0.019
 carefully+0.017
 curiously+0.017
 eagerly+0.017
 gently+0.015
Neg logit contributions
 fault-0.039
 arrest-0.036
opl-0.034
erved-0.034
 fallen-0.033
Token Tom
Token ID4186
Feature activation+1.01e-02

Pos logit contributions
 fault+0.007
 arrest+0.006
 fallen+0.006
erved+0.005
 earthquake+0.005
Neg logit contributions
 quietly-0.005
 curiously-0.005
 carefully-0.005
 eagerly-0.005
fish-0.005
Token and
Token ID290
Feature activation+6.23e-03

Pos logit contributions
 fault+0.023
 arrest+0.021
 Kim+0.020
 earthquake+0.020
 fallen+0.020
Neg logit contributions
 quietly-0.013
 curiously-0.013
 carefully-0.012
 eagerly-0.012
fish-0.011
Token Anna
Token ID11735
Feature activation-6.81e-03

Pos logit contributions
 fault+0.013
 arrest+0.012
erved+0.011
 fallen+0.011
opl+0.011
Neg logit contributions
 quietly-0.010
 carefully-0.009
 curiously-0.009
 eagerly-0.009
 gently-0.008
Token heard
Token ID2982
Feature activation+6.36e-02

Pos logit contributions
 quietly+0.012
 curiously+0.012
Hop+0.011
fish+0.011
 carefully+0.011
Neg logit contributions
 fault-0.012
 arrest-0.011
 fallen-0.010
 enough-0.010
 Kim-0.010
Token a
Token ID257
Feature activation-1.43e-03

Pos logit contributions
 fault+0.116
 arrest+0.108
 Kim+0.099
 fallen+0.098
 enough+0.097
Neg logit contributions
 quietly-0.106
 curiously-0.104
 carefully-0.100
fish-0.099
 eagerly-0.098
Token knock
Token ID10643
Feature activation+1.46e-02

Pos logit contributions
Hop+0.003
 quietly+0.003
 eagerly+0.003
 curiously+0.003
fish+0.003
Neg logit contributions
 fault-0.002
 arrest-0.002
 earthquake-0.002
 fallen-0.002
 Kim-0.002
Token on
Token ID319
Feature activation-5.45e-03

Pos logit contributions
 fault+0.034
 arrest+0.032
 fallen+0.029
 Kim+0.029
erved+0.029
Neg logit contributions
 quietly-0.019
 curiously-0.017
 carefully-0.017
 eagerly-0.016
fish-0.016
Token their
Token ID511
Feature activation+8.44e-03

Pos logit contributions
 curiously+0.010
 quietly+0.010
fish+0.009
Hop+0.009
 carefully+0.009
Neg logit contributions
 fault-0.009
 arrest-0.009
 Kim-0.008
 enough-0.008
 fallen-0.007
Token doors
Token ID8215
Feature activation+4.06e-02

Pos logit contributions
 fault+0.014
 arrest+0.013
 Kim+0.012
 fallen+0.011
 earthquake+0.011
Neg logit contributions
Hop-0.016
 quietly-0.016
fish-0.016
 curiously-0.015
 eagerly-0.015
Token some
Token ID617
Feature activation+5.22e-03

Pos logit contributions
 fault+0.005
 arrest+0.005
 fallen+0.005
 Kim+0.005
 earthquake+0.005
Neg logit contributions
 quietly-0.003
 curiously-0.003
 carefully-0.003
 eagerly-0.003
fish-0.003
Token more
Token ID517
Feature activation+1.15e-02

Pos logit contributions
 fault+0.009
 arrest+0.008
 fallen+0.008
 enough+0.008
 earthquake+0.007
Neg logit contributions
 curiously-0.009
 quietly-0.009
 eagerly-0.009
Hop-0.009
fish-0.008
Token.
Token ID13
Feature activation+1.97e-02

Pos logit contributions
 fault+0.022
 arrest+0.020
 fallen+0.019
 enough+0.018
 Kim+0.018
Neg logit contributions
 curiously-0.019
 quietly-0.018
 carefully-0.017
 eagerly-0.017
Hop-0.017
Token They
Token ID1119
Feature activation+1.68e-02

Pos logit contributions
 fault+0.030
 arrest+0.027
 Kim+0.026
 enough+0.024
 fallen+0.024
Neg logit contributions
 curiously-0.040
fish-0.039
 quietly-0.038
 carefully-0.037
 eagerly-0.036
Token do
Token ID466
Feature activation-3.74e-02

Pos logit contributions
 fault+0.027
 arrest+0.026
 fallen+0.023
 Kim+0.023
 enough+0.022
Neg logit contributions
 curiously-0.031
 quietly-0.030
fish-0.029
 eagerly-0.028
Hop-0.028
Token not
Token ID407
Feature activation+6.48e-02

Pos logit contributions
 quietly+0.081
 curiously+0.080
 carefully+0.078
 eagerly+0.076
fish+0.076
Neg logit contributions
 fault-0.050
 arrest-0.046
 fallen-0.040
 Kim-0.039
 enough-0.038
Token lose
Token ID4425
Feature activation+1.95e-02

Pos logit contributions
 fault+0.126
 arrest+0.117
 fallen+0.107
 Kim+0.106
erved+0.106
Neg logit contributions
 quietly-0.102
 curiously-0.098
 carefully-0.096
 eagerly-0.093
fish-0.092
Token her
Token ID607
Feature activation+6.27e-03

Pos logit contributions
 fault+0.039
 arrest+0.036
 fallen+0.034
erved+0.033
 Kim+0.033
Neg logit contributions
 quietly-0.030
 curiously-0.029
 carefully-0.028
 eagerly-0.027
fish-0.026
Token again
Token ID757
Feature activation+2.98e-02

Pos logit contributions
 fault+0.011
 arrest+0.010
 fallen+0.009
 Kim+0.009
 enough+0.009
Neg logit contributions
 quietly-0.011
 curiously-0.011
 carefully-0.011
 eagerly-0.010
fish-0.010
Token.
Token ID13
Feature activation+2.05e-02

Pos logit contributions
 fault+0.071
 arrest+0.067
erved+0.062
 fallen+0.062
 inconvenience+0.062
Neg logit contributions
 quietly-0.036
 curiously-0.033
 carefully-0.033
 eagerly-0.031
fish-0.030
Token They
Token ID1119
Feature activation+1.70e-02

Pos logit contributions
 fault+0.035
 arrest+0.032
 Kim+0.030
 fallen+0.029
 enough+0.028
Neg logit contributions
 curiously-0.037
 quietly-0.036
fish-0.036
 carefully-0.035
 eagerly-0.034
Token learned
Token ID4499
Feature activation+4.47e-02

Pos logit contributions
 fault+0.008
 arrest+0.008
 fallen+0.007
 enough+0.007
 Kim+0.007
Neg logit contributions
 curiously-0.006
fish-0.006
 quietly-0.006
Hop-0.006
 closely-0.006
Token their
Token ID511
Feature activation+2.03e-02

Pos logit contributions
 fault+0.089
 arrest+0.084
 fallen+0.077
 Kim+0.076
 earthquake+0.075
Neg logit contributions
 quietly-0.069
 curiously-0.067
 carefully-0.065
 eagerly-0.062
fish-0.062
Token lesson
Token ID11483
Feature activation+2.37e-02

Pos logit contributions
 fault+0.035
 arrest+0.030
 fallen+0.028
 enough+0.027
 earthquake+0.027
Neg logit contributions
 quietly-0.038
 curiously-0.038
fish-0.036
 eagerly-0.035
 carefully-0.035
Token.
Token ID13
Feature activation+1.83e-02

Pos logit contributions
 fault+0.050
 arrest+0.047
 fallen+0.044
 Kim+0.044
 enough+0.043
Neg logit contributions
 quietly-0.032
 curiously-0.032
 carefully-0.030
 eagerly-0.029
fish-0.029
Token They
Token ID1119
Feature activation+1.12e-02

Pos logit contributions
 fault+0.035
 arrest+0.033
 fallen+0.030
 earthquake+0.029
 Kim+0.029
Neg logit contributions
 quietly-0.030
 curiously-0.029
 carefully-0.028
 eagerly-0.027
fish-0.026
Token never
Token ID1239
Feature activation+6.33e-02

Pos logit contributions
 fault+0.020
 arrest+0.018
 fallen+0.017
 enough+0.017
 Kim+0.017
Neg logit contributions
 curiously-0.019
 quietly-0.019
fish-0.018
 carefully-0.017
 eagerly-0.017
Token went
Token ID1816
Feature activation+4.49e-02

Pos logit contributions
 fault+0.131
 arrest+0.120
opl+0.110
 Entry+0.110
 Kim+0.108
Neg logit contributions
 quietly-0.097
 carefully-0.092
 curiously-0.084
 gently-0.084
 eagerly-0.084
Token near
Token ID1474
Feature activation-1.39e-02

Pos logit contributions
 fault+0.108
 arrest+0.102
 fallen+0.094
 Kim+0.094
erved+0.093
Neg logit contributions
 quietly-0.052
 curiously-0.049
 carefully-0.047
 eagerly-0.045
fish-0.043
Token the
Token ID262
Feature activation+1.64e-02

Pos logit contributions
 quietly+0.023
 curiously+0.022
 carefully+0.021
 eagerly+0.021
fish+0.021
Neg logit contributions
 fault-0.026
 arrest-0.025
 fallen-0.023
 Kim-0.022
 enough-0.022
Token road
Token ID2975
Feature activation+1.88e-02

Pos logit contributions
 fault+0.031
 arrest+0.029
 Kim+0.026
 fallen+0.026
erved+0.026
Neg logit contributions
 quietly-0.027
 curiously-0.026
 carefully-0.025
fish-0.024
 eagerly-0.024
Token again
Token ID757
Feature activation+2.82e-02

Pos logit contributions
 fault+0.042
 arrest+0.040
 fallen+0.037
 Kim+0.037
erved+0.037
Neg logit contributions
 quietly-0.024
 curiously-0.023
 carefully-0.022
 eagerly-0.021
fish-0.021
Token there
Token ID612
Feature activation+6.73e-03

Pos logit contributions
 fault+0.031
 Kim+0.029
 fallen+0.028
 enough+0.028
 arrest+0.028
Neg logit contributions
 curiously-0.027
 quietly-0.026
 carefully-0.025
 eagerly-0.025
fish-0.025
Token was
Token ID373
Feature activation+2.33e-02

Pos logit contributions
 fault+0.014
 arrest+0.013
 fallen+0.013
 Kim+0.013
 enough+0.012
Neg logit contributions
 quietly-0.009
 curiously-0.009
 carefully-0.009
fish-0.008
Hop-0.008
Token a
Token ID257
Feature activation+1.33e-03

Pos logit contributions
 fault+0.037
 arrest+0.035
 fallen+0.032
 enough+0.032
 Kim+0.031
Neg logit contributions
 quietly-0.043
 curiously-0.043
fish-0.042
 carefully-0.041
 eagerly-0.040
Token little
Token ID1310
Feature activation-6.24e-03

Pos logit contributions
 earthquake+0.002
 arrest+0.002
 fallen+0.002
 Kim+0.002
 fault+0.002
Neg logit contributions
 curiously-0.003
 eagerly-0.002
 quietly-0.002
 closely-0.002
 carefully-0.002
Token dog
Token ID3290
Feature activation+2.37e-02

Pos logit contributions
 curiously+0.010
 eagerly+0.010
 quietly+0.009
 carefully+0.009
 closely+0.009
Neg logit contributions
 earthquake-0.011
 arrest-0.011
 fault-0.011
 fallen-0.010
 Kim-0.010
Token named
Token ID3706
Feature activation+7.21e-02

Pos logit contributions
 arrest+0.034
 fault+0.034
 Kim+0.031
 fallen+0.031
 earthquake+0.029
Neg logit contributions
 curiously-0.046
 quietly-0.045
 eagerly-0.043
 carefully-0.043
fish-0.041
Token Spot
Token ID15899
Feature activation-9.66e-03

Pos logit contributions
 fault+0.136
 arrest+0.122
 inconvenience+0.111
 earthquake+0.110
 fallen+0.110
Neg logit contributions
 quietly-0.124
 curiously-0.119
 carefully-0.118
Hop-0.113
 eagerly-0.110
Token.
Token ID13
Feature activation+2.39e-02

Pos logit contributions
 quietly+0.013
 curiously+0.013
 carefully+0.012
 eagerly+0.012
fish+0.012
Neg logit contributions
 fault-0.021
 arrest-0.020
 fallen-0.018
 Kim-0.018
 earthquake-0.018
Token Spot
Token ID15899
Feature activation-1.00e-02

Pos logit contributions
 fault+0.047
 arrest+0.043
 earthquake+0.039
 fallen+0.039
 inconvenience+0.039
Neg logit contributions
 quietly-0.038
 curiously-0.036
Hop-0.035
 carefully-0.035
 eagerly-0.034
Token loved
Token ID6151
Feature activation-2.57e-02

Pos logit contributions
 curiously+0.015
 quietly+0.014
 carefully+0.013
 eagerly+0.013
fish+0.013
Neg logit contributions
 fault-0.020
 arrest-0.019
 fallen-0.018
 Kim-0.018
erved-0.017
Token to
Token ID284
Feature activation+2.77e-02

Pos logit contributions
 quietly+0.042
 carefully+0.040
 curiously+0.039
 eagerly+0.038
fish+0.037
Neg logit contributions
 fault-0.049
 arrest-0.044
 fallen-0.040
 earthquake-0.040
 inconvenience-0.040
Token "
Token ID366
Feature activation-1.34e-02

Pos logit contributions
fish+0.018
 curiously+0.018
 quietly+0.018
Hop+0.018
 carefully+0.017
Neg logit contributions
 fault-0.016
 arrest-0.016
 Kim-0.015
 fallen-0.015
 enough-0.014
TokenThat
Token ID2504
Feature activation-4.87e-02

Pos logit contributions
 quietly+0.015
 curiously+0.015
 carefully+0.014
 eagerly+0.014
Hop+0.014
Neg logit contributions
 fault-0.032
 arrest-0.030
 fallen-0.028
 earthquake-0.027
 Kim-0.027
Token was
Token ID373
Feature activation+6.69e-03

Pos logit contributions
 carefully+0.057
 quietly+0.056
 curiously+0.056
 eagerly+0.049
fish+0.049
Neg logit contributions
 fault-0.114
 arrest-0.113
 Kim-0.103
 inconvenience-0.101
erved-0.098
Token a
Token ID257
Feature activation-4.85e-03

Pos logit contributions
 arrest+0.016
 fault+0.016
erved+0.014
opl+0.014
 inconvenience+0.014
Neg logit contributions
 quietly-0.008
 carefully-0.008
 closer-0.007
 curiously-0.007
 eagerly-0.006
Token great
Token ID1049
Feature activation-7.22e-03

Pos logit contributions
 quietly+0.007
 curiously+0.007
 carefully+0.006
 eagerly+0.006
fish+0.006
Neg logit contributions
 fault-0.010
 arrest-0.009
 fallen-0.009
 earthquake-0.009
 Kim-0.009
Token race
Token ID3234
Feature activation+2.04e-02

Pos logit contributions
 quietly+0.013
 curiously+0.012
 carefully+0.012
 eagerly+0.012
fish+0.011
Neg logit contributions
 fault-0.013
 arrest-0.012
 fallen-0.011
 Kim-0.011
erved-0.010
Token!
Token ID0
Feature activation+1.77e-02

Pos logit contributions
 fault+0.038
 arrest+0.035
erved+0.033
 fallen+0.033
upid+0.031
Neg logit contributions
 quietly-0.034
 carefully-0.031
 curiously-0.030
 eagerly-0.030
fish-0.030
Token What
Token ID1867
Feature activation-5.41e-02

Pos logit contributions
 fault+0.032
 arrest+0.030
 fallen+0.028
 earthquake+0.027
erved+0.027
Neg logit contributions
 quietly-0.030
 curiously-0.028
 carefully-0.028
 eagerly-0.028
Hop-0.027
Token should
Token ID815
Feature activation+7.03e-03

Pos logit contributions
 quietly+0.080
 carefully+0.076
 curiously+0.073
fish+0.072
 eagerly+0.071
Neg logit contributions
 fault-0.109
 arrest-0.109
erved-0.099
 earthquake-0.095
 Kim-0.095
Token we
Token ID356
Feature activation+1.27e-03

Pos logit contributions
 fault+0.016
erved+0.014
OO+0.013
 arrest+0.013
opl+0.013
Neg logit contributions
 quietly-0.009
 carefully-0.009
 curiously-0.009
 eagerly-0.008
fish-0.008
Token do
Token ID466
Feature activation-3.33e-02

Pos logit contributions
 fault+0.002
 arrest+0.002
 fallen+0.002
 earthquake+0.002
 Kim+0.002
Neg logit contributions
 quietly-0.002
 curiously-0.002
fish-0.002
 carefully-0.002
 eagerly-0.002
Token wanted
Token ID2227
Feature activation-7.96e-03

Pos logit contributions
 fault+0.007
 arrest+0.006
 fallen+0.006
 Kim+0.006
 enough+0.006
Neg logit contributions
 curiously-0.007
 quietly-0.007
fish-0.006
 eagerly-0.006
 carefully-0.006
Token to
Token ID284
Feature activation+5.56e-03

Pos logit contributions
 quietly+0.013
 carefully+0.012
 curiously+0.011
 eagerly+0.011
 gently+0.011
Neg logit contributions
 fault-0.016
 arrest-0.015
 earthquake-0.014
erved-0.014
opl-0.014
Token help
Token ID1037
Feature activation+2.13e-03

Pos logit contributions
 fault+0.012
 arrest+0.011
 enough+0.011
 fallen+0.011
erved+0.010
Neg logit contributions
 quietly-0.007
 curiously-0.007
 carefully-0.007
 eagerly-0.007
fish-0.006
Token,
Token ID11
Feature activation+1.99e-03

Pos logit contributions
 fault+0.004
 arrest+0.004
 fallen+0.004
erved+0.004
 enough+0.004
Neg logit contributions
 quietly-0.003
 curiously-0.003
 carefully-0.003
 eagerly-0.003
fish-0.003
Token but
Token ID475
Feature activation-2.05e-02

Pos logit contributions
 fault+0.004
 arrest+0.004
erved+0.004
opl+0.004
 fallen+0.004
Neg logit contributions
 quietly-0.003
 carefully-0.003
 curiously-0.003
 eagerly-0.003
 gently-0.002
Token she
Token ID673
Feature activation+1.82e-02

Pos logit contributions
 quietly+0.030
 curiously+0.029
 carefully+0.028
 eagerly+0.027
fish+0.026
Neg logit contributions
 fault-0.045
 arrest-0.042
erved-0.037
 inconvenience-0.037
 fallen-0.037
Token was
Token ID373
Feature activation-1.52e-03

Pos logit contributions
 fault+0.032
 arrest+0.031
 fallen+0.030
 Kim+0.028
 enough+0.028
Neg logit contributions
 curiously-0.029
 quietly-0.029
 carefully-0.027
 eagerly-0.027
 mer-0.026
Token too
Token ID1165
Feature activation+7.77e-03

Pos logit contributions
 quietly+0.002
 curiously+0.002
 carefully+0.002
 eagerly+0.002
fish+0.002
Neg logit contributions
 fault-0.003
 arrest-0.003
 fallen-0.003
 Kim-0.003
 enough-0.003
Token small
Token ID1402
Feature activation-5.01e-03

Pos logit contributions
 fault+0.017
 arrest+0.015
 fallen+0.015
 Kim+0.014
 enough+0.014
Neg logit contributions
 quietly-0.010
 curiously-0.010
 carefully-0.010
 eagerly-0.010
fish-0.010
Token to
Token ID284
Feature activation+2.62e-02

Pos logit contributions
 quietly+0.006
 carefully+0.006
 curiously+0.005
 eagerly+0.005
 closely+0.005
Neg logit contributions
 arrest-0.012
 fault-0.011
 earthquake-0.011
 Entry-0.011
erved-0.011
Token reach
Token ID3151
Feature activation-5.56e-03

Pos logit contributions
 fault+0.058
 arrest+0.053
 fallen+0.049
 Kim+0.049
erved+0.049
Neg logit contributions
 quietly-0.036
 curiously-0.035
 carefully-0.034
 eagerly-0.032
fish-0.031
Token in
Token ID287
Feature activation-1.77e-02

Pos logit contributions
 quietly+0.027
 curiously+0.027
 carefully+0.025
 eagerly+0.024
fish+0.024
Neg logit contributions
 fault-0.058
 arrest-0.055
 fallen-0.051
 Kim-0.051
 enough-0.050
Token the
Token ID262
Feature activation+3.19e-02

Pos logit contributions
 quietly+0.027
 curiously+0.026
 carefully+0.025
 eagerly+0.025
fish+0.023
Neg logit contributions
 fault-0.036
 arrest-0.033
erved-0.031
 fallen-0.031
 Kim-0.030
Token air
Token ID1633
Feature activation-1.22e-02

Pos logit contributions
 fault+0.058
 arrest+0.053
 fallen+0.051
 earthquake+0.050
 Kim+0.048
Neg logit contributions
 quietly-0.053
 curiously-0.053
fish-0.051
 carefully-0.050
 eagerly-0.050
Token all
Token ID477
Feature activation+7.89e-03

Pos logit contributions
 quietly+0.015
 curiously+0.014
 carefully+0.014
 eagerly+0.013
fish+0.012
Neg logit contributions
 fault-0.029
 arrest-0.028
 Kim-0.026
 fallen-0.025
 earthquake-0.025
Token night
Token ID1755
Feature activation+3.07e-03

Pos logit contributions
 fault+0.017
 arrest+0.016
 fallen+0.014
 Entry+0.014
 Kim+0.014
Neg logit contributions
 quietly-0.011
 curiously-0.011
 carefully-0.011
 eagerly-0.010
fish-0.010
Token long
Token ID890
Feature activation+2.49e-02

Pos logit contributions
 fault+0.007
 arrest+0.006
 fallen+0.006
 Kim+0.006
erved+0.006
Neg logit contributions
 quietly-0.004
 curiously-0.004
 carefully-0.004
 eagerly-0.004
fish-0.004
Token,
Token ID11
Feature activation+2.16e-02

Pos logit contributions
 fault+0.058
 arrest+0.054
 Kim+0.053
erved+0.052
 earthquake+0.052
Neg logit contributions
 quietly-0.030
 carefully-0.028
 curiously-0.028
 eagerly-0.027
fish-0.024
Token until
Token ID1566
Feature activation+2.53e-03

Pos logit contributions
 fault+0.044
 arrest+0.041
 fallen+0.039
 enough+0.038
 Kim+0.037
Neg logit contributions
 curiously-0.031
 quietly-0.031
fish-0.029
Hop-0.028
 carefully-0.028
Token the
Token ID262
Feature activation+3.08e-02

Pos logit contributions
 fault+0.006
 arrest+0.005
 Kim+0.005
erved+0.005
 fallen+0.005
Neg logit contributions
 quietly-0.003
 curiously-0.003
 carefully-0.003
 eagerly-0.003
fish-0.003
Token sun
Token ID4252
Feature activation+8.18e-03

Pos logit contributions
 fault+0.055
 arrest+0.052
 fallen+0.049
 earthquake+0.048
 enough+0.046
Neg logit contributions
 quietly-0.053
 curiously-0.053
fish-0.050
 eagerly-0.050
 carefully-0.049
Token rose
Token ID8278
Feature activation-1.63e-03

Pos logit contributions
 fault+0.014
 fallen+0.014
 arrest+0.013
 enough+0.013
erved+0.013
Neg logit contributions
 curiously-0.013
 quietly-0.013
Hop-0.013
 carefully-0.012
fish-0.012
Token put
Token ID1234
Feature activation-2.69e-02

Pos logit contributions
 fault+0.024
 arrest+0.022
 fallen+0.021
 Kim+0.021
 enough+0.021
Neg logit contributions
 quietly-0.016
 curiously-0.016
 carefully-0.015
 eagerly-0.015
fish-0.015
Token her
Token ID607
Feature activation-1.99e-02

Pos logit contributions
 quietly+0.039
 curiously+0.039
fish+0.036
 carefully+0.036
 eagerly+0.035
Neg logit contributions
 fault-0.053
 arrest-0.050
 fallen-0.048
 enough-0.047
 Kim-0.046
Token in
Token ID287
Feature activation-3.34e-02

Pos logit contributions
 quietly+0.029
 curiously+0.028
Hop+0.026
 carefully+0.026
fish+0.026
Neg logit contributions
 fault-0.041
 arrest-0.037
 fallen-0.036
 enough-0.035
erved-0.034
Token an
Token ID281
Feature activation+2.06e-02

Pos logit contributions
 quietly+0.054
 curiously+0.053
 carefully+0.050
fish+0.050
 eagerly+0.049
Neg logit contributions
 fault-0.063
 arrest-0.058
 fallen-0.054
 Kim-0.053
 enough-0.053
Token even
Token ID772
Feature activation-7.76e-03

Pos logit contributions
 fault+0.039
 arrest+0.036
 fallen+0.034
 enough+0.033
 earthquake+0.033
Neg logit contributions
 quietly-0.033
 curiously-0.032
 carefully-0.030
fish-0.030
 eagerly-0.030
Token more
Token ID517
Feature activation+1.34e-02

Pos logit contributions
 curiously+0.011
 quietly+0.011
Hop+0.011
fish+0.011
 eagerly+0.010
Neg logit contributions
 fault-0.014
 enough-0.013
 fallen-0.013
 arrest-0.013
 earthquake-0.013
Token peaceful
Token ID12309
Feature activation+1.64e-02

Pos logit contributions
 fault+0.025
 arrest+0.023
 fallen+0.022
 enough+0.021
 earthquake+0.021
Neg logit contributions
 quietly-0.022
 curiously-0.021
 carefully-0.020
Hop-0.020
 eagerly-0.020
Token mood
Token ID10038
Feature activation+3.48e-02

Pos logit contributions
 fault+0.030
 arrest+0.028
 fallen+0.026
 enough+0.026
 earthquake+0.025
Neg logit contributions
 quietly-0.027
 curiously-0.026
 carefully-0.025
 eagerly-0.024
fish-0.024
Token.
Token ID13
Feature activation-1.51e-02

Pos logit contributions
 fault+0.076
 arrest+0.073
 Kim+0.067
 fallen+0.066
 earthquake+0.066
Neg logit contributions
 quietly-0.046
 curiously-0.044
 carefully-0.044
 eagerly-0.042
fish-0.041
Token
Token ID198
Feature activation-2.27e-03

Pos logit contributions
 quietly+0.020
fish+0.019
 curiously+0.019
 carefully+0.018
 eagerly+0.017
Neg logit contributions
 fault-0.033
 arrest-0.031
 Kim-0.030
 fallen-0.029
 enough-0.028
Token
Token ID198
Feature activation+2.87e-03

Pos logit contributions
fish+0.003
 mer+0.003
 quietly+0.003
 curiously+0.003
beans+0.003
Neg logit contributions
 Kim-0.004
 fault-0.004
 arrest-0.004
 fallen-0.004
upid-0.003
Token then
Token ID788
Feature activation-8.82e-03

Pos logit contributions
 quietly+0.033
 curiously+0.032
 carefully+0.031
 eagerly+0.030
fish+0.030
Neg logit contributions
 fault-0.042
 arrest-0.039
 fallen-0.036
 Kim-0.035
 earthquake-0.035
Token,
Token ID11
Feature activation+7.45e-03

Pos logit contributions
 quietly+0.012
 curiously+0.011
 carefully+0.011
 eagerly+0.011
fish+0.011
Neg logit contributions
 fault-0.020
 arrest-0.018
 fallen-0.017
 earthquake-0.017
erved-0.017
Token something
Token ID1223
Feature activation-1.70e-02

Pos logit contributions
 fault+0.014
 arrest+0.013
 fallen+0.012
 earthquake+0.012
erved+0.012
Neg logit contributions
 quietly-0.012
 curiously-0.012
 carefully-0.012
 eagerly-0.011
fish-0.011
Token happened
Token ID3022
Feature activation+3.45e-03

Pos logit contributions
 curiously+0.041
 quietly+0.039
 eagerly+0.038
Hop+0.038
 carefully+0.038
Neg logit contributions
 fault-0.017
 fallen-0.016
 enough-0.014
 arrest-0.014
 caused-0.014
Token.
Token ID13
Feature activation+2.50e-02

Pos logit contributions
 fault+0.008
 arrest+0.007
 Kim+0.007
 earthquake+0.007
 fallen+0.007
Neg logit contributions
 quietly-0.004
 curiously-0.004
 carefully-0.004
 eagerly-0.004
fish-0.004
Token The
Token ID383
Feature activation+7.44e-03

Pos logit contributions
 fault+0.044
 arrest+0.040
 fallen+0.037
erved+0.036
 earthquake+0.036
Neg logit contributions
 quietly-0.045
 curiously-0.043
 carefully-0.041
 eagerly-0.040
fish-0.039
Token elevator
Token ID20932
Feature activation+2.16e-02

Pos logit contributions
 fault+0.012
 arrest+0.011
 fallen+0.010
 Kim+0.010
 earthquake+0.010
Neg logit contributions
 quietly-0.014
 curiously-0.014
 carefully-0.013
 eagerly-0.013
fish-0.013
Token stopped
Token ID5025
Feature activation+5.10e-03

Pos logit contributions
 fault+0.047
 arrest+0.040
upid+0.038
 yours+0.038
 Entry+0.037
Neg logit contributions
 quietly-0.035
 carefully-0.033
 curiously-0.031
 eagerly-0.030
fish-0.029
Token.
Token ID13
Feature activation+2.07e-02

Pos logit contributions
 fault+0.012
 arrest+0.011
erved+0.010
 fallen+0.010
 inconvenience+0.010
Neg logit contributions
 quietly-0.007
 curiously-0.006
 carefully-0.006
 eagerly-0.006
fish-0.006
Token The
Token ID383
Feature activation+7.16e-03

Pos logit contributions
 fault+0.037
 arrest+0.035
 fallen+0.032
erved+0.031
 earthquake+0.031
Neg logit contributions
 quietly-0.036
 curiously-0.034
 carefully-0.033
 eagerly-0.032
 softly-0.030
Token lights
Token ID7588
Feature activation+4.35e-03

Pos logit contributions
 fault+0.012
 arrest+0.011
 fallen+0.010
 earthquake+0.010
 Kim+0.010
Neg logit contributions
 quietly-0.013
 curiously-0.013
 carefully-0.012
 eagerly-0.012
fish-0.012
Token was
Token ID373
Feature activation-1.06e-02

Pos logit contributions
 quietly+0.020
 curiously+0.020
fish+0.019
 carefully+0.019
Hop+0.018
Neg logit contributions
 fault-0.034
 fallen-0.031
 arrest-0.031
 Kim-0.030
 enough-0.030
Token a
Token ID257
Feature activation-1.06e-02

Pos logit contributions
 quietly+0.012
 carefully+0.012
 curiously+0.012
 eagerly+0.011
 closely+0.011
Neg logit contributions
 arrest-0.025
 fault-0.024
 inconvenience-0.022
erved-0.022
EEP-0.021
Token nice
Token ID3621
Feature activation-2.80e-03

Pos logit contributions
 curiously+0.017
 quietly+0.017
 carefully+0.016
 eagerly+0.016
fish+0.016
Neg logit contributions
 fault-0.020
 arrest-0.019
 fallen-0.018
 earthquake-0.017
 enough-0.017
Token toy
Token ID13373
Feature activation-4.33e-03

Pos logit contributions
 quietly+0.004
 curiously+0.004
 carefully+0.004
 eagerly+0.004
fish+0.004
Neg logit contributions
 fault-0.005
 arrest-0.005
 fallen-0.005
 Kim-0.005
 earthquake-0.005
Token,
Token ID11
Feature activation+7.74e-03

Pos logit contributions
 quietly+0.006
 carefully+0.006
 curiously+0.006
fish+0.005
 eagerly+0.005
Neg logit contributions
 fault-0.009
 arrest-0.009
erved-0.008
 fallen-0.008
 inconvenience-0.008
Token but
Token ID475
Feature activation-2.32e-02

Pos logit contributions
 fault+0.016
 arrest+0.016
 fallen+0.014
 earthquake+0.014
erved+0.014
Neg logit contributions
 quietly-0.011
 curiously-0.011
 carefully-0.010
 eagerly-0.010
fish-0.009
Token reminded
Token ID14516
Feature activation+1.90e-02

Pos logit contributions
 quietly+0.032
 curiously+0.031
 carefully+0.029
fish+0.029
 eagerly+0.029
Neg logit contributions
 fault-0.049
 arrest-0.046
 fallen-0.043
 Kim-0.043
 enough-0.043
Token him
Token ID683
Feature activation-2.46e-02

Pos logit contributions
 fault+0.040
 arrest+0.037
 fallen+0.034
 inconvenience+0.033
erved+0.033
Neg logit contributions
 quietly-0.028
 curiously-0.027
 carefully-0.027
 eagerly-0.026
Hop-0.024
Token to
Token ID284
Feature activation+2.95e-02

Pos logit contributions
 carefully+0.034
 quietly+0.034
 curiously+0.030
 eagerly+0.030
fish+0.030
Neg logit contributions
 fault-0.052
 arrest-0.052
 Entry-0.049
 inconvenience-0.048
opl-0.048
Token be
Token ID307
Feature activation+1.46e-02

Pos logit contributions
 fault+0.053
 arrest+0.049
 fallen+0.045
 enough+0.044
erved+0.044
Neg logit contributions
 quietly-0.051
 carefully-0.049
 curiously-0.049
 eagerly-0.047
fish-0.045
Token careful
Token ID8161
Feature activation-1.18e-03

Pos logit contributions
 fault+0.031
 arrest+0.030
 fallen+0.027
 earthquake+0.027
 Kim+0.027
Neg logit contributions
 quietly-0.021
 curiously-0.019
 carefully-0.019
 eagerly-0.018
fish-0.018
Token with
Token ID351
Feature activation-2.65e-02

Pos logit contributions
 curiously+0.021
 quietly+0.020
 carefully+0.019
 eagerly+0.019
fish+0.019
Neg logit contributions
 fault-0.030
 arrest-0.028
 fallen-0.026
 enough-0.026
 Kim-0.026
Token blocks
Token ID7021
Feature activation+1.55e-02

Pos logit contributions
 quietly+0.049
 curiously+0.048
 carefully+0.047
fish+0.045
 eagerly+0.045
Neg logit contributions
 fault-0.044
 arrest-0.040
 fallen-0.036
erved-0.036
 earthquake-0.035
Token.
Token ID13
Feature activation+4.42e-02

Pos logit contributions
 fault+0.030
 arrest+0.029
 fallen+0.026
erved+0.026
 Kim+0.026
Neg logit contributions
 quietly-0.024
 carefully-0.023
 curiously-0.023
fish-0.022
 eagerly-0.022
Token We
Token ID775
Feature activation-7.33e-03

Pos logit contributions
 fault+0.074
 arrest+0.069
 fallen+0.064
 earthquake+0.060
erved+0.060
Neg logit contributions
 quietly-0.080
 curiously-0.078
 carefully-0.076
 eagerly-0.074
fish-0.073
Token built
Token ID3170
Feature activation+1.19e-02

Pos logit contributions
 quietly+0.012
 carefully+0.011
 curiously+0.011
 eagerly+0.011
fish+0.011
Neg logit contributions
 fault-0.014
 arrest-0.013
 Kim-0.012
erved-0.012
 fallen-0.011
Token a
Token ID257
Feature activation-7.79e-03

Pos logit contributions
 fault+0.022
 arrest+0.021
 fallen+0.019
 Kim+0.019
 earthquake+0.019
Neg logit contributions
 quietly-0.020
 curiously-0.019
 carefully-0.019
 eagerly-0.018
fish-0.018
Token yacht
Token ID43207
Feature activation+1.16e-02

Pos logit contributions
 curiously+0.013
 quietly+0.013
 eagerly+0.012
 carefully+0.012
fish+0.012
Neg logit contributions
 fault-0.014
 arrest-0.013
 fallen-0.012
 earthquake-0.012
 Kim-0.012
Token.
Token ID13
Feature activation+3.77e-02

Pos logit contributions
 fault+0.025
 arrest+0.023
 fallen+0.022
 Kim+0.021
 enough+0.021
Neg logit contributions
 quietly-0.016
 curiously-0.015
 carefully-0.014
 eagerly-0.014
fish-0.014
Token Do
Token ID2141
Feature activation-1.53e-03

Pos logit contributions
 fault+0.062
 arrest+0.057
 fallen+0.052
 Kim+0.050
 earthquake+0.050
Neg logit contributions
 quietly-0.070
 curiously-0.069
 carefully-0.066
 eagerly-0.065
fish-0.064
Token you
Token ID345
Feature activation+6.24e-03

Pos logit contributions
 quietly+0.002
 carefully+0.002
 else+0.001
fish+0.001
Sure+0.001
Neg logit contributions
 fault-0.004
 fallen-0.003
 arrest-0.003
erved-0.003
upid-0.003
Token like
Token ID588
Feature activation-4.86e-02

Pos logit contributions
 fault+0.013
 arrest+0.012
 fallen+0.011
erved+0.011
 Kim+0.011
Neg logit contributions
 quietly-0.009
 carefully-0.009
 curiously-0.009
 eagerly-0.009
fish-0.008
Token had
Token ID550
Feature activation+2.71e-02

Pos logit contributions
 curiously+0.002
 quietly+0.002
fish+0.002
 carefully+0.002
Hop+0.002
Neg logit contributions
 fault-0.003
 arrest-0.003
 fallen-0.003
 enough-0.002
 Kim-0.002
Token a
Token ID257
Feature activation+6.05e-03

Pos logit contributions
 fault+0.057
 earthquake+0.055
 arrest+0.055
 inconvenience+0.053
erved+0.050
Neg logit contributions
 carefully-0.038
 quietly-0.037
 eagerly-0.036
 cheered-0.034
 happily-0.034
Token wonderful
Token ID7932
Feature activation-1.13e-02

Pos logit contributions
 fault+0.011
 arrest+0.010
 fallen+0.009
 enough+0.009
 Kim+0.009
Neg logit contributions
 curiously-0.010
 quietly-0.010
fish-0.010
 eagerly-0.009
 carefully-0.009
Token day
Token ID1110
Feature activation-1.22e-02

Pos logit contributions
 quietly+0.025
 curiously+0.025
 carefully+0.023
fish+0.023
Hop+0.023
Neg logit contributions
 fault-0.015
 arrest-0.013
 fallen-0.012
 enough-0.011
 earthquake-0.011
Token full
Token ID1336
Feature activation+1.40e-02

Pos logit contributions
 curiously+0.018
 quietly+0.018
fish+0.017
Hop+0.016
 carefully+0.016
Neg logit contributions
 fault-0.023
 arrest-0.023
 fallen-0.021
 enough-0.021
 Kim-0.020
Token of
Token ID286
Feature activation+4.03e-03

Pos logit contributions
 fault+0.026
 arrest+0.024
 fallen+0.023
 enough+0.023
 Kim+0.022
Neg logit contributions
 quietly-0.022
 curiously-0.022
 carefully-0.021
Hop-0.021
fish-0.020
Token spinning
Token ID19493
Feature activation+6.66e-04

Pos logit contributions
 fault+0.008
 arrest+0.007
 fallen+0.007
 enough+0.006
 Kim+0.006
Neg logit contributions
 curiously-0.006
 quietly-0.006
fish-0.006
Hop-0.006
 closely-0.006
Token and
Token ID290
Feature activation+1.22e-02

Pos logit contributions
 fault+0.002
 arrest+0.001
 fallen+0.001
 Kim+0.001
 earthquake+0.001
Neg logit contributions
 quietly-0.001
 curiously-0.001
 carefully-0.001
 eagerly-0.001
fish-0.001
Token fun
Token ID1257
Feature activation-1.50e-03

Pos logit contributions
 fault+0.023
 arrest+0.022
 fallen+0.020
 Kim+0.020
 enough+0.019
Neg logit contributions
 quietly-0.020
 curiously-0.019
 carefully-0.018
 eagerly-0.018
fish-0.018
Token.
Token ID13
Feature activation-1.08e-02

Pos logit contributions
 quietly+0.002
 curiously+0.002
 carefully+0.002
 eagerly+0.002
fish+0.002
Neg logit contributions
 fault-0.003
 arrest-0.003
 fallen-0.002
 Kim-0.002
 enough-0.002
Token<|endoftext|>
Token ID50256
Feature activation-2.10e-02

Pos logit contributions
 quietly+0.012
 curiously+0.012
fish+0.012
 carefully+0.011
 eagerly+0.011
Neg logit contributions
 fault-0.025
 arrest-0.024
 Kim-0.023
 fallen-0.022
 enough-0.022
Token he
Token ID339
Feature activation-1.50e-02

Pos logit contributions
 quietly+0.046
 curiously+0.046
 carefully+0.043
 eagerly+0.042
fish+0.042
Neg logit contributions
 fault-0.064
 arrest-0.061
 fallen-0.056
 Kim-0.055
 enough-0.055
Token knew
Token ID2993
Feature activation+1.08e-02

Pos logit contributions
 curiously+0.025
 quietly+0.025
fish+0.023
 carefully+0.023
 eagerly+0.023
Neg logit contributions
 fault-0.025
 arrest-0.025
 fallen-0.023
 enough-0.022
 Kim-0.022
Token how
Token ID703
Feature activation-5.83e-02

Pos logit contributions
 fault+0.022
 arrest+0.021
 fallen+0.019
erved+0.019
 earthquake+0.019
Neg logit contributions
 quietly-0.016
 curiously-0.015
 carefully-0.015
 eagerly-0.014
fish-0.014
Token to
Token ID284
Feature activation+1.67e-02

Pos logit contributions
 quietly+0.099
 curiously+0.097
 carefully+0.094
 eagerly+0.091
fish+0.090
Neg logit contributions
 fault-0.106
 arrest-0.099
 fallen-0.090
 Kim-0.089
 earthquake-0.088
Token softly
Token ID26625
Feature activation-1.64e-02

Pos logit contributions
 arrest+0.030
 fault+0.029
 fallen+0.027
 earthquake+0.026
 enough+0.025
Neg logit contributions
 quietly-0.025
 curiously-0.025
Hop-0.024
fish-0.024
 mer-0.024
Token rub
Token ID6437
Feature activation-3.54e-02

Pos logit contributions
 quietly+0.023
 curiously+0.022
 carefully+0.021
 eagerly+0.020
fish+0.020
Neg logit contributions
 fault-0.035
 arrest-0.033
 fallen-0.030
 Kim-0.030
 earthquake-0.030
Token her
Token ID607
Feature activation-2.05e-02

Pos logit contributions
 quietly+0.056
 curiously+0.055
 carefully+0.052
 eagerly+0.051
Hop+0.051
Neg logit contributions
 fault-0.066
 arrest-0.061
 fallen-0.057
 enough-0.057
 Kim-0.057
Token shoulders
Token ID12450
Feature activation+7.07e-03

Pos logit contributions
Hop+0.034
Sure+0.031
fish+0.031
 quietly+0.031
Would+0.030
Neg logit contributions
 fault-0.037
 enough-0.033
 fallen-0.031
 arrest-0.030
erved-0.029
Token.
Token ID13
Feature activation-1.58e-02

Pos logit contributions
 fault+0.015
 arrest+0.014
 fallen+0.014
 enough+0.013
 Kim+0.013
Neg logit contributions
 quietly-0.009
 curiously-0.009
 carefully-0.008
fish-0.008
 eagerly-0.008
Token His
Token ID2399
Feature activation+1.26e-02

Pos logit contributions
 quietly+0.016
 curiously+0.015
 carefully+0.014
 eagerly+0.014
fish+0.013
Neg logit contributions
 fault-0.040
 arrest-0.038
 fallen-0.036
 earthquake-0.035
 enough-0.035
Token mom
Token ID1995
Feature activation+1.69e-02

Pos logit contributions
 fault+0.026
 arrest+0.022
 fallen+0.021
 Kim+0.021
 enough+0.021
Neg logit contributions
fish-0.019
Hop-0.019
 quietly-0.019
 curiously-0.019
 mer-0.018
Token he
Token ID339
Feature activation+1.30e-02

Pos logit contributions
 quietly+0.014
 curiously+0.014
 carefully+0.013
fish+0.013
 eagerly+0.013
Neg logit contributions
 fault-0.020
 arrest-0.019
 fallen-0.018
 Kim-0.018
 enough-0.018
Token asked
Token ID1965
Feature activation-3.36e-02

Pos logit contributions
 fault+0.024
 arrest+0.023
 fallen+0.022
 enough+0.021
 Kim+0.021
Neg logit contributions
 curiously-0.019
 quietly-0.019
fish-0.018
Hop-0.017
 closely-0.017
Token the
Token ID262
Feature activation-1.44e-02

Pos logit contributions
 quietly+0.050
 curiously+0.049
 carefully+0.047
 eagerly+0.045
fish+0.043
Neg logit contributions
 fault-0.070
 arrest-0.066
 fallen-0.061
erved-0.059
 earthquake-0.059
Token baby
Token ID5156
Feature activation-2.37e-02

Pos logit contributions
fish+0.025
 curiously+0.025
 quietly+0.025
 carefully+0.024
Hop+0.024
Neg logit contributions
 fault-0.025
 arrest-0.022
 fallen-0.021
 enough-0.021
 earthquake-0.021
Token to
Token ID284
Feature activation-1.26e-03

Pos logit contributions
 quietly+0.035
 curiously+0.034
 carefully+0.032
 eagerly+0.031
fish+0.031
Neg logit contributions
 fault-0.048
 arrest-0.045
 fallen-0.042
 Kim-0.041
 enough-0.041
Token open
Token ID1280
Feature activation-3.12e-02

Pos logit contributions
 quietly+0.002
 curiously+0.002
 carefully+0.002
fish+0.002
 eagerly+0.002
Neg logit contributions
 fault-0.002
 arrest-0.002
 fallen-0.002
 Kim-0.002
 enough-0.002
Token his
Token ID465
Feature activation-4.06e-03

Pos logit contributions
 quietly+0.050
 curiously+0.049
 carefully+0.047
 eagerly+0.046
fish+0.046
Neg logit contributions
 fault-0.059
 arrest-0.055
 fallen-0.051
 Kim-0.050
 enough-0.049
Token mouth
Token ID5422
Feature activation-3.70e-04

Pos logit contributions
 quietly+0.008
 curiously+0.008
 carefully+0.008
 eagerly+0.008
fish+0.008
Neg logit contributions
 fault-0.006
 arrest-0.006
 fallen-0.005
 Kim-0.005
 enough-0.005
Token and
Token ID290
Feature activation-8.39e-03

Pos logit contributions
 quietly+0.001
 curiously+0.001
 carefully+0.000
 eagerly+0.000
fish+0.000
Neg logit contributions
 fault-0.001
 arrest-0.001
 fallen-0.001
 Kim-0.001
 enough-0.001
Token say
Token ID910
Feature activation-1.31e-02

Pos logit contributions
 curiously+0.012
 quietly+0.012
fish+0.012
Hop+0.011
 carefully+0.011
Neg logit contributions
 fault-0.016
 arrest-0.016
 Kim-0.015
 fallen-0.014
 enough-0.014
Token '
Token ID705
Feature activation-5.30e-02

Pos logit contributions
 quietly+0.018
 carefully+0.017
 curiously+0.017
 eagerly+0.016
fish+0.015
Neg logit contributions
 fault-0.029
 arrest-0.028
 fallen-0.026
erved-0.026
 earthquake-0.025
Token nice
Token ID3621
Feature activation-2.02e-02

Pos logit contributions
 fault+0.005
 arrest+0.005
erved+0.005
 fallen+0.005
upid+0.004
Neg logit contributions
 quietly-0.003
 carefully-0.003
 cheer-0.003
 curiously-0.003
 eagerly-0.002
Token again
Token ID757
Feature activation+2.14e-02

Pos logit contributions
 quietly+0.028
 curiously+0.027
 carefully+0.026
 eagerly+0.025
fish+0.025
Neg logit contributions
 fault-0.042
 arrest-0.040
 fallen-0.037
erved-0.036
 Kim-0.036
Token.
Token ID13
Feature activation+4.73e-03

Pos logit contributions
 fault+0.037
 arrest+0.035
 fallen+0.031
erved+0.031
 Kim+0.030
Neg logit contributions
 quietly-0.040
 carefully-0.038
 curiously-0.037
 eagerly-0.036
fish-0.035
Token Would
Token ID10928
Feature activation-6.08e-02

Pos logit contributions
 fault+0.007
 arrest+0.007
 Kim+0.006
 enough+0.006
 fallen+0.006
Neg logit contributions
 curiously-0.009
fish-0.009
 quietly-0.009
 carefully-0.009
 eagerly-0.009
Token you
Token ID345
Feature activation-1.70e-02

Pos logit contributions
 curiously+0.120
 quietly+0.119
fish+0.115
 eagerly+0.112
 mer+0.112
Neg logit contributions
 fault-0.082
 arrest-0.081
 enough-0.075
 fallen-0.073
 Kim-0.073
Token like
Token ID588
Feature activation-6.91e-02

Pos logit contributions
 curiously+0.030
 quietly+0.030
fish+0.028
Hop+0.027
 carefully+0.027
Neg logit contributions
 fault-0.029
 arrest-0.026
 enough-0.025
 fallen-0.024
 Kim-0.023
Token to
Token ID284
Feature activation+2.12e-02

Pos logit contributions
 quietly+0.119
 curiously+0.118
 carefully+0.112
 eagerly+0.110
fish+0.110
Neg logit contributions
 fault-0.123
 arrest-0.115
 fallen-0.105
 Kim-0.103
 enough-0.102
Token help
Token ID1037
Feature activation+1.41e-02

Pos logit contributions
 fault+0.043
 arrest+0.040
 fallen+0.037
 Kim+0.036
 enough+0.036
Neg logit contributions
 quietly-0.032
 curiously-0.031
 carefully-0.030
 eagerly-0.029
fish-0.029
Token me
Token ID502
Feature activation-4.40e-02

Pos logit contributions
 fault+0.023
 arrest+0.022
 Kim+0.020
 enough+0.020
 fallen+0.020
Neg logit contributions
 curiously-0.025
 quietly-0.025
fish-0.023
 eagerly-0.023
Hop-0.023
Token?"
Token ID1701
Feature activation-4.28e-02

Pos logit contributions
 curiously+0.068
 quietly+0.066
Hop+0.064
fish+0.064
 eagerly+0.062
Neg logit contributions
 fault-0.081
 arrest-0.077
 fallen-0.072
 enough-0.071
 Kim-0.070
Token
Token ID220
Feature activation+1.22e-02

Pos logit contributions
fish+0.066
 quietly+0.060
Hop+0.059
 mer+0.059
 curiously+0.057
Neg logit contributions
 Kim-0.076
 fault-0.073
 arrest-0.073
 enough-0.065
 fallen-0.065
Token."
Token ID526
Feature activation-3.05e-02

Pos logit contributions
 quietly+0.038
 carefully+0.036
 curiously+0.035
fish+0.033
 eagerly+0.033
Neg logit contributions
 fault-0.037
 arrest-0.035
erved-0.033
 inconvenience-0.030
hy-0.030
Token Mia
Token ID32189
Feature activation-2.48e-02

Pos logit contributions
 quietly+0.053
 curiously+0.052
 carefully+0.051
 eagerly+0.049
fish+0.047
Neg logit contributions
 fault-0.055
 arrest-0.051
 fallen-0.047
 enough-0.045
 earthquake-0.045
Token and
Token ID290
Feature activation-7.42e-03

Pos logit contributions
 curiously+0.040
Hop+0.039
fish+0.037
 quietly+0.037
 closely+0.036
Neg logit contributions
 fault-0.041
 arrest-0.039
 enough-0.038
 fallen-0.038
erved-0.036
Token Tom
Token ID4186
Feature activation+1.28e-04

Pos logit contributions
 quietly+0.016
 curiously+0.016
 carefully+0.016
 eagerly+0.015
 gently+0.015
Neg logit contributions
 fault-0.010
 arrest-0.009
 fallen-0.008
 inconvenience-0.008
 earthquake-0.008
Token looked
Token ID3114
Feature activation-1.33e-02

Pos logit contributions
 fault+0.000
 arrest+0.000
 fallen+0.000
 enough+0.000
erved+0.000
Neg logit contributions
 curiously-0.000
 quietly-0.000
fish-0.000
Hop-0.000
 carefully-0.000
Token at
Token ID379
Feature activation-5.07e-02

Pos logit contributions
fish+0.025
Hop+0.023
 therapist+0.023
 curiously+0.021
beans+0.021
Neg logit contributions
 fault-0.022
 enough-0.021
 fallen-0.019
erved-0.019
 earthquake-0.018
Token each
Token ID1123
Feature activation-4.05e-02

Pos logit contributions
 curiously+0.079
 quietly+0.079
fish+0.078
Hop+0.077
 eagerly+0.075
Neg logit contributions
 fault-0.097
 arrest-0.088
 Kim-0.086
 enough-0.083
 earthquake-0.080
Token other
Token ID584
Feature activation+1.71e-02

Pos logit contributions
 quietly+0.054
 curiously+0.053
 carefully+0.052
 eagerly+0.048
 closely+0.046
Neg logit contributions
 fault-0.091
 arrest-0.084
 fallen-0.078
erved-0.077
 inconvenience-0.077
Token and
Token ID290
Feature activation-1.00e-02

Pos logit contributions
 fault+0.039
 arrest+0.036
 enough+0.034
 fallen+0.034
 Kim+0.033
Neg logit contributions
 quietly-0.020
 curiously-0.020
fish-0.019
Hop-0.019
 carefully-0.019
Token knew
Token ID2993
Feature activation+1.39e-02

Pos logit contributions
 curiously+0.016
fish+0.015
Hop+0.015
 quietly+0.015
 carefully+0.014
Neg logit contributions
 fault-0.018
 Kim-0.017
 arrest-0.017
 enough-0.017
 fallen-0.016
Token they
Token ID484
Feature activation+3.07e-03

Pos logit contributions
 fault+0.031
 arrest+0.030
 fallen+0.028
erved+0.027
 earthquake+0.027
Neg logit contributions
 quietly-0.017
 curiously-0.017
 carefully-0.016
 eagerly-0.015
fish-0.015
Token hand
Token ID1021
Feature activation+2.15e-02

Pos logit contributions
 fault+0.036
 arrest+0.035
 inconvenience+0.032
 Entry+0.031
 earthquake+0.031
Neg logit contributions
 quietly-0.033
 curiously-0.033
 carefully-0.031
 eagerly-0.031
fish-0.030
Token.
Token ID13
Feature activation-1.20e-02

Pos logit contributions
 fault+0.049
 arrest+0.047
 inconvenience+0.043
 earthquake+0.043
 fallen+0.043
Neg logit contributions
 quietly-0.027
 curiously-0.026
 carefully-0.025
 eagerly-0.024
fish-0.022
Token They
Token ID1119
Feature activation+3.16e-03

Pos logit contributions
 quietly+0.022
 curiously+0.021
 carefully+0.020
 eagerly+0.020
 closely+0.019
Neg logit contributions
 fault-0.021
 arrest-0.020
 fallen-0.018
 enough-0.017
erved-0.017
Token asked
Token ID1965
Feature activation-3.19e-02

Pos logit contributions
 fault+0.006
 arrest+0.006
 fallen+0.005
 enough+0.005
 Kim+0.005
Neg logit contributions
 quietly-0.005
 curiously-0.005
fish-0.005
Hop-0.004
 carefully-0.004
Token him
Token ID683
Feature activation-3.77e-02

Pos logit contributions
 quietly+0.054
 curiously+0.053
 carefully+0.051
 eagerly+0.050
fish+0.048
Neg logit contributions
 fault-0.059
 arrest-0.055
 fallen-0.051
erved-0.049
 earthquake-0.049
Token what
Token ID644
Feature activation-5.58e-02

Pos logit contributions
 curiously+0.063
 quietly+0.062
fish+0.060
Hop+0.059
 eagerly+0.058
Neg logit contributions
 fault-0.067
 arrest-0.060
 enough-0.059
 Kim-0.058
 fallen-0.055
Token happened
Token ID3022
Feature activation-1.75e-03

Pos logit contributions
 quietly+0.092
 curiously+0.087
 carefully+0.087
fish+0.084
 eagerly+0.084
Neg logit contributions
 fault-0.105
 arrest-0.098
erved-0.091
 earthquake-0.087
 fallen-0.087
Token and
Token ID290
Feature activation-7.11e-03

Pos logit contributions
 quietly+0.002
 curiously+0.002
 carefully+0.002
 eagerly+0.002
fish+0.002
Neg logit contributions
 fault-0.004
 arrest-0.004
 fallen-0.004
 Kim-0.003
 earthquake-0.003
Token they
Token ID484
Feature activation+1.70e-02

Pos logit contributions
 quietly+0.010
 curiously+0.009
 carefully+0.009
 eagerly+0.008
fish+0.008
Neg logit contributions
 fault-0.017
 arrest-0.015
erved-0.014
 fallen-0.014
 earthquake-0.014
Token started
Token ID2067
Feature activation+7.30e-03

Pos logit contributions
 fault+0.035
 arrest+0.033
 fallen+0.030
 Kim+0.030
 enough+0.030
Neg logit contributions
 quietly-0.024
 curiously-0.024
 carefully-0.023
fish-0.022
 eagerly-0.022
Token to
Token ID284
Feature activation+1.81e-02

Pos logit contributions
 fault+0.013
 arrest+0.012
 fallen+0.011
 Kim+0.011
 earthquake+0.011
Neg logit contributions
 quietly-0.013
 curiously-0.013
 carefully-0.012
 eagerly-0.012
fish-0.012
Token Tom
Token ID4186
Feature activation+1.93e-02

Pos logit contributions
 fault+0.021
 arrest+0.017
 fallen+0.016
 earthquake+0.015
upid+0.015
Neg logit contributions
 quietly-0.032
 curiously-0.032
 carefully-0.031
 eagerly-0.030
 closely-0.029
Token said
Token ID531
Feature activation+2.23e-02

Pos logit contributions
 fault+0.045
 arrest+0.043
 fallen+0.040
 Kim+0.040
 enough+0.039
Neg logit contributions
 quietly-0.023
 curiously-0.022
 carefully-0.021
fish-0.020
 eagerly-0.020
Token.
Token ID13
Feature activation+9.64e-03

Pos logit contributions
 fault+0.042
 arrest+0.039
 Kim+0.037
 fallen+0.036
 enough+0.036
Neg logit contributions
 quietly-0.034
 curiously-0.034
fish-0.033
Hop-0.033
 carefully-0.032
Token He
Token ID679
Feature activation-4.99e-03

Pos logit contributions
 fault+0.018
 arrest+0.017
 fallen+0.015
 earthquake+0.015
erved+0.015
Neg logit contributions
 quietly-0.016
 curiously-0.016
 carefully-0.015
 eagerly-0.015
fish-0.014
Token picked
Token ID6497
Feature activation-3.67e-02

Pos logit contributions
fish+0.009
Hop+0.008
 reef+0.008
 curiously+0.008
 quietly+0.008
Neg logit contributions
 arrest-0.007
 fallen-0.007
 Kim-0.006
 enough-0.006
 caused-0.006
Token up
Token ID510
Feature activation-5.80e-02

Pos logit contributions
 quietly+0.042
fish+0.040
 curiously+0.040
Hop+0.039
 eagerly+0.038
Neg logit contributions
 fault-0.084
 arrest-0.078
 enough-0.078
 Kim-0.075
 fallen-0.074
Token the
Token ID262
Feature activation-2.49e-02

Pos logit contributions
 quietly+0.093
 curiously+0.090
 carefully+0.088
 eagerly+0.085
fish+0.082
Neg logit contributions
 fault-0.113
 arrest-0.105
 fallen-0.097
 earthquake-0.096
 inconvenience-0.094
Token pin
Token ID6757
Feature activation-2.39e-02

Pos logit contributions
 quietly+0.040
 curiously+0.039
 carefully+0.038
 eagerly+0.037
fish+0.036
Neg logit contributions
 fault-0.047
 arrest-0.044
 fallen-0.041
 Kim-0.040
 enough-0.039
Token and
Token ID290
Feature activation-1.09e-02

Pos logit contributions
 quietly+0.028
 curiously+0.027
Hop+0.025
 eagerly+0.025
fish+0.025
Neg logit contributions
 fault-0.055
 arrest-0.051
 enough-0.049
 fallen-0.048
 Kim-0.047
Token held
Token ID2714
Feature activation-5.15e-02

Pos logit contributions
 curiously+0.017
Hop+0.017
 quietly+0.016
fish+0.016
 eagerly+0.015
Neg logit contributions
 arrest-0.019
 fault-0.019
 fallen-0.018
 Kim-0.018
 enough-0.018
Token it
Token ID340
Feature activation-2.48e-02

Pos logit contributions
Hop+0.087
fish+0.087
 quietly+0.084
 mer+0.083
 curiously+0.081
Neg logit contributions
 fault-0.083
 Kim-0.081
 enough-0.081
 arrest-0.077
 fallen-0.074
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
 quietly+0.021
 curiously+0.020
 carefully+0.020
fish+0.019
 eagerly+0.019
Neg logit contributions
 fault-0.037
 arrest-0.035
 Kim-0.033
 earthquake-0.032
 fallen-0.032
Token He
Token ID679
Feature activation-8.67e-03

Pos logit contributions
 quietly+0.018
 curiously+0.017
 carefully+0.017
 eagerly+0.016
fish+0.016
Neg logit contributions
 fault-0.021
 arrest-0.019
 fallen-0.018
 earthquake-0.017
erved-0.017
Token was
Token ID373
Feature activation-1.26e-02

Pos logit contributions
 quietly+0.013
 curiously+0.013
Hop+0.012
fish+0.012
 eagerly+0.012
Neg logit contributions
 fault-0.016
 arrest-0.016
 fallen-0.015
 enough-0.014
erved-0.014
Token amazed
Token ID24562
Feature activation+6.14e-03

Pos logit contributions
 quietly+0.018
 curiously+0.018
 carefully+0.017
fish+0.017
 eagerly+0.016
Neg logit contributions
 fault-0.026
 arrest-0.024
 fallen-0.023
 Kim-0.022
 enough-0.022
Token by
Token ID416
Feature activation-3.15e-02

Pos logit contributions
 fault+0.013
 arrest+0.012
 fallen+0.011
 enough+0.011
 Kim+0.011
Neg logit contributions
 quietly-0.009
 curiously-0.009
 carefully-0.008
 eagerly-0.008
fish-0.008
Token what
Token ID644
Feature activation-5.51e-02

Pos logit contributions
 quietly+0.049
 curiously+0.049
 carefully+0.047
 eagerly+0.045
fish+0.045
Neg logit contributions
 fault-0.061
 arrest-0.057
 fallen-0.053
 Kim-0.052
erved-0.051
Token he
Token ID339
Feature activation-7.68e-03

Pos logit contributions
 quietly+0.080
 curiously+0.079
 carefully+0.075
 eagerly+0.073
fish+0.072
Neg logit contributions
 fault-0.112
 arrest-0.105
 fallen-0.098
 Kim-0.097
 enough-0.096
Token found
Token ID1043
Feature activation+7.86e-03

Pos logit contributions
 quietly+0.013
 curiously+0.013
 carefully+0.012
fish+0.012
 eagerly+0.012
Neg logit contributions
 fault-0.012
 arrest-0.012
 fallen-0.011
 enough-0.011
 caused-0.010
Token inside
Token ID2641
Feature activation-2.19e-02

Pos logit contributions
 fault+0.016
 arrest+0.015
 fallen+0.014
 Kim+0.014
 enough+0.014
Neg logit contributions
 quietly-0.011
 curiously-0.011
 carefully-0.010
 eagerly-0.010
fish-0.010
Token -
Token ID532
Feature activation-9.08e-04

Pos logit contributions
 quietly+0.030
 curiously+0.030
 carefully+0.028
 eagerly+0.027
fish+0.027
Neg logit contributions
 fault-0.047
 arrest-0.044
 fallen-0.041
erved-0.040
 Kim-0.040
Token a
Token ID257
Feature activation-9.38e-03

Pos logit contributions
 quietly+0.001
 curiously+0.001
fish+0.001
Hop+0.001
 carefully+0.001
Neg logit contributions
 fault-0.002
 arrest-0.002
 enough-0.001
 Kim-0.001
 fallen-0.001