TOP ACTIVATIONS
MAX = 0.137

sund ae with lots of topp ings . Tim my was
was proud of her big rad ish . She knew the
in being a modest little rad ish . <|endoftext|> One day
with his family of other rad ishes . One day ,
awn â s fault . The
sund ae with lots of topp ings . They
it was Tim my 's fault but Tim my felt guilty
of cheese and sauce and topp ings . The people who
ier ? You are y umm ier . You are my
there was a modest little rad ish . He lived in
, there was a kind rad ish . He liked to
, sauce , cheese and topp ings . She showed them
time there was a generous rad ish farmer . He had
, give me back my rad ish !" Ben shouted .
it away in a wheel bar row . Once
they can add their own topp ings too . Sara chooses
Once there was a clever rad ish . He lived in
knew it was his own fault . He would have to
, there was a sad rad ish that lived in a
her mom and share her rad ishes . She liked to

MIN ACTIVATIONS
MIN = -0.108

Lou put the add - on on the toy and
Max played with the mixed - up toys . They had
whenever Polly needed a pick - me - up , she
to wear and put make - up on her face .
remember to release the Wi - Fi when you 're done
push - ups and pull - ups . He worked really
needed a pick - me - up , she 'd read
. He loved playing make - bel ieve with his friends
. You cannot steal ice - cream s . You have
ez ing ! The sprinkle - covered pastry flew out of
how to use the Wi - Fi network to watch more
ie loved to play make - bel ieve . Today she
and Dad to the check - in counter . They were
! He thanked the repair - man and promised to never
He exclaimed , " Pick - a - bo o !
a zip . It zip - z ipped all around ,
up , trying the make - up like a young adult
sorry for using the past - date cheese . Sam learned
She could pick any ice - cream she wanted !
I am selling b erry - bus hes ."

INTERVAL 0.076 - 0.137
CONTAINS 0.028%

a time , there were two animals . One of them
asked his friend , a se ag ull named Tim my
The two little girls waited and waited
a time there was a rad ish farmer . He grew
, with all its different col ours , shapes and textures

INTERVAL 0.015 - 0.076
CONTAINS 16.257%

. Lily said , " Hi , my name is Lily
face and asked , " What happened , Lily ?"
the store . Can you help me ?" The
The friends couldn 't believe it . They ran inside
things . The farmer quickly grabbed the tool

INTERVAL -0.047 - 0.015
CONTAINS 83.343%

waved goodbye to the city and started his journey home .
. They began to eat and the little girl was so
some little sticks from outside the cage and carefully w ove
a blue m itten . " No , this
st eg osaurus , with a long tail and big plates

INTERVAL -0.108 - -0.047
CONTAINS 0.372%

She loved to play dress - up and wear pretty things
light message again , coming from the same spot . Cur
" Let 's spray the dog with the hose . Maybe
came ! She opened the package and what do you think
wanted to know why the dog kept complaining . After a
Token sund
Token ID37437
Feature activation+1.88e-02

Pos logit contributions
 excuse+0.054
arre+0.048
 fault+0.047
have+0.047
 consequences+0.046
Neg logit contributions
stack-0.037
 free-0.036
 valleys-0.035
tail-0.034
 low-0.034
Tokenae
Token ID3609
Feature activation+1.94e-02

Pos logit contributions
 excuse+0.037
 fault+0.031
burn+0.030
arre+0.030
resp+0.030
Neg logit contributions
 valleys-0.022
 free-0.019
tail-0.019
 Pat-0.018
 behold-0.018
Token with
Token ID351
Feature activation-2.95e-03

Pos logit contributions
 excuse+0.035
arre+0.029
resp+0.028
burn+0.027
 fault+0.027
Neg logit contributions
 valleys-0.028
 free-0.025
 delighted-0.025
tail-0.025
 rivers-0.024
Token lots
Token ID6041
Feature activation+2.29e-02

Pos logit contributions
 valleys+0.005
 rivers+0.004
 free+0.004
 Pat+0.004
 amaz+0.004
Neg logit contributions
 excuse-0.005
burn-0.004
 fault-0.004
resp-0.004
arre-0.004
Token of
Token ID286
Feature activation+4.76e-02

Pos logit contributions
 excuse+0.037
burn+0.034
arre+0.032
 fault+0.031
have+0.029
Neg logit contributions
 valleys-0.036
 free-0.035
 rivers-0.032
 delight-0.031
 delighted-0.031
Token topp
Token ID23126
Feature activation+1.21e-01

Pos logit contributions
burn+0.078
 excuse+0.078
arre+0.074
resp+0.073
 fault+0.071
Neg logit contributions
 valleys-0.068
 delight-0.067
 free-0.065
 -0.062
 amaz-0.061
Tokenings
Token ID654
Feature activation-2.32e-02

Pos logit contributions
 excuse+0.219
burn+0.176
 divorce+0.175
 fault+0.172
 distur+0.171
Neg logit contributions
 valleys-0.160
 Pat-0.142
 free-0.142
 itself-0.133
 amaz-0.133
Token.
Token ID13
Feature activation+8.48e-03

Pos logit contributions
 valleys+0.029
 free+0.025
 Pat+0.025
 rivers+0.024
tail+0.023
Neg logit contributions
 excuse-0.047
 fault-0.039
burn-0.037
arre-0.036
resp-0.036
Token Tim
Token ID5045
Feature activation+1.76e-02

Pos logit contributions
 excuse+0.016
burn+0.013
arre+0.013
 fault+0.013
 haircut+0.013
Neg logit contributions
 valleys-0.012
 rivers-0.010
 free-0.009
 streams-0.009
 Pat-0.009
Tokenmy
Token ID1820
Feature activation+2.72e-02

Pos logit contributions
 excuse+0.026
resp+0.023
burn+0.019
 fault+0.019
 haircut+0.019
Neg logit contributions
 valleys-0.029
 free-0.026
 -0.026
 Pat-0.026
 behold-0.026
Token was
Token ID373
Feature activation+1.98e-02

Pos logit contributions
 excuse+0.056
burn+0.051
 fault+0.049
resp+0.048
ruck+0.047
Neg logit contributions
 delighted-0.029
 valleys-0.028
 -0.027
 behold-0.026
 free-0.026
Token was
Token ID373
Feature activation+1.98e-02

Pos logit contributions
 valleys+0.000
 rivers+0.000
 Pat+0.000
 free+0.000
 behold+0.000
Neg logit contributions
 excuse-0.000
burn-0.000
 fault-0.000
resp-0.000
arre-0.000
Token proud
Token ID6613
Feature activation-4.99e-03

Pos logit contributions
 excuse+0.034
burn+0.031
 fault+0.029
 boo+0.028
have+0.028
Neg logit contributions
 valleys-0.033
 rivers-0.027
 streams-0.026
 forests-0.026
 Pat-0.026
Token of
Token ID286
Feature activation+4.75e-02

Pos logit contributions
 valleys+0.010
 Pat+0.009
 free+0.009
 rivers+0.009
 amaz+0.008
Neg logit contributions
 excuse-0.007
burn-0.005
 fault-0.005
resp-0.005
arre-0.005
Token her
Token ID607
Feature activation+1.25e-02

Pos logit contributions
 excuse+0.077
 fault+0.056
ins+0.055
 consequences+0.055
 dessert+0.055
Neg logit contributions
 valleys-0.092
 rivers-0.078
 dense-0.076
 streams-0.074
 majestic-0.072
Token big
Token ID1263
Feature activation+1.38e-02

Pos logit contributions
 excuse+0.022
burn+0.019
resp+0.019
arre+0.019
ins+0.018
Neg logit contributions
 valleys-0.017
 free-0.017
 amaz-0.015
 delighted-0.015
 content-0.015
Token rad
Token ID2511
Feature activation+1.20e-01

Pos logit contributions
 excuse+0.021
resp+0.019
burn+0.018
ins+0.018
have+0.017
Neg logit contributions
 valleys-0.023
 free-0.020
 amaz-0.020
 rivers-0.020
 delight-0.019
Tokenish
Token ID680
Feature activation+5.74e-02

Pos logit contributions
 excuse+0.152
burn+0.109
resp+0.106
 fault+0.106
 haircut+0.103
Neg logit contributions
 valleys-0.250
 rivers-0.219
 Pat-0.216
 free-0.216
 amaz-0.209
Token.
Token ID13
Feature activation+1.49e-02

Pos logit contributions
 excuse+0.108
burn+0.086
arre+0.085
have+0.084
 fault+0.084
Neg logit contributions
 valleys-0.078
 free-0.075
 Pat-0.072
 rivers-0.067
tail-0.067
Token She
Token ID1375
Feature activation-1.90e-03

Pos logit contributions
 excuse+0.026
 fault+0.022
resp+0.021
burn+0.021
arre+0.021
Neg logit contributions
 valleys-0.024
 rivers-0.020
 behold-0.020
 rocky-0.019
 streams-0.019
Token knew
Token ID2993
Feature activation+8.27e-03

Pos logit contributions
 valleys+0.003
 rivers+0.003
 Pat+0.003
 streams+0.002
 behold+0.002
Neg logit contributions
 excuse-0.003
burn-0.003
resp-0.003
 fault-0.003
 haircut-0.003
Token the
Token ID262
Feature activation+1.71e-02

Pos logit contributions
 excuse+0.017
burn+0.014
 fault+0.014
resp+0.014
arre+0.014
Neg logit contributions
 valleys-0.011
 rivers-0.008
 Pat-0.008
 free-0.008
 behold-0.008
Token in
Token ID287
Feature activation+2.20e-02

Pos logit contributions
 valleys+0.013
 rivers+0.011
 Pat+0.011
 free+0.011
 behold+0.011
Neg logit contributions
 excuse-0.012
burn-0.009
 fault-0.009
resp-0.009
arre-0.009
Token being
Token ID852
Feature activation-7.15e-03

Pos logit contributions
 excuse+0.041
burn+0.033
 fault+0.033
 haircut+0.032
resp+0.032
Neg logit contributions
 valleys-0.033
 rivers-0.026
 Pat-0.025
 behold-0.025
 free-0.025
Token a
Token ID257
Feature activation+2.05e-02

Pos logit contributions
 valleys+0.011
 rivers+0.009
 Pat+0.009
 behold+0.009
stack+0.009
Neg logit contributions
 excuse-0.012
burn-0.010
ins-0.010
arre-0.010
 fault-0.010
Token modest
Token ID12949
Feature activation+3.02e-03

Pos logit contributions
 excuse+0.035
burn+0.028
resp+0.028
 fault+0.027
arre+0.027
Neg logit contributions
 valleys-0.034
 rivers-0.028
 free-0.028
 Pat-0.028
 amaz-0.027
Token little
Token ID1310
Feature activation+2.47e-02

Pos logit contributions
 excuse+0.005
burn+0.005
resp+0.005
have+0.004
ins+0.004
Neg logit contributions
 valleys-0.005
 rivers-0.004
 amaz-0.004
 free-0.004
 Pat-0.004
Token rad
Token ID2511
Feature activation+1.17e-01

Pos logit contributions
 excuse+0.042
resp+0.037
burn+0.036
arre+0.034
ins+0.034
Neg logit contributions
 valleys-0.041
 rivers-0.035
 free-0.034
 streams-0.033
 amaz-0.033
Tokenish
Token ID680
Feature activation+5.59e-02

Pos logit contributions
 excuse+0.160
burn+0.118
resp+0.117
 fault+0.114
 haircut+0.113
Neg logit contributions
 valleys-0.234
 rivers-0.204
 Pat-0.198
 free-0.198
 streams-0.194
Token.
Token ID13
Feature activation+8.99e-03

Pos logit contributions
 excuse+0.100
burn+0.081
resp+0.078
 fault+0.077
arre+0.077
Neg logit contributions
 valleys-0.086
 free-0.074
 Pat-0.074
 rivers-0.073
tail-0.069
Token<|endoftext|>
Token ID50256
Feature activation+3.00e-02

Pos logit contributions
 excuse+0.016
 fault+0.014
arre+0.013
burn+0.013
ins+0.012
Neg logit contributions
 valleys-0.013
 rivers-0.011
 free-0.011
 behold-0.010
 rocky-0.010
TokenOne
Token ID3198
Feature activation+2.25e-02

Pos logit contributions
 excuse+0.040
burn+0.033
arre+0.033
ins+0.033
 fault+0.032
Neg logit contributions
 valleys-0.051
 amaz-0.047
 winding-0.045
 majestic-0.045
 behold-0.044
Token day
Token ID1110
Feature activation-2.67e-03

Pos logit contributions
 excuse+0.039
burn+0.031
 fault+0.031
resp+0.030
 haircut+0.030
Neg logit contributions
 valleys-0.037
 rivers-0.031
 Pat-0.030
 free-0.030
 behold-0.029
Token with
Token ID351
Feature activation-1.69e-02

Pos logit contributions
 excuse+0.044
 fault+0.037
burn+0.037
arre+0.036
resp+0.036
Neg logit contributions
 valleys-0.033
 rivers-0.028
 free-0.027
 Pat-0.026
 forests-0.025
Token his
Token ID465
Feature activation-1.01e-03

Pos logit contributions
 valleys+0.031
tail+0.026
stack+0.026
 rivers+0.025
 behold+0.025
Neg logit contributions
 excuse-0.023
 consequences-0.019
burn-0.018
 dessert-0.018
 fault-0.017
Token family
Token ID1641
Feature activation-2.96e-03

Pos logit contributions
 valleys+0.002
 rivers+0.002
 behold+0.002
 Pat+0.002
 amaz+0.002
Neg logit contributions
 excuse-0.001
burn-0.001
 fault-0.001
 haircut-0.001
resp-0.001
Token of
Token ID286
Feature activation+4.51e-02

Pos logit contributions
 valleys+0.003
 rivers+0.003
 Pat+0.003
 free+0.003
 +0.002
Neg logit contributions
 excuse-0.007
resp-0.006
arre-0.006
 fault-0.005
burn-0.005
Token other
Token ID584
Feature activation+5.36e-02

Pos logit contributions
 excuse+0.076
burn+0.062
 fault+0.060
resp+0.059
arre+0.058
Neg logit contributions
 valleys-0.077
 rivers-0.066
 free-0.064
 Pat-0.064
 streams-0.062
Token rad
Token ID2511
Feature activation+1.16e-01

Pos logit contributions
 excuse+0.091
burn+0.082
 haircut+0.079
 fault+0.078
resp+0.076
Neg logit contributions
 valleys-0.089
 rivers-0.077
 free-0.075
 streams-0.072
 forests-0.072
Tokenishes
Token ID5614
Feature activation+3.15e-02

Pos logit contributions
 excuse+0.169
resp+0.127
burn+0.127
 haircut+0.126
 fault+0.124
Neg logit contributions
 valleys-0.222
 rivers-0.193
 free-0.188
 Pat-0.187
 streams-0.182
Token.
Token ID13
Feature activation-2.57e-05

Pos logit contributions
 excuse+0.059
burn+0.049
resp+0.047
 haircut+0.047
 fault+0.047
Neg logit contributions
 valleys-0.046
 free-0.039
 rivers-0.038
 Pat-0.037
 streams-0.035
Token One
Token ID1881
Feature activation+1.18e-02

Pos logit contributions
 valleys+0.000
 rivers+0.000
 Pat+0.000
 free+0.000
 behold+0.000
Neg logit contributions
 excuse-0.000
burn-0.000
 fault-0.000
resp-0.000
 haircut-0.000
Token day
Token ID1110
Feature activation-1.14e-02

Pos logit contributions
 excuse+0.021
burn+0.017
 fault+0.016
resp+0.016
 haircut+0.016
Neg logit contributions
 valleys-0.019
 rivers-0.016
 Pat-0.015
 free-0.015
 behold-0.015
Token,
Token ID11
Feature activation-1.64e-02

Pos logit contributions
 valleys+0.012
 Pat+0.010
 free+0.009
 rivers+0.009
 low+0.009
Neg logit contributions
 excuse-0.026
 fault-0.023
burn-0.023
arre-0.022
ins-0.022
Tokenawn
Token ID3832
Feature activation+1.27e-02

Pos logit contributions
 valleys+0.004
 Pat+0.004
 free+0.004
 rivers+0.004
 behold+0.004
Neg logit contributions
 excuse-0.005
burn-0.004
 haircut-0.004
resp-0.004
arre-0.004
Tokenâ
Token ID22940
Feature activation+3.53e-02

Pos logit contributions
 excuse+0.022
burn+0.018
 fault+0.018
resp+0.017
 haircut+0.017
Neg logit contributions
 valleys-0.021
 free-0.018
 Pat-0.017
 rivers-0.017
 delighted-0.016
Token
Token ID26391
Feature activation-2.02e-02

Pos logit contributions
 excuse+0.064
burn+0.052
 fault+0.050
resp+0.050
arre+0.049
Neg logit contributions
 valleys-0.055
 rivers-0.046
 Pat-0.045
 free-0.044
 behold-0.043
Token
Token ID8151
Feature activation-2.76e-03

Pos logit contributions
 valleys+0.033
 rivers+0.029
 behold+0.029
 amaz+0.028
 rocky+0.028
Neg logit contributions
 excuse-0.029
burn-0.026
EEP-0.025
 fault-0.025
resp-0.025
Tokens
Token ID82
Feature activation+2.67e-02

Pos logit contributions
 valleys+0.006
 rivers+0.005
 Pat+0.005
 free+0.005
 behold+0.005
Neg logit contributions
 excuse-0.003
burn-0.002
 fault-0.002
resp-0.002
 haircut-0.002
Token fault
Token ID8046
Feature activation+1.15e-01

Pos logit contributions
 excuse+0.045
burn+0.036
resp+0.035
 fault+0.034
arre+0.034
Neg logit contributions
 valleys-0.044
 rivers-0.038
 Pat-0.037
 free-0.037
 behold-0.036
Token.
Token ID13
Feature activation+1.09e-02

Pos logit contributions
 excuse+0.223
burn+0.183
resp+0.182
arre+0.181
have+0.177
Neg logit contributions
 valleys-0.153
 free-0.136
 Pat-0.134
 rivers-0.126
 itself-0.123
Token
Token ID220
Feature activation-1.20e-02

Pos logit contributions
 excuse+0.021
burn+0.017
 fault+0.017
resp+0.017
arre+0.017
Neg logit contributions
 valleys-0.015
 rivers-0.012
 free-0.012
 behold-0.012
 amaz-0.012
Token
Token ID198
Feature activation+4.76e-05

Pos logit contributions
 valleys+0.016
 rivers+0.013
 amaz+0.013
 behold+0.013
 free+0.013
Neg logit contributions
 excuse-0.023
burn-0.019
 fault-0.019
resp-0.019
arre-0.019
Token
Token ID198
Feature activation-1.92e-03

Pos logit contributions
 excuse+0.000
burn+0.000
have+0.000
resp+0.000
arre+0.000
Neg logit contributions
 valleys-0.000
 rivers-0.000
 amaz-0.000
 free-0.000
 streams-0.000
TokenThe
Token ID464
Feature activation+1.37e-02

Pos logit contributions
 valleys+0.003
 rivers+0.002
 Pat+0.002
 free+0.002
 behold+0.002
Neg logit contributions
 excuse-0.004
burn-0.003
 fault-0.003
resp-0.003
 haircut-0.003
Token sund
Token ID37437
Feature activation+1.84e-02

Pos logit contributions
 low+0.002
 free+0.002
 delighted+0.002
 amaz+0.002
 majestic+0.002
Neg logit contributions
 excuse-0.004
resp-0.003
ins-0.003
 consequences-0.003
arre-0.003
Tokenae
Token ID3609
Feature activation+9.83e-03

Pos logit contributions
 excuse+0.036
 fault+0.031
arre+0.030
 consequences+0.030
resp+0.030
Neg logit contributions
 valleys-0.020
tail-0.019
 free-0.018
 behold-0.018
 Pat-0.017
Token with
Token ID351
Feature activation-2.96e-03

Pos logit contributions
 excuse+0.018
resp+0.015
arre+0.015
 fault+0.014
burn+0.014
Neg logit contributions
 valleys-0.015
 free-0.013
 rivers-0.013
 amaz-0.012
 delighted-0.012
Token lots
Token ID6041
Feature activation+1.78e-02

Pos logit contributions
 valleys+0.005
 rivers+0.004
 Pat+0.004
 behold+0.004
stack+0.004
Neg logit contributions
 excuse-0.005
burn-0.004
 haircut-0.004
 fault-0.004
 consequences-0.003
Token of
Token ID286
Feature activation+4.58e-02

Pos logit contributions
 excuse+0.025
burn+0.022
 fault+0.020
arre+0.020
resp+0.019
Neg logit contributions
 valleys-0.033
 free-0.031
 rivers-0.030
 delighted-0.029
 amaz-0.029
Token topp
Token ID23126
Feature activation+1.14e-01

Pos logit contributions
 excuse+0.074
burn+0.070
arre+0.068
resp+0.065
 fault+0.064
Neg logit contributions
 valleys-0.074
 free-0.067
 rivers-0.065
 delight-0.065
 amaz-0.064
Tokenings
Token ID654
Feature activation-2.48e-02

Pos logit contributions
 excuse+0.209
burn+0.171
 divorce+0.166
 distur+0.165
have+0.164
Neg logit contributions
 valleys-0.153
 Pat-0.138
 free-0.131
 itself-0.129
 amaz-0.126
Token.
Token ID13
Feature activation+3.74e-03

Pos logit contributions
 valleys+0.041
 rivers+0.035
 Pat+0.034
 free+0.034
 behold+0.033
Neg logit contributions
 excuse-0.042
burn-0.033
 fault-0.033
resp-0.032
arre-0.032
Token
Token ID198
Feature activation+1.28e-03

Pos logit contributions
 excuse+0.006
arre+0.006
burn+0.006
 fault+0.006
 haircut+0.005
Neg logit contributions
 valleys-0.006
 rivers-0.005
 rocky-0.005
 majestic-0.004
 behold-0.004
Token
Token ID198
Feature activation-9.64e-04

Pos logit contributions
 excuse+0.002
burn+0.002
arre+0.002
resp+0.002
ins+0.002
Neg logit contributions
 valleys-0.002
 rivers-0.002
 amaz-0.002
 streams-0.002
 rocky-0.001
TokenThey
Token ID2990
Feature activation-2.70e-03

Pos logit contributions
 valleys+0.002
 free+0.001
 rivers+0.001
 Pat+0.001
 behold+0.001
Neg logit contributions
 excuse-0.002
 fault-0.001
burn-0.001
resp-0.001
 haircut-0.001
Token it
Token ID340
Feature activation+1.49e-02

Pos logit contributions
 valleys+0.011
 free+0.009
 Pat+0.009
 delighted+0.009
 behold+0.008
Neg logit contributions
 excuse-0.021
burn-0.018
 fault-0.018
arre-0.017
ins-0.017
Token was
Token ID373
Feature activation-1.16e-04

Pos logit contributions
 excuse+0.031
ins+0.027
 haircut+0.026
burn+0.026
resp+0.026
Neg logit contributions
 valleys-0.015
 free-0.015
 Pat-0.014
 -0.014
 Bella-0.013
Token Tim
Token ID5045
Feature activation+1.44e-02

Pos logit contributions
 valleys+0.000
 free+0.000
 Pat+0.000
 rivers+0.000
 low+0.000
Neg logit contributions
 excuse-0.000
burn-0.000
resp-0.000
ins-0.000
 haircut-0.000
Tokenmy
Token ID1820
Feature activation+1.92e-02

Pos logit contributions
 excuse+0.022
resp+0.021
 haircut+0.018
burn+0.017
arre+0.017
Neg logit contributions
 free-0.020
 Pat-0.019
 -0.019
 valleys-0.019
tail-0.018
Token's
Token ID338
Feature activation+3.10e-02

Pos logit contributions
 excuse+0.034
resp+0.029
burn+0.027
ins+0.027
 haircut+0.027
Neg logit contributions
 free-0.027
 valleys-0.025
 delighted-0.024
tail-0.023
 -0.023
Token fault
Token ID8046
Feature activation+1.12e-01

Pos logit contributions
 excuse+0.065
resp+0.062
ins+0.059
arre+0.057
have+0.057
Neg logit contributions
 free-0.027
 valleys-0.026
 Pat-0.026
 amaz-0.025
 delight-0.024
Token but
Token ID475
Feature activation+7.69e-03

Pos logit contributions
 excuse+0.210
have+0.183
resp+0.175
 haircut+0.175
 topp+0.169
Neg logit contributions
 valleys-0.134
 free-0.131
 Pat-0.130
 itself-0.122
 Bella-0.119
Token Tim
Token ID5045
Feature activation+1.01e-02

Pos logit contributions
 excuse+0.015
burn+0.012
 fault+0.012
resp+0.011
 haircut+0.011
Neg logit contributions
 valleys-0.012
 rivers-0.009
 Pat-0.009
 free-0.009
 streams-0.009
Tokenmy
Token ID1820
Feature activation+1.92e-02

Pos logit contributions
 excuse+0.014
resp+0.013
 haircut+0.011
burn+0.011
ruck+0.010
Neg logit contributions
 valleys-0.016
 free-0.016
 Pat-0.016
 -0.016
tail-0.015
Token felt
Token ID2936
Feature activation+1.85e-02

Pos logit contributions
 excuse+0.037
resp+0.033
burn+0.033
 haircut+0.032
ruck+0.031
Neg logit contributions
 free-0.020
 valleys-0.019
 delighted-0.019
 -0.019
 Pat-0.018
Token guilty
Token ID6717
Feature activation+3.83e-02

Pos logit contributions
 excuse+0.029
burn+0.023
 fault+0.022
resp+0.022
arre+0.022
Neg logit contributions
 valleys-0.033
 rivers-0.028
 Pat-0.028
 behold-0.027
 amaz-0.027
Token of
Token ID286
Feature activation+4.73e-02

Pos logit contributions
 valleys+0.015
 free+0.013
 rivers+0.013
 Pat+0.013
 rocky+0.013
Neg logit contributions
 excuse-0.010
burn-0.008
arre-0.007
 fault-0.007
resp-0.007
Token cheese
Token ID9891
Feature activation+2.57e-02

Pos logit contributions
 excuse+0.071
burn+0.069
resp+0.062
arre+0.061
 fault+0.057
Neg logit contributions
 valleys-0.082
 free-0.072
 rivers-0.072
 behold-0.068
 amaz-0.067
Token and
Token ID290
Feature activation-6.34e-03

Pos logit contributions
 excuse+0.041
burn+0.033
arre+0.032
 fault+0.031
 haircut+0.031
Neg logit contributions
 valleys-0.043
 free-0.038
 rivers-0.037
 Pat-0.037
 streams-0.035
Token sauce
Token ID10746
Feature activation+3.78e-02

Pos logit contributions
 valleys+0.011
 rivers+0.010
 free+0.009
 Pat+0.009
 behold+0.009
Neg logit contributions
 excuse-0.010
burn-0.008
arre-0.008
 fault-0.008
resp-0.008
Token and
Token ID290
Feature activation-1.31e-02

Pos logit contributions
 excuse+0.069
burn+0.059
arre+0.056
 fault+0.055
 haircut+0.055
Neg logit contributions
 valleys-0.054
 free-0.047
 rivers-0.045
 Pat-0.043
 low-0.042
Token topp
Token ID23126
Feature activation+1.11e-01

Pos logit contributions
 valleys+0.023
 rivers+0.020
 free+0.019
 Pat+0.019
 streams+0.019
Neg logit contributions
 excuse-0.021
burn-0.017
arre-0.017
resp-0.016
 fault-0.016
Tokenings
Token ID654
Feature activation-3.16e-02

Pos logit contributions
 excuse+0.187
burn+0.156
 haircut+0.148
resp+0.145
arre+0.144
Neg logit contributions
 valleys-0.168
 free-0.146
 Pat-0.145
 itself-0.141
 rivers-0.140
Token.
Token ID13
Feature activation+8.23e-04

Pos logit contributions
 valleys+0.051
 rivers+0.044
 free+0.043
 Pat+0.043
 behold+0.041
Neg logit contributions
 excuse-0.054
burn-0.044
 fault-0.043
arre-0.041
 haircut-0.041
Token The
Token ID383
Feature activation-3.09e-03

Pos logit contributions
 excuse+0.001
burn+0.001
 fault+0.001
resp+0.001
arre+0.001
Neg logit contributions
 valleys-0.001
 rivers-0.001
 free-0.001
 Pat-0.001
 behold-0.001
Token people
Token ID661
Feature activation-1.33e-02

Pos logit contributions
 valleys+0.005
 rivers+0.005
 free+0.005
 amaz+0.004
 streams+0.004
Neg logit contributions
 excuse-0.005
burn-0.004
arre-0.004
resp-0.004
 fault-0.004
Token who
Token ID508
Feature activation-7.67e-03

Pos logit contributions
 valleys+0.020
 free+0.020
 below+0.017
 behold+0.017
 low+0.017
Neg logit contributions
 excuse-0.023
burn-0.021
resp-0.020
 fault-0.020
arre-0.018
Tokenier
Token ID959
Feature activation+1.16e-02

Pos logit contributions
 excuse+0.128
 boo+0.115
resp+0.112
arre+0.112
 divorce+0.110
Neg logit contributions
 valleys-0.145
tail-0.128
 rivers-0.123
 free-0.123
 Pat-0.122
Token?
Token ID30
Feature activation+2.16e-03

Pos logit contributions
 excuse+0.020
burn+0.017
arre+0.016
 haircut+0.016
resp+0.016
Neg logit contributions
 valleys-0.018
 rivers-0.015
 free-0.015
 Pat-0.015
 itself-0.014
Token You
Token ID921
Feature activation+8.39e-03

Pos logit contributions
 excuse+0.003
burn+0.003
 fault+0.003
resp+0.003
arre+0.003
Neg logit contributions
 valleys-0.004
 rivers-0.003
 free-0.003
 Pat-0.003
 behold-0.003
Token are
Token ID389
Feature activation+2.34e-02

Pos logit contributions
 excuse+0.013
burn+0.010
 fault+0.010
resp+0.010
 haircut+0.009
Neg logit contributions
 valleys-0.015
 rivers-0.013
 Pat-0.013
 free-0.013
 behold-0.012
Token y
Token ID331
Feature activation+3.55e-02

Pos logit contributions
 excuse+0.040
ins+0.030
 fault+0.030
resp+0.029
 haircut+0.029
Neg logit contributions
 valleys-0.041
 rivers-0.033
 Pat-0.032
 behold-0.032
 amaz-0.032
Tokenumm
Token ID13929
Feature activation+1.10e-01

Pos logit contributions
 excuse+0.050
burn+0.038
 fault+0.038
resp+0.037
 haircut+0.036
Neg logit contributions
 valleys-0.067
 rivers-0.058
 Pat-0.058
 free-0.057
 streams-0.056
Tokenier
Token ID959
Feature activation+1.90e-02

Pos logit contributions
 excuse+0.143
resp+0.131
 divorce+0.129
 boo+0.124
arre+0.122
Neg logit contributions
 valleys-0.163
tail-0.152
years-0.144
 rivers-0.141
 free-0.139
Token.
Token ID13
Feature activation+1.52e-02

Pos logit contributions
 excuse+0.033
burn+0.028
arre+0.027
resp+0.027
 fault+0.027
Neg logit contributions
 valleys-0.029
 rivers-0.024
 free-0.024
 Pat-0.024
 amaz-0.023
Token You
Token ID921
Feature activation+9.72e-03

Pos logit contributions
 excuse+0.023
burn+0.018
resp+0.017
 fault+0.017
arre+0.017
Neg logit contributions
 valleys-0.028
 rivers-0.024
 free-0.023
 Pat-0.023
 behold-0.023
Token are
Token ID389
Feature activation+2.21e-02

Pos logit contributions
 excuse+0.015
burn+0.012
 fault+0.012
resp+0.012
 haircut+0.011
Neg logit contributions
 valleys-0.017
 free-0.015
 rivers-0.014
 Pat-0.014
 behold-0.014
Token my
Token ID616
Feature activation+4.42e-02

Pos logit contributions
 excuse+0.037
ins+0.027
 haircut+0.027
 fault+0.027
have+0.027
Neg logit contributions
 valleys-0.039
 rivers-0.032
 Pat-0.031
 streams-0.031
 amaz-0.031
Token there
Token ID612
Feature activation-2.62e-03

Pos logit contributions
 excuse+0.003
burn+0.002
 fault+0.002
arre+0.002
ins+0.002
Neg logit contributions
 valleys-0.002
 rivers-0.002
 amaz-0.002
 Pat-0.002
 behold-0.002
Token was
Token ID373
Feature activation-2.17e-02

Pos logit contributions
 valleys+0.006
 rivers+0.005
 free+0.005
tail+0.005
 streams+0.005
Neg logit contributions
 excuse-0.003
 fault-0.002
burn-0.002
arre-0.002
 haircut-0.002
Token a
Token ID257
Feature activation-4.54e-03

Pos logit contributions
 valleys+0.030
 rivers+0.024
 free+0.024
 Pat+0.024
 behold+0.023
Neg logit contributions
 excuse-0.044
burn-0.036
 fault-0.035
resp-0.035
 haircut-0.034
Token modest
Token ID12949
Feature activation-7.95e-03

Pos logit contributions
 valleys+0.009
 rivers+0.008
 Pat+0.008
 free+0.008
 behold+0.008
Neg logit contributions
 excuse-0.006
burn-0.004
 fault-0.004
resp-0.004
arre-0.004
Token little
Token ID1310
Feature activation+1.70e-02

Pos logit contributions
 valleys+0.013
 Pat+0.011
 rivers+0.011
 clearing+0.011
 free+0.011
Neg logit contributions
 excuse-0.013
 fault-0.010
arre-0.010
burn-0.010
ins-0.010
Token rad
Token ID2511
Feature activation+1.09e-01

Pos logit contributions
 excuse+0.023
burn+0.017
 fault+0.017
resp+0.017
arre+0.016
Neg logit contributions
 valleys-0.034
 rivers-0.029
 Pat-0.029
 free-0.029
 behold-0.028
Tokenish
Token ID680
Feature activation+4.60e-02

Pos logit contributions
 excuse+0.133
burn+0.095
 fault+0.092
resp+0.092
 haircut+0.088
Neg logit contributions
 valleys-0.231
 rivers-0.202
 Pat-0.200
 free-0.198
 behold-0.193
Token.
Token ID13
Feature activation-2.76e-03

Pos logit contributions
 excuse+0.097
 fault+0.085
arre+0.084
burn+0.079
resp+0.076
Neg logit contributions
 valleys-0.052
 Pat-0.046
 free-0.045
tail-0.044
stack-0.044
Token He
Token ID679
Feature activation-1.68e-03

Pos logit contributions
 valleys+0.004
 rivers+0.003
 free+0.003
 rocky+0.003
 streams+0.003
Neg logit contributions
 excuse-0.005
resp-0.004
reat-0.004
 haircut-0.004
burn-0.004
Token lived
Token ID5615
Feature activation-4.79e-02

Pos logit contributions
 valleys+0.003
 rivers+0.002
 Pat+0.002
 free+0.002
 behold+0.002
Neg logit contributions
 excuse-0.003
burn-0.003
 fault-0.003
resp-0.002
arre-0.002
Token in
Token ID287
Feature activation+2.43e-02

Pos logit contributions
 valleys+0.092
 rivers+0.079
 Pat+0.078
 behold+0.077
 free+0.077
Neg logit contributions
 excuse-0.069
burn-0.052
 fault-0.050
resp-0.049
arre-0.049
Token,
Token ID11
Feature activation-6.43e-03

Pos logit contributions
 valleys+0.025
 rivers+0.021
 free+0.020
 behold+0.020
tail+0.020
Neg logit contributions
 excuse-0.026
 fault-0.020
burn-0.020
resp-0.020
 haircut-0.019
Token there
Token ID612
Feature activation-3.31e-03

Pos logit contributions
 valleys+0.009
 rivers+0.008
 Pat+0.008
 free+0.007
 amaz+0.007
Neg logit contributions
 excuse-0.012
burn-0.010
 fault-0.009
resp-0.009
arre-0.009
Token was
Token ID373
Feature activation-2.23e-02

Pos logit contributions
 valleys+0.007
 rivers+0.006
tail+0.006
 free+0.006
 streams+0.006
Neg logit contributions
 excuse-0.004
 fault-0.003
burn-0.003
 haircut-0.002
arre-0.002
Token a
Token ID257
Feature activation-3.58e-03

Pos logit contributions
 valleys+0.030
 rivers+0.024
 free+0.024
 Pat+0.024
 behold+0.023
Neg logit contributions
 excuse-0.046
burn-0.038
resp-0.037
 fault-0.037
ins-0.036
Token kind
Token ID1611
Feature activation-2.93e-02

Pos logit contributions
 valleys+0.007
 rivers+0.006
 Pat+0.006
 free+0.006
 behold+0.006
Neg logit contributions
 excuse-0.005
burn-0.004
 fault-0.003
resp-0.003
 haircut-0.003
Token rad
Token ID2511
Feature activation+1.08e-01

Pos logit contributions
 valleys+0.052
 Pat+0.045
 rivers+0.045
 free+0.044
 behold+0.043
Neg logit contributions
 excuse-0.045
burn-0.035
 fault-0.034
arre-0.033
resp-0.033
Tokenish
Token ID680
Feature activation+4.31e-02

Pos logit contributions
 excuse+0.136
burn+0.099
 fault+0.096
resp+0.095
arre+0.092
Neg logit contributions
 valleys-0.225
 rivers-0.197
 Pat-0.195
 free-0.193
 streams-0.188
Token.
Token ID13
Feature activation-6.59e-03

Pos logit contributions
 excuse+0.097
 fault+0.087
arre+0.084
burn+0.081
have+0.078
Neg logit contributions
 valleys-0.043
 Pat-0.037
 free-0.036
stack-0.036
tail-0.035
Token He
Token ID679
Feature activation+1.88e-03

Pos logit contributions
 valleys+0.009
 rivers+0.007
 free+0.007
 rocky+0.007
 streams+0.007
Neg logit contributions
 excuse-0.012
 haircut-0.010
resp-0.009
reat-0.009
 price-0.009
Token liked
Token ID8288
Feature activation-3.92e-03

Pos logit contributions
 excuse+0.004
burn+0.003
 fault+0.003
arre+0.003
resp+0.003
Neg logit contributions
 valleys-0.003
 free-0.002
 rivers-0.002
 Pat-0.002
 behold-0.002
Token to
Token ID284
Feature activation-3.29e-03

Pos logit contributions
 valleys+0.007
 rivers+0.006
 free+0.006
 Pat+0.006
 forests+0.006
Neg logit contributions
 excuse-0.006
burn-0.005
arre-0.005
 fault-0.005
resp-0.005
Token,
Token ID11
Feature activation+5.15e-03

Pos logit contributions
 excuse+0.048
burn+0.038
arre+0.036
 fault+0.036
have+0.036
Neg logit contributions
 valleys-0.048
 Pat-0.043
 rivers-0.042
 free-0.042
 Lou-0.040
Token sauce
Token ID10746
Feature activation+3.03e-02

Pos logit contributions
 excuse+0.009
burn+0.009
 fault+0.008
arre+0.008
resp+0.008
Neg logit contributions
 valleys-0.007
 -0.006
 free-0.006
 Pat-0.006
 rivers-0.006
Token,
Token ID11
Feature activation+6.42e-04

Pos logit contributions
 excuse+0.057
burn+0.048
have+0.046
 fault+0.045
arre+0.045
Neg logit contributions
 valleys-0.041
 free-0.036
 Pat-0.036
 rivers-0.035
 itself-0.034
Token cheese
Token ID9891
Feature activation+1.33e-02

Pos logit contributions
 excuse+0.001
burn+0.001
 fault+0.001
arre+0.001
EEP+0.001
Neg logit contributions
 valleys-0.001
 rivers-0.001
 free-0.001
 -0.001
 streams-0.001
Token and
Token ID290
Feature activation-2.51e-02

Pos logit contributions
 excuse+0.025
burn+0.021
 fault+0.020
arre+0.020
have+0.020
Neg logit contributions
 valleys-0.019
 Pat-0.016
 free-0.016
 rivers-0.016
 Lou-0.015
Token topp
Token ID23126
Feature activation+1.07e-01

Pos logit contributions
 valleys+0.038
 rivers+0.034
 free+0.033
 amaz+0.031
 streams+0.031
Neg logit contributions
 excuse-0.044
burn-0.038
 fault-0.037
resp-0.036
arre-0.036
Tokenings
Token ID654
Feature activation-3.45e-02

Pos logit contributions
 excuse+0.177
burn+0.145
 haircut+0.138
 fault+0.136
resp+0.135
Neg logit contributions
 valleys-0.169
 Pat-0.148
 free-0.146
 rivers-0.142
 itself-0.141
Token.
Token ID13
Feature activation-1.94e-03

Pos logit contributions
 valleys+0.056
 rivers+0.047
 Pat+0.047
 free+0.046
 behold+0.044
Neg logit contributions
 excuse-0.060
burn-0.048
 fault-0.047
resp-0.046
 haircut-0.045
Token She
Token ID1375
Feature activation-1.96e-02

Pos logit contributions
 valleys+0.003
 rivers+0.002
 free+0.002
 Pat+0.002
 behold+0.002
Neg logit contributions
 excuse-0.004
burn-0.003
 fault-0.003
resp-0.003
arre-0.003
Token showed
Token ID3751
Feature activation+8.10e-03

Pos logit contributions
 valleys+0.023
 delighted+0.022
 behold+0.022
 Pat+0.021
 free+0.021
Neg logit contributions
 excuse-0.038
burn-0.034
 fault-0.034
 consequences-0.032
resp-0.031
Token them
Token ID606
Feature activation-2.17e-02

Pos logit contributions
 excuse+0.014
burn+0.011
 fault+0.011
resp+0.010
 haircut+0.010
Neg logit contributions
 valleys-0.014
 rivers-0.011
 free-0.011
 Pat-0.011
 behold-0.011
Token time
Token ID640
Feature activation-1.29e-02

Pos logit contributions
 valleys+0.021
 low+0.019
 free+0.019
 behold+0.019
 Pat+0.019
Neg logit contributions
 excuse-0.018
burn-0.015
resp-0.015
ins-0.015
 fault-0.014
Token there
Token ID612
Feature activation+2.07e-04

Pos logit contributions
 valleys+0.022
 rivers+0.019
tail+0.017
 free+0.017
 streams+0.017
Neg logit contributions
 excuse-0.022
 fault-0.017
resp-0.016
burn-0.016
 haircut-0.016
Token was
Token ID373
Feature activation-2.07e-02

Pos logit contributions
 excuse+0.000
 fault+0.000
burn+0.000
que+0.000
 haircut+0.000
Neg logit contributions
 valleys-0.000
 rivers-0.000
tail-0.000
 streams-0.000
 free-0.000
Token a
Token ID257
Feature activation-5.94e-03

Pos logit contributions
 valleys+0.030
 rivers+0.024
 Pat+0.024
 free+0.024
 behold+0.023
Neg logit contributions
 excuse-0.040
burn-0.033
 fault-0.032
resp-0.032
arre-0.031
Token generous
Token ID14431
Feature activation-2.74e-03

Pos logit contributions
 valleys+0.013
 rivers+0.011
 Pat+0.011
 free+0.011
 behold+0.011
Neg logit contributions
 excuse-0.007
burn-0.005
 haircut-0.005
resp-0.005
 fault-0.005
Token rad
Token ID2511
Feature activation+1.07e-01

Pos logit contributions
 valleys+0.005
 Pat+0.004
 rivers+0.004
 free+0.004
 behold+0.004
Neg logit contributions
 excuse-0.004
burn-0.003
 fault-0.003
arre-0.003
resp-0.003
Tokenish
Token ID680
Feature activation+4.30e-02

Pos logit contributions
 excuse+0.128
burn+0.091
 fault+0.087
resp+0.085
 haircut+0.083
Neg logit contributions
 valleys-0.230
 rivers-0.202
 Pat-0.200
 free-0.198
 behold-0.195
Token farmer
Token ID18739
Feature activation+2.85e-02

Pos logit contributions
 excuse+0.095
 fault+0.083
arre+0.081
burn+0.079
have+0.075
Neg logit contributions
 valleys-0.045
 Pat-0.040
 free-0.038
stack-0.038
 rivers-0.038
Token.
Token ID13
Feature activation-4.43e-03

Pos logit contributions
 excuse+0.049
burn+0.038
 fault+0.038
resp+0.037
arre+0.037
Neg logit contributions
 valleys-0.047
 rivers-0.040
 Pat-0.040
 free-0.039
 behold-0.038
Token He
Token ID679
Feature activation-1.10e-03

Pos logit contributions
 valleys+0.007
 free+0.006
 amaz+0.006
 behold+0.006
 rivers+0.006
Neg logit contributions
 excuse-0.007
 haircut-0.005
ruck-0.005
 price-0.005
reat-0.005
Token had
Token ID550
Feature activation-8.86e-03

Pos logit contributions
 valleys+0.001
 free+0.001
 Pat+0.001
 rivers+0.001
 behold+0.001
Neg logit contributions
 excuse-0.002
 fault-0.002
burn-0.002
arre-0.002
 haircut-0.002
Token,
Token ID11
Feature activation+4.84e-02

Pos logit contributions
 excuse+0.101
burn+0.081
 fault+0.079
resp+0.079
 haircut+0.078
Neg logit contributions
 valleys-0.084
 rivers-0.069
 Pat-0.067
 free-0.066
 behold-0.064
Token give
Token ID1577
Feature activation+2.29e-03

Pos logit contributions
 excuse+0.089
burn+0.081
arre+0.077
ins+0.077
 fault+0.076
Neg logit contributions
 valleys-0.062
 Pat-0.061
 free-0.059
 behold-0.056
 amaz-0.055
Token me
Token ID502
Feature activation-1.62e-02

Pos logit contributions
 excuse+0.004
burn+0.003
have+0.003
ins+0.003
 haircut+0.003
Neg logit contributions
 valleys-0.004
 amaz-0.003
 rivers-0.003
 winding-0.003
 delighted-0.003
Token back
Token ID736
Feature activation+7.58e-04

Pos logit contributions
 valleys+0.021
 free+0.018
 Pat+0.018
 rivers+0.018
 behold+0.017
Neg logit contributions
 excuse-0.032
burn-0.027
 fault-0.026
arre-0.026
resp-0.026
Token my
Token ID616
Feature activation+1.24e-02

Pos logit contributions
 excuse+0.001
burn+0.001
 fault+0.001
resp+0.001
arre+0.001
Neg logit contributions
 valleys-0.001
 rivers-0.001
 Pat-0.001
 free-0.001
 behold-0.001
Token rad
Token ID2511
Feature activation+1.06e-01

Pos logit contributions
 excuse+0.020
burn+0.017
arre+0.017
resp+0.017
have+0.016
Neg logit contributions
 free-0.019
 valleys-0.019
 amaz-0.017
 itself-0.017
 majestic-0.017
Tokenish
Token ID680
Feature activation+4.66e-02

Pos logit contributions
 excuse+0.144
resp+0.105
burn+0.104
 haircut+0.104
arre+0.101
Neg logit contributions
 valleys-0.208
 free-0.182
 rivers-0.181
 Pat-0.181
 amaz-0.173
Token!"
Token ID2474
Feature activation+4.39e-03

Pos logit contributions
 excuse+0.085
burn+0.073
arre+0.071
 fault+0.069
have+0.069
Neg logit contributions
 valleys-0.061
 free-0.061
 Pat-0.056
 itself-0.053
tail-0.053
Token Ben
Token ID3932
Feature activation+2.51e-02

Pos logit contributions
 excuse+0.008
burn+0.007
 fault+0.007
resp+0.007
 haircut+0.006
Neg logit contributions
 valleys-0.006
 rivers-0.005
 free-0.005
 Pat-0.005
 amaz-0.005
Token shouted
Token ID17293
Feature activation+2.28e-02

Pos logit contributions
 excuse+0.050
burn+0.041
resp+0.040
 fault+0.040
 haircut+0.040
Neg logit contributions
 valleys-0.034
 rivers-0.027
 Pat-0.026
 free-0.026
 amaz-0.026
Token.
Token ID13
Feature activation+8.51e-03

Pos logit contributions
 excuse+0.039
burn+0.032
 fault+0.031
resp+0.030
arre+0.030
Neg logit contributions
 valleys-0.037
 rivers-0.031
 Pat-0.031
 free-0.031
 behold-0.030
Token it
Token ID340
Feature activation+1.30e-03

Pos logit contributions
 valleys+0.019
 amaz+0.016
tail+0.016
stack+0.016
elope+0.016
Neg logit contributions
 excuse-0.023
ruck-0.019
 fault-0.019
 consequences-0.019
reat-0.019
Token away
Token ID1497
Feature activation+3.52e-03

Pos logit contributions
 excuse+0.002
burn+0.002
 fault+0.002
resp+0.002
 haircut+0.002
Neg logit contributions
 valleys-0.002
 rivers-0.002
 Pat-0.002
 behold-0.002
 amaz-0.002
Token in
Token ID287
Feature activation+2.59e-02

Pos logit contributions
 excuse+0.005
 fault+0.004
burn+0.003
resp+0.003
ruck+0.003
Neg logit contributions
 valleys-0.007
 behold-0.006
 Pat-0.006
tail-0.006
 amaz-0.006
Token a
Token ID257
Feature activation+9.91e-03

Pos logit contributions
 excuse+0.048
 fault+0.041
ruck+0.038
burn+0.038
 consequences+0.038
Neg logit contributions
 valleys-0.035
tail-0.031
 Pat-0.029
 behold-0.029
 Alicia-0.029
Token wheel
Token ID7825
Feature activation+2.87e-02

Pos logit contributions
 excuse+0.017
 haircut+0.013
burn+0.013
 fault+0.013
arre+0.013
Neg logit contributions
 valleys-0.015
 Pat-0.014
 behold-0.013
 Lou-0.013
 rivers-0.013
Tokenbar
Token ID5657
Feature activation+1.06e-01

Pos logit contributions
 excuse+0.050
burn+0.040
resp+0.039
 fault+0.039
arre+0.038
Neg logit contributions
 valleys-0.046
 rivers-0.039
 Pat-0.038
 free-0.038
 behold-0.036
Tokenrow
Token ID808
Feature activation+9.92e-03

Pos logit contributions
 excuse+0.207
resp+0.174
 haircut+0.172
burn+0.170
arre+0.167
Neg logit contributions
 valleys-0.139
 free-0.115
 rivers-0.115
 Pat-0.111
 itself-0.107
Token.
Token ID13
Feature activation+1.09e-02

Pos logit contributions
 excuse+0.015
 fault+0.012
burn+0.012
resp+0.011
reat+0.011
Neg logit contributions
 valleys-0.018
 behold-0.015
 rivers-0.015
 Pat-0.015
 amaz-0.015
Token
Token ID198
Feature activation+1.11e-02

Pos logit contributions
 excuse+0.021
 fault+0.018
EEP+0.018
arre+0.018
reat+0.018
Neg logit contributions
 valleys-0.012
tail-0.010
 rocky-0.010
 majestic-0.010
 behold-0.009
Token
Token ID198
Feature activation+9.10e-03

Pos logit contributions
 excuse+0.016
reat+0.012
ruck+0.012
ins+0.012
have+0.012
Neg logit contributions
 valleys-0.020
 amaz-0.018
 rivers-0.017
tail-0.017
 streams-0.017
TokenOnce
Token ID7454
Feature activation-7.59e-03

Pos logit contributions
 excuse+0.017
 fault+0.014
burn+0.014
 haircut+0.013
ins+0.013
Neg logit contributions
 valleys-0.014
 rivers-0.011
 free-0.011
 Pat-0.011
 behold-0.011
Token they
Token ID484
Feature activation+5.73e-03

Pos logit contributions
 valleys+0.018
 Pat+0.015
 rivers+0.014
 free+0.014
 behold+0.014
Neg logit contributions
 excuse-0.035
burn-0.031
arre-0.030
 fault-0.030
ruck-0.029
Token can
Token ID460
Feature activation+1.09e-02

Pos logit contributions
 excuse+0.009
burn+0.008
resp+0.008
 fault+0.007
 haircut+0.007
Neg logit contributions
 valleys-0.010
 free-0.008
 rivers-0.008
 behold-0.008
 Pat-0.008
Token add
Token ID751
Feature activation+3.02e-02

Pos logit contributions
 excuse+0.018
resp+0.015
 fault+0.015
ins+0.015
 consequences+0.015
Neg logit contributions
 valleys-0.017
 free-0.016
 Pat-0.015
tail-0.015
 behold-0.015
Token their
Token ID511
Feature activation-2.95e-02

Pos logit contributions
 excuse+0.056
burn+0.046
resp+0.045
 fault+0.044
arre+0.044
Neg logit contributions
 valleys-0.046
 rivers-0.038
 free-0.037
 Pat-0.037
 behold-0.035
Token own
Token ID898
Feature activation-2.70e-03

Pos logit contributions
 valleys+0.052
 rivers+0.046
 free+0.046
 amaz+0.044
 streams+0.043
Neg logit contributions
 excuse-0.045
burn-0.040
resp-0.036
arre-0.035
ruck-0.034
Token topp
Token ID23126
Feature activation+1.06e-01

Pos logit contributions
 valleys+0.005
 rivers+0.004
 free+0.004
 amaz+0.004
 streams+0.004
Neg logit contributions
 excuse-0.004
burn-0.004
resp-0.003
arre-0.003
que-0.003
Tokenings
Token ID654
Feature activation-2.31e-02

Pos logit contributions
 excuse+0.187
burn+0.155
resp+0.146
 haircut+0.144
have+0.144
Neg logit contributions
 valleys-0.150
 Pat-0.130
 itself-0.128
 amaz-0.127
 free-0.125
Token too
Token ID1165
Feature activation-2.41e-02

Pos logit contributions
 valleys+0.036
 Pat+0.033
 free+0.032
 rivers+0.031
 Lou+0.030
Neg logit contributions
 excuse-0.039
burn-0.031
 fault-0.031
arre-0.030
resp-0.029
Token.
Token ID13
Feature activation+6.58e-03

Pos logit contributions
 valleys+0.037
 Pat+0.030
 rivers+0.030
 free+0.030
 behold+0.028
Neg logit contributions
 excuse-0.044
burn-0.036
 fault-0.034
arre-0.034
resp-0.034
Token Sara
Token ID24799
Feature activation-1.50e-02

Pos logit contributions
 excuse+0.014
burn+0.012
 fault+0.012
resp+0.012
ruck+0.011
Neg logit contributions
 valleys-0.008
 rivers-0.006
 Pat-0.006
 free-0.006
 amaz-0.006
Token chooses
Token ID19769
Feature activation-4.49e-03

Pos logit contributions
 valleys+0.025
 Pat+0.021
 free+0.021
 Lou+0.021
 delighted+0.021
Neg logit contributions
 excuse-0.024
burn-0.021
resp-0.019
ruck-0.018
 haircut-0.018
TokenOnce
Token ID7454
Feature activation-2.28e-03

Pos logit contributions
 excuse+0.034
burn+0.028
arre+0.028
que+0.027
 fault+0.027
Neg logit contributions
 valleys-0.046
 amaz-0.042
 winding-0.041
 rocky-0.041
 majestic-0.040
Token there
Token ID612
Feature activation+5.85e-03

Pos logit contributions
 valleys+0.005
 rivers+0.005
tail+0.005
 amaz+0.005
 streams+0.005
Neg logit contributions
 excuse-0.002
 fault-0.001
ins-0.001
burn-0.001
arre-0.001
Token was
Token ID373
Feature activation-2.01e-02

Pos logit contributions
 excuse+0.008
 fault+0.006
burn+0.006
 haircut+0.005
reat+0.005
Neg logit contributions
 valleys-0.012
 rivers-0.010
tail-0.010
 free-0.010
 streams-0.010
Token a
Token ID257
Feature activation-6.64e-03

Pos logit contributions
 valleys+0.025
 free+0.021
 Pat+0.020
 rivers+0.020
 behold+0.019
Neg logit contributions
 excuse-0.043
resp-0.036
burn-0.035
 haircut-0.034
ins-0.034
Token clever
Token ID14169
Feature activation-4.90e-03

Pos logit contributions
 valleys+0.013
 free+0.011
 rivers+0.011
 Pat+0.011
 behold+0.011
Neg logit contributions
 excuse-0.009
 fault-0.007
burn-0.007
resp-0.006
arre-0.006
Token rad
Token ID2511
Feature activation+1.05e-01

Pos logit contributions
 valleys+0.008
 Pat+0.007
 rivers+0.007
 free+0.007
 low+0.007
Neg logit contributions
 excuse-0.008
 fault-0.006
burn-0.006
arre-0.006
have-0.006
Tokenish
Token ID680
Feature activation+4.51e-02

Pos logit contributions
 excuse+0.141
resp+0.104
burn+0.103
 fault+0.101
 haircut+0.099
Neg logit contributions
 valleys-0.211
 rivers-0.184
 Pat-0.181
 free-0.179
 streams-0.176
Token.
Token ID13
Feature activation-7.24e-03

Pos logit contributions
 excuse+0.095
 fault+0.083
arre+0.082
burn+0.078
resp+0.075
Neg logit contributions
 valleys-0.051
 Pat-0.047
 free-0.044
stack-0.043
tail-0.043
Token He
Token ID679
Feature activation-5.67e-03

Pos logit contributions
 valleys+0.011
 free+0.009
 rivers+0.009
 amaz+0.009
 rocky+0.009
Neg logit contributions
 excuse-0.012
 fault-0.010
reat-0.009
burn-0.009
resp-0.009
Token lived
Token ID5615
Feature activation-5.19e-02

Pos logit contributions
 valleys+0.009
 rivers+0.007
 free+0.007
 Pat+0.007
 behold+0.007
Neg logit contributions
 excuse-0.010
burn-0.009
 fault-0.008
resp-0.008
arre-0.008
Token in
Token ID287
Feature activation+2.17e-02

Pos logit contributions
 valleys+0.097
 rivers+0.083
 Pat+0.082
 free+0.082
 behold+0.080
Neg logit contributions
 excuse-0.077
burn-0.059
 fault-0.057
resp-0.057
 haircut-0.055
Token knew
Token ID2993
Feature activation+3.79e-03

Pos logit contributions
 excuse+0.028
burn+0.022
 fault+0.022
arre+0.021
resp+0.021
Neg logit contributions
 valleys-0.028
 rivers-0.024
 Pat-0.022
 free-0.022
 streams-0.022
Token it
Token ID340
Feature activation+2.36e-02

Pos logit contributions
 excuse+0.007
burn+0.006
 fault+0.006
resp+0.006
 haircut+0.006
Neg logit contributions
 valleys-0.006
 rivers-0.005
 free-0.004
 Pat-0.004
 behold-0.004
Token was
Token ID373
Feature activation+6.38e-03

Pos logit contributions
 excuse+0.034
burn+0.026
 fault+0.026
resp+0.025
arre+0.025
Neg logit contributions
 valleys-0.045
 rivers-0.039
 Pat-0.038
 free-0.037
 behold-0.037
Token his
Token ID465
Feature activation+3.10e-02

Pos logit contributions
 excuse+0.010
burn+0.007
resp+0.007
 fault+0.007
 haircut+0.007
Neg logit contributions
 valleys-0.012
 rivers-0.010
 Pat-0.010
 free-0.010
 behold-0.010
Token own
Token ID898
Feature activation+2.09e-02

Pos logit contributions
 excuse+0.053
ins+0.047
burn+0.046
resp+0.045
arre+0.045
Neg logit contributions
 valleys-0.045
 amaz-0.043
 free-0.042
 Pat-0.041
 content-0.040
Token fault
Token ID8046
Feature activation+1.05e-01

Pos logit contributions
 excuse+0.036
resp+0.030
burn+0.030
ins+0.029
arre+0.029
Neg logit contributions
 valleys-0.032
 free-0.029
 amaz-0.028
 Pat-0.028
 rivers-0.028
Token.
Token ID13
Feature activation+6.65e-03

Pos logit contributions
 excuse+0.203
burn+0.168
resp+0.166
arre+0.165
ins+0.163
Neg logit contributions
 valleys-0.138
 Pat-0.123
 free-0.121
 rivers-0.114
 itself-0.112
Token He
Token ID679
Feature activation+8.89e-03

Pos logit contributions
 excuse+0.011
arre+0.009
 fault+0.009
EEP+0.008
reat+0.008
Neg logit contributions
 valleys-0.011
 behold-0.009
 rivers-0.009
stack-0.009
 free-0.009
Token would
Token ID561
Feature activation+2.55e-02

Pos logit contributions
 excuse+0.017
burn+0.014
 fault+0.013
resp+0.013
 haircut+0.013
Neg logit contributions
 valleys-0.013
 rivers-0.011
 Pat-0.011
 free-0.011
 behold-0.010
Token have
Token ID423
Feature activation-1.18e-02

Pos logit contributions
 excuse+0.045
ins+0.037
resp+0.037
burn+0.037
 fault+0.035
Neg logit contributions
 valleys-0.039
 free-0.035
 Pat-0.034
 behold-0.033
 amaz-0.033
Token to
Token ID284
Feature activation-6.65e-03

Pos logit contributions
 valleys+0.019
 Pat+0.016
 free+0.016
 rivers+0.016
 amaz+0.015
Neg logit contributions
 excuse-0.021
burn-0.017
resp-0.017
ins-0.016
arre-0.015
Token,
Token ID11
Feature activation-6.22e-03

Pos logit contributions
 valleys+0.025
 rivers+0.021
 free+0.020
 behold+0.020
 streams+0.019
Neg logit contributions
 excuse-0.028
 fault-0.022
burn-0.022
resp-0.022
 haircut-0.021
Token there
Token ID612
Feature activation-4.91e-03

Pos logit contributions
 valleys+0.009
 rivers+0.007
 Pat+0.007
 free+0.007
 behold+0.007
Neg logit contributions
 excuse-0.012
burn-0.010
 fault-0.010
resp-0.010
 haircut-0.010
Token was
Token ID373
Feature activation-2.27e-02

Pos logit contributions
 valleys+0.011
 rivers+0.010
tail+0.010
 free+0.010
 streams+0.009
Neg logit contributions
 excuse-0.005
 fault-0.003
burn-0.003
 haircut-0.003
que-0.003
Token a
Token ID257
Feature activation-5.80e-03

Pos logit contributions
 valleys+0.033
 rivers+0.027
 Pat+0.027
 free+0.026
 behold+0.026
Neg logit contributions
 excuse-0.044
burn-0.036
 fault-0.035
resp-0.035
arre-0.034
Token sad
Token ID6507
Feature activation+1.44e-03

Pos logit contributions
 valleys+0.012
 rivers+0.010
 Pat+0.010
 free+0.010
 behold+0.010
Neg logit contributions
 excuse-0.008
burn-0.006
resp-0.005
 haircut-0.005
 fault-0.005
Token rad
Token ID2511
Feature activation+1.05e-01

Pos logit contributions
 excuse+0.002
burn+0.002
have+0.002
ins+0.002
 fault+0.002
Neg logit contributions
 valleys-0.002
 Pat-0.002
 rivers-0.002
 free-0.002
 low-0.002
Tokenish
Token ID680
Feature activation+4.59e-02

Pos logit contributions
 excuse+0.132
burn+0.095
 fault+0.093
resp+0.093
 haircut+0.090
Neg logit contributions
 valleys-0.218
 rivers-0.191
 Pat-0.189
 free-0.188
 streams-0.182
Token that
Token ID326
Feature activation-2.27e-02

Pos logit contributions
 excuse+0.101
 fault+0.090
arre+0.089
burn+0.085
ins+0.082
Neg logit contributions
 valleys-0.049
 Pat-0.041
 free-0.041
 rivers-0.040
stack-0.040
Token lived
Token ID5615
Feature activation-4.28e-02

Pos logit contributions
 valleys+0.036
 rivers+0.030
 Pat+0.029
 amaz+0.029
tail+0.029
Neg logit contributions
 excuse-0.039
burn-0.030
have-0.030
 fault-0.029
 haircut-0.029
Token in
Token ID287
Feature activation+2.48e-02

Pos logit contributions
 valleys+0.070
 free+0.062
 rivers+0.059
 low+0.057
 Pat+0.057
Neg logit contributions
 excuse-0.075
burn-0.060
 fault-0.059
resp-0.059
 haircut-0.058
Token a
Token ID257
Feature activation-5.19e-04

Pos logit contributions
 excuse+0.030
 dessert+0.024
 consequences+0.024
 haircut+0.024
burn+0.023
Neg logit contributions
 valleys-0.044
stack-0.038
 delighted-0.038
 rivers-0.038
 behold-0.037
Token her
Token ID607
Feature activation-1.74e-02

Pos logit contributions
 excuse+0.030
burn+0.025
 fault+0.024
arre+0.024
resp+0.024
Neg logit contributions
 valleys-0.022
 Pat-0.019
 rivers-0.019
 free-0.019
 behold-0.018
Token mom
Token ID1995
Feature activation-8.63e-03

Pos logit contributions
 valleys+0.024
 free+0.021
 amaz+0.021
 Pat+0.020
 rivers+0.020
Neg logit contributions
 excuse-0.033
resp-0.027
arre-0.027
burn-0.027
ins-0.026
Token and
Token ID290
Feature activation+7.76e-03

Pos logit contributions
 valleys+0.015
 free+0.014
 rivers+0.013
 low+0.013
 Pat+0.012
Neg logit contributions
 excuse-0.014
resp-0.011
 fault-0.011
burn-0.010
 consequences-0.010
Token share
Token ID2648
Feature activation-2.25e-02

Pos logit contributions
 excuse+0.014
burn+0.011
resp+0.011
 fault+0.011
arre+0.011
Neg logit contributions
 valleys-0.012
 Pat-0.010
 free-0.010
 rivers-0.010
 amaz-0.010
Token her
Token ID607
Feature activation-1.12e-02

Pos logit contributions
 valleys+0.029
 free+0.024
 rivers+0.024
 Pat+0.023
 behold+0.022
Neg logit contributions
 excuse-0.046
burn-0.038
 fault-0.038
resp-0.037
arre-0.037
Token rad
Token ID2511
Feature activation+1.04e-01

Pos logit contributions
 valleys+0.017
 free+0.017
 rivers+0.015
 content+0.015
 amaz+0.015
Neg logit contributions
 excuse-0.019
burn-0.016
EEP-0.016
arre-0.016
ud-0.016
Tokenishes
Token ID5614
Feature activation+1.81e-02

Pos logit contributions
 excuse+0.138
burn+0.102
 fault+0.098
resp+0.098
 haircut+0.095
Neg logit contributions
 valleys-0.212
 rivers-0.185
 free-0.181
 Pat-0.181
 behold-0.176
Token.
Token ID13
Feature activation+5.61e-03

Pos logit contributions
 excuse+0.030
burn+0.024
 fault+0.023
resp+0.023
arre+0.022
Neg logit contributions
 valleys-0.030
 rivers-0.026
 free-0.026
 Pat-0.025
 behold-0.025
Token She
Token ID1375
Feature activation-1.40e-02

Pos logit contributions
 excuse+0.008
 fault+0.007
burn+0.007
EEP+0.006
 haircut+0.006
Neg logit contributions
 valleys-0.010
 behold-0.009
 rivers-0.009
 free-0.008
 rocky-0.008
Token liked
Token ID8288
Feature activation-1.12e-02

Pos logit contributions
 valleys+0.018
 free+0.016
 delighted+0.016
 Pat+0.016
 behold+0.015
Neg logit contributions
 excuse-0.027
burn-0.024
 fault-0.022
reat-0.022
ruck-0.022
Token to
Token ID284
Feature activation-4.68e-03

Pos logit contributions
 valleys+0.017
 rivers+0.015
 free+0.014
 Pat+0.014
 forests+0.013
Neg logit contributions
 excuse-0.021
arre-0.017
burn-0.017
EEP-0.016
resp-0.016
Token
Token ID198
Feature activation-4.26e-03

Pos logit contributions
 valleys+0.005
 amaz+0.005
 rivers+0.004
tail+0.004
 rocky+0.004
Neg logit contributions
 excuse-0.004
oral-0.003
have-0.003
arre-0.003
reat-0.003
TokenLou
Token ID24016
Feature activation+3.45e-02

Pos logit contributions
 valleys+0.007
 rivers+0.006
 free+0.006
 amaz+0.006
 rocky+0.006
Neg logit contributions
 excuse-0.007
resp-0.005
arre-0.005
have-0.005
oral-0.005
Token put
Token ID1234
Feature activation-6.00e-03

Pos logit contributions
 excuse+0.053
burn+0.040
 fault+0.038
resp+0.038
arre+0.038
Neg logit contributions
 valleys-0.062
 rivers-0.053
 Pat-0.052
 free-0.052
 amaz-0.051
Token the
Token ID262
Feature activation-2.21e-02

Pos logit contributions
 valleys+0.009
 rivers+0.007
 dense+0.007
 streams+0.007
 forests+0.007
Neg logit contributions
 excuse-0.010
 haircut-0.009
 scissors-0.008
burn-0.008
 cream-0.008
Token add
Token ID751
Feature activation+8.00e-03

Pos logit contributions
 valleys+0.035
 free+0.032
 amaz+0.031
 rivers+0.030
 behold+0.030
Neg logit contributions
 excuse-0.036
burn-0.031
arre-0.031
resp-0.030
ins-0.030
Token-
Token ID12
Feature activation-1.08e-01

Pos logit contributions
 excuse+0.014
burn+0.011
arre+0.011
resp+0.011
 fault+0.011
Neg logit contributions
 valleys-0.013
 free-0.012
 rivers-0.011
 Pat-0.011
 behold-0.011
Tokenon
Token ID261
Feature activation-1.77e-03

Pos logit contributions
 valleys+0.201
 rivers+0.173
 free+0.171
 Pat+0.170
 behold+0.168
Neg logit contributions
 excuse-0.163
burn-0.124
 fault-0.123
resp-0.118
arre-0.117
Token on
Token ID319
Feature activation-1.63e-02

Pos logit contributions
 valleys+0.003
 rivers+0.003
 free+0.003
 Pat+0.003
 behold+0.003
Neg logit contributions
 excuse-0.003
burn-0.002
resp-0.002
 fault-0.002
arre-0.002
Token the
Token ID262
Feature activation-5.08e-03

Pos logit contributions
 valleys+0.024
 rivers+0.019
 behold+0.019
 streams+0.018
tail+0.018
Neg logit contributions
 excuse-0.029
 fault-0.023
 haircut-0.023
burn-0.023
resp-0.022
Token toy
Token ID13373
Feature activation+8.38e-03

Pos logit contributions
 valleys+0.008
 free+0.007
 rivers+0.007
 amaz+0.006
 behold+0.006
Neg logit contributions
 excuse-0.009
burn-0.008
arre-0.007
ins-0.007
resp-0.007
Token and
Token ID290
Feature activation-4.27e-03

Pos logit contributions
 excuse+0.013
burn+0.010
 fault+0.010
resp+0.010
arre+0.009
Neg logit contributions
 valleys-0.015
 rivers-0.013
 Pat-0.013
 free-0.013
 behold-0.012
Token Max
Token ID5436
Feature activation+9.46e-05

Pos logit contributions
 excuse+0.043
burn+0.034
ruck+0.033
 twin+0.032
 friend+0.032
Neg logit contributions
 valleys-0.043
 majestic-0.034
 amaz-0.034
 dense-0.034
 hollow-0.034
Token played
Token ID2826
Feature activation+1.87e-02

Pos logit contributions
 excuse+0.000
 fault+0.000
burn+0.000
resp+0.000
ins+0.000
Neg logit contributions
 valleys-0.000
 rivers-0.000
 free-0.000
 Pat-0.000
 behold-0.000
Token with
Token ID351
Feature activation-2.32e-02

Pos logit contributions
 excuse+0.031
burn+0.025
 fault+0.024
resp+0.024
arre+0.023
Neg logit contributions
 valleys-0.032
 rivers-0.027
 Pat-0.026
 free-0.026
 behold-0.026
Token the
Token ID262
Feature activation+9.15e-04

Pos logit contributions
 valleys+0.032
tail+0.025
 behold+0.025
 rivers+0.025
stack+0.025
Neg logit contributions
 excuse-0.045
 fault-0.038
burn-0.037
 twin-0.036
ins-0.036
Token mixed
Token ID7668
Feature activation+2.11e-02

Pos logit contributions
resp+0.001
 excuse+0.001
burn+0.001
EEP+0.001
reat+0.001
Neg logit contributions
 valleys-0.001
 free-0.001
 low-0.001
 tall-0.001
 wide-0.001
Token-
Token ID12
Feature activation-1.08e-01

Pos logit contributions
 excuse+0.033
burn+0.027
resp+0.027
 fault+0.027
EEP+0.025
Neg logit contributions
 valleys-0.038
 rivers-0.033
 free-0.033
 streams-0.031
 low-0.031
Tokenup
Token ID929
Feature activation-1.22e-02

Pos logit contributions
 valleys+0.196
 free+0.172
 rivers+0.171
 streams+0.161
 behold+0.160
Neg logit contributions
 excuse-0.167
 fault-0.135
burn-0.122
resp-0.120
 haircut-0.116
Token toys
Token ID14958
Feature activation-2.17e-03

Pos logit contributions
 valleys+0.020
 rivers+0.017
 free+0.017
 streams+0.016
 amaz+0.016
Neg logit contributions
 excuse-0.021
burn-0.018
resp-0.016
have-0.016
 fault-0.016
Token.
Token ID13
Feature activation+3.49e-03

Pos logit contributions
 valleys+0.004
 rivers+0.003
 Pat+0.003
 behold+0.003
 amaz+0.003
Neg logit contributions
 excuse-0.004
burn-0.003
 fault-0.003
resp-0.003
arre-0.003
Token They
Token ID1119
Feature activation-4.25e-03

Pos logit contributions
 excuse+0.005
 fault+0.004
arre+0.004
burn+0.004
resp+0.004
Neg logit contributions
 valleys-0.006
 behold-0.005
 rivers-0.005
 amaz-0.005
 free-0.005
Token had
Token ID550
Feature activation+5.34e-03

Pos logit contributions
 valleys+0.008
 rivers+0.006
 Pat+0.006
 amaz+0.006
tail+0.006
Neg logit contributions
 excuse-0.007
burn-0.005
arre-0.005
have-0.005
 fault-0.005
Token whenever
Token ID8797
Feature activation-2.88e-03

Pos logit contributions
 excuse+0.001
burn+0.001
arre+0.001
 fault+0.001
 haircut+0.001
Neg logit contributions
 valleys-0.001
 rivers-0.001
 Pat-0.001
 free-0.001
 amaz-0.001
Token Polly
Token ID36898
Feature activation-5.01e-03

Pos logit contributions
 valleys+0.004
 rivers+0.003
 free+0.003
 Pat+0.003
 amaz+0.003
Neg logit contributions
 excuse-0.006
 haircut-0.004
burn-0.004
 fault-0.004
arre-0.004
Token needed
Token ID2622
Feature activation+2.72e-03

Pos logit contributions
 valleys+0.008
 rivers+0.006
 Pat+0.006
 free+0.006
 behold+0.006
Neg logit contributions
 excuse-0.009
burn-0.007
 fault-0.007
resp-0.007
ruck-0.007
Token a
Token ID257
Feature activation-1.75e-02

Pos logit contributions
 excuse+0.005
burn+0.004
 fault+0.004
resp+0.004
arre+0.004
Neg logit contributions
 valleys-0.004
 rivers-0.003
 Pat-0.003
 free-0.003
 behold-0.003
Token pick
Token ID2298
Feature activation+5.37e-03

Pos logit contributions
 valleys+0.031
 rivers+0.027
 free+0.027
 Pat+0.027
 amaz+0.026
Neg logit contributions
 excuse-0.027
burn-0.021
resp-0.020
 fault-0.020
ins-0.020
Token-
Token ID12
Feature activation-1.06e-01

Pos logit contributions
 excuse+0.009
burn+0.007
 fault+0.007
resp+0.007
arre+0.006
Neg logit contributions
 valleys-0.009
 rivers-0.008
 Pat-0.008
 free-0.008
 behold-0.008
Tokenme
Token ID1326
Feature activation-3.96e-02

Pos logit contributions
 valleys+0.230
 rivers+0.203
 Pat+0.201
 free+0.200
 behold+0.197
Neg logit contributions
 excuse-0.125
burn-0.088
 fault-0.085
resp-0.083
arre-0.080
Token-
Token ID12
Feature activation-1.05e-01

Pos logit contributions
 valleys+0.068
 free+0.060
 rivers+0.060
 Pat+0.060
 amaz+0.059
Neg logit contributions
 excuse-0.062
burn-0.048
resp-0.046
 fault-0.046
 haircut-0.044
Tokenup
Token ID929
Feature activation+2.75e-04

Pos logit contributions
 valleys+0.233
 rivers+0.206
 Pat+0.204
 free+0.203
 behold+0.200
Neg logit contributions
 excuse-0.118
burn-0.082
 fault-0.079
resp-0.077
arre-0.074
Token,
Token ID11
Feature activation-2.18e-03

Pos logit contributions
 excuse+0.000
burn+0.000
 fault+0.000
resp+0.000
arre+0.000
Neg logit contributions
 valleys-0.000
 free-0.000
 Pat-0.000
 rivers-0.000
 amaz-0.000
Token she
Token ID673
Feature activation-8.47e-03

Pos logit contributions
 valleys+0.003
 rivers+0.002
 free+0.002
 Pat+0.002
 behold+0.002
Neg logit contributions
 excuse-0.004
burn-0.003
 fault-0.003
 haircut-0.003
resp-0.003
Token to
Token ID284
Feature activation-1.21e-02

Pos logit contributions
 valleys+0.015
 free+0.013
 rivers+0.013
 Pat+0.013
 amaz+0.012
Neg logit contributions
 excuse-0.013
burn-0.009
resp-0.009
 fault-0.009
arre-0.009
Token wear
Token ID5806
Feature activation-9.71e-03

Pos logit contributions
 valleys+0.022
 rivers+0.018
 rocky+0.018
 dense+0.018
 streams+0.017
Neg logit contributions
 excuse-0.019
 haircut-0.015
burn-0.014
reat-0.013
ruck-0.013
Token and
Token ID290
Feature activation-1.57e-02

Pos logit contributions
 valleys+0.018
 Pat+0.015
 rivers+0.015
 behold+0.014
 amaz+0.014
Neg logit contributions
 excuse-0.014
burn-0.011
 haircut-0.011
 fault-0.011
resp-0.011
Token put
Token ID1234
Feature activation-1.44e-02

Pos logit contributions
 valleys+0.027
 rivers+0.023
 Pat+0.022
 free+0.021
 behold+0.021
Neg logit contributions
 excuse-0.025
burn-0.020
 haircut-0.020
 fault-0.019
resp-0.019
Token make
Token ID787
Feature activation-3.25e-03

Pos logit contributions
 valleys+0.028
 amaz+0.024
 dense+0.024
 Pat+0.024
 rivers+0.024
Neg logit contributions
 excuse-0.018
burn-0.015
 haircut-0.014
ins-0.013
have-0.013
Token-
Token ID12
Feature activation-1.06e-01

Pos logit contributions
 valleys+0.005
 rivers+0.004
 Pat+0.004
 rocky+0.004
 behold+0.004
Neg logit contributions
 excuse-0.006
burn-0.005
 fault-0.005
 haircut-0.004
ins-0.004
Tokenup
Token ID929
Feature activation-6.01e-04

Pos logit contributions
 valleys+0.202
 rivers+0.174
 Pat+0.171
 free+0.170
 behold+0.167
Neg logit contributions
 excuse-0.153
burn-0.117
 fault-0.113
resp-0.111
arre-0.108
Token on
Token ID319
Feature activation-1.62e-02

Pos logit contributions
 valleys+0.001
 rivers+0.001
 free+0.001
 Pat+0.001
 amaz+0.001
Neg logit contributions
 excuse-0.001
burn-0.001
 fault-0.001
resp-0.001
arre-0.001
Token her
Token ID607
Feature activation-2.05e-02

Pos logit contributions
 valleys+0.023
 Pat+0.018
 rivers+0.018
 behold+0.018
stack+0.017
Neg logit contributions
 excuse-0.030
 fault-0.025
burn-0.025
 haircut-0.025
ruck-0.023
Token face
Token ID1986
Feature activation-2.30e-02

Pos logit contributions
 valleys+0.036
 Pat+0.030
 rivers+0.030
 behold+0.029
 streams+0.028
Neg logit contributions
 excuse-0.033
 haircut-0.026
 fault-0.025
burn-0.025
resp-0.024
Token.
Token ID13
Feature activation-2.07e-03

Pos logit contributions
 valleys+0.041
 rivers+0.035
 Pat+0.034
 behold+0.033
 free+0.033
Neg logit contributions
 excuse-0.035
burn-0.029
 haircut-0.027
 fault-0.027
resp-0.026
Token remember
Token ID3505
Feature activation-1.90e-02

Pos logit contributions
 valleys+0.028
 rivers+0.024
 Pat+0.024
 free+0.024
 behold+0.023
Neg logit contributions
 excuse-0.024
burn-0.020
 fault-0.019
resp-0.019
arre-0.019
Token to
Token ID284
Feature activation+3.73e-03

Pos logit contributions
 valleys+0.027
 Pat+0.025
 rivers+0.024
 free+0.023
 Lou+0.022
Neg logit contributions
 excuse-0.035
burn-0.029
arre-0.029
resp-0.028
ins-0.028
Token release
Token ID2650
Feature activation-1.13e-02

Pos logit contributions
 excuse+0.005
burn+0.004
 fault+0.004
resp+0.004
arre+0.004
Neg logit contributions
 valleys-0.007
 rivers-0.006
 Pat-0.006
 free-0.006
 behold-0.006
Token the
Token ID262
Feature activation-1.58e-02

Pos logit contributions
 valleys+0.018
 rivers+0.015
 Pat+0.014
 free+0.014
 behold+0.014
Neg logit contributions
 excuse-0.020
burn-0.016
 fault-0.016
resp-0.016
 haircut-0.015
Token Wi
Token ID11759
Feature activation-6.09e-02

Pos logit contributions
 valleys+0.026
 rivers+0.023
 free+0.022
 winding+0.021
 forests+0.021
Neg logit contributions
 excuse-0.026
burn-0.022
resp-0.022
 fault-0.021
ins-0.021
Token-
Token ID12
Feature activation-1.05e-01

Pos logit contributions
 valleys+0.113
 rivers+0.099
 Pat+0.097
 free+0.097
 streams+0.093
Neg logit contributions
 excuse-0.088
burn-0.069
 fault-0.067
arre-0.063
resp-0.062
TokenFi
Token ID10547
Feature activation-6.25e-02

Pos logit contributions
 valleys+0.189
 rivers+0.164
 Pat+0.164
 free+0.163
tail+0.158
Neg logit contributions
 excuse-0.159
 fault-0.128
burn-0.121
arre-0.115
resp-0.115
Token when
Token ID618
Feature activation-1.65e-02

Pos logit contributions
 valleys+0.107
 free+0.092
 rivers+0.092
 Pat+0.092
 Lou+0.087
Neg logit contributions
 excuse-0.100
burn-0.080
 fault-0.079
arre-0.075
resp-0.075
Token you
Token ID345
Feature activation-6.43e-03

Pos logit contributions
 valleys+0.027
 rivers+0.023
 Pat+0.023
 free+0.022
 behold+0.022
Neg logit contributions
 excuse-0.028
burn-0.022
 fault-0.022
resp-0.021
arre-0.021
Token're
Token ID821
Feature activation+2.60e-02

Pos logit contributions
 valleys+0.012
 rivers+0.010
 Pat+0.010
 free+0.010
 behold+0.010
Neg logit contributions
 excuse-0.010
burn-0.008
 fault-0.007
resp-0.007
 haircut-0.007
Token done
Token ID1760
Feature activation+9.91e-03

Pos logit contributions
 excuse+0.047
burn+0.038
resp+0.038
 fault+0.037
arre+0.036
Neg logit contributions
 valleys-0.040
 free-0.034
 rivers-0.033
 Pat-0.033
 behold-0.031
Token push
Token ID4574
Feature activation+7.18e-03

Pos logit contributions
 excuse+0.066
burn+0.051
 fault+0.050
resp+0.049
 haircut+0.049
Neg logit contributions
 valleys-0.070
 rivers-0.059
 Pat-0.059
 free-0.058
 behold-0.057
Token-
Token ID12
Feature activation-8.30e-02

Pos logit contributions
 excuse+0.011
burn+0.009
 fault+0.009
arre+0.008
resp+0.008
Neg logit contributions
 valleys-0.012
 rivers-0.011
 Pat-0.010
 behold-0.010
 free-0.010
Tokenups
Token ID4739
Feature activation-8.09e-03

Pos logit contributions
 valleys+0.164
 rivers+0.142
 free+0.140
 Pat+0.139
 streams+0.135
Neg logit contributions
 excuse-0.118
burn-0.088
 fault-0.086
resp-0.084
 haircut-0.081
Token and
Token ID290
Feature activation-2.06e-02

Pos logit contributions
 valleys+0.014
 rivers+0.012
 free+0.011
 Pat+0.011
 behold+0.011
Neg logit contributions
 excuse-0.013
burn-0.011
resp-0.010
 fault-0.010
 haircut-0.010
Token pull
Token ID2834
Feature activation-3.00e-03

Pos logit contributions
 valleys+0.038
 rivers+0.032
 Pat+0.032
 behold+0.031
 free+0.031
Neg logit contributions
 excuse-0.030
burn-0.023
 haircut-0.023
 fault-0.022
arre-0.021
Token-
Token ID12
Feature activation-1.05e-01

Pos logit contributions
 valleys+0.005
 rivers+0.004
 Pat+0.004
 behold+0.004
 free+0.004
Neg logit contributions
 excuse-0.005
burn-0.004
 fault-0.004
resp-0.004
arre-0.004
Tokenups
Token ID4739
Feature activation-2.14e-02

Pos logit contributions
 valleys+0.207
 rivers+0.180
 free+0.176
 Pat+0.175
 streams+0.171
Neg logit contributions
 excuse-0.150
burn-0.110
 fault-0.109
resp-0.108
 haircut-0.103
Token.
Token ID13
Feature activation-7.83e-03

Pos logit contributions
 valleys+0.035
 rivers+0.029
 Pat+0.029
 free+0.029
 behold+0.028
Neg logit contributions
 excuse-0.037
burn-0.029
 fault-0.029
resp-0.028
 haircut-0.028
Token He
Token ID679
Feature activation+5.32e-04

Pos logit contributions
 valleys+0.010
 rivers+0.008
 free+0.008
 behold+0.007
 amaz+0.007
Neg logit contributions
 excuse-0.016
 fault-0.013
burn-0.013
arre-0.013
resp-0.013
Token worked
Token ID3111
Feature activation+1.45e-02

Pos logit contributions
 excuse+0.001
burn+0.001
 fault+0.001
resp+0.001
 haircut+0.001
Neg logit contributions
 valleys-0.001
 rivers-0.001
 Pat-0.001
 free-0.001
 behold-0.001
Token really
Token ID1107
Feature activation+5.61e-03

Pos logit contributions
 excuse+0.028
burn+0.023
 fault+0.022
resp+0.022
 haircut+0.022
Neg logit contributions
 valleys-0.021
 rivers-0.017
 Pat-0.017
 free-0.017
 behold-0.016
Token needed
Token ID2622
Feature activation+2.72e-03

Pos logit contributions
 valleys+0.008
 rivers+0.006
 Pat+0.006
 free+0.006
 behold+0.006
Neg logit contributions
 excuse-0.009
burn-0.007
 fault-0.007
resp-0.007
ruck-0.007
Token a
Token ID257
Feature activation-1.75e-02

Pos logit contributions
 excuse+0.005
burn+0.004
 fault+0.004
resp+0.004
arre+0.004
Neg logit contributions
 valleys-0.004
 rivers-0.003
 Pat-0.003
 free-0.003
 behold-0.003
Token pick
Token ID2298
Feature activation+5.37e-03

Pos logit contributions
 valleys+0.031
 rivers+0.027
 free+0.027
 Pat+0.027
 amaz+0.026
Neg logit contributions
 excuse-0.027
burn-0.021
resp-0.020
 fault-0.020
ins-0.020
Token-
Token ID12
Feature activation-1.06e-01

Pos logit contributions
 excuse+0.009
burn+0.007
 fault+0.007
resp+0.007
arre+0.006
Neg logit contributions
 valleys-0.009
 rivers-0.008
 Pat-0.008
 free-0.008
 behold-0.008
Tokenme
Token ID1326
Feature activation-3.96e-02

Pos logit contributions
 valleys+0.230
 rivers+0.203
 Pat+0.201
 free+0.200
 behold+0.197
Neg logit contributions
 excuse-0.125
burn-0.088
 fault-0.085
resp-0.083
arre-0.080
Token-
Token ID12
Feature activation-1.05e-01

Pos logit contributions
 valleys+0.068
 free+0.060
 rivers+0.060
 Pat+0.060
 amaz+0.059
Neg logit contributions
 excuse-0.062
burn-0.048
resp-0.046
 fault-0.046
 haircut-0.044
Tokenup
Token ID929
Feature activation+2.75e-04

Pos logit contributions
 valleys+0.233
 rivers+0.206
 Pat+0.204
 free+0.203
 behold+0.200
Neg logit contributions
 excuse-0.118
burn-0.082
 fault-0.079
resp-0.077
arre-0.074
Token,
Token ID11
Feature activation-2.18e-03

Pos logit contributions
 excuse+0.000
burn+0.000
 fault+0.000
resp+0.000
arre+0.000
Neg logit contributions
 valleys-0.000
 free-0.000
 Pat-0.000
 rivers-0.000
 amaz-0.000
Token she
Token ID673
Feature activation-8.47e-03

Pos logit contributions
 valleys+0.003
 rivers+0.002
 free+0.002
 Pat+0.002
 behold+0.002
Neg logit contributions
 excuse-0.004
burn-0.003
 fault-0.003
 haircut-0.003
resp-0.003
Token'd
Token ID1549
Feature activation+3.59e-02

Pos logit contributions
 valleys+0.013
 Pat+0.011
 free+0.011
 rivers+0.011
 delighted+0.011
Neg logit contributions
 excuse-0.015
burn-0.012
resp-0.012
 fault-0.012
reat-0.011
Token read
Token ID1100
Feature activation-3.12e-02

Pos logit contributions
 excuse+0.062
burn+0.051
resp+0.050
ins+0.049
arre+0.047
Neg logit contributions
 valleys-0.057
 free-0.050
 rivers-0.048
 amaz-0.048
 behold-0.048
Token.
Token ID13
Feature activation-1.72e-03

Pos logit contributions
 excuse+0.050
resp+0.041
burn+0.041
 fault+0.040
arre+0.039
Neg logit contributions
 valleys-0.038
 Pat-0.032
 rivers-0.032
 free-0.031
tail-0.031
Token He
Token ID679
Feature activation-7.08e-03

Pos logit contributions
 valleys+0.002
 rivers+0.002
 Pat+0.002
 free+0.002
 behold+0.002
Neg logit contributions
 excuse-0.003
burn-0.003
 fault-0.003
resp-0.003
 haircut-0.003
Token loved
Token ID6151
Feature activation-6.59e-03

Pos logit contributions
 valleys+0.010
 Pat+0.008
 rivers+0.008
 behold+0.008
 free+0.008
Neg logit contributions
 excuse-0.014
burn-0.012
resp-0.011
 fault-0.011
ins-0.011
Token playing
Token ID2712
Feature activation+9.51e-03

Pos logit contributions
 valleys+0.012
 rivers+0.011
 free+0.010
 Pat+0.010
 forests+0.010
Neg logit contributions
 excuse-0.011
burn-0.008
resp-0.008
arre-0.008
 fault-0.008
Token make
Token ID787
Feature activation-4.01e-03

Pos logit contributions
 excuse+0.016
burn+0.013
 fault+0.012
resp+0.012
arre+0.011
Neg logit contributions
 valleys-0.016
 rivers-0.014
 free-0.014
 Pat-0.014
 behold-0.013
Token-
Token ID12
Feature activation-1.05e-01

Pos logit contributions
 valleys+0.006
 rivers+0.006
 free+0.005
 Pat+0.005
 streams+0.005
Neg logit contributions
 excuse-0.007
burn-0.006
 fault-0.006
resp-0.006
 haircut-0.005
Tokenbel
Token ID6667
Feature activation+3.25e-03

Pos logit contributions
 valleys+0.162
 rivers+0.136
 free+0.135
 behold+0.131
 Pat+0.129
Neg logit contributions
 excuse-0.190
 fault-0.156
burn-0.150
resp-0.149
 haircut-0.142
Tokenieve
Token ID12311
Feature activation-2.04e-02

Pos logit contributions
 excuse+0.007
burn+0.006
 fault+0.006
resp+0.006
 haircut+0.005
Neg logit contributions
 valleys-0.004
 free-0.003
 rivers-0.003
 behold-0.003
 Pat-0.003
Token with
Token ID351
Feature activation-3.09e-02

Pos logit contributions
 valleys+0.034
 rivers+0.029
 free+0.028
 Pat+0.028
 forests+0.027
Neg logit contributions
 excuse-0.035
burn-0.030
 fault-0.028
resp-0.027
arre-0.027
Token his
Token ID465
Feature activation-5.15e-03

Pos logit contributions
 valleys+0.048
 rivers+0.040
 Pat+0.039
 free+0.039
 behold+0.038
Neg logit contributions
 excuse-0.055
burn-0.045
 fault-0.044
resp-0.043
 haircut-0.042
Token friends
Token ID2460
Feature activation-6.55e-03

Pos logit contributions
 valleys+0.010
 rivers+0.008
 Pat+0.008
 free+0.008
 amaz+0.008
Neg logit contributions
 excuse-0.008
burn-0.006
resp-0.006
arre-0.006
 fault-0.006
Token.
Token ID13
Feature activation+1.36e-02

Pos logit contributions
 excuse+0.041
burn+0.037
arre+0.036
resp+0.035
 fault+0.035
Neg logit contributions
 valleys-0.027
 free-0.025
 Pat-0.023
tail-0.022
 delighted-0.022
Token You
Token ID921
Feature activation+7.52e-03

Pos logit contributions
 excuse+0.022
burn+0.019
 haircut+0.017
 fault+0.017
arre+0.017
Neg logit contributions
 valleys-0.023
 Pat-0.020
 rivers-0.019
 free-0.018
 Lou-0.018
Token cannot
Token ID2314
Feature activation-5.52e-04

Pos logit contributions
 excuse+0.015
burn+0.014
ins+0.013
 haircut+0.013
 fault+0.012
Neg logit contributions
 free-0.009
 valleys-0.008
 Pat-0.007
 trapped-0.007
 -0.007
Token steal
Token ID8711
Feature activation+6.50e-03

Pos logit contributions
 free+0.001
tail+0.001
 +0.001
 Pat+0.001
 behold+0.001
Neg logit contributions
 excuse-0.001
ins-0.001
 fault-0.001
arre-0.001
resp-0.001
Token ice
Token ID4771
Feature activation-1.39e-02

Pos logit contributions
 excuse+0.012
burn+0.010
arre+0.010
resp+0.010
 fault+0.010
Neg logit contributions
 valleys-0.010
 free-0.008
 rivers-0.008
 Pat-0.008
 behold-0.008
Token-
Token ID12
Feature activation-1.04e-01

Pos logit contributions
 valleys+0.025
 free+0.022
 rivers+0.022
tail+0.022
 rocky+0.021
Neg logit contributions
 excuse-0.019
arre-0.015
 fault-0.014
resp-0.014
have-0.014
Tokencream
Token ID36277
Feature activation-4.68e-03

Pos logit contributions
 valleys+0.204
 free+0.178
 rivers+0.178
 Pat+0.173
 behold+0.170
Neg logit contributions
 excuse-0.149
 fault-0.108
arre-0.103
burn-0.103
resp-0.101
Tokens
Token ID82
Feature activation-3.62e-02

Pos logit contributions
 valleys+0.007
 free+0.006
 rivers+0.006
tail+0.005
 rocky+0.005
Neg logit contributions
 excuse-0.008
 fault-0.007
burn-0.007
arre-0.007
resp-0.007
Token.
Token ID13
Feature activation+5.45e-03

Pos logit contributions
 valleys+0.037
 free+0.037
 below+0.032
 itself+0.032
 low+0.031
Neg logit contributions
 excuse-0.069
arre-0.062
burn-0.061
resp-0.061
 fault-0.059
Token You
Token ID921
Feature activation+3.46e-03

Pos logit contributions
 excuse+0.009
burn+0.008
 haircut+0.007
 fault+0.007
arre+0.007
Neg logit contributions
 valleys-0.009
 Pat-0.007
 rivers-0.007
tail-0.007
 free-0.007
Token have
Token ID423
Feature activation+5.52e-03

Pos logit contributions
 excuse+0.007
burn+0.006
ins+0.006
ruck+0.006
resp+0.006
Neg logit contributions
 free-0.004
 valleys-0.004
tail-0.004
 Pat-0.004
 -0.003
Tokenez
Token ID8471
Feature activation-1.36e-02

Pos logit contributions
 excuse+0.002
burn+0.002
 fault+0.002
arre+0.002
 haircut+0.002
Neg logit contributions
 valleys-0.002
 rivers-0.001
 Pat-0.001
 free-0.001
 amaz-0.001
Tokening
Token ID278
Feature activation+2.15e-02

Pos logit contributions
 valleys+0.025
 Pat+0.022
 free+0.022
 rivers+0.022
tail+0.021
Neg logit contributions
 excuse-0.020
 fault-0.015
burn-0.014
resp-0.014
 haircut-0.014
Token!
Token ID0
Feature activation+1.89e-02

Pos logit contributions
 excuse+0.038
burn+0.030
 fault+0.029
resp+0.029
arre+0.029
Neg logit contributions
 valleys-0.034
 rivers-0.029
 free-0.029
 Pat-0.029
 behold-0.027
Token The
Token ID383
Feature activation+8.00e-03

Pos logit contributions
 excuse+0.033
arre+0.028
 fault+0.027
burn+0.027
 haircut+0.027
Neg logit contributions
 valleys-0.031
 rivers-0.026
 rocky-0.024
 streams-0.024
 amaz-0.023
Token sprinkle
Token ID45799
Feature activation-4.43e-03

Pos logit contributions
 excuse+0.012
 fault+0.011
resp+0.011
arre+0.010
burn+0.010
Neg logit contributions
 valleys-0.012
 free-0.012
 delighted-0.012
 low-0.011
 -0.011
Token-
Token ID12
Feature activation-1.04e-01

Pos logit contributions
 valleys+0.007
 free+0.006
 behold+0.006
 Pat+0.006
 low+0.006
Neg logit contributions
 excuse-0.008
 fault-0.007
resp-0.006
arre-0.006
burn-0.006
Tokencovered
Token ID32111
Feature activation-2.15e-02

Pos logit contributions
 valleys+0.185
 rivers+0.158
 free+0.157
 Pat+0.156
 behold+0.153
Neg logit contributions
 excuse-0.165
 fault-0.127
burn-0.124
resp-0.122
arre-0.118
Token pastry
Token ID41738
Feature activation-1.89e-02

Pos logit contributions
 valleys+0.035
 rivers+0.030
 free+0.029
 behold+0.029
stack+0.029
Neg logit contributions
 excuse-0.036
 fault-0.030
arre-0.029
resp-0.029
burn-0.029
Token flew
Token ID13112
Feature activation-4.85e-03

Pos logit contributions
 valleys+0.027
 free+0.024
 Pat+0.023
 low+0.022
 rivers+0.022
Neg logit contributions
 excuse-0.036
 fault-0.030
resp-0.030
ins-0.028
arre-0.028
Token out
Token ID503
Feature activation+1.25e-02

Pos logit contributions
 valleys+0.007
 rivers+0.006
 Pat+0.006
 free+0.006
 behold+0.006
Neg logit contributions
 excuse-0.009
burn-0.007
 fault-0.007
resp-0.007
arre-0.007
Token of
Token ID286
Feature activation+9.54e-03

Pos logit contributions
 excuse+0.019
 fault+0.015
burn+0.014
resp+0.014
have+0.014
Neg logit contributions
 valleys-0.024
 free-0.021
 Pat-0.021
 rivers-0.021
 amaz-0.020
Token how
Token ID703
Feature activation-1.27e-02

Pos logit contributions
 valleys+0.037
 Pat+0.035
 amaz+0.032
 free+0.031
 Lou+0.030
Neg logit contributions
 excuse-0.111
EEP-0.089
arre-0.089
resp-0.087
ud-0.086
Token to
Token ID284
Feature activation-1.91e-02

Pos logit contributions
 valleys+0.025
 free+0.022
 rivers+0.022
 Pat+0.022
 behold+0.021
Neg logit contributions
 excuse-0.017
burn-0.013
 fault-0.013
resp-0.012
arre-0.012
Token use
Token ID779
Feature activation+4.36e-03

Pos logit contributions
 valleys+0.033
 free+0.030
 behold+0.029
 Pat+0.029
 rivers+0.028
Neg logit contributions
 excuse-0.029
 fault-0.025
resp-0.023
burn-0.023
ins-0.023
Token the
Token ID262
Feature activation-1.44e-02

Pos logit contributions
 excuse+0.008
burn+0.006
 fault+0.006
 haircut+0.006
resp+0.006
Neg logit contributions
 valleys-0.007
 rivers-0.006
 Pat-0.006
 behold-0.006
 free-0.006
Token Wi
Token ID11759
Feature activation-5.91e-02

Pos logit contributions
 valleys+0.021
 low+0.021
 free+0.021
 winding+0.020
 tall+0.020
Neg logit contributions
 excuse-0.020
resp-0.020
ins-0.020
ud-0.019
burn-0.018
Token-
Token ID12
Feature activation-1.03e-01

Pos logit contributions
 valleys+0.102
 rivers+0.089
 Pat+0.086
 free+0.085
 forests+0.084
Neg logit contributions
 excuse-0.091
burn-0.074
 fault-0.070
arre-0.068
resp-0.065
TokenFi
Token ID10547
Feature activation-5.89e-02

Pos logit contributions
 valleys+0.186
 Pat+0.166
 rivers+0.164
tail+0.161
 free+0.161
Neg logit contributions
 excuse-0.152
 fault-0.119
burn-0.113
arre-0.111
resp-0.107
Token network
Token ID3127
Feature activation-2.16e-02

Pos logit contributions
 valleys+0.102
 Pat+0.091
 rivers+0.090
 free+0.089
tail+0.086
Neg logit contributions
 excuse-0.092
burn-0.071
arre-0.068
ins-0.068
 fault-0.067
Token to
Token ID284
Feature activation-1.11e-02

Pos logit contributions
 valleys+0.044
 rivers+0.038
 Pat+0.038
 behold+0.037
 free+0.037
Neg logit contributions
 excuse-0.028
burn-0.020
 fault-0.020
resp-0.020
 haircut-0.019
Token watch
Token ID2342
Feature activation-1.91e-02

Pos logit contributions
 valleys+0.024
 delighted+0.022
 rivers+0.021
 amaz+0.021
 majestic+0.021
Neg logit contributions
 excuse-0.013
 haircut-0.009
burn-0.009
reat-0.008
have-0.007
Token more
Token ID517
Feature activation-3.00e-02

Pos logit contributions
 valleys+0.035
 rivers+0.029
 Pat+0.029
 behold+0.029
 amaz+0.029
Neg logit contributions
 excuse-0.029
burn-0.023
resp-0.021
 haircut-0.021
 fault-0.021
Tokenie
Token ID494
Feature activation+1.45e-02

Pos logit contributions
 excuse+0.028
resp+0.023
 fault+0.022
 haircut+0.021
burn+0.021
Neg logit contributions
 valleys-0.028
 behold-0.024
 rivers-0.024
 delighted-0.023
 free-0.023
Token loved
Token ID6151
Feature activation+6.18e-03

Pos logit contributions
 excuse+0.021
burn+0.016
 fault+0.016
arre+0.016
resp+0.015
Neg logit contributions
 valleys-0.027
 rivers-0.023
 Pat-0.022
 free-0.022
 amaz-0.022
Token to
Token ID284
Feature activation+7.89e-03

Pos logit contributions
 excuse+0.009
burn+0.007
 fault+0.007
resp+0.007
arre+0.006
Neg logit contributions
 valleys-0.012
 rivers-0.010
 Pat-0.010
 free-0.010
 amaz-0.010
Token play
Token ID711
Feature activation+1.89e-03

Pos logit contributions
 excuse+0.013
 fault+0.011
burn+0.011
ins+0.010
resp+0.010
Neg logit contributions
 valleys-0.013
 free-0.011
 rivers-0.011
 Pat-0.011
 behold-0.011
Token make
Token ID787
Feature activation+5.11e-03

Pos logit contributions
 excuse+0.003
burn+0.002
resp+0.002
 fault+0.002
 haircut+0.002
Neg logit contributions
 valleys-0.003
 rivers-0.003
 Pat-0.003
 free-0.003
 amaz-0.003
Token-
Token ID12
Feature activation-1.03e-01

Pos logit contributions
 excuse+0.009
burn+0.007
 fault+0.007
resp+0.007
 haircut+0.006
Neg logit contributions
 valleys-0.008
 rivers-0.007
 free-0.007
 Pat-0.007
 streams-0.007
Tokenbel
Token ID6667
Feature activation+5.65e-03

Pos logit contributions
 valleys+0.188
 rivers+0.163
 free+0.162
 Pat+0.159
 behold+0.158
Neg logit contributions
 excuse-0.157
 fault-0.123
burn-0.119
resp-0.115
 haircut-0.112
Tokenieve
Token ID12311
Feature activation-1.65e-02

Pos logit contributions
 excuse+0.011
burn+0.010
 fault+0.009
resp+0.009
arre+0.009
Neg logit contributions
 valleys-0.007
 rivers-0.006
 free-0.006
 Pat-0.006
 behold-0.006
Token.
Token ID13
Feature activation-8.93e-03

Pos logit contributions
 valleys+0.029
 rivers+0.025
 Pat+0.024
 free+0.024
 behold+0.023
Neg logit contributions
 excuse-0.027
burn-0.021
 fault-0.021
resp-0.020
arre-0.020
Token Today
Token ID6288
Feature activation-2.09e-02

Pos logit contributions
 valleys+0.013
 free+0.011
 rivers+0.011
 Pat+0.011
 behold+0.011
Neg logit contributions
 excuse-0.016
 fault-0.013
burn-0.013
arre-0.012
 haircut-0.012
Token she
Token ID673
Feature activation-2.89e-02

Pos logit contributions
 valleys+0.022
 Pat+0.018
 free+0.017
 rivers+0.017
 behold+0.017
Neg logit contributions
 excuse-0.046
burn-0.042
resp-0.041
ins-0.040
arre-0.039
Token and
Token ID290
Feature activation+3.88e-03

Pos logit contributions
 valleys+0.006
 rivers+0.005
 Pat+0.005
 free+0.005
 behold+0.005
Neg logit contributions
 excuse-0.003
burn-0.002
 fault-0.002
resp-0.002
 haircut-0.002
Token Dad
Token ID17415
Feature activation+1.27e-02

Pos logit contributions
 excuse+0.006
burn+0.005
 haircut+0.005
 fault+0.005
resp+0.005
Neg logit contributions
 valleys-0.007
 rivers-0.006
 behold-0.005
 Pat-0.005
 free-0.005
Token to
Token ID284
Feature activation-1.86e-03

Pos logit contributions
 excuse+0.015
burn+0.011
 fault+0.010
 haircut+0.010
reat+0.009
Neg logit contributions
 valleys-0.027
 behold-0.024
 rivers-0.024
 Pat-0.024
 amaz-0.024
Token the
Token ID262
Feature activation+4.06e-03

Pos logit contributions
 valleys+0.002
 dense+0.002
 Pat+0.002
 delighted+0.002
 rivers+0.002
Neg logit contributions
 excuse-0.004
 haircut-0.003
burn-0.003
 fault-0.003
 boo-0.003
Token check
Token ID2198
Feature activation-3.54e-02

Pos logit contributions
 excuse+0.006
 haircut+0.005
burn+0.005
 fault+0.004
 consequences+0.004
Neg logit contributions
 valleys-0.007
 Pat-0.007
 behold-0.006
 Lou-0.006
 free-0.006
Token-
Token ID12
Feature activation-1.03e-01

Pos logit contributions
 valleys+0.065
 rivers+0.056
 Pat+0.055
 free+0.055
 behold+0.053
Neg logit contributions
 excuse-0.054
burn-0.041
 fault-0.041
resp-0.040
arre-0.040
Tokenin
Token ID259
Feature activation-2.04e-02

Pos logit contributions
 valleys+0.210
 rivers+0.183
 Pat+0.179
 free+0.178
 behold+0.175
Neg logit contributions
 excuse-0.139
 fault-0.101
burn-0.100
resp-0.097
arre-0.094
Token counter
Token ID3753
Feature activation-3.48e-03

Pos logit contributions
 valleys+0.038
 Pat+0.033
 rivers+0.033
 behold+0.032
 free+0.032
Neg logit contributions
 excuse-0.029
burn-0.022
 haircut-0.021
resp-0.020
ins-0.020
Token.
Token ID13
Feature activation+2.21e-03

Pos logit contributions
 valleys+0.006
 rivers+0.005
 Pat+0.005
 free+0.005
 behold+0.005
Neg logit contributions
 excuse-0.006
burn-0.004
 fault-0.004
resp-0.004
 haircut-0.004
Token They
Token ID1119
Feature activation-6.00e-03

Pos logit contributions
 excuse+0.002
 fault+0.002
EEP+0.002
 cousin+0.001
arre+0.001
Neg logit contributions
 valleys-0.005
tail-0.004
 free-0.004
 majestic-0.004
 rocky-0.004
Token were
Token ID547
Feature activation-4.44e-04

Pos logit contributions
 valleys+0.011
 rivers+0.009
 Pat+0.009
 free+0.008
 streams+0.008
Neg logit contributions
 excuse-0.009
burn-0.007
 haircut-0.007
resp-0.007
have-0.007
Token!
Token ID0
Feature activation+1.22e-03

Pos logit contributions
 valleys+0.001
 rivers+0.001
 free+0.001
 Pat+0.001
 amaz+0.001
Neg logit contributions
 excuse-0.002
resp-0.001
burn-0.001
arre-0.001
ins-0.001
Token He
Token ID679
Feature activation-2.39e-04

Pos logit contributions
 excuse+0.002
 fault+0.002
burn+0.002
resp+0.002
arre+0.002
Neg logit contributions
 valleys-0.002
 rivers-0.001
 free-0.001
 behold-0.001
 Pat-0.001
Token thanked
Token ID26280
Feature activation-3.27e-02

Pos logit contributions
 valleys+0.000
 rivers+0.000
 Pat+0.000
 free+0.000
 behold+0.000
Neg logit contributions
 excuse-0.000
burn-0.000
 fault-0.000
resp-0.000
arre-0.000
Token the
Token ID262
Feature activation+4.76e-03

Pos logit contributions
 valleys+0.044
 rivers+0.035
 Pat+0.034
 free+0.034
 behold+0.032
Neg logit contributions
 excuse-0.066
burn-0.055
 haircut-0.053
 fault-0.053
resp-0.053
Token repair
Token ID9185
Feature activation-1.39e-02

Pos logit contributions
 excuse+0.009
burn+0.007
resp+0.007
ins+0.007
 fault+0.007
Neg logit contributions
 valleys-0.007
 rivers-0.006
 free-0.006
 amaz-0.006
 streams-0.006
Token-
Token ID12
Feature activation-1.02e-01

Pos logit contributions
 valleys+0.022
 rivers+0.019
 Pat+0.019
 free+0.019
 behold+0.019
Neg logit contributions
 excuse-0.023
burn-0.019
resp-0.019
 fault-0.019
arre-0.018
Tokenman
Token ID805
Feature activation-2.01e-03

Pos logit contributions
 valleys+0.201
 rivers+0.175
 Pat+0.173
 free+0.173
 behold+0.171
Neg logit contributions
 excuse-0.142
burn-0.108
 fault-0.106
resp-0.102
arre-0.099
Token and
Token ID290
Feature activation-2.28e-02

Pos logit contributions
 valleys+0.004
 rivers+0.003
 Pat+0.003
 free+0.003
 amaz+0.003
Neg logit contributions
 excuse-0.003
burn-0.002
 fault-0.002
resp-0.002
arre-0.002
Token promised
Token ID8072
Feature activation-3.34e-02

Pos logit contributions
 valleys+0.038
 rivers+0.032
 Pat+0.031
 free+0.031
 streams+0.030
Neg logit contributions
 excuse-0.038
burn-0.030
 haircut-0.029
 fault-0.029
resp-0.028
Token to
Token ID284
Feature activation+1.57e-02

Pos logit contributions
 valleys+0.058
 Pat+0.051
 free+0.050
 rivers+0.050
 behold+0.048
Neg logit contributions
 excuse-0.054
burn-0.043
arre-0.041
ins-0.041
 fault-0.041
Token never
Token ID1239
Feature activation-6.87e-04

Pos logit contributions
 excuse+0.024
 fault+0.019
ins+0.019
resp+0.019
burn+0.019
Neg logit contributions
 valleys-0.028
 free-0.025
 Pat-0.024
 rivers-0.024
 behold-0.024
TokenHe
Token ID1544
Feature activation+1.42e-02

Pos logit contributions
 excuse+0.012
resp+0.009
burn+0.009
arre+0.009
have+0.009
Neg logit contributions
 valleys-0.011
 rivers-0.009
 amaz-0.009
 Pat-0.009
 free-0.009
Token exclaimed
Token ID33552
Feature activation-7.17e-03

Pos logit contributions
 excuse+0.025
burn+0.021
 fault+0.020
resp+0.020
 haircut+0.019
Neg logit contributions
 valleys-0.022
 rivers-0.019
 Pat-0.018
 free-0.018
 behold-0.018
Token,
Token ID11
Feature activation+1.10e-02

Pos logit contributions
 valleys+0.011
 rivers+0.009
 Pat+0.009
 free+0.009
 behold+0.008
Neg logit contributions
 excuse-0.014
burn-0.011
 fault-0.011
resp-0.011
 haircut-0.010
Token "
Token ID366
Feature activation-1.76e-02

Pos logit contributions
 excuse+0.022
burn+0.018
resp+0.018
 haircut+0.018
 fault+0.018
Neg logit contributions
 valleys-0.015
 rivers-0.012
 behold-0.011
 Pat-0.011
 free-0.011
TokenPick
Token ID31686
Feature activation+2.23e-02

Pos logit contributions
 valleys+0.029
 rivers+0.024
 behold+0.023
 Pat+0.023
 free+0.023
Neg logit contributions
 excuse-0.032
 fault-0.026
resp-0.024
burn-0.024
 haircut-0.024
Token-
Token ID12
Feature activation-1.02e-01

Pos logit contributions
 excuse+0.038
burn+0.030
 fault+0.029
ins+0.029
resp+0.029
Neg logit contributions
 valleys-0.036
 rivers-0.030
 Pat-0.030
 free-0.029
 behold-0.029
Tokena
Token ID64
Feature activation-2.18e-02

Pos logit contributions
 valleys+0.164
 rivers+0.136
 Pat+0.134
 free+0.133
 amaz+0.132
Neg logit contributions
 excuse-0.176
burn-0.143
 fault-0.137
resp-0.136
 haircut-0.135
Token-
Token ID12
Feature activation-9.03e-02

Pos logit contributions
 valleys+0.032
 Pat+0.028
 free+0.027
 rivers+0.027
 behold+0.026
Neg logit contributions
 excuse-0.041
burn-0.033
 fault-0.033
resp-0.032
arre-0.032
Tokenbo
Token ID2127
Feature activation-1.71e-02

Pos logit contributions
 valleys+0.152
 rivers+0.128
 Pat+0.127
 free+0.125
 behold+0.123
Neg logit contributions
 excuse-0.151
burn-0.120
 fault-0.117
resp-0.115
arre-0.113
Tokeno
Token ID78
Feature activation-3.57e-02

Pos logit contributions
 valleys+0.027
 Pat+0.024
 free+0.023
 rivers+0.023
 behold+0.022
Neg logit contributions
 excuse-0.030
burn-0.024
resp-0.023
 fault-0.023
 haircut-0.022
Token!
Token ID0
Feature activation+3.32e-03

Pos logit contributions
 valleys+0.050
 Pat+0.044
 free+0.044
 rivers+0.042
tail+0.040
Neg logit contributions
 excuse-0.068
 fault-0.055
burn-0.054
resp-0.053
arre-0.053
Token a
Token ID257
Feature activation-1.63e-02

Pos logit contributions
 valleys+0.042
 free+0.034
 rivers+0.034
 Pat+0.033
 behold+0.032
Neg logit contributions
 excuse-0.074
resp-0.062
burn-0.061
ins-0.061
arre-0.060
Token zip
Token ID19974
Feature activation-8.27e-03

Pos logit contributions
 valleys+0.032
 free+0.028
 rivers+0.028
 Pat+0.028
 behold+0.027
Neg logit contributions
 excuse-0.022
 fault-0.017
burn-0.017
resp-0.016
arre-0.016
Token.
Token ID13
Feature activation-1.35e-02

Pos logit contributions
 valleys+0.011
 rivers+0.009
 free+0.009
 low+0.009
 Pat+0.009
Neg logit contributions
 excuse-0.017
 fault-0.014
ins-0.013
resp-0.013
arre-0.013
Token It
Token ID632
Feature activation-2.97e-02

Pos logit contributions
 valleys+0.017
 rivers+0.014
 free+0.014
 amaz+0.013
 behold+0.013
Neg logit contributions
 excuse-0.027
burn-0.023
 fault-0.023
resp-0.022
 haircut-0.022
Token zip
Token ID19974
Feature activation-2.70e-02

Pos logit contributions
 valleys+0.062
 amaz+0.056
 rivers+0.055
 Pat+0.053
 behold+0.053
Neg logit contributions
 excuse-0.034
have-0.023
burn-0.023
 distur-0.023
que-0.022
Token-
Token ID12
Feature activation-1.02e-01

Pos logit contributions
 valleys+0.050
 amaz+0.044
 Pat+0.043
 rivers+0.043
 behold+0.043
Neg logit contributions
 excuse-0.038
burn-0.029
 fault-0.029
resp-0.028
arre-0.027
Tokenz
Token ID89
Feature activation+2.51e-02

Pos logit contributions
 valleys+0.168
 rivers+0.141
 Pat+0.141
 amaz+0.139
 free+0.138
Neg logit contributions
 excuse-0.169
burn-0.136
 fault-0.131
resp-0.130
 haircut-0.127
Tokenipped
Token ID3949
Feature activation-3.05e-02

Pos logit contributions
 excuse+0.046
burn+0.043
 fault+0.040
 sy+0.039
 distur+0.039
Neg logit contributions
 valleys-0.031
 Pat-0.027
 free-0.027
 behold-0.027
 rivers-0.027
Token all
Token ID477
Feature activation-1.94e-02

Pos logit contributions
 valleys+0.047
 amaz+0.042
 behold+0.042
tail+0.041
 Pat+0.041
Neg logit contributions
 excuse-0.050
burn-0.039
ruck-0.039
 fault-0.039
reat-0.038
Token around
Token ID1088
Feature activation-2.72e-02

Pos logit contributions
 valleys+0.035
 amaz+0.032
tail+0.032
 Pat+0.030
stack+0.030
Neg logit contributions
 excuse-0.027
ruck-0.021
burn-0.021
reat-0.020
 fault-0.019
Token,
Token ID11
Feature activation-2.42e-03

Pos logit contributions
 valleys+0.033
tail+0.031
 amaz+0.031
 delighted+0.029
 behold+0.029
Neg logit contributions
 excuse-0.050
ruck-0.040
 consequences-0.039
 fault-0.039
reat-0.038
Tokenup
Token ID929
Feature activation+3.97e-02

Pos logit contributions
 valleys+0.094
 rivers+0.084
 free+0.083
 Pat+0.083
 behold+0.081
Neg logit contributions
 excuse-0.038
burn-0.025
 fault-0.024
resp-0.023
 haircut-0.021
Token,
Token ID11
Feature activation+1.53e-02

Pos logit contributions
 excuse+0.063
burn+0.050
 fault+0.049
resp+0.047
 haircut+0.047
Neg logit contributions
 valleys-0.070
 rivers-0.059
 Pat-0.058
 free-0.057
 behold-0.057
Token trying
Token ID2111
Feature activation+3.37e-02

Pos logit contributions
 excuse+0.026
burn+0.021
resp+0.021
 fault+0.021
ins+0.020
Neg logit contributions
 valleys-0.025
 rivers-0.021
 free-0.021
 Pat-0.021
 amaz-0.020
Token the
Token ID262
Feature activation-1.42e-02

Pos logit contributions
 excuse+0.050
burn+0.040
resp+0.038
 fault+0.036
arre+0.036
Neg logit contributions
 valleys-0.062
 free-0.054
 rivers-0.054
 Pat-0.054
 amaz-0.051
Token make
Token ID787
Feature activation+8.56e-03

Pos logit contributions
 valleys+0.022
 rivers+0.019
 free+0.019
 amaz+0.019
 low+0.019
Neg logit contributions
 excuse-0.023
burn-0.021
resp-0.020
ins-0.020
ud-0.019
Token-
Token ID12
Feature activation-1.02e-01

Pos logit contributions
 excuse+0.016
 fault+0.013
burn+0.013
 haircut+0.013
arre+0.013
Neg logit contributions
 valleys-0.013
 rivers-0.010
 Pat-0.010
 free-0.010
 behold-0.010
Tokenup
Token ID929
Feature activation-7.18e-03

Pos logit contributions
 valleys+0.212
 free+0.190
 rivers+0.187
 behold+0.185
 Pat+0.182
Neg logit contributions
 excuse-0.126
 fault-0.091
burn-0.089
resp-0.088
 haircut-0.078
Token like
Token ID588
Feature activation-1.51e-02

Pos logit contributions
 valleys+0.011
 free+0.009
 rivers+0.009
 amaz+0.009
 Pat+0.009
Neg logit contributions
 excuse-0.013
burn-0.011
resp-0.011
 fault-0.010
arre-0.010
Token a
Token ID257
Feature activation-1.68e-02

Pos logit contributions
 valleys+0.020
 rivers+0.017
 Pat+0.016
 free+0.016
 amaz+0.016
Neg logit contributions
 excuse-0.030
burn-0.025
resp-0.024
 fault-0.024
arre-0.024
Token young
Token ID1862
Feature activation-1.39e-02

Pos logit contributions
 valleys+0.026
 free+0.023
 amaz+0.022
 rivers+0.022
 Pat+0.022
Neg logit contributions
 excuse-0.029
burn-0.024
resp-0.024
ins-0.023
reat-0.023
Token adult
Token ID4044
Feature activation+2.79e-02

Pos logit contributions
 valleys+0.024
 rivers+0.021
 amaz+0.021
 free+0.020
 delighted+0.020
Neg logit contributions
 excuse-0.021
resp-0.018
burn-0.018
reat-0.017
ins-0.016
Token sorry
Token ID7926
Feature activation-5.80e-03

Pos logit contributions
 excuse+0.023
burn+0.019
 fault+0.018
resp+0.018
 haircut+0.018
Neg logit contributions
 valleys-0.016
 rivers-0.013
 Pat-0.013
 free-0.013
 behold-0.012
Token for
Token ID329
Feature activation-2.95e-03

Pos logit contributions
 valleys+0.008
 Pat+0.007
 free+0.007
 rivers+0.007
 Lou+0.007
Neg logit contributions
 excuse-0.011
burn-0.009
 haircut-0.009
resp-0.009
arre-0.009
Token using
Token ID1262
Feature activation+6.61e-03

Pos logit contributions
 valleys+0.005
 rivers+0.004
 behold+0.004
 amaz+0.004
 Pat+0.004
Neg logit contributions
 excuse-0.005
burn-0.004
 fault-0.004
 haircut-0.004
resp-0.003
Token the
Token ID262
Feature activation-6.83e-03

Pos logit contributions
 excuse+0.014
burn+0.012
resp+0.011
 fault+0.011
arre+0.011
Neg logit contributions
 valleys-0.008
 rivers-0.007
 Pat-0.007
 free-0.007
 behold-0.006
Token past
Token ID1613
Feature activation-4.07e-02

Pos logit contributions
 valleys+0.010
 free+0.010
 low+0.009
 rivers+0.009
 majestic+0.009
Neg logit contributions
 excuse-0.011
burn-0.010
resp-0.010
arre-0.010
ins-0.009
Token-
Token ID12
Feature activation-1.02e-01

Pos logit contributions
 valleys+0.069
 Pat+0.059
 rivers+0.059
 behold+0.058
 free+0.057
Neg logit contributions
 excuse-0.067
burn-0.052
resp-0.052
 fault-0.051
arre-0.050
Tokendate
Token ID4475
Feature activation-8.43e-04

Pos logit contributions
 valleys+0.173
 free+0.152
 Pat+0.151
 rivers+0.150
 behold+0.147
Neg logit contributions
 excuse-0.164
burn-0.127
 fault-0.125
arre-0.122
resp-0.122
Token cheese
Token ID9891
Feature activation+5.95e-03

Pos logit contributions
 valleys+0.001
 rivers+0.001
 Pat+0.001
 free+0.001
 behold+0.001
Neg logit contributions
 excuse-0.001
burn-0.001
 fault-0.001
resp-0.001
arre-0.001
Token.
Token ID13
Feature activation+1.71e-03

Pos logit contributions
 excuse+0.011
burn+0.009
resp+0.008
arre+0.008
have+0.008
Neg logit contributions
 valleys-0.009
 free-0.008
 Pat-0.008
 rivers-0.008
 streams-0.007
Token Sam
Token ID3409
Feature activation-5.51e-03

Pos logit contributions
 excuse+0.002
arre+0.002
 fault+0.002
 twin+0.002
EEP+0.002
Neg logit contributions
 valleys-0.003
 rivers-0.003
tail-0.003
 free-0.002
 rocky-0.002
Token learned
Token ID4499
Feature activation+1.14e-03

Pos logit contributions
 valleys+0.007
 free+0.006
 Pat+0.006
 delighted+0.006
 behold+0.006
Neg logit contributions
 excuse-0.011
 fault-0.009
burn-0.009
resp-0.009
reat-0.009
Token She
Token ID1375
Feature activation-1.78e-02

Pos logit contributions
 excuse+0.001
 fault+0.001
arre+0.001
resp+0.001
 haircut+0.001
Neg logit contributions
 valleys-0.001
 free-0.000
 rivers-0.000
 behold-0.000
 rocky-0.000
Token could
Token ID714
Feature activation-1.31e-02

Pos logit contributions
 valleys+0.031
 rivers+0.027
 Pat+0.026
tail+0.025
 streams+0.025
Neg logit contributions
 excuse-0.026
 haircut-0.020
have-0.019
 boo-0.019
burn-0.019
Token pick
Token ID2298
Feature activation-7.38e-03

Pos logit contributions
 valleys+0.016
 free+0.014
 Pat+0.013
 behold+0.013
 rivers+0.012
Neg logit contributions
 excuse-0.028
burn-0.023
ins-0.023
resp-0.023
 fault-0.023
Token any
Token ID597
Feature activation-2.37e-02

Pos logit contributions
 valleys+0.010
 rivers+0.008
 Pat+0.008
 behold+0.008
 free+0.008
Neg logit contributions
 excuse-0.014
burn-0.012
 fault-0.012
 haircut-0.011
resp-0.011
Token ice
Token ID4771
Feature activation-1.33e-02

Pos logit contributions
 valleys+0.036
 amaz+0.032
 free+0.032
 rivers+0.032
 rocky+0.031
Neg logit contributions
 excuse-0.040
resp-0.037
arre-0.037
EEP-0.036
burn-0.034
Token-
Token ID12
Feature activation-1.02e-01

Pos logit contributions
 valleys+0.025
 rivers+0.021
 free+0.021
 Pat+0.021
 rocky+0.020
Neg logit contributions
 excuse-0.020
arre-0.015
 fault-0.015
burn-0.015
resp-0.015
Tokencream
Token ID36277
Feature activation-1.10e-02

Pos logit contributions
 valleys+0.209
 rivers+0.182
 free+0.182
 Pat+0.177
 behold+0.176
Neg logit contributions
 excuse-0.135
 fault-0.097
burn-0.094
resp-0.093
arre-0.092
Token she
Token ID673
Feature activation-1.62e-02

Pos logit contributions
 valleys+0.018
 rivers+0.015
 Pat+0.015
 free+0.015
 behold+0.015
Neg logit contributions
 excuse-0.019
burn-0.015
 fault-0.014
resp-0.014
arre-0.014
Token wanted
Token ID2227
Feature activation+9.18e-03

Pos logit contributions
 valleys+0.025
 rivers+0.021
 Pat+0.020
 free+0.020
 behold+0.019
Neg logit contributions
 excuse-0.029
burn-0.023
 fault-0.023
 haircut-0.023
resp-0.023
Token!
Token ID0
Feature activation+3.11e-03

Pos logit contributions
 excuse+0.017
burn+0.014
arre+0.014
resp+0.013
ins+0.013
Neg logit contributions
 valleys-0.013
 free-0.012
 Pat-0.012
 rivers-0.012
 itself-0.011
Token
Token ID198
Feature activation-2.89e-03

Pos logit contributions
 excuse+0.006
 fault+0.005
burn+0.005
arre+0.005
 haircut+0.005
Neg logit contributions
 valleys-0.004
 rivers-0.004
 free-0.003
 rocky-0.003
 majestic-0.003
TokenI
Token ID40
Feature activation+4.37e-02

Pos logit contributions
 excuse+0.018
arre+0.016
 fault+0.015
burn+0.015
resp+0.014
Neg logit contributions
 valleys-0.016
 Pat-0.015
 rivers-0.015
 free-0.014
 forests-0.014
Token am
Token ID716
Feature activation-6.22e-03

Pos logit contributions
 excuse+0.075
resp+0.070
burn+0.065
ruck+0.063
ins+0.061
Neg logit contributions
 valleys-0.062
 free-0.060
 behold-0.059
 -0.058
 low-0.057
Token selling
Token ID6301
Feature activation+2.01e-02

Pos logit contributions
 valleys+0.009
 free+0.008
 Pat+0.008
 delighted+0.007
 rivers+0.007
Neg logit contributions
 excuse-0.011
burn-0.010
resp-0.010
arre-0.009
ruck-0.009
Token b
Token ID275
Feature activation-2.84e-02

Pos logit contributions
 excuse+0.036
burn+0.031
arre+0.030
 fault+0.030
resp+0.029
Neg logit contributions
 valleys-0.029
 rivers-0.025
 free-0.025
 Pat-0.024
 behold-0.024
Tokenerry
Token ID6996
Feature activation-2.37e-02

Pos logit contributions
 valleys+0.050
 rivers+0.044
 Pat+0.043
 streams+0.042
tail+0.042
Neg logit contributions
 excuse-0.043
 fault-0.034
resp-0.033
arre-0.033
burn-0.033
Token-
Token ID12
Feature activation-1.01e-01

Pos logit contributions
 valleys+0.038
 Pat+0.033
 rivers+0.033
 free+0.033
tail+0.031
Neg logit contributions
 excuse-0.040
 fault-0.032
burn-0.032
arre-0.031
 haircut-0.030
Tokenbus
Token ID10885
Feature activation-4.70e-02

Pos logit contributions
 valleys+0.163
 rivers+0.139
 Pat+0.138
 free+0.136
 behold+0.133
Neg logit contributions
 excuse-0.176
 fault-0.139
burn-0.138
arre-0.135
resp-0.134
Tokenhes
Token ID956
Feature activation-1.84e-02

Pos logit contributions
 valleys+0.094
 rivers+0.082
 free+0.082
 Pat+0.082
tail+0.079
Neg logit contributions
 excuse-0.063
 fault-0.044
resp-0.043
 haircut-0.042
burn-0.042
Token."
Token ID526
Feature activation+2.38e-03

Pos logit contributions
 valleys+0.029
 Pat+0.025
 free+0.025
 rivers+0.025
 behold+0.024
Neg logit contributions
 excuse-0.032
burn-0.025
 fault-0.025
resp-0.024
arre-0.024
Token
Token ID198
Feature activation-3.91e-03

Pos logit contributions
 excuse+0.005
burn+0.004
 fault+0.004
resp+0.004
arre+0.004
Neg logit contributions
 valleys-0.003
 Pat-0.002
 rivers-0.002
 free-0.002
 behold-0.002
Token
Token ID198
Feature activation-8.44e-03

Pos logit contributions
 valleys+0.006
 rivers+0.005
 amaz+0.004
 free+0.004
 streams+0.004
Neg logit contributions
 excuse-0.007
burn-0.006
resp-0.006
 fault-0.006
 haircut-0.006
Token a
Token ID257
Feature activation-1.19e-02

Pos logit contributions
 excuse+0.023
burn+0.019
 fault+0.018
resp+0.018
arre+0.018
Neg logit contributions
 valleys-0.018
 rivers-0.014
 Pat-0.014
 free-0.014
 behold-0.014
Token time
Token ID640
Feature activation-1.41e-02

Pos logit contributions
 valleys+0.019
 low+0.018
 free+0.017
 Pat+0.017
 +0.017
Neg logit contributions
 excuse-0.018
resp-0.016
burn-0.016
ins-0.015
ruck-0.015
Token,
Token ID11
Feature activation-2.75e-03

Pos logit contributions
 valleys+0.023
 rivers+0.019
 free+0.018
 behold+0.018
tail+0.018
Neg logit contributions
 excuse-0.025
 fault-0.019
burn-0.019
resp-0.019
arre-0.018
Token there
Token ID612
Feature activation-3.42e-03

Pos logit contributions
 valleys+0.004
 rivers+0.003
 Pat+0.003
 free+0.003
 amaz+0.003
Neg logit contributions
 excuse-0.005
burn-0.004
 fault-0.004
resp-0.004
arre-0.004
Token were
Token ID547
Feature activation-2.86e-02

Pos logit contributions
 valleys+0.008
 rivers+0.007
tail+0.006
 free+0.006
 streams+0.006
Neg logit contributions
 excuse-0.004
burn-0.003
 fault-0.003
 haircut-0.002
arre-0.002
Token two
Token ID734
Feature activation+7.79e-02

Pos logit contributions
 valleys+0.050
 rivers+0.044
 Pat+0.042
 free+0.042
 streams+0.041
Neg logit contributions
 excuse-0.047
burn-0.036
 fault-0.036
resp-0.036
 haircut-0.035
Token animals
Token ID4695
Feature activation+2.92e-02

Pos logit contributions
 excuse+0.133
burn+0.117
resp+0.113
 fault+0.113
 haircut+0.109
Neg logit contributions
 valleys-0.120
 rivers-0.106
 Pat-0.102
 free-0.100
 streams-0.099
Token.
Token ID13
Feature activation+2.82e-04

Pos logit contributions
 excuse+0.057
burn+0.046
 fault+0.045
 haircut+0.045
resp+0.045
Neg logit contributions
 valleys-0.041
 Pat-0.039
 free-0.035
 rivers-0.035
 Lou-0.034
Token One
Token ID1881
Feature activation+1.29e-02

Pos logit contributions
 excuse+0.000
burn+0.000
resp+0.000
 twin+0.000
 fault+0.000
Neg logit contributions
 valleys-0.000
 free-0.000
 amaz-0.000
 rivers-0.000
 rocky-0.000
Token of
Token ID286
Feature activation+7.81e-03

Pos logit contributions
 excuse+0.021
burn+0.016
 fault+0.016
resp+0.016
arre+0.015
Neg logit contributions
 valleys-0.023
 rivers-0.019
 Pat-0.019
 free-0.019
 amaz-0.018
Token them
Token ID606
Feature activation-1.28e-02

Pos logit contributions
 excuse+0.013
que+0.011
ruck+0.011
ins+0.011
 twin+0.011
Neg logit contributions
 valleys-0.011
tail-0.010
 amaz-0.010
 content-0.009
 delighted-0.009
Token asked
Token ID1965
Feature activation-1.44e-02

Pos logit contributions
 excuse+0.041
 fault+0.034
burn+0.034
resp+0.032
 haircut+0.032
Neg logit contributions
 valleys-0.034
 Pat-0.029
 rivers-0.029
 free-0.028
 behold-0.027
Token his
Token ID465
Feature activation+2.56e-03

Pos logit contributions
 valleys+0.025
 rivers+0.020
 amaz+0.020
 behold+0.020
 free+0.019
Neg logit contributions
 excuse-0.025
burn-0.019
resp-0.019
 fault-0.019
reat-0.018
Token friend
Token ID1545
Feature activation-9.94e-03

Pos logit contributions
 excuse+0.003
burn+0.002
 fault+0.002
arre+0.002
resp+0.002
Neg logit contributions
 valleys-0.005
 Pat-0.005
 rivers-0.005
 free-0.005
 amaz-0.005
Token,
Token ID11
Feature activation-9.01e-03

Pos logit contributions
 valleys+0.012
 Pat+0.011
 rivers+0.010
 free+0.010
 Lou+0.010
Neg logit contributions
 excuse-0.020
burn-0.017
 fault-0.016
arre-0.016
resp-0.016
Token a
Token ID257
Feature activation-4.79e-03

Pos logit contributions
 valleys+0.013
 rivers+0.010
 amaz+0.010
 behold+0.010
 streams+0.010
Neg logit contributions
 excuse-0.018
burn-0.015
resp-0.014
 fault-0.014
 haircut-0.014
Token se
Token ID384
Feature activation+8.10e-02

Pos logit contributions
 valleys+0.008
 rivers+0.007
 Pat+0.007
 free+0.007
 behold+0.007
Neg logit contributions
 excuse-0.008
burn-0.006
 fault-0.006
resp-0.006
arre-0.006
Tokenag
Token ID363
Feature activation-9.22e-04

Pos logit contributions
 excuse+0.109
burn+0.088
 fault+0.084
resp+0.083
 haircut+0.082
Neg logit contributions
 valleys-0.146
 free-0.134
 behold-0.132
 Pat-0.131
 rivers-0.126
Tokenull
Token ID724
Feature activation+4.08e-02

Pos logit contributions
 valleys+0.002
 rivers+0.001
 free+0.001
 Pat+0.001
tail+0.001
Neg logit contributions
 excuse-0.001
burn-0.001
 fault-0.001
resp-0.001
reat-0.001
Token named
Token ID3706
Feature activation-3.23e-02

Pos logit contributions
 excuse+0.078
 fault+0.064
resp+0.062
 haircut+0.061
arre+0.061
Neg logit contributions
 valleys-0.056
 Pat-0.050
 free-0.049
 rivers-0.047
tail-0.045
Token Tim
Token ID5045
Feature activation+6.18e-03

Pos logit contributions
 valleys+0.045
tail+0.037
 amaz+0.036
 behold+0.036
 rocky+0.034
Neg logit contributions
 excuse-0.063
 fault-0.054
reat-0.052
burn-0.051
resp-0.050
Tokenmy
Token ID1820
Feature activation+9.08e-03

Pos logit contributions
 excuse+0.008
resp+0.006
 haircut+0.005
 fault+0.005
burn+0.005
Neg logit contributions
 valleys-0.013
 Pat-0.011
 rivers-0.011
 free-0.011
 streams-0.011
Token
Token ID26391
Feature activation-5.17e-02

Pos logit contributions
 excuse+0.028
burn+0.022
arre+0.022
resp+0.022
 fault+0.022
Neg logit contributions
 valleys-0.025
 rivers-0.020
 Pat-0.020
 amaz-0.019
 free-0.019
Token
Token ID220
Feature activation-2.89e-03

Pos logit contributions
 valleys+0.094
 behold+0.088
years+0.084
 rocky+0.084
 rivers+0.083
Neg logit contributions
 excuse-0.066
/-0.057
arre-0.056
EEP-0.055
 fault-0.055
Token
Token ID198
Feature activation+1.09e-02

Pos logit contributions
 valleys+0.004
 rivers+0.003
 Pat+0.003
 free+0.003
 behold+0.003
Neg logit contributions
 excuse-0.006
burn-0.005
 fault-0.005
resp-0.005
arre-0.005
Token
Token ID198
Feature activation+1.00e-02

Pos logit contributions
 excuse+0.019
resp+0.015
burn+0.015
arre+0.015
 haircut+0.015
Neg logit contributions
 valleys-0.018
 rivers-0.015
 amaz-0.014
 streams-0.014
 rocky-0.014
TokenThe
Token ID464
Feature activation+2.40e-02

Pos logit contributions
 excuse+0.019
 fault+0.018
burn+0.016
 behavior+0.015
 consequences+0.015
Neg logit contributions
 valleys-0.014
 free-0.012
 behold-0.012
 rivers-0.012
 clearing-0.011
Token two
Token ID734
Feature activation+8.11e-02

Pos logit contributions
 excuse+0.037
burn+0.033
resp+0.031
ins+0.030
arre+0.030
Neg logit contributions
 valleys-0.040
 free-0.035
 rivers-0.035
 behold-0.035
 amaz-0.035
Token little
Token ID1310
Feature activation+4.94e-02

Pos logit contributions
 excuse+0.133
burn+0.128
 fault+0.127
resp+0.117
EEP+0.114
Neg logit contributions
 valleys-0.124
 rivers-0.108
 free-0.105
 Pat-0.104
 streams-0.103
Token girls
Token ID4813
Feature activation+2.02e-02

Pos logit contributions
 excuse+0.066
burn+0.054
resp+0.054
 fault+0.053
arre+0.050
Neg logit contributions
 valleys-0.096
 rivers-0.085
 Pat-0.083
 free-0.083
 streams-0.081
Token waited
Token ID13488
Feature activation+1.09e-02

Pos logit contributions
 excuse+0.037
burn+0.031
 fault+0.031
resp+0.030
arre+0.029
Neg logit contributions
 valleys-0.030
 rivers-0.025
 free-0.025
 Pat-0.025
 behold-0.024
Token and
Token ID290
Feature activation-2.66e-03

Pos logit contributions
 excuse+0.018
burn+0.015
resp+0.014
 fault+0.014
arre+0.014
Neg logit contributions
 valleys-0.018
 rivers-0.016
 free-0.015
 Pat-0.015
 amaz-0.015
Token waited
Token ID13488
Feature activation+2.57e-03

Pos logit contributions
 valleys+0.005
 dense+0.005
 majestic+0.004
tail+0.004
 rivers+0.004
Neg logit contributions
 excuse-0.003
 haircut-0.002
burn-0.002
have-0.002
 takes-0.002
Token a
Token ID257
Feature activation-1.52e-02

Pos logit contributions
 excuse+0.022
burn+0.018
 fault+0.018
resp+0.018
arre+0.017
Neg logit contributions
 valleys-0.013
 free-0.011
 rivers-0.011
 Pat-0.011
 behold-0.010
Token time
Token ID640
Feature activation-1.57e-02

Pos logit contributions
 valleys+0.025
 low+0.023
 free+0.023
 Pat+0.022
 behold+0.022
Neg logit contributions
 excuse-0.023
burn-0.020
resp-0.020
ins-0.019
ruck-0.018
Token there
Token ID612
Feature activation-1.13e-03

Pos logit contributions
 valleys+0.025
 rivers+0.021
 free+0.020
 behold+0.019
 streams+0.019
Neg logit contributions
 excuse-0.028
 fault-0.022
burn-0.022
resp-0.022
arre-0.021
Token was
Token ID373
Feature activation-2.53e-02

Pos logit contributions
 valleys+0.003
 rivers+0.002
tail+0.002
 streams+0.002
 free+0.002
Neg logit contributions
 excuse-0.001
 fault-0.001
burn-0.001
 haircut-0.001
que-0.001
Token a
Token ID257
Feature activation-1.21e-02

Pos logit contributions
 valleys+0.037
 rivers+0.030
 Pat+0.030
 free+0.030
 behold+0.029
Neg logit contributions
 excuse-0.048
burn-0.039
 fault-0.039
resp-0.038
arre-0.038
Token rad
Token ID2511
Feature activation+9.60e-02

Pos logit contributions
 valleys+0.026
 rivers+0.023
 Pat+0.022
 free+0.022
 behold+0.022
Neg logit contributions
 excuse-0.015
burn-0.011
resp-0.010
 fault-0.010
 haircut-0.010
Tokenish
Token ID680
Feature activation+3.55e-02

Pos logit contributions
 excuse+0.120
burn+0.085
 fault+0.083
resp+0.083
 haircut+0.080
Neg logit contributions
 valleys-0.203
 rivers-0.178
 Pat-0.176
 free-0.174
 streams-0.170
Token farmer
Token ID18739
Feature activation+2.29e-02

Pos logit contributions
 excuse+0.075
 fault+0.066
arre+0.062
burn+0.061
resp+0.059
Neg logit contributions
 valleys-0.042
 Pat-0.037
 free-0.036
 rivers-0.035
tail-0.034
Token.
Token ID13
Feature activation-9.52e-03

Pos logit contributions
 excuse+0.039
burn+0.031
 fault+0.030
resp+0.030
arre+0.029
Neg logit contributions
 valleys-0.038
 rivers-0.032
 Pat-0.032
 free-0.032
 behold-0.031
Token He
Token ID679
Feature activation-8.34e-03

Pos logit contributions
 valleys+0.014
 rivers+0.012
 amaz+0.012
 free+0.011
 rocky+0.011
Neg logit contributions
 excuse-0.016
 haircut-0.013
burn-0.013
reat-0.013
ruck-0.013
Token grew
Token ID6348
Feature activation-3.34e-02

Pos logit contributions
 valleys+0.011
 free+0.009
 rivers+0.009
 Pat+0.009
 behold+0.009
Neg logit contributions
 excuse-0.017
burn-0.014
 fault-0.014
arre-0.013
resp-0.013
Token,
Token ID11
Feature activation+8.43e-03

Pos logit contributions
 valleys+0.048
 rivers+0.041
 Pat+0.041
 behold+0.040
 free+0.040
Neg logit contributions
 excuse-0.032
burn-0.024
 fault-0.023
 haircut-0.022
resp-0.022
Token with
Token ID351
Feature activation-1.24e-02

Pos logit contributions
 excuse+0.013
burn+0.010
 fault+0.010
 haircut+0.010
resp+0.010
Neg logit contributions
 valleys-0.015
 Pat-0.012
 rivers-0.012
 behold-0.012
 streams-0.012
Token all
Token ID477
Feature activation+1.47e-02

Pos logit contributions
 valleys+0.020
stack+0.018
tail+0.017
elope+0.017
 rivers+0.016
Neg logit contributions
 excuse-0.019
 fault-0.017
 consequences-0.016
 haircut-0.016
ruck-0.016
Token its
Token ID663
Feature activation+9.07e-03

Pos logit contributions
 excuse+0.030
 fault+0.026
burn+0.025
 haircut+0.025
ruck+0.024
Neg logit contributions
 valleys-0.019
 rivers-0.014
 Pat-0.014
tail-0.014
 free-0.014
Token different
Token ID1180
Feature activation+3.22e-02

Pos logit contributions
 excuse+0.015
 fault+0.012
 consequences+0.011
ins+0.011
burn+0.011
Neg logit contributions
 valleys-0.015
 Pat-0.012
 rivers-0.012
stack-0.012
 behold-0.012
Tokencol
Token ID4033
Feature activation+8.38e-02

Pos logit contributions
 excuse+0.057
burn+0.048
resp+0.045
 fault+0.044
 haircut+0.044
Neg logit contributions
 valleys-0.052
 rivers-0.044
 free-0.042
 amaz-0.042
 Pat-0.042
Tokenours
Token ID4662
Feature activation-8.01e-04

Pos logit contributions
 excuse+0.149
burn+0.115
resp+0.112
have+0.111
 fault+0.110
Neg logit contributions
 valleys-0.129
 rivers-0.109
 free-0.106
 Pat-0.104
tail-0.103
Token,
Token ID11
Feature activation+7.55e-04

Pos logit contributions
 valleys+0.001
 Pat+0.001
 rivers+0.001
 behold+0.001
 amaz+0.001
Neg logit contributions
 excuse-0.001
burn-0.001
 fault-0.001
resp-0.001
ins-0.001
Token shapes
Token ID15268
Feature activation+9.23e-03

Pos logit contributions
 excuse+0.001
burn+0.001
 fault+0.001
resp+0.001
 haircut+0.001
Neg logit contributions
 valleys-0.001
 rivers-0.001
 Pat-0.001
 free-0.001
 behold-0.001
Token and
Token ID290
Feature activation-6.78e-03

Pos logit contributions
 excuse+0.016
burn+0.012
resp+0.012
arre+0.012
have+0.012
Neg logit contributions
 valleys-0.016
 rivers-0.013
 free-0.013
 Pat-0.013
 amaz-0.012
Token textures
Token ID20028
Feature activation+4.04e-03

Pos logit contributions
 valleys+0.011
 Pat+0.009
 behold+0.009
 rivers+0.009
 free+0.009
Neg logit contributions
 excuse-0.011
burn-0.009
 fault-0.009
 haircut-0.008
ruck-0.008
Token.
Token ID13
Feature activation-4.20e-03

Pos logit contributions
 valleys+0.049
 rivers+0.042
 behold+0.041
 Pat+0.041
 amaz+0.041
Neg logit contributions
 excuse-0.036
burn-0.028
 fault-0.026
ins-0.025
arre-0.025
Token Lily
Token ID20037
Feature activation-2.43e-02

Pos logit contributions
 valleys+0.010
 rivers+0.009
 rocky+0.008
field+0.008
 depths+0.008
Neg logit contributions
 excuse-0.004
 cousin-0.003
 twin-0.003
que-0.003
 friend-0.003
Token said
Token ID531
Feature activation-1.87e-02

Pos logit contributions
 valleys+0.045
 rivers+0.038
 Pat+0.036
 amaz+0.036
 free+0.036
Neg logit contributions
 excuse-0.037
burn-0.029
arre-0.028
resp-0.027
 fault-0.027
Token,
Token ID11
Feature activation+1.00e-02

Pos logit contributions
 valleys+0.028
 rivers+0.023
 Pat+0.022
 free+0.022
 behold+0.022
Neg logit contributions
 excuse-0.035
burn-0.029
 fault-0.028
resp-0.028
 haircut-0.027
Token "
Token ID366
Feature activation-3.03e-02

Pos logit contributions
 excuse+0.017
burn+0.014
resp+0.014
 haircut+0.014
reat+0.013
Neg logit contributions
 valleys-0.016
 rivers-0.013
 behold-0.013
 amaz-0.013
 streams-0.013
TokenHi
Token ID17250
Feature activation+1.94e-02

Pos logit contributions
 valleys+0.046
 rivers+0.038
 Pat+0.037
 behold+0.037
tail+0.037
Neg logit contributions
 excuse-0.057
 fault-0.049
resp-0.046
ins-0.045
burn-0.044
Token,
Token ID11
Feature activation+1.48e-02

Pos logit contributions
 excuse+0.032
burn+0.026
 haircut+0.024
resp+0.024
 fault+0.024
Neg logit contributions
 valleys-0.035
 rivers-0.029
 behold-0.028
stack-0.028
 amaz-0.027
Token my
Token ID616
Feature activation+2.19e-02

Pos logit contributions
 excuse+0.029
burn+0.024
 fault+0.024
ins+0.023
resp+0.023
Neg logit contributions
 valleys-0.019
 Pat-0.017
 free-0.016
 rivers-0.016
 behold-0.016
Token name
Token ID1438
Feature activation-1.07e-02

Pos logit contributions
 excuse+0.034
burn+0.026
resp+0.025
 fault+0.025
ins+0.024
Neg logit contributions
 valleys-0.038
 Pat-0.034
 free-0.034
 rivers-0.033
 amaz-0.033
Token is
Token ID318
Feature activation-3.39e-02

Pos logit contributions
 valleys+0.020
 free+0.018
 Pat+0.018
 rivers+0.017
 Lou+0.017
Neg logit contributions
 excuse-0.015
 fault-0.012
burn-0.012
arre-0.012
resp-0.011
Token Lily
Token ID20037
Feature activation-3.23e-02

Pos logit contributions
 valleys+0.058
 rivers+0.048
 amaz+0.047
 rocky+0.045
 streams+0.045
Neg logit contributions
 excuse-0.056
burn-0.046
resp-0.045
 haircut-0.044
que-0.043
Token face
Token ID1986
Feature activation-5.58e-03

Pos logit contributions
 excuse+0.012
resp+0.010
ins+0.010
 fault+0.010
w+0.010
Neg logit contributions
 valleys-0.007
 amaz-0.007
 delight-0.007
 itself-0.007
 content-0.007
Token and
Token ID290
Feature activation+1.02e-02

Pos logit contributions
 valleys+0.009
 free+0.008
 low+0.008
tail+0.007
 rivers+0.007
Neg logit contributions
 excuse-0.010
resp-0.008
 fault-0.008
ins-0.008
burn-0.007
Token asked
Token ID1965
Feature activation-1.61e-02

Pos logit contributions
 excuse+0.019
 fault+0.016
burn+0.016
resp+0.015
arre+0.015
Neg logit contributions
 valleys-0.015
 rivers-0.013
 free-0.013
 Pat-0.012
 behold-0.012
Token,
Token ID11
Feature activation-1.23e-02

Pos logit contributions
 valleys+0.023
 rivers+0.019
 Pat+0.019
 free+0.019
 behold+0.018
Neg logit contributions
 excuse-0.031
burn-0.025
 fault-0.025
resp-0.024
arre-0.024
Token "
Token ID366
Feature activation-3.04e-02

Pos logit contributions
 valleys+0.015
 rivers+0.012
 Pat+0.012
 free+0.011
 behold+0.011
Neg logit contributions
 excuse-0.026
burn-0.022
 fault-0.021
resp-0.021
 haircut-0.021
TokenWhat
Token ID2061
Feature activation+2.47e-02

Pos logit contributions
 valleys+0.035
 rivers+0.029
 clearing+0.028
 behold+0.027
 free+0.026
Neg logit contributions
 fault-0.064
 excuse-0.062
resp-0.056
EEP-0.054
 behavior-0.054
Token happened
Token ID3022
Feature activation+1.22e-02

Pos logit contributions
 excuse+0.043
burn+0.040
 stret+0.037
ruck+0.036
oun+0.036
Neg logit contributions
 free-0.028
 Lou-0.027
 Pat-0.026
 -0.026
 valleys-0.025
Token,
Token ID11
Feature activation+5.06e-03

Pos logit contributions
 excuse+0.021
burn+0.018
ins+0.018
 fault+0.017
arre+0.017
Neg logit contributions
 valleys-0.017
 free-0.016
 Pat-0.016
 rivers-0.015
 Lou-0.015
Token Lily
Token ID20037
Feature activation-2.45e-02

Pos logit contributions
 excuse+0.009
burn+0.008
 fault+0.008
arre+0.008
resp+0.008
Neg logit contributions
 valleys-0.007
 free-0.006
 Pat-0.006
 behold-0.006
 amaz-0.006
Token?"
Token ID1701
Feature activation-1.06e-02

Pos logit contributions
 valleys+0.045
 free+0.042
 Pat+0.039
tail+0.039
 rivers+0.039
Neg logit contributions
 excuse-0.031
resp-0.025
 fault-0.024
 haircut-0.024
burn-0.022
Token
Token ID198
Feature activation+1.77e-03

Pos logit contributions
 valleys+0.015
 rivers+0.013
 Pat+0.012
 free+0.012
 amaz+0.012
Neg logit contributions
 excuse-0.020
burn-0.017
 fault-0.016
resp-0.016
 haircut-0.016
Token the
Token ID262
Feature activation-1.64e-02

Pos logit contributions
 valleys+0.016
 rivers+0.013
 delighted+0.012
 amaz+0.012
 behold+0.012
Neg logit contributions
 excuse-0.024
 haircut-0.020
burn-0.020
 fault-0.020
 boo-0.018
Token store
Token ID3650
Feature activation+2.75e-03

Pos logit contributions
 valleys+0.028
 rivers+0.024
 Pat+0.024
 free+0.023
 behold+0.023
Neg logit contributions
 excuse-0.027
burn-0.021
 fault-0.021
resp-0.020
arre-0.020
Token.
Token ID13
Feature activation-1.14e-02

Pos logit contributions
 excuse+0.004
burn+0.003
 fault+0.003
resp+0.003
arre+0.003
Neg logit contributions
 valleys-0.005
 rivers-0.004
 Pat-0.004
 free-0.004
 behold-0.004
Token Can
Token ID1680
Feature activation-3.11e-02

Pos logit contributions
 valleys+0.020
 rivers+0.017
 free+0.016
 streams+0.016
 behold+0.016
Neg logit contributions
 excuse-0.018
burn-0.014
resp-0.014
 haircut-0.013
 fault-0.013
Token you
Token ID345
Feature activation-2.39e-02

Pos logit contributions
 valleys+0.056
 Pat+0.048
 rivers+0.048
 free+0.047
 behold+0.047
Neg logit contributions
 excuse-0.048
burn-0.038
 fault-0.037
arre-0.036
resp-0.036
Token help
Token ID1037
Feature activation+3.02e-02

Pos logit contributions
 free+0.036
 valleys+0.035
 Pat+0.033
tail+0.033
 behold+0.032
Neg logit contributions
 excuse-0.039
 fault-0.036
ins-0.036
arre-0.033
burn-0.033
Token me
Token ID502
Feature activation-1.79e-02

Pos logit contributions
 excuse+0.055
burn+0.045
 fault+0.043
resp+0.043
 haircut+0.042
Neg logit contributions
 valleys-0.047
 rivers-0.039
 Pat-0.038
 free-0.037
 amaz-0.037
Token?"
Token ID1701
Feature activation-1.58e-02

Pos logit contributions
 free+0.031
 valleys+0.029
tail+0.026
 Pat+0.026
 behold+0.026
Neg logit contributions
 excuse-0.027
 fault-0.024
 consequences-0.022
resp-0.022
arre-0.022
Token
Token ID198
Feature activation-1.03e-02

Pos logit contributions
 valleys+0.024
 rivers+0.020
 amaz+0.019
 streams+0.019
 rocky+0.019
Neg logit contributions
 excuse-0.029
 haircut-0.024
burn-0.024
resp-0.023
 fault-0.022
Token
Token ID198
Feature activation-1.09e-02

Pos logit contributions
 valleys+0.017
 rivers+0.015
 amaz+0.015
 streams+0.014
 rocky+0.014
Neg logit contributions
 excuse-0.017
burn-0.013
 haircut-0.013
resp-0.013
que-0.013
TokenThe
Token ID464
Feature activation+1.11e-02

Pos logit contributions
 valleys+0.018
 rivers+0.015
 free+0.015
 Pat+0.015
 amaz+0.015
Neg logit contributions
 excuse-0.019
burn-0.015
resp-0.014
 haircut-0.014
 fault-0.014
Token
Token ID198
Feature activation-5.02e-03

Pos logit contributions
 valleys+0.009
 rivers+0.008
 amaz+0.008
 rocky+0.008
tail+0.007
Neg logit contributions
 excuse-0.009
arre-0.007
resp-0.007
ins-0.007
burn-0.007
TokenThe
Token ID464
Feature activation+9.10e-03

Pos logit contributions
 valleys+0.008
 rivers+0.006
 Pat+0.006
 free+0.006
 amaz+0.006
Neg logit contributions
 excuse-0.009
burn-0.007
resp-0.007
arre-0.007
 haircut-0.007
Token friends
Token ID2460
Feature activation+6.84e-04

Pos logit contributions
 excuse+0.013
burn+0.011
resp+0.010
 fault+0.010
arre+0.010
Neg logit contributions
 valleys-0.017
 rivers-0.015
 free-0.015
 Pat-0.014
 amaz-0.014
Token couldn
Token ID3521
Feature activation+6.35e-04

Pos logit contributions
 excuse+0.001
burn+0.001
 fault+0.001
resp+0.001
 haircut+0.001
Neg logit contributions
 valleys-0.001
 rivers-0.001
 Pat-0.001
 free-0.001
 behold-0.001
Token't
Token ID470
Feature activation-1.19e-03

Pos logit contributions
 excuse+0.001
burn+0.001
 fault+0.001
ins+0.001
 haircut+0.001
Neg logit contributions
 valleys-0.001
 Pat-0.001
 free-0.001
 rivers-0.001
 behold-0.001
Token believe
Token ID1975
Feature activation+1.94e-02

Pos logit contributions
 valleys+0.002
 rivers+0.001
 Pat+0.001
 free+0.001
 behold+0.001
Neg logit contributions
 excuse-0.002
burn-0.002
 fault-0.002
resp-0.002
arre-0.002
Token it
Token ID340
Feature activation+3.73e-02

Pos logit contributions
 excuse+0.036
resp+0.029
 haircut+0.028
 fault+0.028
ins+0.028
Neg logit contributions
 valleys-0.029
 rivers-0.024
tail-0.023
 dense-0.023
 clearing-0.022
Token.
Token ID13
Feature activation-8.37e-04

Pos logit contributions
 excuse+0.050
 fault+0.034
arre+0.033
burn+0.033
have+0.032
Neg logit contributions
 valleys-0.075
 rivers-0.064
 streams-0.061
 Asia-0.061
 forests-0.060
Token They
Token ID1119
Feature activation-6.24e-03

Pos logit contributions
 valleys+0.001
 free+0.001
 rocky+0.001
tail+0.001
 rivers+0.001
Neg logit contributions
 excuse-0.001
arre-0.001
 fault-0.001
resp-0.001
 twin-0.001
Token ran
Token ID4966
Feature activation+9.04e-04

Pos logit contributions
 valleys+0.011
 rivers+0.009
tail+0.009
 Pat+0.009
 streams+0.009
Neg logit contributions
 excuse-0.009
arre-0.007
 haircut-0.007
ruck-0.007
burn-0.007
Token inside
Token ID2641
Feature activation-1.17e-02

Pos logit contributions
 excuse+0.001
 fault+0.001
ruck+0.001
 haircut+0.001
burn+0.001
Neg logit contributions
 valleys-0.002
 behold-0.001
 amaz-0.001
tail-0.001
 rivers-0.001
Token things
Token ID1243
Feature activation-1.30e-03

Pos logit contributions
 valleys+0.001
 rivers+0.001
 Pat+0.001
 free+0.001
 behold+0.001
Neg logit contributions
 excuse-0.001
burn-0.001
 fault-0.001
resp-0.001
ins-0.001
Token.
Token ID13
Feature activation+1.39e-03

Pos logit contributions
 valleys+0.002
 free+0.002
 rivers+0.002
 Pat+0.002
 behold+0.002
Neg logit contributions
 excuse-0.002
burn-0.002
resp-0.002
 fault-0.002
 haircut-0.002
Token
Token ID220
Feature activation-1.52e-02

Pos logit contributions
 excuse+0.003
 fault+0.002
arre+0.002
resp+0.002
burn+0.002
Neg logit contributions
 valleys-0.002
 free-0.002
 rivers-0.002
 rocky-0.002
 amaz-0.001
Token
Token ID198
Feature activation-4.11e-03

Pos logit contributions
 valleys+0.021
 rivers+0.017
 free+0.016
 amaz+0.016
 behold+0.016
Neg logit contributions
 excuse-0.029
 fault-0.024
burn-0.024
arre-0.024
resp-0.024
Token
Token ID198
Feature activation-5.62e-03

Pos logit contributions
 valleys+0.006
 rivers+0.005
 amaz+0.005
 free+0.005
 Pat+0.005
Neg logit contributions
 excuse-0.008
burn-0.006
arre-0.006
resp-0.006
ins-0.006
TokenThe
Token ID464
Feature activation+1.55e-02

Pos logit contributions
 valleys+0.009
 rivers+0.007
 Pat+0.007
 behold+0.007
 free+0.007
Neg logit contributions
 excuse-0.010
burn-0.008
 fault-0.008
resp-0.008
 haircut-0.008
Token farmer
Token ID18739
Feature activation+7.46e-03

Pos logit contributions
 excuse+0.025
burn+0.021
 fault+0.020
ins+0.020
resp+0.020
Neg logit contributions
 valleys-0.026
 rivers-0.023
 free-0.022
 amaz-0.022
 behold-0.022
Token quickly
Token ID2952
Feature activation+3.83e-03

Pos logit contributions
 excuse+0.014
burn+0.012
 fault+0.011
resp+0.011
 haircut+0.011
Neg logit contributions
 valleys-0.011
 rivers-0.010
 behold-0.010
 Pat-0.009
 free-0.009
Token grabbed
Token ID13176
Feature activation+4.59e-03

Pos logit contributions
 excuse+0.007
 fault+0.006
burn+0.006
resp+0.005
ins+0.005
Neg logit contributions
 valleys-0.006
 free-0.005
 rivers-0.005
 behold-0.005
 amaz-0.005
Token the
Token ID262
Feature activation-1.20e-02

Pos logit contributions
 excuse+0.009
burn+0.007
 fault+0.007
 haircut+0.007
resp+0.007
Neg logit contributions
 valleys-0.006
 rivers-0.005
 Pat-0.005
tail-0.005
 behold-0.005
Token tool
Token ID2891
Feature activation+1.02e-02

Pos logit contributions
 valleys+0.020
 free+0.018
 rivers+0.018
 winding+0.017
 rocky+0.017
Neg logit contributions
 excuse-0.019
burn-0.018
ins-0.016
resp-0.016
arre-0.016
Token waved
Token ID26834
Feature activation-1.35e-02

Pos logit contributions
 excuse+0.003
 fault+0.002
burn+0.002
resp+0.002
arre+0.002
Neg logit contributions
 valleys-0.002
 rivers-0.002
 Pat-0.002
 free-0.002
 behold-0.002
Token goodbye
Token ID24829
Feature activation-2.22e-02

Pos logit contributions
 valleys+0.022
 rivers+0.019
 Pat+0.018
 free+0.018
 behold+0.018
Neg logit contributions
 excuse-0.023
burn-0.018
 fault-0.018
resp-0.018
 haircut-0.017
Token to
Token ID284
Feature activation-9.71e-03

Pos logit contributions
 valleys+0.048
 behold+0.042
 rivers+0.042
 amaz+0.041
 Pat+0.041
Neg logit contributions
 excuse-0.025
burn-0.018
 fault-0.018
 haircut-0.017
resp-0.017
Token the
Token ID262
Feature activation-2.76e-04

Pos logit contributions
 valleys+0.014
 rivers+0.011
 dense+0.011
tail+0.011
 rocky+0.011
Neg logit contributions
 excuse-0.018
 haircut-0.014
ruck-0.013
 fault-0.013
have-0.013
Token city
Token ID1748
Feature activation-7.41e-03

Pos logit contributions
 valleys+0.000
 rivers+0.000
 Pat+0.000
 free+0.000
 behold+0.000
Neg logit contributions
 excuse-0.000
burn-0.000
 fault-0.000
resp-0.000
arre-0.000
Token and
Token ID290
Feature activation-2.12e-02

Pos logit contributions
 valleys+0.014
 rivers+0.012
 Pat+0.012
 free+0.012
 behold+0.011
Neg logit contributions
 excuse-0.011
burn-0.008
 fault-0.008
resp-0.008
 haircut-0.008
Token started
Token ID2067
Feature activation-2.49e-02

Pos logit contributions
 valleys+0.035
 rivers+0.030
 streams+0.028
 dense+0.028
 free+0.028
Neg logit contributions
 excuse-0.034
burn-0.027
 haircut-0.026
have-0.025
 fault-0.025
Token his
Token ID465
Feature activation+7.23e-04

Pos logit contributions
 valleys+0.046
 rivers+0.039
 behold+0.038
 Pat+0.038
 free+0.038
Neg logit contributions
 excuse-0.037
burn-0.030
 fault-0.028
 haircut-0.028
resp-0.027
Token journey
Token ID7002
Feature activation+7.17e-03

Pos logit contributions
 excuse+0.001
burn+0.001
 fault+0.001
resp+0.001
arre+0.001
Neg logit contributions
 valleys-0.001
 rivers-0.001
 Pat-0.001
 free-0.001
 behold-0.001
Token home
Token ID1363
Feature activation-8.24e-03

Pos logit contributions
 excuse+0.011
burn+0.009
 fault+0.008
 haircut+0.008
ruck+0.008
Neg logit contributions
 valleys-0.013
 Pat-0.011
 rivers-0.011
 amaz-0.010
 behold-0.010
Token.
Token ID13
Feature activation-6.05e-03

Pos logit contributions
 valleys+0.015
 rivers+0.013
 behold+0.013
 Pat+0.013
 Asia+0.013
Neg logit contributions
 excuse-0.011
burn-0.009
 haircut-0.009
 fault-0.009
reat-0.008
Token.
Token ID13
Feature activation+4.09e-03

Pos logit contributions
 excuse+0.016
burn+0.012
 fault+0.012
resp+0.012
arre+0.012
Neg logit contributions
 valleys-0.014
 Pat-0.012
 free-0.012
 rivers-0.012
 behold-0.011
Token They
Token ID1119
Feature activation-1.33e-03

Pos logit contributions
 excuse+0.008
 fault+0.006
burn+0.006
resp+0.006
arre+0.006
Neg logit contributions
 valleys-0.006
 rivers-0.005
 free-0.004
 rocky-0.004
 behold-0.004
Token began
Token ID2540
Feature activation-4.25e-03

Pos logit contributions
 valleys+0.002
 rivers+0.002
 Pat+0.002
tail+0.002
 forests+0.002
Neg logit contributions
 excuse-0.002
burn-0.002
 haircut-0.001
arre-0.001
ins-0.001
Token to
Token ID284
Feature activation+8.07e-03

Pos logit contributions
 valleys+0.008
 rivers+0.007
 Pat+0.007
 free+0.007
 amaz+0.007
Neg logit contributions
 excuse-0.006
resp-0.005
burn-0.005
 fault-0.004
ins-0.004
Token eat
Token ID4483
Feature activation-9.71e-03

Pos logit contributions
 excuse+0.012
burn+0.010
 fault+0.009
resp+0.009
 haircut+0.009
Neg logit contributions
 valleys-0.015
 rivers-0.012
 Pat-0.012
 free-0.012
 behold-0.012
Token and
Token ID290
Feature activation-1.17e-02

Pos logit contributions
 valleys+0.016
 Pat+0.013
 rivers+0.013
tail+0.013
 behold+0.013
Neg logit contributions
 excuse-0.016
burn-0.013
 fault-0.013
resp-0.012
ins-0.012
Token the
Token ID262
Feature activation-1.05e-03

Pos logit contributions
 valleys+0.019
 rivers+0.016
 dense+0.015
 rocky+0.015
 forests+0.015
Neg logit contributions
 excuse-0.019
 haircut-0.015
burn-0.015
 fault-0.014
arre-0.014
Token little
Token ID1310
Feature activation+2.86e-03

Pos logit contributions
 valleys+0.002
 Pat+0.001
 rivers+0.001
 free+0.001
 amaz+0.001
Neg logit contributions
 excuse-0.002
 haircut-0.001
burn-0.001
 fault-0.001
resp-0.001
Token girl
Token ID2576
Feature activation-1.44e-02

Pos logit contributions
 excuse+0.003
burn+0.002
 haircut+0.002
 fault+0.002
ins+0.002
Neg logit contributions
 valleys-0.006
 Pat-0.006
 free-0.006
 rivers-0.006
 behold-0.006
Token was
Token ID373
Feature activation+2.41e-02

Pos logit contributions
 valleys+0.026
 rivers+0.022
 Pat+0.021
 free+0.021
 streams+0.021
Neg logit contributions
 excuse-0.022
burn-0.017
 fault-0.017
 haircut-0.017
arre-0.016
Token so
Token ID523
Feature activation+2.58e-02

Pos logit contributions
 excuse+0.038
burn+0.035
 fault+0.034
 twin+0.032
arre+0.032
Neg logit contributions
 valleys-0.039
 rivers-0.032
 streams-0.030
 forests-0.030
 Pat-0.029
Token some
Token ID617
Feature activation+1.14e-02

Pos logit contributions
 excuse+0.031
burn+0.024
 haircut+0.023
 fault+0.023
ruck+0.023
Neg logit contributions
 valleys-0.032
stack-0.026
 rivers-0.026
 amaz-0.026
 Pat-0.026
Token little
Token ID1310
Feature activation+2.05e-03

Pos logit contributions
 excuse+0.021
burn+0.019
arre+0.019
 fault+0.019
EEP+0.017
Neg logit contributions
 valleys-0.016
 free-0.015
 rivers-0.014
 streams-0.013
 clearing-0.013
Token sticks
Token ID16461
Feature activation+6.35e-03

Pos logit contributions
 excuse+0.004
arre+0.003
burn+0.003
 fault+0.003
resp+0.003
Neg logit contributions
 valleys-0.003
 rivers-0.003
 free-0.003
 streams-0.003
 Pat-0.003
Token from
Token ID422
Feature activation-4.25e-02

Pos logit contributions
 excuse+0.009
burn+0.007
 fault+0.006
resp+0.006
 haircut+0.006
Neg logit contributions
 valleys-0.013
 rivers-0.011
 Pat-0.011
 free-0.011
 behold-0.010
Token outside
Token ID2354
Feature activation+1.07e-02

Pos logit contributions
 valleys+0.051
 amaz+0.044
 delighted+0.042
 Pat+0.041
tail+0.041
Neg logit contributions
 excuse-0.085
burn-0.071
ruck-0.069
 fault-0.069
 haircut-0.068
Token the
Token ID262
Feature activation-3.26e-04

Pos logit contributions
 excuse+0.017
burn+0.014
resp+0.014
 fault+0.013
arre+0.013
Neg logit contributions
 valleys-0.019
 rivers-0.016
 Pat-0.016
 free-0.016
 amaz-0.015
Token cage
Token ID17186
Feature activation-3.04e-02

Pos logit contributions
 valleys+0.001
 Helen+0.001
 Pat+0.000
 Lou+0.000
 behold+0.000
Neg logit contributions
 excuse-0.000
burn-0.000
 haircut-0.000
 price-0.000
que-0.000
Token and
Token ID290
Feature activation-9.30e-04

Pos logit contributions
 valleys+0.067
 behold+0.060
 rivers+0.058
 amaz+0.058
 Pat+0.057
Neg logit contributions
 excuse-0.032
 fault-0.024
burn-0.021
resp-0.020
arre-0.020
Token carefully
Token ID7773
Feature activation+7.21e-03

Pos logit contributions
 valleys+0.002
 Pat+0.001
 amaz+0.001
 rivers+0.001
 behold+0.001
Neg logit contributions
 excuse-0.001
burn-0.001
 haircut-0.001
ruck-0.001
resp-0.001
Token w
Token ID266
Feature activation-3.23e-03

Pos logit contributions
 excuse+0.012
burn+0.009
 fault+0.009
resp+0.009
 haircut+0.009
Neg logit contributions
 valleys-0.012
 rivers-0.011
 Pat-0.010
 free-0.010
 behold-0.010
Tokenove
Token ID659
Feature activation-3.20e-02

Pos logit contributions
 valleys+0.006
 rivers+0.005
 free+0.005
 Pat+0.005
 behold+0.005
Neg logit contributions
 excuse-0.005
burn-0.004
 haircut-0.004
 fault-0.004
resp-0.004
Token a
Token ID257
Feature activation-1.54e-02

Pos logit contributions
 valleys+0.034
 Pat+0.029
 behold+0.028
 rivers+0.028
stack+0.028
Neg logit contributions
 excuse-0.027
burn-0.023
 haircut-0.023
 fault-0.022
 dessert-0.021
Token blue
Token ID4171
Feature activation-1.09e-02

Pos logit contributions
 valleys+0.023
 free+0.019
 low+0.019
 rivers+0.019
 amaz+0.018
Neg logit contributions
 excuse-0.028
resp-0.023
burn-0.023
 fault-0.023
arre-0.022
Token m
Token ID285
Feature activation+2.43e-03

Pos logit contributions
 +0.014
 valleys+0.014
tail+0.014
 low+0.014
 amaz+0.013
Neg logit contributions
 excuse-0.020
 fault-0.017
resp-0.017
arre-0.017
ins-0.016
Tokenitten
Token ID2621
Feature activation-3.92e-03

Pos logit contributions
 excuse+0.003
arre+0.002
 fault+0.002
resp+0.002
burn+0.002
Neg logit contributions
 valleys-0.005
 Pat-0.004
tail-0.004
 free-0.004
 rivers-0.004
Token.
Token ID13
Feature activation-3.93e-03

Pos logit contributions
 valleys+0.006
 free+0.005
tail+0.005
 Pat+0.005
 rivers+0.005
Neg logit contributions
 excuse-0.007
resp-0.006
ins-0.006
arre-0.006
 fault-0.006
Token
Token ID198
Feature activation-2.40e-03

Pos logit contributions
 valleys+0.007
 rivers+0.005
 amaz+0.005
 behold+0.005
 streams+0.005
Neg logit contributions
 excuse-0.007
arre-0.005
burn-0.005
resp-0.005
 fault-0.005
Token
Token ID198
Feature activation-1.62e-03

Pos logit contributions
 valleys+0.003
 rivers+0.003
 Pat+0.002
 free+0.002
 amaz+0.002
Neg logit contributions
 excuse-0.005
burn-0.004
 fault-0.004
resp-0.004
arre-0.004
Token"
Token ID1
Feature activation-1.95e-02

Pos logit contributions
 valleys+0.002
 free+0.002
 rivers+0.002
 behold+0.002
 Pat+0.002
Neg logit contributions
 excuse-0.003
 fault-0.003
burn-0.002
resp-0.002
 haircut-0.002
TokenNo
Token ID2949
Feature activation+9.49e-03

Pos logit contributions
 valleys+0.031
 behold+0.027
 free+0.027
 rivers+0.027
 low+0.026
Neg logit contributions
 excuse-0.033
 fault-0.030
burn-0.027
arre-0.026
ins-0.025
Token,
Token ID11
Feature activation+2.23e-02

Pos logit contributions
 excuse+0.021
arre+0.020
resp+0.020
ins+0.020
burn+0.020
Neg logit contributions
 Pat-0.007
 valleys-0.007
 free-0.006
 Bella-0.006
 -0.006
Token this
Token ID428
Feature activation+1.85e-02

Pos logit contributions
 excuse+0.042
burn+0.035
 fault+0.034
resp+0.034
arre+0.034
Neg logit contributions
 valleys-0.032
 Pat-0.026
 rivers-0.026
 free-0.026
 behold-0.025
Token st
Token ID336
Feature activation+3.55e-03

Pos logit contributions
 excuse+0.014
burn+0.012
 fault+0.012
resp+0.011
arre+0.011
Neg logit contributions
 valleys-0.015
 rivers-0.013
 Pat-0.013
 free-0.013
 amaz-0.012
Tokeneg
Token ID1533
Feature activation+3.87e-02

Pos logit contributions
 excuse+0.004
 fault+0.003
resp+0.003
burn+0.003
EEP+0.003
Neg logit contributions
 valleys-0.007
 behold-0.007
 rivers-0.007
 amaz-0.006
 delighted-0.006
Tokenosaurus
Token ID47650
Feature activation+5.04e-02

Pos logit contributions
 excuse+0.056
 fault+0.043
burn+0.041
 consequences+0.041
 haircut+0.040
Neg logit contributions
 valleys-0.068
 Pat-0.064
 free-0.064
 Lou-0.062
tail-0.061
Token,
Token ID11
Feature activation+1.92e-02

Pos logit contributions
 excuse+0.092
burn+0.076
 fault+0.074
arre+0.073
have+0.072
Neg logit contributions
 valleys-0.074
 Pat-0.065
 rivers-0.063
 free-0.062
tail-0.060
Token with
Token ID351
Feature activation-1.59e-02

Pos logit contributions
 excuse+0.034
arre+0.027
resp+0.027
burn+0.027
ins+0.026
Neg logit contributions
 valleys-0.031
 Pat-0.027
 rivers-0.027
 free-0.027
 amaz-0.025
Token a
Token ID257
Feature activation-5.48e-03

Pos logit contributions
 valleys+0.027
 behold+0.023
 rivers+0.022
stack+0.022
tail+0.022
Neg logit contributions
 excuse-0.024
 fault-0.020
burn-0.019
 haircut-0.019
 consequences-0.018
Token long
Token ID890
Feature activation-8.91e-03

Pos logit contributions
 valleys+0.008
 rivers+0.007
 Pat+0.007
 free+0.007
 amaz+0.007
Neg logit contributions
 excuse-0.010
burn-0.008
 fault-0.008
resp-0.008
arre-0.008
Token tail
Token ID7894
Feature activation-2.49e-02

Pos logit contributions
 valleys+0.012
 rivers+0.010
 Pat+0.010
 free+0.010
 behold+0.009
Neg logit contributions
 excuse-0.018
burn-0.015
 fault-0.014
resp-0.014
arre-0.014
Token and
Token ID290
Feature activation+1.46e-02

Pos logit contributions
 valleys+0.039
 rivers+0.033
 Pat+0.033
 free+0.032
 streams+0.031
Neg logit contributions
 excuse-0.046
have-0.036
burn-0.036
resp-0.035
 fault-0.035
Token big
Token ID1263
Feature activation-6.86e-03

Pos logit contributions
 excuse+0.027
burn+0.022
 fault+0.021
resp+0.021
arre+0.021
Neg logit contributions
 valleys-0.022
 rivers-0.018
 Pat-0.018
 free-0.018
 behold-0.017
Token plates
Token ID13375
Feature activation+2.68e-02

Pos logit contributions
 valleys+0.011
 rivers+0.009
 Pat+0.009
 free+0.009
 amaz+0.009
Neg logit contributions
 excuse-0.012
burn-0.010
resp-0.009
 fault-0.009
arre-0.009
Token She
Token ID1375
Feature activation-1.53e-02

Pos logit contributions
 excuse+0.009
burn+0.007
que+0.007
 fault+0.007
arre+0.007
Neg logit contributions
 valleys-0.009
 rivers-0.008
 majestic-0.008
 amaz-0.008
 behold-0.007
Token loved
Token ID6151
Feature activation+8.50e-03

Pos logit contributions
 valleys+0.023
 free+0.020
 rivers+0.019
 Pat+0.019
 behold+0.019
Neg logit contributions
 excuse-0.028
burn-0.023
 fault-0.023
resp-0.022
arre-0.022
Token to
Token ID284
Feature activation+8.08e-03

Pos logit contributions
 excuse+0.013
 fault+0.010
arre+0.010
resp+0.010
burn+0.010
Neg logit contributions
 valleys-0.015
 rivers-0.014
 free-0.013
 Pat-0.013
 forests-0.013
Token play
Token ID711
Feature activation+1.66e-03

Pos logit contributions
 excuse+0.015
 fault+0.012
ins+0.012
arre+0.012
burn+0.012
Neg logit contributions
 valleys-0.012
 free-0.011
 Pat-0.011
tail-0.010
 behold-0.010
Token dress
Token ID6576
Feature activation-1.39e-02

Pos logit contributions
 excuse+0.003
 fault+0.003
burn+0.002
resp+0.002
arre+0.002
Neg logit contributions
 valleys-0.003
 free-0.002
 rivers-0.002
 Pat-0.002
tail-0.002
Token-
Token ID12
Feature activation-8.60e-02

Pos logit contributions
 valleys+0.024
 free+0.021
 rivers+0.021
 Pat+0.021
 behold+0.020
Neg logit contributions
 excuse-0.022
 fault-0.018
burn-0.017
arre-0.016
resp-0.016
Tokenup
Token ID929
Feature activation+1.07e-02

Pos logit contributions
 valleys+0.215
 rivers+0.192
 Pat+0.191
 free+0.189
 behold+0.187
Neg logit contributions
 excuse-0.073
burn-0.044
 fault-0.040
resp-0.039
 haircut-0.037
Token and
Token ID290
Feature activation-8.75e-03

Pos logit contributions
 excuse+0.017
burn+0.013
 fault+0.013
resp+0.012
arre+0.012
Neg logit contributions
 valleys-0.019
 rivers-0.016
 free-0.016
 Pat-0.016
 behold-0.016
Token wear
Token ID5806
Feature activation-1.41e-02

Pos logit contributions
 valleys+0.017
 rivers+0.015
 Pat+0.014
 free+0.014
 behold+0.014
Neg logit contributions
 excuse-0.012
burn-0.009
 fault-0.009
resp-0.009
arre-0.009
Token pretty
Token ID2495
Feature activation-3.04e-02

Pos logit contributions
 valleys+0.026
stack+0.024
 Pat+0.023
 amaz+0.022
 behold+0.022
Neg logit contributions
 excuse-0.020
 haircut-0.017
burn-0.016
have-0.015
resp-0.015
Token things
Token ID1243
Feature activation+1.18e-02

Pos logit contributions
 valleys+0.057
 rivers+0.050
 free+0.048
 Pat+0.048
 behold+0.047
Neg logit contributions
 excuse-0.045
burn-0.034
 fault-0.034
resp-0.033
arre-0.032
Token light
Token ID1657
Feature activation-2.34e-02

Pos logit contributions
 excuse+0.042
burn+0.035
 fault+0.034
resp+0.034
arre+0.033
Neg logit contributions
 valleys-0.035
 rivers-0.030
 free-0.029
 amaz-0.029
 Pat-0.029
Token message
Token ID3275
Feature activation-1.40e-02

Pos logit contributions
 valleys+0.036
 rivers+0.031
 Pat+0.030
 free+0.030
 behold+0.030
Neg logit contributions
 excuse-0.042
 fault-0.035
resp-0.035
arre-0.033
burn-0.033
Token again
Token ID757
Feature activation-3.22e-03

Pos logit contributions
 valleys+0.022
 free+0.019
 rivers+0.018
 Pat+0.018
 behold+0.018
Neg logit contributions
 excuse-0.025
burn-0.020
 fault-0.020
resp-0.020
arre-0.020
Token,
Token ID11
Feature activation-4.33e-03

Pos logit contributions
 valleys+0.006
 rivers+0.005
 Pat+0.005
 streams+0.005
stack+0.005
Neg logit contributions
 excuse-0.005
burn-0.003
 haircut-0.003
ruck-0.003
ins-0.003
Token coming
Token ID2406
Feature activation-4.62e-03

Pos logit contributions
 valleys+0.008
 rivers+0.006
 Pat+0.006
 free+0.006
 behold+0.006
Neg logit contributions
 excuse-0.007
burn-0.006
 fault-0.005
resp-0.005
 haircut-0.005
Token from
Token ID422
Feature activation-5.79e-02

Pos logit contributions
 valleys+0.008
 rivers+0.007
 Pat+0.007
 free+0.007
 behold+0.007
Neg logit contributions
 excuse-0.007
burn-0.006
 fault-0.006
resp-0.006
arre-0.006
Token the
Token ID262
Feature activation-1.33e-03

Pos logit contributions
 valleys+0.078
 Pat+0.061
 delighted+0.061
 rivers+0.061
tail+0.061
Neg logit contributions
 excuse-0.111
 fault-0.089
burn-0.089
ruck-0.088
 haircut-0.087
Token same
Token ID976
Feature activation+1.48e-02

Pos logit contributions
 valleys+0.002
 rivers+0.002
 free+0.002
 Pat+0.002
 behold+0.002
Neg logit contributions
 excuse-0.002
burn-0.002
resp-0.002
 fault-0.002
ins-0.002
Token spot
Token ID4136
Feature activation-2.80e-02

Pos logit contributions
 excuse+0.027
burn+0.022
resp+0.022
 fault+0.021
ins+0.021
Neg logit contributions
 valleys-0.023
 rivers-0.020
 free-0.019
 Pat-0.019
 amaz-0.019
Token.
Token ID13
Feature activation+5.58e-04

Pos logit contributions
 valleys+0.045
 rivers+0.038
 Pat+0.038
 free+0.037
 behold+0.036
Neg logit contributions
 excuse-0.049
burn-0.039
resp-0.038
 fault-0.038
arre-0.038
Token Cur
Token ID4424
Feature activation+1.63e-02

Pos logit contributions
 excuse+0.001
 fault+0.001
resp+0.001
arre+0.001
burn+0.001
Neg logit contributions
 valleys-0.001
 rivers-0.001
 free-0.001
 rocky-0.001
 behold-0.001
Token "
Token ID366
Feature activation-1.81e-02

Pos logit contributions
 excuse+0.041
resp+0.032
burn+0.032
 fault+0.032
reat+0.031
Neg logit contributions
 valleys-0.038
 behold-0.031
 rivers-0.031
 amaz-0.030
 majestic-0.030
TokenLet
Token ID5756
Feature activation+3.46e-02

Pos logit contributions
 valleys+0.033
 rivers+0.028
 behold+0.027
 Pat+0.027
 free+0.027
Neg logit contributions
 excuse-0.029
 fault-0.024
resp-0.023
burn-0.022
 haircut-0.022
Token's
Token ID338
Feature activation-9.73e-03

Pos logit contributions
 excuse+0.057
 fault+0.050
burn+0.049
ins+0.048
ruck+0.047
Neg logit contributions
 valleys-0.051
 free-0.049
 Pat-0.048
 -0.046
 behold-0.045
Token spray
Token ID11662
Feature activation+7.97e-03

Pos logit contributions
 valleys+0.016
 free+0.015
 Pat+0.015
 behold+0.014
tail+0.014
Neg logit contributions
 excuse-0.016
ins-0.014
resp-0.013
 fault-0.013
arre-0.013
Token the
Token ID262
Feature activation-3.09e-02

Pos logit contributions
 excuse+0.015
burn+0.012
 fault+0.012
resp+0.012
 haircut+0.012
Neg logit contributions
 valleys-0.012
 rivers-0.010
 Pat-0.009
 free-0.009
 behold-0.009
Token dog
Token ID3290
Feature activation-5.22e-02

Pos logit contributions
 valleys+0.049
 rivers+0.044
 free+0.043
 streams+0.042
 rocky+0.042
Neg logit contributions
 excuse-0.052
ins-0.042
arre-0.041
resp-0.041
have-0.040
Token with
Token ID351
Feature activation-1.87e-02

Pos logit contributions
 valleys+0.078
 free+0.075
 Pat+0.067
 rivers+0.066
tail+0.066
Neg logit contributions
 excuse-0.092
 fault-0.075
ruck-0.074
 haircut-0.074
resp-0.074
Token the
Token ID262
Feature activation-4.43e-03

Pos logit contributions
 valleys+0.027
 rivers+0.022
 free+0.022
 Pat+0.022
 behold+0.021
Neg logit contributions
 excuse-0.036
burn-0.030
resp-0.029
 fault-0.029
arre-0.029
Token hose
Token ID31489
Feature activation+3.47e-03

Pos logit contributions
 valleys+0.007
 free+0.006
 low+0.006
 rivers+0.006
 rocky+0.006
Neg logit contributions
 excuse-0.007
ins-0.007
arre-0.006
resp-0.006
have-0.006
Token.
Token ID13
Feature activation-2.39e-04

Pos logit contributions
 excuse+0.006
burn+0.005
 haircut+0.005
resp+0.005
have+0.005
Neg logit contributions
 valleys-0.005
 free-0.004
 rivers-0.004
 Pat-0.004
 streams-0.004
Token Maybe
Token ID6674
Feature activation-3.21e-02

Pos logit contributions
 valleys+0.000
 rivers+0.000
 Pat+0.000
 free+0.000
 behold+0.000
Neg logit contributions
 excuse-0.001
burn-0.000
 fault-0.000
resp-0.000
 haircut-0.000
Token came
Token ID1625
Feature activation-2.36e-02

Pos logit contributions
 valleys+0.016
 rivers+0.014
 Pat+0.013
 free+0.013
 streams+0.013
Neg logit contributions
 excuse-0.014
burn-0.011
arre-0.010
 haircut-0.010
 fault-0.010
Token!
Token ID0
Feature activation-7.41e-03

Pos logit contributions
 valleys+0.041
 rivers+0.034
 Pat+0.034
 free+0.034
 behold+0.033
Neg logit contributions
 excuse-0.039
burn-0.031
 fault-0.030
resp-0.030
 haircut-0.029
Token She
Token ID1375
Feature activation-1.91e-02

Pos logit contributions
 valleys+0.010
 rivers+0.008
 rocky+0.008
 free+0.008
 majestic+0.007
Neg logit contributions
 excuse-0.013
 haircut-0.011
burn-0.011
resp-0.011
arre-0.011
Token opened
Token ID4721
Feature activation-2.30e-02

Pos logit contributions
 valleys+0.033
 rivers+0.028
 Pat+0.027
 free+0.026
 streams+0.026
Neg logit contributions
 excuse-0.031
burn-0.024
 haircut-0.023
resp-0.023
 fault-0.023
Token the
Token ID262
Feature activation-2.91e-02

Pos logit contributions
 valleys+0.032
tail+0.028
stack+0.026
years+0.026
 Pat+0.026
Neg logit contributions
 excuse-0.037
 haircut-0.032
 fault-0.031
burn-0.030
ruck-0.030
Token package
Token ID5301
Feature activation-4.95e-02

Pos logit contributions
 valleys+0.053
 rivers+0.046
 amaz+0.046
 free+0.046
 behold+0.044
Neg logit contributions
 excuse-0.044
burn-0.035
resp-0.034
ins-0.033
 fault-0.033
Token and
Token ID290
Feature activation-2.50e-02

Pos logit contributions
 valleys+0.104
 rivers+0.090
 Pat+0.089
 behold+0.088
 free+0.087
Neg logit contributions
 excuse-0.060
burn-0.044
 haircut-0.042
 fault-0.041
resp-0.039
Token what
Token ID644
Feature activation-1.98e-02

Pos logit contributions
 valleys+0.041
 rivers+0.035
 Pat+0.034
tail+0.033
 streams+0.033
Neg logit contributions
 excuse-0.040
burn-0.032
 haircut-0.032
resp-0.030
have-0.030
Token do
Token ID466
Feature activation-7.53e-03

Pos logit contributions
 valleys+0.034
 rivers+0.029
 free+0.028
 Pat+0.028
 behold+0.027
Neg logit contributions
 excuse-0.032
burn-0.025
 fault-0.025
 haircut-0.025
resp-0.024
Token you
Token ID345
Feature activation+1.64e-02

Pos logit contributions
 valleys+0.011
 Pat+0.009
 rivers+0.009
 free+0.009
 behold+0.009
Neg logit contributions
 excuse-0.014
burn-0.012
 fault-0.011
resp-0.011
arre-0.011
Token think
Token ID892
Feature activation+3.08e-02

Pos logit contributions
 excuse+0.026
burn+0.019
 fault+0.019
resp+0.019
 haircut+0.019
Neg logit contributions
 valleys-0.029
 rivers-0.025
 Pat-0.024
 streams-0.023
 forests-0.023
Token wanted
Token ID2227
Feature activation+3.80e-03

Pos logit contributions
 excuse+0.004
burn+0.003
arre+0.003
 fault+0.003
have+0.003
Neg logit contributions
 valleys-0.004
 rivers-0.003
 Pat-0.003
tail-0.003
 free-0.003
Token to
Token ID284
Feature activation-8.63e-03

Pos logit contributions
 excuse+0.006
burn+0.005
arre+0.005
resp+0.005
ins+0.005
Neg logit contributions
 valleys-0.007
 Pat-0.006
 rivers-0.006
 free-0.006
 streams-0.006
Token know
Token ID760
Feature activation-6.88e-03

Pos logit contributions
 free+0.012
 valleys+0.012
 Pat+0.012
 behold+0.012
 Lou+0.010
Neg logit contributions
 excuse-0.015
ins-0.014
burn-0.013
arre-0.013
resp-0.013
Token why
Token ID1521
Feature activation-1.36e-02

Pos logit contributions
 valleys+0.010
 Pat+0.008
 rivers+0.008
 free+0.008
 behold+0.008
Neg logit contributions
 excuse-0.013
burn-0.011
arre-0.010
 fault-0.010
resp-0.010
Token the
Token ID262
Feature activation-1.30e-02

Pos logit contributions
 valleys+0.017
 rivers+0.013
 Pat+0.013
 free+0.013
 behold+0.013
Neg logit contributions
 excuse-0.029
burn-0.024
 fault-0.024
resp-0.023
 haircut-0.023
Token dog
Token ID3290
Feature activation-5.07e-02

Pos logit contributions
 valleys+0.025
 rivers+0.021
 free+0.021
 Pat+0.021
 amaz+0.020
Neg logit contributions
 excuse-0.019
burn-0.015
resp-0.014
 fault-0.014
ins-0.014
Token kept
Token ID4030
Feature activation-3.05e-05

Pos logit contributions
 valleys+0.077
 Pat+0.068
 free+0.067
 rivers+0.065
 behold+0.064
Neg logit contributions
 excuse-0.089
burn-0.074
 fault-0.072
resp-0.071
ins-0.070
Token complaining
Token ID18705
Feature activation-3.03e-03

Pos logit contributions
 valleys+0.000
 free+0.000
 Pat+0.000
 rivers+0.000
 amaz+0.000
Neg logit contributions
 excuse-0.000
resp-0.000
burn-0.000
 fault-0.000
ins-0.000
Token.
Token ID13
Feature activation-8.39e-04

Pos logit contributions
 valleys+0.005
 rivers+0.004
 Pat+0.004
 free+0.004
 behold+0.004
Neg logit contributions
 excuse-0.005
burn-0.004
resp-0.004
 fault-0.004
arre-0.004
Token After
Token ID2293
Feature activation-1.04e-02

Pos logit contributions
 valleys+0.001
 rivers+0.001
 free+0.001
tail+0.001
 behold+0.001
Neg logit contributions
 excuse-0.002
 fault-0.001
arre-0.001
resp-0.001
burn-0.001
Token a
Token ID257
Feature activation-1.29e-02

Pos logit contributions
 valleys+0.016
 rivers+0.013
 Pat+0.013
 free+0.013
 behold+0.013
Neg logit contributions
 excuse-0.018
burn-0.014
 fault-0.014
 haircut-0.014
resp-0.014