TOP ACTIVATIONS
MAX = 0.099

two were best friends ever since . <|endoftext|> One day ,
was careful with things ever since . No more things vanished
they have been friends ever since . The
the spirit with them ever since . <|endoftext|> One day ,
They have been pals ever since . <|endoftext|> Once upon a
They were best friends ever since . <|endoftext|> Jim was walking
have been best friends ever since ! <|endoftext|> Once upon a
a much better person ever since . The wise owl was
he stayed with them ever since ! <|endoftext|> One day ,
they have been friends ever since ! <|endoftext|> Once upon a
hair tied in two pig tails . " M ummy !"
tried to remember it ever since . <|endoftext|> Once upon a
living in the garden ever since . <|endoftext|> Once upon a
they have been friends ever since . <|endoftext|> Once upon a
ooming around with it ever since . <|endoftext|> One night ,
had many boxes of different sizes and shapes . They used
she had grown two pounds since her last check - up
for the family and ever since , the boy and his
have been best friends ever since . They have many more
. They had been friends since they were babies and were

MIN ACTIVATIONS
MIN = -0.113

that day on , Bun - B un didn 't want
. The grand - daughter cl apped her hands
surprise . But when Bun - B un got to the
they did . Ch oo - Ch oo was so happy
, the bird and Bun - B un became the best
castle was destroyed and no - one was able to help
Hours passed , but no - one came and it started
. One day , Bun - B un met a little
Rex . Unfortunately , T - Rex used D ippy 's
on adventures . Ch oo - Ch oo was so happy
loud sound . Ch oo - ch oo ! It goes
on , Lily always double - checked the ingredients before they
. B un - B un hopped away ,
. One day , Bun - B un was hopping through
much fun ! The black - cl ot hed child was
make noise . Ch oo - ch oo !
, " Dog . D - o - g . Dog
B un - B un felt sad and
. " Hello , Bun - Cloud . We are your
?" B un - B un said , "

INTERVAL 0.046 - 0.099
CONTAINS 0.439%

trunk . The moral of the story is that
be here to support you if you want to explore .
that she would un pack every morning . One
was sad and her brother felt clumsy . They worked together
how does the driver get in and out ?"

INTERVAL -0.007 - 0.046
CONTAINS 67.381%

eed . They saw other fish and crabs too . The
bread to eat together . <|endoftext|> Once upon a time ,
. One day , as he was digging , he made
. Jack was very excited ; he said , " This
trick on her . Lily gigg led and continued to help

INTERVAL -0.060 - -0.007
CONTAINS 31.933%

was just a trick ! Sophie had tricked them all .
felt Lily in her backpack and smiled . But
, " H oney , you were so brave , but
. S ue tries to stop him . She
a tree . When the potato looked up he saw the

INTERVAL -0.113 - -0.060
CONTAINS 0.247%

waited for their mom to come . They did not want
, she found a funny - looking machine . It was
" Wow , Sara and Tom . You are so
. L ily and Ben took turns drying their
. He heard a high - p itched noise coming from
Token two
Token ID734
Feature activation+2.66e-02

Pos logit contributions
rav+0.010
 bye+0.010
gan+0.009
ia+0.008
five+0.008
Neg logit contributions
 valleys-0.012
 universe-0.012
 sizes-0.012
 valley-0.012
 sorts-0.011
Token were
Token ID547
Feature activation+1.55e-02

Pos logit contributions
 valleys+0.037
 sizes+0.032
 valley+0.032
 planets+0.032
 sorts+0.031
Neg logit contributions
 bye-0.047
rav-0.047
gan-0.042
five-0.042
cake-0.041
Token best
Token ID1266
Feature activation-7.56e-03

Pos logit contributions
 valleys+0.021
 sizes+0.020
 sorts+0.019
 planets+0.019
 galaxies+0.019
Neg logit contributions
 bye-0.028
rav-0.028
ia-0.025
five-0.025
cake-0.024
Token friends
Token ID2460
Feature activation+1.02e-02

Pos logit contributions
 bye+0.011
rav+0.011
gan+0.010
cake+0.010
ia+0.010
Neg logit contributions
 valleys-0.012
 sorts-0.012
 planets-0.011
 universe-0.011
 sizes-0.011
Token ever
Token ID1683
Feature activation-2.06e-02

Pos logit contributions
 valleys+0.014
 sizes+0.013
 galaxies+0.012
 universe+0.012
 planets+0.012
Neg logit contributions
 bye-0.018
rav-0.018
ia-0.016
gan-0.016
cake-0.016
Token since
Token ID1201
Feature activation+9.94e-02

Pos logit contributions
 bye+0.039
rav+0.037
ia+0.035
gan+0.034
cake+0.034
Neg logit contributions
 valleys-0.028
 sizes-0.025
 sorts-0.025
 galaxies-0.024
 planets-0.024
Token.
Token ID13
Feature activation-6.96e-03

Pos logit contributions
 valleys+0.151
 sizes+0.140
 galaxies+0.138
 universe+0.137
 planets+0.136
Neg logit contributions
 bye-0.170
rav-0.159
ia-0.146
 reward-0.140
gan-0.140
Token<|endoftext|>
Token ID50256
Feature activation+2.50e-02

Pos logit contributions
 bye+0.013
rav+0.012
ia+0.011
less+0.011
 reward+0.011
Neg logit contributions
 valleys-0.009
 sizes-0.009
 sorts-0.008
 universe-0.008
 galaxies-0.008
TokenOne
Token ID3198
Feature activation+2.54e-02

Pos logit contributions
 sizes+0.036
 valleys+0.035
 galaxies+0.034
 situations+0.034
 planets+0.033
Neg logit contributions
 bye-0.041
rav-0.041
ia-0.038
cake-0.036
gan-0.035
Token day
Token ID1110
Feature activation-7.79e-03

Pos logit contributions
 valleys+0.030
 universe+0.028
 sorts+0.027
 sizes+0.027
 planets+0.026
Neg logit contributions
 bye-0.050
rav-0.049
ia-0.045
gan-0.044
cake-0.044
Token,
Token ID11
Feature activation+6.50e-04

Pos logit contributions
 bye+0.016
rav+0.016
cake+0.015
five+0.015
ia+0.015
Neg logit contributions
 valleys-0.007
 universe-0.006
 valley-0.006
 galaxies-0.006
?",-0.006
Token was
Token ID373
Feature activation+1.47e-02

Pos logit contributions
 bye+0.001
rav+0.001
ia+0.001
gan+0.001
less+0.001
Neg logit contributions
 valleys-0.001
 sizes-0.000
 galaxies-0.000
 planets-0.000
 universe-0.000
Token careful
Token ID8161
Feature activation+4.17e-03

Pos logit contributions
 valleys+0.022
 sizes+0.020
 universe+0.020
 galaxies+0.020
 planets+0.019
Neg logit contributions
 bye-0.025
rav-0.025
ia-0.022
cake-0.021
five-0.021
Token with
Token ID351
Feature activation-1.32e-02

Pos logit contributions
 valleys+0.007
 universe+0.007
 sizes+0.007
 galaxies+0.007
 planets+0.007
Neg logit contributions
 bye-0.006
rav-0.006
ia-0.005
less-0.005
gan-0.005
Token things
Token ID1243
Feature activation+1.45e-02

Pos logit contributions
 bye+0.019
ia+0.016
rav+0.016
 reward+0.016
less+0.015
Neg logit contributions
 valleys-0.023
 sizes-0.021
roads-0.021
 universe-0.021
 galaxies-0.021
Token ever
Token ID1683
Feature activation-1.01e-02

Pos logit contributions
 valleys+0.022
 sizes+0.021
 galaxies+0.020
 universe+0.020
 planets+0.020
Neg logit contributions
 bye-0.024
rav-0.023
ia-0.021
less-0.020
gan-0.020
Token since
Token ID1201
Feature activation+9.82e-02

Pos logit contributions
 bye+0.016
rav+0.015
ia+0.013
gan+0.013
 reward+0.013
Neg logit contributions
 valleys-0.017
 sizes-0.016
 universe-0.015
 galaxies-0.015
 planets-0.015
Token.
Token ID13
Feature activation-7.35e-03

Pos logit contributions
 valleys+0.163
 sizes+0.153
 galaxies+0.152
 sorts+0.147
 planets+0.147
Neg logit contributions
 bye-0.153
rav-0.140
ia-0.129
 reward-0.125
less-0.124
Token No
Token ID1400
Feature activation+4.90e-03

Pos logit contributions
 bye+0.015
rav+0.014
ia+0.013
gan+0.013
 reward+0.013
Neg logit contributions
 valleys-0.009
 sizes-0.008
 universe-0.008
 galaxies-0.008
 planets-0.008
Token more
Token ID517
Feature activation-1.66e-02

Pos logit contributions
 valleys+0.005
 sizes+0.005
 universe+0.005
 planets+0.005
 sorts+0.005
Neg logit contributions
rav-0.010
 bye-0.010
gan-0.009
ia-0.009
cake-0.009
Token things
Token ID1243
Feature activation-1.45e-02

Pos logit contributions
 bye+0.028
rav+0.026
ia+0.024
gan+0.023
 reward+0.023
Neg logit contributions
 valleys-0.025
 sizes-0.023
 galaxies-0.023
 universe-0.023
 planets-0.023
Token vanished
Token ID23717
Feature activation-4.10e-03

Pos logit contributions
 bye+0.025
rav+0.024
ia+0.022
gan+0.021
cake+0.021
Neg logit contributions
 valleys-0.022
 sizes-0.020
 galaxies-0.019
 universe-0.019
 planets-0.019
Token they
Token ID484
Feature activation+7.70e-04

Pos logit contributions
rav+0.078
ia+0.073
 bye+0.072
cake+0.070
five+0.068
Neg logit contributions
 valleys-0.050
 sizes-0.043
 sorts-0.042
 galaxies-0.042
 planets-0.042
Token have
Token ID423
Feature activation-1.18e-02

Pos logit contributions
 valleys+0.001
 sizes+0.001
 galaxies+0.001
 universe+0.001
 planets+0.001
Neg logit contributions
 bye-0.001
rav-0.001
ia-0.001
gan-0.001
cake-0.001
Token been
Token ID587
Feature activation+1.19e-02

Pos logit contributions
 bye+0.023
rav+0.023
cake+0.021
ia+0.020
five+0.020
Neg logit contributions
 valleys-0.015
 sizes-0.014
 sorts-0.013
 planets-0.013
 galaxies-0.013
Token friends
Token ID2460
Feature activation+2.02e-02

Pos logit contributions
 valleys+0.018
 sizes+0.016
 sorts+0.016
 situations+0.016
 planets+0.016
Neg logit contributions
 bye-0.020
rav-0.020
ia-0.018
cake-0.018
 reward-0.017
Token ever
Token ID1683
Feature activation-1.44e-02

Pos logit contributions
 valleys+0.028
 sizes+0.026
 universe+0.025
 galaxies+0.025
 situations+0.025
Neg logit contributions
 bye-0.037
rav-0.035
ia-0.032
gan-0.031
less-0.031
Token since
Token ID1201
Feature activation+9.81e-02

Pos logit contributions
 bye+0.025
rav+0.025
ia+0.024
cake+0.023
gan+0.022
Neg logit contributions
 valleys-0.021
 sizes-0.019
 galaxies-0.018
 sorts-0.018
 planets-0.018
Token.
Token ID13
Feature activation-4.67e-03

Pos logit contributions
 valleys+0.162
 sizes+0.151
 galaxies+0.151
 universe+0.148
 planets+0.147
Neg logit contributions
 bye-0.156
rav-0.138
ia-0.130
 reward-0.127
less-0.124
Token
Token ID220
Feature activation-1.02e-02

Pos logit contributions
 bye+0.009
rav+0.008
ia+0.008
less+0.007
 reward+0.007
Neg logit contributions
 valleys-0.006
 sizes-0.006
 universe-0.006
 galaxies-0.006
 planets-0.006
Token
Token ID198
Feature activation-6.17e-03

Pos logit contributions
 bye+0.018
rav+0.018
ia+0.016
 reward+0.016
cake+0.016
Neg logit contributions
 valleys-0.015
 sizes-0.013
 galaxies-0.013
 universe-0.013
 situations-0.013
Token
Token ID198
Feature activation-3.68e-03

Pos logit contributions
 bye+0.011
rav+0.010
ia+0.009
gan+0.009
 reward+0.009
Neg logit contributions
 valleys-0.009
 sizes-0.008
 universe-0.008
 galaxies-0.008
 sorts-0.008
TokenThe
Token ID464
Feature activation-7.09e-03

Pos logit contributions
 bye+0.007
rav+0.007
ia+0.006
gan+0.006
 reward+0.006
Neg logit contributions
 valleys-0.005
roads-0.004
 galaxies-0.004
 sizes-0.004
 universe-0.004
Token the
Token ID262
Feature activation+8.82e-03

Pos logit contributions
 valleys+0.044
 sizes+0.042
 universe+0.042
 galaxies+0.041
roads+0.041
Neg logit contributions
 bye-0.044
rav-0.041
 reward-0.037
ia-0.036
five-0.036
Token spirit
Token ID4437
Feature activation+9.61e-03

Pos logit contributions
 valleys+0.013
 universe+0.012
 sizes+0.012
 galaxies+0.012
 planets+0.012
Neg logit contributions
 bye-0.015
rav-0.015
ia-0.014
gan-0.014
cake-0.013
Token with
Token ID351
Feature activation+7.40e-03

Pos logit contributions
 valleys+0.015
 sizes+0.014
 universe+0.014
 galaxies+0.014
 situations+0.014
Neg logit contributions
 bye-0.015
rav-0.014
ia-0.013
less-0.013
 reward-0.012
Token them
Token ID606
Feature activation-6.63e-03

Pos logit contributions
 valleys+0.012
 sizes+0.011
 universe+0.011
 galaxies+0.011
 situations+0.011
Neg logit contributions
 bye-0.011
rav-0.011
ia-0.010
 reward-0.009
less-0.009
Token ever
Token ID1683
Feature activation-1.30e-02

Pos logit contributions
 bye+0.011
rav+0.011
ia+0.010
less+0.010
gan+0.009
Neg logit contributions
 valleys-0.010
 sizes-0.009
 universe-0.009
 galaxies-0.009
 situations-0.009
Token since
Token ID1201
Feature activation+9.58e-02

Pos logit contributions
 bye+0.023
rav+0.022
ia+0.020
gan+0.020
cake+0.019
Neg logit contributions
 valleys-0.019
 sizes-0.018
 galaxies-0.017
 universe-0.017
 situations-0.017
Token.
Token ID13
Feature activation-1.36e-02

Pos logit contributions
 valleys+0.159
 sizes+0.148
 galaxies+0.147
 universe+0.145
 planets+0.143
Neg logit contributions
 bye-0.151
rav-0.133
 reward-0.122
ia-0.121
less-0.120
Token<|endoftext|>
Token ID50256
Feature activation+2.40e-02

Pos logit contributions
 bye+0.022
rav+0.021
ia+0.019
gan+0.019
 reward+0.018
Neg logit contributions
 valleys-0.021
 sizes-0.020
 universe-0.019
 galaxies-0.019
 situations-0.019
TokenOne
Token ID3198
Feature activation+2.40e-02

Pos logit contributions
 sizes+0.037
 valleys+0.036
 galaxies+0.035
 situations+0.035
 planets+0.034
Neg logit contributions
 bye-0.037
rav-0.037
ia-0.034
cake-0.031
gan-0.031
Token day
Token ID1110
Feature activation-4.78e-03

Pos logit contributions
 valleys+0.030
 sizes+0.027
 universe+0.027
 sorts+0.027
 galaxies+0.026
Neg logit contributions
 bye-0.046
rav-0.046
ia-0.041
gan-0.041
cake-0.040
Token,
Token ID11
Feature activation+6.77e-03

Pos logit contributions
 bye+0.010
rav+0.009
ia+0.009
cake+0.009
five+0.009
Neg logit contributions
 valleys-0.005
 universe-0.005
 sizes-0.005
 galaxies-0.005
 situations-0.005
Token They
Token ID1119
Feature activation+1.45e-02

Pos logit contributions
 bye+0.012
rav+0.011
ia+0.010
gan+0.010
cake+0.010
Neg logit contributions
 valleys-0.012
 sizes-0.012
 galaxies-0.011
 planets-0.011
 situations-0.011
Token have
Token ID423
Feature activation-4.60e-03

Pos logit contributions
 valleys+0.017
 situations+0.016
 sizes+0.016
 universe+0.015
 galaxies+0.015
Neg logit contributions
 bye-0.029
rav-0.028
ia-0.026
gan-0.025
 reward-0.025
Token been
Token ID587
Feature activation+1.31e-02

Pos logit contributions
 bye+0.009
rav+0.008
ia+0.008
gan+0.007
cake+0.007
Neg logit contributions
 valleys-0.006
 sizes-0.006
 galaxies-0.006
 planets-0.006
 sorts-0.005
Token pals
Token ID39898
Feature activation+4.45e-02

Pos logit contributions
 valleys+0.017
 sizes+0.016
 sorts+0.015
 planets+0.015
 galaxies+0.015
Neg logit contributions
 bye-0.025
rav-0.024
ia-0.022
cake-0.022
five-0.021
Token ever
Token ID1683
Feature activation-1.54e-02

Pos logit contributions
 valleys+0.066
 sizes+0.060
 situations+0.059
 universe+0.059
 galaxies+0.059
Neg logit contributions
 bye-0.077
rav-0.074
ia-0.068
gan-0.065
less-0.065
Token since
Token ID1201
Feature activation+9.56e-02

Pos logit contributions
 bye+0.031
rav+0.030
ia+0.029
gan+0.027
 reward+0.027
Neg logit contributions
 valleys-0.018
 sizes-0.017
 galaxies-0.016
 sorts-0.016
 planets-0.016
Token.
Token ID13
Feature activation+6.52e-03

Pos logit contributions
 valleys+0.148
 sizes+0.136
 galaxies+0.134
 situations+0.132
 universe+0.132
Neg logit contributions
 bye-0.162
rav-0.151
ia-0.139
 reward-0.134
gan-0.132
Token<|endoftext|>
Token ID50256
Feature activation+2.53e-02

Pos logit contributions
 valleys+0.010
 sizes+0.009
 universe+0.009
 galaxies+0.009
 planets+0.009
Neg logit contributions
 bye-0.011
rav-0.011
ia-0.010
gan-0.009
cake-0.009
TokenOnce
Token ID7454
Feature activation-1.61e-03

Pos logit contributions
 valleys+0.038
 sizes+0.038
 galaxies+0.036
 situations+0.036
 planets+0.036
Neg logit contributions
 bye-0.040
rav-0.039
ia-0.037
cake-0.034
gan-0.034
Token upon
Token ID2402
Feature activation-2.20e-02

Pos logit contributions
 bye+0.002
rav+0.002
ia+0.002
gan+0.002
less+0.002
Neg logit contributions
 valleys-0.003
 sizes-0.003
 situations-0.003
 galaxies-0.003
 planets-0.003
Token a
Token ID257
Feature activation-3.89e-02

Pos logit contributions
 bye+0.034
rav+0.033
ia+0.030
gan+0.029
 reward+0.028
Neg logit contributions
 valleys-0.036
 sizes-0.033
 galaxies-0.033
 situations-0.032
 universe-0.032
Token They
Token ID1119
Feature activation+1.34e-02

Pos logit contributions
rav+0.022
 bye+0.021
ia+0.020
less+0.019
gan+0.019
Neg logit contributions
 valleys-0.014
 sorts-0.013
 sizes-0.012
 universe-0.012
 galaxies-0.012
Token were
Token ID547
Feature activation+1.08e-02

Pos logit contributions
 valleys+0.019
 situations+0.018
 sizes+0.017
 universe+0.017
roads+0.017
Neg logit contributions
 bye-0.024
rav-0.023
ia-0.021
gan-0.021
 reward-0.021
Token best
Token ID1266
Feature activation-4.59e-03

Pos logit contributions
 valleys+0.013
 sizes+0.012
 universe+0.012
 galaxies+0.012
 situations+0.012
Neg logit contributions
 bye-0.022
rav-0.021
ia-0.019
gan-0.019
 reward-0.019
Token friends
Token ID2460
Feature activation+8.11e-03

Pos logit contributions
 bye+0.007
rav+0.007
gan+0.006
cake+0.006
ia+0.006
Neg logit contributions
 valleys-0.008
 sizes-0.007
 sorts-0.007
 planets-0.007
 galaxies-0.007
Token ever
Token ID1683
Feature activation-2.54e-02

Pos logit contributions
 valleys+0.012
 sizes+0.011
 situations+0.011
 universe+0.011
 galaxies+0.011
Neg logit contributions
 bye-0.014
rav-0.013
ia-0.012
less-0.012
gan-0.011
Token since
Token ID1201
Feature activation+9.51e-02

Pos logit contributions
 bye+0.047
rav+0.045
ia+0.041
gan+0.040
less+0.040
Neg logit contributions
 valleys-0.035
 sizes-0.032
 universe-0.031
 galaxies-0.031
 situations-0.031
Token.
Token ID13
Feature activation-5.35e-03

Pos logit contributions
 valleys+0.150
 sizes+0.139
 galaxies+0.136
 situations+0.136
 universe+0.135
Neg logit contributions
 bye-0.159
rav-0.147
ia-0.136
 reward-0.130
gan-0.130
Token<|endoftext|>
Token ID50256
Feature activation+2.53e-02

Pos logit contributions
 bye+0.010
rav+0.009
ia+0.009
 reward+0.008
gan+0.008
Neg logit contributions
 valleys-0.008
 sizes-0.007
 sorts-0.007
 universe-0.007
 galaxies-0.007
TokenJim
Token ID18050
Feature activation-7.37e-03

Pos logit contributions
 sizes+0.038
 valleys+0.038
 galaxies+0.036
 situations+0.036
 planets+0.035
Neg logit contributions
 bye-0.040
rav-0.039
ia-0.037
cake-0.034
gan-0.034
Token was
Token ID373
Feature activation-4.13e-03

Pos logit contributions
 bye+0.011
ia+0.010
rav+0.009
ara+0.008
-+0.008
Neg logit contributions
 valleys-0.013
roads-0.012
 situations-0.012
 sizes-0.012
 basement-0.012
Token walking
Token ID6155
Feature activation+7.11e-03

Pos logit contributions
rav+0.008
 bye+0.008
ia+0.007
cake+0.007
five+0.007
Neg logit contributions
 valleys-0.005
 universe-0.005
 planets-0.005
 galaxies-0.005
 sorts-0.005
Token have
Token ID423
Feature activation-5.48e-03

Pos logit contributions
 valleys+0.017
 sizes+0.015
 universe+0.015
 galaxies+0.015
 situations+0.015
Neg logit contributions
 bye-0.019
rav-0.018
ia-0.016
gan-0.016
 reward-0.016
Token been
Token ID587
Feature activation+1.54e-02

Pos logit contributions
 bye+0.010
rav+0.009
ia+0.009
cake+0.008
gan+0.008
Neg logit contributions
 valleys-0.008
 sizes-0.007
 sorts-0.007
 galaxies-0.007
 planets-0.007
Token best
Token ID1266
Feature activation+6.45e-03

Pos logit contributions
 valleys+0.022
 sizes+0.020
 sorts+0.020
 planets+0.020
 situations+0.020
Neg logit contributions
 bye-0.027
rav-0.027
ia-0.024
cake-0.024
gan-0.023
Token friends
Token ID2460
Feature activation+1.53e-02

Pos logit contributions
 valleys+0.011
 sizes+0.011
 situations+0.011
 sorts+0.010
 planets+0.010
Neg logit contributions
 bye-0.009
rav-0.009
ia-0.008
cake-0.008
gan-0.008
Token ever
Token ID1683
Feature activation-1.81e-02

Pos logit contributions
 valleys+0.024
 sizes+0.022
 situations+0.022
 galaxies+0.022
 universe+0.022
Neg logit contributions
 bye-0.025
rav-0.024
ia-0.022
less-0.022
gan-0.021
Token since
Token ID1201
Feature activation+9.43e-02

Pos logit contributions
 bye+0.034
rav+0.033
ia+0.030
gan+0.029
less+0.029
Neg logit contributions
 valleys-0.025
 sizes-0.022
 universe-0.022
 galaxies-0.022
 planets-0.022
Token!
Token ID0
Feature activation-6.50e-03

Pos logit contributions
 valleys+0.150
 sizes+0.139
 galaxies+0.138
 universe+0.135
 planets+0.135
Neg logit contributions
 bye-0.155
rav-0.142
ia-0.133
 reward-0.129
gan-0.126
Token<|endoftext|>
Token ID50256
Feature activation+2.85e-02

Pos logit contributions
 bye+0.012
rav+0.011
ia+0.010
less+0.010
 reward+0.010
Neg logit contributions
 valleys-0.009
 sizes-0.008
 universe-0.008
 galaxies-0.008
 sorts-0.008
TokenOnce
Token ID7454
Feature activation+1.40e-03

Pos logit contributions
 sizes+0.043
 valleys+0.042
 galaxies+0.041
 situations+0.040
 planets+0.039
Neg logit contributions
 bye-0.045
rav-0.045
ia-0.042
cake-0.039
gan-0.039
Token upon
Token ID2402
Feature activation-2.15e-02

Pos logit contributions
 valleys+0.003
 sizes+0.003
 situations+0.003
 galaxies+0.003
 planets+0.002
Neg logit contributions
 bye-0.002
rav-0.002
ia-0.002
gan-0.001
less-0.001
Token a
Token ID257
Feature activation-3.63e-02

Pos logit contributions
 bye+0.034
rav+0.032
ia+0.030
gan+0.028
 reward+0.028
Neg logit contributions
 valleys-0.035
 sizes-0.032
 galaxies-0.032
 universe-0.031
 situations-0.031
Token a
Token ID257
Feature activation-2.25e-02

Pos logit contributions
 valleys+0.019
 sizes+0.018
 sorts+0.017
 galaxies+0.017
 planets+0.017
Neg logit contributions
 bye-0.017
rav-0.016
ia-0.015
cake-0.014
gan-0.014
Token much
Token ID881
Feature activation-2.93e-02

Pos logit contributions
rav+0.032
 bye+0.032
ia+0.031
gan+0.029
cake+0.029
Neg logit contributions
 valleys-0.037
 universe-0.036
 sizes-0.035
 valley-0.035
 sorts-0.034
Token better
Token ID1365
Feature activation-2.32e-02

Pos logit contributions
rav+0.044
 bye+0.043
ia+0.040
gan+0.038
cake+0.036
Neg logit contributions
 valleys-0.049
 sizes-0.046
 sorts-0.045
 galaxies-0.044
 universe-0.044
Token person
Token ID1048
Feature activation-1.84e-02

Pos logit contributions
 bye+0.032
rav+0.029
ia+0.026
 reward+0.025
less+0.024
Neg logit contributions
 valleys-0.044
 sizes-0.041
 galaxies-0.040
 planets-0.040
 universe-0.040
Token ever
Token ID1683
Feature activation-2.05e-02

Pos logit contributions
 bye+0.033
rav+0.032
ia+0.029
gan+0.028
cake+0.028
Neg logit contributions
 valleys-0.026
 sizes-0.023
 universe-0.023
 galaxies-0.023
 planets-0.023
Token since
Token ID1201
Feature activation+9.42e-02

Pos logit contributions
 bye+0.035
rav+0.034
ia+0.031
gan+0.030
cake+0.029
Neg logit contributions
 valleys-0.031
 sizes-0.029
 galaxies-0.028
 situations-0.028
 universe-0.028
Token.
Token ID13
Feature activation-5.16e-03

Pos logit contributions
 valleys+0.156
 sizes+0.146
 galaxies+0.143
 planets+0.140
 universe+0.138
Neg logit contributions
 bye-0.148
rav-0.133
ia-0.127
 reward-0.123
less-0.119
Token The
Token ID383
Feature activation-1.21e-02

Pos logit contributions
 bye+0.009
rav+0.009
ia+0.008
gan+0.008
less+0.008
Neg logit contributions
 valleys-0.008
 sizes-0.007
 universe-0.007
 galaxies-0.007
 sorts-0.007
Token wise
Token ID10787
Feature activation-1.97e-02

Pos logit contributions
rav+0.016
 bye+0.016
gan+0.014
ia+0.014
cake+0.013
Neg logit contributions
 valleys-0.023
 universe-0.022
 planets-0.021
 galaxies-0.021
 sizes-0.021
Token owl
Token ID39610
Feature activation-8.49e-03

Pos logit contributions
 bye+0.033
rav+0.032
ia+0.030
gan+0.029
cake+0.028
Neg logit contributions
 valleys-0.029
 sizes-0.027
 universe-0.027
 galaxies-0.027
 planets-0.026
Token was
Token ID373
Feature activation+1.34e-02

Pos logit contributions
 bye+0.014
rav+0.014
ia+0.012
gan+0.012
 reward+0.012
Neg logit contributions
 valleys-0.013
 sizes-0.012
 universe-0.012
 galaxies-0.012
 situations-0.012
Token he
Token ID339
Feature activation-8.00e-03

Pos logit contributions
 bye+0.053
rav+0.052
ia+0.047
gan+0.046
cake+0.045
Neg logit contributions
 valleys-0.048
 sizes-0.044
 galaxies-0.043
 universe-0.043
 planets-0.043
Token stayed
Token ID9658
Feature activation+1.09e-02

Pos logit contributions
 bye+0.014
rav+0.014
ia+0.012
gan+0.012
 reward+0.012
Neg logit contributions
 valleys-0.011
 sizes-0.011
 universe-0.010
 galaxies-0.010
 situations-0.010
Token with
Token ID351
Feature activation-5.67e-03

Pos logit contributions
 valleys+0.018
 sizes+0.017
 universe+0.017
 galaxies+0.016
 situations+0.016
Neg logit contributions
 bye-0.017
rav-0.015
ia-0.014
less-0.014
 reward-0.014
Token them
Token ID606
Feature activation-1.83e-02

Pos logit contributions
 bye+0.008
rav+0.007
ia+0.007
 reward+0.006
five+0.006
Neg logit contributions
 valleys-0.010
 sizes-0.009
 universe-0.009
 situations-0.009
roads-0.009
Token ever
Token ID1683
Feature activation-1.85e-02

Pos logit contributions
 bye+0.031
rav+0.030
ia+0.028
gan+0.027
 reward+0.026
Neg logit contributions
 valleys-0.027
 sizes-0.025
 universe-0.024
 galaxies-0.024
 planets-0.024
Token since
Token ID1201
Feature activation+9.41e-02

Pos logit contributions
 bye+0.032
rav+0.031
ia+0.030
gan+0.028
cake+0.028
Neg logit contributions
 valleys-0.028
 sizes-0.026
 galaxies-0.025
 planets-0.025
 situations-0.025
Token!
Token ID0
Feature activation-2.24e-02

Pos logit contributions
 valleys+0.152
 sizes+0.142
 galaxies+0.140
 universe+0.138
 planets+0.137
Neg logit contributions
 bye-0.151
rav-0.134
ia-0.129
 reward-0.124
less-0.121
Token<|endoftext|>
Token ID50256
Feature activation+2.60e-02

Pos logit contributions
 bye+0.038
rav+0.036
ia+0.033
gan+0.032
cake+0.031
Neg logit contributions
 valleys-0.034
 sizes-0.032
 universe-0.031
 galaxies-0.031
 planets-0.031
TokenOne
Token ID3198
Feature activation+2.42e-02

Pos logit contributions
 sizes+0.039
 valleys+0.038
 galaxies+0.037
 situations+0.037
 planets+0.036
Neg logit contributions
 bye-0.040
rav-0.040
ia-0.038
cake-0.035
less-0.035
Token day
Token ID1110
Feature activation-7.18e-03

Pos logit contributions
 valleys+0.029
 universe+0.027
 sizes+0.027
 sorts+0.027
 galaxies+0.026
Neg logit contributions
 bye-0.047
rav-0.046
ia-0.042
gan-0.041
cake-0.041
Token,
Token ID11
Feature activation+4.38e-03

Pos logit contributions
 bye+0.015
rav+0.014
ia+0.013
cake+0.013
five+0.013
Neg logit contributions
 valleys-0.008
 universe-0.007
 galaxies-0.007
 planets-0.007
 sizes-0.007
Token they
Token ID484
Feature activation-3.61e-03

Pos logit contributions
 bye+0.054
rav+0.052
ia+0.048
gan+0.046
cake+0.046
Neg logit contributions
 valleys-0.043
 sizes-0.040
 galaxies-0.038
 universe-0.038
 planets-0.038
Token have
Token ID423
Feature activation-1.42e-02

Pos logit contributions
 bye+0.007
rav+0.007
ia+0.007
gan+0.007
cake+0.006
Neg logit contributions
 valleys-0.004
 sizes-0.004
 universe-0.004
 galaxies-0.004
 planets-0.004
Token been
Token ID587
Feature activation+5.94e-03

Pos logit contributions
 bye+0.028
rav+0.027
cake+0.024
ia+0.024
gan+0.024
Neg logit contributions
 valleys-0.018
 sizes-0.016
 galaxies-0.016
 planets-0.016
 sorts-0.016
Token friends
Token ID2460
Feature activation+9.85e-03

Pos logit contributions
 valleys+0.009
 sizes+0.008
 universe+0.008
 galaxies+0.008
 planets+0.008
Neg logit contributions
 bye-0.011
rav-0.010
ia-0.009
gan-0.009
cake-0.009
Token ever
Token ID1683
Feature activation-1.59e-02

Pos logit contributions
 valleys+0.014
 sizes+0.013
 situations+0.012
 universe+0.012
 galaxies+0.012
Neg logit contributions
 bye-0.018
rav-0.017
ia-0.016
gan-0.015
less-0.015
Token since
Token ID1201
Feature activation+9.33e-02

Pos logit contributions
 bye+0.030
rav+0.029
ia+0.027
gan+0.026
cake+0.026
Neg logit contributions
 valleys-0.021
 sizes-0.020
 galaxies-0.019
 planets-0.019
 universe-0.019
Token!
Token ID0
Feature activation-1.39e-02

Pos logit contributions
 valleys+0.149
 sizes+0.137
 galaxies+0.136
 situations+0.134
 planets+0.132
Neg logit contributions
 bye-0.157
rav-0.142
ia-0.132
 reward-0.128
gan-0.123
Token<|endoftext|>
Token ID50256
Feature activation+2.41e-02

Pos logit contributions
 bye+0.025
rav+0.024
ia+0.022
less+0.021
 reward+0.021
Neg logit contributions
 valleys-0.019
 sizes-0.017
 universe-0.017
 sorts-0.017
 galaxies-0.017
TokenOnce
Token ID7454
Feature activation-2.40e-03

Pos logit contributions
 sizes+0.036
 valleys+0.035
 galaxies+0.034
 situations+0.034
 planets+0.033
Neg logit contributions
 bye-0.038
rav-0.038
ia-0.036
cake-0.033
gan-0.032
Token upon
Token ID2402
Feature activation-2.35e-02

Pos logit contributions
 bye+0.003
rav+0.003
ia+0.003
gan+0.003
less+0.002
Neg logit contributions
 valleys-0.005
 sizes-0.004
 situations-0.004
 galaxies-0.004
 planets-0.004
Token a
Token ID257
Feature activation-3.99e-02

Pos logit contributions
 bye+0.037
rav+0.035
ia+0.032
gan+0.031
 reward+0.030
Neg logit contributions
 valleys-0.038
 sizes-0.035
 galaxies-0.035
 universe-0.034
 situations-0.034
Token hair
Token ID4190
Feature activation+1.45e-02

Pos logit contributions
 valleys+0.011
 sizes+0.010
 galaxies+0.010
 situations+0.010
 universe+0.010
Neg logit contributions
 bye-0.009
rav-0.009
ia-0.008
gan-0.007
 reward-0.007
Token tied
Token ID8165
Feature activation+3.03e-02

Pos logit contributions
 valleys+0.022
 galaxies+0.020
 situations+0.020
 sizes+0.020
 universe+0.020
Neg logit contributions
 bye-0.024
rav-0.023
ia-0.022
gan-0.021
cake-0.021
Token in
Token ID287
Feature activation+5.16e-02

Pos logit contributions
 valleys+0.046
 sizes+0.042
 universe+0.042
 galaxies+0.042
 situations+0.042
Neg logit contributions
 bye-0.050
rav-0.048
ia-0.044
gan-0.043
less-0.043
Token two
Token ID734
Feature activation+7.55e-02

Pos logit contributions
 valleys+0.084
 galaxies+0.078
 sizes+0.077
 situations+0.077
roads+0.077
Neg logit contributions
 bye-0.078
rav-0.072
ia-0.069
 reward-0.067
less-0.065
Token pig
Token ID12967
Feature activation+7.89e-03

Pos logit contributions
 valleys+0.109
roads+0.100
 galaxies+0.099
 situations+0.098
 universe+0.098
Neg logit contributions
 bye-0.127
rav-0.119
ia-0.113
 reward-0.109
less-0.108
Tokentails
Token ID26404
Feature activation+9.32e-02

Pos logit contributions
 valleys+0.008
roads+0.007
 universe+0.007
 galaxies+0.007
 situations+0.007
Neg logit contributions
 bye-0.017
rav-0.016
ia-0.016
less-0.016
gan-0.016
Token.
Token ID13
Feature activation+1.23e-02

Pos logit contributions
 valleys+0.139
 sizes+0.127
 galaxies+0.126
 universe+0.126
 situations+0.125
Neg logit contributions
 bye-0.157
rav-0.150
ia-0.141
gan-0.135
cake-0.134
Token "
Token ID366
Feature activation+1.52e-02

Pos logit contributions
 valleys+0.016
 universe+0.014
 sizes+0.014
 galaxies+0.014
 sorts+0.014
Neg logit contributions
 bye-0.023
rav-0.023
ia-0.021
 reward-0.020
less-0.020
TokenM
Token ID44
Feature activation+1.03e-02

Pos logit contributions
 valleys+0.022
 sizes+0.020
 universe+0.019
 galaxies+0.019
 sorts+0.019
Neg logit contributions
 bye-0.028
rav-0.026
ia-0.025
 reward-0.023
gan-0.023
Tokenummy
Token ID13513
Feature activation+2.54e-02

Pos logit contributions
?",+0.016
 universe+0.016
 valley+0.016
 situations+0.016
 valleys+0.015
Neg logit contributions
rav-0.014
five-0.014
 bye-0.013
less-0.012
eal-0.012
Token!"
Token ID2474
Feature activation+3.16e-03

Pos logit contributions
 valleys+0.036
 galaxies+0.032
 sizes+0.032
 situations+0.031
 universe+0.031
Neg logit contributions
 bye-0.047
rav-0.045
ia-0.041
 reward-0.040
gan-0.040
Token tried
Token ID3088
Feature activation+1.21e-02

Pos logit contributions
 valleys+0.004
roads+0.004
 sizes+0.004
 situations+0.004
 galaxies+0.004
Neg logit contributions
 bye-0.005
rav-0.004
ia-0.004
 reward-0.004
cake-0.004
Token to
Token ID284
Feature activation-9.59e-04

Pos logit contributions
 valleys+0.023
 sizes+0.021
roads+0.021
 universe+0.021
 galaxies+0.021
Neg logit contributions
 bye-0.016
rav-0.015
ia-0.013
less-0.013
 reward-0.012
Token remember
Token ID3505
Feature activation+1.42e-02

Pos logit contributions
 bye+0.001
rav+0.001
ia+0.001
 reward+0.001
gan+0.001
Neg logit contributions
 valleys-0.002
 sizes-0.002
 galaxies-0.002
 planets-0.001
 universe-0.001
Token it
Token ID340
Feature activation-1.79e-02

Pos logit contributions
 valleys+0.024
 sizes+0.023
 universe+0.022
 galaxies+0.022
roads+0.022
Neg logit contributions
 bye-0.021
rav-0.019
ia-0.018
 reward-0.017
less-0.017
Token ever
Token ID1683
Feature activation-1.66e-02

Pos logit contributions
 bye+0.032
rav+0.031
ia+0.028
gan+0.027
cake+0.027
Neg logit contributions
 valleys-0.025
 sizes-0.023
 universe-0.023
 galaxies-0.023
 planets-0.023
Token since
Token ID1201
Feature activation+9.31e-02

Pos logit contributions
 bye+0.029
rav+0.028
ia+0.026
gan+0.025
cake+0.025
Neg logit contributions
 valleys-0.024
 sizes-0.022
 galaxies-0.021
 universe-0.021
 planets-0.021
Token.
Token ID13
Feature activation-8.44e-03

Pos logit contributions
 valleys+0.158
 galaxies+0.144
 sizes+0.144
 planets+0.141
 universe+0.139
Neg logit contributions
 bye-0.142
rav-0.127
ia-0.122
 reward-0.121
less-0.116
Token<|endoftext|>
Token ID50256
Feature activation+2.34e-02

Pos logit contributions
 bye+0.015
rav+0.015
ia+0.013
gan+0.013
less+0.013
Neg logit contributions
 valleys-0.012
 universe-0.011
 sorts-0.011
 sizes-0.010
 galaxies-0.010
TokenOnce
Token ID7454
Feature activation+6.19e-04

Pos logit contributions
 valleys+0.040
 sizes+0.040
 galaxies+0.038
 situations+0.038
 planets+0.038
Neg logit contributions
 bye-0.033
rav-0.032
ia-0.029
cake-0.027
gan-0.027
Token upon
Token ID2402
Feature activation-1.77e-02

Pos logit contributions
 valleys+0.001
 sizes+0.001
 situations+0.001
 galaxies+0.001
 planets+0.001
Neg logit contributions
 bye-0.001
rav-0.001
ia-0.001
gan-0.001
less-0.001
Token a
Token ID257
Feature activation-3.40e-02

Pos logit contributions
 bye+0.030
rav+0.029
ia+0.026
gan+0.026
cake+0.025
Neg logit contributions
 valleys-0.026
 sizes-0.024
 universe-0.024
 galaxies-0.024
 planets-0.024
Token living
Token ID2877
Feature activation+1.28e-02

Pos logit contributions
 valleys+0.009
 sizes+0.008
 sorts+0.008
 galaxies+0.008
 universe+0.008
Neg logit contributions
 bye-0.011
rav-0.011
ia-0.010
cake-0.010
gan-0.010
Token in
Token ID287
Feature activation+1.71e-02

Pos logit contributions
 valleys+0.023
roads+0.023
 sizes+0.022
 galaxies+0.021
 universe+0.021
Neg logit contributions
 bye-0.017
rav-0.016
less-0.015
 reward-0.014
ia-0.014
Token the
Token ID262
Feature activation+7.85e-03

Pos logit contributions
 valleys+0.030
 sizes+0.028
roads+0.028
 situations+0.027
 galaxies+0.027
Neg logit contributions
 bye-0.025
rav-0.022
 reward-0.021
ia-0.021
less-0.020
Token garden
Token ID11376
Feature activation-2.26e-02

Pos logit contributions
 valleys+0.010
 universe+0.009
 valley+0.008
 planets+0.008
 galaxies+0.008
Neg logit contributions
 bye-0.016
rav-0.015
ia-0.014
gan-0.014
cake-0.014
Token ever
Token ID1683
Feature activation-2.27e-02

Pos logit contributions
 bye+0.036
rav+0.034
ia+0.031
less+0.031
gan+0.030
Neg logit contributions
 valleys-0.035
 situations-0.032
 sizes-0.032
 universe-0.032
 galaxies-0.032
Token since
Token ID1201
Feature activation+9.26e-02

Pos logit contributions
 bye+0.039
rav+0.038
ia+0.036
gan+0.034
cake+0.033
Neg logit contributions
 valleys-0.034
 sizes-0.032
 galaxies-0.031
 universe-0.031
 planets-0.031
Token.
Token ID13
Feature activation-1.42e-02

Pos logit contributions
 valleys+0.149
 sizes+0.140
 galaxies+0.136
 situations+0.135
 planets+0.132
Neg logit contributions
 bye-0.151
rav-0.136
ia-0.128
 reward-0.124
less-0.120
Token<|endoftext|>
Token ID50256
Feature activation+2.48e-02

Pos logit contributions
 bye+0.023
rav+0.022
ia+0.020
gan+0.019
cake+0.019
Neg logit contributions
 valleys-0.023
 sizes-0.021
 galaxies-0.021
 situations-0.021
 universe-0.021
TokenOnce
Token ID7454
Feature activation+1.37e-03

Pos logit contributions
 sizes+0.040
 valleys+0.038
 galaxies+0.037
 situations+0.037
 planets+0.036
Neg logit contributions
 bye-0.037
rav-0.037
ia-0.035
cake-0.032
gan-0.032
Token upon
Token ID2402
Feature activation-2.38e-02

Pos logit contributions
 valleys+0.003
 sizes+0.003
 situations+0.003
 galaxies+0.002
 planets+0.002
Neg logit contributions
 bye-0.002
rav-0.002
ia-0.002
gan-0.001
less-0.001
Token a
Token ID257
Feature activation-4.20e-02

Pos logit contributions
 bye+0.037
rav+0.035
ia+0.032
gan+0.031
 reward+0.030
Neg logit contributions
 valleys-0.039
 sizes-0.037
 galaxies-0.035
 situations-0.035
 universe-0.035
Token they
Token ID484
Feature activation+8.44e-03

Pos logit contributions
rav+0.044
 bye+0.043
ia+0.040
less+0.038
cake+0.038
Neg logit contributions
 valleys-0.035
 sizes-0.032
 planets-0.032
 sorts-0.032
 galaxies-0.032
Token have
Token ID423
Feature activation-4.30e-03

Pos logit contributions
 valleys+0.011
 situations+0.010
 sizes+0.010
 universe+0.010
 galaxies+0.010
Neg logit contributions
 bye-0.016
rav-0.015
ia-0.014
gan-0.014
 reward-0.013
Token been
Token ID587
Feature activation+1.32e-02

Pos logit contributions
 bye+0.008
rav+0.008
ia+0.007
cake+0.007
five+0.007
Neg logit contributions
 valleys-0.006
 sizes-0.005
 planets-0.005
 sorts-0.005
 galaxies-0.005
Token friends
Token ID2460
Feature activation+2.27e-02

Pos logit contributions
 valleys+0.019
 sizes+0.017
 sorts+0.016
 planets+0.016
 galaxies+0.016
Neg logit contributions
 bye-0.024
rav-0.023
ia-0.021
cake-0.021
five-0.020
Token ever
Token ID1683
Feature activation-1.29e-02

Pos logit contributions
 valleys+0.030
 sizes+0.027
 situations+0.026
 galaxies+0.026
 universe+0.026
Neg logit contributions
 bye-0.044
rav-0.042
ia-0.038
gan-0.038
less-0.038
Token since
Token ID1201
Feature activation+9.23e-02

Pos logit contributions
 bye+0.024
rav+0.023
ia+0.022
cake+0.020
gan+0.020
Neg logit contributions
 valleys-0.018
 sizes-0.016
 galaxies-0.016
 universe-0.016
 planets-0.016
Token.
Token ID13
Feature activation+1.47e-03

Pos logit contributions
 valleys+0.141
 sizes+0.131
 galaxies+0.128
 universe+0.126
 situations+0.126
Neg logit contributions
 bye-0.157
rav-0.147
ia-0.135
 reward-0.130
gan-0.130
Token<|endoftext|>
Token ID50256
Feature activation+2.84e-02

Pos logit contributions
 valleys+0.002
 sizes+0.002
 universe+0.002
 galaxies+0.002
 sorts+0.002
Neg logit contributions
 bye-0.003
rav-0.003
ia-0.002
less-0.002
gan-0.002
TokenOnce
Token ID7454
Feature activation+2.90e-03

Pos logit contributions
 sizes+0.044
 valleys+0.043
 galaxies+0.042
 situations+0.041
 planets+0.041
Neg logit contributions
 bye-0.044
rav-0.043
ia-0.041
cake-0.038
gan-0.037
Token upon
Token ID2402
Feature activation-2.20e-02

Pos logit contributions
 valleys+0.005
 sizes+0.005
 situations+0.005
 galaxies+0.005
 universe+0.005
Neg logit contributions
 bye-0.004
rav-0.004
ia-0.004
gan-0.003
less-0.003
Token a
Token ID257
Feature activation-4.02e-02

Pos logit contributions
 bye+0.035
rav+0.033
ia+0.030
gan+0.029
 reward+0.029
Neg logit contributions
 valleys-0.036
 sizes-0.033
 galaxies-0.032
 universe-0.032
 situations-0.032
Tokenooming
Token ID30602
Feature activation-1.40e-03

Pos logit contributions
 bye+0.032
rav+0.030
ia+0.029
cake+0.029
 timer+0.029
Neg logit contributions
 valleys-0.017
roads-0.015
unks-0.014
igh-0.014
neath-0.014
Token around
Token ID1088
Feature activation-6.19e-03

Pos logit contributions
 bye+0.003
rav+0.003
ia+0.002
gan+0.002
cake+0.002
Neg logit contributions
 valleys-0.002
 sizes-0.002
 galaxies-0.002
 universe-0.002
 planets-0.002
Token with
Token ID351
Feature activation-3.77e-03

Pos logit contributions
 bye+0.010
rav+0.010
ia+0.009
less+0.008
 reward+0.008
Neg logit contributions
 valleys-0.009
 sizes-0.009
 galaxies-0.009
 universe-0.009
 situations-0.009
Token it
Token ID340
Feature activation-1.80e-02

Pos logit contributions
 bye+0.006
rav+0.006
ia+0.006
gan+0.005
cake+0.005
Neg logit contributions
 valleys-0.006
 sizes-0.005
 galaxies-0.005
 universe-0.005
 planets-0.005
Token ever
Token ID1683
Feature activation-1.87e-02

Pos logit contributions
 bye+0.030
rav+0.029
ia+0.027
 reward+0.025
less+0.025
Neg logit contributions
 valleys-0.027
 sizes-0.025
 universe-0.024
 galaxies-0.024
 situations-0.024
Token since
Token ID1201
Feature activation+9.18e-02

Pos logit contributions
 bye+0.038
rav+0.038
ia+0.036
gan+0.035
cake+0.034
Neg logit contributions
 valleys-0.022
 sizes-0.020
 galaxies-0.019
 planets-0.019
 universe-0.019
Token.
Token ID13
Feature activation-5.93e-03

Pos logit contributions
 valleys+0.148
 sizes+0.139
 galaxies+0.135
 planets+0.133
 universe+0.132
Neg logit contributions
 bye-0.150
rav-0.135
ia-0.126
 reward-0.123
less-0.121
Token<|endoftext|>
Token ID50256
Feature activation+2.73e-02

Pos logit contributions
 bye+0.010
rav+0.010
ia+0.009
gan+0.009
less+0.009
Neg logit contributions
 valleys-0.008
 sizes-0.008
 universe-0.008
 sorts-0.008
 galaxies-0.007
TokenOne
Token ID3198
Feature activation+2.98e-02

Pos logit contributions
 valleys+0.037
 sizes+0.037
 galaxies+0.035
 situations+0.035
 planets+0.035
Neg logit contributions
 bye-0.047
rav-0.046
ia-0.043
cake-0.041
gan-0.041
Token night
Token ID1755
Feature activation-1.33e-02

Pos logit contributions
 valleys+0.036
 universe+0.033
 sizes+0.033
 sorts+0.033
 galaxies+0.032
Neg logit contributions
 bye-0.058
rav-0.057
ia-0.052
gan-0.051
cake-0.051
Token,
Token ID11
Feature activation+5.19e-03

Pos logit contributions
 bye+0.025
rav+0.025
ia+0.023
gan+0.023
cake+0.022
Neg logit contributions
 valleys-0.017
 sizes-0.015
 universe-0.015
 galaxies-0.015
 planets-0.015
Token had
Token ID550
Feature activation+2.13e-02

Pos logit contributions
 situations+0.049
roads+0.047
 valleys+0.044
 galaxies+0.040
 North+0.040
Neg logit contributions
ia-0.028
 bye-0.026
rav-0.026
 reward-0.025
 slice-0.025
Token many
Token ID867
Feature activation+3.31e-02

Pos logit contributions
roads+0.036
 situations+0.036
 valleys+0.033
 affect+0.032
 sorts+0.032
Neg logit contributions
 bye-0.028
ia-0.028
rav-0.026
gan-0.025
 reward-0.024
Token boxes
Token ID10559
Feature activation+3.43e-02

Pos logit contributions
roads+0.053
 valleys+0.051
 situations+0.049
field+0.048
 universe+0.046
Neg logit contributions
ia-0.045
 bye-0.043
 reward-0.041
rav-0.041
--0.041
Token of
Token ID286
Feature activation+1.57e-02

Pos logit contributions
 valleys+0.068
 situations+0.068
 galaxies+0.063
roads+0.063
 universe+0.062
Neg logit contributions
 bye-0.039
less-0.038
ia-0.036
rav-0.036
cake-0.036
Token different
Token ID1180
Feature activation+2.69e-02

Pos logit contributions
 valleys+0.025
roads+0.024
 situations+0.023
 galaxies+0.022
 universe+0.022
Neg logit contributions
 bye-0.022
rav-0.021
ia-0.021
 reward-0.020
less-0.020
Token sizes
Token ID10620
Feature activation+9.13e-02

Pos logit contributions
roads+0.040
 valleys+0.040
 valley+0.037
 universe+0.036
 situations+0.035
Neg logit contributions
ia-0.039
rav-0.038
 bye-0.038
less-0.036
 reward-0.035
Token and
Token ID290
Feature activation+5.60e-03

Pos logit contributions
 valleys+0.172
 situations+0.160
roads+0.160
 galaxies+0.158
 universe+0.153
Neg logit contributions
ia-0.112
 bye-0.108
cake-0.106
less-0.104
rav-0.104
Token shapes
Token ID15268
Feature activation+3.26e-02

Pos logit contributions
roads+0.005
 valleys+0.005
 situations+0.005
 valley+0.005
 universe+0.005
Neg logit contributions
 bye-0.012
rav-0.011
ia-0.011
 reward-0.011
less-0.011
Token.
Token ID13
Feature activation+2.29e-02

Pos logit contributions
 valleys+0.053
 situations+0.051
roads+0.049
 universe+0.049
 galaxies+0.049
Neg logit contributions
 bye-0.048
rav-0.046
ia-0.046
cake-0.043
less-0.043
Token They
Token ID1119
Feature activation+2.42e-02

Pos logit contributions
 valleys+0.031
 sizes+0.029
 universe+0.027
 planets+0.027
 sorts+0.027
Neg logit contributions
 bye-0.042
rav-0.041
less-0.037
ia-0.037
 reward-0.037
Token used
Token ID973
Feature activation+5.84e-02

Pos logit contributions
 situations+0.047
roads+0.043
 valleys+0.043
 galaxies+0.038
 valley+0.038
Neg logit contributions
ia-0.029
gan-0.028
 bye-0.028
rav-0.027
 reward-0.027
Token she
Token ID673
Feature activation-1.14e-03

Pos logit contributions
 bye+0.007
rav+0.006
ia+0.006
cake+0.005
less+0.005
Neg logit contributions
 valleys-0.008
 sizes-0.008
 situations-0.008
 universe-0.008
 galaxies-0.008
Token had
Token ID550
Feature activation+1.19e-02

Pos logit contributions
 bye+0.002
rav+0.002
ia+0.002
gan+0.002
five+0.002
Neg logit contributions
 valleys-0.001
 sizes-0.001
 universe-0.001
 galaxies-0.001
 planets-0.001
Token grown
Token ID7334
Feature activation+1.11e-02

Pos logit contributions
 valleys+0.017
 sizes+0.016
 galaxies+0.015
 planets+0.015
 universe+0.015
Neg logit contributions
 bye-0.021
rav-0.021
ia-0.019
gan-0.018
five-0.018
Token two
Token ID734
Feature activation+5.04e-02

Pos logit contributions
 valleys+0.025
 sizes+0.024
 universe+0.023
 galaxies+0.023
 planets+0.023
Neg logit contributions
 bye-0.011
rav-0.010
ia-0.008
gan-0.008
cake-0.008
Token pounds
Token ID8059
Feature activation+6.71e-04

Pos logit contributions
 valleys+0.087
 universe+0.081
 galaxies+0.080
roads+0.079
 sizes+0.079
Neg logit contributions
 bye-0.071
rav-0.068
ia-0.063
 reward-0.061
less-0.060
Token since
Token ID1201
Feature activation+9.06e-02

Pos logit contributions
 valleys+0.001
 galaxies+0.001
 universe+0.001
 sizes+0.001
 planets+0.001
Neg logit contributions
 bye-0.001
rav-0.001
ia-0.001
less-0.001
gan-0.001
Token her
Token ID607
Feature activation-5.94e-03

Pos logit contributions
 valleys+0.135
 sizes+0.125
 universe+0.122
 galaxies+0.122
 planets+0.121
Neg logit contributions
 bye-0.155
rav-0.150
ia-0.137
gan-0.132
cake-0.130
Token last
Token ID938
Feature activation+2.35e-02

Pos logit contributions
 bye+0.009
rav+0.008
ia+0.008
gan+0.007
cake+0.007
Neg logit contributions
 valleys-0.010
 sizes-0.010
 universe-0.009
 galaxies-0.009
 planets-0.009
Token check
Token ID2198
Feature activation-1.86e-02

Pos logit contributions
 valleys+0.037
 sizes+0.035
 galaxies+0.034
 planets+0.034
 universe+0.033
Neg logit contributions
 bye-0.038
rav-0.036
ia-0.033
 reward-0.032
gan-0.032
Token-
Token ID12
Feature activation-6.33e-02

Pos logit contributions
 bye+0.031
rav+0.030
ia+0.028
less+0.027
gan+0.027
Neg logit contributions
 valleys-0.028
 sizes-0.026
 galaxies-0.026
 universe-0.026
 planets-0.025
Tokenup
Token ID929
Feature activation+4.61e-03

Pos logit contributions
 bye+0.101
rav+0.095
ia+0.088
 reward+0.084
gan+0.079
Neg logit contributions
 valleys-0.106
roads-0.095
 sizes-0.094
 galaxies-0.094
 sorts-0.094
Token for
Token ID329
Feature activation-2.29e-02

Pos logit contributions
rav+0.028
 bye+0.027
ia+0.026
gan+0.024
cake+0.024
Neg logit contributions
 valleys-0.025
 sorts-0.024
 sizes-0.023
 universe-0.023
 galaxies-0.023
Token the
Token ID262
Feature activation-1.55e-02

Pos logit contributions
rav+0.048
 bye+0.045
cake+0.043
eal+0.042
gan+0.042
Neg logit contributions
 valleys-0.027
 planets-0.025
 galaxies-0.024
 universe-0.024
 sorts-0.023
Token family
Token ID1641
Feature activation+1.86e-02

Pos logit contributions
rav+0.022
 bye+0.021
eal+0.019
ia+0.019
gan+0.019
Neg logit contributions
 universe-0.028
 valleys-0.026
 valley-0.026
 hill-0.026
 attic-0.026
Token and
Token ID290
Feature activation-1.42e-02

Pos logit contributions
 valleys+0.032
 sizes+0.029
 galaxies+0.029
 situations+0.029
 universe+0.029
Neg logit contributions
 bye-0.028
rav-0.026
ia-0.024
less-0.023
 reward-0.023
Token ever
Token ID1683
Feature activation-2.08e-02

Pos logit contributions
 bye+0.022
rav+0.020
ia+0.018
 reward+0.018
gan+0.017
Neg logit contributions
 valleys-0.024
 sizes-0.022
 universe-0.022
 galaxies-0.022
 situations-0.022
Token since
Token ID1201
Feature activation+9.06e-02

Pos logit contributions
 bye+0.040
rav+0.039
ia+0.039
cake+0.036
gan+0.036
Neg logit contributions
 valleys-0.025
 sizes-0.024
 sorts-0.022
 universe-0.022
 galaxies-0.022
Token,
Token ID11
Feature activation+6.37e-03

Pos logit contributions
 valleys+0.146
 sizes+0.137
 galaxies+0.133
 situations+0.133
 universe+0.131
Neg logit contributions
 bye-0.150
rav-0.135
ia-0.125
 reward-0.121
gan-0.118
Token the
Token ID262
Feature activation-1.68e-02

Pos logit contributions
 valleys+0.010
 sizes+0.009
 universe+0.009
 galaxies+0.009
 planets+0.009
Neg logit contributions
 bye-0.011
rav-0.010
ia-0.009
gan-0.009
cake-0.009
Token boy
Token ID2933
Feature activation-1.01e-02

Pos logit contributions
rav+0.025
 bye+0.022
gan+0.021
five+0.020
ia+0.020
Neg logit contributions
 universe-0.031
 valleys-0.030
 valley-0.030
 attic-0.029
 hill-0.029
Token and
Token ID290
Feature activation-3.39e-02

Pos logit contributions
 bye+0.017
rav+0.015
ia+0.014
 reward+0.014
gan+0.014
Neg logit contributions
 valleys-0.016
 sizes-0.015
 galaxies-0.015
 situations-0.015
 universe-0.014
Token his
Token ID465
Feature activation-3.15e-02

Pos logit contributions
 bye+0.037
rav+0.026
 goodbye+0.023
 reward+0.023
ia+0.022
Neg logit contributions
 valleys-0.077
 sizes-0.074
 galaxies-0.070
roads-0.070
 universe-0.069
Token have
Token ID423
Feature activation-8.31e-03

Pos logit contributions
 bye+0.007
rav+0.007
ia+0.007
gan+0.006
 reward+0.006
Neg logit contributions
 valleys-0.005
 sizes-0.004
 situations-0.004
 galaxies-0.004
 universe-0.004
Token been
Token ID587
Feature activation+1.07e-02

Pos logit contributions
 bye+0.015
rav+0.015
ia+0.013
cake+0.013
gan+0.013
Neg logit contributions
 valleys-0.012
 sizes-0.011
 galaxies-0.010
 planets-0.010
 universe-0.010
Token best
Token ID1266
Feature activation+2.49e-03

Pos logit contributions
 valleys+0.016
 sizes+0.014
 universe+0.014
 sorts+0.014
 planets+0.014
Neg logit contributions
 bye-0.019
rav-0.018
ia-0.017
cake-0.016
gan-0.016
Token friends
Token ID2460
Feature activation+1.42e-02

Pos logit contributions
 valleys+0.004
 sizes+0.004
 universe+0.004
 planets+0.004
 galaxies+0.004
Neg logit contributions
rav-0.003
 bye-0.003
cake-0.003
gan-0.003
ia-0.003
Token ever
Token ID1683
Feature activation-2.39e-02

Pos logit contributions
 valleys+0.022
 sizes+0.020
 situations+0.020
 galaxies+0.020
 universe+0.019
Neg logit contributions
 bye-0.024
rav-0.023
ia-0.021
gan-0.020
less-0.020
Token since
Token ID1201
Feature activation+9.02e-02

Pos logit contributions
 bye+0.042
rav+0.040
ia+0.037
gan+0.036
cake+0.035
Neg logit contributions
 valleys-0.035
 sizes-0.033
 galaxies-0.032
 universe-0.032
 planets-0.032
Token.
Token ID13
Feature activation-3.40e-03

Pos logit contributions
 valleys+0.140
 sizes+0.129
 galaxies+0.127
 situations+0.125
 universe+0.125
Neg logit contributions
 bye-0.152
rav-0.142
ia-0.131
 reward-0.126
gan-0.123
Token They
Token ID1119
Feature activation+8.06e-03

Pos logit contributions
 bye+0.007
rav+0.007
ia+0.006
less+0.006
gan+0.006
Neg logit contributions
 valleys-0.004
 sizes-0.004
 universe-0.003
 galaxies-0.003
 planets-0.003
Token have
Token ID423
Feature activation-8.41e-03

Pos logit contributions
 valleys+0.012
 situations+0.012
roads+0.011
 sizes+0.011
 galaxies+0.011
Neg logit contributions
 bye-0.014
rav-0.013
 reward-0.012
ia-0.012
five-0.011
Token many
Token ID867
Feature activation+2.21e-02

Pos logit contributions
 bye+0.015
rav+0.014
ia+0.013
gan+0.013
cake+0.013
Neg logit contributions
 valleys-0.012
 sizes-0.011
 galaxies-0.011
 universe-0.011
 planets-0.011
Token more
Token ID517
Feature activation+1.28e-02

Pos logit contributions
 valleys+0.041
 sizes+0.038
 universe+0.038
 galaxies+0.038
 planets+0.037
Neg logit contributions
 bye-0.030
rav-0.028
ia-0.025
 reward-0.024
gan-0.023
Token.
Token ID13
Feature activation+2.72e-02

Pos logit contributions
 valleys+0.036
 sizes+0.033
 galaxies+0.033
 situations+0.032
 universe+0.032
Neg logit contributions
 bye-0.043
rav-0.041
ia-0.038
gan-0.037
less-0.037
Token They
Token ID1119
Feature activation+2.15e-02

Pos logit contributions
 valleys+0.037
 situations+0.035
 galaxies+0.035
 universe+0.034
 sizes+0.034
Neg logit contributions
 bye-0.050
rav-0.046
ia-0.044
gan-0.043
five-0.042
Token had
Token ID550
Feature activation+2.01e-02

Pos logit contributions
 situations+0.041
roads+0.039
 valleys+0.038
 hallway+0.035
 basement+0.035
Neg logit contributions
 bye-0.031
rav-0.027
ia-0.025
By-0.024
gan-0.024
Token been
Token ID587
Feature activation+2.62e-02

Pos logit contributions
 valleys+0.033
 sizes+0.030
 situations+0.030
 galaxies+0.030
 universe+0.030
Neg logit contributions
 bye-0.031
rav-0.029
ia-0.027
gan-0.026
 reward-0.025
Token friends
Token ID2460
Feature activation+3.59e-02

Pos logit contributions
 valleys+0.032
 sizes+0.029
 planets+0.028
 sorts+0.027
 universe+0.027
Neg logit contributions
 bye-0.052
rav-0.051
ia-0.047
cake-0.047
five-0.046
Token since
Token ID1201
Feature activation+9.00e-02

Pos logit contributions
 valleys+0.074
 situations+0.071
 sizes+0.066
roads+0.066
 galaxies+0.066
Neg logit contributions
 bye-0.044
less-0.039
rav-0.037
 goodbye-0.036
 nicely-0.035
Token they
Token ID484
Feature activation+9.36e-03

Pos logit contributions
 valleys+0.152
 situations+0.143
 sizes+0.140
 galaxies+0.138
 universe+0.136
Neg logit contributions
 bye-0.141
rav-0.126
ia-0.119
gan-0.113
 reward-0.110
Token were
Token ID547
Feature activation+1.56e-02

Pos logit contributions
 valleys+0.014
 situations+0.013
 sizes+0.013
roads+0.013
 galaxies+0.012
Neg logit contributions
 bye-0.016
rav-0.015
 reward-0.014
ia-0.013
five-0.013
Token babies
Token ID11903
Feature activation+1.63e-02

Pos logit contributions
 valleys+0.022
 sizes+0.020
 universe+0.020
 galaxies+0.020
 planets+0.020
Neg logit contributions
 bye-0.028
rav-0.027
ia-0.025
gan-0.024
cake-0.024
Token and
Token ID290
Feature activation-4.03e-03

Pos logit contributions
 valleys+0.027
 situations+0.025
roads+0.025
 galaxies+0.024
 universe+0.024
Neg logit contributions
 bye-0.025
rav-0.023
less-0.023
ia-0.022
gan-0.022
Token were
Token ID547
Feature activation+1.18e-02

Pos logit contributions
 bye+0.007
rav+0.007
ia+0.006
gan+0.006
 reward+0.006
Neg logit contributions
 valleys-0.006
roads-0.006
 situations-0.005
 universe-0.005
 galaxies-0.005
Token that
Token ID326
Feature activation+1.72e-02

Pos logit contributions
 valleys+0.041
 sizes+0.041
 situations+0.039
 sorts+0.038
 universe+0.038
Neg logit contributions
 bye-0.038
rav-0.032
ia-0.031
gan-0.029
 reward-0.029
Token day
Token ID1110
Feature activation-1.32e-02

Pos logit contributions
 valleys+0.033
 sizes+0.031
 galaxies+0.030
 situations+0.030
 planets+0.029
Neg logit contributions
 bye-0.025
rav-0.023
ia-0.021
 reward-0.020
By-0.018
Token on
Token ID319
Feature activation-1.48e-02

Pos logit contributions
 bye+0.025
rav+0.024
ia+0.021
less+0.021
 reward+0.021
Neg logit contributions
 valleys-0.018
 sizes-0.018
 galaxies-0.017
 planets-0.016
 sorts-0.016
Token,
Token ID11
Feature activation-9.41e-03

Pos logit contributions
 bye+0.019
rav+0.016
ia+0.015
 reward+0.015
 goodbye+0.014
Neg logit contributions
 sizes-0.029
 sorts-0.029
 valleys-0.029
 galaxies-0.028
 stations-0.028
Token Bun
Token ID28515
Feature activation-1.24e-02

Pos logit contributions
 bye+0.012
rav+0.010
By+0.010
 okay+0.010
 reward+0.010
Neg logit contributions
roads-0.017
 situations-0.017
 valleys-0.017
 sorts-0.017
 sizes-0.016
Token-
Token ID12
Feature activation-1.13e-01

Pos logit contributions
 bye+0.019
rav+0.019
ia+0.016
less+0.016
gan+0.015
Neg logit contributions
 valleys-0.020
 sizes-0.019
 universe-0.019
 situations-0.019
 galaxies-0.019
TokenB
Token ID33
Feature activation-2.17e-02

Pos logit contributions
 bye+0.192
rav+0.186
ia+0.171
gan+0.164
 reward+0.163
Neg logit contributions
 valleys-0.166
 sizes-0.154
 galaxies-0.151
 universe-0.150
 planets-0.149
Tokenun
Token ID403
Feature activation+1.31e-02

Pos logit contributions
 bye+0.045
rav+0.044
ia+0.040
gan+0.039
less+0.039
Neg logit contributions
 valleys-0.025
 sizes-0.022
 galaxies-0.022
 universe-0.022
 planets-0.022
Token didn
Token ID1422
Feature activation+2.65e-03

Pos logit contributions
 valleys+0.020
 sizes+0.018
 universe+0.018
 galaxies+0.018
 planets+0.018
Neg logit contributions
 bye-0.022
rav-0.021
ia-0.019
gan-0.019
less-0.019
Token't
Token ID470
Feature activation+4.09e-03

Pos logit contributions
 valleys+0.004
 sizes+0.004
 galaxies+0.004
 planets+0.004
 sorts+0.004
Neg logit contributions
 bye-0.004
rav-0.004
ia-0.004
cake-0.004
 reward-0.004
Token want
Token ID765
Feature activation-2.00e-02

Pos logit contributions
 valleys+0.007
 galaxies+0.006
 planets+0.006
 sizes+0.006
roads+0.006
Neg logit contributions
 bye-0.007
rav-0.006
ia-0.006
 reward-0.006
five-0.005
Token.
Token ID13
Feature activation-3.05e-03

Pos logit contributions
rav+0.007
 bye+0.007
ia+0.006
 reward+0.006
eal+0.006
Neg logit contributions
 valleys-0.004
 sizes-0.003
 galaxies-0.003
 situations-0.003
 sorts-0.003
Token
Token ID198
Feature activation+3.61e-03

Pos logit contributions
rav+0.006
 bye+0.006
less+0.006
ia+0.005
 reward+0.005
Neg logit contributions
 valleys-0.004
 sorts-0.003
roads-0.003
 sizes-0.003
 universe-0.003
Token
Token ID198
Feature activation+7.70e-03

Pos logit contributions
 valleys+0.005
roads+0.004
 universe+0.004
 sorts+0.004
 galaxies+0.004
Neg logit contributions
 bye-0.007
rav-0.007
less-0.006
 reward-0.006
gan-0.006
TokenThe
Token ID464
Feature activation+2.26e-03

Pos logit contributions
 valleys+0.009
roads+0.008
 sorts+0.008
 galaxies+0.008
 universe+0.007
Neg logit contributions
 bye-0.016
rav-0.015
less-0.014
 reward-0.014
ia-0.014
Token grand
Token ID4490
Feature activation-7.03e-03

Pos logit contributions
 valleys+0.003
 universe+0.003
 valley+0.003
 planets+0.003
 galaxies+0.003
Neg logit contributions
rav-0.004
 bye-0.004
gan-0.004
less-0.003
ia-0.003
Token-
Token ID12
Feature activation-1.09e-01

Pos logit contributions
rav+0.013
 bye+0.013
less+0.012
By+0.012
gan+0.012
Neg logit contributions
 planets-0.008
 universe-0.008
 valleys-0.008
 hill-0.008
 situations-0.008
Tokendaughter
Token ID29642
Feature activation-8.58e-03

Pos logit contributions
 bye+0.194
rav+0.183
 reward+0.175
eal+0.161
 sandwich+0.157
Neg logit contributions
 sorts-0.147
 valleys-0.145
roads-0.143
 galaxies-0.132
 kinds-0.131
Token cl
Token ID537
Feature activation-3.22e-02

Pos logit contributions
rav+0.016
 bye+0.016
less+0.014
 reward+0.014
rost+0.014
Neg logit contributions
 valleys-0.011
 planets-0.010
 galaxies-0.010
 sizes-0.010
 universe-0.010
Tokenapped
Token ID6320
Feature activation+1.41e-02

Pos logit contributions
 bye+0.050
rav+0.049
 reward+0.045
 nicer+0.043
 timer+0.042
Neg logit contributions
 valleys-0.049
 sorts-0.048
tails-0.045
bowl-0.045
roads-0.045
Token her
Token ID607
Feature activation-2.75e-03

Pos logit contributions
 valleys+0.020
 sizes+0.018
 galaxies+0.018
 sorts+0.018
 planets+0.018
Neg logit contributions
rav-0.025
 bye-0.024
ia-0.022
gan-0.022
cake-0.022
Token hands
Token ID2832
Feature activation+1.95e-02

Pos logit contributions
rav+0.006
 bye+0.005
cake+0.005
ia+0.005
less+0.005
Neg logit contributions
 valleys-0.003
 sizes-0.003
 planets-0.003
 galaxies-0.003
 situations-0.003
Token surprise
Token ID5975
Feature activation-1.62e-02

Pos logit contributions
 valleys+0.020
 universe+0.019
 sizes+0.019
 planets+0.018
 galaxies+0.018
Neg logit contributions
 bye-0.027
rav-0.026
ia-0.024
gan-0.023
cake-0.023
Token.
Token ID13
Feature activation-4.04e-03

Pos logit contributions
 bye+0.029
rav+0.029
ia+0.026
gan+0.025
five+0.025
Neg logit contributions
 valleys-0.022
 sizes-0.021
 universe-0.020
 planets-0.020
 sorts-0.020
Token But
Token ID887
Feature activation+1.25e-02

Pos logit contributions
 bye+0.007
rav+0.006
 reward+0.005
gan+0.005
By+0.005
Neg logit contributions
 valleys-0.006
 sizes-0.006
 situations-0.006
 universe-0.006
 planets-0.006
Token when
Token ID618
Feature activation+2.53e-02

Pos logit contributions
 valleys+0.020
 sizes+0.019
 sorts+0.019
 situations+0.019
 universe+0.019
Neg logit contributions
 bye-0.020
rav-0.018
 reward-0.017
less-0.016
By-0.016
Token Bun
Token ID28515
Feature activation-1.48e-02

Pos logit contributions
 sizes+0.051
 sorts+0.049
 kinds+0.047
 valleys+0.047
 situations+0.046
Neg logit contributions
 bye-0.034
rav-0.028
 reward-0.027
By-0.026
 goodbye-0.024
Token-
Token ID12
Feature activation-1.08e-01

Pos logit contributions
 bye+0.025
rav+0.024
ia+0.022
gan+0.022
cake+0.021
Neg logit contributions
 valleys-0.022
 sizes-0.020
 galaxies-0.020
 universe-0.020
 situations-0.020
TokenB
Token ID33
Feature activation-2.62e-02

Pos logit contributions
 bye+0.211
rav+0.206
ia+0.190
gan+0.184
cake+0.183
Neg logit contributions
 valleys-0.134
 sizes-0.120
 universe-0.118
 galaxies-0.118
 planets-0.117
Tokenun
Token ID403
Feature activation+7.87e-03

Pos logit contributions
 bye+0.054
rav+0.053
ia+0.049
gan+0.048
less+0.047
Neg logit contributions
 valleys-0.029
 sizes-0.026
 galaxies-0.026
 universe-0.026
 planets-0.025
Token got
Token ID1392
Feature activation+3.52e-04

Pos logit contributions
 valleys+0.011
 sizes+0.010
 universe+0.010
 galaxies+0.010
 planets+0.010
Neg logit contributions
 bye-0.014
rav-0.013
ia-0.012
gan-0.012
cake-0.012
Token to
Token ID284
Feature activation-7.95e-03

Pos logit contributions
 valleys+0.001
 sizes+0.001
 galaxies+0.001
 universe+0.001
 planets+0.001
Neg logit contributions
 bye-0.001
rav-0.001
ia-0.000
gan-0.000
less-0.000
Token the
Token ID262
Feature activation-2.72e-04

Pos logit contributions
 bye+0.013
rav+0.011
 reward+0.011
ia+0.011
five+0.010
Neg logit contributions
 valleys-0.013
 sizes-0.013
 galaxies-0.012
 planets-0.012
 universe-0.011
Token they
Token ID484
Feature activation+1.15e-02

Pos logit contributions
 valleys+0.011
 sizes+0.010
 planets+0.010
 galaxies+0.010
 universe+0.010
Neg logit contributions
 bye-0.013
rav-0.013
ia-0.012
gan-0.011
 reward-0.011
Token did
Token ID750
Feature activation+4.16e-02

Pos logit contributions
 valleys+0.015
 sizes+0.014
 situations+0.013
 galaxies+0.013
 universe+0.013
Neg logit contributions
 bye-0.022
rav-0.021
ia-0.019
 reward-0.019
gan-0.019
Token.
Token ID13
Feature activation-8.93e-03

Pos logit contributions
 valleys+0.068
 sizes+0.061
 galaxies+0.061
 universe+0.061
 situations+0.061
Neg logit contributions
 bye-0.065
rav-0.063
ia-0.059
less-0.056
 reward-0.055
Token Ch
Token ID609
Feature activation-1.18e-04

Pos logit contributions
 bye+0.016
rav+0.016
ia+0.015
gan+0.014
cake+0.014
Neg logit contributions
 valleys-0.012
 sizes-0.011
 galaxies-0.011
 universe-0.011
 planets-0.011
Tokenoo
Token ID2238
Feature activation-4.97e-02

Pos logit contributions
 bye+0.000
rav+0.000
cake+0.000
 reward+0.000
ia+0.000
Neg logit contributions
 valleys-0.000
 sizes-0.000
 situations-0.000
 universe-0.000
 planets-0.000
Token-
Token ID12
Feature activation-1.07e-01

Pos logit contributions
rav+0.073
 bye+0.072
gan+0.065
ia+0.065
 reward+0.065
Neg logit contributions
 valleys-0.078
 sizes-0.075
 planets-0.070
 galaxies-0.069
 sorts-0.069
TokenCh
Token ID1925
Feature activation-3.27e-04

Pos logit contributions
 bye+0.191
 reward+0.180
rav+0.174
ia+0.172
 apology+0.165
Neg logit contributions
 valleys-0.128
tails-0.125
 sizes-0.122
 sorts-0.119
roads-0.112
Tokenoo
Token ID2238
Feature activation-4.84e-02

Pos logit contributions
 bye+0.000
rav+0.000
cake+0.000
 reward+0.000
five+0.000
Neg logit contributions
 valleys-0.001
 situations-0.001
 sizes-0.001
roads-0.001
 universe-0.001
Token was
Token ID373
Feature activation+5.15e-03

Pos logit contributions
rav+0.069
 bye+0.067
ia+0.062
gan+0.062
 reward+0.062
Neg logit contributions
 valleys-0.077
 sizes-0.074
tails-0.069
 planets-0.069
 galaxies-0.068
Token so
Token ID523
Feature activation-5.96e-03

Pos logit contributions
 valleys+0.007
 sizes+0.006
 universe+0.006
 galaxies+0.006
 situations+0.006
Neg logit contributions
 bye-0.010
rav-0.010
ia-0.009
gan-0.009
cake-0.008
Token happy
Token ID3772
Feature activation-3.83e-02

Pos logit contributions
 bye+0.008
rav+0.008
cake+0.008
 timer+0.008
 reward+0.007
Neg logit contributions
 sorts-0.010
 valleys-0.010
 sizes-0.009
 universe-0.009
 kinds-0.009
Token,
Token ID11
Feature activation-1.36e-02

Pos logit contributions
 bye+0.020
rav+0.017
ia+0.016
 reward+0.016
 goodbye+0.015
Neg logit contributions
 sorts-0.030
 valleys-0.030
 sizes-0.029
roads-0.029
 galaxies-0.029
Token the
Token ID262
Feature activation-1.73e-02

Pos logit contributions
 bye+0.018
rav+0.016
By+0.015
 reward+0.015
less+0.014
Neg logit contributions
roads-0.025
 situations-0.024
 valleys-0.023
 lens-0.022
 basement-0.022
Token bird
Token ID6512
Feature activation-4.59e-02

Pos logit contributions
rav+0.026
 bye+0.026
ia+0.022
gan+0.022
cake+0.022
Neg logit contributions
 valleys-0.030
 universe-0.028
 planets-0.028
 sizes-0.028
 valley-0.027
Token and
Token ID290
Feature activation-4.36e-02

Pos logit contributions
 bye+0.074
rav+0.070
ia+0.067
gan+0.060
 reward+0.060
Neg logit contributions
 valleys-0.071
 situations-0.066
 sizes-0.066
 universe-0.066
 galaxies-0.065
Token Bun
Token ID28515
Feature activation+6.84e-03

Pos logit contributions
 bye+0.070
rav+0.062
By+0.054
 reward+0.053
ia+0.049
Neg logit contributions
roads-0.070
 valleys-0.069
 situations-0.067
 basement-0.064
 sizes-0.064
Token-
Token ID12
Feature activation-1.06e-01

Pos logit contributions
 valleys+0.010
 universe+0.010
 situations+0.010
roads+0.010
 galaxies+0.009
Neg logit contributions
 bye-0.011
rav-0.011
less-0.010
ia-0.010
eal-0.009
TokenB
Token ID33
Feature activation-1.18e-02

Pos logit contributions
 bye+0.189
rav+0.181
ia+0.167
gan+0.161
 reward+0.161
Neg logit contributions
 valleys-0.152
 sizes-0.140
 galaxies-0.137
 universe-0.136
 planets-0.136
Tokenun
Token ID403
Feature activation+2.87e-02

Pos logit contributions
 bye+0.025
rav+0.024
ia+0.022
gan+0.022
less+0.021
Neg logit contributions
 valleys-0.013
 sizes-0.012
 universe-0.011
 galaxies-0.011
 planets-0.011
Token became
Token ID2627
Feature activation+2.46e-02

Pos logit contributions
 valleys+0.036
 situations+0.034
 universe+0.034
 attic+0.033
 basement+0.033
Neg logit contributions
 bye-0.055
rav-0.055
five-0.048
less-0.047
ia-0.047
Token the
Token ID262
Feature activation-3.33e-03

Pos logit contributions
 valleys+0.045
roads+0.045
 situations+0.044
 galaxies+0.043
 sizes+0.043
Neg logit contributions
 bye-0.031
rav-0.029
 reward-0.025
ia-0.025
gan-0.025
Token best
Token ID1266
Feature activation-2.11e-02

Pos logit contributions
 bye+0.005
rav+0.005
 reward+0.005
ia+0.005
gan+0.005
Neg logit contributions
 valleys-0.005
 sizes-0.005
 situations-0.005
 galaxies-0.005
roads-0.005
Token castle
Token ID16669
Feature activation-1.95e-02

Pos logit contributions
 bye+0.036
rav+0.036
gan+0.033
cake+0.032
five+0.032
Neg logit contributions
 valleys-0.025
 universe-0.023
 planets-0.023
 galaxies-0.022
 valley-0.022
Token was
Token ID373
Feature activation+2.87e-02

Pos logit contributions
 bye+0.032
rav+0.031
ia+0.029
 reward+0.028
cake+0.028
Neg logit contributions
 valleys-0.030
 galaxies-0.027
 sizes-0.027
 universe-0.026
 situations-0.026
Token destroyed
Token ID6572
Feature activation+1.84e-02

Pos logit contributions
 valleys+0.041
 sizes+0.039
 universe+0.038
 planets+0.038
 galaxies+0.038
Neg logit contributions
 bye-0.050
rav-0.050
ia-0.044
five-0.043
gan-0.042
Token and
Token ID290
Feature activation-1.31e-02

Pos logit contributions
 valleys+0.030
 sizes+0.028
 galaxies+0.028
roads+0.027
 universe+0.027
Neg logit contributions
 bye-0.027
rav-0.026
ia-0.024
less-0.024
gan-0.023
Token no
Token ID645
Feature activation+2.05e-03

Pos logit contributions
 bye+0.021
rav+0.020
ia+0.018
less+0.018
gan+0.018
Neg logit contributions
 valleys-0.021
 sizes-0.019
 universe-0.019
 galaxies-0.018
 situations-0.018
Token-
Token ID12
Feature activation-1.05e-01

Pos logit contributions
 valleys+0.003
 sizes+0.003
 universe+0.003
 planets+0.003
 galaxies+0.003
Neg logit contributions
 bye-0.003
rav-0.003
ia-0.003
gan-0.003
cake-0.003
Tokenone
Token ID505
Feature activation+6.89e-03

Pos logit contributions
 bye+0.169
rav+0.161
ia+0.151
 reward+0.143
gan+0.142
Neg logit contributions
 valleys-0.171
 sizes-0.155
 galaxies-0.154
 planets-0.151
 sorts-0.151
Token was
Token ID373
Feature activation+1.99e-02

Pos logit contributions
 valleys+0.011
 sizes+0.010
 galaxies+0.010
 universe+0.010
 planets+0.010
Neg logit contributions
 bye-0.011
rav-0.011
ia-0.010
gan-0.009
less-0.009
Token able
Token ID1498
Feature activation-2.05e-02

Pos logit contributions
 valleys+0.025
 sizes+0.022
 universe+0.021
 planets+0.021
 galaxies+0.021
Neg logit contributions
rav-0.040
 bye-0.039
ia-0.036
five-0.034
cake-0.034
Token to
Token ID284
Feature activation-1.37e-02

Pos logit contributions
 bye+0.032
rav+0.031
ia+0.029
cake+0.027
gan+0.027
Neg logit contributions
 valleys-0.032
 sizes-0.031
 situations-0.030
 sorts-0.030
 universe-0.029
Token help
Token ID1037
Feature activation+7.33e-03

Pos logit contributions
 bye+0.021
rav+0.020
 reward+0.018
ia+0.018
gan+0.018
Neg logit contributions
 valleys-0.023
 sizes-0.021
 galaxies-0.021
 planets-0.020
 sorts-0.020
Token Hours
Token ID19347
Feature activation+1.11e-02

Pos logit contributions
 bye+0.010
rav+0.009
ia+0.009
gan+0.008
less+0.008
Neg logit contributions
 valleys-0.008
 sizes-0.007
 universe-0.007
 galaxies-0.007
 sorts-0.007
Token passed
Token ID3804
Feature activation+5.54e-03

Pos logit contributions
 valleys+0.020
roads+0.018
 universe+0.018
 situations+0.018
 sorts+0.018
Neg logit contributions
 bye-0.015
rav-0.015
ia-0.014
gan-0.013
cake-0.013
Token,
Token ID11
Feature activation-1.10e-02

Pos logit contributions
 valleys+0.008
 sizes+0.008
 galaxies+0.007
 universe+0.007
 planets+0.007
Neg logit contributions
 bye-0.010
rav-0.009
ia-0.008
gan-0.008
cake-0.008
Token but
Token ID475
Feature activation+1.21e-02

Pos logit contributions
 bye+0.021
rav+0.020
ia+0.019
gan+0.018
cake+0.018
Neg logit contributions
 valleys-0.014
 sizes-0.013
 galaxies-0.013
 universe-0.013
 planets-0.013
Token no
Token ID645
Feature activation-3.64e-03

Pos logit contributions
 valleys+0.018
 universe+0.016
 sizes+0.016
 galaxies+0.016
 planets+0.016
Neg logit contributions
 bye-0.021
rav-0.021
ia-0.019
cake-0.018
five-0.018
Token-
Token ID12
Feature activation-1.04e-01

Pos logit contributions
rav+0.007
 bye+0.007
ia+0.006
cake+0.006
gan+0.006
Neg logit contributions
 valleys-0.005
 sizes-0.005
 universe-0.005
 planets-0.004
 galaxies-0.004
Tokenone
Token ID505
Feature activation+2.98e-03

Pos logit contributions
 bye+0.170
rav+0.162
ia+0.153
 reward+0.142
gan+0.142
Neg logit contributions
 valleys-0.167
 galaxies-0.149
 sorts-0.149
roads-0.149
 situations-0.148
Token came
Token ID1625
Feature activation-3.94e-02

Pos logit contributions
 valleys+0.004
 sizes+0.004
 universe+0.004
 galaxies+0.004
 planets+0.004
Neg logit contributions
 bye-0.005
rav-0.005
ia-0.005
gan-0.004
cake-0.004
Token and
Token ID290
Feature activation-2.49e-02

Pos logit contributions
 bye+0.056
rav+0.054
ia+0.048
cake+0.046
five+0.046
Neg logit contributions
 valleys-0.071
 sizes-0.065
 galaxies-0.064
 universe-0.064
 planets-0.064
Token it
Token ID340
Feature activation-3.45e-02

Pos logit contributions
rav+0.045
 bye+0.044
ia+0.041
cake+0.038
eal+0.038
Neg logit contributions
 valleys-0.036
 galaxies-0.032
 planets-0.031
 universe-0.031
roads-0.031
Token started
Token ID2067
Feature activation+1.67e-02

Pos logit contributions
 bye+0.070
rav+0.068
five+0.062
ia+0.060
cake+0.059
Neg logit contributions
 valleys-0.042
 sizes-0.037
 galaxies-0.035
 planets-0.035
roads-0.035
Token.
Token ID13
Feature activation-2.18e-03

Pos logit contributions
 bye+0.024
rav+0.021
ia+0.019
less+0.019
gan+0.018
Neg logit contributions
 valleys-0.026
 sizes-0.025
roads-0.024
 universe-0.024
 galaxies-0.024
Token One
Token ID1881
Feature activation+2.51e-02

Pos logit contributions
 bye+0.004
rav+0.003
ia+0.003
 reward+0.003
Of+0.003
Neg logit contributions
 valleys-0.003
 situations-0.003
roads-0.003
 galaxies-0.003
 sizes-0.003
Token day
Token ID1110
Feature activation-1.05e-02

Pos logit contributions
 valleys+0.025
 universe+0.024
 sorts+0.024
 sizes+0.023
 planets+0.022
Neg logit contributions
rav-0.053
 bye-0.052
gan-0.048
cake-0.047
five-0.047
Token,
Token ID11
Feature activation+1.96e-03

Pos logit contributions
 bye+0.017
rav+0.016
less+0.014
ia+0.014
 reward+0.014
Neg logit contributions
 valleys-0.017
 sizes-0.016
 sorts-0.016
 galaxies-0.016
roads-0.016
Token Bun
Token ID28515
Feature activation+1.48e-03

Pos logit contributions
 valleys+0.004
 situations+0.003
 sizes+0.003
roads+0.003
 sorts+0.003
Neg logit contributions
 bye-0.003
rav-0.002
 reward-0.002
By-0.002
less-0.002
Token-
Token ID12
Feature activation-1.04e-01

Pos logit contributions
 valleys+0.003
 sizes+0.002
roads+0.002
 universe+0.002
 situations+0.002
Neg logit contributions
 bye-0.002
rav-0.002
ia-0.002
less-0.002
gan-0.002
TokenB
Token ID33
Feature activation-1.21e-02

Pos logit contributions
 bye+0.160
rav+0.154
ia+0.138
less+0.134
gan+0.134
Neg logit contributions
 valleys-0.174
 sizes-0.162
 situations-0.159
 galaxies-0.157
 universe-0.157
Tokenun
Token ID403
Feature activation+2.76e-02

Pos logit contributions
rav+0.023
 bye+0.023
less+0.021
ia+0.021
gan+0.020
Neg logit contributions
 valleys-0.016
roads-0.014
 sizes-0.014
 galaxies-0.014
 planets-0.014
Token met
Token ID1138
Feature activation-2.98e-02

Pos logit contributions
 valleys+0.044
 sizes+0.041
roads+0.041
 situations+0.041
 universe+0.041
Neg logit contributions
 bye-0.045
rav-0.042
ia-0.039
gan-0.037
less-0.036
Token a
Token ID257
Feature activation-3.29e-02

Pos logit contributions
 bye+0.048
rav+0.046
ia+0.042
gan+0.040
cake+0.039
Neg logit contributions
 valleys-0.048
 sizes-0.044
 universe-0.043
 galaxies-0.043
 planets-0.043
Token little
Token ID1310
Feature activation+8.67e-03

Pos logit contributions
rav+0.045
 bye+0.045
ia+0.040
gan+0.040
five+0.040
Neg logit contributions
 valleys-0.057
 universe-0.056
 valley-0.054
 sizes-0.053
 galaxies-0.053
TokenRex
Token ID47389
Feature activation+1.16e-02

Pos logit contributions
 bye+0.110
rav+0.103
ia+0.099
 reward+0.095
 timer+0.092
Neg logit contributions
 valleys-0.115
 sorts-0.108
roads-0.105
tails-0.105
 sizes-0.104
Token.
Token ID13
Feature activation-1.05e-02

Pos logit contributions
 valleys+0.015
 sizes+0.014
 universe+0.013
 planets+0.013
 galaxies+0.013
Neg logit contributions
 bye-0.022
rav-0.021
ia-0.020
gan-0.019
less-0.018
Token Unfortunately
Token ID8989
Feature activation+1.45e-02

Pos logit contributions
 bye+0.019
rav+0.018
ia+0.017
gan+0.016
cake+0.016
Neg logit contributions
 valleys-0.014
 sizes-0.013
 galaxies-0.013
 universe-0.013
 sorts-0.013
Token,
Token ID11
Feature activation-3.81e-03

Pos logit contributions
 valleys+0.017
 sizes+0.016
 situations+0.015
 galaxies+0.015
 planets+0.015
Neg logit contributions
 bye-0.028
rav-0.028
cake-0.026
ia-0.026
gan-0.025
Token T
Token ID309
Feature activation-1.91e-02

Pos logit contributions
 bye+0.006
rav+0.006
ia+0.006
gan+0.005
cake+0.005
Neg logit contributions
 valleys-0.006
 sizes-0.005
 universe-0.005
 galaxies-0.005
 planets-0.005
Token-
Token ID12
Feature activation-1.04e-01

Pos logit contributions
 slice+0.032
 timer+0.032
cake+0.030
rav+0.030
 bye+0.029
Neg logit contributions
 universe-0.022
 planets-0.022
 North-0.021
 valleys-0.021
tails-0.021
TokenRex
Token ID47389
Feature activation-1.85e-02

Pos logit contributions
 bye+0.161
rav+0.157
 reward+0.144
 timer+0.139
eal+0.136
Neg logit contributions
 valleys-0.156
 sizes-0.152
 sorts-0.148
tails-0.146
 galaxies-0.144
Token used
Token ID973
Feature activation+4.25e-02

Pos logit contributions
 bye+0.034
rav+0.034
ia+0.030
five+0.030
cake+0.029
Neg logit contributions
 valleys-0.024
 sizes-0.022
tails-0.021
 planets-0.021
 galaxies-0.020
Token D
Token ID360
Feature activation+1.32e-02

Pos logit contributions
 valleys+0.069
 sizes+0.065
 sorts+0.062
 galaxies+0.062
 planets+0.062
Neg logit contributions
 bye-0.067
rav-0.066
ia-0.059
gan-0.057
cake-0.057
Tokenippy
Token ID41214
Feature activation+1.75e-03

Pos logit contributions
 situations+0.018
 valleys+0.017
 sizes+0.016
orns+0.016
 universe+0.016
Neg logit contributions
 timer-0.021
 slice-0.020
 bye-0.019
cake-0.018
rav-0.017
Token's
Token ID338
Feature activation+1.04e-02

Pos logit contributions
 valleys+0.003
 sizes+0.003
 planets+0.003
 galaxies+0.003
 situations+0.003
Neg logit contributions
 bye-0.003
rav-0.003
ia-0.002
cake-0.002
gan-0.002
Token on
Token ID319
Feature activation+4.80e-03

Pos logit contributions
 bye+0.006
rav+0.006
ia+0.005
gan+0.005
 reward+0.005
Neg logit contributions
 valleys-0.005
 sizes-0.005
 universe-0.005
 galaxies-0.005
 situations-0.005
Token adventures
Token ID17545
Feature activation+2.55e-03

Pos logit contributions
 valleys+0.006
 sizes+0.006
 situations+0.006
roads+0.006
 galaxies+0.006
Neg logit contributions
 bye-0.009
rav-0.008
ia-0.008
 reward-0.007
gan-0.007
Token.
Token ID13
Feature activation-6.37e-03

Pos logit contributions
 valleys+0.004
 situations+0.004
 universe+0.004
 galaxies+0.004
 sizes+0.004
Neg logit contributions
 bye-0.004
rav-0.003
ia-0.003
five-0.003
less-0.003
Token Ch
Token ID609
Feature activation+7.45e-03

Pos logit contributions
 bye+0.011
rav+0.011
ia+0.010
gan+0.009
 reward+0.009
Neg logit contributions
 valleys-0.009
 situations-0.009
 sizes-0.009
 universe-0.008
 galaxies-0.008
Tokenoo
Token ID2238
Feature activation-4.34e-02

Pos logit contributions
 valleys+0.017
 sizes+0.016
 situations+0.016
 universe+0.016
 planets+0.015
Neg logit contributions
 bye-0.007
rav-0.006
ia-0.005
cake-0.005
 reward-0.005
Token-
Token ID12
Feature activation-1.04e-01

Pos logit contributions
rav+0.070
 bye+0.070
ia+0.063
 reward+0.063
gan+0.063
Neg logit contributions
 valleys-0.063
 sizes-0.060
 planets-0.057
 galaxies-0.056
 sorts-0.056
TokenCh
Token ID1925
Feature activation+7.80e-03

Pos logit contributions
 bye+0.200
 reward+0.192
rav+0.187
ia+0.183
 apology+0.174
Neg logit contributions
 valleys-0.115
 sizes-0.107
tails-0.105
 sorts-0.102
roads-0.099
Tokenoo
Token ID2238
Feature activation-4.21e-02

Pos logit contributions
 valleys+0.017
 sizes+0.016
 situations+0.016
 universe+0.016
roads+0.016
Neg logit contributions
 bye-0.007
rav-0.007
cake-0.005
 reward-0.005
ia-0.005
Token was
Token ID373
Feature activation+7.53e-03

Pos logit contributions
rav+0.068
 bye+0.067
ia+0.061
 reward+0.061
gan+0.061
Neg logit contributions
 valleys-0.062
 sizes-0.059
 planets-0.055
 galaxies-0.054
 sorts-0.054
Token so
Token ID523
Feature activation-2.61e-03

Pos logit contributions
 valleys+0.011
 sizes+0.010
 situations+0.010
 universe+0.010
 galaxies+0.010
Neg logit contributions
 bye-0.014
rav-0.013
ia-0.012
gan-0.012
less-0.011
Token happy
Token ID3772
Feature activation-3.75e-02

Pos logit contributions
 bye+0.003
rav+0.003
 reward+0.003
cake+0.003
ia+0.003
Neg logit contributions
 valleys-0.005
 sorts-0.004
 sizes-0.004
 universe-0.004
 galaxies-0.004
Token loud
Token ID7812
Feature activation-3.31e-02

Pos logit contributions
rav+0.024
gan+0.023
 bye+0.023
cake+0.023
ia+0.022
Neg logit contributions
 valleys-0.022
 universe-0.021
 planets-0.020
 valley-0.020
 sizes-0.020
Token sound
Token ID2128
Feature activation-2.34e-02

Pos logit contributions
rav+0.057
 bye+0.056
gan+0.053
ia+0.051
cake+0.050
Neg logit contributions
 valleys-0.047
 sizes-0.044
 universe-0.042
 planets-0.042
 sorts-0.042
Token.
Token ID13
Feature activation+3.71e-03

Pos logit contributions
rav+0.045
 bye+0.044
gan+0.041
ia+0.041
 reward+0.039
Neg logit contributions
 valleys-0.029
 sizes-0.027
 planets-0.026
 galaxies-0.026
 universe-0.026
Token Ch
Token ID609
Feature activation+1.16e-02

Pos logit contributions
 valleys+0.004
 universe+0.004
 planets+0.004
 sizes+0.004
 galaxies+0.004
Neg logit contributions
rav-0.008
 bye-0.007
less-0.007
gan-0.007
eal-0.006
Tokenoo
Token ID2238
Feature activation-4.93e-02

Pos logit contributions
 valleys+0.018
 situations+0.017
 sizes+0.017
 universe+0.017
 planets+0.016
Neg logit contributions
 bye-0.018
rav-0.017
 reward-0.015
cake-0.015
five-0.015
Token-
Token ID12
Feature activation-1.04e-01

Pos logit contributions
rav+0.086
 bye+0.084
 reward+0.079
less+0.077
cake+0.077
Neg logit contributions
 valleys-0.067
 sizes-0.062
 planets-0.059
 galaxies-0.056
 universe-0.056
Tokench
Token ID354
Feature activation-3.79e-02

Pos logit contributions
 bye+0.188
 reward+0.187
rav+0.174
ia+0.167
 apology+0.162
Neg logit contributions
 valleys-0.128
 sizes-0.113
 sorts-0.112
tails-0.111
roads-0.108
Tokenoo
Token ID2238
Feature activation-5.00e-02

Pos logit contributions
 bye+0.057
rav+0.055
ia+0.049
gan+0.047
cake+0.047
Neg logit contributions
 valleys-0.064
 sizes-0.060
 universe-0.058
 galaxies-0.058
 planets-0.058
Token!
Token ID0
Feature activation-1.44e-02

Pos logit contributions
rav+0.087
 bye+0.084
 reward+0.080
cake+0.078
gan+0.078
Neg logit contributions
 valleys-0.069
 sizes-0.062
 planets-0.060
 galaxies-0.057
 situations-0.057
Token It
Token ID632
Feature activation-3.70e-02

Pos logit contributions
rav+0.027
 bye+0.025
less+0.024
cake+0.024
 reward+0.024
Neg logit contributions
 valleys-0.019
 universe-0.016
 valley-0.015
 sizes-0.015
 planets-0.015
Token goes
Token ID2925
Feature activation-2.61e-02

Pos logit contributions
 bye+0.068
rav+0.065
ia+0.061
gan+0.058
five+0.058
Neg logit contributions
 valleys-0.050
 situations-0.046
 sizes-0.045
 universe-0.045
 galaxies-0.045
Token on
Token ID319
Feature activation-1.77e-02

Pos logit contributions
 bye+0.031
rav+0.030
ia+0.028
gan+0.027
 reward+0.027
Neg logit contributions
 valleys-0.018
 sizes-0.017
 galaxies-0.016
 universe-0.016
 planets-0.016
Token,
Token ID11
Feature activation-1.43e-02

Pos logit contributions
 bye+0.030
rav+0.028
ia+0.027
 reward+0.025
gan+0.025
Neg logit contributions
 valleys-0.028
 sizes-0.026
 galaxies-0.026
 planets-0.025
 sorts-0.025
Token Lily
Token ID20037
Feature activation-3.79e-03

Pos logit contributions
 bye+0.014
rav+0.014
ia+0.012
five+0.011
gan+0.011
Neg logit contributions
 valleys-0.031
 sizes-0.030
 universe-0.030
 sorts-0.029
 planets-0.029
Token always
Token ID1464
Feature activation+2.04e-03

Pos logit contributions
 bye+0.006
rav+0.005
ia+0.005
cake+0.005
 reward+0.005
Neg logit contributions
 valleys-0.007
roads-0.006
 situations-0.006
 galaxies-0.006
 sizes-0.006
Token double
Token ID4274
Feature activation-2.45e-02

Pos logit contributions
 valleys+0.003
 sizes+0.003
 galaxies+0.003
 universe+0.003
 planets+0.003
Neg logit contributions
 bye-0.003
rav-0.003
ia-0.003
gan-0.003
 reward-0.003
Token-
Token ID12
Feature activation-1.03e-01

Pos logit contributions
 bye+0.029
rav+0.028
ia+0.024
gan+0.023
cake+0.023
Neg logit contributions
 valleys-0.049
 sorts-0.046
 sizes-0.045
 situations-0.045
 planets-0.044
Tokenchecked
Token ID26752
Feature activation-7.96e-03

Pos logit contributions
 bye+0.165
rav+0.155
ia+0.146
 reward+0.137
eal+0.131
Neg logit contributions
 valleys-0.155
tails-0.147
 sorts-0.144
 sizes-0.143
bowl-0.140
Token the
Token ID262
Feature activation+9.22e-03

Pos logit contributions
 bye+0.013
rav+0.013
ia+0.012
gan+0.011
cake+0.011
Neg logit contributions
 valleys-0.012
 sizes-0.011
 galaxies-0.011
 planets-0.011
 situations-0.011
Token ingredients
Token ID9391
Feature activation-1.55e-04

Pos logit contributions
 valleys+0.014
 sizes+0.014
 situations+0.013
 universe+0.013
 valley+0.012
Neg logit contributions
 bye-0.016
rav-0.016
gan-0.014
ia-0.013
eal-0.013
Token before
Token ID878
Feature activation+2.78e-02

Pos logit contributions
 bye+0.000
rav+0.000
ia+0.000
gan+0.000
 reward+0.000
Neg logit contributions
 valleys-0.000
 sizes-0.000
 universe-0.000
 galaxies-0.000
 planets-0.000
Token they
Token ID484
Feature activation+7.43e-04

Pos logit contributions
 valleys+0.039
 sizes+0.036
 universe+0.035
 galaxies+0.035
 planets+0.035
Neg logit contributions
 bye-0.050
rav-0.048
ia-0.044
gan-0.043
cake-0.042
Token.
Token ID13
Feature activation-1.49e-02

Pos logit contributions
rav+0.026
 bye+0.025
ia+0.024
 reward+0.022
rost+0.022
Neg logit contributions
 valleys-0.011
 sizes-0.010
tails-0.010
 sorts-0.010
bowl-0.009
Token
Token ID198
Feature activation-3.19e-03

Pos logit contributions
 bye+0.024
rav+0.024
ia+0.022
gan+0.021
 reward+0.021
Neg logit contributions
 valleys-0.023
 sizes-0.021
 galaxies-0.021
 universe-0.020
 planets-0.020
Token
Token ID198
Feature activation-9.85e-05

Pos logit contributions
 bye+0.005
rav+0.005
ia+0.005
gan+0.004
cake+0.004
Neg logit contributions
 valleys-0.005
 sizes-0.005
 universe-0.005
 galaxies-0.005
 planets-0.005
TokenB
Token ID33
Feature activation-2.87e-02

Pos logit contributions
 bye+0.000
rav+0.000
ia+0.000
gan+0.000
cake+0.000
Neg logit contributions
 valleys-0.000
 sizes-0.000
 universe-0.000
 galaxies-0.000
 planets-0.000
Tokenun
Token ID403
Feature activation+1.57e-02

Pos logit contributions
 bye+0.059
rav+0.056
ia+0.053
 reward+0.052
cake+0.052
Neg logit contributions
 valleys-0.031
 sizes-0.029
 situations-0.028
 sorts-0.027
 galaxies-0.027
Token-
Token ID12
Feature activation-1.03e-01

Pos logit contributions
 valleys+0.021
 sizes+0.021
 sorts+0.020
tails+0.019
 galaxies+0.019
Neg logit contributions
 bye-0.027
rav-0.027
 reward-0.024
ia-0.023
five-0.023
TokenB
Token ID33
Feature activation-2.15e-02

Pos logit contributions
 bye+0.188
rav+0.177
 reward+0.164
ia+0.164
eal+0.155
Neg logit contributions
 valleys-0.138
 sizes-0.131
 sorts-0.127
 galaxies-0.127
 planets-0.125
Tokenun
Token ID403
Feature activation+2.00e-02

Pos logit contributions
 bye+0.047
rav+0.044
ia+0.042
 reward+0.041
cake+0.041
Neg logit contributions
 valleys-0.022
 sizes-0.020
 galaxies-0.019
 universe-0.018
 situations-0.018
Token hopped
Token ID45230
Feature activation+1.31e-02

Pos logit contributions
 valleys+0.028
 sizes+0.027
 sorts+0.026
tails+0.025
 galaxies+0.025
Neg logit contributions
 bye-0.034
rav-0.033
 reward-0.029
ia-0.029
gan-0.029
Token away
Token ID1497
Feature activation-1.94e-02

Pos logit contributions
 valleys+0.013
 sorts+0.011
 valley+0.011
roads+0.011
 hills+0.010
Neg logit contributions
rav-0.029
 bye-0.027
ia-0.026
gan-0.025
omy-0.025
Token,
Token ID11
Feature activation-1.41e-02

Pos logit contributions
rav+0.040
 bye+0.038
ia+0.035
five+0.034
eal+0.034
Neg logit contributions
 valleys-0.022
 sizes-0.019
 sorts-0.019
 valley-0.019
 galaxies-0.019
Token.
Token ID13
Feature activation+1.07e-03

Pos logit contributions
 valleys+0.018
roads+0.017
 galaxies+0.017
neath+0.016
 sorts+0.016
Neg logit contributions
 bye-0.015
rav-0.013
ia-0.013
 reward-0.012
gan-0.012
Token One
Token ID1881
Feature activation+2.53e-02

Pos logit contributions
 valleys+0.002
 situations+0.002
roads+0.002
 sizes+0.002
 galaxies+0.002
Neg logit contributions
 bye-0.002
rav-0.002
ia-0.001
 reward-0.001
By-0.001
Token day
Token ID1110
Feature activation-9.43e-03

Pos logit contributions
 valleys+0.031
 sizes+0.029
 universe+0.028
 sorts+0.028
 galaxies+0.028
Neg logit contributions
 bye-0.049
rav-0.048
ia-0.044
gan-0.043
cake-0.043
Token,
Token ID11
Feature activation+1.03e-03

Pos logit contributions
 bye+0.016
rav+0.015
less+0.013
 reward+0.013
ia+0.013
Neg logit contributions
 valleys-0.015
 sizes-0.015
 sorts-0.014
 galaxies-0.014
roads-0.014
Token Bun
Token ID28515
Feature activation-9.72e-04

Pos logit contributions
 situations+0.002
 valleys+0.002
 sizes+0.002
roads+0.002
 sorts+0.002
Neg logit contributions
 bye-0.001
rav-0.001
By-0.001
 reward-0.001
less-0.001
Token-
Token ID12
Feature activation-1.03e-01

Pos logit contributions
 bye+0.001
rav+0.001
ia+0.001
less+0.001
-+0.001
Neg logit contributions
 valleys-0.002
 sizes-0.002
 universe-0.002
roads-0.002
 situations-0.002
TokenB
Token ID33
Feature activation-1.23e-02

Pos logit contributions
 bye+0.152
rav+0.145
ia+0.130
less+0.127
gan+0.126
Neg logit contributions
 valleys-0.177
 sizes-0.167
 situations-0.162
 galaxies-0.160
 planets-0.160
Tokenun
Token ID403
Feature activation+2.70e-02

Pos logit contributions
rav+0.024
 bye+0.023
less+0.021
ia+0.021
gan+0.020
Neg logit contributions
 valleys-0.016
roads-0.015
 sizes-0.015
 planets-0.014
 galaxies-0.014
Token was
Token ID373
Feature activation+7.10e-03

Pos logit contributions
 valleys+0.044
 sizes+0.043
 universe+0.042
 situations+0.042
 galaxies+0.041
Neg logit contributions
 bye-0.042
rav-0.039
ia-0.037
gan-0.034
less-0.033
Token hopping
Token ID47153
Feature activation+1.21e-02

Pos logit contributions
 valleys+0.010
 sizes+0.009
 galaxies+0.009
 universe+0.009
 situations+0.009
Neg logit contributions
 bye-0.013
rav-0.012
ia-0.012
gan-0.011
less-0.011
Token through
Token ID832
Feature activation+1.94e-02

Pos logit contributions
 situations+0.019
 sizes+0.019
 valleys+0.019
 universe+0.019
 planets+0.018
Neg logit contributions
 bye-0.019
less-0.016
rav-0.016
ia-0.016
--0.015
Token much
Token ID881
Feature activation-2.86e-02

Pos logit contributions
rav+0.012
 bye+0.012
ia+0.011
five+0.011
 reward+0.011
Neg logit contributions
 valleys-0.007
 sorts-0.007
 universe-0.006
 sizes-0.006
 kinds-0.006
Token fun
Token ID1257
Feature activation-2.82e-02

Pos logit contributions
rav+0.051
 bye+0.047
ia+0.046
gan+0.046
eal+0.043
Neg logit contributions
 sorts-0.039
 valleys-0.039
 sizes-0.038
 universe-0.038
 valley-0.036
Token!
Token ID0
Feature activation-6.25e-03

Pos logit contributions
 bye+0.046
rav+0.044
ia+0.041
less+0.039
 reward+0.038
Neg logit contributions
 valleys-0.044
 sizes-0.041
 galaxies-0.040
 universe-0.040
 planets-0.040
Token The
Token ID383
Feature activation+5.42e-03

Pos logit contributions
 bye+0.012
rav+0.012
ia+0.011
gan+0.010
 reward+0.010
Neg logit contributions
 valleys-0.008
 sizes-0.007
 universe-0.007
 galaxies-0.007
 sorts-0.007
Token black
Token ID2042
Feature activation-1.70e-02

Pos logit contributions
 valleys+0.008
 universe+0.007
 planets+0.007
 sizes+0.007
 sorts+0.007
Neg logit contributions
rav-0.009
 bye-0.009
gan-0.008
ia-0.008
cake-0.008
Token-
Token ID12
Feature activation-1.03e-01

Pos logit contributions
 bye+0.027
rav+0.027
gan+0.025
 reward+0.024
ia+0.024
Neg logit contributions
 valleys-0.025
 universe-0.025
 planets-0.024
 sorts-0.023
 galaxies-0.023
Tokencl
Token ID565
Feature activation-4.21e-02

Pos logit contributions
 bye+0.172
 reward+0.163
 coach+0.155
rav+0.152
 Deal+0.152
Neg logit contributions
 sorts-0.150
 kinds-0.129
 valleys-0.124
roads-0.123
 sizes-0.120
Tokenot
Token ID313
Feature activation+2.42e-05

Pos logit contributions
 bye+0.072
rav+0.066
 reward+0.063
cake+0.060
ia+0.059
Neg logit contributions
 valleys-0.061
 sorts-0.057
 universe-0.056
 situations-0.055
roads-0.055
Tokenhed
Token ID704
Feature activation-1.67e-02

Pos logit contributions
 valleys+0.000
 sizes+0.000
 sorts+0.000
 planets+0.000
 universe+0.000
Neg logit contributions
rav-0.000
 bye-0.000
gan-0.000
ia-0.000
less-0.000
Token child
Token ID1200
Feature activation-2.47e-03

Pos logit contributions
 bye+0.030
rav+0.030
ia+0.027
gan+0.026
five+0.026
Neg logit contributions
 valleys-0.022
 sizes-0.021
 universe-0.020
 planets-0.020
 galaxies-0.020
Token was
Token ID373
Feature activation+1.89e-02

Pos logit contributions
 bye+0.004
rav+0.004
ia+0.003
gan+0.003
five+0.003
Neg logit contributions
 valleys-0.004
 sizes-0.004
 situations-0.004
 universe-0.004
 galaxies-0.004
Token make
Token ID787
Feature activation+4.48e-03

Pos logit contributions
 situations+0.011
 valleys+0.010
 galaxies+0.010
 attic+0.010
 sorts+0.010
Neg logit contributions
 bye-0.009
rav-0.008
five-0.008
 reward-0.007
ia-0.007
Token noise
Token ID7838
Feature activation-1.29e-02

Pos logit contributions
 valleys+0.007
 planets+0.006
 sizes+0.006
 galaxies+0.006
 universe+0.006
Neg logit contributions
rav-0.008
 bye-0.007
cake-0.007
gan-0.007
 reward-0.007
Token.
Token ID13
Feature activation+1.20e-02

Pos logit contributions
 bye+0.024
rav+0.024
ia+0.022
gan+0.021
 reward+0.021
Neg logit contributions
 valleys-0.017
 sizes-0.016
 planets-0.015
 galaxies-0.015
 universe-0.015
Token Ch
Token ID609
Feature activation+1.46e-02

Pos logit contributions
 valleys+0.015
 sizes+0.013
 universe+0.013
 planets+0.013
 galaxies+0.013
Neg logit contributions
 bye-0.023
rav-0.023
 reward-0.021
ia-0.021
cake-0.021
Tokenoo
Token ID2238
Feature activation-4.51e-02

Pos logit contributions
 valleys+0.022
 sizes+0.021
 situations+0.021
 universe+0.020
 planets+0.020
Neg logit contributions
 bye-0.024
rav-0.022
 reward-0.020
cake-0.020
gan-0.020
Token-
Token ID12
Feature activation-1.03e-01

Pos logit contributions
 bye+0.079
rav+0.078
ia+0.071
gan+0.070
cake+0.070
Neg logit contributions
 valleys-0.063
 sizes-0.059
 planets-0.056
 galaxies-0.055
 universe-0.055
Tokench
Token ID354
Feature activation-3.41e-02

Pos logit contributions
 bye+0.169
 reward+0.162
ia+0.151
rav+0.151
 sandwich+0.141
Neg logit contributions
 valleys-0.151
 sizes-0.141
 sorts-0.135
tails-0.132
 stations-0.132
Tokenoo
Token ID2238
Feature activation-4.54e-02

Pos logit contributions
 bye+0.057
rav+0.055
ia+0.050
gan+0.048
five+0.047
Neg logit contributions
 valleys-0.053
 sizes-0.049
 galaxies-0.048
 universe-0.048
 planets-0.047
Token!
Token ID0
Feature activation-9.50e-03

Pos logit contributions
 bye+0.080
rav+0.080
ia+0.073
gan+0.072
 reward+0.072
Neg logit contributions
 valleys-0.062
 sizes-0.057
 planets-0.054
 galaxies-0.053
 situations-0.053
Token
Token ID198
Feature activation-5.40e-04

Pos logit contributions
 bye+0.018
rav+0.018
 reward+0.016
ia+0.016
cake+0.016
Neg logit contributions
 valleys-0.012
 sizes-0.010
 universe-0.010
 planets-0.010
 galaxies-0.010
Token
Token ID198
Feature activation+3.85e-03

Pos logit contributions
 bye+0.001
rav+0.001
ia+0.001
gan+0.001
five+0.001
Neg logit contributions
 valleys-0.001
 sizes-0.001
 situations-0.001
 galaxies-0.001
 universe-0.001
Token,
Token ID11
Feature activation-3.37e-02

Pos logit contributions
omy+0.076
ige+0.075
eal+0.075
oph+0.075
rav+0.074
Neg logit contributions
 sometimes-0.005
 valleys-0.005
,"-0.003
?",-0.003
tails-0.001
Token "
Token ID366
Feature activation+6.47e-03

Pos logit contributions
rav+0.063
ia+0.062
eal+0.060
gan+0.058
less+0.057
Neg logit contributions
 valleys-0.039
 hill-0.032
 sizes-0.030
 sometimes-0.030
 fields-0.030
TokenDog
Token ID32942
Feature activation-2.74e-02

Pos logit contributions
 valleys+0.010
 sizes+0.010
roads+0.009
 sorts+0.009
 universe+0.008
Neg logit contributions
ia-0.010
 bye-0.010
rav-0.009
less-0.009
gan-0.009
Token.
Token ID13
Feature activation+5.63e-04

Pos logit contributions
 bye+0.055
rav+0.055
cake+0.051
 timer+0.050
 reward+0.050
Neg logit contributions
 sizes-0.026
 valleys-0.026
tails-0.025
 sorts-0.023
 situations-0.022
Token D
Token ID360
Feature activation+3.66e-03

Pos logit contributions
 valleys+0.001
 sizes+0.001
 universe+0.001
 planets+0.001
 galaxies+0.001
Neg logit contributions
rav-0.001
ia-0.001
 bye-0.001
cake-0.001
less-0.001
Token-
Token ID12
Feature activation-1.03e-01

Pos logit contributions
 situations+0.005
orns+0.005
roads+0.005
 sorts+0.005
 kinds+0.005
Neg logit contributions
 bye-0.005
 slice-0.005
 timer-0.005
rav-0.005
gan-0.005
Tokeno
Token ID78
Feature activation+2.35e-04

Pos logit contributions
 bye+0.141
rav+0.130
ia+0.125
 reward+0.124
eal+0.120
Neg logit contributions
 sorts-0.156
roads-0.156
 valleys-0.156
tails-0.150
 sizes-0.150
Token-
Token ID12
Feature activation-9.97e-02

Pos logit contributions
 valleys+0.000
 sizes+0.000
 galaxies+0.000
 universe+0.000
 planets+0.000
Neg logit contributions
 bye-0.000
rav-0.000
ia-0.000
gan-0.000
 reward-0.000
Tokeng
Token ID70
Feature activation-3.76e-03

Pos logit contributions
 bye+0.144
rav+0.134
 reward+0.128
ia+0.127
eal+0.122
Neg logit contributions
 valleys-0.154
roads-0.147
 sorts-0.145
tails-0.144
 sizes-0.144
Token.
Token ID13
Feature activation+5.63e-04

Pos logit contributions
 bye+0.006
rav+0.005
ia+0.005
gan+0.005
cake+0.005
Neg logit contributions
 valleys-0.006
 sizes-0.006
 galaxies-0.006
 universe-0.006
 planets-0.006
Token Dog
Token ID8532
Feature activation-2.15e-02

Pos logit contributions
 valleys+0.001
 sizes+0.001
 universe+0.001
 planets+0.001
 galaxies+0.001
Neg logit contributions
rav-0.001
 bye-0.001
ia-0.001
less-0.001
cake-0.001
Token
Token ID220
Feature activation-6.79e-03

Pos logit contributions
 bye+0.014
rav+0.012
 reward+0.011
ia+0.011
gan+0.010
Neg logit contributions
 valleys-0.017
 planets-0.016
 galaxies-0.016
 universe-0.016
 sizes-0.016
Token
Token ID198
Feature activation-6.84e-03

Pos logit contributions
 bye+0.011
rav+0.011
ia+0.010
gan+0.009
cake+0.009
Neg logit contributions
 valleys-0.011
 sizes-0.010
 universe-0.010
 galaxies-0.010
 planets-0.010
Token
Token ID198
Feature activation-1.14e-03

Pos logit contributions
 bye+0.007
rav+0.007
ia+0.006
By+0.006
S+0.006
Neg logit contributions
 sizes-0.014
 valleys-0.014
 situations-0.014
 planets-0.014
 galaxies-0.013
TokenB
Token ID33
Feature activation-3.09e-02

Pos logit contributions
 bye+0.001
rav+0.001
By+0.001
 reward+0.001
S+0.001
Neg logit contributions
 valleys-0.002
 sizes-0.002
 sorts-0.002
 planets-0.002
 situations-0.002
Tokenun
Token ID403
Feature activation+1.04e-02

Pos logit contributions
 bye+0.065
rav+0.063
ia+0.059
gan+0.057
cake+0.056
Neg logit contributions
 valleys-0.034
 sizes-0.030
 universe-0.029
 galaxies-0.029
 planets-0.029
Token-
Token ID12
Feature activation-1.03e-01

Pos logit contributions
 valleys+0.016
 sizes+0.015
 universe+0.014
 galaxies+0.014
 planets+0.014
Neg logit contributions
 bye-0.017
rav-0.017
ia-0.015
gan-0.015
cake-0.014
TokenB
Token ID33
Feature activation-2.22e-02

Pos logit contributions
 bye+0.176
rav+0.169
ia+0.156
 reward+0.149
gan+0.149
Neg logit contributions
 valleys-0.151
 sizes-0.138
 galaxies-0.136
 universe-0.134
 sorts-0.134
Tokenun
Token ID403
Feature activation+1.62e-02

Pos logit contributions
 bye+0.046
rav+0.045
ia+0.042
gan+0.041
less+0.040
Neg logit contributions
 valleys-0.025
 sizes-0.022
 universe-0.022
 galaxies-0.022
 planets-0.021
Token felt
Token ID2936
Feature activation+3.92e-02

Pos logit contributions
 valleys+0.024
 sizes+0.023
 universe+0.022
 galaxies+0.022
 planets+0.022
Neg logit contributions
 bye-0.027
rav-0.026
ia-0.024
gan-0.023
cake-0.023
Token sad
Token ID6507
Feature activation+1.13e-02

Pos logit contributions
 valleys+0.070
 sizes+0.068
 planets+0.066
 galaxies+0.065
 universe+0.065
Neg logit contributions
 bye-0.059
rav-0.049
 reward-0.048
ia-0.046
less-0.045
Token and
Token ID290
Feature activation-9.13e-03

Pos logit contributions
 valleys+0.021
 sizes+0.020
roads+0.019
 universe+0.019
 planets+0.019
Neg logit contributions
 bye-0.016
rav-0.014
ia-0.013
less-0.013
 reward-0.013
Token.
Token ID13
Feature activation-5.37e-03

Pos logit contributions
rav+0.007
 bye+0.007
ia+0.006
omm+0.006
gan+0.006
Neg logit contributions
 sizes-0.002
 valleys-0.002
tails-0.002
 sorts-0.001
 kinds-0.001
Token "
Token ID366
Feature activation-3.44e-03

Pos logit contributions
 bye+0.009
rav+0.008
ia+0.007
gan+0.007
cake+0.007
Neg logit contributions
 valleys-0.009
 sizes-0.008
 galaxies-0.008
 universe-0.008
 planets-0.008
TokenHello
Token ID15496
Feature activation-4.16e-02

Pos logit contributions
 bye+0.006
rav+0.006
ia+0.005
gan+0.005
less+0.005
Neg logit contributions
 valleys-0.005
 sizes-0.005
 galaxies-0.005
 universe-0.005
 planets-0.005
Token,
Token ID11
Feature activation-4.07e-02

Pos logit contributions
 bye+0.076
rav+0.074
ia+0.068
gan+0.066
cake+0.065
Neg logit contributions
 valleys-0.056
 sizes-0.052
 universe-0.051
 galaxies-0.050
 planets-0.050
Token Bun
Token ID28515
Feature activation+2.26e-02

Pos logit contributions
 bye+0.066
rav+0.065
ia+0.060
gan+0.058
cake+0.057
Neg logit contributions
 valleys-0.062
 sizes-0.058
 universe-0.058
 planets-0.057
 galaxies-0.056
Token-
Token ID12
Feature activation-1.02e-01

Pos logit contributions
 valleys+0.034
 sizes+0.031
 galaxies+0.030
 universe+0.030
 planets+0.030
Neg logit contributions
 bye-0.039
rav-0.038
ia-0.034
gan-0.033
less-0.033
TokenCloud
Token ID18839
Feature activation+2.24e-03

Pos logit contributions
 bye+0.156
rav+0.147
ia+0.140
 reward+0.136
gan+0.134
Neg logit contributions
 valleys-0.158
 sorts-0.150
 universe-0.149
 sizes-0.147
 galaxies-0.146
Token.
Token ID13
Feature activation+1.47e-02

Pos logit contributions
 valleys+0.003
 galaxies+0.003
 situations+0.003
 universe+0.003
roads+0.003
Neg logit contributions
 bye-0.004
rav-0.004
ia-0.003
gan-0.003
less-0.003
Token We
Token ID775
Feature activation-2.52e-02

Pos logit contributions
 valleys+0.022
 sizes+0.021
 universe+0.020
 galaxies+0.020
 planets+0.020
Neg logit contributions
 bye-0.025
rav-0.024
ia-0.022
gan-0.021
cake-0.021
Token are
Token ID389
Feature activation-7.56e-03

Pos logit contributions
 bye+0.046
rav+0.045
 reward+0.042
ia+0.040
five+0.040
Neg logit contributions
 valleys-0.034
 situations-0.032
roads-0.031
 sizes-0.030
 galaxies-0.030
Token your
Token ID534
Feature activation-2.76e-02

Pos logit contributions
rav+0.012
 bye+0.012
ia+0.011
five+0.011
cake+0.011
Neg logit contributions
 valleys-0.012
 sizes-0.011
 planets-0.011
 universe-0.011
 galaxies-0.011
Token?"
Token ID1701
Feature activation-6.99e-03

Pos logit contributions
 valleys+0.010
 sizes+0.010
 universe+0.009
 planets+0.009
 galaxies+0.009
Neg logit contributions
 bye-0.020
rav-0.020
ia-0.018
five-0.018
gan-0.017
Token
Token ID198
Feature activation+2.81e-03

Pos logit contributions
rav+0.013
 bye+0.013
ia+0.012
less+0.012
 reward+0.011
Neg logit contributions
 valleys-0.009
 universe-0.008
 sizes-0.008
 galaxies-0.008
 planets-0.008
Token
Token ID198
Feature activation+7.72e-03

Pos logit contributions
 valleys+0.003
 universe+0.003
 sorts+0.003
 planets+0.003
 galaxies+0.003
Neg logit contributions
 bye-0.005
rav-0.005
 reward-0.005
less-0.005
gan-0.005
TokenB
Token ID33
Feature activation-1.94e-02

Pos logit contributions
 valleys+0.012
 universe+0.011
roads+0.011
 valley+0.011
tails+0.011
Neg logit contributions
 bye-0.012
rav-0.012
 reward-0.011
less-0.011
gan-0.010
Tokenun
Token ID403
Feature activation+2.22e-02

Pos logit contributions
 bye+0.032
 timer+0.029
 nicer+0.028
 reward+0.027
 waiter+0.026
Neg logit contributions
 sorts-0.025
tails-0.025
 valleys-0.024
 sizes-0.023
bowl-0.023
Token-
Token ID12
Feature activation-1.02e-01

Pos logit contributions
 sizes+0.027
 valleys+0.026
 sorts+0.026
tails+0.025
bowl+0.025
Neg logit contributions
rav-0.040
 bye-0.039
 reward-0.035
less-0.033
five-0.032
TokenB
Token ID33
Feature activation-1.55e-02

Pos logit contributions
 bye+0.162
rav+0.149
 reward+0.141
ia+0.132
eal+0.128
Neg logit contributions
 valleys-0.161
 sorts-0.149
 sizes-0.147
 galaxies-0.147
 planets-0.145
Tokenun
Token ID403
Feature activation+2.41e-02

Pos logit contributions
 bye+0.032
rav+0.030
ia+0.029
 reward+0.028
cake+0.028
Neg logit contributions
 valleys-0.016
 sizes-0.015
 sorts-0.015
tails-0.014
 situations-0.014
Token said
Token ID531
Feature activation-2.11e-02

Pos logit contributions
 sizes+0.031
 valleys+0.031
 sorts+0.030
tails+0.030
 kinds+0.029
Neg logit contributions
rav-0.042
 bye-0.041
 reward-0.037
 timer-0.035
less-0.035
Token,
Token ID11
Feature activation-2.96e-02

Pos logit contributions
rav+0.052
omy+0.051
 timer+0.050
eal+0.050
icious+0.050
Neg logit contributions
 valleys-0.006
?",-0.005
 sometimes-0.005
 valley-0.003
bowl-0.003
Token "
Token ID366
Feature activation+1.37e-02

Pos logit contributions
rav+0.055
ia+0.053
eal+0.050
 bye+0.049
 reward+0.049
Neg logit contributions
 valleys-0.037
 fields-0.030
 hill-0.029
 valley-0.029
 galaxies-0.029
Token trunk
Token ID21427
Feature activation+1.75e-02

Pos logit contributions
 universe+0.020
 valleys+0.019
 planets+0.018
 galaxies+0.018
 sizes+0.018
Neg logit contributions
rav-0.019
 bye-0.018
gan-0.017
ia-0.017
cake-0.016
Token.
Token ID13
Feature activation-1.01e-02

Pos logit contributions
 valleys+0.026
 sizes+0.024
 universe+0.024
 galaxies+0.024
 planets+0.024
Neg logit contributions
 bye-0.029
rav-0.029
ia-0.026
gan-0.025
five-0.024
Token
Token ID198
Feature activation-7.85e-03

Pos logit contributions
 bye+0.016
rav+0.015
ia+0.014
gan+0.013
cake+0.013
Neg logit contributions
 valleys-0.016
 sizes-0.015
 galaxies-0.015
 universe-0.015
 planets-0.015
Token
Token ID198
Feature activation-4.93e-03

Pos logit contributions
 bye+0.011
rav+0.010
ia+0.010
five+0.009
cake+0.009
Neg logit contributions
 valleys-0.014
 sizes-0.014
 situations-0.014
 galaxies-0.013
 planets-0.013
TokenThe
Token ID464
Feature activation-1.15e-02

Pos logit contributions
 bye+0.007
rav+0.007
ia+0.006
cake+0.006
five+0.006
Neg logit contributions
 valleys-0.009
 sizes-0.008
 situations-0.008
 planets-0.008
 galaxies-0.008
Token moral
Token ID6573
Feature activation+5.92e-02

Pos logit contributions
rav+0.022
eal+0.021
gan+0.020
less+0.020
 bye+0.019
Neg logit contributions
 universe-0.015
 forest-0.015
 hill-0.015
 valley-0.014
 tallest-0.014
Token of
Token ID286
Feature activation-6.35e-03

Pos logit contributions
 valleys+0.116
 sizes+0.109
 galaxies+0.105
 situations+0.104
 planets+0.104
Neg logit contributions
 bye-0.077
rav-0.071
ia-0.066
 reward-0.061
five-0.060
Token the
Token ID262
Feature activation-1.57e-02

Pos logit contributions
 bye+0.010
rav+0.009
ia+0.008
 reward+0.008
five+0.008
Neg logit contributions
 valleys-0.011
 sizes-0.011
 situations-0.010
 galaxies-0.010
 valley-0.010
Token story
Token ID1621
Feature activation-9.98e-03

Pos logit contributions
rav+0.029
 bye+0.029
gan+0.028
less+0.028
eal+0.028
Neg logit contributions
 universe-0.020
 valleys-0.018
 planets-0.018
 valley-0.017
 sizes-0.017
Token is
Token ID318
Feature activation-1.87e-02

Pos logit contributions
 bye+0.017
rav+0.016
ia+0.016
 reward+0.014
cake+0.014
Neg logit contributions
 valleys-0.016
 sizes-0.014
 galaxies-0.014
 situations-0.014
 valley-0.014
Token that
Token ID326
Feature activation+2.52e-02

Pos logit contributions
 bye+0.027
rav+0.024
ia+0.023
less+0.022
 reward+0.022
Neg logit contributions
 valleys-0.036
 sizes-0.034
 planets-0.032
roads-0.032
 galaxies-0.032
Token be
Token ID307
Feature activation+4.75e-03

Pos logit contributions
 bye+0.005
 reward+0.004
rav+0.004
ia+0.004
five+0.004
Neg logit contributions
 valleys-0.007
 sizes-0.007
 galaxies-0.007
 planets-0.006
 situations-0.006
Token here
Token ID994
Feature activation-1.15e-02

Pos logit contributions
 valleys+0.008
 sizes+0.008
 planets+0.008
 universe+0.008
 galaxies+0.008
Neg logit contributions
 bye-0.007
rav-0.007
ia-0.006
cake-0.006
five-0.006
Token to
Token ID284
Feature activation-1.04e-02

Pos logit contributions
 bye+0.017
rav+0.016
ia+0.014
gan+0.014
cake+0.014
Neg logit contributions
 valleys-0.020
 sizes-0.019
 universe-0.018
 galaxies-0.018
 planets-0.018
Token support
Token ID1104
Feature activation+2.18e-02

Pos logit contributions
 bye+0.018
rav+0.017
ia+0.016
gan+0.015
 reward+0.015
Neg logit contributions
 valleys-0.015
 sizes-0.014
 galaxies-0.014
 universe-0.014
 planets-0.014
Token you
Token ID345
Feature activation-7.02e-03

Pos logit contributions
 valleys+0.041
 sizes+0.039
 galaxies+0.038
 universe+0.038
 planets+0.038
Neg logit contributions
 bye-0.029
rav-0.027
ia-0.024
gan-0.023
 reward-0.023
Token if
Token ID611
Feature activation+4.96e-02

Pos logit contributions
 bye+0.013
rav+0.012
ia+0.011
gan+0.011
 reward+0.011
Neg logit contributions
 valleys-0.010
 sizes-0.009
 sorts-0.009
 galaxies-0.009
 universe-0.009
Token you
Token ID345
Feature activation-2.82e-02

Pos logit contributions
 valleys+0.090
 sizes+0.085
 universe+0.083
 galaxies+0.083
 planets+0.083
Neg logit contributions
 bye-0.068
rav-0.065
ia-0.058
gan-0.055
cake-0.054
Token want
Token ID765
Feature activation-2.30e-03

Pos logit contributions
 bye+0.048
rav+0.047
ia+0.043
gan+0.041
cake+0.041
Neg logit contributions
 valleys-0.041
 sizes-0.038
 universe-0.038
 galaxies-0.037
 planets-0.037
Token to
Token ID284
Feature activation+1.19e-03

Pos logit contributions
 bye+0.003
rav+0.003
 reward+0.003
ia+0.003
five+0.002
Neg logit contributions
 valleys-0.004
 sizes-0.004
 galaxies-0.004
 sorts-0.004
roads-0.004
Token explore
Token ID7301
Feature activation+1.56e-02

Pos logit contributions
 valleys+0.002
 sizes+0.002
 galaxies+0.002
 sorts+0.002
 planets+0.002
Neg logit contributions
 bye-0.002
rav-0.002
 reward-0.002
ia-0.001
gan-0.001
Token.
Token ID13
Feature activation+4.81e-03

Pos logit contributions
 valleys+0.025
 sizes+0.024
tails+0.024
 sorts+0.023
roads+0.023
Neg logit contributions
 bye-0.023
rav-0.021
ia-0.020
 reward-0.019
less-0.018
Token that
Token ID326
Feature activation+3.00e-02

Pos logit contributions
 valleys+0.070
 situations+0.065
 sizes+0.064
 galaxies+0.064
roads+0.062
Neg logit contributions
 bye-0.036
ia-0.031
 reward-0.031
less-0.030
 lady-0.030
Token she
Token ID673
Feature activation-2.64e-03

Pos logit contributions
 valleys+0.047
 situations+0.043
 sizes+0.042
 galaxies+0.042
roads+0.041
Neg logit contributions
 bye-0.050
rav-0.048
ia-0.045
gan-0.043
 reward-0.042
Token would
Token ID561
Feature activation+2.02e-02

Pos logit contributions
 bye+0.005
rav+0.005
ia+0.004
gan+0.004
 reward+0.004
Neg logit contributions
 valleys-0.004
 situations-0.004
roads-0.003
 sizes-0.003
 galaxies-0.003
Token un
Token ID555
Feature activation-9.50e-04

Pos logit contributions
 valleys+0.034
 situations+0.032
 sizes+0.031
roads+0.030
 galaxies+0.030
Neg logit contributions
 bye-0.032
rav-0.028
ia-0.028
 reward-0.027
five-0.026
Tokenpack
Token ID8002
Feature activation+2.54e-02

Pos logit contributions
 bye+0.001
rav+0.001
ia+0.001
gan+0.001
cake+0.001
Neg logit contributions
 valleys-0.002
 sizes-0.001
 situations-0.001
 galaxies-0.001
 universe-0.001
Token every
Token ID790
Feature activation+4.80e-02

Pos logit contributions
 valleys+0.041
 sizes+0.037
 universe+0.037
 galaxies+0.037
 situations+0.036
Neg logit contributions
 bye-0.040
rav-0.038
ia-0.036
less-0.034
 reward-0.034
Token morning
Token ID3329
Feature activation+1.08e-02

Pos logit contributions
 valleys+0.073
 sizes+0.068
 galaxies+0.066
 planets+0.064
 situations+0.063
Neg logit contributions
 bye-0.083
rav-0.076
 reward-0.071
ia-0.071
cake-0.068
Token.
Token ID13
Feature activation+5.17e-03

Pos logit contributions
 valleys+0.015
 sizes+0.014
 universe+0.014
 galaxies+0.014
 planets+0.014
Neg logit contributions
 bye-0.019
rav-0.019
ia-0.017
gan-0.017
cake-0.016
Token
Token ID198
Feature activation+1.05e-02

Pos logit contributions
 valleys+0.007
 sizes+0.006
 universe+0.006
 galaxies+0.006
 planets+0.006
Neg logit contributions
 bye-0.010
rav-0.009
ia-0.009
gan-0.008
cake-0.008
Token
Token ID198
Feature activation+1.54e-02

Pos logit contributions
 valleys+0.019
 sizes+0.019
 situations+0.018
 galaxies+0.018
 planets+0.018
Neg logit contributions
 bye-0.014
rav-0.014
ia-0.013
gan-0.012
cake-0.011
TokenOne
Token ID3198
Feature activation+1.71e-02

Pos logit contributions
 valleys+0.021
 sizes+0.019
 galaxies+0.019
 situations+0.019
 planets+0.018
Neg logit contributions
 bye-0.028
rav-0.028
ia-0.026
gan-0.024
cake-0.024
Token was
Token ID373
Feature activation+1.02e-02

Pos logit contributions
 valleys+0.024
 situations+0.022
roads+0.022
 galaxies+0.022
 universe+0.022
Neg logit contributions
 bye-0.016
ia-0.014
rav-0.014
gan-0.014
cake-0.013
Token sad
Token ID6507
Feature activation+2.26e-03

Pos logit contributions
 valleys+0.015
 sizes+0.014
 galaxies+0.014
 situations+0.014
 universe+0.014
Neg logit contributions
 bye-0.018
rav-0.016
ia-0.015
gan-0.015
cake-0.015
Token and
Token ID290
Feature activation-1.70e-02

Pos logit contributions
 valleys+0.004
 sizes+0.003
 galaxies+0.003
roads+0.003
 planets+0.003
Neg logit contributions
 bye-0.004
rav-0.003
ia-0.003
less-0.003
 reward-0.003
Token her
Token ID607
Feature activation-2.34e-02

Pos logit contributions
 bye+0.027
rav+0.023
ia+0.021
gan+0.021
 reward+0.020
Neg logit contributions
 valleys-0.030
 sizes-0.028
 galaxies-0.027
 universe-0.027
 situations-0.027
Token brother
Token ID3956
Feature activation+3.74e-02

Pos logit contributions
 bye+0.035
rav+0.035
gan+0.030
cake+0.030
ia+0.030
Neg logit contributions
 valleys-0.039
 universe-0.038
 planets-0.037
 sizes-0.037
 galaxies-0.037
Token felt
Token ID2936
Feature activation+4.93e-02

Pos logit contributions
 valleys+0.056
 sizes+0.052
 galaxies+0.051
 universe+0.051
 planets+0.050
Neg logit contributions
 bye-0.064
rav-0.061
ia-0.056
gan-0.054
 reward-0.054
Token clumsy
Token ID39120
Feature activation+3.40e-02

Pos logit contributions
 valleys+0.108
 sizes+0.099
 planets+0.098
tails+0.098
 galaxies+0.098
Neg logit contributions
 bye-0.056
 reward-0.043
rav-0.043
 nicer-0.043
less-0.042
Token.
Token ID13
Feature activation-7.72e-05

Pos logit contributions
 valleys+0.062
 galaxies+0.057
 sizes+0.057
 universe+0.057
 planets+0.056
Neg logit contributions
 bye-0.048
rav-0.044
less-0.040
ia-0.039
cake-0.039
Token They
Token ID1119
Feature activation+1.00e-02

Pos logit contributions
 bye+0.000
rav+0.000
ia+0.000
 reward+0.000
less+0.000
Neg logit contributions
 valleys-0.000
 sizes-0.000
 universe-0.000
 situations-0.000
 galaxies-0.000
Token worked
Token ID3111
Feature activation+1.77e-02

Pos logit contributions
 valleys+0.017
 situations+0.017
roads+0.016
 sizes+0.016
 universe+0.015
Neg logit contributions
 bye-0.015
rav-0.014
gan-0.013
 reward-0.012
ia-0.012
Token together
Token ID1978
Feature activation+9.70e-03

Pos logit contributions
 valleys+0.034
 universe+0.032
 sizes+0.032
 situations+0.031
 galaxies+0.031
Neg logit contributions
 bye-0.023
rav-0.022
ia-0.020
less-0.019
gan-0.019
Token how
Token ID703
Feature activation-1.52e-02

Pos logit contributions
 valleys+0.011
 sizes+0.011
 planets+0.011
 universe+0.011
 galaxies+0.010
Neg logit contributions
rav-0.014
 bye-0.014
eal-0.013
cake-0.013
five-0.013
Token does
Token ID857
Feature activation-1.33e-03

Pos logit contributions
 bye+0.024
rav+0.022
ia+0.021
gan+0.019
less+0.019
Neg logit contributions
 valleys-0.026
 sizes-0.024
 situations-0.023
 galaxies-0.023
roads-0.023
Token the
Token ID262
Feature activation+7.46e-03

Pos logit contributions
 bye+0.002
rav+0.002
ia+0.002
 reward+0.002
gan+0.001
Neg logit contributions
 valleys-0.002
 sizes-0.002
 galaxies-0.002
 situations-0.002
 sorts-0.002
Token driver
Token ID4639
Feature activation+7.94e-03

Pos logit contributions
 valleys+0.012
 universe+0.011
 sizes+0.011
 planets+0.011
 valley+0.011
Neg logit contributions
rav-0.012
 bye-0.011
ia-0.011
gan-0.011
cake-0.010
Token get
Token ID651
Feature activation+1.11e-02

Pos logit contributions
 valleys+0.011
 sizes+0.010
 universe+0.010
 planets+0.010
 galaxies+0.009
Neg logit contributions
 bye-0.014
rav-0.014
ia-0.013
gan-0.013
cake-0.012
Token in
Token ID287
Feature activation+4.71e-02

Pos logit contributions
 valleys+0.017
 sizes+0.015
 universe+0.015
 planets+0.015
 galaxies+0.015
Neg logit contributions
 bye-0.019
rav-0.018
ia-0.017
gan-0.016
cake-0.016
Token and
Token ID290
Feature activation-9.69e-03

Pos logit contributions
 valleys+0.086
 galaxies+0.082
tails+0.079
 sizes+0.079
bowl+0.077
Neg logit contributions
 bye-0.061
 reward-0.054
rav-0.048
ia-0.048
 goodbye-0.048
Token out
Token ID503
Feature activation-1.97e-02

Pos logit contributions
 bye+0.016
rav+0.015
ia+0.014
gan+0.013
 reward+0.013
Neg logit contributions
 valleys-0.015
 sizes-0.014
 universe-0.014
 galaxies-0.014
 planets-0.014
Token?"
Token ID1701
Feature activation-1.89e-02

Pos logit contributions
 bye+0.033
rav+0.031
ia+0.028
 reward+0.027
gan+0.027
Neg logit contributions
 valleys-0.031
 sizes-0.029
 galaxies-0.028
 universe-0.028
 planets-0.027
Token
Token ID198
Feature activation-1.00e-02

Pos logit contributions
rav+0.040
less+0.037
 bye+0.037
ige+0.034
 reward+0.033
Neg logit contributions
 valleys-0.023
roads-0.020
 sorts-0.018
 valley-0.018
 galaxies-0.018
Token
Token ID198
Feature activation-4.31e-03

Pos logit contributions
 bye+0.021
rav+0.020
 reward+0.019
less+0.019
eal+0.019
Neg logit contributions
 valleys-0.011
 sorts-0.009
 universe-0.009
roads-0.009
 planets-0.009
Tokeneed
Token ID2308
Feature activation+3.49e-03

Pos logit contributions
 bye+0.050
rav+0.048
cake+0.044
 reward+0.040
gan+0.040
Neg logit contributions
 valleys-0.054
 sizes-0.050
bowl-0.049
tails-0.047
 galaxies-0.047
Token.
Token ID13
Feature activation+1.07e-03

Pos logit contributions
 valleys+0.005
 sizes+0.005
 universe+0.005
 galaxies+0.005
 situations+0.005
Neg logit contributions
 bye-0.006
rav-0.006
ia-0.005
gan-0.005
cake-0.005
Token They
Token ID1119
Feature activation+1.35e-02

Pos logit contributions
 valleys+0.001
 sizes+0.001
 galaxies+0.001
 planets+0.001
 universe+0.001
Neg logit contributions
 bye-0.002
rav-0.002
ia-0.002
less-0.002
gan-0.002
Token saw
Token ID2497
Feature activation-9.24e-03

Pos logit contributions
 situations+0.020
 valleys+0.020
roads+0.019
 sizes+0.019
 universe+0.018
Neg logit contributions
 bye-0.022
rav-0.021
ia-0.019
 reward-0.019
gan-0.019
Token other
Token ID584
Feature activation+8.20e-03

Pos logit contributions
 bye+0.016
rav+0.014
ia+0.014
gan+0.013
 reward+0.013
Neg logit contributions
 valleys-0.014
 sizes-0.013
 situations-0.013
roads-0.013
 universe-0.013
Token fish
Token ID5916
Feature activation-7.13e-04

Pos logit contributions
 valleys+0.014
 sizes+0.013
 universe+0.012
 galaxies+0.012
 planets+0.012
Neg logit contributions
 bye-0.013
rav-0.012
ia-0.011
gan-0.011
cake-0.011
Token and
Token ID290
Feature activation-1.02e-02

Pos logit contributions
 bye+0.001
rav+0.001
ia+0.001
less+0.001
 reward+0.001
Neg logit contributions
 valleys-0.001
 universe-0.001
 sizes-0.001
 galaxies-0.001
 situations-0.001
Token crabs
Token ID49172
Feature activation+1.47e-03

Pos logit contributions
 bye+0.016
rav+0.015
ia+0.014
gan+0.013
 reward+0.013
Neg logit contributions
 valleys-0.016
 universe-0.015
 sizes-0.015
 situations-0.015
roads-0.015
Token too
Token ID1165
Feature activation-2.63e-03

Pos logit contributions
 valleys+0.002
 situations+0.002
 universe+0.002
 galaxies+0.002
 sizes+0.002
Neg logit contributions
 bye-0.002
rav-0.002
ia-0.002
less-0.002
 reward-0.002
Token.
Token ID13
Feature activation+8.66e-03

Pos logit contributions
 bye+0.004
rav+0.004
ia+0.004
gan+0.004
less+0.004
Neg logit contributions
 valleys-0.004
 sizes-0.004
 situations-0.004
 universe-0.004
roads-0.004
Token The
Token ID383
Feature activation+1.34e-02

Pos logit contributions
 valleys+0.011
 sizes+0.010
 galaxies+0.010
 universe+0.010
 planets+0.010
Neg logit contributions
 bye-0.017
rav-0.016
ia-0.015
less-0.015
gan-0.014
Token bread
Token ID8509
Feature activation-2.18e-02

Pos logit contributions
 bye+0.020
rav+0.020
ia+0.017
gan+0.017
cake+0.016
Neg logit contributions
 valleys-0.032
 sizes-0.030
 situations-0.029
 universe-0.029
 planets-0.029
Token to
Token ID284
Feature activation+2.71e-03

Pos logit contributions
 bye+0.039
rav+0.038
ia+0.033
five+0.033
gan+0.033
Neg logit contributions
 valleys-0.031
 sizes-0.029
 planets-0.028
 galaxies-0.027
 sorts-0.027
Token eat
Token ID4483
Feature activation-2.01e-03

Pos logit contributions
 valleys+0.004
 sizes+0.004
 sorts+0.004
 universe+0.004
 planets+0.003
Neg logit contributions
rav-0.005
 bye-0.005
ia-0.004
gan-0.004
less-0.004
Token together
Token ID1978
Feature activation+1.74e-02

Pos logit contributions
 bye+0.004
rav+0.004
ia+0.003
gan+0.003
five+0.003
Neg logit contributions
 valleys-0.003
 sizes-0.003
 universe-0.002
 planets-0.002
 sorts-0.002
Token.
Token ID13
Feature activation-1.18e-02

Pos logit contributions
 valleys+0.025
 sizes+0.023
 galaxies+0.023
 universe+0.023
 planets+0.022
Neg logit contributions
 bye-0.030
rav-0.029
ia-0.027
gan-0.026
cake-0.026
Token<|endoftext|>
Token ID50256
Feature activation+9.97e-03

Pos logit contributions
 bye+0.021
rav+0.020
ia+0.019
gan+0.018
 reward+0.018
Neg logit contributions
 valleys-0.016
 sorts-0.015
 sizes-0.015
 universe-0.015
 planets-0.014
TokenOnce
Token ID7454
Feature activation-5.94e-03

Pos logit contributions
 valleys+0.018
 sizes+0.017
 galaxies+0.017
 situations+0.017
 planets+0.017
Neg logit contributions
 bye-0.013
rav-0.013
ia-0.012
gan-0.011
cake-0.011
Token upon
Token ID2402
Feature activation-2.73e-02

Pos logit contributions
 bye+0.008
rav+0.008
ia+0.007
gan+0.007
less+0.007
Neg logit contributions
 valleys-0.011
 sizes-0.010
 galaxies-0.010
 situations-0.010
 planets-0.010
Token a
Token ID257
Feature activation-3.95e-02

Pos logit contributions
 bye+0.046
rav+0.044
ia+0.040
gan+0.039
cake+0.038
Neg logit contributions
 valleys-0.041
 sizes-0.038
 universe-0.038
 galaxies-0.037
 planets-0.037
Token time
Token ID640
Feature activation-7.38e-03

Pos logit contributions
 bye+0.064
rav+0.063
cake+0.055
gan+0.055
 reward+0.054
Neg logit contributions
 valleys-0.057
 universe-0.055
 sizes-0.054
 sorts-0.054
 situations-0.052
Token,
Token ID11
Feature activation-3.18e-03

Pos logit contributions
 bye+0.015
rav+0.014
ia+0.013
cake+0.013
five+0.013
Neg logit contributions
 valleys-0.009
 universe-0.008
 sizes-0.008
 planets-0.008
 galaxies-0.008
Token.
Token ID13
Feature activation+1.19e-02

Pos logit contributions
 valleys+0.026
 galaxies+0.025
 universe+0.024
 situations+0.024
roads+0.024
Neg logit contributions
ia-0.011
 bye-0.010
--0.010
 nicely-0.010
less-0.009
Token One
Token ID1881
Feature activation+3.30e-02

Pos logit contributions
 valleys+0.016
 galaxies+0.015
 situations+0.015
 sizes+0.015
 universe+0.015
Neg logit contributions
 bye-0.022
rav-0.020
ia-0.020
gan-0.018
five-0.017
Token day
Token ID1110
Feature activation-1.77e-03

Pos logit contributions
 valleys+0.051
 sizes+0.046
 galaxies+0.046
 planets+0.045
 situations+0.045
Neg logit contributions
 bye-0.056
rav-0.052
ia-0.052
gan-0.048
less-0.047
Token,
Token ID11
Feature activation-1.11e-03

Pos logit contributions
 bye+0.003
rav+0.003
ia+0.003
gan+0.003
 reward+0.003
Neg logit contributions
 valleys-0.003
 sizes-0.002
 universe-0.002
 galaxies-0.002
 planets-0.002
Token as
Token ID355
Feature activation+1.31e-02

Pos logit contributions
 bye+0.002
rav+0.001
ia+0.001
gan+0.001
 reward+0.001
Neg logit contributions
 valleys-0.002
 sizes-0.002
 universe-0.002
 galaxies-0.002
 planets-0.002
Token he
Token ID339
Feature activation+2.64e-03

Pos logit contributions
 valleys+0.022
 sizes+0.020
 universe+0.020
 galaxies+0.020
 planets+0.020
Neg logit contributions
 bye-0.020
rav-0.020
ia-0.018
gan-0.017
 reward-0.017
Token was
Token ID373
Feature activation+2.21e-02

Pos logit contributions
 valleys+0.003
 sizes+0.003
 universe+0.003
 sorts+0.003
 galaxies+0.003
Neg logit contributions
 bye-0.005
rav-0.005
ia-0.004
gan-0.004
cake-0.004
Token digging
Token ID18894
Feature activation+2.52e-02

Pos logit contributions
 valleys+0.027
 sizes+0.026
 galaxies+0.024
 universe+0.024
 planets+0.024
Neg logit contributions
 bye-0.044
rav-0.041
ia-0.040
gan-0.038
 reward-0.038
Token,
Token ID11
Feature activation+1.29e-02

Pos logit contributions
 sizes+0.043
 universe+0.042
tails+0.042
 valleys+0.041
 sorts+0.041
Neg logit contributions
 bye-0.033
ia-0.032
less-0.028
 reward-0.028
rav-0.028
Token he
Token ID339
Feature activation-5.73e-03

Pos logit contributions
 valleys+0.024
 sizes+0.024
 universe+0.023
 sorts+0.023
 galaxies+0.023
Neg logit contributions
 bye-0.017
rav-0.015
ia-0.014
gan-0.014
 reward-0.013
Token made
Token ID925
Feature activation+1.51e-02

Pos logit contributions
 bye+0.009
rav+0.009
ia+0.008
gan+0.008
 reward+0.008
Neg logit contributions
 valleys-0.009
 sizes-0.008
 situations-0.008
 universe-0.008
 sorts-0.008
Token.
Token ID13
Feature activation+1.67e-02

Pos logit contributions
 bye+0.007
rav+0.007
ia+0.006
gan+0.006
cake+0.006
Neg logit contributions
 valleys-0.006
 sizes-0.006
 universe-0.006
 galaxies-0.006
 planets-0.006
Token Jack
Token ID3619
Feature activation+2.83e-02

Pos logit contributions
 valleys+0.024
 sizes+0.022
 situations+0.022
 galaxies+0.022
 universe+0.021
Neg logit contributions
 bye-0.030
rav-0.028
ia-0.026
gan-0.025
 reward-0.025
Token was
Token ID373
Feature activation+2.01e-02

Pos logit contributions
 valleys+0.048
 situations+0.047
 galaxies+0.045
 sizes+0.045
 universe+0.045
Neg logit contributions
 bye-0.041
rav-0.039
ia-0.036
gan-0.035
cake-0.034
Token very
Token ID845
Feature activation+9.43e-03

Pos logit contributions
 valleys+0.025
 sizes+0.023
 universe+0.022
 galaxies+0.022
 planets+0.022
Neg logit contributions
 bye-0.040
rav-0.038
ia-0.035
gan-0.034
cake-0.034
Token excited
Token ID6568
Feature activation-3.42e-04

Pos logit contributions
 valleys+0.012
 sorts+0.011
 sizes+0.011
 universe+0.011
 kinds+0.011
Neg logit contributions
rav-0.017
 bye-0.016
ia-0.016
cake-0.015
five-0.015
Token;
Token ID26
Feature activation+2.37e-02

Pos logit contributions
 bye+0.000
rav+0.000
ia+0.000
less+0.000
gan+0.000
Neg logit contributions
 valleys-0.001
 galaxies-0.001
 sizes-0.001
 universe-0.001
 planets-0.001
Token he
Token ID339
Feature activation-2.10e-03

Pos logit contributions
 valleys+0.041
 sizes+0.038
 situations+0.038
 galaxies+0.037
 universe+0.037
Neg logit contributions
 bye-0.036
rav-0.033
ia-0.029
gan-0.029
 reward-0.029
Token said
Token ID531
Feature activation-2.07e-03

Pos logit contributions
 bye+0.004
rav+0.004
ia+0.004
gan+0.003
cake+0.003
Neg logit contributions
 valleys-0.003
 sizes-0.003
 universe-0.002
 galaxies-0.002
 planets-0.002
Token,
Token ID11
Feature activation-1.06e-02

Pos logit contributions
rav+0.003
 bye+0.003
ia+0.003
gan+0.003
eal+0.003
Neg logit contributions
 valleys-0.003
 universe-0.003
 galaxies-0.003
 planets-0.003
 sizes-0.003
Token "
Token ID366
Feature activation+2.81e-02

Pos logit contributions
rav+0.017
ia+0.017
 bye+0.016
eal+0.016
 reward+0.016
Neg logit contributions
 valleys-0.015
 sizes-0.014
 galaxies-0.013
 universe-0.013
 planets-0.013
TokenThis
Token ID1212
Feature activation+3.71e-03

Pos logit contributions
 valleys+0.038
 sorts+0.035
 universe+0.035
 sizes+0.034
 galaxies+0.034
Neg logit contributions
 bye-0.052
ia-0.049
rav-0.048
 apology-0.045
 reward-0.045
Token trick
Token ID6908
Feature activation-1.13e-02

Pos logit contributions
 valleys+0.000
 universe+0.000
 sizes+0.000
 galaxies+0.000
 sorts+0.000
Neg logit contributions
 bye-0.000
rav-0.000
gan-0.000
ia-0.000
eal-0.000
Token on
Token ID319
Feature activation+9.06e-03

Pos logit contributions
 bye+0.019
rav+0.018
ia+0.017
gan+0.016
less+0.016
Neg logit contributions
 valleys-0.017
 sizes-0.016
 galaxies-0.015
 universe-0.015
 planets-0.015
Token her
Token ID607
Feature activation+1.03e-02

Pos logit contributions
 valleys+0.015
 sizes+0.014
 galaxies+0.013
 universe+0.013
 planets+0.013
Neg logit contributions
 bye-0.014
rav-0.013
ia-0.013
gan-0.012
 reward-0.012
Token.
Token ID13
Feature activation-7.23e-05

Pos logit contributions
 valleys+0.017
 sizes+0.015
 galaxies+0.015
 universe+0.015
 sorts+0.015
Neg logit contributions
 bye-0.016
rav-0.015
ia-0.014
gan-0.014
 reward-0.013
Token Lily
Token ID20037
Feature activation+1.63e-02

Pos logit contributions
rav+0.000
 bye+0.000
less+0.000
ia+0.000
 reward+0.000
Neg logit contributions
 valleys-0.000
 sizes-0.000
 universe-0.000
 galaxies-0.000
 sorts-0.000
Token gigg
Token ID30442
Feature activation+3.28e-03

Pos logit contributions
 valleys+0.025
 sizes+0.023
 situations+0.022
 universe+0.022
 galaxies+0.022
Neg logit contributions
 bye-0.028
rav-0.026
ia-0.025
gan-0.023
cake-0.023
Tokenled
Token ID992
Feature activation+2.30e-02

Pos logit contributions
 valleys+0.005
 sizes+0.005
 galaxies+0.005
 planets+0.005
 universe+0.005
Neg logit contributions
 bye-0.005
rav-0.005
ia-0.005
 reward-0.005
gan-0.004
Token and
Token ID290
Feature activation-2.12e-02

Pos logit contributions
 valleys+0.024
 galaxies+0.021
 planets+0.020
 sorts+0.020
 sizes+0.020
Neg logit contributions
rav-0.050
 bye-0.045
ia-0.045
gan-0.044
cake-0.043
Token continued
Token ID3767
Feature activation+1.63e-02

Pos logit contributions
 bye+0.032
rav+0.030
ia+0.027
gan+0.026
 reward+0.026
Neg logit contributions
 valleys-0.036
 universe-0.033
 sizes-0.033
 situations-0.033
 galaxies-0.033
Token to
Token ID284
Feature activation-6.47e-03

Pos logit contributions
 valleys+0.027
 sizes+0.025
 universe+0.025
 galaxies+0.025
 planets+0.025
Neg logit contributions
 bye-0.025
rav-0.024
ia-0.022
gan-0.021
cake-0.020
Token help
Token ID1037
Feature activation+1.39e-02

Pos logit contributions
 bye+0.010
rav+0.009
ia+0.009
 reward+0.008
five+0.008
Neg logit contributions
 valleys-0.011
 sizes-0.010
 galaxies-0.010
 situations-0.010
 planets-0.010
Token was
Token ID373
Feature activation+1.32e-02

Pos logit contributions
 bye+0.065
rav+0.063
ia+0.058
gan+0.056
cake+0.055
Neg logit contributions
 valleys-0.048
 sizes-0.044
 galaxies-0.043
 universe-0.043
 planets-0.043
Token just
Token ID655
Feature activation-2.91e-02

Pos logit contributions
 valleys+0.019
 sizes+0.018
 universe+0.017
 galaxies+0.017
 planets+0.017
Neg logit contributions
 bye-0.023
rav-0.022
ia-0.020
gan-0.020
cake-0.019
Token a
Token ID257
Feature activation-2.23e-02

Pos logit contributions
rav+0.053
 bye+0.052
ia+0.048
cake+0.047
five+0.046
Neg logit contributions
 valleys-0.040
 sizes-0.039
 planets-0.037
 galaxies-0.037
 sorts-0.036
Token trick
Token ID6908
Feature activation-2.62e-02

Pos logit contributions
 bye+0.034
rav+0.033
ia+0.031
gan+0.031
eal+0.029
Neg logit contributions
 valleys-0.035
 universe-0.034
 sizes-0.033
 valley-0.033
 sorts-0.033
Token!
Token ID0
Feature activation-9.01e-03

Pos logit contributions
 bye+0.046
rav+0.045
ia+0.040
gan+0.040
 reward+0.039
Neg logit contributions
 valleys-0.037
 sizes-0.035
 universe-0.033
 galaxies-0.033
 planets-0.033
Token Sophie
Token ID35331
Feature activation-1.40e-02

Pos logit contributions
 bye+0.017
rav+0.017
less+0.015
ia+0.015
 reward+0.015
Neg logit contributions
 valleys-0.011
 universe-0.010
 sorts-0.010
 valley-0.009
 galaxies-0.009
Token had
Token ID550
Feature activation+1.83e-04

Pos logit contributions
 bye+0.024
rav+0.024
ia+0.021
gan+0.021
less+0.021
Neg logit contributions
 valleys-0.020
 sizes-0.019
 galaxies-0.018
 universe-0.018
 planets-0.018
Token tricked
Token ID42608
Feature activation+5.71e-03

Pos logit contributions
 valleys+0.000
 sizes+0.000
 sorts+0.000
 galaxies+0.000
 planets+0.000
Neg logit contributions
 bye-0.000
rav-0.000
ia-0.000
cake-0.000
gan-0.000
Token them
Token ID606
Feature activation-2.61e-02

Pos logit contributions
 valleys+0.008
 sizes+0.008
 universe+0.008
 situations+0.008
 planets+0.008
Neg logit contributions
 bye-0.010
rav-0.009
ia-0.008
cake-0.008
five-0.008
Token all
Token ID477
Feature activation+7.49e-03

Pos logit contributions
 bye+0.045
rav+0.043
ia+0.039
gan+0.038
less+0.038
Neg logit contributions
 valleys-0.039
 sizes-0.036
 galaxies-0.035
 universe-0.035
 planets-0.035
Token.
Token ID13
Feature activation-8.22e-03

Pos logit contributions
 valleys+0.014
 galaxies+0.013
 universe+0.012
 sizes+0.012
 planets+0.012
Neg logit contributions
 bye-0.010
rav-0.010
ia-0.009
less-0.008
five-0.008
Token felt
Token ID2936
Feature activation+3.99e-02

Pos logit contributions
 valleys+0.011
 planets+0.010
 sizes+0.010
 galaxies+0.009
 universe+0.009
Neg logit contributions
 bye-0.017
rav-0.017
less-0.015
 reward-0.015
cake-0.015
Token Lily
Token ID20037
Feature activation+3.03e-02

Pos logit contributions
 valleys+0.074
 situations+0.071
 universe+0.071
 sizes+0.070
 galaxies+0.070
Neg logit contributions
 bye-0.053
rav-0.044
gan-0.040
less-0.040
 reward-0.039
Token in
Token ID287
Feature activation+3.60e-02

Pos logit contributions
 valleys+0.045
 sizes+0.041
 galaxies+0.041
 universe+0.040
 situations+0.040
Neg logit contributions
 bye-0.053
rav-0.051
ia-0.046
gan-0.045
less-0.045
Token her
Token ID607
Feature activation-2.98e-03

Pos logit contributions
 valleys+0.059
 sizes+0.057
bowl+0.055
 galaxies+0.055
roads+0.055
Neg logit contributions
 bye-0.055
rav-0.049
ia-0.047
 reward-0.044
five-0.043
Token backpack
Token ID22526
Feature activation+1.34e-02

Pos logit contributions
 bye+0.005
rav+0.005
ia+0.004
cake+0.004
gan+0.004
Neg logit contributions
 valleys-0.004
 sizes-0.004
 universe-0.004
 planets-0.004
 galaxies-0.004
Token and
Token ID290
Feature activation-4.55e-02

Pos logit contributions
 valleys+0.023
 sizes+0.021
 galaxies+0.021
 situations+0.021
 universe+0.020
Neg logit contributions
 bye-0.020
rav-0.018
ia-0.018
cake-0.017
less-0.017
Token smiled
Token ID13541
Feature activation+6.17e-03

Pos logit contributions
 bye+0.074
rav+0.069
ia+0.063
gan+0.061
 reward+0.060
Neg logit contributions
 valleys-0.073
 sizes-0.067
 universe-0.067
 situations-0.066
 galaxies-0.066
Token.
Token ID13
Feature activation+7.72e-03

Pos logit contributions
 valleys+0.007
 universe+0.007
 planets+0.007
 sizes+0.007
 galaxies+0.007
Neg logit contributions
rav-0.012
 bye-0.012
ia-0.011
five-0.011
cake-0.011
Token
Token ID198
Feature activation+3.25e-03

Pos logit contributions
 valleys+0.011
 sizes+0.010
 universe+0.009
 galaxies+0.009
 planets+0.009
Neg logit contributions
 bye-0.014
rav-0.014
ia-0.013
gan-0.012
cake-0.012
Token
Token ID198
Feature activation+7.15e-03

Pos logit contributions
 valleys+0.006
 sizes+0.006
 situations+0.006
 galaxies+0.006
 planets+0.006
Neg logit contributions
 bye-0.004
rav-0.004
ia-0.004
five-0.003
gan-0.003
TokenBut
Token ID1537
Feature activation+4.26e-03

Pos logit contributions
 valleys+0.013
 sizes+0.012
 universe+0.011
 galaxies+0.011
 planets+0.011
Neg logit contributions
 bye-0.010
rav-0.010
ia-0.009
gan-0.009
 reward-0.009
Token,
Token ID11
Feature activation-1.23e-02

Pos logit contributions
rav+0.003
 bye+0.003
eal+0.003
ia+0.003
five+0.003
Neg logit contributions
 valleys-0.001
 situations-0.001
?",-0.001
 sizes-0.001
tails-0.001
Token "
Token ID366
Feature activation+2.56e-02

Pos logit contributions
rav+0.019
 bye+0.019
ia+0.017
eal+0.017
five+0.016
Neg logit contributions
 valleys-0.019
 sizes-0.018
 galaxies-0.017
 planets-0.016
roads-0.016
TokenH
Token ID39
Feature activation-9.02e-04

Pos logit contributions
 valleys+0.039
 sizes+0.035
 galaxies+0.034
 universe+0.034
 situations+0.034
Neg logit contributions
 bye-0.044
rav-0.042
ia-0.039
gan-0.037
 reward-0.037
Tokenoney
Token ID1419
Feature activation+1.66e-02

Pos logit contributions
 bye+0.001
rav+0.001
ia+0.001
gan+0.001
less+0.001
Neg logit contributions
 valleys-0.002
 sizes-0.002
 galaxies-0.002
 universe-0.002
 planets-0.002
Token,
Token ID11
Feature activation-1.24e-02

Pos logit contributions
 valleys+0.023
 sizes+0.021
 universe+0.020
 galaxies+0.020
 planets+0.020
Neg logit contributions
 bye-0.030
rav-0.029
ia-0.027
gan-0.026
cake-0.026
Token you
Token ID345
Feature activation-4.76e-02

Pos logit contributions
 bye+0.019
rav+0.019
ia+0.017
gan+0.016
cake+0.016
Neg logit contributions
 valleys-0.020
 sizes-0.019
 universe-0.019
 situations-0.018
 galaxies-0.018
Token were
Token ID547
Feature activation+1.66e-02

Pos logit contributions
 bye+0.090
rav+0.083
 reward+0.079
ia+0.078
gan+0.077
Neg logit contributions
 valleys-0.068
 sizes-0.061
 galaxies-0.059
roads-0.059
 planets-0.058
Token so
Token ID523
Feature activation+1.63e-02

Pos logit contributions
 valleys+0.024
 sizes+0.022
 galaxies+0.021
 universe+0.021
 planets+0.021
Neg logit contributions
 bye-0.029
rav-0.028
ia-0.026
gan-0.025
cake-0.025
Token brave
Token ID14802
Feature activation+7.27e-03

Pos logit contributions
 valleys+0.020
 universe+0.018
 sizes+0.018
 sorts+0.017
 situations+0.017
Neg logit contributions
 bye-0.032
rav-0.031
ia-0.029
 reward-0.028
cake-0.028
Token,
Token ID11
Feature activation-1.21e-02

Pos logit contributions
 valleys+0.012
 galaxies+0.012
 universe+0.011
 sizes+0.011
 planets+0.011
Neg logit contributions
 bye-0.011
rav-0.010
less-0.009
ia-0.009
five-0.009
Token but
Token ID475
Feature activation+1.96e-02

Pos logit contributions
 bye+0.018
rav+0.018
ia+0.016
gan+0.016
cake+0.016
Neg logit contributions
 valleys-0.020
 sizes-0.019
 universe-0.018
 galaxies-0.018
 situations-0.018
Token.
Token ID13
Feature activation+4.41e-03

Pos logit contributions
 valleys+0.005
 sizes+0.004
 galaxies+0.004
 universe+0.004
 planets+0.004
Neg logit contributions
 bye-0.006
rav-0.005
ia-0.005
gan-0.005
cake-0.005
Token
Token ID198
Feature activation-3.49e-03

Pos logit contributions
 valleys+0.005
roads+0.004
 sorts+0.004
 universe+0.004
 hill+0.003
Neg logit contributions
rav-0.010
 bye-0.009
less-0.009
eal-0.009
ige-0.008
Token
Token ID198
Feature activation-2.14e-03

Pos logit contributions
 bye+0.006
rav+0.005
ia+0.005
gan+0.005
cake+0.005
Neg logit contributions
 valleys-0.006
 sizes-0.005
 galaxies-0.005
 universe-0.005
 planets-0.005
TokenS
Token ID50
Feature activation-6.62e-02

Pos logit contributions
rav+0.003
 bye+0.003
eal+0.003
 reward+0.003
ia+0.003
Neg logit contributions
 valleys-0.004
roads-0.003
 sorts-0.003
 hill-0.003
 valley-0.003
Tokenue
Token ID518
Feature activation-8.28e-03

Pos logit contributions
 bye+0.082
 coach+0.079
rav+0.074
 goodbye+0.067
 waiter+0.066
Neg logit contributions
 attic-0.107
 valleys-0.105
 basement-0.103
neath-0.103
 shed-0.101
Token tries
Token ID8404
Feature activation-1.58e-02

Pos logit contributions
 bye+0.013
rav+0.012
ia+0.012
five+0.011
gan+0.011
Neg logit contributions
 valleys-0.013
 sizes-0.013
 universe-0.013
 situations-0.013
 galaxies-0.013
Token to
Token ID284
Feature activation-1.29e-02

Pos logit contributions
rav+0.030
 bye+0.029
eal+0.028
ia+0.027
cake+0.026
Neg logit contributions
 valleys-0.019
 hill-0.017
 hills-0.017
tails-0.016
 sometimes-0.016
Token stop
Token ID2245
Feature activation-2.76e-02

Pos logit contributions
rav+0.025
 bye+0.024
cake+0.022
ia+0.022
less+0.022
Neg logit contributions
 valleys-0.016
 hill-0.015
roads-0.015
 universe-0.015
 sorts-0.014
Token him
Token ID683
Feature activation-1.70e-02

Pos logit contributions
rav+0.051
 bye+0.050
cake+0.046
ia+0.046
 reward+0.045
Neg logit contributions
 valleys-0.038
 planets-0.033
 sizes-0.032
 situations-0.032
 universe-0.032
Token.
Token ID13
Feature activation+3.77e-03

Pos logit contributions
 bye+0.026
rav+0.023
ia+0.022
gan+0.021
less+0.021
Neg logit contributions
 valleys-0.030
 galaxies-0.028
 planets-0.027
 sizes-0.027
 universe-0.027
Token She
Token ID1375
Feature activation+1.06e-03

Pos logit contributions
 valleys+0.004
 sizes+0.004
 universe+0.003
 situations+0.003
 galaxies+0.003
Neg logit contributions
 bye-0.008
rav-0.008
ia-0.007
 reward-0.007
less-0.007
Token a
Token ID257
Feature activation-8.93e-03

Pos logit contributions
 bye+0.005
rav+0.004
ia+0.004
gan+0.004
 reward+0.004
Neg logit contributions
 valleys-0.004
 sizes-0.004
 universe-0.004
 situations-0.004
 galaxies-0.004
Token tree
Token ID5509
Feature activation-9.96e-03

Pos logit contributions
 bye+0.015
rav+0.015
ia+0.014
gan+0.013
five+0.013
Neg logit contributions
 valleys-0.013
 universe-0.012
 sizes-0.012
 galaxies-0.012
 planets-0.012
Token.
Token ID13
Feature activation-2.19e-03

Pos logit contributions
 bye+0.015
rav+0.013
ia+0.013
less+0.013
cake+0.013
Neg logit contributions
 valleys-0.016
 galaxies-0.015
 situations-0.015
 sizes-0.015
 universe-0.015
Token When
Token ID1649
Feature activation+2.00e-02

Pos logit contributions
rav+0.004
 bye+0.004
ia+0.004
less+0.004
 reward+0.004
Neg logit contributions
 valleys-0.003
 galaxies-0.002
 sizes-0.002
 universe-0.002
 planets-0.002
Token the
Token ID262
Feature activation+4.13e-03

Pos logit contributions
 valleys+0.030
 galaxies+0.026
 planets+0.026
 universe+0.026
 valley+0.026
Neg logit contributions
rav-0.035
 bye-0.034
ia-0.032
cake-0.031
eal-0.031
Token potato
Token ID21219
Feature activation-1.28e-02

Pos logit contributions
 valleys+0.006
 universe+0.006
 valley+0.006
 planets+0.006
 sizes+0.006
Neg logit contributions
rav-0.007
 bye-0.007
ia-0.006
gan-0.006
five-0.006
Token looked
Token ID3114
Feature activation-4.99e-03

Pos logit contributions
rav+0.026
 bye+0.025
less+0.023
five+0.023
gan+0.023
Neg logit contributions
 valleys-0.016
 valley-0.014
 universe-0.014
 planets-0.014
 hill-0.013
Token up
Token ID510
Feature activation-1.54e-02

Pos logit contributions
 bye+0.008
rav+0.008
ia+0.008
gan+0.007
cake+0.007
Neg logit contributions
 valleys-0.008
 sizes-0.007
 galaxies-0.007
 universe-0.007
 planets-0.007
Token he
Token ID339
Feature activation-5.19e-03

Pos logit contributions
rav+0.030
 bye+0.030
ia+0.027
cake+0.027
five+0.027
Neg logit contributions
 valleys-0.019
 galaxies-0.016
 universe-0.016
 valley-0.016
 planets-0.016
Token saw
Token ID2497
Feature activation-7.15e-04

Pos logit contributions
 bye+0.009
rav+0.008
ia+0.008
gan+0.007
 reward+0.007
Neg logit contributions
 valleys-0.008
 sizes-0.007
 situations-0.007
 universe-0.007
 galaxies-0.007
Token the
Token ID262
Feature activation+3.60e-03

Pos logit contributions
 bye+0.001
rav+0.001
ia+0.001
gan+0.001
 reward+0.001
Neg logit contributions
 valleys-0.001
 sizes-0.001
 universe-0.001
 situations-0.001
 galaxies-0.001
Token waited
Token ID13488
Feature activation-1.29e-02

Pos logit contributions
 bye+0.042
rav+0.040
ia+0.036
gan+0.035
 reward+0.034
Neg logit contributions
 valleys-0.046
 sizes-0.043
 universe-0.042
 galaxies-0.042
 planets-0.042
Token for
Token ID329
Feature activation-3.18e-02

Pos logit contributions
rav+0.022
 bye+0.021
cake+0.019
gan+0.019
ia+0.019
Neg logit contributions
 valleys-0.018
 planets-0.016
 galaxies-0.016
 sizes-0.016
 universe-0.016
Token their
Token ID511
Feature activation-2.53e-02

Pos logit contributions
rav+0.061
cake+0.057
 bye+0.056
eal+0.054
gan+0.052
Neg logit contributions
 valleys-0.041
 planets-0.039
 sizes-0.037
 galaxies-0.036
 universe-0.036
Token mom
Token ID1995
Feature activation+2.88e-02

Pos logit contributions
 bye+0.030
rav+0.030
ia+0.027
cake+0.026
eal+0.026
Neg logit contributions
 valleys-0.047
 universe-0.046
 sizes-0.046
 planets-0.045
 situations-0.044
Token to
Token ID284
Feature activation-1.19e-02

Pos logit contributions
 valleys+0.043
 sizes+0.041
 planets+0.039
 universe+0.039
 sorts+0.039
Neg logit contributions
 bye-0.048
rav-0.046
ia-0.042
gan-0.040
 reward-0.039
Token come
Token ID1282
Feature activation-6.37e-02

Pos logit contributions
rav+0.020
 bye+0.018
ia+0.018
gan+0.017
cake+0.017
Neg logit contributions
 valleys-0.017
 universe-0.016
 situations-0.016
 valley-0.016
 hill-0.015
Token.
Token ID13
Feature activation+4.85e-03

Pos logit contributions
 bye+0.084
rav+0.082
ia+0.074
cake+0.072
five+0.069
Neg logit contributions
 valleys-0.117
 universe-0.107
 planets-0.106
 galaxies-0.104
 valley-0.104
Token They
Token ID1119
Feature activation+7.00e-03

Pos logit contributions
 valleys+0.006
 sizes+0.005
 planets+0.005
 galaxies+0.005
 universe+0.005
Neg logit contributions
 bye-0.010
rav-0.010
ia-0.009
cake-0.009
 reward-0.009
Token did
Token ID750
Feature activation-2.84e-03

Pos logit contributions
 valleys+0.011
 sizes+0.010
 situations+0.010
 galaxies+0.010
 universe+0.010
Neg logit contributions
 bye-0.012
rav-0.011
ia-0.010
gan-0.010
 reward-0.010
Token not
Token ID407
Feature activation-1.78e-02

Pos logit contributions
rav+0.005
 bye+0.005
cake+0.005
eal+0.005
less+0.004
Neg logit contributions
 valleys-0.004
 planets-0.003
 universe-0.003
 sizes-0.003
 basement-0.003
Token want
Token ID765
Feature activation-2.41e-02

Pos logit contributions
 bye+0.031
rav+0.029
ia+0.027
 reward+0.026
gan+0.026
Neg logit contributions
 valleys-0.027
 sizes-0.025
 galaxies-0.024
 planets-0.024
 situations-0.024
Token,
Token ID11
Feature activation+9.71e-03

Pos logit contributions
 valleys+0.006
roads+0.005
 galaxies+0.005
 universe+0.005
 planets+0.005
Neg logit contributions
rav-0.006
 bye-0.006
ia-0.006
 reward-0.006
less-0.006
Token she
Token ID673
Feature activation-7.21e-03

Pos logit contributions
 valleys+0.014
 sizes+0.013
 situations+0.013
 universe+0.013
 galaxies+0.013
Neg logit contributions
 bye-0.016
rav-0.016
ia-0.014
gan-0.014
cake-0.014
Token found
Token ID1043
Feature activation-4.22e-03

Pos logit contributions
 bye+0.013
rav+0.013
ia+0.012
gan+0.011
cake+0.011
Neg logit contributions
 valleys-0.010
 sizes-0.009
 galaxies-0.009
 universe-0.009
 planets-0.009
Token a
Token ID257
Feature activation-9.85e-03

Pos logit contributions
 bye+0.007
rav+0.007
ia+0.007
gan+0.006
cake+0.006
Neg logit contributions
 valleys-0.006
 sizes-0.005
 universe-0.005
 galaxies-0.005
 planets-0.005
Token funny
Token ID8258
Feature activation-1.61e-02

Pos logit contributions
 bye+0.015
rav+0.015
ia+0.013
 reward+0.013
cake+0.013
Neg logit contributions
 valleys-0.016
 sizes-0.015
 galaxies-0.015
 situations-0.015
 planets-0.015
Token-
Token ID12
Feature activation-7.21e-02

Pos logit contributions
 bye+0.027
rav+0.026
ia+0.023
gan+0.023
five+0.022
Neg logit contributions
 valleys-0.024
 universe-0.023
 sizes-0.023
 planets-0.022
 galaxies-0.022
Tokenlooking
Token ID11534
Feature activation-2.32e-02

Pos logit contributions
 bye+0.132
rav+0.125
ia+0.114
gan+0.113
 reward+0.108
Neg logit contributions
 sorts-0.099
 valleys-0.096
 universe-0.093
 sizes-0.091
 galaxies-0.091
Token machine
Token ID4572
Feature activation+9.69e-03

Pos logit contributions
 bye+0.038
rav+0.038
ia+0.034
gan+0.034
five+0.032
Neg logit contributions
 valleys-0.034
 universe-0.033
 sizes-0.033
 valley-0.032
 situations-0.032
Token.
Token ID13
Feature activation+1.54e-03

Pos logit contributions
 valleys+0.015
 galaxies+0.014
 sizes+0.014
 universe+0.014
 situations+0.014
Neg logit contributions
 bye-0.015
rav-0.015
ia-0.014
cake-0.014
less-0.013
Token It
Token ID632
Feature activation-2.53e-02

Pos logit contributions
 valleys+0.002
 sizes+0.002
 universe+0.002
 galaxies+0.002
 planets+0.002
Neg logit contributions
 bye-0.003
rav-0.002
ia-0.002
gan-0.002
 reward-0.002
Token was
Token ID373
Feature activation+1.92e-02

Pos logit contributions
 bye+0.044
rav+0.043
ia+0.040
gan+0.038
cake+0.038
Neg logit contributions
 valleys-0.036
 sizes-0.033
 galaxies-0.032
 universe-0.032
 situations-0.032
Token
Token ID198
Feature activation+1.06e-02

Pos logit contributions
 valleys+0.010
 sizes+0.009
 universe+0.009
 galaxies+0.009
 planets+0.009
Neg logit contributions
 bye-0.011
rav-0.011
ia-0.010
gan-0.009
cake-0.009
Token"
Token ID1
Feature activation+1.10e-02

Pos logit contributions
 valleys+0.019
 sizes+0.018
 universe+0.017
 galaxies+0.017
 planets+0.017
Neg logit contributions
 bye-0.015
rav-0.015
ia-0.013
gan-0.013
cake-0.012
TokenWow
Token ID22017
Feature activation-3.98e-02

Pos logit contributions
 valleys+0.013
 sizes+0.012
First+0.011
 universe+0.011
 planets+0.011
Neg logit contributions
 bye-0.022
ia-0.021
rav-0.020
less-0.020
gan-0.020
Token,
Token ID11
Feature activation-2.36e-02

Pos logit contributions
 bye+0.075
rav+0.072
ia+0.067
gan+0.065
cake+0.065
Neg logit contributions
 valleys-0.052
 sizes-0.047
 universe-0.046
 planets-0.045
 galaxies-0.045
Token Sara
Token ID24799
Feature activation-3.59e-03

Pos logit contributions
rav+0.037
cake+0.036
less+0.034
ia+0.034
 bye+0.033
Neg logit contributions
 valleys-0.033
 valley-0.031
 sizes-0.031
 universe-0.030
 planets-0.030
Token and
Token ID290
Feature activation-6.75e-02

Pos logit contributions
 bye+0.007
rav+0.006
ia+0.006
gan+0.006
cake+0.006
Neg logit contributions
 valleys-0.005
 sizes-0.005
 galaxies-0.004
 universe-0.004
 planets-0.004
Token Tom
Token ID4186
Feature activation+2.33e-02

Pos logit contributions
rav+0.104
less+0.098
cake+0.097
va+0.094
har+0.091
Neg logit contributions
 fields-0.092
 attic-0.091
 trees-0.091
 sometimes-0.090
 valleys-0.088
Token.
Token ID13
Feature activation+1.35e-02

Pos logit contributions
 valleys+0.037
 galaxies+0.035
 sizes+0.033
 universe+0.033
 planets+0.033
Neg logit contributions
 bye-0.039
rav-0.036
ia-0.035
 reward-0.032
gan-0.032
Token You
Token ID921
Feature activation-3.43e-02

Pos logit contributions
 valleys+0.022
 sizes+0.021
 galaxies+0.020
 universe+0.020
 planets+0.020
Neg logit contributions
 bye-0.021
rav-0.021
ia-0.018
gan-0.018
five-0.017
Token are
Token ID389
Feature activation-2.02e-02

Pos logit contributions
 bye+0.061
rav+0.058
 reward+0.054
five+0.052
ia+0.052
Neg logit contributions
 valleys-0.050
 situations-0.048
roads-0.046
 sizes-0.045
 galaxies-0.044
Token so
Token ID523
Feature activation+1.32e-03

Pos logit contributions
 bye+0.037
rav+0.036
ia+0.033
cake+0.032
five+0.032
Neg logit contributions
 valleys-0.027
 sizes-0.025
 universe-0.025
 planets-0.025
 galaxies-0.024
Token.
Token ID13
Feature activation-2.36e-03

Pos logit contributions
 bye+0.020
rav+0.019
ia+0.018
gan+0.017
cake+0.017
Neg logit contributions
 valleys-0.017
 sizes-0.015
 universe-0.015
 galaxies-0.015
 planets-0.015
Token
Token ID198
Feature activation-3.93e-03

Pos logit contributions
rav+0.004
 bye+0.004
ia+0.004
gan+0.004
 reward+0.004
Neg logit contributions
 valleys-0.003
 sizes-0.003
 universe-0.003
 galaxies-0.003
 planets-0.003
Token
Token ID198
Feature activation-2.07e-03

Pos logit contributions
 bye+0.006
rav+0.005
ia+0.005
gan+0.004
five+0.004
Neg logit contributions
 valleys-0.007
 sizes-0.007
 galaxies-0.007
 situations-0.007
 universe-0.007
TokenL
Token ID43
Feature activation-1.60e-02

Pos logit contributions
 bye+0.003
rav+0.003
ia+0.002
gan+0.002
 reward+0.002
Neg logit contributions
 valleys-0.004
 sizes-0.003
 universe-0.003
 galaxies-0.003
 planets-0.003
Tokenily
Token ID813
Feature activation+5.61e-03

Pos logit contributions
 bye+0.018
rav+0.016
 reward+0.013
 timer+0.012
less+0.012
Neg logit contributions
 valleys-0.032
 situations-0.031
 universe-0.031
 planets-0.030
 attic-0.030
Token and
Token ID290
Feature activation-7.20e-02

Pos logit contributions
 valleys+0.007
 sizes+0.006
 sorts+0.006
 planets+0.006
 universe+0.006
Neg logit contributions
 bye-0.011
rav-0.011
ia-0.009
less-0.009
 reward-0.009
Token Ben
Token ID3932
Feature activation+2.85e-02

Pos logit contributions
rav+0.126
cake+0.111
less+0.110
 bye+0.109
five+0.107
Neg logit contributions
 valleys-0.100
 planets-0.099
 universe-0.098
 fields-0.093
 valley-0.092
Token took
Token ID1718
Feature activation-1.42e-02

Pos logit contributions
 valleys+0.046
 sizes+0.043
 universe+0.042
 situations+0.042
 galaxies+0.042
Neg logit contributions
 bye-0.045
rav-0.043
ia-0.039
gan-0.038
cake-0.037
Token turns
Token ID4962
Feature activation-1.63e-02

Pos logit contributions
rav+0.026
 bye+0.025
cake+0.023
ia+0.023
gan+0.022
Neg logit contributions
 valleys-0.020
 planets-0.018
 sorts-0.017
 sizes-0.017
 universe-0.017
Token drying
Token ID29621
Feature activation-2.00e-03

Pos logit contributions
 bye+0.034
rav+0.033
ia+0.030
gan+0.029
 reward+0.029
Neg logit contributions
 valleys-0.019
 sizes-0.017
 sorts-0.016
 planets-0.016
 fields-0.015
Token their
Token ID511
Feature activation-2.01e-04

Pos logit contributions
 bye+0.003
rav+0.003
ia+0.003
cake+0.003
gan+0.003
Neg logit contributions
 valleys-0.003
 sizes-0.003
 planets-0.003
 galaxies-0.003
 universe-0.003
Token.
Token ID13
Feature activation-1.29e-03

Pos logit contributions
 valleys+0.036
 sizes+0.034
 universe+0.033
 galaxies+0.033
 planets+0.032
Neg logit contributions
 bye-0.041
rav-0.039
ia-0.036
gan-0.034
cake-0.034
Token He
Token ID679
Feature activation+3.76e-03

Pos logit contributions
rav+0.002
 bye+0.002
less+0.002
ia+0.002
 reward+0.002
Neg logit contributions
 valleys-0.002
 sizes-0.001
 universe-0.001
 galaxies-0.001
 planets-0.001
Token heard
Token ID2982
Feature activation-6.72e-03

Pos logit contributions
 valleys+0.005
 sizes+0.005
 galaxies+0.005
 planets+0.005
 universe+0.004
Neg logit contributions
 bye-0.007
rav-0.007
ia-0.006
cake-0.006
less-0.006
Token a
Token ID257
Feature activation-1.68e-02

Pos logit contributions
rav+0.014
 bye+0.012
gan+0.012
ia+0.011
eal+0.011
Neg logit contributions
 valleys-0.009
 galaxies-0.008
 planets-0.008
 universe-0.008
 sizes-0.007
Token high
Token ID1029
Feature activation-1.76e-02

Pos logit contributions
rav+0.027
 bye+0.026
gan+0.026
ia+0.024
cake+0.023
Neg logit contributions
 valleys-0.026
 universe-0.024
 valley-0.024
 hill-0.023
 sizes-0.023
Token-
Token ID12
Feature activation-7.37e-02

Pos logit contributions
rav+0.027
 bye+0.027
ia+0.024
gan+0.024
cake+0.023
Neg logit contributions
 valleys-0.029
 sizes-0.026
 universe-0.026
 planets-0.026
 galaxies-0.026
Tokenp
Token ID79
Feature activation-1.32e-02

Pos logit contributions
 bye+0.126
rav+0.125
 reward+0.112
ia+0.110
gan+0.110
Neg logit contributions
 valleys-0.108
roads-0.103
 sorts-0.103
 galaxies-0.097
 sizes-0.095
Tokenitched
Token ID10981
Feature activation+4.01e-02

Pos logit contributions
rav+0.024
 bye+0.023
 reward+0.021
gan+0.021
cake+0.021
Neg logit contributions
 valleys-0.018
 sizes-0.016
 planets-0.016
 galaxies-0.016
 universe-0.015
Token noise
Token ID7838
Feature activation-5.22e-03

Pos logit contributions
 valleys+0.061
 sizes+0.057
 universe+0.056
 galaxies+0.055
 planets+0.055
Neg logit contributions
rav-0.067
 bye-0.066
ia-0.058
gan-0.058
cake-0.056
Token coming
Token ID2406
Feature activation-4.81e-02

Pos logit contributions
rav+0.010
 bye+0.010
ia+0.009
gan+0.009
 reward+0.009
Neg logit contributions
 valleys-0.007
 sizes-0.006
 galaxies-0.006
 planets-0.006
 universe-0.006
Token from
Token ID422
Feature activation+1.05e-03

Pos logit contributions
 bye+0.085
rav+0.083
ia+0.076
gan+0.073
cake+0.073
Neg logit contributions
 valleys-0.069
 sizes-0.063
 universe-0.061
 galaxies-0.061
 planets-0.061