TOP ACTIVATIONS
MAX = 0.076

sand . She used her shovel to dig it up and
, if you don 't stop talking , I will have
picks up her bucket and shovel and walks to him .
. They used buckets and shovel s and shells and stones
They share their bucket and shovel with Mia . They make
They took their bucket and shovel . They found
have a bad bucket and shovel . They have no fun
He took his bucket and shovel and went to another spot
then carefully covered it with shovel s full of dirt .
the sand . They had shovel s and buckets and made
with her yellow bucket and shovel . One day , she
and touched it with his fins . The rock felt bump
angry . They throw their shovel s and hats on the
She used her bucket and shovel to help her climb the
and grabbed her bucket and shovel . As they opened the
. He shows Sara his shovel . He has filled it
have the best bucket and shovel . I have the best
It touches their masks and fins with its nose . It
heart . You should not stop singing because of one sw
And our new bucket and shovel too ." L

MIN ACTIVATIONS
MIN = -0.071

to clean your room before you can ride your motorcycle ."
, there was a place called Rainbow land . In Rainbow
there was a big mountain called a volcano . The volcano
, " Look how clever you are , you can put
was special because a man named Bob designed it . Bob
can solve any problem if you try your best ."
lived in a big house called a stable . The stable
, and they lived happily ever after . <|endoftext|> Once upon
he didn 't know what to count . Then he had
door . A little boy named Tim lived in this room
. And they lived happily ever after . <|endoftext|> One sunny
need to be ready before you go play ."
But then something strange happened . He looked out the window
he reached a tall hill . Wil f wanted to go
the box and see what we can do ," Ben says
, Spot heard a snap ! He dropped the nail and
he heard a loud crack ! The branch broke and Tweet
fast driver named Tim went for a ride in his loud
And they all lived happily ever after . <|endoftext|> Once upon
there was a special room called a chamber . The chamber

INTERVAL 0.039 - 0.076
CONTAINS 0.488%

about her adventure . <|endoftext|> Once there was a gifted bear
weak mouse being filled with strength . The mouse grate fully
orns every day . <|endoftext|> Once upon a time there was
volcano . <|endoftext|> Once upon a time , there were two
be filled with wonderful <|endoftext|> Once upon a time , there

INTERVAL 0.002 - 0.039
CONTAINS 42.213%

led and squ irmed , trying to make it more comfortable
You need to share ." " But Mom ,
his toy . Tom 's friend , Sam ,
yes and helped her get on the swing .
I forgive you . And yes , we can be friends

INTERVAL -0.034 - 0.002
CONTAINS 56.486%

boy named Tim my . He loved to play with his
the floor . Mom and Ben looked at the mess .
the backyard , playing with a toy truck . Tim my
just in time ! The party goers were able to use
help her mom . One day , her mom asked her

INTERVAL -0.071 - -0.034
CONTAINS 0.813%

do . Suddenly he heard a voice .
happy and said , " Thank you , Mom my .
were brave and clever . <|endoftext|> Anna and Ben were twins
" Wow , you two are very clever !"
But no matter how much she begged , her mom said
Token sand
Token ID6450
Feature activation+3.21e-02

Pos logit contributions
 chefs+0.004
 builders+0.004
 cooks+0.004
 knights+0.004
 helpers+0.004
Neg logit contributions
ho-0.003
 perspective-0.003
ute-0.003
ci-0.003
 way-0.003
Token.
Token ID13
Feature activation+1.42e-02

Pos logit contributions
ho+0.062
 perspective+0.059
ute+0.057
ci+0.056
 way+0.052
Neg logit contributions
 chefs-0.042
 knights-0.041
 cooks-0.040
 builders-0.040
 helpers-0.040
Token She
Token ID1375
Feature activation-1.15e-02

Pos logit contributions
ho+0.026
 perspective+0.024
ute+0.022
ci+0.022
 way+0.021
Neg logit contributions
 chefs-0.021
 knights-0.020
 cooks-0.020
 builders-0.020
 helpers-0.020
Token used
Token ID973
Feature activation+3.11e-02

Pos logit contributions
 chefs+0.021
 builders+0.021
 knights+0.020
 cooks+0.020
 helpers+0.020
Neg logit contributions
ho-0.017
 perspective-0.014
 way-0.014
ci-0.014
ute-0.013
Token her
Token ID607
Feature activation-2.19e-02

Pos logit contributions
ho+0.054
 perspective+0.050
ute+0.050
ci+0.050
itating+0.044
Neg logit contributions
 knights-0.047
 chefs-0.047
 helpers-0.047
 builders-0.046
 cooks-0.045
Token shovel
Token ID33677
Feature activation+7.58e-02

Pos logit contributions
 knights+0.039
 chefs+0.038
 helpers+0.038
 builders+0.038
 cooks+0.037
Neg logit contributions
ho-0.033
 perspective-0.030
ute-0.028
ci-0.028
itating-0.025
Token to
Token ID284
Feature activation-6.37e-03

Pos logit contributions
 perspective+0.129
ho+0.125
ute+0.120
ci+0.115
 du+0.110
Neg logit contributions
 builders-0.107
 chefs-0.107
 helpers-0.106
iers-0.103
 cooks-0.102
Token dig
Token ID3100
Feature activation+1.07e-02

Pos logit contributions
 chefs+0.010
 builders+0.010
 cooks+0.010
 knights+0.010
 helpers+0.010
Neg logit contributions
ho-0.011
 perspective-0.010
ute-0.009
ci-0.009
 way-0.009
Token it
Token ID340
Feature activation-1.71e-02

Pos logit contributions
ho+0.019
 perspective+0.017
ute+0.016
ci+0.016
 way+0.016
Neg logit contributions
 chefs-0.016
 knights-0.016
 builders-0.016
 cooks-0.015
 helpers-0.015
Token up
Token ID510
Feature activation+9.76e-03

Pos logit contributions
 chefs+0.025
 knights+0.025
 builders+0.024
 cooks+0.024
 helpers+0.024
Neg logit contributions
ho-0.031
 perspective-0.029
ci-0.027
ute-0.027
 way-0.025
Token and
Token ID290
Feature activation+3.21e-03

Pos logit contributions
ho+0.020
 perspective+0.018
ute+0.018
ci+0.018
 way+0.017
Neg logit contributions
 knights-0.011
 chefs-0.011
 builders-0.011
 helpers-0.011
 cooks-0.011
Token,
Token ID11
Feature activation+2.09e-02

Pos logit contributions
ho+0.029
 perspective+0.026
ci+0.025
ute+0.025
aman+0.023
Neg logit contributions
 cooks-0.019
 helpers-0.018
 chefs-0.018
 builders-0.018
 knights-0.018
Token if
Token ID611
Feature activation+2.97e-02

Pos logit contributions
ho+0.041
 perspective+0.038
ute+0.037
ci+0.036
yt+0.033
Neg logit contributions
 cooks-0.026
 chefs-0.026
 knights-0.026
 builders-0.026
 helpers-0.026
Token you
Token ID345
Feature activation-1.78e-02

Pos logit contributions
ho+0.063
 perspective+0.059
ute+0.058
ci+0.055
aman+0.053
Neg logit contributions
 helpers-0.035
 knights-0.034
 cooks-0.034
 builders-0.034
 chefs-0.033
Token don
Token ID836
Feature activation-2.35e-03

Pos logit contributions
 chefs+0.026
 knights+0.026
 cooks+0.026
 builders+0.026
 helpers+0.026
Neg logit contributions
ho-0.031
 perspective-0.029
ute-0.027
ci-0.027
 way-0.026
Token't
Token ID470
Feature activation+1.24e-02

Pos logit contributions
 chefs+0.004
 cooks+0.004
 builders+0.004
 knights+0.004
 helpers+0.004
Neg logit contributions
ho-0.004
 perspective-0.003
ci-0.003
ute-0.003
 way-0.003
Token stop
Token ID2245
Feature activation+7.31e-02

Pos logit contributions
ho+0.027
ci+0.024
aman+0.024
 perspective+0.024
ute+0.023
Neg logit contributions
 cooks-0.012
 helpers-0.011
 builders-0.011
 boys-0.011
 chefs-0.011
Token talking
Token ID3375
Feature activation+1.34e-02

Pos logit contributions
ho+0.149
 perspective+0.133
ute+0.130
ci+0.126
 redd+0.126
Neg logit contributions
 cooks-0.095
 pirates-0.090
 chefs-0.087
 helpers-0.087
 boys-0.086
Token,
Token ID11
Feature activation+2.49e-02

Pos logit contributions
ho+0.024
 perspective+0.022
ci+0.021
ute+0.021
 way+0.020
Neg logit contributions
 chefs-0.019
 knights-0.019
 cooks-0.019
 builders-0.019
 helpers-0.019
Token I
Token ID314
Feature activation+5.86e-03

Pos logit contributions
ho+0.050
 perspective+0.047
ute+0.045
ci+0.044
aman+0.040
Neg logit contributions
 builders-0.031
 cooks-0.031
 knights-0.031
 helpers-0.031
 chefs-0.031
Token will
Token ID481
Feature activation+3.28e-02

Pos logit contributions
ho+0.009
 perspective+0.009
ci+0.008
ute+0.008
 way+0.007
Neg logit contributions
 cooks-0.010
 chefs-0.009
 helpers-0.009
 builders-0.009
 knights-0.009
Token have
Token ID423
Feature activation+1.72e-02

Pos logit contributions
ho+0.068
 perspective+0.061
ci+0.061
ute+0.057
spe+0.056
Neg logit contributions
 cooks-0.039
 helpers-0.038
 chefs-0.036
 builders-0.036
 pirates-0.036
Token picks
Token ID11103
Feature activation-2.21e-03

Pos logit contributions
 chefs+0.015
 builders+0.015
 knights+0.015
 cooks+0.015
 helpers+0.015
Neg logit contributions
ho-0.017
 perspective-0.015
ute-0.015
ci-0.014
 way-0.014
Token up
Token ID510
Feature activation+5.22e-03

Pos logit contributions
 chefs+0.004
 knights+0.004
 cooks+0.004
 builders+0.004
 helpers+0.004
Neg logit contributions
ho-0.003
 perspective-0.003
ci-0.003
ute-0.003
 way-0.002
Token her
Token ID607
Feature activation-2.33e-02

Pos logit contributions
ho+0.009
 perspective+0.008
ute+0.008
ci+0.008
 way+0.008
Neg logit contributions
 chefs-0.008
 builders-0.008
 cooks-0.007
 knights-0.007
 helpers-0.007
Token bucket
Token ID19236
Feature activation+3.39e-02

Pos logit contributions
 chefs+0.039
 cooks+0.039
 builders+0.039
 knights+0.038
 helpers+0.038
Neg logit contributions
ho-0.036
 perspective-0.032
ute-0.031
ci-0.031
 way-0.030
Token and
Token ID290
Feature activation+4.11e-03

Pos logit contributions
 perspective+0.070
ho+0.069
ute+0.065
ci+0.064
 way+0.062
Neg logit contributions
 knights-0.038
 chefs-0.037
 builders-0.037
 cooks-0.036
 helpers-0.036
Token shovel
Token ID33677
Feature activation+7.19e-02

Pos logit contributions
ho+0.008
 perspective+0.007
ute+0.007
ci+0.007
 way+0.006
Neg logit contributions
 chefs-0.006
 knights-0.006
 cooks-0.006
 builders-0.006
 helpers-0.006
Token and
Token ID290
Feature activation-1.33e-03

Pos logit contributions
 perspective+0.108
ho+0.101
ute+0.099
ci+0.095
 way+0.091
Neg logit contributions
 chefs-0.121
 cooks-0.118
 knights-0.117
 helpers-0.117
 builders-0.117
Token walks
Token ID11114
Feature activation+1.27e-02

Pos logit contributions
 chefs+0.002
 knights+0.002
 builders+0.002
 cooks+0.002
 helpers+0.002
Neg logit contributions
ho-0.002
 perspective-0.002
ute-0.002
ci-0.002
 way-0.002
Token to
Token ID284
Feature activation+8.42e-03

Pos logit contributions
 perspective+0.020
ho+0.019
ute+0.019
ci+0.019
 way+0.017
Neg logit contributions
 knights-0.020
 chefs-0.020
 builders-0.020
 helpers-0.020
 cooks-0.020
Token him
Token ID683
Feature activation+2.23e-03

Pos logit contributions
ho+0.017
 perspective+0.016
ute+0.015
ci+0.015
 way+0.015
Neg logit contributions
 chefs-0.010
 builders-0.010
 knights-0.010
 cooks-0.010
 helpers-0.010
Token.
Token ID13
Feature activation+5.08e-04

Pos logit contributions
ho+0.004
 perspective+0.004
ute+0.004
ci+0.004
 way+0.004
Neg logit contributions
 knights-0.003
 chefs-0.003
 cooks-0.003
 helpers-0.003
 builders-0.003
Token.
Token ID13
Feature activation+8.87e-03

Pos logit contributions
ho+0.040
 perspective+0.036
ute+0.035
ci+0.034
 way+0.034
Neg logit contributions
 chefs-0.032
 builders-0.031
 knights-0.031
 cooks-0.031
 helpers-0.031
Token They
Token ID1119
Feature activation-1.31e-02

Pos logit contributions
ho+0.013
 perspective+0.012
ute+0.011
 way+0.011
ci+0.010
Neg logit contributions
 builders-0.015
 helpers-0.015
 chefs-0.015
 cooks-0.015
 knights-0.015
Token used
Token ID973
Feature activation+2.78e-02

Pos logit contributions
 chefs+0.019
 knights+0.019
 builders+0.019
 cooks+0.019
 helpers+0.018
Neg logit contributions
ho-0.024
 perspective-0.021
ci-0.020
ute-0.020
 way-0.020
Token buckets
Token ID38674
Feature activation+2.81e-02

Pos logit contributions
ho+0.052
 perspective+0.048
ci+0.047
ute+0.047
itating+0.043
Neg logit contributions
 chefs-0.040
 knights-0.039
 builders-0.039
 helpers-0.039
 pirates-0.038
Token and
Token ID290
Feature activation-1.87e-02

Pos logit contributions
ho+0.053
 perspective+0.050
ute+0.048
ci+0.048
 way+0.046
Neg logit contributions
 chefs-0.037
 knights-0.037
 builders-0.037
 cooks-0.036
 helpers-0.036
Token shovel
Token ID33677
Feature activation+7.08e-02

Pos logit contributions
 chefs+0.029
 builders+0.028
 knights+0.028
 cooks+0.028
 helpers+0.028
Neg logit contributions
ho-0.031
 perspective-0.027
 way-0.027
ci-0.026
ute-0.026
Tokens
Token ID82
Feature activation-2.10e-02

Pos logit contributions
 perspective+0.104
ho+0.102
ute+0.096
ci+0.094
 way+0.090
Neg logit contributions
 chefs-0.118
 helpers-0.116
 builders-0.115
 cooks-0.114
iers-0.113
Token and
Token ID290
Feature activation-1.33e-02

Pos logit contributions
 chefs+0.034
 builders+0.034
 knights+0.034
 cooks+0.034
 helpers+0.033
Neg logit contributions
ho-0.034
 perspective-0.030
 way-0.028
ute-0.028
ci-0.028
Token shells
Token ID19679
Feature activation+2.59e-02

Pos logit contributions
 chefs+0.021
 builders+0.021
 cooks+0.021
 knights+0.021
 helpers+0.020
Neg logit contributions
ho-0.021
 perspective-0.019
 way-0.019
ci-0.018
ute-0.017
Token and
Token ID290
Feature activation-2.05e-02

Pos logit contributions
ho+0.049
 perspective+0.046
ute+0.044
ci+0.044
 way+0.042
Neg logit contributions
 chefs-0.034
 knights-0.034
 builders-0.034
 cooks-0.034
 helpers-0.033
Token stones
Token ID14966
Feature activation+3.47e-02

Pos logit contributions
 chefs+0.032
 builders+0.031
 cooks+0.031
 helpers+0.031
 knights+0.031
Neg logit contributions
ho-0.033
 perspective-0.029
 way-0.029
ci-0.028
ute-0.026
Token They
Token ID1119
Feature activation+3.72e-03

Pos logit contributions
ho+0.011
 perspective+0.010
ute+0.009
ci+0.009
 way+0.009
Neg logit contributions
 chefs-0.007
 knights-0.007
 builders-0.007
 cooks-0.007
 helpers-0.007
Token share
Token ID2648
Feature activation+1.01e-02

Pos logit contributions
ho+0.006
 perspective+0.006
ute+0.005
ci+0.005
 way+0.005
Neg logit contributions
 knights-0.006
 chefs-0.006
 builders-0.006
 cooks-0.006
 helpers-0.006
Token their
Token ID511
Feature activation-6.25e-03

Pos logit contributions
ho+0.017
 perspective+0.016
ute+0.015
ci+0.015
 way+0.014
Neg logit contributions
 chefs-0.016
 knights-0.016
 cooks-0.016
 helpers-0.016
 builders-0.016
Token bucket
Token ID19236
Feature activation+3.52e-02

Pos logit contributions
 chefs+0.009
 knights+0.009
 builders+0.009
 cooks+0.009
 helpers+0.009
Neg logit contributions
ho-0.011
 perspective-0.010
ute-0.010
ci-0.010
 way-0.010
Token and
Token ID290
Feature activation+1.33e-02

Pos logit contributions
ho+0.072
 perspective+0.070
ute+0.066
ci+0.065
 way+0.062
Neg logit contributions
 knights-0.041
 chefs-0.040
 builders-0.040
 helpers-0.039
 cooks-0.039
Token shovel
Token ID33677
Feature activation+7.05e-02

Pos logit contributions
ho+0.023
 perspective+0.022
ute+0.021
ci+0.020
 way+0.019
Neg logit contributions
 chefs-0.020
 knights-0.020
 builders-0.020
 cooks-0.020
 helpers-0.020
Token with
Token ID351
Feature activation+7.24e-03

Pos logit contributions
 perspective+0.065
ho+0.065
ute+0.056
 way+0.052
ci+0.050
Neg logit contributions
 chefs-0.161
 builders-0.159
 knights-0.158
 helpers-0.157
 cooks-0.156
Token Mia
Token ID32189
Feature activation+1.10e-02

Pos logit contributions
ho+0.013
 perspective+0.012
ute+0.012
ci+0.011
aman+0.011
Neg logit contributions
 knights-0.011
 chefs-0.011
 builders-0.011
 helpers-0.010
 cooks-0.010
Token.
Token ID13
Feature activation+7.54e-03

Pos logit contributions
ho+0.020
 perspective+0.019
ute+0.018
ci+0.017
aman+0.017
Neg logit contributions
 knights-0.016
 cooks-0.015
 chefs-0.015
 helpers-0.015
 builders-0.015
Token They
Token ID1119
Feature activation-7.84e-03

Pos logit contributions
ho+0.014
 perspective+0.013
ute+0.012
ci+0.012
 way+0.012
Neg logit contributions
 chefs-0.011
 builders-0.011
 knights-0.011
 cooks-0.011
 helpers-0.010
Token make
Token ID787
Feature activation-2.04e-03

Pos logit contributions
 chefs+0.012
 knights+0.011
 builders+0.011
 cooks+0.011
 helpers+0.011
Neg logit contributions
ho-0.014
 perspective-0.013
ute-0.012
ci-0.012
 way-0.012
Token They
Token ID1119
Feature activation-7.87e-03

Pos logit contributions
ho+0.005
 perspective+0.004
ute+0.004
 way+0.004
ci+0.004
Neg logit contributions
 chefs-0.005
 cooks-0.005
 helpers-0.005
 builders-0.005
 knights-0.005
Token took
Token ID1718
Feature activation-1.10e-02

Pos logit contributions
 chefs+0.012
 builders+0.012
 knights+0.012
 cooks+0.012
 helpers+0.012
Neg logit contributions
ho-0.014
 perspective-0.012
ci-0.012
 way-0.011
ute-0.011
Token their
Token ID511
Feature activation-4.01e-03

Pos logit contributions
 chefs+0.017
 cooks+0.017
 builders+0.016
 knights+0.016
 helpers+0.016
Neg logit contributions
ho-0.018
 way-0.016
 perspective-0.016
ute-0.016
ci-0.016
Token bucket
Token ID19236
Feature activation+2.88e-02

Pos logit contributions
 chefs+0.006
 knights+0.006
 helpers+0.006
 cooks+0.006
 builders+0.006
Neg logit contributions
ho-0.007
 perspective-0.007
ute-0.006
ci-0.006
itating-0.006
Token and
Token ID290
Feature activation+4.59e-04

Pos logit contributions
ho+0.058
 perspective+0.057
ute+0.054
ci+0.053
 way+0.051
Neg logit contributions
 chefs-0.033
 knights-0.033
 builders-0.033
 helpers-0.032
 cooks-0.032
Token shovel
Token ID33677
Feature activation+7.04e-02

Pos logit contributions
ho+0.001
ci+0.001
 perspective+0.001
 way+0.001
ute+0.001
Neg logit contributions
 chefs-0.001
 builders-0.001
 knights-0.001
 cooks-0.001
 helpers-0.001
Token.
Token ID13
Feature activation+7.40e-03

Pos logit contributions
 perspective+0.117
ho+0.113
ute+0.110
ci+0.105
 way+0.101
Neg logit contributions
 chefs-0.106
 builders-0.104
 helpers-0.102
 knights-0.101
 cooks-0.100
Token
Token ID198
Feature activation-3.02e-03

Pos logit contributions
ho+0.012
 perspective+0.011
ute+0.011
 way+0.010
ci+0.010
Neg logit contributions
 chefs-0.012
 cooks-0.011
 knights-0.011
 builders-0.011
 helpers-0.011
Token
Token ID198
Feature activation-1.69e-02

Pos logit contributions
 chefs+0.005
 knights+0.005
 builders+0.005
 cooks+0.005
 helpers+0.005
Neg logit contributions
ho-0.005
 perspective-0.004
ute-0.004
ci-0.004
 way-0.004
TokenThey
Token ID2990
Feature activation-1.64e-02

Pos logit contributions
 chefs+0.029
 builders+0.029
 knights+0.029
 cooks+0.029
 helpers+0.029
Neg logit contributions
ho-0.026
 perspective-0.023
ci-0.022
ute-0.022
 way-0.021
Token found
Token ID1043
Feature activation-7.69e-03

Pos logit contributions
 chefs+0.025
 builders+0.024
 knights+0.024
 cooks+0.024
 helpers+0.024
Neg logit contributions
ho-0.029
 perspective-0.025
ci-0.025
ute-0.024
 way-0.024
Token have
Token ID423
Feature activation+1.26e-02

Pos logit contributions
 chefs+0.011
 knights+0.011
 builders+0.011
 cooks+0.011
 helpers+0.011
Neg logit contributions
ho-0.015
 perspective-0.013
ute-0.013
ci-0.013
 way-0.012
Token a
Token ID257
Feature activation+3.20e-02

Pos logit contributions
ho+0.021
 perspective+0.019
ute+0.019
ci+0.018
 way+0.016
Neg logit contributions
 chefs-0.021
 knights-0.021
 builders-0.020
 cooks-0.020
 helpers-0.020
Token bad
Token ID2089
Feature activation+1.01e-02

Pos logit contributions
ci+0.057
ho+0.056
 perspective+0.050
ute+0.048
itating+0.047
Neg logit contributions
 chefs-0.047
 knights-0.046
 helpers-0.046
 spies-0.045
 pirates-0.045
Token bucket
Token ID19236
Feature activation+3.05e-02

Pos logit contributions
ho+0.016
 perspective+0.015
ci+0.015
ute+0.014
itating+0.014
Neg logit contributions
 knights-0.016
 chefs-0.016
 helpers-0.016
 builders-0.016
 cooks-0.016
Token and
Token ID290
Feature activation-3.61e-03

Pos logit contributions
ho+0.058
 perspective+0.054
ute+0.051
ci+0.051
 way+0.049
Neg logit contributions
 chefs-0.041
 knights-0.040
 builders-0.040
 cooks-0.040
 helpers-0.040
Token shovel
Token ID33677
Feature activation+6.99e-02

Pos logit contributions
 chefs+0.005
 builders+0.005
 knights+0.005
 cooks+0.005
 helpers+0.005
Neg logit contributions
ho-0.006
 perspective-0.006
ci-0.005
 way-0.005
ute-0.005
Token.
Token ID13
Feature activation+5.70e-03

Pos logit contributions
ho+0.112
 perspective+0.109
ute+0.101
ci+0.099
 way+0.098
Neg logit contributions
 chefs-0.111
 builders-0.109
 cooks-0.108
 helpers-0.107
 knights-0.107
Token They
Token ID1119
Feature activation-9.51e-03

Pos logit contributions
ho+0.009
 perspective+0.008
ute+0.008
ci+0.008
 way+0.007
Neg logit contributions
 builders-0.009
 helpers-0.009
 chefs-0.009
 cooks-0.009
 knights-0.009
Token have
Token ID423
Feature activation+1.03e-02

Pos logit contributions
 chefs+0.014
 knights+0.014
 builders+0.014
 helpers+0.014
 cooks+0.014
Neg logit contributions
ho-0.017
 perspective-0.015
ute-0.014
ci-0.014
 way-0.014
Token no
Token ID645
Feature activation-1.92e-03

Pos logit contributions
ho+0.017
 perspective+0.015
ute+0.015
ci+0.015
 way+0.013
Neg logit contributions
 chefs-0.017
 knights-0.017
 builders-0.017
 cooks-0.017
 helpers-0.017
Token fun
Token ID1257
Feature activation+1.87e-02

Pos logit contributions
 chefs+0.003
 knights+0.003
 builders+0.003
 cooks+0.003
 helpers+0.003
Neg logit contributions
ho-0.003
 perspective-0.003
ute-0.003
ci-0.003
 way-0.003
Token He
Token ID679
Feature activation-1.81e-03

Pos logit contributions
ho+0.016
 perspective+0.015
ute+0.014
ci+0.014
 way+0.014
Neg logit contributions
 chefs-0.016
 builders-0.015
 cooks-0.015
 knights-0.015
 helpers-0.015
Token took
Token ID1718
Feature activation-4.47e-03

Pos logit contributions
 chefs+0.003
 builders+0.003
 knights+0.003
 cooks+0.003
 helpers+0.003
Neg logit contributions
ho-0.003
 perspective-0.003
ci-0.003
 way-0.002
ute-0.002
Token his
Token ID465
Feature activation-2.24e-03

Pos logit contributions
 chefs+0.007
 cooks+0.007
 builders+0.007
 helpers+0.006
 knights+0.006
Neg logit contributions
ho-0.007
 way-0.007
 perspective-0.006
ci-0.006
ute-0.006
Token bucket
Token ID19236
Feature activation+3.84e-02

Pos logit contributions
 chefs+0.004
 knights+0.004
 builders+0.004
 cooks+0.004
 helpers+0.004
Neg logit contributions
ho-0.004
 perspective-0.003
ute-0.003
ci-0.003
 way-0.003
Token and
Token ID290
Feature activation-4.42e-03

Pos logit contributions
ho+0.074
 perspective+0.071
ute+0.067
ci+0.066
 way+0.064
Neg logit contributions
 chefs-0.049
 builders-0.049
 knights-0.049
 helpers-0.047
 cooks-0.047
Token shovel
Token ID33677
Feature activation+6.96e-02

Pos logit contributions
 chefs+0.007
 builders+0.007
 knights+0.007
 cooks+0.007
 helpers+0.007
Neg logit contributions
ho-0.007
 perspective-0.006
ci-0.006
 way-0.006
ute-0.006
Token and
Token ID290
Feature activation-1.15e-02

Pos logit contributions
ho+0.116
 perspective+0.115
ute+0.108
ci+0.105
 way+0.100
Neg logit contributions
 chefs-0.104
 builders-0.103
 knights-0.102
 helpers-0.101
iers-0.100
Token went
Token ID1816
Feature activation-2.81e-03

Pos logit contributions
 chefs+0.017
 builders+0.017
 cooks+0.017
 knights+0.017
 helpers+0.017
Neg logit contributions
ho-0.020
 perspective-0.018
ci-0.018
 way-0.017
ute-0.017
Token to
Token ID284
Feature activation+6.77e-04

Pos logit contributions
 chefs+0.005
 knights+0.005
 builders+0.005
 helpers+0.004
 cooks+0.004
Neg logit contributions
ho-0.005
 perspective-0.004
ci-0.004
ute-0.004
 way-0.004
Token another
Token ID1194
Feature activation+1.85e-02

Pos logit contributions
ho+0.001
 perspective+0.001
ute+0.001
ci+0.001
 way+0.001
Neg logit contributions
 chefs-0.001
 builders-0.001
 knights-0.001
 cooks-0.001
 helpers-0.001
Token spot
Token ID4136
Feature activation+2.53e-02

Pos logit contributions
ho+0.031
ute+0.027
 perspective+0.027
ci+0.026
aman+0.024
Neg logit contributions
 chefs-0.031
 knights-0.031
 builders-0.030
 cooks-0.029
 pirates-0.029
Token then
Token ID788
Feature activation-4.44e-03

Pos logit contributions
 chefs+0.024
 builders+0.024
 knights+0.023
 helpers+0.023
 cooks+0.023
Neg logit contributions
ho-0.029
 perspective-0.026
ci-0.025
 way-0.024
ute-0.024
Token carefully
Token ID7773
Feature activation-9.75e-03

Pos logit contributions
 chefs+0.006
 builders+0.006
 knights+0.006
 helpers+0.006
 cooks+0.006
Neg logit contributions
ho-0.008
 perspective-0.007
ci-0.007
 way-0.007
ute-0.007
Token covered
Token ID5017
Feature activation+1.13e-02

Pos logit contributions
 builders+0.015
 chefs+0.015
 helpers+0.015
 knights+0.015
 spies+0.015
Neg logit contributions
ho-0.016
 way-0.014
ci-0.013
 perspective-0.013
ute-0.013
Token it
Token ID340
Feature activation-1.02e-02

Pos logit contributions
ho+0.019
 perspective+0.017
ute+0.017
ci+0.016
 way+0.016
Neg logit contributions
 chefs-0.018
 knights-0.017
 builders-0.017
 cooks-0.017
 helpers-0.017
Token with
Token ID351
Feature activation+5.02e-03

Pos logit contributions
 chefs+0.016
 knights+0.016
 cooks+0.016
 builders+0.016
 pirates+0.016
Neg logit contributions
ho-0.017
 perspective-0.016
ute-0.015
ci-0.014
aman-0.014
Token shovel
Token ID33677
Feature activation+6.90e-02

Pos logit contributions
ho+0.009
 perspective+0.008
ute+0.008
ci+0.008
 way+0.008
Neg logit contributions
 chefs-0.007
 builders-0.007
 knights-0.007
 cooks-0.007
 helpers-0.007
Tokens
Token ID82
Feature activation-8.73e-03

Pos logit contributions
 perspective+0.089
ho+0.089
ute+0.084
ci+0.079
 way+0.075
Neg logit contributions
 chefs-0.129
 builders-0.125
 helpers-0.125
 cooks-0.124
 knights-0.121
Token full
Token ID1336
Feature activation-2.29e-02

Pos logit contributions
 chefs+0.013
 builders+0.013
 cooks+0.013
 knights+0.013
 helpers+0.013
Neg logit contributions
ho-0.015
 perspective-0.014
ute-0.013
 way-0.013
ci-0.013
Token of
Token ID286
Feature activation+1.10e-03

Pos logit contributions
 chefs+0.030
 builders+0.029
 knights+0.029
 cooks+0.029
 helpers+0.029
Neg logit contributions
ho-0.044
 perspective-0.041
ute-0.039
ci-0.038
 way-0.038
Token dirt
Token ID13647
Feature activation+1.26e-02

Pos logit contributions
ho+0.002
 perspective+0.002
 way+0.002
ute+0.002
ci+0.002
Neg logit contributions
 chefs-0.002
 builders-0.002
 cooks-0.002
 knights-0.002
 helpers-0.002
Token.
Token ID13
Feature activation-1.03e-02

Pos logit contributions
ho+0.023
 perspective+0.021
ute+0.021
ci+0.020
 way+0.020
Neg logit contributions
 chefs-0.017
 knights-0.017
 builders-0.017
 cooks-0.017
 helpers-0.017
Token the
Token ID262
Feature activation+6.87e-03

Pos logit contributions
ho+0.013
ute+0.012
 perspective+0.012
ci+0.012
 way+0.011
Neg logit contributions
 chefs-0.008
 knights-0.008
 builders-0.008
 cooks-0.008
 helpers-0.008
Token sand
Token ID6450
Feature activation+2.42e-02

Pos logit contributions
ho+0.013
 perspective+0.012
ci+0.012
ute+0.011
spe+0.011
Neg logit contributions
 chefs-0.010
 knights-0.010
 builders-0.009
 cooks-0.009
 pirates-0.009
Token.
Token ID13
Feature activation+1.80e-02

Pos logit contributions
ho+0.045
 perspective+0.041
ute+0.039
ci+0.039
 way+0.038
Neg logit contributions
 chefs-0.034
 knights-0.034
 builders-0.034
 cooks-0.033
 helpers-0.033
Token They
Token ID1119
Feature activation-8.85e-03

Pos logit contributions
ho+0.029
 perspective+0.026
ute+0.026
 way+0.025
ci+0.024
Neg logit contributions
 builders-0.029
 helpers-0.028
 chefs-0.028
 cooks-0.028
 knights-0.027
Token had
Token ID550
Feature activation+5.10e-03

Pos logit contributions
 chefs+0.012
 knights+0.012
 cooks+0.012
 builders+0.012
 helpers+0.011
Neg logit contributions
ho-0.017
 perspective-0.016
ute-0.015
ci-0.015
 way-0.014
Token shovel
Token ID33677
Feature activation+6.89e-02

Pos logit contributions
ho+0.009
ute+0.009
ci+0.009
 perspective+0.009
aman+0.008
Neg logit contributions
 knights-0.007
 chefs-0.007
 builders-0.007
 pirates-0.007
 cooks-0.007
Tokens
Token ID82
Feature activation-1.26e-02

Pos logit contributions
 perspective+0.073
ho+0.072
ute+0.065
ci+0.062
 way+0.058
Neg logit contributions
 chefs-0.146
 builders-0.144
 cooks-0.142
 helpers-0.141
 pirates-0.140
Token and
Token ID290
Feature activation+2.35e-03

Pos logit contributions
 chefs+0.018
 knights+0.017
 builders+0.017
 cooks+0.017
 helpers+0.017
Neg logit contributions
ho-0.023
 perspective-0.021
ci-0.021
ute-0.020
 way-0.019
Token buckets
Token ID38674
Feature activation+2.71e-02

Pos logit contributions
ho+0.004
 perspective+0.004
ute+0.004
ci+0.003
aman+0.003
Neg logit contributions
 chefs-0.004
 knights-0.004
 builders-0.004
 cooks-0.004
 helpers-0.004
Token and
Token ID290
Feature activation-8.86e-03

Pos logit contributions
ho+0.052
 perspective+0.048
ci+0.047
ute+0.046
 way+0.044
Neg logit contributions
 chefs-0.036
 builders-0.036
 knights-0.036
 cooks-0.035
 helpers-0.035
Token made
Token ID925
Feature activation-9.65e-03

Pos logit contributions
 chefs+0.015
 knights+0.014
 builders+0.014
 cooks+0.014
 helpers+0.014
Neg logit contributions
ho-0.014
 perspective-0.013
ute-0.013
ci-0.012
 way-0.012
Token with
Token ID351
Feature activation-5.69e-03

Pos logit contributions
ho+0.028
 perspective+0.026
ute+0.025
ci+0.024
 way+0.024
Neg logit contributions
 chefs-0.020
 knights-0.020
 cooks-0.020
 builders-0.020
 helpers-0.019
Token her
Token ID607
Feature activation-1.93e-02

Pos logit contributions
 chefs+0.008
 knights+0.008
 builders+0.008
 cooks+0.008
 helpers+0.008
Neg logit contributions
ho-0.011
 perspective-0.010
ci-0.010
ute-0.010
spe-0.009
Token yellow
Token ID7872
Feature activation-5.51e-03

Pos logit contributions
 chefs+0.027
 helpers+0.027
 builders+0.026
 knights+0.026
 boys+0.025
Neg logit contributions
ho-0.036
 perspective-0.035
aman-0.032
ute-0.031
itating-0.031
Token bucket
Token ID19236
Feature activation+3.14e-02

Pos logit contributions
 knights+0.009
 chefs+0.009
 pirates+0.009
 helpers+0.009
 builders+0.009
Neg logit contributions
ho-0.008
itating-0.008
 perspective-0.008
ci-0.008
spe-0.007
Token and
Token ID290
Feature activation+3.88e-04

Pos logit contributions
 perspective+0.070
ho+0.069
ute+0.063
aman+0.063
itating+0.061
Neg logit contributions
 knights-0.029
 helpers-0.027
 cooks-0.027
 chefs-0.027
 builders-0.027
Token shovel
Token ID33677
Feature activation+6.88e-02

Pos logit contributions
ute+0.001
ho+0.001
 perspective+0.001
itating+0.001
ci+0.001
Neg logit contributions
 chefs-0.001
 cooks-0.001
 builders-0.001
 knights-0.001
 helpers-0.001
Token.
Token ID13
Feature activation+2.50e-02

Pos logit contributions
 perspective+0.122
ho+0.115
ute+0.109
spe+0.106
ci+0.105
Neg logit contributions
 chefs-0.096
 builders-0.092
 helpers-0.092
 cooks-0.091
 boys-0.090
Token One
Token ID1881
Feature activation+4.64e-03

Pos logit contributions
ho+0.051
 perspective+0.047
spe+0.046
ci+0.045
ute+0.044
Neg logit contributions
 knights-0.030
 chefs-0.027
 cooks-0.027
 pirates-0.026
 boys-0.026
Token day
Token ID1110
Feature activation-4.09e-03

Pos logit contributions
ho+0.006
 perspective+0.005
 wrists+0.005
ute+0.005
ci+0.005
Neg logit contributions
 cooks-0.009
 chefs-0.009
 pirates-0.009
 builders-0.009
 helpers-0.009
Token,
Token ID11
Feature activation+7.46e-03

Pos logit contributions
 chefs+0.005
 knights+0.005
 cooks+0.005
 builders+0.005
 helpers+0.005
Neg logit contributions
ho-0.008
 perspective-0.007
ci-0.007
ute-0.007
 way-0.007
Token she
Token ID673
Feature activation-7.44e-03

Pos logit contributions
ho+0.013
 perspective+0.013
ci+0.012
ute+0.012
spe+0.011
Neg logit contributions
 chefs-0.011
 knights-0.011
 cooks-0.011
 builders-0.010
 helpers-0.010
Token and
Token ID290
Feature activation-5.48e-03

Pos logit contributions
 chefs+0.004
 builders+0.004
 knights+0.004
 cooks+0.004
 helpers+0.004
Neg logit contributions
ho-0.005
 perspective-0.005
ute-0.005
ci-0.005
 way-0.005
Token touched
Token ID12615
Feature activation+1.19e-02

Pos logit contributions
 builders+0.009
 chefs+0.009
 knights+0.009
 helpers+0.009
 cooks+0.009
Neg logit contributions
ho-0.009
 perspective-0.008
ci-0.008
 way-0.008
ute-0.007
Token it
Token ID340
Feature activation-5.50e-03

Pos logit contributions
ho+0.020
 perspective+0.018
ci+0.018
ute+0.018
spe+0.017
Neg logit contributions
 chefs-0.018
 cooks-0.018
 knights-0.018
 pirates-0.018
 builders-0.017
Token with
Token ID351
Feature activation+1.60e-02

Pos logit contributions
 knights+0.006
 cooks+0.006
 chefs+0.006
 pirates+0.005
 builders+0.005
Neg logit contributions
 perspective-0.012
ho-0.011
ci-0.011
aman-0.011
ute-0.010
Token his
Token ID465
Feature activation+4.05e-03

Pos logit contributions
ho+0.035
ci+0.033
 perspective+0.033
ute+0.033
spe+0.030
Neg logit contributions
 knights-0.017
 chefs-0.017
 cooks-0.016
 builders-0.016
 pirates-0.016
Token fins
Token ID42080
Feature activation+6.87e-02

Pos logit contributions
ho+0.007
 perspective+0.006
ci+0.006
ute+0.006
 way+0.006
Neg logit contributions
 chefs-0.006
 knights-0.006
 cooks-0.006
 builders-0.006
 helpers-0.006
Token.
Token ID13
Feature activation-8.32e-03

Pos logit contributions
 perspective+0.151
ci+0.149
ho+0.146
aman+0.143
ute+0.140
Neg logit contributions
 chefs-0.060
 cooks-0.060
 knights-0.058
 pirates-0.056
 boys-0.056
Token The
Token ID383
Feature activation+8.82e-03

Pos logit contributions
 chefs+0.013
 builders+0.012
 knights+0.012
 cooks+0.012
 helpers+0.012
Neg logit contributions
ho-0.015
 perspective-0.013
ute-0.013
ci-0.013
 way-0.012
Token rock
Token ID3881
Feature activation+3.15e-02

Pos logit contributions
ho+0.015
 perspective+0.014
ci+0.014
ute+0.013
 way+0.013
Neg logit contributions
 chefs-0.013
 knights-0.013
 cooks-0.013
 builders-0.013
 helpers-0.013
Token felt
Token ID2936
Feature activation-1.22e-02

Pos logit contributions
ho+0.051
 perspective+0.051
ute+0.047
ci+0.046
aman+0.044
Neg logit contributions
 knights-0.048
 cooks-0.048
 chefs-0.048
 builders-0.047
 helpers-0.046
Token bump
Token ID13852
Feature activation+1.87e-02

Pos logit contributions
 builders+0.021
 chefs+0.021
 helpers+0.021
 knights+0.020
 cooks+0.020
Neg logit contributions
ho-0.019
 perspective-0.017
 way-0.016
ute-0.016
 high-0.015
Token angry
Token ID7954
Feature activation+2.73e-03

Pos logit contributions
ho+0.024
 perspective+0.023
ute+0.022
ci+0.021
 way+0.020
Neg logit contributions
 knights-0.020
 chefs-0.020
 cooks-0.019
 builders-0.019
 helpers-0.019
Token.
Token ID13
Feature activation+1.10e-02

Pos logit contributions
ho+0.005
 perspective+0.004
ute+0.004
ci+0.004
 way+0.004
Neg logit contributions
 chefs-0.004
 knights-0.004
 builders-0.004
 cooks-0.004
 helpers-0.004
Token They
Token ID1119
Feature activation+1.24e-03

Pos logit contributions
ho+0.023
 perspective+0.022
ute+0.021
ci+0.021
 way+0.020
Neg logit contributions
 chefs-0.013
 knights-0.012
 builders-0.012
 cooks-0.012
 helpers-0.012
Token throw
Token ID3714
Feature activation+2.29e-02

Pos logit contributions
ho+0.002
 perspective+0.002
ute+0.002
ci+0.002
 way+0.002
Neg logit contributions
 chefs-0.002
 cooks-0.002
 knights-0.002
 builders-0.002
 helpers-0.002
Token their
Token ID511
Feature activation+2.50e-03

Pos logit contributions
ho+0.042
 perspective+0.039
ute+0.038
ci+0.038
 way+0.035
Neg logit contributions
 chefs-0.032
 knights-0.032
 builders-0.032
 cooks-0.031
 helpers-0.031
Token shovel
Token ID33677
Feature activation+6.87e-02

Pos logit contributions
ho+0.004
 perspective+0.004
ute+0.004
ci+0.004
aman+0.004
Neg logit contributions
 knights-0.004
 chefs-0.004
 builders-0.004
 helpers-0.004
 cooks-0.004
Tokens
Token ID82
Feature activation+9.01e-03

Pos logit contributions
 perspective+0.139
ho+0.135
ci+0.131
ute+0.131
aman+0.122
Neg logit contributions
 builders-0.082
 chefs-0.080
 boys-0.076
 knights-0.075
 helpers-0.075
Token and
Token ID290
Feature activation+1.94e-04

Pos logit contributions
ci+0.019
 perspective+0.018
ho+0.018
ute+0.018
aman+0.017
Neg logit contributions
 knights-0.010
 chefs-0.010
 builders-0.009
 pirates-0.009
 spies-0.009
Token hats
Token ID23910
Feature activation+3.82e-02

Pos logit contributions
ho+0.000
 perspective+0.000
ute+0.000
ci+0.000
aman+0.000
Neg logit contributions
 chefs-0.000
 knights-0.000
 builders-0.000
 cooks-0.000
 helpers-0.000
Token on
Token ID319
Feature activation+1.21e-02

Pos logit contributions
ho+0.073
 perspective+0.070
ci+0.067
ute+0.066
 way+0.063
Neg logit contributions
 chefs-0.049
 knights-0.049
 builders-0.049
 cooks-0.048
 helpers-0.047
Token the
Token ID262
Feature activation+3.58e-03

Pos logit contributions
ho+0.024
 perspective+0.022
ci+0.022
ute+0.022
 way+0.020
Neg logit contributions
 chefs-0.015
 knights-0.015
 builders-0.015
 cooks-0.015
 helpers-0.015
Token She
Token ID1375
Feature activation-1.10e-02

Pos logit contributions
ho+0.010
 perspective+0.009
ute+0.009
ci+0.009
 way+0.008
Neg logit contributions
 chefs-0.009
 knights-0.009
 builders-0.009
 cooks-0.009
 helpers-0.009
Token used
Token ID973
Feature activation+3.46e-02

Pos logit contributions
 cooks+0.016
 chefs+0.016
 knights+0.016
 helpers+0.016
 builders+0.016
Neg logit contributions
ho-0.019
 perspective-0.018
ute-0.017
ci-0.016
 way-0.016
Token her
Token ID607
Feature activation-1.27e-02

Pos logit contributions
ho+0.065
ute+0.063
 perspective+0.063
ci+0.062
itating+0.057
Neg logit contributions
 helpers-0.045
 chefs-0.044
 knights-0.042
 cooks-0.042
 builders-0.042
Token bucket
Token ID19236
Feature activation+4.00e-02

Pos logit contributions
 helpers+0.020
 chefs+0.020
 cooks+0.020
 knights+0.020
 builders+0.019
Neg logit contributions
ho-0.020
 perspective-0.020
ci-0.018
ute-0.018
itating-0.017
Token and
Token ID290
Feature activation-9.74e-03

Pos logit contributions
 perspective+0.069
ho+0.065
ute+0.064
ci+0.061
 photo+0.058
Neg logit contributions
 helpers-0.054
 builders-0.054
 chefs-0.054
 knights-0.053
iers-0.053
Token shovel
Token ID33677
Feature activation+6.87e-02

Pos logit contributions
 chefs+0.015
 knights+0.015
 builders+0.015
 cooks+0.015
 helpers+0.014
Neg logit contributions
ho-0.017
 perspective-0.015
ute-0.015
ci-0.014
 way-0.014
Token to
Token ID284
Feature activation+7.14e-03

Pos logit contributions
 perspective+0.116
ute+0.111
ho+0.110
ci+0.104
 du+0.100
Neg logit contributions
iers-0.098
 chefs-0.096
 helpers-0.094
 builders-0.094
 cooks-0.091
Token help
Token ID1037
Feature activation+1.52e-02

Pos logit contributions
ho+0.012
 perspective+0.012
ute+0.011
ci+0.010
spe+0.010
Neg logit contributions
 chefs-0.011
 cooks-0.010
 knights-0.010
 builders-0.010
 helpers-0.010
Token her
Token ID607
Feature activation-4.33e-03

Pos logit contributions
ho+0.023
 perspective+0.020
ute+0.019
ci+0.019
 way+0.018
Neg logit contributions
 chefs-0.027
 knights-0.027
 builders-0.027
 cooks-0.027
 helpers-0.026
Token climb
Token ID12080
Feature activation+5.34e-03

Pos logit contributions
 chefs+0.007
 helpers+0.006
 cooks+0.006
 knights+0.006
 builders+0.006
Neg logit contributions
ho-0.008
 perspective-0.007
ute-0.007
ci-0.006
aman-0.006
Token the
Token ID262
Feature activation-4.06e-03

Pos logit contributions
ho+0.009
 perspective+0.009
ute+0.008
ci+0.008
 way+0.008
Neg logit contributions
 builders-0.008
 chefs-0.008
 knights-0.008
 cooks-0.007
 helpers-0.007
Token and
Token ID290
Feature activation+6.65e-04

Pos logit contributions
 knights+0.005
 chefs+0.005
 builders+0.005
 cooks+0.005
 helpers+0.005
Neg logit contributions
ho-0.009
 perspective-0.009
ute-0.008
 way-0.008
ci-0.008
Token grabbed
Token ID13176
Feature activation+8.68e-04

Pos logit contributions
ho+0.001
 perspective+0.001
ute+0.001
itating+0.001
ci+0.001
Neg logit contributions
 chefs-0.001
 knights-0.001
 cooks-0.001
 builders-0.001
 helpers-0.001
Token her
Token ID607
Feature activation-1.71e-02

Pos logit contributions
ho+0.002
 perspective+0.002
ute+0.001
ci+0.001
 way+0.001
Neg logit contributions
 chefs-0.001
 knights-0.001
 helpers-0.001
 builders-0.001
 cooks-0.001
Token bucket
Token ID19236
Feature activation+2.98e-02

Pos logit contributions
 chefs+0.028
 helpers+0.027
 knights+0.027
 pirates+0.027
 cooks+0.027
Neg logit contributions
ho-0.029
 perspective-0.027
ute-0.023
ci-0.023
itating-0.023
Token and
Token ID290
Feature activation-2.77e-03

Pos logit contributions
 perspective+0.064
ho+0.064
ute+0.058
ci+0.055
Me+0.055
Neg logit contributions
 chefs-0.028
 builders-0.028
 knights-0.027
 helpers-0.027
iers-0.027
Token shovel
Token ID33677
Feature activation+6.86e-02

Pos logit contributions
 chefs+0.004
 cooks+0.004
 knights+0.004
 builders+0.004
 helpers+0.004
Neg logit contributions
ho-0.005
 perspective-0.005
ute-0.005
ci-0.004
itating-0.004
Token.
Token ID13
Feature activation+1.27e-02

Pos logit contributions
 perspective+0.122
ho+0.117
ute+0.113
 way+0.106
ci+0.105
Neg logit contributions
 chefs-0.093
iers-0.092
 helpers-0.091
 builders-0.089
 cooks-0.088
Token As
Token ID1081
Feature activation+1.09e-02

Pos logit contributions
ho+0.023
 perspective+0.021
ci+0.020
ute+0.020
 way+0.019
Neg logit contributions
 chefs-0.018
 cooks-0.018
 knights-0.018
 builders-0.018
 helpers-0.018
Token they
Token ID484
Feature activation-2.57e-02

Pos logit contributions
ute+0.019
ho+0.018
 perspective+0.018
ci+0.017
itating+0.016
Neg logit contributions
 builders-0.017
 knights-0.016
 helpers-0.016
 chefs-0.016
 cooks-0.016
Token opened
Token ID4721
Feature activation+1.38e-02

Pos logit contributions
 chefs+0.043
 knights+0.043
 builders+0.042
 helpers+0.042
 cooks+0.041
Neg logit contributions
ho-0.042
 perspective-0.036
ci-0.035
 way-0.034
ute-0.034
Token the
Token ID262
Feature activation+4.78e-04

Pos logit contributions
ho+0.034
 perspective+0.031
ute+0.031
ci+0.029
spe+0.028
Neg logit contributions
 pirates-0.012
 chefs-0.011
 knights-0.011
 helpers-0.011
 builders-0.011
Token.
Token ID13
Feature activation-1.93e-02

Pos logit contributions
 chefs+0.016
 builders+0.016
 knights+0.016
 cooks+0.016
 helpers+0.015
Neg logit contributions
ho-0.018
 perspective-0.016
ute-0.015
ci-0.015
 way-0.015
Token He
Token ID679
Feature activation-3.94e-03

Pos logit contributions
 chefs+0.019
 builders+0.019
 knights+0.019
 cooks+0.019
 helpers+0.018
Neg logit contributions
ho-0.043
 perspective-0.040
ute-0.039
ci-0.038
 way-0.038
Token shows
Token ID2523
Feature activation-2.66e-04

Pos logit contributions
 chefs+0.006
 knights+0.006
 cooks+0.006
 builders+0.006
 helpers+0.006
Neg logit contributions
ho-0.007
 perspective-0.006
ute-0.006
ci-0.006
 way-0.006
Token Sara
Token ID24799
Feature activation+6.80e-04

Pos logit contributions
 chefs+0.000
 knights+0.000
 builders+0.000
 cooks+0.000
 helpers+0.000
Neg logit contributions
ho-0.001
 perspective-0.000
ute-0.000
ci-0.000
 way-0.000
Token his
Token ID465
Feature activation+6.21e-03

Pos logit contributions
ho+0.001
 perspective+0.001
ute+0.001
ci+0.001
aman+0.001
Neg logit contributions
 knights-0.001
 chefs-0.001
 cooks-0.001
 pirates-0.001
 helpers-0.001
Token shovel
Token ID33677
Feature activation+6.82e-02

Pos logit contributions
ho+0.010
 perspective+0.009
ci+0.009
ute+0.009
aman+0.008
Neg logit contributions
 chefs-0.010
 knights-0.010
 helpers-0.010
 builders-0.010
 cooks-0.010
Token.
Token ID13
Feature activation-7.75e-03

Pos logit contributions
 perspective+0.144
ho+0.134
ute+0.132
ci+0.129
 du+0.127
Neg logit contributions
 chefs-0.071
 builders-0.069
 cooks-0.068
 boys-0.067
 pirates-0.066
Token He
Token ID679
Feature activation-2.40e-03

Pos logit contributions
 chefs+0.009
 knights+0.008
 cooks+0.008
 builders+0.008
 helpers+0.008
Neg logit contributions
ho-0.017
 perspective-0.016
ci-0.015
ute-0.015
 way-0.014
Token has
Token ID468
Feature activation+2.65e-03

Pos logit contributions
 chefs+0.004
 cooks+0.004
 knights+0.004
 builders+0.004
 helpers+0.004
Neg logit contributions
ho-0.004
 perspective-0.004
ute-0.004
ci-0.004
 way-0.003
Token filled
Token ID5901
Feature activation+2.86e-03

Pos logit contributions
ho+0.006
 perspective+0.006
ute+0.006
ci+0.006
itating+0.006
Neg logit contributions
 knights-0.003
 chefs-0.003
 builders-0.002
 cooks-0.002
 pirates-0.002
Token it
Token ID340
Feature activation-1.32e-02

Pos logit contributions
ho+0.005
 perspective+0.005
ute+0.005
ci+0.005
 way+0.004
Neg logit contributions
 chefs-0.004
 knights-0.004
 pirates-0.004
 cooks-0.004
 builders-0.004
Token have
Token ID423
Feature activation+1.05e-02

Pos logit contributions
 builders+0.000
 knights+0.000
 chefs+0.000
 helpers+0.000
iers+0.000
Neg logit contributions
ho-0.000
 perspective-0.000
ute-0.000
ci-0.000
 way-0.000
Token the
Token ID262
Feature activation+8.29e-03

Pos logit contributions
ho+0.020
 perspective+0.018
ute+0.017
ci+0.017
 way+0.016
Neg logit contributions
 chefs-0.015
 knights-0.015
 builders-0.015
 cooks-0.015
 helpers-0.015
Token best
Token ID1266
Feature activation+7.91e-03

Pos logit contributions
ho+0.013
 perspective+0.012
ci+0.011
ute+0.011
 way+0.010
Neg logit contributions
 chefs-0.014
 knights-0.014
 builders-0.014
 helpers-0.014
 cooks-0.014
Token bucket
Token ID19236
Feature activation+3.37e-02

Pos logit contributions
ho+0.013
 perspective+0.012
ci+0.012
ute+0.012
 way+0.011
Neg logit contributions
 chefs-0.013
 builders-0.013
 knights-0.013
 cooks-0.012
 helpers-0.012
Token and
Token ID290
Feature activation+3.12e-03

Pos logit contributions
ho+0.060
 perspective+0.054
ute+0.052
ci+0.052
 way+0.051
Neg logit contributions
 chefs-0.049
 knights-0.049
 builders-0.048
 cooks-0.048
 helpers-0.048
Token shovel
Token ID33677
Feature activation+6.81e-02

Pos logit contributions
ho+0.005
 perspective+0.005
ci+0.005
ute+0.005
 way+0.005
Neg logit contributions
 chefs-0.005
 builders-0.005
 knights-0.005
 cooks-0.005
 helpers-0.005
Token.
Token ID13
Feature activation+4.44e-03

Pos logit contributions
ho+0.108
 perspective+0.104
ute+0.097
ci+0.095
 way+0.093
Neg logit contributions
 chefs-0.110
 builders-0.109
 cooks-0.108
 knights-0.108
 helpers-0.107
Token I
Token ID314
Feature activation+1.42e-03

Pos logit contributions
ho+0.007
ci+0.006
ute+0.006
 perspective+0.006
 way+0.006
Neg logit contributions
 builders-0.007
 helpers-0.007
 chefs-0.007
 cooks-0.007
 knights-0.007
Token have
Token ID423
Feature activation+9.52e-03

Pos logit contributions
ho+0.002
 perspective+0.002
ci+0.002
ute+0.002
 way+0.002
Neg logit contributions
 builders-0.002
 knights-0.002
 chefs-0.002
 helpers-0.002
 cooks-0.002
Token the
Token ID262
Feature activation+9.28e-03

Pos logit contributions
ho+0.019
 perspective+0.017
ute+0.017
ci+0.016
 way+0.016
Neg logit contributions
 chefs-0.013
 knights-0.012
 builders-0.012
 cooks-0.012
 helpers-0.012
Token best
Token ID1266
Feature activation+1.27e-02

Pos logit contributions
ho+0.014
ci+0.013
 perspective+0.012
ute+0.012
aman+0.011
Neg logit contributions
 chefs-0.017
 knights-0.017
 builders-0.016
 helpers-0.016
 cooks-0.016
Token It
Token ID632
Feature activation+1.57e-02

Pos logit contributions
ho+0.008
ute+0.007
 perspective+0.007
 way+0.006
ci+0.006
Neg logit contributions
 builders-0.009
 chefs-0.009
 helpers-0.009
 cooks-0.009
 knights-0.009
Token touches
Token ID18105
Feature activation+1.43e-02

Pos logit contributions
ho+0.026
 perspective+0.025
ute+0.023
ci+0.023
 way+0.021
Neg logit contributions
 knights-0.025
 cooks-0.025
 chefs-0.025
 helpers-0.024
 builders-0.024
Token their
Token ID511
Feature activation+4.32e-03

Pos logit contributions
ho+0.028
 perspective+0.026
ute+0.025
ci+0.025
 way+0.024
Neg logit contributions
 chefs-0.018
 knights-0.018
 cooks-0.018
 builders-0.018
 helpers-0.018
Token masks
Token ID20680
Feature activation+6.84e-03

Pos logit contributions
ho+0.008
 perspective+0.007
ute+0.007
ci+0.007
 way+0.006
Neg logit contributions
 chefs-0.006
 knights-0.006
 cooks-0.006
 helpers-0.006
 builders-0.006
Token and
Token ID290
Feature activation+1.27e-02

Pos logit contributions
ho+0.014
 perspective+0.014
ci+0.013
ute+0.013
aman+0.012
Neg logit contributions
 knights-0.008
 chefs-0.007
 cooks-0.007
 builders-0.007
 pirates-0.007
Token fins
Token ID42080
Feature activation+6.81e-02

Pos logit contributions
ho+0.023
 perspective+0.022
ute+0.020
ci+0.020
aman+0.019
Neg logit contributions
 knights-0.019
 cooks-0.019
 chefs-0.018
 helpers-0.018
 builders-0.018
Token with
Token ID351
Feature activation+2.60e-02

Pos logit contributions
 perspective+0.143
ho+0.142
ci+0.141
aman+0.136
ute+0.129
Neg logit contributions
 knights-0.072
 cooks-0.070
 pirates-0.068
 chefs-0.067
 spies-0.064
Token its
Token ID663
Feature activation-2.30e-03

Pos logit contributions
ho+0.041
 perspective+0.038
ci+0.038
ute+0.036
 way+0.034
Neg logit contributions
 knights-0.044
 chefs-0.043
 cooks-0.042
 pirates-0.042
 builders-0.042
Token nose
Token ID9686
Feature activation+3.03e-02

Pos logit contributions
 knights+0.003
 chefs+0.003
 cooks+0.003
 builders+0.003
 helpers+0.003
Neg logit contributions
ho-0.004
 perspective-0.004
ci-0.004
ute-0.004
 way-0.003
Token.
Token ID13
Feature activation+6.32e-03

Pos logit contributions
ho+0.059
 perspective+0.057
ci+0.053
ute+0.053
 way+0.051
Neg logit contributions
 chefs-0.038
 knights-0.038
 cooks-0.037
 builders-0.036
 helpers-0.036
Token It
Token ID632
Feature activation+1.14e-02

Pos logit contributions
ho+0.009
ute+0.008
 perspective+0.008
 way+0.007
 high+0.007
Neg logit contributions
 builders-0.011
 helpers-0.011
 chefs-0.011
 cooks-0.011
 knights-0.010
Token heart
Token ID2612
Feature activation+8.60e-03

Pos logit contributions
ho+0.015
 perspective+0.013
ute+0.012
ci+0.012
aman+0.012
Neg logit contributions
 knights-0.015
 chefs-0.015
 helpers-0.014
 cooks-0.014
 builders-0.014
Token.
Token ID13
Feature activation+1.59e-02

Pos logit contributions
ho+0.017
 perspective+0.015
ute+0.015
ci+0.015
 way+0.014
Neg logit contributions
 chefs-0.011
 knights-0.011
 cooks-0.011
 builders-0.011
 helpers-0.011
Token You
Token ID921
Feature activation-1.45e-02

Pos logit contributions
ho+0.026
ute+0.024
 perspective+0.024
ci+0.023
 way+0.022
Neg logit contributions
 chefs-0.024
 builders-0.024
 helpers-0.023
 knights-0.023
 cooks-0.023
Token should
Token ID815
Feature activation+4.73e-02

Pos logit contributions
 chefs+0.025
 knights+0.025
 builders+0.025
 helpers+0.024
iers+0.023
Neg logit contributions
ho-0.022
 perspective-0.020
 way-0.019
ute-0.019
ci-0.019
Token not
Token ID407
Feature activation+1.84e-02

Pos logit contributions
ho+0.091
 perspective+0.085
ci+0.082
ute+0.079
spe+0.078
Neg logit contributions
 cooks-0.061
 helpers-0.060
 knights-0.060
 pirates-0.060
 chefs-0.058
Token stop
Token ID2245
Feature activation+6.80e-02

Pos logit contributions
ho+0.034
ci+0.032
 perspective+0.031
ute+0.030
aman+0.030
Neg logit contributions
 cooks-0.024
 builders-0.022
 knights-0.022
 pirates-0.022
 chefs-0.022
Token singing
Token ID13777
Feature activation+1.08e-02

Pos logit contributions
ho+0.126
 perspective+0.115
ute+0.113
ci+0.111
aman+0.110
Neg logit contributions
 cooks-0.097
 pirates-0.094
 knights-0.092
 chefs-0.092
 builders-0.091
Token because
Token ID780
Feature activation+9.50e-03

Pos logit contributions
ho+0.019
 perspective+0.017
ute+0.017
ci+0.016
 way+0.016
Neg logit contributions
 chefs-0.016
 builders-0.016
 cooks-0.016
 helpers-0.016
 knights-0.016
Token of
Token ID286
Feature activation+2.15e-02

Pos logit contributions
ho+0.016
 perspective+0.015
ci+0.014
ute+0.014
 way+0.013
Neg logit contributions
 builders-0.015
 knights-0.015
 cooks-0.015
 helpers-0.015
 chefs-0.015
Token one
Token ID530
Feature activation+2.56e-02

Pos logit contributions
ho+0.039
 perspective+0.035
ute+0.034
ci+0.034
 way+0.031
Neg logit contributions
 knights-0.032
 chefs-0.031
 helpers-0.031
 cooks-0.031
 builders-0.031
Token sw
Token ID1509
Feature activation+3.67e-02

Pos logit contributions
ho+0.051
 perspective+0.047
ci+0.045
ute+0.044
aman+0.043
Neg logit contributions
 knights-0.033
 cooks-0.032
 chefs-0.032
 helpers-0.032
 builders-0.031
Token And
Token ID843
Feature activation+1.45e-02

Pos logit contributions
ho+0.008
ute+0.007
ci+0.007
 perspective+0.007
 way+0.007
Neg logit contributions
 builders-0.009
 helpers-0.009
 chefs-0.009
 cooks-0.009
 knights-0.009
Token our
Token ID674
Feature activation+1.26e-02

Pos logit contributions
ho+0.023
 perspective+0.021
ute+0.020
ci+0.020
 way+0.019
Neg logit contributions
 chefs-0.024
 knights-0.024
 builders-0.024
 cooks-0.023
 helpers-0.023
Token new
Token ID649
Feature activation+2.02e-02

Pos logit contributions
ho+0.022
ci+0.021
 perspective+0.020
ute+0.020
aman+0.019
Neg logit contributions
 knights-0.019
 chefs-0.018
 builders-0.018
 helpers-0.018
 pirates-0.018
Token bucket
Token ID19236
Feature activation+3.73e-02

Pos logit contributions
ho+0.038
ci+0.034
aman+0.034
itating+0.033
liv+0.032
Neg logit contributions
 knights-0.031
 chefs-0.031
 pirates-0.030
 builders-0.030
 helpers-0.030
Token and
Token ID290
Feature activation+1.10e-02

Pos logit contributions
ho+0.057
 perspective+0.050
 way+0.048
ci+0.048
ute+0.048
Neg logit contributions
 chefs-0.064
 builders-0.063
 knights-0.063
 cooks-0.062
 helpers-0.062
Token shovel
Token ID33677
Feature activation+6.80e-02

Pos logit contributions
ho+0.019
 perspective+0.017
ute+0.017
ci+0.016
aman+0.015
Neg logit contributions
 chefs-0.018
 builders-0.017
 knights-0.017
 helpers-0.017
 cooks-0.017
Token too
Token ID1165
Feature activation-8.38e-03

Pos logit contributions
ho+0.101
 perspective+0.093
ute+0.087
ci+0.086
 way+0.083
Neg logit contributions
 chefs-0.118
 builders-0.117
 knights-0.116
 cooks-0.116
 helpers-0.115
Token."
Token ID526
Feature activation+9.11e-04

Pos logit contributions
 chefs+0.013
 knights+0.012
 builders+0.012
 helpers+0.012
 cooks+0.012
Neg logit contributions
ho-0.015
 perspective-0.013
ute-0.013
 way-0.013
ci-0.012
Token
Token ID198
Feature activation+5.36e-03

Pos logit contributions
ho+0.001
ute+0.001
 perspective+0.001
 way+0.001
 high+0.001
Neg logit contributions
 builders-0.002
 cooks-0.002
 chefs-0.002
 helpers-0.002
 knights-0.002
Token
Token ID198
Feature activation-6.50e-03

Pos logit contributions
ho+0.008
 perspective+0.007
ute+0.007
 way+0.006
ci+0.006
Neg logit contributions
 chefs-0.010
 builders-0.010
 helpers-0.010
 cooks-0.010
 knights-0.009
TokenL
Token ID43
Feature activation-2.14e-02

Pos logit contributions
 knights+0.010
 chefs+0.010
 builders+0.010
 cooks+0.010
 helpers+0.009
Neg logit contributions
ho-0.011
 perspective-0.010
ci-0.010
ute-0.010
 way-0.010
Token to
Token ID284
Feature activation-1.51e-02

Pos logit contributions
 chefs+0.033
 knights+0.033
 cooks+0.033
 builders+0.033
 helpers+0.032
Neg logit contributions
ho-0.018
 perspective-0.017
 way-0.016
ute-0.015
ci-0.015
Token clean
Token ID3424
Feature activation-2.25e-02

Pos logit contributions
 chefs+0.023
 knights+0.023
 builders+0.023
 cooks+0.023
 helpers+0.022
Neg logit contributions
ho-0.026
 perspective-0.024
ute-0.023
ci-0.022
 way-0.022
Token your
Token ID534
Feature activation-2.24e-02

Pos logit contributions
 chefs+0.044
 builders+0.044
 knights+0.044
 cooks+0.044
 helpers+0.043
Neg logit contributions
ho-0.028
 perspective-0.025
ci-0.024
ute-0.024
 way-0.023
Token room
Token ID2119
Feature activation-1.30e-02

Pos logit contributions
iers+0.039
 chefs+0.038
 cooks+0.037
 builders+0.037
 knights+0.036
Neg logit contributions
 way-0.032
ci-0.029
 perspective-0.028
ute-0.028
ho-0.028
Token before
Token ID878
Feature activation-3.23e-02

Pos logit contributions
 chefs+0.019
 knights+0.019
 builders+0.019
 cooks+0.019
 helpers+0.019
Neg logit contributions
ho-0.023
 perspective-0.021
ute-0.020
ci-0.020
 way-0.019
Token you
Token ID345
Feature activation-7.10e-02

Pos logit contributions
 chefs+0.050
 knights+0.049
 builders+0.049
 cooks+0.048
 helpers+0.048
Neg logit contributions
ho-0.055
 perspective-0.050
ute-0.048
ci-0.048
 way-0.047
Token can
Token ID460
Feature activation-1.55e-02

Pos logit contributions
 chefs+0.154
 knights+0.151
 spies+0.147
 helpers+0.147
 builders+0.146
Neg logit contributions
ho-0.069
 way-0.068
ci-0.067
ute-0.067
 high-0.060
Token ride
Token ID6594
Feature activation-1.26e-02

Pos logit contributions
 chefs+0.027
 knights+0.027
 builders+0.026
 helpers+0.026
 cooks+0.026
Neg logit contributions
ho-0.023
 perspective-0.021
ute-0.021
ci-0.020
 way-0.020
Token your
Token ID534
Feature activation-2.85e-02

Pos logit contributions
 cooks+0.022
 chefs+0.022
 helpers+0.022
 builders+0.022
 knights+0.022
Neg logit contributions
ho-0.017
ci-0.016
ute-0.015
 way-0.015
 perspective-0.015
Token motorcycle
Token ID18757
Feature activation-9.60e-03

Pos logit contributions
 chefs+0.048
 cooks+0.048
 builders+0.047
iers+0.047
 knights+0.047
Neg logit contributions
ho-0.041
 way-0.039
 perspective-0.038
ci-0.037
ute-0.037
Token."
Token ID526
Feature activation-1.68e-02

Pos logit contributions
 chefs+0.016
 cooks+0.015
 builders+0.015
 helpers+0.015
 knights+0.015
Neg logit contributions
ho-0.015
ci-0.014
 way-0.014
ute-0.013
 perspective-0.013
Token,
Token ID11
Feature activation+9.20e-03

Pos logit contributions
ho+0.004
 perspective+0.004
ci+0.004
ute+0.004
 way+0.004
Neg logit contributions
 knights-0.003
 chefs-0.003
 builders-0.003
 cooks-0.003
 helpers-0.003
Token there
Token ID612
Feature activation+1.12e-02

Pos logit contributions
ho+0.016
 perspective+0.015
ci+0.015
ute+0.014
 way+0.014
Neg logit contributions
 chefs-0.014
 knights-0.014
 builders-0.013
 cooks-0.013
 pirates-0.013
Token was
Token ID373
Feature activation-1.79e-02

Pos logit contributions
ho+0.018
 perspective+0.017
ute+0.016
ci+0.016
 way+0.015
Neg logit contributions
 chefs-0.018
 knights-0.018
 cooks-0.018
 builders-0.017
 helpers-0.017
Token a
Token ID257
Feature activation+3.50e-03

Pos logit contributions
iers+0.025
 helpers+0.025
 builders+0.024
 cooks+0.024
 warriors+0.024
Neg logit contributions
ho-0.030
 perspective-0.028
 way-0.028
ute-0.027
 photo-0.027
Token place
Token ID1295
Feature activation+4.58e-03

Pos logit contributions
ho+0.006
 perspective+0.005
ute+0.005
ci+0.005
 way+0.005
Neg logit contributions
 chefs-0.005
 builders-0.005
 knights-0.005
 cooks-0.005
 helpers-0.005
Token called
Token ID1444
Feature activation-6.82e-02

Pos logit contributions
 perspective+0.010
aman+0.009
ho+0.009
heit+0.009
burn+0.009
Neg logit contributions
 builders-0.005
 chefs-0.005
 pirates-0.005
 cooks-0.005
 knights-0.005
Token Rainbow
Token ID19909
Feature activation-2.28e-02

Pos logit contributions
 chefs+0.094
 cooks+0.094
 builders+0.092
 helpers+0.092
 knights+0.089
Neg logit contributions
ho-0.116
 way-0.108
 perspective-0.105
ute-0.104
 photo-0.102
Tokenland
Token ID1044
Feature activation-3.44e-02

Pos logit contributions
 knights+0.032
 chefs+0.032
 builders+0.032
 cooks+0.032
 helpers+0.031
Neg logit contributions
ho-0.041
 perspective-0.038
ute-0.036
ci-0.036
 way-0.035
Token.
Token ID13
Feature activation-6.08e-03

Pos logit contributions
 chefs+0.050
 knights+0.049
 builders+0.049
 cooks+0.049
 helpers+0.048
Neg logit contributions
ho-0.062
 perspective-0.057
ute-0.054
ci-0.054
 way-0.052
Token In
Token ID554
Feature activation+1.10e-02

Pos logit contributions
 chefs+0.009
 knights+0.009
 builders+0.009
 cooks+0.009
 helpers+0.009
Neg logit contributions
ho-0.010
 perspective-0.010
ute-0.009
ci-0.009
 way-0.009
Token Rainbow
Token ID19909
Feature activation-1.40e-02

Pos logit contributions
ho+0.024
 perspective+0.023
ute+0.022
itating+0.022
ci+0.022
Neg logit contributions
 pirates-0.011
 knights-0.011
 helpers-0.010
 cooks-0.010
 builders-0.010
Token there
Token ID612
Feature activation+1.59e-02

Pos logit contributions
ho+0.024
 perspective+0.022
ci+0.021
ute+0.021
 way+0.020
Neg logit contributions
 chefs-0.021
 knights-0.021
 builders-0.020
 cooks-0.020
 helpers-0.020
Token was
Token ID373
Feature activation-1.29e-02

Pos logit contributions
ho+0.025
 perspective+0.023
ute+0.022
ci+0.022
 way+0.020
Neg logit contributions
 chefs-0.026
 cooks-0.025
 knights-0.025
 builders-0.025
 helpers-0.025
Token a
Token ID257
Feature activation+4.09e-03

Pos logit contributions
iers+0.018
 helpers+0.017
 builders+0.017
 cooks+0.017
 warriors+0.017
Neg logit contributions
ho-0.022
 perspective-0.021
 way-0.020
ute-0.020
 high-0.020
Token big
Token ID1263
Feature activation-9.68e-03

Pos logit contributions
ho+0.007
 perspective+0.006
ute+0.006
 way+0.006
ci+0.006
Neg logit contributions
 chefs-0.006
 builders-0.006
 knights-0.006
 cooks-0.006
 helpers-0.006
Token mountain
Token ID8598
Feature activation+1.65e-02

Pos logit contributions
 chefs+0.016
 builders+0.016
 cooks+0.016
 knights+0.016
iers+0.015
Neg logit contributions
ho-0.015
 perspective-0.014
ute-0.013
ci-0.013
 way-0.013
Token called
Token ID1444
Feature activation-6.79e-02

Pos logit contributions
 perspective+0.036
aman+0.035
 wrists+0.035
ho+0.034
heit+0.033
Neg logit contributions
 knights-0.015
 pirates-0.014
 builders-0.014
 cooks-0.013
 helpers-0.012
Token a
Token ID257
Feature activation-2.42e-02

Pos logit contributions
 cooks+0.093
 chefs+0.093
 helpers+0.089
 builders+0.088
 knights+0.088
Neg logit contributions
ho-0.119
 perspective-0.109
 way-0.107
ute-0.106
ci-0.101
Token volcano
Token ID32410
Feature activation-6.78e-03

Pos logit contributions
 chefs+0.041
 knights+0.041
 builders+0.040
 cooks+0.040
 helpers+0.040
Neg logit contributions
ho-0.037
 perspective-0.033
ute-0.032
 way-0.031
ci-0.030
Token.
Token ID13
Feature activation-1.83e-02

Pos logit contributions
 cooks+0.008
 knights+0.008
 builders+0.008
 chefs+0.008
 helpers+0.008
Neg logit contributions
ho-0.013
 perspective-0.013
ute-0.012
ci-0.012
aman-0.012
Token The
Token ID383
Feature activation-2.70e-02

Pos logit contributions
ier+0.037
 chefs+0.035
 helpers+0.035
 spies+0.033
 cooks+0.033
Neg logit contributions
 high-0.019
 perspective-0.019
ho-0.018
 way-0.018
 stair-0.017
Token volcano
Token ID32410
Feature activation+3.82e-03

Pos logit contributions
iers+0.054
paste+0.053
 chefs+0.053
 spies+0.051
 knights+0.051
Neg logit contributions
ho-0.032
 way-0.030
 perspective-0.028
ute-0.025
 high-0.025
Token,
Token ID11
Feature activation-5.37e-03

Pos logit contributions
 chefs+0.011
 knights+0.010
 builders+0.010
 cooks+0.010
 helpers+0.010
Neg logit contributions
ho-0.012
 perspective-0.011
ute-0.010
ci-0.010
 way-0.010
Token "
Token ID366
Feature activation+1.89e-02

Pos logit contributions
 chefs+0.008
 knights+0.008
 cooks+0.008
 builders+0.008
 helpers+0.008
Neg logit contributions
ho-0.009
 perspective-0.009
ci-0.008
ute-0.008
 way-0.008
TokenLook
Token ID8567
Feature activation+4.52e-03

Pos logit contributions
ho+0.033
 perspective+0.029
ute+0.029
 way+0.029
ci+0.028
Neg logit contributions
 chefs-0.029
 builders-0.029
 cooks-0.029
 knights-0.028
iers-0.027
Token how
Token ID703
Feature activation+1.27e-02

Pos logit contributions
ho+0.008
 perspective+0.007
ute+0.007
ci+0.007
 way+0.006
Neg logit contributions
 chefs-0.007
 knights-0.007
 builders-0.007
 cooks-0.007
 helpers-0.007
Token clever
Token ID14169
Feature activation+5.95e-03

Pos logit contributions
ho+0.019
 perspective+0.017
ute+0.016
ci+0.016
 way+0.016
Neg logit contributions
 chefs-0.022
 knights-0.022
 builders-0.022
 cooks-0.022
 helpers-0.022
Token you
Token ID345
Feature activation-6.72e-02

Pos logit contributions
ho+0.010
 perspective+0.009
ci+0.009
ute+0.009
 way+0.008
Neg logit contributions
 chefs-0.009
 builders-0.009
 knights-0.009
 cooks-0.009
 helpers-0.009
Token are
Token ID389
Feature activation-5.17e-02

Pos logit contributions
 footsteps+0.133
 spies+0.126
 knights+0.126
paste+0.124
 doctors+0.122
Neg logit contributions
 way-0.079
 would-0.068
ho-0.067
 high-0.064
 lot-0.062
Token,
Token ID11
Feature activation+1.16e-02

Pos logit contributions
 knights+0.091
 footsteps+0.091
 spies+0.087
 chefs+0.087
iers+0.086
Neg logit contributions
 way-0.064
ho-0.062
 perspective-0.056
ute-0.055
ci-0.051
Token you
Token ID345
Feature activation-3.62e-02

Pos logit contributions
ho+0.020
 perspective+0.018
ute+0.018
ci+0.017
 way+0.017
Neg logit contributions
 chefs-0.017
 builders-0.017
 knights-0.017
 cooks-0.017
 helpers-0.016
Token can
Token ID460
Feature activation-1.15e-03

Pos logit contributions
 knights+0.062
 chefs+0.062
iddles+0.061
iers+0.061
 builders+0.061
Neg logit contributions
 way-0.048
ho-0.042
 perspective-0.040
ute-0.039
 lot-0.039
Token put
Token ID1234
Feature activation-6.21e-03

Pos logit contributions
 chefs+0.002
 knights+0.002
 helpers+0.002
 builders+0.002
 spies+0.002
Neg logit contributions
ho-0.002
ute-0.002
ci-0.002
 way-0.002
 perspective-0.002
Token was
Token ID373
Feature activation-2.35e-02

Pos logit contributions
 builders+0.012
 cooks+0.012
 knights+0.012
 helpers+0.012
 chefs+0.012
Neg logit contributions
ho-0.014
 perspective-0.014
ute-0.012
ci-0.012
aman-0.012
Token special
Token ID2041
Feature activation-5.35e-02

Pos logit contributions
 chefs+0.043
 knights+0.042
 cooks+0.042
 builders+0.042
 helpers+0.041
Neg logit contributions
ho-0.033
 perspective-0.030
ute-0.028
 way-0.028
ci-0.028
Token because
Token ID780
Feature activation-2.46e-02

Pos logit contributions
iers+0.093
 knights+0.092
 cooks+0.090
 chefs+0.089
 builders+0.089
Neg logit contributions
ho-0.071
 way-0.068
 perspective-0.067
ci-0.063
ute-0.062
Token a
Token ID257
Feature activation-1.33e-02

Pos logit contributions
 chefs+0.045
 knights+0.044
 builders+0.044
 cooks+0.044
 helpers+0.044
Neg logit contributions
ho-0.035
 perspective-0.031
ute-0.029
ci-0.029
 way-0.028
Token man
Token ID582
Feature activation-2.25e-02

Pos logit contributions
 chefs+0.023
 builders+0.023
 knights+0.022
 helpers+0.022
 cooks+0.022
Neg logit contributions
ho-0.021
 perspective-0.019
ci-0.018
ute-0.018
aman-0.016
Token named
Token ID3706
Feature activation-6.71e-02

Pos logit contributions
 builders+0.039
 chefs+0.039
 knights+0.038
 helpers+0.038
 cooks+0.037
Neg logit contributions
ho-0.034
ci-0.030
 perspective-0.029
 way-0.028
ute-0.028
Token Bob
Token ID5811
Feature activation-1.13e-02

Pos logit contributions
 chefs+0.095
 cooks+0.093
 helpers+0.089
 knights+0.087
paste+0.085
Neg logit contributions
ho-0.113
 way-0.108
 perspective-0.102
ci-0.101
ute-0.097
Token designed
Token ID3562
Feature activation-3.37e-02

Pos logit contributions
 builders+0.019
 chefs+0.019
 knights+0.019
 helpers+0.018
 cooks+0.018
Neg logit contributions
ho-0.018
ci-0.015
 perspective-0.015
ute-0.014
 way-0.014
Token it
Token ID340
Feature activation-4.03e-02

Pos logit contributions
 chefs+0.054
 cooks+0.054
 builders+0.052
 helpers+0.052
 knights+0.051
Neg logit contributions
 way-0.048
ho-0.047
 perspective-0.045
ute-0.044
 high-0.043
Token.
Token ID13
Feature activation-1.19e-02

Pos logit contributions
 chefs+0.067
 cooks+0.066
 builders+0.065
 knights+0.065
 helpers+0.065
Neg logit contributions
ho-0.063
 perspective-0.056
ute-0.055
 way-0.054
ci-0.054
Token Bob
Token ID5811
Feature activation+2.31e-04

Pos logit contributions
 chefs+0.019
 builders+0.019
 knights+0.019
 helpers+0.019
 cooks+0.019
Neg logit contributions
ho-0.019
 perspective-0.017
 way-0.017
ute-0.016
ci-0.016
Token can
Token ID460
Feature activation-3.32e-03

Pos logit contributions
 chefs+0.082
 knights+0.079
 builders+0.077
 spies+0.076
 helpers+0.076
Neg logit contributions
ho-0.055
 way-0.052
 perspective-0.049
ci-0.047
ute-0.046
Token solve
Token ID8494
Feature activation-5.27e-03

Pos logit contributions
 chefs+0.005
 knights+0.005
 helpers+0.005
 cooks+0.005
 builders+0.005
Neg logit contributions
ho-0.005
 perspective-0.005
 way-0.005
ute-0.005
ci-0.005
Token any
Token ID597
Feature activation-1.07e-02

Pos logit contributions
 chefs+0.008
 cooks+0.008
 builders+0.008
 knights+0.008
 helpers+0.008
Neg logit contributions
ho-0.009
 perspective-0.008
ute-0.008
ci-0.008
 way-0.008
Token problem
Token ID1917
Feature activation-8.45e-03

Pos logit contributions
 chefs+0.020
 cooks+0.020
 knights+0.020
 builders+0.020
 helpers+0.019
Neg logit contributions
ho-0.014
 perspective-0.013
 way-0.012
ute-0.012
ci-0.012
Token if
Token ID611
Feature activation-6.68e-03

Pos logit contributions
 chefs+0.014
 cooks+0.013
 knights+0.013
 helpers+0.013
 builders+0.013
Neg logit contributions
ho-0.012
 perspective-0.012
 way-0.011
ute-0.011
ci-0.011
Token you
Token ID345
Feature activation-6.70e-02

Pos logit contributions
 chefs+0.010
 knights+0.010
 builders+0.010
 cooks+0.010
 helpers+0.010
Neg logit contributions
ho-0.012
 perspective-0.011
ute-0.010
ci-0.010
 way-0.010
Token try
Token ID1949
Feature activation-3.85e-03

Pos logit contributions
 chefs+0.120
 knights+0.112
 spies+0.112
 builders+0.110
 helpers+0.109
Neg logit contributions
ho-0.095
 way-0.090
 perspective-0.087
ci-0.081
ute-0.079
Token your
Token ID534
Feature activation-3.48e-02

Pos logit contributions
 chefs+0.006
 knights+0.006
 builders+0.006
 cooks+0.006
 helpers+0.006
Neg logit contributions
ho-0.006
 perspective-0.006
ute-0.005
ci-0.005
 way-0.005
Token best
Token ID1266
Feature activation+2.73e-03

Pos logit contributions
 chefs+0.066
 builders+0.065
 cooks+0.065
 knights+0.064
 helpers+0.063
Neg logit contributions
ho-0.044
 perspective-0.041
 way-0.040
ute-0.039
ci-0.037
Token."
Token ID526
Feature activation+3.12e-03

Pos logit contributions
ho+0.004
 perspective+0.004
ute+0.004
 way+0.004
ci+0.004
Neg logit contributions
 chefs-0.004
 cooks-0.004
 knights-0.004
 builders-0.004
 helpers-0.004
Token
Token ID198
Feature activation-2.69e-03

Pos logit contributions
ho+0.004
 perspective+0.003
ci+0.003
ute+0.003
 way+0.003
Neg logit contributions
 chefs-0.006
 builders-0.006
 knights-0.006
 cooks-0.006
 helpers-0.006
Token lived
Token ID5615
Feature activation-1.37e-02

Pos logit contributions
ho+0.013
 perspective+0.012
ute+0.012
aman+0.011
ci+0.011
Neg logit contributions
 knights-0.011
 cooks-0.011
 chefs-0.010
 builders-0.010
 pirates-0.010
Token in
Token ID287
Feature activation-2.08e-02

Pos logit contributions
 chefs+0.023
 knights+0.023
 builders+0.022
 cooks+0.022
 helpers+0.022
Neg logit contributions
ho-0.022
 perspective-0.020
ute-0.019
ci-0.018
 way-0.018
Token a
Token ID257
Feature activation+1.95e-02

Pos logit contributions
 chefs+0.030
 builders+0.030
 cooks+0.029
 helpers+0.029
iers+0.029
Neg logit contributions
 way-0.034
ho-0.033
 perspective-0.031
 high-0.031
ci-0.030
Token big
Token ID1263
Feature activation+1.42e-02

Pos logit contributions
ho+0.034
 perspective+0.031
ci+0.030
ute+0.028
spe+0.027
Neg logit contributions
 knights-0.032
 chefs-0.030
 pirates-0.030
 cooks-0.030
 builders-0.029
Token house
Token ID2156
Feature activation-1.51e-02

Pos logit contributions
ho+0.023
 perspective+0.020
ute+0.019
ci+0.019
itating+0.018
Neg logit contributions
 knights-0.025
 builders-0.024
 chefs-0.024
 pirates-0.023
 cooks-0.023
Token called
Token ID1444
Feature activation-6.70e-02

Pos logit contributions
 knights+0.019
 builders+0.018
 cooks+0.018
 chefs+0.018
 pirates+0.018
Neg logit contributions
ho-0.030
 perspective-0.029
aman-0.028
ute-0.028
ci-0.026
Token a
Token ID257
Feature activation-1.36e-02

Pos logit contributions
 cooks+0.104
 chefs+0.103
 builders+0.101
 helpers+0.099
 knights+0.097
Neg logit contributions
ho-0.110
ute-0.096
 perspective-0.095
 way-0.094
ci-0.090
Token stable
Token ID8245
Feature activation-1.43e-02

Pos logit contributions
 chefs+0.024
 knights+0.024
 builders+0.024
 cooks+0.024
 helpers+0.024
Neg logit contributions
ho-0.020
 perspective-0.018
ute-0.017
ci-0.017
 way-0.016
Token.
Token ID13
Feature activation-2.65e-02

Pos logit contributions
 knights+0.020
 chefs+0.020
 builders+0.020
 helpers+0.019
 cooks+0.019
Neg logit contributions
ho-0.026
 perspective-0.025
ute-0.023
ci-0.023
 way-0.022
Token The
Token ID383
Feature activation-1.65e-02

Pos logit contributions
 chefs+0.046
 cooks+0.045
 builders+0.044
 helpers+0.044
 spies+0.043
Neg logit contributions
ho-0.038
 way-0.037
 perspective-0.035
ci-0.034
ute-0.032
Token stable
Token ID8245
Feature activation-2.79e-03

Pos logit contributions
 chefs+0.028
 knights+0.028
 builders+0.028
 cooks+0.028
 helpers+0.028
Neg logit contributions
ho-0.025
 perspective-0.023
ute-0.022
ci-0.022
 way-0.021
Token,
Token ID11
Feature activation+8.66e-03

Pos logit contributions
ho+0.004
ci+0.003
 perspective+0.003
ute+0.003
aman+0.003
Neg logit contributions
 knights-0.002
 pirates-0.002
 chefs-0.002
 builders-0.002
 cooks-0.002
Token and
Token ID290
Feature activation-3.55e-03

Pos logit contributions
ho+0.017
 perspective+0.017
ute+0.016
ci+0.016
aman+0.015
Neg logit contributions
 knights-0.011
 chefs-0.011
 pirates-0.010
 builders-0.010
 spies-0.010
Token they
Token ID484
Feature activation-7.37e-03

Pos logit contributions
 knights+0.005
 chefs+0.005
 builders+0.005
 cooks+0.005
 helpers+0.005
Neg logit contributions
ho-0.007
 perspective-0.006
ute-0.006
ci-0.006
aman-0.005
Token lived
Token ID5615
Feature activation-1.36e-02

Pos logit contributions
 knights+0.010
 builders+0.010
 pirates+0.009
 cooks+0.009
 chefs+0.009
Neg logit contributions
ho-0.013
 perspective-0.013
ute-0.013
aman-0.012
ci-0.012
Token happily
Token ID18177
Feature activation-4.41e-02

Pos logit contributions
 chefs+0.023
 cooks+0.023
 builders+0.023
 helpers+0.023
 knights+0.023
Neg logit contributions
ho-0.020
 perspective-0.018
ute-0.017
 way-0.016
ci-0.016
Token ever
Token ID1683
Feature activation-6.56e-02

Pos logit contributions
 chefs+0.081
 cooks+0.078
iers+0.077
 builders+0.077
 helpers+0.076
Neg logit contributions
ho-0.064
 perspective-0.051
ci-0.050
 way-0.048
ute-0.047
Token after
Token ID706
Feature activation-2.95e-02

Pos logit contributions
iers+0.117
 chefs+0.114
 helpers+0.112
 cooks+0.109
 builders+0.109
Neg logit contributions
ho-0.102
 perspective-0.088
 way-0.085
ci-0.077
 photo-0.073
Token.
Token ID13
Feature activation-1.51e-02

Pos logit contributions
 chefs+0.043
 knights+0.043
 builders+0.043
 cooks+0.043
 helpers+0.042
Neg logit contributions
ho-0.052
 perspective-0.048
ute-0.046
ci-0.046
 way-0.045
Token<|endoftext|>
Token ID50256
Feature activation-2.55e-02

Pos logit contributions
 chefs+0.020
 knights+0.020
 builders+0.020
 cooks+0.019
 helpers+0.019
Neg logit contributions
ho-0.029
 perspective-0.027
ute-0.026
ci-0.026
 way-0.025
TokenOnce
Token ID7454
Feature activation+3.99e-02

Pos logit contributions
 builders+0.051
 helpers+0.051
 chefs+0.051
 cooks+0.049
 knights+0.049
Neg logit contributions
ho-0.030
 way-0.027
 perspective-0.027
ute-0.026
ci-0.024
Token upon
Token ID2402
Feature activation-2.89e-02

Pos logit contributions
ho+0.071
ci+0.071
 wrists+0.070
 perspective+0.069
ute+0.063
Neg logit contributions
 knights-0.050
 boys-0.050
 helpers-0.050
 builders-0.049
 pirates-0.047
Token he
Token ID339
Feature activation-2.02e-02

Pos logit contributions
 chefs+0.017
 cooks+0.017
 builders+0.017
 knights+0.016
 helpers+0.016
Neg logit contributions
ho-0.018
 perspective-0.015
 way-0.015
ci-0.015
ute-0.015
Token didn
Token ID1422
Feature activation-1.18e-02

Pos logit contributions
 chefs+0.039
 builders+0.039
 knights+0.038
iers+0.037
 cooks+0.037
Neg logit contributions
ho-0.028
 way-0.023
ci-0.022
 perspective-0.022
ute-0.020
Token't
Token ID470
Feature activation-2.84e-02

Pos logit contributions
 chefs+0.022
 knights+0.022
 builders+0.021
 cooks+0.021
 helpers+0.021
Neg logit contributions
ho-0.017
 perspective-0.015
ute-0.014
ci-0.014
 way-0.013
Token know
Token ID760
Feature activation-1.27e-02

Pos logit contributions
 chefs+0.044
 knights+0.042
 cooks+0.041
 builders+0.041
 helpers+0.040
Neg logit contributions
 way-0.047
ho-0.044
 perspective-0.043
 high-0.040
ute-0.039
Token what
Token ID644
Feature activation-1.54e-02

Pos logit contributions
 chefs+0.016
 knights+0.016
 builders+0.015
 helpers+0.015
 cooks+0.015
Neg logit contributions
ho-0.026
 perspective-0.024
ute-0.023
ci-0.023
 way-0.022
Token to
Token ID284
Feature activation-6.56e-02

Pos logit contributions
 chefs+0.025
 knights+0.025
 builders+0.025
 cooks+0.025
 helpers+0.025
Neg logit contributions
ho-0.025
 perspective-0.023
ute-0.021
ci-0.021
 way-0.020
Token count
Token ID954
Feature activation+7.40e-03

Pos logit contributions
 chefs+0.112
iers+0.110
paste+0.107
 knights+0.106
 warriors+0.105
Neg logit contributions
 way-0.095
ho-0.094
 perspective-0.085
 high-0.078
 angle-0.074
Token.
Token ID13
Feature activation-1.66e-02

Pos logit contributions
ho+0.013
 perspective+0.012
ute+0.011
ci+0.011
 way+0.011
Neg logit contributions
 chefs-0.011
 cooks-0.011
 builders-0.011
 knights-0.011
 helpers-0.011
Token Then
Token ID3244
Feature activation-2.77e-02

Pos logit contributions
 chefs+0.025
 cooks+0.023
 builders+0.023
 helpers+0.023
 spies+0.023
Neg logit contributions
 way-0.028
ho-0.028
 perspective-0.027
ci-0.025
ute-0.024
Token he
Token ID339
Feature activation-3.40e-02

Pos logit contributions
 chefs+0.052
 cooks+0.051
 builders+0.050
 helpers+0.049
 knights+0.049
Neg logit contributions
ho-0.038
 way-0.034
 perspective-0.033
ci-0.031
ute-0.029
Token had
Token ID550
Feature activation-1.24e-02

Pos logit contributions
 chefs+0.061
 builders+0.059
iers+0.059
 helpers+0.058
 cooks+0.058
Neg logit contributions
ho-0.049
 way-0.043
ci-0.040
 perspective-0.038
 often-0.035
Token door
Token ID3420
Feature activation-2.70e-02

Pos logit contributions
 chefs+0.014
 knights+0.014
 builders+0.014
 cooks+0.014
 helpers+0.013
Neg logit contributions
ho-0.013
 perspective-0.012
ute-0.011
ci-0.011
 way-0.011
Token.
Token ID13
Feature activation-2.55e-02

Pos logit contributions
 builders+0.033
 cooks+0.032
 knights+0.031
 helpers+0.031
 chefs+0.031
Neg logit contributions
ho-0.054
 perspective-0.053
ute-0.048
ci-0.047
aman-0.046
Token A
Token ID317
Feature activation-2.44e-02

Pos logit contributions
ier+0.051
 chefs+0.048
 helpers+0.045
 spies+0.045
ps+0.045
Neg logit contributions
 high-0.028
 way-0.028
 -0.027
Every-0.027
ho-0.026
Token little
Token ID1310
Feature activation-3.03e-02

Pos logit contributions
 chefs+0.040
 knights+0.040
 builders+0.040
 cooks+0.040
 helpers+0.039
Neg logit contributions
ho-0.038
 perspective-0.034
 way-0.033
ute-0.033
ci-0.032
Token boy
Token ID2933
Feature activation-1.05e-02

Pos logit contributions
 chefs+0.051
 cooks+0.051
 builders+0.050
 knights+0.050
 helpers+0.049
Neg logit contributions
ho-0.046
 perspective-0.041
 way-0.040
ci-0.039
ute-0.039
Token named
Token ID3706
Feature activation-6.52e-02

Pos logit contributions
 chefs+0.019
 builders+0.019
 knights+0.019
 helpers+0.019
 cooks+0.019
Neg logit contributions
ho-0.015
ci-0.013
 perspective-0.013
ute-0.012
 way-0.012
Token Tim
Token ID5045
Feature activation+3.92e-04

Pos logit contributions
paste+0.104
ier+0.098
 cooks+0.097
bread+0.097
iers+0.096
Neg logit contributions
 way-0.095
ho-0.095
 high-0.077
spe-0.074
ci-0.073
Token lived
Token ID5615
Feature activation-2.61e-02

Pos logit contributions
ho+0.001
 way+0.000
ci+0.000
 perspective+0.000
ute+0.000
Neg logit contributions
 builders-0.001
 chefs-0.001
 helpers-0.001
 knights-0.001
iers-0.001
Token in
Token ID287
Feature activation-3.21e-02

Pos logit contributions
 chefs+0.042
 builders+0.042
 cooks+0.042
 knights+0.042
 helpers+0.042
Neg logit contributions
ho-0.040
 perspective-0.035
ute-0.035
ci-0.034
 way-0.033
Token this
Token ID428
Feature activation-9.07e-03

Pos logit contributions
 chefs+0.055
 cooks+0.054
 builders+0.054
iers+0.053
 knights+0.053
Neg logit contributions
 way-0.045
ho-0.043
 perspective-0.040
ci-0.039
 high-0.039
Token room
Token ID2119
Feature activation-6.37e-03

Pos logit contributions
 chefs+0.012
 cooks+0.011
 knights+0.011
 helpers+0.011
 builders+0.011
Neg logit contributions
ho-0.019
 perspective-0.017
ci-0.016
ute-0.015
 wrists-0.015
Token.
Token ID13
Feature activation-1.31e-02

Pos logit contributions
 knights+0.006
 builders+0.006
 chefs+0.006
 pirates+0.006
 cooks+0.006
Neg logit contributions
ho-0.011
 perspective-0.010
ute-0.010
ci-0.010
aman-0.010
Token And
Token ID843
Feature activation-1.22e-02

Pos logit contributions
 chefs+0.016
 builders+0.016
 knights+0.016
 cooks+0.016
 helpers+0.016
Neg logit contributions
ho-0.027
 perspective-0.024
ci-0.023
ute-0.023
 way-0.023
Token they
Token ID484
Feature activation-1.53e-02

Pos logit contributions
 chefs+0.020
 knights+0.020
 builders+0.020
 helpers+0.020
 cooks+0.020
Neg logit contributions
ho-0.018
 perspective-0.018
ute-0.017
ci-0.016
 way-0.015
Token lived
Token ID5615
Feature activation-1.08e-02

Pos logit contributions
 chefs+0.028
 builders+0.027
 cooks+0.027
 knights+0.027
 helpers+0.027
Neg logit contributions
ho-0.022
 perspective-0.020
ci-0.019
ute-0.019
 way-0.018
Token happily
Token ID18177
Feature activation-4.53e-02

Pos logit contributions
 cooks+0.019
 builders+0.019
 chefs+0.019
iers+0.019
 helpers+0.019
Neg logit contributions
ho-0.015
 perspective-0.013
ute-0.013
 way-0.012
ci-0.012
Token ever
Token ID1683
Feature activation-6.52e-02

Pos logit contributions
 chefs+0.089
iers+0.086
renches+0.086
 cooks+0.084
iddles+0.084
Neg logit contributions
ho-0.060
ci-0.046
 perspective-0.045
 way-0.041
ute-0.040
Token after
Token ID706
Feature activation-2.54e-02

Pos logit contributions
iers+0.122
 chefs+0.117
 helpers+0.115
 cooks+0.111
 knights+0.110
Neg logit contributions
ho-0.098
 perspective-0.083
 way-0.079
ci-0.072
 photo-0.069
Token.
Token ID13
Feature activation-6.77e-03

Pos logit contributions
 chefs+0.042
iers+0.042
 builders+0.042
 cooks+0.041
 knights+0.041
Neg logit contributions
ho-0.038
 perspective-0.036
 way-0.034
ute-0.033
ci-0.033
Token<|endoftext|>
Token ID50256
Feature activation-1.70e-02

Pos logit contributions
 chefs+0.011
 builders+0.011
 knights+0.011
 cooks+0.011
 helpers+0.010
Neg logit contributions
ho-0.011
 perspective-0.010
ci-0.010
ute-0.010
 way-0.010
TokenOne
Token ID3198
Feature activation+1.51e-02

Pos logit contributions
 builders+0.033
 chefs+0.033
 helpers+0.033
 cooks+0.032
 knights+0.032
Neg logit contributions
ho-0.021
 perspective-0.019
 way-0.019
ute-0.018
ci-0.017
Token sunny
Token ID27737
Feature activation-3.34e-03

Pos logit contributions
ho+0.016
 perspective+0.013
ute+0.012
ci+0.012
 way+0.010
Neg logit contributions
 knights-0.033
 chefs-0.033
 cooks-0.033
 builders-0.033
 helpers-0.033
Token need
Token ID761
Feature activation-1.46e-02

Pos logit contributions
 chefs+0.048
 knights+0.048
 builders+0.048
iers+0.047
 helpers+0.046
Neg logit contributions
ho-0.040
 way-0.037
 perspective-0.035
ute-0.035
ci-0.034
Token to
Token ID284
Feature activation-1.14e-02

Pos logit contributions
 chefs+0.029
 knights+0.029
 builders+0.029
 cooks+0.029
 helpers+0.028
Neg logit contributions
ho-0.018
 perspective-0.016
ute-0.015
 way-0.015
ci-0.015
Token be
Token ID307
Feature activation-2.45e-02

Pos logit contributions
 chefs+0.017
 knights+0.017
 builders+0.017
 cooks+0.017
 helpers+0.016
Neg logit contributions
ho-0.019
 perspective-0.018
ute-0.017
 way-0.017
ci-0.017
Token ready
Token ID3492
Feature activation-2.00e-03

Pos logit contributions
 cooks+0.044
iers+0.044
 knights+0.044
 builders+0.044
 footsteps+0.044
Neg logit contributions
 way-0.028
 high-0.025
ho-0.025
 perspective-0.024
ute-0.024
Token before
Token ID878
Feature activation-2.24e-02

Pos logit contributions
 chefs+0.003
 builders+0.003
 knights+0.003
 cooks+0.003
 helpers+0.003
Neg logit contributions
ho-0.003
 perspective-0.003
 way-0.003
ute-0.003
ci-0.003
Token you
Token ID345
Feature activation-6.49e-02

Pos logit contributions
 chefs+0.034
 knights+0.033
 builders+0.033
 cooks+0.033
 helpers+0.032
Neg logit contributions
ho-0.039
 perspective-0.036
ute-0.034
ci-0.034
 way-0.033
Token go
Token ID467
Feature activation-1.07e-02

Pos logit contributions
 chefs+0.132
iers+0.130
 knights+0.128
 builders+0.127
 spies+0.127
Neg logit contributions
ho-0.069
 way-0.065
ute-0.059
 high-0.056
ci-0.056
Token play
Token ID711
Feature activation-2.88e-02

Pos logit contributions
 chefs+0.020
 knights+0.020
 builders+0.020
 helpers+0.019
 cooks+0.019
Neg logit contributions
ho-0.014
 perspective-0.012
ute-0.012
 way-0.012
ci-0.012
Token."
Token ID526
Feature activation-1.46e-02

Pos logit contributions
iers+0.052
 builders+0.051
 cooks+0.051
 helpers+0.051
 chefs+0.048
Neg logit contributions
ute-0.036
 way-0.032
ho-0.031
 perspective-0.031
ci-0.031
Token
Token ID198
Feature activation+5.62e-03

Pos logit contributions
 builders+0.024
 chefs+0.024
 knights+0.024
 helpers+0.024
 cooks+0.023
Neg logit contributions
ho-0.024
 perspective-0.021
ute-0.020
ci-0.020
 way-0.019
Token
Token ID198
Feature activation-2.43e-02

Pos logit contributions
ho+0.011
 perspective+0.010
ci+0.010
ute+0.010
 way+0.010
Neg logit contributions
 chefs-0.007
 knights-0.007
 cooks-0.007
 helpers-0.006
 builders-0.006
TokenBut
Token ID1537
Feature activation-2.66e-02

Pos logit contributions
 chefs+0.055
 knights+0.054
 cooks+0.054
 builders+0.053
 helpers+0.052
Neg logit contributions
ho-0.055
 perspective-0.052
ci-0.052
ute-0.049
 wrists-0.045
Token then
Token ID788
Feature activation-3.26e-02

Pos logit contributions
 chefs+0.032
 knights+0.032
 builders+0.032
 cooks+0.031
 helpers+0.031
Neg logit contributions
ho-0.054
 perspective-0.050
ute-0.048
ci-0.048
 way-0.047
Token something
Token ID1223
Feature activation-1.85e-02

Pos logit contributions
 chefs+0.060
 builders+0.060
iers+0.060
 cooks+0.060
 helpers+0.059
Neg logit contributions
ho-0.045
 way-0.037
 perspective-0.037
ci-0.033
ute-0.033
Token strange
Token ID6283
Feature activation-3.83e-02

Pos logit contributions
 chefs+0.033
 builders+0.032
 cooks+0.032
iers+0.032
 knights+0.032
Neg logit contributions
ho-0.027
 way-0.025
 perspective-0.023
ute-0.023
 high-0.022
Token happened
Token ID3022
Feature activation-1.02e-02

Pos logit contributions
 builders+0.077
 knights+0.077
 chefs+0.077
iers+0.075
 helpers+0.074
Neg logit contributions
ho-0.050
 way-0.041
ci-0.039
 would-0.036
ute-0.036
Token.
Token ID13
Feature activation-6.49e-02

Pos logit contributions
 chefs+0.013
 knights+0.013
 builders+0.012
 cooks+0.012
 helpers+0.012
Neg logit contributions
ho-0.020
 perspective-0.019
ute-0.018
ci-0.018
 way-0.017
Token He
Token ID679
Feature activation-2.36e-02

Pos logit contributions
ieve+0.108
ier+0.108
iers+0.107
 helpers+0.106
 chefs+0.105
Neg logit contributions
ho-0.096
 high-0.086
 way-0.080
 perspective-0.076
 lot-0.076
Token looked
Token ID3114
Feature activation+6.16e-03

Pos logit contributions
 chefs+0.046
 builders+0.045
iers+0.045
 helpers+0.045
 knights+0.043
Neg logit contributions
ho-0.032
 way-0.027
ci-0.026
 perspective-0.025
ute-0.023
Token out
Token ID503
Feature activation+1.47e-02

Pos logit contributions
ho+0.009
 perspective+0.009
ute+0.008
ci+0.008
 way+0.008
Neg logit contributions
 chefs-0.011
 knights-0.010
 builders-0.010
 cooks-0.010
 helpers-0.010
Token the
Token ID262
Feature activation-7.88e-03

Pos logit contributions
ho+0.027
 perspective+0.025
ute+0.024
ci+0.024
 way+0.023
Neg logit contributions
 chefs-0.020
 cooks-0.020
 builders-0.020
 knights-0.020
 helpers-0.020
Token window
Token ID4324
Feature activation+5.91e-03

Pos logit contributions
 chefs+0.012
 knights+0.012
 builders+0.012
 cooks+0.012
 helpers+0.011
Neg logit contributions
ho-0.014
 perspective-0.013
ci-0.012
ute-0.012
 way-0.011
Token he
Token ID339
Feature activation-1.74e-02

Pos logit contributions
 chefs+0.062
 knights+0.061
 builders+0.061
 cooks+0.061
 helpers+0.060
Neg logit contributions
ho-0.065
 perspective-0.060
ute-0.057
ci-0.056
 way-0.055
Token reached
Token ID4251
Feature activation-1.14e-02

Pos logit contributions
 chefs+0.031
 builders+0.030
 helpers+0.030
iers+0.030
 knights+0.030
Neg logit contributions
ho-0.027
 way-0.024
 perspective-0.022
ci-0.020
yt-0.020
Token a
Token ID257
Feature activation-7.46e-04

Pos logit contributions
 chefs+0.016
 builders+0.016
 cooks+0.016
 helpers+0.015
iers+0.015
Neg logit contributions
ho-0.020
 way-0.019
 perspective-0.019
ute-0.018
ci-0.018
Token tall
Token ID7331
Feature activation-2.03e-02

Pos logit contributions
 chefs+0.001
 knights+0.001
 builders+0.001
 cooks+0.001
 helpers+0.001
Neg logit contributions
ho-0.001
 perspective-0.001
ci-0.001
ute-0.001
 way-0.001
Token hill
Token ID12788
Feature activation+1.12e-02

Pos logit contributions
 chefs+0.040
 builders+0.040
 knights+0.040
 cooks+0.040
 helpers+0.039
Neg logit contributions
ho-0.025
 perspective-0.023
ute-0.021
ci-0.021
 way-0.020
Token.
Token ID13
Feature activation-6.49e-02

Pos logit contributions
ho+0.022
ute+0.022
 perspective+0.022
ci+0.022
 wrists+0.021
Neg logit contributions
 builders-0.012
 knights-0.012
 cooks-0.011
 pirates-0.011
 chefs-0.011
Token Wil
Token ID5187
Feature activation+3.43e-03

Pos logit contributions
 chefs+0.095
 knights+0.092
 cooks+0.090
 helpers+0.089
 spies+0.088
Neg logit contributions
ho-0.118
 way-0.104
 perspective-0.103
ute-0.096
 high-0.093
Tokenf
Token ID69
Feature activation-2.07e-02

Pos logit contributions
ho+0.005
 perspective+0.005
ute+0.005
 way+0.004
ci+0.004
Neg logit contributions
 cooks-0.006
 chefs-0.006
 helpers-0.006
 builders-0.006
 knights-0.005
Token wanted
Token ID2227
Feature activation-2.29e-02

Pos logit contributions
 chefs+0.036
 helpers+0.033
 cooks+0.033
 spies+0.032
 pirates+0.031
Neg logit contributions
ho-0.034
 perspective-0.030
ute-0.029
 way-0.026
yt-0.025
Token to
Token ID284
Feature activation-1.92e-02

Pos logit contributions
 chefs+0.043
 builders+0.042
 knights+0.042
 cooks+0.042
 helpers+0.042
Neg logit contributions
ho-0.032
 perspective-0.029
ute-0.028
ci-0.027
 way-0.025
Token go
Token ID467
Feature activation-2.09e-02

Pos logit contributions
 chefs+0.032
 builders+0.031
 knights+0.031
 cooks+0.031
 helpers+0.031
Neg logit contributions
ho-0.031
 perspective-0.028
ute-0.027
ci-0.027
 way-0.025
Token the
Token ID262
Feature activation-2.28e-02

Pos logit contributions
 chefs+0.039
 cooks+0.038
 knights+0.038
 helpers+0.038
 builders+0.038
Neg logit contributions
ho-0.031
 perspective-0.029
ute-0.029
ci-0.028
 way-0.028
Token box
Token ID3091
Feature activation-4.98e-03

Pos logit contributions
 chefs+0.026
 knights+0.025
 cooks+0.025
 builders+0.025
 helpers+0.025
Neg logit contributions
ho-0.048
 perspective-0.045
ute-0.043
 way-0.043
ci-0.042
Token and
Token ID290
Feature activation-1.71e-02

Pos logit contributions
 chefs+0.008
 knights+0.008
 cooks+0.008
 builders+0.008
 helpers+0.007
Neg logit contributions
ho-0.008
 perspective-0.007
ute-0.007
ci-0.007
 way-0.007
Token see
Token ID766
Feature activation-2.00e-02

Pos logit contributions
 chefs+0.027
iers+0.026
 cooks+0.026
 helpers+0.026
 knights+0.026
Neg logit contributions
ho-0.026
 way-0.024
 perspective-0.024
ci-0.023
ute-0.022
Token what
Token ID644
Feature activation-2.78e-03

Pos logit contributions
 cooks+0.029
 chefs+0.029
 builders+0.029
 knights+0.029
 helpers+0.029
Neg logit contributions
 perspective-0.030
ho-0.030
 way-0.028
ute-0.028
ci-0.027
Token we
Token ID356
Feature activation-6.48e-02

Pos logit contributions
 chefs+0.005
 knights+0.005
 builders+0.005
 cooks+0.005
 helpers+0.005
Neg logit contributions
ho-0.004
 perspective-0.004
ute-0.004
ci-0.004
 way-0.003
Token can
Token ID460
Feature activation-3.30e-02

Pos logit contributions
 chefs+0.122
 helpers+0.119
 knights+0.118
 firefighters+0.117
ers+0.116
Neg logit contributions
ho-0.074
 would-0.068
 high-0.067
 way-0.066
 often-0.062
Token do
Token ID466
Feature activation-3.08e-02

Pos logit contributions
 chefs+0.057
 helpers+0.054
 knights+0.054
iers+0.053
 builders+0.053
Neg logit contributions
ho-0.047
 perspective-0.045
 way-0.043
ute-0.043
ci-0.041
Token,"
Token ID553
Feature activation+5.21e-03

Pos logit contributions
 chefs+0.051
 helpers+0.051
 knights+0.051
 cooks+0.051
 builders+0.050
Neg logit contributions
ho-0.041
 perspective-0.039
ute-0.038
ci-0.037
 way-0.036
Token Ben
Token ID3932
Feature activation-3.31e-03

Pos logit contributions
ho+0.011
 perspective+0.010
 way+0.009
ci+0.009
ute+0.009
Neg logit contributions
 builders-0.006
 chefs-0.006
 knights-0.006
 cooks-0.006
 helpers-0.006
Token says
Token ID1139
Feature activation-1.48e-02

Pos logit contributions
 builders+0.005
 chefs+0.005
 knights+0.005
 cooks+0.005
 pirates+0.005
Neg logit contributions
ho-0.005
 way-0.005
ci-0.004
ute-0.004
 perspective-0.004
Token,
Token ID11
Feature activation-2.74e-02

Pos logit contributions
 chefs+0.042
iers+0.042
 builders+0.041
 cooks+0.041
 helpers+0.041
Neg logit contributions
ho-0.023
 high-0.018
 way-0.017
 perspective-0.017
ute-0.016
Token Spot
Token ID15899
Feature activation-8.43e-03

Pos logit contributions
 cooks+0.040
 builders+0.039
iers+0.039
 chefs+0.038
 helpers+0.038
Neg logit contributions
ho-0.047
 way-0.042
 high-0.041
 lot-0.040
 perspective-0.040
Token heard
Token ID2982
Feature activation-3.79e-02

Pos logit contributions
 builders+0.016
 chefs+0.016
 helpers+0.016
iers+0.016
 knights+0.015
Neg logit contributions
ho-0.012
 way-0.010
ci-0.009
 perspective-0.009
ute-0.008
Token a
Token ID257
Feature activation-9.90e-03

Pos logit contributions
iers+0.072
 chefs+0.071
 builders+0.071
 cooks+0.070
 helpers+0.068
Neg logit contributions
 lot-0.044
ho-0.041
 high-0.041
 way-0.040
 poppy-0.039
Token snap
Token ID11495
Feature activation-9.57e-03

Pos logit contributions
 chefs+0.015
 builders+0.015
 knights+0.015
 cooks+0.015
 helpers+0.014
Neg logit contributions
ho-0.018
 perspective-0.016
ute-0.015
 way-0.015
ci-0.014
Token!
Token ID0
Feature activation-6.48e-02

Pos logit contributions
 chefs+0.014
 builders+0.014
 knights+0.014
 cooks+0.014
 helpers+0.014
Neg logit contributions
ho-0.017
 perspective-0.016
ute-0.015
ci-0.015
 way-0.015
Token He
Token ID679
Feature activation-2.34e-02

Pos logit contributions
 builders+0.101
 chefs+0.100
 helpers+0.098
ieve+0.097
 cooks+0.096
Neg logit contributions
ho-0.110
 pouch-0.094
 way-0.093
 high-0.092
 tight-0.091
Token dropped
Token ID5710
Feature activation-5.32e-04

Pos logit contributions
iers+0.049
 chefs+0.049
 builders+0.048
 helpers+0.048
 knights+0.046
Neg logit contributions
ho-0.030
 way-0.023
yt-0.020
 high-0.020
 perspective-0.020
Token the
Token ID262
Feature activation+6.66e-03

Pos logit contributions
 chefs+0.001
 knights+0.001
 builders+0.001
 cooks+0.001
 helpers+0.001
Neg logit contributions
ho-0.001
 perspective-0.001
ute-0.001
 way-0.001
ci-0.001
Token nail
Token ID17864
Feature activation+2.56e-02

Pos logit contributions
ho+0.009
ci+0.008
 perspective+0.008
ute+0.008
aman+0.007
Neg logit contributions
 builders-0.012
 chefs-0.012
 helpers-0.012
 knights-0.012
 cooks-0.012
Token and
Token ID290
Feature activation-1.45e-02

Pos logit contributions
 perspective+0.058
ho+0.057
ute+0.055
ci+0.053
aman+0.053
Neg logit contributions
 knights-0.022
 chefs-0.020
 builders-0.020
 pirates-0.020
 spies-0.019
Token he
Token ID339
Feature activation-2.01e-02

Pos logit contributions
 cooks+0.060
 chefs+0.059
iers+0.058
 helpers+0.058
 builders+0.058
Neg logit contributions
ho-0.052
 high-0.043
 way-0.042
 lot-0.041
 perspective-0.039
Token heard
Token ID2982
Feature activation-4.55e-02

Pos logit contributions
 chefs+0.035
 builders+0.035
 helpers+0.034
iers+0.033
 knights+0.033
Neg logit contributions
ho-0.032
 perspective-0.027
 way-0.027
ute-0.026
ci-0.026
Token a
Token ID257
Feature activation-1.13e-02

Pos logit contributions
 chefs+0.082
iers+0.080
 helpers+0.077
 cooks+0.077
 builders+0.076
Neg logit contributions
ho-0.056
 high-0.056
 way-0.055
 perspective-0.052
 lot-0.051
Token loud
Token ID7812
Feature activation-2.86e-02

Pos logit contributions
 chefs+0.018
 builders+0.017
 knights+0.017
 helpers+0.017
 cooks+0.017
Neg logit contributions
ho-0.020
 perspective-0.018
ute-0.017
 way-0.017
ci-0.016
Token crack
Token ID8469
Feature activation-3.64e-02

Pos logit contributions
 chefs+0.043
 builders+0.041
 helpers+0.040
iers+0.039
 knights+0.039
Neg logit contributions
ho-0.056
 perspective-0.047
 way-0.047
ute-0.047
ci-0.044
Token!
Token ID0
Feature activation-6.47e-02

Pos logit contributions
 chefs+0.053
 knights+0.053
 cooks+0.052
 builders+0.052
 helpers+0.051
Neg logit contributions
ho-0.064
 perspective-0.060
ute-0.056
ci-0.056
 way-0.055
Token The
Token ID383
Feature activation-3.10e-02

Pos logit contributions
 chefs+0.117
 builders+0.113
 helpers+0.112
 cooks+0.109
ier+0.106
Neg logit contributions
ho-0.102
 high-0.081
 pouch-0.079
 way-0.078
 lot-0.078
Token branch
Token ID8478
Feature activation-8.80e-03

Pos logit contributions
 chefs+0.053
iers+0.052
 knights+0.050
 spies+0.050
 builders+0.050
Neg logit contributions
ho-0.049
 way-0.045
 perspective-0.044
ute-0.039
 high-0.039
Token broke
Token ID6265
Feature activation-1.52e-02

Pos logit contributions
 chefs+0.016
 builders+0.015
 knights+0.015
 helpers+0.015
 cooks+0.015
Neg logit contributions
ho-0.014
 perspective-0.012
 way-0.011
ute-0.011
ci-0.011
Token and
Token ID290
Feature activation-2.87e-02

Pos logit contributions
 helpers+0.025
 cooks+0.025
 chefs+0.025
 knights+0.024
 builders+0.024
Neg logit contributions
ho-0.024
 perspective-0.021
 way-0.021
 high-0.020
ute-0.019
Token Tweet
Token ID18752
Feature activation-2.27e-02

Pos logit contributions
 chefs+0.053
iers+0.051
 builders+0.051
 helpers+0.051
 cooks+0.051
Neg logit contributions
ho-0.044
 way-0.037
 perspective-0.033
ci-0.032
 high-0.032
Token fast
Token ID3049
Feature activation+1.44e-03

Pos logit contributions
ho+0.007
 perspective+0.006
ute+0.006
ci+0.006
 way+0.006
Neg logit contributions
 chefs-0.007
 builders-0.007
 cooks-0.007
 knights-0.007
 helpers-0.007
Token driver
Token ID4639
Feature activation-1.41e-02

Pos logit contributions
ho+0.003
 perspective+0.002
ci+0.002
ute+0.002
 way+0.002
Neg logit contributions
 knights-0.002
 builders-0.002
 chefs-0.002
 helpers-0.002
 cooks-0.002
Token named
Token ID3706
Feature activation-5.94e-02

Pos logit contributions
 chefs+0.025
 builders+0.023
 knights+0.023
 helpers+0.023
 spies+0.023
Neg logit contributions
ho-0.021
ci-0.018
 way-0.018
 perspective-0.018
ute-0.017
Token Tim
Token ID5045
Feature activation-3.30e-03

Pos logit contributions
 chefs+0.074
 cooks+0.072
 helpers+0.072
 knights+0.068
iers+0.068
Neg logit contributions
ho-0.118
 way-0.109
 perspective-0.104
ci-0.101
ute-0.100
Token went
Token ID1816
Feature activation-3.86e-02

Pos logit contributions
 chefs+0.006
 knights+0.006
 builders+0.006
 helpers+0.006
 spies+0.006
Neg logit contributions
ho-0.004
 way-0.004
 perspective-0.004
ci-0.004
ute-0.004
Token for
Token ID329
Feature activation-6.46e-02

Pos logit contributions
 chefs+0.070
 knights+0.069
 builders+0.069
 cooks+0.068
 helpers+0.068
Neg logit contributions
ho-0.055
 perspective-0.049
ute-0.047
ci-0.047
 way-0.045
Token a
Token ID257
Feature activation-1.39e-02

Pos logit contributions
ives+0.121
iers+0.120
 cooks+0.118
ier+0.118
paste+0.116
Neg logit contributions
 way-0.074
 high-0.072
 long-0.072
 lot-0.071
Spr-0.061
Token ride
Token ID6594
Feature activation-1.86e-02

Pos logit contributions
 knights+0.025
 chefs+0.025
 builders+0.025
 cooks+0.024
 pirates+0.024
Neg logit contributions
ho-0.020
 perspective-0.018
ute-0.017
ci-0.016
 wrists-0.015
Token in
Token ID287
Feature activation-1.57e-02

Pos logit contributions
 knights+0.024
 builders+0.023
 pirates+0.023
 chefs+0.022
 helpers+0.022
Neg logit contributions
ho-0.037
 perspective-0.035
aman-0.032
ute-0.031
 way-0.031
Token his
Token ID465
Feature activation-8.87e-03

Pos logit contributions
 chefs+0.020
 knights+0.020
 builders+0.020
 cooks+0.019
 helpers+0.019
Neg logit contributions
ho-0.032
 perspective-0.029
ute-0.028
ci-0.028
 way-0.026
Token loud
Token ID7812
Feature activation-1.71e-02

Pos logit contributions
 knights+0.015
 chefs+0.015
 builders+0.015
 helpers+0.015
 cooks+0.015
Neg logit contributions
ho-0.014
 perspective-0.013
ute-0.011
ci-0.011
 way-0.010
Token And
Token ID843
Feature activation-1.23e-02

Pos logit contributions
 chefs+0.018
 builders+0.017
 cooks+0.017
 helpers+0.017
 knights+0.017
Neg logit contributions
ho-0.025
 perspective-0.023
 way-0.023
ute-0.023
ci-0.022
Token they
Token ID484
Feature activation-1.20e-02

Pos logit contributions
 chefs+0.019
 knights+0.019
 builders+0.018
 helpers+0.018
 cooks+0.018
Neg logit contributions
ho-0.021
 perspective-0.020
ute-0.019
ci-0.018
aman-0.017
Token all
Token ID477
Feature activation-9.76e-03

Pos logit contributions
 knights+0.019
 chefs+0.019
 cooks+0.019
 builders+0.019
 helpers+0.019
Neg logit contributions
ho-0.019
 perspective-0.018
ute-0.017
ci-0.017
 way-0.016
Token lived
Token ID5615
Feature activation-8.01e-03

Pos logit contributions
 chefs+0.016
 knights+0.016
 builders+0.016
 cooks+0.016
 helpers+0.015
Neg logit contributions
ho-0.016
 perspective-0.014
ute-0.014
ci-0.013
 way-0.013
Token happily
Token ID18177
Feature activation-4.10e-02

Pos logit contributions
 chefs+0.014
 cooks+0.014
 builders+0.014
 helpers+0.013
 knights+0.013
Neg logit contributions
ho-0.012
 perspective-0.011
ute-0.010
ci-0.010
 way-0.010
Token ever
Token ID1683
Feature activation-6.46e-02

Pos logit contributions
 chefs+0.074
 builders+0.070
 cooks+0.070
iers+0.070
 helpers+0.070
Neg logit contributions
ho-0.061
ci-0.050
 perspective-0.050
ute-0.047
 way-0.046
Token after
Token ID706
Feature activation-3.24e-02

Pos logit contributions
iers+0.118
 chefs+0.113
 helpers+0.112
 cooks+0.109
 knights+0.109
Neg logit contributions
ho-0.097
 perspective-0.084
 way-0.079
ci-0.074
 photo-0.072
Token.
Token ID13
Feature activation-7.35e-03

Pos logit contributions
 chefs+0.053
 builders+0.053
iers+0.053
 knights+0.053
 cooks+0.052
Neg logit contributions
ho-0.048
 perspective-0.046
ute-0.043
 way-0.043
ci-0.042
Token<|endoftext|>
Token ID50256
Feature activation-1.41e-02

Pos logit contributions
 chefs+0.013
 builders+0.012
 helpers+0.012
 cooks+0.012
 knights+0.012
Neg logit contributions
ho-0.011
 perspective-0.010
ute-0.010
ci-0.010
 way-0.010
TokenOnce
Token ID7454
Feature activation+4.16e-02

Pos logit contributions
 builders+0.027
 chefs+0.026
 helpers+0.026
 knights+0.026
 cooks+0.026
Neg logit contributions
ho-0.019
 perspective-0.017
 way-0.016
ute-0.016
ci-0.015
Token upon
Token ID2402
Feature activation-2.30e-02

Pos logit contributions
ho+0.072
ci+0.072
 perspective+0.070
 wrists+0.070
ute+0.064
Neg logit contributions
 knights-0.055
 helpers-0.054
 boys-0.054
 builders-0.053
 cooks-0.052
Token there
Token ID612
Feature activation-6.75e-03

Pos logit contributions
 chefs+0.032
 cooks+0.032
 builders+0.032
 knights+0.032
 helpers+0.032
Neg logit contributions
ho-0.027
 perspective-0.024
ute-0.023
ci-0.023
 way-0.023
Token was
Token ID373
Feature activation-4.24e-02

Pos logit contributions
 builders+0.014
 chefs+0.014
 helpers+0.014
 knights+0.014
 warriors+0.014
Neg logit contributions
ho-0.008
 perspective-0.007
 way-0.007
ute-0.006
ci-0.006
Token a
Token ID257
Feature activation-1.35e-02

Pos logit contributions
iers+0.055
 chefs+0.055
 cooks+0.055
 helpers+0.054
 builders+0.053
Neg logit contributions
ho-0.078
 way-0.073
 perspective-0.070
 high-0.067
ute-0.065
Token special
Token ID2041
Feature activation-3.81e-02

Pos logit contributions
 chefs+0.023
 knights+0.023
 builders+0.023
 cooks+0.022
 helpers+0.022
Neg logit contributions
ho-0.021
 perspective-0.019
ute-0.018
ci-0.018
 way-0.017
Token room
Token ID2119
Feature activation-1.31e-02

Pos logit contributions
 chefs+0.060
 builders+0.059
 knights+0.059
 cooks+0.059
 helpers+0.058
Neg logit contributions
ho-0.063
 perspective-0.057
ute-0.055
ci-0.054
 way-0.053
Token called
Token ID1444
Feature activation-6.45e-02

Pos logit contributions
 chefs+0.017
 builders+0.017
 cooks+0.016
 knights+0.016
 helpers+0.016
Neg logit contributions
ho-0.025
 perspective-0.025
ute-0.023
aman-0.022
 way-0.022
Token a
Token ID257
Feature activation-1.59e-02

Pos logit contributions
 chefs+0.086
 cooks+0.085
 builders+0.084
 helpers+0.082
 knights+0.082
Neg logit contributions
ho-0.116
ute-0.107
 perspective-0.104
 way-0.103
ci-0.101
Token chamber
Token ID11847
Feature activation-1.58e-02

Pos logit contributions
 chefs+0.026
 knights+0.026
 builders+0.025
 cooks+0.025
 helpers+0.025
Neg logit contributions
ho-0.026
 perspective-0.023
ci-0.022
ute-0.022
 way-0.021
Token.
Token ID13
Feature activation-1.54e-02

Pos logit contributions
 chefs+0.022
 knights+0.022
 builders+0.022
 cooks+0.022
 helpers+0.022
Neg logit contributions
ho-0.029
 perspective-0.026
ute-0.025
ci-0.025
 way-0.024
Token The
Token ID383
Feature activation-1.70e-02

Pos logit contributions
ier+0.030
 helpers+0.029
 chefs+0.028
 builders+0.028
ps+0.028
Neg logit contributions
 way-0.017
 -0.017
 perspective-0.017
Every-0.017
 high-0.017
Token chamber
Token ID11847
Feature activation+2.48e-04

Pos logit contributions
iers+0.032
paste+0.029
 cooks+0.029
 chefs+0.029
 builders+0.029
Neg logit contributions
 way-0.023
ho-0.020
 photo-0.019
 perspective-0.018
 high-0.018
Token about
Token ID546
Feature activation+1.08e-02

Pos logit contributions
 knights+0.004
 helpers+0.004
 chefs+0.004
 cooks+0.004
 builders+0.004
Neg logit contributions
ho-0.009
 perspective-0.009
ute-0.009
ci-0.008
 way-0.008
Token her
Token ID607
Feature activation-1.91e-02

Pos logit contributions
ho+0.021
ute+0.020
 perspective+0.019
ci+0.018
itating+0.017
Neg logit contributions
 chefs-0.014
 knights-0.014
 helpers-0.014
 builders-0.013
 cooks-0.013
Token adventure
Token ID8855
Feature activation+1.30e-03

Pos logit contributions
 knights+0.031
 chefs+0.031
 helpers+0.031
 builders+0.031
 cooks+0.031
Neg logit contributions
ho-0.031
 perspective-0.027
ute-0.026
ci-0.026
 way-0.024
Token.
Token ID13
Feature activation+1.16e-02

Pos logit contributions
ho+0.002
 perspective+0.002
ute+0.002
ci+0.002
 way+0.002
Neg logit contributions
 chefs-0.002
 knights-0.002
 builders-0.002
 cooks-0.002
 helpers-0.002
Token<|endoftext|>
Token ID50256
Feature activation-1.07e-02

Pos logit contributions
ho+0.025
 perspective+0.024
ci+0.023
 angle+0.021
ute+0.021
Neg logit contributions
 helpers-0.011
 cooks-0.010
 Doctor-0.010
 Someone-0.010
 chefs-0.010
TokenOnce
Token ID7454
Feature activation+5.02e-02

Pos logit contributions
 builders+0.017
 chefs+0.017
 helpers+0.017
 knights+0.017
 cooks+0.017
Neg logit contributions
ho-0.017
 perspective-0.016
ute-0.015
 way-0.015
ci-0.015
Token there
Token ID612
Feature activation+2.63e-02

Pos logit contributions
ho+0.093
ci+0.092
 wrists+0.089
 perspective+0.087
ute+0.083
Neg logit contributions
 boys-0.059
 helpers-0.058
 knights-0.058
ily-0.056
 builders-0.055
Token was
Token ID373
Feature activation-5.85e-03

Pos logit contributions
ho+0.046
ute+0.044
 perspective+0.044
ci+0.041
 wrists+0.039
Neg logit contributions
 chefs-0.036
 cooks-0.035
 knights-0.035
 helpers-0.034
 boys-0.034
Token a
Token ID257
Feature activation+1.48e-02

Pos logit contributions
 helpers+0.007
 builders+0.007
 cooks+0.007
 chefs+0.007
 knights+0.007
Neg logit contributions
ho-0.012
 perspective-0.011
ute-0.011
 way-0.010
ci-0.010
Token gifted
Token ID22527
Feature activation-1.48e-02

Pos logit contributions
ho+0.028
 perspective+0.026
ci+0.025
ute+0.025
 way+0.024
Neg logit contributions
 chefs-0.020
 knights-0.020
 cooks-0.020
 builders-0.019
 helpers-0.019
Token bear
Token ID6842
Feature activation+7.69e-03

Pos logit contributions
 chefs+0.023
 builders+0.022
 knights+0.022
 cooks+0.022
 helpers+0.022
Neg logit contributions
ho-0.025
 perspective-0.023
ute-0.022
ci-0.022
 way-0.021
Token weak
Token ID4939
Feature activation+4.72e-03

Pos logit contributions
 chefs+0.011
 knights+0.011
 builders+0.011
 cooks+0.010
 helpers+0.010
Neg logit contributions
ho-0.012
 perspective-0.011
ute-0.010
ci-0.010
 way-0.010
Token mouse
Token ID10211
Feature activation+6.37e-03

Pos logit contributions
ho+0.009
 perspective+0.008
ute+0.008
ci+0.007
 way+0.007
Neg logit contributions
 chefs-0.007
 builders-0.007
 knights-0.007
 cooks-0.007
 helpers-0.006
Token being
Token ID852
Feature activation-7.67e-03

Pos logit contributions
ho+0.011
ute+0.010
 perspective+0.010
ci+0.010
aman+0.009
Neg logit contributions
 knights-0.009
 cooks-0.009
 chefs-0.009
 helpers-0.009
 builders-0.009
Token filled
Token ID5901
Feature activation+1.24e-02

Pos logit contributions
 chefs+0.011
 builders+0.011
 knights+0.010
 cooks+0.010
 helpers+0.010
Neg logit contributions
ho-0.014
 perspective-0.013
ci-0.012
 way-0.012
ute-0.012
Token with
Token ID351
Feature activation+1.07e-02

Pos logit contributions
ho+0.022
 perspective+0.021
ute+0.020
ci+0.019
 way+0.019
Neg logit contributions
 knights-0.018
 chefs-0.017
 pirates-0.017
 cooks-0.017
 builders-0.017
Token strength
Token ID4202
Feature activation+4.65e-02

Pos logit contributions
ho+0.021
 perspective+0.019
ute+0.019
ci+0.018
 way+0.017
Neg logit contributions
 chefs-0.014
 knights-0.014
 builders-0.014
 cooks-0.013
 helpers-0.013
Token.
Token ID13
Feature activation-4.92e-03

Pos logit contributions
ho+0.098
 perspective+0.092
ute+0.092
ci+0.089
 photo+0.084
Neg logit contributions
 knights-0.046
 builders-0.045
 helpers-0.044
iers-0.044
 cooks-0.044
Token The
Token ID383
Feature activation+9.28e-03

Pos logit contributions
 chefs+0.006
 builders+0.006
 cooks+0.006
 helpers+0.006
 knights+0.006
Neg logit contributions
ho-0.010
 perspective-0.009
ute-0.009
ci-0.009
 way-0.009
Token mouse
Token ID10211
Feature activation+1.53e-02

Pos logit contributions
ho+0.018
 perspective+0.016
 way+0.016
ci+0.015
ute+0.015
Neg logit contributions
 chefs-0.012
 builders-0.012
 cooks-0.012
iers-0.012
 knights-0.012
Token grate
Token ID48612
Feature activation+1.08e-02

Pos logit contributions
ho+0.026
 perspective+0.025
ute+0.024
ci+0.023
 way+0.021
Neg logit contributions
 knights-0.023
 cooks-0.023
 helpers-0.022
 builders-0.022
 chefs-0.022
Tokenfully
Token ID2759
Feature activation+1.14e-02

Pos logit contributions
ho+0.019
 perspective+0.018
ute+0.017
ci+0.017
 way+0.017
Neg logit contributions
 chefs-0.015
 knights-0.015
 builders-0.015
 cooks-0.015
 helpers-0.015
Tokenorns
Token ID19942
Feature activation+4.42e-03

Pos logit contributions
 chefs+0.018
 cooks+0.018
 builders+0.018
 knights+0.018
 helpers+0.018
Neg logit contributions
ho-0.024
 perspective-0.022
ci-0.022
ute-0.022
 way-0.020
Token every
Token ID790
Feature activation+1.28e-02

Pos logit contributions
ho+0.008
 perspective+0.007
ute+0.007
ci+0.007
 way+0.007
Neg logit contributions
 knights-0.006
 chefs-0.006
 builders-0.006
 helpers-0.006
 cooks-0.006
Token day
Token ID1110
Feature activation+3.37e-03

Pos logit contributions
ho+0.017
ute+0.015
itating+0.015
 perspective+0.015
ci+0.014
Neg logit contributions
 knights-0.023
 cooks-0.022
 pirates-0.022
 helpers-0.022
 builders-0.022
Token.
Token ID13
Feature activation+7.43e-03

Pos logit contributions
ho+0.006
 perspective+0.006
ute+0.006
ci+0.006
 way+0.005
Neg logit contributions
 knights-0.004
 builders-0.004
 chefs-0.004
 helpers-0.004
 cooks-0.004
Token<|endoftext|>
Token ID50256
Feature activation-2.24e-02

Pos logit contributions
ho+0.014
 perspective+0.012
ute+0.012
ci+0.012
 way+0.012
Neg logit contributions
 chefs-0.010
 knights-0.010
 builders-0.010
 cooks-0.010
 helpers-0.010
TokenOnce
Token ID7454
Feature activation+4.27e-02

Pos logit contributions
 builders+0.044
 helpers+0.043
 chefs+0.043
 knights+0.042
 cooks+0.042
Neg logit contributions
ho-0.027
 perspective-0.025
 way-0.024
ute-0.023
ci-0.021
Token upon
Token ID2402
Feature activation-1.56e-02

Pos logit contributions
ho+0.075
ci+0.073
 wrists+0.072
 perspective+0.071
ute+0.066
Neg logit contributions
 knights-0.056
 helpers-0.054
 builders-0.053
 boys-0.053
 pirates-0.053
Token a
Token ID257
Feature activation+3.30e-02

Pos logit contributions
 builders+0.024
 helpers+0.024
iers+0.024
 cooks+0.023
 chefs+0.023
Neg logit contributions
 perspective-0.023
ho-0.023
 high-0.022
 way-0.022
ute-0.021
Token time
Token ID640
Feature activation+4.30e-03

Pos logit contributions
ho+0.059
 perspective+0.052
ci+0.052
ute+0.050
 wrists+0.046
Neg logit contributions
 knights-0.049
 chefs-0.048
 cooks-0.047
 builders-0.046
 helpers-0.046
Token there
Token ID612
Feature activation+1.42e-02

Pos logit contributions
ho+0.008
 perspective+0.007
ci+0.007
ute+0.007
 way+0.007
Neg logit contributions
 chefs-0.006
 knights-0.006
 builders-0.006
 cooks-0.006
 helpers-0.006
Token was
Token ID373
Feature activation-9.30e-03

Pos logit contributions
ho+0.023
 perspective+0.021
ute+0.020
ci+0.020
 way+0.018
Neg logit contributions
 chefs-0.023
 cooks-0.022
 knights-0.022
 builders-0.022
 helpers-0.022
Token volcano
Token ID32410
Feature activation+1.98e-02

Pos logit contributions
 chefs+0.001
 knights+0.001
 builders+0.001
 cooks+0.001
 helpers+0.001
Neg logit contributions
ho-0.001
 perspective-0.001
ute-0.001
ci-0.001
 way-0.001
Token.
Token ID13
Feature activation+1.22e-02

Pos logit contributions
ho+0.038
 perspective+0.036
ute+0.035
ci+0.033
aman+0.032
Neg logit contributions
 knights-0.024
 builders-0.024
 chefs-0.024
 cooks-0.024
 helpers-0.024
Token<|endoftext|>
Token ID50256
Feature activation-1.75e-02

Pos logit contributions
ho+0.022
 perspective+0.021
ute+0.020
ci+0.020
 way+0.019
Neg logit contributions
 chefs-0.017
 knights-0.017
 cooks-0.017
 helpers-0.016
 builders-0.016
TokenOnce
Token ID7454
Feature activation+4.70e-02

Pos logit contributions
 builders+0.029
 helpers+0.029
 chefs+0.029
 knights+0.028
 cooks+0.028
Neg logit contributions
ho-0.028
 perspective-0.025
 way-0.024
ute-0.024
ci-0.023
Token upon
Token ID2402
Feature activation-1.42e-02

Pos logit contributions
ho+0.083
ci+0.081
 wrists+0.080
 perspective+0.078
ute+0.075
Neg logit contributions
 knights-0.060
 helpers-0.059
 boys-0.059
 cooks-0.057
 builders-0.057
Token a
Token ID257
Feature activation+4.02e-02

Pos logit contributions
 builders+0.022
 helpers+0.022
iers+0.021
 cooks+0.020
 chefs+0.020
Neg logit contributions
ho-0.021
 perspective-0.021
 way-0.021
 high-0.020
ute-0.020
Token time
Token ID640
Feature activation+7.91e-03

Pos logit contributions
ho+0.080
ci+0.072
 perspective+0.071
ute+0.070
spe+0.065
Neg logit contributions
 knights-0.050
 chefs-0.049
 cooks-0.048
 pirates-0.046
 helpers-0.045
Token,
Token ID11
Feature activation+1.57e-02

Pos logit contributions
ho+0.015
 perspective+0.014
ci+0.014
ute+0.014
 way+0.013
Neg logit contributions
 knights-0.010
 chefs-0.010
 builders-0.010
 cooks-0.010
 helpers-0.010
Token there
Token ID612
Feature activation+1.72e-02

Pos logit contributions
ho+0.029
 perspective+0.027
ci+0.026
ute+0.026
 way+0.025
Neg logit contributions
 knights-0.022
 chefs-0.022
 pirates-0.021
 cooks-0.021
 builders-0.021
Token were
Token ID547
Feature activation-5.91e-03

Pos logit contributions
ho+0.028
 perspective+0.026
ute+0.026
ci+0.025
 way+0.023
Neg logit contributions
 chefs-0.027
 cooks-0.026
 knights-0.026
 builders-0.025
 helpers-0.025
Token two
Token ID734
Feature activation-4.74e-04

Pos logit contributions
 chefs+0.008
 builders+0.008
 knights+0.008
 helpers+0.008
 cooks+0.008
Neg logit contributions
ho-0.010
 perspective-0.009
ute-0.009
 way-0.009
ci-0.009
Token be
Token ID307
Feature activation-1.43e-02

Pos logit contributions
ho+0.016
 perspective+0.015
ute+0.015
ci+0.014
 way+0.013
Neg logit contributions
 builders-0.012
 cooks-0.012
 helpers-0.012
 knights-0.012
 chefs-0.012
Token filled
Token ID5901
Feature activation+1.02e-02

Pos logit contributions
 chefs+0.023
 cooks+0.022
 builders+0.022
 knights+0.022
 helpers+0.022
Neg logit contributions
ho-0.023
 perspective-0.021
 way-0.021
ci-0.020
ute-0.019
Token with
Token ID351
Feature activation+6.22e-04

Pos logit contributions
ho+0.020
 perspective+0.018
ute+0.017
aman+0.017
burn+0.016
Neg logit contributions
 chefs-0.013
 pirates-0.013
 knights-0.013
 builders-0.013
 cooks-0.012
Token wonderful
Token ID7932
Feature activation-1.21e-02

Pos logit contributions
ho+0.001
 perspective+0.001
ute+0.001
ci+0.001
 way+0.001
Neg logit contributions
 chefs-0.001
 knights-0.001
 builders-0.001
 cooks-0.001
 helpers-0.001
Token<|endoftext|>
Token ID50256
Feature activation-2.61e-02

Pos logit contributions
 chefs+0.018
 builders+0.018
 cooks+0.018
 knights+0.018
 helpers+0.018
Neg logit contributions
ho-0.020
 perspective-0.019
 way-0.018
ute-0.018
ci-0.018
TokenOnce
Token ID7454
Feature activation+4.56e-02

Pos logit contributions
 builders+0.037
 chefs+0.036
 helpers+0.036
 knights+0.035
 cooks+0.035
Neg logit contributions
ho-0.047
 perspective-0.044
 way-0.042
ute-0.041
ci-0.041
Token upon
Token ID2402
Feature activation-9.42e-03

Pos logit contributions
ho+0.082
 wrists+0.082
ci+0.081
 perspective+0.080
burn+0.078
Neg logit contributions
 knights-0.056
 boys-0.055
 helpers-0.054
 pirates-0.053
 builders-0.053
Token a
Token ID257
Feature activation+3.46e-02

Pos logit contributions
 builders+0.015
 helpers+0.014
iers+0.014
 cooks+0.014
 chefs+0.014
Neg logit contributions
ho-0.014
 perspective-0.014
 way-0.014
 high-0.014
ute-0.013
Token time
Token ID640
Feature activation+1.06e-02

Pos logit contributions
ho+0.063
 perspective+0.056
ci+0.055
ute+0.054
spe+0.050
Neg logit contributions
 knights-0.050
 chefs-0.050
 cooks-0.048
 pirates-0.047
 builders-0.047
Token,
Token ID11
Feature activation+2.50e-02

Pos logit contributions
ho+0.021
 perspective+0.021
ci+0.020
ute+0.020
 wrists+0.019
Neg logit contributions
 knights-0.012
 chefs-0.012
 builders-0.011
 pirates-0.011
 cooks-0.011
Token there
Token ID612
Feature activation+2.57e-02

Pos logit contributions
ho+0.050
 perspective+0.050
ci+0.046
aman+0.045
ute+0.044
Neg logit contributions
 knights-0.030
 chefs-0.029
 pirates-0.029
 builders-0.027
 cooks-0.027
Tokenled
Token ID992
Feature activation+1.20e-02

Pos logit contributions
ho+0.042
 perspective+0.039
ute+0.038
ci+0.038
 way+0.037
Neg logit contributions
 chefs-0.018
 knights-0.018
 builders-0.018
 cooks-0.018
 helpers-0.017
Token and
Token ID290
Feature activation-8.66e-04

Pos logit contributions
ho+0.019
 perspective+0.018
ute+0.017
ci+0.016
 way+0.016
Neg logit contributions
 knights-0.019
 chefs-0.019
 builders-0.019
 cooks-0.019
 helpers-0.019
Token squ
Token ID2809
Feature activation+1.59e-02

Pos logit contributions
 chefs+0.001
 knights+0.001
 cooks+0.001
 helpers+0.001
 builders+0.001
Neg logit contributions
ho-0.002
 perspective-0.002
ute-0.001
ci-0.001
 way-0.001
Tokenirmed
Token ID15491
Feature activation+2.78e-02

Pos logit contributions
ho+0.027
ute+0.025
 perspective+0.025
ci+0.025
 wrists+0.022
Neg logit contributions
 builders-0.024
 chefs-0.024
 knights-0.023
 helpers-0.023
 cooks-0.023
Token,
Token ID11
Feature activation+1.66e-02

Pos logit contributions
ho+0.054
 perspective+0.051
ci+0.051
ute+0.050
 photo+0.047
Neg logit contributions
 knights-0.034
 chefs-0.033
 builders-0.033
 helpers-0.032
iers-0.031
Token trying
Token ID2111
Feature activation+3.49e-02

Pos logit contributions
ho+0.031
 perspective+0.030
ute+0.029
ci+0.027
 photo+0.026
Neg logit contributions
 knights-0.023
 chefs-0.022
 cooks-0.022
 helpers-0.021
 builders-0.021
Token to
Token ID284
Feature activation+6.20e-03

Pos logit contributions
ho+0.074
ci+0.070
ute+0.067
 perspective+0.066
spe+0.065
Neg logit contributions
 knights-0.038
 chefs-0.036
 spies-0.035
 helpers-0.034
 pirates-0.034
Token make
Token ID787
Feature activation-2.33e-03

Pos logit contributions
ho+0.010
 perspective+0.009
ute+0.009
ci+0.009
 way+0.009
Neg logit contributions
 chefs-0.010
 knights-0.010
 builders-0.010
 cooks-0.010
 helpers-0.010
Token it
Token ID340
Feature activation-4.30e-03

Pos logit contributions
 chefs+0.004
 builders+0.004
 knights+0.004
 cooks+0.004
 helpers+0.004
Neg logit contributions
ho-0.003
 perspective-0.003
ute-0.003
ci-0.003
 way-0.003
Token more
Token ID517
Feature activation-4.43e-03

Pos logit contributions
 chefs+0.007
 knights+0.007
 builders+0.006
 cooks+0.006
 helpers+0.006
Neg logit contributions
ho-0.007
 perspective-0.007
ute-0.006
ci-0.006
 way-0.006
Token comfortable
Token ID6792
Feature activation-1.40e-02

Pos logit contributions
 chefs+0.007
 knights+0.007
 builders+0.007
 cooks+0.007
 helpers+0.007
Neg logit contributions
ho-0.007
 perspective-0.007
ute-0.006
ci-0.006
 way-0.006
Token You
Token ID921
Feature activation-3.06e-02

Pos logit contributions
 helpers+0.005
 chefs+0.005
 knights+0.005
 cooks+0.005
 builders+0.005
Neg logit contributions
ho-0.004
ute-0.004
ci-0.004
 perspective-0.003
 photo-0.003
Token need
Token ID761
Feature activation+8.52e-03

Pos logit contributions
 knights+0.052
 chefs+0.052
 helpers+0.051
iddles+0.050
 footsteps+0.050
Neg logit contributions
ho-0.042
 way-0.038
ci-0.036
ute-0.036
 perspective-0.036
Token to
Token ID284
Feature activation+1.71e-02

Pos logit contributions
ho+0.012
 perspective+0.011
ute+0.010
ci+0.010
 way+0.010
Neg logit contributions
 chefs-0.015
 knights-0.015
 builders-0.015
 cooks-0.015
 helpers-0.015
Token share
Token ID2648
Feature activation+2.25e-02

Pos logit contributions
ho+0.033
 perspective+0.030
ute+0.028
ci+0.028
 way+0.027
Neg logit contributions
 chefs-0.023
 builders-0.023
 knights-0.023
 cooks-0.023
 helpers-0.023
Token."
Token ID526
Feature activation+4.60e-03

Pos logit contributions
ho+0.043
 perspective+0.039
ci+0.038
ute+0.038
 way+0.036
Neg logit contributions
 chefs-0.031
 knights-0.030
 builders-0.030
 cooks-0.030
 helpers-0.030
Token
Token ID198
Feature activation+3.79e-03

Pos logit contributions
ho+0.008
 perspective+0.007
ute+0.007
ci+0.006
 way+0.006
Neg logit contributions
 chefs-0.007
 knights-0.007
 builders-0.007
 cooks-0.007
 helpers-0.007
Token
Token ID198
Feature activation-3.30e-03

Pos logit contributions
ho+0.006
 perspective+0.006
ute+0.005
ci+0.005
 way+0.005
Neg logit contributions
 chefs-0.006
 knights-0.006
 builders-0.006
 cooks-0.006
 helpers-0.006
Token"
Token ID1
Feature activation+1.56e-02

Pos logit contributions
 builders+0.005
 chefs+0.005
 knights+0.005
 helpers+0.005
 cooks+0.005
Neg logit contributions
ho-0.006
 perspective-0.005
ci-0.005
ute-0.005
 way-0.005
TokenBut
Token ID1537
Feature activation+1.03e-02

Pos logit contributions
ho+0.025
 perspective+0.022
ute+0.021
ci+0.021
 way+0.020
Neg logit contributions
 chefs-0.026
 builders-0.026
 knights-0.026
 cooks-0.026
 helpers-0.026
Token Mom
Token ID11254
Feature activation-4.22e-03

Pos logit contributions
ho+0.016
 way+0.014
 perspective+0.014
ci+0.014
ute+0.013
Neg logit contributions
 knights-0.017
 chefs-0.017
 builders-0.017
 cooks-0.016
iers-0.016
Token,
Token ID11
Feature activation-1.57e-03

Pos logit contributions
 knights+0.007
 chefs+0.007
 builders+0.007
 helpers+0.007
 cooks+0.007
Neg logit contributions
ho-0.007
 perspective-0.006
ci-0.006
 way-0.006
ute-0.005
Token his
Token ID465
Feature activation-7.69e-03

Pos logit contributions
 cooks+0.024
 builders+0.024
 chefs+0.024
 helpers+0.024
 knights+0.023
Neg logit contributions
ho-0.025
 perspective-0.023
 way-0.023
ci-0.022
ute-0.022
Token toy
Token ID13373
Feature activation+2.67e-03

Pos logit contributions
 chefs+0.012
 knights+0.012
 helpers+0.012
 builders+0.012
 cooks+0.012
Neg logit contributions
ho-0.013
 perspective-0.012
ute-0.011
ci-0.011
aman-0.010
Token.
Token ID13
Feature activation+2.82e-03

Pos logit contributions
ho+0.005
 perspective+0.004
ute+0.004
ci+0.004
 way+0.004
Neg logit contributions
 chefs-0.004
 knights-0.004
 builders-0.004
 cooks-0.004
 helpers-0.004
Token
Token ID198
Feature activation+5.44e-03

Pos logit contributions
ho+0.005
 perspective+0.005
 way+0.005
ute+0.005
ci+0.005
Neg logit contributions
 chefs-0.004
 builders-0.004
 cooks-0.004
 knights-0.004
 helpers-0.004
Token
Token ID198
Feature activation-9.85e-03

Pos logit contributions
ho+0.012
ci+0.011
 perspective+0.011
ute+0.010
 wrists+0.010
Neg logit contributions
 knights-0.006
 cooks-0.005
 builders-0.005
 chefs-0.005
 pirates-0.005
TokenTom
Token ID13787
Feature activation+1.26e-02

Pos logit contributions
 knights+0.015
 builders+0.014
 cooks+0.014
 pirates+0.014
ily+0.014
Neg logit contributions
ho-0.017
ci-0.016
 perspective-0.016
 poppy-0.015
 wrists-0.014
Token's
Token ID338
Feature activation+9.60e-03

Pos logit contributions
ho+0.017
ci+0.015
 way+0.015
 perspective+0.014
ute+0.014
Neg logit contributions
 chefs-0.024
 builders-0.024
 helpers-0.023
 knights-0.023
 cooks-0.023
Token friend
Token ID1545
Feature activation-9.70e-03

Pos logit contributions
ho+0.017
 perspective+0.016
ci+0.015
ute+0.015
itating+0.014
Neg logit contributions
 knights-0.015
 helpers-0.014
 chefs-0.014
 builders-0.014
 pirates-0.014
Token,
Token ID11
Feature activation-2.25e-03

Pos logit contributions
 chefs+0.017
 builders+0.016
 cooks+0.016
 knights+0.016
 helpers+0.016
Neg logit contributions
ho-0.015
 way-0.013
ci-0.013
 perspective-0.013
ute-0.012
Token Sam
Token ID3409
Feature activation+1.10e-02

Pos logit contributions
 chefs+0.002
 cooks+0.002
 builders+0.002
 knights+0.002
 helpers+0.002
Neg logit contributions
ho-0.005
 perspective-0.004
ci-0.004
 way-0.004
ute-0.004
Token,
Token ID11
Feature activation+3.06e-03

Pos logit contributions
ho+0.018
 perspective+0.015
ci+0.015
 way+0.015
ute+0.014
Neg logit contributions
 chefs-0.018
 builders-0.018
 cooks-0.018
 knights-0.018
 helpers-0.018
Token yes
Token ID3763
Feature activation-3.12e-03

Pos logit contributions
 helpers+0.009
 chefs+0.009
 knights+0.009
 cooks+0.009
 builders+0.009
Neg logit contributions
 perspective-0.015
ho-0.014
ci-0.014
ute-0.014
 way-0.013
Token and
Token ID290
Feature activation-1.01e-02

Pos logit contributions
 chefs+0.004
 cooks+0.004
 knights+0.004
 builders+0.004
 helpers+0.004
Neg logit contributions
ho-0.006
 perspective-0.006
ute-0.005
ci-0.005
 way-0.005
Token helped
Token ID4193
Feature activation+7.10e-03

Pos logit contributions
 chefs+0.018
 knights+0.018
 builders+0.018
 cooks+0.018
 helpers+0.018
Neg logit contributions
ho-0.014
 perspective-0.012
ci-0.012
ute-0.012
 way-0.012
Token her
Token ID607
Feature activation-1.26e-02

Pos logit contributions
ho+0.009
 perspective+0.008
ci+0.008
ute+0.008
 way+0.007
Neg logit contributions
 chefs-0.014
 builders-0.014
 knights-0.014
 helpers-0.014
 cooks-0.014
Token get
Token ID651
Feature activation+2.55e-04

Pos logit contributions
 chefs+0.019
 cooks+0.019
 helpers+0.019
 builders+0.019
 knights+0.019
Neg logit contributions
ho-0.023
 perspective-0.021
ute-0.019
ci-0.018
 way-0.017
Token on
Token ID319
Feature activation+2.63e-03

Pos logit contributions
ho+0.000
 perspective+0.000
ute+0.000
ci+0.000
 way+0.000
Neg logit contributions
 chefs-0.000
 builders-0.000
 cooks-0.000
 helpers-0.000
 knights-0.000
Token the
Token ID262
Feature activation-3.22e-03

Pos logit contributions
ho+0.007
ute+0.006
 perspective+0.006
ci+0.006
itating+0.006
Neg logit contributions
 builders-0.002
 chefs-0.002
 helpers-0.002
 boys-0.002
 spies-0.002
Token swing
Token ID9628
Feature activation+6.31e-03

Pos logit contributions
 boys+0.006
 builders+0.006
 knights+0.006
 helpers+0.006
 cooks+0.006
Neg logit contributions
ho-0.005
 perspective-0.004
ci-0.004
spe-0.004
ute-0.003
Token.
Token ID13
Feature activation+4.85e-03

Pos logit contributions
ho+0.013
 perspective+0.012
ute+0.011
ci+0.011
 way+0.011
Neg logit contributions
 builders-0.007
 knights-0.007
 chefs-0.007
 cooks-0.007
 helpers-0.007
Token
Token ID198
Feature activation+6.79e-03

Pos logit contributions
ho+0.008
 perspective+0.007
ute+0.007
ci+0.007
 way+0.007
Neg logit contributions
 chefs-0.008
 knights-0.008
 builders-0.008
 cooks-0.008
 helpers-0.007
Token
Token ID198
Feature activation-1.24e-02

Pos logit contributions
ho+0.014
 perspective+0.012
ci+0.012
ute+0.012
 way+0.011
Neg logit contributions
 chefs-0.008
 cooks-0.008
 knights-0.008
 builders-0.008
 helpers-0.008
Token I
Token ID314
Feature activation-4.17e-03

Pos logit contributions
 chefs+0.001
 builders+0.001
 cooks+0.001
 knights+0.001
 helpers+0.001
Neg logit contributions
ho-0.001
 perspective-0.001
 way-0.001
ute-0.001
ci-0.001
Token forgive
Token ID20927
Feature activation-1.10e-02

Pos logit contributions
 chefs+0.007
 knights+0.007
 builders+0.007
 cooks+0.006
 helpers+0.006
Neg logit contributions
ho-0.007
 perspective-0.006
ute-0.006
 way-0.006
ci-0.006
Token you
Token ID345
Feature activation-2.85e-02

Pos logit contributions
 builders+0.019
 chefs+0.018
 helpers+0.018
 knights+0.018
iers+0.018
Neg logit contributions
ho-0.017
 perspective-0.015
ci-0.014
 way-0.014
ute-0.014
Token.
Token ID13
Feature activation+8.19e-04

Pos logit contributions
 chefs+0.045
 builders+0.044
 cooks+0.044
 knights+0.043
 helpers+0.043
Neg logit contributions
ho-0.047
 perspective-0.043
ute-0.041
 way-0.041
ci-0.040
Token And
Token ID843
Feature activation+1.18e-02

Pos logit contributions
ho+0.001
 perspective+0.001
ute+0.001
ci+0.001
 way+0.001
Neg logit contributions
 chefs-0.001
 knights-0.001
 builders-0.001
 cooks-0.001
 helpers-0.001
Token yes
Token ID3763
Feature activation+8.75e-03

Pos logit contributions
ho+0.022
 perspective+0.021
ute+0.020
ci+0.019
 way+0.018
Neg logit contributions
 chefs-0.017
 knights-0.017
 helpers-0.016
 builders-0.016
 cooks-0.016
Token,
Token ID11
Feature activation+6.52e-03

Pos logit contributions
ho+0.017
 perspective+0.016
ci+0.015
ute+0.015
 way+0.014
Neg logit contributions
 chefs-0.012
 knights-0.012
 cooks-0.012
 builders-0.012
 helpers-0.011
Token we
Token ID356
Feature activation-3.07e-03

Pos logit contributions
ho+0.011
 perspective+0.010
ute+0.010
ci+0.010
 way+0.010
Neg logit contributions
 chefs-0.010
 knights-0.010
 builders-0.010
 cooks-0.010
 helpers-0.010
Token can
Token ID460
Feature activation+2.97e-02

Pos logit contributions
 chefs+0.005
 knights+0.005
 builders+0.005
 cooks+0.005
 helpers+0.005
Neg logit contributions
ho-0.005
 perspective-0.005
ute-0.004
 way-0.004
ci-0.004
Token be
Token ID307
Feature activation-6.36e-03

Pos logit contributions
ho+0.064
 perspective+0.059
ci+0.055
ute+0.055
spe+0.055
Neg logit contributions
 cooks-0.033
 knights-0.033
 builders-0.033
 chefs-0.033
 helpers-0.032
Token friends
Token ID2460
Feature activation-8.45e-03

Pos logit contributions
 cooks+0.010
iers+0.010
 builders+0.010
 chefs+0.010
 helpers+0.010
Neg logit contributions
ho-0.009
 way-0.008
ci-0.008
 perspective-0.008
ute-0.008
Token boy
Token ID2933
Feature activation+1.19e-02

Pos logit contributions
 chefs+0.019
 knights+0.019
 builders+0.019
 cooks+0.019
 helpers+0.019
Neg logit contributions
ho-0.021
 perspective-0.019
ute-0.018
ci-0.018
 way-0.018
Token named
Token ID3706
Feature activation-3.46e-02

Pos logit contributions
ho+0.022
 perspective+0.022
ute+0.021
aman+0.020
 way+0.020
Neg logit contributions
 knights-0.016
 chefs-0.016
 cooks-0.015
 pirates-0.015
 builders-0.015
Token Tim
Token ID5045
Feature activation+2.52e-02

Pos logit contributions
 helpers+0.046
iers+0.045
 cooks+0.044
 builders+0.044
 chefs+0.043
Neg logit contributions
ho-0.067
 way-0.061
ci-0.057
 perspective-0.056
 high-0.055
Tokenmy
Token ID1820
Feature activation+1.26e-02

Pos logit contributions
ho+0.043
 perspective+0.039
ute+0.038
ci+0.037
 way+0.035
Neg logit contributions
 chefs-0.039
 knights-0.039
 cooks-0.038
 builders-0.038
 helpers-0.037
Token.
Token ID13
Feature activation-1.68e-02

Pos logit contributions
ho+0.026
 perspective+0.024
ute+0.023
ci+0.022
 way+0.022
Neg logit contributions
 chefs-0.015
 knights-0.015
 builders-0.015
 cooks-0.015
 helpers-0.014
Token He
Token ID679
Feature activation+1.17e-03

Pos logit contributions
 helpers+0.020
 cooks+0.019
 chefs+0.019
 knights+0.019
 builders+0.019
Neg logit contributions
ho-0.035
 perspective-0.032
 way-0.031
ci-0.031
ute-0.030
Token loved
Token ID6151
Feature activation-1.80e-02

Pos logit contributions
ho+0.001
ci+0.001
 perspective+0.001
 way+0.001
ute+0.001
Neg logit contributions
 builders-0.002
 chefs-0.002
 helpers-0.002
iers-0.002
 knights-0.002
Token to
Token ID284
Feature activation-3.29e-03

Pos logit contributions
 chefs+0.034
 knights+0.033
 builders+0.033
 cooks+0.033
 helpers+0.033
Neg logit contributions
ho-0.025
 perspective-0.022
ute-0.021
ci-0.021
 way-0.020
Token play
Token ID711
Feature activation-1.65e-02

Pos logit contributions
 chefs+0.006
 builders+0.006
 knights+0.006
 cooks+0.005
 helpers+0.005
Neg logit contributions
ho-0.005
 perspective-0.005
ci-0.004
ute-0.004
 way-0.004
Token with
Token ID351
Feature activation-1.96e-02

Pos logit contributions
iers+0.030
 helpers+0.030
 cooks+0.029
 builders+0.029
 warriors+0.028
Neg logit contributions
ute-0.021
ho-0.020
ci-0.019
 way-0.018
 high-0.018
Token his
Token ID465
Feature activation-1.36e-02

Pos logit contributions
 chefs+0.024
 knights+0.024
 builders+0.024
 cooks+0.024
 helpers+0.024
Neg logit contributions
ho-0.039
 perspective-0.036
ute-0.035
ci-0.035
 way-0.034
Token the
Token ID262
Feature activation-7.77e-03

Pos logit contributions
 cooks+0.003
 builders+0.002
 helpers+0.002
 chefs+0.002
iers+0.002
Neg logit contributions
ho-0.002
 perspective-0.002
 way-0.002
 high-0.002
 photo-0.002
Token floor
Token ID4314
Feature activation+2.26e-02

Pos logit contributions
 chefs+0.011
 knights+0.011
 builders+0.010
 cooks+0.010
 helpers+0.010
Neg logit contributions
ho-0.015
 perspective-0.014
ci-0.013
ute-0.013
aman-0.012
Token.
Token ID13
Feature activation-4.99e-04

Pos logit contributions
ho+0.043
 perspective+0.040
ci+0.039
ute+0.039
 way+0.037
Neg logit contributions
 chefs-0.030
 knights-0.029
 builders-0.029
 cooks-0.029
 helpers-0.028
Token Mom
Token ID11254
Feature activation-2.90e-03

Pos logit contributions
 builders+0.001
 helpers+0.001
 chefs+0.001
 knights+0.001
 cooks+0.001
Neg logit contributions
ho-0.001
ute-0.001
 perspective-0.001
 way-0.001
 high-0.001
Token and
Token ID290
Feature activation-1.88e-02

Pos logit contributions
 builders+0.005
 chefs+0.005
 helpers+0.005
 knights+0.005
 spies+0.005
Neg logit contributions
ho-0.004
 way-0.003
ci-0.003
 perspective-0.003
s-0.003
Token Ben
Token ID3932
Feature activation+1.33e-03

Pos logit contributions
iers+0.023
 builders+0.023
 knights+0.023
 chefs+0.023
 cooks+0.023
Neg logit contributions
ho-0.036
 way-0.032
ci-0.031
 perspective-0.031
 photo-0.030
Token looked
Token ID3114
Feature activation+1.08e-02

Pos logit contributions
ho+0.002
 perspective+0.002
 way+0.002
ute+0.002
ci+0.002
Neg logit contributions
 builders-0.002
 chefs-0.002
 helpers-0.002
 knights-0.002
 cooks-0.002
Token at
Token ID379
Feature activation+2.05e-03

Pos logit contributions
ho+0.019
 perspective+0.017
ute+0.016
ci+0.016
 way+0.016
Neg logit contributions
 chefs-0.017
 knights-0.016
 builders-0.016
 cooks-0.016
 helpers-0.016
Token the
Token ID262
Feature activation-4.48e-03

Pos logit contributions
ho+0.003
 perspective+0.003
 way+0.003
ute+0.003
ci+0.003
Neg logit contributions
 chefs-0.003
 builders-0.003
 cooks-0.003
 knights-0.003
 helpers-0.003
Token mess
Token ID2085
Feature activation-8.64e-03

Pos logit contributions
 chefs+0.006
 knights+0.006
 builders+0.006
 cooks+0.006
 helpers+0.006
Neg logit contributions
ho-0.008
 perspective-0.008
ute-0.007
ci-0.007
 way-0.007
Token.
Token ID13
Feature activation+2.75e-03

Pos logit contributions
 chefs+0.014
 knights+0.014
 builders+0.014
 helpers+0.014
 cooks+0.014
Neg logit contributions
ho-0.013
 perspective-0.011
ute-0.011
ci-0.011
 way-0.011
Token the
Token ID262
Feature activation+3.93e-03

Pos logit contributions
 builders+0.011
 chefs+0.011
 cooks+0.011
 knights+0.011
 helpers+0.011
Neg logit contributions
ho-0.016
 perspective-0.015
 way-0.015
ci-0.014
ute-0.014
Token backyard
Token ID24296
Feature activation+1.84e-02

Pos logit contributions
ho+0.007
 perspective+0.006
ute+0.006
ci+0.006
spe+0.006
Neg logit contributions
 chefs-0.006
 knights-0.006
 pirates-0.006
 cooks-0.006
 builders-0.006
Token,
Token ID11
Feature activation-4.11e-03

Pos logit contributions
ho+0.038
ute+0.036
 perspective+0.036
ci+0.034
 wrists+0.033
Neg logit contributions
 knights-0.021
 chefs-0.021
 cooks-0.021
 builders-0.020
 pirates-0.020
Token playing
Token ID2712
Feature activation-1.43e-02

Pos logit contributions
 knights+0.004
 chefs+0.004
 pirates+0.004
 cooks+0.004
 spies+0.004
Neg logit contributions
ho-0.009
 perspective-0.009
ute-0.008
ci-0.008
spe-0.008
Token with
Token ID351
Feature activation-1.93e-02

Pos logit contributions
 helpers+0.026
 builders+0.025
 cooks+0.025
iers+0.025
 chefs+0.024
Neg logit contributions
ho-0.019
 perspective-0.018
ute-0.017
 way-0.017
ci-0.016
Token a
Token ID257
Feature activation+2.34e-03

Pos logit contributions
 chefs+0.024
 builders+0.024
 cooks+0.024
 knights+0.024
 helpers+0.023
Neg logit contributions
ho-0.038
 perspective-0.035
ute-0.034
ci-0.034
 way-0.033
Token toy
Token ID13373
Feature activation+6.78e-03

Pos logit contributions
ho+0.004
ci+0.004
 perspective+0.003
aman+0.003
itating+0.003
Neg logit contributions
 chefs-0.004
 knights-0.004
 builders-0.004
 spies-0.004
 pirates-0.004
Token truck
Token ID7779
Feature activation-7.94e-04

Pos logit contributions
ho+0.011
 perspective+0.010
ci+0.009
ute+0.009
 way+0.009
Neg logit contributions
 chefs-0.011
 knights-0.011
 builders-0.011
 cooks-0.011
 helpers-0.011
Token.
Token ID13
Feature activation-2.21e-02

Pos logit contributions
 chefs+0.001
 knights+0.001
 builders+0.001
 cooks+0.001
 helpers+0.001
Neg logit contributions
ho-0.002
 perspective-0.001
ute-0.001
ci-0.001
 way-0.001
Token Tim
Token ID5045
Feature activation+1.42e-02

Pos logit contributions
 helpers+0.031
 chefs+0.030
 cooks+0.029
 spies+0.029
 knights+0.029
Neg logit contributions
ho-0.039
 way-0.039
 perspective-0.036
 high-0.034
ute-0.034
Tokenmy
Token ID1820
Feature activation+4.86e-04

Pos logit contributions
ho+0.022
 perspective+0.020
ute+0.019
ci+0.019
 way+0.018
Neg logit contributions
 chefs-0.024
 knights-0.024
 builders-0.024
 cooks-0.024
 helpers-0.024
Token just
Token ID655
Feature activation-1.01e-02

Pos logit contributions
 chefs+0.008
 knights+0.008
 builders+0.008
 cooks+0.008
 helpers+0.008
Neg logit contributions
ho-0.010
 perspective-0.009
ute-0.009
ci-0.009
 way-0.008
Token in
Token ID287
Feature activation+1.47e-02

Pos logit contributions
 chefs+0.015
 builders+0.014
 knights+0.014
iers+0.014
 cooks+0.014
Neg logit contributions
ho-0.016
 way-0.015
 perspective-0.015
ci-0.014
 high-0.013
Token time
Token ID640
Feature activation+1.82e-02

Pos logit contributions
ho+0.030
ute+0.027
ci+0.026
 perspective+0.026
 way+0.024
Neg logit contributions
 chefs-0.019
 helpers-0.018
 knights-0.018
 builders-0.018
 cooks-0.018
Token!
Token ID0
Feature activation+3.78e-04

Pos logit contributions
ho+0.035
 perspective+0.033
ute+0.032
ci+0.032
 way+0.029
Neg logit contributions
 chefs-0.023
 builders-0.023
 knights-0.023
 helpers-0.023
 cooks-0.023
Token The
Token ID383
Feature activation+4.92e-03

Pos logit contributions
ho+0.001
 perspective+0.001
ute+0.001
ci+0.001
 way+0.001
Neg logit contributions
 builders-0.000
 chefs-0.000
 knights-0.000
 cooks-0.000
 helpers-0.000
Token party
Token ID2151
Feature activation-1.25e-03

Pos logit contributions
ho+0.009
 perspective+0.008
ute+0.008
ci+0.008
 way+0.008
Neg logit contributions
 chefs-0.007
 builders-0.007
 knights-0.007
 cooks-0.007
 helpers-0.007
Tokengoers
Token ID31006
Feature activation-4.73e-03

Pos logit contributions
 chefs+0.002
 knights+0.002
 builders+0.002
 cooks+0.002
 helpers+0.002
Neg logit contributions
ho-0.002
 perspective-0.002
ute-0.002
ci-0.002
 way-0.002
Token were
Token ID547
Feature activation-9.09e-04

Pos logit contributions
 chefs+0.007
 builders+0.007
 knights+0.007
 cooks+0.007
 helpers+0.007
Neg logit contributions
ho-0.008
 perspective-0.007
ci-0.007
ute-0.007
 way-0.007
Token able
Token ID1498
Feature activation+2.89e-02

Pos logit contributions
 chefs+0.001
 builders+0.001
 knights+0.001
 cooks+0.001
 helpers+0.001
Neg logit contributions
ho-0.002
 perspective-0.002
ute-0.001
ci-0.001
 way-0.001
Token to
Token ID284
Feature activation-1.02e-02

Pos logit contributions
ho+0.047
ute+0.045
ci+0.043
 perspective+0.043
aman+0.037
Neg logit contributions
 builders-0.048
 chefs-0.048
 helpers-0.048
 knights-0.047
 cooks-0.047
Token use
Token ID779
Feature activation+4.31e-03

Pos logit contributions
 knights+0.017
 chefs+0.017
 builders+0.016
 helpers+0.016
 cooks+0.016
Neg logit contributions
ho-0.016
 perspective-0.015
 way-0.015
ci-0.014
ute-0.014
Token help
Token ID1037
Feature activation+2.20e-02

Pos logit contributions
 chefs+0.002
 knights+0.002
 builders+0.002
 cooks+0.002
 helpers+0.002
Neg logit contributions
ho-0.002
 perspective-0.002
ute-0.002
ci-0.002
 way-0.002
Token her
Token ID607
Feature activation-2.04e-03

Pos logit contributions
ho+0.034
 perspective+0.031
ute+0.029
ci+0.029
 way+0.026
Neg logit contributions
 chefs-0.039
 cooks-0.038
 knights-0.038
 builders-0.037
 helpers-0.037
Token mom
Token ID1995
Feature activation-7.85e-03

Pos logit contributions
 chefs+0.003
 helpers+0.003
 knights+0.003
 cooks+0.003
 builders+0.003
Neg logit contributions
ho-0.004
 perspective-0.004
ute-0.003
aman-0.003
ci-0.003
Token.
Token ID13
Feature activation+2.34e-02

Pos logit contributions
 chefs+0.012
 builders+0.012
 knights+0.012
 cooks+0.012
 helpers+0.012
Neg logit contributions
ho-0.013
 perspective-0.012
ci-0.011
ute-0.011
 way-0.011
Token One
Token ID1881
Feature activation+4.44e-03

Pos logit contributions
ho+0.039
 perspective+0.035
ute+0.034
ci+0.034
 way+0.032
Neg logit contributions
 chefs-0.037
 knights-0.037
 cooks-0.036
 builders-0.036
 helpers-0.036
Token day
Token ID1110
Feature activation-1.93e-03

Pos logit contributions
ho+0.005
 perspective+0.005
ute+0.004
 wrists+0.004
ci+0.004
Neg logit contributions
 cooks-0.009
 helpers-0.009
 chefs-0.009
 builders-0.009
 knights-0.009
Token,
Token ID11
Feature activation+9.06e-03

Pos logit contributions
 chefs+0.002
 cooks+0.002
 knights+0.002
 builders+0.002
 helpers+0.002
Neg logit contributions
ho-0.004
 perspective-0.004
ci-0.004
ute-0.004
spe-0.003
Token her
Token ID607
Feature activation-7.56e-03

Pos logit contributions
ho+0.015
 perspective+0.015
ute+0.014
ci+0.014
spe+0.013
Neg logit contributions
 chefs-0.015
 cooks-0.014
 helpers-0.014
 knights-0.014
 builders-0.014
Token mom
Token ID1995
Feature activation-8.54e-03

Pos logit contributions
 chefs+0.012
 helpers+0.012
 cooks+0.012
 builders+0.012
 knights+0.012
Neg logit contributions
ho-0.013
 perspective-0.012
ute-0.011
ci-0.011
 way-0.010
Token asked
Token ID1965
Feature activation-1.59e-02

Pos logit contributions
 knights+0.014
 chefs+0.014
 builders+0.014
 spies+0.014
iers+0.014
Neg logit contributions
ho-0.013
ci-0.011
 way-0.011
 perspective-0.010
ute-0.010
Token her
Token ID607
Feature activation-1.65e-02

Pos logit contributions
 builders+0.034
 chefs+0.033
 knights+0.033
 cooks+0.033
 helpers+0.033
Neg logit contributions
ho-0.018
 perspective-0.014
 way-0.014
ci-0.013
ute-0.013
Token do
Token ID466
Feature activation-1.08e-02

Pos logit contributions
 knights+0.055
 builders+0.055
 chefs+0.055
 cooks+0.054
 helpers+0.054
Neg logit contributions
ho-0.071
 perspective-0.065
 way-0.063
ute-0.063
ci-0.062
Token.
Token ID13
Feature activation+7.29e-05

Pos logit contributions
 chefs+0.017
 knights+0.017
 builders+0.016
 cooks+0.016
 helpers+0.016
Neg logit contributions
ho-0.018
 perspective-0.017
ute-0.016
ci-0.016
 way-0.016
Token
Token ID220
Feature activation-4.73e-03

Pos logit contributions
ho+0.000
 perspective+0.000
 way+0.000
ci+0.000
ute+0.000
Neg logit contributions
 chefs-0.000
 builders-0.000
 cooks-0.000
 knights-0.000
 helpers-0.000
Token
Token ID198
Feature activation-7.07e-03

Pos logit contributions
 chefs+0.008
 builders+0.008
 cooks+0.008
 knights+0.008
 helpers+0.008
Neg logit contributions
ho-0.007
 perspective-0.006
ute-0.006
 way-0.006
ci-0.006
Token
Token ID198
Feature activation-2.52e-02

Pos logit contributions
 chefs+0.011
 knights+0.011
 builders+0.011
 cooks+0.011
 helpers+0.011
Neg logit contributions
ho-0.012
 perspective-0.011
ute-0.010
ci-0.010
 way-0.010
TokenSuddenly
Token ID38582
Feature activation-3.49e-02

Pos logit contributions
 chefs+0.043
 knights+0.043
 builders+0.043
 cooks+0.042
 helpers+0.042
Neg logit contributions
ho-0.039
 perspective-0.035
ute-0.033
ci-0.033
 way-0.032
Token he
Token ID339
Feature activation-1.53e-02

Pos logit contributions
 chefs+0.071
 builders+0.069
 cooks+0.068
iddles+0.067
 spies+0.066
Neg logit contributions
ho-0.037
 way-0.035
 high-0.031
 perspective-0.031
ute-0.029
Token heard
Token ID2982
Feature activation-4.30e-02

Pos logit contributions
 chefs+0.027
 builders+0.026
 helpers+0.026
 cooks+0.026
 knights+0.025
Neg logit contributions
ho-0.024
 way-0.020
 perspective-0.020
ci-0.019
ute-0.018
Token a
Token ID257
Feature activation-3.51e-03

Pos logit contributions
 chefs+0.076
 builders+0.073
 cooks+0.071
iers+0.069
 helpers+0.069
Neg logit contributions
 way-0.055
 lot-0.053
ho-0.052
 perspective-0.051
 high-0.050
Token voice
Token ID3809
Feature activation-4.58e-02

Pos logit contributions
 chefs+0.005
 builders+0.005
 knights+0.005
 cooks+0.005
 helpers+0.005
Neg logit contributions
ho-0.007
 perspective-0.006
 way-0.006
ute-0.006
ci-0.005
Token.
Token ID13
Feature activation-3.19e-02

Pos logit contributions
 chefs+0.080
 builders+0.078
 knights+0.078
 cooks+0.075
 helpers+0.075
Neg logit contributions
ho-0.073
ci-0.062
 perspective-0.060
 way-0.060
ute-0.059
Token happy
Token ID3772
Feature activation-1.27e-02

Pos logit contributions
ho+0.022
 perspective+0.021
ute+0.020
ci+0.020
 way+0.019
Neg logit contributions
 chefs-0.019
 cooks-0.018
 knights-0.018
 builders-0.018
 helpers-0.018
Token and
Token ID290
Feature activation+6.89e-03

Pos logit contributions
 chefs+0.021
 builders+0.021
 knights+0.021
 cooks+0.021
 helpers+0.021
Neg logit contributions
ho-0.020
 perspective-0.018
ci-0.017
ute-0.017
 way-0.017
Token said
Token ID531
Feature activation-4.96e-03

Pos logit contributions
ho+0.012
 perspective+0.011
ute+0.011
ci+0.010
 way+0.010
Neg logit contributions
 chefs-0.010
 knights-0.010
 cooks-0.010
 builders-0.010
 helpers-0.010
Token,
Token ID11
Feature activation+7.30e-03

Pos logit contributions
 knights+0.006
 builders+0.006
 chefs+0.006
 helpers+0.006
 cooks+0.006
Neg logit contributions
 perspective-0.010
ho-0.010
ci-0.009
ute-0.009
aman-0.008
Token "
Token ID366
Feature activation+1.69e-02

Pos logit contributions
 perspective+0.015
ho+0.014
ci+0.014
ute+0.013
 way+0.013
Neg logit contributions
 chefs-0.009
 cooks-0.009
 knights-0.009
 builders-0.009
 helpers-0.008
TokenThank
Token ID10449
Feature activation-3.77e-02

Pos logit contributions
ho+0.035
ute+0.033
 perspective+0.032
 way+0.031
ci+0.031
Neg logit contributions
 chefs-0.020
 builders-0.019
 cooks-0.019
 knights-0.018
 helpers-0.017
Token you
Token ID345
Feature activation-2.99e-02

Pos logit contributions
 chefs+0.068
 cooks+0.067
 knights+0.066
 builders+0.066
 helpers+0.066
Neg logit contributions
ho-0.054
ci-0.045
ute-0.044
 perspective-0.043
 way-0.039
Token,
Token ID11
Feature activation+1.28e-02

Pos logit contributions
 chefs+0.051
 cooks+0.051
 builders+0.050
 knights+0.050
 helpers+0.049
Neg logit contributions
ho-0.045
 perspective-0.039
 way-0.038
ci-0.038
ute-0.038
Token Mom
Token ID11254
Feature activation-6.55e-03

Pos logit contributions
ho+0.025
 perspective+0.024
ute+0.022
ci+0.022
 way+0.020
Neg logit contributions
 chefs-0.017
 knights-0.017
 helpers-0.017
 builders-0.016
 cooks-0.016
Tokenmy
Token ID1820
Feature activation-7.88e-03

Pos logit contributions
 chefs+0.011
 knights+0.011
 builders+0.011
 cooks+0.011
 helpers+0.011
Neg logit contributions
ho-0.010
 perspective-0.009
ci-0.008
ute-0.008
 way-0.008
Token.
Token ID13
Feature activation-2.37e-02

Pos logit contributions
 chefs+0.013
 knights+0.013
 builders+0.013
 cooks+0.013
 helpers+0.013
Neg logit contributions
ho-0.012
 perspective-0.011
ci-0.011
 way-0.010
ute-0.010
Token were
Token ID547
Feature activation-7.32e-03

Pos logit contributions
 chefs+0.017
 builders+0.017
 knights+0.017
 cooks+0.017
 helpers+0.017
Neg logit contributions
ho-0.018
 perspective-0.016
ci-0.015
 way-0.015
ute-0.015
Token brave
Token ID14802
Feature activation+1.40e-03

Pos logit contributions
 chefs+0.012
 builders+0.012
 cooks+0.012
 knights+0.012
 helpers+0.012
Neg logit contributions
ho-0.011
 perspective-0.010
 way-0.010
ute-0.010
ci-0.010
Token and
Token ID290
Feature activation-8.20e-03

Pos logit contributions
ho+0.002
 perspective+0.002
 way+0.002
yt+0.002
ci+0.002
Neg logit contributions
 cooks-0.002
 chefs-0.002
iers-0.002
 builders-0.002
renches-0.002
Token clever
Token ID14169
Feature activation-8.38e-03

Pos logit contributions
 chefs+0.013
 knights+0.012
 builders+0.012
 cooks+0.012
 helpers+0.012
Neg logit contributions
ho-0.014
ci-0.012
 way-0.012
 perspective-0.011
ute-0.011
Token.
Token ID13
Feature activation+5.53e-03

Pos logit contributions
 chefs+0.015
 cooks+0.014
 builders+0.014
iers+0.014
renches+0.014
Neg logit contributions
ho-0.011
 perspective-0.010
 way-0.009
ute-0.008
ci-0.008
Token<|endoftext|>
Token ID50256
Feature activation-3.61e-02

Pos logit contributions
ho+0.009
 perspective+0.009
ute+0.008
ci+0.008
 way+0.008
Neg logit contributions
 chefs-0.009
 knights-0.008
 builders-0.008
 cooks-0.008
 helpers-0.008
TokenAnna
Token ID31160
Feature activation+1.06e-02

Pos logit contributions
 builders+0.055
 helpers+0.053
 chefs+0.051
 knights+0.051
 mechanics+0.050
Neg logit contributions
ho-0.059
 perspective-0.053
ute-0.053
 way-0.051
Today-0.048
Token and
Token ID290
Feature activation+6.59e-04

Pos logit contributions
ho+0.015
 way+0.013
ci+0.012
 perspective+0.011
ute+0.011
Neg logit contributions
 builders-0.020
iers-0.019
 chefs-0.019
 helpers-0.019
 knights-0.019
Token Ben
Token ID3932
Feature activation+2.04e-02

Pos logit contributions
ho+0.001
 perspective+0.001
ute+0.001
ci+0.001
 way+0.001
Neg logit contributions
 chefs-0.001
 knights-0.001
 builders-0.001
 cooks-0.001
 helpers-0.001
Token were
Token ID547
Feature activation+1.29e-02

Pos logit contributions
ho+0.028
 way+0.023
 perspective+0.022
ci+0.021
 photo+0.020
Neg logit contributions
 builders-0.038
 helpers-0.038
iers-0.038
 knights-0.038
 chefs-0.037
Token twins
Token ID20345
Feature activation+1.23e-02

Pos logit contributions
ho+0.021
 perspective+0.020
ci+0.017
ute+0.017
spe+0.016
Neg logit contributions
 knights-0.023
 chefs-0.023
 helpers-0.023
 cooks-0.023
 builders-0.023
Token
Token ID198
Feature activation+1.05e-02

Pos logit contributions
ho+0.017
 perspective+0.015
ute+0.014
ci+0.014
 way+0.014
Neg logit contributions
 chefs-0.014
 knights-0.014
 builders-0.014
 cooks-0.014
 helpers-0.014
Token
Token ID198
Feature activation+1.01e-04

Pos logit contributions
ho+0.018
 perspective+0.017
ci+0.016
ute+0.016
 way+0.015
Neg logit contributions
 chefs-0.016
 knights-0.015
 builders-0.015
 cooks-0.015
 helpers-0.015
Token"
Token ID1
Feature activation+2.08e-02

Pos logit contributions
ho+0.000
 perspective+0.000
ci+0.000
ute+0.000
 way+0.000
Neg logit contributions
 chefs-0.000
 builders-0.000
 knights-0.000
 cooks-0.000
 helpers-0.000
TokenWow
Token ID22017
Feature activation+8.71e-03

Pos logit contributions
ho+0.032
 perspective+0.029
ute+0.027
ci+0.027
 way+0.026
Neg logit contributions
 chefs-0.037
 cooks-0.036
 builders-0.036
 knights-0.036
 helpers-0.036
Token,
Token ID11
Feature activation+1.59e-02

Pos logit contributions
ho+0.014
 perspective+0.012
ute+0.012
 way+0.012
ci+0.012
Neg logit contributions
 chefs-0.014
 knights-0.014
 builders-0.014
 cooks-0.014
 helpers-0.014
Token you
Token ID345
Feature activation-5.28e-02

Pos logit contributions
ho+0.028
 perspective+0.026
ute+0.024
ci+0.024
 way+0.023
Neg logit contributions
 chefs-0.024
 knights-0.024
 builders-0.024
 cooks-0.024
 helpers-0.023
Token two
Token ID734
Feature activation-1.18e-02

Pos logit contributions
iers+0.104
 firefighters+0.100
iddles+0.099
ores+0.098
paste+0.096
Neg logit contributions
 way-0.063
 lot-0.049
 photo-0.047
 high-0.047
ho-0.044
Token are
Token ID389
Feature activation-2.77e-02

Pos logit contributions
 chefs+0.020
 knights+0.019
 builders+0.019
 helpers+0.019
 cooks+0.019
Neg logit contributions
ho-0.017
 perspective-0.016
 way-0.016
ute-0.014
ci-0.014
Token very
Token ID845
Feature activation-2.53e-02

Pos logit contributions
iers+0.055
paste+0.052
 opener+0.052
renches+0.051
aid+0.051
Neg logit contributions
 way-0.035
 high-0.028
ho-0.026
 photo-0.024
yt-0.022
Token clever
Token ID14169
Feature activation-7.08e-03

Pos logit contributions
 chefs+0.044
 knights+0.043
 builders+0.043
iers+0.043
 helpers+0.043
Neg logit contributions
ho-0.037
 way-0.031
 perspective-0.030
ute-0.030
ci-0.029
Token!"
Token ID2474
Feature activation+2.11e-02

Pos logit contributions
 chefs+0.011
 cooks+0.011
iers+0.011
 builders+0.011
 knights+0.011
Neg logit contributions
ho-0.010
 perspective-0.010
 way-0.009
yt-0.008
ute-0.008
Token But
Token ID887
Feature activation+5.80e-03

Pos logit contributions
ho+0.007
 perspective+0.006
ute+0.006
ci+0.006
 way+0.005
Neg logit contributions
 cooks-0.005
 chefs-0.005
 helpers-0.005
 knights-0.005
 builders-0.005
Token no
Token ID645
Feature activation+1.63e-03

Pos logit contributions
ho+0.010
ute+0.010
 perspective+0.010
ci+0.009
yt+0.009
Neg logit contributions
 helpers-0.008
 knights-0.007
 cooks-0.007
 chefs-0.007
 builders-0.007
Token matter
Token ID2300
Feature activation+1.61e-03

Pos logit contributions
ho+0.003
ute+0.002
 perspective+0.002
ci+0.002
spe+0.002
Neg logit contributions
 helpers-0.003
 chefs-0.003
 knights-0.003
 cooks-0.003
 builders-0.003
Token how
Token ID703
Feature activation+7.26e-03

Pos logit contributions
ho+0.003
 perspective+0.003
ute+0.003
ci+0.003
 way+0.003
Neg logit contributions
 chefs-0.002
 knights-0.002
 builders-0.002
 helpers-0.002
 cooks-0.002
Token much
Token ID881
Feature activation-2.55e-02

Pos logit contributions
ho+0.011
 perspective+0.009
ci+0.009
ute+0.009
 way+0.008
Neg logit contributions
 builders-0.013
 helpers-0.013
 knights-0.013
 cooks-0.013
 chefs-0.013
Token she
Token ID673
Feature activation-4.20e-02

Pos logit contributions
 chefs+0.047
 knights+0.047
 builders+0.047
 cooks+0.046
 helpers+0.046
Neg logit contributions
ho-0.035
 perspective-0.032
ute-0.029
ci-0.029
 way-0.029
Token begged
Token ID32946
Feature activation+1.54e-03

Pos logit contributions
 builders+0.086
iers+0.085
 knights+0.084
 chefs+0.084
 spies+0.082
Neg logit contributions
ho-0.052
 way-0.051
ci-0.050
 would-0.041
 skipped-0.040
Token,
Token ID11
Feature activation+5.06e-03

Pos logit contributions
ho+0.003
 perspective+0.002
ute+0.002
ci+0.002
 way+0.002
Neg logit contributions
 chefs-0.002
 knights-0.002
 builders-0.002
 cooks-0.002
 helpers-0.002
Token her
Token ID607
Feature activation-3.16e-03

Pos logit contributions
ho+0.008
ute+0.008
 perspective+0.008
ci+0.007
 photo+0.006
Neg logit contributions
 helpers-0.008
 chefs-0.008
 cooks-0.008
 builders-0.008
 knights-0.008
Token mom
Token ID1995
Feature activation-6.62e-03

Pos logit contributions
 helpers+0.005
 knights+0.004
 builders+0.004
 chefs+0.004
 cooks+0.004
Neg logit contributions
ho-0.006
 perspective-0.005
ci-0.005
ute-0.005
itating-0.005
Token said
Token ID531
Feature activation+3.22e-04

Pos logit contributions
 chefs+0.010
 cooks+0.010
 helpers+0.010
 knights+0.010
 builders+0.010
Neg logit contributions
ho-0.011
 perspective-0.010
ute-0.010
ci-0.009
 way-0.009