TOP ACTIVATIONS
MAX = 0.081

, Lily and Tom liked each other more and more .
of vegetables . You can each take one and keep it
sweet ie . You can each choose one balloon ," Mom
man and the girl liked each other very much . They
zy . They both loved each other very much . One
were so happy to share each other 's company . They
" Let 's give each other a haircut !" Tom
and dad said they could each choose one toy . Tom
They were friendly and liked each other very much . They
friends . They both loved each other very much and were
children suggested that they should each tell a lie , and
happy together . They loved each other very much and wanted
my and the kid loved each other very much . They
twins . You should love each other ." Sam
. She worked hard on each of her building projects and
. Sam and Emily loved each other very much and they
getting married ! They loved each other very much . They
and sister . They loved each other very much , but
disagreed on what to do each day . One day Mark
handsome boy . They loved each other , so they got

MIN ACTIVATIONS
MIN = -0.133

When Daisy put the magn ifying glass down , the cater
the necklace and the magn ifying glass and started to examine
hat and grabbed her magn ifying glass . She was ready
He took out a magn ifying glass and a notebook and
the watch with the magn ifying glass . He saw the
She takes out a magn ifying glass . She looks at
and handed him the magn ifying glass . He
, next to her magn ifying glass . She looked at
the necklace and the magn ifying glass and told him that
cried , dropping the magn ifying glass . "
examine it with the magn ifying glass ?" "
said , holding the magn ifying glass over a black ant
, handing him the magn ifying glass . She looked around
flashlight , and a magn ifying glass . As
She took out a magn ifying glass and looked very carefully
He pointed his magn ifying glass at different classmates and
it she had a magn ifying glass , a tiny torch
She had a big magn ifying glass that made things look
. Lily had a magn ifying glass that she found in
a flashlight and a magn ifying glass to examine their finds

INTERVAL 0.028 - 0.081
CONTAINS 3.171%

, Fiona found a delicious bunch of bananas . She excited
Anna are scared . They nod . They are sorry .
sad . He wanted to cross the bridge just like all
nice . You are my best friend ," Sara said .
. After , she picked up her bag and continued on

INTERVAL -0.026 - 0.028
CONTAINS 91.636%

bad wolf ! " Oh no !" Lily cried . But
and it felt perfect . But then a strange
saw a big white factory . She was curious and went
stuck in the branches ! The brave lion was
so tiny before , so she decided to remove it .

INTERVAL -0.079 - -0.026
CONTAINS 5.191%

with the children and the toys . They have fun .
started eating her ice - cream . After
have some crack ers and cheese while we wait for the
It roared and f lic kered and reached far up into
They lost their homes and belongings . They hoped the rain

INTERVAL -0.133 - -0.079
CONTAINS 0.003%

pulled and pulled but it wouldn 't open . Suddenly ,
YOU WILL GET IN TR OU BLE !" L
but it felt tight and wouldn 't bud ge . Sam
throw the ball and he fet ches it . They take
pushed and pulled until it wouldn 't go any further .
Token,
Token ID11
Feature activation+2.28e-02

Pos logit contributions
 courtyard+0.056
 neighborhood+0.053
castle+0.052
 classroom+0.052
 builder+0.052
Neg logit contributions
text-0.066
-0.055
ifying-0.054
 AW-0.051
 eight-0.051
Token Lily
Token ID20037
Feature activation-3.76e-04

Pos logit contributions
 classroom+0.036
 courtyard+0.035
 neighborhood+0.034
 painter+0.034
 builder+0.034
Neg logit contributions
text-0.048
ifying-0.040
-0.040
 AW-0.038
 tal-0.036
Token and
Token ID290
Feature activation-2.76e-02

Pos logit contributions
text+0.001
+0.001
ifying+0.001
 eight+0.001
 Sn+0.001
Neg logit contributions
 courtyard-0.001
 classroom-0.001
castle-0.001
 builder-0.001
 neighborhood-0.001
Token Tom
Token ID4186
Feature activation+2.18e-03

Pos logit contributions
text+0.041
+0.034
ifying+0.031
 AW+0.031
 Sn+0.030
Neg logit contributions
 courtyard-0.059
 neighborhood-0.056
 builder-0.056
castle-0.056
 classroom-0.056
Token liked
Token ID8288
Feature activation+1.21e-02

Pos logit contributions
 courtyard+0.003
 classroom+0.003
castle+0.003
 builder+0.003
 studio+0.003
Neg logit contributions
text-0.004
-0.004
 Sn-0.003
 eight-0.003
ifying-0.003
Token each
Token ID1123
Feature activation+8.13e-02

Pos logit contributions
 courtyard+0.024
 neighborhood+0.022
 grounds+0.022
 classroom+0.022
castle+0.022
Neg logit contributions
text-0.019
-0.016
 Sn-0.015
 eight-0.015
ifying-0.015
Token other
Token ID584
Feature activation-1.17e-02

Pos logit contributions
 courtyard+0.178
castle+0.174
 neighborhood+0.166
 studio+0.166
 grounds+0.165
Neg logit contributions
text-0.118
-0.096
 eight-0.085
 Sn-0.081
ifying-0.080
Token more
Token ID517
Feature activation+2.71e-02

Pos logit contributions
text+0.023
+0.020
ifying+0.018
 Sn+0.017
 AW+0.017
Neg logit contributions
 courtyard-0.020
 builder-0.020
castle-0.019
 classroom-0.019
 painter-0.019
Token and
Token ID290
Feature activation+8.80e-03

Pos logit contributions
 courtyard+0.053
 builder+0.051
 classroom+0.050
 neighborhood+0.050
castle+0.050
Neg logit contributions
text-0.048
-0.038
ifying-0.033
 AW-0.033
 Sn-0.032
Token more
Token ID517
Feature activation+2.04e-02

Pos logit contributions
 courtyard+0.016
utter+0.016
 neighborhood+0.015
 builder+0.015
icker+0.015
Neg logit contributions
text-0.014
-0.013
 rib-0.012
 Fl-0.011
 Sn-0.011
Token.
Token ID13
Feature activation+2.11e-03

Pos logit contributions
 courtyard+0.038
 classroom+0.036
castle+0.036
 neighborhood+0.035
 builder+0.035
Neg logit contributions
text-0.036
-0.030
ifying-0.027
 Sn-0.026
 AW-0.026
Token of
Token ID286
Feature activation+7.34e-03

Pos logit contributions
text+0.023
+0.019
ifying+0.019
 AW+0.018
 Sn+0.018
Neg logit contributions
 courtyard-0.018
 classroom-0.018
 neighborhood-0.017
 builder-0.017
 painter-0.017
Token vegetables
Token ID13701
Feature activation-2.19e-02

Pos logit contributions
 classroom+0.011
 courtyard+0.011
 neighborhood+0.011
 painter+0.011
castle+0.011
Neg logit contributions
text-0.016
 AW-0.013
ifying-0.013
-0.012
 Sn-0.012
Token.
Token ID13
Feature activation+5.32e-03

Pos logit contributions
text+0.044
+0.038
ifying+0.037
 AW+0.035
 Sn+0.035
Neg logit contributions
 courtyard-0.035
 classroom-0.034
 neighborhood-0.034
 painter-0.033
 builder-0.033
Token You
Token ID921
Feature activation-3.71e-02

Pos logit contributions
 classroom+0.007
 neighborhood+0.007
 courtyard+0.007
 builder+0.007
 painter+0.007
Neg logit contributions
text-0.012
 AW-0.010
ifying-0.010
 rib-0.010
 tal-0.009
Token can
Token ID460
Feature activation-7.11e-03

Pos logit contributions
text+0.070
ifying+0.056
 AW+0.056
 Sn+0.053
 rib+0.051
Neg logit contributions
 courtyard-0.065
 builder-0.062
 neighborhood-0.061
 classroom-0.061
 artist-0.061
Token each
Token ID1123
Feature activation+8.10e-02

Pos logit contributions
text+0.014
ifying+0.012
+0.012
 AW+0.011
 Sn+0.011
Neg logit contributions
 courtyard-0.012
 classroom-0.011
 neighborhood-0.011
castle-0.011
 builder-0.011
Token take
Token ID1011
Feature activation-1.66e-02

Pos logit contributions
 courtyard+0.122
 neighborhood+0.118
 classroom+0.118
 painter+0.117
castle+0.117
Neg logit contributions
text-0.176
ifying-0.148
 AW-0.142
-0.141
 Sn-0.138
Token one
Token ID530
Feature activation+4.23e-02

Pos logit contributions
text+0.034
ifying+0.029
 AW+0.028
 Sn+0.027
Fl+0.025
Neg logit contributions
 artist-0.021
 courtyard-0.021
 neighborhood-0.021
 outdoors-0.021
 classroom-0.020
Token and
Token ID290
Feature activation-1.27e-02

Pos logit contributions
 courtyard+0.061
 classroom+0.060
castle+0.060
 neighborhood+0.058
 painter+0.058
Neg logit contributions
text-0.093
 AW-0.078
ifying-0.075
-0.075
 Sn-0.074
Token keep
Token ID1394
Feature activation+9.25e-03

Pos logit contributions
text+0.028
 AW+0.023
ifying+0.022
 Sn+0.021
+0.020
Neg logit contributions
 courtyard-0.019
 classroom-0.018
castle-0.018
 builder-0.018
 painter-0.018
Token it
Token ID340
Feature activation+9.65e-03

Pos logit contributions
 neighborhood+0.013
 artist+0.013
 painter+0.013
 courtyard+0.013
 classroom+0.013
Neg logit contributions
text-0.021
ifying-0.016
 AW-0.016
-0.015
Fl-0.015
Token sweet
Token ID6029
Feature activation+1.15e-02

Pos logit contributions
 artist+0.005
 classroom+0.005
 painter+0.005
castle+0.005
 builder+0.005
Neg logit contributions
text-0.008
ifying-0.007
 AW-0.007
 tal-0.007
-0.006
Tokenie
Token ID494
Feature activation-1.04e-02

Pos logit contributions
 courtyard+0.009
 classroom+0.008
 neighborhood+0.008
 painter+0.008
castle+0.008
Neg logit contributions
text-0.033
ifying-0.030
-0.029
 AW-0.029
 Sn-0.028
Token.
Token ID13
Feature activation+2.83e-03

Pos logit contributions
text+0.024
ifying+0.021
 AW+0.020
+0.019
 Sn+0.019
Neg logit contributions
 courtyard-0.014
 classroom-0.013
 painter-0.013
 neighborhood-0.013
 builder-0.013
Token You
Token ID921
Feature activation-3.46e-02

Pos logit contributions
 classroom+0.004
 neighborhood+0.004
 builder+0.004
 painter+0.004
 courtyard+0.004
Neg logit contributions
text-0.006
 AW-0.005
ifying-0.004
 tal-0.004
-0.004
Token can
Token ID460
Feature activation-1.31e-02

Pos logit contributions
text+0.063
ifying+0.050
 AW+0.049
 Sn+0.047
+0.046
Neg logit contributions
 courtyard-0.063
 builder-0.063
 painter-0.062
 classroom-0.062
 neighborhood-0.062
Token each
Token ID1123
Feature activation+8.06e-02

Pos logit contributions
text+0.027
ifying+0.023
+0.022
 AW+0.022
 Sn+0.021
Neg logit contributions
 courtyard-0.021
 classroom-0.020
 neighborhood-0.020
castle-0.020
 painter-0.019
Token choose
Token ID3853
Feature activation-1.32e-03

Pos logit contributions
 courtyard+0.129
 painter+0.128
 neighborhood+0.128
 classroom+0.128
 artist+0.125
Neg logit contributions
text-0.166
ifying-0.139
-0.132
 AW-0.131
 Sn-0.129
Token one
Token ID530
Feature activation+4.02e-02

Pos logit contributions
text+0.003
 AW+0.002
ifying+0.002
+0.002
 tal+0.002
Neg logit contributions
 classroom-0.002
castle-0.002
 neighborhood-0.002
 courtyard-0.002
 painter-0.002
Token balloon
Token ID21190
Feature activation+6.17e-03

Pos logit contributions
 courtyard+0.066
 classroom+0.065
 neighborhood+0.064
castle+0.063
 painter+0.063
Neg logit contributions
text-0.080
ifying-0.066
-0.066
 AW-0.064
 Sn-0.062
Token,"
Token ID553
Feature activation+2.25e-02

Pos logit contributions
 courtyard+0.011
 classroom+0.010
castle+0.010
 neighborhood+0.010
 painter+0.010
Neg logit contributions
text-0.012
ifying-0.010
-0.010
 AW-0.009
 Sn-0.009
Token Mom
Token ID11254
Feature activation+3.35e-03

Pos logit contributions
 classroom+0.033
 painter+0.032
 neighborhood+0.030
 artist+0.029
 studio+0.029
Neg logit contributions
text-0.050
 AW-0.044
ifying-0.044
 tal-0.039
ified-0.038
Token man
Token ID582
Feature activation+2.97e-02

Pos logit contributions
text+0.003
+0.002
 rib+0.002
 eight+0.002
 ho+0.002
Neg logit contributions
 courtyard-0.002
castle-0.002
ook-0.002
 studio-0.002
utter-0.002
Token and
Token ID290
Feature activation-1.92e-02

Pos logit contributions
 courtyard+0.051
 classroom+0.050
 neighborhood+0.048
castle+0.048
 studio+0.048
Neg logit contributions
text-0.056
-0.049
ifying-0.044
 Sn-0.043
 AW-0.043
Token the
Token ID262
Feature activation-4.31e-03

Pos logit contributions
text+0.033
+0.026
ifying+0.026
 AW+0.025
 Sn+0.024
Neg logit contributions
 courtyard-0.038
 classroom-0.037
 neighborhood-0.037
 painter-0.036
 builder-0.036
Token girl
Token ID2576
Feature activation+1.53e-02

Pos logit contributions
text+0.007
 rib+0.006
+0.006
 eight+0.006
 ho+0.006
Neg logit contributions
castle-0.007
ook-0.007
 courtyard-0.007
utter-0.007
ashing-0.007
Token liked
Token ID8288
Feature activation+9.10e-03

Pos logit contributions
 courtyard+0.025
 classroom+0.025
castle+0.024
 builder+0.024
 neighborhood+0.024
Neg logit contributions
text-0.030
-0.026
ifying-0.024
 Sn-0.023
 eight-0.023
Token each
Token ID1123
Feature activation+8.05e-02

Pos logit contributions
 courtyard+0.018
 classroom+0.017
 neighborhood+0.017
castle+0.017
 builder+0.017
Neg logit contributions
text-0.015
-0.013
ifying-0.012
 Sn-0.012
 AW-0.011
Token other
Token ID584
Feature activation-1.55e-02

Pos logit contributions
 courtyard+0.169
castle+0.167
 neighborhood+0.156
 grounds+0.155
pit+0.154
Neg logit contributions
text-0.127
-0.109
 eight-0.094
 AW-0.092
ifying-0.091
Token very
Token ID845
Feature activation+2.31e-02

Pos logit contributions
text+0.032
+0.027
ifying+0.026
 Sn+0.025
 AW+0.025
Neg logit contributions
 courtyard-0.025
 classroom-0.024
castle-0.023
 neighborhood-0.023
 builder-0.023
Token much
Token ID881
Feature activation+3.76e-03

Pos logit contributions
 courtyard+0.050
 studio+0.049
 neighborhood+0.049
 classroom+0.048
castle+0.048
Neg logit contributions
text-0.035
-0.029
ifying-0.024
 AW-0.024
 Sn-0.023
Token.
Token ID13
Feature activation+4.35e-03

Pos logit contributions
 courtyard+0.007
 classroom+0.007
castle+0.007
 neighborhood+0.007
 studio+0.007
Neg logit contributions
text-0.006
-0.006
ifying-0.005
 Sn-0.005
 rib-0.005
Token They
Token ID1119
Feature activation+1.45e-02

Pos logit contributions
 courtyard+0.008
castle+0.007
 classroom+0.007
 neighborhood+0.007
 painter+0.007
Neg logit contributions
text-0.008
-0.007
ifying-0.007
 Sn-0.006
 AW-0.006
Tokenzy
Token ID7357
Feature activation+4.39e-03

Pos logit contributions
text+0.062
ifying+0.052
+0.052
 AW+0.050
 Sn+0.048
Neg logit contributions
 courtyard-0.053
 classroom-0.052
 neighborhood-0.051
castle-0.050
 painter-0.050
Token.
Token ID13
Feature activation-7.70e-03

Pos logit contributions
 courtyard+0.007
 classroom+0.007
 neighborhood+0.007
castle+0.006
 builder+0.006
Neg logit contributions
text-0.009
-0.008
ifying-0.008
 AW-0.007
 Sn-0.007
Token They
Token ID1119
Feature activation+1.65e-02

Pos logit contributions
text+0.014
+0.012
ifying+0.011
 AW+0.011
 Sn+0.011
Neg logit contributions
 courtyard-0.014
castle-0.014
 builder-0.014
 painter-0.013
 artist-0.013
Token both
Token ID1111
Feature activation+1.80e-02

Pos logit contributions
 courtyard+0.022
 classroom+0.021
castle+0.021
 neighborhood+0.021
 studio+0.020
Neg logit contributions
text-0.038
-0.034
ifying-0.033
 AW-0.031
 Sn-0.031
Token loved
Token ID6151
Feature activation+1.80e-02

Pos logit contributions
 courtyard+0.016
 classroom+0.015
 neighborhood+0.015
 painter+0.015
 artist+0.014
Neg logit contributions
text-0.049
ifying-0.044
 AW-0.042
-0.042
 Sn-0.041
Token each
Token ID1123
Feature activation+7.95e-02

Pos logit contributions
 courtyard+0.029
 classroom+0.028
 neighborhood+0.028
castle+0.028
 builder+0.027
Neg logit contributions
text-0.036
-0.031
ifying-0.030
 AW-0.029
 Sn-0.028
Token other
Token ID584
Feature activation-1.13e-02

Pos logit contributions
 courtyard+0.147
castle+0.141
 classroom+0.141
 neighborhood+0.140
 builder+0.137
Neg logit contributions
text-0.145
-0.120
ifying-0.115
 AW-0.111
 Sn-0.109
Token very
Token ID845
Feature activation+1.70e-02

Pos logit contributions
text+0.024
+0.021
ifying+0.020
 AW+0.019
 Sn+0.019
Neg logit contributions
 courtyard-0.017
 classroom-0.017
 neighborhood-0.016
castle-0.016
 builder-0.016
Token much
Token ID881
Feature activation-3.40e-04

Pos logit contributions
 courtyard+0.035
 studio+0.035
 neighborhood+0.034
 classroom+0.034
castle+0.034
Neg logit contributions
text-0.027
-0.022
ifying-0.019
 AW-0.019
 Sn-0.018
Token.
Token ID13
Feature activation-2.22e-03

Pos logit contributions
text+0.001
+0.001
ifying+0.000
 Sn+0.000
 rib+0.000
Neg logit contributions
 courtyard-0.001
 classroom-0.001
castle-0.001
 painter-0.001
 builder-0.001
Token One
Token ID1881
Feature activation+1.83e-02

Pos logit contributions
text+0.004
+0.003
ifying+0.003
 AW+0.003
 Sn+0.003
Neg logit contributions
 courtyard-0.004
 classroom-0.004
castle-0.004
 builder-0.004
 neighborhood-0.004
Token were
Token ID547
Feature activation+2.15e-02

Pos logit contributions
 courtyard+0.010
 classroom+0.010
 neighborhood+0.010
castle+0.009
 builder+0.009
Neg logit contributions
text-0.016
-0.014
ifying-0.014
 AW-0.013
 Sn-0.013
Token so
Token ID523
Feature activation+8.89e-03

Pos logit contributions
 courtyard+0.032
 neighborhood+0.031
 classroom+0.031
 builder+0.030
 painter+0.030
Neg logit contributions
text-0.046
ifying-0.040
-0.039
 AW-0.038
 Sn-0.037
Token happy
Token ID3772
Feature activation+1.77e-02

Pos logit contributions
 courtyard+0.009
 classroom+0.009
 neighborhood+0.009
castle+0.008
 builder+0.008
Neg logit contributions
text-0.023
-0.021
ifying-0.020
 AW-0.020
 Sn-0.019
Token to
Token ID284
Feature activation+3.99e-03

Pos logit contributions
 courtyard+0.036
 classroom+0.033
 grounds+0.033
 studio+0.033
 builder+0.032
Neg logit contributions
text-0.028
-0.025
 rib-0.021
ifying-0.021
 eight-0.021
Token share
Token ID2648
Feature activation+1.98e-02

Pos logit contributions
 courtyard+0.008
 neighborhood+0.007
 classroom+0.007
castle+0.007
 grounds+0.007
Neg logit contributions
text-0.007
-0.006
ifying-0.006
 AW-0.005
 eight-0.005
Token each
Token ID1123
Feature activation+7.92e-02

Pos logit contributions
 courtyard+0.039
 builder+0.037
castle+0.037
 classroom+0.036
 grounds+0.036
Neg logit contributions
text-0.034
-0.029
ifying-0.027
 rib-0.025
 Sn-0.025
Token other
Token ID584
Feature activation-4.50e-04

Pos logit contributions
 courtyard+0.138
 classroom+0.133
 neighborhood+0.132
castle+0.130
 builder+0.129
Neg logit contributions
text-0.152
-0.126
ifying-0.125
 AW-0.120
 Sn-0.116
Token's
Token ID338
Feature activation-3.67e-03

Pos logit contributions
text+0.001
ifying+0.001
 AW+0.001
+0.001
 Sn+0.001
Neg logit contributions
 courtyard-0.001
 classroom-0.001
 neighborhood-0.001
 painter-0.001
castle-0.001
Token company
Token ID1664
Feature activation+5.97e-03

Pos logit contributions
text+0.008
+0.007
ifying+0.007
 AW+0.006
 Sn+0.006
Neg logit contributions
 courtyard-0.006
 classroom-0.005
 neighborhood-0.005
castle-0.005
 builder-0.005
Token.
Token ID13
Feature activation-3.91e-03

Pos logit contributions
 courtyard+0.010
 classroom+0.010
 neighborhood+0.010
 painter+0.009
 builder+0.009
Neg logit contributions
text-0.012
-0.010
ifying-0.010
 Sn-0.009
 AW-0.009
Token They
Token ID1119
Feature activation+8.06e-03

Pos logit contributions
text+0.007
ifying+0.006
 AW+0.006
 rib+0.006
+0.006
Neg logit contributions
 neighborhood-0.007
 classroom-0.006
 courtyard-0.006
 painter-0.006
 builder-0.006
Token
Token ID198
Feature activation-2.39e-03

Pos logit contributions
text+0.020
ifying+0.019
 AW+0.017
 rib+0.017
 tal+0.016
Neg logit contributions
 neighborhood-0.015
 classroom-0.014
 courtyard-0.014
 painter-0.013
 builder-0.013
Token"
Token ID1
Feature activation+6.79e-03

Pos logit contributions
text+0.005
ifying+0.004
 tal+0.004
 rib+0.003
 AW+0.003
Neg logit contributions
 classroom-0.004
 courtyard-0.004
 neighborhood-0.004
 painter-0.004
 builder-0.004
TokenLet
Token ID5756
Feature activation-7.62e-03

Pos logit contributions
 courtyard+0.012
 classroom+0.011
 neighborhood+0.011
 painter+0.011
 builder+0.011
Neg logit contributions
text-0.013
ifying-0.011
-0.011
 AW-0.011
 Sn-0.010
Token's
Token ID338
Feature activation+1.91e-02

Pos logit contributions
ifying+0.018
text+0.016
 Sn+0.015
 eight+0.014
 reins+0.014
Neg logit contributions
 painter-0.010
 neighborhood-0.009
 classroom-0.009
 artist-0.009
 courtyard-0.009
Token give
Token ID1577
Feature activation-4.60e-03

Pos logit contributions
 courtyard+0.026
 classroom+0.026
 painter+0.025
 builder+0.025
 artist+0.025
Neg logit contributions
text-0.045
ifying-0.037
 AW-0.036
-0.035
 Sn-0.035
Token each
Token ID1123
Feature activation+7.87e-02

Pos logit contributions
text+0.011
 AW+0.009
ifying+0.009
 tal+0.008
ified+0.008
Neg logit contributions
 painter-0.006
 artist-0.005
 classroom-0.005
 outside-0.005
 building-0.005
Token other
Token ID584
Feature activation-1.38e-02

Pos logit contributions
 painter+0.099
 builder+0.092
 artist+0.091
 classroom+0.087
 scientist+0.084
Neg logit contributions
text-0.196
ifying-0.175
 AW-0.162
 Sn-0.157
ified-0.152
Token a
Token ID257
Feature activation-7.04e-03

Pos logit contributions
text+0.027
ifying+0.022
 AW+0.022
 Sn+0.021
+0.020
Neg logit contributions
 classroom-0.023
 painter-0.022
 courtyard-0.022
 artist-0.021
 neighborhood-0.021
Token haircut
Token ID45548
Feature activation+1.28e-02

Pos logit contributions
text+0.015
ifying+0.012
+0.012
 AW+0.011
 Sn+0.011
Neg logit contributions
 courtyard-0.011
 classroom-0.011
 builder-0.011
 painter-0.011
 studio-0.011
Token!"
Token ID2474
Feature activation+1.92e-02

Pos logit contributions
 courtyard+0.027
 classroom+0.027
castle+0.027
 builder+0.026
 neighborhood+0.026
Neg logit contributions
text-0.019
ifying-0.015
-0.014
 AW-0.014
 Sn-0.013
Token Tom
Token ID4186
Feature activation-8.59e-04

Pos logit contributions
 classroom+0.029
 courtyard+0.028
 painter+0.027
 neighborhood+0.027
 studio+0.027
Neg logit contributions
text-0.041
ifying-0.036
 AW-0.033
-0.032
 tal-0.031
Token and
Token ID290
Feature activation-1.17e-02

Pos logit contributions
 courtyard+0.043
 classroom+0.042
 neighborhood+0.041
 painter+0.040
castle+0.040
Neg logit contributions
text-0.059
ifying-0.050
-0.049
 AW-0.048
 Sn-0.046
Token dad
Token ID9955
Feature activation+2.09e-02

Pos logit contributions
text+0.027
 AW+0.024
ifying+0.023
+0.022
 Sn+0.021
Neg logit contributions
 painter-0.016
 classroom-0.015
 courtyard-0.015
 builder-0.015
 neighborhood-0.015
Token said
Token ID531
Feature activation-3.20e-03

Pos logit contributions
 courtyard+0.034
 classroom+0.033
 neighborhood+0.033
castle+0.032
 painter+0.032
Neg logit contributions
text-0.042
ifying-0.035
-0.035
 AW-0.034
 Sn-0.033
Token they
Token ID484
Feature activation+3.66e-03

Pos logit contributions
text+0.007
ifying+0.006
 AW+0.006
 tal+0.006
 Sn+0.006
Neg logit contributions
 courtyard-0.004
 painter-0.004
 builder-0.004
castle-0.004
 classroom-0.004
Token could
Token ID714
Feature activation-5.03e-03

Pos logit contributions
 courtyard+0.006
 classroom+0.006
 neighborhood+0.006
 builder+0.006
castle+0.006
Neg logit contributions
text-0.007
ifying-0.006
-0.006
 AW-0.006
 Sn-0.006
Token each
Token ID1123
Feature activation+7.85e-02

Pos logit contributions
text+0.011
ifying+0.009
+0.009
 AW+0.009
 Sn+0.008
Neg logit contributions
 courtyard-0.008
 classroom-0.008
 neighborhood-0.008
castle-0.007
 builder-0.007
Token choose
Token ID3853
Feature activation-2.38e-03

Pos logit contributions
 courtyard+0.125
 classroom+0.121
 neighborhood+0.120
castle+0.117
 painter+0.116
Neg logit contributions
text-0.163
ifying-0.137
-0.136
 AW-0.131
 Sn-0.127
Token one
Token ID530
Feature activation+4.31e-02

Pos logit contributions
text+0.005
 AW+0.004
ifying+0.004
+0.004
 eight+0.004
Neg logit contributions
 courtyard-0.003
 classroom-0.003
castle-0.003
 neighborhood-0.003
 builder-0.003
Token toy
Token ID13373
Feature activation+4.56e-03

Pos logit contributions
 courtyard+0.065
 classroom+0.063
 neighborhood+0.062
castle+0.061
 builder+0.061
Neg logit contributions
text-0.092
ifying-0.078
-0.077
 AW-0.075
 Sn-0.073
Token.
Token ID13
Feature activation-5.62e-03

Pos logit contributions
 courtyard+0.007
 classroom+0.007
castle+0.007
 neighborhood+0.007
 builder+0.006
Neg logit contributions
text-0.010
ifying-0.008
-0.008
 AW-0.008
 Sn-0.008
Token Tom
Token ID4186
Feature activation+4.67e-03

Pos logit contributions
text+0.012
 AW+0.010
ifying+0.010
 rib+0.010
 tal+0.009
Neg logit contributions
 classroom-0.008
 neighborhood-0.008
 courtyard-0.008
 painter-0.007
 studio-0.007
Token They
Token ID1119
Feature activation+1.36e-02

Pos logit contributions
text+0.015
+0.013
ifying+0.013
 AW+0.012
 Sn+0.012
Neg logit contributions
 courtyard-0.012
 classroom-0.012
 neighborhood-0.012
 builder-0.011
 painter-0.011
Token were
Token ID547
Feature activation+2.08e-02

Pos logit contributions
 courtyard+0.019
 classroom+0.018
castle+0.018
 neighborhood+0.018
 studio+0.018
Neg logit contributions
text-0.031
-0.027
ifying-0.026
 AW-0.025
 Sn-0.025
Token friendly
Token ID8030
Feature activation+1.77e-03

Pos logit contributions
 classroom+0.025
 courtyard+0.025
 neighborhood+0.025
 painter+0.024
 builder+0.024
Neg logit contributions
text-0.051
ifying-0.045
 AW-0.044
-0.043
 rib-0.041
Token and
Token ID290
Feature activation-5.81e-03

Pos logit contributions
 courtyard+0.003
 classroom+0.003
 neighborhood+0.003
castle+0.003
 builder+0.003
Neg logit contributions
text-0.004
-0.003
ifying-0.003
 AW-0.003
 Sn-0.003
Token liked
Token ID8288
Feature activation+6.87e-03

Pos logit contributions
text+0.014
+0.012
ifying+0.012
 Sn+0.012
 eight+0.011
Neg logit contributions
 courtyard-0.007
 neighborhood-0.007
castle-0.007
 classroom-0.007
 studio-0.007
Token each
Token ID1123
Feature activation+7.82e-02

Pos logit contributions
 courtyard+0.012
 classroom+0.012
 neighborhood+0.011
castle+0.011
 builder+0.011
Neg logit contributions
text-0.013
-0.011
ifying-0.010
 AW-0.010
 Sn-0.010
Token other
Token ID584
Feature activation-1.54e-02

Pos logit contributions
 courtyard+0.159
castle+0.156
 classroom+0.150
 studio+0.149
 neighborhood+0.149
Neg logit contributions
text-0.128
-0.107
ifying-0.096
 eight-0.095
 Sn-0.094
Token very
Token ID845
Feature activation+1.39e-02

Pos logit contributions
text+0.031
+0.027
ifying+0.025
 Sn+0.024
 AW+0.023
Neg logit contributions
 courtyard-0.026
castle-0.025
 classroom-0.025
 builder-0.025
 painter-0.024
Token much
Token ID881
Feature activation-5.40e-03

Pos logit contributions
 courtyard+0.030
 studio+0.029
 neighborhood+0.029
castle+0.029
 classroom+0.028
Neg logit contributions
text-0.021
-0.017
ifying-0.015
 Sn-0.015
 AW-0.014
Token.
Token ID13
Feature activation-1.90e-03

Pos logit contributions
text+0.009
+0.008
 Sn+0.007
ifying+0.007
 AW+0.006
Neg logit contributions
 courtyard-0.011
 classroom-0.010
 neighborhood-0.010
castle-0.010
 painter-0.010
Token They
Token ID1119
Feature activation+1.83e-02

Pos logit contributions
text+0.003
+0.003
ifying+0.003
 AW+0.003
 Sn+0.003
Neg logit contributions
 courtyard-0.003
 classroom-0.003
 neighborhood-0.003
castle-0.003
 builder-0.003
Token friends
Token ID2460
Feature activation+2.11e-02

Pos logit contributions
text+0.036
+0.031
ifying+0.030
 AW+0.029
 Sn+0.029
Neg logit contributions
 courtyard-0.022
 classroom-0.021
 neighborhood-0.020
castle-0.020
 builder-0.020
Token.
Token ID13
Feature activation-1.08e-02

Pos logit contributions
 courtyard+0.037
 grounds+0.036
castle+0.035
 classroom+0.035
 studio+0.035
Neg logit contributions
text-0.040
-0.035
 eight-0.031
 Sn-0.031
ifying-0.031
Token They
Token ID1119
Feature activation+1.32e-02

Pos logit contributions
text+0.022
ifying+0.019
 AW+0.019
+0.019
 rib+0.017
Neg logit contributions
 classroom-0.017
 courtyard-0.017
 neighborhood-0.016
 builder-0.016
 painter-0.016
Token both
Token ID1111
Feature activation+1.88e-02

Pos logit contributions
 courtyard+0.016
 classroom+0.016
 neighborhood+0.016
castle+0.015
 builder+0.015
Neg logit contributions
text-0.032
-0.027
ifying-0.027
 AW-0.026
 Sn-0.026
Token loved
Token ID6151
Feature activation+1.44e-02

Pos logit contributions
 courtyard+0.015
 classroom+0.014
 neighborhood+0.014
 painter+0.014
 artist+0.014
Neg logit contributions
text-0.052
ifying-0.047
 AW-0.047
-0.045
 Sn-0.045
Token each
Token ID1123
Feature activation+7.82e-02

Pos logit contributions
 courtyard+0.022
 classroom+0.021
 neighborhood+0.021
castle+0.021
 builder+0.021
Neg logit contributions
text-0.030
-0.026
ifying-0.025
 AW-0.024
 Sn-0.024
Token other
Token ID584
Feature activation-1.22e-02

Pos logit contributions
 courtyard+0.149
castle+0.143
 classroom+0.142
 neighborhood+0.141
 studio+0.138
Neg logit contributions
text-0.139
-0.115
ifying-0.108
 AW-0.105
 Sn-0.103
Token very
Token ID845
Feature activation+1.55e-02

Pos logit contributions
text+0.025
+0.022
ifying+0.020
 Sn+0.020
 AW+0.019
Neg logit contributions
 courtyard-0.019
castle-0.019
 classroom-0.019
 builder-0.019
 painter-0.018
Token much
Token ID881
Feature activation-5.25e-05

Pos logit contributions
 courtyard+0.033
 studio+0.033
 neighborhood+0.033
castle+0.032
pit+0.032
Neg logit contributions
text-0.024
-0.019
ifying-0.016
 AW-0.016
 Sn-0.016
Token and
Token ID290
Feature activation+3.97e-03

Pos logit contributions
text+0.000
+0.000
 Sn+0.000
 rib+0.000
 Fl+0.000
Neg logit contributions
 courtyard-0.000
 studio-0.000
 classroom-0.000
 painter-0.000
 builder-0.000
Token were
Token ID547
Feature activation+3.07e-02

Pos logit contributions
 courtyard+0.006
 classroom+0.006
 neighborhood+0.006
castle+0.005
 builder+0.005
Neg logit contributions
text-0.008
-0.008
ifying-0.007
 Sn-0.007
 rib-0.007
Token children
Token ID1751
Feature activation+2.20e-02

Pos logit contributions
 courtyard+0.023
castle+0.022
 classroom+0.021
 studio+0.021
 builder+0.021
Neg logit contributions
text-0.022
-0.019
 eight-0.018
 AW-0.017
ifying-0.017
Token suggested
Token ID5220
Feature activation-9.62e-04

Pos logit contributions
 courtyard+0.036
 classroom+0.035
castle+0.035
 neighborhood+0.034
 builder+0.034
Neg logit contributions
text-0.043
-0.038
ifying-0.035
 AW-0.033
 eight-0.033
Token that
Token ID326
Feature activation+2.28e-02

Pos logit contributions
text+0.002
ifying+0.002
+0.002
 AW+0.002
 Sn+0.002
Neg logit contributions
 courtyard-0.001
 classroom-0.001
 neighborhood-0.001
 painter-0.001
castle-0.001
Token they
Token ID484
Feature activation+1.28e-02

Pos logit contributions
 courtyard+0.041
 classroom+0.039
 neighborhood+0.039
castle+0.038
 builder+0.037
Neg logit contributions
text-0.042
-0.036
ifying-0.034
 Sn-0.033
 AW-0.033
Token should
Token ID815
Feature activation-7.59e-03

Pos logit contributions
 courtyard+0.019
 classroom+0.018
 neighborhood+0.018
 painter+0.018
 builder+0.018
Neg logit contributions
text-0.028
ifying-0.023
-0.023
 AW-0.022
 Sn-0.022
Token each
Token ID1123
Feature activation+7.80e-02

Pos logit contributions
text+0.016
ifying+0.013
+0.013
 AW+0.013
 Sn+0.012
Neg logit contributions
 courtyard-0.012
 classroom-0.012
 neighborhood-0.011
castle-0.011
 painter-0.011
Token tell
Token ID1560
Feature activation-6.88e-03

Pos logit contributions
 courtyard+0.124
 classroom+0.121
 neighborhood+0.119
castle+0.117
 builder+0.117
Neg logit contributions
text-0.161
-0.135
ifying-0.134
 AW-0.129
 Sn-0.125
Token a
Token ID257
Feature activation-4.11e-03

Pos logit contributions
text+0.014
ifying+0.012
+0.011
 AW+0.011
 Sn+0.011
Neg logit contributions
 courtyard-0.011
 classroom-0.011
 neighborhood-0.011
 painter-0.011
 studio-0.010
Token lie
Token ID6486
Feature activation+2.67e-02

Pos logit contributions
text+0.010
ifying+0.008
+0.008
 AW+0.008
 Sn+0.008
Neg logit contributions
 classroom-0.005
 courtyard-0.005
 neighborhood-0.005
 painter-0.005
 builder-0.005
Token,
Token ID11
Feature activation+1.16e-02

Pos logit contributions
 courtyard+0.042
 classroom+0.041
 neighborhood+0.041
castle+0.040
 builder+0.040
Neg logit contributions
text-0.055
-0.046
ifying-0.046
 AW-0.044
 Sn-0.043
Token and
Token ID290
Feature activation-1.18e-02

Pos logit contributions
 courtyard+0.017
 classroom+0.016
 neighborhood+0.016
 painter+0.016
 builder+0.016
Neg logit contributions
text-0.026
ifying-0.022
-0.021
 AW-0.021
 Sn-0.020
Token happy
Token ID3772
Feature activation-4.69e-04

Pos logit contributions
 courtyard+0.019
castle+0.018
 neighborhood+0.018
 classroom+0.017
 studio+0.017
Neg logit contributions
text-0.031
-0.028
ifying-0.027
 Sn-0.027
 eight-0.027
Token together
Token ID1978
Feature activation+5.35e-03

Pos logit contributions
text+0.001
+0.001
 Sn+0.001
ifying+0.001
 eight+0.001
Neg logit contributions
 courtyard-0.001
 grounds-0.001
 neighborhood-0.001
castle-0.001
 classroom-0.001
Token.
Token ID13
Feature activation-5.71e-03

Pos logit contributions
 courtyard+0.008
 classroom+0.008
 neighborhood+0.008
castle+0.007
 painter+0.007
Neg logit contributions
text-0.012
ifying-0.010
-0.010
 AW-0.010
 Sn-0.009
Token They
Token ID1119
Feature activation+1.53e-02

Pos logit contributions
text+0.009
+0.008
 Sn+0.008
ifying+0.008
 Fl+0.007
Neg logit contributions
 courtyard-0.011
castle-0.011
 builder-0.011
 artist-0.010
 grounds-0.010
Token loved
Token ID6151
Feature activation+1.82e-02

Pos logit contributions
 courtyard+0.023
 classroom+0.022
 neighborhood+0.021
castle+0.021
 builder+0.021
Neg logit contributions
text-0.033
-0.029
ifying-0.028
 AW-0.027
 Sn-0.027
Token each
Token ID1123
Feature activation+7.76e-02

Pos logit contributions
 courtyard+0.029
 classroom+0.029
 neighborhood+0.028
castle+0.028
 painter+0.028
Neg logit contributions
text-0.037
-0.031
ifying-0.031
 AW-0.030
 Sn-0.029
Token other
Token ID584
Feature activation-1.16e-02

Pos logit contributions
 courtyard+0.142
castle+0.136
 classroom+0.136
 neighborhood+0.135
 builder+0.132
Neg logit contributions
text-0.143
-0.119
ifying-0.114
 AW-0.109
 Sn-0.108
Token very
Token ID845
Feature activation+1.79e-02

Pos logit contributions
text+0.026
ifying+0.022
+0.022
 AW+0.021
 Sn+0.021
Neg logit contributions
 courtyard-0.016
 classroom-0.016
 neighborhood-0.016
castle-0.015
 painter-0.015
Token much
Token ID881
Feature activation+8.94e-04

Pos logit contributions
 courtyard+0.037
 neighborhood+0.036
 classroom+0.035
castle+0.035
 studio+0.035
Neg logit contributions
text-0.029
-0.024
ifying-0.022
 AW-0.021
 Sn-0.021
Token and
Token ID290
Feature activation+5.12e-03

Pos logit contributions
 courtyard+0.002
 classroom+0.002
castle+0.002
 painter+0.002
 neighborhood+0.002
Neg logit contributions
text-0.002
-0.001
 Sn-0.001
ifying-0.001
 AW-0.001
Token wanted
Token ID2227
Feature activation+5.21e-03

Pos logit contributions
 courtyard+0.007
castle+0.007
 builder+0.007
 classroom+0.007
 neighborhood+0.007
Neg logit contributions
text-0.011
-0.010
ifying-0.009
 Sn-0.009
 rib-0.009
Tokenmy
Token ID1820
Feature activation+7.04e-03

Pos logit contributions
 courtyard+0.043
 classroom+0.040
castle+0.040
 studio+0.040
 grounds+0.039
Neg logit contributions
text-0.060
-0.055
 Sn-0.049
ifying-0.048
 eight-0.047
Token and
Token ID290
Feature activation-1.43e-02

Pos logit contributions
 courtyard+0.011
 classroom+0.010
castle+0.010
 neighborhood+0.010
 studio+0.010
Neg logit contributions
text-0.015
-0.013
ifying-0.012
 Sn-0.012
 AW-0.012
Token the
Token ID262
Feature activation-2.61e-03

Pos logit contributions
text+0.033
ifying+0.029
 AW+0.028
+0.028
 tal+0.026
Neg logit contributions
 painter-0.020
 classroom-0.020
 builder-0.019
 courtyard-0.018
 artist-0.018
Token kid
Token ID5141
Feature activation+2.81e-02

Pos logit contributions
text+0.006
ifying+0.005
 AW+0.005
+0.005
 Sn+0.005
Neg logit contributions
 classroom-0.004
 courtyard-0.003
 neighborhood-0.003
 painter-0.003
 builder-0.003
Token loved
Token ID6151
Feature activation+1.53e-02

Pos logit contributions
 courtyard+0.039
castle+0.038
 classroom+0.038
 builder+0.037
 neighborhood+0.036
Neg logit contributions
text-0.062
-0.056
ifying-0.051
 Sn-0.050
 eight-0.049
Token each
Token ID1123
Feature activation+7.72e-02

Pos logit contributions
 courtyard+0.026
 classroom+0.025
 neighborhood+0.025
castle+0.024
 builder+0.024
Neg logit contributions
text-0.030
-0.025
ifying-0.025
 AW-0.024
 Sn-0.023
Token other
Token ID584
Feature activation-1.35e-02

Pos logit contributions
 courtyard+0.138
 classroom+0.133
 neighborhood+0.132
castle+0.131
 builder+0.129
Neg logit contributions
text-0.145
-0.121
ifying-0.118
 AW-0.113
 Sn-0.111
Token very
Token ID845
Feature activation+1.88e-02

Pos logit contributions
text+0.030
ifying+0.026
+0.026
 AW+0.025
 Sn+0.024
Neg logit contributions
 courtyard-0.019
 classroom-0.018
 neighborhood-0.018
castle-0.018
 builder-0.018
Token much
Token ID881
Feature activation-1.40e-03

Pos logit contributions
 courtyard+0.038
 neighborhood+0.037
 classroom+0.037
castle+0.037
 studio+0.037
Neg logit contributions
text-0.031
-0.025
ifying-0.023
 Sn-0.022
 AW-0.022
Token.
Token ID13
Feature activation-1.71e-03

Pos logit contributions
text+0.003
+0.002
ifying+0.002
 Sn+0.002
 AW+0.002
Neg logit contributions
 courtyard-0.003
 classroom-0.002
castle-0.002
 neighborhood-0.002
 painter-0.002
Token They
Token ID1119
Feature activation+1.90e-02

Pos logit contributions
text+0.003
ifying+0.003
+0.003
 AW+0.003
 Sn+0.002
Neg logit contributions
 courtyard-0.003
 classroom-0.003
 neighborhood-0.003
 painter-0.003
 builder-0.003
Token twins
Token ID20345
Feature activation-3.05e-03

Pos logit contributions
 courtyard+0.015
 classroom+0.014
 neighborhood+0.014
castle+0.014
 builder+0.013
Neg logit contributions
text-0.021
-0.018
ifying-0.017
 AW-0.017
 Sn-0.016
Token.
Token ID13
Feature activation+7.94e-03

Pos logit contributions
text+0.006
+0.005
ifying+0.005
 AW+0.004
 Sn+0.004
Neg logit contributions
 courtyard-0.005
 classroom-0.005
 neighborhood-0.005
castle-0.005
 builder-0.005
Token You
Token ID921
Feature activation-3.90e-02

Pos logit contributions
 classroom+0.014
 artist+0.014
 painter+0.014
 neighborhood+0.014
 writer+0.013
Neg logit contributions
text-0.014
 AW-0.012
ifying-0.012
 rib-0.011
 tal-0.011
Token should
Token ID815
Feature activation+3.37e-04

Pos logit contributions
text+0.071
ifying+0.057
 AW+0.057
 Sn+0.054
+0.054
Neg logit contributions
 neighborhood-0.068
 courtyard-0.068
 painter-0.068
 classroom-0.068
 artist-0.067
Token love
Token ID1842
Feature activation+2.60e-02

Pos logit contributions
 courtyard+0.001
 grounds+0.001
 neighborhood+0.001
 builder+0.001
 studio+0.001
Neg logit contributions
text-0.001
-0.001
ifying-0.001
 AW-0.001
 Sn-0.001
Token each
Token ID1123
Feature activation+7.72e-02

Pos logit contributions
 classroom+0.041
 neighborhood+0.040
 painter+0.038
 courtyard+0.038
 artist+0.038
Neg logit contributions
text-0.055
ifying-0.047
 AW-0.047
-0.045
 Sn-0.041
Token other
Token ID584
Feature activation-1.55e-02

Pos logit contributions
 courtyard+0.127
 classroom+0.125
 neighborhood+0.124
 painter+0.122
 builder+0.121
Neg logit contributions
text-0.154
ifying-0.131
-0.128
 AW-0.124
 Sn-0.119
Token."
Token ID526
Feature activation+1.58e-02

Pos logit contributions
text+0.031
ifying+0.026
+0.025
 AW+0.025
 Sn+0.024
Neg logit contributions
 courtyard-0.026
 classroom-0.025
 neighborhood-0.025
 painter-0.024
 builder-0.024
Token
Token ID198
Feature activation-7.40e-03

Pos logit contributions
 courtyard+0.036
castle+0.035
 artist+0.034
 builder+0.034
 classroom+0.034
Neg logit contributions
text-0.022
-0.019
 eight-0.015
ifying-0.015
 Sn-0.015
Token
Token ID198
Feature activation-3.18e-03

Pos logit contributions
text+0.013
+0.010
ifying+0.010
 AW+0.010
 Sn+0.009
Neg logit contributions
 courtyard-0.014
 classroom-0.014
 neighborhood-0.014
castle-0.014
 builder-0.013
TokenSam
Token ID16305
Feature activation-1.23e-02

Pos logit contributions
text+0.006
+0.005
 eight+0.004
Fl+0.004
ifying+0.004
Neg logit contributions
 courtyard-0.006
 artist-0.006
castle-0.006
 builder-0.006
 classroom-0.006
Token.
Token ID13
Feature activation-1.58e-04

Pos logit contributions
 writer+0.033
ashing+0.032
 studio+0.032
 artist+0.032
 painter+0.032
Neg logit contributions
text-0.020
-0.018
 rib-0.016
 AW-0.015
 lun-0.014
Token She
Token ID1375
Feature activation+9.34e-03

Pos logit contributions
text+0.000
+0.000
 Sn+0.000
ifying+0.000
 rib+0.000
Neg logit contributions
castle-0.000
 courtyard-0.000
 builder-0.000
 painter-0.000
 artist-0.000
Token worked
Token ID3111
Feature activation+1.31e-02

Pos logit contributions
 courtyard+0.017
 classroom+0.017
 neighborhood+0.017
castle+0.017
 studio+0.016
Neg logit contributions
text-0.016
-0.014
 eight-0.013
ifying-0.012
 AW-0.012
Token hard
Token ID1327
Feature activation+1.95e-02

Pos logit contributions
 courtyard+0.031
 painter+0.030
 neighborhood+0.030
 artist+0.029
castle+0.029
Neg logit contributions
text-0.014
-0.013
 eight-0.012
 AW-0.011
 rib-0.010
Token on
Token ID319
Feature activation-6.80e-04

Pos logit contributions
 courtyard+0.040
castle+0.038
 classroom+0.038
 neighborhood+0.038
 painter+0.037
Neg logit contributions
text-0.028
-0.024
 eight-0.022
 AW-0.022
ifying-0.021
Token each
Token ID1123
Feature activation+7.72e-02

Pos logit contributions
text+0.001
+0.001
 eight+0.001
 rib+0.001
 AW+0.001
Neg logit contributions
 courtyard-0.001
castle-0.001
ashing-0.001
 neighborhood-0.001
 artist-0.001
Token of
Token ID286
Feature activation-2.36e-03

Pos logit contributions
 courtyard+0.144
ashing+0.136
 neighborhood+0.136
castle+0.136
 writer+0.132
Neg logit contributions
text-0.121
-0.115
 eight-0.104
 AW-0.088
 ho-0.084
Token her
Token ID607
Feature activation+2.42e-03

Pos logit contributions
text+0.003
 eight+0.003
+0.003
 rib+0.003
 Sn+0.003
Neg logit contributions
ashing-0.005
 courtyard-0.005
castle-0.005
 writer-0.004
 neighborhood-0.004
Token building
Token ID2615
Feature activation+1.00e-02

Pos logit contributions
 courtyard+0.003
 writer+0.003
 artist+0.003
castle+0.003
 studio+0.003
Neg logit contributions
text-0.005
-0.004
 eight-0.004
 AW-0.004
 rib-0.004
Token projects
Token ID4493
Feature activation-5.86e-03

Pos logit contributions
 writer+0.019
 artist+0.018
 reporter+0.018
ashing+0.018
 scientist+0.018
Neg logit contributions
text-0.013
-0.013
 eight-0.012
 AW-0.010
 rib-0.010
Token and
Token ID290
Feature activation+1.32e-03

Pos logit contributions
+0.008
text+0.007
 rib+0.006
 Sn+0.005
ifying+0.005
Neg logit contributions
 courtyard-0.012
 painter-0.011
 swamp-0.011
 neighborhood-0.011
 writer-0.011
Token.
Token ID13
Feature activation-1.05e-02

Pos logit contributions
text+0.035
ifying+0.031
 AW+0.030
+0.029
 Sn+0.028
Neg logit contributions
 courtyard-0.018
 classroom-0.018
 neighborhood-0.018
 painter-0.017
 builder-0.017
Token Sam
Token ID3409
Feature activation+6.83e-03

Pos logit contributions
text+0.016
+0.014
 AW+0.013
 eight+0.012
ifying+0.012
Neg logit contributions
 courtyard-0.022
castle-0.021
 artist-0.021
 builder-0.021
 painter-0.021
Token and
Token ID290
Feature activation-1.93e-02

Pos logit contributions
 courtyard+0.014
castle+0.013
 studio+0.013
 builder+0.013
 classroom+0.013
Neg logit contributions
text-0.011
-0.010
 AW-0.008
 Sn-0.008
 eight-0.007
Token Emily
Token ID17608
Feature activation-8.16e-03

Pos logit contributions
text+0.040
+0.035
 Sn+0.033
 AW+0.032
ifying+0.032
Neg logit contributions
 courtyard-0.031
 neighborhood-0.029
castle-0.029
 grounds-0.028
 classroom-0.028
Token loved
Token ID6151
Feature activation+1.55e-02

Pos logit contributions
text+0.016
+0.014
ifying+0.013
 AW+0.013
 Sn+0.012
Neg logit contributions
 courtyard-0.013
 classroom-0.013
castle-0.013
 builder-0.013
 studio-0.013
Token each
Token ID1123
Feature activation+7.71e-02

Pos logit contributions
 courtyard+0.029
 grounds+0.027
castle+0.027
 neighborhood+0.027
 classroom+0.027
Neg logit contributions
text-0.027
-0.022
 rib-0.021
ifying-0.021
 eight-0.021
Token other
Token ID584
Feature activation-1.30e-02

Pos logit contributions
 courtyard+0.152
castle+0.147
 classroom+0.144
 neighborhood+0.143
 studio+0.141
Neg logit contributions
text-0.132
-0.111
ifying-0.101
 AW-0.098
 eight-0.098
Token very
Token ID845
Feature activation+1.74e-02

Pos logit contributions
text+0.026
+0.023
ifying+0.021
 Sn+0.021
 AW+0.020
Neg logit contributions
 courtyard-0.021
castle-0.021
 classroom-0.020
 builder-0.020
 painter-0.020
Token much
Token ID881
Feature activation-1.33e-03

Pos logit contributions
 studio+0.040
 courtyard+0.040
pit+0.040
 neighborhood+0.039
 classroom+0.039
Neg logit contributions
text-0.024
-0.019
ifying-0.015
 Sn-0.014
 eight-0.014
Token and
Token ID290
Feature activation+9.28e-03

Pos logit contributions
text+0.002
+0.002
 Sn+0.002
ifying+0.002
 rib+0.002
Neg logit contributions
 courtyard-0.003
 classroom-0.003
 neighborhood-0.003
castle-0.003
 painter-0.003
Token they
Token ID484
Feature activation+1.66e-02

Pos logit contributions
 courtyard+0.015
 neighborhood+0.014
 studio+0.014
 classroom+0.014
castle+0.014
Neg logit contributions
text-0.018
-0.017
 rib-0.015
 Sn-0.015
ifying-0.014
Token getting
Token ID1972
Feature activation+9.91e-03

Pos logit contributions
 courtyard+0.041
 classroom+0.039
 neighborhood+0.038
castle+0.038
 grounds+0.037
Neg logit contributions
text-0.052
-0.045
ifying-0.044
 Sn-0.042
 AW-0.042
Token married
Token ID6405
Feature activation-8.03e-03

Pos logit contributions
 courtyard+0.015
 classroom+0.015
 neighborhood+0.015
 builder+0.014
 painter+0.014
Neg logit contributions
text-0.021
ifying-0.018
-0.018
 AW-0.017
 Sn-0.016
Token!
Token ID0
Feature activation-7.42e-03

Pos logit contributions
text+0.016
+0.014
ifying+0.014
 AW+0.013
 Sn+0.013
Neg logit contributions
 courtyard-0.013
 classroom-0.013
 neighborhood-0.012
castle-0.012
 builder-0.012
Token They
Token ID1119
Feature activation+1.35e-02

Pos logit contributions
text+0.015
+0.012
ifying+0.012
 AW+0.012
 Sn+0.011
Neg logit contributions
 courtyard-0.012
 classroom-0.012
 neighborhood-0.012
 painter-0.012
 builder-0.012
Token loved
Token ID6151
Feature activation+1.96e-02

Pos logit contributions
 courtyard+0.019
 classroom+0.019
 neighborhood+0.018
 painter+0.018
 builder+0.018
Neg logit contributions
text-0.030
ifying-0.025
-0.025
 AW-0.025
 Sn-0.024
Token each
Token ID1123
Feature activation+7.71e-02

Pos logit contributions
 classroom+0.028
 neighborhood+0.028
 courtyard+0.028
 painter+0.027
 artist+0.027
Neg logit contributions
text-0.043
ifying-0.038
-0.037
 AW-0.037
 tal-0.033
Token other
Token ID584
Feature activation-1.39e-02

Pos logit contributions
 courtyard+0.122
 classroom+0.120
 neighborhood+0.120
 painter+0.118
 artist+0.115
Neg logit contributions
text-0.157
ifying-0.135
-0.133
 AW-0.132
 Sn-0.122
Token very
Token ID845
Feature activation+1.96e-02

Pos logit contributions
text+0.032
ifying+0.028
 AW+0.027
+0.027
 eight+0.026
Neg logit contributions
 courtyard-0.018
 neighborhood-0.018
 classroom-0.018
 painter-0.017
 artist-0.016
Token much
Token ID881
Feature activation+4.94e-04

Pos logit contributions
 courtyard+0.035
 classroom+0.034
 neighborhood+0.034
castle+0.033
 builder+0.033
Neg logit contributions
text-0.036
-0.030
ifying-0.029
 AW-0.028
 Sn-0.027
Token.
Token ID13
Feature activation-4.11e-03

Pos logit contributions
 courtyard+0.001
 classroom+0.001
 neighborhood+0.001
castle+0.001
 painter+0.001
Neg logit contributions
text-0.001
-0.001
ifying-0.001
 Sn-0.001
 AW-0.001
Token They
Token ID1119
Feature activation+1.67e-02

Pos logit contributions
text+0.008
 AW+0.006
ifying+0.006
+0.006
 eight+0.006
Neg logit contributions
 classroom-0.007
 neighborhood-0.007
 courtyard-0.007
 painter-0.007
 studio-0.007
Token and
Token ID290
Feature activation-2.11e-02

Pos logit contributions
text+0.053
+0.045
ifying+0.044
 AW+0.042
 Sn+0.041
Neg logit contributions
 courtyard-0.041
 classroom-0.040
 neighborhood-0.039
castle-0.038
 builder-0.038
Token sister
Token ID6621
Feature activation-9.68e-03

Pos logit contributions
text+0.044
ifying+0.037
+0.037
 AW+0.036
 Sn+0.034
Neg logit contributions
 courtyard-0.032
 classroom-0.032
 neighborhood-0.031
 painter-0.031
 builder-0.031
Token.
Token ID13
Feature activation-8.55e-03

Pos logit contributions
text+0.020
ifying+0.017
+0.017
 AW+0.016
 Sn+0.016
Neg logit contributions
 courtyard-0.015
 classroom-0.015
 neighborhood-0.014
 painter-0.014
castle-0.014
Token They
Token ID1119
Feature activation+1.56e-02

Pos logit contributions
text+0.012
+0.011
ifying+0.010
 eight+0.008
 Sn+0.008
Neg logit contributions
 courtyard-0.018
castle-0.018
 builder-0.018
 writer-0.017
 grounds-0.017
Token loved
Token ID6151
Feature activation+1.86e-02

Pos logit contributions
 courtyard+0.023
 classroom+0.022
castle+0.022
 neighborhood+0.022
 studio+0.022
Neg logit contributions
text-0.034
-0.031
ifying-0.029
 Sn-0.027
 AW-0.027
Token each
Token ID1123
Feature activation+7.70e-02

Pos logit contributions
 courtyard+0.030
 classroom+0.029
 neighborhood+0.029
castle+0.028
 builder+0.028
Neg logit contributions
text-0.037
-0.032
ifying-0.031
 AW-0.030
 Sn-0.029
Token other
Token ID584
Feature activation-1.40e-02

Pos logit contributions
 courtyard+0.131
 classroom+0.128
 neighborhood+0.127
 painter+0.124
 builder+0.124
Neg logit contributions
text-0.150
ifying-0.125
-0.124
 AW-0.120
 Sn-0.115
Token very
Token ID845
Feature activation+1.92e-02

Pos logit contributions
text+0.030
+0.025
ifying+0.025
 AW+0.024
 Sn+0.023
Neg logit contributions
 courtyard-0.021
 classroom-0.021
 neighborhood-0.020
castle-0.020
 builder-0.020
Token much
Token ID881
Feature activation-9.02e-04

Pos logit contributions
 courtyard+0.038
 neighborhood+0.037
 classroom+0.037
castle+0.036
 builder+0.036
Neg logit contributions
text-0.033
-0.027
ifying-0.025
 AW-0.024
 Sn-0.024
Token,
Token ID11
Feature activation+2.67e-02

Pos logit contributions
text+0.002
+0.001
ifying+0.001
 Sn+0.001
 AW+0.001
Neg logit contributions
 courtyard-0.002
 classroom-0.002
 neighborhood-0.002
castle-0.001
 painter-0.001
Token but
Token ID475
Feature activation+9.18e-03

Pos logit contributions
 courtyard+0.036
 classroom+0.034
 neighborhood+0.034
 builder+0.033
castle+0.033
Neg logit contributions
text-0.062
-0.053
ifying-0.052
 AW-0.050
 Sn-0.050
Token disagreed
Token ID27884
Feature activation+1.42e-02

Pos logit contributions
text+0.053
+0.045
ifying+0.044
 AW+0.043
 Sn+0.042
Neg logit contributions
 courtyard-0.042
 classroom-0.040
 neighborhood-0.040
castle-0.039
 builder-0.039
Token on
Token ID319
Feature activation+9.04e-04

Pos logit contributions
 courtyard+0.019
 neighborhood+0.018
 classroom+0.017
 painter+0.017
castle+0.017
Neg logit contributions
text-0.034
-0.030
ifying-0.029
 Sn-0.028
 eight-0.027
Token what
Token ID644
Feature activation-1.48e-02

Pos logit contributions
 classroom+0.001
 courtyard+0.001
 neighborhood+0.001
castle+0.001
 painter+0.001
Neg logit contributions
text-0.002
ifying-0.002
-0.001
 AW-0.001
 Sn-0.001
Token to
Token ID284
Feature activation+2.32e-02

Pos logit contributions
text+0.029
+0.024
ifying+0.023
 AW+0.023
 Sn+0.022
Neg logit contributions
 courtyard-0.025
 classroom-0.024
 neighborhood-0.024
castle-0.024
 builder-0.023
Token do
Token ID466
Feature activation+1.88e-02

Pos logit contributions
 courtyard+0.043
 studio+0.041
 builder+0.040
 neighborhood+0.040
ashing+0.040
Neg logit contributions
text-0.041
-0.036
 rib-0.030
 AW-0.030
 Sn-0.030
Token each
Token ID1123
Feature activation+7.70e-02

Pos logit contributions
 courtyard+0.032
 classroom+0.030
 neighborhood+0.030
 painter+0.030
 builder+0.030
Neg logit contributions
text-0.037
-0.032
ifying-0.030
 AW-0.029
 Sn-0.029
Token day
Token ID1110
Feature activation+1.32e-03

Pos logit contributions
 courtyard+0.146
 neighborhood+0.137
castle+0.135
 classroom+0.135
 grounds+0.133
Neg logit contributions
text-0.136
-0.117
 AW-0.105
 eight-0.104
ifying-0.104
Token.
Token ID13
Feature activation-2.89e-03

Pos logit contributions
 courtyard+0.002
 classroom+0.002
 neighborhood+0.002
castle+0.002
 painter+0.002
Neg logit contributions
text-0.003
ifying-0.002
-0.002
 AW-0.002
 Sn-0.002
Token One
Token ID1881
Feature activation+1.46e-02

Pos logit contributions
text+0.004
+0.003
 AW+0.003
 eight+0.003
ifying+0.003
Neg logit contributions
 courtyard-0.006
castle-0.006
 artist-0.006
 builder-0.006
 painter-0.006
Token day
Token ID1110
Feature activation+6.51e-03

Pos logit contributions
 courtyard+0.017
 classroom+0.016
 neighborhood+0.016
castle+0.016
 builder+0.016
Neg logit contributions
text-0.036
-0.032
ifying-0.031
 AW-0.030
 eight-0.029
Token Mark
Token ID2940
Feature activation+1.18e-02

Pos logit contributions
 courtyard+0.011
 neighborhood+0.010
 classroom+0.010
castle+0.010
 painter+0.010
Neg logit contributions
text-0.013
-0.011
ifying-0.011
 AW-0.010
 eight-0.010
Token handsome
Token ID22665
Feature activation-1.23e-02

Pos logit contributions
text+0.074
+0.065
ifying+0.062
 Sn+0.061
 eight+0.059
Neg logit contributions
 courtyard-0.073
castle-0.070
 grounds-0.069
 neighborhood-0.068
 classroom-0.068
Token boy
Token ID2933
Feature activation+2.72e-02

Pos logit contributions
text+0.027
ifying+0.023
+0.023
 Sn+0.022
 AW+0.022
Neg logit contributions
 courtyard-0.017
 neighborhood-0.016
 classroom-0.016
castle-0.016
 grounds-0.016
Token.
Token ID13
Feature activation-5.11e-03

Pos logit contributions
 courtyard+0.046
castle+0.044
 classroom+0.043
 neighborhood+0.043
 grounds+0.043
Neg logit contributions
text-0.053
-0.046
ifying-0.044
 Sn-0.042
 AW-0.041
Token They
Token ID1119
Feature activation+1.58e-02

Pos logit contributions
text+0.009
+0.007
 Sn+0.007
 Fl+0.007
ifying+0.006
Neg logit contributions
castle-0.010
 courtyard-0.010
 artist-0.009
 builder-0.009
 grounds-0.009
Token loved
Token ID6151
Feature activation+1.89e-02

Pos logit contributions
 courtyard+0.017
 classroom+0.016
 neighborhood+0.016
 painter+0.015
 builder+0.015
Neg logit contributions
text-0.041
ifying-0.036
-0.035
 AW-0.035
 Sn-0.034
Token each
Token ID1123
Feature activation+7.70e-02

Pos logit contributions
 courtyard+0.034
castle+0.032
 classroom+0.032
 neighborhood+0.032
 builder+0.031
Neg logit contributions
text-0.035
-0.029
ifying-0.028
 Sn-0.027
 rib-0.027
Token other
Token ID584
Feature activation-1.37e-02

Pos logit contributions
 courtyard+0.143
castle+0.139
 classroom+0.137
 neighborhood+0.136
 builder+0.133
Neg logit contributions
text-0.140
-0.116
ifying-0.110
 AW-0.106
 Sn-0.105
Token,
Token ID11
Feature activation+2.75e-02

Pos logit contributions
text+0.029
+0.025
ifying+0.024
 AW+0.023
 Sn+0.023
Neg logit contributions
 courtyard-0.021
 classroom-0.020
castle-0.020
 builder-0.020
 neighborhood-0.020
Token so
Token ID523
Feature activation+1.20e-02

Pos logit contributions
 courtyard+0.038
 neighborhood+0.036
 classroom+0.036
 builder+0.035
castle+0.035
Neg logit contributions
text-0.063
-0.054
ifying-0.052
 Sn-0.051
 AW-0.051
Token they
Token ID484
Feature activation+1.39e-02

Pos logit contributions
 classroom+0.014
 courtyard+0.013
 neighborhood+0.013
 painter+0.013
castle+0.012
Neg logit contributions
text-0.030
 AW-0.026
ifying-0.026
-0.025
 GO-0.024
Token got
Token ID1392
Feature activation+2.00e-02

Pos logit contributions
 courtyard+0.019
 classroom+0.019
 neighborhood+0.018
 builder+0.018
 painter+0.018
Neg logit contributions
text-0.031
ifying-0.027
-0.026
 AW-0.026
 Sn-0.025
Token When
Token ID1649
Feature activation-1.67e-02

Pos logit contributions
 classroom+0.014
 neighborhood+0.014
 courtyard+0.014
 painter+0.013
 builder+0.013
Neg logit contributions
text-0.024
 AW-0.020
ifying-0.020
 rib-0.019
 tal-0.019
Token Daisy
Token ID40355
Feature activation+2.11e-03

Pos logit contributions
text+0.041
ifying+0.035
 AW+0.034
+0.033
 tal+0.032
Neg logit contributions
 classroom-0.021
 courtyard-0.021
castle-0.019
 builder-0.019
 studio-0.019
Token put
Token ID1234
Feature activation+7.99e-03

Pos logit contributions
 courtyard+0.003
 classroom+0.003
 neighborhood+0.003
 painter+0.003
 grounds+0.003
Neg logit contributions
text-0.004
ifying-0.004
 AW-0.004
 Sn-0.003
 rib-0.003
Token the
Token ID262
Feature activation+3.13e-02

Pos logit contributions
 courtyard+0.013
 classroom+0.013
 studio+0.013
 neighborhood+0.012
castle+0.012
Neg logit contributions
text-0.016
ifying-0.013
-0.013
 AW-0.013
 eight-0.012
Token magn
Token ID7842
Feature activation-5.19e-02

Pos logit contributions
 courtyard+0.047
 classroom+0.047
 painter+0.046
 neighborhood+0.044
 builder+0.044
Neg logit contributions
text-0.072
-0.057
 AW-0.057
ifying-0.056
 Sn-0.054
Tokenifying
Token ID4035
Feature activation-1.33e-01

Pos logit contributions
text+0.051
 AW+0.030
 Sn+0.029
Fl+0.026
 tal+0.024
Neg logit contributions
 courtyard-0.136
 builder-0.132
castle-0.132
 grounds-0.131
 studio-0.130
Token glass
Token ID5405
Feature activation-5.73e-03

Pos logit contributions
text+0.262
 AW+0.209
+0.206
 Sn+0.198
 rib+0.193
Neg logit contributions
 courtyard-0.226
 classroom-0.222
castle-0.214
 studio-0.209
 neighborhood-0.208
Token down
Token ID866
Feature activation+1.03e-02

Pos logit contributions
text+0.012
+0.010
ifying+0.010
 AW+0.010
 Sn+0.009
Neg logit contributions
 courtyard-0.009
 classroom-0.009
 neighborhood-0.009
castle-0.008
 builder-0.008
Token,
Token ID11
Feature activation+2.31e-02

Pos logit contributions
 courtyard+0.015
 classroom+0.014
 neighborhood+0.014
castle+0.014
 builder+0.014
Neg logit contributions
text-0.023
ifying-0.019
-0.019
 AW-0.019
 Sn-0.018
Token the
Token ID262
Feature activation-4.36e-03

Pos logit contributions
 courtyard+0.022
 classroom+0.021
 neighborhood+0.020
 painter+0.019
 builder+0.019
Neg logit contributions
text-0.063
ifying-0.055
-0.053
 AW-0.053
 Sn-0.052
Token cater
Token ID20825
Feature activation-4.01e-02

Pos logit contributions
text+0.010
+0.008
ifying+0.008
 AW+0.008
 Sn+0.008
Neg logit contributions
 courtyard-0.006
 classroom-0.006
 neighborhood-0.006
 painter-0.006
 builder-0.006
Token the
Token ID262
Feature activation+8.67e-03

Pos logit contributions
 classroom+0.002
 courtyard+0.002
 painter+0.002
 builder+0.002
 neighborhood+0.002
Neg logit contributions
text-0.005
 AW-0.005
ifying-0.004
-0.004
 tal-0.004
Token necklace
Token ID38987
Feature activation-1.07e-02

Pos logit contributions
 courtyard+0.011
 classroom+0.010
 neighborhood+0.010
 painter+0.010
castle+0.009
Neg logit contributions
text-0.022
ifying-0.019
-0.019
 AW-0.018
 Sn-0.018
Token and
Token ID290
Feature activation-1.62e-02

Pos logit contributions
text+0.020
+0.017
ifying+0.017
 Sn+0.016
 AW+0.016
Neg logit contributions
 courtyard-0.019
 classroom-0.018
 neighborhood-0.018
 painter-0.018
 builder-0.017
Token the
Token ID262
Feature activation-9.40e-03

Pos logit contributions
text+0.033
ifying+0.028
 AW+0.027
+0.027
 Sn+0.025
Neg logit contributions
 courtyard-0.027
 classroom-0.026
 neighborhood-0.025
castle-0.025
 grounds-0.024
Token magn
Token ID7842
Feature activation-6.10e-02

Pos logit contributions
text+0.022
ifying+0.019
+0.018
 AW+0.018
 Sn+0.017
Neg logit contributions
 courtyard-0.013
 classroom-0.013
 painter-0.013
 neighborhood-0.013
castle-0.012
Tokenifying
Token ID4035
Feature activation-1.32e-01

Pos logit contributions
text+0.065
 AW+0.046
+0.045
 Sn+0.043
Fl+0.039
Neg logit contributions
 courtyard-0.149
utter-0.146
castle-0.144
uttering-0.143
 grounds-0.142
Token glass
Token ID5405
Feature activation-1.67e-02

Pos logit contributions
text+0.252
+0.207
 AW+0.204
ifying+0.191
 Sn+0.189
Neg logit contributions
 courtyard-0.230
 classroom-0.225
castle-0.214
 neighborhood-0.213
 studio-0.208
Token and
Token ID290
Feature activation-1.39e-02

Pos logit contributions
text+0.036
+0.030
ifying+0.030
 AW+0.029
 Sn+0.028
Neg logit contributions
 courtyard-0.025
 classroom-0.024
castle-0.024
 neighborhood-0.024
 builder-0.023
Token started
Token ID2067
Feature activation+1.13e-02

Pos logit contributions
text+0.028
ifying+0.024
+0.023
 AW+0.023
 Sn+0.022
Neg logit contributions
 courtyard-0.023
 classroom-0.022
 neighborhood-0.022
castle-0.021
 grounds-0.021
Token to
Token ID284
Feature activation-6.93e-03

Pos logit contributions
 courtyard+0.016
 classroom+0.016
 neighborhood+0.016
 painter+0.015
castle+0.015
Neg logit contributions
text-0.026
ifying-0.022
-0.021
 AW-0.021
 Sn-0.019
Token examine
Token ID10716
Feature activation+1.63e-02

Pos logit contributions
text+0.015
+0.013
ifying+0.012
 AW+0.012
 Sn+0.012
Neg logit contributions
 courtyard-0.010
 neighborhood-0.010
 classroom-0.010
 builder-0.010
castle-0.010
Token hat
Token ID6877
Feature activation-2.71e-02

Pos logit contributions
text+0.010
+0.008
ifying+0.008
 AW+0.008
 eight+0.007
Neg logit contributions
 classroom-0.006
 studio-0.006
 courtyard-0.006
 neighborhood-0.006
 painter-0.006
Token and
Token ID290
Feature activation-3.14e-02

Pos logit contributions
text+0.054
+0.046
ifying+0.045
 AW+0.042
 Sn+0.042
Neg logit contributions
 courtyard-0.044
 classroom-0.043
 neighborhood-0.043
 builder-0.042
 painter-0.042
Token grabbed
Token ID13176
Feature activation+5.78e-03

Pos logit contributions
text+0.066
ifying+0.058
 AW+0.056
+0.055
 Sn+0.051
Neg logit contributions
 courtyard-0.048
 classroom-0.048
 painter-0.045
castle-0.045
 studio-0.045
Token her
Token ID607
Feature activation-1.71e-02

Pos logit contributions
 artist+0.006
 classroom+0.006
 painter+0.006
 courtyard+0.006
 builder+0.006
Neg logit contributions
text-0.015
 AW-0.012
ifying-0.012
-0.012
haw-0.011
Token magn
Token ID7842
Feature activation-6.71e-02

Pos logit contributions
text+0.040
 AW+0.033
+0.032
ifying+0.031
 GO+0.030
Neg logit contributions
 classroom-0.024
 studio-0.023
 neighborhood-0.023
 painter-0.023
 courtyard-0.022
Tokenifying
Token ID4035
Feature activation-1.29e-01

Pos logit contributions
text+0.068
 AW+0.050
haw+0.042
heit+0.040
+0.039
Neg logit contributions
utter-0.167
 courtyard-0.163
uttering-0.161
pit-0.161
castle-0.160
Token glass
Token ID5405
Feature activation-2.67e-02

Pos logit contributions
text+0.248
+0.205
 AW+0.203
ifying+0.195
 Sn+0.185
Neg logit contributions
 classroom-0.223
 courtyard-0.223
 studio-0.211
 neighborhood-0.208
castle-0.205
Token.
Token ID13
Feature activation-1.02e-03

Pos logit contributions
text+0.058
+0.049
 AW+0.049
ifying+0.049
 Sn+0.046
Neg logit contributions
 courtyard-0.039
 classroom-0.038
castle-0.037
 neighborhood-0.037
 builder-0.037
Token She
Token ID1375
Feature activation-5.27e-03

Pos logit contributions
text+0.002
ifying+0.002
+0.002
 AW+0.002
 Sn+0.002
Neg logit contributions
 courtyard-0.001
 classroom-0.001
 neighborhood-0.001
 painter-0.001
 builder-0.001
Token was
Token ID373
Feature activation+1.23e-02

Pos logit contributions
text+0.011
ifying+0.010
 AW+0.009
+0.009
 Sn+0.009
Neg logit contributions
 courtyard-0.008
 classroom-0.008
 neighborhood-0.008
castle-0.008
 painter-0.008
Token ready
Token ID3492
Feature activation-1.71e-02

Pos logit contributions
 courtyard+0.017
 classroom+0.016
 neighborhood+0.016
castle+0.015
 builder+0.015
Neg logit contributions
text-0.028
-0.024
ifying-0.024
 AW-0.023
 Sn-0.023
Token He
Token ID679
Feature activation-6.33e-03

Pos logit contributions
 courtyard+0.035
castle+0.035
 builder+0.034
 artist+0.034
 neighborhood+0.033
Neg logit contributions
text-0.027
-0.023
ifying-0.020
 eight-0.020
 AW-0.019
Token took
Token ID1718
Feature activation-1.32e-02

Pos logit contributions
text+0.013
ifying+0.011
+0.011
 AW+0.010
 Sn+0.010
Neg logit contributions
 courtyard-0.010
 classroom-0.010
 neighborhood-0.010
 builder-0.010
castle-0.010
Token out
Token ID503
Feature activation+6.91e-03

Pos logit contributions
text+0.025
ifying+0.022
+0.021
 AW+0.020
 Sn+0.020
Neg logit contributions
 courtyard-0.023
 classroom-0.022
 builder-0.022
 neighborhood-0.021
 painter-0.021
Token a
Token ID257
Feature activation-1.30e-02

Pos logit contributions
 courtyard+0.012
 classroom+0.012
 builder+0.012
 neighborhood+0.011
 painter+0.011
Neg logit contributions
text-0.013
 AW-0.011
ifying-0.011
-0.011
 Sn-0.010
Token magn
Token ID7842
Feature activation-5.42e-02

Pos logit contributions
text+0.026
+0.022
ifying+0.022
 AW+0.021
 eight+0.020
Neg logit contributions
 courtyard-0.021
 neighborhood-0.020
 classroom-0.020
castle-0.020
 builder-0.019
Tokenifying
Token ID4035
Feature activation-1.26e-01

Pos logit contributions
text+0.042
 AW+0.022
+0.021
 Sn+0.019
Fl+0.018
Neg logit contributions
 courtyard-0.155
 builder-0.151
castle-0.150
 studio-0.150
pit-0.149
Token glass
Token ID5405
Feature activation-1.56e-02

Pos logit contributions
text+0.233
+0.189
 AW+0.183
ifying+0.181
 Sn+0.173
Neg logit contributions
 courtyard-0.231
 classroom-0.228
 neighborhood-0.220
castle-0.217
 studio-0.216
Token and
Token ID290
Feature activation-9.02e-03

Pos logit contributions
text+0.031
+0.026
ifying+0.026
 AW+0.025
 Sn+0.024
Neg logit contributions
 courtyard-0.025
 classroom-0.025
 neighborhood-0.025
 painter-0.024
 builder-0.024
Token a
Token ID257
Feature activation-3.22e-02

Pos logit contributions
text+0.017
+0.015
ifying+0.014
 rib+0.013
 AW+0.013
Neg logit contributions
 courtyard-0.015
 neighborhood-0.015
 classroom-0.015
castle-0.015
 painter-0.015
Token notebook
Token ID20922
Feature activation-3.09e-02

Pos logit contributions
text+0.063
+0.055
ifying+0.054
 AW+0.051
 eight+0.050
Neg logit contributions
 courtyard-0.052
 neighborhood-0.050
 classroom-0.050
castle-0.049
 builder-0.048
Token and
Token ID290
Feature activation-1.74e-02

Pos logit contributions
text+0.060
+0.052
ifying+0.050
 AW+0.048
 Sn+0.047
Neg logit contributions
 courtyard-0.050
 neighborhood-0.049
 classroom-0.049
 painter-0.048
 artist-0.046
Token the
Token ID262
Feature activation+1.35e-02

Pos logit contributions
 courtyard+0.055
 classroom+0.053
 neighborhood+0.053
castle+0.052
 builder+0.052
Neg logit contributions
text-0.044
-0.035
ifying-0.034
 AW-0.032
 Sn-0.032
Token watch
Token ID2342
Feature activation-1.35e-02

Pos logit contributions
 courtyard+0.020
castle+0.019
 neighborhood+0.019
 builder+0.019
 classroom+0.019
Neg logit contributions
text-0.028
-0.025
ifying-0.025
 AW-0.023
 rib-0.023
Token with
Token ID351
Feature activation-3.31e-03

Pos logit contributions
text+0.028
+0.023
ifying+0.023
 AW+0.023
 Sn+0.022
Neg logit contributions
 courtyard-0.021
 classroom-0.021
 neighborhood-0.020
castle-0.020
 builder-0.020
Token the
Token ID262
Feature activation+9.94e-03

Pos logit contributions
text+0.005
+0.004
ifying+0.004
 AW+0.004
 Sn+0.004
Neg logit contributions
 courtyard-0.007
 neighborhood-0.007
 classroom-0.007
castle-0.007
 painter-0.007
Token magn
Token ID7842
Feature activation-5.31e-02

Pos logit contributions
 courtyard+0.015
 neighborhood+0.014
 classroom+0.014
castle+0.014
 builder+0.014
Neg logit contributions
text-0.020
-0.018
ifying-0.018
 AW-0.017
 eight-0.017
Tokenifying
Token ID4035
Feature activation-1.25e-01

Pos logit contributions
text+0.045
 AW+0.023
 Sn+0.021
+0.019
 tal+0.017
Neg logit contributions
 courtyard-0.149
castle-0.146
 builder-0.145
 studio-0.145
 grounds-0.144
Token glass
Token ID5405
Feature activation-1.38e-02

Pos logit contributions
text+0.235
+0.189
 AW+0.188
ifying+0.179
 Sn+0.175
Neg logit contributions
 courtyard-0.222
 classroom-0.220
 studio-0.209
 neighborhood-0.208
castle-0.205
Token.
Token ID13
Feature activation+3.57e-03

Pos logit contributions
text+0.028
+0.023
ifying+0.023
 AW+0.022
 Sn+0.021
Neg logit contributions
 courtyard-0.022
 classroom-0.022
 neighborhood-0.022
 painter-0.021
 builder-0.021
Token He
Token ID679
Feature activation+1.34e-03

Pos logit contributions
 courtyard+0.007
 classroom+0.006
 neighborhood+0.006
 artist+0.006
castle+0.006
Neg logit contributions
text-0.006
-0.005
ifying-0.005
 Sn-0.005
 eight-0.005
Token saw
Token ID2497
Feature activation+1.60e-02

Pos logit contributions
 courtyard+0.002
 classroom+0.002
castle+0.002
 builder+0.002
 artist+0.002
Neg logit contributions
text-0.003
-0.003
ifying-0.002
 eight-0.002
 AW-0.002
Token the
Token ID262
Feature activation+6.60e-03

Pos logit contributions
 courtyard+0.035
 neighborhood+0.035
 classroom+0.035
 studio+0.034
castle+0.034
Neg logit contributions
text-0.022
-0.018
ifying-0.017
 eight-0.016
 Sn-0.016
Token She
Token ID1375
Feature activation-4.96e-03

Pos logit contributions
 courtyard+0.001
 classroom+0.001
 neighborhood+0.001
 painter+0.001
 builder+0.001
Neg logit contributions
text-0.002
ifying-0.001
-0.001
 AW-0.001
 Sn-0.001
Token takes
Token ID2753
Feature activation-1.97e-02

Pos logit contributions
text+0.009
ifying+0.007
 Sn+0.007
 AW+0.007
+0.007
Neg logit contributions
 courtyard-0.009
 classroom-0.009
 neighborhood-0.009
 builder-0.009
 painter-0.009
Token out
Token ID503
Feature activation+5.33e-03

Pos logit contributions
text+0.045
ifying+0.042
 AW+0.039
+0.036
Fl+0.036
Neg logit contributions
 builder-0.025
 classroom-0.024
 painter-0.024
 courtyard-0.024
 artist-0.024
Token a
Token ID257
Feature activation-9.87e-03

Pos logit contributions
 classroom+0.009
 courtyard+0.009
 builder+0.009
 painter+0.009
 studio+0.009
Neg logit contributions
text-0.010
ifying-0.008
 AW-0.008
-0.008
 Sn-0.007
Token magn
Token ID7842
Feature activation-5.34e-02

Pos logit contributions
text+0.018
+0.016
ifying+0.016
 AW+0.015
 eight+0.015
Neg logit contributions
 courtyard-0.016
 neighborhood-0.016
castle-0.016
 classroom-0.016
 artist-0.016
Tokenifying
Token ID4035
Feature activation-1.22e-01

Pos logit contributions
text+0.052
Fl+0.030
 Sn+0.030
 AW+0.029
 Fl+0.026
Neg logit contributions
 builder-0.137
 courtyard-0.136
utter-0.135
castle-0.134
uttering-0.132
Token glass
Token ID5405
Feature activation-1.08e-02

Pos logit contributions
text+0.229
+0.182
 AW+0.178
ifying+0.172
 Sn+0.170
Neg logit contributions
 courtyard-0.221
 classroom-0.218
castle-0.209
 neighborhood-0.208
 studio-0.206
Token.
Token ID13
Feature activation+3.24e-03

Pos logit contributions
text+0.021
+0.018
ifying+0.017
 AW+0.017
 Sn+0.016
Neg logit contributions
 courtyard-0.018
 classroom-0.018
 neighborhood-0.018
 painter-0.017
castle-0.017
Token She
Token ID1375
Feature activation+1.77e-03

Pos logit contributions
 courtyard+0.005
 classroom+0.005
 neighborhood+0.005
 builder+0.005
 painter+0.005
Neg logit contributions
text-0.007
ifying-0.006
-0.006
 AW-0.005
 Sn-0.005
Token looks
Token ID3073
Feature activation+3.07e-02

Pos logit contributions
 courtyard+0.003
 classroom+0.003
 neighborhood+0.003
 builder+0.003
 painter+0.003
Neg logit contributions
text-0.003
ifying-0.003
-0.002
 AW-0.002
 Sn-0.002
Token at
Token ID379
Feature activation+3.13e-02

Pos logit contributions
 neighborhood+0.051
 courtyard+0.051
castle+0.050
 classroom+0.050
 painter+0.048
Neg logit contributions
text-0.058
-0.053
ifying-0.050
 AW-0.048
 rib-0.048
Token and
Token ID290
Feature activation-2.14e-02

Pos logit contributions
text+0.054
+0.046
ifying+0.043
 AW+0.042
 Sn+0.041
Neg logit contributions
 courtyard-0.047
 classroom-0.045
 neighborhood-0.044
 builder-0.044
castle-0.044
Token handed
Token ID10158
Feature activation-1.30e-02

Pos logit contributions
text+0.043
ifying+0.037
+0.035
 AW+0.035
 Sn+0.033
Neg logit contributions
 courtyard-0.035
 classroom-0.034
 builder-0.033
 painter-0.033
castle-0.033
Token him
Token ID683
Feature activation+5.08e-03

Pos logit contributions
text+0.029
 AW+0.025
ifying+0.025
 tal+0.023
+0.023
Neg logit contributions
 classroom-0.018
 courtyard-0.018
 painter-0.017
 builder-0.017
 artist-0.017
Token the
Token ID262
Feature activation+1.36e-02

Pos logit contributions
 courtyard+0.008
 classroom+0.008
 builder+0.008
 painter+0.008
 neighborhood+0.008
Neg logit contributions
text-0.010
 AW-0.008
ifying-0.008
-0.007
 Sn-0.007
Token magn
Token ID7842
Feature activation-5.06e-02

Pos logit contributions
 courtyard+0.017
 classroom+0.016
 painter+0.016
 artist+0.016
 neighborhood+0.016
Neg logit contributions
text-0.033
ifying-0.028
 Sn-0.027
-0.026
 AW-0.026
Tokenifying
Token ID4035
Feature activation-1.21e-01

Pos logit contributions
text+0.055
 AW+0.036
 Sn+0.036
Fl+0.033
haw+0.032
Neg logit contributions
 courtyard-0.125
castle-0.124
 artist-0.122
utter-0.122
 builder-0.121
Token glass
Token ID5405
Feature activation-1.40e-02

Pos logit contributions
text+0.239
 AW+0.196
+0.193
ifying+0.181
 Sn+0.179
Neg logit contributions
 classroom-0.206
 courtyard-0.205
 neighborhood-0.194
 studio-0.193
 painter-0.190
Token.
Token ID13
Feature activation+2.05e-03

Pos logit contributions
text+0.030
ifying+0.025
+0.025
 AW+0.025
 Sn+0.024
Neg logit contributions
 courtyard-0.021
 classroom-0.020
castle-0.020
 builder-0.020
 neighborhood-0.020
Token
Token ID198
Feature activation-6.98e-03

Pos logit contributions
 courtyard+0.004
castle+0.004
 neighborhood+0.004
 classroom+0.004
 builder+0.004
Neg logit contributions
text-0.003
-0.003
ifying-0.003
 Sn-0.003
 AW-0.002
Token
Token ID198
Feature activation+7.63e-04

Pos logit contributions
text+0.014
ifying+0.013
 AW+0.012
+0.011
 rib+0.011
Neg logit contributions
 neighborhood-0.011
 courtyard-0.011
 classroom-0.011
 painter-0.010
 grounds-0.010
TokenHe
Token ID1544
Feature activation-7.44e-03

Pos logit contributions
 courtyard+0.001
 classroom+0.001
 neighborhood+0.001
castle+0.001
 builder+0.001
Neg logit contributions
text-0.001
-0.001
ifying-0.001
 AW-0.001
 Sn-0.001
Token,
Token ID11
Feature activation+8.23e-03

Pos logit contributions
text+0.040
+0.035
ifying+0.033
 AW+0.031
 Sn+0.031
Neg logit contributions
 courtyard-0.035
 neighborhood-0.034
 classroom-0.034
 painter-0.034
 builder-0.033
Token next
Token ID1306
Feature activation-5.25e-03

Pos logit contributions
 courtyard+0.013
castle+0.013
 builder+0.013
 classroom+0.013
 neighborhood+0.013
Neg logit contributions
text-0.016
-0.014
ifying-0.014
 Sn-0.014
 rib-0.013
Token to
Token ID284
Feature activation-1.76e-02

Pos logit contributions
text+0.011
+0.009
ifying+0.009
 AW+0.009
 Sn+0.008
Neg logit contributions
 courtyard-0.008
 classroom-0.008
 neighborhood-0.008
castle-0.008
 painter-0.008
Token her
Token ID607
Feature activation-1.72e-02

Pos logit contributions
text+0.035
+0.030
ifying+0.029
 AW+0.028
 Sn+0.027
Neg logit contributions
 courtyard-0.029
 classroom-0.028
 neighborhood-0.028
castle-0.028
 builder-0.027
Token magn
Token ID7842
Feature activation-5.51e-02

Pos logit contributions
text+0.036
+0.030
ifying+0.030
 AW+0.028
 Sn+0.028
Neg logit contributions
 courtyard-0.027
 classroom-0.026
 neighborhood-0.026
castle-0.025
 builder-0.025
Tokenifying
Token ID4035
Feature activation-1.21e-01

Pos logit contributions
text+0.047
 AW+0.031
+0.024
Fl+0.022
 Sn+0.022
Neg logit contributions
 courtyard-0.150
 studio-0.147
utter-0.146
pit-0.145
castle-0.145
Token glass
Token ID5405
Feature activation-1.33e-02

Pos logit contributions
text+0.230
 AW+0.190
+0.187
ifying+0.176
 rib+0.170
Neg logit contributions
 classroom-0.218
 courtyard-0.214
 neighborhood-0.205
 studio-0.204
castle-0.199
Token.
Token ID13
Feature activation+2.73e-03

Pos logit contributions
text+0.027
+0.023
ifying+0.023
 AW+0.022
 Sn+0.021
Neg logit contributions
 courtyard-0.021
 classroom-0.021
 neighborhood-0.021
castle-0.020
 builder-0.020
Token She
Token ID1375
Feature activation-6.21e-05

Pos logit contributions
 courtyard+0.003
 classroom+0.003
 neighborhood+0.003
 studio+0.003
 painter+0.003
Neg logit contributions
text-0.007
ifying-0.006
 AW-0.006
-0.006
 Sn-0.006
Token looked
Token ID3114
Feature activation+2.59e-02

Pos logit contributions
text+0.000
ifying+0.000
+0.000
 AW+0.000
 Sn+0.000
Neg logit contributions
 courtyard-0.000
 classroom-0.000
 neighborhood-0.000
 builder-0.000
castle-0.000
Token at
Token ID379
Feature activation+2.85e-02

Pos logit contributions
 courtyard+0.041
castle+0.041
 classroom+0.040
 neighborhood+0.040
 builder+0.040
Neg logit contributions
text-0.051
-0.046
ifying-0.045
 rib-0.042
 AW-0.042
Token the
Token ID262
Feature activation+2.05e-02

Pos logit contributions
 courtyard+0.026
 classroom+0.025
 neighborhood+0.025
 builder+0.025
castle+0.025
Neg logit contributions
text-0.029
ifying-0.024
-0.023
 AW-0.023
 Sn-0.022
Token necklace
Token ID38987
Feature activation-3.24e-03

Pos logit contributions
 classroom+0.029
 courtyard+0.029
 neighborhood+0.029
 painter+0.028
 artist+0.028
Neg logit contributions
text-0.048
ifying-0.041
-0.038
 Sn-0.038
 AW-0.038
Token and
Token ID290
Feature activation-1.35e-02

Pos logit contributions
text+0.006
+0.005
ifying+0.005
 AW+0.005
 Sn+0.004
Neg logit contributions
 courtyard-0.006
 neighborhood-0.006
 classroom-0.006
 studio-0.005
 builder-0.005
Token the
Token ID262
Feature activation-3.65e-03

Pos logit contributions
text+0.025
+0.021
ifying+0.020
 Sn+0.020
 AW+0.019
Neg logit contributions
 courtyard-0.024
 neighborhood-0.023
 classroom-0.023
castle-0.023
 builder-0.022
Token magn
Token ID7842
Feature activation-5.71e-02

Pos logit contributions
text+0.008
ifying+0.007
 AW+0.007
+0.007
 Sn+0.007
Neg logit contributions
 classroom-0.005
 courtyard-0.005
 painter-0.005
 neighborhood-0.005
 artist-0.005
Tokenifying
Token ID4035
Feature activation-1.21e-01

Pos logit contributions
text+0.060
 AW+0.040
+0.039
 Sn+0.039
Fl+0.037
Neg logit contributions
 courtyard-0.142
castle-0.139
utter-0.138
 builder-0.136
 grounds-0.135
Token glass
Token ID5405
Feature activation-1.43e-02

Pos logit contributions
text+0.238
+0.195
 AW+0.193
ifying+0.179
 Sn+0.178
Neg logit contributions
 courtyard-0.206
 classroom-0.204
 neighborhood-0.192
 studio-0.189
castle-0.189
Token and
Token ID290
Feature activation-7.03e-03

Pos logit contributions
text+0.030
+0.025
ifying+0.025
 AW+0.024
 Sn+0.023
Neg logit contributions
 courtyard-0.022
 classroom-0.021
castle-0.021
 neighborhood-0.021
 builder-0.021
Token told
Token ID1297
Feature activation-2.36e-02

Pos logit contributions
text+0.013
+0.011
ifying+0.011
 AW+0.010
 Sn+0.010
Neg logit contributions
 courtyard-0.012
 classroom-0.012
 neighborhood-0.012
castle-0.012
 builder-0.012
Token him
Token ID683
Feature activation-1.11e-03

Pos logit contributions
text+0.048
ifying+0.042
+0.040
 AW+0.039
 Sn+0.037
Neg logit contributions
 courtyard-0.037
 classroom-0.037
 painter-0.035
 builder-0.035
 artist-0.035
Token that
Token ID326
Feature activation+4.83e-03

Pos logit contributions
text+0.002
+0.002
ifying+0.002
 AW+0.002
 Sn+0.002
Neg logit contributions
 courtyard-0.002
 classroom-0.002
 neighborhood-0.002
castle-0.002
 builder-0.002
Token cried
Token ID16896
Feature activation-3.60e-02

Pos logit contributions
text+0.017
+0.015
ifying+0.014
 AW+0.014
 Sn+0.014
Neg logit contributions
 courtyard-0.011
 classroom-0.010
castle-0.010
 neighborhood-0.010
 builder-0.010
Token,
Token ID11
Feature activation+1.03e-02

Pos logit contributions
text+0.075
+0.063
ifying+0.063
 AW+0.060
 Sn+0.058
Neg logit contributions
 courtyard-0.057
 classroom-0.055
 neighborhood-0.054
castle-0.053
 builder-0.053
Token dropping
Token ID12047
Feature activation+1.01e-02

Pos logit contributions
 courtyard+0.016
 classroom+0.016
castle+0.015
 neighborhood+0.015
 artist+0.015
Neg logit contributions
text-0.021
-0.019
ifying-0.018
 Sn-0.018
 eight-0.017
Token the
Token ID262
Feature activation+2.16e-02

Pos logit contributions
 classroom+0.016
 courtyard+0.016
 painter+0.015
 builder+0.015
 neighborhood+0.015
Neg logit contributions
text-0.021
ifying-0.017
-0.017
 AW-0.017
 eight-0.015
Token magn
Token ID7842
Feature activation-5.17e-02

Pos logit contributions
 courtyard+0.032
 classroom+0.031
 neighborhood+0.031
castle+0.030
 builder+0.030
Neg logit contributions
text-0.046
-0.040
ifying-0.040
 AW-0.038
 eight-0.037
Tokenifying
Token ID4035
Feature activation-1.20e-01

Pos logit contributions
text+0.049
 AW+0.028
 Sn+0.026
+0.024
haw+0.023
Neg logit contributions
 courtyard-0.133
 builder-0.133
castle-0.132
utter-0.131
pit-0.131
Token glass
Token ID5405
Feature activation-1.09e-02

Pos logit contributions
text+0.222
+0.182
 AW+0.181
ifying+0.172
 rib+0.165
Neg logit contributions
 courtyard-0.215
 classroom-0.214
 neighborhood-0.204
 studio-0.202
 painter-0.200
Token.
Token ID13
Feature activation+8.52e-04

Pos logit contributions
text+0.023
+0.019
ifying+0.019
 AW+0.019
 Sn+0.018
Neg logit contributions
 courtyard-0.016
 classroom-0.016
castle-0.016
 builder-0.015
 neighborhood-0.015
Token
Token ID198
Feature activation-7.05e-03

Pos logit contributions
 courtyard+0.002
castle+0.001
 classroom+0.001
 artist+0.001
 neighborhood+0.001
Neg logit contributions
text-0.002
-0.001
ifying-0.001
 AW-0.001
 Sn-0.001
Token
Token ID198
Feature activation-1.57e-04

Pos logit contributions
text+0.015
ifying+0.013
 rib+0.012
 AW+0.012
+0.011
Neg logit contributions
 neighborhood-0.011
 classroom-0.010
 courtyard-0.010
 painter-0.010
 builder-0.010
Token"
Token ID1
Feature activation-3.92e-03

Pos logit contributions
text+0.000
ifying+0.000
 AW+0.000
+0.000
 rib+0.000
Neg logit contributions
 courtyard-0.000
 neighborhood-0.000
 classroom-0.000
 painter-0.000
 builder-0.000
Token examine
Token ID10716
Feature activation+2.41e-02

Pos logit contributions
 courtyard+0.044
 neighborhood+0.042
 classroom+0.042
castle+0.041
 grounds+0.041
Neg logit contributions
text-0.056
-0.050
ifying-0.048
 AW-0.045
 Sn-0.044
Token it
Token ID340
Feature activation+7.51e-03

Pos logit contributions
 courtyard+0.037
 artist+0.037
 classroom+0.037
 painter+0.036
castle+0.036
Neg logit contributions
text-0.050
 AW-0.040
ifying-0.040
-0.039
 Sn-0.036
Token with
Token ID351
Feature activation+1.56e-03

Pos logit contributions
 courtyard+0.014
 neighborhood+0.014
 classroom+0.013
 painter+0.013
 builder+0.013
Neg logit contributions
text-0.012
-0.011
ifying-0.010
 Sn-0.010
 rib-0.010
Token the
Token ID262
Feature activation+1.38e-02

Pos logit contributions
 courtyard+0.003
 classroom+0.003
 artist+0.003
 painter+0.002
 neighborhood+0.002
Neg logit contributions
text-0.003
ifying-0.002
 AW-0.002
-0.002
 Sn-0.002
Token magn
Token ID7842
Feature activation-5.13e-02

Pos logit contributions
 courtyard+0.021
 classroom+0.021
 neighborhood+0.021
 painter+0.020
 builder+0.020
Neg logit contributions
text-0.030
-0.025
ifying-0.024
 AW-0.023
 Sn-0.023
Tokenifying
Token ID4035
Feature activation-1.19e-01

Pos logit contributions
text+0.052
 AW+0.030
Fl+0.029
 Sn+0.028
+0.027
Neg logit contributions
castle-0.132
 artist-0.131
 courtyard-0.131
 studio-0.130
 builder-0.129
Token glass
Token ID5405
Feature activation-7.88e-03

Pos logit contributions
text+0.229
 AW+0.184
+0.182
ifying+0.177
 Sn+0.169
Neg logit contributions
 courtyard-0.208
 classroom-0.206
 studio-0.195
 neighborhood-0.193
castle-0.191
Token?"
Token ID1701
Feature activation+3.35e-02

Pos logit contributions
text+0.016
+0.013
ifying+0.013
 AW+0.013
 Sn+0.012
Neg logit contributions
 courtyard-0.013
 classroom-0.013
 neighborhood-0.012
castle-0.012
 builder-0.012
Token
Token ID198
Feature activation-7.35e-03

Pos logit contributions
 classroom+0.051
 neighborhood+0.050
 courtyard+0.050
 painter+0.049
 builder+0.049
Neg logit contributions
text-0.071
ifying-0.064
 AW-0.059
-0.056
 rib-0.055
Token
Token ID198
Feature activation-1.33e-03

Pos logit contributions
text+0.016
ifying+0.014
 AW+0.013
 rib+0.013
+0.013
Neg logit contributions
 neighborhood-0.011
 classroom-0.010
 courtyard-0.010
 builder-0.010
 painter-0.010
Token"
Token ID1
Feature activation+1.13e-03

Pos logit contributions
text+0.002
ifying+0.002
 AW+0.002
+0.002
 rib+0.002
Neg logit contributions
 courtyard-0.002
 neighborhood-0.002
 builder-0.002
 classroom-0.002
castle-0.002
Token said
Token ID531
Feature activation-1.14e-02

Pos logit contributions
text+0.014
+0.012
ifying+0.011
 AW+0.011
 Sn+0.011
Neg logit contributions
 courtyard-0.014
 classroom-0.014
 neighborhood-0.014
castle-0.014
 builder-0.014
Token,
Token ID11
Feature activation+5.70e-03

Pos logit contributions
text+0.020
+0.018
ifying+0.016
 Sn+0.016
 eight+0.016
Neg logit contributions
 courtyard-0.021
 neighborhood-0.021
 classroom-0.020
castle-0.020
 grounds-0.020
Token holding
Token ID4769
Feature activation+1.22e-02

Pos logit contributions
 courtyard+0.009
 neighborhood+0.008
 classroom+0.008
castle+0.008
 grounds+0.008
Neg logit contributions
text-0.011
-0.011
 eight-0.010
 Sn-0.010
ifying-0.010
Token the
Token ID262
Feature activation+2.19e-02

Pos logit contributions
 classroom+0.018
 courtyard+0.018
 painter+0.018
 neighborhood+0.018
 builder+0.018
Neg logit contributions
text-0.026
ifying-0.022
 AW-0.021
-0.021
 Sn-0.019
Token magn
Token ID7842
Feature activation-5.16e-02

Pos logit contributions
 courtyard+0.031
 classroom+0.030
 neighborhood+0.030
 painter+0.029
 builder+0.029
Neg logit contributions
text-0.049
-0.041
ifying-0.041
 AW-0.040
 Sn-0.039
Tokenifying
Token ID4035
Feature activation-1.19e-01

Pos logit contributions
text+0.046
 AW+0.024
 Sn+0.023
+0.018
Fl+0.018
Neg logit contributions
 courtyard-0.139
castle-0.138
 builder-0.138
utter-0.137
pit-0.135
Token glass
Token ID5405
Feature activation-1.01e-02

Pos logit contributions
text+0.222
+0.178
 AW+0.177
ifying+0.169
 Sn+0.163
Neg logit contributions
 courtyard-0.213
 classroom-0.213
 neighborhood-0.201
 studio-0.201
 painter-0.200
Token over
Token ID625
Feature activation+1.90e-02

Pos logit contributions
text+0.021
+0.018
ifying+0.017
 AW+0.017
 Sn+0.016
Neg logit contributions
 courtyard-0.016
 classroom-0.015
 neighborhood-0.015
castle-0.015
 builder-0.015
Token a
Token ID257
Feature activation-2.49e-02

Pos logit contributions
 courtyard+0.029
 classroom+0.029
 neighborhood+0.028
 painter+0.028
 builder+0.028
Neg logit contributions
text-0.041
 AW-0.034
ifying-0.033
-0.033
 Sn-0.030
Token black
Token ID2042
Feature activation-2.29e-03

Pos logit contributions
text+0.053
ifying+0.044
+0.044
 AW+0.042
 Sn+0.041
Neg logit contributions
 courtyard-0.038
 classroom-0.037
 neighborhood-0.037
 painter-0.036
 builder-0.036
Token ant
Token ID1885
Feature activation+7.90e-03

Pos logit contributions
text+0.005
+0.004
ifying+0.004
 AW+0.004
 Sn+0.004
Neg logit contributions
 courtyard-0.003
 classroom-0.003
 neighborhood-0.003
 painter-0.003
castle-0.003
Token,
Token ID11
Feature activation+3.24e-03

Pos logit contributions
text+0.020
ifying+0.017
+0.017
 AW+0.017
 Sn+0.016
Neg logit contributions
 courtyard-0.012
 classroom-0.012
 builder-0.012
 painter-0.011
castle-0.011
Token handing
Token ID22786
Feature activation-2.72e-03

Pos logit contributions
 courtyard+0.005
 classroom+0.005
 neighborhood+0.005
castle+0.005
 builder+0.005
Neg logit contributions
text-0.007
-0.006
ifying-0.006
 AW-0.006
 Sn-0.005
Token him
Token ID683
Feature activation+7.42e-03

Pos logit contributions
text+0.006
ifying+0.005
 AW+0.005
+0.005
 eight+0.005
Neg logit contributions
 classroom-0.004
 painter-0.004
 courtyard-0.003
 neighborhood-0.003
 artist-0.003
Token the
Token ID262
Feature activation+2.09e-02

Pos logit contributions
 classroom+0.011
 courtyard+0.011
 painter+0.011
 builder+0.011
 neighborhood+0.010
Neg logit contributions
text-0.015
 AW-0.013
ifying-0.013
-0.012
 GO-0.011
Token magn
Token ID7842
Feature activation-5.13e-02

Pos logit contributions
 courtyard+0.031
 classroom+0.031
 neighborhood+0.030
 painter+0.030
 builder+0.029
Neg logit contributions
text-0.046
-0.038
ifying-0.038
 AW-0.037
 Sn-0.036
Tokenifying
Token ID4035
Feature activation-1.19e-01

Pos logit contributions
text+0.042
 AW+0.022
 Sn+0.020
+0.019
Fl+0.017
Neg logit contributions
castle-0.142
 courtyard-0.142
 builder-0.140
 studio-0.138
pit-0.137
Token glass
Token ID5405
Feature activation-1.12e-02

Pos logit contributions
text+0.228
+0.183
 AW+0.181
ifying+0.171
 Sn+0.170
Neg logit contributions
 classroom-0.208
 courtyard-0.207
 studio-0.196
 neighborhood-0.195
castle-0.194
Token.
Token ID13
Feature activation+4.58e-03

Pos logit contributions
text+0.024
ifying+0.020
+0.020
 AW+0.020
 Sn+0.019
Neg logit contributions
 courtyard-0.017
 classroom-0.016
castle-0.016
 builder-0.016
 neighborhood-0.016
Token She
Token ID1375
Feature activation-6.78e-04

Pos logit contributions
 courtyard+0.008
 classroom+0.008
 neighborhood+0.007
castle+0.007
 builder+0.007
Neg logit contributions
text-0.009
-0.007
ifying-0.007
 AW-0.007
 Sn-0.007
Token looked
Token ID3114
Feature activation+2.35e-02

Pos logit contributions
text+0.001
ifying+0.001
+0.001
 AW+0.001
 Sn+0.001
Neg logit contributions
 courtyard-0.001
 classroom-0.001
 neighborhood-0.001
castle-0.001
 painter-0.001
Token around
Token ID1088
Feature activation+1.81e-02

Pos logit contributions
 courtyard+0.041
castle+0.040
 classroom+0.040
 builder+0.039
 neighborhood+0.039
Neg logit contributions
text-0.043
-0.039
ifying-0.037
 AW-0.035
 rib-0.034
Token flashlight
Token ID38371
Feature activation-3.30e-02

Pos logit contributions
text+0.075
ifying+0.061
+0.060
 AW+0.059
 Sn+0.057
Neg logit contributions
 courtyard-0.054
 classroom-0.053
 neighborhood-0.052
 builder-0.051
 painter-0.051
Token,
Token ID11
Feature activation+7.69e-03

Pos logit contributions
text+0.067
+0.057
ifying+0.056
 AW+0.054
 Sn+0.052
Neg logit contributions
 courtyard-0.052
 classroom-0.051
 neighborhood-0.050
 painter-0.049
 builder-0.049
Token and
Token ID290
Feature activation-6.51e-03

Pos logit contributions
 courtyard+0.012
 classroom+0.011
 neighborhood+0.011
 builder+0.011
 painter+0.011
Neg logit contributions
text-0.016
ifying-0.014
 AW-0.014
-0.013
 Sn-0.013
Token a
Token ID257
Feature activation-3.33e-02

Pos logit contributions
text+0.013
ifying+0.010
 AW+0.010
+0.010
 Sn+0.010
Neg logit contributions
 courtyard-0.011
 classroom-0.011
 neighborhood-0.011
 builder-0.011
 painter-0.010
Token magn
Token ID7842
Feature activation-5.50e-02

Pos logit contributions
text+0.071
+0.059
ifying+0.059
 AW+0.057
 Sn+0.055
Neg logit contributions
 courtyard-0.051
 classroom-0.050
 neighborhood-0.049
 builder-0.048
 painter-0.048
Tokenifying
Token ID4035
Feature activation-1.17e-01

Pos logit contributions
text+0.060
 AW+0.037
Fl+0.036
 Sn+0.035
+0.034
Neg logit contributions
 courtyard-0.140
 builder-0.138
castle-0.137
 grounds-0.135
pit-0.133
Token glass
Token ID5405
Feature activation-5.18e-03

Pos logit contributions
text+0.226
+0.181
 AW+0.179
 Sn+0.171
ifying+0.170
Neg logit contributions
 courtyard-0.207
 classroom-0.202
castle-0.196
 neighborhood-0.192
 studio-0.189
Token.
Token ID13
Feature activation+3.13e-03

Pos logit contributions
text+0.011
+0.009
ifying+0.009
 AW+0.008
 Sn+0.008
Neg logit contributions
 courtyard-0.008
 classroom-0.008
 neighborhood-0.008
castle-0.008
 builder-0.008
Token
Token ID198
Feature activation-2.17e-03

Pos logit contributions
 classroom+0.004
 courtyard+0.004
 neighborhood+0.004
 builder+0.004
 studio+0.004
Neg logit contributions
text-0.008
 AW-0.007
ifying-0.006
 tal-0.006
-0.006
Token
Token ID198
Feature activation+3.65e-03

Pos logit contributions
text+0.004
ifying+0.004
 AW+0.004
 rib+0.003
 Sn+0.003
Neg logit contributions
 neighborhood-0.004
 courtyard-0.004
 classroom-0.004
 painter-0.003
 builder-0.003
TokenAs
Token ID1722
Feature activation-1.85e-03

Pos logit contributions
 courtyard+0.006
 classroom+0.006
 neighborhood+0.006
 painter+0.006
 builder+0.006
Neg logit contributions
text-0.007
ifying-0.006
 AW-0.005
 rib-0.005
 Sn-0.005
Token She
Token ID1375
Feature activation+1.02e-02

Pos logit contributions
 courtyard+0.008
castle+0.008
 classroom+0.008
 neighborhood+0.008
 painter+0.007
Neg logit contributions
text-0.015
-0.013
ifying-0.012
 AW-0.012
 Sn-0.012
Token took
Token ID1718
Feature activation-5.23e-03

Pos logit contributions
 courtyard+0.016
 classroom+0.016
 neighborhood+0.015
castle+0.015
 painter+0.015
Neg logit contributions
text-0.021
-0.018
ifying-0.017
 AW-0.017
 Sn-0.016
Token out
Token ID503
Feature activation+1.47e-02

Pos logit contributions
text+0.010
ifying+0.009
 AW+0.009
+0.009
 Sn+0.008
Neg logit contributions
 courtyard-0.008
 classroom-0.008
 builder-0.008
 neighborhood-0.007
 artist-0.007
Token a
Token ID257
Feature activation-4.71e-03

Pos logit contributions
 courtyard+0.025
 classroom+0.025
 neighborhood+0.024
 builder+0.024
 painter+0.024
Neg logit contributions
text-0.028
 AW-0.024
ifying-0.024
-0.023
 Sn-0.021
Token magn
Token ID7842
Feature activation-4.54e-02

Pos logit contributions
text+0.009
+0.008
ifying+0.008
 AW+0.007
 eight+0.007
Neg logit contributions
 courtyard-0.008
 neighborhood-0.008
 classroom-0.008
castle-0.008
 artist-0.007
Tokenifying
Token ID4035
Feature activation-1.17e-01

Pos logit contributions
text+0.035
 AW+0.018
 Sn+0.016
+0.016
Fl+0.013
Neg logit contributions
 courtyard-0.129
 builder-0.126
castle-0.125
 grounds-0.124
 studio-0.123
Token glass
Token ID5405
Feature activation-2.81e-03

Pos logit contributions
text+0.215
+0.173
 AW+0.170
ifying+0.166
 Sn+0.158
Neg logit contributions
 courtyard-0.216
 classroom-0.212
 neighborhood-0.205
 studio-0.202
 painter-0.201
Token and
Token ID290
Feature activation-1.40e-03

Pos logit contributions
text+0.005
+0.004
ifying+0.004
 AW+0.004
 eight+0.004
Neg logit contributions
 courtyard-0.005
 classroom-0.005
 neighborhood-0.005
 painter-0.005
 artist-0.005
Token looked
Token ID3114
Feature activation+3.60e-02

Pos logit contributions
text+0.003
+0.002
ifying+0.002
 AW+0.002
 Sn+0.002
Neg logit contributions
 courtyard-0.002
 classroom-0.002
 neighborhood-0.002
castle-0.002
 builder-0.002
Token very
Token ID845
Feature activation+2.32e-02

Pos logit contributions
 classroom+0.060
 neighborhood+0.059
castle+0.058
 courtyard+0.058
 painter+0.058
Neg logit contributions
text-0.066
-0.062
ifying-0.060
 rib-0.056
 AW-0.055
Token carefully
Token ID7773
Feature activation-7.22e-03

Pos logit contributions
 classroom+0.041
castle+0.040
pit+0.040
 neighborhood+0.040
 painter+0.040
Neg logit contributions
-0.037
 eight-0.033
text-0.033
ifying-0.031
 rib-0.030
Token
Token ID198
Feature activation+3.02e-03

Pos logit contributions
text+0.002
ifying+0.002
 AW+0.002
+0.002
 Sn+0.002
Neg logit contributions
 courtyard-0.002
 neighborhood-0.002
 classroom-0.002
 painter-0.002
 builder-0.002
TokenHe
Token ID1544
Feature activation-8.53e-03

Pos logit contributions
 courtyard+0.006
 classroom+0.005
 neighborhood+0.005
castle+0.005
 builder+0.005
Neg logit contributions
text-0.006
-0.005
ifying-0.005
 AW-0.004
 Sn-0.004
Token pointed
Token ID6235
Feature activation+2.49e-02

Pos logit contributions
text+0.018
ifying+0.015
+0.014
 AW+0.014
 Sn+0.014
Neg logit contributions
 courtyard-0.014
 classroom-0.013
 neighborhood-0.013
 painter-0.013
 builder-0.013
Token his
Token ID465
Feature activation+8.56e-03

Pos logit contributions
 courtyard+0.041
 classroom+0.040
 neighborhood+0.040
castle+0.039
 builder+0.039
Neg logit contributions
text-0.049
-0.042
ifying-0.041
 AW-0.039
 Sn-0.038
Token magn
Token ID7842
Feature activation-5.41e-02

Pos logit contributions
 classroom+0.014
 courtyard+0.014
 painter+0.013
 neighborhood+0.013
 builder+0.013
Neg logit contributions
text-0.018
-0.015
ifying-0.014
 Sn-0.014
 AW-0.014
Tokenifying
Token ID4035
Feature activation-1.16e-01

Pos logit contributions
text+0.055
 AW+0.033
Fl+0.032
 Sn+0.030
 Fl+0.028
Neg logit contributions
 courtyard-0.136
utter-0.134
castle-0.134
 builder-0.133
pit-0.133
Token glass
Token ID5405
Feature activation-1.86e-02

Pos logit contributions
text+0.217
+0.177
 AW+0.172
ifying+0.167
 Sn+0.161
Neg logit contributions
 classroom-0.208
 courtyard-0.208
 neighborhood-0.198
 studio-0.196
castle-0.194
Token at
Token ID379
Feature activation+1.96e-02

Pos logit contributions
text+0.038
+0.032
ifying+0.032
 AW+0.031
 Sn+0.030
Neg logit contributions
 courtyard-0.030
 classroom-0.029
 neighborhood-0.028
castle-0.028
 builder-0.028
Token different
Token ID1180
Feature activation+2.41e-02

Pos logit contributions
 courtyard+0.040
 classroom+0.039
 neighborhood+0.039
castle+0.039
 builder+0.038
Neg logit contributions
text-0.031
-0.025
ifying-0.024
 AW-0.022
 Sn-0.022
Token classmates
Token ID28999
Feature activation+5.99e-03

Pos logit contributions
 courtyard+0.037
castle+0.035
 neighborhood+0.035
 classroom+0.035
 builder+0.035
Neg logit contributions
text-0.050
-0.044
ifying-0.043
 AW-0.040
 rib-0.040
Token and
Token ID290
Feature activation-1.62e-02

Pos logit contributions
 courtyard+0.010
 classroom+0.010
 neighborhood+0.010
castle+0.010
 builder+0.010
Neg logit contributions
text-0.011
-0.010
ifying-0.010
 Sn-0.009
 AW-0.009
Token it
Token ID340
Feature activation+2.62e-02

Pos logit contributions
text+0.031
ifying+0.026
+0.025
 AW+0.025
 Sn+0.023
Neg logit contributions
 classroom-0.026
 courtyard-0.026
 neighborhood-0.025
 studio-0.024
 painter-0.024
Token she
Token ID673
Feature activation+1.29e-02

Pos logit contributions
 artist+0.053
 painter+0.053
castle+0.053
 writer+0.052
 scientist+0.051
Neg logit contributions
-0.039
text-0.038
 Fl-0.029
 Sn-0.028
 AW-0.027
Token had
Token ID550
Feature activation-3.28e-02

Pos logit contributions
 courtyard+0.018
 classroom+0.018
 neighborhood+0.018
 painter+0.018
castle+0.017
Neg logit contributions
text-0.028
-0.024
ifying-0.024
 AW-0.023
 Sn-0.022
Token a
Token ID257
Feature activation-1.51e-02

Pos logit contributions
text+0.067
ifying+0.061
 AW+0.059
raw+0.054
 Sn+0.054
Neg logit contributions
 courtyard-0.054
 neighborhood-0.051
 grounds-0.049
 classroom-0.049
 neighbourhood-0.044
Token magn
Token ID7842
Feature activation-4.71e-02

Pos logit contributions
text+0.026
+0.023
ifying+0.022
 AW+0.021
 Sn+0.020
Neg logit contributions
 courtyard-0.027
castle-0.026
 classroom-0.026
 neighborhood-0.026
 artist-0.026
Tokenifying
Token ID4035
Feature activation-1.13e-01

Pos logit contributions
text+0.039
 AW+0.021
+0.019
 Sn+0.018
Fl+0.016
Neg logit contributions
 courtyard-0.133
 builder-0.129
castle-0.128
 grounds-0.127
 neighborhood-0.127
Token glass
Token ID5405
Feature activation-3.99e-03

Pos logit contributions
text+0.205
+0.165
 AW+0.162
ifying+0.160
 Sn+0.151
Neg logit contributions
 courtyard-0.211
 classroom-0.205
 neighborhood-0.200
castle-0.197
 studio-0.194
Token,
Token ID11
Feature activation+2.07e-02

Pos logit contributions
text+0.008
+0.007
ifying+0.006
 AW+0.006
 Sn+0.006
Neg logit contributions
 courtyard-0.007
 classroom-0.007
 neighborhood-0.007
 painter-0.006
 builder-0.006
Token a
Token ID257
Feature activation-3.33e-02

Pos logit contributions
 courtyard+0.034
 classroom+0.033
 neighborhood+0.033
 painter+0.032
 builder+0.032
Neg logit contributions
text-0.042
ifying-0.035
-0.034
 AW-0.034
 Sn-0.032
Token tiny
Token ID7009
Feature activation-1.93e-02

Pos logit contributions
text+0.066
+0.055
ifying+0.055
 AW+0.052
 Sn+0.051
Neg logit contributions
 courtyard-0.055
 classroom-0.054
 neighborhood-0.053
castle-0.052
 builder-0.052
Token torch
Token ID28034
Feature activation-5.23e-03

Pos logit contributions
text+0.040
+0.033
ifying+0.033
 AW+0.032
 Sn+0.031
Neg logit contributions
 courtyard-0.030
 classroom-0.029
 neighborhood-0.029
castle-0.029
 builder-0.029
Token She
Token ID1375
Feature activation+5.47e-03

Pos logit contributions
text+0.004
+0.003
ifying+0.003
 AW+0.003
 Sn+0.003
Neg logit contributions
 courtyard-0.003
 classroom-0.003
 neighborhood-0.003
castle-0.003
 builder-0.003
Token had
Token ID550
Feature activation-2.19e-02

Pos logit contributions
 courtyard+0.008
 classroom+0.008
castle+0.008
 scientist+0.008
 builder+0.008
Neg logit contributions
text-0.011
-0.010
 rib-0.009
 eight-0.009
 Sn-0.009
Token a
Token ID257
Feature activation-1.03e-02

Pos logit contributions
text+0.037
ifying+0.029
+0.029
 AW+0.028
 Sn+0.027
Neg logit contributions
 courtyard-0.044
 classroom-0.043
 neighborhood-0.043
 painter-0.041
castle-0.041
Token big
Token ID1263
Feature activation+1.95e-02

Pos logit contributions
text+0.015
+0.013
 eight+0.013
ifying+0.012
 rib+0.012
Neg logit contributions
 courtyard-0.021
castle-0.020
 grounds-0.020
ashing-0.019
 artist-0.019
Token magn
Token ID7842
Feature activation-5.26e-02

Pos logit contributions
 courtyard+0.032
 grounds+0.031
ashing+0.030
castle+0.029
 builder+0.029
Neg logit contributions
text-0.036
-0.031
 rib-0.030
 ho-0.030
 AW-0.030
Tokenifying
Token ID4035
Feature activation-1.12e-01

Pos logit contributions
text+0.041
+0.022
 AW+0.020
 Sn+0.016
Fl+0.014
Neg logit contributions
 courtyard-0.150
castle-0.148
 builder-0.147
 studio-0.145
 classroom-0.145
Token glass
Token ID5405
Feature activation-6.60e-03

Pos logit contributions
text+0.197
+0.159
ifying+0.156
 AW+0.152
 Sn+0.144
Neg logit contributions
 courtyard-0.215
 classroom-0.210
 neighborhood-0.206
castle-0.203
 painter-0.201
Token that
Token ID326
Feature activation+3.41e-03

Pos logit contributions
text+0.012
ifying+0.010
+0.010
 eight+0.009
 rib+0.009
Neg logit contributions
 courtyard-0.012
 neighborhood-0.011
 classroom-0.011
 painter-0.011
 artist-0.011
Token made
Token ID925
Feature activation-1.72e-02

Pos logit contributions
 courtyard+0.006
 neighborhood+0.006
 builder+0.006
 classroom+0.006
castle+0.006
Neg logit contributions
text-0.006
-0.005
 rib-0.005
 Sn-0.004
 Fl-0.004
Token things
Token ID1243
Feature activation-1.46e-02

Pos logit contributions
text+0.034
+0.029
ifying+0.029
 AW+0.027
 Sn+0.027
Neg logit contributions
 courtyard-0.028
 classroom-0.027
 neighborhood-0.027
castle-0.026
 builder-0.026
Token look
Token ID804
Feature activation+1.20e-02

Pos logit contributions
+0.025
text+0.023
ifying+0.022
 rib+0.021
 ho+0.019
Neg logit contributions
 neighborhood-0.027
 courtyard-0.027
 classroom-0.026
 studio-0.026
 painter-0.025
Token.
Token ID13
Feature activation+2.10e-03

Pos logit contributions
text+0.040
+0.034
ifying+0.033
 AW+0.032
 Sn+0.031
Neg logit contributions
 courtyard-0.031
 classroom-0.030
 neighborhood-0.029
castle-0.029
 builder-0.029
Token Lily
Token ID20037
Feature activation+5.69e-04

Pos logit contributions
 courtyard+0.004
 classroom+0.004
 neighborhood+0.003
castle+0.003
 painter+0.003
Neg logit contributions
text-0.004
-0.003
ifying-0.003
 AW-0.003
 Sn-0.003
Token had
Token ID550
Feature activation-2.52e-02

Pos logit contributions
 courtyard+0.001
castle+0.001
 classroom+0.001
 painter+0.001
 studio+0.001
Neg logit contributions
text-0.001
-0.001
 eight-0.001
 rib-0.001
 Sn-0.001
Token a
Token ID257
Feature activation-1.27e-02

Pos logit contributions
text+0.044
ifying+0.035
 AW+0.034
+0.034
 Sn+0.032
Neg logit contributions
 courtyard-0.050
 classroom-0.049
 neighborhood-0.048
 builder-0.046
 grounds-0.046
Token magn
Token ID7842
Feature activation-5.33e-02

Pos logit contributions
text+0.021
+0.019
ifying+0.019
 AW+0.018
 eight+0.018
Neg logit contributions
 courtyard-0.023
 neighborhood-0.022
castle-0.022
 grounds-0.022
 artist-0.021
Tokenifying
Token ID4035
Feature activation-1.12e-01

Pos logit contributions
text+0.042
 AW+0.022
+0.021
Fl+0.017
 Sn+0.017
Neg logit contributions
 courtyard-0.150
 builder-0.149
castle-0.148
pit-0.145
 classroom-0.144
Token glass
Token ID5405
Feature activation-6.09e-03

Pos logit contributions
text+0.197
+0.160
ifying+0.155
 AW+0.153
 Sn+0.145
Neg logit contributions
 courtyard-0.212
 classroom-0.208
 neighborhood-0.201
castle-0.199
 studio-0.197
Token that
Token ID326
Feature activation+4.59e-03

Pos logit contributions
text+0.012
+0.010
ifying+0.010
 AW+0.009
 Sn+0.009
Neg logit contributions
 courtyard-0.010
 neighborhood-0.010
 classroom-0.010
 painter-0.009
 builder-0.009
Token she
Token ID673
Feature activation+4.57e-03

Pos logit contributions
 courtyard+0.008
 neighborhood+0.008
 classroom+0.008
 painter+0.008
 builder+0.008
Neg logit contributions
text-0.009
-0.008
ifying-0.007
 Sn-0.007
 rib-0.006
Token found
Token ID1043
Feature activation+1.21e-02

Pos logit contributions
 courtyard+0.007
 classroom+0.007
castle+0.007
 neighborhood+0.007
 builder+0.006
Neg logit contributions
text-0.010
-0.008
ifying-0.008
 eight-0.008
 rib-0.008
Token in
Token ID287
Feature activation-1.09e-02

Pos logit contributions
 courtyard+0.018
 neighborhood+0.017
 grounds+0.017
 classroom+0.017
castle+0.016
Neg logit contributions
text-0.027
 AW-0.023
ifying-0.022
-0.022
 Sn-0.021
Token a
Token ID257
Feature activation-1.09e-02

Pos logit contributions
text+0.033
ifying+0.028
 AW+0.027
+0.026
 Sn+0.025
Neg logit contributions
 courtyard-0.035
 neighborhood-0.035
 classroom-0.034
 grounds-0.033
 builder-0.032
Token flashlight
Token ID38371
Feature activation-1.74e-02

Pos logit contributions
text+0.020
+0.017
ifying+0.016
 AW+0.016
 Sn+0.015
Neg logit contributions
 courtyard-0.019
 neighborhood-0.018
 classroom-0.018
castle-0.018
 artist-0.018
Token and
Token ID290
Feature activation-2.04e-02

Pos logit contributions
text+0.029
ifying+0.026
+0.026
 eight+0.023
 rib+0.023
Neg logit contributions
 courtyard-0.031
 neighborhood-0.031
 classroom-0.031
 painter-0.030
 artist-0.029
Token a
Token ID257
Feature activation-2.96e-02

Pos logit contributions
text+0.035
ifying+0.028
 AW+0.027
+0.027
 Sn+0.026
Neg logit contributions
 courtyard-0.040
 classroom-0.039
 neighborhood-0.038
 painter-0.037
 builder-0.037
Token magn
Token ID7842
Feature activation-5.36e-02

Pos logit contributions
text+0.058
+0.049
ifying+0.049
 AW+0.047
 Sn+0.045
Neg logit contributions
 courtyard-0.048
 neighborhood-0.046
 classroom-0.046
castle-0.046
 artist-0.045
Tokenifying
Token ID4035
Feature activation-1.11e-01

Pos logit contributions
text+0.044
+0.025
 AW+0.025
 Sn+0.020
 tal+0.017
Neg logit contributions
 courtyard-0.151
 builder-0.148
castle-0.148
 grounds-0.145
pit-0.145
Token glass
Token ID5405
Feature activation-9.06e-03

Pos logit contributions
text+0.200
+0.164
ifying+0.159
 AW+0.157
 Sn+0.149
Neg logit contributions
 courtyard-0.205
 classroom-0.200
 neighborhood-0.195
castle-0.193
 studio-0.190
Token to
Token ID284
Feature activation-6.95e-03

Pos logit contributions
text+0.018
ifying+0.015
+0.015
 AW+0.014
 Sn+0.014
Neg logit contributions
 courtyard-0.015
 neighborhood-0.014
 classroom-0.014
 painter-0.014
 builder-0.014
Token examine
Token ID10716
Feature activation+1.47e-02

Pos logit contributions
text+0.013
+0.011
ifying+0.011
 rib+0.011
 AW+0.010
Neg logit contributions
 neighborhood-0.012
 courtyard-0.012
 classroom-0.011
castle-0.011
 builder-0.011
Token their
Token ID511
Feature activation+8.24e-03

Pos logit contributions
 courtyard+0.020
 classroom+0.020
 neighborhood+0.020
 grounds+0.019
 builder+0.019
Neg logit contributions
text-0.032
 AW-0.028
ifying-0.027
-0.026
 eight-0.026
Token finds
Token ID7228
Feature activation+5.09e-03

Pos logit contributions
 courtyard+0.011
 classroom+0.010
 neighborhood+0.010
castle+0.010
 builder+0.010
Neg logit contributions
text-0.019
-0.017
ifying-0.017
 AW-0.016
 Sn-0.016
Token,
Token ID11
Feature activation+2.04e-02

Pos logit contributions
 courtyard+0.005
 classroom+0.005
 neighborhood+0.005
 builder+0.004
utter+0.004
Neg logit contributions
text-0.013
ifying-0.012
 AW-0.011
-0.011
 tal-0.011
Token Fiona
Token ID42207
Feature activation+1.16e-02

Pos logit contributions
 neighborhood+0.028
 courtyard+0.028
 classroom+0.028
 studio+0.027
utter+0.027
Neg logit contributions
text-0.042
ifying-0.036
 AW-0.035
-0.034
raw-0.033
Token found
Token ID1043
Feature activation+1.39e-02

Pos logit contributions
 courtyard+0.017
 classroom+0.016
 neighborhood+0.016
castle+0.016
 builder+0.016
Neg logit contributions
text-0.025
-0.021
ifying-0.021
 AW-0.020
 Sn-0.020
Token a
Token ID257
Feature activation-1.21e-02

Pos logit contributions
 courtyard+0.031
 classroom+0.031
 neighborhood+0.030
castle+0.030
 builder+0.030
Neg logit contributions
text-0.019
-0.015
ifying-0.015
 AW-0.014
 Sn-0.013
Token delicious
Token ID12625
Feature activation+2.45e-02

Pos logit contributions
text+0.020
+0.018
ifying+0.017
 eight+0.016
 AW+0.016
Neg logit contributions
 courtyard-0.022
 classroom-0.021
 neighborhood-0.021
castle-0.021
 artist-0.021
Token bunch
Token ID7684
Feature activation+3.17e-02

Pos logit contributions
 courtyard+0.046
 classroom+0.045
 studio+0.044
 neighborhood+0.044
 builder+0.044
Neg logit contributions
text-0.040
-0.034
ifying-0.032
 AW-0.032
 eight-0.031
Token of
Token ID286
Feature activation+1.35e-02

Pos logit contributions
 courtyard+0.058
 classroom+0.057
 neighborhood+0.057
castle+0.055
 builder+0.055
Neg logit contributions
text-0.057
-0.048
ifying-0.046
 AW-0.043
 Sn-0.042
Token bananas
Token ID35484
Feature activation-2.44e-03

Pos logit contributions
 courtyard+0.025
 neighborhood+0.024
 classroom+0.024
 painter+0.024
 builder+0.024
Neg logit contributions
text-0.025
ifying-0.021
-0.020
 AW-0.020
 Sn-0.018
Token.
Token ID13
Feature activation-4.21e-04

Pos logit contributions
text+0.005
+0.004
ifying+0.004
 AW+0.004
 Sn+0.004
Neg logit contributions
 courtyard-0.004
 classroom-0.004
 neighborhood-0.004
 painter-0.004
 builder-0.004
Token She
Token ID1375
Feature activation+7.52e-03

Pos logit contributions
text+0.001
ifying+0.001
 AW+0.001
+0.001
 tal+0.001
Neg logit contributions
 courtyard-0.000
 classroom-0.000
 neighborhood-0.000
 studio-0.000
 grounds-0.000
Token excited
Token ID6568
Feature activation-4.02e-03

Pos logit contributions
 courtyard+0.011
 neighborhood+0.010
 classroom+0.010
 painter+0.010
 builder+0.010
Neg logit contributions
text-0.017
ifying-0.015
 AW-0.014
-0.014
 Sn-0.013
Token Anna
Token ID11735
Feature activation-2.57e-02

Pos logit contributions
text+0.061
ifying+0.052
 AW+0.050
+0.050
 tal+0.047
Neg logit contributions
 classroom-0.050
 painter-0.049
 builder-0.048
 courtyard-0.048
 studio-0.048
Token are
Token ID389
Feature activation+7.08e-03

Pos logit contributions
text+0.046
+0.040
ifying+0.038
 AW+0.036
 Sn+0.035
Neg logit contributions
 courtyard-0.047
 classroom-0.046
 neighborhood-0.045
castle-0.045
 builder-0.045
Token scared
Token ID12008
Feature activation+8.76e-03

Pos logit contributions
 courtyard+0.011
 artist+0.011
castle+0.011
 studio+0.010
 writer+0.010
Neg logit contributions
text-0.015
-0.014
 eight-0.012
ifying-0.012
 Sn-0.012
Token.
Token ID13
Feature activation+2.04e-03

Pos logit contributions
 courtyard+0.017
castle+0.016
 classroom+0.016
 studio+0.016
 neighborhood+0.016
Neg logit contributions
text-0.016
-0.013
ifying-0.012
 Sn-0.012
 eight-0.012
Token They
Token ID1119
Feature activation+2.36e-03

Pos logit contributions
 courtyard+0.004
castle+0.004
 artist+0.004
 classroom+0.004
 neighborhood+0.004
Neg logit contributions
text-0.004
-0.003
ifying-0.003
 Sn-0.003
 eight-0.003
Token nod
Token ID18666
Feature activation+3.09e-02

Pos logit contributions
 courtyard+0.003
 classroom+0.003
 neighborhood+0.003
 painter+0.003
 builder+0.003
Neg logit contributions
text-0.005
ifying-0.004
-0.004
 AW-0.004
 Sn-0.004
Token.
Token ID13
Feature activation+5.05e-03

Pos logit contributions
 courtyard+0.051
 classroom+0.049
 neighborhood+0.049
castle+0.048
 builder+0.048
Neg logit contributions
text-0.062
-0.053
ifying-0.052
 AW-0.049
 Sn-0.048
Token They
Token ID1119
Feature activation+4.95e-03

Pos logit contributions
 courtyard+0.008
 classroom+0.008
 neighborhood+0.008
castle+0.008
 builder+0.008
Neg logit contributions
text-0.010
-0.009
ifying-0.008
 AW-0.008
 Sn-0.008
Token are
Token ID389
Feature activation+1.86e-02

Pos logit contributions
 courtyard+0.007
 classroom+0.007
 neighborhood+0.007
 painter+0.006
 builder+0.006
Neg logit contributions
text-0.011
ifying-0.009
-0.009
 AW-0.009
 Sn-0.009
Token sorry
Token ID7926
Feature activation+5.34e-03

Pos logit contributions
 courtyard+0.030
 classroom+0.028
castle+0.028
 neighborhood+0.028
 grounds+0.028
Neg logit contributions
text-0.037
-0.033
ifying-0.032
 eight-0.031
 Sn-0.031
Token.
Token ID13
Feature activation+9.33e-03

Pos logit contributions
 courtyard+0.009
 classroom+0.008
 neighborhood+0.008
castle+0.008
 builder+0.008
Neg logit contributions
text-0.011
-0.009
ifying-0.009
 AW-0.009
 Sn-0.008
Token sad
Token ID6507
Feature activation+4.98e-03

Pos logit contributions
 courtyard+0.016
 classroom+0.016
castle+0.016
 neighborhood+0.015
 painter+0.015
Neg logit contributions
text-0.026
-0.023
ifying-0.023
 AW-0.022
 Sn-0.021
Token.
Token ID13
Feature activation+1.36e-03

Pos logit contributions
 courtyard+0.011
 grounds+0.010
 classroom+0.010
 studio+0.010
castle+0.010
Neg logit contributions
text-0.007
-0.007
ifying-0.006
 Sn-0.006
 Fl-0.006
Token He
Token ID679
Feature activation+4.71e-03

Pos logit contributions
 courtyard+0.002
 classroom+0.002
 neighborhood+0.002
 builder+0.002
 painter+0.002
Neg logit contributions
text-0.003
ifying-0.002
-0.002
 AW-0.002
 Sn-0.002
Token wanted
Token ID2227
Feature activation+5.47e-03

Pos logit contributions
 courtyard+0.007
 neighborhood+0.007
 classroom+0.006
 painter+0.006
 builder+0.006
Neg logit contributions
text-0.011
ifying-0.009
-0.009
 AW-0.009
 Sn-0.008
Token to
Token ID284
Feature activation+1.80e-02

Pos logit contributions
 courtyard+0.009
 classroom+0.009
 neighborhood+0.009
castle+0.009
 builder+0.009
Neg logit contributions
text-0.011
-0.009
ifying-0.009
 AW-0.009
 Sn-0.008
Token cross
Token ID3272
Feature activation+3.42e-02

Pos logit contributions
 courtyard+0.029
 classroom+0.028
 neighborhood+0.028
castle+0.028
 builder+0.027
Neg logit contributions
text-0.036
-0.031
ifying-0.030
 AW-0.029
 Sn-0.028
Token the
Token ID262
Feature activation+2.01e-02

Pos logit contributions
 courtyard+0.069
 classroom+0.067
 neighborhood+0.067
castle+0.066
 builder+0.066
Neg logit contributions
text-0.055
-0.044
ifying-0.044
 AW-0.042
 Sn-0.040
Token bridge
Token ID7696
Feature activation+1.15e-02

Pos logit contributions
 courtyard+0.025
 writer+0.024
 classroom+0.024
 artist+0.024
castle+0.023
Neg logit contributions
text-0.044
-0.041
 AW-0.039
 eight-0.039
ifying-0.038
Token just
Token ID655
Feature activation-5.22e-03

Pos logit contributions
 courtyard+0.020
 classroom+0.020
 neighborhood+0.019
 painter+0.019
 artist+0.019
Neg logit contributions
text-0.020
-0.019
ifying-0.017
 AW-0.016
 Sn-0.016
Token like
Token ID588
Feature activation-6.55e-03

Pos logit contributions
text+0.011
+0.010
ifying+0.009
 AW+0.009
 Sn+0.009
Neg logit contributions
 courtyard-0.008
 classroom-0.008
 neighborhood-0.008
 builder-0.007
 painter-0.007
Token all
Token ID477
Feature activation+3.22e-02

Pos logit contributions
text+0.014
ifying+0.012
 AW+0.012
+0.011
raw+0.011
Neg logit contributions
 courtyard-0.010
 builder-0.009
 neighborhood-0.009
 classroom-0.009
 painter-0.009
Token nice
Token ID3621
Feature activation+2.25e-03

Pos logit contributions
text+0.024
ifying+0.020
 AW+0.019
+0.019
 Sn+0.018
Neg logit contributions
 courtyard-0.013
 classroom-0.013
 neighborhood-0.012
 artist-0.012
 painter-0.012
Token.
Token ID13
Feature activation+2.83e-03

Pos logit contributions
 courtyard+0.004
 classroom+0.004
 neighborhood+0.004
 artist+0.004
 painter+0.004
Neg logit contributions
text-0.004
ifying-0.004
 AW-0.004
-0.004
 Sn-0.003
Token You
Token ID921
Feature activation-3.76e-02

Pos logit contributions
 writer+0.004
 artist+0.004
 classroom+0.004
 painter+0.004
 builder+0.004
Neg logit contributions
text-0.006
 AW-0.005
ifying-0.005
heit-0.004
 tal-0.004
Token are
Token ID389
Feature activation+1.78e-02

Pos logit contributions
text+0.065
ifying+0.053
+0.051
 AW+0.051
 Sn+0.048
Neg logit contributions
 courtyard-0.072
 classroom-0.069
 neighborhood-0.069
 builder-0.068
 painter-0.068
Token my
Token ID616
Feature activation-2.99e-03

Pos logit contributions
 courtyard+0.032
 classroom+0.031
 painter+0.031
 artist+0.031
 neighborhood+0.031
Neg logit contributions
text-0.034
ifying-0.027
 AW-0.027
-0.026
 Sn-0.024
Token best
Token ID1266
Feature activation+3.57e-02

Pos logit contributions
text+0.007
ifying+0.006
 AW+0.006
+0.006
raw+0.006
Neg logit contributions
 classroom-0.004
 neighborhood-0.004
 builder-0.004
 painter-0.004
 artist-0.003
Token friend
Token ID1545
Feature activation+1.90e-03

Pos logit contributions
 builder+0.028
 artist+0.028
 painter+0.028
 classroom+0.027
 neighborhood+0.026
Neg logit contributions
text-0.104
 AW-0.094
ifying-0.093
-0.091
 rib-0.085
Token,"
Token ID553
Feature activation+2.80e-02

Pos logit contributions
 courtyard+0.003
 artist+0.003
 neighborhood+0.003
 classroom+0.003
 painter+0.003
Neg logit contributions
text-0.004
ifying-0.003
 AW-0.003
-0.003
 Sn-0.003
Token Sara
Token ID24799
Feature activation-1.20e-02

Pos logit contributions
 courtyard+0.046
 classroom+0.044
 neighborhood+0.044
castle+0.043
 builder+0.043
Neg logit contributions
text-0.056
-0.048
ifying-0.046
 AW-0.045
 Sn-0.044
Token said
Token ID531
Feature activation-1.25e-02

Pos logit contributions
text+0.021
+0.017
ifying+0.017
 AW+0.016
 Sn+0.016
Neg logit contributions
 courtyard-0.023
 classroom-0.022
 neighborhood-0.022
castle-0.021
 builder-0.021
Token.
Token ID13
Feature activation+2.34e-03

Pos logit contributions
text+0.026
ifying+0.022
+0.022
 AW+0.021
 Sn+0.021
Neg logit contributions
 courtyard-0.019
 classroom-0.019
 neighborhood-0.019
castle-0.018
 builder-0.018
Token.
Token ID13
Feature activation-1.79e-03

Pos logit contributions
 courtyard+0.009
 classroom+0.009
 neighborhood+0.009
castle+0.009
 builder+0.009
Neg logit contributions
text-0.012
-0.010
ifying-0.010
 AW-0.009
 Sn-0.009
Token After
Token ID2293
Feature activation-1.21e-02

Pos logit contributions
text+0.004
+0.003
ifying+0.003
 AW+0.003
 Sn+0.003
Neg logit contributions
 courtyard-0.003
 classroom-0.003
 neighborhood-0.003
castle-0.003
 builder-0.003
Token,
Token ID11
Feature activation+2.45e-02

Pos logit contributions
text+0.023
ifying+0.019
+0.019
 AW+0.019
 Sn+0.017
Neg logit contributions
 courtyard-0.021
 classroom-0.020
 neighborhood-0.020
 builder-0.020
 painter-0.020
Token she
Token ID673
Feature activation+3.72e-04

Pos logit contributions
 courtyard+0.045
castle+0.045
ashing+0.044
pit+0.043
 neighborhood+0.043
Neg logit contributions
text-0.039
-0.039
ifying-0.034
 AW-0.032
 eight-0.032
Token picked
Token ID6497
Feature activation+1.70e-02

Pos logit contributions
 courtyard+0.001
 neighborhood+0.001
 classroom+0.001
castle+0.001
 builder+0.001
Neg logit contributions
text-0.001
ifying-0.001
 AW-0.001
-0.001
 Sn-0.001
Token up
Token ID510
Feature activation+4.39e-02

Pos logit contributions
 courtyard+0.024
 classroom+0.023
 neighborhood+0.023
 builder+0.022
 painter+0.022
Neg logit contributions
text-0.039
ifying-0.033
-0.033
 AW-0.032
 Sn-0.031
Token her
Token ID607
Feature activation+1.87e-04

Pos logit contributions
 courtyard+0.076
 classroom+0.075
 neighborhood+0.074
 painter+0.073
 builder+0.072
Neg logit contributions
text-0.084
ifying-0.069
 AW-0.068
-0.066
 Sn-0.062
Token bag
Token ID6131
Feature activation-2.34e-02

Pos logit contributions
 classroom+0.000
 courtyard+0.000
 neighborhood+0.000
castle+0.000
 painter+0.000
Neg logit contributions
text-0.000
-0.000
ifying-0.000
 AW-0.000
 Sn-0.000
Token and
Token ID290
Feature activation-1.41e-02

Pos logit contributions
text+0.046
+0.040
ifying+0.038
 AW+0.036
 Sn+0.036
Neg logit contributions
 courtyard-0.038
 classroom-0.038
 neighborhood-0.037
 painter-0.037
 builder-0.036
Token continued
Token ID3767
Feature activation+1.96e-02

Pos logit contributions
text+0.028
ifying+0.023
+0.022
 AW+0.022
 Sn+0.021
Neg logit contributions
 courtyard-0.024
 classroom-0.024
 neighborhood-0.023
castle-0.023
 builder-0.023
Token on
Token ID319
Feature activation-1.71e-02

Pos logit contributions
 courtyard+0.033
 classroom+0.032
 neighborhood+0.032
castle+0.032
 painter+0.031
Neg logit contributions
text-0.038
-0.032
ifying-0.031
 AW-0.030
 Sn-0.030
Token bad
Token ID2089
Feature activation+1.63e-02

Pos logit contributions
 courtyard+0.030
 studio+0.029
 classroom+0.028
 builder+0.028
 grounds+0.028
Neg logit contributions
text-0.032
-0.029
 AW-0.027
 rib-0.027
 tal-0.026
Token wolf
Token ID17481
Feature activation-7.71e-03

Pos logit contributions
 studio+0.029
 courtyard+0.029
icker+0.027
castle+0.027
 classroom+0.027
Neg logit contributions
text-0.027
-0.024
 ho-0.023
 Sn-0.023
 lun-0.023
Token!
Token ID0
Feature activation+9.23e-04

Pos logit contributions
text+0.016
ifying+0.013
 AW+0.013
+0.012
 Sn+0.012
Neg logit contributions
 courtyard-0.012
 classroom-0.012
 neighborhood-0.012
 painter-0.012
castle-0.012
Token "
Token ID366
Feature activation-4.55e-03

Pos logit contributions
 courtyard+0.001
 classroom+0.001
 neighborhood+0.001
 builder+0.001
 painter+0.001
Neg logit contributions
text-0.002
ifying-0.002
-0.002
 AW-0.002
 rib-0.002
TokenOh
Token ID5812
Feature activation-1.63e-02

Pos logit contributions
text+0.009
 tal+0.007
 AW+0.007
ifying+0.007
 eight+0.007
Neg logit contributions
 painter-0.007
 neighborhood-0.007
 artist-0.007
 courtyard-0.007
 classroom-0.007
Token no
Token ID645
Feature activation-1.90e-04

Pos logit contributions
text+0.030
ifying+0.026
+0.025
raw+0.024
 reins+0.023
Neg logit contributions
 courtyard-0.026
 builder-0.026
castle-0.026
 structure-0.025
 painter-0.025
Token!"
Token ID2474
Feature activation+2.28e-02

Pos logit contributions
text+0.000
ifying+0.000
+0.000
 Sn+0.000
 AW+0.000
Neg logit contributions
 courtyard-0.000
 classroom-0.000
 painter-0.000
castle-0.000
 builder-0.000
Token Lily
Token ID20037
Feature activation-8.93e-03

Pos logit contributions
 courtyard+0.034
 classroom+0.033
 neighborhood+0.031
 painter+0.031
 builder+0.031
Neg logit contributions
text-0.050
ifying-0.043
-0.041
 AW-0.040
 Sn-0.038
Token cried
Token ID16896
Feature activation-3.19e-02

Pos logit contributions
text+0.018
ifying+0.016
 AW+0.015
 rib+0.014
 Sn+0.014
Neg logit contributions
 courtyard-0.015
 neighborhood-0.014
 classroom-0.014
 painter-0.014
 builder-0.014
Token.
Token ID13
Feature activation+9.25e-04

Pos logit contributions
text+0.068
ifying+0.057
+0.057
 AW+0.055
 Sn+0.053
Neg logit contributions
 courtyard-0.048
 classroom-0.047
 neighborhood-0.046
 builder-0.046
 painter-0.045
Token But
Token ID887
Feature activation-7.35e-03

Pos logit contributions
 courtyard+0.001
 classroom+0.001
castle+0.001
 neighborhood+0.001
 builder+0.001
Neg logit contributions
text-0.002
-0.002
ifying-0.002
 Sn-0.002
 AW-0.002
Token and
Token ID290
Feature activation+1.78e-03

Pos logit contributions
text+0.015
ifying+0.012
+0.012
 AW+0.012
 Sn+0.011
Neg logit contributions
 courtyard-0.010
 classroom-0.010
 neighborhood-0.010
castle-0.010
 painter-0.010
Token it
Token ID340
Feature activation-1.81e-04

Pos logit contributions
 courtyard+0.003
 neighborhood+0.003
 classroom+0.003
castle+0.003
 builder+0.003
Neg logit contributions
text-0.003
-0.003
ifying-0.003
 AW-0.003
 rib-0.003
Token felt
Token ID2936
Feature activation-3.14e-03

Pos logit contributions
text+0.000
+0.000
ifying+0.000
 AW+0.000
 Sn+0.000
Neg logit contributions
 courtyard-0.000
 classroom-0.000
 neighborhood-0.000
 painter-0.000
castle-0.000
Token perfect
Token ID2818
Feature activation-6.41e-04

Pos logit contributions
text+0.007
ifying+0.006
+0.006
 AW+0.005
 Sn+0.005
Neg logit contributions
 courtyard-0.005
 classroom-0.005
 neighborhood-0.005
 builder-0.004
castle-0.004
Token.
Token ID13
Feature activation-2.32e-03

Pos logit contributions
text+0.001
+0.001
ifying+0.001
 AW+0.001
 Sn+0.001
Neg logit contributions
 courtyard-0.001
 classroom-0.001
 neighborhood-0.001
 painter-0.001
 builder-0.001
Token
Token ID198
Feature activation-7.87e-03

Pos logit contributions
text+0.005
ifying+0.004
 AW+0.004
 rib+0.004
 ho+0.004
Neg logit contributions
 neighborhood-0.004
 classroom-0.003
 courtyard-0.003
 neighbourhood-0.003
 studio-0.003
Token
Token ID198
Feature activation-1.23e-03

Pos logit contributions
text+0.017
ifying+0.017
 AW+0.015
 rib+0.015
 ho+0.014
Neg logit contributions
 neighborhood-0.011
 courtyard-0.010
 classroom-0.010
 neighbourhood-0.009
 grounds-0.009
TokenBut
Token ID1537
Feature activation-1.94e-03

Pos logit contributions
text+0.002
ifying+0.002
 tal+0.002
 jaws+0.002
 AW+0.002
Neg logit contributions
 neighborhood-0.002
 courtyard-0.002
 grounds-0.002
 classroom-0.002
 neighbourhood-0.002
Token then
Token ID788
Feature activation-6.92e-03

Pos logit contributions
text+0.004
ifying+0.004
 AW+0.004
+0.003
 eight+0.003
Neg logit contributions
 courtyard-0.003
 neighborhood-0.003
 classroom-0.003
 painter-0.003
 artist-0.003
Token a
Token ID257
Feature activation-2.68e-02

Pos logit contributions
text+0.016
ifying+0.015
 AW+0.014
 tal+0.013
 eight+0.013
Neg logit contributions
 courtyard-0.008
 neighborhood-0.008
 classroom-0.008
 painter-0.008
 artist-0.008
Token strange
Token ID6283
Feature activation+1.02e-02

Pos logit contributions
text+0.057
+0.048
ifying+0.048
 AW+0.046
 Sn+0.045
Neg logit contributions
 courtyard-0.041
 classroom-0.039
 neighborhood-0.039
castle-0.038
 builder-0.038
Token saw
Token ID2497
Feature activation+2.52e-02

Pos logit contributions
text+0.024
+0.022
 Sn+0.021
 Fl+0.021
ifying+0.020
Neg logit contributions
castle-0.018
 builder-0.017
 painter-0.017
 artist-0.017
 courtyard-0.016
Token a
Token ID257
Feature activation-1.67e-02

Pos logit contributions
 courtyard+0.057
 classroom+0.056
 neighborhood+0.055
castle+0.055
 builder+0.055
Neg logit contributions
text-0.035
-0.027
ifying-0.026
 Sn-0.024
 AW-0.024
Token big
Token ID1263
Feature activation+1.74e-02

Pos logit contributions
text+0.028
+0.025
ifying+0.025
 Sn+0.023
 AW+0.023
Neg logit contributions
 courtyard-0.030
castle-0.028
 neighborhood-0.028
 classroom-0.028
 grounds-0.028
Token white
Token ID2330
Feature activation+6.22e-03

Pos logit contributions
 courtyard+0.028
 grounds+0.027
 neighborhood+0.027
 builder+0.026
castle+0.026
Neg logit contributions
text-0.032
-0.029
ifying-0.028
 Sn-0.027
 AW-0.027
Token factory
Token ID8860
Feature activation-6.47e-03

Pos logit contributions
 courtyard+0.010
 classroom+0.009
 neighborhood+0.009
castle+0.009
 builder+0.009
Neg logit contributions
text-0.012
ifying-0.011
-0.011
 AW-0.010
 Sn-0.010
Token.
Token ID13
Feature activation+4.21e-04

Pos logit contributions
text+0.011
+0.010
ifying+0.010
 Sn+0.009
 AW+0.009
Neg logit contributions
 courtyard-0.011
 neighborhood-0.011
 classroom-0.011
 artist-0.011
castle-0.011
Token She
Token ID1375
Feature activation+8.28e-03

Pos logit contributions
 courtyard+0.001
 classroom+0.001
 neighborhood+0.001
castle+0.001
 builder+0.001
Neg logit contributions
text-0.001
-0.001
ifying-0.001
 AW-0.001
 Sn-0.001
Token was
Token ID373
Feature activation+2.25e-02

Pos logit contributions
 courtyard+0.012
 classroom+0.012
 neighborhood+0.012
castle+0.012
 builder+0.012
Neg logit contributions
text-0.018
-0.015
ifying-0.015
 AW-0.014
 Sn-0.014
Token curious
Token ID11040
Feature activation+1.09e-02

Pos logit contributions
castle+0.042
 courtyard+0.041
 studio+0.039
 writer+0.039
pit+0.038
Neg logit contributions
-0.037
text-0.034
 eight-0.034
 Sn-0.032
 Fl-0.030
Token and
Token ID290
Feature activation+1.40e-02

Pos logit contributions
 courtyard+0.019
castle+0.019
 builder+0.018
 studio+0.018
 classroom+0.018
Neg logit contributions
text-0.020
-0.019
ifying-0.016
 rib-0.015
 Sn-0.015
Token went
Token ID1816
Feature activation+1.20e-02

Pos logit contributions
castle+0.025
 courtyard+0.024
 classroom+0.023
 neighborhood+0.023
pit+0.023
Neg logit contributions
text-0.023
-0.023
 eight-0.021
 Sn-0.020
 rib-0.020
Token stuck
Token ID7819
Feature activation-1.53e-02

Pos logit contributions
text+0.008
ifying+0.006
 AW+0.006
+0.006
 Sn+0.006
Neg logit contributions
 courtyard-0.005
castle-0.005
 builder-0.005
 classroom-0.005
 neighborhood-0.005
Token in
Token ID287
Feature activation-1.62e-02

Pos logit contributions
text+0.036
 AW+0.029
ifying+0.028
+0.028
 Sn+0.027
Neg logit contributions
 courtyard-0.021
castle-0.020
 builder-0.020
 neighborhood-0.020
 classroom-0.019
Token the
Token ID262
Feature activation+1.33e-02

Pos logit contributions
text+0.035
 AW+0.028
ifying+0.027
+0.025
haw+0.025
Neg logit contributions
 classroom-0.025
 courtyard-0.025
 neighborhood-0.025
 building-0.024
 builder-0.024
Token branches
Token ID13737
Feature activation-3.31e-03

Pos logit contributions
 courtyard+0.020
 neighborhood+0.019
 classroom+0.019
 painter+0.018
 builder+0.018
Neg logit contributions
text-0.030
ifying-0.025
 AW-0.025
-0.024
 Sn-0.024
Token!
Token ID0
Feature activation-3.64e-03

Pos logit contributions
text+0.007
ifying+0.006
+0.006
 AW+0.006
 Sn+0.005
Neg logit contributions
 courtyard-0.005
 classroom-0.005
 neighborhood-0.005
castle-0.005
 builder-0.005
Token
Token ID198
Feature activation-4.57e-03

Pos logit contributions
text+0.008
 AW+0.007
 rib+0.007
ifying+0.007
 ho+0.006
Neg logit contributions
 neighborhood-0.005
 classroom-0.005
 courtyard-0.005
 builder-0.004
 neighbourhood-0.004
Token
Token ID198
Feature activation+7.70e-05

Pos logit contributions
text+0.010
ifying+0.009
 AW+0.009
 rib+0.009
 ho+0.008
Neg logit contributions
 neighborhood-0.006
 courtyard-0.006
 classroom-0.006
 neighbourhood-0.006
 builder-0.006
TokenThe
Token ID464
Feature activation-5.37e-03

Pos logit contributions
 neighborhood+0.000
 courtyard+0.000
 builder+0.000
 classroom+0.000
 painter+0.000
Neg logit contributions
text-0.000
ifying-0.000
 AW-0.000
 rib-0.000
 tal-0.000
Token brave
Token ID14802
Feature activation-1.51e-02

Pos logit contributions
text+0.012
ifying+0.010
 AW+0.010
+0.010
 Sn+0.009
Neg logit contributions
 courtyard-0.008
 neighborhood-0.008
 painter-0.008
 builder-0.008
 classroom-0.008
Token lion
Token ID18744
Feature activation+1.15e-03

Pos logit contributions
text+0.032
ifying+0.027
+0.026
 AW+0.026
 Sn+0.025
Neg logit contributions
 courtyard-0.023
 builder-0.023
 painter-0.023
 neighborhood-0.023
 classroom-0.023
Token was
Token ID373
Feature activation+1.63e-02

Pos logit contributions
 courtyard+0.002
 neighborhood+0.002
castle+0.002
 classroom+0.002
 builder+0.002
Neg logit contributions
text-0.002
ifying-0.002
 AW-0.002
-0.002
 Sn-0.002
Token so
Token ID523
Feature activation+7.07e-03

Pos logit contributions
text+0.034
+0.031
ifying+0.029
 AW+0.028
 Sn+0.027
Neg logit contributions
 courtyard-0.025
 neighborhood-0.024
 classroom-0.024
castle-0.024
 builder-0.023
Token tiny
Token ID7009
Feature activation+7.11e-03

Pos logit contributions
 courtyard+0.011
 classroom+0.011
 neighborhood+0.011
castle+0.010
 builder+0.010
Neg logit contributions
text-0.015
-0.013
ifying-0.013
 AW-0.012
 eight-0.011
Token before
Token ID878
Feature activation-3.53e-02

Pos logit contributions
 courtyard+0.011
 classroom+0.010
 painter+0.010
 artist+0.010
 neighborhood+0.010
Neg logit contributions
text-0.015
 AW-0.012
ifying-0.012
-0.012
 Sn-0.012
Token,
Token ID11
Feature activation+1.91e-02

Pos logit contributions
text+0.077
ifying+0.065
 AW+0.063
+0.063
 Sn+0.060
Neg logit contributions
 courtyard-0.052
 classroom-0.051
 neighborhood-0.051
castle-0.048
 painter-0.048
Token so
Token ID523
Feature activation+1.88e-03

Pos logit contributions
 courtyard+0.024
 classroom+0.023
 neighborhood+0.023
 painter+0.022
 grounds+0.022
Neg logit contributions
text-0.047
ifying-0.039
 AW-0.038
 Sn-0.036
-0.036
Token she
Token ID673
Feature activation-2.14e-03

Pos logit contributions
 courtyard+0.002
 classroom+0.002
 neighborhood+0.002
 builder+0.002
 painter+0.002
Neg logit contributions
text-0.005
ifying-0.004
-0.004
 AW-0.004
 Sn-0.004
Token decided
Token ID3066
Feature activation-4.88e-03

Pos logit contributions
text+0.005
ifying+0.004
 AW+0.004
 Sn+0.004
+0.004
Neg logit contributions
 courtyard-0.003
 neighborhood-0.003
 painter-0.003
 classroom-0.003
 grounds-0.003
Token to
Token ID284
Feature activation+6.71e-03

Pos logit contributions
text+0.010
+0.008
ifying+0.008
 AW+0.008
 Sn+0.007
Neg logit contributions
 courtyard-0.008
 classroom-0.008
 neighborhood-0.008
castle-0.008
 builder-0.008
Token remove
Token ID4781
Feature activation+1.30e-02

Pos logit contributions
 courtyard+0.011
 classroom+0.011
 neighborhood+0.011
castle+0.011
 builder+0.011
Neg logit contributions
text-0.013
-0.011
ifying-0.011
 AW-0.010
 Sn-0.010
Token it
Token ID340
Feature activation+3.68e-03

Pos logit contributions
 classroom+0.020
 courtyard+0.020
 painter+0.019
 artist+0.018
 studio+0.018
Neg logit contributions
text-0.028
ifying-0.023
 AW-0.022
-0.020
 Sn-0.020
Token.
Token ID13
Feature activation-1.40e-04

Pos logit contributions
 courtyard+0.006
 classroom+0.006
 neighborhood+0.006
castle+0.006
 builder+0.006
Neg logit contributions
text-0.007
ifying-0.006
-0.006
 AW-0.006
 Sn-0.005
Token with
Token ID351
Feature activation+2.75e-03

Pos logit contributions
 courtyard+0.033
castle+0.031
 painter+0.031
 classroom+0.031
 builder+0.031
Neg logit contributions
text-0.037
-0.031
 AW-0.029
 rib-0.028
 Sn-0.028
Token the
Token ID262
Feature activation+6.52e-03

Pos logit contributions
 courtyard+0.006
 classroom+0.005
 neighborhood+0.005
castle+0.005
 builder+0.005
Neg logit contributions
text-0.005
-0.004
ifying-0.004
 AW-0.003
 Sn-0.003
Token children
Token ID1751
Feature activation+4.85e-03

Pos logit contributions
 courtyard+0.011
 classroom+0.010
 neighborhood+0.010
castle+0.010
 builder+0.010
Neg logit contributions
text-0.013
-0.011
ifying-0.011
 AW-0.011
 Sn-0.010
Token and
Token ID290
Feature activation-2.02e-02

Pos logit contributions
 courtyard+0.008
 classroom+0.008
castle+0.008
 neighborhood+0.008
 painter+0.008
Neg logit contributions
text-0.009
-0.008
ifying-0.007
 Sn-0.007
 eight-0.007
Token the
Token ID262
Feature activation-6.05e-03

Pos logit contributions
text+0.033
+0.027
ifying+0.026
 AW+0.025
 Sn+0.024
Neg logit contributions
 courtyard-0.041
 neighborhood-0.039
 classroom-0.039
castle-0.039
 builder-0.038
Token toys
Token ID14958
Feature activation-3.00e-02

Pos logit contributions
text+0.013
+0.011
ifying+0.011
 AW+0.010
 Sn+0.010
Neg logit contributions
 courtyard-0.009
 classroom-0.009
 neighborhood-0.009
 painter-0.009
castle-0.009
Token.
Token ID13
Feature activation+1.31e-03

Pos logit contributions
text+0.044
+0.039
ifying+0.034
 rib+0.034
 Sn+0.034
Neg logit contributions
 courtyard-0.060
 classroom-0.059
 neighborhood-0.058
 painter-0.057
 artist-0.057
Token They
Token ID1119
Feature activation-4.82e-03

Pos logit contributions
 courtyard+0.002
 classroom+0.002
 neighborhood+0.002
 painter+0.002
 builder+0.002
Neg logit contributions
text-0.003
ifying-0.002
-0.002
 AW-0.002
 Sn-0.002
Token have
Token ID423
Feature activation-2.83e-02

Pos logit contributions
text+0.008
+0.007
ifying+0.006
 Sn+0.006
 AW+0.006
Neg logit contributions
 courtyard-0.009
 neighborhood-0.009
 classroom-0.009
castle-0.009
 builder-0.009
Token fun
Token ID1257
Feature activation+3.59e-02

Pos logit contributions
text+0.067
ifying+0.062
 AW+0.059
+0.058
 Sn+0.055
Neg logit contributions
 courtyard-0.036
 neighborhood-0.036
 classroom-0.035
castle-0.032
 painter-0.032
Token.
Token ID13
Feature activation+4.55e-03

Pos logit contributions
 courtyard+0.065
 classroom+0.062
castle+0.061
 neighborhood+0.060
 grounds+0.060
Neg logit contributions
text-0.065
-0.056
ifying-0.054
 Sn-0.051
 AW-0.050
Token started
Token ID2067
Feature activation+2.16e-02

Pos logit contributions
text+0.002
ifying+0.002
 AW+0.002
+0.002
 Sn+0.002
Neg logit contributions
 courtyard-0.002
 classroom-0.002
 neighborhood-0.002
castle-0.002
 painter-0.002
Token eating
Token ID6600
Feature activation+1.65e-02

Pos logit contributions
 courtyard+0.034
 classroom+0.033
 neighborhood+0.033
castle+0.032
 painter+0.032
Neg logit contributions
text-0.045
-0.038
ifying-0.038
 AW-0.036
 Sn-0.035
Token her
Token ID607
Feature activation+1.44e-02

Pos logit contributions
 courtyard+0.031
 classroom+0.030
 neighborhood+0.030
 builder+0.030
castle+0.030
Neg logit contributions
text-0.029
-0.023
ifying-0.023
 AW-0.022
 Sn-0.021
Token ice
Token ID4771
Feature activation+3.23e-02

Pos logit contributions
 courtyard+0.021
 classroom+0.021
 neighborhood+0.020
castle+0.020
 painter+0.020
Neg logit contributions
text-0.033
ifying-0.027
 AW-0.027
-0.026
 Sn-0.025
Token-
Token ID12
Feature activation-1.13e-02

Pos logit contributions
 courtyard+0.047
 classroom+0.045
 neighborhood+0.045
castle+0.044
 builder+0.044
Neg logit contributions
text-0.071
-0.060
ifying-0.060
 AW-0.058
 Sn-0.056
Tokencream
Token ID36277
Feature activation-5.02e-02

Pos logit contributions
text+0.020
ifying+0.015
 AW+0.015
 eight+0.014
+0.014
Neg logit contributions
 courtyard-0.021
castle-0.021
 artist-0.021
 painter-0.020
 builder-0.020
Token.
Token ID13
Feature activation-5.98e-03

Pos logit contributions
text+0.099
+0.085
ifying+0.084
 AW+0.079
 Sn+0.078
Neg logit contributions
 courtyard-0.082
 classroom-0.080
 neighborhood-0.079
 painter-0.077
 builder-0.077
Token
Token ID220
Feature activation+3.89e-03

Pos logit contributions
text+0.011
+0.009
ifying+0.009
 AW+0.008
 Sn+0.008
Neg logit contributions
 courtyard-0.011
 classroom-0.011
 neighborhood-0.011
castle-0.011
 builder-0.010
Token
Token ID198
Feature activation-9.21e-03

Pos logit contributions
 neighborhood+0.007
 courtyard+0.007
 classroom+0.006
 builder+0.006
 painter+0.006
Neg logit contributions
text-0.007
ifying-0.006
 AW-0.006
-0.006
 rib-0.006
Token
Token ID198
Feature activation-2.68e-03

Pos logit contributions
text+0.017
ifying+0.014
 AW+0.013
+0.013
 Sn+0.013
Neg logit contributions
 courtyard-0.016
 classroom-0.016
 neighborhood-0.016
 painter-0.016
 builder-0.016
TokenAfter
Token ID3260
Feature activation-1.58e-02

Pos logit contributions
text+0.006
ifying+0.005
+0.005
 AW+0.005
 Sn+0.005
Neg logit contributions
 courtyard-0.004
 classroom-0.004
 neighborhood-0.004
 painter-0.004
 builder-0.003
Token have
Token ID423
Feature activation-7.22e-03

Pos logit contributions
text+0.027
ifying+0.023
+0.022
 AW+0.022
 Sn+0.021
Neg logit contributions
 courtyard-0.018
 classroom-0.017
 neighborhood-0.017
castle-0.017
 builder-0.017
Token some
Token ID617
Feature activation+7.46e-04

Pos logit contributions
text+0.016
ifying+0.014
 AW+0.014
se+0.013
ified+0.013
Neg logit contributions
 courtyard-0.009
 neighborhood-0.009
 classroom-0.009
 building-0.008
 meeting-0.008
Token crack
Token ID8469
Feature activation+2.08e-04

Pos logit contributions
 courtyard+0.001
 neighborhood+0.001
 classroom+0.001
 studio+0.001
 artist+0.001
Neg logit contributions
text-0.002
ifying-0.001
 AW-0.001
-0.001
 Sn-0.001
Tokeners
Token ID364
Feature activation-1.37e-02

Pos logit contributions
castle+0.000
 courtyard+0.000
 builder+0.000
 classroom+0.000
 grounds+0.000
Neg logit contributions
text-0.000
ifying-0.000
 AW-0.000
-0.000
 eight-0.000
Token and
Token ID290
Feature activation-1.97e-02

Pos logit contributions
text+0.028
ifying+0.023
 AW+0.023
+0.023
 eight+0.021
Neg logit contributions
 courtyard-0.022
 classroom-0.021
castle-0.021
 neighborhood-0.021
 builder-0.021
Token cheese
Token ID9891
Feature activation-3.19e-02

Pos logit contributions
text+0.042
 AW+0.034
ifying+0.033
+0.030
 Sn+0.030
Neg logit contributions
 classroom-0.031
 courtyard-0.031
 builder-0.030
 painter-0.030
castle-0.030
Token while
Token ID981
Feature activation-1.77e-02

Pos logit contributions
text+0.064
ifying+0.055
+0.053
 AW+0.053
 Sn+0.050
Neg logit contributions
 courtyard-0.051
castle-0.050
 classroom-0.049
 neighborhood-0.049
 builder-0.049
Token we
Token ID356
Feature activation-1.06e-02

Pos logit contributions
text+0.033
 AW+0.027
ifying+0.026
+0.025
 tal+0.024
Neg logit contributions
 classroom-0.033
castle-0.031
 courtyard-0.031
 painter-0.031
 studio-0.031
Token wait
Token ID4043
Feature activation-3.06e-03

Pos logit contributions
text+0.021
 AW+0.017
ifying+0.016
+0.016
 Sn+0.015
Neg logit contributions
 courtyard-0.018
 builder-0.017
 classroom-0.017
 neighborhood-0.017
castle-0.017
Token for
Token ID329
Feature activation-1.62e-02

Pos logit contributions
text+0.007
ifying+0.006
 AW+0.006
+0.006
 tal+0.005
Neg logit contributions
 classroom-0.004
 courtyard-0.004
castle-0.004
 studio-0.004
 builder-0.004
Token the
Token ID262
Feature activation+1.41e-04

Pos logit contributions
text+0.036
ifying+0.031
 AW+0.031
+0.029
se+0.028
Neg logit contributions
 classroom-0.023
 building-0.022
 meeting-0.021
 courtyard-0.021
 studio-0.021
Token It
Token ID632
Feature activation+6.06e-03

Pos logit contributions
castle+0.010
 courtyard+0.009
 builder+0.009
 neighborhood+0.009
pit+0.008
Neg logit contributions
text-0.011
-0.011
ifying-0.009
 eight-0.009
 AW-0.009
Token roared
Token ID45615
Feature activation-2.21e-03

Pos logit contributions
 classroom+0.011
 courtyard+0.011
 neighborhood+0.011
castle+0.011
 painter+0.011
Neg logit contributions
text-0.011
-0.010
ifying-0.009
 AW-0.008
 Sn-0.008
Token and
Token ID290
Feature activation-2.32e-02

Pos logit contributions
text+0.004
+0.004
ifying+0.004
 Sn+0.003
 eight+0.003
Neg logit contributions
 courtyard-0.004
 classroom-0.004
 neighborhood-0.004
castle-0.004
 painter-0.004
Token f
Token ID277
Feature activation-1.87e-02

Pos logit contributions
text+0.042
+0.035
ifying+0.034
 AW+0.033
 Sn+0.033
Neg logit contributions
 courtyard-0.042
 neighborhood-0.041
 classroom-0.041
castle-0.040
 painter-0.040
Tokenlic
Token ID677
Feature activation-4.06e-02

Pos logit contributions
text+0.036
ifying+0.030
+0.030
 AW+0.028
 Sn+0.027
Neg logit contributions
 courtyard-0.033
 classroom-0.032
 neighborhood-0.032
 painter-0.031
castle-0.031
Tokenkered
Token ID28970
Feature activation-3.23e-02

Pos logit contributions
text+0.065
+0.053
ifying+0.053
 AW+0.050
 Sn+0.049
Neg logit contributions
 courtyard-0.081
 classroom-0.078
 neighborhood-0.077
 builder-0.077
castle-0.077
Token and
Token ID290
Feature activation-2.36e-02

Pos logit contributions
text+0.065
+0.055
ifying+0.055
 AW+0.052
 Sn+0.051
Neg logit contributions
 courtyard-0.052
 classroom-0.051
 neighborhood-0.051
 painter-0.049
 builder-0.049
Token reached
Token ID4251
Feature activation-4.47e-03

Pos logit contributions
text+0.042
+0.036
ifying+0.035
 AW+0.033
 Sn+0.033
Neg logit contributions
 courtyard-0.042
 neighborhood-0.042
 classroom-0.041
castle-0.041
 painter-0.040
Token far
Token ID1290
Feature activation-1.32e-03

Pos logit contributions
text+0.008
+0.007
ifying+0.007
 AW+0.007
 Sn+0.006
Neg logit contributions
 courtyard-0.008
 classroom-0.008
 neighborhood-0.008
castle-0.007
 painter-0.007
Token up
Token ID510
Feature activation+2.18e-02

Pos logit contributions
text+0.003
+0.002
ifying+0.002
 AW+0.002
 Sn+0.002
Neg logit contributions
 courtyard-0.002
 classroom-0.002
 neighborhood-0.002
 builder-0.002
castle-0.002
Token into
Token ID656
Feature activation+3.19e-03

Pos logit contributions
 courtyard+0.035
 classroom+0.034
 neighborhood+0.034
castle+0.033
 builder+0.033
Neg logit contributions
text-0.045
ifying-0.037
-0.037
 AW-0.036
 Sn-0.034
Token They
Token ID1119
Feature activation+6.39e-03

Pos logit contributions
text+0.006
ifying+0.005
+0.005
 AW+0.005
 Sn+0.005
Neg logit contributions
 courtyard-0.005
 classroom-0.005
 neighborhood-0.005
 builder-0.005
 painter-0.005
Token lost
Token ID2626
Feature activation-4.02e-03

Pos logit contributions
castle+0.012
 classroom+0.012
 courtyard+0.012
 studio+0.011
 neighborhood+0.011
Neg logit contributions
text-0.011
-0.010
ifying-0.009
 eight-0.009
 Sn-0.009
Token their
Token ID511
Feature activation+2.87e-02

Pos logit contributions
text+0.008
+0.007
ifying+0.007
 AW+0.007
 Sn+0.006
Neg logit contributions
 courtyard-0.006
 classroom-0.006
 neighborhood-0.006
castle-0.006
 builder-0.006
Token homes
Token ID5682
Feature activation+6.73e-03

Pos logit contributions
 courtyard+0.037
 builder+0.035
castle+0.035
 classroom+0.035
 artist+0.035
Neg logit contributions
text-0.064
-0.055
ifying-0.054
 rib-0.052
 AW-0.052
Token and
Token ID290
Feature activation-2.16e-02

Pos logit contributions
 courtyard+0.012
 classroom+0.012
 neighborhood+0.011
 painter+0.011
 artist+0.011
Neg logit contributions
text-0.012
-0.011
ifying-0.010
 Sn-0.009
 eight-0.009
Token belongings
Token ID30985
Feature activation-3.74e-02

Pos logit contributions
text+0.042
+0.035
ifying+0.035
 AW+0.033
 Sn+0.033
Neg logit contributions
 courtyard-0.036
 classroom-0.035
 neighborhood-0.035
castle-0.035
 builder-0.034
Token.
Token ID13
Feature activation-4.58e-03

Pos logit contributions
text+0.075
+0.063
ifying+0.062
 AW+0.060
 Sn+0.058
Neg logit contributions
 courtyard-0.061
 classroom-0.059
 neighborhood-0.059
 painter-0.058
castle-0.057
Token They
Token ID1119
Feature activation+4.26e-03

Pos logit contributions
text+0.009
ifying+0.007
+0.007
 AW+0.007
 rib+0.006
Neg logit contributions
 courtyard-0.008
 neighborhood-0.008
 classroom-0.008
 builder-0.008
 painter-0.008
Token hoped
Token ID10719
Feature activation-1.75e-02

Pos logit contributions
castle+0.008
 classroom+0.008
 courtyard+0.008
 neighborhood+0.008
 painter+0.008
Neg logit contributions
text-0.008
-0.007
ifying-0.006
 Sn-0.006
 eight-0.006
Token the
Token ID262
Feature activation+1.75e-03

Pos logit contributions
text+0.032
+0.026
ifying+0.025
 AW+0.024
 Sn+0.024
Neg logit contributions
 courtyard-0.032
 classroom-0.031
 neighborhood-0.031
castle-0.031
 painter-0.030
Token rain
Token ID6290
Feature activation-1.09e-02

Pos logit contributions
 courtyard+0.002
 studio+0.002
 classroom+0.002
castle+0.002
pit+0.002
Neg logit contributions
text-0.004
-0.003
ifying-0.003
 eight-0.003
 Sn-0.003
Token pulled
Token ID5954
Feature activation+6.64e-03

Pos logit contributions
 courtyard+0.004
 neighborhood+0.003
 classroom+0.003
 builder+0.003
 painter+0.003
Neg logit contributions
text-0.005
ifying-0.004
 AW-0.004
-0.004
 Sn-0.004
Token and
Token ID290
Feature activation-1.76e-02

Pos logit contributions
 classroom+0.009
 builder+0.009
 painter+0.009
 courtyard+0.009
 neighborhood+0.009
Neg logit contributions
text-0.015
ifying-0.013
 AW-0.012
haw-0.011
 Sn-0.011
Token pulled
Token ID5954
Feature activation+9.72e-04

Pos logit contributions
text+0.038
ifying+0.035
raw+0.029
 AW+0.029
+0.029
Neg logit contributions
 courtyard-0.024
 builder-0.024
 classroom-0.023
 building-0.022
 painter-0.022
Token but
Token ID475
Feature activation-1.03e-02

Pos logit contributions
 courtyard+0.001
 classroom+0.001
 builder+0.001
 neighborhood+0.001
 painter+0.001
Neg logit contributions
text-0.002
ifying-0.002
 AW-0.002
-0.002
 Sn-0.002
Token it
Token ID340
Feature activation-1.04e-02

Pos logit contributions
text+0.017
+0.014
ifying+0.014
 AW+0.013
 Sn+0.013
Neg logit contributions
 courtyard-0.021
 classroom-0.020
 neighborhood-0.020
castle-0.020
 painter-0.020
Token wouldn
Token ID3636
Feature activation-8.89e-02

Pos logit contributions
text+0.021
ifying+0.018
+0.017
 AW+0.017
 Sn+0.016
Neg logit contributions
 courtyard-0.016
 builder-0.016
castle-0.015
 neighborhood-0.015
 classroom-0.015
Token't
Token ID470
Feature activation-1.38e-02

Pos logit contributions
text+0.194
haw+0.189
ifying+0.187
 tal+0.183
 rib+0.179
Neg logit contributions
 builder-0.080
utter-0.076
 painter-0.075
 outside-0.073
line-0.073
Token open
Token ID1280
Feature activation-5.86e-03

Pos logit contributions
text+0.028
ifying+0.023
+0.022
 AW+0.022
 Sn+0.021
Neg logit contributions
 courtyard-0.023
 classroom-0.022
 builder-0.022
 neighborhood-0.021
castle-0.021
Token.
Token ID13
Feature activation-5.12e-03

Pos logit contributions
text+0.014
 AW+0.012
ifying+0.012
+0.012
 eight+0.011
Neg logit contributions
 courtyard-0.007
 classroom-0.007
 neighborhood-0.007
castle-0.007
 grounds-0.007
Token Suddenly
Token ID24975
Feature activation-3.20e-03

Pos logit contributions
text+0.009
+0.008
ifying+0.008
 AW+0.007
 Sn+0.007
Neg logit contributions
 courtyard-0.009
 classroom-0.009
 neighborhood-0.009
castle-0.009
 builder-0.009
Token,
Token ID11
Feature activation+2.21e-02

Pos logit contributions
text+0.008
ifying+0.007
+0.006
 AW+0.006
 Sn+0.006
Neg logit contributions
 courtyard-0.004
 classroom-0.004
 builder-0.004
 neighborhood-0.004
 painter-0.004
Token YOU
Token ID7013
Feature activation-5.72e-02

Pos logit contributions
text+0.081
ifying+0.067
 tal+0.066
 reins+0.058
 jaws+0.058
Neg logit contributions
 classroom-0.069
 studio-0.066
 meeting-0.065
 courtyard-0.063
 gym-0.063
Token WILL
Token ID17682
Feature activation-4.62e-02

Pos logit contributions
text+0.095
ifying+0.081
 Sn+0.076
 tal+0.074
 rib+0.072
Neg logit contributions
 courtyard-0.094
 studio-0.091
 grounds-0.090
 classroom-0.089
 meeting-0.088
Token GET
Token ID17151
Feature activation-8.58e-02

Pos logit contributions
text+0.070
 pounds+0.067
 degrees+0.059
 rib+0.057
 spe+0.057
Neg logit contributions
 classroom-0.076
 meeting-0.067
 building-0.066
 artist-0.066
 painter-0.064
Token IN
Token ID3268
Feature activation-5.64e-02

Pos logit contributions
text+0.148
ifying+0.123
+0.119
 Deal+0.096
Fl+0.093
Neg logit contributions
ashing-0.134
 artist-0.132
 classroom-0.127
 studio-0.125
ook-0.125
Token TR
Token ID7579
Feature activation-4.14e-02

Pos logit contributions
text+0.090
ifying+0.077
 rib+0.071
Fl+0.066
 spe+0.064
Neg logit contributions
 classroom-0.099
 studio-0.096
 building-0.093
 meeting-0.093
 neighborhood-0.091
TokenOU
Token ID2606
Feature activation-9.13e-02

Pos logit contributions
text+0.072
ifying+0.061
+0.059
 tal+0.059
 rib+0.058
Neg logit contributions
 classroom-0.072
 neighborhood-0.069
 studio-0.068
 courtyard-0.067
 painter-0.067
TokenBLE
Token ID19146
Feature activation-4.97e-02

Pos logit contributions
text+0.142
 rib+0.108
 tal+0.102
 eight+0.098
 tentacles+0.097
Neg logit contributions
ashing-0.158
 classroom-0.156
date-0.154
 painter-0.153
 neighborhood-0.151
Token!"
Token ID2474
Feature activation+1.47e-02

Pos logit contributions
text+0.090
 tal+0.065
+0.065
ifying+0.065
 rib+0.064
Neg logit contributions
castle-0.088
 courtyard-0.086
 classroom-0.086
 studio-0.084
ashing-0.084
Token
Token ID198
Feature activation-1.49e-02

Pos logit contributions
 classroom+0.020
 neighborhood+0.018
 painter+0.017
 studio+0.017
 courtyard+0.016
Neg logit contributions
text-0.034
ifying-0.031
 AW-0.030
ified-0.027
 rib-0.027
Token
Token ID198
Feature activation-8.41e-03

Pos logit contributions
text+0.035
ifying+0.034
 AW+0.032
 rib+0.029
ified+0.029
Neg logit contributions
 neighborhood-0.019
 classroom-0.018
 painter-0.016
 neighbourhood-0.016
 courtyard-0.016
TokenL
Token ID43
Feature activation-3.35e-02

Pos logit contributions
text+0.018
ifying+0.016
 AW+0.016
 tal+0.015
 rib+0.014
Neg logit contributions
 classroom-0.013
 neighborhood-0.012
 painter-0.012
 courtyard-0.011
 builder-0.011
Token but
Token ID475
Feature activation-4.14e-03

Pos logit contributions
 courtyard+0.013
 classroom+0.012
 neighborhood+0.012
 builder+0.012
 painter+0.012
Neg logit contributions
text-0.027
ifying-0.023
-0.022
 AW-0.022
 Sn-0.021
Token it
Token ID340
Feature activation-9.83e-03

Pos logit contributions
text+0.008
ifying+0.007
+0.007
 AW+0.006
 Sn+0.006
Neg logit contributions
 courtyard-0.007
 classroom-0.007
 neighborhood-0.007
 builder-0.007
castle-0.007
Token felt
Token ID2936
Feature activation-1.31e-02

Pos logit contributions
text+0.020
ifying+0.017
 AW+0.016
+0.016
 Sn+0.015
Neg logit contributions
 courtyard-0.016
 neighborhood-0.015
 classroom-0.015
castle-0.015
 builder-0.015
Token tight
Token ID5381
Feature activation-2.32e-02

Pos logit contributions
text+0.029
 AW+0.024
ifying+0.023
+0.023
 Sn+0.022
Neg logit contributions
 courtyard-0.019
 classroom-0.018
 neighborhood-0.018
 builder-0.017
 grounds-0.017
Token and
Token ID290
Feature activation-2.10e-02

Pos logit contributions
text+0.054
ifying+0.045
 AW+0.044
+0.043
 eight+0.042
Neg logit contributions
 courtyard-0.031
 neighborhood-0.029
 classroom-0.029
castle-0.029
 grounds-0.029
Token wouldn
Token ID3636
Feature activation-8.05e-02

Pos logit contributions
text+0.048
ifying+0.039
 AW+0.038
+0.038
 Sn+0.036
Neg logit contributions
 courtyard-0.030
 classroom-0.029
 neighborhood-0.028
castle-0.027
 grounds-0.027
Token't
Token ID470
Feature activation-6.00e-03

Pos logit contributions
text+0.169
haw+0.152
 rib+0.148
ifying+0.147
 tal+0.145
Neg logit contributions
 builder-0.093
utter-0.091
 painter-0.087
line-0.086
icker-0.085
Token bud
Token ID3600
Feature activation-9.05e-03

Pos logit contributions
text+0.011
+0.010
ifying+0.009
 AW+0.009
 Sn+0.009
Neg logit contributions
 courtyard-0.011
 neighborhood-0.010
 classroom-0.010
 studio-0.010
castle-0.010
Tokenge
Token ID469
Feature activation-2.72e-02

Pos logit contributions
text+0.018
+0.015
ifying+0.015
 AW+0.014
 Sn+0.014
Neg logit contributions
 courtyard-0.015
 classroom-0.015
 neighborhood-0.015
 painter-0.014
 builder-0.014
Token.
Token ID13
Feature activation-9.02e-04

Pos logit contributions
text+0.067
ifying+0.057
 AW+0.056
+0.054
 eight+0.053
Neg logit contributions
 courtyard-0.031
 classroom-0.030
 neighborhood-0.030
castle-0.030
 builder-0.029
Token Sam
Token ID3409
Feature activation-1.31e-03

Pos logit contributions
text+0.002
ifying+0.002
 AW+0.002
 rib+0.002
+0.002
Neg logit contributions
 neighborhood-0.001
 courtyard-0.001
 classroom-0.001
 builder-0.001
 grounds-0.001
Token throw
Token ID3714
Feature activation+6.64e-04

Pos logit contributions
text+0.002
ifying+0.002
 AW+0.002
+0.002
 Sn+0.002
Neg logit contributions
 courtyard-0.002
 classroom-0.002
 neighborhood-0.002
 builder-0.002
 painter-0.002
Token the
Token ID262
Feature activation+9.97e-03

Pos logit contributions
 classroom+0.001
 courtyard+0.001
 structure+0.001
 neighborhood+0.001
 builder+0.001
Neg logit contributions
text-0.001
ifying-0.001
 AW-0.001
-0.001
 Sn-0.001
Token ball
Token ID2613
Feature activation-7.29e-03

Pos logit contributions
 classroom+0.011
 neighborhood+0.011
 painter+0.011
 structure+0.011
 courtyard+0.010
Neg logit contributions
text-0.025
-0.021
 Sn-0.021
ifying-0.020
 AW-0.020
Token and
Token ID290
Feature activation-2.01e-02

Pos logit contributions
text+0.014
+0.012
ifying+0.012
 AW+0.011
 eight+0.011
Neg logit contributions
 courtyard-0.012
 classroom-0.012
 neighborhood-0.012
 painter-0.012
castle-0.012
Token he
Token ID339
Feature activation-1.06e-02

Pos logit contributions
text+0.047
ifying+0.040
 AW+0.036
E+0.036
ified+0.036
Neg logit contributions
 classroom-0.027
 courtyard-0.026
 structure-0.025
castle-0.024
 painter-0.024
Token fet
Token ID11351
Feature activation-8.73e-02

Pos logit contributions
text+0.018
ifying+0.016
 AW+0.014
+0.014
 Sn+0.013
Neg logit contributions
 courtyard-0.020
 classroom-0.019
 neighborhood-0.019
 builder-0.019
 studio-0.019
Tokenches
Token ID2052
Feature activation-2.33e-02

Pos logit contributions
ifying+0.138
text+0.120
 Sn+0.119
 sle+0.115
+0.113
Neg logit contributions
 agree-0.149
icker-0.134
ook-0.130
 builder-0.129
 studio-0.129
Token it
Token ID340
Feature activation-1.88e-02

Pos logit contributions
text+0.048
ifying+0.044
 AW+0.039
 Sn+0.039
 rib+0.038
Neg logit contributions
 classroom-0.031
 courtyard-0.031
 studio-0.029
 structure-0.029
 builder-0.029
Token.
Token ID13
Feature activation-2.53e-03

Pos logit contributions
text+0.038
+0.032
ifying+0.031
 AW+0.030
 Sn+0.029
Neg logit contributions
 courtyard-0.031
 classroom-0.030
 neighborhood-0.030
 painter-0.029
castle-0.029
Token They
Token ID1119
Feature activation-4.01e-03

Pos logit contributions
text+0.006
+0.005
ifying+0.005
 AW+0.005
 rib+0.005
Neg logit contributions
 classroom-0.003
 courtyard-0.003
 neighborhood-0.003
 builder-0.003
castle-0.003
Token take
Token ID1011
Feature activation-3.04e-02

Pos logit contributions
text+0.009
ifying+0.007
 AW+0.007
+0.007
 Sn+0.007
Neg logit contributions
 courtyard-0.006
 classroom-0.006
 neighborhood-0.006
castle-0.005
 builder-0.005
Token pushed
Token ID7121
Feature activation+2.08e-02

Pos logit contributions
 courtyard+0.012
 neighborhood+0.011
 classroom+0.011
 builder+0.011
castle+0.011
Neg logit contributions
text-0.014
ifying-0.013
-0.012
 AW-0.012
 Sn-0.011
Token and
Token ID290
Feature activation-9.59e-03

Pos logit contributions
 painter+0.029
 classroom+0.029
 courtyard+0.028
 artist+0.028
 builder+0.028
Neg logit contributions
text-0.044
ifying-0.041
 AW-0.037
 Sn-0.035
-0.033
Token pulled
Token ID5954
Feature activation+4.81e-03

Pos logit contributions
text+0.020
ifying+0.017
 AW+0.015
 Sn+0.015
+0.015
Neg logit contributions
 courtyard-0.015
 classroom-0.014
 builder-0.014
 neighborhood-0.014
 grounds-0.014
Token until
Token ID1566
Feature activation-2.60e-02

Pos logit contributions
 courtyard+0.006
 classroom+0.006
 builder+0.006
 neighborhood+0.006
 painter+0.006
Neg logit contributions
text-0.011
ifying-0.010
 AW-0.009
-0.009
 Sn-0.009
Token it
Token ID340
Feature activation-1.99e-02

Pos logit contributions
text+0.058
+0.050
ifying+0.049
 AW+0.048
 Sn+0.047
Neg logit contributions
 courtyard-0.037
 classroom-0.036
 neighborhood-0.035
castle-0.035
 painter-0.034
Token wouldn
Token ID3636
Feature activation-9.37e-02

Pos logit contributions
text+0.039
ifying+0.032
+0.032
 AW+0.031
 Sn+0.030
Neg logit contributions
 courtyard-0.034
 classroom-0.033
 neighborhood-0.033
castle-0.032
 builder-0.032
Token't
Token ID470
Feature activation-1.08e-02

Pos logit contributions
text+0.190
ifying+0.178
haw+0.177
 tal+0.173
 dollars+0.170
Neg logit contributions
 builder-0.104
utter-0.104
 outside-0.101
 painter-0.096
line-0.095
Token go
Token ID467
Feature activation-1.51e-02

Pos logit contributions
text+0.020
+0.017
 AW+0.016
ifying+0.016
 eight+0.015
Neg logit contributions
 courtyard-0.019
 classroom-0.019
 neighborhood-0.019
 studio-0.018
 painter-0.018
Token any
Token ID597
Feature activation-3.36e-03

Pos logit contributions
text+0.035
ifying+0.028
+0.028
 AW+0.027
 eight+0.027
Neg logit contributions
 courtyard-0.020
 classroom-0.020
 neighborhood-0.020
castle-0.019
 builder-0.019
Token further
Token ID2252
Feature activation+2.71e-02

Pos logit contributions
text+0.008
+0.007
ifying+0.006
 AW+0.006
 Sn+0.006
Neg logit contributions
 courtyard-0.005
 classroom-0.004
 neighborhood-0.004
castle-0.004
 builder-0.004
Token.
Token ID13
Feature activation-4.63e-03

Pos logit contributions
 courtyard+0.043
 classroom+0.041
 neighborhood+0.041
castle+0.040
 builder+0.040
Neg logit contributions
text-0.056
-0.048
ifying-0.047
 AW-0.045
 Sn-0.044