TOP ACTIVATIONS
MAX = 9.965

was a little girl called Hanna . She was three years
. One day , when Hanna was playing in the garden
right into the garden . Hanna was very surprised and excited
One day , both Hannah and her mum went to
much . One day , Hannah wanted to go for a
organized ! â added Hannah . The friends
. From then on , Hannah was determined to always succeed
As they walked around , Hannah 's eyes lit up as
was a little girl named Hannah . She was three years
was a little girl called Hannah . She loved animals very
and the other was called Hannah . One day , they
from happening . So , Hannah 's mom taught her to
. ' You look incredible Hannah !' she would say .
With her coat on , Hannah had a wonderful day in
eat lots of food and Hannah just wanted to be silly
, Tommy , Jeffrey and Hannah . They were all so
was a little bird called Hannah . She flew around all
They were all looking at Hannah and the leader of the
dad . Every day , Hannah loved to play in the
big , hairy cat . Hannah was so excited . She

MIN ACTIVATIONS
MIN = -8.899

and said , " V anish !" The toy car was
and said , " V anish !" Suddenly , his toy
They had to eat their salad . They did not like
I 'll make you a salad for dinner ." Jack thanked
She said , " V anish !" and then waited .
magic word , " V anish !" When he opened his
him a big bowl of salad . Max ate it all
When the salad was ready , Bella and
, " What a beautiful salad !" The boys began to
something and say ' v anish !' and then come out
and said , " V anish !" The couch was gone
They said , " V anish !" each time . All
sad because she loved her salad and wanted to eat all
bird found a bright yellow salad . The bird thought it
do not like to eat salad . Salad is green and
, I 'd love a salad !" His mom
Let 's make a tomato salad !" Mouse said .
had to eat soup and salad . Sam and
did not like to eat salad . One day , his
, he was making a salad . He filled it with

INTERVAL 5.249 - 9.965
CONTAINS 0.188%

in the park . The puppy was sad and hungry .
. The puppy was much cheer full er
was Grand ma ! Little B was so excited to see
and put on their p aj amas . They each pick
. She showed it to Tommy and said , " Do

INTERVAL 0.533 - 5.249
CONTAINS 26.662%

the story is that sharing can bring happiness and fun to
sit and watch the world around him . One night ,
pulled , but it was too rough for him to break
the end , they all waved goodbye and wished each other
all the coins you love counting ". M ia

INTERVAL -4.183 - 0.533
CONTAINS 72.519%

business . After he was done , Max felt much better
time , there was a very gifted dinosaur named D ippy
and played until it was time for bed .
. Her friends saw the change in her and soon she
upon a time there was a happy little girl who loved

INTERVAL -8.899 - -4.183
CONTAINS 0.631%

looked different , they still tasted just as delicious as before
The salmon used all of its strength and started to swim
, the tall man was able to grab the balloon .
it . He took a knife and cut the top off
Mr . Snow man doesn 't say anything .
Token was
Token ID373
Feature activation+5.69e-01

Pos logit contributions
 Kel+0.070
 Kenny+0.070
 F+0.068
 Hanna+0.068
 Buster+0.067
Neg logit contributions
castle-0.096
ide-0.089
 structures-0.089
 salad-0.085
ull-0.085
Token a
Token ID257
Feature activation+3.08e-01

Pos logit contributions
 Kel+0.035
 Hanna+0.034
 Kenny+0.034
 Buster+0.034
 F+0.034
Neg logit contributions
castle-0.060
ide-0.057
 structures-0.056
room-0.054
ull-0.054
Token little
Token ID1310
Feature activation+1.75e-01

Pos logit contributions
 Hanna+0.021
 Kel+0.021
 pup+0.021
 Buster+0.021
 Kenny+0.021
Neg logit contributions
castle-0.030
ide-0.028
 structures-0.028
room-0.027
ull-0.027
Token girl
Token ID2576
Feature activation+4.88e-01

Pos logit contributions
 Kel+0.014
 Hanna+0.014
 Kenny+0.014
 hesitation+0.014
 Buster+0.014
Neg logit contributions
castle-0.015
ide-0.014
 structures-0.014
 salad-0.014
ull-0.013
Token called
Token ID1444
Feature activation+1.63e+00

Pos logit contributions
 Kel+0.035
 Hanna+0.035
 F+0.034
 Kenny+0.034
 pup+0.034
Neg logit contributions
castle-0.046
ide-0.043
 structures-0.043
room-0.041
 salad-0.041
Token Hanna
Token ID43766
Feature activation+9.96e+00

Pos logit contributions
 Kel+0.152
 Hanna+0.151
 Kenny+0.151
 Buster+0.150
 buck+0.148
Neg logit contributions
castle-0.115
ide-0.110
 structures-0.105
room-0.103
ull-0.101
Token.
Token ID13
Feature activation+1.04e-01

Pos logit contributions
 Kel+0.787
 Hanna+0.780
 Kenny+0.778
 Buster+0.771
 Hannah+0.764
Neg logit contributions
castle-0.858
ide-0.826
 structures-0.785
room-0.779
ull-0.774
Token She
Token ID1375
Feature activation-2.63e-01

Pos logit contributions
 hesitation+0.011
 training+0.011
ging+0.010
Matthew+0.010
Michael+0.010
Neg logit contributions
ide-0.005
room-0.005
tub-0.003
 salad-0.003
 bloom-0.003
Token was
Token ID373
Feature activation+5.72e-01

Pos logit contributions
castle+0.024
ide+0.023
 structures+0.023
room+0.021
 salad+0.021
Neg logit contributions
 Hanna-0.020
 F-0.020
 Kel-0.020
 buck-0.019
 Hannah-0.019
Token three
Token ID1115
Feature activation+2.59e+00

Pos logit contributions
 Kel+0.042
 Kenny+0.041
 Hanna+0.041
 Buster+0.041
 hesitation+0.041
Neg logit contributions
castle-0.051
ide-0.049
 structures-0.048
room-0.047
 salad-0.046
Token years
Token ID812
Feature activation+2.42e+00

Pos logit contributions
 Kel+0.124
 pup+0.122
 Hanna+0.122
 F+0.122
 Kenny+0.121
Neg logit contributions
castle-0.303
ide-0.287
 structures-0.280
room-0.279
ull-0.278
Token.
Token ID13
Feature activation+9.84e-02

Pos logit contributions
castle+0.052
ide+0.050
 structures+0.048
room+0.047
ull+0.047
Neg logit contributions
 Kel-0.047
 Hanna-0.046
 Kenny-0.046
 Buster-0.046
 Cindy-0.045
Token One
Token ID1881
Feature activation+9.00e-01

Pos logit contributions
 hesitation+0.011
ging+0.011
 training+0.010
leaf+0.010
uster+0.010
Neg logit contributions
ide-0.004
room-0.004
 salad-0.003
tub-0.003
ectar-0.002
Token day
Token ID1110
Feature activation-6.26e-01

Pos logit contributions
 pup+0.055
 Kel+0.054
 F+0.054
 Hanna+0.053
 Kenny+0.053
Neg logit contributions
castle-0.095
ide-0.090
 structures-0.090
room-0.087
ull-0.085
Token,
Token ID11
Feature activation+2.10e-01

Pos logit contributions
room+0.023
ide+0.023
ics+0.017
 salad+0.016
kins+0.015
Neg logit contributions
 hesitation-0.070
 reins-0.069
uster-0.068
leaf-0.067
 Kel-0.066
Token when
Token ID618
Feature activation-1.42e+00

Pos logit contributions
 hesitation+0.020
leaf+0.018
 grizz+0.018
 reins+0.018
 training+0.018
Neg logit contributions
ide-0.011
room-0.011
 salad-0.010
tub-0.009
kins-0.009
Token Hanna
Token ID43766
Feature activation+9.96e+00

Pos logit contributions
ide+0.060
room+0.060
 bloom+0.058
ics+0.056
 salad+0.056
Neg logit contributions
leaf-0.145
uster-0.142
 reins-0.139
 hesitation-0.138
ion-0.129
Token was
Token ID373
Feature activation+5.68e-01

Pos logit contributions
 Hanna+0.697
 buck+0.690
 Hannah+0.685
 pup+0.683
 F+0.679
Neg logit contributions
castle-0.957
 structures-0.924
ide-0.904
 salad-0.863
ull-0.846
Token playing
Token ID2712
Feature activation+1.02e+00

Pos logit contributions
 Kel+0.043
 Hanna+0.043
 Kenny+0.043
 Buster+0.042
 Cindy+0.042
Neg logit contributions
castle-0.051
ide-0.048
 structures-0.047
room-0.046
ull-0.046
Token in
Token ID287
Feature activation-4.66e-01

Pos logit contributions
 Hannah+0.067
 Kenny+0.066
 F+0.066
 Kel+0.066
 Buster+0.066
Neg logit contributions
castle-0.100
ide-0.095
ull-0.093
 structures-0.093
room-0.087
Token the
Token ID262
Feature activation-5.19e-02

Pos logit contributions
castle+0.043
ide+0.041
 structures+0.040
room+0.039
ull+0.039
Neg logit contributions
 Kel-0.034
 Hanna-0.034
 Kenny-0.033
 Buster-0.033
 Hannah-0.033
Token garden
Token ID11376
Feature activation-6.04e-01

Pos logit contributions
castle+0.005
ide+0.005
ull+0.005
anish+0.004
room+0.004
Neg logit contributions
 pup-0.004
 Kel-0.003
 F-0.003
 Hanna-0.003
 Hannah-0.003
Token right
Token ID826
Feature activation+1.65e+00

Pos logit contributions
castle+0.277
ide+0.271
 structures+0.259
ull+0.256
room+0.253
Neg logit contributions
 Hanna-0.276
 Kel-0.272
 Kenny-0.271
 Hannah-0.270
 hesitation-0.268
Token into
Token ID656
Feature activation-8.27e-01

Pos logit contributions
 Kel+0.144
 Hanna+0.143
 Kenny+0.143
 Buster+0.143
 Hannah+0.141
Neg logit contributions
castle-0.135
ide-0.126
 structures-0.121
ull-0.119
 salad-0.114
Token the
Token ID262
Feature activation-5.96e-02

Pos logit contributions
ide+0.053
 salad+0.052
ull+0.052
room+0.051
 structures+0.050
Neg logit contributions
uster-0.076
 Kenny-0.075
 Buster-0.074
Michael-0.073
Matthew-0.073
Token garden
Token ID11376
Feature activation-6.08e-01

Pos logit contributions
castle+0.005
ide+0.005
room+0.005
 structures+0.005
ull+0.005
Neg logit contributions
 pup-0.005
 Kel-0.004
 Hanna-0.004
 Cindy-0.004
 F-0.004
Token.
Token ID13
Feature activation+8.66e-02

Pos logit contributions
castle+0.057
ide+0.054
 structures+0.053
room+0.052
ull+0.051
Neg logit contributions
 Kel-0.043
 Hanna-0.042
 F-0.042
 buck-0.042
 Kenny-0.041
Token Hanna
Token ID43766
Feature activation+9.95e+00

Pos logit contributions
 hesitation+0.009
 training+0.008
Michael+0.008
Matthew+0.008
Tyler+0.008
Neg logit contributions
ide-0.004
room-0.004
 salad-0.004
 bloom-0.004
tub-0.004
Token was
Token ID373
Feature activation+5.51e-01

Pos logit contributions
 Kel+0.715
 Hanna+0.706
 F+0.699
 buck+0.696
 Kenny+0.695
Neg logit contributions
castle-0.935
 structures-0.883
ide-0.880
 salad-0.847
room-0.845
Token very
Token ID845
Feature activation-5.59e-01

Pos logit contributions
 Kel+0.042
 Hanna+0.041
 Kenny+0.041
 Buster+0.041
 Hannah+0.040
Neg logit contributions
castle-0.049
ide-0.047
 structures-0.046
ull-0.045
room-0.045
Token surprised
Token ID6655
Feature activation-1.90e+00

Pos logit contributions
castle+0.053
ide+0.051
 structures+0.049
room+0.049
 salad+0.047
Neg logit contributions
 Kel-0.041
 F-0.040
 Cindy-0.040
 Hanna-0.040
 Hannah-0.040
Token and
Token ID290
Feature activation+1.92e-01

Pos logit contributions
castle+0.155
ide+0.154
 structures+0.142
room+0.140
ull+0.138
Neg logit contributions
 Kel-0.159
 Hanna-0.158
 Buster-0.155
 Kenny-0.155
 buck-0.152
Token excited
Token ID6568
Feature activation-3.40e-01

Pos logit contributions
 Kel+0.015
 Hanna+0.015
 Kenny+0.015
 Buster+0.015
 Hannah+0.014
Neg logit contributions
castle-0.017
ide-0.016
 structures-0.016
room-0.015
ull-0.015
Token
Token ID198
Feature activation-4.70e-01

Pos logit contributions
ide+0.034
room+0.031
ull+0.030
castle+0.029
 structures+0.027
Neg logit contributions
 hesitation-0.042
 buck-0.042
 luggage-0.042
 Kel-0.042
 training-0.041
TokenOne
Token ID3198
Feature activation+1.40e+00

Pos logit contributions
ide+0.039
castle+0.037
room+0.037
ull+0.034
 structures+0.034
Neg logit contributions
 Kel-0.039
 hesitation-0.039
 buck-0.038
 Kenny-0.038
 luggage-0.038
Token day
Token ID1110
Feature activation-5.84e-01

Pos logit contributions
 Kel+0.126
 Buster+0.125
 Kenny+0.125
 Cindy+0.123
 Hanna+0.122
Neg logit contributions
ide-0.107
castle-0.104
ull-0.099
 salad-0.097
room-0.094
Token,
Token ID11
Feature activation+2.58e-01

Pos logit contributions
ide+0.033
 salad+0.032
room+0.031
castle+0.028
kins+0.026
Neg logit contributions
 reins-0.061
uster-0.056
 training-0.055
 buck-0.054
 Kel-0.054
Token both
Token ID1111
Feature activation-1.03e+00

Pos logit contributions
 reins+0.016
 training+0.015
 Kel+0.015
 hesitation+0.015
uster+0.015
Neg logit contributions
ide-0.024
castle-0.024
 salad-0.024
room-0.023
 structures-0.022
Token Hannah
Token ID26013
Feature activation+9.88e+00

Pos logit contributions
castle+0.090
ide+0.087
 structures+0.082
ull+0.081
room+0.081
Neg logit contributions
 Kel-0.079
 Kenny-0.078
 Buster-0.078
 Hanna-0.077
 F-0.076
Token and
Token ID290
Feature activation+2.52e-01

Pos logit contributions
 Kel+0.796
 Hanna+0.789
 Kenny+0.783
 Buster+0.778
 Cindy+0.772
Neg logit contributions
castle-0.836
ide-0.795
 structures-0.766
room-0.751
ull-0.744
Token her
Token ID607
Feature activation-9.14e-01

Pos logit contributions
uster+0.026
 reins+0.025
 Kel+0.025
 buck+0.025
 training+0.024
Neg logit contributions
ide-0.014
room-0.013
castle-0.013
 salad-0.012
 structures-0.012
Token mum
Token ID25682
Feature activation+2.22e+00

Pos logit contributions
castle+0.069
 salad+0.063
ide+0.062
 structures+0.061
room+0.059
Neg logit contributions
 Kel-0.085
uster-0.081
 Hanna-0.080
 Kenny-0.080
 Buster-0.079
Token went
Token ID1816
Feature activation-2.20e+00

Pos logit contributions
 Kel+0.190
 Hanna+0.187
 Kenny+0.185
 Buster+0.183
 Cindy+0.182
Neg logit contributions
castle-0.179
ide-0.172
room-0.162
 structures-0.159
ull-0.158
Token to
Token ID284
Feature activation+3.47e-02

Pos logit contributions
castle+0.115
 salad+0.106
ide+0.100
 structures+0.099
ics+0.096
Neg logit contributions
 Kel-0.242
Michael-0.240
 hesitation-0.227
uster-0.225
 Ethan-0.225
Token much
Token ID881
Feature activation-3.99e-01

Pos logit contributions
castle+0.051
ide+0.047
 structures+0.047
room+0.046
ull+0.045
Neg logit contributions
 Kel-0.034
 Hanna-0.033
 F-0.033
 Kenny-0.033
 Cindy-0.032
Token.
Token ID13
Feature activation+1.44e-01

Pos logit contributions
castle+0.029
ide+0.029
 structures+0.028
 salad+0.027
room+0.026
Neg logit contributions
 Kel-0.037
 Kenny-0.036
 Hanna-0.036
 Buster-0.035
Michael-0.035
Token One
Token ID1881
Feature activation+9.42e-01

Pos logit contributions
 hesitation+0.013
 Kel+0.012
 buck+0.012
 training+0.012
 Kenny+0.012
Neg logit contributions
ide-0.011
room-0.010
 salad-0.010
 structures-0.010
castle-0.010
Token day
Token ID1110
Feature activation-5.85e-01

Pos logit contributions
 Kel+0.064
 Hanna+0.063
 Kenny+0.062
 Buster+0.062
 F+0.062
Neg logit contributions
castle-0.093
ide-0.089
 structures-0.087
room-0.085
ull-0.084
Token,
Token ID11
Feature activation+2.52e-01

Pos logit contributions
ide+0.026
room+0.024
 salad+0.024
castle+0.020
ics+0.019
Neg logit contributions
 reins-0.066
uster-0.064
 hesitation-0.062
 Kel-0.062
Michael-0.062
Token Hannah
Token ID26013
Feature activation+9.87e+00

Pos logit contributions
 hesitation+0.026
 reins+0.026
Michael+0.026
 Kel+0.025
 Kenny+0.025
Neg logit contributions
 salad-0.011
ide-0.011
room-0.010
kins-0.009
ectar-0.009
Token wanted
Token ID2227
Feature activation-7.60e-01

Pos logit contributions
 Hanna+0.749
 pup+0.741
 buck+0.737
 Hannah+0.734
 F+0.733
Neg logit contributions
castle-0.894
ide-0.849
 structures-0.844
ull-0.793
room-0.791
Token to
Token ID284
Feature activation+2.64e-02

Pos logit contributions
castle+0.062
ide+0.059
 salad+0.059
 structures+0.059
ull+0.057
Neg logit contributions
 Kel-0.063
 Kenny-0.062
 Hanna-0.062
uster-0.062
 Buster-0.061
Token go
Token ID467
Feature activation-1.92e+00

Pos logit contributions
 F+0.002
 Hanna+0.002
 Buster+0.002
 Hannah+0.002
 Cindy+0.002
Neg logit contributions
castle-0.002
ide-0.002
 structures-0.002
ull-0.002
ocado-0.002
Token for
Token ID329
Feature activation+5.84e-01

Pos logit contributions
castle+0.141
 salad+0.130
ide+0.122
ull+0.122
 structures+0.121
Neg logit contributions
 Kel-0.179
Michael-0.178
 Hannah-0.170
 Hanna-0.169
 Haley-0.168
Token a
Token ID257
Feature activation+3.51e-01

Pos logit contributions
uster+0.067
ging+0.059
 Kel+0.059
Michael+0.058
 Kenny+0.057
Neg logit contributions
 salad-0.029
 structures-0.027
 dragons-0.026
ics-0.024
 pie-0.023
Token organized
Token ID8389
Feature activation+6.51e-01

Pos logit contributions
 Kel+0.055
uster+0.055
 Kenny+0.054
 Hanna+0.054
 reins+0.053
Neg logit contributions
castle-0.054
 structures-0.048
ull-0.048
ide-0.047
 salad-0.047
Token!
Token ID0
Feature activation+6.83e-01

Pos logit contributions
 Kel+0.063
 hesitation+0.061
 Hanna+0.060
uster+0.060
 Kenny+0.060
Neg logit contributions
castle-0.045
 structures-0.043
ide-0.041
room-0.040
 salad-0.040
Tokenâ
Token ID22940
Feature activation+3.59e+00

Pos logit contributions
 hesitation+0.062
 buck+0.058
 luggage+0.058
 training+0.058
 Kel+0.054
Neg logit contributions
room-0.050
ide-0.044
castle-0.044
ull-0.043
 structures-0.042
Token
Token ID26391
Feature activation+1.14e+00

Pos logit contributions
 luggage+0.322
 hesitation+0.306
 reins+0.302
 training+0.301
 Hanna+0.298
Neg logit contributions
castle-0.282
room-0.276
ide-0.273
ull-0.246
ics-0.235
Token added
Token ID2087
Feature activation-2.27e+00

Pos logit contributions
 hesitation+0.093
 Kel+0.092
 Hanna+0.091
 training+0.090
 Buster+0.090
Neg logit contributions
castle-0.096
ide-0.090
room-0.089
 structures-0.086
ull-0.083
Token Hannah
Token ID26013
Feature activation+9.87e+00

Pos logit contributions
castle+0.182
 salad+0.177
 structures+0.173
room+0.170
ide+0.167
Neg logit contributions
 Kel-0.193
 Kenny-0.188
 Buster-0.185
 Hanna-0.185
 hesitation-0.183
Token.
Token ID13
Feature activation+1.46e-01

Pos logit contributions
 Kel+0.803
 Hanna+0.786
 Kenny+0.785
 hesitation+0.773
 Buster+0.772
Neg logit contributions
castle-0.842
ide-0.792
 structures-0.771
room-0.762
 salad-0.753
Token
Token ID198
Feature activation-4.62e-01

Pos logit contributions
 hesitation+0.014
 buck+0.013
 luggage+0.013
 training+0.013
 reins+0.012
Neg logit contributions
room-0.010
ide-0.009
castle-0.009
ill-0.008
 structures-0.008
Token
Token ID198
Feature activation-4.52e-01

Pos logit contributions
ide+0.027
ull+0.026
room+0.025
castle+0.021
ics+0.020
Neg logit contributions
 hesitation-0.050
 luggage-0.048
 training-0.048
 reins-0.046
 buck-0.046
TokenThe
Token ID464
Feature activation-5.81e-02

Pos logit contributions
ide+0.030
room+0.028
ull+0.025
castle+0.024
ics+0.022
Neg logit contributions
 hesitation-0.047
 luggage-0.045
 buck-0.043
 training-0.043
 reins-0.041
Token friends
Token ID2460
Feature activation+1.37e-01

Pos logit contributions
castle+0.004
 salad+0.004
 tower+0.004
 nation+0.004
 structure+0.004
Neg logit contributions
 Kel-0.005
uster-0.005
 Kathy-0.005
 Kenny-0.005
 Hanna-0.005
Token.
Token ID13
Feature activation+1.46e-01

Pos logit contributions
 Kel+0.048
 Hanna+0.048
 Buster+0.047
 Kenny+0.047
Michael+0.047
Neg logit contributions
castle-0.033
ide-0.030
 structures-0.030
room-0.028
 nation-0.027
Token From
Token ID3574
Feature activation+4.21e-01

Pos logit contributions
 buck+0.012
 hesitation+0.012
 training+0.012
 Kel+0.012
 luggage+0.012
Neg logit contributions
ide-0.011
castle-0.011
room-0.011
 structures-0.011
ull-0.010
Token then
Token ID788
Feature activation+3.26e+00

Pos logit contributions
uster+0.039
 reins+0.036
 training+0.034
 hesitation+0.034
 Kel+0.033
Neg logit contributions
castle-0.031
ide-0.028
ics-0.027
 structures-0.027
ate-0.027
Token on
Token ID319
Feature activation+1.83e+00

Pos logit contributions
uster+0.282
 Kel+0.280
 Hanna+0.278
 hesitation+0.275
 Hannah+0.272
Neg logit contributions
room-0.234
castle-0.209
ide-0.208
anish-0.206
ics-0.197
Token,
Token ID11
Feature activation+2.53e-01

Pos logit contributions
uster+0.174
 reins+0.168
 buck+0.166
 Kel+0.160
 hesitation+0.160
Neg logit contributions
ide-0.127
castle-0.127
 structures-0.121
room-0.116
 salad-0.114
Token Hannah
Token ID26013
Feature activation+9.87e+00

Pos logit contributions
uster+0.020
 buck+0.020
 Kel+0.020
 hesitation+0.020
 reins+0.020
Neg logit contributions
castle-0.020
ide-0.020
 structures-0.019
room-0.019
 salad-0.018
Token was
Token ID373
Feature activation+6.04e-01

Pos logit contributions
 Kel+0.889
 hesitation+0.887
 Kenny+0.872
uster+0.869
 Hanna+0.862
Neg logit contributions
castle-0.731
ide-0.689
room-0.670
ull-0.630
 structures-0.626
Token determined
Token ID5295
Feature activation+2.70e+00

Pos logit contributions
uster+0.055
 Kenny+0.054
 luggage+0.054
 Kel+0.053
 Buster+0.053
Neg logit contributions
castle-0.045
ide-0.040
ull-0.040
room-0.038
ics-0.037
Token to
Token ID284
Feature activation+3.18e-02

Pos logit contributions
 Kel+0.242
 Kenny+0.237
 Hanna+0.237
 Buster+0.232
uster+0.230
Neg logit contributions
castle-0.216
ide-0.201
 structures-0.191
 salad-0.187
ull-0.184
Token always
Token ID1464
Feature activation+3.35e-01

Pos logit contributions
 Kel+0.003
 Hanna+0.003
 Kenny+0.003
 Hannah+0.003
 Cindy+0.003
Neg logit contributions
castle-0.002
ide-0.002
room-0.002
 structures-0.002
ull-0.002
Token succeed
Token ID6758
Feature activation-4.76e+00

Pos logit contributions
 Kel+0.032
uster+0.031
 Hanna+0.031
 Kenny+0.031
 Hannah+0.030
Neg logit contributions
castle-0.025
ide-0.021
room-0.021
 structures-0.021
 salad-0.020
Token As
Token ID1081
Feature activation+8.35e-01

Pos logit contributions
 hesitation+0.011
 training+0.011
Michael+0.011
 buck+0.011
 Kel+0.010
Neg logit contributions
room-0.011
ide-0.010
tub-0.010
 salad-0.010
 structures-0.010
Token they
Token ID484
Feature activation-3.62e-01

Pos logit contributions
 reins+0.109
uster+0.101
 brakes+0.099
 hesitation+0.098
leaf+0.096
Neg logit contributions
ide-0.028
 salad-0.020
 nation-0.019
ics-0.019
room-0.016
Token walked
Token ID6807
Feature activation+1.25e+00

Pos logit contributions
ide+0.024
castle+0.023
room+0.021
 salad+0.019
ill+0.018
Neg logit contributions
 reins-0.035
 Kel-0.035
uster-0.035
 Buster-0.034
 pup-0.034
Token around
Token ID1088
Feature activation+1.82e+00

Pos logit contributions
Michael+0.129
 hesitation+0.128
 Kel+0.124
 reins+0.123
 training+0.122
Neg logit contributions
castle-0.073
ide-0.066
room-0.063
 structure-0.059
 structures-0.058
Token,
Token ID11
Feature activation+2.45e-01

Pos logit contributions
 reins+0.182
uster+0.177
Matthew+0.172
Michael+0.169
 Buster+0.169
Neg logit contributions
room-0.120
ide-0.118
castle-0.115
 structures-0.108
 nation-0.106
Token Hannah
Token ID26013
Feature activation+9.87e+00

Pos logit contributions
 reins+0.015
 training+0.013
Michael+0.012
 hesitation+0.012
 saddle+0.012
Neg logit contributions
ide-0.022
 salad-0.022
kins-0.022
tub-0.022
room-0.022
Token's
Token ID338
Feature activation-1.00e+00

Pos logit contributions
 Kel+0.874
 hesitation+0.848
 Kenny+0.847
 Hanna+0.841
 Buster+0.838
Neg logit contributions
castle-0.766
ide-0.717
room-0.675
 structures-0.663
 salad-0.641
Token eyes
Token ID2951
Feature activation-2.54e-01

Pos logit contributions
castle+0.079
 salad+0.070
 structures+0.067
ide+0.066
room+0.064
Neg logit contributions
 Kel-0.094
ha-0.090
Michael-0.090
 Hanna-0.089
 Kenny-0.088
Token lit
Token ID6578
Feature activation-6.99e-01

Pos logit contributions
castle+0.018
ide+0.017
room+0.016
ull+0.016
 structures+0.016
Neg logit contributions
 Kel-0.024
 Kenny-0.024
 Buster-0.023
 Hanna-0.023
 Hannah-0.023
Token up
Token ID510
Feature activation-3.09e-01

Pos logit contributions
ide+0.049
castle+0.048
 structures+0.045
room+0.042
 salad+0.042
Neg logit contributions
 hesitation-0.067
uster-0.066
Michael-0.066
 Buster-0.065
 training-0.065
Token as
Token ID355
Feature activation+8.87e-02

Pos logit contributions
castle+0.024
ide+0.023
 structures+0.022
room+0.021
 salad+0.021
Neg logit contributions
 Kel-0.028
 Hanna-0.027
 Buster-0.027
 Kenny-0.027
uster-0.027
Token was
Token ID373
Feature activation+6.04e-01

Pos logit contributions
 Kel+0.060
 Hanna+0.060
 Kenny+0.059
 Hannah+0.059
 Cindy+0.059
Neg logit contributions
castle-0.066
 structures-0.062
ide-0.062
ull-0.059
room-0.059
Token a
Token ID257
Feature activation+3.56e-01

Pos logit contributions
 Hanna+0.034
 Kel+0.034
 Cindy+0.033
 F+0.033
 Buster+0.033
Neg logit contributions
castle-0.066
ide-0.063
 structures-0.062
room-0.059
ull-0.058
Token little
Token ID1310
Feature activation+2.18e-01

Pos logit contributions
 Kel+0.026
 Hanna+0.026
 Kenny+0.026
 Buster+0.026
 Hannah+0.026
Neg logit contributions
castle-0.032
ide-0.031
 structures-0.030
room-0.029
ull-0.029
Token girl
Token ID2576
Feature activation+5.24e-01

Pos logit contributions
 Kel+0.018
 Hanna+0.018
 Kenny+0.018
 hesitation+0.018
 Buster+0.017
Neg logit contributions
castle-0.018
ide-0.017
 structures-0.016
 salad-0.016
ull-0.016
Token named
Token ID3706
Feature activation+1.12e+00

Pos logit contributions
 Kel+0.048
 Kenny+0.048
uster+0.047
 hesitation+0.047
 Buster+0.047
Neg logit contributions
castle-0.038
ide-0.037
room-0.034
ull-0.034
 structures-0.033
Token Hannah
Token ID26013
Feature activation+9.87e+00

Pos logit contributions
 buck+0.094
 hesitation+0.094
 Kel+0.094
 Buster+0.094
 Kenny+0.094
Neg logit contributions
castle-0.085
ide-0.085
 structures-0.080
room-0.080
ull-0.076
Token.
Token ID13
Feature activation+1.43e-01

Pos logit contributions
 hesitation+0.933
 training+0.919
 Kenny+0.913
 Kel+0.906
 reins+0.904
Neg logit contributions
ide-0.698
castle-0.672
room-0.623
ull-0.616
 structures-0.585
Token She
Token ID1375
Feature activation-2.31e-01

Pos logit contributions
 hesitation+0.018
 training+0.017
 buck+0.017
 reins+0.017
Michael+0.016
Neg logit contributions
ide-0.006
room-0.005
ill-0.004
ectar-0.003
 salad-0.003
Token was
Token ID373
Feature activation+6.10e-01

Pos logit contributions
castle+0.017
ide+0.017
ull+0.016
room+0.016
 structures+0.015
Neg logit contributions
 hesitation-0.021
 Kel-0.021
uster-0.021
 Kenny-0.020
 Buster-0.020
Token three
Token ID1115
Feature activation+2.63e+00

Pos logit contributions
uster+0.047
 hesitation+0.046
 Kenny+0.046
 Buster+0.046
Matthew+0.046
Neg logit contributions
castle-0.050
ide-0.049
room-0.048
 structures-0.047
ull-0.046
Token years
Token ID812
Feature activation+2.45e+00

Pos logit contributions
 Kel+0.158
 Hanna+0.156
 Buster+0.156
 Kenny+0.155
 Cindy+0.151
Neg logit contributions
castle-0.276
ide-0.271
 structures-0.264
room-0.256
 salad-0.253
Token was
Token ID373
Feature activation+6.05e-01

Pos logit contributions
 Kenny+0.061
 Kel+0.061
 F+0.060
 Hannah+0.057
 Hanna+0.056
Neg logit contributions
castle-0.114
 structures-0.108
 salad-0.105
ide-0.103
ull-0.098
Token a
Token ID257
Feature activation+3.43e-01

Pos logit contributions
 Hanna+0.035
 Kel+0.035
 Cindy+0.035
 Kenny+0.034
 Hannah+0.034
Neg logit contributions
castle-0.066
ide-0.062
 structures-0.061
room-0.059
ull-0.059
Token little
Token ID1310
Feature activation+2.06e-01

Pos logit contributions
 Hanna+0.024
 Kel+0.024
 Buster+0.023
 pup+0.023
 Kenny+0.023
Neg logit contributions
castle-0.033
ide-0.032
 structures-0.031
room-0.030
ull-0.030
Token girl
Token ID2576
Feature activation+5.20e-01

Pos logit contributions
 Kel+0.016
 Kenny+0.016
 Hanna+0.016
 hesitation+0.016
 Buster+0.016
Neg logit contributions
castle-0.018
ide-0.017
 structures-0.016
 salad-0.016
ull-0.016
Token called
Token ID1444
Feature activation+1.66e+00

Pos logit contributions
 Kel+0.039
 Hanna+0.038
 Kenny+0.038
 F+0.038
 Buster+0.038
Neg logit contributions
castle-0.048
ide-0.045
 structures-0.044
room-0.043
ull-0.042
Token Hannah
Token ID26013
Feature activation+9.87e+00

Pos logit contributions
 Kel+0.152
 Kenny+0.151
 Buster+0.150
 buck+0.150
 Hanna+0.150
Neg logit contributions
castle-0.117
ide-0.115
 structures-0.108
room-0.108
ull-0.104
Token.
Token ID13
Feature activation+1.37e-01

Pos logit contributions
 Kel+0.730
 Hanna+0.725
 Kenny+0.714
 F+0.713
 Buster+0.711
Neg logit contributions
castle-0.913
ide-0.858
 structures-0.844
room-0.817
ull-0.811
Token She
Token ID1375
Feature activation-2.24e-01

Pos logit contributions
 hesitation+0.016
 Kel+0.015
Michael+0.015
 training+0.015
 reins+0.015
Neg logit contributions
room-0.006
ide-0.006
 salad-0.004
 structures-0.004
ectar-0.004
Token loved
Token ID6151
Feature activation-2.15e-01

Pos logit contributions
castle+0.020
ide+0.019
 structures+0.019
room+0.018
 salad+0.018
Neg logit contributions
 Hanna-0.017
 Kel-0.017
 F-0.017
 Kenny-0.017
 Hannah-0.017
Token animals
Token ID4695
Feature activation-1.62e-01

Pos logit contributions
castle+0.021
ide+0.021
 structures+0.020
room+0.020
ull+0.019
Neg logit contributions
 Kel-0.014
 Hanna-0.014
 Kenny-0.014
 F-0.014
 Cindy-0.014
Token very
Token ID845
Feature activation-5.08e-01

Pos logit contributions
castle+0.015
ide+0.014
 structures+0.014
room+0.014
ull+0.014
Neg logit contributions
 Kel-0.012
 Hanna-0.012
 F-0.011
 Kenny-0.011
 Buster-0.011
Token and
Token ID290
Feature activation+2.53e-01

Pos logit contributions
 Kel+0.225
 Kenny+0.222
 Hanna+0.221
 Buster+0.220
 hesitation+0.218
Neg logit contributions
castle-0.216
ide-0.215
room-0.198
 structures-0.196
ull-0.196
Token the
Token ID262
Feature activation-2.05e-02

Pos logit contributions
 Kel+0.020
 Hanna+0.020
 Kenny+0.020
 Buster+0.020
 Hannah+0.020
Neg logit contributions
castle-0.022
ide-0.021
 structures-0.020
room-0.020
ull-0.019
Token other
Token ID584
Feature activation+1.21e+00

Pos logit contributions
castle+0.002
ide+0.002
 structures+0.002
room+0.002
ull+0.002
Neg logit contributions
 Hanna-0.002
 Kel-0.002
 F-0.002
 Cindy-0.002
 pup-0.002
Token was
Token ID373
Feature activation+6.13e-01

Pos logit contributions
 Kel+0.098
 Hanna+0.097
 Kenny+0.096
 Buster+0.096
 Hannah+0.095
Neg logit contributions
castle-0.102
ide-0.097
 structures-0.094
room-0.092
ull-0.091
Token called
Token ID1444
Feature activation+1.67e+00

Pos logit contributions
 Kel+0.045
 Kenny+0.044
 Hanna+0.044
 Buster+0.044
 Hannah+0.043
Neg logit contributions
castle-0.057
ide-0.054
 structures-0.053
room-0.051
ull-0.051
Token Hannah
Token ID26013
Feature activation+9.87e+00

Pos logit contributions
 Kel+0.131
 Hanna+0.129
 Kenny+0.129
 buck+0.129
 Buster+0.128
Neg logit contributions
castle-0.142
ide-0.139
 structures-0.132
room-0.131
ull-0.128
Token.
Token ID13
Feature activation+1.55e-01

Pos logit contributions
 Kel+0.836
 Hanna+0.824
 Kenny+0.820
 Buster+0.807
 buck+0.806
Neg logit contributions
castle-0.789
ide-0.777
 structures-0.724
room-0.715
ull-0.707
Token One
Token ID1881
Feature activation+9.45e-01

Pos logit contributions
 training+0.016
 hesitation+0.016
 buck+0.015
 reins+0.015
 luggage+0.015
Neg logit contributions
ide-0.010
room-0.008
 structures-0.008
 salad-0.007
castle-0.007
Token day
Token ID1110
Feature activation-5.91e-01

Pos logit contributions
 Kel+0.063
 Hanna+0.063
 Kenny+0.062
 Buster+0.062
 F+0.062
Neg logit contributions
castle-0.094
ide-0.089
 structures-0.088
room-0.085
ull-0.084
Token,
Token ID11
Feature activation+2.60e-01

Pos logit contributions
ide+0.036
room+0.032
 salad+0.031
 structures+0.029
ics+0.028
Neg logit contributions
uster-0.062
 reins-0.061
 training-0.057
 buck-0.057
 hesitation-0.057
Token they
Token ID484
Feature activation-3.57e-01

Pos logit contributions
uster+0.024
 hesitation+0.024
 reins+0.023
 training+0.023
 buck+0.023
Neg logit contributions
ide-0.018
 salad-0.016
room-0.015
castle-0.015
 structures-0.015
Token from
Token ID422
Feature activation-5.17e-01

Pos logit contributions
castle+0.148
ide+0.148
 structures+0.137
room+0.134
 salad+0.132
Neg logit contributions
 Hanna-0.158
 Kel-0.158
 Kenny-0.156
 Cindy-0.155
 pup-0.154
Token happening
Token ID5836
Feature activation-1.98e+00

Pos logit contributions
castle+0.046
ide+0.044
 structures+0.043
room+0.041
 salad+0.041
Neg logit contributions
 Hanna-0.039
 Kel-0.039
 Kenny-0.039
uster-0.038
 buck-0.038
Token.
Token ID13
Feature activation+1.40e-01

Pos logit contributions
ide+0.145
castle+0.142
 structures+0.133
room+0.132
ull+0.126
Neg logit contributions
 Kel-0.175
 buck-0.175
 Hanna-0.174
 hesitation-0.173
 Kenny-0.172
Token So
Token ID1406
Feature activation-1.14e+00

Pos logit contributions
 buck+0.012
 Kel+0.012
 hesitation+0.011
 Hanna+0.011
 training+0.011
Neg logit contributions
castle-0.011
ide-0.011
 structures-0.010
room-0.010
 salad-0.010
Token,
Token ID11
Feature activation+2.57e-01

Pos logit contributions
castle+0.081
ide+0.078
 salad+0.075
 structures+0.072
room+0.071
Neg logit contributions
uster-0.104
 reins-0.102
 training-0.098
 buck-0.098
Michael-0.097
Token Hannah
Token ID26013
Feature activation+9.86e+00

Pos logit contributions
uster+0.021
 buck+0.021
 training+0.021
 reins+0.020
 luggage+0.020
Neg logit contributions
castle-0.020
ide-0.020
 salad-0.018
room-0.018
 structures-0.018
Token's
Token ID338
Feature activation-1.00e+00

Pos logit contributions
 Kel+0.862
 Kenny+0.847
 hesitation+0.839
 training+0.831
 Hanna+0.827
Neg logit contributions
castle-0.763
ide-0.707
ull-0.680
room-0.673
 salad-0.655
Token mom
Token ID1995
Feature activation+2.85e-01

Pos logit contributions
castle+0.065
 salad+0.064
 tower+0.061
ull+0.055
 dragons+0.054
Neg logit contributions
 Kel-0.104
 Hanna-0.102
 Eleanor-0.100
 Cindy-0.099
uster-0.097
Token taught
Token ID7817
Feature activation+1.19e+00

Pos logit contributions
 Kel+0.024
 Hanna+0.023
 Kenny+0.023
 luggage+0.023
 F+0.023
Neg logit contributions
castle-0.023
ide-0.022
ull-0.021
room-0.021
 salad-0.020
Token her
Token ID607
Feature activation-9.20e-01

Pos logit contributions
 reins+0.143
 buck+0.140
 saddle+0.138
 Taxi+0.135
 training+0.134
Neg logit contributions
ide-0.048
room-0.039
iggle-0.032
 salad-0.032
ics-0.029
Token to
Token ID284
Feature activation+3.15e-02

Pos logit contributions
castle+0.068
ide+0.067
ull+0.065
 structures+0.065
 salad+0.064
Neg logit contributions
 Hanna-0.081
 Kel-0.080
 Kenny-0.080
 Buster-0.079
 Cindy-0.078
Token.
Token ID13
Feature activation+1.07e-01

Pos logit contributions
 Kel+0.152
 buck+0.149
 Hanna+0.149
uster+0.149
 Buster+0.149
Neg logit contributions
castle-0.136
ide-0.136
room-0.128
 structures-0.127
 salad-0.123
Token '
Token ID705
Feature activation+1.24e+00

Pos logit contributions
 hesitation+0.009
 buck+0.009
 Kel+0.009
 training+0.008
 luggage+0.008
Neg logit contributions
ide-0.008
castle-0.008
 structures-0.008
room-0.008
 salad-0.008
TokenYou
Token ID1639
Feature activation+7.86e-01

Pos logit contributions
 luggage+0.106
 Buster+0.104
 hesitation+0.103
 Kel+0.102
 Kenny+0.100
Neg logit contributions
ide-0.097
room-0.097
castle-0.092
ull-0.088
anish-0.085
Token look
Token ID804
Feature activation-1.51e+00

Pos logit contributions
 Kel+0.060
 Hanna+0.058
 Kenny+0.058
 Buster+0.057
 Cindy+0.057
Neg logit contributions
castle-0.070
ide-0.068
room-0.065
 structures-0.064
 salad-0.063
Token incredible
Token ID8082
Feature activation+3.25e-01

Pos logit contributions
castle+0.128
ide+0.121
 structures+0.120
 salad+0.120
room+0.114
Neg logit contributions
 Kel-0.122
 Buster-0.117
 Kenny-0.117
 Hanna-0.117
 buck-0.116
Token Hannah
Token ID26013
Feature activation+9.86e+00

Pos logit contributions
 Kel+0.028
 Kenny+0.027
 Hanna+0.027
 Buster+0.027
 buck+0.026
Neg logit contributions
castle-0.027
ide-0.025
 structures-0.024
 salad-0.024
room-0.023
Token!'
Token ID13679
Feature activation-2.05e-01

Pos logit contributions
 Kel+0.740
 Hanna+0.728
 Kenny+0.724
 Buster+0.718
 F+0.711
Neg logit contributions
castle-0.892
ide-0.853
 structures-0.821
room-0.810
ull-0.797
Token she
Token ID673
Feature activation+3.08e-01

Pos logit contributions
castle+0.015
ide+0.015
 salad+0.014
ull+0.014
room+0.014
Neg logit contributions
 Kel-0.017
 Kenny-0.017
uster-0.017
 luggage-0.016
 training-0.016
Token would
Token ID561
Feature activation-6.12e-01

Pos logit contributions
 Kel+0.021
 Hanna+0.021
 Kenny+0.020
 Buster+0.020
 Hannah+0.020
Neg logit contributions
castle-0.030
ide-0.029
 structures-0.028
room-0.027
ull-0.027
Token say
Token ID910
Feature activation+1.15e+00

Pos logit contributions
castle+0.051
ide+0.048
room+0.045
 structures+0.045
 salad+0.045
Neg logit contributions
 Kel-0.051
 Hanna-0.050
 Kenny-0.050
 Buster-0.050
 pup-0.049
Token.
Token ID13
Feature activation+1.35e-01

Pos logit contributions
 Kel+0.090
 Kenny+0.088
 Buster+0.087
 buck+0.087
 Hanna+0.086
Neg logit contributions
castle-0.097
ide-0.094
room-0.090
 structures-0.088
ull-0.088
TokenWith
Token ID3152
Feature activation-1.02e+00

Pos logit contributions
castle+0.043
ide+0.042
 structures+0.040
room+0.040
ull+0.039
Neg logit contributions
 Kel-0.035
 Hanna-0.035
 Kenny-0.034
 Buster-0.034
 hesitation-0.034
Token her
Token ID607
Feature activation-9.16e-01

Pos logit contributions
 salad+0.067
ide+0.062
castle+0.061
 structures+0.059
room+0.058
Neg logit contributions
uster-0.099
 Kel-0.094
 buck-0.092
 Hanna-0.092
 Eleanor-0.091
Token coat
Token ID13209
Feature activation+5.70e+00

Pos logit contributions
castle+0.073
 salad+0.069
 tower+0.065
 structures+0.064
 structure+0.063
Neg logit contributions
 Kel-0.080
 Kenny-0.078
 Hanna-0.077
 Cindy-0.076
Matthew-0.076
Token on
Token ID319
Feature activation+1.82e+00

Pos logit contributions
 Hanna+0.476
 Kel+0.472
 Hannah+0.466
 Hank+0.463
 Haley+0.462
Neg logit contributions
castle-0.461
ide-0.417
room-0.416
 structures-0.396
ics-0.390
Token,
Token ID11
Feature activation+2.45e-01

Pos logit contributions
 buck+0.149
 Kel+0.147
 Hanna+0.146
 Kenny+0.146
 hesitation+0.145
Neg logit contributions
castle-0.148
ide-0.144
room-0.137
 structures-0.136
 salad-0.133
Token Hannah
Token ID26013
Feature activation+9.86e+00

Pos logit contributions
uster+0.020
 training+0.019
 hesitation+0.019
 buck+0.019
 reins+0.019
Neg logit contributions
castle-0.019
ide-0.019
 structures-0.018
 salad-0.018
room-0.018
Token had
Token ID550
Feature activation+2.85e+00

Pos logit contributions
 Kel+0.826
 Kenny+0.813
 Hanna+0.803
 Buster+0.802
 Cindy+0.797
Neg logit contributions
castle-0.807
ide-0.764
room-0.727
 structures-0.712
ull-0.706
Token a
Token ID257
Feature activation+3.52e-01

Pos logit contributions
uster+0.235
 Kel+0.233
 Kenny+0.230
 Buster+0.227
 Hanna+0.223
Neg logit contributions
castle-0.229
ide-0.212
 salad-0.209
 structures-0.208
room-0.205
Token wonderful
Token ID7932
Feature activation+3.46e+00

Pos logit contributions
 Kel+0.029
uster+0.029
 Kenny+0.027
 F+0.027
 Hanna+0.027
Neg logit contributions
castle-0.029
 salad-0.028
 nation-0.026
 tower-0.026
 structure-0.026
Token day
Token ID1110
Feature activation-5.90e-01

Pos logit contributions
 Kel+0.287
 Hanna+0.284
uster+0.283
 F+0.279
 Kenny+0.278
Neg logit contributions
castle-0.305
ide-0.278
ull-0.269
 salad-0.267
room-0.262
Token in
Token ID287
Feature activation-4.11e-01

Pos logit contributions
castle+0.038
ide+0.036
room+0.034
 salad+0.032
ull+0.032
Neg logit contributions
 Kel-0.060
 Eleanor-0.058
 Hanna-0.058
 Kenny-0.058
Michael-0.057
Token eat
Token ID4483
Feature activation-2.83e+00

Pos logit contributions
uster+0.003
 Kel+0.003
 Kenny+0.003
 Hanna+0.003
 Buster+0.003
Neg logit contributions
castle-0.002
ide-0.002
room-0.002
ull-0.002
 structures-0.002
Token lots
Token ID6041
Feature activation-1.28e+00

Pos logit contributions
castle+0.233
ide+0.226
 salad+0.223
 structures+0.219
room+0.210
Neg logit contributions
 Kel-0.238
 Hanna-0.232
uster-0.230
 Kenny-0.229
 buck-0.226
Token of
Token ID286
Feature activation+8.36e-01

Pos logit contributions
ide+0.099
castle+0.095
 salad+0.091
 structures+0.091
room+0.086
Neg logit contributions
 Kel-0.117
 Hanna-0.114
 Kenny-0.112
 Buster-0.112
 buck-0.111
Token food
Token ID2057
Feature activation-5.44e-01

Pos logit contributions
uster+0.079
 Kel+0.078
 Hanna+0.077
 Kenny+0.077
 Buster+0.076
Neg logit contributions
 salad-0.061
 structures-0.058
castle-0.058
ide-0.056
ull-0.052
Token and
Token ID290
Feature activation+2.40e-01

Pos logit contributions
castle+0.049
ide+0.047
 structures+0.045
room+0.044
 salad+0.044
Neg logit contributions
 Kel-0.042
 Hanna-0.041
 Kenny-0.041
 Buster-0.040
 Cindy-0.040
Token Hannah
Token ID26013
Feature activation+9.86e+00

Pos logit contributions
 Kel+0.020
 Hanna+0.020
 Kenny+0.020
 Buster+0.020
 Hannah+0.020
Neg logit contributions
castle-0.020
ide-0.019
 structures-0.018
room-0.018
ull-0.017
Token just
Token ID655
Feature activation+1.26e+00

Pos logit contributions
 Kel+0.806
 Kenny+0.783
 Hanna+0.779
 Buster+0.772
uster+0.768
Neg logit contributions
castle-0.818
ide-0.798
room-0.736
 structures-0.731
 salad-0.729
Token wanted
Token ID2227
Feature activation-7.71e-01

Pos logit contributions
 Kel+0.110
 Hanna+0.109
 Kenny+0.108
 Buster+0.108
 Hannah+0.106
Neg logit contributions
castle-0.098
ide-0.093
ull-0.087
 salad-0.087
 structures-0.086
Token to
Token ID284
Feature activation+1.51e-02

Pos logit contributions
 salad+0.057
ide+0.054
castle+0.052
 structures+0.051
ull+0.050
Neg logit contributions
uster-0.074
 Kel-0.073
 Kenny-0.069
 Haley-0.069
 Hanna-0.069
Token be
Token ID307
Feature activation+2.72e-01

Pos logit contributions
 Kel+0.001
 Hanna+0.001
 Kenny+0.001
 Buster+0.001
 Hannah+0.001
Neg logit contributions
castle-0.001
ide-0.001
 structures-0.001
room-0.001
ull-0.001
Token silly
Token ID14397
Feature activation-2.28e-01

Pos logit contributions
 Hanna+0.019
 Hannah+0.018
 F+0.018
 Buster+0.018
 Cindy+0.018
Neg logit contributions
castle-0.026
ide-0.026
 structures-0.025
room-0.024
ull-0.023
Token,
Token ID11
Feature activation+2.51e-01

Pos logit contributions
 Kel+0.009
 Kenny+0.009
 F+0.009
 Buster+0.009
 Hanna+0.009
Neg logit contributions
castle-0.012
 structures-0.011
ide-0.011
room-0.010
 salad-0.010
Token Tommy
Token ID19919
Feature activation+5.34e+00

Pos logit contributions
 Kel+0.022
 Hanna+0.022
 Kenny+0.022
 Buster+0.022
 Hannah+0.021
Neg logit contributions
castle-0.019
ide-0.018
 structures-0.018
room-0.017
ull-0.017
Token,
Token ID11
Feature activation+2.47e-01

Pos logit contributions
 Kel+0.321
 pup+0.319
 buck+0.318
 F+0.318
 Buster+0.318
Neg logit contributions
castle-0.565
 structures-0.531
ide-0.520
room-0.507
 salad-0.497
Token Jeffrey
Token ID19627
Feature activation+4.07e-01

Pos logit contributions
 Hanna+0.018
 Kel+0.018
 F+0.018
 Kenny+0.018
 Buster+0.018
Neg logit contributions
castle-0.023
 structures-0.022
ide-0.021
room-0.020
ull-0.020
Token and
Token ID290
Feature activation+2.44e-01

Pos logit contributions
 Kel+0.035
 Kenny+0.035
 Hanna+0.035
 Buster+0.034
 Cindy+0.034
Neg logit contributions
castle-0.031
ide-0.031
ull-0.030
room-0.029
 salad-0.028
Token Hannah
Token ID26013
Feature activation+9.86e+00

Pos logit contributions
 Hanna+0.020
 Kel+0.019
 Buster+0.019
 Cindy+0.019
 Kenny+0.019
Neg logit contributions
castle-0.022
ide-0.021
 structures-0.020
room-0.019
ull-0.019
Token.
Token ID13
Feature activation+1.45e-01

Pos logit contributions
 Hanna+0.853
 Kel+0.850
 Kenny+0.843
 Buster+0.835
 Hannah+0.829
Neg logit contributions
castle-0.755
ide-0.753
 structures-0.705
room-0.699
ull-0.694
Token They
Token ID1119
Feature activation-2.31e-01

Pos logit contributions
 Kel+0.014
 Hanna+0.014
 Kenny+0.014
 Buster+0.014
 F+0.014
Neg logit contributions
castle-0.010
ide-0.010
 structures-0.009
ull-0.009
room-0.009
Token were
Token ID547
Feature activation-1.24e+00

Pos logit contributions
castle+0.019
 structures+0.017
ide+0.017
room+0.017
ull+0.016
Neg logit contributions
 F-0.020
 Hanna-0.020
 Kel-0.019
 Cindy-0.019
 Buster-0.019
Token all
Token ID477
Feature activation-1.48e+00

Pos logit contributions
castle+0.120
ide+0.115
 structures+0.110
room+0.108
ull+0.107
Neg logit contributions
 Kel-0.088
 Hanna-0.087
 F-0.087
 Kenny-0.087
 Buster-0.086
Token so
Token ID523
Feature activation-6.74e-01

Pos logit contributions
castle+0.147
ide+0.140
 structures+0.135
room+0.133
ull+0.131
Neg logit contributions
 Kel-0.101
 Hanna-0.099
 F-0.099
 buck-0.099
 Hannah-0.097
Token was
Token ID373
Feature activation+6.03e-01

Pos logit contributions
 Kenny+0.063
 Kel+0.063
 F+0.063
 Hannah+0.060
 Hanna+0.059
Neg logit contributions
castle-0.111
 structures-0.104
 salad-0.100
ide-0.100
ull-0.097
Token a
Token ID257
Feature activation+3.39e-01

Pos logit contributions
 Kel+0.036
 Hanna+0.036
 Kenny+0.035
 Buster+0.035
 Hannah+0.035
Neg logit contributions
castle-0.064
ide-0.061
 structures-0.059
room-0.058
ull-0.058
Token little
Token ID1310
Feature activation+2.08e-01

Pos logit contributions
 Hanna+0.023
 Kel+0.023
 pup+0.023
 Buster+0.023
 Hannah+0.023
Neg logit contributions
castle-0.033
ide-0.032
 structures-0.031
room-0.030
ull-0.030
Token bird
Token ID6512
Feature activation-2.33e+00

Pos logit contributions
 Kel+0.016
 Kenny+0.016
 Hanna+0.016
 hesitation+0.016
 Buster+0.016
Neg logit contributions
castle-0.018
ide-0.017
 structures-0.017
 salad-0.017
ull-0.016
Token called
Token ID1444
Feature activation+1.66e+00

Pos logit contributions
castle+0.263
 structures+0.260
 salad+0.243
 pulp+0.235
room+0.234
Neg logit contributions
 F-0.119
 buck-0.116
 pup-0.114
 Kel-0.109
 Mag-0.107
Token Hannah
Token ID26013
Feature activation+9.85e+00

Pos logit contributions
 Kel+0.135
 Hanna+0.134
 Kenny+0.133
 Buster+0.132
 Hannah+0.131
Neg logit contributions
castle-0.139
ide-0.133
 structures-0.128
room-0.125
ull-0.124
Token.
Token ID13
Feature activation+1.29e-01

Pos logit contributions
 Kel+0.750
 Hanna+0.744
 Kenny+0.737
 Buster+0.733
 Hannah+0.728
Neg logit contributions
castle-0.884
ide-0.840
 structures-0.817
room-0.797
ull-0.791
Token She
Token ID1375
Feature activation-2.39e-01

Pos logit contributions
 hesitation+0.011
 Kel+0.011
 training+0.010
 reins+0.010
Michael+0.010
Neg logit contributions
room-0.009
ide-0.009
 salad-0.009
castle-0.008
 structures-0.008
Token flew
Token ID13112
Feature activation-3.34e+00

Pos logit contributions
castle+0.022
 structures+0.021
ide+0.021
room+0.020
 salad+0.019
Neg logit contributions
 F-0.018
 buck-0.018
 Hanna-0.018
 Kel-0.018
 Buster-0.017
Token around
Token ID1088
Feature activation+1.80e+00

Pos logit contributions
castle+0.332
ide+0.311
 structures+0.306
room+0.302
ull+0.298
Neg logit contributions
 Kel-0.227
 F-0.226
 Hanna-0.226
 Cindy-0.225
 Kenny-0.223
Token all
Token ID477
Feature activation-1.50e+00

Pos logit contributions
 F+0.097
 Cindy+0.087
 pup+0.086
 training+0.085
 buck+0.085
Neg logit contributions
castle-0.214
 structures-0.190
ull-0.189
ide-0.186
ocado-0.183
Token They
Token ID1119
Feature activation-2.33e-01

Pos logit contributions
 buck+0.012
 training+0.012
 hesitation+0.011
Michael+0.011
 Kel+0.011
Neg logit contributions
ide-0.009
room-0.008
 salad-0.008
 structures-0.008
castle-0.008
Token were
Token ID547
Feature activation-1.25e+00

Pos logit contributions
castle+0.019
ide+0.018
 structures+0.018
room+0.017
ull+0.017
Neg logit contributions
 Kel-0.019
 Hanna-0.019
 Kenny-0.019
 Buster-0.019
 F-0.019
Token all
Token ID477
Feature activation-1.49e+00

Pos logit contributions
castle+0.110
ide+0.104
 structures+0.102
room+0.098
ull+0.098
Neg logit contributions
 Kel-0.096
 Hanna-0.096
 Kenny-0.095
 Buster-0.095
 Hannah-0.094
Token looking
Token ID2045
Feature activation-3.83e-01

Pos logit contributions
castle+0.135
ide+0.128
 structures+0.124
ull+0.121
room+0.121
Neg logit contributions
 Kel-0.112
 Hanna-0.112
 Kenny-0.111
 Buster-0.110
 Hannah-0.109
Token at
Token ID379
Feature activation-1.42e+00

Pos logit contributions
castle+0.032
ide+0.031
 structures+0.030
 salad+0.029
ull+0.029
Neg logit contributions
 Kel-0.031
 Hanna-0.031
 Kenny-0.031
 buck-0.030
 Hannah-0.030
Token Hannah
Token ID26013
Feature activation+9.85e+00

Pos logit contributions
castle+0.117
ide+0.115
 structures+0.111
room+0.107
ull+0.106
Neg logit contributions
 Hanna-0.116
 Kel-0.115
uster-0.113
 Buster-0.113
 buck-0.113
Token and
Token ID290
Feature activation+2.35e-01

Pos logit contributions
 Kel+0.819
 Hanna+0.810
 Kenny+0.801
 Buster+0.792
 Cindy+0.789
Neg logit contributions
castle-0.812
ide-0.791
 structures-0.744
room-0.727
 salad-0.719
Token the
Token ID262
Feature activation-3.41e-02

Pos logit contributions
 Kel+0.018
 Hanna+0.018
 Kenny+0.018
 Buster+0.018
 Hannah+0.018
Neg logit contributions
castle-0.020
ide-0.019
 structures-0.019
room-0.018
ull-0.018
Token leader
Token ID3554
Feature activation-1.67e+00

Pos logit contributions
castle+0.003
 structures+0.003
ide+0.002
 salad+0.002
room+0.002
Neg logit contributions
 Kel-0.003
 Hanna-0.003
uster-0.003
 Hannah-0.003
 Haley-0.003
Token of
Token ID286
Feature activation+8.12e-01

Pos logit contributions
castle+0.141
ide+0.136
 structures+0.128
room+0.126
ull+0.126
Neg logit contributions
 Kel-0.137
 Hanna-0.136
 Buster-0.134
 Kenny-0.133
 Cindy-0.132
Token the
Token ID262
Feature activation-4.39e-02

Pos logit contributions
uster+0.080
 Kel+0.078
 buck+0.077
Michael+0.076
 reins+0.075
Neg logit contributions
ide-0.048
 structures-0.046
 salad-0.044
 dragons-0.044
room-0.044
Token dad
Token ID9955
Feature activation-5.44e-01

Pos logit contributions
 Kel+0.020
 Hanna+0.020
 Hannah+0.019
 Buster+0.019
 Kenny+0.019
Neg logit contributions
castle-0.021
ide-0.020
 structures-0.019
room-0.019
ull-0.019
Token.
Token ID13
Feature activation+1.35e-01

Pos logit contributions
ide+0.035
castle+0.032
room+0.031
 salad+0.030
 structures+0.030
Neg logit contributions
 Kel-0.056
 Hanna-0.055
 Kenny-0.055
uster-0.055
Michael-0.055
Token Every
Token ID3887
Feature activation+2.59e-01

Pos logit contributions
 hesitation+0.014
 training+0.014
 reins+0.014
ha+0.014
 buck+0.014
Neg logit contributions
ide-0.007
room-0.007
 salad-0.006
tub-0.006
 structures-0.006
Token day
Token ID1110
Feature activation-6.03e-01

Pos logit contributions
 Kel+0.019
 Kenny+0.019
 Hanna+0.019
 Buster+0.019
 Cindy+0.018
Neg logit contributions
castle-0.024
ide-0.023
ull-0.022
room-0.022
 structures-0.022
Token,
Token ID11
Feature activation+2.40e-01

Pos logit contributions
ide+0.026
room+0.025
 salad+0.021
ap+0.020
ics+0.019
Neg logit contributions
 reins-0.070
uster-0.065
 hesitation-0.065
Michael-0.064
 Kenny-0.064
Token Hannah
Token ID26013
Feature activation+9.85e+00

Pos logit contributions
 reins+0.024
uster+0.023
 hesitation+0.023
 grizz+0.023
 training+0.023
Neg logit contributions
 salad-0.013
ide-0.013
room-0.011
castle-0.011
tub-0.010
Token loved
Token ID6151
Feature activation-2.40e-01

Pos logit contributions
 Hanna+0.759
 Kel+0.746
 buck+0.744
 Hannah+0.740
 Buster+0.738
Neg logit contributions
castle-0.868
ide-0.823
 structures-0.811
ull-0.782
room-0.776
Token to
Token ID284
Feature activation+1.11e-02

Pos logit contributions
castle+0.022
ide+0.021
 structures+0.021
room+0.020
ull+0.020
Neg logit contributions
 Kel-0.017
 Hanna-0.017
 Kenny-0.017
 Buster-0.017
 Hannah-0.017
Token play
Token ID711
Feature activation+2.91e-01

Pos logit contributions
 Hanna+0.001
 Kel+0.001
 Buster+0.001
 buck+0.001
 Kenny+0.001
Neg logit contributions
castle-0.001
ide-0.001
 structures-0.001
ull-0.001
room-0.001
Token in
Token ID287
Feature activation-4.45e-01

Pos logit contributions
 Hannah+0.020
 pup+0.020
 Buster+0.020
 Hanna+0.020
 Kenny+0.019
Neg logit contributions
castle-0.028
ull-0.026
ide-0.026
 structures-0.025
 pulp-0.024
Token the
Token ID262
Feature activation-4.04e-02

Pos logit contributions
castle+0.043
ide+0.041
 structures+0.040
ull+0.039
room+0.039
Neg logit contributions
 Kel-0.031
 Hanna-0.031
 Kenny-0.030
 Hannah-0.030
 Buster-0.030
Token big
Token ID1263
Feature activation-7.83e-01

Pos logit contributions
 Kel+0.029
 hesitation+0.028
 Hanna+0.028
 Kenny+0.028
uster+0.028
Neg logit contributions
castle-0.027
 salad-0.024
ide-0.024
ull-0.023
 structures-0.023
Token,
Token ID11
Feature activation+2.25e-01

Pos logit contributions
castle+0.062
 salad+0.056
 structures+0.055
ide+0.054
room+0.052
Neg logit contributions
 Kel-0.070
 Hanna-0.070
uster-0.068
 Kenny-0.068
 hesitation-0.067
Token hairy
Token ID43873
Feature activation+1.79e+00

Pos logit contributions
uster+0.021
 Kenny+0.020
 Hanna+0.020
 Kel+0.020
 hesitation+0.020
Neg logit contributions
 salad-0.016
castle-0.016
 structure-0.015
 tower-0.015
 structures-0.014
Token cat
Token ID3797
Feature activation-9.93e-01

Pos logit contributions
 Kel+0.149
 Hanna+0.148
 Kenny+0.145
 hesitation+0.144
 Buster+0.144
Neg logit contributions
castle-0.151
ide-0.140
 structures-0.136
room-0.134
 salad-0.134
Token.
Token ID13
Feature activation+1.10e-01

Pos logit contributions
castle+0.082
ide+0.079
 structures+0.075
room+0.075
ull+0.073
Neg logit contributions
 Hanna-0.082
 Kel-0.081
 hesitation-0.081
uster-0.080
 Buster-0.080
Token Hannah
Token ID26013
Feature activation+9.84e+00

Pos logit contributions
 hesitation+0.011
 training+0.011
 Kel+0.011
 buck+0.011
 Kenny+0.011
Neg logit contributions
 salad-0.006
room-0.006
ide-0.006
castle-0.006
 structures-0.005
Token was
Token ID373
Feature activation+5.82e-01

Pos logit contributions
 Kel+0.705
 F+0.692
 Kenny+0.690
 Hanna+0.689
 Cindy+0.682
Neg logit contributions
castle-0.948
ide-0.885
 structures-0.879
room-0.854
ull-0.846
Token so
Token ID523
Feature activation-7.13e-01

Pos logit contributions
 Hanna+0.043
 Kel+0.043
 Kenny+0.042
 Buster+0.042
 Hannah+0.042
Neg logit contributions
castle-0.053
ide-0.051
 structures-0.049
room-0.048
ull-0.048
Token excited
Token ID6568
Feature activation-3.10e-01

Pos logit contributions
castle+0.082
 structures+0.076
ide+0.073
 salad+0.072
room+0.072
Neg logit contributions
 Kel-0.043
 F-0.042
 Cindy-0.041
 buck-0.039
 Kenny-0.039
Token.
Token ID13
Feature activation+1.15e-01

Pos logit contributions
castle+0.027
ide+0.026
 structures+0.025
 salad+0.024
room+0.024
Neg logit contributions
 Kel-0.024
 Hanna-0.024
 Buster-0.024
 Kenny-0.024
uster-0.023
Token She
Token ID1375
Feature activation-2.50e-01

Pos logit contributions
 hesitation+0.012
 training+0.012
 Kel+0.011
 buck+0.011
 luggage+0.011
Neg logit contributions
ide-0.006
castle-0.006
room-0.006
 structures-0.006
 salad-0.006
Token and
Token ID290
Feature activation+1.92e-01

Pos logit contributions
 F+0.138
 Kenny+0.138
 buck+0.137
 Buster+0.136
 Hanna+0.136
Neg logit contributions
castle-0.202
ide-0.191
 structures-0.187
ull-0.187
 salad-0.182
Token said
Token ID531
Feature activation+6.87e-01

Pos logit contributions
 buck+0.013
 Kenny+0.012
 F+0.012
 Kel+0.012
 Buster+0.012
Neg logit contributions
ide-0.020
castle-0.019
room-0.019
 structures-0.018
ull-0.018
Token,
Token ID11
Feature activation+1.91e-01

Pos logit contributions
 F+0.038
 Kenny+0.036
 Kel+0.036
 Cindy+0.036
 training+0.035
Neg logit contributions
castle-0.075
ide-0.074
 structures-0.071
room-0.070
ull-0.068
Token "
Token ID366
Feature activation-1.02e+00

Pos logit contributions
 Kel+0.017
 Hanna+0.017
 Kenny+0.017
 Buster+0.017
 Hannah+0.017
Neg logit contributions
castle-0.014
ide-0.013
 structures-0.013
ull-0.012
 salad-0.012
TokenV
Token ID53
Feature activation+2.54e+00

Pos logit contributions
castle+0.110
ide+0.104
 structures+0.104
 salad+0.100
ull+0.100
Neg logit contributions
 Hanna-0.061
 Kel-0.061
 Kenny-0.060
 Buster-0.059
 F-0.059
Tokenanish
Token ID7115
Feature activation-8.43e+00

Pos logit contributions
 pup+0.299
Michael+0.298
 F+0.296
 Kenny+0.296
 Hannah+0.294
Neg logit contributions
 pulp-0.088
castle-0.081
 salad-0.081
 structures-0.077
pole-0.064
Token!"
Token ID2474
Feature activation-6.70e-01

Pos logit contributions
castle+0.859
 structures+0.804
ide+0.794
ull+0.787
 salad+0.786
Neg logit contributions
 Kenny-0.555
 F-0.554
 Kel-0.539
 Hannah-0.536
 Hanna-0.533
Token The
Token ID383
Feature activation-5.49e-01

Pos logit contributions
castle+0.080
ide+0.075
 structures+0.073
 nation+0.073
ocado+0.071
Neg logit contributions
 F-0.035
 Kenny-0.034
 buck-0.032
 Kel-0.031
 Buster-0.030
Token toy
Token ID13373
Feature activation+6.26e-01

Pos logit contributions
castle+0.050
ide+0.048
 structures+0.046
room+0.045
ull+0.045
Neg logit contributions
 Kel-0.041
 Hanna-0.041
 Kenny-0.040
 Buster-0.040
 Hannah-0.040
Token car
Token ID1097
Feature activation+2.06e+00

Pos logit contributions
 buck+0.047
 pup+0.046
 F+0.046
 Kenny+0.046
 luggage+0.045
Neg logit contributions
ide-0.055
castle-0.055
 structures-0.050
 salad-0.049
ull-0.049
Token was
Token ID373
Feature activation+5.39e-01

Pos logit contributions
 buck+0.143
 F+0.132
 Kenny+0.131
 Kel+0.128
 training+0.127
Neg logit contributions
castle-0.200
ide-0.197
 salad-0.188
 structures-0.188
ull-0.185
Token and
Token ID290
Feature activation+2.06e-01

Pos logit contributions
 Kel+0.084
 Hanna+0.083
 Kenny+0.082
 Hannah+0.081
 Cindy+0.081
Neg logit contributions
castle-0.105
ide-0.101
 structures-0.097
room-0.096
ull-0.095
Token said
Token ID531
Feature activation+7.10e-01

Pos logit contributions
 buck+0.014
 F+0.014
 Kenny+0.014
 Kel+0.014
 Hanna+0.014
Neg logit contributions
castle-0.020
ide-0.020
room-0.018
 structures-0.018
ull-0.018
Token,
Token ID11
Feature activation+2.10e-01

Pos logit contributions
 F+0.042
 Kel+0.041
 Kenny+0.041
 Cindy+0.040
 Buster+0.040
Neg logit contributions
castle-0.075
ide-0.073
 structures-0.070
room-0.069
ull-0.067
Token "
Token ID366
Feature activation-1.01e+00

Pos logit contributions
 Kel+0.019
 Hanna+0.019
 Kenny+0.019
 Hannah+0.019
 Buster+0.019
Neg logit contributions
castle-0.015
ide-0.014
 structures-0.014
 salad-0.013
ull-0.013
TokenV
Token ID53
Feature activation+2.55e+00

Pos logit contributions
castle+0.103
ide+0.099
 structures+0.096
room+0.094
ull+0.094
Neg logit contributions
 Kel-0.064
 Hanna-0.064
 Kenny-0.063
 Buster-0.063
 Hannah-0.062
Tokenanish
Token ID7115
Feature activation-8.43e+00

Pos logit contributions
 pup+0.308
Michael+0.308
 Kenny+0.308
 F+0.306
 Peggy+0.299
Neg logit contributions
 pulp-0.080
 salad-0.073
castle-0.071
 structures-0.069
ull-0.053
Token!"
Token ID2474
Feature activation-6.60e-01

Pos logit contributions
castle+0.828
 structures+0.772
ide+0.769
ull+0.753
 salad+0.753
Neg logit contributions
 Kenny-0.577
 F-0.574
 Kel-0.571
 Hanna-0.566
 Hannah-0.563
Token Suddenly
Token ID24975
Feature activation+8.85e-01

Pos logit contributions
castle+0.068
ide+0.065
ull+0.063
 structures+0.061
 salad+0.059
Neg logit contributions
 F-0.043
 Kenny-0.043
 buck-0.042
 Peggy-0.040
 Hannah-0.040
Token,
Token ID11
Feature activation+1.93e-01

Pos logit contributions
 Hanna+0.071
 Kel+0.070
 Buster+0.069
 Hannah+0.069
 Cindy+0.068
Neg logit contributions
castle-0.076
ide-0.071
 structures-0.071
room-0.069
ull-0.068
Token his
Token ID465
Feature activation-1.08e+00

Pos logit contributions
 Kel+0.014
 Hanna+0.014
 Buster+0.014
 Kenny+0.013
 Hannah+0.013
Neg logit contributions
castle-0.018
ide-0.017
 structures-0.017
room-0.017
 salad-0.016
Token toy
Token ID13373
Feature activation+6.46e-01

Pos logit contributions
castle+0.091
ide+0.086
 structures+0.084
room+0.081
ull+0.081
Neg logit contributions
 Kel-0.088
 Hanna-0.088
 Kenny-0.086
 Buster-0.086
 Cindy-0.086
Token They
Token ID1119
Feature activation-8.34e-01

Pos logit contributions
 Kel+0.007
 Kenny+0.007
 Buster+0.007
 Hanna+0.007
 buck+0.007
Neg logit contributions
castle-0.005
ide-0.004
 structures-0.004
room-0.004
 salad-0.004
Token had
Token ID550
Feature activation+2.44e+00

Pos logit contributions
castle+0.070
ide+0.066
 structures+0.064
room+0.062
ull+0.062
Neg logit contributions
 Kel-0.068
 Hanna-0.067
 Kenny-0.067
 Buster-0.067
 Hannah-0.066
Token to
Token ID284
Feature activation-2.14e-01

Pos logit contributions
 Kel+0.175
 Hanna+0.174
 Kenny+0.172
 Buster+0.171
 Hannah+0.170
Neg logit contributions
castle-0.228
ide-0.218
 structures-0.211
room-0.207
ull-0.205
Token eat
Token ID4483
Feature activation-3.05e+00

Pos logit contributions
castle+0.017
ide+0.016
room+0.015
 structures+0.015
 salad+0.015
Neg logit contributions
 Kel-0.018
 Kenny-0.018
 Hanna-0.018
 Buster-0.018
uster-0.018
Token their
Token ID511
Feature activation-5.00e-01

Pos logit contributions
castle+0.281
ide+0.267
 structures+0.257
room+0.254
ull+0.252
Neg logit contributions
 Kel-0.221
 Hanna-0.220
 F-0.219
 Kenny-0.219
 Buster-0.218
Token salad
Token ID20698
Feature activation-8.43e+00

Pos logit contributions
castle+0.038
ide+0.037
 structures+0.035
room+0.034
ull+0.034
Neg logit contributions
 Kel-0.044
 Hanna-0.043
 Kenny-0.043
 Buster-0.043
 Hannah-0.043
Token.
Token ID13
Feature activation+4.55e-02

Pos logit contributions
ull+0.884
 structures+0.854
castle+0.834
room+0.822
roc+0.817
Neg logit contributions
 Hannah-0.421
 F-0.415
 ahead-0.413
 pup-0.408
 buck-0.401
Token They
Token ID1119
Feature activation-2.92e-01

Pos logit contributions
 Kel+0.004
 hesitation+0.004
 Hanna+0.004
 buck+0.004
 Kenny+0.004
Neg logit contributions
castle-0.003
ide-0.003
 structures-0.003
room-0.003
ull-0.003
Token did
Token ID750
Feature activation-1.92e+00

Pos logit contributions
castle+0.028
 structures+0.027
ide+0.026
room+0.026
ull+0.025
Neg logit contributions
 buck-0.021
 F-0.020
 Hanna-0.020
 Hannah-0.020
 Cindy-0.020
Token not
Token ID407
Feature activation-2.46e+00

Pos logit contributions
room+0.169
castle+0.167
ide+0.166
ull+0.162
ase+0.156
Neg logit contributions
 buck-0.133
 hesitation-0.130
 F-0.129
 ahead-0.128
 training-0.128
Token like
Token ID588
Feature activation+1.84e+00

Pos logit contributions
castle+0.267
ide+0.257
 structures+0.253
room+0.251
ics+0.245
Neg logit contributions
 hesitation-0.141
 buck-0.136
 training-0.135
 ahead-0.131
 Hannah-0.124
Token I
Token ID314
Feature activation-4.31e-01

Pos logit contributions
 Kenny+0.064
 Kel+0.064
 F+0.063
 Hanna+0.062
 Hannah+0.061
Neg logit contributions
castle-0.146
 structures-0.133
 salad-0.131
ide-0.127
ull-0.125
Token'll
Token ID1183
Feature activation+1.36e+00

Pos logit contributions
castle+0.037
ide+0.035
 salad+0.033
ull+0.033
room+0.033
Neg logit contributions
 Kel-0.035
 Kenny-0.035
 Buster-0.034
 Hanna-0.034
uster-0.033
Token make
Token ID787
Feature activation-1.89e+00

Pos logit contributions
 Kel+0.117
 Kenny+0.115
 Hanna+0.112
 Buster+0.111
 pup+0.110
Neg logit contributions
castle-0.109
ide-0.103
 salad-0.098
room-0.097
ull-0.095
Token you
Token ID345
Feature activation-7.78e-01

Pos logit contributions
castle+0.154
ide+0.147
 structures+0.145
 salad+0.144
room+0.140
Neg logit contributions
 Kel-0.154
 Hanna-0.150
 Kenny-0.150
 hesitation-0.149
 buck-0.149
Token a
Token ID257
Feature activation+1.97e-01

Pos logit contributions
castle+0.066
ide+0.065
 structures+0.063
room+0.061
ull+0.060
Neg logit contributions
 Hanna-0.059
 buck-0.059
 Hannah-0.059
 training-0.059
 Kel-0.059
Token salad
Token ID20698
Feature activation-8.43e+00

Pos logit contributions
 Kel+0.015
 Hanna+0.015
 Kenny+0.015
 Buster+0.015
 F+0.015
Neg logit contributions
castle-0.017
ide-0.016
 structures-0.016
room-0.016
 salad-0.016
Token for
Token ID329
Feature activation+4.84e-01

Pos logit contributions
 structures+0.807
ull+0.795
ide+0.794
castle+0.777
room+0.758
Neg logit contributions
 Hannah-0.497
 Hanna-0.494
 buck-0.492
 F-0.486
 ahead-0.484
Token dinner
Token ID8073
Feature activation-1.49e+00

Pos logit contributions
 Hanna+0.034
 Hannah+0.034
 Buster+0.034
 Cindy+0.034
 F+0.034
Neg logit contributions
castle-0.046
ide-0.044
 structures-0.042
room-0.041
ull-0.041
Token."
Token ID526
Feature activation-1.41e+00

Pos logit contributions
castle+0.128
ide+0.121
 structures+0.121
ull+0.118
room+0.114
Neg logit contributions
 Hanna-0.114
 Hannah-0.113
 F-0.111
 Buster-0.111
 pup-0.110
Token Jack
Token ID3619
Feature activation-1.97e+00

Pos logit contributions
castle+0.132
ide+0.126
 structures+0.122
room+0.120
ull+0.119
Neg logit contributions
 Kel-0.102
 Hanna-0.102
 Kenny-0.101
 Buster-0.100
 Hannah-0.100
Token thanked
Token ID26280
Feature activation-1.26e+00

Pos logit contributions
castle+0.204
 structures+0.198
ide+0.194
room+0.191
ull+0.185
Neg logit contributions
 Hannah-0.120
 buck-0.118
 Hanna-0.118
 Kel-0.115
 Kenny-0.115
Token She
Token ID1375
Feature activation-2.58e-01

Pos logit contributions
 Kel+0.009
 Buster+0.009
 Kenny+0.009
 hesitation+0.009
 Hanna+0.009
Neg logit contributions
castle-0.008
ide-0.008
 structures-0.008
room-0.008
 salad-0.008
Token said
Token ID531
Feature activation+7.30e-01

Pos logit contributions
castle+0.023
ide+0.022
 structures+0.021
room+0.021
ull+0.021
Neg logit contributions
 Kel-0.020
 Hanna-0.020
 Kenny-0.019
 Buster-0.019
 Hannah-0.019
Token,
Token ID11
Feature activation+2.22e-01

Pos logit contributions
 Kel+0.048
 Hanna+0.048
 Hannah+0.047
 Cindy+0.047
 Kenny+0.047
Neg logit contributions
castle-0.073
ide-0.070
 structures-0.068
room-0.066
ull-0.066
Token "
Token ID366
Feature activation-9.95e-01

Pos logit contributions
 Kel+0.019
 Kenny+0.019
 Hanna+0.019
 buck+0.019
 Buster+0.019
Neg logit contributions
castle-0.017
ide-0.016
 structures-0.015
room-0.015
 salad-0.015
TokenV
Token ID53
Feature activation+2.56e+00

Pos logit contributions
castle+0.081
room+0.081
ide+0.081
ull+0.074
 salad+0.072
Neg logit contributions
 Buster-0.081
 Kel-0.081
 Kenny-0.081
 hesitation-0.080
 luggage-0.078
Tokenanish
Token ID7115
Feature activation-8.41e+00

Pos logit contributions
 Hannah+0.322
 Hanna+0.321
 Kenny+0.319
 Kel+0.319
 pup+0.319
Neg logit contributions
castle-0.095
 structures-0.084
ide-0.080
 salad-0.075
ull-0.072
Token!"
Token ID2474
Feature activation-6.33e-01

Pos logit contributions
ide+0.692
room+0.663
castle+0.657
 structures+0.584
anish+0.579
Neg logit contributions
 buck-0.700
 Kel-0.696
 luggage-0.693
 hesitation-0.684
 Buster-0.682
Token and
Token ID290
Feature activation+2.17e-01

Pos logit contributions
castle+0.056
ide+0.054
 structures+0.052
room+0.051
ull+0.050
Neg logit contributions
 Kel-0.049
 Hanna-0.048
 Kenny-0.048
 Buster-0.047
 Hannah-0.047
Token then
Token ID788
Feature activation+3.20e+00

Pos logit contributions
 Hannah+0.016
 Hanna+0.015
 Kel+0.015
 Cindy+0.015
 buck+0.015
Neg logit contributions
castle-0.021
ide-0.021
 structures-0.020
ull-0.020
room-0.019
Token waited
Token ID13488
Feature activation-4.36e-01

Pos logit contributions
 Kel+0.250
 Buster+0.249
uster+0.246
 Hanna+0.246
 Kenny+0.245
Neg logit contributions
castle-0.275
ide-0.258
 structures-0.251
room-0.247
 salad-0.243
Token.
Token ID13
Feature activation+1.02e-01

Pos logit contributions
castle+0.038
ide+0.036
room+0.034
 structures+0.034
 salad+0.034
Neg logit contributions
 Kel-0.035
 Kenny-0.034
 Buster-0.034
 Hanna-0.034
 pup-0.034
Token magic
Token ID5536
Feature activation-1.02e+00

Pos logit contributions
castle+0.004
 salad+0.003
 structures+0.003
 nation+0.003
ull+0.003
Neg logit contributions
 Kel-0.004
 Kenny-0.004
 Kathy-0.004
 Buster-0.004
 Hanna-0.004
Token word
Token ID1573
Feature activation-5.57e-01

Pos logit contributions
castle+0.084
 structures+0.075
ull+0.075
ide+0.075
room+0.073
Neg logit contributions
 Kel-0.089
 Cindy-0.088
 Hanna-0.087
 Kenny-0.087
 Buster-0.087
Token,
Token ID11
Feature activation+2.20e-01

Pos logit contributions
castle+0.047
ide+0.045
 structures+0.043
room+0.043
ull+0.042
Neg logit contributions
 Hanna-0.045
 Kel-0.045
 Cindy-0.044
 Hannah-0.044
 Buster-0.044
Token "
Token ID366
Feature activation-9.86e-01

Pos logit contributions
 Kel+0.019
 Hanna+0.019
 Kenny+0.019
 Buster+0.019
 Hannah+0.019
Neg logit contributions
castle-0.017
ide-0.016
 structures-0.015
ull-0.015
room-0.015
TokenV
Token ID53
Feature activation+2.57e+00

Pos logit contributions
castle+0.092
ide+0.089
 structures+0.085
room+0.085
ull+0.084
Neg logit contributions
 Kel-0.071
 Hanna-0.070
 Kenny-0.070
 Buster-0.069
 Hannah-0.069
Tokenanish
Token ID7115
Feature activation-8.41e+00

Pos logit contributions
 Hannah+0.278
 pup+0.276
 F+0.273
 Hank+0.271
Michael+0.270
Neg logit contributions
 structures-0.116
castle-0.113
 salad-0.108
ide-0.102
 pulp-0.099
Token!"
Token ID2474
Feature activation-6.46e-01

Pos logit contributions
castle+0.717
ide+0.698
room+0.666
 structures+0.655
ull+0.640
Neg logit contributions
 Kel-0.672
 Hanna-0.664
 hesitation-0.655
 Kenny-0.655
 Buster-0.654
Token When
Token ID1649
Feature activation+3.75e-01

Pos logit contributions
castle+0.059
ide+0.057
 structures+0.055
ull+0.053
room+0.052
Neg logit contributions
 Kenny-0.048
 Kel-0.048
 Hanna-0.047
 F-0.047
 Buster-0.047
Token he
Token ID339
Feature activation+9.27e-01

Pos logit contributions
uster+0.037
 reins+0.035
 Kel+0.034
Matthew+0.033
 Hanna+0.033
Neg logit contributions
castle-0.028
 salad-0.025
 structures-0.025
 dragons-0.024
 tower-0.024
Token opened
Token ID4721
Feature activation-1.21e+00

Pos logit contributions
 buck+0.064
 Hanna+0.063
 Kenny+0.063
 Hannah+0.063
 F+0.062
Neg logit contributions
castle-0.089
 structures-0.086
ide-0.085
room-0.080
ull-0.080
Token his
Token ID465
Feature activation-1.06e+00

Pos logit contributions
castle+0.086
ide+0.080
 structures+0.079
 salad+0.077
ull+0.073
Neg logit contributions
 Hanna-0.112
 pup-0.111
 Kel-0.111
uster-0.111
 hesitation-0.110
Token him
Token ID683
Feature activation-4.56e-01

Pos logit contributions
 salad+0.034
castle+0.034
ide+0.034
ull+0.033
 structures+0.032
Neg logit contributions
uster-0.042
 buck-0.042
 hesitation-0.042
Michael-0.041
 Kel-0.041
Token a
Token ID257
Feature activation+1.08e-01

Pos logit contributions
castle+0.041
ide+0.039
 structures+0.038
 salad+0.038
ull+0.038
Neg logit contributions
 Kel-0.034
 Hanna-0.033
 Kenny-0.033
 Buster-0.033
 Cindy-0.032
Token big
Token ID1263
Feature activation-9.86e-01

Pos logit contributions
 Kel+0.010
 Kenny+0.009
 Hanna+0.009
 F+0.009
 Eleanor+0.009
Neg logit contributions
 salad-0.009
castle-0.008
ull-0.008
ide-0.008
 tower-0.007
Token bowl
Token ID9396
Feature activation-3.20e+00

Pos logit contributions
castle+0.085
 salad+0.083
ull+0.077
ide+0.075
room+0.075
Neg logit contributions
 Kel-0.082
 Hanna-0.079
 Kenny-0.078
 Haley-0.077
 Eleanor-0.077
Token of
Token ID286
Feature activation+7.05e-01

Pos logit contributions
castle+0.302
ide+0.290
 structures+0.287
room+0.276
ull+0.273
Neg logit contributions
 Kel-0.219
 Hanna-0.218
 Kenny-0.216
 Buster-0.216
 F-0.215
Token salad
Token ID20698
Feature activation-8.39e+00

Pos logit contributions
 Kel+0.066
 Hanna+0.065
 Kenny+0.065
uster+0.065
 Haley+0.064
Neg logit contributions
 salad-0.051
castle-0.050
 structures-0.047
ide-0.046
ull-0.045
Token.
Token ID13
Feature activation+7.60e-02

Pos logit contributions
castle+0.793
 structures+0.782
ide+0.780
room+0.759
ull+0.754
Neg logit contributions
 F-0.532
 buck-0.527
 Hanna-0.526
 Hannah-0.525
 pup-0.522
Token Max
Token ID5436
Feature activation+3.91e-01

Pos logit contributions
 Kel+0.006
 Hanna+0.006
 hesitation+0.006
 buck+0.006
 Kenny+0.006
Neg logit contributions
castle-0.006
ide-0.006
 structures-0.006
room-0.006
ull-0.006
Token ate
Token ID15063
Feature activation-3.08e+00

Pos logit contributions
 F+0.026
 buck+0.026
 Hannah+0.026
 Kenny+0.026
 Buster+0.026
Neg logit contributions
castle-0.039
ide-0.036
 structures-0.036
ull-0.036
room-0.035
Token it
Token ID340
Feature activation+7.13e-01

Pos logit contributions
castle+0.320
ide+0.303
room+0.294
 structures+0.291
ull+0.290
Neg logit contributions
 F-0.189
 Hannah-0.189
 Cindy-0.187
 Buster-0.186
 Kenny-0.185
Token all
Token ID477
Feature activation-1.52e+00

Pos logit contributions
 F+0.041
 Kenny+0.040
 Buster+0.039
 Hannah+0.039
 pup+0.039
Neg logit contributions
castle-0.073
 structures-0.070
ull-0.070
room-0.067
ide-0.066
Token
Token ID220
Feature activation+8.59e-01

Pos logit contributions
castle+0.038
ide+0.037
 structures+0.035
room+0.035
ull+0.034
Neg logit contributions
 Kel-0.038
 Hanna-0.038
 hesitation-0.037
 Kenny-0.037
 buck-0.037
Token
Token ID198
Feature activation-7.07e-01

Pos logit contributions
 Kel+0.058
 Kenny+0.058
 Hanna+0.058
 Buster+0.057
 Hannah+0.057
Neg logit contributions
castle-0.085
ide-0.079
 structures-0.078
 salad-0.076
room-0.075
Token
Token ID198
Feature activation-6.85e-01

Pos logit contributions
ide+0.059
castle+0.059
room+0.055
ull+0.055
 structures+0.055
Neg logit contributions
 Kel-0.056
 Hanna-0.055
 hesitation-0.055
 buck-0.055
 luggage-0.054
TokenWhen
Token ID2215
Feature activation+9.63e-01

Pos logit contributions
ide+0.055
castle+0.055
room+0.052
 structures+0.051
ull+0.050
Neg logit contributions
 hesitation-0.056
 Kel-0.056
 Hanna-0.056
 luggage-0.055
 buck-0.055
Token the
Token ID262
Feature activation-1.51e-01

Pos logit contributions
uster+0.087
 Kel+0.082
 hesitation+0.082
 reins+0.081
 buck+0.081
Neg logit contributions
castle-0.074
 salad-0.069
ide-0.067
 structures-0.065
room-0.064
Token salad
Token ID20698
Feature activation-8.39e+00

Pos logit contributions
castle+0.012
 salad+0.012
 structures+0.011
 tower+0.010
 structure+0.010
Neg logit contributions
 Kel-0.014
 Hanna-0.013
uster-0.013
 Kenny-0.013
 Eleanor-0.013
Token was
Token ID373
Feature activation+5.41e-01

Pos logit contributions
 structures+0.843
ide+0.831
castle+0.814
room+0.791
ull+0.786
Neg logit contributions
 buck-0.481
 F-0.464
 Hannah-0.459
 Hanna-0.457
 Kenny-0.457
Token ready
Token ID3492
Feature activation+5.22e-01

Pos logit contributions
 Hanna+0.035
 buck+0.035
 hesitation+0.034
 Kel+0.034
 Cindy+0.034
Neg logit contributions
castle-0.052
ide-0.051
 structures-0.051
room-0.050
ull-0.048
Token,
Token ID11
Feature activation+2.19e-01

Pos logit contributions
 Kel+0.042
 Hanna+0.042
 Kenny+0.041
 buck+0.041
 Buster+0.041
Neg logit contributions
castle-0.044
ide-0.042
 salad-0.040
room-0.039
 structures-0.039
Token Bella
Token ID39304
Feature activation+1.41e+00

Pos logit contributions
uster+0.018
 hesitation+0.018
 training+0.018
 Kel+0.018
 buck+0.018
Neg logit contributions
castle-0.017
ide-0.016
 salad-0.016
 structures-0.016
room-0.015
Token and
Token ID290
Feature activation+2.18e-01

Pos logit contributions
 Hanna+0.106
 Kel+0.106
 Buster+0.104
 Hannah+0.104
 Kenny+0.104
Neg logit contributions
castle-0.126
ide-0.120
 structures-0.118
room-0.115
ull-0.113
Token,
Token ID11
Feature activation-1.67e-01

Pos logit contributions
 Kel+0.012
 Hanna+0.012
 Kenny+0.012
 buck+0.012
 Buster+0.011
Neg logit contributions
castle-0.013
ide-0.012
ull-0.012
room-0.012
 structures-0.012
Token "
Token ID366
Feature activation-1.20e+00

Pos logit contributions
castle+0.013
ide+0.012
ull+0.012
 salad+0.011
 structures+0.011
Neg logit contributions
 Kel-0.014
 Hanna-0.014
 Kenny-0.014
 luggage-0.014
 hesitation-0.014
TokenWhat
Token ID2061
Feature activation+1.39e+00

Pos logit contributions
castle+0.104
ide+0.102
room+0.098
ull+0.097
 structures+0.093
Neg logit contributions
 luggage-0.093
 hesitation-0.091
 Kel-0.090
 Kenny-0.090
 Hanna-0.089
Token a
Token ID257
Feature activation+1.97e-01

Pos logit contributions
+0.077
 pup+0.076
 luggage+0.075
 F+0.075
 buck+0.072
Neg logit contributions
castle-0.151
ide-0.140
 structures-0.130
ull-0.129
ectar-0.127
Token beautiful
Token ID4950
Feature activation+5.46e-01

Pos logit contributions
 Kel+0.017
 Kenny+0.016
 Hanna+0.016
uster+0.015
 buck+0.015
Neg logit contributions
castle-0.017
 salad-0.016
 structures-0.015
ull-0.015
ide-0.015
Token salad
Token ID20698
Feature activation-8.39e+00

Pos logit contributions
 Kel+0.048
 Kenny+0.047
 Hanna+0.047
 Buster+0.047
 F+0.046
Neg logit contributions
castle-0.043
ide-0.040
 structures-0.039
 salad-0.038
ull-0.038
Token!"
Token ID2474
Feature activation-6.70e-01

Pos logit contributions
 structures+0.681
castle+0.676
ide+0.676
ull+0.651
room+0.631
Neg logit contributions
 Hannah-0.632
 Hanna-0.631
 F-0.629
 pup-0.623
 Buster-0.621
Token The
Token ID383
Feature activation-5.40e-01

Pos logit contributions
castle+0.064
ide+0.060
 structures+0.059
room+0.057
ull+0.056
Neg logit contributions
 Hanna-0.048
 Kel-0.048
 Hannah-0.048
 Kenny-0.048
 Cindy-0.048
Token boys
Token ID6510
Feature activation+7.67e-01

Pos logit contributions
castle+0.046
ide+0.043
 structures+0.042
 salad+0.042
ull+0.041
Neg logit contributions
 Kel-0.044
 Hanna-0.043
 Kenny-0.043
 Buster-0.042
 F-0.042
Token began
Token ID2540
Feature activation+1.34e+00

Pos logit contributions
 Kel+0.058
 Hanna+0.057
 Kenny+0.057
 Hannah+0.057
 Buster+0.057
Neg logit contributions
castle-0.068
ide-0.064
 structures-0.064
room-0.061
ull-0.061
Token to
Token ID284
Feature activation-4.07e-03

Pos logit contributions
 hesitation+0.068
 buck+0.064
 training+0.061
 F+0.061
 Hannah+0.059
Neg logit contributions
castle-0.153
room-0.149
ide-0.148
 structures-0.141
anish-0.137
Token something
Token ID1223
Feature activation+5.79e-01

Pos logit contributions
castle+0.111
ide+0.106
 structures+0.103
room+0.101
ull+0.100
Neg logit contributions
 Kel-0.088
 Hanna-0.087
 Kenny-0.086
 Buster-0.086
 Hannah-0.085
Token and
Token ID290
Feature activation+2.49e-01

Pos logit contributions
 Hanna+0.047
 Kel+0.047
 Kenny+0.047
 F+0.046
 Hannah+0.046
Neg logit contributions
castle-0.048
ide-0.045
 structures-0.044
room-0.043
ull-0.043
Token say
Token ID910
Feature activation+1.16e+00

Pos logit contributions
 Kel+0.021
 Hanna+0.020
 Kenny+0.020
 Buster+0.020
 Hannah+0.020
Neg logit contributions
castle-0.021
ide-0.019
 structures-0.019
 salad-0.018
room-0.018
Token '
Token ID705
Feature activation+1.27e+00

Pos logit contributions
 Kel+0.083
 Kenny+0.080
 Buster+0.080
uster+0.079
 Hanna+0.079
Neg logit contributions
castle-0.108
ide-0.103
 structures-0.102
 salad-0.101
room-0.100
Tokenv
Token ID85
Feature activation-6.01e-01

Pos logit contributions
 Kel+0.102
 Buster+0.101
 Hanna+0.100
 Kenny+0.100
 hesitation+0.100
Neg logit contributions
ide-0.104
castle-0.104
room-0.101
ull-0.097
 structures-0.093
Tokenanish
Token ID7115
Feature activation-8.37e+00

Pos logit contributions
castle+0.039
 structures+0.036
 salad+0.034
ide+0.034
ull+0.030
Neg logit contributions
 Hannah-0.058
 pup-0.057
 Hanna-0.057
 Buster-0.056
 F-0.056
Token!'
Token ID13679
Feature activation-2.01e-01

Pos logit contributions
ide+0.699
castle+0.657
room+0.655
 structures+0.607
ull+0.600
Neg logit contributions
 Kel-0.697
 Kenny-0.689
 buck-0.681
 Buster-0.662
 luggage-0.662
Token and
Token ID290
Feature activation+2.42e-01

Pos logit contributions
castle+0.014
ide+0.014
room+0.014
 salad+0.013
 structures+0.013
Neg logit contributions
 Kel-0.018
 Kenny-0.018
 Buster-0.018
 buck-0.018
 Hanna-0.017
Token then
Token ID788
Feature activation+3.25e+00

Pos logit contributions
 Kel+0.016
 Hanna+0.015
 Buster+0.015
 Kenny+0.015
uster+0.015
Neg logit contributions
castle-0.024
ide-0.023
 structures-0.022
 salad-0.022
room-0.022
Token come
Token ID1282
Feature activation-5.69e-01

Pos logit contributions
 Kel+0.255
uster+0.250
 Buster+0.250
 Kenny+0.249
 Hanna+0.246
Neg logit contributions
castle-0.279
ide-0.260
 structures-0.253
 salad-0.252
room-0.248
Token out
Token ID503
Feature activation-1.75e+00

Pos logit contributions
castle+0.046
ide+0.042
room+0.042
ull+0.041
 structures+0.040
Neg logit contributions
 hesitation-0.049
 Kel-0.048
 Hanna-0.047
 Hannah-0.047
Michael-0.047
Token and
Token ID290
Feature activation+2.52e-01

Pos logit contributions
 F+0.097
 pup+0.097
 buck+0.096
 Kenny+0.095
 hesitation+0.094
Neg logit contributions
castle-0.154
ull-0.146
 structures-0.146
ide-0.145
room-0.139
Token said
Token ID531
Feature activation+7.51e-01

Pos logit contributions
 Kel+0.017
 Hanna+0.017
 Kenny+0.017
 F+0.017
 buck+0.017
Neg logit contributions
castle-0.024
ide-0.024
 structures-0.023
room-0.023
ull-0.022
Token,
Token ID11
Feature activation+2.57e-01

Pos logit contributions
 Kel+0.050
 F+0.049
 Hanna+0.049
 Kenny+0.049
 Cindy+0.048
Neg logit contributions
castle-0.074
ide-0.072
 structures-0.070
room-0.068
ull-0.067
Token "
Token ID366
Feature activation-9.57e-01

Pos logit contributions
 Kel+0.025
 Hanna+0.025
 luggage+0.024
 Kenny+0.024
 pup+0.024
Neg logit contributions
castle-0.017
ide-0.016
 structures-0.015
ull-0.015
 salad-0.015
TokenV
Token ID53
Feature activation+2.59e+00

Pos logit contributions
castle+0.095
ide+0.093
room+0.090
ull+0.088
 structures+0.088
Neg logit contributions
 hesitation-0.062
 Kel-0.061
 Buster-0.061
 Kenny-0.061
 Hanna-0.061
Tokenanish
Token ID7115
Feature activation-8.37e+00

Pos logit contributions
Michael+0.326
 Hannah+0.325
 Kenny+0.321
 pup+0.320
 Eleanor+0.318
Neg logit contributions
castle-0.074
 salad-0.067
 structures-0.065
 pulp-0.064
ull-0.051
Token!"
Token ID2474
Feature activation-6.09e-01

Pos logit contributions
castle+0.749
ide+0.732
room+0.701
 structures+0.688
ull+0.669
Neg logit contributions
 Kel-0.632
 Hanna-0.631
 hesitation-0.621
 Buster-0.621
 buck-0.620
Token The
Token ID383
Feature activation-4.95e-01

Pos logit contributions
castle+0.056
ide+0.054
 structures+0.052
ull+0.050
room+0.050
Neg logit contributions
 Kel-0.045
 Kenny-0.045
 F-0.044
 Hanna-0.044
 Buster-0.044
Token couch
Token ID18507
Feature activation+1.57e+00

Pos logit contributions
 salad+0.033
 structures+0.033
castle+0.032
room+0.032
ide+0.031
Neg logit contributions
uster-0.048
 Kel-0.047
Matthew-0.047
 Buster-0.046
 Hank-0.046
Token was
Token ID373
Feature activation+6.03e-01

Pos logit contributions
 buck+0.103
 Kel+0.101
 F+0.101
 Kenny+0.100
 pup+0.099
Neg logit contributions
castle-0.146
ide-0.142
 structures-0.141
ull-0.137
 salad-0.135
Token gone
Token ID3750
Feature activation-2.52e+00

Pos logit contributions
 Kel+0.047
 Hanna+0.046
 Kenny+0.046
 Buster+0.046
 Hannah+0.045
Neg logit contributions
castle-0.053
ide-0.051
 structures-0.049
room-0.048
ull-0.048
Token They
Token ID1119
Feature activation-2.33e-01

Pos logit contributions
 Kel+0.014
 Hanna+0.013
 Kenny+0.013
 Buster+0.013
 buck+0.013
Neg logit contributions
castle-0.009
ide-0.008
 structures-0.008
room-0.008
ull-0.008
Token said
Token ID531
Feature activation+7.61e-01

Pos logit contributions
castle+0.021
ide+0.020
 structures+0.020
room+0.019
ull+0.019
Neg logit contributions
 buck-0.017
 Kel-0.017
 Hanna-0.017
 F-0.017
 Kenny-0.017
Token,
Token ID11
Feature activation+2.49e-01

Pos logit contributions
 Kel+0.058
 Hanna+0.058
 Kenny+0.058
 Buster+0.057
uster+0.057
Neg logit contributions
castle-0.067
ide-0.064
 structures-0.063
 salad-0.061
ull-0.061
Token "
Token ID366
Feature activation-9.59e-01

Pos logit contributions
 Kel+0.021
 Hanna+0.021
 Kenny+0.021
 Buster+0.021
 Hannah+0.020
Neg logit contributions
castle-0.020
ide-0.019
 structures-0.018
room-0.018
ull-0.018
TokenV
Token ID53
Feature activation+2.61e+00

Pos logit contributions
castle+0.086
ide+0.084
room+0.082
 structures+0.079
ull+0.078
Neg logit contributions
 Kel-0.073
 Buster-0.072
 Kenny-0.072
 Hanna-0.071
 hesitation-0.071
Tokenanish
Token ID7115
Feature activation-8.37e+00

Pos logit contributions
 pup+0.313
Michael+0.310
 F+0.310
 Kenny+0.308
 Hannah+0.306
Neg logit contributions
 salad-0.087
castle-0.084
 pulp-0.076
 structures-0.067
ull-0.058
Token!"
Token ID2474
Feature activation-6.01e-01

Pos logit contributions
castle+0.714
ide+0.705
room+0.670
 structures+0.649
ull+0.633
Neg logit contributions
 Kel-0.671
 Hanna-0.657
 hesitation-0.656
 buck-0.654
 Buster-0.653
Token each
Token ID1123
Feature activation-1.64e+00

Pos logit contributions
castle+0.050
ide+0.048
 structures+0.046
room+0.046
ull+0.045
Neg logit contributions
 Kel-0.049
 Hanna-0.048
 Kenny-0.048
 Buster-0.048
 pup-0.047
Token time
Token ID640
Feature activation+4.63e-01

Pos logit contributions
castle+0.150
ide+0.145
room+0.138
 structures+0.138
ull+0.138
Neg logit contributions
 Kel-0.122
 Hanna-0.120
 Kenny-0.120
 Buster-0.119
 Cindy-0.118
Token.
Token ID13
Feature activation+1.41e-01

Pos logit contributions
uster+0.041
 Kel+0.040
 Hanna+0.039
 reins+0.039
 Haley+0.038
Neg logit contributions
castle-0.034
room-0.033
ide-0.033
ull-0.031
ics-0.030
Token All
Token ID1439
Feature activation+5.13e-01

Pos logit contributions
 Kel+0.014
 hesitation+0.013
 Hanna+0.013
 Buster+0.013
 Kenny+0.013
Neg logit contributions
castle-0.009
ide-0.009
room-0.009
 structures-0.009
ull-0.008
Token sad
Token ID6507
Feature activation-9.81e-01

Pos logit contributions
castle+0.137
 salad+0.133
 structures+0.131
ull+0.129
ide+0.128
Neg logit contributions
 Kel-0.186
 Hanna-0.183
 Kenny-0.183
Michael-0.181
uster-0.179
Token because
Token ID780
Feature activation-5.43e-01

Pos logit contributions
castle+0.079
ide+0.075
 salad+0.071
 structures+0.071
room+0.069
Neg logit contributions
 Kel-0.086
 Hanna-0.084
 Kenny-0.082
 Buster-0.081
Michael-0.081
Token she
Token ID673
Feature activation+2.57e-01

Pos logit contributions
castle+0.048
ide+0.046
 structures+0.045
 salad+0.044
ull+0.044
Neg logit contributions
 Kel-0.041
 Hanna-0.041
uster-0.040
 buck-0.040
 Kenny-0.040
Token loved
Token ID6151
Feature activation-2.88e-01

Pos logit contributions
 Kenny+0.019
 Kel+0.019
 Hannah+0.018
 Hanna+0.018
 Buster+0.018
Neg logit contributions
castle-0.024
ide-0.023
 structures-0.022
room-0.022
ull-0.022
Token her
Token ID607
Feature activation-9.76e-01

Pos logit contributions
castle+0.027
ide+0.027
room+0.026
ull+0.025
ics+0.023
Neg logit contributions
 Hannah-0.020
 Cindy-0.020
 hesitation-0.020
 F-0.020
 Buster-0.020
Token salad
Token ID20698
Feature activation-8.36e+00

Pos logit contributions
ide+0.096
room+0.091
castle+0.085
ics+0.084
ull+0.083
Neg logit contributions
 hesitation-0.063
 pup-0.062
 luggage-0.059
 training-0.059
 courage-0.057
Token and
Token ID290
Feature activation+1.78e-01

Pos logit contributions
castle+0.846
 structures+0.826
ull+0.824
ide+0.806
room+0.803
Neg logit contributions
 Hannah-0.494
 pup-0.485
 hesitation-0.479
 F-0.477
 Hanna-0.476
Token wanted
Token ID2227
Feature activation-8.32e-01

Pos logit contributions
 Hannah+0.012
 Cindy+0.012
 Kel+0.012
 Buster+0.012
 Kenny+0.012
Neg logit contributions
ide-0.017
castle-0.017
room-0.017
ull-0.016
 structures-0.016
Token to
Token ID284
Feature activation-3.45e-02

Pos logit contributions
castle+0.081
ide+0.077
 structures+0.075
room+0.074
ull+0.073
Neg logit contributions
 Kel-0.057
 Hanna-0.057
 Kenny-0.056
 Buster-0.056
 Hannah-0.056
Token eat
Token ID4483
Feature activation-2.90e+00

Pos logit contributions
castle+0.004
ide+0.004
 structures+0.004
ull+0.003
room+0.003
Neg logit contributions
 F-0.002
 Hannah-0.002
 Kenny-0.002
 Cindy-0.002
 Kel-0.002
Token all
Token ID477
Feature activation-1.56e+00

Pos logit contributions
castle+0.341
ide+0.329
room+0.322
 structures+0.316
ull+0.315
Neg logit contributions
 Hannah-0.134
 Cindy-0.133
 F-0.129
 hesitation-0.126
 Hanna-0.125
Token bird
Token ID6512
Feature activation-2.39e+00

Pos logit contributions
castle+0.007
ide+0.007
 structures+0.007
room+0.006
ull+0.006
Neg logit contributions
 Kel-0.006
 Hanna-0.006
 pup-0.006
 F-0.006
 Kenny-0.006
Token found
Token ID1043
Feature activation-2.03e+00

Pos logit contributions
castle+0.238
 structures+0.231
ide+0.226
room+0.224
 salad+0.217
Neg logit contributions
 buck-0.164
 pup-0.152
 F-0.151
 Hanna-0.147
 Buster-0.147
Token a
Token ID257
Feature activation+2.83e-01

Pos logit contributions
castle+0.244
ide+0.235
 structures+0.224
room+0.220
 salad+0.207
Neg logit contributions
 F-0.104
 Kel-0.098
 pup-0.097
 Buster-0.097
 Cindy-0.097
Token bright
Token ID6016
Feature activation+2.61e-01

Pos logit contributions
 Kel+0.022
 Hanna+0.022
 Kenny+0.022
 Buster+0.021
 Hannah+0.021
Neg logit contributions
castle-0.025
ide-0.024
 structures-0.023
room-0.022
ull-0.022
Token yellow
Token ID7872
Feature activation+9.43e-01

Pos logit contributions
 Kel+0.021
 Hanna+0.021
 Kenny+0.021
 Buster+0.021
 Hannah+0.021
Neg logit contributions
castle-0.022
ide-0.021
 structures-0.020
ull-0.020
room-0.020
Token salad
Token ID20698
Feature activation-8.36e+00

Pos logit contributions
 Kel+0.090
 Hanna+0.089
 Kenny+0.089
 Buster+0.088
 Hannah+0.087
Neg logit contributions
castle-0.068
ide-0.063
 structures-0.060
room-0.060
ull-0.059
Token.
Token ID13
Feature activation+7.38e-02

Pos logit contributions
castle+0.818
ide+0.797
 structures+0.791
room+0.766
ull+0.756
Neg logit contributions
 pup-0.527
 F-0.527
 Hanna-0.521
 buck-0.521
 Buster-0.518
Token The
Token ID383
Feature activation-5.48e-01

Pos logit contributions
 F+0.004
 Cindy+0.004
 Buster+0.004
 Kenny+0.004
 Kel+0.004
Neg logit contributions
castle-0.009
ide-0.008
 structures-0.008
ull-0.008
room-0.007
Token bird
Token ID6512
Feature activation-2.40e+00

Pos logit contributions
ide+0.067
room+0.066
iders+0.062
ics+0.061
imm+0.060
Neg logit contributions
 hesitation-0.025
 pup-0.024
 luggage-0.021
 reins-0.021
 leather-0.019
Token thought
Token ID1807
Feature activation+1.77e-01

Pos logit contributions
room+0.280
castle+0.277
 structures+0.274
ide+0.260
anish+0.255
Neg logit contributions
 buck-0.126
 Kel-0.097
 F-0.097
ha-0.096
 ahead-0.095
Token it
Token ID340
Feature activation+6.65e-01

Pos logit contributions
 F+0.007
 hesitation+0.007
 Kel+0.007
 luggage+0.007
 Buster+0.007
Neg logit contributions
castle-0.022
ide-0.021
room-0.020
 structures-0.020
ull-0.019
Token do
Token ID466
Feature activation+4.32e-01

Pos logit contributions
castle+0.035
ide+0.034
 structures+0.033
room+0.032
 salad+0.031
Neg logit contributions
 Kel-0.034
 Hanna-0.034
 Kenny-0.034
 Buster-0.034
 Hannah-0.034
Token not
Token ID407
Feature activation-2.50e+00

Pos logit contributions
 Kel+0.042
 Hanna+0.042
 Kenny+0.042
 F+0.042
 buck+0.041
Neg logit contributions
castle-0.029
ide-0.026
ull-0.025
 structures-0.025
room-0.025
Token like
Token ID588
Feature activation+1.80e+00

Pos logit contributions
castle+0.252
 structures+0.236
ide+0.233
room+0.226
 salad+0.225
Neg logit contributions
 buck-0.164
 Hanna-0.160
 F-0.159
 training-0.158
 Kel-0.156
Token to
Token ID284
Feature activation-3.97e-02

Pos logit contributions
 Kel+0.118
 F+0.117
 Kenny+0.117
 buck+0.117
 Hanna+0.116
Neg logit contributions
castle-0.178
ide-0.171
room-0.162
 structures-0.162
ull-0.162
Token eat
Token ID4483
Feature activation-2.89e+00

Pos logit contributions
castle+0.003
 structures+0.003
ull+0.003
ide+0.003
 salad+0.003
Neg logit contributions
 buck-0.003
 training-0.003
 F-0.003
 Hanna-0.003
 Kel-0.003
Token salad
Token ID20698
Feature activation-8.36e+00

Pos logit contributions
castle+0.262
ide+0.248
ull+0.237
 structures+0.234
room+0.234
Neg logit contributions
 Kel-0.215
 Kenny-0.212
 Hanna-0.212
 F-0.212
 Buster-0.210
Token.
Token ID13
Feature activation+7.05e-02

Pos logit contributions
ull+0.861
castle+0.857
 structures+0.819
ide+0.794
room+0.790
Neg logit contributions
 F-0.473
 Hannah-0.453
 pup-0.450
 buck-0.448
 luggage-0.444
Token Salad
Token ID42706
Feature activation-4.86e+00

Pos logit contributions
 Kel+0.005
 Kenny+0.005
 Hanna+0.005
 F+0.005
 Cindy+0.005
Neg logit contributions
castle-0.006
ide-0.006
 structures-0.006
ull-0.006
room-0.006
Token is
Token ID318
Feature activation-2.84e-02

Pos logit contributions
castle+0.540
ide+0.523
ull+0.518
 structures+0.506
room+0.495
Neg logit contributions
 buck-0.195
 F-0.191
 sk-0.182
 Hannah-0.180
 Hanna-0.174
Token green
Token ID4077
Feature activation-1.45e-01

Pos logit contributions
castle+0.003
ide+0.003
room+0.003
ull+0.003
 structures+0.003
Neg logit contributions
 F-0.002
 hesitation-0.002
 training-0.002
 buck-0.002
 Hannah-0.002
Token and
Token ID290
Feature activation+1.78e-01

Pos logit contributions
castle+0.015
ide+0.015
ull+0.014
 structures+0.014
room+0.014
Neg logit contributions
 F-0.008
 buck-0.008
 pup-0.008
 luggage-0.008
 Hannah-0.007
Token,
Token ID11
Feature activation+1.90e-01

Pos logit contributions
 Kel+0.017
 Hanna+0.017
 Buster+0.017
 Kenny+0.017
 Hannah+0.016
Neg logit contributions
castle-0.017
ide-0.017
room-0.016
 structures-0.016
 salad-0.015
Token I
Token ID314
Feature activation+1.04e-01

Pos logit contributions
 Kel+0.017
 Hanna+0.017
 Kenny+0.017
 Haley+0.017
uster+0.016
Neg logit contributions
castle-0.014
 salad-0.014
ide-0.013
 structures-0.012
room-0.012
Token'd
Token ID1549
Feature activation+8.27e-01

Pos logit contributions
 Kel+0.009
 Cindy+0.009
 Hannah+0.009
 Hanna+0.009
 Kenny+0.009
Neg logit contributions
castle-0.008
room-0.008
ull-0.008
ide-0.008
 salad-0.007
Token love
Token ID1842
Feature activation+4.94e+00

Pos logit contributions
 Kel+0.067
 Kenny+0.065
 Hanna+0.063
 Haley+0.063
 Buster+0.063
Neg logit contributions
castle-0.074
ide-0.069
 salad-0.069
room-0.067
 structures-0.067
Token a
Token ID257
Feature activation+2.87e-01

Pos logit contributions
 Buster+0.322
 F+0.321
 Kenny+0.321
 pup+0.319
 Kel+0.319
Neg logit contributions
castle-0.485
ide-0.468
room-0.450
 structures-0.444
ull-0.443
Token salad
Token ID20698
Feature activation-8.36e+00

Pos logit contributions
 pup+0.024
 hesitation+0.023
 Hannah+0.022
 training+0.022
 Cindy+0.022
Neg logit contributions
ide-0.023
 structures-0.022
castle-0.022
room-0.022
ics-0.021
Token!"
Token ID2474
Feature activation-6.75e-01

Pos logit contributions
castle+0.707
 structures+0.685
ide+0.677
ull+0.663
room+0.650
Neg logit contributions
 Hanna-0.635
 F-0.634
 Hannah-0.632
 pup-0.628
 buck-0.622
Token
Token ID198
Feature activation-5.39e-01

Pos logit contributions
castle+0.051
ide+0.049
room+0.049
 structures+0.048
 salad+0.046
Neg logit contributions
 hesitation-0.059
 Kel-0.058
 Hanna-0.058
 Cindy-0.056
 F-0.055
Token
Token ID198
Feature activation-5.44e-01

Pos logit contributions
ide+0.042
castle+0.042
room+0.041
ull+0.039
 structures+0.039
Neg logit contributions
 hesitation-0.047
 Kel-0.045
 Hanna-0.045
 Cindy-0.044
 pup-0.044
TokenHis
Token ID6653
Feature activation-4.13e-01

Pos logit contributions
castle+0.046
ide+0.045
room+0.044
 structures+0.042
ull+0.040
Neg logit contributions
 hesitation-0.045
 Kel-0.044
 Hanna-0.044
 Cindy-0.043
 Hannah-0.042
Token mom
Token ID1995
Feature activation+2.09e-01

Pos logit contributions
castle+0.031
 structures+0.029
 salad+0.028
ide+0.028
room+0.027
Neg logit contributions
 Kel-0.039
 Hanna-0.038
 Buster-0.038
 Cindy-0.037
uster-0.037
TokenLet
Token ID5756
Feature activation+4.05e-01

Pos logit contributions
 Hanna+0.030
 hesitation+0.030
 Kel+0.030
 luggage+0.030
 Kenny+0.030
Neg logit contributions
castle-0.037
ide-0.036
 structures-0.035
room-0.035
ull-0.035
Token's
Token ID338
Feature activation-1.07e+00

Pos logit contributions
 Kel+0.039
 Hanna+0.039
 Kenny+0.039
 buck+0.038
 Buster+0.038
Neg logit contributions
castle-0.027
ide-0.026
 structures-0.025
room-0.024
ull-0.024
Token make
Token ID787
Feature activation-1.70e+00

Pos logit contributions
castle+0.092
ide+0.091
 structures+0.087
ull+0.087
room+0.083
Neg logit contributions
 Buster-0.080
 F-0.079
 Hannah-0.079
 Kenny-0.078
 Cindy-0.078
Token a
Token ID257
Feature activation+2.90e-01

Pos logit contributions
castle+0.198
ide+0.194
room+0.182
ull+0.175
ill+0.172
Neg logit contributions
 Buster-0.082
 F-0.081
 Hannah-0.081
 Cindy-0.079
 Kenny-0.078
Token tomato
Token ID24240
Feature activation-6.64e+00

Pos logit contributions
 luggage+0.026
 pup+0.026
 hesitation+0.025
 Hannah+0.024
 buck+0.024
Neg logit contributions
ide-0.020
room-0.020
ull-0.018
ics-0.018
castle-0.018
Token salad
Token ID20698
Feature activation-8.36e+00

Pos logit contributions
ide+0.476
room+0.470
 structures+0.425
ateful+0.421
ull+0.409
Neg logit contributions
-0.507
 ahead-0.503
 buck-0.500
 hesitation-0.496
 pup-0.494
Token!"
Token ID2474
Feature activation-6.66e-01

Pos logit contributions
 structures+0.797
ide+0.796
ull+0.796
room+0.795
castle+0.782
Neg logit contributions
 buck-0.487
 ahead-0.453
 Hannah-0.451
 pup-0.449
 Cindy-0.449
Token Mouse
Token ID21839
Feature activation+2.14e+00

Pos logit contributions
ide+0.057
castle+0.056
 structures+0.055
room+0.054
ull+0.053
Neg logit contributions
 Hanna-0.049
uster-0.048
 Hannah-0.048
 hesitation-0.047
 Cindy-0.047
Token said
Token ID531
Feature activation+6.96e-01

Pos logit contributions
 Kel+0.139
 Hanna+0.137
 Kenny+0.136
 Buster+0.135
 Hannah+0.134
Neg logit contributions
castle-0.216
ide-0.204
 structures-0.201
room-0.197
ull-0.194
Token.
Token ID13
Feature activation+8.43e-02

Pos logit contributions
 Hanna+0.055
 Kel+0.055
 buck+0.054
 Kenny+0.053
 luggage+0.053
Neg logit contributions
castle-0.057
ide-0.057
 structures-0.053
ull-0.053
room-0.053
Token
Token ID198
Feature activation-5.31e-01

Pos logit contributions
 Hanna+0.007
 luggage+0.007
 hesitation+0.006
 Hannah+0.006
 buck+0.006
Neg logit contributions
ide-0.007
castle-0.007
 structures-0.007
room-0.006
ull-0.006
Token had
Token ID550
Feature activation+2.79e+00

Pos logit contributions
castle+0.030
ide+0.029
 salad+0.029
room+0.028
ull+0.028
Neg logit contributions
 Kel-0.038
 Hanna-0.038
 hesitation-0.037
uster-0.037
 Cindy-0.037
Token to
Token ID284
Feature activation-3.92e-02

Pos logit contributions
 Kel+0.199
 Hanna+0.198
 Kenny+0.196
 Buster+0.195
 Hannah+0.193
Neg logit contributions
castle-0.262
ide-0.250
 structures-0.243
room-0.238
ull-0.236
Token eat
Token ID4483
Feature activation-2.89e+00

Pos logit contributions
castle+0.003
ide+0.003
 structures+0.003
ull+0.003
room+0.003
Neg logit contributions
 Hanna-0.003
 Kel-0.003
 F-0.003
 Buster-0.003
 Hannah-0.003
Token soup
Token ID17141
Feature activation-4.38e+00

Pos logit contributions
castle+0.269
ide+0.254
 structures+0.244
ull+0.242
room+0.242
Neg logit contributions
 Hanna-0.204
 Hannah-0.203
 F-0.202
 Kel-0.201
 Cindy-0.200
Token and
Token ID290
Feature activation+1.92e-01

Pos logit contributions
castle+0.430
 structures+0.411
ull+0.409
ide+0.401
room+0.388
Neg logit contributions
 Hanna-0.277
 F-0.276
 Hannah-0.273
 buck-0.272
 Buster-0.265
Token salad
Token ID20698
Feature activation-8.36e+00

Pos logit contributions
 Kel+0.016
 Hanna+0.016
 Hannah+0.016
 buck+0.016
 F+0.016
Neg logit contributions
castle-0.015
ide-0.015
 structures-0.014
room-0.014
ull-0.014
Token.
Token ID13
Feature activation+7.92e-02

Pos logit contributions
castle+0.805
 structures+0.765
ide+0.759
ull+0.758
room+0.738
Neg logit contributions
 Hanna-0.555
 F-0.546
 buck-0.545
 Hannah-0.544
 Kel-0.537
Token
Token ID198
Feature activation-5.30e-01

Pos logit contributions
 Kel+0.006
 Kenny+0.006
 F+0.006
 Hannah+0.006
 Hanna+0.006
Neg logit contributions
castle-0.008
ide-0.007
 structures-0.007
ull-0.007
 salad-0.007
Token
Token ID198
Feature activation-5.27e-01

Pos logit contributions
castle+0.044
ide+0.044
room+0.042
ull+0.041
 structures+0.041
Neg logit contributions
 Kel-0.042
 Hanna-0.042
 pup-0.042
 Buster-0.041
 hesitation-0.041
TokenSam
Token ID16305
Feature activation-1.65e+00

Pos logit contributions
castle+0.037
ide+0.035
 structures+0.033
room+0.033
ull+0.033
Neg logit contributions
 Hanna-0.050
 Kel-0.050
 Buster-0.049
 Kenny-0.049
 buck-0.049
Token and
Token ID290
Feature activation+1.84e-01

Pos logit contributions
castle+0.157
ide+0.149
 structures+0.147
room+0.142
ull+0.142
Neg logit contributions
 Kel-0.116
 Hanna-0.115
 Kenny-0.114
 Hannah-0.114
 Buster-0.113
Token did
Token ID750
Feature activation-1.93e+00

Pos logit contributions
 buck+0.009
 F+0.009
 Kel+0.008
 Hanna+0.008
 Buster+0.008
Neg logit contributions
castle-0.012
ide-0.012
 structures-0.012
room-0.011
 salad-0.011
Token not
Token ID407
Feature activation-2.49e+00

Pos logit contributions
castle+0.164
ull+0.159
ide+0.159
room+0.149
 pulp+0.146
Neg logit contributions
 buck-0.146
 F-0.144
 training-0.142
 Kenny-0.140
 Kel-0.138
Token like
Token ID588
Feature activation+1.81e+00

Pos logit contributions
castle+0.237
ide+0.225
 structures+0.220
room+0.214
ull+0.212
Neg logit contributions
 Kel-0.175
 Hanna-0.175
 Buster-0.172
 Kenny-0.172
 F-0.170
Token to
Token ID284
Feature activation-4.47e-02

Pos logit contributions
 buck+0.108
 F+0.107
 training+0.103
 Kenny+0.102
 Buster+0.100
Neg logit contributions
castle-0.194
ide-0.188
room-0.178
ull-0.178
 structures-0.175
Token eat
Token ID4483
Feature activation-2.90e+00

Pos logit contributions
castle+0.004
 structures+0.004
ull+0.004
ide+0.004
ocado+0.004
Neg logit contributions
 buck-0.003
 F-0.003
 training-0.003
 Hanna-0.003
 Hannah-0.003
Token salad
Token ID20698
Feature activation-8.36e+00

Pos logit contributions
castle+0.281
ide+0.269
ull+0.259
room+0.256
 structures+0.250
Neg logit contributions
 F-0.198
 Kenny-0.192
 buck-0.191
 luggage-0.191
 Kel-0.190
Token.
Token ID13
Feature activation+7.90e-02

Pos logit contributions
ull+0.900
castle+0.880
 structures+0.857
 pulp+0.843
ide+0.817
Neg logit contributions
 F-0.454
 buck-0.418
 Hannah-0.411
 ahead-0.411
 pup-0.409
Token One
Token ID1881
Feature activation+8.72e-01

Pos logit contributions
 Kenny+0.005
 F+0.005
 Kel+0.005
 Cindy+0.004
 Hank+0.004
Neg logit contributions
castle-0.009
ide-0.008
ull-0.008
ics-0.008
 pulp-0.007
Token day
Token ID1110
Feature activation-6.62e-01

Pos logit contributions
 pup+0.031
 F+0.026
 ahead+0.024
 buck+0.022
 luggage+0.019
Neg logit contributions
castle-0.116
 structures-0.113
ics-0.108
ide-0.107
room-0.106
Token,
Token ID11
Feature activation+1.80e-01

Pos logit contributions
castle+0.064
ide+0.061
 structures+0.060
room+0.058
ull+0.058
Neg logit contributions
 Kel-0.045
 Kenny-0.045
 Hanna-0.044
 F-0.044
 Buster-0.044
Token his
Token ID465
Feature activation-1.10e+00

Pos logit contributions
 Kenny+0.013
 Kel+0.013
 F+0.013
 Buster+0.013
 Cindy+0.013
Neg logit contributions
castle-0.017
ide-0.016
 structures-0.015
ull-0.015
room-0.015
Token,
Token ID11
Feature activation+1.96e-01

Pos logit contributions
castle+0.051
ide+0.048
room+0.047
 structures+0.046
 salad+0.046
Neg logit contributions
 Kel-0.057
 Hanna-0.056
 Kenny-0.055
uster-0.055
 Buster-0.055
Token he
Token ID339
Feature activation+8.91e-01

Pos logit contributions
 Kel+0.014
 Hanna+0.013
 Kenny+0.013
 Buster+0.013
 Hannah+0.013
Neg logit contributions
castle-0.019
ide-0.018
 structures-0.018
room-0.017
ull-0.017
Token was
Token ID373
Feature activation+5.43e-01

Pos logit contributions
 buck+0.062
 pup+0.059
 F+0.058
 Hanna+0.058
 Buster+0.057
Neg logit contributions
castle-0.087
 structures-0.085
ide-0.085
room-0.078
 salad-0.077
Token making
Token ID1642
Feature activation-1.91e+00

Pos logit contributions
 Kel+0.035
 Hanna+0.035
 Kenny+0.035
 Buster+0.034
 Hannah+0.034
Neg logit contributions
castle-0.055
ide-0.052
 structures-0.051
room-0.050
ull-0.050
Token a
Token ID257
Feature activation+2.77e-01

Pos logit contributions
castle+0.190
ide+0.183
 structures+0.176
room+0.174
ull+0.171
Neg logit contributions
 Kel-0.125
 Hanna-0.124
 Kenny-0.123
 Buster-0.123
 F-0.122
Token salad
Token ID20698
Feature activation-8.36e+00

Pos logit contributions
 pup+0.020
 luggage+0.019
 Buster+0.019
 Hanna+0.019
 Kenny+0.019
Neg logit contributions
ide-0.026
castle-0.025
room-0.024
ull-0.024
 structures-0.024
Token.
Token ID13
Feature activation+7.90e-02

Pos logit contributions
 structures+0.886
ide+0.879
ull+0.869
castle+0.856
room+0.845
Neg logit contributions
 buck-0.425
 ahead-0.416
 pup-0.415
 F-0.402
 training-0.400
Token He
Token ID679
Feature activation+1.15e-01

Pos logit contributions
 F+0.006
 Kenny+0.006
 Kel+0.006
 Buster+0.006
 Hanna+0.006
Neg logit contributions
castle-0.008
ide-0.007
 structures-0.007
ull-0.007
room-0.006
Token filled
Token ID5901
Feature activation-4.66e-02

Pos logit contributions
 buck+0.008
 F+0.007
 pup+0.007
 Buster+0.006
 Hanna+0.006
Neg logit contributions
castle-0.012
 structures-0.012
ide-0.011
room-0.011
ics-0.010
Token it
Token ID340
Feature activation+6.65e-01

Pos logit contributions
castle+0.006
ide+0.005
ill+0.005
iders+0.005
anish+0.005
Neg logit contributions
 buck-0.002
 ahead-0.001
 Cindy-0.001
 Robbie-0.001
 F-0.001
Token with
Token ID351
Feature activation-1.60e+00

Pos logit contributions
 buck+0.035
 ahead+0.034
 wrapping+0.031
 F+0.030
 training+0.030
Neg logit contributions
ull-0.066
castle-0.065
room-0.065
roc-0.064
ill-0.062
Token in
Token ID287
Feature activation-4.66e-01

Pos logit contributions
 Kel+0.478
 Hanna+0.475
 Kenny+0.457
 Cindy+0.456
 Hannah+0.454
Neg logit contributions
castle-0.471
ide-0.453
 structures-0.432
room-0.429
 salad-0.427
Token the
Token ID262
Feature activation-5.19e-02

Pos logit contributions
castle+0.034
 structures+0.033
ide+0.033
room+0.032
 salad+0.032
Neg logit contributions
 Hanna-0.041
uster-0.041
 Kel-0.041
 hesitation-0.040
 Cindy-0.040
Token park
Token ID3952
Feature activation+7.10e-01

Pos logit contributions
castle+0.004
ide+0.004
 structures+0.004
room+0.004
ull+0.004
Neg logit contributions
 Kel-0.004
 Hanna-0.004
 Kenny-0.004
 Buster-0.004
 Hannah-0.004
Token.
Token ID13
Feature activation+9.54e-02

Pos logit contributions
 F+0.043
 pup+0.042
 buck+0.041
 Kenny+0.041
 Buster+0.040
Neg logit contributions
castle-0.073
ide-0.068
 structures-0.068
ull-0.068
 salad-0.066
Token The
Token ID383
Feature activation-5.30e-01

Pos logit contributions
 Kenny+0.006
 F+0.006
 Kel+0.006
 Buster+0.006
 Cindy+0.006
Neg logit contributions
castle-0.010
ide-0.010
 structures-0.009
ull-0.009
 salad-0.009
Token puppy
Token ID26188
Feature activation+5.67e+00

Pos logit contributions
castle+0.051
 salad+0.049
room+0.049
 structures+0.048
ide+0.047
Neg logit contributions
 hesitation-0.034
 Kel-0.033
uster-0.033
 Hanna-0.032
Michael-0.032
Token was
Token ID373
Feature activation+5.59e-01

Pos logit contributions
 Kel+0.434
 Hanna+0.432
 Kenny+0.423
 hesitation+0.421
 Buster+0.421
Neg logit contributions
castle-0.500
ide-0.480
 structures-0.463
room-0.452
ull-0.450
Token sad
Token ID6507
Feature activation-9.63e-01

Pos logit contributions
 Hanna+0.047
 Kel+0.046
uster+0.046
 Kenny+0.045
Michael+0.045
Neg logit contributions
castle-0.043
ide-0.041
ull-0.040
 salad-0.040
 structures-0.040
Token and
Token ID290
Feature activation+2.01e-01

Pos logit contributions
castle+0.083
ide+0.079
 structures+0.076
room+0.074
ull+0.074
Neg logit contributions
 Kel-0.077
 Hanna-0.077
 Kenny-0.075
 Buster-0.075
 Cindy-0.075
Token hungry
Token ID14720
Feature activation-2.34e+00

Pos logit contributions
 Kel+0.015
 Kenny+0.015
 Buster+0.015
 Hanna+0.015
 F+0.015
Neg logit contributions
castle-0.018
ide-0.018
 structures-0.017
room-0.017
ull-0.016
Token.
Token ID13
Feature activation+1.01e-01

Pos logit contributions
castle+0.183
ide+0.181
 structures+0.171
room+0.170
 salad+0.169
Neg logit contributions
 Kel-0.206
 Hanna-0.205
 Cindy-0.197
 Haley-0.197
 Kenny-0.196
Token.
Token ID13
Feature activation+1.45e-01

Pos logit contributions
 Kel+0.083
 Kenny+0.080
 Hanna+0.078
 Buster+0.078
 Eleanor+0.077
Neg logit contributions
castle-0.064
ide-0.062
 salad-0.058
room-0.058
ull-0.056
Token
Token ID220
Feature activation+1.27e+00

Pos logit contributions
 Hanna+0.010
 Kel+0.009
 Kenny+0.009
 Buster+0.009
 Hannah+0.009
Neg logit contributions
castle-0.015
ide-0.014
 structures-0.014
room-0.013
ull-0.013
Token
Token ID198
Feature activation-4.75e-01

Pos logit contributions
 Kel+0.084
 Kenny+0.084
 Hanna+0.083
 Buster+0.083
 F+0.082
Neg logit contributions
castle-0.130
ide-0.121
 structures-0.120
 salad-0.116
ull-0.115
Token
Token ID198
Feature activation-4.69e-01

Pos logit contributions
castle+0.048
ide+0.044
 structures+0.044
room+0.042
 salad+0.042
Neg logit contributions
 Hanna-0.032
 Kel-0.032
 Kenny-0.032
 Buster-0.031
 Hannah-0.031
TokenThe
Token ID464
Feature activation-6.37e-02

Pos logit contributions
castle+0.042
ide+0.040
 structures+0.039
room+0.038
 salad+0.037
Neg logit contributions
 Hanna-0.035
 Kel-0.035
 Buster-0.035
 Kenny-0.035
 Hannah-0.035
Token puppy
Token ID26188
Feature activation+5.71e+00

Pos logit contributions
 salad+0.004
 structure+0.004
 team+0.004
 nation+0.004
 image+0.004
Neg logit contributions
uster-0.006
 Kel-0.006
 deter-0.005
 Hanna-0.005
 buck-0.005
Token was
Token ID373
Feature activation+6.00e-01

Pos logit contributions
 Kel+0.456
 Hanna+0.445
 Kenny+0.444
 Buster+0.441
 Cindy+0.439
Neg logit contributions
castle-0.488
ide-0.465
ull-0.437
 structures-0.436
 salad-0.433
Token much
Token ID881
Feature activation-4.17e-01

Pos logit contributions
 Kel+0.049
 Kenny+0.049
 Buster+0.048
 Hanna+0.048
 pup+0.048
Neg logit contributions
castle-0.049
ide-0.047
ull-0.045
room-0.043
 salad-0.043
Token cheer
Token ID14042
Feature activation+3.60e-01

Pos logit contributions
castle+0.036
 salad+0.034
ide+0.033
ull+0.033
 structures+0.033
Neg logit contributions
 Kel-0.034
 Kenny-0.032
 Hanna-0.032
 Buster-0.032
 F-0.032
Tokenfull
Token ID12853
Feature activation-5.68e-01

Pos logit contributions
 Kel+0.027
 Hanna+0.026
 buck+0.026
 Kenny+0.026
 hesitation+0.026
Neg logit contributions
castle-0.033
ide-0.032
room-0.030
 structures-0.030
ull-0.030
Tokener
Token ID263
Feature activation+5.16e-01

Pos logit contributions
castle+0.043
ide+0.042
ull+0.038
 salad+0.038
room+0.038
Neg logit contributions
 Kel-0.052
 Kenny-0.051
 Hanna-0.050
 Buster-0.050
 Cindy-0.050
Token was
Token ID373
Feature activation+5.70e-01

Pos logit contributions
uster+0.063
 reins+0.060
 hesitation+0.060
 training+0.059
 luggage+0.058
Neg logit contributions
castle-0.041
ide-0.041
ull-0.039
room-0.036
 structures-0.035
Token Grand
Token ID5675
Feature activation+3.03e+00

Pos logit contributions
uster+0.051
 reins+0.050
 Eleanor+0.048
ha+0.047
 Kel+0.047
Neg logit contributions
ull-0.039
castle-0.039
ide-0.038
room-0.037
 salad-0.035
Tokenma
Token ID2611
Feature activation+1.04e+00

Pos logit contributions
 reins+0.269
 Buster+0.269
uster+0.264
 Hanna+0.263
 Bruno+0.260
Neg logit contributions
castle-0.244
ide-0.232
room-0.227
ull-0.218
ill-0.204
Token!
Token ID0
Feature activation+6.55e-01

Pos logit contributions
 Kel+0.099
 hesitation+0.099
 Hanna+0.099
uster+0.099
 reins+0.098
Neg logit contributions
ide-0.067
castle-0.065
room-0.058
ull-0.057
ics-0.053
Token Little
Token ID7703
Feature activation+2.79e+00

Pos logit contributions
 hesitation+0.069
 training+0.065
 luggage+0.063
 buck+0.062
 reins+0.062
Neg logit contributions
room-0.035
ide-0.031
ull-0.031
castle-0.028
ill-0.028
Token B
Token ID347
Feature activation+5.38e+00

Pos logit contributions
 reins+0.222
 training+0.208
 hesitation+0.200
 saddle+0.195
leaf+0.189
Neg logit contributions
castle-0.196
room-0.180
ull-0.169
ide-0.167
imm-0.165
Token was
Token ID373
Feature activation+5.58e-01

Pos logit contributions
 hesitation+0.453
 training+0.402
 reins+0.389
 luggage+0.383
 buck+0.379
Neg logit contributions
castle-0.476
ide-0.468
room-0.455
ull-0.424
ics-0.401
Token so
Token ID523
Feature activation-7.28e-01

Pos logit contributions
uster+0.050
 reins+0.048
ha+0.047
 Kel+0.047
 Eleanor+0.047
Neg logit contributions
castle-0.039
ide-0.037
ull-0.037
room-0.036
ics-0.034
Token excited
Token ID6568
Feature activation-3.27e-01

Pos logit contributions
castle+0.053
ide+0.050
ull+0.048
room+0.047
 structures+0.047
Neg logit contributions
 Buster-0.065
 Kel-0.065
 Hanna-0.065
 Kenny-0.065
uster-0.065
Token to
Token ID284
Feature activation-1.51e-02

Pos logit contributions
ide+0.023
castle+0.023
 salad+0.020
 structures+0.020
room+0.019
Neg logit contributions
 reins-0.031
uster-0.030
 Hanna-0.030
Char-0.030
 training-0.029
Token see
Token ID766
Feature activation-7.62e-01

Pos logit contributions
castle+0.001
ide+0.001
room+0.001
ull+0.001
 structures+0.001
Neg logit contributions
 Kel-0.001
 Hanna-0.001
uster-0.001
 Kenny-0.001
 Buster-0.001
Token and
Token ID290
Feature activation+2.09e-01

Pos logit contributions
castle+0.114
ide+0.109
 structures+0.106
ull+0.104
room+0.103
Neg logit contributions
 Kel-0.088
 Hanna-0.087
 Kenny-0.086
 Buster-0.085
 Hannah-0.085
Token put
Token ID1234
Feature activation+1.31e+00

Pos logit contributions
 Kel+0.017
 Kenny+0.016
 Hanna+0.016
 Buster+0.016
 Cindy+0.016
Neg logit contributions
castle-0.018
ide-0.017
 structures-0.017
 salad-0.016
room-0.016
Token on
Token ID319
Feature activation+1.78e+00

Pos logit contributions
 Kel+0.116
Michael+0.114
 Kenny+0.114
 hesitation+0.113
uster+0.112
Neg logit contributions
castle-0.100
 salad-0.095
 structures-0.094
ide-0.091
ull-0.091
Token their
Token ID511
Feature activation-3.83e-01

Pos logit contributions
 Kel+0.180
uster+0.178
 Kenny+0.178
 Hanna+0.177
 hesitation+0.177
Neg logit contributions
castle-0.112
 structures-0.108
ide-0.105
 salad-0.105
room-0.098
Token p
Token ID279
Feature activation+2.28e+00

Pos logit contributions
castle+0.035
ide+0.033
 structures+0.033
 salad+0.032
room+0.032
Neg logit contributions
 Kel-0.028
 Hanna-0.028
 Kenny-0.028
 Buster-0.028
 Hannah-0.027
Tokenaj
Token ID1228
Feature activation+5.55e+00

Pos logit contributions
 Kel+0.137
 Hanna+0.136
 buck+0.133
 Kenny+0.133
 luggage+0.131
Neg logit contributions
castle-0.232
ide-0.227
ull-0.220
room-0.218
 structures-0.217
Tokenamas
Token ID17485
Feature activation-3.09e+00

Pos logit contributions
 Hanna+0.442
 Hannah+0.426
 Kenny+0.425
 Buster+0.424
 Cindy+0.419
Neg logit contributions
castle-0.463
ull-0.453
ide-0.433
 structures-0.423
 salad-0.402
Token.
Token ID13
Feature activation+1.03e-01

Pos logit contributions
castle+0.240
ide+0.238
room+0.225
 structures+0.207
ull+0.207
Neg logit contributions
 Kel-0.276
 Hanna-0.268
 Kenny-0.268
 Buster-0.267
 hesitation-0.266
Token They
Token ID1119
Feature activation-2.65e-01

Pos logit contributions
 Kel+0.009
 Kenny+0.009
 Buster+0.009
 Hanna+0.009
 hesitation+0.009
Neg logit contributions
castle-0.007
ide-0.007
 structures-0.007
room-0.007
 salad-0.007
Token each
Token ID1123
Feature activation-1.69e+00

Pos logit contributions
castle+0.021
ide+0.020
 salad+0.020
 structures+0.019
room+0.019
Neg logit contributions
 Kel-0.022
 Kenny-0.022
 Buster-0.022
 Cindy-0.022
 pup-0.021
Token pick
Token ID2298
Feature activation+1.37e+00

Pos logit contributions
ide+0.124
castle+0.123
room+0.122
 salad+0.118
ull+0.115
Neg logit contributions
 Kenny-0.151
 Kel-0.150
 Buster-0.146
 Cindy-0.146
 Hannah-0.142
Token.
Token ID13
Feature activation+1.42e-01

Pos logit contributions
castle+0.359
ide+0.342
 structures+0.333
ull+0.325
room+0.323
Neg logit contributions
 Hanna-0.296
 Kel-0.294
 Kenny-0.291
 Buster-0.290
 Hannah-0.290
Token She
Token ID1375
Feature activation-2.18e-01

Pos logit contributions
 Hanna+0.011
 Kel+0.011
 Kenny+0.011
 Hannah+0.011
 Cindy+0.011
Neg logit contributions
castle-0.013
ide-0.012
 structures-0.012
ull-0.012
room-0.011
Token showed
Token ID3751
Feature activation-1.23e+00

Pos logit contributions
castle+0.019
ide+0.018
 structures+0.017
room+0.017
ull+0.017
Neg logit contributions
 Kel-0.017
 Hanna-0.017
 Kenny-0.017
 Buster-0.017
 Hannah-0.017
Token it
Token ID340
Feature activation+7.42e-01

Pos logit contributions
castle+0.102
ide+0.099
 structures+0.097
room+0.094
ull+0.093
Neg logit contributions
 Kel-0.098
 Hanna-0.097
 hesitation-0.095
 Buster-0.095
 Kenny-0.095
Token to
Token ID284
Feature activation+4.13e-02

Pos logit contributions
 F+0.036
 training+0.035
 buck+0.035
 Hanna+0.035
 pup+0.034
Neg logit contributions
castle-0.088
ide-0.082
room-0.081
ull-0.080
 structures-0.077
Token Tommy
Token ID19919
Feature activation+5.34e+00

Pos logit contributions
 F+0.003
 Hannah+0.003
 Cindy+0.003
 Hanna+0.003
 Kenny+0.003
Neg logit contributions
castle-0.004
ide-0.004
ull-0.004
 structures-0.004
room-0.004
Token and
Token ID290
Feature activation+2.50e-01

Pos logit contributions
 Hanna+0.324
 Hannah+0.323
 buck+0.321
 Buster+0.316
 Cindy+0.313
Neg logit contributions
castle-0.562
ull-0.528
 structures-0.526
room-0.512
ide-0.511
Token said
Token ID531
Feature activation+7.64e-01

Pos logit contributions
 Kel+0.017
 Hanna+0.017
 Kenny+0.016
 Buster+0.016
 Hannah+0.016
Neg logit contributions
castle-0.025
ide-0.024
 structures-0.023
room-0.023
ull-0.022
Token,
Token ID11
Feature activation+2.52e-01

Pos logit contributions
 Cindy+0.044
 Hannah+0.044
 Hanna+0.044
 F+0.043
 Kel+0.043
Neg logit contributions
castle-0.081
ide-0.080
 structures-0.077
room-0.076
ull-0.075
Token "
Token ID366
Feature activation-9.52e-01

Pos logit contributions
 Kel+0.020
 Hanna+0.020
 Kenny+0.020
 Buster+0.020
 Hannah+0.020
Neg logit contributions
castle-0.021
ide-0.020
 structures-0.020
room-0.019
ull-0.019
TokenDo
Token ID5211
Feature activation+1.27e+00

Pos logit contributions
room+0.074
ide+0.072
castle+0.071
ull+0.065
anish+0.063
Neg logit contributions
 hesitation-0.084
 Kel-0.082
 Buster-0.079
 Kenny-0.079
 luggage-0.078
Token the
Token ID262
Feature activation-9.57e-03

Pos logit contributions
 Kel+0.048
 Hannah+0.048
 F+0.047
 Kenny+0.047
 Buster+0.047
Neg logit contributions
castle-0.095
ide-0.087
 structures-0.086
room-0.082
 salad-0.081
Token story
Token ID1621
Feature activation-1.43e+00

Pos logit contributions
 salad+0.001
castle+0.001
 structures+0.000
ide+0.000
ull+0.000
Neg logit contributions
uster-0.001
 Kel-0.001
 Hanna-0.001
 Kenny-0.001
 Haley-0.001
Token is
Token ID318
Feature activation+5.34e-02

Pos logit contributions
castle+0.146
ide+0.136
ull+0.133
 salad+0.133
 structures+0.132
Neg logit contributions
 F-0.088
 Kenny-0.088
 Hannah-0.087
 Kel-0.087
 Hanna-0.085
Token that
Token ID326
Feature activation+1.05e+00

Pos logit contributions
 Kel+0.003
 Hanna+0.003
 Kenny+0.003
 Buster+0.003
 Hannah+0.003
Neg logit contributions
castle-0.005
ide-0.005
 structures-0.005
room-0.005
ull-0.005
Token sharing
Token ID7373
Feature activation-3.91e+00

Pos logit contributions
 F+0.088
 Kel+0.087
 Buster+0.087
 Kenny+0.087
 pup+0.086
Neg logit contributions
castle-0.087
ide-0.082
room-0.078
 structures-0.076
ull-0.076
Token can
Token ID460
Feature activation+1.59e+00

Pos logit contributions
castle+0.427
ide+0.403
ull+0.381
room+0.369
ics+0.364
Neg logit contributions
 Hannah-0.222
 pup-0.217
 Cindy-0.212
 Buster-0.211
 F-0.205
Token bring
Token ID2222
Feature activation+7.96e-01

Pos logit contributions
 Kel+0.132
 Hanna+0.131
 Buster+0.130
 Kenny+0.130
 Hannah+0.130
Neg logit contributions
castle-0.130
ide-0.122
 structures-0.120
ull-0.116
room-0.115
Token happiness
Token ID12157
Feature activation+1.04e+00

Pos logit contributions
 Kel+0.059
 Hanna+0.058
 Kenny+0.058
 Buster+0.057
 buck+0.057
Neg logit contributions
castle-0.072
ide-0.069
 structures-0.068
room-0.066
 salad-0.065
Token and
Token ID290
Feature activation+2.57e-01

Pos logit contributions
 Hanna+0.074
 Kel+0.074
 Buster+0.073
 Kenny+0.073
 Hannah+0.073
Neg logit contributions
castle-0.097
ide-0.092
 structures-0.089
room-0.088
ull-0.087
Token fun
Token ID1257
Feature activation+1.00e+00

Pos logit contributions
 Kel+0.017
 Hanna+0.017
 Kenny+0.017
 Buster+0.017
 Hannah+0.017
Neg logit contributions
castle-0.025
ide-0.024
 structures-0.023
room-0.023
ull-0.023
Token to
Token ID284
Feature activation+2.64e-02

Pos logit contributions
 Kel+0.083
 Hanna+0.082
 Kenny+0.082
 Buster+0.081
 Cindy+0.080
Neg logit contributions
castle-0.084
ide-0.079
 structures-0.077
room-0.075
ull-0.074
Token sit
Token ID1650
Feature activation+9.75e-01

Pos logit contributions
castle+0.060
 structures+0.059
ide+0.056
ull+0.055
room+0.055
Neg logit contributions
 buck-0.044
 training-0.042
 Hanna-0.042
 Buster-0.041
 pup-0.041
Token and
Token ID290
Feature activation+2.28e-01

Pos logit contributions
 buck+0.054
 Hanna+0.052
 F+0.052
 pup+0.052
 Kenny+0.052
Neg logit contributions
castle-0.106
ide-0.105
 structures-0.104
ull-0.098
room-0.098
Token watch
Token ID2342
Feature activation+1.20e+00

Pos logit contributions
 Kel+0.018
 Hanna+0.018
 Kenny+0.018
 Buster+0.018
 Hannah+0.018
Neg logit contributions
castle-0.019
ide-0.018
 structures-0.018
room-0.017
 salad-0.017
Token the
Token ID262
Feature activation-5.05e-02

Pos logit contributions
 Hanna+0.079
 Kenny+0.079
 F+0.078
 Buster+0.078
 Kel+0.078
Neg logit contributions
castle-0.122
ide-0.114
room-0.109
 structures-0.109
ull-0.107
Token world
Token ID995
Feature activation-1.77e+00

Pos logit contributions
castle+0.005
ide+0.004
 structures+0.004
room+0.004
ull+0.004
Neg logit contributions
 pup-0.004
 Hanna-0.004
 Kel-0.004
 Buster-0.004
 F-0.004
Token around
Token ID1088
Feature activation+1.79e+00

Pos logit contributions
castle+0.177
ide+0.162
 structures+0.160
ull+0.159
room+0.158
Neg logit contributions
 Hanna-0.116
 Kel-0.116
 F-0.115
 Kenny-0.114
 buck-0.114
Token him
Token ID683
Feature activation-6.12e-02

Pos logit contributions
 F+0.103
 Buster+0.100
 Kenny+0.098
 Hanna+0.098
 pup+0.097
Neg logit contributions
castle-0.202
ide-0.183
 structures-0.180
ull-0.175
room-0.175
Token.
Token ID13
Feature activation+1.07e-01

Pos logit contributions
castle+0.006
ide+0.006
 structures+0.006
ull+0.006
room+0.006
Neg logit contributions
 F-0.004
 Hanna-0.004
 pup-0.004
 buck-0.004
 Kenny-0.004
Token One
Token ID1881
Feature activation+9.12e-01

Pos logit contributions
 Kel+0.007
 Hanna+0.007
 Kenny+0.007
 Buster+0.007
 Hannah+0.007
Neg logit contributions
castle-0.010
ide-0.010
 structures-0.010
room-0.009
ull-0.009
Token night
Token ID1755
Feature activation-4.98e-01

Pos logit contributions
 Kel+0.055
 pup+0.055
 Hanna+0.055
 F+0.055
 Kenny+0.054
Neg logit contributions
castle-0.096
 structures-0.091
ide-0.090
room-0.087
 salad-0.086
Token,
Token ID11
Feature activation+2.23e-01

Pos logit contributions
castle+0.041
ide+0.040
ull+0.038
 structures+0.038
room+0.038
Neg logit contributions
 Kel-0.040
 Hanna-0.040
 Kenny-0.040
 Buster-0.039
 hesitation-0.039
Token pulled
Token ID5954
Feature activation-1.00e-02

Pos logit contributions
uster+0.020
 hesitation+0.019
 reins+0.019
 luggage+0.018
 training+0.018
Neg logit contributions
castle-0.013
 salad-0.012
ide-0.012
ull-0.011
room-0.010
Token,
Token ID11
Feature activation+2.17e-01

Pos logit contributions
ide+0.000
room+0.000
ull+0.000
 structures+0.000
 dragons+0.000
Neg logit contributions
 Hanna-0.001
 luggage-0.001
Matthew-0.001
 buck-0.001
 Haley-0.001
Token but
Token ID475
Feature activation+7.52e-01

Pos logit contributions
 luggage+0.020
 Hanna+0.019
 Kel+0.019
 Kenny+0.019
 Haley+0.018
Neg logit contributions
castle-0.014
ide-0.013
 salad-0.012
 structures-0.012
ull-0.012
Token it
Token ID340
Feature activation+7.01e-01

Pos logit contributions
uster+0.072
leaf+0.071
 hesitation+0.069
 reins+0.069
 freshly+0.066
Neg logit contributions
castle-0.037
ull-0.034
ide-0.033
room-0.029
ics-0.029
Token was
Token ID373
Feature activation+5.59e-01

Pos logit contributions
 hesitation+0.059
 Hanna+0.059
 Hannah+0.058
 Cindy+0.058
uster+0.057
Neg logit contributions
castle-0.056
ide-0.054
room-0.050
ull-0.050
 structures-0.049
Token too
Token ID1165
Feature activation+1.08e+00

Pos logit contributions
 Hanna+0.046
 Kel+0.045
 Haley+0.045
uster+0.045
 Hannah+0.045
Neg logit contributions
castle-0.044
ide-0.043
 salad-0.042
 structures-0.040
ull-0.039
Token rough
Token ID5210
Feature activation-8.34e-02

Pos logit contributions
 Kel+0.088
 Kenny+0.087
 Hanna+0.086
 Hannah+0.084
 hesitation+0.084
Neg logit contributions
castle-0.089
ide-0.088
ull-0.084
 salad-0.083
 structures-0.080
Token for
Token ID329
Feature activation+5.43e-01

Pos logit contributions
ide+0.006
castle+0.006
room+0.005
 structures+0.005
 salad+0.005
Neg logit contributions
 Hanna-0.008
 Kel-0.008
 Haley-0.008
 Eleanor-0.008
 Cindy-0.008
Token him
Token ID683
Feature activation-7.68e-02

Pos logit contributions
uster+0.056
fy+0.055
ience+0.054
 reins+0.053
 pup+0.052
Neg logit contributions
ide-0.023
room-0.022
 dragons-0.021
ull-0.019
ics-0.019
Token to
Token ID284
Feature activation-1.09e-02

Pos logit contributions
castle+0.006
ide+0.006
 structures+0.005
room+0.005
ull+0.005
Neg logit contributions
 Hanna-0.007
 Kel-0.007
 Cindy-0.007
 Haley-0.007
 Eleanor-0.007
Token break
Token ID2270
Feature activation-1.43e+00

Pos logit contributions
castle+0.001
ide+0.001
room+0.001
ull+0.001
 structures+0.001
Neg logit contributions
 Hanna-0.001
 Kel-0.001
 luggage-0.001
 Kenny-0.001
uster-0.001
Token the
Token ID262
Feature activation-4.29e-02

Pos logit contributions
 Kel+0.010
uster+0.010
 reins+0.010
 Hanna+0.009
 Kenny+0.009
Neg logit contributions
castle-0.007
 salad-0.006
room-0.006
 structures-0.006
ull-0.006
Token end
Token ID886
Feature activation-4.23e-01

Pos logit contributions
castle+0.003
 salad+0.003
 nation+0.003
 structures+0.003
room+0.003
Neg logit contributions
 Kel-0.004
uster-0.004
 Kenny-0.004
 Hanna-0.004
 Eleanor-0.004
Token,
Token ID11
Feature activation+2.19e-01

Pos logit contributions
castle+0.033
 structures+0.032
ide+0.032
ull+0.030
 salad+0.030
Neg logit contributions
 reins-0.038
 Hanna-0.037
 Kel-0.037
 buck-0.037
uster-0.037
Token they
Token ID484
Feature activation-3.86e-01

Pos logit contributions
 Buster+0.019
 Kenny+0.019
 Hannah+0.019
 Cindy+0.019
 Kel+0.019
Neg logit contributions
castle-0.017
ide-0.016
 structures-0.015
room-0.015
ull-0.015
Token all
Token ID477
Feature activation-1.51e+00

Pos logit contributions
castle+0.031
ide+0.030
room+0.028
ull+0.028
 structures+0.028
Neg logit contributions
 Kel-0.033
 Hanna-0.032
 Kenny-0.032
 Buster-0.032
 pup-0.031
Token waved
Token ID26834
Feature activation+1.52e+00

Pos logit contributions
castle+0.125
ide+0.119
ull+0.113
 structures+0.112
room+0.112
Neg logit contributions
 Kel-0.124
 Kenny-0.123
 Hanna-0.123
 Buster-0.122
 hesitation-0.121
Token goodbye
Token ID24829
Feature activation+3.45e+00

Pos logit contributions
 Kel+0.100
 Hanna+0.096
Michael+0.095
 Kenny+0.094
 Eleanor+0.093
Neg logit contributions
castle-0.149
 structures-0.141
 salad-0.141
ide-0.141
room-0.139
Token and
Token ID290
Feature activation+2.23e-01

Pos logit contributions
 Kel+0.284
 Hanna+0.281
 Kenny+0.279
 Buster+0.274
 Hannah+0.272
Neg logit contributions
castle-0.291
ide-0.275
 structures-0.263
room-0.256
ull-0.255
Token wished
Token ID16555
Feature activation+1.60e+00

Pos logit contributions
 Kel+0.019
 Hanna+0.019
uster+0.019
 Kenny+0.018
 hesitation+0.018
Neg logit contributions
castle-0.018
ide-0.016
 structures-0.016
 salad-0.016
ull-0.016
Token each
Token ID1123
Feature activation-1.69e+00

Pos logit contributions
 Hanna+0.127
 Kel+0.125
 buck+0.124
 Haley+0.121
 Cindy+0.121
Neg logit contributions
castle-0.139
ide-0.133
 structures-0.130
room-0.126
ull-0.125
Token other
Token ID584
Feature activation+1.16e+00

Pos logit contributions
ide+0.129
 salad+0.125
castle+0.122
ull+0.121
 structures+0.118
Neg logit contributions
 Kel-0.159
 Hanna-0.150
 Kenny-0.149
 Cindy-0.147
 buck-0.146
Token all
Token ID477
Feature activation-1.54e+00

Pos logit contributions
castle+0.113
 salad+0.113
ide+0.109
 structures+0.106
room+0.104
Neg logit contributions
 Kel-0.138
uster-0.138
 Kenny-0.135
Michael-0.134
 Haley-0.134
Token the
Token ID262
Feature activation-5.19e-02

Pos logit contributions
castle+0.123
ide+0.120
 structures+0.117
 salad+0.114
ull+0.112
Neg logit contributions
 Hanna-0.130
 Kenny-0.130
 Buster-0.129
 Kel-0.128
uster-0.128
Token coins
Token ID10796
Feature activation+2.48e+00

Pos logit contributions
castle+0.004
 salad+0.004
 structures+0.004
 tower+0.004
 nation+0.004
Neg logit contributions
 Haley-0.004
 Hanna-0.004
uster-0.004
 Kenny-0.004
 Kel-0.004
Token you
Token ID345
Feature activation-6.04e-01

Pos logit contributions
 Kel+0.217
 Kenny+0.213
 Hanna+0.211
 Buster+0.209
 Haley+0.206
Neg logit contributions
castle-0.204
ide-0.190
 structures-0.180
 salad-0.179
ull-0.175
Token love
Token ID1842
Feature activation+4.96e+00

Pos logit contributions
castle+0.046
ide+0.044
 salad+0.043
 structures+0.042
room+0.041
Neg logit contributions
 Kel-0.056
 Kenny-0.054
 Haley-0.052
 Buster-0.052
 Hanna-0.052
Token counting
Token ID14143
Feature activation+6.21e-01

Pos logit contributions
 Kel+0.410
 Hanna+0.405
 Kenny+0.403
 Buster+0.400
 Cindy+0.397
Neg logit contributions
castle-0.413
ide-0.394
 structures-0.379
room-0.371
ull-0.367
Token".
Token ID1911
Feature activation+8.14e-01

Pos logit contributions
 Kel+0.056
 Hanna+0.055
 Buster+0.055
 Kenny+0.055
 Haley+0.054
Neg logit contributions
castle-0.048
ide-0.046
 structures-0.045
room-0.043
 salad-0.043
Token
Token ID198
Feature activation-5.12e-01

Pos logit contributions
 F+0.043
 Kenny+0.042
 Kel+0.040
 Hannah+0.040
 Haley+0.039
Neg logit contributions
castle-0.101
ide-0.088
 structures-0.087
ics-0.086
 nation-0.084
Token
Token ID198
Feature activation-5.14e-01

Pos logit contributions
castle+0.059
 structures+0.054
ide+0.053
 salad+0.051
room+0.051
Neg logit contributions
 Kenny-0.028
 Kel-0.028
 F-0.028
 Hanna-0.027
 Hannah-0.027
TokenM
Token ID44
Feature activation-3.06e+00

Pos logit contributions
castle+0.052
 structures+0.048
 salad+0.045
ide+0.044
ics+0.044
Neg logit contributions
Michael-0.035
 Kel-0.033
 Haley-0.033
 Kenny-0.033
 Hanna-0.033
Tokenia
Token ID544
Feature activation-3.53e+00

Pos logit contributions
castle+0.170
 structures+0.155
ide+0.151
room+0.143
 salad+0.143
Neg logit contributions
 Kel-0.325
 hesitation-0.321
 Kenny-0.318
 Hanna-0.318
 Haley-0.315
Token business
Token ID1597
Feature activation+1.04e+00

Pos logit contributions
castle+0.093
ide+0.088
 structures+0.085
room+0.083
ull+0.083
Neg logit contributions
 Kel-0.083
 Hanna-0.083
 Kenny-0.082
 Buster-0.082
 Hannah-0.081
Token.
Token ID13
Feature activation+9.47e-02

Pos logit contributions
 Kel+0.078
 Hanna+0.077
 Kenny+0.076
 Buster+0.076
 F+0.075
Neg logit contributions
castle-0.093
ide-0.089
 structures-0.086
ull-0.084
room-0.084
Token After
Token ID2293
Feature activation+1.67e+00

Pos logit contributions
 Kel+0.008
 hesitation+0.008
 Hanna+0.008
 buck+0.008
 pup+0.008
Neg logit contributions
castle-0.007
 structures-0.007
ide-0.007
 salad-0.007
room-0.007
Token he
Token ID339
Feature activation+9.15e-01

Pos logit contributions
uster+0.152
 Hanna+0.147
 Kel+0.146
 reins+0.145
 pup+0.142
Neg logit contributions
castle-0.126
 salad-0.125
 structures-0.116
room-0.115
ull-0.112
Token was
Token ID373
Feature activation+5.69e-01

Pos logit contributions
 Kel+0.067
 Hanna+0.067
 Kenny+0.066
 Buster+0.066
 Hannah+0.065
Neg logit contributions
castle-0.085
ide-0.081
 structures-0.079
room-0.077
ull-0.076
Token done
Token ID1760
Feature activation-1.07e+00

Pos logit contributions
 Hanna+0.042
 Kel+0.042
 Kenny+0.041
 Buster+0.041
 Hannah+0.041
Neg logit contributions
castle-0.052
ide-0.050
 structures-0.049
room-0.048
ull-0.047
Token,
Token ID11
Feature activation+2.15e-01

Pos logit contributions
castle+0.083
ide+0.074
room+0.074
 structures+0.073
 salad+0.072
Neg logit contributions
uster-0.097
 reins-0.095
 pup-0.094
 Hanna-0.093
 Kel-0.092
Token Max
Token ID5436
Feature activation+4.07e-01

Pos logit contributions
 Kel+0.017
 Hanna+0.017
uster+0.017
 hesitation+0.016
 buck+0.016
Neg logit contributions
castle-0.018
 structures-0.017
 salad-0.017
ide-0.017
room-0.016
Token felt
Token ID2936
Feature activation-1.98e+00

Pos logit contributions
 Hanna+0.030
 buck+0.030
 Kel+0.030
 Kenny+0.030
 F+0.030
Neg logit contributions
castle-0.037
ide-0.036
 structures-0.035
room-0.034
ull-0.034
Token much
Token ID881
Feature activation-4.54e-01

Pos logit contributions
castle+0.184
ide+0.175
 structures+0.170
room+0.167
ull+0.167
Neg logit contributions
 Kel-0.144
 Hanna-0.142
 Kenny-0.141
 Buster-0.140
 Cindy-0.139
Token better
Token ID1365
Feature activation+1.06e+00

Pos logit contributions
castle+0.045
 salad+0.043
ide+0.042
 structures+0.042
ull+0.041
Neg logit contributions
 Kel-0.030
 Kenny-0.029
 Hanna-0.029
 Buster-0.029
 Cindy-0.029
Token time
Token ID640
Feature activation+4.61e-01

Pos logit contributions
 pup+0.011
 Hanna+0.011
Michael+0.011
 Hannah+0.010
 F+0.010
Neg logit contributions
castle-0.046
 structures-0.043
ull-0.042
room-0.041
ics-0.041
Token,
Token ID11
Feature activation+2.60e-01

Pos logit contributions
 F+0.025
 Kel+0.025
 Kenny+0.025
 Cindy+0.024
 Hannah+0.024
Neg logit contributions
castle-0.052
 structures-0.049
 salad-0.048
ull-0.047
ide-0.047
Token there
Token ID612
Feature activation+1.04e+00

Pos logit contributions
 Kel+0.016
 Hanna+0.016
 Kenny+0.015
 F+0.015
 Hannah+0.015
Neg logit contributions
castle-0.027
 structures-0.026
ide-0.026
 salad-0.025
room-0.025
Token was
Token ID373
Feature activation+6.09e-01

Pos logit contributions
 F+0.056
 Kenny+0.054
 Kel+0.054
 Hannah+0.050
 Hanna+0.049
Neg logit contributions
castle-0.121
 structures-0.116
 salad-0.111
ide-0.108
 pulp-0.106
Token a
Token ID257
Feature activation+3.62e-01

Pos logit contributions
 Hanna+0.034
 Kel+0.034
 F+0.033
 Cindy+0.033
 Hannah+0.033
Neg logit contributions
castle-0.069
ide-0.065
 structures-0.063
room-0.061
ull-0.061
Token very
Token ID845
Feature activation-5.09e-01

Pos logit contributions
 pup+0.024
 Hanna+0.023
 Hannah+0.023
 Cindy+0.023
 Kel+0.023
Neg logit contributions
castle-0.036
ide-0.035
 structures-0.034
room-0.034
ull-0.033
Token gifted
Token ID22527
Feature activation+9.18e-01

Pos logit contributions
castle+0.049
ide+0.048
 structures+0.046
room+0.046
ull+0.045
Neg logit contributions
 Hanna-0.034
 Kel-0.034
 pup-0.034
 Cindy-0.033
 Hannah-0.033
Token dinosaur
Token ID28257
Feature activation-1.51e+00

Pos logit contributions
 Kel+0.070
 Hanna+0.069
 Kenny+0.069
 Buster+0.068
 Hannah+0.068
Neg logit contributions
castle-0.082
ide-0.078
 structures-0.075
room-0.074
ull-0.073
Token named
Token ID3706
Feature activation+1.12e+00

Pos logit contributions
castle+0.157
 structures+0.150
ide+0.150
room+0.143
ics+0.140
Neg logit contributions
 F-0.089
 pup-0.088
 buck-0.081
 Buster-0.080
 Kenny-0.080
Token D
Token ID360
Feature activation+2.49e+00

Pos logit contributions
 Kel+0.088
 Hanna+0.088
 Kenny+0.087
 Buster+0.086
 Hannah+0.086
Neg logit contributions
castle-0.097
ide-0.093
 structures-0.090
room-0.088
ull-0.087
Tokenippy
Token ID41214
Feature activation+2.99e+00

Pos logit contributions
 F+0.201
 Hanna+0.201
 Hannah+0.201
 Buster+0.199
 pup+0.199
Neg logit contributions
castle-0.210
 structures-0.203
ide-0.180
 salad-0.177
ull-0.177
Token and
Token ID290
Feature activation+2.05e-01

Pos logit contributions
 Kel+0.014
 Hanna+0.014
 buck+0.014
 Kenny+0.014
 Buster+0.014
Neg logit contributions
castle-0.020
ide-0.019
 structures-0.018
room-0.018
ull-0.018
Token played
Token ID2826
Feature activation-1.33e+00

Pos logit contributions
 Kel+0.016
 Hanna+0.016
 Buster+0.015
 Kenny+0.015
 Hannah+0.015
Neg logit contributions
castle-0.018
ide-0.017
 structures-0.017
room-0.016
 salad-0.016
Token until
Token ID1566
Feature activation+6.28e-01

Pos logit contributions
castle+0.115
ull+0.107
ide+0.106
 structures+0.100
 salad+0.098
Neg logit contributions
 Kenny-0.101
 Hanna-0.101
 F-0.101
 Hannah-0.100
 Buster-0.100
Token it
Token ID340
Feature activation+6.92e-01

Pos logit contributions
 Kel+0.048
uster+0.047
 Kenny+0.047
 Buster+0.047
 Hanna+0.046
Neg logit contributions
castle-0.057
ide-0.053
 salad-0.052
 structures-0.052
ull-0.051
Token was
Token ID373
Feature activation+5.57e-01

Pos logit contributions
 F+0.039
 Hanna+0.038
 buck+0.037
 Kel+0.037
 Kenny+0.037
Neg logit contributions
 structures-0.076
castle-0.074
 salad-0.070
ide-0.069
ull-0.066
Token time
Token ID640
Feature activation+4.06e-01

Pos logit contributions
 Hanna+0.027
 Cindy+0.026
 Buster+0.025
 Hannah+0.025
 F+0.025
Neg logit contributions
castle-0.066
 structures-0.063
ide-0.063
room-0.062
ull-0.060
Token for
Token ID329
Feature activation+5.40e-01

Pos logit contributions
 Kel+0.038
 Kenny+0.038
 Hanna+0.037
 Buster+0.037
Michael+0.037
Neg logit contributions
castle-0.030
ide-0.028
 salad-0.027
ull-0.026
 structures-0.026
Token bed
Token ID3996
Feature activation+1.44e+00

Pos logit contributions
 Kel+0.043
uster+0.041
 Kenny+0.041
 pup+0.041
 Buster+0.041
Neg logit contributions
castle-0.045
ide-0.044
 salad-0.043
 structures-0.043
room-0.042
Token.
Token ID13
Feature activation+8.28e-02

Pos logit contributions
 Kel+0.111
 Hanna+0.110
 Kenny+0.109
 Buster+0.109
 Cindy+0.108
Neg logit contributions
castle-0.127
ide-0.121
 structures-0.117
room-0.114
ull-0.113
Token
Token ID198
Feature activation-5.13e-01

Pos logit contributions
 Hannah+0.006
 Hanna+0.006
 Cindy+0.006
 F+0.006
 Kel+0.006
Neg logit contributions
castle-0.009
ide-0.008
 structures-0.008
ull-0.008
 salad-0.007
Token
Token ID198
Feature activation-5.16e-01

Pos logit contributions
castle+0.053
 structures+0.049
ide+0.049
 salad+0.047
room+0.046
Neg logit contributions
 Kel-0.034
 Kenny-0.034
 Hanna-0.033
 Buster-0.033
 F-0.033
Token.
Token ID13
Feature activation+8.84e-02

Pos logit contributions
 Kel+0.141
 Hanna+0.137
 Kenny+0.137
uster+0.137
 Buster+0.135
Neg logit contributions
castle-0.125
ide-0.113
 structures-0.112
 salad-0.108
room-0.107
Token Her
Token ID2332
Feature activation-1.06e+00

Pos logit contributions
 Kenny+0.007
 F+0.007
 Cindy+0.007
 Hannah+0.007
 Kel+0.007
Neg logit contributions
castle-0.009
ide-0.008
 structures-0.008
ull-0.008
 salad-0.007
Token friends
Token ID2460
Feature activation+7.36e-02

Pos logit contributions
castle+0.089
 structures+0.082
 salad+0.080
 nation+0.076
ull+0.075
Neg logit contributions
 Hanna-0.093
 Kel-0.092
uster-0.091
 Kenny-0.088
 Hannah-0.088
Token saw
Token ID2497
Feature activation-9.12e-01

Pos logit contributions
 Kel+0.007
 Hanna+0.007
uster+0.007
 Buster+0.007
 Kenny+0.007
Neg logit contributions
castle-0.005
ide-0.005
room-0.005
ull-0.004
 structures-0.004
Token the
Token ID262
Feature activation-7.46e-02

Pos logit contributions
castle+0.087
ide+0.084
 structures+0.081
room+0.079
ull+0.079
Neg logit contributions
 Kel-0.064
 Hanna-0.063
 Kenny-0.063
 Buster-0.062
 Hannah-0.062
Token change
Token ID1487
Feature activation-2.50e-01

Pos logit contributions
castle+0.007
 salad+0.006
 structures+0.006
ide+0.006
ull+0.006
Neg logit contributions
 Kel-0.006
 Kenny-0.006
 Hanna-0.006
 F-0.006
 Eleanor-0.006
Token in
Token ID287
Feature activation-4.77e-01

Pos logit contributions
castle+0.022
ide+0.021
 structures+0.020
room+0.020
ull+0.020
Neg logit contributions
 Kel-0.019
 Hanna-0.019
 Kenny-0.019
 Buster-0.019
 F-0.019
Token her
Token ID607
Feature activation-9.77e-01

Pos logit contributions
 salad+0.035
castle+0.034
 structures+0.034
room+0.034
ide+0.032
Neg logit contributions
uster-0.042
 Hanna-0.041
 Kel-0.041
 Kenny-0.039
Michael-0.039
Token and
Token ID290
Feature activation+1.89e-01

Pos logit contributions
castle+0.077
 structures+0.070
 salad+0.068
ide+0.067
 nation+0.067
Neg logit contributions
 Kel-0.088
 Kenny-0.086
 Hanna-0.086
 Buster-0.084
Michael-0.084
Token soon
Token ID2582
Feature activation+1.19e+00

Pos logit contributions
 Kel+0.014
 Hanna+0.014
 Kenny+0.014
 Buster+0.014
 Hannah+0.014
Neg logit contributions
castle-0.017
ide-0.016
 structures-0.016
room-0.016
ull-0.015
Token she
Token ID673
Feature activation+2.45e-01

Pos logit contributions
 Hanna+0.094
 Kel+0.094
 Hannah+0.092
 Cindy+0.092
 Buster+0.092
Neg logit contributions
castle-0.104
ide-0.099
 structures-0.096
room-0.095
ull-0.093
Token upon
Token ID2402
Feature activation+1.67e+00

Pos logit contributions
castle+0.078
ide+0.074
 structures+0.073
room+0.072
ull+0.071
Neg logit contributions
 Kel-0.053
 Hanna-0.053
 Kenny-0.052
 Hannah-0.052
 Buster-0.052
Token a
Token ID257
Feature activation+3.15e-01

Pos logit contributions
 Hanna+0.096
 Kel+0.095
 Kenny+0.094
 F+0.094
 Hannah+0.093
Neg logit contributions
castle-0.181
ide-0.171
 structures-0.169
room-0.163
ull-0.162
Token time
Token ID640
Feature activation+4.33e-01

Pos logit contributions
 Kel+0.017
 Hanna+0.017
 Kenny+0.016
 Buster+0.016
 Hannah+0.016
Neg logit contributions
castle-0.035
ide-0.033
 structures-0.033
room-0.032
ull-0.032
Token there
Token ID612
Feature activation+1.00e+00

Pos logit contributions
 Kel+0.031
 Hanna+0.031
 Kenny+0.031
 Buster+0.031
 Hannah+0.031
Neg logit contributions
castle-0.040
ide-0.038
 structures-0.037
room-0.036
ull-0.036
Token was
Token ID373
Feature activation+5.71e-01

Pos logit contributions
 Kel+0.070
 Kenny+0.070
 F+0.068
 Hanna+0.068
 Hannah+0.067
Neg logit contributions
castle-0.097
 structures-0.091
ide-0.090
 salad-0.087
ull-0.086
Token a
Token ID257
Feature activation+3.07e-01

Pos logit contributions
 Kel+0.039
 Hanna+0.038
 Kenny+0.038
 Buster+0.037
 Hannah+0.037
Neg logit contributions
castle-0.056
ide-0.054
 structures-0.052
room-0.051
ull-0.051
Token happy
Token ID3772
Feature activation-2.20e+00

Pos logit contributions
 Kel+0.023
 Hanna+0.022
 Kenny+0.022
 Buster+0.022
 Cindy+0.022
Neg logit contributions
castle-0.028
ide-0.027
 structures-0.026
room-0.026
ull-0.025
Token little
Token ID1310
Feature activation+1.85e-01

Pos logit contributions
castle+0.209
ide+0.200
 structures+0.196
room+0.190
ull+0.187
Neg logit contributions
 Hanna-0.155
 pup-0.155
 Kel-0.154
 Cindy-0.153
 Hannah-0.153
Token girl
Token ID2576
Feature activation+4.93e-01

Pos logit contributions
 Kel+0.016
 Kenny+0.015
 hesitation+0.015
 Hanna+0.015
 training+0.015
Neg logit contributions
castle-0.015
 salad-0.014
ide-0.014
ull-0.013
 structures-0.013
Token who
Token ID508
Feature activation-1.96e-01

Pos logit contributions
 Kel+0.043
 Kenny+0.042
 Hanna+0.042
 Buster+0.042
 hesitation+0.042
Neg logit contributions
castle-0.038
ide-0.037
room-0.036
ull-0.035
 structures-0.035
Token loved
Token ID6151
Feature activation-2.55e-01

Pos logit contributions
ide+0.012
castle+0.012
room+0.011
 structures+0.011
ull+0.011
Neg logit contributions
 hesitation-0.020
uster-0.020
 Kel-0.020
 Kenny-0.020
 Buster-0.019
Token looked
Token ID3114
Feature activation-1.90e+00

Pos logit contributions
castle+0.037
 structures+0.036
room+0.034
ide+0.034
ull+0.033
Neg logit contributions
 F-0.028
 hesitation-0.028
 Hannah-0.027
 Kel-0.027
 Kenny-0.027
Token different
Token ID1180
Feature activation+9.30e-01

Pos logit contributions
castle+0.165
ide+0.156
 structures+0.153
 salad+0.149
ull+0.146
Neg logit contributions
 Kel-0.151
 Hanna-0.148
 Kenny-0.148
 Buster-0.146
 F-0.144
Token,
Token ID11
Feature activation+2.23e-01

Pos logit contributions
 Kel+0.071
 Hanna+0.071
 Kenny+0.070
 Buster+0.070
 Hannah+0.069
Neg logit contributions
castle-0.083
ide-0.079
 structures-0.077
room-0.075
ull-0.074
Token they
Token ID484
Feature activation-3.89e-01

Pos logit contributions
uster+0.018
 Kel+0.018
 Hanna+0.018
 training+0.018
 buck+0.017
Neg logit contributions
castle-0.018
 salad-0.017
 structures-0.017
ide-0.017
ull-0.016
Token still
Token ID991
Feature activation+1.36e+00

Pos logit contributions
castle+0.036
 structures+0.036
ide+0.035
room+0.035
ull+0.033
Neg logit contributions
 hesitation-0.026
 buck-0.026
 Buster-0.026
 Hanna-0.026
 Cindy-0.026
Token tasted
Token ID29187
Feature activation-5.05e+00

Pos logit contributions
 Kel+0.106
 Hanna+0.105
 Kenny+0.105
 Buster+0.104
 Hannah+0.104
Neg logit contributions
castle-0.118
ide-0.113
 structures-0.110
room-0.108
ull-0.106
Token just
Token ID655
Feature activation+1.23e+00

Pos logit contributions
castle+0.457
ide+0.439
 structures+0.426
 salad+0.414
ull+0.413
Neg logit contributions
 Kel-0.377
 Hanna-0.373
 Kenny-0.370
 Buster-0.366
 Hannah-0.362
Token as
Token ID355
Feature activation+5.71e-02

Pos logit contributions
 Kenny+0.098
 Kel+0.098
 Hanna+0.097
 Buster+0.097
Michael+0.095
Neg logit contributions
castle-0.107
 salad-0.100
 structures-0.100
ide-0.100
ull-0.095
Token delicious
Token ID12625
Feature activation-6.80e-01

Pos logit contributions
 Hannah+0.004
 Hanna+0.004
 Cindy+0.004
 buck+0.004
 F+0.004
Neg logit contributions
castle-0.005
room-0.005
ide-0.005
 structures-0.005
anish-0.005
Token as
Token ID355
Feature activation+6.98e-02

Pos logit contributions
castle+0.065
ide+0.062
 structures+0.060
 salad+0.060
ull+0.058
Neg logit contributions
 Kel-0.049
 Hanna-0.048
 Kenny-0.048
 Buster-0.047
 buck-0.046
Token before
Token ID878
Feature activation+1.21e+00

Pos logit contributions
 Kel+0.006
 Hanna+0.006
 Kenny+0.006
 Buster+0.006
 Cindy+0.005
Neg logit contributions
castle-0.006
ide-0.006
 structures-0.005
 salad-0.005
ull-0.005
Token The
Token ID383
Feature activation-5.24e-01

Pos logit contributions
 hesitation+0.009
 buck+0.009
 Kel+0.009
 training+0.009
 luggage+0.009
Neg logit contributions
castle-0.008
ide-0.008
room-0.008
 structures-0.008
 salad-0.007
Token salmon
Token ID24535
Feature activation+1.27e+00

Pos logit contributions
castle+0.038
 salad+0.036
 structures+0.034
ull+0.033
 pie+0.032
Neg logit contributions
Michael-0.051
 Hanna-0.051
 Kel-0.051
 Kenny-0.051
 Kathy-0.050
Token used
Token ID973
Feature activation+3.43e-02

Pos logit contributions
 Hanna+0.101
 Kel+0.101
 Kenny+0.099
 Cindy+0.098
 Hannah+0.098
Neg logit contributions
castle-0.110
ide-0.104
 structures-0.100
room-0.098
ull-0.098
Token all
Token ID477
Feature activation-1.53e+00

Pos logit contributions
 Hanna+0.003
 Kel+0.003
 Haley+0.003
 Kenny+0.003
Michael+0.003
Neg logit contributions
castle-0.003
ide-0.003
 structures-0.003
 salad-0.002
room-0.002
Token of
Token ID286
Feature activation+8.02e-01

Pos logit contributions
ide+0.117
castle+0.113
 structures+0.110
room+0.108
ull+0.107
Neg logit contributions
 Hanna-0.133
 Cindy-0.128
 Kel-0.128
 Hannah-0.127
 Kenny-0.127
Token its
Token ID663
Feature activation-4.41e+00

Pos logit contributions
 Hanna+0.065
 Kel+0.065
 buck+0.065
 Kenny+0.064
 pup+0.064
Neg logit contributions
ide-0.065
castle-0.063
 structures-0.061
room-0.060
ull-0.059
Token strength
Token ID4202
Feature activation+1.79e+00

Pos logit contributions
castle+0.383
 structures+0.369
ide+0.363
 salad+0.354
ull+0.341
Neg logit contributions
 Hanna-0.349
 Kel-0.347
 Hannah-0.342
 Kenny-0.340
 Cindy-0.337
Token and
Token ID290
Feature activation+2.09e-01

Pos logit contributions
 Hanna+0.160
 Kel+0.157
 Kenny+0.154
 Haley+0.154
 Eleanor+0.153
Neg logit contributions
castle-0.138
ide-0.134
 structures-0.122
room-0.120
ull-0.118
Token started
Token ID2067
Feature activation+6.49e-01

Pos logit contributions
uster+0.017
 Hanna+0.017
 Haley+0.016
 Kel+0.016
 Hannah+0.016
Neg logit contributions
castle-0.017
 structures-0.016
ide-0.015
 salad-0.015
ull-0.015
Token to
Token ID284
Feature activation-1.33e-02

Pos logit contributions
 Hanna+0.056
uster+0.055
 Kel+0.053
 luggage+0.053
 Haley+0.053
Neg logit contributions
castle-0.051
 structures-0.048
ide-0.047
ull-0.044
ics-0.043
Token swim
Token ID9422
Feature activation+2.08e-01

Pos logit contributions
castle+0.001
ide+0.001
 structures+0.001
room+0.001
ull+0.001
Neg logit contributions
 Hanna-0.001
uster-0.001
 Kel-0.001
 Hannah-0.001
 luggage-0.001
Token,
Token ID11
Feature activation+2.13e-01

Pos logit contributions
Michael+0.147
 buck+0.146
 Hanna+0.145
 luggage+0.144
 reins+0.144
Neg logit contributions
ide-0.124
castle-0.120
room-0.117
ull-0.110
 structures-0.105
Token the
Token ID262
Feature activation-5.26e-02

Pos logit contributions
 reins+0.020
uster+0.020
 training+0.019
 hesitation+0.019
 luggage+0.019
Neg logit contributions
castle-0.012
 salad-0.012
room-0.011
ide-0.011
ull-0.011
Token tall
Token ID7331
Feature activation-1.65e+00

Pos logit contributions
castle+0.004
 salad+0.003
 structure+0.003
 nation+0.003
 tower+0.003
Neg logit contributions
 Kel-0.005
 Kenny-0.005
 Buster-0.005
uster-0.005
 Hanna-0.005
Token man
Token ID582
Feature activation+7.22e-01

Pos logit contributions
castle+0.134
ide+0.119
 structures+0.119
 salad+0.117
 tower+0.116
Neg logit contributions
 Kenny-0.145
 Kel-0.145
 Buster-0.143
 Hanna-0.142
 F-0.139
Token was
Token ID373
Feature activation+5.66e-01

Pos logit contributions
 Kel+0.059
 Hanna+0.058
 Kenny+0.058
 Cindy+0.057
 Buster+0.057
Neg logit contributions
castle-0.061
ide-0.057
ull-0.055
room-0.053
 salad-0.052
Token able
Token ID1498
Feature activation-4.73e+00

Pos logit contributions
 Kel+0.047
 luggage+0.047
uster+0.047
 Kenny+0.046
 pup+0.045
Neg logit contributions
castle-0.045
ull-0.043
ide-0.041
room-0.039
ics-0.039
Token to
Token ID284
Feature activation-1.30e-02

Pos logit contributions
castle+0.321
 salad+0.298
ide+0.295
ull+0.288
 structures+0.277
Neg logit contributions
 Hanna-0.457
 Kel-0.448
 Kenny-0.446
uster-0.444
 Eleanor-0.443
Token grab
Token ID5552
Feature activation+1.61e+00

Pos logit contributions
castle+0.001
ide+0.001
room+0.001
ull+0.001
 salad+0.001
Neg logit contributions
 Kel-0.001
 luggage-0.001
 Hanna-0.001
 Kenny-0.001
 Cindy-0.001
Token the
Token ID262
Feature activation-6.29e-02

Pos logit contributions
Michael+0.155
uster+0.153
Matthew+0.152
 reins+0.152
 Hanna+0.151
Neg logit contributions
room-0.090
 structures-0.086
 salad-0.084
ide-0.083
castle-0.081
Token balloon
Token ID21190
Feature activation-8.77e-01

Pos logit contributions
castle+0.005
 salad+0.004
 structures+0.004
 tower+0.004
 structure+0.004
Neg logit contributions
 Kel-0.006
 Kenny-0.006
 Hanna-0.006
 Buster-0.006
 Cindy-0.006
Token.
Token ID13
Feature activation+1.00e-01

Pos logit contributions
castle+0.080
ide+0.076
 structures+0.075
room+0.073
ull+0.073
Neg logit contributions
 Kel-0.063
 Hanna-0.063
 Kenny-0.063
 Buster-0.062
 pup-0.062
Token it
Token ID340
Feature activation+6.80e-01

Pos logit contributions
 training+0.026
Matthew+0.026
 pup+0.026
 Buster+0.026
 hesitation+0.026
Neg logit contributions
castle-0.019
ide-0.019
 salad-0.019
 structures-0.018
room-0.018
Token.
Token ID13
Feature activation+8.54e-02

Pos logit contributions
 Kel+0.054
 Hanna+0.053
 Kenny+0.053
 Buster+0.052
 Cindy+0.052
Neg logit contributions
castle-0.059
ide-0.057
 structures-0.055
room-0.054
 salad-0.053
Token He
Token ID679
Feature activation+1.26e-01

Pos logit contributions
 Hannah+0.006
 Hanna+0.006
 Cindy+0.006
 Kenny+0.006
 Kel+0.006
Neg logit contributions
castle-0.008
ull-0.008
ide-0.008
 structures-0.007
room-0.007
Token took
Token ID1718
Feature activation-1.20e+00

Pos logit contributions
 Kel+0.008
 buck+0.008
 Kenny+0.008
 Hanna+0.008
 F+0.008
Neg logit contributions
castle-0.012
ide-0.012
 structures-0.012
room-0.011
ull-0.011
Token a
Token ID257
Feature activation+2.88e-01

Pos logit contributions
castle+0.102
ide+0.099
 structures+0.097
room+0.096
 salad+0.095
Neg logit contributions
 Kel-0.092
 Hanna-0.091
 hesitation-0.091
 Kenny-0.091
 Buster-0.090
Token knife
Token ID9845
Feature activation-4.84e+00

Pos logit contributions
 Kel+0.022
 Hanna+0.022
 Kenny+0.022
 F+0.022
 Buster+0.022
Neg logit contributions
castle-0.025
ide-0.024
 salad-0.023
 structures-0.023
ull-0.023
Token and
Token ID290
Feature activation+1.81e-01

Pos logit contributions
castle+0.439
ide+0.420
 structures+0.405
room+0.398
ull+0.395
Neg logit contributions
 Kel-0.361
 Hanna-0.359
 Kenny-0.355
 Buster-0.353
 Hannah-0.351
Token cut
Token ID2005
Feature activation-2.32e+00

Pos logit contributions
 Kel+0.013
 Hanna+0.013
 Kenny+0.013
 buck+0.013
 Buster+0.013
Neg logit contributions
castle-0.017
ide-0.016
 structures-0.016
room-0.015
ull-0.015
Token the
Token ID262
Feature activation-7.66e-02

Pos logit contributions
castle+0.227
ide+0.218
 structures+0.211
room+0.208
ull+0.206
Neg logit contributions
 Kel-0.156
 Hanna-0.155
 Kenny-0.153
 Buster-0.152
 Hannah-0.151
Token top
Token ID1353
Feature activation-1.24e+00

Pos logit contributions
ide+0.006
room+0.005
castle+0.005
 structures+0.005
ics+0.005
Neg logit contributions
 luggage-0.007
 buck-0.007
 pup-0.006
 hesitation-0.006
 reins-0.006
Token off
Token ID572
Feature activation+1.12e+00

Pos logit contributions
castle+0.122
ide+0.118
 structures+0.117
ull+0.117
room+0.116
Neg logit contributions
 Kel-0.073
 buck-0.073
 F-0.072
 ahead-0.070
 Kenny-0.069
Token
Token ID198
Feature activation-5.06e-01

Pos logit contributions
castle+0.093
 salad+0.091
 structures+0.089
ide+0.088
room+0.085
Neg logit contributions
uster-0.132
 Kel-0.130
 hesitation-0.127
 Kenny-0.127
 Buster-0.126
Token
Token ID198
Feature activation-5.16e-01

Pos logit contributions
castle+0.050
ide+0.047
 structures+0.046
room+0.044
 salad+0.044
Neg logit contributions
 Kel-0.035
 Hanna-0.035
 Kenny-0.034
 Buster-0.034
 F-0.034
TokenMr
Token ID5246
Feature activation+1.38e+00

Pos logit contributions
castle+0.040
ide+0.038
room+0.037
 structures+0.036
ull+0.036
Neg logit contributions
 Kel-0.045
 Kenny-0.044
 hesitation-0.044
 Hanna-0.044
 Buster-0.044
Token.
Token ID13
Feature activation+9.38e-02

Pos logit contributions
 Kel+0.106
 Hanna+0.105
 Kenny+0.104
 Buster+0.104
 Hannah+0.103
Neg logit contributions
castle-0.122
ide-0.116
 structures-0.112
room-0.110
ull-0.109
Token Snow
Token ID7967
Feature activation+6.48e+00

Pos logit contributions
 hesitation+0.005
leaf+0.005
 Kel+0.005
uster+0.005
 training+0.005
Neg logit contributions
castle-0.009
room-0.009
ull-0.009
ide-0.009
 dragons-0.008
Tokenman
Token ID805
Feature activation-4.75e+00

Pos logit contributions
 pup+0.604
 Hanna+0.599
 buck+0.584
 Hannah+0.582
 Kenny+0.576
Neg logit contributions
castle-0.449
 structures-0.443
ide-0.420
room-0.396
 salad-0.391
Token doesn
Token ID1595
Feature activation-4.49e-01

Pos logit contributions
castle+0.586
ide+0.580
room+0.551
 structures+0.551
ull+0.536
Neg logit contributions
 buck-0.200
 F-0.185
 Hanna-0.183
 Kenny-0.183
 pup-0.183
Token't
Token ID470
Feature activation+1.29e-02

Pos logit contributions
castle+0.042
ide+0.041
 structures+0.039
room+0.039
ull+0.038
Neg logit contributions
 Kel-0.032
 Hanna-0.032
 Kenny-0.031
 pup-0.031
 Buster-0.031
Token say
Token ID910
Feature activation+1.11e+00

Pos logit contributions
 Kel+0.001
 Hanna+0.001
 Kenny+0.001
 Buster+0.001
 Hannah+0.001
Neg logit contributions
castle-0.001
ide-0.001
 structures-0.001
room-0.001
ull-0.001
Token anything
Token ID1997
Feature activation-2.25e+00

Pos logit contributions
 Kel+0.089
 Hanna+0.088
 Buster+0.087
 Kenny+0.087
 buck+0.086
Neg logit contributions
castle-0.095
ide-0.090
 structures-0.087
room-0.085
ull-0.085
Token.
Token ID13
Feature activation+9.29e-02

Pos logit contributions
ide+0.149
castle+0.147
 structures+0.139
 salad+0.138
ull+0.131
Neg logit contributions
uster-0.223
 Hanna-0.216
 Kel-0.215
 Buster-0.213
 reins-0.213