TOP ACTIVATIONS
MAX = 0.097

and always tried to out run everyone else . One day
silly to try and out run a vehicle ! Always think
and Jimmy 's face sc run ched up as he ate
never be able to out run the lion . He lay
could , trying to out run the backwards me adow .
went home and enjoyed his freshly caught fish . <|endoftext|> Once
to play and explore the freshly organized lawn . <|endoftext|> Once
ble , and it sh imm ered in the sunshine .
was really excited about his upcoming adventure . <|endoftext|> Once upon
. He tried to out run the train but he was
the road and it gl imm ered in the bright sunshine
led and her dress gl imm ered in the sun .
but she couldn 't out run the man . He quickly
her goal ! <|endoftext|> J imm ie was scared . He
covered the ground and gl imm ered in the sunlight .
bright colors and painting her masterpiece . She couldn 't contain
you ?" B run o said , " Of
his kick - and - ch ase game , and knew
her candy , she sc run ched her eyes tight .
was dark but it gl imm ered in the night sky

MIN ACTIVATIONS
MIN = -0.091

hopped over to the b unn ies . They were so
took off the oven m itt s and opened the oven
She grabbed oven m itt s and carefully pulled out
Bob smiled at the b unn ies and asked : "
and get it , k itt ies !" she called ,
to the cage of b unn ies .
They put on oven m itt s and took out the
ummy food for the b unn ies . It started with
. But the b unn ies didn 't wake up
grass with the loud b unn ies . <|endoftext|> Once upon
. It made the b unn ies smile and cheer .
the house saw the b unn ies up in the nest
the honey to the b unn ies ! â €
learned not to judge by appearances . They learned to help
the nest . The b unn ies could see the baby
. The b unn ies showed the elderly rabbit
put on an oven m itt and a a pron .
on top . The b unn ies wanted to have a
vegetables ! All the b unn ies were over joy ed
The b unn ies happily hopped back to

INTERVAL 0.050 - 0.097
CONTAINS 0.023%

for bright ening up his day . <|endoftext|> Once there was
sky and they spent the day playing tag and laughing in
safe journey and for their day together . When they were
special roof rack and a super bright head light . He
began to shine and tw inkle . Lily and her friend

INTERVAL 0.003 - 0.050
CONTAINS 41.139%

. Just stay still and the dog will go away .
floated and bob bed on the water . They touched the
liked to play make - bel ieve . One
So , Lily went to find her mom and told
have spent time together . The end . <|endoftext|> Once upon

INTERVAL -0.044 - 0.003
CONTAINS 58.763%

Daisy to the farm , mom ?" Anna asks .
barking and running around . It looked wild and scary .
M ama smiled and said , " That 's a great
, " I see something healthy and delicious !"
it with his foot . The hut fell down . He

INTERVAL -0.091 - -0.044
CONTAINS 0.074%

Tom went back to the sea with his new boat .
lesson . They would not hide behind the curtain again .
The le opard cub gul ped but he was
. She learned that accidents can happen , but it 's
had no skin , no flesh , no hair . It
Token and
Token ID290
Feature activation+1.48e-02

Pos logit contributions
 court+0.000
 rooms+0.000
 peel+0.000
iffs+0.000
 slide+0.000
Neg logit contributions
 unforgettable-0.001
 newfound-0.001
 lifetime-0.001
immer-0.001
 lifelong-0.001
Token always
Token ID1464
Feature activation+3.31e-02

Pos logit contributions
 unforgettable+0.025
imm+0.024
 newfound+0.023
immer+0.023
 brand+0.022
Neg logit contributions
 court-0.017
iffs-0.016
 rooms-0.016
 seeker-0.016
 peel-0.016
Token tried
Token ID3088
Feature activation-9.22e-03

Pos logit contributions
 unforgettable+0.049
 newfound+0.046
imm+0.044
immer+0.044
 brand+0.043
Neg logit contributions
 seeker-0.044
 court-0.043
iffs-0.043
 rooms-0.041
 slams-0.041
Token to
Token ID284
Feature activation-1.57e-03

Pos logit contributions
iffs+0.011
 court+0.011
 rooms+0.010
 peel+0.010
 seeker+0.010
Neg logit contributions
 unforgettable-0.015
imm-0.014
 newfound-0.014
immer-0.014
b-0.013
Token out
Token ID503
Feature activation+1.30e-02

Pos logit contributions
 court+0.002
 peel+0.001
 seeker+0.001
iffs+0.001
 slide+0.001
Neg logit contributions
 unforgettable-0.003
imm-0.003
 newfound-0.003
 brand-0.003
 lifetime-0.003
Tokenrun
Token ID5143
Feature activation+9.67e-02

Pos logit contributions
imm+0.024
 unforgettable+0.024
immer+0.023
 newfound+0.022
 lifetime+0.022
Neg logit contributions
 rooms-0.013
 court-0.012
 tables-0.012
 peel-0.012
 slams-0.011
Token everyone
Token ID2506
Feature activation-2.54e-04

Pos logit contributions
 unforgettable+0.171
 newfound+0.161
imm+0.159
immer+0.158
 lifetime+0.156
Neg logit contributions
iffs-0.107
 court-0.095
 rooms-0.093
 Enjoy-0.092
 seeker-0.091
Token else
Token ID2073
Feature activation+3.86e-03

Pos logit contributions
 court+0.000
 rooms+0.000
 peel+0.000
iffs+0.000
 slide+0.000
Neg logit contributions
 unforgettable-0.001
 newfound-0.001
immer-0.000
imm-0.000
 brand-0.000
Token.
Token ID13
Feature activation+5.94e-03

Pos logit contributions
 unforgettable+0.008
imm+0.008
 newfound+0.008
immer+0.008
 brand+0.007
Neg logit contributions
 court-0.003
 peel-0.003
 rooms-0.003
iffs-0.002
 tables-0.002
Token One
Token ID1881
Feature activation+1.27e-03

Pos logit contributions
 unforgettable+0.009
 newfound+0.009
b+0.009
 thrilling+0.008
imm+0.008
Neg logit contributions
iffs-0.007
 seeker-0.007
 slams-0.007
 Gard-0.007
 court-0.006
Token day
Token ID1110
Feature activation+1.45e-02

Pos logit contributions
 unforgettable+0.002
imm+0.002
immer+0.002
 brand+0.002
 newfound+0.002
Neg logit contributions
 seeker-0.002
 court-0.002
iffs-0.002
 peel-0.002
 rooms-0.002
Token silly
Token ID14397
Feature activation-2.32e-03

Pos logit contributions
 unforgettable+0.002
imm+0.002
b+0.002
 lifetime+0.001
 newfound+0.001
Neg logit contributions
 seeker-0.001
 slams-0.001
 court-0.001
 rooms-0.001
 Gard-0.001
Token to
Token ID284
Feature activation+8.47e-05

Pos logit contributions
iffs+0.002
 seeker+0.002
 rooms+0.002
 court+0.002
 slams+0.002
Neg logit contributions
 unforgettable-0.004
 newfound-0.004
imm-0.004
b-0.004
 thrilling-0.004
Token try
Token ID1949
Feature activation-1.64e-02

Pos logit contributions
 unforgettable+0.000
imm+0.000
 newfound+0.000
b+0.000
 thrilling+0.000
Neg logit contributions
 court-0.000
iffs-0.000
 seeker-0.000
 peel-0.000
 Gard-0.000
Token and
Token ID290
Feature activation+5.56e-03

Pos logit contributions
iffs+0.019
 court+0.019
 peel+0.017
 rooms+0.017
 seeker+0.017
Neg logit contributions
 unforgettable-0.028
imm-0.027
 newfound-0.026
immer-0.025
 thrilling-0.025
Token out
Token ID503
Feature activation+2.43e-02

Pos logit contributions
imm+0.011
 unforgettable+0.010
b+0.010
 newfound+0.010
immer+0.010
Neg logit contributions
 peel-0.005
 court-0.005
iffs-0.005
 slams-0.004
 rooms-0.004
Tokenrun
Token ID5143
Feature activation+9.42e-02

Pos logit contributions
 unforgettable+0.048
imm+0.048
 newfound+0.045
b+0.044
 lifetime+0.044
Neg logit contributions
 peel-0.019
 court-0.019
 slams-0.019
 Enjoy-0.019
iffs-0.018
Token a
Token ID257
Feature activation+1.85e-02

Pos logit contributions
 unforgettable+0.166
 brand+0.161
 newfound+0.160
 lifetime+0.158
 thrilling+0.152
Neg logit contributions
iffs-0.115
 Enjoy-0.097
other-0.092
 court-0.091
atches-0.091
Token vehicle
Token ID4038
Feature activation+3.17e-02

Pos logit contributions
imm+0.035
 unforgettable+0.032
ittered+0.031
b+0.030
immer+0.029
Neg logit contributions
 court-0.021
 peel-0.021
 slide-0.019
 sink-0.019
 kitchen-0.019
Token!
Token ID0
Feature activation+1.13e-02

Pos logit contributions
 newfound+0.073
 unforgettable+0.069
 lifelong+0.068
 thrilling+0.065
 lifetime+0.065
Neg logit contributions
 peel-0.024
 court-0.023
 slide-0.022
 bar-0.020
 sink-0.019
Token Always
Token ID16622
Feature activation+2.33e-03

Pos logit contributions
 unforgettable+0.022
 newfound+0.021
b+0.021
imm+0.020
 thrilling+0.020
Neg logit contributions
iffs-0.010
 seeker-0.009
 court-0.009
 Gard-0.009
 slams-0.009
Token think
Token ID892
Feature activation+4.45e-03

Pos logit contributions
 unforgettable+0.004
imm+0.004
 newfound+0.004
 thrilling+0.004
 brand+0.004
Neg logit contributions
 seeker-0.002
 peel-0.002
iffs-0.002
 court-0.002
 slams-0.002
Token and
Token ID290
Feature activation-1.13e-03

Pos logit contributions
iffs+0.006
 seeker+0.006
 slams+0.006
 court+0.006
 Gard+0.006
Neg logit contributions
 unforgettable-0.007
 newfound-0.006
b-0.006
imm-0.006
 thrilling-0.006
Token Jimmy
Token ID12963
Feature activation+3.82e-04

Pos logit contributions
 peel+0.001
 sink+0.001
 rooms+0.001
 wasting+0.001
 tables+0.001
Neg logit contributions
immer-0.002
imm-0.002
 lifetime-0.002
 unforgettable-0.002
 newfound-0.002
Token's
Token ID338
Feature activation+4.83e-03

Pos logit contributions
 unforgettable+0.001
immer+0.001
 newfound+0.001
 thrilling+0.001
imm+0.001
Neg logit contributions
 peel-0.000
 rooms-0.000
 Gard-0.000
 sink-0.000
 yog-0.000
Token face
Token ID1986
Feature activation+1.14e-02

Pos logit contributions
pr+0.011
imm+0.011
b+0.011
 unforgettable+0.010
immer+0.010
Neg logit contributions
 peel-0.003
 ear-0.003
 rooms-0.003
 kitchen-0.002
 stomach-0.002
Token sc
Token ID629
Feature activation+2.37e-02

Pos logit contributions
 unforgettable+0.018
 newfound+0.016
 lifetime+0.016
 thrilling+0.016
 lifelong+0.015
Neg logit contributions
 peel-0.016
 sink-0.013
 tables-0.013
 rooms-0.013
iffs-0.013
Tokenrun
Token ID5143
Feature activation+9.36e-02

Pos logit contributions
 unforgettable+0.035
 newfound+0.033
 thrilling+0.032
b+0.031
imm+0.031
Neg logit contributions
iffs-0.033
 court-0.029
 slams-0.029
 peel-0.029
 seeker-0.029
Tokenched
Token ID1740
Feature activation-9.89e-03

Pos logit contributions
 unforgettable+0.157
 lifetime+0.151
 newfound+0.151
 brand+0.148
 thrilling+0.143
Neg logit contributions
iffs-0.123
other-0.104
ud-0.102
 peel-0.101
 Enjoy-0.101
Token up
Token ID510
Feature activation+6.20e-03

Pos logit contributions
 peel+0.008
iffs+0.008
 court+0.008
 tables+0.008
 rooms+0.007
Neg logit contributions
 unforgettable-0.019
 newfound-0.018
 lifetime-0.018
imm-0.018
 thrilling-0.018
Token as
Token ID355
Feature activation-1.02e-02

Pos logit contributions
 unforgettable+0.013
b+0.013
imm+0.013
 lifetime+0.013
 newfound+0.013
Neg logit contributions
 court-0.004
 peel-0.004
 tables-0.003
 rooms-0.003
 slams-0.003
Token he
Token ID339
Feature activation+3.04e-04

Pos logit contributions
 peel+0.010
 court+0.010
 seeker+0.010
 tables+0.010
 rooms+0.010
Neg logit contributions
imm-0.017
 unforgettable-0.017
b-0.016
 newfound-0.016
immer-0.016
Token ate
Token ID15063
Feature activation-1.41e-02

Pos logit contributions
 unforgettable+0.001
 lifelong+0.001
 newfound+0.001
immer+0.000
 possible+0.000
Neg logit contributions
 peel-0.000
 bar-0.000
 eats-0.000
 sav-0.000
 yog-0.000
Token never
Token ID1239
Feature activation+1.37e-02

Pos logit contributions
iffs+0.022
 seeker+0.022
 court+0.021
 peel+0.020
 Gard+0.020
Neg logit contributions
 unforgettable-0.034
 newfound-0.033
imm-0.032
b-0.032
 thrilling-0.031
Token be
Token ID307
Feature activation+3.49e-03

Pos logit contributions
 unforgettable+0.022
 newfound+0.021
imm+0.020
b+0.020
 brand+0.020
Neg logit contributions
iffs-0.017
 seeker-0.016
 court-0.016
 peel-0.015
 slams-0.015
Token able
Token ID1498
Feature activation-2.34e-02

Pos logit contributions
 unforgettable+0.006
 newfound+0.006
imm+0.006
 lifetime+0.006
 brand+0.006
Neg logit contributions
 seeker-0.004
iffs-0.003
 slams-0.003
 rooms-0.003
 court-0.003
Token to
Token ID284
Feature activation-7.23e-03

Pos logit contributions
iffs+0.023
 court+0.022
 rooms+0.021
 seeker+0.021
 peel+0.020
Neg logit contributions
imm-0.043
 unforgettable-0.043
immer-0.041
 newfound-0.040
b-0.039
Token out
Token ID503
Feature activation+7.03e-03

Pos logit contributions
iffs+0.008
 seeker+0.007
 court+0.007
 slams+0.007
 Gard+0.007
Neg logit contributions
 unforgettable-0.013
 newfound-0.012
b-0.012
imm-0.012
 thrilling-0.011
Tokenrun
Token ID5143
Feature activation+9.26e-02

Pos logit contributions
imm+0.014
 unforgettable+0.014
 newfound+0.013
b+0.013
immer+0.013
Neg logit contributions
 rooms-0.006
 court-0.006
iffs-0.005
 peel-0.005
 tables-0.005
Token the
Token ID262
Feature activation+9.04e-03

Pos logit contributions
 unforgettable+0.174
 newfound+0.167
 brand+0.166
imm+0.165
immer+0.160
Neg logit contributions
iffs-0.098
 court-0.084
 Enjoy-0.081
 seeker-0.080
 yourselves-0.079
Token lion
Token ID18744
Feature activation-1.55e-02

Pos logit contributions
 unforgettable+0.013
 newfound+0.012
 thrilling+0.012
b+0.012
 remarkable+0.011
Neg logit contributions
iffs-0.013
 slams-0.012
 seeker-0.012
 court-0.011
 Gard-0.011
Token.
Token ID13
Feature activation+5.43e-03

Pos logit contributions
 court+0.014
 peel+0.013
iffs+0.012
 rooms+0.012
 seeker+0.012
Neg logit contributions
 unforgettable-0.031
 newfound-0.030
imm-0.029
b-0.029
 thrilling-0.028
Token He
Token ID679
Feature activation-7.96e-03

Pos logit contributions
 unforgettable+0.008
 newfound+0.007
b+0.007
imm+0.007
 thrilling+0.007
Neg logit contributions
 seeker-0.008
iffs-0.007
 slams-0.007
 Gard-0.007
 peel-0.007
Token lay
Token ID3830
Feature activation-2.58e-02

Pos logit contributions
 slams+0.009
 rooms+0.009
 seeker+0.009
 peel+0.009
 Gard+0.008
Neg logit contributions
 unforgettable-0.013
 brand-0.013
immer-0.013
 newfound-0.013
imm-0.012
Token could
Token ID714
Feature activation-1.69e-03

Pos logit contributions
 unforgettable+0.010
 lifelong+0.009
immer+0.009
 coll+0.009
 newfound+0.009
Neg logit contributions
 peel-0.004
 seeker-0.004
 Ben-0.004
 But-0.004
 Raz-0.004
Token,
Token ID11
Feature activation+1.16e-03

Pos logit contributions
 court+0.001
 peel+0.001
 seeker+0.001
iffs+0.001
 slide+0.001
Neg logit contributions
 unforgettable-0.004
imm-0.003
 lifetime-0.003
 newfound-0.003
 brand-0.003
Token trying
Token ID2111
Feature activation-1.37e-02

Pos logit contributions
imm+0.002
 unforgettable+0.002
immer+0.002
 lifelong+0.002
ittered+0.002
Neg logit contributions
 court-0.001
 peel-0.001
 tables-0.001
 seeker-0.001
 rooms-0.001
Token to
Token ID284
Feature activation-5.89e-03

Pos logit contributions
 court+0.012
 peel+0.012
iffs+0.011
 rooms+0.011
 slams+0.010
Neg logit contributions
 unforgettable-0.026
imm-0.026
immer-0.024
 lifetime-0.024
 newfound-0.024
Token out
Token ID503
Feature activation+6.93e-03

Pos logit contributions
 peel+0.005
 court+0.005
 seeker+0.005
 slide+0.004
 slams+0.004
Neg logit contributions
imm-0.012
 unforgettable-0.011
 lifetime-0.011
 newfound-0.011
 brand-0.011
Tokenrun
Token ID5143
Feature activation+8.80e-02

Pos logit contributions
imm+0.013
 unforgettable+0.013
 lifetime+0.012
immer+0.012
 newfound+0.012
Neg logit contributions
 rooms-0.006
 slams-0.005
 tables-0.005
 court-0.005
 peel-0.005
Token the
Token ID262
Feature activation+6.09e-03

Pos logit contributions
 unforgettable+0.153
 brand+0.145
 newfound+0.144
 lifetime+0.140
imm+0.137
Neg logit contributions
iffs-0.105
 seeker-0.089
 court-0.087
 peel-0.087
 Enjoy-0.085
Token backwards
Token ID16196
Feature activation+2.34e-02

Pos logit contributions
imm+0.012
immer+0.011
ittered+0.011
 unforgettable+0.011
 newfound+0.010
Neg logit contributions
 seeker-0.006
 rooms-0.006
 peel-0.006
 tables-0.006
 court-0.006
Token me
Token ID502
Feature activation+1.10e-03

Pos logit contributions
 unforgettable+0.047
 newfound+0.047
 lifelong+0.045
imm+0.044
immer+0.043
Neg logit contributions
 court-0.020
 peel-0.020
 rooms-0.020
 tables-0.020
 slide-0.019
Tokenadow
Token ID4584
Feature activation+5.16e-03

Pos logit contributions
 unforgettable+0.002
 newfound+0.002
 lifelong+0.002
 lifetime+0.002
imm+0.002
Neg logit contributions
 court-0.001
 seeker-0.001
 peel-0.001
iffs-0.001
 rooms-0.001
Token.
Token ID13
Feature activation+3.32e-03

Pos logit contributions
 unforgettable+0.011
 newfound+0.011
 lifelong+0.010
imm+0.010
b+0.010
Neg logit contributions
 court-0.004
 rooms-0.003
 peel-0.003
 seeker-0.003
 tables-0.003
Token went
Token ID1816
Feature activation-9.43e-03

Pos logit contributions
 peel+0.020
 court+0.018
 sink+0.018
 Ben+0.017
 gard+0.017
Neg logit contributions
 unforgettable-0.032
 thrilling-0.030
 brand-0.030
 newfound-0.029
imm-0.029
Token home
Token ID1363
Feature activation-2.15e-02

Pos logit contributions
iffs+0.012
 seeker+0.012
 Gard+0.012
 slams+0.012
 court+0.011
Neg logit contributions
 unforgettable-0.014
 newfound-0.013
b-0.013
imm-0.013
 thrilling-0.013
Token and
Token ID290
Feature activation+2.56e-03

Pos logit contributions
 court+0.019
iffs+0.018
 rooms+0.018
 peel+0.018
 seeker+0.017
Neg logit contributions
 unforgettable-0.041
imm-0.039
 newfound-0.038
b-0.038
 thrilling-0.038
Token enjoyed
Token ID8359
Feature activation+1.03e-02

Pos logit contributions
imm+0.004
 unforgettable+0.004
immer+0.004
 thrilling+0.004
 newfound+0.004
Neg logit contributions
 court-0.003
 rooms-0.003
 peel-0.003
 slams-0.003
 sink-0.003
Token his
Token ID465
Feature activation-1.36e-02

Pos logit contributions
imm+0.020
 unforgettable+0.019
immer+0.019
b+0.018
chie+0.018
Neg logit contributions
 court-0.009
 rooms-0.008
 peel-0.008
 Enjoy-0.008
iffs-0.008
Token freshly
Token ID29026
Feature activation+8.80e-02

Pos logit contributions
 slide+0.013
 peel+0.013
 court+0.013
 sink+0.012
 kitchen+0.012
Neg logit contributions
imm-0.025
immer-0.024
bound-0.023
b-0.023
 beating-0.023
Token caught
Token ID4978
Feature activation+2.27e-02

Pos logit contributions
 unforgettable+0.140
 newfound+0.126
 thrilling+0.125
immer+0.124
 brand+0.122
Neg logit contributions
iffs-0.112
 peel-0.109
 Enjoy-0.101
 court-0.097
 seeker-0.094
Token fish
Token ID5916
Feature activation-2.06e-03

Pos logit contributions
 unforgettable+0.042
 lifetime+0.042
imm+0.041
 newfound+0.040
chie+0.039
Neg logit contributions
 peel-0.020
 rooms-0.019
 tables-0.019
 chefs-0.019
 court-0.018
Token.
Token ID13
Feature activation+7.23e-03

Pos logit contributions
 court+0.002
 peel+0.002
iffs+0.002
 seeker+0.002
 rooms+0.002
Neg logit contributions
 unforgettable-0.004
 happened-0.003
imm-0.003
 newfound-0.003
 thrilling-0.003
Token<|endoftext|>
Token ID50256
Feature activation-1.40e-02

Pos logit contributions
 unforgettable+0.011
 newfound+0.011
 thrilling+0.010
 brand+0.010
 lifetime+0.010
Neg logit contributions
 seeker-0.009
iffs-0.009
 slams-0.009
 Gard-0.008
 peel-0.008
TokenOnce
Token ID7454
Feature activation-9.83e-03

Pos logit contributions
iffs+0.016
 seeker+0.015
 court+0.015
 slams+0.015
 Gard+0.015
Neg logit contributions
 unforgettable-0.023
 newfound-0.022
b-0.021
imm-0.021
 thrilling-0.021
Token to
Token ID284
Feature activation+1.35e-02

Pos logit contributions
imm+0.059
immer+0.055
b+0.053
 unforgettable+0.053
ittered+0.051
Neg logit contributions
 court-0.018
 rooms-0.014
iffs-0.014
 peel-0.013
 seeker-0.013
Token play
Token ID711
Feature activation+2.42e-02

Pos logit contributions
imm+0.025
b+0.025
 unforgettable+0.025
immer+0.024
 thrilling+0.023
Neg logit contributions
 court-0.014
 rooms-0.012
 Enjoy-0.012
iffs-0.012
 peel-0.012
Token and
Token ID290
Feature activation+1.64e-02

Pos logit contributions
 unforgettable+0.052
b+0.052
 newfound+0.050
 lifetime+0.050
imm+0.049
Neg logit contributions
 court-0.017
 rooms-0.014
 seeker-0.014
 peel-0.013
 tables-0.012
Token explore
Token ID7301
Feature activation+1.71e-02

Pos logit contributions
imm+0.030
 unforgettable+0.029
b+0.028
immer+0.028
 newfound+0.028
Neg logit contributions
 court-0.016
iffs-0.015
 rooms-0.015
 seeker-0.015
 peel-0.015
Token the
Token ID262
Feature activation+1.90e-02

Pos logit contributions
imm+0.033
 unforgettable+0.032
b+0.031
immer+0.031
 happened+0.030
Neg logit contributions
 court-0.014
 rooms-0.014
 tables-0.013
iffs-0.013
 peel-0.011
Token freshly
Token ID29026
Feature activation+8.46e-02

Pos logit contributions
 unforgettable+0.034
imm+0.033
b+0.033
 newfound+0.032
immer+0.031
Neg logit contributions
iffs-0.018
 rooms-0.018
 seeker-0.018
 court-0.018
 Gard-0.017
Token organized
Token ID8389
Feature activation+3.70e-03

Pos logit contributions
 unforgettable+0.130
 newfound+0.118
 thrilling+0.115
immer+0.113
 lifelong+0.110
Neg logit contributions
iffs-0.115
 rooms-0.105
 court-0.103
 peel-0.102
 seeker-0.102
Token lawn
Token ID21133
Feature activation+1.19e-02

Pos logit contributions
 unforgettable+0.008
b+0.008
imm+0.008
 newfound+0.008
immer+0.008
Neg logit contributions
 rooms-0.002
 court-0.002
 seeker-0.002
iffs-0.002
 tables-0.002
Token.
Token ID13
Feature activation+8.68e-03

Pos logit contributions
 newfound+0.024
 unforgettable+0.024
imm+0.023
b+0.023
 thrilling+0.023
Neg logit contributions
 court-0.009
 rooms-0.008
 peel-0.008
iffs-0.008
 seeker-0.007
Token<|endoftext|>
Token ID50256
Feature activation-1.56e-02

Pos logit contributions
 unforgettable+0.013
 newfound+0.013
 lifelong+0.012
b+0.012
 thrilling+0.011
Neg logit contributions
 seeker-0.011
 slams-0.011
iffs-0.011
 peel-0.010
 rooms-0.010
TokenOnce
Token ID7454
Feature activation-1.10e-02

Pos logit contributions
iffs+0.016
 seeker+0.014
 court+0.014
 rooms+0.013
 Gard+0.013
Neg logit contributions
 unforgettable-0.030
 newfound-0.028
b-0.027
imm-0.027
 thrilling-0.027
Tokenble
Token ID903
Feature activation-1.01e-02

Pos logit contributions
 unforgettable+0.043
 newfound+0.040
 thrilling+0.038
b+0.038
imm+0.038
Neg logit contributions
iffs-0.041
 seeker-0.039
 court-0.037
 Gard-0.037
 peel-0.037
Token,
Token ID11
Feature activation+6.80e-03

Pos logit contributions
 peel+0.008
 court+0.008
 seeker+0.008
iffs+0.007
 rooms+0.007
Neg logit contributions
 unforgettable-0.021
 newfound-0.020
imm-0.019
 lifelong-0.019
 thrilling-0.019
Token and
Token ID290
Feature activation+8.30e-03

Pos logit contributions
 unforgettable+0.012
 newfound+0.011
imm+0.011
b+0.011
 lifetime+0.011
Neg logit contributions
iffs-0.007
 seeker-0.007
 peel-0.007
 slams-0.007
 court-0.007
Token it
Token ID340
Feature activation+2.07e-02

Pos logit contributions
immer+0.016
imm+0.016
 unforgettable+0.015
 lifetime+0.014
 lifelong+0.014
Neg logit contributions
 rooms-0.007
 ear-0.007
 peel-0.007
 seeker-0.007
 yourselves-0.007
Token sh
Token ID427
Feature activation+2.09e-02

Pos logit contributions
 unforgettable+0.023
 newfound+0.021
imm+0.020
 thrilling+0.020
b+0.020
Neg logit contributions
iffs-0.035
 seeker-0.033
 rooms-0.033
 slams-0.033
 Gard-0.033
Tokenimm
Token ID8608
Feature activation+8.24e-02

Pos logit contributions
 unforgettable+0.013
imm+0.012
b+0.012
 newfound+0.011
 thrilling+0.010
Neg logit contributions
iffs-0.046
 seeker-0.044
 court-0.044
 Gard-0.044
 rooms-0.044
Tokenered
Token ID1068
Feature activation+1.01e-02

Pos logit contributions
 unforgettable+0.115
 lifetime+0.110
 newfound+0.106
 thrilling+0.103
 lifelong+0.103
Neg logit contributions
iffs-0.119
 rooms-0.107
 slams-0.106
atters-0.106
 yourselves-0.104
Token in
Token ID287
Feature activation-2.52e-03

Pos logit contributions
 unforgettable+0.020
 newfound+0.019
imm+0.018
 lifelong+0.018
 lifetime+0.018
Neg logit contributions
iffs-0.008
 rooms-0.008
 seeker-0.008
 tables-0.008
 peel-0.007
Token the
Token ID262
Feature activation+6.62e-03

Pos logit contributions
iffs+0.003
 seeker+0.003
 court+0.003
 rooms+0.003
 Gard+0.003
Neg logit contributions
 unforgettable-0.004
 newfound-0.004
b-0.004
imm-0.004
 thrilling-0.003
Token sunshine
Token ID34488
Feature activation+1.26e-02

Pos logit contributions
 unforgettable+0.010
 newfound+0.010
b+0.010
 thrilling+0.009
 brand+0.009
Neg logit contributions
iffs-0.008
 slams-0.008
 seeker-0.008
 Gard-0.008
 court-0.007
Token.
Token ID13
Feature activation+6.26e-03

Pos logit contributions
 unforgettable+0.024
 newfound+0.023
 thrilling+0.022
b+0.022
imm+0.022
Neg logit contributions
iffs-0.011
 court-0.011
 seeker-0.010
 peel-0.010
 rooms-0.010
Token was
Token ID373
Feature activation+1.03e-02

Pos logit contributions
immer+0.039
 unforgettable+0.039
imm+0.038
 newfound+0.038
 thrilling+0.038
Neg logit contributions
 court-0.019
 Ben-0.019
 peel-0.018
."-0.018
 eats-0.018
Token really
Token ID1107
Feature activation-4.96e-04

Pos logit contributions
imm+0.019
b+0.018
immer+0.018
 powers+0.018
 awaited+0.017
Neg logit contributions
 seeker-0.009
 ear-0.009
 Enjoy-0.008
 Sara-0.008
 kitchen-0.008
Token excited
Token ID6568
Feature activation-3.38e-03

Pos logit contributions
 Enjoy+0.000
 seeker+0.000
 court+0.000
 gard+0.000
 slams+0.000
Neg logit contributions
imm-0.001
 unforgettable-0.001
immer-0.001
 stre-0.001
 newfound-0.001
Token about
Token ID546
Feature activation+2.50e-02

Pos logit contributions
 court+0.003
iffs+0.003
 rooms+0.003
 peel+0.003
 seeker+0.003
Neg logit contributions
 unforgettable-0.006
 newfound-0.006
b-0.006
imm-0.006
 lifelong-0.006
Token his
Token ID465
Feature activation-2.08e-02

Pos logit contributions
imm+0.046
 unforgettable+0.043
immer+0.043
b+0.043
 newfound+0.040
Neg logit contributions
 court-0.027
 rooms-0.025
 Enjoy-0.022
 peel-0.022
 tables-0.022
Token upcoming
Token ID7865
Feature activation+8.18e-02

Pos logit contributions
 court+0.024
 rooms+0.023
 seeker+0.021
 kitchen+0.020
 church+0.020
Neg logit contributions
imm-0.044
immer-0.039
lic-0.037
ittered-0.037
pr-0.037
Token adventure
Token ID8855
Feature activation+3.15e-02

Pos logit contributions
imm+0.144
b+0.136
immer+0.135
ittered+0.131
 unforgettable+0.127
Neg logit contributions
 court-0.102
 rooms-0.089
 seeker-0.087
 peel-0.085
 church-0.083
Token.
Token ID13
Feature activation+2.31e-03

Pos logit contributions
 newfound+0.072
 unforgettable+0.070
imm+0.069
immer+0.069
 lifelong+0.068
Neg logit contributions
 court-0.019
 rooms-0.013
 tables-0.011
 seeker-0.011
 peel-0.010
Token<|endoftext|>
Token ID50256
Feature activation-1.41e-02

Pos logit contributions
 unforgettable+0.004
 newfound+0.004
 thrilling+0.004
 brand+0.004
 lifelong+0.004
Neg logit contributions
 seeker-0.002
iffs-0.002
 slams-0.002
 rooms-0.002
 peel-0.002
TokenOnce
Token ID7454
Feature activation-9.92e-03

Pos logit contributions
iffs+0.016
 seeker+0.015
 court+0.014
 rooms+0.014
 Gard+0.014
Neg logit contributions
 unforgettable-0.024
 newfound-0.023
b-0.022
imm-0.022
 thrilling-0.022
Token upon
Token ID2402
Feature activation-3.07e-04

Pos logit contributions
iffs+0.014
 seeker+0.013
 court+0.013
 rooms+0.013
 slams+0.012
Neg logit contributions
 unforgettable-0.015
imm-0.013
 newfound-0.013
b-0.013
 lifetime-0.013
Token.
Token ID13
Feature activation+1.98e-03

Pos logit contributions
iffs+0.007
 court+0.007
 seeker+0.007
 peel+0.007
 rooms+0.006
Neg logit contributions
 unforgettable-0.016
imm-0.014
b-0.014
 newfound-0.014
 brand-0.014
Token He
Token ID679
Feature activation-1.38e-02

Pos logit contributions
 unforgettable+0.004
 newfound+0.004
 thrilling+0.004
 brand+0.004
imm+0.004
Neg logit contributions
iffs-0.002
 seeker-0.002
 Gard-0.002
 slams-0.001
 peel-0.001
Token tried
Token ID3088
Feature activation-3.70e-02

Pos logit contributions
 rooms+0.014
 slams+0.014
 tables+0.013
 seeker+0.013
 slides+0.013
Neg logit contributions
immer-0.024
imm-0.023
 unforgettable-0.023
 lifelong-0.021
pr-0.021
Token to
Token ID284
Feature activation-2.17e-02

Pos logit contributions
iffs+0.052
 seeker+0.050
 court+0.048
 Gard+0.048
 slams+0.048
Neg logit contributions
 unforgettable-0.052
 newfound-0.049
b-0.048
 thrilling-0.047
imm-0.046
Token out
Token ID503
Feature activation-5.43e-03

Pos logit contributions
iffs+0.022
 seeker+0.021
 court+0.020
 slams+0.020
 Gard+0.019
Neg logit contributions
 unforgettable-0.039
 newfound-0.037
b-0.037
imm-0.037
 thrilling-0.036
Tokenrun
Token ID5143
Feature activation+8.13e-02

Pos logit contributions
 rooms+0.005
 tables+0.004
 slams+0.004
 court+0.004
 peel+0.004
Neg logit contributions
imm-0.010
 unforgettable-0.010
 lifetime-0.010
b-0.010
immer-0.010
Token the
Token ID262
Feature activation-7.02e-04

Pos logit contributions
 unforgettable+0.136
 newfound+0.128
 brand+0.127
 lifetime+0.125
imm+0.123
Neg logit contributions
iffs-0.101
 seeker-0.086
 court-0.083
 Enjoy-0.083
 peel-0.082
Token train
Token ID4512
Feature activation+3.77e-03

Pos logit contributions
 seeker+0.001
iffs+0.001
 rooms+0.001
 court+0.001
 Gard+0.001
Neg logit contributions
 unforgettable-0.001
imm-0.001
 newfound-0.001
b-0.001
 thrilling-0.001
Token but
Token ID475
Feature activation-1.64e-02

Pos logit contributions
 unforgettable+0.008
 newfound+0.008
 lifelong+0.008
imm+0.008
b+0.008
Neg logit contributions
 court-0.003
 peel-0.002
 tables-0.002
 slams-0.002
 rooms-0.002
Token he
Token ID339
Feature activation-5.75e-03

Pos logit contributions
 seeker+0.017
 court+0.016
iffs+0.015
 slams+0.015
 rooms+0.015
Neg logit contributions
imm-0.030
immer-0.028
 unforgettable-0.028
 coll-0.027
 lifelong-0.027
Token was
Token ID373
Feature activation-1.29e-02

Pos logit contributions
 seeker+0.008
 peel+0.007
 court+0.007
 slams+0.007
 rooms+0.007
Neg logit contributions
 unforgettable-0.009
immer-0.008
 lifelong-0.008
 newfound-0.008
 coll-0.008
Token the
Token ID262
Feature activation-5.70e-04

Pos logit contributions
 unforgettable+0.035
imm+0.035
b+0.034
 newfound+0.033
immer+0.033
Neg logit contributions
 court-0.023
 seeker-0.022
iffs-0.022
 peel-0.021
 rooms-0.021
Token road
Token ID2975
Feature activation-2.72e-02

Pos logit contributions
iffs+0.001
 slams+0.001
 seeker+0.001
 Gard+0.001
atters+0.001
Neg logit contributions
 unforgettable-0.001
b-0.001
 newfound-0.001
 thrilling-0.001
 brand-0.001
Token and
Token ID290
Feature activation+4.99e-03

Pos logit contributions
iffs+0.032
 seeker+0.030
 Gard+0.029
 slams+0.029
 court+0.029
Neg logit contributions
 unforgettable-0.045
 newfound-0.042
b-0.042
imm-0.041
 thrilling-0.040
Token it
Token ID340
Feature activation+1.89e-02

Pos logit contributions
 unforgettable+0.007
 newfound+0.007
imm+0.007
immer+0.007
 lifelong+0.007
Neg logit contributions
iffs-0.007
 seeker-0.007
 rooms-0.006
 court-0.006
 slams-0.006
Token gl
Token ID1278
Feature activation+2.57e-02

Pos logit contributions
 unforgettable+0.027
 newfound+0.026
immer+0.025
 lifelong+0.024
 lifetime+0.023
Neg logit contributions
 rooms-0.026
 peel-0.025
 slams-0.025
iffs-0.025
 seeker-0.025
Tokenimm
Token ID8608
Feature activation+8.10e-02

Pos logit contributions
 unforgettable+0.026
 newfound+0.025
 thrilling+0.022
 lifelong+0.021
 lifetime+0.021
Neg logit contributions
iffs-0.048
 seeker-0.044
atters-0.043
 slams-0.043
 rooms-0.042
Tokenered
Token ID1068
Feature activation+1.04e-02

Pos logit contributions
 unforgettable+0.106
 newfound+0.098
 lifetime+0.096
 thrilling+0.093
 lifelong+0.092
Neg logit contributions
iffs-0.124
 rooms-0.115
 slams-0.113
 seeker-0.109
 yourselves-0.109
Token in
Token ID287
Feature activation-3.48e-03

Pos logit contributions
 unforgettable+0.019
 newfound+0.018
imm+0.017
b+0.017
 thrilling+0.017
Neg logit contributions
iffs-0.010
 seeker-0.010
 rooms-0.010
 court-0.009
 peel-0.009
Token the
Token ID262
Feature activation+4.35e-03

Pos logit contributions
iffs+0.004
 rooms+0.004
 seeker+0.004
 court+0.004
 peel+0.004
Neg logit contributions
 unforgettable-0.006
imm-0.005
 newfound-0.005
b-0.005
immer-0.005
Token bright
Token ID6016
Feature activation-8.69e-03

Pos logit contributions
 unforgettable+0.007
 newfound+0.007
b+0.007
 thrilling+0.007
imm+0.007
Neg logit contributions
iffs-0.005
 seeker-0.005
 slams-0.005
 court-0.005
 Gard-0.005
Token sunshine
Token ID34488
Feature activation+1.04e-02

Pos logit contributions
iffs+0.011
 seeker+0.010
 court+0.010
 rooms+0.010
 slams+0.010
Neg logit contributions
 unforgettable-0.013
 newfound-0.013
b-0.012
imm-0.012
 thrilling-0.012
Tokenled
Token ID992
Feature activation+1.49e-03

Pos logit contributions
 unforgettable+0.029
 lifetime+0.028
 thrilling+0.027
 newfound+0.027
 lifelong+0.026
Neg logit contributions
der-0.010
 rooms-0.008
 yog-0.008
 peel-0.008
 all-0.008
Token and
Token ID290
Feature activation+1.69e-02

Pos logit contributions
 unforgettable+0.003
 lifetime+0.003
b+0.003
imm+0.003
str+0.003
Neg logit contributions
 seeker-0.001
 rooms-0.001
 tables-0.001
 gard-0.001
 peel-0.001
Token her
Token ID607
Feature activation+1.27e-02

Pos logit contributions
immer+0.027
 unforgettable+0.025
imm+0.025
chie+0.024
 newfound+0.023
Neg logit contributions
 rooms-0.020
 seeker-0.020
 tables-0.019
 ear-0.019
 slams-0.019
Token dress
Token ID6576
Feature activation+3.58e-02

Pos logit contributions
 unforgettable+0.025
immer+0.024
imm+0.023
pr+0.023
b+0.023
Neg logit contributions
 rooms-0.012
 seeker-0.011
 ear-0.010
 tables-0.010
 peel-0.009
Token gl
Token ID1278
Feature activation+3.51e-02

Pos logit contributions
 unforgettable+0.062
 newfound+0.057
immer+0.057
 lifelong+0.056
 thrilling+0.056
Neg logit contributions
 rooms-0.037
 floor-0.034
 tables-0.033
 slide-0.033
 peel-0.033
Tokenimm
Token ID8608
Feature activation+8.08e-02

Pos logit contributions
 unforgettable+0.037
 newfound+0.036
 happened+0.033
 thrilling+0.032
 lifetime+0.032
Neg logit contributions
iffs-0.060
atters-0.056
ast-0.052
arl-0.051
 seeker-0.051
Tokenered
Token ID1068
Feature activation+4.26e-03

Pos logit contributions
 unforgettable+0.114
 lifetime+0.112
 newfound+0.103
 thrilling+0.102
 lifelong+0.099
Neg logit contributions
iffs-0.111
 rooms-0.104
 yourselves-0.100
 slams-0.099
atters-0.094
Token in
Token ID287
Feature activation-7.22e-03

Pos logit contributions
 unforgettable+0.009
 newfound+0.009
 lifetime+0.008
b+0.008
imm+0.008
Neg logit contributions
 rooms-0.002
 tables-0.002
 peel-0.002
 all-0.002
 seeker-0.002
Token the
Token ID262
Feature activation+1.53e-03

Pos logit contributions
 rooms+0.006
 tables+0.006
 court+0.006
 seeker+0.006
 peel+0.006
Neg logit contributions
imm-0.014
 unforgettable-0.014
b-0.013
 lifetime-0.013
 newfound-0.012
Token sun
Token ID4252
Feature activation+1.45e-02

Pos logit contributions
 unforgettable+0.003
imm+0.003
 newfound+0.003
b+0.003
 thrilling+0.003
Neg logit contributions
 seeker-0.001
iffs-0.001
 rooms-0.001
 court-0.001
 slams-0.001
Token.
Token ID13
Feature activation+5.07e-03

Pos logit contributions
 unforgettable+0.032
 newfound+0.030
 thrilling+0.030
 remarkable+0.029
b+0.028
Neg logit contributions
 peel-0.009
iffs-0.008
 seeker-0.008
 court-0.008
 tables-0.008
Token but
Token ID475
Feature activation-1.04e-02

Pos logit contributions
iffs+0.001
 court+0.001
 peel+0.001
 slams+0.001
 seeker+0.001
Neg logit contributions
 unforgettable-0.002
imm-0.002
 newfound-0.002
immer-0.002
 lifelong-0.002
Token she
Token ID673
Feature activation+5.69e-03

Pos logit contributions
 seeker+0.012
 court+0.011
 slams+0.011
 tables+0.010
 peel+0.010
Neg logit contributions
imm-0.019
 unforgettable-0.017
immer-0.017
b-0.016
 brand-0.016
Token couldn
Token ID3521
Feature activation+1.29e-02

Pos logit contributions
 unforgettable+0.009
 lifelong+0.009
immer+0.008
 newfound+0.008
imm+0.008
Neg logit contributions
 seeker-0.007
 peel-0.007
 slams-0.007
 rooms-0.006
 court-0.006
Token't
Token ID470
Feature activation+1.88e-03

Pos logit contributions
 unforgettable+0.025
 newfound+0.024
b+0.023
imm+0.023
 thrilling+0.023
Neg logit contributions
iffs-0.012
 seeker-0.011
 court-0.011
 slams-0.010
 peel-0.010
Token out
Token ID503
Feature activation-1.06e-02

Pos logit contributions
 unforgettable+0.003
 newfound+0.003
b+0.003
 thrilling+0.003
imm+0.003
Neg logit contributions
iffs-0.002
 seeker-0.002
 slams-0.002
 rooms-0.002
 Gard-0.002
Tokenrun
Token ID5143
Feature activation+8.06e-02

Pos logit contributions
 rooms+0.007
 court+0.006
 slams+0.006
 peel+0.006
 tables+0.005
Neg logit contributions
 unforgettable-0.024
imm-0.023
b-0.022
 lifetime-0.022
 newfound-0.022
Token the
Token ID262
Feature activation-1.46e-03

Pos logit contributions
 brand+0.151
 unforgettable+0.151
 lifetime+0.146
 newfound+0.145
 spectacular+0.137
Neg logit contributions
iffs-0.087
 Enjoy-0.071
other-0.069
 court-0.067
 seeker-0.067
Token man
Token ID582
Feature activation-1.13e-02

Pos logit contributions
 seeker+0.002
 court+0.002
 peel+0.001
 rooms+0.001
 tables+0.001
Neg logit contributions
imm-0.003
 unforgettable-0.003
b-0.002
immer-0.002
ittered-0.002
Token.
Token ID13
Feature activation+3.85e-03

Pos logit contributions
 court+0.009
 peel+0.009
 slide+0.007
 slams+0.006
 eat+0.006
Neg logit contributions
 unforgettable-0.025
imm-0.024
 newfound-0.024
 thrilling-0.023
 lifelong-0.022
Token He
Token ID679
Feature activation-9.93e-03

Pos logit contributions
 unforgettable+0.008
 newfound+0.008
b+0.008
imm+0.008
 thrilling+0.008
Neg logit contributions
iffs-0.003
 seeker-0.002
 court-0.002
 slams-0.002
 Gard-0.002
Token quickly
Token ID2952
Feature activation+2.01e-03

Pos logit contributions
 slams+0.011
 bar+0.010
 peel+0.010
 rooms+0.009
 ear+0.009
Neg logit contributions
immer-0.017
imm-0.017
 unforgettable-0.017
 lifelong-0.017
 newfound-0.016
Token her
Token ID607
Feature activation-3.27e-02

Pos logit contributions
imm+0.008
b+0.008
chie+0.008
 unforgettable+0.008
ch+0.008
Neg logit contributions
 rooms-0.005
 court-0.004
 tables-0.004
 slams-0.004
 seeker-0.004
Token goal
Token ID3061
Feature activation+1.05e-02

Pos logit contributions
 court+0.038
 seeker+0.038
 slams+0.037
 rooms+0.036
 peel+0.035
Neg logit contributions
imm-0.053
 unforgettable-0.051
b-0.051
immer-0.049
ittered-0.047
Token!
Token ID0
Feature activation+1.48e-02

Pos logit contributions
 unforgettable+0.020
imm+0.019
b+0.019
 newfound+0.019
 lifelong+0.018
Neg logit contributions
 court-0.011
 peel-0.009
 rooms-0.009
 seeker-0.008
iffs-0.008
Token<|endoftext|>
Token ID50256
Feature activation-1.70e-02

Pos logit contributions
 unforgettable+0.020
 newfound+0.020
 lifelong+0.018
imm+0.018
 thrilling+0.018
Neg logit contributions
 seeker-0.021
 slams-0.020
 rooms-0.020
iffs-0.020
 peel-0.019
TokenJ
Token ID41
Feature activation-2.45e-02

Pos logit contributions
iffs+0.018
 seeker+0.016
 court+0.016
 rooms+0.015
 Gard+0.015
Neg logit contributions
 unforgettable-0.032
 newfound-0.030
b-0.029
imm-0.029
 thrilling-0.028
Tokenimm
Token ID8608
Feature activation+8.01e-02

Pos logit contributions
iffs+0.027
 court+0.024
 seeker+0.024
 peel+0.023
 Gard+0.023
Neg logit contributions
 unforgettable-0.044
 newfound-0.042
b-0.041
 thrilling-0.040
 brand-0.039
Tokenie
Token ID494
Feature activation+5.81e-03

Pos logit contributions
 unforgettable+0.111
 lifetime+0.103
 thrilling+0.099
 newfound+0.098
 beating+0.096
Neg logit contributions
iffs-0.116
 rooms-0.113
 slams-0.113
 yog-0.105
atches-0.105
Token was
Token ID373
Feature activation+1.69e-02

Pos logit contributions
immer+0.010
 unforgettable+0.010
 newfound+0.009
imm+0.009
 lifelong+0.009
Neg logit contributions
 rooms-0.007
 court-0.007
 tables-0.007
iffs-0.006
 slams-0.006
Token scared
Token ID12008
Feature activation-3.41e-03

Pos logit contributions
 unforgettable+0.029
b+0.029
 lifetime+0.028
imm+0.028
chie+0.028
Neg logit contributions
 seeker-0.017
 kitchen-0.016
 rooms-0.016
 gard-0.015
 court-0.015
Token.
Token ID13
Feature activation-1.65e-03

Pos logit contributions
 court+0.002
 rooms+0.002
 seeker+0.002
 peel+0.002
 tables+0.002
Neg logit contributions
b-0.007
 unforgettable-0.007
 newfound-0.007
imm-0.007
 lifelong-0.006
Token He
Token ID679
Feature activation-2.13e-02

Pos logit contributions
iffs+0.001
 seeker+0.001
 slams+0.001
 court+0.001
 Gard+0.001
Neg logit contributions
 unforgettable-0.003
 newfound-0.003
b-0.003
imm-0.003
 thrilling-0.003
Token covered
Token ID5017
Feature activation-8.50e-03

Pos logit contributions
 court+0.045
 peel+0.043
 tables+0.042
 sink+0.041
 rooms+0.040
Neg logit contributions
 unforgettable-0.073
 newfound-0.066
 lifelong-0.065
immer-0.065
 thrilling-0.063
Token the
Token ID262
Feature activation-9.15e-03

Pos logit contributions
 rooms+0.008
 tables+0.008
iffs+0.007
 yourselves+0.007
 court+0.007
Neg logit contributions
 unforgettable-0.016
 lifetime-0.016
 newfound-0.015
 thrilling-0.015
b-0.015
Token ground
Token ID2323
Feature activation-2.31e-02

Pos logit contributions
iffs+0.011
 slams+0.011
 seeker+0.010
 Gard+0.010
 court+0.010
Neg logit contributions
 unforgettable-0.014
 newfound-0.014
b-0.014
 brand-0.013
 thrilling-0.013
Token and
Token ID290
Feature activation+1.54e-02

Pos logit contributions
iffs+0.018
 peel+0.018
 rooms+0.017
 slams+0.017
 court+0.017
Neg logit contributions
 unforgettable-0.046
 newfound-0.044
 lifetime-0.042
b-0.042
imm-0.042
Token gl
Token ID1278
Feature activation+2.39e-02

Pos logit contributions
immer+0.025
 unforgettable+0.025
imm+0.025
 newfound+0.023
pr+0.023
Neg logit contributions
 rooms-0.018
 tables-0.018
 court-0.017
 seeker-0.016
 peel-0.016
Tokenimm
Token ID8608
Feature activation+7.99e-02

Pos logit contributions
 unforgettable+0.031
 newfound+0.030
 thrilling+0.028
 lifetime+0.027
 lifelong+0.027
Neg logit contributions
iffs-0.037
atters-0.034
 seeker-0.031
 peel-0.031
 slams-0.031
Tokenered
Token ID1068
Feature activation+8.83e-03

Pos logit contributions
 unforgettable+0.109
 lifetime+0.103
 newfound+0.100
 thrilling+0.097
 lifelong+0.095
Neg logit contributions
iffs-0.115
 rooms-0.110
 yourselves-0.103
 slams-0.102
atters-0.102
Token in
Token ID287
Feature activation-5.35e-03

Pos logit contributions
 unforgettable+0.018
 newfound+0.017
 lifetime+0.017
b+0.016
imm+0.016
Neg logit contributions
 rooms-0.007
 tables-0.006
iffs-0.005
 peel-0.005
 court-0.005
Token the
Token ID262
Feature activation+2.35e-03

Pos logit contributions
 rooms+0.005
 court+0.005
 tables+0.005
 peel+0.005
 seeker+0.005
Neg logit contributions
 unforgettable-0.010
imm-0.010
b-0.010
 newfound-0.009
 lifetime-0.009
Token sunlight
Token ID19606
Feature activation+1.99e-03

Pos logit contributions
 unforgettable+0.004
 newfound+0.004
b+0.003
 thrilling+0.003
 brand+0.003
Neg logit contributions
iffs-0.003
 seeker-0.003
 slams-0.003
 court-0.003
 Gard-0.003
Token.
Token ID13
Feature activation+4.57e-03

Pos logit contributions
 unforgettable+0.004
 newfound+0.004
b+0.004
 thrilling+0.004
imm+0.004
Neg logit contributions
iffs-0.002
 peel-0.002
 court-0.002
 seeker-0.002
 rooms-0.002
Token bright
Token ID6016
Feature activation+5.79e-03

Pos logit contributions
 unforgettable+0.028
imm+0.027
 newfound+0.026
b+0.026
immer+0.026
Neg logit contributions
 rooms-0.018
 court-0.017
 tables-0.017
 peel-0.016
 kitchen-0.016
Token colors
Token ID7577
Feature activation+2.34e-02

Pos logit contributions
 unforgettable+0.009
 newfound+0.008
 lifetime+0.008
b+0.008
 thrilling+0.008
Neg logit contributions
 rooms-0.008
 tables-0.007
 peel-0.007
 court-0.007
iffs-0.007
Token and
Token ID290
Feature activation+1.55e-02

Pos logit contributions
 unforgettable+0.049
 newfound+0.048
 lifetime+0.046
 lifelong+0.044
b+0.044
Neg logit contributions
 peel-0.016
 court-0.014
iffs-0.014
 tables-0.013
 slide-0.013
Token painting
Token ID12036
Feature activation+4.70e-02

Pos logit contributions
immer+0.025
imm+0.024
 unforgettable+0.024
 newfound+0.022
 lifelong+0.022
Neg logit contributions
 tables-0.019
 rooms-0.019
 court-0.017
 peel-0.016
 kitchen-0.016
Token her
Token ID607
Feature activation+4.64e-03

Pos logit contributions
 unforgettable+0.099
 newfound+0.097
 lifetime+0.095
immer+0.093
b+0.092
Neg logit contributions
 tables-0.034
 rooms-0.032
 peel-0.027
 court-0.025
 She-0.024
Token masterpiece
Token ID30669
Feature activation+7.97e-02

Pos logit contributions
immer+0.009
b+0.009
imm+0.009
 unforgettable+0.008
boat+0.008
Neg logit contributions
 rooms-0.005
 court-0.004
 kitchen-0.004
 tables-0.004
 slide-0.004
Token.
Token ID13
Feature activation+5.82e-03

Pos logit contributions
 unforgettable+0.185
 newfound+0.182
 lifelong+0.172
immer+0.171
 thrilling+0.171
Neg logit contributions
 peel-0.044
 court-0.040
 rooms-0.034
 tables-0.034
 slide-0.031
Token She
Token ID1375
Feature activation+2.44e-03

Pos logit contributions
 unforgettable+0.011
 newfound+0.011
 brand+0.010
 thrilling+0.010
imm+0.010
Neg logit contributions
 seeker-0.005
iffs-0.005
 slams-0.004
 court-0.004
 peel-0.004
Token couldn
Token ID3521
Feature activation+1.03e-03

Pos logit contributions
immer+0.004
 unforgettable+0.004
imm+0.004
 lifetime+0.003
 newfound+0.003
Neg logit contributions
 rooms-0.003
 yog-0.003
 peel-0.003
 slide-0.003
 tables-0.002
Token't
Token ID470
Feature activation-9.68e-03

Pos logit contributions
 unforgettable+0.002
 newfound+0.002
b+0.002
imm+0.002
 thrilling+0.002
Neg logit contributions
iffs-0.001
 seeker-0.001
 court-0.001
 slams-0.001
 peel-0.001
Token contain
Token ID3994
Feature activation-4.83e-02

Pos logit contributions
iffs+0.012
 seeker+0.011
 slams+0.011
 court+0.011
 rooms+0.011
Neg logit contributions
 unforgettable-0.015
 newfound-0.014
b-0.014
imm-0.014
 thrilling-0.014
Token you
Token ID345
Feature activation+1.69e-02

Pos logit contributions
iffs+0.009
 Enjoy+0.008
 peel+0.008
 court+0.008
 yourselves+0.008
Neg logit contributions
 unforgettable-0.018
imm-0.018
 thrilling-0.017
 newfound-0.017
immer-0.017
Token?"
Token ID1701
Feature activation-8.67e-03

Pos logit contributions
 unforgettable+0.036
imm+0.036
 newfound+0.035
b+0.034
 thrilling+0.034
Neg logit contributions
iffs-0.012
 seeker-0.012
 court-0.011
 rooms-0.011
 peel-0.010
Token
Token ID198
Feature activation+4.99e-03

Pos logit contributions
iffs+0.010
 seeker+0.009
 court+0.009
 slams+0.009
 Gard+0.009
Neg logit contributions
 unforgettable-0.015
 newfound-0.014
b-0.014
imm-0.013
 thrilling-0.013
Token
Token ID198
Feature activation+3.38e-03

Pos logit contributions
 unforgettable+0.011
imm+0.010
 newfound+0.010
 thrilling+0.010
immer+0.010
Neg logit contributions
iffs-0.004
 court-0.003
 peel-0.003
 rooms-0.003
 seeker-0.003
TokenB
Token ID33
Feature activation-1.66e-02

Pos logit contributions
 unforgettable+0.006
imm+0.006
 newfound+0.006
 happened+0.005
 thrilling+0.005
Neg logit contributions
iffs-0.004
 court-0.004
 peel-0.004
 seeker-0.003
 Gard-0.003
Tokenrun
Token ID5143
Feature activation+7.89e-02

Pos logit contributions
iffs+0.021
 seeker+0.018
 peel+0.017
 court+0.017
 Gard+0.017
Neg logit contributions
 unforgettable-0.029
 newfound-0.027
 thrilling-0.025
b-0.025
 remarkable-0.025
Tokeno
Token ID78
Feature activation+1.18e-02

Pos logit contributions
 unforgettable+0.132
 lifetime+0.129
 newfound+0.124
 brand+0.123
 thrilling+0.119
Neg logit contributions
iffs-0.117
ud-0.092
atters-0.090
other-0.088
atches-0.086
Token said
Token ID531
Feature activation+2.04e-02

Pos logit contributions
 unforgettable+0.019
 newfound+0.018
 thrilling+0.018
 lifelong+0.018
 lifetime+0.017
Neg logit contributions
 peel-0.015
iffs-0.014
 court-0.013
 slams-0.013
 yog-0.013
Token,
Token ID11
Feature activation-2.22e-03

Pos logit contributions
 unforgettable+0.039
 newfound+0.036
 thrilling+0.035
imm+0.035
 lifetime+0.035
Neg logit contributions
iffs-0.020
 seeker-0.018
 peel-0.017
 court-0.017
 gard-0.017
Token "
Token ID366
Feature activation+1.11e-02

Pos logit contributions
iffs+0.003
 seeker+0.003
 slams+0.003
 rooms+0.003
 court+0.003
Neg logit contributions
 unforgettable-0.003
 newfound-0.003
b-0.003
imm-0.003
 thrilling-0.003
TokenOf
Token ID5189
Feature activation+4.03e-03

Pos logit contributions
 unforgettable+0.018
 newfound+0.018
imm+0.017
 thrilling+0.017
 brand+0.017
Neg logit contributions
iffs-0.013
 peel-0.012
 seeker-0.011
 Gard-0.011
 court-0.011
Token his
Token ID465
Feature activation-3.07e-02

Pos logit contributions
iffs+0.025
 court+0.025
 peel+0.024
 seeker+0.024
 rooms+0.023
Neg logit contributions
 unforgettable-0.038
imm-0.037
immer-0.036
 newfound-0.036
b-0.035
Token kick
Token ID4829
Feature activation-1.25e-03

Pos logit contributions
 court+0.027
 peel+0.025
 seeker+0.024
 slide+0.023
 sink+0.022
Neg logit contributions
imm-0.063
immer-0.060
b-0.059
 god-0.059
pr-0.058
Token-
Token ID12
Feature activation+4.17e-02

Pos logit contributions
 court+0.001
 peel+0.001
 seeker+0.001
iffs+0.001
 slide+0.001
Neg logit contributions
 unforgettable-0.003
imm-0.003
immer-0.003
 lifetime-0.003
 remarkable-0.003
Tokenand
Token ID392
Feature activation-3.98e-03

Pos logit contributions
 unforgettable+0.063
 newfound+0.061
imm+0.059
 brand+0.058
immer+0.058
Neg logit contributions
iffs-0.059
 peel-0.050
 court-0.049
 Enjoy-0.048
 seeker-0.048
Token-
Token ID12
Feature activation+3.64e-02

Pos logit contributions
iffs+0.007
 peel+0.006
 court+0.006
 seeker+0.006
 Enjoy+0.006
Neg logit contributions
 unforgettable-0.004
imm-0.004
 newfound-0.004
 lifelong-0.004
immer-0.004
Tokench
Token ID354
Feature activation+7.87e-02

Pos logit contributions
 unforgettable+0.057
 newfound+0.056
imm+0.054
immer+0.053
 brand+0.053
Neg logit contributions
iffs-0.049
 court-0.043
 seeker-0.042
 peel-0.041
 Enjoy-0.041
Tokenase
Token ID589
Feature activation-5.25e-03

Pos logit contributions
 unforgettable+0.155
 lifetime+0.145
 remarkable+0.144
 brand+0.141
 newfound+0.140
Neg logit contributions
iffs-0.073
 court-0.072
 peel-0.069
 seeker-0.067
 slide-0.060
Token game
Token ID983
Feature activation+4.51e-03

Pos logit contributions
 court+0.005
 peel+0.004
 seeker+0.004
 slide+0.004
iffs+0.004
Neg logit contributions
 unforgettable-0.011
b-0.010
 newfound-0.010
immer-0.010
 lifetime-0.010
Token,
Token ID11
Feature activation-1.02e-02

Pos logit contributions
 unforgettable+0.010
 newfound+0.009
immer+0.009
imm+0.009
 lifelong+0.009
Neg logit contributions
 court-0.004
 peel-0.003
iffs-0.003
 seeker-0.003
 rooms-0.003
Token and
Token ID290
Feature activation-4.60e-03

Pos logit contributions
 court+0.009
 seeker+0.009
 peel+0.009
iffs+0.008
 slams+0.008
Neg logit contributions
imm-0.020
 unforgettable-0.019
immer-0.019
b-0.018
 newfound-0.018
Token knew
Token ID2993
Feature activation+8.27e-04

Pos logit contributions
 seeker+0.005
iffs+0.005
 court+0.005
 rooms+0.005
 slams+0.005
Neg logit contributions
 unforgettable-0.007
imm-0.007
immer-0.007
 newfound-0.007
b-0.007
Token her
Token ID607
Feature activation-2.23e-02

Pos logit contributions
 unforgettable+0.022
 newfound+0.020
b+0.020
imm+0.020
 thrilling+0.020
Neg logit contributions
iffs-0.014
 court-0.013
 seeker-0.013
 slams-0.013
 peel-0.012
Token candy
Token ID18550
Feature activation-7.63e-03

Pos logit contributions
 court+0.021
 peel+0.021
iffs+0.019
 kitchen+0.017
 seeker+0.017
Neg logit contributions
imm-0.044
 unforgettable-0.043
b-0.043
chie-0.041
immer-0.041
Token,
Token ID11
Feature activation+4.38e-03

Pos logit contributions
 court+0.007
 peel+0.006
 tables+0.005
iffs+0.005
 slams+0.005
Neg logit contributions
 unforgettable-0.016
b-0.015
 newfound-0.015
 thrilling-0.015
 lifelong-0.014
Token she
Token ID673
Feature activation-1.72e-03

Pos logit contributions
 unforgettable+0.008
imm+0.007
b+0.007
immer+0.007
 thrilling+0.007
Neg logit contributions
 seeker-0.005
 court-0.005
 rooms-0.004
 tables-0.004
 slams-0.004
Token sc
Token ID629
Feature activation+1.47e-02

Pos logit contributions
 peel+0.002
 rooms+0.002
 court+0.002
 slams+0.002
 tables+0.002
Neg logit contributions
 unforgettable-0.003
imm-0.003
 newfound-0.003
 thrilling-0.003
 lifelong-0.003
Tokenrun
Token ID5143
Feature activation+7.81e-02

Pos logit contributions
 unforgettable+0.022
 newfound+0.021
b+0.020
imm+0.020
 thrilling+0.020
Neg logit contributions
iffs-0.020
 seeker-0.018
 court-0.018
 slams-0.018
 peel-0.018
Tokenched
Token ID1740
Feature activation-1.36e-02

Pos logit contributions
 unforgettable+0.133
 brand+0.129
 newfound+0.126
 lifetime+0.124
 thrilling+0.123
Neg logit contributions
iffs-0.105
ud-0.092
other-0.088
atters-0.082
atches-0.081
Token her
Token ID607
Feature activation-1.37e-02

Pos logit contributions
iffs+0.014
 seeker+0.013
 court+0.013
 peel+0.013
 rooms+0.012
Neg logit contributions
 unforgettable-0.024
imm-0.023
 thrilling-0.022
 newfound-0.022
 lifetime-0.022
Token eyes
Token ID2951
Feature activation-1.02e-03

Pos logit contributions
 peel+0.013
 ear+0.012
 court+0.012
 seeker+0.012
 kitchen+0.012
Neg logit contributions
imm-0.027
 unforgettable-0.026
b-0.025
immer-0.025
chie-0.025
Token tight
Token ID5381
Feature activation+1.67e-02

Pos logit contributions
 peel+0.001
iffs+0.001
 court+0.001
 seeker+0.001
 tables+0.001
Neg logit contributions
 unforgettable-0.002
 newfound-0.002
b-0.002
imm-0.002
 lifelong-0.002
Token.
Token ID13
Feature activation+2.08e-03

Pos logit contributions
 unforgettable+0.034
 newfound+0.033
 thrilling+0.031
b+0.031
imm+0.031
Neg logit contributions
iffs-0.013
 peel-0.012
 court-0.012
 seeker-0.012
 slams-0.012
Token was
Token ID373
Feature activation-3.54e-03

Pos logit contributions
 slide+0.007
 peel+0.007
 rooms+0.007
 slams+0.007
 pit+0.007
Neg logit contributions
 unforgettable-0.012
 lifelong-0.011
chie-0.011
immer-0.011
 thrilling-0.011
Token dark
Token ID3223
Feature activation-6.64e-04

Pos logit contributions
 many+0.003
 all+0.003
 rooms+0.003
 seeker+0.003
 kitchen+0.003
Neg logit contributions
 lifetime-0.007
imm-0.006
chie-0.006
agles-0.006
making-0.006
Token but
Token ID475
Feature activation-1.00e-02

Pos logit contributions
iffs+0.001
 seeker+0.001
 court+0.001
 slams+0.001
 rooms+0.001
Neg logit contributions
 unforgettable-0.001
 newfound-0.001
b-0.001
imm-0.001
 thrilling-0.001
Token it
Token ID340
Feature activation+2.37e-03

Pos logit contributions
iffs+0.012
 rooms+0.012
 seeker+0.012
 slams+0.012
 tables+0.011
Neg logit contributions
 unforgettable-0.015
imm-0.015
b-0.014
 newfound-0.014
immer-0.014
Token gl
Token ID1278
Feature activation+2.06e-02

Pos logit contributions
 unforgettable+0.004
 lifelong+0.003
 newfound+0.003
immer+0.003
 lifetime+0.003
Neg logit contributions
 peel-0.003
 slams-0.003
 sink-0.003
 rooms-0.003
 seeker-0.002
Tokenimm
Token ID8608
Feature activation+7.78e-02

Pos logit contributions
 unforgettable+0.023
 newfound+0.022
 lifelong+0.020
 lifetime+0.020
 thrilling+0.020
Neg logit contributions
iffs-0.035
atters-0.033
 slams-0.031
 seeker-0.030
ast-0.029
Tokenered
Token ID1068
Feature activation-1.48e-03

Pos logit contributions
 unforgettable+0.110
 lifetime+0.107
 newfound+0.102
 lifelong+0.100
 thrilling+0.099
Neg logit contributions
iffs-0.106
 rooms-0.097
 slams-0.097
 yourselves-0.095
atters-0.093
Token in
Token ID287
Feature activation-6.99e-03

Pos logit contributions
 tables+0.001
 rooms+0.001
 peel+0.001
 sink+0.000
 slams+0.000
Neg logit contributions
 unforgettable-0.003
 newfound-0.003
 lifetime-0.003
imm-0.003
 lifelong-0.003
Token the
Token ID262
Feature activation+1.36e-03

Pos logit contributions
 rooms+0.006
 tables+0.006
 court+0.006
 peel+0.006
 seeker+0.005
Neg logit contributions
imm-0.013
 unforgettable-0.013
b-0.013
 lifetime-0.012
 newfound-0.012
Token night
Token ID1755
Feature activation+7.84e-03

Pos logit contributions
 unforgettable+0.002
imm+0.002
 newfound+0.002
b+0.002
 thrilling+0.002
Neg logit contributions
 rooms-0.001
iffs-0.001
 seeker-0.001
 court-0.001
 Gard-0.001
Token sky
Token ID6766
Feature activation-1.53e-02

Pos logit contributions
 unforgettable+0.015
 newfound+0.014
imm+0.014
b+0.014
 thrilling+0.014
Neg logit contributions
 court-0.007
 peel-0.007
 rooms-0.007
iffs-0.007
 tables-0.007
Token hopped
Token ID45230
Feature activation-1.04e-02

Pos logit contributions
 peel+0.005
 sav+0.004
 gard+0.004
 court+0.004
 bar+0.004
Neg logit contributions
 unforgettable-0.009
 lifelong-0.009
immer-0.008
 newfound-0.008
 possible-0.008
Token over
Token ID625
Feature activation+2.08e-02

Pos logit contributions
iffs+0.012
 seeker+0.011
 court+0.011
 peel+0.011
 Gard+0.010
Neg logit contributions
 unforgettable-0.018
 newfound-0.017
imm-0.016
b-0.016
 lifetime-0.016
Token to
Token ID284
Feature activation+1.12e-02

Pos logit contributions
imm+0.041
 unforgettable+0.041
immer+0.039
 lifetime+0.038
b+0.038
Neg logit contributions
iffs-0.019
 court-0.017
 peel-0.017
 rooms-0.017
 tables-0.017
Token the
Token ID262
Feature activation-3.96e-03

Pos logit contributions
 unforgettable+0.018
 newfound+0.017
b+0.017
imm+0.017
 thrilling+0.016
Neg logit contributions
iffs-0.013
 seeker-0.013
 court-0.012
 slams-0.012
 Gard-0.012
Token b
Token ID275
Feature activation-2.81e-02

Pos logit contributions
 seeker+0.004
 rooms+0.004
iffs+0.004
 court+0.003
 gard+0.003
Neg logit contributions
imm-0.007
 unforgettable-0.007
 newfound-0.007
b-0.007
 happened-0.007
Tokenunn
Token ID20935
Feature activation-9.14e-02

Pos logit contributions
 seeker+0.043
 slams+0.043
 court+0.041
 rooms+0.041
iffs+0.040
Neg logit contributions
b-0.034
 unforgettable-0.034
imm-0.032
 newfound-0.032
immer-0.031
Tokenies
Token ID444
Feature activation-1.30e-02

Pos logit contributions
iffs+0.157
 seeker+0.149
 Gard+0.149
 court+0.148
 rooms+0.147
Neg logit contributions
 unforgettable-0.103
 newfound-0.092
 thrilling-0.089
imm-0.086
 brand-0.086
Token.
Token ID13
Feature activation+4.78e-03

Pos logit contributions
 court+0.010
 peel+0.010
iffs+0.009
 tables+0.008
 gard+0.008
Neg logit contributions
 unforgettable-0.027
imm-0.026
 newfound-0.025
 thrilling-0.025
 lifelong-0.025
Token They
Token ID1119
Feature activation-1.95e-03

Pos logit contributions
 unforgettable+0.009
 newfound+0.009
b+0.008
imm+0.008
 thrilling+0.008
Neg logit contributions
iffs-0.004
 seeker-0.004
 court-0.004
 slams-0.004
 Gard-0.004
Token were
Token ID547
Feature activation-1.09e-02

Pos logit contributions
 peel+0.002
 court+0.002
 gard+0.002
 tables+0.002
 slide+0.002
Neg logit contributions
imm-0.003
 unforgettable-0.003
immer-0.003
 lifelong-0.003
 powers-0.003
Token so
Token ID523
Feature activation-5.04e-03

Pos logit contributions
 rooms+0.011
 seeker+0.010
 gard+0.009
 tables+0.009
 slams+0.009
Neg logit contributions
immer-0.019
imm-0.019
 unforgettable-0.018
b-0.018
 lifetime-0.018
Token took
Token ID1718
Feature activation-1.55e-02

Pos logit contributions
 slams+0.001
 peel+0.001
 rooms+0.001
 Gard+0.001
 tables+0.001
Neg logit contributions
 unforgettable-0.002
 newfound-0.002
 lifelong-0.002
imm-0.002
 thrilling-0.002
Token off
Token ID572
Feature activation-1.17e-02

Pos logit contributions
iffs+0.019
 seeker+0.018
 court+0.018
 slams+0.018
 rooms+0.018
Neg logit contributions
 unforgettable-0.024
 newfound-0.023
b-0.023
imm-0.022
 thrilling-0.022
Token the
Token ID262
Feature activation-8.44e-03

Pos logit contributions
iffs+0.018
 seeker+0.018
 Gard+0.017
 slams+0.016
 rooms+0.016
Neg logit contributions
 unforgettable-0.014
 newfound-0.014
b-0.013
 thrilling-0.013
 brand-0.012
Token oven
Token ID14361
Feature activation-4.25e-03

Pos logit contributions
iffs+0.008
 seeker+0.007
 court+0.007
 Gard+0.007
 slams+0.007
Neg logit contributions
 unforgettable-0.016
 newfound-0.015
b-0.015
imm-0.015
 thrilling-0.014
Token m
Token ID285
Feature activation-1.22e-02

Pos logit contributions
iffs+0.005
 seeker+0.005
 court+0.005
 slams+0.005
 Gard+0.004
Neg logit contributions
 unforgettable-0.007
 newfound-0.007
b-0.006
imm-0.006
 thrilling-0.006
Tokenitt
Token ID715
Feature activation-8.59e-02

Pos logit contributions
 seeker+0.019
 court+0.017
 rooms+0.017
 slams+0.017
iffs+0.017
Neg logit contributions
imm-0.015
 unforgettable-0.015
immer-0.015
b-0.014
 newfound-0.014
Tokens
Token ID82
Feature activation-2.16e-02

Pos logit contributions
iffs+0.136
 court+0.127
 peel+0.126
 seeker+0.126
 Gard+0.125
Neg logit contributions
 unforgettable-0.109
 newfound-0.100
 thrilling-0.095
imm-0.094
b-0.092
Token and
Token ID290
Feature activation-9.48e-03

Pos logit contributions
 peel+0.016
iffs+0.014
 rooms+0.014
 court+0.014
 tables+0.014
Neg logit contributions
 unforgettable-0.048
 lifetime-0.046
 thrilling-0.045
 newfound-0.045
 lifelong-0.045
Token opened
Token ID4721
Feature activation+1.11e-03

Pos logit contributions
iffs+0.012
 rooms+0.012
 slams+0.012
 peel+0.012
 court+0.012
Neg logit contributions
 unforgettable-0.014
 newfound-0.013
imm-0.013
b-0.013
 thrilling-0.013
Token the
Token ID262
Feature activation+3.24e-03

Pos logit contributions
 unforgettable+0.001
 newfound+0.001
b+0.001
 thrilling+0.001
imm+0.001
Neg logit contributions
iffs-0.002
 seeker-0.002
 court-0.002
 Gard-0.002
 slams-0.002
Token oven
Token ID14361
Feature activation+7.28e-03

Pos logit contributions
 unforgettable+0.006
 newfound+0.006
b+0.006
 brand+0.006
 remarkable+0.006
Neg logit contributions
iffs-0.002
 seeker-0.002
 court-0.002
 Gard-0.002
 slams-0.002
Token
Token ID220
Feature activation+1.20e-02

Pos logit contributions
 unforgettable+0.019
 newfound+0.018
imm+0.018
 thrilling+0.017
 lifetime+0.017
Neg logit contributions
 seeker-0.009
iffs-0.008
 peel-0.008
 court-0.007
 slams-0.007
Token She
Token ID1375
Feature activation+2.29e-03

Pos logit contributions
 unforgettable+0.022
 newfound+0.021
b+0.021
 thrilling+0.020
imm+0.020
Neg logit contributions
iffs-0.012
 court-0.011
 seeker-0.011
 slams-0.011
 Gard-0.010
Token grabbed
Token ID13176
Feature activation-1.09e-02

Pos logit contributions
 unforgettable+0.003
immer+0.003
imm+0.003
 newfound+0.003
 lifelong+0.003
Neg logit contributions
 peel-0.003
 Gard-0.003
 gard-0.003
 slams-0.002
 rooms-0.002
Token oven
Token ID14361
Feature activation-1.01e-02

Pos logit contributions
iffs+0.015
 seeker+0.014
 court+0.013
 slams+0.013
 Gard+0.013
Neg logit contributions
 unforgettable-0.016
 newfound-0.015
b-0.015
imm-0.015
 thrilling-0.015
Token m
Token ID285
Feature activation-8.43e-03

Pos logit contributions
 peel+0.007
iffs+0.007
 tables+0.007
 court+0.007
 rooms+0.007
Neg logit contributions
 unforgettable-0.021
 newfound-0.020
 lifetime-0.020
 thrilling-0.020
b-0.020
Tokenitt
Token ID715
Feature activation-8.49e-02

Pos logit contributions
 seeker+0.008
iffs+0.007
 rooms+0.007
 court+0.007
 slams+0.007
Neg logit contributions
 unforgettable-0.016
imm-0.015
b-0.015
 newfound-0.015
immer-0.015
Tokens
Token ID82
Feature activation-1.70e-02

Pos logit contributions
iffs+0.130
 peel+0.123
 Gard+0.118
 court+0.116
 rooms+0.115
Neg logit contributions
 unforgettable-0.115
 newfound-0.102
 thrilling-0.100
 brand-0.096
 remarkable-0.096
Token and
Token ID290
Feature activation-8.30e-03

Pos logit contributions
 peel+0.010
 kitchen+0.007
 tables+0.007
iffs+0.007
 rooms+0.007
Neg logit contributions
 unforgettable-0.041
 lifetime-0.039
 thrilling-0.038
 lifelong-0.038
imm-0.038
Token carefully
Token ID7773
Feature activation+6.83e-03

Pos logit contributions
 peel+0.010
 tables+0.010
 rooms+0.010
 slams+0.010
 Gard+0.010
Neg logit contributions
 unforgettable-0.013
imm-0.012
 newfound-0.012
immer-0.012
b-0.012
Token pulled
Token ID5954
Feature activation+3.58e-03

Pos logit contributions
 unforgettable+0.012
 lifelong+0.011
 thrilling+0.011
 newfound+0.011
 lifetime+0.011
Neg logit contributions
 peel-0.009
 tables-0.008
 gard-0.007
 court-0.007
iffs-0.007
Token out
Token ID503
Feature activation+7.59e-03

Pos logit contributions
 unforgettable+0.007
 lifetime+0.007
imm+0.006
 newfound+0.006
 thrilling+0.006
Neg logit contributions
 peel-0.003
iffs-0.003
 tables-0.003
 Gard-0.003
 seeker-0.003
TokenBob
Token ID18861
Feature activation-5.69e-04

Pos logit contributions
 unforgettable+0.009
 newfound+0.008
b+0.008
 thrilling+0.008
imm+0.008
Neg logit contributions
iffs-0.009
 seeker-0.009
 slams-0.008
 court-0.008
 rooms-0.008
Token smiled
Token ID13541
Feature activation-9.34e-03

Pos logit contributions
 peel+0.001
 court+0.001
 rooms+0.001
 slams+0.001
 Gard+0.001
Neg logit contributions
 unforgettable-0.001
 newfound-0.001
 lifelong-0.001
 brand-0.001
imm-0.001
Token at
Token ID379
Feature activation+4.64e-03

Pos logit contributions
iffs+0.004
 seeker+0.004
 gard+0.004
 rooms+0.004
 She+0.004
Neg logit contributions
 unforgettable-0.021
b-0.020
imm-0.020
 lifetime-0.020
 newfound-0.020
Token the
Token ID262
Feature activation-7.50e-03

Pos logit contributions
 unforgettable+0.007
 newfound+0.007
b+0.007
imm+0.006
 thrilling+0.006
Neg logit contributions
iffs-0.006
 seeker-0.006
 court-0.006
 slams-0.006
 Gard-0.006
Token b
Token ID275
Feature activation-2.35e-02

Pos logit contributions
 rooms+0.008
 seeker+0.007
 court+0.007
 tables+0.007
 Gard+0.007
Neg logit contributions
imm-0.014
 unforgettable-0.013
immer-0.013
b-0.013
 newfound-0.013
Tokenunn
Token ID20935
Feature activation-8.45e-02

Pos logit contributions
 seeker+0.030
iffs+0.029
 slams+0.029
 court+0.029
 rooms+0.027
Neg logit contributions
b-0.035
 unforgettable-0.034
imm-0.033
 newfound-0.032
immer-0.031
Tokenies
Token ID444
Feature activation-7.90e-03

Pos logit contributions
iffs+0.154
 seeker+0.149
 court+0.146
 Gard+0.146
 rooms+0.146
Neg logit contributions
 unforgettable-0.084
 newfound-0.076
b-0.072
 thrilling-0.071
imm-0.071
Token and
Token ID290
Feature activation+1.86e-03

Pos logit contributions
 court+0.006
 peel+0.006
 slams+0.005
 rooms+0.005
 Gard+0.005
Neg logit contributions
 unforgettable-0.016
imm-0.016
b-0.015
 newfound-0.015
 thrilling-0.015
Token asked
Token ID1965
Feature activation+4.76e-04

Pos logit contributions
 unforgettable+0.002
immer+0.002
imm+0.002
 newfound+0.002
 thrilling+0.002
Neg logit contributions
 gard-0.003
 Gard-0.003
 court-0.003
 slams-0.003
 rooms-0.003
Token:
Token ID25
Feature activation+5.41e-03

Pos logit contributions
 unforgettable+0.001
imm+0.001
 lifetime+0.001
b+0.001
 newfound+0.001
Neg logit contributions
iffs-0.000
 court-0.000
 rooms-0.000
 Gard-0.000
 Enjoy-0.000
Token "
Token ID366
Feature activation+9.80e-03

Pos logit contributions
 unforgettable+0.010
imm+0.009
 newfound+0.009
b+0.009
 thrilling+0.009
Neg logit contributions
iffs-0.006
 court-0.005
 Gard-0.005
 seeker-0.005
 peel-0.005
Token and
Token ID290
Feature activation+6.66e-03

Pos logit contributions
iffs+0.013
 seeker+0.013
 peel+0.012
 court+0.012
 Enjoy+0.011
Neg logit contributions
 unforgettable-0.021
imm-0.020
 newfound-0.019
b-0.019
 thrilling-0.019
Token get
Token ID651
Feature activation+1.59e-02

Pos logit contributions
 unforgettable+0.012
 newfound+0.011
b+0.011
imm+0.011
 thrilling+0.011
Neg logit contributions
iffs-0.007
 seeker-0.006
 court-0.006
 slams-0.006
 peel-0.006
Token it
Token ID340
Feature activation+3.09e-02

Pos logit contributions
imm+0.030
 unforgettable+0.028
 newfound+0.027
immer+0.027
 lifelong+0.027
Neg logit contributions
iffs-0.018
 Enjoy-0.016
 seeker-0.015
 peel-0.015
 yourselves-0.015
Token,
Token ID11
Feature activation-1.92e-03

Pos logit contributions
 newfound+0.072
 unforgettable+0.069
 remarkable+0.067
 lifelong+0.065
b+0.064
Neg logit contributions
."-0.018
 seeker-0.015
iffs-0.014
 slide-0.014
 peel-0.013
Token k
Token ID479
Feature activation-1.13e-02

Pos logit contributions
 seeker+0.002
 peel+0.002
 gard+0.002
 Gard+0.002
iffs+0.002
Neg logit contributions
imm-0.003
b-0.003
 unforgettable-0.003
 newfound-0.003
 lifetime-0.003
Tokenitt
Token ID715
Feature activation-8.44e-02

Pos logit contributions
iffs+0.018
 seeker+0.017
 court+0.016
 slams+0.016
 peel+0.016
Neg logit contributions
 unforgettable-0.015
 newfound-0.014
 thrilling-0.013
b-0.013
imm-0.013
Tokenies
Token ID444
Feature activation-9.87e-03

Pos logit contributions
 peel+0.113
iffs+0.111
 eats+0.102
ud+0.098
itt+0.096
Neg logit contributions
 unforgettable-0.138
 newfound-0.124
 brand-0.123
 thrilling-0.121
 lifelong-0.119
Token!"
Token ID2474
Feature activation-8.30e-03

Pos logit contributions
 seeker+0.008
 court+0.008
 peel+0.007
iffs+0.007
 gard+0.007
Neg logit contributions
 unforgettable-0.020
 newfound-0.020
imm-0.019
 brand-0.019
 thrilling-0.019
Token she
Token ID673
Feature activation+1.37e-02

Pos logit contributions
 court+0.008
iffs+0.008
 slams+0.007
 Gard+0.007
 peel+0.007
Neg logit contributions
 unforgettable-0.015
imm-0.015
b-0.015
 newfound-0.015
 thrilling-0.014
Token called
Token ID1444
Feature activation+1.95e-02

Pos logit contributions
 unforgettable+0.024
 lifetime+0.023
 newfound+0.023
 lifelong+0.022
 thrilling+0.022
Neg logit contributions
 peel-0.015
 yog-0.015
 Ben-0.015
 Anna-0.015
 slams-0.015
Token,
Token ID11
Feature activation+3.86e-03

Pos logit contributions
 unforgettable+0.036
b+0.034
 newfound+0.034
imm+0.034
 thrilling+0.033
Neg logit contributions
 seeker-0.020
 court-0.019
iffs-0.019
 peel-0.018
 slams-0.018
Token to
Token ID284
Feature activation+1.05e-02

Pos logit contributions
imm+0.042
 unforgettable+0.042
immer+0.040
b+0.040
 newfound+0.038
Neg logit contributions
 court-0.020
iffs-0.020
 rooms-0.019
 tables-0.019
 peel-0.018
Token the
Token ID262
Feature activation-1.83e-03

Pos logit contributions
 unforgettable+0.017
 newfound+0.016
b+0.015
 thrilling+0.015
imm+0.015
Neg logit contributions
iffs-0.013
 seeker-0.012
 slams-0.012
 Gard-0.012
 rooms-0.012
Token cage
Token ID17186
Feature activation-3.50e-03

Pos logit contributions
iffs+0.002
 seeker+0.002
 court+0.002
 rooms+0.002
 slams+0.002
Neg logit contributions
 unforgettable-0.003
imm-0.003
b-0.003
 newfound-0.003
 thrilling-0.003
Token of
Token ID286
Feature activation+3.18e-02

Pos logit contributions
 court+0.003
 rooms+0.002
 slams+0.002
iffs+0.002
 peel+0.002
Neg logit contributions
 unforgettable-0.007
 newfound-0.007
b-0.007
 lifelong-0.007
imm-0.007
Token b
Token ID275
Feature activation-1.98e-02

Pos logit contributions
imm+0.067
b+0.064
 unforgettable+0.062
immer+0.061
 lifetime+0.061
Neg logit contributions
 court-0.029
 peel-0.025
 rooms-0.025
 tables-0.025
iffs-0.024
Tokenunn
Token ID20935
Feature activation-8.39e-02

Pos logit contributions
 seeker+0.026
iffs+0.025
 slams+0.025
 rooms+0.025
 Gard+0.024
Neg logit contributions
 unforgettable-0.028
b-0.028
imm-0.027
 newfound-0.026
immer-0.026
Tokenies
Token ID444
Feature activation-9.16e-03

Pos logit contributions
iffs+0.133
 rooms+0.129
 court+0.128
 seeker+0.126
 Gard+0.126
Neg logit contributions
 unforgettable-0.108
 newfound-0.096
 thrilling-0.095
imm-0.091
 brand-0.090
Token.
Token ID13
Feature activation+4.91e-03

Pos logit contributions
 court+0.007
 peel+0.005
 sink+0.005
 tables+0.005
 kitchen+0.005
Neg logit contributions
 unforgettable-0.020
imm-0.020
 newfound-0.019
b-0.019
 thrilling-0.019
Token
Token ID220
Feature activation+4.08e-03

Pos logit contributions
 unforgettable+0.008
 newfound+0.008
 brand+0.007
 thrilling+0.007
 happened+0.007
Neg logit contributions
 seeker-0.006
iffs-0.006
 peel-0.006
 slams-0.005
 court-0.005
Token
Token ID198
Feature activation+8.52e-03

Pos logit contributions
 unforgettable+0.007
 newfound+0.006
b+0.006
imm+0.006
 thrilling+0.006
Neg logit contributions
iffs-0.005
 seeker-0.004
 court-0.004
 slams-0.004
 Gard-0.004
Token
Token ID198
Feature activation+9.05e-03

Pos logit contributions
 unforgettable+0.014
 newfound+0.013
b+0.013
imm+0.013
 thrilling+0.013
Neg logit contributions
iffs-0.010
 seeker-0.009
 slams-0.009
 court-0.009
 Gard-0.009
Token They
Token ID1119
Feature activation-1.63e-02

Pos logit contributions
 seeker+0.007
 Gard+0.006
 peel+0.006
ud+0.006
 yog+0.006
Neg logit contributions
 unforgettable-0.007
 printed-0.007
 happened-0.007
 brand-0.007
 newfound-0.007
Token put
Token ID1234
Feature activation-1.84e-02

Pos logit contributions
iffs+0.029
 seeker+0.028
 court+0.027
 Gard+0.027
 slams+0.026
Neg logit contributions
 unforgettable-0.016
 newfound-0.015
b-0.015
 thrilling-0.014
imm-0.014
Token on
Token ID319
Feature activation+1.22e-02

Pos logit contributions
iffs+0.022
 seeker+0.019
 slams+0.019
 peel+0.019
 rooms+0.019
Neg logit contributions
 unforgettable-0.032
 newfound-0.030
imm-0.030
 thrilling-0.030
b-0.029
Token oven
Token ID14361
Feature activation-1.97e-03

Pos logit contributions
 unforgettable+0.012
 newfound+0.011
 brand+0.011
 thrilling+0.010
 coloured+0.010
Neg logit contributions
 seeker-0.022
iffs-0.021
 Gard-0.020
 court-0.020
 slams-0.019
Token m
Token ID285
Feature activation-2.51e-03

Pos logit contributions
iffs+0.002
 rooms+0.002
 peel+0.002
 tables+0.002
 slams+0.002
Neg logit contributions
 unforgettable-0.004
 newfound-0.004
 lifetime-0.004
b-0.004
 thrilling-0.004
Tokenitt
Token ID715
Feature activation-8.16e-02

Pos logit contributions
 seeker+0.003
 slams+0.003
 rooms+0.003
 court+0.003
 gard+0.003
Neg logit contributions
 unforgettable-0.004
imm-0.004
b-0.004
immer-0.004
 newfound-0.004
Tokens
Token ID82
Feature activation-1.33e-02

Pos logit contributions
iffs+0.129
 peel+0.120
 court+0.117
 rooms+0.116
 Gard+0.116
Neg logit contributions
 unforgettable-0.108
 newfound-0.098
 thrilling-0.093
imm-0.092
 lifelong-0.091
Token and
Token ID290
Feature activation-5.63e-03

Pos logit contributions
iffs+0.012
 rooms+0.012
 peel+0.011
 tables+0.011
 court+0.010
Neg logit contributions
 unforgettable-0.028
 newfound-0.026
 lifetime-0.026
 thrilling-0.026
 lifelong-0.025
Token took
Token ID1718
Feature activation-1.49e-03

Pos logit contributions
iffs+0.007
 rooms+0.007
 peel+0.007
 seeker+0.007
 slams+0.007
Neg logit contributions
 unforgettable-0.009
 newfound-0.008
imm-0.008
b-0.008
 thrilling-0.008
Token out
Token ID503
Feature activation+1.09e-02

Pos logit contributions
iffs+0.002
 rooms+0.002
 peel+0.002
 seeker+0.002
 slams+0.002
Neg logit contributions
 unforgettable-0.002
 newfound-0.002
imm-0.002
b-0.002
 thrilling-0.002
Token the
Token ID262
Feature activation+3.71e-03

Pos logit contributions
 unforgettable+0.018
imm+0.017
 newfound+0.016
b+0.016
 thrilling+0.016
Neg logit contributions
iffs-0.013
 rooms-0.012
 slams-0.012
 court-0.012
 seeker-0.012
Tokenummy
Token ID13513
Feature activation-1.09e-02

Pos logit contributions
 seeker+0.046
 slams+0.043
iffs+0.041
 principal+0.041
 court+0.040
Neg logit contributions
imm-0.030
 unforgettable-0.028
immer-0.025
 newfound-0.025
 thrilling-0.025
Token food
Token ID2057
Feature activation-1.35e-02

Pos logit contributions
iffs+0.012
 rooms+0.011
 peel+0.011
 court+0.011
 tables+0.011
Neg logit contributions
 unforgettable-0.019
imm-0.018
 newfound-0.017
b-0.017
 thrilling-0.017
Token for
Token ID329
Feature activation+8.30e-04

Pos logit contributions
 peel+0.012
 court+0.011
iffs+0.010
 sink+0.010
 rooms+0.010
Neg logit contributions
 unforgettable-0.027
 newfound-0.026
imm-0.025
b-0.025
 lifetime-0.025
Token the
Token ID262
Feature activation-1.12e-02

Pos logit contributions
 unforgettable+0.001
imm+0.001
b+0.001
 newfound+0.001
immer+0.001
Neg logit contributions
 court-0.001
 rooms-0.001
iffs-0.001
 tables-0.001
 seeker-0.001
Token b
Token ID275
Feature activation-2.69e-02

Pos logit contributions
 seeker+0.011
 court+0.011
iffs+0.011
 rooms+0.011
 peel+0.011
Neg logit contributions
 unforgettable-0.020
imm-0.019
 newfound-0.019
b-0.018
immer-0.018
Tokenunn
Token ID20935
Feature activation-8.16e-02

Pos logit contributions
 slams+0.037
 seeker+0.036
iffs+0.034
 rooms+0.033
 Gard+0.033
Neg logit contributions
 unforgettable-0.037
b-0.037
imm-0.035
 newfound-0.034
 thrilling-0.033
Tokenies
Token ID444
Feature activation-9.84e-03

Pos logit contributions
iffs+0.143
 seeker+0.138
 rooms+0.136
 court+0.136
 Gard+0.135
Neg logit contributions
 unforgettable-0.088
 newfound-0.080
 thrilling-0.075
imm-0.075
 brand-0.074
Token.
Token ID13
Feature activation+5.93e-03

Pos logit contributions
 court+0.007
 peel+0.007
 rooms+0.006
iffs+0.006
 seeker+0.006
Neg logit contributions
 unforgettable-0.021
imm-0.020
 newfound-0.020
 thrilling-0.020
immer-0.019
Token It
Token ID632
Feature activation+6.46e-03

Pos logit contributions
 unforgettable+0.009
 newfound+0.009
 thrilling+0.009
imm+0.009
b+0.008
Neg logit contributions
 seeker-0.007
iffs-0.007
 slams-0.007
 Gard-0.007
 court-0.006
Token started
Token ID2067
Feature activation-1.85e-02

Pos logit contributions
 unforgettable+0.008
immer+0.008
 newfound+0.008
 lifetime+0.008
 brand+0.008
Neg logit contributions
 peel-0.009
 Gard-0.009
 rooms-0.009
 gard-0.009
 sink-0.009
Token with
Token ID351
Feature activation-2.29e-02

Pos logit contributions
 peel+0.017
 rooms+0.017
iffs+0.016
 tables+0.016
 sink+0.016
Neg logit contributions
 unforgettable-0.036
immer-0.033
 lifetime-0.033
imm-0.033
 newfound-0.032
Token.
Token ID13
Feature activation+4.40e-03

Pos logit contributions
 rooms+0.007
 tables+0.007
 sink+0.006
 court+0.005
 peel+0.005
Neg logit contributions
 unforgettable-0.054
 newfound-0.051
b-0.051
imm-0.051
 lifelong-0.050
Token
Token ID220
Feature activation+5.19e-03

Pos logit contributions
 unforgettable+0.009
b+0.009
 newfound+0.009
imm+0.009
 thrilling+0.009
Neg logit contributions
iffs-0.003
 court-0.003
 Gard-0.003
 rooms-0.002
 seeker-0.002
Token But
Token ID887
Feature activation-3.42e-03

Pos logit contributions
 unforgettable+0.011
 newfound+0.010
b+0.010
 thrilling+0.010
imm+0.010
Neg logit contributions
iffs-0.005
 court-0.004
 rooms-0.004
 Gard-0.004
atters-0.004
Token the
Token ID262
Feature activation-8.01e-03

Pos logit contributions
 seeker+0.003
 But+0.002
 court+0.002
 slams+0.002
 Ben+0.002
Neg logit contributions
imm-0.007
b-0.006
 unforgettable-0.006
immer-0.006
 thrilling-0.006
Token b
Token ID275
Feature activation-2.38e-02

Pos logit contributions
 rooms+0.008
 gard+0.008
 seeker+0.007
 tables+0.007
 Gard+0.007
Neg logit contributions
imm-0.016
b-0.015
immer-0.014
 unforgettable-0.014
ittered-0.014
Tokenunn
Token ID20935
Feature activation-8.13e-02

Pos logit contributions
iffs+0.020
 Gard+0.017
 rooms+0.016
 seeker+0.016
 peel+0.016
Neg logit contributions
 unforgettable-0.047
 newfound-0.045
 thrilling-0.044
b-0.044
 brand-0.044
Tokenies
Token ID444
Feature activation-1.47e-02

Pos logit contributions
 Gard+0.113
iffs+0.109
 gard+0.108
 rooms+0.105
 seeker+0.101
Neg logit contributions
 unforgettable-0.122
 thrilling-0.113
 brand-0.109
 newfound-0.104
 remarkable-0.104
Token didn
Token ID1422
Feature activation-1.45e-02

Pos logit contributions
 court+0.016
 peel+0.016
 slams+0.015
 rooms+0.015
 gard+0.015
Neg logit contributions
 unforgettable-0.024
 brand-0.023
imm-0.022
 lifelong-0.022
 thrilling-0.022
Token't
Token ID470
Feature activation-1.89e-02

Pos logit contributions
iffs+0.011
 court+0.010
 peel+0.010
 seeker+0.010
 slams+0.009
Neg logit contributions
 unforgettable-0.029
b-0.028
 newfound-0.028
imm-0.028
 lifetime-0.027
Token wake
Token ID7765
Feature activation-2.76e-02

Pos logit contributions
iffs+0.019
 seeker+0.019
 court+0.019
 peel+0.018
 Gard+0.018
Neg logit contributions
 unforgettable-0.033
 newfound-0.031
imm-0.031
b-0.031
 brand-0.030
Token up
Token ID510
Feature activation-1.31e-02

Pos logit contributions
 tables+0.015
 rooms+0.015
 slams+0.014
 sink+0.014
iffs+0.014
Neg logit contributions
 unforgettable-0.059
 lifetime-0.057
b-0.057
imm-0.056
 newfound-0.056
Token grass
Token ID8701
Feature activation-4.26e-02

Pos logit contributions
 unforgettable+0.008
imm+0.008
b+0.008
 newfound+0.008
immer+0.007
Neg logit contributions
iffs-0.005
 seeker-0.005
 court-0.005
 rooms-0.005
 Gard-0.005
Token with
Token ID351
Feature activation-9.00e-03

Pos logit contributions
 court+0.033
 seeker+0.031
iffs+0.030
 peel+0.030
 rooms+0.028
Neg logit contributions
 unforgettable-0.088
 newfound-0.087
b-0.082
 thrilling-0.080
imm-0.080
Token the
Token ID262
Feature activation+1.16e-03

Pos logit contributions
 court+0.007
 rooms+0.007
iffs+0.006
 peel+0.006
 Ben+0.006
Neg logit contributions
imm-0.018
 unforgettable-0.017
b-0.017
immer-0.017
 lifetime-0.017
Token loud
Token ID7812
Feature activation+1.16e-02

Pos logit contributions
imm+0.002
immer+0.002
 unforgettable+0.002
ittered+0.002
 happened+0.002
Neg logit contributions
 gard-0.001
 seeker-0.001
 rooms-0.001
 Gard-0.001
 yog-0.001
Token b
Token ID275
Feature activation-2.13e-02

Pos logit contributions
 unforgettable+0.022
imm+0.021
b+0.020
immer+0.020
pr+0.020
Neg logit contributions
 rooms-0.011
 tables-0.010
 ear-0.010
 peel-0.010
 gard-0.010
Tokenunn
Token ID20935
Feature activation-8.07e-02

Pos logit contributions
 seeker+0.027
iffs+0.027
 slams+0.027
 court+0.026
 rooms+0.025
Neg logit contributions
 unforgettable-0.032
b-0.030
 newfound-0.030
imm-0.030
 thrilling-0.029
Tokenies
Token ID444
Feature activation-5.23e-03

Pos logit contributions
iffs+0.130
 Gard+0.124
 rooms+0.123
 seeker+0.122
 court+0.121
Neg logit contributions
 unforgettable-0.100
 newfound-0.088
 thrilling-0.087
 brand-0.085
imm-0.083
Token.
Token ID13
Feature activation+9.16e-03

Pos logit contributions
 court+0.004
 peel+0.003
 seeker+0.003
iffs+0.003
 rooms+0.003
Neg logit contributions
imm-0.011
 unforgettable-0.011
 newfound-0.011
 thrilling-0.010
b-0.010
Token<|endoftext|>
Token ID50256
Feature activation-1.36e-02

Pos logit contributions
 unforgettable+0.014
 newfound+0.013
 thrilling+0.012
imm+0.012
b+0.012
Neg logit contributions
 seeker-0.012
iffs-0.011
 slams-0.011
 rooms-0.011
 peel-0.011
TokenOnce
Token ID7454
Feature activation-9.31e-03

Pos logit contributions
iffs+0.014
 seeker+0.013
 court+0.012
 rooms+0.012
 Gard+0.012
Neg logit contributions
 unforgettable-0.025
 newfound-0.024
b-0.023
imm-0.023
 thrilling-0.023
Token upon
Token ID2402
Feature activation-2.52e-04

Pos logit contributions
iffs+0.013
 seeker+0.012
 court+0.012
 rooms+0.012
 slams+0.012
Neg logit contributions
 unforgettable-0.014
imm-0.012
 newfound-0.012
b-0.012
 lifetime-0.012
Token.
Token ID13
Feature activation+8.47e-03

Pos logit contributions
 peel+0.004
iffs+0.004
 court+0.004
 chefs+0.004
 rooms+0.004
Neg logit contributions
 unforgettable-0.011
 newfound-0.010
 thrilling-0.010
b-0.010
 remarkable-0.010
Token It
Token ID632
Feature activation+2.70e-03

Pos logit contributions
 unforgettable+0.015
 newfound+0.014
imm+0.013
 thrilling+0.013
 lifelong+0.013
Neg logit contributions
 seeker-0.009
iffs-0.008
 slams-0.008
 peel-0.008
 court-0.008
Token made
Token ID925
Feature activation+1.78e-03

Pos logit contributions
 unforgettable+0.004
immer+0.004
 newfound+0.004
 lifetime+0.003
b+0.003
Neg logit contributions
 peel-0.004
 Gard-0.004
 rooms-0.003
 yog-0.003
 eats-0.003
Token the
Token ID262
Feature activation-2.04e-02

Pos logit contributions
b+0.003
 unforgettable+0.003
 lifetime+0.003
immer+0.003
imm+0.003
Neg logit contributions
 rooms-0.002
 tables-0.002
iffs-0.001
 Enjoy-0.001
 yourselves-0.001
Token b
Token ID275
Feature activation-2.71e-02

Pos logit contributions
iffs+0.032
 slams+0.030
 seeker+0.029
 Gard+0.029
 court+0.028
Neg logit contributions
 unforgettable-0.027
 newfound-0.025
b-0.024
 thrilling-0.023
 lifelong-0.023
Tokenunn
Token ID20935
Feature activation-8.01e-02

Pos logit contributions
 slams+0.037
 seeker+0.037
 court+0.035
iffs+0.035
 rooms+0.034
Neg logit contributions
 unforgettable-0.039
b-0.038
imm-0.037
 newfound-0.036
immer-0.034
Tokenies
Token ID444
Feature activation-7.43e-03

Pos logit contributions
iffs+0.147
 seeker+0.142
 court+0.139
 Gard+0.139
 rooms+0.139
Neg logit contributions
 unforgettable-0.078
 newfound-0.070
b-0.067
imm-0.066
 thrilling-0.066
Token smile
Token ID8212
Feature activation+1.49e-03

Pos logit contributions
 peel+0.008
 court+0.008
iffs+0.008
 rooms+0.008
 gard+0.007
Neg logit contributions
 unforgettable-0.012
imm-0.011
b-0.011
 thrilling-0.011
 happened-0.011
Token and
Token ID290
Feature activation+1.66e-02

Pos logit contributions
 unforgettable+0.003
b+0.003
 newfound+0.003
chie+0.003
 thrilling+0.003
Neg logit contributions
 peel-0.001
 seeker-0.000
iffs-0.000
der-0.000
 She-0.000
Token cheer
Token ID14042
Feature activation+1.37e-03

Pos logit contributions
imm+0.026
 unforgettable+0.026
immer+0.026
b+0.026
 thrilling+0.025
Neg logit contributions
iffs-0.018
 peel-0.018
 Gard-0.018
 rooms-0.017
 court-0.017
Token.
Token ID13
Feature activation+1.25e-02

Pos logit contributions
b+0.002
 unforgettable+0.002
 newfound+0.002
 thrilling+0.002
 lifetime+0.002
Neg logit contributions
iffs-0.001
 seeker-0.001
 rooms-0.001
 yourselves-0.001
 ear-0.001
Token the
Token ID262
Feature activation-4.52e-03

Pos logit contributions
 court+0.006
 rooms+0.005
 slams+0.005
 ear+0.005
 arrest+0.005
Neg logit contributions
b-0.018
imm-0.018
 unforgettable-0.018
immer-0.017
 lifetime-0.017
Token house
Token ID2156
Feature activation-1.87e-02

Pos logit contributions
 rooms+0.004
 court+0.003
 seeker+0.003
 Gard+0.003
 kitchen+0.003
Neg logit contributions
imm-0.009
 unforgettable-0.009
immer-0.009
 happened-0.009
ittered-0.009
Token saw
Token ID2497
Feature activation-8.78e-03

Pos logit contributions
 rooms+0.022
 tables+0.020
 court+0.020
 slams+0.019
 sink+0.018
Neg logit contributions
b-0.029
 unforgettable-0.029
immer-0.029
imm-0.028
 lifelong-0.028
Token the
Token ID262
Feature activation-1.30e-02

Pos logit contributions
 Ben+0.006
 rooms+0.006
 court+0.006
 tables+0.006
 Sara+0.005
Neg logit contributions
 unforgettable-0.018
imm-0.018
 lifetime-0.018
b-0.018
immer-0.018
Token b
Token ID275
Feature activation-2.56e-02

Pos logit contributions
 seeker+0.015
 rooms+0.015
 gard+0.014
 Gard+0.013
 sink+0.013
Neg logit contributions
imm-0.024
 unforgettable-0.022
immer-0.021
b-0.021
ittered-0.021
Tokenunn
Token ID20935
Feature activation-7.99e-02

Pos logit contributions
iffs+0.027
 seeker+0.026
 court+0.025
 slams+0.024
 rooms+0.024
Neg logit contributions
 unforgettable-0.045
 newfound-0.043
b-0.042
imm-0.042
 thrilling-0.041
Tokenies
Token ID444
Feature activation-8.80e-03

Pos logit contributions
iffs+0.118
 Gard+0.118
 rooms+0.117
 gard+0.113
 seeker+0.112
Neg logit contributions
 unforgettable-0.107
 thrilling-0.096
 newfound-0.093
 brand-0.092
 remarkable-0.089
Token up
Token ID510
Feature activation-1.54e-03

Pos logit contributions
 court+0.007
 peel+0.007
 sink+0.007
 kitchen+0.006
 tables+0.006
Neg logit contributions
imm-0.017
 unforgettable-0.017
b-0.016
 thrilling-0.015
 newfound-0.015
Token in
Token ID287
Feature activation-2.50e-03

Pos logit contributions
 court+0.001
 peel+0.001
 slams+0.001
 seeker+0.001
iffs+0.001
Neg logit contributions
 unforgettable-0.003
imm-0.003
b-0.003
 lifetime-0.003
bors-0.003
Token the
Token ID262
Feature activation-1.65e-03

Pos logit contributions
 court+0.002
 rooms+0.002
 tables+0.002
 peel+0.002
 seeker+0.002
Neg logit contributions
imm-0.005
b-0.005
 unforgettable-0.005
immer-0.005
 lifetime-0.005
Token nest
Token ID16343
Feature activation-1.53e-02

Pos logit contributions
 rooms+0.002
 seeker+0.001
 court+0.001
 slams+0.001
 Gard+0.001
Neg logit contributions
 unforgettable-0.003
imm-0.003
b-0.003
 newfound-0.003
immer-0.003
Token the
Token ID262
Feature activation+5.39e-03

Pos logit contributions
imm+0.048
immer+0.046
 lifetime+0.045
 beating+0.045
 happened+0.044
Neg logit contributions
 Ben-0.015
 yourselves-0.015
 Sara-0.014
 Enjoy-0.014
 court-0.013
Token honey
Token ID12498
Feature activation+1.97e-02

Pos logit contributions
imm+0.011
ittered+0.010
 unforgettable+0.010
immer+0.010
 beating+0.009
Neg logit contributions
 seeker-0.005
 gard-0.005
 yog-0.005
 rooms-0.005
 court-0.005
Token to
Token ID284
Feature activation+1.11e-02

Pos logit contributions
 newfound+0.040
 unforgettable+0.040
 thrilling+0.038
imm+0.038
 remarkable+0.037
Neg logit contributions
iffs-0.015
 peel-0.015
 court-0.014
 seeker-0.014
 sink-0.012
Token the
Token ID262
Feature activation-2.25e-03

Pos logit contributions
imm+0.021
b+0.021
 unforgettable+0.021
 lifetime+0.020
 thrilling+0.020
Neg logit contributions
 court-0.010
 peel-0.010
iffs-0.009
 Enjoy-0.009
 yourselves-0.009
Token b
Token ID275
Feature activation-2.12e-02

Pos logit contributions
 seeker+0.002
 court+0.002
 rooms+0.002
 gard+0.002
 Gard+0.002
Neg logit contributions
imm-0.004
 unforgettable-0.004
b-0.004
 newfound-0.004
ittered-0.004
Tokenunn
Token ID20935
Feature activation-7.99e-02

Pos logit contributions
iffs+0.029
 seeker+0.028
 court+0.027
 slams+0.027
 rooms+0.027
Neg logit contributions
 unforgettable-0.031
 newfound-0.029
b-0.028
imm-0.028
 thrilling-0.027
Tokenies
Token ID444
Feature activation-3.02e-03

Pos logit contributions
iffs+0.116
 Gard+0.112
 rooms+0.112
 seeker+0.110
 gard+0.108
Neg logit contributions
 unforgettable-0.113
 thrilling-0.103
 newfound-0.102
 brand-0.100
 remarkable-0.097
Token!
Token ID0
Feature activation+1.08e-02

Pos logit contributions
 court+0.002
 peel+0.002
iffs+0.002
 seeker+0.002
 rooms+0.002
Neg logit contributions
 unforgettable-0.006
 newfound-0.006
imm-0.006
 thrilling-0.006
b-0.006
Tokenâ
Token ID22940
Feature activation+2.40e-02

Pos logit contributions
 unforgettable+0.021
imm+0.020
b+0.020
 newfound+0.020
 thrilling+0.019
Neg logit contributions
iffs-0.010
 court-0.009
 seeker-0.009
 Gard-0.008
 slams-0.008
Token€
Token ID26391
Feature activation+2.02e-02

Pos logit contributions
 unforgettable+0.033
 newfound+0.031
b+0.030
 thrilling+0.029
 brand+0.029
Neg logit contributions
iffs-0.035
 seeker-0.033
 court-0.032
 slams-0.032
 Gard-0.032
Token
Token ID198
Feature activation+1.33e-02

Pos logit contributions
 unforgettable+0.032
 newfound+0.030
b+0.029
imm+0.029
 thrilling+0.029
Neg logit contributions
iffs-0.025
 seeker-0.024
 court-0.023
 slams-0.023
 Gard-0.023
Token learned
Token ID4499
Feature activation-1.28e-02

Pos logit contributions
iffs+0.027
 seeker+0.027
 court+0.027
 Gard+0.026
 slams+0.026
Neg logit contributions
 unforgettable-0.032
 newfound-0.030
imm-0.030
b-0.030
 thrilling-0.029
Token not
Token ID407
Feature activation-1.85e-03

Pos logit contributions
iffs+0.013
 rooms+0.013
 slams+0.013
 court+0.013
 seeker+0.012
Neg logit contributions
imm-0.022
 unforgettable-0.022
 newfound-0.021
immer-0.021
b-0.021
Token to
Token ID284
Feature activation-2.28e-02

Pos logit contributions
iffs+0.002
 seeker+0.002
 court+0.002
 slams+0.002
 rooms+0.002
Neg logit contributions
 unforgettable-0.003
 newfound-0.003
imm-0.003
b-0.003
 thrilling-0.002
Token judge
Token ID5052
Feature activation-2.99e-02

Pos logit contributions
iffs+0.034
 seeker+0.032
 Gard+0.031
 slams+0.031
 rooms+0.031
Neg logit contributions
 unforgettable-0.030
b-0.028
 newfound-0.028
 thrilling-0.027
imm-0.026
Token by
Token ID416
Feature activation-2.78e-03

Pos logit contributions
 rooms+0.032
 court+0.031
iffs+0.029
 yourselves+0.028
 slams+0.028
Neg logit contributions
 unforgettable-0.055
imm-0.053
immer-0.052
 lifetime-0.051
 newfound-0.050
Token appearances
Token ID11057
Feature activation-7.82e-02

Pos logit contributions
iffs+0.003
 seeker+0.003
 court+0.003
 rooms+0.003
 Gard+0.003
Neg logit contributions
 unforgettable-0.005
 newfound-0.005
imm-0.005
b-0.005
 thrilling-0.004
Token.
Token ID13
Feature activation-8.21e-03

Pos logit contributions
iffs+0.063
 rooms+0.061
 court+0.060
 seeker+0.057
 slams+0.056
Neg logit contributions
 unforgettable-0.161
 newfound-0.155
imm-0.148
b-0.147
 lifetime-0.145
Token They
Token ID1119
Feature activation-2.33e-02

Pos logit contributions
 peel+0.008
 seeker+0.008
iffs+0.007
 Gard+0.007
 slams+0.007
Neg logit contributions
 unforgettable-0.015
 newfound-0.014
 lifelong-0.014
 thrilling-0.013
 brand-0.013
Token learned
Token ID4499
Feature activation-1.54e-02

Pos logit contributions
iffs+0.029
 seeker+0.029
 court+0.028
 slams+0.028
 Gard+0.027
Neg logit contributions
 unforgettable-0.035
 newfound-0.033
imm-0.033
b-0.033
 thrilling-0.032
Token to
Token ID284
Feature activation-1.26e-02

Pos logit contributions
iffs+0.017
 rooms+0.016
 court+0.016
 slams+0.016
 seeker+0.015
Neg logit contributions
 unforgettable-0.026
imm-0.026
 newfound-0.025
b-0.025
immer-0.025
Token help
Token ID1037
Feature activation-2.75e-02

Pos logit contributions
iffs+0.019
 seeker+0.018
 slams+0.018
 rooms+0.018
 Gard+0.018
Neg logit contributions
 unforgettable-0.016
 newfound-0.015
b-0.015
 thrilling-0.014
imm-0.014
Token the
Token ID262
Feature activation-9.90e-04

Pos logit contributions
 court+0.004
 rooms+0.004
 peel+0.003
 tables+0.003
 arrest+0.003
Neg logit contributions
imm-0.009
 unforgettable-0.009
b-0.008
immer-0.008
 lifetime-0.008
Token nest
Token ID16343
Feature activation-7.86e-03

Pos logit contributions
 rooms+0.001
 seeker+0.001
 court+0.001
 peel+0.001
iffs+0.001
Neg logit contributions
 unforgettable-0.002
imm-0.002
b-0.002
 newfound-0.002
immer-0.002
Token.
Token ID13
Feature activation+4.70e-04

Pos logit contributions
 court+0.005
 peel+0.004
 sink+0.004
 rooms+0.004
 slams+0.004
Neg logit contributions
 newfound-0.017
 unforgettable-0.017
 lifelong-0.016
 thrilling-0.016
 remarkable-0.016
Token The
Token ID383
Feature activation-3.53e-03

Pos logit contributions
 unforgettable+0.001
 newfound+0.001
b+0.001
imm+0.001
 thrilling+0.001
Neg logit contributions
iffs-0.001
 seeker-0.001
 slams-0.001
 court-0.001
 rooms-0.001
Token b
Token ID275
Feature activation-2.10e-02

Pos logit contributions
 seeker+0.003
 court+0.003
 gard+0.003
 Gard+0.002
 rooms+0.002
Neg logit contributions
imm-0.008
 unforgettable-0.007
immer-0.007
b-0.007
 newfound-0.007
Tokenunn
Token ID20935
Feature activation-7.77e-02

Pos logit contributions
iffs+0.025
 seeker+0.024
 court+0.023
 slams+0.023
 Gard+0.023
Neg logit contributions
 unforgettable-0.034
 newfound-0.032
b-0.032
imm-0.031
 thrilling-0.031
Tokenies
Token ID444
Feature activation-7.25e-03

Pos logit contributions
iffs+0.115
 Gard+0.113
 seeker+0.110
 gard+0.108
 rooms+0.108
Neg logit contributions
 unforgettable-0.109
 thrilling-0.097
 brand-0.096
 newfound-0.095
 remarkable-0.090
Token could
Token ID714
Feature activation-2.24e-02

Pos logit contributions
 court+0.008
 peel+0.008
 gard+0.008
 Gard+0.008
 slams+0.007
Neg logit contributions
 unforgettable-0.012
imm-0.011
 thrilling-0.011
 brand-0.011
 lifelong-0.011
Token see
Token ID766
Feature activation-1.84e-02

Pos logit contributions
 peel+0.019
 court+0.018
iffs+0.018
 seeker+0.017
 slams+0.017
Neg logit contributions
 unforgettable-0.044
b-0.042
imm-0.042
 lifetime-0.042
 newfound-0.041
Token the
Token ID262
Feature activation-1.47e-02

Pos logit contributions
iffs+0.017
 Gard+0.015
 rooms+0.015
 Enjoy+0.015
 Ben+0.015
Neg logit contributions
 unforgettable-0.034
imm-0.033
 lifetime-0.032
b-0.032
 thrilling-0.031
Token baby
Token ID5156
Feature activation-1.26e-02

Pos logit contributions
 seeker+0.015
 rooms+0.015
 gard+0.014
 peel+0.014
 Gard+0.014
Neg logit contributions
imm-0.027
 unforgettable-0.027
immer-0.025
 newfound-0.025
b-0.025
Token.
Token ID13
Feature activation+7.17e-03

Pos logit contributions
 unforgettable+0.012
imm+0.012
b+0.012
 newfound+0.012
 brand+0.012
Neg logit contributions
 court-0.004
 peel-0.004
iffs-0.004
 seeker-0.003
 Gard-0.003
Token
Token ID198
Feature activation+9.43e-03

Pos logit contributions
 unforgettable+0.009
 newfound+0.009
imm+0.009
b+0.008
 lifelong+0.008
Neg logit contributions
 seeker-0.011
 slams-0.010
 peel-0.010
iffs-0.009
 Gard-0.009
Token
Token ID198
Feature activation+8.40e-03

Pos logit contributions
 unforgettable+0.016
 newfound+0.015
b+0.015
imm+0.015
 thrilling+0.015
Neg logit contributions
iffs-0.010
 seeker-0.010
 court-0.009
 slams-0.009
 rooms-0.009
TokenThe
Token ID464
Feature activation-2.71e-03

Pos logit contributions
 unforgettable+0.012
 newfound+0.011
b+0.011
 thrilling+0.011
imm+0.011
Neg logit contributions
iffs-0.012
 seeker-0.011
 slams-0.011
 court-0.011
 Gard-0.011
Token b
Token ID275
Feature activation-1.94e-02

Pos logit contributions
 seeker+0.003
 gard+0.003
 Gard+0.003
 rooms+0.003
 court+0.003
Neg logit contributions
imm-0.005
ittered-0.005
immer-0.005
b-0.005
 unforgettable-0.005
Tokenunn
Token ID20935
Feature activation-7.75e-02

Pos logit contributions
iffs+0.021
 seeker+0.020
 court+0.020
 slams+0.020
 rooms+0.019
Neg logit contributions
 unforgettable-0.034
 newfound-0.032
b-0.032
imm-0.031
 thrilling-0.030
Tokenies
Token ID444
Feature activation-1.55e-03

Pos logit contributions
iffs+0.137
 seeker+0.132
 Gard+0.131
 rooms+0.130
 court+0.129
Neg logit contributions
 unforgettable-0.082
 newfound-0.073
 thrilling-0.071
b-0.069
imm-0.069
Token showed
Token ID3751
Feature activation-2.52e-02

Pos logit contributions
 court+0.002
 rooms+0.002
 Gard+0.002
 gard+0.002
 peel+0.002
Neg logit contributions
 unforgettable-0.003
 brand-0.002
 thrilling-0.002
imm-0.002
 newfound-0.002
Token the
Token ID262
Feature activation-1.82e-02

Pos logit contributions
iffs+0.027
 rooms+0.027
 court+0.026
 slams+0.025
 tables+0.025
Neg logit contributions
 unforgettable-0.043
imm-0.041
b-0.040
immer-0.040
 lifetime-0.040
Token elderly
Token ID13830
Feature activation-1.20e-02

Pos logit contributions
 Gard+0.018
 rooms+0.018
 seeker+0.018
 gard+0.018
 court+0.017
Neg logit contributions
imm-0.034
 unforgettable-0.033
immer-0.032
b-0.032
ittered-0.031
Token rabbit
Token ID22746
Feature activation-1.94e-02

Pos logit contributions
iffs+0.016
 seeker+0.015
 court+0.015
 Gard+0.015
 rooms+0.015
Neg logit contributions
 unforgettable-0.018
 newfound-0.017
imm-0.016
b-0.016
 thrilling-0.016
Token put
Token ID1234
Feature activation-1.62e-02

Pos logit contributions
 rooms+0.009
 slams+0.009
 peel+0.009
 Gard+0.008
 seeker+0.008
Neg logit contributions
immer-0.012
 unforgettable-0.012
imm-0.011
 newfound-0.011
 lifelong-0.011
Token on
Token ID319
Feature activation+7.51e-03

Pos logit contributions
iffs+0.015
 seeker+0.013
 peel+0.013
 tables+0.013
 slams+0.012
Neg logit contributions
 unforgettable-0.033
 lifetime-0.033
 thrilling-0.031
 newfound-0.031
 remarkable-0.030
Token an
Token ID281
Feature activation-4.75e-03

Pos logit contributions
 unforgettable+0.011
 newfound+0.010
b+0.010
imm+0.009
 thrilling+0.009
Neg logit contributions
iffs-0.011
 seeker-0.010
 court-0.010
 Gard-0.010
 slams-0.010
Token oven
Token ID14361
Feature activation+1.07e-02

Pos logit contributions
 rooms+0.006
 peel+0.006
 court+0.006
 tables+0.006
 ear+0.006
Neg logit contributions
 lifetime-0.007
 lifelong-0.007
imm-0.007
b-0.007
 newfound-0.006
Token m
Token ID285
Feature activation-5.36e-03

Pos logit contributions
 unforgettable+0.021
 newfound+0.021
 lifelong+0.020
imm+0.020
 thrilling+0.020
Neg logit contributions
iffs-0.009
 peel-0.009
 court-0.009
 seeker-0.009
 rooms-0.009
Tokenitt
Token ID715
Feature activation-7.71e-02

Pos logit contributions
 seeker+0.007
iffs+0.006
 court+0.006
 slams+0.006
 rooms+0.006
Neg logit contributions
 unforgettable-0.008
b-0.008
imm-0.008
immer-0.007
 newfound-0.007
Token and
Token ID290
Feature activation-9.01e-03

Pos logit contributions
iffs+0.086
 peel+0.085
 court+0.078
 tables+0.076
 rooms+0.074
Neg logit contributions
 unforgettable-0.139
 newfound-0.127
 thrilling-0.123
 lifelong-0.123
 remarkable-0.121
Token a
Token ID257
Feature activation+1.53e-02

Pos logit contributions
iffs+0.011
 rooms+0.011
 peel+0.011
 slams+0.011
 seeker+0.011
Neg logit contributions
 unforgettable-0.014
imm-0.013
 newfound-0.013
b-0.013
 thrilling-0.013
Token a
Token ID257
Feature activation+1.60e-02

Pos logit contributions
imm+0.033
 unforgettable+0.032
b+0.030
ittered+0.029
ully+0.029
Neg logit contributions
 peel-0.013
 kitchen-0.013
 sink-0.012
 refrigerator-0.012
 gard-0.011
Tokenpron
Token ID31186
Feature activation+4.16e-02

Pos logit contributions
imm+0.033
 unforgettable+0.032
b+0.029
ittered+0.029
ully+0.029
Neg logit contributions
 peel-0.015
 kitchen-0.014
 sink-0.014
 refrigerator-0.014
 gard-0.013
Token.
Token ID13
Feature activation-7.51e-03

Pos logit contributions
 unforgettable+0.091
 newfound+0.090
 lifelong+0.088
 thrilling+0.086
 remarkable+0.084
Neg logit contributions
 peel-0.034
 seeker-0.028
 sink-0.028
 court-0.028
 table-0.027
Token on
Token ID319
Feature activation-4.74e-04

Pos logit contributions
 unforgettable+0.009
imm+0.009
immer+0.008
 lifetime+0.008
 newfound+0.008
Neg logit contributions
 rooms-0.005
 tables-0.005
 court-0.004
 peel-0.004
 chefs-0.004
Token top
Token ID1353
Feature activation-3.42e-02

Pos logit contributions
 rooms+0.000
 court+0.000
 tables+0.000
 slams+0.000
 yourselves+0.000
Neg logit contributions
imm-0.001
b-0.001
 unforgettable-0.001
immer-0.001
 lifetime-0.001
Token.
Token ID13
Feature activation-1.54e-02

Pos logit contributions
 peel+0.021
 rooms+0.020
 slide+0.019
iffs+0.019
 tables+0.019
Neg logit contributions
 unforgettable-0.077
 newfound-0.077
 lifelong-0.074
 brand-0.073
 lifetime-0.072
Token The
Token ID383
Feature activation-5.80e-05

Pos logit contributions
iffs+0.008
 court+0.008
 Enjoy+0.007
 She+0.007
 rooms+0.007
Neg logit contributions
 unforgettable-0.035
b-0.035
imm-0.035
 newfound-0.034
 remarkable-0.033
Token b
Token ID275
Feature activation-1.58e-02

Pos logit contributions
 court+0.000
 seeker+0.000
 peel+0.000
 rooms+0.000
 gard+0.000
Neg logit contributions
imm-0.000
 unforgettable-0.000
immer-0.000
ittered-0.000
b-0.000
Tokenunn
Token ID20935
Feature activation-7.69e-02

Pos logit contributions
iffs+0.019
 seeker+0.018
 Gard+0.018
 court+0.018
 slams+0.017
Neg logit contributions
 unforgettable-0.025
 newfound-0.024
b-0.023
 thrilling-0.023
imm-0.023
Tokenies
Token ID444
Feature activation+4.17e-03

Pos logit contributions
 Gard+0.103
 rooms+0.102
iffs+0.102
 gard+0.099
 court+0.097
Neg logit contributions
 unforgettable-0.116
 brand-0.110
 thrilling-0.107
 newfound-0.104
 remarkable-0.101
Token wanted
Token ID2227
Feature activation-1.23e-02

Pos logit contributions
 unforgettable+0.007
imm+0.007
 newfound+0.007
 brand+0.006
 thrilling+0.006
Neg logit contributions
 court-0.005
 peel-0.004
 slams-0.004
 rooms-0.004
 Gard-0.004
Token to
Token ID284
Feature activation-1.03e-02

Pos logit contributions
 tables+0.009
 rooms+0.009
 court+0.009
 Sara+0.008
 Ben+0.008
Neg logit contributions
imm-0.028
immer-0.028
b-0.025
ittered-0.025
 lifetime-0.025
Token have
Token ID423
Feature activation+1.79e-02

Pos logit contributions
 peel+0.010
 court+0.010
 seeker+0.009
 rooms+0.009
iffs+0.009
Neg logit contributions
 unforgettable-0.019
imm-0.018
b-0.018
 newfound-0.018
 lifetime-0.018
Token a
Token ID257
Feature activation-1.34e-02

Pos logit contributions
imm+0.040
immer+0.038
b+0.035
ittered+0.034
ully+0.033
Neg logit contributions
 rooms-0.017
 tables-0.015
 many-0.015
 slides-0.014
 court-0.013
Token vegetables
Token ID13701
Feature activation-2.64e-02

Pos logit contributions
imm+0.025
immer+0.022
ittered+0.022
chie+0.022
b+0.021
Neg logit contributions
 rooms-0.015
 tables-0.015
 peel-0.014
 kitchen-0.013
 court-0.013
Token!
Token ID0
Feature activation+1.20e-02

Pos logit contributions
 peel+0.026
 court+0.025
 sink+0.023
iffs+0.022
 seeker+0.022
Neg logit contributions
 unforgettable-0.050
 newfound-0.047
imm-0.046
 lifelong-0.045
b-0.045
Token All
Token ID1439
Feature activation-1.96e-02

Pos logit contributions
 unforgettable+0.025
b+0.024
 newfound+0.024
imm+0.024
 thrilling+0.023
Neg logit contributions
iffs-0.009
 court-0.009
 rooms-0.008
 Gard-0.008
 slams-0.008
Token the
Token ID262
Feature activation-3.02e-03

Pos logit contributions
 rooms+0.014
 slams+0.012
 tables+0.011
 Ben+0.010
 Mia+0.010
Neg logit contributions
immer-0.042
 lifetime-0.041
 unforgettable-0.040
 brand-0.038
 newfound-0.037
Token b
Token ID275
Feature activation-1.98e-02

Pos logit contributions
 rooms+0.003
 tables+0.003
 court+0.003
 Gard+0.003
 yog+0.003
Neg logit contributions
 unforgettable-0.005
imm-0.005
immer-0.005
b-0.005
 newfound-0.005
Tokenunn
Token ID20935
Feature activation-7.63e-02

Pos logit contributions
iffs+0.019
 seeker+0.017
 court+0.017
 Gard+0.017
 rooms+0.016
Neg logit contributions
 unforgettable-0.037
 newfound-0.035
b-0.035
imm-0.034
 thrilling-0.034
Tokenies
Token ID444
Feature activation-4.16e-03

Pos logit contributions
 Gard+0.108
iffs+0.107
 rooms+0.106
 court+0.102
 gard+0.101
Neg logit contributions
 unforgettable-0.107
 thrilling-0.096
 newfound-0.095
 brand-0.093
 remarkable-0.091
Token were
Token ID547
Feature activation-1.67e-02

Pos logit contributions
 peel+0.004
 court+0.004
 gard+0.004
 rooms+0.004
 tables+0.004
Neg logit contributions
 unforgettable-0.007
imm-0.006
 lifelong-0.006
 thrilling-0.006
 newfound-0.006
Token over
Token ID625
Feature activation+1.40e-02

Pos logit contributions
 rooms+0.018
 chefs+0.016
 tables+0.016
 gard+0.016
 peel+0.015
Neg logit contributions
imm-0.029
 unforgettable-0.028
 lifetime-0.028
b-0.027
ch-0.027
Tokenjoy
Token ID2633
Feature activation+5.97e-03

Pos logit contributions
imm+0.029
 unforgettable+0.028
b+0.028
immer+0.027
 lifetime+0.026
Neg logit contributions
 court-0.010
 rooms-0.010
 tables-0.010
 seeker-0.010
 chefs-0.009
Tokened
Token ID276
Feature activation-6.76e-04

Pos logit contributions
 brand+0.014
 unforgettable+0.013
 newfound+0.013
 remarkable+0.013
 thrilling+0.012
Neg logit contributions
iffs-0.004
ud-0.004
other-0.004
itt-0.003
ives-0.003
Token
Token ID220
Feature activation+6.49e-03

Pos logit contributions
 unforgettable+0.022
 newfound+0.021
b+0.020
imm+0.020
 thrilling+0.020
Neg logit contributions
iffs-0.010
 seeker-0.009
 court-0.009
 Gard-0.009
 rooms-0.009
Token
Token ID198
Feature activation+1.36e-02

Pos logit contributions
 unforgettable+0.013
 newfound+0.012
b+0.012
 thrilling+0.012
imm+0.012
Neg logit contributions
iffs-0.006
 court-0.005
 Gard-0.005
 rooms-0.005
 slams-0.005
Token
Token ID198
Feature activation+1.37e-02

Pos logit contributions
 unforgettable+0.025
 newfound+0.024
imm+0.023
b+0.023
 thrilling+0.023
Neg logit contributions
iffs-0.014
 court-0.012
 seeker-0.012
 Gard-0.012
 rooms-0.012
TokenThe
Token ID464
Feature activation+3.44e-03

Pos logit contributions
 unforgettable+0.023
imm+0.022
 newfound+0.022
 brand+0.021
 happened+0.020
Neg logit contributions
iffs-0.016
 court-0.015
 peel-0.014
 Gard-0.014
 seeker-0.014
Token b
Token ID275
Feature activation-1.69e-02

Pos logit contributions
imm+0.007
 unforgettable+0.007
 thrilling+0.006
b+0.006
immer+0.006
Neg logit contributions
 gard-0.003
 rooms-0.003
 seeker-0.003
 Gard-0.003
 yog-0.003
Tokenunn
Token ID20935
Feature activation-7.63e-02

Pos logit contributions
iffs+0.014
 Gard+0.012
 seeker+0.011
 rooms+0.011
 court+0.011
Neg logit contributions
 unforgettable-0.034
 newfound-0.033
 thrilling-0.032
 brand-0.032
b-0.032
Tokenies
Token ID444
Feature activation+1.13e-03

Pos logit contributions
 Gard+0.114
iffs+0.112
 gard+0.109
 rooms+0.108
 seeker+0.106
Neg logit contributions
 unforgettable-0.105
 thrilling-0.096
 brand-0.093
 newfound-0.091
 remarkable-0.088
Token happily
Token ID18177
Feature activation+5.08e-03

Pos logit contributions
 unforgettable+0.002
imm+0.002
 thrilling+0.002
 brand+0.002
 lifelong+0.002
Neg logit contributions
 court-0.001
 peel-0.001
 gard-0.001
 Gard-0.001
 rooms-0.001
Token hopped
Token ID45230
Feature activation-8.26e-03

Pos logit contributions
 unforgettable+0.008
 newfound+0.008
 brand+0.008
 thrilling+0.008
 lifelong+0.008
Neg logit contributions
iffs-0.006
 peel-0.006
 court-0.005
 gard-0.005
 seeker-0.005
Token back
Token ID736
Feature activation+5.39e-03

Pos logit contributions
iffs+0.006
 court+0.006
 seeker+0.005
 tables+0.005
 peel+0.005
Neg logit contributions
 unforgettable-0.018
imm-0.017
 lifetime-0.017
 happened-0.016
 beating-0.016
Token to
Token ID284
Feature activation+1.24e-02

Pos logit contributions
 unforgettable+0.010
imm+0.009
 newfound+0.009
immer+0.009
b+0.009
Neg logit contributions
iffs-0.006
 court-0.005
 rooms-0.005
 slams-0.005
 seeker-0.005
Token for
Token ID329
Feature activation+9.13e-03

Pos logit contributions
 unforgettable+0.033
 remarkable+0.033
 happened+0.033
b+0.033
 thrilling+0.032
Neg logit contributions
 slide-0.010
 peel-0.010
 Lee-0.010
 seeker-0.010
 court-0.009
Token bright
Token ID6016
Feature activation-1.96e-02

Pos logit contributions
imm+0.017
b+0.016
 unforgettable+0.016
immer+0.016
 brand+0.015
Neg logit contributions
 court-0.009
 seeker-0.008
 Enjoy-0.008
 tables-0.008
 peel-0.008
Tokenening
Token ID3101
Feature activation-6.47e-03

Pos logit contributions
 rooms+0.018
 court+0.017
 tables+0.017
 slide+0.016
 yog+0.016
Neg logit contributions
b-0.036
 lifetime-0.036
immer-0.035
imm-0.035
 unforgettable-0.034
Token up
Token ID510
Feature activation+9.17e-03

Pos logit contributions
 court+0.005
 rooms+0.005
 tables+0.005
 slide+0.004
 slams+0.004
Neg logit contributions
immer-0.013
 newfound-0.012
imm-0.012
 unforgettable-0.012
b-0.012
Token his
Token ID465
Feature activation-1.08e-02

Pos logit contributions
 unforgettable+0.020
b+0.020
imm+0.020
immer+0.020
 brand+0.019
Neg logit contributions
 court-0.004
 seeker-0.003
 peel-0.003
iffs-0.003
 rooms-0.002
Token day
Token ID1110
Feature activation+5.18e-02

Pos logit contributions
 ear+0.010
 court+0.010
 peel+0.010
 slide+0.009
 rooms+0.009
Neg logit contributions
imm-0.020
pr-0.020
b-0.020
immer-0.020
 stre-0.019
Token.
Token ID13
Feature activation+8.85e-03

Pos logit contributions
imm+0.115
b+0.113
 unforgettable+0.112
immer+0.109
pr+0.108
Neg logit contributions
 court-0.034
 peel-0.024
 tables-0.023
 kitchen-0.018
 rooms-0.018
Token<|endoftext|>
Token ID50256
Feature activation-1.22e-02

Pos logit contributions
 unforgettable+0.015
 newfound+0.014
b+0.014
imm+0.013
 thrilling+0.013
Neg logit contributions
iffs-0.010
 seeker-0.010
 slams-0.009
 court-0.009
 peel-0.009
TokenOnce
Token ID7454
Feature activation-9.74e-03

Pos logit contributions
iffs+0.012
 court+0.011
 seeker+0.011
 rooms+0.010
 Gard+0.010
Neg logit contributions
 unforgettable-0.024
 newfound-0.022
b-0.022
imm-0.022
 thrilling-0.022
Token there
Token ID612
Feature activation-5.53e-03

Pos logit contributions
iffs+0.014
 seeker+0.013
 court+0.013
 rooms+0.013
 slams+0.013
Neg logit contributions
 unforgettable-0.014
imm-0.012
 newfound-0.012
b-0.012
 lifetime-0.012
Token was
Token ID373
Feature activation+1.56e-02

Pos logit contributions
 rooms+0.005
 slide+0.005
 Sara+0.005
 But+0.005
 seeker+0.005
Neg logit contributions
 unforgettable-0.010
 lifetime-0.009
immer-0.009
 coll-0.009
 beating-0.009
Token sky
Token ID6766
Feature activation-6.86e-03

Pos logit contributions
b+0.024
 unforgettable+0.024
imm+0.023
immer+0.023
 brand+0.023
Neg logit contributions
 rooms-0.017
 court-0.017
 slide-0.016
 tables-0.016
 seeker-0.016
Token and
Token ID290
Feature activation+8.18e-03

Pos logit contributions
 all+0.002
 sink+0.002
 and+0.002
 slide+0.001
 peel+0.001
Neg logit contributions
b-0.016
 unforgettable-0.015
chie-0.015
immer-0.015
 lifetime-0.014
Token they
Token ID484
Feature activation+9.59e-03

Pos logit contributions
immer+0.016
imm+0.015
b+0.014
pr+0.014
chie+0.013
Neg logit contributions
 rooms-0.008
 tables-0.007
 slide-0.007
 court-0.007
 eats-0.007
Token spent
Token ID3377
Feature activation-1.13e-04

Pos logit contributions
immer+0.016
 brand+0.016
imm+0.016
 unforgettable+0.016
 thrilling+0.015
Neg logit contributions
 all-0.009
 gard-0.009
 seeker-0.008
 slide-0.008
 ear-0.008
Token the
Token ID262
Feature activation+1.31e-02

Pos logit contributions
 rooms+0.000
 court+0.000
iffs+0.000
 seeker+0.000
 tables+0.000
Neg logit contributions
 unforgettable-0.000
immer-0.000
b-0.000
imm-0.000
 brand-0.000
Token day
Token ID1110
Feature activation+5.53e-02

Pos logit contributions
 unforgettable+0.020
imm+0.020
b+0.020
 brand+0.019
 newfound+0.019
Neg logit contributions
 seeker-0.014
iffs-0.014
 court-0.013
 rooms-0.013
 peel-0.013
Token playing
Token ID2712
Feature activation+2.77e-02

Pos logit contributions
 unforgettable+0.082
imm+0.076
 newfound+0.074
 brand+0.072
b+0.072
Neg logit contributions
 court-0.069
 rooms-0.067
iffs-0.067
 seeker-0.064
 tables-0.064
Token tag
Token ID7621
Feature activation+4.88e-03

Pos logit contributions
 unforgettable+0.050
b+0.048
 newfound+0.047
imm+0.047
immer+0.046
Neg logit contributions
 court-0.027
iffs-0.026
 seeker-0.026
 rooms-0.026
 peel-0.025
Token and
Token ID290
Feature activation+1.77e-02

Pos logit contributions
 unforgettable+0.010
 newfound+0.010
imm+0.010
b+0.010
immer+0.009
Neg logit contributions
 court-0.003
 rooms-0.003
 tables-0.003
iffs-0.003
 peel-0.003
Token laughing
Token ID14376
Feature activation+3.80e-02

Pos logit contributions
 unforgettable+0.031
imm+0.031
immer+0.031
b+0.029
 newfound+0.029
Neg logit contributions
 court-0.018
 rooms-0.018
 seeker-0.017
iffs-0.016
 Gard-0.016
Token in
Token ID287
Feature activation+1.35e-02

Pos logit contributions
 unforgettable+0.076
b+0.076
immer+0.074
imm+0.073
 newfound+0.073
Neg logit contributions
 rooms-0.026
 court-0.025
 tables-0.024
 seeker-0.023
iffs-0.023
Token safe
Token ID3338
Feature activation-6.35e-03

Pos logit contributions
 kitchen+0.009
 ear+0.009
 rooms+0.009
 court+0.008
 peel+0.008
Neg logit contributions
imm-0.022
b-0.019
immer-0.019
ittered-0.019
 perched-0.017
Token journey
Token ID7002
Feature activation-1.23e-03

Pos logit contributions
 rooms+0.005
 court+0.004
 ear+0.004
 seeker+0.004
 slams+0.004
Neg logit contributions
 unforgettable-0.013
b-0.013
imm-0.013
 thrilling-0.013
 remarkable-0.012
Token and
Token ID290
Feature activation+4.58e-03

Pos logit contributions
 court+0.001
 slams+0.000
 rooms+0.000
iffs+0.000
 tables+0.000
Neg logit contributions
 unforgettable-0.003
 newfound-0.003
imm-0.003
b-0.003
 lifelong-0.003
Token for
Token ID329
Feature activation+2.49e-02

Pos logit contributions
imm+0.007
immer+0.007
b+0.007
chie+0.007
 unforgettable+0.007
Neg logit contributions
 slams-0.005
 court-0.005
 seeker-0.005
 rooms-0.005
 tables-0.005
Token their
Token ID511
Feature activation-8.73e-03

Pos logit contributions
imm+0.046
b+0.044
immer+0.042
 unforgettable+0.041
ittered+0.038
Neg logit contributions
 court-0.025
 rooms-0.025
 seeker-0.024
 Enjoy-0.024
 tables-0.023
Token day
Token ID1110
Feature activation+5.65e-02

Pos logit contributions
 rooms+0.009
 ear+0.009
 kitchen+0.009
 court+0.008
 seeker+0.008
Neg logit contributions
imm-0.018
immer-0.016
b-0.016
ittered-0.015
pr-0.014
Token together
Token ID1978
Feature activation+9.41e-03

Pos logit contributions
imm+0.127
b+0.124
 unforgettable+0.122
immer+0.116
 thrilling+0.115
Neg logit contributions
 court-0.031
 tables-0.029
 rooms-0.028
 peel-0.021
 kitchen-0.020
Token.
Token ID13
Feature activation+7.34e-03

Pos logit contributions
 unforgettable+0.023
b+0.023
imm+0.023
chie+0.022
 newfound+0.022
Neg logit contributions
 court-0.004
 peel-0.003
 rooms-0.002
 tables-0.002
 Ben-0.002
Token When
Token ID1649
Feature activation+9.31e-03

Pos logit contributions
 unforgettable+0.011
 newfound+0.011
 brand+0.010
 lifelong+0.010
 thrilling+0.010
Neg logit contributions
 seeker-0.009
 peel-0.008
iffs-0.008
 slams-0.008
 Gard-0.008
Token they
Token ID484
Feature activation-1.57e-02

Pos logit contributions
 unforgettable+0.014
imm+0.014
b+0.013
 newfound+0.013
immer+0.013
Neg logit contributions
 seeker-0.012
 court-0.012
iffs-0.012
 rooms-0.012
 slams-0.012
Token were
Token ID547
Feature activation-7.24e-03

Pos logit contributions
iffs+0.023
 court+0.023
 seeker+0.022
 peel+0.022
 slams+0.022
Neg logit contributions
 unforgettable-0.021
b-0.019
imm-0.019
 newfound-0.019
 thrilling-0.018
Token special
Token ID2041
Feature activation+7.41e-03

Pos logit contributions
imm+0.021
ittered+0.019
 unforgettable+0.019
 happened+0.017
immer+0.017
Neg logit contributions
 court-0.012
 peel-0.012
 rooms-0.011
 slide-0.011
 kitchen-0.011
Token roof
Token ID9753
Feature activation-1.20e-03

Pos logit contributions
imm+0.014
 unforgettable+0.013
ittered+0.013
pr+0.012
med+0.012
Neg logit contributions
 court-0.008
 slide-0.008
 peel-0.007
 rooms-0.007
 sink-0.007
Token rack
Token ID19127
Feature activation+2.17e-02

Pos logit contributions
iffs+0.001
 peel+0.001
 rooms+0.001
 court+0.001
 tables+0.001
Neg logit contributions
 unforgettable-0.003
 newfound-0.003
 lifelong-0.002
 lifetime-0.002
 remarkable-0.002
Token and
Token ID290
Feature activation+1.17e-02

Pos logit contributions
 newfound+0.039
 unforgettable+0.039
 lifelong+0.036
 thrilling+0.035
 remarkable+0.035
Neg logit contributions
iffs-0.023
 court-0.020
 rooms-0.020
 peel-0.020
 tables-0.019
Token a
Token ID257
Feature activation+3.47e-02

Pos logit contributions
imm+0.019
immer+0.019
 unforgettable+0.018
 lifelong+0.017
 newfound+0.017
Neg logit contributions
 rooms-0.015
 tables-0.015
 court-0.014
 peel-0.013
 sink-0.012
Token super
Token ID2208
Feature activation+5.06e-02

Pos logit contributions
imm+0.070
ittered+0.062
 unforgettable+0.060
immer+0.058
 happened+0.056
Neg logit contributions
 kitchen-0.036
 court-0.036
 gard-0.035
 sink-0.035
 slide-0.035
Token bright
Token ID6016
Feature activation+9.40e-03

Pos logit contributions
imm+0.088
immer+0.082
 unforgettable+0.081
pr+0.080
 hovering+0.078
Neg logit contributions
 rooms-0.059
 slide-0.054
 sink-0.052
 peel-0.052
 court-0.050
Token head
Token ID1182
Feature activation+1.08e-02

Pos logit contributions
 unforgettable+0.016
imm+0.016
b+0.015
pr+0.015
 lifelong+0.015
Neg logit contributions
 rooms-0.010
 sink-0.009
 slide-0.009
 court-0.009
 peel-0.009
Tokenlight
Token ID2971
Feature activation+3.40e-02

Pos logit contributions
 unforgettable+0.024
 newfound+0.023
 lifelong+0.022
 thrilling+0.022
 remarkable+0.021
Neg logit contributions
 peel-0.007
 sink-0.006
 slide-0.005
 court-0.005
 seeker-0.005
Token.
Token ID13
Feature activation+8.62e-03

Pos logit contributions
 unforgettable+0.079
 newfound+0.077
 lifelong+0.077
 thrilling+0.075
 lifetime+0.073
Neg logit contributions
 sink-0.016
 peel-0.014
 table-0.011
 kitchen-0.011
 tables-0.011
Token He
Token ID679
Feature activation-5.48e-03

Pos logit contributions
 unforgettable+0.016
 newfound+0.015
b+0.015
imm+0.015
 thrilling+0.015
Neg logit contributions
iffs-0.008
 seeker-0.008
 court-0.008
 Gard-0.007
 rooms-0.007
Token began
Token ID2540
Feature activation-2.20e-02

Pos logit contributions
 unforgettable+0.017
 newfound+0.016
b+0.016
immer+0.016
 lifetime+0.016
Neg logit contributions
 rooms-0.011
 tables-0.010
 sink-0.008
 court-0.008
 slams-0.008
Token to
Token ID284
Feature activation-1.48e-02

Pos logit contributions
 rooms+0.016
 tables+0.014
 sink+0.013
 kitchen+0.012
 peel+0.012
Neg logit contributions
 unforgettable-0.047
imm-0.045
immer-0.044
 newfound-0.043
b-0.042
Token shine
Token ID18340
Feature activation+1.03e-02

Pos logit contributions
 peel+0.015
 sink+0.014
 tables+0.014
 seeker+0.014
 rooms+0.014
Neg logit contributions
 unforgettable-0.026
imm-0.025
b-0.025
 newfound-0.025
 lifetime-0.024
Token and
Token ID290
Feature activation+2.25e-02

Pos logit contributions
 unforgettable+0.020
 newfound+0.020
 lifetime+0.020
b+0.019
imm+0.019
Neg logit contributions
 rooms-0.007
 tables-0.007
 seeker-0.006
 sink-0.006
 slams-0.006
Token tw
Token ID665
Feature activation+1.06e-02

Pos logit contributions
imm+0.041
immer+0.039
b+0.038
 unforgettable+0.038
 newfound+0.036
Neg logit contributions
 rooms-0.024
 tables-0.024
 slams-0.022
 sink-0.021
 peel-0.020
Tokeninkle
Token ID19894
Feature activation+5.33e-02

Pos logit contributions
 unforgettable+0.011
 newfound+0.010
 lifetime+0.010
 thrilling+0.010
 lifelong+0.009
Neg logit contributions
iffs-0.018
 rooms-0.017
 slams-0.017
atters-0.017
 seeker-0.017
Token.
Token ID13
Feature activation+3.14e-03

Pos logit contributions
 unforgettable+0.111
b+0.106
 lifetime+0.105
 newfound+0.103
 thrilling+0.102
Neg logit contributions
 tables-0.035
 rooms-0.034
 gard-0.029
 peel-0.028
 chefs-0.026
Token Lily
Token ID20037
Feature activation+8.96e-03

Pos logit contributions
 unforgettable+0.007
b+0.007
 newfound+0.006
imm+0.006
 thrilling+0.006
Neg logit contributions
iffs-0.002
 court-0.002
 rooms-0.002
 seeker-0.002
 slams-0.002
Token and
Token ID290
Feature activation-9.44e-03

Pos logit contributions
 unforgettable+0.015
 newfound+0.014
 lifelong+0.014
immer+0.014
imm+0.013
Neg logit contributions
 rooms-0.010
 slams-0.010
 peel-0.010
 seeker-0.009
 tables-0.009
Token her
Token ID607
Feature activation+9.48e-03

Pos logit contributions
iffs+0.011
 seeker+0.010
 court+0.010
 slams+0.010
 Gard+0.010
Neg logit contributions
 unforgettable-0.016
 newfound-0.015
b-0.015
imm-0.015
 thrilling-0.015
Token friend
Token ID1545
Feature activation+1.51e-02

Pos logit contributions
immer+0.020
b+0.020
imm+0.020
 unforgettable+0.019
pr+0.018
Neg logit contributions
 rooms-0.009
 seeker-0.008
 ear-0.007
 court-0.007
 kitchen-0.007
Token.
Token ID13
Feature activation+2.63e-03

Pos logit contributions
 court+0.009
 rooms+0.008
 slams+0.008
 peel+0.008
iffs+0.007
Neg logit contributions
imm-0.023
 unforgettable-0.023
 newfound-0.022
 lifelong-0.022
b-0.022
Token Just
Token ID2329
Feature activation+3.50e-04

Pos logit contributions
 unforgettable+0.004
 happened+0.004
 responded+0.004
 replied+0.004
 thrilling+0.004
Neg logit contributions
 seeker-0.003
iffs-0.003
 rooms-0.002
 slams-0.002
 peel-0.002
Token stay
Token ID2652
Feature activation-8.55e-03

Pos logit contributions
 lifelong+0.001
 lifetime+0.001
 unforgettable+0.001
imm+0.001
 brand+0.001
Neg logit contributions
 peel-0.000
 seeker-0.000
 ear-0.000
 court-0.000
 gard-0.000
Token still
Token ID991
Feature activation+6.70e-03

Pos logit contributions
 seeker+0.009
iffs+0.008
 slams+0.008
 rooms+0.008
 court+0.007
Neg logit contributions
 unforgettable-0.016
imm-0.015
 newfound-0.015
 lifetime-0.015
 happened-0.014
Token and
Token ID290
Feature activation+6.31e-03

Pos logit contributions
imm+0.014
 unforgettable+0.014
b+0.014
 newfound+0.013
 brand+0.013
Neg logit contributions
 seeker-0.005
 court-0.005
 peel-0.004
 slams-0.004
iffs-0.004
Token the
Token ID262
Feature activation+1.34e-02

Pos logit contributions
 unforgettable+0.010
imm+0.009
 newfound+0.009
immer+0.009
b+0.009
Neg logit contributions
 slams-0.008
 court-0.008
 seeker-0.007
iffs-0.007
 peel-0.007
Token dog
Token ID3290
Feature activation-9.55e-04

Pos logit contributions
 unforgettable+0.026
imm+0.025
b+0.025
 newfound+0.025
immer+0.024
Neg logit contributions
 seeker-0.012
iffs-0.012
 court-0.012
 peel-0.011
 rooms-0.011
Token will
Token ID481
Feature activation-3.80e-03

Pos logit contributions
 court+0.001
 slams+0.001
 peel+0.001
 rooms+0.001
 seeker+0.001
Neg logit contributions
 unforgettable-0.002
 newfound-0.002
immer-0.002
 brand-0.002
imm-0.002
Token go
Token ID467
Feature activation+3.73e-03

Pos logit contributions
 peel+0.004
 seeker+0.003
 court+0.003
 gard+0.003
 sink+0.003
Neg logit contributions
 unforgettable-0.007
imm-0.007
b-0.007
 brand-0.007
 lifetime-0.007
Token away
Token ID1497
Feature activation-8.37e-03

Pos logit contributions
 unforgettable+0.007
imm+0.006
 newfound+0.006
 happened+0.006
 thrilling+0.006
Neg logit contributions
iffs-0.004
 court-0.004
 seeker-0.004
 peel-0.004
 slams-0.004
Token.
Token ID13
Feature activation+6.02e-03

Pos logit contributions
 seeker+0.006
 court+0.006
iffs+0.006
 rooms+0.006
 peel+0.005
Neg logit contributions
 unforgettable-0.017
imm-0.017
 newfound-0.017
b-0.016
immer-0.016
Token floated
Token ID29742
Feature activation+1.60e-02

Pos logit contributions
 unforgettable+0.024
 newfound+0.023
 lifelong+0.022
 thrilling+0.022
immer+0.022
Neg logit contributions
 peel-0.020
 tables-0.019
 sink-0.019
 court-0.018
 rooms-0.018
Token and
Token ID290
Feature activation+1.90e-02

Pos logit contributions
 newfound+0.037
 unforgettable+0.037
 lifetime+0.037
 lifelong+0.034
inations+0.034
Neg logit contributions
 tables-0.007
 sink-0.007
iffs-0.007
 peel-0.006
 refrigerator-0.005
Token bob
Token ID29202
Feature activation+2.28e-02

Pos logit contributions
imm+0.032
 unforgettable+0.032
immer+0.031
 newfound+0.031
 lifelong+0.030
Neg logit contributions
 tables-0.022
 peel-0.021
 rooms-0.021
 sink-0.020
iffs-0.019
Tokenbed
Token ID3077
Feature activation-6.79e-03

Pos logit contributions
 unforgettable+0.037
 newfound+0.035
 thrilling+0.034
 lifetime+0.033
 lifelong+0.032
Neg logit contributions
iffs-0.028
 seeker-0.026
 peel-0.025
 court-0.024
 yourselves-0.024
Token on
Token ID319
Feature activation+2.63e-02

Pos logit contributions
 tables+0.005
 peel+0.005
iffs+0.005
 rooms+0.005
 sink+0.004
Neg logit contributions
 unforgettable-0.014
 lifetime-0.013
 newfound-0.013
imm-0.013
 thrilling-0.013
Token the
Token ID262
Feature activation+9.12e-03

Pos logit contributions
 unforgettable+0.046
imm+0.045
b+0.044
 newfound+0.043
 lifetime+0.042
Neg logit contributions
 peel-0.028
iffs-0.028
 tables-0.028
 rooms-0.027
 court-0.026
Token water
Token ID1660
Feature activation-4.81e-03

Pos logit contributions
 unforgettable+0.019
 newfound+0.018
b+0.018
 thrilling+0.017
 brand+0.017
Neg logit contributions
iffs-0.007
 seeker-0.007
 slams-0.006
 Gard-0.006
 court-0.006
Token.
Token ID13
Feature activation+3.37e-03

Pos logit contributions
 peel+0.004
iffs+0.004
 court+0.004
 rooms+0.004
 tables+0.004
Neg logit contributions
 unforgettable-0.010
 newfound-0.009
 thrilling-0.009
 lifelong-0.009
imm-0.009
Token They
Token ID1119
Feature activation-8.25e-03

Pos logit contributions
 unforgettable+0.007
 newfound+0.006
b+0.006
imm+0.006
 thrilling+0.006
Neg logit contributions
iffs-0.003
 seeker-0.003
 Gard-0.002
 court-0.002
 slams-0.002
Token touched
Token ID12615
Feature activation+1.50e-03

Pos logit contributions
 seeker+0.011
 peel+0.010
 rooms+0.010
 court+0.010
 tables+0.010
Neg logit contributions
 unforgettable-0.012
imm-0.012
 newfound-0.011
 lifelong-0.011
immer-0.011
Token the
Token ID262
Feature activation-5.08e-03

Pos logit contributions
 unforgettable+0.003
 lifetime+0.003
imm+0.003
 newfound+0.003
immer+0.002
Neg logit contributions
iffs-0.002
 tables-0.001
 peel-0.001
 rooms-0.001
 seeker-0.001
Token liked
Token ID8288
Feature activation+8.90e-04

Pos logit contributions
 unforgettable+0.007
 newfound+0.007
imm+0.007
b+0.007
immer+0.007
Neg logit contributions
iffs-0.007
 seeker-0.007
 slams-0.007
 court-0.006
 rooms-0.006
Token to
Token ID284
Feature activation+5.31e-03

Pos logit contributions
 unforgettable+0.002
immer+0.001
imm+0.001
 newfound+0.001
b+0.001
Neg logit contributions
 rooms-0.001
 court-0.001
iffs-0.001
 seeker-0.001
 tables-0.001
Token play
Token ID711
Feature activation+2.23e-02

Pos logit contributions
 unforgettable+0.009
 newfound+0.009
b+0.009
imm+0.009
 thrilling+0.008
Neg logit contributions
 seeker-0.006
iffs-0.005
 court-0.005
 peel-0.005
 rooms-0.005
Token make
Token ID787
Feature activation+1.52e-02

Pos logit contributions
 unforgettable+0.031
imm+0.030
b+0.030
 newfound+0.030
 brand+0.029
Neg logit contributions
iffs-0.030
 Gard-0.029
 slams-0.027
 seeker-0.027
 court-0.026
Token-
Token ID12
Feature activation+3.59e-02

Pos logit contributions
 unforgettable+0.022
 newfound+0.021
b+0.021
imm+0.020
 thrilling+0.020
Neg logit contributions
iffs-0.020
 seeker-0.019
 court-0.019
 slams-0.019
 Gard-0.019
Tokenbel
Token ID6667
Feature activation+4.93e-02

Pos logit contributions
 unforgettable+0.038
 newfound+0.036
imm+0.033
 thrilling+0.031
 brand+0.031
Neg logit contributions
iffs-0.064
 peel-0.061
 court-0.060
 rooms-0.059
 seeker-0.058
Tokenieve
Token ID12311
Feature activation+4.36e-03

Pos logit contributions
 unforgettable+0.061
 newfound+0.056
imm+0.055
 lifetime+0.053
 thrilling+0.053
Neg logit contributions
iffs-0.075
 rooms-0.073
 court-0.073
 slams-0.071
 peel-0.071
Token.
Token ID13
Feature activation+1.42e-03

Pos logit contributions
 unforgettable+0.008
 newfound+0.007
b+0.007
imm+0.007
 thrilling+0.007
Neg logit contributions
iffs-0.004
 rooms-0.004
 seeker-0.004
 court-0.004
 slams-0.004
Token
Token ID198
Feature activation+4.01e-03

Pos logit contributions
 unforgettable+0.002
 newfound+0.002
 belonged+0.001
 lifelong+0.001
 thrilling+0.001
Neg logit contributions
 seeker-0.002
iffs-0.002
 arrest-0.002
 slams-0.002
 Gard-0.002
Token
Token ID198
Feature activation+4.33e-03

Pos logit contributions
 unforgettable+0.007
 newfound+0.006
b+0.006
imm+0.006
 thrilling+0.006
Neg logit contributions
iffs-0.005
 seeker-0.004
 court-0.004
 slams-0.004
 Gard-0.004
TokenOne
Token ID3198
Feature activation-3.16e-03

Pos logit contributions
 unforgettable+0.004
 thrilling+0.004
 newfound+0.004
 remarkable+0.004
 lifetime+0.004
Neg logit contributions
iffs-0.008
 slams-0.008
 seeker-0.008
 Gard-0.007
 peel-0.007
Token
Token ID198
Feature activation+8.53e-03

Pos logit contributions
 unforgettable+0.017
 newfound+0.016
imm+0.016
b+0.016
 thrilling+0.015
Neg logit contributions
iffs-0.009
 court-0.008
 seeker-0.008
 rooms-0.008
 slams-0.008
TokenSo
Token ID2396
Feature activation+1.31e-02

Pos logit contributions
 unforgettable+0.013
 newfound+0.012
b+0.012
imm+0.012
 thrilling+0.012
Neg logit contributions
iffs-0.011
 seeker-0.011
 court-0.010
 slams-0.010
 Gard-0.010
Token,
Token ID11
Feature activation-1.40e-04

Pos logit contributions
 unforgettable+0.019
 newfound+0.019
b+0.018
imm+0.017
 thrilling+0.017
Neg logit contributions
iffs-0.017
 seeker-0.016
 court-0.016
 Gard-0.016
 rooms-0.016
Token Lily
Token ID20037
Feature activation+1.01e-02

Pos logit contributions
iffs+0.000
 seeker+0.000
 court+0.000
 slams+0.000
 peel+0.000
Neg logit contributions
 unforgettable-0.000
imm-0.000
b-0.000
 thrilling-0.000
 newfound-0.000
Token went
Token ID1816
Feature activation-9.68e-03

Pos logit contributions
 unforgettable+0.016
 newfound+0.015
 thrilling+0.014
imm+0.014
b+0.014
Neg logit contributions
 court-0.012
 peel-0.012
iffs-0.012
 seeker-0.012
 slams-0.012
Token to
Token ID284
Feature activation+4.25e-03

Pos logit contributions
 Gard+0.014
iffs+0.014
 seeker+0.014
 yourselves+0.014
 rooms+0.014
Neg logit contributions
b-0.011
 unforgettable-0.011
 newfound-0.011
 brand-0.010
 thrilling-0.010
Token find
Token ID1064
Feature activation+5.23e-04

Pos logit contributions
 unforgettable+0.006
 newfound+0.006
b+0.005
 thrilling+0.005
 brand+0.005
Neg logit contributions
iffs-0.006
 seeker-0.006
 Gard-0.005
 slams-0.005
 rooms-0.005
Token her
Token ID607
Feature activation-1.02e-02

Pos logit contributions
 unforgettable+0.001
imm+0.001
b+0.001
 lifetime+0.001
 newfound+0.001
Neg logit contributions
iffs-0.001
 seeker-0.001
 rooms-0.001
 court-0.001
 slams-0.001
Token mom
Token ID1995
Feature activation+1.48e-02

Pos logit contributions
 seeker+0.009
 court+0.008
 rooms+0.008
 slams+0.007
 peel+0.007
Neg logit contributions
imm-0.022
immer-0.021
 unforgettable-0.021
b-0.021
chie-0.020
Token and
Token ID290
Feature activation+5.76e-03

Pos logit contributions
 unforgettable+0.035
imm+0.034
 thrilling+0.033
chie+0.033
 newfound+0.032
Neg logit contributions
 court-0.008
 peel-0.008
 kitchen-0.008
 sink-0.005
 eat-0.005
Token told
Token ID1297
Feature activation+6.02e-03

Pos logit contributions
 unforgettable+0.010
imm+0.010
 thrilling+0.009
immer+0.009
b+0.009
Neg logit contributions
 court-0.007
 rooms-0.006
 gard-0.006
 Gard-0.006
 seeker-0.006
Token have
Token ID423
Feature activation+2.46e-02

Pos logit contributions
 unforgettable+0.016
b+0.015
imm+0.015
 newfound+0.015
 thrilling+0.014
Neg logit contributions
 seeker-0.009
iffs-0.009
 court-0.008
 peel-0.008
 slams-0.008
Token spent
Token ID3377
Feature activation-1.00e-02

Pos logit contributions
b+0.045
immer+0.045
imm+0.042
chie+0.041
 newfound+0.041
Neg logit contributions
 court-0.023
 rooms-0.022
 Ben-0.022
 chefs-0.022
 Tom-0.021
Token time
Token ID640
Feature activation+1.45e-02

Pos logit contributions
iffs+0.012
 seeker+0.012
 court+0.012
 rooms+0.012
 slams+0.011
Neg logit contributions
 unforgettable-0.016
b-0.015
 newfound-0.015
imm-0.015
 thrilling-0.014
Token together
Token ID1978
Feature activation-8.96e-03

Pos logit contributions
 unforgettable+0.028
imm+0.027
 newfound+0.026
 remarkable+0.026
 thrilling+0.026
Neg logit contributions
 court-0.013
iffs-0.012
 peel-0.011
 seeker-0.011
 rooms-0.011
Token.
Token ID13
Feature activation+8.62e-03

Pos logit contributions
 court+0.007
 peel+0.007
 rooms+0.007
 seeker+0.006
iffs+0.006
Neg logit contributions
 unforgettable-0.018
imm-0.018
b-0.017
 newfound-0.017
 thrilling-0.017
Token The
Token ID383
Feature activation+6.72e-03

Pos logit contributions
 unforgettable+0.014
 newfound+0.013
 thrilling+0.013
b+0.012
 brand+0.012
Neg logit contributions
iffs-0.010
 seeker-0.010
 slams-0.010
 Gard-0.010
 court-0.009
Token end
Token ID886
Feature activation+1.52e-02

Pos logit contributions
imm+0.012
 unforgettable+0.012
b+0.012
immer+0.011
 newfound+0.011
Neg logit contributions
 seeker-0.007
 court-0.007
 peel-0.007
 rooms-0.007
iffs-0.006
Token.
Token ID13
Feature activation+8.15e-03

Pos logit contributions
 unforgettable+0.032
 newfound+0.031
b+0.031
imm+0.030
 lifelong+0.030
Neg logit contributions
iffs-0.011
 slams-0.008
 peel-0.008
 rooms-0.008
 court-0.007
Token<|endoftext|>
Token ID50256
Feature activation-1.60e-02

Pos logit contributions
 unforgettable+0.014
b+0.014
 newfound+0.013
imm+0.013
 thrilling+0.013
Neg logit contributions
iffs-0.009
 seeker-0.008
 court-0.008
 Gard-0.008
 slams-0.008
TokenOnce
Token ID7454
Feature activation-1.15e-02

Pos logit contributions
iffs+0.019
 seeker+0.018
 slams+0.018
 Gard+0.018
 court+0.017
Neg logit contributions
 unforgettable-0.025
 newfound-0.024
b-0.024
imm-0.023
 thrilling-0.023
Token upon
Token ID2402
Feature activation-2.85e-03

Pos logit contributions
iffs+0.017
 seeker+0.016
 rooms+0.016
 court+0.016
 slams+0.015
Neg logit contributions
 unforgettable-0.016
imm-0.014
b-0.014
 newfound-0.014
 thrilling-0.014
Token Daisy
Token ID40355
Feature activation+8.15e-03

Pos logit contributions
imm+0.019
b+0.019
 lifetime+0.019
 unforgettable+0.019
 newfound+0.019
Neg logit contributions
iffs-0.009
 court-0.008
 peel-0.008
 yourselves-0.008
 tables-0.008
Token to
Token ID284
Feature activation+1.67e-02

Pos logit contributions
 unforgettable+0.017
imm+0.017
 newfound+0.017
b+0.017
 lifetime+0.016
Neg logit contributions
 seeker-0.007
iffs-0.007
 court-0.006
 peel-0.006
 kitchen-0.005
Token the
Token ID262
Feature activation+1.43e-02

Pos logit contributions
imm+0.036
b+0.034
 unforgettable+0.033
 lifetime+0.032
 remarkable+0.032
Neg logit contributions
 court-0.015
 peel-0.013
iffs-0.013
 seeker-0.013
 eat-0.013
Token farm
Token ID5318
Feature activation-1.54e-02

Pos logit contributions
imm+0.028
 unforgettable+0.026
b+0.026
immer+0.025
ittered+0.025
Neg logit contributions
 rooms-0.014
 seeker-0.014
 court-0.014
 gard-0.013
 Gard-0.013
Token,
Token ID11
Feature activation+1.53e-03

Pos logit contributions
 court+0.012
 seeker+0.011
iffs+0.010
 peel+0.009
 sink+0.008
Neg logit contributions
 unforgettable-0.034
 newfound-0.033
b-0.033
imm-0.033
 thrilling-0.032
Token mom
Token ID1995
Feature activation-3.01e-03

Pos logit contributions
b+0.003
imm+0.003
 lifetime+0.003
 unforgettable+0.003
ittered+0.003
Neg logit contributions
 seeker-0.001
 Ben-0.001
 court-0.001
 peel-0.001
iffs-0.001
Token?"
Token ID1701
Feature activation-1.15e-02

Pos logit contributions
iffs+0.003
 seeker+0.003
 peel+0.002
 court+0.002
 rooms+0.002
Neg logit contributions
 unforgettable-0.006
 newfound-0.006
 thrilling-0.005
 remarkable-0.005
imm-0.005
Token Anna
Token ID11735
Feature activation+2.26e-02

Pos logit contributions
iffs+0.006
 seeker+0.005
 court+0.005
 slams+0.005
 rooms+0.005
Neg logit contributions
 unforgettable-0.026
 newfound-0.025
b-0.025
imm-0.025
 thrilling-0.025
Token asks
Token ID7893
Feature activation+6.39e-03

Pos logit contributions
 unforgettable+0.041
b+0.040
 newfound+0.039
imm+0.038
 happened+0.038
Neg logit contributions
iffs-0.021
 seeker-0.020
 court-0.018
 Gard-0.018
 rooms-0.017
Token.
Token ID13
Feature activation+4.00e-03

Pos logit contributions
 unforgettable+0.011
 newfound+0.011
b+0.010
imm+0.010
 thrilling+0.010
Neg logit contributions
iffs-0.007
 seeker-0.007
 court-0.006
 slams-0.006
 rooms-0.006
Token
Token ID198
Feature activation+2.18e-03

Pos logit contributions
 unforgettable+0.006
 newfound+0.006
b+0.006
 thrilling+0.006
imm+0.006
Neg logit contributions
iffs-0.005
 seeker-0.005
 slams-0.004
 court-0.004
 rooms-0.004
Token barking
Token ID48961
Feature activation+2.04e-02

Pos logit contributions
 seeker+0.009
 slams+0.009
iffs+0.008
 rooms+0.008
 peel+0.008
Neg logit contributions
 unforgettable-0.014
 lifetime-0.013
imm-0.013
 newfound-0.013
immer-0.013
Token and
Token ID290
Feature activation+1.18e-02

Pos logit contributions
 unforgettable+0.037
 newfound+0.035
b+0.034
imm+0.034
 thrilling+0.033
Neg logit contributions
iffs-0.021
 seeker-0.019
 court-0.019
 slams-0.019
 rooms-0.019
Token running
Token ID2491
Feature activation+1.95e-02

Pos logit contributions
 unforgettable+0.023
imm+0.022
immer+0.022
 newfound+0.021
 lifelong+0.021
Neg logit contributions
 peel-0.011
iffs-0.011
 slams-0.011
 seeker-0.010
 court-0.010
Token around
Token ID1088
Feature activation+3.57e-03

Pos logit contributions
 unforgettable+0.034
 newfound+0.032
imm+0.031
b+0.030
 lifetime+0.030
Neg logit contributions
iffs-0.022
 seeker-0.020
 court-0.020
 slams-0.019
 rooms-0.019
Token.
Token ID13
Feature activation-7.10e-04

Pos logit contributions
 unforgettable+0.007
imm+0.007
 newfound+0.007
b+0.007
immer+0.007
Neg logit contributions
 court-0.003
iffs-0.003
 seeker-0.003
 peel-0.003
 rooms-0.003
Token It
Token ID632
Feature activation-1.09e-03

Pos logit contributions
iffs+0.001
 seeker+0.001
 court+0.001
 slams+0.001
 Gard+0.001
Neg logit contributions
 unforgettable-0.001
 newfound-0.001
b-0.001
imm-0.001
 thrilling-0.001
Token looked
Token ID3114
Feature activation-2.66e-04

Pos logit contributions
 bar+0.001
 slams+0.001
 peel+0.001
iffs+0.001
 rooms+0.001
Neg logit contributions
 unforgettable-0.002
immer-0.002
 lifetime-0.002
 newfound-0.002
imm-0.002
Token wild
Token ID4295
Feature activation+2.54e-02

Pos logit contributions
 seeker+0.000
 slams+0.000
iffs+0.000
 rooms+0.000
 Enjoy+0.000
Neg logit contributions
imm-0.000
 lifetime-0.000
immer-0.000
 unforgettable-0.000
b-0.000
Token and
Token ID290
Feature activation+1.30e-02

Pos logit contributions
 unforgettable+0.046
 newfound+0.045
b+0.043
imm+0.043
 thrilling+0.042
Neg logit contributions
iffs-0.025
 seeker-0.024
 rooms-0.023
 slams-0.023
 court-0.023
Token scary
Token ID14343
Feature activation+2.58e-02

Pos logit contributions
immer+0.023
imm+0.022
 unforgettable+0.021
 lifetime+0.020
 lifelong+0.020
Neg logit contributions
 slams-0.014
 rooms-0.013
 bar-0.013
 Ben-0.013
 peel-0.012
Token.
Token ID13
Feature activation-4.25e-03

Pos logit contributions
 newfound+0.055
 unforgettable+0.055
imm+0.052
 lifetime+0.052
 lifelong+0.051
Neg logit contributions
 rooms-0.018
iffs-0.017
 seeker-0.017
 slams-0.016
 court-0.016
TokenM
Token ID44
Feature activation+6.62e-04

Pos logit contributions
 unforgettable+0.024
imm+0.023
b+0.022
 newfound+0.022
 thrilling+0.022
Neg logit contributions
iffs-0.012
 Gard-0.011
 peel-0.011
 Enjoy-0.010
 gard-0.010
Tokenama
Token ID1689
Feature activation+2.07e-02

Pos logit contributions
 unforgettable+0.001
b+0.001
 newfound+0.001
 brand+0.001
 thrilling+0.001
Neg logit contributions
 Gard-0.001
iffs-0.001
 peel-0.001
ud-0.001
 gard-0.001
Token smiled
Token ID13541
Feature activation-1.27e-02

Pos logit contributions
 unforgettable+0.035
 newfound+0.034
 lifelong+0.034
 lifetime+0.033
 thrilling+0.033
Neg logit contributions
 peel-0.024
 gard-0.023
 Gard-0.022
 yog-0.022
 court-0.021
Token and
Token ID290
Feature activation-3.58e-03

Pos logit contributions
 gard+0.005
 She+0.004
iffs+0.004
 Gard+0.004
 ear+0.004
Neg logit contributions
b-0.029
 unforgettable-0.029
 lifetime-0.028
imm-0.028
 thrilling-0.027
Token said
Token ID531
Feature activation+3.27e-02

Pos logit contributions
 gard+0.005
 Gard+0.005
 yog+0.005
iffs+0.005
 peel+0.005
Neg logit contributions
 unforgettable-0.005
immer-0.005
 thrilling-0.005
 lifetime-0.005
imm-0.005
Token,
Token ID11
Feature activation+1.26e-03

Pos logit contributions
 unforgettable+0.064
b+0.061
 newfound+0.060
 thrilling+0.059
imm+0.059
Neg logit contributions
iffs-0.027
 court-0.025
 gard-0.025
 Gard-0.025
 Enjoy-0.024
Token "
Token ID366
Feature activation+1.58e-02

Pos logit contributions
 unforgettable+0.002
 newfound+0.002
b+0.002
imm+0.002
 thrilling+0.002
Neg logit contributions
iffs-0.001
 seeker-0.001
 court-0.001
 Gard-0.001
 peel-0.001
TokenThat
Token ID2504
Feature activation+3.78e-02

Pos logit contributions
 newfound+0.029
immer+0.029
 brand+0.028
 happened+0.028
imm+0.028
Neg logit contributions
iffs-0.014
 Enjoy-0.012
 Gard-0.011
 gard-0.011
 peel-0.010
Token's
Token ID338
Feature activation+1.42e-02

Pos logit contributions
 unforgettable+0.070
imm+0.069
 newfound+0.069
immer+0.069
b+0.067
Neg logit contributions
 rooms-0.033
 kitchen-0.032
 Gard-0.031
 peel-0.031
 tables-0.030
Token a
Token ID257
Feature activation+1.85e-02

Pos logit contributions
b+0.019
 newfound+0.019
 unforgettable+0.019
 lifetime+0.019
imm+0.018
Neg logit contributions
 seeker-0.019
 yourselves-0.017
 Enjoy-0.017
iffs-0.017
 Gard-0.017
Token great
Token ID1049
Feature activation+2.38e-02

Pos logit contributions
imm+0.023
 unforgettable+0.022
b+0.021
immer+0.021
 happened+0.021
Neg logit contributions
 seeker-0.027
 peel-0.027
 Gard-0.026
 gard-0.026
 court-0.026
Token,
Token ID11
Feature activation+7.30e-03

Pos logit contributions
 unforgettable+0.040
 newfound+0.037
b+0.037
 thrilling+0.036
imm+0.036
Neg logit contributions
iffs-0.024
 court-0.022
 seeker-0.022
 Gard-0.022
 peel-0.021
Token "
Token ID366
Feature activation+1.66e-02

Pos logit contributions
 unforgettable+0.014
 newfound+0.013
b+0.013
imm+0.013
 thrilling+0.013
Neg logit contributions
iffs-0.006
 court-0.006
 Gard-0.006
 peel-0.006
 gard-0.006
TokenI
Token ID40
Feature activation-1.07e-02

Pos logit contributions
 unforgettable+0.027
b+0.025
 newfound+0.025
imm+0.024
 thrilling+0.024
Neg logit contributions
iffs-0.020
 seeker-0.019
 slams-0.019
 rooms-0.019
 court-0.019
Token see
Token ID766
Feature activation+1.58e-02

Pos logit contributions
 seeker+0.012
iffs+0.012
 peel+0.011
 Gard+0.011
 court+0.011
Neg logit contributions
 unforgettable-0.018
 newfound-0.017
imm-0.017
b-0.017
 thrilling-0.017
Token something
Token ID1223
Feature activation+1.30e-02

Pos logit contributions
 unforgettable+0.027
imm+0.027
 newfound+0.026
immer+0.025
b+0.025
Neg logit contributions
 rooms-0.017
 tables-0.017
 yourselves-0.016
iffs-0.016
 court-0.016
Token healthy
Token ID5448
Feature activation-2.17e-03

Pos logit contributions
imm+0.020
 unforgettable+0.020
 newfound+0.019
immer+0.019
 lifetime+0.019
Neg logit contributions
 rooms-0.016
 peel-0.016
iffs-0.015
 court-0.015
 slams-0.015
Token and
Token ID290
Feature activation+2.66e-02

Pos logit contributions
 seeker+0.002
 court+0.002
iffs+0.002
 peel+0.002
 yourselves+0.002
Neg logit contributions
 newfound-0.004
 unforgettable-0.004
imm-0.004
 lifetime-0.004
 lifelong-0.004
Token delicious
Token ID12625
Feature activation+2.63e-02

Pos logit contributions
imm+0.039
 unforgettable+0.036
immer+0.036
b+0.035
 newfound+0.035
Neg logit contributions
iffs-0.034
 rooms-0.034
 peel-0.034
 court-0.033
 tables-0.033
Token!"
Token ID2474
Feature activation+2.01e-03

Pos logit contributions
 unforgettable+0.049
 newfound+0.048
imm+0.046
b+0.046
 lifelong+0.045
Neg logit contributions
iffs-0.026
 seeker-0.025
 peel-0.024
 court-0.023
 Gard-0.022
Token
Token ID198
Feature activation+1.65e-02

Pos logit contributions
 unforgettable+0.004
 newfound+0.003
b+0.003
imm+0.003
 thrilling+0.003
Neg logit contributions
iffs-0.002
 court-0.002
 seeker-0.002
 Gard-0.002
 rooms-0.002
Token
Token ID198
Feature activation+1.56e-02

Pos logit contributions
 unforgettable+0.032
 newfound+0.030
imm+0.030
b+0.029
 thrilling+0.029
Neg logit contributions
iffs-0.016
 court-0.014
 Gard-0.013
 rooms-0.013
 peel-0.013
Token it
Token ID340
Feature activation+6.74e-03

Pos logit contributions
iffs+0.018
 seeker+0.017
 Gard+0.017
 slams+0.017
 court+0.017
Neg logit contributions
 unforgettable-0.016
 newfound-0.015
b-0.015
 thrilling-0.014
imm-0.014
Token with
Token ID351
Feature activation-2.16e-02

Pos logit contributions
 unforgettable+0.010
 newfound+0.010
b+0.010
imm+0.010
 thrilling+0.010
Neg logit contributions
iffs-0.008
 seeker-0.008
 Gard-0.008
 court-0.008
 rooms-0.008
Token his
Token ID465
Feature activation-2.54e-02

Pos logit contributions
iffs+0.024
 peel+0.022
 seeker+0.022
 court+0.022
 rooms+0.022
Neg logit contributions
 unforgettable-0.037
imm-0.035
b-0.035
 lifetime-0.034
 newfound-0.034
Token foot
Token ID2366
Feature activation+6.19e-03

Pos logit contributions
 peel+0.018
 ear+0.017
 seeker+0.016
 court+0.015
iffs+0.014
Neg logit contributions
 unforgettable-0.057
imm-0.056
pr-0.054
immer-0.053
 lifetime-0.053
Token.
Token ID13
Feature activation-7.20e-03

Pos logit contributions
 unforgettable+0.013
 newfound+0.012
 lifelong+0.012
 lifetime+0.012
imm+0.012
Neg logit contributions
 peel-0.005
iffs-0.005
 slams-0.004
 court-0.004
 seeker-0.004
Token The
Token ID383
Feature activation-4.97e-03

Pos logit contributions
 seeker+0.009
 Gard+0.009
 court+0.008
 gard+0.008
 anth+0.008
Neg logit contributions
 unforgettable-0.009
 newfound-0.009
 brand-0.009
 thrilling-0.009
 happened-0.009
Token hut
Token ID40812
Feature activation-2.77e-02

Pos logit contributions
iffs+0.006
 slams+0.006
 seeker+0.006
 court+0.005
 Gard+0.005
Neg logit contributions
 unforgettable-0.008
 newfound-0.008
 thrilling-0.007
b-0.007
 brand-0.007
Token fell
Token ID3214
Feature activation-2.59e-02

Pos logit contributions
 peel+0.040
 slams+0.038
 rooms+0.038
 tables+0.037
iffs+0.036
Neg logit contributions
 unforgettable-0.041
 newfound-0.037
 lifetime-0.037
 lifelong-0.036
 thrilling-0.035
Token down
Token ID866
Feature activation-1.36e-02

Pos logit contributions
iffs+0.029
 seeker+0.028
 court+0.026
 slams+0.026
 rooms+0.026
Neg logit contributions
 unforgettable-0.044
 newfound-0.042
b-0.040
imm-0.040
 thrilling-0.040
Token.
Token ID13
Feature activation-7.01e-03

Pos logit contributions
iffs+0.017
 seeker+0.016
 Gard+0.015
 slams+0.015
 court+0.015
Neg logit contributions
 unforgettable-0.021
 newfound-0.020
b-0.020
 thrilling-0.019
imm-0.019
Token He
Token ID679
Feature activation-2.99e-02

Pos logit contributions
 seeker+0.010
 Gard+0.009
 gard+0.009
 anth+0.008
 court+0.008
Neg logit contributions
 unforgettable-0.009
 newfound-0.009
 happened-0.008
 brand-0.008
 thrilling-0.008
TokenTom
Token ID13787
Feature activation+5.67e-04

Pos logit contributions
 Gard+0.000
 gard+0.000
 slams+0.000
 rooms+0.000
 chefs+0.000
Neg logit contributions
b-0.000
 thrilling-0.000
 unforgettable-0.000
ch-0.000
 beating-0.000
Token went
Token ID1816
Feature activation-1.28e-02

Pos logit contributions
 unforgettable+0.001
 newfound+0.001
 thrilling+0.001
 lifelong+0.001
immer+0.001
Neg logit contributions
 peel-0.001
 rooms-0.001
 court-0.001
 slams-0.001
 seeker-0.001
Token back
Token ID736
Feature activation-4.41e-03

Pos logit contributions
 Gard+0.020
iffs+0.020
 seeker+0.019
 slams+0.019
 rooms+0.018
Neg logit contributions
 unforgettable-0.015
b-0.015
 newfound-0.014
 thrilling-0.014
 brand-0.014
Token to
Token ID284
Feature activation+9.87e-03

Pos logit contributions
iffs+0.006
 seeker+0.006
 Gard+0.006
 slams+0.006
 court+0.006
Neg logit contributions
 unforgettable-0.006
 newfound-0.005
b-0.005
imm-0.005
 thrilling-0.005
Token the
Token ID262
Feature activation+2.97e-03

Pos logit contributions
 newfound+0.011
 unforgettable+0.010
 thrilling+0.009
 happened+0.009
b+0.009
Neg logit contributions
iffs-0.016
 slams-0.015
 Gard-0.015
 seeker-0.015
itt-0.015
Token sea
Token ID5417
Feature activation-4.81e-02

Pos logit contributions
 unforgettable+0.005
imm+0.005
 newfound+0.005
b+0.005
immer+0.005
Neg logit contributions
iffs-0.003
 seeker-0.003
 court-0.003
 rooms-0.003
 peel-0.003
Token with
Token ID351
Feature activation-9.12e-03

Pos logit contributions
iffs+0.058
 seeker+0.055
 slams+0.054
 Gard+0.053
 court+0.051
Neg logit contributions
 unforgettable-0.076
 newfound-0.073
b-0.073
imm-0.070
 thrilling-0.069
Token his
Token ID465
Feature activation-1.32e-02

Pos logit contributions
iffs+0.013
 Gard+0.012
 seeker+0.012
 slams+0.012
 court+0.012
Neg logit contributions
 unforgettable-0.013
 newfound-0.013
b-0.012
 thrilling-0.012
 brand-0.012
Token new
Token ID649
Feature activation+1.00e-02

Pos logit contributions
 rooms+0.012
 court+0.011
 slide+0.011
 chefs+0.011
 peel+0.011
Neg logit contributions
immer-0.026
imm-0.026
 unforgettable-0.026
pr-0.024
bound-0.023
Token boat
Token ID8848
Feature activation+5.83e-03

Pos logit contributions
 unforgettable+0.019
imm+0.019
immer+0.018
pr+0.017
 thrilling+0.017
Neg logit contributions
 rooms-0.010
 court-0.009
 slide-0.009
 gard-0.009
 sink-0.009
Token.
Token ID13
Feature activation+1.92e-04

Pos logit contributions
 unforgettable+0.014
 thrilling+0.014
 newfound+0.013
 lifelong+0.013
 remarkable+0.013
Neg logit contributions
 court-0.003
 peel-0.003
 rooms-0.002
 sink-0.002
 seeker-0.002
Token lesson
Token ID11483
Feature activation-1.60e-03

Pos logit contributions
iffs+0.014
 seeker+0.014
 slams+0.014
 rooms+0.013
 court+0.013
Neg logit contributions
 unforgettable-0.007
 newfound-0.007
 thrilling-0.006
b-0.006
 remarkable-0.006
Token.
Token ID13
Feature activation-1.11e-02

Pos logit contributions
iffs+0.002
 seeker+0.002
 Gard+0.002
 slams+0.002
 court+0.002
Neg logit contributions
 unforgettable-0.003
b-0.002
 newfound-0.002
 thrilling-0.002
imm-0.002
Token They
Token ID1119
Feature activation-2.72e-02

Pos logit contributions
 peel+0.014
 seeker+0.014
 yog+0.013
 court+0.013
 slams+0.013
Neg logit contributions
 unforgettable-0.015
 newfound-0.014
 true-0.014
 lifelong-0.014
 happened-0.014
Token would
Token ID561
Feature activation-3.19e-02

Pos logit contributions
iffs+0.040
 seeker+0.038
 court+0.037
 slams+0.037
 rooms+0.037
Neg logit contributions
 unforgettable-0.037
 newfound-0.034
b-0.033
imm-0.033
 thrilling-0.032
Token not
Token ID407
Feature activation+3.79e-03

Pos logit contributions
iffs+0.042
 seeker+0.038
 rooms+0.037
 Gard+0.037
 slams+0.037
Neg logit contributions
 unforgettable-0.048
 newfound-0.045
b-0.044
 thrilling-0.043
 brand-0.042
Token hide
Token ID7808
Feature activation-4.51e-02

Pos logit contributions
 unforgettable+0.006
b+0.005
 newfound+0.005
imm+0.005
 thrilling+0.005
Neg logit contributions
iffs-0.005
 seeker-0.005
 rooms-0.005
 Gard-0.005
 slams-0.005
Token behind
Token ID2157
Feature activation-3.32e-03

Pos logit contributions
iffs+0.044
 seeker+0.042
 rooms+0.041
 slams+0.040
 court+0.040
Neg logit contributions
 unforgettable-0.083
 newfound-0.079
imm-0.077
b-0.077
 thrilling-0.076
Token the
Token ID262
Feature activation-1.64e-02

Pos logit contributions
 rooms+0.003
 tables+0.003
 court+0.003
 yourselves+0.003
 slams+0.003
Neg logit contributions
imm-0.007
 unforgettable-0.007
immer-0.006
b-0.006
 newfound-0.006
Token curtain
Token ID29461
Feature activation-1.86e-02

Pos logit contributions
iffs+0.021
 Enjoy+0.020
 seeker+0.020
 Gard+0.020
 slams+0.019
Neg logit contributions
 brand-0.023
 thrilling-0.023
 newfound-0.023
 unforgettable-0.022
 remarkable-0.022
Token again
Token ID757
Feature activation-1.39e-02

Pos logit contributions
iffs+0.022
 seeker+0.021
 Gard+0.020
 court+0.020
 slams+0.020
Neg logit contributions
 unforgettable-0.030
 newfound-0.028
b-0.028
imm-0.028
 thrilling-0.027
Token.
Token ID13
Feature activation-9.03e-03

Pos logit contributions
iffs+0.012
 court+0.012
 seeker+0.012
 rooms+0.011
 peel+0.011
Neg logit contributions
 unforgettable-0.027
imm-0.026
 newfound-0.026
b-0.025
 brand-0.025
Token
Token ID198
Feature activation+1.23e-02

Pos logit contributions
 unforgettable+0.013
 newfound+0.012
b+0.012
imm+0.012
 thrilling+0.012
Neg logit contributions
iffs-0.007
 court-0.006
 seeker-0.006
 Gard-0.006
 slams-0.006
Token
Token ID198
Feature activation+1.18e-02

Pos logit contributions
 unforgettable+0.026
imm+0.024
 newfound+0.024
 brand+0.023
b+0.023
Neg logit contributions
iffs-0.011
 court-0.009
 peel-0.008
 Gard-0.008
 Enjoy-0.008
TokenThe
Token ID464
Feature activation+4.01e-03

Pos logit contributions
 unforgettable+0.020
imm+0.019
 brand+0.019
 newfound+0.019
 happened+0.019
Neg logit contributions
iffs-0.014
 court-0.013
 peel-0.013
 Gard-0.012
 Enjoy-0.011
Token le
Token ID443
Feature activation+5.57e-03

Pos logit contributions
imm+0.008
 unforgettable+0.008
ittered+0.008
immer+0.007
 thrilling+0.007
Neg logit contributions
 gard-0.004
 peel-0.004
 Gard-0.004
 court-0.004
 slide-0.003
Tokenopard
Token ID15478
Feature activation+7.53e-04

Pos logit contributions
 unforgettable+0.010
 newfound+0.009
 thrilling+0.009
 brand+0.009
 lifelong+0.009
Neg logit contributions
iffs-0.006
 Gard-0.006
 peel-0.006
 seeker-0.006
 court-0.006
Token cub
Token ID13617
Feature activation-4.56e-02

Pos logit contributions
 unforgettable+0.001
 brand+0.001
immer+0.001
 thrilling+0.001
 lifelong+0.001
Neg logit contributions
 peel-0.001
 gard-0.001
 court-0.001
 Gard-0.001
 slams-0.001
Token gul
Token ID47161
Feature activation-2.83e-02

Pos logit contributions
 court+0.046
 peel+0.046
itt+0.043
 Gard+0.042
 yog+0.041
Neg logit contributions
 unforgettable-0.077
 brand-0.074
 thrilling-0.070
imm-0.068
immer-0.068
Tokenped
Token ID9124
Feature activation-1.72e-02

Pos logit contributions
iffs+0.045
 peel+0.041
 Gard+0.041
 court+0.041
 slams+0.040
Neg logit contributions
 unforgettable-0.037
 newfound-0.033
 brand-0.033
 lifetime-0.033
 thrilling-0.033
Token but
Token ID475
Feature activation-1.62e-02

Pos logit contributions
 peel+0.012
 court+0.012
iffs+0.011
 ear+0.011
 tables+0.010
Neg logit contributions
 unforgettable-0.035
imm-0.034
b-0.033
 lifetime-0.033
 newfound-0.032
Token he
Token ID339
Feature activation-2.97e-03

Pos logit contributions
 court+0.014
 seeker+0.014
 gard+0.014
 Gard+0.013
 Ben+0.013
Neg logit contributions
imm-0.029
 unforgettable-0.028
immer-0.028
 brand-0.028
 thrilling-0.027
Token was
Token ID373
Feature activation-8.95e-03

Pos logit contributions
 peel+0.003
 bar+0.003
 Gard+0.003
 gard+0.003
 yog+0.003
Neg logit contributions
 unforgettable-0.005
 lifelong-0.005
 brand-0.005
 coll-0.005
immer-0.005
Token.
Token ID13
Feature activation-1.66e-02

Pos logit contributions
 unforgettable+0.005
 newfound+0.005
b+0.005
 thrilling+0.004
imm+0.004
Neg logit contributions
iffs-0.004
 seeker-0.004
 Gard-0.004
 rooms-0.004
 slams-0.004
Token She
Token ID1375
Feature activation-1.14e-02

Pos logit contributions
iffs+0.015
 seeker+0.015
 peel+0.014
 Gard+0.013
 court+0.013
Neg logit contributions
 unforgettable-0.030
 newfound-0.028
 brand-0.028
 happened-0.027
 thrilling-0.027
Token learned
Token ID4499
Feature activation-1.34e-02

Pos logit contributions
 slams+0.015
 rooms+0.015
iffs+0.015
 seeker+0.015
 peel+0.014
Neg logit contributions
 unforgettable-0.017
 newfound-0.016
imm-0.016
 lifelong-0.015
b-0.015
Token that
Token ID326
Feature activation-5.09e-04

Pos logit contributions
iffs+0.018
 seeker+0.017
 court+0.017
 slams+0.017
 rooms+0.016
Neg logit contributions
 unforgettable-0.020
 newfound-0.019
imm-0.019
b-0.018
 thrilling-0.018
Token accidents
Token ID17390
Feature activation-8.65e-03

Pos logit contributions
iffs+0.001
 seeker+0.001
 court+0.001
 rooms+0.001
 slams+0.001
Neg logit contributions
 unforgettable-0.001
 newfound-0.001
imm-0.001
b-0.001
 thrilling-0.001
Token can
Token ID460
Feature activation-5.11e-02

Pos logit contributions
iffs+0.011
 slams+0.011
 rooms+0.011
 peel+0.010
 court+0.010
Neg logit contributions
 unforgettable-0.013
 newfound-0.012
 lifetime-0.012
imm-0.012
immer-0.012
Token happen
Token ID1645
Feature activation-1.20e-02

Pos logit contributions
iffs+0.065
 peel+0.065
 court+0.065
 Gard+0.061
 seeker+0.061
Neg logit contributions
 unforgettable-0.078
 newfound-0.073
imm-0.071
 lifetime-0.070
 thrilling-0.069
Token,
Token ID11
Feature activation-9.04e-03

Pos logit contributions
iffs+0.010
 rooms+0.010
 court+0.009
 yourselves+0.008
 seeker+0.008
Neg logit contributions
 unforgettable-0.024
 newfound-0.023
imm-0.023
b-0.022
 lifetime-0.022
Token but
Token ID475
Feature activation-1.37e-02

Pos logit contributions
 seeker+0.008
iffs+0.008
 rooms+0.007
 court+0.007
 slams+0.007
Neg logit contributions
 unforgettable-0.017
imm-0.017
 newfound-0.017
b-0.016
 lifetime-0.016
Token it
Token ID340
Feature activation+8.64e-03

Pos logit contributions
iffs+0.018
 seeker+0.018
 court+0.017
 rooms+0.017
 slams+0.017
Neg logit contributions
 unforgettable-0.020
 newfound-0.019
imm-0.019
b-0.018
 thrilling-0.018
Token's
Token ID338
Feature activation+1.27e-02

Pos logit contributions
 unforgettable+0.011
b+0.010
 newfound+0.010
imm+0.010
 thrilling+0.009
Neg logit contributions
iffs-0.013
 seeker-0.013
 court-0.013
 Gard-0.012
 rooms-0.012
Token had
Token ID550
Feature activation-3.30e-03

Pos logit contributions
 peel+0.015
 slams+0.015
iffs+0.015
 bar+0.014
 rooms+0.014
Neg logit contributions
 unforgettable-0.018
 lifetime-0.016
 newfound-0.016
immer-0.016
 lifelong-0.016
Token no
Token ID645
Feature activation+3.96e-03

Pos logit contributions
 rooms+0.004
iffs+0.004
 seeker+0.004
 peel+0.004
 court+0.004
Neg logit contributions
 unforgettable-0.005
imm-0.005
 newfound-0.005
b-0.005
immer-0.005
Token skin
Token ID4168
Feature activation-2.49e-02

Pos logit contributions
 unforgettable+0.007
imm+0.006
 newfound+0.006
b+0.006
 thrilling+0.006
Neg logit contributions
iffs-0.004
 seeker-0.004
 court-0.004
 rooms-0.004
 peel-0.004
Token,
Token ID11
Feature activation-1.78e-02

Pos logit contributions
iffs+0.026
 seeker+0.024
 peel+0.024
 court+0.023
 slams+0.023
Neg logit contributions
 unforgettable-0.045
 newfound-0.043
imm-0.041
 thrilling-0.041
b-0.041
Token no
Token ID645
Feature activation+8.08e-03

Pos logit contributions
iffs+0.021
 seeker+0.021
 court+0.020
 peel+0.020
 slams+0.020
Neg logit contributions
 unforgettable-0.028
imm-0.027
 newfound-0.027
b-0.026
 thrilling-0.025
Token flesh
Token ID11222
Feature activation-5.26e-02

Pos logit contributions
 unforgettable+0.014
imm+0.014
 newfound+0.013
b+0.013
 thrilling+0.013
Neg logit contributions
iffs-0.008
 seeker-0.008
 rooms-0.008
 court-0.008
 peel-0.008
Token,
Token ID11
Feature activation-1.29e-02

Pos logit contributions
iffs+0.053
 seeker+0.050
 peel+0.049
 court+0.049
 rooms+0.048
Neg logit contributions
 unforgettable-0.097
 newfound-0.091
 thrilling-0.088
imm-0.088
b-0.087
Token no
Token ID645
Feature activation+9.77e-03

Pos logit contributions
iffs+0.015
 seeker+0.015
 peel+0.015
 court+0.014
 rooms+0.014
Neg logit contributions
 unforgettable-0.021
imm-0.020
 newfound-0.020
b-0.019
 lifetime-0.019
Token hair
Token ID4190
Feature activation-6.57e-03

Pos logit contributions
 unforgettable+0.017
imm+0.016
 newfound+0.016
b+0.016
 thrilling+0.016
Neg logit contributions
iffs-0.010
 seeker-0.010
 rooms-0.010
 court-0.009
 peel-0.009
Token.
Token ID13
Feature activation-1.46e-02

Pos logit contributions
iffs+0.006
 peel+0.006
 court+0.006
 seeker+0.006
 slams+0.005
Neg logit contributions
 unforgettable-0.013
 newfound-0.012
 thrilling-0.012
imm-0.012
 lifelong-0.012
Token It
Token ID632
Feature activation-1.90e-02

Pos logit contributions
iffs+0.018
 seeker+0.018
 slams+0.017
 rooms+0.016
 court+0.016
Neg logit contributions
 unforgettable-0.023
 newfound-0.022
imm-0.021
b-0.021
 brand-0.021