TOP ACTIVATIONS
MAX = 7.117

found the truck many years ago and worked hard to fix
You 're a good h ider , Tim my !" Tim
You 're the best h ider , Ben , but the
by a famous artist long ago . One
had been made many years ago , but it still looked
was such a good h ider . After
imagined the tart was long ago , when she was a
velvet box all those years ago . They were glad that
this house a long time ago ." L ily
- six , forty - seven , forty - eight ,
I was the best h ider !" Tom saw
big farm a long time ago . Lily listened as the
You 're a good h ider , Lily ."
in the park many years ago . Toby was amazed !
- six , twenty - seven , twenty - eight ,
he had seen a year ago . He went home ,
and Tom is the h ider . Tom runs to a
You 're a great h ider . Let 's play again
- six , thirty - seven , thirty - eight ,
and Ben is the h ider . Lily counts to ten

MIN ACTIVATIONS
MIN = -7.237

sk ates and helmets and pads . They use a puck
. <|endoftext|> M ama and pap a were busy in the
to obey m ama and pap a .
. We have chicken and potatoes and carrots and a large
mom cooked dinner and put onions in the food . Tim
. It has carrots and potatoes and cheese . You like
. It has carrots and potatoes and chicken . You like
asked her m ama and pap a if she could go
help her m ama and pap a . So
put on her helmet and pads . They were very gentle
. They wear helmets and pads to be safe .
. She added spices and herbs to make the food taste
. It has carrots and potatoes and chicken . You like
cooked things like chicken and potatoes . Sometimes she served salad
gave them medicine and band ages . " Tom
pulled up some carrots and potatoes . He also picked some
. He saw carrots and potatoes and chicken in the soup
carrots , potatoes , and onions and put them in a
cooked something with carrots and potatoes . It was y ummy
prefer to eat fruits and vegetables . They are good for

INTERVAL 3.529 - 7.117
CONTAINS 0.346%

the moon . One day , they take their
like to play in the park . One day , they
very much . One day , Ben sees a
You are the best niece ever ." They laughed and drew
That â € ™ s not real gold ! It â

INTERVAL -0.060 - 3.529
CONTAINS 49.856%

high and bright . It still said , " Come to
make too much noise . His mom smiled , happy to
child . The man and the child smiled at each other
help . She found a grown - up who was deaf
L ily : " This flower is rotting . Can

INTERVAL -3.648 - -0.060
CONTAINS 49.640%

The End . <|endoftext|> John was weak .
Like chess . Or knitting . Or sleeping ."
surprised ! He drove as fast as he could and soon
around the lamp , feeling very happy . Next
. One day , he took a walk in the woods

INTERVAL -7.237 - -3.648
CONTAINS 0.158%

to Jill , " Let 's go play !" But Jill
salad . It had red tomatoes , green lettuce , and
" Yeah , but let 's add some onion to make
, the boss y duck ling said , " Sc atter
dad said , " Let 's add some fire work to
Token found
Token ID1043
Feature activation+1.31e+00

Pos logit contributions
 first+0.032
 Mia+0.031
 moment+0.031
 Sue+0.030
 Jen+0.030
Neg logit contributions
hing-0.064
 streaming-0.059
ming-0.058
onder-0.054
cient-0.053
Token the
Token ID262
Feature activation+1.47e-01

Pos logit contributions
 first+0.031
 moment+0.030
 Mia+0.030
 Jen+0.029
 Sue+0.028
Neg logit contributions
hing-0.073
 streaming-0.070
ming-0.069
onder-0.064
cient-0.063
Token truck
Token ID7779
Feature activation+2.53e+00

Pos logit contributions
 Mia+0.004
 first+0.004
 Jen+0.004
 moment+0.004
 Sue+0.004
Neg logit contributions
hing-0.007
 streaming-0.007
ming-0.007
onder-0.006
cient-0.006
Token many
Token ID867
Feature activation+1.59e+00

Pos logit contributions
 first+0.078
 moment+0.076
 Mia+0.074
 Jen+0.073
 Sue+0.072
Neg logit contributions
hing-0.123
 streaming-0.117
ming-0.114
onder-0.105
cient-0.103
Token years
Token ID812
Feature activation+1.65e+00

Pos logit contributions
 first+0.049
 moment+0.048
 Mia+0.046
 Jen+0.045
 Sue+0.045
Neg logit contributions
hing-0.079
 streaming-0.075
ming-0.073
onder-0.067
cient-0.066
Token ago
Token ID2084
Feature activation+7.12e+00

Pos logit contributions
 moment+0.037
 Mia+0.037
 first+0.036
 Jen+0.035
 Sue+0.034
Neg logit contributions
hing-0.090
 streaming-0.089
ming-0.086
onder-0.081
ug-0.079
Token and
Token ID290
Feature activation+5.14e-01

Pos logit contributions
 moment+0.178
 Mia+0.178
 first+0.177
 Jen+0.171
 Sue+0.165
Neg logit contributions
hing-0.371
 streaming-0.361
ming-0.350
onder-0.325
ning-0.316
Token worked
Token ID3111
Feature activation+6.54e-01

Pos logit contributions
 first+0.015
 Mia+0.014
 moment+0.014
 Jen+0.014
 Sue+0.014
Neg logit contributions
hing-0.027
 streaming-0.025
ming-0.025
onder-0.023
severe-0.023
Token hard
Token ID1327
Feature activation-1.00e+00

Pos logit contributions
 Mia+0.021
 Jen+0.021
 moment+0.021
 first+0.020
ila+0.020
Neg logit contributions
 streaming-0.029
hing-0.029
ming-0.028
onder-0.025
ning-0.025
Token to
Token ID284
Feature activation+5.52e-01

Pos logit contributions
hing+0.053
 streaming+0.050
ming+0.049
onder+0.046
cient+0.045
Neg logit contributions
 first-0.028
 moment-0.027
 Mia-0.026
 Jen-0.025
 Sue-0.025
Token fix
Token ID4259
Feature activation+1.07e+00

Pos logit contributions
 first+0.018
 moment+0.018
 Mia+0.017
 Jen+0.017
 Sue+0.017
Neg logit contributions
hing-0.026
 streaming-0.025
ming-0.024
onder-0.022
cient-0.021
TokenYou
Token ID1639
Feature activation-6.25e-01

Pos logit contributions
hing+0.041
 streaming+0.037
ming+0.037
ting+0.033
onder+0.032
Neg logit contributions
 first-0.033
 Mia-0.032
 Jen-0.031
 moment-0.031
 Sue-0.031
Token're
Token ID821
Feature activation-2.17e+00

Pos logit contributions
hing+0.033
 streaming+0.032
ming+0.032
ning+0.030
ting+0.029
Neg logit contributions
 first-0.019
 Mia-0.017
 Jen-0.016
 Sue-0.016
 moment-0.016
Token a
Token ID257
Feature activation+1.16e+00

Pos logit contributions
hing+0.112
 streaming+0.108
ming+0.105
onder+0.097
cient+0.095
Neg logit contributions
 moment-0.059
 first-0.057
 Mia-0.055
 Jen-0.054
 Sue-0.053
Token good
Token ID922
Feature activation-1.65e-01

Pos logit contributions
 first+0.036
 moment+0.036
 Jen+0.035
 Mia+0.035
 Sue+0.035
Neg logit contributions
hing-0.054
 streaming-0.051
ming-0.050
onder-0.046
cient-0.045
Token h
Token ID289
Feature activation+1.20e-01

Pos logit contributions
hing+0.008
 streaming+0.007
ming+0.007
onder+0.007
severe+0.006
Neg logit contributions
 first-0.005
 moment-0.005
 Mia-0.005
 Jen-0.005
 Sue-0.005
Tokenider
Token ID1304
Feature activation+7.01e+00

Pos logit contributions
 moment+0.003
 first+0.003
 Mia+0.003
 Sue+0.003
 Jen+0.003
Neg logit contributions
hing-0.007
 streaming-0.006
ming-0.006
onder-0.006
ug-0.006
Token,
Token ID11
Feature activation-4.43e-01

Pos logit contributions
 moment+0.218
ila+0.210
 Jen+0.207
 first+0.206
 Sue+0.202
Neg logit contributions
hing-0.288
 streaming-0.282
ming-0.277
onder-0.273
ug-0.253
Token Tim
Token ID5045
Feature activation+1.25e+00

Pos logit contributions
hing+0.023
 streaming+0.022
ming+0.022
onder+0.020
severe+0.020
Neg logit contributions
 moment-0.012
 first-0.011
 Mia-0.011
 Jen-0.011
 Sue-0.010
Tokenmy
Token ID1820
Feature activation-1.25e+00

Pos logit contributions
 moment+0.046
 first+0.045
 Mia+0.044
 Jen+0.044
 Sue+0.043
Neg logit contributions
hing-0.051
ming-0.049
 streaming-0.048
ning-0.044
onder-0.043
Token!"
Token ID2474
Feature activation+1.48e+00

Pos logit contributions
hing+0.071
 streaming+0.068
ming+0.065
onder+0.061
severe+0.060
Neg logit contributions
 first-0.030
 Mia-0.028
 moment-0.028
 Jen-0.027
 Sue-0.027
Token Tim
Token ID5045
Feature activation+7.29e-01

Pos logit contributions
 first+0.040
 moment+0.038
ila+0.036
 Jen+0.035
 minute+0.034
Neg logit contributions
hing-0.077
 streaming-0.075
ming-0.074
onder-0.066
cient-0.065
Token You
Token ID921
Feature activation+4.84e-01

Pos logit contributions
hing+0.042
 streaming+0.041
ming+0.039
severe+0.036
ting+0.036
Neg logit contributions
 first-0.028
 moment-0.027
oe-0.025
 minute-0.024
 next-0.023
Token're
Token ID821
Feature activation-2.15e+00

Pos logit contributions
 first+0.016
 Mia+0.014
 Sue+0.014
 Jen+0.014
 moment+0.013
Neg logit contributions
 streaming-0.023
hing-0.023
ming-0.023
ning-0.022
onder-0.021
Token the
Token ID262
Feature activation+5.17e-02

Pos logit contributions
hing+0.112
 streaming+0.108
ming+0.105
onder+0.097
cient+0.095
Neg logit contributions
 moment-0.058
 first-0.058
 Mia-0.055
 Jen-0.054
 Sue-0.053
Token best
Token ID1266
Feature activation-1.07e+00

Pos logit contributions
 first+0.002
 moment+0.002
 Mia+0.002
 Jen+0.002
 Sue+0.002
Neg logit contributions
hing-0.002
 streaming-0.002
ming-0.002
onder-0.002
cient-0.002
Token h
Token ID289
Feature activation+1.87e-01

Pos logit contributions
hing+0.048
 streaming+0.046
ming+0.044
onder+0.041
ning+0.040
Neg logit contributions
 first-0.036
 moment-0.036
 Mia-0.035
 Jen-0.034
 Sue-0.033
Tokenider
Token ID1304
Feature activation+6.97e+00

Pos logit contributions
 moment+0.006
 first+0.006
 Mia+0.006
 Sue+0.006
 Jen+0.006
Neg logit contributions
hing-0.009
 streaming-0.008
ming-0.008
onder-0.008
ug-0.007
Token,
Token ID11
Feature activation-1.42e-01

Pos logit contributions
 moment+0.212
 first+0.206
ila+0.201
 Jen+0.201
 Mia+0.198
Neg logit contributions
hing-0.296
 streaming-0.289
ming-0.281
onder-0.275
ug-0.257
Token Ben
Token ID3932
Feature activation-2.05e-01

Pos logit contributions
hing+0.007
 streaming+0.007
ming+0.007
ting+0.006
severe+0.006
Neg logit contributions
 moment-0.004
 first-0.004
oe-0.004
 Mia-0.004
 Jen-0.004
Token,
Token ID11
Feature activation-1.88e-01

Pos logit contributions
hing+0.011
 streaming+0.011
ming+0.011
onder+0.010
severe+0.010
Neg logit contributions
 first-0.005
 Mia-0.005
 moment-0.005
 Jen-0.005
 Sue-0.005
Token but
Token ID475
Feature activation-1.50e+00

Pos logit contributions
hing+0.009
 streaming+0.009
ming+0.009
onder+0.008
cient+0.008
Neg logit contributions
 first-0.005
 moment-0.005
 Mia-0.005
 Jen-0.005
 Sue-0.005
Token the
Token ID262
Feature activation+9.70e-02

Pos logit contributions
hing+0.088
 streaming+0.085
ming+0.083
onder+0.077
cient+0.076
Neg logit contributions
 moment-0.030
 first-0.030
 Mia-0.028
 Jen-0.028
 Sue-0.027
Token by
Token ID416
Feature activation+4.76e-01

Pos logit contributions
hing+0.015
 streaming+0.015
ming+0.014
cient+0.013
onder+0.013
Neg logit contributions
 first-0.009
 Mia-0.009
 Jen-0.009
 Sue-0.009
 moment-0.008
Token a
Token ID257
Feature activation+1.05e-01

Pos logit contributions
 first+0.016
 moment+0.016
 Mia+0.015
oe+0.015
 Jen+0.015
Neg logit contributions
hing-0.022
 streaming-0.020
ming-0.020
onder-0.018
ning-0.017
Token famous
Token ID5863
Feature activation+5.00e-02

Pos logit contributions
 first+0.003
 moment+0.003
 Mia+0.003
 Jen+0.003
 Sue+0.003
Neg logit contributions
hing-0.005
 streaming-0.005
ming-0.005
onder-0.005
cient-0.004
Token artist
Token ID6802
Feature activation+1.43e+00

Pos logit contributions
 moment+0.002
 first+0.002
 Mia+0.002
 Jen+0.002
ila+0.002
Neg logit contributions
hing-0.002
 streaming-0.002
ming-0.002
onder-0.002
cient-0.002
Token long
Token ID890
Feature activation-5.85e-01

Pos logit contributions
 first+0.029
 moment+0.027
 Mia+0.027
 Jen+0.025
 Sue+0.025
Neg logit contributions
hing-0.086
 streaming-0.083
ming-0.080
onder-0.076
severe-0.075
Token ago
Token ID2084
Feature activation+6.87e+00

Pos logit contributions
hing+0.031
 streaming+0.029
ming+0.029
onder+0.027
cient+0.026
Neg logit contributions
 first-0.016
 moment-0.016
 Mia-0.015
 Jen-0.015
 Sue-0.015
Token.
Token ID13
Feature activation+1.67e-01

Pos logit contributions
 moment+0.158
 first+0.156
 Mia+0.154
 Jen+0.149
 Sue+0.145
Neg logit contributions
hing-0.374
 streaming-0.363
ming-0.353
onder-0.330
ning-0.322
Token
Token ID220
Feature activation+3.76e-01

Pos logit contributions
 first+0.006
 moment+0.005
oe+0.005
ila+0.005
 next+0.005
Neg logit contributions
hing-0.008
 streaming-0.007
ming-0.007
 Australian-0.006
cient-0.006
Token
Token ID198
Feature activation-5.90e-02

Pos logit contributions
 moment+0.010
 first+0.010
 minute+0.009
 Mia+0.009
 Jen+0.009
Neg logit contributions
hing-0.019
 streaming-0.017
ming-0.017
onder-0.016
ug-0.016
Token
Token ID198
Feature activation+7.21e-02

Pos logit contributions
hing+0.002
ming+0.002
 streaming+0.002
 Australian+0.002
ug+0.002
Neg logit contributions
 first-0.002
 Jen-0.002
ider-0.002
 moment-0.002
oe-0.002
TokenOne
Token ID3198
Feature activation+4.53e+00

Pos logit contributions
 Jen+0.002
 first+0.002
 Mia+0.002
 moment+0.002
 Sue+0.002
Neg logit contributions
hing-0.004
ming-0.003
 Australian-0.003
 streaming-0.003
ug-0.003
Token had
Token ID550
Feature activation+4.34e-01

Pos logit contributions
hing+0.001
 streaming+0.001
ming+0.001
onder+0.001
cient+0.001
Neg logit contributions
 first-0.001
 moment-0.001
 Mia-0.001
 Jen-0.001
 Sue-0.001
Token been
Token ID587
Feature activation+6.85e-01

Pos logit contributions
 first+0.010
 Jen+0.009
 Mia+0.009
 Sue+0.009
 moment+0.009
Neg logit contributions
hing-0.026
 streaming-0.024
ming-0.024
cient-0.022
onder-0.022
Token made
Token ID925
Feature activation+2.28e-01

Pos logit contributions
 first+0.024
 moment+0.023
 Mia+0.023
 Jen+0.022
 Sue+0.022
Neg logit contributions
hing-0.031
 streaming-0.030
ming-0.029
onder-0.026
cient-0.026
Token many
Token ID867
Feature activation+1.07e+00

Pos logit contributions
 first+0.007
 moment+0.007
 Mia+0.007
 Jen+0.007
 next+0.007
Neg logit contributions
hing-0.011
 streaming-0.011
ming-0.010
cient-0.009
onder-0.009
Token years
Token ID812
Feature activation+1.54e+00

Pos logit contributions
 first+0.023
 next+0.022
 moment+0.020
 minute+0.019
 new+0.018
Neg logit contributions
hing-0.065
 streaming-0.063
cient-0.061
severe-0.061
ming-0.060
Token ago
Token ID2084
Feature activation+6.86e+00

Pos logit contributions
 first+0.029
 moment+0.029
 Mia+0.028
 Jen+0.027
 Sue+0.026
Neg logit contributions
hing-0.092
 streaming-0.089
ming-0.087
onder-0.082
cient-0.080
Token,
Token ID11
Feature activation+5.99e-01

Pos logit contributions
 first+0.148
 moment+0.148
 Mia+0.145
 Jen+0.140
 Sue+0.136
Neg logit contributions
hing-0.388
 streaming-0.375
ming-0.366
onder-0.341
ning-0.333
Token but
Token ID475
Feature activation-8.35e-01

Pos logit contributions
 moment+0.018
oe+0.017
 first+0.017
 Mia+0.016
ila+0.016
Neg logit contributions
hing-0.028
 streaming-0.028
ming-0.027
onder-0.025
ning-0.024
Token it
Token ID340
Feature activation-1.66e-02

Pos logit contributions
hing+0.038
 streaming+0.038
ming+0.036
onder+0.033
ning+0.033
Neg logit contributions
 moment-0.025
 Mia-0.023
 first-0.023
 Jen-0.023
oe-0.023
Token still
Token ID991
Feature activation+1.42e+00

Pos logit contributions
hing+0.001
 streaming+0.001
ming+0.001
severe+0.001
onder+0.001
Neg logit contributions
 first-0.001
 moment-0.000
 Mia-0.000
 Jen-0.000
 Sue-0.000
Token looked
Token ID3114
Feature activation+1.65e+00

Pos logit contributions
 first+0.051
 moment+0.051
 Mia+0.050
 Jen+0.049
 Sue+0.048
Neg logit contributions
hing-0.063
 streaming-0.058
ming-0.057
severe-0.053
onder-0.052
Token was
Token ID373
Feature activation+5.67e-01

Pos logit contributions
 first+0.009
 Mia+0.008
 moment+0.008
 Sue+0.008
 next+0.008
Neg logit contributions
hing-0.014
 streaming-0.013
ming-0.013
onder-0.012
ning-0.011
Token such
Token ID884
Feature activation+1.28e-01

Pos logit contributions
 moment+0.019
 Jen+0.018
 Mia+0.018
 first+0.018
ila+0.017
Neg logit contributions
 streaming-0.025
hing-0.025
ming-0.023
onder-0.021
ting-0.021
Token a
Token ID257
Feature activation+8.12e-01

Pos logit contributions
 first+0.002
 moment+0.002
 Mia+0.002
 Jen+0.002
 Sue+0.002
Neg logit contributions
hing-0.008
 streaming-0.008
ming-0.007
onder-0.007
cient-0.007
Token good
Token ID922
Feature activation-3.74e-01

Pos logit contributions
 Jen+0.027
 Mia+0.027
ila+0.026
 Sue+0.025
 first+0.025
Neg logit contributions
hing-0.034
 streaming-0.032
ming-0.031
severe-0.029
onder-0.029
Token h
Token ID289
Feature activation+3.74e-02

Pos logit contributions
hing+0.017
 streaming+0.016
ming+0.016
onder+0.015
severe+0.014
Neg logit contributions
 first-0.012
 moment-0.012
 Mia-0.012
 Jen-0.012
 Sue-0.011
Tokenider
Token ID1304
Feature activation+6.86e+00

Pos logit contributions
 moment+0.001
 first+0.001
 Mia+0.001
 Jen+0.001
 Sue+0.001
Neg logit contributions
hing-0.002
 streaming-0.002
ming-0.002
onder-0.002
ug-0.002
Token.
Token ID13
Feature activation-1.93e-01

Pos logit contributions
 moment+0.179
 first+0.169
 Jen+0.167
ila+0.165
 Mia+0.163
Neg logit contributions
 streaming-0.325
hing-0.322
ming-0.311
onder-0.301
ug-0.289
Token
Token ID220
Feature activation-1.92e-01

Pos logit contributions
 streaming+0.009
hing+0.009
ming+0.008
 Australian+0.008
cient+0.008
Neg logit contributions
 moment-0.006
 first-0.006
ila-0.006
 Jen-0.005
oe-0.005
Token
Token ID198
Feature activation-3.43e-01

Pos logit contributions
hing+0.011
 streaming+0.010
ming+0.010
onder+0.009
cient+0.009
Neg logit contributions
 moment-0.004
 first-0.004
 Mia-0.004
 Jen-0.004
 Sue-0.004
Token
Token ID198
Feature activation-1.72e-01

Pos logit contributions
hing+0.015
ming+0.014
 streaming+0.014
 Australian+0.013
ug+0.013
Neg logit contributions
 moment-0.010
 first-0.010
 Jen-0.009
 Mia-0.009
 minute-0.009
TokenAfter
Token ID3260
Feature activation-6.38e-01

Pos logit contributions
hing+0.008
 streaming+0.007
ming+0.007
onder+0.006
cient+0.006
Neg logit contributions
 first-0.006
 moment-0.006
 Mia-0.006
 Sue-0.006
 Jen-0.006
Token imagined
Token ID15758
Feature activation+7.72e-01

Pos logit contributions
hing+0.007
 streaming+0.006
ming+0.006
severe+0.006
onder+0.005
Neg logit contributions
 first-0.004
 Mia-0.004
 moment-0.004
 Jen-0.004
 Sue-0.004
Token the
Token ID262
Feature activation+8.09e-03

Pos logit contributions
 moment+0.027
oe+0.025
 minute+0.025
 Jen+0.025
 first+0.024
Neg logit contributions
 streaming-0.032
hing-0.032
ming-0.031
ting-0.028
onder-0.028
Token tart
Token ID35842
Feature activation-1.20e+00

Pos logit contributions
 Jen+0.000
ila+0.000
 Mia+0.000
 Lia+0.000
 Sue+0.000
Neg logit contributions
hing-0.000
 streaming-0.000
ming-0.000
onder-0.000
ning-0.000
Token was
Token ID373
Feature activation+1.70e+00

Pos logit contributions
hing+0.059
 streaming+0.056
ming+0.054
onder+0.050
ning+0.049
Neg logit contributions
 first-0.034
 Jen-0.034
 moment-0.034
 Mia-0.033
 Sue-0.032
Token long
Token ID890
Feature activation-9.23e-02

Pos logit contributions
 moment+0.050
 first+0.048
 Mia+0.048
 Jen+0.048
ila+0.046
Neg logit contributions
 streaming-0.082
hing-0.082
ming-0.078
onder-0.071
ning-0.070
Token ago
Token ID2084
Feature activation+6.84e+00

Pos logit contributions
hing+0.005
 streaming+0.004
ming+0.004
onder+0.004
ning+0.004
Neg logit contributions
 first-0.003
 moment-0.003
 Mia-0.003
 Jen-0.003
 Sue-0.002
Token,
Token ID11
Feature activation+1.77e+00

Pos logit contributions
 moment+0.175
 first+0.175
 Mia+0.173
 Jen+0.170
 Sue+0.164
Neg logit contributions
hing-0.352
 streaming-0.344
ming-0.333
onder-0.309
ning-0.300
Token when
Token ID618
Feature activation-8.00e-01

Pos logit contributions
 moment+0.050
ila+0.050
 Jen+0.049
oe+0.049
 first+0.048
Neg logit contributions
 streaming-0.083
hing-0.079
ming-0.077
ting-0.070
onder-0.069
Token she
Token ID673
Feature activation-5.93e-01

Pos logit contributions
hing+0.042
 streaming+0.042
ming+0.040
onder+0.038
ning+0.037
Neg logit contributions
 moment-0.018
 first-0.018
 Jen-0.017
 Mia-0.017
ila-0.016
Token was
Token ID373
Feature activation-1.08e-02

Pos logit contributions
hing+0.027
 streaming+0.025
ming+0.025
severe+0.022
onder+0.022
Neg logit contributions
 first-0.021
 Mia-0.021
 moment-0.020
 Jen-0.020
 Sue-0.019
Token a
Token ID257
Feature activation+8.56e-01

Pos logit contributions
hing+0.001
 streaming+0.001
ming+0.000
cient+0.000
onder+0.000
Neg logit contributions
 first-0.000
 Mia-0.000
 moment-0.000
 Sue-0.000
 Jen-0.000
Token velvet
Token ID47750
Feature activation+1.01e-01

Pos logit contributions
 streaming+0.023
hing+0.022
ming+0.021
onder+0.019
cient+0.019
Neg logit contributions
 Mia-0.020
 Jen-0.020
ila-0.018
 Sue-0.018
 Jill-0.018
Token box
Token ID3091
Feature activation+1.04e+00

Pos logit contributions
 first+0.003
 moment+0.003
 Mia+0.003
 Jen+0.003
 Sue+0.003
Neg logit contributions
hing-0.005
 streaming-0.005
ming-0.004
onder-0.004
cient-0.004
Token all
Token ID477
Feature activation-1.93e+00

Pos logit contributions
 Mia+0.028
 Jen+0.028
 moment+0.028
ila+0.027
 first+0.027
Neg logit contributions
hing-0.052
 streaming-0.051
ming-0.049
ning-0.044
onder-0.044
Token those
Token ID883
Feature activation-2.93e-02

Pos logit contributions
hing+0.063
 streaming+0.058
ming+0.055
ning+0.048
ting+0.046
Neg logit contributions
 Jen-0.087
 Mia-0.087
ila-0.087
 first-0.085
 Lia-0.084
Token years
Token ID812
Feature activation+1.30e+00

Pos logit contributions
hing+0.001
 streaming+0.001
ming+0.001
onder+0.001
severe+0.001
Neg logit contributions
 first-0.001
 moment-0.001
 Mia-0.001
 Jen-0.001
 Sue-0.001
Token ago
Token ID2084
Feature activation+6.80e+00

Pos logit contributions
 moment+0.030
 Mia+0.030
 first+0.030
 Jen+0.029
 Sue+0.028
Neg logit contributions
hing-0.069
 streaming-0.069
ming-0.065
onder-0.062
ug-0.060
Token.
Token ID13
Feature activation-4.22e-01

Pos logit contributions
 Mia+0.153
 first+0.152
 moment+0.152
 Jen+0.146
 Sue+0.142
Neg logit contributions
hing-0.365
 streaming-0.359
ming-0.348
onder-0.327
ning-0.316
Token They
Token ID1119
Feature activation-7.11e-01

Pos logit contributions
hing+0.018
 streaming+0.017
ming+0.017
onder+0.016
severe+0.015
Neg logit contributions
 first-0.015
 moment-0.015
 minute-0.013
oe-0.013
 next-0.013
Token were
Token ID547
Feature activation-1.45e+00

Pos logit contributions
hing+0.032
ming+0.030
 streaming+0.030
ning+0.029
onder+0.028
Neg logit contributions
 first-0.026
 moment-0.023
 Mia-0.023
 Sue-0.022
 Jen-0.022
Token glad
Token ID9675
Feature activation-9.42e-01

Pos logit contributions
hing+0.072
ming+0.065
 streaming+0.065
severe+0.061
onder+0.061
Neg logit contributions
 first-0.048
 Mia-0.045
 Sue-0.044
 Jen-0.044
 moment-0.044
Token that
Token ID326
Feature activation-3.64e-01

Pos logit contributions
hing+0.050
 streaming+0.048
ming+0.047
onder+0.044
cient+0.043
Neg logit contributions
 first-0.024
 moment-0.024
 Mia-0.023
 Jen-0.023
 Sue-0.022
Token this
Token ID428
Feature activation+6.95e-01

Pos logit contributions
 streaming+0.004
hing+0.004
ming+0.004
onder+0.004
ning+0.004
Neg logit contributions
 moment-0.003
ila-0.003
 Jen-0.003
 Mia-0.003
oe-0.003
Token house
Token ID2156
Feature activation+1.35e+00

Pos logit contributions
 Mia+0.023
 Jen+0.023
 moment+0.023
 first+0.023
ila+0.023
Neg logit contributions
hing-0.029
 streaming-0.028
ming-0.028
onder-0.026
cient-0.026
Token a
Token ID257
Feature activation+6.90e-01

Pos logit contributions
 moment+0.038
 Jen+0.036
 Mia+0.036
 first+0.035
 Sue+0.035
Neg logit contributions
hing-0.067
ming-0.063
 streaming-0.063
onder-0.057
cient-0.057
Token long
Token ID890
Feature activation+2.69e-01

Pos logit contributions
 first+0.021
 moment+0.021
 Mia+0.021
 Jen+0.021
 Sue+0.020
Neg logit contributions
hing-0.033
 streaming-0.031
ming-0.030
onder-0.028
cient-0.028
Token time
Token ID640
Feature activation+2.41e+00

Pos logit contributions
 first+0.008
 moment+0.008
 Mia+0.008
 Jen+0.008
 Sue+0.008
Neg logit contributions
hing-0.013
 streaming-0.012
ming-0.012
onder-0.011
cient-0.011
Token ago
Token ID2084
Feature activation+6.80e+00

Pos logit contributions
 moment+0.058
 Jen+0.056
 Mia+0.056
 first+0.055
 Sue+0.055
Neg logit contributions
hing-0.127
 streaming-0.123
ming-0.120
onder-0.112
ning-0.108
Token."
Token ID526
Feature activation+1.81e+00

Pos logit contributions
 moment+0.177
 Mia+0.173
 first+0.168
 Jen+0.168
ila+0.162
Neg logit contributions
hing-0.338
 streaming-0.329
ming-0.322
onder-0.302
ning-0.295
Token
Token ID198
Feature activation+3.09e-02

Pos logit contributions
 moment+0.058
 first+0.057
 minute+0.052
ila+0.052
 common+0.049
Neg logit contributions
 streaming-0.075
ming-0.073
hing-0.072
usted-0.069
onder-0.067
Token
Token ID198
Feature activation+7.55e-02

Pos logit contributions
 first+0.001
 moment+0.001
 Jen+0.001
 minute+0.001
 Mia+0.001
Neg logit contributions
hing-0.001
ming-0.001
 streaming-0.001
onder-0.001
 Australian-0.001
TokenL
Token ID43
Feature activation+1.30e+00

Pos logit contributions
 moment+0.002
 Jen+0.002
 first+0.002
 Mia+0.002
 Sue+0.002
Neg logit contributions
hing-0.004
ming-0.004
 streaming-0.004
 Australian-0.003
onder-0.003
Tokenily
Token ID813
Feature activation+1.74e+00

Pos logit contributions
 moment+0.038
 first+0.036
 Jen+0.032
 ranger+0.032
 minute+0.032
Neg logit contributions
hing-0.063
 streaming-0.059
ils-0.057
ming-0.057
usted-0.057
Token-
Token ID12
Feature activation+1.50e+00

Pos logit contributions
 moment+0.243
 Mia+0.239
 Sue+0.237
 first+0.235
 Jen+0.234
Neg logit contributions
hing-0.187
ming-0.180
 streaming-0.167
onder-0.160
ning-0.152
Tokensix
Token ID19412
Feature activation+5.14e+00

Pos logit contributions
 moment+0.047
 Mia+0.044
 first+0.043
 Sue+0.043
 Jen+0.042
Neg logit contributions
hing-0.071
ming-0.068
 streaming-0.062
ting-0.060
ning-0.059
Token,
Token ID11
Feature activation+1.71e+00

Pos logit contributions
 first+0.159
 moment+0.157
 Mia+0.153
 Jen+0.150
 Sue+0.148
Neg logit contributions
hing-0.248
 streaming-0.237
ming-0.231
onder-0.211
cient-0.207
Token forty
Token ID16571
Feature activation+5.50e+00

Pos logit contributions
 moment+0.049
 Mia+0.047
 Jen+0.047
 Sue+0.046
 first+0.046
Neg logit contributions
hing-0.085
ming-0.081
 streaming-0.080
ning-0.072
ting-0.071
Token-
Token ID12
Feature activation+1.55e+00

Pos logit contributions
 moment+0.232
 Mia+0.227
 first+0.225
 Sue+0.225
 Jen+0.222
Neg logit contributions
hing-0.190
ming-0.183
 streaming-0.172
onder-0.163
ning-0.157
Tokenseven
Token ID26548
Feature activation+6.76e+00

Pos logit contributions
 moment+0.049
 Mia+0.046
 first+0.045
 Sue+0.044
 Jen+0.043
Neg logit contributions
hing-0.074
ming-0.071
 streaming-0.066
ning-0.063
ting-0.062
Token,
Token ID11
Feature activation+1.54e+00

Pos logit contributions
 first+0.208
 moment+0.206
 Mia+0.200
 Jen+0.196
 Sue+0.193
Neg logit contributions
hing-0.327
 streaming-0.312
ming-0.304
onder-0.279
ning-0.271
Token forty
Token ID16571
Feature activation+5.35e+00

Pos logit contributions
 moment+0.051
 Sue+0.048
oe+0.048
 Jen+0.047
 Mia+0.045
Neg logit contributions
hing-0.065
ming-0.064
 streaming-0.063
onder-0.055
ning-0.055
Token-
Token ID12
Feature activation+6.43e-01

Pos logit contributions
 moment+0.231
 first+0.226
 Mia+0.225
 Sue+0.223
 Jen+0.220
Neg logit contributions
hing-0.179
ming-0.171
 streaming-0.163
onder-0.153
ug-0.144
Tokeneight
Token ID26022
Feature activation+4.51e+00

Pos logit contributions
 moment+0.023
 first+0.022
 Mia+0.021
 Sue+0.021
 Jen+0.020
Neg logit contributions
hing-0.028
ming-0.026
 streaming-0.024
onder-0.023
se-0.022
Token,
Token ID11
Feature activation+1.47e+00

Pos logit contributions
 moment+0.156
 first+0.150
 Mia+0.149
 Jen+0.147
 Sue+0.145
Neg logit contributions
hing-0.191
ming-0.187
 streaming-0.186
onder-0.167
ning-0.166
Token I
Token ID314
Feature activation+8.01e-01

Pos logit contributions
 moment+0.004
 first+0.003
oe+0.003
 Mia+0.003
 minute+0.003
Neg logit contributions
hing-0.006
 streaming-0.006
ming-0.006
severe-0.005
cient-0.005
Token was
Token ID373
Feature activation-7.84e-02

Pos logit contributions
 first+0.027
 moment+0.027
 Mia+0.026
 Jen+0.026
 Sue+0.025
Neg logit contributions
hing-0.036
 streaming-0.034
ming-0.033
onder-0.030
cient-0.029
Token the
Token ID262
Feature activation-2.01e-01

Pos logit contributions
hing+0.004
 streaming+0.004
ming+0.003
onder+0.003
ting+0.003
Neg logit contributions
 moment-0.003
 first-0.002
 Jen-0.002
 Mia-0.002
 minute-0.002
Token best
Token ID1266
Feature activation-1.01e+00

Pos logit contributions
hing+0.009
 streaming+0.009
ming+0.008
onder+0.008
cient+0.007
Neg logit contributions
 moment-0.007
 first-0.007
 Mia-0.007
 Jen-0.006
 Sue-0.006
Token h
Token ID289
Feature activation+8.92e-02

Pos logit contributions
hing+0.045
 streaming+0.043
ming+0.042
 echoed+0.038
onder+0.038
Neg logit contributions
 first-0.031
 moment-0.031
 Jen-0.030
 Mia-0.029
ila-0.029
Tokenider
Token ID1304
Feature activation+6.76e+00

Pos logit contributions
 moment+0.002
 first+0.002
 Mia+0.002
 Sue+0.002
 minute+0.002
Neg logit contributions
hing-0.005
ming-0.005
 streaming-0.005
ug-0.005
onder-0.005
Token!"
Token ID2474
Feature activation+1.68e+00

Pos logit contributions
 moment+0.192
ila+0.179
 Jen+0.179
 first+0.177
 Sue+0.175
Neg logit contributions
hing-0.289
 streaming-0.284
ming-0.277
onder-0.273
ug-0.262
Token
Token ID198
Feature activation+1.96e-01

Pos logit contributions
 moment+0.052
 first+0.051
 minute+0.047
oe+0.046
ila+0.044
Neg logit contributions
 streaming-0.075
hing-0.071
ming-0.070
onder-0.064
ned-0.063
Token
Token ID198
Feature activation+3.33e-01

Pos logit contributions
 moment+0.005
 first+0.005
 minute+0.005
 Jen+0.005
 Mia+0.005
Neg logit contributions
hing-0.010
 streaming-0.009
ming-0.009
onder-0.008
ug-0.008
TokenTom
Token ID13787
Feature activation+2.44e+00

Pos logit contributions
 first+0.010
 moment+0.010
 Mia+0.010
 Jen+0.010
 Sue+0.009
Neg logit contributions
hing-0.016
 streaming-0.015
ming-0.015
onder-0.014
severe-0.013
Token saw
Token ID2497
Feature activation+1.05e+00

Pos logit contributions
 first+0.072
 Mia+0.071
 Jen+0.066
 Sue+0.066
 moment+0.066
Neg logit contributions
hing-0.130
 streaming-0.122
ming-0.116
severe-0.111
cient-0.110
Token big
Token ID1263
Feature activation-9.86e-01

Pos logit contributions
 first+0.015
 Mia+0.015
 Jen+0.015
 moment+0.014
 Sue+0.014
Neg logit contributions
hing-0.024
 streaming-0.023
ming-0.022
onder-0.021
cient-0.020
Token farm
Token ID5318
Feature activation+4.51e-01

Pos logit contributions
hing+0.042
 streaming+0.040
ming+0.039
onder+0.035
severe+0.035
Neg logit contributions
 first-0.033
 Jen-0.032
ila-0.032
 Mia-0.032
 moment-0.031
Token a
Token ID257
Feature activation+1.37e+00

Pos logit contributions
 moment+0.012
 first+0.012
 Mia+0.012
 Jen+0.012
 Sue+0.012
Neg logit contributions
hing-0.023
 streaming-0.022
ming-0.021
onder-0.020
cient-0.019
Token long
Token ID890
Feature activation+1.73e-01

Pos logit contributions
 first+0.041
 Mia+0.041
 Jen+0.040
 moment+0.040
 Sue+0.039
Neg logit contributions
hing-0.065
 streaming-0.062
ming-0.061
onder-0.056
cient-0.055
Token time
Token ID640
Feature activation+2.38e+00

Pos logit contributions
 first+0.005
 moment+0.005
 Mia+0.005
 Jen+0.005
 Sue+0.005
Neg logit contributions
hing-0.009
 streaming-0.008
ming-0.008
onder-0.007
cient-0.007
Token ago
Token ID2084
Feature activation+6.73e+00

Pos logit contributions
 moment+0.059
 Mia+0.057
 first+0.057
 Sue+0.056
 Jen+0.056
Neg logit contributions
hing-0.123
 streaming-0.119
ming-0.116
onder-0.107
ning-0.104
Token.
Token ID13
Feature activation+5.63e-02

Pos logit contributions
 moment+0.168
 Mia+0.165
 first+0.164
 Jen+0.160
 Sue+0.155
Neg logit contributions
hing-0.347
 streaming-0.338
ming-0.329
onder-0.307
ning-0.299
Token Lily
Token ID20037
Feature activation+1.62e+00

Pos logit contributions
 moment+0.001
 first+0.001
ila+0.001
oe+0.001
 next+0.001
Neg logit contributions
 streaming-0.003
hing-0.003
ming-0.003
usted-0.003
 echoed-0.003
Token listened
Token ID16399
Feature activation+4.83e-01

Pos logit contributions
 first+0.049
 Mia+0.045
 moment+0.045
 next+0.043
 Sue+0.041
Neg logit contributions
hing-0.085
 streaming-0.082
ming-0.076
onder-0.073
severe-0.073
Token as
Token ID355
Feature activation-9.52e-01

Pos logit contributions
 Jen+0.017
ila+0.016
 Mia+0.016
 Sue+0.016
 moment+0.016
Neg logit contributions
hing-0.020
 streaming-0.020
ming-0.020
onder-0.017
ning-0.017
Token the
Token ID262
Feature activation+3.14e-01

Pos logit contributions
hing+0.048
 streaming+0.047
ming+0.046
ug+0.042
onder+0.042
Neg logit contributions
 moment-0.024
 first-0.023
 minute-0.022
oe-0.022
 Jen-0.021
TokenYou
Token ID1639
Feature activation-7.79e-01

Pos logit contributions
hing+0.049
 streaming+0.045
ming+0.045
onder+0.040
severe+0.039
Neg logit contributions
 first-0.034
 Mia-0.032
 Jen-0.032
 moment-0.032
 Sue-0.031
Token're
Token ID821
Feature activation-2.45e+00

Pos logit contributions
 streaming+0.041
hing+0.041
ming+0.040
ning+0.039
ting+0.038
Neg logit contributions
 first-0.022
 Mia-0.020
 Sue-0.019
 Jen-0.018
 moment-0.018
Token a
Token ID257
Feature activation+9.82e-01

Pos logit contributions
hing+0.131
 streaming+0.125
ming+0.122
onder+0.113
cient+0.110
Neg logit contributions
 first-0.065
 moment-0.064
 Mia-0.062
 Jen-0.060
 Sue-0.059
Token good
Token ID922
Feature activation-3.62e-01

Pos logit contributions
 first+0.030
 moment+0.029
 Jen+0.029
 Mia+0.029
 Sue+0.028
Neg logit contributions
hing-0.047
 streaming-0.044
ming-0.044
onder-0.040
cient-0.039
Token h
Token ID289
Feature activation-1.79e-01

Pos logit contributions
hing+0.018
 streaming+0.018
ming+0.017
onder+0.016
cient+0.015
Neg logit contributions
 first-0.011
 moment-0.011
 Mia-0.011
 Jen-0.010
 Sue-0.010
Tokenider
Token ID1304
Feature activation+6.72e+00

Pos logit contributions
hing+0.011
 streaming+0.010
ming+0.010
onder+0.009
ug+0.009
Neg logit contributions
 moment-0.004
 first-0.003
 Mia-0.003
 Sue-0.003
 Jen-0.003
Token,
Token ID11
Feature activation-4.36e-01

Pos logit contributions
 moment+0.230
ila+0.225
 first+0.220
 Jen+0.217
 Mia+0.212
Neg logit contributions
hing-0.246
 streaming-0.244
ming-0.239
onder-0.238
ug-0.223
Token Lily
Token ID20037
Feature activation+1.55e+00

Pos logit contributions
hing+0.025
ming+0.024
 streaming+0.023
 echoed+0.021
severe+0.021
Neg logit contributions
 moment-0.008
oe-0.008
 Jen-0.008
ila-0.008
 first-0.008
Token."
Token ID526
Feature activation+1.52e+00

Pos logit contributions
 first+0.041
 moment+0.040
 Mia+0.039
 Jen+0.038
 Sue+0.038
Neg logit contributions
hing-0.082
 streaming-0.079
ming-0.077
onder-0.071
cient-0.070
Token
Token ID198
Feature activation-1.16e-01

Pos logit contributions
 first+0.037
 moment+0.036
ila+0.034
 Jen+0.033
 Mia+0.032
Neg logit contributions
hing-0.080
 streaming-0.078
ming-0.078
onder-0.071
cient-0.070
Token
Token ID198
Feature activation-2.87e-02

Pos logit contributions
hing+0.006
 streaming+0.006
ming+0.006
onder+0.005
ug+0.005
Neg logit contributions
 first-0.003
 moment-0.003
 Jen-0.003
 Mia-0.003
 minute-0.003
Token in
Token ID287
Feature activation-1.85e-01

Pos logit contributions
 Jen+0.094
 Mia+0.092
 moment+0.090
ila+0.090
 Sue+0.089
Neg logit contributions
hing-0.147
 streaming-0.145
ming-0.140
onder-0.127
ning-0.125
Token the
Token ID262
Feature activation-7.48e-01

Pos logit contributions
 streaming+0.008
hing+0.008
ming+0.007
ting+0.007
 Buster+0.006
Neg logit contributions
ila-0.006
ider-0.006
 moment-0.006
oe-0.006
ia-0.006
Token park
Token ID3952
Feature activation+3.54e+00

Pos logit contributions
hing+0.030
 streaming+0.030
ming+0.028
 Australian+0.026
onder+0.026
Neg logit contributions
 Mia-0.027
 first-0.026
 Jen-0.026
 moment-0.026
 Sue-0.026
Token many
Token ID867
Feature activation+1.01e+00

Pos logit contributions
 moment+0.092
 Jen+0.089
 Mia+0.089
 first+0.087
ila+0.087
Neg logit contributions
hing-0.181
 streaming-0.174
ming-0.172
ning-0.156
ting-0.156
Token years
Token ID812
Feature activation+9.87e-01

Pos logit contributions
 first+0.035
 moment+0.034
 Mia+0.034
 Jen+0.033
 Sue+0.032
Neg logit contributions
hing-0.046
 streaming-0.043
ming-0.042
onder-0.038
cient-0.037
Token ago
Token ID2084
Feature activation+6.71e+00

Pos logit contributions
 moment+0.026
 Mia+0.025
 first+0.024
 Jen+0.024
 Sue+0.024
Neg logit contributions
hing-0.049
 streaming-0.049
ming-0.047
onder-0.044
ug-0.043
Token.
Token ID13
Feature activation-2.10e-01

Pos logit contributions
 first+0.148
 Mia+0.148
 moment+0.148
 Jen+0.142
 Sue+0.137
Neg logit contributions
hing-0.367
 streaming-0.361
ming-0.349
onder-0.325
ning-0.318
Token Toby
Token ID37578
Feature activation-2.09e+00

Pos logit contributions
hing+0.009
 streaming+0.008
ming+0.008
onder+0.007
ning+0.007
Neg logit contributions
 first-0.008
 Mia-0.008
 moment-0.008
 Jen-0.007
 Sue-0.007
Token was
Token ID373
Feature activation-3.76e-01

Pos logit contributions
hing+0.111
 streaming+0.105
ming+0.100
onder+0.096
severe+0.095
Neg logit contributions
 first-0.062
 moment-0.056
 Mia-0.055
 next-0.053
 Sue-0.051
Token amazed
Token ID24562
Feature activation-3.16e-01

Pos logit contributions
hing+0.019
 streaming+0.017
ming+0.017
onder+0.016
cient+0.016
Neg logit contributions
 first-0.012
 Mia-0.012
 Sue-0.011
 Jen-0.011
 moment-0.011
Token!
Token ID0
Feature activation-8.54e-01

Pos logit contributions
hing+0.016
 streaming+0.015
ming+0.015
onder+0.014
severe+0.014
Neg logit contributions
 first-0.009
 moment-0.009
 Mia-0.009
 Sue-0.009
 Jen-0.009
Token-
Token ID12
Feature activation+8.25e-01

Pos logit contributions
 moment+0.217
 Sue+0.211
 Jill+0.207
 Jen+0.206
 ranger+0.205
Neg logit contributions
hing-0.178
ming-0.177
ils-0.168
onder-0.163
ug-0.163
Tokensix
Token ID19412
Feature activation+4.85e+00

Pos logit contributions
 moment+0.030
 first+0.026
 Sue+0.026
 Mia+0.026
 minute+0.025
Neg logit contributions
hing-0.033
ming-0.033
ting-0.029
check-0.029
se-0.028
Token,
Token ID11
Feature activation+1.37e+00

Pos logit contributions
 moment+0.171
 first+0.165
 Mia+0.160
 Jen+0.160
 Sue+0.158
Neg logit contributions
hing-0.201
 streaming-0.195
ming-0.192
onder-0.176
cient-0.170
Token twenty
Token ID8208
Feature activation+5.24e+00

Pos logit contributions
 moment+0.041
oe+0.037
 Sue+0.036
 Jen+0.036
ila+0.036
Neg logit contributions
hing-0.065
ming-0.062
 streaming-0.061
ning-0.056
ting-0.056
Token-
Token ID12
Feature activation+7.43e-01

Pos logit contributions
 moment+0.181
 Sue+0.175
 Jen+0.171
 Jill+0.170
 ranger+0.169
Neg logit contributions
ming-0.188
hing-0.185
ils-0.179
ning-0.176
onder-0.175
Tokenseven
Token ID26548
Feature activation+6.71e+00

Pos logit contributions
 moment+0.026
 first+0.023
 Sue+0.023
 Mia+0.022
 minute+0.022
Neg logit contributions
ming-0.031
hing-0.030
se-0.027
ting-0.027
check-0.027
Token,
Token ID11
Feature activation+1.71e+00

Pos logit contributions
 moment+0.236
 first+0.229
 Jen+0.219
 Mia+0.217
 Sue+0.217
Neg logit contributions
hing-0.282
 streaming-0.272
ming-0.268
onder-0.247
ning-0.232
Token twenty
Token ID8208
Feature activation+5.58e+00

Pos logit contributions
 moment+0.060
oe+0.060
 ranger+0.056
 minute+0.053
 Sue+0.053
Neg logit contributions
hing-0.063
 streaming-0.063
ming-0.062
ning-0.055
ting-0.054
Token-
Token ID12
Feature activation+3.64e-01

Pos logit contributions
 moment+0.215
 Sue+0.205
 first+0.203
 Jen+0.201
 Jill+0.200
Neg logit contributions
hing-0.188
ming-0.187
ils-0.174
 streaming-0.171
ug-0.169
Tokeneight
Token ID26022
Feature activation+4.38e+00

Pos logit contributions
 moment+0.015
 first+0.014
 Sue+0.013
 Mia+0.013
 minute+0.013
Neg logit contributions
hing-0.014
ming-0.013
ils-0.011
se-0.011
ting-0.011
Token,
Token ID11
Feature activation+1.47e+00

Pos logit contributions
 moment+0.178
 minute+0.164
 first+0.160
ila+0.159
 Jen+0.157
Neg logit contributions
ming-0.150
 streaming-0.142
hing-0.139
ning-0.132
onder-0.127
Token he
Token ID339
Feature activation+9.61e-01

Pos logit contributions
 moment+0.077
 first+0.076
 Jen+0.075
 Mia+0.075
 Sue+0.073
Neg logit contributions
hing-0.103
 streaming-0.099
ming-0.097
ning-0.086
onder-0.085
Token had
Token ID550
Feature activation+2.40e+00

Pos logit contributions
 moment+0.033
 first+0.033
 Mia+0.033
 Jen+0.032
 Sue+0.032
Neg logit contributions
hing-0.042
 streaming-0.039
ming-0.039
onder-0.035
cient-0.034
Token seen
Token ID1775
Feature activation+1.49e+00

Pos logit contributions
 moment+0.079
 Mia+0.078
 first+0.077
 Jen+0.075
 Sue+0.075
Neg logit contributions
hing-0.108
 streaming-0.104
ming-0.101
onder-0.091
ning-0.088
Token a
Token ID257
Feature activation+5.92e-01

Pos logit contributions
 first+0.036
 moment+0.035
 Mia+0.034
 Jen+0.033
 Sue+0.033
Neg logit contributions
hing-0.082
 streaming-0.079
ming-0.077
onder-0.071
cient-0.070
Token year
Token ID614
Feature activation+4.09e+00

Pos logit contributions
 first+0.023
 moment+0.022
 Mia+0.021
 Jen+0.021
 Sue+0.021
Neg logit contributions
hing-0.025
 streaming-0.024
ming-0.022
onder-0.021
severe-0.020
Token ago
Token ID2084
Feature activation+6.70e+00

Pos logit contributions
 Mia+0.108
 moment+0.107
 Jen+0.105
 first+0.104
 Sue+0.103
Neg logit contributions
hing-0.214
 streaming-0.203
ming-0.201
onder-0.185
cient-0.180
Token.
Token ID13
Feature activation-2.07e-01

Pos logit contributions
 moment+0.153
 Mia+0.153
 first+0.152
 Jen+0.146
 Sue+0.142
Neg logit contributions
hing-0.366
 streaming-0.355
ming-0.346
onder-0.322
ning-0.315
Token He
Token ID679
Feature activation+5.30e-02

Pos logit contributions
 streaming+0.009
hing+0.009
ming+0.008
onder+0.008
 Australian+0.008
Neg logit contributions
 moment-0.007
 first-0.007
oe-0.006
ila-0.006
 minute-0.006
Token went
Token ID1816
Feature activation-1.37e+00

Pos logit contributions
 first+0.002
 Mia+0.002
 moment+0.002
 Jen+0.002
 Sue+0.002
Neg logit contributions
hing-0.002
 streaming-0.002
ming-0.002
onder-0.002
cient-0.002
Token home
Token ID1363
Feature activation-2.78e-01

Pos logit contributions
hing+0.053
ming+0.052
 streaming+0.052
ting+0.046
ning+0.045
Neg logit contributions
 moment-0.049
 Mia-0.049
 Jen-0.049
ila-0.048
 Sue-0.047
Token,
Token ID11
Feature activation-4.55e-01

Pos logit contributions
hing+0.012
 streaming+0.012
ming+0.012
onder+0.011
ning+0.010
Neg logit contributions
 moment-0.009
 Mia-0.009
 Jen-0.009
 first-0.008
ila-0.008
Token and
Token ID290
Feature activation+1.44e+00

Pos logit contributions
 moment+0.083
 first+0.082
 Jen+0.076
 Mia+0.076
 minute+0.075
Neg logit contributions
hing-0.131
 streaming-0.128
ming-0.124
onder-0.115
ning-0.112
Token Tom
Token ID4186
Feature activation+2.49e+00

Pos logit contributions
 moment+0.038
 first+0.031
 minute+0.030
oe+0.029
 next+0.027
Neg logit contributions
hing-0.073
 streaming-0.072
ming-0.071
onder-0.065
 echoed-0.064
Token is
Token ID318
Feature activation+1.70e+00

Pos logit contributions
 moment+0.064
 first+0.063
 Jen+0.058
 Mia+0.058
 minute+0.057
Neg logit contributions
hing-0.131
 streaming-0.124
ming-0.123
onder-0.112
ning-0.110
Token the
Token ID262
Feature activation+6.08e-01

Pos logit contributions
 moment+0.047
 first+0.045
 Jen+0.042
 Mia+0.042
 minute+0.042
Neg logit contributions
hing-0.084
 streaming-0.082
ming-0.082
ning-0.073
onder-0.073
Token h
Token ID289
Feature activation-3.34e-01

Pos logit contributions
 moment+0.018
 Jen+0.017
 first+0.017
oe+0.017
 Sue+0.017
Neg logit contributions
hing-0.027
 streaming-0.026
ming-0.026
onder-0.024
ning-0.024
Tokenider
Token ID1304
Feature activation+6.69e+00

Pos logit contributions
hing+0.020
ming+0.020
 streaming+0.020
onder+0.019
ug+0.018
Neg logit contributions
 moment-0.006
 first-0.006
 Sue-0.005
 Mia-0.005
 Jen-0.005
Token.
Token ID13
Feature activation-1.49e-01

Pos logit contributions
 moment+0.164
 first+0.158
 Jen+0.148
 Sue+0.147
 Mia+0.144
Neg logit contributions
hing-0.330
 streaming-0.329
ming-0.318
onder-0.305
 echoed-0.293
Token Tom
Token ID4186
Feature activation+1.36e+00

Pos logit contributions
 streaming+0.006
ming+0.006
 Australian+0.006
 Buster+0.005
ning+0.005
Neg logit contributions
 moment-0.005
 first-0.004
 apology-0.004
 minute-0.004
ila-0.004
Token runs
Token ID4539
Feature activation+2.15e-01

Pos logit contributions
 first+0.037
 moment+0.037
 Mia+0.036
 Jen+0.035
 Sue+0.034
Neg logit contributions
hing-0.071
 streaming-0.068
ming-0.066
onder-0.061
cient-0.060
Token to
Token ID284
Feature activation+1.35e-01

Pos logit contributions
 moment+0.008
 first+0.008
 Jen+0.008
 Mia+0.008
ila+0.007
Neg logit contributions
hing-0.009
 streaming-0.008
ming-0.008
ting-0.007
onder-0.007
Token a
Token ID257
Feature activation-2.89e-01

Pos logit contributions
ider+0.006
onna+0.006
iw+0.006
 moment+0.005
ila+0.005
Neg logit contributions
 off-0.003
hing-0.003
check-0.003
ting-0.003
 Matthew-0.003
Token You
Token ID921
Feature activation+1.19e-01

Pos logit contributions
hing+0.047
 streaming+0.046
ming+0.045
onder+0.041
cient+0.041
Neg logit contributions
 moment-0.027
 first-0.027
 Mia-0.023
 minute-0.023
 Jen-0.023
Token're
Token ID821
Feature activation-2.63e+00

Pos logit contributions
 first+0.004
 Mia+0.003
 moment+0.003
 Sue+0.003
 Jen+0.003
Neg logit contributions
ming-0.006
hing-0.006
 streaming-0.006
ning-0.005
hat-0.005
Token a
Token ID257
Feature activation+6.83e-01

Pos logit contributions
hing+0.140
 streaming+0.132
ming+0.130
onder+0.120
cient+0.118
Neg logit contributions
 first-0.076
 Mia-0.072
 moment-0.070
 Sue-0.069
 Jen-0.069
Token great
Token ID1049
Feature activation+1.24e-01

Pos logit contributions
 first+0.021
 moment+0.020
 Mia+0.020
 Jen+0.019
 Sue+0.019
Neg logit contributions
hing-0.034
 streaming-0.032
ming-0.031
onder-0.029
cient-0.028
Token h
Token ID289
Feature activation-4.40e-01

Pos logit contributions
 first+0.005
 moment+0.004
 Mia+0.004
 Jen+0.004
 Sue+0.004
Neg logit contributions
hing-0.005
 streaming-0.005
ming-0.005
onder-0.004
severe-0.004
Tokenider
Token ID1304
Feature activation+6.68e+00

Pos logit contributions
hing+0.024
 streaming+0.023
ming+0.022
onder+0.021
ug+0.021
Neg logit contributions
 moment-0.011
 first-0.011
 Mia-0.011
 Sue-0.010
 Jen-0.010
Token.
Token ID13
Feature activation-5.86e-01

Pos logit contributions
 moment+0.203
 first+0.199
 Jen+0.195
ila+0.193
 Mia+0.192
Neg logit contributions
hing-0.290
 streaming-0.285
ming-0.277
onder-0.267
ug-0.251
Token Let
Token ID3914
Feature activation+1.00e+00

Pos logit contributions
hing+0.029
 streaming+0.028
ming+0.027
onder+0.025
severe+0.024
Neg logit contributions
 moment-0.017
 first-0.017
 Mia-0.016
 Jen-0.016
 Sue-0.016
Token's
Token ID338
Feature activation-4.01e+00

Pos logit contributions
 first+0.038
 Mia+0.035
 Sue+0.035
 moment+0.035
 Jen+0.034
Neg logit contributions
hing-0.045
 streaming-0.043
ming-0.041
onder-0.037
severe-0.037
Token play
Token ID711
Feature activation+3.73e-01

Pos logit contributions
hing+0.209
 streaming+0.203
ming+0.200
ning+0.185
cient+0.181
Neg logit contributions
 first-0.126
 Mia-0.116
 Sue-0.111
 Jen-0.109
 moment-0.104
Token again
Token ID757
Feature activation+1.95e+00

Pos logit contributions
 moment+0.011
 Jen+0.011
 first+0.011
 Sue+0.010
 Mia+0.010
Neg logit contributions
hing-0.018
ming-0.017
 streaming-0.017
onder-0.016
ting-0.015
Token-
Token ID12
Feature activation+1.58e+00

Pos logit contributions
 moment+0.252
 Mia+0.246
 first+0.245
 Sue+0.244
 Jen+0.240
Neg logit contributions
hing-0.211
ming-0.203
 streaming-0.196
onder-0.181
ning-0.175
Tokensix
Token ID19412
Feature activation+5.17e+00

Pos logit contributions
 moment+0.050
 Mia+0.047
 first+0.046
 Sue+0.046
 Jen+0.045
Neg logit contributions
hing-0.075
ming-0.072
 streaming-0.067
ting-0.064
ning-0.063
Token,
Token ID11
Feature activation+1.81e+00

Pos logit contributions
 first+0.157
 moment+0.154
 Mia+0.151
 Jen+0.147
 Sue+0.145
Neg logit contributions
hing-0.255
 streaming-0.242
ming-0.235
onder-0.215
cient-0.212
Token thirty
Token ID12277
Feature activation+5.89e+00

Pos logit contributions
 moment+0.053
 first+0.051
 Jen+0.049
 Sue+0.049
 Mia+0.049
Neg logit contributions
hing-0.087
 streaming-0.084
ming-0.082
ting-0.074
onder-0.073
Token-
Token ID12
Feature activation+1.45e+00

Pos logit contributions
 moment+0.241
 first+0.236
 Mia+0.235
 Sue+0.232
 Jen+0.229
Neg logit contributions
hing-0.211
ming-0.202
 streaming-0.197
onder-0.181
ning-0.176
Tokenseven
Token ID26548
Feature activation+6.66e+00

Pos logit contributions
 moment+0.045
 Mia+0.042
 first+0.042
 Sue+0.042
 Jen+0.041
Neg logit contributions
hing-0.069
ming-0.066
 streaming-0.062
ning-0.059
ting-0.059
Token,
Token ID11
Feature activation+1.60e+00

Pos logit contributions
 first+0.199
 moment+0.197
 Mia+0.191
 Jen+0.187
 Sue+0.184
Neg logit contributions
hing-0.329
 streaming-0.313
ming-0.306
onder-0.281
cient-0.273
Token thirty
Token ID12277
Feature activation+5.85e+00

Pos logit contributions
 moment+0.053
oe+0.049
 Sue+0.048
 Jen+0.047
 first+0.046
Neg logit contributions
hing-0.067
 streaming-0.067
ming-0.066
onder-0.058
ning-0.057
Token-
Token ID12
Feature activation+6.71e-01

Pos logit contributions
 moment+0.249
 first+0.245
 Mia+0.242
 Sue+0.239
 Jen+0.236
Neg logit contributions
hing-0.200
ming-0.190
 streaming-0.185
onder-0.169
ug-0.161
Tokeneight
Token ID26022
Feature activation+4.56e+00

Pos logit contributions
 moment+0.024
 first+0.023
 Mia+0.022
 Sue+0.022
 Jen+0.021
Neg logit contributions
hing-0.029
ming-0.027
 streaming-0.026
onder-0.024
ting-0.024
Token,
Token ID11
Feature activation+1.50e+00

Pos logit contributions
 moment+0.151
 first+0.148
 Mia+0.144
 Jen+0.141
 Sue+0.139
Neg logit contributions
hing-0.205
 streaming-0.198
ming-0.195
onder-0.177
ning-0.174
Token and
Token ID290
Feature activation+1.54e+00

Pos logit contributions
 moment+0.080
 first+0.079
 Jen+0.074
 Mia+0.074
 minute+0.072
Neg logit contributions
hing-0.129
 streaming-0.126
ming-0.122
onder-0.112
ning-0.110
Token Ben
Token ID3932
Feature activation+6.06e-01

Pos logit contributions
 moment+0.044
 first+0.040
oe+0.037
 minute+0.037
 Jen+0.036
Neg logit contributions
hing-0.074
 streaming-0.072
ming-0.071
onder-0.065
 echoed-0.064
Token is
Token ID318
Feature activation+1.90e+00

Pos logit contributions
 first+0.016
 moment+0.016
 Mia+0.015
 Jen+0.015
 Sue+0.014
Neg logit contributions
hing-0.032
 streaming-0.030
ming-0.030
onder-0.027
ning-0.027
Token the
Token ID262
Feature activation+6.56e-01

Pos logit contributions
 first+0.048
 moment+0.048
 Mia+0.045
 Jen+0.044
 Sue+0.043
Neg logit contributions
hing-0.102
 streaming-0.098
ming-0.096
onder-0.088
ning-0.087
Token h
Token ID289
Feature activation-3.02e-01

Pos logit contributions
 moment+0.021
 Jen+0.020
 first+0.020
oe+0.019
 Sue+0.019
Neg logit contributions
hing-0.028
 streaming-0.028
ming-0.027
onder-0.024
ning-0.024
Tokenider
Token ID1304
Feature activation+6.62e+00

Pos logit contributions
hing+0.019
 streaming+0.018
ming+0.018
onder+0.017
ug+0.017
Neg logit contributions
 moment-0.006
 first-0.005
 Sue-0.004
 Mia-0.004
 Jen-0.004
Token.
Token ID13
Feature activation-9.98e-02

Pos logit contributions
 moment+0.162
 first+0.157
 Jen+0.148
 Sue+0.144
ila+0.144
Neg logit contributions
hing-0.328
 streaming-0.325
ming-0.314
onder-0.301
 echoed-0.290
Token Lily
Token ID20037
Feature activation+1.20e+00

Pos logit contributions
 streaming+0.004
ming+0.004
 Australian+0.004
hing+0.004
onder+0.004
Neg logit contributions
 moment-0.003
 first-0.003
oe-0.003
 minute-0.003
ila-0.003
Token counts
Token ID9853
Feature activation+2.02e+00

Pos logit contributions
 first+0.032
 Mia+0.031
 moment+0.029
 Jen+0.028
 Sue+0.028
Neg logit contributions
hing-0.067
 streaming-0.064
ming-0.061
onder-0.058
cient-0.056
Token to
Token ID284
Feature activation+8.22e-01

Pos logit contributions
 first+0.056
 moment+0.055
 Mia+0.054
 Jen+0.052
 Sue+0.052
Neg logit contributions
hing-0.105
 streaming-0.100
ming-0.097
onder-0.090
cient-0.088
Token ten
Token ID3478
Feature activation+2.53e+00

Pos logit contributions
oe+0.025
 moment+0.025
ila+0.023
 minute+0.023
 Jen+0.022
Neg logit contributions
hing-0.035
ming-0.034
 streaming-0.033
ting-0.030
onder-0.030
Token sk
Token ID1341
Feature activation-1.86e+00

Pos logit contributions
hing+0.012
 streaming+0.011
ming+0.011
onder+0.010
cient+0.010
Neg logit contributions
 first-0.007
 moment-0.007
 Mia-0.007
 Jen-0.007
 Sue-0.007
Tokenates
Token ID689
Feature activation-2.37e+00

Pos logit contributions
hing+0.066
 streaming+0.064
ming+0.059
 echoed+0.054
cient+0.053
Neg logit contributions
 first-0.080
 Mia-0.076
 Jen-0.076
ila-0.075
 moment-0.074
Token and
Token ID290
Feature activation-6.69e-02

Pos logit contributions
hing+0.106
 streaming+0.100
ming+0.099
onder+0.089
cient+0.088
Neg logit contributions
 moment-0.078
 Mia-0.075
 Jen-0.075
 Sue-0.073
 first-0.073
Token helmets
Token ID28359
Feature activation-1.53e+00

Pos logit contributions
hing+0.003
 streaming+0.003
ming+0.003
onder+0.003
severe+0.002
Neg logit contributions
 first-0.002
 moment-0.002
 Mia-0.002
 Jen-0.002
 Sue-0.002
Token and
Token ID290
Feature activation-1.07e-01

Pos logit contributions
hing+0.087
 streaming+0.085
ming+0.082
onder+0.077
 Australian+0.077
Neg logit contributions
 first-0.036
 moment-0.033
 Mia-0.031
 Jen-0.030
 Sue-0.029
Token pads
Token ID21226
Feature activation-7.24e+00

Pos logit contributions
hing+0.005
 streaming+0.005
ming+0.004
onder+0.004
cient+0.004
Neg logit contributions
 first-0.004
 moment-0.004
 Mia-0.004
 Jen-0.003
 Sue-0.003
Token.
Token ID13
Feature activation-7.21e-03

Pos logit contributions
hing+0.384
 streaming+0.353
ming+0.353
onder+0.323
cient+0.320
Neg logit contributions
 moment-0.189
 Mia-0.186
 Jen-0.179
 Sue-0.174
 first-0.174
Token They
Token ID1119
Feature activation-3.15e-01

Pos logit contributions
hing+0.000
 streaming+0.000
ming+0.000
onder+0.000
severe+0.000
Neg logit contributions
 first-0.000
 moment-0.000
 Mia-0.000
 Jen-0.000
 Sue-0.000
Token use
Token ID779
Feature activation+1.11e-01

Pos logit contributions
hing+0.016
 streaming+0.016
ming+0.016
onder+0.015
ning+0.015
Neg logit contributions
 first-0.009
 moment-0.008
 Mia-0.008
 Jen-0.008
 Sue-0.007
Token a
Token ID257
Feature activation-1.60e-01

Pos logit contributions
 first+0.003
 moment+0.003
 Mia+0.003
 Jen+0.003
 Sue+0.003
Neg logit contributions
hing-0.006
 streaming-0.006
ming-0.006
onder-0.005
cient-0.005
Token puck
Token ID20695
Feature activation-2.00e-01

Pos logit contributions
hing+0.009
 streaming+0.008
ming+0.008
onder+0.007
ting+0.007
Neg logit contributions
 moment-0.004
 first-0.004
 Mia-0.004
 minute-0.004
 Jen-0.004
Token.
Token ID13
Feature activation-4.38e-01

Pos logit contributions
 first+0.009
 moment+0.009
 Mia+0.009
 Jen+0.009
ila+0.008
Neg logit contributions
hing-0.016
 streaming-0.016
ming-0.016
ning-0.014
onder-0.014
Token<|endoftext|>
Token ID50256
Feature activation-3.51e-01

Pos logit contributions
hing+0.023
 streaming+0.022
ming+0.022
onder+0.020
cient+0.020
Neg logit contributions
 first-0.012
 moment-0.011
 Mia-0.011
 Jen-0.010
 Sue-0.010
TokenM
Token ID44
Feature activation-2.05e+00

Pos logit contributions
hing+0.017
 streaming+0.016
ming+0.016
onder+0.015
 Australian+0.014
Neg logit contributions
 first-0.010
 moment-0.009
 Jen-0.009
 Mia-0.009
 minute-0.009
Tokenama
Token ID1689
Feature activation-4.15e+00

Pos logit contributions
hing+0.068
 streaming+0.067
ming+0.060
onder+0.060
cient+0.058
Neg logit contributions
 moment-0.088
 Mia-0.083
 Jen-0.082
 first-0.082
 Sue-0.081
Token and
Token ID290
Feature activation-7.88e-01

Pos logit contributions
hing+0.213
 streaming+0.203
ming+0.197
severe+0.185
onder+0.183
Neg logit contributions
 first-0.120
 Mia-0.118
 moment-0.114
 Jen-0.112
 Sue-0.112
Token pap
Token ID20461
Feature activation-7.04e+00

Pos logit contributions
hing+0.048
ming+0.046
 streaming+0.044
onder+0.042
severe+0.042
Neg logit contributions
 Sue-0.019
 Mia-0.019
 Jen-0.018
 Sara-0.017
 first-0.017
Tokena
Token ID64
Feature activation-1.77e+00

Pos logit contributions
hing+0.292
 streaming+0.278
ming+0.271
onder+0.240
severe+0.237
Neg logit contributions
 first-0.271
 moment-0.266
 Mia-0.257
 Jen-0.254
 Sue-0.249
Token were
Token ID547
Feature activation-1.98e+00

Pos logit contributions
hing+0.080
 streaming+0.076
ming+0.073
ning+0.068
onder+0.068
Neg logit contributions
 first-0.064
 Mia-0.061
 moment-0.059
 Jen-0.058
 Sue-0.057
Token busy
Token ID8179
Feature activation-1.26e+00

Pos logit contributions
hing+0.090
 streaming+0.080
ming+0.080
severe+0.072
onder+0.072
Neg logit contributions
 first-0.075
 Mia-0.069
 moment-0.069
 Sue-0.067
 next-0.067
Token in
Token ID287
Feature activation+2.77e-01

Pos logit contributions
hing+0.061
 streaming+0.056
ming+0.055
severe+0.051
onder+0.051
Neg logit contributions
 first-0.042
 moment-0.040
 Mia-0.039
 Jen-0.038
 next-0.037
Token the
Token ID262
Feature activation-4.01e-01

Pos logit contributions
 moment+0.008
 first+0.008
ila+0.007
 Jen+0.007
 Mia+0.007
Neg logit contributions
hing-0.013
 streaming-0.013
ming-0.012
onder-0.011
 Australian-0.011
Token to
Token ID284
Feature activation+2.62e-01

Pos logit contributions
hing+0.083
 streaming+0.077
ming+0.074
cient+0.071
onder+0.070
Neg logit contributions
 first-0.053
 moment-0.051
 Mia-0.048
 next-0.048
 Jen-0.047
Token obey
Token ID22389
Feature activation-4.59e-01

Pos logit contributions
 first+0.008
 moment+0.007
 Mia+0.007
 Jen+0.007
 Sue+0.007
Neg logit contributions
hing-0.013
 streaming-0.013
ming-0.012
onder-0.012
cient-0.012
Token m
Token ID285
Feature activation-3.25e+00

Pos logit contributions
hing+0.024
 streaming+0.023
ming+0.022
cient+0.021
onder+0.021
Neg logit contributions
 first-0.013
 Mia-0.013
 moment-0.013
 Sue-0.012
 Jen-0.012
Tokenama
Token ID1689
Feature activation-2.92e+00

Pos logit contributions
hing+0.110
 streaming+0.104
ming+0.099
severe+0.085
onder+0.085
Neg logit contributions
 first-0.148
 Jen-0.143
 moment-0.141
 Mia-0.141
 Sue-0.138
Token and
Token ID290
Feature activation-2.70e-02

Pos logit contributions
hing+0.142
 streaming+0.135
ming+0.128
onder+0.120
severe+0.117
Neg logit contributions
 first-0.095
 moment-0.091
 Mia-0.089
 Jen-0.086
 Sue-0.084
Token pap
Token ID20461
Feature activation-7.01e+00

Pos logit contributions
hing+0.001
 streaming+0.001
ming+0.001
onder+0.001
 Australian+0.001
Neg logit contributions
 first-0.001
 moment-0.001
 Mia-0.001
 next-0.001
 Jen-0.001
Tokena
Token ID64
Feature activation-1.65e+00

Pos logit contributions
hing+0.280
 streaming+0.260
ming+0.254
onder+0.226
cient+0.223
Neg logit contributions
 first-0.280
 moment-0.277
 Mia-0.275
 Jen-0.270
 Sue-0.268
Token.
Token ID13
Feature activation-7.06e-01

Pos logit contributions
hing+0.083
 streaming+0.079
ming+0.076
onder+0.072
ning+0.070
Neg logit contributions
 first-0.050
 moment-0.048
 Mia-0.046
 Jen-0.044
 Sue-0.043
Token
Token ID220
Feature activation-2.56e-01

Pos logit contributions
hing+0.034
 streaming+0.033
ming+0.032
onder+0.030
 Australian+0.029
Neg logit contributions
 first-0.021
 moment-0.019
ila-0.018
oe-0.018
 next-0.017
Token
Token ID198
Feature activation-5.92e-01

Pos logit contributions
hing+0.015
 streaming+0.015
ming+0.014
onder+0.013
cient+0.013
Neg logit contributions
 first-0.005
 moment-0.005
 Mia-0.005
 Jen-0.005
 Sue-0.004
Token
Token ID198
Feature activation-4.03e-01

Pos logit contributions
hing+0.030
ming+0.028
 streaming+0.027
 Australian+0.025
onder+0.025
Neg logit contributions
 first-0.017
 moment-0.016
 Mia-0.016
 Jen-0.016
 Sue-0.015
Token.
Token ID13
Feature activation+2.41e-01

Pos logit contributions
hing+0.058
 streaming+0.056
ming+0.054
onder+0.051
ning+0.049
Neg logit contributions
 moment-0.025
 Jen-0.025
 first-0.025
 Mia-0.025
 Sue-0.024
Token We
Token ID775
Feature activation+6.85e-01

Pos logit contributions
 first+0.008
 moment+0.008
oe+0.007
 Mia+0.007
 Jen+0.007
Neg logit contributions
hing-0.011
 streaming-0.011
ming-0.010
onder-0.010
severe-0.010
Token have
Token ID423
Feature activation+1.18e+00

Pos logit contributions
 first+0.017
 moment+0.016
 Mia+0.016
 Jen+0.015
 Sue+0.015
Neg logit contributions
hing-0.037
 streaming-0.037
ming-0.037
ning-0.034
onder-0.033
Token chicken
Token ID9015
Feature activation-4.05e+00

Pos logit contributions
 moment+0.036
ila+0.034
 Jen+0.034
 first+0.033
 Mia+0.033
Neg logit contributions
 streaming-0.052
hing-0.052
ming-0.051
onder-0.047
cient-0.046
Token and
Token ID290
Feature activation-5.70e-01

Pos logit contributions
hing+0.205
 streaming+0.196
ming+0.191
onder+0.175
cient+0.172
Neg logit contributions
 first-0.117
 moment-0.115
 Mia-0.112
 Jen-0.109
 Sue-0.108
Token potatoes
Token ID18821
Feature activation-6.99e+00

Pos logit contributions
hing+0.029
 streaming+0.027
ming+0.027
onder+0.024
cient+0.024
Neg logit contributions
 first-0.016
 moment-0.016
 Mia-0.016
 Jen-0.015
 Sue-0.015
Token and
Token ID290
Feature activation-5.89e-01

Pos logit contributions
hing+0.348
 streaming+0.328
ming+0.324
onder+0.295
cient+0.290
Neg logit contributions
 moment-0.201
 Mia-0.196
 first-0.196
 Jen-0.195
 Sue-0.189
Token carrots
Token ID34397
Feature activation-5.99e+00

Pos logit contributions
hing+0.029
 streaming+0.028
ming+0.027
onder+0.025
cient+0.024
Neg logit contributions
 first-0.018
 moment-0.017
 Mia-0.017
 Jen-0.016
 Sue-0.016
Token and
Token ID290
Feature activation-3.79e-01

Pos logit contributions
hing+0.284
 streaming+0.266
ming+0.262
onder+0.239
cient+0.234
Neg logit contributions
 Jen-0.179
 moment-0.179
 Mia-0.178
 Sue-0.171
 first-0.168
Token a
Token ID257
Feature activation+1.13e-01

Pos logit contributions
hing+0.018
 streaming+0.018
ming+0.017
onder+0.016
cient+0.015
Neg logit contributions
 first-0.011
 moment-0.011
 Mia-0.011
 Jen-0.011
 Sue-0.011
Token large
Token ID1588
Feature activation-1.23e+00

Pos logit contributions
 Jen+0.003
 first+0.003
 Mia+0.003
 moment+0.003
 Sue+0.003
Neg logit contributions
hing-0.005
 streaming-0.005
ming-0.005
onder-0.004
cient-0.004
Token mom
Token ID1995
Feature activation+2.70e-01

Pos logit contributions
hing+0.064
 streaming+0.059
ming+0.059
ting+0.054
severe+0.053
Neg logit contributions
 first-0.025
 next-0.021
 moment-0.021
 Mia-0.020
 Jen-0.019
Token cooked
Token ID15847
Feature activation-9.09e-01

Pos logit contributions
 first+0.009
 Mia+0.008
 moment+0.008
 Sue+0.008
 next+0.008
Neg logit contributions
hing-0.014
 streaming-0.013
ming-0.012
severe-0.012
onder-0.011
Token dinner
Token ID8073
Feature activation-1.16e+00

Pos logit contributions
hing+0.051
 streaming+0.048
ming+0.047
onder+0.044
cient+0.043
Neg logit contributions
 first-0.022
 moment-0.022
 Mia-0.021
 Jen-0.020
 Sue-0.020
Token and
Token ID290
Feature activation-2.68e-01

Pos logit contributions
hing+0.058
 streaming+0.055
ming+0.055
onder+0.049
ning+0.049
Neg logit contributions
 moment-0.033
 first-0.033
 Jen-0.032
 Mia-0.032
 Sue-0.031
Token put
Token ID1234
Feature activation-9.26e-01

Pos logit contributions
hing+0.013
 streaming+0.013
ming+0.012
severe+0.011
onder+0.011
Neg logit contributions
 first-0.009
 Mia-0.008
 moment-0.008
 Sue-0.008
 Jen-0.008
Token onions
Token ID24681
Feature activation-6.94e+00

Pos logit contributions
hing+0.043
 streaming+0.041
ming+0.040
onder+0.036
ning+0.035
Neg logit contributions
 moment-0.030
 first-0.030
 Mia-0.029
 Jen-0.028
ila-0.027
Token in
Token ID287
Feature activation-4.34e-01

Pos logit contributions
hing+0.291
 streaming+0.274
ming+0.265
onder+0.241
severe+0.235
Neg logit contributions
 first-0.271
 moment-0.259
 Mia-0.253
 Jen-0.246
 Sue-0.245
Token the
Token ID262
Feature activation-3.96e-01

Pos logit contributions
hing+0.018
 streaming+0.017
ming+0.015
onder+0.015
ning+0.014
Neg logit contributions
ider-0.015
ila-0.015
ia-0.015
 Lia-0.015
 moment-0.014
Token food
Token ID2057
Feature activation-1.98e+00

Pos logit contributions
hing+0.021
 streaming+0.020
ming+0.019
onder+0.018
severe+0.018
Neg logit contributions
 first-0.011
 moment-0.011
 Mia-0.011
 Sue-0.010
 Jen-0.010
Token.
Token ID13
Feature activation-3.13e-02

Pos logit contributions
hing+0.122
 streaming+0.117
ming+0.115
onder+0.107
severe+0.106
Neg logit contributions
 first-0.038
 moment-0.035
 Mia-0.034
 Jen-0.032
 Sue-0.032
Token Tim
Token ID5045
Feature activation+1.08e+00

Pos logit contributions
hing+0.002
 streaming+0.002
ming+0.001
onder+0.001
cient+0.001
Neg logit contributions
 first-0.001
 moment-0.001
 Mia-0.001
 Jen-0.001
 Sue-0.001
Token.
Token ID13
Feature activation+2.22e-01

Pos logit contributions
 moment+0.049
 Jen+0.046
 Mia+0.046
 first+0.045
 Sue+0.045
Neg logit contributions
hing-0.099
 streaming-0.096
ming-0.091
onder-0.091
cient-0.086
Token It
Token ID632
Feature activation+2.65e-01

Pos logit contributions
 moment+0.008
 first+0.007
oe+0.007
 minute+0.007
 Mia+0.006
Neg logit contributions
hing-0.010
 streaming-0.009
ming-0.009
onder-0.008
severe-0.008
Token has
Token ID468
Feature activation+9.82e-01

Pos logit contributions
 moment+0.008
 Jen+0.008
 first+0.008
 Sue+0.008
 Mia+0.008
Neg logit contributions
hing-0.012
 streaming-0.012
ming-0.012
cient-0.011
ning-0.010
Token carrots
Token ID34397
Feature activation-5.27e+00

Pos logit contributions
 moment+0.029
 first+0.027
 Jen+0.026
oe+0.026
 Mia+0.026
Neg logit contributions
hing-0.046
 streaming-0.046
ming-0.043
onder-0.041
cient-0.039
Token and
Token ID290
Feature activation-7.59e-02

Pos logit contributions
hing+0.248
 streaming+0.235
ming+0.228
onder+0.211
ting+0.205
Neg logit contributions
 moment-0.163
 Mia-0.159
 first-0.159
 Jen-0.158
 Sue-0.155
Token potatoes
Token ID18821
Feature activation-6.86e+00

Pos logit contributions
hing+0.003
 streaming+0.003
ming+0.003
onder+0.002
cient+0.002
Neg logit contributions
 moment-0.003
 first-0.003
oe-0.003
 Jen-0.003
 Sue-0.003
Token and
Token ID290
Feature activation-3.62e-01

Pos logit contributions
hing+0.352
 streaming+0.334
ming+0.327
onder+0.301
cient+0.295
Neg logit contributions
 moment-0.188
 first-0.184
 Mia-0.181
 Jen-0.180
 Sue-0.176
Token cheese
Token ID9891
Feature activation-3.60e+00

Pos logit contributions
hing+0.015
 streaming+0.014
ming+0.014
onder+0.013
cient+0.012
Neg logit contributions
 moment-0.013
 first-0.013
 Jen-0.013
 Mia-0.013
 Sue-0.012
Token.
Token ID13
Feature activation+6.91e-02

Pos logit contributions
hing+0.189
 streaming+0.181
ming+0.177
onder+0.164
cient+0.160
Neg logit contributions
 moment-0.093
 first-0.090
 Jen-0.088
 Mia-0.088
 Sue-0.086
Token You
Token ID921
Feature activation+2.67e-01

Pos logit contributions
 moment+0.002
 first+0.002
oe+0.002
 minute+0.002
 Mia+0.002
Neg logit contributions
hing-0.003
 streaming-0.003
ming-0.003
onder-0.003
severe-0.003
Token like
Token ID588
Feature activation-4.09e-02

Pos logit contributions
 first+0.007
 Mia+0.006
 moment+0.006
 Jen+0.006
 Sue+0.006
Neg logit contributions
hing-0.015
 streaming-0.015
ming-0.014
ning-0.013
onder-0.013
Token.
Token ID13
Feature activation-1.09e-01

Pos logit contributions
 moment+0.043
 first+0.042
 Jen+0.039
 Mia+0.039
 Sue+0.038
Neg logit contributions
hing-0.108
 streaming-0.104
ming-0.103
onder-0.095
ning-0.094
Token It
Token ID632
Feature activation-8.36e-03

Pos logit contributions
hing+0.005
 streaming+0.005
ming+0.004
onder+0.004
cient+0.004
Neg logit contributions
 moment-0.004
 first-0.004
oe-0.003
 minute-0.003
ila-0.003
Token has
Token ID468
Feature activation+7.87e-01

Pos logit contributions
hing+0.000
 streaming+0.000
ming+0.000
onder+0.000
cient+0.000
Neg logit contributions
 first-0.000
 moment-0.000
 Mia-0.000
 Jen-0.000
 Sue-0.000
Token carrots
Token ID34397
Feature activation-5.25e+00

Pos logit contributions
 moment+0.023
 first+0.023
 Jen+0.022
 Mia+0.022
 Sue+0.021
Neg logit contributions
hing-0.037
 streaming-0.037
ming-0.035
onder-0.033
ning-0.032
Token and
Token ID290
Feature activation-1.98e-01

Pos logit contributions
hing+0.255
 streaming+0.243
ming+0.236
onder+0.217
cient+0.211
Neg logit contributions
 moment-0.158
 first-0.155
 Mia-0.154
 Jen-0.152
 Sue-0.149
Token potatoes
Token ID18821
Feature activation-6.80e+00

Pos logit contributions
hing+0.008
 streaming+0.008
ming+0.007
onder+0.007
cient+0.007
Neg logit contributions
 moment-0.007
 first-0.007
oe-0.007
 Jen-0.007
ila-0.007
Token and
Token ID290
Feature activation-4.03e-01

Pos logit contributions
hing+0.374
 streaming+0.357
ming+0.349
onder+0.323
cient+0.317
Neg logit contributions
 first-0.163
 moment-0.163
 Mia-0.158
 Jen-0.155
 Sue-0.151
Token chicken
Token ID9015
Feature activation-3.88e+00

Pos logit contributions
hing+0.018
 streaming+0.017
ming+0.016
onder+0.015
cient+0.015
Neg logit contributions
 moment-0.014
 first-0.014
 Jen-0.013
 Mia-0.013
 Sue-0.013
Token.
Token ID13
Feature activation-2.97e-01

Pos logit contributions
hing+0.217
 streaming+0.208
ming+0.202
onder+0.187
severe+0.185
Neg logit contributions
 first-0.095
 moment-0.090
 Mia-0.090
 Jen-0.086
 Sue-0.084
Token You
Token ID921
Feature activation+6.45e-02

Pos logit contributions
hing+0.014
 streaming+0.014
ming+0.013
onder+0.012
ting+0.012
Neg logit contributions
 moment-0.009
 first-0.009
oe-0.009
 Mia-0.008
 minute-0.008
Token like
Token ID588
Feature activation-1.49e-01

Pos logit contributions
 first+0.002
 Mia+0.001
 moment+0.001
 Sue+0.001
 Jen+0.001
Neg logit contributions
 streaming-0.004
hing-0.004
ming-0.004
ning-0.003
onder-0.003
Token asked
Token ID1965
Feature activation-7.78e-01

Pos logit contributions
hing+0.012
 streaming+0.011
ming+0.011
onder+0.010
severe+0.010
Neg logit contributions
 first-0.009
 moment-0.009
 Mia-0.008
 Jen-0.008
 next-0.008
Token her
Token ID607
Feature activation-9.19e-01

Pos logit contributions
hing+0.049
 streaming+0.046
ming+0.045
onder+0.042
severe+0.042
Neg logit contributions
 first-0.017
 Mia-0.016
 Sue-0.016
 moment-0.015
 Jen-0.015
Token m
Token ID285
Feature activation-2.85e+00

Pos logit contributions
hing+0.050
 streaming+0.047
ming+0.046
onder+0.043
severe+0.042
Neg logit contributions
 first-0.025
 moment-0.024
 Mia-0.024
 Jen-0.023
 Sue-0.023
Tokenama
Token ID1689
Feature activation-3.12e+00

Pos logit contributions
hing+0.087
 streaming+0.080
ming+0.077
onder+0.066
cient+0.064
Neg logit contributions
 first-0.140
 moment-0.139
 Mia-0.137
 Jen-0.135
 Sue-0.134
Token and
Token ID290
Feature activation+3.16e-01

Pos logit contributions
hing+0.143
 streaming+0.137
ming+0.137
cient+0.123
ning+0.122
Neg logit contributions
 moment-0.097
 first-0.094
 Jen-0.094
 Mia-0.093
 Sue-0.091
Token pap
Token ID20461
Feature activation-6.76e+00

Pos logit contributions
 first+0.012
 Mia+0.011
 Jen+0.011
 Sue+0.011
 moment+0.010
Neg logit contributions
hing-0.015
 streaming-0.014
ming-0.014
severe-0.013
onder-0.012
Tokena
Token ID64
Feature activation-1.54e+00

Pos logit contributions
hing+0.254
 streaming+0.230
ming+0.223
cient+0.201
onder+0.198
Neg logit contributions
 Mia-0.286
 first-0.286
 Jen-0.282
 moment-0.281
 Sue-0.280
Token if
Token ID611
Feature activation-3.15e-01

Pos logit contributions
hing+0.073
 streaming+0.070
ming+0.068
cient+0.062
onder+0.061
Neg logit contributions
 moment-0.047
 first-0.046
 Jen-0.046
 Mia-0.046
 Sue-0.045
Token she
Token ID673
Feature activation-1.64e-01

Pos logit contributions
hing+0.019
 streaming+0.018
ming+0.018
onder+0.017
cient+0.017
Neg logit contributions
 first-0.006
 moment-0.006
 Mia-0.005
 Jen-0.005
 Sue-0.005
Token could
Token ID714
Feature activation-1.43e+00

Pos logit contributions
hing+0.006
 streaming+0.005
ming+0.005
onder+0.004
cient+0.004
Neg logit contributions
 first-0.007
 moment-0.007
 Mia-0.007
 Jen-0.007
 Sue-0.007
Token go
Token ID467
Feature activation-7.80e-01

Pos logit contributions
hing+0.066
 streaming+0.063
ming+0.062
onder+0.056
ning+0.055
Neg logit contributions
 first-0.050
 moment-0.048
 Mia-0.047
 Jen-0.046
 Sue-0.045
Token help
Token ID1037
Feature activation+7.48e-02

Pos logit contributions
 first+0.027
 moment+0.025
 Mia+0.025
 Jen+0.024
 Sue+0.024
Neg logit contributions
hing-0.049
 streaming-0.048
ming-0.046
ning-0.043
onder-0.043
Token her
Token ID607
Feature activation-7.95e-01

Pos logit contributions
 first+0.002
 Mia+0.001
 moment+0.001
 Sue+0.001
 Jen+0.001
Neg logit contributions
hing-0.005
 streaming-0.004
ming-0.004
severe-0.004
cient-0.004
Token m
Token ID285
Feature activation-2.58e+00

Pos logit contributions
hing+0.044
 streaming+0.040
ming+0.038
cient+0.036
ug+0.036
Neg logit contributions
 first-0.026
 moment-0.024
 next-0.023
 one-0.020
 Mia-0.020
Tokenama
Token ID1689
Feature activation-2.88e+00

Pos logit contributions
hing+0.088
 streaming+0.085
ming+0.080
ting+0.070
 Australian+0.069
Neg logit contributions
 first-0.118
 moment-0.111
 Jen-0.110
 next-0.109
oe-0.108
Token and
Token ID290
Feature activation+5.00e-01

Pos logit contributions
hing+0.154
 streaming+0.147
ming+0.142
onder+0.133
cient+0.131
Neg logit contributions
 first-0.079
 moment-0.075
 Mia-0.074
 Jen-0.071
 Sue-0.070
Token pap
Token ID20461
Feature activation-6.74e+00

Pos logit contributions
 first+0.016
 Mia+0.015
 Jen+0.015
 Sue+0.014
 moment+0.014
Neg logit contributions
hing-0.027
 streaming-0.024
ming-0.024
severe-0.022
 Australian-0.022
Tokena
Token ID64
Feature activation-1.58e+00

Pos logit contributions
hing+0.274
 streaming+0.253
ming+0.248
onder+0.221
cient+0.219
Neg logit contributions
 first-0.264
 moment-0.261
 Mia-0.260
 Jen-0.256
 Sue-0.254
Token.
Token ID13
Feature activation-4.68e-01

Pos logit contributions
hing+0.079
 streaming+0.075
ming+0.073
onder+0.068
cient+0.066
Neg logit contributions
 first-0.047
 moment-0.046
 Mia-0.045
 Jen-0.043
 Sue-0.043
Token
Token ID198
Feature activation-6.17e-01

Pos logit contributions
hing+0.030
ming+0.028
 streaming+0.028
ug+0.026
ning+0.025
Neg logit contributions
 Mia-0.009
 moment-0.009
 first-0.008
 Sue-0.008
 Jen-0.008
Token
Token ID198
Feature activation-4.26e-01

Pos logit contributions
hing+0.037
 streaming+0.036
ming+0.035
onder+0.032
severe+0.032
Neg logit contributions
 first-0.013
 moment-0.012
 Mia-0.012
 Jen-0.011
 Sue-0.011
TokenSo
Token ID2396
Feature activation-7.82e-01

Pos logit contributions
hing+0.022
 streaming+0.022
ming+0.021
ning+0.020
ug+0.019
Neg logit contributions
 first-0.012
 moment-0.012
 next-0.011
 minute-0.011
 Mia-0.010
Token put
Token ID1234
Feature activation-4.54e-01

Pos logit contributions
hing+0.084
 streaming+0.081
severe+0.075
ming+0.074
onder+0.073
Neg logit contributions
 first-0.044
 Mia-0.041
 moment-0.038
 Jen-0.038
 next-0.037
Token on
Token ID319
Feature activation-1.17e+00

Pos logit contributions
hing+0.022
 streaming+0.021
ming+0.020
ting+0.018
ning+0.018
Neg logit contributions
 moment-0.014
 first-0.013
 Mia-0.013
 Jen-0.013
 Sue-0.012
Token her
Token ID607
Feature activation-1.08e+00

Pos logit contributions
 streaming+0.055
hing+0.055
ming+0.054
onder+0.048
ning+0.047
Neg logit contributions
 moment-0.035
 first-0.032
ila-0.032
 minute-0.031
 Jen-0.031
Token helmet
Token ID14335
Feature activation+3.49e-01

Pos logit contributions
hing+0.044
ming+0.043
 streaming+0.042
onder+0.037
ning+0.037
Neg logit contributions
 Jen-0.039
 moment-0.038
 Sue-0.038
 Mia-0.038
 first-0.037
Token and
Token ID290
Feature activation-6.25e-01

Pos logit contributions
 first+0.009
 moment+0.008
 next+0.007
 Mia+0.007
 Sue+0.007
Neg logit contributions
hing-0.020
 streaming-0.019
ming-0.018
onder-0.017
severe-0.017
Token pads
Token ID21226
Feature activation-6.73e+00

Pos logit contributions
hing+0.034
 streaming+0.032
cient+0.030
onder+0.030
ming+0.030
Neg logit contributions
 Mia-0.018
 first-0.018
 Jen-0.017
 Sue-0.017
 moment-0.016
Token.
Token ID13
Feature activation-1.37e-01

Pos logit contributions
hing+0.384
 streaming+0.370
ming+0.360
onder+0.336
cient+0.333
Neg logit contributions
 first-0.156
 moment-0.147
 Mia-0.143
 Jen-0.139
 Sue-0.136
Token They
Token ID1119
Feature activation-5.07e-01

Pos logit contributions
hing+0.008
ming+0.007
 streaming+0.007
cient+0.006
severe+0.006
Neg logit contributions
 Mia-0.004
 first-0.003
 Jen-0.003
 Sue-0.003
 moment-0.003
Token were
Token ID547
Feature activation-1.07e+00

Pos logit contributions
hing+0.022
 streaming+0.022
ning+0.021
onder+0.021
adoes+0.021
Neg logit contributions
 first-0.019
 Mia-0.017
 Jen-0.017
 one-0.016
 moment-0.016
Token very
Token ID845
Feature activation-1.20e-01

Pos logit contributions
hing+0.059
cient+0.054
severe+0.053
 streaming+0.051
ors+0.050
Neg logit contributions
 first-0.032
 Jen-0.030
 Mia-0.030
 Sue-0.029
 Sara-0.028
Token gentle
Token ID10296
Feature activation-8.94e-01

Pos logit contributions
hing+0.007
 streaming+0.007
ming+0.006
severe+0.006
ting+0.006
Neg logit contributions
 first-0.003
 Mia-0.003
 Jen-0.003
 moment-0.003
 Sue-0.003
Token.
Token ID13
Feature activation+2.74e-01

Pos logit contributions
 first+0.033
 moment+0.032
 Mia+0.031
 Jen+0.031
 Sue+0.030
Neg logit contributions
hing-0.085
 streaming-0.081
ming-0.079
onder-0.074
cient-0.073
Token They
Token ID1119
Feature activation+7.83e-02

Pos logit contributions
 first+0.008
 moment+0.008
 minute+0.007
 next+0.007
oe+0.007
Neg logit contributions
hing-0.012
 streaming-0.012
ming-0.011
onder-0.011
 Australian-0.011
Token wear
Token ID5806
Feature activation+2.16e-01

Pos logit contributions
 first+0.003
 Mia+0.002
 moment+0.002
 Jen+0.002
 Sue+0.002
Neg logit contributions
 streaming-0.004
hing-0.004
ming-0.004
ning-0.004
onder-0.003
Token helmets
Token ID28359
Feature activation-7.76e-01

Pos logit contributions
 first+0.007
 moment+0.007
 Mia+0.007
 Jen+0.007
 Sue+0.007
Neg logit contributions
hing-0.010
 streaming-0.010
ming-0.009
onder-0.009
cient-0.009
Token and
Token ID290
Feature activation-9.38e-02

Pos logit contributions
hing+0.043
 streaming+0.041
ming+0.039
onder+0.038
 Australian+0.037
Neg logit contributions
 first-0.021
 moment-0.019
 Mia-0.017
 next-0.017
 Jen-0.017
Token pads
Token ID21226
Feature activation-6.69e+00

Pos logit contributions
hing+0.004
 streaming+0.004
ming+0.004
onder+0.003
cient+0.003
Neg logit contributions
 first-0.004
 moment-0.004
 Mia-0.003
 Jen-0.003
 Sue-0.003
Token to
Token ID284
Feature activation+1.84e-01

Pos logit contributions
hing+0.347
 streaming+0.323
ming+0.322
onder+0.288
ug+0.284
Neg logit contributions
 moment-0.183
 Mia-0.181
 first-0.175
 Jen-0.173
 Sue-0.169
Token be
Token ID307
Feature activation+9.66e-01

Pos logit contributions
 first+0.007
 moment+0.006
 Mia+0.006
 Jen+0.006
 Sue+0.006
Neg logit contributions
hing-0.008
 streaming-0.008
ming-0.007
onder-0.007
cient-0.007
Token safe
Token ID3338
Feature activation-2.17e+00

Pos logit contributions
 first+0.032
 Mia+0.030
 Jen+0.030
 moment+0.030
 Sue+0.030
Neg logit contributions
hing-0.047
 streaming-0.044
ming-0.041
onder-0.040
cient-0.040
Token.
Token ID13
Feature activation+2.48e-01

Pos logit contributions
hing+0.137
 streaming+0.130
ming+0.127
onder+0.120
cient+0.119
Neg logit contributions
 first-0.041
 moment-0.037
 Mia-0.035
 Jen-0.034
 Sue-0.033
Token
Token ID198
Feature activation+2.29e-01

Pos logit contributions
 moment+0.008
 first+0.008
 minute+0.007
oe+0.007
ila+0.007
Neg logit contributions
 streaming-0.010
hing-0.010
ming-0.010
 Australian-0.009
ug-0.009
Token.
Token ID13
Feature activation-2.28e-01

Pos logit contributions
hing+0.034
ming+0.033
 streaming+0.033
onder+0.030
ning+0.030
Neg logit contributions
 moment-0.020
 Jen-0.019
 first-0.019
ila-0.019
 minute-0.018
Token She
Token ID1375
Feature activation-6.02e-01

Pos logit contributions
hing+0.013
 streaming+0.012
ming+0.012
onder+0.010
ug+0.010
Neg logit contributions
 Mia-0.006
 first-0.006
 Sue-0.006
 Jen-0.006
 moment-0.006
Token added
Token ID2087
Feature activation-8.41e-01

Pos logit contributions
hing+0.030
 streaming+0.028
ming+0.027
severe+0.025
onder+0.025
Neg logit contributions
 first-0.020
 Mia-0.019
 moment-0.019
 Jen-0.018
 Sue-0.018
Token spices
Token ID27952
Feature activation-4.80e+00

Pos logit contributions
hing+0.050
 streaming+0.048
ming+0.046
onder+0.043
cient+0.043
Neg logit contributions
 first-0.018
 moment-0.017
 Mia-0.017
 Jen-0.016
 Sue-0.016
Token and
Token ID290
Feature activation-4.46e-01

Pos logit contributions
hing+0.260
 streaming+0.248
ming+0.240
severe+0.224
cient+0.222
Neg logit contributions
 first-0.131
 moment-0.123
 Mia-0.123
 Jen-0.116
 Sue-0.115
Token herbs
Token ID25411
Feature activation-6.65e+00

Pos logit contributions
hing+0.022
 streaming+0.020
ming+0.020
severe+0.018
onder+0.018
Neg logit contributions
 first-0.015
 Mia-0.014
 moment-0.014
 Jen-0.014
 Sue-0.013
Token to
Token ID284
Feature activation-1.68e-01

Pos logit contributions
hing+0.367
 streaming+0.348
ming+0.341
severe+0.322
 Australian+0.317
Neg logit contributions
 first-0.177
 moment-0.161
 Mia-0.160
 Sue-0.149
 Jen-0.149
Token make
Token ID787
Feature activation-6.98e-01

Pos logit contributions
hing+0.008
 streaming+0.008
ming+0.008
onder+0.007
severe+0.007
Neg logit contributions
 first-0.005
 Mia-0.005
 moment-0.005
 Jen-0.005
 Sue-0.005
Token the
Token ID262
Feature activation-1.39e+00

Pos logit contributions
hing+0.038
 streaming+0.036
ming+0.035
severe+0.033
onder+0.032
Neg logit contributions
 first-0.019
 Mia-0.018
 moment-0.018
 Jen-0.017
 Sue-0.017
Token food
Token ID2057
Feature activation-2.69e+00

Pos logit contributions
hing+0.066
 streaming+0.063
ming+0.061
onder+0.055
severe+0.054
Neg logit contributions
 first-0.048
 moment-0.046
 Mia-0.044
 Jen-0.042
 Sue-0.042
Token taste
Token ID6938
Feature activation-2.66e-01

Pos logit contributions
hing+0.145
 streaming+0.138
ming+0.135
onder+0.125
cient+0.122
Neg logit contributions
 first-0.069
 moment-0.068
 Mia-0.066
 Jen-0.064
 Sue-0.063
Token.
Token ID13
Feature activation+2.48e-01

Pos logit contributions
 moment+0.046
 Jen+0.043
 Mia+0.043
 first+0.042
 Sue+0.042
Neg logit contributions
hing-0.116
 streaming-0.112
ming-0.110
onder-0.103
cient-0.101
Token It
Token ID632
Feature activation+3.55e-01

Pos logit contributions
 moment+0.008
 first+0.008
oe+0.007
 Mia+0.007
 Sue+0.007
Neg logit contributions
hing-0.011
 streaming-0.011
ming-0.010
onder-0.010
severe-0.009
Token has
Token ID468
Feature activation+1.21e+00

Pos logit contributions
 first+0.010
 moment+0.010
 Mia+0.010
 Jen+0.010
 Sue+0.010
Neg logit contributions
hing-0.018
 streaming-0.017
ming-0.016
onder-0.015
cient-0.015
Token carrots
Token ID34397
Feature activation-5.00e+00

Pos logit contributions
 moment+0.035
 first+0.034
 Mia+0.032
 Jen+0.032
 Sue+0.032
Neg logit contributions
hing-0.060
 streaming-0.058
ming-0.056
onder-0.052
cient-0.051
Token and
Token ID290
Feature activation+4.18e-03

Pos logit contributions
hing+0.248
 streaming+0.235
ming+0.229
onder+0.211
cient+0.206
Neg logit contributions
 moment-0.147
 first-0.146
 Mia-0.143
 Jen-0.141
 Sue-0.138
Token potatoes
Token ID18821
Feature activation-6.65e+00

Pos logit contributions
 moment+0.000
 first+0.000
oe+0.000
 Jen+0.000
 Sue+0.000
Neg logit contributions
hing-0.000
 streaming-0.000
ming-0.000
onder-0.000
cient-0.000
Token and
Token ID290
Feature activation-2.87e-01

Pos logit contributions
hing+0.360
 streaming+0.344
ming+0.336
onder+0.311
cient+0.305
Neg logit contributions
 first-0.167
 moment-0.165
 Mia-0.160
 Jen-0.156
 Sue-0.153
Token chicken
Token ID9015
Feature activation-3.73e+00

Pos logit contributions
hing+0.013
 streaming+0.012
ming+0.012
onder+0.011
cient+0.010
Neg logit contributions
 first-0.010
 moment-0.010
 Mia-0.009
 Jen-0.009
 Sue-0.009
Token.
Token ID13
Feature activation-1.13e-01

Pos logit contributions
hing+0.210
 streaming+0.201
ming+0.195
onder+0.181
severe+0.179
Neg logit contributions
 first-0.091
 Mia-0.085
 moment-0.085
 Jen-0.081
 Sue-0.079
Token You
Token ID921
Feature activation+2.76e-01

Pos logit contributions
hing+0.006
 streaming+0.005
ming+0.005
onder+0.005
ting+0.005
Neg logit contributions
 moment-0.003
 first-0.003
oe-0.003
 Mia-0.003
 Jen-0.003
Token like
Token ID588
Feature activation+5.08e-02

Pos logit contributions
 first+0.007
 Mia+0.006
 moment+0.006
 Jen+0.005
 minute+0.005
Neg logit contributions
 streaming-0.016
ming-0.015
hing-0.015
ning-0.015
onder-0.014
Token cooked
Token ID15847
Feature activation-1.48e+00

Pos logit contributions
hing+0.026
 streaming+0.025
ming+0.024
severe+0.022
onder+0.022
Neg logit contributions
 first-0.018
 Mia-0.017
 moment-0.017
 Jen-0.016
 Sue-0.016
Token things
Token ID1243
Feature activation-3.94e+00

Pos logit contributions
hing+0.077
 streaming+0.073
ming+0.072
onder+0.066
ning+0.064
Neg logit contributions
 first-0.040
 moment-0.040
 Jen-0.038
 Mia-0.038
 Sue-0.037
Token like
Token ID588
Feature activation+2.05e-01

Pos logit contributions
hing+0.186
 streaming+0.177
ming+0.173
onder+0.156
cient+0.152
Neg logit contributions
 moment-0.124
 first-0.124
 Mia-0.121
 Jen-0.120
 Sue-0.117
Token chicken
Token ID9015
Feature activation-3.55e+00

Pos logit contributions
 first+0.006
 moment+0.006
 Mia+0.005
 Jen+0.005
 Sue+0.005
Neg logit contributions
hing-0.010
 streaming-0.010
ming-0.010
onder-0.009
cient-0.009
Token and
Token ID290
Feature activation-3.24e-01

Pos logit contributions
hing+0.195
 streaming+0.186
ming+0.179
severe+0.168
onder+0.168
Neg logit contributions
 first-0.091
 moment-0.087
 Mia-0.087
 Sue-0.082
 Jen-0.081
Token potatoes
Token ID18821
Feature activation-6.62e+00

Pos logit contributions
hing+0.017
 streaming+0.015
ming+0.015
severe+0.014
onder+0.014
Neg logit contributions
 Mia-0.010
 first-0.010
 moment-0.010
 Sue-0.009
 Jen-0.009
Token.
Token ID13
Feature activation-4.13e-01

Pos logit contributions
hing+0.391
 streaming+0.375
ming+0.367
onder+0.342
cient+0.337
Neg logit contributions
 first-0.136
 moment-0.132
 Mia-0.127
 Jen-0.122
 Sue-0.120
Token Sometimes
Token ID8975
Feature activation+1.81e+00

Pos logit contributions
hing+0.019
 streaming+0.018
ming+0.018
onder+0.016
cient+0.016
Neg logit contributions
 first-0.014
 moment-0.014
 Mia-0.013
 Jen-0.013
 Sue-0.013
Token she
Token ID673
Feature activation-1.12e+00

Pos logit contributions
 first+0.043
 Mia+0.042
 moment+0.042
 Sue+0.040
 Jen+0.040
Neg logit contributions
hing-0.106
 streaming-0.099
ming-0.097
severe-0.090
onder-0.090
Token served
Token ID4983
Feature activation-1.11e+00

Pos logit contributions
hing+0.050
 streaming+0.049
ming+0.046
severe+0.043
onder+0.043
Neg logit contributions
 first-0.040
 Mia-0.039
 moment-0.038
 Sue-0.036
 Jen-0.036
Token salad
Token ID20698
Feature activation-2.86e+00

Pos logit contributions
hing+0.058
 streaming+0.055
ming+0.053
severe+0.050
onder+0.049
Neg logit contributions
 first-0.032
 Mia-0.031
 moment-0.031
 Sue-0.029
 Jen-0.029
Token gave
Token ID2921
Feature activation-6.94e-01

Pos logit contributions
hing+0.016
 streaming+0.014
ming+0.014
onder+0.013
cient+0.013
Neg logit contributions
 first-0.012
 Mia-0.011
 moment-0.011
 Jen-0.011
 Sue-0.011
Token them
Token ID606
Feature activation-3.69e-02

Pos logit contributions
hing+0.030
 streaming+0.029
ming+0.028
ting+0.026
onder+0.025
Neg logit contributions
 moment-0.024
 first-0.023
 minute-0.022
 Mia-0.022
 Jen-0.022
Token medicine
Token ID9007
Feature activation-2.99e+00

Pos logit contributions
hing+0.002
 streaming+0.002
ming+0.002
onder+0.002
usted+0.002
Neg logit contributions
 moment-0.001
 first-0.001
ila-0.001
 minute-0.001
oe-0.001
Token and
Token ID290
Feature activation-5.36e-01

Pos logit contributions
hing+0.167
 streaming+0.160
ming+0.157
onder+0.144
ning+0.141
Neg logit contributions
 moment-0.068
 first-0.067
 Mia-0.066
 Jen-0.064
 Sue-0.062
Token band
Token ID4097
Feature activation-9.35e-01

Pos logit contributions
hing+0.029
 streaming+0.027
ming+0.027
onder+0.025
cient+0.024
Neg logit contributions
 first-0.014
 moment-0.014
 Mia-0.013
 Jen-0.013
 Sue-0.013
Tokenages
Token ID1095
Feature activation-6.48e+00

Pos logit contributions
hing+0.021
 streaming+0.019
ming+0.017
onder+0.014
cient+0.013
Neg logit contributions
 first-0.054
 moment-0.053
 Mia-0.052
 Jen-0.052
 Sue-0.052
Token.
Token ID13
Feature activation-6.04e-02

Pos logit contributions
hing+0.377
 streaming+0.362
ming+0.353
onder+0.328
cient+0.322
Neg logit contributions
 first-0.137
 moment-0.135
 Mia-0.131
 Jen-0.127
 Sue-0.124
Token
Token ID198
Feature activation+1.21e-01

Pos logit contributions
 streaming+0.002
hing+0.002
ming+0.002
 Australian+0.002
 echoed+0.002
Neg logit contributions
 moment-0.002
 first-0.002
 minute-0.002
oe-0.002
 next-0.002
Token
Token ID198
Feature activation+3.54e-01

Pos logit contributions
 first+0.003
 moment+0.003
 minute+0.003
 Mia+0.003
 Jen+0.003
Neg logit contributions
hing-0.006
 streaming-0.006
ming-0.006
onder-0.005
 Australian-0.005
Token"
Token ID1
Feature activation+5.12e-01

Pos logit contributions
 moment+0.012
 first+0.011
 Mia+0.011
 Jen+0.010
 minute+0.010
Neg logit contributions
hing-0.016
ming-0.015
 streaming-0.015
 Australian-0.014
onder-0.014
TokenTom
Token ID13787
Feature activation+2.78e+00

Pos logit contributions
 Mia+0.021
 Jen+0.020
 first+0.020
 moment+0.019
 Sue+0.019
Neg logit contributions
hing-0.019
 streaming-0.017
ming-0.017
ting-0.017
check-0.015
Token pulled
Token ID5954
Feature activation-2.26e-01

Pos logit contributions
hing+0.004
 streaming+0.004
ming+0.004
severe+0.003
onder+0.003
Neg logit contributions
 first-0.003
 Mia-0.003
 moment-0.003
 Jen-0.003
 Sue-0.003
Token up
Token ID510
Feature activation-1.27e+00

Pos logit contributions
hing+0.011
 streaming+0.010
ming+0.010
onder+0.009
ning+0.009
Neg logit contributions
 moment-0.007
 first-0.007
 Mia-0.007
 Jen-0.007
 Sue-0.007
Token some
Token ID617
Feature activation+7.16e-01

Pos logit contributions
 streaming+0.057
hing+0.056
ming+0.054
ning+0.049
onder+0.047
Neg logit contributions
ila-0.040
 moment-0.039
 Jen-0.037
 minute-0.036
 Mia-0.035
Token carrots
Token ID34397
Feature activation-4.26e+00

Pos logit contributions
 first+0.023
 moment+0.022
 Mia+0.021
 Sue+0.020
 Jen+0.020
Neg logit contributions
hing-0.035
 streaming-0.033
ming-0.033
severe-0.030
cient-0.030
Token and
Token ID290
Feature activation-5.08e-03

Pos logit contributions
hing+0.230
 streaming+0.220
ming+0.215
cient+0.200
severe+0.200
Neg logit contributions
 first-0.121
 moment-0.108
 Mia-0.104
 next-0.100
 Sue-0.098
Token potatoes
Token ID18821
Feature activation-6.45e+00

Pos logit contributions
hing+0.000
 streaming+0.000
ming+0.000
onder+0.000
cient+0.000
Neg logit contributions
 first-0.000
 Mia-0.000
 moment-0.000
 Jen-0.000
 Sue-0.000
Token.
Token ID13
Feature activation-3.13e-01

Pos logit contributions
hing+0.354
 streaming+0.339
ming+0.332
severe+0.319
 Australian+0.319
Neg logit contributions
 first-0.171
 moment-0.148
 Mia-0.143
 next-0.141
 Sue-0.133
Token He
Token ID679
Feature activation-3.04e-01

Pos logit contributions
hing+0.016
 streaming+0.015
ming+0.015
onder+0.014
severe+0.014
Neg logit contributions
 first-0.009
 Mia-0.009
 moment-0.009
 Jen-0.009
 Sue-0.008
Token also
Token ID635
Feature activation+2.78e-01

Pos logit contributions
hing+0.015
 streaming+0.014
ming+0.013
severe+0.013
cient+0.012
Neg logit contributions
 first-0.011
 Mia-0.010
 moment-0.010
 Jen-0.009
 Sue-0.009
Token picked
Token ID6497
Feature activation-1.10e-01

Pos logit contributions
 first+0.010
 moment+0.010
 Mia+0.010
 Jen+0.010
 Sue+0.010
Neg logit contributions
hing-0.012
 streaming-0.011
ming-0.011
onder-0.010
severe-0.010
Token some
Token ID617
Feature activation+4.98e-01

Pos logit contributions
hing+0.006
 streaming+0.006
ming+0.006
onder+0.005
ning+0.005
Neg logit contributions
 moment-0.003
 Jen-0.003
 first-0.003
 Mia-0.003
 Sue-0.003
Token.
Token ID13
Feature activation+2.27e-01

Pos logit contributions
hing+0.041
ming+0.038
 streaming+0.038
onder+0.035
ning+0.035
Neg logit contributions
 Jen-0.025
 moment-0.024
 Sue-0.024
ila-0.024
 first-0.024
Token He
Token ID679
Feature activation+9.39e-02

Pos logit contributions
 moment+0.009
 first+0.009
oe+0.008
 minute+0.008
ila+0.008
Neg logit contributions
hing-0.008
 streaming-0.008
ming-0.008
onder-0.007
severe-0.007
Token saw
Token ID2497
Feature activation+1.18e+00

Pos logit contributions
 first+0.003
 Mia+0.003
 moment+0.003
 Jen+0.003
 Sue+0.003
Neg logit contributions
hing-0.004
 streaming-0.004
ming-0.004
onder-0.003
severe-0.003
Token carrots
Token ID34397
Feature activation-4.80e+00

Pos logit contributions
 first+0.029
 moment+0.029
 Mia+0.027
 Jen+0.027
 Sue+0.026
Neg logit contributions
hing-0.064
 streaming-0.061
ming-0.060
onder-0.056
ning-0.055
Token and
Token ID290
Feature activation-6.09e-02

Pos logit contributions
hing+0.244
 streaming+0.233
ming+0.227
onder+0.210
severe+0.208
Neg logit contributions
 first-0.142
 moment-0.136
 Mia-0.133
 Jen-0.128
 Sue-0.127
Token potatoes
Token ID18821
Feature activation-6.44e+00

Pos logit contributions
hing+0.003
 streaming+0.002
ming+0.002
onder+0.002
cient+0.002
Neg logit contributions
 first-0.002
 moment-0.002
 Mia-0.002
 Jen-0.002
 Sue-0.002
Token and
Token ID290
Feature activation-3.49e-01

Pos logit contributions
hing+0.370
 streaming+0.355
ming+0.346
onder+0.323
severe+0.320
Neg logit contributions
 first-0.146
 moment-0.138
 Mia-0.135
 Jen-0.129
 Sue-0.127
Token chicken
Token ID9015
Feature activation-3.73e+00

Pos logit contributions
hing+0.016
 streaming+0.015
ming+0.015
onder+0.013
cient+0.013
Neg logit contributions
 first-0.012
 moment-0.012
 Mia-0.012
 Jen-0.011
 Sue-0.011
Token in
Token ID287
Feature activation-1.61e-01

Pos logit contributions
hing+0.231
 streaming+0.221
ming+0.215
severe+0.203
onder+0.202
Neg logit contributions
 first-0.070
 Mia-0.065
 moment-0.064
 Jen-0.060
 Sue-0.059
Token the
Token ID262
Feature activation-1.36e-01

Pos logit contributions
hing+0.006
 streaming+0.006
ming+0.006
onder+0.005
ning+0.005
Neg logit contributions
ider-0.006
oe-0.005
 moment-0.005
ila-0.005
 first-0.005
Token soup
Token ID17141
Feature activation-2.09e+00

Pos logit contributions
hing+0.006
 streaming+0.006
ming+0.006
onder+0.006
cient+0.005
Neg logit contributions
 first-0.004
 moment-0.004
 Jen-0.004
 Sue-0.004
 Mia-0.004
Token carrots
Token ID34397
Feature activation-3.70e+00

Pos logit contributions
hing+0.026
 streaming+0.025
ming+0.024
severe+0.022
ug+0.022
Neg logit contributions
 first-0.018
 moment-0.017
 Mia-0.016
 next-0.016
 minute-0.015
Token,
Token ID11
Feature activation+3.96e-01

Pos logit contributions
hing+0.187
 streaming+0.179
ming+0.175
onder+0.159
severe+0.159
Neg logit contributions
 first-0.115
 moment-0.108
 Mia-0.105
 Sue-0.100
 Jen-0.099
Token potatoes
Token ID18821
Feature activation-5.41e+00

Pos logit contributions
 first+0.014
 moment+0.014
 Mia+0.014
 Jen+0.014
 Sue+0.013
Neg logit contributions
hing-0.017
 streaming-0.016
ming-0.016
onder-0.014
cient-0.014
Token,
Token ID11
Feature activation+1.36e-01

Pos logit contributions
hing+0.297
 streaming+0.289
ming+0.279
severe+0.266
 Australian+0.259
Neg logit contributions
 first-0.148
 moment-0.129
 Mia-0.125
 next-0.124
 Sue-0.116
Token and
Token ID290
Feature activation-9.49e-02

Pos logit contributions
 first+0.004
 moment+0.004
 Mia+0.004
 Jen+0.004
 Sue+0.004
Neg logit contributions
hing-0.007
 streaming-0.007
ming-0.006
onder-0.006
severe-0.006
Token onions
Token ID24681
Feature activation-6.44e+00

Pos logit contributions
hing+0.005
 streaming+0.004
ming+0.004
severe+0.004
onder+0.004
Neg logit contributions
 first-0.003
 Mia-0.003
 moment-0.003
 Jen-0.003
 Sue-0.003
Token and
Token ID290
Feature activation-2.77e-01

Pos logit contributions
hing+0.375
 streaming+0.363
ming+0.351
severe+0.338
 Australian+0.332
Neg logit contributions
 first-0.151
 moment-0.131
 Mia-0.126
 next-0.121
 Jen-0.117
Token put
Token ID1234
Feature activation-1.47e+00

Pos logit contributions
hing+0.013
 streaming+0.012
ming+0.012
severe+0.011
onder+0.011
Neg logit contributions
 first-0.009
 Mia-0.009
 moment-0.009
 Jen-0.009
 Sue-0.008
Token them
Token ID606
Feature activation-1.61e+00

Pos logit contributions
hing+0.069
 streaming+0.065
ming+0.063
onder+0.058
ting+0.056
Neg logit contributions
 moment-0.047
 first-0.046
 Jen-0.046
 Mia-0.045
 Sue-0.044
Token in
Token ID287
Feature activation-4.92e-01

Pos logit contributions
hing+0.057
 streaming+0.054
ming+0.051
onder+0.044
cient+0.044
Neg logit contributions
 moment-0.070
 Jen-0.069
 Mia-0.069
 first-0.068
 Sue-0.068
Token a
Token ID257
Feature activation-3.28e-01

Pos logit contributions
 streaming+0.021
hing+0.020
ming+0.018
onder+0.018
ning+0.017
Neg logit contributions
ila-0.017
 Jen-0.015
ider-0.015
 Lia-0.015
oe-0.015
Token cooked
Token ID15847
Feature activation-9.37e-01

Pos logit contributions
hing+0.013
 streaming+0.012
ming+0.012
severe+0.010
onder+0.010
Neg logit contributions
 first-0.010
 Mia-0.009
 moment-0.009
 Jen-0.009
 Sue-0.009
Token something
Token ID1223
Feature activation-1.90e+00

Pos logit contributions
hing+0.050
 streaming+0.048
ming+0.047
onder+0.044
cient+0.043
Neg logit contributions
 first-0.024
 moment-0.024
 Mia-0.023
 Jen-0.022
 Sue-0.022
Token with
Token ID351
Feature activation+1.89e+00

Pos logit contributions
hing+0.104
 streaming+0.099
ming+0.096
severe+0.089
onder+0.089
Neg logit contributions
 first-0.049
 moment-0.047
 Mia-0.047
 Jen-0.045
 Sue-0.044
Token carrots
Token ID34397
Feature activation-4.32e+00

Pos logit contributions
 first+0.049
 moment+0.048
 Mia+0.047
 Jen+0.046
 Sue+0.045
Neg logit contributions
hing-0.101
 streaming-0.097
ming-0.094
onder-0.087
cient-0.085
Token and
Token ID290
Feature activation+7.37e-02

Pos logit contributions
hing+0.229
 streaming+0.218
ming+0.213
onder+0.196
severe+0.193
Neg logit contributions
 first-0.119
 moment-0.114
 Mia-0.112
 Jen-0.108
 Sue-0.107
Token potatoes
Token ID18821
Feature activation-6.36e+00

Pos logit contributions
 first+0.003
 Mia+0.003
 moment+0.002
 Jen+0.002
 Sue+0.002
Neg logit contributions
hing-0.003
 streaming-0.003
ming-0.003
severe-0.003
onder-0.003
Token.
Token ID13
Feature activation-2.34e-01

Pos logit contributions
hing+0.382
 streaming+0.367
ming+0.359
onder+0.334
severe+0.333
Neg logit contributions
 first-0.130
 moment-0.121
 Mia-0.116
 Jen-0.109
 Sue-0.108
Token It
Token ID632
Feature activation-1.35e-01

Pos logit contributions
hing+0.011
 streaming+0.010
ming+0.010
onder+0.009
cient+0.009
Neg logit contributions
 first-0.008
 moment-0.008
 Mia-0.007
 Jen-0.007
ila-0.007
Token was
Token ID373
Feature activation-1.81e-01

Pos logit contributions
hing+0.007
 streaming+0.006
ming+0.006
onder+0.006
severe+0.006
Neg logit contributions
 first-0.004
 moment-0.004
 Mia-0.004
 Jen-0.004
 Sue-0.004
Token y
Token ID331
Feature activation-1.74e+00

Pos logit contributions
hing+0.009
 streaming+0.009
ming+0.009
onder+0.008
cient+0.008
Neg logit contributions
 first-0.005
 moment-0.005
 Mia-0.005
 Jen-0.005
 Sue-0.005
Tokenummy
Token ID13513
Feature activation-1.90e+00

Pos logit contributions
hing+0.078
 streaming+0.072
ming+0.072
severe+0.065
cient+0.064
Neg logit contributions
 first-0.061
 Mia-0.059
 moment-0.058
 Jen-0.058
 Sue-0.057
Token prefer
Token ID4702
Feature activation-3.53e-01

Pos logit contributions
hing+0.009
 streaming+0.009
ming+0.009
onder+0.008
ning+0.008
Neg logit contributions
 first-0.006
 moment-0.006
 Mia-0.006
 Jen-0.006
 Sue-0.006
Token to
Token ID284
Feature activation-2.38e-01

Pos logit contributions
hing+0.020
 streaming+0.018
ming+0.018
onder+0.017
cient+0.017
Neg logit contributions
 first-0.009
 moment-0.009
 Mia-0.009
 Jen-0.008
 Sue-0.008
Token eat
Token ID4483
Feature activation-2.58e-01

Pos logit contributions
hing+0.011
 streaming+0.010
ming+0.010
onder+0.009
cient+0.009
Neg logit contributions
 moment-0.008
 first-0.008
 Mia-0.007
 Jen-0.007
ila-0.007
Token fruits
Token ID15921
Feature activation-3.60e+00

Pos logit contributions
hing+0.013
 streaming+0.012
ming+0.012
onder+0.011
ning+0.011
Neg logit contributions
 moment-0.007
 first-0.007
 Mia-0.007
 Jen-0.007
ila-0.006
Token and
Token ID290
Feature activation-2.82e-01

Pos logit contributions
hing+0.171
 streaming+0.162
ming+0.159
onder+0.142
ting+0.141
Neg logit contributions
 moment-0.111
 Mia-0.109
 Jen-0.108
 first-0.107
 Sue-0.102
Token vegetables
Token ID13701
Feature activation-6.31e+00

Pos logit contributions
hing+0.013
 streaming+0.013
ming+0.012
onder+0.011
cient+0.011
Neg logit contributions
 moment-0.009
 first-0.009
 Mia-0.008
 Jen-0.008
oe-0.008
Token.
Token ID13
Feature activation-2.76e-02

Pos logit contributions
hing+0.315
 streaming+0.301
ming+0.294
ting+0.268
onder+0.266
Neg logit contributions
 Jen-0.173
 Mia-0.171
 moment-0.165
 Sue-0.161
ila-0.159
Token They
Token ID1119
Feature activation-3.77e-01

Pos logit contributions
hing+0.001
 streaming+0.001
ming+0.001
 Australian+0.001
onder+0.001
Neg logit contributions
 first-0.001
 moment-0.001
ila-0.001
oe-0.001
 Mia-0.001
Token are
Token ID389
Feature activation-3.99e-01

Pos logit contributions
hing+0.018
 streaming+0.017
ming+0.016
onder+0.015
cient+0.014
Neg logit contributions
 moment-0.012
 first-0.012
 Mia-0.012
 Jen-0.012
 Sue-0.012
Token good
Token ID922
Feature activation-2.07e+00

Pos logit contributions
hing+0.020
 streaming+0.019
ming+0.019
onder+0.017
cient+0.017
Neg logit contributions
 moment-0.011
 first-0.011
 Mia-0.011
 Jen-0.010
 Sue-0.010
Token for
Token ID329
Feature activation-6.60e-01

Pos logit contributions
hing+0.095
 streaming+0.086
ming+0.084
cient+0.077
onder+0.076
Neg logit contributions
 first-0.077
 Mia-0.072
 Jen-0.071
 next-0.071
 moment-0.070
Token the
Token ID262
Feature activation-5.02e-01

Pos logit contributions
 moment+0.016
 first+0.016
 Mia+0.015
 Jen+0.015
ila+0.015
Neg logit contributions
hing-0.031
 streaming-0.029
ming-0.029
onder-0.026
severe-0.026
Token moon
Token ID8824
Feature activation-1.57e-02

Pos logit contributions
hing+0.025
 streaming+0.025
ming+0.023
onder+0.022
ug+0.021
Neg logit contributions
 first-0.016
 moment-0.015
 next-0.014
 Mia-0.014
 minute-0.013
Token.
Token ID13
Feature activation+2.30e-01

Pos logit contributions
hing+0.001
 streaming+0.001
ming+0.001
onder+0.001
cient+0.001
Neg logit contributions
 first-0.000
 moment-0.000
 Mia-0.000
 Jen-0.000
 Sue-0.000
Token
Token ID198
Feature activation+1.38e-01

Pos logit contributions
 moment+0.007
 first+0.007
oe+0.007
 next+0.007
 minute+0.006
Neg logit contributions
hing-0.009
ming-0.009
 streaming-0.009
 Australian-0.009
onder-0.009
Token
Token ID198
Feature activation+2.76e-01

Pos logit contributions
ider+0.005
 first+0.004
 moment+0.004
 Jen+0.004
 ranger+0.004
Neg logit contributions
hing-0.005
ming-0.005
ug-0.004
 Australian-0.004
itated-0.004
TokenOne
Token ID3198
Feature activation+4.35e+00

Pos logit contributions
 Jen+0.007
 moment+0.007
 Sue+0.007
 first+0.007
 Jill+0.007
Neg logit contributions
hing-0.014
ming-0.013
 Australian-0.012
 streaming-0.012
onder-0.012
Token day
Token ID1110
Feature activation+8.71e-03

Pos logit contributions
 first+0.116
 moment+0.114
 Mia+0.111
 Jen+0.107
 Sue+0.105
Neg logit contributions
hing-0.234
 streaming-0.224
ming-0.218
onder-0.201
cient-0.198
Token,
Token ID11
Feature activation-4.58e-01

Pos logit contributions
 first+0.000
 moment+0.000
 Mia+0.000
 Jen+0.000
 Sue+0.000
Neg logit contributions
hing-0.000
 streaming-0.000
ming-0.000
onder-0.000
cient-0.000
Token they
Token ID484
Feature activation-2.71e-01

Pos logit contributions
hing+0.023
 streaming+0.022
ming+0.022
onder+0.020
 echoed+0.020
Neg logit contributions
 moment-0.012
 first-0.011
 Jen-0.010
oe-0.010
 minute-0.010
Token take
Token ID1011
Feature activation-1.44e-01

Pos logit contributions
ming+0.015
 streaming+0.014
ning+0.014
hing+0.014
onder+0.013
Neg logit contributions
 first-0.008
 Mia-0.007
 Jill-0.007
 next-0.006
 Jen-0.006
Token their
Token ID511
Feature activation-9.77e-01

Pos logit contributions
hing+0.009
 streaming+0.009
ming+0.009
cient+0.008
onder+0.008
Neg logit contributions
 Mia-0.003
 first-0.003
 Jen-0.002
 moment-0.002
 Sue-0.002
Token like
Token ID588
Feature activation+2.59e-02

Pos logit contributions
hing+0.042
 streaming+0.040
ming+0.039
onder+0.036
ning+0.035
Neg logit contributions
 first-0.028
 moment-0.027
 Mia-0.027
 Jen-0.026
 Sue-0.026
Token to
Token ID284
Feature activation+7.85e-02

Pos logit contributions
 first+0.001
 moment+0.001
 Mia+0.001
 Jen+0.001
 Sue+0.001
Neg logit contributions
hing-0.001
 streaming-0.001
ming-0.001
onder-0.001
severe-0.001
Token play
Token ID711
Feature activation+1.20e+00

Pos logit contributions
 moment+0.003
 first+0.003
ila+0.003
 Mia+0.003
 Jen+0.003
Neg logit contributions
hing-0.003
 streaming-0.003
ming-0.003
onder-0.002
 echoed-0.002
Token in
Token ID287
Feature activation-3.14e-01

Pos logit contributions
 first+0.044
 Mia+0.042
 moment+0.042
 Sue+0.040
 next+0.040
Neg logit contributions
hing-0.053
 streaming-0.052
ming-0.049
onder-0.045
severe-0.045
Token the
Token ID262
Feature activation-3.81e-01

Pos logit contributions
hing+0.010
 streaming+0.009
ming+0.009
 helicopters+0.009
 Australian+0.009
Neg logit contributions
ider-0.014
ila-0.012
oe-0.012
 minute-0.012
ia-0.012
Token park
Token ID3952
Feature activation+3.56e+00

Pos logit contributions
hing+0.017
 streaming+0.016
ming+0.016
onder+0.014
 Australian+0.014
Neg logit contributions
 first-0.013
 moment-0.013
 Jen-0.012
 Mia-0.012
 Sue-0.012
Token.
Token ID13
Feature activation-2.33e-01

Pos logit contributions
ila+0.117
 moment+0.112
 Jen+0.111
 apology+0.109
onna+0.105
Neg logit contributions
hing-0.146
ming-0.139
 streaming-0.131
ning-0.125
ug-0.122
Token One
Token ID1881
Feature activation+3.26e+00

Pos logit contributions
hing+0.011
 streaming+0.010
ming+0.010
 Australian+0.010
cient+0.010
Neg logit contributions
 first-0.007
 moment-0.007
oe-0.006
 next-0.006
ila-0.006
Token day
Token ID1110
Feature activation-9.60e-02

Pos logit contributions
 first+0.108
 moment+0.104
onna+0.103
 Jill+0.103
 Mia+0.103
Neg logit contributions
hing-0.130
 streaming-0.113
ming-0.109
 Australian-0.108
ug-0.108
Token,
Token ID11
Feature activation-4.51e-01

Pos logit contributions
hing+0.005
 streaming+0.005
ming+0.005
onder+0.004
severe+0.004
Neg logit contributions
 first-0.003
 moment-0.003
 Mia-0.003
 Jen-0.003
 Sue-0.003
Token they
Token ID484
Feature activation-2.46e-01

Pos logit contributions
hing+0.024
 streaming+0.023
ming+0.023
ning+0.021
onder+0.021
Neg logit contributions
 moment-0.010
 first-0.010
 minute-0.009
oe-0.009
ila-0.008
Token very
Token ID845
Feature activation+4.43e-02

Pos logit contributions
hing+0.019
 streaming+0.018
ming+0.018
onder+0.016
ning+0.016
Neg logit contributions
 moment-0.010
ila-0.009
 Jen-0.009
 Mia-0.009
oe-0.009
Token much
Token ID881
Feature activation+5.87e-01

Pos logit contributions
 moment+0.002
 first+0.002
 Jen+0.002
ila+0.002
 Mia+0.002
Neg logit contributions
hing-0.002
 streaming-0.001
ming-0.001
ug-0.001
 echoed-0.001
Token.
Token ID13
Feature activation-2.07e-01

Pos logit contributions
 moment+0.016
 Jen+0.015
ila+0.015
 Mia+0.015
 first+0.015
Neg logit contributions
hing-0.028
 streaming-0.027
ming-0.027
ting-0.025
onder-0.025
Token
Token ID198
Feature activation-1.31e-01

Pos logit contributions
hing+0.009
 streaming+0.008
ming+0.008
 Australian+0.008
cient+0.008
Neg logit contributions
 moment-0.007
 first-0.007
oe-0.006
 next-0.006
 minute-0.006
Token
Token ID198
Feature activation+1.29e-02

Pos logit contributions
hing+0.005
ming+0.005
 Australian+0.005
 streaming+0.004
ug+0.004
Neg logit contributions
 first-0.004
 Jen-0.004
ider-0.004
 moment-0.004
 Jill-0.004
TokenOne
Token ID3198
Feature activation+4.45e+00

Pos logit contributions
 Jen+0.000
 moment+0.000
 Sue+0.000
 first+0.000
 Mia+0.000
Neg logit contributions
hing-0.001
ming-0.001
 streaming-0.001
 Australian-0.001
onder-0.001
Token day
Token ID1110
Feature activation+1.53e-01

Pos logit contributions
 first+0.149
 Jill+0.145
 Jen+0.145
 Mia+0.144
oe+0.143
Neg logit contributions
hing-0.178
 streaming-0.161
ming-0.157
 Australian-0.150
ug-0.149
Token,
Token ID11
Feature activation-3.34e-01

Pos logit contributions
 first+0.004
 Mia+0.004
 moment+0.004
 Jen+0.004
 Sue+0.004
Neg logit contributions
hing-0.008
 streaming-0.008
ming-0.007
onder-0.007
cient-0.007
Token Ben
Token ID3932
Feature activation+2.81e-02

Pos logit contributions
hing+0.017
 streaming+0.016
ming+0.016
onder+0.015
cient+0.014
Neg logit contributions
 moment-0.009
 first-0.009
 Jen-0.008
 Sue-0.008
 Mia-0.008
Token sees
Token ID7224
Feature activation+2.16e+00

Pos logit contributions
 first+0.001
 Mia+0.001
 moment+0.001
 Jen+0.001
 Sue+0.001
Neg logit contributions
hing-0.001
 streaming-0.001
ming-0.001
onder-0.001
ning-0.001
Token a
Token ID257
Feature activation+2.61e-01

Pos logit contributions
 first+0.040
 Mia+0.039
 moment+0.038
 Jen+0.037
 Sue+0.036
Neg logit contributions
hing-0.134
 streaming-0.128
ming-0.125
onder-0.117
cient-0.116
TokenYou
Token ID1639
Feature activation-5.30e-01

Pos logit contributions
hing+0.038
ming+0.035
 streaming+0.034
ting+0.031
severe+0.030
Neg logit contributions
 first-0.029
 Mia-0.029
 Jen-0.028
 Sue-0.028
 moment-0.027
Token are
Token ID389
Feature activation+1.32e-01

Pos logit contributions
 streaming+0.026
hing+0.026
ming+0.026
ning+0.025
ting+0.024
Neg logit contributions
 first-0.016
 moment-0.014
 Mia-0.014
 Sue-0.013
 Jen-0.013
Token the
Token ID262
Feature activation-2.35e-01

Pos logit contributions
 moment+0.003
 first+0.003
 Mia+0.003
 Jen+0.003
 Sue+0.003
Neg logit contributions
hing-0.007
 streaming-0.007
ming-0.007
onder-0.006
cient-0.006
Token best
Token ID1266
Feature activation-1.22e+00

Pos logit contributions
hing+0.010
 streaming+0.009
ming+0.009
onder+0.008
cient+0.008
Neg logit contributions
 moment-0.009
 Mia-0.008
 first-0.008
 Jen-0.008
 Sue-0.008
Token niece
Token ID41803
Feature activation+1.52e+00

Pos logit contributions
hing+0.054
 streaming+0.051
ming+0.050
onder+0.046
ning+0.045
Neg logit contributions
 first-0.042
 moment-0.041
 Mia-0.040
 Jen-0.039
 Sue-0.039
Token ever
Token ID1683
Feature activation+3.53e+00

Pos logit contributions
 moment+0.043
 first+0.042
 Jen+0.040
 Mia+0.040
ila+0.040
Neg logit contributions
hing-0.071
ming-0.069
 streaming-0.069
ting-0.062
onder-0.060
Token."
Token ID526
Feature activation+1.80e+00

Pos logit contributions
 first+0.096
 moment+0.094
 Mia+0.094
 Jen+0.092
 Sue+0.090
Neg logit contributions
hing-0.185
 streaming-0.175
ming-0.172
onder-0.159
cient-0.156
Token They
Token ID1119
Feature activation-4.97e-02

Pos logit contributions
 first+0.055
 moment+0.053
 minute+0.048
ila+0.048
oe+0.046
Neg logit contributions
hing-0.084
 streaming-0.083
ming-0.081
onder-0.073
 Australian-0.072
Token laughed
Token ID13818
Feature activation-2.71e-01

Pos logit contributions
hing+0.002
ming+0.002
 streaming+0.002
ning+0.002
onder+0.002
Neg logit contributions
 first-0.002
 Mia-0.002
 Sue-0.002
 Jen-0.002
 Sara-0.002
Token and
Token ID290
Feature activation+5.75e-01

Pos logit contributions
hing+0.016
 streaming+0.015
ming+0.014
cient+0.014
onder+0.014
Neg logit contributions
 first-0.006
 moment-0.006
 Mia-0.006
 next-0.006
 Sue-0.005
Token drew
Token ID9859
Feature activation+1.37e+00

Pos logit contributions
 Mia+0.021
 first+0.021
 Sue+0.021
 moment+0.021
 Jen+0.021
Neg logit contributions
hing-0.025
 streaming-0.022
ming-0.022
onder-0.021
cient-0.021
TokenThat
Token ID2504
Feature activation+1.85e-01

Pos logit contributions
hing+0.005
 streaming+0.004
ming+0.004
ned+0.004
amb+0.004
Neg logit contributions
 first-0.004
 Mia-0.004
 Sue-0.004
oe-0.004
 Lia-0.004
Tokenâ
Token ID22940
Feature activation+8.19e-01

Pos logit contributions
 first+0.007
 moment+0.007
 Mia+0.007
 Jen+0.007
 Sue+0.007
Neg logit contributions
hing-0.008
 streaming-0.007
ming-0.007
onder-0.006
cient-0.006
Token€
Token ID26391
Feature activation+8.63e-01

Pos logit contributions
 Mia+0.031
 Jen+0.030
 first+0.030
 moment+0.029
 Sue+0.029
Neg logit contributions
hing-0.035
ming-0.031
 streaming-0.030
ting-0.027
onder-0.027
Tokenâ„¢
Token ID8151
Feature activation-3.82e-01

Pos logit contributions
 first+0.032
 moment+0.032
 Mia+0.030
 Jen+0.030
 Sue+0.029
Neg logit contributions
hing-0.036
 streaming-0.034
ming-0.033
onder-0.030
cient-0.029
Tokens
Token ID82
Feature activation-1.29e+00

Pos logit contributions
hing+0.017
 streaming+0.016
ming+0.015
onder+0.014
cient+0.014
Neg logit contributions
 moment-0.013
 first-0.013
 Mia-0.012
 Jen-0.012
 Sue-0.012
Token not
Token ID407
Feature activation+3.57e+00

Pos logit contributions
hing+0.064
 streaming+0.061
ming+0.059
onder+0.055
severe+0.054
Neg logit contributions
 first-0.039
 Mia-0.038
 moment-0.037
 Jen-0.037
 Sue-0.036
Token real
Token ID1103
Feature activation-3.28e-01

Pos logit contributions
 first+0.104
 moment+0.101
 Mia+0.099
 Jen+0.097
 Sue+0.096
Neg logit contributions
hing-0.182
 streaming-0.174
ming-0.169
onder-0.155
cient-0.153
Token gold
Token ID3869
Feature activation+9.47e-01

Pos logit contributions
hing+0.015
 streaming+0.015
ming+0.014
onder+0.013
cient+0.013
Neg logit contributions
 first-0.010
 moment-0.010
 Mia-0.010
 Sue-0.010
 Jen-0.010
Token!
Token ID0
Feature activation-6.51e-01

Pos logit contributions
 moment+0.027
 first+0.027
 Mia+0.027
 Jen+0.026
 Sue+0.026
Neg logit contributions
hing-0.047
 streaming-0.045
ming-0.044
onder-0.041
cient-0.040
Token It
Token ID632
Feature activation+2.83e-01

Pos logit contributions
hing+0.030
 streaming+0.029
ming+0.028
onder+0.026
severe+0.025
Neg logit contributions
 moment-0.022
 first-0.021
 Mia-0.019
oe-0.019
 Sue-0.019
Tokenâ
Token ID22940
Feature activation+3.66e-02

Pos logit contributions
 moment+0.011
 first+0.011
 Mia+0.010
 Jen+0.010
 Sue+0.010
Neg logit contributions
hing-0.012
 streaming-0.011
ming-0.011
onder-0.009
cient-0.009
Token high
Token ID1029
Feature activation-6.98e-01

Pos logit contributions
hing+0.003
 streaming+0.003
ming+0.003
onder+0.003
ning+0.003
Neg logit contributions
 moment-0.002
 first-0.002
 Mia-0.001
 Jen-0.001
 minute-0.001
Token and
Token ID290
Feature activation+1.94e+00

Pos logit contributions
hing+0.030
 streaming+0.028
ming+0.028
ning+0.026
ting+0.026
Neg logit contributions
 Mia-0.024
 Jen-0.024
 moment-0.024
 Sue-0.023
 minute-0.023
Token bright
Token ID6016
Feature activation-1.19e+00

Pos logit contributions
ila+0.061
 moment+0.059
 apology+0.059
 first+0.059
 minute+0.056
Neg logit contributions
 streaming-0.080
hing-0.079
ming-0.073
 echoed-0.071
 streamed-0.066
Token.
Token ID13
Feature activation+4.22e-01

Pos logit contributions
hing+0.063
 streaming+0.062
ming+0.059
onder+0.055
ning+0.055
Neg logit contributions
 first-0.030
 moment-0.030
 Mia-0.029
 Jen-0.029
ila-0.028
Token It
Token ID632
Feature activation+1.96e-01

Pos logit contributions
 first+0.017
 moment+0.017
oe+0.015
 minute+0.015
 next+0.015
Neg logit contributions
 streaming-0.015
hing-0.015
ming-0.013
 echoed-0.013
severe-0.013
Token still
Token ID991
Feature activation+5.97e-01

Pos logit contributions
 moment+0.007
 first+0.007
 minute+0.007
 Jen+0.007
ila+0.007
Neg logit contributions
hing-0.007
 streaming-0.007
ming-0.007
 echoed-0.007
ning-0.006
Token said
Token ID531
Feature activation-3.59e-01

Pos logit contributions
 first+0.020
 moment+0.020
 Mia+0.019
 Jen+0.019
 minute+0.019
Neg logit contributions
hing-0.026
 streaming-0.026
ming-0.024
 echoed-0.022
onder-0.022
Token,
Token ID11
Feature activation+8.79e-01

Pos logit contributions
hing+0.015
 streaming+0.014
ming+0.014
 echoed+0.013
elcome+0.012
Neg logit contributions
 moment-0.012
oe-0.012
 first-0.011
ila-0.011
iw-0.010
Token "
Token ID366
Feature activation-6.31e-01

Pos logit contributions
 first+0.029
oe+0.028
 moment+0.028
ila+0.028
 Mia+0.027
Neg logit contributions
hing-0.038
 streaming-0.037
ming-0.036
ting-0.033
cient-0.032
TokenCome
Token ID16773
Feature activation-1.15e+00

Pos logit contributions
hing+0.026
ming+0.023
 streaming+0.022
ting+0.020
severe+0.020
Neg logit contributions
 first-0.025
 Mia-0.025
 Sue-0.024
 Jen-0.023
 Jill-0.023
Token to
Token ID284
Feature activation-7.66e-01

Pos logit contributions
hing+0.042
 streaming+0.039
ming+0.039
ting+0.034
ning+0.034
Neg logit contributions
 moment-0.047
 Mia-0.046
 Jen-0.045
 Sue-0.045
ila-0.045
Token make
Token ID787
Feature activation-2.05e-01

Pos logit contributions
 first+0.019
 Mia+0.018
 Sue+0.018
 moment+0.018
 Jen+0.018
Neg logit contributions
hing-0.030
 streaming-0.029
ming-0.028
cient-0.026
onder-0.026
Token too
Token ID1165
Feature activation+1.70e+00

Pos logit contributions
hing+0.011
 streaming+0.011
ming+0.010
onder+0.009
cient+0.009
Neg logit contributions
 first-0.005
 Mia-0.005
 moment-0.005
 Jen-0.005
 Sue-0.005
Token much
Token ID881
Feature activation-1.77e+00

Pos logit contributions
 first+0.042
 moment+0.041
 Mia+0.040
 Jen+0.039
 Sue+0.039
Neg logit contributions
hing-0.095
 streaming-0.090
ming-0.089
onder-0.082
cient-0.082
Token noise
Token ID7838
Feature activation-4.81e-01

Pos logit contributions
hing+0.087
 streaming+0.081
ming+0.080
cient+0.077
 Australian+0.075
Neg logit contributions
 moment-0.054
 first-0.053
 Mia-0.052
 Sue-0.051
 Jen-0.049
Token.
Token ID13
Feature activation-3.10e-01

Pos logit contributions
hing+0.027
 streaming+0.026
ming+0.025
cient+0.024
 Australian+0.024
Neg logit contributions
 first-0.012
 Mia-0.011
 moment-0.011
 Sue-0.010
 Jen-0.010
Token His
Token ID2399
Feature activation+1.79e-01

Pos logit contributions
hing+0.015
 streaming+0.014
ming+0.013
onder+0.012
severe+0.012
Neg logit contributions
 first-0.010
 moment-0.010
 Mia-0.010
 Jen-0.009
 Sue-0.009
Token mom
Token ID1995
Feature activation-8.81e-02

Pos logit contributions
 first+0.005
 moment+0.005
 Mia+0.005
 Jen+0.005
 Sue+0.005
Neg logit contributions
hing-0.009
 streaming-0.009
ming-0.008
onder-0.008
cient-0.008
Token smiled
Token ID13541
Feature activation+9.91e-01

Pos logit contributions
hing+0.004
 streaming+0.004
severe+0.004
ming+0.004
 Australian+0.004
Neg logit contributions
 first-0.003
 moment-0.003
 Mia-0.003
 next-0.002
 Sue-0.002
Token,
Token ID11
Feature activation+9.28e-02

Pos logit contributions
 first+0.023
 moment+0.022
 next+0.021
 Mia+0.020
 one+0.019
Neg logit contributions
hing-0.059
 streaming-0.053
ming-0.053
cient-0.052
 Australian-0.051
Token happy
Token ID3772
Feature activation-2.20e-01

Pos logit contributions
 first+0.003
 moment+0.003
 Mia+0.003
ila+0.003
oe+0.003
Neg logit contributions
 streaming-0.004
hing-0.004
ming-0.004
onder-0.003
ning-0.003
Token to
Token ID284
Feature activation+1.64e+00

Pos logit contributions
hing+0.011
 streaming+0.010
ming+0.010
onder+0.009
cient+0.009
Neg logit contributions
 first-0.007
 moment-0.007
 Mia-0.006
 Jen-0.006
 Sue-0.006
Token child
Token ID1200
Feature activation-1.01e-01

Pos logit contributions
hing+0.006
 streaming+0.005
ming+0.005
severe+0.005
onder+0.005
Neg logit contributions
 first-0.003
 moment-0.002
 Mia-0.002
 Jen-0.002
 Sue-0.002
Token.
Token ID13
Feature activation-4.16e-01

Pos logit contributions
hing+0.004
 streaming+0.004
ming+0.004
onder+0.004
cient+0.004
Neg logit contributions
 first-0.004
 moment-0.004
 Mia-0.003
 Jen-0.003
 Sue-0.003
Token The
Token ID383
Feature activation+1.19e+00

Pos logit contributions
hing+0.024
 streaming+0.022
ming+0.022
onder+0.020
ning+0.020
Neg logit contributions
 Mia-0.010
 first-0.010
 Sue-0.010
 Jen-0.009
 moment-0.009
Token man
Token ID582
Feature activation+2.92e-01

Pos logit contributions
 Sue+0.047
 Mia+0.046
 Jen+0.046
ila+0.045
oe+0.045
Neg logit contributions
hing-0.043
ming-0.040
 streaming-0.040
cient-0.038
onder-0.036
Token and
Token ID290
Feature activation-3.38e-01

Pos logit contributions
 first+0.009
 moment+0.008
 Mia+0.008
 Sue+0.008
 next+0.008
Neg logit contributions
hing-0.015
 streaming-0.014
ming-0.013
onder-0.013
severe-0.013
Token the
Token ID262
Feature activation+2.95e-01

Pos logit contributions
hing+0.018
 streaming+0.017
ming+0.017
onder+0.015
cient+0.015
Neg logit contributions
 first-0.009
 moment-0.009
 Mia-0.008
 Jen-0.008
 Sue-0.008
Token child
Token ID1200
Feature activation-3.13e-01

Pos logit contributions
 Sue+0.011
 Mia+0.011
 Jen+0.011
oe+0.011
 moment+0.011
Neg logit contributions
hing-0.011
 streaming-0.010
ming-0.010
cient-0.010
ting-0.009
Token smiled
Token ID13541
Feature activation+5.43e-01

Pos logit contributions
hing+0.014
 streaming+0.013
ming+0.013
onder+0.013
severe+0.012
Neg logit contributions
 first-0.011
 Mia-0.010
 moment-0.010
 next-0.010
 Sue-0.010
Token at
Token ID379
Feature activation+5.38e-01

Pos logit contributions
 first+0.014
 next+0.013
 moment+0.013
 one+0.012
 Mia+0.012
Neg logit contributions
hing-0.031
ming-0.028
 streaming-0.028
ils-0.026
 Australian-0.026
Token each
Token ID1123
Feature activation-2.10e+00

Pos logit contributions
 first+0.018
 Mia+0.017
 moment+0.017
 Jen+0.017
 Sue+0.016
Neg logit contributions
hing-0.026
 streaming-0.025
ming-0.024
onder-0.022
severe-0.022
Token other
Token ID584
Feature activation+1.13e+00

Pos logit contributions
hing+0.109
 streaming+0.103
ming+0.101
onder+0.092
cient+0.090
Neg logit contributions
 first-0.058
 moment-0.057
 Mia-0.057
 Jen-0.056
 Sue-0.055
Token help
Token ID1037
Feature activation+4.43e-01

Pos logit contributions
hing+0.010
 streaming+0.010
ming+0.009
ting+0.008
onder+0.008
Neg logit contributions
 moment-0.007
 first-0.007
 Mia-0.006
ila-0.006
 minute-0.006
Token.
Token ID13
Feature activation-4.87e-01

Pos logit contributions
 moment+0.010
 first+0.010
 Mia+0.009
 Jen+0.009
ila+0.009
Neg logit contributions
hing-0.025
 streaming-0.024
ming-0.024
onder-0.022
ning-0.021
Token She
Token ID1375
Feature activation-7.36e-01

Pos logit contributions
hing+0.027
 streaming+0.025
ming+0.024
onder+0.022
severe+0.022
Neg logit contributions
 first-0.013
 Mia-0.013
 Sue-0.013
 Jen-0.013
 moment-0.012
Token found
Token ID1043
Feature activation+2.08e-01

Pos logit contributions
hing+0.040
 streaming+0.036
ming+0.034
severe+0.033
onder+0.032
Neg logit contributions
 first-0.023
 Mia-0.023
 Sue-0.021
 Jen-0.021
 next-0.020
Token a
Token ID257
Feature activation+3.26e-01

Pos logit contributions
 first+0.005
 Mia+0.005
 Jen+0.005
 Sue+0.005
 moment+0.005
Neg logit contributions
hing-0.012
 streaming-0.011
ming-0.011
onder-0.010
severe-0.010
Token grown
Token ID7334
Feature activation+2.13e+00

Pos logit contributions
 moment+0.009
 first+0.009
 Mia+0.008
 Sue+0.008
 minute+0.008
Neg logit contributions
hing-0.018
 streaming-0.017
ming-0.017
ug-0.015
ting-0.015
Token-
Token ID12
Feature activation+3.31e-01

Pos logit contributions
 first+0.079
 moment+0.078
 Mia+0.077
 Jen+0.075
 Sue+0.074
Neg logit contributions
hing-0.091
 streaming-0.086
ming-0.083
onder-0.075
cient-0.074
Tokenup
Token ID929
Feature activation+1.26e+00

Pos logit contributions
 first+0.015
 moment+0.014
 Mia+0.014
 Jen+0.014
 Sue+0.014
Neg logit contributions
hing-0.012
 streaming-0.011
ming-0.011
onder-0.009
cient-0.009
Token who
Token ID508
Feature activation+6.52e-01

Pos logit contributions
 first+0.041
 moment+0.040
 Mia+0.039
 Jen+0.038
 Sue+0.038
Neg logit contributions
hing-0.060
 streaming-0.057
ming-0.055
onder-0.051
severe-0.049
Token was
Token ID373
Feature activation+1.92e-01

Pos logit contributions
 first+0.022
 Mia+0.021
 Sue+0.021
 Jen+0.021
 moment+0.021
Neg logit contributions
hing-0.030
 streaming-0.028
ming-0.028
severe-0.026
onder-0.026
Token deaf
Token ID21815
Feature activation+2.94e-01

Pos logit contributions
 first+0.006
 moment+0.006
 Mia+0.006
 Jen+0.006
 Sue+0.006
Neg logit contributions
hing-0.009
 streaming-0.009
ming-0.008
onder-0.008
cient-0.008
Token
Token ID198
Feature activation+1.51e-01

Pos logit contributions
 first+0.003
 moment+0.002
 Jen+0.002
ider+0.002
 minute+0.002
Neg logit contributions
hing-0.004
ming-0.003
 streaming-0.003
onder-0.003
ug-0.003
TokenL
Token ID43
Feature activation+1.21e+00

Pos logit contributions
 first+0.004
 moment+0.004
 Jen+0.004
 Mia+0.004
 Sue+0.004
Neg logit contributions
hing-0.008
ming-0.007
 streaming-0.007
onder-0.007
cient-0.007
Tokenily
Token ID813
Feature activation+1.56e+00

Pos logit contributions
 moment+0.031
 first+0.029
 Jen+0.026
 minute+0.026
 second+0.024
Neg logit contributions
hing-0.065
 streaming-0.059
ming-0.058
onder-0.057
ils-0.057
Token:
Token ID25
Feature activation+7.56e-01

Pos logit contributions
 first+0.048
 Mia+0.045
 moment+0.043
 Sue+0.041
 next+0.041
Neg logit contributions
hing-0.081
 streaming-0.073
ming-0.072
ug-0.069
severe-0.067
Token "
Token ID366
Feature activation-1.32e+00

Pos logit contributions
 first+0.019
 moment+0.018
 Mia+0.018
 Jen+0.017
 Sue+0.017
Neg logit contributions
hing-0.041
 streaming-0.040
ming-0.038
onder-0.036
cient-0.035
TokenThis
Token ID1212
Feature activation+1.14e+00

Pos logit contributions
hing+0.061
 streaming+0.058
ming+0.054
 Australian+0.049
onder+0.049
Neg logit contributions
 first-0.046
 moment-0.045
 Mia-0.044
 Jen-0.043
 Sue-0.043
Token flower
Token ID15061
Feature activation+6.15e-02

Pos logit contributions
 Jen+0.033
 first+0.032
 Mia+0.031
 Jill+0.031
 Sue+0.031
Neg logit contributions
hing-0.054
 streaming-0.051
ming-0.051
ning-0.048
cient-0.048
Token is
Token ID318
Feature activation+1.42e+00

Pos logit contributions
 first+0.001
 moment+0.001
 Mia+0.001
 Jen+0.001
 Sue+0.001
Neg logit contributions
hing-0.004
 streaming-0.003
ming-0.003
onder-0.003
cient-0.003
Token rotting
Token ID44494
Feature activation-1.18e+00

Pos logit contributions
 moment+0.039
 first+0.038
 Mia+0.037
 Jen+0.037
 Sue+0.035
Neg logit contributions
hing-0.073
 streaming-0.071
ming-0.069
onder-0.064
cient-0.063
Token.
Token ID13
Feature activation-9.05e-02

Pos logit contributions
hing+0.069
 streaming+0.066
ming+0.065
onder+0.060
severe+0.060
Neg logit contributions
 first-0.026
 moment-0.024
 Mia-0.023
 Sue-0.022
 Jen-0.022
Token Can
Token ID1680
Feature activation+1.29e+00

Pos logit contributions
hing+0.004
 streaming+0.004
ming+0.004
cient+0.004
onder+0.004
Neg logit contributions
 first-0.003
 moment-0.003
 Mia-0.003
 Jen-0.003
ila-0.003
Token
Token ID220
Feature activation-4.53e-01

Pos logit contributions
hing+0.021
 streaming+0.021
ming+0.019
onder+0.018
 Australian+0.018
Neg logit contributions
 first-0.019
 moment-0.018
ila-0.017
oe-0.017
 Jen-0.017
Token
Token ID198
Feature activation-7.64e-01

Pos logit contributions
hing+0.026
 streaming+0.025
ming+0.025
onder+0.023
cient+0.023
Neg logit contributions
 first-0.010
 moment-0.010
 Mia-0.009
 Jen-0.009
 Sue-0.009
Token
Token ID198
Feature activation-6.12e-01

Pos logit contributions
hing+0.038
 streaming+0.035
ming+0.035
onder+0.031
 Australian+0.031
Neg logit contributions
 first-0.023
 moment-0.022
 Mia-0.022
 Jen-0.022
 Sue-0.021
TokenThe
Token ID464
Feature activation+8.89e-01

Pos logit contributions
hing+0.026
 streaming+0.023
ming+0.023
 Australian+0.020
onder+0.020
Neg logit contributions
 first-0.023
 Jen-0.023
 Mia-0.023
 moment-0.023
 Sue-0.022
Token End
Token ID5268
Feature activation+2.85e+00

Pos logit contributions
 Jen+0.033
 Mia+0.031
 Sue+0.030
 Jill+0.029
ila+0.029
Neg logit contributions
hing-0.038
 streaming-0.037
ming-0.034
cient-0.032
 Australian-0.032
Token.
Token ID13
Feature activation-6.48e-01

Pos logit contributions
 first+0.088
 moment+0.079
 Jen+0.074
oe+0.074
 Sue+0.073
Neg logit contributions
hing-0.131
 streaming-0.129
ming-0.127
 echoed-0.117
onder-0.116
Token<|endoftext|>
Token ID50256
Feature activation-4.84e-01

Pos logit contributions
hing+0.033
 streaming+0.032
ming+0.030
onder+0.028
 Australian+0.028
Neg logit contributions
 first-0.018
 moment-0.017
 Jen-0.016
 Mia-0.016
oe-0.015
TokenJohn
Token ID7554
Feature activation+3.89e-01

Pos logit contributions
hing+0.025
 streaming+0.024
ming+0.023
onder+0.021
ting+0.021
Neg logit contributions
 first-0.013
 Jen-0.012
 moment-0.012
 Mia-0.012
 Sue-0.012
Token was
Token ID373
Feature activation-1.43e+00

Pos logit contributions
 first+0.011
 moment+0.011
 Mia+0.011
 Jen+0.010
 next+0.010
Neg logit contributions
hing-0.020
 streaming-0.019
ming-0.018
severe-0.017
onder-0.017
Token weak
Token ID4939
Feature activation-2.27e+00

Pos logit contributions
hing+0.071
ming+0.064
 streaming+0.064
onder+0.059
severe+0.059
Neg logit contributions
 first-0.048
 Mia-0.044
 moment-0.044
 next-0.043
 Sue-0.043
Token.
Token ID13
Feature activation-1.34e+00

Pos logit contributions
hing+0.132
 streaming+0.121
ming+0.120
severe+0.113
onder+0.113
Neg logit contributions
 first-0.057
 moment-0.051
 Mia-0.048
 next-0.047
 Sue-0.045
Token Like
Token ID4525
Feature activation+1.06e+00

Pos logit contributions
 moment+0.013
 first+0.013
oe+0.012
 Mia+0.011
 Jen+0.011
Neg logit contributions
hing-0.018
 streaming-0.017
ming-0.016
severe-0.015
onder-0.015
Token chess
Token ID19780
Feature activation-4.27e-01

Pos logit contributions
 first+0.030
 moment+0.029
 Mia+0.029
 Jen+0.028
 Sue+0.027
Neg logit contributions
hing-0.055
 streaming-0.052
ming-0.051
onder-0.047
cient-0.046
Token.
Token ID13
Feature activation-2.17e-01

Pos logit contributions
hing+0.023
 streaming+0.022
ming+0.022
onder+0.020
cient+0.020
Neg logit contributions
 first-0.011
 moment-0.011
 Mia-0.010
 Jen-0.010
 Sue-0.010
Token Or
Token ID1471
Feature activation+5.90e-01

Pos logit contributions
hing+0.010
 streaming+0.010
ming+0.009
severe+0.009
onder+0.009
Neg logit contributions
 moment-0.007
 first-0.007
oe-0.006
 Mia-0.006
 Sue-0.006
Token knitting
Token ID41577
Feature activation-1.76e+00

Pos logit contributions
 first+0.014
 moment+0.014
 Mia+0.013
 Jen+0.013
 Sue+0.013
Neg logit contributions
hing-0.033
 streaming-0.032
ming-0.031
onder-0.029
cient-0.028
Token.
Token ID13
Feature activation-2.61e-01

Pos logit contributions
hing+0.094
ming+0.089
 streaming+0.087
onder+0.080
severe+0.079
Neg logit contributions
 moment-0.044
 Jen-0.044
 Sue-0.043
 first-0.043
 Mia-0.042
Token Or
Token ID1471
Feature activation+5.74e-01

Pos logit contributions
hing+0.012
 streaming+0.012
ming+0.011
onder+0.010
severe+0.010
Neg logit contributions
 moment-0.008
 first-0.008
 Mia-0.008
 Jen-0.008
 Sue-0.008
Token sleeping
Token ID11029
Feature activation-1.71e+00

Pos logit contributions
 first+0.015
 moment+0.014
 Mia+0.014
 Jen+0.013
 Sue+0.013
Neg logit contributions
hing-0.032
 streaming-0.030
ming-0.029
onder-0.027
cient-0.027
Token."
Token ID526
Feature activation+1.79e+00

Pos logit contributions
hing+0.095
 streaming+0.093
ming+0.089
onder+0.082
severe+0.081
Neg logit contributions
 first-0.044
 moment-0.040
 Mia-0.040
 next-0.037
 Jen-0.037
Token
Token ID198
Feature activation+4.16e-01

Pos logit contributions
 moment+0.058
 first+0.057
 minute+0.052
ila+0.051
oe+0.051
Neg logit contributions
 streaming-0.076
hing-0.073
ming-0.069
cient-0.067
 Buster-0.067
Token
Token ID198
Feature activation+5.62e-01

Pos logit contributions
 moment+0.013
 first+0.013
 minute+0.012
 Jen+0.011
oe+0.011
Neg logit contributions
hing-0.019
ming-0.017
 streaming-0.017
ug-0.016
onder-0.016
Token surprised
Token ID6655
Feature activation+2.70e-02

Pos logit contributions
 first+0.002
 moment+0.002
 Mia+0.001
 Jen+0.001
 Sue+0.001
Neg logit contributions
hing-0.003
 streaming-0.003
ming-0.003
onder-0.003
cient-0.003
Token!
Token ID0
Feature activation-6.49e-01

Pos logit contributions
 first+0.001
 moment+0.001
 Mia+0.001
 Jen+0.001
 Sue+0.001
Neg logit contributions
hing-0.001
 streaming-0.001
ming-0.001
onder-0.001
cient-0.001
Token He
Token ID679
Feature activation-1.93e-01

Pos logit contributions
hing+0.033
 streaming+0.032
ming+0.031
onder+0.028
cient+0.028
Neg logit contributions
 first-0.019
 moment-0.018
 Mia-0.017
 Jen-0.017
 Sue-0.017
Token drove
Token ID10357
Feature activation+4.30e-02

Pos logit contributions
hing+0.009
 streaming+0.009
onder+0.008
ming+0.008
cient+0.008
Neg logit contributions
 first-0.006
 moment-0.006
 Mia-0.005
 next-0.005
 Jen-0.005
Token as
Token ID355
Feature activation-1.60e+00

Pos logit contributions
 first+0.002
 moment+0.002
 Mia+0.002
 Jen+0.001
 Sue+0.001
Neg logit contributions
hing-0.002
 streaming-0.002
ming-0.002
onder-0.002
cient-0.002
Token fast
Token ID3049
Feature activation-2.09e+00

Pos logit contributions
hing+0.090
 streaming+0.084
ming+0.083
ug+0.076
severe+0.076
Neg logit contributions
 moment-0.035
 first-0.034
 Mia-0.034
 Jen-0.033
 Sue-0.032
Token as
Token ID355
Feature activation-1.40e+00

Pos logit contributions
hing+0.106
 streaming+0.101
ming+0.096
cient+0.095
onder+0.094
Neg logit contributions
 first-0.062
 moment-0.058
 Mia-0.058
 Sue-0.057
 Jen-0.056
Token he
Token ID339
Feature activation-1.31e+00

Pos logit contributions
hing+0.074
 streaming+0.070
ming+0.069
ug+0.063
onder+0.063
Neg logit contributions
 moment-0.035
 first-0.033
 Mia-0.032
 Jen-0.031
 minute-0.031
Token could
Token ID714
Feature activation-1.33e+00

Pos logit contributions
hing+0.050
 streaming+0.048
ming+0.044
onder+0.041
cient+0.039
Neg logit contributions
 first-0.058
 moment-0.054
 Mia-0.054
 Jen-0.053
 next-0.052
Token and
Token ID290
Feature activation-5.78e-01

Pos logit contributions
hing+0.068
 streaming+0.063
ming+0.063
onder+0.057
severe+0.056
Neg logit contributions
 moment-0.036
 Mia-0.036
 first-0.035
 Jen-0.035
 Sue-0.034
Token soon
Token ID2582
Feature activation+5.40e-01

Pos logit contributions
hing+0.028
 streaming+0.027
ming+0.026
onder+0.025
cient+0.024
Neg logit contributions
 first-0.019
 moment-0.018
 Mia-0.018
 Jen-0.017
 Sue-0.017
Token around
Token ID1088
Feature activation-2.16e+00

Pos logit contributions
hing+0.021
 streaming+0.019
ming+0.018
onder+0.016
cient+0.016
Neg logit contributions
 first-0.025
 moment-0.025
 Mia-0.025
 Jen-0.024
 Sue-0.024
Token the
Token ID262
Feature activation+9.60e-02

Pos logit contributions
hing+0.099
 streaming+0.095
ming+0.094
ting+0.084
onder+0.082
Neg logit contributions
 Jen-0.068
 first-0.066
 Sue-0.066
 moment-0.065
ila-0.065
Token lamp
Token ID20450
Feature activation+1.43e+00

Pos logit contributions
 first+0.003
 moment+0.003
 next+0.003
 Mia+0.003
 minute+0.002
Neg logit contributions
hing-0.005
 streaming-0.005
ming-0.005
severe-0.004
ug-0.004
Token,
Token ID11
Feature activation+6.90e-03

Pos logit contributions
 first+0.038
 moment+0.037
 Mia+0.037
 Jen+0.036
 Sue+0.035
Neg logit contributions
hing-0.076
 streaming-0.072
ming-0.070
onder-0.065
cient-0.064
Token feeling
Token ID4203
Feature activation-2.13e+00

Pos logit contributions
 first+0.000
 moment+0.000
 Jen+0.000
 Mia+0.000
 Sue+0.000
Neg logit contributions
hing-0.000
 streaming-0.000
ming-0.000
onder-0.000
ning-0.000
Token very
Token ID845
Feature activation-8.18e-01

Pos logit contributions
hing+0.115
 streaming+0.109
ming+0.106
cient+0.101
onder+0.100
Neg logit contributions
 first-0.057
 moment-0.055
 Mia-0.055
 Jen-0.052
 Sue-0.051
Token happy
Token ID3772
Feature activation+6.37e-01

Pos logit contributions
hing+0.041
 streaming+0.039
ming+0.038
onder+0.035
cient+0.034
Neg logit contributions
 first-0.024
 moment-0.023
 Mia-0.023
 Jen-0.022
 Sue-0.022
Token.
Token ID13
Feature activation+1.11e-01

Pos logit contributions
 first+0.013
 moment+0.013
 Mia+0.012
 Jen+0.011
 Sue+0.011
Neg logit contributions
hing-0.039
 streaming-0.037
ming-0.036
cient-0.034
onder-0.034
Token
Token ID198
Feature activation-2.84e-02

Pos logit contributions
 first+0.003
 moment+0.003
 Jen+0.003
 Mia+0.003
 Sue+0.003
Neg logit contributions
hing-0.006
 streaming-0.006
ming-0.005
onder-0.005
cient-0.005
Token
Token ID198
Feature activation+1.03e-01

Pos logit contributions
hing+0.001
ming+0.001
 streaming+0.001
onder+0.001
 Australian+0.001
Neg logit contributions
 first-0.001
 moment-0.001
 Jen-0.001
 Mia-0.001
 minute-0.001
TokenNext
Token ID10019
Feature activation-1.34e-02

Pos logit contributions
 first+0.004
 moment+0.003
 Mia+0.003
 Sue+0.003
 Jen+0.003
Neg logit contributions
hing-0.005
 streaming-0.005
ming-0.004
onder-0.004
cient-0.004
Token.
Token ID13
Feature activation-2.94e-01

Pos logit contributions
 first+0.011
 moment+0.010
 Mia+0.010
 Sue+0.010
 Jen+0.009
Neg logit contributions
hing-0.032
 streaming-0.031
ming-0.030
severe-0.028
onder-0.028
Token One
Token ID1881
Feature activation+3.00e+00

Pos logit contributions
hing+0.014
 streaming+0.013
ming+0.013
onder+0.012
severe+0.012
Neg logit contributions
 first-0.009
 moment-0.009
 Mia-0.009
ila-0.008
 Jen-0.008
Token day
Token ID1110
Feature activation-2.02e-01

Pos logit contributions
 Lia+0.104
 first+0.102
ila+0.101
oe+0.100
 Mia+0.100
Neg logit contributions
hing-0.115
 streaming-0.107
ming-0.099
 Australian-0.096
onder-0.094
Token,
Token ID11
Feature activation-6.70e-01

Pos logit contributions
hing+0.012
ming+0.012
 streaming+0.011
severe+0.011
cient+0.011
Neg logit contributions
 Mia-0.004
 first-0.004
 moment-0.004
 Sue-0.004
 Jen-0.004
Token he
Token ID339
Feature activation-8.38e-01

Pos logit contributions
hing+0.038
 streaming+0.037
ming+0.036
onder+0.033
cient+0.033
Neg logit contributions
 first-0.015
 moment-0.015
 Mia-0.015
 Jen-0.014
 Sue-0.014
Token took
Token ID1718
Feature activation-3.44e-01

Pos logit contributions
hing+0.037
 streaming+0.034
ming+0.033
onder+0.030
cient+0.029
Neg logit contributions
 first-0.032
 moment-0.031
 Mia-0.031
 Jen-0.030
 Sue-0.030
Token a
Token ID257
Feature activation-3.59e-01

Pos logit contributions
hing+0.024
cient+0.023
 streaming+0.023
ming+0.022
itated+0.021
Neg logit contributions
 Mia-0.005
 first-0.005
 one-0.005
 Sue-0.005
 Jen-0.004
Token walk
Token ID2513
Feature activation+1.93e+00

Pos logit contributions
hing+0.017
 streaming+0.016
ming+0.016
onder+0.014
cient+0.014
Neg logit contributions
 first-0.012
 moment-0.011
 Mia-0.011
 Jen-0.011
 Sue-0.011
Token in
Token ID287
Feature activation-5.45e-02

Pos logit contributions
 first+0.077
 moment+0.076
 Mia+0.075
 Jen+0.073
 Sue+0.072
Neg logit contributions
hing-0.077
 streaming-0.073
ming-0.070
onder-0.063
cient-0.061
Token the
Token ID262
Feature activation-4.41e-01

Pos logit contributions
hing+0.002
 streaming+0.002
ming+0.002
 Australian+0.002
ning+0.002
Neg logit contributions
ider-0.002
oe-0.002
ila-0.002
 moment-0.002
 first-0.002
Token woods
Token ID16479
Feature activation-1.86e-01

Pos logit contributions
hing+0.021
 streaming+0.020
ming+0.020
ug+0.018
onder+0.018
Neg logit contributions
 first-0.015
 moment-0.014
 Mia-0.014
 Sue-0.014
 Jen-0.014
Token to
Token ID284
Feature activation+8.77e-01

Pos logit contributions
hing+0.033
 streaming+0.031
ming+0.031
onder+0.028
cient+0.028
Neg logit contributions
 first-0.019
 moment-0.019
 Mia-0.018
 Jen-0.018
 Sue-0.018
Token Jill
Token ID22772
Feature activation+1.54e+00

Pos logit contributions
ider+0.022
 moment+0.021
iw+0.020
 minute+0.020
oe+0.018
Neg logit contributions
hing-0.044
 streaming-0.041
ming-0.038
ug-0.037
check-0.037
Token,
Token ID11
Feature activation+2.89e-01

Pos logit contributions
 first+0.048
 moment+0.047
 Mia+0.046
 Jen+0.045
 Sue+0.044
Neg logit contributions
hing-0.075
 streaming-0.071
ming-0.069
onder-0.063
cient-0.062
Token "
Token ID366
Feature activation-8.63e-01

Pos logit contributions
 first+0.008
 moment+0.008
 Jen+0.008
 Mia+0.008
ila+0.008
Neg logit contributions
hing-0.014
 streaming-0.014
ming-0.013
onder-0.012
cient-0.012
TokenLet
Token ID5756
Feature activation+1.16e-01

Pos logit contributions
hing+0.041
 streaming+0.039
ming+0.038
onder+0.034
ning+0.034
Neg logit contributions
 first-0.028
 Mia-0.027
 moment-0.027
 Jen-0.027
 Sue-0.026
Token's
Token ID338
Feature activation-3.75e+00

Pos logit contributions
 first+0.004
 Mia+0.004
 moment+0.004
 Sue+0.004
 next+0.004
Neg logit contributions
hing-0.006
 streaming-0.005
ming-0.005
ug-0.005
severe-0.005
Token go
Token ID467
Feature activation-9.49e-01

Pos logit contributions
hing+0.201
 streaming+0.199
ming+0.197
ning+0.185
ug+0.185
Neg logit contributions
 first-0.112
 Mia-0.093
 Sue-0.093
 next-0.086
 one-0.085
Token play
Token ID711
Feature activation+1.58e+00

Pos logit contributions
hing+0.038
ming+0.036
 streaming+0.034
ting+0.031
ning+0.031
Neg logit contributions
 moment-0.035
 Mia-0.034
 Jen-0.034
ila-0.034
 Sue-0.033
Token!"
Token ID2474
Feature activation+1.44e+00

Pos logit contributions
 first+0.050
 moment+0.050
 Mia+0.048
 Jen+0.047
 Sue+0.047
Neg logit contributions
hing-0.076
 streaming-0.072
ming-0.070
onder-0.064
cient-0.063
Token But
Token ID887
Feature activation-3.79e-01

Pos logit contributions
 first+0.033
ider+0.033
 moment+0.032
ila+0.032
 minute+0.031
Neg logit contributions
hing-0.069
ming-0.067
 streaming-0.066
 echoed-0.062
onder-0.062
Token Jill
Token ID22772
Feature activation+8.63e-01

Pos logit contributions
 streaming+0.019
hing+0.018
ming+0.018
 echoed+0.017
elcome+0.017
Neg logit contributions
ila-0.008
 moment-0.008
onna-0.007
iw-0.007
oe-0.007
Token salad
Token ID20698
Feature activation-1.46e-01

Pos logit contributions
 Jen+0.008
 Mia+0.007
ila+0.007
 Sue+0.007
 Lia+0.007
Neg logit contributions
hing-0.010
 streaming-0.010
ming-0.010
onder-0.009
cient-0.009
Token.
Token ID13
Feature activation+2.72e-01

Pos logit contributions
hing+0.007
 streaming+0.007
ming+0.007
ning+0.006
onder+0.006
Neg logit contributions
 Jen-0.004
 moment-0.004
 Mia-0.004
 Sue-0.004
 first-0.004
Token It
Token ID632
Feature activation+4.93e-01

Pos logit contributions
 moment+0.011
 first+0.011
oe+0.010
 minute+0.010
ila+0.010
Neg logit contributions
 streaming-0.010
hing-0.009
onder-0.008
ming-0.008
cient-0.008
Token had
Token ID550
Feature activation+6.93e-01

Pos logit contributions
 Jen+0.018
 moment+0.018
 first+0.018
 minute+0.017
ila+0.017
Neg logit contributions
hing-0.019
 streaming-0.018
ming-0.018
 echoed-0.016
ning-0.016
Token red
Token ID2266
Feature activation-9.57e-01

Pos logit contributions
 moment+0.022
 first+0.020
oe+0.020
 minute+0.019
ila+0.019
Neg logit contributions
 streaming-0.031
hing-0.031
ming-0.029
onder-0.027
 echoed-0.026
Token tomatoes
Token ID23972
Feature activation-4.26e+00

Pos logit contributions
hing+0.042
 streaming+0.041
ming+0.039
onder+0.037
ning+0.034
Neg logit contributions
 moment-0.032
 first-0.032
 Jen-0.032
ila-0.031
 Mia-0.031
Token,
Token ID11
Feature activation+5.80e-01

Pos logit contributions
hing+0.213
 streaming+0.204
ming+0.199
onder+0.182
ning+0.177
Neg logit contributions
 moment-0.123
 first-0.123
 Mia-0.119
 Jen-0.118
 Sue-0.115
Token green
Token ID4077
Feature activation-8.52e-01

Pos logit contributions
oe+0.018
 moment+0.018
ila+0.017
 Jen+0.017
 minute+0.017
Neg logit contributions
 streaming-0.025
hing-0.024
ming-0.024
onder-0.021
opes-0.020
Token lettuce
Token ID39406
Feature activation-3.51e+00

Pos logit contributions
hing+0.037
 streaming+0.036
ming+0.035
onder+0.032
ning+0.030
Neg logit contributions
 Jen-0.028
 first-0.028
 moment-0.028
ila-0.027
 Mia-0.027
Token,
Token ID11
Feature activation+7.51e-01

Pos logit contributions
hing+0.164
 streaming+0.159
ming+0.154
onder+0.142
ning+0.138
Neg logit contributions
 moment-0.109
 first-0.107
 Jen-0.105
 Mia-0.105
 Sue-0.102
Token and
Token ID290
Feature activation+6.70e-01

Pos logit contributions
 moment+0.025
 Jen+0.024
oe+0.024
ila+0.024
 minute+0.024
Neg logit contributions
 streaming-0.031
hing-0.030
ming-0.030
onder-0.027
ting-0.026
Token "
Token ID366
Feature activation-9.58e-01

Pos logit contributions
ila+0.006
 Jen+0.006
 moment+0.006
 Mia+0.006
 first+0.006
Neg logit contributions
hing-0.010
 streaming-0.010
ming-0.010
ting-0.009
cient-0.009
TokenYeah
Token ID10995
Feature activation-4.27e-01

Pos logit contributions
hing+0.047
ming+0.043
 streaming+0.042
severe+0.038
ting+0.038
Neg logit contributions
 Mia-0.030
 first-0.030
 Jen-0.030
 Sue-0.029
 moment-0.028
Token,
Token ID11
Feature activation-5.76e-01

Pos logit contributions
hing+0.022
 streaming+0.021
ming+0.020
onder+0.019
severe+0.018
Neg logit contributions
 first-0.012
 moment-0.012
 Mia-0.012
 Jen-0.011
 Sue-0.011
Token but
Token ID475
Feature activation-2.10e+00

Pos logit contributions
hing+0.032
 streaming+0.031
ming+0.030
onder+0.028
severe+0.027
Neg logit contributions
 first-0.014
 moment-0.014
 Mia-0.013
 Jen-0.013
 Sue-0.013
Token let
Token ID1309
Feature activation-6.67e-01

Pos logit contributions
hing+0.121
 streaming+0.116
ming+0.113
onder+0.106
cient+0.105
Neg logit contributions
 first-0.044
 moment-0.044
 Mia-0.042
 Jen-0.041
 Sue-0.040
Token's
Token ID338
Feature activation-3.84e+00

Pos logit contributions
 streaming+0.034
hing+0.034
ming+0.029
amb+0.029
 streamed+0.029
Neg logit contributions
 first-0.023
 Mia-0.021
 Sue-0.020
 one-0.019
 next-0.019
Token add
Token ID751
Feature activation+6.14e-01

Pos logit contributions
 streaming+0.207
hing+0.205
ming+0.203
ting+0.188
ning+0.187
Neg logit contributions
 first-0.117
 Mia-0.093
 next-0.091
 Sue-0.088
 moment-0.086
Token some
Token ID617
Feature activation+7.79e-01

Pos logit contributions
 first+0.013
 moment+0.013
 Mia+0.012
 Jen+0.012
 Sue+0.012
Neg logit contributions
hing-0.036
 streaming-0.035
ming-0.034
onder-0.031
cient-0.031
Token onion
Token ID21670
Feature activation-3.89e+00

Pos logit contributions
 first+0.023
 moment+0.022
 Mia+0.021
 minute+0.020
 next+0.020
Neg logit contributions
hing-0.041
 streaming-0.039
ming-0.038
severe-0.034
cient-0.034
Token to
Token ID284
Feature activation+3.23e-02

Pos logit contributions
hing+0.206
 streaming+0.197
ming+0.191
onder+0.176
severe+0.173
Neg logit contributions
 first-0.106
 moment-0.102
 Mia-0.101
 Jen-0.097
 Sue-0.095
Token make
Token ID787
Feature activation-2.51e-01

Pos logit contributions
 first+0.001
 moment+0.001
 Mia+0.001
 Jen+0.001
 Sue+0.001
Neg logit contributions
hing-0.002
 streaming-0.002
ming-0.002
onder-0.001
severe-0.001
Token,
Token ID11
Feature activation-8.95e-01

Pos logit contributions
hing+0.005
 streaming+0.004
ming+0.004
onder+0.004
cient+0.004
Neg logit contributions
 first-0.002
 moment-0.002
 Mia-0.002
 Jen-0.002
 Sue-0.002
Token the
Token ID262
Feature activation+3.90e-01

Pos logit contributions
hing+0.053
 streaming+0.051
ming+0.049
onder+0.046
cient+0.045
Neg logit contributions
 first-0.019
 moment-0.018
 Mia-0.018
 Jen-0.017
 Sue-0.017
Token boss
Token ID6478
Feature activation+4.56e-01

Pos logit contributions
 first+0.014
 moment+0.013
 Mia+0.013
 Sue+0.012
 Jen+0.012
Neg logit contributions
hing-0.018
 streaming-0.017
ming-0.017
onder-0.015
ning-0.015
Tokeny
Token ID88
Feature activation-5.36e-01

Pos logit contributions
 first+0.018
 moment+0.018
 Jen+0.018
 Mia+0.018
 Sue+0.018
Neg logit contributions
hing-0.017
 streaming-0.016
ming-0.016
 Australian-0.014
cient-0.014
Token duck
Token ID22045
Feature activation-3.71e-02

Pos logit contributions
hing+0.026
 streaming+0.025
ming+0.024
onder+0.022
severe+0.021
Neg logit contributions
 first-0.018
 moment-0.017
 Mia-0.017
 Jen-0.016
 Sue-0.016
Tokenling
Token ID1359
Feature activation-3.69e+00

Pos logit contributions
hing+0.001
 streaming+0.001
ming+0.001
severe+0.001
onder+0.001
Neg logit contributions
 first-0.002
 moment-0.001
 Mia-0.001
 Jen-0.001
 Sue-0.001
Token said
Token ID531
Feature activation-6.32e-01

Pos logit contributions
hing+0.153
 streaming+0.139
ming+0.138
onder+0.129
cient+0.127
Neg logit contributions
 first-0.146
 moment-0.142
 Mia-0.137
 next-0.134
 Sue-0.128
Token,
Token ID11
Feature activation+4.22e-01

Pos logit contributions
hing+0.033
 streaming+0.031
ming+0.030
onder+0.028
cient+0.027
Neg logit contributions
 first-0.018
 moment-0.018
 Mia-0.018
 Jen-0.017
 Sue-0.017
Token "
Token ID366
Feature activation-9.18e-01

Pos logit contributions
 first+0.012
 moment+0.012
 Mia+0.012
 Jen+0.012
 Sue+0.011
Neg logit contributions
hing-0.021
 streaming-0.020
ming-0.020
onder-0.018
cient-0.018
TokenSc
Token ID3351
Feature activation-6.23e-01

Pos logit contributions
hing+0.041
 streaming+0.038
ming+0.037
ting+0.033
onder+0.033
Neg logit contributions
 first-0.033
 Mia-0.032
 Jen-0.031
 Sue-0.031
 moment-0.031
Tokenatter
Token ID1436
Feature activation-5.44e-01

Pos logit contributions
hing+0.022
 streaming+0.020
ming+0.020
onder+0.017
cient+0.017
Neg logit contributions
 first-0.028
 moment-0.027
 Mia-0.027
 Jen-0.027
 Sue-0.026
Token dad
Token ID9955
Feature activation-7.20e-01

Pos logit contributions
 first+0.001
 moment+0.001
 next+0.001
 Jen+0.001
 Mia+0.001
Neg logit contributions
hing-0.001
ming-0.001
 streaming-0.001
severe-0.001
ting-0.001
Token said
Token ID531
Feature activation-4.24e-01

Pos logit contributions
hing+0.042
 streaming+0.040
severe+0.039
ming+0.038
 Australian+0.037
Neg logit contributions
 first-0.017
 Mia-0.015
 Sue-0.015
 moment-0.015
 next-0.015
Token,
Token ID11
Feature activation+5.70e-01

Pos logit contributions
hing+0.021
 streaming+0.020
ming+0.019
onder+0.017
cient+0.017
Neg logit contributions
 first-0.013
 moment-0.013
 Mia-0.013
 Jen-0.012
 Sue-0.012
Token "
Token ID366
Feature activation-7.40e-01

Pos logit contributions
 first+0.017
 moment+0.016
ila+0.016
 Mia+0.016
 Jen+0.016
Neg logit contributions
hing-0.027
 streaming-0.027
ming-0.026
onder-0.023
cient-0.023
TokenLet
Token ID5756
Feature activation+8.69e-02

Pos logit contributions
hing+0.034
ming+0.031
 streaming+0.029
ting+0.026
ning+0.026
Neg logit contributions
 first-0.026
 Mia-0.025
 Jen-0.024
 Sue-0.024
 moment-0.024
Token's
Token ID338
Feature activation-3.74e+00

Pos logit contributions
 first+0.003
 Mia+0.003
 moment+0.003
 Sue+0.003
 next+0.003
Neg logit contributions
hing-0.004
 streaming-0.004
ming-0.004
severe-0.003
ug-0.003
Token add
Token ID751
Feature activation+8.86e-01

Pos logit contributions
 streaming+0.195
hing+0.194
ming+0.191
ting+0.180
ning+0.177
Neg logit contributions
 first-0.118
 Mia-0.097
 Sue-0.096
 Jen-0.095
 next-0.093
Token some
Token ID617
Feature activation+9.37e-01

Pos logit contributions
 first+0.019
 moment+0.019
 Mia+0.018
 Jen+0.018
 Sue+0.017
Neg logit contributions
hing-0.051
 streaming-0.049
ming-0.048
onder-0.045
cient-0.044
Token fire
Token ID2046
Feature activation-7.66e-02

Pos logit contributions
 first+0.025
 moment+0.023
 Mia+0.023
 Jen+0.022
 Sue+0.022
Neg logit contributions
hing-0.051
 streaming-0.049
ming-0.048
severe-0.044
cient-0.044
Tokenwork
Token ID1818
Feature activation-1.07e-01

Pos logit contributions
 streaming+0.004
hing+0.004
ming+0.004
 Australian+0.004
severe+0.004
Neg logit contributions
 first-0.002
 Mia-0.002
 moment-0.002
 next-0.002
 Jen-0.002
Token to
Token ID284
Feature activation+1.51e-02

Pos logit contributions
hing+0.006
 streaming+0.006
ming+0.005
 Australian+0.005
severe+0.005
Neg logit contributions
 first-0.003
 moment-0.003
 Mia-0.003
 next-0.002
 Jen-0.002