TOP ACTIVATIONS
MAX = 8.103

instinct can often save us unnecessary danger . <|endoftext|> Once upon
your parents and not take unnecessary risks . <|endoftext|> One day
asked her mother what she meant . Her mother smiled and
Jack loved printing , which meant that he could make lots
had resisted , as it meant she could play the piano
learned that sometimes panic is unnecessary . Everything was okay ,
friends , and never take unnecessary risks . <|endoftext|> Once upon
't like all of the unnecessary extravag ance .
find out what it all meant . She
be prepared and not take unnecessary risks . <|endoftext|> Once upon
the project and hadn 't meant to make a mess .
thought that life was just meant to be fun and laughing
be careful and not take unnecessary risks . The moral of
why and what it all meant , but her mom and
't know what decor ate meant , but he wanted to
and friends and not take unnecessary risks . We should always
remember how much it had meant to her . From then
B enny knew that meant it might rain soon .
more cautious and not take unnecessary risks . The memory of
It was meant for a princess . â

MIN ACTIVATIONS
MIN = -6.993

. The more , the mer rier . Come to my
passport belonged to a magical mer maid who had been visiting
many more adventures with the mer maid . The
was so thankful that the mer maid had forgiven him .
was happy and sailed away mer rily . <|endoftext|> Once there
man watched as she skipped mer rily away into the sunset
too late . The star fish poison is strong . Tom
! Ben touched a jelly fish ! He is hurt !"
flake is small and has eight points . They are both
Let 's pretend we are mer maid s and swim in
" J elly fish are very interesting , but
. He touches the star fish . The star fish moves
, six , seven , eight , nine , ten ,
They left the jelly fish on the sand and ran
Trust me . The jelly fish is not a good friend
Can we see a sword fish , Mom ?"
I still think the jelly fish is pretty ." L ila
didn 't need the jelly fish . They had each other
" And I 'm a mer maid !" Lily said .
in the sea that the mer maid carefully chose . It

INTERVAL 4.329 - 8.103
CONTAINS 0.231%

thing to study and learn about with his friends . <|endoftext|>
wet for a carrot . Then it tried the playground ,
. Every time it r ained , he would go outside
. They made up stories about the animals and people who
L ily thought about what Tim my said and

INTERVAL 0.555 - 4.329
CONTAINS 31.813%

and chew . He che wed on sticks , balls ,
After a long day of flying , Buzz y was
are happy when they make nice pictures . One
Once upon a time , there was a little boy named
special ," she said . The rabbit hopped away

INTERVAL -3.219 - 0.555
CONTAINS 66.949%

was a big ocean , so it was hard to know
. Mama , Papa and even Grand ma were very impressed
the door and pushed the box inside . She saw her
" I know you wanted to have fun , but you
. She was crying . " Too bad !"

INTERVAL -6.993 - -3.219
CONTAINS 1.007%

are a great cook . I am very proud of you
Tom and Mia waved goodbye to the guard and ran
her eyes and soon fell asleep . In the morning ,
box and found an old bar of soap . Together they
day ever ! <|endoftext|> Once upon a time , there was
Token instinct
Token ID13311
Feature activation+4.62e-01

Pos logit contributions
 la+0.026
opus+0.025
 microphone+0.025
 bishop+0.024
 player+0.024
Neg logit contributions
 Timothy-0.023
 Karen-0.020
 Kitty-0.019
 Bill-0.019
 Samantha-0.019
Token can
Token ID460
Feature activation-7.25e-01

Pos logit contributions
 Timothy+0.027
 Karen+0.024
 about+0.023
 Bill+0.023
 Samantha+0.023
Neg logit contributions
 microphone-0.032
opus-0.032
 la-0.032
 rook-0.031
 bishop-0.031
Token often
Token ID1690
Feature activation-2.29e+00

Pos logit contributions
 la+0.049
 microphone+0.049
opus+0.048
 rook+0.048
 bishop+0.047
Neg logit contributions
 Timothy-0.046
 Karen-0.041
 Bill-0.039
 Kitty-0.038
 Samantha-0.038
Token save
Token ID3613
Feature activation+5.85e-01

Pos logit contributions
opus+0.155
 microphone+0.149
 player+0.145
 la+0.145
 rook+0.142
Neg logit contributions
 Timothy-0.143
 Karen-0.128
 Bill-0.124
 about-0.124
 Samantha-0.122
Token us
Token ID514
Feature activation+1.30e+00

Pos logit contributions
 Timothy+0.038
 Karen+0.034
 Samantha+0.032
 Su+0.032
 Kitty+0.032
Neg logit contributions
 microphone-0.039
 la-0.037
 player-0.036
 dancer-0.036
opus-0.036
Token unnecessary
Token ID13114
Feature activation+8.10e+00

Pos logit contributions
 Timothy+0.077
 Karen+0.067
 Samantha+0.063
 about+0.063
 Bill+0.063
Neg logit contributions
opus-0.091
 microphone-0.091
 la-0.091
 player-0.088
 dancer-0.088
Token danger
Token ID3514
Feature activation+2.30e+00

Pos logit contributions
 Timothy+0.439
 Karen+0.378
 Kitty+0.353
 Samantha+0.352
 Bill+0.347
Neg logit contributions
 microphone-0.615
opus-0.612
 la-0.600
 player-0.596
 thumb-0.593
Token.
Token ID13
Feature activation+4.08e-01

Pos logit contributions
 Timothy+0.127
 Karen+0.112
 about+0.109
 Kitty+0.106
 Bill+0.103
Neg logit contributions
 microphone-0.169
opus-0.168
 thumb-0.166
 rook-0.165
 bishop-0.164
Token<|endoftext|>
Token ID50256
Feature activation-2.81e+00

Pos logit contributions
 Timothy+0.024
 Karen+0.021
 Samantha+0.020
 Su+0.020
 Bill+0.019
Neg logit contributions
 microphone-0.031
opus-0.030
 thumb-0.029
 la-0.029
 player-0.029
TokenOnce
Token ID7454
Feature activation-1.89e+00

Pos logit contributions
 microphone+0.190
 la+0.184
 player+0.184
opus+0.182
 dancer+0.181
Neg logit contributions
 Timothy-0.179
 Karen-0.157
 Samantha-0.149
 Su-0.146
 Kitty-0.145
Token upon
Token ID2402
Feature activation-3.43e+00

Pos logit contributions
 microphone+0.109
 la+0.105
opus+0.102
 player+0.101
 rook+0.101
Neg logit contributions
 Timothy-0.143
 Karen-0.129
 Su-0.125
 Samantha-0.124
 Sue-0.123
Token your
Token ID534
Feature activation-1.91e+00

Pos logit contributions
 Timothy+0.004
 Karen+0.003
 Samantha+0.003
 Kitty+0.003
 Su+0.003
Neg logit contributions
 microphone-0.003
opus-0.003
 la-0.003
 player-0.003
 dancer-0.003
Token parents
Token ID3397
Feature activation+7.98e-01

Pos logit contributions
 la+0.125
opus+0.123
 microphone+0.123
 player+0.118
 dancer+0.116
Neg logit contributions
 Timothy-0.121
 Karen-0.109
 Samantha-0.104
 Kitty-0.104
 Bill-0.102
Token and
Token ID290
Feature activation+3.97e-01

Pos logit contributions
 Timothy+0.044
 about+0.042
 Karen+0.039
 meant+0.038
 Samantha+0.037
Neg logit contributions
opus-0.057
 la-0.056
 microphone-0.055
 dancer-0.054
 rook-0.053
Token not
Token ID407
Feature activation+5.58e-01

Pos logit contributions
 Timothy+0.023
 Karen+0.021
 Samantha+0.020
 Kitty+0.020
 Bill+0.020
Neg logit contributions
opus-0.028
 microphone-0.028
 la-0.028
 player-0.027
 dancer-0.026
Token take
Token ID1011
Feature activation-9.43e-01

Pos logit contributions
 Timothy+0.036
 Karen+0.032
 Kitty+0.031
 Samantha+0.031
 Bill+0.031
Neg logit contributions
opus-0.036
 microphone-0.036
 la-0.035
 player-0.034
 rook-0.034
Token unnecessary
Token ID13114
Feature activation+7.78e+00

Pos logit contributions
 microphone+0.064
 la+0.060
 brain+0.060
 dancer+0.060
 player+0.059
Neg logit contributions
 Timothy-0.062
 Karen-0.054
 Su-0.051
Kim-0.051
 Samantha-0.051
Token risks
Token ID7476
Feature activation+3.85e+00

Pos logit contributions
 Timothy+0.389
 Karen+0.329
 Samantha+0.307
 Kitty+0.304
 Su+0.297
Neg logit contributions
 microphone-0.626
opus-0.617
 la-0.611
 player-0.606
 dancer-0.604
Token.
Token ID13
Feature activation+1.48e-01

Pos logit contributions
 Timothy+0.205
 about+0.178
 Karen+0.178
 Kitty+0.167
 Samantha+0.164
Neg logit contributions
opus-0.290
 microphone-0.287
 la-0.286
 thumb-0.279
 rook-0.278
Token<|endoftext|>
Token ID50256
Feature activation-2.95e+00

Pos logit contributions
 Timothy+0.009
 Karen+0.008
 Samantha+0.008
 Su+0.008
 Kitty+0.007
Neg logit contributions
 microphone-0.010
 la-0.010
 player-0.010
opus-0.010
 dancer-0.010
TokenOne
Token ID3198
Feature activation-5.99e-03

Pos logit contributions
 microphone+0.184
 player+0.180
 la+0.175
opus+0.174
 dancer+0.174
Neg logit contributions
 Timothy-0.205
 Karen-0.180
 Samantha-0.173
 Su-0.168
 Kitty-0.165
Token day
Token ID1110
Feature activation-1.02e+00

Pos logit contributions
 microphone+0.000
opus+0.000
 la+0.000
 rook+0.000
 bishop+0.000
Neg logit contributions
 Timothy-0.000
 Karen-0.000
 Sue-0.000
 about-0.000
 Su-0.000
Token asked
Token ID1965
Feature activation+2.78e+00

Pos logit contributions
 Timothy+0.088
 Karen+0.079
 theirs+0.078
 Samantha+0.077
 Su+0.074
Neg logit contributions
opus-0.052
 microphone-0.051
 player-0.050
 la-0.050
 team-0.050
Token her
Token ID607
Feature activation+5.70e-01

Pos logit contributions
 Timothy+0.167
 Karen+0.146
 Samantha+0.137
 Su+0.135
 Kitty+0.135
Neg logit contributions
 microphone-0.197
 la-0.190
 player-0.188
 dancer-0.187
 thumb-0.187
Token mother
Token ID2802
Feature activation+2.36e+00

Pos logit contributions
 Timothy+0.040
 Karen+0.036
 Samantha+0.034
 Su+0.034
 Kitty+0.034
Neg logit contributions
 microphone-0.035
 la-0.034
opus-0.033
 player-0.033
 dancer-0.033
Token what
Token ID644
Feature activation+4.19e+00

Pos logit contributions
 Timothy+0.142
 Karen+0.125
 Samantha+0.118
 Su+0.116
 Kitty+0.116
Neg logit contributions
 microphone-0.168
 la-0.163
opus-0.163
 player-0.161
 dancer-0.161
Token she
Token ID673
Feature activation+7.85e-01

Pos logit contributions
 Timothy+0.293
 Karen+0.251
 Samantha+0.241
Kim+0.239
 Su+0.238
Neg logit contributions
 microphone-0.261
 player-0.247
 dancer-0.244
 thumb-0.239
 la-0.239
Token meant
Token ID4001
Feature activation+7.77e+00

Pos logit contributions
 Timothy+0.058
 Karen+0.051
 Samantha+0.049
 Su+0.049
Kim+0.048
Neg logit contributions
 microphone-0.044
 player-0.044
 la-0.043
 thumb-0.043
opus-0.043
Token.
Token ID13
Feature activation+8.70e-01

Pos logit contributions
 Timothy+0.466
 Karen+0.408
 Samantha+0.386
 Su+0.382
 about+0.379
Neg logit contributions
 microphone-0.551
opus-0.548
 la-0.547
 dancer-0.530
 player-0.528
Token Her
Token ID2332
Feature activation+1.16e+00

Pos logit contributions
 Timothy+0.058
 Karen+0.052
 Samantha+0.049
 Su+0.048
 Kitty+0.048
Neg logit contributions
 microphone-0.057
 la-0.055
opus-0.055
 player-0.054
 dancer-0.054
Token mother
Token ID2802
Feature activation+1.93e+00

Pos logit contributions
 Timothy+0.090
 Karen+0.080
 Samantha+0.076
 Su+0.076
 Kitty+0.075
Neg logit contributions
 microphone-0.064
 thumb-0.061
 la-0.059
 player-0.059
 dancer-0.059
Token smiled
Token ID13541
Feature activation-1.64e+00

Pos logit contributions
 Timothy+0.112
 Karen+0.097
 Samantha+0.092
 Su+0.090
 Kitty+0.090
Neg logit contributions
 microphone-0.142
 la-0.138
opus-0.136
 player-0.136
 dancer-0.135
Token and
Token ID290
Feature activation+9.60e-01

Pos logit contributions
opus+0.120
 microphone+0.117
 la+0.116
 rook+0.113
 player+0.111
Neg logit contributions
 Timothy-0.095
 Karen-0.084
 about-0.082
 Samantha-0.081
 Bill-0.080
Token Jack
Token ID3619
Feature activation-7.01e-01

Pos logit contributions
 Timothy+0.084
 Karen+0.073
 Samantha+0.069
 Kitty+0.069
 Su+0.069
Neg logit contributions
 microphone-0.096
 la-0.092
opus-0.092
 dancer-0.091
 player-0.090
Token loved
Token ID6151
Feature activation-1.60e+00

Pos logit contributions
 microphone+0.048
 la+0.047
opus+0.046
 dancer+0.046
 rook+0.045
Neg logit contributions
 Timothy-0.044
 Karen-0.039
 Kitty-0.037
 Samantha-0.037
 Su-0.036
Token printing
Token ID13570
Feature activation+1.94e-01

Pos logit contributions
 microphone+0.106
 la+0.101
 dancer+0.100
 thumb+0.100
 player+0.099
Neg logit contributions
 Timothy-0.106
 Karen-0.093
 Samantha-0.089
 Su-0.087
Kim-0.085
Token,
Token ID11
Feature activation+4.61e-01

Pos logit contributions
 Timothy+0.012
 Karen+0.011
 Samantha+0.010
 Kitty+0.010
 Su+0.010
Neg logit contributions
 microphone-0.014
 la-0.013
 player-0.013
opus-0.013
 dancer-0.013
Token which
Token ID543
Feature activation+3.05e+00

Pos logit contributions
 Timothy+0.030
 Karen+0.026
 Kitty+0.025
 Su+0.025
 Bill+0.024
Neg logit contributions
 microphone-0.033
opus-0.032
 dancer-0.031
 player-0.031
 la-0.031
Token meant
Token ID4001
Feature activation+7.75e+00

Pos logit contributions
 Timothy+0.203
 Karen+0.179
 Kitty+0.170
 Samantha+0.168
 Bill+0.167
Neg logit contributions
 la-0.201
opus-0.195
 microphone-0.195
 dancer-0.189
 bishop-0.185
Token that
Token ID326
Feature activation+4.57e-01

Pos logit contributions
 Timothy+0.474
 Karen+0.413
 Su+0.389
 Bill+0.387
 Samantha+0.387
Neg logit contributions
 microphone-0.539
 la-0.533
opus-0.532
 dancer-0.520
 player-0.516
Token he
Token ID339
Feature activation-4.59e-01

Pos logit contributions
 Timothy+0.030
 Karen+0.027
 Kitty+0.025
 Samantha+0.025
 Bill+0.025
Neg logit contributions
 microphone-0.030
 la-0.030
opus-0.029
 dancer-0.028
 thumb-0.028
Token could
Token ID714
Feature activation+5.67e-01

Pos logit contributions
 microphone+0.034
 la+0.034
opus+0.033
 cheek+0.033
 dancer+0.033
Neg logit contributions
 Timothy-0.025
 Karen-0.021
 meant-0.020
 Su-0.020
 Bill-0.020
Token make
Token ID787
Feature activation-2.02e-01

Pos logit contributions
 Timothy+0.035
 Karen+0.031
 Bill+0.030
 Su+0.030
 about+0.030
Neg logit contributions
opus-0.038
 la-0.038
 microphone-0.035
 cheek-0.035
 dancer-0.034
Token lots
Token ID6041
Feature activation+3.04e-01

Pos logit contributions
 microphone+0.014
 la+0.014
opus+0.014
 player+0.014
 dancer+0.013
Neg logit contributions
 Timothy-0.012
 Karen-0.011
 Samantha-0.010
 Su-0.010
 Kitty-0.010
Token had
Token ID550
Feature activation+4.68e-01

Pos logit contributions
 microphone+0.010
opus+0.010
 rook+0.009
 thumb+0.009
 la+0.009
Neg logit contributions
 Timothy-0.007
 Karen-0.006
 about-0.006
 Samantha-0.006
 Su-0.006
Token resisted
Token ID26643
Feature activation+1.01e+00

Pos logit contributions
 Timothy+0.027
 Karen+0.023
 Samantha+0.022
 Su+0.022
 Bill+0.021
Neg logit contributions
 microphone-0.035
opus-0.034
 la-0.034
 player-0.034
 dancer-0.033
Token,
Token ID11
Feature activation-3.08e-01

Pos logit contributions
 Timothy+0.064
 Karen+0.056
 Samantha+0.054
 Su+0.053
 Kitty+0.053
Neg logit contributions
 microphone-0.069
 la-0.067
opus-0.066
 player-0.066
 dancer-0.066
Token as
Token ID355
Feature activation+1.30e+00

Pos logit contributions
 microphone+0.021
opus+0.021
 la+0.020
 player+0.020
 dancer+0.020
Neg logit contributions
 Timothy-0.020
 Karen-0.017
 Samantha-0.017
 Su-0.016
 Kitty-0.016
Token it
Token ID340
Feature activation+1.08e+00

Pos logit contributions
 Timothy+0.088
 Karen+0.079
 Samantha+0.075
 Su+0.074
 Kitty+0.074
Neg logit contributions
 microphone-0.084
opus-0.082
 la-0.081
 player-0.080
 thumb-0.079
Token meant
Token ID4001
Feature activation+7.71e+00

Pos logit contributions
 Timothy+0.062
 Karen+0.054
 Samantha+0.051
 Su+0.051
 Kitty+0.051
Neg logit contributions
 microphone-0.080
 la-0.077
opus-0.077
 thumb-0.075
 rook-0.075
Token she
Token ID673
Feature activation+1.69e-01

Pos logit contributions
 Timothy+0.498
 Karen+0.435
 Samantha+0.414
 Su+0.408
 Kitty+0.404
Neg logit contributions
 microphone-0.515
 la-0.495
 player-0.488
 dancer-0.487
 thumb-0.482
Token could
Token ID714
Feature activation+5.67e-01

Pos logit contributions
 Timothy+0.010
 Karen+0.009
 Samantha+0.008
 Su+0.008
 Kitty+0.008
Neg logit contributions
 microphone-0.012
 la-0.012
opus-0.012
 dancer-0.012
 player-0.011
Token play
Token ID711
Feature activation-1.01e+00

Pos logit contributions
 Timothy+0.036
 Karen+0.032
 Samantha+0.031
 Kitty+0.030
 Bill+0.030
Neg logit contributions
 microphone-0.038
opus-0.038
 la-0.037
 rook-0.036
 bishop-0.036
Token the
Token ID262
Feature activation+7.92e-01

Pos logit contributions
opus+0.069
 thumb+0.066
 microphone+0.066
 la+0.065
 rook+0.065
Neg logit contributions
 Timothy-0.063
 Karen-0.055
 Sue-0.054
 Kitty-0.054
 about-0.054
Token piano
Token ID19132
Feature activation-2.11e+00

Pos logit contributions
 Timothy+0.052
 Karen+0.046
 Kitty+0.043
 Su+0.043
 Samantha+0.043
Neg logit contributions
 la-0.044
opus-0.043
 microphone-0.042
 rook-0.041
acy-0.041
Token learned
Token ID4499
Feature activation-7.68e-01

Pos logit contributions
 microphone+0.048
opus+0.047
 la+0.046
 dancer+0.045
 rook+0.045
Neg logit contributions
 Timothy-0.036
 Karen-0.032
 Su-0.030
 Samantha-0.030
 Bill-0.030
Token that
Token ID326
Feature activation-1.28e-01

Pos logit contributions
 microphone+0.047
 dancer+0.042
 guitar+0.042
 tut+0.041
 brain+0.041
Neg logit contributions
 Timothy-0.057
 Karen-0.049
 Samantha-0.048
Kim-0.048
 Su-0.047
Token sometimes
Token ID3360
Feature activation-1.08e+00

Pos logit contributions
opus+0.010
 la+0.010
 microphone+0.009
 player+0.009
ater+0.009
Neg logit contributions
 Timothy-0.007
 Karen-0.006
 Kitty-0.006
 about-0.006
 Samantha-0.006
Token panic
Token ID13619
Feature activation-2.52e-01

Pos logit contributions
opus+0.076
 microphone+0.075
 la+0.074
 player+0.073
 thumb+0.071
Neg logit contributions
 Timothy-0.064
 Karen-0.057
 Samantha-0.054
 Kitty-0.053
 Su-0.053
Token is
Token ID318
Feature activation-8.56e-02

Pos logit contributions
 microphone+0.015
 la+0.014
opus+0.014
 player+0.014
 dancer+0.014
Neg logit contributions
 Timothy-0.018
 Karen-0.016
 Samantha-0.015
 Kitty-0.015
 Su-0.015
Token unnecessary
Token ID13114
Feature activation+7.67e+00

Pos logit contributions
 microphone+0.006
 brain+0.005
 dancer+0.005
 thumb+0.005
 guitar+0.005
Neg logit contributions
 Timothy-0.006
 Karen-0.005
Kim-0.005
 Samantha-0.005
 Su-0.005
Token.
Token ID13
Feature activation+3.84e-01

Pos logit contributions
 Timothy+0.493
 Karen+0.431
 Samantha+0.411
 Su+0.405
 Kitty+0.399
Neg logit contributions
 microphone-0.525
 la-0.502
 player-0.498
 dancer-0.495
opus-0.493
Token Everything
Token ID11391
Feature activation+3.79e+00

Pos logit contributions
 Timothy+0.024
 Karen+0.021
 Samantha+0.020
 Su+0.020
 Kitty+0.020
Neg logit contributions
 microphone-0.027
opus-0.026
 la-0.026
fish-0.025
 player-0.025
Token was
Token ID373
Feature activation+9.47e-01

Pos logit contributions
 Timothy+0.240
 Karen+0.215
 Kitty+0.202
 Samantha+0.201
 Su+0.200
Neg logit contributions
 microphone-0.246
opus-0.245
 la-0.245
 player-0.237
 rook-0.237
Token okay
Token ID8788
Feature activation+4.40e+00

Pos logit contributions
 Timothy+0.065
ibi+0.059
cream+0.059
 Karen+0.058
Kim+0.057
Neg logit contributions
 microphone-0.051
 alone-0.047
 guitar-0.045
 thumb-0.045
 star-0.045
Token,
Token ID11
Feature activation-4.61e-01

Pos logit contributions
 Timothy+0.349
 Karen+0.303
 Samantha+0.296
 Su+0.287
Kim+0.285
Neg logit contributions
 microphone-0.241
 player-0.227
 thumb-0.226
 brain-0.226
 dancer-0.226
Token friends
Token ID2460
Feature activation+1.55e+00

Pos logit contributions
 microphone+0.123
 la+0.120
opus+0.118
 player+0.117
 dancer+0.117
Neg logit contributions
 Timothy-0.123
 Karen-0.109
 Samantha-0.103
 Su-0.102
 Kitty-0.102
Token,
Token ID11
Feature activation-6.94e-01

Pos logit contributions
 Timothy+0.085
 about+0.080
 Karen+0.073
 Bill+0.071
 meant+0.070
Neg logit contributions
 la-0.112
opus-0.111
 microphone-0.110
 rook-0.109
 dancer-0.108
Token and
Token ID290
Feature activation+3.72e-01

Pos logit contributions
 microphone+0.050
 la+0.048
opus+0.048
 player+0.047
 dancer+0.047
Neg logit contributions
 Timothy-0.042
 Karen-0.036
 Samantha-0.034
 Su-0.034
 Kitty-0.034
Token never
Token ID1239
Feature activation-1.05e+00

Pos logit contributions
 Timothy+0.021
 Karen+0.018
 Bill+0.017
 Samantha+0.017
 Kitty+0.017
Neg logit contributions
 microphone-0.028
 la-0.027
opus-0.027
 dancer-0.026
 player-0.026
Token take
Token ID1011
Feature activation-8.52e-01

Pos logit contributions
 microphone+0.061
 la+0.058
 player+0.058
opus+0.058
 dancer+0.057
Neg logit contributions
 Timothy-0.078
 Karen-0.070
 Samantha-0.067
 Su-0.066
 Kitty-0.066
Token unnecessary
Token ID13114
Feature activation+7.66e+00

Pos logit contributions
 microphone+0.062
 brain+0.060
 la+0.059
 thumb+0.059
 dancer+0.058
Neg logit contributions
 Timothy-0.051
 Karen-0.044
 Samantha-0.041
Kim-0.041
 Su-0.041
Token risks
Token ID7476
Feature activation+3.75e+00

Pos logit contributions
 Timothy+0.389
 Karen+0.330
 Samantha+0.307
 Su+0.301
 Kitty+0.298
Neg logit contributions
 microphone-0.622
 la-0.604
opus-0.596
 player-0.596
 dancer-0.594
Token.
Token ID13
Feature activation+2.69e-01

Pos logit contributions
 Timothy+0.196
 about+0.170
 Karen+0.164
 Su+0.154
 Bill+0.154
Neg logit contributions
 microphone-0.289
 la-0.285
opus-0.285
 rook-0.278
 thumb-0.278
Token<|endoftext|>
Token ID50256
Feature activation-2.60e+00

Pos logit contributions
 Timothy+0.018
 Karen+0.016
 Samantha+0.015
 Su+0.015
 Kitty+0.014
Neg logit contributions
 microphone-0.017
 player-0.017
 la-0.017
opus-0.016
 dancer-0.016
TokenOnce
Token ID7454
Feature activation-1.72e+00

Pos logit contributions
 microphone+0.158
 player+0.154
opus+0.150
 la+0.149
 dancer+0.148
Neg logit contributions
 Timothy-0.181
 Karen-0.160
 Samantha-0.154
 Su-0.150
 theirs-0.148
Token upon
Token ID2402
Feature activation-3.47e+00

Pos logit contributions
 microphone+0.075
 la+0.071
opus+0.071
 player+0.070
 dancer+0.070
Neg logit contributions
 Timothy-0.150
 Karen-0.137
 Samantha-0.132
 Kitty-0.130
 Su-0.130
Token't
Token ID470
Feature activation-1.24e+00

Pos logit contributions
 Timothy+0.104
 Karen+0.095
 Samantha+0.093
 Su+0.092
 Kitty+0.091
Neg logit contributions
 microphone-0.041
opus-0.038
 player-0.037
 la-0.037
 dancer-0.037
Token like
Token ID588
Feature activation+3.23e-01

Pos logit contributions
 microphone+0.083
 la+0.083
 dancer+0.082
 brain+0.081
 player+0.081
Neg logit contributions
 Timothy-0.080
 Karen-0.069
 Samantha-0.065
 Bill-0.064
 Kitty-0.063
Token all
Token ID477
Feature activation-1.54e+00

Pos logit contributions
 Timothy+0.020
 Karen+0.017
 Samantha+0.016
 Su+0.016
 Kitty+0.016
Neg logit contributions
 microphone-0.023
opus-0.022
 la-0.022
 player-0.022
 dancer-0.022
Token of
Token ID286
Feature activation+6.58e-01

Pos logit contributions
 microphone+0.099
 dancer+0.094
 player+0.092
 la+0.091
 brain+0.090
Neg logit contributions
 Timothy-0.103
Kim-0.089
 Karen-0.087
 resolved-0.084
 Samantha-0.084
Token the
Token ID262
Feature activation+6.31e-01

Pos logit contributions
 Timothy+0.042
 Karen+0.037
 Samantha+0.035
Kim+0.035
 Su+0.034
Neg logit contributions
 microphone-0.044
 dancer-0.041
 la-0.041
 thumb-0.041
 brain-0.041
Token unnecessary
Token ID13114
Feature activation+7.66e+00

Pos logit contributions
 Timothy+0.042
 Karen+0.037
 Samantha+0.035
 Su+0.034
 Kitty+0.034
Neg logit contributions
 microphone-0.042
 la-0.040
opus-0.040
 player-0.040
 dancer-0.040
Token extravag
Token ID36827
Feature activation+5.71e-01

Pos logit contributions
 Timothy+0.473
 Karen+0.416
 Samantha+0.391
 Kitty+0.388
 Su+0.387
Neg logit contributions
 microphone-0.530
opus-0.518
 la-0.515
 player-0.510
 dancer-0.506
Tokenance
Token ID590
Feature activation+1.63e+00

Pos logit contributions
 Timothy+0.031
 Karen+0.027
 Samantha+0.025
 Kitty+0.025
 Su+0.025
Neg logit contributions
 microphone-0.044
 la-0.043
opus-0.043
 player-0.043
 dancer-0.042
Token.
Token ID13
Feature activation+8.36e-01

Pos logit contributions
 Timothy+0.101
 Karen+0.089
 Samantha+0.084
 Su+0.084
 Kitty+0.083
Neg logit contributions
 microphone-0.111
opus-0.110
 la-0.110
 thumb-0.107
 rook-0.105
Token
Token ID198
Feature activation+1.15e+00

Pos logit contributions
 Timothy+0.056
 Karen+0.048
 Samantha+0.046
 Su+0.045
 Kitty+0.045
Neg logit contributions
 microphone-0.053
 la-0.051
 player-0.051
 dancer-0.050
opus-0.050
Token
Token ID198
Feature activation+1.12e+00

Pos logit contributions
 Timothy+0.085
 Karen+0.074
 Samantha+0.072
 Su+0.070
 theirs+0.070
Neg logit contributions
 microphone-0.064
opus-0.062
 la-0.061
 team-0.061
 player-0.061
Token find
Token ID1064
Feature activation+4.39e-01

Pos logit contributions
 Timothy+0.065
 Karen+0.058
 Samantha+0.055
 Su+0.055
 Bill+0.054
Neg logit contributions
 microphone-0.062
opus-0.060
 la-0.060
 player-0.059
 dancer-0.059
Token out
Token ID503
Feature activation+1.12e+00

Pos logit contributions
 Timothy+0.029
 Karen+0.026
Kim+0.025
 Samantha+0.024
 Su+0.024
Neg logit contributions
 microphone-0.029
 dancer-0.027
 player-0.026
 thumb-0.026
 brain-0.026
Token what
Token ID644
Feature activation+4.17e+00

Pos logit contributions
 Timothy+0.052
 Karen+0.043
 Samantha+0.040
 Su+0.039
 Kitty+0.039
Neg logit contributions
 microphone-0.096
 la-0.092
 dancer-0.092
 player-0.092
opus-0.090
Token it
Token ID340
Feature activation+1.32e+00

Pos logit contributions
 Timothy+0.282
 Karen+0.247
 Samantha+0.235
 Su+0.233
 Kitty+0.230
Neg logit contributions
 microphone-0.269
 player-0.257
 la-0.256
 dancer-0.255
opus-0.254
Token all
Token ID477
Feature activation-1.08e+00

Pos logit contributions
 Timothy+0.089
 Karen+0.076
 Samantha+0.073
 Su+0.072
Kim+0.070
Neg logit contributions
 microphone-0.084
 player-0.083
 dancer-0.082
 la-0.081
opus-0.080
Token meant
Token ID4001
Feature activation+7.66e+00

Pos logit contributions
 microphone+0.069
 player+0.066
 dancer+0.065
 la+0.065
 thumb+0.063
Neg logit contributions
 Timothy-0.074
 Karen-0.064
 Samantha-0.062
Kim-0.060
 Su-0.060
Token.
Token ID13
Feature activation+1.38e+00

Pos logit contributions
 Timothy+0.485
 Karen+0.427
 Samantha+0.403
 Su+0.400
 Kitty+0.398
Neg logit contributions
 microphone-0.521
opus-0.507
 la-0.507
 dancer-0.497
 player-0.496
Token
Token ID220
Feature activation-1.47e+00

Pos logit contributions
 Timothy+0.095
 Karen+0.083
 Samantha+0.079
 Kitty+0.079
 Su+0.078
Neg logit contributions
 microphone-0.082
 la-0.081
 player-0.080
opus-0.079
 dancer-0.079
Token
Token ID198
Feature activation+1.28e+00

Pos logit contributions
 microphone+0.078
 player+0.077
 la+0.076
opus+0.074
 team+0.073
Neg logit contributions
 Timothy-0.105
 Karen-0.093
 Samantha-0.089
 Su-0.087
 Kitty-0.086
Token
Token ID198
Feature activation+1.18e+00

Pos logit contributions
 Timothy+0.088
 Karen+0.078
 Samantha+0.074
 Su+0.073
 Kitty+0.073
Neg logit contributions
 microphone-0.075
opus-0.074
 la-0.073
 player-0.073
 dancer-0.073
TokenShe
Token ID3347
Feature activation+2.83e-01

Pos logit contributions
 Timothy+0.070
 Karen+0.061
 Samantha+0.058
 Su+0.057
 Kitty+0.056
Neg logit contributions
 microphone-0.087
 la-0.084
opus-0.083
 player-0.083
 thumb-0.082
Token be
Token ID307
Feature activation-1.21e-01

Pos logit contributions
 microphone+0.003
opus+0.002
 la+0.002
 player+0.002
 dancer+0.002
Neg logit contributions
 Timothy-0.003
 Karen-0.002
 Samantha-0.002
 Kitty-0.002
 Su-0.002
Token prepared
Token ID5597
Feature activation-4.25e-01

Pos logit contributions
 microphone+0.008
 la+0.008
opus+0.008
 player+0.008
 dancer+0.008
Neg logit contributions
 Timothy-0.008
 Karen-0.007
 Samantha-0.007
 Kitty-0.006
 Su-0.006
Token and
Token ID290
Feature activation+5.24e-01

Pos logit contributions
 microphone+0.030
 la+0.029
opus+0.029
 dancer+0.028
 player+0.028
Neg logit contributions
 Timothy-0.026
 Karen-0.023
 Samantha-0.022
 Su-0.021
 Kitty-0.021
Token not
Token ID407
Feature activation+5.89e-01

Pos logit contributions
 Timothy+0.031
 Karen+0.028
 Kitty+0.026
 Bill+0.026
 Samantha+0.026
Neg logit contributions
opus-0.037
 la-0.036
 microphone-0.036
 player-0.035
 dancer-0.035
Token take
Token ID1011
Feature activation-9.25e-01

Pos logit contributions
 Timothy+0.039
 Karen+0.035
 Samantha+0.033
 Kitty+0.033
 Bill+0.033
Neg logit contributions
 microphone-0.038
opus-0.038
 la-0.037
 player-0.036
 dancer-0.036
Token unnecessary
Token ID13114
Feature activation+7.56e+00

Pos logit contributions
 microphone+0.068
 la+0.065
 player+0.064
 thumb+0.064
 dancer+0.064
Neg logit contributions
 Timothy-0.055
 Karen-0.048
 Samantha-0.045
 Su-0.045
 Kitty-0.044
Token risks
Token ID7476
Feature activation+3.61e+00

Pos logit contributions
 Timothy+0.361
 Karen+0.302
 Samantha+0.281
 Kitty+0.275
 Su+0.274
Neg logit contributions
 microphone-0.630
opus-0.614
 la-0.613
 player-0.609
 dancer-0.606
Token.
Token ID13
Feature activation+3.86e-01

Pos logit contributions
 Timothy+0.193
 Karen+0.164
 about+0.164
 Bill+0.154
 Kitty+0.152
Neg logit contributions
 microphone-0.272
opus-0.271
 la-0.271
 dancer-0.263
 thumb-0.262
Token<|endoftext|>
Token ID50256
Feature activation-2.71e+00

Pos logit contributions
 Timothy+0.023
 Karen+0.020
 Samantha+0.019
 Kitty+0.019
 Su+0.019
Neg logit contributions
 microphone-0.028
 la-0.027
opus-0.027
 dancer-0.027
 player-0.027
TokenOnce
Token ID7454
Feature activation-1.79e+00

Pos logit contributions
 microphone+0.170
 player+0.166
 la+0.162
opus+0.162
 dancer+0.160
Neg logit contributions
 Timothy-0.183
 Karen-0.162
 Samantha-0.155
 Su-0.151
 Kitty-0.150
Token upon
Token ID2402
Feature activation-3.48e+00

Pos logit contributions
 microphone+0.089
 la+0.085
opus+0.083
 player+0.083
 dancer+0.082
Neg logit contributions
 Timothy-0.148
 Karen-0.134
 Samantha-0.128
 Su-0.128
 Kitty-0.127
Token the
Token ID262
Feature activation+8.11e-01

Pos logit contributions
 Timothy+0.369
 Karen+0.324
 Kitty+0.308
 Bill+0.304
 Samantha+0.304
Neg logit contributions
opus-0.453
 microphone-0.451
 la-0.439
 player-0.436
 thumb-0.431
Token project
Token ID1628
Feature activation+2.38e-01

Pos logit contributions
 Timothy+0.056
 Karen+0.050
 Kitty+0.048
 Samantha+0.048
 Bill+0.047
Neg logit contributions
 la-0.046
opus-0.046
 microphone-0.045
 rook-0.044
 bishop-0.043
Token and
Token ID290
Feature activation+1.01e+00

Pos logit contributions
 Timothy+0.014
 Karen+0.013
 Su+0.012
 Bill+0.012
 Samantha+0.012
Neg logit contributions
 microphone-0.017
 la-0.017
opus-0.017
 dancer-0.016
 thumb-0.016
Token hadn
Token ID8020
Feature activation+3.27e+00

Pos logit contributions
 Timothy+0.062
 Karen+0.055
 Samantha+0.052
 Su+0.052
 Kitty+0.051
Neg logit contributions
 microphone-0.070
opus-0.068
 la-0.068
 player-0.067
 thumb-0.066
Token't
Token ID470
Feature activation-8.80e-01

Pos logit contributions
 Timothy+0.286
 Karen+0.260
 Samantha+0.252
 Su+0.248
 Kitty+0.248
Neg logit contributions
 microphone-0.144
 la-0.137
opus-0.134
 player-0.133
 dancer-0.132
Token meant
Token ID4001
Feature activation+7.55e+00

Pos logit contributions
 microphone+0.060
 dancer+0.058
 la+0.058
 player+0.058
 brain+0.058
Neg logit contributions
 Timothy-0.055
 Karen-0.047
 Kitty-0.045
 Samantha-0.044
 Su-0.043
Token to
Token ID284
Feature activation+4.93e-01

Pos logit contributions
 Timothy+0.479
 Karen+0.420
 Samantha+0.400
 Su+0.392
 Kitty+0.391
Neg logit contributions
 microphone-0.513
 la-0.490
 player-0.487
 dancer-0.485
 thumb-0.483
Token make
Token ID787
Feature activation-1.72e-01

Pos logit contributions
 Timothy+0.035
 Karen+0.031
 Samantha+0.029
 Su+0.029
 Bill+0.029
Neg logit contributions
 microphone-0.031
opus-0.030
 la-0.030
 player-0.029
 dancer-0.029
Token a
Token ID257
Feature activation-6.41e-02

Pos logit contributions
 microphone+0.012
 dancer+0.011
 player+0.011
 la+0.011
 thumb+0.011
Neg logit contributions
 Timothy-0.011
 Karen-0.009
 Samantha-0.009
 Kitty-0.009
 Su-0.009
Token mess
Token ID2085
Feature activation+2.29e+00

Pos logit contributions
 la+0.005
opus+0.005
 microphone+0.004
 player+0.004
 rook+0.004
Neg logit contributions
 Timothy-0.004
 Karen-0.003
 Su-0.003
 Samantha-0.003
 Kitty-0.003
Token.
Token ID13
Feature activation+8.53e-01

Pos logit contributions
 Timothy+0.130
 Karen+0.111
 about+0.109
 Su+0.107
 Samantha+0.103
Neg logit contributions
opus-0.167
 la-0.167
 microphone-0.167
 rook-0.163
 bishop-0.162
Token thought
Token ID1807
Feature activation-8.97e-01

Pos logit contributions
 Timothy+0.027
 Karen+0.024
 Bill+0.023
 Samantha+0.023
 Kitty+0.023
Neg logit contributions
 microphone-0.031
 la-0.031
opus-0.031
 dancer-0.030
fish-0.030
Token that
Token ID326
Feature activation+3.21e-01

Pos logit contributions
 microphone+0.060
 la+0.058
 dancer+0.056
opus+0.056
 player+0.056
Neg logit contributions
 Timothy-0.057
 Karen-0.050
 Samantha-0.048
Kim-0.046
 Su-0.045
Token life
Token ID1204
Feature activation+2.11e-01

Pos logit contributions
 Timothy+0.021
 Karen+0.018
 Samantha+0.017
 Su+0.017
 Kitty+0.017
Neg logit contributions
 microphone-0.022
 la-0.021
 player-0.020
 thumb-0.020
 dancer-0.020
Token was
Token ID373
Feature activation+1.26e+00

Pos logit contributions
 Timothy+0.013
 Karen+0.012
 Samantha+0.011
 Kitty+0.011
 Su+0.011
Neg logit contributions
 microphone-0.014
 la-0.014
opus-0.014
 player-0.013
 dancer-0.013
Token just
Token ID655
Feature activation-5.69e-02

Pos logit contributions
 Timothy+0.082
 Karen+0.073
 Samantha+0.069
Kim+0.068
 Su+0.067
Neg logit contributions
 microphone-0.084
 player-0.078
 thumb-0.078
 brain-0.077
 dancer-0.076
Token meant
Token ID4001
Feature activation+7.52e+00

Pos logit contributions
 microphone+0.004
 la+0.004
 player+0.004
 thumb+0.004
 dancer+0.004
Neg logit contributions
 Timothy-0.003
 Karen-0.003
 Samantha-0.003
 Kitty-0.003
 Su-0.003
Token to
Token ID284
Feature activation+3.76e-01

Pos logit contributions
 Timothy+0.479
 Karen+0.420
 Samantha+0.399
 Su+0.392
 Kitty+0.391
Neg logit contributions
 microphone-0.511
 la-0.491
 player-0.487
opus-0.485
 dancer-0.484
Token be
Token ID307
Feature activation+5.87e-01

Pos logit contributions
 Timothy+0.024
 Karen+0.021
 Bill+0.021
 Samantha+0.020
 Su+0.020
Neg logit contributions
opus-0.026
 microphone-0.025
 la-0.025
 player-0.024
 rook-0.024
Token fun
Token ID1257
Feature activation+4.54e+00

Pos logit contributions
 Timothy+0.040
 Karen+0.036
 Samantha+0.034
 Kitty+0.033
 Su+0.033
Neg logit contributions
 microphone-0.037
 la-0.035
 player-0.035
 thumb-0.035
 dancer-0.035
Token and
Token ID290
Feature activation+8.52e-01

Pos logit contributions
 Timothy+0.257
 Karen+0.225
 about+0.222
 Su+0.212
 Bill+0.211
Neg logit contributions
opus-0.328
 la-0.321
 microphone-0.320
 rook-0.310
 thumb-0.308
Token laughing
Token ID14376
Feature activation-1.03e+00

Pos logit contributions
 Timothy+0.054
 Karen+0.048
 Samantha+0.045
 Kitty+0.045
 Su+0.045
Neg logit contributions
 microphone-0.058
opus-0.056
 la-0.056
 player-0.055
 dancer-0.055
Token be
Token ID307
Feature activation-1.39e-01

Pos logit contributions
 microphone+0.007
 la+0.007
 dancer+0.007
 thumb+0.007
 brain+0.007
Neg logit contributions
 Timothy-0.010
 Karen-0.009
 Samantha-0.009
 Su-0.008
 Kitty-0.008
Token careful
Token ID8161
Feature activation-1.48e+00

Pos logit contributions
 microphone+0.007
 brain+0.007
 player+0.007
 thumb+0.007
 dancer+0.007
Neg logit contributions
 Timothy-0.011
 Karen-0.010
 Samantha-0.009
 Su-0.009
 Bill-0.009
Token and
Token ID290
Feature activation+2.18e-01

Pos logit contributions
 microphone+0.102
 la+0.102
opus+0.102
 dancer+0.097
 rook+0.097
Neg logit contributions
 Timothy-0.091
 about-0.080
 Karen-0.079
 Kitty-0.075
 Samantha-0.075
Token not
Token ID407
Feature activation+4.13e-01

Pos logit contributions
 Timothy+0.013
 Karen+0.011
 Kitty+0.010
 Samantha+0.010
 Bill+0.010
Neg logit contributions
 microphone-0.016
opus-0.016
 la-0.015
 player-0.015
 dancer-0.015
Token take
Token ID1011
Feature activation-9.64e-01

Pos logit contributions
 Timothy+0.028
 Karen+0.025
 Kitty+0.024
 Bill+0.024
 Su+0.024
Neg logit contributions
 microphone-0.026
opus-0.025
 la-0.025
 player-0.024
 dancer-0.024
Token unnecessary
Token ID13114
Feature activation+7.51e+00

Pos logit contributions
 microphone+0.060
 brain+0.059
 la+0.057
 thumb+0.056
 dancer+0.055
Neg logit contributions
 Timothy-0.066
 Karen-0.057
Kim-0.056
 Su-0.055
 Samantha-0.055
Token risks
Token ID7476
Feature activation+3.60e+00

Pos logit contributions
 Timothy+0.380
 Karen+0.321
 Samantha+0.300
 Su+0.294
 Kitty+0.292
Neg logit contributions
 microphone-0.610
 la-0.592
opus-0.586
 player-0.585
 dancer-0.583
Token.
Token ID13
Feature activation-8.48e-02

Pos logit contributions
 Timothy+0.194
 Karen+0.169
 about+0.165
 Kitty+0.157
 Samantha+0.155
Neg logit contributions
 microphone-0.270
opus-0.270
 la-0.267
 thumb-0.262
 rook-0.261
Token The
Token ID383
Feature activation-8.16e-01

Pos logit contributions
 la+0.004
 player+0.004
 microphone+0.004
 brain+0.004
 thumb+0.004
Neg logit contributions
 Timothy-0.007
 Karen-0.006
 Samantha-0.006
 theirs-0.006
 Bill-0.005
Token moral
Token ID6573
Feature activation+1.46e+00

Pos logit contributions
 microphone+0.048
 la+0.046
opus+0.046
 player+0.046
 dancer+0.045
Neg logit contributions
 Timothy-0.060
 Karen-0.053
 Samantha-0.051
 Kitty-0.050
 Su-0.050
Token of
Token ID286
Feature activation+2.16e-01

Pos logit contributions
 Timothy+0.127
 Karen+0.112
 Samantha+0.109
 Su+0.107
 Wil+0.107
Neg logit contributions
 microphone-0.069
 player-0.067
 dancer-0.063
 brain-0.063
 team-0.061
Token why
Token ID1521
Feature activation+3.37e+00

Pos logit contributions
 Timothy+0.161
 Karen+0.141
 Samantha+0.133
 Kitty+0.132
 Su+0.131
Neg logit contributions
 microphone-0.190
opus-0.185
 la-0.184
 player-0.181
 dancer-0.181
Token and
Token ID290
Feature activation+1.33e+00

Pos logit contributions
 Timothy+0.212
 Karen+0.188
 Samantha+0.179
 Su+0.178
 Kitty+0.177
Neg logit contributions
 microphone-0.233
opus-0.227
 la-0.225
 player-0.222
 thumb-0.219
Token what
Token ID644
Feature activation+3.82e+00

Pos logit contributions
 Timothy+0.075
 Karen+0.067
 Samantha+0.063
 Kitty+0.062
 Su+0.062
Neg logit contributions
 microphone-0.100
opus-0.100
 la-0.097
 player-0.097
 rook-0.096
Token it
Token ID340
Feature activation+9.77e-01

Pos logit contributions
 Timothy+0.254
 Karen+0.221
 Samantha+0.210
 Su+0.207
 Kitty+0.205
Neg logit contributions
 microphone-0.254
 player-0.242
 la-0.241
 dancer-0.241
opus-0.239
Token all
Token ID477
Feature activation-1.31e+00

Pos logit contributions
 Timothy+0.061
 Karen+0.053
 Samantha+0.050
 Su+0.049
 Bill+0.048
Neg logit contributions
 microphone-0.068
 player-0.066
 dancer-0.065
 thumb-0.065
 la-0.065
Token meant
Token ID4001
Feature activation+7.50e+00

Pos logit contributions
 microphone+0.081
 la+0.078
opus+0.078
 player+0.077
 dancer+0.077
Neg logit contributions
 Timothy-0.091
 Karen-0.081
 Samantha-0.078
 Su-0.077
 Kitty-0.076
Token,
Token ID11
Feature activation-1.07e-01

Pos logit contributions
 Timothy+0.477
 Karen+0.420
 Samantha+0.398
 Su+0.393
 Kitty+0.393
Neg logit contributions
 microphone-0.507
 la-0.493
opus-0.492
 player-0.483
 dancer-0.483
Token but
Token ID475
Feature activation+1.17e+00

Pos logit contributions
 microphone+0.007
 la+0.007
 player+0.007
opus+0.007
 dancer+0.007
Neg logit contributions
 Timothy-0.007
 Karen-0.006
 Samantha-0.006
 Su-0.006
 Kitty-0.006
Token her
Token ID607
Feature activation+3.59e-01

Pos logit contributions
 Timothy+0.077
 Karen+0.068
 Samantha+0.065
 Kitty+0.064
 Su+0.063
Neg logit contributions
 microphone-0.078
 la-0.075
opus-0.075
 player-0.074
 dancer-0.073
Token mom
Token ID1995
Feature activation+8.20e-01

Pos logit contributions
 Timothy+0.024
 Karen+0.022
 Samantha+0.021
 Kitty+0.020
 Bill+0.020
Neg logit contributions
 microphone-0.023
 la-0.022
opus-0.022
 dancer-0.021
 player-0.021
Token and
Token ID290
Feature activation+9.32e-01

Pos logit contributions
 Timothy+0.049
 Karen+0.042
 Samantha+0.040
 Kitty+0.040
 Su+0.040
Neg logit contributions
 microphone-0.057
 thumb-0.056
 player-0.055
 la-0.055
opus-0.055
Token't
Token ID470
Feature activation-1.08e+00

Pos logit contributions
 Timothy+0.131
 Karen+0.119
 Samantha+0.114
 Su+0.113
 Kitty+0.113
Neg logit contributions
 microphone-0.071
 la-0.068
opus-0.067
 player-0.067
 dancer-0.066
Token know
Token ID760
Feature activation+2.61e+00

Pos logit contributions
 microphone+0.069
opus+0.066
 la+0.066
 player+0.065
 dancer+0.064
Neg logit contributions
 Timothy-0.075
 Karen-0.066
 Samantha-0.063
 Kitty-0.063
 Su-0.063
Token what
Token ID644
Feature activation+4.00e+00

Pos logit contributions
 Timothy+0.121
 Karen+0.100
 Samantha+0.094
 Su+0.090
 Kitty+0.088
Neg logit contributions
 microphone-0.222
 la-0.216
 dancer-0.213
 player-0.213
opus-0.212
Token decor
Token ID11705
Feature activation-9.28e-01

Pos logit contributions
 Timothy+0.233
 Karen+0.203
 Samantha+0.191
 Kitty+0.189
 Su+0.188
Neg logit contributions
 microphone-0.293
 la-0.285
opus-0.283
 player-0.280
 dancer-0.280
Tokenate
Token ID378
Feature activation+1.90e+00

Pos logit contributions
 microphone+0.072
 player+0.068
 dancer+0.068
opus+0.068
 thumb+0.068
Neg logit contributions
 Timothy-0.052
 Karen-0.044
 Samantha-0.042
 Kitty-0.041
 Su-0.041
Token meant
Token ID4001
Feature activation+7.50e+00

Pos logit contributions
 Timothy+0.117
 Karen+0.103
 Samantha+0.099
 Su+0.098
 Bill+0.097
Neg logit contributions
 microphone-0.130
 la-0.128
opus-0.128
 player-0.124
 rook-0.124
Token,
Token ID11
Feature activation-1.00e-01

Pos logit contributions
 Timothy+0.457
 Karen+0.399
 Samantha+0.376
 Su+0.374
 Bill+0.372
Neg logit contributions
 microphone-0.526
opus-0.521
 la-0.518
 dancer-0.501
 player-0.499
Token but
Token ID475
Feature activation+1.31e+00

Pos logit contributions
 microphone+0.007
opus+0.007
 la+0.007
 player+0.007
 dancer+0.007
Neg logit contributions
 Timothy-0.006
 Karen-0.005
 Samantha-0.005
 Kitty-0.005
 Su-0.005
Token he
Token ID339
Feature activation-9.75e-01

Pos logit contributions
 Timothy+0.081
 Karen+0.071
 Samantha+0.067
 Kitty+0.066
 Bill+0.066
Neg logit contributions
 microphone-0.093
opus-0.091
 la-0.091
 player-0.089
 dancer-0.088
Token wanted
Token ID2227
Feature activation+1.26e+00

Pos logit contributions
 microphone+0.073
 la+0.071
opus+0.069
 dancer+0.068
 player+0.068
Neg logit contributions
 Timothy-0.056
 Karen-0.048
 Samantha-0.046
 Bill-0.045
 Su-0.045
Token to
Token ID284
Feature activation+6.50e-01

Pos logit contributions
 Timothy+0.092
 Karen+0.081
 Samantha+0.078
Kim+0.077
 Su+0.075
Neg logit contributions
 microphone-0.076
 thumb-0.071
 brain-0.070
 player-0.069
 dancer-0.068
Token and
Token ID290
Feature activation+3.18e-01

Pos logit contributions
 Timothy+0.034
 about+0.032
 Karen+0.029
 Bill+0.028
 Samantha+0.028
Neg logit contributions
 la-0.050
opus-0.050
 microphone-0.049
 dancer-0.048
 rook-0.047
Token friends
Token ID2460
Feature activation+1.53e+00

Pos logit contributions
 Timothy+0.020
 Karen+0.018
 Kitty+0.017
 Samantha+0.017
 Bill+0.017
Neg logit contributions
 microphone-0.021
 la-0.021
opus-0.021
 dancer-0.020
 player-0.020
Token and
Token ID290
Feature activation+3.39e-01

Pos logit contributions
 Timothy+0.076
 about+0.073
 Karen+0.064
 meant+0.062
 Bill+0.061
Neg logit contributions
opus-0.119
 la-0.118
 rook-0.115
 microphone-0.114
 bishop-0.113
Token not
Token ID407
Feature activation+4.07e-01

Pos logit contributions
 Timothy+0.020
 Karen+0.018
 Kitty+0.017
 Bill+0.017
 Samantha+0.017
Neg logit contributions
opus-0.024
 la-0.023
 microphone-0.023
 dancer-0.022
 player-0.022
Token take
Token ID1011
Feature activation-1.01e+00

Pos logit contributions
 Timothy+0.027
 Karen+0.024
 Kitty+0.024
 Bill+0.023
 Sue+0.023
Neg logit contributions
 microphone-0.026
opus-0.025
 la-0.025
 dancer-0.024
 player-0.024
Token unnecessary
Token ID13114
Feature activation+7.48e+00

Pos logit contributions
 microphone+0.068
 brain+0.065
 thumb+0.063
 team+0.062
 la+0.062
Neg logit contributions
 Timothy-0.066
 Karen-0.060
Kim-0.057
 Samantha-0.056
 Su-0.056
Token risks
Token ID7476
Feature activation+3.57e+00

Pos logit contributions
 Timothy+0.401
 Karen+0.344
 Samantha+0.323
 Su+0.317
 Kitty+0.310
Neg logit contributions
 microphone-0.591
 la-0.572
 player-0.563
 brain-0.561
 dancer-0.561
Token.
Token ID13
Feature activation+1.04e-01

Pos logit contributions
 Timothy+0.192
 about+0.165
 Karen+0.163
 theirs+0.153
 Kitty+0.152
Neg logit contributions
 microphone-0.270
 la-0.269
opus-0.269
 thumb-0.261
 dancer-0.261
Token We
Token ID775
Feature activation-1.26e+00

Pos logit contributions
 Timothy+0.007
 Karen+0.006
 Samantha+0.006
 Su+0.006
 Bill+0.006
Neg logit contributions
 microphone-0.006
 player-0.006
 la-0.006
 thumb-0.006
 team-0.006
Token should
Token ID815
Feature activation+3.03e+00

Pos logit contributions
 la+0.100
 microphone+0.100
opus+0.099
 dancer+0.098
 player+0.097
Neg logit contributions
 Timothy-0.065
 Karen-0.055
 Kitty-0.051
 Samantha-0.051
 Bill-0.050
Token always
Token ID1464
Feature activation-1.36e+00

Pos logit contributions
 Timothy+0.175
 Karen+0.151
 Samantha+0.143
 Kitty+0.140
 Bill+0.139
Neg logit contributions
 microphone-0.226
 la-0.221
opus-0.220
 player-0.217
 dancer-0.216
Token remember
Token ID3505
Feature activation+8.88e-01

Pos logit contributions
 microphone+0.011
opus+0.011
 la+0.011
 player+0.011
 dancer+0.011
Neg logit contributions
 Timothy-0.012
 Karen-0.010
 Samantha-0.010
 Bill-0.010
 Kitty-0.010
Token how
Token ID703
Feature activation+1.50e+00

Pos logit contributions
 Timothy+0.053
 Karen+0.046
 Samantha+0.044
 Su+0.043
 Kitty+0.043
Neg logit contributions
 microphone-0.064
 la-0.062
 player-0.061
 dancer-0.061
opus-0.061
Token much
Token ID881
Feature activation+2.75e+00

Pos logit contributions
 Timothy+0.084
 Karen+0.076
 Kitty+0.072
 Samantha+0.072
 Su+0.071
Neg logit contributions
 la-0.111
 microphone-0.110
opus-0.109
 player-0.108
 rook-0.108
Token it
Token ID340
Feature activation+9.04e-01

Pos logit contributions
 Timothy+0.206
 Karen+0.182
Kim+0.174
 Su+0.174
 Samantha+0.173
Neg logit contributions
 microphone-0.155
 dancer-0.151
 player-0.146
 la-0.146
 brain-0.145
Token had
Token ID550
Feature activation+9.85e-01

Pos logit contributions
 Timothy+0.051
 Karen+0.045
 Kitty+0.042
 Samantha+0.042
 meant+0.042
Neg logit contributions
 microphone-0.067
opus-0.065
 la-0.065
 player-0.063
 rook-0.063
Token meant
Token ID4001
Feature activation+7.48e+00

Pos logit contributions
 Timothy+0.060
 Karen+0.052
 Samantha+0.049
 Su+0.048
 Kitty+0.048
Neg logit contributions
 microphone-0.070
 la-0.067
 player-0.067
 dancer-0.067
 thumb-0.066
Token to
Token ID284
Feature activation+5.08e-01

Pos logit contributions
 Timothy+0.451
 Karen+0.397
 Samantha+0.376
 Kitty+0.371
 Su+0.370
Neg logit contributions
opus-0.523
 microphone-0.523
 la-0.516
 player-0.504
 dancer-0.502
Token her
Token ID607
Feature activation+3.01e-01

Pos logit contributions
 Timothy+0.032
 Karen+0.029
 Bill+0.028
 Samantha+0.027
 Kitty+0.027
Neg logit contributions
opus-0.036
 microphone-0.034
 la-0.034
 player-0.033
 rook-0.033
Token.
Token ID13
Feature activation+8.38e-01

Pos logit contributions
 Timothy+0.019
 Karen+0.018
 Kitty+0.017
 Samantha+0.017
 Bill+0.017
Neg logit contributions
opus-0.019
 la-0.019
 microphone-0.018
 player-0.018
 rook-0.018
Token From
Token ID3574
Feature activation+1.53e+00

Pos logit contributions
 Timothy+0.055
 Karen+0.048
 Samantha+0.046
 Su+0.045
 Kitty+0.045
Neg logit contributions
 microphone-0.055
 la-0.053
 player-0.053
opus-0.052
 dancer-0.052
Token then
Token ID788
Feature activation+1.91e+00

Pos logit contributions
 Timothy+0.102
 Karen+0.088
 Su+0.086
Kim+0.085
 Samantha+0.085
Neg logit contributions
 microphone-0.097
 dancer-0.092
 la-0.091
opus-0.091
 player-0.090
Token
Token ID198
Feature activation+1.40e+00

Pos logit contributions
 Timothy+0.098
 Karen+0.087
 Samantha+0.084
 Su+0.081
 Bill+0.081
Neg logit contributions
 microphone-0.095
opus-0.092
 la-0.092
 player-0.092
 dancer-0.090
TokenB
Token ID33
Feature activation+6.77e-01

Pos logit contributions
 Timothy+0.095
 Karen+0.079
Kim+0.079
 Kitty+0.078
 Su+0.076
Neg logit contributions
 microphone-0.096
 la-0.091
 dancer-0.090
 rook-0.087
 player-0.087
Tokenenny
Token ID11870
Feature activation-1.03e+00

Pos logit contributions
 Timothy+0.060
 Karen+0.054
 Su+0.052
 Samantha+0.052
 Kitty+0.052
Neg logit contributions
 microphone-0.031
 la-0.029
 dancer-0.028
opus-0.028
 player-0.027
Token knew
Token ID2993
Feature activation-1.03e-01

Pos logit contributions
 microphone+0.074
 la+0.071
 player+0.070
 dancer+0.070
opus+0.070
Neg logit contributions
 Timothy-0.061
 Karen-0.054
 Kitty-0.051
 Samantha-0.051
 Bill-0.051
Token that
Token ID326
Feature activation+5.15e-01

Pos logit contributions
 microphone+0.005
 thumb+0.005
 dancer+0.005
opus+0.005
 brain+0.005
Neg logit contributions
 Timothy-0.008
 Karen-0.007
 Samantha-0.007
Kim-0.007
ained-0.007
Token meant
Token ID4001
Feature activation+7.47e+00

Pos logit contributions
 Timothy+0.035
 Karen+0.031
 Samantha+0.029
 Su+0.029
 Kitty+0.029
Neg logit contributions
 microphone-0.033
 la-0.032
 player-0.032
 dancer-0.031
 thumb-0.031
Token it
Token ID340
Feature activation+9.07e-01

Pos logit contributions
 Timothy+0.477
 Karen+0.416
 Samantha+0.397
 Su+0.388
 Kitty+0.382
Neg logit contributions
 microphone-0.512
 la-0.487
 player-0.484
 dancer-0.483
 thumb-0.481
Token might
Token ID1244
Feature activation+1.54e+00

Pos logit contributions
 Timothy+0.066
 Karen+0.058
 Samantha+0.058
 Su+0.056
Kim+0.054
Neg logit contributions
 microphone-0.054
 thumb-0.053
 player-0.053
 brain-0.052
 dancer-0.049
Token rain
Token ID6290
Feature activation+2.00e+00

Pos logit contributions
 Timothy+0.118
 Karen+0.106
 Samantha+0.102
 Su+0.100
 Kitty+0.100
Neg logit contributions
 microphone-0.085
 la-0.081
 dancer-0.080
 player-0.080
 thumb-0.079
Token soon
Token ID2582
Feature activation+1.04e+00

Pos logit contributions
 Timothy+0.146
 Samantha+0.128
 Karen+0.126
 Su+0.119
 Bill+0.119
Neg logit contributions
 microphone-0.117
 player-0.112
 la-0.111
 team-0.111
 dancer-0.110
Token.
Token ID13
Feature activation+1.11e+00

Pos logit contributions
 Timothy+0.078
 Karen+0.071
 Samantha+0.068
 Su+0.065
Kim+0.065
Neg logit contributions
 microphone-0.059
 thumb-0.056
 brain-0.055
 player-0.055
 dancer-0.055
Token more
Token ID517
Feature activation+3.14e+00

Pos logit contributions
 player+0.009
 microphone+0.009
 brain+0.009
 team+0.008
 dancer+0.008
Neg logit contributions
 Timothy-0.019
 Karen-0.017
 Su-0.016
 Samantha-0.016
 Wil-0.016
Token cautious
Token ID21205
Feature activation-1.30e+00

Pos logit contributions
 Timothy+0.262
 Karen+0.240
 Su+0.227
 Samantha+0.226
 Kitty+0.222
Neg logit contributions
 brain-0.156
 microphone-0.153
 player-0.152
 dancer-0.151
 thumb-0.151
Token and
Token ID290
Feature activation+4.13e-01

Pos logit contributions
 microphone+0.089
 la+0.086
opus+0.086
 dancer+0.084
 thumb+0.083
Neg logit contributions
 Timothy-0.083
 Karen-0.072
 Kitty-0.068
 Samantha-0.068
 about-0.068
Token not
Token ID407
Feature activation+4.31e-01

Pos logit contributions
 Timothy+0.025
 Karen+0.022
 Samantha+0.021
 Su+0.020
 Kitty+0.020
Neg logit contributions
 microphone-0.029
 la-0.028
opus-0.028
 player-0.028
 dancer-0.028
Token take
Token ID1011
Feature activation-1.01e+00

Pos logit contributions
 Timothy+0.026
 Karen+0.023
 Kitty+0.022
 Samantha+0.022
 Bill+0.022
Neg logit contributions
 microphone-0.030
opus-0.030
 la-0.029
 player-0.029
 dancer-0.029
Token unnecessary
Token ID13114
Feature activation+7.47e+00

Pos logit contributions
 brain+0.067
 microphone+0.067
 thumb+0.063
 player+0.063
 team+0.063
Neg logit contributions
 Timothy-0.065
 Karen-0.058
Kim-0.056
 Samantha-0.056
 Su-0.055
Token risks
Token ID7476
Feature activation+3.54e+00

Pos logit contributions
 Timothy+0.376
 Karen+0.318
 Samantha+0.297
 Su+0.291
 Kitty+0.287
Neg logit contributions
 microphone-0.610
 la-0.592
 player-0.585
opus-0.584
 dancer-0.582
Token.
Token ID13
Feature activation+1.92e-01

Pos logit contributions
 Timothy+0.207
 Karen+0.179
 about+0.168
 Kitty+0.168
 Samantha+0.166
Neg logit contributions
 microphone-0.256
 la-0.252
opus-0.251
 dancer-0.246
 thumb-0.245
Token The
Token ID383
Feature activation-7.21e-01

Pos logit contributions
 Timothy+0.015
 Karen+0.014
 Samantha+0.014
 theirs+0.013
 meant+0.012
Neg logit contributions
 player-0.008
 team-0.008
 brain-0.008
 thumb-0.008
 la-0.008
Token memory
Token ID4088
Feature activation+7.42e-03

Pos logit contributions
 microphone+0.042
 la+0.040
opus+0.040
 player+0.040
 dancer+0.039
Neg logit contributions
 Timothy-0.054
 Karen-0.048
 Samantha-0.046
 Su-0.046
 Kitty-0.045
Token of
Token ID286
Feature activation+2.15e-01

Pos logit contributions
 Timothy+0.000
 Karen+0.000
 Samantha+0.000
 Kitty+0.000
 Su+0.000
Neg logit contributions
 microphone-0.000
 la-0.000
opus-0.000
 thumb-0.000
 dancer-0.000
Token
Token ID26391
Feature activation+2.92e+00

Pos logit contributions
 microphone+0.102
 la+0.098
opus+0.098
 player+0.097
 dancer+0.097
Neg logit contributions
 Timothy-0.090
 Karen-0.078
 Samantha-0.074
 Su-0.073
 Kitty-0.072
Token
Token ID129
Feature activation+4.52e-01

Pos logit contributions
 Timothy+0.183
 Karen+0.161
 Samantha+0.153
 Su+0.151
 Bill+0.151
Neg logit contributions
 microphone-0.206
 dancer-0.195
 player-0.194
 thumb-0.193
 la-0.193
Token
Token ID241
Feature activation+1.01e+00

Pos logit contributions
 Timothy+0.029
 Karen+0.026
 Samantha+0.024
 Kitty+0.024
 Su+0.024
Neg logit contributions
 microphone-0.029
 la-0.029
opus-0.028
 player-0.028
 dancer-0.028
TokenIt
Token ID1026
Feature activation+9.49e-01

Pos logit contributions
 Timothy+0.067
 Karen+0.059
 Samantha+0.056
 Su+0.055
 Kitty+0.055
Neg logit contributions
 microphone-0.067
 la-0.064
opus-0.064
 player-0.064
 dancer-0.063
Token was
Token ID373
Feature activation+1.42e+00

Pos logit contributions
 Timothy+0.055
 Karen+0.049
 meant+0.047
 Kitty+0.047
 about+0.045
Neg logit contributions
 microphone-0.068
 rook-0.067
opus-0.067
 la-0.065
 bishop-0.065
Token meant
Token ID4001
Feature activation+7.45e+00

Pos logit contributions
 Timothy+0.088
 Karen+0.078
 Samantha+0.073
 Kitty+0.071
 Su+0.071
Neg logit contributions
 microphone-0.094
 la-0.093
 player-0.091
 dancer-0.090
 thumb-0.089
Token for
Token ID329
Feature activation+1.31e+00

Pos logit contributions
 Timothy+0.509
 Karen+0.452
 Samantha+0.429
 Su+0.426
 Kitty+0.424
Neg logit contributions
 microphone-0.474
 la-0.457
opus-0.456
 player-0.450
 dancer-0.447
Token a
Token ID257
Feature activation-2.72e-01

Pos logit contributions
 Timothy+0.086
 Karen+0.076
 Samantha+0.072
 Su+0.071
 Kitty+0.071
Neg logit contributions
 microphone-0.087
 la-0.084
opus-0.084
 player-0.083
 dancer-0.082
Token princess
Token ID21752
Feature activation-6.89e-01

Pos logit contributions
 microphone+0.015
opus+0.015
 bishop+0.015
mine+0.014
ater+0.014
Neg logit contributions
 Timothy-0.020
 Karen-0.018
 Kitty-0.017
 Su-0.017
 Samantha-0.017
Token.
Token ID13
Feature activation+7.49e-01

Pos logit contributions
 microphone+0.049
opus+0.049
 rook+0.048
 thumb+0.047
 la+0.047
Neg logit contributions
 Timothy-0.039
 Karen-0.035
 Kitty-0.033
 about-0.033
 Samantha-0.033
Tokenâ
Token ID22940
Feature activation-1.10e+00

Pos logit contributions
 Timothy+0.051
 Karen+0.045
 Samantha+0.043
 Su+0.042
 Kitty+0.042
Neg logit contributions
 microphone-0.049
opus-0.047
 la-0.046
 player-0.046
 dancer-0.046
Token.
Token ID13
Feature activation+2.95e-01

Pos logit contributions
 microphone+0.229
 la+0.221
opus+0.220
 player+0.218
 dancer+0.217
Neg logit contributions
 Timothy-0.224
 Karen-0.198
 Samantha-0.188
 Kitty-0.185
 Su-0.185
Token The
Token ID383
Feature activation-4.07e-01

Pos logit contributions
 Timothy+0.022
 Karen+0.020
 Kitty+0.020
 Bill+0.019
 Sue+0.019
Neg logit contributions
 microphone-0.018
opus-0.017
 thumb-0.016
 dancer-0.016
fish-0.016
Token more
Token ID517
Feature activation+3.28e+00

Pos logit contributions
opus+0.022
 la+0.021
mine+0.020
ater+0.020
 microphone+0.020
Neg logit contributions
 Timothy-0.029
 Karen-0.026
 Kitty-0.025
 Bill-0.025
 Su-0.025
Token,
Token ID11
Feature activation-5.72e-01

Pos logit contributions
 Timothy+0.206
 Karen+0.193
 Sue+0.188
 Kitty+0.184
 Su+0.180
Neg logit contributions
opus-0.211
 microphone-0.206
 la-0.202
 thumb-0.202
 rook-0.198
Token the
Token ID262
Feature activation+3.11e-01

Pos logit contributions
opus+0.038
 microphone+0.036
 thumb+0.035
 la+0.035
 player+0.034
Neg logit contributions
 Timothy-0.038
 Karen-0.034
 Kitty-0.033
 Su-0.033
 Sue-0.032
Token mer
Token ID4017
Feature activation-6.86e+00

Pos logit contributions
 Timothy+0.021
 Karen+0.020
 Kitty+0.019
 Sue+0.019
 Su+0.018
Neg logit contributions
opus-0.017
 la-0.017
mine-0.016
 microphone-0.016
ater-0.016
Tokenrier
Token ID5277
Feature activation-2.16e+00

Pos logit contributions
 thumb+0.299
nam+0.289
Ol+0.276
 brain+0.276
 microphone+0.275
Neg logit contributions
 about-0.507
 Timothy-0.489
 theirs-0.479
 Bill-0.453
 Karen-0.452
Token.
Token ID13
Feature activation+3.24e-02

Pos logit contributions
 microphone+0.152
opus+0.151
 la+0.149
 rook+0.146
 thumb+0.146
Neg logit contributions
 Timothy-0.122
 Karen-0.108
 about-0.106
 Bill-0.105
 Sue-0.104
Token Come
Token ID7911
Feature activation+7.47e-01

Pos logit contributions
 Timothy+0.002
 Karen+0.002
 Bill+0.002
 Su+0.002
 Kitty+0.002
Neg logit contributions
 microphone-0.002
opus-0.002
 thumb-0.002
 guitar-0.002
 dancer-0.002
Token to
Token ID284
Feature activation-2.08e-02

Pos logit contributions
 Timothy+0.055
 Karen+0.049
 Samantha+0.047
 Su+0.046
 Kitty+0.046
Neg logit contributions
 microphone-0.044
 la-0.043
 player-0.042
opus-0.042
 dancer-0.042
Token my
Token ID616
Feature activation-1.88e+00

Pos logit contributions
 microphone+0.001
opus+0.001
 la+0.001
 player+0.001
 thumb+0.001
Neg logit contributions
 Timothy-0.001
 Karen-0.001
 Kitty-0.001
 Bill-0.001
 Su-0.001
Token passport
Token ID17981
Feature activation-7.85e-01

Pos logit contributions
 la+0.040
opus+0.039
mine+0.039
ater+0.038
 microphone+0.037
Neg logit contributions
 Timothy-0.058
 Karen-0.053
 Kitty-0.051
 Sue-0.050
 Bill-0.050
Token belonged
Token ID19611
Feature activation+3.53e+00

Pos logit contributions
opus+0.057
 thumb+0.057
 la+0.057
 microphone+0.057
 rook+0.057
Neg logit contributions
 Timothy-0.039
 Karen-0.036
 about-0.035
 meant-0.035
 Sue-0.034
Token to
Token ID284
Feature activation+1.82e-01

Pos logit contributions
 Timothy+0.229
 Karen+0.201
 Samantha+0.192
 Su+0.187
 Kitty+0.187
Neg logit contributions
 microphone-0.237
 la-0.228
 player-0.226
opus-0.225
 dancer-0.224
Token a
Token ID257
Feature activation-4.43e-01

Pos logit contributions
 Timothy+0.011
 Karen+0.010
 Samantha+0.009
 Su+0.009
 Kitty+0.009
Neg logit contributions
 microphone-0.013
 la-0.012
 player-0.012
opus-0.012
 dancer-0.012
Token magical
Token ID10883
Feature activation+8.03e-01

Pos logit contributions
mine+0.028
opus+0.027
 la+0.027
 thumb+0.026
acy+0.026
Neg logit contributions
 Timothy-0.027
 Karen-0.025
 Kitty-0.023
 Samantha-0.023
 Su-0.023
Token mer
Token ID4017
Feature activation-6.40e+00

Pos logit contributions
 Timothy+0.040
 Karen+0.036
 about+0.035
 Samantha+0.034
 Su+0.033
Neg logit contributions
 la-0.060
opus-0.058
mine-0.057
 bishop-0.057
 rook-0.057
Tokenmaid
Token ID23151
Feature activation-2.59e+00

Pos logit contributions
nam+0.276
 rook+0.271
Ol+0.269
 thumb+0.267
 la+0.267
Neg logit contributions
 about-0.466
 Timothy-0.451
 theirs-0.431
 exactly-0.420
 Karen-0.417
Token who
Token ID508
Feature activation-1.36e+00

Pos logit contributions
opus+0.186
 rook+0.186
 microphone+0.184
 brain+0.184
 la+0.184
Neg logit contributions
 Timothy-0.133
 about-0.122
 Karen-0.121
 Su-0.114
 Kitty-0.114
Token had
Token ID550
Feature activation+4.84e-01

Pos logit contributions
 microphone+0.103
 la+0.101
opus+0.101
 thumb+0.100
 rook+0.099
Neg logit contributions
 Timothy-0.072
 Karen-0.064
 meant-0.061
 Bill-0.061
 Su-0.059
Token been
Token ID587
Feature activation+1.20e+00

Pos logit contributions
 Timothy+0.028
 Karen+0.024
 Samantha+0.023
 Su+0.023
 Kitty+0.023
Neg logit contributions
 microphone-0.036
 la-0.034
 player-0.034
 dancer-0.034
opus-0.034
Token visiting
Token ID10013
Feature activation+4.62e-01

Pos logit contributions
 Timothy+0.071
 Karen+0.062
 Samantha+0.059
 Su+0.058
 Bill+0.058
Neg logit contributions
opus-0.086
 microphone-0.086
 la-0.085
 player-0.084
 dancer-0.083
Token many
Token ID867
Feature activation+7.19e-01

Pos logit contributions
 Timothy+0.049
 Karen+0.042
Kim+0.041
 Su+0.040
ibi+0.040
Neg logit contributions
 microphone-0.044
 audience-0.042
 team-0.041
 guitar-0.041
 thumb-0.041
Token more
Token ID517
Feature activation+3.73e+00

Pos logit contributions
 Timothy+0.039
 Karen+0.033
 Samantha+0.032
Kim+0.031
 Su+0.030
Neg logit contributions
 microphone-0.056
 brain-0.054
 la-0.054
 dancer-0.053
 team-0.053
Token adventures
Token ID17545
Feature activation+2.77e+00

Pos logit contributions
 Timothy+0.181
 Karen+0.152
 Samantha+0.141
 Su+0.138
 Kitty+0.138
Neg logit contributions
 microphone-0.311
 la-0.302
opus-0.299
 player-0.299
 dancer-0.299
Token with
Token ID351
Feature activation-1.96e+00

Pos logit contributions
 Timothy+0.184
 Karen+0.163
 Samantha+0.154
 Kitty+0.152
 Su+0.152
Neg logit contributions
 microphone-0.182
 la-0.177
opus-0.176
 dancer-0.173
 player-0.172
Token the
Token ID262
Feature activation+8.02e-01

Pos logit contributions
 microphone+0.132
 la+0.126
 player+0.125
 dancer+0.125
 thumb+0.124
Neg logit contributions
 Timothy-0.124
 Karen-0.109
 Samantha-0.103
 Su-0.102
 Kitty-0.100
Token mer
Token ID4017
Feature activation-6.30e+00

Pos logit contributions
 Timothy+0.052
 Karen+0.046
 Samantha+0.043
 Bill+0.043
 Kitty+0.043
Neg logit contributions
opus-0.052
 la-0.051
 microphone-0.051
 player-0.049
 dancer-0.049
Tokenmaid
Token ID23151
Feature activation-2.59e+00

Pos logit contributions
 la+0.262
nam+0.257
 thumb+0.254
 microphone+0.254
 player+0.251
Neg logit contributions
 Timothy-0.470
 about-0.460
 theirs-0.435
 Karen-0.427
 Bill-0.425
Token.
Token ID13
Feature activation+6.79e-01

Pos logit contributions
 microphone+0.172
 la+0.172
opus+0.170
 rook+0.168
 thumb+0.165
Neg logit contributions
 Timothy-0.158
 Karen-0.143
 Samantha-0.136
 Kitty-0.135
 about-0.135
Token
Token ID198
Feature activation+8.03e-01

Pos logit contributions
 Timothy+0.042
 Karen+0.036
 Samantha+0.034
 Su+0.034
 Kitty+0.034
Neg logit contributions
 microphone-0.047
 la-0.046
 player-0.045
opus-0.045
 dancer-0.045
Token
Token ID198
Feature activation+6.75e-01

Pos logit contributions
 Timothy+0.054
 Karen+0.048
 Samantha+0.046
 Su+0.045
 Bill+0.044
Neg logit contributions
 microphone-0.049
opus-0.048
 la-0.048
 player-0.047
 dancer-0.047
TokenThe
Token ID464
Feature activation-4.43e-01

Pos logit contributions
 Timothy+0.041
 Karen+0.036
 Samantha+0.034
 Kitty+0.033
 Su+0.033
Neg logit contributions
 microphone-0.050
 thumb-0.047
 la-0.047
 brain-0.047
 player-0.046
Token was
Token ID373
Feature activation+8.53e-01

Pos logit contributions
 microphone+0.037
 la+0.036
opus+0.036
 player+0.036
 dancer+0.036
Neg logit contributions
 Timothy-0.034
 Karen-0.030
 Samantha-0.029
 Su-0.028
 Kitty-0.028
Token so
Token ID523
Feature activation+1.37e-01

Pos logit contributions
 Timothy+0.058
 Karen+0.052
 Samantha+0.049
 Su+0.049
 Kitty+0.048
Neg logit contributions
 microphone-0.055
opus-0.054
 la-0.054
 player-0.052
 dancer-0.052
Token thankful
Token ID25535
Feature activation-1.96e+00

Pos logit contributions
 Timothy+0.008
 Karen+0.007
 Su+0.007
 Samantha+0.007
 Sue+0.007
Neg logit contributions
 microphone-0.010
opus-0.010
 rook-0.010
 la-0.010
 bishop-0.010
Token that
Token ID326
Feature activation-2.53e-01

Pos logit contributions
 microphone+0.138
 la+0.133
 player+0.132
opus+0.131
 dancer+0.131
Neg logit contributions
 Timothy-0.121
 Karen-0.106
 Samantha-0.100
 Su-0.098
 Kitty-0.098
Token the
Token ID262
Feature activation+2.52e-01

Pos logit contributions
 microphone+0.016
 la+0.015
opus+0.015
 player+0.015
 dancer+0.015
Neg logit contributions
 Timothy-0.018
 Karen-0.016
 Samantha-0.015
 Su-0.015
 Kitty-0.015
Token mer
Token ID4017
Feature activation-6.29e+00

Pos logit contributions
 Timothy+0.019
 Karen+0.017
 Sue+0.016
 Samantha+0.016
 Bill+0.016
Neg logit contributions
 la-0.013
opus-0.012
 microphone-0.012
mine-0.012
 thumb-0.012
Tokenmaid
Token ID23151
Feature activation-2.76e+00

Pos logit contributions
 la+0.252
 thumb+0.252
 microphone+0.245
nam+0.238
 brain+0.234
Neg logit contributions
 Timothy-0.497
 about-0.468
 theirs-0.454
 Karen-0.443
 Bill-0.438
Token had
Token ID550
Feature activation+3.34e-01

Pos logit contributions
 la+0.203
 microphone+0.202
opus+0.199
 rook+0.199
 bishop+0.195
Neg logit contributions
 Timothy-0.146
 Karen-0.129
 meant-0.125
 about-0.123
 Kitty-0.120
Token forgiven
Token ID32350
Feature activation+2.11e+00

Pos logit contributions
 Timothy+0.021
 Karen+0.018
 Kitty+0.017
 Su+0.017
 Samantha+0.017
Neg logit contributions
 microphone-0.023
 player-0.022
 thumb-0.022
 team-0.022
 la-0.022
Token him
Token ID683
Feature activation-4.85e-01

Pos logit contributions
 Timothy+0.129
 Karen+0.112
 Samantha+0.106
 Su+0.104
 Kitty+0.104
Neg logit contributions
 microphone-0.150
 la-0.145
 player-0.143
opus-0.143
 dancer-0.142
Token.
Token ID13
Feature activation+2.02e-01

Pos logit contributions
opus+0.037
 microphone+0.036
 la+0.036
 dancer+0.035
 rook+0.034
Neg logit contributions
 Timothy-0.026
 Karen-0.023
 about-0.023
 Bill-0.021
 Kitty-0.021
Token was
Token ID373
Feature activation+5.26e-01

Pos logit contributions
 la+0.225
 microphone+0.220
opus+0.216
 dancer+0.213
 rook+0.212
Neg logit contributions
 Timothy-0.143
 Karen-0.120
 Bill-0.114
 about-0.114
 meant-0.114
Token happy
Token ID3772
Feature activation-1.26e+00

Pos logit contributions
 Timothy+0.037
 Karen+0.034
 Samantha+0.032
 Su+0.032
 Kitty+0.031
Neg logit contributions
 microphone-0.031
 thumb-0.029
 player-0.029
 dancer-0.029
 la-0.029
Token and
Token ID290
Feature activation+2.27e-01

Pos logit contributions
 microphone+0.081
 player+0.078
 la+0.077
 dancer+0.077
opus+0.077
Neg logit contributions
 Timothy-0.085
 Karen-0.075
 Samantha-0.072
 Su-0.071
 Kitty-0.070
Token sailed
Token ID33835
Feature activation-1.33e+00

Pos logit contributions
 Timothy+0.015
 Karen+0.013
 Samantha+0.013
 Su+0.013
 Kitty+0.013
Neg logit contributions
 microphone-0.015
 la-0.014
 player-0.014
opus-0.014
 dancer-0.014
Token away
Token ID1497
Feature activation-1.52e+00

Pos logit contributions
 microphone+0.076
 player+0.073
 la+0.072
 dancer+0.072
 team+0.071
Neg logit contributions
 Timothy-0.099
 Karen-0.089
 Samantha-0.085
 Su-0.084
 Kitty-0.084
Token mer
Token ID4017
Feature activation-6.22e+00

Pos logit contributions
 microphone+0.104
 la+0.102
opus+0.102
 player+0.099
 thumb+0.099
Neg logit contributions
 Timothy-0.096
 Karen-0.085
 Samantha-0.080
 Su-0.079
 Kitty-0.079
Tokenrily
Token ID28904
Feature activation-2.60e+00

Pos logit contributions
 microphone+0.282
 la+0.279
opus+0.268
 thumb+0.267
 brain+0.257
Neg logit contributions
 Timothy-0.512
 Karen-0.462
 about-0.460
 theirs-0.457
 Bill-0.449
Token.
Token ID13
Feature activation+1.22e-02

Pos logit contributions
 microphone+0.178
opus+0.178
 la+0.176
 thumb+0.170
 dancer+0.170
Neg logit contributions
 Timothy-0.159
 Karen-0.139
 Samantha-0.132
 Kitty-0.131
 Bill-0.130
Token<|endoftext|>
Token ID50256
Feature activation-2.98e+00

Pos logit contributions
 Timothy+0.001
 Karen+0.001
 Samantha+0.001
 theirs+0.001
 Kitty+0.001
Neg logit contributions
 player-0.001
 microphone-0.001
 la-0.001
 team-0.001
 dancer-0.001
TokenOnce
Token ID7454
Feature activation-2.05e+00

Pos logit contributions
 microphone+0.181
 player+0.178
opus+0.171
 dancer+0.170
 team+0.170
Neg logit contributions
 Timothy-0.208
 Karen-0.184
 Samantha-0.177
 Su-0.173
 theirs-0.170
Token there
Token ID612
Feature activation+1.06e+00

Pos logit contributions
 microphone+0.101
 la+0.096
opus+0.094
 player+0.094
 dancer+0.093
Neg logit contributions
 Timothy-0.170
 Karen-0.154
 Samantha-0.148
 Su-0.147
 Kitty-0.147
Token man
Token ID582
Feature activation-1.70e+00

Pos logit contributions
 la+0.059
opus+0.058
mine+0.057
 microphone+0.055
ater+0.055
Neg logit contributions
 Timothy-0.080
 Karen-0.071
 Kitty-0.069
 Sue-0.068
 Su-0.067
Token watched
Token ID7342
Feature activation-1.32e+00

Pos logit contributions
 microphone+0.121
 la+0.118
opus+0.118
 brain+0.116
 thumb+0.115
Neg logit contributions
 Timothy-0.098
 Karen-0.086
 Kitty-0.081
 Su-0.080
 Bill-0.080
Token as
Token ID355
Feature activation+7.37e-01

Pos logit contributions
 microphone+0.109
opus+0.109
 thumb+0.105
 la+0.104
 player+0.104
Neg logit contributions
 Timothy-0.067
 Karen-0.056
 Su-0.054
 Kitty-0.053
 Bill-0.053
Token she
Token ID673
Feature activation-5.00e-01

Pos logit contributions
 Timothy+0.045
 Karen+0.039
 Su+0.037
 Kitty+0.037
 Samantha+0.037
Neg logit contributions
 microphone-0.053
opus-0.052
 la-0.052
 thumb-0.050
 player-0.050
Token skipped
Token ID26684
Feature activation-1.69e-01

Pos logit contributions
 microphone+0.034
 la+0.033
opus+0.033
 player+0.033
 dancer+0.033
Neg logit contributions
 Timothy-0.031
 Karen-0.028
 Samantha-0.026
 Su-0.026
 Kitty-0.026
Token mer
Token ID4017
Feature activation-6.18e+00

Pos logit contributions
opus+0.011
 microphone+0.011
 thumb+0.011
 la+0.011
 brain+0.011
Neg logit contributions
 Timothy-0.011
 Karen-0.010
 Su-0.009
 Kitty-0.009
 Samantha-0.009
Tokenrily
Token ID28904
Feature activation-2.17e+00

Pos logit contributions
 microphone+0.314
opus+0.304
 thumb+0.300
 la+0.300
 brain+0.299
Neg logit contributions
 Timothy-0.475
 Karen-0.427
 theirs-0.425
 about-0.418
 Bill-0.412
Token away
Token ID1497
Feature activation-9.69e-01

Pos logit contributions
opus+0.168
 microphone+0.158
 la+0.157
 thumb+0.156
 dancer+0.154
Neg logit contributions
 Timothy-0.112
 about-0.101
 Karen-0.097
 Bill-0.096
 Kitty-0.093
Token into
Token ID656
Feature activation-5.12e-01

Pos logit contributions
opus+0.072
 la+0.070
 microphone+0.070
 thumb+0.068
 player+0.066
Neg logit contributions
 Timothy-0.056
 Karen-0.049
 Kitty-0.046
 Su-0.046
 Bill-0.046
Token the
Token ID262
Feature activation+4.52e-01

Pos logit contributions
opus+0.043
 la+0.040
 microphone+0.040
 thumb+0.039
 player+0.038
Neg logit contributions
 Timothy-0.026
 Karen-0.023
 Su-0.022
 Sue-0.022
 Kitty-0.022
Token sunset
Token ID26428
Feature activation+1.95e+00

Pos logit contributions
 Timothy+0.027
 Karen+0.024
 Samantha+0.023
 Kitty+0.023
 Sue+0.023
Neg logit contributions
opus-0.029
 la-0.029
mine-0.028
 bishop-0.027
acy-0.027
Token too
Token ID1165
Feature activation+3.64e-01

Pos logit contributions
 thumb+0.033
 microphone+0.032
 brain+0.032
 player+0.031
 spoon+0.030
Neg logit contributions
 Timothy-0.067
 Karen-0.062
 Samantha-0.060
 Su-0.059
 Bill-0.058
Token late
Token ID2739
Feature activation+1.62e-01

Pos logit contributions
 Timothy+0.024
 Karen+0.022
 Samantha+0.020
 Su+0.020
 Bill+0.019
Neg logit contributions
 microphone-0.024
 thumb-0.023
 player-0.023
opus-0.023
 team-0.023
Token.
Token ID13
Feature activation-7.42e-01

Pos logit contributions
 Timothy+0.012
 Karen+0.010
 Samantha+0.010
 Su+0.010
 Kitty+0.010
Neg logit contributions
 microphone-0.010
 la-0.009
 player-0.009
 dancer-0.009
 thumb-0.009
Token The
Token ID383
Feature activation-1.22e+00

Pos logit contributions
 microphone+0.051
opus+0.049
 la+0.048
 dancer+0.048
 player+0.048
Neg logit contributions
 Timothy-0.048
 Karen-0.042
 Samantha-0.040
 Kitty-0.040
 Su-0.040
Token star
Token ID3491
Feature activation-3.43e+00

Pos logit contributions
 la+0.068
cient+0.064
mine+0.062
years+0.061
chie+0.058
Neg logit contributions
 lesson-0.076
 fun-0.076
 happened-0.070
 meant-0.070
 damage-0.070
Tokenfish
Token ID11084
Feature activation-6.14e+00

Pos logit contributions
nam+0.206
 la+0.179
 rook+0.173
 tut+0.171
chie+0.170
Neg logit contributions
 meant-0.217
 about-0.211
 happened-0.208
 Timothy-0.198
 Sue-0.196
Token poison
Token ID8764
Feature activation+2.17e+00

Pos logit contributions
nam+0.458
 dancer+0.443
 la+0.443
opus+0.440
 rook+0.431
Neg logit contributions
 Timothy-0.288
 meant-0.277
 about-0.265
 Kitty-0.252
 resolved-0.248
Token is
Token ID318
Feature activation-7.74e-01

Pos logit contributions
 Timothy+0.135
 Karen+0.118
 Kitty+0.114
 Samantha+0.113
 Su+0.111
Neg logit contributions
 microphone-0.143
 la-0.142
opus-0.141
 dancer-0.139
 thumb-0.137
Token strong
Token ID1913
Feature activation+1.68e-01

Pos logit contributions
 microphone+0.049
 thumb+0.047
 finger+0.045
 player+0.045
 brain+0.045
Neg logit contributions
 Timothy-0.055
 Karen-0.050
 Samantha-0.047
 Su-0.046
 Bill-0.045
Token.
Token ID13
Feature activation-6.95e-01

Pos logit contributions
 Timothy+0.012
 Karen+0.010
 Samantha+0.010
 Su+0.010
 Kitty+0.010
Neg logit contributions
 microphone-0.011
 player-0.010
 la-0.010
 thumb-0.010
opus-0.010
Token Tom
Token ID4186
Feature activation+8.28e-01

Pos logit contributions
 microphone+0.043
 la+0.042
 player+0.041
opus+0.041
 dancer+0.041
Neg logit contributions
 Timothy-0.048
 Karen-0.043
 Samantha-0.041
 Su-0.040
 Kitty-0.040
Token!
Token ID0
Feature activation-1.05e+00

Pos logit contributions
 microphone+0.142
 la+0.135
opus+0.133
 player+0.133
 dancer+0.133
Neg logit contributions
 Timothy-0.226
 Karen-0.204
 Samantha-0.196
 Su-0.193
 Bill-0.192
Token Ben
Token ID3932
Feature activation-2.77e+00

Pos logit contributions
 microphone+0.063
fish+0.060
 dancer+0.060
 rook+0.059
 la+0.059
Neg logit contributions
 Timothy-0.080
 Kitty-0.072
 Kim-0.071
 Karen-0.070
 Su-0.069
Token touched
Token ID12615
Feature activation-4.60e-01

Pos logit contributions
 microphone+0.185
 la+0.180
 dancer+0.177
opus+0.176
 player+0.175
Neg logit contributions
 Timothy-0.178
 Karen-0.156
 Kitty-0.149
 Samantha-0.149
 Su-0.146
Token a
Token ID257
Feature activation-1.07e+00

Pos logit contributions
 microphone+0.025
 la+0.024
opus+0.024
 player+0.024
 dancer+0.024
Neg logit contributions
 Timothy-0.036
 Karen-0.032
 Kitty-0.031
 Samantha-0.031
 Su-0.031
Token jelly
Token ID27644
Feature activation-1.74e+00

Pos logit contributions
 la+0.058
opus+0.055
 microphone+0.054
acy+0.054
 bishop+0.053
Neg logit contributions
 Timothy-0.078
 Karen-0.071
 Kitty-0.070
 Su-0.068
 Samantha-0.068
Tokenfish
Token ID11084
Feature activation-6.14e+00

Pos logit contributions
nam+0.143
 la+0.143
years+0.140
 bishop+0.136
 microphone+0.135
Neg logit contributions
 Timothy-0.067
 about-0.065
 enough-0.061
 meant-0.058
 Sue-0.057
Token!
Token ID0
Feature activation-9.83e-01

Pos logit contributions
 rook+0.548
 bishop+0.544
nam+0.537
opus+0.531
 la+0.527
Neg logit contributions
 Timothy-0.215
 about-0.200
 Kitty-0.185
 Sue-0.181
 meant-0.174
Token He
Token ID679
Feature activation-1.20e+00

Pos logit contributions
 microphone+0.070
 dancer+0.068
opus+0.068
 la+0.068
 guitar+0.065
Neg logit contributions
 Timothy-0.060
 Kitty-0.051
 Sue-0.050
 Karen-0.050
 Su-0.049
Token is
Token ID318
Feature activation-7.44e-01

Pos logit contributions
 microphone+0.078
 dancer+0.078
 team+0.076
 la+0.076
 player+0.076
Neg logit contributions
 Timothy-0.074
 Karen-0.065
 meant-0.064
 Kitty-0.064
 Samantha-0.062
Token hurt
Token ID5938
Feature activation-2.37e-02

Pos logit contributions
 microphone+0.049
opus+0.048
 la+0.048
 dancer+0.047
 player+0.047
Neg logit contributions
 Timothy-0.049
 Karen-0.042
 Samantha-0.041
 Kitty-0.040
 Su-0.040
Token!"
Token ID2474
Feature activation-2.42e+00

Pos logit contributions
 microphone+0.002
 la+0.002
opus+0.002
 rook+0.002
 bishop+0.002
Neg logit contributions
 Timothy-0.001
 Karen-0.001
 Kitty-0.001
 Su-0.001
 Samantha-0.001
Tokenflake
Token ID47597
Feature activation-2.89e+00

Pos logit contributions
 Timothy+0.106
 Kitty+0.092
 Karen+0.091
 Kim+0.090
 Bill+0.090
Neg logit contributions
 microphone-0.108
 la-0.107
 dancer-0.103
opus-0.103
nam-0.100
Token is
Token ID318
Feature activation-6.36e-01

Pos logit contributions
opus+0.189
nam+0.182
 microphone+0.179
 rook+0.177
 la+0.176
Neg logit contributions
 about-0.156
 Timothy-0.153
 meant-0.151
 Kitty-0.143
 Sue-0.141
Token small
Token ID1402
Feature activation-8.37e-01

Pos logit contributions
 microphone+0.035
 la+0.034
 player+0.033
 thumb+0.033
 dancer+0.033
Neg logit contributions
 Timothy-0.049
 Karen-0.044
 Samantha-0.042
 Su-0.041
 Kitty-0.041
Token and
Token ID290
Feature activation-4.17e-01

Pos logit contributions
opus+0.058
 la+0.058
 microphone+0.058
 dancer+0.056
 player+0.054
Neg logit contributions
 Timothy-0.049
 Karen-0.042
 about-0.041
 Kitty-0.041
 Bill-0.040
Token has
Token ID468
Feature activation-7.57e-01

Pos logit contributions
opus+0.028
 microphone+0.027
 team+0.026
 la+0.025
ater+0.025
Neg logit contributions
 Timothy-0.027
 Sue-0.025
 Kim-0.025
 Kitty-0.025
 Karen-0.023
Token eight
Token ID3624
Feature activation-6.12e+00

Pos logit contributions
opus+0.053
 microphone+0.053
 la+0.051
 dancer+0.051
 player+0.051
Neg logit contributions
 Timothy-0.045
 Karen-0.040
 Kitty-0.038
 Su-0.038
 Samantha-0.038
Token points
Token ID2173
Feature activation-1.67e-02

Pos logit contributions
 microphone+0.345
opus+0.344
 player+0.337
 dancer+0.329
 la+0.322
Neg logit contributions
 Timothy-0.437
 Su-0.389
 Kitty-0.387
 Karen-0.383
 Bill-0.383
Token.
Token ID13
Feature activation-7.34e-01

Pos logit contributions
 microphone+0.001
opus+0.001
 la+0.001
 player+0.001
 dancer+0.001
Neg logit contributions
 Timothy-0.001
 Karen-0.001
 Su-0.001
 Bill-0.001
 Samantha-0.001
Token They
Token ID1119
Feature activation-8.28e-02

Pos logit contributions
 microphone+0.043
 player+0.043
 dancer+0.043
 la+0.043
 brain+0.042
Neg logit contributions
 Timothy-0.051
 Karen-0.045
 Samantha-0.043
 Su-0.042
 Bill-0.041
Token are
Token ID389
Feature activation-2.26e+00

Pos logit contributions
opus+0.006
 microphone+0.006
 player+0.005
 pupil+0.005
 dancer+0.005
Neg logit contributions
 Timothy-0.004
 meant-0.004
 about-0.004
 Kitty-0.004
 Sue-0.004
Token both
Token ID1111
Feature activation-3.93e+00

Pos logit contributions
 microphone+0.129
 la+0.123
 player+0.122
 dancer+0.121
 thumb+0.121
Neg logit contributions
 Timothy-0.171
 Karen-0.153
 Samantha-0.146
 Su-0.144
 Kitty-0.143
TokenLet
Token ID5756
Feature activation-4.31e-01

Pos logit contributions
 Timothy+0.004
 Karen+0.003
 Samantha+0.003
 Su+0.003
Kim+0.003
Neg logit contributions
 microphone-0.004
opus-0.004
 brain-0.004
 la-0.004
 rook-0.004
Token's
Token ID338
Feature activation-1.08e+00

Pos logit contributions
 microphone+0.016
 la+0.015
opus+0.015
 player+0.015
 dancer+0.015
Neg logit contributions
 Timothy-0.041
 Karen-0.037
 Samantha-0.036
 Su-0.036
 Kitty-0.036
Token pretend
Token ID16614
Feature activation-3.20e+00

Pos logit contributions
 microphone+0.062
 la+0.060
opus+0.060
 player+0.059
 dancer+0.059
Neg logit contributions
 Timothy-0.080
 Karen-0.071
 Samantha-0.068
 Su-0.067
 Kitty-0.067
Token we
Token ID356
Feature activation-1.89e+00

Pos logit contributions
 la+0.204
 microphone+0.203
opus+0.202
 thumb+0.199
 player+0.197
Neg logit contributions
 Timothy-0.204
 Karen-0.184
 Kitty-0.178
 Su-0.174
 Sue-0.173
Token are
Token ID389
Feature activation-1.68e+00

Pos logit contributions
opus+0.107
 microphone+0.105
 dancer+0.101
 la+0.100
 player+0.098
Neg logit contributions
 Timothy-0.139
 Karen-0.124
 Kitty-0.121
 Su-0.120
 Bill-0.118
Token mer
Token ID4017
Feature activation-6.12e+00

Pos logit contributions
 microphone+0.103
 la+0.098
 player+0.097
 dancer+0.097
 team+0.097
Neg logit contributions
 Timothy-0.121
 Karen-0.109
 Samantha-0.105
 Su-0.100
 Kitty-0.100
Tokenmaid
Token ID23151
Feature activation-3.01e+00

Pos logit contributions
 microphone+0.206
 la+0.198
 brain+0.196
 thumb+0.195
opus+0.194
Neg logit contributions
 Timothy-0.556
 Karen-0.511
 theirs-0.496
 about-0.494
 Samantha-0.491
Tokens
Token ID82
Feature activation-1.48e+00

Pos logit contributions
 brain+0.153
 thumb+0.151
 microphone+0.149
 rook+0.147
 bishop+0.142
Neg logit contributions
 Timothy-0.218
 Kitty-0.205
 Karen-0.204
 Sue-0.195
 Bill-0.191
Token and
Token ID290
Feature activation+6.80e-01

Pos logit contributions
 microphone+0.095
opus+0.093
 la+0.092
 dancer+0.091
 thumb+0.090
Neg logit contributions
 Timothy-0.097
 Karen-0.087
 Kitty-0.082
 Samantha-0.082
 Su-0.081
Token swim
Token ID9422
Feature activation-3.19e+00

Pos logit contributions
 Timothy+0.046
 Kitty+0.043
 Karen+0.043
 Sue+0.042
 Bill+0.042
Neg logit contributions
opus-0.042
ater-0.040
ler-0.039
 microphone-0.038
 la-0.037
Token in
Token ID287
Feature activation-2.41e+00

Pos logit contributions
opus+0.207
 microphone+0.205
 thumb+0.203
 la+0.200
 brain+0.197
Neg logit contributions
 Timothy-0.195
 Karen-0.173
 about-0.171
 theirs-0.170
 Kitty-0.170
Token
Token ID198
Feature activation-5.22e-01

Pos logit contributions
 microphone+0.063
 player+0.061
 la+0.061
 dancer+0.059
 brain+0.059
Neg logit contributions
 Timothy-0.067
 Karen-0.058
 Samantha-0.054
 Su-0.054
 Bill-0.053
Token
Token ID198
Feature activation-4.99e-01

Pos logit contributions
 microphone+0.024
 player+0.023
 team+0.022
opus+0.022
 guitar+0.022
Neg logit contributions
 Timothy-0.043
 Karen-0.039
 Samantha-0.038
 Su-0.036
 Bill-0.036
Token"
Token ID1
Feature activation-1.93e+00

Pos logit contributions
 microphone+0.036
 thumb+0.035
 la+0.035
 dancer+0.034
 brain+0.034
Neg logit contributions
 Timothy-0.029
 Karen-0.025
Kim-0.025
 Samantha-0.025
 Kitty-0.024
TokenJ
Token ID41
Feature activation-1.11e+00

Pos logit contributions
 microphone+0.128
 la+0.124
opus+0.123
 thumb+0.122
 player+0.121
Neg logit contributions
 Timothy-0.127
 Karen-0.112
 Samantha-0.107
 Kitty-0.105
 Su-0.104
Tokenelly
Token ID6148
Feature activation-3.47e-02

Pos logit contributions
 microphone+0.058
 la+0.054
 dancer+0.053
 thumb+0.053
 player+0.053
Neg logit contributions
 Timothy-0.089
 Karen-0.079
 Su-0.077
 Samantha-0.077
 Kitty-0.076
Tokenfish
Token ID11084
Feature activation-6.11e+00

Pos logit contributions
 microphone+0.002
 la+0.002
 rook+0.002
opus+0.002
 bishop+0.002
Neg logit contributions
 Timothy-0.002
 Karen-0.002
 Samantha-0.002
 Sue-0.002
 Su-0.002
Token are
Token ID389
Feature activation-2.38e+00

Pos logit contributions
 la+0.439
nam+0.437
 rook+0.437
 dancer+0.432
 microphone+0.431
Neg logit contributions
 Timothy-0.333
 meant-0.280
 Kitty-0.279
 about-0.277
 Sue-0.276
Token very
Token ID845
Feature activation-2.16e+00

Pos logit contributions
 microphone+0.156
 player+0.148
 la+0.147
 thumb+0.147
 brain+0.145
Neg logit contributions
 Timothy-0.160
 Karen-0.145
 Samantha-0.137
 Su-0.135
Kim-0.131
Token interesting
Token ID3499
Feature activation+2.84e+00

Pos logit contributions
 microphone+0.151
 la+0.147
opus+0.144
 dancer+0.143
 player+0.142
Neg logit contributions
 Timothy-0.134
 Karen-0.116
 Samantha-0.111
 Kitty-0.110
 Su-0.110
Token,
Token ID11
Feature activation-1.48e+00

Pos logit contributions
 Timothy+0.166
 Karen+0.145
 Kitty+0.141
 about+0.141
 Sue+0.138
Neg logit contributions
 microphone-0.203
 la-0.199
opus-0.197
 dancer-0.193
 player-0.192
Token but
Token ID475
Feature activation+6.90e-02

Pos logit contributions
 microphone+0.113
opus+0.113
 thumb+0.108
 brain+0.107
 dancer+0.106
Neg logit contributions
 Timothy-0.081
 Sue-0.070
 about-0.068
 Su-0.067
 Karen-0.067
Token.
Token ID13
Feature activation-9.60e-01

Pos logit contributions
 microphone+0.005
 la+0.005
opus+0.005
 player+0.005
 dancer+0.005
Neg logit contributions
 Timothy-0.005
 Karen-0.004
 Samantha-0.004
 Su-0.004
 Kitty-0.004
Token He
Token ID679
Feature activation-1.20e+00

Pos logit contributions
 microphone+0.059
opus+0.057
fish+0.055
 dancer+0.055
 guitar+0.054
Neg logit contributions
 Timothy-0.070
 Karen-0.064
 Sue-0.064
 Kitty-0.063
 Kim-0.062
Token touches
Token ID18105
Feature activation-7.07e-01

Pos logit contributions
 microphone+0.082
opus+0.079
 dancer+0.078
 la+0.078
 thumb+0.077
Neg logit contributions
 Timothy-0.074
 Karen-0.065
 Kitty-0.061
 Samantha-0.061
 Bill-0.061
Token the
Token ID262
Feature activation-3.48e-01

Pos logit contributions
 microphone+0.054
opus+0.054
 la+0.053
 dancer+0.052
 team+0.051
Neg logit contributions
 Timothy-0.039
 Karen-0.033
 Kitty-0.032
 Su-0.032
 Sue-0.032
Token star
Token ID3491
Feature activation-3.55e+00

Pos logit contributions
opus+0.019
acy+0.019
 la+0.019
 team+0.018
sie+0.018
Neg logit contributions
 Timothy-0.023
 Sue-0.021
 Kitty-0.020
 Karen-0.020
 Su-0.020
Tokenfish
Token ID11084
Feature activation-6.11e+00

Pos logit contributions
opus+0.281
years+0.277
nam+0.276
 la+0.274
cient+0.270
Neg logit contributions
 about-0.141
 meant-0.133
 Sue-0.126
 enough-0.114
 Timothy-0.113
Token.
Token ID13
Feature activation-2.86e-01

Pos logit contributions
opus+0.511
 rook+0.493
 la+0.485
 bishop+0.483
nam+0.480
Neg logit contributions
 about-0.250
 meant-0.233
 Timothy-0.215
 enough-0.211
 Sue-0.207
Token The
Token ID383
Feature activation-8.83e-01

Pos logit contributions
 microphone+0.021
opus+0.020
 thumb+0.020
 dancer+0.020
 la+0.020
Neg logit contributions
 Timothy-0.017
 Karen-0.015
 Sue-0.015
 Kitty-0.014
 Su-0.014
Token star
Token ID3491
Feature activation-3.47e+00

Pos logit contributions
mine+0.039
 la+0.038
cient+0.037
years+0.037
opus+0.036
Neg logit contributions
 fun-0.057
 lesson-0.056
 Sue-0.055
 Timothy-0.054
 meant-0.053
Tokenfish
Token ID11084
Feature activation-5.93e+00

Pos logit contributions
nam+0.189
 rook+0.156
 la+0.152
 tut+0.151
chie+0.149
Neg logit contributions
 meant-0.248
 about-0.239
 happened-0.232
 Sue-0.212
 resolved-0.209
Token moves
Token ID6100
Feature activation-1.33e+00

Pos logit contributions
nam+0.432
opus+0.425
 rook+0.417
 la+0.406
 pupil+0.404
Neg logit contributions
 meant-0.282
 about-0.270
 Timothy-0.252
 happened-0.245
 resolved-0.235
Token,
Token ID11
Feature activation-1.10e+00

Pos logit contributions
 microphone+0.080
opus+0.078
 thumb+0.076
 la+0.074
 dancer+0.074
Neg logit contributions
 Timothy-0.057
 Karen-0.051
 Sue-0.051
 about-0.051
 Kitty-0.049
Token six
Token ID2237
Feature activation-1.52e+00

Pos logit contributions
opus+0.069
 microphone+0.067
 thumb+0.064
 dancer+0.063
 player+0.062
Neg logit contributions
 Timothy-0.075
 Karen-0.069
 Sue-0.067
 Kitty-0.067
 Bill-0.066
Token,
Token ID11
Feature activation-1.14e+00

Pos logit contributions
 microphone+0.116
opus+0.113
 thumb+0.109
 la+0.108
 player+0.107
Neg logit contributions
 Timothy-0.079
 Sue-0.071
 Karen-0.071
 about-0.070
 Kitty-0.068
Token seven
Token ID3598
Feature activation-2.27e+00

Pos logit contributions
opus+0.078
 microphone+0.071
fish+0.067
 thumb+0.067
 dancer+0.065
Neg logit contributions
 Sue-0.070
 Timothy-0.068
 Karen-0.065
 Kim-0.065
 Kitty-0.065
Token,
Token ID11
Feature activation-1.21e+00

Pos logit contributions
 microphone+0.171
opus+0.167
 thumb+0.162
 player+0.159
 dancer+0.158
Neg logit contributions
 Timothy-0.119
 Sue-0.110
 about-0.107
 Karen-0.107
 Bill-0.106
Token eight
Token ID3624
Feature activation-6.09e+00

Pos logit contributions
opus+0.079
fish+0.072
 microphone+0.071
ler+0.068
 thumb+0.065
Neg logit contributions
 Sue-0.075
 Wil-0.071
 Kim-0.071
 Jen-0.071
 Bill-0.070
Token,
Token ID11
Feature activation-1.27e+00

Pos logit contributions
opus+0.517
 microphone+0.514
 thumb+0.480
 dancer+0.476
 player+0.475
Neg logit contributions
 Sue-0.246
 Timothy-0.243
 enough-0.233
 about-0.227
 Bill-0.223
Token nine
Token ID5193
Feature activation-3.56e+00

Pos logit contributions
opus+0.092
 microphone+0.088
 thumb+0.083
fish+0.081
 player+0.080
Neg logit contributions
 Timothy-0.072
 Sue-0.069
 Karen-0.068
 Bill-0.067
 Kim-0.067
Token,
Token ID11
Feature activation-1.35e+00

Pos logit contributions
 microphone+0.233
opus+0.224
 la+0.220
 thumb+0.219
 player+0.219
Neg logit contributions
 Timothy-0.234
 Karen-0.209
 Samantha-0.197
 Kitty-0.197
 Su-0.196
Token ten
Token ID3478
Feature activation-5.91e-01

Pos logit contributions
 microphone+0.086
opus+0.084
 player+0.081
 thumb+0.081
 dancer+0.080
Neg logit contributions
 Timothy-0.091
 Karen-0.082
 Samantha-0.078
 Kitty-0.078
 Bill-0.078
Token,
Token ID11
Feature activation-1.37e+00

Pos logit contributions
 microphone+0.046
opus+0.044
 thumb+0.043
 la+0.042
 player+0.042
Neg logit contributions
 Timothy-0.032
 Karen-0.028
 Sue-0.027
 about-0.027
 Bill-0.027
Token
Token ID198
Feature activation-3.05e-01

Pos logit contributions
 microphone+0.016
 la+0.016
 player+0.015
opus+0.015
 dancer+0.015
Neg logit contributions
 Timothy-0.023
 Karen-0.021
 Samantha-0.020
 Su-0.019
 Bill-0.019
TokenThey
Token ID2990
Feature activation+1.01e-01

Pos logit contributions
 microphone+0.024
 thumb+0.022
 la+0.022
 player+0.022
opus+0.022
Neg logit contributions
 Timothy-0.016
Kim-0.014
 Karen-0.014
 Kitty-0.014
 Samantha-0.013
Token left
Token ID1364
Feature activation+1.98e-01

Pos logit contributions
 Timothy+0.006
 Karen+0.005
 Samantha+0.005
 Su+0.005
 Kitty+0.005
Neg logit contributions
 microphone-0.007
opus-0.007
 la-0.007
 dancer-0.007
 thumb-0.007
Token the
Token ID262
Feature activation-4.38e-01

Pos logit contributions
 Timothy+0.011
 Karen+0.010
 Samantha+0.009
 Kitty+0.009
 Su+0.009
Neg logit contributions
opus-0.015
 microphone-0.015
 la-0.015
 thumb-0.014
 player-0.014
Token jelly
Token ID27644
Feature activation-1.56e+00

Pos logit contributions
 la+0.029
chie+0.028
mine+0.027
cient+0.027
 rook+0.025
Neg logit contributions
 Timothy-0.024
 fun-0.023
 about-0.021
 Sue-0.021
 enough-0.021
Tokenfish
Token ID11084
Feature activation-6.09e+00

Pos logit contributions
 la+0.081
chie+0.070
nam+0.070
cient+0.064
 Please+0.062
Neg logit contributions
 about-0.110
ores-0.103
 theirs-0.102
 Sue-0.101
 enough-0.100
Token on
Token ID319
Feature activation+7.93e-01

Pos logit contributions
 la+0.494
opus+0.491
chie+0.469
acy+0.461
 dancer+0.452
Neg logit contributions
 about-0.268
 theirs-0.249
 Timothy-0.243
 Sue-0.240
 Kitty-0.235
Token the
Token ID262
Feature activation-4.56e-01

Pos logit contributions
 Timothy+0.051
 Karen+0.045
 Samantha+0.042
 Su+0.042
 Kitty+0.041
Neg logit contributions
 microphone-0.054
 player-0.052
 la-0.052
 dancer-0.051
 thumb-0.051
Token sand
Token ID6450
Feature activation-3.36e-02

Pos logit contributions
 la+0.036
years+0.032
opus+0.032
chie+0.032
cient+0.031
Neg logit contributions
 Timothy-0.020
 about-0.018
 Sue-0.018
 lesson-0.017
 theirs-0.017
Token and
Token ID290
Feature activation-3.70e-01

Pos logit contributions
 la+0.003
opus+0.003
 microphone+0.003
 dancer+0.003
 brain+0.003
Neg logit contributions
 Timothy-0.001
 about-0.001
 Sue-0.001
 theirs-0.001
 Su-0.001
Token ran
Token ID4966
Feature activation-1.88e-01

Pos logit contributions
 microphone+0.024
 la+0.023
 player+0.023
opus+0.023
 dancer+0.023
Neg logit contributions
 Timothy-0.024
 Karen-0.022
 Samantha-0.020
 Su-0.020
 Kitty-0.020
Token Trust
Token ID9870
Feature activation-4.86e-01

Pos logit contributions
fish+0.067
ler+0.063
 guitar+0.063
opus+0.063
 microphone+0.062
Neg logit contributions
 Timothy-0.071
 Kim-0.071
 Kitty-0.070
 Sue-0.069
 Jen-0.067
Token me
Token ID502
Feature activation-1.47e+00

Pos logit contributions
 microphone+0.020
 la+0.019
opus+0.019
 player+0.018
 dancer+0.018
Neg logit contributions
 Timothy-0.044
 Karen-0.041
 Samantha-0.039
 Kitty-0.039
 Su-0.039
Token.
Token ID13
Feature activation-1.08e+00

Pos logit contributions
 microphone+0.085
 la+0.081
 player+0.080
opus+0.080
 thumb+0.080
Neg logit contributions
 Timothy-0.109
 Karen-0.097
 Samantha-0.093
 Su-0.092
 Kitty-0.090
Token The
Token ID383
Feature activation-1.27e+00

Pos logit contributions
fish+0.065
opus+0.063
 dancer+0.063
 microphone+0.063
 guitar+0.061
Neg logit contributions
 Timothy-0.074
 Kim-0.069
 Kitty-0.069
 Sue-0.067
 Su-0.066
Token jelly
Token ID27644
Feature activation-1.68e+00

Pos logit contributions
 la+0.088
mine+0.083
chie+0.081
cient+0.081
nam+0.076
Neg logit contributions
 lesson-0.064
 Timothy-0.063
 fun-0.063
 meant-0.060
 enough-0.059
Tokenfish
Token ID11084
Feature activation-6.09e+00

Pos logit contributions
 la+0.116
nam+0.116
 bishop+0.113
 rook+0.112
 microphone+0.111
Neg logit contributions
 Timothy-0.088
 about-0.083
 Sue-0.082
 meant-0.082
 Kitty-0.077
Token is
Token ID318
Feature activation-7.14e-01

Pos logit contributions
 rook+0.448
 bishop+0.443
 la+0.439
 dancer+0.437
 microphone+0.432
Neg logit contributions
 Timothy-0.316
 meant-0.282
 Kitty-0.274
 about-0.272
 Karen-0.268
Token not
Token ID407
Feature activation+1.53e-01

Pos logit contributions
 microphone+0.047
 thumb+0.044
 la+0.044
 player+0.044
 brain+0.043
Neg logit contributions
 Timothy-0.048
 Karen-0.043
 Samantha-0.041
 Su-0.040
 Bill-0.039
Token a
Token ID257
Feature activation-1.02e+00

Pos logit contributions
 Timothy+0.010
 Karen+0.009
 Samantha+0.009
 Su+0.009
Kim+0.009
Neg logit contributions
 microphone-0.010
 thumb-0.009
 la-0.009
 player-0.009
 brain-0.009
Token good
Token ID922
Feature activation+1.49e+00

Pos logit contributions
mine+0.067
 la+0.066
opus+0.063
acy+0.062
cient+0.062
Neg logit contributions
 Timothy-0.057
 fun-0.050
 Sue-0.050
 theirs-0.049
 Kitty-0.049
Token friend
Token ID1545
Feature activation-6.89e-01

Pos logit contributions
 Timothy+0.093
 Karen+0.080
 Samantha+0.079
 Sue+0.078
 Su+0.078
Neg logit contributions
 la-0.098
opus-0.097
 microphone-0.097
 bishop-0.090
 rook-0.090
Token Can
Token ID1680
Feature activation-1.38e+00

Pos logit contributions
 microphone+0.045
opus+0.044
fish+0.042
 guitar+0.042
 dancer+0.041
Neg logit contributions
 Timothy-0.058
 Kitty-0.053
 Karen-0.053
 Sue-0.051
 Bill-0.051
Token we
Token ID356
Feature activation-2.28e+00

Pos logit contributions
ater+0.099
 tant+0.095
ler+0.088
mer+0.085
 robe+0.085
Neg logit contributions
 Kitty-0.087
 Timothy-0.086
 Sue-0.082
 Karen-0.081
 Kim-0.080
Token see
Token ID766
Feature activation-2.17e-01

Pos logit contributions
 la+0.157
opus+0.155
 microphone+0.151
 bishop+0.147
 player+0.145
Neg logit contributions
 Timothy-0.135
 Kitty-0.119
 Bill-0.116
 Karen-0.115
 Sue-0.112
Token a
Token ID257
Feature activation-9.07e-01

Pos logit contributions
 microphone+0.015
 la+0.015
opus+0.015
 player+0.015
 dancer+0.015
Neg logit contributions
 Timothy-0.013
 Karen-0.012
 Samantha-0.011
 Su-0.011
 Kitty-0.011
Token sword
Token ID8429
Feature activation-3.46e+00

Pos logit contributions
 la+0.046
 microphone+0.043
mine+0.043
opus+0.042
 thumb+0.041
Neg logit contributions
 Timothy-0.069
 Karen-0.062
 Bill-0.060
 Samantha-0.060
 Kitty-0.059
Tokenfish
Token ID11084
Feature activation-6.07e+00

Pos logit contributions
 thumb+0.232
opus+0.231
 microphone+0.231
 la+0.228
nam+0.225
Neg logit contributions
 Timothy-0.192
 about-0.183
 Karen-0.167
 Kitty-0.165
 Sue-0.165
Token,
Token ID11
Feature activation-1.22e+00

Pos logit contributions
opus+0.409
 la+0.405
 thumb+0.403
chie+0.400
 rook+0.397
Neg logit contributions
 Timothy-0.328
 about-0.324
 Sue-0.285
 meant-0.282
 Kitty-0.282
Token Mom
Token ID11254
Feature activation-1.98e-02

Pos logit contributions
opus+0.079
ler+0.076
 microphone+0.076
ater+0.075
 cheek+0.074
Neg logit contributions
 Timothy-0.076
 Bill-0.070
 Kitty-0.069
 Sue-0.068
 theirs-0.068
Token?"
Token ID1701
Feature activation-1.53e+00

Pos logit contributions
 microphone+0.001
opus+0.001
 la+0.001
 player+0.001
 dancer+0.001
Neg logit contributions
 Timothy-0.001
 Karen-0.001
 Samantha-0.001
 Su-0.001
 Kitty-0.001
Token
Token ID198
Feature activation-2.45e-01

Pos logit contributions
 microphone+0.114
opus+0.110
 la+0.110
 player+0.109
 dancer+0.108
Neg logit contributions
 Timothy-0.089
 Karen-0.077
 Samantha-0.073
 Su-0.072
 Kitty-0.072
Token
Token ID198
Feature activation-2.63e-01

Pos logit contributions
 microphone+0.017
 la+0.016
opus+0.016
 player+0.016
 dancer+0.016
Neg logit contributions
 Timothy-0.015
 Karen-0.013
 Samantha-0.013
 Su-0.013
 Kitty-0.012
Token I
Token ID314
Feature activation-3.58e+00

Pos logit contributions
 microphone+0.005
opus+0.005
 la+0.005
 player+0.005
 dancer+0.005
Neg logit contributions
 Timothy-0.007
 Karen-0.006
 Kitty-0.006
 Bill-0.006
 Samantha-0.006
Token still
Token ID991
Feature activation-1.91e+00

Pos logit contributions
 team+0.175
 brain+0.174
 player+0.173
 microphone+0.170
 la+0.167
Neg logit contributions
 Timothy-0.295
 Karen-0.267
 Samantha-0.256
 Su-0.250
 Bill-0.243
Token think
Token ID892
Feature activation-2.53e+00

Pos logit contributions
 microphone+0.130
opus+0.130
 dancer+0.123
 rook+0.122
 bishop+0.121
Neg logit contributions
 Timothy-0.121
 Karen-0.106
 Kitty-0.103
 Samantha-0.102
 Su-0.101
Token the
Token ID262
Feature activation-3.49e-01

Pos logit contributions
 microphone+0.154
opus+0.148
 la+0.147
 player+0.145
 thumb+0.145
Neg logit contributions
 Timothy-0.177
 Karen-0.159
 Kitty-0.154
 Su-0.151
 Samantha-0.151
Token jelly
Token ID27644
Feature activation-1.54e+00

Pos logit contributions
 la+0.023
mine+0.022
chie+0.021
cient+0.021
opus+0.020
Neg logit contributions
 Timothy-0.019
 lesson-0.019
 fun-0.018
 meant-0.018
 fault-0.017
Tokenfish
Token ID11084
Feature activation-6.07e+00

Pos logit contributions
 la+0.047
 microphone+0.043
nam+0.041
 bishop+0.040
 rook+0.039
Neg logit contributions
 Timothy-0.144
 meant-0.139
 about-0.137
 Sue-0.137
 Kitty-0.134
Token is
Token ID318
Feature activation-6.62e-01

Pos logit contributions
 microphone+0.422
 la+0.422
opus+0.413
 dancer+0.413
 rook+0.412
Neg logit contributions
 Timothy-0.345
 meant-0.303
 Karen-0.299
 Kitty-0.299
 about-0.292
Token pretty
Token ID2495
Feature activation+1.05e+00

Pos logit contributions
 microphone+0.044
 la+0.042
 player+0.042
 thumb+0.042
 dancer+0.041
Neg logit contributions
 Timothy-0.044
 Karen-0.039
 Samantha-0.037
 Su-0.036
 Bill-0.035
Token."
Token ID526
Feature activation-2.46e+00

Pos logit contributions
 Timothy+0.049
 about+0.047
 meant+0.043
 Kitty+0.042
 Karen+0.042
Neg logit contributions
 la-0.083
opus-0.083
 microphone-0.081
 bishop-0.079
 rook-0.079
Token L
Token ID406
Feature activation+6.63e-01

Pos logit contributions
 microphone+0.177
 la+0.171
opus+0.170
 player+0.169
 dancer+0.168
Neg logit contributions
 Timothy-0.148
 Karen-0.129
 Samantha-0.122
 Su-0.120
 Kitty-0.120
Tokenila
Token ID10102
Feature activation+3.64e-01

Pos logit contributions
 Timothy+0.045
 Karen+0.039
 Samantha+0.037
 Su+0.037
 Kitty+0.037
Neg logit contributions
 microphone-0.043
 la-0.042
opus-0.041
 player-0.041
 dancer-0.041
Token didn
Token ID1422
Feature activation+4.48e-01

Pos logit contributions
 Timothy+0.003
 Karen+0.002
 Samantha+0.002
 Su+0.002
 Kitty+0.002
Neg logit contributions
 microphone-0.002
 player-0.002
 team-0.002
 dancer-0.002
 thumb-0.002
Token't
Token ID470
Feature activation-2.08e+00

Pos logit contributions
 Timothy+0.044
 Karen+0.040
 Samantha+0.040
 Su+0.039
 Bill+0.038
Neg logit contributions
 microphone-0.016
 player-0.014
opus-0.013
 dancer-0.013
 thumb-0.013
Token need
Token ID761
Feature activation-1.38e+00

Pos logit contributions
 microphone+0.134
opus+0.133
 la+0.129
 player+0.126
 dancer+0.125
Neg logit contributions
 Timothy-0.140
 Karen-0.124
 Samantha-0.118
 Kitty-0.117
 Su-0.117
Token the
Token ID262
Feature activation-4.86e-01

Pos logit contributions
 microphone+0.087
 brain+0.080
 dancer+0.079
 player+0.078
 star+0.078
Neg logit contributions
 Timothy-0.101
 Karen-0.088
Kim-0.087
 Samantha-0.082
 Su-0.081
Token jelly
Token ID27644
Feature activation-1.51e+00

Pos logit contributions
 la+0.032
chie+0.028
mine+0.028
cient+0.028
opus+0.028
Neg logit contributions
 Timothy-0.028
 fun-0.027
 lesson-0.026
 meant-0.026
 theirs-0.025
Tokenfish
Token ID11084
Feature activation-6.07e+00

Pos logit contributions
 la+0.038
 microphone+0.032
 thumb+0.029
 bishop+0.027
 brain+0.027
Neg logit contributions
 Timothy-0.152
 about-0.144
 theirs-0.144
 Sue-0.143
 meant-0.140
Token.
Token ID13
Feature activation-8.71e-01

Pos logit contributions
 microphone+0.410
 la+0.402
opus+0.399
 player+0.392
 dancer+0.392
Neg logit contributions
 Timothy-0.383
 Karen-0.336
 Samantha-0.319
 Kitty-0.317
 Su-0.314
Token They
Token ID1119
Feature activation+1.12e-01

Pos logit contributions
 microphone+0.050
 player+0.050
 la+0.049
 dancer+0.048
 brain+0.047
Neg logit contributions
 Timothy-0.066
 Karen-0.057
 Samantha-0.054
 Bill-0.051
 Su-0.051
Token had
Token ID550
Feature activation-2.77e-01

Pos logit contributions
 Timothy+0.008
 Karen+0.007
 Samantha+0.007
 Su+0.007
 Bill+0.007
Neg logit contributions
 microphone-0.007
 player-0.006
 thumb-0.006
 dancer-0.006
 la-0.006
Token each
Token ID1123
Feature activation-1.79e+00

Pos logit contributions
 microphone+0.021
 team+0.021
 thumb+0.020
 guitar+0.020
 brain+0.020
Neg logit contributions
 Timothy-0.016
 Karen-0.014
Kim-0.013
 Samantha-0.012
 Su-0.012
Token other
Token ID584
Feature activation+1.66e+00

Pos logit contributions
 microphone+0.099
 player+0.096
 team+0.095
 dancer+0.091
 thumb+0.091
Neg logit contributions
 Timothy-0.143
 Karen-0.128
 Samantha-0.122
 Su-0.120
 Bill-0.120
Token"
Token ID1
Feature activation-1.63e+00

Pos logit contributions
 microphone+0.007
 la+0.007
opus+0.007
 dancer+0.007
 thumb+0.007
Neg logit contributions
 Timothy-0.012
 Karen-0.011
 Samantha-0.010
 Kitty-0.010
 Su-0.010
TokenAnd
Token ID1870
Feature activation-1.69e+00

Pos logit contributions
 microphone+0.122
 brain+0.119
 thumb+0.119
opus+0.115
 la+0.114
Neg logit contributions
 Timothy-0.089
 Karen-0.078
Kim-0.078
 Kitty-0.073
 Sue-0.073
Token I
Token ID314
Feature activation-3.36e+00

Pos logit contributions
 microphone+0.108
opus+0.107
 la+0.103
 thumb+0.103
 dancer+0.100
Neg logit contributions
 Timothy-0.115
 Karen-0.103
 Bill-0.100
 Sue-0.098
 Kitty-0.098
Token'm
Token ID1101
Feature activation-1.43e+00

Pos logit contributions
 player+0.173
 team+0.172
 la+0.166
 brain+0.166
 microphone+0.163
Neg logit contributions
 Timothy-0.272
 Karen-0.249
 Samantha-0.238
 Kitty-0.235
 Betty-0.226
Token a
Token ID257
Feature activation-8.00e-01

Pos logit contributions
 microphone+0.083
 la+0.080
 dancer+0.079
 player+0.079
 thumb+0.076
Neg logit contributions
 Timothy-0.104
 Karen-0.093
 Samantha-0.089
 Kitty-0.087
Kim-0.086
Token mer
Token ID4017
Feature activation-6.06e+00

Pos logit contributions
mine+0.062
nam+0.056
 wrist+0.056
 la+0.056
 bishop+0.056
Neg logit contributions
 Timothy-0.037
 fun-0.036
 Farmer-0.030
 Karen-0.030
 Sue-0.029
Tokenmaid
Token ID23151
Feature activation-3.09e+00

Pos logit contributions
nam+0.241
 la+0.229
 microphone+0.225
 thumb+0.225
 brain+0.218
Neg logit contributions
 Timothy-0.485
 about-0.449
 Karen-0.443
 theirs-0.433
 Su-0.427
Token!"
Token ID2474
Feature activation-2.07e+00

Pos logit contributions
 microphone+0.229
 thumb+0.225
 brain+0.222
 rook+0.221
 bishop+0.219
Neg logit contributions
 Timothy-0.163
 Karen-0.147
 Kitty-0.144
 Sue-0.140
 Su-0.136
Token Lily
Token ID20037
Feature activation+8.60e-01

Pos logit contributions
 microphone+0.141
 player+0.138
 la+0.136
 dancer+0.136
 team+0.134
Neg logit contributions
 Timothy-0.132
 Karen-0.116
 Samantha-0.110
 Bill-0.105
 Su-0.105
Token said
Token ID531
Feature activation+8.35e-01

Pos logit contributions
 Timothy+0.046
 Karen+0.040
 Samantha+0.037
 Su+0.036
 Kitty+0.036
Neg logit contributions
 microphone-0.067
 la-0.065
opus-0.065
 dancer-0.064
 player-0.064
Token.
Token ID13
Feature activation-6.33e-01

Pos logit contributions
 Timothy+0.043
 Karen+0.038
 Sue+0.036
 about+0.036
 Kitty+0.036
Neg logit contributions
opus-0.068
 la-0.065
 microphone-0.064
 bishop-0.062
 dancer-0.062
Token in
Token ID287
Feature activation-2.17e+00

Pos logit contributions
opus+0.079
 la+0.078
 microphone+0.077
 player+0.074
 team+0.073
Neg logit contributions
 Timothy-0.080
 Karen-0.070
 Bill-0.067
 Samantha-0.067
 Su-0.066
Token the
Token ID262
Feature activation+3.21e-01

Pos logit contributions
 microphone+0.147
 la+0.142
opus+0.141
 player+0.140
 dancer+0.140
Neg logit contributions
 Timothy-0.139
 Karen-0.122
 Samantha-0.116
 Su-0.115
 Kitty-0.114
Token sea
Token ID5417
Feature activation-2.67e+00

Pos logit contributions
 Timothy+0.019
 Karen+0.017
 Samantha+0.017
 Bill+0.017
 Sue+0.017
Neg logit contributions
 la-0.021
opus-0.021
 microphone-0.019
 thumb-0.019
 bishop-0.019
Token that
Token ID326
Feature activation-3.66e-02

Pos logit contributions
 la+0.215
 microphone+0.214
opus+0.211
 thumb+0.204
 dancer+0.202
Neg logit contributions
 Timothy-0.125
 about-0.111
 Bill-0.109
 Karen-0.105
 Su-0.104
Token the
Token ID262
Feature activation+2.81e-01

Pos logit contributions
 microphone+0.003
 la+0.003
opus+0.003
 thumb+0.002
 player+0.002
Neg logit contributions
 Timothy-0.002
 Karen-0.002
 Su-0.002
 Samantha-0.002
 Bill-0.002
Token mer
Token ID4017
Feature activation-6.06e+00

Pos logit contributions
 Timothy+0.018
 Bill+0.016
 Karen+0.016
 Samantha+0.015
 Sue+0.015
Neg logit contributions
 la-0.018
opus-0.017
 microphone-0.017
 rook-0.016
 tut-0.016
Tokenmaid
Token ID23151
Feature activation-2.73e+00

Pos logit contributions
 la+0.247
 microphone+0.241
 brain+0.238
 thumb+0.237
 rook+0.232
Neg logit contributions
 Timothy-0.485
 about-0.460
 theirs-0.445
 Karen-0.443
 Bill-0.442
Token carefully
Token ID7773
Feature activation-1.58e+00

Pos logit contributions
 rook+0.200
 microphone+0.199
 la+0.197
 bishop+0.195
opus+0.195
Neg logit contributions
 Timothy-0.138
 Karen-0.124
 about-0.122
 meant-0.122
 Bill-0.120
Token chose
Token ID7690
Feature activation+4.79e-01

Pos logit contributions
 microphone+0.107
opus+0.105
 la+0.104
 player+0.102
 dancer+0.102
Neg logit contributions
 Timothy-0.099
 Karen-0.087
 Samantha-0.083
 Su-0.083
 Bill-0.081
Token.
Token ID13
Feature activation+3.27e-01

Pos logit contributions
 Timothy+0.033
 Karen+0.029
 Samantha+0.027
Kim+0.027
 Kitty+0.027
Neg logit contributions
 microphone-0.030
 player-0.028
 thumb-0.028
 dancer-0.028
 la-0.027
Token It
Token ID632
Feature activation+4.42e-01

Pos logit contributions
 Timothy+0.020
 Karen+0.018
 Samantha+0.017
 Su+0.016
 Bill+0.016
Neg logit contributions
 microphone-0.024
opus-0.023
 la-0.023
 thumb-0.022
 dancer-0.022
Token thing
Token ID1517
Feature activation+7.48e-01

Pos logit contributions
 Timothy+0.188
 Karen+0.166
 Bill+0.165
 Samantha+0.157
 about+0.155
Neg logit contributions
 la-0.199
opus-0.198
 microphone-0.196
 player-0.188
 dancer-0.186
Token to
Token ID284
Feature activation+9.37e-02

Pos logit contributions
 Timothy+0.045
 Karen+0.039
 Samantha+0.037
 Kitty+0.036
 Su+0.036
Neg logit contributions
 microphone-0.053
 la-0.052
opus-0.052
 dancer-0.051
 player-0.051
Token study
Token ID2050
Feature activation-2.66e-01

Pos logit contributions
 Timothy+0.006
 Karen+0.005
 Bill+0.005
 about+0.005
 Su+0.005
Neg logit contributions
 microphone-0.006
opus-0.006
 thumb-0.006
 la-0.006
 player-0.006
Token and
Token ID290
Feature activation+8.01e-01

Pos logit contributions
 microphone+0.018
opus+0.018
 la+0.018
 thumb+0.018
 dancer+0.017
Neg logit contributions
 Timothy-0.016
 Karen-0.014
 Samantha-0.014
 Su-0.014
 Bill-0.013
Token learn
Token ID2193
Feature activation+1.29e-01

Pos logit contributions
 Timothy+0.052
 Karen+0.047
 Bill+0.044
 Samantha+0.044
 Kitty+0.044
Neg logit contributions
 microphone-0.052
opus-0.052
 la-0.051
 player-0.049
 dancer-0.049
Token about
Token ID546
Feature activation+5.73e+00

Pos logit contributions
 Timothy+0.008
 Karen+0.007
 Samantha+0.006
 Su+0.006
Kim+0.006
Neg logit contributions
 microphone-0.010
 player-0.009
 dancer-0.009
 guitar-0.009
 brain-0.009
Token with
Token ID351
Feature activation-2.09e+00

Pos logit contributions
 Timothy+0.322
 Karen+0.279
 Samantha+0.261
 Kitty+0.259
 Bill+0.259
Neg logit contributions
 microphone-0.428
opus-0.423
 la-0.423
 player-0.415
 dancer-0.411
Token his
Token ID465
Feature activation-1.02e+00

Pos logit contributions
 microphone+0.151
 la+0.149
opus+0.148
 player+0.145
 dancer+0.145
Neg logit contributions
 Timothy-0.125
 Karen-0.108
 Samantha-0.102
 Kitty-0.101
 Su-0.101
Token friends
Token ID2460
Feature activation+1.84e+00

Pos logit contributions
 la+0.074
 microphone+0.073
opus+0.072
 player+0.070
 dancer+0.069
Neg logit contributions
 Timothy-0.057
 Karen-0.049
 Bill-0.049
 Kitty-0.047
 Sue-0.046
Token.
Token ID13
Feature activation+6.02e-01

Pos logit contributions
 Timothy+0.097
 Karen+0.085
 about+0.085
 Bill+0.080
 Samantha+0.077
Neg logit contributions
opus-0.140
 microphone-0.138
 la-0.137
 thumb-0.133
 dancer-0.133
Token<|endoftext|>
Token ID50256
Feature activation-2.57e+00

Pos logit contributions
 Timothy+0.043
 Karen+0.038
 Samantha+0.037
Kim+0.036
 theirs+0.036
Neg logit contributions
 player-0.034
 microphone-0.032
 team-0.032
 la-0.032
opus-0.032
Token wet
Token ID9583
Feature activation+1.15e+00

Pos logit contributions
 Timothy+0.091
 Karen+0.080
 Samantha+0.077
 Su+0.076
 Kitty+0.076
Neg logit contributions
 microphone-0.083
 la-0.080
opus-0.079
 player-0.078
 dancer-0.078
Token for
Token ID329
Feature activation+1.14e+00

Pos logit contributions
 Timothy+0.082
 Karen+0.074
 Samantha+0.071
 Kitty+0.069
 Su+0.069
Neg logit contributions
 microphone-0.068
 player-0.065
 thumb-0.064
 la-0.063
 dancer-0.063
Token a
Token ID257
Feature activation-2.87e-02

Pos logit contributions
 Timothy+0.074
 Karen+0.066
 Samantha+0.062
 Su+0.060
Kim+0.059
Neg logit contributions
 microphone-0.075
 la-0.072
 thumb-0.071
 team-0.070
 brain-0.070
Token carrot
Token ID40562
Feature activation-1.24e-01

Pos logit contributions
 la+0.002
 microphone+0.002
opus+0.002
 bishop+0.002
 player+0.002
Neg logit contributions
 Timothy-0.002
 Kitty-0.002
 Karen-0.002
 Sue-0.002
 Su-0.002
Token.
Token ID13
Feature activation+9.47e-01

Pos logit contributions
 microphone+0.008
 la+0.008
opus+0.008
 dancer+0.008
 player+0.008
Neg logit contributions
 Timothy-0.008
 Karen-0.007
 Samantha-0.007
 Kitty-0.006
 Su-0.006
Token Then
Token ID3244
Feature activation+4.35e+00

Pos logit contributions
 Timothy+0.063
 Karen+0.056
 Samantha+0.053
 Su+0.052
 Kitty+0.052
Neg logit contributions
 microphone-0.061
 la-0.059
 player-0.059
opus-0.058
 thumb-0.058
Token it
Token ID340
Feature activation+5.79e-01

Pos logit contributions
 Timothy+0.318
 Karen+0.284
 Samantha+0.272
 Su+0.268
 Kitty+0.266
Neg logit contributions
 microphone-0.258
 la-0.247
 player-0.244
opus-0.243
 dancer-0.242
Token tried
Token ID3088
Feature activation-2.47e-01

Pos logit contributions
 Timothy+0.037
 Karen+0.032
 Samantha+0.031
 Kitty+0.031
 Su+0.030
Neg logit contributions
 microphone-0.039
 la-0.038
opus-0.038
 dancer-0.037
 player-0.037
Token the
Token ID262
Feature activation+6.84e-01

Pos logit contributions
 microphone+0.015
 thumb+0.015
 brain+0.014
 player+0.014
 dancer+0.014
Neg logit contributions
 Timothy-0.018
 Karen-0.016
 Samantha-0.015
Kim-0.015
 Su-0.015
Token playground
Token ID24817
Feature activation+1.53e-01

Pos logit contributions
 Timothy+0.048
 Karen+0.043
 Samantha+0.041
 Kitty+0.040
 Su+0.040
Neg logit contributions
 microphone-0.042
 la-0.041
opus-0.040
 player-0.040
 dancer-0.040
Token,
Token ID11
Feature activation-8.35e-02

Pos logit contributions
 Timothy+0.011
 Karen+0.010
 Samantha+0.009
 Kitty+0.009
 Su+0.009
Neg logit contributions
 microphone-0.009
 player-0.009
 thumb-0.008
 dancer-0.008
 team-0.008
Token.
Token ID13
Feature activation+5.76e-01

Pos logit contributions
 Timothy+0.097
 Karen+0.086
 Su+0.081
 Kitty+0.080
 Samantha+0.079
Neg logit contributions
 microphone-0.110
opus-0.108
 la-0.108
 thumb-0.105
 rook-0.104
Token Every
Token ID3887
Feature activation+6.86e-01

Pos logit contributions
 Timothy+0.040
 Karen+0.035
 Samantha+0.034
 Su+0.033
 Kitty+0.033
Neg logit contributions
 microphone-0.035
 player-0.035
 la-0.035
 dancer-0.034
 thumb-0.034
Token time
Token ID640
Feature activation+2.50e+00

Pos logit contributions
 Timothy+0.040
 Karen+0.034
 Su+0.033
 Kitty+0.033
 Samantha+0.033
Neg logit contributions
 microphone-0.050
 la-0.049
opus-0.049
 rook-0.048
 bishop-0.048
Token it
Token ID340
Feature activation+4.88e-01

Pos logit contributions
 Timothy+0.183
 Karen+0.163
 Samantha+0.157
 Su+0.154
 Kitty+0.153
Neg logit contributions
 microphone-0.146
 la-0.140
 player-0.139
opus-0.139
 dancer-0.138
Token r
Token ID374
Feature activation+1.59e+00

Pos logit contributions
 Timothy+0.033
 Karen+0.029
 Samantha+0.028
 Su+0.027
 Kitty+0.027
Neg logit contributions
 microphone-0.031
 la-0.030
 player-0.030
 dancer-0.030
opus-0.030
Tokenained
Token ID1328
Feature activation+5.59e+00

Pos logit contributions
 Timothy+0.069
 Karen+0.059
 Samantha+0.059
 Bill+0.054
 Kitty+0.051
Neg logit contributions
ater-0.127
 microphone-0.125
mine-0.123
ubby-0.122
 team-0.122
Token,
Token ID11
Feature activation-1.98e-01

Pos logit contributions
 Timothy+0.415
 Karen+0.374
 Samantha+0.363
 Su+0.351
 Bill+0.349
Neg logit contributions
 microphone-0.316
 player-0.301
 la-0.301
 dancer-0.295
opus-0.293
Token he
Token ID339
Feature activation-8.93e-01

Pos logit contributions
 microphone+0.015
 la+0.015
opus+0.015
ater+0.015
 player+0.014
Neg logit contributions
 Timothy-0.011
 Karen-0.009
 Bill-0.009
 Su-0.009
 Kitty-0.009
Token would
Token ID561
Feature activation+3.40e-02

Pos logit contributions
 microphone+0.072
opus+0.072
 la+0.071
 rook+0.070
 cheek+0.069
Neg logit contributions
 Timothy-0.041
 Karen-0.035
 Su-0.034
 Kitty-0.034
 meant-0.033
Token go
Token ID467
Feature activation+1.27e+00

Pos logit contributions
 Timothy+0.002
 Su+0.002
 Karen+0.002
 Kitty+0.002
 about+0.002
Neg logit contributions
opus-0.003
 rook-0.002
 microphone-0.002
 player-0.002
 bishop-0.002
Token outside
Token ID2354
Feature activation-1.67e+00

Pos logit contributions
 Timothy+0.100
 Samantha+0.090
 Karen+0.090
 Bill+0.086
 Kitty+0.082
Neg logit contributions
 microphone-0.069
 dancer-0.064
 la-0.062
 player-0.061
 thumb-0.060
Token.
Token ID13
Feature activation+3.79e-01

Pos logit contributions
 microphone+0.043
 thumb+0.043
opus+0.043
 la+0.043
 dancer+0.042
Neg logit contributions
 Timothy-0.030
 about-0.027
 Karen-0.027
 Bill-0.026
 Sue-0.026
Token They
Token ID1119
Feature activation+1.15e+00

Pos logit contributions
 Timothy+0.029
 supposed+0.028
 secure+0.026
 Karen+0.026
 resolved+0.026
Neg logit contributions
 player-0.017
 team-0.016
 la-0.016
 microphone-0.015
'-0.014
Token made
Token ID925
Feature activation-5.91e-01

Pos logit contributions
 Timothy+0.055
 Karen+0.050
 meant+0.049
 Su+0.049
 Bill+0.048
Neg logit contributions
opus-0.095
 cheek-0.090
 microphone-0.089
 la-0.085
 dancer-0.085
Token up
Token ID510
Feature activation+1.56e+00

Pos logit contributions
 microphone+0.039
 la+0.037
 player+0.037
 dancer+0.036
 team+0.036
Neg logit contributions
 Timothy-0.040
 Karen-0.035
 Samantha-0.034
 Su-0.032
 Kitty-0.032
Token stories
Token ID3923
Feature activation+1.21e+00

Pos logit contributions
 Timothy+0.096
 Karen+0.083
 Samantha+0.079
 Su+0.076
 Kitty+0.076
Neg logit contributions
 microphone-0.112
 la-0.107
 player-0.107
 dancer-0.106
 team-0.105
Token about
Token ID546
Feature activation+5.50e+00

Pos logit contributions
 Timothy+0.077
 Karen+0.065
 Samantha+0.064
 Su+0.062
 Kitty+0.060
Neg logit contributions
 microphone-0.086
 player-0.084
 la-0.082
 dancer-0.082
 team-0.080
Token the
Token ID262
Feature activation+2.97e-01

Pos logit contributions
 Timothy+0.351
 Karen+0.312
 Samantha+0.294
 Kitty+0.293
 Su+0.292
Neg logit contributions
 microphone-0.366
opus-0.360
 la-0.357
 player-0.352
 dancer-0.350
Token animals
Token ID4695
Feature activation+3.73e+00

Pos logit contributions
 Timothy+0.018
 Karen+0.016
 Bill+0.015
 Kitty+0.015
 Sue+0.015
Neg logit contributions
opus-0.019
 thumb-0.018
 la-0.018
 microphone-0.018
mine-0.018
Token and
Token ID290
Feature activation+6.22e-01

Pos logit contributions
 Timothy+0.221
 Karen+0.199
 about+0.195
 Bill+0.190
 Kitty+0.189
Neg logit contributions
opus-0.249
 microphone-0.248
 la-0.246
 thumb-0.244
 dancer-0.239
Token people
Token ID661
Feature activation+2.41e+00

Pos logit contributions
 Timothy+0.039
 Karen+0.035
 Samantha+0.033
 Kitty+0.033
 Bill+0.033
Neg logit contributions
 microphone-0.042
opus-0.042
 la-0.041
 player-0.040
 thumb-0.040
Token who
Token ID508
Feature activation-1.56e+00

Pos logit contributions
 Timothy+0.146
 Karen+0.133
 about+0.126
 Kitty+0.123
 Bill+0.123
Neg logit contributions
opus-0.160
 la-0.159
 microphone-0.159
 thumb-0.157
 brain-0.155
Token
Token ID198
Feature activation+9.22e-01

Pos logit contributions
 player+0.081
opus+0.080
 microphone+0.080
 la+0.079
 team+0.077
Neg logit contributions
 Timothy-0.135
 Karen-0.123
 Samantha-0.120
 Su-0.115
 theirs-0.114
Token
Token ID198
Feature activation+8.96e-01

Pos logit contributions
 Timothy+0.063
 Karen+0.056
 Samantha+0.054
 Bill+0.052
 Su+0.051
Neg logit contributions
 microphone-0.057
 la-0.056
 player-0.055
opus-0.055
 dancer-0.054
TokenL
Token ID43
Feature activation+3.01e-01

Pos logit contributions
 Timothy+0.070
 Karen+0.063
 Samantha+0.060
 Su+0.060
 Kitty+0.059
Neg logit contributions
 microphone-0.049
 la-0.047
opus-0.046
 player-0.046
 dancer-0.046
Tokenily
Token ID813
Feature activation+1.05e+00

Pos logit contributions
 Timothy+0.023
 Karen+0.021
 Samantha+0.020
 Bill+0.020
 Kitty+0.020
Neg logit contributions
 player-0.016
 dancer-0.016
 la-0.016
 microphone-0.016
opus-0.016
Token thought
Token ID1807
Feature activation-2.25e+00

Pos logit contributions
 Timothy+0.070
 Karen+0.062
 Samantha+0.059
 Su+0.058
 Kitty+0.058
Neg logit contributions
 microphone-0.068
 la-0.066
opus-0.064
 dancer-0.064
 player-0.063
Token about
Token ID546
Feature activation+5.77e+00

Pos logit contributions
 microphone+0.204
 la+0.199
 dancer+0.195
 player+0.195
 thumb+0.195
Neg logit contributions
 Timothy-0.091
 Karen-0.074
 Samantha-0.071
Kim-0.066
 Bill-0.062
Token what
Token ID644
Feature activation+3.54e+00

Pos logit contributions
 Timothy+0.394
Kim+0.344
 Karen+0.344
 Samantha+0.338
ained+0.327
Neg logit contributions
 microphone-0.355
 thumb-0.327
 la-0.321
 guitar-0.319
 dancer-0.319
Token Tim
Token ID5045
Feature activation+2.61e+00

Pos logit contributions
 Timothy+0.192
 Karen+0.165
 Samantha+0.162
Kim+0.155
ained+0.154
Neg logit contributions
 microphone-0.263
 thumb-0.255
 player-0.251
 guitar-0.244
 brain-0.242
Tokenmy
Token ID1820
Feature activation+1.18e+00

Pos logit contributions
 Timothy+0.201
 Karen+0.180
 Samantha+0.173
 Su+0.168
 Kitty+0.164
Neg logit contributions
 thumb-0.142
 la-0.141
 microphone-0.141
 player-0.140
 brain-0.135
Token said
Token ID531
Feature activation+1.56e+00

Pos logit contributions
 Timothy+0.083
 Karen+0.072
 Su+0.069
 Samantha+0.069
Kim+0.068
Neg logit contributions
 microphone-0.074
 thumb-0.072
 player-0.072
 la-0.069
opus-0.069
Token and
Token ID290
Feature activation+6.89e-01

Pos logit contributions
 Timothy+0.103
 Karen+0.091
 Samantha+0.086
 Su+0.085
 Kitty+0.084
Neg logit contributions
 microphone-0.103
 la-0.100
opus-0.100
 dancer-0.098
 player-0.098
Token and
Token ID290
Feature activation+5.03e-01

Pos logit contributions
opus+0.052
 microphone+0.051
acy+0.051
 rook+0.048
 dancer+0.048
Neg logit contributions
 Timothy-0.035
 Sue-0.033
 Kitty-0.033
 Karen-0.030
 about-0.030
Token chew
Token ID34722
Feature activation-9.02e-01

Pos logit contributions
 Timothy+0.034
 Kitty+0.032
 Karen+0.031
 Sue+0.031
 Bill+0.029
Neg logit contributions
opus-0.033
 microphone-0.032
ater-0.031
 thumb-0.030
 dancer-0.029
Token.
Token ID13
Feature activation+1.99e-01

Pos logit contributions
 microphone+0.055
 la+0.053
 player+0.053
opus+0.052
 dancer+0.052
Neg logit contributions
 Timothy-0.064
 Karen-0.057
 Samantha-0.054
 Su-0.054
 Bill-0.053
Token He
Token ID679
Feature activation-4.02e-01

Pos logit contributions
 Timothy+0.014
 Karen+0.013
 Samantha+0.012
 supposed+0.012
 Su+0.012
Neg logit contributions
 la-0.011
 player-0.011
 team-0.011
 microphone-0.011
 brain-0.011
Token che
Token ID1125
Feature activation-1.99e+00

Pos logit contributions
 microphone+0.027
opus+0.026
 la+0.026
 thumb+0.026
 dancer+0.025
Neg logit contributions
 Timothy-0.026
 Karen-0.023
 Kitty-0.022
 Samantha-0.022
 Su-0.021
Tokenwed
Token ID19103
Feature activation+1.22e+00

Pos logit contributions
 la+0.149
 microphone+0.148
 player+0.146
fish+0.146
ater+0.146
Neg logit contributions
 Timothy-0.111
 Karen-0.099
 Samantha-0.090
 Su-0.088
 Bill-0.087
Token on
Token ID319
Feature activation+1.07e+00

Pos logit contributions
 Timothy+0.089
 Karen+0.082
 Samantha+0.079
 Su+0.078
 Bill+0.075
Neg logit contributions
 microphone-0.069
 player-0.067
 la-0.067
 thumb-0.064
 brain-0.064
Token sticks
Token ID16461
Feature activation+2.04e+00

Pos logit contributions
 Timothy+0.064
 Karen+0.055
 Samantha+0.052
 Su+0.051
 Kitty+0.051
Neg logit contributions
 microphone-0.078
 la-0.075
opus-0.075
 player-0.074
 dancer-0.074
Token,
Token ID11
Feature activation-7.38e-01

Pos logit contributions
 Timothy+0.120
 about+0.102
 Sue+0.100
 Karen+0.100
 Kitty+0.100
Neg logit contributions
 microphone-0.149
opus-0.148
 la-0.146
 dancer-0.144
 thumb-0.141
Token balls
Token ID11333
Feature activation+2.48e+00

Pos logit contributions
 microphone+0.052
opus+0.052
 dancer+0.049
 player+0.049
 la+0.049
Neg logit contributions
 Timothy-0.045
 Sue-0.039
 Kitty-0.039
 Karen-0.039
 Samantha-0.037
Token,
Token ID11
Feature activation-7.44e-01

Pos logit contributions
 Timothy+0.143
 about+0.124
 Sue+0.121
 Karen+0.120
 Kitty+0.120
Neg logit contributions
 microphone-0.182
opus-0.179
 la-0.177
 dancer-0.177
 thumb-0.176
Token
Token ID198
Feature activation+1.01e+00

Pos logit contributions
 Timothy+0.062
 Karen+0.054
 Samantha+0.051
 Su+0.050
 Kitty+0.050
Neg logit contributions
 microphone-0.073
 la-0.070
opus-0.070
 player-0.070
 dancer-0.069
TokenAfter
Token ID3260
Feature activation+2.54e+00

Pos logit contributions
 Timothy+0.041
 Kitty+0.037
 about+0.035
 Karen+0.035
 Sue+0.034
Neg logit contributions
 microphone-0.088
 thumb-0.087
 finger-0.084
 cheek-0.084
 la-0.083
Token a
Token ID257
Feature activation-1.45e-01

Pos logit contributions
 Timothy+0.156
 Karen+0.136
 Samantha+0.129
 Su+0.127
 Kitty+0.127
Neg logit contributions
 microphone-0.180
 la-0.174
opus-0.173
 player-0.172
 dancer-0.171
Token long
Token ID890
Feature activation+1.10e-01

Pos logit contributions
 microphone+0.009
 la+0.008
opus+0.008
 thumb+0.008
 player+0.008
Neg logit contributions
 Timothy-0.010
 Karen-0.009
 Kitty-0.009
 Samantha-0.009
 Su-0.009
Token day
Token ID1110
Feature activation-1.19e+00

Pos logit contributions
 Timothy+0.005
 Kitty+0.005
 Karen+0.004
 Sue+0.004
 Betty+0.004
Neg logit contributions
 microphone-0.009
opus-0.009
 la-0.009
 player-0.009
 dancer-0.009
Token of
Token ID286
Feature activation+5.71e-01

Pos logit contributions
 microphone+0.071
 la+0.068
opus+0.067
 player+0.067
 dancer+0.067
Neg logit contributions
 Timothy-0.086
 Karen-0.077
 Samantha-0.073
 Su-0.072
 Kitty-0.072
Token flying
Token ID7348
Feature activation+5.70e-01

Pos logit contributions
 Timothy+0.044
 Karen+0.039
 Samantha+0.037
 Su+0.037
Kim+0.037
Neg logit contributions
 microphone-0.033
 la-0.030
 brain-0.030
 dancer-0.030
 thumb-0.030
Token,
Token ID11
Feature activation-3.01e-01

Pos logit contributions
 Timothy+0.034
 Sue+0.030
 Kitty+0.030
 Karen+0.030
 about+0.029
Neg logit contributions
opus-0.040
 microphone-0.040
 la-0.039
 thumb-0.039
 rook-0.038
Token Buzz
Token ID15753
Feature activation-1.43e+00

Pos logit contributions
 microphone+0.021
 la+0.020
opus+0.020
 thumb+0.020
ater+0.020
Neg logit contributions
 Timothy-0.019
 Kitty-0.017
 Karen-0.017
 Sue-0.016
 Samantha-0.016
Tokeny
Token ID88
Feature activation-3.04e+00

Pos logit contributions
 microphone+0.056
 la+0.054
opus+0.052
 player+0.052
 rook+0.051
Neg logit contributions
 Timothy-0.125
 Karen-0.115
 Kitty-0.114
 meant-0.114
 Samantha-0.114
Token was
Token ID373
Feature activation+1.24e+00

Pos logit contributions
 microphone+0.203
 la+0.199
opus+0.197
 rook+0.195
 player+0.193
Neg logit contributions
 Timothy-0.174
 Karen-0.159
 Samantha-0.157
 meant-0.155
 about-0.155
Token are
Token ID389
Feature activation-1.78e+00

Pos logit contributions
 meant+0.044
 about+0.042
 Timothy+0.041
 Bill+0.039
 Sue+0.039
Neg logit contributions
opus-0.083
 cheek-0.073
 microphone-0.071
ternally-0.068
ler-0.068
Token happy
Token ID3772
Feature activation-1.08e+00

Pos logit contributions
 microphone+0.097
 la+0.092
 player+0.091
 dancer+0.091
 thumb+0.089
Neg logit contributions
 Timothy-0.141
 Karen-0.126
 Samantha-0.121
 Su-0.119
 Kitty-0.118
Token when
Token ID618
Feature activation-3.60e-01

Pos logit contributions
 microphone+0.075
opus+0.074
 la+0.073
 thumb+0.072
 player+0.071
Neg logit contributions
 Timothy-0.065
 Karen-0.058
 Kitty-0.055
 Su-0.055
 about-0.054
Token they
Token ID484
Feature activation-7.26e-01

Pos logit contributions
 microphone+0.027
 la+0.026
opus+0.026
 player+0.026
 dancer+0.025
Neg logit contributions
 Timothy-0.021
 Karen-0.018
 Samantha-0.017
 Su-0.017
 Kitty-0.017
Token make
Token ID787
Feature activation-7.02e-01

Pos logit contributions
opus+0.049
 microphone+0.047
 cheek+0.046
 thumb+0.046
 player+0.045
Neg logit contributions
 Timothy-0.041
 Karen-0.037
 Su-0.036
 meant-0.036
 about-0.036
Token nice
Token ID3621
Feature activation+2.03e+00

Pos logit contributions
 microphone+0.046
 team+0.043
 guitar+0.042
 player+0.042
 thumb+0.042
Neg logit contributions
 Timothy-0.049
 Karen-0.044
 Samantha-0.041
 Kitty-0.040
Kim-0.040
Token pictures
Token ID5986
Feature activation+2.59e+00

Pos logit contributions
 Timothy+0.136
 Karen+0.118
 Samantha+0.113
 Kitty+0.109
 Su+0.108
Neg logit contributions
 microphone-0.138
 la-0.131
 dancer-0.131
 player-0.130
 team-0.129
Token.
Token ID13
Feature activation+3.83e-01

Pos logit contributions
 Timothy+0.172
 Karen+0.151
 Samantha+0.144
 Su+0.142
 Kitty+0.141
Neg logit contributions
 microphone-0.170
 la-0.163
opus-0.162
 player-0.161
 dancer-0.161
Token
Token ID198
Feature activation+7.46e-01

Pos logit contributions
 Timothy+0.024
 Karen+0.021
 Samantha+0.020
 Su+0.020
 Kitty+0.019
Neg logit contributions
 microphone-0.026
 la-0.026
 player-0.025
opus-0.025
 dancer-0.025
Token
Token ID198
Feature activation+7.79e-01

Pos logit contributions
 Timothy+0.052
 Karen+0.045
 Samantha+0.044
 Su+0.043
 Bill+0.043
Neg logit contributions
 microphone-0.047
 player-0.045
 la-0.045
 dancer-0.045
opus-0.044
TokenOne
Token ID3198
Feature activation-1.61e-01

Pos logit contributions
 Timothy+0.039
 Karen+0.034
 Samantha+0.033
 about+0.032
 Kitty+0.032
Neg logit contributions
 microphone-0.062
 thumb-0.061
 rook-0.060
 brain-0.059
 la-0.059
TokenOnce
Token ID7454
Feature activation-1.79e+00

Pos logit contributions
 player+0.139
 microphone+0.136
 team+0.130
 dancer+0.128
 la+0.125
Neg logit contributions
 Timothy-0.209
 Karen-0.186
 Samantha-0.180
 Su-0.176
 theirs-0.173
Token upon
Token ID2402
Feature activation-3.54e+00

Pos logit contributions
 microphone+0.087
 la+0.083
opus+0.081
 player+0.081
 dancer+0.080
Neg logit contributions
 Timothy-0.149
 Karen-0.135
 Samantha-0.130
 Su-0.129
 Kitty-0.129
Token a
Token ID257
Feature activation+1.31e-01

Pos logit contributions
 microphone+0.255
 la+0.247
 player+0.244
opus+0.244
 dancer+0.243
Neg logit contributions
 Timothy-0.211
 Karen-0.183
 Samantha-0.173
 Su-0.170
 Kitty-0.169
Token time
Token ID640
Feature activation+2.55e+00

Pos logit contributions
 Timothy+0.006
 Karen+0.005
 Samantha+0.005
 Su+0.005
 Kitty+0.005
Neg logit contributions
 microphone-0.011
 la-0.011
 player-0.010
opus-0.010
 dancer-0.010
Token,
Token ID11
Feature activation-3.03e-01

Pos logit contributions
 Timothy+0.162
 Karen+0.142
 Kitty+0.136
 Sue+0.135
 Bill+0.135
Neg logit contributions
 microphone-0.179
opus-0.170
 player-0.169
 dancer-0.169
 thumb-0.169
Token there
Token ID612
Feature activation+1.33e+00

Pos logit contributions
 microphone+0.021
opus+0.021
 la+0.020
 dancer+0.020
 rook+0.020
Neg logit contributions
 Timothy-0.019
 Karen-0.017
 Kitty-0.016
 Sue-0.016
 Samantha-0.016
Token was
Token ID373
Feature activation+1.44e+00

Pos logit contributions
 Timothy+0.091
 Karen+0.080
 Samantha+0.077
 theirs+0.076
Kim+0.075
Neg logit contributions
 player-0.082
 microphone-0.080
 team-0.079
 dancer-0.078
 thumb-0.077
Token a
Token ID257
Feature activation+1.29e-01

Pos logit contributions
 Timothy+0.103
 Karen+0.093
Kim+0.087
 Samantha+0.087
 Su+0.086
Neg logit contributions
 microphone-0.085
 player-0.081
 guitar-0.081
 la-0.080
 team-0.080
Token little
Token ID1310
Feature activation-1.12e+00

Pos logit contributions
 Timothy+0.008
 Kitty+0.008
 Sue+0.008
 Karen+0.007
 Su+0.007
Neg logit contributions
mine-0.008
chie-0.007
 microphone-0.007
acy-0.007
 la-0.007
Token boy
Token ID2933
Feature activation-1.06e-02

Pos logit contributions
acy+0.065
 la+0.063
chie+0.062
mine+0.062
 bishop+0.062
Neg logit contributions
 Timothy-0.073
 Kitty-0.070
 Sue-0.067
 Karen-0.066
 Sally-0.064
Token named
Token ID3706
Feature activation+3.54e-01

Pos logit contributions
opus+0.001
 microphone+0.001
 la+0.001
 dancer+0.001
 rook+0.001
Neg logit contributions
 Timothy-0.001
 Kitty-0.000
 Sue-0.000
 Karen-0.000
 Kim-0.000
Token special
Token ID2041
Feature activation-1.33e-01

Pos logit contributions
 microphone+0.118
 la+0.112
 dancer+0.111
opus+0.111
mine+0.110
Neg logit contributions
 Timothy-0.093
 Karen-0.080
 Sue-0.079
 Kitty-0.079
 Bill-0.076
Token,"
Token ID553
Feature activation-7.87e-01

Pos logit contributions
 microphone+0.009
 la+0.009
opus+0.009
 player+0.008
 dancer+0.008
Neg logit contributions
 Timothy-0.008
 Karen-0.007
 Kitty-0.007
 Samantha-0.007
 Bill-0.007
Token she
Token ID673
Feature activation-4.00e-01

Pos logit contributions
 microphone+0.064
opus+0.063
 la+0.062
 rook+0.061
ater+0.061
Neg logit contributions
 Timothy-0.042
 Karen-0.036
 Samantha-0.035
 Su-0.035
 Sue-0.035
Token said
Token ID531
Feature activation+1.32e+00

Pos logit contributions
 microphone+0.035
opus+0.034
 la+0.034
 dancer+0.033
 player+0.033
Neg logit contributions
 Timothy-0.018
 Karen-0.015
 Samantha-0.014
 Su-0.014
 Bill-0.014
Token.
Token ID13
Feature activation+3.59e-01

Pos logit contributions
 Timothy+0.070
 Karen+0.061
 Samantha+0.059
 about+0.059
 Su+0.059
Neg logit contributions
opus-0.108
 la-0.105
 microphone-0.102
 dancer-0.100
 bishop-0.100
Token
Token ID198
Feature activation+5.75e-01

Pos logit contributions
 Timothy+0.022
 Karen+0.020
 Samantha+0.019
 Kitty+0.019
 Su+0.019
Neg logit contributions
 microphone-0.026
opus-0.025
 dancer-0.024
 la-0.024
 thumb-0.024
Token
Token ID198
Feature activation+5.13e-01

Pos logit contributions
 Timothy+0.036
 Karen+0.031
 Samantha+0.030
 Su+0.029
 Kitty+0.029
Neg logit contributions
 microphone-0.039
 la-0.038
opus-0.038
 player-0.038
 dancer-0.037
TokenThe
Token ID464
Feature activation-6.03e-01

Pos logit contributions
 Timothy+0.029
 Karen+0.026
Kim+0.025
 Samantha+0.024
 Kitty+0.024
Neg logit contributions
 microphone-0.040
 thumb-0.038
 brain-0.038
 rook-0.037
 player-0.037
Token rabbit
Token ID22746
Feature activation-3.30e-01

Pos logit contributions
 la+0.032
 microphone+0.031
opus+0.031
 player+0.029
mine+0.029
Neg logit contributions
 Timothy-0.046
 Karen-0.041
 Kitty-0.040
 Bill-0.039
 Betty-0.039
Token hopped
Token ID45230
Feature activation-7.28e-01

Pos logit contributions
 microphone+0.024
opus+0.023
 la+0.023
 dancer+0.022
 rook+0.022
Neg logit contributions
 Timothy-0.019
 Karen-0.017
 Kitty-0.016
 Bill-0.016
 Samantha-0.016
Token away
Token ID1497
Feature activation-1.19e+00

Pos logit contributions
 microphone+0.049
opus+0.048
 la+0.048
 thumb+0.047
 dancer+0.047
Neg logit contributions
 Timothy-0.048
 Karen-0.042
 Samantha-0.039
 Su-0.039
 Kitty-0.039
Token was
Token ID373
Feature activation+1.31e+00

Pos logit contributions
 Timothy+0.046
 Karen+0.040
 Kitty+0.038
 Samantha+0.038
 Su+0.038
Neg logit contributions
 microphone-0.052
 la-0.051
opus-0.050
 dancer-0.049
 player-0.049
Token a
Token ID257
Feature activation-1.10e-01

Pos logit contributions
 Timothy+0.107
 Karen+0.098
 Samantha+0.094
 Su+0.092
Kim+0.091
Neg logit contributions
 microphone-0.066
 thumb-0.062
 player-0.061
 brain-0.060
 guitar-0.058
Token big
Token ID1263
Feature activation-4.62e-02

Pos logit contributions
mine+0.009
 la+0.008
acy+0.008
opus+0.008
per+0.008
Neg logit contributions
 Timothy-0.005
 fun-0.005
 Kitty-0.004
 Sue-0.004
 Karen-0.004
Token ocean
Token ID9151
Feature activation-3.04e+00

Pos logit contributions
acy+0.003
mine+0.003
 la+0.003
opus+0.003
chie+0.003
Neg logit contributions
 Timothy-0.003
 Bill-0.002
 Sally-0.002
 Sue-0.002
 about-0.002
Token,
Token ID11
Feature activation-3.91e-01

Pos logit contributions
 la+0.216
 microphone+0.210
 dancer+0.202
opus+0.202
 thumb+0.197
Neg logit contributions
 Timothy-0.183
 about-0.157
 Karen-0.156
 Kitty-0.153
 Sue-0.152
Token so
Token ID523
Feature activation+2.57e-01

Pos logit contributions
 microphone+0.025
 la+0.024
opus+0.024
 player+0.024
 dancer+0.023
Neg logit contributions
 Timothy-0.027
 Karen-0.024
 Samantha-0.023
 Su-0.022
 Kitty-0.022
Token it
Token ID340
Feature activation+4.16e-01

Pos logit contributions
 Timothy+0.016
 Karen+0.014
 Kitty+0.013
 Samantha+0.013
 Su+0.013
Neg logit contributions
 microphone-0.018
 la-0.018
opus-0.018
 player-0.017
 dancer-0.017
Token was
Token ID373
Feature activation+1.27e+00

Pos logit contributions
 Timothy+0.025
 Karen+0.021
 meant+0.021
 Kitty+0.021
 Su+0.021
Neg logit contributions
 microphone-0.029
 la-0.029
opus-0.028
 dancer-0.028
 rook-0.027
Token hard
Token ID1327
Feature activation+8.10e-01

Pos logit contributions
 Timothy+0.108
 Karen+0.099
 Samantha+0.095
Kim+0.092
 Su+0.092
Neg logit contributions
 microphone-0.060
 thumb-0.056
 player-0.055
 brain-0.055
 la-0.054
Token to
Token ID284
Feature activation-5.98e-03

Pos logit contributions
 Timothy+0.048
 Karen+0.042
 Samantha+0.040
 Kitty+0.039
 Su+0.039
Neg logit contributions
 microphone-0.058
 la-0.056
opus-0.056
 dancer-0.055
 player-0.055
Token know
Token ID760
Feature activation+1.83e+00

Pos logit contributions
 microphone+0.000
opus+0.000
 la+0.000
 player+0.000
 dancer+0.000
Neg logit contributions
 Timothy-0.000
 Karen-0.000
 Bill-0.000
 Kitty-0.000
 Su-0.000
Token.
Token ID13
Feature activation+3.02e-01

Pos logit contributions
 microphone+0.038
 la+0.037
opus+0.037
 dancer+0.036
 player+0.036
Neg logit contributions
 Timothy-0.037
 Karen-0.032
 Samantha-0.030
 Su-0.030
 Kitty-0.030
Token Mama
Token ID40084
Feature activation+1.83e+00

Pos logit contributions
 Timothy+0.019
 Karen+0.016
 Samantha+0.016
 Bill+0.015
 Su+0.015
Neg logit contributions
 microphone-0.022
 la-0.021
opus-0.021
 dancer-0.021
 thumb-0.020
Token,
Token ID11
Feature activation-5.93e-01

Pos logit contributions
 Timothy+0.113
 Karen+0.101
 Su+0.095
 Samantha+0.095
 Kitty+0.094
Neg logit contributions
 microphone-0.127
 la-0.123
opus-0.122
 brain-0.121
 dancer-0.119
Token Papa
Token ID42328
Feature activation-1.25e+00

Pos logit contributions
opus+0.037
 microphone+0.037
 la+0.035
 brain+0.034
 player+0.034
Neg logit contributions
 Timothy-0.043
 Karen-0.039
 Samantha-0.037
 Su-0.037
 Bill-0.037
Token and
Token ID290
Feature activation+5.05e-01

Pos logit contributions
 microphone+0.088
 la+0.086
opus+0.085
 brain+0.085
 dancer+0.084
Neg logit contributions
 Timothy-0.077
 Karen-0.068
 Samantha-0.065
 Bill-0.064
 Su-0.064
Token even
Token ID772
Feature activation-1.57e-01

Pos logit contributions
 Timothy+0.033
 Karen+0.029
 Samantha+0.029
 Bill+0.028
 Su+0.028
Neg logit contributions
opus-0.034
 microphone-0.034
 la-0.034
 player-0.032
 dancer-0.032
Token Grand
Token ID5675
Feature activation+9.84e-01

Pos logit contributions
 microphone+0.011
opus+0.011
 la+0.011
 thumb+0.010
 dancer+0.010
Neg logit contributions
 Timothy-0.009
 Karen-0.008
 Su-0.008
 Samantha-0.008
 Kitty-0.008
Tokenma
Token ID2611
Feature activation-4.97e-01

Pos logit contributions
 Timothy+0.059
 Karen+0.052
 Samantha+0.049
 Kitty+0.048
 Su+0.048
Neg logit contributions
 microphone-0.071
 la-0.069
opus-0.068
 player-0.068
 dancer-0.067
Token were
Token ID547
Feature activation+1.35e-01

Pos logit contributions
 microphone+0.035
 la+0.034
opus+0.034
 dancer+0.034
 player+0.033
Neg logit contributions
 Timothy-0.030
 Karen-0.026
 Samantha-0.025
 Su-0.025
 Bill-0.025
Token very
Token ID845
Feature activation-1.75e+00

Pos logit contributions
 Timothy+0.009
 Karen+0.008
 Samantha+0.008
 Su+0.007
 Bill+0.007
Neg logit contributions
opus-0.009
 la-0.009
 microphone-0.009
 player-0.009
 thumb-0.008
Token impressed
Token ID12617
Feature activation-3.73e-01

Pos logit contributions
 la+0.118
 microphone+0.118
opus+0.116
 thumb+0.113
 player+0.111
Neg logit contributions
 Timothy-0.111
 Karen-0.098
 Su-0.096
 Samantha-0.096
 Bill-0.092
Token the
Token ID262
Feature activation-2.40e-01

Pos logit contributions
 Timothy+0.025
 Karen+0.022
 Samantha+0.021
 Su+0.020
 Kitty+0.020
Neg logit contributions
 microphone-0.025
 la-0.024
 dancer-0.024
 player-0.024
 thumb-0.023
Token door
Token ID3420
Feature activation+1.23e-01

Pos logit contributions
 la+0.011
opus+0.011
 dancer+0.011
fish+0.010
mine+0.010
Neg logit contributions
 Timothy-0.019
 Karen-0.017
 Sue-0.017
 Kitty-0.017
 Bill-0.017
Token and
Token ID290
Feature activation-9.78e-02

Pos logit contributions
 Timothy+0.009
 Karen+0.008
 Samantha+0.007
 Su+0.007
 Kitty+0.007
Neg logit contributions
 microphone-0.007
 la-0.007
 player-0.007
 dancer-0.007
opus-0.007
Token pushed
Token ID7121
Feature activation-2.05e-01

Pos logit contributions
 microphone+0.006
 la+0.006
opus+0.006
 player+0.006
 dancer+0.006
Neg logit contributions
 Timothy-0.007
 Karen-0.006
 Samantha-0.005
 Su-0.005
 Kitty-0.005
Token the
Token ID262
Feature activation-2.43e-01

Pos logit contributions
 microphone+0.012
 player+0.011
 la+0.011
 dancer+0.011
 thumb+0.011
Neg logit contributions
 Timothy-0.015
 Karen-0.013
 Samantha-0.013
Kim-0.012
 resolved-0.012
Token box
Token ID3091
Feature activation-2.03e+00

Pos logit contributions
 la+0.012
opus+0.011
 dancer+0.011
acy+0.011
mine+0.011
Neg logit contributions
 Timothy-0.018
 Karen-0.016
 Kitty-0.016
Kim-0.016
 Sue-0.016
Token inside
Token ID2641
Feature activation+1.97e+00

Pos logit contributions
 microphone+0.125
 player+0.120
 la+0.118
 dancer+0.118
 thumb+0.117
Neg logit contributions
 Timothy-0.144
 Karen-0.127
 Samantha-0.122
 Su-0.119
 Kitty-0.118
Token.
Token ID13
Feature activation-5.47e-01

Pos logit contributions
 Timothy+0.132
 Karen+0.116
 Samantha+0.110
 Su+0.108
 Kitty+0.108
Neg logit contributions
 microphone-0.128
 la-0.123
 player-0.122
 dancer-0.122
opus-0.122
Token She
Token ID1375
Feature activation-1.08e-01

Pos logit contributions
 microphone+0.030
 la+0.029
 player+0.029
 dancer+0.029
 brain+0.028
Neg logit contributions
 Timothy-0.042
 Karen-0.037
 Samantha-0.035
 Su-0.034
 Kitty-0.034
Token saw
Token ID2497
Feature activation-1.10e+00

Pos logit contributions
 microphone+0.007
 la+0.007
opus+0.007
 dancer+0.007
 thumb+0.007
Neg logit contributions
 Timothy-0.007
 Karen-0.006
 Su-0.006
 Samantha-0.006
 Kitty-0.006
Token her
Token ID607
Feature activation-7.25e-01

Pos logit contributions
 microphone+0.070
 player+0.065
 dancer+0.065
 la+0.063
 thumb+0.063
Neg logit contributions
 Timothy-0.078
 Karen-0.067
 Samantha-0.065
Kim-0.062
 Su-0.062
Token"
Token ID1
Feature activation-2.21e+00

Pos logit contributions
 microphone+0.052
 thumb+0.050
opus+0.049
 dancer+0.049
 brain+0.048
Neg logit contributions
 Timothy-0.038
 Karen-0.033
Kim-0.031
 Kitty-0.031
 Samantha-0.031
TokenI
Token ID40
Feature activation-2.08e+00

Pos logit contributions
 microphone+0.145
 la+0.138
opus+0.138
 thumb+0.138
 brain+0.137
Neg logit contributions
 Timothy-0.149
 Karen-0.133
 Samantha-0.127
 Kitty-0.124
 Su-0.123
Token know
Token ID760
Feature activation+1.57e+00

Pos logit contributions
 microphone+0.132
opus+0.129
 la+0.125
 player+0.123
 dancer+0.123
Neg logit contributions
 Timothy-0.144
 Karen-0.128
 Samantha-0.122
 Su-0.122
 Kitty-0.119
Token you
Token ID345
Feature activation-2.32e+00

Pos logit contributions
 Timothy+0.106
 Karen+0.094
 Samantha+0.089
 Kitty+0.089
 Su+0.088
Neg logit contributions
 microphone-0.101
opus-0.097
 la-0.097
 player-0.097
 dancer-0.096
Token wanted
Token ID2227
Feature activation+7.18e-01

Pos logit contributions
 microphone+0.148
opus+0.144
 la+0.140
 dancer+0.140
 player+0.139
Neg logit contributions
 Timothy-0.156
 Karen-0.139
 Bill-0.133
 Samantha-0.132
 Su-0.130
Token to
Token ID284
Feature activation+3.81e-01

Pos logit contributions
 Timothy+0.043
 Karen+0.038
 Samantha+0.036
 Kitty+0.035
 Su+0.035
Neg logit contributions
 microphone-0.051
 la-0.050
opus-0.050
 player-0.049
 dancer-0.049
Token have
Token ID423
Feature activation+5.69e-01

Pos logit contributions
 Timothy+0.027
 Karen+0.024
 Samantha+0.023
 Bill+0.023
 Kitty+0.022
Neg logit contributions
 microphone-0.024
opus-0.023
 la-0.023
 player-0.023
 dancer-0.022
Token fun
Token ID1257
Feature activation+4.49e+00

Pos logit contributions
 Timothy+0.036
 Karen+0.031
Kim+0.029
 Su+0.029
 Samantha+0.029
Neg logit contributions
 microphone-0.039
 la-0.037
 thumb-0.036
 guitar-0.036
 brain-0.036
Token,
Token ID11
Feature activation-3.26e-01

Pos logit contributions
 Timothy+0.269
 Karen+0.235
 about+0.231
 Kitty+0.221
 Bill+0.219
Neg logit contributions
opus-0.317
 microphone-0.306
 la-0.302
 dancer-0.297
 player-0.297
Token but
Token ID475
Feature activation+1.00e+00

Pos logit contributions
 microphone+0.023
opus+0.022
 la+0.022
 player+0.021
 thumb+0.021
Neg logit contributions
 Timothy-0.020
 Karen-0.018
 Samantha-0.017
 Kitty-0.017
 Sue-0.017
Token you
Token ID345
Feature activation-1.84e+00

Pos logit contributions
 Timothy+0.086
 Karen+0.078
 Kitty+0.076
 Samantha+0.075
 Bill+0.075
Neg logit contributions
 microphone-0.047
opus-0.045
 la-0.044
 player-0.043
 dancer-0.042
Token.
Token ID13
Feature activation-7.29e-01

Pos logit contributions
 Timothy+0.036
 Karen+0.031
 Samantha+0.029
 about+0.029
 Bill+0.029
Neg logit contributions
opus-0.050
 microphone-0.049
 la-0.049
 dancer-0.048
 player-0.047
Token She
Token ID1375
Feature activation-9.47e-02

Pos logit contributions
 microphone+0.048
opus+0.047
 la+0.046
 dancer+0.046
 thumb+0.046
Neg logit contributions
 Timothy-0.048
 Karen-0.043
 Kitty-0.041
 Samantha-0.041
 Bill-0.041
Token was
Token ID373
Feature activation+4.07e-01

Pos logit contributions
 microphone+0.006
 la+0.006
opus+0.006
 dancer+0.006
 thumb+0.006
Neg logit contributions
 Timothy-0.006
 Karen-0.005
 Samantha-0.005
 Kitty-0.005
 Su-0.005
Token crying
Token ID13774
Feature activation-1.94e+00

Pos logit contributions
 Timothy+0.032
 Karen+0.029
 Samantha+0.027
 Su+0.027
 Bill+0.027
Neg logit contributions
 microphone-0.022
 la-0.021
 player-0.021
 dancer-0.021
 thumb-0.020
Token.
Token ID13
Feature activation-7.59e-01

Pos logit contributions
 microphone+0.131
 la+0.128
opus+0.127
 dancer+0.125
 player+0.125
Neg logit contributions
 Timothy-0.125
 Karen-0.109
 Samantha-0.104
 Kitty-0.103
 Su-0.102
Token
Token ID198
Feature activation-2.37e-01

Pos logit contributions
 microphone+0.046
 la+0.045
 player+0.044
 dancer+0.044
opus+0.044
Neg logit contributions
 Timothy-0.053
 Karen-0.046
 Samantha-0.044
 Su-0.043
 Kitty-0.043
Token
Token ID198
Feature activation-2.32e-01

Pos logit contributions
 microphone+0.012
opus+0.011
 la+0.011
 player+0.011
 team+0.011
Neg logit contributions
 Timothy-0.018
 Karen-0.016
 Samantha-0.016
 theirs-0.015
 Su-0.015
Token"
Token ID1
Feature activation-1.74e+00

Pos logit contributions
 microphone+0.013
 la+0.013
opus+0.013
 player+0.013
 dancer+0.013
Neg logit contributions
 Timothy-0.017
 Karen-0.016
 Samantha-0.015
 Kitty-0.015
 Su-0.015
TokenToo
Token ID23307
Feature activation+5.41e-01

Pos logit contributions
 microphone+0.112
 la+0.107
opus+0.106
 thumb+0.105
 player+0.104
Neg logit contributions
 Timothy-0.119
 Karen-0.106
 Samantha-0.102
 Kitty-0.100
 Su-0.099
Token bad
Token ID2089
Feature activation+2.16e+00

Pos logit contributions
 Timothy+0.037
 Karen+0.034
 Su+0.032
 Samantha+0.032
 Bill+0.030
Neg logit contributions
 microphone-0.034
 brain-0.033
 player-0.032
 la-0.032
 thumb-0.032
Token!"
Token ID2474
Feature activation-2.32e+00

Pos logit contributions
 Timothy+0.157
 Karen+0.140
 Samantha+0.134
 Su+0.133
 Kitty+0.131
Neg logit contributions
 microphone-0.128
 la-0.124
 player-0.122
opus-0.122
 dancer-0.121
Token are
Token ID389
Feature activation-2.12e+00

Pos logit contributions
 microphone+0.103
 la+0.099
opus+0.099
 dancer+0.098
 player+0.097
Neg logit contributions
 Timothy-0.115
 Karen-0.102
 Samantha-0.097
 Kitty-0.096
 Su-0.096
Token a
Token ID257
Feature activation-8.62e-01

Pos logit contributions
 microphone+0.136
 player+0.130
 la+0.129
 dancer+0.128
 thumb+0.127
Neg logit contributions
 Timothy-0.147
 Karen-0.130
 Samantha-0.124
 Su-0.123
 Bill-0.120
Token great
Token ID1049
Feature activation+2.45e+00

Pos logit contributions
mine+0.074
opus+0.073
 la+0.073
 microphone+0.071
 bishop+0.071
Neg logit contributions
 Timothy-0.030
 fun-0.025
 Karen-0.023
 Kitty-0.023
 Samantha-0.023
Token cook
Token ID4255
Feature activation-1.98e+00

Pos logit contributions
 Timothy+0.177
 Su+0.157
 Bill+0.155
 about+0.154
 Karen+0.154
Neg logit contributions
 microphone-0.129
nam-0.128
 la-0.128
opus-0.120
chie-0.119
Token.
Token ID13
Feature activation-6.09e-01

Pos logit contributions
 microphone+0.142
 la+0.142
opus+0.140
 thumb+0.139
nam+0.135
Neg logit contributions
 Timothy-0.107
 about-0.095
 Sue-0.094
 Karen-0.091
 Kitty-0.089
Token I
Token ID314
Feature activation-3.45e+00

Pos logit contributions
 microphone+0.034
 la+0.033
 player+0.032
opus+0.032
 dancer+0.032
Neg logit contributions
 Timothy-0.046
 Karen-0.042
 Samantha-0.040
 Su-0.039
 Kitty-0.039
Token am
Token ID716
Feature activation-3.90e-01

Pos logit contributions
 microphone+0.205
 player+0.201
 la+0.198
 dancer+0.195
 team+0.195
Neg logit contributions
 Timothy-0.254
 Karen-0.226
 Samantha-0.216
 Bill-0.211
 Kitty-0.209
Token very
Token ID845
Feature activation-2.01e+00

Pos logit contributions
 microphone+0.024
opus+0.023
 la+0.023
 player+0.023
 dancer+0.022
Neg logit contributions
 Timothy-0.028
 Karen-0.025
 Samantha-0.024
 Su-0.023
 Kitty-0.023
Token proud
Token ID6613
Feature activation-2.39e+00

Pos logit contributions
 microphone+0.156
opus+0.150
 rook+0.150
 thumb+0.148
 la+0.147
Neg logit contributions
 Timothy-0.102
 Sue-0.093
 about-0.092
 Su-0.091
 Karen-0.088
Token of
Token ID286
Feature activation+3.70e-03

Pos logit contributions
 microphone+0.185
opus+0.182
 la+0.178
 thumb+0.177
 rook+0.176
Neg logit contributions
 about-0.119
 Timothy-0.115
 Sue-0.106
 Kitty-0.102
 Karen-0.101
Token you
Token ID345
Feature activation-2.49e+00

Pos logit contributions
 Timothy+0.000
 Kitty+0.000
 Sue+0.000
 Karen+0.000
 Samantha+0.000
Neg logit contributions
 microphone-0.000
opus-0.000
 thumb-0.000
 bishop-0.000
 rook-0.000
Token
Token ID198
Feature activation-2.16e-01

Pos logit contributions
 microphone+0.012
 player+0.012
 team+0.012
opus+0.012
 la+0.012
Neg logit contributions
 Timothy-0.017
 Karen-0.015
 Samantha-0.015
 Su-0.014
 Bill-0.014
TokenTom
Token ID13787
Feature activation+9.95e-01

Pos logit contributions
 player+0.015
 la+0.015
 microphone+0.015
opus+0.015
 team+0.015
Neg logit contributions
 Timothy-0.013
 Karen-0.012
 Samantha-0.011
 Su-0.011
 Kitty-0.011
Token and
Token ID290
Feature activation-2.63e-01

Pos logit contributions
 Timothy+0.055
 Karen+0.048
 Kitty+0.046
 Bill+0.046
 Samantha+0.046
Neg logit contributions
 microphone-0.074
opus-0.070
 la-0.069
 dancer-0.068
 brain-0.068
Token Mia
Token ID32189
Feature activation-6.02e-01

Pos logit contributions
 microphone+0.022
opus+0.022
 thumb+0.021
 guitar+0.021
 la+0.021
Neg logit contributions
 Timothy-0.012
 Sue-0.011
 Karen-0.011
 Kitty-0.011
 Su-0.011
Token waved
Token ID26834
Feature activation-7.19e-01

Pos logit contributions
opus+0.046
 microphone+0.046
 rook+0.043
 brain+0.043
 bishop+0.043
Neg logit contributions
 Timothy-0.030
 Karen-0.028
 Su-0.027
 Samantha-0.027
 Bill-0.027
Token goodbye
Token ID24829
Feature activation-3.97e+00

Pos logit contributions
opus+0.047
 microphone+0.045
 la+0.043
 thumb+0.043
 rook+0.042
Neg logit contributions
 Timothy-0.048
 Karen-0.044
 about-0.043
 Samantha-0.042
 Kitty-0.042
Token to
Token ID284
Feature activation-4.52e-01

Pos logit contributions
 microphone+0.275
 la+0.267
opus+0.266
 dancer+0.263
 player+0.263
Neg logit contributions
 Timothy-0.248
 Karen-0.218
 Samantha-0.206
 Kitty-0.204
 Su-0.203
Token the
Token ID262
Feature activation-2.20e-01

Pos logit contributions
 microphone+0.030
 la+0.028
 player+0.028
 dancer+0.028
 team+0.028
Neg logit contributions
 Timothy-0.030
 Karen-0.026
 Samantha-0.025
 Su-0.025
Kim-0.025
Token guard
Token ID4860
Feature activation-1.12e+00

Pos logit contributions
 microphone+0.013
 la+0.012
opus+0.012
 thumb+0.012
 guitar+0.012
Neg logit contributions
 Timothy-0.016
 Karen-0.015
 Sue-0.014
 Kitty-0.014
 Bill-0.014
Token and
Token ID290
Feature activation-2.68e-01

Pos logit contributions
opus+0.089
 la+0.086
 microphone+0.086
 dancer+0.084
 thumb+0.084
Neg logit contributions
 Timothy-0.052
 about-0.051
 Karen-0.047
 Kitty-0.046
 Sue-0.046
Token ran
Token ID4966
Feature activation-5.72e-02

Pos logit contributions
 microphone+0.018
 player+0.017
 la+0.017
 dancer+0.017
 team+0.017
Neg logit contributions
 Timothy-0.018
 Karen-0.016
 Samantha-0.015
 Su-0.015
 Kitty-0.014
Token her
Token ID607
Feature activation-5.15e-02

Pos logit contributions
 microphone+0.009
 la+0.009
opus+0.008
 player+0.008
 dancer+0.008
Neg logit contributions
 Timothy-0.008
 Karen-0.007
 Samantha-0.007
 Su-0.007
 Kitty-0.007
Token eyes
Token ID2951
Feature activation+3.65e-01

Pos logit contributions
opus+0.004
 la+0.003
fish+0.003
mine+0.003
 dancer+0.003
Neg logit contributions
 Timothy-0.003
 about-0.003
 Kitty-0.003
 Karen-0.003
 Kim-0.003
Token and
Token ID290
Feature activation+7.30e-01

Pos logit contributions
 Timothy+0.022
 Karen+0.020
 Samantha+0.019
 Kitty+0.019
 Su+0.019
Neg logit contributions
 microphone-0.025
opus-0.025
 la-0.025
 dancer-0.024
 player-0.024
Token soon
Token ID2582
Feature activation+5.56e-01

Pos logit contributions
 Timothy+0.049
 Karen+0.044
 Samantha+0.042
 Kitty+0.041
 Su+0.041
Neg logit contributions
 microphone-0.047
opus-0.046
 la-0.045
 player-0.044
 dancer-0.044
Token fell
Token ID3214
Feature activation+5.85e-01

Pos logit contributions
 Timothy+0.042
 Karen+0.037
 Samantha+0.035
Kim+0.035
 Su+0.035
Neg logit contributions
 microphone-0.031
 dancer-0.031
 la-0.030
 player-0.030
 brain-0.029
Token asleep
Token ID16039
Feature activation-3.28e+00

Pos logit contributions
 Timothy+0.053
 Karen+0.049
 Samantha+0.047
 Kitty+0.047
 Su+0.047
Neg logit contributions
 microphone-0.023
opus-0.022
 la-0.022
 player-0.022
 dancer-0.021
Token.
Token ID13
Feature activation+6.26e-01

Pos logit contributions
opus+0.245
 la+0.234
 microphone+0.232
 rook+0.229
 bishop+0.229
Neg logit contributions
 Timothy-0.182
 Karen-0.165
 about-0.156
 Kitty-0.155
 Su-0.151
Token In
Token ID554
Feature activation+1.97e+00

Pos logit contributions
 Timothy+0.041
 Karen+0.036
 Samantha+0.034
 Su+0.033
 Bill+0.033
Neg logit contributions
 microphone-0.042
 la-0.041
 player-0.040
 dancer-0.040
opus-0.039
Token the
Token ID262
Feature activation+5.41e-01

Pos logit contributions
 Timothy+0.110
 Karen+0.098
 Samantha+0.092
 Kitty+0.092
 Su+0.090
Neg logit contributions
opus-0.150
 la-0.146
 microphone-0.145
 team-0.141
 player-0.141
Token morning
Token ID3329
Feature activation-1.78e+00

Pos logit contributions
 Timothy+0.029
 Karen+0.026
 Kitty+0.025
 Samantha+0.025
 about+0.024
Neg logit contributions
opus-0.039
 microphone-0.039
 la-0.039
 rook-0.038
fish-0.038
Token,
Token ID11
Feature activation-2.82e-01

Pos logit contributions
 microphone+0.103
 la+0.099
 dancer+0.098
 player+0.097
opus+0.097
Neg logit contributions
 Timothy-0.132
 Karen-0.118
 Samantha-0.113
 Su-0.111
 Bill-0.111
Token box
Token ID3091
Feature activation-1.71e+00

Pos logit contributions
opus+0.001
 la+0.001
 microphone+0.001
 bishop+0.001
acy+0.001
Neg logit contributions
 Timothy-0.001
 Karen-0.001
 Sue-0.001
 Su-0.001
 Kitty-0.001
Token and
Token ID290
Feature activation+9.40e-01

Pos logit contributions
 la+0.123
opus+0.121
 microphone+0.120
 dancer+0.118
 brain+0.115
Neg logit contributions
 Timothy-0.101
 Karen-0.088
 about-0.086
 Bill-0.085
 Su-0.083
Token found
Token ID1043
Feature activation+2.89e-01

Pos logit contributions
 Timothy+0.060
 Karen+0.053
 Samantha+0.050
 Kitty+0.049
 Su+0.049
Neg logit contributions
 microphone-0.063
 player-0.061
 la-0.060
 thumb-0.060
 dancer-0.060
Token an
Token ID281
Feature activation-1.05e-01

Pos logit contributions
 Timothy+0.020
 Karen+0.018
 Samantha+0.017
Kim+0.017
 Su+0.016
Neg logit contributions
 microphone-0.018
 player-0.017
 thumb-0.017
 finger-0.016
 spoon-0.016
Token old
Token ID1468
Feature activation+1.81e+00

Pos logit contributions
 microphone+0.007
 thumb+0.006
 player+0.006
 dancer+0.006
 brain+0.006
Neg logit contributions
 Timothy-0.007
 Karen-0.006
 Samantha-0.006
 Kitty-0.006
 Su-0.006
Token bar
Token ID2318
Feature activation-3.28e+00

Pos logit contributions
 Timothy+0.145
 Karen+0.130
 Samantha+0.125
 Su+0.125
 Kitty+0.124
Neg logit contributions
 la-0.094
opus-0.090
 microphone-0.085
acy-0.083
 dancer-0.083
Token of
Token ID286
Feature activation+6.28e-01

Pos logit contributions
 la+0.196
 microphone+0.188
 thumb+0.175
 tut+0.174
 brain+0.172
Neg logit contributions
 Timothy-0.226
 about-0.205
 Karen-0.202
 Bill-0.199
 Su-0.197
Token soap
Token ID19533
Feature activation+1.57e-01

Pos logit contributions
 Timothy+0.044
 Karen+0.039
 Samantha+0.037
 Su+0.037
 Kitty+0.037
Neg logit contributions
opus-0.038
 la-0.038
 microphone-0.038
 dancer-0.037
 player-0.036
Token.
Token ID13
Feature activation+7.50e-01

Pos logit contributions
 Timothy+0.009
 about+0.007
 Karen+0.007
 Su+0.007
 meant+0.007
Neg logit contributions
 la-0.012
opus-0.011
 microphone-0.011
 dancer-0.011
 brain-0.011
Token Together
Token ID17083
Feature activation+8.05e-01

Pos logit contributions
 Timothy+0.036
 Karen+0.031
 Samantha+0.028
 Kitty+0.028
 Su+0.027
Neg logit contributions
 microphone-0.060
 player-0.059
 la-0.059
opus-0.059
 thumb-0.058
Token they
Token ID484
Feature activation-1.83e-01

Pos logit contributions
 Timothy+0.063
 Karen+0.057
Kim+0.057
ained+0.057
 supposed+0.056
Neg logit contributions
 team-0.039
 microphone-0.039
 player-0.037
 thumb-0.036
 spoon-0.035
Token day
Token ID1110
Feature activation-9.80e-01

Pos logit contributions
 Timothy+0.218
 Karen+0.193
 Samantha+0.183
 Su+0.181
 Kitty+0.180
Neg logit contributions
 microphone-0.220
 la-0.216
opus-0.214
 thumb-0.210
 player-0.208
Token ever
Token ID1683
Feature activation-2.43e+00

Pos logit contributions
 microphone+0.071
 brain+0.069
opus+0.069
 thumb+0.069
 rook+0.068
Neg logit contributions
 Timothy-0.058
 Karen-0.052
 Su-0.050
 Samantha-0.049
 Bill-0.049
Token!
Token ID0
Feature activation+1.72e-01

Pos logit contributions
 microphone+0.161
 la+0.157
 player+0.155
 dancer+0.154
opus+0.154
Neg logit contributions
 Timothy-0.160
 Karen-0.140
 Samantha-0.133
 Kitty-0.130
 Su-0.130
Token<|endoftext|>
Token ID50256
Feature activation-2.78e+00

Pos logit contributions
 Timothy+0.011
 Karen+0.010
 Samantha+0.009
 Su+0.009
 Kitty+0.009
Neg logit contributions
 microphone-0.011
 la-0.011
 player-0.011
 dancer-0.011
opus-0.011
TokenOnce
Token ID7454
Feature activation-1.68e+00

Pos logit contributions
 microphone+0.178
 player+0.172
 la+0.169
 dancer+0.168
opus+0.168
Neg logit contributions
 Timothy-0.189
 Karen-0.165
 Samantha-0.158
 Su-0.154
 Kitty-0.152
Token upon
Token ID2402
Feature activation-3.29e+00

Pos logit contributions
 microphone+0.094
 la+0.092
opus+0.089
 rook+0.089
 player+0.088
Neg logit contributions
 Timothy-0.127
 Karen-0.116
 Sue-0.111
 Su-0.111
 Kitty-0.111
Token a
Token ID257
Feature activation+1.97e-01

Pos logit contributions
 microphone+0.241
 la+0.234
opus+0.232
 player+0.231
 dancer+0.230
Neg logit contributions
 Timothy-0.192
 Karen-0.166
 Samantha-0.157
 Su-0.154
 Kitty-0.154
Token time
Token ID640
Feature activation+2.89e+00

Pos logit contributions
 Timothy+0.009
 Karen+0.007
 Samantha+0.007
 Kitty+0.007
 Su+0.006
Neg logit contributions
 microphone-0.017
opus-0.017
 la-0.017
 dancer-0.016
 player-0.016
Token,
Token ID11
Feature activation-1.30e-01

Pos logit contributions
 Timothy+0.180
 Karen+0.158
 Kitty+0.152
 Sue+0.150
 Bill+0.149
Neg logit contributions
 microphone-0.207
 thumb-0.197
 player-0.197
opus-0.196
 dancer-0.196
Token there
Token ID612
Feature activation+1.28e+00

Pos logit contributions
 microphone+0.009
opus+0.009
 la+0.008
 rook+0.008
 dancer+0.008
Neg logit contributions
 Timothy-0.008
 Karen-0.007
 Kitty-0.007
 Sue-0.007
 Su-0.007
Token was
Token ID373
Feature activation+1.59e+00

Pos logit contributions
 Timothy+0.093
 Karen+0.082
 Samantha+0.080
 theirs+0.079
 Su+0.076
Neg logit contributions
 player-0.074
 team-0.071
 microphone-0.071
 dancer-0.070
 thumb-0.069