TOP ACTIVATIONS
MAX = 0.077

and replied with a mis chie vous gl int in his
The cat grinned mis chie v ously . " Never
With a mis chie vous gl int in her
your parents and not take unnecessary risks . <|endoftext|> One day
be prepared and not take unnecessary risks . <|endoftext|> Once upon
hot sun and felt very thirsty . Suddenly , she saw
cat loved to explore and roam around the neighbourhood . One
and the sun flower felt thirsty . A
Ros ie nodded and smiled as she ran away . She
elephant liked to explore and roam the plains . One day
too , and she smiled as she listened to the bird
elephant and he liked to roam around the forests and watch
pink , and she grinned as it took shape .
more cautious and not take unnecessary risks . The memory of
<|endoftext|> Peter was a mis chie vous 3 year old who
Sus ie smiled as she looked around the beach
M ia smiled as she slid all the way
The little boy smiled mis chie v ously and said ,
the phone . Sara grinned as her daddy passed the phone
Mike smiled as he ate the muff in

MIN ACTIVATIONS
MIN = -0.090

want . All of these perf umes smell so good .
hopped closer to the story te ller and the crowd that
stirred the bowl together under Grand ma 's careful watch .
The day finally arrived when grandma came to visit . Tim
When L ola finished , she looked up and
wanted to peek inside her Grand ma 's room . Grand
When Lily finished her salad , she noticed
. When mom finished her work , she went
. When Sue finished , she focused her attention
. When Lucy finished cleaning the penny , she
. When Sue finished eating , she went outside
When Lucy finished eating , she put her
the story . When Mia finished reading , Dad asked her
the bell . When her grandma came in , she gave
When Tim my finished his sandwich , he said
. When Lily finished painting , she looked at
When her dad finished , the girl thanked him
ed out from under the t arp . They saw the
blew fiercely outside . Inside Grand ma 's house , Doug
sailed for days , exploring unch arted waters until he came

INTERVAL 0.035 - 0.077
CONTAINS 1.668%

the magic pen and felt guilty for causing so much damage
three !" they shouted , as they counted out loud .
" Wow !" said Andy .
â t make one happy â
rub it ." " Me too !" Lily said

INTERVAL -0.007 - 0.035
CONTAINS 63.726%

it . But you have to give it back to me
One day the jet started flying lower and lower . It
hopped into the forest to find some food . Suddenly ,
looked around and saw a bull . The bull was very
, " I 'm glad you 're having a good time

INTERVAL -0.048 - -0.007
CONTAINS 34.393%

!" After the exam , Tim felt very happy
were very happy . One day , Abby was
They have fun . They are patient . They know that
her explor ations . When Lisa got there ,
attack . John was so amazed that he cl

INTERVAL -0.090 - -0.048
CONTAINS 0.213%

welcome , Lily . I love you . You are the
mom and dad , who were sitting on a blanket .
she looked out at the beautiful view . She was so
She said , " What are you doing here ?"
and the papers with the cr ay on . He smiled
Token and
Token ID290
Feature activation+2.32e-02

Pos logit contributions
omm+0.041
igsaw+0.033
+0.033
stand+0.032
ifying+0.030
Neg logit contributions
 feat-0.028
 observ-0.026
 cautious-0.025
 intelligent-0.025
 courageous-0.025
Token replied
Token ID8712
Feature activation+1.45e-02

Pos logit contributions
 feat+0.029
 vines+0.029
 dangers+0.028
cut+0.028
 observ+0.027
Neg logit contributions
omm-0.041
stand-0.033
-0.031
bell-0.031
igsaw-0.031
Token with
Token ID351
Feature activation-6.30e-03

Pos logit contributions
 feat+0.027
 observ+0.025
 agile+0.025
 cautious+0.025
 vines+0.025
Neg logit contributions
omm-0.020
igsaw-0.014
stand-0.014
-0.013
ifying-0.012
Token a
Token ID257
Feature activation+1.42e-02

Pos logit contributions
omm+0.013
stand+0.011
+0.011
igsaw+0.010
work+0.010
Neg logit contributions
 feat-0.007
 observ-0.007
 courageous-0.007
 intelligent-0.007
 wond-0.007
Token mis
Token ID2984
Feature activation+2.63e-02

Pos logit contributions
 feat+0.016
ts+0.016
 vines+0.016
 observ+0.016
cut+0.016
Neg logit contributions
omm-0.027
igsaw-0.022
stand-0.022
-0.022
bell-0.020
Tokenchie
Token ID3043
Feature activation+7.69e-02

Pos logit contributions
 feat+0.039
 vines+0.036
 agile+0.036
 wond+0.036
 intelligent+0.035
Neg logit contributions
omm-0.045
igsaw-0.035
stand-0.033
-0.032
work-0.031
Tokenvous
Token ID31222
Feature activation+3.48e-02

Pos logit contributions
 wounds+0.100
 feat+0.087
 unnecessary+0.085
ts+0.084
 instincts+0.082
Neg logit contributions
omm-0.128
igsaw-0.111
work-0.110
bell-0.108
stand-0.107
Token gl
Token ID1278
Feature activation-4.22e-03

Pos logit contributions
 feat+0.051
 observ+0.049
 agile+0.048
 cautious+0.047
 vines+0.047
Neg logit contributions
omm-0.060
igsaw-0.046
stand-0.045
-0.044
bell-0.041
Tokenint
Token ID600
Feature activation+2.30e-02

Pos logit contributions
omm+0.008
igsaw+0.006
stand+0.006
+0.006
work+0.005
Neg logit contributions
 feat-0.006
 observ-0.006
 agile-0.006
 cautious-0.006
 vines-0.006
Token in
Token ID287
Feature activation-9.90e-03

Pos logit contributions
 feat+0.031
 observ+0.030
 wond+0.030
 agile+0.029
 intelligent+0.029
Neg logit contributions
omm-0.041
ifying-0.032
work-0.032
stand-0.031
bell-0.031
Token his
Token ID465
Feature activation+8.22e-03

Pos logit contributions
omm+0.021
stand+0.018
igsaw+0.018
+0.017
ifying+0.017
Neg logit contributions
 feat-0.010
 observ-0.010
 cautious-0.009
 agile-0.009
 courageous-0.009
Token
Token ID198
Feature activation-1.06e-02

Pos logit contributions
omm+0.007
stand+0.006
+0.005
igsaw+0.005
 Today+0.004
Neg logit contributions
 feat-0.010
 redd-0.010
 agile-0.009
 observ-0.009
 courageous-0.009
TokenThe
Token ID464
Feature activation+2.49e-03

Pos logit contributions
omm+0.024
stand+0.020
+0.020
igsaw+0.020
ifying+0.019
Neg logit contributions
 feat-0.010
 observ-0.009
 wond-0.009
 redd-0.009
 agile-0.009
Token cat
Token ID3797
Feature activation-2.93e-03

Pos logit contributions
 feat+0.003
 observ+0.003
 agile+0.003
 wond+0.003
 redd+0.003
Neg logit contributions
omm-0.005
-0.004
stand-0.003
igsaw-0.003
bell-0.003
Token grinned
Token ID36268
Feature activation-2.54e-02

Pos logit contributions
omm+0.005
stand+0.004
+0.004
bell+0.004
work+0.004
Neg logit contributions
 feat-0.004
 observ-0.004
 redd-0.004
 intelligent-0.004
 agile-0.004
Token mis
Token ID2984
Feature activation+2.59e-02

Pos logit contributions
omm+0.049
igsaw+0.039
stand+0.039
+0.038
ifying+0.036
Neg logit contributions
 feat-0.034
 observ-0.031
 cautious-0.031
 intelligent-0.030
 courageous-0.030
Tokenchie
Token ID3043
Feature activation+7.41e-02

Pos logit contributions
 feat+0.040
 wond+0.038
 biggest+0.037
 intelligent+0.037
 agile+0.037
Neg logit contributions
omm-0.042
igsaw-0.031
stand-0.030
work-0.029
ifying-0.029
Tokenv
Token ID85
Feature activation-2.45e-03

Pos logit contributions
 feat+0.080
 wounds+0.076
ts+0.072
 redd+0.072
 wond+0.071
Neg logit contributions
omm-0.147
work-0.119
igsaw-0.117
stand-0.116
bell-0.116
Tokenously
Token ID3481
Feature activation+1.30e-02

Pos logit contributions
omm+0.004
igsaw+0.003
stand+0.003
+0.003
ifying+0.003
Neg logit contributions
 feat-0.003
 observ-0.003
 agile-0.003
 cautious-0.003
 courageous-0.003
Token.
Token ID13
Feature activation-7.45e-03

Pos logit contributions
 feat+0.019
 observ+0.018
 cautious+0.018
 courageous+0.018
 agile+0.018
Neg logit contributions
omm-0.022
igsaw-0.017
stand-0.017
-0.017
ifying-0.016
Token "
Token ID366
Feature activation+2.11e-02

Pos logit contributions
omm+0.013
igsaw+0.010
stand+0.010
+0.010
ifying+0.009
Neg logit contributions
 feat-0.011
 observ-0.010
 cautious-0.010
 courageous-0.010
 agile-0.010
TokenNever
Token ID12295
Feature activation+6.70e-03

Pos logit contributions
 feat+0.025
 observ+0.022
 intelligent+0.022
 vines+0.022
 sturdy+0.022
Neg logit contributions
omm-0.042
-0.034
stand-0.034
ifying-0.033
igsaw-0.033
Token
Token ID198
Feature activation-1.21e-02

Pos logit contributions
omm+0.022
stand+0.016
+0.016
igsaw+0.015
bell+0.014
Neg logit contributions
 feat-0.025
 agile-0.024
 courageous-0.023
 observ-0.023
 cautious-0.023
Token
Token ID198
Feature activation-1.68e-02

Pos logit contributions
omm+0.018
stand+0.014
igsaw+0.013
+0.013
bell+0.013
Neg logit contributions
 feat-0.021
 agile-0.020
 observ-0.020
 cautious-0.019
cut-0.019
TokenWith
Token ID3152
Feature activation-1.31e-02

Pos logit contributions
omm+0.031
stand+0.024
igsaw+0.024
+0.023
bell+0.023
Neg logit contributions
 feat-0.023
 observ-0.021
 cautious-0.021
 agile-0.021
 courageous-0.021
Token a
Token ID257
Feature activation+1.90e-02

Pos logit contributions
omm+0.029
stand+0.023
igsaw+0.023
+0.023
ifying+0.022
Neg logit contributions
 feat-0.014
 observ-0.013
 intelligent-0.013
 courageous-0.013
 strong-0.013
Token mis
Token ID2984
Feature activation+2.88e-02

Pos logit contributions
 feat+0.019
 observ+0.017
 agile+0.017
 cautious+0.016
 vines+0.016
Neg logit contributions
omm-0.041
stand-0.034
igsaw-0.034
-0.034
ifying-0.032
Tokenchie
Token ID3043
Feature activation+7.37e-02

Pos logit contributions
 feat+0.038
 observ+0.036
 cautious+0.035
 courageous+0.035
 intelligent+0.035
Neg logit contributions
omm-0.055
stand-0.043
igsaw-0.043
-0.042
ifying-0.040
Tokenvous
Token ID31222
Feature activation+4.10e-02

Pos logit contributions
 wounds+0.088
 feat+0.083
 unnecessary+0.076
 redd+0.075
ts+0.074
Neg logit contributions
omm-0.130
bell-0.114
work-0.112
stand-0.112
igsaw-0.108
Token gl
Token ID1278
Feature activation-3.53e-03

Pos logit contributions
 feat+0.053
 observ+0.049
 cautious+0.049
 courageous+0.048
 intelligent+0.048
Neg logit contributions
omm-0.080
stand-0.063
igsaw-0.062
-0.062
ifying-0.060
Tokenint
Token ID600
Feature activation+2.82e-02

Pos logit contributions
omm+0.007
igsaw+0.005
stand+0.005
+0.005
work+0.005
Neg logit contributions
 feat-0.005
 observ-0.004
 agile-0.004
 cautious-0.004
 courageous-0.004
Token in
Token ID287
Feature activation-1.07e-02

Pos logit contributions
 feat+0.039
 wond+0.038
 observ+0.037
 agile+0.037
 courageous+0.036
Neg logit contributions
omm-0.049
work-0.039
bell-0.038
igsaw-0.037
stand-0.037
Token her
Token ID607
Feature activation+5.01e-03

Pos logit contributions
omm+0.019
igsaw+0.016
stand+0.016
bell+0.015
+0.015
Neg logit contributions
 feat-0.014
 observ-0.014
cut-0.014
 cautious-0.013
 agile-0.013
Token your
Token ID534
Feature activation+1.31e-02

Pos logit contributions
 feat+0.054
cut+0.051
 agile+0.050
 cautious+0.050
 redd+0.049
Neg logit contributions
omm-0.068
stand-0.054
igsaw-0.052
work-0.047
ifying-0.046
Token parents
Token ID3397
Feature activation+8.23e-03

Pos logit contributions
 feat+0.018
 observ+0.017
 cautious+0.017
 courageous+0.017
 intelligent+0.017
Neg logit contributions
omm-0.024
igsaw-0.019
stand-0.019
-0.018
ifying-0.018
Token and
Token ID290
Feature activation+1.72e-03

Pos logit contributions
 feat+0.010
 cautious+0.010
 observ+0.010
 intelligent+0.010
 courageous+0.010
Neg logit contributions
omm-0.016
igsaw-0.013
-0.012
stand-0.012
ifying-0.012
Token not
Token ID407
Feature activation+3.75e-03

Pos logit contributions
 feat+0.002
 observ+0.002
 agile+0.002
 courageous+0.002
 cautious+0.002
Neg logit contributions
omm-0.003
stand-0.002
igsaw-0.002
-0.002
work-0.002
Token take
Token ID1011
Feature activation+2.12e-02

Pos logit contributions
 feat+0.004
 observ+0.004
 cautious+0.004
 intelligent+0.004
 courageous+0.004
Neg logit contributions
omm-0.008
igsaw-0.006
-0.006
stand-0.006
ifying-0.006
Token unnecessary
Token ID13114
Feature activation+7.32e-02

Pos logit contributions
 feat+0.022
 observ+0.020
 cautious+0.019
 courageous+0.019
 agile+0.019
Neg logit contributions
omm-0.047
igsaw-0.038
stand-0.038
-0.037
ifying-0.036
Token risks
Token ID7476
Feature activation+4.91e-02

Pos logit contributions
 feat+0.079
 redd+0.074
cut+0.074
 observ+0.073
 intelligent+0.070
Neg logit contributions
omm-0.140
stand-0.126
igsaw-0.123
-0.116
ifying-0.114
Token.
Token ID13
Feature activation+3.41e-03

Pos logit contributions
 feat+0.068
 observ+0.065
 cautious+0.062
 courageous+0.062
 agile+0.062
Neg logit contributions
omm-0.086
stand-0.071
igsaw-0.069
-0.067
ifying-0.065
Token<|endoftext|>
Token ID50256
Feature activation+3.35e-03

Pos logit contributions
 feat+0.002
 intelligent+0.002
 cautious+0.002
 observ+0.002
 wise+0.002
Neg logit contributions
omm-0.008
ifying-0.007
work-0.007
igsaw-0.007
bell-0.007
TokenOne
Token ID3198
Feature activation+2.24e-02

Pos logit contributions
 feat+0.005
 observ+0.005
 cautious+0.005
 agile+0.005
 courageous+0.005
Neg logit contributions
omm-0.006
stand-0.005
-0.005
igsaw-0.004
ifying-0.004
Token day
Token ID1110
Feature activation-1.89e-02

Pos logit contributions
 feat+0.032
 observ+0.031
cut+0.030
 cautious+0.030
 agile+0.030
Neg logit contributions
omm-0.039
igsaw-0.031
stand-0.031
-0.029
bell-0.028
Token be
Token ID307
Feature activation+2.45e-02

Pos logit contributions
 feat+0.017
 cautious+0.015
 observ+0.015
 careful+0.015
 brave+0.015
Neg logit contributions
omm-0.032
igsaw-0.026
ifying-0.026
stand-0.025
-0.025
Token prepared
Token ID5597
Feature activation+8.30e-03

Pos logit contributions
 redd+0.022
 feat+0.022
 vines+0.022
rated+0.020
 agile+0.020
Neg logit contributions
omm-0.045
work-0.039
stand-0.039
 ago-0.037
ifying-0.037
Token and
Token ID290
Feature activation+1.38e-02

Pos logit contributions
 feat+0.012
 observ+0.011
 agile+0.011
 redd+0.011
 cautious+0.011
Neg logit contributions
omm-0.014
stand-0.011
igsaw-0.011
work-0.010
ifying-0.010
Token not
Token ID407
Feature activation+3.74e-03

Pos logit contributions
 feat+0.018
 observ+0.017
 agile+0.017
 cautious+0.016
 vines+0.016
Neg logit contributions
omm-0.025
stand-0.020
igsaw-0.020
-0.019
work-0.019
Token take
Token ID1011
Feature activation+2.31e-02

Pos logit contributions
 feat+0.005
 observ+0.005
 agile+0.005
 cautious+0.005
 courageous+0.005
Neg logit contributions
omm-0.006
stand-0.005
igsaw-0.005
-0.005
ifying-0.005
Token unnecessary
Token ID13114
Feature activation+7.31e-02

Pos logit contributions
 feat+0.025
cut+0.024
 observ+0.023
 agile+0.023
 cautious+0.023
Neg logit contributions
omm-0.046
stand-0.039
igsaw-0.038
ifying-0.037
-0.036
Token risks
Token ID7476
Feature activation+5.18e-02

Pos logit contributions
 feat+0.072
cut+0.072
 redd+0.070
 observ+0.068
 intelligent+0.066
Neg logit contributions
omm-0.136
stand-0.127
igsaw-0.125
ifying-0.120
-0.116
Token.
Token ID13
Feature activation+2.87e-03

Pos logit contributions
 feat+0.073
 observ+0.069
 cautious+0.066
 intelligent+0.065
 courageous+0.065
Neg logit contributions
omm-0.090
stand-0.074
igsaw-0.072
-0.070
ifying-0.069
Token<|endoftext|>
Token ID50256
Feature activation-8.77e-04

Pos logit contributions
 feat+0.003
 cautious+0.003
 observ+0.003
 intelligent+0.003
 courageous+0.003
Neg logit contributions
omm-0.006
igsaw-0.005
bell-0.005
stand-0.005
ifying-0.005
TokenOnce
Token ID7454
Feature activation-1.22e-02

Pos logit contributions
omm+0.002
+0.001
stand+0.001
igsaw+0.001
ifying+0.001
Neg logit contributions
 feat-0.001
 observ-0.001
 cautious-0.001
 agile-0.001
 courageous-0.001
Token upon
Token ID2402
Feature activation+1.23e-02

Pos logit contributions
omm+0.022
igsaw+0.017
stand+0.016
+0.016
bell+0.015
Neg logit contributions
 feat-0.018
 observ-0.017
 cautious-0.017
 agile-0.017
 courageous-0.017
Token hot
Token ID3024
Feature activation-1.34e-02

Pos logit contributions
 tall+0.023
 biggest+0.022
 feat+0.022
 graceful+0.022
 strong+0.022
Neg logit contributions
omm-0.063
-0.052
igsaw-0.052
work-0.051
etti-0.051
Token sun
Token ID4252
Feature activation-2.51e-02

Pos logit contributions
omm+0.028
igsaw+0.023
+0.022
ifying+0.022
work+0.022
Neg logit contributions
 feat-0.015
 strong-0.014
 intelligent-0.014
 tall-0.014
 brave-0.014
Token and
Token ID290
Feature activation+1.50e-02

Pos logit contributions
omm+0.059
igsaw+0.053
+0.045
ifying+0.045
bell+0.044
Neg logit contributions
 feat-0.023
ling-0.020
 biggest-0.020
 strong-0.019
cut-0.019
Token felt
Token ID2936
Feature activation+2.33e-02

Pos logit contributions
 feat+0.020
cut+0.018
 agile+0.018
 observ+0.018
 courageous+0.018
Neg logit contributions
omm-0.027
stand-0.023
-0.021
work-0.021
bell-0.021
Token very
Token ID845
Feature activation+4.11e-02

Pos logit contributions
 feat+0.032
 vines+0.028
cut+0.028
 agile+0.027
 observ+0.027
Neg logit contributions
omm-0.042
stand-0.034
ifying-0.033
-0.032
igsaw-0.031
Token thirsty
Token ID47124
Feature activation+7.29e-02

Pos logit contributions
 feat+0.056
 vines+0.052
cut+0.051
 agile+0.048
 observ+0.047
Neg logit contributions
omm-0.075
stand-0.057
-0.056
igsaw-0.055
ifying-0.055
Token.
Token ID13
Feature activation-7.24e-03

Pos logit contributions
 feat+0.062
 strong+0.061
 intelligent+0.059
 brave+0.058
 cautious+0.057
Neg logit contributions
omm-0.177
igsaw-0.147
stand-0.139
-0.138
ifying-0.135
Token Suddenly
Token ID24975
Feature activation+1.13e-02

Pos logit contributions
omm+0.011
igsaw+0.009
work+0.009
ifying+0.009
stand+0.008
Neg logit contributions
 feat-0.011
 intelligent-0.011
 observ-0.011
 cautious-0.010
 biggest-0.010
Token,
Token ID11
Feature activation+3.99e-03

Pos logit contributions
 feat+0.015
 brave+0.014
 strong+0.014
 intelligent+0.014
 clever+0.014
Neg logit contributions
omm-0.021
igsaw-0.017
-0.016
work-0.016
ifying-0.016
Token she
Token ID673
Feature activation-2.83e-03

Pos logit contributions
 feat+0.005
 observ+0.005
 intelligent+0.004
 courageous+0.004
 cautious+0.004
Neg logit contributions
omm-0.008
igsaw-0.006
stand-0.006
-0.006
ifying-0.006
Token saw
Token ID2497
Feature activation+3.06e-02

Pos logit contributions
omm+0.007
stand+0.006
igsaw+0.006
+0.006
ifying+0.005
Neg logit contributions
 feat-0.002
 observ-0.002
 cautious-0.002
 courageous-0.002
 agile-0.002
Token cat
Token ID3797
Feature activation-5.42e-03

Pos logit contributions
omm+0.010
stand+0.008
+0.008
igsaw+0.008
ifying+0.007
Neg logit contributions
 feat-0.005
 observ-0.005
 cautious-0.005
 agile-0.005
 courageous-0.005
Token loved
Token ID6151
Feature activation-2.95e-02

Pos logit contributions
omm+0.011
stand+0.009
+0.008
igsaw+0.008
bell+0.008
Neg logit contributions
 feat-0.006
 observ-0.006
 agile-0.006
 courageous-0.006
 unnecessary-0.006
Token to
Token ID284
Feature activation+1.27e-02

Pos logit contributions
omm+0.049
stand+0.038
igsaw+0.036
+0.035
work+0.035
Neg logit contributions
 feat-0.043
 courageous-0.041
 agile-0.040
 cautious-0.040
cut-0.040
Token explore
Token ID7301
Feature activation+3.41e-02

Pos logit contributions
 feat+0.018
 observ+0.017
 cautious+0.017
 courageous+0.017
 agile+0.016
Neg logit contributions
omm-0.022
stand-0.018
-0.017
igsaw-0.017
bell-0.016
Token and
Token ID290
Feature activation+9.04e-03

Pos logit contributions
 agile+0.062
 redd+0.062
cut+0.061
 cautious+0.060
 observ+0.059
Neg logit contributions
:-0.030
 Wed-0.026
 today-0.023
,-0.023
 TV-0.022
Token roam
Token ID35563
Feature activation+7.27e-02

Pos logit contributions
 feat+0.011
 observ+0.010
 agile+0.010
 cautious+0.010
 courageous+0.010
Neg logit contributions
omm-0.016
stand-0.014
-0.013
igsaw-0.013
work-0.013
Token around
Token ID1088
Feature activation-8.32e-03

Pos logit contributions
 feat+0.116
 instincts+0.111
 wounds+0.110
 redd+0.107
 observ+0.106
Neg logit contributions
|-0.074
work-0.069
stand-0.068
 ago-0.067
 They-0.066
Token the
Token ID262
Feature activation+1.47e-02

Pos logit contributions
omm+0.013
work+0.011
stand+0.011
+0.011
 ago+0.011
Neg logit contributions
 redd-0.011
 feat-0.011
 observ-0.011
 agile-0.011
 cautious-0.011
Token neighbourhood
Token ID19495
Feature activation+5.16e-03

Pos logit contributions
 feat+0.021
 agile+0.019
 observ+0.019
 courageous+0.019
cut+0.019
Neg logit contributions
omm-0.026
stand-0.021
igsaw-0.020
-0.019
ifying-0.019
Token.
Token ID13
Feature activation-4.88e-03

Pos logit contributions
 feat+0.007
 observ+0.007
 agile+0.007
 courageous+0.007
 redd+0.007
Neg logit contributions
omm-0.009
stand-0.007
work-0.007
-0.007
igsaw-0.007
Token One
Token ID1881
Feature activation+2.55e-02

Pos logit contributions
omm+0.009
igsaw+0.007
stand+0.007
work+0.006
+0.006
Neg logit contributions
 feat-0.007
 observ-0.007
 cautious-0.006
 intelligent-0.006
 agile-0.006
Token and
Token ID290
Feature activation+1.48e-02

Pos logit contributions
omm+0.004
igsaw+0.003
stand+0.003
+0.003
acey+0.003
Neg logit contributions
 feat-0.002
 strong-0.002
 vines-0.002
 intelligent-0.002
 tall-0.002
Token the
Token ID262
Feature activation+2.31e-02

Pos logit contributions
 feat+0.018
 observ+0.017
 cautious+0.017
 courageous+0.016
cut+0.016
Neg logit contributions
omm-0.029
stand-0.023
-0.023
ifying-0.022
work-0.022
Token sun
Token ID4252
Feature activation-1.07e-02

Pos logit contributions
 unnecessary+0.050
 agile+0.050
vered+0.048
 observ+0.048
 courageous+0.048
Neg logit contributions
omm-0.020
work-0.010
-0.010
stand-0.010
 frost-0.009
Tokenflower
Token ID25547
Feature activation+2.70e-02

Pos logit contributions
omm+0.018
+0.013
bell+0.013
stand+0.013
work+0.013
Neg logit contributions
 observ-0.017
 unnecessary-0.016
 courageous-0.016
 feat-0.016
 agile-0.016
Token felt
Token ID2936
Feature activation+1.87e-02

Pos logit contributions
 unnecessary+0.047
 observ+0.047
 feat+0.045
 agile+0.043
 courageous+0.043
Neg logit contributions
omm-0.035
bell-0.029
-0.028
stand-0.027
work-0.025
Token thirsty
Token ID47124
Feature activation+7.24e-02

Pos logit contributions
 feat+0.029
 vines+0.025
 observ+0.025
cut+0.025
 agile+0.024
Neg logit contributions
omm-0.031
stand-0.024
ifying-0.024
-0.022
bell-0.021
Token.
Token ID13
Feature activation-4.80e-03

Pos logit contributions
 feat+0.068
 strong+0.066
 intelligent+0.063
 brave+0.062
 cautious+0.061
Neg logit contributions
omm-0.170
igsaw-0.140
stand-0.132
-0.131
ifying-0.129
Token
Token ID220
Feature activation-7.60e-03

Pos logit contributions
omm+0.008
stand+0.006
+0.006
igsaw+0.006
ifying+0.005
Neg logit contributions
 feat-0.008
 observ-0.007
 courageous-0.007
 cautious-0.007
 agile-0.007
Token
Token ID198
Feature activation-7.31e-03

Pos logit contributions
omm+0.010
stand+0.007
+0.007
 Today+0.006
igsaw+0.006
Neg logit contributions
 feat-0.015
 agile-0.015
 courageous-0.015
 observ-0.015
 cautious-0.014
Token
Token ID198
Feature activation-1.35e-02

Pos logit contributions
omm+0.009
stand+0.007
+0.006
 Today+0.006
igsaw+0.006
Neg logit contributions
 feat-0.014
 agile-0.014
 observ-0.014
 courageous-0.014
 redd-0.013
TokenA
Token ID32
Feature activation-1.44e-02

Pos logit contributions
omm+0.029
stand+0.024
igsaw+0.024
+0.023
ifying+0.023
Neg logit contributions
 feat-0.014
 observ-0.013
 courageous-0.012
 cautious-0.012
 agile-0.012
TokenRos
Token ID35740
Feature activation+8.57e-04

Pos logit contributions
omm+0.003
stand+0.002
igsaw+0.002
+0.002
bell+0.002
Neg logit contributions
 feat-0.003
 agile-0.003
 redd-0.003
 courageous-0.003
 vines-0.003
Tokenie
Token ID494
Feature activation+1.77e-02

Pos logit contributions
 feat+0.001
 observ+0.001
 cautious+0.001
 courageous+0.001
 agile+0.001
Neg logit contributions
omm-0.002
igsaw-0.001
stand-0.001
-0.001
ifying-0.001
Token nodded
Token ID14464
Feature activation-8.62e-03

Pos logit contributions
 feat+0.022
 observ+0.021
 agile+0.021
 courageous+0.020
 wond+0.020
Neg logit contributions
omm-0.034
stand-0.026
-0.026
igsaw-0.026
work-0.025
Token and
Token ID290
Feature activation+2.05e-02

Pos logit contributions
omm+0.016
igsaw+0.012
stand+0.012
+0.012
ifying+0.012
Neg logit contributions
 feat-0.012
 observ-0.011
 cautious-0.011
 courageous-0.011
 intelligent-0.011
Token smiled
Token ID13541
Feature activation-5.97e-03

Pos logit contributions
 feat+0.027
 observ+0.026
 agile+0.026
 intelligent+0.025
 cautious+0.025
Neg logit contributions
omm-0.037
stand-0.029
-0.028
igsaw-0.028
work-0.027
Token as
Token ID355
Feature activation+7.24e-02

Pos logit contributions
omm+0.011
igsaw+0.009
stand+0.009
+0.009
ifying+0.008
Neg logit contributions
 feat-0.008
 observ-0.007
 cautious-0.007
 agile-0.007
 courageous-0.007
Token she
Token ID673
Feature activation+1.08e-03

Pos logit contributions
 feat+0.118
 cautious+0.117
 observ+0.114
 vines+0.114
cut+0.113
Neg logit contributions
omm-0.106
igsaw-0.080
stand-0.079
bell-0.075
-0.074
Token ran
Token ID4966
Feature activation+1.88e-02

Pos logit contributions
cut+0.001
 feat+0.001
 observ+0.001
 unnecessary+0.001
 biggest+0.001
Neg logit contributions
omm-0.002
-0.002
stand-0.001
bell-0.001
igsaw-0.001
Token away
Token ID1497
Feature activation-1.26e-02

Pos logit contributions
 strong+0.018
 free+0.018
 feat+0.016
 brave+0.016
 independent+0.016
Neg logit contributions
omm-0.045
-0.034
ifying-0.034
work-0.034
igsaw-0.033
Token.
Token ID13
Feature activation-4.22e-04

Pos logit contributions
omm+0.021
stand+0.017
igsaw+0.017
+0.017
work+0.016
Neg logit contributions
 feat-0.018
 observ-0.017
 agile-0.017
 redd-0.017
 vines-0.017
Token She
Token ID1375
Feature activation+3.42e-03

Pos logit contributions
omm+0.001
igsaw+0.001
stand+0.001
+0.000
ifying+0.000
Neg logit contributions
 feat-0.001
 observ-0.001
 cautious-0.001
 intelligent-0.001
 courageous-0.001
Token elephant
Token ID20950
Feature activation+7.56e-03

Pos logit contributions
omm+0.002
igsaw+0.002
+0.002
stand+0.002
work+0.002
Neg logit contributions
 feat-0.002
 cautious-0.001
 observ-0.001
 agile-0.001
 courageous-0.001
Token liked
Token ID8288
Feature activation-2.27e-02

Pos logit contributions
 feat+0.010
 observ+0.009
 unnecessary+0.009
 agile+0.009
 courageous+0.009
Neg logit contributions
omm-0.014
stand-0.011
work-0.011
bell-0.011
-0.011
Token to
Token ID284
Feature activation+1.12e-02

Pos logit contributions
omm+0.032
stand+0.025
work+0.023
igsaw+0.022
 Wed+0.022
Neg logit contributions
 feat-0.036
 cautious-0.036
 courageous-0.035
 observ-0.035
 secure-0.035
Token explore
Token ID7301
Feature activation+3.33e-02

Pos logit contributions
 feat+0.016
 observ+0.015
 cautious+0.015
 courageous+0.015
 agile+0.015
Neg logit contributions
omm-0.019
stand-0.015
igsaw-0.015
-0.015
bell-0.014
Token and
Token ID290
Feature activation+9.23e-03

Pos logit contributions
 cautious+0.059
 observ+0.058
 agile+0.058
 unnecessary+0.057
 redd+0.056
Neg logit contributions
:-0.033
 Wed-0.028
 theater-0.028
 TV-0.028
 television-0.028
Token roam
Token ID35563
Feature activation+7.22e-02

Pos logit contributions
 feat+0.012
 agile+0.011
 observ+0.011
 cautious+0.011
 courageous+0.011
Neg logit contributions
omm-0.015
stand-0.013
work-0.013
bell-0.013
igsaw-0.012
Token the
Token ID262
Feature activation+2.36e-02

Pos logit contributions
 feat+0.112
 wounds+0.110
 instincts+0.109
 observ+0.108
 oh+0.108
Neg logit contributions
work-0.081
|-0.076
bell-0.073
stand-0.072
 ago-0.071
Token plains
Token ID36149
Feature activation+2.89e-02

Pos logit contributions
 feat+0.028
 observ+0.026
 cautious+0.026
 agile+0.025
 courageous+0.025
Neg logit contributions
omm-0.047
stand-0.039
igsaw-0.038
-0.037
work-0.036
Token.
Token ID13
Feature activation-7.70e-03

Pos logit contributions
 observ+0.046
 feat+0.045
 wounds+0.043
 unnecessary+0.043
 wond+0.043
Neg logit contributions
omm-0.039
work-0.035
stand-0.034
bell-0.032
 ago-0.031
Token One
Token ID1881
Feature activation+2.67e-02

Pos logit contributions
omm+0.009
stand+0.006
igsaw+0.006
+0.006
ifying+0.005
Neg logit contributions
 feat-0.016
 observ-0.015
 cautious-0.015
 courageous-0.015
 agile-0.015
Token day
Token ID1110
Feature activation-1.01e-02

Pos logit contributions
 feat+0.038
 observ+0.037
 agile+0.036
 cautious+0.035
 wounds+0.035
Neg logit contributions
omm-0.045
igsaw-0.035
stand-0.035
work-0.033
bell-0.033
Token too
Token ID1165
Feature activation+3.77e-02

Pos logit contributions
omm+0.062
igsaw+0.049
stand+0.048
+0.048
ifying+0.046
Neg logit contributions
 feat-0.042
 observ-0.040
 cautious-0.039
 courageous-0.038
 intelligent-0.038
Token,
Token ID11
Feature activation+1.54e-02

Pos logit contributions
 feat+0.046
 intelligent+0.043
 strong+0.042
 brave+0.042
 cautious+0.041
Neg logit contributions
omm-0.080
igsaw-0.064
stand-0.062
ifier-0.061
-0.061
Token and
Token ID290
Feature activation+6.24e-03

Pos logit contributions
 feat+0.023
cut+0.022
 observ+0.022
ts+0.021
 wond+0.021
Neg logit contributions
omm-0.024
stand-0.020
-0.018
igsaw-0.018
bell-0.017
Token she
Token ID673
Feature activation+2.43e-02

Pos logit contributions
cut+0.010
 unnecessary+0.010
 dangers+0.010
 feat+0.010
 vines+0.010
Neg logit contributions
omm-0.008
stand-0.006
bell-0.006
 Av-0.006
-0.006
Token smiled
Token ID13541
Feature activation-6.22e-03

Pos logit contributions
 feat+0.030
 observ+0.029
cut+0.028
 unnecessary+0.028
 cautious+0.028
Neg logit contributions
omm-0.046
stand-0.036
-0.036
bell-0.034
work-0.033
Token as
Token ID355
Feature activation+7.18e-02

Pos logit contributions
omm+0.011
igsaw+0.008
stand+0.008
+0.008
work+0.008
Neg logit contributions
 feat-0.009
 observ-0.009
cut-0.009
 vines-0.009
 agile-0.008
Token she
Token ID673
Feature activation+2.87e-03

Pos logit contributions
cut+0.125
 wounds+0.120
 cautious+0.118
cking+0.118
cod+0.115
Neg logit contributions
omm-0.090
stand-0.072
igsaw-0.065
bell-0.065
 Av-0.064
Token listened
Token ID16399
Feature activation+2.00e-03

Pos logit contributions
 feat+0.004
 observ+0.004
cut+0.004
 redd+0.003
 courageous+0.003
Neg logit contributions
omm-0.005
-0.004
stand-0.004
ifying-0.004
bell-0.004
Token to
Token ID284
Feature activation+3.45e-02

Pos logit contributions
 feat+0.003
 observ+0.003
 agile+0.002
 cautious+0.002
cut+0.002
Neg logit contributions
omm-0.004
stand-0.003
igsaw-0.003
-0.003
ifying-0.003
Token the
Token ID262
Feature activation+6.09e-03

Pos logit contributions
 feat+0.047
 cautious+0.045
cut+0.044
 agile+0.044
 observ+0.044
Neg logit contributions
omm-0.064
stand-0.050
igsaw-0.049
-0.048
work-0.045
Token bird
Token ID6512
Feature activation-5.31e-03

Pos logit contributions
 agile+0.008
 observ+0.007
 unnecessary+0.007
ts+0.007
bel+0.007
Neg logit contributions
omm-0.011
work-0.009
-0.008
bell-0.008
stand-0.008
Token elephant
Token ID20950
Feature activation+1.87e-02

Pos logit contributions
omm+0.054
igsaw+0.044
stand+0.043
+0.043
work+0.042
Neg logit contributions
 feat-0.020
 observ-0.020
 cautious-0.020
 agile-0.020
 courageous-0.019
Token and
Token ID290
Feature activation+1.43e-02

Pos logit contributions
 feat+0.028
 observ+0.027
 redd+0.026
 wond+0.026
 agile+0.026
Neg logit contributions
omm-0.031
stand-0.024
work-0.023
-0.023
igsaw-0.022
Token he
Token ID339
Feature activation+2.48e-02

Pos logit contributions
 feat+0.015
 unnecessary+0.014
 observ+0.014
 cautious+0.014
 courageous+0.014
Neg logit contributions
omm-0.029
stand-0.023
-0.023
work-0.022
bell-0.022
Token liked
Token ID8288
Feature activation-1.57e-02

Pos logit contributions
 feat+0.030
 unnecessary+0.028
 redd+0.027
 observ+0.027
 wounds+0.027
Neg logit contributions
omm-0.047
-0.037
work-0.037
stand-0.037
bell-0.036
Token to
Token ID284
Feature activation+1.70e-02

Pos logit contributions
omm+0.017
 Wed+0.013
stand+0.013
work+0.012
+0.011
Neg logit contributions
rated-0.028
 cautious-0.028
 redd-0.028
 courageous-0.028
wing-0.028
Token roam
Token ID35563
Feature activation+7.15e-02

Pos logit contributions
 observ+0.025
 feat+0.025
 cautious+0.024
 agile+0.024
 courageous+0.024
Neg logit contributions
omm-0.026
stand-0.021
-0.021
igsaw-0.020
bell-0.020
Token around
Token ID1088
Feature activation-9.05e-03

Pos logit contributions
 feat+0.110
 wounds+0.108
 observ+0.108
 unnecessary+0.103
 instincts+0.103
Neg logit contributions
work-0.081
|-0.074
bell-0.073
stand-0.072
 Today-0.072
Token the
Token ID262
Feature activation+1.28e-02

Pos logit contributions
omm+0.016
stand+0.013
work+0.013
+0.013
igsaw+0.013
Neg logit contributions
 observ-0.011
 feat-0.011
 redd-0.011
 cautious-0.011
 agile-0.011
Token forests
Token ID17039
Feature activation+2.40e-02

Pos logit contributions
 feat+0.015
 observ+0.014
 cautious+0.014
 agile+0.014
 courageous+0.014
Neg logit contributions
omm-0.026
igsaw-0.021
stand-0.021
-0.020
ifying-0.019
Token and
Token ID290
Feature activation+1.57e-02

Pos logit contributions
 observ+0.035
 feat+0.035
 wond+0.033
 cautious+0.033
 agile+0.033
Neg logit contributions
omm-0.038
stand-0.031
work-0.030
-0.029
igsaw-0.029
Token watch
Token ID2342
Feature activation+9.04e-03

Pos logit contributions
 feat+0.020
 agile+0.019
 observ+0.019
 cautious+0.019
 wounds+0.018
Neg logit contributions
omm-0.023
work-0.022
bell-0.021
stand-0.020
-0.019
Token pink
Token ID11398
Feature activation-1.58e-02

Pos logit contributions
 feat+0.005
 strong+0.004
 vines+0.004
 intelligent+0.004
 observ+0.004
Neg logit contributions
omm-0.015
igsaw-0.012
stand-0.012
-0.012
ifying-0.012
Token,
Token ID11
Feature activation+3.53e-03

Pos logit contributions
omm+0.032
igsaw+0.026
stand+0.025
ifying+0.024
+0.024
Neg logit contributions
 feat-0.019
 vines-0.019
 strong-0.017
 agile-0.017
 intelligent-0.017
Token and
Token ID290
Feature activation+2.76e-03

Pos logit contributions
 feat+0.003
 observ+0.003
 intelligent+0.003
 cautious+0.003
 agile+0.003
Neg logit contributions
omm-0.008
igsaw-0.007
stand-0.007
-0.007
ifying-0.006
Token she
Token ID673
Feature activation+1.68e-02

Pos logit contributions
 feat+0.003
 observ+0.003
 cautious+0.003
 courageous+0.003
 agile+0.003
Neg logit contributions
omm-0.006
stand-0.005
igsaw-0.005
-0.004
ifying-0.004
Token grinned
Token ID36268
Feature activation-1.11e-02

Pos logit contributions
 unnecessary+0.026
 feat+0.025
 prey+0.024
 courageous+0.024
 tranquil+0.024
Neg logit contributions
omm-0.027
bell-0.021
work-0.020
stand-0.019
-0.019
Token as
Token ID355
Feature activation+7.14e-02

Pos logit contributions
omm+0.022
igsaw+0.017
stand+0.017
+0.017
ifying+0.016
Neg logit contributions
 feat-0.014
 observ-0.013
 cautious-0.013
 courageous-0.013
 agile-0.013
Token it
Token ID340
Feature activation-8.27e-03

Pos logit contributions
cut+0.123
 wounds+0.123
 cautious+0.121
 feat+0.119
cking+0.118
Neg logit contributions
omm-0.093
bell-0.069
stand-0.069
igsaw-0.068
-0.064
Token took
Token ID1718
Feature activation+1.56e-02

Pos logit contributions
omm+0.013
bell+0.010
work+0.010
+0.010
stand+0.010
Neg logit contributions
 feat-0.013
 unnecessary-0.012
 observ-0.012
 courageous-0.012
 cautious-0.012
Token shape
Token ID5485
Feature activation-5.33e-03

Pos logit contributions
 feat+0.019
 observ+0.018
 cautious+0.018
 courageous+0.018
 intelligent+0.018
Neg logit contributions
omm-0.031
igsaw-0.024
stand-0.024
-0.024
ifying-0.023
Token.
Token ID13
Feature activation-5.99e-03

Pos logit contributions
omm+0.010
stand+0.008
igsaw+0.008
+0.008
ifying+0.008
Neg logit contributions
 feat-0.007
 observ-0.006
 cautious-0.006
 courageous-0.006
 agile-0.006
Token
Token ID198
Feature activation-8.50e-03

Pos logit contributions
omm+0.010
+0.007
stand+0.007
igsaw+0.007
bell+0.007
Neg logit contributions
 feat-0.009
 agile-0.009
 observ-0.009
 courageous-0.009
 cautious-0.009
Token more
Token ID517
Feature activation+2.56e-02

Pos logit contributions
 feat+0.026
cut+0.025
 gra+0.024
rated+0.024
 vines+0.024
Neg logit contributions
omm-0.043
 too-0.034
 already-0.034
stand-0.033
 School-0.032
Token cautious
Token ID21205
Feature activation+4.91e-02

Pos logit contributions
 gra+0.020
 feat+0.020
 vines+0.020
ts+0.019
cut+0.018
Neg logit contributions
omm-0.052
stand-0.043
igsaw-0.042
arre-0.042
 already-0.041
Token and
Token ID290
Feature activation+2.00e-02

Pos logit contributions
 feat+0.081
 observ+0.076
 redd+0.075
 wond+0.075
 tranquil+0.075
Neg logit contributions
omm-0.073
stand-0.057
work-0.054
igsaw-0.053
-0.050
Token not
Token ID407
Feature activation+4.43e-03

Pos logit contributions
 feat+0.029
 vines+0.027
 wond+0.026
 redd+0.026
 agile+0.026
Neg logit contributions
omm-0.030
stand-0.024
work-0.023
igsaw-0.022
-0.022
Token take
Token ID1011
Feature activation+2.28e-02

Pos logit contributions
 feat+0.006
 observ+0.006
 courageous+0.005
 wond+0.005
 agile+0.005
Neg logit contributions
omm-0.008
stand-0.006
igsaw-0.006
-0.006
work-0.006
Token unnecessary
Token ID13114
Feature activation+7.12e-02

Pos logit contributions
 feat+0.025
cut+0.024
 wond+0.022
 redd+0.022
 observ+0.022
Neg logit contributions
omm-0.045
stand-0.038
igsaw-0.037
ifying-0.035
-0.035
Token risks
Token ID7476
Feature activation+4.98e-02

Pos logit contributions
cut+0.075
 feat+0.074
 redd+0.074
 observ+0.068
 intelligent+0.067
Neg logit contributions
omm-0.126
stand-0.120
igsaw-0.116
ifying-0.110
-0.105
Token.
Token ID13
Feature activation+2.31e-03

Pos logit contributions
 feat+0.075
 observ+0.071
 redd+0.068
 cautious+0.068
 wond+0.067
Neg logit contributions
omm-0.080
stand-0.067
igsaw-0.064
-0.063
work-0.061
Token The
Token ID383
Feature activation+6.81e-03

Pos logit contributions
 feat+0.003
 observ+0.003
 cautious+0.003
 agile+0.003
 courageous+0.003
Neg logit contributions
omm-0.004
igsaw-0.003
stand-0.003
-0.003
ifying-0.003
Token memory
Token ID4088
Feature activation-2.62e-02

Pos logit contributions
 feat+0.013
 observ+0.012
 agile+0.012
 cautious+0.012
 courageous+0.012
Neg logit contributions
omm-0.009
igsaw-0.006
stand-0.006
-0.006
ifying-0.006
Token of
Token ID286
Feature activation-1.73e-02

Pos logit contributions
omm+0.050
stand+0.039
igsaw+0.039
+0.038
work+0.037
Neg logit contributions
 feat-0.034
 observ-0.032
 cautious-0.031
 agile-0.031
 courageous-0.031
Token<|endoftext|>
Token ID50256
Feature activation+6.18e-03

Pos logit contributions
 feat+0.003
 observ+0.003
 cautious+0.003
 courageous+0.002
 intelligent+0.002
Neg logit contributions
omm-0.004
stand-0.003
igsaw-0.003
-0.003
ifying-0.003
TokenPeter
Token ID19727
Feature activation-7.56e-05

Pos logit contributions
 feat+0.009
 observ+0.009
 cautious+0.008
 agile+0.008
 courageous+0.008
Neg logit contributions
omm-0.011
-0.009
stand-0.009
igsaw-0.009
ifying-0.008
Token was
Token ID373
Feature activation-2.39e-02

Pos logit contributions
omm+0.000
stand+0.000
+0.000
igsaw+0.000
bell+0.000
Neg logit contributions
 feat-0.000
 observ-0.000
 agile-0.000
 courageous-0.000
 cautious-0.000
Token a
Token ID257
Feature activation+2.79e-02

Pos logit contributions
omm+0.045
stand+0.034
igsaw+0.034
+0.034
ifying+0.032
Neg logit contributions
 feat-0.032
 observ-0.030
 agile-0.029
 vines-0.029
 cautious-0.029
Token mis
Token ID2984
Feature activation+3.02e-02

Pos logit contributions
 feat+0.024
 observ+0.022
 cautious+0.021
 courageous+0.021
 intelligent+0.021
Neg logit contributions
omm-0.066
igsaw-0.054
stand-0.054
-0.053
ifying-0.051
Tokenchie
Token ID3043
Feature activation+7.11e-02

Pos logit contributions
 feat+0.036
 observ+0.035
 cautious+0.033
 courageous+0.033
 intelligent+0.033
Neg logit contributions
omm-0.063
stand-0.051
-0.050
igsaw-0.049
ifying-0.047
Tokenvous
Token ID31222
Feature activation+4.48e-02

Pos logit contributions
 wounds+0.075
 feat+0.071
 unnecessary+0.068
 vines+0.067
 agile+0.064
Neg logit contributions
omm-0.144
work-0.123
stand-0.123
igsaw-0.122
bell-0.120
Token 3
Token ID513
Feature activation-6.51e-04

Pos logit contributions
 feat+0.041
 observ+0.037
 cautious+0.036
 courageous+0.035
 intelligent+0.035
Neg logit contributions
omm-0.104
stand-0.085
igsaw-0.085
-0.084
ifying-0.081
Token year
Token ID614
Feature activation-2.65e-02

Pos logit contributions
omm+0.001
igsaw+0.001
stand+0.001
+0.001
bell+0.001
Neg logit contributions
 feat-0.001
 agile-0.001
 redd-0.001
 observ-0.001
 tranquil-0.001
Token old
Token ID1468
Feature activation+1.54e-02

Pos logit contributions
omm+0.069
stand+0.058
igsaw+0.058
+0.058
ifying+0.056
Neg logit contributions
 feat-0.016
 observ-0.014
 cautious-0.013
 intelligent-0.013
 courageous-0.013
Token who
Token ID508
Feature activation+1.36e-02

Pos logit contributions
 feat+0.017
 observ+0.016
 agile+0.016
 cautious+0.016
 courageous+0.015
Neg logit contributions
omm-0.032
stand-0.025
igsaw-0.025
-0.025
work-0.025
Token
Token ID198
Feature activation-1.42e-03

Pos logit contributions
omm+0.020
+0.015
stand+0.015
igsaw+0.015
ifying+0.013
Neg logit contributions
 feat-0.020
 observ-0.019
 agile-0.019
 courageous-0.019
 cautious-0.019
Token
Token ID198
Feature activation-7.12e-03

Pos logit contributions
omm+0.002
stand+0.002
+0.001
igsaw+0.001
bell+0.001
Neg logit contributions
 feat-0.003
 agile-0.002
 vines-0.002
 redd-0.002
 unnecessary-0.002
TokenSus
Token ID30746
Feature activation-1.88e-02

Pos logit contributions
omm+0.013
igsaw+0.010
stand+0.009
+0.009
bell+0.009
Neg logit contributions
 feat-0.011
 agile-0.010
 vines-0.010
 observ-0.010
 cautious-0.010
Tokenie
Token ID494
Feature activation+1.64e-02

Pos logit contributions
omm+0.037
stand+0.029
igsaw+0.029
+0.027
work+0.027
Neg logit contributions
 feat-0.024
 observ-0.023
 agile-0.022
 cautious-0.022
 courageous-0.022
Token smiled
Token ID13541
Feature activation-1.39e-02

Pos logit contributions
 feat+0.021
 observ+0.020
 agile+0.019
 cautious+0.019
 redd+0.019
Neg logit contributions
omm-0.031
stand-0.024
-0.024
igsaw-0.023
work-0.023
Token as
Token ID355
Feature activation+7.09e-02

Pos logit contributions
omm+0.026
igsaw+0.021
stand+0.021
+0.020
ifying+0.019
Neg logit contributions
 feat-0.019
 observ-0.018
 cautious-0.018
 courageous-0.017
 intelligent-0.017
Token she
Token ID673
Feature activation+6.27e-04

Pos logit contributions
 feat+0.117
cut+0.114
 cautious+0.112
 vines+0.111
 wounds+0.110
Neg logit contributions
omm-0.105
igsaw-0.082
stand-0.079
-0.076
bell-0.076
Token looked
Token ID3114
Feature activation+5.38e-03

Pos logit contributions
 feat+0.001
 observ+0.001
cut+0.001
 intelligent+0.001
 cautious+0.001
Neg logit contributions
omm-0.001
-0.001
stand-0.001
ifying-0.001
bell-0.001
Token around
Token ID1088
Feature activation-1.56e-02

Pos logit contributions
 feat+0.008
cut+0.007
 agile+0.007
 observ+0.007
 vines+0.007
Neg logit contributions
omm-0.008
ifying-0.007
igsaw-0.007
bell-0.007
work-0.007
Token the
Token ID262
Feature activation-4.19e-03

Pos logit contributions
omm+0.027
stand+0.021
igsaw+0.021
+0.021
ifying+0.020
Neg logit contributions
 feat-0.022
 observ-0.021
 agile-0.021
cut-0.020
 cautious-0.020
Token beach
Token ID10481
Feature activation-4.91e-03

Pos logit contributions
omm+0.007
igsaw+0.005
stand+0.005
+0.005
ifying+0.005
Neg logit contributions
 feat-0.006
 observ-0.006
 intelligent-0.006
 cautious-0.006
 courageous-0.006
Token
Token ID198
Feature activation-9.00e-04

Pos logit contributions
omm+0.016
+0.012
stand+0.012
igsaw+0.012
etti+0.010
Neg logit contributions
 feat-0.017
 courageous-0.016
 observ-0.016
 agile-0.016
 cautious-0.016
Token
Token ID198
Feature activation-7.57e-03

Pos logit contributions
omm+0.001
stand+0.001
+0.001
igsaw+0.001
 Today+0.001
Neg logit contributions
 feat-0.002
 agile-0.002
 redd-0.002
 courageous-0.002
 unnecessary-0.002
TokenM
Token ID44
Feature activation-1.22e-02

Pos logit contributions
omm+0.013
stand+0.010
+0.010
igsaw+0.010
ifying+0.009
Neg logit contributions
 feat-0.011
 observ-0.010
 agile-0.010
 courageous-0.010
 sturdy-0.010
Tokenia
Token ID544
Feature activation+7.44e-03

Pos logit contributions
omm+0.024
stand+0.021
ifying+0.020
igsaw+0.020
work+0.020
Neg logit contributions
 feat-0.014
 observ-0.013
 intelligent-0.013
 cautious-0.013
 courageous-0.012
Token smiled
Token ID13541
Feature activation-1.35e-02

Pos logit contributions
 feat+0.010
 observ+0.010
 redd+0.010
 agile+0.009
 unnecessary+0.009
Neg logit contributions
omm-0.013
stand-0.010
-0.010
igsaw-0.010
bell-0.010
Token as
Token ID355
Feature activation+7.08e-02

Pos logit contributions
omm+0.025
igsaw+0.020
stand+0.020
+0.019
ifying+0.018
Neg logit contributions
 feat-0.019
 observ-0.017
 cautious-0.017
 courageous-0.017
 intelligent-0.017
Token she
Token ID673
Feature activation+1.46e-03

Pos logit contributions
 feat+0.124
cut+0.122
 wounds+0.120
 cautious+0.119
 observ+0.118
Neg logit contributions
omm-0.098
igsaw-0.073
stand-0.072
-0.070
bell-0.063
Token slid
Token ID27803
Feature activation-1.05e-02

Pos logit contributions
 feat+0.002
 redd+0.002
 observ+0.002
cut+0.002
 unnecessary+0.002
Neg logit contributions
omm-0.002
-0.002
stand-0.002
bell-0.002
ifying-0.002
Token all
Token ID477
Feature activation-2.45e-03

Pos logit contributions
omm+0.021
stand+0.018
igsaw+0.017
+0.017
ifying+0.016
Neg logit contributions
 observ-0.012
 feat-0.011
 cautious-0.011
 courageous-0.011
 wond-0.011
Token the
Token ID262
Feature activation+9.46e-03

Pos logit contributions
omm+0.005
igsaw+0.004
stand+0.004
+0.004
work+0.004
Neg logit contributions
 feat-0.003
 observ-0.003
 cautious-0.003
 courageous-0.003
 wond-0.002
Token way
Token ID835
Feature activation-1.51e-02

Pos logit contributions
 feat+0.013
 cautious+0.013
 courageous+0.012
 sly+0.012
 observ+0.012
Neg logit contributions
omm-0.013
stand-0.012
igsaw-0.012
-0.011
 Wed-0.010
Token The
Token ID383
Feature activation+1.50e-03

Pos logit contributions
omm+0.007
+0.006
stand+0.006
igsaw+0.006
ifying+0.005
Neg logit contributions
 feat-0.007
 observ-0.006
 agile-0.006
 cautious-0.006
 courageous-0.006
Token little
Token ID1310
Feature activation-3.10e-03

Pos logit contributions
 feat+0.002
 observ+0.002
cking+0.002
 agile+0.001
cest+0.001
Neg logit contributions
omm-0.003
stand-0.003
-0.003
igsaw-0.002
bell-0.002
Token boy
Token ID2933
Feature activation+6.98e-03

Pos logit contributions
omm+0.006
igsaw+0.005
stand+0.005
bell+0.005
work+0.005
Neg logit contributions
 observ-0.003
cking-0.003
 agile-0.003
 wond-0.003
 feat-0.003
Token smiled
Token ID13541
Feature activation-1.46e-02

Pos logit contributions
 feat+0.008
 observ+0.008
 cautious+0.008
 courageous+0.008
 agile+0.008
Neg logit contributions
omm-0.014
igsaw-0.011
stand-0.011
-0.011
ifying-0.010
Token mis
Token ID2984
Feature activation+2.84e-02

Pos logit contributions
omm+0.029
igsaw+0.024
stand+0.023
+0.023
ifying+0.022
Neg logit contributions
 feat-0.019
 cautious-0.017
 intelligent-0.017
 observ-0.017
 courageous-0.017
Tokenchie
Token ID3043
Feature activation+7.07e-02

Pos logit contributions
 feat+0.041
 wond+0.037
 agile+0.037
 intelligent+0.037
 vines+0.037
Neg logit contributions
omm-0.050
igsaw-0.038
stand-0.037
-0.036
work-0.036
Tokenv
Token ID85
Feature activation+2.48e-03

Pos logit contributions
 wounds+0.083
 feat+0.083
 unnecessary+0.079
 agile+0.076
ts+0.076
Neg logit contributions
omm-0.130
work-0.111
igsaw-0.106
bell-0.105
stand-0.101
Tokenously
Token ID3481
Feature activation+1.38e-02

Pos logit contributions
 feat+0.003
 observ+0.003
 cautious+0.003
 agile+0.003
 courageous+0.003
Neg logit contributions
omm-0.005
igsaw-0.004
stand-0.004
-0.003
ifying-0.003
Token and
Token ID290
Feature activation+1.08e-02

Pos logit contributions
 feat+0.020
 observ+0.019
 agile+0.018
 cautious+0.018
 courageous+0.018
Neg logit contributions
omm-0.024
stand-0.019
igsaw-0.019
-0.018
ifying-0.017
Token said
Token ID531
Feature activation+2.42e-02

Pos logit contributions
 feat+0.013
 observ+0.012
 agile+0.012
 vines+0.012
 intelligent+0.012
Neg logit contributions
omm-0.021
stand-0.016
-0.016
igsaw-0.016
work-0.015
Token,
Token ID11
Feature activation-5.20e-03

Pos logit contributions
 feat+0.048
 vines+0.046
 observ+0.045
 agile+0.045
 wond+0.045
Neg logit contributions
omm-0.030
stand-0.019
igsaw-0.019
-0.018
work-0.017
Token the
Token ID262
Feature activation-2.51e-03

Pos logit contributions
omm+0.042
igsaw+0.034
stand+0.034
+0.034
ifying+0.033
Neg logit contributions
 feat-0.018
 observ-0.016
 cautious-0.015
 courageous-0.015
 agile-0.015
Token phone
Token ID3072
Feature activation-6.90e-03

Pos logit contributions
omm+0.003
stand+0.003
bell+0.003
work+0.002
igsaw+0.002
Neg logit contributions
 feat-0.004
 agile-0.004
 cautious-0.004
 observ-0.004
 courageous-0.004
Token.
Token ID13
Feature activation-3.91e-03

Pos logit contributions
omm+0.015
igsaw+0.012
+0.012
stand+0.011
ifying+0.011
Neg logit contributions
 feat-0.008
 observ-0.007
 cautious-0.007
 intelligent-0.007
 courageous-0.007
Token Sara
Token ID24799
Feature activation+1.14e-02

Pos logit contributions
omm+0.007
igsaw+0.006
stand+0.005
work+0.005
+0.005
Neg logit contributions
 feat-0.005
 intelligent-0.005
 observ-0.005
 cautious-0.005
 wond-0.005
Token grinned
Token ID36268
Feature activation-1.79e-02

Pos logit contributions
 feat+0.014
 observ+0.013
 cautious+0.013
 courageous+0.013
 agile+0.013
Neg logit contributions
omm-0.023
stand-0.018
igsaw-0.018
-0.018
work-0.017
Token as
Token ID355
Feature activation+7.07e-02

Pos logit contributions
omm+0.038
+0.033
igsaw+0.033
stand+0.032
ifying+0.030
Neg logit contributions
 feat-0.020
 intelligent-0.018
 strong-0.018
 brave-0.018
 observ-0.017
Token her
Token ID607
Feature activation-4.30e-03

Pos logit contributions
 feat+0.117
 vines+0.113
 cautious+0.113
 observ+0.112
 wounds+0.110
Neg logit contributions
omm-0.104
stand-0.081
igsaw-0.078
bell-0.074
-0.072
Token daddy
Token ID35822
Feature activation-3.24e-02

Pos logit contributions
omm+0.008
stand+0.006
igsaw+0.006
+0.006
ifying+0.006
Neg logit contributions
 feat-0.006
 intelligent-0.006
 observ-0.006
 courageous-0.006
 cautious-0.006
Token passed
Token ID3804
Feature activation-3.83e-03

Pos logit contributions
omm+0.063
stand+0.050
+0.049
igsaw+0.049
work+0.047
Neg logit contributions
 feat-0.040
 observ-0.038
 cautious-0.037
 agile-0.036
 courageous-0.036
Token the
Token ID262
Feature activation-9.89e-03

Pos logit contributions
omm+0.008
stand+0.007
igsaw+0.007
+0.006
bell+0.006
Neg logit contributions
 feat-0.004
 observ-0.004
 cautious-0.004
 agile-0.004
 courageous-0.004
Token phone
Token ID3072
Feature activation-4.31e-03

Pos logit contributions
omm+0.015
stand+0.012
igsaw+0.011
+0.011
bell+0.011
Neg logit contributions
 feat-0.016
 observ-0.016
 agile-0.016
 cautious-0.015
 courageous-0.015
Token
Token ID220
Feature activation-1.37e-03

Pos logit contributions
 feat+0.004
 observ+0.003
 cautious+0.003
 courageous+0.003
 intelligent+0.003
Neg logit contributions
omm-0.005
igsaw-0.004
stand-0.004
work-0.003
ifying-0.003
Token
Token ID198
Feature activation+1.04e-02

Pos logit contributions
omm+0.002
+0.002
stand+0.001
igsaw+0.001
 Today+0.001
Neg logit contributions
 feat-0.002
 observ-0.002
 courageous-0.002
 agile-0.002
 cautious-0.002
Token
Token ID198
Feature activation+2.55e-03

Pos logit contributions
 feat+0.018
 redd+0.018
 vines+0.018
 agile+0.017
 observ+0.017
Neg logit contributions
omm-0.015
stand-0.012
-0.011
igsaw-0.009
ifying-0.009
TokenMike
Token ID16073
Feature activation+3.54e-03

Pos logit contributions
 feat+0.003
 observ+0.003
 vines+0.003
 intelligent+0.003
 cautious+0.003
Neg logit contributions
omm-0.005
stand-0.004
-0.004
igsaw-0.004
ifying-0.004
Token smiled
Token ID13541
Feature activation-1.11e-02

Pos logit contributions
 feat+0.006
 observ+0.006
 agile+0.006
 courageous+0.006
 cautious+0.006
Neg logit contributions
omm-0.005
stand-0.004
igsaw-0.004
-0.004
work-0.003
Token as
Token ID355
Feature activation+7.04e-02

Pos logit contributions
omm+0.020
igsaw+0.015
stand+0.015
+0.015
ifying+0.014
Neg logit contributions
 feat-0.016
 observ-0.015
 cautious-0.015
 agile-0.015
 courageous-0.015
Token he
Token ID339
Feature activation+7.96e-03

Pos logit contributions
 feat+0.087
 observ+0.081
 cautious+0.079
 vines+0.078
 agile+0.077
Neg logit contributions
omm-0.139
igsaw-0.110
stand-0.110
-0.108
ifying-0.104
Token ate
Token ID15063
Feature activation-1.08e-02

Pos logit contributions
 biggest+0.011
 unnecessary+0.011
 observ+0.011
 feat+0.011
 redd+0.011
Neg logit contributions
omm-0.013
-0.011
bell-0.010
stand-0.010
ifying-0.009
Token the
Token ID262
Feature activation-9.02e-03

Pos logit contributions
omm+0.023
stand+0.019
igsaw+0.019
+0.019
ifying+0.018
Neg logit contributions
 feat-0.011
 observ-0.011
 cautious-0.010
 agile-0.010
 courageous-0.010
Token muff
Token ID27563
Feature activation-2.05e-02

Pos logit contributions
omm+0.013
+0.009
stand+0.008
igsaw+0.008
work+0.007
Neg logit contributions
 feat-0.017
 agile-0.016
 cautious-0.016
 observ-0.016
 courageous-0.016
Tokenin
Token ID259
Feature activation-6.31e-03

Pos logit contributions
omm+0.046
igsaw+0.037
+0.037
stand+0.037
ifying+0.035
Neg logit contributions
 feat-0.021
 observ-0.019
 cautious-0.018
 wond-0.018
 agile-0.018
Token want
Token ID765
Feature activation-1.30e-02

Pos logit contributions
omm+0.021
+0.018
igsaw+0.017
stand+0.016
bell+0.016
Neg logit contributions
 feat-0.017
cut-0.016
 agile-0.016
 observ-0.015
 cautious-0.015
Token.
Token ID13
Feature activation-6.31e-03

Pos logit contributions
omm+0.027
stand+0.022
igsaw+0.021
+0.021
ifying+0.020
Neg logit contributions
 feat-0.015
 observ-0.014
 cautious-0.014
 agile-0.014
 courageous-0.014
Token All
Token ID1439
Feature activation-9.78e-03

Pos logit contributions
omm+0.012
igsaw+0.011
stand+0.010
+0.010
ifying+0.010
Neg logit contributions
 observ-0.007
 cautious-0.007
 feat-0.007
 intelligent-0.007
 courageous-0.007
Token of
Token ID286
Feature activation-1.63e-02

Pos logit contributions
omm+0.015
work+0.012
igsaw+0.012
stand+0.012
ifying+0.012
Neg logit contributions
 feat-0.014
 courageous-0.013
 cautious-0.013
 prey-0.013
 wond-0.013
Token these
Token ID777
Feature activation-1.07e-03

Pos logit contributions
omm+0.036
igsaw+0.029
stand+0.028
+0.028
ifying+0.027
Neg logit contributions
 feat-0.017
 observ-0.016
 intelligent-0.015
 cautious-0.015
 courageous-0.015
Token perf
Token ID23035
Feature activation-8.99e-02

Pos logit contributions
omm+0.002
stand+0.002
+0.002
igsaw+0.002
ifying+0.002
Neg logit contributions
 feat-0.001
 agile-0.001
 cautious-0.001
cut-0.001
 courageous-0.001
Tokenumes
Token ID8139
Feature activation-4.57e-03

Pos logit contributions
omm+0.165
igsaw+0.135
stand+0.131
 Today+0.126
+0.119
Neg logit contributions
 observ-0.116
 feat-0.107
 redd-0.104
 biggest-0.102
 careful-0.102
Token smell
Token ID8508
Feature activation+9.00e-03

Pos logit contributions
omm+0.008
stand+0.006
igsaw+0.006
+0.006
ifying+0.006
Neg logit contributions
 feat-0.006
 observ-0.006
 cautious-0.006
 agile-0.006
 courageous-0.006
Token so
Token ID523
Feature activation+4.61e-02

Pos logit contributions
 feat+0.012
 observ+0.011
 cautious+0.011
 agile+0.011
 courageous+0.011
Neg logit contributions
omm-0.017
stand-0.014
igsaw-0.013
-0.013
ifying-0.013
Token good
Token ID922
Feature activation-1.52e-02

Pos logit contributions
 feat+0.050
 observ+0.045
 cautious+0.045
 agile+0.044
cut+0.044
Neg logit contributions
omm-0.098
igsaw-0.079
-0.078
stand-0.078
ifying-0.075
Token.
Token ID13
Feature activation-5.32e-03

Pos logit contributions
omm+0.030
stand+0.024
igsaw+0.024
+0.023
work+0.023
Neg logit contributions
 feat-0.018
 observ-0.017
 cautious-0.017
 agile-0.017
 courageous-0.017
Token hopped
Token ID45230
Feature activation+2.33e-02

Pos logit contributions
 feat+0.019
 observ+0.018
 agile+0.017
 redd+0.017
 cautious+0.017
Neg logit contributions
omm-0.028
stand-0.022
bell-0.021
igsaw-0.021
-0.021
Token closer
Token ID5699
Feature activation+2.36e-03

Pos logit contributions
 feat+0.024
 observ+0.022
 cautious+0.022
 courageous+0.021
 intelligent+0.021
Neg logit contributions
omm-0.052
igsaw-0.042
stand-0.042
-0.041
ifying-0.040
Token to
Token ID284
Feature activation+9.14e-03

Pos logit contributions
 feat+0.004
 observ+0.004
cut+0.004
 redd+0.004
 agile+0.004
Neg logit contributions
omm-0.003
stand-0.003
igsaw-0.003
work-0.002
bell-0.002
Token the
Token ID262
Feature activation-1.62e-02

Pos logit contributions
 agile+0.013
 cautious+0.012
 feat+0.012
 observ+0.012
 redd+0.012
Neg logit contributions
omm-0.014
stand-0.012
work-0.011
bell-0.011
igsaw-0.011
Token story
Token ID1621
Feature activation-3.93e-02

Pos logit contributions
omm+0.033
stand+0.026
igsaw+0.026
+0.025
work+0.025
Neg logit contributions
 feat-0.019
 observ-0.018
 agile-0.018
 cautious-0.018
 courageous-0.017
Tokente
Token ID660
Feature activation-8.75e-02

Pos logit contributions
omm+0.083
+0.066
igsaw+0.065
ifying+0.064
ifier+0.062
Neg logit contributions
 feat-0.044
 strong-0.042
 intelligent-0.041
 brave-0.041
 tall-0.040
Tokenller
Token ID6051
Feature activation-2.99e-02

Pos logit contributions
omm+0.154
igsaw+0.120
stand+0.115
+0.115
ifying+0.113
Neg logit contributions
 feat-0.129
 observ-0.118
cut-0.116
 intelligent-0.115
 cautious-0.115
Token and
Token ID290
Feature activation-9.04e-04

Pos logit contributions
omm+0.057
igsaw+0.044
stand+0.043
+0.043
ifying+0.041
Neg logit contributions
 feat-0.041
 observ-0.037
 cautious-0.037
 courageous-0.037
 intelligent-0.037
Token the
Token ID262
Feature activation+1.00e-03

Pos logit contributions
omm+0.002
stand+0.001
work+0.001
bell+0.001
ifying+0.001
Neg logit contributions
 feat-0.001
 redd-0.001
 vines-0.001
 agile-0.001
cut-0.001
Token crowd
Token ID4315
Feature activation-3.07e-03

Pos logit contributions
 feat+0.001
 agile+0.001
 observ+0.001
 cautious+0.001
 sturdy+0.001
Neg logit contributions
omm-0.002
stand-0.002
work-0.002
igsaw-0.002
-0.002
Token that
Token ID326
Feature activation+2.93e-02

Pos logit contributions
omm+0.005
stand+0.004
work+0.004
bell+0.004
+0.004
Neg logit contributions
 feat-0.004
 observ-0.004
 redd-0.004
 unnecessary-0.004
 cautious-0.004
Token stirred
Token ID33091
Feature activation-2.19e-02

Pos logit contributions
omm+0.001
stand+0.001
igsaw+0.001
+0.001
ifying+0.000
Neg logit contributions
 feat-0.000
 observ-0.000
 cautious-0.000
 courageous-0.000
 agile-0.000
Token the
Token ID262
Feature activation-6.25e-03

Pos logit contributions
omm+0.044
stand+0.036
+0.035
igsaw+0.035
work+0.033
Neg logit contributions
 observ-0.026
 feat-0.026
 cautious-0.025
 courageous-0.025
 agile-0.025
Token bowl
Token ID9396
Feature activation-1.54e-02

Pos logit contributions
omm+0.009
stand+0.007
+0.007
igsaw+0.006
 frost+0.006
Neg logit contributions
 observ-0.011
 cautious-0.011
 courageous-0.011
 agile-0.011
 feat-0.011
Token together
Token ID1978
Feature activation-1.95e-02

Pos logit contributions
omm+0.029
igsaw+0.023
stand+0.023
+0.023
ifying+0.022
Neg logit contributions
 feat-0.020
 observ-0.019
 cautious-0.018
 agile-0.018
 courageous-0.018
Token under
Token ID739
Feature activation-1.14e-02

Pos logit contributions
omm+0.034
stand+0.027
igsaw+0.027
+0.026
work+0.025
Neg logit contributions
 feat-0.028
 observ-0.027
 cautious-0.026
 agile-0.026
 courageous-0.026
Token Grand
Token ID5675
Feature activation-8.39e-02

Pos logit contributions
omm+0.027
igsaw+0.022
stand+0.022
+0.022
ifying+0.021
Neg logit contributions
 feat-0.010
 observ-0.009
 cautious-0.008
 intelligent-0.008
 courageous-0.008
Tokenma
Token ID2611
Feature activation-6.19e-02

Pos logit contributions
omm+0.138
ifying+0.137
igsaw+0.116
stand+0.116
ifier+0.115
Neg logit contributions
 feat-0.109
 intelligent-0.091
ts-0.090
 clever-0.089
 careful-0.088
Token's
Token ID338
Feature activation-3.87e-02

Pos logit contributions
omm+0.112
igsaw+0.086
+0.083
stand+0.081
ifying+0.079
Neg logit contributions
 feat-0.088
 observ-0.082
 cautious-0.081
 intelligent-0.080
 strong-0.080
Token careful
Token ID8161
Feature activation-1.03e-02

Pos logit contributions
omm+0.063
stand+0.048
igsaw+0.047
+0.046
ifying+0.043
Neg logit contributions
 feat-0.061
 observ-0.060
 agile-0.059
 cautious-0.059
 courageous-0.058
Token watch
Token ID2342
Feature activation-7.80e-03

Pos logit contributions
omm+0.019
stand+0.015
igsaw+0.015
+0.014
work+0.014
Neg logit contributions
 feat-0.014
 observ-0.013
 agile-0.013
 courageous-0.013
 cautious-0.013
Token.
Token ID13
Feature activation-9.81e-03

Pos logit contributions
omm+0.015
stand+0.012
igsaw+0.012
+0.012
ifying+0.011
Neg logit contributions
 feat-0.010
 observ-0.010
 cautious-0.009
 courageous-0.009
 agile-0.009
TokenThe
Token ID464
Feature activation-1.42e-02

Pos logit contributions
omm+0.018
stand+0.015
+0.014
bell+0.013
igsaw+0.013
Neg logit contributions
 feat-0.015
 courageous-0.014
 observ-0.014
 wond-0.014
 unnecessary-0.013
Token day
Token ID1110
Feature activation-8.10e-03

Pos logit contributions
omm+0.032
igsaw+0.027
+0.026
stand+0.026
ifying+0.025
Neg logit contributions
 feat-0.014
 intelligent-0.013
 observ-0.013
 courageous-0.012
 strong-0.012
Token finally
Token ID3443
Feature activation+4.64e-03

Pos logit contributions
omm+0.016
igsaw+0.013
stand+0.013
+0.013
ifying+0.012
Neg logit contributions
 feat-0.010
 observ-0.009
 intelligent-0.009
 cautious-0.009
 courageous-0.009
Token arrived
Token ID5284
Feature activation-3.72e-03

Pos logit contributions
 feat+0.005
 strong+0.005
 flies+0.005
 brave+0.005
 intelligent+0.005
Neg logit contributions
omm-0.009
ifying-0.008
igsaw-0.008
-0.008
stand-0.007
Token when
Token ID618
Feature activation+1.82e-02

Pos logit contributions
omm+0.008
igsaw+0.006
stand+0.006
+0.006
ifying+0.006
Neg logit contributions
 feat-0.004
 observ-0.004
 cautious-0.004
 intelligent-0.004
 courageous-0.004
Token grandma
Token ID49890
Feature activation-8.32e-02

Pos logit contributions
 feat+0.015
 strong+0.015
 intelligent+0.013
 brave+0.013
 clever+0.013
Neg logit contributions
omm-0.045
stand-0.037
igsaw-0.037
ifying-0.036
ifier-0.036
Token came
Token ID1625
Feature activation-3.18e-02

Pos logit contributions
omm+0.171
igsaw+0.143
stand+0.132
ifying+0.130
+0.129
Neg logit contributions
 feat-0.099
 brave-0.087
 intelligent-0.085
 observ-0.085
 clever-0.085
Token to
Token ID284
Feature activation-6.39e-03

Pos logit contributions
omm+0.076
igsaw+0.063
+0.062
stand+0.060
ifying+0.058
Neg logit contributions
 feat-0.032
 strong-0.028
 intelligent-0.027
 brave-0.027
 free-0.026
Token visit
Token ID3187
Feature activation-7.26e-03

Pos logit contributions
omm+0.012
stand+0.010
igsaw+0.010
+0.010
work+0.009
Neg logit contributions
 feat-0.008
 observ-0.008
 cautious-0.008
 agile-0.007
 courageous-0.007
Token.
Token ID13
Feature activation-1.73e-02

Pos logit contributions
omm+0.012
stand+0.010
+0.009
igsaw+0.009
bell+0.009
Neg logit contributions
 feat-0.011
 observ-0.011
 agile-0.011
 cautious-0.011
 courageous-0.010
Token Tim
Token ID5045
Feature activation+4.78e-03

Pos logit contributions
omm+0.031
stand+0.026
work+0.025
ifying+0.025
igsaw+0.025
Neg logit contributions
 intelligent-0.022
 strong-0.021
 feat-0.021
 sturdy-0.021
 independent-0.021
Token
Token ID198
Feature activation-1.57e-02

Pos logit contributions
omm+0.018
igsaw+0.013
stand+0.013
+0.013
ifying+0.013
Neg logit contributions
 feat-0.016
 observ-0.016
 cautious-0.015
 intelligent-0.015
 agile-0.015
Token
Token ID198
Feature activation-2.05e-02

Pos logit contributions
omm+0.023
stand+0.018
+0.016
igsaw+0.016
bell+0.015
Neg logit contributions
 feat-0.027
 agile-0.026
 courageous-0.026
 observ-0.026
cut-0.026
TokenWhen
Token ID2215
Feature activation+3.48e-05

Pos logit contributions
omm+0.042
igsaw+0.033
stand+0.033
+0.033
ifying+0.032
Neg logit contributions
 feat-0.024
 observ-0.023
 cautious-0.022
 courageous-0.022
 agile-0.022
Token L
Token ID406
Feature activation-2.48e-02

Pos logit contributions
 feat+0.000
 strong+0.000
 graceful+0.000
 clever+0.000
 intelligent+0.000
Neg logit contributions
omm-0.000
igsaw-0.000
ifier-0.000
ifying-0.000
work-0.000
Tokenola
Token ID5708
Feature activation-2.94e-02

Pos logit contributions
omm+0.038
ifying+0.029
work+0.028
ifier+0.028
+0.028
Neg logit contributions
 feat-0.037
 observ-0.037
 intelligent-0.037
 clever-0.036
 careful-0.036
Token finished
Token ID5201
Feature activation-8.29e-02

Pos logit contributions
omm+0.057
igsaw+0.046
stand+0.045
+0.043
ifying+0.042
Neg logit contributions
 feat-0.038
 observ-0.035
 cautious-0.035
 intelligent-0.034
 courageous-0.034
Token,
Token ID11
Feature activation+5.82e-03

Pos logit contributions
omm+0.166
igsaw+0.131
stand+0.131
ifying+0.130
+0.128
Neg logit contributions
 feat-0.102
 observ-0.093
 strong-0.093
 intelligent-0.093
 cautious-0.091
Token she
Token ID673
Feature activation+1.97e-03

Pos logit contributions
 feat+0.009
cut+0.009
 cautious+0.009
 observ+0.009
 courageous+0.008
Neg logit contributions
omm-0.009
stand-0.007
-0.007
igsaw-0.007
bell-0.007
Token looked
Token ID3114
Feature activation+5.78e-03

Pos logit contributions
 feat+0.002
 observ+0.002
 cautious+0.002
 courageous+0.002
 agile+0.002
Neg logit contributions
omm-0.004
stand-0.003
-0.003
bell-0.003
igsaw-0.003
Token up
Token ID510
Feature activation+2.23e-02

Pos logit contributions
 feat+0.008
cut+0.007
 agile+0.007
 observ+0.007
 cautious+0.007
Neg logit contributions
omm-0.010
ifying-0.008
igsaw-0.008
bell-0.008
work-0.008
Token and
Token ID290
Feature activation+1.18e-03

Pos logit contributions
 feat+0.029
 observ+0.027
 cautious+0.027
 agile+0.027
 courageous+0.026
Neg logit contributions
omm-0.042
igsaw-0.033
stand-0.033
-0.033
ifying-0.031
Token wanted
Token ID2227
Feature activation+1.50e-03

Pos logit contributions
omm+0.012
stand+0.009
igsaw+0.009
+0.009
bell+0.009
Neg logit contributions
 feat-0.008
 observ-0.008
 agile-0.008
 courageous-0.008
 cautious-0.008
Token to
Token ID284
Feature activation-4.12e-03

Pos logit contributions
 feat+0.002
 observ+0.001
 intelligent+0.001
 cautious+0.001
 strong+0.001
Neg logit contributions
omm-0.003
-0.003
igsaw-0.003
ifying-0.003
stand-0.003
Token peek
Token ID27185
Feature activation+2.33e-02

Pos logit contributions
omm+0.009
igsaw+0.007
ifying+0.007
+0.007
stand+0.007
Neg logit contributions
 feat-0.005
 strong-0.004
 intelligent-0.004
 observ-0.004
 brave-0.004
Token inside
Token ID2641
Feature activation-8.78e-03

Pos logit contributions
 feat+0.032
cut+0.031
 agile+0.030
 observ+0.029
 cautious+0.029
Neg logit contributions
omm-0.039
bell-0.032
ifying-0.032
igsaw-0.032
stand-0.031
Token her
Token ID607
Feature activation-2.67e-02

Pos logit contributions
omm+0.018
stand+0.015
igsaw+0.015
+0.014
ifying+0.014
Neg logit contributions
 feat-0.010
 observ-0.009
 agile-0.009
 cautious-0.009
 courageous-0.009
Token Grand
Token ID5675
Feature activation-8.26e-02

Pos logit contributions
omm+0.051
stand+0.040
+0.040
igsaw+0.039
ifying+0.037
Neg logit contributions
 feat-0.037
 intelligent-0.034
 biggest-0.033
 clever-0.033
 observ-0.033
Tokenma
Token ID2611
Feature activation-5.73e-02

Pos logit contributions
omm+0.145
ifying+0.135
igsaw+0.123
+0.119
stand+0.115
Neg logit contributions
 feat-0.112
 intelligent-0.098
 observ-0.096
 wond-0.096
 daring-0.094
Token's
Token ID338
Feature activation-4.08e-02

Pos logit contributions
omm+0.123
igsaw+0.096
+0.092
stand+0.086
ifying+0.085
Neg logit contributions
 feat-0.065
 strong-0.058
 intelligent-0.058
 brave-0.058
 free-0.055
Token room
Token ID2119
Feature activation-2.93e-02

Pos logit contributions
omm+0.063
stand+0.046
igsaw+0.046
+0.046
ifying+0.043
Neg logit contributions
 feat-0.069
 observ-0.065
 cautious-0.064
 intelligent-0.064
 courageous-0.064
Token.
Token ID13
Feature activation-1.01e-02

Pos logit contributions
omm+0.061
igsaw+0.049
stand+0.048
+0.048
ifying+0.046
Neg logit contributions
 feat-0.033
 observ-0.030
 cautious-0.030
 intelligent-0.030
 courageous-0.030
Token Grand
Token ID5675
Feature activation-4.61e-02

Pos logit contributions
omm+0.016
+0.013
igsaw+0.013
ifying+0.012
work+0.012
Neg logit contributions
 feat-0.015
 observ-0.015
 intelligent-0.015
 strong-0.014
 wond-0.014
Token
Token ID220
Feature activation-1.07e-02

Pos logit contributions
omm+0.019
igsaw+0.016
work+0.015
ifying+0.015
stand+0.015
Neg logit contributions
 feat-0.013
 intelligent-0.012
 observ-0.012
 strong-0.012
 cautious-0.012
Token
Token ID198
Feature activation-9.38e-03

Pos logit contributions
omm+0.017
stand+0.013
+0.012
igsaw+0.012
bell+0.011
Neg logit contributions
 feat-0.018
 observ-0.017
 agile-0.017
 courageous-0.017
 cautious-0.017
Token
Token ID198
Feature activation-1.59e-02

Pos logit contributions
omm+0.014
stand+0.011
+0.010
igsaw+0.010
bell+0.009
Neg logit contributions
 feat-0.016
 observ-0.016
 agile-0.015
 courageous-0.015
 cautious-0.015
TokenWhen
Token ID2215
Feature activation-1.77e-03

Pos logit contributions
omm+0.033
igsaw+0.027
+0.027
work+0.026
stand+0.026
Neg logit contributions
 feat-0.018
 observ-0.017
 intelligent-0.017
 cautious-0.017
 careful-0.016
Token Lily
Token ID20037
Feature activation-3.67e-02

Pos logit contributions
omm+0.004
igsaw+0.004
ifying+0.004
work+0.004
ifier+0.003
Neg logit contributions
 strong-0.001
 new-0.001
 clever-0.001
 healthy-0.001
 feat-0.001
Token finished
Token ID5201
Feature activation-8.24e-02

Pos logit contributions
omm+0.086
igsaw+0.076
ifying+0.071
work+0.070
stand+0.067
Neg logit contributions
 brave-0.029
 feat-0.028
 strong-0.027
 clever-0.027
 saw-0.026
Token her
Token ID607
Feature activation-3.31e-02

Pos logit contributions
omm+0.177
ifying+0.159
ifier+0.145
stand+0.144
bell+0.141
Neg logit contributions
 strong-0.081
 feat-0.079
 clever-0.077
 healthy-0.075
 intelligent-0.074
Token salad
Token ID20698
Feature activation-2.49e-02

Pos logit contributions
omm+0.054
igsaw+0.041
stand+0.040
+0.039
work+0.036
Neg logit contributions
 feat-0.053
 observ-0.051
 cautious-0.050
 agile-0.050
 courageous-0.049
Token,
Token ID11
Feature activation+9.11e-03

Pos logit contributions
omm+0.039
igsaw+0.028
stand+0.028
+0.027
ifying+0.026
Neg logit contributions
 feat-0.041
 observ-0.039
 cautious-0.039
 courageous-0.038
 intelligent-0.038
Token she
Token ID673
Feature activation+7.83e-03

Pos logit contributions
 feat+0.013
 observ+0.013
 cautious+0.012
 courageous+0.012
 agile+0.012
Neg logit contributions
omm-0.016
stand-0.012
igsaw-0.012
-0.012
ifying-0.011
Token noticed
Token ID6810
Feature activation+2.10e-02

Pos logit contributions
 feat+0.008
 observ+0.008
 cautious+0.008
 intelligent+0.007
 courageous+0.007
Neg logit contributions
omm-0.017
igsaw-0.014
stand-0.014
-0.014
ifying-0.013
Token.
Token ID13
Feature activation+2.36e-03

Pos logit contributions
omm+0.069
igsaw+0.055
+0.053
stand+0.053
ifying+0.052
Neg logit contributions
 feat-0.038
 observ-0.035
 cautious-0.035
 intelligent-0.034
 courageous-0.034
Token
Token ID198
Feature activation+2.27e-03

Pos logit contributions
 feat+0.003
 observ+0.003
 intelligent+0.003
 cautious+0.003
 strong+0.003
Neg logit contributions
omm-0.005
igsaw-0.004
work-0.004
stand-0.004
ifying-0.004
Token
Token ID198
Feature activation-4.57e-03

Pos logit contributions
 feat+0.004
 redd+0.004
 agile+0.004
 observ+0.004
 courageous+0.004
Neg logit contributions
omm-0.003
stand-0.002
-0.002
igsaw-0.002
 Today-0.002
TokenWhen
Token ID2215
Feature activation+4.24e-03

Pos logit contributions
omm+0.010
igsaw+0.008
+0.008
work+0.008
stand+0.008
Neg logit contributions
 feat-0.005
 observ-0.004
 cautious-0.004
 intelligent-0.004
 courageous-0.004
Token mom
Token ID1995
Feature activation-6.15e-02

Pos logit contributions
 strong+0.004
 new+0.004
 clever+0.003
 tall+0.003
 graceful+0.003
Neg logit contributions
omm-0.010
igsaw-0.008
work-0.008
ifying-0.008
ann-0.008
Token finished
Token ID5201
Feature activation-8.21e-02

Pos logit contributions
omm+0.124
igsaw+0.100
stand+0.097
+0.097
ifying+0.093
Neg logit contributions
 feat-0.074
 observ-0.068
 cautious-0.067
 intelligent-0.066
 courageous-0.066
Token her
Token ID607
Feature activation-3.56e-02

Pos logit contributions
omm+0.171
ifying+0.151
ifier+0.140
igsaw+0.138
 unison+0.133
Neg logit contributions
 strong-0.085
 feat-0.082
 new-0.080
 tall-0.080
 healthy-0.079
Token work
Token ID670
Feature activation-1.27e-02

Pos logit contributions
omm+0.062
igsaw+0.047
stand+0.046
+0.045
ifying+0.044
Neg logit contributions
 feat-0.054
 observ-0.051
 intelligent-0.050
 cautious-0.050
 courageous-0.049
Token,
Token ID11
Feature activation+1.78e-02

Pos logit contributions
omm+0.023
ifying+0.018
igsaw+0.017
ifier+0.017
+0.016
Neg logit contributions
 feat-0.018
 strong-0.017
 intelligent-0.017
 brave-0.016
 clever-0.016
Token she
Token ID673
Feature activation+1.00e-03

Pos logit contributions
 feat+0.025
 observ+0.024
 intelligent+0.023
 cautious+0.023
 courageous+0.023
Neg logit contributions
omm-0.032
igsaw-0.025
stand-0.025
-0.024
ifying-0.023
Token went
Token ID1816
Feature activation-6.85e-03

Pos logit contributions
 feat+0.001
 observ+0.001
 cautious+0.001
 courageous+0.001
 agile+0.001
Neg logit contributions
omm-0.002
stand-0.002
igsaw-0.002
-0.002
ifying-0.002
Token.
Token ID13
Feature activation-5.65e-03

Pos logit contributions
 feat+0.004
 observ+0.004
 cautious+0.004
 courageous+0.004
 agile+0.004
Neg logit contributions
omm-0.007
igsaw-0.006
stand-0.006
-0.005
ifying-0.005
Token
Token ID198
Feature activation-7.31e-03

Pos logit contributions
omm+0.009
igsaw+0.006
stand+0.006
+0.006
ifying+0.006
Neg logit contributions
 feat-0.009
 observ-0.009
 cautious-0.009
 courageous-0.009
 agile-0.009
Token
Token ID198
Feature activation-1.44e-02

Pos logit contributions
omm+0.011
stand+0.008
+0.008
igsaw+0.007
bell+0.007
Neg logit contributions
 feat-0.013
 redd-0.013
 agile-0.012
 observ-0.012
 courageous-0.012
TokenWhen
Token ID2215
Feature activation+2.11e-03

Pos logit contributions
omm+0.028
stand+0.022
+0.021
igsaw+0.021
ifying+0.020
Neg logit contributions
 feat-0.019
 observ-0.017
 cautious-0.017
 agile-0.017
 courageous-0.017
Token Sue
Token ID26089
Feature activation-2.22e-02

Pos logit contributions
 feat+0.003
 observ+0.002
 cautious+0.002
 courageous+0.002
 agile+0.002
Neg logit contributions
omm-0.004
igsaw-0.003
stand-0.003
-0.003
ifying-0.003
Token finished
Token ID5201
Feature activation-8.19e-02

Pos logit contributions
omm+0.039
stand+0.030
igsaw+0.030
+0.030
ifying+0.028
Neg logit contributions
 feat-0.032
 observ-0.030
 cautious-0.030
 courageous-0.030
 agile-0.029
Token,
Token ID11
Feature activation+5.49e-03

Pos logit contributions
omm+0.171
ifying+0.136
igsaw+0.134
ifier+0.132
stand+0.132
Neg logit contributions
 feat-0.097
 observ-0.087
 strong-0.087
 intelligent-0.086
 careful-0.086
Token she
Token ID673
Feature activation-8.47e-04

Pos logit contributions
 feat+0.009
 observ+0.009
cut+0.009
 cautious+0.009
 agile+0.009
Neg logit contributions
omm-0.008
-0.006
stand-0.006
igsaw-0.006
bell-0.005
Token focused
Token ID5670
Feature activation-2.28e-02

Pos logit contributions
omm+0.001
stand+0.001
+0.001
bell+0.001
igsaw+0.001
Neg logit contributions
 feat-0.001
 observ-0.001
cut-0.001
 cautious-0.001
 courageous-0.001
Token her
Token ID607
Feature activation-2.10e-02

Pos logit contributions
omm+0.042
stand+0.037
+0.035
igsaw+0.034
bell+0.032
Neg logit contributions
 feat-0.029
 observ-0.027
 redd-0.026
 biggest-0.026
cut-0.025
Token attention
Token ID3241
Feature activation-3.82e-02

Pos logit contributions
omm+0.036
+0.030
igsaw+0.029
stand+0.027
bell+0.026
Neg logit contributions
cut-0.030
 observ-0.030
 cautious-0.028
 feat-0.028
 redd-0.028
Token.
Token ID13
Feature activation-5.94e-03

Pos logit contributions
omm+0.030
stand+0.025
+0.024
igsaw+0.022
bell+0.022
Neg logit contributions
cut-0.036
 observ-0.036
 feat-0.035
 cautious-0.034
 courageous-0.034
Token
Token ID198
Feature activation-1.10e-02

Pos logit contributions
omm+0.011
igsaw+0.008
stand+0.008
+0.008
ifying+0.008
Neg logit contributions
 feat-0.008
 observ-0.008
 cautious-0.008
 intelligent-0.008
 courageous-0.008
Token
Token ID198
Feature activation-1.65e-02

Pos logit contributions
omm+0.018
stand+0.014
+0.014
igsaw+0.013
bell+0.012
Neg logit contributions
 feat-0.018
 agile-0.017
 observ-0.017
 cautious-0.017
 courageous-0.017
TokenWhen
Token ID2215
Feature activation+7.34e-04

Pos logit contributions
omm+0.035
stand+0.028
igsaw+0.028
+0.027
ifying+0.026
Neg logit contributions
 feat-0.019
 observ-0.017
 cautious-0.017
 agile-0.017
 courageous-0.017
Token Lucy
Token ID22162
Feature activation-3.63e-02

Pos logit contributions
 feat+0.001
 strong+0.001
 intelligent+0.001
 clever+0.001
 brave+0.001
Neg logit contributions
omm-0.002
igsaw-0.001
stand-0.001
ifying-0.001
work-0.001
Token finished
Token ID5201
Feature activation-8.17e-02

Pos logit contributions
omm+0.066
igsaw+0.052
stand+0.050
ifying+0.048
+0.048
Neg logit contributions
 feat-0.051
 observ-0.047
 cautious-0.047
 courageous-0.046
 intelligent-0.046
Token cleaning
Token ID12724
Feature activation-2.94e-02

Pos logit contributions
omm+0.162
ifying+0.140
ifier+0.133
work+0.130
igsaw+0.129
Neg logit contributions
 feat-0.095
 strong-0.094
 careful-0.091
 clever-0.088
 daring-0.088
Token the
Token ID262
Feature activation-1.12e-02

Pos logit contributions
omm+0.054
igsaw+0.041
stand+0.041
+0.040
ifying+0.039
Neg logit contributions
 feat-0.041
 observ-0.039
 cautious-0.038
 courageous-0.038
 agile-0.038
Token penny
Token ID26369
Feature activation-2.34e-02

Pos logit contributions
omm+0.022
igsaw+0.017
stand+0.017
+0.016
ifying+0.016
Neg logit contributions
 feat-0.015
 observ-0.014
 intelligent-0.013
 cautious-0.013
 courageous-0.013
Token,
Token ID11
Feature activation+1.32e-02

Pos logit contributions
omm+0.033
stand+0.023
+0.023
igsaw+0.023
work+0.020
Neg logit contributions
 feat-0.043
 observ-0.042
 cautious-0.041
 courageous-0.040
 agile-0.040
Token she
Token ID673
Feature activation+7.84e-03

Pos logit contributions
 feat+0.023
 observ+0.022
 cautious+0.022
 agile+0.021
rows+0.021
Neg logit contributions
omm-0.019
stand-0.014
-0.013
igsaw-0.013
bell-0.012
Token.
Token ID13
Feature activation-5.90e-03

Pos logit contributions
omm+0.021
igsaw+0.017
stand+0.017
+0.016
ifying+0.016
Neg logit contributions
 feat-0.013
 observ-0.012
 cautious-0.012
 courageous-0.012
 agile-0.012
Token
Token ID198
Feature activation-2.26e-03

Pos logit contributions
omm+0.008
+0.005
igsaw+0.005
stand+0.005
bell+0.005
Neg logit contributions
 feat-0.011
 observ-0.011
 cautious-0.011
 courageous-0.011
 agile-0.011
Token
Token ID198
Feature activation-8.90e-03

Pos logit contributions
omm+0.003
stand+0.002
+0.002
 Today+0.002
igsaw+0.002
Neg logit contributions
 feat-0.004
 agile-0.004
 redd-0.004
 observ-0.004
 courageous-0.004
TokenWhen
Token ID2215
Feature activation+2.18e-03

Pos logit contributions
omm+0.017
stand+0.014
+0.013
igsaw+0.013
bell+0.013
Neg logit contributions
 feat-0.011
 observ-0.010
 agile-0.010
 cautious-0.010
 courageous-0.010
Token Sue
Token ID26089
Feature activation-2.15e-02

Pos logit contributions
 feat+0.003
 observ+0.002
 cautious+0.002
 courageous+0.002
 agile+0.002
Neg logit contributions
omm-0.004
-0.004
igsaw-0.004
stand-0.004
ifying-0.003
Token finished
Token ID5201
Feature activation-8.16e-02

Pos logit contributions
omm+0.044
igsaw+0.037
stand+0.034
ifying+0.034
+0.033
Neg logit contributions
 feat-0.025
 brave-0.023
 strong-0.023
 intelligent-0.023
 clever-0.023
Token eating
Token ID6600
Feature activation-1.61e-02

Pos logit contributions
omm+0.177
ifying+0.158
ifier+0.150
igsaw+0.148
stand+0.141
Neg logit contributions
 strong-0.078
 feat-0.077
 healthy-0.074
 tall-0.073
 new-0.071
Token,
Token ID11
Feature activation+7.88e-03

Pos logit contributions
omm+0.032
igsaw+0.026
stand+0.025
+0.025
ifying+0.024
Neg logit contributions
 feat-0.020
 observ-0.018
 intelligent-0.018
 cautious-0.018
 courageous-0.018
Token she
Token ID673
Feature activation+3.58e-03

Pos logit contributions
 feat+0.016
cut+0.016
rows+0.015
 observ+0.015
cest+0.015
Neg logit contributions
omm-0.008
-0.006
bell-0.005
stand-0.004
 lipstick-0.004
Token went
Token ID1816
Feature activation-7.94e-03

Pos logit contributions
 feat+0.004
 observ+0.004
 cautious+0.004
 courageous+0.004
 agile+0.004
Neg logit contributions
omm-0.007
stand-0.006
igsaw-0.006
-0.006
ifying-0.005
Token outside
Token ID2354
Feature activation-6.11e-03

Pos logit contributions
omm+0.019
igsaw+0.015
+0.015
ifying+0.015
stand+0.014
Neg logit contributions
 feat-0.007
 strong-0.007
 intelligent-0.007
 free-0.007
 brave-0.007
Token
Token ID220
Feature activation-1.28e-02

Pos logit contributions
omm+0.005
igsaw+0.004
stand+0.004
+0.004
ifying+0.004
Neg logit contributions
 feat-0.004
 observ-0.004
 cautious-0.004
 courageous-0.004
 intelligent-0.004
Token
Token ID198
Feature activation-3.50e-03

Pos logit contributions
omm+0.020
+0.015
stand+0.014
igsaw+0.013
etti+0.012
Neg logit contributions
 feat-0.023
 agile-0.022
cut-0.022
 courageous-0.022
 observ-0.022
Token
Token ID198
Feature activation-1.11e-02

Pos logit contributions
omm+0.005
stand+0.004
+0.004
igsaw+0.003
 Today+0.003
Neg logit contributions
 feat-0.006
 agile-0.006
 redd-0.006
 observ-0.006
 cautious-0.006
TokenWhen
Token ID2215
Feature activation+4.76e-03

Pos logit contributions
omm+0.021
stand+0.017
+0.017
igsaw+0.016
ifying+0.016
Neg logit contributions
 feat-0.015
 observ-0.013
 agile-0.013
 cautious-0.013
 intelligent-0.013
Token Lucy
Token ID22162
Feature activation-3.37e-02

Pos logit contributions
 feat+0.005
 strong+0.004
 brave+0.004
 intelligent+0.004
 clever+0.004
Neg logit contributions
omm-0.011
igsaw-0.009
stand-0.009
ifying-0.008
work-0.008
Token finished
Token ID5201
Feature activation-8.14e-02

Pos logit contributions
omm+0.069
igsaw+0.058
stand+0.054
ifying+0.053
+0.052
Neg logit contributions
 feat-0.039
 brave-0.035
 strong-0.034
 cautious-0.034
 courageous-0.034
Token eating
Token ID6600
Feature activation-1.78e-02

Pos logit contributions
omm+0.177
ifying+0.151
ifier+0.141
igsaw+0.141
work+0.140
Neg logit contributions
 feat-0.085
 strong-0.078
 tall-0.074
 healthy-0.073
 new-0.073
Token,
Token ID11
Feature activation+6.57e-03

Pos logit contributions
omm+0.034
igsaw+0.027
stand+0.027
+0.026
ifying+0.025
Neg logit contributions
 feat-0.023
 observ-0.022
 cautious-0.021
 courageous-0.021
 intelligent-0.021
Token she
Token ID673
Feature activation+5.31e-03

Pos logit contributions
 feat+0.012
 observ+0.011
 agile+0.011
cut+0.011
 cautious+0.011
Neg logit contributions
omm-0.009
stand-0.007
-0.007
igsaw-0.006
bell-0.006
Token put
Token ID1234
Feature activation+5.86e-03

Pos logit contributions
 feat+0.006
 observ+0.006
 cautious+0.006
 agile+0.006
 courageous+0.006
Neg logit contributions
omm-0.011
stand-0.009
igsaw-0.009
-0.008
ifying-0.008
Token her
Token ID607
Feature activation-3.20e-03

Pos logit contributions
 observ+0.008
 feat+0.008
 cautious+0.008
 agile+0.008
 courageous+0.008
Neg logit contributions
omm-0.010
-0.009
stand-0.008
igsaw-0.008
ifier-0.008
Token the
Token ID262
Feature activation-1.27e-03

Pos logit contributions
omm+0.032
stand+0.026
igsaw+0.025
+0.024
ifying+0.024
Neg logit contributions
 feat-0.023
 observ-0.022
 cautious-0.021
 agile-0.021
 courageous-0.021
Token story
Token ID1621
Feature activation-3.13e-02

Pos logit contributions
omm+0.002
stand+0.001
igsaw+0.001
work+0.001
bell+0.001
Neg logit contributions
 feat-0.002
 agile-0.002
 cautious-0.002
cut-0.002
 observ-0.002
Token.
Token ID13
Feature activation-1.07e-02

Pos logit contributions
omm+0.067
+0.053
igsaw+0.052
stand+0.051
ifying+0.050
Neg logit contributions
 feat-0.035
 intelligent-0.031
 observ-0.031
 courageous-0.031
 agile-0.031
Token When
Token ID1649
Feature activation+7.05e-03

Pos logit contributions
omm+0.018
igsaw+0.014
stand+0.014
+0.014
ifying+0.013
Neg logit contributions
 feat-0.016
 observ-0.015
 cautious-0.015
 agile-0.015
 courageous-0.015
Token Mia
Token ID32189
Feature activation-2.37e-02

Pos logit contributions
 feat+0.009
 observ+0.008
 cautious+0.008
 agile+0.008
 intelligent+0.008
Neg logit contributions
omm-0.014
igsaw-0.011
stand-0.011
-0.011
ifying-0.010
Token finished
Token ID5201
Feature activation-8.14e-02

Pos logit contributions
omm+0.048
igsaw+0.039
stand+0.037
ifying+0.037
+0.036
Neg logit contributions
 feat-0.028
 observ-0.026
 brave-0.026
 cautious-0.025
 strong-0.025
Token reading
Token ID3555
Feature activation-3.53e-02

Pos logit contributions
omm+0.161
ifying+0.133
ifier+0.123
+0.123
igsaw+0.122
Neg logit contributions
 feat-0.099
 strong-0.095
 tall-0.090
 daring-0.090
 intelligent-0.090
Token,
Token ID11
Feature activation+9.54e-03

Pos logit contributions
omm+0.068
igsaw+0.052
stand+0.051
+0.051
ifying+0.050
Neg logit contributions
 feat-0.046
 courageous-0.042
 observ-0.042
 intelligent-0.042
 strong-0.042
Token Dad
Token ID17415
Feature activation-3.11e-02

Pos logit contributions
 feat+0.014
 observ+0.013
 cautious+0.013
 courageous+0.013
 agile+0.012
Neg logit contributions
omm-0.017
stand-0.013
igsaw-0.013
-0.013
ifying-0.012
Token asked
Token ID1965
Feature activation-1.15e-03

Pos logit contributions
omm+0.066
igsaw+0.054
stand+0.053
+0.053
ifying+0.051
Neg logit contributions
 feat-0.034
 observ-0.032
 cautious-0.031
 intelligent-0.030
 courageous-0.030
Token her
Token ID607
Feature activation-2.29e-02

Pos logit contributions
omm+0.002
stand+0.001
igsaw+0.001
bell+0.001
work+0.001
Neg logit contributions
 feat-0.002
 agile-0.002
 observ-0.002
 courageous-0.002
 cautious-0.002
Token the
Token ID262
Feature activation-1.10e-02

Pos logit contributions
omm+0.026
stand+0.021
igsaw+0.020
bell+0.020
+0.020
Neg logit contributions
 feat-0.014
 observ-0.013
 cautious-0.013
 agile-0.013
 courageous-0.012
Token bell
Token ID8966
Feature activation-2.46e-02

Pos logit contributions
omm+0.013
stand+0.009
bell+0.009
igsaw+0.008
work+0.007
Neg logit contributions
 feat-0.023
 agile-0.022
 cautious-0.022
 observ-0.022
 courageous-0.021
Token.
Token ID13
Feature activation-1.12e-02

Pos logit contributions
omm+0.051
igsaw+0.041
+0.040
stand+0.039
ifying+0.037
Neg logit contributions
 feat-0.029
 observ-0.026
 intelligent-0.026
 cautious-0.025
 strong-0.025
Token When
Token ID1649
Feature activation+5.04e-03

Pos logit contributions
omm+0.021
igsaw+0.018
work+0.018
ifying+0.017
stand+0.017
Neg logit contributions
 feat-0.012
 strong-0.012
 observ-0.012
 intelligent-0.012
 clever-0.011
Token her
Token ID607
Feature activation-4.41e-02

Pos logit contributions
 strong+0.004
 feat+0.004
 brave+0.004
 clever+0.004
 free+0.004
Neg logit contributions
omm-0.012
igsaw-0.010
work-0.009
ifying-0.009
stand-0.009
Token grandma
Token ID49890
Feature activation-8.11e-02

Pos logit contributions
omm+0.080
igsaw+0.065
stand+0.063
+0.063
ifying+0.061
Neg logit contributions
 feat-0.062
 intelligent-0.057
 observ-0.057
 clever-0.055
 strong-0.055
Token came
Token ID1625
Feature activation-3.30e-02

Pos logit contributions
omm+0.152
igsaw+0.122
stand+0.117
+0.116
ifying+0.112
Neg logit contributions
 feat-0.109
 observ-0.101
 cautious-0.098
 courageous-0.098
 intelligent-0.098
Token in
Token ID287
Feature activation-3.96e-02

Pos logit contributions
omm+0.078
igsaw+0.065
+0.065
stand+0.061
ifying+0.060
Neg logit contributions
 feat-0.031
 brave-0.028
 free-0.028
 strong-0.027
 careful-0.027
Token,
Token ID11
Feature activation+2.36e-03

Pos logit contributions
omm+0.087
igsaw+0.068
work+0.066
ifying+0.066
+0.065
Neg logit contributions
 feat-0.043
 strong-0.040
 brave-0.039
 intelligent-0.038
 clever-0.037
Token she
Token ID673
Feature activation-1.79e-03

Pos logit contributions
 feat+0.004
 observ+0.004
 cautious+0.004
 agile+0.004
 courageous+0.004
Neg logit contributions
omm-0.003
stand-0.002
-0.002
bell-0.002
igsaw-0.002
Token gave
Token ID2921
Feature activation-1.80e-02

Pos logit contributions
omm+0.004
stand+0.003
bell+0.003
+0.003
work+0.003
Neg logit contributions
 feat-0.002
 cautious-0.002
 observ-0.002
 courageous-0.002
 unnecessary-0.002
Token
Token ID198
Feature activation-4.76e-03

Pos logit contributions
omm+0.027
+0.020
stand+0.020
igsaw+0.019
ifying+0.017
Neg logit contributions
 feat-0.026
 observ-0.025
 cautious-0.025
 courageous-0.025
 agile-0.025
Token
Token ID198
Feature activation-1.05e-02

Pos logit contributions
omm+0.007
stand+0.006
+0.005
igsaw+0.005
bell+0.005
Neg logit contributions
 feat-0.008
 observ-0.008
 agile-0.008
 courageous-0.008
 cautious-0.008
TokenWhen
Token ID2215
Feature activation+4.81e-04

Pos logit contributions
omm+0.022
stand+0.018
igsaw+0.017
+0.017
ifying+0.016
Neg logit contributions
 feat-0.012
 observ-0.011
 cautious-0.011
 courageous-0.011
 agile-0.011
Token Tim
Token ID5045
Feature activation-1.98e-02

Pos logit contributions
 strong+0.000
 feat+0.000
 new+0.000
 clever+0.000
 free+0.000
Neg logit contributions
omm-0.001
igsaw-0.001
stand-0.001
ifier-0.001
ifying-0.001
Tokenmy
Token ID1820
Feature activation-5.57e-02

Pos logit contributions
omm+0.036
igsaw+0.030
ifying+0.029
stand+0.028
+0.027
Neg logit contributions
 feat-0.027
 brave-0.024
 intelligent-0.024
 strong-0.023
 clever-0.023
Token finished
Token ID5201
Feature activation-8.01e-02

Pos logit contributions
omm+0.127
igsaw+0.108
ifying+0.102
ifier+0.099
work+0.097
Neg logit contributions
 feat-0.052
 flies-0.046
 clever-0.046
 brave-0.045
 strong-0.045
Token his
Token ID465
Feature activation-2.41e-02

Pos logit contributions
omm+0.175
ifying+0.163
ifier+0.152
 unison+0.145
igsaw+0.141
Neg logit contributions
 feat-0.075
 strong-0.072
 new-0.070
 tall-0.069
 healthy-0.067
Token sandwich
Token ID20433
Feature activation-1.71e-02

Pos logit contributions
omm+0.037
igsaw+0.027
stand+0.027
+0.027
ifying+0.025
Neg logit contributions
 feat-0.040
 observ-0.038
 cautious-0.038
 courageous-0.037
 agile-0.037
Token,
Token ID11
Feature activation+9.47e-03

Pos logit contributions
omm+0.026
igsaw+0.019
stand+0.019
+0.018
ifying+0.017
Neg logit contributions
 feat-0.029
 observ-0.028
 cautious-0.027
 courageous-0.027
 intelligent-0.027
Token he
Token ID339
Feature activation+1.23e-02

Pos logit contributions
 feat+0.013
 observ+0.013
 cautious+0.013
 agile+0.012
 courageous+0.012
Neg logit contributions
omm-0.016
stand-0.012
-0.012
igsaw-0.012
work-0.012
Token said
Token ID531
Feature activation+1.68e-02

Pos logit contributions
 feat+0.015
 observ+0.014
 cautious+0.014
 courageous+0.014
 agile+0.013
Neg logit contributions
omm-0.025
stand-0.020
igsaw-0.019
-0.019
ifying-0.018
Token.
Token ID13
Feature activation-7.79e-03

Pos logit contributions
omm+0.074
igsaw+0.060
+0.058
stand+0.058
ifying+0.056
Neg logit contributions
 feat-0.036
 cautious-0.032
 observ-0.032
 intelligent-0.032
 agile-0.031
Token
Token ID198
Feature activation-1.01e-02

Pos logit contributions
omm+0.013
stand+0.010
+0.010
igsaw+0.009
bell+0.009
Neg logit contributions
 feat-0.012
 observ-0.012
 agile-0.012
 courageous-0.012
 cautious-0.012
Token
Token ID198
Feature activation-1.49e-02

Pos logit contributions
omm+0.013
stand+0.010
+0.009
igsaw+0.009
 Today+0.009
Neg logit contributions
 feat-0.019
 agile-0.019
 observ-0.018
 courageous-0.018
 redd-0.018
TokenWhen
Token ID2215
Feature activation+2.31e-03

Pos logit contributions
omm+0.028
stand+0.022
igsaw+0.021
+0.021
ifying+0.020
Neg logit contributions
 feat-0.021
 observ-0.019
 agile-0.019
 courageous-0.019
 cautious-0.019
Token Lily
Token ID20037
Feature activation-3.32e-02

Pos logit contributions
 strong+0.002
 feat+0.001
 new+0.001
 brave+0.001
 graceful+0.001
Neg logit contributions
omm-0.006
igsaw-0.005
work-0.005
ifying-0.005
ifier-0.005
Token finished
Token ID5201
Feature activation-8.00e-02

Pos logit contributions
omm+0.065
igsaw+0.052
stand+0.050
ifying+0.049
+0.048
Neg logit contributions
 feat-0.043
 observ-0.039
 intelligent-0.038
 cautious-0.037
 clever-0.037
Token painting
Token ID12036
Feature activation-3.05e-02

Pos logit contributions
omm+0.153
ifying+0.127
ifier+0.123
stand+0.118
igsaw+0.118
Neg logit contributions
 feat-0.103
 strong-0.097
 intelligent-0.096
 observ-0.095
 clever-0.094
Token,
Token ID11
Feature activation+9.91e-03

Pos logit contributions
omm+0.056
stand+0.043
igsaw+0.043
+0.042
ifying+0.040
Neg logit contributions
 feat-0.042
 observ-0.040
 cautious-0.039
 courageous-0.039
 agile-0.039
Token she
Token ID673
Feature activation+4.44e-03

Pos logit contributions
 feat+0.017
 observ+0.016
 cautious+0.016
 agile+0.015
 courageous+0.015
Neg logit contributions
omm-0.015
stand-0.011
-0.011
igsaw-0.011
bell-0.010
Token looked
Token ID3114
Feature activation+7.65e-03

Pos logit contributions
 feat+0.006
 courageous+0.006
cut+0.006
 cautious+0.005
 observ+0.005
Neg logit contributions
omm-0.008
stand-0.006
bell-0.006
-0.006
work-0.006
Token at
Token ID379
Feature activation+2.31e-02

Pos logit contributions
 feat+0.011
cut+0.010
 agile+0.010
 cautious+0.010
 observ+0.010
Neg logit contributions
omm-0.013
igsaw-0.010
bell-0.010
ifying-0.010
work-0.010
Token
Token ID198
Feature activation-8.14e-03

Pos logit contributions
omm+0.023
stand+0.017
+0.017
igsaw+0.016
 Today+0.014
Neg logit contributions
 feat-0.029
 observ-0.028
 agile-0.028
 courageous-0.028
 cautious-0.027
Token
Token ID198
Feature activation-1.35e-02

Pos logit contributions
omm+0.011
stand+0.009
+0.008
igsaw+0.008
bell+0.007
Neg logit contributions
 feat-0.015
 agile-0.014
 courageous-0.014
 observ-0.014
 dangers-0.014
TokenWhen
Token ID2215
Feature activation+4.94e-03

Pos logit contributions
omm+0.030
igsaw+0.025
stand+0.025
+0.024
ifying+0.024
Neg logit contributions
 feat-0.013
 observ-0.012
 cautious-0.012
 intelligent-0.012
 courageous-0.012
Token her
Token ID607
Feature activation-3.70e-02

Pos logit contributions
 feat+0.003
 strong+0.003
 vines+0.003
 clever+0.003
 intelligent+0.003
Neg logit contributions
omm-0.013
igsaw-0.011
stand-0.010
ifying-0.010
work-0.010
Token dad
Token ID9955
Feature activation-4.90e-02

Pos logit contributions
omm+0.070
stand+0.057
ifying+0.057
igsaw+0.056
+0.055
Neg logit contributions
 feat-0.050
 intelligent-0.047
 clever-0.045
 strong-0.045
 brave-0.044
Token finished
Token ID5201
Feature activation-7.97e-02

Pos logit contributions
omm+0.097
stand+0.076
igsaw+0.076
+0.075
bell+0.071
Neg logit contributions
 feat-0.061
 observ-0.058
 cautious-0.057
 agile-0.056
 courageous-0.056
Token,
Token ID11
Feature activation+1.23e-02

Pos logit contributions
omm+0.171
ifying+0.148
ifier+0.140
stand+0.138
igsaw+0.133
Neg logit contributions
 strong-0.081
 feat-0.081
 intelligent-0.076
 clever-0.075
 tall-0.073
Token the
Token ID262
Feature activation+1.08e-03

Pos logit contributions
 feat+0.016
 observ+0.016
 cautious+0.015
 agile+0.015
 courageous+0.015
Neg logit contributions
omm-0.023
stand-0.018
igsaw-0.018
-0.018
ifying-0.017
Token girl
Token ID2576
Feature activation-1.33e-02

Pos logit contributions
 feat+0.001
 observ+0.001
 agile+0.001
 cautious+0.001
cut+0.001
Neg logit contributions
omm-0.003
igsaw-0.002
stand-0.002
-0.002
bell-0.002
Token thanked
Token ID26280
Feature activation+5.49e-03

Pos logit contributions
omm+0.025
stand+0.020
+0.019
igsaw+0.019
bell+0.018
Neg logit contributions
 feat-0.017
 observ-0.017
 agile-0.016
 cautious-0.016
 courageous-0.016
Token him
Token ID683
Feature activation+1.74e-03

Pos logit contributions
 feat+0.007
 observ+0.007
 intelligent+0.006
 cautious+0.006
 courageous+0.006
Neg logit contributions
omm-0.011
igsaw-0.008
stand-0.008
-0.008
ifying-0.008
Tokened
Token ID276
Feature activation-1.57e-02

Pos logit contributions
 feat+0.028
cut+0.026
 agile+0.026
 observ+0.026
 cautious+0.025
Neg logit contributions
omm-0.037
ifying-0.031
bell-0.030
stand-0.030
work-0.030
Token out
Token ID503
Feature activation+5.03e-03

Pos logit contributions
omm+0.034
igsaw+0.027
stand+0.027
+0.026
ifying+0.025
Neg logit contributions
 feat-0.018
 observ-0.016
 cautious-0.016
 courageous-0.016
 intelligent-0.016
Token from
Token ID422
Feature activation-8.51e-03

Pos logit contributions
 observ+0.008
 feat+0.007
 redd+0.007
 cautious+0.007
 agile+0.007
Neg logit contributions
omm-0.008
stand-0.006
igsaw-0.006
bell-0.006
ifying-0.006
Token under
Token ID739
Feature activation-1.06e-02

Pos logit contributions
omm+0.018
igsaw+0.016
stand+0.015
work+0.015
+0.015
Neg logit contributions
 feat-0.008
 observ-0.008
 cautious-0.007
 agile-0.007
 courageous-0.007
Token the
Token ID262
Feature activation-1.27e-02

Pos logit contributions
omm+0.030
igsaw+0.024
stand+0.024
+0.024
bell+0.024
Neg logit contributions
 feat-0.006
 vines-0.004
 strong-0.004
 brave-0.004
 intelligent-0.004
Token t
Token ID256
Feature activation-7.95e-02

Pos logit contributions
omm+0.025
igsaw+0.019
stand+0.019
+0.019
ifying+0.018
Neg logit contributions
 feat-0.016
 observ-0.015
 intelligent-0.015
 vines-0.015
 cautious-0.015
Tokenarp
Token ID5117
Feature activation-3.56e-02

Pos logit contributions
omm+0.168
igsaw+0.154
stand+0.146
+0.146
 School+0.141
Neg logit contributions
rows-0.076
 feat-0.067
an-0.062
ling-0.059
cest-0.056
Token.
Token ID13
Feature activation-8.87e-04

Pos logit contributions
omm+0.082
igsaw+0.070
+0.067
ifying+0.066
stand+0.065
Neg logit contributions
cut-0.032
 feat-0.032
 vines-0.028
 intelligent-0.027
 strong-0.027
Token They
Token ID1119
Feature activation-3.53e-03

Pos logit contributions
omm+0.002
stand+0.001
igsaw+0.001
ifying+0.001
+0.001
Neg logit contributions
 feat-0.001
 vines-0.001
 observ-0.001
 cautious-0.001
 intelligent-0.001
Token saw
Token ID2497
Feature activation+2.67e-02

Pos logit contributions
omm+0.008
stand+0.007
igsaw+0.007
+0.007
ifying+0.007
Neg logit contributions
 feat-0.003
 observ-0.003
 cautious-0.003
 courageous-0.003
 agile-0.003
Token the
Token ID262
Feature activation-1.21e-02

Pos logit contributions
 feat+0.036
 observ+0.035
 cautious+0.034
 agile+0.034
 courageous+0.034
Neg logit contributions
omm-0.046
work-0.037
stand-0.037
igsaw-0.036
ifying-0.034
Token blew
Token ID17948
Feature activation-1.92e-03

Pos logit contributions
omm+0.053
stand+0.042
+0.041
igsaw+0.041
work+0.040
Neg logit contributions
 feat-0.034
 observ-0.033
 courageous-0.031
 cautious-0.031
 agile-0.031
Token fiercely
Token ID33285
Feature activation+1.16e-02

Pos logit contributions
omm+0.003
igsaw+0.003
stand+0.003
+0.003
ifying+0.003
Neg logit contributions
 feat-0.003
 observ-0.003
 cautious-0.002
 agile-0.002
 courageous-0.002
Token outside
Token ID2354
Feature activation+5.85e-03

Pos logit contributions
 feat+0.015
 observ+0.014
 cautious+0.014
 courageous+0.014
 agile+0.014
Neg logit contributions
omm-0.022
igsaw-0.017
stand-0.017
-0.017
ifying-0.016
Token.
Token ID13
Feature activation-7.69e-03

Pos logit contributions
 feat+0.009
 observ+0.009
 cautious+0.009
 agile+0.009
 courageous+0.009
Neg logit contributions
omm-0.009
stand-0.007
-0.007
igsaw-0.007
ifying-0.007
Token Inside
Token ID14384
Feature activation-9.29e-03

Pos logit contributions
omm+0.013
igsaw+0.010
stand+0.010
+0.009
ifying+0.009
Neg logit contributions
 feat-0.011
 intelligent-0.011
 observ-0.011
 agile-0.010
 cautious-0.010
Token Grand
Token ID5675
Feature activation-7.95e-02

Pos logit contributions
omm+0.017
+0.014
stand+0.014
igsaw+0.014
work+0.013
Neg logit contributions
 observ-0.012
 feat-0.012
 cautious-0.012
 courageous-0.012
 agile-0.012
Tokenma
Token ID2611
Feature activation-5.74e-02

Pos logit contributions
omm+0.138
ifying+0.121
stand+0.110
igsaw+0.109
+0.108
Neg logit contributions
 feat-0.113
 intelligent-0.101
 wond-0.098
 careful-0.096
 clever-0.096
Token's
Token ID338
Feature activation-3.51e-02

Pos logit contributions
omm+0.090
stand+0.068
igsaw+0.066
+0.065
bell+0.061
Neg logit contributions
 feat-0.093
 observ-0.090
 cautious-0.088
 agile-0.088
 courageous-0.087
Token house
Token ID2156
Feature activation-2.59e-02

Pos logit contributions
omm+0.063
igsaw+0.048
stand+0.048
+0.047
ifying+0.045
Neg logit contributions
 feat-0.051
 observ-0.047
 cautious-0.047
 intelligent-0.047
 courageous-0.047
Token,
Token ID11
Feature activation-1.86e-03

Pos logit contributions
omm+0.037
stand+0.025
+0.024
igsaw+0.024
bell+0.024
Neg logit contributions
 feat-0.047
 observ-0.045
 agile-0.045
 cautious-0.045
 courageous-0.044
Token Doug
Token ID15115
Feature activation+7.19e-03

Pos logit contributions
omm+0.004
stand+0.003
igsaw+0.003
+0.003
ifying+0.003
Neg logit contributions
 feat-0.002
 observ-0.002
 cautious-0.002
 agile-0.002
 courageous-0.002
Token sailed
Token ID33835
Feature activation+2.23e-02

Pos logit contributions
 observ+0.016
 redd+0.016
 agile+0.016
 vines+0.016
 courageous+0.015
Neg logit contributions
omm-0.014
stand-0.011
work-0.011
bell-0.011
acey-0.010
Token for
Token ID329
Feature activation-1.49e-02

Pos logit contributions
 observ+0.036
cod+0.036
 feat+0.035
 unnecessary+0.035
 harmful+0.034
Neg logit contributions
omm-0.026
stand-0.024
:-0.022
bell-0.021
 Ryan-0.020
Token days
Token ID1528
Feature activation-2.17e-02

Pos logit contributions
omm+0.029
stand+0.024
igsaw+0.024
+0.023
ifying+0.022
Neg logit contributions
 feat-0.018
 observ-0.017
cut-0.016
 cautious-0.016
 agile-0.016
Token,
Token ID11
Feature activation+2.98e-03

Pos logit contributions
omm+0.040
igsaw+0.030
stand+0.030
+0.030
ifying+0.028
Neg logit contributions
 feat-0.031
 observ-0.029
 cautious-0.028
 courageous-0.028
 agile-0.028
Token exploring
Token ID13504
Feature activation+1.48e-02

Pos logit contributions
 feat+0.004
 redd+0.004
 unnecessary+0.004
 observ+0.004
 dangers+0.004
Neg logit contributions
omm-0.004
stand-0.004
work-0.003
-0.003
 ago-0.003
Token unch
Token ID16258
Feature activation-7.95e-02

Pos logit contributions
 observ+0.025
 redd+0.025
 cautious+0.024
 unnecessary+0.024
cut+0.024
Neg logit contributions
omm-0.013
:-0.013
 theater-0.013
 Wed-0.012
igsaw-0.012
Tokenarted
Token ID19112
Feature activation+2.49e-03

Pos logit contributions
omm+0.168
+0.160
stand+0.158
ifying+0.143
ifier+0.135
Neg logit contributions
 feat-0.077
 vines-0.072
 intelligent-0.072
 observ-0.067
 agile-0.067
Token waters
Token ID10150
Feature activation-5.51e-03

Pos logit contributions
 feat+0.003
 observ+0.003
 redd+0.003
 cautious+0.003
 intelligent+0.003
Neg logit contributions
omm-0.004
stand-0.004
igsaw-0.004
work-0.004
ifying-0.004
Token until
Token ID1566
Feature activation-4.92e-03

Pos logit contributions
omm+0.008
stand+0.007
ifying+0.006
igsaw+0.006
work+0.006
Neg logit contributions
 observ-0.009
 feat-0.009
 courageous-0.008
 cautious-0.008
 redd-0.008
Token he
Token ID339
Feature activation-1.11e-02

Pos logit contributions
omm+0.011
igsaw+0.009
stand+0.009
+0.009
ifying+0.008
Neg logit contributions
 feat-0.005
 observ-0.005
 cautious-0.005
 courageous-0.005
 agile-0.005
Token came
Token ID1625
Feature activation-1.12e-02

Pos logit contributions
omm+0.020
stand+0.016
+0.015
work+0.015
igsaw+0.014
Neg logit contributions
 feat-0.015
 observ-0.015
 cautious-0.014
 agile-0.014
 courageous-0.014
Token the
Token ID262
Feature activation-4.64e-03

Pos logit contributions
omm+0.002
igsaw+0.002
stand+0.002
+0.002
ifying+0.002
Neg logit contributions
 feat-0.001
 observ-0.001
 cautious-0.001
 courageous-0.001
 agile-0.001
Token magic
Token ID5536
Feature activation-1.30e-02

Pos logit contributions
omm+0.008
stand+0.006
igsaw+0.006
+0.006
work+0.005
Neg logit contributions
 feat-0.007
 observ-0.006
cking-0.006
 agile-0.006
 cautious-0.006
Token pen
Token ID3112
Feature activation+6.55e-03

Pos logit contributions
omm+0.026
stand+0.021
igsaw+0.021
+0.020
ifying+0.020
Neg logit contributions
 feat-0.016
 observ-0.015
 cautious-0.014
 agile-0.014
 courageous-0.014
Token and
Token ID290
Feature activation-2.53e-03

Pos logit contributions
 feat+0.009
 observ+0.009
 cautious+0.008
 wond+0.008
 agile+0.008
Neg logit contributions
omm-0.012
stand-0.009
igsaw-0.009
-0.009
ifying-0.009
Token felt
Token ID2936
Feature activation+2.05e-02

Pos logit contributions
omm+0.005
stand+0.004
igsaw+0.004
work+0.004
+0.004
Neg logit contributions
 feat-0.003
 observ-0.003
 cautious-0.003
 courageous-0.003
 agile-0.003
Token guilty
Token ID6717
Feature activation+3.65e-02

Pos logit contributions
 feat+0.031
 vines+0.027
 observ+0.026
 redd+0.026
rows+0.025
Neg logit contributions
omm-0.032
ifying-0.027
stand-0.025
igsaw-0.024
work-0.024
Token for
Token ID329
Feature activation-8.59e-03

Pos logit contributions
 feat+0.051
 observ+0.047
 agile+0.046
 cautious+0.046
 courageous+0.045
Neg logit contributions
omm-0.068
stand-0.053
igsaw-0.052
-0.050
work-0.050
Token causing
Token ID6666
Feature activation+3.34e-03

Pos logit contributions
omm+0.014
stand+0.011
igsaw+0.011
+0.010
work+0.010
Neg logit contributions
 feat-0.013
 redd-0.012
 cautious-0.012
 observ-0.012
 agile-0.012
Token so
Token ID523
Feature activation+3.92e-02

Pos logit contributions
 feat+0.004
 observ+0.004
 cautious+0.004
cut+0.004
 agile+0.004
Neg logit contributions
omm-0.007
stand-0.005
igsaw-0.005
-0.005
ifying-0.005
Token much
Token ID881
Feature activation-4.17e-03

Pos logit contributions
 feat+0.043
 observ+0.040
 cautious+0.039
 agile+0.038
 wond+0.038
Neg logit contributions
omm-0.082
igsaw-0.067
stand-0.066
-0.065
ifying-0.063
Token damage
Token ID2465
Feature activation-4.57e-03

Pos logit contributions
omm+0.009
stand+0.007
igsaw+0.007
+0.007
ifying+0.006
Neg logit contributions
 feat-0.005
 observ-0.005
 cautious-0.004
 agile-0.004
 courageous-0.004
Token three
Token ID1115
Feature activation-6.28e-03

Pos logit contributions
omm+0.023
stand+0.018
igsaw+0.017
+0.017
bell+0.015
Neg logit contributions
 feat-0.019
 observ-0.018
 intelligent-0.017
 strong-0.017
 courageous-0.017
Token!"
Token ID2474
Feature activation+3.30e-03

Pos logit contributions
omm+0.008
igsaw+0.005
 ago+0.005
stand+0.005
+0.005
Neg logit contributions
 observ-0.011
 redd-0.011
 unnecessary-0.011
 cautious-0.011
 secure-0.011
Token they
Token ID484
Feature activation+1.12e-03

Pos logit contributions
 feat+0.004
 observ+0.004
 intelligent+0.004
 cautious+0.004
 courageous+0.004
Neg logit contributions
omm-0.006
igsaw-0.005
-0.005
stand-0.005
ifying-0.005
Token shouted
Token ID17293
Feature activation+7.80e-03

Pos logit contributions
 feat+0.001
 observ+0.001
 cautious+0.001
 courageous+0.001
 agile+0.001
Neg logit contributions
omm-0.002
igsaw-0.002
stand-0.002
-0.002
ifying-0.002
Token,
Token ID11
Feature activation-7.51e-03

Pos logit contributions
 feat+0.010
 observ+0.009
 cautious+0.009
 intelligent+0.009
 courageous+0.009
Neg logit contributions
omm-0.016
stand-0.012
igsaw-0.012
-0.012
ifying-0.012
Token as
Token ID355
Feature activation+5.34e-02

Pos logit contributions
omm+0.015
igsaw+0.012
stand+0.012
+0.012
ifying+0.011
Neg logit contributions
 feat-0.009
 observ-0.009
 cautious-0.008
 intelligent-0.008
 courageous-0.008
Token they
Token ID484
Feature activation-1.55e-02

Pos logit contributions
 feat+0.086
 observ+0.082
 cautious+0.081
 courageous+0.080
 agile+0.080
Neg logit contributions
omm-0.086
igsaw-0.064
stand-0.063
-0.062
ifying-0.059
Token counted
Token ID14789
Feature activation-3.32e-02

Pos logit contributions
omm+0.031
igsaw+0.025
stand+0.025
+0.024
ifier+0.023
Neg logit contributions
 feat-0.019
 observ-0.018
 agile-0.018
 cautious-0.018
 vines-0.017
Token out
Token ID503
Feature activation+5.06e-03

Pos logit contributions
omm+0.064
stand+0.050
igsaw+0.050
+0.049
work+0.047
Neg logit contributions
 feat-0.042
 observ-0.041
 cautious-0.040
 courageous-0.040
 agile-0.039
Token loud
Token ID7812
Feature activation+3.87e-03

Pos logit contributions
 feat+0.007
 observ+0.006
 cautious+0.006
 courageous+0.006
 agile+0.006
Neg logit contributions
omm-0.010
igsaw-0.008
stand-0.008
-0.007
ifying-0.007
Token.
Token ID13
Feature activation-8.72e-03

Pos logit contributions
 feat+0.005
 observ+0.005
 cautious+0.005
 courageous+0.005
 agile+0.005
Neg logit contributions
omm-0.007
igsaw-0.005
stand-0.005
-0.005
ifying-0.005
Token
Token ID198
Feature activation-1.03e-02

Pos logit contributions
omm+0.030
+0.022
stand+0.022
igsaw+0.021
etti+0.019
Neg logit contributions
 feat-0.034
 observ-0.033
 agile-0.033
 courageous-0.032
 cautious-0.032
Token
Token ID198
Feature activation-1.25e-02

Pos logit contributions
omm+0.015
+0.012
stand+0.011
igsaw+0.010
ifying+0.010
Neg logit contributions
 feat-0.018
 observ-0.017
 agile-0.017
 unnecessary-0.017
 courageous-0.017
Token"
Token ID1
Feature activation+3.53e-02

Pos logit contributions
omm+0.022
+0.018
stand+0.017
ifying+0.017
igsaw+0.017
Neg logit contributions
 feat-0.017
 observ-0.016
 biggest-0.016
 unnecessary-0.016
 freedom-0.016
TokenWow
Token ID22017
Feature activation-3.37e-02

Pos logit contributions
 feat+0.040
 observ+0.038
 cautious+0.037
 courageous+0.036
 agile+0.036
Neg logit contributions
omm-0.074
igsaw-0.059
stand-0.059
-0.058
ifying-0.056
Token!"
Token ID2474
Feature activation+1.89e-02

Pos logit contributions
omm+0.047
igsaw+0.035
etti+0.035
+0.034
work+0.033
Neg logit contributions
 cautious-0.058
 observ-0.057
cut-0.057
 feat-0.057
 agile-0.056
Token said
Token ID531
Feature activation+3.93e-02

Pos logit contributions
 feat+0.022
 observ+0.021
 cautious+0.020
 agile+0.020
 courageous+0.020
Neg logit contributions
omm-0.038
stand-0.031
igsaw-0.030
-0.030
ifying-0.029
Token Andy
Token ID12382
Feature activation+1.83e-02

Pos logit contributions
 feat+0.054
 observ+0.052
 cautious+0.051
 agile+0.050
 wond+0.050
Neg logit contributions
omm-0.071
igsaw-0.054
-0.054
stand-0.053
work-0.053
Token.
Token ID13
Feature activation-1.67e-02

Pos logit contributions
 feat+0.028
 vines+0.027
 observ+0.027
 wond+0.027
 dangers+0.027
Neg logit contributions
omm-0.030
stand-0.023
igsaw-0.022
work-0.021
bell-0.021
Token
Token ID220
Feature activation-1.20e-02

Pos logit contributions
omm+0.033
stand+0.026
+0.026
igsaw+0.025
ifying+0.024
Neg logit contributions
 feat-0.021
 observ-0.020
 cautious-0.019
 agile-0.019
 courageous-0.019
Token
Token ID198
Feature activation-4.08e-03

Pos logit contributions
omm+0.017
stand+0.012
+0.012
igsaw+0.011
 Today+0.010
Neg logit contributions
 observ-0.022
 feat-0.022
 agile-0.021
 courageous-0.021
 redd-0.021
Token
Token ID198
Feature activation-8.03e-03

Pos logit contributions
omm+0.006
stand+0.005
+0.005
igsaw+0.004
bell+0.004
Neg logit contributions
 redd-0.007
 feat-0.007
 unnecessary-0.007
 observ-0.007
 agile-0.007
Tokenâ
Token ID22940
Feature activation+8.37e-03

Pos logit contributions
omm+0.057
igsaw+0.044
stand+0.044
+0.043
ifying+0.042
Neg logit contributions
 feat-0.042
 observ-0.039
 cautious-0.039
 courageous-0.038
 intelligent-0.038
Token
Token ID26391
Feature activation-1.82e-02

Pos logit contributions
 feat+0.011
 observ+0.010
 cautious+0.010
 courageous+0.010
 intelligent+0.010
Neg logit contributions
omm-0.016
ifying-0.012
-0.012
igsaw-0.012
stand-0.012
Token
Token ID8151
Feature activation-9.91e-03

Pos logit contributions
omm+0.031
igsaw+0.023
stand+0.022
+0.022
ifying+0.022
Neg logit contributions
 feat-0.028
 observ-0.026
 cautious-0.026
 courageous-0.026
 agile-0.025
Tokent
Token ID83
Feature activation-6.63e-03

Pos logit contributions
omm+0.015
igsaw+0.011
ifier+0.010
stand+0.010
+0.010
Neg logit contributions
 feat-0.016
 intelligent-0.015
 strong-0.015
 courageous-0.015
 observ-0.015
Token make
Token ID787
Feature activation+1.73e-02

Pos logit contributions
omm+0.013
igsaw+0.010
stand+0.010
+0.010
ifying+0.009
Neg logit contributions
 feat-0.009
 observ-0.008
 cautious-0.008
 intelligent-0.008
 courageous-0.008
Token one
Token ID530
Feature activation+5.14e-02

Pos logit contributions
 feat+0.021
 observ+0.020
 cautious+0.020
cut+0.019
 agile+0.019
Neg logit contributions
omm-0.034
stand-0.028
igsaw-0.028
-0.025
work-0.025
Token happy
Token ID3772
Feature activation-2.62e-02

Pos logit contributions
 vines+0.073
 observ+0.072
 feat+0.072
ts+0.071
 agile+0.070
Neg logit contributions
omm-0.082
igsaw-0.070
stand-0.067
work-0.066
bell-0.064
Tokenâ
Token ID22940
Feature activation+1.77e-02

Pos logit contributions
omm+0.045
stand+0.036
igsaw+0.035
+0.034
work+0.033
Neg logit contributions
 feat-0.037
 observ-0.037
 agile-0.036
 cautious-0.035
 vines-0.035
Token
Token ID26391
Feature activation-1.11e-02

Pos logit contributions
 feat+0.025
 observ+0.023
 cautious+0.023
 courageous+0.023
 agile+0.023
Neg logit contributions
omm-0.033
igsaw-0.025
stand-0.025
-0.025
ifying-0.023
Token
Token ID447
Feature activation+1.37e-02

Pos logit contributions
omm+0.019
stand+0.015
+0.014
igsaw+0.014
ifying+0.013
Neg logit contributions
 feat-0.017
 observ-0.016
 cautious-0.016
cut-0.016
 agile-0.016
Token
Token ID251
Feature activation+1.53e-02

Pos logit contributions
 feat+0.018
 observ+0.018
 cautious+0.017
 wond+0.017
 agile+0.017
Neg logit contributions
omm-0.026
-0.022
igsaw-0.020
stand-0.020
ifying-0.019
Token rub
Token ID6437
Feature activation-7.20e-03

Pos logit contributions
 feat+0.004
 observ+0.003
 cautious+0.003
 courageous+0.003
 intelligent+0.003
Neg logit contributions
omm-0.007
igsaw-0.005
stand-0.005
-0.005
ifying-0.005
Token it
Token ID340
Feature activation+2.78e-04

Pos logit contributions
omm+0.014
stand+0.012
ifying+0.011
igsaw+0.011
+0.011
Neg logit contributions
 feat-0.009
 wond-0.009
 observ-0.009
 cautious-0.008
 courageous-0.008
Token."
Token ID526
Feature activation+1.70e-03

Pos logit contributions
 wond+0.000
 feat+0.000
 observ+0.000
 cautious+0.000
 courageous+0.000
Neg logit contributions
omm-0.000
stand-0.000
bell-0.000
 Today-0.000
work-0.000
Token
Token ID198
Feature activation+1.81e-03

Pos logit contributions
 feat+0.003
 observ+0.003
cut+0.003
 cautious+0.003
 wond+0.003
Neg logit contributions
omm-0.002
stand-0.002
-0.002
igsaw-0.002
bell-0.002
Token
Token ID198
Feature activation-3.21e-03

Pos logit contributions
 feat+0.004
 observ+0.004
 vines+0.004
 courageous+0.004
 unnecessary+0.004
Neg logit contributions
omm-0.002
stand-0.002
-0.001
bell-0.001
 Today-0.001
Token"
Token ID1
Feature activation+4.32e-02

Pos logit contributions
omm+0.005
stand+0.004
bell+0.004
igsaw+0.004
+0.004
Neg logit contributions
 feat-0.005
 observ-0.005
 wond-0.005
 unnecessary-0.005
 vines-0.005
TokenMe
Token ID5308
Feature activation-2.42e-02

Pos logit contributions
 feat+0.065
 secure+0.058
cut+0.058
ling+0.058
 traps+0.058
Neg logit contributions
omm-0.066
-0.061
stand-0.056
ifying-0.053
bell-0.050
Token too
Token ID1165
Feature activation+4.31e-02

Pos logit contributions
omm+0.027
+0.017
igsaw+0.016
stand+0.016
work+0.015
Neg logit contributions
 feat-0.051
 observ-0.049
 wond-0.049
 cautious-0.048
 agile-0.048
Token!"
Token ID2474
Feature activation+8.40e-03

Pos logit contributions
 observ+0.053
 feat+0.052
 wond+0.050
 cautious+0.049
cut+0.048
Neg logit contributions
omm-0.083
stand-0.064
igsaw-0.064
-0.062
bell-0.062
Token Lily
Token ID20037
Feature activation+1.81e-02

Pos logit contributions
 feat+0.016
 observ+0.016
cut+0.015
 cautious+0.015
 wond+0.015
Neg logit contributions
omm-0.011
stand-0.008
igsaw-0.008
-0.007
bell-0.007
Token said
Token ID531
Feature activation+3.31e-02

Pos logit contributions
 feat+0.021
 observ+0.020
 courageous+0.019
 wond+0.019
 agile+0.019
Neg logit contributions
omm-0.037
stand-0.030
igsaw-0.029
-0.029
bell-0.027
Token it
Token ID340
Feature activation+1.57e-02

Pos logit contributions
 feat+0.010
 observ+0.009
 cautious+0.009
 courageous+0.008
 agile+0.008
Neg logit contributions
omm-0.021
igsaw-0.017
stand-0.017
-0.017
ifying-0.016
Token.
Token ID13
Feature activation+1.19e-03

Pos logit contributions
 feat+0.020
 observ+0.019
 cautious+0.019
 courageous+0.018
 agile+0.018
Neg logit contributions
omm-0.030
igsaw-0.024
stand-0.024
-0.023
ifying-0.022
Token But
Token ID887
Feature activation+1.35e-02

Pos logit contributions
 feat+0.001
 observ+0.001
 cautious+0.001
 agile+0.001
 courageous+0.001
Neg logit contributions
omm-0.002
igsaw-0.002
stand-0.002
work-0.002
bell-0.002
Token you
Token ID345
Feature activation-2.37e-03

Pos logit contributions
 feat+0.019
 observ+0.018
 cautious+0.018
 intelligent+0.018
 agile+0.018
Neg logit contributions
omm-0.024
igsaw-0.019
stand-0.019
-0.018
ifying-0.018
Token have
Token ID423
Feature activation-1.79e-02

Pos logit contributions
omm+0.005
igsaw+0.004
stand+0.004
ifying+0.004
work+0.004
Neg logit contributions
 strong-0.003
 feat-0.003
 brave-0.003
 clever-0.003
 intelligent-0.003
Token to
Token ID284
Feature activation+1.05e-02

Pos logit contributions
omm+0.035
stand+0.028
igsaw+0.028
+0.027
ifying+0.026
Neg logit contributions
 feat-0.023
 observ-0.021
 cautious-0.021
 courageous-0.021
 agile-0.020
Token give
Token ID1577
Feature activation-1.53e-02

Pos logit contributions
 feat+0.013
 careful+0.012
 cautious+0.012
 strong+0.012
 observ+0.011
Neg logit contributions
omm-0.022
igsaw-0.018
ifying-0.017
work-0.017
-0.017
Token it
Token ID340
Feature activation+9.17e-03

Pos logit contributions
omm+0.028
stand+0.023
igsaw+0.022
ifier+0.021
+0.021
Neg logit contributions
 feat-0.020
 observ-0.019
 cautious-0.019
 wond-0.019
 courageous-0.019
Token back
Token ID736
Feature activation-9.58e-03

Pos logit contributions
 feat+0.011
 observ+0.011
 cautious+0.010
 courageous+0.010
 intelligent+0.010
Neg logit contributions
omm-0.018
igsaw-0.014
stand-0.014
-0.014
ifying-0.014
Token to
Token ID284
Feature activation+2.94e-02

Pos logit contributions
omm+0.018
stand+0.014
igsaw+0.014
+0.013
ifying+0.013
Neg logit contributions
 feat-0.013
 observ-0.012
 cautious-0.012
 courageous-0.012
 agile-0.012
Token me
Token ID502
Feature activation+2.02e-02

Pos logit contributions
 feat+0.049
 observ+0.046
 cautious+0.046
 courageous+0.046
 agile+0.045
Neg logit contributions
omm-0.047
stand-0.035
igsaw-0.033
-0.033
ifying-0.031
Token One
Token ID1881
Feature activation+3.01e-02

Pos logit contributions
omm+0.002
stand+0.001
+0.001
igsaw+0.001
ifying+0.001
Neg logit contributions
 feat-0.002
 observ-0.002
 cautious-0.002
 courageous-0.002
 agile-0.002
Token day
Token ID1110
Feature activation-8.58e-03

Pos logit contributions
 feat+0.047
 observ+0.047
 wounds+0.046
 agile+0.046
 vines+0.045
Neg logit contributions
omm-0.043
stand-0.034
work-0.034
bell-0.033
igsaw-0.033
Token the
Token ID262
Feature activation-3.70e-03

Pos logit contributions
omm+0.013
igsaw+0.010
stand+0.010
+0.009
ifying+0.009
Neg logit contributions
 feat-0.015
 observ-0.014
 cautious-0.014
 courageous-0.014
 agile-0.014
Token jet
Token ID12644
Feature activation-1.67e-02

Pos logit contributions
omm+0.006
stand+0.005
+0.004
work+0.004
igsaw+0.004
Neg logit contributions
 feat-0.005
 observ-0.005
 cautious-0.005
 agile-0.005
 courageous-0.005
Token started
Token ID2067
Feature activation-9.46e-03

Pos logit contributions
omm+0.022
bell+0.020
work+0.020
stand+0.019
 ago+0.017
Neg logit contributions
 observ-0.027
 redd-0.026
 feat-0.026
 unnecessary-0.026
 wounds-0.025
Token flying
Token ID7348
Feature activation+1.85e-02

Pos logit contributions
omm+0.016
stand+0.013
work+0.012
igsaw+0.012
+0.012
Neg logit contributions
 feat-0.013
 observ-0.013
 courageous-0.012
 redd-0.012
 agile-0.012
Token lower
Token ID2793
Feature activation-1.77e-02

Pos logit contributions
 observ+0.030
 feat+0.028
 wounds+0.027
 redd+0.027
cut+0.027
Neg logit contributions
omm-0.022
work-0.021
bell-0.019
:-0.019
ifying-0.019
Token and
Token ID290
Feature activation+2.20e-02

Pos logit contributions
omm+0.025
stand+0.019
work+0.019
ifying+0.018
igsaw+0.018
Neg logit contributions
 observ-0.029
 feat-0.029
 courageous-0.028
 wond-0.028
 cautious-0.027
Token lower
Token ID2793
Feature activation-4.60e-04

Pos logit contributions
 feat+0.024
 observ+0.024
 cautious+0.022
cut+0.021
 courageous+0.021
Neg logit contributions
omm-0.037
work-0.036
bell-0.035
stand-0.033
ifying-0.032
Token.
Token ID13
Feature activation-9.05e-03

Pos logit contributions
omm+0.001
work+0.001
stand+0.001
ifying+0.000
 ago+0.000
Neg logit contributions
 observ-0.001
 courageous-0.001
 feat-0.001
 wond-0.001
 redd-0.001
Token It
Token ID632
Feature activation-1.49e-02

Pos logit contributions
omm+0.014
stand+0.010
+0.010
igsaw+0.010
ifying+0.009
Neg logit contributions
 feat-0.015
 observ-0.015
 cautious-0.014
 courageous-0.014
 agile-0.014
Token hopped
Token ID45230
Feature activation+2.23e-02

Pos logit contributions
omm+0.012
stand+0.010
+0.009
bell+0.009
work+0.009
Neg logit contributions
 observ-0.009
 feat-0.009
 agile-0.009
 sturdy-0.009
 redd-0.009
Token into
Token ID656
Feature activation+8.84e-03

Pos logit contributions
 feat+0.024
 observ+0.023
 cautious+0.022
 courageous+0.022
 agile+0.022
Neg logit contributions
omm-0.048
igsaw-0.039
stand-0.038
-0.038
ifying-0.036
Token the
Token ID262
Feature activation+6.28e-03

Pos logit contributions
 feat+0.009
 observ+0.009
 cautious+0.008
 courageous+0.008
 agile+0.008
Neg logit contributions
omm-0.019
stand-0.016
igsaw-0.016
-0.015
ifying-0.015
Token forest
Token ID8222
Feature activation+8.14e-03

Pos logit contributions
 feat+0.007
 intelligent+0.006
 strong+0.006
 observ+0.006
 cautious+0.006
Neg logit contributions
omm-0.014
-0.011
igsaw-0.011
stand-0.011
work-0.011
Token to
Token ID284
Feature activation+6.30e-03

Pos logit contributions
 feat+0.011
 observ+0.011
 agile+0.011
 wond+0.011
 redd+0.011
Neg logit contributions
omm-0.013
stand-0.011
igsaw-0.010
-0.010
work-0.010
Token find
Token ID1064
Feature activation+2.76e-02

Pos logit contributions
 feat+0.008
 observ+0.008
 agile+0.008
cut+0.008
 cautious+0.008
Neg logit contributions
omm-0.011
stand-0.009
igsaw-0.009
-0.009
work-0.008
Token some
Token ID617
Feature activation+1.35e-02

Pos logit contributions
bel+0.052
 agile+0.051
rated+0.051
cking+0.050
cut+0.050
Neg logit contributions
omm-0.029
ifier-0.023
stand-0.018
igsaw-0.017
 bath-0.016
Token food
Token ID2057
Feature activation+1.41e-02

Pos logit contributions
cut+0.013
 observ+0.013
cking+0.013
 agile+0.012
 feat+0.012
Neg logit contributions
omm-0.026
igsaw-0.021
stand-0.021
ifying-0.020
-0.020
Token.
Token ID13
Feature activation-1.17e-02

Pos logit contributions
 feat+0.020
 observ+0.019
 agile+0.019
 courageous+0.018
 cautious+0.018
Neg logit contributions
omm-0.025
stand-0.020
-0.019
igsaw-0.019
work-0.018
Token Suddenly
Token ID24975
Feature activation+6.41e-03

Pos logit contributions
omm+0.021
igsaw+0.016
stand+0.016
+0.015
work+0.015
Neg logit contributions
 feat-0.017
 observ-0.016
 cautious-0.016
 intelligent-0.016
 courageous-0.016
Token,
Token ID11
Feature activation+1.77e-03

Pos logit contributions
 feat+0.013
 observ+0.012
 cautious+0.012
 agile+0.012
cut+0.012
Neg logit contributions
omm-0.008
stand-0.005
-0.005
igsaw-0.005
ifying-0.004
Token looked
Token ID3114
Feature activation+8.82e-03

Pos logit contributions
omm+0.000
stand+0.000
igsaw+0.000
+0.000
bell+0.000
Neg logit contributions
 feat-0.000
 observ-0.000
 cautious-0.000
 agile-0.000
 courageous-0.000
Token around
Token ID1088
Feature activation-1.09e-02

Pos logit contributions
 feat+0.010
 observ+0.009
cut+0.009
 agile+0.009
 cautious+0.009
Neg logit contributions
omm-0.016
ifying-0.014
bell-0.014
igsaw-0.013
work-0.013
Token and
Token ID290
Feature activation+2.69e-03

Pos logit contributions
omm+0.013
stand+0.010
 ago+0.009
igsaw+0.009
+0.009
Neg logit contributions
 redd-0.020
 observ-0.020
 feat-0.019
 agile-0.019
cut-0.018
Token saw
Token ID2497
Feature activation+2.73e-02

Pos logit contributions
 feat+0.003
 redd+0.003
cut+0.003
 vines+0.003
 biggest+0.003
Neg logit contributions
omm-0.005
stand-0.004
bell-0.004
ifying-0.004
work-0.004
Token a
Token ID257
Feature activation+7.28e-03

Pos logit contributions
 feat+0.046
 cautious+0.044
 observ+0.043
 agile+0.043
rated+0.043
Neg logit contributions
omm-0.036
work-0.027
stand-0.026
ifying-0.025
igsaw-0.024
Token bull
Token ID6473
Feature activation+4.40e-03

Pos logit contributions
 feat+0.006
 observ+0.006
 cautious+0.005
 agile+0.005
 courageous+0.005
Neg logit contributions
omm-0.017
stand-0.014
igsaw-0.014
-0.014
ifying-0.014
Token.
Token ID13
Feature activation-1.36e-02

Pos logit contributions
 feat+0.006
 observ+0.005
 agile+0.005
 cautious+0.005
 courageous+0.005
Neg logit contributions
omm-0.008
stand-0.007
-0.006
igsaw-0.006
ifying-0.006
Token The
Token ID383
Feature activation-1.06e-02

Pos logit contributions
omm+0.020
+0.016
stand+0.014
igsaw+0.014
bell+0.013
Neg logit contributions
 feat-0.024
 observ-0.023
 cautious-0.023
 courageous-0.023
 agile-0.022
Token bull
Token ID6473
Feature activation-5.97e-03

Pos logit contributions
omm+0.021
work+0.017
+0.016
 Today+0.016
bell+0.016
Neg logit contributions
 observ-0.011
 feat-0.011
 agile-0.011
ts-0.010
 secure-0.010
Token was
Token ID373
Feature activation-2.70e-02

Pos logit contributions
omm+0.011
+0.008
stand+0.008
bell+0.008
work+0.008
Neg logit contributions
 feat-0.008
 observ-0.008
 redd-0.007
 agile-0.007
 cautious-0.007
Token very
Token ID845
Feature activation+2.02e-02

Pos logit contributions
omm+0.060
igsaw+0.050
stand+0.049
+0.049
ifying+0.046
Neg logit contributions
 feat-0.027
 observ-0.025
 cautious-0.025
 intelligent-0.025
 strong-0.025
Token,
Token ID11
Feature activation-7.09e-03

Pos logit contributions
 feat+0.040
 vines+0.040
 observ+0.039
 cautious+0.039
 wond+0.039
Neg logit contributions
omm-0.033
stand-0.024
-0.024
igsaw-0.023
bell-0.021
Token "
Token ID366
Feature activation+1.53e-02

Pos logit contributions
omm+0.014
stand+0.011
+0.011
igsaw+0.010
bell+0.010
Neg logit contributions
 feat-0.009
 observ-0.009
 courageous-0.008
 agile-0.008
 wond-0.008
TokenI
Token ID40
Feature activation-3.66e-03

Pos logit contributions
 feat+0.021
 wounds+0.019
 vines+0.019
 cautious+0.019
 observ+0.018
Neg logit contributions
omm-0.027
stand-0.022
-0.022
igsaw-0.021
ifying-0.020
Token'm
Token ID1101
Feature activation-1.60e-02

Pos logit contributions
omm+0.008
igsaw+0.006
stand+0.006
+0.006
ifying+0.006
Neg logit contributions
 feat-0.004
 observ-0.004
 cautious-0.004
 courageous-0.004
 intelligent-0.004
Token glad
Token ID9675
Feature activation-1.39e-02

Pos logit contributions
omm+0.038
igsaw+0.031
stand+0.031
+0.031
ifying+0.030
Neg logit contributions
 feat-0.014
 observ-0.013
 cautious-0.012
 intelligent-0.012
 courageous-0.012
Token you
Token ID345
Feature activation+1.41e-02

Pos logit contributions
omm+0.020
stand+0.014
+0.014
igsaw+0.013
work+0.013
Neg logit contributions
cut-0.024
 feat-0.024
 cautious-0.023
 observ-0.023
 courageous-0.023
Token're
Token ID821
Feature activation-4.95e-03

Pos logit contributions
 feat+0.019
 observ+0.018
 cautious+0.018
 courageous+0.018
 intelligent+0.018
Neg logit contributions
omm-0.027
igsaw-0.021
stand-0.020
-0.020
ifying-0.019
Token having
Token ID1719
Feature activation-3.89e-03

Pos logit contributions
omm+0.010
stand+0.007
+0.007
igsaw+0.007
work+0.007
Neg logit contributions
 feat-0.006
 vines-0.006
 observ-0.006
 wond-0.006
 agile-0.006
Token a
Token ID257
Feature activation+2.24e-02

Pos logit contributions
omm+0.008
stand+0.006
+0.006
igsaw+0.006
ifying+0.006
Neg logit contributions
 feat-0.004
cut-0.004
 observ-0.004
 cautious-0.004
 agile-0.004
Token good
Token ID922
Feature activation-7.35e-03

Pos logit contributions
 feat+0.016
 intelligent+0.014
 observ+0.014
 strong+0.013
 clever+0.013
Neg logit contributions
omm-0.057
stand-0.048
igsaw-0.047
ifying-0.046
-0.046
Token time
Token ID640
Feature activation+2.72e-03

Pos logit contributions
omm+0.014
igsaw+0.011
ifying+0.011
stand+0.011
+0.011
Neg logit contributions
 feat-0.010
 intelligent-0.009
 observ-0.009
 cautious-0.009
 strong-0.009
Token!"
Token ID2474
Feature activation-2.99e-03

Pos logit contributions
 feat+0.004
 observ+0.003
 cautious+0.003
 courageous+0.003
 agile+0.003
Neg logit contributions
omm-0.006
stand-0.005
igsaw-0.005
-0.005
work-0.004
Token
Token ID198
Feature activation+4.91e-03

Pos logit contributions
omm+0.005
igsaw+0.004
stand+0.004
+0.004
ifying+0.004
Neg logit contributions
 feat-0.004
 observ-0.004
 cautious-0.004
 intelligent-0.004
 courageous-0.004
Token
Token ID198
Feature activation-3.30e-03

Pos logit contributions
 agile+0.009
 observ+0.009
 redd+0.009
 feat+0.009
 courageous+0.009
Neg logit contributions
omm-0.007
stand-0.005
-0.005
igsaw-0.005
bell-0.004
TokenAfter
Token ID3260
Feature activation-2.32e-02

Pos logit contributions
omm+0.007
stand+0.005
igsaw+0.005
+0.005
bell+0.005
Neg logit contributions
 feat-0.004
 observ-0.004
 courageous-0.004
 agile-0.004
 cautious-0.004
Token the
Token ID262
Feature activation-1.26e-02

Pos logit contributions
omm+0.052
ifying+0.040
igsaw+0.040
bell+0.039
stand+0.039
Neg logit contributions
 feat-0.024
 daring-0.023
 strong-0.023
 brave-0.023
 clever-0.022
Token exam
Token ID2814
Feature activation-1.47e-02

Pos logit contributions
omm+0.016
igsaw+0.011
stand+0.011
+0.010
bell+0.010
Neg logit contributions
 feat-0.024
 observ-0.023
 cautious-0.023
 agile-0.023
cut-0.022
Token,
Token ID11
Feature activation+1.78e-02

Pos logit contributions
omm+0.024
igsaw+0.017
stand+0.017
+0.017
ifying+0.016
Neg logit contributions
 feat-0.024
 observ-0.022
 cautious-0.022
 courageous-0.022
 agile-0.022
Token Tim
Token ID5045
Feature activation+1.44e-02

Pos logit contributions
 feat+0.026
 observ+0.024
 cautious+0.024
 courageous+0.024
 agile+0.024
Neg logit contributions
omm-0.031
igsaw-0.024
stand-0.024
-0.023
ifying-0.022
Token felt
Token ID2936
Feature activation+1.26e-02

Pos logit contributions
 feat+0.024
 observ+0.023
 cautious+0.022
 courageous+0.022
 agile+0.022
Neg logit contributions
omm-0.023
stand-0.017
igsaw-0.017
-0.016
ifying-0.015
Token very
Token ID845
Feature activation+4.22e-02

Pos logit contributions
 feat+0.018
 vines+0.015
 observ+0.015
cut+0.015
 agile+0.014
Neg logit contributions
omm-0.023
stand-0.019
-0.018
igsaw-0.018
ifying-0.017
Token happy
Token ID3772
Feature activation-2.72e-02

Pos logit contributions
 vines+0.048
 feat+0.046
cut+0.043
 prey+0.040
 deer+0.040
Neg logit contributions
omm-0.083
-0.068
igsaw-0.065
stand-0.065
bell-0.064
Token were
Token ID547
Feature activation-2.58e-02

Pos logit contributions
 feat+0.011
 observ+0.011
 cautious+0.010
 intelligent+0.010
 courageous+0.010
Neg logit contributions
omm-0.013
igsaw-0.010
stand-0.010
-0.010
ifying-0.009
Token very
Token ID845
Feature activation+2.85e-02

Pos logit contributions
omm+0.045
stand+0.032
work+0.032
+0.031
igsaw+0.031
Neg logit contributions
 feat-0.036
cut-0.035
 vines-0.034
 observ-0.034
 agile-0.034
Token happy
Token ID3772
Feature activation-2.38e-02

Pos logit contributions
 feat+0.021
 observ+0.018
 vines+0.018
 cautious+0.018
cut+0.017
Neg logit contributions
omm-0.070
igsaw-0.058
stand-0.057
-0.057
ifying-0.055
Token.
Token ID13
Feature activation+8.18e-04

Pos logit contributions
omm+0.047
igsaw+0.037
stand+0.037
+0.036
ifying+0.035
Neg logit contributions
 feat-0.030
 observ-0.028
 cautious-0.028
 agile-0.027
 courageous-0.027
Token
Token ID198
Feature activation-1.27e-02

Pos logit contributions
 feat+0.001
 wond+0.001
 intelligent+0.001
 cautious+0.001
 observ+0.001
Neg logit contributions
omm-0.002
igsaw-0.001
stand-0.001
work-0.001
ifier-0.001
Token
Token ID198
Feature activation-1.71e-02

Pos logit contributions
omm+0.025
igsaw+0.020
stand+0.019
+0.019
ifying+0.018
Neg logit contributions
 feat-0.016
 observ-0.015
 cautious-0.015
 intelligent-0.015
 courageous-0.015
TokenOne
Token ID3198
Feature activation+1.69e-02

Pos logit contributions
omm+0.031
igsaw+0.025
stand+0.024
+0.024
work+0.023
Neg logit contributions
 feat-0.024
 observ-0.023
 cautious-0.023
 intelligent-0.023
 courageous-0.022
Token day
Token ID1110
Feature activation-1.06e-02

Pos logit contributions
 feat+0.021
 observ+0.019
 cautious+0.019
 intelligent+0.019
 courageous+0.019
Neg logit contributions
omm-0.034
stand-0.027
igsaw-0.027
-0.027
ifying-0.025
Token,
Token ID11
Feature activation+4.03e-03

Pos logit contributions
omm+0.019
igsaw+0.016
stand+0.015
+0.014
work+0.014
Neg logit contributions
 feat-0.015
 brave-0.014
 intelligent-0.014
 observ-0.014
 strong-0.014
Token Abby
Token ID49220
Feature activation-1.81e-03

Pos logit contributions
 feat+0.006
 intelligent+0.005
 observ+0.005
 strong+0.005
 courageous+0.005
Neg logit contributions
omm-0.007
igsaw-0.006
stand-0.006
-0.005
ifier-0.005
Token was
Token ID373
Feature activation-2.36e-02

Pos logit contributions
omm+0.003
stand+0.003
+0.003
bell+0.003
igsaw+0.003
Neg logit contributions
 feat-0.002
 observ-0.002
 agile-0.002
 courageous-0.002
 cautious-0.002
Token They
Token ID1119
Feature activation+4.70e-03

Pos logit contributions
 feat+0.030
 strong+0.029
 intelligent+0.029
 graceful+0.029
 tranquil+0.028
Neg logit contributions
omm-0.039
igsaw-0.030
work-0.029
stand-0.028
-0.027
Token have
Token ID423
Feature activation-1.64e-02

Pos logit contributions
 feat+0.008
 observ+0.007
 cautious+0.007
 courageous+0.007
 agile+0.007
Neg logit contributions
omm-0.007
-0.006
stand-0.005
igsaw-0.005
bell-0.005
Token fun
Token ID1257
Feature activation-5.64e-03

Pos logit contributions
omm+0.032
stand+0.027
+0.026
igsaw+0.025
ifying+0.024
Neg logit contributions
cut-0.019
 feat-0.019
 observ-0.018
 courageous-0.017
 cautious-0.017
Token.
Token ID13
Feature activation+2.43e-02

Pos logit contributions
omm+0.009
+0.008
stand+0.007
igsaw+0.007
work+0.007
Neg logit contributions
 observ-0.008
 feat-0.008
 redd-0.008
 courageous-0.008
 cautious-0.008
Token They
Token ID1119
Feature activation+6.12e-03

Pos logit contributions
 feat+0.030
 intelligent+0.030
 strong+0.030
 graceful+0.029
 clever+0.029
Neg logit contributions
omm-0.046
igsaw-0.037
work-0.035
-0.034
stand-0.034
Token are
Token ID389
Feature activation-2.75e-02

Pos logit contributions
 feat+0.010
 observ+0.010
 cautious+0.010
 courageous+0.010
 agile+0.009
Neg logit contributions
omm-0.009
-0.007
stand-0.007
igsaw-0.006
ifying-0.006
Token patient
Token ID5827
Feature activation+5.91e-03

Pos logit contributions
omm+0.055
stand+0.043
work+0.041
+0.040
etti+0.039
Neg logit contributions
cut-0.029
 feat-0.028
 wond-0.028
 vines-0.028
worm-0.027
Token.
Token ID13
Feature activation+2.58e-02

Pos logit contributions
 feat+0.008
 observ+0.007
 cautious+0.007
 agile+0.007
 courageous+0.007
Neg logit contributions
omm-0.011
stand-0.009
igsaw-0.009
-0.008
work-0.008
Token They
Token ID1119
Feature activation+5.27e-03

Pos logit contributions
 feat+0.029
 intelligent+0.029
 strong+0.028
 graceful+0.027
 clever+0.027
Neg logit contributions
omm-0.052
igsaw-0.043
work-0.040
stand-0.039
bell-0.038
Token know
Token ID760
Feature activation+1.18e-02

Pos logit contributions
 feat+0.009
 observ+0.008
 cautious+0.008
 courageous+0.008
 agile+0.008
Neg logit contributions
omm-0.008
-0.006
stand-0.006
igsaw-0.006
ifying-0.006
Token that
Token ID326
Feature activation+2.53e-02

Pos logit contributions
 feat+0.017
 observ+0.016
 cautious+0.015
 courageous+0.015
 agile+0.015
Neg logit contributions
omm-0.022
stand-0.017
igsaw-0.017
-0.016
ifying-0.016
Token her
Token ID607
Feature activation+3.98e-03

Pos logit contributions
omm+0.021
stand+0.016
igsaw+0.016
+0.016
ifying+0.015
Neg logit contributions
 feat-0.015
 observ-0.014
 cautious-0.014
 courageous-0.014
 intelligent-0.014
Token explor
Token ID39180
Feature activation+2.88e-04

Pos logit contributions
 feat+0.004
 observ+0.004
cut+0.004
 agile+0.004
 cautious+0.004
Neg logit contributions
omm-0.009
igsaw-0.007
stand-0.007
-0.007
work-0.007
Tokenations
Token ID602
Feature activation+2.55e-02

Pos logit contributions
 vines+0.000
 observ+0.000
 feat+0.000
 agile+0.000
 cautious+0.000
Neg logit contributions
omm-0.001
stand-0.000
work-0.000
bell-0.000
igsaw-0.000
Token.
Token ID13
Feature activation-2.97e-03

Pos logit contributions
 observ+0.038
 feat+0.037
 cautious+0.036
 agile+0.036
 redd+0.036
Neg logit contributions
omm-0.039
stand-0.033
igsaw-0.031
-0.031
work-0.030
Token
Token ID198
Feature activation-5.58e-04

Pos logit contributions
omm+0.006
igsaw+0.005
work+0.005
+0.005
stand+0.005
Neg logit contributions
 feat-0.003
 observ-0.003
 intelligent-0.003
 cautious-0.003
 courageous-0.003
Token
Token ID198
Feature activation-7.54e-03

Pos logit contributions
omm+0.001
stand+0.001
igsaw+0.001
+0.001
bell+0.001
Neg logit contributions
 feat-0.001
 redd-0.001
 agile-0.001
 observ-0.001
 cautious-0.001
TokenWhen
Token ID2215
Feature activation+7.12e-03

Pos logit contributions
omm+0.015
stand+0.012
igsaw+0.012
+0.011
ifying+0.011
Neg logit contributions
 feat-0.009
 observ-0.009
 cautious-0.009
 agile-0.008
 courageous-0.008
Token Lisa
Token ID15378
Feature activation-3.62e-02

Pos logit contributions
 feat+0.007
 strong+0.007
 clever+0.007
 intelligent+0.007
 brave+0.007
Neg logit contributions
omm-0.016
igsaw-0.013
-0.012
stand-0.012
ifying-0.012
Token got
Token ID1392
Feature activation-1.96e-02

Pos logit contributions
omm+0.068
igsaw+0.054
stand+0.051
+0.051
ifying+0.049
Neg logit contributions
 feat-0.050
 observ-0.045
 cautious-0.045
 courageous-0.045
 wond-0.044
Token there
Token ID612
Feature activation+2.10e-03

Pos logit contributions
omm+0.044
igsaw+0.036
stand+0.036
+0.035
ifying+0.033
Neg logit contributions
 feat-0.020
 strong-0.019
 intelligent-0.019
 brave-0.019
 careful-0.018
Token,
Token ID11
Feature activation+1.26e-02

Pos logit contributions
 feat+0.004
 observ+0.004
 agile+0.004
 cautious+0.004
cut+0.004
Neg logit contributions
omm-0.003
stand-0.002
-0.002
igsaw-0.002
bell-0.002
Token attack
Token ID1368
Feature activation+2.22e-02

Pos logit contributions
omm+0.021
stand+0.017
+0.017
bell+0.016
igsaw+0.015
Neg logit contributions
 feat-0.020
 observ-0.020
 cautious-0.019
 agile-0.019
 intelligent-0.019
Token.
Token ID13
Feature activation-2.80e-03

Pos logit contributions
 feat+0.037
 observ+0.036
 cautious+0.036
 agile+0.035
 redd+0.034
Neg logit contributions
omm-0.033
stand-0.026
igsaw-0.024
-0.023
ifying-0.023
Token
Token ID198
Feature activation-7.95e-03

Pos logit contributions
omm+0.005
igsaw+0.004
stand+0.004
+0.004
ifying+0.004
Neg logit contributions
 feat-0.004
 observ-0.004
 cautious-0.004
 courageous-0.004
 agile-0.004
Token
Token ID198
Feature activation-1.29e-02

Pos logit contributions
omm+0.013
stand+0.010
+0.010
igsaw+0.010
bell+0.009
Neg logit contributions
 feat-0.013
 observ-0.012
 agile-0.012
 redd-0.012
 cautious-0.012
TokenJohn
Token ID7554
Feature activation-8.84e-03

Pos logit contributions
omm+0.022
stand+0.017
igsaw+0.017
+0.016
ifying+0.016
Neg logit contributions
 feat-0.020
 observ-0.018
 agile-0.018
 cautious-0.018
 redd-0.017
Token was
Token ID373
Feature activation-1.38e-02

Pos logit contributions
omm+0.017
stand+0.013
+0.012
bell+0.012
igsaw+0.012
Neg logit contributions
 feat-0.012
 observ-0.011
 redd-0.011
 agile-0.011
 vines-0.011
Token so
Token ID523
Feature activation+2.74e-02

Pos logit contributions
omm+0.026
stand+0.021
igsaw+0.021
+0.020
ifying+0.020
Neg logit contributions
 feat-0.018
 observ-0.017
 agile-0.016
 cautious-0.016
 courageous-0.016
Token amazed
Token ID24562
Feature activation-3.49e-02

Pos logit contributions
 feat+0.028
 vines+0.025
 observ+0.024
 agile+0.024
cut+0.024
Neg logit contributions
omm-0.060
stand-0.049
igsaw-0.048
-0.048
ifying-0.047
Token that
Token ID326
Feature activation+2.99e-02

Pos logit contributions
omm+0.054
igsaw+0.040
stand+0.040
work+0.039
+0.039
Neg logit contributions
 feat-0.054
 observ-0.053
 vines-0.052
 cautious-0.052
 redd-0.051
Token he
Token ID339
Feature activation+1.90e-02

Pos logit contributions
 feat+0.038
 observ+0.036
 vines+0.036
 agile+0.035
 cautious+0.035
Neg logit contributions
omm-0.057
stand-0.046
igsaw-0.044
-0.043
work-0.041
Token cl
Token ID537
Feature activation-2.08e-02

Pos logit contributions
 feat+0.029
 redd+0.027
 vines+0.027
 observ+0.027
cut+0.026
Neg logit contributions
omm-0.030
stand-0.023
bell-0.023
ifying-0.022
-0.022
Token welcome
Token ID7062
Feature activation-2.22e-03

Pos logit contributions
omm+0.037
igsaw+0.033
bell+0.032
stand+0.032
+0.032
Neg logit contributions
 intelligent-0.011
 brave-0.010
 observ-0.010
 clever-0.010
 strong-0.010
Token,
Token ID11
Feature activation+1.92e-02

Pos logit contributions
omm+0.003
igsaw+0.003
stand+0.003
+0.002
ifying+0.002
Neg logit contributions
 feat-0.004
 observ-0.004
 cautious-0.003
 courageous-0.003
 agile-0.003
Token Lily
Token ID20037
Feature activation+3.05e-02

Pos logit contributions
 feat+0.027
 intelligent+0.026
 observ+0.025
 courageous+0.025
 cautious+0.025
Neg logit contributions
omm-0.034
-0.027
stand-0.027
igsaw-0.027
ifying-0.026
Token.
Token ID13
Feature activation-1.66e-03

Pos logit contributions
 observ+0.049
 vines+0.048
 wond+0.048
 unnecessary+0.048
 redd+0.048
Neg logit contributions
omm-0.046
stand-0.035
igsaw-0.032
work-0.031
-0.030
Token I
Token ID314
Feature activation+2.50e-04

Pos logit contributions
omm+0.003
igsaw+0.002
stand+0.002
+0.002
bell+0.002
Neg logit contributions
 feat-0.002
 observ-0.002
 cautious-0.002
 courageous-0.002
 intelligent-0.002
Token love
Token ID1842
Feature activation-5.27e-02

Pos logit contributions
 feat+0.000
 observ+0.000
 cautious+0.000
 courageous+0.000
 intelligent+0.000
Neg logit contributions
omm-0.000
igsaw-0.000
stand-0.000
-0.000
ifying-0.000
Token you
Token ID345
Feature activation+1.48e-02

Pos logit contributions
omm+0.099
igsaw+0.077
stand+0.077
+0.075
ifying+0.073
Neg logit contributions
 feat-0.071
 observ-0.067
 cautious-0.065
 intelligent-0.065
 courageous-0.065
Token.
Token ID13
Feature activation+2.72e-03

Pos logit contributions
 feat+0.021
 strong+0.021
 brave+0.021
 intelligent+0.020
 careful+0.020
Neg logit contributions
omm-0.026
igsaw-0.022
stand-0.020
-0.020
ifying-0.019
Token You
Token ID921
Feature activation-5.77e-03

Pos logit contributions
 feat+0.003
 observ+0.003
 cautious+0.003
 intelligent+0.003
 courageous+0.003
Neg logit contributions
omm-0.005
igsaw-0.004
stand-0.004
bell-0.004
-0.004
Token are
Token ID389
Feature activation-2.14e-02

Pos logit contributions
omm+0.009
igsaw+0.006
stand+0.006
+0.006
ifying+0.006
Neg logit contributions
 feat-0.010
 observ-0.009
 cautious-0.009
 agile-0.009
 courageous-0.009
Token the
Token ID262
Feature activation+3.86e-02

Pos logit contributions
omm+0.041
stand+0.031
igsaw+0.031
+0.030
ifying+0.029
Neg logit contributions
 feat-0.028
 observ-0.026
 cautious-0.026
cut-0.025
 agile-0.025
Token mom
Token ID1995
Feature activation-3.35e-02

Pos logit contributions
omm+0.037
work+0.027
stand+0.027
igsaw+0.025
+0.024
Neg logit contributions
 agile-0.038
 observ-0.038
 courageous-0.037
 redd-0.037
 wond-0.036
Token and
Token ID290
Feature activation-2.62e-02

Pos logit contributions
omm+0.056
stand+0.045
+0.042
bell+0.041
igsaw+0.041
Neg logit contributions
 feat-0.050
 observ-0.049
 agile-0.049
 courageous-0.047
 wond-0.047
Token dad
Token ID9955
Feature activation-1.48e-02

Pos logit contributions
omm+0.040
stand+0.030
igsaw+0.028
+0.028
work+0.026
Neg logit contributions
 feat-0.044
 observ-0.041
 agile-0.041
 cautious-0.041
 courageous-0.041
Token,
Token ID11
Feature activation-1.36e-02

Pos logit contributions
omm+0.026
stand+0.021
+0.020
igsaw+0.020
bell+0.019
Neg logit contributions
 feat-0.021
 observ-0.020
 agile-0.020
 courageous-0.019
 cautious-0.019
Token who
Token ID508
Feature activation-1.34e-02

Pos logit contributions
omm+0.024
stand+0.021
+0.018
ifier+0.018
ifying+0.017
Neg logit contributions
 feat-0.018
 redd-0.017
 agile-0.017
 observ-0.016
cut-0.016
Token were
Token ID547
Feature activation-4.92e-02

Pos logit contributions
omm+0.023
+0.018
igsaw+0.017
stand+0.017
bell+0.017
Neg logit contributions
 feat-0.019
 observ-0.019
 agile-0.018
 cautious-0.018
 unnecessary-0.018
Token sitting
Token ID5586
Feature activation+2.29e-02

Pos logit contributions
omm+0.093
stand+0.073
+0.070
work+0.068
bell+0.067
Neg logit contributions
 feat-0.063
 agile-0.058
 vines-0.058
 observ-0.058
 redd-0.055
Token on
Token ID319
Feature activation+8.64e-04

Pos logit contributions
 wond+0.033
 feat+0.033
 observ+0.031
 agile+0.031
 redd+0.030
Neg logit contributions
omm-0.036
-0.030
ifying-0.029
work-0.027
stand-0.027
Token a
Token ID257
Feature activation-5.12e-03

Pos logit contributions
 feat+0.001
 vines+0.001
 strong+0.001
 tall+0.001
 intelligent+0.001
Neg logit contributions
omm-0.002
igsaw-0.002
ifying-0.002
-0.002
stand-0.002
Token blanket
Token ID18447
Feature activation-2.11e-02

Pos logit contributions
omm+0.011
stand+0.009
igsaw+0.009
+0.008
ifying+0.008
Neg logit contributions
 feat-0.006
 observ-0.005
 cautious-0.005
 courageous-0.005
 agile-0.005
Token.
Token ID13
Feature activation+2.39e-03

Pos logit contributions
omm+0.042
igsaw+0.033
stand+0.032
+0.032
ifying+0.031
Neg logit contributions
 feat-0.026
 observ-0.025
 cautious-0.024
 courageous-0.024
 agile-0.024
Token she
Token ID673
Feature activation+7.53e-03

Pos logit contributions
 feat+0.033
 observ+0.031
 cautious+0.031
cut+0.031
 agile+0.030
Neg logit contributions
omm-0.032
stand-0.024
-0.024
igsaw-0.023
ifying-0.022
Token looked
Token ID3114
Feature activation+1.45e-02

Pos logit contributions
 feat+0.009
 unnecessary+0.009
 observ+0.009
 redd+0.009
cut+0.009
Neg logit contributions
omm-0.013
bell-0.011
stand-0.011
work-0.010
ifying-0.010
Token out
Token ID503
Feature activation+3.10e-02

Pos logit contributions
 feat+0.020
cut+0.019
 observ+0.018
Bee+0.018
 agile+0.018
Neg logit contributions
omm-0.022
ifying-0.019
bell-0.019
work-0.018
igsaw-0.018
Token at
Token ID379
Feature activation+1.72e-02

Pos logit contributions
cut+0.047
 observ+0.045
 feat+0.045
cod+0.043
 unnecessary+0.042
Neg logit contributions
omm-0.043
bell-0.039
ifying-0.037
igsaw-0.034
stand-0.034
Token the
Token ID262
Feature activation-7.41e-03

Pos logit contributions
 observ+0.020
 feat+0.020
 cautious+0.019
 courageous+0.019
 agile+0.019
Neg logit contributions
omm-0.033
igsaw-0.028
stand-0.028
-0.028
work-0.026
Token beautiful
Token ID4950
Feature activation-4.90e-02

Pos logit contributions
omm+0.010
ifying+0.008
+0.008
work+0.008
stand+0.008
Neg logit contributions
 feat-0.012
 observ-0.012
 cautious-0.011
cut-0.011
 agile-0.011
Token view
Token ID1570
Feature activation-2.14e-02

Pos logit contributions
omm+0.085
igsaw+0.073
stand+0.070
work+0.070
bell+0.069
Neg logit contributions
 observ-0.062
 cautious-0.059
 feat-0.057
cut-0.056
 courageous-0.056
Token.
Token ID13
Feature activation-1.03e-02

Pos logit contributions
omm+0.044
igsaw+0.035
stand+0.035
+0.034
ifying+0.033
Neg logit contributions
 feat-0.025
 observ-0.023
 cautious-0.023
 intelligent-0.023
 agile-0.022
Token She
Token ID1375
Feature activation-3.79e-03

Pos logit contributions
omm+0.015
stand+0.011
igsaw+0.011
work+0.011
ifying+0.010
Neg logit contributions
 feat-0.017
 observ-0.017
 cautious-0.016
 intelligent-0.016
 courageous-0.016
Token was
Token ID373
Feature activation-1.51e-02

Pos logit contributions
omm+0.007
bell+0.005
stand+0.005
work+0.005
+0.005
Neg logit contributions
 feat-0.005
 unnecessary-0.005
cut-0.005
 observ-0.005
 redd-0.004
Token so
Token ID523
Feature activation+2.65e-02

Pos logit contributions
omm+0.027
stand+0.021
igsaw+0.020
+0.020
work+0.020
Neg logit contributions
 feat-0.021
 observ-0.019
cut-0.019
 vines-0.019
 agile-0.019
Token She
Token ID1375
Feature activation-4.77e-03

Pos logit contributions
omm+0.013
igsaw+0.010
stand+0.010
+0.010
ifying+0.009
Neg logit contributions
 feat-0.013
 observ-0.012
 cautious-0.012
 courageous-0.012
 agile-0.012
Token said
Token ID531
Feature activation+1.58e-02

Pos logit contributions
omm+0.009
stand+0.007
+0.007
bell+0.007
igsaw+0.007
Neg logit contributions
 feat-0.006
 observ-0.006
 cautious-0.005
 courageous-0.005
 agile-0.005
Token,
Token ID11
Feature activation-1.13e-02

Pos logit contributions
 feat+0.029
 observ+0.027
 wond+0.027
 agile+0.027
 vines+0.027
Neg logit contributions
omm-0.021
stand-0.014
igsaw-0.013
work-0.013
bell-0.013
Token "
Token ID366
Feature activation+2.82e-03

Pos logit contributions
omm+0.023
stand+0.018
igsaw+0.018
+0.018
ifying+0.017
Neg logit contributions
 feat-0.014
 observ-0.013
 cautious-0.012
 agile-0.012
 courageous-0.012
TokenWhat
Token ID2061
Feature activation+2.23e-03

Pos logit contributions
 feat+0.003
 observ+0.003
 cautious+0.003
 intelligent+0.003
 courageous+0.003
Neg logit contributions
omm-0.006
stand-0.005
igsaw-0.004
-0.004
ifying-0.004
Token are
Token ID389
Feature activation-5.50e-02

Pos logit contributions
 feat+0.003
 observ+0.003
 cautious+0.003
 courageous+0.003
 agile+0.003
Neg logit contributions
omm-0.004
igsaw-0.003
stand-0.003
-0.003
ifying-0.003
Token you
Token ID345
Feature activation-5.62e-03

Pos logit contributions
omm+0.096
igsaw+0.073
stand+0.072
+0.071
ifying+0.068
Neg logit contributions
 feat-0.082
 cautious-0.076
 observ-0.076
 courageous-0.075
 agile-0.075
Token doing
Token ID1804
Feature activation-8.01e-06

Pos logit contributions
omm+0.012
igsaw+0.010
stand+0.010
+0.010
ifying+0.010
Neg logit contributions
 feat-0.006
 intelligent-0.006
 observ-0.006
 cautious-0.005
 clever-0.005
Token here
Token ID994
Feature activation+1.33e-02

Feature not active on prev token;
no predictions were affected.
Token?"
Token ID1701
Feature activation-2.89e-03

Pos logit contributions
 feat+0.020
 observ+0.019
 cautious+0.019
 agile+0.019
 courageous+0.018
Neg logit contributions
omm-0.023
igsaw-0.018
stand-0.018
-0.017
work-0.016
Token
Token ID198
Feature activation-1.49e-03

Pos logit contributions
omm+0.005
igsaw+0.004
stand+0.004
+0.004
ifying+0.004
Neg logit contributions
 feat-0.004
 observ-0.004
 cautious-0.004
 courageous-0.004
 agile-0.004
Token and
Token ID290
Feature activation+3.64e-03

Pos logit contributions
omm+0.009
igsaw+0.007
+0.007
stand+0.007
ifier+0.007
Neg logit contributions
 feat-0.004
 strong-0.004
cut-0.004
 intelligent-0.004
 tall-0.004
Token the
Token ID262
Feature activation+7.32e-03

Pos logit contributions
 feat+0.004
 observ+0.004
 cautious+0.004
 courageous+0.004
 agile+0.004
Neg logit contributions
omm-0.008
stand-0.006
igsaw-0.006
-0.006
ifying-0.006
Token papers
Token ID9473
Feature activation+4.87e-03

Pos logit contributions
 feat+0.011
 observ+0.010
 cautious+0.010
 courageous+0.010
 agile+0.010
Neg logit contributions
omm-0.012
stand-0.010
igsaw-0.010
ifying-0.009
work-0.009
Token with
Token ID351
Feature activation-2.00e-02

Pos logit contributions
 feat+0.005
 intelligent+0.004
 observ+0.004
 cautious+0.004
 strong+0.004
Neg logit contributions
omm-0.011
igsaw-0.009
-0.009
stand-0.009
ifying-0.008
Token the
Token ID262
Feature activation+6.48e-03

Pos logit contributions
omm+0.045
+0.036
igsaw+0.036
stand+0.035
ifying+0.033
Neg logit contributions
 feat-0.021
 observ-0.019
 vines-0.019
 intelligent-0.019
 agile-0.018
Token cr
Token ID1067
Feature activation-5.07e-02

Pos logit contributions
 feat+0.009
 observ+0.009
 cautious+0.009
 courageous+0.009
 agile+0.009
Neg logit contributions
omm-0.011
stand-0.009
igsaw-0.009
ifying-0.009
work-0.008
Tokenay
Token ID323
Feature activation-4.33e-02

Pos logit contributions
omm+0.084
ework+0.078
+0.077
 salon+0.077
 School+0.075
Neg logit contributions
rows-0.055
 vines-0.050
 biggest-0.047
 strong-0.044
 tall-0.044
Tokenon
Token ID261
Feature activation-2.42e-02

Pos logit contributions
omm+0.098
+0.081
igsaw+0.078
ifying+0.076
ifier+0.073
Neg logit contributions
 vines-0.038
 feat-0.036
cut-0.035
b-0.035
ts-0.035
Token.
Token ID13
Feature activation+5.27e-03

Pos logit contributions
omm+0.056
igsaw+0.046
+0.045
stand+0.042
ifying+0.041
Neg logit contributions
 feat-0.022
 strong-0.020
cut-0.020
 vines-0.020
 intelligent-0.020
Token He
Token ID679
Feature activation+5.85e-04

Pos logit contributions
 feat+0.008
 observ+0.007
 cautious+0.007
 intelligent+0.007
 agile+0.007
Neg logit contributions
omm-0.009
igsaw-0.007
stand-0.007
-0.007
work-0.007
Token smiled
Token ID13541
Feature activation-1.18e-02

Pos logit contributions
 feat+0.001
 observ+0.001
 cautious+0.001
 agile+0.001
 intelligent+0.001
Neg logit contributions
omm-0.001
igsaw-0.001
stand-0.001
-0.001
ifying-0.001