TOP ACTIVATIONS
MAX = 0.082

ie . They both had birth days coming up , and
Spot . There are balls and bones and ropes and sque
my took Joe out for a walk in the garden .
his friends dropped her lip bal m on the ground .
dance around ! Em ma was over joy ed
new symbol ! Em ma 's friends cl apped
Goose ' and ' Hide and Seek ' ?"
and his mom went for a walk . It was a
things . There are hats and shoes and dresses and coats
with me ?" Mel ody was very excited and
The monster thought for a moment and then said
nice . Some are black and disgusting and mean ." Mia
here . Em ma had a great time
place . Em ma was so proud of
!" Mom thought for a moment . Then she
think you 're doing ?!" Fox y looked up and saw
us . We are kings and queens of this castle .
does . She learned that birth days are not about cakes
time there were two friends , Bob and Jack , who
ribbon for being the best bal ancer in the competition .

MIN ACTIVATIONS
MIN = -0.084

. Every day , the family would give Strip lots of
. Every day , the mail man would put letters inside
So every evening , the lawyer would go to his big
. Every day , the mail man would leave the mail
L ily decided to keep an eye on the field to
inside it , so she let herself in . She stepped
They found a hotel and let themselves into the lobby .
. Every day , the clock went " tick - t
small . Every day the piece went on adventures , looking
From then on , Tim my never played with ropes again
. Every day , the clock would tick - t ock
she didn 't want to let it go . She held
Every day , the painter liked to play with children
It made loud noises and let out smoke when it zoom
gate . She smiled and let them in . They saw
that day on , Tim my learned to be more careful
She opened the door and let the bird out . The
was all dry when she let go ! She put on
their food on it and let it cook .
calm and peaceful . They let their little dog run around

INTERVAL 0.041 - 0.082
CONTAINS 0.304%

day , Freddy went for a swim in the swamp and
!" Sue says . " Do you think it has toys
't you both read your books and then tell each other
Sam says , " Thank you , Mom ! I
But one day , Lily couldn 't find her necklace .

INTERVAL -0.001 - 0.041
CONTAINS 51.168%

to see the crocod ile up close . After
elling flowers ! Finally , Daisy and her
today . She had a new friend to play with .
" I 'm the best slider here , the judge said
bow was so excited . He wanted to tell everyone the

INTERVAL -0.042 - -0.001
CONTAINS 48.202%

adding new ideas to their game . Summer was the best
um , which road do I use to get to the
its head . The bird flew down and landed on the
She said it was a memory of a war . She
and Johnny pulled Bob 's hand as they started walking home

INTERVAL -0.084 - -0.042
CONTAINS 0.326%

They use the bread to make more blocks . They add
on , Tim my knew that when he needed help ,
- being . <|endoftext|> L ily loved to paint . She
car . It can make sounds and lights . Do you
his house . He would watch the boat go on the
Tokenie
Token ID494
Feature activation+1.29e-02

Pos logit contributions
ifying+0.020
igsaw+0.017
years+0.016
chie+0.015
 hips+0.015
Neg logit contributions
 idea-0.012
crow-0.010
 bal-0.010
 doctors-0.010
 people-0.009
Token.
Token ID13
Feature activation+9.79e-03

Pos logit contributions
ifying+0.030
igsaw+0.026
years+0.025
chie+0.025
jamin+0.023
Neg logit contributions
 idea-0.014
 bal-0.011
 doctors-0.010
crow-0.010
 cows-0.010
Token They
Token ID1119
Feature activation+4.99e-03

Pos logit contributions
ifying+0.020
igsaw+0.017
years+0.016
chie+0.016
jamin+0.015
Neg logit contributions
 idea-0.013
crow-0.011
 bal-0.010
 doctors-0.010
 cows-0.010
Token both
Token ID1111
Feature activation+1.16e-02

Pos logit contributions
ifying+0.010
igsaw+0.008
years+0.008
chie+0.008
 hips+0.007
Neg logit contributions
 idea-0.007
 bal-0.006
crow-0.006
 doctors-0.006
 cows-0.005
Token had
Token ID550
Feature activation+2.24e-02

Pos logit contributions
ifying+0.025
igsaw+0.022
years+0.021
chie+0.021
 hips+0.020
Neg logit contributions
 idea-0.014
 bal-0.011
crow-0.011
 doctors-0.010
 people-0.010
Token birth
Token ID4082
Feature activation+8.24e-02

Pos logit contributions
ifying+0.045
igsaw+0.041
chie+0.037
years+0.037
 hips+0.035
Neg logit contributions
 idea-0.030
 cows-0.026
 doctors-0.026
 people-0.026
 time-0.024
Tokendays
Token ID12545
Feature activation+2.29e-02

Pos logit contributions
ifying+0.118
igsaw+0.115
jamin+0.102
 hips+0.099
 hind+0.096
Neg logit contributions
 idea-0.129
 if-0.122
 whenever-0.120
 understand-0.119
 at-0.114
Token coming
Token ID2406
Feature activation+2.48e-02

Pos logit contributions
ifying+0.052
igsaw+0.044
chie+0.043
years+0.043
jamin+0.041
Neg logit contributions
 idea-0.027
 bal-0.020
 people-0.019
 doctors-0.019
 day-0.019
Token up
Token ID510
Feature activation+2.60e-02

Pos logit contributions
ifying+0.052
igsaw+0.045
years+0.043
chie+0.042
jamin+0.041
Neg logit contributions
 idea-0.032
 people-0.026
 doctors-0.025
 cows-0.025
 understand-0.024
Token,
Token ID11
Feature activation+3.31e-04

Pos logit contributions
ifying+0.057
chie+0.048
igsaw+0.047
years+0.047
jamin+0.045
Neg logit contributions
 idea-0.034
 people-0.026
 time-0.025
 day-0.025
 doctors-0.024
Token and
Token ID290
Feature activation+6.83e-03

Pos logit contributions
ifying+0.001
igsaw+0.001
years+0.001
chie+0.001
 hips+0.000
Neg logit contributions
 idea-0.000
crow-0.000
 bal-0.000
 doctors-0.000
 cows-0.000
Token Spot
Token ID15899
Feature activation+5.33e-03

Pos logit contributions
ifying+0.089
igsaw+0.074
chie+0.072
jamin+0.071
oured+0.069
Neg logit contributions
 idea-0.032
 people-0.029
 doctors-0.027
 cows-0.023
 time-0.021
Token.
Token ID13
Feature activation+5.03e-04

Pos logit contributions
ifying+0.012
igsaw+0.010
years+0.010
chie+0.010
jamin+0.009
Neg logit contributions
 idea-0.006
 bal-0.005
crow-0.005
 doctors-0.005
 cows-0.005
Token There
Token ID1318
Feature activation-2.68e-02

Pos logit contributions
ifying+0.001
igsaw+0.001
years+0.001
chie+0.001
 hips+0.001
Neg logit contributions
 idea-0.001
 bal-0.001
crow-0.001
 doctors-0.001
 cows-0.001
Token are
Token ID389
Feature activation+2.41e-02

Pos logit contributions
 idea+0.048
 bal+0.045
 doctors+0.044
crow+0.043
 paramedics+0.043
Neg logit contributions
ifying-0.035
igsaw-0.029
years-0.025
chie-0.025
 hips-0.025
Token balls
Token ID11333
Feature activation+6.29e-03

Pos logit contributions
ifying+0.051
igsaw+0.045
chie+0.042
years+0.041
jamin+0.039
Neg logit contributions
 idea-0.031
 doctors-0.026
 cows-0.026
 people-0.025
 bal-0.024
Token and
Token ID290
Feature activation+7.75e-02

Pos logit contributions
ifying+0.014
igsaw+0.012
years+0.012
chie+0.012
jamin+0.011
Neg logit contributions
 idea-0.007
 bal-0.006
crow-0.006
 doctors-0.005
 people-0.005
Token bones
Token ID11945
Feature activation+4.05e-03

Pos logit contributions
ifying+0.190
igsaw+0.180
ll+0.174
oured+0.172
years+0.172
Neg logit contributions
 people-0.076
 cows-0.072
 doctors-0.065
 idea-0.060
 sun-0.057
Token and
Token ID290
Feature activation+5.20e-02

Pos logit contributions
ifying+0.009
igsaw+0.008
years+0.007
chie+0.007
jamin+0.007
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token ropes
Token ID31001
Feature activation-8.75e-03

Pos logit contributions
ifying+0.117
igsaw+0.109
chie+0.105
years+0.105
ll+0.102
Neg logit contributions
 idea-0.060
 people-0.057
 doctors-0.054
 cows-0.054
 sun-0.045
Token and
Token ID290
Feature activation+5.04e-02

Pos logit contributions
 idea+0.011
 bal+0.009
crow+0.009
 doctors+0.009
 cows+0.008
Neg logit contributions
ifying-0.018
igsaw-0.015
years-0.015
chie-0.014
 hips-0.014
Token sque
Token ID35776
Feature activation+2.82e-02

Pos logit contributions
ifying+0.112
igsaw+0.103
years+0.098
chie+0.097
ll+0.096
Neg logit contributions
 idea-0.062
 people-0.056
 doctors-0.053
 cows-0.051
 wind-0.045
Tokenmy
Token ID1820
Feature activation-5.85e-03

Pos logit contributions
ifying+0.035
igsaw+0.029
chie+0.027
years+0.026
 hips+0.024
Neg logit contributions
 idea-0.037
 bal-0.031
crow-0.031
 cows-0.031
 people-0.031
Token took
Token ID1718
Feature activation-2.01e-02

Pos logit contributions
 idea+0.008
 bal+0.007
crow+0.007
 doctors+0.006
 cows+0.006
Neg logit contributions
ifying-0.011
igsaw-0.010
years-0.009
chie-0.009
jamin-0.009
Token Joe
Token ID5689
Feature activation+2.09e-02

Pos logit contributions
 idea+0.031
cut+0.026
crow+0.025
 bal+0.025
 doctors+0.025
Neg logit contributions
ifying-0.036
igsaw-0.031
years-0.029
jamin-0.027
chie-0.027
Token out
Token ID503
Feature activation+3.26e-02

Pos logit contributions
ifying+0.046
igsaw+0.039
years+0.038
chie+0.038
 hips+0.035
Neg logit contributions
 idea-0.024
 bal-0.019
 cows-0.019
crow-0.019
 people-0.018
Token for
Token ID329
Feature activation+2.85e-02

Pos logit contributions
ifying+0.068
igsaw+0.058
chie+0.058
years+0.056
jamin+0.055
Neg logit contributions
 idea-0.041
 people-0.031
 bal-0.031
 cows-0.031
 doctors-0.030
Token a
Token ID257
Feature activation+7.62e-02

Pos logit contributions
ifying+0.059
igsaw+0.050
chie+0.049
years+0.048
jamin+0.046
Neg logit contributions
 idea-0.037
 doctors-0.031
 people-0.031
 time-0.030
 cows-0.030
Token walk
Token ID2513
Feature activation-5.39e-04

Pos logit contributions
ifying+0.186
years+0.181
chie+0.163
hetically+0.153
 seeking+0.151
Neg logit contributions
 idea-0.070
 day-0.067
 time-0.064
 morning-0.056
 bal-0.051
Token in
Token ID287
Feature activation+7.46e-03

Pos logit contributions
 idea+0.001
 bal+0.001
crow+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.001
igsaw-0.001
years-0.001
chie-0.001
jamin-0.001
Token the
Token ID262
Feature activation+4.60e-03

Pos logit contributions
ifying+0.016
igsaw+0.014
years+0.013
chie+0.013
jamin+0.012
Neg logit contributions
 idea-0.009
crow-0.007
 bal-0.007
 doctors-0.007
 people-0.007
Token garden
Token ID11376
Feature activation-8.37e-03

Pos logit contributions
ifying+0.011
igsaw+0.009
years+0.009
chie+0.009
 hips+0.009
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.003
 cows-0.003
Token.
Token ID13
Feature activation-1.34e-02

Pos logit contributions
 idea+0.011
crow+0.009
 bal+0.009
 doctors+0.008
 understand+0.008
Neg logit contributions
ifying-0.017
igsaw-0.015
years-0.014
chie-0.014
 hips-0.013
Token his
Token ID465
Feature activation+3.26e-03

Pos logit contributions
ifying+0.009
igsaw+0.008
years+0.007
chie+0.007
jamin+0.007
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.003
Token friends
Token ID2460
Feature activation-2.43e-02

Pos logit contributions
ifying+0.007
igsaw+0.006
years+0.006
chie+0.006
jamin+0.006
Neg logit contributions
 idea-0.004
 bal-0.003
crow-0.003
 doctors-0.003
 cows-0.003
Token dropped
Token ID5710
Feature activation+6.81e-03

Pos logit contributions
 idea+0.039
 bal+0.032
crow+0.031
cut+0.031
 doctors+0.031
Neg logit contributions
ifying-0.042
years-0.034
chie-0.032
jamin-0.031
igsaw-0.031
Token her
Token ID607
Feature activation+1.84e-02

Pos logit contributions
ifying+0.014
igsaw+0.012
years+0.012
chie+0.011
jamin+0.011
Neg logit contributions
 idea-0.009
crow-0.007
 bal-0.007
 doctors-0.007
 cows-0.007
Token lip
Token ID10645
Feature activation+3.92e-03

Pos logit contributions
ifying+0.038
igsaw+0.034
years+0.034
chie+0.032
jamin+0.030
Neg logit contributions
 idea-0.024
 bal-0.019
 doctors-0.018
crow-0.018
 people-0.018
Token bal
Token ID3652
Feature activation+7.49e-02

Pos logit contributions
ifying+0.008
igsaw+0.007
years+0.007
chie+0.007
jamin+0.007
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Tokenm
Token ID76
Feature activation+1.52e-02

Pos logit contributions
igsaw+0.115
ifying+0.095
 hind+0.091
 unison+0.090
 hips+0.086
Neg logit contributions
 bal-0.113
ying-0.109
crow-0.103
 if-0.103
umber-0.099
Token on
Token ID319
Feature activation-3.18e-03

Pos logit contributions
ifying+0.029
igsaw+0.024
years+0.024
chie+0.023
jamin+0.021
Neg logit contributions
 idea-0.023
 bal-0.019
crow-0.019
 doctors-0.018
 people-0.018
Token the
Token ID262
Feature activation+1.32e-02

Pos logit contributions
 idea+0.004
 bal+0.004
crow+0.004
 doctors+0.003
 understand+0.003
Neg logit contributions
ifying-0.006
igsaw-0.005
years-0.005
chie-0.005
 hips-0.005
Token ground
Token ID2323
Feature activation+1.81e-02

Pos logit contributions
ifying+0.033
igsaw+0.030
years+0.029
chie+0.028
run+0.027
Neg logit contributions
 idea-0.012
 doctors-0.008
 bal-0.008
crow-0.008
 cows-0.008
Token.
Token ID13
Feature activation+2.70e-03

Pos logit contributions
ifying+0.043
igsaw+0.038
years+0.037
chie+0.036
jamin+0.034
Neg logit contributions
 idea-0.019
 bal-0.014
crow-0.013
 doctors-0.013
 cows-0.013
Token dance
Token ID9280
Feature activation+6.92e-03

Pos logit contributions
ifying+0.025
igsaw+0.021
years+0.020
chie+0.020
jamin+0.019
Neg logit contributions
 idea-0.016
 bal-0.013
crow-0.013
 doctors-0.013
 people-0.013
Token around
Token ID1088
Feature activation+6.20e-03

Pos logit contributions
ifying+0.015
igsaw+0.013
years+0.013
chie+0.012
jamin+0.012
Neg logit contributions
 idea-0.008
 bal-0.006
crow-0.006
 doctors-0.006
 people-0.006
Token!
Token ID0
Feature activation+1.96e-02

Pos logit contributions
ifying+0.013
igsaw+0.011
years+0.011
chie+0.011
jamin+0.010
Neg logit contributions
 idea-0.008
 bal-0.006
crow-0.006
 doctors-0.006
 people-0.006
Token
Token ID198
Feature activation+1.49e-02

Pos logit contributions
ifying+0.039
years+0.033
chie+0.032
igsaw+0.032
 hips+0.029
Neg logit contributions
 idea-0.027
 bal-0.022
crow-0.022
 doctors-0.021
 people-0.021
Token
Token ID198
Feature activation+2.04e-02

Pos logit contributions
ifying+0.033
igsaw+0.028
years+0.027
chie+0.027
 hips+0.025
Neg logit contributions
 idea-0.017
 bal-0.013
crow-0.013
 understand-0.013
 doctors-0.013
TokenEm
Token ID10161
Feature activation+7.43e-02

Pos logit contributions
ifying+0.032
igsaw+0.025
chie+0.024
years+0.024
 hips+0.023
Neg logit contributions
 idea-0.035
 bal-0.031
 doctors-0.031
 people-0.030
 understand-0.030
Tokenma
Token ID2611
Feature activation+2.07e-02

Pos logit contributions
ifying+0.180
 hind+0.156
igsaw+0.151
ider+0.143
lic+0.142
Neg logit contributions
 idea-0.046
 time-0.045
 day-0.042
est-0.041
pa-0.040
Token was
Token ID373
Feature activation-2.24e-03

Pos logit contributions
ifying+0.037
igsaw+0.032
years+0.029
chie+0.028
 hips+0.026
Neg logit contributions
 idea-0.033
 bal-0.028
crow-0.027
 people-0.027
 cows-0.026
Token over
Token ID625
Feature activation+9.56e-03

Pos logit contributions
 idea+0.004
crow+0.004
 bal+0.004
 doctors+0.004
 cows+0.003
Neg logit contributions
ifying-0.003
igsaw-0.003
years-0.003
chie-0.003
jamin-0.002
Tokenjoy
Token ID2633
Feature activation+8.09e-03

Pos logit contributions
ifying+0.013
igsaw+0.010
years+0.010
chie+0.010
jamin+0.009
Neg logit contributions
 idea-0.019
 bal-0.017
crow-0.016
 doctors-0.016
 people-0.016
Tokened
Token ID276
Feature activation+7.44e-03

Pos logit contributions
ifying+0.014
igsaw+0.011
years+0.011
chie+0.011
 hips+0.010
Neg logit contributions
 idea-0.013
 bal-0.011
crow-0.011
 doctors-0.011
 people-0.011
Token new
Token ID649
Feature activation-1.23e-02

Pos logit contributions
ifying+0.041
years+0.034
igsaw+0.033
chie+0.032
jamin+0.032
Neg logit contributions
 idea-0.025
 bal-0.019
crow-0.018
 thing-0.018
 doctors-0.018
Token symbol
Token ID6194
Feature activation-7.95e-03

Pos logit contributions
 idea+0.015
crow+0.013
 doctors+0.013
 bal+0.012
 cows+0.012
Neg logit contributions
ifying-0.025
igsaw-0.022
chie-0.021
years-0.020
 hips-0.019
Token!
Token ID0
Feature activation+2.97e-02

Pos logit contributions
 idea+0.011
crow+0.009
 doctors+0.009
 bal+0.009
 cows+0.008
Neg logit contributions
ifying-0.016
igsaw-0.013
years-0.013
chie-0.013
jamin-0.012
Token
Token ID198
Feature activation+1.16e-02

Pos logit contributions
ifying+0.059
years+0.052
chie+0.051
igsaw+0.048
ll+0.045
Neg logit contributions
 idea-0.040
 People-0.032
 people-0.031
 bal-0.031
crow-0.030
Token
Token ID198
Feature activation+1.84e-02

Pos logit contributions
ifying+0.028
igsaw+0.024
years+0.024
chie+0.024
 hips+0.023
Neg logit contributions
 idea-0.011
 bal-0.008
 doctors-0.008
crow-0.008
 people-0.007
TokenEm
Token ID10161
Feature activation+7.34e-02

Pos logit contributions
ifying+0.029
igsaw+0.023
chie+0.022
years+0.022
 hips+0.021
Neg logit contributions
 idea-0.032
 bal-0.028
 doctors-0.028
 people-0.027
 understand-0.027
Tokenma
Token ID2611
Feature activation+2.60e-02

Pos logit contributions
ifying+0.180
igsaw+0.151
 hind+0.150
lic+0.141
run+0.140
Neg logit contributions
 idea-0.052
 people-0.047
ying-0.046
 day-0.044
est-0.044
Token's
Token ID338
Feature activation-2.32e-03

Pos logit contributions
ifying+0.048
igsaw+0.042
years+0.037
chie+0.037
 hips+0.033
Neg logit contributions
 idea-0.040
 bal-0.034
crow-0.033
 people-0.033
 understand-0.032
Token friends
Token ID2460
Feature activation+3.11e-02

Pos logit contributions
 idea+0.003
crow+0.002
 bal+0.002
 doctors+0.002
 cows+0.002
Neg logit contributions
ifying-0.005
igsaw-0.004
years-0.004
chie-0.004
 hips-0.004
Token cl
Token ID537
Feature activation+2.39e-02

Pos logit contributions
ifying+0.058
igsaw+0.053
years+0.046
chie+0.045
 hips+0.042
Neg logit contributions
 idea-0.046
 understand-0.039
 people-0.038
 bal-0.037
crow-0.037
Tokenapped
Token ID6320
Feature activation-1.21e-03

Pos logit contributions
ifying+0.051
igsaw+0.045
 hips+0.042
years+0.041
 seeking+0.041
Neg logit contributions
 idea-0.027
 bal-0.022
crow-0.022
 understand-0.020
ook-0.020
Token Goose
Token ID46317
Feature activation+5.58e-03

Pos logit contributions
ifying+0.006
igsaw+0.005
years+0.005
chie+0.005
jamin+0.005
Neg logit contributions
 idea-0.004
 bal-0.004
crow-0.003
 doctors-0.003
 cows-0.003
Token'
Token ID6
Feature activation-1.28e-02

Pos logit contributions
ifying+0.011
igsaw+0.009
years+0.009
chie+0.008
 hips+0.008
Neg logit contributions
 idea-0.008
 bal-0.007
crow-0.007
 doctors-0.006
 cows-0.006
Token and
Token ID290
Feature activation+7.22e-03

Pos logit contributions
 idea+0.018
 bal+0.015
crow+0.015
 doctors+0.014
cut+0.014
Neg logit contributions
ifying-0.024
years-0.020
igsaw-0.019
chie-0.019
 hips-0.018
Token '
Token ID705
Feature activation+3.10e-02

Pos logit contributions
ifying+0.013
igsaw+0.011
years+0.011
chie+0.010
jamin+0.010
Neg logit contributions
 idea-0.011
 bal-0.009
crow-0.009
 doctors-0.009
 cows-0.009
TokenHide
Token ID38518
Feature activation+1.67e-02

Pos logit contributions
ifying+0.057
igsaw+0.050
 hips+0.045
years+0.045
chie+0.044
Neg logit contributions
 idea-0.044
crow-0.039
 bal-0.039
 people-0.037
 doctors-0.037
Token and
Token ID290
Feature activation+7.25e-02

Pos logit contributions
ifying+0.036
igsaw+0.031
years+0.030
chie+0.029
 hips+0.028
Neg logit contributions
 idea-0.020
crow-0.016
 people-0.016
 bal-0.016
 doctors-0.016
Token Seek
Token ID40749
Feature activation+2.80e-02

Pos logit contributions
ifying+0.136
igsaw+0.109
�+0.105
chie+0.100
years+0.100
Neg logit contributions
 idea-0.108
 people-0.108
 day-0.100
 wind-0.098
 sun-0.096
Token'
Token ID6
Feature activation+6.74e-03

Pos logit contributions
ifying+0.056
years+0.050
igsaw+0.049
chie+0.048
jamin+0.044
Neg logit contributions
 idea-0.036
 people-0.028
 doctors-0.028
crow-0.028
 bal-0.028
Token?"
Token ID1701
Feature activation-6.76e-03

Pos logit contributions
ifying+0.014
igsaw+0.012
years+0.011
chie+0.011
jamin+0.010
Neg logit contributions
 idea-0.009
 bal-0.007
crow-0.007
 doctors-0.007
 cows-0.007
Token
Token ID198
Feature activation-4.47e-03

Pos logit contributions
 idea+0.011
crow+0.009
 doctors+0.009
 bal+0.009
 pilot+0.009
Neg logit contributions
ifying-0.011
igsaw-0.010
years-0.009
 hips-0.009
jamin-0.009
Token
Token ID198
Feature activation+4.33e-04

Pos logit contributions
 idea+0.007
 bal+0.005
crow+0.005
 doctors+0.005
 cows+0.005
Neg logit contributions
ifying-0.008
igsaw-0.007
years-0.007
chie-0.007
jamin-0.006
Token and
Token ID290
Feature activation-4.63e-03

Pos logit contributions
 idea+0.038
 bal+0.031
 doctors+0.031
crow+0.031
 paramedics+0.030
Neg logit contributions
ifying-0.044
years-0.035
jamin-0.035
igsaw-0.034
chie-0.033
Token his
Token ID465
Feature activation+2.53e-02

Pos logit contributions
 idea+0.007
 bal+0.005
crow+0.005
 doctors+0.005
 cows+0.005
Neg logit contributions
ifying-0.009
igsaw-0.007
years-0.007
chie-0.007
jamin-0.007
Token mom
Token ID1995
Feature activation-4.39e-03

Pos logit contributions
ifying+0.066
igsaw+0.059
years+0.059
chie+0.058
jamin+0.055
Neg logit contributions
 idea-0.021
 people-0.014
 cows-0.013
 bal-0.013
 pilot-0.013
Token went
Token ID1816
Feature activation+3.27e-02

Pos logit contributions
 idea+0.007
crow+0.005
 bal+0.005
 doctors+0.005
 cows+0.005
Neg logit contributions
ifying-0.008
igsaw-0.007
years-0.007
chie-0.006
jamin-0.006
Token for
Token ID329
Feature activation+3.58e-02

Pos logit contributions
ifying+0.074
years+0.064
chie+0.063
igsaw+0.063
jamin+0.058
Neg logit contributions
 idea-0.037
 people-0.028
 day-0.028
 cows-0.027
 doctors-0.027
Token a
Token ID257
Feature activation+7.16e-02

Pos logit contributions
ifying+0.084
chie+0.071
years+0.070
igsaw+0.069
jamin+0.067
Neg logit contributions
 idea-0.040
 people-0.035
 doctors-0.034
 wheat-0.033
 cows-0.032
Token walk
Token ID2513
Feature activation+5.85e-03

Pos logit contributions
ifying+0.178
years+0.166
chie+0.155
jamin+0.148
run+0.143
Neg logit contributions
 idea-0.070
 day-0.066
 time-0.059
 morning-0.053
 go-0.048
Token.
Token ID13
Feature activation-1.64e-03

Pos logit contributions
ifying+0.013
igsaw+0.011
years+0.010
chie+0.010
jamin+0.010
Neg logit contributions
 idea-0.007
 bal-0.006
crow-0.006
 doctors-0.005
 cows-0.005
Token It
Token ID632
Feature activation-3.20e-03

Pos logit contributions
 idea+0.002
 bal+0.002
crow+0.002
 doctors+0.002
 cows+0.002
Neg logit contributions
ifying-0.003
igsaw-0.003
years-0.003
chie-0.002
jamin-0.002
Token was
Token ID373
Feature activation-2.05e-02

Pos logit contributions
 idea+0.005
 bal+0.004
crow+0.004
 doctors+0.004
 cows+0.004
Neg logit contributions
ifying-0.006
igsaw-0.004
years-0.004
chie-0.004
jamin-0.004
Token a
Token ID257
Feature activation+1.81e-02

Pos logit contributions
 idea+0.037
crow+0.036
 bal+0.035
cut+0.034
ighters+0.034
Neg logit contributions
ifying-0.028
years-0.021
jamin-0.020
igsaw-0.020
chie-0.018
Token things
Token ID1243
Feature activation-7.51e-04

Pos logit contributions
 idea+0.003
crow+0.002
 bal+0.002
 doctors+0.002
 cows+0.002
Neg logit contributions
ifying-0.004
igsaw-0.003
years-0.003
chie-0.003
 hips-0.003
Token.
Token ID13
Feature activation+1.42e-02

Pos logit contributions
 idea+0.001
crow+0.001
 bal+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.002
igsaw-0.001
years-0.001
chie-0.001
 hips-0.001
Token There
Token ID1318
Feature activation-2.40e-02

Pos logit contributions
ifying+0.030
igsaw+0.025
years+0.025
chie+0.025
jamin+0.023
Neg logit contributions
 idea-0.018
crow-0.014
 bal-0.014
 cows-0.014
 doctors-0.014
Token are
Token ID389
Feature activation+1.78e-02

Pos logit contributions
 idea+0.043
 bal+0.040
 doctors+0.039
crow+0.038
 paramedics+0.037
Neg logit contributions
ifying-0.033
igsaw-0.026
chie-0.025
years-0.024
 hips-0.023
Token hats
Token ID23910
Feature activation+2.82e-03

Pos logit contributions
ifying+0.038
igsaw+0.034
years+0.031
chie+0.031
jamin+0.030
Neg logit contributions
 idea-0.023
 doctors-0.018
 people-0.018
 cows-0.018
 bal-0.017
Token and
Token ID290
Feature activation+7.16e-02

Pos logit contributions
ifying+0.006
igsaw+0.005
years+0.005
chie+0.005
jamin+0.005
Neg logit contributions
 idea-0.003
 bal-0.003
crow-0.003
 doctors-0.002
 cows-0.002
Token shoes
Token ID10012
Feature activation+2.03e-02

Pos logit contributions
ifying+0.154
igsaw+0.151
�+0.145
years+0.143
chie+0.137
Neg logit contributions
 people-0.079
 cows-0.078
 idea-0.077
 doctors-0.068
 sun-0.065
Token and
Token ID290
Feature activation+5.25e-02

Pos logit contributions
ifying+0.048
igsaw+0.043
years+0.043
chie+0.041
jamin+0.039
Neg logit contributions
 idea-0.021
 bal-0.015
 people-0.014
crow-0.014
 doctors-0.014
Token dresses
Token ID27309
Feature activation+6.81e-03

Pos logit contributions
ifying+0.113
igsaw+0.106
years+0.102
chie+0.101
�+0.099
Neg logit contributions
 idea-0.064
 people-0.057
 cows-0.055
 doctors-0.054
 wheat-0.047
Token and
Token ID290
Feature activation+4.65e-02

Pos logit contributions
ifying+0.015
igsaw+0.013
years+0.013
chie+0.013
jamin+0.012
Neg logit contributions
 idea-0.008
 bal-0.006
crow-0.006
 doctors-0.006
 cows-0.006
Token coats
Token ID30720
Feature activation+8.87e-03

Pos logit contributions
ifying+0.101
igsaw+0.092
years+0.088
chie+0.088
�+0.085
Neg logit contributions
 idea-0.059
 people-0.051
 doctors-0.049
 cows-0.048
 understand-0.043
Token with
Token ID351
Feature activation+5.23e-03

Pos logit contributions
ifying+0.006
igsaw+0.005
years+0.005
chie+0.005
jamin+0.005
Neg logit contributions
 idea-0.004
 bal-0.003
crow-0.003
 doctors-0.003
 cows-0.003
Token me
Token ID502
Feature activation+2.16e-02

Pos logit contributions
ifying+0.009
igsaw+0.007
years+0.007
chie+0.007
jamin+0.006
Neg logit contributions
 idea-0.009
 bal-0.007
crow-0.007
 doctors-0.007
 cows-0.007
Token?"
Token ID1701
Feature activation+1.49e-02

Pos logit contributions
ifying+0.049
igsaw+0.043
years+0.042
chie+0.041
jamin+0.038
Neg logit contributions
 idea-0.024
 bal-0.018
 doctors-0.018
 people-0.017
crow-0.017
Token
Token ID198
Feature activation+1.45e-02

Pos logit contributions
ifying+0.032
igsaw+0.027
years+0.027
chie+0.026
jamin+0.024
Neg logit contributions
 idea-0.019
crow-0.015
 people-0.014
 bal-0.014
 doctors-0.014
Token
Token ID198
Feature activation+3.16e-02

Pos logit contributions
ifying+0.032
igsaw+0.026
years+0.026
chie+0.025
jamin+0.024
Neg logit contributions
 idea-0.018
crow-0.014
 bal-0.014
 people-0.014
 doctors-0.013
TokenMel
Token ID21102
Feature activation+7.16e-02

Pos logit contributions
ifying+0.063
igsaw+0.050
years+0.048
chie+0.048
run+0.046
Neg logit contributions
 idea-0.045
 people-0.038
crow-0.037
 understand-0.037
 bal-0.035
Tokenody
Token ID1118
Feature activation+2.89e-02

Pos logit contributions
ifying+0.144
�+0.113
igsaw+0.111
 hind+0.109
run+0.106
Neg logit contributions
 idea-0.095
 people-0.086
aw-0.085
crow-0.085
 thing-0.080
Token was
Token ID373
Feature activation-6.86e-03

Pos logit contributions
ifying+0.054
igsaw+0.045
years+0.042
chie+0.041
run+0.037
Neg logit contributions
 idea-0.046
 bal-0.038
crow-0.036
 understand-0.036
 people-0.035
Token very
Token ID845
Feature activation-5.35e-03

Pos logit contributions
 idea+0.011
crow+0.010
 bal+0.010
 doctors+0.010
 cows+0.009
Neg logit contributions
ifying-0.011
years-0.009
igsaw-0.009
chie-0.009
jamin-0.008
Token excited
Token ID6568
Feature activation+8.19e-03

Pos logit contributions
 idea+0.006
crow+0.005
 doctors+0.005
 bal+0.005
 cows+0.005
Neg logit contributions
ifying-0.011
igsaw-0.010
years-0.010
chie-0.009
jamin-0.009
Token and
Token ID290
Feature activation+5.78e-03

Pos logit contributions
ifying+0.018
igsaw+0.016
years+0.015
chie+0.015
jamin+0.014
Neg logit contributions
 idea-0.010
 bal-0.007
crow-0.007
 doctors-0.007
 people-0.007
Token
Token ID198
Feature activation+1.91e-02

Pos logit contributions
ifying+0.098
chie+0.086
years+0.085
igsaw+0.079
 brief+0.073
Neg logit contributions
 idea-0.045
 People-0.036
 But-0.035
 bal-0.035
 whenever-0.035
Token
Token ID198
Feature activation+2.76e-02

Pos logit contributions
ifying+0.044
igsaw+0.037
years+0.037
chie+0.036
 hips+0.033
Neg logit contributions
 idea-0.021
 bal-0.016
crow-0.016
 understand-0.015
 people-0.015
TokenThe
Token ID464
Feature activation+8.60e-03

Pos logit contributions
ifying+0.049
igsaw+0.040
chie+0.039
years+0.038
oured+0.036
Neg logit contributions
 idea-0.041
 bal-0.038
crow-0.036
 understand-0.036
 doctors-0.035
Token monster
Token ID9234
Feature activation-3.74e-03

Pos logit contributions
ifying+0.018
igsaw+0.015
years+0.015
chie+0.015
 hips+0.014
Neg logit contributions
 idea-0.011
 bal-0.009
crow-0.009
 doctors-0.009
 people-0.008
Token thought
Token ID1807
Feature activation+6.75e-03

Pos logit contributions
 idea+0.006
 bal+0.005
crow+0.005
 doctors+0.005
 cows+0.005
Neg logit contributions
ifying-0.006
igsaw-0.005
years-0.005
chie-0.005
jamin-0.005
Token for
Token ID329
Feature activation+7.12e-02

Pos logit contributions
ifying+0.017
igsaw+0.014
years+0.014
chie+0.014
jamin+0.013
Neg logit contributions
 idea-0.006
 bal-0.005
crow-0.005
 doctors-0.004
 people-0.004
Token a
Token ID257
Feature activation+2.48e-02

Pos logit contributions
ifying+0.189
chie+0.166
igsaw+0.159
run+0.158
years+0.155
Neg logit contributions
 idea-0.070
 time-0.061
 people-0.049
 it-0.047
 understand-0.044
Token moment
Token ID2589
Feature activation-2.62e-03

Pos logit contributions
ifying+0.048
years+0.041
igsaw+0.039
chie+0.036
jamin+0.035
Neg logit contributions
 idea-0.039
 understand-0.031
 bal-0.031
 time-0.031
 day-0.030
Token and
Token ID290
Feature activation+1.29e-02

Pos logit contributions
 idea+0.003
crow+0.003
 bal+0.003
 doctors+0.003
 cows+0.002
Neg logit contributions
ifying-0.006
igsaw-0.005
years-0.005
chie-0.004
 hips-0.004
Token then
Token ID788
Feature activation-1.30e-02

Pos logit contributions
ifying+0.027
igsaw+0.023
chie+0.022
years+0.022
 hips+0.020
Neg logit contributions
 idea-0.017
 bal-0.014
crow-0.014
 doctors-0.013
 people-0.013
Token said
Token ID531
Feature activation-5.09e-03

Pos logit contributions
 idea+0.021
crow+0.017
 bal+0.017
 doctors+0.017
 cows+0.016
Neg logit contributions
ifying-0.022
igsaw-0.018
years-0.018
chie-0.017
jamin-0.017
Token nice
Token ID3621
Feature activation+6.75e-03

Pos logit contributions
ifying+0.005
igsaw+0.004
years+0.004
chie+0.004
jamin+0.004
Neg logit contributions
 idea-0.004
 bal-0.003
crow-0.003
 doctors-0.003
 cows-0.003
Token.
Token ID13
Feature activation+6.09e-03

Pos logit contributions
ifying+0.015
igsaw+0.013
years+0.013
chie+0.012
jamin+0.012
Neg logit contributions
 idea-0.008
 bal-0.006
crow-0.006
 doctors-0.006
 people-0.006
Token Some
Token ID2773
Feature activation+2.33e-02

Pos logit contributions
ifying+0.012
igsaw+0.010
years+0.010
chie+0.010
jamin+0.009
Neg logit contributions
 idea-0.009
 bal-0.007
crow-0.007
 doctors-0.007
 cows-0.007
Token are
Token ID389
Feature activation+1.64e-02

Pos logit contributions
ifying+0.052
igsaw+0.046
years+0.045
chie+0.043
jamin+0.042
Neg logit contributions
 idea-0.027
 people-0.022
crow-0.022
 doctors-0.021
 cows-0.021
Token black
Token ID2042
Feature activation+1.93e-02

Pos logit contributions
ifying+0.033
igsaw+0.029
years+0.028
chie+0.027
jamin+0.025
Neg logit contributions
 idea-0.022
 doctors-0.018
 bal-0.018
 cows-0.018
crow-0.018
Token and
Token ID290
Feature activation+7.06e-02

Pos logit contributions
ifying+0.046
igsaw+0.040
years+0.039
chie+0.039
jamin+0.036
Neg logit contributions
 idea-0.020
crow-0.015
 bal-0.015
 people-0.015
 doctors-0.015
Token disgusting
Token ID23374
Feature activation-1.03e-03

Pos logit contributions
ifying+0.145
igsaw+0.142
enny+0.134
jamin+0.132
oured+0.131
Neg logit contributions
 idea-0.074
 wheat-0.074
 people-0.071
 cows-0.070
 doctors-0.068
Token and
Token ID290
Feature activation+5.73e-02

Pos logit contributions
 idea+0.001
 bal+0.001
crow+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.002
igsaw-0.002
years-0.002
chie-0.002
jamin-0.002
Token mean
Token ID1612
Feature activation-1.56e-02

Pos logit contributions
ifying+0.120
igsaw+0.114
years+0.105
jamin+0.104
oured+0.103
Neg logit contributions
 idea-0.066
 people-0.059
 doctors-0.057
 wheat-0.056
 cows-0.055
Token."
Token ID526
Feature activation+2.57e-03

Pos logit contributions
 idea+0.020
 bal+0.016
crow+0.016
 doctors+0.015
 pilot+0.015
Neg logit contributions
ifying-0.031
igsaw-0.025
years-0.025
chie-0.025
 hips-0.024
Token Mia
Token ID32189
Feature activation-2.08e-03

Pos logit contributions
ifying+0.004
igsaw+0.003
years+0.003
chie+0.003
jamin+0.003
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token here
Token ID994
Feature activation+1.31e-03

Pos logit contributions
 idea+0.005
crow+0.004
 bal+0.004
 doctors+0.004
 cows+0.004
Neg logit contributions
ifying-0.006
igsaw-0.005
years-0.005
chie-0.005
 hips-0.005
Token.
Token ID13
Feature activation+3.16e-02

Pos logit contributions
ifying+0.003
igsaw+0.002
years+0.002
chie+0.002
jamin+0.002
Neg logit contributions
 idea-0.002
 bal-0.001
crow-0.001
 doctors-0.001
 cows-0.001
Token
Token ID220
Feature activation-3.17e-04

Pos logit contributions
ifying+0.063
chie+0.056
years+0.055
igsaw+0.052
 seeking+0.048
Neg logit contributions
 idea-0.041
 People-0.035
 whenever-0.033
 people-0.033
 bal-0.032
Token
Token ID198
Feature activation+2.22e-02

Pos logit contributions
 idea+0.000
crow+0.000
 bal+0.000
 doctors+0.000
 cows+0.000
Neg logit contributions
ifying-0.001
igsaw-0.001
years-0.001
chie-0.001
jamin-0.000
Token
Token ID198
Feature activation+1.42e-02

Pos logit contributions
ifying+0.052
chie+0.045
igsaw+0.044
years+0.043
 hips+0.040
Neg logit contributions
 idea-0.023
 understand-0.018
 people-0.017
 bal-0.017
 doctors-0.016
TokenEm
Token ID10161
Feature activation+7.05e-02

Pos logit contributions
ifying+0.023
igsaw+0.018
chie+0.018
years+0.017
 hips+0.016
Neg logit contributions
 idea-0.025
 bal-0.022
 doctors-0.021
 people-0.021
 understand-0.021
Tokenma
Token ID2611
Feature activation+3.20e-03

Pos logit contributions
ifying+0.166
 hind+0.141
igsaw+0.136
run+0.129
chie+0.129
Neg logit contributions
 idea-0.060
 day-0.056
 time-0.054
est-0.051
 people-0.048
Token had
Token ID550
Feature activation+1.58e-02

Pos logit contributions
ifying+0.006
igsaw+0.005
years+0.005
chie+0.005
jamin+0.004
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token a
Token ID257
Feature activation+1.76e-02

Pos logit contributions
ifying+0.031
igsaw+0.027
years+0.025
chie+0.024
 hips+0.023
Neg logit contributions
 idea-0.023
 bal-0.018
 doctors-0.018
 people-0.018
crow-0.018
Token great
Token ID1049
Feature activation-2.21e-02

Pos logit contributions
ifying+0.039
years+0.034
igsaw+0.033
chie+0.032
 hips+0.030
Neg logit contributions
 idea-0.022
 bal-0.016
 doctors-0.015
 understand-0.015
 pilot-0.015
Token time
Token ID640
Feature activation-8.05e-03

Pos logit contributions
ighters+0.014
crow+0.014
cut+0.013
 doctors+0.013
 idea+0.013
Neg logit contributions
ifying-0.053
igsaw-0.052
jamin-0.046
�-0.045
chie-0.045
Token place
Token ID1295
Feature activation+2.39e-02

Pos logit contributions
 idea+0.036
ighters+0.034
 bal+0.034
 doctors+0.032
crow+0.031
Neg logit contributions
ifying-0.073
igsaw-0.067
chie-0.062
 seeking-0.061
jamin-0.061
Token.
Token ID13
Feature activation+3.43e-02

Pos logit contributions
ifying+0.050
igsaw+0.043
years+0.043
chie+0.041
jamin+0.039
Neg logit contributions
 idea-0.031
 bal-0.023
crow-0.023
 people-0.022
 doctors-0.022
Token
Token ID220
Feature activation-1.47e-03

Pos logit contributions
ifying+0.075
chie+0.069
years+0.067
igsaw+0.064
 brief+0.060
Neg logit contributions
 idea-0.035
 People-0.030
 whenever-0.028
 Everyone-0.027
 bal-0.026
Token
Token ID198
Feature activation+1.19e-02

Pos logit contributions
 idea+0.002
 bal+0.002
crow+0.002
 doctors+0.002
 cows+0.002
Neg logit contributions
ifying-0.003
igsaw-0.002
years-0.002
chie-0.002
jamin-0.002
Token
Token ID198
Feature activation+1.68e-02

Pos logit contributions
ifying+0.027
igsaw+0.023
years+0.023
chie+0.023
 hips+0.021
Neg logit contributions
 idea-0.013
 bal-0.010
crow-0.010
 doctors-0.010
 people-0.010
TokenEm
Token ID10161
Feature activation+7.03e-02

Pos logit contributions
ifying+0.033
igsaw+0.027
chie+0.026
years+0.026
 hips+0.025
Neg logit contributions
 idea-0.023
 bal-0.019
 doctors-0.019
crow-0.019
 people-0.019
Tokenma
Token ID2611
Feature activation+1.65e-02

Pos logit contributions
ifying+0.169
igsaw+0.141
 hind+0.138
lic+0.130
run+0.128
Neg logit contributions
 idea-0.059
 time-0.052
 day-0.049
ying-0.047
 people-0.047
Token was
Token ID373
Feature activation-8.53e-04

Pos logit contributions
ifying+0.034
igsaw+0.029
years+0.027
chie+0.027
 hips+0.025
Neg logit contributions
 idea-0.022
 bal-0.018
crow-0.018
 doctors-0.017
 people-0.017
Token so
Token ID523
Feature activation+9.24e-03

Pos logit contributions
 idea+0.001
crow+0.001
 bal+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.001
igsaw-0.001
years-0.001
chie-0.001
jamin-0.001
Token proud
Token ID6613
Feature activation-7.71e-04

Pos logit contributions
ifying+0.022
igsaw+0.019
years+0.018
chie+0.018
 hips+0.017
Neg logit contributions
 idea-0.010
 bal-0.007
crow-0.007
 doctors-0.007
 understand-0.007
Token of
Token ID286
Feature activation+3.33e-02

Pos logit contributions
 idea+0.001
crow+0.001
 bal+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.002
igsaw-0.001
years-0.001
chie-0.001
jamin-0.001
Token!"
Token ID2474
Feature activation-1.14e-02

Pos logit contributions
ifying+0.003
igsaw+0.003
years+0.003
chie+0.003
 hips+0.002
Neg logit contributions
 idea-0.002
crow-0.002
 bal-0.002
 doctors-0.002
 cows-0.001
Token
Token ID198
Feature activation-1.67e-02

Pos logit contributions
 idea+0.019
crow+0.016
cut+0.015
 bal+0.015
 doctors+0.015
Neg logit contributions
ifying-0.019
igsaw-0.016
years-0.015
 hips-0.015
jamin-0.014
Token
Token ID198
Feature activation-2.69e-03

Pos logit contributions
 idea+0.028
crow+0.023
 bal+0.023
 doctors+0.023
 pilot+0.022
Neg logit contributions
ifying-0.027
igsaw-0.022
years-0.021
 hips-0.020
jamin-0.020
TokenMom
Token ID29252
Feature activation+3.30e-02

Pos logit contributions
 idea+0.004
crow+0.003
 bal+0.003
 doctors+0.003
 cows+0.003
Neg logit contributions
ifying-0.005
igsaw-0.004
years-0.004
chie-0.004
jamin-0.004
Token thought
Token ID1807
Feature activation-1.51e-02

Pos logit contributions
ifying+0.067
igsaw+0.057
chie+0.052
years+0.048
�+0.044
Neg logit contributions
 idea-0.049
 bal-0.043
 doctors-0.042
 people-0.041
 understand-0.040
Token for
Token ID329
Feature activation+7.02e-02

Pos logit contributions
 idea+0.019
 bal+0.017
crow+0.017
 pilot+0.016
cut+0.016
Neg logit contributions
ifying-0.028
igsaw-0.025
years-0.024
run-0.023
chie-0.023
Token a
Token ID257
Feature activation+3.65e-02

Pos logit contributions
ifying+0.185
chie+0.173
run+0.169
igsaw+0.162
ll+0.157
Neg logit contributions
 idea-0.057
 doctors-0.054
 people-0.049
 time-0.048
 sun-0.039
Token moment
Token ID2589
Feature activation+5.03e-03

Pos logit contributions
ifying+0.079
years+0.071
chie+0.063
igsaw+0.062
oured+0.058
Neg logit contributions
 idea-0.055
 understand-0.042
 time-0.042
 day-0.041
 thing-0.040
Token.
Token ID13
Feature activation-3.38e-03

Pos logit contributions
ifying+0.011
igsaw+0.010
years+0.009
chie+0.009
jamin+0.009
Neg logit contributions
 idea-0.006
 bal-0.005
crow-0.004
 doctors-0.004
 cows-0.004
Token Then
Token ID3244
Feature activation-4.70e-03

Pos logit contributions
 idea+0.005
crow+0.004
 bal+0.004
 doctors+0.004
cut+0.004
Neg logit contributions
ifying-0.006
igsaw-0.005
years-0.005
chie-0.005
jamin-0.005
Token she
Token ID673
Feature activation+1.60e-02

Pos logit contributions
 idea+0.006
crow+0.005
 bal+0.005
cut+0.005
 doctors+0.005
Neg logit contributions
ifying-0.009
igsaw-0.008
years-0.008
chie-0.007
 hips-0.007
Token think
Token ID892
Feature activation+5.70e-05

Pos logit contributions
ifying+0.002
igsaw+0.002
years+0.002
chie+0.002
jamin+0.002
Neg logit contributions
 idea-0.002
 bal-0.001
crow-0.001
 doctors-0.001
 cows-0.001
Token you
Token ID345
Feature activation+4.82e-03

Pos logit contributions
ifying+0.000
igsaw+0.000
years+0.000
chie+0.000
jamin+0.000
Neg logit contributions
 idea-0.000
 bal-0.000
crow-0.000
 doctors-0.000
 cows-0.000
Token're
Token ID821
Feature activation-2.02e-02

Pos logit contributions
ifying+0.009
igsaw+0.008
years+0.008
chie+0.007
jamin+0.007
Neg logit contributions
 idea-0.007
 bal-0.006
crow-0.006
 doctors-0.006
 cows-0.005
Token doing
Token ID1804
Feature activation-1.71e-02

Pos logit contributions
 idea+0.029
 bal+0.027
crow+0.027
 pilot+0.025
 paramedics+0.025
Neg logit contributions
ifying-0.035
igsaw-0.029
years-0.028
 seeking-0.028
chie-0.027
Token?!"
Token ID30823
Feature activation+2.49e-02

Pos logit contributions
 idea+0.025
crow+0.022
 bal+0.022
 paramedics+0.021
 doctors+0.021
Neg logit contributions
ifying-0.031
igsaw-0.026
years-0.024
chie-0.023
jamin-0.023
Token Fox
Token ID5426
Feature activation+7.02e-02

Pos logit contributions
ifying+0.056
igsaw+0.049
years+0.048
chie+0.047
 hips+0.044
Neg logit contributions
 idea-0.028
crow-0.024
 people-0.022
 bal-0.021
 doctors-0.021
Tokeny
Token ID88
Feature activation+5.31e-02

Pos logit contributions
ifying+0.132
igsaw+0.125
�+0.107
years+0.106
chie+0.098
Neg logit contributions
 idea-0.100
crow-0.088
 understand-0.083
cut-0.080
 people-0.077
Token looked
Token ID3114
Feature activation-2.53e-03

Pos logit contributions
ifying+0.113
igsaw+0.099
years+0.090
chie+0.086
run+0.079
Neg logit contributions
 idea-0.073
 people-0.058
crow-0.058
 understand-0.057
 bal-0.057
Token up
Token ID510
Feature activation+5.73e-03

Pos logit contributions
 idea+0.003
 bal+0.002
 doctors+0.002
crow+0.002
 cows+0.002
Neg logit contributions
ifying-0.006
igsaw-0.005
years-0.005
chie-0.005
jamin-0.005
Token and
Token ID290
Feature activation+3.30e-02

Pos logit contributions
ifying+0.013
igsaw+0.011
years+0.011
chie+0.011
jamin+0.010
Neg logit contributions
 idea-0.007
 bal-0.005
crow-0.005
 doctors-0.005
 people-0.005
Token saw
Token ID2497
Feature activation+1.65e-02

Pos logit contributions
ifying+0.074
igsaw+0.067
chie+0.059
years+0.058
�+0.056
Neg logit contributions
 idea-0.037
 people-0.031
crow-0.030
 cows-0.030
 bal-0.030
Token us
Token ID514
Feature activation+1.65e-02

Pos logit contributions
ifying+0.042
igsaw+0.035
chie+0.034
years+0.034
jamin+0.032
Neg logit contributions
 idea-0.031
 people-0.027
 doctors-0.027
 cows-0.026
crow-0.025
Token.
Token ID13
Feature activation-2.44e-04

Pos logit contributions
ifying+0.038
igsaw+0.033
years+0.032
chie+0.032
jamin+0.030
Neg logit contributions
 idea-0.018
 bal-0.014
crow-0.013
 people-0.013
 doctors-0.013
Token We
Token ID775
Feature activation-2.79e-02

Pos logit contributions
 idea+0.000
 bal+0.000
crow+0.000
 doctors+0.000
 cows+0.000
Neg logit contributions
ifying-0.000
igsaw-0.000
years-0.000
chie-0.000
jamin-0.000
Token are
Token ID389
Feature activation-1.39e-02

Pos logit contributions
 idea+0.050
 doctors+0.047
 cows+0.046
 paramedics+0.046
crow+0.045
Neg logit contributions
ifying-0.042
chie-0.031
years-0.028
igsaw-0.028
 total-0.026
Token kings
Token ID21557
Feature activation-6.86e-05

Pos logit contributions
 idea+0.023
crow+0.021
 bal+0.020
cut+0.019
 understand+0.019
Neg logit contributions
ifying-0.022
years-0.019
igsaw-0.018
chie-0.017
 seeking-0.017
Token and
Token ID290
Feature activation+6.99e-02

Pos logit contributions
 idea+0.000
 bal+0.000
crow+0.000
 doctors+0.000
 cows+0.000
Neg logit contributions
ifying-0.000
igsaw-0.000
years-0.000
chie-0.000
jamin-0.000
Token queens
Token ID38771
Feature activation+1.78e-02

Pos logit contributions
ifying+0.113
igsaw+0.098
 hips+0.092
�+0.091
jamin+0.089
Neg logit contributions
 people-0.130
 doctors-0.123
 pilot-0.116
 even-0.114
 cows-0.113
Token of
Token ID286
Feature activation+3.12e-02

Pos logit contributions
ifying+0.041
igsaw+0.036
years+0.036
chie+0.035
jamin+0.033
Neg logit contributions
 idea-0.019
 bal-0.014
crow-0.014
 people-0.014
 doctors-0.013
Token this
Token ID428
Feature activation-8.20e-04

Pos logit contributions
ifying+0.074
igsaw+0.064
jamin+0.061
chie+0.061
years+0.060
Neg logit contributions
 idea-0.032
 people-0.029
 doctors-0.026
 cows-0.026
crow-0.023
Token castle
Token ID16669
Feature activation+9.71e-03

Pos logit contributions
 idea+0.001
 bal+0.001
crow+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.002
igsaw-0.001
years-0.001
chie-0.001
jamin-0.001
Token.
Token ID13
Feature activation-1.05e-02

Pos logit contributions
ifying+0.021
igsaw+0.018
years+0.018
chie+0.018
jamin+0.017
Neg logit contributions
 idea-0.011
 bal-0.009
crow-0.009
 doctors-0.009
 people-0.008
Token does
Token ID857
Feature activation+1.55e-02

Pos logit contributions
ifying+0.021
igsaw+0.018
years+0.017
chie+0.017
jamin+0.016
Neg logit contributions
 idea-0.012
 bal-0.010
crow-0.009
 doctors-0.009
 people-0.009
Token.
Token ID13
Feature activation+1.34e-03

Pos logit contributions
ifying+0.032
igsaw+0.028
years+0.028
chie+0.026
jamin+0.025
Neg logit contributions
 idea-0.021
 people-0.016
 doctors-0.016
 understand-0.016
 bal-0.016
Token She
Token ID1375
Feature activation-9.96e-04

Pos logit contributions
ifying+0.003
igsaw+0.002
years+0.002
chie+0.002
jamin+0.002
Neg logit contributions
 idea-0.002
 bal-0.002
crow-0.002
 doctors-0.002
 cows-0.002
Token learned
Token ID4499
Feature activation+1.89e-02

Pos logit contributions
 idea+0.001
crow+0.001
 bal+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.002
igsaw-0.002
years-0.002
chie-0.001
jamin-0.001
Token that
Token ID326
Feature activation-3.15e-02

Pos logit contributions
ifying+0.044
igsaw+0.037
chie+0.037
years+0.036
jamin+0.034
Neg logit contributions
 idea-0.022
 doctors-0.017
 people-0.017
 bal-0.016
 understand-0.016
Token birth
Token ID4082
Feature activation+6.94e-02

Pos logit contributions
 idea+0.049
crow+0.044
 bal+0.040
 paramedics+0.039
aw+0.038
Neg logit contributions
ifying-0.049
years-0.043
igsaw-0.042
 seeking-0.042
 hips-0.038
Tokendays
Token ID12545
Feature activation+2.50e-02

Pos logit contributions
ifying+0.114
igsaw+0.107
 hips+0.096
 hind+0.095
chie+0.095
Neg logit contributions
 idea-0.105
 if-0.097
 understand-0.096
 whenever-0.092
 thing-0.089
Token are
Token ID389
Feature activation+1.73e-03

Pos logit contributions
ifying+0.050
igsaw+0.042
years+0.042
chie+0.040
jamin+0.038
Neg logit contributions
 idea-0.036
 people-0.029
crow-0.028
 doctors-0.028
 bal-0.028
Token not
Token ID407
Feature activation-5.59e-03

Pos logit contributions
ifying+0.004
igsaw+0.003
years+0.003
chie+0.003
jamin+0.003
Neg logit contributions
 idea-0.002
 bal-0.002
crow-0.002
 doctors-0.002
 cows-0.002
Token about
Token ID546
Feature activation+2.27e-02

Pos logit contributions
 idea+0.007
crow+0.006
 bal+0.006
 doctors+0.005
 cows+0.005
Neg logit contributions
ifying-0.012
igsaw-0.010
years-0.009
chie-0.009
jamin-0.009
Token cakes
Token ID30849
Feature activation+6.59e-03

Pos logit contributions
ifying+0.047
igsaw+0.039
years+0.038
chie+0.037
jamin+0.034
Neg logit contributions
 idea-0.032
 people-0.027
 doctors-0.026
 cows-0.025
 bal-0.025
Token time
Token ID640
Feature activation+1.99e-02

Pos logit contributions
ifying+0.043
years+0.038
igsaw+0.038
chie+0.038
 hips+0.035
Neg logit contributions
 idea-0.005
 bal-0.001
crow-0.000
 time-0.000
 doctors+0.000
Token there
Token ID612
Feature activation-2.29e-02

Pos logit contributions
ifying+0.050
years+0.042
igsaw+0.042
chie+0.041
jamin+0.039
Neg logit contributions
 idea-0.019
 bal-0.014
crow-0.013
 people-0.012
 doctors-0.012
Token were
Token ID547
Feature activation+1.09e-02

Pos logit contributions
 idea+0.041
 doctors+0.038
 bal+0.037
 paramedics+0.036
 cows+0.036
Neg logit contributions
ifying-0.031
igsaw-0.025
years-0.023
jamin-0.023
�-0.022
Token two
Token ID734
Feature activation+1.51e-02

Pos logit contributions
ifying+0.019
igsaw+0.015
years+0.014
chie+0.014
 hips+0.013
Neg logit contributions
 idea-0.018
 bal-0.015
 doctors-0.015
crow-0.015
 cows-0.015
Token friends
Token ID2460
Feature activation+2.20e-02

Pos logit contributions
ifying+0.035
igsaw+0.031
years+0.029
chie+0.029
jamin+0.028
Neg logit contributions
 idea-0.017
 doctors-0.013
 cows-0.012
 people-0.012
 bal-0.012
Token,
Token ID11
Feature activation+6.93e-02

Pos logit contributions
ifying+0.055
igsaw+0.048
chie+0.045
years+0.045
jamin+0.042
Neg logit contributions
 idea-0.021
 doctors-0.016
crow-0.016
 people-0.016
 cows-0.015
Token Bob
Token ID5811
Feature activation+1.31e-02

Pos logit contributions
ifying+0.164
chie+0.144
igsaw+0.142
years+0.135
ll+0.134
Neg logit contributions
 idea-0.067
 cows-0.058
 time-0.051
 people-0.051
 doctors-0.051
Token and
Token ID290
Feature activation+3.46e-02

Pos logit contributions
ifying+0.031
igsaw+0.027
years+0.026
chie+0.026
 hips+0.024
Neg logit contributions
 idea-0.014
 cows-0.010
crow-0.010
 bal-0.010
 doctors-0.010
Token Jack
Token ID3619
Feature activation+2.09e-02

Pos logit contributions
ifying+0.079
igsaw+0.063
years+0.062
chie+0.060
oured+0.057
Neg logit contributions
 idea-0.045
crow-0.035
 doctors-0.035
 bal-0.035
 people-0.035
Token,
Token ID11
Feature activation+2.75e-02

Pos logit contributions
ifying+0.053
igsaw+0.046
years+0.044
chie+0.044
 hips+0.042
Neg logit contributions
 idea-0.020
 cows-0.013
 bal-0.013
crow-0.013
 people-0.013
Token who
Token ID508
Feature activation-1.27e-02

Pos logit contributions
ifying+0.054
igsaw+0.047
chie+0.045
years+0.044
 hips+0.040
Neg logit contributions
 idea-0.040
 cows-0.032
 bal-0.032
 doctors-0.032
 people-0.031
Token ribbon
Token ID29092
Feature activation+2.23e-02

Pos logit contributions
ifying+0.063
years+0.057
igsaw+0.054
chie+0.053
ll+0.048
Neg logit contributions
 idea-0.048
 bal-0.039
 thing-0.036
 people-0.035
 pilot-0.034
Token for
Token ID329
Feature activation+3.00e-02

Pos logit contributions
ifying+0.049
igsaw+0.044
years+0.043
chie+0.042
jamin+0.039
Neg logit contributions
 idea-0.025
 bal-0.020
crow-0.018
 people-0.018
 cows-0.017
Token being
Token ID852
Feature activation+1.75e-02

Pos logit contributions
ifying+0.070
igsaw+0.059
chie+0.056
years+0.055
jamin+0.053
Neg logit contributions
 idea-0.034
 people-0.028
 doctors-0.026
 bal-0.025
crow-0.024
Token the
Token ID262
Feature activation-2.63e-02

Pos logit contributions
ifying+0.034
igsaw+0.028
years+0.026
chie+0.026
 hips+0.025
Neg logit contributions
 idea-0.026
 bal-0.021
crow-0.021
 doctors-0.021
 people-0.021
Token best
Token ID1266
Feature activation-1.90e-02

Pos logit contributions
 idea+0.033
crow+0.030
 bal+0.028
 paramedics+0.028
 cows+0.028
Neg logit contributions
ifying-0.051
igsaw-0.042
chie-0.042
years-0.040
jamin-0.039
Token bal
Token ID3652
Feature activation+6.93e-02

Pos logit contributions
 idea+0.022
crow+0.020
cut+0.020
 cows+0.019
 doctors+0.019
Neg logit contributions
ifying-0.038
igsaw-0.035
chie-0.031
run-0.030
years-0.030
Tokenancer
Token ID8250
Feature activation+1.30e-02

Pos logit contributions
ifying+0.122
igsaw+0.120
 hind+0.111
 hips+0.108
 unison+0.106
Neg logit contributions
ying-0.082
 bal-0.080
 if-0.073
crow-0.072
 idea-0.071
Token in
Token ID287
Feature activation+1.97e-02

Pos logit contributions
ifying+0.027
igsaw+0.023
years+0.022
chie+0.022
jamin+0.021
Neg logit contributions
 idea-0.017
 bal-0.013
crow-0.013
 doctors-0.013
 cows-0.013
Token the
Token ID262
Feature activation+9.01e-03

Pos logit contributions
ifying+0.043
igsaw+0.037
years+0.037
chie+0.036
 hips+0.034
Neg logit contributions
 idea-0.023
crow-0.018
 people-0.018
 bal-0.017
 doctors-0.017
Token competition
Token ID5449
Feature activation-1.29e-03

Pos logit contributions
ifying+0.021
igsaw+0.018
years+0.018
chie+0.017
 hips+0.017
Neg logit contributions
 idea-0.010
 bal-0.007
crow-0.007
 doctors-0.007
 people-0.007
Token.
Token ID13
Feature activation+2.41e-02

Pos logit contributions
 idea+0.002
 bal+0.001
crow+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.003
igsaw-0.002
years-0.002
chie-0.002
jamin-0.002
Token.
Token ID13
Feature activation-2.34e-02

Pos logit contributions
ifying+0.010
igsaw+0.008
years+0.008
chie+0.008
jamin+0.007
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token Every
Token ID3887
Feature activation-1.22e-02

Pos logit contributions
 idea+0.037
 bal+0.032
 doctors+0.030
crow+0.030
cut+0.030
Neg logit contributions
ifying-0.041
igsaw-0.033
jamin-0.031
years-0.031
 hips-0.030
Token day
Token ID1110
Feature activation-2.93e-02

Pos logit contributions
 idea+0.007
 doctors+0.005
 bal+0.005
crow+0.005
 paramedics+0.004
Neg logit contributions
ifying-0.032
igsaw-0.029
years-0.027
chie-0.027
jamin-0.027
Token,
Token ID11
Feature activation-1.59e-02

Pos logit contributions
 idea+0.050
cut+0.039
 paramedics+0.038
 understand+0.038
 doctors+0.037
Neg logit contributions
ifying-0.052
years-0.039
igsaw-0.036
 eight-0.036
 hips-0.034
Token the
Token ID262
Feature activation-3.20e-02

Pos logit contributions
 idea+0.028
crow+0.023
 bal+0.023
 doctors+0.022
 understand+0.022
Neg logit contributions
ifying-0.026
years-0.020
igsaw-0.020
chie-0.018
 hips-0.018
Token family
Token ID1641
Feature activation-8.38e-02

Pos logit contributions
 idea+0.042
ook+0.039
ighters+0.038
crow+0.038
ying+0.037
Neg logit contributions
ifying-0.059
chie-0.045
jamin-0.044
�-0.044
years-0.044
Token would
Token ID561
Feature activation-2.13e-02

Pos logit contributions
 idea+0.168
 impression+0.152
 decisions+0.149
ainer+0.148
eeper+0.148
Neg logit contributions
ll-0.074
o-0.063
m-0.061
ifying-0.060
 looked-0.058
Token give
Token ID1577
Feature activation-1.65e-02

Pos logit contributions
 idea+0.029
crow+0.025
 bal+0.024
 doctors+0.024
ighters+0.022
Neg logit contributions
ifying-0.040
igsaw-0.032
chie-0.032
years-0.032
jamin-0.031
Token Strip
Token ID18508
Feature activation+2.20e-02

Pos logit contributions
 idea+0.027
 bal+0.022
crow+0.021
cut+0.021
 doctors+0.020
Neg logit contributions
ifying-0.028
years-0.024
igsaw-0.022
chie-0.021
jamin-0.021
Token lots
Token ID6041
Feature activation-1.62e-04

Pos logit contributions
ifying+0.034
igsaw+0.028
years+0.026
chie+0.025
 hips+0.023
Neg logit contributions
 idea-0.039
 bal-0.035
 cows-0.035
 people-0.035
crow-0.035
Token of
Token ID286
Feature activation-1.99e-02

Pos logit contributions
 idea+0.000
crow+0.000
 bal+0.000
 doctors+0.000
 cows+0.000
Neg logit contributions
ifying-0.000
igsaw-0.000
years-0.000
chie-0.000
jamin-0.000
Token.
Token ID13
Feature activation-1.34e-02

Pos logit contributions
 idea+0.024
 bal+0.020
crow+0.019
 understand+0.019
 doctors+0.019
Neg logit contributions
ifying-0.031
igsaw-0.025
years-0.025
chie-0.024
 seeking-0.023
Token Every
Token ID3887
Feature activation-1.86e-02

Pos logit contributions
 idea+0.021
 bal+0.019
 doctors+0.018
cut+0.018
crow+0.018
Neg logit contributions
ifying-0.023
igsaw-0.019
years-0.018
jamin-0.017
 hips-0.017
Token day
Token ID1110
Feature activation-3.43e-02

Pos logit contributions
 idea+0.012
ighters+0.010
 bal+0.010
 paramedics+0.009
 doctors+0.009
Neg logit contributions
ifying-0.045
igsaw-0.041
chie-0.038
years-0.038
jamin-0.037
Token,
Token ID11
Feature activation-1.71e-02

Pos logit contributions
 idea+0.064
cut+0.058
 impression+0.056
 paramedics+0.056
 understand+0.054
Neg logit contributions
ifying-0.050
 eight-0.033
years-0.033
igsaw-0.031
�-0.029
Token the
Token ID262
Feature activation-4.12e-02

Pos logit contributions
 idea+0.026
 bal+0.022
crow+0.022
cut+0.021
 paramedics+0.020
Neg logit contributions
ifying-0.030
years-0.024
igsaw-0.024
chie-0.023
 hips-0.021
Token mail
Token ID6920
Feature activation-8.30e-02

Pos logit contributions
crow+0.050
 bal+0.050
 idea+0.047
ook+0.042
ighters+0.042
Neg logit contributions
ifying-0.075
igsaw-0.060
 total-0.059
chie-0.059
�-0.059
Tokenman
Token ID805
Feature activation-3.12e-02

Pos logit contributions
ubs+0.147
 mischief+0.147
dd+0.136
 impression+0.135
 exception+0.130
Neg logit contributions
work-0.080
ifying-0.080
 looked-0.073
 Claus-0.067
fly-0.064
Token would
Token ID561
Feature activation-2.94e-02

Pos logit contributions
 idea+0.050
 bal+0.042
crow+0.042
 paramedics+0.040
 doctors+0.038
Neg logit contributions
ifying-0.052
years-0.042
chie-0.038
 seeking-0.037
jamin-0.037
Token put
Token ID1234
Feature activation-2.57e-02

Pos logit contributions
 idea+0.044
crow+0.042
 bal+0.042
 doctors+0.039
 cows+0.038
Neg logit contributions
ifying-0.048
chie-0.038
igsaw-0.038
 hips-0.037
years-0.036
Token letters
Token ID7475
Feature activation-2.03e-02

Pos logit contributions
 idea+0.038
 bal+0.034
crow+0.033
 understand+0.032
 paramedics+0.031
Neg logit contributions
ifying-0.047
years-0.036
igsaw-0.036
chie-0.034
 hips-0.033
Token inside
Token ID2641
Feature activation-1.17e-02

Pos logit contributions
 idea+0.033
crow+0.029
 bal+0.027
 doctors+0.027
 paramedics+0.026
Neg logit contributions
ifying-0.037
years-0.028
igsaw-0.027
chie-0.027
 hips-0.026
Token So
Token ID1406
Feature activation-1.88e-02

Pos logit contributions
 idea+0.007
 bal+0.006
crow+0.006
 doctors+0.006
 cows+0.005
Neg logit contributions
ifying-0.008
igsaw-0.007
years-0.007
chie-0.006
jamin-0.006
Token every
Token ID790
Feature activation-2.24e-02

Pos logit contributions
 idea+0.028
crow+0.025
ighters+0.025
 bal+0.024
 paramedics+0.024
Neg logit contributions
ifying-0.032
igsaw-0.025
years-0.025
 hips-0.023
chie-0.023
Token evening
Token ID6180
Feature activation-5.87e-02

Pos logit contributions
 idea+0.016
ighters+0.013
crow+0.012
 paramedics+0.011
 bal+0.011
Neg logit contributions
ifying-0.055
igsaw-0.050
years-0.046
jamin-0.045
run-0.044
Token,
Token ID11
Feature activation-2.30e-02

Pos logit contributions
 idea+0.114
ighters+0.101
adders+0.095
its+0.095
 itch+0.095
Neg logit contributions
ifying-0.071
igsaw-0.047
 pawn-0.046
va-0.045
 bit-0.045
Token the
Token ID262
Feature activation-1.76e-02

Pos logit contributions
 idea+0.039
crow+0.032
 bal+0.032
ook+0.031
ying+0.031
Neg logit contributions
ifying-0.037
years-0.029
igsaw-0.028
chie-0.026
 hips-0.025
Token lawyer
Token ID6853
Feature activation-7.96e-02

Pos logit contributions
 idea+0.025
crow+0.022
 bal+0.021
ook+0.019
ighters+0.019
Neg logit contributions
ifying-0.032
years-0.028
igsaw-0.027
chie-0.025
 hips-0.023
Token would
Token ID561
Feature activation-1.65e-02

Pos logit contributions
 idea+0.138
ubs+0.123
crow+0.119
 paramedics+0.115
 People+0.114
Neg logit contributions
ifying-0.096
jamin-0.081
 bit-0.077
years-0.074
ll-0.074
Token go
Token ID467
Feature activation+1.90e-02

Pos logit contributions
 idea+0.022
crow+0.019
 bal+0.019
 doctors+0.018
 cows+0.018
Neg logit contributions
ifying-0.032
igsaw-0.026
chie-0.025
years-0.025
 hips-0.024
Token to
Token ID284
Feature activation+3.22e-03

Pos logit contributions
ifying+0.043
igsaw+0.036
years+0.035
chie+0.035
jamin+0.032
Neg logit contributions
 idea-0.024
 doctors-0.019
 people-0.018
 hospital-0.018
 bal-0.018
Token his
Token ID465
Feature activation+7.77e-03

Pos logit contributions
ifying+0.007
igsaw+0.006
years+0.005
chie+0.005
jamin+0.005
Neg logit contributions
 idea-0.004
 bal-0.003
crow-0.003
 doctors-0.003
 cows-0.003
Token big
Token ID1263
Feature activation-1.02e-02

Pos logit contributions
ifying+0.018
igsaw+0.015
years+0.015
chie+0.015
jamin+0.014
Neg logit contributions
 idea-0.008
 doctors-0.006
 bal-0.006
crow-0.006
 cows-0.006
Token.
Token ID13
Feature activation-2.37e-02

Pos logit contributions
 idea+0.007
crow+0.006
 bal+0.006
 doctors+0.006
 cows+0.006
Neg logit contributions
ifying-0.011
igsaw-0.009
years-0.009
chie-0.009
 hips-0.009
Token Every
Token ID3887
Feature activation-1.38e-02

Pos logit contributions
 idea+0.044
 bal+0.040
 doctors+0.038
crow+0.038
 infection+0.037
Neg logit contributions
ifying-0.032
igsaw-0.026
�-0.024
jamin-0.024
 hips-0.023
Token day
Token ID1110
Feature activation-4.09e-02

Pos logit contributions
 idea+0.008
 bal+0.006
crow+0.006
 doctors+0.006
ighters+0.006
Neg logit contributions
ifying-0.036
igsaw-0.032
years-0.030
chie-0.030
jamin-0.030
Token,
Token ID11
Feature activation-1.86e-02

Pos logit contributions
 idea+0.083
ighters+0.073
 paramedics+0.070
 impression+0.069
adders+0.069
Neg logit contributions
ifying-0.052
 bit-0.035
 eight-0.034
�-0.031
years-0.030
Token the
Token ID262
Feature activation-2.38e-02

Pos logit contributions
 idea+0.031
crow+0.026
 bal+0.026
 understand+0.024
 paramedics+0.023
Neg logit contributions
ifying-0.031
years-0.024
igsaw-0.023
chie-0.022
�-0.021
Token mail
Token ID6920
Feature activation-7.87e-02

Pos logit contributions
 idea+0.029
ighters+0.026
crow+0.026
ook+0.025
 bal+0.024
Neg logit contributions
ifying-0.044
igsaw-0.037
years-0.035
�-0.035
 seeking-0.035
Tokenman
Token ID805
Feature activation-1.79e-02

Pos logit contributions
ubs+0.147
 mischief+0.142
 idea+0.139
ighters+0.134
 itch+0.134
Neg logit contributions
ifying-0.072
work-0.070
 Claus-0.065
fly-0.064
 recorder-0.051
Token would
Token ID561
Feature activation-1.27e-02

Pos logit contributions
 idea+0.026
 bal+0.022
crow+0.021
 doctors+0.020
 paramedics+0.020
Neg logit contributions
ifying-0.034
years-0.027
chie-0.026
igsaw-0.026
jamin-0.025
Token leave
Token ID2666
Feature activation-1.96e-02

Pos logit contributions
 idea+0.018
 bal+0.015
crow+0.015
 doctors+0.015
 cows+0.014
Neg logit contributions
ifying-0.024
igsaw-0.020
chie-0.020
years-0.019
 hips-0.018
Token the
Token ID262
Feature activation-2.86e-02

Pos logit contributions
 idea+0.030
 bal+0.026
crow+0.023
 doctors+0.022
 paramedics+0.022
Neg logit contributions
ifying-0.036
years-0.030
igsaw-0.029
chie-0.028
 hips-0.027
Token mail
Token ID6920
Feature activation-4.39e-02

Pos logit contributions
 idea+0.040
crow+0.033
ighters+0.033
 bal+0.031
ying+0.031
Neg logit contributions
ifying-0.052
igsaw-0.043
years-0.042
 seeking-0.041
jamin-0.040
TokenL
Token ID43
Feature activation+9.26e-03

Pos logit contributions
 idea+0.000
crow+0.000
 bal+0.000
 doctors+0.000
 cows+0.000
Neg logit contributions
ifying-0.000
igsaw-0.000
years-0.000
chie-0.000
jamin-0.000
Tokenily
Token ID813
Feature activation+2.47e-03

Pos logit contributions
ifying+0.024
igsaw+0.021
years+0.021
chie+0.020
 hips+0.020
Neg logit contributions
 idea-0.008
crow-0.005
 bal-0.005
 doctors-0.005
 people-0.004
Token decided
Token ID3066
Feature activation-1.44e-02

Pos logit contributions
ifying+0.004
igsaw+0.004
years+0.004
chie+0.004
jamin+0.003
Neg logit contributions
 idea-0.004
 bal-0.003
crow-0.003
 doctors-0.003
 cows-0.003
Token to
Token ID284
Feature activation-2.49e-02

Pos logit contributions
 idea+0.028
crow+0.024
 bal+0.024
cut+0.023
 cows+0.022
Neg logit contributions
ifying-0.019
igsaw-0.015
years-0.015
 hips-0.014
chie-0.014
Token keep
Token ID1394
Feature activation-5.74e-03

Pos logit contributions
 idea+0.036
 bal+0.032
crow+0.032
 doctors+0.031
 cows+0.030
Neg logit contributions
ifying-0.043
igsaw-0.036
chie-0.034
 hips-0.033
jamin-0.033
Token an
Token ID281
Feature activation-7.82e-02

Pos logit contributions
 idea+0.008
 bal+0.007
crow+0.007
 doctors+0.006
 understand+0.006
Neg logit contributions
ifying-0.011
years-0.009
igsaw-0.009
chie-0.009
 hips-0.008
Token eye
Token ID4151
Feature activation-5.59e-02

Pos logit contributions
 bal+0.090
car+0.081
 People+0.080
crow+0.080
ves+0.077
Neg logit contributions
chie-0.115
igsaw-0.114
years-0.112
ifying-0.111
oured-0.099
Token on
Token ID319
Feature activation-1.47e-02

Pos logit contributions
 idea+0.084
 People+0.072
 countryside+0.069
 rarely+0.069
 Vol+0.068
Neg logit contributions
ifying-0.103
 hips-0.072
igsaw-0.069
chie-0.069
years-0.068
Token the
Token ID262
Feature activation-3.78e-02

Pos logit contributions
 idea+0.021
 bal+0.018
crow+0.017
 understand+0.016
 pilot+0.016
Neg logit contributions
ifying-0.028
igsaw-0.023
years-0.022
 hips-0.021
chie-0.021
Token field
Token ID2214
Feature activation-1.16e-02

Pos logit contributions
 bal+0.037
 idea+0.033
crow+0.030
 clim+0.029
car+0.026
Neg logit contributions
ifying-0.083
chie-0.068
 hips-0.066
 freshly-0.065
�-0.062
Token to
Token ID284
Feature activation-4.84e-03

Pos logit contributions
 idea+0.016
 bal+0.013
crow+0.013
 doctors+0.013
 cows+0.012
Neg logit contributions
ifying-0.023
igsaw-0.019
years-0.019
chie-0.018
 hips-0.018
Token inside
Token ID2641
Feature activation-9.37e-03

Pos logit contributions
 idea+0.036
 bal+0.034
crow+0.033
 paramedics+0.032
 understand+0.031
Neg logit contributions
ifying-0.048
 seeking-0.042
igsaw-0.041
chie-0.040
years-0.039
Token it
Token ID340
Feature activation+5.95e-03

Pos logit contributions
 idea+0.013
 bal+0.011
crow+0.010
 doctors+0.010
 pilot+0.010
Neg logit contributions
ifying-0.019
igsaw-0.015
years-0.015
chie-0.015
 hips-0.014
Token,
Token ID11
Feature activation-1.50e-02

Pos logit contributions
ifying+0.013
igsaw+0.011
years+0.011
chie+0.011
jamin+0.010
Neg logit contributions
 idea-0.007
 bal-0.005
crow-0.005
 doctors-0.005
 people-0.005
Token so
Token ID523
Feature activation-1.70e-02

Pos logit contributions
 idea+0.028
crow+0.025
 bal+0.025
 doctors+0.023
cut+0.023
Neg logit contributions
ifying-0.021
years-0.016
 hips-0.015
igsaw-0.015
chie-0.015
Token she
Token ID673
Feature activation+1.24e-03

Pos logit contributions
 idea+0.027
crow+0.024
cut+0.022
 bal+0.022
 paramedics+0.021
Neg logit contributions
ifying-0.029
years-0.024
igsaw-0.022
 hips-0.021
chie-0.021
Token let
Token ID1309
Feature activation-7.73e-02

Pos logit contributions
ifying+0.003
igsaw+0.002
years+0.002
chie+0.002
jamin+0.002
Neg logit contributions
 idea-0.002
 bal-0.001
crow-0.001
 doctors-0.001
 cows-0.001
Token herself
Token ID5223
Feature activation-4.55e-03

Pos logit contributions
cut+0.131
ooks+0.128
 bal+0.123
 impression+0.122
aw+0.120
Neg logit contributions
ifying-0.069
 front-0.068
 looked-0.060
 Rose-0.060
chie-0.058
Token in
Token ID287
Feature activation+1.07e-02

Pos logit contributions
 idea+0.007
crow+0.006
 bal+0.006
 doctors+0.006
 cows+0.006
Neg logit contributions
ifying-0.008
igsaw-0.007
years-0.006
chie-0.006
 hips-0.006
Token.
Token ID13
Feature activation-9.10e-03

Pos logit contributions
ifying+0.024
igsaw+0.021
years+0.020
chie+0.020
jamin+0.019
Neg logit contributions
 idea-0.013
crow-0.010
 doctors-0.010
 people-0.009
 bal-0.009
Token She
Token ID1375
Feature activation+4.77e-03

Pos logit contributions
 idea+0.015
 doctors+0.012
 bal+0.012
crow+0.012
 pilot+0.012
Neg logit contributions
ifying-0.016
igsaw-0.013
years-0.012
chie-0.012
 hips-0.012
Token stepped
Token ID10764
Feature activation+6.01e-03

Pos logit contributions
ifying+0.009
igsaw+0.007
years+0.007
chie+0.007
jamin+0.006
Neg logit contributions
 idea-0.008
 bal-0.006
crow-0.006
 doctors-0.006
 cows-0.006
Token They
Token ID1119
Feature activation-2.98e-04

Pos logit contributions
 idea+0.007
 bal+0.006
crow+0.006
 doctors+0.006
cut+0.006
Neg logit contributions
ifying-0.009
igsaw-0.007
years-0.007
chie-0.006
jamin-0.006
Token found
Token ID1043
Feature activation-1.50e-02

Pos logit contributions
 idea+0.000
 bal+0.000
crow+0.000
 doctors+0.000
 cows+0.000
Neg logit contributions
ifying-0.001
igsaw-0.000
years-0.000
chie-0.000
jamin-0.000
Token a
Token ID257
Feature activation-1.53e-03

Pos logit contributions
 idea+0.026
 bal+0.024
crow+0.023
cut+0.022
 understand+0.021
Neg logit contributions
ifying-0.024
years-0.018
igsaw-0.017
chie-0.016
jamin-0.015
Token hotel
Token ID7541
Feature activation+1.84e-03

Pos logit contributions
 idea+0.002
crow+0.002
 bal+0.002
 doctors+0.002
 cows+0.002
Neg logit contributions
ifying-0.003
igsaw-0.002
years-0.002
chie-0.002
jamin-0.002
Token and
Token ID290
Feature activation+4.22e-03

Pos logit contributions
ifying+0.004
igsaw+0.003
years+0.003
chie+0.003
jamin+0.003
Neg logit contributions
 idea-0.002
 bal-0.002
crow-0.002
 doctors-0.002
 cows-0.002
Token let
Token ID1309
Feature activation-7.71e-02

Pos logit contributions
ifying+0.008
igsaw+0.007
years+0.007
chie+0.007
jamin+0.006
Neg logit contributions
 idea-0.006
 bal-0.005
crow-0.005
 doctors-0.005
 cows-0.005
Token themselves
Token ID2405
Feature activation-6.38e-03

Pos logit contributions
cut+0.135
 bal+0.121
 idea+0.113
opl+0.109
 clim+0.107
Neg logit contributions
ifying-0.097
chie-0.070
 front-0.069
 looks-0.069
 Alex-0.066
Token into
Token ID656
Feature activation+1.21e-02

Pos logit contributions
 idea+0.007
crow+0.006
 bal+0.005
 doctors+0.005
cut+0.005
Neg logit contributions
ifying-0.014
igsaw-0.012
years-0.012
chie-0.012
 hips-0.011
Token the
Token ID262
Feature activation-1.01e-02

Pos logit contributions
ifying+0.025
igsaw+0.022
years+0.021
chie+0.021
jamin+0.020
Neg logit contributions
 idea-0.016
 doctors-0.012
 people-0.012
crow-0.012
 cows-0.012
Token lobby
Token ID10866
Feature activation-4.95e-03

Pos logit contributions
 idea+0.014
 bal+0.012
crow+0.012
 understand+0.011
cut+0.011
Neg logit contributions
ifying-0.020
igsaw-0.016
years-0.016
chie-0.015
 hips-0.014
Token.
Token ID13
Feature activation-8.60e-03

Pos logit contributions
 idea+0.006
crow+0.005
 bal+0.005
 doctors+0.005
 cows+0.005
Neg logit contributions
ifying-0.010
igsaw-0.009
years-0.008
chie-0.008
 hips-0.008
Token.
Token ID13
Feature activation-7.07e-03

Pos logit contributions
 idea+0.009
crow+0.008
 bal+0.008
 doctors+0.007
 cows+0.007
Neg logit contributions
ifying-0.015
igsaw-0.012
years-0.012
chie-0.012
 hips-0.011
Token Every
Token ID3887
Feature activation-1.87e-02

Pos logit contributions
 idea+0.011
 bal+0.009
crow+0.009
 doctors+0.009
 pilot+0.008
Neg logit contributions
ifying-0.013
igsaw-0.011
years-0.010
chie-0.010
 hips-0.010
Token day
Token ID1110
Feature activation-3.32e-02

Pos logit contributions
 idea+0.012
 bal+0.010
 paramedics+0.010
crow+0.010
ighters+0.009
Neg logit contributions
ifying-0.046
igsaw-0.041
years-0.038
jamin-0.038
chie-0.037
Token,
Token ID11
Feature activation-1.29e-02

Pos logit contributions
 idea+0.062
cut+0.054
 impression+0.053
 understand+0.052
 paramedics+0.052
Neg logit contributions
ifying-0.051
 eight-0.034
years-0.033
igsaw-0.031
 total-0.031
Token the
Token ID262
Feature activation-2.47e-02

Pos logit contributions
 idea+0.019
crow+0.016
 bal+0.016
cut+0.015
 understand+0.015
Neg logit contributions
ifying-0.024
years-0.019
igsaw-0.019
chie-0.019
 hips-0.017
Token clock
Token ID8801
Feature activation-7.71e-02

Pos logit contributions
 idea+0.033
crow+0.032
 bal+0.029
ook+0.027
 paramedics+0.026
Neg logit contributions
ifying-0.046
igsaw-0.037
years-0.036
 hips-0.035
chie-0.034
Token went
Token ID1816
Feature activation+6.71e-03

Pos logit contributions
 idea+0.135
ubs+0.131
ooks+0.125
 paramedics+0.122
ook+0.122
Neg logit contributions
ifying-0.083
work-0.071
 bit-0.070
 looked-0.066
fly-0.065
Token "
Token ID366
Feature activation-8.45e-03

Pos logit contributions
ifying+0.014
igsaw+0.012
years+0.011
chie+0.011
jamin+0.011
Neg logit contributions
 idea-0.009
 bal-0.007
 doctors-0.007
crow-0.007
 cows-0.007
Tokentick
Token ID42298
Feature activation-1.35e-02

Pos logit contributions
 idea+0.014
 doctors+0.011
crow+0.011
 bal+0.011
 cows+0.010
Neg logit contributions
ifying-0.015
years-0.012
igsaw-0.012
chie-0.012
jamin-0.011
Token-
Token ID12
Feature activation-9.76e-03

Pos logit contributions
 idea+0.018
 bal+0.015
 doctors+0.014
 paramedics+0.014
crow+0.014
Neg logit contributions
ifying-0.025
igsaw-0.023
years-0.022
chie-0.021
jamin-0.020
Tokent
Token ID83
Feature activation-9.71e-03

Pos logit contributions
 idea+0.019
 doctors+0.016
 bal+0.016
crow+0.016
 cows+0.016
Neg logit contributions
ifying-0.014
igsaw-0.011
years-0.011
chie-0.011
run-0.009
Token small
Token ID1402
Feature activation+2.68e-03

Pos logit contributions
 idea+0.002
crow+0.002
 bal+0.002
 doctors+0.002
 cows+0.002
Neg logit contributions
ifying-0.003
igsaw-0.003
years-0.003
chie-0.003
 hips-0.002
Token.
Token ID13
Feature activation-1.20e-02

Pos logit contributions
ifying+0.006
igsaw+0.005
years+0.005
chie+0.005
jamin+0.005
Neg logit contributions
 idea-0.003
 bal-0.002
crow-0.002
 doctors-0.002
 cows-0.002
Token Every
Token ID3887
Feature activation-1.80e-02

Pos logit contributions
 idea+0.019
 bal+0.016
 doctors+0.015
crow+0.015
cut+0.015
Neg logit contributions
ifying-0.021
igsaw-0.018
years-0.017
chie-0.016
jamin-0.016
Token day
Token ID1110
Feature activation-4.08e-02

Pos logit contributions
 idea+0.011
 doctors+0.009
 paramedics+0.009
ighters+0.009
 bal+0.009
Neg logit contributions
ifying-0.045
igsaw-0.041
jamin-0.038
chie-0.038
years-0.038
Token the
Token ID262
Feature activation-8.94e-03

Pos logit contributions
 idea+0.078
 impression+0.067
cut+0.066
ighters+0.065
 paramedics+0.065
Neg logit contributions
ifying-0.060
 eight-0.040
years-0.040
 total-0.035
 Alex-0.035
Token piece
Token ID3704
Feature activation-7.67e-02

Pos logit contributions
 idea+0.013
crow+0.011
 bal+0.010
 paramedics+0.010
 doctors+0.010
Neg logit contributions
ifying-0.017
igsaw-0.014
years-0.014
chie-0.013
 hips-0.013
Token went
Token ID1816
Feature activation+2.26e-04

Pos logit contributions
 idea+0.110
ubs+0.103
 paramedics+0.101
 Vol+0.101
 impression+0.100
Neg logit contributions
ifying-0.097
 nowhere-0.089
ll-0.082
 seeking-0.080
years-0.079
Token on
Token ID319
Feature activation-4.84e-04

Pos logit contributions
ifying+0.000
igsaw+0.000
years+0.000
chie+0.000
 hips+0.000
Neg logit contributions
 idea-0.000
 bal-0.000
crow-0.000
 doctors-0.000
 cows-0.000
Token adventures
Token ID17545
Feature activation-7.44e-03

Pos logit contributions
 idea+0.001
 bal+0.001
crow+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.001
igsaw-0.001
years-0.001
chie-0.001
 hips-0.001
Token,
Token ID11
Feature activation-2.49e-02

Pos logit contributions
 idea+0.010
 bal+0.008
crow+0.008
 doctors+0.008
 paramedics+0.008
Neg logit contributions
ifying-0.015
igsaw-0.012
years-0.012
chie-0.012
 hips-0.011
Token looking
Token ID2045
Feature activation-2.00e-02

Pos logit contributions
 idea+0.042
crow+0.037
 bal+0.036
 paramedics+0.034
 doctors+0.033
Neg logit contributions
ifying-0.038
years-0.032
 seeking-0.030
 hips-0.030
chie-0.030
Token From
Token ID3574
Feature activation-9.95e-03

Pos logit contributions
 idea+0.022
crow+0.018
 bal+0.018
 doctors+0.018
 pilot+0.018
Neg logit contributions
ifying-0.020
igsaw-0.017
years-0.017
run-0.015
jamin-0.015
Token then
Token ID788
Feature activation-2.31e-02

Pos logit contributions
 idea+0.016
 bal+0.013
crow+0.013
 understand+0.012
 doctors+0.012
Neg logit contributions
ifying-0.017
years-0.014
igsaw-0.014
chie-0.013
jamin-0.013
Token on
Token ID319
Feature activation-4.41e-03

Pos logit contributions
 idea+0.037
 cows+0.030
 doctors+0.030
crow+0.030
 paramedics+0.030
Neg logit contributions
ifying-0.040
years-0.032
igsaw-0.032
chie-0.030
run-0.029
Token,
Token ID11
Feature activation-2.79e-02

Pos logit contributions
 idea+0.006
crow+0.005
 bal+0.005
 doctors+0.004
 cows+0.004
Neg logit contributions
ifying-0.009
igsaw-0.008
years-0.007
chie-0.007
 hips-0.007
Token Tim
Token ID5045
Feature activation-7.19e-03

Pos logit contributions
 idea+0.048
crow+0.044
 pilot+0.040
ighters+0.040
 paramedics+0.039
Neg logit contributions
ifying-0.039
years-0.033
igsaw-0.033
�-0.032
run-0.029
Tokenmy
Token ID1820
Feature activation-7.63e-02

Pos logit contributions
 idea+0.015
 bal+0.013
crow+0.013
 cows+0.013
 doctors+0.013
Neg logit contributions
ifying-0.008
years-0.006
igsaw-0.006
chie-0.006
jamin-0.006
Token never
Token ID1239
Feature activation+1.39e-02

Pos logit contributions
 idea+0.127
 paramedics+0.116
crow+0.109
 impression+0.108
 hurricane+0.107
Neg logit contributions
ll-0.094
years-0.087
jamin-0.082
ifying-0.078
 seeking-0.077
Token played
Token ID2826
Feature activation-2.27e-02

Pos logit contributions
ifying+0.028
igsaw+0.023
years+0.022
chie+0.022
 hips+0.020
Neg logit contributions
 idea-0.020
 bal-0.017
 doctors-0.016
crow-0.016
 understand-0.016
Token with
Token ID351
Feature activation-6.05e-04

Pos logit contributions
 idea+0.032
crow+0.028
 bal+0.027
cut+0.026
 cows+0.026
Neg logit contributions
ifying-0.039
igsaw-0.034
chie-0.031
years-0.031
 seeking-0.031
Token ropes
Token ID31001
Feature activation-1.60e-02

Pos logit contributions
 idea+0.001
 bal+0.001
crow+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.001
igsaw-0.001
years-0.001
chie-0.001
 hips-0.001
Token again
Token ID757
Feature activation+1.24e-02

Pos logit contributions
 idea+0.027
crow+0.022
 cows+0.022
 doctors+0.022
 bal+0.021
Neg logit contributions
ifying-0.027
igsaw-0.022
years-0.021
chie-0.020
 hips-0.019
Token.
Token ID13
Feature activation-1.96e-02

Pos logit contributions
 idea+0.031
crow+0.025
 doctors+0.024
 paramedics+0.024
 bal+0.023
Neg logit contributions
ifying-0.041
igsaw-0.031
years-0.030
chie-0.029
run-0.028
Token Every
Token ID3887
Feature activation-1.28e-02

Pos logit contributions
 idea+0.029
 bal+0.025
crow+0.024
cut+0.024
 doctors+0.023
Neg logit contributions
ifying-0.034
igsaw-0.030
years-0.027
 hips-0.027
jamin-0.027
Token day
Token ID1110
Feature activation-3.49e-02

Pos logit contributions
 idea+0.008
crow+0.005
 bal+0.005
 doctors+0.005
 paramedics+0.005
Neg logit contributions
ifying-0.033
igsaw-0.030
years-0.028
chie-0.028
jamin-0.028
Token,
Token ID11
Feature activation-1.63e-02

Pos logit contributions
 idea+0.067
cut+0.058
 impression+0.058
 paramedics+0.056
 understand+0.056
Neg logit contributions
ifying-0.051
 eight-0.035
years-0.035
igsaw-0.034
 total-0.031
Token the
Token ID262
Feature activation-3.69e-02

Pos logit contributions
 idea+0.023
crow+0.019
 bal+0.019
cut+0.017
 understand+0.017
Neg logit contributions
ifying-0.032
igsaw-0.026
years-0.026
chie-0.024
 hips-0.023
Token clock
Token ID8801
Feature activation-7.59e-02

Pos logit contributions
crow+0.052
 idea+0.050
 bal+0.047
ook+0.045
ger+0.042
Neg logit contributions
ifying-0.062
igsaw-0.050
years-0.049
 hips-0.049
chie-0.046
Token would
Token ID561
Feature activation-3.04e-02

Pos logit contributions
 idea+0.139
ubs+0.135
ooks+0.130
ook+0.127
 impression+0.127
Neg logit contributions
ifying-0.077
 bit-0.066
work-0.063
 looked-0.061
fly-0.058
Token tick
Token ID4378
Feature activation-5.80e-03

Pos logit contributions
 idea+0.043
crow+0.041
 bal+0.038
 doctors+0.038
 cows+0.037
Neg logit contributions
ifying-0.054
chie-0.043
 hips-0.041
igsaw-0.041
years-0.040
Token-
Token ID12
Feature activation-6.04e-03

Pos logit contributions
 idea+0.008
crow+0.006
 bal+0.006
 doctors+0.006
 paramedics+0.006
Neg logit contributions
ifying-0.012
igsaw-0.010
years-0.010
chie-0.009
jamin-0.009
Tokent
Token ID83
Feature activation-1.65e-02

Pos logit contributions
 idea+0.010
 bal+0.008
 doctors+0.008
crow+0.008
 cows+0.007
Neg logit contributions
ifying-0.011
igsaw-0.009
years-0.009
chie-0.009
jamin-0.008
Tokenock
Token ID735
Feature activation-7.61e-03

Pos logit contributions
 idea+0.024
 bal+0.019
 doctors+0.018
crow+0.018
 paramedics+0.018
Neg logit contributions
ifying-0.031
years-0.026
igsaw-0.026
chie-0.025
jamin-0.023
Token she
Token ID673
Feature activation-2.32e-02

Pos logit contributions
 idea+0.078
ans+0.073
crow+0.072
 paramedics+0.072
cut+0.071
Neg logit contributions
ifying-0.045
years-0.040
 seeking-0.037
 nowhere-0.037
 Rose-0.032
Token didn
Token ID1422
Feature activation+7.53e-03

Pos logit contributions
 idea+0.035
crow+0.030
 cows+0.028
 emergency+0.028
 bal+0.028
Neg logit contributions
ifying-0.039
years-0.035
jamin-0.032
chie-0.031
 hips-0.031
Token't
Token ID470
Feature activation-2.44e-02

Pos logit contributions
ifying+0.011
igsaw+0.009
years+0.008
chie+0.008
jamin+0.008
Neg logit contributions
 idea-0.014
 bal-0.012
crow-0.012
 doctors-0.012
 people-0.012
Token want
Token ID765
Feature activation-2.54e-02

Pos logit contributions
 idea+0.030
 cows+0.026
 paramedics+0.025
cut+0.025
crow+0.025
Neg logit contributions
ifying-0.048
chie-0.041
igsaw-0.039
years-0.038
 hips-0.036
Token to
Token ID284
Feature activation-1.45e-02

Pos logit contributions
 idea+0.045
cut+0.041
 paramedics+0.038
 pilot+0.037
 bal+0.037
Neg logit contributions
ifying-0.039
years-0.032
igsaw-0.030
 seeking-0.028
run-0.028
Token let
Token ID1309
Feature activation-7.55e-02

Pos logit contributions
 idea+0.022
crow+0.019
 cows+0.019
cut+0.019
 bal+0.018
Neg logit contributions
ifying-0.026
igsaw-0.020
chie-0.020
years-0.020
 hips-0.019
Token it
Token ID340
Feature activation-5.27e-03

Pos logit contributions
cut+0.143
ooks+0.121
aver+0.120
 paramedics+0.119
 idea+0.116
Neg logit contributions
 Rose-0.083
ifying-0.074
 sigh-0.066
 Emily-0.066
 Ruby-0.065
Token go
Token ID467
Feature activation+4.18e-03

Pos logit contributions
 idea+0.006
crow+0.005
 doctors+0.005
 bal+0.005
 cows+0.005
Neg logit contributions
ifying-0.011
igsaw-0.009
chie-0.009
years-0.009
 hips-0.008
Token.
Token ID13
Feature activation-1.85e-02

Pos logit contributions
ifying+0.009
igsaw+0.008
years+0.007
chie+0.007
jamin+0.007
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token She
Token ID1375
Feature activation-2.03e-02

Pos logit contributions
 idea+0.029
cut+0.026
crow+0.025
 pilot+0.024
 bal+0.024
Neg logit contributions
ifying-0.030
igsaw-0.026
years-0.025
 hips-0.024
jamin-0.023
Token held
Token ID2714
Feature activation-1.60e-02

Pos logit contributions
 idea+0.033
crow+0.030
 bal+0.028
cut+0.028
 emergency+0.027
Neg logit contributions
ifying-0.031
years-0.027
jamin-0.025
 hips-0.025
chie-0.024
Token
Token ID198
Feature activation-1.50e-02

Pos logit contributions
 idea+0.029
 bal+0.025
 doctors+0.025
crow+0.024
 paramedics+0.024
Neg logit contributions
ifying-0.029
igsaw-0.025
years-0.024
chie-0.023
run-0.022
TokenEvery
Token ID6109
Feature activation-7.85e-03

Pos logit contributions
 idea+0.030
 bal+0.026
crow+0.025
 doctors+0.025
 paramedics+0.024
Neg logit contributions
ifying-0.020
years-0.016
igsaw-0.016
chie-0.015
jamin-0.014
Token day
Token ID1110
Feature activation-3.19e-02

Pos logit contributions
 idea+0.005
 bal+0.003
crow+0.003
 doctors+0.003
 cows+0.002
Neg logit contributions
ifying-0.021
igsaw-0.019
years-0.018
chie-0.018
jamin-0.018
Token,
Token ID11
Feature activation+3.71e-03

Pos logit contributions
 idea+0.056
ighters+0.047
cut+0.047
 itch+0.047
 understand+0.046
Neg logit contributions
ifying-0.051
years-0.035
igsaw-0.034
 eight-0.033
 Alex-0.032
Token the
Token ID262
Feature activation-1.51e-02

Pos logit contributions
ifying+0.008
igsaw+0.007
years+0.006
chie+0.006
jamin+0.006
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token painter
Token ID34537
Feature activation-7.54e-02

Pos logit contributions
 idea+0.017
crow+0.015
 bal+0.014
 cows+0.013
 paramedics+0.013
Neg logit contributions
ifying-0.031
igsaw-0.028
years-0.027
chie-0.026
 hips-0.024
Token liked
Token ID8288
Feature activation-6.14e-03

Pos logit contributions
 idea+0.135
 paramedics+0.120
ubs+0.118
 impression+0.117
 People+0.117
Neg logit contributions
ifying-0.083
years-0.071
jamin-0.071
 bit-0.069
ll-0.068
Token to
Token ID284
Feature activation-1.61e-02

Pos logit contributions
 idea+0.011
crow+0.009
 bal+0.009
 understand+0.009
 doctors+0.009
Neg logit contributions
ifying-0.010
igsaw-0.008
years-0.008
chie-0.007
 hips-0.007
Token play
Token ID711
Feature activation-1.90e-02

Pos logit contributions
 idea+0.022
crow+0.020
 bal+0.019
 cows+0.019
 doctors+0.018
Neg logit contributions
ifying-0.029
igsaw-0.025
chie-0.024
years-0.023
 hips-0.023
Token with
Token ID351
Feature activation-1.04e-02

Pos logit contributions
 idea+0.027
crow+0.025
 bal+0.024
 paramedics+0.024
 cows+0.023
Neg logit contributions
ifying-0.031
igsaw-0.028
years-0.026
chie-0.025
 seeking-0.025
Token children
Token ID1751
Feature activation-6.55e-03

Pos logit contributions
 idea+0.016
crow+0.013
 bal+0.013
 doctors+0.012
 paramedics+0.012
Neg logit contributions
ifying-0.019
igsaw-0.016
years-0.016
chie-0.015
 hips-0.014
Token It
Token ID632
Feature activation-8.44e-03

Pos logit contributions
 idea+0.015
 bal+0.013
crow+0.012
 doctors+0.012
cut+0.012
Neg logit contributions
ifying-0.018
igsaw-0.015
years-0.015
chie-0.014
jamin-0.014
Token made
Token ID925
Feature activation-3.61e-02

Pos logit contributions
 idea+0.015
crow+0.012
 bal+0.012
 doctors+0.012
 cows+0.012
Neg logit contributions
ifying-0.013
years-0.010
igsaw-0.010
chie-0.010
jamin-0.009
Token loud
Token ID7812
Feature activation-1.04e-02

Pos logit contributions
 idea+0.062
crow+0.060
 bal+0.060
cut+0.057
 bidder+0.057
Neg logit contributions
ifying-0.044
years-0.040
igsaw-0.035
chie-0.035
run-0.032
Token noises
Token ID26782
Feature activation-3.24e-02

Pos logit contributions
 idea+0.014
 bal+0.012
crow+0.012
 doctors+0.011
 paramedics+0.011
Neg logit contributions
ifying-0.020
igsaw-0.017
years-0.016
chie-0.016
 hips-0.015
Token and
Token ID290
Feature activation-1.24e-02

Pos logit contributions
 idea+0.044
crow+0.038
 paramedics+0.038
 bidder+0.036
 bal+0.036
Neg logit contributions
ifying-0.059
years-0.048
 seeking-0.045
chie-0.043
run-0.043
Token let
Token ID1309
Feature activation-7.53e-02

Pos logit contributions
 idea+0.018
 bal+0.015
crow+0.015
 doctors+0.014
 paramedics+0.014
Neg logit contributions
ifying-0.023
years-0.019
igsaw-0.018
chie-0.018
 hips-0.017
Token out
Token ID503
Feature activation+1.48e-02

Pos logit contributions
cut+0.109
 idea+0.107
ooks+0.106
 bal+0.105
 impression+0.103
Neg logit contributions
ifying-0.096
chie-0.078
 looks-0.075
of-0.071
 eight-0.071
Token smoke
Token ID7523
Feature activation-2.87e-02

Pos logit contributions
ifying+0.029
igsaw+0.024
years+0.023
chie+0.023
jamin+0.022
Neg logit contributions
 idea-0.022
 people-0.018
 doctors-0.018
 bal-0.018
crow-0.018
Token when
Token ID618
Feature activation-1.45e-02

Pos logit contributions
 idea+0.041
 bal+0.035
crow+0.035
 doctors+0.033
 pilot+0.032
Neg logit contributions
ifying-0.053
chie-0.040
 hips-0.039
years-0.038
igsaw-0.038
Token it
Token ID340
Feature activation-1.75e-02

Pos logit contributions
 idea+0.015
 bal+0.012
crow+0.012
cut+0.011
 understand+0.010
Neg logit contributions
ifying-0.033
igsaw-0.027
years-0.027
chie-0.027
 hips-0.026
Token zoom
Token ID19792
Feature activation-1.05e-02

Pos logit contributions
 idea+0.027
 bal+0.023
crow+0.023
 understand+0.022
 doctors+0.022
Neg logit contributions
ifying-0.031
chie-0.023
years-0.023
igsaw-0.022
run-0.022
Token gate
Token ID8946
Feature activation-7.45e-03

Pos logit contributions
ifying+0.002
igsaw+0.002
years+0.002
chie+0.002
jamin+0.002
Neg logit contributions
 idea-0.002
 bal-0.001
crow-0.001
 doctors-0.001
 cows-0.001
Token.
Token ID13
Feature activation-1.69e-02

Pos logit contributions
 idea+0.009
 bal+0.008
crow+0.008
 doctors+0.007
 cows+0.007
Neg logit contributions
ifying-0.016
igsaw-0.013
years-0.013
chie-0.012
 hips-0.012
Token She
Token ID1375
Feature activation+2.61e-02

Pos logit contributions
 idea+0.030
 bal+0.025
cut+0.024
crow+0.024
 doctors+0.024
Neg logit contributions
ifying-0.028
igsaw-0.023
years-0.021
�-0.020
run-0.019
Token smiled
Token ID13541
Feature activation-4.56e-03

Pos logit contributions
ifying+0.057
igsaw+0.051
chie+0.046
years+0.044
 hips+0.041
Neg logit contributions
 idea-0.034
 doctors-0.028
 people-0.027
 cows-0.027
 bal-0.027
Token and
Token ID290
Feature activation+2.88e-02

Pos logit contributions
 idea+0.006
crow+0.005
 bal+0.005
 doctors+0.005
cut+0.004
Neg logit contributions
ifying-0.009
igsaw-0.008
years-0.008
chie-0.007
jamin-0.007
Token let
Token ID1309
Feature activation-7.49e-02

Pos logit contributions
ifying+0.063
igsaw+0.056
chie+0.053
years+0.050
 hips+0.045
Neg logit contributions
 idea-0.036
 bal-0.031
 doctors-0.030
 people-0.030
 cows-0.030
Token them
Token ID606
Feature activation-5.91e-03

Pos logit contributions
cut+0.117
 bal+0.111
 rarely+0.110
ooks+0.109
 idea+0.109
Neg logit contributions
ifying-0.079
of-0.073
 first-0.071
 Rose-0.070
 sigh-0.063
Token in
Token ID287
Feature activation+4.74e-03

Pos logit contributions
 idea+0.007
crow+0.006
 bal+0.006
 doctors+0.005
cut+0.005
Neg logit contributions
ifying-0.013
igsaw-0.011
chie-0.010
years-0.010
 hips-0.010
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
ifying+0.010
igsaw+0.008
years+0.008
chie+0.008
jamin+0.007
Neg logit contributions
 idea-0.006
 bal-0.005
crow-0.005
 doctors-0.005
 cows-0.005
Token They
Token ID1119
Feature activation+1.03e-02

Pos logit contributions
 idea+0.018
 bal+0.015
crow+0.015
cut+0.014
 doctors+0.014
Neg logit contributions
ifying-0.019
igsaw-0.016
years-0.015
�-0.014
 hips-0.014
Token saw
Token ID2497
Feature activation-2.17e-02

Pos logit contributions
ifying+0.020
igsaw+0.017
years+0.016
chie+0.016
jamin+0.015
Neg logit contributions
 idea-0.015
 bal-0.012
crow-0.012
 doctors-0.012
 cows-0.012
Token that
Token ID326
Feature activation-2.50e-02

Pos logit contributions
ifying+0.007
igsaw+0.006
years+0.006
chie+0.006
jamin+0.005
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token day
Token ID1110
Feature activation-8.30e-03

Pos logit contributions
 idea+0.020
crow+0.018
ighters+0.018
 paramedics+0.018
aw+0.017
Neg logit contributions
ifying-0.056
igsaw-0.053
jamin-0.050
years-0.050
chie-0.047
Token on
Token ID319
Feature activation+1.78e-04

Pos logit contributions
 idea+0.012
crow+0.009
 doctors+0.009
 bal+0.009
 cows+0.009
Neg logit contributions
ifying-0.017
years-0.014
igsaw-0.013
chie-0.013
 hips-0.012
Token,
Token ID11
Feature activation-2.55e-02

Pos logit contributions
ifying+0.000
igsaw+0.000
years+0.000
chie+0.000
 hips+0.000
Neg logit contributions
 idea-0.000
 bal-0.000
crow-0.000
 doctors-0.000
 cows-0.000
Token Tim
Token ID5045
Feature activation-1.53e-02

Pos logit contributions
 idea+0.043
crow+0.041
 paramedics+0.036
ighters+0.036
aw+0.036
Neg logit contributions
ifying-0.037
years-0.031
igsaw-0.030
�-0.028
 total-0.026
Tokenmy
Token ID1820
Feature activation-7.49e-02

Pos logit contributions
 idea+0.034
crow+0.031
 bal+0.030
 paramedics+0.030
 cows+0.030
Neg logit contributions
ifying-0.015
years-0.011
jamin-0.011
igsaw-0.011
chie-0.010
Token learned
Token ID4499
Feature activation+8.53e-03

Pos logit contributions
 idea+0.132
 paramedics+0.126
crow+0.122
 hurricane+0.117
adders+0.116
Neg logit contributions
ll-0.080
jamin-0.074
years-0.070
ifying-0.065
 seeking-0.064
Token to
Token ID284
Feature activation-3.79e-03

Pos logit contributions
ifying+0.018
igsaw+0.015
years+0.014
chie+0.014
jamin+0.013
Neg logit contributions
 idea-0.012
 bal-0.009
 doctors-0.009
crow-0.009
 people-0.009
Token be
Token ID307
Feature activation-1.65e-02

Pos logit contributions
 idea+0.005
 bal+0.004
crow+0.004
 doctors+0.004
 cows+0.004
Neg logit contributions
ifying-0.007
igsaw-0.006
years-0.006
chie-0.006
jamin-0.006
Token more
Token ID517
Feature activation-3.32e-02

Pos logit contributions
 idea+0.024
crow+0.021
 cows+0.020
 bal+0.020
cut+0.019
Neg logit contributions
ifying-0.029
years-0.026
chie-0.024
igsaw-0.024
run-0.022
Token careful
Token ID8161
Feature activation-2.88e-02

Pos logit contributions
 idea+0.049
crow+0.047
 cows+0.044
 bidder+0.043
 People+0.043
Neg logit contributions
ifying-0.049
years-0.047
run-0.043
igsaw-0.043
ll-0.041
Token She
Token ID1375
Feature activation+7.97e-03

Pos logit contributions
ifying+0.032
igsaw+0.026
years+0.026
chie+0.026
 hips+0.024
Neg logit contributions
 idea-0.019
crow-0.016
 bal-0.015
 doctors-0.015
 people-0.015
Token opened
Token ID4721
Feature activation-1.25e-02

Pos logit contributions
ifying+0.016
igsaw+0.014
years+0.013
chie+0.013
 hips+0.012
Neg logit contributions
 idea-0.011
 bal-0.009
 doctors-0.009
crow-0.009
 cows-0.009
Token the
Token ID262
Feature activation+9.28e-03

Pos logit contributions
 idea+0.019
 bal+0.016
crow+0.016
 doctors+0.016
 paramedics+0.015
Neg logit contributions
ifying-0.023
igsaw-0.019
years-0.018
chie-0.017
 hips-0.017
Token door
Token ID3420
Feature activation+1.53e-02

Pos logit contributions
ifying+0.019
igsaw+0.016
years+0.015
chie+0.015
jamin+0.014
Neg logit contributions
 idea-0.013
crow-0.010
 doctors-0.010
 bal-0.010
 cows-0.010
Token and
Token ID290
Feature activation-4.52e-03

Pos logit contributions
ifying+0.036
igsaw+0.031
chie+0.031
years+0.031
jamin+0.029
Neg logit contributions
 idea-0.016
 bal-0.012
crow-0.011
 people-0.011
 doctors-0.011
Token let
Token ID1309
Feature activation-7.42e-02

Pos logit contributions
 idea+0.007
 bal+0.006
crow+0.006
 doctors+0.006
 cows+0.006
Neg logit contributions
ifying-0.008
years-0.007
igsaw-0.007
chie-0.006
jamin-0.006
Token the
Token ID262
Feature activation-1.24e-02

Pos logit contributions
ooks+0.142
 bal+0.136
cut+0.135
ained+0.135
 idea+0.133
Neg logit contributions
ifying-0.047
 Ruby-0.042
of-0.040
 Rose-0.040
 looks-0.039
Token bird
Token ID6512
Feature activation-6.62e-03

Pos logit contributions
 idea+0.014
 bal+0.012
crow+0.011
 doctors+0.011
cut+0.011
Neg logit contributions
ifying-0.025
chie-0.022
igsaw-0.021
years-0.021
 seeking-0.019
Token out
Token ID503
Feature activation+1.46e-02

Pos logit contributions
 idea+0.007
 bal+0.006
crow+0.006
 doctors+0.006
 cows+0.005
Neg logit contributions
ifying-0.015
igsaw-0.012
chie-0.012
years-0.012
 hips-0.011
Token.
Token ID13
Feature activation-2.72e-03

Pos logit contributions
ifying+0.035
igsaw+0.031
years+0.030
chie+0.029
jamin+0.028
Neg logit contributions
 idea-0.015
 bal-0.011
crow-0.011
 people-0.010
 doctors-0.010
Token The
Token ID383
Feature activation-1.72e-02

Pos logit contributions
 idea+0.004
 bal+0.004
crow+0.003
 doctors+0.003
 cows+0.003
Neg logit contributions
ifying-0.005
igsaw-0.004
years-0.004
chie-0.004
 hips-0.004
Token was
Token ID373
Feature activation-1.20e-02

Pos logit contributions
ifying+0.008
igsaw+0.007
years+0.006
chie+0.006
jamin+0.006
Neg logit contributions
 idea-0.006
 bal-0.005
crow-0.005
 doctors-0.005
 cows-0.005
Token all
Token ID477
Feature activation+9.59e-05

Pos logit contributions
 idea+0.017
crow+0.015
 bal+0.014
cut+0.014
 cows+0.013
Neg logit contributions
ifying-0.023
years-0.019
chie-0.018
igsaw-0.018
 hips-0.017
Token dry
Token ID5894
Feature activation+3.44e-03

Pos logit contributions
ifying+0.000
igsaw+0.000
years+0.000
chie+0.000
jamin+0.000
Neg logit contributions
 idea-0.000
crow-0.000
 bal-0.000
 doctors-0.000
 cows-0.000
Token when
Token ID618
Feature activation-1.00e-02

Pos logit contributions
ifying+0.008
igsaw+0.006
years+0.006
chie+0.006
jamin+0.006
Neg logit contributions
 idea-0.004
 bal-0.003
crow-0.003
 doctors-0.003
 cows-0.003
Token she
Token ID673
Feature activation+4.85e-03

Pos logit contributions
 idea+0.011
crow+0.008
 bal+0.008
cut+0.008
 paramedics+0.007
Neg logit contributions
ifying-0.023
years-0.020
igsaw-0.019
chie-0.018
 hips-0.018
Token let
Token ID1309
Feature activation-7.38e-02

Pos logit contributions
ifying+0.009
igsaw+0.008
years+0.007
chie+0.007
jamin+0.007
Neg logit contributions
 idea-0.007
 bal-0.006
crow-0.006
 doctors-0.006
 cows-0.006
Token go
Token ID467
Feature activation+1.62e-02

Pos logit contributions
cut+0.121
ooks+0.109
 idea+0.108
amb+0.103
ook+0.103
Neg logit contributions
ifying-0.087
 Rose-0.074
 sigh-0.072
 Alex-0.068
 Emily-0.065
Token!
Token ID0
Feature activation-1.19e-03

Pos logit contributions
ifying+0.038
igsaw+0.033
years+0.033
chie+0.032
jamin+0.031
Neg logit contributions
 idea-0.017
 bal-0.013
crow-0.012
 doctors-0.012
 people-0.012
Token She
Token ID1375
Feature activation-8.20e-03

Pos logit contributions
 idea+0.002
crow+0.001
 bal+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.002
igsaw-0.002
years-0.002
chie-0.002
jamin-0.002
Token put
Token ID1234
Feature activation-1.52e-02

Pos logit contributions
 idea+0.014
crow+0.012
 bal+0.011
 doctors+0.011
 cows+0.011
Neg logit contributions
ifying-0.014
years-0.011
igsaw-0.011
chie-0.010
jamin-0.010
Token on
Token ID319
Feature activation-8.04e-03

Pos logit contributions
 idea+0.019
crow+0.014
 bal+0.014
 cows+0.014
cut+0.014
Neg logit contributions
ifying-0.032
years-0.027
igsaw-0.026
chie-0.025
 hips-0.025
Token their
Token ID511
Feature activation+3.80e-03

Pos logit contributions
 idea+0.041
 bal+0.039
 pilot+0.037
 ending+0.036
crow+0.036
Neg logit contributions
ifying-0.055
igsaw-0.044
 hips-0.042
years-0.041
chie-0.039
Token food
Token ID2057
Feature activation-7.90e-03

Pos logit contributions
ifying+0.008
igsaw+0.007
years+0.007
chie+0.006
jamin+0.006
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token on
Token ID319
Feature activation-2.14e-02

Pos logit contributions
 idea+0.012
crow+0.010
 bal+0.010
 doctors+0.010
 paramedics+0.010
Neg logit contributions
ifying-0.015
igsaw-0.012
years-0.011
chie-0.011
 hips-0.011
Token it
Token ID340
Feature activation-2.11e-02

Pos logit contributions
 idea+0.031
 bal+0.028
crow+0.027
 doctors+0.026
 paramedics+0.026
Neg logit contributions
ifying-0.041
igsaw-0.033
 hips-0.030
years-0.029
chie-0.029
Token and
Token ID290
Feature activation-2.17e-02

Pos logit contributions
 idea+0.029
crow+0.026
 paramedics+0.026
 doctors+0.025
 bal+0.025
Neg logit contributions
ifying-0.040
igsaw-0.033
 hips-0.031
years-0.031
chie-0.030
Token let
Token ID1309
Feature activation-7.34e-02

Pos logit contributions
 idea+0.031
crow+0.027
 bal+0.026
 doctors+0.024
 paramedics+0.023
Neg logit contributions
ifying-0.041
igsaw-0.032
years-0.031
 hips-0.031
jamin-0.030
Token it
Token ID340
Feature activation-5.30e-03

Pos logit contributions
cut+0.116
 bal+0.106
ooks+0.100
aver+0.099
 idea+0.098
Neg logit contributions
ifying-0.085
 looks-0.072
igsaw-0.071
 Alex-0.070
chie-0.065
Token cook
Token ID4255
Feature activation-3.10e-02

Pos logit contributions
 idea+0.006
crow+0.004
 bal+0.004
 doctors+0.004
 cows+0.004
Neg logit contributions
ifying-0.012
igsaw-0.010
chie-0.010
years-0.010
 hips-0.009
Token.
Token ID13
Feature activation-2.16e-02

Pos logit contributions
 idea+0.044
crow+0.043
 bal+0.041
 pilot+0.040
 paramedics+0.039
Neg logit contributions
ifying-0.055
igsaw-0.043
 hips-0.041
years-0.040
chie-0.037
Token
Token ID198
Feature activation-1.01e-02

Pos logit contributions
 idea+0.034
 bal+0.030
 doctors+0.029
cut+0.028
crow+0.028
Neg logit contributions
ifying-0.037
igsaw-0.031
�-0.028
years-0.027
jamin-0.027
Token
Token ID198
Feature activation-6.59e-03

Pos logit contributions
 idea+0.015
 bal+0.013
 doctors+0.013
crow+0.012
 cows+0.012
Neg logit contributions
ifying-0.018
igsaw-0.015
years-0.015
chie-0.014
jamin-0.014
Token calm
Token ID9480
Feature activation-5.83e-03

Pos logit contributions
 idea+0.017
crow+0.016
 bal+0.015
 doctors+0.015
 cows+0.014
Neg logit contributions
ifying-0.022
igsaw-0.018
years-0.018
chie-0.017
jamin-0.016
Token and
Token ID290
Feature activation-6.14e-03

Pos logit contributions
 idea+0.008
crow+0.006
 bal+0.006
 doctors+0.006
 understand+0.006
Neg logit contributions
ifying-0.012
igsaw-0.010
years-0.009
chie-0.009
jamin-0.009
Token peaceful
Token ID12309
Feature activation-2.92e-03

Pos logit contributions
 idea+0.009
crow+0.007
 bal+0.007
 doctors+0.007
 cows+0.007
Neg logit contributions
ifying-0.012
years-0.010
igsaw-0.010
chie-0.009
jamin-0.009
Token.
Token ID13
Feature activation-8.13e-04

Pos logit contributions
 idea+0.004
crow+0.003
 bal+0.003
 doctors+0.003
 cows+0.003
Neg logit contributions
ifying-0.006
igsaw-0.005
years-0.005
chie-0.005
jamin-0.005
Token They
Token ID1119
Feature activation-5.08e-03

Pos logit contributions
 idea+0.001
crow+0.001
 bal+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.002
igsaw-0.001
years-0.001
chie-0.001
jamin-0.001
Token let
Token ID1309
Feature activation-7.30e-02

Pos logit contributions
 idea+0.008
 bal+0.006
crow+0.006
 doctors+0.006
 cows+0.006
Neg logit contributions
ifying-0.010
years-0.008
igsaw-0.008
chie-0.008
jamin-0.007
Token their
Token ID511
Feature activation+1.36e-02

Pos logit contributions
cut+0.129
 idea+0.129
 clim+0.110
adders+0.109
 bidder+0.109
Neg logit contributions
ifying-0.082
jamin-0.066
of-0.062
 Bear-0.059
 Alex-0.056
Token little
Token ID1310
Feature activation-1.49e-02

Pos logit contributions
ifying+0.033
igsaw+0.030
years+0.029
chie+0.028
jamin+0.027
Neg logit contributions
 idea-0.013
 doctors-0.009
 people-0.009
 bal-0.009
 cows-0.009
Token dog
Token ID3290
Feature activation-6.97e-03

Pos logit contributions
 idea+0.023
crow+0.020
 bal+0.019
 doctors+0.019
cut+0.018
Neg logit contributions
ifying-0.026
igsaw-0.021
years-0.020
chie-0.020
jamin-0.019
Token run
Token ID1057
Feature activation+5.35e-03

Pos logit contributions
 idea+0.010
crow+0.008
 bal+0.008
 doctors+0.008
cut+0.007
Neg logit contributions
ifying-0.014
igsaw-0.011
years-0.011
chie-0.011
jamin-0.010
Token around
Token ID1088
Feature activation+4.50e-03

Pos logit contributions
ifying+0.012
igsaw+0.010
years+0.010
chie+0.010
jamin+0.009
Neg logit contributions
 idea-0.006
 bal-0.005
crow-0.005
 doctors-0.005
 cows-0.004
Token day
Token ID1110
Feature activation+8.42e-03

Pos logit contributions
ifying+0.012
igsaw+0.010
years+0.010
chie+0.010
jamin+0.010
Neg logit contributions
 idea-0.002
 bal-0.001
crow-0.001
 doctors-0.001
 cows-0.001
Token,
Token ID11
Feature activation+1.48e-02

Pos logit contributions
ifying+0.019
igsaw+0.017
years+0.016
chie+0.016
jamin+0.015
Neg logit contributions
 idea-0.009
 bal-0.007
crow-0.007
 doctors-0.007
 people-0.007
Token Freddy
Token ID45437
Feature activation-9.65e-03

Pos logit contributions
ifying+0.027
igsaw+0.022
years+0.021
chie+0.021
jamin+0.019
Neg logit contributions
 idea-0.024
 bal-0.020
crow-0.020
 people-0.019
 doctors-0.019
Token went
Token ID1816
Feature activation+2.58e-02

Pos logit contributions
 idea+0.015
 bal+0.012
crow+0.012
 doctors+0.012
 paramedics+0.012
Neg logit contributions
ifying-0.017
years-0.014
igsaw-0.014
jamin-0.013
chie-0.013
Token for
Token ID329
Feature activation+2.86e-02

Pos logit contributions
ifying+0.064
igsaw+0.054
chie+0.053
years+0.052
jamin+0.049
Neg logit contributions
 idea-0.028
 doctors-0.021
crow-0.020
 people-0.020
 cows-0.020
Token a
Token ID257
Feature activation+4.59e-02

Pos logit contributions
ifying+0.067
igsaw+0.055
chie+0.054
years+0.052
jamin+0.050
Neg logit contributions
 idea-0.036
 doctors-0.032
 people-0.030
 time-0.029
 cows-0.029
Token swim
Token ID9422
Feature activation+2.09e-02

Pos logit contributions
ifying+0.106
years+0.091
igsaw+0.084
jamin+0.083
chie+0.082
Neg logit contributions
 idea-0.058
 day-0.047
 time-0.046
 bal-0.045
 understand-0.043
Token in
Token ID287
Feature activation+8.91e-03

Pos logit contributions
ifying+0.046
igsaw+0.041
years+0.040
chie+0.038
jamin+0.036
Neg logit contributions
 idea-0.026
crow-0.020
 bal-0.020
 people-0.019
 thing-0.019
Token the
Token ID262
Feature activation+2.06e-02

Pos logit contributions
ifying+0.020
igsaw+0.017
years+0.016
chie+0.016
jamin+0.015
Neg logit contributions
 idea-0.011
crow-0.008
 doctors-0.008
 bal-0.008
 people-0.008
Token swamp
Token ID26837
Feature activation+8.59e-03

Pos logit contributions
ifying+0.045
igsaw+0.040
years+0.038
chie+0.037
run+0.036
Neg logit contributions
 idea-0.026
crow-0.021
 people-0.021
 doctors-0.020
 hospital-0.020
Token and
Token ID290
Feature activation+3.13e-03

Pos logit contributions
ifying+0.019
igsaw+0.016
years+0.016
chie+0.015
jamin+0.015
Neg logit contributions
 idea-0.010
 bal-0.008
crow-0.008
 doctors-0.008
 people-0.008
Token!"
Token ID2474
Feature activation-4.86e-03

Pos logit contributions
crow+0.048
 idea+0.046
 bal+0.045
 paramedics+0.045
 doctors+0.044
Neg logit contributions
ifying-0.040
igsaw-0.030
 hips-0.030
chie-0.029
 seeking-0.027
Token Sue
Token ID26089
Feature activation-1.00e-02

Pos logit contributions
 idea+0.008
 bal+0.007
crow+0.007
 doctors+0.006
 cows+0.006
Neg logit contributions
ifying-0.008
igsaw-0.007
years-0.007
chie-0.006
 hips-0.006
Token says
Token ID1139
Feature activation+2.86e-03

Pos logit contributions
 idea+0.014
 doctors+0.012
crow+0.011
 bal+0.011
 cows+0.011
Neg logit contributions
ifying-0.019
igsaw-0.016
years-0.015
jamin-0.015
chie-0.015
Token.
Token ID13
Feature activation-4.11e-03

Pos logit contributions
ifying+0.006
igsaw+0.005
years+0.005
chie+0.005
 hips+0.005
Neg logit contributions
 idea-0.003
 bal-0.003
crow-0.003
 doctors-0.003
 cows-0.002
Token "
Token ID366
Feature activation-7.28e-03

Pos logit contributions
 idea+0.006
 bal+0.005
crow+0.005
 doctors+0.005
cut+0.004
Neg logit contributions
ifying-0.008
igsaw-0.007
years-0.006
chie-0.006
 hips-0.006
TokenDo
Token ID5211
Feature activation+4.46e-02

Pos logit contributions
 idea+0.012
crow+0.010
 bal+0.010
 doctors+0.010
 cows+0.009
Neg logit contributions
ifying-0.013
years-0.010
igsaw-0.010
chie-0.010
jamin-0.009
Token you
Token ID345
Feature activation-2.01e-02

Pos logit contributions
ifying+0.108
igsaw+0.092
oured+0.090
years+0.090
jamin+0.085
Neg logit contributions
 idea-0.055
 people-0.052
 time-0.047
 day-0.044
 it-0.043
Token think
Token ID892
Feature activation-1.09e-02

Pos logit contributions
 idea+0.023
 cows+0.021
 doctors+0.021
cut+0.021
crow+0.021
Neg logit contributions
ifying-0.041
chie-0.034
years-0.034
igsaw-0.033
 hips-0.032
Token it
Token ID340
Feature activation+4.73e-03

Pos logit contributions
 idea+0.013
crow+0.010
 bal+0.009
 doctors+0.009
cut+0.009
Neg logit contributions
ifying-0.024
igsaw-0.020
years-0.020
chie-0.019
run-0.019
Token has
Token ID468
Feature activation+3.23e-03

Pos logit contributions
ifying+0.009
igsaw+0.007
years+0.007
chie+0.007
jamin+0.007
Neg logit contributions
 idea-0.007
 bal-0.006
crow-0.006
 doctors-0.006
 cows-0.006
Token toys
Token ID14958
Feature activation+1.47e-02

Pos logit contributions
ifying+0.006
igsaw+0.005
years+0.005
chie+0.005
jamin+0.004
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token't
Token ID470
Feature activation+1.37e-02

Pos logit contributions
ifying+0.040
years+0.033
igsaw+0.032
chie+0.031
jamin+0.029
Neg logit contributions
 idea-0.040
 people-0.034
 bal-0.034
 doctors-0.034
crow-0.034
Token you
Token ID345
Feature activation+5.36e-03

Pos logit contributions
ifying+0.022
igsaw+0.018
years+0.018
chie+0.016
jamin+0.015
Neg logit contributions
 idea-0.025
 people-0.021
 bal-0.021
 doctors-0.021
crow-0.020
Token both
Token ID1111
Feature activation+3.38e-03

Pos logit contributions
ifying+0.011
igsaw+0.009
years+0.009
chie+0.009
jamin+0.008
Neg logit contributions
 idea-0.007
 bal-0.006
crow-0.006
 doctors-0.006
 people-0.006
Token read
Token ID1100
Feature activation+1.48e-02

Pos logit contributions
ifying+0.007
igsaw+0.006
years+0.005
chie+0.005
jamin+0.005
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token your
Token ID534
Feature activation+5.99e-03

Pos logit contributions
ifying+0.031
igsaw+0.026
years+0.026
chie+0.026
jamin+0.024
Neg logit contributions
 idea-0.019
 doctors-0.015
 bal-0.015
 people-0.015
 cows-0.015
Token books
Token ID3835
Feature activation+4.30e-02

Pos logit contributions
ifying+0.012
igsaw+0.010
years+0.010
chie+0.009
jamin+0.009
Neg logit contributions
 idea-0.009
 bal-0.007
crow-0.007
 doctors-0.007
 people-0.007
Token and
Token ID290
Feature activation+1.40e-02

Pos logit contributions
ifying+0.095
years+0.085
chie+0.081
igsaw+0.081
jamin+0.074
Neg logit contributions
 idea-0.053
 bal-0.037
 day-0.037
 people-0.037
 whenever-0.037
Token then
Token ID788
Feature activation-4.84e-03

Pos logit contributions
ifying+0.029
igsaw+0.024
years+0.024
chie+0.024
jamin+0.022
Neg logit contributions
 idea-0.019
 doctors-0.015
 bal-0.015
 people-0.015
crow-0.015
Token tell
Token ID1560
Feature activation-4.30e-03

Pos logit contributions
 idea+0.007
crow+0.006
 bal+0.006
 doctors+0.005
 cows+0.005
Neg logit contributions
ifying-0.009
igsaw-0.008
years-0.008
chie-0.007
 hips-0.007
Token each
Token ID1123
Feature activation-1.17e-02

Pos logit contributions
 idea+0.006
 bal+0.005
crow+0.005
 doctors+0.005
 cows+0.004
Neg logit contributions
ifying-0.008
igsaw-0.007
years-0.007
chie-0.007
 hips-0.006
Token other
Token ID584
Feature activation+9.37e-03

Pos logit contributions
 idea+0.012
 bal+0.011
crow+0.010
 doctors+0.009
cut+0.009
Neg logit contributions
ifying-0.025
igsaw-0.022
years-0.022
chie-0.021
jamin-0.020
Token
Token ID198
Feature activation+7.35e-03

Pos logit contributions
 idea+0.004
 bal+0.003
crow+0.003
 doctors+0.003
 cows+0.003
Neg logit contributions
ifying-0.005
igsaw-0.004
years-0.004
chie-0.004
jamin-0.004
TokenSam
Token ID16305
Feature activation+2.21e-02

Pos logit contributions
ifying+0.014
igsaw+0.012
years+0.011
chie+0.011
 hips+0.010
Neg logit contributions
 idea-0.011
 bal-0.009
crow-0.009
 doctors-0.009
 people-0.009
Token says
Token ID1139
Feature activation+1.88e-02

Pos logit contributions
ifying+0.047
igsaw+0.038
years+0.038
chie+0.038
run+0.034
Neg logit contributions
 idea-0.031
 bal-0.024
 people-0.023
crow-0.023
 understand-0.023
Token,
Token ID11
Feature activation+5.64e-03

Pos logit contributions
ifying+0.048
igsaw+0.041
years+0.040
chie+0.039
jamin+0.038
Neg logit contributions
 idea-0.018
 bal-0.013
 people-0.012
crow-0.012
 doctors-0.011
Token "
Token ID366
Feature activation-1.57e-03

Pos logit contributions
ifying+0.012
igsaw+0.010
years+0.010
chie+0.010
jamin+0.009
Neg logit contributions
 idea-0.007
 bal-0.006
crow-0.006
 doctors-0.005
 cows-0.005
TokenThank
Token ID10449
Feature activation+4.24e-02

Pos logit contributions
 idea+0.002
crow+0.002
 doctors+0.002
 bal+0.002
 cows+0.002
Neg logit contributions
ifying-0.003
igsaw-0.002
years-0.002
chie-0.002
jamin-0.002
Token you
Token ID345
Feature activation+2.73e-02

Pos logit contributions
ifying+0.093
igsaw+0.079
oured+0.076
jamin+0.073
 hips+0.072
Neg logit contributions
 people-0.049
 idea-0.047
crow-0.045
 bal-0.044
 thing-0.042
Token,
Token ID11
Feature activation-8.77e-03

Pos logit contributions
ifying+0.068
igsaw+0.059
years+0.059
chie+0.057
oured+0.054
Neg logit contributions
 idea-0.027
 bal-0.018
 people-0.018
 doctors-0.018
 cows-0.016
Token Mom
Token ID11254
Feature activation+2.04e-02

Pos logit contributions
 idea+0.011
crow+0.009
 bal+0.009
 doctors+0.009
 cows+0.008
Neg logit contributions
ifying-0.018
igsaw-0.016
years-0.015
chie-0.015
jamin-0.014
Token!
Token ID0
Feature activation+4.54e-03

Pos logit contributions
ifying+0.044
igsaw+0.037
years+0.037
chie+0.036
 hips+0.033
Neg logit contributions
 idea-0.027
 bal-0.021
 doctors-0.020
 people-0.020
crow-0.020
Token I
Token ID314
Feature activation-3.38e-03

Pos logit contributions
ifying+0.007
igsaw+0.006
years+0.006
chie+0.005
jamin+0.005
Neg logit contributions
 idea-0.008
 bal-0.007
crow-0.007
 doctors-0.007
 cows-0.007
TokenBut
Token ID1537
Feature activation-2.76e-02

Pos logit contributions
 idea+0.045
crow+0.040
 crops+0.038
cut+0.037
 bal+0.037
Neg logit contributions
ifying-0.033
igsaw-0.025
years-0.025
jamin-0.024
An-0.022
Token one
Token ID530
Feature activation-2.13e-02

Pos logit contributions
 idea+0.044
crow+0.038
 understand+0.036
 bal+0.035
ighters+0.035
Neg logit contributions
ifying-0.047
years-0.038
igsaw-0.036
 hips-0.035
jamin-0.034
Token day
Token ID1110
Feature activation-1.64e-02

Pos logit contributions
 idea+0.015
ighters+0.012
 cows+0.012
crow+0.012
 paramedics+0.012
Neg logit contributions
ifying-0.052
igsaw-0.047
years-0.046
jamin-0.043
run-0.042
Token,
Token ID11
Feature activation-1.67e-03

Pos logit contributions
 idea+0.025
cut+0.021
crow+0.020
 cows+0.019
 paramedics+0.019
Neg logit contributions
ifying-0.031
igsaw-0.025
years-0.024
jamin-0.022
chie-0.022
Token Lily
Token ID20037
Feature activation-1.91e-02

Pos logit contributions
 idea+0.003
crow+0.002
 bal+0.002
 doctors+0.002
 cows+0.002
Neg logit contributions
ifying-0.003
igsaw-0.003
years-0.003
chie-0.002
 hips-0.002
Token couldn
Token ID3521
Feature activation+4.85e-02

Pos logit contributions
 idea+0.033
crow+0.029
cut+0.028
 bal+0.027
 cows+0.027
Neg logit contributions
ifying-0.030
years-0.023
jamin-0.023
igsaw-0.022
 hips-0.021
Token't
Token ID470
Feature activation-1.15e-02

Pos logit contributions
ifying+0.100
years+0.087
chie+0.080
igsaw+0.079
jamin+0.076
Neg logit contributions
 idea-0.064
 bal-0.055
 hospital-0.053
 People-0.052
 doctors-0.052
Token find
Token ID1064
Feature activation+2.89e-03

Pos logit contributions
 idea+0.015
crow+0.013
 cows+0.013
 bal+0.012
 doctors+0.012
Neg logit contributions
ifying-0.023
igsaw-0.019
chie-0.019
years-0.019
 hips-0.018
Token her
Token ID607
Feature activation-3.81e-03

Pos logit contributions
ifying+0.006
igsaw+0.005
years+0.005
chie+0.005
jamin+0.005
Neg logit contributions
 idea-0.004
 bal-0.003
crow-0.003
 doctors-0.003
 cows-0.003
Token necklace
Token ID38987
Feature activation-2.74e-03

Pos logit contributions
 idea+0.005
crow+0.004
 bal+0.004
 doctors+0.004
 cows+0.004
Neg logit contributions
ifying-0.007
igsaw-0.006
years-0.006
chie-0.006
 hips-0.006
Token.
Token ID13
Feature activation-1.70e-03

Pos logit contributions
 idea+0.004
crow+0.003
 bal+0.003
 doctors+0.003
 cows+0.003
Neg logit contributions
ifying-0.005
igsaw-0.004
years-0.004
chie-0.004
 hips-0.004
Token to
Token ID284
Feature activation-7.73e-03

Pos logit contributions
 idea+0.012
crow+0.010
 bal+0.010
 doctors+0.009
 cows+0.009
Neg logit contributions
ifying-0.012
years-0.009
chie-0.009
igsaw-0.009
 hips-0.008
Token see
Token ID766
Feature activation-1.70e-02

Pos logit contributions
 idea+0.011
 bal+0.009
crow+0.009
 doctors+0.008
cut+0.008
Neg logit contributions
ifying-0.015
igsaw-0.012
years-0.012
chie-0.012
 hips-0.011
Token the
Token ID262
Feature activation-8.75e-03

Pos logit contributions
 idea+0.028
 bal+0.023
crow+0.023
 ending+0.022
 paramedics+0.021
Neg logit contributions
ifying-0.029
years-0.024
igsaw-0.022
chie-0.021
run-0.021
Token crocod
Token ID37565
Feature activation+1.52e-02

Pos logit contributions
 idea+0.012
 bal+0.010
crow+0.009
 doctors+0.009
 understand+0.009
Neg logit contributions
ifying-0.018
igsaw-0.014
years-0.014
chie-0.014
 seeking-0.013
Tokenile
Token ID576
Feature activation+6.53e-03

Pos logit contributions
ifying+0.034
igsaw+0.030
years+0.029
chie+0.028
 hips+0.028
Neg logit contributions
 idea-0.017
crow-0.014
 bal-0.013
 doctors-0.013
 cows-0.013
Token up
Token ID510
Feature activation+2.73e-03

Pos logit contributions
ifying+0.014
igsaw+0.012
years+0.012
chie+0.012
jamin+0.011
Neg logit contributions
 idea-0.008
 bal-0.006
crow-0.006
 doctors-0.006
 cows-0.006
Token close
Token ID1969
Feature activation+2.40e-03

Pos logit contributions
ifying+0.005
igsaw+0.004
years+0.004
chie+0.004
jamin+0.004
Neg logit contributions
 idea-0.004
 bal-0.003
crow-0.003
 doctors-0.003
 cows-0.003
Token.
Token ID13
Feature activation-6.44e-03

Pos logit contributions
ifying+0.005
igsaw+0.004
years+0.004
chie+0.004
jamin+0.004
Neg logit contributions
 idea-0.003
 bal-0.002
crow-0.002
 doctors-0.002
 cows-0.002
Token
Token ID198
Feature activation-1.78e-02

Pos logit contributions
 idea+0.010
crow+0.009
 doctors+0.009
 bal+0.008
 pilot+0.008
Neg logit contributions
ifying-0.011
igsaw-0.009
years-0.009
chie-0.009
jamin-0.008
Token
Token ID198
Feature activation-2.04e-02

Pos logit contributions
 idea+0.028
 bal+0.025
 doctors+0.024
crow+0.024
 paramedics+0.023
Neg logit contributions
ifying-0.030
igsaw-0.025
years-0.023
run-0.022
chie-0.022
TokenAfter
Token ID3260
Feature activation+2.28e-02

Pos logit contributions
 idea+0.038
crow+0.032
 bal+0.031
ighters+0.031
 infection+0.030
Neg logit contributions
ifying-0.030
years-0.023
igsaw-0.022
lic-0.022
chie-0.021
Tokenelling
Token ID9417
Feature activation-1.96e-02

Pos logit contributions
ifying+0.005
igsaw+0.004
years+0.004
chie+0.004
jamin+0.004
Neg logit contributions
 idea-0.005
 bal-0.005
crow-0.004
 doctors-0.004
 cows-0.004
Token flowers
Token ID12734
Feature activation+2.38e-02

Pos logit contributions
 idea+0.023
 bal+0.017
crow+0.017
 understand+0.017
 doctors+0.016
Neg logit contributions
ifying-0.043
igsaw-0.036
years-0.034
chie-0.034
jamin-0.032
Token!
Token ID0
Feature activation+2.18e-02

Pos logit contributions
ifying+0.054
igsaw+0.048
years+0.047
chie+0.047
jamin+0.043
Neg logit contributions
 idea-0.026
 bal-0.019
 cows-0.019
 people-0.018
crow-0.018
Token
Token ID220
Feature activation-1.55e-02

Pos logit contributions
ifying+0.045
years+0.039
chie+0.038
igsaw+0.038
 hips+0.034
Neg logit contributions
 idea-0.028
crow-0.023
 bal-0.022
 cows-0.021
 People-0.021
Token
Token ID198
Feature activation+1.37e-02

Pos logit contributions
 idea+0.024
 doctors+0.020
crow+0.020
 bal+0.020
 pilot+0.019
Neg logit contributions
ifying-0.028
igsaw-0.024
years-0.022
chie-0.022
run-0.021
Token
Token ID198
Feature activation+8.57e-03

Pos logit contributions
ifying+0.030
igsaw+0.026
years+0.025
chie+0.025
 hips+0.023
Neg logit contributions
 idea-0.016
 bal-0.013
crow-0.013
 cows-0.012
 understand-0.012
TokenFinally
Token ID11158
Feature activation+2.25e-04

Pos logit contributions
ifying+0.014
igsaw+0.012
years+0.011
chie+0.011
 hips+0.010
Neg logit contributions
 idea-0.014
 bal-0.012
crow-0.012
 doctors-0.012
 cows-0.012
Token,
Token ID11
Feature activation+9.48e-03

Pos logit contributions
ifying+0.000
igsaw+0.000
years+0.000
chie+0.000
jamin+0.000
Neg logit contributions
 idea-0.000
 bal-0.000
crow-0.000
 doctors-0.000
 cows-0.000
Token Daisy
Token ID40355
Feature activation-2.60e-03

Pos logit contributions
ifying+0.018
igsaw+0.015
years+0.015
chie+0.015
jamin+0.014
Neg logit contributions
 idea-0.014
 bal-0.011
crow-0.011
 doctors-0.011
 people-0.011
Token and
Token ID290
Feature activation-2.51e-02

Pos logit contributions
 idea+0.004
crow+0.003
 bal+0.003
 doctors+0.003
 cows+0.003
Neg logit contributions
ifying-0.005
igsaw-0.004
years-0.004
chie-0.004
jamin-0.004
Token her
Token ID607
Feature activation-3.28e-03

Pos logit contributions
 idea+0.039
crow+0.034
ighters+0.034
 bal+0.033
cut+0.033
Neg logit contributions
ifying-0.039
igsaw-0.034
jamin-0.033
years-0.032
 hips-0.032
Token today
Token ID1909
Feature activation-1.74e-02

Pos logit contributions
ifying+0.000
igsaw+0.000
years+0.000
chie+0.000
jamin+0.000
Neg logit contributions
 idea-0.000
crow-0.000
 bal-0.000
 doctors-0.000
 cows-0.000
Token.
Token ID13
Feature activation-7.28e-03

Pos logit contributions
 idea+0.023
crow+0.019
 bal+0.019
 doctors+0.019
cut+0.018
Neg logit contributions
ifying-0.034
igsaw-0.028
years-0.027
chie-0.026
jamin-0.026
Token She
Token ID1375
Feature activation-5.50e-04

Pos logit contributions
 idea+0.011
 doctors+0.009
 bal+0.009
crow+0.009
cut+0.009
Neg logit contributions
ifying-0.013
igsaw-0.011
years-0.010
chie-0.010
jamin-0.010
Token had
Token ID550
Feature activation+2.81e-03

Pos logit contributions
 idea+0.001
crow+0.001
 bal+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.001
igsaw-0.001
years-0.001
chie-0.001
jamin-0.001
Token a
Token ID257
Feature activation+1.63e-02

Pos logit contributions
ifying+0.005
igsaw+0.004
years+0.004
chie+0.004
jamin+0.004
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token new
Token ID649
Feature activation+3.11e-03

Pos logit contributions
ifying+0.034
years+0.029
igsaw+0.028
chie+0.028
 hips+0.026
Neg logit contributions
 idea-0.022
 bal-0.017
 doctors-0.017
 cows-0.016
 people-0.016
Token friend
Token ID1545
Feature activation+2.51e-03

Pos logit contributions
ifying+0.007
igsaw+0.006
years+0.005
chie+0.005
jamin+0.005
Neg logit contributions
 idea-0.004
 bal-0.003
crow-0.003
 doctors-0.003
 cows-0.003
Token to
Token ID284
Feature activation-1.08e-03

Pos logit contributions
ifying+0.005
igsaw+0.004
years+0.004
chie+0.004
jamin+0.004
Neg logit contributions
 idea-0.003
 bal-0.003
crow-0.003
 doctors-0.003
 cows-0.002
Token play
Token ID711
Feature activation+3.82e-04

Pos logit contributions
 idea+0.002
 bal+0.001
crow+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.002
igsaw-0.002
years-0.002
chie-0.002
jamin-0.002
Token with
Token ID351
Feature activation+4.45e-03

Pos logit contributions
ifying+0.001
igsaw+0.001
years+0.001
chie+0.001
jamin+0.001
Neg logit contributions
 idea-0.001
 bal-0.000
crow-0.000
 doctors-0.000
 cows-0.000
Token.
Token ID13
Feature activation-6.65e-03

Pos logit contributions
ifying+0.010
igsaw+0.008
years+0.008
chie+0.008
jamin+0.008
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token "
Token ID366
Feature activation+2.10e-04

Pos logit contributions
ifying+0.028
igsaw+0.025
years+0.024
chie+0.023
jamin+0.022
Neg logit contributions
 idea-0.015
 bal-0.012
crow-0.012
 doctors-0.012
 cows-0.011
TokenI
Token ID40
Feature activation-2.72e-02

Pos logit contributions
ifying+0.000
igsaw+0.000
years+0.000
chie+0.000
jamin+0.000
Neg logit contributions
 idea-0.000
 bal-0.000
crow-0.000
 doctors-0.000
 cows-0.000
Token'm
Token ID1101
Feature activation-1.25e-02

Pos logit contributions
 idea+0.043
 cows+0.041
 doctors+0.040
cut+0.040
 paramedics+0.039
Neg logit contributions
ifying-0.044
igsaw-0.033
chie-0.033
run-0.032
years-0.032
Token the
Token ID262
Feature activation+1.10e-02

Pos logit contributions
 idea+0.018
crow+0.015
 bal+0.015
cut+0.015
 understand+0.014
Neg logit contributions
ifying-0.023
years-0.020
igsaw-0.019
chie-0.019
 seeking-0.018
Token best
Token ID1266
Feature activation-1.63e-02

Pos logit contributions
ifying+0.025
igsaw+0.021
years+0.021
chie+0.020
 hips+0.019
Neg logit contributions
 idea-0.013
 bal-0.010
crow-0.010
 doctors-0.010
 people-0.010
Token slider
Token ID28982
Feature activation+1.95e-02

Pos logit contributions
 idea+0.020
cut+0.018
ighters+0.017
 cows+0.017
 doctors+0.017
Neg logit contributions
ifying-0.030
igsaw-0.028
chie-0.026
years-0.024
 seeking-0.024
Token here
Token ID994
Feature activation+1.26e-02

Pos logit contributions
ifying+0.039
igsaw+0.033
years+0.032
chie+0.030
jamin+0.030
Neg logit contributions
 idea-0.028
 bal-0.023
crow-0.022
 people-0.022
 doctors-0.021
Token,
Token ID11
Feature activation-1.02e-02

Pos logit contributions
ifying+0.024
igsaw+0.020
years+0.020
chie+0.019
jamin+0.018
Neg logit contributions
 idea-0.019
 bal-0.015
crow-0.015
 doctors-0.015
 people-0.015
Token the
Token ID262
Feature activation+4.59e-03

Pos logit contributions
 idea+0.016
crow+0.014
 bal+0.013
 doctors+0.013
cut+0.013
Neg logit contributions
ifying-0.018
igsaw-0.015
chie-0.014
years-0.014
jamin-0.013
Token judge
Token ID5052
Feature activation+3.31e-03

Pos logit contributions
ifying+0.010
igsaw+0.009
years+0.008
chie+0.008
jamin+0.008
Neg logit contributions
 idea-0.006
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token said
Token ID531
Feature activation-6.46e-04

Pos logit contributions
ifying+0.006
igsaw+0.005
years+0.005
chie+0.005
jamin+0.005
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token bow
Token ID9563
Feature activation-2.06e-02

Pos logit contributions
ifying+0.000
igsaw+0.000
years+0.000
chie+0.000
jamin+0.000
Neg logit contributions
 idea-0.000
 bal-0.000
crow-0.000
 doctors-0.000
 cows-0.000
Token was
Token ID373
Feature activation-1.58e-02

Pos logit contributions
 idea+0.031
crow+0.026
 doctors+0.026
 bal+0.026
 cows+0.025
Neg logit contributions
ifying-0.036
chie-0.028
igsaw-0.028
years-0.028
 hips-0.027
Token so
Token ID523
Feature activation-9.47e-03

Pos logit contributions
 idea+0.027
crow+0.024
 bal+0.024
 doctors+0.023
 paramedics+0.022
Neg logit contributions
ifying-0.025
years-0.020
igsaw-0.019
chie-0.019
 hips-0.018
Token excited
Token ID6568
Feature activation-2.31e-03

Pos logit contributions
 idea+0.013
crow+0.011
 bal+0.011
 doctors+0.011
 cows+0.010
Neg logit contributions
ifying-0.019
igsaw-0.016
years-0.015
chie-0.015
jamin-0.014
Token.
Token ID13
Feature activation-7.12e-04

Pos logit contributions
 idea+0.003
 bal+0.003
crow+0.003
 doctors+0.003
 cows+0.002
Neg logit contributions
ifying-0.004
igsaw-0.004
years-0.004
chie-0.004
jamin-0.003
Token He
Token ID679
Feature activation+1.26e-02

Pos logit contributions
 idea+0.001
 bal+0.001
crow+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.001
igsaw-0.001
years-0.001
chie-0.001
jamin-0.001
Token wanted
Token ID2227
Feature activation-3.67e-03

Pos logit contributions
ifying+0.025
igsaw+0.021
years+0.020
chie+0.020
jamin+0.018
Neg logit contributions
 idea-0.018
 bal-0.014
crow-0.014
 doctors-0.014
 people-0.014
Token to
Token ID284
Feature activation-8.80e-03

Pos logit contributions
 idea+0.006
 bal+0.005
crow+0.005
 doctors+0.005
cut+0.005
Neg logit contributions
ifying-0.006
igsaw-0.005
years-0.005
chie-0.005
 hips-0.004
Token tell
Token ID1560
Feature activation-8.45e-03

Pos logit contributions
 idea+0.012
 bal+0.010
crow+0.010
 doctors+0.010
 cows+0.010
Neg logit contributions
ifying-0.017
igsaw-0.014
chie-0.014
years-0.013
 hips-0.013
Token everyone
Token ID2506
Feature activation+1.56e-02

Pos logit contributions
 idea+0.011
 bal+0.009
crow+0.009
 cows+0.008
 doctors+0.008
Neg logit contributions
ifying-0.017
igsaw-0.015
years-0.014
chie-0.014
 hips-0.013
Token the
Token ID262
Feature activation-1.57e-02

Pos logit contributions
ifying+0.036
igsaw+0.030
years+0.030
chie+0.029
jamin+0.027
Neg logit contributions
 idea-0.018
 bal-0.013
crow-0.013
 people-0.013
 doctors-0.013
Token adding
Token ID4375
Feature activation-1.28e-03

Pos logit contributions
 idea+0.015
crow+0.012
 bal+0.012
 doctors+0.012
 paramedics+0.011
Neg logit contributions
ifying-0.018
years-0.014
igsaw-0.014
chie-0.013
 hips-0.013
Token new
Token ID649
Feature activation+8.97e-03

Pos logit contributions
 idea+0.002
 bal+0.001
crow+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.003
igsaw-0.002
years-0.002
chie-0.002
jamin-0.002
Token ideas
Token ID4213
Feature activation+2.71e-03

Pos logit contributions
ifying+0.021
igsaw+0.018
years+0.018
chie+0.018
jamin+0.017
Neg logit contributions
 idea-0.009
 bal-0.007
crow-0.007
 doctors-0.007
 people-0.007
Token to
Token ID284
Feature activation+1.54e-02

Pos logit contributions
ifying+0.006
igsaw+0.005
years+0.005
chie+0.005
jamin+0.004
Neg logit contributions
 idea-0.004
 bal-0.003
crow-0.003
 doctors-0.003
 cows-0.003
Token their
Token ID511
Feature activation+2.41e-03

Pos logit contributions
ifying+0.034
igsaw+0.028
years+0.028
chie+0.027
jamin+0.026
Neg logit contributions
 idea-0.019
 people-0.015
 doctors-0.015
crow-0.014
 understand-0.014
Token game
Token ID983
Feature activation-9.73e-03

Pos logit contributions
ifying+0.005
igsaw+0.004
years+0.004
chie+0.004
jamin+0.004
Neg logit contributions
 idea-0.003
 bal-0.002
crow-0.002
 doctors-0.002
 cows-0.002
Token.
Token ID13
Feature activation-1.37e-03

Pos logit contributions
 idea+0.013
 bal+0.010
crow+0.010
 doctors+0.010
 cows+0.009
Neg logit contributions
ifying-0.020
igsaw-0.017
years-0.016
chie-0.016
 hips-0.015
Token Summer
Token ID10216
Feature activation+9.49e-03

Pos logit contributions
 idea+0.002
 bal+0.002
crow+0.002
 doctors+0.002
 people+0.001
Neg logit contributions
ifying-0.003
igsaw-0.002
years-0.002
chie-0.002
jamin-0.002
Token was
Token ID373
Feature activation-1.84e-02

Pos logit contributions
ifying+0.018
igsaw+0.015
years+0.014
chie+0.014
jamin+0.013
Neg logit contributions
 idea-0.014
crow-0.012
 bal-0.012
 doctors-0.011
 cows-0.011
Token the
Token ID262
Feature activation-2.04e-03

Pos logit contributions
 idea+0.031
crow+0.028
 bal+0.027
 doctors+0.026
cut+0.025
Neg logit contributions
ifying-0.030
years-0.024
igsaw-0.023
chie-0.023
jamin-0.021
Token best
Token ID1266
Feature activation-1.07e-02

Pos logit contributions
 idea+0.002
crow+0.002
 bal+0.002
 doctors+0.002
 cows+0.002
Neg logit contributions
ifying-0.004
igsaw-0.004
years-0.004
chie-0.004
jamin-0.003
Tokenum
Token ID388
Feature activation+6.37e-03

Pos logit contributions
ifying+0.014
igsaw+0.012
years+0.012
chie+0.011
jamin+0.011
Neg logit contributions
 idea-0.011
 bal-0.009
crow-0.009
 doctors-0.009
 cows-0.009
Token,
Token ID11
Feature activation+2.97e-03

Pos logit contributions
ifying+0.014
igsaw+0.012
years+0.012
chie+0.012
 hips+0.011
Neg logit contributions
 idea-0.007
 bal-0.005
crow-0.005
 doctors-0.005
 cows-0.005
Token which
Token ID543
Feature activation-1.10e-02

Pos logit contributions
ifying+0.006
igsaw+0.005
years+0.004
chie+0.004
jamin+0.004
Neg logit contributions
 idea-0.004
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.003
Token road
Token ID2975
Feature activation+1.24e-02

Pos logit contributions
 idea+0.017
 bal+0.015
 doctors+0.014
 understand+0.014
crow+0.014
Neg logit contributions
ifying-0.019
igsaw-0.016
chie-0.015
years-0.014
jamin-0.014
Token do
Token ID466
Feature activation+1.16e-02

Pos logit contributions
ifying+0.024
igsaw+0.020
years+0.020
chie+0.019
jamin+0.018
Neg logit contributions
 idea-0.018
crow-0.015
 doctors-0.015
 bal-0.015
 people-0.014
Token I
Token ID314
Feature activation-5.60e-03

Pos logit contributions
ifying+0.023
igsaw+0.020
years+0.019
chie+0.018
jamin+0.017
Neg logit contributions
 idea-0.017
 doctors-0.013
 people-0.013
 bal-0.013
crow-0.013
Token use
Token ID779
Feature activation-3.02e-03

Pos logit contributions
 idea+0.008
 bal+0.007
crow+0.006
 doctors+0.006
 cows+0.006
Neg logit contributions
ifying-0.011
igsaw-0.009
years-0.009
chie-0.009
jamin-0.008
Token to
Token ID284
Feature activation-2.06e-02

Pos logit contributions
 idea+0.004
 bal+0.003
crow+0.003
 doctors+0.003
 cows+0.003
Neg logit contributions
ifying-0.006
igsaw-0.005
years-0.005
chie-0.005
 hips-0.004
Token get
Token ID651
Feature activation-1.60e-03

Pos logit contributions
 idea+0.027
 bal+0.024
crow+0.024
 doctors+0.022
 cows+0.022
Neg logit contributions
ifying-0.039
chie-0.033
igsaw-0.033
years-0.031
 hips-0.031
Token to
Token ID284
Feature activation-2.80e-02

Pos logit contributions
 idea+0.002
 bal+0.002
crow+0.002
 doctors+0.002
 cows+0.002
Neg logit contributions
ifying-0.003
igsaw-0.002
years-0.002
chie-0.002
 hips-0.002
Token the
Token ID262
Feature activation-2.38e-02

Pos logit contributions
 idea+0.041
 bal+0.038
crow+0.036
cut+0.034
ighters+0.032
Neg logit contributions
ifying-0.048
igsaw-0.040
chie-0.039
years-0.036
 hips-0.036
Token its
Token ID663
Feature activation+5.65e-03

Pos logit contributions
 idea+0.037
 doctors+0.032
 bal+0.032
 paramedics+0.032
crow+0.031
Neg logit contributions
ifying-0.056
igsaw-0.044
�-0.042
 hips-0.041
chie-0.041
Token head
Token ID1182
Feature activation-1.24e-02

Pos logit contributions
ifying+0.011
igsaw+0.009
years+0.009
chie+0.009
jamin+0.008
Neg logit contributions
 idea-0.008
 bal-0.007
crow-0.006
 doctors-0.006
 cows-0.006
Token.
Token ID13
Feature activation+4.71e-03

Pos logit contributions
 idea+0.017
 doctors+0.013
crow+0.013
 bal+0.013
 people+0.012
Neg logit contributions
ifying-0.026
igsaw-0.020
years-0.020
chie-0.020
 hips-0.019
Token The
Token ID383
Feature activation-8.92e-03

Pos logit contributions
ifying+0.009
igsaw+0.008
years+0.008
chie+0.008
jamin+0.007
Neg logit contributions
 idea-0.007
 bal-0.005
crow-0.005
 doctors-0.005
 cows-0.005
Token bird
Token ID6512
Feature activation+8.37e-03

Pos logit contributions
 idea+0.010
 bal+0.008
crow+0.008
 doctors+0.007
 understand+0.007
Neg logit contributions
ifying-0.019
chie-0.016
igsaw-0.016
years-0.016
 hips-0.015
Token flew
Token ID13112
Feature activation-6.05e-03

Pos logit contributions
ifying+0.015
igsaw+0.012
years+0.012
chie+0.012
jamin+0.011
Neg logit contributions
 idea-0.014
 bal-0.011
crow-0.011
 doctors-0.011
 people-0.011
Token down
Token ID866
Feature activation+1.54e-02

Pos logit contributions
 idea+0.009
 bal+0.007
 doctors+0.007
crow+0.007
 cows+0.007
Neg logit contributions
ifying-0.012
igsaw-0.010
years-0.010
chie-0.010
jamin-0.009
Token and
Token ID290
Feature activation+1.54e-02

Pos logit contributions
ifying+0.036
igsaw+0.031
years+0.030
chie+0.030
jamin+0.028
Neg logit contributions
 idea-0.017
crow-0.013
 bal-0.012
 doctors-0.012
 people-0.012
Token landed
Token ID11406
Feature activation+1.19e-03

Pos logit contributions
ifying+0.031
igsaw+0.026
chie+0.025
years+0.024
jamin+0.022
Neg logit contributions
 idea-0.022
 bal-0.018
crow-0.018
 doctors-0.018
 cows-0.017
Token on
Token ID319
Feature activation+1.68e-02

Pos logit contributions
ifying+0.002
igsaw+0.002
years+0.002
chie+0.002
 hips+0.002
Neg logit contributions
 idea-0.002
 bal-0.001
crow-0.001
 doctors-0.001
 cows-0.001
Token the
Token ID262
Feature activation+7.39e-03

Pos logit contributions
ifying+0.035
igsaw+0.030
years+0.030
chie+0.029
run+0.027
Neg logit contributions
 idea-0.022
 people-0.017
crow-0.017
 doctors-0.017
 cows-0.017
Token She
Token ID1375
Feature activation+8.77e-03

Pos logit contributions
 idea+0.016
crow+0.013
 bal+0.013
 doctors+0.013
 cows+0.013
Neg logit contributions
ifying-0.018
igsaw-0.015
years-0.014
chie-0.013
jamin-0.013
Token said
Token ID531
Feature activation-1.76e-02

Pos logit contributions
ifying+0.018
igsaw+0.015
years+0.014
chie+0.014
 hips+0.013
Neg logit contributions
 idea-0.012
 bal-0.010
 doctors-0.010
crow-0.010
 people-0.009
Token it
Token ID340
Feature activation+6.98e-03

Pos logit contributions
 idea+0.026
crow+0.019
 bal+0.019
 cows+0.019
 countryside+0.018
Neg logit contributions
ifying-0.033
igsaw-0.028
years-0.028
chie-0.026
 hips-0.025
Token was
Token ID373
Feature activation+9.61e-03

Pos logit contributions
ifying+0.013
igsaw+0.011
years+0.011
chie+0.011
jamin+0.010
Neg logit contributions
 idea-0.010
 bal-0.008
crow-0.008
 doctors-0.008
 cows-0.008
Token a
Token ID257
Feature activation+1.09e-02

Pos logit contributions
ifying+0.019
igsaw+0.016
years+0.016
chie+0.015
jamin+0.014
Neg logit contributions
 idea-0.014
 doctors-0.011
 bal-0.011
crow-0.011
 cows-0.010
Token memory
Token ID4088
Feature activation-1.16e-02

Pos logit contributions
ifying+0.023
igsaw+0.019
years+0.019
chie+0.018
jamin+0.017
Neg logit contributions
 idea-0.014
 bal-0.011
 doctors-0.011
crow-0.011
 people-0.010
Token of
Token ID286
Feature activation+1.83e-02

Pos logit contributions
 idea+0.015
crow+0.012
 bal+0.012
 doctors+0.011
 cows+0.011
Neg logit contributions
ifying-0.024
igsaw-0.020
years-0.019
chie-0.019
 hips-0.018
Token a
Token ID257
Feature activation-8.81e-03

Pos logit contributions
ifying+0.040
igsaw+0.034
years+0.033
chie+0.033
jamin+0.031
Neg logit contributions
 idea-0.023
 people-0.018
 doctors-0.017
crow-0.017
 cows-0.017
Token war
Token ID1175
Feature activation-1.52e-02

Pos logit contributions
 idea+0.012
crow+0.010
 bal+0.010
 doctors+0.010
 cows+0.010
Neg logit contributions
ifying-0.017
igsaw-0.015
years-0.014
chie-0.013
 hips-0.013
Token.
Token ID13
Feature activation-9.25e-03

Pos logit contributions
 idea+0.020
 bal+0.017
crow+0.016
 doctors+0.016
 cows+0.015
Neg logit contributions
ifying-0.030
igsaw-0.025
chie-0.024
years-0.024
 seeking-0.023
Token She
Token ID1375
Feature activation+3.28e-03

Pos logit contributions
 idea+0.015
crow+0.012
 bal+0.012
 doctors+0.012
 cows+0.011
Neg logit contributions
ifying-0.017
igsaw-0.014
years-0.013
chie-0.013
jamin-0.012
Token and
Token ID290
Feature activation+1.05e-03

Pos logit contributions
ifying+0.026
igsaw+0.022
years+0.021
chie+0.021
jamin+0.019
Neg logit contributions
 idea-0.016
 bal-0.013
crow-0.012
 doctors-0.012
 people-0.012
Token Johnny
Token ID15470
Feature activation-9.00e-03

Pos logit contributions
ifying+0.002
igsaw+0.002
years+0.002
chie+0.002
jamin+0.002
Neg logit contributions
 idea-0.001
 bal-0.001
crow-0.001
 doctors-0.001
 cows-0.001
Token pulled
Token ID5954
Feature activation-1.67e-02

Pos logit contributions
 idea+0.014
crow+0.011
 bal+0.011
 doctors+0.011
cut+0.011
Neg logit contributions
ifying-0.016
igsaw-0.013
years-0.013
chie-0.013
jamin-0.012
Token Bob
Token ID5811
Feature activation+1.11e-02

Pos logit contributions
 idea+0.024
 bal+0.021
crow+0.021
 doctors+0.021
cut+0.020
Neg logit contributions
ifying-0.030
igsaw-0.025
years-0.024
chie-0.024
 hips-0.024
Token's
Token ID338
Feature activation+4.19e-03

Pos logit contributions
ifying+0.022
igsaw+0.019
years+0.019
chie+0.018
jamin+0.017
Neg logit contributions
 idea-0.015
 bal-0.012
crow-0.012
 doctors-0.012
 people-0.011
Token hand
Token ID1021
Feature activation-9.63e-03

Pos logit contributions
ifying+0.009
igsaw+0.008
years+0.008
chie+0.008
jamin+0.007
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.003
Token as
Token ID355
Feature activation-3.29e-03

Pos logit contributions
 idea+0.013
crow+0.011
 doctors+0.010
 bal+0.010
 cows+0.010
Neg logit contributions
ifying-0.020
igsaw-0.016
years-0.016
chie-0.015
 hips-0.015
Token they
Token ID484
Feature activation+1.38e-02

Pos logit contributions
 idea+0.005
crow+0.004
 bal+0.004
 doctors+0.004
 cows+0.004
Neg logit contributions
ifying-0.006
igsaw-0.005
years-0.005
chie-0.005
 hips-0.004
Token started
Token ID2067
Feature activation+5.30e-03

Pos logit contributions
ifying+0.027
igsaw+0.024
chie+0.022
years+0.022
jamin+0.021
Neg logit contributions
 idea-0.019
 bal-0.015
crow-0.015
 doctors-0.015
 cows-0.015
Token walking
Token ID6155
Feature activation-3.98e-03

Pos logit contributions
ifying+0.009
igsaw+0.008
years+0.007
chie+0.007
jamin+0.007
Neg logit contributions
 idea-0.009
 bal-0.007
crow-0.007
 doctors-0.007
 cows-0.007
Token home
Token ID1363
Feature activation+1.24e-02

Pos logit contributions
 idea+0.005
 bal+0.004
 doctors+0.004
crow+0.004
 cows+0.004
Neg logit contributions
ifying-0.008
igsaw-0.007
years-0.007
chie-0.007
 hips-0.006
Token They
Token ID1119
Feature activation-1.42e-02

Pos logit contributions
 idea+0.017
crow+0.014
cut+0.014
 bal+0.014
 doctors+0.014
Neg logit contributions
ifying-0.017
igsaw-0.014
years-0.012
 hips-0.012
jamin-0.012
Token use
Token ID779
Feature activation-1.43e-02

Pos logit contributions
 idea+0.021
crow+0.019
 bal+0.018
 doctors+0.018
 cows+0.017
Neg logit contributions
ifying-0.026
igsaw-0.021
years-0.020
chie-0.020
 hips-0.019
Token the
Token ID262
Feature activation-1.52e-02

Pos logit contributions
 idea+0.022
 bal+0.019
crow+0.019
 doctors+0.018
 pilot+0.017
Neg logit contributions
ifying-0.026
igsaw-0.021
years-0.020
 hips-0.019
chie-0.019
Token bread
Token ID8509
Feature activation-8.45e-03

Pos logit contributions
 idea+0.023
 bal+0.021
crow+0.021
 doctors+0.019
ook+0.019
Neg logit contributions
ifying-0.027
igsaw-0.021
chie-0.020
 hips-0.020
years-0.019
Token to
Token ID284
Feature activation-5.95e-03

Pos logit contributions
 idea+0.012
crow+0.010
 doctors+0.010
 bal+0.010
 pilot+0.009
Neg logit contributions
ifying-0.016
igsaw-0.013
years-0.013
chie-0.013
 hips-0.012
Token make
Token ID787
Feature activation-4.83e-02

Pos logit contributions
 idea+0.008
 bal+0.007
crow+0.007
 doctors+0.007
 cows+0.006
Neg logit contributions
ifying-0.012
igsaw-0.010
years-0.009
chie-0.009
 hips-0.009
Token more
Token ID517
Feature activation-4.42e-02

Pos logit contributions
crow+0.080
 idea+0.076
 bal+0.074
cut+0.071
 ending+0.071
Neg logit contributions
ifying-0.064
igsaw-0.051
years-0.048
 hips-0.045
 forks-0.045
Token blocks
Token ID7021
Feature activation-1.74e-02

Pos logit contributions
crow+0.057
 idea+0.056
 bal+0.053
 rarely+0.052
 paramedics+0.049
Neg logit contributions
ifying-0.074
igsaw-0.064
 forks-0.053
 sigh-0.053
 hips-0.053
Token.
Token ID13
Feature activation-1.67e-02

Pos logit contributions
 idea+0.023
crow+0.021
 doctors+0.019
 bal+0.019
 paramedics+0.018
Neg logit contributions
ifying-0.033
igsaw-0.028
chie-0.026
years-0.025
 hips-0.025
Token They
Token ID1119
Feature activation-1.55e-02

Pos logit contributions
 idea+0.031
cut+0.027
crow+0.026
 bal+0.026
 pilot+0.026
Neg logit contributions
ifying-0.027
igsaw-0.021
�-0.018
 hips-0.018
years-0.018
Token add
Token ID751
Feature activation-1.75e-02

Pos logit contributions
 idea+0.023
crow+0.021
 bal+0.020
 doctors+0.020
 cows+0.019
Neg logit contributions
ifying-0.028
igsaw-0.022
years-0.021
chie-0.021
 hips-0.020
Token on
Token ID319
Feature activation+2.29e-03

Pos logit contributions
ifying+0.000
igsaw+0.000
years+0.000
chie+0.000
jamin+0.000
Neg logit contributions
 idea-0.000
crow-0.000
 bal-0.000
 doctors-0.000
 cows-0.000
Token,
Token ID11
Feature activation-2.15e-02

Pos logit contributions
ifying+0.005
igsaw+0.004
years+0.004
chie+0.004
jamin+0.004
Neg logit contributions
 idea-0.003
 bal-0.002
crow-0.002
 doctors-0.002
 cows-0.002
Token Tim
Token ID5045
Feature activation-3.57e-03

Pos logit contributions
 idea+0.035
crow+0.033
 bal+0.029
 paramedics+0.029
 doctors+0.029
Neg logit contributions
ifying-0.033
igsaw-0.028
years-0.027
�-0.026
chie-0.024
Tokenmy
Token ID1820
Feature activation-5.77e-02

Pos logit contributions
 idea+0.007
crow+0.006
 bal+0.006
 doctors+0.006
 cows+0.006
Neg logit contributions
ifying-0.005
igsaw-0.003
years-0.003
chie-0.003
jamin-0.003
Token knew
Token ID2993
Feature activation-6.29e-03

Pos logit contributions
 idea+0.093
 paramedics+0.087
crow+0.086
ighters+0.079
 hurricane+0.077
Neg logit contributions
years-0.074
ll-0.073
ifying-0.071
jamin-0.067
 seeking-0.062
Token that
Token ID326
Feature activation-5.71e-02

Pos logit contributions
 idea+0.010
crow+0.008
 bal+0.008
 cows+0.007
 doctors+0.007
Neg logit contributions
ifying-0.011
igsaw-0.009
years-0.009
chie-0.009
run-0.008
Token when
Token ID618
Feature activation-1.05e-02

Pos logit contributions
 idea+0.098
crow+0.097
aw+0.087
isc+0.086
oster+0.085
Neg logit contributions
 seeking-0.065
years-0.061
 nowhere-0.057
ifying-0.055
ll-0.054
Token he
Token ID339
Feature activation-3.99e-03

Pos logit contributions
 idea+0.015
crow+0.012
 bal+0.012
cut+0.011
 paramedics+0.011
Neg logit contributions
ifying-0.020
igsaw-0.017
years-0.017
chie-0.016
 seeking-0.015
Token needed
Token ID2622
Feature activation-1.21e-02

Pos logit contributions
 idea+0.006
crow+0.005
 bal+0.005
 doctors+0.005
 cows+0.005
Neg logit contributions
ifying-0.007
igsaw-0.006
years-0.006
chie-0.006
jamin-0.006
Token help
Token ID1037
Feature activation-1.91e-02

Pos logit contributions
 idea+0.021
crow+0.018
 bal+0.017
cut+0.016
 pilot+0.016
Neg logit contributions
ifying-0.019
igsaw-0.015
years-0.015
chie-0.015
 hips-0.014
Token,
Token ID11
Feature activation-2.07e-02

Pos logit contributions
 idea+0.025
crow+0.020
 bal+0.020
 paramedics+0.019
 pilot+0.018
Neg logit contributions
ifying-0.036
igsaw-0.031
chie-0.030
years-0.029
 hips-0.029
Token-
Token ID12
Feature activation+2.92e-03

Pos logit contributions
ifying+0.066
igsaw+0.058
years+0.058
chie+0.057
jamin+0.053
Neg logit contributions
 idea-0.029
 bal-0.021
 people-0.020
 doctors-0.019
crow-0.019
Tokenbeing
Token ID11873
Feature activation+1.90e-02

Pos logit contributions
ifying+0.006
igsaw+0.005
years+0.005
chie+0.004
jamin+0.004
Neg logit contributions
 idea-0.004
 bal-0.003
crow-0.003
 doctors-0.003
 cows-0.003
Token.
Token ID13
Feature activation+1.17e-02

Pos logit contributions
ifying+0.044
igsaw+0.040
years+0.038
chie+0.037
jamin+0.035
Neg logit contributions
 idea-0.021
 bal-0.015
 people-0.015
 doctors-0.015
crow-0.014
Token<|endoftext|>
Token ID50256
Feature activation+2.56e-02

Pos logit contributions
ifying+0.023
igsaw+0.020
years+0.020
chie+0.019
 hips+0.018
Neg logit contributions
 idea-0.016
crow-0.013
 bal-0.013
 doctors-0.012
 people-0.012
TokenL
Token ID43
Feature activation+6.90e-03

Pos logit contributions
ifying+0.050
years+0.043
chie+0.043
igsaw+0.042
 hips+0.040
Neg logit contributions
 idea-0.036
crow-0.030
 bal-0.029
 people-0.029
 time-0.028
Tokenily
Token ID813
Feature activation-5.33e-02

Pos logit contributions
ifying+0.015
igsaw+0.012
years+0.012
chie+0.012
jamin+0.011
Neg logit contributions
 idea-0.009
crow-0.007
 bal-0.007
 doctors-0.007
 people-0.007
Token loved
Token ID6151
Feature activation-9.28e-03

Pos logit contributions
 idea+0.084
crow+0.072
 paramedics+0.072
 bal+0.070
 doctors+0.070
Neg logit contributions
ifying-0.082
jamin-0.074
igsaw-0.067
years-0.064
 seeking-0.062
Token to
Token ID284
Feature activation+1.44e-02

Pos logit contributions
 idea+0.016
crow+0.013
 bal+0.013
 doctors+0.012
 understand+0.012
Neg logit contributions
ifying-0.016
igsaw-0.013
years-0.013
chie-0.012
jamin-0.011
Token paint
Token ID7521
Feature activation-8.15e-03

Pos logit contributions
ifying+0.030
years+0.025
igsaw+0.025
chie+0.025
jamin+0.023
Neg logit contributions
 idea-0.019
 bal-0.015
 understand-0.014
 doctors-0.014
crow-0.014
Token.
Token ID13
Feature activation-1.40e-02

Pos logit contributions
 idea+0.010
crow+0.008
 bal+0.008
 doctors+0.008
 cows+0.008
Neg logit contributions
ifying-0.017
igsaw-0.014
years-0.014
chie-0.013
 hips-0.013
Token She
Token ID1375
Feature activation-2.99e-02

Pos logit contributions
 idea+0.022
cut+0.019
 bal+0.018
crow+0.018
 doctors+0.018
Neg logit contributions
ifying-0.025
igsaw-0.022
years-0.020
chie-0.019
jamin-0.019
Token car
Token ID1097
Feature activation-5.41e-03

Pos logit contributions
 idea+0.032
crow+0.032
ighters+0.030
 doctors+0.030
 bal+0.030
Neg logit contributions
ifying-0.040
igsaw-0.036
jamin-0.031
chie-0.031
 hips-0.031
Token.
Token ID13
Feature activation-1.52e-02

Pos logit contributions
 idea+0.007
crow+0.005
 bal+0.005
 doctors+0.005
 cows+0.005
Neg logit contributions
ifying-0.011
igsaw-0.010
years-0.009
chie-0.009
 hips-0.009
Token It
Token ID632
Feature activation+1.81e-02

Pos logit contributions
 idea+0.024
 bal+0.019
crow+0.019
 doctors+0.019
 cows+0.019
Neg logit contributions
ifying-0.029
igsaw-0.023
years-0.022
chie-0.021
jamin-0.020
Token can
Token ID460
Feature activation+1.29e-02

Pos logit contributions
ifying+0.037
igsaw+0.032
years+0.031
chie+0.029
 hips+0.028
Neg logit contributions
 idea-0.025
 bal-0.020
 doctors-0.020
 people-0.020
crow-0.020
Token make
Token ID787
Feature activation-3.18e-02

Pos logit contributions
ifying+0.027
igsaw+0.024
years+0.023
chie+0.022
jamin+0.021
Neg logit contributions
 idea-0.017
 doctors-0.013
 bal-0.013
 people-0.013
crow-0.013
Token sounds
Token ID5238
Feature activation-4.26e-02

Pos logit contributions
 idea+0.056
crow+0.052
 bal+0.050
cut+0.049
 paramedics+0.048
Neg logit contributions
ifying-0.045
igsaw-0.036
years-0.035
chie-0.034
run-0.031
Token and
Token ID290
Feature activation+6.09e-03

Pos logit contributions
crow+0.066
 idea+0.066
 paramedics+0.063
 paramed+0.059
 infection+0.057
Neg logit contributions
ifying-0.070
igsaw-0.051
 seeking-0.047
years-0.046
run-0.046
Token lights
Token ID7588
Feature activation-1.64e-02

Pos logit contributions
ifying+0.013
igsaw+0.011
years+0.011
chie+0.010
jamin+0.010
Neg logit contributions
 idea-0.008
 bal-0.006
 doctors-0.006
crow-0.006
 people-0.006
Token.
Token ID13
Feature activation-1.90e-02

Pos logit contributions
 idea+0.022
crow+0.018
 bal+0.018
 doctors+0.017
 paramedics+0.017
Neg logit contributions
ifying-0.034
igsaw-0.027
chie-0.026
years-0.026
 hips-0.025
Token Do
Token ID2141
Feature activation+2.22e-02

Pos logit contributions
 idea+0.034
crow+0.027
 bal+0.027
cut+0.026
 doctors+0.026
Neg logit contributions
ifying-0.033
igsaw-0.024
years-0.024
chie-0.023
jamin-0.022
Token you
Token ID345
Feature activation-1.56e-02

Pos logit contributions
ifying+0.043
igsaw+0.035
chie+0.033
years+0.033
oured+0.030
Neg logit contributions
 idea-0.035
 people-0.031
 doctors-0.030
 bal-0.029
 cows-0.029
Token his
Token ID465
Feature activation+1.06e-03

Pos logit contributions
 idea+0.008
 bal+0.007
crow+0.007
 doctors+0.007
cut+0.006
Neg logit contributions
ifying-0.010
igsaw-0.009
years-0.008
chie-0.008
jamin-0.008
Token house
Token ID2156
Feature activation-1.90e-02

Pos logit contributions
ifying+0.003
igsaw+0.002
years+0.002
chie+0.002
jamin+0.002
Neg logit contributions
 idea-0.001
crow-0.001
 bal-0.001
 doctors-0.001
 cows-0.001
Token.
Token ID13
Feature activation-1.54e-02

Pos logit contributions
 idea+0.027
 bal+0.024
crow+0.024
 doctors+0.022
 paramedics+0.022
Neg logit contributions
ifying-0.035
igsaw-0.028
years-0.027
chie-0.027
 hips-0.027
Token He
Token ID679
Feature activation-1.74e-02

Pos logit contributions
 idea+0.023
 bal+0.020
 doctors+0.020
crow+0.020
 pilot+0.019
Neg logit contributions
ifying-0.027
igsaw-0.023
years-0.021
run-0.021
jamin-0.021
Token would
Token ID561
Feature activation-1.90e-02

Pos logit contributions
 idea+0.028
crow+0.025
 bal+0.024
 doctors+0.023
 paramedics+0.023
Neg logit contributions
ifying-0.029
years-0.024
chie-0.023
jamin-0.022
 hips-0.022
Token watch
Token ID2342
Feature activation-4.26e-02

Pos logit contributions
 idea+0.024
crow+0.023
 bal+0.022
 doctors+0.021
 paramedics+0.020
Neg logit contributions
ifying-0.037
chie-0.030
years-0.029
igsaw-0.029
 hips-0.029
Token the
Token ID262
Feature activation-3.46e-02

Pos logit contributions
 idea+0.070
crow+0.068
 bal+0.064
umber+0.061
 paramedics+0.061
Neg logit contributions
ifying-0.063
years-0.046
igsaw-0.046
 hips-0.046
 eight-0.045
Token boat
Token ID8848
Feature activation-2.06e-02

Pos logit contributions
 idea+0.047
 bal+0.046
crow+0.044
ook+0.043
 understand+0.042
Neg logit contributions
ifying-0.063
chie-0.046
igsaw-0.041
 eight-0.040
 total-0.040
Token go
Token ID467
Feature activation-1.58e-03

Pos logit contributions
 idea+0.025
crow+0.021
 bal+0.021
 doctors+0.020
 paramedics+0.019
Neg logit contributions
ifying-0.042
chie-0.034
igsaw-0.034
years-0.033
 hips-0.032
Token on
Token ID319
Feature activation+1.21e-03

Pos logit contributions
 idea+0.002
 bal+0.002
crow+0.002
 doctors+0.002
 cows+0.002
Neg logit contributions
ifying-0.003
igsaw-0.003
years-0.003
chie-0.003
 hips-0.002
Token the
Token ID262
Feature activation+4.03e-03

Pos logit contributions
ifying+0.002
igsaw+0.002
years+0.002
chie+0.002
 hips+0.002
Neg logit contributions
 idea-0.002
 bal-0.001
crow-0.001
 doctors-0.001
 cows-0.001