TOP ACTIVATIONS
MAX = 3.328

ie . They both had birth days coming up , and
Spot . There are balls and bones and ropes and sque
my took Joe out for a walk in the garden .
his friends dropped her lip bal m on the ground .
dance around ! Em ma was over joy ed
new symbol ! Em ma 's friends cl apped
Goose ' and ' Hide and Seek ' ?"
things . There are hats and shoes and dresses and coats
and his mom went for a walk . It was a
The monster thought for a moment and then said
with me ?" Mel ody was very excited and
nice . Some are black and disgusting and mean ." Mia
!" Mom thought for a moment . Then she
here . Em ma had a great time
place . Em ma was so proud of
think you 're doing ?!" Fox y looked up and saw
us . We are kings and queens of this castle .
time there were two friends , Bob and Jack , who
she declared , " My toes are better now than before
does . She learned that birth days are not about cakes

MIN ACTIVATIONS
MIN = -3.408

. Every day , the family would give Strip lots of
. Every day , the mail man would put letters inside
So every evening , the lawyer would go to his big
L ily decided to keep an eye on the field to
. Every day , the mail man would leave the mail
inside it , so she let herself in . She stepped
They found a hotel and let themselves into the lobby .
. Every day , the clock went " tick - t
small . Every day the piece went on adventures , looking
From then on , Tim my never played with ropes again
. Every day , the clock would tick - t ock
she didn 't want to let it go . She held
Every day , the painter liked to play with children
It made loud noises and let out smoke when it zoom
gate . She smiled and let them in . They saw
that day on , Tim my learned to be more careful
She opened the door and let the bird out . The
was all dry when she let go ! She put on
their food on it and let it cook .
calm and peaceful . They let their little dog run around

INTERVAL 1.644 - 3.328
CONTAINS 0.323%

toys . There were balls , dolls , cars , and
day , Sophie put on her new robe , stepped outside
cries . She came in and saw the mess . She
better to share with friends than to be selfish . <|endoftext|>
man appeared behind her . He said , " That 's

INTERVAL -0.040 - 1.644
CONTAINS 51.912%

Tommy 's mom and dad came to find him . They
Good night , mom . Good night , duck ," Lily
little girl with a friendly face . " I
thought , " I should give the bear some food ."
! <|endoftext|> Once upon a time there was a big truck

INTERVAL -1.724 - -0.040
CONTAINS 47.447%

when she unp acked it . She decided to give it
after playing with them . She felt much better knowing where
of the television . <|endoftext|> Once upon a time , there
run !" They grabbed their snacks and ran out
, the mystery was never solved . <|endoftext|> One day ,

INTERVAL -3.408 - -1.724
CONTAINS 0.318%

the bench , they saw an ice cream truck . Lily
's mom my told her that they were going to have
and tried to make him feel better . The
day on , Lily knew that whenever she felt scared or
see it , please ?" Scar lett proudly handed
Tokenie
Token ID494
Feature activation+5.32e-01

Pos logit contributions
ifying+0.019
igsaw+0.016
years+0.015
chie+0.015
 hips+0.014
Neg logit contributions
 idea-0.012
crow-0.010
 bal-0.009
 doctors-0.009
 people-0.009
Token.
Token ID13
Feature activation+4.09e-01

Pos logit contributions
ifying+0.030
igsaw+0.026
years+0.025
chie+0.025
jamin+0.024
Neg logit contributions
 idea-0.015
 bal-0.011
crow-0.011
 doctors-0.011
 cows-0.010
Token They
Token ID1119
Feature activation+2.09e-01

Pos logit contributions
ifying+0.021
igsaw+0.017
years+0.017
chie+0.017
jamin+0.015
Neg logit contributions
 idea-0.014
crow-0.011
 bal-0.011
 doctors-0.010
 cows-0.010
Token both
Token ID1111
Feature activation+4.70e-01

Pos logit contributions
ifying+0.010
igsaw+0.009
years+0.008
chie+0.008
jamin+0.008
Neg logit contributions
 idea-0.007
 bal-0.006
crow-0.006
 doctors-0.006
 cows-0.006
Token had
Token ID550
Feature activation+9.13e-01

Pos logit contributions
ifying+0.025
igsaw+0.022
years+0.021
chie+0.020
 hips+0.019
Neg logit contributions
 idea-0.014
 bal-0.011
crow-0.011
 doctors-0.010
 cows-0.010
Token birth
Token ID4082
Feature activation+3.33e+00

Pos logit contributions
ifying+0.045
igsaw+0.040
chie+0.037
years+0.036
 hips+0.035
Neg logit contributions
 idea-0.031
 cows-0.027
 doctors-0.026
 people-0.026
 bal-0.024
Tokendays
Token ID12545
Feature activation+9.27e-01

Pos logit contributions
ifying+0.118
igsaw+0.114
jamin+0.102
 hips+0.099
+0.097
Neg logit contributions
 idea-0.128
 if-0.121
 understand-0.118
 whenever-0.118
 at-0.112
Token coming
Token ID2406
Feature activation+1.01e+00

Pos logit contributions
ifying+0.051
igsaw+0.044
chie+0.043
years+0.043
jamin+0.041
Neg logit contributions
 idea-0.027
 bal-0.020
 doctors-0.019
 people-0.019
 day-0.019
Token up
Token ID510
Feature activation+1.05e+00

Pos logit contributions
ifying+0.052
igsaw+0.045
years+0.042
chie+0.042
jamin+0.041
Neg logit contributions
 idea-0.032
 people-0.026
 cows-0.025
 doctors-0.025
 understand-0.025
Token,
Token ID11
Feature activation+1.26e-02

Pos logit contributions
ifying+0.057
chie+0.047
years+0.047
igsaw+0.047
jamin+0.045
Neg logit contributions
 idea-0.034
 people-0.025
 time-0.025
 day-0.025
 doctors-0.024
Token and
Token ID290
Feature activation+2.73e-01

Pos logit contributions
ifying+0.001
igsaw+0.000
years+0.000
chie+0.000
 hips+0.000
Neg logit contributions
 idea-0.000
 bal-0.000
crow-0.000
 doctors-0.000
 cows-0.000
Token Spot
Token ID15899
Feature activation+2.08e-01

Pos logit contributions
ifying+0.088
igsaw+0.073
chie+0.072
jamin+0.071
oured+0.068
Neg logit contributions
 idea-0.032
 people-0.029
 doctors-0.026
 cows-0.023
 time-0.021
Token.
Token ID13
Feature activation+3.16e-02

Pos logit contributions
ifying+0.011
igsaw+0.009
years+0.009
chie+0.009
jamin+0.009
Neg logit contributions
 idea-0.006
 bal-0.005
crow-0.005
 doctors-0.005
 cows-0.005
Token There
Token ID1318
Feature activation-1.09e+00

Pos logit contributions
ifying+0.001
igsaw+0.001
years+0.001
chie+0.001
 hips+0.001
Neg logit contributions
 idea-0.001
 bal-0.001
crow-0.001
 doctors-0.001
 cows-0.001
Token are
Token ID389
Feature activation+9.78e-01

Pos logit contributions
 idea+0.048
 bal+0.046
 doctors+0.045
crow+0.043
 paramedics+0.043
Neg logit contributions
ifying-0.035
igsaw-0.028
years-0.025
chie-0.025
 hips-0.025
Token balls
Token ID11333
Feature activation+2.55e-01

Pos logit contributions
ifying+0.051
igsaw+0.045
chie+0.042
years+0.041
+0.039
Neg logit contributions
 idea-0.031
 doctors-0.026
 cows-0.026
 people-0.025
 bal-0.024
Token and
Token ID290
Feature activation+3.15e+00

Pos logit contributions
ifying+0.014
igsaw+0.012
years+0.012
chie+0.011
jamin+0.011
Neg logit contributions
 idea-0.007
 bal-0.006
crow-0.006
 doctors-0.005
 cows-0.005
Token bones
Token ID11945
Feature activation+1.58e-01

Pos logit contributions
ifying+0.190
igsaw+0.179
ll+0.174
oured+0.172
years+0.171
Neg logit contributions
 people-0.076
 cows-0.072
 doctors-0.065
 idea-0.060
 sun-0.056
Token and
Token ID290
Feature activation+2.12e+00

Pos logit contributions
ifying+0.008
igsaw+0.007
years+0.007
chie+0.007
jamin+0.007
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.003
Token ropes
Token ID31001
Feature activation-3.52e-01

Pos logit contributions
ifying+0.117
igsaw+0.108
chie+0.105
years+0.104
ll+0.102
Neg logit contributions
 idea-0.060
 people-0.057
 doctors-0.054
 cows-0.054
 thing-0.045
Token and
Token ID290
Feature activation+2.05e+00

Pos logit contributions
 idea+0.011
 bal+0.009
crow+0.009
 doctors+0.009
 cows+0.008
Neg logit contributions
ifying-0.018
igsaw-0.015
years-0.014
chie-0.014
 hips-0.014
Token sque
Token ID35776
Feature activation+1.12e+00

Pos logit contributions
ifying+0.112
igsaw+0.102
years+0.097
chie+0.097
ll+0.096
Neg logit contributions
 idea-0.062
 people-0.056
 doctors-0.053
 cows-0.051
 wind-0.045
Tokenmy
Token ID1820
Feature activation-2.29e-01

Pos logit contributions
ifying+0.035
igsaw+0.028
chie+0.027
years+0.027
 hips+0.024
Neg logit contributions
 idea-0.038
 bal-0.032
crow-0.031
 cows-0.031
 doctors-0.031
Token took
Token ID1718
Feature activation-8.27e-01

Pos logit contributions
 idea+0.008
 bal+0.006
crow+0.006
 doctors+0.006
 cows+0.006
Neg logit contributions
ifying-0.011
igsaw-0.009
years-0.009
chie-0.009
jamin-0.008
Token Joe
Token ID5689
Feature activation+8.61e-01

Pos logit contributions
 idea+0.032
cut+0.026
crow+0.026
 bal+0.025
 doctors+0.025
Neg logit contributions
ifying-0.037
igsaw-0.031
years-0.029
jamin-0.027
chie-0.027
Token out
Token ID503
Feature activation+1.33e+00

Pos logit contributions
ifying+0.046
igsaw+0.040
years+0.039
chie+0.038
 hips+0.036
Neg logit contributions
 idea-0.025
 bal-0.020
crow-0.019
 cows-0.019
 doctors-0.019
Token for
Token ID329
Feature activation+1.15e+00

Pos logit contributions
ifying+0.068
chie+0.058
igsaw+0.058
years+0.056
jamin+0.055
Neg logit contributions
 idea-0.042
 bal-0.031
 people-0.031
 cows-0.031
 doctors-0.031
Token a
Token ID257
Feature activation+3.08e+00

Pos logit contributions
ifying+0.059
igsaw+0.049
chie+0.049
years+0.047
jamin+0.045
Neg logit contributions
 idea-0.037
 doctors-0.031
 people-0.031
 cows-0.030
 time-0.030
Token walk
Token ID2513
Feature activation-2.00e-02

Pos logit contributions
ifying+0.185
years+0.179
chie+0.161
hetically+0.152
 seeking+0.149
Neg logit contributions
 idea-0.070
 day-0.066
 time-0.064
 morning-0.056
 bal-0.052
Token in
Token ID287
Feature activation+3.08e-01

Pos logit contributions
 idea+0.001
 bal+0.001
crow+0.000
 doctors+0.000
 cows+0.000
Neg logit contributions
ifying-0.001
igsaw-0.001
years-0.001
chie-0.001
jamin-0.001
Token the
Token ID262
Feature activation+1.96e-01

Pos logit contributions
ifying+0.016
igsaw+0.014
years+0.013
chie+0.013
jamin+0.012
Neg logit contributions
 idea-0.010
crow-0.008
 bal-0.008
 doctors-0.007
 cows-0.007
Token garden
Token ID11376
Feature activation-3.23e-01

Pos logit contributions
ifying+0.011
igsaw+0.010
years+0.010
chie+0.009
 hips+0.009
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.003
Token.
Token ID13
Feature activation-5.41e-01

Pos logit contributions
 idea+0.010
crow+0.008
 bal+0.008
 doctors+0.008
 understand+0.007
Neg logit contributions
ifying-0.017
igsaw-0.014
years-0.013
chie-0.013
 hips-0.013
Token his
Token ID465
Feature activation+1.43e-01

Pos logit contributions
ifying+0.010
igsaw+0.008
years+0.008
chie+0.008
jamin+0.007
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token friends
Token ID2460
Feature activation-9.89e-01

Pos logit contributions
ifying+0.008
igsaw+0.007
years+0.007
chie+0.007
jamin+0.006
Neg logit contributions
 idea-0.004
 bal-0.003
crow-0.003
 doctors-0.003
 cows-0.003
Token dropped
Token ID5710
Feature activation+2.63e-01

Pos logit contributions
 idea+0.039
 bal+0.032
crow+0.031
cut+0.031
 doctors+0.031
Neg logit contributions
ifying-0.042
years-0.034
chie-0.032
jamin-0.031
igsaw-0.030
Token her
Token ID607
Feature activation+7.51e-01

Pos logit contributions
ifying+0.013
igsaw+0.011
years+0.011
chie+0.011
jamin+0.010
Neg logit contributions
 idea-0.009
 bal-0.007
crow-0.007
 doctors-0.007
 cows-0.007
Token lip
Token ID10645
Feature activation+1.51e-01

Pos logit contributions
ifying+0.039
years+0.034
igsaw+0.034
chie+0.032
jamin+0.030
Neg logit contributions
 idea-0.024
 bal-0.019
 doctors-0.018
crow-0.018
 cows-0.018
Token bal
Token ID3652
Feature activation+3.02e+00

Pos logit contributions
ifying+0.008
igsaw+0.007
years+0.007
chie+0.006
jamin+0.006
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.003
 cows-0.003
Tokenm
Token ID76
Feature activation+6.09e-01

Pos logit contributions
igsaw+0.113
ifying+0.095
 hind+0.090
 unison+0.089
 hips+0.084
Neg logit contributions
 bal-0.113
ying-0.109
crow-0.103
 if-0.103
umber-0.099
Token on
Token ID319
Feature activation-1.31e-01

Pos logit contributions
ifying+0.028
igsaw+0.024
years+0.023
chie+0.022
jamin+0.021
Neg logit contributions
 idea-0.023
 bal-0.019
crow-0.018
 doctors-0.018
 people-0.017
Token the
Token ID262
Feature activation+5.43e-01

Pos logit contributions
 idea+0.004
 bal+0.004
crow+0.004
 doctors+0.003
 understand+0.003
Neg logit contributions
ifying-0.006
igsaw-0.005
years-0.005
chie-0.005
 hips-0.005
Token ground
Token ID2323
Feature activation+7.40e-01

Pos logit contributions
ifying+0.034
igsaw+0.030
years+0.029
chie+0.028
jamin+0.027
Neg logit contributions
 idea-0.012
 doctors-0.008
 bal-0.008
crow-0.008
 cows-0.008
Token.
Token ID13
Feature activation+1.12e-01

Pos logit contributions
ifying+0.043
igsaw+0.038
years+0.037
chie+0.036
jamin+0.034
Neg logit contributions
 idea-0.019
 bal-0.014
crow-0.013
 doctors-0.013
 cows-0.013
Token dance
Token ID9280
Feature activation+2.79e-01

Pos logit contributions
ifying+0.024
igsaw+0.021
years+0.020
chie+0.019
jamin+0.018
Neg logit contributions
 idea-0.016
 bal-0.013
crow-0.013
 doctors-0.013
 people-0.013
Token around
Token ID1088
Feature activation+2.53e-01

Pos logit contributions
ifying+0.015
igsaw+0.013
years+0.013
chie+0.012
jamin+0.012
Neg logit contributions
 idea-0.008
 bal-0.006
crow-0.006
 doctors-0.006
 cows-0.006
Token!
Token ID0
Feature activation+7.98e-01

Pos logit contributions
ifying+0.013
igsaw+0.011
years+0.011
chie+0.011
jamin+0.010
Neg logit contributions
 idea-0.008
 bal-0.006
crow-0.006
 doctors-0.006
 cows-0.006
Token
Token ID198
Feature activation+6.03e-01

Pos logit contributions
ifying+0.039
years+0.032
chie+0.032
igsaw+0.032
 hips+0.029
Neg logit contributions
 idea-0.027
 bal-0.022
crow-0.022
 doctors-0.021
 people-0.021
Token
Token ID198
Feature activation+8.29e-01

Pos logit contributions
ifying+0.033
igsaw+0.027
years+0.027
chie+0.027
 hips+0.025
Neg logit contributions
 idea-0.018
 bal-0.014
crow-0.013
 understand-0.013
 doctors-0.013
TokenEm
Token ID10161
Feature activation+3.00e+00

Pos logit contributions
ifying+0.032
igsaw+0.025
chie+0.024
years+0.024
 hips+0.023
Neg logit contributions
 idea-0.035
 bal-0.031
 doctors-0.031
 understand-0.030
 people-0.030
Tokenma
Token ID2611
Feature activation+8.46e-01

Pos logit contributions
ifying+0.178
 hind+0.155
igsaw+0.149
ider+0.143
+0.141
Neg logit contributions
 idea-0.046
 time-0.045
 day-0.042
est-0.041
ying-0.040
Token was
Token ID373
Feature activation-8.76e-02

Pos logit contributions
ifying+0.037
igsaw+0.031
years+0.029
chie+0.028
 hips+0.026
Neg logit contributions
 idea-0.033
 bal-0.029
crow-0.027
 people-0.027
 cows-0.026
Token over
Token ID625
Feature activation+4.00e-01

Pos logit contributions
 idea+0.004
crow+0.003
 bal+0.003
 doctors+0.003
 cows+0.003
Neg logit contributions
ifying-0.003
igsaw-0.003
years-0.002
chie-0.002
jamin-0.002
Tokenjoy
Token ID2633
Feature activation+3.25e-01

Pos logit contributions
ifying+0.014
igsaw+0.011
years+0.010
chie+0.010
jamin+0.009
Neg logit contributions
 idea-0.020
 bal-0.017
crow-0.017
 doctors-0.017
 people-0.016
Tokened
Token ID276
Feature activation+3.19e-01

Pos logit contributions
ifying+0.014
igsaw+0.011
years+0.011
chie+0.010
 hips+0.010
Neg logit contributions
 idea-0.013
 bal-0.011
crow-0.011
 doctors-0.011
 people-0.011
Token new
Token ID649
Feature activation-5.15e-01

Pos logit contributions
ifying+0.039
years+0.033
igsaw+0.032
chie+0.031
jamin+0.031
Neg logit contributions
 idea-0.025
 bal-0.019
crow-0.018
 thing-0.018
 doctors-0.018
Token symbol
Token ID6194
Feature activation-3.21e-01

Pos logit contributions
 idea+0.015
crow+0.013
 doctors+0.013
 bal+0.013
 cows+0.013
Neg logit contributions
ifying-0.025
igsaw-0.022
chie-0.021
years-0.020
 hips-0.019
Token!
Token ID0
Feature activation+1.20e+00

Pos logit contributions
 idea+0.011
crow+0.009
 bal+0.009
 doctors+0.009
 cows+0.008
Neg logit contributions
ifying-0.016
igsaw-0.013
chie-0.012
years-0.012
jamin-0.012
Token
Token ID198
Feature activation+4.63e-01

Pos logit contributions
ifying+0.058
years+0.051
chie+0.050
igsaw+0.048
ll+0.045
Neg logit contributions
 idea-0.039
 People-0.032
 bal-0.031
 people-0.031
crow-0.030
Token
Token ID198
Feature activation+7.40e-01

Pos logit contributions
ifying+0.028
igsaw+0.024
years+0.023
chie+0.023
 hips+0.022
Neg logit contributions
 idea-0.011
 bal-0.008
crow-0.007
 doctors-0.007
 understand-0.007
TokenEm
Token ID10161
Feature activation+2.97e+00

Pos logit contributions
ifying+0.029
igsaw+0.023
chie+0.022
years+0.022
 hips+0.021
Neg logit contributions
 idea-0.032
 bal-0.028
 doctors-0.028
 understand-0.027
 people-0.027
Tokenma
Token ID2611
Feature activation+1.06e+00

Pos logit contributions
ifying+0.179
 hind+0.150
igsaw+0.149
lic+0.141
run+0.138
Neg logit contributions
 idea-0.052
ying-0.046
 people-0.046
 day-0.043
est-0.043
Token's
Token ID338
Feature activation-9.63e-02

Pos logit contributions
ifying+0.048
igsaw+0.041
years+0.037
chie+0.036
 hips+0.033
Neg logit contributions
 idea-0.040
 bal-0.034
crow-0.033
 understand-0.033
 people-0.033
Token friends
Token ID2460
Feature activation+1.26e+00

Pos logit contributions
 idea+0.003
 bal+0.002
crow+0.002
 doctors+0.002
 cows+0.002
Neg logit contributions
ifying-0.005
igsaw-0.004
years-0.004
chie-0.004
 hips-0.004
Token cl
Token ID537
Feature activation+9.56e-01

Pos logit contributions
ifying+0.058
igsaw+0.052
years+0.045
chie+0.045
 hips+0.041
Neg logit contributions
 idea-0.046
 understand-0.039
 people-0.038
 bal-0.037
crow-0.037
Tokenapped
Token ID6320
Feature activation-4.21e-02

Pos logit contributions
ifying+0.050
igsaw+0.044
 hips+0.041
years+0.041
 seeking+0.040
Neg logit contributions
 idea-0.027
 bal-0.022
crow-0.022
 understand-0.020
ook-0.019
Token Goose
Token ID46317
Feature activation+2.31e-01

Pos logit contributions
ifying+0.006
igsaw+0.005
years+0.005
chie+0.005
jamin+0.005
Neg logit contributions
 idea-0.004
 bal-0.004
crow-0.004
 doctors-0.003
 cows-0.003
Token'
Token ID6
Feature activation-5.17e-01

Pos logit contributions
ifying+0.011
igsaw+0.009
years+0.009
chie+0.009
jamin+0.008
Neg logit contributions
 idea-0.008
 bal-0.007
crow-0.007
 doctors-0.007
 cows-0.006
Token and
Token ID290
Feature activation+2.93e-01

Pos logit contributions
 idea+0.018
 bal+0.015
crow+0.015
 doctors+0.014
cut+0.014
Neg logit contributions
ifying-0.024
years-0.020
chie-0.019
igsaw-0.019
 hips-0.018
Token '
Token ID705
Feature activation+1.25e+00

Pos logit contributions
ifying+0.013
igsaw+0.011
years+0.011
chie+0.010
jamin+0.010
Neg logit contributions
 idea-0.011
 bal-0.009
crow-0.009
 doctors-0.009
 cows-0.009
TokenHide
Token ID38518
Feature activation+6.66e-01

Pos logit contributions
ifying+0.056
igsaw+0.049
 hips+0.044
years+0.044
chie+0.044
Neg logit contributions
 idea-0.044
crow-0.039
 bal-0.039
 doctors-0.037
 people-0.037
Token and
Token ID290
Feature activation+2.94e+00

Pos logit contributions
ifying+0.035
igsaw+0.030
years+0.029
chie+0.029
 hips+0.027
Neg logit contributions
 idea-0.020
crow-0.016
 bal-0.016
 doctors-0.015
 people-0.015
Token Seek
Token ID40749
Feature activation+1.13e+00

Pos logit contributions
ifying+0.135
igsaw+0.107
+0.105
chie+0.099
years+0.099
Neg logit contributions
 idea-0.108
 people-0.107
 day-0.100
 wind-0.098
 sun-0.095
Token'
Token ID6
Feature activation+2.58e-01

Pos logit contributions
ifying+0.056
years+0.049
igsaw+0.048
chie+0.047
jamin+0.044
Neg logit contributions
 idea-0.035
 people-0.028
crow-0.028
 bal-0.028
 doctors-0.028
Token?"
Token ID1701
Feature activation-2.72e-01

Pos logit contributions
ifying+0.013
igsaw+0.011
years+0.011
chie+0.010
jamin+0.010
Neg logit contributions
 idea-0.009
 bal-0.007
crow-0.007
 doctors-0.007
 cows-0.006
Token
Token ID198
Feature activation-1.75e-01

Pos logit contributions
 idea+0.011
crow+0.009
 bal+0.009
 doctors+0.009
 pilot+0.009
Neg logit contributions
ifying-0.011
igsaw-0.010
years-0.009
jamin-0.008
 hips-0.008
Token
Token ID198
Feature activation+2.38e-02

Pos logit contributions
 idea+0.006
 bal+0.005
crow+0.005
 doctors+0.005
 cows+0.005
Neg logit contributions
ifying-0.008
igsaw-0.007
years-0.007
chie-0.006
jamin-0.006
Token things
Token ID1243
Feature activation-4.19e-02

Pos logit contributions
 idea+0.003
 bal+0.002
crow+0.002
 doctors+0.002
 cows+0.002
Neg logit contributions
ifying-0.003
igsaw-0.003
years-0.003
chie-0.003
 hips-0.003
Token.
Token ID13
Feature activation+5.75e-01

Pos logit contributions
 idea+0.001
crow+0.001
 bal+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.002
igsaw-0.002
years-0.002
chie-0.002
 hips-0.002
Token There
Token ID1318
Feature activation-9.83e-01

Pos logit contributions
ifying+0.030
igsaw+0.025
years+0.025
chie+0.025
jamin+0.023
Neg logit contributions
 idea-0.018
crow-0.014
 bal-0.014
 cows-0.014
 doctors-0.014
Token are
Token ID389
Feature activation+7.22e-01

Pos logit contributions
 idea+0.043
 bal+0.041
 doctors+0.039
crow+0.038
 paramedics+0.038
Neg logit contributions
ifying-0.033
igsaw-0.026
chie-0.025
years-0.024
 hips-0.023
Token hats
Token ID23910
Feature activation+1.23e-01

Pos logit contributions
ifying+0.038
igsaw+0.033
years+0.031
chie+0.031
jamin+0.030
Neg logit contributions
 idea-0.023
 doctors-0.018
 people-0.018
 cows-0.018
 bal-0.017
Token and
Token ID290
Feature activation+2.91e+00

Pos logit contributions
ifying+0.007
igsaw+0.006
years+0.005
chie+0.005
jamin+0.005
Neg logit contributions
 idea-0.004
 bal-0.003
crow-0.003
 doctors-0.003
 cows-0.003
Token shoes
Token ID10012
Feature activation+8.25e-01

Pos logit contributions
ifying+0.154
igsaw+0.150
+0.146
years+0.143
chie+0.137
Neg logit contributions
 people-0.078
 cows-0.078
 idea-0.077
 doctors-0.068
 wheat-0.064
Token and
Token ID290
Feature activation+2.13e+00

Pos logit contributions
ifying+0.048
igsaw+0.042
years+0.042
chie+0.041
jamin+0.039
Neg logit contributions
 idea-0.021
 bal-0.015
crow-0.014
 people-0.014
 doctors-0.014
Token dresses
Token ID27309
Feature activation+2.76e-01

Pos logit contributions
ifying+0.113
igsaw+0.105
years+0.101
chie+0.100
+0.099
Neg logit contributions
 idea-0.064
 people-0.057
 cows-0.055
 doctors-0.054
 wheat-0.047
Token and
Token ID290
Feature activation+1.89e+00

Pos logit contributions
ifying+0.015
igsaw+0.013
years+0.013
chie+0.012
jamin+0.012
Neg logit contributions
 idea-0.008
 bal-0.006
crow-0.006
 doctors-0.006
 cows-0.006
Token coats
Token ID30720
Feature activation+3.62e-01

Pos logit contributions
ifying+0.100
igsaw+0.091
years+0.087
chie+0.087
+0.085
Neg logit contributions
 idea-0.059
 people-0.051
 doctors-0.049
 cows-0.048
 understand-0.043
Token and
Token ID290
Feature activation-1.92e-01

Pos logit contributions
 idea+0.038
 bal+0.031
 doctors+0.031
crow+0.031
 paramedics+0.030
Neg logit contributions
ifying-0.044
years-0.034
jamin-0.034
igsaw-0.033
chie-0.032
Token his
Token ID465
Feature activation+1.04e+00

Pos logit contributions
 idea+0.007
 bal+0.006
crow+0.005
 doctors+0.005
 cows+0.005
Neg logit contributions
ifying-0.009
igsaw-0.008
years-0.007
chie-0.007
jamin-0.007
Token mom
Token ID1995
Feature activation-1.69e-01

Pos logit contributions
ifying+0.067
igsaw+0.059
years+0.059
chie+0.058
jamin+0.055
Neg logit contributions
 idea-0.021
 people-0.014
 cows-0.014
 bal-0.013
 pilot-0.013
Token went
Token ID1816
Feature activation+1.33e+00

Pos logit contributions
 idea+0.006
crow+0.005
 bal+0.005
 doctors+0.005
 cows+0.005
Neg logit contributions
ifying-0.008
igsaw-0.006
years-0.006
chie-0.006
jamin-0.006
Token for
Token ID329
Feature activation+1.45e+00

Pos logit contributions
ifying+0.074
years+0.063
chie+0.063
igsaw+0.063
jamin+0.058
Neg logit contributions
 idea-0.037
 people-0.028
 day-0.028
 cows-0.027
 doctors-0.027
Token a
Token ID257
Feature activation+2.90e+00

Pos logit contributions
ifying+0.083
chie+0.071
years+0.069
igsaw+0.068
jamin+0.066
Neg logit contributions
 idea-0.040
 people-0.035
 doctors-0.034
 wheat-0.033
 cows-0.032
Token walk
Token ID2513
Feature activation+2.43e-01

Pos logit contributions
ifying+0.177
years+0.165
chie+0.154
jamin+0.147
hetically+0.142
Neg logit contributions
 idea-0.070
 day-0.066
 time-0.059
 morning-0.052
 understand-0.049
Token.
Token ID13
Feature activation-6.67e-02

Pos logit contributions
ifying+0.013
igsaw+0.011
years+0.011
chie+0.010
jamin+0.010
Neg logit contributions
 idea-0.007
 bal-0.006
crow-0.006
 doctors-0.006
 cows-0.005
Token It
Token ID632
Feature activation-1.29e-01

Pos logit contributions
 idea+0.002
 bal+0.002
crow+0.002
 doctors+0.002
 cows+0.002
Neg logit contributions
ifying-0.003
igsaw-0.003
years-0.002
chie-0.002
jamin-0.002
Token was
Token ID373
Feature activation-8.20e-01

Pos logit contributions
 idea+0.005
 bal+0.004
crow+0.004
 doctors+0.004
 cows+0.004
Neg logit contributions
ifying-0.005
igsaw-0.004
years-0.004
chie-0.004
jamin-0.004
Token a
Token ID257
Feature activation+7.28e-01

Pos logit contributions
 idea+0.036
crow+0.036
 bal+0.035
cut+0.034
ighters+0.033
Neg logit contributions
ifying-0.027
years-0.020
jamin-0.019
igsaw-0.019
chie-0.018
Token
Token ID198
Feature activation+7.79e-01

Pos logit contributions
ifying+0.097
chie+0.085
years+0.083
igsaw+0.077
 brief+0.071
Neg logit contributions
 idea-0.045
 People-0.035
 bal-0.035
 But-0.035
 whenever-0.034
Token
Token ID198
Feature activation+1.12e+00

Pos logit contributions
ifying+0.044
igsaw+0.037
years+0.036
chie+0.036
 hips+0.033
Neg logit contributions
 idea-0.021
 bal-0.016
crow-0.016
 understand-0.016
 cows-0.015
TokenThe
Token ID464
Feature activation+3.54e-01

Pos logit contributions
ifying+0.049
igsaw+0.039
chie+0.038
years+0.038
oured+0.036
Neg logit contributions
 idea-0.041
 bal-0.038
crow-0.036
 understand-0.036
 doctors-0.035
Token monster
Token ID9234
Feature activation-1.55e-01

Pos logit contributions
ifying+0.018
igsaw+0.015
years+0.015
chie+0.015
 hips+0.014
Neg logit contributions
 idea-0.011
 bal-0.009
crow-0.009
 doctors-0.009
 people-0.008
Token thought
Token ID1807
Feature activation+2.65e-01

Pos logit contributions
 idea+0.006
 bal+0.005
crow+0.005
 doctors+0.005
 cows+0.005
Neg logit contributions
ifying-0.006
igsaw-0.005
years-0.005
chie-0.005
jamin-0.005
Token for
Token ID329
Feature activation+2.90e+00

Pos logit contributions
ifying+0.016
igsaw+0.014
years+0.014
chie+0.013
jamin+0.013
Neg logit contributions
 idea-0.006
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token a
Token ID257
Feature activation+1.00e+00

Pos logit contributions
ifying+0.189
chie+0.166
igsaw+0.157
run+0.156
ll+0.154
Neg logit contributions
 idea-0.071
 time-0.061
 people-0.049
 it-0.048
 understand-0.045
Token moment
Token ID2589
Feature activation-1.06e-01

Pos logit contributions
ifying+0.048
years+0.040
igsaw+0.038
chie+0.036
jamin+0.034
Neg logit contributions
 idea-0.039
 understand-0.031
 bal-0.031
 time-0.030
 day-0.030
Token and
Token ID290
Feature activation+5.23e-01

Pos logit contributions
 idea+0.003
crow+0.003
 bal+0.003
 doctors+0.003
 cows+0.002
Neg logit contributions
ifying-0.005
igsaw-0.005
years-0.004
chie-0.004
jamin-0.004
Token then
Token ID788
Feature activation-5.32e-01

Pos logit contributions
ifying+0.027
igsaw+0.022
chie+0.021
years+0.021
 hips+0.020
Neg logit contributions
 idea-0.017
 bal-0.014
crow-0.014
 doctors-0.014
 people-0.013
Token said
Token ID531
Feature activation-2.17e-01

Pos logit contributions
 idea+0.021
crow+0.017
 bal+0.017
 doctors+0.017
 cows+0.017
Neg logit contributions
ifying-0.022
igsaw-0.018
years-0.018
chie-0.017
jamin-0.017
Token with
Token ID351
Feature activation+2.11e-01

Pos logit contributions
ifying+0.006
igsaw+0.005
years+0.005
chie+0.005
jamin+0.005
Neg logit contributions
 idea-0.004
 bal-0.003
crow-0.003
 doctors-0.003
 cows-0.003
Token me
Token ID502
Feature activation+8.72e-01

Pos logit contributions
ifying+0.009
igsaw+0.007
years+0.007
chie+0.007
jamin+0.006
Neg logit contributions
 idea-0.009
 bal-0.007
crow-0.007
 doctors-0.007
 cows-0.007
Token?"
Token ID1701
Feature activation+6.19e-01

Pos logit contributions
ifying+0.049
igsaw+0.043
years+0.041
chie+0.040
jamin+0.038
Neg logit contributions
 idea-0.024
 bal-0.018
 doctors-0.018
 people-0.017
crow-0.017
Token
Token ID198
Feature activation+6.02e-01

Pos logit contributions
ifying+0.033
igsaw+0.027
years+0.027
chie+0.027
jamin+0.025
Neg logit contributions
 idea-0.019
crow-0.015
 bal-0.015
 people-0.014
 understand-0.014
Token
Token ID198
Feature activation+1.30e+00

Pos logit contributions
ifying+0.032
igsaw+0.027
years+0.026
chie+0.026
jamin+0.024
Neg logit contributions
 idea-0.018
crow-0.014
 bal-0.014
 people-0.014
 understand-0.014
TokenMel
Token ID21102
Feature activation+2.89e+00

Pos logit contributions
ifying+0.064
igsaw+0.050
years+0.048
chie+0.048
 hips+0.046
Neg logit contributions
 idea-0.045
 people-0.038
 understand-0.038
crow-0.038
 bal-0.036
Tokenody
Token ID1118
Feature activation+1.16e+00

Pos logit contributions
ifying+0.143
+0.113
igsaw+0.109
 hind+0.108
 Deal+0.106
Neg logit contributions
 idea-0.094
aw-0.084
 people-0.084
crow-0.084
 thing-0.079
Token was
Token ID373
Feature activation-2.77e-01

Pos logit contributions
ifying+0.053
igsaw+0.044
years+0.041
chie+0.040
+0.036
Neg logit contributions
 idea-0.046
 bal-0.038
crow-0.036
 understand-0.036
 people-0.035
Token very
Token ID845
Feature activation-2.13e-01

Pos logit contributions
 idea+0.011
crow+0.010
 bal+0.010
 doctors+0.010
 cows+0.009
Neg logit contributions
ifying-0.011
years-0.009
igsaw-0.009
chie-0.009
jamin-0.008
Token excited
Token ID6568
Feature activation+3.34e-01

Pos logit contributions
 idea+0.006
crow+0.005
 bal+0.005
 doctors+0.005
 cows+0.005
Neg logit contributions
ifying-0.011
igsaw-0.010
years-0.009
chie-0.009
jamin-0.009
Token and
Token ID290
Feature activation+2.43e-01

Pos logit contributions
ifying+0.018
igsaw+0.016
years+0.015
chie+0.015
jamin+0.014
Neg logit contributions
 idea-0.010
 bal-0.007
crow-0.007
 doctors-0.007
 people-0.007
Token nice
Token ID3621
Feature activation+2.72e-01

Pos logit contributions
ifying+0.006
igsaw+0.005
years+0.005
chie+0.004
jamin+0.004
Neg logit contributions
 idea-0.004
 bal-0.003
crow-0.003
 doctors-0.003
 cows-0.003
Token.
Token ID13
Feature activation+2.49e-01

Pos logit contributions
ifying+0.015
igsaw+0.013
years+0.012
chie+0.012
jamin+0.012
Neg logit contributions
 idea-0.008
 bal-0.006
crow-0.006
 doctors-0.006
 cows-0.006
Token Some
Token ID2773
Feature activation+9.60e-01

Pos logit contributions
ifying+0.012
igsaw+0.010
years+0.010
chie+0.010
jamin+0.009
Neg logit contributions
 idea-0.009
 bal-0.007
crow-0.007
 doctors-0.007
 cows-0.007
Token are
Token ID389
Feature activation+6.77e-01

Pos logit contributions
ifying+0.053
igsaw+0.047
years+0.045
chie+0.044
jamin+0.042
Neg logit contributions
 idea-0.028
crow-0.022
 people-0.022
 bal-0.021
 doctors-0.021
Token black
Token ID2042
Feature activation+7.79e-01

Pos logit contributions
ifying+0.034
igsaw+0.029
years+0.028
chie+0.027
jamin+0.026
Neg logit contributions
 idea-0.023
 doctors-0.018
 bal-0.018
crow-0.018
 cows-0.018
Token and
Token ID290
Feature activation+2.87e+00

Pos logit contributions
ifying+0.045
igsaw+0.040
years+0.039
chie+0.038
jamin+0.036
Neg logit contributions
 idea-0.020
crow-0.015
 bal-0.015
 doctors-0.015
 people-0.014
Token disgusting
Token ID23374
Feature activation-4.77e-02

Pos logit contributions
ifying+0.145
igsaw+0.140
enny+0.133
jamin+0.131
oured+0.130
Neg logit contributions
 idea-0.074
 wheat-0.074
 people-0.071
 cows-0.070
 doctors-0.069
Token and
Token ID290
Feature activation+2.33e+00

Pos logit contributions
 idea+0.001
 bal+0.001
crow+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.003
igsaw-0.002
years-0.002
chie-0.002
jamin-0.002
Token mean
Token ID1612
Feature activation-6.37e-01

Pos logit contributions
ifying+0.120
igsaw+0.113
years+0.104
jamin+0.103
oured+0.103
Neg logit contributions
 idea-0.066
 people-0.059
 doctors-0.057
 wheat-0.056
 cows-0.055
Token."
Token ID526
Feature activation+1.13e-01

Pos logit contributions
 idea+0.021
 bal+0.016
crow+0.016
 pilot+0.015
 doctors+0.015
Neg logit contributions
ifying-0.031
igsaw-0.025
years-0.025
chie-0.025
 hips-0.024
Token Mia
Token ID32189
Feature activation-8.81e-02

Pos logit contributions
ifying+0.004
igsaw+0.004
years+0.003
chie+0.003
jamin+0.003
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token!"
Token ID2474
Feature activation-4.55e-01

Pos logit contributions
ifying+0.003
igsaw+0.003
years+0.002
chie+0.002
 hips+0.002
Neg logit contributions
 idea-0.002
crow-0.002
 bal-0.002
 doctors-0.001
 cows-0.001
Token
Token ID198
Feature activation-6.78e-01

Pos logit contributions
 idea+0.018
crow+0.016
cut+0.015
 bal+0.015
 doctors+0.015
Neg logit contributions
ifying-0.019
igsaw-0.015
years-0.015
 hips-0.015
jamin-0.014
Token
Token ID198
Feature activation-1.08e-01

Pos logit contributions
 idea+0.028
 bal+0.024
crow+0.023
 doctors+0.023
 pilot+0.022
Neg logit contributions
ifying-0.027
igsaw-0.022
years-0.021
 hips-0.020
jamin-0.020
TokenMom
Token ID29252
Feature activation+1.34e+00

Pos logit contributions
 idea+0.004
crow+0.003
 bal+0.003
 doctors+0.003
 cows+0.003
Neg logit contributions
ifying-0.005
igsaw-0.004
years-0.004
chie-0.004
jamin-0.004
Token thought
Token ID1807
Feature activation-6.15e-01

Pos logit contributions
ifying+0.067
igsaw+0.056
chie+0.052
years+0.048
+0.045
Neg logit contributions
 idea-0.049
 bal-0.043
 doctors-0.042
 people-0.041
 understand-0.041
Token for
Token ID329
Feature activation+2.86e+00

Pos logit contributions
 idea+0.019
 bal+0.017
crow+0.017
 pilot+0.016
cut+0.016
Neg logit contributions
ifying-0.028
igsaw-0.025
years-0.023
run-0.023
chie-0.023
Token a
Token ID257
Feature activation+1.47e+00

Pos logit contributions
ifying+0.185
chie+0.172
run+0.168
igsaw+0.161
ll+0.158
Neg logit contributions
 idea-0.058
 doctors-0.054
 people-0.048
 time-0.047
 understand-0.039
Token moment
Token ID2589
Feature activation+2.08e-01

Pos logit contributions
ifying+0.078
years+0.070
chie+0.062
igsaw+0.061
oured+0.057
Neg logit contributions
 idea-0.055
 understand-0.042
 time-0.041
 day-0.040
 thing-0.040
Token.
Token ID13
Feature activation-1.30e-01

Pos logit contributions
ifying+0.011
igsaw+0.010
years+0.009
chie+0.009
jamin+0.009
Neg logit contributions
 idea-0.006
 bal-0.005
crow-0.005
 doctors-0.004
 cows-0.004
Token Then
Token ID3244
Feature activation-1.73e-01

Pos logit contributions
 idea+0.005
 bal+0.004
crow+0.004
 doctors+0.004
cut+0.004
Neg logit contributions
ifying-0.006
igsaw-0.005
years-0.005
chie-0.005
jamin-0.004
Token she
Token ID673
Feature activation+6.47e-01

Pos logit contributions
 idea+0.006
crow+0.005
 bal+0.005
cut+0.004
 doctors+0.004
Neg logit contributions
ifying-0.009
igsaw-0.007
years-0.007
chie-0.007
 hips-0.007
Token here
Token ID994
Feature activation+4.42e-02

Pos logit contributions
 idea+0.005
crow+0.004
 bal+0.004
 doctors+0.004
 cows+0.004
Neg logit contributions
ifying-0.007
igsaw-0.006
years-0.005
chie-0.005
 hips-0.005
Token.
Token ID13
Feature activation+1.29e+00

Pos logit contributions
ifying+0.002
igsaw+0.002
years+0.002
chie+0.002
 hips+0.002
Neg logit contributions
 idea-0.001
 bal-0.001
crow-0.001
 doctors-0.001
 cows-0.001
Token
Token ID220
Feature activation-1.25e-02

Pos logit contributions
ifying+0.063
chie+0.056
years+0.055
igsaw+0.052
ll+0.048
Neg logit contributions
 idea-0.041
 People-0.035
 understand-0.033
 whenever-0.033
 bal-0.033
Token
Token ID198
Feature activation+9.02e-01

Pos logit contributions
 idea+0.000
 bal+0.000
crow+0.000
 doctors+0.000
 cows+0.000
Neg logit contributions
ifying-0.001
igsaw-0.001
years-0.000
chie-0.000
jamin-0.000
Token
Token ID198
Feature activation+5.78e-01

Pos logit contributions
ifying+0.052
chie+0.044
igsaw+0.044
years+0.043
 hips+0.040
Neg logit contributions
 idea-0.023
 understand-0.018
 bal-0.017
 people-0.017
 doctors-0.016
TokenEm
Token ID10161
Feature activation+2.85e+00

Pos logit contributions
ifying+0.023
igsaw+0.018
chie+0.018
years+0.017
 hips+0.016
Neg logit contributions
 idea-0.025
 bal-0.022
 doctors-0.021
 understand-0.021
crow-0.021
Tokenma
Token ID2611
Feature activation+1.35e-01

Pos logit contributions
ifying+0.165
 hind+0.140
igsaw+0.135
run+0.128
chie+0.128
Neg logit contributions
 idea-0.059
 day-0.055
 time-0.053
est-0.050
 morning-0.047
Token had
Token ID550
Feature activation+6.37e-01

Pos logit contributions
ifying+0.006
igsaw+0.005
years+0.005
chie+0.005
jamin+0.004
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token a
Token ID257
Feature activation+7.15e-01

Pos logit contributions
ifying+0.031
igsaw+0.026
years+0.024
chie+0.024
 hips+0.023
Neg logit contributions
 idea-0.023
 bal-0.019
 doctors-0.018
crow-0.018
 people-0.018
Token great
Token ID1049
Feature activation-9.03e-01

Pos logit contributions
ifying+0.039
years+0.033
igsaw+0.033
chie+0.032
 hips+0.030
Neg logit contributions
 idea-0.022
 bal-0.016
 doctors-0.015
 understand-0.015
 pilot-0.015
Token time
Token ID640
Feature activation-3.23e-01

Pos logit contributions
ighters+0.014
crow+0.014
cut+0.013
 doctors+0.013
 idea+0.013
Neg logit contributions
ifying-0.053
igsaw-0.051
jamin-0.046
-0.045
chie-0.044
Token place
Token ID1295
Feature activation+9.67e-01

Pos logit contributions
 idea+0.036
ighters+0.035
 bal+0.035
crow+0.032
 doctors+0.032
Neg logit contributions
ifying-0.073
igsaw-0.066
chie-0.062
 seeking-0.061
jamin-0.060
Token.
Token ID13
Feature activation+1.39e+00

Pos logit contributions
ifying+0.050
years+0.043
igsaw+0.043
chie+0.040
jamin+0.039
Neg logit contributions
 idea-0.031
 bal-0.024
crow-0.023
 doctors-0.022
 people-0.022
Token
Token ID220
Feature activation-6.23e-02

Pos logit contributions
ifying+0.075
chie+0.068
years+0.067
igsaw+0.063
 brief+0.059
Neg logit contributions
 idea-0.035
 People-0.030
 whenever-0.028
 Everyone-0.027
 bal-0.027
Token
Token ID198
Feature activation+4.77e-01

Pos logit contributions
 idea+0.002
 bal+0.002
crow+0.002
 doctors+0.002
 cows+0.002
Neg logit contributions
ifying-0.003
igsaw-0.003
years-0.002
chie-0.002
jamin-0.002
Token
Token ID198
Feature activation+6.79e-01

Pos logit contributions
ifying+0.027
igsaw+0.023
years+0.022
chie+0.022
 hips+0.021
Neg logit contributions
 idea-0.013
 bal-0.010
crow-0.010
 doctors-0.010
 understand-0.010
TokenEm
Token ID10161
Feature activation+2.84e+00

Pos logit contributions
ifying+0.033
igsaw+0.027
years+0.026
chie+0.026
 hips+0.025
Neg logit contributions
 idea-0.023
 bal-0.019
 doctors-0.019
 understand-0.019
crow-0.019
Tokenma
Token ID2611
Feature activation+6.81e-01

Pos logit contributions
ifying+0.168
igsaw+0.139
 hind+0.137
lic+0.130
run+0.126
Neg logit contributions
 idea-0.059
 time-0.051
 day-0.049
ying-0.048
est-0.046
Token was
Token ID373
Feature activation-3.45e-02

Pos logit contributions
ifying+0.034
igsaw+0.029
years+0.027
chie+0.027
 hips+0.025
Neg logit contributions
 idea-0.023
 bal-0.019
crow-0.018
 doctors-0.018
 cows-0.018
Token so
Token ID523
Feature activation+3.75e-01

Pos logit contributions
 idea+0.001
crow+0.001
 bal+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.001
igsaw-0.001
years-0.001
chie-0.001
jamin-0.001
Token proud
Token ID6613
Feature activation-1.69e-02

Pos logit contributions
ifying+0.022
igsaw+0.019
years+0.018
chie+0.018
 hips+0.017
Neg logit contributions
 idea-0.010
 bal-0.007
crow-0.007
 understand-0.007
 doctors-0.007
Token of
Token ID286
Feature activation+1.34e+00

Pos logit contributions
 idea+0.001
crow+0.000
 bal+0.000
 doctors+0.000
 cows+0.000
Neg logit contributions
ifying-0.001
igsaw-0.001
years-0.001
chie-0.001
jamin-0.001
Token think
Token ID892
Feature activation-1.28e-03

Pos logit contributions
ifying+0.002
igsaw+0.002
years+0.002
chie+0.002
jamin+0.002
Neg logit contributions
 idea-0.002
 bal-0.001
crow-0.001
 doctors-0.001
 cows-0.001
Token you
Token ID345
Feature activation+1.96e-01

Pos logit contributions
 idea+0.000
 bal+0.000
crow+0.000
 doctors+0.000
 cows+0.000
Neg logit contributions
ifying-0.000
igsaw-0.000
years-0.000
chie-0.000
jamin-0.000
Token're
Token ID821
Feature activation-8.20e-01

Pos logit contributions
ifying+0.009
igsaw+0.008
years+0.008
chie+0.007
jamin+0.007
Neg logit contributions
 idea-0.007
 bal-0.006
crow-0.006
 doctors-0.006
 cows-0.005
Token doing
Token ID1804
Feature activation-6.97e-01

Pos logit contributions
 idea+0.029
 bal+0.028
crow+0.027
 pilot+0.025
 paramedics+0.025
Neg logit contributions
ifying-0.035
igsaw-0.029
years-0.028
 seeking-0.027
chie-0.027
Token?!"
Token ID30823
Feature activation+1.01e+00

Pos logit contributions
 idea+0.025
 bal+0.023
crow+0.023
 paramedics+0.021
 doctors+0.021
Neg logit contributions
ifying-0.031
igsaw-0.026
years-0.023
jamin-0.023
chie-0.023
Token Fox
Token ID5426
Feature activation+2.83e+00

Pos logit contributions
ifying+0.056
igsaw+0.049
years+0.048
chie+0.047
 hips+0.043
Neg logit contributions
 idea-0.028
crow-0.024
 people-0.022
 bal-0.022
 understand-0.022
Tokeny
Token ID88
Feature activation+2.15e+00

Pos logit contributions
ifying+0.132
igsaw+0.123
+0.107
years+0.104
chie+0.097
Neg logit contributions
 idea-0.099
crow-0.087
 understand-0.083
cut-0.080
 bal-0.077
Token looked
Token ID3114
Feature activation-1.07e-01

Pos logit contributions
ifying+0.112
igsaw+0.098
years+0.089
chie+0.086
run+0.078
Neg logit contributions
 idea-0.072
 people-0.058
crow-0.058
 understand-0.057
 bal-0.057
Token up
Token ID510
Feature activation+2.44e-01

Pos logit contributions
 idea+0.003
 bal+0.002
crow+0.002
 doctors+0.002
 cows+0.002
Neg logit contributions
ifying-0.006
igsaw-0.005
years-0.005
chie-0.005
jamin-0.005
Token and
Token ID290
Feature activation+1.34e+00

Pos logit contributions
ifying+0.013
igsaw+0.011
years+0.011
chie+0.011
jamin+0.010
Neg logit contributions
 idea-0.007
 bal-0.005
crow-0.005
 doctors-0.005
 cows-0.005
Token saw
Token ID2497
Feature activation+6.71e-01

Pos logit contributions
ifying+0.074
igsaw+0.066
chie+0.058
years+0.058
+0.056
Neg logit contributions
 idea-0.037
 people-0.031
crow-0.031
 bal-0.030
 cows-0.030
Token us
Token ID514
Feature activation+6.62e-01

Pos logit contributions
ifying+0.041
igsaw+0.034
chie+0.033
years+0.033
jamin+0.032
Neg logit contributions
 idea-0.031
 people-0.027
 doctors-0.026
 cows-0.026
crow-0.025
Token.
Token ID13
Feature activation-1.16e-02

Pos logit contributions
ifying+0.037
igsaw+0.032
years+0.031
chie+0.031
jamin+0.029
Neg logit contributions
 idea-0.018
 bal-0.014
crow-0.013
 doctors-0.013
 people-0.013
Token We
Token ID775
Feature activation-1.13e+00

Pos logit contributions
 idea+0.000
 bal+0.000
crow+0.000
 doctors+0.000
 cows+0.000
Neg logit contributions
ifying-0.001
igsaw-0.000
years-0.000
chie-0.000
jamin-0.000
Token are
Token ID389
Feature activation-5.62e-01

Pos logit contributions
 idea+0.050
 doctors+0.047
 cows+0.047
 paramedics+0.046
crow+0.046
Neg logit contributions
ifying-0.042
chie-0.030
years-0.028
igsaw-0.027
 total-0.026
Token kings
Token ID21557
Feature activation-7.63e-03

Pos logit contributions
 idea+0.023
crow+0.021
 bal+0.020
cut+0.019
 understand+0.019
Neg logit contributions
ifying-0.022
years-0.018
igsaw-0.017
chie-0.017
 seeking-0.017
Token and
Token ID290
Feature activation+2.83e+00

Pos logit contributions
 idea+0.000
 bal+0.000
crow+0.000
 doctors+0.000
 cows+0.000
Neg logit contributions
ifying-0.000
igsaw-0.000
years-0.000
chie-0.000
jamin-0.000
Token queens
Token ID38771
Feature activation+7.14e-01

Pos logit contributions
ifying+0.113
igsaw+0.096
+0.091
 hips+0.091
jamin+0.088
Neg logit contributions
 people-0.128
 doctors-0.122
 pilot-0.116
 even-0.114
 cows-0.113
Token of
Token ID286
Feature activation+1.26e+00

Pos logit contributions
ifying+0.041
igsaw+0.035
years+0.035
chie+0.034
jamin+0.033
Neg logit contributions
 idea-0.019
 bal-0.014
crow-0.014
 people-0.013
 doctors-0.013
Token this
Token ID428
Feature activation-3.85e-02

Pos logit contributions
ifying+0.073
igsaw+0.063
jamin+0.060
chie+0.060
years+0.059
Neg logit contributions
 idea-0.032
 people-0.029
 doctors-0.026
 cows-0.026
crow-0.023
Token castle
Token ID16669
Feature activation+3.91e-01

Pos logit contributions
 idea+0.001
 bal+0.001
crow+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.002
igsaw-0.002
years-0.002
chie-0.002
jamin-0.001
Token.
Token ID13
Feature activation-4.28e-01

Pos logit contributions
ifying+0.021
igsaw+0.018
years+0.018
chie+0.017
jamin+0.016
Neg logit contributions
 idea-0.011
 bal-0.009
crow-0.009
 doctors-0.008
 cows-0.008
Token time
Token ID640
Feature activation+7.96e-01

Pos logit contributions
ifying+0.043
years+0.038
igsaw+0.037
chie+0.037
 hips+0.035
Neg logit contributions
 idea-0.005
 bal-0.001
crow-0.000
 time-0.000
 doctors+0.000
Token there
Token ID612
Feature activation-9.36e-01

Pos logit contributions
ifying+0.049
years+0.041
igsaw+0.041
chie+0.040
jamin+0.038
Neg logit contributions
 idea-0.019
 bal-0.014
crow-0.013
 people-0.012
 doctors-0.012
Token were
Token ID547
Feature activation+4.22e-01

Pos logit contributions
 idea+0.042
 doctors+0.038
 bal+0.038
 paramedics+0.037
 cows+0.037
Neg logit contributions
ifying-0.031
igsaw-0.024
jamin-0.023
years-0.023
-0.022
Token two
Token ID734
Feature activation+6.34e-01

Pos logit contributions
ifying+0.018
igsaw+0.014
years+0.014
chie+0.014
 hips+0.013
Neg logit contributions
 idea-0.017
 bal-0.015
 doctors-0.014
crow-0.014
 cows-0.014
Token friends
Token ID2460
Feature activation+9.06e-01

Pos logit contributions
ifying+0.036
igsaw+0.031
years+0.030
chie+0.030
jamin+0.028
Neg logit contributions
 idea-0.017
 doctors-0.013
 cows-0.013
 bal-0.013
 people-0.013
Token,
Token ID11
Feature activation+2.82e+00

Pos logit contributions
ifying+0.056
igsaw+0.048
chie+0.045
years+0.045
jamin+0.043
Neg logit contributions
 idea-0.022
crow-0.016
 doctors-0.016
 people-0.016
 cows-0.015
Token Bob
Token ID5811
Feature activation+5.42e-01

Pos logit contributions
ifying+0.164
chie+0.143
igsaw+0.141
years+0.134
ll+0.133
Neg logit contributions
 idea-0.067
 cows-0.058
 time-0.051
 doctors-0.051
 people-0.050
Token and
Token ID290
Feature activation+1.39e+00

Pos logit contributions
ifying+0.031
igsaw+0.027
years+0.026
chie+0.026
 hips+0.024
Neg logit contributions
 idea-0.014
crow-0.011
 bal-0.011
 cows-0.010
 doctors-0.010
Token Jack
Token ID3619
Feature activation+8.53e-01

Pos logit contributions
ifying+0.078
igsaw+0.062
years+0.061
chie+0.059
oured+0.057
Neg logit contributions
 idea-0.045
crow-0.035
 bal-0.034
 doctors-0.034
 understand-0.034
Token,
Token ID11
Feature activation+1.10e+00

Pos logit contributions
ifying+0.053
igsaw+0.046
years+0.044
chie+0.044
 hips+0.042
Neg logit contributions
 idea-0.020
 bal-0.013
 cows-0.013
crow-0.013
 understand-0.013
Token who
Token ID508
Feature activation-5.20e-01

Pos logit contributions
ifying+0.053
igsaw+0.046
chie+0.044
years+0.044
 hips+0.039
Neg logit contributions
 idea-0.039
 bal-0.032
 cows-0.032
 doctors-0.031
 understand-0.031
Token she
Token ID673
Feature activation+4.65e-01

Pos logit contributions
ifying+0.020
igsaw+0.017
years+0.017
chie+0.016
 hips+0.015
Neg logit contributions
 idea-0.014
 bal-0.011
crow-0.011
 doctors-0.011
 people-0.011
Token declared
Token ID6875
Feature activation-3.69e-02

Pos logit contributions
ifying+0.020
igsaw+0.017
years+0.016
chie+0.016
 hips+0.014
Neg logit contributions
 idea-0.019
 bal-0.016
crow-0.015
 doctors-0.015
 understand-0.015
Token,
Token ID11
Feature activation-1.69e-02

Pos logit contributions
 idea+0.001
crow+0.001
 bal+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.002
igsaw-0.002
years-0.002
chie-0.002
jamin-0.001
Token "
Token ID366
Feature activation+2.58e-01

Pos logit contributions
 idea+0.001
crow+0.000
 bal+0.000
 doctors+0.000
 cows+0.000
Neg logit contributions
ifying-0.001
igsaw-0.001
years-0.001
chie-0.001
jamin-0.001
TokenMy
Token ID3666
Feature activation-4.32e-03

Pos logit contributions
ifying+0.012
igsaw+0.010
years+0.009
chie+0.009
 hips+0.008
Neg logit contributions
 idea-0.010
 bal-0.008
crow-0.008
 doctors-0.008
 cows-0.008
Token toes
Token ID23932
Feature activation+2.81e+00

Pos logit contributions
 idea+0.000
crow+0.000
 bal+0.000
 doctors+0.000
 cows+0.000
Neg logit contributions
ifying-0.000
igsaw-0.000
years-0.000
chie-0.000
 hips-0.000
Token are
Token ID389
Feature activation+8.75e-01

Pos logit contributions
ifying+0.137
years+0.128
igsaw+0.127
jamin+0.110
chie+0.110
Neg logit contributions
 idea-0.094
 understand-0.068
 bal-0.067
 people-0.067
 thing-0.065
Token better
Token ID1365
Feature activation+2.07e-01

Pos logit contributions
ifying+0.040
igsaw+0.034
years+0.032
chie+0.031
 hips+0.029
Neg logit contributions
 idea-0.033
 bal-0.027
 people-0.026
 doctors-0.026
 cows-0.025
Token now
Token ID783
Feature activation+2.48e-01

Pos logit contributions
ifying+0.010
igsaw+0.008
years+0.008
chie+0.008
jamin+0.008
Neg logit contributions
 idea-0.007
 bal-0.006
crow-0.006
 doctors-0.006
 cows-0.005
Token than
Token ID621
Feature activation+1.63e+00

Pos logit contributions
ifying+0.012
igsaw+0.010
years+0.010
chie+0.010
jamin+0.009
Neg logit contributions
 idea-0.009
 bal-0.007
crow-0.007
 doctors-0.007
 cows-0.006
Token before
Token ID878
Feature activation-2.24e-01

Pos logit contributions
ifying+0.077
igsaw+0.064
years+0.062
jamin+0.060
chie+0.059
Neg logit contributions
 idea-0.056
 people-0.050
 doctors-0.048
 bal-0.047
 time-0.046
Token does
Token ID857
Feature activation+6.50e-01

Pos logit contributions
ifying+0.021
igsaw+0.018
years+0.017
chie+0.017
jamin+0.016
Neg logit contributions
 idea-0.013
 bal-0.010
crow-0.010
 doctors-0.009
 people-0.009
Token.
Token ID13
Feature activation+5.02e-02

Pos logit contributions
ifying+0.033
years+0.028
igsaw+0.028
chie+0.027
jamin+0.026
Neg logit contributions
 idea-0.022
 people-0.017
 doctors-0.017
 understand-0.017
 bal-0.016
Token She
Token ID1375
Feature activation-4.00e-02

Pos logit contributions
ifying+0.002
igsaw+0.002
years+0.002
chie+0.002
jamin+0.002
Neg logit contributions
 idea-0.002
 bal-0.002
crow-0.002
 doctors-0.001
 cows-0.001
Token learned
Token ID4499
Feature activation+7.70e-01

Pos logit contributions
 idea+0.001
crow+0.001
 bal+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.002
igsaw-0.001
years-0.001
chie-0.001
jamin-0.001
Token that
Token ID326
Feature activation-1.30e+00

Pos logit contributions
ifying+0.044
igsaw+0.037
chie+0.037
years+0.036
jamin+0.034
Neg logit contributions
 idea-0.022
 doctors-0.017
 people-0.017
 bal-0.016
 understand-0.016
Token birth
Token ID4082
Feature activation+2.80e+00

Pos logit contributions
 idea+0.050
crow+0.044
 bal+0.041
aw+0.039
 paramedics+0.039
Neg logit contributions
ifying-0.049
years-0.043
 seeking-0.042
igsaw-0.042
 hips-0.038
Tokendays
Token ID12545
Feature activation+1.02e+00

Pos logit contributions
ifying+0.113
igsaw+0.105
 hips+0.096
 hind+0.095
chie+0.094
Neg logit contributions
 idea-0.104
 if-0.096
 understand-0.095
 whenever-0.091
 about-0.088
Token are
Token ID389
Feature activation+9.57e-02

Pos logit contributions
ifying+0.050
igsaw+0.042
years+0.042
chie+0.040
jamin+0.037
Neg logit contributions
 idea-0.036
 people-0.029
crow-0.028
 doctors-0.028
 bal-0.028
Token not
Token ID407
Feature activation-2.28e-01

Pos logit contributions
ifying+0.005
igsaw+0.004
years+0.004
chie+0.004
jamin+0.004
Neg logit contributions
 idea-0.003
 bal-0.003
crow-0.002
 doctors-0.002
 cows-0.002
Token about
Token ID546
Feature activation+9.22e-01

Pos logit contributions
 idea+0.007
crow+0.006
 bal+0.006
 doctors+0.005
 cows+0.005
Neg logit contributions
ifying-0.012
igsaw-0.010
chie-0.009
years-0.009
jamin-0.009
Token cakes
Token ID30849
Feature activation+2.69e-01

Pos logit contributions
ifying+0.046
igsaw+0.038
years+0.038
chie+0.036
jamin+0.034
Neg logit contributions
 idea-0.032
 people-0.026
 doctors-0.026
 cows-0.025
 bal-0.025
Token.
Token ID13
Feature activation-9.44e-01

Pos logit contributions
ifying+0.010
igsaw+0.008
years+0.008
chie+0.008
jamin+0.007
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token Every
Token ID3887
Feature activation-5.03e-01

Pos logit contributions
 idea+0.037
 bal+0.032
 doctors+0.030
crow+0.030
cut+0.030
Neg logit contributions
ifying-0.040
igsaw-0.032
jamin-0.031
years-0.030
 hips-0.029
Token day
Token ID1110
Feature activation-1.17e+00

Pos logit contributions
 idea+0.007
 bal+0.005
 doctors+0.005
crow+0.005
ighters+0.005
Neg logit contributions
ifying-0.032
igsaw-0.029
years-0.027
jamin-0.027
chie-0.027
Token,
Token ID11
Feature activation-6.24e-01

Pos logit contributions
 idea+0.049
cut+0.038
 understand+0.037
 paramedics+0.037
crow+0.037
Neg logit contributions
ifying-0.051
years-0.038
 eight-0.035
igsaw-0.035
 hips-0.034
Token the
Token ID262
Feature activation-1.28e+00

Pos logit contributions
 idea+0.027
 bal+0.022
crow+0.022
 understand+0.021
 doctors+0.021
Neg logit contributions
ifying-0.025
years-0.020
igsaw-0.019
chie-0.018
 hips-0.017
Token family
Token ID1641
Feature activation-3.41e+00

Pos logit contributions
 idea+0.042
ook+0.038
ighters+0.038
crow+0.037
ying+0.037
Neg logit contributions
ifying-0.058
chie-0.044
-0.044
jamin-0.043
years-0.043
Token would
Token ID561
Feature activation-8.61e-01

Pos logit contributions
 idea+0.169
 impression+0.154
 decisions+0.150
ainer+0.149
eeper+0.148
Neg logit contributions
ll-0.074
o-0.062
m-0.061
ifying-0.059
 looked-0.057
Token give
Token ID1577
Feature activation-6.68e-01

Pos logit contributions
 idea+0.029
crow+0.025
 bal+0.025
 doctors+0.024
ighters+0.023
Neg logit contributions
ifying-0.040
igsaw-0.032
chie-0.032
years-0.031
jamin-0.031
Token Strip
Token ID18508
Feature activation+8.87e-01

Pos logit contributions
 idea+0.027
 bal+0.022
crow+0.021
cut+0.021
 doctors+0.020
Neg logit contributions
ifying-0.028
years-0.024
igsaw-0.022
chie-0.021
jamin-0.021
Token lots
Token ID6041
Feature activation+4.73e-03

Pos logit contributions
ifying+0.034
igsaw+0.027
years+0.025
chie+0.025
 hips+0.023
Neg logit contributions
 idea-0.039
 bal-0.035
 cows-0.035
crow-0.034
 people-0.034
Token of
Token ID286
Feature activation-7.96e-01

Pos logit contributions
ifying+0.000
igsaw+0.000
years+0.000
chie+0.000
jamin+0.000
Neg logit contributions
 idea-0.000
crow-0.000
 bal-0.000
 doctors-0.000
 cows-0.000
Token.
Token ID13
Feature activation-5.45e-01

Pos logit contributions
 idea+0.023
 bal+0.020
crow+0.019
 understand+0.019
 doctors+0.018
Neg logit contributions
ifying-0.031
years-0.024
igsaw-0.024
chie-0.023
 seeking-0.023
Token Every
Token ID3887
Feature activation-7.58e-01

Pos logit contributions
 idea+0.021
 bal+0.019
 doctors+0.018
cut+0.018
crow+0.018
Neg logit contributions
ifying-0.023
igsaw-0.019
years-0.017
jamin-0.017
 hips-0.017
Token day
Token ID1110
Feature activation-1.38e+00

Pos logit contributions
 idea+0.012
ighters+0.010
 bal+0.010
 paramedics+0.009
crow+0.009
Neg logit contributions
ifying-0.045
igsaw-0.041
chie-0.038
years-0.037
-0.037
Token,
Token ID11
Feature activation-6.80e-01

Pos logit contributions
 idea+0.064
cut+0.058
 impression+0.056
 paramedics+0.055
 understand+0.054
Neg logit contributions
ifying-0.049
 eight-0.033
years-0.032
igsaw-0.030
-0.029
Token the
Token ID262
Feature activation-1.65e+00

Pos logit contributions
 idea+0.026
 bal+0.022
crow+0.022
cut+0.021
 paramedics+0.020
Neg logit contributions
ifying-0.030
years-0.023
igsaw-0.023
chie-0.022
 hips-0.021
Token mail
Token ID6920
Feature activation-3.35e+00

Pos logit contributions
 bal+0.049
crow+0.049
 idea+0.046
ook+0.042
ighters+0.042
Neg logit contributions
ifying-0.074
 total-0.059
igsaw-0.059
-0.059
chie-0.058
Tokenman
Token ID805
Feature activation-1.25e+00

Pos logit contributions
ubs+0.147
 mischief+0.146
dd+0.136
 impression+0.135
 exception+0.130
Neg logit contributions
ifying-0.079
work-0.079
 looked-0.073
 Claus-0.066
 had-0.064
Token would
Token ID561
Feature activation-1.20e+00

Pos logit contributions
 idea+0.049
 bal+0.042
crow+0.041
 paramedics+0.040
 doctors+0.038
Neg logit contributions
ifying-0.051
years-0.041
chie-0.037
 seeking-0.036
jamin-0.036
Token put
Token ID1234
Feature activation-1.05e+00

Pos logit contributions
 idea+0.044
crow+0.043
 bal+0.042
 doctors+0.039
 cows+0.038
Neg logit contributions
ifying-0.048
chie-0.038
igsaw-0.037
 hips-0.037
years-0.036
Token letters
Token ID7475
Feature activation-8.26e-01

Pos logit contributions
 idea+0.039
 bal+0.035
crow+0.033
 understand+0.032
 paramedics+0.032
Neg logit contributions
ifying-0.047
years-0.036
igsaw-0.036
chie-0.034
 hips-0.033
Token inside
Token ID2641
Feature activation-4.71e-01

Pos logit contributions
 idea+0.033
crow+0.029
 bal+0.028
 doctors+0.027
 paramedics+0.027
Neg logit contributions
ifying-0.037
years-0.027
igsaw-0.027
chie-0.027
 hips-0.026
Token So
Token ID1406
Feature activation-7.48e-01

Pos logit contributions
 idea+0.007
 bal+0.006
crow+0.006
 doctors+0.006
 cows+0.006
Neg logit contributions
ifying-0.008
igsaw-0.007
years-0.007
chie-0.006
jamin-0.006
Token every
Token ID790
Feature activation-9.14e-01

Pos logit contributions
 idea+0.028
ighters+0.025
crow+0.025
 bal+0.024
 paramedics+0.024
Neg logit contributions
ifying-0.031
igsaw-0.025
years-0.024
 hips-0.023
chie-0.022
Token evening
Token ID6180
Feature activation-2.36e+00

Pos logit contributions
 idea+0.016
ighters+0.013
crow+0.012
 paramedics+0.012
 bal+0.012
Neg logit contributions
ifying-0.055
igsaw-0.050
years-0.045
jamin-0.045
run-0.044
Token,
Token ID11
Feature activation-9.11e-01

Pos logit contributions
 idea+0.113
ighters+0.101
adders+0.095
 itch+0.094
its+0.094
Neg logit contributions
ifying-0.070
igsaw-0.046
 pawn-0.045
 bit-0.045
va-0.044
Token the
Token ID262
Feature activation-7.03e-01

Pos logit contributions
 idea+0.038
crow+0.032
 bal+0.031
ook+0.030
ying+0.030
Neg logit contributions
ifying-0.036
years-0.028
igsaw-0.027
chie-0.025
 hips-0.024
Token lawyer
Token ID6853
Feature activation-3.24e+00

Pos logit contributions
 idea+0.025
crow+0.022
 bal+0.021
ook+0.019
ighters+0.019
Neg logit contributions
ifying-0.031
years-0.027
igsaw-0.027
chie-0.025
-0.023
Token would
Token ID561
Feature activation-6.74e-01

Pos logit contributions
 idea+0.138
ubs+0.125
crow+0.120
 paramedics+0.116
 impression+0.115
Neg logit contributions
ifying-0.095
jamin-0.080
 bit-0.077
ll-0.074
 Claus-0.074
Token go
Token ID467
Feature activation+7.78e-01

Pos logit contributions
 idea+0.022
crow+0.020
 bal+0.019
 doctors+0.018
 cows+0.018
Neg logit contributions
ifying-0.032
igsaw-0.026
chie-0.025
years-0.025
 hips-0.024
Token to
Token ID284
Feature activation+1.21e-01

Pos logit contributions
ifying+0.043
igsaw+0.036
years+0.035
chie+0.035
jamin+0.032
Neg logit contributions
 idea-0.024
 doctors-0.019
 people-0.018
 hospital-0.018
 bal-0.018
Token his
Token ID465
Feature activation+3.16e-01

Pos logit contributions
ifying+0.006
igsaw+0.005
years+0.005
chie+0.005
jamin+0.005
Neg logit contributions
 idea-0.004
 bal-0.003
crow-0.003
 doctors-0.003
 cows-0.003
Token big
Token ID1263
Feature activation-4.30e-01

Pos logit contributions
ifying+0.018
igsaw+0.015
years+0.015
chie+0.015
jamin+0.014
Neg logit contributions
 idea-0.008
 bal-0.006
 doctors-0.006
crow-0.006
 cows-0.006
TokenL
Token ID43
Feature activation+3.57e-01

Pos logit contributions
 idea+0.000
 bal+0.000
crow+0.000
 doctors+0.000
 cows+0.000
Neg logit contributions
ifying-0.000
igsaw-0.000
years-0.000
chie-0.000
jamin-0.000
Tokenily
Token ID813
Feature activation+1.02e-01

Pos logit contributions
ifying+0.023
igsaw+0.020
years+0.019
chie+0.019
 hips+0.019
Neg logit contributions
 idea-0.007
 bal-0.005
crow-0.005
 doctors-0.004
 people-0.004
Token decided
Token ID3066
Feature activation-5.85e-01

Pos logit contributions
ifying+0.005
igsaw+0.004
years+0.004
chie+0.004
jamin+0.003
Neg logit contributions
 idea-0.004
 bal-0.003
crow-0.003
 doctors-0.003
 cows-0.003
Token to
Token ID284
Feature activation-9.98e-01

Pos logit contributions
 idea+0.028
crow+0.024
 bal+0.024
cut+0.023
ighters+0.023
Neg logit contributions
ifying-0.019
igsaw-0.014
years-0.014
 hips-0.014
chie-0.014
Token keep
Token ID1394
Feature activation-2.28e-01

Pos logit contributions
 idea+0.035
 bal+0.032
crow+0.031
 doctors+0.030
 cows+0.030
Neg logit contributions
ifying-0.042
igsaw-0.035
chie-0.033
 hips-0.032
jamin-0.032
Token an
Token ID281
Feature activation-3.18e+00

Pos logit contributions
 idea+0.008
 bal+0.007
crow+0.007
 doctors+0.006
 understand+0.006
Neg logit contributions
ifying-0.011
years-0.009
igsaw-0.009
chie-0.008
 hips-0.008
Token eye
Token ID4151
Feature activation-2.27e+00

Pos logit contributions
 bal+0.091
car+0.082
crow+0.080
 People+0.080
 matters+0.078
Neg logit contributions
chie-0.115
igsaw-0.114
years-0.112
ifying-0.111
oured-0.099
Token on
Token ID319
Feature activation-5.90e-01

Pos logit contributions
 idea+0.084
 People+0.072
 rarely+0.069
 countryside+0.069
 Vol+0.069
Neg logit contributions
ifying-0.103
 hips-0.071
chie-0.068
igsaw-0.068
years-0.068
Token the
Token ID262
Feature activation-1.52e+00

Pos logit contributions
 idea+0.021
 bal+0.018
crow+0.017
 understand+0.017
 pilot+0.016
Neg logit contributions
ifying-0.027
igsaw-0.022
years-0.021
 hips-0.021
chie-0.020
Token field
Token ID2214
Feature activation-4.79e-01

Pos logit contributions
 bal+0.037
 idea+0.033
crow+0.030
 clim+0.029
car+0.026
Neg logit contributions
ifying-0.082
chie-0.068
 hips-0.066
 freshly-0.065
-0.062
Token to
Token ID284
Feature activation-1.89e-01

Pos logit contributions
 idea+0.016
 bal+0.013
crow+0.013
 doctors+0.013
 cows+0.012
Neg logit contributions
ifying-0.023
igsaw-0.019
years-0.019
chie-0.018
 hips-0.018
Token.
Token ID13
Feature activation-9.63e-01

Pos logit contributions
 idea+0.008
crow+0.006
 bal+0.006
 doctors+0.006
 cows+0.006
Neg logit contributions
ifying-0.011
igsaw-0.009
years-0.009
chie-0.009
 hips-0.009
Token Every
Token ID3887
Feature activation-5.65e-01

Pos logit contributions
 idea+0.044
 bal+0.040
 doctors+0.039
crow+0.038
 infection+0.037
Neg logit contributions
ifying-0.032
igsaw-0.026
-0.024
jamin-0.023
 hips-0.023
Token day
Token ID1110
Feature activation-1.64e+00

Pos logit contributions
 idea+0.009
 bal+0.007
crow+0.006
ighters+0.006
 doctors+0.006
Neg logit contributions
ifying-0.036
igsaw-0.032
years-0.030
chie-0.030
jamin-0.030
Token,
Token ID11
Feature activation-7.32e-01

Pos logit contributions
 idea+0.082
ighters+0.073
adders+0.069
 impression+0.069
 paramedics+0.069
Neg logit contributions
ifying-0.052
 bit-0.035
 eight-0.034
-0.031
years-0.029
Token the
Token ID262
Feature activation-9.49e-01

Pos logit contributions
 idea+0.030
crow+0.026
 bal+0.026
 understand+0.024
ying+0.023
Neg logit contributions
ifying-0.030
years-0.023
igsaw-0.022
chie-0.021
-0.021
Token mail
Token ID6920
Feature activation-3.18e+00

Pos logit contributions
 idea+0.029
ighters+0.026
crow+0.025
ook+0.025
 bal+0.024
Neg logit contributions
ifying-0.043
igsaw-0.036
-0.035
years-0.035
 seeking-0.034
Tokenman
Token ID805
Feature activation-7.15e-01

Pos logit contributions
ubs+0.147
 mischief+0.141
 idea+0.138
ear+0.134
ighters+0.134
Neg logit contributions
ifying-0.072
work-0.069
 Claus-0.064
fly-0.063
 recorder-0.051
Token would
Token ID561
Feature activation-5.12e-01

Pos logit contributions
 idea+0.026
 bal+0.021
crow+0.021
 doctors+0.020
 paramedics+0.019
Neg logit contributions
ifying-0.033
years-0.026
chie-0.026
igsaw-0.025
jamin-0.024
Token leave
Token ID2666
Feature activation-7.96e-01

Pos logit contributions
 idea+0.018
 bal+0.015
crow+0.015
 doctors+0.014
 cows+0.014
Neg logit contributions
ifying-0.024
igsaw-0.019
chie-0.019
years-0.019
 hips-0.018
Token the
Token ID262
Feature activation-1.15e+00

Pos logit contributions
 idea+0.030
 bal+0.026
crow+0.024
 doctors+0.022
 paramedics+0.022
Neg logit contributions
ifying-0.036
years-0.030
igsaw-0.028
chie-0.027
 hips-0.027
Token mail
Token ID6920
Feature activation-1.78e+00

Pos logit contributions
 idea+0.039
ighters+0.033
crow+0.033
 bal+0.031
ying+0.031
Neg logit contributions
ifying-0.052
igsaw-0.042
years-0.041
 seeking-0.041
jamin-0.040
Token inside
Token ID2641
Feature activation-3.74e-01

Pos logit contributions
 idea+0.036
 bal+0.034
crow+0.033
 paramedics+0.032
 understand+0.031
Neg logit contributions
ifying-0.049
 seeking-0.042
igsaw-0.041
chie-0.041
years-0.039
Token it
Token ID340
Feature activation+2.49e-01

Pos logit contributions
 idea+0.013
 bal+0.011
crow+0.010
 doctors+0.010
 pilot+0.010
Neg logit contributions
ifying-0.018
igsaw-0.015
years-0.015
chie-0.014
 hips-0.014
Token,
Token ID11
Feature activation-6.10e-01

Pos logit contributions
ifying+0.014
igsaw+0.012
years+0.011
chie+0.011
jamin+0.010
Neg logit contributions
 idea-0.007
 bal-0.006
crow-0.006
 doctors-0.005
 cows-0.005
Token so
Token ID523
Feature activation-6.81e-01

Pos logit contributions
 idea+0.028
crow+0.025
 bal+0.025
 doctors+0.024
cut+0.024
Neg logit contributions
ifying-0.021
years-0.016
 hips-0.015
igsaw-0.015
chie-0.015
Token she
Token ID673
Feature activation+3.98e-02

Pos logit contributions
 idea+0.027
crow+0.024
cut+0.022
 bal+0.021
 paramedics+0.021
Neg logit contributions
ifying-0.029
years-0.024
igsaw-0.022
 hips-0.021
chie-0.020
Token let
Token ID1309
Feature activation-3.14e+00

Pos logit contributions
ifying+0.002
igsaw+0.002
years+0.002
chie+0.002
jamin+0.001
Neg logit contributions
 idea-0.001
 bal-0.001
crow-0.001
 doctors-0.001
 cows-0.001
Token herself
Token ID5223
Feature activation-1.86e-01

Pos logit contributions
cut+0.131
ooks+0.129
 bal+0.123
 impression+0.122
aw+0.120
Neg logit contributions
 front-0.069
ifying-0.068
 looked-0.060
 Rose-0.059
chie-0.058
Token in
Token ID287
Feature activation+4.35e-01

Pos logit contributions
 idea+0.007
crow+0.006
 bal+0.006
 doctors+0.006
 cows+0.006
Neg logit contributions
ifying-0.008
igsaw-0.007
years-0.006
chie-0.006
 hips-0.006
Token.
Token ID13
Feature activation-3.66e-01

Pos logit contributions
ifying+0.024
igsaw+0.020
years+0.020
chie+0.020
jamin+0.018
Neg logit contributions
 idea-0.013
crow-0.010
 bal-0.010
 doctors-0.010
 understand-0.009
Token She
Token ID1375
Feature activation+1.79e-01

Pos logit contributions
 idea+0.015
 doctors+0.012
 bal+0.012
crow+0.012
 pilot+0.012
Neg logit contributions
ifying-0.015
igsaw-0.013
years-0.012
chie-0.012
 hips-0.011
Token stepped
Token ID10764
Feature activation+2.41e-01

Pos logit contributions
ifying+0.008
igsaw+0.006
years+0.006
chie+0.006
jamin+0.006
Neg logit contributions
 idea-0.007
 bal-0.006
crow-0.006
 doctors-0.006
 cows-0.006
Token They
Token ID1119
Feature activation-1.38e-02

Pos logit contributions
 idea+0.008
 bal+0.007
crow+0.006
 doctors+0.006
cut+0.006
Neg logit contributions
ifying-0.009
igsaw-0.007
years-0.007
chie-0.007
jamin-0.006
Token found
Token ID1043
Feature activation-6.13e-01

Pos logit contributions
 idea+0.000
 bal+0.000
crow+0.000
 doctors+0.000
 cows+0.000
Neg logit contributions
ifying-0.001
years-0.001
igsaw-0.001
chie-0.001
jamin-0.001
Token a
Token ID257
Feature activation-6.99e-02

Pos logit contributions
 idea+0.027
 bal+0.024
crow+0.023
cut+0.022
 clim+0.022
Neg logit contributions
ifying-0.024
years-0.018
igsaw-0.017
chie-0.016
jamin-0.015
Token hotel
Token ID7541
Feature activation+8.25e-02

Pos logit contributions
 idea+0.003
crow+0.002
 bal+0.002
 doctors+0.002
 cows+0.002
Neg logit contributions
ifying-0.003
igsaw-0.003
years-0.003
chie-0.002
jamin-0.002
Token and
Token ID290
Feature activation+1.69e-01

Pos logit contributions
ifying+0.004
igsaw+0.004
years+0.003
chie+0.003
jamin+0.003
Neg logit contributions
 idea-0.003
 bal-0.002
crow-0.002
 doctors-0.002
 cows-0.002
Token let
Token ID1309
Feature activation-3.13e+00

Pos logit contributions
ifying+0.008
igsaw+0.007
years+0.007
chie+0.006
jamin+0.006
Neg logit contributions
 idea-0.006
 bal-0.005
crow-0.005
 doctors-0.005
 cows-0.005
Token themselves
Token ID2405
Feature activation-2.57e-01

Pos logit contributions
cut+0.136
 bal+0.122
 idea+0.113
opl+0.110
 clim+0.108
Neg logit contributions
ifying-0.097
 front-0.070
chie-0.069
 looks-0.068
 Alex-0.065
Token into
Token ID656
Feature activation+4.84e-01

Pos logit contributions
 idea+0.007
crow+0.006
 bal+0.005
 doctors+0.005
cut+0.005
Neg logit contributions
ifying-0.014
igsaw-0.012
years-0.012
chie-0.012
 hips-0.011
Token the
Token ID262
Feature activation-4.09e-01

Pos logit contributions
ifying+0.025
igsaw+0.021
years+0.020
chie+0.020
jamin+0.019
Neg logit contributions
 idea-0.016
 doctors-0.012
 people-0.012
crow-0.012
 bal-0.012
Token lobby
Token ID10866
Feature activation-1.92e-01

Pos logit contributions
 idea+0.014
 bal+0.012
crow+0.011
 understand+0.011
cut+0.011
Neg logit contributions
ifying-0.020
igsaw-0.016
years-0.015
chie-0.015
jamin-0.014
Token.
Token ID13
Feature activation-3.49e-01

Pos logit contributions
 idea+0.006
crow+0.005
 bal+0.005
 doctors+0.005
 cows+0.004
Neg logit contributions
ifying-0.010
igsaw-0.008
years-0.008
chie-0.008
 hips-0.008
Token.
Token ID13
Feature activation-2.94e-01

Pos logit contributions
 idea+0.009
crow+0.008
 bal+0.008
 doctors+0.007
 cows+0.007
Neg logit contributions
ifying-0.015
igsaw-0.012
years-0.012
chie-0.012
 hips-0.011
Token Every
Token ID3887
Feature activation-7.64e-01

Pos logit contributions
 idea+0.011
 bal+0.009
crow+0.009
 doctors+0.009
 pilot+0.008
Neg logit contributions
ifying-0.013
igsaw-0.011
years-0.011
chie-0.010
 hips-0.010
Token day
Token ID1110
Feature activation-1.34e+00

Pos logit contributions
 idea+0.012
 bal+0.010
 paramedics+0.010
crow+0.010
ighters+0.010
Neg logit contributions
ifying-0.046
igsaw-0.041
years-0.038
jamin-0.038
chie-0.037
Token,
Token ID11
Feature activation-5.15e-01

Pos logit contributions
 idea+0.061
cut+0.054
 impression+0.053
 understand+0.052
 paramedics+0.052
Neg logit contributions
ifying-0.050
 eight-0.033
years-0.032
 total-0.030
igsaw-0.030
Token the
Token ID262
Feature activation-9.88e-01

Pos logit contributions
 idea+0.019
 bal+0.016
crow+0.016
 understand+0.015
cut+0.014
Neg logit contributions
ifying-0.024
years-0.019
igsaw-0.019
chie-0.018
 hips-0.017
Token clock
Token ID8801
Feature activation-3.13e+00

Pos logit contributions
 idea+0.032
crow+0.031
 bal+0.028
ook+0.027
ighters+0.025
Neg logit contributions
ifying-0.045
igsaw-0.036
years-0.036
 hips-0.035
chie-0.034
Token went
Token ID1816
Feature activation+2.78e-01

Pos logit contributions
 idea+0.135
ubs+0.132
ooks+0.126
ook+0.123
 impression+0.123
Neg logit contributions
ifying-0.083
work-0.070
 bit-0.069
 looked-0.066
fly-0.064
Token "
Token ID366
Feature activation-3.53e-01

Pos logit contributions
ifying+0.014
igsaw+0.012
years+0.012
chie+0.011
jamin+0.011
Neg logit contributions
 idea-0.009
 bal-0.007
 doctors-0.007
crow-0.007
 cows-0.007
Tokentick
Token ID42298
Feature activation-5.52e-01

Pos logit contributions
 idea+0.014
crow+0.011
 bal+0.011
 doctors+0.011
 cows+0.011
Neg logit contributions
ifying-0.016
years-0.013
igsaw-0.013
chie-0.012
jamin-0.011
Token-
Token ID12
Feature activation-3.94e-01

Pos logit contributions
 idea+0.018
 bal+0.015
 doctors+0.015
 paramedics+0.014
crow+0.014
Neg logit contributions
ifying-0.026
igsaw-0.023
years-0.022
chie-0.021
jamin-0.020
Tokent
Token ID83
Feature activation-3.98e-01

Pos logit contributions
 idea+0.019
 bal+0.016
 doctors+0.016
crow+0.016
 cows+0.016
Neg logit contributions
ifying-0.014
igsaw-0.011
years-0.011
chie-0.010
run-0.009
Token small
Token ID1402
Feature activation+1.07e-01

Pos logit contributions
 idea+0.002
crow+0.002
 bal+0.002
 doctors+0.002
 cows+0.002
Neg logit contributions
ifying-0.003
igsaw-0.003
years-0.003
chie-0.003
jamin-0.002
Token.
Token ID13
Feature activation-4.86e-01

Pos logit contributions
ifying+0.006
igsaw+0.005
years+0.005
chie+0.005
jamin+0.004
Neg logit contributions
 idea-0.003
 bal-0.002
crow-0.002
 doctors-0.002
 cows-0.002
Token Every
Token ID3887
Feature activation-7.37e-01

Pos logit contributions
 idea+0.019
 bal+0.016
 doctors+0.015
crow+0.015
cut+0.015
Neg logit contributions
ifying-0.021
igsaw-0.018
years-0.017
chie-0.016
jamin-0.016
Token day
Token ID1110
Feature activation-1.64e+00

Pos logit contributions
 idea+0.012
ighters+0.009
 doctors+0.009
 paramedics+0.009
 bal+0.009
Neg logit contributions
ifying-0.045
igsaw-0.040
jamin-0.038
chie-0.038
years-0.037
Token the
Token ID262
Feature activation-3.52e-01

Pos logit contributions
 idea+0.077
 impression+0.067
cut+0.065
ighters+0.065
 paramedics+0.064
Neg logit contributions
ifying-0.059
 eight-0.040
years-0.039
 total-0.035
 Alex-0.034
Token piece
Token ID3704
Feature activation-3.12e+00

Pos logit contributions
 idea+0.012
crow+0.011
 bal+0.010
 paramedics+0.010
 doctors+0.010
Neg logit contributions
ifying-0.016
igsaw-0.014
years-0.013
chie-0.013
 hips-0.012
Token went
Token ID1816
Feature activation+2.06e-02

Pos logit contributions
 idea+0.111
ubs+0.104
 Vol+0.102
 paramedics+0.102
 impression+0.102
Neg logit contributions
ifying-0.096
 nowhere-0.088
ll-0.082
 seeking-0.079
years-0.078
Token on
Token ID319
Feature activation-2.16e-02

Pos logit contributions
ifying+0.001
igsaw+0.001
years+0.001
chie+0.001
jamin+0.001
Neg logit contributions
 idea-0.001
 bal-0.001
crow-0.001
 doctors-0.001
 cows-0.001
Token adventures
Token ID17545
Feature activation-3.10e-01

Pos logit contributions
 idea+0.001
 bal+0.001
crow+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.001
igsaw-0.001
years-0.001
chie-0.001
 hips-0.001
Token,
Token ID11
Feature activation-1.01e+00

Pos logit contributions
 idea+0.010
 bal+0.009
crow+0.009
 doctors+0.008
 paramedics+0.008
Neg logit contributions
ifying-0.015
igsaw-0.012
years-0.012
chie-0.012
 hips-0.011
Token looking
Token ID2045
Feature activation-8.05e-01

Pos logit contributions
 idea+0.042
crow+0.037
 bal+0.036
 paramedics+0.034
 doctors+0.034
Neg logit contributions
ifying-0.038
years-0.032
 seeking-0.030
 hips-0.029
chie-0.029
Token From
Token ID3574
Feature activation-3.90e-01

Pos logit contributions
 idea+0.022
crow+0.018
 bal+0.018
 doctors+0.018
 pilot+0.018
Neg logit contributions
ifying-0.021
igsaw-0.017
years-0.017
run-0.015
jamin-0.015
Token then
Token ID788
Feature activation-9.31e-01

Pos logit contributions
 idea+0.016
 bal+0.013
crow+0.012
 understand+0.012
 cows+0.012
Neg logit contributions
ifying-0.017
years-0.013
igsaw-0.013
chie-0.012
jamin-0.012
Token on
Token ID319
Feature activation-1.69e-01

Pos logit contributions
 idea+0.037
 cows+0.030
 doctors+0.030
crow+0.029
 paramedics+0.029
Neg logit contributions
ifying-0.040
years-0.032
igsaw-0.031
chie-0.029
run-0.028
Token,
Token ID11
Feature activation-1.12e+00

Pos logit contributions
 idea+0.006
 bal+0.004
crow+0.004
 doctors+0.004
 understand+0.004
Neg logit contributions
ifying-0.008
igsaw-0.007
years-0.007
chie-0.007
 hips-0.007
Token Tim
Token ID5045
Feature activation-2.94e-01

Pos logit contributions
 idea+0.047
crow+0.044
 pilot+0.040
ighters+0.040
 paramedics+0.039
Neg logit contributions
ifying-0.039
years-0.032
igsaw-0.032
-0.032
ll-0.029
Tokenmy
Token ID1820
Feature activation-3.09e+00

Pos logit contributions
 idea+0.015
 bal+0.014
crow+0.014
 cows+0.013
 doctors+0.013
Neg logit contributions
ifying-0.008
years-0.006
igsaw-0.006
chie-0.006
jamin-0.006
Token never
Token ID1239
Feature activation+5.52e-01

Pos logit contributions
 idea+0.127
 paramedics+0.115
crow+0.108
 impression+0.108
 hurricane+0.107
Neg logit contributions
ll-0.093
years-0.087
jamin-0.081
ifying-0.078
 seeking-0.076
Token played
Token ID2826
Feature activation-9.17e-01

Pos logit contributions
ifying+0.027
igsaw+0.023
years+0.021
chie+0.021
 hips+0.020
Neg logit contributions
 idea-0.020
 bal-0.017
 doctors-0.016
crow-0.016
 understand-0.016
Token with
Token ID351
Feature activation-2.68e-02

Pos logit contributions
 idea+0.032
crow+0.028
 bal+0.027
cut+0.027
 ostr+0.026
Neg logit contributions
ifying-0.039
igsaw-0.033
chie-0.031
years-0.031
 seeking-0.031
Token ropes
Token ID31001
Feature activation-6.48e-01

Pos logit contributions
 idea+0.001
 bal+0.001
crow+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.001
igsaw-0.001
years-0.001
chie-0.001
 hips-0.001
Token again
Token ID757
Feature activation+4.99e-01

Pos logit contributions
 idea+0.027
crow+0.022
 cows+0.022
 doctors+0.022
 bal+0.021
Neg logit contributions
ifying-0.027
igsaw-0.022
years-0.021
chie-0.020
 hips-0.019
Token.
Token ID13
Feature activation-8.01e-01

Pos logit contributions
 idea+0.031
crow+0.025
 doctors+0.024
 paramedics+0.024
 bal+0.023
Neg logit contributions
ifying-0.040
igsaw-0.030
years-0.030
chie-0.028
run-0.027
Token Every
Token ID3887
Feature activation-5.24e-01

Pos logit contributions
 idea+0.029
 bal+0.026
crow+0.025
cut+0.024
 doctors+0.024
Neg logit contributions
ifying-0.034
igsaw-0.030
years-0.027
 hips-0.027
jamin-0.026
Token day
Token ID1110
Feature activation-1.41e+00

Pos logit contributions
 idea+0.008
crow+0.006
 bal+0.005
 doctors+0.005
 paramedics+0.005
Neg logit contributions
ifying-0.033
igsaw-0.030
years-0.028
chie-0.028
jamin-0.028
Token,
Token ID11
Feature activation-6.55e-01

Pos logit contributions
 idea+0.067
cut+0.058
 impression+0.058
 paramedics+0.056
 understand+0.056
Neg logit contributions
ifying-0.051
 eight-0.035
years-0.035
igsaw-0.033
 total-0.030
Token the
Token ID262
Feature activation-1.48e+00

Pos logit contributions
 idea+0.022
crow+0.019
 bal+0.019
cut+0.017
 understand+0.017
Neg logit contributions
ifying-0.031
years-0.025
igsaw-0.025
chie-0.024
 hips-0.023
Token clock
Token ID8801
Feature activation-3.08e+00

Pos logit contributions
crow+0.051
 idea+0.049
 bal+0.046
ook+0.044
ger+0.041
Neg logit contributions
ifying-0.062
igsaw-0.049
years-0.049
 hips-0.048
chie-0.046
Token would
Token ID561
Feature activation-1.24e+00

Pos logit contributions
 idea+0.139
ubs+0.137
ooks+0.131
 impression+0.129
ook+0.129
Neg logit contributions
ifying-0.076
 bit-0.065
work-0.062
 looked-0.061
fly-0.057
Token tick
Token ID4378
Feature activation-2.45e-01

Pos logit contributions
 idea+0.044
crow+0.041
 bal+0.038
 doctors+0.038
 cows+0.037
Neg logit contributions
ifying-0.053
chie-0.042
 hips-0.041
igsaw-0.040
years-0.039
Token-
Token ID12
Feature activation-2.48e-01

Pos logit contributions
 idea+0.008
 bal+0.006
crow+0.006
 doctors+0.006
 paramedics+0.006
Neg logit contributions
ifying-0.012
igsaw-0.010
years-0.010
chie-0.010
jamin-0.009
Tokent
Token ID83
Feature activation-6.83e-01

Pos logit contributions
 idea+0.010
 bal+0.008
 doctors+0.008
crow+0.008
 cows+0.007
Neg logit contributions
ifying-0.011
igsaw-0.009
years-0.009
chie-0.009
jamin-0.008
Tokenock
Token ID735
Feature activation-3.16e-01

Pos logit contributions
 idea+0.025
 bal+0.019
 doctors+0.018
crow+0.018
 paramedics+0.018
Neg logit contributions
ifying-0.031
years-0.026
igsaw-0.026
chie-0.026
jamin-0.023
Token she
Token ID673
Feature activation-9.47e-01

Pos logit contributions
 idea+0.078
ans+0.073
crow+0.073
 paramedics+0.072
cut+0.071
Neg logit contributions
ifying-0.044
years-0.040
 seeking-0.036
 nowhere-0.036
 Rose-0.032
Token didn
Token ID1422
Feature activation+2.98e-01

Pos logit contributions
 idea+0.036
crow+0.031
 bal+0.028
 emergency+0.028
 cows+0.028
Neg logit contributions
ifying-0.040
years-0.034
jamin-0.032
chie-0.031
 hips-0.031
Token't
Token ID470
Feature activation-1.01e+00

Pos logit contributions
ifying+0.011
igsaw+0.008
years+0.008
chie+0.008
jamin+0.007
Neg logit contributions
 idea-0.014
 bal-0.012
crow-0.012
 doctors-0.012
 cows-0.012
Token want
Token ID765
Feature activation-1.03e+00

Pos logit contributions
 idea+0.031
 cows+0.027
 paramedics+0.026
cut+0.026
crow+0.025
Neg logit contributions
ifying-0.049
chie-0.041
igsaw-0.039
years-0.038
 hips-0.036
Token to
Token ID284
Feature activation-5.89e-01

Pos logit contributions
 idea+0.045
cut+0.041
 paramedics+0.038
 pilot+0.038
 bal+0.037
Neg logit contributions
ifying-0.039
years-0.031
igsaw-0.030
 seeking-0.028
 hips-0.028
Token let
Token ID1309
Feature activation-3.07e+00

Pos logit contributions
 idea+0.022
crow+0.019
 cows+0.019
cut+0.019
 bal+0.019
Neg logit contributions
ifying-0.026
igsaw-0.020
chie-0.020
years-0.019
 hips-0.019
Token it
Token ID340
Feature activation-2.23e-01

Pos logit contributions
cut+0.143
ooks+0.122
aver+0.120
 paramedics+0.119
 idea+0.116
Neg logit contributions
 Rose-0.083
ifying-0.074
 sigh-0.066
 Emily-0.066
 Ruby-0.065
Token go
Token ID467
Feature activation+1.64e-01

Pos logit contributions
 idea+0.007
crow+0.005
 doctors+0.005
 bal+0.005
 cows+0.005
Neg logit contributions
ifying-0.011
igsaw-0.010
chie-0.009
years-0.009
 hips-0.009
Token.
Token ID13
Feature activation-7.50e-01

Pos logit contributions
ifying+0.009
igsaw+0.007
years+0.007
chie+0.007
jamin+0.007
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token She
Token ID1375
Feature activation-8.38e-01

Pos logit contributions
 idea+0.029
cut+0.026
crow+0.026
 bal+0.025
 pilot+0.025
Neg logit contributions
ifying-0.030
igsaw-0.026
years-0.025
 hips-0.024
jamin-0.023
Token held
Token ID2714
Feature activation-6.47e-01

Pos logit contributions
 idea+0.034
crow+0.030
 bal+0.029
cut+0.028
 emergency+0.027
Neg logit contributions
ifying-0.031
years-0.028
jamin-0.025
 hips-0.025
chie-0.024
Token
Token ID198
Feature activation-6.04e-01

Pos logit contributions
 idea+0.029
 bal+0.026
 doctors+0.025
crow+0.025
 paramedics+0.024
Neg logit contributions
ifying-0.029
igsaw-0.024
years-0.024
chie-0.022
jamin-0.022
TokenEvery
Token ID6109
Feature activation-3.19e-01

Pos logit contributions
 idea+0.030
 bal+0.026
crow+0.025
 doctors+0.025
 paramedics+0.024
Neg logit contributions
ifying-0.020
years-0.016
igsaw-0.015
chie-0.014
jamin-0.014
Token day
Token ID1110
Feature activation-1.28e+00

Pos logit contributions
 idea+0.005
 bal+0.003
crow+0.003
 doctors+0.003
cut+0.002
Neg logit contributions
ifying-0.021
igsaw-0.019
years-0.018
chie-0.018
jamin-0.018
Token,
Token ID11
Feature activation+1.67e-01

Pos logit contributions
 idea+0.056
ighters+0.047
cut+0.047
 itch+0.046
 understand+0.046
Neg logit contributions
ifying-0.050
years-0.034
igsaw-0.033
 eight-0.032
 Alex-0.031
Token the
Token ID262
Feature activation-6.01e-01

Pos logit contributions
ifying+0.009
igsaw+0.007
years+0.007
chie+0.007
jamin+0.006
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token painter
Token ID34537
Feature activation-3.06e+00

Pos logit contributions
 idea+0.016
crow+0.014
 bal+0.014
 cows+0.013
 paramedics+0.012
Neg logit contributions
ifying-0.030
igsaw-0.027
years-0.027
chie-0.026
 hips-0.024
Token liked
Token ID8288
Feature activation-2.56e-01

Pos logit contributions
 idea+0.135
 paramedics+0.120
ubs+0.119
 impression+0.118
 People+0.117
Neg logit contributions
ifying-0.082
years-0.070
jamin-0.070
 bit-0.069
ll-0.068
Token to
Token ID284
Feature activation-6.49e-01

Pos logit contributions
 idea+0.011
crow+0.010
 bal+0.009
 understand+0.009
 doctors+0.009
Neg logit contributions
ifying-0.010
igsaw-0.008
years-0.008
chie-0.007
 hips-0.007
Token play
Token ID711
Feature activation-7.80e-01

Pos logit contributions
 idea+0.022
crow+0.020
 bal+0.020
 cows+0.019
 doctors+0.018
Neg logit contributions
ifying-0.029
igsaw-0.025
chie-0.024
years-0.023
 hips-0.023
Token with
Token ID351
Feature activation-4.21e-01

Pos logit contributions
 idea+0.027
crow+0.025
 bal+0.025
 paramedics+0.024
 cows+0.023
Neg logit contributions
ifying-0.032
igsaw-0.028
years-0.026
chie-0.025
 seeking-0.025
Token children
Token ID1751
Feature activation-2.75e-01

Pos logit contributions
 idea+0.016
crow+0.013
 bal+0.013
 understand+0.012
 doctors+0.012
Neg logit contributions
ifying-0.019
igsaw-0.016
years-0.015
chie-0.015
 hips-0.014
Token It
Token ID632
Feature activation-3.41e-01

Pos logit contributions
 idea+0.015
 bal+0.013
crow+0.012
 doctors+0.012
cut+0.012
Neg logit contributions
ifying-0.018
igsaw-0.015
years-0.014
chie-0.014
jamin-0.014
Token made
Token ID925
Feature activation-1.46e+00

Pos logit contributions
 idea+0.015
crow+0.012
 bal+0.012
 doctors+0.012
 cows+0.012
Neg logit contributions
ifying-0.013
years-0.010
chie-0.010
igsaw-0.010
jamin-0.009
Token loud
Token ID7812
Feature activation-4.31e-01

Pos logit contributions
 idea+0.062
crow+0.060
 bal+0.060
cut+0.057
 bidder+0.057
Neg logit contributions
ifying-0.044
years-0.039
igsaw-0.034
chie-0.034
 eight-0.031
Token noises
Token ID26782
Feature activation-1.31e+00

Pos logit contributions
 idea+0.014
 bal+0.012
crow+0.012
 doctors+0.011
 understand+0.011
Neg logit contributions
ifying-0.020
igsaw-0.017
years-0.016
chie-0.016
 hips-0.015
Token and
Token ID290
Feature activation-5.04e-01

Pos logit contributions
 idea+0.044
crow+0.038
 paramedics+0.037
 bidder+0.036
 bal+0.036
Neg logit contributions
ifying-0.058
years-0.048
 seeking-0.044
chie-0.043
run-0.042
Token let
Token ID1309
Feature activation-3.06e+00

Pos logit contributions
 idea+0.018
 bal+0.016
crow+0.015
 doctors+0.014
 paramedics+0.014
Neg logit contributions
ifying-0.022
years-0.018
igsaw-0.018
chie-0.018
 hips-0.017
Token out
Token ID503
Feature activation+6.01e-01

Pos logit contributions
cut+0.110
ooks+0.107
 idea+0.107
 bal+0.107
 impression+0.104
Neg logit contributions
ifying-0.095
chie-0.077
 looks-0.075
 eight-0.071
of-0.071
Token smoke
Token ID7523
Feature activation-1.16e+00

Pos logit contributions
ifying+0.028
igsaw+0.024
years+0.023
chie+0.022
jamin+0.021
Neg logit contributions
 idea-0.022
 bal-0.018
crow-0.018
 doctors-0.018
 people-0.018
Token when
Token ID618
Feature activation-5.88e-01

Pos logit contributions
 idea+0.041
 bal+0.036
crow+0.035
 doctors+0.033
 pilot+0.033
Neg logit contributions
ifying-0.053
chie-0.039
 hips-0.038
years-0.038
igsaw-0.038
Token it
Token ID340
Feature activation-7.02e-01

Pos logit contributions
 idea+0.015
 bal+0.012
crow+0.012
cut+0.011
 understand+0.010
Neg logit contributions
ifying-0.033
igsaw-0.027
years-0.027
chie-0.027
 hips-0.026
Token zoom
Token ID19792
Feature activation-4.27e-01

Pos logit contributions
 idea+0.027
 bal+0.023
crow+0.023
 understand+0.022
 doctors+0.021
Neg logit contributions
ifying-0.030
chie-0.023
years-0.022
igsaw-0.022
 hips-0.021
Token gate
Token ID8946
Feature activation-3.04e-01

Pos logit contributions
ifying+0.002
igsaw+0.002
years+0.002
chie+0.002
jamin+0.002
Neg logit contributions
 idea-0.002
 bal-0.002
crow-0.002
 doctors-0.001
 cows-0.001
Token.
Token ID13
Feature activation-6.89e-01

Pos logit contributions
 idea+0.010
 bal+0.008
crow+0.008
 doctors+0.007
 cows+0.007
Neg logit contributions
ifying-0.016
igsaw-0.013
years-0.013
chie-0.012
 hips-0.012
Token She
Token ID1375
Feature activation+1.06e+00

Pos logit contributions
 idea+0.030
 bal+0.026
cut+0.025
crow+0.024
 doctors+0.024
Neg logit contributions
ifying-0.028
igsaw-0.023
years-0.021
-0.020
run-0.019
Token smiled
Token ID13541
Feature activation-1.76e-01

Pos logit contributions
ifying+0.057
igsaw+0.050
chie+0.046
years+0.044
 hips+0.041
Neg logit contributions
 idea-0.034
 doctors-0.028
 bal-0.027
 cows-0.027
 people-0.027
Token and
Token ID290
Feature activation+1.17e+00

Pos logit contributions
 idea+0.006
crow+0.005
 bal+0.004
 doctors+0.004
cut+0.004
Neg logit contributions
ifying-0.009
years-0.007
igsaw-0.007
chie-0.007
jamin-0.007
Token let
Token ID1309
Feature activation-3.04e+00

Pos logit contributions
ifying+0.063
igsaw+0.055
chie+0.052
years+0.049
 hips+0.045
Neg logit contributions
 idea-0.036
 bal-0.031
 doctors-0.030
 cows-0.030
 people-0.030
Token them
Token ID606
Feature activation-2.34e-01

Pos logit contributions
cut+0.118
 bal+0.112
 rarely+0.111
ooks+0.110
 idea+0.109
Neg logit contributions
ifying-0.078
of-0.072
 first-0.072
 Rose-0.069
 sigh-0.062
Token in
Token ID287
Feature activation+1.93e-01

Pos logit contributions
 idea+0.007
crow+0.006
 bal+0.005
 doctors+0.005
cut+0.005
Neg logit contributions
ifying-0.012
igsaw-0.010
chie-0.010
years-0.010
 hips-0.010
Token.
Token ID13
Feature activation-4.45e-01

Pos logit contributions
ifying+0.010
igsaw+0.008
years+0.008
chie+0.008
jamin+0.007
Neg logit contributions
 idea-0.006
 bal-0.005
crow-0.005
 doctors-0.005
 cows-0.005
Token They
Token ID1119
Feature activation+4.17e-01

Pos logit contributions
 idea+0.018
 bal+0.015
crow+0.015
cut+0.014
 doctors+0.014
Neg logit contributions
ifying-0.019
igsaw-0.016
years-0.015
-0.014
jamin-0.014
Token saw
Token ID2497
Feature activation-8.78e-01

Pos logit contributions
ifying+0.020
igsaw+0.017
years+0.016
chie+0.016
jamin+0.014
Neg logit contributions
 idea-0.015
 bal-0.012
crow-0.012
 doctors-0.012
 cows-0.012
Token that
Token ID326
Feature activation-1.01e+00

Pos logit contributions
ifying+0.008
igsaw+0.006
years+0.006
chie+0.006
jamin+0.006
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token day
Token ID1110
Feature activation-3.24e-01

Pos logit contributions
 idea+0.020
crow+0.018
ighters+0.018
 paramedics+0.017
aw+0.017
Neg logit contributions
ifying-0.056
igsaw-0.052
jamin-0.049
years-0.049
chie-0.047
Token on
Token ID319
Feature activation+1.56e-02

Pos logit contributions
 idea+0.011
crow+0.009
 bal+0.009
 doctors+0.009
 cows+0.009
Neg logit contributions
ifying-0.016
years-0.013
igsaw-0.013
chie-0.012
 hips-0.012
Token,
Token ID11
Feature activation-1.02e+00

Pos logit contributions
ifying+0.001
igsaw+0.001
years+0.001
chie+0.001
 hips+0.001
Neg logit contributions
 idea-0.000
 bal-0.000
crow-0.000
 doctors-0.000
 cows-0.000
Token Tim
Token ID5045
Feature activation-6.24e-01

Pos logit contributions
 idea+0.042
crow+0.040
ighters+0.036
 paramedics+0.036
aw+0.036
Neg logit contributions
ifying-0.036
years-0.031
igsaw-0.029
-0.028
 total-0.026
Tokenmy
Token ID1820
Feature activation-3.04e+00

Pos logit contributions
 idea+0.034
crow+0.031
 bal+0.031
 paramedics+0.030
 cows+0.030
Neg logit contributions
ifying-0.015
years-0.011
jamin-0.010
igsaw-0.010
chie-0.010
Token learned
Token ID4499
Feature activation+3.50e-01

Pos logit contributions
 idea+0.132
 paramedics+0.125
crow+0.122
adders+0.117
 hurricane+0.117
Neg logit contributions
ll-0.080
jamin-0.073
years-0.070
ifying-0.065
 seeking-0.064
Token to
Token ID284
Feature activation-1.53e-01

Pos logit contributions
ifying+0.018
igsaw+0.015
years+0.014
chie+0.014
jamin+0.013
Neg logit contributions
 idea-0.012
 bal-0.009
 doctors-0.009
crow-0.009
 people-0.009
Token be
Token ID307
Feature activation-6.63e-01

Pos logit contributions
 idea+0.005
 bal+0.004
crow+0.004
 doctors+0.004
 cows+0.004
Neg logit contributions
ifying-0.007
igsaw-0.006
years-0.006
chie-0.006
jamin-0.005
Token more
Token ID517
Feature activation-1.35e+00

Pos logit contributions
 idea+0.024
crow+0.021
 bal+0.020
 cows+0.020
cut+0.019
Neg logit contributions
ifying-0.029
years-0.026
chie-0.024
igsaw-0.023
run-0.022
Token careful
Token ID8161
Feature activation-1.16e+00

Pos logit contributions
 idea+0.048
crow+0.047
 cows+0.044
 bidder+0.043
 People+0.042
Neg logit contributions
ifying-0.049
years-0.047
run-0.042
igsaw-0.042
ll-0.041
Token She
Token ID1375
Feature activation+3.28e-01

Pos logit contributions
ifying+0.032
igsaw+0.026
years+0.026
chie+0.026
 hips+0.024
Neg logit contributions
 idea-0.020
crow-0.016
 bal-0.015
 doctors-0.015
 cows-0.015
Token opened
Token ID4721
Feature activation-5.08e-01

Pos logit contributions
ifying+0.016
igsaw+0.014
years+0.013
chie+0.013
 hips+0.012
Neg logit contributions
 idea-0.011
 bal-0.009
crow-0.009
 doctors-0.009
 cows-0.009
Token the
Token ID262
Feature activation+3.85e-01

Pos logit contributions
 idea+0.019
 bal+0.016
crow+0.016
 doctors+0.016
 understand+0.015
Neg logit contributions
ifying-0.023
igsaw-0.019
years-0.018
chie-0.017
 hips-0.017
Token door
Token ID3420
Feature activation+6.30e-01

Pos logit contributions
ifying+0.019
igsaw+0.016
years+0.016
chie+0.015
jamin+0.015
Neg logit contributions
 idea-0.013
crow-0.011
 doctors-0.010
 bal-0.010
 cows-0.010
Token and
Token ID290
Feature activation-1.88e-01

Pos logit contributions
ifying+0.037
igsaw+0.032
chie+0.031
years+0.031
jamin+0.029
Neg logit contributions
 idea-0.017
 bal-0.012
crow-0.011
 doctors-0.011
 people-0.011
Token let
Token ID1309
Feature activation-3.01e+00

Pos logit contributions
 idea+0.007
 bal+0.006
crow+0.006
 doctors+0.006
 cows+0.006
Neg logit contributions
ifying-0.008
years-0.007
igsaw-0.007
chie-0.006
jamin-0.006
Token the
Token ID262
Feature activation-4.88e-01

Pos logit contributions
ooks+0.143
 bal+0.137
cut+0.136
ained+0.135
 impression+0.133
Neg logit contributions
ifying-0.046
 Ruby-0.041
of-0.039
 looks-0.039
 Rose-0.039
Token bird
Token ID6512
Feature activation-2.55e-01

Pos logit contributions
 idea+0.013
 bal+0.012
crow+0.011
cut+0.010
 doctors+0.010
Neg logit contributions
ifying-0.025
chie-0.021
igsaw-0.021
years-0.020
 hips-0.019
Token out
Token ID503
Feature activation+5.92e-01

Pos logit contributions
 idea+0.007
 bal+0.005
crow+0.005
 doctors+0.005
 cows+0.005
Neg logit contributions
ifying-0.014
igsaw-0.012
chie-0.012
years-0.011
 hips-0.011
Token.
Token ID13
Feature activation-1.05e-01

Pos logit contributions
ifying+0.035
igsaw+0.030
years+0.030
chie+0.029
jamin+0.028
Neg logit contributions
 idea-0.015
 bal-0.011
crow-0.011
 doctors-0.010
 people-0.010
Token The
Token ID383
Feature activation-6.85e-01

Pos logit contributions
 idea+0.004
 bal+0.003
crow+0.003
 doctors+0.003
 cows+0.003
Neg logit contributions
ifying-0.005
igsaw-0.004
years-0.004
chie-0.004
 hips-0.003
Token was
Token ID373
Feature activation-4.94e-01

Pos logit contributions
ifying+0.008
igsaw+0.006
years+0.006
chie+0.006
jamin+0.006
Neg logit contributions
 idea-0.006
 bal-0.005
crow-0.005
 doctors-0.005
 cows-0.005
Token all
Token ID477
Feature activation-5.13e-03

Pos logit contributions
 idea+0.017
crow+0.015
 bal+0.014
cut+0.014
 cows+0.014
Neg logit contributions
ifying-0.023
years-0.019
chie-0.018
igsaw-0.018
 hips-0.017
Token dry
Token ID5894
Feature activation+1.34e-01

Pos logit contributions
 idea+0.000
 bal+0.000
crow+0.000
 doctors+0.000
 cows+0.000
Neg logit contributions
ifying-0.000
igsaw-0.000
years-0.000
chie-0.000
jamin-0.000
Token when
Token ID618
Feature activation-4.10e-01

Pos logit contributions
ifying+0.007
igsaw+0.006
years+0.006
chie+0.006
jamin+0.006
Neg logit contributions
 idea-0.004
 bal-0.003
crow-0.003
 doctors-0.003
 cows-0.003
Token she
Token ID673
Feature activation+1.97e-01

Pos logit contributions
 idea+0.011
crow+0.008
 bal+0.008
cut+0.008
 paramedics+0.007
Neg logit contributions
ifying-0.023
years-0.020
igsaw-0.019
chie-0.018
 hips-0.018
Token let
Token ID1309
Feature activation-3.01e+00

Pos logit contributions
ifying+0.009
igsaw+0.008
years+0.007
chie+0.007
jamin+0.007
Neg logit contributions
 idea-0.007
 bal-0.006
crow-0.006
 doctors-0.006
 cows-0.006
Token go
Token ID467
Feature activation+6.49e-01

Pos logit contributions
cut+0.121
ooks+0.110
 idea+0.109
amb+0.105
ook+0.104
Neg logit contributions
ifying-0.086
 Rose-0.074
 sigh-0.071
 Alex-0.067
 Emily-0.065
Token!
Token ID0
Feature activation-5.84e-02

Pos logit contributions
ifying+0.038
igsaw+0.032
years+0.032
chie+0.031
jamin+0.030
Neg logit contributions
 idea-0.017
 bal-0.013
crow-0.012
 doctors-0.012
 people-0.012
Token She
Token ID1375
Feature activation-3.45e-01

Pos logit contributions
 idea+0.002
 bal+0.002
crow+0.002
 doctors+0.002
 cows+0.002
Neg logit contributions
ifying-0.003
igsaw-0.002
years-0.002
chie-0.002
jamin-0.002
Token put
Token ID1234
Feature activation-6.14e-01

Pos logit contributions
 idea+0.014
crow+0.012
 bal+0.011
 doctors+0.011
 cows+0.011
Neg logit contributions
ifying-0.014
years-0.011
igsaw-0.011
jamin-0.011
chie-0.011
Token on
Token ID319
Feature activation-3.31e-01

Pos logit contributions
 idea+0.018
crow+0.014
 bal+0.014
cut+0.014
 cows+0.014
Neg logit contributions
ifying-0.032
years-0.027
igsaw-0.026
chie-0.025
 hips-0.025
Token their
Token ID511
Feature activation+1.72e-01

Pos logit contributions
 idea+0.041
 bal+0.039
 pilot+0.037
 ending+0.037
 clim+0.037
Neg logit contributions
ifying-0.055
igsaw-0.043
 hips-0.041
years-0.040
chie-0.039
Token food
Token ID2057
Feature activation-3.24e-01

Pos logit contributions
ifying+0.009
igsaw+0.008
years+0.007
chie+0.007
jamin+0.007
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token on
Token ID319
Feature activation-8.71e-01

Pos logit contributions
 idea+0.012
crow+0.010
 bal+0.010
 doctors+0.010
 understand+0.010
Neg logit contributions
ifying-0.015
igsaw-0.012
years-0.011
chie-0.011
 hips-0.011
Token it
Token ID340
Feature activation-8.57e-01

Pos logit contributions
 idea+0.031
 bal+0.028
crow+0.028
 doctors+0.026
 paramedics+0.026
Neg logit contributions
ifying-0.041
igsaw-0.033
 hips-0.030
years-0.029
chie-0.028
Token and
Token ID290
Feature activation-8.83e-01

Pos logit contributions
 idea+0.029
crow+0.027
 paramedics+0.026
 doctors+0.025
 bal+0.025
Neg logit contributions
ifying-0.040
igsaw-0.033
 hips-0.031
years-0.030
chie-0.030
Token let
Token ID1309
Feature activation-2.99e+00

Pos logit contributions
 idea+0.031
crow+0.028
 bal+0.027
 doctors+0.024
 paramedics+0.023
Neg logit contributions
ifying-0.041
igsaw-0.031
years-0.031
 hips-0.031
jamin-0.029
Token it
Token ID340
Feature activation-2.21e-01

Pos logit contributions
cut+0.117
 bal+0.107
ooks+0.102
aver+0.100
 idea+0.098
Neg logit contributions
ifying-0.084
 looks-0.072
igsaw-0.070
 Alex-0.069
chie-0.064
Token cook
Token ID4255
Feature activation-1.27e+00

Pos logit contributions
 idea+0.006
crow+0.005
 bal+0.004
 doctors+0.004
 cows+0.004
Neg logit contributions
ifying-0.012
igsaw-0.011
chie-0.010
years-0.010
 hips-0.010
Token.
Token ID13
Feature activation-8.82e-01

Pos logit contributions
 idea+0.045
crow+0.044
 bal+0.042
 pilot+0.041
 paramedics+0.039
Neg logit contributions
ifying-0.055
igsaw-0.043
 hips-0.041
years-0.039
chie-0.037
Token
Token ID198
Feature activation-4.06e-01

Pos logit contributions
 idea+0.034
 bal+0.031
 doctors+0.029
cut+0.029
 infection+0.029
Neg logit contributions
ifying-0.037
igsaw-0.031
-0.028
jamin-0.027
years-0.027
Token
Token ID198
Feature activation-2.59e-01

Pos logit contributions
 idea+0.015
 bal+0.013
crow+0.012
 doctors+0.012
 cows+0.012
Neg logit contributions
ifying-0.018
igsaw-0.015
years-0.015
chie-0.014
jamin-0.013
Token calm
Token ID9480
Feature activation-2.34e-01

Pos logit contributions
 idea+0.017
crow+0.016
 bal+0.015
 doctors+0.015
 cows+0.015
Neg logit contributions
ifying-0.022
years-0.017
igsaw-0.017
chie-0.017
jamin-0.016
Token and
Token ID290
Feature activation-2.45e-01

Pos logit contributions
 idea+0.008
 bal+0.006
crow+0.006
 doctors+0.006
 understand+0.006
Neg logit contributions
ifying-0.012
igsaw-0.010
years-0.009
chie-0.009
jamin-0.009
Token peaceful
Token ID12309
Feature activation-1.23e-01

Pos logit contributions
 idea+0.009
crow+0.007
 bal+0.007
 doctors+0.007
 cows+0.006
Neg logit contributions
ifying-0.012
years-0.010
igsaw-0.009
chie-0.009
jamin-0.009
Token.
Token ID13
Feature activation-3.44e-02

Pos logit contributions
 idea+0.004
crow+0.003
 bal+0.003
 doctors+0.003
 understand+0.003
Neg logit contributions
ifying-0.006
igsaw-0.005
years-0.005
chie-0.005
jamin-0.005
Token They
Token ID1119
Feature activation-2.23e-01

Pos logit contributions
 idea+0.001
crow+0.001
 bal+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.002
igsaw-0.001
years-0.001
chie-0.001
jamin-0.001
Token let
Token ID1309
Feature activation-2.97e+00

Pos logit contributions
 idea+0.008
 bal+0.007
crow+0.007
 doctors+0.006
 cows+0.006
Neg logit contributions
ifying-0.010
years-0.009
igsaw-0.008
chie-0.008
jamin-0.008
Token their
Token ID511
Feature activation+5.61e-01

Pos logit contributions
cut+0.130
 idea+0.129
 clim+0.111
adders+0.111
 bidder+0.110
Neg logit contributions
ifying-0.081
jamin-0.066
of-0.061
 Bear-0.059
 Alex-0.055
Token little
Token ID1310
Feature activation-6.01e-01

Pos logit contributions
ifying+0.034
igsaw+0.030
years+0.029
chie+0.029
jamin+0.027
Neg logit contributions
 idea-0.013
 doctors-0.010
 bal-0.009
 people-0.009
 cows-0.009
Token dog
Token ID3290
Feature activation-2.81e-01

Pos logit contributions
 idea+0.022
crow+0.020
 bal+0.019
 doctors+0.018
cut+0.018
Neg logit contributions
ifying-0.026
igsaw-0.021
years-0.020
chie-0.020
jamin-0.019
Token run
Token ID1057
Feature activation+2.10e-01

Pos logit contributions
 idea+0.010
crow+0.008
 bal+0.008
 doctors+0.008
cut+0.007
Neg logit contributions
ifying-0.013
igsaw-0.011
years-0.011
chie-0.010
jamin-0.010
Token around
Token ID1088
Feature activation+1.73e-01

Pos logit contributions
ifying+0.012
igsaw+0.010
years+0.010
chie+0.009
jamin+0.009
Neg logit contributions
 idea-0.006
 bal-0.005
crow-0.005
 doctors-0.004
 cows-0.004
Token toys
Token ID14958
Feature activation+5.19e-01

Pos logit contributions
ifying+0.001
igsaw+0.001
years+0.001
chie+0.001
jamin+0.001
Neg logit contributions
 idea-0.001
 bal-0.000
crow-0.000
 doctors-0.000
 cows-0.000
Token.
Token ID13
Feature activation+1.39e-01

Pos logit contributions
ifying+0.029
igsaw+0.025
years+0.024
chie+0.024
jamin+0.023
Neg logit contributions
 idea-0.014
 bal-0.011
crow-0.011
 doctors-0.011
 cows-0.010
Token There
Token ID1318
Feature activation-1.31e+00

Pos logit contributions
ifying+0.007
igsaw+0.006
years+0.005
chie+0.005
jamin+0.005
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token were
Token ID547
Feature activation+5.42e-01

Pos logit contributions
 idea+0.061
 bal+0.060
 doctors+0.057
crow+0.056
 pilot+0.056
Neg logit contributions
ifying-0.035
igsaw-0.025
years-0.025
-0.024
 seeking-0.024
Token balls
Token ID11333
Feature activation+1.55e-01

Pos logit contributions
ifying+0.027
igsaw+0.023
years+0.022
chie+0.022
jamin+0.021
Neg logit contributions
 idea-0.018
 doctors-0.014
 cows-0.014
 bal-0.014
crow-0.014
Token,
Token ID11
Feature activation+2.33e+00

Pos logit contributions
ifying+0.008
igsaw+0.007
years+0.007
chie+0.007
jamin+0.006
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.003
 cows-0.003
Token dolls
Token ID36062
Feature activation+3.46e-01

Pos logit contributions
ifying+0.126
igsaw+0.123
chie+0.117
ll+0.112
years+0.112
Neg logit contributions
 people-0.060
 idea-0.058
 cows-0.058
 doctors-0.055
 sun-0.048
Token,
Token ID11
Feature activation+1.62e+00

Pos logit contributions
ifying+0.019
igsaw+0.017
years+0.016
chie+0.016
jamin+0.015
Neg logit contributions
 idea-0.009
 bal-0.007
crow-0.007
 doctors-0.007
 cows-0.007
Token cars
Token ID5006
Feature activation+2.25e-01

Pos logit contributions
ifying+0.089
igsaw+0.084
chie+0.080
years+0.076
+0.072
Neg logit contributions
 idea-0.043
 cows-0.042
 people-0.042
 doctors-0.040
 wheat-0.034
Token,
Token ID11
Feature activation+1.39e+00

Pos logit contributions
ifying+0.012
igsaw+0.011
years+0.010
chie+0.010
jamin+0.010
Neg logit contributions
 idea-0.006
 bal-0.005
crow-0.005
 doctors-0.005
 cows-0.005
Token and
Token ID290
Feature activation+1.76e+00

Pos logit contributions
ifying+0.082
igsaw+0.076
chie+0.072
years+0.071
+0.066
Neg logit contributions
 idea-0.033
 cows-0.031
 people-0.030
 doctors-0.029
 wheat-0.025
Token day
Token ID1110
Feature activation-7.91e-01

Pos logit contributions
ifying+0.075
years+0.066
 hips+0.064
chie+0.064
igsaw+0.063
Neg logit contributions
 idea-0.006
 day-0.001
 time+0.001
 bal+0.001
 morning+0.001
Token,
Token ID11
Feature activation-8.25e-02

Pos logit contributions
 idea+0.028
cut+0.023
crow+0.021
 doctors+0.021
 bal+0.020
Neg logit contributions
ifying-0.039
igsaw-0.032
years-0.031
jamin-0.029
chie-0.028
Token Sophie
Token ID35331
Feature activation-7.03e-01

Pos logit contributions
 idea+0.003
crow+0.003
 bal+0.002
 doctors+0.002
 cows+0.002
Neg logit contributions
ifying-0.004
igsaw-0.003
years-0.003
chie-0.003
jamin-0.003
Token put
Token ID1234
Feature activation+2.26e-01

Pos logit contributions
 idea+0.025
crow+0.021
 bal+0.020
 doctors+0.020
cut+0.019
Neg logit contributions
ifying-0.032
igsaw-0.026
years-0.026
jamin-0.025
chie-0.025
Token on
Token ID319
Feature activation+2.18e-02

Pos logit contributions
ifying+0.012
igsaw+0.010
years+0.010
chie+0.010
jamin+0.009
Neg logit contributions
 idea-0.007
 bal-0.005
crow-0.005
 doctors-0.005
 cows-0.005
Token her
Token ID607
Feature activation+2.08e+00

Pos logit contributions
ifying+0.001
igsaw+0.001
years+0.001
chie+0.001
jamin+0.001
Neg logit contributions
 idea-0.001
 bal-0.001
crow-0.001
 doctors-0.001
 cows-0.000
Token new
Token ID649
Feature activation+9.33e-01

Pos logit contributions
ifying+0.122
years+0.111
chie+0.108
igsaw+0.105
ll+0.098
Neg logit contributions
 idea-0.057
 time-0.039
 people-0.039
 day-0.038
 bal-0.037
Token robe
Token ID33192
Feature activation+7.02e-01

Pos logit contributions
ifying+0.049
years+0.041
igsaw+0.040
chie+0.039
jamin+0.035
Neg logit contributions
 idea-0.032
 bal-0.024
 doctors-0.024
 people-0.023
 thing-0.023
Token,
Token ID11
Feature activation+7.56e-02

Pos logit contributions
ifying+0.039
igsaw+0.033
years+0.033
chie+0.032
jamin+0.030
Neg logit contributions
 idea-0.019
 bal-0.015
crow-0.014
 doctors-0.014
 cows-0.014
Token stepped
Token ID10764
Feature activation+2.45e-01

Pos logit contributions
ifying+0.004
igsaw+0.003
years+0.003
chie+0.003
jamin+0.003
Neg logit contributions
 idea-0.003
 bal-0.002
crow-0.002
 doctors-0.002
 cows-0.002
Token outside
Token ID2354
Feature activation+4.81e-01

Pos logit contributions
ifying+0.013
igsaw+0.011
years+0.010
chie+0.010
jamin+0.010
Neg logit contributions
 idea-0.008
 bal-0.006
crow-0.006
 doctors-0.006
 cows-0.006
Token cries
Token ID24691
Feature activation+3.67e-01

Pos logit contributions
ifying+0.034
igsaw+0.029
years+0.029
chie+0.028
jamin+0.027
Neg logit contributions
 idea-0.016
 doctors-0.012
 people-0.011
 cows-0.011
 bal-0.011
Token.
Token ID13
Feature activation+2.42e-01

Pos logit contributions
ifying+0.020
igsaw+0.018
years+0.017
chie+0.017
jamin+0.016
Neg logit contributions
 idea-0.010
 bal-0.008
crow-0.008
 doctors-0.008
 cows-0.007
Token She
Token ID1375
Feature activation+1.41e+00

Pos logit contributions
ifying+0.012
igsaw+0.010
years+0.010
chie+0.009
jamin+0.009
Neg logit contributions
 idea-0.008
crow-0.007
 bal-0.007
 doctors-0.007
 cows-0.007
Token came
Token ID1625
Feature activation+9.52e-01

Pos logit contributions
ifying+0.081
igsaw+0.074
chie+0.063
years+0.063
 hips+0.062
Neg logit contributions
 idea-0.040
 doctors-0.035
 people-0.034
 pilot-0.033
crow-0.032
Token in
Token ID287
Feature activation+7.17e-01

Pos logit contributions
ifying+0.055
igsaw+0.048
years+0.043
+0.042
chie+0.041
Neg logit contributions
 idea-0.027
 people-0.023
 doctors-0.022
 cows-0.021
crow-0.021
Token and
Token ID290
Feature activation+1.78e+00

Pos logit contributions
ifying+0.043
years+0.037
igsaw+0.036
chie+0.036
jamin+0.034
Neg logit contributions
 idea-0.018
 people-0.013
crow-0.013
 doctors-0.013
 understand-0.012
Token saw
Token ID2497
Feature activation-1.33e-01

Pos logit contributions
ifying+0.112
igsaw+0.104
+0.091
chie+0.089
years+0.083
Neg logit contributions
 people-0.036
 doctors-0.035
crow-0.032
 idea-0.032
 cows-0.030
Token the
Token ID262
Feature activation-6.48e-01

Pos logit contributions
 idea+0.005
 bal+0.004
crow+0.004
 doctors+0.004
 understand+0.003
Neg logit contributions
ifying-0.006
igsaw-0.005
years-0.005
chie-0.005
jamin-0.005
Token mess
Token ID2085
Feature activation+7.96e-01

Pos logit contributions
 idea+0.024
crow+0.023
 bal+0.023
 clim+0.021
 cows+0.020
Neg logit contributions
ifying-0.027
igsaw-0.022
chie-0.020
years-0.019
 hips-0.019
Token.
Token ID13
Feature activation+4.06e-01

Pos logit contributions
ifying+0.047
igsaw+0.041
years+0.040
chie+0.039
jamin+0.037
Neg logit contributions
 idea-0.020
 people-0.015
 bal-0.015
 doctors-0.015
crow-0.014
Token She
Token ID1375
Feature activation+2.04e+00

Pos logit contributions
ifying+0.020
igsaw+0.017
years+0.017
chie+0.017
 hips+0.015
Neg logit contributions
 idea-0.013
crow-0.011
 bal-0.011
 doctors-0.011
 people-0.010
Token better
Token ID1365
Feature activation+6.25e-01

Pos logit contributions
 idea+0.013
crow+0.011
 bal+0.011
 cows+0.010
 paramedics+0.009
Neg logit contributions
ifying-0.034
years-0.029
igsaw-0.029
chie-0.028
jamin-0.028
Token to
Token ID284
Feature activation+4.50e-01

Pos logit contributions
ifying+0.031
igsaw+0.026
years+0.025
chie+0.024
jamin+0.023
Neg logit contributions
 idea-0.022
 bal-0.018
crow-0.018
 doctors-0.018
 people-0.018
Token share
Token ID2648
Feature activation+2.51e-01

Pos logit contributions
ifying+0.023
igsaw+0.019
years+0.019
chie+0.018
 hips+0.017
Neg logit contributions
 idea-0.015
crow-0.012
 bal-0.012
 understand-0.012
 doctors-0.011
Token with
Token ID351
Feature activation+1.12e+00

Pos logit contributions
ifying+0.014
igsaw+0.012
years+0.011
chie+0.011
jamin+0.010
Neg logit contributions
 idea-0.007
 bal-0.006
crow-0.006
 doctors-0.006
 cows-0.005
Token friends
Token ID2460
Feature activation+8.24e-01

Pos logit contributions
ifying+0.080
igsaw+0.071
years+0.066
chie+0.066
oured+0.065
Neg logit contributions
 idea-0.017
 people-0.012
 doctors-0.010
 understand-0.010
 thing-0.008
Token than
Token ID621
Feature activation+1.67e+00

Pos logit contributions
ifying+0.047
igsaw+0.041
years+0.040
chie+0.040
 hips+0.037
Neg logit contributions
 idea-0.021
 bal-0.016
crow-0.016
 people-0.016
 thing-0.015
Token to
Token ID284
Feature activation+7.05e-01

Pos logit contributions
ifying+0.089
igsaw+0.077
jamin+0.071
years+0.071
chie+0.069
Neg logit contributions
 idea-0.051
 people-0.046
 doctors-0.044
 understand-0.043
 time-0.039
Token be
Token ID307
Feature activation+1.94e-01

Pos logit contributions
ifying+0.037
years+0.031
igsaw+0.031
chie+0.029
oured+0.027
Neg logit contributions
 idea-0.024
 understand-0.019
 people-0.018
crow-0.018
 doctors-0.018
Token selfish
Token ID20363
Feature activation+8.40e-01

Pos logit contributions
ifying+0.009
igsaw+0.008
years+0.008
chie+0.007
jamin+0.007
Neg logit contributions
 idea-0.007
 bal-0.006
crow-0.005
 doctors-0.005
 cows-0.005
Token.
Token ID13
Feature activation+1.56e-01

Pos logit contributions
ifying+0.048
igsaw+0.042
years+0.041
chie+0.041
 hips+0.038
Neg logit contributions
 idea-0.022
 bal-0.017
 people-0.016
 doctors-0.016
crow-0.016
Token<|endoftext|>
Token ID50256
Feature activation+7.62e-01

Pos logit contributions
ifying+0.008
igsaw+0.006
years+0.006
chie+0.006
jamin+0.006
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token man
Token ID582
Feature activation+9.53e-01

Pos logit contributions
 idea+0.010
 bal+0.008
crow+0.008
 doctors+0.008
 understand+0.007
Neg logit contributions
ifying-0.021
igsaw-0.018
years-0.017
chie-0.017
jamin-0.017
Token appeared
Token ID4120
Feature activation+1.34e+00

Pos logit contributions
ifying+0.049
igsaw+0.045
chie+0.041
years+0.040
 hips+0.039
Neg logit contributions
 idea-0.027
crow-0.024
 doctors-0.023
 bal-0.023
 cows-0.023
Token behind
Token ID2157
Feature activation+5.33e-01

Pos logit contributions
ifying+0.082
igsaw+0.076
chie+0.073
years+0.072
 hips+0.068
Neg logit contributions
 idea-0.027
crow-0.019
 people-0.018
 at-0.018
 bal-0.018
Token her
Token ID607
Feature activation+1.24e+00

Pos logit contributions
ifying+0.029
igsaw+0.025
years+0.024
chie+0.024
jamin+0.022
Neg logit contributions
 idea-0.016
crow-0.012
 cows-0.012
 doctors-0.012
 people-0.012
Token.
Token ID13
Feature activation+6.98e-01

Pos logit contributions
ifying+0.071
years+0.065
chie+0.061
igsaw+0.061
jamin+0.057
Neg logit contributions
 idea-0.031
 people-0.024
 bal-0.024
 cows-0.023
 pilot-0.022
Token He
Token ID679
Feature activation+1.85e+00

Pos logit contributions
ifying+0.036
igsaw+0.029
years+0.029
chie+0.028
 hips+0.026
Neg logit contributions
 idea-0.024
crow-0.019
 cows-0.018
 bal-0.018
 people-0.018
Token said
Token ID531
Feature activation-2.20e-01

Pos logit contributions
ifying+0.107
igsaw+0.097
chie+0.085
years+0.084
 hips+0.081
Neg logit contributions
 idea-0.049
 people-0.039
 pilot-0.039
 man-0.038
 cows-0.037
Token,
Token ID11
Feature activation+5.31e-01

Pos logit contributions
 idea+0.007
 bal+0.006
crow+0.006
 doctors+0.006
 cows+0.005
Neg logit contributions
ifying-0.011
igsaw-0.009
years-0.009
chie-0.009
jamin-0.008
Token "
Token ID366
Feature activation+5.15e-01

Pos logit contributions
ifying+0.029
igsaw+0.025
years+0.024
chie+0.023
jamin+0.022
Neg logit contributions
 idea-0.015
 bal-0.012
crow-0.012
 doctors-0.012
 cows-0.012
TokenThat
Token ID2504
Feature activation+5.46e-01

Pos logit contributions
ifying+0.023
igsaw+0.019
years+0.018
 hips+0.017
chie+0.017
Neg logit contributions
 idea-0.020
crow-0.017
 bal-0.017
 doctors-0.016
 people-0.016
Token's
Token ID338
Feature activation+3.38e-01

Pos logit contributions
ifying+0.027
years+0.022
igsaw+0.022
chie+0.021
 hips+0.020
Neg logit contributions
 idea-0.019
 bal-0.015
crow-0.015
 doctors-0.015
 people-0.015
Token Tommy
Token ID19919
Feature activation-2.68e-01

Pos logit contributions
 idea+0.014
crow+0.012
 bal+0.012
cut+0.011
 doctors+0.011
Neg logit contributions
ifying-0.018
igsaw-0.014
years-0.014
chie-0.013
jamin-0.013
Token's
Token ID338
Feature activation-4.85e-01

Pos logit contributions
 idea+0.011
crow+0.009
 bal+0.009
 doctors+0.009
cut+0.008
Neg logit contributions
ifying-0.011
years-0.009
igsaw-0.009
chie-0.008
jamin-0.008
Token mom
Token ID1995
Feature activation+2.17e-01

Pos logit contributions
 idea+0.014
crow+0.012
 bal+0.011
cut+0.011
 doctors+0.010
Neg logit contributions
ifying-0.025
igsaw-0.021
years-0.020
chie-0.020
 hips-0.020
Token and
Token ID290
Feature activation-3.31e-01

Pos logit contributions
ifying+0.010
igsaw+0.008
years+0.008
chie+0.008
jamin+0.007
Neg logit contributions
 idea-0.008
 bal-0.007
crow-0.007
 doctors-0.007
 cows-0.007
Token dad
Token ID9955
Feature activation-4.83e-01

Pos logit contributions
 idea+0.011
crow+0.009
 bal+0.009
 cows+0.008
cut+0.008
Neg logit contributions
ifying-0.016
years-0.013
igsaw-0.013
chie-0.013
jamin-0.012
Token came
Token ID1625
Feature activation+1.13e+00

Pos logit contributions
 idea+0.018
 bal+0.015
crow+0.015
 doctors+0.015
 cows+0.014
Neg logit contributions
ifying-0.021
years-0.017
chie-0.016
igsaw-0.016
 hips-0.016
Token to
Token ID284
Feature activation+4.65e-01

Pos logit contributions
ifying+0.067
igsaw+0.059
years+0.053
+0.052
chie+0.052
Neg logit contributions
 idea-0.029
 people-0.024
 doctors-0.023
 cows-0.023
 understand-0.022
Token find
Token ID1064
Feature activation-1.16e+00

Pos logit contributions
ifying+0.024
igsaw+0.020
years+0.020
chie+0.019
jamin+0.018
Neg logit contributions
 idea-0.015
crow-0.012
 doctors-0.012
 bal-0.012
 people-0.012
Token him
Token ID683
Feature activation+2.95e-01

Pos logit contributions
 idea+0.046
 bal+0.040
 pilot+0.037
 rarely+0.037
 bidder+0.036
Neg logit contributions
ifying-0.047
years-0.037
igsaw-0.036
run-0.036
chie-0.034
Token.
Token ID13
Feature activation+5.57e-01

Pos logit contributions
ifying+0.016
igsaw+0.014
years+0.014
chie+0.013
jamin+0.013
Neg logit contributions
 idea-0.008
 bal-0.006
crow-0.006
 doctors-0.006
 cows-0.006
Token They
Token ID1119
Feature activation+4.19e-01

Pos logit contributions
ifying+0.028
igsaw+0.024
chie+0.024
years+0.024
jamin+0.021
Neg logit contributions
 idea-0.018
 bal-0.014
crow-0.014
 doctors-0.014
 cows-0.014
TokenGood
Token ID10248
Feature activation-9.00e-01

Pos logit contributions
ifying+0.011
igsaw+0.009
years+0.009
chie+0.009
 hips+0.008
Neg logit contributions
 idea-0.009
 bal-0.007
crow-0.007
 doctors-0.007
 cows-0.007
Token night
Token ID1755
Feature activation-1.03e-01

Pos logit contributions
cut+0.018
 idea+0.017
ighters+0.017
 cows+0.017
crow+0.017
Neg logit contributions
ifying-0.048
igsaw-0.042
chie-0.040
jamin-0.039
years-0.038
Token,
Token ID11
Feature activation+2.67e-02

Pos logit contributions
 idea+0.003
crow+0.003
 bal+0.002
 doctors+0.002
 cows+0.002
Neg logit contributions
ifying-0.005
igsaw-0.005
years-0.004
chie-0.004
 hips-0.004
Token mom
Token ID1995
Feature activation-2.78e-01

Pos logit contributions
ifying+0.001
igsaw+0.001
years+0.001
chie+0.001
jamin+0.001
Neg logit contributions
 idea-0.001
crow-0.001
 bal-0.001
 doctors-0.001
 cows-0.001
Token.
Token ID13
Feature activation+9.18e-03

Pos logit contributions
 idea+0.010
crow+0.008
 bal+0.008
 cows+0.008
 doctors+0.008
Neg logit contributions
ifying-0.013
igsaw-0.011
years-0.011
chie-0.010
jamin-0.010
Token Good
Token ID4599
Feature activation-2.32e-02

Pos logit contributions
ifying+0.000
igsaw+0.000
years+0.000
chie+0.000
jamin+0.000
Neg logit contributions
 idea-0.000
 bal-0.000
crow-0.000
 doctors-0.000
 cows-0.000
Token night
Token ID1755
Feature activation-2.65e-02

Pos logit contributions
 idea+0.000
 bal+0.000
crow+0.000
 doctors+0.000
 cows+0.000
Neg logit contributions
ifying-0.002
igsaw-0.001
years-0.001
chie-0.001
jamin-0.001
Token,
Token ID11
Feature activation+2.36e-01

Pos logit contributions
 idea+0.001
 bal+0.001
crow+0.001
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.001
igsaw-0.001
years-0.001
chie-0.001
 hips-0.001
Token duck
Token ID22045
Feature activation+2.07e-01

Pos logit contributions
ifying+0.011
igsaw+0.009
years+0.009
chie+0.009
jamin+0.008
Neg logit contributions
 idea-0.009
 bal-0.007
crow-0.007
 doctors-0.007
 cows-0.007
Token,"
Token ID553
Feature activation+1.39e-01

Pos logit contributions
ifying+0.010
igsaw+0.009
years+0.008
chie+0.008
jamin+0.008
Neg logit contributions
 idea-0.007
 bal-0.006
crow-0.006
 doctors-0.005
 cows-0.005
Token Lily
Token ID20037
Feature activation+2.96e-01

Pos logit contributions
ifying+0.007
igsaw+0.005
years+0.005
chie+0.005
jamin+0.005
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token little
Token ID1310
Feature activation+7.16e-01

Pos logit contributions
ifying+0.054
years+0.046
igsaw+0.046
chie+0.043
 hips+0.042
Neg logit contributions
 idea-0.035
 bal-0.028
 pilot-0.028
 people-0.027
 doctors-0.027
Token girl
Token ID2576
Feature activation+1.48e-02

Pos logit contributions
ifying+0.040
igsaw+0.034
years+0.034
chie+0.032
 hips+0.031
Neg logit contributions
 idea-0.020
crow-0.016
 bal-0.015
 doctors-0.015
 people-0.015
Token with
Token ID351
Feature activation+1.22e+00

Pos logit contributions
ifying+0.001
igsaw+0.001
years+0.001
chie+0.001
jamin+0.001
Neg logit contributions
 idea-0.001
 bal-0.000
crow-0.000
 doctors-0.000
 cows-0.000
Token a
Token ID257
Feature activation+1.46e+00

Pos logit contributions
ifying+0.060
igsaw+0.053
chie+0.050
ll+0.050
years+0.048
Neg logit contributions
 idea-0.041
 people-0.033
 understand-0.033
 bal-0.032
 doctors-0.032
Token friendly
Token ID8030
Feature activation+2.07e-01

Pos logit contributions
ifying+0.072
years+0.065
igsaw+0.061
chie+0.060
ll+0.054
Neg logit contributions
 idea-0.051
 bal-0.040
 understand-0.038
 pilot-0.038
 thing-0.037
Token face
Token ID1986
Feature activation+7.25e-02

Pos logit contributions
ifying+0.009
igsaw+0.007
years+0.007
chie+0.007
jamin+0.007
Neg logit contributions
 idea-0.008
 bal-0.007
crow-0.007
 doctors-0.007
 cows-0.006
Token.
Token ID13
Feature activation+8.81e-01

Pos logit contributions
ifying+0.004
igsaw+0.003
years+0.003
chie+0.003
jamin+0.003
Neg logit contributions
 idea-0.002
 bal-0.002
crow-0.002
 doctors-0.002
 cows-0.002
Token
Token ID198
Feature activation+1.99e-01

Pos logit contributions
ifying+0.045
chie+0.037
years+0.036
igsaw+0.036
 hips+0.032
Neg logit contributions
 idea-0.029
crow-0.023
 bal-0.023
 people-0.022
 cows-0.022
Token
Token ID198
Feature activation+9.80e-01

Pos logit contributions
ifying+0.010
igsaw+0.008
years+0.008
chie+0.008
jamin+0.008
Neg logit contributions
 idea-0.007
 bal-0.005
crow-0.005
 doctors-0.005
 cows-0.005
Token"
Token ID1
Feature activation+4.88e-01

Pos logit contributions
ifying+0.049
igsaw+0.038
chie+0.038
years+0.038
 hips+0.036
Neg logit contributions
 idea-0.033
crow-0.027
 bal-0.027
 people-0.027
 understand-0.026
TokenI
Token ID40
Feature activation-3.54e-01

Pos logit contributions
ifying+0.021
igsaw+0.018
years+0.016
chie+0.016
 hips+0.016
Neg logit contributions
 idea-0.019
 bal-0.017
crow-0.017
 doctors-0.016
 understand-0.016
Token thought
Token ID1807
Feature activation-1.15e+00

Pos logit contributions
ifying+0.013
igsaw+0.011
years+0.010
chie+0.010
 hips+0.010
Neg logit contributions
 idea-0.008
 bal-0.007
crow-0.007
 doctors-0.006
 cows-0.006
Token,
Token ID11
Feature activation+1.48e-02

Pos logit contributions
 idea+0.036
crow+0.034
 bal+0.032
 paramedics+0.030
 countryside+0.030
Neg logit contributions
ifying-0.050
igsaw-0.045
years-0.042
 hips-0.042
run-0.041
Token "
Token ID366
Feature activation-2.70e-01

Pos logit contributions
ifying+0.001
igsaw+0.001
years+0.001
chie+0.001
jamin+0.001
Neg logit contributions
 idea-0.000
 bal-0.000
crow-0.000
 doctors-0.000
 cows-0.000
TokenI
Token ID40
Feature activation-1.19e+00

Pos logit contributions
 idea+0.012
 bal+0.010
 doctors+0.010
crow+0.010
 cows+0.010
Neg logit contributions
ifying-0.011
years-0.008
igsaw-0.008
chie-0.008
jamin-0.008
Token should
Token ID815
Feature activation+1.85e-01

Pos logit contributions
 cows+0.041
 doctors+0.040
 paramedics+0.040
 idea+0.039
crow+0.038
Neg logit contributions
ifying-0.049
igsaw-0.037
years-0.037
chie-0.037
ll-0.036
Token give
Token ID1577
Feature activation+1.46e-01

Pos logit contributions
ifying+0.008
igsaw+0.007
years+0.007
chie+0.006
jamin+0.006
Neg logit contributions
 idea-0.007
 bal-0.006
crow-0.006
 doctors-0.006
 cows-0.006
Token the
Token ID262
Feature activation-3.42e-01

Pos logit contributions
ifying+0.008
igsaw+0.006
years+0.006
chie+0.006
jamin+0.006
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.003
Token bear
Token ID6842
Feature activation+1.20e+00

Pos logit contributions
 idea+0.015
 bal+0.013
crow+0.013
 doctors+0.012
 understand+0.012
Neg logit contributions
ifying-0.013
igsaw-0.010
years-0.010
chie-0.010
 hips-0.009
Token some
Token ID617
Feature activation-6.57e-01

Pos logit contributions
ifying+0.060
igsaw+0.051
years+0.051
chie+0.048
 hips+0.047
Neg logit contributions
 idea-0.039
 bal-0.032
crow-0.032
 thing-0.031
 people-0.031
Token food
Token ID2057
Feature activation+1.15e-01

Pos logit contributions
 idea+0.022
crow+0.018
 bal+0.018
 doctors+0.017
 pilot+0.017
Neg logit contributions
ifying-0.032
igsaw-0.026
years-0.025
chie-0.024
 hips-0.023
Token."
Token ID526
Feature activation+8.57e-03

Pos logit contributions
ifying+0.006
igsaw+0.005
years+0.005
chie+0.005
jamin+0.004
Neg logit contributions
 idea-0.004
 bal-0.003
crow-0.003
 doctors-0.003
 cows-0.003
Token!
Token ID0
Feature activation+3.93e-01

Pos logit contributions
ifying+0.025
igsaw+0.021
years+0.021
chie+0.020
jamin+0.019
Neg logit contributions
 idea-0.016
 bal-0.013
crow-0.012
 doctors-0.012
 people-0.012
Token<|endoftext|>
Token ID50256
Feature activation+4.99e-01

Pos logit contributions
ifying+0.020
igsaw+0.016
years+0.016
chie+0.016
 hips+0.015
Neg logit contributions
 idea-0.013
 bal-0.011
crow-0.011
 doctors-0.010
 cows-0.010
TokenOnce
Token ID7454
Feature activation+2.35e-01

Pos logit contributions
ifying+0.023
igsaw+0.019
years+0.019
chie+0.019
 hips+0.017
Neg logit contributions
 idea-0.018
crow-0.015
 bal-0.015
 doctors-0.014
 people-0.014
Token upon
Token ID2402
Feature activation+1.42e+00

Pos logit contributions
ifying+0.007
igsaw+0.005
years+0.004
chie+0.004
jamin+0.004
Neg logit contributions
 idea-0.013
 bal-0.011
crow-0.011
 doctors-0.011
 cows-0.011
Token a
Token ID257
Feature activation+5.58e-02

Pos logit contributions
ifying+0.077
igsaw+0.066
chie+0.065
years+0.062
 hips+0.060
Neg logit contributions
 idea-0.045
 understand-0.036
 doctors-0.036
 people-0.036
 time-0.034
Token time
Token ID640
Feature activation+9.54e-01

Pos logit contributions
ifying+0.004
igsaw+0.004
years+0.003
chie+0.003
jamin+0.003
Neg logit contributions
 idea-0.001
crow-0.000
 bal-0.000
 doctors-0.000
 cows-0.000
Token there
Token ID612
Feature activation-8.54e-01

Pos logit contributions
ifying+0.060
years+0.051
igsaw+0.050
chie+0.049
jamin+0.047
Neg logit contributions
 idea-0.021
 bal-0.016
crow-0.014
 people-0.013
 time-0.013
Token was
Token ID373
Feature activation-7.23e-01

Pos logit contributions
 idea+0.038
 doctors+0.035
 bal+0.034
 paramedics+0.034
crow+0.034
Neg logit contributions
ifying-0.029
igsaw-0.022
years-0.021
jamin-0.021
chie-0.020
Token a
Token ID257
Feature activation-2.85e-01

Pos logit contributions
 idea+0.034
crow+0.031
cut+0.029
 bal+0.029
 paramedics+0.028
Neg logit contributions
ifying-0.025
years-0.019
igsaw-0.019
jamin-0.017
chie-0.016
Token big
Token ID1263
Feature activation-5.04e-01

Pos logit contributions
 idea+0.011
crow+0.009
 bal+0.009
 doctors+0.009
 understand+0.009
Neg logit contributions
ifying-0.012
igsaw-0.010
years-0.010
chie-0.009
jamin-0.009
Token truck
Token ID7779
Feature activation-4.27e-01

Pos logit contributions
 idea+0.016
 bal+0.014
crow+0.014
 understand+0.013
 doctors+0.013
Neg logit contributions
ifying-0.024
igsaw-0.021
years-0.019
chie-0.019
jamin-0.019
Token when
Token ID618
Feature activation-4.21e-01

Pos logit contributions
 idea+0.005
crow+0.004
 bal+0.004
 doctors+0.004
 understand+0.003
Neg logit contributions
ifying-0.006
igsaw-0.005
years-0.005
chie-0.005
 hips-0.004
Token she
Token ID673
Feature activation+1.06e+00

Pos logit contributions
 idea+0.015
crow+0.012
 bal+0.012
cut+0.012
 paramedics+0.011
Neg logit contributions
ifying-0.020
years-0.017
igsaw-0.016
chie-0.015
 hips-0.015
Token unp
Token ID8593
Feature activation+7.69e-01

Pos logit contributions
ifying+0.062
igsaw+0.056
chie+0.052
years+0.048
 hips+0.047
Neg logit contributions
 idea-0.027
 doctors-0.021
 bal-0.021
 people-0.020
 hospital-0.019
Tokenacked
Token ID6021
Feature activation-1.08e+00

Pos logit contributions
ifying+0.039
chie+0.032
igsaw+0.031
years+0.031
 hips+0.030
Neg logit contributions
 idea-0.025
crow-0.021
 bal-0.019
 doctors-0.018
 people-0.018
Token it
Token ID340
Feature activation+2.50e-01

Pos logit contributions
 idea+0.034
crow+0.028
 bal+0.028
 paramedics+0.028
cut+0.027
Neg logit contributions
ifying-0.053
igsaw-0.044
years-0.044
 hips-0.043
chie-0.042
Token.
Token ID13
Feature activation-4.54e-01

Pos logit contributions
ifying+0.014
igsaw+0.012
years+0.011
chie+0.011
jamin+0.011
Neg logit contributions
 idea-0.007
 bal-0.006
crow-0.005
 doctors-0.005
 cows-0.005
Token She
Token ID1375
Feature activation+3.45e-02

Pos logit contributions
 idea+0.019
crow+0.016
 bal+0.016
 doctors+0.016
cut+0.016
Neg logit contributions
ifying-0.019
igsaw-0.016
years-0.015
jamin-0.014
chie-0.014
Token decided
Token ID3066
Feature activation-3.38e-01

Pos logit contributions
ifying+0.002
igsaw+0.001
years+0.001
chie+0.001
jamin+0.001
Neg logit contributions
 idea-0.001
crow-0.001
 bal-0.001
 doctors-0.001
 cows-0.001
Token to
Token ID284
Feature activation-5.84e-01

Pos logit contributions
 idea+0.015
crow+0.013
 bal+0.013
cut+0.012
 doctors+0.012
Neg logit contributions
ifying-0.012
igsaw-0.010
years-0.010
chie-0.009
 hips-0.009
Token give
Token ID1577
Feature activation-1.43e+00

Pos logit contributions
 idea+0.020
 bal+0.018
crow+0.018
 doctors+0.017
 cows+0.017
Neg logit contributions
ifying-0.027
igsaw-0.023
chie-0.021
years-0.021
jamin-0.020
Token it
Token ID340
Feature activation+3.45e-02

Pos logit contributions
 idea+0.052
cut+0.045
 paramedics+0.044
 hurricane+0.043
 emergency+0.042
Neg logit contributions
ifying-0.058
years-0.055
oured-0.048
igsaw-0.048
 hips-0.046
Token after
Token ID706
Feature activation+1.68e-01

Pos logit contributions
ifying+0.056
igsaw+0.049
years+0.048
chie+0.048
jamin+0.045
Neg logit contributions
 idea-0.026
 bal-0.019
 people-0.019
 doctors-0.019
 whenever-0.018
Token playing
Token ID2712
Feature activation-4.91e-01

Pos logit contributions
ifying+0.009
igsaw+0.007
years+0.007
chie+0.007
jamin+0.007
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.004
Token with
Token ID351
Feature activation-2.73e-01

Pos logit contributions
 idea+0.017
crow+0.015
 bal+0.014
 paramedics+0.014
 cows+0.013
Neg logit contributions
ifying-0.023
igsaw-0.019
years-0.018
chie-0.018
 hips-0.017
Token them
Token ID606
Feature activation-1.30e-02

Pos logit contributions
 idea+0.009
 bal+0.007
crow+0.007
 understand+0.006
cut+0.006
Neg logit contributions
ifying-0.014
igsaw-0.012
years-0.012
chie-0.011
 hips-0.011
Token.
Token ID13
Feature activation-6.81e-01

Pos logit contributions
 idea+0.000
crow+0.000
 bal+0.000
 doctors+0.000
 cows+0.000
Neg logit contributions
ifying-0.001
igsaw-0.001
years-0.001
chie-0.001
 hips-0.001
Token She
Token ID1375
Feature activation-7.99e-01

Pos logit contributions
 idea+0.029
cut+0.026
crow+0.026
 pilot+0.025
 doctors+0.025
Neg logit contributions
ifying-0.025
igsaw-0.022
years-0.020
jamin-0.019
 hips-0.018
Token felt
Token ID2936
Feature activation-1.23e+00

Pos logit contributions
 idea+0.030
crow+0.027
 cows+0.025
 bal+0.025
 paramedics+0.025
Neg logit contributions
ifying-0.031
years-0.028
jamin-0.026
igsaw-0.025
chie-0.025
Token much
Token ID881
Feature activation-8.48e-01

Pos logit contributions
 idea+0.047
 cows+0.044
crow+0.044
 bidder+0.043
 pilot+0.041
Neg logit contributions
ifying-0.048
years-0.043
igsaw-0.037
chie-0.035
jamin-0.033
Token better
Token ID1365
Feature activation+2.41e-01

Pos logit contributions
 idea+0.026
crow+0.025
 cows+0.024
cut+0.022
 bal+0.022
Neg logit contributions
ifying-0.039
igsaw-0.033
years-0.032
chie-0.030
jamin-0.029
Token knowing
Token ID6970
Feature activation-2.58e-01

Pos logit contributions
ifying+0.014
igsaw+0.012
years+0.011
chie+0.011
jamin+0.011
Neg logit contributions
 idea-0.006
 bal-0.005
crow-0.005
 doctors-0.005
 people-0.004
Token where
Token ID810
Feature activation-4.09e-01

Pos logit contributions
 idea+0.009
crow+0.008
 bal+0.008
 cows+0.007
 doctors+0.007
Neg logit contributions
ifying-0.012
igsaw-0.010
years-0.010
chie-0.009
 hips-0.009
Token of
Token ID286
Feature activation+5.91e-01

Pos logit contributions
ifying+0.000
igsaw+0.000
years+0.000
chie+0.000
 hips+0.000
Neg logit contributions
 idea-0.000
 bal-0.000
crow-0.000
 doctors-0.000
 cows-0.000
Token the
Token ID262
Feature activation+2.63e-01

Pos logit contributions
ifying+0.033
igsaw+0.028
years+0.027
chie+0.027
jamin+0.025
Neg logit contributions
 idea-0.017
 people-0.013
 doctors-0.013
crow-0.013
 cows-0.013
Token television
Token ID5581
Feature activation+6.26e-01

Pos logit contributions
ifying+0.012
igsaw+0.010
years+0.010
chie+0.009
jamin+0.009
Neg logit contributions
 idea-0.010
 bal-0.008
crow-0.008
 doctors-0.008
 cows-0.008
Token.
Token ID13
Feature activation-3.11e-01

Pos logit contributions
ifying+0.034
igsaw+0.029
years+0.029
chie+0.029
jamin+0.027
Neg logit contributions
 idea-0.018
 bal-0.014
crow-0.013
 doctors-0.013
 cows-0.013
Token<|endoftext|>
Token ID50256
Feature activation+4.26e-01

Pos logit contributions
 idea+0.012
 bal+0.010
 doctors+0.010
crow+0.010
 pilot+0.009
Neg logit contributions
ifying-0.014
igsaw-0.012
years-0.011
chie-0.010
jamin-0.010
TokenOnce
Token ID7454
Feature activation-2.62e-01

Pos logit contributions
ifying+0.022
igsaw+0.019
years+0.018
chie+0.018
 hips+0.017
Neg logit contributions
 idea-0.013
crow-0.011
 bal-0.010
 doctors-0.010
 people-0.010
Token upon
Token ID2402
Feature activation+9.65e-01

Pos logit contributions
 idea+0.014
 bal+0.013
crow+0.013
 doctors+0.013
 cows+0.012
Neg logit contributions
ifying-0.007
igsaw-0.005
years-0.005
chie-0.004
jamin-0.004
Token a
Token ID257
Feature activation-2.20e-01

Pos logit contributions
ifying+0.048
igsaw+0.041
chie+0.040
years+0.039
 hips+0.037
Neg logit contributions
 idea-0.034
 understand-0.027
 doctors-0.027
 people-0.027
 bal-0.027
Token time
Token ID640
Feature activation+9.87e-01

Pos logit contributions
 idea+0.003
crow+0.002
 bal+0.002
 doctors+0.001
 cows+0.001
Neg logit contributions
ifying-0.015
igsaw-0.013
years-0.013
chie-0.013
jamin-0.013
Token,
Token ID11
Feature activation+5.42e-01

Pos logit contributions
ifying+0.062
years+0.053
igsaw+0.053
chie+0.052
jamin+0.049
Neg logit contributions
 idea-0.022
 bal-0.016
 time-0.014
crow-0.014
 people-0.013
Token there
Token ID612
Feature activation-6.01e-01

Pos logit contributions
ifying+0.028
igsaw+0.024
years+0.023
chie+0.022
 hips+0.021
Neg logit contributions
 idea-0.018
 bal-0.014
crow-0.013
 doctors-0.013
 people-0.013
Token run
Token ID1057
Feature activation-4.05e-01

Pos logit contributions
 idea+0.042
crow+0.033
 bal+0.033
ook+0.032
cut+0.032
Neg logit contributions
ifying-0.039
chie-0.030
igsaw-0.028
years-0.027
 seeking-0.027
Token!"
Token ID2474
Feature activation+6.29e-01

Pos logit contributions
 idea+0.016
 bal+0.013
 understand+0.013
 doctors+0.012
crow+0.012
Neg logit contributions
ifying-0.018
chie-0.014
igsaw-0.014
years-0.014
 hips-0.013
Token
Token ID198
Feature activation+1.15e-02

Pos logit contributions
ifying+0.035
years+0.030
igsaw+0.030
chie+0.030
jamin+0.027
Neg logit contributions
 idea-0.018
 bal-0.014
crow-0.014
 people-0.014
 doctors-0.013
Token
Token ID198
Feature activation+5.99e-01

Pos logit contributions
ifying+0.001
igsaw+0.000
years+0.000
chie+0.000
jamin+0.000
Neg logit contributions
 idea-0.000
 bal-0.000
crow-0.000
 doctors-0.000
 cows-0.000
TokenThey
Token ID2990
Feature activation+5.57e-01

Pos logit contributions
ifying+0.031
igsaw+0.027
years+0.026
chie+0.025
oured+0.024
Neg logit contributions
 idea-0.018
 bal-0.016
 people-0.016
 doctors-0.016
crow-0.016
Token grabbed
Token ID13176
Feature activation-3.82e-01

Pos logit contributions
ifying+0.026
igsaw+0.023
years+0.021
chie+0.021
jamin+0.019
Neg logit contributions
 idea-0.020
 bal-0.017
crow-0.017
 doctors-0.016
 people-0.016
Token their
Token ID511
Feature activation-3.12e-01

Pos logit contributions
 idea+0.014
 bal+0.012
crow+0.012
 doctors+0.012
cut+0.011
Neg logit contributions
ifying-0.017
igsaw-0.014
years-0.013
chie-0.013
 hips-0.012
Token snacks
Token ID28487
Feature activation-6.02e-01

Pos logit contributions
 idea+0.012
crow+0.011
 bal+0.010
 doctors+0.010
 understand+0.010
Neg logit contributions
ifying-0.014
igsaw-0.011
chie-0.011
years-0.010
 hips-0.010
Token and
Token ID290
Feature activation+2.63e-02

Pos logit contributions
 idea+0.021
 bal+0.017
crow+0.017
 doctors+0.017
 pilot+0.016
Neg logit contributions
ifying-0.029
igsaw-0.023
 hips-0.022
years-0.022
chie-0.022
Token ran
Token ID4966
Feature activation-3.65e-01

Pos logit contributions
ifying+0.001
igsaw+0.001
years+0.001
chie+0.001
 hips+0.001
Neg logit contributions
 idea-0.001
 bal-0.001
crow-0.001
 doctors-0.001
 cows-0.001
Token out
Token ID503
Feature activation+6.40e-01

Pos logit contributions
 idea+0.016
 bal+0.013
crow+0.013
 doctors+0.013
 cows+0.012
Neg logit contributions
ifying-0.015
igsaw-0.012
years-0.011
chie-0.011
 hips-0.010
Token,
Token ID11
Feature activation-2.14e-01

Pos logit contributions
 idea+0.006
 bal+0.005
crow+0.005
 doctors+0.005
 cows+0.004
Neg logit contributions
ifying-0.010
igsaw-0.008
years-0.008
chie-0.008
jamin-0.008
Token the
Token ID262
Feature activation+6.45e-01

Pos logit contributions
 idea+0.007
 bal+0.006
crow+0.006
 doctors+0.005
 cows+0.005
Neg logit contributions
ifying-0.011
igsaw-0.009
years-0.009
chie-0.008
jamin-0.008
Token mystery
Token ID10715
Feature activation-1.19e+00

Pos logit contributions
ifying+0.032
igsaw+0.026
years+0.026
chie+0.025
jamin+0.024
Neg logit contributions
 idea-0.023
 bal-0.018
 doctors-0.018
 people-0.018
crow-0.017
Token was
Token ID373
Feature activation-7.84e-02

Pos logit contributions
 idea+0.046
 bal+0.041
 cows+0.040
crow+0.040
 doctors+0.039
Neg logit contributions
ifying-0.046
igsaw-0.039
chie-0.037
years-0.035
 seeking-0.034
Token never
Token ID1239
Feature activation+5.41e-01

Pos logit contributions
 idea+0.003
 bal+0.002
crow+0.002
 doctors+0.002
 cows+0.002
Neg logit contributions
ifying-0.004
igsaw-0.003
years-0.003
chie-0.003
jamin-0.003
Token solved
Token ID16019
Feature activation-8.72e-02

Pos logit contributions
ifying+0.026
igsaw+0.022
years+0.021
chie+0.021
jamin+0.020
Neg logit contributions
 idea-0.019
 bal-0.015
 doctors-0.015
crow-0.015
 people-0.015
Token.
Token ID13
Feature activation+1.10e+00

Pos logit contributions
 idea+0.003
 bal+0.003
crow+0.003
 doctors+0.002
 cows+0.002
Neg logit contributions
ifying-0.004
igsaw-0.003
years-0.003
chie-0.003
jamin-0.003
Token<|endoftext|>
Token ID50256
Feature activation+6.45e-01

Pos logit contributions
ifying+0.062
years+0.052
chie+0.051
igsaw+0.051
 hips+0.049
Neg logit contributions
 idea-0.028
crow-0.023
 bal-0.022
 People-0.022
 understand-0.021
TokenOne
Token ID3198
Feature activation+4.27e-01

Pos logit contributions
ifying+0.033
igsaw+0.027
years+0.027
chie+0.027
 hips+0.025
Neg logit contributions
 idea-0.021
crow-0.017
 bal-0.016
 people-0.016
 doctors-0.016
Token day
Token ID1110
Feature activation+2.29e-01

Pos logit contributions
ifying+0.032
igsaw+0.028
years+0.028
chie+0.028
 hips+0.026
Neg logit contributions
 idea-0.004
 bal-0.001
crow-0.001
 doctors-0.001
 people-0.001
Token,
Token ID11
Feature activation+5.16e-01

Pos logit contributions
ifying+0.013
igsaw+0.011
years+0.011
chie+0.010
jamin+0.010
Neg logit contributions
 idea-0.006
 bal-0.005
crow-0.005
 doctors-0.005
 cows-0.005
Token the
Token ID262
Feature activation-5.48e-01

Pos logit contributions
 idea+0.015
 bal+0.013
crow+0.013
cut+0.012
 doctors+0.012
Neg logit contributions
ifying-0.018
igsaw-0.015
years-0.014
 seeking-0.014
jamin-0.013
Token bench
Token ID7624
Feature activation+1.67e-01

Pos logit contributions
 idea+0.018
crow+0.014
 bal+0.014
cut+0.013
ighters+0.013
Neg logit contributions
ifying-0.028
igsaw-0.022
years-0.021
chie-0.021
jamin-0.020
Token,
Token ID11
Feature activation-9.18e-02

Pos logit contributions
ifying+0.009
igsaw+0.008
years+0.008
chie+0.007
jamin+0.007
Neg logit contributions
 idea-0.005
 bal-0.004
crow-0.004
 doctors-0.004
 cows-0.003
Token they
Token ID484
Feature activation-4.50e-01

Pos logit contributions
 idea+0.003
crow+0.003
 bal+0.003
 doctors+0.003
 cows+0.003
Neg logit contributions
ifying-0.004
igsaw-0.003
years-0.003
chie-0.003
 hips-0.003
Token saw
Token ID2497
Feature activation-3.93e-01

Pos logit contributions
 idea+0.019
crow+0.016
 bal+0.016
 doctors+0.015
cut+0.015
Neg logit contributions
ifying-0.019
years-0.015
igsaw-0.014
chie-0.014
jamin-0.014
Token an
Token ID281
Feature activation-2.02e+00

Pos logit contributions
 idea+0.017
 bal+0.015
crow+0.014
cut+0.014
 pilot+0.013
Neg logit contributions
ifying-0.015
years-0.012
igsaw-0.012
chie-0.010
jamin-0.010
Token ice
Token ID4771
Feature activation+2.04e-01

Pos logit contributions
 idea+0.068
 bal+0.066
ighters+0.065
crow+0.064
 whenever+0.064
Neg logit contributions
ifying-0.070
years-0.068
chie-0.064
igsaw-0.063
jamin-0.056
Token cream
Token ID8566
Feature activation-3.66e-01

Pos logit contributions
ifying+0.009
igsaw+0.008
years+0.008
chie+0.007
jamin+0.007
Neg logit contributions
 idea-0.008
 bal-0.006
crow-0.006
 doctors-0.006
 cows-0.006
Token truck
Token ID7779
Feature activation+1.37e-01

Pos logit contributions
 idea+0.014
crow+0.011
 understand+0.011
 bal+0.011
 doctors+0.011
Neg logit contributions
ifying-0.017
igsaw-0.013
years-0.013
chie-0.013
 hips-0.012
Token.
Token ID13
Feature activation-2.37e-01

Pos logit contributions
ifying+0.007
igsaw+0.006
years+0.006
chie+0.006
jamin+0.005
Neg logit contributions
 idea-0.004
 bal-0.003
crow-0.003
 doctors-0.003
 cows-0.003
Token Lily
Token ID20037
Feature activation+3.38e-01

Pos logit contributions
 idea+0.009
 bal+0.007
crow+0.007
 doctors+0.007
 understand+0.007
Neg logit contributions
ifying-0.011
igsaw-0.009
years-0.009
chie-0.009
jamin-0.008
Token's
Token ID338
Feature activation-3.53e-01

Pos logit contributions
ifying+0.004
igsaw+0.004
years+0.003
chie+0.003
jamin+0.003
Neg logit contributions
 idea-0.003
 bal-0.003
crow-0.003
 doctors-0.003
 cows-0.002
Token mom
Token ID1995
Feature activation+3.20e-01

Pos logit contributions
 idea+0.009
crow+0.008
 bal+0.007
 understand+0.007
cut+0.007
Neg logit contributions
ifying-0.019
igsaw-0.017
years-0.016
chie-0.015
 hips-0.015
Tokenmy
Token ID1820
Feature activation-1.21e-01

Pos logit contributions
ifying+0.016
igsaw+0.013
years+0.012
chie+0.012
 hips+0.011
Neg logit contributions
 idea-0.011
 bal-0.009
 doctors-0.009
crow-0.009
 cows-0.009
Token told
Token ID1297
Feature activation+1.30e-01

Pos logit contributions
 idea+0.005
crow+0.004
 bal+0.004
 doctors+0.004
 cows+0.004
Neg logit contributions
ifying-0.005
igsaw-0.004
years-0.004
chie-0.004
jamin-0.004
Token her
Token ID607
Feature activation+6.65e-02

Pos logit contributions
ifying+0.007
igsaw+0.006
years+0.006
chie+0.006
jamin+0.006
Neg logit contributions
 idea-0.004
 bal-0.003
crow-0.003
 doctors-0.003
 cows-0.003
Token that
Token ID326
Feature activation-1.73e+00

Pos logit contributions
ifying+0.003
igsaw+0.003
years+0.003
chie+0.003
jamin+0.002
Neg logit contributions
 idea-0.002
 bal-0.002
crow-0.002
 doctors-0.002
 cows-0.002
Token they
Token ID484
Feature activation+6.05e-01

Pos logit contributions
 idea+0.070
crow+0.060
 bal+0.055
age+0.055
aw+0.055
Neg logit contributions
ifying-0.063
igsaw-0.053
 seeking-0.053
years-0.053
 nowhere-0.048
Token were
Token ID547
Feature activation-7.34e-01

Pos logit contributions
ifying+0.028
igsaw+0.024
years+0.022
chie+0.022
jamin+0.021
Neg logit contributions
 idea-0.023
 understand-0.018
 doctors-0.018
 people-0.018
 bal-0.018
Token going
Token ID1016
Feature activation+9.00e-01

Pos logit contributions
 idea+0.026
crow+0.023
 bal+0.023
cut+0.021
 understand+0.020
Neg logit contributions
ifying-0.034
years-0.029
igsaw-0.026
chie-0.026
 seeking-0.026
Token to
Token ID284
Feature activation-2.24e-01

Pos logit contributions
ifying+0.046
igsaw+0.040
years+0.038
chie+0.038
jamin+0.036
Neg logit contributions
 idea-0.029
 doctors-0.025
 people-0.024
 cows-0.023
 bal-0.023
Token have
Token ID423
Feature activation+6.78e-01

Pos logit contributions
 idea+0.008
 bal+0.007
crow+0.007
 doctors+0.006
 cows+0.006
Neg logit contributions
ifying-0.010
igsaw-0.009
chie-0.008
years-0.008
jamin-0.008
Token and
Token ID290
Feature activation+4.18e-01

Pos logit contributions
ifying+0.014
igsaw+0.012
years+0.012
chie+0.012
jamin+0.011
Neg logit contributions
 idea-0.009
 bal-0.007
crow-0.007
 doctors-0.006
 cows-0.006
Token tried
Token ID3088
Feature activation+5.21e-01

Pos logit contributions
ifying+0.020
igsaw+0.017
years+0.016
chie+0.016
jamin+0.015
Neg logit contributions
 idea-0.015
 bal-0.013
 doctors-0.013
crow-0.012
 cows-0.012
Token to
Token ID284
Feature activation-1.28e-01

Pos logit contributions
ifying+0.026
igsaw+0.021
years+0.021
chie+0.020
jamin+0.019
Neg logit contributions
 idea-0.018
 bal-0.015
 doctors-0.015
 understand-0.015
crow-0.015
Token make
Token ID787
Feature activation-1.26e+00

Pos logit contributions
 idea+0.005
crow+0.004
 bal+0.004
 doctors+0.004
 cows+0.004
Neg logit contributions
ifying-0.006
igsaw-0.005
chie-0.005
years-0.005
 hips-0.004
Token him
Token ID683
Feature activation-3.25e-01

Pos logit contributions
 idea+0.043
crow+0.040
cut+0.039
 bal+0.037
 pilot+0.037
Neg logit contributions
ifying-0.050
igsaw-0.045
years-0.043
chie-0.041
 hips-0.040
Token feel
Token ID1254
Feature activation-2.06e+00

Pos logit contributions
 idea+0.012
crow+0.010
 bal+0.009
 cows+0.009
 doctors+0.009
Neg logit contributions
ifying-0.015
igsaw-0.012
chie-0.012
years-0.012
 hips-0.011
Token better
Token ID1365
Feature activation+4.25e-02

Pos logit contributions
 bidder+0.079
 pilot+0.078
 idea+0.078
ook+0.076
 cows+0.075
Neg logit contributions
ifying-0.067
run-0.053
 sigh-0.052
 swollen-0.047
 freshly-0.046
Token.
Token ID13
Feature activation-3.30e-01

Pos logit contributions
ifying+0.002
igsaw+0.002
years+0.002
chie+0.002
 hips+0.002
Neg logit contributions
 idea-0.001
 bal-0.001
crow-0.001
 doctors-0.001
 cows-0.001
Token
Token ID198
Feature activation-2.44e-01

Pos logit contributions
 idea+0.013
 bal+0.010
crow+0.010
 doctors+0.010
 pilot+0.010
Neg logit contributions
ifying-0.014
igsaw-0.012
years-0.012
 hips-0.011
chie-0.011
Token
Token ID198
Feature activation+4.56e-01

Pos logit contributions
 idea+0.009
 bal+0.007
crow+0.007
 doctors+0.007
 cows+0.007
Neg logit contributions
ifying-0.011
igsaw-0.009
years-0.009
chie-0.009
jamin-0.008
TokenThe
Token ID464
Feature activation-7.45e-01

Pos logit contributions
ifying+0.025
igsaw+0.021
years+0.020
chie+0.020
jamin+0.019
Neg logit contributions
 idea-0.013
crow-0.011
 bal-0.011
 doctors-0.010
 people-0.010
Token day
Token ID1110
Feature activation-6.01e-01

Pos logit contributions
 idea+0.016
 paramedics+0.014
 doctors+0.013
crow+0.013
ighters+0.013
Neg logit contributions
ifying-0.045
igsaw-0.042
jamin-0.038
years-0.038
chie-0.037
Token on
Token ID319
Feature activation-8.37e-02

Pos logit contributions
 idea+0.023
 cows+0.018
 doctors+0.018
cut+0.018
crow+0.018
Neg logit contributions
ifying-0.028
years-0.022
igsaw-0.022
chie-0.021
 hips-0.020
Token,
Token ID11
Feature activation-6.38e-01

Pos logit contributions
 idea+0.003
 bal+0.002
crow+0.002
 doctors+0.002
 cows+0.002
Neg logit contributions
ifying-0.004
igsaw-0.004
years-0.003
chie-0.003
 hips-0.003
Token Lily
Token ID20037
Feature activation-1.34e+00

Pos logit contributions
 idea+0.027
crow+0.024
 bal+0.023
 cows+0.023
cut+0.023
Neg logit contributions
ifying-0.024
igsaw-0.021
years-0.019
chie-0.018
-0.017
Token knew
Token ID2993
Feature activation-3.71e-01

Pos logit contributions
 idea+0.053
 paramedics+0.044
crow+0.042
 doctors+0.041
 emergency+0.041
Neg logit contributions
ifying-0.052
years-0.046
jamin-0.044
ll-0.043
chie-0.041
Token that
Token ID326
Feature activation-2.32e+00

Pos logit contributions
 idea+0.014
crow+0.011
 bal+0.011
 cows+0.011
 doctors+0.011
Neg logit contributions
ifying-0.016
years-0.013
igsaw-0.013
chie-0.013
run-0.012
Token whenever
Token ID8797
Feature activation-8.79e-01

Pos logit contributions
 idea+0.092
crow+0.078
isc+0.073
ook+0.073
 paramedics+0.071
Neg logit contributions
ifying-0.077
years-0.074
 seeking-0.074
 nowhere-0.071
 queens-0.062
Token she
Token ID673
Feature activation+6.01e-01

Pos logit contributions
 idea+0.033
crow+0.027
cut+0.026
 bal+0.026
 paramedics+0.025
Neg logit contributions
ifying-0.037
years-0.032
igsaw-0.032
chie-0.030
 seeking-0.029
Token felt
Token ID2936
Feature activation-1.26e+00

Pos logit contributions
ifying+0.030
igsaw+0.026
years+0.024
chie+0.024
 hips+0.023
Neg logit contributions
 idea-0.020
 bal-0.016
crow-0.016
 doctors-0.016
 people-0.016
Token scared
Token ID12008
Feature activation+3.25e-02

Pos logit contributions
 idea+0.044
cut+0.037
 pilot+0.036
crow+0.036
 bidder+0.036
Neg logit contributions
ifying-0.056
igsaw-0.044
years-0.043
chie-0.042
run-0.042
Token or
Token ID393
Feature activation+4.54e-01

Pos logit contributions
ifying+0.002
igsaw+0.001
years+0.001
chie+0.001
jamin+0.001
Neg logit contributions
 idea-0.001
 bal-0.001
crow-0.001
 doctors-0.001
 cows-0.001
Token see
Token ID766
Feature activation-3.83e-01

Pos logit contributions
ifying+0.015
igsaw+0.013
years+0.013
chie+0.012
jamin+0.011
Neg logit contributions
 idea-0.010
 bal-0.008
crow-0.007
 doctors-0.007
 understand-0.007
Token it
Token ID340
Feature activation+6.85e-01

Pos logit contributions
 idea+0.011
crow+0.008
 bal+0.008
 paramedics+0.007
 doctors+0.007
Neg logit contributions
ifying-0.021
years-0.018
igsaw-0.018
chie-0.017
 hips-0.016
Token,
Token ID11
Feature activation+1.69e-01

Pos logit contributions
ifying+0.037
igsaw+0.032
years+0.031
chie+0.030
jamin+0.029
Neg logit contributions
 idea-0.021
 bal-0.016
 doctors-0.015
 people-0.015
crow-0.015
Token please
Token ID3387
Feature activation-1.79e-01

Pos logit contributions
ifying+0.008
igsaw+0.007
years+0.006
chie+0.006
jamin+0.006
Neg logit contributions
 idea-0.006
 bal-0.005
crow-0.005
 doctors-0.005
 cows-0.005
Token?"
Token ID1701
Feature activation+6.06e-01

Pos logit contributions
 idea+0.006
crow+0.005
 bal+0.005
 doctors+0.005
 cows+0.005
Neg logit contributions
ifying-0.009
igsaw-0.007
years-0.007
chie-0.007
 hips-0.006
Token
Token ID220
Feature activation-1.95e+00

Pos logit contributions
ifying+0.030
chie+0.025
years+0.025
igsaw+0.024
run+0.021
Neg logit contributions
 idea-0.021
 bal-0.017
crow-0.017
 doctors-0.017
 people-0.016
Token
Token ID198
Feature activation-2.03e-01

Pos logit contributions
 idea+0.089
 ostr+0.081
crow+0.081
 doctors+0.080
 bal+0.078
Neg logit contributions
ifying-0.063
igsaw-0.053
run-0.046
jamin-0.046
-0.045
TokenScar
Token ID44433
Feature activation+1.89e+00

Pos logit contributions
 idea+0.007
crow+0.006
 bal+0.006
 doctors+0.006
 cows+0.006
Neg logit contributions
ifying-0.009
igsaw-0.008
years-0.008
chie-0.007
jamin-0.007
Tokenlett
Token ID15503
Feature activation+8.94e-01

Pos logit contributions
ifying+0.081
igsaw+0.068
chie+0.064
years+0.064
run+0.057
Neg logit contributions
 idea-0.067
cat-0.061
 pilot-0.061
 understand-0.059
cut-0.059
Token proudly
Token ID23017
Feature activation+4.69e-01

Pos logit contributions
ifying+0.042
igsaw+0.035
years+0.034
chie+0.033
run+0.029
Neg logit contributions
 idea-0.034
 bal-0.028
crow-0.027
 understand-0.026
 people-0.026
Token handed
Token ID10158
Feature activation-1.35e-01

Pos logit contributions
ifying+0.022
igsaw+0.018
years+0.017
chie+0.017
jamin+0.015
Neg logit contributions
 idea-0.018
 bal-0.015
crow-0.014
 doctors-0.014
 understand-0.014