TOP ACTIVATIONS
MAX = 0.086

to yawn . It was summer and it was a very
ended . Finally it was summer and the garden was full
a forest . It was summer and the weather was nice
able sack . It was summer , so the sack and
night , Sara . Good night , Fl uffy . I
said , " It 's summer !" "
, t eddy . Good night , mom my . Good
love . That evening , Amy went for a
" I wish it was summer . I don 't like
When it was night , Tim saw the lights
to play . It was summer , and the sun was
night , Will . Good night , Mr . Bear ,"
. That evening , Th ump was talking
upset . That night , May smelled something funny
. That night , the boy fell asleep
tomorrow . That night , Bob had a dream
help . That evening , they started to increase
her mom . It was noon and the sun was shining
joy . That night , Maya was visited by
together . That night , the crab went to

MIN ACTIVATIONS
MIN = -0.109

noticed something strange . A gol fer was hitting golf balls
wanted to be a good gol fer like his dad .
! I 'm the best gol fer in the world and
! I 'm the best gol fer in the world !"
. But we need more le mons ." Jane and her
one of Mary 's most tre asured possessions and she never
to school with a blue er aser . He wanted to
pond ?" The gol fer replied , " I
Tom called out to the gol fer " Hey , why
. They asked a zo oke eper , who pointed them
, she found a big er aser on her teacher 's
. They ask a zo oke eper what it means .
and circles and triangles and rect angles . She made her
room and saw a big er aser on the teacher 's
named Tim found a big er aser . It was not
you name all of these le mons ?" Lily looked at
. I have a new er aser for you . It
took a deep breath and le aped over the stack .
Sus ie had a furry er aser in her hand and
of the house 's most tre asured treasures . <|endoftext|> Once

INTERVAL 0.037 - 0.086
CONTAINS 0.712%

" How much is this dress ?" she asked the shop
you and I won 't let anything bad happen ."
the morning . But the wife didn 't agree . She
to do . There were flowers to smell , trees to
up and down . It makes a nice sound .

INTERVAL -0.012 - 0.037
CONTAINS 80.550%

You can have some soup . You are a good girl
my smiled and said , " I know , sweet ie
nodded and said , " OK , Mom my ." <|endoftext|>
the snacks . Soon Mo started to understand
felt funny . The letter had different shapes on it .

INTERVAL -0.060 - -0.012
CONTAINS 18.709%

are very kind and smart . Let 's go and pay
<|endoftext|> One day , George was playing in the living room
walking would be fun so he opened the door and left
to help him find his l oll ipop .
just like the sock ! <|endoftext|> Once upon a time ,

INTERVAL -0.109 - -0.060
CONTAINS 0.029%

But one day , Lily accidentally stepped on Buzz . She
was a charming three - year old who loved playing with
your hand ?" the zo oke eper asked .
They wanted to find something fun . When
can I slide with the er aser too ?"
Token to
Token ID284
Feature activation-5.02e-03

Pos logit contributions
ris+0.043
spe+0.039
 gol+0.038
 narrative+0.038
 nations+0.037
Neg logit contributions
cel-0.049
 forehead-0.045
 lap-0.044
 chin-0.042
tail-0.040
Token yawn
Token ID40908
Feature activation+3.56e-02

Pos logit contributions
ris+0.011
 narrative+0.010
spe+0.010
 gol+0.010
 nations+0.010
Neg logit contributions
cel-0.006
 lap-0.005
 forehead-0.005
 chin-0.004
amas-0.004
Token.
Token ID13
Feature activation+6.25e-03

Pos logit contributions
cel+0.065
 lap+0.060
 chin+0.058
 forehead+0.057
chief+0.054
Neg logit contributions
ris-0.052
spe-0.050
ole-0.046
 gol-0.046
 narrative-0.042
Token It
Token ID632
Feature activation+1.07e-02

Pos logit contributions
cel+0.010
 forehead+0.009
tail+0.008
 chin+0.008
 lap+0.008
Neg logit contributions
ris-0.012
spe-0.011
 gol-0.011
 narrative-0.010
 nations-0.010
Token was
Token ID373
Feature activation+1.85e-02

Pos logit contributions
cel+0.020
 lap+0.018
 forehead+0.018
 chin+0.017
amas+0.017
Neg logit contributions
ris-0.016
spe-0.015
 narrative-0.014
igators-0.013
ile-0.013
Token summer
Token ID3931
Feature activation+8.60e-02

Pos logit contributions
cel+0.026
 forehead+0.023
 lap+0.023
 chin+0.021
tail+0.020
Neg logit contributions
ris-0.038
 gol-0.035
 narrative-0.034
spe-0.034
 nations-0.033
Token and
Token ID290
Feature activation+2.22e-02

Pos logit contributions
cel+0.139
 forehead+0.125
 lap+0.122
 chin+0.115
tail+0.112
Neg logit contributions
ris-0.153
spe-0.138
 gol-0.135
 narrative-0.131
 nations-0.124
Token it
Token ID340
Feature activation+1.83e-02

Pos logit contributions
cel+0.036
 forehead+0.032
 lap+0.032
 chin+0.030
tail+0.029
Neg logit contributions
ris-0.039
spe-0.037
 narrative-0.036
 gol-0.036
 nations-0.035
Token was
Token ID373
Feature activation+1.14e-02

Pos logit contributions
cel+0.032
 lap+0.030
 forehead+0.030
 chin+0.028
tail+0.027
Neg logit contributions
ris-0.029
spe-0.027
 narrative-0.026
 gol-0.025
igators-0.024
Token a
Token ID257
Feature activation+1.67e-02

Pos logit contributions
cel+0.017
 lap+0.015
 forehead+0.015
 chin+0.014
tail+0.013
Neg logit contributions
ris-0.022
 gol-0.020
spe-0.020
 narrative-0.020
 nations-0.019
Token very
Token ID845
Feature activation+2.12e-02

Pos logit contributions
cel+0.021
 forehead+0.020
 lap+0.019
 chin+0.018
 cheeks+0.016
Neg logit contributions
ris-0.036
 gol-0.033
 narrative-0.032
spe-0.032
 nations-0.031
Token ended
Token ID4444
Feature activation+1.94e-02

Pos logit contributions
cel+0.081
 forehead+0.077
 lap+0.076
 chin+0.070
tail+0.068
Neg logit contributions
ris-0.075
spe-0.068
 gol-0.068
 narrative-0.066
 nations-0.063
Token.
Token ID13
Feature activation+4.91e-03

Pos logit contributions
cel+0.028
 lap+0.026
 forehead+0.025
 chin+0.023
 cheeks+0.022
Neg logit contributions
ris-0.039
spe-0.035
 gol-0.034
 narrative-0.034
 nations-0.033
Token Finally
Token ID9461
Feature activation+4.74e-03

Pos logit contributions
cel+0.007
 forehead+0.006
 chin+0.006
 lap+0.005
ze+0.005
Neg logit contributions
ris-0.010
 gol-0.009
spe-0.009
 narrative-0.009
 nations-0.008
Token it
Token ID340
Feature activation+1.27e-02

Pos logit contributions
cel+0.006
 forehead+0.005
 lap+0.005
 chin+0.005
 cheeks+0.004
Neg logit contributions
ris-0.010
 gol-0.010
 narrative-0.009
spe-0.009
 nations-0.009
Token was
Token ID373
Feature activation+1.71e-02

Pos logit contributions
cel+0.022
 lap+0.021
 forehead+0.021
 chin+0.019
amas+0.018
Neg logit contributions
ris-0.022
spe-0.019
 narrative-0.019
 gol-0.018
 nations-0.018
Token summer
Token ID3931
Feature activation+8.45e-02

Pos logit contributions
cel+0.031
 forehead+0.029
 lap+0.029
 chin+0.028
 cheeks+0.026
Neg logit contributions
ris-0.027
 gol-0.025
 narrative-0.024
spe-0.024
 nations-0.023
Token and
Token ID290
Feature activation+2.69e-02

Pos logit contributions
cel+0.133
 forehead+0.126
 lap+0.126
 chin+0.116
tail+0.111
Neg logit contributions
ris-0.152
 gol-0.136
spe-0.133
 narrative-0.132
 nations-0.123
Token the
Token ID262
Feature activation+1.09e-02

Pos logit contributions
cel+0.044
 lap+0.040
 forehead+0.039
 chin+0.036
amas+0.035
Neg logit contributions
ris-0.047
spe-0.046
 gol-0.044
 narrative-0.044
 nations-0.043
Token garden
Token ID11376
Feature activation+3.99e-02

Pos logit contributions
cel+0.014
 forehead+0.013
 lap+0.013
 chin+0.012
 cheeks+0.011
Neg logit contributions
ris-0.023
 gol-0.021
 narrative-0.021
spe-0.021
 nations-0.020
Token was
Token ID373
Feature activation+1.50e-02

Pos logit contributions
cel+0.064
 forehead+0.060
 lap+0.059
 chin+0.056
tail+0.052
Neg logit contributions
ris-0.071
spe-0.065
 narrative-0.064
 gol-0.064
 nations-0.061
Token full
Token ID1336
Feature activation+3.29e-02

Pos logit contributions
cel+0.022
 forehead+0.020
 lap+0.020
 chin+0.018
 cheeks+0.017
Neg logit contributions
ris-0.030
 gol-0.027
 narrative-0.027
spe-0.027
 nations-0.026
Token a
Token ID257
Feature activation+1.87e-02

Pos logit contributions
cel+0.017
 forehead+0.016
 lap+0.015
 chin+0.015
 cheeks+0.014
Neg logit contributions
ris-0.019
 gol-0.018
spe-0.017
 narrative-0.017
 nations-0.017
Token forest
Token ID8222
Feature activation+3.96e-03

Pos logit contributions
cel+0.025
 forehead+0.023
 lap+0.023
 chin+0.021
 cheeks+0.020
Neg logit contributions
ris-0.039
 gol-0.036
spe-0.035
 narrative-0.035
 nations-0.034
Token.
Token ID13
Feature activation+1.11e-02

Pos logit contributions
cel+0.003
 lap+0.003
 forehead+0.003
 chin+0.003
tail+0.002
Neg logit contributions
ris-0.010
 gol-0.010
 narrative-0.010
 nations-0.009
spe-0.009
Token It
Token ID632
Feature activation+1.26e-02

Pos logit contributions
cel+0.016
 forehead+0.015
 lap+0.014
 chin+0.013
ze+0.012
Neg logit contributions
ris-0.022
 gol-0.020
spe-0.020
 narrative-0.020
 nations-0.019
Token was
Token ID373
Feature activation+2.14e-02

Pos logit contributions
cel+0.018
 lap+0.016
 forehead+0.016
 chin+0.014
tail+0.013
Neg logit contributions
ris-0.026
 gol-0.025
 narrative-0.024
 nations-0.023
spe-0.023
Token summer
Token ID3931
Feature activation+8.35e-02

Pos logit contributions
cel+0.023
 lap+0.022
 forehead+0.021
 chin+0.019
 warm+0.019
Neg logit contributions
ris-0.050
 gol-0.046
 narrative-0.046
spe-0.045
 nations-0.044
Token and
Token ID290
Feature activation+2.15e-02

Pos logit contributions
cel+0.139
 forehead+0.138
 lap+0.127
 chin+0.121
chief+0.118
Neg logit contributions
ris-0.139
 gol-0.122
spe-0.119
 nations-0.117
igators-0.115
Token the
Token ID262
Feature activation+1.72e-02

Pos logit contributions
cel+0.035
 forehead+0.033
 lap+0.032
 chin+0.030
tail+0.028
Neg logit contributions
spe-0.037
ris-0.036
 narrative-0.035
 nations-0.034
igators-0.034
Token weather
Token ID6193
Feature activation+4.74e-02

Pos logit contributions
cel+0.029
 lap+0.027
 forehead+0.026
tail+0.025
 chin+0.025
Neg logit contributions
ris-0.028
 narrative-0.028
 gol-0.028
 nations-0.028
spe-0.026
Token was
Token ID373
Feature activation+1.42e-02

Pos logit contributions
cel+0.078
 forehead+0.074
 lap+0.072
 chin+0.068
tail+0.064
Neg logit contributions
ris-0.084
spe-0.076
 gol-0.075
 narrative-0.074
 nations-0.071
Token nice
Token ID3621
Feature activation+3.46e-02

Pos logit contributions
cel+0.020
 forehead+0.018
 lap+0.018
 chin+0.017
 cheeks+0.016
Neg logit contributions
ris-0.029
 gol-0.026
 narrative-0.026
spe-0.026
 nations-0.025
Tokenable
Token ID540
Feature activation+1.03e-02

Pos logit contributions
ris+0.007
spe+0.006
 narrative+0.006
 gol+0.006
 nations+0.006
Neg logit contributions
cel-0.013
 lap-0.012
 forehead-0.012
 chin-0.011
tail-0.011
Token sack
Token ID23704
Feature activation-3.71e-03

Pos logit contributions
cel+0.012
 forehead+0.011
 lap+0.011
 chin+0.010
 shoulder+0.009
Neg logit contributions
ris-0.023
 narrative-0.022
spe-0.022
 gol-0.021
 nations-0.021
Token.
Token ID13
Feature activation+1.94e-02

Pos logit contributions
ris+0.009
 narrative+0.008
spe+0.008
 gol+0.008
 nations+0.008
Neg logit contributions
cel-0.004
 lap-0.004
 forehead-0.004
 shoulder-0.003
 chin-0.003
Token It
Token ID632
Feature activation+1.24e-02

Pos logit contributions
cel+0.023
 forehead+0.022
 lap+0.019
 chin+0.019
 cheek+0.017
Neg logit contributions
ris-0.043
 gol-0.040
spe-0.039
 nations-0.038
 narrative-0.038
Token was
Token ID373
Feature activation+2.09e-02

Pos logit contributions
cel+0.017
 lap+0.015
 forehead+0.014
 chin+0.013
 cozy+0.012
Neg logit contributions
ris-0.026
 gol-0.025
 narrative-0.024
 nations-0.024
spe-0.024
Token summer
Token ID3931
Feature activation+8.26e-02

Pos logit contributions
cel+0.024
 forehead+0.022
 lap+0.022
 chin+0.019
 warm+0.018
Neg logit contributions
ris-0.048
 narrative-0.045
 gol-0.045
spe-0.044
 nations-0.043
Token,
Token ID11
Feature activation+1.99e-02

Pos logit contributions
cel+0.150
 forehead+0.144
 lap+0.135
 chin+0.130
chief+0.128
Neg logit contributions
ris-0.129
 gol-0.114
spe-0.108
 narrative-0.106
 nations-0.103
Token so
Token ID523
Feature activation+3.05e-02

Pos logit contributions
cel+0.027
 forehead+0.024
 lap+0.024
 chin+0.022
 cheeks+0.021
Neg logit contributions
ris-0.041
 gol-0.038
 narrative-0.038
spe-0.037
 nations-0.036
Token the
Token ID262
Feature activation+6.64e-03

Pos logit contributions
cel+0.052
 forehead+0.047
 lap+0.047
 chin+0.044
amas+0.042
Neg logit contributions
ris-0.053
 gol-0.050
spe-0.050
 narrative-0.048
 nations-0.047
Token sack
Token ID23704
Feature activation+3.50e-03

Pos logit contributions
cel+0.010
 forehead+0.009
 lap+0.009
 chin+0.008
tail+0.008
Neg logit contributions
ris-0.013
 gol-0.012
 narrative-0.012
 nations-0.012
spe-0.011
Token and
Token ID290
Feature activation+5.58e-03

Pos logit contributions
cel+0.004
 lap+0.004
 forehead+0.004
 chin+0.004
 shoulder+0.003
Neg logit contributions
ris-0.008
 gol-0.007
 narrative-0.007
 nations-0.007
spe-0.007
Token night
Token ID1755
Feature activation+5.95e-02

Pos logit contributions
cel+0.022
 forehead+0.021
 lap+0.020
 chin+0.020
 cheeks+0.018
Neg logit contributions
ris-0.029
spe-0.027
 gol-0.026
 narrative-0.026
 nations-0.026
Token,
Token ID11
Feature activation+1.10e-02

Pos logit contributions
cel+0.089
 forehead+0.081
 lap+0.080
 chin+0.073
tail+0.071
Neg logit contributions
ris-0.111
 gol-0.101
spe-0.100
 narrative-0.096
 nations-0.094
Token Sara
Token ID24799
Feature activation+3.38e-02

Pos logit contributions
cel+0.013
 forehead+0.012
 lap+0.011
 chin+0.010
 cheeks+0.010
Neg logit contributions
ris-0.025
 gol-0.023
 narrative-0.023
spe-0.023
 nations-0.022
Token.
Token ID13
Feature activation-5.16e-03

Pos logit contributions
cel+0.056
 lap+0.053
 forehead+0.051
 cheeks+0.048
 chin+0.046
Neg logit contributions
ris-0.059
 gol-0.051
 narrative-0.049
ile-0.049
spe-0.048
Token Good
Token ID4599
Feature activation+3.27e-02

Pos logit contributions
ris+0.009
 gol+0.009
 narrative+0.009
spe+0.008
 nations+0.008
Neg logit contributions
 forehead-0.007
cel-0.007
 lap-0.007
 chin-0.007
 hips-0.006
Token night
Token ID1755
Feature activation+8.16e-02

Pos logit contributions
cel+0.062
 lap+0.055
 forehead+0.052
 chin+0.050
chief+0.049
Neg logit contributions
ris-0.052
 narrative-0.048
 gol-0.047
spe-0.045
 nations-0.044
Token,
Token ID11
Feature activation+1.28e-02

Pos logit contributions
cel+0.134
 lap+0.119
 forehead+0.119
chief+0.113
 chin+0.111
Neg logit contributions
ris-0.132
spe-0.123
 narrative-0.118
 gol-0.114
 nations-0.110
Token Fl
Token ID1610
Feature activation+1.14e-02

Pos logit contributions
cel+0.018
 forehead+0.016
 lap+0.016
 chin+0.014
tail+0.014
Neg logit contributions
ris-0.026
 gol-0.024
 narrative-0.024
spe-0.024
 nations-0.023
Tokenuffy
Token ID15352
Feature activation+2.06e-02

Pos logit contributions
cel+0.020
 lap+0.019
 forehead+0.019
 chin+0.017
tail+0.017
Neg logit contributions
ris-0.019
 gol-0.017
 narrative-0.016
spe-0.016
 nations-0.015
Token.
Token ID13
Feature activation-2.12e-03

Pos logit contributions
cel+0.030
 lap+0.028
 forehead+0.027
 cheeks+0.025
 chin+0.025
Neg logit contributions
ris-0.037
ile-0.033
spe-0.033
 narrative-0.033
 gol-0.032
Token I
Token ID314
Feature activation+1.17e-02

Pos logit contributions
ris+0.004
 narrative+0.003
 gol+0.003
spe+0.003
igators+0.003
Neg logit contributions
cel-0.003
 forehead-0.003
 lap-0.003
 chin-0.003
 hips-0.003
Token said
Token ID531
Feature activation-2.61e-03

Pos logit contributions
ris+0.005
 gol+0.005
spe+0.004
 nations+0.004
 narrative+0.004
Neg logit contributions
cel-0.007
 lap-0.007
 forehead-0.007
 chin-0.006
amas-0.006
Token,
Token ID11
Feature activation-5.72e-03

Pos logit contributions
ris+0.004
spe+0.004
 gol+0.004
 narrative+0.004
 nations+0.004
Neg logit contributions
cel-0.004
 lap-0.004
 forehead-0.004
 chin-0.004
amas-0.003
Token "
Token ID366
Feature activation+2.10e-02

Pos logit contributions
 gol+0.008
spe+0.008
ris+0.007
 narrative+0.007
 nations+0.006
Neg logit contributions
cel-0.010
tail-0.009
 lap-0.009
 forehead-0.009
 chin-0.009
TokenIt
Token ID1026
Feature activation+6.25e-03

Pos logit contributions
cel+0.038
 forehead+0.037
 lap+0.036
 chin+0.034
 cheeks+0.033
Neg logit contributions
ris-0.034
 gol-0.031
 narrative-0.030
spe-0.030
 nations-0.027
Token's
Token ID338
Feature activation+1.89e-02

Pos logit contributions
cel+0.009
 lap+0.009
 forehead+0.008
 chin+0.008
 cheeks+0.007
Neg logit contributions
ris-0.011
spe-0.011
igators-0.010
 nations-0.010
 narrative-0.010
Token summer
Token ID3931
Feature activation+8.07e-02

Pos logit contributions
cel+0.032
 lap+0.030
 forehead+0.030
 chin+0.029
 cheeks+0.027
Neg logit contributions
ris-0.032
 narrative-0.028
 gol-0.027
spe-0.027
 nations-0.027
Token!"
Token ID2474
Feature activation+8.68e-03

Pos logit contributions
cel+0.143
 forehead+0.135
 lap+0.128
chief+0.123
 chin+0.121
Neg logit contributions
ris-0.122
spe-0.106
igators-0.104
 narrative-0.098
 nations-0.097
Token
Token ID220
Feature activation+1.87e-02

Pos logit contributions
cel+0.012
 forehead+0.010
tail+0.010
 chin+0.010
ze+0.009
Neg logit contributions
ris-0.015
 gol-0.015
 narrative-0.014
spe-0.013
uzz-0.013
Token
Token ID198
Feature activation+5.37e-03

Pos logit contributions
cel+0.030
 forehead+0.028
 chin+0.026
 lap+0.025
 cheeks+0.025
Neg logit contributions
ris-0.034
 gol-0.030
spe-0.030
 narrative-0.029
 nations-0.028
Token
Token ID198
Feature activation+1.62e-02

Pos logit contributions
cel+0.008
 forehead+0.008
 chin+0.007
 lap+0.007
 cheeks+0.007
Neg logit contributions
ris-0.010
 gol-0.009
spe-0.009
 narrative-0.009
 nations-0.008
Token"
Token ID1
Feature activation+1.69e-02

Pos logit contributions
cel+0.013
 lap+0.012
 forehead+0.011
 chin+0.010
tail+0.009
Neg logit contributions
ris-0.042
 narrative-0.039
 gol-0.039
spe-0.038
 nations-0.038
Token,
Token ID11
Feature activation+1.33e-02

Pos logit contributions
cel+0.112
 lap+0.109
 forehead+0.109
 upper+0.101
chief+0.099
Neg logit contributions
ris-0.091
 gol-0.079
spe-0.079
guard-0.070
 nations-0.067
Token t
Token ID256
Feature activation-6.86e-03

Pos logit contributions
cel+0.016
 forehead+0.014
 lap+0.014
 chin+0.013
tail+0.012
Neg logit contributions
ris-0.029
 gol-0.027
 narrative-0.027
spe-0.027
 nations-0.026
Tokeneddy
Token ID21874
Feature activation+1.56e-02

Pos logit contributions
ris+0.011
 gol+0.010
 narrative+0.010
spe+0.010
 nations+0.010
Neg logit contributions
cel-0.012
 forehead-0.011
 lap-0.011
 chin-0.011
 cheeks-0.010
Token.
Token ID13
Feature activation-2.40e-03

Pos logit contributions
 lap+0.023
cel+0.022
 hips+0.020
 cheeks+0.020
 chin+0.020
Neg logit contributions
ris-0.027
ile-0.026
 gol-0.023
 narrative-0.023
guard-0.023
Token Good
Token ID4599
Feature activation+3.30e-02

Pos logit contributions
ris+0.004
 narrative+0.004
 gol+0.004
spe+0.003
igators+0.003
Neg logit contributions
cel-0.004
 forehead-0.004
 lap-0.003
 chin-0.003
 hips-0.003
Token night
Token ID1755
Feature activation+8.00e-02

Pos logit contributions
cel+0.068
 lap+0.060
 forehead+0.055
 hips+0.055
chief+0.054
Neg logit contributions
ris-0.047
 narrative-0.044
 gol-0.042
 nations-0.039
ile-0.038
Token,
Token ID11
Feature activation+1.76e-02

Pos logit contributions
cel+0.122
 lap+0.118
 forehead+0.116
 cheeks+0.113
chief+0.112
Neg logit contributions
ris-0.125
spe-0.112
 gol-0.106
 narrative-0.105
guard-0.102
Token mom
Token ID1995
Feature activation-4.30e-03

Pos logit contributions
cel+0.024
 lap+0.022
 forehead+0.021
 chin+0.020
tail+0.020
Neg logit contributions
ris-0.035
 gol-0.033
 narrative-0.032
spe-0.032
 nations-0.030
Tokenmy
Token ID1820
Feature activation+9.37e-03

Pos logit contributions
ris+0.007
spe+0.006
 nations+0.006
 gol+0.006
ile+0.006
Neg logit contributions
cel-0.008
 lap-0.007
 forehead-0.007
 cheeks-0.007
 chin-0.007
Token.
Token ID13
Feature activation-1.17e-02

Pos logit contributions
cel+0.013
 lap+0.012
 forehead+0.011
 cheeks+0.011
 chin+0.011
Neg logit contributions
ris-0.019
 gol-0.017
spe-0.016
ile-0.016
 nations-0.016
Token Good
Token ID4599
Feature activation+2.17e-02

Pos logit contributions
ris+0.019
 narrative+0.017
 gol+0.016
spe+0.016
igators+0.014
Neg logit contributions
cel-0.019
 forehead-0.018
 hips-0.017
 lap-0.017
 chin-0.017
Token love
Token ID1842
Feature activation+1.39e-02

Pos logit contributions
cel+0.017
 forehead+0.015
 lap+0.015
 chin+0.014
amas+0.014
Neg logit contributions
ris-0.017
 gol-0.015
 narrative-0.015
spe-0.015
 nations-0.014
Token.
Token ID13
Feature activation-2.24e-03

Pos logit contributions
cel+0.016
 forehead+0.015
 lap+0.015
 chin+0.013
 cheeks+0.012
Neg logit contributions
ris-0.031
 gol-0.029
 narrative-0.029
spe-0.028
 nations-0.028
Token
Token ID198
Feature activation+9.14e-03

Pos logit contributions
ris+0.005
 gol+0.005
 narrative+0.005
spe+0.005
 nations+0.004
Neg logit contributions
cel-0.003
 forehead-0.002
 lap-0.002
 chin-0.002
 cheeks-0.002
Token
Token ID198
Feature activation+2.06e-02

Pos logit contributions
cel+0.013
 lap+0.012
 forehead+0.012
 chin+0.011
tail+0.010
Neg logit contributions
ris-0.018
 gol-0.017
 narrative-0.017
spe-0.017
 nations-0.016
TokenThat
Token ID2504
Feature activation+2.48e-02

Pos logit contributions
 lap+0.022
cel+0.021
ze+0.019
tail+0.018
 forehead+0.018
Neg logit contributions
 gol-0.044
ris-0.044
spe-0.041
 nations-0.041
 narrative-0.040
Token evening
Token ID6180
Feature activation+7.98e-02

Pos logit contributions
cel+0.030
 forehead+0.029
 lap+0.029
 chin+0.027
tail+0.025
Neg logit contributions
ris-0.052
 gol-0.050
spe-0.047
 narrative-0.046
 nations-0.045
Token,
Token ID11
Feature activation+2.14e-02

Pos logit contributions
cel+0.103
 forehead+0.093
 lap+0.093
 chin+0.085
 cheeks+0.078
Neg logit contributions
ris-0.170
 gol-0.157
 narrative-0.155
spe-0.154
 nations-0.150
Token Amy
Token ID14235
Feature activation+5.36e-03

Pos logit contributions
cel+0.031
 forehead+0.029
 lap+0.029
 chin+0.027
 cheeks+0.025
Neg logit contributions
ris-0.042
 gol-0.038
 narrative-0.038
spe-0.037
 nations-0.036
Token went
Token ID1816
Feature activation+1.37e-03

Pos logit contributions
cel+0.009
 forehead+0.008
 lap+0.008
 chin+0.007
amas+0.007
Neg logit contributions
ris-0.010
spe-0.009
 gol-0.008
 narrative-0.008
 nations-0.008
Token for
Token ID329
Feature activation+2.06e-03

Pos logit contributions
cel+0.002
 forehead+0.002
 lap+0.002
 chin+0.002
 cheeks+0.002
Neg logit contributions
ris-0.002
 gol-0.002
 narrative-0.002
spe-0.002
 nations-0.002
Token a
Token ID257
Feature activation-3.34e-03

Pos logit contributions
cel+0.004
 chin+0.003
amas+0.003
 lap+0.003
 forehead+0.003
Neg logit contributions
ris-0.003
 narrative-0.003
 gol-0.003
 nations-0.003
 bomb-0.003
Token "
Token ID366
Feature activation+1.36e-02

Pos logit contributions
ris+0.040
 gol+0.038
spe+0.037
 narrative+0.035
 nations+0.035
Neg logit contributions
cel-0.026
 lap-0.025
 forehead-0.025
 chin-0.023
tail-0.021
TokenI
Token ID40
Feature activation+3.20e-03

Pos logit contributions
cel+0.022
 forehead+0.020
 lap+0.020
 chin+0.019
 cheeks+0.018
Neg logit contributions
ris-0.025
 gol-0.023
 narrative-0.022
spe-0.022
 nations-0.021
Token wish
Token ID4601
Feature activation+2.19e-03

Pos logit contributions
cel+0.004
 lap+0.003
 forehead+0.003
 chin+0.003
 feel+0.003
Neg logit contributions
ris-0.007
 narrative-0.007
 nations-0.007
 gol-0.006
spe-0.006
Token it
Token ID340
Feature activation+1.02e-02

Pos logit contributions
cel+0.003
 lap+0.003
 forehead+0.003
 chin+0.003
tail+0.003
Neg logit contributions
 nations-0.003
ris-0.003
 narrative-0.003
 gol-0.003
spe-0.003
Token was
Token ID373
Feature activation+1.25e-02

Pos logit contributions
 lap+0.023
 forehead+0.023
cel+0.023
 chin+0.022
 cheeks+0.021
Neg logit contributions
ris-0.010
spe-0.009
 planned-0.008
igators-0.008
 nations-0.008
Token summer
Token ID3931
Feature activation+7.97e-02

Pos logit contributions
cel+0.022
 lap+0.021
 forehead+0.020
 chin+0.020
 cheeks+0.019
Neg logit contributions
ris-0.020
 gol-0.017
 nations-0.017
 narrative-0.017
spe-0.017
Token.
Token ID13
Feature activation-2.14e-03

Pos logit contributions
cel+0.144
 forehead+0.134
 lap+0.128
chief+0.123
 chin+0.121
Neg logit contributions
ris-0.127
 gol-0.111
spe-0.107
igators-0.103
ile-0.102
Token I
Token ID314
Feature activation+2.38e-03

Pos logit contributions
ris+0.004
 gol+0.003
 narrative+0.003
spe+0.003
igators+0.003
Neg logit contributions
cel-0.003
 forehead-0.003
 chin-0.003
chief-0.003
 lap-0.003
Token don
Token ID836
Feature activation-9.76e-03

Pos logit contributions
cel+0.003
 forehead+0.003
 lap+0.003
 chin+0.003
 feel+0.003
Neg logit contributions
ris-0.005
 narrative-0.005
 gol-0.005
 nations-0.005
spe-0.004
Token't
Token ID470
Feature activation-3.90e-02

Pos logit contributions
ris+0.016
 gol+0.015
 narrative+0.014
spe+0.014
 nations+0.013
Neg logit contributions
cel-0.016
 forehead-0.016
 lap-0.016
 chin-0.015
 cheeks-0.014
Token like
Token ID588
Feature activation-3.48e-03

Pos logit contributions
ris+0.072
 gol+0.065
 narrative+0.062
spe+0.060
 nations+0.059
Neg logit contributions
cel-0.064
 forehead-0.058
 lap-0.058
 chin-0.055
 cheeks-0.053
Token
Token ID198
Feature activation-3.22e-03

Pos logit contributions
ris+0.030
 gol+0.027
 narrative+0.027
spe+0.027
 nations+0.026
Neg logit contributions
cel-0.018
 forehead-0.017
 lap-0.017
 chin-0.015
 cheeks-0.014
Token
Token ID198
Feature activation+7.94e-03

Pos logit contributions
ris+0.006
 gol+0.006
 narrative+0.006
spe+0.006
 nations+0.005
Neg logit contributions
cel-0.005
 lap-0.004
 forehead-0.004
 chin-0.004
tail-0.004
TokenWhen
Token ID2215
Feature activation-6.46e-03

Pos logit contributions
cel+0.007
 lap+0.006
ze+0.005
tail+0.005
Fl+0.005
Neg logit contributions
ris-0.018
 gol-0.017
 nations-0.017
spe-0.017
 narrative-0.017
Token it
Token ID340
Feature activation+2.53e-02

Pos logit contributions
ris+0.013
 narrative+0.012
spe+0.011
 gol+0.011
ile+0.011
Neg logit contributions
cel-0.009
 lap-0.008
 forehead-0.008
 chin-0.008
 Mitt-0.008
Token was
Token ID373
Feature activation+2.64e-02

Pos logit contributions
cel+0.041
 forehead+0.038
 lap+0.038
 chin+0.035
tail+0.033
Neg logit contributions
ris-0.046
 gol-0.042
spe-0.041
 narrative-0.041
 nations-0.039
Token night
Token ID1755
Feature activation+7.96e-02

Pos logit contributions
cel+0.040
 forehead+0.040
 lap+0.039
 warm+0.037
 chin+0.036
Neg logit contributions
ris-0.047
spe-0.045
 gol-0.043
 narrative-0.042
 nations-0.041
Token,
Token ID11
Feature activation+2.00e-02

Pos logit contributions
cel+0.096
 forehead+0.087
 lap+0.087
 chin+0.079
 cheeks+0.072
Neg logit contributions
ris-0.176
 gol-0.163
 narrative-0.161
spe-0.160
 nations-0.156
Token Tim
Token ID5045
Feature activation-4.93e-03

Pos logit contributions
cel+0.031
 lap+0.028
 forehead+0.028
 chin+0.027
 cheeks+0.025
Neg logit contributions
ris-0.037
 narrative-0.034
 gol-0.034
spe-0.033
 nations-0.032
Token saw
Token ID2497
Feature activation-2.62e-03

Pos logit contributions
ris+0.009
 gol+0.007
spe+0.007
 nations+0.007
 narrative+0.007
Neg logit contributions
cel-0.008
 forehead-0.008
 lap-0.008
 cheeks-0.007
tail-0.007
Token the
Token ID262
Feature activation+4.54e-03

Pos logit contributions
ris+0.006
 narrative+0.005
 gol+0.005
spe+0.005
 nations+0.005
Neg logit contributions
cel-0.003
 forehead-0.003
 lap-0.003
 chin-0.003
 cheeks-0.002
Token lights
Token ID7588
Feature activation+3.47e-03

Pos logit contributions
 forehead+0.005
cel+0.005
 lap+0.005
 chin+0.005
 cheek+0.004
Neg logit contributions
ris-0.010
spe-0.009
 gol-0.009
 narrative-0.009
 nations-0.009
Token to
Token ID284
Feature activation+4.11e-03

Pos logit contributions
ris+0.006
 gol+0.005
 narrative+0.005
spe+0.005
 nations+0.005
Neg logit contributions
cel-0.004
 forehead-0.004
 lap-0.004
 chin-0.003
 cheeks-0.003
Token play
Token ID711
Feature activation+1.69e-02

Pos logit contributions
cel+0.007
 forehead+0.007
amas+0.006
 cheeks+0.006
 chin+0.006
Neg logit contributions
ris-0.007
 gol-0.007
igators-0.006
 nations-0.006
 narrative-0.006
Token.
Token ID13
Feature activation+4.14e-03

Pos logit contributions
cel+0.027
 forehead+0.026
 lap+0.024
 chin+0.024
 cheeks+0.022
Neg logit contributions
ris-0.030
 gol-0.028
 narrative-0.027
 nations-0.026
ile-0.025
Token It
Token ID632
Feature activation+8.58e-03

Pos logit contributions
cel+0.008
 forehead+0.007
ze+0.007
chief+0.007
ridge+0.007
Neg logit contributions
ris-0.005
 gol-0.005
 clubhouse-0.004
 about-0.004
 planned-0.004
Token was
Token ID373
Feature activation+1.68e-02

Pos logit contributions
cel+0.015
 forehead+0.014
 lap+0.014
 chin+0.013
 cheeks+0.013
Neg logit contributions
ris-0.014
spe-0.013
 narrative-0.013
 gol-0.013
 nations-0.012
Token summer
Token ID3931
Feature activation+7.94e-02

Pos logit contributions
cel+0.020
 lap+0.018
 forehead+0.018
 chin+0.016
tail+0.015
Neg logit contributions
ris-0.038
 gol-0.035
spe-0.035
 narrative-0.034
 nations-0.033
Token,
Token ID11
Feature activation+1.34e-02

Pos logit contributions
cel+0.131
 forehead+0.127
 lap+0.115
 chin+0.110
chief+0.109
Neg logit contributions
ris-0.139
 gol-0.124
spe-0.113
 narrative-0.113
 nations-0.110
Token and
Token ID290
Feature activation+1.69e-02

Pos logit contributions
cel+0.017
 forehead+0.015
 lap+0.015
 chin+0.014
tail+0.013
Neg logit contributions
ris-0.028
 gol-0.027
 narrative-0.027
spe-0.026
 nations-0.026
Token the
Token ID262
Feature activation+8.28e-03

Pos logit contributions
cel+0.030
 forehead+0.030
 lap+0.029
amas+0.027
 chin+0.027
Neg logit contributions
 narrative-0.024
 nations-0.022
 gol-0.022
spe-0.021
ris-0.021
Token sun
Token ID4252
Feature activation+4.11e-02

Pos logit contributions
cel+0.015
 forehead+0.014
 lap+0.013
chief+0.013
 chin+0.013
Neg logit contributions
 gol-0.013
ris-0.013
 narrative-0.012
 nations-0.012
spe-0.011
Token was
Token ID373
Feature activation+1.20e-02

Pos logit contributions
cel+0.082
 forehead+0.082
 lap+0.077
 hips+0.074
 chin+0.072
Neg logit contributions
ris-0.053
spe-0.046
ile-0.042
 narrative-0.041
 gol-0.041
Token night
Token ID1755
Feature activation+5.97e-02

Pos logit contributions
cel+0.017
 lap+0.015
 forehead+0.015
 chin+0.014
 cheeks+0.013
Neg logit contributions
ris-0.018
 gol-0.016
 narrative-0.016
spe-0.016
 nations-0.015
Token,
Token ID11
Feature activation+7.03e-03

Pos logit contributions
cel+0.087
 lap+0.078
 forehead+0.078
 chin+0.070
tail+0.068
Neg logit contributions
ris-0.116
 gol-0.105
spe-0.103
 narrative-0.100
 nations-0.098
Token Will
Token ID2561
Feature activation+2.22e-02

Pos logit contributions
cel+0.009
 lap+0.008
 forehead+0.008
 chin+0.007
tail+0.007
Neg logit contributions
ris-0.015
 gol-0.014
 narrative-0.014
spe-0.014
 nations-0.014
Token.
Token ID13
Feature activation+1.06e-03

Pos logit contributions
cel+0.035
 lap+0.033
 forehead+0.031
 chin+0.028
amas+0.027
Neg logit contributions
ris-0.043
 gol-0.037
spe-0.036
 narrative-0.036
 nations-0.036
Token Good
Token ID4599
Feature activation+3.25e-02

Pos logit contributions
cel+0.002
 forehead+0.002
 lap+0.001
 sweat+0.001
 chin+0.001
Neg logit contributions
ris-0.002
 gol-0.002
 narrative-0.002
spe-0.002
uzz-0.001
Token night
Token ID1755
Feature activation+7.92e-02

Pos logit contributions
cel+0.066
 lap+0.059
 forehead+0.055
chief+0.054
 hips+0.054
Neg logit contributions
ris-0.047
 narrative-0.043
 gol-0.041
spe-0.039
 nations-0.039
Token,
Token ID11
Feature activation+1.46e-02

Pos logit contributions
cel+0.127
 lap+0.115
 forehead+0.114
 chin+0.108
chief+0.106
Neg logit contributions
ris-0.135
spe-0.121
 gol-0.119
 narrative-0.118
 nations-0.110
Token Mr
Token ID1770
Feature activation+4.66e-03

Pos logit contributions
cel+0.018
 lap+0.017
 forehead+0.016
 chin+0.016
tail+0.015
Neg logit contributions
ris-0.031
 gol-0.029
 narrative-0.029
spe-0.028
 nations-0.027
Token.
Token ID13
Feature activation+2.89e-02

Pos logit contributions
cel+0.006
 lap+0.006
 forehead+0.005
 chin+0.005
 cheeks+0.005
Neg logit contributions
ris-0.010
 narrative-0.009
 gol-0.009
 nations-0.009
spe-0.009
Token Bear
Token ID14732
Feature activation+2.48e-02

Pos logit contributions
cel+0.036
 forehead+0.035
 lap+0.035
 chin+0.031
tail+0.031
Neg logit contributions
ris-0.057
 narrative-0.054
 gol-0.054
spe-0.053
 nations-0.050
Token,"
Token ID553
Feature activation+4.21e-03

Pos logit contributions
cel+0.036
 lap+0.032
 forehead+0.032
 chin+0.030
 cheeks+0.029
Neg logit contributions
ris-0.049
 gol-0.044
spe-0.044
 narrative-0.044
 nations-0.043
Token.
Token ID13
Feature activation-8.24e-04

Pos logit contributions
cel+0.013
 forehead+0.011
 lap+0.011
 chin+0.011
 cheeks+0.010
Neg logit contributions
ris-0.019
 gol-0.018
spe-0.018
 narrative-0.018
 nations-0.017
Token
Token ID220
Feature activation+1.55e-02

Pos logit contributions
ris+0.002
 gol+0.002
spe+0.002
 narrative+0.002
 nations+0.002
Neg logit contributions
cel-0.001
 forehead-0.001
 lap-0.001
 chin-0.001
tail-0.001
Token
Token ID198
Feature activation+1.15e-02

Pos logit contributions
cel+0.021
 forehead+0.020
 lap+0.018
 chin+0.018
 cheeks+0.017
Neg logit contributions
ris-0.032
 gol-0.029
spe-0.028
 narrative-0.028
 nations-0.027
Token
Token ID198
Feature activation+2.25e-02

Pos logit contributions
cel+0.015
 forehead+0.014
 lap+0.013
 chin+0.013
 cheeks+0.012
Neg logit contributions
ris-0.025
 gol-0.022
spe-0.022
 narrative-0.022
 nations-0.021
TokenThat
Token ID2504
Feature activation+2.48e-02

Pos logit contributions
cel+0.029
 lap+0.026
 forehead+0.025
 chin+0.023
ze+0.022
Neg logit contributions
ris-0.047
 narrative-0.044
 gol-0.044
 nations-0.044
spe-0.042
Token evening
Token ID6180
Feature activation+7.91e-02

Pos logit contributions
cel+0.023
 forehead+0.023
ze+0.021
 nose+0.021
 feel+0.021
Neg logit contributions
ris-0.056
 gol-0.056
spe-0.052
uzz-0.050
 nations-0.049
Token,
Token ID11
Feature activation+2.15e-02

Pos logit contributions
cel+0.099
 forehead+0.090
 lap+0.089
 chin+0.082
 cheeks+0.075
Neg logit contributions
ris-0.171
 gol-0.158
 narrative-0.157
spe-0.155
 nations-0.151
Token Th
Token ID536
Feature activation-1.97e-02

Pos logit contributions
cel+0.030
 forehead+0.028
 lap+0.027
 chin+0.025
tail+0.023
Neg logit contributions
ris-0.044
 gol-0.040
spe-0.040
 narrative-0.039
 nations-0.038
Tokenump
Token ID931
Feature activation+7.18e-03

Pos logit contributions
ris+0.033
 gol+0.029
 narrative+0.029
spe+0.029
 nations+0.028
Neg logit contributions
cel-0.035
 forehead-0.033
 lap-0.033
 chin-0.031
 cheeks-0.029
Token was
Token ID373
Feature activation+1.88e-02

Pos logit contributions
cel+0.012
 forehead+0.011
 lap+0.011
 chin+0.010
 cheeks+0.010
Neg logit contributions
ris-0.013
 gol-0.012
 narrative-0.011
spe-0.011
 nations-0.011
Token talking
Token ID3375
Feature activation+3.28e-02

Pos logit contributions
cel+0.027
 forehead+0.026
 lap+0.025
 chin+0.023
ze+0.022
Neg logit contributions
ris-0.037
 gol-0.034
 narrative-0.034
spe-0.033
 nations-0.033
Token upset
Token ID9247
Feature activation-4.77e-03

Pos logit contributions
cel+0.010
 forehead+0.009
 lap+0.009
 chin+0.008
 cheeks+0.008
Neg logit contributions
ris-0.014
 gol-0.013
 narrative-0.013
spe-0.013
 nations-0.013
Token.
Token ID13
Feature activation+3.63e-03

Pos logit contributions
ris+0.010
 gol+0.009
 narrative+0.009
spe+0.009
 nations+0.008
Neg logit contributions
cel-0.007
 forehead-0.006
 lap-0.006
 chin-0.006
tail-0.005
Token
Token ID198
Feature activation+1.16e-02

Pos logit contributions
cel+0.005
 forehead+0.005
 chin+0.005
 lap+0.005
ze+0.005
Neg logit contributions
ris-0.006
spe-0.005
 narrative-0.005
 gol-0.005
 nations-0.005
Token
Token ID198
Feature activation+2.15e-02

Pos logit contributions
cel+0.017
 forehead+0.017
 chin+0.015
 lap+0.015
 cheeks+0.015
Neg logit contributions
ris-0.023
 narrative-0.020
spe-0.020
 gol-0.020
 nations-0.019
TokenThat
Token ID2504
Feature activation+2.32e-02

Pos logit contributions
cel+0.035
 forehead+0.033
 lap+0.033
 chin+0.032
 cheeks+0.030
Neg logit contributions
ris-0.038
spe-0.034
 narrative-0.034
 gol-0.033
 nations-0.032
Token night
Token ID1755
Feature activation+7.86e-02

Pos logit contributions
's+0.020
 feel+0.017
 nose+0.017
 warm+0.017
 forehead+0.016
Neg logit contributions
ris-0.057
 gol-0.055
uzz-0.051
ole-0.050
spe-0.049
Token,
Token ID11
Feature activation+1.90e-02

Pos logit contributions
cel+0.106
 lap+0.099
 forehead+0.098
 chin+0.091
tail+0.083
Neg logit contributions
ris-0.158
spe-0.147
 gol-0.144
 narrative-0.143
 nations-0.137
Token May
Token ID1737
Feature activation+2.58e-02

Pos logit contributions
 lap+0.028
cel+0.028
 chin+0.027
 forehead+0.026
 shoulder+0.024
Neg logit contributions
ris-0.031
spe-0.030
 narrative-0.029
 gol-0.027
 nations-0.027
Token smelled
Token ID37609
Feature activation+1.79e-02

Pos logit contributions
cel+0.040
 lap+0.038
 forehead+0.037
 chin+0.034
amas+0.033
Neg logit contributions
ris-0.047
spe-0.043
 gol-0.043
 narrative-0.042
 nations-0.040
Token something
Token ID1223
Feature activation-1.62e-03

Pos logit contributions
cel+0.028
 lap+0.026
 forehead+0.025
 chin+0.025
tail+0.023
Neg logit contributions
ris-0.031
 narrative-0.030
 gol-0.029
 nations-0.029
spe-0.028
Token funny
Token ID8258
Feature activation-1.91e-02

Pos logit contributions
ris+0.003
 gol+0.003
 narrative+0.003
spe+0.003
 nations+0.003
Neg logit contributions
cel-0.002
 forehead-0.002
 lap-0.002
 chin-0.002
 cheeks-0.002
Token.
Token ID13
Feature activation-1.92e-03

Pos logit contributions
ris+0.015
 narrative+0.015
 gol+0.014
spe+0.014
 nations+0.014
Neg logit contributions
cel-0.007
 lap-0.007
 forehead-0.007
 chin-0.006
 cheeks-0.005
Token
Token ID220
Feature activation+1.43e-02

Pos logit contributions
ris+0.005
 gol+0.004
 narrative+0.004
spe+0.004
 nations+0.004
Neg logit contributions
cel-0.002
 lap-0.002
 forehead-0.002
 chin-0.002
 cheeks-0.001
Token
Token ID198
Feature activation+1.22e-02

Pos logit contributions
cel+0.018
 forehead+0.016
 lap+0.016
 chin+0.015
 cheeks+0.014
Neg logit contributions
ris-0.031
 gol-0.029
spe-0.028
 narrative-0.028
 nations-0.027
Token
Token ID198
Feature activation+2.39e-02

Pos logit contributions
cel+0.013
 lap+0.012
 forehead+0.012
 chin+0.010
tail+0.009
Neg logit contributions
ris-0.029
 gol-0.027
 narrative-0.026
spe-0.026
 nations-0.026
TokenThat
Token ID2504
Feature activation+2.33e-02

Pos logit contributions
cel+0.028
 lap+0.027
 forehead+0.023
tail+0.022
ze+0.021
Neg logit contributions
ris-0.049
 narrative-0.047
 nations-0.047
 gol-0.047
spe-0.044
Token night
Token ID1755
Feature activation+7.86e-02

Pos logit contributions
 forehead+0.025
cel+0.025
 lap+0.024
 chin+0.023
 feel+0.022
Neg logit contributions
ris-0.049
 gol-0.048
 narrative-0.045
spe-0.044
 nations-0.044
Token,
Token ID11
Feature activation+2.33e-02

Pos logit contributions
cel+0.109
 lap+0.101
 forehead+0.100
 chin+0.093
tail+0.087
Neg logit contributions
ris-0.157
spe-0.144
 gol-0.143
 narrative-0.140
 nations-0.135
Token the
Token ID262
Feature activation+1.24e-02

Pos logit contributions
cel+0.033
 forehead+0.031
 lap+0.031
 chin+0.028
 cheeks+0.026
Neg logit contributions
ris-0.046
 gol-0.042
 narrative-0.042
spe-0.042
 nations-0.040
Token boy
Token ID2933
Feature activation+1.43e-02

Pos logit contributions
cel+0.016
 forehead+0.015
 lap+0.014
 chin+0.013
 cheeks+0.012
Neg logit contributions
ris-0.027
 gol-0.024
 narrative-0.024
spe-0.024
 nations-0.023
Token fell
Token ID3214
Feature activation-1.08e-03

Pos logit contributions
cel+0.023
 forehead+0.021
 lap+0.021
 chin+0.020
amas+0.019
Neg logit contributions
ris-0.026
 gol-0.023
spe-0.023
 narrative-0.023
 nations-0.022
Token asleep
Token ID16039
Feature activation+2.97e-02

Pos logit contributions
ris+0.002
 gol+0.002
 narrative+0.002
spe+0.002
 nations+0.002
Neg logit contributions
cel-0.001
 forehead-0.001
 lap-0.001
 chin-0.001
 cheeks-0.001
Token tomorrow
Token ID9439
Feature activation+2.74e-02

Pos logit contributions
ris+0.007
spe+0.007
 nations+0.006
 gol+0.006
 narrative+0.006
Neg logit contributions
cel-0.006
 forehead-0.005
 lap-0.005
 chin-0.005
amas-0.005
Token.
Token ID13
Feature activation-4.37e-03

Pos logit contributions
cel+0.041
 forehead+0.037
 lap+0.035
 chin+0.033
 cheeks+0.031
Neg logit contributions
ris-0.054
spe-0.049
 gol-0.048
 nations-0.047
 narrative-0.047
Token
Token ID198
Feature activation+5.95e-03

Pos logit contributions
ris+0.011
 gol+0.010
 narrative+0.010
spe+0.010
 nations+0.010
Neg logit contributions
cel-0.004
 lap-0.004
 forehead-0.004
 chin-0.003
 cheeks-0.003
Token
Token ID198
Feature activation+1.69e-02

Pos logit contributions
cel+0.006
 lap+0.006
 forehead+0.005
 chin+0.005
tail+0.005
Neg logit contributions
ris-0.014
 gol-0.013
 narrative-0.013
spe-0.013
 nations-0.012
TokenThat
Token ID2504
Feature activation+2.44e-02

Pos logit contributions
 lap+0.016
ze+0.016
Fl+0.016
cel+0.015
 warm+0.014
Neg logit contributions
 gol-0.038
 nations-0.034
 narrative-0.034
ris-0.033
spe-0.032
Token night
Token ID1755
Feature activation+7.85e-02

Pos logit contributions
's+0.016
 nose+0.015
 feel+0.013
 lap+0.013
 warm+0.013
Neg logit contributions
ris-0.063
 gol-0.063
spe-0.060
uzz-0.058
 narrative-0.057
Token,
Token ID11
Feature activation+2.05e-02

Pos logit contributions
cel+0.087
 forehead+0.079
 lap+0.078
 chin+0.071
 cheeks+0.066
Neg logit contributions
ris-0.182
 gol-0.169
 narrative-0.168
spe-0.165
 nations-0.162
Token Bob
Token ID5811
Feature activation+2.01e-02

Pos logit contributions
cel+0.028
 lap+0.028
 forehead+0.027
 chin+0.026
 cheeks+0.025
Neg logit contributions
ris-0.040
 narrative-0.037
 gol-0.037
spe-0.035
 nations-0.034
Token had
Token ID550
Feature activation-6.49e-03

Pos logit contributions
cel+0.030
 forehead+0.028
 lap+0.027
 chin+0.025
 cheeks+0.024
Neg logit contributions
ris-0.039
 gol-0.035
spe-0.035
 narrative-0.035
 nations-0.034
Token a
Token ID257
Feature activation-1.04e-03

Pos logit contributions
ris+0.014
 narrative+0.013
 gol+0.013
spe+0.013
 nations+0.012
Neg logit contributions
cel-0.008
 forehead-0.007
 lap-0.007
 chin-0.006
 shoulder-0.006
Token dream
Token ID4320
Feature activation+4.13e-02

Pos logit contributions
ris+0.002
spe+0.002
 narrative+0.002
 gol+0.002
 nations+0.002
Neg logit contributions
cel-0.001
 lap-0.001
 forehead-0.001
 chin-0.001
 cozy-0.001
Token help
Token ID1037
Feature activation+7.60e-03

Pos logit contributions
ris+0.002
 gol+0.002
 narrative+0.002
spe+0.002
 nations+0.002
Neg logit contributions
cel-0.001
 lap-0.001
 forehead-0.001
 chin-0.001
 cheeks-0.001
Token.
Token ID13
Feature activation-1.52e-03

Pos logit contributions
cel+0.012
 forehead+0.010
 lap+0.010
 chin+0.010
tail+0.009
Neg logit contributions
ris-0.014
 gol-0.013
spe-0.013
 nations-0.012
ole-0.012
Token
Token ID198
Feature activation+1.17e-02

Pos logit contributions
ris+0.003
 gol+0.003
spe+0.003
 narrative+0.003
 nations+0.003
Neg logit contributions
cel-0.002
 forehead-0.002
 lap-0.002
 chin-0.001
 cheeks-0.001
Token
Token ID198
Feature activation+2.26e-02

Pos logit contributions
cel+0.011
 lap+0.010
 forehead+0.009
 chin+0.008
tail+0.007
Neg logit contributions
ris-0.028
 gol-0.027
 narrative-0.027
spe-0.026
 nations-0.026
TokenThat
Token ID2504
Feature activation+2.12e-02

Pos logit contributions
cel+0.032
 lap+0.031
 forehead+0.028
tail+0.026
 chin+0.026
Neg logit contributions
ris-0.041
 narrative-0.040
 gol-0.039
 nations-0.039
spe-0.037
Token evening
Token ID6180
Feature activation+7.85e-02

Pos logit contributions
 forehead+0.020
 nose+0.019
's+0.018
 lap+0.018
 warm+0.018
Neg logit contributions
 gol-0.047
ris-0.047
 narrative-0.043
 nations-0.042
uzz-0.042
Token,
Token ID11
Feature activation+2.00e-02

Pos logit contributions
cel+0.096
 forehead+0.088
 lap+0.087
 chin+0.080
 cheeks+0.075
Neg logit contributions
ris-0.171
 gol-0.159
 narrative-0.158
spe-0.154
 nations-0.152
Token they
Token ID484
Feature activation+3.83e-03

Pos logit contributions
cel+0.035
 forehead+0.033
 lap+0.033
 chin+0.031
 cheeks+0.030
Neg logit contributions
ris-0.033
 narrative-0.030
 gol-0.029
spe-0.028
 nations-0.028
Token started
Token ID2067
Feature activation-1.61e-02

Pos logit contributions
cel+0.005
 forehead+0.005
 lap+0.005
 chin+0.005
 feel+0.005
Neg logit contributions
ris-0.007
 gol-0.007
 narrative-0.007
spe-0.006
 nations-0.006
Token to
Token ID284
Feature activation+8.78e-04

Pos logit contributions
ris+0.032
 narrative+0.030
 gol+0.029
 nations+0.028
spe+0.028
Neg logit contributions
cel-0.023
 lap-0.022
 forehead-0.022
 chin-0.020
 cheeks-0.019
Token increase
Token ID2620
Feature activation+1.90e-03

Pos logit contributions
cel+0.001
 forehead+0.001
 lap+0.001
 chin+0.001
 cheeks+0.001
Neg logit contributions
ris-0.002
 gol-0.002
spe-0.002
 narrative-0.002
 nations-0.002
Token her
Token ID607
Feature activation+2.19e-02

Pos logit contributions
cel+0.008
 forehead+0.008
amas+0.007
 chin+0.007
 cheek+0.007
Neg logit contributions
ris-0.006
 nations-0.006
 narrative-0.006
 gol-0.006
igators-0.006
Token mom
Token ID1995
Feature activation+3.33e-03

Pos logit contributions
cel+0.025
 forehead+0.023
 lap+0.023
 chin+0.021
 cheeks+0.019
Neg logit contributions
ris-0.050
 gol-0.046
 narrative-0.046
spe-0.045
 nations-0.044
Token.
Token ID13
Feature activation+6.29e-04

Pos logit contributions
cel+0.005
 forehead+0.005
 lap+0.005
 cheeks+0.005
 chin+0.005
Neg logit contributions
ris-0.006
 gol-0.005
spe-0.005
 nations-0.005
 narrative-0.005
Token It
Token ID632
Feature activation+6.53e-03

Pos logit contributions
cel+0.001
 forehead+0.001
ze+0.001
 chin+0.001
 cheek+0.001
Neg logit contributions
ris-0.001
 gol-0.001
spe-0.001
 nations-0.001
 planned-0.001
Token was
Token ID373
Feature activation+1.27e-02

Pos logit contributions
cel+0.012
 forehead+0.012
 lap+0.012
 chin+0.011
 cheeks+0.011
Neg logit contributions
ris-0.009
spe-0.009
 narrative-0.008
 gol-0.008
igators-0.008
Token noon
Token ID19613
Feature activation+7.84e-02

Pos logit contributions
cel+0.014
 lap+0.013
 forehead+0.012
 chin+0.011
 warm+0.010
Neg logit contributions
ris-0.030
 gol-0.028
spe-0.027
 narrative-0.027
 nations-0.027
Token and
Token ID290
Feature activation+2.13e-02

Pos logit contributions
cel+0.124
 forehead+0.120
 lap+0.112
 chin+0.108
chief+0.103
Neg logit contributions
ris-0.135
 gol-0.119
spe-0.115
 narrative-0.112
 nations-0.112
Token the
Token ID262
Feature activation+7.52e-03

Pos logit contributions
cel+0.037
 forehead+0.035
 lap+0.035
 chin+0.033
tail+0.031
Neg logit contributions
spe-0.031
ris-0.030
 narrative-0.030
 gol-0.030
igators-0.029
Token sun
Token ID4252
Feature activation+4.45e-02

Pos logit contributions
cel+0.012
 forehead+0.010
 lap+0.010
tail+0.010
 chin+0.010
Neg logit contributions
ris-0.013
 narrative-0.013
 gol-0.013
 nations-0.012
spe-0.012
Token was
Token ID373
Feature activation+1.17e-02

Pos logit contributions
cel+0.087
 forehead+0.084
 lap+0.082
 hips+0.076
 chin+0.076
Neg logit contributions
ris-0.060
spe-0.054
 narrative-0.049
 gol-0.048
ile-0.047
Token shining
Token ID22751
Feature activation+4.30e-02

Pos logit contributions
cel+0.017
 forehead+0.015
 lap+0.015
 chin+0.014
tail+0.014
Neg logit contributions
ris-0.023
 gol-0.021
 narrative-0.021
spe-0.021
 nations-0.020
Token joy
Token ID8716
Feature activation-4.32e-03

Pos logit contributions
cel+0.002
 lap+0.002
 forehead+0.002
 chin+0.002
 cheeks+0.002
Neg logit contributions
ris-0.004
 gol-0.003
 narrative-0.003
spe-0.003
 nations-0.003
Token.
Token ID13
Feature activation+5.42e-03

Pos logit contributions
ris+0.009
 gol+0.008
spe+0.008
 narrative+0.008
 nations+0.008
Neg logit contributions
cel-0.006
 lap-0.005
 forehead-0.005
 chin-0.005
tail-0.005
Token
Token ID198
Feature activation+1.15e-02

Pos logit contributions
cel+0.007
 forehead+0.007
 lap+0.006
 chin+0.006
 cheeks+0.006
Neg logit contributions
ris-0.011
 narrative-0.010
spe-0.010
 gol-0.010
 nations-0.010
Token
Token ID198
Feature activation+2.26e-02

Pos logit contributions
cel+0.019
 lap+0.018
 forehead+0.018
 chin+0.017
tail+0.016
Neg logit contributions
ris-0.020
 gol-0.019
 narrative-0.018
spe-0.018
 nations-0.017
TokenThat
Token ID2504
Feature activation+2.12e-02

Pos logit contributions
cel+0.028
 lap+0.025
 forehead+0.024
ze+0.022
tail+0.021
Neg logit contributions
ris-0.048
 gol-0.047
spe-0.044
 narrative-0.043
 nations-0.043
Token night
Token ID1755
Feature activation+7.84e-02

Pos logit contributions
cel+0.022
 forehead+0.021
 lap+0.020
 feel+0.019
 chin+0.019
Neg logit contributions
ris-0.047
 gol-0.045
 narrative-0.042
spe-0.041
 nations-0.040
Token,
Token ID11
Feature activation+2.32e-02

Pos logit contributions
cel+0.118
 lap+0.112
 forehead+0.109
 chin+0.105
tail+0.098
Neg logit contributions
ris-0.144
spe-0.135
 narrative-0.130
 gol-0.127
 nations-0.123
Token Maya
Token ID26041
Feature activation+3.84e-02

Pos logit contributions
cel+0.033
 lap+0.031
 forehead+0.030
 chin+0.029
tail+0.027
Neg logit contributions
ris-0.043
spe-0.041
 narrative-0.041
 gol-0.039
 nations-0.039
Token was
Token ID373
Feature activation+8.78e-03

Pos logit contributions
cel+0.064
 lap+0.060
 forehead+0.059
 chin+0.055
amas+0.053
Neg logit contributions
ris-0.067
spe-0.060
 gol-0.059
 narrative-0.058
 nations-0.056
Token visited
Token ID8672
Feature activation+4.20e-02

Pos logit contributions
cel+0.014
 lap+0.012
 forehead+0.012
 chin+0.012
 cheeks+0.011
Neg logit contributions
ris-0.016
 gol-0.015
spe-0.015
 narrative-0.015
 nations-0.014
Token by
Token ID416
Feature activation-2.75e-03

Pos logit contributions
cel+0.068
 forehead+0.062
 lap+0.062
 chin+0.061
 cheeks+0.057
Neg logit contributions
ris-0.075
spe-0.069
 narrative-0.066
 gol-0.066
 nations-0.065
Token together
Token ID1978
Feature activation-1.64e-02

Pos logit contributions
cel+0.022
 forehead+0.020
 lap+0.020
 chin+0.019
 cheeks+0.017
Neg logit contributions
ris-0.033
 gol-0.031
 narrative-0.030
spe-0.030
 nations-0.029
Token.
Token ID13
Feature activation+8.63e-03

Pos logit contributions
ris+0.037
 gol+0.035
 narrative+0.035
spe+0.034
 nations+0.033
Neg logit contributions
cel-0.018
 lap-0.016
 forehead-0.016
 chin-0.015
tail-0.013
Token
Token ID198
Feature activation+1.41e-02

Pos logit contributions
cel+0.011
 forehead+0.010
 lap+0.009
 chin+0.009
 cheeks+0.008
Neg logit contributions
ris-0.019
 gol-0.018
spe-0.017
 narrative-0.017
 nations-0.016
Token
Token ID198
Feature activation+2.43e-02

Pos logit contributions
cel+0.015
 lap+0.013
 forehead+0.013
 chin+0.012
tail+0.011
Neg logit contributions
ris-0.033
 gol-0.031
 narrative-0.031
spe-0.030
 nations-0.030
TokenThat
Token ID2504
Feature activation+2.36e-02

Pos logit contributions
cel+0.029
 lap+0.026
 forehead+0.024
tail+0.022
 chin+0.022
Neg logit contributions
ris-0.052
 narrative-0.049
 nations-0.049
 gol-0.048
spe-0.047
Token night
Token ID1755
Feature activation+7.78e-02

Pos logit contributions
cel+0.032
 forehead+0.030
 lap+0.030
 chin+0.028
tail+0.026
Neg logit contributions
ris-0.046
 gol-0.044
 narrative-0.042
spe-0.042
 nations-0.041
Token,
Token ID11
Feature activation+2.15e-02

Pos logit contributions
cel+0.110
 lap+0.104
 forehead+0.103
 chin+0.095
tail+0.090
Neg logit contributions
ris-0.153
 gol-0.142
spe-0.141
 narrative-0.137
 nations-0.130
Token the
Token ID262
Feature activation+1.64e-02

Pos logit contributions
cel+0.032
 forehead+0.029
 lap+0.029
 chin+0.027
tail+0.026
Neg logit contributions
ris-0.042
 gol-0.039
spe-0.038
 narrative-0.037
 nations-0.036
Token crab
Token ID32202
Feature activation+1.45e-02

Pos logit contributions
cel+0.028
 lap+0.025
 forehead+0.024
 chin+0.023
tail+0.022
Neg logit contributions
ris-0.028
 gol-0.027
 narrative-0.026
 nations-0.025
spe-0.025
Token went
Token ID1816
Feature activation+1.13e-02

Pos logit contributions
cel+0.022
 forehead+0.020
 lap+0.020
 chin+0.019
 cheeks+0.017
Neg logit contributions
ris-0.027
 gol-0.025
 narrative-0.025
spe-0.024
 nations-0.024
Token to
Token ID284
Feature activation+9.29e-03

Pos logit contributions
cel+0.015
 lap+0.014
 forehead+0.014
 chin+0.013
tail+0.012
Neg logit contributions
ris-0.024
 narrative-0.022
 gol-0.022
spe-0.022
 nations-0.021
Token noticed
Token ID6810
Feature activation-5.38e-03

Pos logit contributions
cel+0.003
 forehead+0.003
 lap+0.003
 chin+0.002
 cheeks+0.002
Neg logit contributions
ris-0.003
 gol-0.003
 narrative-0.003
spe-0.003
 nations-0.003
Token something
Token ID1223
Feature activation-2.05e-02

Pos logit contributions
ris+0.012
 narrative+0.012
spe+0.012
 gol+0.011
 nations+0.011
Neg logit contributions
cel-0.006
 lap-0.005
 forehead-0.005
 chin-0.004
ze-0.004
Token strange
Token ID6283
Feature activation-4.24e-02

Pos logit contributions
ris+0.048
 narrative+0.044
 gol+0.043
spe+0.043
 nations+0.042
Neg logit contributions
cel-0.023
 forehead-0.021
 lap-0.020
 warm-0.020
 chin-0.019
Token.
Token ID13
Feature activation-1.60e-03

Pos logit contributions
ris+0.099
 gol+0.093
 narrative+0.092
spe+0.091
 nations+0.089
Neg logit contributions
cel-0.045
 forehead-0.041
 lap-0.040
 chin-0.037
 cheeks-0.034
Token A
Token ID317
Feature activation-2.99e-03

Pos logit contributions
ris+0.004
 gol+0.003
 narrative+0.003
spe+0.003
 nations+0.003
Neg logit contributions
cel-0.002
 forehead-0.002
 lap-0.002
 chin-0.002
 cheeks-0.001
Token gol
Token ID43791
Feature activation-1.05e-01

Pos logit contributions
ris+0.007
 narrative+0.007
spe+0.006
 nations+0.006
 gol+0.006
Neg logit contributions
cel-0.003
 lap-0.003
 forehead-0.003
 chin-0.003
 shoulder-0.002
Tokenfer
Token ID2232
Feature activation-4.14e-03

Pos logit contributions
ris+0.185
 narrative+0.172
 nations+0.168
spe+0.162
igators+0.162
Neg logit contributions
cel-0.177
tail-0.146
 lap-0.145
 forehead-0.145
 chin-0.145
Token was
Token ID373
Feature activation-1.49e-02

Pos logit contributions
ris+0.008
 narrative+0.008
 gol+0.007
spe+0.007
 nations+0.007
Neg logit contributions
cel-0.006
 lap-0.005
 forehead-0.005
 chin-0.005
ze-0.005
Token hitting
Token ID9008
Feature activation+8.99e-03

Pos logit contributions
ris+0.034
 narrative+0.034
 nations+0.031
oral+0.031
spe+0.031
Neg logit contributions
cel-0.017
ze-0.015
 warm-0.015
 forehead-0.015
 lap-0.014
Token golf
Token ID13126
Feature activation-2.80e-02

Pos logit contributions
cel+0.010
 lap+0.010
 forehead+0.009
 chin+0.009
 cheeks+0.008
Neg logit contributions
ris-0.020
 narrative-0.018
 gol-0.018
spe-0.018
 nations-0.017
Token balls
Token ID11333
Feature activation-1.45e-02

Pos logit contributions
ris+0.064
 narrative+0.058
spe+0.057
 gol+0.056
 nations+0.055
Neg logit contributions
cel-0.029
 lap-0.028
 chin-0.026
ze-0.026
 forehead-0.026
Token wanted
Token ID2227
Feature activation-6.24e-03

Pos logit contributions
 narrative+0.018
ris+0.018
 gol+0.017
ole+0.016
 nations+0.016
Neg logit contributions
cel-0.010
 lap-0.009
 feel-0.008
 chin-0.008
 sn-0.008
Token to
Token ID284
Feature activation+3.75e-04

Pos logit contributions
ris+0.013
 narrative+0.013
spe+0.012
 gol+0.012
ile+0.011
Neg logit contributions
cel-0.007
 lap-0.007
 forehead-0.006
 chin-0.006
 feel-0.006
Token be
Token ID307
Feature activation-1.25e-02

Pos logit contributions
cel+0.000
 lap+0.000
 forehead+0.000
 chin+0.000
tail+0.000
Neg logit contributions
ris-0.001
 narrative-0.001
spe-0.001
 gol-0.001
 nations-0.001
Token a
Token ID257
Feature activation-4.84e-03

Pos logit contributions
ris+0.026
 gol+0.024
 narrative+0.024
spe+0.024
 nations+0.023
Neg logit contributions
cel-0.016
 forehead-0.015
 lap-0.015
 chin-0.014
 cheeks-0.013
Token good
Token ID922
Feature activation-7.84e-03

Pos logit contributions
ris+0.009
 narrative+0.009
spe+0.009
igators+0.008
 nations+0.008
Neg logit contributions
cel-0.007
 lap-0.006
 forehead-0.006
 chin-0.006
 warm-0.006
Token gol
Token ID43791
Feature activation-1.03e-01

Pos logit contributions
ris+0.014
 narrative+0.014
spe+0.014
igators+0.013
 nations+0.012
Neg logit contributions
cel-0.011
 lap-0.011
 chin-0.010
 feel-0.010
 nose-0.010
Tokenfer
Token ID2232
Feature activation-1.22e-02

Pos logit contributions
ris+0.161
 narrative+0.144
spe+0.140
 gol+0.138
 nations+0.137
Neg logit contributions
cel-0.195
 lap-0.177
 forehead-0.176
 chin-0.169
tail-0.164
Token like
Token ID588
Feature activation-9.70e-03

Pos logit contributions
ris+0.025
 gol+0.023
 narrative+0.023
spe+0.023
 nations+0.022
Neg logit contributions
cel-0.017
 forehead-0.015
 lap-0.015
 chin-0.014
tail-0.013
Token his
Token ID465
Feature activation+1.56e-03

Pos logit contributions
ris+0.019
 gol+0.018
spe+0.017
 narrative+0.017
 nations+0.017
Neg logit contributions
cel-0.014
 forehead-0.013
 lap-0.013
 chin-0.012
 cheeks-0.011
Token dad
Token ID9955
Feature activation+1.61e-02

Pos logit contributions
cel+0.002
 lap+0.002
 forehead+0.002
 chin+0.002
 cheeks+0.002
Neg logit contributions
ris-0.003
 narrative-0.003
spe-0.003
 gol-0.003
igators-0.003
Token.
Token ID13
Feature activation-2.15e-02

Pos logit contributions
cel+0.024
 forehead+0.023
 lap+0.022
amas+0.020
 chin+0.020
Neg logit contributions
ris-0.031
 gol-0.028
spe-0.027
 nations-0.027
 narrative-0.026
Token!
Token ID0
Feature activation-2.84e-03

Pos logit contributions
ris+0.058
 narrative+0.054
 gol+0.053
spe+0.053
 nations+0.052
Neg logit contributions
cel-0.033
 lap-0.030
 forehead-0.030
 chin-0.028
 cheeks-0.025
Token I
Token ID314
Feature activation+9.91e-04

Pos logit contributions
ris+0.006
 gol+0.006
spe+0.005
 narrative+0.005
 nations+0.005
Neg logit contributions
cel-0.004
 forehead-0.003
 lap-0.003
 chin-0.003
 cheeks-0.003
Token'm
Token ID1101
Feature activation-1.20e-04

Pos logit contributions
cel+0.001
 lap+0.001
 forehead+0.001
 feel+0.001
 chin+0.001
Neg logit contributions
 narrative-0.002
 nations-0.002
ris-0.002
 gol-0.002
spe-0.002
Token the
Token ID262
Feature activation+2.20e-03

Pos logit contributions
ris+0.000
 narrative+0.000
spe+0.000
 nations+0.000
 gol+0.000
Neg logit contributions
cel-0.000
 forehead-0.000
 lap-0.000
 warm-0.000
 chin-0.000
Token best
Token ID1266
Feature activation-1.17e-02

Pos logit contributions
cel+0.003
 forehead+0.003
 lap+0.003
 shoulder+0.002
 feel+0.002
Neg logit contributions
ris-0.005
spe-0.004
igators-0.004
uzz-0.004
 narrative-0.004
Token gol
Token ID43791
Feature activation-9.15e-02

Pos logit contributions
ris+0.017
 narrative+0.015
 nations+0.013
igators+0.013
spe+0.013
Neg logit contributions
 lap-0.021
cel-0.021
 feel-0.020
 chin-0.020
 shoulder-0.020
Tokenfer
Token ID2232
Feature activation-3.54e-03

Pos logit contributions
ris+0.147
 narrative+0.129
spe+0.126
 nations+0.124
 gol+0.124
Neg logit contributions
cel-0.169
 lap-0.152
 forehead-0.152
 chin-0.145
tail-0.141
Token in
Token ID287
Feature activation-2.30e-02

Pos logit contributions
ris+0.006
 narrative+0.006
 gol+0.006
spe+0.005
 nations+0.005
Neg logit contributions
cel-0.006
 forehead-0.005
 lap-0.005
 chin-0.005
tail-0.005
Token the
Token ID262
Feature activation-3.36e-05

Pos logit contributions
ris+0.055
 narrative+0.052
 gol+0.051
spe+0.051
 nations+0.048
Neg logit contributions
cel-0.022
 lap-0.020
 forehead-0.019
 chin-0.018
tail-0.016
Token world
Token ID995
Feature activation-5.54e-03

Pos logit contributions
ris+0.000
spe+0.000
 gol+0.000
 narrative+0.000
igators+0.000
Neg logit contributions
cel-0.000
 lap-0.000
 forehead-0.000
 chin-0.000
 shoulder-0.000
Token and
Token ID290
Feature activation-1.17e-02

Pos logit contributions
ris+0.013
 narrative+0.012
 gol+0.012
spe+0.012
 nations+0.012
Neg logit contributions
cel-0.006
 lap-0.005
 forehead-0.005
 chin-0.005
tail-0.004
Token!
Token ID0
Feature activation-7.08e-03

Pos logit contributions
ris+0.025
 gol+0.023
spe+0.021
 narrative+0.020
 nations+0.020
Neg logit contributions
cel-0.019
 forehead-0.018
 lap-0.017
 chin-0.015
amas-0.015
Token I
Token ID314
Feature activation+1.93e-03

Pos logit contributions
ris+0.014
 gol+0.014
spe+0.013
 narrative+0.013
 nations+0.012
Neg logit contributions
cel-0.010
 forehead-0.009
 lap-0.008
 chin-0.008
 cheeks-0.008
Token'm
Token ID1101
Feature activation+3.92e-03

Pos logit contributions
cel+0.003
 lap+0.002
 feel+0.002
 forehead+0.002
 chin+0.002
Neg logit contributions
 nations-0.004
 narrative-0.004
ris-0.004
spe-0.003
 gol-0.003
Token the
Token ID262
Feature activation+6.96e-03

Pos logit contributions
cel+0.005
 forehead+0.005
 lap+0.004
 chin+0.004
 warm+0.004
Neg logit contributions
ris-0.009
 narrative-0.008
spe-0.008
 nations-0.008
 gol-0.008
Token best
Token ID1266
Feature activation-8.56e-03

Pos logit contributions
cel+0.009
 lap+0.008
 forehead+0.008
 chin+0.008
 shoulder+0.008
Neg logit contributions
ris-0.014
spe-0.013
 narrative-0.012
igators-0.012
 nations-0.012
Token gol
Token ID43791
Feature activation-9.09e-02

Pos logit contributions
ris+0.014
 narrative+0.013
spe+0.012
igators+0.011
 nations+0.011
Neg logit contributions
 lap-0.014
 feel-0.013
cel-0.013
 shoulder-0.013
 chin-0.012
Tokenfer
Token ID2232
Feature activation-2.59e-03

Pos logit contributions
ris+0.150
 narrative+0.132
spe+0.129
 nations+0.127
 gol+0.126
Neg logit contributions
cel-0.165
 lap-0.147
 forehead-0.146
 chin-0.140
tail-0.137
Token in
Token ID287
Feature activation-2.17e-02

Pos logit contributions
ris+0.004
 narrative+0.004
 gol+0.004
spe+0.004
 nations+0.004
Neg logit contributions
cel-0.005
 forehead-0.004
 lap-0.004
 chin-0.004
tail-0.004
Token the
Token ID262
Feature activation+3.18e-03

Pos logit contributions
ris+0.051
 narrative+0.048
 gol+0.047
spe+0.047
 nations+0.045
Neg logit contributions
cel-0.022
 lap-0.020
 forehead-0.019
 chin-0.018
tail-0.016
Token world
Token ID995
Feature activation-4.21e-03

Pos logit contributions
cel+0.005
 lap+0.004
 forehead+0.004
 chin+0.004
 shoulder+0.004
Neg logit contributions
ris-0.006
spe-0.006
 gol-0.005
 narrative-0.005
 nations-0.005
Token!"
Token ID2474
Feature activation-1.94e-03

Pos logit contributions
ris+0.010
 narrative+0.009
 gol+0.009
spe+0.009
 nations+0.009
Neg logit contributions
cel-0.004
 lap-0.004
 forehead-0.004
 chin-0.004
tail-0.003
Token.
Token ID13
Feature activation-3.98e-03

Pos logit contributions
ris+0.030
 gol+0.028
 narrative+0.027
spe+0.027
 nations+0.027
Neg logit contributions
cel-0.020
 forehead-0.019
 lap-0.018
 chin-0.017
tail-0.016
Token But
Token ID887
Feature activation-1.03e-03

Pos logit contributions
ris+0.008
 gol+0.008
 narrative+0.007
spe+0.007
 nations+0.007
Neg logit contributions
cel-0.005
 forehead-0.005
 lap-0.005
 chin-0.005
 cheeks-0.004
Token we
Token ID356
Feature activation-7.54e-03

Pos logit contributions
ris+0.002
 gol+0.002
 narrative+0.002
spe+0.002
 nations+0.002
Neg logit contributions
cel-0.002
 lap-0.001
 forehead-0.001
 chin-0.001
amas-0.001
Token need
Token ID761
Feature activation-2.53e-02

Pos logit contributions
spe+0.019
ris+0.017
 gol+0.017
 narrative+0.017
 nations+0.016
Neg logit contributions
 feel-0.007
cel-0.006
 make-0.005
 warm-0.005
 cover-0.005
Token more
Token ID517
Feature activation-2.83e-02

Pos logit contributions
ris+0.056
spe+0.054
 narrative+0.053
 gol+0.052
 nations+0.048
Neg logit contributions
 lap-0.029
cel-0.028
 forehead-0.027
 chin-0.025
 warm-0.025
Token le
Token ID443
Feature activation-8.73e-02

Pos logit contributions
ris+0.059
spe+0.057
 narrative+0.055
 gol+0.055
 nations+0.050
Neg logit contributions
 lap-0.035
cel-0.034
 chin-0.032
 forehead-0.031
 cheeks-0.030
Tokenmons
Token ID11567
Feature activation-3.57e-02

Pos logit contributions
ris+0.148
spe+0.140
 narrative+0.135
 nations+0.128
 gol+0.127
Neg logit contributions
cel-0.141
 lap-0.134
 forehead-0.129
 chin-0.128
tail-0.122
Token."
Token ID526
Feature activation-1.38e-03

Pos logit contributions
ris+0.071
 narrative+0.066
spe+0.066
 gol+0.066
 nations+0.063
Neg logit contributions
cel-0.049
 lap-0.046
 forehead-0.045
 chin-0.043
tail-0.039
Token Jane
Token ID12091
Feature activation-1.25e-02

Pos logit contributions
ris+0.002
 gol+0.002
 narrative+0.002
spe+0.002
 nations+0.002
Neg logit contributions
cel-0.002
 forehead-0.002
 lap-0.002
 chin-0.002
tail-0.002
Token and
Token ID290
Feature activation+1.44e-03

Pos logit contributions
ris+0.026
 narrative+0.025
 gol+0.025
 nations+0.024
spe+0.024
Neg logit contributions
cel-0.015
 forehead-0.015
 chin-0.015
 lap-0.015
 cheeks-0.014
Token her
Token ID607
Feature activation-2.47e-03

Pos logit contributions
cel+0.002
 forehead+0.001
 lap+0.001
 chin+0.001
tail+0.001
Neg logit contributions
ris-0.003
 gol-0.003
 narrative-0.003
spe-0.003
 nations-0.003
Token one
Token ID530
Feature activation+1.81e-02

Pos logit contributions
cel+0.008
 lap+0.008
 forehead+0.007
 chin+0.007
tail+0.006
Neg logit contributions
ris-0.016
 narrative-0.015
 gol-0.015
spe-0.015
 nations-0.014
Token of
Token ID286
Feature activation+5.12e-03

Pos logit contributions
 lap+0.034
cel+0.033
 forehead+0.033
 chin+0.032
tail+0.030
Neg logit contributions
 narrative-0.025
ris-0.025
 gol-0.024
spe-0.023
 nations-0.022
Token Mary
Token ID5335
Feature activation+2.39e-02

Pos logit contributions
 lap+0.003
 herself+0.003
 forehead+0.003
 warm+0.003
 skin+0.002
Neg logit contributions
ris-0.013
spe-0.013
 narrative-0.013
 gol-0.012
 nations-0.012
Token's
Token ID338
Feature activation+1.17e-03

Pos logit contributions
cel+0.026
 forehead+0.024
 lap+0.023
 chin+0.020
amas+0.020
Neg logit contributions
ris-0.056
spe-0.051
 gol-0.050
 nations-0.050
 narrative-0.049
Token most
Token ID749
Feature activation-2.79e-02

Pos logit contributions
cel+0.002
 forehead+0.002
 lap+0.002
tail+0.001
amas+0.001
Neg logit contributions
ris-0.002
 gol-0.002
 nations-0.002
spe-0.002
 narrative-0.002
Token tre
Token ID2054
Feature activation-8.57e-02

Pos logit contributions
 narrative+0.057
ris+0.055
igators+0.054
 nations+0.053
spe+0.052
Neg logit contributions
cel-0.033
 lap-0.030
 forehead-0.029
 warm-0.028
 cozy-0.028
Tokenasured
Token ID34006
Feature activation-3.15e-02

Pos logit contributions
ris+0.124
 narrative+0.118
spe+0.117
 nations+0.117
 gol+0.113
Neg logit contributions
cel-0.158
 forehead-0.151
 lap-0.148
 chin-0.142
 shoulder-0.136
Token possessions
Token ID23309
Feature activation-1.46e-02

Pos logit contributions
ris+0.054
 narrative+0.052
 gol+0.050
spe+0.048
 nations+0.047
Neg logit contributions
cel-0.052
 lap-0.050
 forehead-0.050
 chin-0.047
 shoulder-0.044
Token and
Token ID290
Feature activation-5.82e-04

Pos logit contributions
ris+0.033
 gol+0.031
 narrative+0.031
spe+0.030
 nations+0.029
Neg logit contributions
cel-0.017
 forehead-0.015
 lap-0.015
 chin-0.014
 cheeks-0.012
Token she
Token ID673
Feature activation-1.30e-02

Pos logit contributions
ris+0.001
 gol+0.001
spe+0.001
 narrative+0.001
 nations+0.001
Neg logit contributions
cel-0.001
 forehead-0.001
 lap-0.001
 chin-0.001
amas-0.001
Token never
Token ID1239
Feature activation-3.77e-02

Pos logit contributions
ris+0.024
 gol+0.022
 narrative+0.022
spe+0.021
 nations+0.021
Neg logit contributions
cel-0.020
 forehead-0.019
 lap-0.019
 chin-0.018
 cheeks-0.017
Token to
Token ID284
Feature activation-6.13e-03

Pos logit contributions
ris+0.018
 narrative+0.017
spe+0.016
 gol+0.016
 nations+0.015
Neg logit contributions
cel-0.014
 lap-0.013
 forehead-0.013
 chin-0.012
 shoulder-0.011
Token school
Token ID1524
Feature activation-1.37e-03

Pos logit contributions
ris+0.012
 gol+0.011
 nations+0.011
 narrative+0.010
spe+0.010
Neg logit contributions
cel-0.009
 forehead-0.008
 lap-0.008
 chin-0.008
 cheeks-0.007
Token with
Token ID351
Feature activation-3.41e-03

Pos logit contributions
ris+0.003
 gol+0.003
 narrative+0.003
spe+0.003
 nations+0.003
Neg logit contributions
cel-0.002
 lap-0.002
 forehead-0.002
 chin-0.001
 cheeks-0.001
Token a
Token ID257
Feature activation-1.84e-02

Pos logit contributions
ris+0.006
 gol+0.005
 nations+0.005
 narrative+0.005
spe+0.005
Neg logit contributions
cel-0.006
 forehead-0.006
 lap-0.005
 chin-0.005
tail-0.005
Token blue
Token ID4171
Feature activation-1.95e-02

Pos logit contributions
ris+0.042
 narrative+0.038
spe+0.038
 gol+0.037
 nations+0.037
Neg logit contributions
 lap-0.020
cel-0.019
 forehead-0.019
 chin-0.018
 shoulder-0.016
Token er
Token ID1931
Feature activation-8.51e-02

Pos logit contributions
ris+0.044
spe+0.041
 narrative+0.040
 gol+0.039
igators+0.038
Neg logit contributions
 lap-0.023
cel-0.022
 forehead-0.021
 chin-0.020
 cheeks-0.019
Tokenaser
Token ID6005
Feature activation-5.14e-02

Pos logit contributions
ris+0.123
spe+0.119
 narrative+0.115
 gol+0.110
 nations+0.108
Neg logit contributions
cel-0.163
 forehead-0.148
 lap-0.145
tail-0.140
 chin-0.139
Token.
Token ID13
Feature activation-9.37e-03

Pos logit contributions
ris+0.133
 narrative+0.129
spe+0.127
 gol+0.125
 nations+0.124
Neg logit contributions
cel-0.042
 lap-0.038
 forehead-0.034
 chin-0.032
tail-0.029
Token He
Token ID679
Feature activation-8.42e-03

Pos logit contributions
ris+0.018
 gol+0.017
spe+0.016
 nations+0.016
 narrative+0.015
Neg logit contributions
cel-0.015
 forehead-0.013
 lap-0.012
 chin-0.012
tail-0.012
Token wanted
Token ID2227
Feature activation-7.19e-03

Pos logit contributions
ris+0.015
 gol+0.013
 narrative+0.013
spe+0.013
 nations+0.013
Neg logit contributions
cel-0.014
 forehead-0.013
 lap-0.013
 chin-0.012
 cheeks-0.011
Token to
Token ID284
Feature activation-2.73e-03

Pos logit contributions
ris+0.014
 narrative+0.014
spe+0.013
 gol+0.013
 nations+0.012
Neg logit contributions
cel-0.010
 lap-0.009
 forehead-0.009
 chin-0.008
 shoulder-0.008
Token pond
Token ID16723
Feature activation+5.93e-04

Pos logit contributions
cel+0.016
 forehead+0.014
 lap+0.013
 chin+0.013
tail+0.012
Neg logit contributions
ris-0.019
 gol-0.019
 narrative-0.018
 nations-0.017
spe-0.017
Token?"
Token ID1701
Feature activation-6.55e-03

Pos logit contributions
cel+0.001
 forehead+0.001
 lap+0.001
 chin+0.001
 cheeks+0.001
Neg logit contributions
ris-0.001
 gol-0.001
 narrative-0.001
spe-0.001
 nations-0.001
Token
Token ID198
Feature activation+1.06e-03

Pos logit contributions
ris+0.016
 gol+0.015
spe+0.015
 narrative+0.014
 nations+0.014
Neg logit contributions
cel-0.007
 forehead-0.006
 lap-0.005
 chin-0.005
amas-0.005
Token
Token ID198
Feature activation+1.22e-02

Pos logit contributions
cel+0.002
 forehead+0.002
 cheeks+0.001
 chin+0.001
 lap+0.001
Neg logit contributions
ris-0.002
 gol-0.002
spe-0.002
 narrative-0.002
 nations-0.002
TokenThe
Token ID464
Feature activation+7.60e-04

Pos logit contributions
cel+0.017
 forehead+0.016
 lap+0.016
 chin+0.014
 cheeks+0.013
Neg logit contributions
ris-0.025
 gol-0.023
 narrative-0.022
spe-0.022
 nations-0.022
Token gol
Token ID43791
Feature activation-8.44e-02

Pos logit contributions
cel+0.002
 forehead+0.001
 lap+0.001
 shoulder+0.001
 cheek+0.001
Neg logit contributions
ris-0.001
spe-0.001
 narrative-0.001
ile-0.001
 nations-0.001
Tokenfer
Token ID2232
Feature activation+4.87e-03

Pos logit contributions
ris+0.148
 narrative+0.134
 nations+0.132
igators+0.130
spe+0.128
Neg logit contributions
cel-0.141
 lap-0.114
 forehead-0.113
 chin-0.112
tail-0.111
Token replied
Token ID8712
Feature activation+5.35e-03

Pos logit contributions
cel+0.007
 forehead+0.007
 lap+0.007
 chin+0.006
 cheeks+0.006
Neg logit contributions
ris-0.009
 narrative-0.009
 gol-0.009
 nations-0.008
spe-0.008
Token,
Token ID11
Feature activation-6.43e-04

Pos logit contributions
cel+0.007
 lap+0.007
 forehead+0.007
 chin+0.006
tail+0.006
Neg logit contributions
ris-0.011
 gol-0.010
spe-0.010
 narrative-0.010
 nations-0.009
Token "
Token ID366
Feature activation+2.03e-02

Pos logit contributions
ris+0.001
 gol+0.001
spe+0.001
 narrative+0.001
 nations+0.001
Neg logit contributions
cel-0.001
 lap-0.001
 forehead-0.001
tail-0.001
 chin-0.001
TokenI
Token ID40
Feature activation-1.40e-03

Pos logit contributions
cel+0.026
 lap+0.023
 forehead+0.022
 chin+0.021
tail+0.020
Neg logit contributions
ris-0.043
 nations-0.039
 narrative-0.038
 gol-0.037
spe-0.036
TokenTom
Token ID13787
Feature activation+4.81e-03

Pos logit contributions
cel+0.033
 lap+0.032
 forehead+0.030
 chin+0.029
tail+0.028
Neg logit contributions
ris-0.033
 narrative-0.031
 gol-0.031
spe-0.030
 nations-0.029
Token called
Token ID1444
Feature activation-1.58e-02

Pos logit contributions
cel+0.006
 forehead+0.005
 lap+0.005
 chin+0.005
 cheeks+0.005
Neg logit contributions
ris-0.010
 narrative-0.010
 gol-0.010
spe-0.009
 nations-0.009
Token out
Token ID503
Feature activation-1.51e-02

Pos logit contributions
ris+0.037
 narrative+0.035
 gol+0.035
spe+0.034
 nations+0.033
Neg logit contributions
cel-0.016
 lap-0.016
 forehead-0.016
 chin-0.013
 shoulder-0.013
Token to
Token ID284
Feature activation+1.69e-04

Pos logit contributions
 narrative+0.035
ris+0.034
 gol+0.032
 nations+0.031
spe+0.031
Neg logit contributions
cel-0.016
 lap-0.015
 forehead-0.015
 chin-0.014
 cheeks-0.013
Token the
Token ID262
Feature activation-8.79e-03

Pos logit contributions
cel+0.000
 forehead+0.000
 lap+0.000
 chin+0.000
amas+0.000
Neg logit contributions
ris-0.000
 gol-0.000
 nations-0.000
 narrative-0.000
spe-0.000
Token gol
Token ID43791
Feature activation-8.38e-02

Pos logit contributions
ris+0.017
spe+0.015
 narrative+0.014
ile+0.013
igators+0.013
Neg logit contributions
 feel-0.012
 shoulder-0.012
 forehead-0.012
 lap-0.012
 warm-0.012
Tokenfer
Token ID2232
Feature activation+5.88e-03

Pos logit contributions
ris+0.141
 narrative+0.128
spe+0.123
 nations+0.122
igators+0.121
Neg logit contributions
cel-0.148
 lap-0.127
 forehead-0.125
 chin-0.123
tail-0.122
Token "
Token ID366
Feature activation+1.60e-02

Pos logit contributions
cel+0.007
 forehead+0.006
 lap+0.006
 chin+0.006
 cheeks+0.005
Neg logit contributions
ris-0.013
 gol-0.012
 narrative-0.012
spe-0.012
 nations-0.012
TokenHey
Token ID10814
Feature activation-1.18e-02

Pos logit contributions
cel+0.026
 forehead+0.025
 lap+0.025
 chin+0.023
 cheeks+0.023
Neg logit contributions
ris-0.029
 gol-0.027
spe-0.025
 narrative-0.025
ile-0.024
Token,
Token ID11
Feature activation+6.03e-03

Pos logit contributions
ris+0.024
 gol+0.022
 narrative+0.022
spe+0.022
 nations+0.021
Neg logit contributions
cel-0.016
 forehead-0.015
 lap-0.015
 chin-0.013
 cheeks-0.012
Token why
Token ID1521
Feature activation+2.59e-02

Pos logit contributions
cel+0.007
 forehead+0.006
 lap+0.006
 shoulder+0.005
 feel+0.005
Neg logit contributions
ris-0.014
 narrative-0.013
spe-0.013
 nations-0.012
igators-0.011
Token.
Token ID13
Feature activation+2.11e-03

Pos logit contributions
ris+0.012
 gol+0.011
 narrative+0.011
spe+0.011
 nations+0.010
Neg logit contributions
cel-0.008
 forehead-0.007
 lap-0.007
 chin-0.006
tail-0.006
Token They
Token ID1119
Feature activation-2.77e-03

Pos logit contributions
cel+0.003
 forehead+0.003
 lap+0.003
 chin+0.002
 cheeks+0.002
Neg logit contributions
ris-0.004
 gol-0.004
 narrative-0.004
spe-0.004
 nations-0.004
Token asked
Token ID1965
Feature activation-2.06e-03

Pos logit contributions
ris+0.005
 gol+0.005
 narrative+0.005
spe+0.005
 nations+0.005
Neg logit contributions
cel-0.004
 forehead-0.004
 lap-0.004
 chin-0.003
 feel-0.003
Token a
Token ID257
Feature activation-2.55e-02

Pos logit contributions
ris+0.003
 nations+0.003
 narrative+0.003
spe+0.003
 gol+0.003
Neg logit contributions
cel-0.004
 forehead-0.003
 lap-0.003
 chin-0.003
amas-0.003
Token zo
Token ID40565
Feature activation-4.65e-02

Pos logit contributions
ris+0.050
 narrative+0.046
 gol+0.045
spe+0.044
 nations+0.043
Neg logit contributions
cel-0.037
 lap-0.035
 forehead-0.034
 chin-0.032
 cheeks-0.030
Tokenoke
Token ID2088
Feature activation-8.35e-02

Pos logit contributions
ris+0.061
 narrative+0.061
 gol+0.060
spe+0.057
 lasted+0.050
Neg logit contributions
cel-0.082
 forehead-0.075
 lap-0.075
 cheek-0.071
tail-0.069
Tokeneper
Token ID5723
Feature activation-4.35e-02

Pos logit contributions
 gol+0.132
ris+0.128
spe+0.122
 narrative+0.114
 clubhouse+0.102
Neg logit contributions
cel-0.140
 forehead-0.126
ze-0.126
 shoulder-0.124
 lap-0.124
Token,
Token ID11
Feature activation-3.35e-02

Pos logit contributions
ris+0.089
 gol+0.085
 narrative+0.084
spe+0.081
 nations+0.077
Neg logit contributions
cel-0.057
 lap-0.052
 forehead-0.051
 chin-0.047
 shoulder-0.044
Token who
Token ID508
Feature activation-1.99e-02

Pos logit contributions
ris+0.053
 gol+0.048
spe+0.045
 nations+0.044
 narrative+0.044
Neg logit contributions
cel-0.059
 forehead-0.056
 lap-0.055
 chin-0.054
 cheeks-0.052
Token pointed
Token ID6235
Feature activation-2.87e-02

Pos logit contributions
 gol+0.041
ris+0.041
 narrative+0.041
 nations+0.038
ole+0.037
Neg logit contributions
cel-0.023
 feel-0.018
 sn-0.016
 chin-0.016
 lap-0.016
Token them
Token ID606
Feature activation-8.34e-03

Pos logit contributions
ris+0.056
 narrative+0.052
 gol+0.052
spe+0.050
 nations+0.046
Neg logit contributions
 lap-0.040
cel-0.039
 forehead-0.039
 chin-0.037
 shoulder-0.034
Token,
Token ID11
Feature activation-1.44e-04

Pos logit contributions
cel+0.034
 forehead+0.031
 lap+0.031
 chin+0.028
tail+0.026
Neg logit contributions
ris-0.057
 gol-0.053
 narrative-0.052
spe-0.051
 nations-0.050
Token she
Token ID673
Feature activation-4.70e-03

Pos logit contributions
ris+0.000
 gol+0.000
 narrative+0.000
spe+0.000
 nations+0.000
Neg logit contributions
cel-0.000
 forehead-0.000
 lap-0.000
 chin-0.000
tail-0.000
Token found
Token ID1043
Feature activation-1.94e-02

Pos logit contributions
ris+0.010
 narrative+0.009
 gol+0.009
spe+0.009
igators+0.009
Neg logit contributions
cel-0.006
 lap-0.005
 chin-0.005
 forehead-0.005
 feel-0.005
Token a
Token ID257
Feature activation-1.36e-02

Pos logit contributions
ris+0.047
spe+0.043
 narrative+0.043
 gol+0.043
 nations+0.041
Neg logit contributions
cel-0.020
 forehead-0.018
 lap-0.018
 chin-0.015
tail-0.014
Token big
Token ID1263
Feature activation-1.91e-02

Pos logit contributions
ris+0.033
 narrative+0.031
igators+0.029
spe+0.029
 nations+0.029
Neg logit contributions
cel-0.013
 forehead-0.013
 lap-0.012
 chin-0.012
 shoulder-0.011
Token er
Token ID1931
Feature activation-8.34e-02

Pos logit contributions
ris+0.040
spe+0.037
 narrative+0.037
 gol+0.034
igators+0.034
Neg logit contributions
cel-0.025
 lap-0.024
 forehead-0.024
 chin-0.024
 nose-0.022
Tokenaser
Token ID6005
Feature activation-4.57e-02

Pos logit contributions
spe+0.130
ris+0.127
igators+0.118
 narrative+0.115
 nations+0.115
Neg logit contributions
cel-0.154
 forehead-0.133
 lap-0.130
tail-0.130
 chin-0.125
Token on
Token ID319
Feature activation-1.41e-02

Pos logit contributions
ris+0.102
spe+0.101
 nations+0.097
 narrative+0.096
 gol+0.095
Neg logit contributions
cel-0.057
 lap-0.048
 forehead-0.046
tail-0.045
 chin-0.045
Token her
Token ID607
Feature activation+8.78e-03

Pos logit contributions
ris+0.035
 narrative+0.033
spe+0.033
 gol+0.033
 nations+0.031
Neg logit contributions
cel-0.013
 lap-0.011
 forehead-0.011
 chin-0.010
tail-0.009
Token teacher
Token ID4701
Feature activation-1.69e-02

Pos logit contributions
cel+0.013
 forehead+0.010
 lap+0.010
chief+0.010
amas+0.010
Neg logit contributions
ris-0.017
 gol-0.016
 nations-0.015
 narrative-0.015
spe-0.015
Token's
Token ID338
Feature activation-1.98e-02

Pos logit contributions
ris+0.044
 gol+0.041
 narrative+0.041
spe+0.040
 nations+0.039
Neg logit contributions
cel-0.014
 lap-0.012
 forehead-0.012
 chin-0.011
tail-0.009
Token.
Token ID13
Feature activation-1.26e-02

Pos logit contributions
cel+0.004
 forehead+0.004
 lap+0.004
 chin+0.003
tail+0.003
Neg logit contributions
ris-0.007
 gol-0.006
spe-0.006
 narrative-0.006
 nations-0.006
Token They
Token ID1119
Feature activation-1.08e-02

Pos logit contributions
ris+0.019
 gol+0.017
spe+0.016
 nations+0.016
 narrative+0.016
Neg logit contributions
cel-0.025
 forehead-0.024
 lap-0.022
 chin-0.021
 cheek-0.021
Token ask
Token ID1265
Feature activation+2.07e-03

Pos logit contributions
ris+0.027
spe+0.026
 gol+0.026
 narrative+0.026
ile+0.025
Neg logit contributions
cel-0.008
 feel-0.008
 forehead-0.008
 lap-0.007
 chin-0.007
Token a
Token ID257
Feature activation-2.33e-02

Pos logit contributions
cel+0.004
 forehead+0.003
 lap+0.003
 chin+0.003
amas+0.003
Neg logit contributions
ris-0.003
 gol-0.003
 narrative-0.003
spe-0.003
 nations-0.003
Token zo
Token ID40565
Feature activation-4.48e-02

Pos logit contributions
ris+0.041
 narrative+0.038
 gol+0.037
spe+0.036
 nations+0.035
Neg logit contributions
cel-0.037
 lap-0.035
 forehead-0.035
 chin-0.033
 shoulder-0.031
Tokenoke
Token ID2088
Feature activation-8.30e-02

Pos logit contributions
ris+0.056
 narrative+0.055
 gol+0.053
spe+0.051
 virtue+0.042
Neg logit contributions
cel-0.085
 lap-0.079
 forehead-0.079
 cheek-0.074
tail-0.074
Tokeneper
Token ID5723
Feature activation-3.88e-02

Pos logit contributions
 gol+0.129
ris+0.127
spe+0.120
 narrative+0.114
 virtue+0.101
Neg logit contributions
cel-0.139
 forehead-0.126
 lap-0.125
ze-0.124
 nose-0.124
Token what
Token ID644
Feature activation-1.18e-02

Pos logit contributions
ris+0.079
 gol+0.075
 narrative+0.074
spe+0.072
 nations+0.068
Neg logit contributions
cel-0.052
 lap-0.048
 forehead-0.047
 chin-0.043
 shoulder-0.040
Token it
Token ID340
Feature activation-2.97e-02

Pos logit contributions
ris+0.026
 gol+0.024
 narrative+0.023
spe+0.023
ole+0.021
Neg logit contributions
cel-0.013
 lap-0.012
tail-0.011
 chin-0.011
 forehead-0.010
Token means
Token ID1724
Feature activation-1.72e-02

Pos logit contributions
ris+0.055
 gol+0.050
 narrative+0.050
spe+0.049
 nations+0.048
Neg logit contributions
cel-0.047
 forehead-0.043
 lap-0.043
 chin-0.040
 cheeks-0.038
Token.
Token ID13
Feature activation-7.80e-03

Pos logit contributions
ris+0.042
 gol+0.039
 narrative+0.039
spe+0.037
ile+0.035
Neg logit contributions
cel-0.016
 lap-0.015
 forehead-0.014
 chin-0.013
 cheeks-0.012
Token and
Token ID290
Feature activation-2.55e-03

Pos logit contributions
ris+0.043
 gol+0.041
 narrative+0.041
spe+0.040
 nations+0.037
Neg logit contributions
cel-0.019
 lap-0.018
 chin-0.016
 forehead-0.016
 cheeks-0.015
Token circles
Token ID13332
Feature activation+4.27e-03

Pos logit contributions
ris+0.004
 gol+0.004
 narrative+0.004
spe+0.003
 nations+0.003
Neg logit contributions
cel-0.005
 forehead-0.004
 lap-0.004
 chin-0.004
 cheeks-0.004
Token and
Token ID290
Feature activation-7.62e-03

Pos logit contributions
cel+0.006
 forehead+0.005
 lap+0.005
 chin+0.005
 cheeks+0.004
Neg logit contributions
ris-0.009
 gol-0.008
 narrative-0.008
spe-0.008
 nations-0.008
Token triangles
Token ID44360
Feature activation-2.16e-02

Pos logit contributions
ris+0.013
 gol+0.012
 narrative+0.012
spe+0.011
 nations+0.011
Neg logit contributions
cel-0.013
 lap-0.012
 forehead-0.012
 chin-0.012
 cheeks-0.011
Token and
Token ID290
Feature activation-1.24e-02

Pos logit contributions
ris+0.049
 narrative+0.047
 gol+0.047
spe+0.045
 nations+0.043
Neg logit contributions
cel-0.023
 lap-0.021
 forehead-0.020
 chin-0.020
tail-0.018
Token rect
Token ID13621
Feature activation-8.21e-02

Pos logit contributions
ris+0.021
 gol+0.019
 narrative+0.019
spe+0.019
 nations+0.018
Neg logit contributions
cel-0.021
 forehead-0.020
 lap-0.020
 chin-0.019
 cheeks-0.018
Tokenangles
Token ID27787
Feature activation-2.27e-02

Pos logit contributions
 gol+0.097
ris+0.091
 narrative+0.091
spe+0.088
guard+0.075
Neg logit contributions
cel-0.165
 lap-0.158
amas-0.156
 chin-0.154
 forehead-0.153
Token.
Token ID13
Feature activation-1.93e-03

Pos logit contributions
ris+0.050
 gol+0.047
 narrative+0.046
spe+0.045
 nations+0.044
Neg logit contributions
cel-0.027
 lap-0.025
 forehead-0.025
 chin-0.023
 cheeks-0.021
Token She
Token ID1375
Feature activation-2.37e-03

Pos logit contributions
ris+0.003
 gol+0.003
 narrative+0.003
spe+0.003
 nations+0.003
Neg logit contributions
cel-0.003
 forehead-0.003
 chin-0.003
 lap-0.003
 cheek-0.002
Token made
Token ID925
Feature activation+1.81e-02

Pos logit contributions
ris+0.005
 gol+0.005
 narrative+0.005
ole+0.005
 nations+0.005
Neg logit contributions
cel-0.003
 feel-0.002
 lap-0.002
 chin-0.002
 nose-0.002
Token her
Token ID607
Feature activation+1.87e-02

Pos logit contributions
cel+0.027
 forehead+0.025
 lap+0.025
 chin+0.023
tail+0.021
Neg logit contributions
ris-0.035
 gol-0.032
spe-0.031
 nations-0.031
 narrative-0.031
Token room
Token ID2119
Feature activation-3.03e-03

Pos logit contributions
ris+0.003
 gol+0.002
 narrative+0.002
spe+0.002
 nations+0.002
Neg logit contributions
cel-0.002
 lap-0.002
 forehead-0.002
 chin-0.002
 cheeks-0.001
Token and
Token ID290
Feature activation-1.60e-02

Pos logit contributions
ris+0.007
 gol+0.006
 narrative+0.006
spe+0.006
 nations+0.006
Neg logit contributions
cel-0.004
 lap-0.003
 forehead-0.003
 chin-0.003
 cheeks-0.003
Token saw
Token ID2497
Feature activation-2.42e-02

Pos logit contributions
ris+0.028
 gol+0.026
 narrative+0.026
spe+0.025
 nations+0.025
Neg logit contributions
cel-0.026
 forehead-0.024
 lap-0.024
 chin-0.022
amas-0.022
Token a
Token ID257
Feature activation-1.34e-02

Pos logit contributions
ris+0.050
 gol+0.046
 narrative+0.045
spe+0.045
 nations+0.044
Neg logit contributions
cel-0.033
 forehead-0.030
 lap-0.030
 chin-0.028
tail-0.026
Token big
Token ID1263
Feature activation-1.48e-02

Pos logit contributions
ris+0.030
 narrative+0.027
igators+0.027
 nations+0.027
spe+0.026
Neg logit contributions
cel-0.016
 lap-0.015
 forehead-0.015
 chin-0.014
 cheek-0.013
Token er
Token ID1931
Feature activation-8.20e-02

Pos logit contributions
ris+0.029
 narrative+0.026
spe+0.026
 gol+0.026
 nations+0.025
Neg logit contributions
cel-0.022
 lap-0.020
 forehead-0.020
 chin-0.019
 cheeks-0.018
Tokenaser
Token ID6005
Feature activation-4.22e-02

Pos logit contributions
ris+0.116
spe+0.114
 narrative+0.104
 gol+0.104
igators+0.103
Neg logit contributions
cel-0.164
 forehead-0.145
 lap-0.143
tail-0.138
 chin-0.138
Token on
Token ID319
Feature activation-1.01e-02

Pos logit contributions
ris+0.100
spe+0.099
 gol+0.094
 narrative+0.093
 nations+0.092
Neg logit contributions
cel-0.045
 lap-0.039
 chin-0.037
tail-0.036
 forehead-0.035
Token the
Token ID262
Feature activation-4.74e-04

Pos logit contributions
ris+0.024
 narrative+0.023
 gol+0.023
spe+0.023
 nations+0.022
Neg logit contributions
cel-0.010
 lap-0.009
 forehead-0.009
 chin-0.008
tail-0.007
Token teacher
Token ID4701
Feature activation-1.58e-02

Pos logit contributions
ris+0.001
 narrative+0.001
 gol+0.001
 nations+0.001
spe+0.001
Neg logit contributions
cel-0.001
chief-0.001
 forehead-0.001
 lap-0.001
tail-0.001
Token's
Token ID338
Feature activation-1.85e-02

Pos logit contributions
ris+0.039
 gol+0.037
 narrative+0.037
spe+0.036
 nations+0.035
Neg logit contributions
cel-0.015
 lap-0.013
 forehead-0.013
 chin-0.011
tail-0.010
Token named
Token ID3706
Feature activation-3.09e-02

Pos logit contributions
ris+0.019
 narrative+0.017
 gol+0.017
spe+0.016
 nations+0.016
Neg logit contributions
cel-0.024
 lap-0.022
 forehead-0.022
 chin-0.021
tail-0.020
Token Tim
Token ID5045
Feature activation-1.94e-02

Pos logit contributions
ris+0.050
 gol+0.045
 narrative+0.043
spe+0.043
 nations+0.043
Neg logit contributions
cel-0.056
 forehead-0.053
 lap-0.052
 chin-0.049
tail-0.047
Token found
Token ID1043
Feature activation-1.39e-02

Pos logit contributions
ris+0.035
 gol+0.032
 narrative+0.032
spe+0.032
 nations+0.031
Neg logit contributions
cel-0.031
 forehead-0.029
 lap-0.028
 chin-0.027
 cheeks-0.025
Token a
Token ID257
Feature activation-1.53e-02

Pos logit contributions
ris+0.031
 gol+0.029
 narrative+0.029
spe+0.028
 nations+0.028
Neg logit contributions
cel-0.016
 forehead-0.015
 lap-0.015
 chin-0.013
 cheeks-0.012
Token big
Token ID1263
Feature activation-1.73e-02

Pos logit contributions
ris+0.037
 narrative+0.035
 nations+0.034
spe+0.034
 gol+0.033
Neg logit contributions
cel-0.015
 forehead-0.014
 lap-0.014
 chin-0.013
 cheek-0.012
Token er
Token ID1931
Feature activation-8.09e-02

Pos logit contributions
ris+0.038
 narrative+0.035
spe+0.035
 nations+0.033
 gol+0.032
Neg logit contributions
cel-0.021
 lap-0.021
 chin-0.021
 forehead-0.021
 nose-0.019
Tokenaser
Token ID6005
Feature activation-4.05e-02

Pos logit contributions
spe+0.130
ris+0.125
igators+0.120
 narrative+0.120
 nations+0.119
Neg logit contributions
cel-0.140
 forehead-0.123
 lap-0.117
tail-0.117
 chin-0.113
Token.
Token ID13
Feature activation-4.05e-03

Pos logit contributions
ris+0.092
spe+0.090
 nations+0.090
 narrative+0.089
 gol+0.086
Neg logit contributions
cel-0.049
 lap-0.043
 forehead-0.040
 chin-0.039
tail-0.037
Token It
Token ID632
Feature activation+3.77e-04

Pos logit contributions
ris+0.008
 gol+0.008
spe+0.007
 narrative+0.007
 nations+0.007
Neg logit contributions
cel-0.006
 forehead-0.005
 lap-0.005
tail-0.004
ze-0.004
Token was
Token ID373
Feature activation+8.91e-04

Pos logit contributions
cel+0.001
 lap+0.000
 forehead+0.000
 chin+0.000
 cheeks+0.000
Neg logit contributions
ris-0.001
 gol-0.001
 narrative-0.001
spe-0.001
 nations-0.001
Token not
Token ID407
Feature activation-1.61e-02

Pos logit contributions
 warm+0.001
cel+0.001
 forehead+0.001
 lap+0.001
 cozy+0.001
Neg logit contributions
ris-0.002
spe-0.002
 narrative-0.002
 gol-0.002
 nations-0.002
Token you
Token ID345
Feature activation-8.82e-04

Pos logit contributions
cel+0.036
 forehead+0.035
 lap+0.034
 chin+0.031
amas+0.029
Neg logit contributions
ris-0.041
 narrative-0.041
 gol-0.040
spe-0.040
 nations-0.038
Token name
Token ID1438
Feature activation+2.11e-02

Pos logit contributions
ris+0.002
 gol+0.002
 narrative+0.002
spe+0.002
 nations+0.002
Neg logit contributions
cel-0.001
 forehead-0.001
 lap-0.001
 chin-0.001
 cheeks-0.001
Token all
Token ID477
Feature activation-4.49e-02

Pos logit contributions
 forehead+0.041
 lap+0.040
 chin+0.037
cel+0.037
 cheeks+0.035
Neg logit contributions
ris-0.027
 nations-0.027
ile-0.026
 gol-0.026
igators-0.025
Token of
Token ID286
Feature activation-1.77e-02

Pos logit contributions
ris+0.090
 gol+0.082
 narrative+0.082
spe+0.081
 nations+0.078
Neg logit contributions
cel-0.064
 forehead-0.058
 lap-0.058
 chin-0.054
 cheeks-0.050
Token these
Token ID777
Feature activation-1.86e-02

Pos logit contributions
ris+0.026
 nations+0.026
 narrative+0.025
 gol+0.025
ole+0.024
Neg logit contributions
 forehead-0.031
 lap-0.030
cel-0.029
 chin-0.027
amas-0.026
Token le
Token ID443
Feature activation-8.05e-02

Pos logit contributions
ris+0.027
 gol+0.026
 nations+0.025
 narrative+0.025
spe+0.023
Neg logit contributions
 forehead-0.035
cel-0.034
 lap-0.034
 chin-0.032
tail-0.030
Tokenmons
Token ID11567
Feature activation-2.49e-02

Pos logit contributions
ris+0.129
spe+0.117
 narrative+0.115
 gol+0.111
 nations+0.109
Neg logit contributions
cel-0.146
 lap-0.133
 forehead-0.131
 chin-0.127
tail-0.121
Token?"
Token ID1701
Feature activation-7.60e-03

Pos logit contributions
ris+0.043
 gol+0.040
 narrative+0.040
 nations+0.038
spe+0.038
Neg logit contributions
cel-0.041
 forehead-0.039
 lap-0.038
 chin-0.035
 cheeks-0.033
Token Lily
Token ID20037
Feature activation-1.46e-02

Pos logit contributions
ris+0.009
 gol+0.008
 narrative+0.007
 planned+0.007
 nations+0.007
Neg logit contributions
 forehead-0.014
cel-0.013
 chin-0.013
ze-0.013
 upper-0.013
Token looked
Token ID3114
Feature activation-1.99e-02

Pos logit contributions
ris+0.023
 gol+0.021
spe+0.020
 narrative+0.019
 nations+0.019
Neg logit contributions
cel-0.026
 forehead-0.025
 lap-0.024
 chin-0.022
tail-0.022
Token at
Token ID379
Feature activation-1.69e-02

Pos logit contributions
ris+0.038
 narrative+0.035
spe+0.034
 gol+0.034
 nations+0.032
Neg logit contributions
cel-0.030
 forehead-0.028
 lap-0.028
 chin-0.026
 feel-0.025
Token.
Token ID13
Feature activation-2.55e-02

Pos logit contributions
ris+0.004
 gol+0.004
 nations+0.004
 narrative+0.004
spe+0.004
Neg logit contributions
cel-0.004
 forehead-0.004
 lap-0.003
 cheeks-0.003
 chin-0.003
Token I
Token ID314
Feature activation-6.97e-03

Pos logit contributions
ris+0.055
 gol+0.051
 narrative+0.051
spe+0.049
 nations+0.048
Neg logit contributions
cel-0.032
 forehead-0.030
 lap-0.028
 chin-0.026
amas-0.025
Token have
Token ID423
Feature activation-2.74e-02

Pos logit contributions
ris+0.013
 narrative+0.013
 gol+0.012
 nations+0.012
spe+0.012
Neg logit contributions
cel-0.010
 lap-0.009
 forehead-0.009
 chin-0.008
 feel-0.008
Token a
Token ID257
Feature activation-2.00e-02

Pos logit contributions
ris+0.058
 gol+0.054
 narrative+0.053
spe+0.053
 nations+0.051
Neg logit contributions
cel-0.035
 lap-0.033
 forehead-0.033
 chin-0.030
 cheeks-0.028
Token new
Token ID649
Feature activation-2.74e-02

Pos logit contributions
ris+0.041
spe+0.038
 narrative+0.038
 nations+0.037
 gol+0.036
Neg logit contributions
 lap-0.027
 forehead-0.026
 chin-0.024
cel-0.024
 shoulder-0.023
Token er
Token ID1931
Feature activation-8.00e-02

Pos logit contributions
ris+0.056
spe+0.052
 narrative+0.051
 nations+0.049
igators+0.048
Neg logit contributions
 lap-0.038
 forehead-0.036
 chin-0.034
 shoulder-0.034
 nose-0.033
Tokenaser
Token ID6005
Feature activation-4.30e-02

Pos logit contributions
ris+0.115
spe+0.110
 narrative+0.105
 nations+0.101
 gol+0.101
Neg logit contributions
cel-0.155
 forehead-0.141
 lap-0.137
tail-0.132
 chin-0.130
Token for
Token ID329
Feature activation-1.08e-02

Pos logit contributions
ris+0.099
 narrative+0.093
 gol+0.092
spe+0.092
 nations+0.089
Neg logit contributions
cel-0.047
 lap-0.043
 forehead-0.043
 chin-0.039
tail-0.037
Token you
Token ID345
Feature activation-1.58e-02

Pos logit contributions
ris+0.021
 gol+0.020
 nations+0.020
 narrative+0.020
spe+0.019
Neg logit contributions
cel-0.015
 lap-0.013
 forehead-0.013
 chin-0.012
amas-0.011
Token.
Token ID13
Feature activation-2.27e-02

Pos logit contributions
ris+0.035
 gol+0.032
 narrative+0.031
spe+0.031
 nations+0.031
Neg logit contributions
cel-0.020
 forehead-0.017
 lap-0.017
 chin-0.016
 cheeks-0.014
Token It
Token ID632
Feature activation-2.01e-03

Pos logit contributions
ris+0.051
 gol+0.047
 narrative+0.047
spe+0.046
 nations+0.045
Neg logit contributions
cel-0.026
 forehead-0.024
 lap-0.024
 chin-0.022
 cheeks-0.020
Token took
Token ID1718
Feature activation+1.86e-02

Pos logit contributions
ris+0.028
 gol+0.026
spe+0.025
 narrative+0.025
 nations+0.024
Neg logit contributions
cel-0.020
 forehead-0.018
 lap-0.018
 chin-0.016
tail-0.015
Token a
Token ID257
Feature activation-9.58e-03

Pos logit contributions
cel+0.029
 forehead+0.027
 lap+0.026
 chin+0.024
amas+0.023
Neg logit contributions
ris-0.036
 gol-0.033
 narrative-0.031
spe-0.030
 nations-0.030
Token deep
Token ID2769
Feature activation+7.74e-03

Pos logit contributions
ris+0.018
 gol+0.016
spe+0.015
 narrative+0.015
 nations+0.015
Neg logit contributions
cel-0.015
 forehead-0.013
 lap-0.013
 chin-0.013
amas-0.012
Token breath
Token ID8033
Feature activation+4.56e-03

Pos logit contributions
cel+0.008
 lap+0.008
 forehead+0.008
 chin+0.008
 cheeks+0.007
Neg logit contributions
ris-0.018
 narrative-0.017
spe-0.017
 gol-0.016
 nations-0.016
Token and
Token ID290
Feature activation+3.92e-03

Pos logit contributions
cel+0.005
 lap+0.005
 forehead+0.005
 chin+0.004
 shoulder+0.004
Neg logit contributions
ris-0.010
 narrative-0.010
 gol-0.009
spe-0.009
 nations-0.009
Token le
Token ID443
Feature activation-7.99e-02

Pos logit contributions
cel+0.007
 forehead+0.006
 lap+0.006
 chin+0.006
amas+0.006
Neg logit contributions
ris-0.006
 gol-0.006
spe-0.006
 narrative-0.005
 nations-0.005
Tokenaped
Token ID5813
Feature activation-1.78e-02

Pos logit contributions
ris+0.105
 gol+0.092
 narrative+0.090
spe+0.089
 nations+0.085
Neg logit contributions
cel-0.168
 forehead-0.159
 lap-0.158
 chin-0.151
 cheeks-0.144
Token over
Token ID625
Feature activation-1.30e-03

Pos logit contributions
ris+0.040
 narrative+0.038
 gol+0.037
spe+0.036
 nations+0.036
Neg logit contributions
cel-0.020
 lap-0.020
 forehead-0.020
 chin-0.018
 cheeks-0.016
Token the
Token ID262
Feature activation-3.63e-03

Pos logit contributions
ris+0.003
 gol+0.003
 narrative+0.003
spe+0.003
 nations+0.003
Neg logit contributions
cel-0.002
 lap-0.001
 forehead-0.001
 chin-0.001
 cheeks-0.001
Token stack
Token ID8931
Feature activation-2.08e-02

Pos logit contributions
ris+0.006
spe+0.005
 nations+0.005
 narrative+0.005
 gol+0.005
Neg logit contributions
cel-0.006
 forehead-0.006
tail-0.006
 lap-0.006
chief-0.005
Token.
Token ID13
Feature activation-5.86e-03

Pos logit contributions
ris+0.037
 narrative+0.036
 gol+0.035
 nations+0.034
spe+0.032
Neg logit contributions
 lap-0.032
 forehead-0.030
cel-0.030
 chin-0.029
 shoulder-0.027
Token Sus
Token ID8932
Feature activation-2.18e-02

Pos logit contributions
cel+0.000
 forehead+0.000
 lap+0.000
 chin+0.000
tail+0.000
Neg logit contributions
ris-0.000
 gol-0.000
spe-0.000
 narrative-0.000
 nations-0.000
Tokenie
Token ID494
Feature activation+8.80e-03

Pos logit contributions
ris+0.044
 gol+0.041
 narrative+0.041
spe+0.039
 nations+0.038
Neg logit contributions
cel-0.030
 forehead-0.027
 lap-0.027
 chin-0.025
 cheeks-0.023
Token had
Token ID550
Feature activation-1.76e-02

Pos logit contributions
cel+0.013
 lap+0.011
 forehead+0.011
 chin+0.011
 cheeks+0.010
Neg logit contributions
ris-0.017
 narrative-0.017
 gol-0.016
spe-0.016
 nations-0.015
Token a
Token ID257
Feature activation-1.24e-02

Pos logit contributions
ris+0.038
 gol+0.036
 narrative+0.036
spe+0.035
 nations+0.034
Neg logit contributions
cel-0.022
 lap-0.020
 forehead-0.020
 chin-0.018
tail-0.017
Token furry
Token ID40857
Feature activation-1.11e-02

Pos logit contributions
ris+0.028
 narrative+0.026
spe+0.026
 nations+0.026
 gol+0.025
Neg logit contributions
cel-0.014
 lap-0.014
 forehead-0.013
 chin-0.012
 nose-0.012
Token er
Token ID1931
Feature activation-7.98e-02

Pos logit contributions
ris+0.025
 gol+0.023
 narrative+0.023
spe+0.023
 nations+0.022
Neg logit contributions
cel-0.013
 forehead-0.012
 lap-0.012
 chin-0.011
 cheeks-0.010
Tokenaser
Token ID6005
Feature activation-4.12e-02

Pos logit contributions
spe+0.118
ris+0.117
 narrative+0.114
 nations+0.112
igators+0.109
Neg logit contributions
cel-0.149
 forehead-0.134
 lap-0.132
tail-0.129
 chin-0.126
Token in
Token ID287
Feature activation-8.40e-03

Pos logit contributions
ris+0.094
 narrative+0.092
 gol+0.089
spe+0.089
 nations+0.088
Neg logit contributions
cel-0.048
 lap-0.043
 forehead-0.040
 chin-0.039
tail-0.037
Token her
Token ID607
Feature activation+1.53e-02

Pos logit contributions
ris+0.021
 gol+0.020
 narrative+0.020
spe+0.019
 nations+0.019
Neg logit contributions
cel-0.008
 lap-0.007
 forehead-0.006
 chin-0.006
tail-0.005
Token hand
Token ID1021
Feature activation+2.81e-02

Pos logit contributions
cel+0.020
 forehead+0.017
 heal+0.016
amas+0.016
chief+0.015
Neg logit contributions
ris-0.030
 gol-0.027
 nations-0.027
 narrative-0.027
spe-0.026
Token and
Token ID290
Feature activation-4.50e-03

Pos logit contributions
cel+0.040
 forehead+0.037
 lap+0.035
 chin+0.033
amas+0.031
Neg logit contributions
ris-0.055
 gol-0.050
spe-0.049
 narrative-0.047
 nations-0.046
Token of
Token ID286
Feature activation-8.12e-03

Pos logit contributions
cel+0.020
 lap+0.019
 forehead+0.018
 chin+0.017
tail+0.016
Neg logit contributions
ris-0.021
 narrative-0.020
 gol-0.020
spe-0.020
 nations-0.019
Token the
Token ID262
Feature activation-2.33e-03

Pos logit contributions
ris+0.023
spe+0.022
 narrative+0.021
 gol+0.021
 nations+0.020
Neg logit contributions
 herself-0.004
cel-0.003
 lap-0.003
 forehead-0.003
 Mitt-0.003
Token house
Token ID2156
Feature activation+1.26e-02

Pos logit contributions
ris+0.005
 gol+0.004
spe+0.004
igators+0.004
 narrative+0.004
Neg logit contributions
cel-0.003
 forehead-0.003
 lap-0.003
 chin-0.002
 shoulder-0.002
Token's
Token ID338
Feature activation+4.43e-03

Pos logit contributions
cel+0.020
 forehead+0.019
 lap+0.019
 chin+0.018
amas+0.017
Neg logit contributions
ris-0.023
spe-0.021
 gol-0.020
 narrative-0.020
 nations-0.019
Token most
Token ID749
Feature activation-2.02e-02

Pos logit contributions
cel+0.007
 forehead+0.006
 lap+0.006
 chin+0.006
 cheeks+0.005
Neg logit contributions
ris-0.008
 gol-0.008
 narrative-0.008
spe-0.008
 nations-0.007
Token tre
Token ID2054
Feature activation-7.98e-02

Pos logit contributions
 narrative+0.043
ris+0.043
igators+0.041
 nations+0.040
spe+0.040
Neg logit contributions
cel-0.021
 warm-0.019
 lap-0.019
 cozy-0.018
 forehead-0.018
Tokenasured
Token ID34006
Feature activation-2.70e-02

Pos logit contributions
ris+0.112
 narrative+0.107
spe+0.107
 gol+0.104
 nations+0.104
Neg logit contributions
cel-0.150
 forehead-0.139
 lap-0.137
 chin-0.131
 shoulder-0.127
Token treasures
Token ID29561
Feature activation-1.72e-02

Pos logit contributions
 narrative+0.050
ris+0.050
 gol+0.046
igators+0.044
 nations+0.043
Neg logit contributions
 forehead-0.036
cel-0.036
 lap-0.036
 chin-0.033
 shoulder-0.033
Token.
Token ID13
Feature activation-9.19e-03

Pos logit contributions
ris+0.036
 gol+0.034
 narrative+0.034
spe+0.033
 nations+0.032
Neg logit contributions
cel-0.022
 forehead-0.020
 lap-0.020
 chin-0.018
tail-0.017
Token<|endoftext|>
Token ID50256
Feature activation-3.80e-02

Pos logit contributions
ris+0.017
 gol+0.015
spe+0.015
 narrative+0.015
 nations+0.014
Neg logit contributions
 forehead-0.015
cel-0.015
 chin-0.014
 lap-0.013
 cheek-0.013
TokenOnce
Token ID7454
Feature activation-2.53e-02

Pos logit contributions
ris+0.075
 gol+0.066
 narrative+0.066
spe+0.062
 nations+0.061
Neg logit contributions
 forehead-0.057
cel-0.055
 lap-0.051
 chin-0.050
 cheeks-0.050
Token"
Token ID1
Feature activation+1.97e-02

Pos logit contributions
cel+0.027
 lap+0.025
 forehead+0.023
 chin+0.021
tail+0.021
Neg logit contributions
ris-0.048
 narrative-0.045
 gol-0.045
 nations-0.044
spe-0.043
TokenHow
Token ID2437
Feature activation+2.24e-02

Pos logit contributions
cel+0.027
 lap+0.023
 forehead+0.023
 chin+0.021
tail+0.020
Neg logit contributions
ris-0.041
 nations-0.037
 gol-0.037
spe-0.037
 narrative-0.036
Token much
Token ID881
Feature activation-7.02e-03

Pos logit contributions
cel+0.039
 forehead+0.036
 lap+0.036
 chin+0.034
 cheeks+0.032
Neg logit contributions
ris-0.037
 gol-0.034
 narrative-0.033
spe-0.033
 nations-0.031
Token is
Token ID318
Feature activation-2.52e-02

Pos logit contributions
ris+0.014
 gol+0.013
 narrative+0.012
spe+0.012
 nations+0.012
Neg logit contributions
cel-0.010
 forehead-0.010
 lap-0.009
 chin-0.009
 cheeks-0.008
Token this
Token ID428
Feature activation+1.02e-02

Pos logit contributions
ris+0.053
 narrative+0.048
 gol+0.048
spe+0.048
 nations+0.046
Neg logit contributions
cel-0.034
 forehead-0.031
 lap-0.030
 chin-0.028
 cheeks-0.026
Token dress
Token ID6576
Feature activation+4.87e-02

Pos logit contributions
cel+0.015
 forehead+0.014
 lap+0.014
 chin+0.013
 cheeks+0.012
Neg logit contributions
ris-0.019
 gol-0.018
 narrative-0.018
spe-0.017
 nations-0.017
Token?"
Token ID1701
Feature activation+1.25e-02

Pos logit contributions
cel+0.084
 lap+0.078
 forehead+0.077
 chin+0.074
 cheeks+0.069
Neg logit contributions
ris-0.080
spe-0.074
 gol-0.072
 narrative-0.070
 nations-0.068
Token she
Token ID673
Feature activation-2.92e-05

Pos logit contributions
cel+0.022
 forehead+0.020
 lap+0.020
 chin+0.019
amas+0.018
Neg logit contributions
ris-0.020
 gol-0.019
spe-0.018
 narrative-0.018
 nations-0.017
Token asked
Token ID1965
Feature activation-5.41e-04

Pos logit contributions
ris+0.000
 gol+0.000
spe+0.000
 narrative+0.000
 nations+0.000
Neg logit contributions
cel-0.000
 lap-0.000
 forehead-0.000
amas-0.000
 chin-0.000
Token the
Token ID262
Feature activation-9.84e-03

Pos logit contributions
ris+0.001
spe+0.001
 gol+0.001
 narrative+0.001
 nations+0.001
Neg logit contributions
cel-0.001
 lap-0.001
 forehead-0.001
 chin-0.001
amas-0.001
Token shop
Token ID6128
Feature activation-7.80e-03

Pos logit contributions
ris+0.019
 gol+0.017
 narrative+0.017
spe+0.016
 nations+0.016
Neg logit contributions
cel-0.015
 forehead-0.014
 lap-0.014
 chin-0.013
 cheeks-0.012
Token you
Token ID345
Feature activation-2.38e-03

Pos logit contributions
cel+0.004
 lap+0.003
 forehead+0.003
 chin+0.003
amas+0.003
Neg logit contributions
ris-0.004
 gol-0.003
 nations-0.003
 narrative-0.003
spe-0.003
Token and
Token ID290
Feature activation-9.71e-04

Pos logit contributions
ris+0.004
spe+0.004
 nations+0.004
 gol+0.004
 narrative+0.004
Neg logit contributions
cel-0.004
 forehead-0.003
 lap-0.003
chief-0.003
 chin-0.003
Token I
Token ID314
Feature activation+1.56e-03

Pos logit contributions
ris+0.002
 gol+0.002
spe+0.002
 narrative+0.002
 nations+0.002
Neg logit contributions
cel-0.001
 lap-0.001
 forehead-0.001
 chin-0.001
amas-0.001
Token won
Token ID1839
Feature activation+3.72e-03

Pos logit contributions
cel+0.002
 forehead+0.002
 lap+0.002
 chin+0.002
 cheeks+0.002
Neg logit contributions
ris-0.003
 gol-0.003
 narrative-0.003
spe-0.003
 nations-0.003
Token't
Token ID470
Feature activation-2.85e-02

Pos logit contributions
 lap+0.007
 forehead+0.007
cel+0.007
 chin+0.006
 cheeks+0.006
Neg logit contributions
ris-0.005
 gol-0.005
spe-0.004
 narrative-0.004
igators-0.004
Token let
Token ID1309
Feature activation+4.94e-02

Pos logit contributions
ris+0.060
 gol+0.056
spe+0.054
 narrative+0.053
 nations+0.052
Neg logit contributions
cel-0.038
 lap-0.034
 forehead-0.034
 chin-0.031
amas-0.029
Token anything
Token ID1997
Feature activation-8.33e-03

Pos logit contributions
cel+0.087
 lap+0.080
 forehead+0.078
 chin+0.073
 hips+0.067
Neg logit contributions
ris-0.076
 gol-0.073
igators-0.071
 nations-0.071
spe-0.069
Token bad
Token ID2089
Feature activation-4.19e-02

Pos logit contributions
ris+0.015
 gol+0.014
spe+0.014
 narrative+0.013
 nations+0.013
Neg logit contributions
cel-0.014
 lap-0.012
 forehead-0.012
 chin-0.012
amas-0.011
Token happen
Token ID1645
Feature activation+5.67e-03

Pos logit contributions
ris+0.091
 gol+0.085
 narrative+0.084
spe+0.083
 nations+0.081
Neg logit contributions
cel-0.052
 forehead-0.047
 lap-0.046
 chin-0.042
 cheeks-0.039
Token."
Token ID526
Feature activation-1.52e-03

Pos logit contributions
cel+0.007
 forehead+0.007
 lap+0.007
 chin+0.006
 cheeks+0.006
Neg logit contributions
ris-0.012
spe-0.011
 gol-0.011
 narrative-0.011
 nations-0.011
Token
Token ID220
Feature activation+3.50e-03

Pos logit contributions
ris+0.003
spe+0.003
 gol+0.003
 narrative+0.003
 nations+0.003
Neg logit contributions
cel-0.002
 forehead-0.002
 lap-0.002
 chin-0.002
tail-0.002
Token the
Token ID262
Feature activation+5.70e-03

Pos logit contributions
ris+0.001
 gol+0.000
 narrative+0.000
spe+0.000
 nations+0.000
Neg logit contributions
cel-0.000
 lap-0.000
 forehead-0.000
 chin-0.000
 cheeks-0.000
Token morning
Token ID3329
Feature activation+3.24e-02

Pos logit contributions
cel+0.009
 forehead+0.008
 lap+0.008
 chin+0.007
tail+0.007
Neg logit contributions
ris-0.010
 narrative-0.010
 nations-0.009
 gol-0.009
spe-0.009
Token.
Token ID13
Feature activation-2.86e-03

Pos logit contributions
cel+0.053
 forehead+0.049
chief+0.043
 chin+0.042
 lap+0.042
Neg logit contributions
ris-0.053
spe-0.047
 nations-0.047
 gol-0.045
igators-0.044
Token But
Token ID887
Feature activation+1.47e-02

Pos logit contributions
ris+0.007
 gol+0.006
 narrative+0.006
spe+0.006
 nations+0.006
Neg logit contributions
cel-0.003
 forehead-0.003
 lap-0.003
 chin-0.003
 cheeks-0.002
Token the
Token ID262
Feature activation+4.75e-04

Pos logit contributions
cel+0.013
 forehead+0.013
 chin+0.013
 lap+0.012
 cheeks+0.012
Neg logit contributions
ris-0.035
 gol-0.031
 narrative-0.031
 nations-0.030
spe-0.030
Token wife
Token ID3656
Feature activation+4.11e-02

Pos logit contributions
cel+0.001
 lap+0.001
 forehead+0.001
 chin+0.001
tail+0.001
Neg logit contributions
ris-0.001
 gol-0.001
 narrative-0.001
spe-0.001
 nations-0.001
Token didn
Token ID1422
Feature activation-5.66e-04

Pos logit contributions
cel+0.063
 forehead+0.058
 lap+0.057
 chin+0.055
 cheeks+0.052
Neg logit contributions
ris-0.076
 gol-0.072
 narrative-0.070
spe-0.067
 nations-0.067
Token't
Token ID470
Feature activation-3.14e-02

Pos logit contributions
ris+0.001
spe+0.001
 gol+0.001
 nations+0.001
 narrative+0.001
Neg logit contributions
cel-0.001
 lap-0.001
 forehead-0.000
 chin-0.000
tail-0.000
Token agree
Token ID4236
Feature activation-1.20e-02

Pos logit contributions
ris+0.069
 narrative+0.064
spe+0.064
 gol+0.063
 nations+0.061
Neg logit contributions
cel-0.036
 forehead-0.035
 lap-0.034
 feel-0.031
 chin-0.031
Token.
Token ID13
Feature activation-3.35e-03

Pos logit contributions
ris+0.022
 gol+0.019
 narrative+0.019
spe+0.019
 nations+0.019
Neg logit contributions
cel-0.020
 forehead-0.018
 lap-0.018
 chin-0.017
 cheeks-0.016
Token She
Token ID1375
Feature activation+9.16e-03

Pos logit contributions
ris+0.007
 gol+0.007
spe+0.006
 narrative+0.006
 nations+0.006
Neg logit contributions
cel-0.004
 forehead-0.004
 lap-0.004
 chin-0.004
 cheeks-0.003
Token to
Token ID284
Feature activation-1.01e-03

Pos logit contributions
ris+0.050
 narrative+0.047
 gol+0.046
spe+0.044
 nations+0.044
Neg logit contributions
cel-0.036
 lap-0.034
 forehead-0.033
 chin-0.031
tail-0.029
Token do
Token ID466
Feature activation-1.19e-03

Pos logit contributions
ris+0.002
 nations+0.002
 narrative+0.002
 gol+0.002
igators+0.001
Neg logit contributions
cel-0.002
 forehead-0.002
amas-0.002
 chin-0.001
 lap-0.001
Token.
Token ID13
Feature activation+7.70e-03

Pos logit contributions
ris+0.002
 gol+0.002
 narrative+0.002
spe+0.002
 nations+0.002
Neg logit contributions
cel-0.002
 forehead-0.001
 lap-0.001
 chin-0.001
 cheeks-0.001
Token There
Token ID1318
Feature activation+1.33e-02

Pos logit contributions
cel+0.010
 forehead+0.009
 lap+0.008
 chin+0.008
 cheek+0.008
Neg logit contributions
ris-0.017
 gol-0.015
spe-0.015
 narrative-0.015
 nations-0.015
Token were
Token ID547
Feature activation+2.32e-02

Pos logit contributions
cel+0.020
 lap+0.019
 forehead+0.018
 chin+0.017
 cheeks+0.016
Neg logit contributions
ris-0.026
 gol-0.024
 narrative-0.023
spe-0.022
 nations-0.022
Token flowers
Token ID12734
Feature activation+3.77e-02

Pos logit contributions
cel+0.030
 lap+0.028
 forehead+0.027
 chin+0.025
 cheeks+0.024
Neg logit contributions
ris-0.049
 gol-0.045
 narrative-0.044
spe-0.042
 nations-0.040
Token to
Token ID284
Feature activation+1.65e-02

Pos logit contributions
cel+0.052
 forehead+0.048
 lap+0.047
 chin+0.044
 cheeks+0.040
Neg logit contributions
ris-0.076
 gol-0.070
 narrative-0.069
spe-0.069
 nations-0.067
Token smell
Token ID8508
Feature activation+2.53e-02

Pos logit contributions
cel+0.025
amas+0.024
 forehead+0.023
 chin+0.022
 lap+0.022
Neg logit contributions
ris-0.030
 nations-0.028
 narrative-0.027
 gol-0.027
igators-0.027
Token,
Token ID11
Feature activation+1.07e-02

Pos logit contributions
cel+0.038
 forehead+0.036
 lap+0.035
 chin+0.033
tail+0.031
Neg logit contributions
ris-0.048
 gol-0.043
 narrative-0.043
spe-0.043
 nations-0.042
Token trees
Token ID7150
Feature activation+1.18e-02

Pos logit contributions
cel+0.016
 forehead+0.014
 lap+0.014
 chin+0.013
amas+0.013
Neg logit contributions
ris-0.020
 gol-0.018
spe-0.018
 nations-0.018
 narrative-0.018
Token to
Token ID284
Feature activation+9.26e-03

Pos logit contributions
cel+0.016
 forehead+0.015
 lap+0.015
 chin+0.013
amas+0.013
Neg logit contributions
ris-0.024
 gol-0.022
spe-0.022
 narrative-0.022
 nations-0.021
Token up
Token ID510
Feature activation+8.30e-03

Pos logit contributions
cel+0.012
 warm+0.012
 lap+0.012
 forehead+0.012
 down+0.011
Neg logit contributions
ris-0.032
 narrative-0.031
 gol-0.028
 nations-0.028
oral-0.027
Token and
Token ID290
Feature activation+5.09e-03

Pos logit contributions
 warm+0.005
 lap+0.005
 chin+0.005
 free+0.004
 down+0.004
Neg logit contributions
 narrative-0.022
 gol-0.021
ris-0.021
 nations-0.020
oral-0.019
Token down
Token ID866
Feature activation-4.64e-03

Pos logit contributions
cel+0.006
 forehead+0.006
 lap+0.006
 chin+0.005
 cheeks+0.005
Neg logit contributions
ris-0.011
 gol-0.010
 narrative-0.010
spe-0.010
 nations-0.010
Token.
Token ID13
Feature activation-8.18e-03

Pos logit contributions
ris+0.012
 gol+0.012
 narrative+0.012
spe+0.011
 nations+0.011
Neg logit contributions
 lap-0.003
cel-0.003
 forehead-0.003
 chin-0.003
 warm-0.002
Token It
Token ID632
Feature activation-5.68e-03

Pos logit contributions
ris+0.016
 narrative+0.015
 gol+0.015
spe+0.015
 nations+0.015
Neg logit contributions
cel-0.012
 forehead-0.010
 lap-0.010
 chin-0.009
tail-0.009
Token makes
Token ID1838
Feature activation+4.47e-02

Pos logit contributions
ris+0.013
 gol+0.012
 narrative+0.012
spe+0.011
 nations+0.011
Neg logit contributions
cel-0.007
 lap-0.006
 forehead-0.006
 chin-0.006
 cheeks-0.005
Token a
Token ID257
Feature activation+1.01e-02

Pos logit contributions
cel+0.071
 forehead+0.062
 lap+0.062
 chin+0.060
tail+0.055
Neg logit contributions
ris-0.082
 nations-0.077
 gol-0.076
spe-0.075
 narrative-0.072
Token nice
Token ID3621
Feature activation+6.97e-03

Pos logit contributions
 forehead+0.010
 lap+0.010
cel+0.010
 chin+0.009
 cheek+0.009
Neg logit contributions
ris-0.024
 gol-0.022
 narrative-0.022
spe-0.021
 nations-0.020
Token sound
Token ID2128
Feature activation+1.81e-02

Pos logit contributions
 lap+0.007
 forehead+0.007
cel+0.007
 chin+0.006
 cheek+0.006
Neg logit contributions
ris-0.017
 narrative-0.016
 gol-0.015
spe-0.015
 nations-0.015
Token.
Token ID13
Feature activation-1.14e-02

Pos logit contributions
cel+0.026
 forehead+0.024
 lap+0.024
 chin+0.022
amas+0.021
Neg logit contributions
ris-0.035
 gol-0.032
spe-0.032
 narrative-0.031
 nations-0.031
Token
Token ID198
Feature activation-4.71e-03

Pos logit contributions
ris+0.022
 narrative+0.021
 gol+0.020
 nations+0.020
spe+0.020
Neg logit contributions
cel-0.017
 forehead-0.015
 lap-0.013
 chin-0.013
tail-0.013
Token You
Token ID921
Feature activation+4.26e-03

Pos logit contributions
ris+0.014
 gol+0.013
 narrative+0.013
 nations+0.012
spe+0.012
Neg logit contributions
cel-0.011
 forehead-0.011
 lap-0.010
ze-0.009
 chin-0.009
Token can
Token ID460
Feature activation+5.31e-03

Pos logit contributions
cel+0.006
 forehead+0.005
 lap+0.005
 chin+0.005
tail+0.005
Neg logit contributions
ris-0.009
spe-0.008
 gol-0.008
 narrative-0.008
 nations-0.008
Token have
Token ID423
Feature activation-3.24e-03

Pos logit contributions
cel+0.007
 forehead+0.006
 lap+0.006
 chin+0.006
 cheeks+0.005
Neg logit contributions
ris-0.011
 gol-0.010
 narrative-0.010
spe-0.010
 nations-0.010
Token some
Token ID617
Feature activation+1.92e-02

Pos logit contributions
ris+0.006
 narrative+0.005
 nations+0.005
 gol+0.005
spe+0.005
Neg logit contributions
cel-0.005
 forehead-0.005
 lap-0.005
 chin-0.004
 shoulder-0.004
Token soup
Token ID17141
Feature activation+3.03e-02

Pos logit contributions
cel+0.028
 forehead+0.026
 lap+0.026
 chin+0.024
 cheeks+0.022
Neg logit contributions
ris-0.037
 gol-0.034
 narrative-0.034
spe-0.034
 nations-0.033
Token.
Token ID13
Feature activation-2.49e-03

Pos logit contributions
cel+0.043
 forehead+0.039
 lap+0.039
 chin+0.036
 cheeks+0.034
Neg logit contributions
ris-0.061
 gol-0.056
 narrative-0.055
spe-0.054
 nations-0.053
Token You
Token ID921
Feature activation+1.05e-03

Pos logit contributions
ris+0.005
 narrative+0.004
 gol+0.004
 nations+0.004
igators+0.004
Neg logit contributions
cel-0.004
 forehead-0.003
 lap-0.003
 chin-0.003
amas-0.003
Token are
Token ID389
Feature activation-2.81e-03

Pos logit contributions
cel+0.002
 forehead+0.001
 lap+0.001
 chin+0.001
tail+0.001
Neg logit contributions
ris-0.002
 gol-0.002
spe-0.002
 narrative-0.002
 nations-0.002
Token a
Token ID257
Feature activation+2.04e-04

Pos logit contributions
ris+0.006
spe+0.006
 gol+0.006
 narrative+0.006
 nations+0.005
Neg logit contributions
cel-0.004
 forehead-0.003
 lap-0.003
 chin-0.003
tail-0.003
Token good
Token ID922
Feature activation-1.24e-03

Pos logit contributions
cel+0.000
 forehead+0.000
 lap+0.000
 chin+0.000
 cheek+0.000
Neg logit contributions
ris-0.000
spe-0.000
 gol-0.000
 narrative-0.000
 nations-0.000
Token girl
Token ID2576
Feature activation+6.92e-03

Pos logit contributions
ris+0.002
spe+0.002
 narrative+0.002
 nations+0.002
 gol+0.002
Neg logit contributions
 forehead-0.002
cel-0.002
 lap-0.002
 chin-0.002
 cheek-0.001
Tokenmy
Token ID1820
Feature activation+2.26e-03

Pos logit contributions
ris+0.009
 gol+0.008
 narrative+0.008
spe+0.008
 nations+0.008
Neg logit contributions
cel-0.007
 forehead-0.007
 lap-0.007
 chin-0.006
 cheeks-0.006
Token smiled
Token ID13541
Feature activation+3.45e-03

Pos logit contributions
cel+0.004
 lap+0.003
 forehead+0.003
 chin+0.003
tail+0.003
Neg logit contributions
ris-0.004
 gol-0.004
 narrative-0.004
spe-0.004
 nations-0.003
Token and
Token ID290
Feature activation+1.35e-02

Pos logit contributions
cel+0.006
 lap+0.005
 forehead+0.005
 chin+0.005
tail+0.005
Neg logit contributions
ris-0.006
 gol-0.006
spe-0.005
 narrative-0.005
 nations-0.005
Token said
Token ID531
Feature activation-4.65e-05

Pos logit contributions
cel+0.022
 forehead+0.021
 lap+0.020
 chin+0.019
 cheeks+0.018
Neg logit contributions
ris-0.024
 gol-0.022
 narrative-0.022
spe-0.021
 nations-0.021
Token,
Token ID11
Feature activation-6.89e-03

Pos logit contributions
ris+0.000
 gol+0.000
spe+0.000
 narrative+0.000
 nations+0.000
Neg logit contributions
cel-0.000
 lap-0.000
 forehead-0.000
 chin-0.000
amas-0.000
Token "
Token ID366
Feature activation+1.96e-02

Pos logit contributions
ris+0.011
 gol+0.011
spe+0.011
 narrative+0.010
ole+0.009
Neg logit contributions
cel-0.011
 lap-0.010
tail-0.010
 forehead-0.010
 chin-0.009
TokenI
Token ID40
Feature activation+7.36e-03

Pos logit contributions
cel+0.030
 lap+0.028
 forehead+0.028
 chin+0.026
tail+0.025
Neg logit contributions
ris-0.036
 gol-0.033
 narrative-0.033
spe-0.032
 nations-0.032
Token know
Token ID760
Feature activation+2.06e-03

Pos logit contributions
cel+0.009
 lap+0.008
 forehead+0.008
 chin+0.008
 feel+0.007
Neg logit contributions
ris-0.015
 narrative-0.015
spe-0.014
 gol-0.014
 nations-0.014
Token,
Token ID11
Feature activation+5.60e-03

Pos logit contributions
cel+0.003
 lap+0.003
 forehead+0.003
 chin+0.002
amas+0.002
Neg logit contributions
ris-0.004
 gol-0.004
 nations-0.004
spe-0.004
 narrative-0.004
Token sweet
Token ID6029
Feature activation+1.01e-02

Pos logit contributions
cel+0.007
 lap+0.006
 forehead+0.006
 chin+0.005
tail+0.005
Neg logit contributions
ris-0.012
 gol-0.012
spe-0.011
 narrative-0.011
 nations-0.011
Tokenie
Token ID494
Feature activation+2.77e-02

Pos logit contributions
cel+0.018
 lap+0.017
 forehead+0.016
 chin+0.015
 hips+0.015
Neg logit contributions
ris-0.017
 gol-0.016
spe-0.015
 nations-0.015
 narrative-0.015
Token nodded
Token ID14464
Feature activation+7.81e-05

Pos logit contributions
cel+0.024
 forehead+0.023
 lap+0.023
 chin+0.021
 cheeks+0.020
Neg logit contributions
ris-0.034
 gol-0.032
 narrative-0.032
 nations-0.031
spe-0.031
Token and
Token ID290
Feature activation+6.91e-03

Pos logit contributions
cel+0.000
 lap+0.000
 forehead+0.000
tail+0.000
amas+0.000
Neg logit contributions
ris-0.000
spe-0.000
 gol-0.000
 narrative-0.000
ole-0.000
Token said
Token ID531
Feature activation-7.76e-03

Pos logit contributions
cel+0.010
 forehead+0.010
 lap+0.010
 chin+0.009
 cheeks+0.009
Neg logit contributions
ris-0.013
 narrative-0.012
 gol-0.012
spe-0.012
 nations-0.011
Token,
Token ID11
Feature activation-1.11e-02

Pos logit contributions
ris+0.016
 gol+0.015
spe+0.015
 narrative+0.015
 nations+0.014
Neg logit contributions
cel-0.010
 lap-0.009
 forehead-0.009
 chin-0.009
tail-0.008
Token "
Token ID366
Feature activation+1.83e-02

Pos logit contributions
ris+0.019
spe+0.018
 gol+0.018
 narrative+0.017
 nations+0.015
Neg logit contributions
cel-0.017
tail-0.016
 lap-0.015
 forehead-0.015
 chin-0.015
TokenOK
Token ID11380
Feature activation+3.00e-02

Pos logit contributions
cel+0.031
 forehead+0.029
 lap+0.029
 chin+0.027
tail+0.025
Neg logit contributions
ris-0.031
 gol-0.028
 narrative-0.028
 nations-0.027
spe-0.027
Token,
Token ID11
Feature activation-2.37e-03

Pos logit contributions
cel+0.050
 lap+0.044
 forehead+0.044
tail+0.042
 chin+0.040
Neg logit contributions
ris-0.053
 gol-0.046
spe-0.046
 narrative-0.044
ole-0.043
Token Mom
Token ID11254
Feature activation+7.76e-03

Pos logit contributions
ris+0.005
 gol+0.005
 narrative+0.005
spe+0.005
 nations+0.005
Neg logit contributions
cel-0.003
 forehead-0.002
 lap-0.002
 chin-0.002
tail-0.002
Tokenmy
Token ID1820
Feature activation+2.28e-02

Pos logit contributions
cel+0.014
 forehead+0.013
 lap+0.013
 chin+0.012
 cheeks+0.012
Neg logit contributions
ris-0.013
spe-0.011
 gol-0.010
 narrative-0.010
 nations-0.010
Token."
Token ID526
Feature activation+1.34e-02

Pos logit contributions
cel+0.034
 lap+0.031
 forehead+0.030
 chin+0.027
amas+0.027
Neg logit contributions
ris-0.045
spe-0.040
 gol-0.039
 narrative-0.037
 nations-0.037
Token<|endoftext|>
Token ID50256
Feature activation-2.32e-02

Pos logit contributions
cel+0.020
 forehead+0.017
tail+0.016
 lap+0.016
 chin+0.015
Neg logit contributions
ris-0.027
 gol-0.025
spe-0.024
 narrative-0.024
 nations-0.022
Token the
Token ID262
Feature activation-1.67e-02

Pos logit contributions
ris+0.008
 gol+0.007
 narrative+0.007
spe+0.007
ile+0.007
Neg logit contributions
cel-0.006
 forehead-0.006
 lap-0.006
 chin-0.005
 cheeks-0.005
Token snacks
Token ID28487
Feature activation+4.43e-03

Pos logit contributions
ris+0.036
 gol+0.031
spe+0.031
 narrative+0.030
ile+0.029
Neg logit contributions
cel-0.022
 forehead-0.021
 lap-0.021
 chin-0.019
 cheeks-0.018
Token.
Token ID13
Feature activation-5.74e-03

Pos logit contributions
cel+0.006
 forehead+0.005
 lap+0.005
 chin+0.005
 cheeks+0.004
Neg logit contributions
ris-0.009
 gol-0.009
 narrative-0.009
spe-0.008
 nations-0.008
Token
Token ID220
Feature activation+1.94e-03

Pos logit contributions
ris+0.011
 gol+0.010
spe+0.010
 narrative+0.010
 nations+0.009
Neg logit contributions
cel-0.008
 forehead-0.008
 lap-0.008
 chin-0.007
 cheek-0.007
Token
Token ID198
Feature activation-3.00e-03

Pos logit contributions
 forehead+0.003
cel+0.003
 chin+0.003
 lap+0.003
 cheeks+0.003
Neg logit contributions
ris-0.004
 gol-0.003
spe-0.003
 narrative-0.003
 nations-0.003
Token
Token ID198
Feature activation+9.00e-03

Pos logit contributions
ris+0.006
 gol+0.004
spe+0.004
 narrative+0.004
 nations+0.004
Neg logit contributions
 forehead-0.005
cel-0.005
 chin-0.005
 cheeks-0.004
 lap-0.004
TokenSoon
Token ID28093
Feature activation+1.33e-02

Pos logit contributions
cel+0.014
 lap+0.012
 forehead+0.012
 chin+0.011
tail+0.011
Neg logit contributions
ris-0.017
 gol-0.016
 narrative-0.015
spe-0.015
 nations-0.015
Token Mo
Token ID4270
Feature activation+2.13e-02

Pos logit contributions
cel+0.015
 forehead+0.012
 lap+0.012
 chin+0.011
 cheeks+0.011
Neg logit contributions
ris-0.031
 gol-0.029
 narrative-0.029
spe-0.028
 nations-0.027
Token started
Token ID2067
Feature activation-2.43e-02

Pos logit contributions
cel+0.031
 lap+0.029
 forehead+0.029
 chin+0.026
tail+0.025
Neg logit contributions
ris-0.042
 gol-0.039
spe-0.038
 narrative-0.037
 nations-0.037
Token to
Token ID284
Feature activation+5.56e-04

Pos logit contributions
ris+0.049
 narrative+0.045
 gol+0.043
ile+0.041
 nations+0.041
Neg logit contributions
cel-0.035
 lap-0.034
 forehead-0.033
 chin-0.031
 cheeks-0.030
Token understand
Token ID1833
Feature activation+5.12e-03

Pos logit contributions
cel+0.001
 forehead+0.001
 lap+0.001
 chin+0.000
 cheeks+0.000
Neg logit contributions
ris-0.001
 gol-0.001
 narrative-0.001
spe-0.001
 nations-0.001
Token felt
Token ID2936
Feature activation+8.93e-03

Pos logit contributions
ris+0.015
 gol+0.013
 narrative+0.013
spe+0.013
 nations+0.012
Neg logit contributions
cel-0.014
 forehead-0.013
 lap-0.013
 chin-0.012
tail-0.011
Token funny
Token ID8258
Feature activation+1.00e-02

Pos logit contributions
cel+0.014
 lap+0.013
 forehead+0.012
 chin+0.012
amas+0.012
Neg logit contributions
ris-0.015
 gol-0.015
 narrative-0.015
spe-0.014
 nations-0.014
Token.
Token ID13
Feature activation+1.09e-03

Pos logit contributions
cel+0.011
 forehead+0.010
 lap+0.010
 chin+0.009
 cheeks+0.008
Neg logit contributions
ris-0.023
 gol-0.022
 narrative-0.022
spe-0.021
 nations-0.021
Token The
Token ID383
Feature activation+2.89e-03

Pos logit contributions
cel+0.002
 forehead+0.002
ze+0.002
tail+0.002
 chin+0.002
Neg logit contributions
 gol-0.001
ris-0.001
way-0.001
 nations-0.001
spe-0.001
Token letter
Token ID3850
Feature activation+5.27e-03

Pos logit contributions
cel+0.005
 forehead+0.004
 lap+0.004
 chin+0.004
 cheek+0.004
Neg logit contributions
ris-0.005
 gol-0.005
spe-0.005
 narrative-0.005
 nations-0.004
Token had
Token ID550
Feature activation-1.41e-03

Pos logit contributions
cel+0.006
 forehead+0.006
 lap+0.005
 chin+0.005
 nose+0.005
Neg logit contributions
ris-0.012
 narrative-0.012
 gol-0.011
 nations-0.011
igators-0.011
Token different
Token ID1180
Feature activation-1.60e-02

Pos logit contributions
ris+0.003
 gol+0.003
 narrative+0.003
spe+0.003
 nations+0.003
Neg logit contributions
cel-0.002
 forehead-0.002
 lap-0.002
 chin-0.001
 cheeks-0.001
Token shapes
Token ID15268
Feature activation-5.55e-03

Pos logit contributions
ris+0.030
 gol+0.028
 narrative+0.028
spe+0.027
 nations+0.025
Neg logit contributions
cel-0.024
 forehead-0.023
 lap-0.023
 chin-0.021
 cheeks-0.021
Token on
Token ID319
Feature activation+5.78e-03

Pos logit contributions
ris+0.013
 narrative+0.013
 gol+0.013
spe+0.012
 nations+0.012
Neg logit contributions
cel-0.006
 lap-0.005
 chin-0.005
 forehead-0.005
 cheeks-0.004
Token it
Token ID340
Feature activation-5.59e-03

Pos logit contributions
cel+0.009
 forehead+0.008
 lap+0.008
 chin+0.007
 cheeks+0.007
Neg logit contributions
ris-0.011
 gol-0.010
 narrative-0.010
spe-0.010
 nations-0.010
Token.
Token ID13
Feature activation+9.29e-04

Pos logit contributions
ris+0.011
 gol+0.010
spe+0.010
 narrative+0.010
 nations+0.010
Neg logit contributions
cel-0.008
 forehead-0.008
 lap-0.007
 chin-0.007
amas-0.006
Token are
Token ID389
Feature activation-8.65e-03

Pos logit contributions
spe+0.022
ris+0.021
 narrative+0.021
 gol+0.021
ile+0.020
Neg logit contributions
cel-0.011
 feel-0.009
 lap-0.009
 forehead-0.008
 chin-0.008
Token very
Token ID845
Feature activation+8.43e-03

Pos logit contributions
ris+0.019
spe+0.018
 gol+0.018
 narrative+0.018
 nations+0.017
Neg logit contributions
cel-0.011
 forehead-0.009
 lap-0.009
 chin-0.008
 cheek-0.008
Token kind
Token ID1611
Feature activation-1.44e-03

Pos logit contributions
cel+0.012
 forehead+0.011
 lap+0.011
 chin+0.010
 cheeks+0.010
Neg logit contributions
ris-0.017
 gol-0.015
 narrative-0.015
spe-0.015
 nations-0.015
Token and
Token ID290
Feature activation+8.59e-03

Pos logit contributions
ris+0.003
 narrative+0.003
 gol+0.003
spe+0.003
 nations+0.003
Neg logit contributions
cel-0.001
 lap-0.001
 forehead-0.001
 chin-0.001
 cheeks-0.001
Token smart
Token ID4451
Feature activation-1.38e-02

Pos logit contributions
cel+0.013
 lap+0.012
 forehead+0.012
 chin+0.011
amas+0.011
Neg logit contributions
ris-0.016
 gol-0.015
 narrative-0.015
spe-0.014
 nations-0.014
Token.
Token ID13
Feature activation-1.40e-02

Pos logit contributions
ris+0.032
 gol+0.030
 narrative+0.030
spe+0.030
 nations+0.029
Neg logit contributions
cel-0.015
 forehead-0.013
 lap-0.013
 chin-0.012
 cheeks-0.011
Token Let
Token ID3914
Feature activation+3.86e-02

Pos logit contributions
ris+0.030
 gol+0.028
 narrative+0.027
spe+0.026
 nations+0.026
Neg logit contributions
cel-0.017
 forehead-0.017
 lap-0.016
 chin-0.015
 cheeks-0.014
Token's
Token ID338
Feature activation-1.44e-02

Pos logit contributions
cel+0.056
 forehead+0.054
 lap+0.052
 chin+0.049
 cheeks+0.045
Neg logit contributions
ris-0.070
 narrative-0.070
 gol-0.066
 nations-0.065
spe-0.061
Token go
Token ID467
Feature activation+7.22e-04

Pos logit contributions
ris+0.034
 gol+0.032
 narrative+0.032
spe+0.031
 nations+0.030
Neg logit contributions
cel-0.015
 lap-0.013
 forehead-0.013
 chin-0.012
tail-0.011
Token and
Token ID290
Feature activation-6.89e-03

Pos logit contributions
cel+0.001
 forehead+0.001
 cheeks+0.001
 chin+0.001
 lap+0.001
Neg logit contributions
ris-0.001
 gol-0.001
spe-0.001
 nations-0.001
 narrative-0.001
Token pay
Token ID1414
Feature activation-4.39e-03

Pos logit contributions
ris+0.016
 gol+0.015
 narrative+0.015
spe+0.015
 nations+0.014
Neg logit contributions
cel-0.008
 forehead-0.007
 lap-0.007
 chin-0.006
 cheeks-0.005
Token<|endoftext|>
Token ID50256
Feature activation-2.66e-02

Pos logit contributions
cel+0.011
 forehead+0.010
 lap+0.009
 chin+0.009
tail+0.008
Neg logit contributions
ris-0.017
 gol-0.015
spe-0.015
 narrative-0.014
 nations-0.014
TokenOne
Token ID3198
Feature activation-2.30e-02

Pos logit contributions
ris+0.053
 gol+0.049
 narrative+0.048
spe+0.047
 nations+0.046
Neg logit contributions
cel-0.038
 forehead-0.037
 lap-0.034
 chin-0.033
 cheeks-0.031
Token day
Token ID1110
Feature activation+1.60e-02

Pos logit contributions
ris+0.037
 narrative+0.033
 gol+0.033
spe+0.032
 nations+0.031
Neg logit contributions
cel-0.040
 lap-0.039
 forehead-0.039
 chin-0.037
 cheeks-0.034
Token,
Token ID11
Feature activation-8.75e-03

Pos logit contributions
cel+0.012
 forehead+0.011
 lap+0.010
 chin+0.010
 cheeks+0.009
Neg logit contributions
ris-0.042
 narrative-0.039
spe-0.039
 gol-0.039
 nations-0.038
Token George
Token ID4502
Feature activation+3.18e-03

Pos logit contributions
ris+0.019
 narrative+0.018
 gol+0.017
spe+0.017
 nations+0.017
Neg logit contributions
cel-0.010
 lap-0.009
 forehead-0.009
 chin-0.009
 cheeks-0.008
Token was
Token ID373
Feature activation-1.33e-02

Pos logit contributions
cel+0.005
 forehead+0.005
 lap+0.005
 chin+0.004
 cheeks+0.004
Neg logit contributions
ris-0.006
 gol-0.005
spe-0.005
 narrative-0.005
 nations-0.005
Token playing
Token ID2712
Feature activation+6.18e-03

Pos logit contributions
ris+0.029
spe+0.027
 narrative+0.027
 gol+0.026
 nations+0.026
Neg logit contributions
cel-0.017
 lap-0.015
 forehead-0.015
 chin-0.014
tail-0.013
Token in
Token ID287
Feature activation+1.88e-03

Pos logit contributions
cel+0.009
 lap+0.008
 forehead+0.008
 chin+0.007
tail+0.007
Neg logit contributions
ris-0.013
 narrative-0.012
spe-0.012
 gol-0.011
 nations-0.011
Token the
Token ID262
Feature activation+7.14e-03

Pos logit contributions
cel+0.002
 forehead+0.002
 lap+0.002
 chin+0.002
 cheeks+0.002
Neg logit contributions
ris-0.004
 gol-0.004
 narrative-0.004
spe-0.004
 nations-0.004
Token living
Token ID2877
Feature activation-9.47e-03

Pos logit contributions
cel+0.011
 lap+0.011
 forehead+0.010
 chin+0.010
tail+0.009
Neg logit contributions
ris-0.013
spe-0.012
 narrative-0.012
 gol-0.012
 nations-0.011
Token room
Token ID2119
Feature activation-9.75e-04

Pos logit contributions
ris+0.016
 gol+0.015
spe+0.015
 nations+0.014
 narrative+0.014
Neg logit contributions
cel-0.016
 forehead-0.016
 lap-0.015
 chin-0.014
 cheeks-0.014
Token walking
Token ID6155
Feature activation+7.46e-03

Pos logit contributions
cel+0.025
 forehead+0.024
 lap+0.024
 chin+0.023
 cheeks+0.021
Neg logit contributions
ris-0.023
 gol-0.021
 narrative-0.020
spe-0.020
 nations-0.019
Token would
Token ID561
Feature activation+2.31e-02

Pos logit contributions
cel+0.012
 forehead+0.011
 lap+0.011
 chin+0.010
 cheeks+0.009
Neg logit contributions
ris-0.014
 gol-0.013
 narrative-0.013
spe-0.013
 nations-0.012
Token be
Token ID307
Feature activation+1.27e-02

Pos logit contributions
cel+0.030
 forehead+0.026
 lap+0.025
 chin+0.024
amas+0.023
Neg logit contributions
ris-0.050
spe-0.046
 gol-0.046
 narrative-0.045
 nations-0.044
Token fun
Token ID1257
Feature activation+6.04e-03

Pos logit contributions
cel+0.017
 forehead+0.016
 lap+0.016
 chin+0.014
tail+0.013
Neg logit contributions
ris-0.026
 gol-0.024
 narrative-0.024
spe-0.024
 nations-0.023
Token so
Token ID523
Feature activation+1.94e-02

Pos logit contributions
cel+0.007
 lap+0.007
 forehead+0.007
 chin+0.006
tail+0.006
Neg logit contributions
ris-0.013
 narrative-0.012
 gol-0.012
spe-0.012
 nations-0.011
Token he
Token ID339
Feature activation-2.22e-02

Pos logit contributions
cel+0.026
 lap+0.024
 forehead+0.024
 chin+0.023
 cheeks+0.021
Neg logit contributions
ris-0.039
 narrative-0.036
 gol-0.035
 nations-0.035
spe-0.034
Token opened
Token ID4721
Feature activation-1.87e-02

Pos logit contributions
ris+0.039
 narrative+0.036
 gol+0.036
spe+0.034
 nations+0.034
Neg logit contributions
cel-0.037
 forehead-0.034
 lap-0.034
 chin-0.032
 cheeks-0.030
Token the
Token ID262
Feature activation-1.43e-03

Pos logit contributions
ris+0.044
 narrative+0.042
 gol+0.042
spe+0.041
 nations+0.039
Neg logit contributions
cel-0.019
 lap-0.018
 forehead-0.017
 chin-0.016
chief-0.014
Token door
Token ID3420
Feature activation-1.14e-02

Pos logit contributions
ris+0.003
 narrative+0.003
 gol+0.003
spe+0.003
 nations+0.002
Neg logit contributions
cel-0.002
 lap-0.002
 forehead-0.002
 chin-0.002
 shoulder-0.002
Token and
Token ID290
Feature activation+4.62e-03

Pos logit contributions
ris+0.024
 gol+0.022
 narrative+0.022
spe+0.022
 nations+0.021
Neg logit contributions
cel-0.015
 lap-0.014
 forehead-0.014
 chin-0.012
 cheeks-0.011
Token left
Token ID1364
Feature activation-3.30e-03

Pos logit contributions
cel+0.007
 forehead+0.006
 lap+0.006
 chin+0.006
 cheeks+0.006
Neg logit contributions
ris-0.009
 gol-0.008
 narrative-0.008
spe-0.008
 nations-0.008
Token to
Token ID284
Feature activation-1.38e-02

Pos logit contributions
ris+0.032
 narrative+0.029
 gol+0.029
spe+0.028
 nations+0.028
Neg logit contributions
cel-0.029
 forehead-0.028
 lap-0.027
 chin-0.026
 cheeks-0.024
Token help
Token ID1037
Feature activation+7.63e-03

Pos logit contributions
ris+0.032
 gol+0.030
 narrative+0.030
spe+0.030
 nations+0.029
Neg logit contributions
cel-0.015
 forehead-0.013
 lap-0.013
 chin-0.011
tail-0.010
Token him
Token ID683
Feature activation-4.30e-03

Pos logit contributions
cel+0.015
 forehead+0.014
 lap+0.014
amas+0.013
 chin+0.013
Neg logit contributions
ris-0.009
 gol-0.009
ole-0.008
guard-0.007
spe-0.007
Token find
Token ID1064
Feature activation-5.20e-03

Pos logit contributions
ris+0.008
spe+0.007
 gol+0.006
ole+0.006
igators+0.006
Neg logit contributions
cel-0.007
 forehead-0.006
 lap-0.006
 chin-0.006
amas-0.006
Token his
Token ID465
Feature activation-1.64e-05

Pos logit contributions
ris+0.007
 gol+0.006
 bomb+0.006
 theirs+0.006
ile+0.005
Neg logit contributions
cel-0.010
 forehead-0.010
amas-0.009
 chin-0.009
 lap-0.009
Token l
Token ID300
Feature activation-3.91e-02

Pos logit contributions
ris+0.000
 gol+0.000
 narrative+0.000
ole+0.000
 nations+0.000
Neg logit contributions
cel-0.000
 forehead-0.000
amas-0.000
 lap-0.000
ze-0.000
Tokenoll
Token ID692
Feature activation-1.84e-02

Pos logit contributions
ris+0.072
spe+0.070
 gol+0.068
 narrative+0.067
 nations+0.066
Neg logit contributions
cel-0.059
 lap-0.056
 forehead-0.055
 chin-0.051
tail-0.049
Tokenipop
Token ID42800
Feature activation+3.83e-03

Pos logit contributions
ris+0.027
 gol+0.021
 narrative+0.021
ole+0.020
ile+0.020
Neg logit contributions
cel-0.036
 lap-0.035
 forehead-0.034
amas-0.031
ze-0.031
Token.
Token ID13
Feature activation-6.08e-03

Pos logit contributions
cel+0.005
 forehead+0.004
 lap+0.004
 chin+0.004
 cheeks+0.004
Neg logit contributions
ris-0.008
 gol-0.008
 narrative-0.008
spe-0.007
 nations-0.007
Token
Token ID198
Feature activation+6.70e-03

Pos logit contributions
ris+0.011
 gol+0.010
spe+0.010
igators+0.009
 nations+0.009
Neg logit contributions
cel-0.010
 forehead-0.009
ze-0.008
 chin-0.008
 cheeks-0.008
Token
Token ID198
Feature activation+1.78e-02

Pos logit contributions
cel+0.010
 forehead+0.010
 chin+0.009
 cheeks+0.009
 lap+0.009
Neg logit contributions
ris-0.014
 gol-0.011
spe-0.011
 narrative-0.011
 nations-0.011
Token just
Token ID655
Feature activation-3.78e-03

Pos logit contributions
cel+0.004
 forehead+0.004
 lap+0.004
 chin+0.003
 cheeks+0.003
Neg logit contributions
ris-0.006
 gol-0.005
 narrative-0.005
spe-0.005
 nations-0.005
Token like
Token ID588
Feature activation-3.34e-03

Pos logit contributions
ris+0.006
 gol+0.006
 narrative+0.006
spe+0.005
igators+0.005
Neg logit contributions
cel-0.007
amas-0.006
 lap-0.006
 forehead-0.005
 chin-0.005
Token the
Token ID262
Feature activation-4.51e-03

Pos logit contributions
ris+0.006
 gol+0.006
 narrative+0.006
spe+0.006
 nations+0.006
Neg logit contributions
cel-0.005
 forehead-0.005
 lap-0.005
 chin-0.004
tail-0.004
Token sock
Token ID32263
Feature activation+5.26e-03

Pos logit contributions
ris+0.009
 gol+0.008
spe+0.008
 narrative+0.008
 nations+0.008
Neg logit contributions
cel-0.006
 forehead-0.006
 lap-0.006
 chin-0.005
 cheeks-0.005
Token!
Token ID0
Feature activation-8.50e-03

Pos logit contributions
cel+0.007
 forehead+0.006
 lap+0.006
 chin+0.006
 cheeks+0.005
Neg logit contributions
ris-0.011
 gol-0.010
spe-0.010
 narrative-0.010
 nations-0.009
Token<|endoftext|>
Token ID50256
Feature activation-2.88e-02

Pos logit contributions
ris+0.019
 gol+0.018
spe+0.017
 narrative+0.017
 nations+0.017
Neg logit contributions
cel-0.010
 forehead-0.009
 lap-0.009
 chin-0.008
 cheeks-0.007
TokenOnce
Token ID7454
Feature activation-1.17e-02

Pos logit contributions
ris+0.056
 gol+0.049
 narrative+0.048
spe+0.046
 nations+0.045
Neg logit contributions
 forehead-0.046
cel-0.044
 lap-0.040
 cheeks-0.040
 chin-0.040
Token upon
Token ID2402
Feature activation+5.71e-03

Pos logit contributions
ris+0.021
 gol+0.019
 narrative+0.019
spe+0.018
 nations+0.017
Neg logit contributions
cel-0.018
 lap-0.017
 forehead-0.017
 chin-0.016
 feel-0.015
Token a
Token ID257
Feature activation+7.03e-03

Pos logit contributions
cel+0.010
 chin+0.009
 forehead+0.009
 lap+0.009
 cheeks+0.009
Neg logit contributions
ris-0.009
 gol-0.008
 narrative-0.008
 nations-0.008
ile-0.007
Token time
Token ID640
Feature activation-5.69e-03

Pos logit contributions
cel+0.015
 forehead+0.014
 lap+0.014
 chin+0.013
 cheeks+0.013
Neg logit contributions
ris-0.009
 gol-0.008
 narrative-0.008
spe-0.008
 nations-0.007
Token,
Token ID11
Feature activation+9.63e-03

Pos logit contributions
ris+0.012
 gol+0.011
 narrative+0.011
spe+0.011
 nations+0.011
Neg logit contributions
cel-0.007
 forehead-0.007
 lap-0.007
 chin-0.006
tail-0.006
TokenBut
Token ID1537
Feature activation+4.91e-03

Pos logit contributions
cel+0.011
 lap+0.010
 forehead+0.009
 chin+0.009
tail+0.008
Neg logit contributions
ris-0.013
 gol-0.012
spe-0.012
 narrative-0.011
 nations-0.011
Token one
Token ID530
Feature activation+3.03e-03

Pos logit contributions
cel+0.006
 forehead+0.006
 lap+0.005
 chin+0.005
 cheeks+0.005
Neg logit contributions
ris-0.011
 gol-0.010
spe-0.010
 narrative-0.010
 nations-0.010
Token day
Token ID1110
Feature activation+2.71e-02

Pos logit contributions
 lap+0.004
 forehead+0.004
cel+0.004
 warm+0.004
 chin+0.004
Neg logit contributions
ris-0.006
 narrative-0.005
spe-0.005
 gol-0.005
 nations-0.005
Token,
Token ID11
Feature activation-4.15e-03

Pos logit contributions
cel+0.037
 forehead+0.035
 lap+0.034
 chin+0.031
amas+0.029
Neg logit contributions
ris-0.054
 gol-0.051
 narrative-0.050
spe-0.049
 nations-0.048
Token Lily
Token ID20037
Feature activation-7.03e-03

Pos logit contributions
ris+0.009
 gol+0.008
 narrative+0.008
spe+0.008
 nations+0.008
Neg logit contributions
cel-0.005
 forehead-0.005
 lap-0.005
 chin-0.004
 cheeks-0.004
Token accidentally
Token ID14716
Feature activation-6.13e-02

Pos logit contributions
ris+0.014
 gol+0.013
 narrative+0.013
spe+0.013
 nations+0.012
Neg logit contributions
cel-0.010
 forehead-0.009
 lap-0.009
 chin-0.008
 cheeks-0.008
Token stepped
Token ID10764
Feature activation+4.70e-05

Pos logit contributions
ris+0.101
 gol+0.092
spe+0.089
 narrative+0.088
 nations+0.085
Neg logit contributions
cel-0.104
 lap-0.099
 forehead-0.098
 chin-0.091
amas-0.088
Token on
Token ID319
Feature activation-1.49e-03

Pos logit contributions
cel+0.000
 forehead+0.000
 lap+0.000
 chin+0.000
tail+0.000
Neg logit contributions
ris-0.000
 gol-0.000
spe-0.000
 narrative-0.000
 nations-0.000
Token Buzz
Token ID15753
Feature activation+6.74e-03

Pos logit contributions
ris+0.003
 gol+0.003
 narrative+0.002
 nations+0.002
spe+0.002
Neg logit contributions
cel-0.002
 forehead-0.002
 lap-0.002
 chin-0.002
amas-0.002
Token.
Token ID13
Feature activation+5.14e-03

Pos logit contributions
cel+0.009
 forehead+0.008
 lap+0.007
 chin+0.006
 cheeks+0.006
Neg logit contributions
ris-0.014
 gol-0.013
spe-0.013
 nations-0.013
 narrative-0.013
Token She
Token ID1375
Feature activation+1.19e-03

Pos logit contributions
cel+0.009
 forehead+0.009
 lap+0.008
 chin+0.008
tail+0.008
Neg logit contributions
 gol-0.007
ris-0.007
igators-0.007
spe-0.006
 nations-0.006
Token was
Token ID373
Feature activation+5.23e-03

Pos logit contributions
cel+0.025
 forehead+0.023
 lap+0.023
 chin+0.021
 cheeks+0.020
Neg logit contributions
ris-0.031
 gol-0.028
 narrative-0.028
spe-0.028
 nations-0.027
Token a
Token ID257
Feature activation-8.25e-03

Pos logit contributions
cel+0.006
 forehead+0.006
 lap+0.006
 chin+0.005
tail+0.005
Neg logit contributions
ris-0.012
 narrative-0.011
spe-0.011
 gol-0.011
 nations-0.011
Token charming
Token ID23332
Feature activation-8.87e-04

Pos logit contributions
ris+0.017
 gol+0.016
spe+0.016
 narrative+0.016
 nations+0.015
Neg logit contributions
cel-0.011
 forehead-0.010
 lap-0.010
 chin-0.009
amas-0.009
Token three
Token ID1115
Feature activation-2.00e-02

Pos logit contributions
ris+0.002
 narrative+0.002
spe+0.002
 nations+0.002
 gol+0.002
Neg logit contributions
cel-0.001
 forehead-0.001
 lap-0.001
 chin-0.001
 shoulder-0.001
Token-
Token ID12
Feature activation-2.96e-02

Pos logit contributions
ris+0.032
 gol+0.030
 narrative+0.029
spe+0.028
 nations+0.027
Neg logit contributions
cel-0.035
 lap-0.034
 forehead-0.033
 chin-0.031
 cheeks-0.030
Tokenyear
Token ID1941
Feature activation-6.32e-02

Pos logit contributions
 narrative+0.028
ris+0.028
 gol+0.025
 nations+0.025
spe+0.023
Neg logit contributions
cel-0.068
 forehead-0.065
 lap-0.064
tail-0.061
 cheek-0.059
Token old
Token ID1468
Feature activation-3.06e-02

Pos logit contributions
ris+0.139
 narrative+0.134
 gol+0.128
spe+0.124
 nations+0.124
Neg logit contributions
cel-0.070
 forehead-0.069
 lap-0.069
 chin-0.063
 cheeks-0.059
Token who
Token ID508
Feature activation-7.60e-03

Pos logit contributions
ris+0.068
 narrative+0.063
 gol+0.062
spe+0.061
 nations+0.059
Neg logit contributions
cel-0.036
 forehead-0.034
 lap-0.033
 chin-0.031
 cheeks-0.029
Token loved
Token ID6151
Feature activation-2.76e-02

Pos logit contributions
ris+0.013
 gol+0.012
 narrative+0.012
spe+0.012
 nations+0.011
Neg logit contributions
cel-0.013
 forehead-0.012
 lap-0.012
 chin-0.011
 cheeks-0.010
Token playing
Token ID2712
Feature activation+1.83e-03

Pos logit contributions
ris+0.053
 narrative+0.048
 gol+0.047
spe+0.045
 nations+0.044
Neg logit contributions
cel-0.041
 lap-0.040
 forehead-0.039
 chin-0.037
 cheeks-0.035
Token with
Token ID351
Feature activation+3.79e-03

Pos logit contributions
cel+0.003
 forehead+0.002
 lap+0.002
 chin+0.002
 cheeks+0.002
Neg logit contributions
ris-0.004
 gol-0.003
 narrative-0.003
spe-0.003
 nations-0.003
Token your
Token ID534
Feature activation+7.93e-03

Pos logit contributions
ris+0.074
 narrative+0.067
 gol+0.067
spe+0.067
 nations+0.064
Neg logit contributions
cel-0.026
 lap-0.025
 forehead-0.023
 chin-0.023
tail-0.021
Token hand
Token ID1021
Feature activation+3.68e-02

Pos logit contributions
cel+0.012
 forehead+0.010
 lap+0.009
amas+0.009
 chin+0.009
Neg logit contributions
ris-0.015
 narrative-0.014
 gol-0.014
 nations-0.014
spe-0.014
Token?"
Token ID1701
Feature activation-9.76e-03

Pos logit contributions
cel+0.069
 forehead+0.063
 lap+0.061
 chin+0.057
 cheeks+0.055
Neg logit contributions
ris-0.055
 gol-0.049
 narrative-0.048
spe-0.048
 nations-0.047
Token the
Token ID262
Feature activation-6.06e-03

Pos logit contributions
ris+0.021
 gol+0.020
spe+0.019
 narrative+0.019
 nations+0.018
Neg logit contributions
cel-0.012
 forehead-0.011
 lap-0.011
 chin-0.010
amas-0.009
Token zo
Token ID40565
Feature activation-3.39e-02

Pos logit contributions
ris+0.014
 gol+0.013
 narrative+0.013
spe+0.013
 nations+0.012
Neg logit contributions
cel-0.007
 lap-0.006
 forehead-0.006
 chin-0.005
tail-0.005
Tokenoke
Token ID2088
Feature activation-6.85e-02

Pos logit contributions
ris+0.041
 narrative+0.038
 gol+0.037
spe+0.032
 bomb+0.030
Neg logit contributions
cel-0.069
 forehead-0.065
 lap-0.064
 chin-0.061
 cheeks-0.060
Tokeneper
Token ID5723
Feature activation-2.53e-02

Pos logit contributions
ris+0.084
 gol+0.078
 narrative+0.074
spe+0.068
 nations+0.065
Neg logit contributions
cel-0.146
 forehead-0.140
 lap-0.136
 chin-0.133
 shoulder-0.128
Token asked
Token ID1965
Feature activation+2.12e-05

Pos logit contributions
ris+0.047
 gol+0.044
 narrative+0.043
spe+0.042
 nations+0.041
Neg logit contributions
cel-0.039
 forehead-0.036
 lap-0.036
 chin-0.034
 cheeks-0.032
Token.
Token ID13
Feature activation-5.99e-03

Pos logit contributions
cel+0.000
 lap+0.000
 forehead+0.000
 chin+0.000
amas+0.000
Neg logit contributions
ris-0.000
spe-0.000
 gol-0.000
 nations-0.000
 narrative-0.000
Token
Token ID198
Feature activation-2.95e-03

Pos logit contributions
ris+0.013
 gol+0.013
spe+0.012
 narrative+0.012
 nations+0.012
Neg logit contributions
cel-0.007
 forehead-0.007
 lap-0.006
 chin-0.006
 cheeks-0.005
Token
Token ID198
Feature activation+7.34e-03

Pos logit contributions
ris+0.007
 narrative+0.007
 gol+0.007
spe+0.007
 nations+0.007
Neg logit contributions
cel-0.003
 lap-0.002
 forehead-0.002
 chin-0.002
tail-0.002
Token They
Token ID1119
Feature activation-1.06e-02

Pos logit contributions
ris+0.000
 gol+0.000
spe+0.000
 narrative+0.000
 nations+0.000
Neg logit contributions
cel-0.000
 forehead-0.000
 lap-0.000
 chin-0.000
 cheeks-0.000
Token wanted
Token ID2227
Feature activation+3.31e-03

Pos logit contributions
ris+0.019
 gol+0.018
 narrative+0.018
spe+0.017
 nations+0.017
Neg logit contributions
cel-0.017
 forehead-0.016
 lap-0.016
 chin-0.015
 cheeks-0.014
Token to
Token ID284
Feature activation+5.18e-03

Pos logit contributions
cel+0.005
 lap+0.005
 forehead+0.004
 chin+0.004
 cheeks+0.004
Neg logit contributions
ris-0.006
 narrative-0.006
 gol-0.006
spe-0.006
 nations-0.005
Token find
Token ID1064
Feature activation+2.33e-04

Pos logit contributions
cel+0.008
 forehead+0.007
amas+0.006
 lap+0.006
 chin+0.006
Neg logit contributions
ris-0.010
 gol-0.009
 nations-0.009
spe-0.009
 narrative-0.009
Token something
Token ID1223
Feature activation-2.98e-02

Pos logit contributions
cel+0.000
 forehead+0.000
 lap+0.000
 chin+0.000
amas+0.000
Neg logit contributions
ris-0.000
 gol-0.000
 nations-0.000
 narrative-0.000
spe-0.000
Token fun
Token ID1257
Feature activation-6.26e-02

Pos logit contributions
ris+0.064
 narrative+0.061
 gol+0.061
spe+0.059
 nations+0.057
Neg logit contributions
cel-0.036
 lap-0.034
 forehead-0.033
 chin-0.031
 warm-0.030
Token.
Token ID13
Feature activation-3.40e-03

Pos logit contributions
ris+0.127
 gol+0.122
 narrative+0.122
spe+0.119
 nations+0.112
Neg logit contributions
cel-0.082
 lap-0.078
 forehead-0.073
 chin-0.070
 shoulder-0.064
Token
Token ID220
Feature activation+6.67e-03

Pos logit contributions
ris+0.007
 gol+0.007
 narrative+0.007
spe+0.007
 nations+0.007
Neg logit contributions
cel-0.004
 forehead-0.004
 lap-0.004
 chin-0.003
 cheeks-0.003
Token
Token ID198
Feature activation+3.86e-03

Pos logit contributions
cel+0.010
 forehead+0.009
 chin+0.009
 lap+0.008
 cheeks+0.008
Neg logit contributions
ris-0.014
 gol-0.012
spe-0.012
 narrative-0.012
 nations-0.011
Token
Token ID198
Feature activation+1.48e-02

Pos logit contributions
cel+0.005
 forehead+0.005
 lap+0.005
 chin+0.005
 cheeks+0.005
Neg logit contributions
ris-0.008
spe-0.007
 gol-0.007
 narrative-0.007
 nations-0.007
TokenWhen
Token ID2215
Feature activation-8.44e-03

Pos logit contributions
cel+0.020
 lap+0.018
 forehead+0.017
 chin+0.016
tail+0.015
Neg logit contributions
ris-0.030
 narrative-0.029
 gol-0.028
 nations-0.027
spe-0.027
Token can
Token ID460
Feature activation+9.56e-03

Pos logit contributions
ris+0.010
 gol+0.010
 narrative+0.010
spe+0.009
 nations+0.009
Neg logit contributions
cel-0.008
 lap-0.007
 forehead-0.007
 chin-0.007
amas-0.007
Token I
Token ID314
Feature activation+9.25e-03

Pos logit contributions
cel+0.017
 forehead+0.017
 lap+0.016
amas+0.016
 chin+0.015
Neg logit contributions
igators-0.012
 narrative-0.012
spe-0.011
 gol-0.011
guard-0.011
Token slide
Token ID10649
Feature activation-1.64e-03

Pos logit contributions
cel+0.010
 forehead+0.009
 lap+0.009
 chin+0.008
 cheeks+0.008
Neg logit contributions
ris-0.021
 gol-0.020
 narrative-0.020
spe-0.019
 nations-0.019
Token with
Token ID351
Feature activation-5.53e-03

Pos logit contributions
ris+0.003
 gol+0.003
 narrative+0.003
spe+0.002
 nations+0.002
Neg logit contributions
cel-0.003
 forehead-0.003
 lap-0.003
 chin-0.002
 cheeks-0.002
Token the
Token ID262
Feature activation-4.01e-03

Pos logit contributions
ris+0.010
 gol+0.009
 nations+0.009
igators+0.009
 narrative+0.009
Neg logit contributions
cel-0.009
 forehead-0.008
 lap-0.008
 chin-0.007
amas-0.007
Token er
Token ID1931
Feature activation-6.53e-02

Pos logit contributions
ris+0.005
 gol+0.005
 narrative+0.005
 nations+0.004
spe+0.004
Neg logit contributions
cel-0.008
 forehead-0.008
 lap-0.008
 chin-0.007
tail-0.007
Tokenaser
Token ID6005
Feature activation-2.39e-02

Pos logit contributions
ris+0.086
spe+0.078
 narrative+0.075
 gol+0.074
 nations+0.071
Neg logit contributions
cel-0.138
 forehead-0.127
 lap-0.126
 chin-0.121
tail-0.117
Token too
Token ID1165
Feature activation-2.35e-02

Pos logit contributions
ris+0.046
spe+0.042
 gol+0.042
 narrative+0.042
 nations+0.040
Neg logit contributions
cel-0.035
 lap-0.033
 forehead-0.033
 chin-0.031
tail-0.029
Token?"
Token ID1701
Feature activation-9.78e-03

Pos logit contributions
ris+0.046
 gol+0.042
 narrative+0.042
spe+0.042
 nations+0.040
Neg logit contributions
cel-0.033
 lap-0.032
 forehead-0.031
 chin-0.029
tail-0.027
Token
Token ID198
Feature activation-3.14e-03

Pos logit contributions
ris+0.015
 gol+0.013
spe+0.013
igators+0.012
 narrative+0.011
Neg logit contributions
cel-0.018
 forehead-0.017
 chin-0.016
 lap-0.016
 hips-0.015
Token
Token ID198
Feature activation+9.11e-03

Pos logit contributions
ris+0.006
 gol+0.005
spe+0.005
 narrative+0.005
ile+0.004
Neg logit contributions
 forehead-0.005
cel-0.005
 chin-0.005
 cheeks-0.005
 lap-0.005