TOP ACTIVATIONS
MAX = 7.662

in her hand , what shapes were in the sky ,
find out what all the comm otion was about . When
blinked her eyes and imm ag ined she could see the
ice cream , and bought souven irs . They
He always felt happy and g iddy when he was sliding
Mary loved her ch ubby plate and ran around
sand cast les and se ag ull s . Jane was
were crashing and the se ag ull s were squ aw
play and bark and w ag their tails .
both loved to laugh and g iggle . They were the
the sand and the se ag ull s . They run
colors and had shiny car v ings on it .
L - V . Lil v ! That is not your
waves and wind and se ag ull s .
she couldn 't help but g iggle with joy .
wearing his bath robe and g iggle with joy .
elephants , lions and ac rob ats . They are amazed
mixed its own colors , scenes and music , all by
each other . They w ag their tails and lick each
noises like waves and se ag ull s . They have

MIN ACTIVATIONS
MIN = -7.320

. He pulled the c ork and a puff of smoke
She was able to scoop it up with her net and
the sky before they walked fore shadow ed the down pour
face twist . She spat it out and felt a pain
. She tried to spit it out . "
like it . He spat it out and threw it away
mouth , but he spat it out . He tasted something
very sour . They spit it out . Mom
. She quickly gob bled it up and felt so happy
at all . He spat it out and made a face
them cough . They spit it out and make faces .
unch up . She spat it out and said , "
her tongue . She spat it out and cried .
bitter ! He quickly spat it out . The
coffee down . He spat it out , then ran to
like it . She spit it out and said , "
mouth . You should spit it out . The teacher will
Her mom tried to pull it out too , but it
. Tim tried to pull it out , but it was
so sour that he spit it out and made a funny

INTERVAL 3.916 - 7.662
CONTAINS 0.438%

happy and safe from scary creatures . <|endoftext|> Once upon a
looked at the boy and cl ucked . The boy laughed
them a picture of a solar eclipse in the book .
In the distance , the lights from a house were flickering
with her family , the animals had already destroyed the store

INTERVAL 0.171 - 3.916
CONTAINS 45.541%

are the best . Do you want to keep the fish
I am here for you . I 'll be your friend
run . <|endoftext|> Once upon a time , there was a
set , Lily and her mom my went back outside to
the store to buy the wood and tools they needed .

INTERVAL -3.575 - 0.171
CONTAINS 52.172%

fun . But then , a big boy came
my calmed down and said , " I like puzzles and
wonder . The tree just kept growing , reaching
and hugged them tight . Now , she had two things
out of the ground . It looked like a step .

INTERVAL -7.320 - -3.575
CONTAINS 1.849%

hugged mom and said , " Thank you , mom .
fairy . She said , " I see you are feeling
was too fast . He got to his car and drove
As a treat , Lily got to watch one more cartoon
tall man . He was getting ready to take a bath
Token in
Token ID287
Feature activation+2.48e+00

Pos logit contributions
kin+0.024
 wear+0.022
apy+0.022
inton+0.022
 toes+0.022
Neg logit contributions
 further-0.018
 harder-0.017
 again-0.017
 tighter-0.016
 worse-0.016
Token her
Token ID607
Feature activation+3.21e+00

Pos logit contributions
kin+0.062
 wear+0.059
inton+0.057
 hardly+0.057
 toes+0.057
Neg logit contributions
 further-0.049
 harder-0.048
 tighter-0.047
 again-0.047
 aback-0.046
Token hand
Token ID1021
Feature activation+5.60e-01

Pos logit contributions
kin+0.071
 hardly+0.067
apy+0.066
inton+0.065
orses+0.065
Neg logit contributions
 further-0.071
 again-0.070
 harder-0.068
 tighter-0.067
 cal-0.065
Token,
Token ID11
Feature activation+5.39e-01

Pos logit contributions
kin+0.014
inton+0.013
 hardly+0.013
 wear+0.013
apy+0.013
Neg logit contributions
 again-0.011
 further-0.011
 harder-0.011
 tighter-0.010
 worse-0.010
Token what
Token ID644
Feature activation+1.98e+00

Pos logit contributions
kin+0.013
 wear+0.013
 hardly+0.013
 toes+0.013
inton+0.012
Neg logit contributions
 further-0.010
 harder-0.010
 tighter-0.010
 again-0.010
 cal-0.010
Token shapes
Token ID15268
Feature activation+7.66e+00

Pos logit contributions
kin+0.046
 wear+0.044
 hardly+0.042
 toes+0.042
inton+0.042
Neg logit contributions
 further-0.042
 tighter-0.041
 harder-0.041
 again-0.040
 cal-0.040
Token were
Token ID547
Feature activation+1.53e+00

Pos logit contributions
kin+0.155
inton+0.148
 personality+0.148
ippers+0.147
 toes+0.145
Neg logit contributions
 further-0.180
 again-0.178
 harder-0.169
 worse-0.168
 cal-0.162
Token in
Token ID287
Feature activation+3.14e+00

Pos logit contributions
kin+0.040
 wear+0.038
 personality+0.037
inton+0.037
 hardly+0.037
Neg logit contributions
 further-0.029
 harder-0.028
 again-0.027
 tighter-0.027
 worse-0.026
Token the
Token ID262
Feature activation+4.37e-01

Pos logit contributions
kin+0.076
 wear+0.071
inton+0.070
 hardly+0.070
 personality+0.069
Neg logit contributions
 further-0.065
 harder-0.064
 again-0.063
 tighter-0.061
 aback-0.060
Token sky
Token ID6766
Feature activation+3.40e+00

Pos logit contributions
kin+0.010
 wear+0.010
 toes+0.009
 personality+0.009
inton+0.009
Neg logit contributions
 further-0.009
 tighter-0.009
 harder-0.009
 again-0.009
 aback-0.009
Token,
Token ID11
Feature activation+5.67e-01

Pos logit contributions
kin+0.085
 hardly+0.080
 wear+0.080
inton+0.079
 personality+0.078
Neg logit contributions
 again-0.067
 further-0.066
 harder-0.065
 tighter-0.062
 worse-0.062
Token find
Token ID1064
Feature activation-1.71e+00

Pos logit contributions
 further+0.042
 harder+0.042
 tighter+0.042
 aback+0.041
 worse+0.041
Neg logit contributions
 wear-0.025
kin-0.024
 hardly-0.023
 toes-0.022
 personality-0.022
Token out
Token ID503
Feature activation+6.80e-01

Pos logit contributions
 tighter+0.031
lim+0.031
 harder+0.030
 further+0.030
 again+0.029
Neg logit contributions
 wear-0.041
kin-0.041
 toes-0.040
 hardly-0.039
 personality-0.039
Token what
Token ID644
Feature activation+7.31e-01

Pos logit contributions
kin+0.017
 wear+0.016
 hardly+0.016
 toes+0.016
inton+0.016
Neg logit contributions
 further-0.013
 harder-0.013
 tighter-0.013
 again-0.013
 cal-0.012
Token all
Token ID477
Feature activation+2.63e+00

Pos logit contributions
kin+0.017
ippers+0.016
 wear+0.016
inton+0.016
 toes+0.016
Neg logit contributions
 further-0.016
 again-0.015
 harder-0.015
 worse-0.015
 tighter-0.015
Token the
Token ID262
Feature activation+2.43e+00

Pos logit contributions
kin+0.064
 wear+0.061
 hardly+0.059
inton+0.059
 toes+0.059
Neg logit contributions
 further-0.054
 harder-0.052
 again-0.052
 tighter-0.052
 cal-0.051
Token comm
Token ID725
Feature activation+7.55e+00

Pos logit contributions
kin+0.053
 wear+0.052
 toes+0.050
 hardly+0.050
 personality+0.049
Neg logit contributions
 tighter-0.053
 further-0.053
 harder-0.052
 aback-0.052
 again-0.052
Tokenotion
Token ID9650
Feature activation+1.47e+00

Pos logit contributions
 personality+0.174
pron+0.169
 beard+0.160
omo+0.157
anas+0.157
Neg logit contributions
 again-0.162
ter-0.158
 further-0.157
 aback-0.142
 iv-0.140
Token was
Token ID373
Feature activation+5.11e-01

Pos logit contributions
kin+0.034
 wear+0.032
ippers+0.032
 personality+0.032
inton+0.032
Neg logit contributions
 again-0.031
 further-0.031
 harder-0.030
 worse-0.029
 tighter-0.029
Token about
Token ID546
Feature activation+1.52e+00

Pos logit contributions
kin+0.012
apy+0.011
 toes+0.011
 wear+0.011
ippers+0.011
Neg logit contributions
 further-0.011
 again-0.011
 harder-0.011
 worse-0.010
 tighter-0.010
Token.
Token ID13
Feature activation+2.37e-01

Pos logit contributions
kin+0.038
 wear+0.035
inton+0.035
 hardly+0.034
apy+0.034
Neg logit contributions
 further-0.031
 again-0.031
 harder-0.031
 tighter-0.029
 cal-0.028
Token When
Token ID1649
Feature activation-6.17e-01

Pos logit contributions
 wear+0.003
 hardly+0.003
kin+0.003
 toes+0.003
 personality+0.003
Neg logit contributions
 further-0.007
 tighter-0.007
 harder-0.007
 worse-0.007
 cal-0.007
Token blinked
Token ID40360
Feature activation+1.09e+00

Pos logit contributions
 wear+0.022
 hardly+0.021
kin+0.021
 toes+0.021
 personality+0.020
Neg logit contributions
 aback-0.029
 further-0.028
 tighter-0.028
 harder-0.028
 worse-0.028
Token her
Token ID607
Feature activation+9.71e-02

Pos logit contributions
 toes+0.023
 wear+0.022
kin+0.022
 personality+0.021
 hardly+0.021
Neg logit contributions
 tighter-0.024
 further-0.024
 worse-0.024
 cal-0.024
lim-0.024
Token eyes
Token ID2951
Feature activation+6.58e-01

Pos logit contributions
kin+0.002
inton+0.002
 wear+0.002
 hardly+0.002
orses+0.002
Neg logit contributions
 further-0.002
 harder-0.002
 tighter-0.002
 again-0.002
 cal-0.002
Token and
Token ID290
Feature activation+3.62e-01

Pos logit contributions
 wear+0.015
kin+0.015
 hardly+0.015
 toes+0.015
 ago+0.014
Neg logit contributions
 further-0.013
 tighter-0.013
 cal-0.013
 worse-0.013
 aback-0.013
Token imm
Token ID2296
Feature activation+3.96e+00

Pos logit contributions
kin+0.008
 wear+0.008
 toes+0.008
 hardly+0.008
 personality+0.007
Neg logit contributions
 further-0.008
 tighter-0.008
 harder-0.008
 aback-0.008
 worse-0.008
Tokenag
Token ID363
Feature activation+7.40e+00

Pos logit contributions
kin+0.089
 wear+0.085
 hardly+0.085
 toes+0.083
 personality+0.082
Neg logit contributions
 further-0.087
 harder-0.084
 tighter-0.084
 again-0.082
 worse-0.081
Tokenined
Token ID1389
Feature activation+3.44e+00

Pos logit contributions
kin+0.177
 wear+0.177
ippers+0.171
inton+0.171
 hardly+0.170
Neg logit contributions
 again-0.139
 further-0.135
 harder-0.134
 tighter-0.130
 aback-0.129
Token she
Token ID673
Feature activation-4.56e-01

Pos logit contributions
kin+0.071
 toes+0.070
 wear+0.069
 personality+0.066
 hardly+0.066
Neg logit contributions
 further-0.077
 tighter-0.076
 harder-0.075
lim-0.074
 aback-0.074
Token could
Token ID714
Feature activation+1.49e+00

Pos logit contributions
 further+0.010
 harder+0.010
 tighter+0.010
 again+0.009
 cal+0.009
Neg logit contributions
kin-0.010
 wear-0.010
 hardly-0.010
 toes-0.010
inton-0.010
Token see
Token ID766
Feature activation+5.28e-01

Pos logit contributions
kin+0.028
inton+0.026
ippers+0.025
 personality+0.024
 importantly+0.024
Neg logit contributions
 again-0.040
 further-0.040
 harder-0.039
 closer-0.038
 cal-0.037
Token the
Token ID262
Feature activation-6.20e-01

Pos logit contributions
 wear+0.012
kin+0.012
 toes+0.012
 personality+0.011
 hardly+0.011
Neg logit contributions
 tighter-0.011
 further-0.011
 harder-0.011
 aback-0.011
lim-0.010
Token ice
Token ID4771
Feature activation-7.98e-01

Pos logit contributions
 aback+0.005
 tighter+0.005
 further+0.005
lim+0.005
minded+0.005
Neg logit contributions
kin-0.005
 toes-0.005
 wear-0.005
apy-0.005
 ago-0.005
Token cream
Token ID8566
Feature activation-3.02e-03

Pos logit contributions
 further+0.016
 aback+0.016
 cal+0.016
 tighter+0.015
 worse+0.015
Neg logit contributions
kin-0.021
 wear-0.019
 toes-0.019
inton-0.018
ippers-0.018
Token,
Token ID11
Feature activation+1.61e+00

Pos logit contributions
 further+0.000
 aback+0.000
 tighter+0.000
 harder+0.000
 cal+0.000
Neg logit contributions
kin-0.000
 wear-0.000
 toes-0.000
inton-0.000
 hardly-0.000
Token and
Token ID290
Feature activation+7.81e-01

Pos logit contributions
kin+0.046
 wear+0.044
inton+0.043
 hardly+0.043
 toes+0.043
Neg logit contributions
 further-0.026
 harder-0.025
 tighter-0.025
 again-0.025
 cal-0.024
Token bought
Token ID5839
Feature activation-1.62e+00

Pos logit contributions
kin+0.018
 wear+0.017
inton+0.017
 hardly+0.017
ippers+0.017
Neg logit contributions
 further-0.017
 harder-0.016
 again-0.016
 tighter-0.016
 cal-0.016
Token souven
Token ID45043
Feature activation+7.33e+00

Pos logit contributions
 further+0.034
 aback+0.033
 tighter+0.033
lim+0.032
 harder+0.032
Neg logit contributions
 toes-0.036
 wear-0.036
kin-0.036
inton-0.034
 hardly-0.034
Tokenirs
Token ID17062
Feature activation+2.17e+00

Pos logit contributions
 personality+0.147
kin+0.143
inton+0.134
 toes+0.132
 hardly+0.132
Neg logit contributions
 again-0.173
 further-0.171
lim-0.170
 harder-0.166
 tighter-0.156
Token.
Token ID13
Feature activation+8.08e-01

Pos logit contributions
kin+0.051
 wear+0.048
 hardly+0.047
 personality+0.047
inton+0.047
Neg logit contributions
 further-0.046
 again-0.045
 harder-0.044
 tighter-0.044
 cal-0.042
Token
Token ID198
Feature activation-1.88e-01

Pos logit contributions
kin+0.021
 toes+0.018
ippers+0.018
inton+0.018
 wear+0.017
Neg logit contributions
 Owl-0.017
 again-0.016
 harder-0.015
 cal-0.015
 further-0.015
Token
Token ID198
Feature activation-1.31e+00

Pos logit contributions
 again+0.003
 further+0.003
 harder+0.003
 tighter+0.003
 cal+0.003
Neg logit contributions
kin-0.005
 toes-0.005
 personality-0.005
ippers-0.005
 wear-0.004
TokenThey
Token ID2990
Feature activation+2.13e+00

Pos logit contributions
 further+0.023
 harder+0.022
 tighter+0.022
lim+0.021
 farther+0.021
Neg logit contributions
 toes-0.036
kin-0.034
 personality-0.034
ippers-0.033
 imagination-0.033
Token He
Token ID679
Feature activation+4.02e-01

Pos logit contributions
kin+0.027
inton+0.024
 wear+0.024
 toes+0.023
ippers+0.023
Neg logit contributions
 again-0.025
 further-0.024
 harder-0.024
 Owl-0.023
 aback-0.022
Token always
Token ID1464
Feature activation+1.50e+00

Pos logit contributions
kin+0.006
 wear+0.006
 hardly+0.006
 toes+0.006
inton+0.006
Neg logit contributions
 further-0.011
 tighter-0.011
 aback-0.011
 harder-0.011
 worse-0.011
Token felt
Token ID2936
Feature activation+5.45e-01

Pos logit contributions
kin+0.031
 wear+0.028
inton+0.028
 hardly+0.028
 toes+0.028
Neg logit contributions
 further-0.036
 harder-0.036
 tighter-0.035
 again-0.035
 cal-0.035
Token happy
Token ID3772
Feature activation+9.80e-01

Pos logit contributions
kin+0.012
 wear+0.011
inton+0.011
 toes+0.011
 hardly+0.011
Neg logit contributions
 further-0.013
 harder-0.013
 tighter-0.012
 cal-0.012
 again-0.012
Token and
Token ID290
Feature activation-3.69e-01

Pos logit contributions
kin+0.026
 wear+0.025
 toes+0.025
 hardly+0.025
inton+0.024
Neg logit contributions
 further-0.018
 tighter-0.017
 harder-0.017
 cal-0.016
 aback-0.016
Token g
Token ID308
Feature activation+7.28e+00

Pos logit contributions
 further+0.009
 harder+0.009
 again+0.008
 tighter+0.008
 cal+0.008
Neg logit contributions
kin-0.008
ilot-0.008
ippers-0.008
 ago-0.007
inton-0.007
Tokeniddy
Token ID34208
Feature activation-4.46e-02

Pos logit contributions
kin+0.129
 wear+0.123
pron+0.115
 arrive+0.112
 beard+0.109
Neg logit contributions
 again-0.181
lim-0.176
ter-0.167
 harder-0.160
ocular-0.158
Token when
Token ID618
Feature activation+7.94e-01

Pos logit contributions
 further+0.001
 again+0.001
 harder+0.001
 tighter+0.001
 worse+0.001
Neg logit contributions
kin-0.001
 wear-0.001
 hardly-0.001
 toes-0.001
inton-0.001
Token he
Token ID339
Feature activation-5.75e-01

Pos logit contributions
kin+0.019
inton+0.018
ippers+0.018
 wear+0.018
 personality+0.017
Neg logit contributions
 again-0.018
 further-0.018
 harder-0.017
 worse-0.016
 cal-0.016
Token was
Token ID373
Feature activation-1.23e+00

Pos logit contributions
 tighter+0.014
 further+0.014
 aback+0.014
 harder+0.013
 worse+0.013
Neg logit contributions
 wear-0.012
kin-0.011
 hardly-0.011
 toes-0.010
orses-0.010
Token sliding
Token ID22292
Feature activation+3.54e-01

Pos logit contributions
 further+0.028
 harder+0.028
 again+0.027
 tighter+0.027
 cal+0.027
Neg logit contributions
kin-0.027
 wear-0.025
inton-0.025
 toes-0.025
 personality-0.024
Token
Token ID198
Feature activation+5.05e-01

Pos logit contributions
kin+0.044
 personality+0.040
 wear+0.040
 toes+0.040
inton+0.038
Neg logit contributions
 again-0.037
 harder-0.036
 further-0.036
 aback-0.035
 cal-0.034
Token
Token ID198
Feature activation-7.83e-01

Pos logit contributions
kin+0.013
 toes+0.012
 personality+0.012
inton+0.012
 wear+0.012
Neg logit contributions
 again-0.010
 further-0.010
 harder-0.010
 cal-0.009
 aback-0.009
TokenMary
Token ID24119
Feature activation-3.32e-01

Pos logit contributions
 further+0.017
 harder+0.016
 tighter+0.016
 again+0.016
 cal+0.016
Neg logit contributions
kin-0.018
 wear-0.017
 toes-0.017
 personality-0.017
inton-0.017
Token loved
Token ID6151
Feature activation+1.79e+00

Pos logit contributions
 further+0.008
 harder+0.007
 tighter+0.007
 worse+0.007
 aback+0.007
Neg logit contributions
kin-0.007
 wear-0.007
 hardly-0.007
 toes-0.007
inton-0.007
Token her
Token ID607
Feature activation+2.56e+00

Pos logit contributions
kin+0.042
 wear+0.040
inton+0.039
 hardly+0.039
 toes+0.039
Neg logit contributions
 further-0.038
 harder-0.037
 tighter-0.037
 again-0.037
 cal-0.036
Token ch
Token ID442
Feature activation+7.26e+00

Pos logit contributions
kin+0.054
apy+0.054
ilot+0.053
inton+0.053
 hardly+0.053
Neg logit contributions
 again-0.057
 harder-0.056
 further-0.055
 tighter-0.054
 cal-0.053
Tokenubby
Token ID38393
Feature activation+2.84e+00

Pos logit contributions
 hardly+0.102
 Their+0.099
 wear+0.096
 arrive+0.094
apy+0.093
Neg logit contributions
 harder-0.200
 again-0.198
 tighter-0.197
 aback-0.197
 further-0.195
Token plate
Token ID7480
Feature activation-1.08e+00

Pos logit contributions
apy+0.056
kin+0.056
 wear+0.056
 hardly+0.055
inton+0.055
Neg logit contributions
 again-0.067
 harder-0.065
 tighter-0.064
 further-0.062
 worse-0.060
Token and
Token ID290
Feature activation+1.07e+00

Pos logit contributions
 further+0.021
 harder+0.021
 tighter+0.020
 again+0.020
 aback+0.020
Neg logit contributions
kin-0.027
 wear-0.026
 toes-0.025
 hardly-0.025
inton-0.024
Token ran
Token ID4966
Feature activation-2.21e+00

Pos logit contributions
kin+0.026
ippers+0.025
inton+0.024
ilot+0.024
 ago+0.023
Neg logit contributions
 again-0.023
 further-0.022
 tighter-0.022
 harder-0.022
 cal-0.021
Token around
Token ID1088
Feature activation+1.43e+00

Pos logit contributions
 further+0.033
 tighter+0.032
 harder+0.032
 again+0.031
 cal+0.031
Neg logit contributions
kin-0.065
 wear-0.063
 hardly-0.061
 toes-0.061
inton-0.061
Token sand
Token ID6450
Feature activation+1.93e+00

Pos logit contributions
 wear+0.029
 toes+0.029
kin+0.028
inton+0.027
 ago+0.026
Neg logit contributions
 tighter-0.025
 aback-0.025
 worse-0.025
 harder-0.025
 further-0.024
Tokencast
Token ID2701
Feature activation+2.46e+00

Pos logit contributions
kin+0.044
 wear+0.042
 toes+0.041
 hardly+0.041
inton+0.041
Neg logit contributions
 further-0.042
 tighter-0.041
 harder-0.041
 again-0.040
 cal-0.039
Tokenles
Token ID829
Feature activation+3.31e+00

Pos logit contributions
kin+0.073
 hardly+0.069
 personality+0.069
inton+0.069
apy+0.068
Neg logit contributions
 further-0.038
 again-0.037
 harder-0.037
 tighter-0.035
 farther-0.032
Token and
Token ID290
Feature activation-2.28e-01

Pos logit contributions
kin+0.099
 wear+0.095
 personality+0.094
 hardly+0.093
inton+0.093
Neg logit contributions
 further-0.050
 again-0.049
 harder-0.047
 tighter-0.043
 farther-0.042
Token se
Token ID384
Feature activation+1.39e+00

Pos logit contributions
 tighter+0.004
 further+0.004
 harder+0.004
 aback+0.004
 cal+0.004
Neg logit contributions
kin-0.006
 wear-0.005
 toes-0.005
 hardly-0.005
inton-0.005
Tokenag
Token ID363
Feature activation+7.22e+00

Pos logit contributions
kin+0.028
 wear+0.026
 hardly+0.025
 toes+0.025
inton+0.025
Neg logit contributions
 further-0.034
 harder-0.034
 tighter-0.034
 again-0.033
 aback-0.032
Tokenull
Token ID724
Feature activation+4.45e-01

Pos logit contributions
 wear+0.180
kin+0.174
 personality+0.173
pron+0.168
 hardly+0.168
Neg logit contributions
 again-0.132
 further-0.128
 harder-0.126
 tighter-0.116
st-0.114
Tokens
Token ID82
Feature activation+2.55e+00

Pos logit contributions
kin+0.013
 wear+0.012
 personality+0.011
 hardly+0.011
 ago+0.011
Neg logit contributions
 again-0.008
 further-0.008
 harder-0.007
 tighter-0.007
st-0.007
Token.
Token ID13
Feature activation+3.72e-01

Pos logit contributions
kin+0.064
 wear+0.064
 personality+0.061
 hardly+0.060
apy+0.058
Neg logit contributions
 again-0.051
 further-0.049
 harder-0.048
 tighter-0.044
 farther-0.044
Token Jane
Token ID12091
Feature activation+1.05e-01

Pos logit contributions
kin+0.008
 wear+0.007
inton+0.007
ippers+0.007
 toes+0.007
Neg logit contributions
 further-0.009
 again-0.009
 harder-0.009
 tighter-0.009
 aback-0.009
Token was
Token ID373
Feature activation+6.51e-01

Pos logit contributions
kin+0.003
 wear+0.002
 hardly+0.002
 toes+0.002
inton+0.002
Neg logit contributions
 further-0.002
 harder-0.002
 tighter-0.002
 again-0.002
 cal-0.002
Token were
Token ID547
Feature activation+3.43e-01

Pos logit contributions
kin+0.059
ippers+0.059
 toes+0.057
 personality+0.057
inton+0.056
Neg logit contributions
 again-0.056
 further-0.055
 harder-0.052
 worse-0.051
 farther-0.049
Token crashing
Token ID21899
Feature activation+1.11e+00

Pos logit contributions
kin+0.008
 wear+0.008
 toes+0.008
 hardly+0.008
inton+0.008
Neg logit contributions
 further-0.007
 tighter-0.007
 aback-0.007
 again-0.007
 harder-0.007
Token and
Token ID290
Feature activation+4.39e-01

Pos logit contributions
kin+0.029
 wear+0.028
inton+0.028
 toes+0.028
 hardly+0.027
Neg logit contributions
 further-0.020
 cal-0.019
 tighter-0.019
 aback-0.019
 harder-0.019
Token the
Token ID262
Feature activation+3.38e-01

Pos logit contributions
kin+0.010
 wear+0.010
 toes+0.009
 hardly+0.009
inton+0.009
Neg logit contributions
 further-0.010
 tighter-0.009
 harder-0.009
 aback-0.009
 again-0.009
Token se
Token ID384
Feature activation+1.19e+00

Pos logit contributions
kin+0.009
 wear+0.008
 hardly+0.008
apy+0.008
inton+0.008
Neg logit contributions
 further-0.006
 harder-0.006
 again-0.006
 tighter-0.006
 cal-0.006
Tokenag
Token ID363
Feature activation+7.21e+00

Pos logit contributions
kin+0.019
 toes+0.019
 wear+0.018
 hardly+0.017
apy+0.017
Neg logit contributions
 further-0.033
 harder-0.033
 tighter-0.033
 again-0.032
 aback-0.032
Tokenull
Token ID724
Feature activation+1.25e+00

Pos logit contributions
 wear+0.203
 personality+0.200
 importantly+0.195
 blonde+0.195
 classmates+0.194
Neg logit contributions
 again-0.108
st-0.099
 further-0.097
ter-0.090
 harder-0.088
Tokens
Token ID82
Feature activation+3.17e+00

Pos logit contributions
kin+0.033
ippers+0.031
 personality+0.031
 ago+0.031
 wear+0.031
Neg logit contributions
 again-0.023
 further-0.023
 harder-0.022
 worse-0.022
 tighter-0.022
Token were
Token ID547
Feature activation+7.90e-01

Pos logit contributions
kin+0.070
 wear+0.068
 personality+0.066
ippers+0.065
inton+0.065
Neg logit contributions
 again-0.070
 further-0.069
 harder-0.069
 worse-0.067
 tighter-0.066
Token squ
Token ID2809
Feature activation+1.06e+00

Pos logit contributions
kin+0.021
 wear+0.021
 personality+0.020
ippers+0.020
inton+0.020
Neg logit contributions
 further-0.014
 harder-0.014
 tighter-0.014
 again-0.014
 worse-0.014
Tokenaw
Token ID707
Feature activation-1.75e-01

Pos logit contributions
inton+0.019
ippers+0.018
 wear+0.018
 toes+0.018
kin+0.018
Neg logit contributions
 Owl-0.028
 further-0.027
 again-0.027
 harder-0.027
 worse-0.026
Token play
Token ID711
Feature activation+1.82e+00

Pos logit contributions
 further+0.027
 again+0.027
 harder+0.026
 tighter+0.026
 farther+0.026
Neg logit contributions
kin-0.019
ippers-0.017
inton-0.016
ilot-0.015
 importantly-0.015
Token and
Token ID290
Feature activation-4.51e-01

Pos logit contributions
kin+0.043
 wear+0.041
 hardly+0.040
 toes+0.040
inton+0.040
Neg logit contributions
 further-0.038
 harder-0.037
 again-0.037
 tighter-0.037
 cal-0.036
Token bark
Token ID21405
Feature activation+2.33e+00

Pos logit contributions
 further+0.013
 again+0.013
 harder+0.012
 cal+0.012
 farther+0.012
Neg logit contributions
kin-0.009
ippers-0.008
inton-0.007
ilot-0.007
apy-0.007
Token and
Token ID290
Feature activation+1.88e-01

Pos logit contributions
kin+0.048
 wear+0.045
 toes+0.044
inton+0.044
 hardly+0.044
Neg logit contributions
 further-0.055
 tighter-0.054
 harder-0.053
 worse-0.052
 cal-0.052
Token w
Token ID266
Feature activation+4.63e+00

Pos logit contributions
kin+0.004
ippers+0.004
inton+0.004
apy+0.003
ilot+0.003
Neg logit contributions
 again-0.004
 further-0.004
 harder-0.004
 tighter-0.004
 cal-0.004
Tokenag
Token ID363
Feature activation+7.19e+00

Pos logit contributions
kin+0.086
 wear+0.081
 hardly+0.081
 toes+0.079
 ago+0.078
Neg logit contributions
 cal-0.115
 further-0.115
 harder-0.114
lim-0.113
 again-0.111
Token their
Token ID511
Feature activation+2.16e+00

Pos logit contributions
 wear+0.168
kin+0.165
 hardly+0.163
 ago+0.161
 personality+0.160
Neg logit contributions
 again-0.145
 harder-0.136
 further-0.135
 worse-0.128
 tighter-0.127
Token tails
Token ID30514
Feature activation+5.31e+00

Pos logit contributions
 hardly+0.045
kin+0.044
pron+0.042
orse+0.041
 wear+0.041
Neg logit contributions
 again-0.049
 tighter-0.049
 further-0.047
 harder-0.047
 farther-0.045
Token.
Token ID13
Feature activation+8.56e-01

Pos logit contributions
kin+0.136
 hardly+0.132
apy+0.132
 wear+0.131
orse+0.130
Neg logit contributions
 again-0.105
 further-0.099
 harder-0.098
 tighter-0.091
 farther-0.088
Token
Token ID198
Feature activation-3.50e-01

Pos logit contributions
kin+0.019
 wear+0.017
inton+0.016
 hardly+0.016
 toes+0.016
Neg logit contributions
 further-0.020
 harder-0.020
 again-0.020
 tighter-0.019
 aback-0.019
Token
Token ID198
Feature activation-1.80e+00

Pos logit contributions
 further+0.008
 harder+0.008
 again+0.008
 tighter+0.007
 closer+0.007
Neg logit contributions
kin-0.008
 toes-0.008
 wear-0.008
 personality-0.008
inton-0.007
Token both
Token ID1111
Feature activation+2.58e+00

Pos logit contributions
kin+0.022
ippers+0.021
inton+0.019
 importantly+0.019
 ago+0.019
Neg logit contributions
 again-0.025
 harder-0.024
 further-0.024
 tighter-0.023
 cal-0.022
Token loved
Token ID6151
Feature activation+1.08e+00

Pos logit contributions
kin+0.061
ippers+0.060
ittens+0.054
inton+0.053
 ago+0.053
Neg logit contributions
 again-0.055
 tighter-0.053
 harder-0.053
 cal-0.051
 further-0.051
Token to
Token ID284
Feature activation+9.84e-01

Pos logit contributions
kin+0.031
 wear+0.029
inton+0.029
ippers+0.028
 hardly+0.028
Neg logit contributions
 further-0.018
 again-0.017
 harder-0.017
 tighter-0.017
 cal-0.016
Token laugh
Token ID6487
Feature activation+2.74e+00

Pos logit contributions
kin+0.021
 importantly+0.019
inton+0.019
 ago+0.017
ippers+0.017
Neg logit contributions
 again-0.024
 further-0.023
 ac-0.023
 harder-0.023
 cal-0.022
Token and
Token ID290
Feature activation+2.68e-01

Pos logit contributions
kin+0.066
 wear+0.064
 hardly+0.062
 toes+0.062
inton+0.061
Neg logit contributions
 further-0.056
 harder-0.054
 tighter-0.054
 again-0.053
 cal-0.052
Token g
Token ID308
Feature activation+7.15e+00

Pos logit contributions
kin+0.006
ippers+0.005
inton+0.005
ilot+0.005
ulp+0.005
Neg logit contributions
 again-0.007
 harder-0.006
 Owl-0.006
 closer-0.006
 further-0.006
Tokeniggle
Token ID24082
Feature activation+2.24e+00

Pos logit contributions
kin+0.147
 Have+0.132
 arrive+0.131
pron+0.129
 wear+0.122
Neg logit contributions
 again-0.156
lim-0.153
 up-0.146
aut-0.139
ter-0.135
Token.
Token ID13
Feature activation+1.24e+00

Pos logit contributions
kin+0.054
 wear+0.051
 hardly+0.050
inton+0.049
orses+0.049
Neg logit contributions
 again-0.047
 harder-0.046
 further-0.046
 tighter-0.045
 worse-0.043
Token They
Token ID1119
Feature activation+2.57e+00

Pos logit contributions
kin+0.037
 wear+0.029
pron+0.029
inton+0.029
ippers+0.029
Neg logit contributions
 Owl-0.023
 again-0.023
 harder-0.022
 further-0.022
 aback-0.022
Token were
Token ID547
Feature activation+1.26e+00

Pos logit contributions
kin+0.058
ippers+0.056
 ago+0.051
inton+0.051
apy+0.050
Neg logit contributions
 again-0.060
 harder-0.058
 further-0.058
 tighter-0.058
 cal-0.055
Token the
Token ID262
Feature activation-7.98e-01

Pos logit contributions
kin+0.036
ippers+0.033
 ago+0.032
apy+0.031
orse+0.030
Neg logit contributions
 harder-0.023
 further-0.022
 again-0.022
 cal-0.022
 faster-0.022
Token the
Token ID262
Feature activation-4.04e-01

Pos logit contributions
 further+0.000
 harder+0.000
 again+0.000
 tighter+0.000
 cal+0.000
Neg logit contributions
kin-0.000
inton-0.000
 wear-0.000
 hardly-0.000
ippers-0.000
Token sand
Token ID6450
Feature activation+1.35e+00

Pos logit contributions
 further+0.007
 harder+0.007
 again+0.007
 tighter+0.007
 cal+0.007
Neg logit contributions
kin-0.011
 wear-0.011
 hardly-0.011
apy-0.010
inton-0.010
Token and
Token ID290
Feature activation+9.02e-01

Pos logit contributions
 personality+0.032
 wear+0.032
 hardly+0.032
kin+0.032
apy+0.031
Neg logit contributions
 again-0.028
 further-0.026
 harder-0.025
 farther-0.024
 tighter-0.024
Token the
Token ID262
Feature activation-1.53e-01

Pos logit contributions
kin+0.024
inton+0.022
 hardly+0.022
ippers+0.022
 wear+0.022
Neg logit contributions
 further-0.017
 harder-0.017
 again-0.016
 tighter-0.016
 cal-0.016
Token se
Token ID384
Feature activation+1.17e+00

Pos logit contributions
 further+0.003
 harder+0.003
 tighter+0.003
 again+0.003
 cal+0.003
Neg logit contributions
kin-0.004
 hardly-0.004
 wear-0.004
apy-0.004
inton-0.004
Tokenag
Token ID363
Feature activation+7.14e+00

Pos logit contributions
kin+0.019
 toes+0.018
 wear+0.018
 hardly+0.018
apy+0.017
Neg logit contributions
 further-0.032
 harder-0.032
 tighter-0.032
 again-0.031
 cal-0.031
Tokenull
Token ID724
Feature activation+3.66e-01

Pos logit contributions
 blonde+0.226
 wear+0.222
 personality+0.221
 importantly+0.219
 age+0.215
Neg logit contributions
 again-0.087
st-0.075
 further-0.073
ter-0.068
 harder-0.067
Tokens
Token ID82
Feature activation+2.52e+00

Pos logit contributions
kin+0.012
pron+0.011
 personality+0.011
 classmates+0.011
loe+0.011
Neg logit contributions
 again-0.006
 further-0.006
 harder-0.005
 closer-0.005
 farther-0.005
Token.
Token ID13
Feature activation-5.66e-01

Pos logit contributions
kin+0.067
 wear+0.064
 hardly+0.063
 personality+0.062
 classmates+0.061
Neg logit contributions
 again-0.050
 harder-0.048
 further-0.047
 tighter-0.043
 cal-0.043
Token They
Token ID1119
Feature activation+2.65e+00

Pos logit contributions
 harder+0.015
 again+0.015
 further+0.014
 tighter+0.014
 cal+0.014
Neg logit contributions
kin-0.011
 wear-0.010
 hardly-0.010
orses-0.010
inton-0.010
Token run
Token ID1057
Feature activation-1.53e+00

Pos logit contributions
kin+0.060
apy+0.054
 importantly+0.053
ittens+0.053
ippers+0.052
Neg logit contributions
 further-0.065
 harder-0.063
 cal-0.063
 again-0.061
 closer-0.061
Token colors
Token ID7577
Feature activation+4.49e+00

Pos logit contributions
 wear+0.033
 toes+0.033
kin+0.033
 personality+0.032
 ago+0.032
Neg logit contributions
 further-0.032
 tighter-0.032
 worse-0.032
 harder-0.031
 cal-0.030
Token and
Token ID290
Feature activation-1.10e-01

Pos logit contributions
kin+0.114
 wear+0.109
 hardly+0.106
inton+0.106
 personality+0.106
Neg logit contributions
 further-0.086
 again-0.085
 harder-0.085
 tighter-0.082
 worse-0.079
Token had
Token ID550
Feature activation-1.79e+00

Pos logit contributions
 further+0.002
 harder+0.002
 tighter+0.002
 again+0.002
 cal+0.002
Neg logit contributions
kin-0.003
 wear-0.002
 toes-0.002
 hardly-0.002
inton-0.002
Token shiny
Token ID22441
Feature activation+1.17e+00

Pos logit contributions
 further+0.033
 tighter+0.032
 harder+0.032
 again+0.031
 aback+0.031
Neg logit contributions
 hardly-0.045
kin-0.044
 wear-0.044
 toes-0.044
 personality-0.043
Token car
Token ID1097
Feature activation-1.32e+00

Pos logit contributions
 toes+0.028
 wear+0.027
 ago+0.027
 personality+0.027
kin+0.026
Neg logit contributions
 tighter-0.023
 further-0.023
 worse-0.023
 cal-0.023
minded-0.022
Tokenv
Token ID85
Feature activation+7.13e+00

Pos logit contributions
 further+0.030
 harder+0.029
 tighter+0.029
 again+0.029
 worse+0.028
Neg logit contributions
kin-0.030
 wear-0.028
 hardly-0.027
inton-0.027
 toes-0.027
Tokenings
Token ID654
Feature activation+1.57e+00

Pos logit contributions
 wear+0.161
 personality+0.156
pron+0.152
 hardly+0.152
orses+0.147
Neg logit contributions
 again-0.155
 aback-0.144
 further-0.136
 harder-0.135
ter-0.133
Token on
Token ID319
Feature activation+3.32e+00

Pos logit contributions
kin+0.036
 wear+0.034
 hardly+0.034
inton+0.033
 toes+0.033
Neg logit contributions
 further-0.034
 harder-0.033
 again-0.033
 tighter-0.033
 aback-0.032
Token it
Token ID340
Feature activation-1.28e+00

Pos logit contributions
kin+0.086
 wear+0.082
inton+0.080
 hardly+0.079
 toes+0.079
Neg logit contributions
 further-0.063
 harder-0.061
 again-0.060
 tighter-0.060
 cal-0.057
Token.
Token ID13
Feature activation+1.13e+00

Pos logit contributions
 further+0.028
 again+0.028
 harder+0.028
 tighter+0.027
 worse+0.026
Neg logit contributions
kin-0.029
 wear-0.027
inton-0.027
 personality-0.027
apy-0.027
Token
Token ID198
Feature activation+8.71e-01

Pos logit contributions
kin+0.020
 wear+0.019
 hardly+0.018
 toes+0.018
inton+0.018
Neg logit contributions
 further-0.030
 harder-0.029
 tighter-0.029
 again-0.029
 cal-0.028
TokenL
Token ID43
Feature activation+1.86e-01

Pos logit contributions
 further+0.052
 cal+0.051
 tighter+0.049
 harder+0.049
 worse+0.048
Neg logit contributions
 wear-0.068
ana-0.066
kin-0.065
 hardly-0.063
 personality-0.062
Token-
Token ID12
Feature activation-2.74e+00

Pos logit contributions
kin+0.004
 wear+0.003
 hardly+0.003
 toes+0.003
inton+0.003
Neg logit contributions
 further-0.005
 harder-0.005
 tighter-0.005
 again-0.005
 cal-0.004
TokenV
Token ID53
Feature activation+2.94e+00

Pos logit contributions
 further+0.052
 cal+0.051
 tighter+0.049
 harder+0.049
 worse+0.048
Neg logit contributions
 wear-0.069
ana-0.066
kin-0.065
 hardly-0.063
 personality-0.062
Token.
Token ID13
Feature activation-2.83e-01

Pos logit contributions
kin+0.052
 hardly+0.052
 wear+0.052
 personality+0.050
 toes+0.050
Neg logit contributions
 again-0.075
 further-0.075
 harder-0.074
 tighter-0.072
 worse-0.071
Token Lil
Token ID16342
Feature activation-1.96e-01

Pos logit contributions
 again+0.006
 harder+0.005
 aback+0.005
 further+0.005
 cal+0.005
Neg logit contributions
kin-0.007
 hardly-0.007
inton-0.006
orses-0.006
 toes-0.006
Tokenv
Token ID85
Feature activation+7.10e+00

Pos logit contributions
 again+0.004
 further+0.004
 harder+0.004
 tighter+0.004
 aback+0.004
Neg logit contributions
kin-0.005
 hardly-0.004
 toes-0.004
apy-0.004
 personality-0.004
Token!
Token ID0
Feature activation-4.18e-01

Pos logit contributions
 hardly+0.126
 wear+0.121
 ago+0.120
apy+0.118
 importantly+0.117
Neg logit contributions
 again-0.188
 harder-0.176
 further-0.172
 aback-0.171
st-0.170
Token That
Token ID1320
Feature activation-1.84e+00

Pos logit contributions
 again+0.008
 harder+0.008
 aback+0.007
 cal+0.007
ray+0.007
Neg logit contributions
kin-0.011
 hardly-0.010
inton-0.010
 toes-0.010
orses-0.010
Token is
Token ID318
Feature activation+1.62e+00

Pos logit contributions
 further+0.037
 tighter+0.036
 harder+0.035
 aback+0.034
 cal+0.034
Neg logit contributions
kin-0.046
 wear-0.044
 hardly-0.042
inton-0.042
 toes-0.042
Token not
Token ID407
Feature activation+2.47e-01

Pos logit contributions
kin+0.039
 wear+0.038
 hardly+0.036
 toes+0.036
inton+0.036
Neg logit contributions
 further-0.033
 tighter-0.032
 again-0.031
 aback-0.031
 harder-0.031
Token your
Token ID534
Feature activation+2.84e+00

Pos logit contributions
kin+0.006
 wear+0.006
 toes+0.006
 hardly+0.006
inton+0.006
Neg logit contributions
 further-0.005
 harder-0.005
 tighter-0.005
 again-0.005
 cal-0.005
Token waves
Token ID9813
Feature activation+1.50e+00

Pos logit contributions
kin+0.010
 wear+0.010
 hardly+0.009
inton+0.009
 toes+0.009
Neg logit contributions
 further-0.010
 harder-0.009
 tighter-0.009
 again-0.009
 cal-0.009
Token and
Token ID290
Feature activation+3.75e-01

Pos logit contributions
kin+0.047
 hardly+0.045
 wear+0.044
apy+0.044
 personality+0.044
Neg logit contributions
 further-0.021
 again-0.021
 harder-0.020
 tighter-0.020
 farther-0.018
Token wind
Token ID2344
Feature activation+1.30e+00

Pos logit contributions
kin+0.008
ippers+0.008
inton+0.007
 hardly+0.007
ittens+0.007
Neg logit contributions
 further-0.009
 harder-0.009
 again-0.009
 tighter-0.009
 farther-0.009
Token and
Token ID290
Feature activation+3.85e-01

Pos logit contributions
kin+0.033
 wear+0.032
 hardly+0.031
 personality+0.031
 classmates+0.031
Neg logit contributions
 again-0.026
 harder-0.024
 further-0.023
 tighter-0.022
 worse-0.022
Token se
Token ID384
Feature activation+1.41e+00

Pos logit contributions
kin+0.009
ippers+0.008
inton+0.008
 hardly+0.008
 wear+0.008
Neg logit contributions
 further-0.009
 harder-0.009
 again-0.009
 tighter-0.008
 worse-0.008
Tokenag
Token ID363
Feature activation+7.09e+00

Pos logit contributions
kin+0.031
 wear+0.029
 hardly+0.029
 toes+0.029
 ago+0.028
Neg logit contributions
 further-0.031
 tighter-0.031
 harder-0.031
 again-0.030
 aback-0.030
Tokenull
Token ID724
Feature activation+4.74e-01

Pos logit contributions
 wear+0.183
 personality+0.173
 hardly+0.173
pron+0.171
kin+0.170
Neg logit contributions
 again-0.127
 harder-0.118
 further-0.112
 worse-0.108
st-0.107
Tokens
Token ID82
Feature activation+2.20e+00

Pos logit contributions
kin+0.014
 wear+0.013
 importantly+0.013
 personality+0.013
 ago+0.013
Neg logit contributions
 again-0.008
 harder-0.007
 further-0.007
 tighter-0.007
 farther-0.006
Token.
Token ID13
Feature activation+9.64e-01

Pos logit contributions
kin+0.057
 wear+0.055
 hardly+0.053
ippers+0.052
ittens+0.052
Neg logit contributions
 again-0.044
 harder-0.042
 further-0.041
 tighter-0.038
 cal-0.038
Token
Token ID198
Feature activation-2.18e-01

Pos logit contributions
kin+0.026
 wear+0.024
ippers+0.023
 complexion+0.023
inations+0.023
Neg logit contributions
 again-0.020
 harder-0.019
 further-0.017
 tighter-0.016
 cal-0.016
Token
Token ID198
Feature activation-1.61e+00

Pos logit contributions
 again+0.004
 harder+0.004
 further+0.004
 closer+0.003
 tighter+0.003
Neg logit contributions
kin-0.006
 personality-0.006
 toes-0.006
 beard-0.006
 wear-0.006
Token she
Token ID673
Feature activation-8.23e-01

Pos logit contributions
 again+0.005
 further+0.004
 harder+0.004
 tighter+0.004
 cal+0.004
Neg logit contributions
kin-0.007
ilot-0.006
ippers-0.006
inton-0.006
orses-0.006
Token couldn
Token ID3521
Feature activation-1.26e+00

Pos logit contributions
 further+0.019
 harder+0.018
 again+0.018
 tighter+0.018
 cal+0.018
Neg logit contributions
kin-0.018
 wear-0.017
 toes-0.017
inton-0.016
ippers-0.016
Token't
Token ID470
Feature activation-8.00e-01

Pos logit contributions
 further+0.037
 tighter+0.036
 harder+0.036
 cal+0.035
 aback+0.035
Neg logit contributions
kin-0.019
 wear-0.019
 hardly-0.018
inton-0.017
apy-0.017
Token help
Token ID1037
Feature activation+1.30e+00

Pos logit contributions
 again+0.020
 further+0.020
 harder+0.019
 worse+0.018
 closer+0.018
Neg logit contributions
kin-0.017
ippers-0.016
apy-0.016
inton-0.015
ilot-0.015
Token but
Token ID475
Feature activation+1.47e-01

Pos logit contributions
kin+0.030
 wear+0.030
 hardly+0.029
 toes+0.029
inton+0.028
Neg logit contributions
 further-0.027
 tighter-0.027
 harder-0.026
 aback-0.026
 cal-0.026
Token g
Token ID308
Feature activation+7.09e+00

Pos logit contributions
kin+0.003
ippers+0.003
orses+0.002
ilot+0.002
inton+0.002
Neg logit contributions
 again-0.004
 further-0.004
 harder-0.004
 tighter-0.004
 closer-0.004
Tokeniggle
Token ID24082
Feature activation+1.50e+00

Pos logit contributions
kin+0.135
 wear+0.123
ilot+0.119
 arrive+0.119
 Have+0.116
Neg logit contributions
 again-0.177
lim-0.164
 harder-0.160
 further-0.154
aut-0.153
Token with
Token ID351
Feature activation+2.47e+00

Pos logit contributions
kin+0.037
 wear+0.035
 hardly+0.034
inton+0.034
 toes+0.034
Neg logit contributions
 further-0.030
 harder-0.029
 again-0.029
 tighter-0.029
 cal-0.028
Token joy
Token ID8716
Feature activation+4.75e+00

Pos logit contributions
kin+0.056
ilot+0.052
orses+0.051
orse+0.051
ippers+0.050
Neg logit contributions
 further-0.059
 harder-0.058
 aback-0.057
 cal-0.057
 tighter-0.057
Token.
Token ID13
Feature activation+9.08e-01

Pos logit contributions
kin+0.127
 wear+0.118
orses+0.118
orse+0.118
apy+0.117
Neg logit contributions
 again-0.095
 harder-0.086
 further-0.084
 tighter-0.082
 worse-0.079
Token
Token ID198
Feature activation+7.45e-01

Pos logit contributions
kin+0.022
 wear+0.020
inton+0.019
ippers+0.019
 toes+0.018
Neg logit contributions
 again-0.020
 further-0.020
 harder-0.020
 aback-0.019
 tighter-0.019
Token wearing
Token ID5762
Feature activation+6.79e-01

Pos logit contributions
kin+0.009
ilot+0.009
 personality+0.009
inton+0.009
ana+0.009
Neg logit contributions
 again-0.015
 harder-0.013
 further-0.013
 worse-0.013
 Owl-0.012
Token his
Token ID465
Feature activation+2.14e+00

Pos logit contributions
kin+0.017
 ago+0.017
ilot+0.016
inton+0.016
orses+0.016
Neg logit contributions
 again-0.014
 harder-0.014
 further-0.014
 tighter-0.013
 cal-0.013
Token bath
Token ID7837
Feature activation+1.99e+00

Pos logit contributions
 ago+0.051
asts+0.049
orses+0.048
ilot+0.047
orse+0.047
Neg logit contributions
 again-0.043
 tighter-0.042
 harder-0.042
 further-0.042
 worse-0.041
Tokenrobe
Token ID25481
Feature activation-1.26e+00

Pos logit contributions
 wear+0.033
 hardly+0.033
orses+0.032
 ago+0.032
kin+0.032
Neg logit contributions
 again-0.054
 further-0.052
 harder-0.051
 tighter-0.050
 worse-0.049
Token and
Token ID290
Feature activation+5.54e-01

Pos logit contributions
 further+0.029
 tighter+0.028
 harder+0.028
 cal+0.028
 aback+0.027
Neg logit contributions
kin-0.027
 wear-0.026
 hardly-0.025
 toes-0.025
inton-0.024
Token g
Token ID308
Feature activation+7.08e+00

Pos logit contributions
kin+0.013
ilot+0.012
ippers+0.012
inton+0.012
 ago+0.012
Neg logit contributions
 again-0.013
 harder-0.012
 further-0.012
 tighter-0.012
 worse-0.012
Tokeniggle
Token ID24082
Feature activation+2.32e+00

Pos logit contributions
 arrive+0.132
kin+0.124
 Have+0.123
 tow+0.119
ilot+0.117
Neg logit contributions
 again-0.162
lim-0.151
aut-0.142
 harder-0.140
 up-0.140
Token with
Token ID351
Feature activation+2.86e+00

Pos logit contributions
kin+0.058
 ago+0.056
 wear+0.055
orses+0.054
 hardly+0.054
Neg logit contributions
 again-0.047
 harder-0.045
 further-0.044
 tighter-0.043
 worse-0.042
Token joy
Token ID8716
Feature activation+4.93e+00

Pos logit contributions
kin+0.076
 ago+0.073
ilot+0.072
orses+0.067
ippers+0.067
Neg logit contributions
 harder-0.057
 again-0.056
 further-0.055
 tighter-0.055
 cal-0.055
Token.
Token ID13
Feature activation+1.79e+00

Pos logit contributions
kin+0.135
 wear+0.127
apy+0.127
 hardly+0.126
 ago+0.125
Neg logit contributions
 again-0.099
 harder-0.083
 tighter-0.078
 further-0.077
 worse-0.076
Token
Token ID220
Feature activation+3.01e+00

Pos logit contributions
kin+0.048
inton+0.042
 toes+0.041
ippers+0.041
 wear+0.041
Neg logit contributions
 again-0.038
 harder-0.035
 further-0.034
 aback-0.032
 tighter-0.032
Token elephants
Token ID25368
Feature activation+3.72e+00

Pos logit contributions
kin+0.031
inton+0.029
 wear+0.028
 hardly+0.028
ippers+0.028
Neg logit contributions
 further-0.023
 harder-0.022
 again-0.022
 tighter-0.022
 cal-0.021
Token,
Token ID11
Feature activation+1.58e+00

Pos logit contributions
kin+0.103
 hardly+0.094
 personality+0.094
inton+0.094
apy+0.093
Neg logit contributions
 again-0.070
 further-0.068
 harder-0.066
 cal-0.061
 farther-0.060
Token lions
Token ID31420
Feature activation+4.17e+00

Pos logit contributions
 wear+0.039
 toes+0.039
kin+0.038
 ago+0.037
 personality+0.036
Neg logit contributions
 further-0.030
 aback-0.030
 tighter-0.029
 worse-0.029
lim-0.028
Token and
Token ID290
Feature activation-4.66e-02

Pos logit contributions
kin+0.106
 hardly+0.100
 wear+0.100
 personality+0.099
apy+0.098
Neg logit contributions
 again-0.082
 further-0.081
 harder-0.079
 cal-0.074
 tighter-0.073
Token ac
Token ID936
Feature activation-1.01e-01

Pos logit contributions
 aback+0.001
 further+0.001
 tighter+0.001
 harder+0.001
lim+0.001
Neg logit contributions
 wear-0.001
 toes-0.001
kin-0.001
 personality-0.001
 ago-0.001
Tokenrob
Token ID22609
Feature activation+7.07e+00

Pos logit contributions
 further+0.002
 worse+0.002
 tighter+0.002
 harder+0.002
 again+0.002
Neg logit contributions
kin-0.003
 wear-0.003
ippers-0.003
orses-0.003
inton-0.003
Tokenats
Token ID1381
Feature activation+3.77e+00

Pos logit contributions
kin+0.166
pron+0.149
 arrive+0.149
 hardly+0.149
apy+0.147
Neg logit contributions
lim-0.130
 harder-0.129
 tighter-0.127
st-0.126
 again-0.125
Token.
Token ID13
Feature activation-1.34e-01

Pos logit contributions
kin+0.099
 personality+0.089
 wear+0.089
apy+0.088
inton+0.087
Neg logit contributions
 again-0.076
 further-0.074
 harder-0.072
 tighter-0.068
 cal-0.067
Token They
Token ID1119
Feature activation+2.65e+00

Pos logit contributions
 further+0.004
 harder+0.004
 again+0.004
 tighter+0.004
 cal+0.004
Neg logit contributions
kin-0.002
 wear-0.002
inton-0.002
 hardly-0.002
 toes-0.002
Token are
Token ID389
Feature activation+1.08e+00

Pos logit contributions
kin+0.058
apy+0.050
ippers+0.050
ittens+0.049
pron+0.048
Neg logit contributions
 further-0.067
 cal-0.065
 harder-0.064
 again-0.064
 closer-0.063
Token amazed
Token ID24562
Feature activation+1.05e+00

Pos logit contributions
kin+0.030
ippers+0.027
 personality+0.027
 wear+0.027
apy+0.027
Neg logit contributions
 further-0.020
 harder-0.020
 bigger-0.019
 faster-0.019
 again-0.019
Token mixed
Token ID7668
Feature activation-1.00e+00

Pos logit contributions
 further+0.019
 tighter+0.018
 harder+0.018
 aback+0.017
minded+0.017
Neg logit contributions
 wear-0.015
 hardly-0.015
kin-0.014
 have-0.014
 imagination-0.014
Token its
Token ID663
Feature activation-1.14e+00

Pos logit contributions
 Owl+0.021
minded+0.021
 cal+0.021
 nearest+0.021
lim+0.020
Neg logit contributions
kin-0.022
 toes-0.021
 wear-0.020
apy-0.019
 time-0.019
Token own
Token ID898
Feature activation-1.38e-01

Pos logit contributions
 further+0.025
 harder+0.024
 tighter+0.024
 aback+0.024
 cal+0.023
Neg logit contributions
kin-0.026
 wear-0.025
 toes-0.024
 personality-0.024
 hardly-0.023
Token colors
Token ID7577
Feature activation+3.08e+00

Pos logit contributions
 further+0.003
 harder+0.003
 tighter+0.003
 again+0.003
 cal+0.003
Neg logit contributions
kin-0.003
 wear-0.003
 hardly-0.003
 toes-0.003
inton-0.003
Token,
Token ID11
Feature activation+1.48e+00

Pos logit contributions
kin+0.081
 wear+0.077
 toes+0.075
 hardly+0.074
inton+0.074
Neg logit contributions
 further-0.057
 tighter-0.055
 harder-0.055
 cal-0.053
 again-0.053
Token scenes
Token ID8188
Feature activation+7.06e+00

Pos logit contributions
kin+0.038
 wear+0.037
inton+0.036
 hardly+0.036
 toes+0.035
Neg logit contributions
 further-0.028
 harder-0.027
 again-0.027
 tighter-0.027
 worse-0.026
Token and
Token ID290
Feature activation+7.54e-01

Pos logit contributions
kin+0.194
inton+0.190
ippers+0.188
 personality+0.187
orses+0.186
Neg logit contributions
 again-0.129
 further-0.119
 harder-0.115
 aback-0.104
 cal-0.103
Token music
Token ID2647
Feature activation+2.98e+00

Pos logit contributions
kin+0.018
 wear+0.017
inton+0.017
 hardly+0.017
 toes+0.016
Neg logit contributions
 further-0.016
 harder-0.015
 tighter-0.015
 again-0.015
 cal-0.015
Token,
Token ID11
Feature activation+1.53e+00

Pos logit contributions
kin+0.073
 wear+0.070
 hardly+0.069
inton+0.069
ippers+0.068
Neg logit contributions
 again-0.061
 further-0.060
 harder-0.059
 tighter-0.056
 cal-0.056
Token all
Token ID477
Feature activation+1.65e+00

Pos logit contributions
kin+0.040
 wear+0.039
 hardly+0.038
 toes+0.038
inton+0.038
Neg logit contributions
 further-0.028
 harder-0.027
 tighter-0.027
 again-0.026
 cal-0.026
Token by
Token ID416
Feature activation+2.91e+00

Pos logit contributions
kin+0.047
ippers+0.046
inton+0.045
ittens+0.044
 hardly+0.044
Neg logit contributions
 again-0.029
 further-0.029
 harder-0.027
 cal-0.027
 aback-0.026
Token each
Token ID1123
Feature activation+1.37e+00

Pos logit contributions
kin+0.032
 wear+0.030
ippers+0.029
inton+0.029
 hardly+0.029
Neg logit contributions
 further-0.031
 harder-0.031
 again-0.031
 tighter-0.030
 cal-0.029
Token other
Token ID584
Feature activation+2.47e-02

Pos logit contributions
kin+0.040
apy+0.038
ippers+0.037
orse+0.037
 ago+0.037
Neg logit contributions
 harder-0.025
 again-0.025
 further-0.024
 tighter-0.023
 worse-0.022
Token.
Token ID13
Feature activation+2.11e-01

Pos logit contributions
kin+0.001
 hardly+0.001
 wear+0.001
apy+0.001
inton+0.001
Neg logit contributions
 again-0.001
 harder-0.001
 further-0.001
 tighter-0.001
 cal-0.001
Token They
Token ID1119
Feature activation+2.12e+00

Pos logit contributions
kin+0.006
pron+0.005
inton+0.005
 wear+0.005
inations+0.005
Neg logit contributions
 again-0.004
 harder-0.004
 cal-0.004
 further-0.004
 Owl-0.004
Token w
Token ID266
Feature activation+4.79e+00

Pos logit contributions
kin+0.041
apy+0.039
ippers+0.038
 ago+0.037
 importantly+0.036
Neg logit contributions
 cal-0.056
 again-0.055
 further-0.055
 harder-0.055
 tighter-0.053
Tokenag
Token ID363
Feature activation+7.06e+00

Pos logit contributions
kin+0.090
 ago+0.082
 hardly+0.082
 wear+0.081
 toes+0.081
Neg logit contributions
 cal-0.120
 harder-0.119
 further-0.117
 again-0.114
lim-0.114
Token their
Token ID511
Feature activation+1.64e+00

Pos logit contributions
 wear+0.154
kin+0.151
 hardly+0.151
pron+0.148
ippers+0.146
Neg logit contributions
 again-0.151
 harder-0.146
 further-0.141
 tighter-0.137
 worse-0.136
Token tails
Token ID30514
Feature activation+4.71e+00

Pos logit contributions
 hardly+0.034
kin+0.033
pron+0.033
asts+0.032
orse+0.032
Neg logit contributions
 tighter-0.037
 again-0.036
 harder-0.035
 berries-0.034
 further-0.033
Token and
Token ID290
Feature activation+1.30e+00

Pos logit contributions
kin+0.138
 hardly+0.136
 wear+0.133
apy+0.133
inton+0.133
Neg logit contributions
 again-0.076
 harder-0.072
 further-0.069
 tighter-0.066
 cal-0.062
Token lick
Token ID36728
Feature activation+4.17e+00

Pos logit contributions
kin+0.031
inton+0.029
ippers+0.029
 ago+0.028
 hardly+0.028
Neg logit contributions
 tighter-0.028
 harder-0.028
 again-0.028
 cal-0.027
 further-0.027
Token each
Token ID1123
Feature activation-7.80e-01

Pos logit contributions
kin+0.063
 wear+0.058
 hardly+0.056
inton+0.056
 toes+0.055
Neg logit contributions
 further-0.123
 harder-0.122
 tighter-0.121
 again-0.121
 cal-0.118
Token noises
Token ID26782
Feature activation+2.30e+00

Pos logit contributions
kin+0.013
 toes+0.013
 wear+0.013
inton+0.012
 hardly+0.012
Neg logit contributions
 aback-0.010
 further-0.010
 tighter-0.010
lim-0.010
 Owl-0.010
Token like
Token ID588
Feature activation+1.07e+00

Pos logit contributions
kin+0.066
 wear+0.063
 hardly+0.062
 personality+0.062
inton+0.061
Neg logit contributions
 further-0.039
 again-0.038
 harder-0.038
 tighter-0.036
 cal-0.035
Token waves
Token ID9813
Feature activation+2.36e+00

Pos logit contributions
kin+0.028
 wear+0.026
 hardly+0.026
inton+0.026
 toes+0.025
Neg logit contributions
 further-0.021
 harder-0.020
 again-0.020
 tighter-0.020
 cal-0.019
Token and
Token ID290
Feature activation+2.93e-01

Pos logit contributions
kin+0.071
 hardly+0.070
 wear+0.068
 personality+0.067
apy+0.067
Neg logit contributions
 again-0.037
 harder-0.035
 further-0.035
 tighter-0.033
 worse-0.031
Token se
Token ID384
Feature activation+1.15e+00

Pos logit contributions
kin+0.006
 wear+0.006
inton+0.006
 hardly+0.006
ippers+0.006
Neg logit contributions
 further-0.007
 harder-0.007
 again-0.007
 tighter-0.007
 cal-0.006
Tokenag
Token ID363
Feature activation+7.05e+00

Pos logit contributions
kin+0.024
 wear+0.023
 hardly+0.022
inton+0.022
 toes+0.022
Neg logit contributions
 further-0.028
 harder-0.027
 tighter-0.027
 again-0.027
 cal-0.026
Tokenull
Token ID724
Feature activation-2.54e-01

Pos logit contributions
 wear+0.174
kin+0.167
 personality+0.166
pron+0.166
 hardly+0.165
Neg logit contributions
 again-0.137
 harder-0.131
 further-0.123
 worse-0.119
 tighter-0.115
Tokens
Token ID82
Feature activation+2.01e+00

Pos logit contributions
 again+0.004
 harder+0.004
 further+0.004
 tighter+0.004
 cal+0.004
Neg logit contributions
kin-0.007
 wear-0.007
 hardly-0.007
 personality-0.006
 ago-0.006
Token.
Token ID13
Feature activation+4.00e-01

Pos logit contributions
kin+0.048
 wear+0.046
 hardly+0.045
 personality+0.044
inton+0.044
Neg logit contributions
 again-0.043
 harder-0.042
 further-0.041
 tighter-0.040
 cal-0.039
Token They
Token ID1119
Feature activation+2.27e+00

Pos logit contributions
kin+0.010
 wear+0.009
inton+0.009
 hardly+0.009
lf+0.009
Neg logit contributions
 harder-0.009
 again-0.009
 further-0.008
 cal-0.008
 tighter-0.007
Token have
Token ID423
Feature activation-1.86e+00

Pos logit contributions
ittens+0.047
kin+0.047
apy+0.045
 importantly+0.045
ippers+0.044
Neg logit contributions
 harder-0.056
 cal-0.056
 further-0.055
 again-0.055
 farther-0.053
Token.
Token ID13
Feature activation+2.22e-01

Pos logit contributions
kin+0.005
 wear+0.005
 personality+0.005
 hardly+0.005
inton+0.005
Neg logit contributions
 again-0.005
 harder-0.005
 further-0.005
 tighter-0.005
 worse-0.004
Token He
Token ID679
Feature activation-7.15e-01

Pos logit contributions
kin+0.004
 wear+0.004
 toes+0.004
inton+0.004
 hardly+0.004
Neg logit contributions
 further-0.006
 harder-0.006
 again-0.006
 tighter-0.006
 aback-0.005
Token pulled
Token ID5954
Feature activation-7.97e-01

Pos logit contributions
 further+0.020
 worse+0.019
minded+0.019
 harder+0.018
 cal+0.018
Neg logit contributions
kin-0.013
 hardly-0.012
 wear-0.012
 toes-0.011
apy-0.011
Token the
Token ID262
Feature activation-2.06e+00

Pos logit contributions
minded+0.021
lim+0.020
 nearest+0.019
 cal+0.019
ocular+0.018
Neg logit contributions
 wear-0.014
 toes-0.014
kin-0.013
 have-0.012
apy-0.012
Token c
Token ID269
Feature activation-4.30e-01

Pos logit contributions
 further+0.044
minded+0.044
lim+0.043
 tighter+0.043
 cal+0.042
Neg logit contributions
kin-0.050
 wear-0.047
 toes-0.043
ippers-0.042
 hardly-0.042
Tokenork
Token ID967
Feature activation-7.32e+00

Pos logit contributions
 further+0.012
 harder+0.011
 tighter+0.011
 cal+0.011
 aback+0.011
Neg logit contributions
kin-0.009
ippers-0.008
 wear-0.008
ittens-0.007
ana-0.007
Token and
Token ID290
Feature activation-6.86e-01

Pos logit contributions
 ang+0.155
 cal+0.154
ican+0.148
minded+0.148
ter+0.145
Neg logit contributions
 wear-0.152
ilot-0.149
kin-0.146
 ago-0.145
 toes-0.144
Token a
Token ID257
Feature activation+1.37e+00

Pos logit contributions
minded+0.017
lim+0.017
 further+0.017
 aback+0.017
 cal+0.017
Neg logit contributions
kin-0.013
 wear-0.013
apy-0.012
 toes-0.012
 hardly-0.012
Token puff
Token ID33679
Feature activation+1.21e+00

Pos logit contributions
kin+0.032
inton+0.031
 wear+0.031
 hardly+0.030
 toes+0.030
Neg logit contributions
 further-0.029
 harder-0.028
 tighter-0.028
 again-0.027
 worse-0.026
Token of
Token ID286
Feature activation+5.62e-01

Pos logit contributions
kin+0.020
 wear+0.019
 toes+0.018
 hardly+0.018
inton+0.018
Neg logit contributions
 further-0.034
 tighter-0.033
 harder-0.033
 again-0.032
 cal-0.032
Token smoke
Token ID7523
Feature activation+9.09e-01

Pos logit contributions
 wear+0.013
kin+0.013
 toes+0.013
apy+0.012
 beard+0.012
Neg logit contributions
 further-0.012
minded-0.012
lim-0.011
 tighter-0.011
 again-0.011
Token She
Token ID1375
Feature activation-2.90e-01

Pos logit contributions
 aback+0.003
 again+0.003
 further+0.003
 harder+0.003
 cal+0.003
Neg logit contributions
kin-0.004
 wear-0.004
inton-0.004
ippers-0.004
orse-0.003
Token was
Token ID373
Feature activation+4.29e-01

Pos logit contributions
 further+0.007
 harder+0.007
 worse+0.007
 tighter+0.007
 aback+0.007
Neg logit contributions
 wear-0.006
 hardly-0.006
kin-0.006
orse-0.005
 toes-0.005
Token able
Token ID1498
Feature activation-2.27e+00

Pos logit contributions
kin+0.013
 wear+0.012
 hardly+0.012
 toes+0.012
inton+0.012
Neg logit contributions
 further-0.007
 harder-0.006
 tighter-0.006
 again-0.006
 cal-0.006
Token to
Token ID284
Feature activation-8.43e-01

Pos logit contributions
 further+0.035
 harder+0.034
 tighter+0.034
 again+0.033
 cal+0.032
Neg logit contributions
kin-0.065
 wear-0.063
 hardly-0.062
 toes-0.062
inton-0.061
Token scoop
Token ID23529
Feature activation-1.28e+00

Pos logit contributions
 further+0.020
 harder+0.019
 tighter+0.019
 again+0.019
 cal+0.019
Neg logit contributions
kin-0.018
 wear-0.017
 hardly-0.016
inton-0.016
 toes-0.016
Token it
Token ID340
Feature activation-6.95e+00

Pos logit contributions
lim+0.024
 Owl+0.024
 worse+0.024
 nearest+0.023
 cal+0.023
Neg logit contributions
kin-0.029
 toes-0.028
 wear-0.027
apy-0.027
inton-0.026
Token up
Token ID510
Feature activation-2.39e+00

Pos logit contributions
minded+0.122
 Owl+0.114
 cal+0.112
lim+0.110
 worse+0.104
Neg logit contributions
 wear-0.183
kin-0.178
 hardly-0.172
 toes-0.170
inton-0.170
Token with
Token ID351
Feature activation+7.39e-01

Pos logit contributions
lim+0.051
minded+0.051
 worse+0.050
ray+0.049
 cal+0.049
Neg logit contributions
 wear-0.050
kin-0.050
 toes-0.049
 hardly-0.046
inton-0.045
Token her
Token ID607
Feature activation+1.38e+00

Pos logit contributions
kin+0.018
 wear+0.017
inton+0.016
 hardly+0.016
 toes+0.016
Neg logit contributions
 further-0.015
 harder-0.015
 tighter-0.015
 again-0.015
 cal-0.014
Token net
Token ID2010
Feature activation-1.59e+00

Pos logit contributions
apy+0.034
kin+0.033
loe+0.032
inton+0.032
 personality+0.032
Neg logit contributions
 again-0.027
 tighter-0.026
 further-0.025
 harder-0.025
 closer-0.024
Token and
Token ID290
Feature activation+4.42e-01

Pos logit contributions
 further+0.029
 harder+0.028
 tighter+0.028
 again+0.028
 cal+0.027
Neg logit contributions
kin-0.042
 wear-0.040
inton-0.039
 hardly-0.039
 toes-0.039
Token the
Token ID262
Feature activation-1.74e+00

Pos logit contributions
kin+0.002
orses+0.002
orse+0.002
inton+0.002
ippers+0.002
Neg logit contributions
 further-0.001
 again-0.001
 harder-0.001
 aback-0.001
 tighter-0.001
Token sky
Token ID6766
Feature activation+1.78e+00

Pos logit contributions
 further+0.038
 tighter+0.038
 harder+0.038
 aback+0.037
 again+0.037
Neg logit contributions
kin-0.038
 wear-0.036
 toes-0.035
 personality-0.035
 ago-0.034
Token before
Token ID878
Feature activation-5.78e-01

Pos logit contributions
kin+0.044
 wear+0.042
 hardly+0.042
inton+0.042
ippers+0.041
Neg logit contributions
 again-0.035
 further-0.035
 harder-0.034
 tighter-0.033
 cal-0.033
Token they
Token ID484
Feature activation+6.77e-01

Pos logit contributions
 further+0.012
 harder+0.011
 again+0.011
 tighter+0.011
 cal+0.011
Neg logit contributions
kin-0.014
 wear-0.014
inton-0.013
 toes-0.013
 hardly-0.013
Token walked
Token ID6807
Feature activation-2.06e+00

Pos logit contributions
kin+0.014
inton+0.013
 toes+0.013
ippers+0.013
 personality+0.013
Neg logit contributions
 further-0.016
 again-0.016
 harder-0.016
 tighter-0.015
 cal-0.015
Token fore
Token ID1674
Feature activation-6.94e+00

Pos logit contributions
 further+0.030
 tighter+0.029
 harder+0.029
 cal+0.029
 worse+0.029
Neg logit contributions
kin-0.060
 wear-0.059
 toes-0.057
 hardly-0.057
inton-0.056
Tokenshadow
Token ID19106
Feature activation-2.37e+00

Pos logit contributions
 berries+0.131
 iv+0.125
 cal+0.124
 Owl+0.120
 aback+0.119
Neg logit contributions
 wear-0.156
 ago-0.152
inton-0.150
 hardly-0.149
orses-0.148
Tokened
Token ID276
Feature activation-2.56e+00

Pos logit contributions
 further+0.038
 cal+0.037
 tighter+0.036
 worse+0.036
 harder+0.036
Neg logit contributions
kin-0.063
 toes-0.060
inton-0.060
 ago-0.059
ilot-0.059
Token the
Token ID262
Feature activation-1.40e+00

Pos logit contributions
 further+0.045
 harder+0.044
 again+0.044
 tighter+0.043
 cal+0.041
Neg logit contributions
kin-0.069
 wear-0.066
 hardly-0.065
inton-0.065
 toes-0.064
Token down
Token ID866
Feature activation-2.18e+00

Pos logit contributions
 further+0.030
 aback+0.030
 tighter+0.030
 harder+0.029
 again+0.028
Neg logit contributions
kin-0.032
 wear-0.030
 ago-0.029
 personality-0.029
 toes-0.029
Tokenpour
Token ID48681
Feature activation-2.13e+00

Pos logit contributions
 further+0.042
 again+0.041
 harder+0.041
 tighter+0.040
 cal+0.039
Neg logit contributions
kin-0.055
 wear-0.053
inton-0.052
 hardly-0.052
 toes-0.052
Token face
Token ID1986
Feature activation-2.53e-01

Pos logit contributions
 further+0.020
 harder+0.019
 tighter+0.019
 again+0.019
 cal+0.019
Neg logit contributions
kin-0.016
 wear-0.015
 toes-0.014
 hardly-0.014
inton-0.014
Token twist
Token ID14528
Feature activation+5.46e-01

Pos logit contributions
 further+0.006
 harder+0.005
 tighter+0.005
 again+0.005
 worse+0.005
Neg logit contributions
kin-0.006
 wear-0.005
 hardly-0.005
inton-0.005
 toes-0.005
Token.
Token ID13
Feature activation+2.25e-01

Pos logit contributions
kin+0.012
 toes+0.011
 wear+0.011
inton+0.011
apy+0.011
Neg logit contributions
 further-0.012
 cal-0.012
lim-0.012
 aback-0.012
 tighter-0.012
Token She
Token ID1375
Feature activation-7.35e-02

Pos logit contributions
kin+0.005
 wear+0.005
 toes+0.005
inton+0.005
 personality+0.005
Neg logit contributions
 harder-0.005
 again-0.005
 further-0.005
 aback-0.004
 tighter-0.004
Token spat
Token ID15246
Feature activation+1.69e+00

Pos logit contributions
 further+0.002
 harder+0.002
 tighter+0.002
 again+0.002
 aback+0.002
Neg logit contributions
kin-0.002
 wear-0.002
 hardly-0.001
 toes-0.001
inton-0.001
Token it
Token ID340
Feature activation-6.90e+00

Pos logit contributions
kin+0.041
 wear+0.040
 hardly+0.038
 toes+0.038
inton+0.038
Neg logit contributions
 further-0.034
 tighter-0.033
 harder-0.032
 cal-0.032
 aback-0.032
Token out
Token ID503
Feature activation-5.62e-01

Pos logit contributions
minded+0.148
lim+0.143
 further+0.140
 Owl+0.137
 cal+0.133
Neg logit contributions
 wear-0.158
kin-0.150
apy-0.148
 hardly-0.145
 toes-0.141
Token and
Token ID290
Feature activation-3.91e-01

Pos logit contributions
 further+0.012
lim+0.012
 aback+0.012
minded+0.012
 tighter+0.011
Neg logit contributions
kin-0.012
 toes-0.012
 wear-0.012
apy-0.012
 hardly-0.011
Token felt
Token ID2936
Feature activation-9.32e-02

Pos logit contributions
 further+0.009
 harder+0.008
 tighter+0.008
 again+0.008
 aback+0.008
Neg logit contributions
kin-0.009
 wear-0.009
 toes-0.008
 hardly-0.008
inton-0.008
Token a
Token ID257
Feature activation+1.58e+00

Pos logit contributions
 further+0.002
 harder+0.002
 tighter+0.002
 again+0.002
 worse+0.002
Neg logit contributions
kin-0.002
 wear-0.002
inton-0.002
 hardly-0.002
 toes-0.002
Token pain
Token ID2356
Feature activation+1.98e-01

Pos logit contributions
kin+0.036
 wear+0.035
 ago+0.035
inton+0.035
ippers+0.034
Neg logit contributions
 harder-0.033
 further-0.033
 tighter-0.033
 again-0.032
 worse-0.032
Token.
Token ID13
Feature activation+8.78e-01

Pos logit contributions
 further+0.006
 tighter+0.006
 aback+0.006
 cal+0.006
lim+0.006
Neg logit contributions
kin-0.004
 wear-0.004
 toes-0.004
inton-0.004
 ago-0.004
Token She
Token ID1375
Feature activation+6.31e-01

Pos logit contributions
kin+0.020
 wear+0.018
 hardly+0.018
inton+0.018
orses+0.017
Neg logit contributions
 again-0.020
 harder-0.020
 further-0.019
 tighter-0.019
 aback-0.019
Token tried
Token ID3088
Feature activation-7.39e-01

Pos logit contributions
kin+0.013
 wear+0.012
inton+0.012
ippers+0.012
 toes+0.012
Neg logit contributions
 harder-0.014
 tighter-0.014
 further-0.014
 again-0.014
 worse-0.014
Token to
Token ID284
Feature activation-1.08e+00

Pos logit contributions
lim+0.014
 further+0.014
 aback+0.014
 cal+0.014
minded+0.013
Neg logit contributions
 wear-0.018
 toes-0.017
kin-0.017
 hardly-0.017
 personality-0.017
Token spit
Token ID27591
Feature activation+5.20e-01

Pos logit contributions
 further+0.028
 harder+0.027
 tighter+0.027
 again+0.027
 cal+0.026
Neg logit contributions
kin-0.021
 wear-0.019
 toes-0.019
 hardly-0.019
inton-0.019
Token it
Token ID340
Feature activation-6.87e+00

Pos logit contributions
 wear+0.012
kin+0.012
 hardly+0.011
 toes+0.010
apy+0.010
Neg logit contributions
 further-0.011
 tighter-0.011
lim-0.011
minded-0.011
 cal-0.011
Token out
Token ID503
Feature activation-5.81e-01

Pos logit contributions
lim+0.143
minded+0.143
 further+0.134
 Owl+0.132
 nearest+0.131
Neg logit contributions
 wear-0.165
kin-0.150
 hardly-0.142
apy-0.141
 toes-0.139
Token.
Token ID13
Feature activation+1.48e+00

Pos logit contributions
 further+0.014
 tighter+0.013
 harder+0.013
 again+0.013
 cal+0.013
Neg logit contributions
kin-0.012
 wear-0.012
 toes-0.011
 hardly-0.011
inton-0.011
Token
Token ID198
Feature activation+2.06e-01

Pos logit contributions
kin+0.030
 wear+0.027
inton+0.027
 hardly+0.027
orses+0.026
Neg logit contributions
 again-0.037
 harder-0.037
 further-0.036
 tighter-0.035
 aback-0.035
Token
Token ID198
Feature activation-1.15e+00

Pos logit contributions
kin+0.005
 toes+0.005
 personality+0.005
inton+0.005
 wear+0.005
Neg logit contributions
 again-0.004
 harder-0.004
 further-0.004
 tighter-0.004
 aback-0.004
Token"
Token ID1
Feature activation-4.99e+00

Pos logit contributions
 harder+0.019
 tighter+0.018
 again+0.018
 further+0.017
 worse+0.016
Neg logit contributions
kin-0.032
 hardly-0.031
ilot-0.031
orses-0.031
 classmates-0.031
Token like
Token ID588
Feature activation+1.95e+00

Pos logit contributions
 further+0.039
 harder+0.039
 tighter+0.038
 again+0.038
 cal+0.037
Neg logit contributions
kin-0.027
 wear-0.025
 toes-0.024
 hardly-0.024
inton-0.024
Token it
Token ID340
Feature activation-2.21e+00

Pos logit contributions
kin+0.051
 wear+0.049
inton+0.048
 hardly+0.047
ippers+0.047
Neg logit contributions
 further-0.036
 harder-0.035
 again-0.035
 tighter-0.035
 worse-0.034
Token.
Token ID13
Feature activation+4.11e-01

Pos logit contributions
 again+0.048
 harder+0.048
 further+0.048
 tighter+0.047
 worse+0.047
Neg logit contributions
kin-0.051
 wear-0.048
 personality-0.047
 hardly-0.047
 toes-0.047
Token He
Token ID679
Feature activation-2.55e-01

Pos logit contributions
kin+0.009
 wear+0.009
 toes+0.009
inton+0.009
 personality+0.008
Neg logit contributions
 harder-0.009
 again-0.009
 further-0.009
 aback-0.009
 worse-0.009
Token spat
Token ID15246
Feature activation+1.82e+00

Pos logit contributions
 further+0.006
 aback+0.006
 harder+0.006
 tighter+0.006
 again+0.006
Neg logit contributions
kin-0.005
 wear-0.005
 hardly-0.005
 toes-0.005
inton-0.005
Token it
Token ID340
Feature activation-6.86e+00

Pos logit contributions
kin+0.047
 wear+0.045
 hardly+0.044
 toes+0.044
inton+0.043
Neg logit contributions
 further-0.034
 tighter-0.033
 harder-0.032
 cal-0.032
 aback-0.032
Token out
Token ID503
Feature activation-4.61e-01

Pos logit contributions
minded+0.139
 further+0.135
lim+0.131
 cal+0.130
 Owl+0.125
Neg logit contributions
 wear-0.163
kin-0.152
apy-0.150
 hardly-0.146
inton-0.143
Token and
Token ID290
Feature activation-5.17e-01

Pos logit contributions
 further+0.010
 tighter+0.010
 aback+0.010
 harder+0.010
lim+0.010
Neg logit contributions
kin-0.010
 wear-0.010
 toes-0.009
 hardly-0.009
inton-0.009
Token threw
Token ID9617
Feature activation-1.10e+00

Pos logit contributions
 further+0.012
lim+0.011
 aback+0.011
minded+0.011
 tighter+0.011
Neg logit contributions
 wear-0.011
kin-0.011
apy-0.011
 toes-0.011
 hardly-0.011
Token it
Token ID340
Feature activation-4.74e+00

Pos logit contributions
minded+0.030
lim+0.029
 Owl+0.028
 further+0.028
 cal+0.028
Neg logit contributions
 toes-0.018
kin-0.017
 wear-0.016
apy-0.016
 imagination-0.014
Token away
Token ID1497
Feature activation-5.05e-01

Pos logit contributions
 further+0.097
minded+0.096
 cal+0.096
 aback+0.095
 Owl+0.093
Neg logit contributions
kin-0.103
 wear-0.102
 toes-0.101
 hardly-0.094
inton-0.094
Token mouth
Token ID5422
Feature activation-1.44e+00

Pos logit contributions
kin+0.023
 wear+0.022
 hardly+0.022
apy+0.021
inton+0.021
Neg logit contributions
 tighter-0.032
 further-0.032
 harder-0.032
 again-0.032
 aback-0.030
Token,
Token ID11
Feature activation+4.57e-01

Pos logit contributions
 further+0.034
lim+0.033
 cal+0.033
minded+0.033
 tighter+0.033
Neg logit contributions
kin-0.029
 wear-0.027
 toes-0.027
inton-0.025
 hardly-0.025
Token but
Token ID475
Feature activation+7.46e-01

Pos logit contributions
kin+0.012
inton+0.012
 wear+0.011
ippers+0.011
 personality+0.011
Neg logit contributions
 harder-0.008
 further-0.008
 tighter-0.008
 again-0.008
 worse-0.008
Token he
Token ID339
Feature activation+4.27e-01

Pos logit contributions
kin+0.017
 personality+0.016
 wear+0.016
inton+0.016
ippers+0.016
Neg logit contributions
 harder-0.016
 further-0.016
 again-0.016
 tighter-0.016
 worse-0.016
Token spat
Token ID15246
Feature activation+1.63e+00

Pos logit contributions
kin+0.009
 wear+0.009
 hardly+0.008
 toes+0.008
inton+0.008
Neg logit contributions
 further-0.010
 harder-0.010
 tighter-0.010
 again-0.010
 cal-0.009
Token it
Token ID340
Feature activation-6.85e+00

Pos logit contributions
kin+0.041
 wear+0.039
 hardly+0.038
 toes+0.038
inton+0.037
Neg logit contributions
 further-0.032
 tighter-0.031
 harder-0.031
 again-0.030
 cal-0.030
Token out
Token ID503
Feature activation-3.22e-01

Pos logit contributions
minded+0.126
lim+0.126
 further+0.125
 cal+0.121
 Owl+0.120
Neg logit contributions
 wear-0.171
kin-0.169
inton-0.156
apy-0.156
 hardly-0.156
Token.
Token ID13
Feature activation+7.89e-02

Pos logit contributions
 further+0.008
 tighter+0.008
 harder+0.008
 again+0.007
 aback+0.007
Neg logit contributions
kin-0.006
 wear-0.006
 toes-0.006
 hardly-0.006
inton-0.006
Token He
Token ID679
Feature activation-5.88e-01

Pos logit contributions
kin+0.001
 wear+0.001
 hardly+0.001
 toes+0.001
inton+0.001
Neg logit contributions
 further-0.002
 tighter-0.002
 harder-0.002
 again-0.002
 cal-0.002
Token tasted
Token ID29187
Feature activation-5.57e-01

Pos logit contributions
 further+0.015
 aback+0.014
 harder+0.014
 tighter+0.014
 again+0.014
Neg logit contributions
kin-0.012
 wear-0.012
 hardly-0.011
 toes-0.011
inton-0.011
Token something
Token ID1223
Feature activation-1.18e-01

Pos logit contributions
 further+0.012
 tighter+0.011
 aback+0.011
 harder+0.011
 cal+0.011
Neg logit contributions
kin-0.013
 wear-0.013
 toes-0.012
inton-0.012
 hardly-0.012
Token very
Token ID845
Feature activation-6.77e-01

Pos logit contributions
kin+0.014
 wear+0.013
inton+0.013
 personality+0.013
ippers+0.013
Neg logit contributions
 harder-0.011
 tighter-0.010
 further-0.010
 again-0.010
 worse-0.010
Token sour
Token ID11348
Feature activation-5.15e-01

Pos logit contributions
 tighter+0.012
 harder+0.012
 further+0.012
 worse+0.012
 again+0.012
Neg logit contributions
kin-0.017
 wear-0.017
 personality-0.017
 ago-0.017
 hardly-0.017
Token.
Token ID13
Feature activation+1.86e-01

Pos logit contributions
 further+0.014
 aback+0.014
 tighter+0.014
 cal+0.013
lim+0.013
Neg logit contributions
kin-0.009
 wear-0.008
 toes-0.008
inton-0.008
 ago-0.008
Token They
Token ID1119
Feature activation+1.80e+00

Pos logit contributions
kin+0.004
 wear+0.004
inton+0.004
 toes+0.004
 hardly+0.004
Neg logit contributions
 harder-0.004
 again-0.004
 cal-0.004
 further-0.004
 worse-0.004
Token spit
Token ID27591
Feature activation+5.00e-01

Pos logit contributions
kin+0.039
ippers+0.036
 ago+0.035
ilot+0.034
 importantly+0.034
Neg logit contributions
 harder-0.043
 again-0.042
 cal-0.042
 worse-0.042
 tighter-0.042
Token it
Token ID340
Feature activation-6.84e+00

Pos logit contributions
kin+0.012
 wear+0.012
 hardly+0.011
 toes+0.011
inton+0.011
Neg logit contributions
 further-0.010
 tighter-0.010
 cal-0.010
 harder-0.010
 aback-0.009
Token out
Token ID503
Feature activation-6.03e-01

Pos logit contributions
lim+0.150
 further+0.149
minded+0.147
 Owl+0.145
 cal+0.143
Neg logit contributions
 wear-0.148
kin-0.144
apy-0.137
 hardly-0.133
inton-0.131
Token.
Token ID13
Feature activation+3.89e-01

Pos logit contributions
 further+0.014
 aback+0.013
 tighter+0.013
lim+0.013
 harder+0.013
Neg logit contributions
kin-0.013
 wear-0.012
 toes-0.012
inton-0.012
apy-0.012
Token
Token ID198
Feature activation-5.80e-01

Pos logit contributions
kin+0.009
 wear+0.008
inton+0.008
 toes+0.008
 hardly+0.008
Neg logit contributions
 again-0.009
 harder-0.009
 cal-0.008
 further-0.008
 worse-0.008
Token
Token ID198
Feature activation-1.77e+00

Pos logit contributions
 again+0.011
 harder+0.011
 further+0.011
 cal+0.010
 tighter+0.010
Neg logit contributions
kin-0.015
 toes-0.015
 personality-0.015
 wear-0.014
inton-0.014
TokenMom
Token ID29252
Feature activation-2.39e+00

Pos logit contributions
 harder+0.031
 tighter+0.030
 further+0.030
 again+0.029
 closer+0.028
Neg logit contributions
 personality-0.048
kin-0.046
 imagination-0.046
 toes-0.045
 wear-0.044
Token.
Token ID13
Feature activation+2.30e-01

Pos logit contributions
kin+0.016
 wear+0.015
 hardly+0.015
 personality+0.015
apy+0.015
Neg logit contributions
 again-0.014
 further-0.014
 harder-0.014
 tighter-0.013
 cal-0.013
Token She
Token ID1375
Feature activation-1.89e-01

Pos logit contributions
kin+0.005
 wear+0.005
inton+0.005
 toes+0.005
ippers+0.005
Neg logit contributions
 again-0.005
 further-0.005
 harder-0.005
 aback-0.005
 tighter-0.005
Token quickly
Token ID2952
Feature activation-1.81e+00

Pos logit contributions
 further+0.004
 harder+0.004
 tighter+0.004
 again+0.004
 cal+0.004
Neg logit contributions
kin-0.004
 wear-0.004
 hardly-0.004
 toes-0.004
inton-0.004
Token gob
Token ID48484
Feature activation+1.86e+00

Pos logit contributions
 further+0.041
 harder+0.041
 tighter+0.040
 again+0.040
 cal+0.039
Neg logit contributions
kin-0.039
 wear-0.037
 hardly-0.036
 toes-0.036
inton-0.036
Tokenbled
Token ID9342
Feature activation-3.85e+00

Pos logit contributions
kin+0.033
 wear+0.030
 toes+0.029
inton+0.029
 hardly+0.029
Neg logit contributions
 further-0.051
 tighter-0.050
 harder-0.049
 cal-0.049
 aback-0.048
Token it
Token ID340
Feature activation-6.84e+00

Pos logit contributions
 Owl+0.086
lim+0.086
minded+0.083
 cal+0.082
 further+0.081
Neg logit contributions
 toes-0.072
 wear-0.072
kin-0.070
 hardly-0.066
inton-0.066
Token up
Token ID510
Feature activation-1.40e+00

Pos logit contributions
minded+0.106
 Owl+0.105
 cal+0.100
lim+0.096
 Eagle+0.091
Neg logit contributions
kin-0.204
 wear-0.195
 hardly-0.191
 toes-0.191
inton-0.184
Token and
Token ID290
Feature activation-1.20e+00

Pos logit contributions
 further+0.031
minded+0.030
 aback+0.030
lim+0.030
 cal+0.030
Neg logit contributions
kin-0.031
 wear-0.029
 toes-0.029
 hardly-0.028
inton-0.027
Token felt
Token ID2936
Feature activation+1.21e-01

Pos logit contributions
 further+0.027
 harder+0.026
 tighter+0.026
lim+0.025
 aback+0.025
Neg logit contributions
kin-0.026
 wear-0.026
 hardly-0.024
 toes-0.024
apy-0.024
Token so
Token ID523
Feature activation+1.52e-01

Pos logit contributions
kin+0.003
 wear+0.003
ippers+0.003
inton+0.003
 toes+0.003
Neg logit contributions
 further-0.002
 harder-0.002
 again-0.002
 tighter-0.002
 cal-0.002
Token happy
Token ID3772
Feature activation-2.36e-02

Pos logit contributions
ippers+0.003
kin+0.003
ilot+0.003
 classmates+0.003
 personality+0.003
Neg logit contributions
 cal-0.003
 worse-0.003
 harder-0.003
 further-0.003
 tighter-0.003
Token at
Token ID379
Feature activation+1.28e+00

Pos logit contributions
 further+0.047
 harder+0.046
 again+0.046
 tighter+0.045
 worse+0.045
Neg logit contributions
kin-0.051
 wear-0.048
 hardly-0.047
inton-0.047
 personality-0.047
Token all
Token ID477
Feature activation+9.18e-01

Pos logit contributions
kin+0.031
 wear+0.031
 toes+0.029
 hardly+0.029
 ago+0.028
Neg logit contributions
 tighter-0.025
 aback-0.025
 cal-0.025
minded-0.025
 further-0.025
Token.
Token ID13
Feature activation+9.36e-01

Pos logit contributions
kin+0.024
ippers+0.023
 hardly+0.023
 personality+0.022
ilot+0.022
Neg logit contributions
 again-0.019
 further-0.018
 harder-0.018
 worse-0.017
 tighter-0.017
Token He
Token ID679
Feature activation-3.29e-01

Pos logit contributions
kin+0.020
 wear+0.018
 toes+0.018
inton+0.017
 hardly+0.017
Neg logit contributions
 further-0.022
 harder-0.022
 again-0.022
 tighter-0.022
 aback-0.022
Token spat
Token ID15246
Feature activation+1.97e+00

Pos logit contributions
 further+0.008
 aback+0.008
 tighter+0.008
 harder+0.008
 again+0.008
Neg logit contributions
kin-0.007
 wear-0.006
 hardly-0.006
 toes-0.006
inton-0.006
Token it
Token ID340
Feature activation-6.84e+00

Pos logit contributions
kin+0.047
 wear+0.046
 hardly+0.044
 toes+0.044
inton+0.043
Neg logit contributions
 further-0.041
 tighter-0.039
 cal-0.039
 aback-0.038
 harder-0.038
Token out
Token ID503
Feature activation-5.03e-01

Pos logit contributions
minded+0.132
lim+0.129
 further+0.129
 Owl+0.125
 cal+0.125
Neg logit contributions
 wear-0.166
kin-0.163
apy-0.154
 hardly-0.153
inton-0.152
Token and
Token ID290
Feature activation-3.84e-01

Pos logit contributions
 further+0.010
lim+0.010
 tighter+0.010
 aback+0.010
minded+0.010
Neg logit contributions
kin-0.012
 wear-0.011
 toes-0.011
 hardly-0.011
apy-0.011
Token made
Token ID925
Feature activation+3.74e-01

Pos logit contributions
 further+0.008
 tighter+0.008
 harder+0.008
 aback+0.008
lim+0.008
Neg logit contributions
kin-0.009
 wear-0.008
 hardly-0.008
 toes-0.008
apy-0.008
Token a
Token ID257
Feature activation-3.05e-02

Pos logit contributions
kin+0.009
 wear+0.009
 toes+0.009
 hardly+0.009
apy+0.009
Neg logit contributions
 further-0.007
 tighter-0.007
 aback-0.007
lim-0.007
 cal-0.006
Token face
Token ID1986
Feature activation-3.76e-01

Pos logit contributions
 further+0.001
 harder+0.001
 tighter+0.001
 again+0.001
 aback+0.001
Neg logit contributions
kin-0.001
 wear-0.001
 hardly-0.001
 toes-0.001
inton-0.001
Token them
Token ID606
Feature activation-1.70e+00

Pos logit contributions
kin+0.020
 wear+0.020
 toes+0.019
 hardly+0.019
inton+0.019
Neg logit contributions
 further-0.017
 harder-0.016
 tighter-0.016
 again-0.016
 cal-0.016
Token cough
Token ID22094
Feature activation+7.10e-01

Pos logit contributions
 worse+0.036
 harder+0.036
 tighter+0.036
 again+0.036
 further+0.035
Neg logit contributions
kin-0.040
 ago-0.039
ippers-0.037
 personality-0.037
 importantly-0.036
Token.
Token ID13
Feature activation-1.13e-01

Pos logit contributions
kin+0.014
 wear+0.013
inton+0.013
 toes+0.013
 hardly+0.013
Neg logit contributions
 further-0.018
lim-0.017
 tighter-0.017
 aback-0.017
 cal-0.016
Token They
Token ID1119
Feature activation+2.05e+00

Pos logit contributions
 harder+0.003
 again+0.003
 further+0.003
 aback+0.003
 cal+0.003
Neg logit contributions
kin-0.002
 wear-0.002
ilot-0.002
 personality-0.002
inton-0.002
Token spit
Token ID27591
Feature activation+6.69e-01

Pos logit contributions
kin+0.041
ippers+0.037
inton+0.036
 ago+0.036
 personality+0.035
Neg logit contributions
 further-0.052
 harder-0.052
 tighter-0.051
 again-0.051
 cal-0.051
Token it
Token ID340
Feature activation-6.81e+00

Pos logit contributions
kin+0.017
 wear+0.017
 hardly+0.016
 toes+0.016
inton+0.016
Neg logit contributions
 further-0.013
 tighter-0.012
 harder-0.012
 again-0.012
 cal-0.012
Token out
Token ID503
Feature activation-1.00e+00

Pos logit contributions
lim+0.146
minded+0.146
 further+0.138
 cal+0.137
 Owl+0.135
Neg logit contributions
 wear-0.155
kin-0.147
apy-0.142
 hardly-0.139
inton-0.136
Token and
Token ID290
Feature activation-4.46e-01

Pos logit contributions
 further+0.019
 tighter+0.018
 aback+0.018
lim+0.017
 harder+0.017
Neg logit contributions
kin-0.026
 wear-0.025
 toes-0.024
inton-0.024
apy-0.024
Token make
Token ID787
Feature activation+5.87e-01

Pos logit contributions
 harder+0.011
 further+0.011
 again+0.011
 tighter+0.011
 cal+0.011
Neg logit contributions
kin-0.009
inton-0.008
ippers-0.008
 ago-0.008
ilot-0.008
Token faces
Token ID6698
Feature activation+2.71e+00

Pos logit contributions
kin+0.014
 wear+0.014
 toes+0.014
apy+0.014
 hardly+0.014
Neg logit contributions
 further-0.012
 tighter-0.011
 aback-0.011
lim-0.011
 harder-0.011
Token.
Token ID13
Feature activation+2.54e-01

Pos logit contributions
kin+0.066
 wear+0.063
 personality+0.062
 hardly+0.062
ippers+0.062
Neg logit contributions
 again-0.058
 harder-0.056
 further-0.054
 worse-0.053
 tighter-0.053
Tokenunch
Token ID3316
Feature activation+3.54e-01

Pos logit contributions
kin+0.041
 wear+0.038
 hardly+0.036
 toes+0.036
inton+0.036
Neg logit contributions
 further-0.079
 harder-0.078
 tighter-0.077
 again-0.077
 cal-0.076
Token up
Token ID510
Feature activation-1.55e+00

Pos logit contributions
kin+0.009
 wear+0.008
 toes+0.008
inton+0.007
apy+0.007
Neg logit contributions
 further-0.007
 aback-0.007
lim-0.007
minded-0.007
 cal-0.007
Token.
Token ID13
Feature activation-1.04e-01

Pos logit contributions
lim+0.040
 aback+0.040
 further+0.040
 Owl+0.039
minded+0.039
Neg logit contributions
kin-0.027
 wear-0.027
 toes-0.025
inton-0.023
 hardly-0.023
Token She
Token ID1375
Feature activation+1.49e-01

Pos logit contributions
 further+0.003
 tighter+0.003
 harder+0.003
 farther+0.003
 cal+0.003
Neg logit contributions
 wear-0.002
kin-0.002
 personality-0.002
 toes-0.002
 hardly-0.002
Token spat
Token ID15246
Feature activation+1.97e+00

Pos logit contributions
 wear+0.003
kin+0.003
 hardly+0.003
 toes+0.003
 personality+0.003
Neg logit contributions
 further-0.004
 aback-0.004
 farther-0.004
minded-0.004
 tighter-0.004
Token it
Token ID340
Feature activation-6.81e+00

Pos logit contributions
kin+0.046
 wear+0.045
 hardly+0.043
 toes+0.042
apy+0.042
Neg logit contributions
 further-0.042
 tighter-0.040
 cal-0.040
minded-0.039
 aback-0.039
Token out
Token ID503
Feature activation-6.26e-01

Pos logit contributions
minded+0.139
lim+0.136
 further+0.134
 Owl+0.130
 cal+0.129
Neg logit contributions
 wear-0.163
kin-0.156
apy-0.147
 hardly-0.147
 toes-0.144
Token and
Token ID290
Feature activation-2.99e-01

Pos logit contributions
 further+0.013
lim+0.012
minded+0.012
 aback+0.012
 tighter+0.012
Neg logit contributions
kin-0.014
 wear-0.014
 toes-0.014
apy-0.013
 hardly-0.013
Token said
Token ID531
Feature activation-1.50e-01

Pos logit contributions
 further+0.007
 harder+0.006
 tighter+0.006
 aback+0.006
 again+0.006
Neg logit contributions
kin-0.007
 wear-0.007
 toes-0.006
 hardly-0.006
apy-0.006
Token,
Token ID11
Feature activation-1.57e+00

Pos logit contributions
 further+0.004
 tighter+0.004
 harder+0.004
 again+0.003
 cal+0.003
Neg logit contributions
kin-0.003
 wear-0.003
 hardly-0.003
 toes-0.003
inton-0.003
Token "
Token ID366
Feature activation-3.86e+00

Pos logit contributions
 further+0.042
 harder+0.042
 again+0.041
 tighter+0.041
 cal+0.040
Neg logit contributions
kin-0.028
 wear-0.026
inton-0.025
 toes-0.025
 hardly-0.024
Token her
Token ID607
Feature activation+4.82e-01

Pos logit contributions
 toes+0.010
kin+0.009
 wear+0.009
inton+0.008
apy+0.008
Neg logit contributions
lim-0.015
 cal-0.015
 further-0.015
minded-0.014
 aback-0.014
Token tongue
Token ID11880
Feature activation-4.21e-01

Pos logit contributions
kin+0.008
 wear+0.008
 hardly+0.008
inton+0.007
 personality+0.007
Neg logit contributions
 again-0.013
 tighter-0.013
 harder-0.013
 further-0.013
 worse-0.013
Token.
Token ID13
Feature activation+5.70e-01

Pos logit contributions
 further+0.009
 aback+0.008
lim+0.008
minded+0.008
 cal+0.008
Neg logit contributions
kin-0.010
 wear-0.010
 toes-0.010
 hardly-0.009
inton-0.009
Token She
Token ID1375
Feature activation+1.12e-01

Pos logit contributions
kin+0.013
inton+0.012
 wear+0.012
 toes+0.012
 hardly+0.012
Neg logit contributions
 harder-0.012
 again-0.012
 further-0.012
 aback-0.012
 tighter-0.012
Token spat
Token ID15246
Feature activation+1.87e+00

Pos logit contributions
kin+0.003
 wear+0.002
 hardly+0.002
inton+0.002
 toes+0.002
Neg logit contributions
 further-0.002
 harder-0.002
 tighter-0.002
 again-0.002
 cal-0.002
Token it
Token ID340
Feature activation-6.78e+00

Pos logit contributions
kin+0.047
 wear+0.046
 hardly+0.044
 toes+0.044
inton+0.044
Neg logit contributions
 further-0.036
 tighter-0.035
 harder-0.034
 cal-0.034
 again-0.034
Token out
Token ID503
Feature activation-4.83e-01

Pos logit contributions
minded+0.138
lim+0.135
 further+0.135
 cal+0.130
 Owl+0.128
Neg logit contributions
 wear-0.161
kin-0.158
apy-0.150
 toes-0.149
 hardly-0.148
Token and
Token ID290
Feature activation-3.55e-01

Pos logit contributions
 further+0.012
 aback+0.011
lim+0.011
 tighter+0.011
minded+0.011
Neg logit contributions
kin-0.010
 toes-0.009
 wear-0.009
 hardly-0.009
apy-0.009
Token cried
Token ID16896
Feature activation+1.50e+00

Pos logit contributions
 further+0.008
 harder+0.008
 tighter+0.008
 again+0.007
 aback+0.007
Neg logit contributions
kin-0.008
 wear-0.008
 toes-0.007
 hardly-0.007
inton-0.007
Token.
Token ID13
Feature activation+3.43e-01

Pos logit contributions
kin+0.025
 toes+0.024
 wear+0.023
apy+0.021
inton+0.021
Neg logit contributions
 further-0.041
lim-0.040
 aback-0.040
 cal-0.040
 tighter-0.039
Token
Token ID198
Feature activation-1.08e-01

Pos logit contributions
kin+0.009
 wear+0.008
inton+0.008
 toes+0.007
orses+0.007
Neg logit contributions
 again-0.007
 harder-0.007
 further-0.007
 aback-0.007
 worse-0.007
Token bitter
Token ID12922
Feature activation-9.32e-01

Pos logit contributions
kin+0.020
 wear+0.019
 hardly+0.019
 toes+0.019
 ago+0.019
Neg logit contributions
 further-0.014
 tighter-0.013
minded-0.013
 aback-0.013
 again-0.012
Token!
Token ID0
Feature activation+9.50e-01

Pos logit contributions
 tighter+0.020
 further+0.020
 cal+0.019
 harder+0.019
 aback+0.019
Neg logit contributions
kin-0.021
 wear-0.020
 toes-0.020
inton-0.020
 hardly-0.019
Token He
Token ID679
Feature activation-1.61e-01

Pos logit contributions
kin+0.020
 wear+0.018
inton+0.017
 toes+0.017
 personality+0.017
Neg logit contributions
 further-0.023
 again-0.023
 harder-0.023
 aback-0.022
 tighter-0.022
Token quickly
Token ID2952
Feature activation-1.14e+00

Pos logit contributions
 further+0.004
 harder+0.004
 tighter+0.004
 again+0.003
 aback+0.003
Neg logit contributions
kin-0.004
 wear-0.003
 hardly-0.003
 toes-0.003
inton-0.003
Token spat
Token ID15246
Feature activation+2.14e+00

Pos logit contributions
 further+0.025
 harder+0.025
 tighter+0.024
 again+0.024
 cal+0.024
Neg logit contributions
kin-0.026
 wear-0.024
ippers-0.024
inton-0.024
 toes-0.024
Token it
Token ID340
Feature activation-6.75e+00

Pos logit contributions
kin+0.053
 wear+0.051
 hardly+0.050
 toes+0.049
inton+0.049
Neg logit contributions
 further-0.042
 tighter-0.041
 harder-0.040
 cal-0.039
 aback-0.039
Token out
Token ID503
Feature activation-3.91e-01

Pos logit contributions
minded+0.123
 further+0.120
lim+0.120
 cal+0.117
 Owl+0.113
Neg logit contributions
 wear-0.170
kin-0.165
 hardly-0.163
apy-0.159
 toes-0.158
Token.
Token ID13
Feature activation+1.32e+00

Pos logit contributions
 further+0.008
 tighter+0.007
 harder+0.007
 cal+0.007
 aback+0.007
Neg logit contributions
kin-0.010
 wear-0.009
 toes-0.009
 hardly-0.009
inton-0.009
Token
Token ID198
Feature activation+8.11e-01

Pos logit contributions
kin+0.028
 wear+0.026
inton+0.025
 toes+0.025
 personality+0.024
Neg logit contributions
 further-0.031
 again-0.031
 harder-0.031
 aback-0.031
 tighter-0.030
Token
Token ID198
Feature activation-6.17e-01

Pos logit contributions
kin+0.020
 toes+0.019
 personality+0.019
 wear+0.019
inton+0.018
Neg logit contributions
 again-0.016
 further-0.016
 harder-0.016
 aback-0.016
 cal-0.015
TokenThe
Token ID464
Feature activation-3.49e-01

Pos logit contributions
 further+0.011
 harder+0.011
 tighter+0.011
 again+0.011
 cal+0.010
Neg logit contributions
kin-0.016
 toes-0.016
 ago-0.016
 wear-0.016
 personality-0.016
Token coffee
Token ID6891
Feature activation+4.50e-02

Pos logit contributions
 further+0.040
 harder+0.039
 tighter+0.039
 again+0.039
 cal+0.038
Neg logit contributions
kin-0.046
 wear-0.044
 hardly-0.042
 toes-0.042
inton-0.042
Token down
Token ID866
Feature activation-1.39e+00

Pos logit contributions
kin+0.001
 wear+0.001
 hardly+0.001
inton+0.001
 toes+0.001
Neg logit contributions
 further-0.001
 again-0.001
 harder-0.001
 tighter-0.001
 worse-0.001
Token.
Token ID13
Feature activation-3.11e-03

Pos logit contributions
 further+0.026
 harder+0.025
 tighter+0.025
 again+0.025
 cal+0.024
Neg logit contributions
kin-0.036
 wear-0.035
 hardly-0.034
inton-0.034
 toes-0.034
Token He
Token ID679
Feature activation-8.28e-01

Pos logit contributions
 again+0.000
 harder+0.000
 further+0.000
 aback+0.000
 Owl+0.000
Neg logit contributions
kin-0.000
 wear-0.000
inton-0.000
ippers-0.000
 toes-0.000
Token spat
Token ID15246
Feature activation+1.63e+00

Pos logit contributions
 further+0.017
 harder+0.017
 again+0.017
 tighter+0.017
 cal+0.016
Neg logit contributions
kin-0.020
 wear-0.018
ippers-0.018
inton-0.018
 toes-0.018
Token it
Token ID340
Feature activation-6.74e+00

Pos logit contributions
kin+0.041
 wear+0.039
 personality+0.039
 ago+0.038
inton+0.038
Neg logit contributions
 harder-0.031
 again-0.031
 further-0.031
 tighter-0.030
 worse-0.029
Token out
Token ID503
Feature activation-1.22e-01

Pos logit contributions
 further+0.102
 tighter+0.098
 cal+0.096
 harder+0.095
lim+0.094
Neg logit contributions
kin-0.196
 wear-0.192
 toes-0.184
 hardly-0.184
inton-0.183
Token,
Token ID11
Feature activation-3.86e-01

Pos logit contributions
 further+0.002
 again+0.002
 harder+0.002
 tighter+0.002
 worse+0.002
Neg logit contributions
kin-0.003
 wear-0.003
 personality-0.003
inton-0.003
 hardly-0.003
Token then
Token ID788
Feature activation-2.03e+00

Pos logit contributions
 harder+0.007
 again+0.007
 further+0.007
 tighter+0.006
 worse+0.006
Neg logit contributions
kin-0.011
 wear-0.010
inton-0.010
ippers-0.010
ilot-0.010
Token ran
Token ID4966
Feature activation-2.96e+00

Pos logit contributions
 again+0.045
 further+0.045
 harder+0.044
 tighter+0.043
 cal+0.042
Neg logit contributions
kin-0.046
inton-0.043
 wear-0.043
 hardly-0.043
 personality-0.042
Token to
Token ID284
Feature activation-9.78e-01

Pos logit contributions
 further+0.038
 tighter+0.037
 harder+0.037
 cal+0.036
 worse+0.035
Neg logit contributions
kin-0.091
 wear-0.089
 hardly-0.087
 toes-0.087
inton-0.087
Token like
Token ID588
Feature activation+2.37e+00

Pos logit contributions
 harder+0.031
 further+0.031
 again+0.030
 worse+0.030
 tighter+0.030
Neg logit contributions
kin-0.024
apy-0.023
ippers-0.023
 hardly-0.022
inton-0.022
Token it
Token ID340
Feature activation-2.19e+00

Pos logit contributions
kin+0.059
 wear+0.056
 hardly+0.054
 toes+0.054
inton+0.054
Neg logit contributions
 further-0.047
 harder-0.046
 tighter-0.045
 again-0.045
 cal-0.044
Token.
Token ID13
Feature activation+4.13e-01

Pos logit contributions
 further+0.050
 harder+0.049
 again+0.049
 tighter+0.048
 worse+0.047
Neg logit contributions
kin-0.048
 wear-0.045
 hardly-0.044
inton-0.044
 toes-0.044
Token She
Token ID1375
Feature activation+1.58e-01

Pos logit contributions
kin+0.010
 wear+0.010
inton+0.010
 hardly+0.010
 toes+0.009
Neg logit contributions
 harder-0.008
 further-0.008
 again-0.008
 tighter-0.008
 aback-0.008
Token spit
Token ID27591
Feature activation+7.48e-01

Pos logit contributions
kin+0.003
 wear+0.003
 hardly+0.003
 toes+0.003
inton+0.003
Neg logit contributions
 further-0.004
 harder-0.004
 tighter-0.004
 aback-0.004
 again-0.004
Token it
Token ID340
Feature activation-6.74e+00

Pos logit contributions
kin+0.019
 wear+0.019
 hardly+0.018
 toes+0.018
apy+0.018
Neg logit contributions
 further-0.014
 tighter-0.013
lim-0.013
 cal-0.013
minded-0.013
Token out
Token ID503
Feature activation-4.67e-01

Pos logit contributions
minded+0.150
lim+0.148
 further+0.138
 cal+0.138
 Owl+0.137
Neg logit contributions
 wear-0.151
kin-0.145
 hardly-0.134
apy-0.133
 toes-0.132
Token and
Token ID290
Feature activation-3.66e-01

Pos logit contributions
 further+0.010
lim+0.010
 aback+0.010
minded+0.010
 tighter+0.009
Neg logit contributions
kin-0.010
 wear-0.010
 toes-0.010
apy-0.010
 hardly-0.010
Token said
Token ID531
Feature activation+2.61e-01

Pos logit contributions
 further+0.008
 harder+0.008
 tighter+0.008
 again+0.008
 aback+0.008
Neg logit contributions
kin-0.008
 wear-0.008
 toes-0.008
 hardly-0.007
inton-0.007
Token,
Token ID11
Feature activation-1.59e+00

Pos logit contributions
kin+0.006
 wear+0.006
inton+0.006
 hardly+0.005
 toes+0.005
Neg logit contributions
 again-0.006
 harder-0.006
 further-0.006
 worse-0.005
 tighter-0.005
Token "
Token ID366
Feature activation-3.81e+00

Pos logit contributions
 again+0.037
 harder+0.036
 further+0.034
 tighter+0.034
 cal+0.033
Neg logit contributions
kin-0.036
 wear-0.033
inton-0.033
ippers-0.032
 toes-0.031
Token mouth
Token ID5422
Feature activation+1.70e-01

Pos logit contributions
kin+0.022
 hardly+0.021
inton+0.021
ilot+0.021
apy+0.021
Neg logit contributions
 tighter-0.028
 harder-0.028
 again-0.028
 further-0.028
 cal-0.027
Token.
Token ID13
Feature activation+6.48e-01

Pos logit contributions
kin+0.004
 wear+0.004
 toes+0.003
inton+0.003
 ago+0.003
Neg logit contributions
 further-0.004
 cal-0.004
minded-0.004
lim-0.004
 tighter-0.004
Token You
Token ID921
Feature activation+2.21e+00

Pos logit contributions
kin+0.013
 wear+0.013
 hardly+0.012
inton+0.012
 toes+0.012
Neg logit contributions
 further-0.015
 harder-0.015
 tighter-0.015
 again-0.015
 cal-0.015
Token should
Token ID815
Feature activation+5.57e-01

Pos logit contributions
 wear+0.036
kin+0.035
 hardly+0.033
inton+0.031
 toes+0.031
Neg logit contributions
 further-0.064
 tighter-0.062
 harder-0.062
 again-0.061
 cal-0.060
Token spit
Token ID27591
Feature activation+1.35e+00

Pos logit contributions
kin+0.011
ilot+0.011
apy+0.011
 ago+0.010
 toes+0.010
Neg logit contributions
 harder-0.013
 again-0.013
 tighter-0.013
 worse-0.013
 cal-0.013
Token it
Token ID340
Feature activation-6.73e+00

Pos logit contributions
 wear+0.029
kin+0.027
 hardly+0.025
 toes+0.025
 personality+0.025
Neg logit contributions
 further-0.031
 cal-0.030
lim-0.030
 tighter-0.030
minded-0.029
Token out
Token ID503
Feature activation-1.56e-01

Pos logit contributions
lim+0.127
minded+0.123
 Owl+0.123
 cal+0.123
 further+0.120
Neg logit contributions
 wear-0.173
kin-0.160
 hardly-0.147
 toes-0.145
inton-0.142
Token.
Token ID13
Feature activation+8.59e-01

Pos logit contributions
 further+0.003
 tighter+0.003
 harder+0.003
 cal+0.003
 aback+0.003
Neg logit contributions
kin-0.004
 wear-0.004
 toes-0.003
 hardly-0.003
 ago-0.003
Token The
Token ID383
Feature activation+1.73e-01

Pos logit contributions
kin+0.016
 wear+0.016
 hardly+0.015
 toes+0.015
 personality+0.015
Neg logit contributions
 further-0.022
 tighter-0.021
 harder-0.021
 again-0.021
 cal-0.021
Token teacher
Token ID4701
Feature activation+3.02e-01

Pos logit contributions
kin+0.004
 wear+0.004
 toes+0.004
 hardly+0.004
 personality+0.004
Neg logit contributions
 further-0.003
 tighter-0.003
 harder-0.003
 again-0.003
 aback-0.003
Token will
Token ID481
Feature activation+1.69e+00

Pos logit contributions
kin+0.007
 toes+0.007
inton+0.007
 wear+0.007
 hardly+0.007
Neg logit contributions
 further-0.006
 again-0.006
 harder-0.006
 tighter-0.006
 cal-0.006
TokenHer
Token ID9360
Feature activation+1.06e+00

Pos logit contributions
 tighter+0.001
 again+0.001
 harder+0.001
 further+0.001
lim+0.001
Neg logit contributions
kin-0.001
 toes-0.001
 ago-0.001
inton-0.001
orses-0.001
Token mom
Token ID1995
Feature activation-1.04e+00

Pos logit contributions
kin+0.018
orses+0.017
 wear+0.017
 hardly+0.017
 ago+0.017
Neg logit contributions
 again-0.029
 tighter-0.029
 harder-0.029
 further-0.028
 worse-0.028
Token tried
Token ID3088
Feature activation-1.32e+00

Pos logit contributions
 further+0.028
 farther+0.027
minded+0.026
 worse+0.025
 cal+0.025
Neg logit contributions
kin-0.018
 wear-0.017
 hardly-0.017
orse-0.016
ana-0.016
Token to
Token ID284
Feature activation-1.90e+00

Pos logit contributions
ter+0.030
 cal+0.030
minded+0.030
lim+0.030
 tast+0.028
Neg logit contributions
 wear-0.026
 hardly-0.025
 toes-0.023
 Their-0.023
 organizing-0.023
Token pull
Token ID2834
Feature activation-1.11e+00

Pos logit contributions
minded+0.053
 cal+0.050
lim+0.050
 further+0.050
 again+0.049
Neg logit contributions
 wear-0.034
 hardly-0.027
kin-0.026
 arrive-0.025
 have-0.025
Token it
Token ID340
Feature activation-6.72e+00

Pos logit contributions
 cal+0.030
minded+0.030
lim+0.030
 nearest+0.029
 further+0.029
Neg logit contributions
 wear-0.018
 toes-0.016
kin-0.016
 Their-0.015
apy-0.015
Token out
Token ID503
Feature activation-9.38e-01

Pos logit contributions
minded+0.164
 cal+0.157
lim+0.150
 berries+0.141
 nearest+0.140
Neg logit contributions
 wear-0.134
 hardly-0.126
kin-0.122
 toes-0.118
apy-0.115
Token too
Token ID1165
Feature activation-1.94e+00

Pos logit contributions
 further+0.023
 cal+0.023
 tighter+0.023
 harder+0.022
minded+0.022
Neg logit contributions
kin-0.018
 wear-0.017
 toes-0.016
 hardly-0.016
apy-0.016
Token,
Token ID11
Feature activation-5.96e-01

Pos logit contributions
minded+0.055
 cal+0.054
 further+0.054
lim+0.053
 tighter+0.053
Neg logit contributions
kin-0.027
 wear-0.027
 hardly-0.027
 ago-0.025
apy-0.025
Token but
Token ID475
Feature activation-4.98e-01

Pos logit contributions
lim+0.016
minded+0.016
 cal+0.016
 further+0.016
 aback+0.015
Neg logit contributions
 wear-0.010
 hardly-0.009
 toes-0.009
 personality-0.009
 ago-0.008
Token it
Token ID340
Feature activation-4.09e-01

Pos logit contributions
 further+0.012
 cal+0.012
minded+0.012
 aback+0.012
lim+0.012
Neg logit contributions
kin-0.010
 wear-0.010
 hardly-0.009
apy-0.009
 toes-0.009
Token.
Token ID13
Feature activation-7.24e-01

Pos logit contributions
kin+0.002
 toes+0.001
inton+0.001
 ago+0.001
 wear+0.001
Neg logit contributions
 cal-0.002
 tighter-0.002
 further-0.002
 aback-0.002
lim-0.002
Token Tim
Token ID5045
Feature activation-1.20e+00

Pos logit contributions
 further+0.018
 harder+0.018
 tighter+0.018
 again+0.018
 cal+0.017
Neg logit contributions
kin-0.014
 wear-0.013
 toes-0.013
inton-0.013
 hardly-0.012
Token tried
Token ID3088
Feature activation-1.00e+00

Pos logit contributions
 further+0.030
 aback+0.027
minded+0.027
 cal+0.027
 nearer+0.027
Neg logit contributions
 wear-0.024
 hardly-0.023
kin-0.023
 toes-0.023
inton-0.023
Token to
Token ID284
Feature activation-1.23e+00

Pos logit contributions
lim+0.021
minded+0.021
 cal+0.021
 further+0.020
ter+0.020
Neg logit contributions
 hardly-0.021
 wear-0.020
 toes-0.019
 Their-0.019
apy-0.018
Token pull
Token ID2834
Feature activation-7.21e-01

Pos logit contributions
minded+0.032
 further+0.032
 tighter+0.032
 aback+0.032
 harder+0.032
Neg logit contributions
 wear-0.022
kin-0.021
 hardly-0.020
 toes-0.019
inton-0.019
Token it
Token ID340
Feature activation-6.72e+00

Pos logit contributions
minded+0.019
 cal+0.019
lim+0.018
 nearest+0.018
 further+0.017
Neg logit contributions
 toes-0.011
 wear-0.011
kin-0.011
apy-0.011
 Their-0.010
Token out
Token ID503
Feature activation-8.82e-01

Pos logit contributions
minded+0.151
 cal+0.142
lim+0.137
 Owl+0.125
 nearest+0.125
Neg logit contributions
 wear-0.141
 toes-0.136
 hardly-0.136
kin-0.133
apy-0.131
Token,
Token ID11
Feature activation-4.38e-01

Pos logit contributions
 further+0.021
 tighter+0.020
 harder+0.020
 again+0.020
 cal+0.020
Neg logit contributions
kin-0.018
 wear-0.017
 toes-0.017
 hardly-0.017
inton-0.017
Token but
Token ID475
Feature activation-3.12e-01

Pos logit contributions
 further+0.010
 tighter+0.010
 cal+0.010
lim+0.009
minded+0.009
Neg logit contributions
kin-0.009
 wear-0.009
 hardly-0.009
 toes-0.009
 personality-0.008
Token it
Token ID340
Feature activation-3.09e-01

Pos logit contributions
 further+0.007
 tighter+0.007
 harder+0.007
 cal+0.007
 aback+0.007
Neg logit contributions
kin-0.007
 wear-0.007
 hardly-0.006
 toes-0.006
apy-0.006
Token was
Token ID373
Feature activation-9.48e-02

Pos logit contributions
minded+0.009
 further+0.008
 aback+0.008
 cal+0.008
 tighter+0.008
Neg logit contributions
kin-0.006
 hardly-0.006
apy-0.005
 toes-0.005
 wear-0.005
Token so
Token ID523
Feature activation-1.35e-01

Pos logit contributions
kin+0.011
 wear+0.010
 hardly+0.010
 toes+0.010
 ago+0.010
Neg logit contributions
 further-0.006
 aback-0.006
minded-0.006
 again-0.006
lim-0.006
Token sour
Token ID11348
Feature activation-9.08e-01

Pos logit contributions
 further+0.003
 harder+0.003
 tighter+0.003
 again+0.003
 cal+0.003
Neg logit contributions
kin-0.003
 wear-0.003
 toes-0.003
 hardly-0.003
inton-0.003
Token that
Token ID326
Feature activation-2.21e-04

Pos logit contributions
 tighter+0.020
 further+0.020
 aback+0.019
 cal+0.019
lim+0.019
Neg logit contributions
kin-0.019
 wear-0.019
inton-0.018
 toes-0.018
 ago-0.018
Token he
Token ID339
Feature activation-1.52e+00

Pos logit contributions
 further+0.000
 again+0.000
 harder+0.000
 tighter+0.000
 worse+0.000
Neg logit contributions
kin-0.000
 wear-0.000
inton-0.000
 personality-0.000
 toes-0.000
Token spit
Token ID27591
Feature activation+3.75e-01

Pos logit contributions
 further+0.036
 harder+0.036
 tighter+0.036
 again+0.035
 cal+0.034
Neg logit contributions
kin-0.031
 wear-0.030
 hardly-0.029
 toes-0.029
inton-0.029
Token it
Token ID340
Feature activation-6.72e+00

Pos logit contributions
kin+0.009
 wear+0.009
 hardly+0.008
apy+0.008
 toes+0.008
Neg logit contributions
 further-0.008
lim-0.007
 tighter-0.007
minded-0.007
 cal-0.007
Token out
Token ID503
Feature activation-1.22e-01

Pos logit contributions
lim+0.133
 Owl+0.128
minded+0.128
 further+0.128
 cal+0.127
Neg logit contributions
 wear-0.160
kin-0.154
 hardly-0.143
inton-0.140
 toes-0.140
Token and
Token ID290
Feature activation-3.13e-01

Pos logit contributions
 further+0.003
 harder+0.003
 tighter+0.003
 again+0.003
 cal+0.002
Neg logit contributions
kin-0.003
 wear-0.003
 toes-0.003
 hardly-0.003
inton-0.003
Token made
Token ID925
Feature activation-3.43e-02

Pos logit contributions
 further+0.007
 harder+0.007
 tighter+0.007
 aback+0.007
 again+0.007
Neg logit contributions
kin-0.007
 wear-0.007
 hardly-0.006
 toes-0.006
inton-0.006
Token a
Token ID257
Feature activation-2.91e-02

Pos logit contributions
 further+0.001
 tighter+0.001
 aback+0.001
lim+0.001
 cal+0.001
Neg logit contributions
kin-0.001
 wear-0.001
 hardly-0.001
apy-0.001
 toes-0.001
Token funny
Token ID8258
Feature activation+1.35e+00

Pos logit contributions
 further+0.001
 aback+0.001
 tighter+0.001
 harder+0.001
 again+0.001
Neg logit contributions
kin-0.001
 wear-0.001
 hardly-0.001
 toes-0.001
inton-0.001
Token happy
Token ID3772
Feature activation+8.28e-01

Pos logit contributions
kin+0.036
ippers+0.035
 wear+0.034
 ago+0.034
 personality+0.033
Neg logit contributions
 again-0.026
 further-0.026
 cal-0.026
 harder-0.026
 tighter-0.025
Token and
Token ID290
Feature activation-1.73e-01

Pos logit contributions
kin+0.022
 wear+0.021
 hardly+0.021
inton+0.021
 toes+0.021
Neg logit contributions
 further-0.015
 harder-0.014
 again-0.014
 tighter-0.014
 cal-0.014
Token safe
Token ID3338
Feature activation+1.67e-02

Pos logit contributions
 again+0.004
 further+0.004
 cal+0.004
 tighter+0.004
 harder+0.004
Neg logit contributions
kin-0.004
ippers-0.003
 ago-0.003
inton-0.003
ilot-0.003
Token from
Token ID422
Feature activation-1.18e-01

Pos logit contributions
kin+0.000
 wear+0.000
 toes+0.000
 hardly+0.000
inton+0.000
Neg logit contributions
 further-0.000
 tighter-0.000
 harder-0.000
 cal-0.000
 aback-0.000
Token scary
Token ID14343
Feature activation+9.69e-01

Pos logit contributions
 further+0.002
 again+0.002
 harder+0.002
 tighter+0.002
 cal+0.002
Neg logit contributions
kin-0.003
ippers-0.003
inton-0.003
 wear-0.003
 hardly-0.003
Token creatures
Token ID8109
Feature activation+4.34e+00

Pos logit contributions
kin+0.026
 wear+0.025
 hardly+0.024
inton+0.024
 toes+0.024
Neg logit contributions
 further-0.018
 harder-0.017
 tighter-0.017
 again-0.017
 cal-0.016
Token.
Token ID13
Feature activation+7.90e-01

Pos logit contributions
kin+0.116
ippers+0.109
inton+0.108
 wear+0.107
apy+0.106
Neg logit contributions
 again-0.088
 further-0.084
 harder-0.077
 tighter-0.076
 cal-0.076
Token<|endoftext|>
Token ID50256
Feature activation-3.53e-01

Pos logit contributions
kin+0.020
ippers+0.018
 toes+0.018
 wear+0.018
inton+0.018
Neg logit contributions
 again-0.016
 cal-0.015
 Owl-0.015
 aback-0.015
 harder-0.015
TokenOnce
Token ID7454
Feature activation+9.10e-01

Pos logit contributions
 again+0.008
 further+0.008
 harder+0.007
 tighter+0.007
 aback+0.007
Neg logit contributions
kin-0.008
 wear-0.008
 toes-0.008
 hardly-0.008
 personality-0.008
Token upon
Token ID2402
Feature activation+4.24e-01

Pos logit contributions
 personality+0.023
anas+0.023
 beard+0.022
asts+0.022
kin+0.022
Neg logit contributions
 again-0.020
 further-0.018
 closer-0.017
 harder-0.017
 up-0.016
Token a
Token ID257
Feature activation+9.66e-01

Pos logit contributions
kin+0.013
 wear+0.013
 toes+0.013
inton+0.013
 hardly+0.013
Neg logit contributions
 further-0.005
 harder-0.005
 again-0.005
 tighter-0.005
 worse-0.005
Token looked
Token ID3114
Feature activation-5.40e-01

Pos logit contributions
kin+0.041
 wear+0.040
 hardly+0.040
 toes+0.039
inton+0.038
Neg logit contributions
 further-0.041
 tighter-0.041
 harder-0.040
 aback-0.040
 worse-0.039
Token at
Token ID379
Feature activation-4.09e-01

Pos logit contributions
 further+0.012
 harder+0.012
 tighter+0.012
 again+0.012
 cal+0.011
Neg logit contributions
kin-0.012
 wear-0.011
 hardly-0.011
 toes-0.011
inton-0.011
Token the
Token ID262
Feature activation-1.42e+00

Pos logit contributions
 further+0.010
 tighter+0.010
 cal+0.010
lim+0.010
 aback+0.010
Neg logit contributions
 wear-0.008
 toes-0.007
kin-0.007
 hardly-0.007
 Their-0.007
Token boy
Token ID2933
Feature activation-2.25e+00

Pos logit contributions
 aback+0.030
 further+0.030
lim+0.028
 again+0.028
 cal+0.028
Neg logit contributions
kin-0.033
 wear-0.032
 toes-0.031
inton-0.030
 personality-0.030
Token and
Token ID290
Feature activation+5.55e-01

Pos logit contributions
 further+0.036
 harder+0.035
 tighter+0.035
 again+0.035
 cal+0.033
Neg logit contributions
kin-0.064
 wear-0.062
 hardly-0.060
inton-0.060
 toes-0.060
Token cl
Token ID537
Feature activation+4.36e+00

Pos logit contributions
 wear+0.011
kin+0.011
 toes+0.011
 hardly+0.011
apy+0.011
Neg logit contributions
 further-0.013
 tighter-0.012
 harder-0.012
 aback-0.012
lim-0.012
Tokenucked
Token ID17758
Feature activation-1.21e+00

Pos logit contributions
kin+0.080
 hardly+0.080
 wear+0.079
pron+0.074
 ago+0.073
Neg logit contributions
 aback-0.108
 again-0.108
minded-0.102
 further-0.102
lim-0.102
Token.
Token ID13
Feature activation+8.09e-01

Pos logit contributions
 Owl+0.032
lim+0.032
 further+0.031
 least+0.030
minded+0.030
Neg logit contributions
 toes-0.019
apy-0.017
kin-0.017
inton-0.017
 wear-0.016
Token The
Token ID383
Feature activation-4.30e-01

Pos logit contributions
kin+0.016
 wear+0.015
inton+0.014
 toes+0.014
 personality+0.014
Neg logit contributions
 further-0.020
 harder-0.020
 again-0.020
 tighter-0.020
 aback-0.019
Token boy
Token ID2933
Feature activation-8.52e-01

Pos logit contributions
 further+0.009
 aback+0.009
 tighter+0.008
 again+0.008
 harder+0.008
Neg logit contributions
kin-0.011
 wear-0.010
 toes-0.010
inton-0.010
 personality-0.009
Token laughed
Token ID13818
Feature activation+2.34e+00

Pos logit contributions
 further+0.020
 tighter+0.019
 aback+0.019
 harder+0.019
 worse+0.018
Neg logit contributions
 wear-0.018
kin-0.018
 hardly-0.018
 toes-0.017
apy-0.017
Token them
Token ID606
Feature activation-1.05e+00

Pos logit contributions
 further+0.031
 tighter+0.030
 harder+0.030
 cal+0.030
lim+0.029
Neg logit contributions
 wear-0.025
kin-0.024
 toes-0.023
inton-0.023
 personality-0.022
Token a
Token ID257
Feature activation+1.11e+00

Pos logit contributions
 further+0.020
 tighter+0.020
 harder+0.020
 again+0.019
 aback+0.019
Neg logit contributions
kin-0.026
 wear-0.025
 toes-0.025
 hardly-0.024
inton-0.024
Token picture
Token ID4286
Feature activation-2.34e-01

Pos logit contributions
kin+0.025
inton+0.024
 wear+0.024
 toes+0.024
 hardly+0.023
Neg logit contributions
 further-0.024
 harder-0.024
 tighter-0.023
 again-0.023
 cal-0.023
Token of
Token ID286
Feature activation+2.89e+00

Pos logit contributions
 further+0.006
 again+0.006
 harder+0.006
 tighter+0.006
 worse+0.006
Neg logit contributions
kin-0.004
 wear-0.004
inton-0.004
 hardly-0.004
 personality-0.004
Token a
Token ID257
Feature activation+1.45e+00

Pos logit contributions
kin+0.085
 wear+0.081
inton+0.081
 hardly+0.080
ippers+0.079
Neg logit contributions
 further-0.043
 harder-0.042
 again-0.042
 tighter-0.041
 cal-0.040
Token solar
Token ID6591
Feature activation+4.00e+00

Pos logit contributions
kin+0.034
inton+0.033
 wear+0.032
ippers+0.032
 hardly+0.032
Neg logit contributions
 further-0.030
 again-0.030
 harder-0.030
 tighter-0.029
 cal-0.028
Token eclipse
Token ID25872
Feature activation+4.26e+00

Pos logit contributions
kin+0.093
 wear+0.089
inton+0.086
 hardly+0.086
 toes+0.086
Neg logit contributions
 further-0.085
 harder-0.084
 again-0.083
 tighter-0.082
 cal-0.080
Token in
Token ID287
Feature activation+2.49e+00

Pos logit contributions
kin+0.103
inton+0.098
 wear+0.098
 hardly+0.098
ippers+0.095
Neg logit contributions
 again-0.087
 further-0.084
 harder-0.084
 tighter-0.080
 worse-0.078
Token the
Token ID262
Feature activation-3.67e-01

Pos logit contributions
kin+0.069
inton+0.068
 hardly+0.066
orses+0.066
 wear+0.066
Neg logit contributions
 further-0.042
 harder-0.042
 again-0.041
 tighter-0.040
 cal-0.038
Token book
Token ID1492
Feature activation+3.10e-01

Pos logit contributions
 further+0.008
 harder+0.008
 tighter+0.008
 again+0.008
 aback+0.008
Neg logit contributions
kin-0.008
 wear-0.008
 toes-0.007
 personality-0.007
 hardly-0.007
Token.
Token ID13
Feature activation-6.29e-01

Pos logit contributions
kin+0.007
 wear+0.007
inton+0.007
 personality+0.007
 hardly+0.007
Neg logit contributions
 again-0.007
 harder-0.007
 further-0.007
 tighter-0.006
 worse-0.006
Token In
Token ID554
Feature activation+1.04e+00

Pos logit contributions
kin+0.021
 wear+0.019
inton+0.018
ippers+0.018
orses+0.018
Neg logit contributions
 harder-0.023
 again-0.023
 further-0.023
 aback-0.021
 tighter-0.021
Token the
Token ID262
Feature activation+5.19e-01

Pos logit contributions
kin+0.029
inton+0.028
 wear+0.028
ippers+0.027
 personality+0.027
Neg logit contributions
 further-0.018
 harder-0.018
 again-0.017
 tighter-0.016
 cal-0.016
Token distance
Token ID5253
Feature activation+2.20e+00

Pos logit contributions
kin+0.012
 wear+0.012
 toes+0.012
 hardly+0.012
inton+0.011
Neg logit contributions
 further-0.010
 tighter-0.010
 harder-0.010
 again-0.010
 aback-0.010
Token,
Token ID11
Feature activation+2.37e-01

Pos logit contributions
kin+0.052
 wear+0.049
inton+0.048
 hardly+0.048
 personality+0.047
Neg logit contributions
 further-0.048
 again-0.046
 harder-0.046
 tighter-0.045
 worse-0.044
Token the
Token ID262
Feature activation+6.40e-01

Pos logit contributions
kin+0.006
inton+0.006
 wear+0.006
 personality+0.006
ippers+0.006
Neg logit contributions
 further-0.004
 again-0.004
 harder-0.004
 worse-0.004
 tighter-0.004
Token lights
Token ID7588
Feature activation+4.75e+00

Pos logit contributions
kin+0.016
 wear+0.015
 toes+0.015
inton+0.015
 hardly+0.015
Neg logit contributions
 further-0.012
 tighter-0.012
 harder-0.012
 aback-0.012
 again-0.012
Token from
Token ID422
Feature activation-1.56e+00

Pos logit contributions
kin+0.112
ippers+0.108
 personality+0.106
 classmates+0.105
inton+0.105
Neg logit contributions
 again-0.104
 further-0.103
 harder-0.098
 worse-0.094
 tighter-0.091
Token a
Token ID257
Feature activation+3.68e-01

Pos logit contributions
 aback+0.033
 tighter+0.033
 cal+0.032
 further+0.032
minded+0.032
Neg logit contributions
kin-0.033
 wear-0.033
 toes-0.033
 ago-0.032
 personality-0.031
Token house
Token ID2156
Feature activation+1.87e+00

Pos logit contributions
kin+0.009
 wear+0.009
inton+0.008
 hardly+0.008
 toes+0.008
Neg logit contributions
 further-0.007
 harder-0.007
 tighter-0.007
 again-0.007
 cal-0.007
Token were
Token ID547
Feature activation-7.29e-01

Pos logit contributions
kin+0.050
ippers+0.048
 personality+0.048
 wear+0.048
inton+0.047
Neg logit contributions
 again-0.035
 further-0.034
 harder-0.033
 worse-0.031
 cal-0.029
Token flickering
Token ID50104
Feature activation+9.08e-01

Pos logit contributions
 further+0.014
 harder+0.014
 tighter+0.014
 again+0.013
 worse+0.013
Neg logit contributions
kin-0.019
 wear-0.018
inton-0.017
 hardly-0.017
 personality-0.017
Token with
Token ID351
Feature activation+5.46e-01

Pos logit contributions
 tighter+0.070
 harder+0.069
 worse+0.069
 cal+0.069
lim+0.068
Neg logit contributions
 wear-0.042
kin-0.040
 toes-0.038
 hardly-0.038
 ago-0.037
Token her
Token ID607
Feature activation+1.28e+00

Pos logit contributions
kin+0.015
 wear+0.014
inton+0.014
 hardly+0.014
 toes+0.014
Neg logit contributions
 further-0.009
 harder-0.009
 tighter-0.009
 again-0.009
 cal-0.009
Token family
Token ID1641
Feature activation-1.15e-01

Pos logit contributions
kin+0.029
inton+0.028
orses+0.027
 hardly+0.027
apy+0.027
Neg logit contributions
 further-0.028
 again-0.027
 tighter-0.027
 harder-0.026
 cal-0.026
Token,
Token ID11
Feature activation+5.06e-01

Pos logit contributions
 further+0.002
 again+0.002
 harder+0.002
 tighter+0.002
 aback+0.002
Neg logit contributions
kin-0.003
 personality-0.003
inton-0.003
apy-0.003
ippers-0.003
Token the
Token ID262
Feature activation-4.42e-01

Pos logit contributions
kin+0.009
 wear+0.008
 hardly+0.008
 toes+0.008
apy+0.008
Neg logit contributions
 further-0.014
 tighter-0.013
 harder-0.013
 aback-0.013
 cal-0.013
Token animals
Token ID4695
Feature activation+4.48e+00

Pos logit contributions
 further+0.008
 tighter+0.008
 harder+0.008
 again+0.007
 aback+0.007
Neg logit contributions
kin-0.012
 wear-0.011
 personality-0.011
 toes-0.011
inton-0.011
Token had
Token ID550
Feature activation-2.07e+00

Pos logit contributions
kin+0.101
 personality+0.093
 wear+0.093
 toes+0.093
ippers+0.092
Neg logit contributions
 further-0.104
 again-0.101
 harder-0.098
 tighter-0.096
 worse-0.096
Token already
Token ID1541
Feature activation-2.28e-01

Pos logit contributions
 further+0.038
 tighter+0.037
 harder+0.037
 again+0.035
 cal+0.035
Neg logit contributions
kin-0.053
 wear-0.052
 hardly-0.051
 toes-0.050
inton-0.050
Token destroyed
Token ID6572
Feature activation-1.49e+00

Pos logit contributions
 further+0.005
 again+0.005
 harder+0.004
 tighter+0.004
 cal+0.004
Neg logit contributions
kin-0.006
 hardly-0.005
 wear-0.005
 personality-0.005
apy-0.005
Token the
Token ID262
Feature activation-9.74e-01

Pos logit contributions
 tighter+0.036
lim+0.035
minded+0.035
 aback+0.035
 further+0.034
Neg logit contributions
 toes-0.027
kin-0.027
 wear-0.026
 ago-0.025
inton-0.024
Token store
Token ID3650
Feature activation-2.30e+00

Pos logit contributions
 again+0.022
 tighter+0.022
 aback+0.022
 worse+0.021
 harder+0.021
Neg logit contributions
 wear-0.020
 ago-0.019
 personality-0.018
 imagination-0.018
 age-0.018
Token are
Token ID389
Feature activation+2.02e+00

Pos logit contributions
kin+0.030
 wear+0.029
 hardly+0.027
inton+0.027
 toes+0.026
Neg logit contributions
 further-0.048
 tighter-0.047
 harder-0.047
 again-0.046
 cal-0.045
Token the
Token ID262
Feature activation+1.08e-01

Pos logit contributions
kin+0.056
 wear+0.054
 toes+0.052
inton+0.052
 hardly+0.052
Neg logit contributions
 further-0.034
 harder-0.033
 tighter-0.033
 again-0.033
 cal-0.032
Token best
Token ID1266
Feature activation+2.36e+00

Pos logit contributions
kin+0.002
 wear+0.002
inton+0.002
 toes+0.002
 hardly+0.002
Neg logit contributions
 further-0.003
 tighter-0.003
 again-0.003
 harder-0.003
 aback-0.003
Token.
Token ID13
Feature activation+4.81e-02

Pos logit contributions
 hardly+0.055
kin+0.054
 wear+0.054
apy+0.054
 personality+0.053
Neg logit contributions
 again-0.049
 harder-0.047
 further-0.045
 cal-0.043
 worse-0.043
Token Do
Token ID2141
Feature activation+4.37e-01

Pos logit contributions
kin+0.001
inton+0.001
 toes+0.001
 hardly+0.001
 wear+0.001
Neg logit contributions
 aback-0.001
 Owl-0.001
 cal-0.001
 again-0.001
 harder-0.001
Token you
Token ID345
Feature activation+1.22e+00

Pos logit contributions
kin+0.011
 wear+0.011
 toes+0.011
inton+0.011
 hardly+0.011
Neg logit contributions
 further-0.008
 harder-0.008
 tighter-0.008
 again-0.008
 worse-0.008
Token want
Token ID765
Feature activation-6.54e-01

Pos logit contributions
apy+0.025
 toes+0.024
kin+0.023
 hardly+0.023
ippers+0.023
Neg logit contributions
 harder-0.031
 again-0.030
 cal-0.029
 worse-0.029
 further-0.029
Token to
Token ID284
Feature activation+8.23e-01

Pos logit contributions
 tighter+0.013
 further+0.013
 aback+0.013
 cal+0.012
lim+0.012
Neg logit contributions
 wear-0.015
 imagination-0.014
 hardly-0.014
 Their-0.014
kin-0.014
Token keep
Token ID1394
Feature activation+1.08e+00

Pos logit contributions
kin+0.015
 toes+0.013
inton+0.013
 personality+0.013
 ago+0.013
Neg logit contributions
 again-0.022
 further-0.022
 harder-0.022
 cal-0.021
 worse-0.021
Token the
Token ID262
Feature activation-1.25e-01

Pos logit contributions
 wear+0.021
 hardly+0.020
 toes+0.020
kin+0.020
orses+0.019
Neg logit contributions
 tighter-0.026
 further-0.026
 aback-0.026
 cal-0.025
 harder-0.025
Token fish
Token ID5916
Feature activation-9.08e-01

Pos logit contributions
 further+0.003
 harder+0.003
 tighter+0.003
 again+0.003
 cal+0.002
Neg logit contributions
kin-0.003
 wear-0.003
 hardly-0.003
 toes-0.003
inton-0.003
Token I
Token ID314
Feature activation+2.07e+00

Pos logit contributions
kin+0.015
inton+0.013
 wear+0.013
 toes+0.013
 hardly+0.013
Neg logit contributions
 again-0.014
 aback-0.014
 further-0.014
 harder-0.014
 cal-0.014
Token am
Token ID716
Feature activation+1.21e+00

Pos logit contributions
kin+0.046
 wear+0.043
 toes+0.043
inton+0.042
 hardly+0.042
Neg logit contributions
 further-0.046
 harder-0.045
 tighter-0.045
 again-0.045
 cal-0.044
Token here
Token ID994
Feature activation+5.24e-01

Pos logit contributions
kin+0.031
 toes+0.029
 wear+0.028
inton+0.028
ippers+0.028
Neg logit contributions
 further-0.024
 harder-0.023
 again-0.023
 cal-0.023
 worse-0.023
Token for
Token ID329
Feature activation+3.75e-01

Pos logit contributions
kin+0.016
 wear+0.015
inton+0.015
ippers+0.015
 hardly+0.015
Neg logit contributions
 again-0.007
 further-0.007
 harder-0.007
 tighter-0.007
 cal-0.007
Token you
Token ID345
Feature activation-2.11e-01

Pos logit contributions
kin+0.009
 wear+0.009
 hardly+0.008
 toes+0.008
 personality+0.008
Neg logit contributions
 further-0.008
 tighter-0.007
 harder-0.007
 aback-0.007
 again-0.007
Token.
Token ID13
Feature activation+7.14e-01

Pos logit contributions
 again+0.004
 further+0.004
 harder+0.004
 tighter+0.004
 worse+0.004
Neg logit contributions
kin-0.005
apy-0.005
 wear-0.005
inton-0.005
 hardly-0.005
Token I
Token ID314
Feature activation+2.23e+00

Pos logit contributions
kin+0.016
inton+0.015
 wear+0.014
 toes+0.014
 hardly+0.014
Neg logit contributions
 further-0.016
 aback-0.016
 again-0.016
 harder-0.016
 cal-0.016
Token'll
Token ID1183
Feature activation+1.35e+00

Pos logit contributions
kin+0.049
 wear+0.046
 toes+0.046
inton+0.045
 hardly+0.045
Neg logit contributions
 further-0.050
 harder-0.048
 again-0.048
 tighter-0.048
 cal-0.047
Token be
Token ID307
Feature activation-1.72e-01

Pos logit contributions
kin+0.031
 toes+0.028
inton+0.028
apy+0.028
ippers+0.027
Neg logit contributions
 further-0.030
 again-0.030
 cal-0.029
 harder-0.029
 worse-0.029
Token your
Token ID534
Feature activation+1.77e+00

Pos logit contributions
 further+0.003
 harder+0.003
 tighter+0.003
 again+0.003
 cal+0.003
Neg logit contributions
kin-0.004
 wear-0.004
 hardly-0.004
 toes-0.004
inton-0.004
Token friend
Token ID1545
Feature activation+1.35e-01

Pos logit contributions
kin+0.042
apy+0.041
 hardly+0.040
inton+0.039
 wear+0.039
Neg logit contributions
 further-0.037
 tighter-0.036
 again-0.036
 harder-0.036
 farther-0.034
Token run
Token ID1057
Feature activation-1.77e+00

Pos logit contributions
kin+0.090
ilot+0.088
apy+0.082
orses+0.081
ippers+0.081
Neg logit contributions
 again-0.061
 enough-0.045
 further-0.045
 harder-0.043
 anyway-0.041
Token.
Token ID13
Feature activation+5.58e-01

Pos logit contributions
 further+0.036
 harder+0.035
 again+0.034
 tighter+0.034
 cal+0.033
Neg logit contributions
kin-0.044
 wear-0.042
 hardly-0.041
 toes-0.040
inton-0.040
Token<|endoftext|>
Token ID50256
Feature activation-3.54e-01

Pos logit contributions
kin+0.016
 toes+0.014
inton+0.014
 ago+0.014
 wear+0.013
Neg logit contributions
 again-0.011
 harder-0.010
 aback-0.010
 tighter-0.010
 Owl-0.010
TokenOnce
Token ID7454
Feature activation+7.65e-01

Pos logit contributions
 again+0.007
 further+0.007
 harder+0.006
 aback+0.006
 tighter+0.006
Neg logit contributions
kin-0.009
 wear-0.009
 toes-0.009
 hardly-0.009
 personality-0.009
Token upon
Token ID2402
Feature activation+3.17e-01

Pos logit contributions
 personality+0.020
anas+0.019
kin+0.019
 beard+0.019
asts+0.019
Neg logit contributions
 again-0.016
 further-0.014
 closer-0.014
 harder-0.013
 up-0.013
Token a
Token ID257
Feature activation+6.17e-01

Pos logit contributions
kin+0.010
 wear+0.010
inton+0.010
 toes+0.010
 personality+0.010
Neg logit contributions
 further-0.004
 harder-0.004
 again-0.004
 tighter-0.004
 worse-0.004
Token time
Token ID640
Feature activation+5.38e-01

Pos logit contributions
kin+0.009
apy+0.009
ippers+0.009
 toes+0.008
 classmates+0.008
Neg logit contributions
 harder-0.018
 again-0.018
 further-0.018
 tighter-0.018
 cal-0.017
Token,
Token ID11
Feature activation+9.58e-01

Pos logit contributions
kin+0.014
 personality+0.014
 wear+0.014
inton+0.014
 hardly+0.013
Neg logit contributions
 again-0.010
 further-0.010
 harder-0.009
 aback-0.009
 closer-0.008
Token there
Token ID612
Feature activation+3.57e+00

Pos logit contributions
 hardly+0.028
 personality+0.028
ippers+0.028
kin+0.027
inton+0.027
Neg logit contributions
 further-0.015
 again-0.015
 harder-0.014
 closer-0.014
 worse-0.013
Token was
Token ID373
Feature activation-2.62e+00

Pos logit contributions
ippers+0.121
kin+0.118
ilot+0.116
 personality+0.116
inton+0.115
Neg logit contributions
 again-0.043
 further-0.035
 harder-0.033
 aback-0.030
 worse-0.029
Token a
Token ID257
Feature activation-8.55e-01

Pos logit contributions
 further+0.041
 tighter+0.039
 harder+0.038
 again+0.038
 cal+0.037
Neg logit contributions
kin-0.077
 wear-0.074
 toes-0.072
 hardly-0.072
inton-0.072
Token set
Token ID900
Feature activation-1.84e-01

Pos logit contributions
 tighter+0.011
 aback+0.011
 further+0.011
minded+0.011
 harder+0.011
Neg logit contributions
 wear-0.011
kin-0.011
 hardly-0.010
 toes-0.010
inton-0.010
Token,
Token ID11
Feature activation-3.72e-01

Pos logit contributions
minded+0.005
 tighter+0.005
 further+0.005
 cal+0.005
 harder+0.005
Neg logit contributions
kin-0.003
 wear-0.003
 hardly-0.003
 toes-0.003
apy-0.002
Token Lily
Token ID20037
Feature activation-6.53e-01

Pos logit contributions
 further+0.009
 harder+0.009
 tighter+0.009
 again+0.009
 cal+0.009
Neg logit contributions
kin-0.008
 wear-0.007
 hardly-0.007
 toes-0.007
inton-0.007
Token and
Token ID290
Feature activation+1.95e+00

Pos logit contributions
 further+0.012
 harder+0.011
 tighter+0.011
 again+0.011
 aback+0.011
Neg logit contributions
kin-0.017
 wear-0.017
 hardly-0.016
 toes-0.016
inton-0.016
Token her
Token ID607
Feature activation+2.11e+00

Pos logit contributions
kin+0.049
inton+0.046
ippers+0.046
ilot+0.045
 hardly+0.045
Neg logit contributions
 again-0.039
 tighter-0.038
 harder-0.038
 further-0.038
 worse-0.037
Token mom
Token ID1995
Feature activation+7.09e-01

Pos logit contributions
 ago+0.046
orses+0.044
 hardly+0.042
kin+0.042
inton+0.042
Neg logit contributions
 again-0.050
 tighter-0.046
 harder-0.045
 worse-0.044
 cal-0.043
Tokenmy
Token ID1820
Feature activation+1.58e+00

Pos logit contributions
kin+0.025
 hardly+0.023
ippers+0.023
 wear+0.023
 ago+0.023
Neg logit contributions
 again-0.007
 tighter-0.007
 further-0.007
 harder-0.007
 cal-0.006
Token went
Token ID1816
Feature activation-3.83e+00

Pos logit contributions
kin+0.038
ippers+0.036
 wear+0.035
 ago+0.035
 hardly+0.035
Neg logit contributions
 further-0.032
 again-0.032
 tighter-0.032
 harder-0.032
 cal-0.031
Token back
Token ID736
Feature activation-4.50e+00

Pos logit contributions
 further+0.058
 harder+0.056
 tighter+0.056
 again+0.054
 cal+0.053
Neg logit contributions
kin-0.111
 wear-0.109
 toes-0.105
 hardly-0.105
inton-0.105
Token outside
Token ID2354
Feature activation+2.23e+00

Pos logit contributions
 tighter+0.082
 further+0.081
 harder+0.081
 worse+0.081
 cal+0.079
Neg logit contributions
 wear-0.116
 toes-0.108
kin-0.107
inton-0.106
 ago-0.105
Token to
Token ID284
Feature activation+1.15e-01

Pos logit contributions
kin+0.080
ippers+0.072
inton+0.072
orses+0.071
ilot+0.071
Neg logit contributions
 again-0.033
 further-0.026
 harder-0.024
 tighter-0.022
 cal-0.021
Token the
Token ID262
Feature activation-2.55e+00

Pos logit contributions
 further+0.027
 harder+0.026
 again+0.026
 tighter+0.025
 aback+0.024
Neg logit contributions
kin-0.040
inton-0.036
 personality-0.036
orses-0.036
 toes-0.036
Token store
Token ID3650
Feature activation-2.22e+00

Pos logit contributions
 aback+0.064
 tighter+0.063
 harder+0.062
 cal+0.062
 further+0.061
Neg logit contributions
 wear-0.048
 hardly-0.044
kin-0.042
inton-0.041
 arrive-0.041
Token to
Token ID284
Feature activation+8.92e-01

Pos logit contributions
 further+0.041
 cal+0.040
 harder+0.040
 tighter+0.040
 worse+0.039
Neg logit contributions
kin-0.055
 wear-0.055
inton-0.053
 ago-0.053
 toes-0.053
Token buy
Token ID2822
Feature activation-2.07e+00

Pos logit contributions
kin+0.017
 toes+0.015
inton+0.015
 personality+0.015
 wear+0.015
Neg logit contributions
 further-0.024
 harder-0.023
 again-0.023
 tighter-0.023
 worse-0.022
Token the
Token ID262
Feature activation-1.39e-01

Pos logit contributions
lim+0.041
 Owl+0.041
 aback+0.040
 tighter+0.040
 cal+0.040
Neg logit contributions
 wear-0.049
 toes-0.047
kin-0.046
apy-0.046
 Their-0.045
Token wood
Token ID4898
Feature activation+1.99e-01

Pos logit contributions
 further+0.003
 tighter+0.003
 harder+0.003
 again+0.003
 cal+0.003
Neg logit contributions
kin-0.003
 wear-0.003
 toes-0.003
 hardly-0.003
inton-0.003
Token and
Token ID290
Feature activation+5.58e-01

Pos logit contributions
kin+0.004
 wear+0.004
 toes+0.004
 ago+0.004
 hardly+0.004
Neg logit contributions
 further-0.004
 tighter-0.004
 harder-0.004
 cal-0.004
lim-0.004
Token tools
Token ID4899
Feature activation+1.76e+00

Pos logit contributions
kin+0.012
 wear+0.011
inton+0.011
 personality+0.011
 hardly+0.011
Neg logit contributions
 further-0.013
 harder-0.013
 tighter-0.013
 again-0.013
 worse-0.012
Token they
Token ID484
Feature activation+2.48e+00

Pos logit contributions
kin+0.040
 wear+0.038
 hardly+0.037
 toes+0.037
inton+0.037
Neg logit contributions
 further-0.038
 harder-0.037
 tighter-0.037
 again-0.037
 cal-0.036
Token needed
Token ID2622
Feature activation-8.66e-01

Pos logit contributions
kin+0.058
 wear+0.053
inton+0.053
 personality+0.053
 toes+0.053
Neg logit contributions
 further-0.053
 tighter-0.052
 harder-0.052
 again-0.051
 worse-0.050
Token.
Token ID13
Feature activation+5.95e-01

Pos logit contributions
 further+0.015
 harder+0.015
 tighter+0.014
 again+0.014
 cal+0.014
Neg logit contributions
kin-0.024
 wear-0.023
 hardly-0.022
inton-0.022
 toes-0.022
Token fun
Token ID1257
Feature activation+1.76e+00

Pos logit contributions
kin+0.022
 wear+0.022
ippers+0.021
inton+0.021
 personality+0.021
Neg logit contributions
 harder-0.013
 further-0.013
 tighter-0.012
 again-0.012
 worse-0.012
Token.
Token ID13
Feature activation+5.73e-01

Pos logit contributions
kin+0.051
 wear+0.048
inton+0.046
 hardly+0.046
 personality+0.046
Neg logit contributions
 further-0.029
 again-0.029
 harder-0.029
 tighter-0.027
 worse-0.026
Token
Token ID198
Feature activation-2.93e-01

Pos logit contributions
kin+0.014
 wear+0.013
 toes+0.013
 personality+0.013
inton+0.013
Neg logit contributions
 again-0.012
 harder-0.012
 further-0.011
 tighter-0.011
 aback-0.011
Token
Token ID198
Feature activation-1.87e+00

Pos logit contributions
 again+0.006
 further+0.006
 harder+0.006
 tighter+0.006
 aback+0.006
Neg logit contributions
kin-0.007
 toes-0.007
 personality-0.006
 wear-0.006
inton-0.006
TokenBut
Token ID1537
Feature activation-2.62e-01

Pos logit contributions
 further+0.038
 harder+0.038
 tighter+0.037
 again+0.036
 worse+0.035
Neg logit contributions
kin-0.045
 wear-0.044
 toes-0.044
 personality-0.043
 hardly-0.043
Token then
Token ID788
Feature activation-2.05e+00

Pos logit contributions
 further+0.005
 tighter+0.004
 harder+0.004
 again+0.004
 cal+0.004
Neg logit contributions
kin-0.007
 wear-0.007
 toes-0.007
 hardly-0.007
inton-0.007
Token,
Token ID11
Feature activation-6.04e-01

Pos logit contributions
 further+0.051
 tighter+0.050
 harder+0.050
 aback+0.049
 again+0.049
Neg logit contributions
kin-0.040
 wear-0.038
 toes-0.037
 hardly-0.036
inton-0.036
Token a
Token ID257
Feature activation+8.13e-01

Pos logit contributions
 further+0.013
 harder+0.013
 tighter+0.013
 again+0.013
 cal+0.013
Neg logit contributions
kin-0.014
 wear-0.013
 hardly-0.013
 toes-0.012
inton-0.012
Token big
Token ID1263
Feature activation-1.00e+00

Pos logit contributions
kin+0.018
 wear+0.017
 personality+0.016
 hardly+0.016
apy+0.016
Neg logit contributions
 further-0.018
 tighter-0.018
 harder-0.018
 again-0.018
 aback-0.018
Token boy
Token ID2933
Feature activation-5.90e-01

Pos logit contributions
 further+0.022
 tighter+0.022
 aback+0.021
minded+0.021
 harder+0.021
Neg logit contributions
kin-0.023
 wear-0.022
 personality-0.021
 toes-0.020
 ago-0.020
Token came
Token ID1625
Feature activation-3.45e+00

Pos logit contributions
 further+0.012
 again+0.012
 harder+0.011
 cal+0.011
 tighter+0.011
Neg logit contributions
kin-0.015
 hardly-0.014
 personality-0.014
inton-0.014
 toes-0.014
Tokenmy
Token ID1820
Feature activation-1.12e+00

Pos logit contributions
 further+0.029
 farther+0.026
 harder+0.026
 worse+0.026
 tighter+0.025
Neg logit contributions
kin-0.043
 wear-0.042
inton-0.041
 toes-0.040
 personality-0.040
Token calmed
Token ID49566
Feature activation-2.89e-01

Pos logit contributions
 further+0.038
 farther+0.035
 Owl+0.035
wood+0.034
 nearer+0.034
Neg logit contributions
 wear-0.012
 hardly-0.012
kin-0.011
 Their-0.011
ana-0.011
Token down
Token ID866
Feature activation-2.21e+00

Pos logit contributions
 further+0.004
 harder+0.004
 tighter+0.004
 cal+0.004
 aback+0.004
Neg logit contributions
kin-0.009
 wear-0.009
 toes-0.008
 hardly-0.008
inton-0.008
Token and
Token ID290
Feature activation-1.36e+00

Pos logit contributions
 further+0.045
 aback+0.044
lim+0.044
 tighter+0.041
minded+0.041
Neg logit contributions
kin-0.052
 wear-0.051
 hardly-0.048
apy-0.047
 Their-0.047
Token said
Token ID531
Feature activation-4.21e-01

Pos logit contributions
lim+0.036
 Owl+0.036
 further+0.036
 aback+0.035
st+0.034
Neg logit contributions
 wear-0.023
 Their-0.021
apy-0.020
 classmates-0.019
kin-0.018
Token,
Token ID11
Feature activation-1.40e+00

Pos logit contributions
 further+0.010
 again+0.010
 harder+0.010
 tighter+0.009
 worse+0.009
Neg logit contributions
kin-0.009
 wear-0.009
 hardly-0.008
inton-0.008
 toes-0.008
Token "
Token ID366
Feature activation-4.55e+00

Pos logit contributions
 again+0.034
 harder+0.033
 further+0.032
 tighter+0.032
 cal+0.031
Neg logit contributions
kin-0.031
 wear-0.029
inton-0.028
ippers-0.028
 toes-0.028
TokenI
Token ID40
Feature activation+1.20e+00

Pos logit contributions
 further+0.089
 harder+0.086
 tighter+0.086
 again+0.085
 cal+0.083
Neg logit contributions
kin-0.115
 wear-0.110
 hardly-0.107
inton-0.106
 toes-0.106
Token like
Token ID588
Feature activation+1.87e+00

Pos logit contributions
kin+0.020
 wear+0.018
 toes+0.017
 hardly+0.017
inton+0.017
Neg logit contributions
 further-0.034
 harder-0.034
 tighter-0.033
 again-0.033
 cal-0.033
Token puzzles
Token ID24367
Feature activation+1.59e+00

Pos logit contributions
kin+0.044
 wear+0.042
 hardly+0.040
 toes+0.040
inton+0.040
Neg logit contributions
 further-0.040
 harder-0.039
 tighter-0.038
 again-0.038
 cal-0.037
Token and
Token ID290
Feature activation+7.57e-01

Pos logit contributions
kin+0.038
 hardly+0.037
 wear+0.037
inton+0.037
 toes+0.036
Neg logit contributions
 further-0.032
 again-0.032
 harder-0.031
 tighter-0.031
 worse-0.030
Token wonder
Token ID4240
Feature activation+1.25e+00

Pos logit contributions
kin+0.009
ippers+0.008
inton+0.008
 wear+0.008
ilot+0.008
Neg logit contributions
 further-0.009
 again-0.009
 harder-0.009
 tighter-0.009
 cal-0.009
Token.
Token ID13
Feature activation+8.60e-01

Pos logit contributions
kin+0.034
inton+0.032
 hardly+0.031
apy+0.031
ippers+0.031
Neg logit contributions
 again-0.025
 further-0.024
 harder-0.023
 tighter-0.022
 cal-0.022
Token
Token ID198
Feature activation+6.83e-01

Pos logit contributions
kin+0.021
 wear+0.019
inton+0.018
 toes+0.018
ippers+0.018
Neg logit contributions
 again-0.019
 further-0.018
 harder-0.018
 aback-0.018
 cal-0.017
Token
Token ID198
Feature activation-6.05e-01

Pos logit contributions
kin+0.017
 toes+0.017
 personality+0.016
 wear+0.016
ippers+0.016
Neg logit contributions
 again-0.014
 further-0.014
 harder-0.013
 cal-0.013
 tighter-0.012
TokenThe
Token ID464
Feature activation-2.50e-01

Pos logit contributions
 further+0.012
 harder+0.012
 tighter+0.011
 again+0.011
 cal+0.011
Neg logit contributions
kin-0.015
 toes-0.015
 wear-0.015
 personality-0.014
ippers-0.014
Token tree
Token ID5509
Feature activation+5.56e-03

Pos logit contributions
 aback+0.005
 further+0.005
 tighter+0.005
 harder+0.005
 worse+0.005
Neg logit contributions
kin-0.006
 wear-0.006
 toes-0.006
 personality-0.006
 hardly-0.006
Token just
Token ID655
Feature activation+7.04e-01

Pos logit contributions
kin+0.000
 wear+0.000
 hardly+0.000
 toes+0.000
inton+0.000
Neg logit contributions
 further-0.000
 harder-0.000
 tighter-0.000
 aback-0.000
 worse-0.000
Token kept
Token ID4030
Feature activation-8.11e-01

Pos logit contributions
kin+0.016
 wear+0.015
 toes+0.015
 hardly+0.014
 personality+0.014
Neg logit contributions
 further-0.015
 tighter-0.015
 harder-0.015
 again-0.015
 worse-0.015
Token growing
Token ID3957
Feature activation-1.89e+00

Pos logit contributions
 further+0.018
 harder+0.018
 tighter+0.018
 again+0.017
 cal+0.017
Neg logit contributions
kin-0.018
 wear-0.017
 hardly-0.016
 toes-0.016
inton-0.016
Token,
Token ID11
Feature activation+6.29e-01

Pos logit contributions
lim+0.036
minded+0.036
 aback+0.036
 tighter+0.035
 Owl+0.035
Neg logit contributions
kin-0.043
 wear-0.041
 toes-0.041
 ago-0.038
 hardly-0.037
Token reaching
Token ID8978
Feature activation-7.03e-01

Pos logit contributions
kin+0.016
 wear+0.015
 hardly+0.015
inton+0.015
 toes+0.015
Neg logit contributions
 further-0.012
 harder-0.012
 tighter-0.012
 again-0.012
 cal-0.011
Token and
Token ID290
Feature activation+1.20e+00

Pos logit contributions
inton+0.047
kin+0.045
 hardly+0.045
ana+0.043
orses+0.043
Neg logit contributions
 again-0.041
 tighter-0.039
 further-0.038
 harder-0.037
 closer-0.036
Token hugged
Token ID41130
Feature activation+2.84e+00

Pos logit contributions
kin+0.028
inton+0.026
 wear+0.026
ippers+0.026
 hardly+0.026
Neg logit contributions
 further-0.025
 tighter-0.025
 harder-0.025
 again-0.025
 cal-0.024
Token them
Token ID606
Feature activation-6.66e-01

Pos logit contributions
kin+0.060
 wear+0.059
 toes+0.056
 hardly+0.056
 personality+0.056
Neg logit contributions
 further-0.065
 harder-0.064
 tighter-0.063
 cal-0.062
 worse-0.062
Token tight
Token ID5381
Feature activation-1.25e+00

Pos logit contributions
 again+0.013
 tighter+0.013
 further+0.013
 harder+0.013
 cal+0.012
Neg logit contributions
kin-0.017
 ago-0.017
inton-0.016
 wear-0.016
orses-0.016
Token.
Token ID13
Feature activation+1.03e+00

Pos logit contributions
 further+0.029
 harder+0.028
 tighter+0.028
 again+0.028
 cal+0.027
Neg logit contributions
kin-0.027
 wear-0.026
 hardly-0.025
inton-0.025
 toes-0.025
Token Now
Token ID2735
Feature activation-5.82e-01

Pos logit contributions
kin+0.024
inton+0.022
 wear+0.022
 toes+0.022
 hardly+0.021
Neg logit contributions
 further-0.022
 again-0.022
 harder-0.022
 tighter-0.022
 aback-0.022
Token,
Token ID11
Feature activation+2.24e-01

Pos logit contributions
 further+0.014
 harder+0.013
 tighter+0.013
 again+0.013
 cal+0.013
Neg logit contributions
kin-0.012
 wear-0.012
 hardly-0.011
 toes-0.011
inton-0.011
Token she
Token ID673
Feature activation+3.89e-01

Pos logit contributions
kin+0.005
 wear+0.005
 hardly+0.004
 toes+0.004
inton+0.004
Neg logit contributions
 further-0.005
 harder-0.005
 tighter-0.005
 again-0.005
 cal-0.005
Token had
Token ID550
Feature activation-2.06e+00

Pos logit contributions
kin+0.009
inton+0.008
 wear+0.008
 toes+0.008
ippers+0.008
Neg logit contributions
 further-0.009
 again-0.009
 harder-0.008
 tighter-0.008
 cal-0.008
Token two
Token ID734
Feature activation+1.38e+00

Pos logit contributions
 further+0.033
 harder+0.032
 tighter+0.032
 again+0.031
 cal+0.030
Neg logit contributions
kin-0.059
 wear-0.057
 hardly-0.055
 toes-0.055
inton-0.055
Token things
Token ID1243
Feature activation+2.14e+00

Pos logit contributions
kin+0.033
 hardly+0.031
inton+0.030
 wear+0.030
ippers+0.030
Neg logit contributions
 further-0.030
 cal-0.029
 again-0.029
 tighter-0.029
 harder-0.028
Token out
Token ID503
Feature activation-3.95e-01

Pos logit contributions
minded+0.048
 aback+0.048
 cal+0.047
 tighter+0.045
 worse+0.044
Neg logit contributions
 toes-0.054
 hardly-0.052
 ago-0.050
inton-0.049
ilot-0.049
Token of
Token ID286
Feature activation+5.81e-01

Pos logit contributions
 further+0.011
 harder+0.010
 tighter+0.010
 again+0.010
 cal+0.010
Neg logit contributions
kin-0.007
 wear-0.007
inton-0.006
 hardly-0.006
 personality-0.006
Token the
Token ID262
Feature activation-1.48e+00

Pos logit contributions
kin+0.014
 wear+0.013
 toes+0.013
 ago+0.012
 hardly+0.012
Neg logit contributions
 further-0.012
 tighter-0.012
 harder-0.012
 cal-0.012
 again-0.012
Token ground
Token ID2323
Feature activation+1.13e-01

Pos logit contributions
minded+0.027
 aback+0.026
 cal+0.026
 further+0.025
 tighter+0.025
Neg logit contributions
 toes-0.038
 wear-0.038
kin-0.036
 ago-0.036
 hardly-0.035
Token.
Token ID13
Feature activation-9.30e-01

Pos logit contributions
kin+0.002
 wear+0.002
 toes+0.002
 ago+0.002
inton+0.002
Neg logit contributions
 further-0.003
 tighter-0.003
 harder-0.003
 cal-0.003
 aback-0.003
Token It
Token ID632
Feature activation-4.38e-01

Pos logit contributions
 harder+0.024
 again+0.024
 further+0.024
 aback+0.023
 tighter+0.023
Neg logit contributions
kin-0.018
 wear-0.016
ippers-0.015
 personality-0.015
orses-0.015
Token looked
Token ID3114
Feature activation-9.01e-01

Pos logit contributions
minded+0.014
 further+0.013
 aback+0.013
lim+0.012
 tighter+0.012
Neg logit contributions
 hardly-0.007
kin-0.007
apy-0.007
 wear-0.006
 lived-0.006
Token like
Token ID588
Feature activation+1.09e-01

Pos logit contributions
lim+0.021
 further+0.021
minded+0.021
 tighter+0.021
 again+0.021
Neg logit contributions
 wear-0.017
kin-0.017
 toes-0.017
 hardly-0.016
 personality-0.016
Token a
Token ID257
Feature activation+1.84e+00

Pos logit contributions
kin+0.003
 toes+0.003
 wear+0.003
 hardly+0.003
apy+0.003
Neg logit contributions
minded-0.002
 aback-0.002
 tighter-0.002
 further-0.002
 again-0.002
Token step
Token ID2239
Feature activation-1.06e+00

Pos logit contributions
kin+0.044
 wear+0.042
 hardly+0.040
 toes+0.040
 personality+0.040
Neg logit contributions
 further-0.038
 tighter-0.037
 aback-0.037
 harder-0.037
 again-0.036
Token.
Token ID13
Feature activation-4.05e-01

Pos logit contributions
 further+0.022
 harder+0.022
 tighter+0.022
 again+0.022
 worse+0.021
Neg logit contributions
kin-0.025
 wear-0.024
 hardly-0.023
inton-0.023
 toes-0.023
Token hugged
Token ID41130
Feature activation+2.34e+00

Pos logit contributions
 further+0.008
 tighter+0.008
 harder+0.008
 aback+0.008
 worse+0.008
Neg logit contributions
kin-0.007
 wear-0.007
 hardly-0.007
inton-0.006
 personality-0.006
Token mom
Token ID1995
Feature activation-2.81e+00

Pos logit contributions
 wear+0.040
kin+0.036
 personality+0.035
 toes+0.035
 classmates+0.035
Neg logit contributions
 further-0.064
 aback-0.062
lim-0.061
 cal-0.061
 farther-0.060
Token and
Token ID290
Feature activation+6.42e-01

Pos logit contributions
 further+0.059
 worse+0.058
 aback+0.056
lim+0.056
 nearer+0.055
Neg logit contributions
 wear-0.064
kin-0.061
 toes-0.059
 personality-0.058
ana-0.057
Token said
Token ID531
Feature activation+3.94e-01

Pos logit contributions
kin+0.012
 wear+0.012
 hardly+0.011
 toes+0.011
inton+0.011
Neg logit contributions
 further-0.017
 harder-0.016
 aback-0.016
 tighter-0.016
 worse-0.015
Token,
Token ID11
Feature activation-6.63e-01

Pos logit contributions
kin+0.008
inton+0.008
 toes+0.007
 hardly+0.007
 wear+0.007
Neg logit contributions
 again-0.010
 harder-0.010
 further-0.010
 worse-0.009
 cal-0.009
Token "
Token ID366
Feature activation-3.88e+00

Pos logit contributions
 again+0.016
 harder+0.016
 tighter+0.016
 further+0.015
 cal+0.015
Neg logit contributions
kin-0.014
inton-0.013
 toes-0.013
 wear-0.013
ippers-0.012
TokenThank
Token ID10449
Feature activation+8.53e-01

Pos logit contributions
 further+0.084
 harder+0.081
 worse+0.079
 again+0.079
 tighter+0.079
Neg logit contributions
kin-0.089
 wear-0.085
 hardly-0.083
 personality-0.082
inton-0.081
Token you
Token ID345
Feature activation-1.19e+00

Pos logit contributions
kin+0.020
 wear+0.019
 toes+0.018
 hardly+0.018
inton+0.018
Neg logit contributions
 further-0.018
 harder-0.018
 tighter-0.018
 aback-0.017
 again-0.017
Token,
Token ID11
Feature activation+4.24e-01

Pos logit contributions
 further+0.028
 tighter+0.027
 harder+0.027
 aback+0.026
 worse+0.026
Neg logit contributions
kin-0.025
 wear-0.024
 toes-0.023
 hardly-0.023
inton-0.023
Token mom
Token ID1995
Feature activation-1.45e+00

Pos logit contributions
kin+0.007
inton+0.006
 wear+0.006
 toes+0.006
 hardly+0.006
Neg logit contributions
 further-0.012
 harder-0.012
 again-0.012
 tighter-0.012
 cal-0.012
Token.
Token ID13
Feature activation+1.11e-01

Pos logit contributions
 further+0.035
 tighter+0.033
 harder+0.033
 aback+0.033
 worse+0.032
Neg logit contributions
kin-0.030
 wear-0.029
 toes-0.027
inton-0.027
 hardly-0.027
Token fairy
Token ID25607
Feature activation+1.80e-01

Pos logit contributions
 tighter+0.022
 further+0.022
minded+0.022
 harder+0.022
 worse+0.022
Neg logit contributions
kin-0.025
 wear-0.024
 ago-0.024
 personality-0.024
 toes-0.024
Token.
Token ID13
Feature activation-5.27e-01

Pos logit contributions
kin+0.004
 wear+0.004
inton+0.004
 hardly+0.004
 toes+0.004
Neg logit contributions
 further-0.004
 again-0.004
 harder-0.004
 tighter-0.004
 cal-0.003
Token She
Token ID1375
Feature activation+5.32e-01

Pos logit contributions
 further+0.011
 again+0.011
 harder+0.011
 aback+0.011
 cal+0.011
Neg logit contributions
kin-0.012
inton-0.011
 wear-0.011
 toes-0.011
 hardly-0.011
Token said
Token ID531
Feature activation+3.90e-02

Pos logit contributions
kin+0.011
 wear+0.011
 hardly+0.010
 ago+0.010
 personality+0.010
Neg logit contributions
 further-0.013
 harder-0.012
 tighter-0.012
 aback-0.012
 worse-0.012
Token,
Token ID11
Feature activation-6.39e-01

Pos logit contributions
kin+0.001
 wear+0.001
 hardly+0.001
 toes+0.001
apy+0.001
Neg logit contributions
 tighter-0.001
 further-0.001
 harder-0.001
 aback-0.001
 farther-0.001
Token "
Token ID366
Feature activation-3.83e+00

Pos logit contributions
 further+0.017
 harder+0.017
 again+0.016
 tighter+0.016
 cal+0.016
Neg logit contributions
kin-0.012
 wear-0.011
inton-0.011
 toes-0.011
 hardly-0.011
TokenI
Token ID40
Feature activation+1.46e+00

Pos logit contributions
 harder+0.070
 worse+0.070
 further+0.068
 tighter+0.066
 cal+0.066
Neg logit contributions
 wear-0.096
kin-0.091
ana-0.089
 personality-0.088
 hardly-0.088
Token see
Token ID766
Feature activation-5.45e-01

Pos logit contributions
kin+0.028
 toes+0.026
 wear+0.026
inton+0.026
 hardly+0.025
Neg logit contributions
 further-0.037
 harder-0.036
 again-0.036
 tighter-0.035
 cal-0.035
Token you
Token ID345
Feature activation-2.42e-01

Pos logit contributions
 tighter+0.012
 further+0.011
 aback+0.011
 harder+0.011
 worse+0.011
Neg logit contributions
 wear-0.012
 toes-0.012
 ago-0.012
kin-0.011
 hardly-0.011
Token are
Token ID389
Feature activation+1.73e-01

Pos logit contributions
 further+0.005
 again+0.005
 harder+0.005
 tighter+0.005
 cal+0.005
Neg logit contributions
kin-0.006
inton-0.006
apy-0.006
 hardly-0.006
 wear-0.006
Token feeling
Token ID4203
Feature activation-2.47e-01

Pos logit contributions
kin+0.005
 wear+0.004
 hardly+0.004
 toes+0.004
inton+0.004
Neg logit contributions
 further-0.003
 harder-0.003
 tighter-0.003
 again-0.003
 cal-0.003
Token was
Token ID373
Feature activation-7.92e-02

Pos logit contributions
 further+0.010
 tighter+0.010
 aback+0.009
 harder+0.009
minded+0.009
Neg logit contributions
kin-0.008
 wear-0.008
 hardly-0.008
inton-0.008
 toes-0.007
Token too
Token ID1165
Feature activation-9.88e-01

Pos logit contributions
 further+0.002
 tighter+0.002
 harder+0.002
 again+0.002
 aback+0.001
Neg logit contributions
kin-0.002
 wear-0.002
 hardly-0.002
inton-0.002
 toes-0.002
Token fast
Token ID3049
Feature activation-2.03e+00

Pos logit contributions
 further+0.018
 harder+0.018
 tighter+0.017
 again+0.017
 cal+0.017
Neg logit contributions
kin-0.027
 wear-0.026
 personality-0.025
 toes-0.025
 hardly-0.025
Token.
Token ID13
Feature activation-3.62e-01

Pos logit contributions
 again+0.036
 further+0.035
 harder+0.035
 tighter+0.034
 worse+0.032
Neg logit contributions
kin-0.057
 hardly-0.053
 wear-0.053
apy-0.053
 toes-0.052
Token He
Token ID679
Feature activation-4.66e-01

Pos logit contributions
 harder+0.008
 again+0.008
 further+0.008
 tighter+0.008
 aback+0.008
Neg logit contributions
kin-0.008
 wear-0.008
 toes-0.007
inton-0.007
 personality-0.007
Token got
Token ID1392
Feature activation-5.06e+00

Pos logit contributions
 further+0.012
 harder+0.011
 aback+0.011
 tighter+0.011
 again+0.011
Neg logit contributions
kin-0.009
 wear-0.009
 hardly-0.009
inton-0.008
 toes-0.008
Token to
Token ID284
Feature activation-1.10e+00

Pos logit contributions
lim+0.106
st+0.103
minded+0.103
aut+0.097
 Owl+0.097
Neg logit contributions
 wear-0.110
apy-0.106
 Their-0.103
 hardly-0.102
 toes-0.101
Token his
Token ID465
Feature activation-1.14e+00

Pos logit contributions
 further+0.026
 tighter+0.026
 harder+0.026
 aback+0.025
 again+0.025
Neg logit contributions
 wear-0.022
kin-0.021
 hardly-0.020
inton-0.020
 toes-0.019
Token car
Token ID1097
Feature activation-3.13e+00

Pos logit contributions
 further+0.026
 harder+0.026
 tighter+0.025
 again+0.025
 aback+0.025
Neg logit contributions
kin-0.024
 wear-0.023
 toes-0.023
inton-0.022
 hardly-0.022
Token and
Token ID290
Feature activation+5.27e-01

Pos logit contributions
 further+0.060
 aback+0.059
 cal+0.059
lim+0.058
minded+0.057
Neg logit contributions
kin-0.074
 wear-0.073
 ago-0.068
 hardly-0.068
 toes-0.067
Token drove
Token ID10357
Feature activation-4.09e-01

Pos logit contributions
 wear+0.010
kin+0.010
 hardly+0.009
 toes+0.009
apy+0.009
Neg logit contributions
 further-0.013
lim-0.013
 harder-0.013
 aback-0.013
 tighter-0.013
Token As
Token ID1081
Feature activation-1.36e+00

Pos logit contributions
kin+0.014
 toes+0.013
orse+0.013
ilot+0.013
orses+0.013
Neg logit contributions
 again-0.010
 harder-0.010
 aback-0.009
 tighter-0.009
 further-0.009
Token a
Token ID257
Feature activation+1.23e+00

Pos logit contributions
 again+0.031
 harder+0.030
 further+0.030
 tighter+0.029
 worse+0.028
Neg logit contributions
kin-0.032
inton-0.031
ippers-0.031
ilot-0.031
 personality-0.030
Token treat
Token ID2190
Feature activation+1.01e+00

Pos logit contributions
kin+0.026
inton+0.025
 toes+0.025
ippers+0.025
 wear+0.025
Neg logit contributions
 harder-0.028
 further-0.028
 again-0.028
 tighter-0.028
 cal-0.027
Token,
Token ID11
Feature activation+9.37e-02

Pos logit contributions
inton+0.023
kin+0.023
orses+0.023
 personality+0.023
 toes+0.022
Neg logit contributions
 again-0.021
 further-0.021
 harder-0.021
 tighter-0.020
 worse-0.020
Token Lily
Token ID20037
Feature activation-8.60e-01

Pos logit contributions
kin+0.002
 wear+0.002
inton+0.002
 toes+0.002
 hardly+0.002
Neg logit contributions
 further-0.002
 harder-0.002
 again-0.002
 tighter-0.002
 worse-0.002
Token got
Token ID1392
Feature activation-4.46e+00

Pos logit contributions
 further+0.018
 tighter+0.017
 harder+0.017
 aback+0.017
 again+0.017
Neg logit contributions
kin-0.020
 wear-0.020
 hardly-0.019
 toes-0.019
inton-0.019
Token to
Token ID284
Feature activation+4.99e-03

Pos logit contributions
lim+0.092
 further+0.088
 Owl+0.088
 tighter+0.084
 farther+0.083
Neg logit contributions
 wear-0.108
 toes-0.096
apy-0.095
kin-0.094
 hardly-0.093
Token watch
Token ID2342
Feature activation+2.41e+00

Pos logit contributions
kin+0.000
ilot+0.000
inton+0.000
ippers+0.000
 ago+0.000
Neg logit contributions
 again-0.000
 harder-0.000
 cal-0.000
 further-0.000
 worse-0.000
Token one
Token ID530
Feature activation-1.57e+00

Pos logit contributions
kin+0.059
 wear+0.056
inton+0.055
 hardly+0.054
 toes+0.054
Neg logit contributions
 further-0.048
 harder-0.048
 again-0.048
 tighter-0.047
 worse-0.046
Token more
Token ID517
Feature activation+7.33e-01

Pos logit contributions
 further+0.032
 harder+0.031
 tighter+0.031
 again+0.031
 cal+0.030
Neg logit contributions
kin-0.038
 wear-0.036
 hardly-0.035
inton-0.035
 toes-0.035
Token cartoon
Token ID16251
Feature activation+1.72e+00

Pos logit contributions
kin+0.016
orse+0.015
orses+0.015
ippers+0.015
 ago+0.014
Neg logit contributions
 again-0.018
 harder-0.018
 worse-0.017
 further-0.017
 cal-0.017
Token tall
Token ID7331
Feature activation-7.43e-01

Pos logit contributions
 further+0.014
 harder+0.013
 tighter+0.013
 again+0.013
 cal+0.013
Neg logit contributions
kin-0.014
 wear-0.013
 hardly-0.013
 toes-0.013
inton-0.013
Token man
Token ID582
Feature activation-2.61e-01

Pos logit contributions
 further+0.013
 tighter+0.013
 harder+0.013
 aback+0.013
 worse+0.012
Neg logit contributions
kin-0.020
 wear-0.019
 toes-0.019
 personality-0.018
 hardly-0.018
Token.
Token ID13
Feature activation-2.69e-01

Pos logit contributions
 again+0.006
 further+0.005
 tighter+0.005
 harder+0.005
st+0.005
Neg logit contributions
kin-0.006
 hardly-0.006
apy-0.006
inton-0.006
 toes-0.006
Token He
Token ID679
Feature activation+1.14e+00

Pos logit contributions
 again+0.005
 harder+0.005
 Owl+0.005
 further+0.005
 aback+0.005
Neg logit contributions
kin-0.007
inton-0.006
ippers-0.006
 wear-0.006
 hardly-0.006
Token was
Token ID373
Feature activation+6.04e-01

Pos logit contributions
kin+0.020
ippers+0.020
 toes+0.020
inton+0.019
 personality+0.018
Neg logit contributions
 again-0.028
 harder-0.028
 further-0.028
 cal-0.027
 worse-0.027
Token getting
Token ID1972
Feature activation-4.75e+00

Pos logit contributions
kin+0.017
 wear+0.016
 hardly+0.016
 toes+0.016
inton+0.016
Neg logit contributions
 further-0.010
 harder-0.010
 tighter-0.009
 again-0.009
 cal-0.009
Token ready
Token ID3492
Feature activation-2.00e+00

Pos logit contributions
 aback+0.118
 iv+0.116
 again+0.116
lim+0.114
 cal+0.112
Neg logit contributions
 wear-0.087
 toes-0.078
apy-0.074
 hardly-0.074
kin-0.073
Token to
Token ID284
Feature activation-8.15e-01

Pos logit contributions
 further+0.030
 harder+0.029
 again+0.029
 tighter+0.029
 worse+0.027
Neg logit contributions
kin-0.060
 wear-0.056
inton-0.056
 hardly-0.055
 toes-0.055
Token take
Token ID1011
Feature activation-1.60e+00

Pos logit contributions
 further+0.018
 again+0.018
 harder+0.018
 cal+0.017
 worse+0.017
Neg logit contributions
kin-0.019
inton-0.017
ippers-0.017
 importantly-0.017
 personality-0.017
Token a
Token ID257
Feature activation-8.05e-01

Pos logit contributions
 tighter+0.030
 further+0.030
 harder+0.029
 worse+0.029
 cal+0.028
Neg logit contributions
 wear-0.041
kin-0.038
 toes-0.038
 hardly-0.038
 personality-0.037
Token bath
Token ID7837
Feature activation+9.69e-01

Pos logit contributions
 further+0.022
 harder+0.022
 tighter+0.022
 again+0.022
 aback+0.021
Neg logit contributions
kin-0.013
 wear-0.013
 hardly-0.012
 toes-0.012
inton-0.012