TOP ACTIVATIONS
MAX = 0.192

" A br ac ad ab ra !" L
" A br ac ad ab ra !" Nothing
" A br ac ad ab ra !" But nothing happened
and even toys ! Jo anna was so happy , she
wall reapp eared . Jo anna smiled as she went to
say " ab rac ad ab ra " all the time
alone . Cr ab saw Sara and Ben .
" A br ac ad ab ra !" He
and went home . Jo anna shouted as usual but no
" A br ac ad ab ra !" "
" A br ac ad ab ra !" Suddenly , the
" A br ac ad ab ra , I turn you
" A br ac ad ab ra , make a friend
really special inside ." Jo anna was very curious and couldn
in the sky and Jo anna said to her M ummy
it was time for Jo anna to go to bed .
a little girl named Jo anna . Jo anna lived in
" A br ac ad ab ra ! I 'm normal
funny and cute . Jo anna thought it was the best
John and Jo anna had a lot of fun

MIN ACTIVATIONS
MIN = -0.237

very scared and begged for mercy . He said , '
he learned an important moral lesson that day . He learned
your fault and ask for mercy . I will spare you
learned a very important moral lesson that day and never forgot
again , understanding the moral lesson that she 'd just taught
Tom cried and begged for mercy . They wished they had
Andy took this moral lesson to heart and promised to
simple story with a moral lesson showed the generosity and caring
little boy learnt an important lesson that day - stealing is
Butter i learnt an important lesson that day : that spicy
little girl learned an important lesson that day � � €
. It had a moral lesson : never explore something that
for teaching him this valuable lesson . He was so proud
that he learned an important lesson - don 't be so
They all cherished the moral lesson - to always share and
little girl learned an important lesson : never give up and
The boy learned an important lesson that day : weapons can
little girl learned an important lesson - be careful what you
the party learned an important lesson about helping others . <|endoftext|>
. Jane learned an important lesson to always listen to her

INTERVAL 0.085 - 0.192
CONTAINS 1.438%

the little bird and she began to comfort it . She
can find in the woods ." They walked deeper
are you there ?" Ben says . He is lonely .
what 's wrong ?" Lily asked . " Why can 't
is like a picture . See , I made it with

INTERVAL -0.022 - 0.085
CONTAINS 74.887%

like to play in the garden . One day , they
eat their lunch and enjoy it . They wash their hands
and covered it with some leaves . " There
borrow her pencil . Lily said yes and gave her friend
also learned to be safe and to ask for help when

INTERVAL -0.129 - -0.022
CONTAINS 23.555%

, " It means to write your name so everyone knows
and helping them when they need it ."
opened it and took out the things . S
fun at the beach ! <|endoftext|> Once upon a time there
slowly crept out of the bush , and looked around the

INTERVAL -0.237 - -0.129
CONTAINS 0.120%

. You learned a good lesson today . Sharing is caring
less gr umpy . The moral of the story is that
. At the end of the day , Jane
and cl apped ! The hard fire work looked beautiful in
yarn . She worked very hard . But , eventually she
Token "
Token ID366
Feature activation+2.15e-02

Pos logit contributions
 Mil+0.005
 imagine+0.005
Gu+0.005
 shades+0.005
nes+0.005
Neg logit contributions
oral-0.006
 narrative-0.006
der-0.005
oni-0.005
smart-0.005
TokenA
Token ID32
Feature activation+1.70e-02

Pos logit contributions
 Mil+0.035
 Mir+0.032
 N+0.032
 Ell+0.031
 Del+0.031
Neg logit contributions
 narrative-0.036
oral-0.033
 severe-0.026
smart-0.024
ery-0.024
Tokenbr
Token ID1671
Feature activation+6.18e-02

Pos logit contributions
 Mil+0.034
 Mir+0.032
 Del+0.032
 N+0.032
 But+0.032
Neg logit contributions
 narrative-0.019
oral-0.019
 severe-0.014
 obstacle-0.014
 problem-0.013
Tokenac
Token ID330
Feature activation+9.65e-03

Pos logit contributions
 Mil+0.102
 Mir+0.097
 But+0.093
 Del+0.093
 N+0.093
Neg logit contributions
 narrative-0.096
oral-0.091
 severe-0.077
 obstacle-0.071
smart-0.069
Tokenad
Token ID324
Feature activation-3.24e-02

Pos logit contributions
 Mil+0.014
 Mir+0.013
 Del+0.012
Gu+0.012
 N+0.012
Neg logit contributions
 narrative-0.017
oral-0.016
 severe-0.014
 obstacle-0.013
der-0.013
Tokenab
Token ID397
Feature activation+1.91e-01

Pos logit contributions
 narrative+0.064
oral+0.061
smart+0.053
 severe+0.052
 problem+0.051
Neg logit contributions
 Mil-0.039
 Mir-0.035
 N-0.034
 But-0.034
Gu-0.034
Tokenra
Token ID430
Feature activation+4.02e-02

Pos logit contributions
Gu+0.211
 Mil+0.206
 But+0.195
 Suddenly+0.192
 Mir+0.188
Neg logit contributions
oral-0.362
 narrative-0.362
smart-0.304
iness-0.303
 problem-0.300
Token!"
Token ID2474
Feature activation+7.15e-02

Pos logit contributions
 Mil+0.072
 Mir+0.068
 But+0.066
Gu+0.066
 Del+0.066
Neg logit contributions
 narrative-0.055
oral-0.052
 severe-0.042
smart-0.040
 problem-0.039
Token
Token ID198
Feature activation+2.06e-02

Pos logit contributions
uries+0.133
astical+0.132
nes+0.128
 shades+0.126
Gu+0.126
Neg logit contributions
oral-0.071
 narrative-0.068
ls-0.065
der-0.059
oni-0.057
Token
Token ID198
Feature activation+1.40e-02

Pos logit contributions
 Mil+0.029
Gu+0.028
 Mir+0.027
 Del+0.027
 blonde+0.027
Neg logit contributions
oral-0.034
 narrative-0.033
 severe-0.027
smart-0.027
 story-0.026
TokenL
Token ID43
Feature activation+2.29e-02

Pos logit contributions
 Mil+0.021
 Mir+0.020
 Del+0.020
 Ell+0.020
 N+0.019
Neg logit contributions
oral-0.023
 narrative-0.023
 severe-0.018
in-0.018
smart-0.017
Token "
Token ID366
Feature activation+2.25e-02

Pos logit contributions
nes+0.005
Gu+0.005
 Clare+0.005
 Sony+0.005
 blonde+0.005
Neg logit contributions
oral-0.006
 narrative-0.005
der-0.005
oni-0.005
:-0.004
TokenA
Token ID32
Feature activation+1.57e-02

Pos logit contributions
 Mil+0.032
 Mir+0.030
 N+0.030
 Ell+0.029
 Del+0.028
Neg logit contributions
 narrative-0.040
oral-0.038
smart-0.030
 severe-0.030
 obstacle-0.029
Tokenbr
Token ID1671
Feature activation+6.44e-02

Pos logit contributions
 Mil+0.028
 Mir+0.027
 N+0.026
 Del+0.026
 But+0.026
Neg logit contributions
 narrative-0.022
oral-0.021
 severe-0.017
 obstacle-0.016
 problem-0.016
Tokenac
Token ID330
Feature activation+1.05e-02

Pos logit contributions
 Mil+0.101
 Mir+0.096
 Del+0.091
 But+0.089
 N+0.088
Neg logit contributions
 narrative-0.106
oral-0.100
 severe-0.092
 obstacle-0.083
 pistols-0.081
Tokenad
Token ID324
Feature activation-3.28e-02

Pos logit contributions
 Mil+0.013
 Mir+0.012
 Del+0.012
Gu+0.012
 sill+0.011
Neg logit contributions
 narrative-0.020
oral-0.019
 severe-0.017
 pistols-0.016
der-0.016
Tokenab
Token ID397
Feature activation+1.90e-01

Pos logit contributions
 narrative+0.066
oral+0.063
 severe+0.055
 obstacle+0.052
smart+0.052
Neg logit contributions
 Mil-0.039
 Mir-0.036
 N-0.034
 But-0.034
 Del-0.034
Tokenra
Token ID430
Feature activation+3.72e-02

Pos logit contributions
 Mil+0.196
Gu+0.190
 But+0.177
 N+0.175
 Mir+0.175
Neg logit contributions
 narrative-0.387
oral-0.384
smart-0.319
 severe-0.317
 problem-0.317
Token!"
Token ID2474
Feature activation+6.62e-02

Pos logit contributions
 Mil+0.062
 Mir+0.059
 But+0.057
 N+0.057
 Del+0.057
Neg logit contributions
 narrative-0.056
oral-0.054
 severe-0.044
 obstacle-0.041
smart-0.041
Token
Token ID198
Feature activation+1.62e-02

Pos logit contributions
uries+0.091
rings+0.085
nes+0.082
acher+0.082
astical+0.082
Neg logit contributions
oral-0.106
 narrative-0.096
ls-0.090
der-0.088
:-0.087
Token
Token ID198
Feature activation+9.61e-03

Pos logit contributions
 Mil+0.022
 blonde+0.022
Gu+0.021
 blond+0.021
 N+0.021
Neg logit contributions
oral-0.027
 narrative-0.026
 severe-0.021
smart-0.021
 story-0.020
TokenNothing
Token ID18465
Feature activation-2.54e-02

Pos logit contributions
 Mil+0.018
 N+0.018
 Ell+0.018
 Just+0.018
 My+0.018
Neg logit contributions
oral-0.011
 narrative-0.011
 severe-0.007
smart-0.007
in-0.007
Token "
Token ID366
Feature activation+1.67e-03

Pos logit contributions
 narrative+0.018
oral+0.017
 severe+0.015
 obstacle+0.014
iness+0.014
Neg logit contributions
 Mil-0.011
 Mir-0.010
 But-0.010
 N-0.010
 Just-0.010
TokenA
Token ID32
Feature activation+1.07e-02

Pos logit contributions
 Mil+0.003
 Ell+0.003
 N+0.002
 imagining+0.002
 pretending+0.002
Neg logit contributions
 narrative-0.003
oral-0.003
 severe-0.002
 obstacle-0.002
smart-0.002
Tokenbr
Token ID1671
Feature activation+5.98e-02

Pos logit contributions
 Mil+0.017
 Mir+0.016
 But+0.016
Gu+0.016
 My+0.016
Neg logit contributions
 narrative-0.018
oral-0.016
 severe-0.014
iness-0.013
der-0.012
Tokenac
Token ID330
Feature activation+5.25e-03

Pos logit contributions
 Mil+0.089
 Mir+0.085
 Del+0.079
 But+0.077
 L+0.076
Neg logit contributions
 narrative-0.104
oral-0.097
 severe-0.092
 obstacle-0.083
 pistols-0.081
Tokenad
Token ID324
Feature activation-3.83e-02

Pos logit contributions
 Mil+0.006
 Mir+0.006
 Del+0.006
Gu+0.005
 sill+0.005
Neg logit contributions
 narrative-0.010
oral-0.010
 severe-0.009
 pistols-0.008
der-0.008
Tokenab
Token ID397
Feature activation+1.90e-01

Pos logit contributions
 narrative+0.078
oral+0.075
 severe+0.065
 obstacle+0.062
smart+0.060
Neg logit contributions
 Mil-0.045
 Mir-0.042
 But-0.040
 N-0.039
 Del-0.039
Tokenra
Token ID430
Feature activation+3.43e-02

Pos logit contributions
 Mil+0.184
Gu+0.177
 But+0.168
 Mir+0.167
 N+0.165
Neg logit contributions
 narrative-0.397
oral-0.392
 severe-0.334
smart-0.331
iness-0.326
Token!"
Token ID2474
Feature activation+6.83e-02

Pos logit contributions
 Mil+0.059
 Mir+0.056
 But+0.055
 N+0.054
 Del+0.054
Neg logit contributions
 narrative-0.050
oral-0.047
 severe-0.039
 obstacle-0.036
smart-0.035
Token But
Token ID887
Feature activation+6.17e-02

Pos logit contributions
 Mil+0.093
Gu+0.090
nes+0.085
 Del+0.085
 Mir+0.085
Neg logit contributions
oral-0.113
 narrative-0.113
 severe-0.090
der-0.089
smart-0.089
Token nothing
Token ID2147
Feature activation-6.44e-02

Pos logit contributions
Gu+0.092
 Mil+0.085
 Just+0.083
 But+0.081
 Pand+0.080
Neg logit contributions
 narrative-0.100
oral-0.091
 severe-0.076
der-0.076
 problem-0.076
Token happened
Token ID3022
Feature activation-1.53e-02

Pos logit contributions
 narrative+0.093
oral+0.088
 severe+0.071
 obstacle+0.066
smart+0.063
Neg logit contributions
 Mil-0.113
 Mir-0.108
 But-0.105
Gu-0.104
 Del-0.103
Token and
Token ID290
Feature activation+8.22e-03

Pos logit contributions
 narrative+0.062
oral+0.059
 severe+0.049
 obstacle+0.046
smart+0.045
Neg logit contributions
 Mil-0.061
 Mir-0.057
 But-0.055
 Del-0.055
 N-0.055
Token even
Token ID772
Feature activation-3.52e-03

Pos logit contributions
 Just+0.012
 Mil+0.012
Gu+0.012
 But+0.011
 My+0.011
Neg logit contributions
 narrative-0.013
oral-0.011
 severe-0.011
 problem-0.010
 obstacle-0.010
Token toys
Token ID14958
Feature activation+2.95e-02

Pos logit contributions
 narrative+0.006
oral+0.005
 severe+0.005
smart+0.005
 obstacle+0.005
Neg logit contributions
 Mil-0.005
Gu-0.005
 Just-0.005
 But-0.005
 Mir-0.005
Token!
Token ID0
Feature activation-2.58e-02

Pos logit contributions
 Mil+0.047
 Mir+0.044
 But+0.044
 N+0.043
 Del+0.043
Neg logit contributions
 narrative-0.048
oral-0.046
 severe-0.037
 obstacle-0.035
smart-0.033
Token Jo
Token ID5302
Feature activation+5.50e-02

Pos logit contributions
oral+0.044
 narrative+0.041
:+0.035
ay+0.035
 story+0.035
Neg logit contributions
Gu-0.032
 groom-0.028
 imagine-0.028
 Mil-0.028
 dipped-0.027
Tokenanna
Token ID7697
Feature activation+1.87e-01

Pos logit contributions
 Mil+0.060
Gu+0.056
 Mir+0.055
 But+0.054
 Del+0.052
Neg logit contributions
 narrative-0.109
oral-0.108
iness-0.095
 severe-0.095
ience-0.090
Token was
Token ID373
Feature activation+1.35e-02

Pos logit contributions
 Mil+0.317
Gu+0.311
 Mir+0.305
 Pand+0.295
 My+0.289
Neg logit contributions
 narrative-0.256
oral-0.247
 severe-0.207
iness-0.194
oni-0.191
Token so
Token ID523
Feature activation+4.30e-02

Pos logit contributions
Gu+0.024
 Mil+0.024
 Mir+0.023
 But+0.022
 My+0.022
Neg logit contributions
 narrative-0.016
 severe-0.014
oral-0.014
 problem-0.013
smart-0.012
Token happy
Token ID3772
Feature activation+4.48e-02

Pos logit contributions
Gu+0.054
 Mil+0.050
 My+0.050
 Mir+0.049
rings+0.048
Neg logit contributions
 narrative-0.079
oral-0.075
 severe-0.070
 hard-0.066
 bad-0.066
Token,
Token ID11
Feature activation+8.60e-05

Pos logit contributions
 Mil+0.082
Gu+0.078
 Mir+0.078
 Del+0.076
 N+0.075
Neg logit contributions
 narrative-0.055
oral-0.054
 severe-0.043
der-0.041
smart-0.040
Token she
Token ID673
Feature activation+1.11e-02

Pos logit contributions
Gu+0.000
 Mil+0.000
 Mir+0.000
 Just+0.000
 Del+0.000
Neg logit contributions
 narrative-0.000
oral-0.000
 severe-0.000
 story-0.000
iness-0.000
Token wall
Token ID3355
Feature activation-2.07e-02

Pos logit contributions
 narrative+0.019
oral+0.018
 severe+0.015
 obstacle+0.015
 problem+0.014
Neg logit contributions
 Mil-0.016
 Mir-0.015
Gu-0.014
 But-0.014
 Del-0.014
Token reapp
Token ID24578
Feature activation-2.10e-02

Pos logit contributions
 narrative+0.032
oral+0.030
 severe+0.025
 obstacle+0.024
smart+0.024
Neg logit contributions
 Mil-0.033
Gu-0.031
 Mir-0.031
 N-0.030
 But-0.030
Tokeneared
Token ID3380
Feature activation+5.64e-02

Pos logit contributions
oral+0.034
 narrative+0.034
smart+0.031
 obstacle+0.030
 severe+0.029
Neg logit contributions
 Mil-0.030
 Mir-0.029
 But-0.028
 N-0.026
 ask-0.026
Token.
Token ID13
Feature activation-3.65e-03

Pos logit contributions
 Mil+0.099
 Mir+0.094
 N+0.091
 But+0.091
 Del+0.091
Neg logit contributions
 narrative-0.081
oral-0.077
 severe-0.062
 obstacle-0.058
smart-0.056
Token Jo
Token ID5302
Feature activation+4.54e-02

Pos logit contributions
oral+0.005
 narrative+0.005
smart+0.004
:+0.004
 story+0.004
Neg logit contributions
 imagine-0.005
 dipped-0.005
Gu-0.005
 pretended-0.005
 pretend-0.005
Tokenanna
Token ID7697
Feature activation+1.87e-01

Pos logit contributions
Gu+0.073
 Mil+0.072
 Pand+0.071
 Mir+0.069
 But+0.067
Neg logit contributions
oral-0.058
 narrative-0.057
ience-0.052
 severe-0.050
smart-0.049
Token smiled
Token ID13541
Feature activation+2.71e-03

Pos logit contributions
 Mil+0.316
Gu+0.310
 Mir+0.304
 But+0.291
 My+0.290
Neg logit contributions
 narrative-0.257
oral-0.243
 severe-0.206
iness-0.193
ience-0.193
Token as
Token ID355
Feature activation+6.07e-02

Pos logit contributions
 Mil+0.005
 Mir+0.004
Gu+0.004
 But+0.004
 Del+0.004
Neg logit contributions
 narrative-0.004
oral-0.004
 severe-0.003
der-0.003
 obstacle-0.003
Token she
Token ID673
Feature activation+1.14e-02

Pos logit contributions
Gu+0.067
 Mil+0.064
 Just+0.060
 Mir+0.059
 Pand+0.059
Neg logit contributions
 narrative-0.119
oral-0.111
 severe-0.099
iness-0.096
smart-0.096
Token went
Token ID1816
Feature activation-3.16e-02

Pos logit contributions
 Mil+0.014
 Mir+0.013
 But+0.013
 N+0.013
 Del+0.012
Neg logit contributions
 narrative-0.022
oral-0.021
 severe-0.018
 obstacle-0.018
smart-0.017
Token to
Token ID284
Feature activation+1.66e-02

Pos logit contributions
 narrative+0.042
oral+0.040
 severe+0.034
smart+0.032
 obstacle+0.031
Neg logit contributions
 Mil-0.057
 Mir-0.054
Gu-0.053
 Del-0.052
 N-0.052
Token say
Token ID910
Feature activation-2.37e-02

Pos logit contributions
 Mil+0.003
Gu+0.003
 Mir+0.003
 N+0.003
 Del+0.003
Neg logit contributions
 narrative-0.006
oral-0.005
 obstacle-0.005
 severe-0.005
 mercy-0.005
Token "
Token ID366
Feature activation-2.51e-02

Pos logit contributions
 narrative+0.045
oral+0.043
 severe+0.037
 obstacle+0.035
iness+0.034
Neg logit contributions
 Mil-0.032
 Mir-0.030
 But-0.029
 Del-0.029
 N-0.029
Tokenab
Token ID397
Feature activation+1.58e-01

Pos logit contributions
 narrative+0.041
oral+0.040
ball+0.032
in+0.032
ery+0.032
Neg logit contributions
 Mil-0.037
 Mir-0.036
 ask-0.032
 Clara-0.031
 But-0.031
Tokenrac
Token ID11510
Feature activation-5.33e-02

Pos logit contributions
 Mil+0.236
 Mir+0.221
Gu+0.216
 But+0.215
 N+0.213
Neg logit contributions
 narrative-0.264
oral-0.255
 severe-0.212
 obstacle-0.201
smart-0.199
Tokenad
Token ID324
Feature activation-4.94e-02

Pos logit contributions
 narrative+0.077
oral+0.077
 severe+0.059
smart+0.056
 obstacle+0.055
Neg logit contributions
 Mil-0.095
 Mir-0.090
 But-0.087
 N-0.086
Gu-0.084
Tokenab
Token ID397
Feature activation+1.87e-01

Pos logit contributions
 narrative+0.105
oral+0.101
 severe+0.089
 obstacle+0.085
smart+0.084
Neg logit contributions
 Mil-0.053
 Mir-0.048
 But-0.046
 N-0.045
 Del-0.045
Tokenra
Token ID430
Feature activation+2.68e-02

Pos logit contributions
 Mil+0.197
Gu+0.192
 But+0.178
 Mir+0.178
 N+0.174
Neg logit contributions
 narrative-0.374
oral-0.374
 severe-0.311
oni-0.308
smart-0.307
Token"
Token ID1
Feature activation+4.71e-02

Pos logit contributions
 Mil+0.042
 Mir+0.040
 But+0.038
 N+0.038
 Del+0.038
Neg logit contributions
 narrative-0.043
oral-0.041
 severe-0.035
 obstacle-0.032
smart-0.032
Token all
Token ID477
Feature activation-3.44e-02

Pos logit contributions
 Mil+0.076
 Mir+0.071
 But+0.070
 N+0.069
 Del+0.069
Neg logit contributions
 narrative-0.075
oral-0.072
 severe-0.059
 obstacle-0.056
smart-0.054
Token the
Token ID262
Feature activation+1.17e-02

Pos logit contributions
 narrative+0.056
oral+0.053
smart+0.044
iness+0.044
 obstacle+0.043
Neg logit contributions
 Mil-0.053
 Mir-0.049
Gu-0.049
 Just-0.049
 N-0.048
Token time
Token ID640
Feature activation+4.15e-02

Pos logit contributions
 Mil+0.022
 Just+0.020
 My+0.020
 Suddenly+0.020
 Ph+0.020
Neg logit contributions
 narrative-0.017
 story-0.014
oral-0.013
 problem-0.013
 obstacle-0.012
Token alone
Token ID3436
Feature activation+6.77e-02

Pos logit contributions
 Mil+0.015
 But+0.014
Gu+0.014
 Mir+0.014
 Del+0.014
Neg logit contributions
 narrative-0.013
oral-0.012
 severe-0.012
smart-0.010
 mercy-0.010
Token.
Token ID13
Feature activation+3.05e-02

Pos logit contributions
 Mil+0.081
 But+0.079
 Mir+0.078
 N+0.077
 Just+0.075
Neg logit contributions
oral-0.132
 narrative-0.131
 severe-0.105
oni-0.105
 obstacle-0.102
Token
Token ID198
Feature activation+3.45e-02

Pos logit contributions
Gu+0.044
 imagine+0.042
 Mil+0.042
nes+0.042
rings+0.041
Neg logit contributions
 narrative-0.047
oral-0.045
 severe-0.035
der-0.033
 story-0.032
Token
Token ID198
Feature activation+2.76e-02

Pos logit contributions
 Mil+0.050
 Mir+0.048
 But+0.046
 N+0.045
 Del+0.045
Neg logit contributions
 narrative-0.061
oral-0.057
 severe-0.049
 obstacle-0.048
iness-0.047
TokenCr
Token ID13916
Feature activation-1.49e-02

Pos logit contributions
 Mil+0.044
 Mir+0.042
 Del+0.041
Gu+0.041
 Just+0.041
Neg logit contributions
 narrative-0.043
oral-0.043
 severe-0.036
oni-0.031
 obstacle-0.031
Tokenab
Token ID397
Feature activation+1.87e-01

Pos logit contributions
 narrative+0.025
oral+0.022
 severe+0.022
 mercy+0.021
smart+0.021
Neg logit contributions
 Mil-0.020
Gu-0.019
 Just-0.018
 Mir-0.018
 Del-0.018
Token saw
Token ID2497
Feature activation+4.48e-02

Pos logit contributions
 Mil+0.285
Gu+0.270
 Mir+0.269
 But+0.268
 N+0.261
Neg logit contributions
 narrative-0.295
oral-0.289
 severe-0.236
iness-0.222
oni-0.222
Token Sara
Token ID24799
Feature activation+6.87e-02

Pos logit contributions
 Mil+0.045
 Mir+0.041
Gu+0.041
 Just+0.041
 But+0.040
Neg logit contributions
 narrative-0.096
oral-0.093
 severe-0.081
 obstacle-0.077
oni-0.077
Token and
Token ID290
Feature activation+2.03e-02

Pos logit contributions
 Mil+0.102
 Mir+0.097
 But+0.093
 N+0.093
 Just+0.092
Neg logit contributions
 narrative-0.119
oral-0.114
 severe-0.093
 obstacle-0.090
smart-0.086
Token Ben
Token ID3932
Feature activation+5.32e-03

Pos logit contributions
Gu+0.019
 Mil+0.018
 Just+0.018
 But+0.018
 Del+0.017
Neg logit contributions
 narrative-0.042
oral-0.040
 severe-0.037
oni-0.036
 pistols-0.035
Token.
Token ID13
Feature activation+2.47e-02

Pos logit contributions
 Mil+0.008
 Mir+0.008
 But+0.007
 N+0.007
 Del+0.007
Neg logit contributions
 narrative-0.009
oral-0.009
 severe-0.007
 obstacle-0.007
smart-0.007
Token "
Token ID366
Feature activation+1.47e-03

Pos logit contributions
 narrative+0.022
oral+0.022
der+0.019
 severe+0.019
smart+0.019
Neg logit contributions
 Mil-0.025
Gu-0.024
 Mir-0.023
nes-0.023
 Del-0.023
TokenA
Token ID32
Feature activation+3.78e-03

Pos logit contributions
 Mil+0.003
 pretending+0.003
 imagining+0.002
 Ell+0.002
 Sus+0.002
Neg logit contributions
oral-0.002
 narrative-0.002
in-0.001
 severe-0.001
 moral-0.001
Tokenbr
Token ID1671
Feature activation+5.39e-02

Pos logit contributions
 Mil+0.006
 Mir+0.006
 But+0.006
 N+0.006
 Del+0.006
Neg logit contributions
 narrative-0.006
oral-0.005
 severe-0.005
 obstacle-0.004
smart-0.004
Tokenac
Token ID330
Feature activation-1.70e-03

Pos logit contributions
 Mil+0.081
 Mir+0.077
 Del+0.073
 N+0.071
 But+0.070
Neg logit contributions
 narrative-0.092
oral-0.085
 severe-0.081
 obstacle-0.073
 pistols-0.072
Tokenad
Token ID324
Feature activation-3.96e-02

Pos logit contributions
 narrative+0.003
oral+0.003
 severe+0.003
 pistols+0.003
der+0.003
Neg logit contributions
 Mil-0.002
 Mir-0.002
 Del-0.002
Gu-0.002
 sill-0.002
Tokenab
Token ID397
Feature activation+1.86e-01

Pos logit contributions
 narrative+0.082
oral+0.079
 severe+0.068
smart+0.066
 obstacle+0.066
Neg logit contributions
 Mil-0.044
 Mir-0.040
 But-0.039
 N-0.039
 Del-0.039
Tokenra
Token ID430
Feature activation+3.18e-02

Pos logit contributions
 Mil+0.183
Gu+0.174
 But+0.166
 Mir+0.165
 Suddenly+0.161
Neg logit contributions
 narrative-0.386
oral-0.384
 severe-0.321
smart-0.319
oni-0.315
Token!"
Token ID2474
Feature activation+6.72e-02

Pos logit contributions
 Mil+0.054
 Mir+0.051
 But+0.050
 N+0.050
 Del+0.049
Neg logit contributions
 narrative-0.047
oral-0.045
 severe-0.037
 obstacle-0.034
smart-0.034
Token
Token ID198
Feature activation+2.52e-02

Pos logit contributions
 Mil+0.083
Gu+0.075
 Mir+0.075
 Del+0.073
 N+0.073
Neg logit contributions
 narrative-0.127
oral-0.126
 severe-0.107
smart-0.102
der-0.100
Token
Token ID198
Feature activation+1.99e-02

Pos logit contributions
 Mil+0.024
 Mir+0.023
 But+0.022
 N+0.021
 Del+0.020
Neg logit contributions
 narrative-0.057
oral-0.054
 severe-0.049
 obstacle-0.047
iness-0.046
TokenHe
Token ID1544
Feature activation-3.85e-02

Pos logit contributions
 Mil+0.028
 Mir+0.026
 N+0.026
 Del+0.026
 But+0.025
Neg logit contributions
 narrative-0.036
oral-0.035
 severe-0.029
smart-0.027
 obstacle-0.027
Token and
Token ID290
Feature activation-2.19e-02

Pos logit contributions
oral+0.058
 narrative+0.055
der+0.048
 severe+0.047
oni+0.044
Neg logit contributions
Gu-0.100
 Mil-0.098
 Mir-0.092
Whe-0.092
 Del-0.090
Token went
Token ID1816
Feature activation-4.86e-02

Pos logit contributions
 narrative+0.035
oral+0.031
 severe+0.028
 mercy+0.026
smart+0.026
Neg logit contributions
Gu-0.032
 Mil-0.032
 Mir-0.030
 My-0.030
 But-0.030
Token home
Token ID1363
Feature activation+1.35e-02

Pos logit contributions
 narrative+0.062
oral+0.060
 severe+0.049
smart+0.045
 obstacle+0.043
Neg logit contributions
 Mil-0.090
 Mir-0.086
Gu-0.084
 Del-0.083
 But-0.082
Token.
Token ID13
Feature activation-7.29e-03

Pos logit contributions
 Mil+0.023
Gu+0.023
 Mir+0.021
 My+0.021
 Del+0.021
Neg logit contributions
 narrative-0.018
oral-0.017
 severe-0.017
smart-0.015
 problem-0.014
Token Jo
Token ID5302
Feature activation+4.64e-02

Pos logit contributions
oral+0.009
 narrative+0.009
smart+0.007
 story+0.007
 severe+0.007
Neg logit contributions
 dipped-0.011
Gu-0.011
 Mil-0.011
 shades-0.011
 imagine-0.011
Tokenanna
Token ID7697
Feature activation+1.85e-01

Pos logit contributions
 Mil+0.079
Gu+0.077
 Mir+0.076
 Pand+0.075
 Del+0.072
Neg logit contributions
oral-0.056
 narrative-0.055
ience-0.050
 severe-0.049
 story-0.046
Token shouted
Token ID17293
Feature activation+9.74e-03

Pos logit contributions
 Mil+0.320
Gu+0.315
 Mir+0.307
 But+0.293
 My+0.292
Neg logit contributions
 narrative-0.243
oral-0.234
 severe-0.203
ience-0.186
iness-0.178
Token as
Token ID355
Feature activation+4.92e-02

Pos logit contributions
 Mil+0.018
Gu+0.017
 Mir+0.017
 Del+0.016
 Pand+0.016
Neg logit contributions
 narrative-0.012
oral-0.012
 severe-0.010
der-0.009
smart-0.009
Token usual
Token ID6678
Feature activation-1.99e-02

Pos logit contributions
Gu+0.055
 Mil+0.051
 Mir+0.047
 My+0.047
 Pand+0.047
Neg logit contributions
 narrative-0.099
oral-0.093
 severe-0.083
 obstacle-0.078
iness-0.078
Token but
Token ID475
Feature activation+1.24e-02

Pos logit contributions
 narrative+0.030
oral+0.028
 severe+0.024
smart+0.024
 obstacle+0.023
Neg logit contributions
 Mil-0.032
Gu-0.031
 Mir-0.029
 Del-0.029
 My-0.028
Token no
Token ID645
Feature activation-5.01e-02

Pos logit contributions
Gu+0.018
 Mil+0.016
 Pand+0.015
 Mir+0.015
 My+0.015
Neg logit contributions
 narrative-0.021
oral-0.019
 severe-0.017
 problem-0.016
iness-0.016
Token "
Token ID366
Feature activation+1.43e-02

Pos logit contributions
oral+0.007
der+0.006
 narrative+0.006
oni+0.006
:+0.006
Neg logit contributions
nes-0.008
 imagine-0.008
 seen-0.008
Gu-0.008
 sill-0.008
TokenA
Token ID32
Feature activation+1.15e-02

Pos logit contributions
 Mil+0.022
 Ell+0.021
 pretending+0.021
 Mir+0.021
 N+0.020
Neg logit contributions
 narrative-0.023
oral-0.021
smart-0.015
 severe-0.015
Ever-0.015
Tokenbr
Token ID1671
Feature activation+5.96e-02

Pos logit contributions
 Mil+0.023
 Mir+0.022
 Del+0.022
 N+0.022
 But+0.022
Neg logit contributions
 narrative-0.013
oral-0.012
 severe-0.009
 problem-0.009
 obstacle-0.009
Tokenac
Token ID330
Feature activation+5.82e-03

Pos logit contributions
 Mil+0.097
 Mir+0.092
 Del+0.088
 But+0.087
 N+0.087
Neg logit contributions
 narrative-0.094
oral-0.089
 severe-0.077
 obstacle-0.071
smart-0.069
Tokenad
Token ID324
Feature activation-3.63e-02

Pos logit contributions
 Mil+0.008
 Mir+0.008
 Del+0.008
Gu+0.008
 N+0.007
Neg logit contributions
 narrative-0.010
oral-0.010
 severe-0.008
 obstacle-0.008
der-0.008
Tokenab
Token ID397
Feature activation+1.85e-01

Pos logit contributions
 narrative+0.070
oral+0.068
smart+0.058
 severe+0.058
 problem+0.057
Neg logit contributions
 Mil-0.044
 Mir-0.040
 N-0.039
 But-0.039
 Del-0.039
Tokenra
Token ID430
Feature activation+3.39e-02

Pos logit contributions
Gu+0.212
 Mil+0.207
 But+0.195
 Suddenly+0.193
 Mir+0.189
Neg logit contributions
oral-0.345
 narrative-0.344
iness-0.289
smart-0.287
 story-0.284
Token!"
Token ID2474
Feature activation+6.23e-02

Pos logit contributions
 Mil+0.061
 Mir+0.058
 But+0.057
Gu+0.057
 Del+0.056
Neg logit contributions
 narrative-0.044
oral-0.042
 severe-0.034
smart-0.033
iness-0.032
Token
Token ID198
Feature activation+1.25e-02

Pos logit contributions
uries+0.107
astical+0.099
Gu+0.095
nes+0.094
 blu+0.091
Neg logit contributions
ls-0.070
:-0.069
 on-0.069
oral-0.068
der-0.067
Token
Token ID198
Feature activation+6.06e-03

Pos logit contributions
 Mil+0.019
Gu+0.019
 blonde+0.019
 blond+0.018
 sill+0.018
Neg logit contributions
oral-0.019
 narrative-0.017
 severe-0.014
oni-0.014
 story-0.014
Token"
Token ID1
Feature activation+3.83e-02

Pos logit contributions
 Ell+0.012
 Mil+0.012
 pretending+0.012
 Mir+0.011
 Del+0.011
Neg logit contributions
oral-0.007
 narrative-0.006
in-0.005
 severe-0.005
Together-0.004
Token "
Token ID366
Feature activation-1.32e-02

Pos logit contributions
oral+0.054
 narrative+0.053
 severe+0.044
der+0.044
smart+0.043
Neg logit contributions
 Mil-0.058
 Mir-0.053
Gu-0.053
 Del-0.051
 N-0.051
TokenA
Token ID32
Feature activation+2.72e-03

Pos logit contributions
 narrative+0.022
oral+0.021
 severe+0.016
orn+0.016
oni+0.015
Neg logit contributions
 Mil-0.021
 Mir-0.020
 N-0.018
 pretending-0.018
 But-0.018
Tokenbr
Token ID1671
Feature activation+4.71e-02

Pos logit contributions
 Mil+0.005
 Mir+0.004
 But+0.004
 N+0.004
 Del+0.004
Neg logit contributions
 narrative-0.004
oral-0.004
 severe-0.003
 obstacle-0.003
 problem-0.003
Tokenac
Token ID330
Feature activation-1.71e-03

Pos logit contributions
 Mil+0.064
 Mir+0.058
 Del+0.055
 L+0.054
 N+0.052
Neg logit contributions
 narrative-0.088
 severe-0.083
oral-0.081
 pistols-0.074
 obstacle-0.072
Tokenad
Token ID324
Feature activation-4.61e-02

Pos logit contributions
 narrative+0.003
oral+0.003
 severe+0.003
 pistols+0.003
 obstacle+0.003
Neg logit contributions
 Mil-0.002
 Del-0.002
 Mir-0.002
Gu-0.002
 Just-0.002
Tokenab
Token ID397
Feature activation+1.84e-01

Pos logit contributions
 narrative+0.095
oral+0.091
 severe+0.080
 obstacle+0.076
smart+0.075
Neg logit contributions
 Mil-0.053
 Mir-0.048
 But-0.046
 N-0.046
 Del-0.046
Tokenra
Token ID430
Feature activation+3.12e-02

Pos logit contributions
 Mil+0.176
Gu+0.166
 Mir+0.162
 But+0.160
 N+0.153
Neg logit contributions
 narrative-0.390
oral-0.386
 severe-0.327
smart-0.323
 obstacle-0.321
Token!"
Token ID2474
Feature activation+5.01e-02

Pos logit contributions
 Mil+0.057
 Mir+0.054
 But+0.052
 N+0.052
 Del+0.052
Neg logit contributions
 narrative-0.043
oral-0.040
 severe-0.033
 obstacle-0.030
smart-0.030
Token Suddenly
Token ID24975
Feature activation+6.92e-02

Pos logit contributions
 Mil+0.059
 Mir+0.053
Gu+0.052
 Del+0.050
 N+0.048
Neg logit contributions
oral-0.093
 narrative-0.092
der-0.081
 severe-0.081
smart-0.079
Token,
Token ID11
Feature activation-6.71e-03

Pos logit contributions
 Mil+0.137
Gu+0.135
 Mir+0.128
 Just+0.128
 But+0.128
Neg logit contributions
 narrative-0.073
oral-0.068
 severe-0.058
 problem-0.053
smart-0.052
Token the
Token ID262
Feature activation-1.73e-02

Pos logit contributions
 narrative+0.012
oral+0.011
 severe+0.010
 obstacle+0.009
smart+0.009
Neg logit contributions
 Mil-0.009
Gu-0.008
 Just-0.008
 Mir-0.008
 But-0.008
Token "
Token ID366
Feature activation-1.31e-02

Pos logit contributions
 narrative+0.056
oral+0.056
der+0.046
 severe+0.046
smart+0.044
Neg logit contributions
 Mil-0.056
 Mir-0.052
Gu-0.052
 Del-0.050
 N-0.050
TokenA
Token ID32
Feature activation+7.10e-04

Pos logit contributions
 narrative+0.020
oral+0.018
in+0.013
ball+0.013
Ever+0.012
Neg logit contributions
 Mil-0.024
 Mir-0.023
 pretending-0.022
 imagining-0.022
 Ell-0.021
Tokenbr
Token ID1671
Feature activation+4.53e-02

Pos logit contributions
 Mil+0.001
 Mir+0.001
 But+0.001
 N+0.001
 Del+0.001
Neg logit contributions
 narrative-0.001
oral-0.001
 severe-0.001
 obstacle-0.001
smart-0.001
Tokenac
Token ID330
Feature activation-3.62e-03

Pos logit contributions
 Mil+0.061
 Mir+0.056
 Del+0.053
 L+0.052
 N+0.050
Neg logit contributions
 narrative-0.083
 severe-0.080
oral-0.077
 pistols-0.069
 obstacle-0.069
Tokenad
Token ID324
Feature activation-4.59e-02

Pos logit contributions
 narrative+0.007
oral+0.007
 severe+0.006
der+0.005
 pistols+0.005
Neg logit contributions
 Mil-0.005
 Del-0.004
 Mir-0.004
Gu-0.004
 N-0.004
Tokenab
Token ID397
Feature activation+1.84e-01

Pos logit contributions
 narrative+0.093
oral+0.090
 severe+0.079
 obstacle+0.075
smart+0.073
Neg logit contributions
 Mil-0.053
 Mir-0.049
 But-0.047
 N-0.047
 Del-0.047
Tokenra
Token ID430
Feature activation+2.91e-02

Pos logit contributions
 Mil+0.191
Gu+0.183
 But+0.175
 Mir+0.175
 Suddenly+0.169
Neg logit contributions
 narrative-0.373
oral-0.369
smart-0.309
 severe-0.306
iness-0.305
Token,
Token ID11
Feature activation-2.11e-03

Pos logit contributions
 Mil+0.052
 Mir+0.049
 But+0.048
 Del+0.047
 N+0.047
Neg logit contributions
 narrative-0.041
oral-0.039
 severe-0.031
 obstacle-0.029
smart-0.029
Token I
Token ID314
Feature activation+4.74e-02

Pos logit contributions
 narrative+0.004
oral+0.004
 severe+0.003
 obstacle+0.003
smart+0.003
Neg logit contributions
 Mil-0.003
 Mir-0.003
 But-0.002
Gu-0.002
 N-0.002
Token turn
Token ID1210
Feature activation+1.08e-02

Pos logit contributions
 Mil+0.070
 Mir+0.068
 Del+0.063
 N+0.062
Gu+0.062
Neg logit contributions
 narrative-0.079
oral-0.076
 severe-0.064
 problem-0.062
smart-0.062
Token you
Token ID345
Feature activation+3.30e-02

Pos logit contributions
 Mil+0.017
 Mir+0.016
 But+0.016
Gu+0.016
 Del+0.016
Neg logit contributions
 narrative-0.017
oral-0.016
 severe-0.014
 obstacle-0.013
smart-0.013
Token "
Token ID366
Feature activation-6.98e-03

Pos logit contributions
 narrative+0.039
oral+0.039
der+0.032
 severe+0.031
smart+0.030
Neg logit contributions
 Mil-0.041
 Mir-0.038
Gu-0.038
 Del-0.037
 N-0.037
TokenA
Token ID32
Feature activation+4.76e-03

Pos logit contributions
 narrative+0.011
oral+0.010
ball+0.007
Ever+0.007
ative+0.006
Neg logit contributions
 Mil-0.012
 Mir-0.011
 Ab-0.011
 Sus-0.011
 pretending-0.011
Tokenbr
Token ID1671
Feature activation+4.95e-02

Pos logit contributions
 Mil+0.008
 Mir+0.007
Gu+0.007
 N+0.007
 But+0.007
Neg logit contributions
oral-0.007
 narrative-0.007
 problem-0.006
 story-0.006
 obstacle-0.005
Tokenac
Token ID330
Feature activation-2.43e-03

Pos logit contributions
 Mil+0.072
 Mir+0.068
 Del+0.065
 N+0.063
 L+0.062
Neg logit contributions
 narrative-0.085
 severe-0.079
oral-0.079
 obstacle-0.070
 pistols-0.069
Tokenad
Token ID324
Feature activation-4.46e-02

Pos logit contributions
 narrative+0.004
oral+0.004
 severe+0.004
 pistols+0.004
der+0.004
Neg logit contributions
 Mil-0.003
 Mir-0.003
 Del-0.003
Gu-0.003
 N-0.003
Tokenab
Token ID397
Feature activation+1.84e-01

Pos logit contributions
 narrative+0.089
oral+0.085
 severe+0.073
smart+0.071
 obstacle+0.070
Neg logit contributions
 Mil-0.053
 Mir-0.049
 But-0.047
 Del-0.047
 N-0.047
Tokenra
Token ID430
Feature activation+2.97e-02

Pos logit contributions
Gu+0.205
 Mil+0.204
 But+0.191
 Suddenly+0.191
 Mir+0.187
Neg logit contributions
 narrative-0.347
oral-0.347
 story-0.287
smart-0.287
iness-0.285
Token,
Token ID11
Feature activation+4.54e-03

Pos logit contributions
 Mil+0.052
 Mir+0.049
 But+0.048
 Del+0.048
Gu+0.048
Neg logit contributions
 narrative-0.042
oral-0.040
 severe-0.033
smart-0.030
 obstacle-0.030
Token make
Token ID787
Feature activation+7.27e-03

Pos logit contributions
 Mil+0.005
Gu+0.005
 Mir+0.005
 But+0.005
 Just+0.004
Neg logit contributions
 narrative-0.009
oral-0.009
 severe-0.008
smart-0.007
 obstacle-0.007
Token a
Token ID257
Feature activation+1.78e-02

Pos logit contributions
 But+0.010
 Mil+0.010
Gu+0.010
 Just+0.010
 Pand+0.009
Neg logit contributions
 narrative-0.011
oral-0.010
 story-0.010
 obstacle-0.010
 problem-0.010
Token friend
Token ID1545
Feature activation+3.78e-02

Pos logit contributions
 Mil+0.026
 Mir+0.024
 But+0.024
Gu+0.023
 N+0.023
Neg logit contributions
 narrative-0.031
oral-0.029
 severe-0.025
 obstacle-0.024
 problem-0.023
Token really
Token ID1107
Feature activation-1.89e-03

Pos logit contributions
 narrative+0.004
oral+0.004
 severe+0.004
 obstacle+0.003
 problem+0.003
Neg logit contributions
 Mil-0.004
 Mir-0.004
Gu-0.004
 Del-0.004
 But-0.004
Token special
Token ID2041
Feature activation+1.91e-02

Pos logit contributions
 narrative+0.003
oral+0.003
 severe+0.002
 obstacle+0.002
 problem+0.002
Neg logit contributions
 Mil-0.003
 Mir-0.003
Gu-0.003
 Del-0.003
 N-0.003
Token inside
Token ID2641
Feature activation+5.07e-02

Pos logit contributions
 Mil+0.028
 Mir+0.026
 But+0.025
 N+0.025
 Del+0.025
Neg logit contributions
 narrative-0.033
oral-0.032
 severe-0.027
 obstacle-0.025
smart-0.025
Token."
Token ID526
Feature activation+6.62e-02

Pos logit contributions
 Mil+0.070
 Mir+0.066
 N+0.064
 But+0.063
 Del+0.062
Neg logit contributions
 narrative-0.093
oral-0.089
 severe-0.075
 obstacle-0.071
smart-0.069
Token Jo
Token ID5302
Feature activation+4.31e-02

Pos logit contributions
Gu+0.104
 Mil+0.102
 imagine+0.097
 blu+0.097
nes+0.096
Neg logit contributions
oral-0.087
 narrative-0.079
 severe-0.064
 story-0.064
smart-0.064
Tokenanna
Token ID7697
Feature activation+1.83e-01

Pos logit contributions
Gu+0.068
 Mil+0.066
 Pand+0.064
Ready+0.062
 Mir+0.062
Neg logit contributions
oral-0.059
 narrative-0.055
ience-0.053
 severe-0.051
:-0.049
Token was
Token ID373
Feature activation+1.68e-02

Pos logit contributions
Gu+0.296
 Mil+0.274
 My+0.266
 Mir+0.263
Whe+0.261
Neg logit contributions
oral-0.246
 narrative-0.241
 severe-0.211
ience-0.209
der-0.202
Token very
Token ID845
Feature activation-1.05e-02

Pos logit contributions
Gu+0.029
 Mil+0.027
Ready+0.027
Space+0.027
rings+0.027
Neg logit contributions
 narrative-0.018
 severe-0.017
:-0.017
oral-0.016
 hard-0.016
Token curious
Token ID11040
Feature activation+6.79e-03

Pos logit contributions
 hard+0.015
 severe+0.015
 narrative+0.015
oral+0.015
 bad+0.015
Neg logit contributions
rings-0.015
Gu-0.014
Whe-0.014
 My-0.014
Space-0.013
Token and
Token ID290
Feature activation-1.23e-02

Pos logit contributions
 Mil+0.011
 Mir+0.010
 But+0.010
 N+0.010
 Del+0.010
Neg logit contributions
 narrative-0.011
oral-0.010
 severe-0.008
 obstacle-0.008
smart-0.008
Token couldn
Token ID3521
Feature activation-5.50e-02

Pos logit contributions
oral+0.018
 narrative+0.017
ience+0.016
 severe+0.016
 hard+0.015
Neg logit contributions
Whe-0.017
 Mil-0.017
Gu-0.016
 Pand-0.016
 Mir-0.016
Token in
Token ID287
Feature activation+1.25e-02

Pos logit contributions
 Mil+0.030
 But+0.030
 Mir+0.028
 N+0.028
 Ell+0.027
Neg logit contributions
 narrative-0.044
oral-0.044
 severe-0.036
 mercy-0.034
 obstacle-0.034
Token the
Token ID262
Feature activation-1.41e-02

Pos logit contributions
 Mil+0.018
 Mir+0.016
 But+0.016
Gu+0.016
 Just+0.016
Neg logit contributions
 narrative-0.022
oral-0.021
 severe-0.018
 obstacle-0.017
smart-0.017
Token sky
Token ID6766
Feature activation+5.43e-02

Pos logit contributions
 narrative+0.026
oral+0.025
 severe+0.022
 obstacle+0.021
 problem+0.020
Neg logit contributions
 Mil-0.019
 Mir-0.018
Gu-0.017
 Just-0.016
 Del-0.016
Token and
Token ID290
Feature activation-7.74e-03

Pos logit contributions
 Mil+0.112
Gu+0.109
 Mir+0.107
nes+0.102
 Pand+0.101
Neg logit contributions
 narrative-0.048
oral-0.048
der-0.039
 severe-0.037
iness-0.036
Token Jo
Token ID5302
Feature activation+5.04e-02

Pos logit contributions
 narrative+0.009
 all+0.008
oral+0.008
 problem+0.008
 on+0.007
Neg logit contributions
 Just-0.012
Gu-0.012
uries-0.012
 Pand-0.012
 My-0.012
Tokenanna
Token ID7697
Feature activation+1.83e-01

Pos logit contributions
 Mil+0.051
 Mir+0.047
 But+0.046
Gu+0.045
 Del+0.044
Neg logit contributions
 narrative-0.101
oral-0.101
 severe-0.090
iness-0.089
ience-0.085
Token said
Token ID531
Feature activation-6.13e-03

Pos logit contributions
 Mil+0.300
 Mir+0.288
Gu+0.285
 My+0.277
 But+0.276
Neg logit contributions
 narrative-0.266
oral-0.255
 severe-0.219
iness-0.202
 problem-0.195
Token to
Token ID284
Feature activation-3.37e-02

Pos logit contributions
 narrative+0.008
oral+0.008
 severe+0.006
 problem+0.006
smart+0.006
Neg logit contributions
 Mil-0.011
 Mir-0.010
Gu-0.010
 N-0.010
 Del-0.010
Token her
Token ID607
Feature activation+1.70e-02

Pos logit contributions
 narrative+0.060
oral+0.058
 severe+0.050
 story+0.048
 obstacle+0.048
Neg logit contributions
 Mil-0.041
Gu-0.040
 Pand-0.037
 Mir-0.037
 Just-0.037
Token M
Token ID337
Feature activation-2.81e-02

Pos logit contributions
 Mil+0.019
 Mir+0.018
 N+0.017
 Del+0.017
 But+0.016
Neg logit contributions
 narrative-0.037
oral-0.035
 severe-0.030
 obstacle-0.029
smart-0.029
Tokenummy
Token ID13513
Feature activation-7.81e-02

Pos logit contributions
 narrative+0.044
oral+0.041
 severe+0.035
 obstacle+0.033
smart+0.032
Neg logit contributions
 Mil-0.046
 Mir-0.043
 N-0.042
 But-0.042
 Del-0.042
Token it
Token ID340
Feature activation+1.78e-03

Pos logit contributions
Gu+0.008
 Just+0.008
 Mil+0.008
 My+0.008
 Pand+0.008
Neg logit contributions
 narrative-0.008
oral-0.008
 problem-0.006
 severe-0.006
 obstacle-0.006
Token was
Token ID373
Feature activation+7.49e-03

Pos logit contributions
 Mil+0.003
 Mir+0.003
 But+0.003
 N+0.003
Gu+0.003
Neg logit contributions
 narrative-0.003
oral-0.002
 severe-0.002
 obstacle-0.002
 problem-0.002
Token time
Token ID640
Feature activation+4.42e-02

Pos logit contributions
 Mil+0.012
 Mir+0.011
 But+0.011
 N+0.011
 Del+0.011
Neg logit contributions
 narrative-0.012
oral-0.011
 severe-0.009
 obstacle-0.009
smart-0.009
Token for
Token ID329
Feature activation-5.86e-02

Pos logit contributions
 Mil+0.067
 Mir+0.063
 But+0.062
 Just+0.061
 N+0.061
Neg logit contributions
 narrative-0.076
oral-0.072
 severe-0.061
 obstacle-0.058
smart-0.056
Token Jo
Token ID5302
Feature activation+4.68e-02

Pos logit contributions
 narrative+0.093
oral+0.091
 severe+0.079
 story+0.076
 mercy+0.074
Neg logit contributions
Gu-0.081
 Mil-0.079
 Just-0.079
 My-0.078
 But-0.077
Tokenanna
Token ID7697
Feature activation+1.81e-01

Pos logit contributions
 Mil+0.053
Gu+0.049
 But+0.049
 Mir+0.048
 Del+0.047
Neg logit contributions
oral-0.086
 narrative-0.083
 severe-0.080
iness-0.077
smart-0.076
Token to
Token ID284
Feature activation+2.58e-02

Pos logit contributions
 Mil+0.304
 Mir+0.287
Gu+0.279
 Del+0.274
 N+0.272
Neg logit contributions
 narrative-0.271
oral-0.259
 severe-0.226
smart-0.203
 problem-0.201
Token go
Token ID467
Feature activation-3.25e-02

Pos logit contributions
 Mil+0.035
 Mir+0.032
 But+0.031
Gu+0.031
 N+0.031
Neg logit contributions
 narrative-0.048
oral-0.046
 severe-0.039
 obstacle-0.037
smart-0.037
Token to
Token ID284
Feature activation+1.74e-02

Pos logit contributions
 narrative+0.057
oral+0.055
 severe+0.047
 obstacle+0.044
smart+0.044
Neg logit contributions
 Mil-0.047
 Mir-0.044
 But-0.042
 N-0.042
 Del-0.042
Token bed
Token ID3996
Feature activation-3.10e-02

Pos logit contributions
 Mil+0.023
Gu+0.022
 Mir+0.021
 Just+0.021
 But+0.021
Neg logit contributions
 narrative-0.031
oral-0.030
 severe-0.026
 obstacle-0.025
 story-0.024
Token.
Token ID13
Feature activation-6.22e-03

Pos logit contributions
 narrative+0.054
oral+0.052
 obstacle+0.042
 severe+0.042
 pistols+0.039
Neg logit contributions
 Mil-0.047
 N-0.044
 But-0.043
 Mir-0.043
 Just-0.042
Token a
Token ID257
Feature activation+2.22e-04

Pos logit contributions
 narrative+0.025
oral+0.023
 severe+0.020
ery+0.019
oni+0.019
Neg logit contributions
 Mil-0.026
 Mir-0.024
 N-0.024
Gu-0.024
 Del-0.024
Token little
Token ID1310
Feature activation+3.29e-02

Pos logit contributions
 Mil+0.000
 Mir+0.000
 N+0.000
 But+0.000
 Del+0.000
Neg logit contributions
 narrative-0.000
oral-0.000
 severe-0.000
 obstacle-0.000
smart-0.000
Token girl
Token ID2576
Feature activation+4.77e-02

Pos logit contributions
 Mil+0.039
 Mir+0.035
 But+0.034
 N+0.034
 Del+0.034
Neg logit contributions
 narrative-0.067
oral-0.064
 severe-0.056
 obstacle-0.053
smart-0.053
Token named
Token ID3706
Feature activation+9.95e-02

Pos logit contributions
 Mil+0.069
 Mir+0.064
Gu+0.064
 Del+0.063
 But+0.061
Neg logit contributions
 narrative-0.080
oral-0.076
 severe-0.065
smart-0.064
 obstacle-0.060
Token Jo
Token ID5302
Feature activation+4.04e-02

Pos logit contributions
 Mil+0.111
Gu+0.105
Whe+0.104
Whoever+0.100
 Mir+0.097
Neg logit contributions
 narrative-0.169
oral-0.165
 problem-0.147
 story-0.146
 hard-0.144
Tokenanna
Token ID7697
Feature activation+1.81e-01

Pos logit contributions
 Mil+0.077
Gu+0.075
 Del+0.073
 Mir+0.073
 But+0.072
Neg logit contributions
 narrative-0.044
oral-0.044
 severe-0.036
iness-0.033
smart-0.033
Token.
Token ID13
Feature activation-2.16e-02

Pos logit contributions
 Mil+0.348
Gu+0.335
 Mir+0.332
 Del+0.319
 But+0.318
Neg logit contributions
 narrative-0.209
oral-0.201
 severe-0.163
smart-0.141
 problem-0.141
Token Jo
Token ID5302
Feature activation+4.44e-02

Pos logit contributions
 narrative+0.042
oral+0.041
 story+0.035
smart+0.034
iness+0.033
Neg logit contributions
 Mil-0.021
Gu-0.020
 My-0.019
 Mir-0.018
 Just-0.018
Tokenanna
Token ID7697
Feature activation+1.79e-01

Pos logit contributions
 Mil+0.052
 Mir+0.048
Gu+0.046
 Del+0.046
 But+0.046
Neg logit contributions
 narrative-0.088
oral-0.085
 severe-0.074
iness-0.070
 obstacle-0.070
Token lived
Token ID5615
Feature activation+3.74e-02

Pos logit contributions
 Mil+0.334
Gu+0.329
 Mir+0.325
 My+0.308
 Del+0.306
Neg logit contributions
 narrative-0.220
oral-0.205
 severe-0.159
iness-0.150
 story-0.149
Token in
Token ID287
Feature activation+2.19e-02

Pos logit contributions
 Mil+0.065
 Mir+0.061
 But+0.060
 Del+0.059
Gu+0.059
Neg logit contributions
 narrative-0.054
oral-0.051
 severe-0.042
smart-0.038
 obstacle-0.038
Token "
Token ID366
Feature activation-1.47e-02

Pos logit contributions
oral+0.065
 narrative+0.062
der+0.051
oni+0.049
smart+0.049
Neg logit contributions
 Mil-0.074
Gu-0.069
 Mir-0.068
nes-0.066
 N-0.066
TokenA
Token ID32
Feature activation+1.72e-03

Pos logit contributions
 narrative+0.023
oral+0.021
smart+0.015
 obstacle+0.015
Ever+0.015
Neg logit contributions
 Mil-0.024
 Mir-0.023
 ask-0.021
 pretending-0.021
 But-0.021
Tokenbr
Token ID1671
Feature activation+4.37e-02

Pos logit contributions
 Mil+0.003
 Mir+0.003
 N+0.003
 But+0.002
 Del+0.002
Neg logit contributions
 narrative-0.003
oral-0.003
 severe-0.002
 obstacle-0.002
smart-0.002
Tokenac
Token ID330
Feature activation-4.90e-03

Pos logit contributions
 Mil+0.064
 Mir+0.060
 Del+0.058
 N+0.056
 L+0.054
Neg logit contributions
 narrative-0.077
oral-0.072
 severe-0.070
 pistols-0.063
 obstacle-0.061
Tokenad
Token ID324
Feature activation-4.92e-02

Pos logit contributions
 narrative+0.009
oral+0.008
 severe+0.007
 obstacle+0.007
 pistols+0.007
Neg logit contributions
 Mil-0.007
 Mir-0.006
 Del-0.006
Gu-0.006
 N-0.006
Tokenab
Token ID397
Feature activation+1.81e-01

Pos logit contributions
 narrative+0.099
oral+0.096
 severe+0.082
smart+0.081
 obstacle+0.080
Neg logit contributions
 Mil-0.056
 Mir-0.052
 But-0.050
 N-0.049
Gu-0.049
Tokenra
Token ID430
Feature activation+2.81e-02

Pos logit contributions
 Mil+0.181
Gu+0.173
 Mir+0.167
 But+0.164
 Suddenly+0.158
Neg logit contributions
 narrative-0.371
oral-0.367
smart-0.305
 severe-0.305
 problem-0.302
Token!
Token ID0
Feature activation-3.13e-02

Pos logit contributions
 Mil+0.050
 Mir+0.048
 But+0.047
Gu+0.046
 Del+0.046
Neg logit contributions
 narrative-0.038
oral-0.037
 severe-0.029
smart-0.028
 obstacle-0.027
Token I
Token ID314
Feature activation+5.11e-02

Pos logit contributions
oral+0.056
 narrative+0.054
smart+0.048
ay+0.045
 story+0.044
Neg logit contributions
 Mil-0.036
 dipped-0.033
 shades-0.032
 seen-0.032
 imagining-0.032
Token'm
Token ID1101
Feature activation+4.20e-02

Pos logit contributions
 Mil+0.077
 Mir+0.074
 My+0.069
 N+0.068
Gu+0.068
Neg logit contributions
 narrative-0.081
oral-0.079
 severe-0.065
 problem-0.064
smart-0.064
Token normal
Token ID3487
Feature activation+2.31e-02

Pos logit contributions
ro+0.076
Gu+0.076
Space+0.075
 But+0.074
raid+0.073
Neg logit contributions
 all-0.037
 hard-0.036
 bad-0.035
 narrative-0.035
 on-0.035
Token funny
Token ID8258
Feature activation+1.48e-02

Pos logit contributions
 Mil+0.061
Gu+0.060
 Just+0.059
 My+0.059
Whe+0.058
Neg logit contributions
 narrative-0.042
oral-0.040
 severe-0.034
 hard-0.033
 story-0.031
Token and
Token ID290
Feature activation+3.97e-03

Pos logit contributions
 Mil+0.029
Gu+0.028
 Mir+0.028
 Del+0.028
 N+0.027
Neg logit contributions
 narrative-0.016
oral-0.015
 severe-0.011
smart-0.011
der-0.011
Token cute
Token ID13779
Feature activation+1.99e-02

Pos logit contributions
 Mil+0.006
 Just+0.006
Gu+0.006
 Mir+0.006
 But+0.005
Neg logit contributions
 narrative-0.006
oral-0.005
 severe-0.005
smart-0.005
 obstacle-0.005
Token.
Token ID13
Feature activation-6.88e-03

Pos logit contributions
 Mil+0.036
 Mir+0.035
 But+0.034
 N+0.034
 Del+0.033
Neg logit contributions
 narrative-0.028
oral-0.027
 severe-0.021
 obstacle-0.020
smart-0.019
Token Jo
Token ID5302
Feature activation+4.73e-02

Pos logit contributions
oral+0.010
 narrative+0.009
:+0.009
 story+0.009
 all+0.008
Neg logit contributions
Gu-0.009
 groom-0.008
acher-0.008
nes-0.008
 Mil-0.008
Tokenanna
Token ID7697
Feature activation+1.80e-01

Pos logit contributions
 Mil+0.046
Gu+0.042
 Mir+0.041
 But+0.040
 Del+0.039
Neg logit contributions
 narrative-0.096
oral-0.096
 severe-0.086
ience-0.085
iness-0.084
Token thought
Token ID1807
Feature activation-6.85e-02

Pos logit contributions
 Mil+0.286
Gu+0.277
 Mir+0.274
 My+0.260
 Pand+0.258
Neg logit contributions
 narrative-0.273
oral-0.258
 severe-0.215
iness-0.205
der-0.205
Token it
Token ID340
Feature activation-4.12e-02

Pos logit contributions
 narrative+0.131
oral+0.126
 severe+0.109
 obstacle+0.105
smart+0.104
Neg logit contributions
 Mil-0.086
 Mir-0.079
Gu-0.078
 But-0.076
 N-0.076
Token was
Token ID373
Feature activation-2.70e-02

Pos logit contributions
 narrative+0.065
oral+0.061
 severe+0.051
 obstacle+0.048
smart+0.046
Neg logit contributions
 Mil-0.068
 Mir-0.064
 But-0.062
 N-0.062
 Del-0.062
Token the
Token ID262
Feature activation+6.11e-03

Pos logit contributions
 narrative+0.040
oral+0.037
 severe+0.032
 obstacle+0.031
 problem+0.030
Neg logit contributions
 Mil-0.043
 Mir-0.041
Gu-0.040
 Just-0.039
 But-0.039
Token best
Token ID1266
Feature activation-3.07e-02

Pos logit contributions
 Mil+0.010
 Mir+0.009
Gu+0.009
 Just+0.009
 N+0.009
Neg logit contributions
 narrative-0.010
oral-0.009
 problem-0.008
 obstacle-0.007
 story-0.007
Token
Token ID198
Feature activation-6.39e-03

Pos logit contributions
:+0.020
oral+0.018
 all+0.016
 barrel+0.014
der+0.014
Neg logit contributions
 groom-0.028
Flor-0.028
Gu-0.027
 Flor-0.026
 Mil-0.026
Token
Token ID198
Feature activation-1.14e-02

Pos logit contributions
oral+0.010
 narrative+0.009
 severe+0.008
 story+0.008
smart+0.008
Neg logit contributions
 Mil-0.010
Gu-0.009
 Pand-0.009
 N-0.009
 My-0.009
TokenJohn
Token ID7554
Feature activation+9.98e-03

Pos logit contributions
oral+0.015
 narrative+0.014
 severe+0.012
smart+0.011
:+0.010
Neg logit contributions
 Mil-0.020
 Flor-0.019
 Av-0.018
 N-0.018
 Mir-0.018
Token and
Token ID290
Feature activation-7.40e-04

Pos logit contributions
 Mil+0.015
Gu+0.014
 Mir+0.014
Whe+0.014
 My+0.014
Neg logit contributions
oral-0.014
 narrative-0.014
 severe-0.013
ience-0.013
oni-0.013
Token Jo
Token ID5302
Feature activation+4.66e-02

Pos logit contributions
oral+0.001
 narrative+0.001
smart+0.001
 story+0.001
iness+0.001
Neg logit contributions
 Just-0.001
Gu-0.001
 My-0.001
Ready-0.001
 blu-0.001
Tokenanna
Token ID7697
Feature activation+1.80e-01

Pos logit contributions
Gu+0.082
 Mil+0.079
 Mir+0.077
 My+0.076
 Watch+0.076
Neg logit contributions
 narrative-0.060
oral-0.059
iness-0.054
 severe-0.045
ience-0.044
Token had
Token ID550
Feature activation-4.87e-02

Pos logit contributions
 Mil+0.293
 Mir+0.278
Gu+0.272
 But+0.266
 Del+0.264
Neg logit contributions
 narrative-0.280
oral-0.262
 severe-0.221
 obstacle-0.202
iness-0.201
Token a
Token ID257
Feature activation+3.85e-03

Pos logit contributions
 narrative+0.080
oral+0.075
 severe+0.067
 problem+0.061
 pistols+0.060
Neg logit contributions
 Mil-0.073
 Mir-0.068
 But-0.065
 N-0.065
 My-0.065
Token lot
Token ID1256
Feature activation-6.12e-02

Pos logit contributions
 Mil+0.007
 Mir+0.006
 But+0.006
 Del+0.006
 N+0.006
Neg logit contributions
 narrative-0.005
oral-0.005
 severe-0.004
smart-0.004
iness-0.004
Token of
Token ID286
Feature activation+6.38e-02

Pos logit contributions
 narrative+0.095
oral+0.089
 severe+0.074
 obstacle+0.070
ience+0.066
Neg logit contributions
 Mil-0.102
 Mir-0.096
 But-0.094
 Del-0.093
 N-0.092
Token fun
Token ID1257
Feature activation+3.03e-02

Pos logit contributions
 Mil+0.082
 Mir+0.076
 But+0.074
 Del+0.073
 N+0.072
Neg logit contributions
 narrative-0.122
oral-0.117
 severe-0.102
 obstacle-0.096
smart-0.095
Token very
Token ID845
Feature activation-9.50e-03

Pos logit contributions
 Mil+0.016
 Mir+0.015
 But+0.015
Gu+0.015
 Del+0.015
Neg logit contributions
 narrative-0.018
oral-0.017
 severe-0.015
smart-0.014
 obstacle-0.014
Token scared
Token ID12008
Feature activation+2.76e-02

Pos logit contributions
 narrative+0.015
oral+0.014
 severe+0.013
 problem+0.012
smart+0.011
Neg logit contributions
 Mil-0.015
Gu-0.014
 Mir-0.014
 Del-0.014
 But-0.014
Token and
Token ID290
Feature activation+2.02e-03

Pos logit contributions
 Mil+0.043
 Mir+0.041
 But+0.041
 N+0.040
 Just+0.039
Neg logit contributions
 narrative-0.047
oral-0.045
 severe-0.037
 obstacle-0.034
oni-0.033
Token begged
Token ID32946
Feature activation+4.81e-02

Pos logit contributions
 Mil+0.003
 Mir+0.003
Gu+0.002
 Del+0.002
 But+0.002
Neg logit contributions
 narrative-0.004
oral-0.003
 severe-0.003
 obstacle-0.003
smart-0.003
Token for
Token ID329
Feature activation-8.16e-02

Pos logit contributions
 Mil+0.070
 Mir+0.065
 But+0.063
Gu+0.063
 N+0.063
Neg logit contributions
 narrative-0.084
oral-0.081
 severe-0.069
 obstacle-0.065
smart-0.064
Token mercy
Token ID17703
Feature activation-2.20e-01

Pos logit contributions
 narrative+0.142
oral+0.136
 severe+0.117
 obstacle+0.109
smart+0.108
Neg logit contributions
 Mil-0.119
 Mir-0.110
 But-0.108
Gu-0.107
 Del-0.107
Token.
Token ID13
Feature activation-9.60e-03

Pos logit contributions
 narrative+0.327
oral+0.310
 severe+0.255
 obstacle+0.238
smart+0.229
Neg logit contributions
 Mil-0.376
 Mir-0.356
 But-0.346
 N-0.345
 Del-0.342
Token He
Token ID679
Feature activation-5.49e-02

Pos logit contributions
oral+0.016
 narrative+0.015
oni+0.014
smart+0.013
der+0.013
Neg logit contributions
 shades-0.011
 imagine-0.011
 Mil-0.011
Gu-0.011
 blu-0.011
Token said
Token ID531
Feature activation+1.22e-02

Pos logit contributions
 narrative+0.112
oral+0.108
 severe+0.094
 obstacle+0.089
smart+0.088
Neg logit contributions
 Mil-0.064
 Mir-0.058
 But-0.056
 N-0.056
 Del-0.055
Token,
Token ID11
Feature activation-2.98e-02

Pos logit contributions
 Mil+0.017
 Mir+0.016
 But+0.015
 N+0.015
 Del+0.015
Neg logit contributions
 narrative-0.023
oral-0.022
 severe-0.018
 obstacle-0.018
iness-0.017
Token '
Token ID705
Feature activation+6.52e-03

Pos logit contributions
 narrative+0.057
oral+0.055
 severe+0.047
 obstacle+0.045
smart+0.044
Neg logit contributions
 Mil-0.038
 Mir-0.036
 But-0.035
 N-0.034
 Del-0.034
Token he
Token ID339
Feature activation-1.80e-02

Pos logit contributions
 narrative+0.036
oral+0.033
 severe+0.031
smart+0.029
 obstacle+0.028
Neg logit contributions
 Mil-0.029
Gu-0.029
 Mir-0.027
 Just-0.027
 But-0.026
Token learned
Token ID4499
Feature activation-7.44e-02

Pos logit contributions
 narrative+0.029
oral+0.028
 severe+0.024
ience+0.023
oni+0.022
Neg logit contributions
 Mil-0.026
Gu-0.025
 Mir-0.025
 But-0.024
 Del-0.024
Token an
Token ID281
Feature activation-8.05e-02

Pos logit contributions
 narrative+0.150
oral+0.142
iness+0.118
 severe+0.117
 obstacle+0.114
Neg logit contributions
 Mil-0.092
 But-0.088
 Mir-0.087
 N-0.086
 Del-0.083
Token important
Token ID1593
Feature activation-8.98e-02

Pos logit contributions
 narrative+0.142
oral+0.121
 severe+0.111
ls+0.106
smart+0.104
Neg logit contributions
 But-0.104
 ask-0.103
 Just-0.096
 N-0.094
 Ell-0.094
Token moral
Token ID6573
Feature activation-1.80e-01

Pos logit contributions
 narrative+0.116
oral+0.088
 severe+0.080
iness+0.078
smart+0.076
Neg logit contributions
 But-0.182
 Just-0.168
 Suddenly-0.161
 Mil-0.161
 Mir-0.159
Token lesson
Token ID11483
Feature activation-2.18e-01

Pos logit contributions
 narrative+0.312
oral+0.287
 severe+0.263
oni+0.249
iness+0.247
Neg logit contributions
 But-0.243
 Mil-0.228
 Just-0.223
 N-0.223
 Suddenly-0.220
Token that
Token ID326
Feature activation-7.72e-02

Pos logit contributions
 narrative+0.335
oral+0.300
 severe+0.253
 obstacle+0.237
ery+0.230
Neg logit contributions
 But-0.359
 Mil-0.346
 Just-0.343
 Mir-0.340
 N-0.334
Token day
Token ID1110
Feature activation-7.29e-02

Pos logit contributions
 narrative+0.141
oral+0.136
 severe+0.122
der+0.114
 mercy+0.113
Neg logit contributions
 Mil-0.097
 But-0.097
 Mir-0.093
 Del-0.090
 N-0.089
Token.
Token ID13
Feature activation-3.49e-02

Pos logit contributions
 narrative+0.132
oral+0.128
 severe+0.110
oni+0.105
 obstacle+0.105
Neg logit contributions
 But-0.100
 Just-0.098
 Mil-0.093
 N-0.092
 My-0.092
Token He
Token ID679
Feature activation-6.36e-02

Pos logit contributions
oral+0.060
 narrative+0.059
smart+0.049
 severe+0.046
 story+0.046
Neg logit contributions
 Mil-0.050
 Mir-0.044
 pretend-0.043
 pretended-0.043
 dipped-0.043
Token learned
Token ID4499
Feature activation-7.95e-02

Pos logit contributions
 narrative+0.104
oral+0.098
 severe+0.085
 obstacle+0.080
smart+0.078
Neg logit contributions
 Mil-0.096
 But-0.092
 Mir-0.091
 Just-0.090
 N-0.088
Token your
Token ID534
Feature activation+4.75e-02

Pos logit contributions
 narrative+0.067
oral+0.063
 severe+0.051
 obstacle+0.049
iness+0.047
Neg logit contributions
 Mil-0.067
 Mir-0.063
 But-0.063
 N-0.062
 My-0.061
Token fault
Token ID8046
Feature activation-1.04e-01

Pos logit contributions
 Mil+0.067
 Mir+0.065
 But+0.063
 N+0.062
 Del+0.061
Neg logit contributions
 narrative-0.084
oral-0.078
 severe-0.067
smart-0.064
 obstacle-0.063
Token and
Token ID290
Feature activation-1.73e-02

Pos logit contributions
 narrative+0.181
oral+0.176
 severe+0.148
 obstacle+0.141
smart+0.132
Neg logit contributions
 Mil-0.151
 But-0.148
 Mir-0.144
 Just-0.142
 N-0.140
Token ask
Token ID1265
Feature activation+8.54e-02

Pos logit contributions
 narrative+0.036
oral+0.035
 severe+0.031
 obstacle+0.029
smart+0.029
Neg logit contributions
 Mil-0.019
 Mir-0.018
 But-0.017
 N-0.017
 Del-0.016
Token for
Token ID329
Feature activation-8.60e-02

Pos logit contributions
 Mil+0.111
 Mir+0.102
Gu+0.100
 Del+0.098
 But+0.098
Neg logit contributions
 narrative-0.160
oral-0.154
 severe-0.133
 obstacle-0.126
smart-0.126
Token mercy
Token ID17703
Feature activation-2.17e-01

Pos logit contributions
 narrative+0.150
oral+0.145
 severe+0.127
 mercy+0.120
smart+0.118
Neg logit contributions
 Mil-0.124
 Mir-0.113
 But-0.111
 Del-0.111
Gu-0.110
Token.
Token ID13
Feature activation-4.85e-03

Pos logit contributions
 narrative+0.361
oral+0.342
 severe+0.291
 obstacle+0.273
oni+0.262
Neg logit contributions
 Mil-0.335
 Mir-0.317
 N-0.313
 But-0.313
 Just-0.302
Token I
Token ID314
Feature activation+4.58e-02

Pos logit contributions
 narrative+0.008
oral+0.008
smart+0.007
der+0.007
 severe+0.006
Neg logit contributions
 Mil-0.006
nes-0.006
 dipped-0.006
 shades-0.006
 dip-0.006
Token will
Token ID481
Feature activation-2.47e-03

Pos logit contributions
 Mil+0.086
 Mir+0.082
 Del+0.079
 N+0.078
Gu+0.078
Neg logit contributions
 narrative-0.061
oral-0.058
 severe-0.048
smart-0.042
 problem-0.042
Token spare
Token ID13952
Feature activation+6.39e-03

Pos logit contributions
 narrative+0.005
oral+0.004
 severe+0.004
 mercy+0.004
 story+0.004
Neg logit contributions
 Mil-0.003
 Mir-0.003
Gu-0.003
 Del-0.003
 N-0.003
Token you
Token ID345
Feature activation+4.94e-02

Pos logit contributions
 Mil+0.007
 Mir+0.006
 But+0.006
 N+0.006
 Del+0.006
Neg logit contributions
 narrative-0.014
oral-0.013
 severe-0.012
 obstacle-0.011
smart-0.011
Token learned
Token ID4499
Feature activation-6.73e-02

Pos logit contributions
 narrative+0.029
oral+0.028
 severe+0.024
 obstacle+0.023
smart+0.023
Neg logit contributions
 Mil-0.021
 Mir-0.020
 N-0.019
 But-0.019
 Del-0.019
Token a
Token ID257
Feature activation-4.52e-02

Pos logit contributions
 narrative+0.113
oral+0.107
iness+0.088
 severe+0.087
 obstacle+0.086
Neg logit contributions
 Mil-0.102
 But-0.099
 Mir-0.097
 N-0.097
 Just-0.095
Token very
Token ID845
Feature activation-7.03e-02

Pos logit contributions
 narrative+0.077
iness+0.061
ils+0.061
oral+0.059
wy+0.056
Neg logit contributions
 But-0.078
 Just-0.068
 Suddenly-0.066
 Mil-0.065
 Mir-0.064
Token important
Token ID1593
Feature activation-8.50e-02

Pos logit contributions
 narrative+0.099
oral+0.084
 pistols+0.074
 severe+0.072
iness+0.071
Neg logit contributions
 Mil-0.125
 But-0.125
 Just-0.117
 Mir-0.117
 N-0.115
Token moral
Token ID6573
Feature activation-1.78e-01

Pos logit contributions
 narrative+0.105
oral+0.080
smart+0.072
iness+0.070
 severe+0.070
Neg logit contributions
 But-0.175
 Just-0.163
 Mil-0.157
 Mir-0.155
 Suddenly-0.155
Token lesson
Token ID11483
Feature activation-2.14e-01

Pos logit contributions
 narrative+0.298
oral+0.273
 severe+0.244
 pistols+0.228
 obstacle+0.226
Neg logit contributions
 Mil-0.254
 But-0.254
 N-0.241
 Mir-0.240
 Del-0.238
Token that
Token ID326
Feature activation-6.91e-02

Pos logit contributions
 narrative+0.317
oral+0.286
 severe+0.240
 obstacle+0.229
ery+0.216
Neg logit contributions
 But-0.362
 Mil-0.359
 Mir-0.353
 Just-0.347
 N-0.341
Token day
Token ID1110
Feature activation-7.26e-02

Pos logit contributions
 narrative+0.120
oral+0.115
 severe+0.100
 obstacle+0.093
smart+0.092
Neg logit contributions
 Mil-0.097
 But-0.093
 Mir-0.092
 N-0.089
 Del-0.088
Token and
Token ID290
Feature activation-4.16e-02

Pos logit contributions
 narrative+0.121
oral+0.120
 obstacle+0.107
 severe+0.105
oni+0.105
Neg logit contributions
 But-0.112
 Just-0.109
 I-0.099
 My-0.096
 N-0.096
Token never
Token ID1239
Feature activation-5.91e-02

Pos logit contributions
 narrative+0.074
oral+0.071
 severe+0.060
 obstacle+0.057
smart+0.056
Neg logit contributions
 Mil-0.059
 Mir-0.055
Gu-0.053
 N-0.053
 Del-0.053
Token forgot
Token ID16453
Feature activation-9.62e-03

Pos logit contributions
 narrative+0.106
oral+0.099
 problem+0.088
 severe+0.087
 obstacle+0.086
Neg logit contributions
 Mil-0.081
 Mir-0.077
Gu-0.072
 Del-0.070
 N-0.069
Token again
Token ID757
Feature activation-8.03e-02

Pos logit contributions
 Mil+0.016
 Mir+0.015
 But+0.014
 N+0.014
 Del+0.014
Neg logit contributions
 narrative-0.015
oral-0.014
 severe-0.012
 pistols-0.012
oni-0.011
Token,
Token ID11
Feature activation-8.58e-03

Pos logit contributions
 narrative+0.138
oral+0.129
 severe+0.110
 obstacle+0.104
 pistols+0.100
Neg logit contributions
 Mil-0.121
 But-0.117
 Mir-0.116
 N-0.114
 Del-0.111
Token understanding
Token ID4547
Feature activation-8.94e-02

Pos logit contributions
 narrative+0.011
oral+0.011
 severe+0.009
smart+0.009
der+0.008
Neg logit contributions
 Mil-0.015
Gu-0.014
 Mir-0.014
 Del-0.014
 N-0.014
Token the
Token ID262
Feature activation-4.50e-02

Pos logit contributions
 narrative+0.175
oral+0.167
 severe+0.142
 obstacle+0.133
ience+0.131
Neg logit contributions
 But-0.111
 Mil-0.109
 N-0.109
 Mir-0.104
 L-0.104
Token moral
Token ID6573
Feature activation-1.75e-01

Pos logit contributions
 narrative+0.070
oral+0.067
 severe+0.056
 obstacle+0.052
smart+0.052
Neg logit contributions
 Mil-0.073
 Mir-0.069
 But-0.067
 N-0.067
 Del-0.066
Token lesson
Token ID11483
Feature activation-2.11e-01

Pos logit contributions
 narrative+0.285
oral+0.264
 severe+0.233
oni+0.224
 pistols+0.222
Neg logit contributions
 Mil-0.252
 But-0.250
 N-0.240
 Mir-0.237
 Del-0.234
Token that
Token ID326
Feature activation-7.72e-02

Pos logit contributions
 narrative+0.337
oral+0.306
 severe+0.257
 obstacle+0.247
ery+0.233
Neg logit contributions
 Mil-0.335
 But-0.333
 Mir-0.327
 N-0.318
 Just-0.317
Token she
Token ID673
Feature activation-3.52e-02

Pos logit contributions
 narrative+0.139
oral+0.133
 severe+0.115
 obstacle+0.108
smart+0.106
Neg logit contributions
 Mil-0.106
 Mir-0.100
 But-0.098
 N-0.096
 Del-0.095
Token'd
Token ID1549
Feature activation-2.83e-02

Pos logit contributions
 narrative+0.053
oral+0.052
 severe+0.044
 obstacle+0.043
smart+0.042
Neg logit contributions
 Mil-0.054
 But-0.054
 Just-0.053
 ask-0.051
 Mir-0.051
Token just
Token ID655
Feature activation+5.30e-02

Pos logit contributions
 narrative+0.057
oral+0.055
 severe+0.048
 obstacle+0.047
smart+0.046
Neg logit contributions
 Mil-0.032
 But-0.030
 Mir-0.029
 N-0.028
 Del-0.028
Token taught
Token ID7817
Feature activation-4.68e-03

Pos logit contributions
 Mil+0.070
 Mir+0.065
 But+0.062
 N+0.062
 Del+0.062
Neg logit contributions
 narrative-0.100
oral-0.096
 severe-0.082
 obstacle-0.078
smart-0.077
Token Tom
Token ID4186
Feature activation+4.10e-03

Pos logit contributions
 Mil+0.012
Gu+0.010
 Mir+0.010
 Del+0.010
 But+0.010
Neg logit contributions
 narrative-0.033
oral-0.031
 severe-0.028
oni-0.027
 mercy-0.027
Token cried
Token ID16896
Feature activation+2.82e-03

Pos logit contributions
 Mil+0.006
 peek+0.005
 Mir+0.005
 N+0.005
 But+0.005
Neg logit contributions
 narrative-0.008
oral-0.007
 obstacle-0.006
 severe-0.006
smart-0.006
Token and
Token ID290
Feature activation-8.98e-03

Pos logit contributions
 But+0.004
 Just+0.003
 L+0.003
 Suddenly+0.003
 Mir+0.003
Neg logit contributions
 narrative-0.006
oral-0.006
iflower-0.005
ience-0.005
 obstacle-0.005
Token begged
Token ID32946
Feature activation+5.31e-02

Pos logit contributions
 narrative+0.017
oral+0.017
 severe+0.015
 obstacle+0.014
smart+0.014
Neg logit contributions
 Mil-0.011
 Mir-0.010
 But-0.010
 N-0.010
 Del-0.010
Token for
Token ID329
Feature activation-7.54e-02

Pos logit contributions
 Mil+0.075
 Mir+0.072
 But+0.072
 N+0.070
 L+0.068
Neg logit contributions
 narrative-0.097
oral-0.090
 obstacle-0.080
ience-0.079
 severe-0.079
Token mercy
Token ID17703
Feature activation-2.10e-01

Pos logit contributions
 narrative+0.148
oral+0.142
 obstacle+0.118
 severe+0.118
iness+0.115
Neg logit contributions
 Mil-0.097
 Mir-0.093
 N-0.089
 But-0.086
 Del-0.085
Token.
Token ID13
Feature activation+4.64e-03

Pos logit contributions
 narrative+0.381
oral+0.359
 severe+0.311
 obstacle+0.295
ience+0.285
Neg logit contributions
 Mil-0.305
 Mir-0.288
 N-0.284
 But-0.283
 Just-0.271
Token They
Token ID1119
Feature activation-4.40e-03

Pos logit contributions
 Mil+0.006
Gu+0.006
 N+0.005
 Del+0.005
 blu+0.005
Neg logit contributions
 narrative-0.008
oral-0.008
 severe-0.007
der-0.007
oni-0.006
Token wished
Token ID16555
Feature activation-1.10e-02

Pos logit contributions
oral+0.010
 narrative+0.010
 obstacle+0.009
iflower+0.008
smart+0.008
Neg logit contributions
 Mil-0.004
 asked-0.004
 begged-0.004
 ask-0.004
 But-0.004
Token they
Token ID484
Feature activation-3.08e-02

Pos logit contributions
 narrative+0.019
oral+0.019
 severe+0.016
 obstacle+0.015
smart+0.015
Neg logit contributions
 Mil-0.016
 Mir-0.015
Gu-0.015
 Del-0.014
 But-0.014
Token had
Token ID550
Feature activation-7.55e-02

Pos logit contributions
oral+0.060
 narrative+0.060
 obstacle+0.055
oni+0.049
smart+0.048
Neg logit contributions
 But-0.035
 Mil-0.035
 Just-0.035
 asked-0.034
 Ell-0.034
Token
Token ID198
Feature activation-8.23e-03

Pos logit contributions
 narrative+0.006
oral+0.006
 severe+0.005
 obstacle+0.005
iness+0.005
Neg logit contributions
 Mil-0.004
 Mir-0.004
 But-0.004
 Del-0.004
 Just-0.004
TokenAndy
Token ID35314
Feature activation-3.38e-02

Pos logit contributions
 narrative+0.014
oral+0.013
 severe+0.012
 obstacle+0.011
iness+0.011
Neg logit contributions
 Mil-0.012
 Mir-0.011
 But-0.011
Gu-0.011
 Del-0.011
Token took
Token ID1718
Feature activation-6.37e-02

Pos logit contributions
 narrative+0.054
oral+0.052
 severe+0.043
 obstacle+0.040
ience+0.040
Neg logit contributions
 Mil-0.053
 Mir-0.050
Gu-0.049
 But-0.048
 Del-0.047
Token this
Token ID428
Feature activation-2.32e-02

Pos logit contributions
 narrative+0.109
oral+0.104
 severe+0.090
 obstacle+0.086
smart+0.085
Neg logit contributions
 Mil-0.093
 Mir-0.085
Gu-0.084
 N-0.083
 But-0.082
Token moral
Token ID6573
Feature activation-1.71e-01

Pos logit contributions
 narrative+0.043
oral+0.040
 severe+0.033
oni+0.031
der+0.031
Neg logit contributions
 Mil-0.031
 But-0.029
 Mir-0.029
 N-0.028
 Del-0.028
Token lesson
Token ID11483
Feature activation-2.09e-01

Pos logit contributions
 narrative+0.291
oral+0.266
 severe+0.230
oni+0.221
iness+0.219
Neg logit contributions
 But-0.242
 Mil-0.240
 Just-0.228
 Mir-0.227
 N-0.227
Token to
Token ID284
Feature activation-4.64e-02

Pos logit contributions
 narrative+0.313
oral+0.281
 severe+0.228
 obstacle+0.221
iness+0.207
Neg logit contributions
 Mil-0.357
 But-0.352
 Mir-0.347
 Just-0.342
 N-0.336
Token heart
Token ID2612
Feature activation-1.01e-01

Pos logit contributions
 narrative+0.093
oral+0.090
 severe+0.078
 obstacle+0.075
smart+0.073
Neg logit contributions
 Mil-0.055
 Mir-0.051
 But-0.050
 N-0.049
 Del-0.049
Token and
Token ID290
Feature activation-3.76e-02

Pos logit contributions
 narrative+0.162
oral+0.150
 severe+0.124
 obstacle+0.120
 problem+0.112
Neg logit contributions
 Mil-0.163
 Mir-0.154
 But-0.151
 N-0.149
 Just-0.147
Token promised
Token ID8072
Feature activation-6.61e-02

Pos logit contributions
 narrative+0.066
oral+0.062
 severe+0.053
 mercy+0.050
 obstacle+0.050
Neg logit contributions
 Mil-0.055
 Mir-0.050
Gu-0.050
 Del-0.048
 N-0.048
Token to
Token ID284
Feature activation-3.81e-02

Pos logit contributions
 narrative+0.099
oral+0.098
 severe+0.084
 mercy+0.077
der+0.076
Neg logit contributions
 Mil-0.106
Gu-0.102
 Mir-0.099
 Del-0.096
 N-0.095
Token simple
Token ID2829
Feature activation-1.27e-02

Pos logit contributions
 narrative+0.067
smart+0.062
 severe+0.060
oral+0.060
 pistols+0.055
Neg logit contributions
 Suddenly-0.060
 Mil-0.060
 Del-0.059
 peek-0.059
 But-0.058
Token story
Token ID1621
Feature activation-1.42e-01

Pos logit contributions
 narrative+0.020
oral+0.019
 severe+0.015
 obstacle+0.014
smart+0.014
Neg logit contributions
 Mil-0.021
 Mir-0.020
 But-0.019
 N-0.019
Gu-0.019
Token with
Token ID351
Feature activation+2.37e-02

Pos logit contributions
 narrative+0.184
oral+0.167
 severe+0.153
 pistols+0.143
 obstacle+0.137
Neg logit contributions
 Mil-0.248
 But-0.247
 Del-0.240
 Just-0.240
 Mir-0.240
Token a
Token ID257
Feature activation-2.91e-02

Pos logit contributions
 Mil+0.029
 Mir+0.027
 N+0.026
 Del+0.026
 But+0.026
Neg logit contributions
 narrative-0.047
oral-0.046
 severe-0.039
 obstacle-0.038
iness-0.036
Token moral
Token ID6573
Feature activation-1.72e-01

Pos logit contributions
 narrative+0.052
oral+0.048
ils+0.040
oni+0.040
der+0.040
Neg logit contributions
 Mil-0.040
 But-0.038
 Mir-0.038
 Del-0.037
 Suddenly-0.037
Token lesson
Token ID11483
Feature activation-2.09e-01

Pos logit contributions
 narrative+0.281
oral+0.256
 severe+0.231
oni+0.220
 pistols+0.217
Neg logit contributions
 But-0.252
 Mil-0.248
 Just-0.239
 Mir-0.237
 N-0.235
Token showed
Token ID3751
Feature activation+7.82e-04

Pos logit contributions
 narrative+0.289
oral+0.263
 severe+0.217
 obstacle+0.201
iness+0.187
Neg logit contributions
 Mil-0.378
 Mir-0.368
 But-0.367
 Just-0.357
 N-0.354
Token the
Token ID262
Feature activation-7.20e-02

Pos logit contributions
 Mil+0.001
 Mir+0.001
 But+0.001
 N+0.001
 Del+0.001
Neg logit contributions
 narrative-0.002
oral-0.001
 severe-0.001
 obstacle-0.001
iness-0.001
Token generosity
Token ID31569
Feature activation-1.44e-01

Pos logit contributions
 narrative+0.115
oral+0.110
 severe+0.093
smart+0.088
 obstacle+0.086
Neg logit contributions
 Mil-0.112
 Mir-0.106
 N-0.104
 Del-0.103
 But-0.102
Token and
Token ID290
Feature activation-4.31e-02

Pos logit contributions
 narrative+0.239
oral+0.222
 severe+0.193
 obstacle+0.176
 pistols+0.168
Neg logit contributions
 Mil-0.216
 But-0.213
 Mir-0.206
 N-0.198
 My-0.197
Token caring
Token ID18088
Feature activation-8.53e-02

Pos logit contributions
 narrative+0.072
oral+0.069
 severe+0.058
 obstacle+0.054
smart+0.053
Neg logit contributions
 Mil-0.065
 Mir-0.061
 But-0.060
 N-0.059
 Del-0.059
Token little
Token ID1310
Feature activation+3.91e-02

Pos logit contributions
 Mil+0.031
 Mir+0.029
 N+0.028
 But+0.028
 Del+0.028
Neg logit contributions
 narrative-0.037
oral-0.036
 severe-0.031
smart-0.029
 obstacle-0.029
Token boy
Token ID2933
Feature activation+1.12e-02

Pos logit contributions
 Mil+0.049
Gu+0.046
 Mir+0.045
 Just+0.044
 But+0.044
Neg logit contributions
 narrative-0.075
oral-0.070
 obstacle-0.061
 severe-0.060
 problem-0.059
Token learnt
Token ID26338
Feature activation-7.45e-02

Pos logit contributions
 Mil+0.016
 Mir+0.015
Gu+0.015
 Del+0.014
 N+0.014
Neg logit contributions
 narrative-0.020
oral-0.019
 severe-0.016
smart-0.015
 obstacle-0.015
Token an
Token ID281
Feature activation-8.57e-02

Pos logit contributions
 narrative+0.150
oral+0.143
 severe+0.118
 obstacle+0.116
iness+0.115
Neg logit contributions
 Mil-0.089
 But-0.085
 Mir-0.084
 N-0.082
 Just-0.081
Token important
Token ID1593
Feature activation-1.00e-01

Pos logit contributions
 narrative+0.148
oral+0.123
oni+0.112
 severe+0.108
iness+0.106
Neg logit contributions
 But-0.121
 ask-0.116
 Just-0.113
 Suddenly-0.110
 shades-0.104
Token lesson
Token ID11483
Feature activation-2.09e-01

Pos logit contributions
 narrative+0.131
oral+0.104
oni+0.091
iness+0.089
ils+0.089
Neg logit contributions
 But-0.202
 Just-0.186
 Suddenly-0.180
 ask-0.179
 peek-0.171
Token that
Token ID326
Feature activation-6.28e-02

Pos logit contributions
 narrative+0.346
oral+0.321
 severe+0.277
 obstacle+0.264
oni+0.252
Neg logit contributions
 But-0.323
 Mil-0.309
 Just-0.307
 Mir-0.300
 N-0.292
Token day
Token ID1110
Feature activation-6.74e-02

Pos logit contributions
 narrative+0.112
oral+0.109
 severe+0.096
oni+0.090
 obstacle+0.089
Neg logit contributions
 Mil-0.083
 But-0.082
 Mir-0.078
 Del-0.075
 N-0.075
Token -
Token ID532
Feature activation-9.80e-03

Pos logit contributions
 narrative+0.105
oral+0.100
 severe+0.083
 obstacle+0.079
oni+0.075
Neg logit contributions
 Mil-0.108
 But-0.105
 Just-0.103
 Mir-0.103
 N-0.102
Token stealing
Token ID16568
Feature activation-1.32e-01

Pos logit contributions
 narrative+0.018
oral+0.017
 severe+0.015
 obstacle+0.014
smart+0.014
Neg logit contributions
 Mil-0.014
 Mir-0.012
Gu-0.012
 Del-0.012
 N-0.012
Token is
Token ID318
Feature activation-3.35e-02

Pos logit contributions
 narrative+0.234
oral+0.234
 severe+0.190
 obstacle+0.175
oni+0.173
Neg logit contributions
 But-0.172
 Mil-0.172
 Mir-0.163
 N-0.161
 My-0.160
Token Butter
Token ID18971
Feature activation+4.38e-02

Pos logit contributions
 Mil+0.007
Gu+0.007
 Mir+0.006
 dipped+0.006
 shades+0.006
Neg logit contributions
oral-0.008
 narrative-0.008
smart-0.006
:-0.006
der-0.006
Tokeni
Token ID72
Feature activation+1.22e-02

Pos logit contributions
 Mil+0.070
 Mir+0.066
 But+0.064
Gu+0.064
 N+0.064
Neg logit contributions
 narrative-0.069
oral-0.066
 severe-0.055
 obstacle-0.051
smart-0.050
Token learnt
Token ID26338
Feature activation-7.10e-02

Pos logit contributions
 Mil+0.018
 Mir+0.017
 But+0.016
 N+0.016
 Del+0.016
Neg logit contributions
 narrative-0.021
oral-0.020
 severe-0.017
 obstacle-0.016
smart-0.016
Token an
Token ID281
Feature activation-8.62e-02

Pos logit contributions
 narrative+0.135
oral+0.128
 severe+0.105
 obstacle+0.103
iness+0.101
Neg logit contributions
 Mil-0.093
 But-0.089
 Mir-0.088
 N-0.087
 Just-0.085
Token important
Token ID1593
Feature activation-9.61e-02

Pos logit contributions
 narrative+0.131
oral+0.113
 severe+0.095
oni+0.089
smart+0.088
Neg logit contributions
 But-0.138
 Mil-0.137
 ask-0.133
 N-0.133
 Just-0.132
Token lesson
Token ID11483
Feature activation-2.07e-01

Pos logit contributions
 narrative+0.122
oral+0.097
 severe+0.080
oni+0.079
smart+0.076
Neg logit contributions
 But-0.199
 Just-0.183
 Mil-0.179
 Suddenly-0.179
 ask-0.176
Token that
Token ID326
Feature activation-5.78e-02

Pos logit contributions
 narrative+0.330
oral+0.309
 severe+0.256
 obstacle+0.246
oni+0.231
Neg logit contributions
 But-0.331
 Mil-0.327
 Just-0.316
 Mir-0.313
 N-0.310
Token day
Token ID1110
Feature activation-6.22e-02

Pos logit contributions
 narrative+0.100
oral+0.097
 severe+0.083
 obstacle+0.077
oni+0.077
Neg logit contributions
 Mil-0.082
 But-0.078
 Mir-0.078
 N-0.075
 Del-0.074
Token:
Token ID25
Feature activation-4.94e-02

Pos logit contributions
 narrative+0.095
oral+0.091
 severe+0.075
 obstacle+0.071
oni+0.069
Neg logit contributions
 Mil-0.101
 But-0.100
 Just-0.098
 N-0.096
 Mir-0.096
Token that
Token ID326
Feature activation-3.48e-02

Pos logit contributions
 narrative+0.092
oral+0.090
 severe+0.077
smart+0.076
 obstacle+0.073
Neg logit contributions
 Mil-0.063
 Mir-0.056
Gu-0.056
 Del-0.054
nes-0.052
Token spicy
Token ID26880
Feature activation-5.11e-02

Pos logit contributions
 narrative+0.063
oral+0.060
 severe+0.052
 obstacle+0.049
smart+0.048
Neg logit contributions
 Mil-0.048
 Mir-0.045
Gu-0.043
 Del-0.043
 N-0.043
Token little
Token ID1310
Feature activation+3.21e-02

Pos logit contributions
 narrative+0.039
oral+0.037
 severe+0.033
smart+0.031
 obstacle+0.031
Neg logit contributions
 Mil-0.024
 Mir-0.022
 N-0.022
 But-0.022
 Del-0.021
Token girl
Token ID2576
Feature activation+3.90e-02

Pos logit contributions
 Mil+0.046
 Mir+0.046
nes+0.044
 Just+0.044
 My+0.044
Neg logit contributions
 narrative-0.053
oral-0.048
 problem-0.044
 obstacle-0.042
 severe-0.039
Token learned
Token ID4499
Feature activation-7.72e-02

Pos logit contributions
 Mil+0.061
Gu+0.060
 Mir+0.058
 But+0.056
 Pand+0.055
Neg logit contributions
 narrative-0.058
oral-0.055
ience-0.046
 severe-0.045
 problem-0.043
Token an
Token ID281
Feature activation-8.05e-02

Pos logit contributions
 narrative+0.160
oral+0.154
 severe+0.129
iness+0.128
 obstacle+0.128
Neg logit contributions
 Mil-0.091
 N-0.085
 But-0.084
 Mir-0.084
 Del-0.080
Token important
Token ID1593
Feature activation-9.60e-02

Pos logit contributions
 narrative+0.144
oral+0.122
 severe+0.108
ls+0.106
smart+0.103
Neg logit contributions
 But-0.106
 ask-0.104
 Just-0.101
 N-0.100
 Del-0.100
Token lesson
Token ID11483
Feature activation-2.07e-01

Pos logit contributions
 narrative+0.122
oral+0.093
 severe+0.082
iness+0.078
smart+0.078
Neg logit contributions
 But-0.198
 Just-0.182
 Mil-0.180
 Suddenly-0.176
 Mir-0.175
Token that
Token ID326
Feature activation-5.90e-02

Pos logit contributions
 narrative+0.311
oral+0.293
 severe+0.247
 obstacle+0.231
ery+0.217
Neg logit contributions
 Mil-0.353
 But-0.342
 Mir-0.337
 N-0.332
 Just-0.326
Token day
Token ID1110
Feature activation-6.25e-02

Pos logit contributions
 narrative+0.101
oral+0.098
 severe+0.085
 obstacle+0.078
smart+0.077
Neg logit contributions
 Mil-0.085
 But-0.080
 Mir-0.080
 N-0.078
 Del-0.077
Token �
Token ID6184
Feature activation-3.68e-02

Pos logit contributions
 narrative+0.096
oral+0.093
 severe+0.078
 obstacle+0.074
oni+0.071
Neg logit contributions
 Mil-0.101
 But-0.101
 Just-0.098
 N-0.098
 Mir-0.095
Token�
Token ID95
Feature activation+1.86e-02

Pos logit contributions
 narrative+0.062
oral+0.060
 severe+0.050
 obstacle+0.047
smart+0.046
Neg logit contributions
 Mil-0.056
 Mir-0.053
 Del-0.050
 N-0.050
Gu-0.050
Token€
Token ID26391
Feature activation+2.32e-02

Pos logit contributions
 Mil+0.026
 Mir+0.024
 N+0.022
 Del+0.022
 Ell+0.022
Neg logit contributions
 narrative-0.035
oral-0.033
 severe-0.028
 obstacle-0.027
smart-0.027
Token.
Token ID13
Feature activation-1.53e-02

Pos logit contributions
 narrative+0.076
oral+0.076
 severe+0.060
oni+0.054
 obstacle+0.054
Neg logit contributions
 Mil-0.061
 But-0.060
 Mir-0.058
 Just-0.057
 N-0.057
Token It
Token ID632
Feature activation+2.46e-02

Pos logit contributions
oral+0.030
 narrative+0.030
der+0.026
smart+0.025
 severe+0.025
Neg logit contributions
 Mil-0.017
Gu-0.015
 Mir-0.015
 imagine-0.015
 pretend-0.014
Token had
Token ID550
Feature activation-6.02e-02

Pos logit contributions
 Mil+0.036
 Mir+0.034
 But+0.033
 N+0.033
 Del+0.033
Neg logit contributions
 narrative-0.042
oral-0.040
 severe-0.034
 obstacle-0.032
smart-0.031
Token a
Token ID257
Feature activation-5.24e-03

Pos logit contributions
 narrative+0.105
oral+0.102
 severe+0.092
 problem+0.083
 obstacle+0.083
Neg logit contributions
 Mil-0.086
 Mir-0.079
 N-0.076
 But-0.076
 Del-0.075
Token moral
Token ID6573
Feature activation-1.70e-01

Pos logit contributions
 narrative+0.009
oral+0.008
iness+0.006
oni+0.006
smart+0.006
Neg logit contributions
 Mil-0.008
 But-0.008
 Mir-0.008
 Suddenly-0.008
 Just-0.007
Token lesson
Token ID11483
Feature activation-2.06e-01

Pos logit contributions
 narrative+0.273
oral+0.252
 severe+0.213
oni+0.203
iness+0.199
Neg logit contributions
 Mil-0.260
 But-0.252
 Mir-0.248
 Del-0.242
 N-0.241
Token:
Token ID25
Feature activation-7.15e-02

Pos logit contributions
 narrative+0.280
oral+0.254
 severe+0.211
 obstacle+0.192
iness+0.186
Neg logit contributions
 Mil-0.371
 But-0.364
 Mir-0.363
 Just-0.356
 Del-0.349
Token never
Token ID1239
Feature activation-7.11e-02

Pos logit contributions
oral+0.124
 narrative+0.121
smart+0.102
 severe+0.097
 story+0.097
Neg logit contributions
 Mil-0.103
Gu-0.092
 Mir-0.092
Whe-0.092
 N-0.087
Token explore
Token ID7301
Feature activation+1.04e-02

Pos logit contributions
 narrative+0.129
oral+0.122
 severe+0.108
 obstacle+0.106
 problem+0.105
Neg logit contributions
 Mil-0.097
 Mir-0.090
 N-0.085
Gu-0.085
 Del-0.084
Token something
Token ID1223
Feature activation-3.19e-03

Pos logit contributions
 Mil+0.012
 But+0.012
 Mir+0.012
 N+0.011
 Ell+0.011
Neg logit contributions
 narrative-0.021
oral-0.021
 severe-0.017
oni-0.017
iness-0.016
Token that
Token ID326
Feature activation-3.84e-03

Pos logit contributions
 narrative+0.005
oral+0.005
 severe+0.004
oni+0.004
 obstacle+0.004
Neg logit contributions
 Mil-0.005
 But-0.004
 Mir-0.004
 N-0.004
 Just-0.004
Token for
Token ID329
Feature activation-8.34e-02

Pos logit contributions
 narrative+0.048
oral+0.043
 severe+0.036
 obstacle+0.034
 pistols+0.032
Neg logit contributions
 Mil-0.043
 N-0.043
 But-0.042
 Just-0.041
 Mir-0.041
Token teaching
Token ID7743
Feature activation-4.43e-02

Pos logit contributions
 narrative+0.162
oral+0.159
 obstacle+0.130
 severe+0.129
ience+0.129
Neg logit contributions
 Mil-0.105
 Mir-0.100
 N-0.097
 But-0.094
 Del-0.094
Token him
Token ID683
Feature activation-4.26e-02

Pos logit contributions
 narrative+0.081
oral+0.077
 severe+0.066
 obstacle+0.063
smart+0.061
Neg logit contributions
 Mil-0.061
 Mir-0.057
 But-0.055
 N-0.055
 Del-0.054
Token this
Token ID428
Feature activation-4.25e-02

Pos logit contributions
 narrative+0.077
oral+0.071
 severe+0.058
 obstacle+0.055
 problem+0.054
Neg logit contributions
 Mil-0.063
 Mir-0.058
 N-0.058
 But-0.057
 Del-0.057
Token valuable
Token ID8119
Feature activation-1.32e-01

Pos logit contributions
 narrative+0.073
oral+0.065
 severe+0.059
oni+0.054
iness+0.053
Neg logit contributions
 But-0.059
 Mil-0.057
 Just-0.057
 Del-0.054
 peek-0.054
Token lesson
Token ID11483
Feature activation-2.06e-01

Pos logit contributions
 narrative+0.169
oral+0.136
 severe+0.121
smart+0.104
oni+0.103
Neg logit contributions
 But-0.257
 Mil-0.242
 Just-0.238
 Mir-0.234
 Ell-0.233
Token.
Token ID13
Feature activation-3.36e-02

Pos logit contributions
 narrative+0.362
oral+0.349
 severe+0.300
 obstacle+0.283
 problem+0.261
Neg logit contributions
 Mil-0.289
 But-0.287
 N-0.276
 Just-0.272
 Mir-0.269
Token He
Token ID679
Feature activation-6.51e-02

Pos logit contributions
oral+0.053
 narrative+0.052
smart+0.043
der+0.040
 severe+0.040
Neg logit contributions
 Mil-0.050
Gu-0.047
 Mir-0.045
 imagine-0.044
nes-0.044
Token was
Token ID373
Feature activation-1.08e-02

Pos logit contributions
 narrative+0.123
oral+0.116
 severe+0.101
 obstacle+0.097
iness+0.094
Neg logit contributions
 Mil-0.085
 Mir-0.079
 But-0.079
 N-0.078
 Just-0.077
Token so
Token ID523
Feature activation+2.07e-02

Pos logit contributions
 narrative+0.017
oral+0.016
 severe+0.014
smart+0.013
 obstacle+0.013
Neg logit contributions
 Mil-0.017
 Mir-0.016
Gu-0.015
 But-0.015
 N-0.015
Token proud
Token ID6613
Feature activation-8.40e-03

Pos logit contributions
 Mil+0.032
 Mir+0.031
Gu+0.029
 But+0.029
 Del+0.029
Neg logit contributions
 narrative-0.033
oral-0.032
 severe-0.029
smart-0.026
 obstacle-0.024
Token that
Token ID326
Feature activation-3.73e-02

Pos logit contributions
 Mil+0.063
 Mir+0.060
Gu+0.060
 Del+0.058
 N+0.057
Neg logit contributions
 narrative-0.039
oral-0.038
 severe-0.030
 problem-0.029
der-0.029
Token he
Token ID339
Feature activation-2.94e-02

Pos logit contributions
 narrative+0.071
oral+0.068
 severe+0.059
 obstacle+0.056
smart+0.055
Neg logit contributions
 Mil-0.048
 Mir-0.045
Gu-0.044
 But-0.042
 Del-0.042
Token learned
Token ID4499
Feature activation-7.95e-02

Pos logit contributions
 narrative+0.044
oral+0.043
 severe+0.038
ience+0.037
 obstacle+0.035
Neg logit contributions
 Mil-0.044
 Mir-0.043
Gu-0.042
 Del-0.040
 But-0.040
Token an
Token ID281
Feature activation-8.30e-02

Pos logit contributions
 narrative+0.168
oral+0.161
 severe+0.131
iness+0.131
 obstacle+0.127
Neg logit contributions
 Mil-0.090
 But-0.087
 N-0.086
 Mir-0.083
 Just-0.081
Token important
Token ID1593
Feature activation-9.69e-02

Pos logit contributions
 narrative+0.151
oral+0.125
ls+0.118
es+0.113
 severe+0.113
Neg logit contributions
 ask-0.104
 But-0.103
 Just-0.097
 Del-0.093
 Suddenly-0.093
Token lesson
Token ID11483
Feature activation-2.05e-01

Pos logit contributions
 narrative+0.118
oral+0.092
 severe+0.076
smart+0.074
iness+0.073
Neg logit contributions
 But-0.200
 Just-0.189
 Mil-0.185
 Suddenly-0.183
 Mir-0.182
Token -
Token ID532
Feature activation-1.42e-02

Pos logit contributions
 narrative+0.331
oral+0.312
 severe+0.262
 obstacle+0.244
ery+0.231
Neg logit contributions
 But-0.331
 Mil-0.327
 Just-0.315
 Mir-0.313
 N-0.309
Token don
Token ID836
Feature activation-4.66e-02

Pos logit contributions
 narrative+0.024
oral+0.023
 severe+0.020
smart+0.019
 obstacle+0.019
Neg logit contributions
 Mil-0.021
 Mir-0.019
Gu-0.019
 Del-0.018
 N-0.018
Token't
Token ID470
Feature activation-2.46e-02

Pos logit contributions
 narrative+0.124
oral+0.115
ils+0.104
 severe+0.103
iness+0.102
Neg logit contributions
 But-0.026
 Just-0.025
 My-0.022
 Mil-0.022
 Mir-0.020
Token be
Token ID307
Feature activation-2.94e-02

Pos logit contributions
 narrative+0.048
oral+0.046
 severe+0.041
 obstacle+0.039
 problem+0.038
Neg logit contributions
 Mil-0.030
 Mir-0.028
Gu-0.027
 Del-0.026
 N-0.026
Token so
Token ID523
Feature activation-1.40e-03

Pos logit contributions
 narrative+0.053
oral+0.050
 severe+0.043
 obstacle+0.041
smart+0.040
Neg logit contributions
 Mil-0.042
 Mir-0.039
 But-0.038
 N-0.037
 Del-0.037
Token They
Token ID1119
Feature activation-1.67e-02

Pos logit contributions
 narrative+0.020
oral+0.019
 severe+0.017
 obstacle+0.017
 problem+0.016
Neg logit contributions
 Mil-0.009
 Mir-0.008
 But-0.008
 N-0.008
 Del-0.007
Token all
Token ID477
Feature activation-3.08e-02

Pos logit contributions
oral+0.029
 narrative+0.028
 severe+0.026
 obstacle+0.025
smart+0.024
Neg logit contributions
 Mil-0.023
 Ell-0.022
 Del-0.021
 Mir-0.021
 asked-0.021
Token cherished
Token ID41238
Feature activation+7.17e-04

Pos logit contributions
 narrative+0.053
oral+0.052
 severe+0.044
 obstacle+0.042
 pistols+0.041
Neg logit contributions
 Mil-0.046
 Mir-0.042
 N-0.042
 But-0.041
 Del-0.041
Token the
Token ID262
Feature activation-1.67e-02

Pos logit contributions
 But+0.001
 Mil+0.001
 Mir+0.001
 Ell+0.001
 N+0.001
Neg logit contributions
oral-0.001
 narrative-0.001
 severe-0.001
 obstacle-0.001
iness-0.001
Token moral
Token ID6573
Feature activation-1.66e-01

Pos logit contributions
 narrative+0.029
oral+0.028
 severe+0.023
smart+0.022
der+0.022
Neg logit contributions
 Mil-0.023
 But-0.022
 Mir-0.022
 N-0.022
Gu-0.021
Token lesson
Token ID11483
Feature activation-2.05e-01

Pos logit contributions
 narrative+0.274
oral+0.253
 severe+0.228
oni+0.214
 pistols+0.213
Neg logit contributions
 But-0.238
 Mil-0.228
 Just-0.222
 Suddenly-0.218
 Del-0.218
Token -
Token ID532
Feature activation-3.06e-02

Pos logit contributions
 narrative+0.342
oral+0.309
 severe+0.276
ery+0.260
 obstacle+0.255
Neg logit contributions
 But-0.306
 Just-0.289
 Mir-0.284
 Mil-0.282
 N-0.274
Token to
Token ID284
Feature activation-2.04e-02

Pos logit contributions
 narrative+0.050
oral+0.049
 severe+0.040
smart+0.038
 obstacle+0.038
Neg logit contributions
 Mil-0.047
 Mir-0.044
Gu-0.043
 N-0.042
 Del-0.042
Token always
Token ID1464
Feature activation+1.58e-02

Pos logit contributions
 narrative+0.042
oral+0.041
 severe+0.036
 obstacle+0.035
smart+0.034
Neg logit contributions
 Mil-0.022
 Mir-0.021
 But-0.020
 N-0.020
 Del-0.020
Token share
Token ID2648
Feature activation-3.66e-02

Pos logit contributions
 Mil+0.015
 Mir+0.014
 But+0.013
 N+0.013
 Del+0.013
Neg logit contributions
 narrative-0.035
oral-0.034
 severe-0.030
 obstacle-0.029
smart-0.028
Token and
Token ID290
Feature activation-3.02e-02

Pos logit contributions
 narrative+0.069
oral+0.068
 obstacle+0.055
 severe+0.055
iness+0.054
Neg logit contributions
 But-0.046
 Mil-0.046
 Mir-0.044
 N-0.043
 ask-0.042
Token little
Token ID1310
Feature activation+2.73e-02

Pos logit contributions
 narrative+0.046
oral+0.044
smart+0.043
eds+0.041
 severe+0.041
Neg logit contributions
 Clara-0.033
 N-0.032
 Clare-0.032
 Mil-0.032
 But-0.031
Token girl
Token ID2576
Feature activation+3.69e-02

Pos logit contributions
 Mil+0.031
 Mir+0.029
 Del+0.027
 But+0.027
Gu+0.027
Neg logit contributions
 narrative-0.057
oral-0.054
 severe-0.047
 obstacle-0.046
 problem-0.044
Token learned
Token ID4499
Feature activation-7.73e-02

Pos logit contributions
 Mil+0.057
 Mir+0.054
Gu+0.054
 But+0.052
 N+0.052
Neg logit contributions
 narrative-0.058
oral-0.056
 severe-0.046
ience-0.043
smart-0.043
Token an
Token ID281
Feature activation-8.35e-02

Pos logit contributions
 narrative+0.154
oral+0.147
iness+0.124
 obstacle+0.122
 severe+0.121
Neg logit contributions
 Mil-0.096
 But-0.092
 N-0.091
 Mir-0.090
 Ell-0.089
Token important
Token ID1593
Feature activation-9.62e-02

Pos logit contributions
 narrative+0.146
oral+0.123
 severe+0.111
ls+0.109
smart+0.108
Neg logit contributions
 But-0.110
 ask-0.108
 Just-0.103
 Ell-0.100
 Del-0.100
Token lesson
Token ID11483
Feature activation-2.05e-01

Pos logit contributions
 narrative+0.122
oral+0.091
iness+0.082
 pistols+0.082
 severe+0.081
Neg logit contributions
 But-0.197
 Just-0.182
 Suddenly-0.174
 Mil-0.174
 Mir-0.172
Token:
Token ID25
Feature activation-5.34e-02

Pos logit contributions
 narrative+0.316
oral+0.299
 severe+0.251
 obstacle+0.239
ery+0.227
Neg logit contributions
 But-0.340
 Mil-0.338
 Mir-0.324
 Just-0.321
 N-0.317
Token never
Token ID1239
Feature activation-6.15e-02

Pos logit contributions
 narrative+0.095
oral+0.093
smart+0.077
 severe+0.076
der+0.073
Neg logit contributions
 Mil-0.075
 Mir-0.068
Gu-0.067
 Del-0.064
 N-0.063
Token give
Token ID1577
Feature activation-1.58e-02

Pos logit contributions
 narrative+0.124
oral+0.120
 severe+0.104
 obstacle+0.099
smart+0.097
Neg logit contributions
 Mil-0.073
 Mir-0.067
 But-0.065
 N-0.064
 Del-0.064
Token up
Token ID510
Feature activation+2.59e-02

Pos logit contributions
 narrative+0.024
oral+0.023
 severe+0.021
 obstacle+0.019
smart+0.019
Neg logit contributions
 Mil-0.025
 Mir-0.023
Gu-0.023
 But-0.022
 Del-0.022
Token and
Token ID290
Feature activation-4.42e-02

Pos logit contributions
 Mil+0.037
 N+0.035
 But+0.035
 Mir+0.034
 Del+0.033
Neg logit contributions
 narrative-0.046
oral-0.044
 severe-0.036
 obstacle-0.034
oni-0.034
Token The
Token ID383
Feature activation-2.33e-03

Pos logit contributions
oral+0.028
 narrative+0.026
:+0.022
smart+0.021
der+0.021
Neg logit contributions
Gu-0.024
 Mil-0.024
 dipped-0.023
 imagine-0.022
 styl-0.022
Token boy
Token ID2933
Feature activation+1.42e-03

Pos logit contributions
 narrative+0.004
oral+0.004
 severe+0.003
 obstacle+0.003
smart+0.003
Neg logit contributions
 Mil-0.003
 Mir-0.003
 But-0.003
Gu-0.003
 Del-0.003
Token learned
Token ID4499
Feature activation-7.82e-02

Pos logit contributions
 Mil+0.002
Gu+0.002
 Mir+0.002
 Del+0.002
 Pand+0.002
Neg logit contributions
 narrative-0.002
oral-0.002
ience-0.002
 severe-0.001
smart-0.001
Token an
Token ID281
Feature activation-8.47e-02

Pos logit contributions
 narrative+0.159
oral+0.151
iness+0.125
 severe+0.122
 obstacle+0.121
Neg logit contributions
 Mil-0.094
 But-0.092
 N-0.089
 Mir-0.088
 Just-0.086
Token important
Token ID1593
Feature activation-9.54e-02

Pos logit contributions
 narrative+0.147
oral+0.124
 severe+0.111
oni+0.106
smart+0.105
Neg logit contributions
 But-0.120
 ask-0.118
 Just-0.111
 Suddenly-0.108
 Mil-0.105
Token lesson
Token ID11483
Feature activation-2.05e-01

Pos logit contributions
 narrative+0.126
oral+0.096
oni+0.085
 severe+0.083
iness+0.082
Neg logit contributions
 But-0.198
 Just-0.182
 Suddenly-0.175
 ask-0.170
 peek-0.167
Token that
Token ID326
Feature activation-5.99e-02

Pos logit contributions
 narrative+0.323
oral+0.305
 severe+0.255
 obstacle+0.238
oni+0.227
Neg logit contributions
 But-0.338
 Mil-0.330
 Just-0.322
 Mir-0.316
 N-0.311
Token day
Token ID1110
Feature activation-6.51e-02

Pos logit contributions
 narrative+0.100
oral+0.096
 severe+0.081
 obstacle+0.076
smart+0.075
Neg logit contributions
 Mil-0.090
 Mir-0.085
 But-0.084
 N-0.082
 Del-0.082
Token:
Token ID25
Feature activation-4.59e-02

Pos logit contributions
 narrative+0.098
oral+0.094
 severe+0.077
 obstacle+0.072
oni+0.070
Neg logit contributions
 Mil-0.108
 But-0.106
 Just-0.104
 Mir-0.103
 N-0.103
Token weapons
Token ID3777
Feature activation-1.02e-01

Pos logit contributions
 narrative+0.080
oral+0.080
smart+0.067
 severe+0.065
 story+0.064
Neg logit contributions
 Mil-0.064
 Mir-0.058
Gu-0.057
 Del-0.055
nes-0.054
Token can
Token ID460
Feature activation+4.23e-03

Pos logit contributions
 narrative+0.147
oral+0.141
 severe+0.114
 obstacle+0.104
oni+0.100
Neg logit contributions
 Mil-0.174
 But-0.167
 Mir-0.165
 Just-0.162
 N-0.160
Token little
Token ID1310
Feature activation+3.69e-02

Pos logit contributions
 narrative+0.017
oral+0.016
 severe+0.013
 obstacle+0.013
 problem+0.012
Neg logit contributions
 Mil-0.014
 Mir-0.013
 But-0.013
Gu-0.013
 Del-0.013
Token girl
Token ID2576
Feature activation+4.66e-02

Pos logit contributions
 Mil+0.054
Gu+0.051
 Mir+0.050
 Just+0.049
 Del+0.048
Neg logit contributions
 narrative-0.060
oral-0.057
 problem-0.051
 obstacle-0.051
 severe-0.048
Token learned
Token ID4499
Feature activation-7.14e-02

Pos logit contributions
 Mil+0.075
 Mir+0.071
Gu+0.071
 Del+0.067
 N+0.067
Neg logit contributions
 narrative-0.071
oral-0.068
 severe-0.058
ience-0.055
 problem-0.055
Token an
Token ID281
Feature activation-8.17e-02

Pos logit contributions
 narrative+0.141
oral+0.133
 obstacle+0.109
iness+0.109
 severe+0.107
Neg logit contributions
 Mil-0.089
 But-0.086
 Mir-0.084
 N-0.083
 Just-0.082
Token important
Token ID1593
Feature activation-9.34e-02

Pos logit contributions
 narrative+0.136
oral+0.114
 severe+0.100
 pistols+0.097
oni+0.097
Neg logit contributions
 But-0.119
 ask-0.116
 Just-0.113
 Suddenly-0.108
 Mil-0.107
Token lesson
Token ID11483
Feature activation-2.05e-01

Pos logit contributions
 narrative+0.118
oral+0.092
oni+0.082
ils+0.081
smart+0.079
Neg logit contributions
 But-0.193
 Just-0.179
 Suddenly-0.170
 ask-0.168
 Mil-0.165
Token -
Token ID532
Feature activation-9.71e-03

Pos logit contributions
 narrative+0.322
oral+0.303
 severe+0.253
 obstacle+0.243
oni+0.229
Neg logit contributions
 But-0.335
 Mil-0.326
 Just-0.320
 Mir-0.313
 N-0.308
Token be
Token ID307
Feature activation-2.10e-02

Pos logit contributions
 narrative+0.017
oral+0.016
 severe+0.014
smart+0.013
 obstacle+0.013
Neg logit contributions
 Mil-0.014
 Mir-0.013
Gu-0.013
 Del-0.012
 N-0.012
Token careful
Token ID8161
Feature activation+1.80e-02

Pos logit contributions
 narrative+0.039
oral+0.038
 severe+0.035
smart+0.033
 obstacle+0.031
Neg logit contributions
 Mil-0.028
 Mir-0.025
 N-0.024
 But-0.024
 Del-0.024
Token what
Token ID644
Feature activation-7.22e-02

Pos logit contributions
 Mil+0.028
 Mir+0.027
 N+0.026
Gu+0.026
 Del+0.026
Neg logit contributions
 narrative-0.029
oral-0.028
 severe-0.023
 obstacle-0.022
smart-0.022
Token you
Token ID345
Feature activation+2.37e-02

Pos logit contributions
 narrative+0.141
oral+0.137
 severe+0.119
 obstacle+0.112
oni+0.111
Neg logit contributions
 Mil-0.087
 Mir-0.082
 But-0.081
 N-0.078
 Del-0.077
Token the
Token ID262
Feature activation-3.77e-02

Pos logit contributions
 Mil+0.061
 Mir+0.056
 But+0.055
 N+0.055
 Del+0.053
Neg logit contributions
 narrative-0.111
oral-0.109
 severe-0.094
ience-0.088
 obstacle-0.087
Token party
Token ID2151
Feature activation+1.21e-02

Pos logit contributions
 narrative+0.070
oral+0.067
 severe+0.057
 obstacle+0.055
smart+0.053
Neg logit contributions
 Mil-0.051
 Mir-0.048
 But-0.046
 N-0.046
 Del-0.045
Token learned
Token ID4499
Feature activation-8.26e-02

Pos logit contributions
 Mil+0.019
 Mir+0.018
 But+0.018
 N+0.018
 Del+0.018
Neg logit contributions
 narrative-0.019
oral-0.018
 severe-0.015
 obstacle-0.015
smart-0.014
Token an
Token ID281
Feature activation-8.44e-02

Pos logit contributions
 narrative+0.171
oral+0.163
iness+0.139
 obstacle+0.136
 severe+0.134
Neg logit contributions
 But-0.100
 N-0.089
 Just-0.088
 L-0.088
 Ell-0.087
Token important
Token ID1593
Feature activation-9.42e-02

Pos logit contributions
 narrative+0.141
oral+0.121
 severe+0.110
ls+0.105
 pistols+0.104
Neg logit contributions
 But-0.113
 ask-0.112
 Ell-0.109
 Just-0.109
 N-0.106
Token lesson
Token ID11483
Feature activation-2.05e-01

Pos logit contributions
 narrative+0.126
oral+0.093
iness+0.087
 severe+0.087
smart+0.085
Neg logit contributions
 But-0.191
 Just-0.176
 Suddenly-0.165
 Mil-0.161
 ask-0.161
Token about
Token ID546
Feature activation+3.05e-03

Pos logit contributions
 narrative+0.360
oral+0.338
 severe+0.298
 obstacle+0.283
ery+0.267
Neg logit contributions
 But-0.301
 Mil-0.286
 Just-0.282
 N-0.276
 L-0.275
Token helping
Token ID5742
Feature activation-7.45e-02

Pos logit contributions
 Mil+0.004
 Mir+0.004
 But+0.004
 N+0.004
 Del+0.004
Neg logit contributions
 narrative-0.006
oral-0.006
 severe-0.005
 obstacle-0.004
iness-0.004
Token others
Token ID1854
Feature activation+2.00e-03

Pos logit contributions
 narrative+0.161
oral+0.155
 severe+0.134
 obstacle+0.129
smart+0.126
Neg logit contributions
 Mil-0.078
 Mir-0.072
 But-0.070
 N-0.069
 Del-0.067
Token.
Token ID13
Feature activation-2.87e-02

Pos logit contributions
 Mil+0.003
 But+0.003
 N+0.003
 Just+0.002
 Mir+0.002
Neg logit contributions
 narrative-0.004
oral-0.004
 severe-0.003
 obstacle-0.003
 mercy-0.003
Token<|endoftext|>
Token ID50256
Feature activation-1.06e-01

Pos logit contributions
oral+0.049
 narrative+0.049
 severe+0.039
smart+0.039
 story+0.038
Neg logit contributions
 Mil-0.041
 Mir-0.037
Gu-0.036
 Del-0.035
 blonde-0.035
Token.
Token ID13
Feature activation-1.80e-02

Pos logit contributions
 narrative+0.049
oral+0.048
 severe+0.037
 pistols+0.035
 obstacle+0.034
Neg logit contributions
 Mil-0.048
 Mir-0.045
 But-0.045
 N-0.044
 Del-0.044
Token Jane
Token ID12091
Feature activation+3.96e-02

Pos logit contributions
oral+0.028
 narrative+0.026
smart+0.021
:+0.021
der+0.021
Neg logit contributions
Gu-0.024
 dipped-0.024
 imagine-0.024
 Mil-0.023
 pretending-0.023
Token learned
Token ID4499
Feature activation-7.89e-02

Pos logit contributions
 Mil+0.061
 Mir+0.057
Gu+0.056
 But+0.055
 Del+0.055
Neg logit contributions
 narrative-0.064
oral-0.062
 severe-0.052
 obstacle-0.048
smart-0.048
Token an
Token ID281
Feature activation-8.73e-02

Pos logit contributions
 narrative+0.151
oral+0.144
iness+0.120
 obstacle+0.119
 severe+0.115
Neg logit contributions
 Mil-0.103
 But-0.100
 Mir-0.097
 N-0.096
 Just-0.093
Token important
Token ID1593
Feature activation-9.81e-02

Pos logit contributions
 narrative+0.146
oral+0.125
 severe+0.112
smart+0.111
 pistols+0.111
Neg logit contributions
 But-0.122
 ask-0.118
 Just-0.115
 Mil-0.110
 Ell-0.109
Token lesson
Token ID11483
Feature activation-2.05e-01

Pos logit contributions
 narrative+0.122
oral+0.094
 pistols+0.083
smart+0.081
 severe+0.080
Neg logit contributions
 But-0.206
 Just-0.190
 Suddenly-0.182
 Mil-0.182
 Mir-0.178
Token to
Token ID284
Feature activation-3.16e-02

Pos logit contributions
 narrative+0.299
oral+0.283
 severe+0.235
 obstacle+0.221
oni+0.212
Neg logit contributions
 But-0.351
 Mil-0.351
 Just-0.336
 Mir-0.335
 N-0.327
Token always
Token ID1464
Feature activation+1.44e-02

Pos logit contributions
 narrative+0.062
oral+0.060
 severe+0.052
 obstacle+0.049
 mercy+0.048
Neg logit contributions
 Mil-0.039
 Mir-0.036
Gu-0.035
 Del-0.034
 N-0.034
Token listen
Token ID6004
Feature activation-1.02e-01

Pos logit contributions
 Mil+0.015
 Mir+0.013
 But+0.012
 N+0.012
 Del+0.012
Neg logit contributions
 narrative-0.032
oral-0.031
 severe-0.027
 obstacle-0.026
smart-0.025
Token to
Token ID284
Feature activation-2.03e-02

Pos logit contributions
 narrative+0.197
oral+0.187
 severe+0.159
 obstacle+0.153
smart+0.148
Neg logit contributions
 Mil-0.131
 But-0.122
 Mir-0.121
 N-0.119
 Just-0.118
Token her
Token ID607
Feature activation+6.94e-03

Pos logit contributions
 narrative+0.042
oral+0.040
 severe+0.035
 obstacle+0.033
smart+0.033
Neg logit contributions
 Mil-0.023
 Mir-0.022
 N-0.021
 But-0.021
 Del-0.020
Token the
Token ID262
Feature activation-1.71e-02

Pos logit contributions
 narrative+0.101
oral+0.095
 severe+0.082
smart+0.079
 obstacle+0.078
Neg logit contributions
 Mil-0.087
Gu-0.081
 Mir-0.080
 Just-0.079
 But-0.078
Token little
Token ID1310
Feature activation+2.39e-02

Pos logit contributions
 narrative+0.028
oral+0.027
 problem+0.023
 severe+0.022
 obstacle+0.022
Neg logit contributions
 Mil-0.027
 But-0.026
 Just-0.025
 My-0.025
 Suddenly-0.025
Token bird
Token ID6512
Feature activation+1.17e-02

Pos logit contributions
 Just+0.038
 Mil+0.037
 Suddenly+0.037
nes+0.036
 But+0.036
Neg logit contributions
 narrative-0.033
 problem-0.032
oral-0.030
 obstacle-0.029
 mistake-0.028
Token and
Token ID290
Feature activation+1.41e-03

Pos logit contributions
 Mil+0.020
 Mir+0.019
Gu+0.019
 But+0.018
 Del+0.018
Neg logit contributions
 narrative-0.017
oral-0.016
 severe-0.013
 obstacle-0.012
smart-0.012
Token she
Token ID673
Feature activation+1.12e-02

Pos logit contributions
 Mil+0.002
 Mir+0.002
Gu+0.002
 Del+0.002
 But+0.002
Neg logit contributions
 narrative-0.003
oral-0.002
 severe-0.002
smart-0.002
 obstacle-0.002
Token began
Token ID2540
Feature activation+8.92e-02

Pos logit contributions
 Mil+0.016
 Mir+0.015
 But+0.014
 N+0.014
 Del+0.014
Neg logit contributions
 narrative-0.020
oral-0.019
 severe-0.016
 obstacle-0.016
smart-0.015
Token to
Token ID284
Feature activation+3.48e-02

Pos logit contributions
 Mil+0.132
 Mir+0.123
Gu+0.118
 Del+0.118
 N+0.118
Neg logit contributions
 narrative-0.151
oral-0.146
 severe-0.128
smart-0.119
 obstacle-0.116
Token comfort
Token ID4467
Feature activation-2.14e-02

Pos logit contributions
 Mil+0.044
 Mir+0.040
Gu+0.038
 N+0.038
 Del+0.038
Neg logit contributions
 narrative-0.068
oral-0.065
 severe-0.058
 obstacle-0.054
smart-0.054
Token it
Token ID340
Feature activation-1.63e-02

Pos logit contributions
 narrative+0.038
oral+0.036
 severe+0.031
smart+0.030
 obstacle+0.029
Neg logit contributions
 Mil-0.030
 Mir-0.028
Gu-0.027
 Del-0.027
 But-0.026
Token.
Token ID13
Feature activation-1.39e-02

Pos logit contributions
 narrative+0.023
oral+0.022
 severe+0.018
 obstacle+0.017
smart+0.016
Neg logit contributions
 Mil-0.029
 Mir-0.028
Gu-0.027
 Del-0.027
 N-0.027
Token She
Token ID1375
Feature activation-1.61e-02

Pos logit contributions
oral+0.025
 narrative+0.024
der+0.021
smart+0.021
:+0.020
Neg logit contributions
 Mil-0.017
Gu-0.015
nes-0.015
Whe-0.015
 Mir-0.015
Token can
Token ID460
Feature activation+3.15e-02

Pos logit contributions
 Mil+0.051
 Mir+0.048
 But+0.047
 N+0.046
 Del+0.046
Neg logit contributions
 narrative-0.065
oral-0.063
 severe-0.053
 obstacle-0.051
smart-0.050
Token find
Token ID1064
Feature activation+3.74e-02

Pos logit contributions
 Mil+0.030
 Mir+0.028
Gu+0.026
 N+0.026
 Del+0.025
Neg logit contributions
 narrative-0.068
oral-0.067
 severe-0.059
oni-0.057
 obstacle-0.057
Token in
Token ID287
Feature activation+8.08e-03

Pos logit contributions
 Mil+0.058
 Mir+0.054
Gu+0.053
 Del+0.053
 N+0.052
Neg logit contributions
 narrative-0.060
oral-0.057
 severe-0.048
smart-0.046
 obstacle-0.045
Token the
Token ID262
Feature activation-8.21e-03

Pos logit contributions
 Mil+0.009
 Mir+0.008
 N+0.008
 But+0.008
 Del+0.008
Neg logit contributions
 narrative-0.017
oral-0.017
 severe-0.014
iness-0.014
der-0.013
Token woods
Token ID16479
Feature activation+1.36e-02

Pos logit contributions
oral+0.015
 narrative+0.015
 severe+0.013
smart+0.013
iness+0.013
Neg logit contributions
 But-0.010
 Mil-0.010
 Suddenly-0.010
 Just-0.009
 ask-0.009
Token."
Token ID526
Feature activation+9.45e-02

Pos logit contributions
 Mil+0.021
 Mir+0.019
 But+0.019
 N+0.019
 Del+0.018
Neg logit contributions
 narrative-0.023
oral-0.022
 severe-0.018
 obstacle-0.017
smart-0.017
Token
Token ID198
Feature activation+2.90e-02

Pos logit contributions
nes+0.138
Gu+0.135
uries+0.128
astical+0.127
 Del+0.126
Neg logit contributions
 narrative-0.149
oral-0.148
 story-0.117
ls-0.117
oni-0.113
Token
Token ID198
Feature activation+2.15e-02

Pos logit contributions
 Mil+0.026
 Mir+0.024
 But+0.023
 N+0.021
 L+0.021
Neg logit contributions
 narrative-0.069
oral-0.064
 severe-0.059
 obstacle-0.058
iness-0.057
TokenThey
Token ID2990
Feature activation+4.97e-03

Pos logit contributions
 Mil+0.030
 Mir+0.028
 But+0.027
 N+0.026
 Del+0.026
Neg logit contributions
 narrative-0.039
oral-0.038
 severe-0.033
 obstacle-0.031
smart-0.030
Token walked
Token ID6807
Feature activation-1.33e-02

Pos logit contributions
 Mil+0.006
 Mir+0.005
 N+0.005
 Del+0.005
 But+0.005
Neg logit contributions
 narrative-0.010
oral-0.010
 severe-0.009
 obstacle-0.008
smart-0.008
Token deeper
Token ID9211
Feature activation+1.52e-02

Pos logit contributions
 narrative+0.019
oral+0.019
 severe+0.015
 obstacle+0.014
smart+0.014
Neg logit contributions
 Mil-0.022
Gu-0.021
 My-0.021
 Mir-0.020
 N-0.020
Token are
Token ID389
Feature activation-1.19e-02

Pos logit contributions
Gu+0.019
 Mil+0.019
 Suddenly+0.018
 Del+0.018
 Just+0.018
Neg logit contributions
 narrative-0.032
oral-0.030
 story-0.026
 mercy-0.026
 problem-0.026
Token you
Token ID345
Feature activation+6.04e-02

Pos logit contributions
 narrative+0.023
oral+0.020
 mercy+0.019
smart+0.018
 pistols+0.018
Neg logit contributions
Gu-0.014
 Mil-0.013
 shades-0.013
 sill-0.013
 Del-0.013
Token there
Token ID612
Feature activation+1.03e-01

Pos logit contributions
 Mil+0.095
 Mir+0.089
 But+0.086
 Del+0.086
Gu+0.084
Neg logit contributions
 narrative-0.097
oral-0.092
 severe-0.081
 problem-0.079
smart-0.075
Token?"
Token ID1701
Feature activation+9.90e-02

Pos logit contributions
 Mil+0.167
 Mir+0.158
 N+0.156
 L+0.154
 Del+0.153
Neg logit contributions
 narrative-0.174
oral-0.161
 severe-0.133
ience-0.124
 obstacle-0.123
Token Ben
Token ID3932
Feature activation+2.53e-02

Pos logit contributions
nes+0.174
uries+0.169
Whoever+0.150
water+0.145
astical+0.145
Neg logit contributions
 narrative-0.152
oral-0.131
 bad-0.126
 problem-0.116
:-0.114
Token says
Token ID1139
Feature activation+8.61e-02

Pos logit contributions
 Mil+0.029
 Del+0.027
 My+0.027
 Mir+0.026
 Ell+0.026
Neg logit contributions
 narrative-0.050
oral-0.046
 problem-0.040
 story-0.040
 obstacle-0.039
Token.
Token ID13
Feature activation+2.02e-02

Pos logit contributions
 Mil+0.158
 Mir+0.150
Gu+0.148
 Del+0.147
 N+0.146
Neg logit contributions
 narrative-0.115
oral-0.109
 severe-0.087
smart-0.082
 obstacle-0.080
Token He
Token ID679
Feature activation-3.05e-02

Pos logit contributions
 Mil+0.024
 Mir+0.022
 But+0.021
 N+0.021
 Del+0.021
Neg logit contributions
 narrative-0.041
oral-0.040
 severe-0.035
 obstacle-0.033
smart-0.032
Token is
Token ID318
Feature activation+3.02e-02

Pos logit contributions
 narrative+0.049
oral+0.048
 severe+0.042
iness+0.038
 obstacle+0.037
Neg logit contributions
 Mil-0.048
 Mir-0.045
 But-0.044
 Just-0.044
 N-0.044
Token lonely
Token ID21757
Feature activation+7.49e-02

Pos logit contributions
 Mil+0.050
Gu+0.048
 But+0.047
 Mir+0.047
 Del+0.047
Neg logit contributions
 narrative-0.043
oral-0.037
 severe-0.035
smart-0.033
 problem-0.032
Token.
Token ID13
Feature activation+2.75e-02

Pos logit contributions
 But+0.084
 Mil+0.080
 L+0.078
 sill+0.077
 ask+0.077
Neg logit contributions
oral-0.152
 narrative-0.149
 severe-0.128
ience-0.122
oni-0.121
Token what
Token ID644
Feature activation-5.56e-02

Pos logit contributions
 Mil+0.019
 Mir+0.017
 But+0.017
Gu+0.017
 Del+0.017
Neg logit contributions
 narrative-0.025
oral-0.024
 severe-0.020
 obstacle-0.019
smart-0.019
Token's
Token ID338
Feature activation+5.19e-02

Pos logit contributions
 narrative+0.083
oral+0.080
 severe+0.066
 obstacle+0.062
smart+0.059
Neg logit contributions
 Mil-0.094
 Mir-0.090
 But-0.087
 N-0.086
 Del-0.086
Token wrong
Token ID2642
Feature activation-4.84e-02

Pos logit contributions
 Mil+0.097
 Mir+0.094
 But+0.093
 N+0.090
 Just+0.090
Neg logit contributions
 narrative-0.068
oral-0.068
 severe-0.051
 obstacle-0.047
smart-0.047
Token?"
Token ID1701
Feature activation+9.01e-02

Pos logit contributions
 narrative+0.083
oral+0.077
 severe+0.065
ience+0.062
 obstacle+0.061
Neg logit contributions
 Mil-0.076
 Mir-0.073
 But-0.072
 N-0.070
 Del-0.069
Token Lily
Token ID20037
Feature activation+8.18e-02

Pos logit contributions
Gu+0.116
nes+0.112
 shades+0.106
 blu+0.106
Whe+0.105
Neg logit contributions
 narrative-0.143
oral-0.141
 severe-0.132
 pistols-0.128
 bad-0.126
Token asked
Token ID1965
Feature activation+1.37e-01

Pos logit contributions
 Mil+0.075
Gu+0.065
 Del+0.065
 Mir+0.064
 My+0.061
Neg logit contributions
 narrative-0.178
oral-0.167
 severe-0.158
 pistols-0.150
smart-0.148
Token.
Token ID13
Feature activation+5.85e-03

Pos logit contributions
 Mil+0.255
Gu+0.244
 Mir+0.237
 Del+0.235
 But+0.234
Neg logit contributions
 narrative-0.175
oral-0.163
 severe-0.135
smart-0.125
 mercy-0.121
Token "
Token ID366
Feature activation+1.83e-02

Pos logit contributions
 Mil+0.008
Gu+0.007
 Del+0.007
nes+0.007
 N+0.007
Neg logit contributions
 narrative-0.010
oral-0.010
 severe-0.008
smart-0.008
der-0.008
TokenWhy
Token ID5195
Feature activation-1.06e-02

Pos logit contributions
 Mil+0.027
 Mir+0.025
 Del+0.024
 N+0.024
 But+0.024
Neg logit contributions
 narrative-0.032
oral-0.030
 severe-0.026
smart-0.024
 obstacle-0.023
Token can
Token ID460
Feature activation+2.14e-02

Pos logit contributions
 narrative+0.018
oral+0.018
 obstacle+0.015
 severe+0.015
smart+0.014
Neg logit contributions
 Mil-0.016
 Mir-0.015
 But-0.015
 N-0.014
 L-0.014
Token't
Token ID470
Feature activation+1.73e-02

Pos logit contributions
 Mil+0.024
 Mir+0.021
Gu+0.021
 Del+0.021
 N+0.021
Neg logit contributions
 narrative-0.043
oral-0.042
 severe-0.037
 mercy-0.035
 pistols-0.035
Token is
Token ID318
Feature activation+2.39e-02

Pos logit contributions
 Mil+0.070
 Mir+0.067
 N+0.066
Gu+0.065
 Del+0.065
Neg logit contributions
 narrative-0.038
oral-0.034
 severe-0.026
 obstacle-0.024
 problem-0.024
Token like
Token ID588
Feature activation+6.82e-02

Pos logit contributions
 Mil+0.035
 Mir+0.033
Gu+0.032
 But+0.032
 Del+0.032
Neg logit contributions
 narrative-0.041
oral-0.038
 severe-0.033
 obstacle-0.031
 problem-0.031
Token a
Token ID257
Feature activation+4.99e-04

Pos logit contributions
 Mil+0.090
 Mir+0.083
Gu+0.080
 But+0.080
 Del+0.079
Neg logit contributions
 narrative-0.127
oral-0.122
 severe-0.105
 obstacle-0.101
smart-0.099
Token picture
Token ID4286
Feature activation-9.86e-03

Pos logit contributions
 Mil+0.001
 Mir+0.001
 But+0.001
 N+0.001
 Del+0.001
Neg logit contributions
 narrative-0.001
oral-0.001
 severe-0.001
 obstacle-0.001
smart-0.001
Token.
Token ID13
Feature activation+1.60e-02

Pos logit contributions
 narrative+0.015
oral+0.014
 severe+0.012
 obstacle+0.011
smart+0.011
Neg logit contributions
 Mil-0.016
 Mir-0.015
 But-0.015
 N-0.015
 Del-0.015
Token See
Token ID4091
Feature activation+9.05e-02

Pos logit contributions
 Mil+0.021
 imagining+0.019
nes+0.019
 dipped+0.018
 peek+0.018
Neg logit contributions
 narrative-0.028
oral-0.025
smart-0.023
der-0.021
icks-0.020
Token,
Token ID11
Feature activation+1.09e-02

Pos logit contributions
 Mil+0.159
 Mir+0.150
Gu+0.149
 Del+0.145
 But+0.145
Neg logit contributions
 narrative-0.128
oral-0.120
 severe-0.097
 obstacle-0.094
 problem-0.093
Token I
Token ID314
Feature activation+5.09e-02

Pos logit contributions
 Mil+0.014
 Mir+0.013
Gu+0.013
 Del+0.012
 Just+0.012
Neg logit contributions
 narrative-0.020
oral-0.019
 severe-0.017
 pistols-0.016
 problem-0.016
Token made
Token ID925
Feature activation+1.41e-02

Pos logit contributions
 Mil+0.083
 Mir+0.080
Gu+0.075
 N+0.074
 Del+0.074
Neg logit contributions
 narrative-0.077
oral-0.076
 severe-0.062
smart-0.059
 problem-0.057
Token it
Token ID340
Feature activation+2.00e-02

Pos logit contributions
 Mil+0.022
Gu+0.021
 blonde+0.021
 But+0.020
 blond+0.020
Neg logit contributions
oral-0.020
 narrative-0.019
oni-0.018
 story-0.017
 problem-0.017
Token with
Token ID351
Feature activation+6.22e-02

Pos logit contributions
 Mil+0.035
Gu+0.033
 Mir+0.032
 N+0.032
 But+0.032
Neg logit contributions
 narrative-0.026
oral-0.026
 severe-0.023
 story-0.021
oni-0.021
Token like
Token ID588
Feature activation+6.95e-02

Pos logit contributions
 narrative+0.001
oral+0.001
 severe+0.001
iness+0.000
oni+0.000
Neg logit contributions
 Mil-0.000
 Mir-0.000
Gu-0.000
 Del-0.000
 My-0.000
Token to
Token ID284
Feature activation+3.00e-02

Pos logit contributions
 Mil+0.094
 Mir+0.088
 But+0.084
 N+0.084
 Del+0.083
Neg logit contributions
 narrative-0.129
oral-0.123
 severe-0.106
 obstacle-0.100
smart-0.098
Token play
Token ID711
Feature activation+2.89e-02

Pos logit contributions
 Mil+0.029
 Mir+0.027
Gu+0.025
 N+0.025
 Del+0.024
Neg logit contributions
 narrative-0.067
oral-0.064
 severe-0.056
 obstacle-0.054
smart-0.054
Token in
Token ID287
Feature activation+4.37e-03

Pos logit contributions
 Mil+0.037
 But+0.035
 Mir+0.035
 Just+0.033
 N+0.033
Neg logit contributions
 narrative-0.055
oral-0.055
 severe-0.049
iness-0.045
 obstacle-0.044
Token the
Token ID262
Feature activation-1.62e-02

Pos logit contributions
 Mil+0.006
 Mir+0.006
 But+0.006
Gu+0.006
 N+0.006
Neg logit contributions
 narrative-0.008
oral-0.007
 severe-0.006
 obstacle-0.006
smart-0.006
Token garden
Token ID11376
Feature activation+4.62e-02

Pos logit contributions
 narrative+0.033
oral+0.032
 severe+0.028
smart+0.027
iness+0.026
Neg logit contributions
 Mil-0.019
 Mir-0.017
 But-0.017
 N-0.017
 Del-0.016
Token.
Token ID13
Feature activation-9.96e-03

Pos logit contributions
 Mil+0.075
 Mir+0.071
 But+0.070
 N+0.069
 Del+0.069
Neg logit contributions
 narrative-0.072
oral-0.069
 severe-0.059
 obstacle-0.054
smart-0.053
Token One
Token ID1881
Feature activation+2.88e-02

Pos logit contributions
 narrative+0.017
oral+0.017
 story+0.013
 pistols+0.013
 obstacle+0.013
Neg logit contributions
Gu-0.012
 Mil-0.012
nes-0.012
 Mir-0.012
 Del-0.011
Token day
Token ID1110
Feature activation-3.41e-02

Pos logit contributions
 Mil+0.041
 Mir+0.038
 But+0.038
 Del+0.037
 N+0.036
Neg logit contributions
 narrative-0.052
oral-0.049
 severe-0.043
smart-0.040
 mercy-0.040
Token,
Token ID11
Feature activation-4.07e-03

Pos logit contributions
 narrative+0.035
oral+0.034
 problem+0.024
 pistols+0.023
 severe+0.023
Neg logit contributions
 Mil-0.068
Gu-0.067
 sill-0.066
 Mir-0.065
 Del-0.065
Token they
Token ID484
Feature activation-7.13e-03

Pos logit contributions
 narrative+0.004
oral+0.004
 pistols+0.003
oni+0.003
ls+0.003
Neg logit contributions
Gu-0.008
 sill-0.007
 Mil-0.007
 Just-0.007
 Mir-0.007
Token eat
Token ID4483
Feature activation-3.45e-02

Pos logit contributions
 Mil+0.010
 Mir+0.009
 N+0.009
 But+0.008
 Del+0.008
Neg logit contributions
 narrative-0.020
oral-0.020
 severe-0.017
 obstacle-0.017
smart-0.016
Token their
Token ID511
Feature activation+3.08e-02

Pos logit contributions
 narrative+0.077
oral+0.074
 obstacle+0.065
 severe+0.065
smart+0.064
Neg logit contributions
 Mil-0.041
 Sara-0.038
 N-0.037
 Mir-0.037
 L-0.035
Token lunch
Token ID9965
Feature activation-4.05e-02

Pos logit contributions
 Mil+0.047
 Mir+0.044
 N+0.043
 But+0.043
 Del+0.043
Neg logit contributions
 narrative-0.052
oral-0.049
smart-0.041
 severe-0.041
 obstacle-0.039
Token and
Token ID290
Feature activation+8.90e-03

Pos logit contributions
 narrative+0.074
oral+0.072
 obstacle+0.064
 severe+0.063
smart+0.059
Neg logit contributions
 Mil-0.057
 Mir-0.053
 But-0.053
 N-0.051
 L-0.050
Token enjoy
Token ID2883
Feature activation+1.96e-02

Pos logit contributions
 Mil+0.010
 Mir+0.010
 But+0.009
 N+0.009
 Del+0.009
Neg logit contributions
 narrative-0.018
oral-0.018
 severe-0.015
 obstacle-0.015
smart-0.014
Token it
Token ID340
Feature activation+1.86e-02

Pos logit contributions
 Mil+0.024
 Mir+0.022
 N+0.022
 L+0.021
 Sara+0.021
Neg logit contributions
 narrative-0.041
oral-0.040
 severe-0.035
 obstacle-0.034
iness-0.034
Token.
Token ID13
Feature activation+2.34e-02

Pos logit contributions
 Mil+0.024
 Mir+0.023
 But+0.023
 N+0.023
 Ell+0.022
Neg logit contributions
 narrative-0.036
oral-0.035
 severe-0.030
 obstacle-0.029
smart-0.027
Token They
Token ID1119
Feature activation+1.21e-02

Pos logit contributions
 Mil+0.034
Gu+0.033
 Del+0.031
nes+0.031
 Mir+0.030
Neg logit contributions
 narrative-0.036
oral-0.035
 severe-0.028
der-0.027
 story-0.027
Token wash
Token ID13502
Feature activation+1.09e-02

Pos logit contributions
 Mil+0.016
 Mir+0.015
 N+0.015
 But+0.015
 Del+0.014
Neg logit contributions
 narrative-0.022
oral-0.022
 severe-0.019
smart-0.018
 obstacle-0.018
Token their
Token ID511
Feature activation+5.28e-02

Pos logit contributions
 Mil+0.016
 Mir+0.015
 N+0.014
 But+0.014
 Del+0.013
Neg logit contributions
 narrative-0.020
oral-0.020
 severe-0.016
 obstacle-0.016
smart-0.016
Token hands
Token ID2832
Feature activation-4.52e-02

Pos logit contributions
 Mil+0.063
 Mir+0.058
 But+0.056
 N+0.056
 Del+0.055
Neg logit contributions
 narrative-0.106
oral-0.103
 severe-0.089
smart-0.086
 obstacle-0.085
Token and
Token ID290
Feature activation-1.19e-03

Pos logit contributions
 narrative+0.066
oral+0.063
 severe+0.052
 obstacle+0.048
smart+0.047
Neg logit contributions
 Mil-0.079
 Mir-0.075
 But-0.072
 N-0.072
 Del-0.072
Token covered
Token ID5017
Feature activation+4.58e-02

Pos logit contributions
 narrative+0.002
 severe+0.001
 pistols+0.001
oral+0.001
 all+0.001
Neg logit contributions
Gu-0.002
 My-0.002
 Mil-0.002
 Pand-0.002
 Just-0.002
Token it
Token ID340
Feature activation+1.64e-02

Pos logit contributions
Gu+0.064
 Mil+0.060
 Just+0.060
 My+0.058
 But+0.058
Neg logit contributions
 narrative-0.074
oral-0.068
 severe-0.062
der-0.060
oni-0.060
Token with
Token ID351
Feature activation+6.35e-02

Pos logit contributions
 Mil+0.021
Gu+0.021
 Mir+0.020
 N+0.019
 Del+0.019
Neg logit contributions
 narrative-0.029
oral-0.028
 severe-0.025
 obstacle-0.024
der-0.023
Token some
Token ID617
Feature activation+2.42e-02

Pos logit contributions
 Mil+0.090
 Mir+0.084
Gu+0.082
 Del+0.081
 But+0.080
Neg logit contributions
 narrative-0.112
oral-0.106
 severe-0.094
 obstacle-0.088
smart-0.085
Token leaves
Token ID5667
Feature activation+2.37e-02

Pos logit contributions
 Mil+0.036
 Mir+0.033
Gu+0.032
 Del+0.032
 N+0.032
Neg logit contributions
 narrative-0.042
oral-0.040
 severe-0.034
 obstacle-0.032
smart-0.031
Token.
Token ID13
Feature activation+7.19e-03

Pos logit contributions
 Mil+0.036
 Mir+0.034
 But+0.033
 N+0.032
 Just+0.032
Neg logit contributions
 narrative-0.042
oral-0.040
 severe-0.032
 obstacle-0.031
smart-0.029
Token
Token ID198
Feature activation+1.10e-02

Pos logit contributions
 imagine+0.009
nes+0.009
Gu+0.009
 seen+0.008
 shades+0.008
Neg logit contributions
oral-0.012
 narrative-0.011
der-0.010
oni-0.010
:-0.010
Token
Token ID198
Feature activation+5.29e-03

Pos logit contributions
 Mil+0.015
Gu+0.014
 N+0.014
 Mir+0.014
 Del+0.014
Neg logit contributions
oral-0.019
 narrative-0.018
 severe-0.016
oni-0.015
smart-0.014
Token"
Token ID1
Feature activation+4.96e-02

Pos logit contributions
 Mil+0.010
 Mir+0.010
 Pand+0.010
 Del+0.010
 N+0.010
Neg logit contributions
oral-0.006
 narrative-0.006
 severe-0.005
oni-0.004
in-0.004
TokenThere
Token ID1858
Feature activation+7.10e-02

Pos logit contributions
 Mil+0.071
 My+0.068
 Just+0.067
 Mir+0.067
 But+0.065
Neg logit contributions
 narrative-0.091
oral-0.082
 severe-0.063
oni-0.059
 moral-0.059
Token borrow
Token ID8804
Feature activation-6.11e-02

Pos logit contributions
 narrative+0.028
oral+0.027
smart+0.024
 severe+0.023
 story+0.022
Neg logit contributions
 Mil-0.022
 Mir-0.021
Gu-0.020
 N-0.019
 Suddenly-0.019
Token her
Token ID607
Feature activation+9.25e-03

Pos logit contributions
 narrative+0.109
oral+0.103
 severe+0.092
 pistols+0.092
smart+0.091
Neg logit contributions
 Mil-0.077
Gu-0.070
 Mir-0.070
 Just-0.070
 But-0.067
Token pencil
Token ID21613
Feature activation-8.89e-02

Pos logit contributions
 Mil+0.014
 Mir+0.013
 But+0.013
 N+0.013
 Del+0.013
Neg logit contributions
 narrative-0.016
oral-0.015
 severe-0.012
 obstacle-0.012
smart-0.012
Token.
Token ID13
Feature activation-2.36e-02

Pos logit contributions
 narrative+0.141
oral+0.134
 severe+0.111
 obstacle+0.105
 mercy+0.099
Neg logit contributions
 Mil-0.144
 Mir-0.136
 But-0.134
 N-0.133
 Del-0.132
Token Lily
Token ID20037
Feature activation+4.48e-02

Pos logit contributions
oral+0.032
:+0.027
 narrative+0.027
 story+0.027
oni+0.025
Neg logit contributions
 blu-0.034
 Mil-0.033
nes-0.032
 seen-0.032
foot-0.031
Token said
Token ID531
Feature activation+5.44e-03

Pos logit contributions
 Mil+0.061
 Mir+0.057
 But+0.056
 Del+0.055
 N+0.055
Neg logit contributions
 narrative-0.082
oral-0.078
 severe-0.068
 obstacle-0.065
smart-0.063
Token yes
Token ID3763
Feature activation-5.34e-02

Pos logit contributions
 Mil+0.008
 Mir+0.007
 But+0.007
 N+0.007
 Del+0.007
Neg logit contributions
 narrative-0.010
oral-0.009
 severe-0.008
 obstacle-0.008
smart-0.007
Token and
Token ID290
Feature activation-3.49e-02

Pos logit contributions
 narrative+0.090
oral+0.083
 severe+0.069
 obstacle+0.066
smart+0.064
Neg logit contributions
 Mil-0.084
 But-0.082
 Mir-0.079
 Just-0.079
 N-0.078
Token gave
Token ID2921
Feature activation-3.78e-03

Pos logit contributions
 narrative+0.069
oral+0.067
 severe+0.059
smart+0.056
oni+0.056
Neg logit contributions
 Mil-0.039
 Mir-0.036
 But-0.035
Gu-0.034
 Just-0.034
Token her
Token ID607
Feature activation+7.92e-03

Pos logit contributions
 narrative+0.008
oral+0.007
 severe+0.006
oni+0.006
smart+0.006
Neg logit contributions
 Mil-0.004
 Mir-0.004
Gu-0.004
 Del-0.004
 But-0.004
Token friend
Token ID1545
Feature activation-6.70e-03

Pos logit contributions
 Mil+0.010
 Mir+0.009
 But+0.009
 Del+0.009
 N+0.009
Neg logit contributions
 narrative-0.015
oral-0.015
 severe-0.013
 obstacle-0.012
iness-0.012
Token also
Token ID635
Feature activation+3.86e-02

Pos logit contributions
 Mil+0.011
 But+0.010
 Mir+0.010
 N+0.010
 Del+0.009
Neg logit contributions
 narrative-0.013
oral-0.012
 severe-0.011
 obstacle-0.010
smart-0.010
Token learned
Token ID4499
Feature activation-5.98e-02

Pos logit contributions
 Mil+0.055
 Mir+0.052
 But+0.050
 N+0.050
 Del+0.049
Neg logit contributions
 narrative-0.068
oral-0.066
 severe-0.056
 obstacle-0.053
smart-0.052
Token to
Token ID284
Feature activation-1.06e-02

Pos logit contributions
 narrative+0.127
oral+0.120
iness+0.105
 obstacle+0.103
 severe+0.101
Neg logit contributions
 But-0.065
 Mil-0.064
 Mir-0.063
 N-0.061
 L-0.061
Token be
Token ID307
Feature activation-1.36e-02

Pos logit contributions
 narrative+0.020
oral+0.019
 severe+0.016
 story+0.016
 mercy+0.016
Neg logit contributions
 Mil-0.014
Gu-0.013
 Mir-0.013
 My-0.012
 Del-0.012
Token safe
Token ID3338
Feature activation-3.90e-03

Pos logit contributions
 narrative+0.024
oral+0.023
 severe+0.021
smart+0.020
 obstacle+0.019
Neg logit contributions
 Mil-0.019
 Mir-0.017
 Del-0.017
 But-0.017
 N-0.017
Token and
Token ID290
Feature activation+1.57e-02

Pos logit contributions
 narrative+0.006
oral+0.006
 severe+0.005
 obstacle+0.005
smart+0.005
Neg logit contributions
 Mil-0.006
 Mir-0.006
 But-0.006
 N-0.006
 Just-0.005
Token to
Token ID284
Feature activation+2.86e-02

Pos logit contributions
 Mil+0.023
 Mir+0.021
 Del+0.020
 N+0.020
Gu+0.020
Neg logit contributions
 narrative-0.028
oral-0.026
 severe-0.024
smart-0.022
 obstacle-0.021
Token ask
Token ID1265
Feature activation+8.82e-02

Pos logit contributions
 Mil+0.030
 Mir+0.027
Gu+0.027
 Del+0.026
 N+0.026
Neg logit contributions
 narrative-0.061
oral-0.058
 severe-0.052
 obstacle-0.050
smart-0.049
Token for
Token ID329
Feature activation-8.06e-02

Pos logit contributions
 Mil+0.142
 Mir+0.135
 N+0.132
 But+0.131
 Del+0.128
Neg logit contributions
 narrative-0.142
oral-0.135
 severe-0.113
 obstacle-0.106
smart-0.102
Token help
Token ID1037
Feature activation-7.09e-02

Pos logit contributions
 narrative+0.140
oral+0.134
 severe+0.115
 obstacle+0.108
smart+0.107
Neg logit contributions
 Mil-0.117
 Mir-0.109
 But-0.106
Gu-0.105
 Del-0.105
Token when
Token ID618
Feature activation+6.34e-02

Pos logit contributions
 narrative+0.102
oral+0.097
 severe+0.080
 obstacle+0.074
smart+0.073
Neg logit contributions
 Mil-0.124
 Mir-0.117
Gu-0.115
 Del-0.114
 But-0.114
Token,
Token ID11
Feature activation-3.27e-02

Pos logit contributions
 narrative+0.076
oral+0.071
 severe+0.060
 obstacle+0.057
ery+0.055
Neg logit contributions
 Mil-0.070
 Mir-0.066
 N-0.066
 But-0.065
 L-0.064
Token "
Token ID366
Feature activation+7.34e-04

Pos logit contributions
 narrative+0.059
oral+0.057
 severe+0.049
 obstacle+0.046
smart+0.046
Neg logit contributions
 Mil-0.044
 Mir-0.041
Gu-0.040
 N-0.040
 Del-0.039
TokenIt
Token ID1026
Feature activation-1.03e-02

Pos logit contributions
 Mil+0.001
 Mir+0.001
 pretending+0.001
 N+0.001
 Ell+0.001
Neg logit contributions
oral-0.001
 narrative-0.001
in-0.001
 severe-0.001
smart-0.001
Token means
Token ID1724
Feature activation+2.22e-02

Pos logit contributions
 narrative+0.014
oral+0.014
 severe+0.011
smart+0.011
ience+0.011
Neg logit contributions
 Mil-0.018
 Mir-0.017
 N-0.016
Gu-0.016
foot-0.016
Token to
Token ID284
Feature activation+1.85e-02

Pos logit contributions
 Mil+0.029
Gu+0.027
 Mir+0.027
 But+0.025
 Del+0.025
Neg logit contributions
 narrative-0.040
oral-0.039
 severe-0.034
 problem-0.033
smart-0.033
Token write
Token ID3551
Feature activation-3.66e-02

Pos logit contributions
 Mil+0.022
Gu+0.020
 Mir+0.020
 Just+0.019
 My+0.019
Neg logit contributions
 narrative-0.037
oral-0.035
 severe-0.031
 obstacle-0.029
 mercy-0.029
Token your
Token ID534
Feature activation+2.89e-02

Pos logit contributions
 narrative+0.064
oral+0.061
 severe+0.052
 obstacle+0.049
smart+0.047
Neg logit contributions
 Mil-0.053
 Mir-0.050
 But-0.049
 N-0.048
 Del-0.048
Token name
Token ID1438
Feature activation+2.84e-02

Pos logit contributions
 Mil+0.032
 Mir+0.029
Gu+0.027
 Ph+0.027
 Ell+0.026
Neg logit contributions
 narrative-0.059
oral-0.059
 story-0.050
 problem-0.050
 severe-0.049
Token so
Token ID523
Feature activation-2.47e-02

Pos logit contributions
 Mil+0.041
 Mir+0.039
 N+0.039
 But+0.039
 My+0.039
Neg logit contributions
 narrative-0.050
oral-0.046
 severe-0.040
 obstacle-0.038
 mercy-0.036
Token everyone
Token ID2506
Feature activation-2.80e-02

Pos logit contributions
 narrative+0.048
oral+0.045
 severe+0.039
 obstacle+0.037
smart+0.036
Neg logit contributions
 Mil-0.032
 Mir-0.030
 N-0.030
 But-0.030
 Del-0.029
Token knows
Token ID4206
Feature activation-3.78e-02

Pos logit contributions
 narrative+0.047
oral+0.045
 severe+0.038
 obstacle+0.036
smart+0.036
Neg logit contributions
 Mil-0.042
 Mir-0.039
 But-0.038
Gu-0.038
 Del-0.038
Token and
Token ID290
Feature activation-1.25e-02

Pos logit contributions
 Mil+0.030
 Mir+0.028
Gu+0.027
 Del+0.027
 N+0.027
Neg logit contributions
 narrative-0.029
oral-0.028
 severe-0.024
 obstacle-0.022
smart-0.022
Token helping
Token ID5742
Feature activation-6.58e-02

Pos logit contributions
 narrative+0.022
oral+0.021
 severe+0.018
smart+0.017
 obstacle+0.017
Neg logit contributions
 Mil-0.019
 Mir-0.017
 N-0.016
 Del-0.016
Gu-0.016
Token them
Token ID606
Feature activation+5.71e-02

Pos logit contributions
 narrative+0.122
oral+0.118
 severe+0.103
 obstacle+0.098
smart+0.097
Neg logit contributions
 Mil-0.086
 Mir-0.079
Gu-0.078
 Del-0.076
 Just-0.075
Token when
Token ID618
Feature activation+5.83e-02

Pos logit contributions
 Mil+0.078
Gu+0.072
 Mir+0.072
 N+0.067
 Del+0.066
Neg logit contributions
 narrative-0.100
oral-0.099
 severe-0.086
smart-0.082
 obstacle-0.082
Token they
Token ID484
Feature activation-1.67e-02

Pos logit contributions
 Mil+0.074
 Mir+0.069
Gu+0.068
 Del+0.066
 Just+0.065
Neg logit contributions
 narrative-0.110
oral-0.106
 severe-0.091
 problem-0.091
 obstacle-0.088
Token need
Token ID761
Feature activation-3.63e-02

Pos logit contributions
 narrative+0.030
oral+0.029
 severe+0.025
 obstacle+0.023
smart+0.023
Neg logit contributions
 Mil-0.023
 Mir-0.022
 But-0.021
 N-0.021
 Del-0.021
Token it
Token ID340
Feature activation-4.37e-05

Pos logit contributions
oral+0.059
 narrative+0.059
 severe+0.051
 problem+0.049
 story+0.048
Neg logit contributions
 Mil-0.055
 Mir-0.050
Gu-0.048
 Del-0.048
Whe-0.047
Token."
Token ID526
Feature activation+6.33e-02

Pos logit contributions
 narrative+0.000
oral+0.000
 severe+0.000
 obstacle+0.000
smart+0.000
Neg logit contributions
 Mil-0.000
 Mir-0.000
Gu-0.000
 Del-0.000
 N-0.000
Token
Token ID220
Feature activation-5.54e-03

Pos logit contributions
 Mil+0.080
Gu+0.076
 Mir+0.075
 Del+0.070
 blu+0.069
Neg logit contributions
 narrative-0.110
oral-0.110
der-0.089
 story-0.088
 severe-0.085
Token
Token ID198
Feature activation-6.65e-03

Pos logit contributions
oral+0.009
 narrative+0.009
 severe+0.007
oni+0.007
smart+0.007
Neg logit contributions
 Mil-0.008
Gu-0.008
 N-0.008
 Mir-0.007
 Del-0.007
Token
Token ID198
Feature activation-1.27e-02

Pos logit contributions
 narrative+0.013
oral+0.013
 severe+0.011
smart+0.010
 obstacle+0.010
Neg logit contributions
 Mil-0.008
 Mir-0.008
 N-0.007
Gu-0.007
 Del-0.007
Token opened
Token ID4721
Feature activation+3.82e-02

Pos logit contributions
 Mil+0.006
 Mir+0.005
Gu+0.005
 But+0.005
 Del+0.005
Neg logit contributions
 narrative-0.009
oral-0.008
 severe-0.007
 obstacle-0.007
smart-0.007
Token it
Token ID340
Feature activation-1.76e-02

Pos logit contributions
 Mil+0.055
Gu+0.052
 Mir+0.051
 Just+0.051
 My+0.051
Neg logit contributions
 narrative-0.063
oral-0.059
 severe-0.051
 obstacle-0.049
 problem-0.048
Token and
Token ID290
Feature activation-1.43e-02

Pos logit contributions
 narrative+0.030
oral+0.029
 severe+0.024
 obstacle+0.023
iness+0.022
Neg logit contributions
 Mil-0.027
 Mir-0.026
 But-0.026
 Del-0.025
 Just-0.025
Token took
Token ID1718
Feature activation-6.20e-02

Pos logit contributions
 narrative+0.021
 on+0.018
oral+0.018
ils+0.018
ls+0.017
Neg logit contributions
Gu-0.020
 My-0.019
 Just-0.019
 Mil-0.018
 Pand-0.018
Token out
Token ID503
Feature activation+1.44e-02

Pos logit contributions
 narrative+0.070
ils+0.062
 on+0.061
 hard+0.060
 problem+0.059
Neg logit contributions
Gu-0.111
 My-0.103
 Just-0.101
 Mil-0.101
 Pand-0.099
Token the
Token ID262
Feature activation-2.40e-02

Pos logit contributions
 Mil+0.022
Gu+0.021
 But+0.020
 Mir+0.020
 Just+0.020
Neg logit contributions
 narrative-0.023
oral-0.022
 severe-0.019
smart-0.018
 obstacle-0.018
Token things
Token ID1243
Feature activation-1.24e-02

Pos logit contributions
 narrative+0.029
oral+0.026
 severe+0.020
 obstacle+0.019
 problem+0.019
Neg logit contributions
 Mil-0.049
 Mir-0.046
Gu-0.045
 Del-0.045
 N-0.045
Token.
Token ID13
Feature activation+6.99e-04

Pos logit contributions
 narrative+0.018
oral+0.017
 severe+0.014
smart+0.013
 obstacle+0.013
Neg logit contributions
 Mil-0.021
Gu-0.020
 Mir-0.020
 Del-0.020
 But-0.019
Token
Token ID198
Feature activation+1.17e-02

Pos logit contributions
 imagine+0.001
 artist+0.001
 imagining+0.001
acher+0.001
 pretending+0.001
Neg logit contributions
oral-0.001
 narrative-0.001
:-0.001
smart-0.001
der-0.000
Token
Token ID198
Feature activation+6.89e-03

Pos logit contributions
 Mil+0.016
Gu+0.015
 N+0.015
 My+0.015
 Del+0.015
Neg logit contributions
oral-0.020
 narrative-0.019
 severe-0.016
smart-0.015
 story-0.015
TokenS
Token ID50
Feature activation-1.02e-02

Pos logit contributions
 Mil+0.014
 pretending+0.014
 Just+0.014
 imagining+0.014
 N+0.014
Neg logit contributions
oral-0.007
 narrative-0.006
 severe-0.005
in-0.004
Together-0.004
Token fun
Token ID1257
Feature activation+1.69e-02

Pos logit contributions
 narrative+0.096
oral+0.092
 severe+0.079
 obstacle+0.075
smart+0.074
Neg logit contributions
 Mil-0.073
 Mir-0.069
 Del-0.065
 N-0.065
 But-0.065
Token at
Token ID379
Feature activation+4.08e-02

Pos logit contributions
 Mil+0.027
 Mir+0.025
 But+0.024
 N+0.024
 Del+0.024
Neg logit contributions
 narrative-0.028
oral-0.027
 severe-0.022
 obstacle-0.021
smart-0.020
Token the
Token ID262
Feature activation-4.12e-02

Pos logit contributions
 Mil+0.050
 Mir+0.046
 N+0.043
 Del+0.042
 Ell+0.042
Neg logit contributions
 narrative-0.087
oral-0.085
 severe-0.073
ience-0.069
iness-0.068
Token beach
Token ID10481
Feature activation+7.22e-02

Pos logit contributions
 narrative+0.082
oral+0.079
 severe+0.068
 obstacle+0.067
 problem+0.065
Neg logit contributions
 Mil-0.049
 Mir-0.046
 But-0.044
Gu-0.043
 Del-0.043
Token!
Token ID0
Feature activation-2.54e-02

Pos logit contributions
 Mil+0.123
 Mir+0.116
 But+0.113
 N+0.113
 Del+0.112
Neg logit contributions
 narrative-0.110
oral-0.105
 severe-0.084
 obstacle-0.078
smart-0.076
Token<|endoftext|>
Token ID50256
Feature activation-9.19e-02

Pos logit contributions
 narrative+0.047
oral+0.047
 severe+0.038
oni+0.037
der+0.036
Neg logit contributions
 Mil-0.030
Gu-0.029
 Mir-0.028
 Pand-0.027
 My-0.027
TokenOnce
Token ID7454
Feature activation-7.49e-03

Pos logit contributions
 narrative+0.174
oral+0.167
 severe+0.147
 obstacle+0.140
smart+0.135
Neg logit contributions
 Mil-0.121
 Mir-0.112
 But-0.110
 N-0.109
 Del-0.108
Token upon
Token ID2402
Feature activation+1.71e-02

Pos logit contributions
 narrative+0.017
oral+0.016
 severe+0.014
 obstacle+0.014
smart+0.014
Neg logit contributions
 Mil-0.008
 Mir-0.007
 But-0.007
 N-0.006
 Del-0.006
Token a
Token ID257
Feature activation-3.66e-02

Pos logit contributions
 Mil+0.026
 Mir+0.024
 Del+0.023
 N+0.022
 But+0.022
Neg logit contributions
 narrative-0.027
oral-0.026
 severe-0.023
 story-0.021
 mercy-0.021
Token time
Token ID640
Feature activation+2.84e-03

Pos logit contributions
 narrative+0.063
oral+0.060
 severe+0.051
 obstacle+0.048
smart+0.048
Neg logit contributions
 Mil-0.054
 But-0.051
 Mir-0.051
 Just-0.049
 N-0.049
Token there
Token ID612
Feature activation+7.78e-02

Pos logit contributions
 Mil+0.006
Gu+0.005
 Mir+0.005
 N+0.005
 Del+0.005
Neg logit contributions
 narrative-0.003
oral-0.003
 problem-0.002
smart-0.002
 severe-0.002
Token slowly
Token ID6364
Feature activation+1.90e-03

Pos logit contributions
 narrative+0.085
oral+0.081
 severe+0.069
 obstacle+0.065
smart+0.064
Neg logit contributions
 Mil-0.071
 Mir-0.066
 But-0.064
 N-0.064
 Del-0.064
Token crept
Token ID47092
Feature activation-3.06e-03

Pos logit contributions
 Mil+0.003
Gu+0.002
 My+0.002
 But+0.002
 Mir+0.002
Neg logit contributions
 narrative-0.003
oral-0.003
 severe-0.003
smart-0.003
 obstacle-0.003
Token out
Token ID503
Feature activation+2.38e-02

Pos logit contributions
 narrative+0.004
oral+0.004
 severe+0.003
smart+0.003
 mercy+0.003
Neg logit contributions
 Mil-0.005
Gu-0.005
 Mir-0.005
 My-0.005
Whe-0.005
Token of
Token ID286
Feature activation+1.79e-02

Pos logit contributions
 Mil+0.037
 Mir+0.035
 But+0.033
 Del+0.033
 L+0.033
Neg logit contributions
 narrative-0.043
oral-0.040
 severe-0.033
oni-0.032
ience-0.031
Token the
Token ID262
Feature activation-2.13e-02

Pos logit contributions
 Mil+0.023
 Mir+0.021
 But+0.020
 N+0.020
 Del+0.020
Neg logit contributions
 narrative-0.036
oral-0.035
 severe-0.029
ience-0.028
der-0.028
Token bush
Token ID24484
Feature activation-4.16e-02

Pos logit contributions
 narrative+0.029
oral+0.027
 severe+0.022
 obstacle+0.020
smart+0.020
Neg logit contributions
 Mil-0.040
 Mir-0.038
 But-0.037
 N-0.037
 Del-0.036
Token,
Token ID11
Feature activation-2.28e-03

Pos logit contributions
 narrative+0.065
oral+0.062
 severe+0.050
 obstacle+0.048
smart+0.046
Neg logit contributions
 Mil-0.069
 Mir-0.065
 But-0.064
 N-0.063
 Del-0.063
Token and
Token ID290
Feature activation-1.72e-02

Pos logit contributions
 severe+0.003
 narrative+0.003
oral+0.002
smart+0.002
der+0.002
Neg logit contributions
 Mil-0.004
Gu-0.004
Whe-0.004
nes-0.004
 Del-0.004
Token looked
Token ID3114
Feature activation-3.59e-02

Pos logit contributions
 narrative+0.024
 severe+0.022
oral+0.021
smart+0.020
 mercy+0.019
Neg logit contributions
Gu-0.026
 Mil-0.026
 My-0.024
 Pand-0.024
 Mir-0.024
Token around
Token ID1088
Feature activation+5.01e-02

Pos logit contributions
 narrative+0.059
oral+0.056
 severe+0.047
 obstacle+0.044
smart+0.043
Neg logit contributions
 Mil-0.056
 Mir-0.053
 But-0.051
 N-0.051
 Del-0.051
Token the
Token ID262
Feature activation-1.31e-02

Pos logit contributions
 Mil+0.086
 Mir+0.081
Gu+0.079
 Del+0.078
 N+0.078
Neg logit contributions
 narrative-0.074
oral-0.070
 severe-0.058
 obstacle-0.054
smart-0.053
Token.
Token ID13
Feature activation+2.43e-03

Pos logit contributions
 narrative+0.064
oral+0.061
 severe+0.052
 obstacle+0.049
smart+0.046
Neg logit contributions
 Mil-0.060
 Mir-0.056
 But-0.056
 N-0.056
 Just-0.054
Token You
Token ID921
Feature activation-2.05e-03

Pos logit contributions
 Mil+0.003
 peek+0.003
nes+0.003
 Del+0.003
 begged+0.003
Neg logit contributions
 narrative-0.005
oral-0.004
smart-0.004
der-0.003
ience-0.003
Token learned
Token ID4499
Feature activation-6.57e-02

Pos logit contributions
 narrative+0.003
oral+0.003
 severe+0.003
 obstacle+0.003
smart+0.002
Neg logit contributions
 Mil-0.003
 But-0.003
 Mir-0.003
 N-0.003
 Del-0.003
Token a
Token ID257
Feature activation-4.22e-02

Pos logit contributions
 narrative+0.144
oral+0.137
iness+0.127
 severe+0.120
 obstacle+0.114
Neg logit contributions
 But-0.068
 N-0.067
 L-0.067
 Mil-0.067
 Ell-0.066
Token good
Token ID922
Feature activation-9.46e-02

Pos logit contributions
 narrative+0.075
iness+0.064
ils+0.060
 ling+0.057
agles+0.057
Neg logit contributions
 But-0.068
 ask-0.056
 Just-0.056
 pretend-0.054
 Suddenly-0.053
Token lesson
Token ID11483
Feature activation-1.84e-01

Pos logit contributions
 narrative+0.133
ience+0.108
oral+0.107
 severe+0.106
smart+0.103
Neg logit contributions
 But-0.178
 Just-0.159
 ask-0.158
 Mir-0.155
 Mil-0.155
Token today
Token ID1909
Feature activation+3.34e-02

Pos logit contributions
 narrative+0.315
oral+0.303
 severe+0.274
 obstacle+0.258
ience+0.239
Neg logit contributions
 Mil-0.267
 But-0.262
 Mir-0.257
 N-0.252
 Just-0.247
Token.
Token ID13
Feature activation-3.07e-03

Pos logit contributions
 Mil+0.048
 But+0.047
 Mir+0.045
 N+0.045
 Just+0.044
Neg logit contributions
 narrative-0.058
oral-0.056
 severe-0.050
 obstacle-0.047
 mercy-0.044
Token Sharing
Token ID36644
Feature activation+4.47e-03

Pos logit contributions
 narrative+0.006
oral+0.005
smart+0.004
der+0.004
ience+0.004
Neg logit contributions
 Mil-0.004
 peek-0.004
nes-0.004
 imagine-0.004
 pretended-0.004
Token is
Token ID318
Feature activation-1.61e-02

Pos logit contributions
 But+0.007
 My+0.007
 Mil+0.007
 ask+0.006
 Mir+0.006
Neg logit contributions
 narrative-0.007
oral-0.007
 severe-0.006
 obstacle-0.006
 mercy-0.006
Token caring
Token ID18088
Feature activation-7.15e-02

Pos logit contributions
 narrative+0.029
oral+0.028
iness+0.025
 severe+0.023
 obstacle+0.023
Neg logit contributions
 ask-0.022
 But-0.021
 Mir-0.021
 Mil-0.021
 L-0.020
Token less
Token ID1342
Feature activation+5.39e-02

Pos logit contributions
 narrative+0.046
oral+0.044
 severe+0.034
 obstacle+0.033
smart+0.032
Neg logit contributions
 Mil-0.053
 But-0.051
 Mir-0.050
 N-0.050
 Just-0.049
Token gr
Token ID1036
Feature activation+2.31e-02

Pos logit contributions
 Mil+0.057
 But+0.053
 N+0.052
 Mir+0.051
 Just+0.050
Neg logit contributions
 narrative-0.114
oral-0.112
 severe-0.093
 obstacle-0.090
oni-0.089
Tokenumpy
Token ID32152
Feature activation+1.34e-02

Pos logit contributions
 Mil+0.029
 But+0.028
 Just+0.027
 N+0.027
 Suddenly+0.026
Neg logit contributions
 narrative-0.042
oral-0.041
 severe-0.034
 mercy-0.033
 obstacle-0.032
Token.
Token ID13
Feature activation-1.33e-02

Pos logit contributions
 But+0.018
 Mil+0.018
 Just+0.017
 N+0.017
 Suddenly+0.016
Neg logit contributions
 narrative-0.024
oral-0.024
 severe-0.020
 obstacle-0.018
oni-0.018
Token The
Token ID383
Feature activation-2.63e-04

Pos logit contributions
 narrative+0.025
oral+0.024
 severe+0.021
 obstacle+0.020
smart+0.019
Neg logit contributions
 Mil-0.017
 Mir-0.016
 But-0.016
 Del-0.015
 N-0.015
Token moral
Token ID6573
Feature activation-1.68e-01

Pos logit contributions
 narrative+0.000
smart+0.000
oral+0.000
 severe+0.000
der+0.000
Neg logit contributions
 Mil-0.000
 But-0.000
 N-0.000
 ask-0.000
Gu-0.000
Token of
Token ID286
Feature activation-2.79e-02

Pos logit contributions
 narrative+0.263
oral+0.245
 severe+0.209
 obstacle+0.200
ience+0.193
Neg logit contributions
 Mil-0.267
 But-0.258
 Mir-0.253
Gu-0.246
 Del-0.246
Token the
Token ID262
Feature activation-7.81e-02

Pos logit contributions
 narrative+0.068
oral+0.064
der+0.062
ience+0.061
 obstacle+0.060
Neg logit contributions
 Mir-0.022
 Mil-0.021
 But-0.020
 L-0.020
 Sara-0.019
Token story
Token ID1621
Feature activation-1.68e-01

Pos logit contributions
 narrative+0.065
der+0.059
 severe+0.059
smart+0.058
oral+0.056
Neg logit contributions
 But-0.158
 Del-0.153
 Mil-0.153
Gu-0.151
 N-0.150
Token is
Token ID318
Feature activation-4.54e-02

Pos logit contributions
 severe+0.317
 narrative+0.300
 obstacle+0.284
agons+0.283
oral+0.282
Neg logit contributions
 But-0.223
 Just-0.179
 I-0.174
 ask-0.166
 My-0.164
Token that
Token ID326
Feature activation-2.92e-02

Pos logit contributions
 narrative+0.091
oral+0.079
 severe+0.072
iness+0.072
 obstacle+0.072
Neg logit contributions
 But-0.059
 Mil-0.056
 Just-0.055
 Mir-0.053
 ask-0.053
Token.
Token ID13
Feature activation-2.31e-02

Pos logit contributions
 narrative+0.078
oral+0.074
 severe+0.060
 obstacle+0.056
smart+0.055
Neg logit contributions
 Mil-0.102
 Mir-0.096
Gu-0.094
 But-0.093
 N-0.093
Token
Token ID198
Feature activation-1.28e-02

Pos logit contributions
oral+0.034
 narrative+0.031
:+0.025
der+0.023
 story+0.023
Neg logit contributions
 Mil-0.035
 groom-0.034
Gu-0.034
 imagine-0.033
 pretended-0.033
Token
Token ID198
Feature activation-1.66e-02

Pos logit contributions
oral+0.021
 narrative+0.020
 severe+0.017
smart+0.015
 obstacle+0.015
Neg logit contributions
 Mil-0.020
 Mir-0.019
 N-0.018
Gu-0.018
 Del-0.018
TokenAt
Token ID2953
Feature activation+4.30e-02

Pos logit contributions
oral+0.030
 narrative+0.029
 severe+0.024
smart+0.021
 obstacle+0.021
Neg logit contributions
 Mil-0.025
 Mir-0.023
 N-0.023
 Pand-0.022
 But-0.022
Token the
Token ID262
Feature activation-3.46e-02

Pos logit contributions
 Mil+0.055
 Mir+0.052
 Del+0.048
 N+0.048
 But+0.047
Neg logit contributions
 narrative-0.086
oral-0.083
 severe-0.072
smart-0.068
iness-0.068
Token end
Token ID886
Feature activation-1.31e-01

Pos logit contributions
 narrative+0.025
oral+0.023
 severe+0.012
 obstacle+0.012
 problem+0.011
Neg logit contributions
 Mil-0.086
 Mir-0.082
Gu-0.081
 Just-0.081
 But-0.080
Token of
Token ID286
Feature activation+2.07e-02

Pos logit contributions
oral+0.113
 narrative+0.105
 story+0.073
 problem+0.073
 severe+0.073
Neg logit contributions
Gu-0.297
 Mil-0.285
Whe-0.278
 N-0.274
 Just-0.270
Token the
Token ID262
Feature activation-2.30e-02

Pos logit contributions
 Mil+0.023
 Mir+0.022
 But+0.020
 Del+0.020
 N+0.019
Neg logit contributions
 narrative-0.045
oral-0.044
 severe-0.038
ience-0.038
 obstacle-0.038
Token day
Token ID1110
Feature activation-3.04e-02

Pos logit contributions
 narrative+0.035
oral+0.033
 severe+0.026
 obstacle+0.025
 problem+0.025
Neg logit contributions
 Mil-0.040
 Mir-0.038
Gu-0.037
 Del-0.036
 But-0.036
Token,
Token ID11
Feature activation-1.97e-02

Pos logit contributions
 narrative+0.023
oral+0.022
 story+0.016
 problem+0.016
ing+0.016
Neg logit contributions
Gu-0.069
 Mil-0.063
Whe-0.062
 Pand-0.062
 Mir-0.060
Token Jane
Token ID12091
Feature activation+4.10e-02

Pos logit contributions
 narrative+0.042
oral+0.040
 severe+0.036
iness+0.034
 problem+0.034
Neg logit contributions
Gu-0.019
 Mil-0.018
 Pand-0.017
 Mir-0.016
 Just-0.016
Token and
Token ID290
Feature activation-4.24e-02

Pos logit contributions
 Mil+0.054
 Mir+0.051
 But+0.049
 N+0.049
Gu+0.049
Neg logit contributions
 narrative-0.040
oral-0.038
 severe-0.031
 obstacle-0.028
smart-0.028
Token cl
Token ID537
Feature activation+2.38e-02

Pos logit contributions
 narrative+0.079
oral+0.075
 severe+0.066
 obstacle+0.062
smart+0.061
Neg logit contributions
 Mil-0.056
 Mir-0.052
Gu-0.050
 But-0.050
 N-0.050
Tokenapped
Token ID6320
Feature activation-3.06e-03

Pos logit contributions
 Mil+0.040
 Mir+0.038
Gu+0.037
 But+0.036
 N+0.035
Neg logit contributions
 narrative-0.037
oral-0.035
 severe-0.028
smart-0.028
iness-0.027
Token!
Token ID0
Feature activation-4.45e-02

Pos logit contributions
 narrative+0.005
oral+0.005
 severe+0.004
 obstacle+0.004
oni+0.004
Neg logit contributions
 Mil-0.005
 Mir-0.005
 But-0.005
 N-0.004
 Del-0.004
Token The
Token ID383
Feature activation-1.03e-02

Pos logit contributions
 narrative+0.089
oral+0.087
 severe+0.073
smart+0.071
 obstacle+0.069
Neg logit contributions
 Mil-0.052
 Mir-0.047
Gu-0.047
 Del-0.044
 N-0.044
Token hard
Token ID1327
Feature activation-1.41e-01

Pos logit contributions
 narrative+0.019
oral+0.018
 severe+0.016
smart+0.016
iness+0.015
Neg logit contributions
 Mil-0.013
 Mir-0.012
 But-0.012
 Del-0.012
 N-0.012
Token fire
Token ID2046
Feature activation+4.34e-02

Pos logit contributions
 narrative+0.233
oral+0.222
 severe+0.187
 obstacle+0.175
smart+0.172
Neg logit contributions
 Mil-0.217
 Mir-0.205
 But-0.198
 N-0.197
 Del-0.196
Tokenwork
Token ID1818
Feature activation-3.27e-02

Pos logit contributions
 Mil+0.076
 Mir+0.072
 But+0.070
 Del+0.070
 N+0.069
Neg logit contributions
 narrative-0.063
oral-0.059
 severe-0.049
 obstacle-0.045
smart-0.044
Token looked
Token ID3114
Feature activation-3.00e-02

Pos logit contributions
 narrative+0.054
oral+0.052
 severe+0.044
 obstacle+0.041
smart+0.041
Neg logit contributions
 Mil-0.050
 Mir-0.047
 But-0.045
 N-0.045
 Del-0.045
Token beautiful
Token ID4950
Feature activation+2.01e-02

Pos logit contributions
 narrative+0.056
oral+0.054
 severe+0.045
 obstacle+0.041
smart+0.041
Neg logit contributions
 Mil-0.041
 But-0.039
 N-0.038
 Mir-0.038
 Del-0.038
Token in
Token ID287
Feature activation+7.57e-03

Pos logit contributions
 Mil+0.029
 But+0.028
 N+0.027
 Mir+0.027
 Just+0.027
Neg logit contributions
 narrative-0.036
oral-0.036
 severe-0.029
 obstacle-0.027
smart-0.026
Token yarn
Token ID21181
Feature activation-1.05e-02

Pos logit contributions
 narrative+0.005
oral+0.004
 severe+0.004
 obstacle+0.004
 problem+0.003
Neg logit contributions
 Mil-0.004
 Mir-0.004
 Del-0.004
 N-0.004
Gu-0.004
Token.
Token ID13
Feature activation-8.78e-03

Pos logit contributions
 narrative+0.017
oral+0.017
 severe+0.014
 obstacle+0.013
smart+0.013
Neg logit contributions
 Mil-0.016
 Mir-0.016
 But-0.015
 N-0.015
 Just-0.015
Token She
Token ID1375
Feature activation-1.41e-02

Pos logit contributions
oral+0.015
 narrative+0.015
der+0.014
oni+0.013
:+0.013
Neg logit contributions
nes-0.010
 Mil-0.010
Gu-0.009
 Mir-0.009
 seen-0.008
Token worked
Token ID3111
Feature activation-9.19e-02

Pos logit contributions
 narrative+0.026
oral+0.025
 severe+0.021
oni+0.020
ience+0.020
Neg logit contributions
 Mil-0.018
Gu-0.017
 Mir-0.017
 Del-0.016
 N-0.016
Token very
Token ID845
Feature activation-7.52e-02

Pos logit contributions
 narrative+0.158
oral+0.154
 obstacle+0.120
 severe+0.120
 mercy+0.119
Neg logit contributions
 But-0.150
 Mil-0.142
 Suddenly-0.142
 Just-0.137
 L-0.135
Token hard
Token ID1327
Feature activation-1.68e-01

Pos logit contributions
 narrative+0.093
oral+0.087
 severe+0.067
 pistols+0.064
 obstacle+0.064
Neg logit contributions
 But-0.151
 Mil-0.148
 Mir-0.142
 Just-0.138
 Suddenly-0.137
Token.
Token ID13
Feature activation-9.82e-03

Pos logit contributions
 narrative+0.346
oral+0.333
 severe+0.277
 mercy+0.270
 obstacle+0.267
Neg logit contributions
 But-0.222
 Suddenly-0.200
 Just-0.199
 Mil-0.198
 Mir-0.187
Token But
Token ID887
Feature activation+5.02e-02

Pos logit contributions
oral+0.016
 narrative+0.016
der+0.015
:+0.014
oni+0.014
Neg logit contributions
nes-0.013
 Mil-0.011
Gu-0.011
 Mir-0.010
 mate-0.010
Token,
Token ID11
Feature activation-6.82e-03

Pos logit contributions
 Mil+0.070
Gu+0.068
 Mir+0.065
 Del+0.061
 Pand+0.061
Neg logit contributions
 narrative-0.086
oral-0.080
 severe-0.071
 problem-0.070
der-0.066
Token eventually
Token ID4191
Feature activation+8.03e-03

Pos logit contributions
 narrative+0.014
oral+0.014
 severe+0.012
 obstacle+0.011
 problem+0.011
Neg logit contributions
 Mil-0.007
 Mir-0.007
Gu-0.007
 Del-0.006
 But-0.006
Token she
Token ID673
Feature activation+4.75e-03

Pos logit contributions
 Mil+0.014
 Mir+0.013
Gu+0.013
 But+0.013
 Just+0.013
Neg logit contributions
 narrative-0.012
oral-0.011
 severe-0.009
 problem-0.008
 obstacle-0.008