TOP ACTIVATIONS
MAX = 0.199

a tall tree . Un bek now st to him ,
Jane hopped on the sees aw and before long she was
, slide and see - saw . But their favorite thing
seat on the see - saw . She ran to it
down on the see - saw . After
, slide and see - saw . But when
t about the see - saw and how much fun it
It sounded like " c aw , c aw , c
They get off the sees aw . They run to the
the slide and the sees aw . But there was one
They balance on the sees aw . They cl ap .
blew and made the sees aw go up and down very
" c aw , c aw , c aw ".
pulled . The pigeon squ aw ked and bit her finger
the rock like a sees aw . She puts the jar
They run to the sees aw . They sit on each
, c aw , c aw ". L ily
the slide and the sees aw . They also liked to
want to make the sees aw stay still . They move
want to try the sees aw . They run

MIN ACTIVATIONS
MIN = -0.212

. He would take big breaths of the fresh air and
could take lots of deep breaths and get lots of oxygen
. She kept taking deep breaths until all her energy was
eyes and took some deep breaths . She felt the warm
feet . She took big breaths of fresh air and her
he tried to take deep breaths to calm down . Tim
with some big , big breaths that filled his balloon with
They could hear their own breaths . " Wow
moments and took some deep breaths , before carefully putting her
girl took a few deep breaths and started to feel better
his muscles , taking deep breaths as he did so .
to take nice , deep breaths . Slowly , he felt
face and he took deep breaths . At the
the big boy take deep breaths and count to ten instead
her eyes and took deep breaths . Soon , she felt
to trem ble and their breaths became slow .
they do it . He imag ines he is one of
listen . She threw a tant rum in the store .
kept crying and throwing a tant rum . Finally
them and never threw a tant rum again . <|endoftext|> Once

INTERVAL 0.096 - 0.199
CONTAINS 0.550%

thank God for making the temple . They say goodbye to
delicious everything inside the grill smelled ! " Do
c uddled . They forgot about the clock and the cold
are you ? I 'm coming to find you !" She
he was glad he had come out to enjoy it .

INTERVAL -0.007 - 0.096
CONTAINS 60.729%

The fish said , " Let 's find some food
believe . The lions are awesome . Thank you for taking
!" So Bob and the balloon flew up in the sky
lush green patch of grass and plants in the middle of
They look at each other . They look at the empty

INTERVAL -0.109 - -0.007
CONTAINS 38.367%

ily felt sad and worried that her mom would be upset
upon a time , there was a helpful lawyer named Tim
, there was a little boy named Tim my . Tim
the house . They were fast . They made
it heard the sound of music coming from the lake .

INTERVAL -0.212 - -0.109
CONTAINS 0.353%

" It 's shampoo ," replied mom my .
scared that he started to cry . A
soon . E lla smiled as she realized
so happy that she fell asleep with a big smile on
It 's okay , Tom ," Ben says . " We
Token a
Token ID257
Feature activation+3.53e-04

Pos logit contributions
 exactly+0.060
ook+0.056
ooks+0.054
bles+0.052
ents+0.051
Neg logit contributions
Mine-0.056
 care-0.052
Stop-0.049
hered-0.047
 breaths-0.047
Token tall
Token ID7331
Feature activation+4.84e-02

Pos logit contributions
 exactly+0.001
ooks+0.001
ook+0.001
ents+0.001
ups+0.001
Neg logit contributions
Mine-0.001
 care-0.001
Stop-0.001
 bath-0.001
 shock-0.001
Token tree
Token ID5509
Feature activation+2.86e-02

Pos logit contributions
 exactly+0.087
ook+0.079
ooks+0.078
bles+0.074
ossom+0.073
Neg logit contributions
Mine-0.084
 care-0.079
Stop-0.074
hered-0.071
 breaths-0.070
Token.
Token ID13
Feature activation+4.39e-03

Pos logit contributions
 exactly+0.059
ooks+0.056
ook+0.055
ae+0.053
ossom+0.053
Neg logit contributions
Mine-0.037
 care-0.034
hered-0.033
|-0.031
Stop-0.030
Token Un
Token ID791
Feature activation+1.05e-01

Pos logit contributions
 exactly+0.010
ook+0.009
ooks+0.009
anut+0.009
 impressed+0.009
Neg logit contributions
Mine-0.006
 care-0.005
Stop-0.005
|-0.004
hered-0.004
Tokenbek
Token ID47083
Feature activation+1.99e-01

Pos logit contributions
 exactly+0.181
ooks+0.168
ook+0.157
iring+0.154
ae+0.154
Neg logit contributions
Mine-0.184
 care-0.174
|-0.170
hered-0.168
Stop-0.164
Tokennow
Token ID2197
Feature activation-1.59e-02

Pos logit contributions
 exactly+0.392
ooks+0.375
 impressed+0.361
ae+0.358
anut+0.355
Neg logit contributions
Mine-0.271
ened-0.237
hered-0.234
 breaths-0.233
 care-0.230
Tokenst
Token ID301
Feature activation+2.75e-02

Pos logit contributions
Mine+0.019
 care+0.019
Stop+0.017
hered+0.016
 breaths+0.016
Neg logit contributions
 exactly-0.035
ooks-0.034
ook-0.034
anut-0.031
ossom-0.031
Token to
Token ID284
Feature activation-1.74e-02

Pos logit contributions
 exactly+0.051
ook+0.047
ooks+0.046
anut+0.045
 impressed+0.045
Neg logit contributions
Mine-0.043
 care-0.041
Stop-0.037
hered-0.036
 breaths-0.036
Token him
Token ID683
Feature activation-4.89e-02

Pos logit contributions
Mine+0.027
 care+0.026
Stop+0.024
hered+0.023
 breaths+0.022
Neg logit contributions
 exactly-0.034
ook-0.031
ooks-0.031
bles-0.029
ossom-0.029
Token,
Token ID11
Feature activation+3.83e-03

Pos logit contributions
Mine+0.068
 care+0.067
Stop+0.058
hered+0.058
 breaths+0.056
Neg logit contributions
 exactly-0.097
ooks-0.094
ook-0.094
bles-0.088
anut-0.088
Token Jane
Token ID12091
Feature activation+5.86e-02

Pos logit contributions
 care+0.001
Mine+0.001
Stop+0.001
 imag+0.001
 breaths+0.001
Neg logit contributions
 exactly-0.001
ook-0.001
ooks-0.001
ae-0.001
anut-0.001
Token hopped
Token ID45230
Feature activation+3.96e-02

Pos logit contributions
 exactly+0.107
ook+0.096
ents+0.091
ooks+0.090
ae+0.090
Neg logit contributions
Mine-0.099
 care-0.090
Stop-0.088
hered-0.084
 breaths-0.084
Token on
Token ID319
Feature activation+3.63e-02

Pos logit contributions
 exactly+0.064
ook+0.057
bles+0.054
ooks+0.054
ossom+0.053
Neg logit contributions
Mine-0.075
 care-0.070
Stop-0.067
 imag-0.063
 breaths-0.063
Token the
Token ID262
Feature activation+2.86e-02

Pos logit contributions
 exactly+0.068
ook+0.060
ooks+0.058
 impressed+0.056
ents+0.056
Neg logit contributions
Mine-0.058
 care-0.055
Stop-0.053
hered-0.051
|-0.051
Token sees
Token ID7224
Feature activation+1.77e-02

Pos logit contributions
 exactly+0.049
ook+0.044
ooks+0.043
bles+0.042
ents+0.041
Neg logit contributions
Mine-0.051
 care-0.048
Stop-0.045
hered-0.044
|-0.043
Tokenaw
Token ID707
Feature activation+1.89e-01

Pos logit contributions
 exactly+0.031
ook+0.029
ooks+0.028
bles+0.026
ents+0.026
Neg logit contributions
Mine-0.031
 care-0.029
Stop-0.027
hered-0.026
 breaths-0.026
Token and
Token ID290
Feature activation+1.48e-02

Pos logit contributions
 exactly+0.387
ooks+0.383
ook+0.376
 impressed+0.336
anut+0.334
Neg logit contributions
Mine-0.246
hered-0.239
 care-0.229
Stop-0.222
 him-0.214
Token before
Token ID878
Feature activation+2.99e-02

Pos logit contributions
 exactly+0.030
ook+0.029
ooks+0.028
iring+0.027
bles+0.026
Neg logit contributions
Mine-0.022
 care-0.021
Stop-0.019
hered-0.018
 breaths-0.018
Token long
Token ID890
Feature activation-4.87e-02

Pos logit contributions
 exactly+0.050
ook+0.044
ooks+0.042
 impressed+0.042
ents+0.042
Neg logit contributions
Mine-0.055
 care-0.051
Stop-0.048
|-0.047
 breaths-0.047
Token she
Token ID673
Feature activation+2.48e-02

Pos logit contributions
Mine+0.093
 care+0.083
Stop+0.082
|+0.082
hered+0.076
Neg logit contributions
 exactly-0.080
ook-0.070
ents-0.067
ooks-0.067
bles-0.066
Token was
Token ID373
Feature activation-2.63e-02

Pos logit contributions
 exactly+0.048
ook+0.045
ooks+0.043
bles+0.042
ents+0.041
Neg logit contributions
Mine-0.039
 care-0.036
Stop-0.034
hered-0.032
 breaths-0.032
Token,
Token ID11
Feature activation+5.01e-03

Pos logit contributions
 exactly+0.070
ooks+0.067
ook+0.065
 impressed+0.064
ossom+0.063
Neg logit contributions
Mine-0.045
 care-0.043
hered-0.039
Stop-0.038
|-0.037
Token slide
Token ID10649
Feature activation+2.71e-02

Pos logit contributions
 exactly+0.010
ook+0.009
ooks+0.009
ossom+0.009
iring+0.008
Neg logit contributions
Mine-0.008
 care-0.008
Stop-0.007
|-0.007
hered-0.007
Token and
Token ID290
Feature activation+1.04e-02

Pos logit contributions
 exactly+0.060
ooks+0.058
ossom+0.056
ook+0.056
ae+0.056
Neg logit contributions
Mine-0.034
 care-0.032
Stop-0.030
 him-0.028
|-0.027
Token see
Token ID766
Feature activation+3.89e-02

Pos logit contributions
 exactly+0.020
ook+0.018
ooks+0.017
ents+0.017
bles+0.017
Neg logit contributions
Mine-0.016
 care-0.015
Stop-0.014
hered-0.014
 breaths-0.014
Token-
Token ID12
Feature activation+3.32e-02

Pos logit contributions
ossom+0.076
ae+0.075
ooks+0.075
ook+0.074
 exactly+0.073
Neg logit contributions
Mine-0.057
 care-0.052
 him-0.051
Stop-0.050
 imag-0.042
Tokensaw
Token ID43439
Feature activation+1.87e-01

Pos logit contributions
 exactly+0.034
ooks+0.026
ook+0.026
aze+0.023
ae+0.023
Neg logit contributions
Mine-0.087
 care-0.084
Stop-0.083
|-0.078
hered-0.077
Token.
Token ID13
Feature activation+6.62e-03

Pos logit contributions
 exactly+0.378
ooks+0.359
ook+0.359
ossom+0.343
ae+0.335
Neg logit contributions
Mine-0.272
 care-0.259
Stop-0.234
hered-0.222
 breaths-0.218
Token But
Token ID887
Feature activation-2.37e-02

Pos logit contributions
 exactly+0.015
ook+0.014
ooks+0.013
bles+0.013
ents+0.013
Neg logit contributions
Mine-0.009
 care-0.008
Stop-0.007
hered-0.007
 breaths-0.007
Token their
Token ID511
Feature activation-1.97e-02

Pos logit contributions
Mine+0.034
 care+0.032
Stop+0.029
hered+0.028
 breaths+0.028
Neg logit contributions
 exactly-0.049
ook-0.046
ooks-0.044
bles-0.043
ents-0.042
Token favorite
Token ID4004
Feature activation-2.58e-02

Pos logit contributions
Mine+0.044
 care+0.042
Stop+0.039
 breaths+0.038
 imag+0.038
Neg logit contributions
 exactly-0.026
ook-0.024
ooks-0.023
ossom-0.021
ents-0.021
Token thing
Token ID1517
Feature activation+1.17e-02

Pos logit contributions
Mine+0.043
 care+0.040
Stop+0.038
hered+0.037
 breaths+0.037
Neg logit contributions
 exactly-0.047
ook-0.043
ooks-0.041
bles-0.040
ents-0.039
Token seat
Token ID5852
Feature activation-1.15e-03

Pos logit contributions
Mine+0.033
 care+0.031
Stop+0.029
hered+0.028
 breaths+0.027
Neg logit contributions
 exactly-0.031
ook-0.029
ooks-0.028
bles-0.026
ents-0.026
Token on
Token ID319
Feature activation+3.25e-02

Pos logit contributions
Mine+0.002
 care+0.002
hered+0.002
Stop+0.002
storm+0.001
Neg logit contributions
 exactly-0.002
ooks-0.002
ook-0.002
ossom-0.002
anut-0.002
Token the
Token ID262
Feature activation+1.80e-02

Pos logit contributions
 exactly+0.047
ook+0.039
ents+0.038
ich+0.037
ooks+0.036
Neg logit contributions
Mine-0.062
 care-0.061
 imag-0.061
hered-0.058
 breaths-0.058
Token see
Token ID766
Feature activation+4.08e-02

Pos logit contributions
 exactly+0.036
ook+0.034
ooks+0.033
ossom+0.031
 impressed+0.030
Neg logit contributions
Mine-0.028
 care-0.026
Stop-0.025
hered-0.023
|-0.023
Token-
Token ID12
Feature activation+3.32e-02

Pos logit contributions
 exactly+0.079
ook+0.074
ooks+0.073
ossom+0.071
anut+0.070
Neg logit contributions
Mine-0.064
 care-0.057
Stop-0.057
hered-0.053
Hey-0.050
Tokensaw
Token ID43439
Feature activation+1.86e-01

Pos logit contributions
 exactly+0.051
ooks+0.045
ook+0.043
 impressed+0.042
aze+0.042
Neg logit contributions
Mine-0.068
 care-0.063
Stop-0.062
|-0.060
Hey-0.057
Token.
Token ID13
Feature activation-4.06e-04

Pos logit contributions
 exactly+0.321
ook+0.289
ooks+0.272
ents+0.264
bles+0.263
Neg logit contributions
Mine-0.334
 care-0.314
Stop-0.295
 breaths-0.285
hered-0.284
Token She
Token ID1375
Feature activation+2.96e-02

Pos logit contributions
Mine+0.001
 care+0.001
Stop+0.000
hered+0.000
 breaths+0.000
Neg logit contributions
 exactly-0.001
ook-0.001
ooks-0.001
bles-0.001
anut-0.001
Token ran
Token ID4966
Feature activation-9.18e-03

Pos logit contributions
 exactly+0.066
ook+0.063
ooks+0.062
ossom+0.059
iring+0.059
Neg logit contributions
Mine-0.038
 care-0.035
Stop-0.032
 breaths-0.029
 imag-0.029
Token to
Token ID284
Feature activation-1.84e-02

Pos logit contributions
Mine+0.015
 care+0.014
Stop+0.013
hered+0.013
 breaths+0.013
Neg logit contributions
 exactly-0.017
ook-0.016
ooks-0.015
bles-0.015
ossom-0.015
Token it
Token ID340
Feature activation-1.05e-02

Pos logit contributions
Mine+0.029
 care+0.026
Stop+0.025
hered+0.025
 breaths+0.025
Neg logit contributions
 exactly-0.035
ook-0.033
ooks-0.031
 impressed-0.030
ents-0.030
Token down
Token ID866
Feature activation+2.97e-02

Pos logit contributions
 exactly+0.008
ooks+0.007
ook+0.007
iring+0.007
bles+0.007
Neg logit contributions
Mine-0.006
 care-0.006
Stop-0.005
|-0.005
hered-0.005
Token on
Token ID319
Feature activation+3.29e-02

Pos logit contributions
 exactly+0.058
ook+0.054
ooks+0.053
bles+0.051
anut+0.050
Neg logit contributions
Mine-0.046
 care-0.044
Stop-0.040
hered-0.039
 breaths-0.038
Token the
Token ID262
Feature activation+1.79e-02

Pos logit contributions
 exactly+0.063
ook+0.059
ooks+0.057
bles+0.054
ents+0.054
Neg logit contributions
Mine-0.053
 care-0.049
Stop-0.046
hered-0.044
 breaths-0.043
Token see
Token ID766
Feature activation+3.93e-02

Pos logit contributions
 exactly+0.031
ook+0.029
ooks+0.028
ossom+0.027
bles+0.026
Neg logit contributions
Mine-0.032
 care-0.030
Stop-0.028
hered-0.027
 breaths-0.027
Token-
Token ID12
Feature activation+3.33e-02

Pos logit contributions
 exactly+0.060
ook+0.058
ooks+0.057
ossom+0.055
anut+0.054
Neg logit contributions
Mine-0.076
 care-0.070
Stop-0.068
hered-0.064
 him-0.063
Tokensaw
Token ID43439
Feature activation+1.85e-01

Pos logit contributions
 exactly+0.043
ooks+0.037
bles+0.033
aze+0.033
 impressed+0.033
Neg logit contributions
Mine-0.076
 care-0.073
Stop-0.070
|-0.069
 him-0.066
Token.
Token ID13
Feature activation-3.54e-03

Pos logit contributions
 exactly+0.336
ook+0.308
ooks+0.295
bles+0.284
ents+0.283
Neg logit contributions
Mine-0.314
 care-0.294
Stop-0.275
hered-0.265
 breaths-0.263
Token
Token ID220
Feature activation+2.59e-02

Pos logit contributions
Mine+0.006
 care+0.006
Stop+0.005
 breaths+0.005
hered+0.005
Neg logit contributions
 exactly-0.006
ook-0.006
ooks-0.006
ents-0.005
ossom-0.005
Token
Token ID198
Feature activation+4.23e-03

Pos logit contributions
 exactly+0.056
ook+0.051
anut+0.051
ooks+0.050
ossom+0.050
Neg logit contributions
Mine-0.032
 care-0.030
Stop-0.027
|-0.027
hered-0.026
Token
Token ID198
Feature activation+3.70e-03

Pos logit contributions
 exactly+0.008
ook+0.008
ooks+0.007
bles+0.007
ents+0.007
Neg logit contributions
Mine-0.007
 care-0.006
Stop-0.006
hered-0.005
|-0.005
TokenAfter
Token ID3260
Feature activation+1.10e-02

Pos logit contributions
 exactly+0.005
ook+0.005
ooks+0.005
anut+0.004
bles+0.004
Neg logit contributions
Mine-0.008
 care-0.007
Stop-0.007
|-0.007
hered-0.007
Token,
Token ID11
Feature activation-1.24e-05

Pos logit contributions
 exactly+0.062
ook+0.058
ooks+0.057
bles+0.054
ossom+0.054
Neg logit contributions
Mine-0.054
 care-0.051
Stop-0.047
hered-0.046
 breaths-0.045
Token slide
Token ID10649
Feature activation+2.84e-02

Pos logit contributions
Mine+0.000
 care+0.000
Stop+0.000
|+0.000
 breaths+0.000
Neg logit contributions
 exactly-0.000
ook-0.000
ooks-0.000
ossom-0.000
bles-0.000
Token and
Token ID290
Feature activation+6.65e-03

Pos logit contributions
 exactly+0.055
ook+0.052
ooks+0.052
ossom+0.049
 impressed+0.048
Neg logit contributions
Mine-0.044
 care-0.041
Stop-0.038
hered-0.037
 breaths-0.036
Token see
Token ID766
Feature activation+3.83e-02

Pos logit contributions
 exactly+0.013
ook+0.011
ooks+0.011
ents+0.011
 impressed+0.011
Neg logit contributions
Mine-0.011
 care-0.010
Stop-0.009
hered-0.009
 breaths-0.009
Token-
Token ID12
Feature activation+3.37e-02

Pos logit contributions
 exactly+0.064
ook+0.063
ooks+0.063
ossom+0.062
anut+0.059
Neg logit contributions
Mine-0.065
 care-0.062
Stop-0.057
 him-0.057
 breaths-0.053
Tokensaw
Token ID43439
Feature activation+1.84e-01

Pos logit contributions
 exactly+0.042
ooks+0.035
ook+0.035
 impressed+0.033
aze+0.031
Neg logit contributions
Mine-0.079
 care-0.077
Stop-0.075
|-0.070
hered-0.070
Token.
Token ID13
Feature activation-5.69e-04

Pos logit contributions
 exactly+0.352
ook+0.332
ooks+0.327
ossom+0.310
bles+0.307
Neg logit contributions
Mine-0.290
 care-0.274
Stop-0.253
hered-0.243
 breaths-0.239
Token
Token ID198
Feature activation+7.28e-03

Pos logit contributions
Mine+0.001
 care+0.001
Stop+0.001
 breaths+0.001
hered+0.001
Neg logit contributions
 exactly-0.001
ook-0.001
ooks-0.001
bles-0.001
ents-0.001
Token
Token ID198
Feature activation+6.89e-03

Pos logit contributions
 exactly+0.013
ook+0.012
ooks+0.011
 impressed+0.011
ents+0.011
Neg logit contributions
Mine-0.013
 care-0.012
 breaths-0.012
Stop-0.011
hered-0.011
TokenBut
Token ID1537
Feature activation-5.41e-03

Pos logit contributions
 exactly+0.010
ook+0.009
ooks+0.008
bles+0.008
ents+0.008
Neg logit contributions
Mine-0.015
 care-0.014
Stop-0.013
hered-0.013
|-0.012
Token when
Token ID618
Feature activation+3.16e-02

Pos logit contributions
Mine+0.008
 care+0.008
Stop+0.007
hered+0.007
 breaths+0.006
Neg logit contributions
 exactly-0.010
ook-0.010
ooks-0.010
ossom-0.010
bles-0.010
Tokent
Token ID83
Feature activation-6.25e-02

Pos logit contributions
 exactly+0.019
ook+0.018
ooks+0.018
bles+0.017
ossom+0.017
Neg logit contributions
Mine-0.009
 care-0.009
Stop-0.008
hered-0.008
|-0.007
Token about
Token ID546
Feature activation+1.16e-01

Pos logit contributions
Mine+0.128
 care+0.117
Stop+0.114
hered+0.112
 breaths+0.110
Neg logit contributions
 exactly-0.095
ook-0.083
ooks-0.079
ents-0.077
 impressed-0.075
Token the
Token ID262
Feature activation+1.61e-02

Pos logit contributions
 exactly+0.215
ook+0.213
ooks+0.207
ossom+0.199
bles+0.195
Neg logit contributions
Mine-0.182
 care-0.174
Stop-0.157
 breaths-0.153
 imag-0.150
Token see
Token ID766
Feature activation+3.77e-02

Pos logit contributions
 exactly+0.030
ook+0.029
ooks+0.029
ossom+0.027
bles+0.026
Neg logit contributions
Mine-0.027
 care-0.026
Stop-0.024
 imag-0.022
 applause-0.022
Token-
Token ID12
Feature activation+3.26e-02

Pos logit contributions
 exactly+0.071
ook+0.068
ooks+0.066
ossom+0.063
bles+0.063
Neg logit contributions
Mine-0.059
 care-0.056
Stop-0.053
hered-0.050
 breaths-0.049
Tokensaw
Token ID43439
Feature activation+1.84e-01

Pos logit contributions
 exactly+0.064
ooks+0.062
ook+0.058
bles+0.058
aze+0.056
Neg logit contributions
Mine-0.051
 care-0.050
Stop-0.047
|-0.045
 him-0.043
Token and
Token ID290
Feature activation+3.00e-03

Pos logit contributions
 exactly+0.340
ook+0.322
ooks+0.316
bles+0.298
ossom+0.297
Neg logit contributions
Mine-0.298
 care-0.283
Stop-0.262
hered-0.253
 breaths-0.247
Token how
Token ID703
Feature activation+2.89e-02

Pos logit contributions
 exactly+0.005
ook+0.005
ooks+0.004
 impressed+0.004
ents+0.004
Neg logit contributions
Mine-0.005
 care-0.005
hered-0.005
Stop-0.005
|-0.005
Token much
Token ID881
Feature activation+3.32e-02

Pos logit contributions
 exactly+0.048
ook+0.045
ooks+0.045
bles+0.043
anut+0.042
Neg logit contributions
 care-0.052
Mine-0.049
Stop-0.044
 him-0.043
|-0.043
Token fun
Token ID1257
Feature activation-3.93e-02

Pos logit contributions
 exactly+0.053
ook+0.043
aches+0.043
 impressed+0.042
ents+0.042
Neg logit contributions
Mine-0.063
hered-0.057
|-0.057
 care-0.055
Stop-0.055
Token it
Token ID340
Feature activation-1.02e-02

Pos logit contributions
Mine+0.072
 care+0.066
Stop+0.063
hered+0.062
|+0.062
Neg logit contributions
 exactly-0.067
ook-0.059
ooks-0.057
ents-0.055
bles-0.055
Token It
Token ID632
Feature activation-2.60e-02

Pos logit contributions
 exactly+0.009
ook+0.008
ooks+0.008
bles+0.008
ossom+0.008
Neg logit contributions
Mine-0.006
 care-0.005
Stop-0.005
hered-0.005
 breaths-0.005
Token sounded
Token ID14846
Feature activation+5.85e-02

Pos logit contributions
Mine+0.043
 care+0.040
hered+0.038
Stop+0.037
 breaths+0.036
Neg logit contributions
 exactly-0.048
ook-0.042
 impressed-0.042
ents-0.040
bles-0.039
Token like
Token ID588
Feature activation+9.45e-03

Pos logit contributions
 exactly+0.121
ook+0.112
ooks+0.109
 impressed+0.104
bles+0.104
Neg logit contributions
Mine-0.086
 care-0.079
Stop-0.073
hered-0.071
 breaths-0.069
Token "
Token ID366
Feature activation+4.32e-02

Pos logit contributions
 exactly+0.017
ook+0.014
 impressed+0.014
ooks+0.014
ents+0.014
Neg logit contributions
Mine-0.017
 care-0.015
|-0.015
hered-0.015
Stop-0.014
Tokenc
Token ID66
Feature activation+9.07e-02

Pos logit contributions
 exactly+0.078
ook+0.074
ents+0.070
ossom+0.070
ooks+0.069
Neg logit contributions
Mine-0.069
 care-0.064
 breaths-0.060
 imag-0.058
Stop-0.057
Tokenaw
Token ID707
Feature activation+1.83e-01

Pos logit contributions
 exactly+0.153
ook+0.141
ooks+0.137
bles+0.130
ents+0.129
Neg logit contributions
Mine-0.167
 care-0.154
Stop-0.145
hered-0.141
 breaths-0.139
Token,
Token ID11
Feature activation+7.74e-03

Pos logit contributions
 exactly+0.388
ooks+0.374
ook+0.373
 impressed+0.346
anut+0.335
Neg logit contributions
Mine-0.241
 care-0.230
hered-0.223
Stop-0.222
 breaths-0.196
Token c
Token ID269
Feature activation-1.28e-02

Pos logit contributions
 exactly+0.015
ook+0.015
ooks+0.014
ossom+0.013
bles+0.013
Neg logit contributions
Mine-0.012
 care-0.011
Stop-0.010
hered-0.010
 breaths-0.009
Tokenaw
Token ID707
Feature activation+1.82e-01

Pos logit contributions
Mine+0.025
 care+0.025
Stop+0.025
hered+0.024
 breaths+0.024
Neg logit contributions
 exactly-0.017
ook-0.015
ooks-0.015
 impressed-0.014
ae-0.014
Token,
Token ID11
Feature activation+6.96e-03

Pos logit contributions
 exactly+0.391
ooks+0.377
ook+0.374
 impressed+0.353
iring+0.340
Neg logit contributions
Mine-0.231
 care-0.221
hered-0.220
Stop-0.212
storm-0.190
Token c
Token ID269
Feature activation-1.33e-02

Pos logit contributions
 exactly+0.014
ook+0.013
ooks+0.013
bles+0.012
ossom+0.012
Neg logit contributions
Mine-0.011
 care-0.010
Stop-0.009
hered-0.009
 breaths-0.009
TokenThey
Token ID2990
Feature activation-3.07e-02

Pos logit contributions
 exactly+0.016
ook+0.014
ossom+0.013
ooks+0.013
ae+0.013
Neg logit contributions
Mine-0.015
Stop-0.014
 care-0.014
Hey-0.012
hered-0.012
Token get
Token ID651
Feature activation+4.38e-02

Pos logit contributions
Mine+0.044
 care+0.040
Stop+0.038
hered+0.037
 breaths+0.036
Neg logit contributions
 exactly-0.063
ook-0.058
ooks-0.056
ents-0.055
bles-0.055
Token off
Token ID572
Feature activation-1.32e-02

Pos logit contributions
 exactly+0.077
ook+0.071
ooks+0.069
bles+0.066
ossom+0.065
Neg logit contributions
Mine-0.077
 care-0.073
Stop-0.067
hered-0.065
 breaths-0.065
Token the
Token ID262
Feature activation+1.69e-02

Pos logit contributions
Mine+0.017
 care+0.017
Stop+0.014
|+0.014
 him+0.014
Neg logit contributions
 exactly-0.029
ook-0.027
ooks-0.027
bles-0.027
ossom-0.027
Token sees
Token ID7224
Feature activation+5.91e-04

Pos logit contributions
 exactly+0.029
ook+0.027
ooks+0.026
bles+0.025
ents+0.025
Neg logit contributions
Mine-0.030
 care-0.028
Stop-0.026
hered-0.026
 breaths-0.025
Tokenaw
Token ID707
Feature activation+1.83e-01

Pos logit contributions
 exactly+0.001
ook+0.001
ooks+0.001
bles+0.001
ents+0.001
Neg logit contributions
Mine-0.001
 care-0.001
Stop-0.001
hered-0.001
 breaths-0.001
Token.
Token ID13
Feature activation-2.29e-04

Pos logit contributions
ooks+0.449
 exactly+0.447
ook+0.427
 impressed+0.405
ossom+0.403
Neg logit contributions
 him-0.160
 care-0.155
Mine-0.155
hered-0.152
Stop-0.135
Token They
Token ID1119
Feature activation-3.21e-02

Pos logit contributions
Mine+0.000
 care+0.000
hered+0.000
Stop+0.000
 breaths+0.000
Neg logit contributions
 exactly-0.000
ook-0.000
ooks-0.000
ents-0.000
aze-0.000
Token run
Token ID1057
Feature activation+2.64e-02

Pos logit contributions
Mine+0.045
 care+0.040
Stop+0.039
hered+0.038
 breaths+0.037
Neg logit contributions
 exactly-0.067
ook-0.061
ooks-0.059
ents-0.058
bles-0.058
Token to
Token ID284
Feature activation-1.37e-02

Pos logit contributions
 exactly+0.048
ook+0.045
ooks+0.044
bles+0.041
ossom+0.041
Neg logit contributions
Mine-0.045
 care-0.043
Stop-0.039
hered-0.038
 breaths-0.037
Token the
Token ID262
Feature activation+1.73e-02

Pos logit contributions
Mine+0.023
 care+0.022
Stop+0.021
hered+0.021
 breaths+0.020
Neg logit contributions
 exactly-0.024
ook-0.022
ooks-0.021
ents-0.020
 impressed-0.020
Token the
Token ID262
Feature activation+1.87e-02

Pos logit contributions
 exactly+0.012
ook+0.011
ooks+0.011
ossom+0.010
bles+0.010
Neg logit contributions
Mine-0.010
 care-0.009
Stop-0.009
hered-0.008
|-0.008
Token slide
Token ID10649
Feature activation+2.82e-02

Pos logit contributions
 exactly+0.032
ook+0.030
ooks+0.029
bles+0.027
ents+0.027
Neg logit contributions
Mine-0.034
 care-0.032
Stop-0.030
hered-0.029
 breaths-0.028
Token and
Token ID290
Feature activation+9.94e-03

Pos logit contributions
 exactly+0.059
ooks+0.054
ossom+0.054
ook+0.054
ae+0.052
Neg logit contributions
Mine-0.039
 care-0.037
Stop-0.033
hered-0.032
|-0.031
Token the
Token ID262
Feature activation+1.89e-02

Pos logit contributions
 exactly+0.019
ook+0.017
ooks+0.017
bles+0.016
ents+0.016
Neg logit contributions
Mine-0.016
 care-0.015
Stop-0.014
hered-0.014
 breaths-0.013
Token sees
Token ID7224
Feature activation-1.38e-03

Pos logit contributions
 exactly+0.032
ook+0.029
ooks+0.028
bles+0.027
ossom+0.026
Neg logit contributions
Mine-0.035
 care-0.033
Stop-0.031
hered-0.030
 breaths-0.029
Tokenaw
Token ID707
Feature activation+1.83e-01

Pos logit contributions
Mine+0.003
 care+0.003
Stop+0.003
hered+0.003
Hey+0.003
Neg logit contributions
 exactly-0.001
ook-0.001
ooks-0.001
ossom-0.001
anut-0.001
Token.
Token ID13
Feature activation+5.55e-03

Pos logit contributions
ooks+0.450
 exactly+0.449
ook+0.432
ae+0.404
 impressed+0.403
Neg logit contributions
hered-0.159
Mine-0.157
 him-0.153
 care-0.147
Stop-0.143
Token But
Token ID887
Feature activation-1.99e-02

Pos logit contributions
 exactly+0.013
ook+0.012
ooks+0.012
bles+0.012
ossom+0.012
Neg logit contributions
Mine-0.007
 care-0.006
Stop-0.005
|-0.005
hered-0.005
Token there
Token ID612
Feature activation-7.23e-03

Pos logit contributions
Mine+0.030
 care+0.028
Stop+0.026
hered+0.025
 breaths+0.025
Neg logit contributions
 exactly-0.041
ook-0.037
ooks-0.036
 impressed-0.034
bles-0.034
Token was
Token ID373
Feature activation-2.59e-02

Pos logit contributions
Mine+0.010
 care+0.009
Stop+0.009
hered+0.008
 breaths+0.008
Neg logit contributions
 exactly-0.015
ook-0.014
ooks-0.014
bles-0.013
 impressed-0.013
Token one
Token ID530
Feature activation-2.57e-02

Pos logit contributions
Mine+0.038
 care+0.034
Stop+0.033
 him+0.029
Hey+0.029
Neg logit contributions
 exactly-0.051
ook-0.051
ossom-0.050
ooks-0.049
bles-0.048
Token They
Token ID1119
Feature activation-3.33e-02

Pos logit contributions
 exactly+0.002
ook+0.002
ooks+0.002
ents+0.002
bles+0.002
Neg logit contributions
Mine-0.001
 care-0.001
 breaths-0.001
hered-0.001
Stop-0.001
Token balance
Token ID5236
Feature activation-5.45e-02

Pos logit contributions
Mine+0.047
Stop+0.044
hered+0.043
|+0.042
 breaths+0.041
Neg logit contributions
 exactly-0.064
aches-0.058
ents-0.058
bles-0.057
ook-0.057
Token on
Token ID319
Feature activation+3.06e-02

Pos logit contributions
Mine+0.106
 care+0.100
Stop+0.094
|+0.094
hered+0.092
Neg logit contributions
 exactly-0.087
ook-0.076
bles-0.074
ooks-0.073
 impressed-0.070
Token the
Token ID262
Feature activation+1.43e-02

Pos logit contributions
 exactly+0.059
ook+0.055
ooks+0.053
bles+0.051
ents+0.051
Neg logit contributions
Mine-0.048
 care-0.045
Stop-0.042
hered-0.040
 breaths-0.040
Token sees
Token ID7224
Feature activation+1.56e-03

Pos logit contributions
 exactly+0.024
ook+0.022
ooks+0.021
bles+0.020
ents+0.020
Neg logit contributions
Mine-0.026
 care-0.025
Stop-0.023
hered-0.023
|-0.022
Tokenaw
Token ID707
Feature activation+1.82e-01

Pos logit contributions
 exactly+0.002
ook+0.001
ooks+0.001
ents+0.001
bles+0.001
Neg logit contributions
Mine-0.004
 care-0.004
|-0.004
Stop-0.004
hered-0.004
Token.
Token ID13
Feature activation+1.54e-03

Pos logit contributions
 exactly+0.424
ooks+0.417
ook+0.406
 impressed+0.390
ossom+0.377
Neg logit contributions
Mine-0.192
 care-0.184
hered-0.178
 him-0.173
Stop-0.170
Token They
Token ID1119
Feature activation-3.24e-02

Pos logit contributions
 exactly+0.003
ooks+0.003
ook+0.003
ents+0.003
aze+0.003
Neg logit contributions
Mine-0.002
 care-0.002
 breaths-0.002
hered-0.002
Stop-0.002
Token cl
Token ID537
Feature activation-5.21e-03

Pos logit contributions
Mine+0.047
Stop+0.044
hered+0.043
|+0.042
 breaths+0.041
Neg logit contributions
 exactly-0.059
aches-0.055
ents-0.055
bles-0.055
 impressed-0.053
Tokenap
Token ID499
Feature activation+1.40e-02

Pos logit contributions
Mine+0.011
 care+0.010
Stop+0.009
hered+0.009
 breaths+0.009
Neg logit contributions
 exactly-0.008
ook-0.007
ooks-0.007
bles-0.006
ossom-0.006
Token.
Token ID13
Feature activation+9.10e-04

Pos logit contributions
 exactly+0.027
ook+0.025
ooks+0.024
bles+0.023
ents+0.023
Neg logit contributions
Mine-0.023
 care-0.021
Stop-0.020
hered-0.019
|-0.019
Token blew
Token ID17948
Feature activation-4.52e-02

Pos logit contributions
Mine+0.204
 care+0.176
Stop+0.163
|+0.163
 imag+0.160
Neg logit contributions
 exactly-0.153
bles-0.138
ich-0.136
 impressed-0.132
ook-0.131
Token and
Token ID290
Feature activation+1.76e-02

Pos logit contributions
Mine+0.087
 care+0.077
|+0.077
 imag+0.074
Stop+0.072
Neg logit contributions
 exactly-0.071
ook-0.058
ents-0.056
iring-0.056
ooks-0.055
Token made
Token ID925
Feature activation-2.44e-02

Pos logit contributions
 exactly+0.033
 impressed+0.028
ook+0.028
ents+0.027
ooks+0.026
Neg logit contributions
Mine-0.030
 care-0.026
|-0.025
Stop-0.025
hered-0.025
Token the
Token ID262
Feature activation+1.91e-02

Pos logit contributions
Mine+0.043
 care+0.038
hered+0.038
|+0.038
Stop+0.038
Neg logit contributions
 exactly-0.043
ook-0.038
ooks-0.036
 impressed-0.035
ents-0.035
Token sees
Token ID7224
Feature activation+8.05e-03

Pos logit contributions
 exactly+0.032
ook+0.029
ich+0.028
ooks+0.028
bles+0.028
Neg logit contributions
Mine-0.034
 care-0.032
Stop-0.030
|-0.029
hered-0.029
Tokenaw
Token ID707
Feature activation+1.82e-01

Pos logit contributions
 exactly+0.015
ook+0.013
ooks+0.013
bles+0.012
ents+0.012
Neg logit contributions
Mine-0.014
 care-0.013
Stop-0.012
hered-0.012
|-0.011
Token go
Token ID467
Feature activation+6.34e-02

Pos logit contributions
 exactly+0.364
ooks+0.361
ook+0.357
ae+0.326
 impressed+0.320
Neg logit contributions
Mine-0.241
hered-0.238
 care-0.235
Stop-0.221
 him-0.211
Token up
Token ID510
Feature activation+5.19e-02

Pos logit contributions
 exactly+0.120
ook+0.111
ooks+0.106
bles+0.103
ents+0.102
Neg logit contributions
Mine-0.104
 care-0.096
Stop-0.089
hered-0.085
 breaths-0.085
Token and
Token ID290
Feature activation+1.10e-02

Pos logit contributions
 exactly+0.085
ook+0.075
ooks+0.072
 impressed+0.070
ents+0.070
Neg logit contributions
Mine-0.099
 care-0.092
Stop-0.088
hered-0.085
|-0.084
Token down
Token ID866
Feature activation+2.81e-02

Pos logit contributions
 exactly+0.019
ook+0.017
ooks+0.017
 impressed+0.016
ents+0.016
Neg logit contributions
Mine-0.019
 care-0.018
Stop-0.017
hered-0.016
|-0.016
Token very
Token ID845
Feature activation+4.59e-02

Pos logit contributions
 exactly+0.050
ook+0.046
ooks+0.045
bles+0.042
 impressed+0.042
Neg logit contributions
Mine-0.049
 care-0.045
Stop-0.042
hered-0.041
|-0.041
Token "
Token ID366
Feature activation+4.32e-02

Pos logit contributions
 exactly+0.017
ook+0.014
 impressed+0.014
ooks+0.014
ents+0.014
Neg logit contributions
Mine-0.017
 care-0.015
|-0.015
hered-0.015
Stop-0.014
Tokenc
Token ID66
Feature activation+9.07e-02

Pos logit contributions
 exactly+0.078
ook+0.074
ents+0.070
ossom+0.070
ooks+0.069
Neg logit contributions
Mine-0.069
 care-0.064
 breaths-0.060
 imag-0.058
Stop-0.057
Tokenaw
Token ID707
Feature activation+1.83e-01

Pos logit contributions
 exactly+0.153
ook+0.141
ooks+0.137
bles+0.130
ents+0.129
Neg logit contributions
Mine-0.167
 care-0.154
Stop-0.145
hered-0.141
 breaths-0.139
Token,
Token ID11
Feature activation+7.74e-03

Pos logit contributions
 exactly+0.388
ooks+0.374
ook+0.373
 impressed+0.346
anut+0.335
Neg logit contributions
Mine-0.241
 care-0.230
hered-0.223
Stop-0.222
 breaths-0.196
Token c
Token ID269
Feature activation-1.28e-02

Pos logit contributions
 exactly+0.015
ook+0.015
ooks+0.014
ossom+0.013
bles+0.013
Neg logit contributions
Mine-0.012
 care-0.011
Stop-0.010
hered-0.010
 breaths-0.009
Tokenaw
Token ID707
Feature activation+1.82e-01

Pos logit contributions
Mine+0.025
 care+0.025
Stop+0.025
hered+0.024
 breaths+0.024
Neg logit contributions
 exactly-0.017
ook-0.015
ooks-0.015
 impressed-0.014
ae-0.014
Token,
Token ID11
Feature activation+6.96e-03

Pos logit contributions
 exactly+0.391
ooks+0.377
ook+0.374
 impressed+0.353
iring+0.340
Neg logit contributions
Mine-0.231
 care-0.221
hered-0.220
Stop-0.212
storm-0.190
Token c
Token ID269
Feature activation-1.33e-02

Pos logit contributions
 exactly+0.014
ook+0.013
ooks+0.013
bles+0.012
ossom+0.012
Neg logit contributions
Mine-0.011
 care-0.010
Stop-0.009
hered-0.009
 breaths-0.009
Tokenaw
Token ID707
Feature activation+1.81e-01

Pos logit contributions
 care+0.026
Mine+0.026
Stop+0.026
 breaths+0.025
hered+0.025
Neg logit contributions
 exactly-0.018
ook-0.016
ooks-0.016
 impressed-0.015
ae-0.014
Token".
Token ID1911
Feature activation-2.12e-02

Pos logit contributions
 exactly+0.392
ooks+0.376
ook+0.373
 impressed+0.355
iring+0.342
Neg logit contributions
Mine-0.226
hered-0.218
 care-0.217
Stop-0.206
storm-0.186
Token
Token ID198
Feature activation+8.84e-03

Pos logit contributions
Mine+0.032
 care+0.030
Stop+0.028
hered+0.027
 breaths+0.027
Neg logit contributions
 exactly-0.042
ook-0.039
ooks-0.038
bles-0.036
ents-0.036
Token pulled
Token ID5954
Feature activation-3.96e-02

Pos logit contributions
 exactly+0.018
ook+0.016
ooks+0.015
ents+0.015
bles+0.015
Neg logit contributions
Mine-0.014
 care-0.013
Stop-0.012
hered-0.011
|-0.011
Token.
Token ID13
Feature activation-1.38e-03

Pos logit contributions
Mine+0.059
 care+0.054
Stop+0.051
|+0.049
hered+0.049
Neg logit contributions
 exactly-0.080
ook-0.074
ooks-0.071
ents-0.069
bles-0.069
Token The
Token ID383
Feature activation-2.62e-03

Pos logit contributions
Mine+0.002
 care+0.002
Stop+0.002
hered+0.002
|+0.002
Neg logit contributions
 exactly-0.003
ook-0.003
ooks-0.002
ents-0.002
ossom-0.002
Token pigeon
Token ID46387
Feature activation+8.46e-02

Pos logit contributions
Mine+0.005
 care+0.005
Stop+0.005
hered+0.005
 breaths+0.004
Neg logit contributions
 exactly-0.004
ook-0.004
ooks-0.004
ossom-0.003
ae-0.003
Token squ
Token ID2809
Feature activation-5.01e-02

Pos logit contributions
 exactly+0.177
ooks+0.169
ook+0.169
iring+0.157
bles+0.156
Neg logit contributions
Mine-0.117
 care-0.111
Stop-0.099
hered-0.097
 breaths-0.096
Tokenaw
Token ID707
Feature activation+1.82e-01

Pos logit contributions
Mine+0.083
 care+0.077
Stop+0.074
storm+0.074
|+0.073
Neg logit contributions
 exactly-0.090
ook-0.083
ents-0.079
ooks-0.078
ossom-0.077
Tokenked
Token ID9091
Feature activation-1.96e-02

Pos logit contributions
 exactly+0.421
ooks+0.418
ook+0.413
ae+0.381
iring+0.376
Neg logit contributions
Mine-0.203
hered-0.187
 care-0.187
Stop-0.169
 breaths-0.156
Token and
Token ID290
Feature activation+8.19e-03

Pos logit contributions
Mine+0.034
 care+0.032
Stop+0.029
hered+0.028
 breaths+0.028
Neg logit contributions
 exactly-0.035
ook-0.033
ooks-0.032
bles-0.030
ae-0.030
Token bit
Token ID1643
Feature activation-4.83e-02

Pos logit contributions
 exactly+0.015
 impressed+0.013
ook+0.013
ents+0.013
bles+0.013
Neg logit contributions
Mine-0.013
 care-0.012
|-0.011
Stop-0.011
hered-0.011
Token her
Token ID607
Feature activation+1.69e-02

Pos logit contributions
Mine+0.096
 imag+0.095
storm+0.094
hered+0.093
use+0.090
Neg logit contributions
 exactly-0.067
ents-0.058
iring-0.052
 twice-0.051
ook-0.051
Token finger
Token ID7660
Feature activation-1.77e-02

Pos logit contributions
 exactly+0.029
ook+0.027
ooks+0.026
ents+0.024
bles+0.024
Neg logit contributions
Mine-0.030
 care-0.029
Stop-0.027
hered-0.026
 breaths-0.026
Token the
Token ID262
Feature activation+1.84e-02

Pos logit contributions
 exactly+0.069
ook+0.064
ooks+0.062
bles+0.060
ents+0.059
Neg logit contributions
Mine-0.057
 care-0.054
Stop-0.050
hered-0.048
 breaths-0.047
Token rock
Token ID3881
Feature activation-5.72e-02

Pos logit contributions
 exactly+0.035
ook+0.032
ooks+0.032
ossom+0.030
iring+0.029
Neg logit contributions
Mine-0.030
 care-0.029
Stop-0.026
hered-0.025
 breaths-0.025
Token like
Token ID588
Feature activation+2.09e-03

Pos logit contributions
Mine+0.088
 care+0.084
hered+0.075
 breaths+0.074
Stop+0.073
Neg logit contributions
 exactly-0.107
ooks-0.103
ook-0.101
 impressed-0.095
iring-0.094
Token a
Token ID257
Feature activation+2.08e-03

Pos logit contributions
 exactly+0.004
ook+0.003
ents+0.003
ooks+0.003
bles+0.003
Neg logit contributions
Mine-0.004
 care-0.003
|-0.003
Stop-0.003
hered-0.003
Token sees
Token ID7224
Feature activation+7.90e-04

Pos logit contributions
 exactly+0.004
ook+0.004
ooks+0.004
iring+0.004
anut+0.003
Neg logit contributions
Mine-0.004
 care-0.003
hered-0.003
Stop-0.003
 bath-0.003
Tokenaw
Token ID707
Feature activation+1.82e-01

Pos logit contributions
 exactly+0.001
ook+0.001
ooks+0.001
ents+0.001
bles+0.001
Neg logit contributions
Mine-0.001
 care-0.001
Stop-0.001
|-0.001
hered-0.001
Token.
Token ID13
Feature activation+5.40e-04

Pos logit contributions
 exactly+0.366
ooks+0.353
ook+0.351
 impressed+0.322
ossom+0.319
Neg logit contributions
Mine-0.262
 care-0.245
hered-0.238
Stop-0.229
 him-0.220
Token She
Token ID1375
Feature activation+3.11e-02

Pos logit contributions
 exactly+0.001
ook+0.001
ooks+0.001
anut+0.001
bles+0.001
Neg logit contributions
Mine-0.001
 care-0.001
Stop-0.001
hered-0.001
 him-0.001
Token puts
Token ID7584
Feature activation-7.87e-02

Pos logit contributions
 exactly+0.067
ook+0.063
ooks+0.061
bles+0.059
ents+0.059
Neg logit contributions
Mine-0.042
 care-0.039
Stop-0.035
hered-0.034
 breaths-0.033
Token the
Token ID262
Feature activation+1.87e-02

Pos logit contributions
Mine+0.120
 care+0.113
Stop+0.106
|+0.103
hered+0.102
Neg logit contributions
 exactly-0.154
ook-0.140
ooks-0.134
bles-0.133
ents-0.132
Token jar
Token ID17379
Feature activation+5.17e-02

Pos logit contributions
 exactly+0.034
ook+0.031
ooks+0.030
bles+0.030
ents+0.029
Neg logit contributions
Mine-0.032
 care-0.030
Stop-0.028
hered-0.027
 breaths-0.027
TokenThey
Token ID2990
Feature activation-3.48e-02

Pos logit contributions
 exactly+0.005
ook+0.005
ooks+0.005
ents+0.004
ossom+0.004
Neg logit contributions
Mine-0.006
 care-0.006
Stop-0.006
hered-0.005
Hey-0.005
Token run
Token ID1057
Feature activation+2.43e-02

Pos logit contributions
Mine+0.049
 care+0.045
Stop+0.042
hered+0.041
 breaths+0.041
Neg logit contributions
 exactly-0.072
ook-0.066
ooks-0.064
ents-0.062
bles-0.062
Token to
Token ID284
Feature activation-1.70e-02

Pos logit contributions
 exactly+0.042
ook+0.039
ooks+0.038
bles+0.036
ents+0.036
Neg logit contributions
Mine-0.043
 care-0.040
Stop-0.038
hered-0.037
 breaths-0.036
Token the
Token ID262
Feature activation+1.42e-02

Pos logit contributions
Mine+0.030
 care+0.027
Stop+0.027
hered+0.027
|+0.026
Neg logit contributions
 exactly-0.029
ook-0.026
ooks-0.024
 impressed-0.023
anut-0.023
Token sees
Token ID7224
Feature activation-3.40e-03

Pos logit contributions
 exactly+0.028
ook+0.026
ooks+0.026
ossom+0.024
aze+0.023
Neg logit contributions
Mine-0.024
 care-0.022
Stop-0.021
 breaths-0.018
hered-0.018
Tokenaw
Token ID707
Feature activation+1.81e-01

Pos logit contributions
Mine+0.008
 care+0.008
Stop+0.008
|+0.008
hered+0.008
Neg logit contributions
 exactly-0.003
ook-0.003
ooks-0.003
bles-0.003
ents-0.003
Token.
Token ID13
Feature activation-2.61e-03

Pos logit contributions
ooks+0.445
 exactly+0.431
ook+0.421
ossom+0.395
ae+0.395
Neg logit contributions
 him-0.170
Mine-0.166
 care-0.156
hered-0.152
Stop-0.143
Token They
Token ID1119
Feature activation-3.61e-02

Pos logit contributions
Mine+0.003
 care+0.003
Stop+0.003
hered+0.003
 breaths+0.003
Neg logit contributions
 exactly-0.006
ook-0.005
ooks-0.005
ents-0.005
ossom-0.005
Token sit
Token ID1650
Feature activation+2.27e-02

Pos logit contributions
Mine+0.051
 care+0.047
Stop+0.045
hered+0.044
 breaths+0.043
Neg logit contributions
 exactly-0.073
ook-0.067
ents-0.064
ooks-0.064
bles-0.064
Token on
Token ID319
Feature activation+3.17e-02

Pos logit contributions
ooks+0.054
ae+0.053
 exactly+0.052
via+0.051
aze+0.051
Neg logit contributions
 him-0.019
 breaths-0.018
Mine-0.016
hered-0.016
 care-0.016
Token each
Token ID1123
Feature activation-8.12e-02

Pos logit contributions
 exactly+0.060
ook+0.056
ooks+0.054
bles+0.051
ents+0.051
Neg logit contributions
Mine-0.051
 care-0.048
Stop-0.044
hered-0.043
 breaths-0.042
Token,
Token ID11
Feature activation+7.74e-03

Pos logit contributions
 exactly+0.388
ooks+0.374
ook+0.373
 impressed+0.346
anut+0.335
Neg logit contributions
Mine-0.241
 care-0.230
hered-0.223
Stop-0.222
 breaths-0.196
Token c
Token ID269
Feature activation-1.28e-02

Pos logit contributions
 exactly+0.015
ook+0.015
ooks+0.014
ossom+0.013
bles+0.013
Neg logit contributions
Mine-0.012
 care-0.011
Stop-0.010
hered-0.010
 breaths-0.009
Tokenaw
Token ID707
Feature activation+1.82e-01

Pos logit contributions
Mine+0.025
 care+0.025
Stop+0.025
hered+0.024
 breaths+0.024
Neg logit contributions
 exactly-0.017
ook-0.015
ooks-0.015
 impressed-0.014
ae-0.014
Token,
Token ID11
Feature activation+6.96e-03

Pos logit contributions
 exactly+0.391
ooks+0.377
ook+0.374
 impressed+0.353
iring+0.340
Neg logit contributions
Mine-0.231
 care-0.221
hered-0.220
Stop-0.212
storm-0.190
Token c
Token ID269
Feature activation-1.33e-02

Pos logit contributions
 exactly+0.014
ook+0.013
ooks+0.013
bles+0.012
ossom+0.012
Neg logit contributions
Mine-0.011
 care-0.010
Stop-0.009
hered-0.009
 breaths-0.009
Tokenaw
Token ID707
Feature activation+1.81e-01

Pos logit contributions
 care+0.026
Mine+0.026
Stop+0.026
 breaths+0.025
hered+0.025
Neg logit contributions
 exactly-0.018
ook-0.016
ooks-0.016
 impressed-0.015
ae-0.014
Token".
Token ID1911
Feature activation-2.12e-02

Pos logit contributions
 exactly+0.392
ooks+0.376
ook+0.373
 impressed+0.355
iring+0.342
Neg logit contributions
Mine-0.226
hered-0.218
 care-0.217
Stop-0.206
storm-0.186
Token
Token ID198
Feature activation+8.84e-03

Pos logit contributions
Mine+0.032
 care+0.030
Stop+0.028
hered+0.027
 breaths+0.027
Neg logit contributions
 exactly-0.042
ook-0.039
ooks-0.038
bles-0.036
ents-0.036
Token
Token ID198
Feature activation+8.07e-03

Pos logit contributions
 exactly+0.018
ook+0.017
ooks+0.016
bles+0.016
ossom+0.016
Neg logit contributions
Mine-0.013
 care-0.012
Stop-0.011
|-0.011
hered-0.010
TokenL
Token ID43
Feature activation-2.39e-02

Pos logit contributions
 exactly+0.014
ook+0.012
ooks+0.012
bles+0.012
iring+0.011
Neg logit contributions
Mine-0.016
Stop-0.014
 care-0.014
Hey-0.013
|-0.013
Tokenily
Token ID813
Feature activation-1.13e-02

Pos logit contributions
Mine+0.042
 care+0.040
Stop+0.038
hered+0.037
|+0.036
Neg logit contributions
 exactly-0.041
ook-0.037
ooks-0.036
 impressed-0.035
ossom-0.035
Token the
Token ID262
Feature activation+2.28e-02

Pos logit contributions
 exactly+0.014
ook+0.013
ooks+0.013
ossom+0.012
bles+0.012
Neg logit contributions
Mine-0.013
 care-0.012
Stop-0.011
hered-0.011
|-0.010
Token slide
Token ID10649
Feature activation+3.07e-02

Pos logit contributions
 exactly+0.039
ook+0.036
ooks+0.035
bles+0.033
ents+0.033
Neg logit contributions
Mine-0.041
 care-0.039
Stop-0.036
hered-0.035
 breaths-0.035
Token and
Token ID290
Feature activation+1.15e-02

Pos logit contributions
 exactly+0.062
ook+0.057
ooks+0.057
ossom+0.055
anut+0.054
Neg logit contributions
Mine-0.045
 care-0.043
Stop-0.039
hered-0.038
|-0.037
Token the
Token ID262
Feature activation+2.26e-02

Pos logit contributions
 exactly+0.021
ook+0.020
ooks+0.019
ents+0.018
bles+0.018
Neg logit contributions
Mine-0.019
 care-0.017
Stop-0.016
hered-0.016
 breaths-0.016
Token sees
Token ID7224
Feature activation+5.73e-04

Pos logit contributions
 exactly+0.037
ook+0.034
ooks+0.033
bles+0.031
ents+0.031
Neg logit contributions
Mine-0.042
 care-0.040
Stop-0.037
hered-0.036
 breaths-0.036
Tokenaw
Token ID707
Feature activation+1.81e-01

Pos logit contributions
 exactly+0.001
ook+0.001
ooks+0.001
ossom+0.001
anut+0.001
Neg logit contributions
Mine-0.001
 care-0.001
Stop-0.001
hered-0.001
Hey-0.001
Token.
Token ID13
Feature activation+8.02e-03

Pos logit contributions
 exactly+0.438
ooks+0.437
ook+0.424
 impressed+0.395
bek+0.394
Neg logit contributions
hered-0.155
Mine-0.152
 care-0.150
 him-0.149
storm-0.140
Token They
Token ID1119
Feature activation-2.93e-02

Pos logit contributions
 exactly+0.016
ook+0.015
ooks+0.015
bles+0.014
ossom+0.014
Neg logit contributions
Mine-0.012
 care-0.011
Stop-0.010
hered-0.010
 breaths-0.009
Token also
Token ID635
Feature activation+5.30e-02

Pos logit contributions
Mine+0.047
 care+0.044
Stop+0.041
hered+0.040
 breaths+0.040
Neg logit contributions
 exactly-0.055
ook-0.051
ooks-0.050
bles-0.048
ents-0.047
Token liked
Token ID8288
Feature activation+3.01e-03

Pos logit contributions
 exactly+0.102
ook+0.095
ooks+0.092
bles+0.088
ents+0.088
Neg logit contributions
Mine-0.084
 care-0.079
Stop-0.073
hered-0.070
 breaths-0.069
Token to
Token ID284
Feature activation-1.50e-02

Pos logit contributions
 exactly+0.006
ook+0.006
ooks+0.005
ossom+0.005
bles+0.005
Neg logit contributions
Mine-0.005
 care-0.005
Stop-0.004
 breaths-0.004
 him-0.004
Token want
Token ID765
Feature activation-2.83e-02

Pos logit contributions
Mine+0.053
hered+0.049
Stop+0.049
|+0.048
 breaths+0.046
Neg logit contributions
 exactly-0.070
aches-0.065
ents-0.064
bles-0.064
ook-0.062
Token to
Token ID284
Feature activation-1.48e-02

Pos logit contributions
Mine+0.051
 care+0.048
Stop+0.045
hered+0.044
 breaths+0.043
Neg logit contributions
 exactly-0.049
ook-0.045
ooks-0.043
ents-0.041
bles-0.041
Token make
Token ID787
Feature activation-1.13e-02

Pos logit contributions
Mine+0.024
Stop+0.021
hered+0.021
 breaths+0.021
|+0.020
Neg logit contributions
 exactly-0.027
aches-0.025
ook-0.025
bles-0.024
aze-0.024
Token the
Token ID262
Feature activation+1.17e-02

Pos logit contributions
Mine+0.017
 care+0.016
Stop+0.014
 breaths+0.014
 him+0.013
Neg logit contributions
 exactly-0.022
ook-0.021
ooks-0.021
ossom-0.020
bles-0.020
Token sees
Token ID7224
Feature activation+1.43e-03

Pos logit contributions
 exactly+0.021
ook+0.020
ooks+0.019
ossom+0.018
ents+0.018
Neg logit contributions
Mine-0.020
 care-0.019
Stop-0.018
 breaths-0.017
hered-0.016
Tokenaw
Token ID707
Feature activation+1.81e-01

Pos logit contributions
 exactly+0.002
ook+0.002
ooks+0.002
ossom+0.002
bles+0.001
Neg logit contributions
Mine-0.003
 care-0.003
Stop-0.003
hered-0.003
 breaths-0.003
Token stay
Token ID2652
Feature activation-1.31e-02

Pos logit contributions
ooks+0.424
 exactly+0.406
ook+0.399
 impressed+0.363
ae+0.358
Neg logit contributions
 care-0.180
hered-0.179
 him-0.178
Mine-0.174
storm-0.159
Token still
Token ID991
Feature activation+2.03e-02

Pos logit contributions
 care+0.020
Mine+0.019
 breaths+0.018
hered+0.018
 bath+0.017
Neg logit contributions
 exactly-0.024
ook-0.023
ooks-0.023
ents-0.021
ae-0.021
Token.
Token ID13
Feature activation-2.63e-03

Pos logit contributions
 exactly+0.038
ook+0.036
ooks+0.035
bles+0.033
ents+0.033
Neg logit contributions
Mine-0.033
 care-0.031
Stop-0.028
hered-0.028
 breaths-0.027
Token They
Token ID1119
Feature activation-3.54e-02

Pos logit contributions
Mine+0.004
 care+0.004
 breaths+0.003
hered+0.003
Stop+0.003
Neg logit contributions
 exactly-0.005
ooks-0.005
ook-0.005
ents-0.005
ossom-0.005
Token move
Token ID1445
Feature activation+1.85e-02

Pos logit contributions
Mine+0.049
hered+0.045
Stop+0.044
 care+0.043
|+0.043
Neg logit contributions
 exactly-0.071
ook-0.064
aches-0.063
ents-0.063
bles-0.062
Token want
Token ID765
Feature activation-3.66e-02

Pos logit contributions
Mine+0.048
 care+0.043
hered+0.041
Stop+0.041
 breaths+0.040
Neg logit contributions
 exactly-0.080
ook-0.073
ooks-0.071
ents-0.070
bles-0.070
Token to
Token ID284
Feature activation-1.74e-02

Pos logit contributions
Mine+0.058
 care+0.057
Stop+0.051
 him+0.048
 breaths+0.047
Neg logit contributions
 exactly-0.070
ook-0.066
ooks-0.065
ossom-0.063
bles-0.063
Token try
Token ID1949
Feature activation+5.22e-02

Pos logit contributions
Mine+0.024
 care+0.023
Stop+0.021
hered+0.019
|+0.019
Neg logit contributions
 exactly-0.037
ook-0.035
ooks-0.034
bles-0.033
ossom-0.032
Token the
Token ID262
Feature activation+1.41e-02

Pos logit contributions
ooks+0.128
 exactly+0.126
ossom+0.126
ook+0.125
bek+0.122
Neg logit contributions
 care-0.048
 him-0.047
Mine-0.045
 bath-0.038
Stop-0.036
Token sees
Token ID7224
Feature activation-3.45e-03

Pos logit contributions
 exactly+0.026
ook+0.024
ooks+0.024
ossom+0.022
bles+0.022
Neg logit contributions
Mine-0.024
 care-0.023
Stop-0.021
hered-0.020
 breaths-0.020
Tokenaw
Token ID707
Feature activation+1.80e-01

Pos logit contributions
Mine+0.008
 care+0.008
Stop+0.008
hered+0.007
 breaths+0.007
Neg logit contributions
 exactly-0.004
ook-0.003
ooks-0.003
bles-0.003
ents-0.003
Token.
Token ID13
Feature activation-2.09e-03

Pos logit contributions
ooks+0.449
 exactly+0.423
ook+0.419
 impressed+0.390
ae+0.390
Neg logit contributions
 him-0.171
Mine-0.165
 care-0.156
hered-0.154
Stop-0.144
Token
Token ID198
Feature activation+3.60e-03

Pos logit contributions
Mine+0.003
 care+0.002
Stop+0.002
hered+0.002
 breaths+0.002
Neg logit contributions
 exactly-0.005
ook-0.005
ooks-0.004
bles-0.004
ossom-0.004
Token
Token ID198
Feature activation+3.31e-03

Pos logit contributions
 exactly+0.006
ook+0.006
ooks+0.006
 impressed+0.005
ents+0.005
Neg logit contributions
Mine-0.006
 care-0.006
Stop-0.006
 breaths-0.006
hered-0.006
TokenThey
Token ID2990
Feature activation-3.48e-02

Pos logit contributions
 exactly+0.005
ook+0.005
ooks+0.005
ents+0.004
ossom+0.004
Neg logit contributions
Mine-0.006
 care-0.006
Stop-0.006
hered-0.005
Hey-0.005
Token run
Token ID1057
Feature activation+2.43e-02

Pos logit contributions
Mine+0.049
 care+0.045
Stop+0.042
hered+0.041
 breaths+0.041
Neg logit contributions
 exactly-0.072
ook-0.066
ooks-0.064
ents-0.062
bles-0.062
Token.
Token ID13
Feature activation-3.63e-03

Pos logit contributions
 exactly+0.026
ook+0.023
ooks+0.022
iring+0.021
 impressed+0.021
Neg logit contributions
Mine-0.030
 care-0.027
Stop-0.026
hered-0.026
|-0.025
Token He
Token ID679
Feature activation-9.11e-03

Pos logit contributions
Mine+0.005
 care+0.005
Stop+0.004
hered+0.004
|+0.004
Neg logit contributions
 exactly-0.008
ook-0.007
ooks-0.007
bles-0.007
ossom-0.007
Token would
Token ID561
Feature activation+2.19e-02

Pos logit contributions
Mine+0.016
 care+0.014
Stop+0.014
hered+0.014
 breaths+0.013
Neg logit contributions
 exactly-0.016
ook-0.015
ooks-0.014
 impressed-0.014
ents-0.014
Token take
Token ID1011
Feature activation+2.58e-02

Pos logit contributions
 exactly+0.038
ook+0.032
aches+0.032
 impressed+0.031
ossom+0.030
Neg logit contributions
Mine-0.040
Stop-0.037
hered-0.035
 care-0.034
Hey-0.034
Token big
Token ID1263
Feature activation-1.00e-02

Pos logit contributions
 exactly+0.046
ook+0.041
ooks+0.039
aze+0.038
aches+0.038
Neg logit contributions
Mine-0.045
Stop-0.038
hered-0.036
|-0.035
 care-0.035
Token breaths
Token ID45576
Feature activation-2.12e-01

Pos logit contributions
Mine+0.018
|+0.015
Stop+0.015
 raged+0.014
storm+0.014
Neg logit contributions
 exactly-0.016
 taller-0.015
ents-0.014
adow-0.013
aches-0.013
Token of
Token ID286
Feature activation-1.28e-02

Pos logit contributions
Mine+0.400
 care+0.369
Stop+0.355
hered+0.337
|+0.335
Neg logit contributions
 exactly-0.355
ook-0.322
ooks-0.309
bles-0.296
ents-0.295
Token the
Token ID262
Feature activation+1.19e-02

Pos logit contributions
Mine+0.023
hered+0.021
Stop+0.021
 care+0.020
|+0.020
Neg logit contributions
 exactly-0.022
ook-0.019
ooks-0.019
 impressed-0.018
aches-0.018
Token fresh
Token ID4713
Feature activation-4.40e-03

Pos logit contributions
 exactly+0.020
ook+0.019
ooks+0.018
bles+0.017
ossom+0.017
Neg logit contributions
Mine-0.022
 care-0.021
Stop-0.019
hered-0.019
 breaths-0.018
Token air
Token ID1633
Feature activation-9.35e-02

Pos logit contributions
Mine+0.008
Stop+0.007
 care+0.007
|+0.007
 imag+0.006
Neg logit contributions
 exactly-0.007
ook-0.007
ooks-0.007
iring-0.006
anut-0.006
Token and
Token ID290
Feature activation+6.66e-03

Pos logit contributions
Mine+0.183
 care+0.166
Stop+0.165
|+0.154
hered+0.153
Neg logit contributions
 exactly-0.152
ook-0.133
ooks-0.126
bles-0.123
ents-0.123
Token could
Token ID714
Feature activation+3.45e-02

Pos logit contributions
Mine+0.007
 care+0.006
Stop+0.006
hered+0.006
 breaths+0.006
Neg logit contributions
 exactly-0.008
ook-0.007
ooks-0.007
 impressed-0.007
bles-0.007
Token take
Token ID1011
Feature activation+3.31e-02

Pos logit contributions
 exactly+0.063
ook+0.058
 impressed+0.054
bles+0.054
ooks+0.054
Neg logit contributions
Mine-0.060
Stop-0.054
hered-0.052
 care-0.052
 breaths-0.050
Token lots
Token ID6041
Feature activation-2.03e-02

Pos logit contributions
 exactly+0.063
ook+0.058
ooks+0.056
bles+0.053
ents+0.053
Neg logit contributions
Mine-0.054
 care-0.047
Stop-0.046
hered-0.045
|-0.044
Token of
Token ID286
Feature activation-6.20e-03

Pos logit contributions
Mine+0.040
 care+0.036
Stop+0.035
 imag+0.034
hered+0.034
Neg logit contributions
 exactly-0.032
ook-0.030
ooks-0.028
ents-0.027
iring-0.027
Token deep
Token ID2769
Feature activation-4.22e-02

Pos logit contributions
Mine+0.011
hered+0.009
storm+0.009
Stop+0.009
|+0.008
Neg logit contributions
 exactly-0.012
aches-0.011
ook-0.011
ooks-0.010
 impressed-0.010
Token breaths
Token ID45576
Feature activation-2.10e-01

Pos logit contributions
Mine+0.082
Stop+0.068
hered+0.066
|+0.066
storm+0.063
Neg logit contributions
 exactly-0.067
ooks-0.064
ook-0.064
bles-0.060
ents-0.060
Token and
Token ID290
Feature activation+1.15e-02

Pos logit contributions
Mine+0.429
 care+0.384
Stop+0.383
|+0.360
hered+0.356
Neg logit contributions
 exactly-0.336
ook-0.303
ooks-0.289
bles-0.280
iring-0.279
Token get
Token ID651
Feature activation+4.60e-02

Pos logit contributions
 exactly+0.024
ook+0.022
ooks+0.021
 impressed+0.021
ossom+0.020
Neg logit contributions
Mine-0.017
Stop-0.015
hered-0.015
 care-0.014
|-0.013
Token lots
Token ID6041
Feature activation-1.95e-02

Pos logit contributions
 exactly+0.082
ook+0.073
ooks+0.071
 impressed+0.068
ents+0.067
Neg logit contributions
Mine-0.083
Stop-0.072
 care-0.072
hered-0.072
|-0.070
Token of
Token ID286
Feature activation-5.99e-03

Pos logit contributions
Mine+0.045
Stop+0.039
|+0.038
hered+0.037
 imag+0.037
Neg logit contributions
 exactly-0.026
ents-0.023
ook-0.023
ooks-0.022
iring-0.022
Token oxygen
Token ID11863
Feature activation-7.45e-02

Pos logit contributions
Mine+0.011
hered+0.010
storm+0.009
|+0.009
Stop+0.009
Neg logit contributions
 exactly-0.011
aches-0.010
 taller-0.010
ooks-0.010
 impressed-0.010
Token.
Token ID13
Feature activation-5.72e-03

Pos logit contributions
Mine+0.164
 care+0.146
Stop+0.144
|+0.142
 imag+0.138
Neg logit contributions
 exactly-0.128
ook-0.113
ber-0.104
ae-0.103
ooks-0.102
Token She
Token ID1375
Feature activation+2.66e-02

Pos logit contributions
Mine+0.008
 care+0.008
Stop+0.007
 breaths+0.007
hered+0.007
Neg logit contributions
 exactly-0.012
ook-0.011
ooks-0.011
bles-0.010
anut-0.010
Token kept
Token ID4030
Feature activation-2.89e-02

Pos logit contributions
 exactly+0.051
ook+0.048
ooks+0.047
bles+0.044
iring+0.044
Neg logit contributions
Mine-0.042
 care-0.040
Stop-0.036
 breaths-0.035
hered-0.035
Token taking
Token ID2263
Feature activation+3.20e-02

Pos logit contributions
Mine+0.051
 care+0.049
Stop+0.046
hered+0.044
 breaths+0.044
Neg logit contributions
 exactly-0.050
ook-0.046
ooks-0.045
bles-0.043
ossom-0.043
Token deep
Token ID2769
Feature activation-4.58e-02

Pos logit contributions
 exactly+0.069
ook+0.062
ooks+0.059
ossom+0.058
aze+0.057
Neg logit contributions
Mine-0.045
Stop-0.036
|-0.036
hered-0.034
 care-0.034
Token breaths
Token ID45576
Feature activation-2.09e-01

Pos logit contributions
Mine+0.051
|+0.039
Stop+0.036
hered+0.035
storm+0.033
Neg logit contributions
 exactly-0.112
ook-0.101
ooks-0.100
ups-0.100
bles-0.100
Token until
Token ID1566
Feature activation-1.68e-02

Pos logit contributions
Mine+0.395
 care+0.363
Stop+0.351
|+0.334
hered+0.328
Neg logit contributions
 exactly-0.350
ook-0.317
ooks-0.304
ents-0.293
bles-0.291
Token all
Token ID477
Feature activation-2.74e-02

Pos logit contributions
Mine+0.033
Stop+0.029
 care+0.029
|+0.028
hered+0.028
Neg logit contributions
 exactly-0.028
ook-0.023
ae-0.022
ooks-0.022
ossom-0.021
Token her
Token ID607
Feature activation+1.24e-02

Pos logit contributions
Mine+0.051
 care+0.045
Stop+0.044
hered+0.043
|+0.042
Neg logit contributions
 exactly-0.047
ook-0.042
ooks-0.040
ents-0.039
iring-0.039
Token energy
Token ID2568
Feature activation-8.58e-02

Pos logit contributions
 exactly+0.020
ae+0.017
ook+0.017
ooks+0.016
ber+0.016
Neg logit contributions
Mine-0.023
Stop-0.021
hered-0.021
|-0.021
 care-0.020
Token was
Token ID373
Feature activation-3.22e-02

Pos logit contributions
Mine+0.149
Stop+0.127
|+0.126
 care+0.125
 imag+0.119
Neg logit contributions
 exactly-0.154
ook-0.137
bles-0.132
ups-0.132
 impressed-0.131
Token eyes
Token ID2951
Feature activation-2.16e-02

Pos logit contributions
 exactly+0.016
ook+0.016
ooks+0.015
 impressed+0.014
bles+0.014
Neg logit contributions
Mine-0.016
 care-0.015
 breaths-0.014
 imag-0.014
Stop-0.014
Token and
Token ID290
Feature activation+3.89e-03

Pos logit contributions
Mine+0.032
 care+0.032
hered+0.028
 breaths+0.028
Stop+0.027
Neg logit contributions
 exactly-0.040
ook-0.040
ooks-0.039
ich-0.037
anut-0.037
Token took
Token ID1718
Feature activation+1.95e-02

Pos logit contributions
 exactly+0.007
ook+0.007
ooks+0.006
 impressed+0.006
ossom+0.006
Neg logit contributions
Mine-0.006
 care-0.006
Stop-0.006
hered-0.005
|-0.005
Token some
Token ID617
Feature activation-5.24e-02

Pos logit contributions
 exactly+0.035
ook+0.031
ooks+0.029
ossom+0.028
aze+0.028
Neg logit contributions
Mine-0.035
Stop-0.031
|-0.030
storm-0.029
hered-0.029
Token deep
Token ID2769
Feature activation-4.25e-02

Pos logit contributions
Mine+0.104
Stop+0.093
|+0.091
hered+0.089
storm+0.087
Neg logit contributions
 exactly-0.085
ook-0.071
ents-0.068
ooks-0.068
iring-0.067
Token breaths
Token ID45576
Feature activation-2.09e-01

Pos logit contributions
Mine+0.053
Stop+0.045
|+0.043
storm+0.040
hered+0.040
Neg logit contributions
 exactly-0.096
 taller-0.088
ook-0.087
ooks-0.087
ossom-0.086
Token.
Token ID13
Feature activation-9.06e-03

Pos logit contributions
Mine+0.387
 care+0.354
Stop+0.347
|+0.325
 imag+0.321
Neg logit contributions
 exactly-0.361
ook-0.328
ooks-0.313
ents-0.304
bles-0.300
Token She
Token ID1375
Feature activation+2.25e-02

Pos logit contributions
Mine+0.013
 care+0.012
Stop+0.011
hered+0.010
 breaths+0.010
Neg logit contributions
 exactly-0.019
ook-0.018
bles-0.017
ooks-0.017
anut-0.017
Token felt
Token ID2936
Feature activation-1.38e-02

Pos logit contributions
 exactly+0.043
ook+0.040
ooks+0.039
ents+0.037
bles+0.037
Neg logit contributions
Mine-0.036
 care-0.033
Stop-0.031
hered-0.030
 breaths-0.030
Token the
Token ID262
Feature activation+1.52e-02

Pos logit contributions
Mine+0.029
Stop+0.026
hered+0.026
 care+0.026
|+0.025
Neg logit contributions
 exactly-0.021
ook-0.017
 impressed-0.017
ooks-0.017
iring-0.016
Token warm
Token ID5814
Feature activation-2.33e-02

Pos logit contributions
 exactly+0.026
iring+0.023
ooks+0.022
ook+0.022
ossom+0.022
Neg logit contributions
Mine-0.027
hered-0.024
|-0.024
Stop-0.024
 care-0.023
Token feet
Token ID3625
Feature activation-8.66e-02

Pos logit contributions
 exactly+0.022
ook+0.020
ooks+0.019
ossom+0.018
ents+0.018
Neg logit contributions
Mine-0.022
 care-0.020
Stop-0.020
|-0.019
hered-0.019
Token.
Token ID13
Feature activation+6.51e-03

Pos logit contributions
Mine+0.142
 care+0.135
Stop+0.123
hered+0.121
 breaths+0.118
Neg logit contributions
 exactly-0.158
ook-0.148
ooks-0.144
bles-0.136
ossom-0.136
Token She
Token ID1375
Feature activation+3.11e-02

Pos logit contributions
 exactly+0.016
anut+0.015
keys+0.015
bles+0.015
ich+0.015
Neg logit contributions
Mine-0.007
 care-0.007
Stop-0.006
 him-0.005
 imag-0.005
Token took
Token ID1718
Feature activation+1.80e-02

Pos logit contributions
 exactly+0.061
ook+0.057
ooks+0.056
bles+0.053
iring+0.052
Neg logit contributions
Mine-0.048
 care-0.045
Stop-0.041
hered-0.039
 breaths-0.039
Token big
Token ID1263
Feature activation-3.89e-03

Pos logit contributions
 exactly+0.032
ook+0.029
ooks+0.027
ents+0.027
ae+0.026
Neg logit contributions
Mine-0.032
Stop-0.027
 care-0.027
|-0.027
hered-0.026
Token breaths
Token ID45576
Feature activation-2.09e-01

Pos logit contributions
Mine+0.007
|+0.006
storm+0.006
Stop+0.006
hered+0.005
Neg logit contributions
 exactly-0.006
ents-0.006
 taller-0.005
adow-0.005
 careful-0.005
Token of
Token ID286
Feature activation-7.98e-03

Pos logit contributions
Mine+0.411
 care+0.378
Stop+0.368
|+0.349
hered+0.345
Neg logit contributions
 exactly-0.337
ook-0.303
ooks-0.288
ents-0.278
bles-0.276
Token fresh
Token ID4713
Feature activation-6.33e-05

Pos logit contributions
Mine+0.015
hered+0.014
|+0.013
Stop+0.013
 care+0.013
Neg logit contributions
 exactly-0.014
ook-0.011
aches-0.011
ooks-0.011
 impressed-0.011
Token air
Token ID1633
Feature activation-8.90e-02

Pos logit contributions
Mine+0.000
|+0.000
Stop+0.000
storm+0.000
hered+0.000
Neg logit contributions
 exactly-0.000
 taller-0.000
iring-0.000
ook-0.000
ents-0.000
Token and
Token ID290
Feature activation+9.54e-03

Pos logit contributions
Mine+0.167
 care+0.155
Stop+0.149
|+0.142
hered+0.140
Neg logit contributions
 exactly-0.149
ook-0.133
ooks-0.127
bles-0.123
ents-0.122
Token her
Token ID607
Feature activation+1.33e-02

Pos logit contributions
 exactly+0.017
ook+0.015
ooks+0.015
bles+0.014
ents+0.014
Neg logit contributions
Mine-0.017
 care-0.016
Stop-0.015
hered-0.015
 breaths-0.014
Token he
Token ID339
Feature activation-9.00e-03

Pos logit contributions
 exactly+0.016
 impressed+0.013
ook+0.013
ooks+0.013
ents+0.013
Neg logit contributions
Mine-0.016
hered-0.014
Stop-0.014
 care-0.014
|-0.013
Token tried
Token ID3088
Feature activation-3.16e-02

Pos logit contributions
Mine+0.016
 care+0.014
Stop+0.014
hered+0.014
|+0.013
Neg logit contributions
 exactly-0.016
ook-0.014
 impressed-0.014
ents-0.014
bles-0.013
Token to
Token ID284
Feature activation-1.69e-02

Pos logit contributions
Mine+0.051
 care+0.050
Stop+0.045
hered+0.044
 breaths+0.043
Neg logit contributions
 exactly-0.058
ook-0.055
ooks-0.053
bles-0.051
ossom-0.051
Token take
Token ID1011
Feature activation+3.41e-02

Pos logit contributions
Mine+0.028
Stop+0.024
 imag+0.024
|+0.024
 breaths+0.023
Neg logit contributions
 exactly-0.031
aches-0.029
ook-0.027
aze-0.026
anut-0.026
Token deep
Token ID2769
Feature activation-4.09e-02

Pos logit contributions
 exactly+0.067
ook+0.055
aze+0.054
aches+0.053
ooks+0.051
Neg logit contributions
Mine-0.052
storm-0.045
Stop-0.043
|-0.042
 weep-0.040
Token breaths
Token ID45576
Feature activation-2.09e-01

Pos logit contributions
Mine+0.051
Stop+0.036
|+0.036
hered+0.032
storm+0.032
Neg logit contributions
 exactly-0.098
 taller-0.089
ents-0.087
ooks-0.085
ups-0.084
Token to
Token ID284
Feature activation-1.58e-02

Pos logit contributions
Mine+0.401
 care+0.364
Stop+0.359
 imag+0.335
|+0.335
Neg logit contributions
 exactly-0.352
ook-0.319
ooks-0.304
ents-0.297
bles-0.294
Token calm
Token ID9480
Feature activation-2.74e-02

Pos logit contributions
Mine+0.024
Stop+0.020
 care+0.019
hered+0.019
 imag+0.019
Neg logit contributions
 exactly-0.033
ook-0.030
aches-0.029
ooks-0.028
aze-0.028
Token down
Token ID866
Feature activation+2.87e-02

Pos logit contributions
Mine+0.052
|+0.047
hered+0.044
 imag+0.043
Stop+0.043
Neg logit contributions
 exactly-0.044
ook-0.037
bles-0.035
iring-0.035
ups-0.035
Token.
Token ID13
Feature activation-7.86e-03

Pos logit contributions
 exactly+0.051
ook+0.047
ooks+0.045
bles+0.043
ents+0.043
Neg logit contributions
Mine-0.050
 care-0.046
Stop-0.044
hered-0.042
|-0.042
Token Tim
Token ID5045
Feature activation-2.91e-02

Pos logit contributions
Mine+0.013
 care+0.012
Stop+0.011
hered+0.011
 breaths+0.011
Neg logit contributions
 exactly-0.015
ook-0.014
ooks-0.013
ents-0.012
 impressed-0.012
Token with
Token ID351
Feature activation-2.53e-02

Pos logit contributions
 exactly+0.138
ooks+0.131
ook+0.130
iring+0.124
ich+0.123
Neg logit contributions
 care-0.122
Mine-0.121
hered-0.105
 breaths-0.103
|-0.102
Token some
Token ID617
Feature activation-4.79e-02

Pos logit contributions
Mine+0.046
|+0.046
Stop+0.045
hered+0.043
storm+0.042
Neg logit contributions
 exactly-0.043
ook-0.034
 impressed-0.033
ents-0.032
ooks-0.032
Token big
Token ID1263
Feature activation-4.16e-03

Pos logit contributions
Mine+0.088
|+0.085
Stop+0.079
hered+0.077
 care+0.077
Neg logit contributions
 exactly-0.080
ook-0.068
 impressed-0.066
ooks-0.066
ents-0.065
Token,
Token ID11
Feature activation+4.97e-04

Pos logit contributions
|+0.009
Mine+0.007
Stop+0.007
storm+0.007
 raged+0.007
Neg logit contributions
 exactly-0.006
ents-0.005
 impressed-0.005
 taller-0.005
 careful-0.005
Token big
Token ID1263
Feature activation-3.71e-03

Pos logit contributions
 exactly+0.001
ook+0.001
ooks+0.001
bles+0.001
ossom+0.001
Neg logit contributions
Mine-0.001
 care-0.001
Stop-0.001
hered-0.001
 breaths-0.001
Token breaths
Token ID45576
Feature activation-2.08e-01

Pos logit contributions
|+0.007
Mine+0.007
Stop+0.007
 care+0.006
storm+0.006
Neg logit contributions
 exactly-0.005
ents-0.005
 impressed-0.005
 taller-0.005
ook-0.004
Token that
Token ID326
Feature activation-2.65e-02

Pos logit contributions
Mine+0.410
Stop+0.372
 care+0.368
|+0.349
 imag+0.344
Neg logit contributions
 exactly-0.343
ook-0.308
ooks-0.288
ents-0.288
bles-0.282
Token filled
Token ID5901
Feature activation+3.41e-02

Pos logit contributions
Mine+0.042
 care+0.039
Stop+0.037
 breaths+0.035
hered+0.035
Neg logit contributions
 exactly-0.051
ook-0.047
ooks-0.045
 impressed-0.043
bles-0.043
Token his
Token ID465
Feature activation-1.91e-02

Pos logit contributions
 exactly+0.065
ook+0.061
ooks+0.060
bles+0.057
ents+0.057
Neg logit contributions
Mine-0.054
 care-0.050
Stop-0.047
hered-0.045
|-0.044
Token balloon
Token ID21190
Feature activation+1.37e-02

Pos logit contributions
Mine+0.033
 care+0.031
Stop+0.029
hered+0.028
 breaths+0.028
Neg logit contributions
 exactly-0.034
ook-0.031
ooks-0.030
bles-0.029
ents-0.028
Token with
Token ID351
Feature activation-2.34e-02

Pos logit contributions
 exactly+0.025
ooks+0.024
ook+0.024
 impressed+0.023
iring+0.022
Neg logit contributions
Mine-0.021
 care-0.020
 breaths-0.018
hered-0.018
|-0.017
Token They
Token ID1119
Feature activation-3.75e-02

Pos logit contributions
 exactly+0.005
ook+0.004
aches+0.004
ich+0.004
ooks+0.004
Neg logit contributions
Mine-0.003
 care-0.002
Stop-0.002
hered-0.002
|-0.002
Token could
Token ID714
Feature activation+3.07e-02

Pos logit contributions
Mine+0.057
 care+0.053
Stop+0.049
hered+0.046
 breaths+0.046
Neg logit contributions
 exactly-0.075
ook-0.070
ooks-0.069
bles-0.065
ossom-0.065
Token hear
Token ID3285
Feature activation+7.09e-02

Pos logit contributions
 exactly+0.045
ook+0.041
ooks+0.039
bles+0.037
ents+0.037
Neg logit contributions
Mine-0.063
 care-0.059
Stop-0.057
hered-0.055
 breaths-0.055
Token their
Token ID511
Feature activation-1.78e-02

Pos logit contributions
 exactly+0.129
ook+0.121
ooks+0.117
bles+0.112
ossom+0.111
Neg logit contributions
Mine-0.118
 care-0.112
Stop-0.104
hered-0.100
 breaths-0.099
Token own
Token ID898
Feature activation+8.84e-03

Pos logit contributions
Mine+0.038
 care+0.036
Stop+0.034
hered+0.033
 breaths+0.033
Neg logit contributions
 exactly-0.025
ook-0.022
ooks-0.021
bles-0.020
ents-0.020
Token breaths
Token ID45576
Feature activation-2.07e-01

Pos logit contributions
 exactly+0.017
ook+0.015
ooks+0.015
bles+0.014
ents+0.014
Neg logit contributions
Mine-0.014
 care-0.014
Stop-0.012
hered-0.012
 breaths-0.012
Token.
Token ID13
Feature activation+1.06e-03

Pos logit contributions
Mine+0.395
 care+0.360
Stop+0.349
 breaths+0.331
|+0.331
Neg logit contributions
 exactly-0.346
ook-0.312
ooks-0.297
ents-0.294
bles-0.293
Token
Token ID198
Feature activation+5.66e-03

Pos logit contributions
 exactly+0.002
ook+0.002
ooks+0.002
ich+0.002
aches+0.002
Neg logit contributions
Mine-0.001
 care-0.001
Stop-0.001
hered-0.001
|-0.001
Token
Token ID198
Feature activation+4.89e-03

Pos logit contributions
 exactly+0.011
ook+0.010
ooks+0.010
bles+0.009
ossom+0.009
Neg logit contributions
Mine-0.009
 care-0.008
Stop-0.008
hered-0.007
|-0.007
Token"
Token ID1
Feature activation-2.35e-03

Pos logit contributions
 exactly+0.008
ook+0.006
ich+0.006
ooks+0.006
ents+0.006
Neg logit contributions
Mine-0.010
Stop-0.009
 care-0.009
Hey-0.008
hered-0.008
TokenWow
Token ID22017
Feature activation-4.21e-02

Pos logit contributions
Mine+0.003
 care+0.003
 breaths+0.003
hered+0.003
Stop+0.002
Neg logit contributions
 exactly-0.005
ook-0.005
ooks-0.005
ossom-0.004
bles-0.004
Token moments
Token ID7188
Feature activation-8.24e-02

Pos logit contributions
Mine+0.178
Stop+0.153
 care+0.151
|+0.151
hered+0.150
Neg logit contributions
 exactly-0.181
ook-0.161
ooks-0.154
 impressed-0.149
ents-0.148
Token and
Token ID290
Feature activation+6.22e-03

Pos logit contributions
Mine+0.157
 care+0.142
Stop+0.139
hered+0.129
 breaths+0.129
Neg logit contributions
 exactly-0.142
ook-0.127
ooks-0.122
ents-0.117
 impressed-0.113
Token took
Token ID1718
Feature activation+2.11e-02

Pos logit contributions
 exactly+0.012
ook+0.010
ooks+0.010
 impressed+0.010
ents+0.010
Neg logit contributions
Mine-0.010
 care-0.009
Stop-0.009
hered-0.009
|-0.009
Token some
Token ID617
Feature activation-4.68e-02

Pos logit contributions
 exactly+0.039
ook+0.035
ooks+0.032
aze+0.032
ups+0.031
Neg logit contributions
Mine-0.035
Stop-0.031
|-0.030
storm-0.030
 imag-0.029
Token deep
Token ID2769
Feature activation-4.42e-02

Pos logit contributions
Mine+0.096
Stop+0.085
|+0.082
hered+0.079
storm+0.077
Neg logit contributions
 exactly-0.076
ook-0.062
ents-0.061
ooks-0.058
aches-0.057
Token breaths
Token ID45576
Feature activation-2.06e-01

Pos logit contributions
Mine+0.055
Stop+0.046
storm+0.040
|+0.039
hered+0.037
Neg logit contributions
 exactly-0.100
ook-0.093
ooks-0.092
 taller-0.091
ents-0.089
Token,
Token ID11
Feature activation-8.37e-04

Pos logit contributions
Mine+0.397
 care+0.358
Stop+0.357
 imag+0.333
|+0.331
Neg logit contributions
 exactly-0.348
ook-0.314
ooks-0.297
ents-0.294
iring-0.288
Token before
Token ID878
Feature activation+3.14e-02

Pos logit contributions
Mine+0.001
 care+0.001
Stop+0.001
 breaths+0.001
hered+0.001
Neg logit contributions
 exactly-0.001
ook-0.001
ooks-0.001
bles-0.001
iring-0.001
Token carefully
Token ID7773
Feature activation+1.86e-03

Pos logit contributions
 exactly+0.053
ook+0.045
ooks+0.044
 impressed+0.043
ents+0.042
Neg logit contributions
Mine-0.059
 care-0.055
Stop-0.052
|-0.051
hered-0.051
Token putting
Token ID5137
Feature activation-4.17e-02

Pos logit contributions
 exactly+0.003
ook+0.003
ooks+0.003
ents+0.003
aze+0.003
Neg logit contributions
Mine-0.003
Stop-0.003
 care-0.003
|-0.003
hered-0.003
Token her
Token ID607
Feature activation+1.57e-02

Pos logit contributions
Mine+0.061
 care+0.055
Stop+0.053
hered+0.051
|+0.050
Neg logit contributions
 exactly-0.086
ook-0.079
ooks-0.076
bles-0.074
ents-0.073
Token girl
Token ID2576
Feature activation-5.58e-02

Pos logit contributions
Mine+0.052
 care+0.048
|+0.048
Stop+0.046
 imag+0.045
Neg logit contributions
 exactly-0.046
ook-0.040
 impressed-0.038
ae-0.038
ooks-0.037
Token took
Token ID1718
Feature activation+1.96e-02

Pos logit contributions
Mine+0.087
 care+0.082
Stop+0.075
hered+0.073
 breaths+0.072
Neg logit contributions
 exactly-0.107
ook-0.100
ooks-0.098
bles-0.094
iring-0.092
Token a
Token ID257
Feature activation-6.08e-04

Pos logit contributions
 exactly+0.035
ook+0.031
ooks+0.029
ae+0.029
aze+0.028
Neg logit contributions
Mine-0.034
Stop-0.030
|-0.029
 care-0.029
storm-0.029
Token few
Token ID1178
Feature activation-1.00e-01

Pos logit contributions
Mine+0.001
 care+0.001
Stop+0.001
 breaths+0.001
|+0.001
Neg logit contributions
 exactly-0.001
ook-0.001
 impressed-0.001
ooks-0.001
ossom-0.001
Token deep
Token ID2769
Feature activation-4.38e-02

Pos logit contributions
Mine+0.165
|+0.150
Stop+0.142
 care+0.141
storm+0.138
Neg logit contributions
 exactly-0.187
ook-0.164
 impressed-0.160
ents-0.157
ooks-0.155
Token breaths
Token ID45576
Feature activation-2.06e-01

Pos logit contributions
Mine+0.045
|+0.037
storm+0.036
Stop+0.035
Ro+0.029
Neg logit contributions
 exactly-0.102
 taller-0.097
ook-0.095
 impressed-0.091
ents-0.090
Token and
Token ID290
Feature activation+9.36e-03

Pos logit contributions
Mine+0.430
Stop+0.387
 care+0.386
|+0.371
 imag+0.364
Neg logit contributions
 exactly-0.320
ook-0.281
ents-0.263
ooks-0.262
iring-0.260
Token started
Token ID2067
Feature activation+2.91e-02

Pos logit contributions
 exactly+0.018
ook+0.016
ooks+0.016
bles+0.015
ents+0.015
Neg logit contributions
Mine-0.015
 care-0.014
Stop-0.013
hered-0.013
 breaths-0.013
Token to
Token ID284
Feature activation-1.60e-02

Pos logit contributions
 exactly+0.052
ook+0.048
ooks+0.046
bles+0.044
ents+0.044
Neg logit contributions
Mine-0.051
 care-0.048
Stop-0.044
hered-0.043
 breaths-0.043
Token feel
Token ID1254
Feature activation-1.55e-02

Pos logit contributions
Mine+0.023
Stop+0.020
 care+0.020
hered+0.020
|+0.019
Neg logit contributions
 exactly-0.033
ook-0.029
 impressed-0.028
ossom-0.028
ooks-0.028
Token better
Token ID1365
Feature activation+2.29e-02

Pos logit contributions
Mine+0.035
|+0.032
Stop+0.031
storm+0.031
hered+0.031
Neg logit contributions
 exactly-0.021
 impressed-0.019
ook-0.017
 pleased-0.016
 taller-0.016
Token his
Token ID465
Feature activation-2.09e-02

Pos logit contributions
Mine+0.160
 care+0.146
Stop+0.141
|+0.140
hered+0.138
Neg logit contributions
 exactly-0.165
ook-0.148
ooks-0.140
ents-0.136
 impressed-0.136
Token muscles
Token ID12749
Feature activation-5.19e-02

Pos logit contributions
Mine+0.039
 care+0.036
Stop+0.035
|+0.034
hered+0.034
Neg logit contributions
 exactly-0.035
ook-0.031
ooks-0.030
bles-0.030
ossom-0.029
Token,
Token ID11
Feature activation-2.54e-03

Pos logit contributions
Mine+0.102
Stop+0.092
|+0.092
 care+0.091
 breaths+0.087
Neg logit contributions
 exactly-0.082
ook-0.069
 impressed-0.069
bles-0.068
ents-0.068
Token taking
Token ID2263
Feature activation+3.50e-02

Pos logit contributions
Mine+0.004
 care+0.004
Stop+0.004
hered+0.004
 breaths+0.004
Neg logit contributions
 exactly-0.005
ook-0.004
ooks-0.004
ossom-0.004
bles-0.004
Token deep
Token ID2769
Feature activation-4.60e-02

Pos logit contributions
 exactly+0.071
ook+0.063
ooks+0.060
aze+0.059
 impressed+0.059
Neg logit contributions
Mine-0.050
|-0.044
Stop-0.042
storm-0.041
hered-0.040
Token breaths
Token ID45576
Feature activation-2.06e-01

Pos logit contributions
Mine+0.059
|+0.050
Stop+0.045
storm+0.042
hered+0.042
Neg logit contributions
 exactly-0.102
ook-0.091
bles-0.091
ooks-0.090
ups-0.090
Token as
Token ID355
Feature activation-2.33e-02

Pos logit contributions
Mine+0.406
 care+0.371
Stop+0.363
|+0.346
 breaths+0.340
Neg logit contributions
 exactly-0.332
ook-0.298
ooks-0.283
ents-0.276
bles-0.274
Token he
Token ID339
Feature activation-9.55e-03

Pos logit contributions
Mine+0.037
 care+0.036
Stop+0.032
 breaths+0.030
hered+0.030
Neg logit contributions
 exactly-0.044
ook-0.041
ooks-0.040
ossom-0.039
ents-0.039
Token did
Token ID750
Feature activation+5.65e-02

Pos logit contributions
Mine+0.016
 care+0.014
Stop+0.014
hered+0.013
|+0.013
Neg logit contributions
 exactly-0.018
ook-0.016
ents-0.015
bles-0.015
ber-0.015
Token so
Token ID523
Feature activation-4.56e-04

Pos logit contributions
 exactly+0.108
ook+0.103
ooks+0.101
anut+0.098
ossom+0.097
Neg logit contributions
Mine-0.084
 care-0.084
Stop-0.073
hered-0.071
 breaths-0.070
Token.
Token ID13
Feature activation-5.65e-03

Pos logit contributions
 care+0.001
Mine+0.001
Stop+0.001
 breaths+0.001
hered+0.001
Neg logit contributions
 exactly-0.001
ook-0.001
ooks-0.001
ich-0.001
anut-0.001
Token to
Token ID284
Feature activation-1.73e-02

Pos logit contributions
 exactly+0.051
ook+0.048
ooks+0.046
bles+0.044
ossom+0.044
Neg logit contributions
Mine-0.050
 care-0.048
Stop-0.044
hered-0.043
 breaths-0.042
Token take
Token ID1011
Feature activation+3.19e-02

Pos logit contributions
Mine+0.025
Stop+0.022
 care+0.022
hered+0.021
|+0.020
Neg logit contributions
 exactly-0.036
ook-0.032
ooks-0.031
ossom-0.031
aches-0.030
Token nice
Token ID3621
Feature activation-9.45e-03

Pos logit contributions
 exactly+0.052
ook+0.045
ooks+0.042
aze+0.041
aches+0.039
Neg logit contributions
Mine-0.063
Stop-0.055
storm-0.053
|-0.052
hered-0.051
Token,
Token ID11
Feature activation+2.09e-04

Pos logit contributions
Mine+0.017
|+0.015
Stop+0.015
storm+0.014
Hey+0.014
Neg logit contributions
 exactly-0.017
ook-0.014
ooks-0.014
ents-0.014
ups-0.014
Token deep
Token ID2769
Feature activation-4.41e-02

Pos logit contributions
 exactly+0.000
ook+0.000
ooks+0.000
bles+0.000
ossom+0.000
Neg logit contributions
Mine-0.000
 care-0.000
Stop-0.000
 breaths-0.000
hered-0.000
Token breaths
Token ID45576
Feature activation-2.05e-01

Pos logit contributions
Mine+0.070
Stop+0.059
|+0.058
 care+0.056
hered+0.054
Neg logit contributions
 exactly-0.090
ook-0.081
ooks-0.080
iring-0.079
ents-0.078
Token.
Token ID13
Feature activation-1.50e-03

Pos logit contributions
Mine+0.383
 care+0.350
Stop+0.340
|+0.323
 imag+0.318
Neg logit contributions
 exactly-0.351
ook-0.321
ooks-0.307
ents-0.297
bles-0.296
Token Slowly
Token ID46484
Feature activation-1.42e-02

Pos logit contributions
Mine+0.002
 care+0.002
Stop+0.002
 breaths+0.002
|+0.002
Neg logit contributions
 exactly-0.003
ook-0.003
ooks-0.003
bles-0.003
anut-0.003
Token,
Token ID11
Feature activation-1.36e-03

Pos logit contributions
Mine+0.027
 care+0.023
Stop+0.023
|+0.023
hered+0.023
Neg logit contributions
 exactly-0.025
ook-0.022
ooks-0.021
ents-0.020
bles-0.020
Token he
Token ID339
Feature activation-6.51e-03

Pos logit contributions
 care+0.002
Mine+0.002
Stop+0.002
 him+0.002
 breaths+0.002
Neg logit contributions
 exactly-0.003
ook-0.003
ooks-0.003
ossom-0.003
anut-0.003
Token felt
Token ID2936
Feature activation-1.21e-02

Pos logit contributions
Mine+0.010
 care+0.009
Stop+0.008
 breaths+0.008
hered+0.008
Neg logit contributions
 exactly-0.013
ook-0.012
ooks-0.012
iring-0.011
bles-0.011
Token face
Token ID1986
Feature activation+3.13e-02

Pos logit contributions
Mine+0.032
 care+0.030
Stop+0.029
|+0.028
hered+0.028
Neg logit contributions
 exactly-0.025
ook-0.023
ooks-0.022
bles-0.021
ents-0.021
Token and
Token ID290
Feature activation+1.03e-02

Pos logit contributions
 exactly+0.056
ook+0.053
ooks+0.052
bles+0.049
anut+0.049
Neg logit contributions
Mine-0.052
 care-0.049
Stop-0.045
hered-0.045
 breaths-0.043
Token he
Token ID339
Feature activation-4.21e-03

Pos logit contributions
 exactly+0.019
ook+0.017
ooks+0.016
 impressed+0.016
ents+0.016
Neg logit contributions
Mine-0.018
 care-0.016
Stop-0.015
hered-0.015
|-0.015
Token took
Token ID1718
Feature activation+2.09e-02

Pos logit contributions
Mine+0.007
 care+0.006
Stop+0.006
hered+0.006
 breaths+0.006
Neg logit contributions
 exactly-0.008
ook-0.007
ooks-0.007
ents-0.007
bles-0.007
Token deep
Token ID2769
Feature activation-4.40e-02

Pos logit contributions
 exactly+0.035
ook+0.029
aze+0.027
ooks+0.027
 careful+0.025
Neg logit contributions
Mine-0.041
storm-0.036
|-0.036
Stop-0.035
hered-0.033
Token breaths
Token ID45576
Feature activation-2.05e-01

Pos logit contributions
Mine+0.049
Stop+0.036
|+0.034
storm+0.032
hered+0.032
Neg logit contributions
 exactly-0.112
ook-0.099
ooks-0.099
 taller-0.097
ents-0.097
Token.
Token ID13
Feature activation+3.12e-03

Pos logit contributions
Mine+0.363
 care+0.336
Stop+0.322
hered+0.304
|+0.302
Neg logit contributions
 exactly-0.365
ook-0.335
ooks-0.322
bles-0.308
ents-0.308
Token
Token ID198
Feature activation+1.02e-02

Pos logit contributions
 exactly+0.007
ich+0.007
aches+0.007
keys+0.007
ber+0.007
Neg logit contributions
Mine-0.003
 care-0.003
Stop-0.003
 him-0.003
 Bobby-0.003
Token
Token ID198
Feature activation+9.26e-03

Pos logit contributions
 exactly+0.024
anut+0.022
ich+0.021
sters+0.021
ossom+0.021
Neg logit contributions
Mine-0.012
 care-0.011
Stop-0.010
|-0.010
 him-0.009
TokenAt
Token ID2953
Feature activation+3.56e-02

Pos logit contributions
 exactly+0.018
ich+0.017
sters+0.016
ber+0.016
anut+0.016
Neg logit contributions
Stop-0.015
Mine-0.015
 care-0.014
Hey-0.013
Bang-0.013
Token the
Token ID262
Feature activation+2.00e-02

Pos logit contributions
 exactly+0.060
ook+0.051
ae+0.048
ooks+0.046
 impressed+0.046
Neg logit contributions
Mine-0.071
Stop-0.062
 care-0.060
hered-0.060
ez-0.058
Token the
Token ID262
Feature activation+1.95e-02

Pos logit contributions
Mine+0.034
 care+0.033
Stop+0.029
 him+0.028
 Bobby+0.027
Neg logit contributions
 exactly-0.052
ook-0.050
ossom-0.050
ooks-0.049
ae-0.048
Token big
Token ID1263
Feature activation-6.12e-03

Pos logit contributions
 exactly+0.032
ook+0.029
ooks+0.028
ossom+0.027
ae+0.027
Neg logit contributions
Mine-0.038
 care-0.037
Stop-0.035
hered-0.032
|-0.032
Token boy
Token ID2933
Feature activation-5.99e-02

Pos logit contributions
Mine+0.011
 care+0.010
Stop+0.009
hered+0.009
 breaths+0.009
Neg logit contributions
 exactly-0.011
ook-0.010
ooks-0.009
bles-0.009
 impressed-0.009
Token take
Token ID1011
Feature activation+3.27e-02

Pos logit contributions
Mine+0.099
 care+0.088
Stop+0.086
hered+0.084
|+0.082
Neg logit contributions
 exactly-0.114
ook-0.104
ooks-0.099
bles-0.096
ents-0.095
Token deep
Token ID2769
Feature activation-4.41e-02

Pos logit contributions
 exactly+0.055
ook+0.047
ups+0.045
ooks+0.042
aze+0.042
Neg logit contributions
Mine-0.058
storm-0.052
|-0.050
Stop-0.048
use-0.047
Token breaths
Token ID45576
Feature activation-2.05e-01

Pos logit contributions
Mine+0.056
|+0.043
storm+0.040
Stop+0.040
hered+0.036
Neg logit contributions
 exactly-0.096
 taller-0.090
bles-0.087
ups-0.087
ents-0.086
Token and
Token ID290
Feature activation+9.78e-03

Pos logit contributions
Mine+0.454
Stop+0.409
 care+0.405
 imag+0.385
 breaths+0.384
Neg logit contributions
 exactly-0.298
ook-0.258
ents-0.243
ooks-0.238
iring-0.234
Token count
Token ID954
Feature activation+7.49e-02

Pos logit contributions
 exactly+0.021
ook+0.018
ooks+0.017
 impressed+0.017
aches+0.017
Neg logit contributions
Mine-0.014
Stop-0.012
hered-0.012
 care-0.012
|-0.012
Token to
Token ID284
Feature activation-1.41e-02

Pos logit contributions
 exactly+0.133
ook+0.123
ooks+0.119
bles+0.113
ents+0.113
Neg logit contributions
Mine-0.129
 care-0.122
Stop-0.114
hered-0.110
 breaths-0.108
Token ten
Token ID3478
Feature activation+5.48e-02

Pos logit contributions
Mine+0.023
 care+0.020
hered+0.020
Stop+0.019
|+0.019
Neg logit contributions
 exactly-0.027
ook-0.024
ooks-0.023
ents-0.022
 impressed-0.022
Token instead
Token ID2427
Feature activation-2.63e-03

Pos logit contributions
 exactly+0.103
ook+0.096
ooks+0.094
ossom+0.090
anut+0.088
Neg logit contributions
Mine-0.089
 care-0.085
Stop-0.079
 breaths-0.076
hered-0.075
Token her
Token ID607
Feature activation+8.26e-03

Pos logit contributions
 exactly+0.053
ook+0.048
ooks+0.047
bles+0.045
ents+0.045
Neg logit contributions
Mine-0.041
 care-0.038
Stop-0.035
hered-0.034
|-0.034
Token eyes
Token ID2951
Feature activation-1.96e-02

Pos logit contributions
 exactly+0.014
ook+0.013
ooks+0.013
bles+0.012
 impressed+0.012
Neg logit contributions
Mine-0.015
 care-0.014
 breaths-0.013
Stop-0.013
 imag-0.013
Token and
Token ID290
Feature activation+6.24e-03

Pos logit contributions
Mine+0.031
 care+0.030
hered+0.027
Stop+0.027
 breaths+0.027
Neg logit contributions
 exactly-0.036
ook-0.034
ooks-0.034
bles-0.032
ich-0.032
Token took
Token ID1718
Feature activation+2.46e-02

Pos logit contributions
 exactly+0.012
ook+0.011
ooks+0.011
bles+0.010
ents+0.010
Neg logit contributions
Mine-0.010
 care-0.009
Stop-0.009
hered-0.008
 breaths-0.008
Token deep
Token ID2769
Feature activation-3.59e-02

Pos logit contributions
 exactly+0.042
ook+0.038
ooks+0.035
ae+0.034
aze+0.034
Neg logit contributions
Mine-0.045
Stop-0.040
|-0.039
 care-0.038
hered-0.038
Token breaths
Token ID45576
Feature activation-2.04e-01

Pos logit contributions
Mine+0.040
Stop+0.029
|+0.029
storm+0.028
hered+0.026
Neg logit contributions
 exactly-0.090
aches-0.081
 taller-0.080
ents-0.079
ook-0.079
Token.
Token ID13
Feature activation-6.20e-03

Pos logit contributions
Mine+0.409
Stop+0.370
 care+0.366
|+0.342
 imag+0.342
Neg logit contributions
 exactly-0.336
ook-0.301
ooks-0.287
ents-0.282
bles-0.275
Token Soon
Token ID15894
Feature activation-3.38e-02

Pos logit contributions
Mine+0.009
 care+0.008
Stop+0.007
hered+0.007
 breaths+0.007
Neg logit contributions
 exactly-0.013
ook-0.012
ooks-0.012
anut-0.012
bles-0.012
Token,
Token ID11
Feature activation-3.55e-03

Pos logit contributions
 care+0.048
Mine+0.047
Stop+0.041
hered+0.041
 breaths+0.041
Neg logit contributions
 exactly-0.068
ook-0.065
ooks-0.064
anut-0.063
ich-0.062
Token she
Token ID673
Feature activation+1.75e-02

Pos logit contributions
Mine+0.005
 care+0.005
Stop+0.005
hered+0.004
 him+0.004
Neg logit contributions
 exactly-0.007
ook-0.007
ooks-0.007
ossom-0.006
anut-0.006
Token felt
Token ID2936
Feature activation-1.27e-02

Pos logit contributions
 exactly+0.034
ook+0.032
ooks+0.031
bles+0.030
ents+0.030
Neg logit contributions
Mine-0.027
 care-0.025
Stop-0.023
hered-0.023
 breaths-0.022
Token to
Token ID284
Feature activation-1.70e-02

Pos logit contributions
 exactly+0.047
ook+0.039
ooks+0.039
ents+0.038
iring+0.037
Neg logit contributions
Mine-0.065
Stop-0.058
 care-0.055
Hey-0.054
hered-0.053
Token trem
Token ID10023
Feature activation+2.36e-02

Pos logit contributions
Mine+0.030
Stop+0.027
hered+0.025
 imag+0.024
Hey+0.024
Neg logit contributions
 exactly-0.029
 taller-0.027
aches-0.027
ents-0.025
 impressed-0.024
Tokenble
Token ID903
Feature activation+2.90e-02

Pos logit contributions
 exactly+0.024
ook+0.019
 impressed+0.018
ooks+0.018
ae+0.018
Neg logit contributions
Mine-0.055
 care-0.052
Stop-0.051
 breaths-0.051
 him-0.051
Token and
Token ID290
Feature activation+6.15e-03

Pos logit contributions
 exactly+0.055
ook+0.051
ooks+0.050
bles+0.048
ossom+0.047
Neg logit contributions
Mine-0.046
 care-0.044
Stop-0.039
hered-0.039
 breaths-0.039
Token their
Token ID511
Feature activation-1.67e-02

Pos logit contributions
 exactly+0.011
 impressed+0.009
aches+0.009
ents+0.009
ook+0.009
Neg logit contributions
Mine-0.011
Stop-0.010
hered-0.010
Hey-0.009
|-0.009
Token breaths
Token ID45576
Feature activation-2.04e-01

Pos logit contributions
Mine+0.031
hered+0.029
Stop+0.029
Hey+0.027
|+0.026
Neg logit contributions
 exactly-0.028
ook-0.024
ber-0.023
ooks-0.023
ich-0.023
Token became
Token ID2627
Feature activation+5.07e-02

Pos logit contributions
Mine+0.415
Stop+0.366
 care+0.351
 imag+0.347
 breaths+0.344
Neg logit contributions
 exactly-0.316
bles-0.279
ups-0.279
ook-0.276
ents-0.271
Token slow
Token ID3105
Feature activation-5.04e-02

Pos logit contributions
 exactly+0.086
ook+0.082
ooks+0.078
ossom+0.074
ents+0.074
Neg logit contributions
Mine-0.089
 care-0.086
Stop-0.079
 breaths-0.077
hered-0.076
Token.
Token ID13
Feature activation-6.50e-03

Pos logit contributions
Mine+0.116
Stop+0.105
|+0.101
 care+0.100
Hey+0.098
Neg logit contributions
 exactly-0.067
ook-0.055
bles-0.055
ents-0.054
iring-0.053
Token
Token ID198
Feature activation-8.14e-04

Pos logit contributions
Mine+0.010
 care+0.009
Stop+0.009
hered+0.008
 breaths+0.008
Neg logit contributions
 exactly-0.013
ook-0.012
ooks-0.012
bles-0.011
ents-0.011
Token
Token ID198
Feature activation-9.65e-04

Pos logit contributions
Mine+0.001
 care+0.001
Stop+0.001
|+0.001
hered+0.001
Neg logit contributions
 exactly-0.002
ook-0.001
ooks-0.001
bles-0.001
ossom-0.001
Token they
Token ID484
Feature activation-2.73e-02

Pos logit contributions
 exactly+0.047
bles+0.044
ook+0.044
ooks+0.044
anut+0.044
Neg logit contributions
 care-0.034
Mine-0.033
 him-0.029
Stop-0.029
 Bobby-0.026
Token do
Token ID466
Feature activation+3.69e-02

Pos logit contributions
Mine+0.038
 care+0.034
Stop+0.033
hered+0.032
|+0.031
Neg logit contributions
 exactly-0.057
ook-0.053
ooks-0.050
ents-0.050
bles-0.049
Token it
Token ID340
Feature activation-1.27e-02

Pos logit contributions
 exactly+0.069
ook+0.066
ooks+0.065
ossom+0.062
anut+0.062
Neg logit contributions
 care-0.057
Mine-0.056
Stop-0.049
 him-0.047
 breaths-0.047
Token.
Token ID13
Feature activation-6.76e-03

Pos logit contributions
Mine+0.022
 care+0.020
Stop+0.019
hered+0.018
 breaths+0.018
Neg logit contributions
 exactly-0.023
ook-0.021
ooks-0.020
bles-0.020
ents-0.020
Token He
Token ID679
Feature activation-1.04e-02

Pos logit contributions
Mine+0.009
 care+0.008
Stop+0.007
|+0.007
hered+0.007
Neg logit contributions
 exactly-0.015
ook-0.014
ooks-0.013
bles-0.013
anut-0.013
Token imag
Token ID3590
Feature activation-1.67e-01

Pos logit contributions
Mine+0.014
 care+0.013
Stop+0.012
 breaths+0.011
 imag+0.011
Neg logit contributions
 exactly-0.022
ook-0.021
ooks-0.021
bles-0.020
ossom-0.020
Tokenines
Token ID1127
Feature activation-1.59e-02

Pos logit contributions
Mine+0.334
 care+0.315
Stop+0.299
hered+0.291
 breaths+0.285
Neg logit contributions
 exactly-0.255
ook-0.232
ooks-0.223
bles-0.209
ents-0.208
Token he
Token ID339
Feature activation-1.23e-02

Pos logit contributions
Mine+0.027
 care+0.025
Stop+0.024
hered+0.023
|+0.023
Neg logit contributions
 exactly-0.029
ook-0.026
ooks-0.025
ents-0.024
bles-0.024
Token is
Token ID318
Feature activation-2.65e-02

Pos logit contributions
Mine+0.019
 care+0.017
hered+0.016
Stop+0.016
|+0.016
Neg logit contributions
 exactly-0.024
ook-0.022
ents-0.021
bles-0.021
ooks-0.020
Token one
Token ID530
Feature activation-3.46e-02

Pos logit contributions
 care+0.038
Mine+0.035
 him+0.035
 breaths+0.032
 imag+0.032
Neg logit contributions
ook-0.049
ooks-0.048
 exactly-0.048
ossom-0.048
ents-0.044
Token of
Token ID286
Feature activation-1.10e-02

Pos logit contributions
 care+0.053
Mine+0.052
 applause+0.045
 need+0.044
hered+0.044
Neg logit contributions
ooks-0.061
 exactly-0.060
ook-0.057
ossom-0.057
bles-0.057
Token listen
Token ID6004
Feature activation+6.30e-02

Pos logit contributions
 exactly+0.140
ook+0.129
aches+0.125
aze+0.125
ooks+0.123
Neg logit contributions
Mine-0.118
 breaths-0.104
 imag-0.099
|-0.096
hered-0.096
Token.
Token ID13
Feature activation-4.30e-03

Pos logit contributions
 exactly+0.117
ook+0.110
ooks+0.108
bles+0.102
anut+0.101
Neg logit contributions
Mine-0.101
 care-0.099
Stop-0.090
hered-0.086
 breaths-0.085
Token She
Token ID1375
Feature activation+2.19e-02

Pos logit contributions
Mine+0.005
 care+0.005
Stop+0.004
 breaths+0.004
hered+0.004
Neg logit contributions
 exactly-0.010
ook-0.009
anut-0.009
aches-0.009
ooks-0.009
Token threw
Token ID9617
Feature activation-1.69e-02

Pos logit contributions
 exactly+0.041
ook+0.038
ooks+0.037
bles+0.035
ents+0.035
Neg logit contributions
Mine-0.036
 care-0.034
Stop-0.031
hered-0.030
 breaths-0.030
Token a
Token ID257
Feature activation-6.91e-03

Pos logit contributions
Mine+0.029
storm+0.028
|+0.028
hered+0.027
 care+0.027
Neg logit contributions
 exactly-0.027
ents-0.023
ook-0.022
bles-0.022
 taller-0.022
Token tant
Token ID24246
Feature activation-1.66e-01

Pos logit contributions
Mine+0.009
 care+0.009
 breaths+0.008
Stop+0.008
 imag+0.008
Neg logit contributions
 exactly-0.014
ook-0.013
 impressed-0.013
bles-0.013
ossom-0.012
Tokenrum
Token ID6582
Feature activation-5.15e-02

Pos logit contributions
Mine+0.289
 care+0.258
Stop+0.251
|+0.244
 imag+0.240
Neg logit contributions
 exactly-0.294
ook-0.288
ooks-0.270
bles-0.259
anut-0.251
Token in
Token ID287
Feature activation-1.63e-02

Pos logit contributions
Mine+0.097
 care+0.091
Stop+0.085
|+0.083
hered+0.083
Neg logit contributions
 exactly-0.086
ook-0.078
ooks-0.075
bles-0.073
iring-0.072
Token the
Token ID262
Feature activation+1.15e-02

Pos logit contributions
storm+0.031
Mine+0.030
|+0.029
 imag+0.029
Stop+0.028
Neg logit contributions
 exactly-0.024
ook-0.020
ents-0.019
ooks-0.018
 impressed-0.018
Token store
Token ID3650
Feature activation+3.28e-02

Pos logit contributions
 exactly+0.022
bles+0.019
ooks+0.019
ook+0.019
 impressed+0.019
Neg logit contributions
Mine-0.016
hered-0.014
 care-0.014
storm-0.014
|-0.013
Token.
Token ID13
Feature activation-5.55e-03

Pos logit contributions
 exactly+0.063
ook+0.060
ooks+0.059
ossom+0.057
ae+0.055
Neg logit contributions
Mine-0.049
 care-0.048
hered-0.043
Stop-0.043
 breaths-0.041
Token kept
Token ID4030
Feature activation-2.85e-02

Pos logit contributions
 exactly+0.052
ook+0.048
ents+0.045
ooks+0.045
anut+0.044
Neg logit contributions
Mine-0.041
 care-0.038
Stop-0.035
hered-0.035
|-0.034
Token crying
Token ID13774
Feature activation-7.72e-02

Pos logit contributions
Mine+0.050
 care+0.046
Stop+0.044
|+0.043
hered+0.042
Neg logit contributions
 exactly-0.051
ook-0.044
 impressed-0.043
ooks-0.042
anut-0.042
Token and
Token ID290
Feature activation+5.33e-03

Pos logit contributions
Mine+0.137
 care+0.130
Stop+0.121
hered+0.117
 breaths+0.116
Neg logit contributions
 exactly-0.133
ook-0.124
ooks-0.120
bles-0.113
ossom-0.113
Token throwing
Token ID9644
Feature activation-1.55e-02

Pos logit contributions
 exactly+0.010
ook+0.008
 impressed+0.008
ooks+0.008
ents+0.008
Neg logit contributions
Mine-0.010
 care-0.009
|-0.008
hered-0.008
Stop-0.008
Token a
Token ID257
Feature activation-4.33e-03

Pos logit contributions
Mine+0.025
|+0.024
hered+0.022
 imag+0.022
Stop+0.022
Neg logit contributions
 exactly-0.028
ook-0.024
aches-0.023
ooks-0.023
bles-0.023
Token tant
Token ID24246
Feature activation-1.66e-01

Pos logit contributions
Mine+0.006
 care+0.006
Stop+0.005
 imag+0.005
|+0.005
Neg logit contributions
 exactly-0.009
ook-0.008
ooks-0.008
ossom-0.008
 impressed-0.008
Tokenrum
Token ID6582
Feature activation-5.09e-02

Pos logit contributions
Mine+0.285
 care+0.254
Stop+0.250
|+0.236
 imag+0.236
Neg logit contributions
 exactly-0.298
ook-0.290
ooks-0.273
bles-0.259
ossom-0.254
Token.
Token ID13
Feature activation-3.95e-03

Pos logit contributions
Mine+0.093
 care+0.087
Stop+0.082
|+0.080
hered+0.079
Neg logit contributions
 exactly-0.089
ook-0.080
ooks-0.076
bles-0.074
iring-0.073
Token
Token ID198
Feature activation+3.25e-03

Pos logit contributions
Mine+0.006
 care+0.006
Stop+0.005
hered+0.005
 breaths+0.005
Neg logit contributions
 exactly-0.008
ook-0.007
ooks-0.007
bles-0.007
ents-0.007
Token
Token ID198
Feature activation+3.13e-03

Pos logit contributions
 exactly+0.006
ook+0.006
ooks+0.006
bles+0.005
ents+0.005
Neg logit contributions
Mine-0.005
 care-0.005
Stop-0.004
hered-0.004
|-0.004
TokenFinally
Token ID11158
Feature activation+2.22e-02

Pos logit contributions
 exactly+0.005
ook+0.004
ooks+0.004
bles+0.004
castle+0.004
Neg logit contributions
Mine-0.007
Stop-0.006
 care-0.006
|-0.006
hered-0.006
Token them
Token ID606
Feature activation-5.80e-02

Pos logit contributions
Mine+0.023
 care+0.021
Stop+0.020
hered+0.019
 breaths+0.019
Neg logit contributions
 exactly-0.027
ook-0.025
ooks-0.025
bles-0.024
ossom-0.024
Token and
Token ID290
Feature activation+9.85e-03

Pos logit contributions
Mine+0.092
 care+0.092
hered+0.079
Stop+0.079
 breaths+0.079
Neg logit contributions
 exactly-0.104
ook-0.100
ooks-0.098
bles-0.091
iring-0.091
Token never
Token ID1239
Feature activation+4.39e-02

Pos logit contributions
 exactly+0.016
ook+0.014
ooks+0.014
 impressed+0.013
ents+0.013
Neg logit contributions
Mine-0.019
 care-0.017
Stop-0.017
hered-0.016
|-0.016
Token threw
Token ID9617
Feature activation-7.43e-03

Pos logit contributions
 exactly+0.084
ook+0.075
ents+0.073
 impressed+0.073
ooks+0.072
Neg logit contributions
Mine-0.070
Stop-0.060
 care-0.060
hered-0.057
 breaths-0.057
Token a
Token ID257
Feature activation-1.60e-03

Pos logit contributions
Mine+0.010
 care+0.009
hered+0.008
Stop+0.008
|+0.008
Neg logit contributions
 exactly-0.016
ook-0.014
ooks-0.014
anut-0.014
ents-0.014
Token tant
Token ID24246
Feature activation-1.65e-01

Pos logit contributions
Mine+0.003
 care+0.002
Stop+0.002
hered+0.002
 breaths+0.002
Neg logit contributions
 exactly-0.003
ook-0.003
ooks-0.003
ossom-0.003
bles-0.003
Tokenrum
Token ID6582
Feature activation-4.91e-02

Pos logit contributions
Mine+0.304
Stop+0.268
 care+0.266
|+0.258
 bath+0.257
Neg logit contributions
 exactly-0.285
ook-0.281
ooks-0.262
aches-0.253
bles-0.250
Token again
Token ID757
Feature activation+1.54e-02

Pos logit contributions
Mine+0.092
 care+0.087
Stop+0.080
|+0.078
 breaths+0.078
Neg logit contributions
 exactly-0.083
ook-0.076
ooks-0.071
bles-0.070
iring-0.069
Token.
Token ID13
Feature activation-2.90e-03

Pos logit contributions
 exactly+0.029
ook+0.027
ooks+0.026
bles+0.025
ents+0.025
Neg logit contributions
Mine-0.025
 care-0.024
Stop-0.022
hered-0.021
 breaths-0.021
Token<|endoftext|>
Token ID50256
Feature activation-7.72e-02

Pos logit contributions
Mine+0.004
 care+0.004
|+0.003
Stop+0.003
hered+0.003
Neg logit contributions
 exactly-0.006
ook-0.006
bles-0.006
ooks-0.005
aches-0.005
TokenOnce
Token ID7454
Feature activation+5.94e-02

Pos logit contributions
Mine+0.146
 care+0.137
 breaths+0.131
Stop+0.127
 imag+0.125
Neg logit contributions
 exactly-0.121
ook-0.113
ooks-0.109
bles-0.101
iring-0.100
Token thank
Token ID5875
Feature activation+1.58e-02

Pos logit contributions
Mine+0.049
 care+0.044
Stop+0.042
hered+0.041
 breaths+0.040
Neg logit contributions
 exactly-0.081
ook-0.075
ooks-0.072
ents-0.071
 impressed-0.071
Token God
Token ID1793
Feature activation+5.20e-02

Pos logit contributions
 exactly+0.035
ook+0.034
ooks+0.033
ossom+0.032
ents+0.032
Neg logit contributions
 care-0.020
Mine-0.020
Stop-0.017
 him-0.017
hered-0.016
Token for
Token ID329
Feature activation-1.86e-02

Pos logit contributions
 exactly+0.106
ooks+0.104
ook+0.103
bles+0.096
ossom+0.096
Neg logit contributions
 care-0.074
hered-0.061
Mine-0.060
 him-0.059
 breaths-0.057
Token making
Token ID1642
Feature activation-4.35e-03

Pos logit contributions
Mine+0.032
 care+0.030
Stop+0.028
hered+0.027
 breaths+0.026
Neg logit contributions
 exactly-0.034
ook-0.031
ooks-0.030
bles-0.029
 impressed-0.029
Token the
Token ID262
Feature activation+1.41e-02

Pos logit contributions
Mine+0.005
 care+0.004
Stop+0.004
hered+0.004
 breaths+0.004
Neg logit contributions
 exactly-0.010
ook-0.010
ooks-0.010
bles-0.009
ossom-0.009
Token temple
Token ID12505
Feature activation+1.01e-01

Pos logit contributions
 exactly+0.020
ook+0.020
ooks+0.019
ents+0.018
bles+0.017
Neg logit contributions
Mine-0.029
 care-0.028
Stop-0.026
 imag-0.025
 breaths-0.025
Token.
Token ID13
Feature activation-4.38e-03

Pos logit contributions
ooks+0.212
ook+0.203
ich+0.202
 exactly+0.202
anut+0.196
Neg logit contributions
 care-0.136
 need-0.102
 him-0.101
 applause-0.099
 cry-0.098
Token They
Token ID1119
Feature activation-3.79e-02

Pos logit contributions
Mine+0.006
 care+0.006
Stop+0.005
hered+0.005
 breaths+0.005
Neg logit contributions
 exactly-0.009
ook-0.009
ooks-0.009
ents-0.008
iring-0.008
Token say
Token ID910
Feature activation+2.34e-02

Pos logit contributions
Mine+0.048
 care+0.043
Stop+0.041
hered+0.039
 breaths+0.038
Neg logit contributions
 exactly-0.085
ook-0.078
ooks-0.076
ents-0.074
bles-0.074
Token goodbye
Token ID24829
Feature activation-1.31e-02

Pos logit contributions
 exactly+0.044
ook+0.041
ooks+0.040
anut+0.038
bles+0.038
Neg logit contributions
Mine-0.038
 care-0.036
Stop-0.033
hered-0.032
 breaths-0.032
Token to
Token ID284
Feature activation-1.92e-02

Pos logit contributions
 care+0.019
Mine+0.017
 breaths+0.016
hered+0.016
Stop+0.015
Neg logit contributions
 exactly-0.027
ook-0.024
ooks-0.024
bles-0.024
aches-0.023
Token delicious
Token ID12625
Feature activation+9.46e-02

Pos logit contributions
 exactly+0.033
ook+0.030
ooks+0.029
bles+0.028
anut+0.027
Neg logit contributions
Mine-0.045
 care-0.044
Stop-0.040
 breaths-0.039
hered-0.039
Token everything
Token ID2279
Feature activation+2.03e-02

Pos logit contributions
 exactly+0.177
ook+0.149
aches+0.145
iring+0.142
ooks+0.140
Neg logit contributions
Mine-0.158
|-0.145
 care-0.139
hered-0.137
Stop-0.136
Token inside
Token ID2641
Feature activation-1.59e-02

Pos logit contributions
 exactly+0.037
ook+0.033
 impressed+0.031
ents+0.030
iring+0.030
Neg logit contributions
Mine-0.036
 care-0.032
Stop-0.031
hered-0.030
|-0.029
Token the
Token ID262
Feature activation+1.52e-02

Pos logit contributions
Mine+0.024
 care+0.022
Stop+0.021
hered+0.020
 breaths+0.020
Neg logit contributions
 exactly-0.032
ook-0.029
ooks-0.028
iring-0.027
 impressed-0.027
Token grill
Token ID29901
Feature activation+6.93e-02

Pos logit contributions
 exactly+0.025
ook+0.022
ooks+0.022
ents+0.021
bles+0.020
Neg logit contributions
Mine-0.028
 care-0.026
Stop-0.025
hered-0.025
|-0.024
Token smelled
Token ID37609
Feature activation+1.37e-01

Pos logit contributions
 exactly+0.129
ook+0.116
ooks+0.110
ents+0.109
bles+0.109
Neg logit contributions
Mine-0.115
 care-0.108
Stop-0.101
 breaths-0.097
|-0.097
Token!
Token ID0
Feature activation+2.88e-02

Pos logit contributions
 exactly+0.231
ook+0.197
ooks+0.189
 impressed+0.188
iring+0.183
Neg logit contributions
Mine-0.258
 care-0.233
Stop-0.228
|-0.224
hered-0.221
Token
Token ID198
Feature activation+5.58e-03

Pos logit contributions
 exactly+0.055
ook+0.051
ooks+0.050
ents+0.046
iring+0.046
Neg logit contributions
Mine-0.046
 care-0.044
Stop-0.041
 breaths-0.040
hered-0.040
Token
Token ID198
Feature activation+4.83e-03

Pos logit contributions
 exactly+0.010
ook+0.009
ooks+0.009
 impressed+0.008
bles+0.008
Neg logit contributions
Mine-0.010
 care-0.009
Stop-0.009
hered-0.008
 breaths-0.008
Token"
Token ID1
Feature activation+8.56e-04

Pos logit contributions
 exactly+0.008
ook+0.007
ooks+0.007
bles+0.007
ents+0.007
Neg logit contributions
Mine-0.009
 care-0.009
Stop-0.008
hered-0.008
 breaths-0.008
TokenDo
Token ID5211
Feature activation+1.40e-02

Pos logit contributions
 exactly+0.002
ook+0.002
ooks+0.002
bles+0.001
iring+0.001
Neg logit contributions
Mine-0.001
 care-0.001
 breaths-0.001
Stop-0.001
hered-0.001
Token c
Token ID269
Feature activation-1.73e-02

Pos logit contributions
 exactly+0.008
 impressed+0.007
anut+0.007
bles+0.007
ook+0.006
Neg logit contributions
Mine-0.008
Stop-0.007
Hey-0.006
hered-0.006
|-0.006
Tokenuddled
Token ID32745
Feature activation+3.33e-02

Pos logit contributions
Mine+0.035
 care+0.035
Stop+0.032
 breaths+0.032
hered+0.031
Neg logit contributions
 exactly-0.024
ook-0.021
ooks-0.021
 impressed-0.020
ents-0.020
Token.
Token ID13
Feature activation-7.01e-03

Pos logit contributions
 exactly+0.061
ook+0.058
ooks+0.057
ents+0.055
bles+0.053
Neg logit contributions
Mine-0.054
 care-0.052
Stop-0.047
hered-0.046
|-0.045
Token They
Token ID1119
Feature activation-4.50e-02

Pos logit contributions
Mine+0.008
 care+0.008
Stop+0.007
|+0.006
hered+0.006
Neg logit contributions
 exactly-0.016
ook-0.015
ooks-0.015
bles-0.015
ents-0.014
Token forgot
Token ID16453
Feature activation+7.05e-02

Pos logit contributions
Mine+0.064
 care+0.057
Stop+0.055
 breaths+0.055
hered+0.054
Neg logit contributions
 exactly-0.094
ook-0.084
 impressed-0.082
ooks-0.081
anut-0.081
Token about
Token ID546
Feature activation+1.16e-01

Pos logit contributions
ook+0.092
 exactly+0.090
ooks+0.087
ich+0.086
ossom+0.085
Neg logit contributions
 care-0.144
Mine-0.144
 him-0.134
Stop-0.129
 breaths-0.125
Token the
Token ID262
Feature activation+1.05e-02

Pos logit contributions
 exactly+0.220
ook+0.203
ooks+0.197
bles+0.188
 impressed+0.187
Neg logit contributions
Mine-0.186
 care-0.174
Stop-0.162
hered-0.157
 breaths-0.154
Token clock
Token ID8801
Feature activation+5.42e-02

Pos logit contributions
 exactly+0.019
ich+0.016
ae+0.016
 impressed+0.016
iring+0.016
Neg logit contributions
Mine-0.018
 care-0.016
hered-0.016
Stop-0.015
|-0.014
Token and
Token ID290
Feature activation+3.91e-03

Pos logit contributions
 exactly+0.094
ook+0.086
ooks+0.083
ents+0.079
bles+0.079
Neg logit contributions
Mine-0.097
 care-0.090
Stop-0.085
hered-0.082
 breaths-0.081
Token the
Token ID262
Feature activation+1.12e-02

Pos logit contributions
 exactly+0.008
ook+0.007
ooks+0.007
bles+0.007
ents+0.007
Neg logit contributions
Mine-0.006
 care-0.006
Stop-0.005
hered-0.005
 breaths-0.005
Token cold
Token ID4692
Feature activation-4.72e-02

Pos logit contributions
 exactly+0.019
ook+0.017
ooks+0.016
ich+0.016
bles+0.016
Neg logit contributions
Mine-0.019
 care-0.018
hered-0.017
Stop-0.017
|-0.017
Token are
Token ID389
Feature activation-4.31e-02

Pos logit contributions
 exactly+0.124
ook+0.118
ooks+0.115
ossom+0.112
bles+0.111
Neg logit contributions
Mine-0.071
 care-0.069
Stop-0.061
|-0.056
 breaths-0.056
Token you
Token ID345
Feature activation-1.40e-02

Pos logit contributions
Mine+0.038
 care+0.037
Stop+0.033
 him+0.030
|+0.028
Neg logit contributions
 exactly-0.108
ook-0.106
ossom-0.103
ooks-0.103
 impressed-0.100
Token?
Token ID30
Feature activation+1.65e-03

Pos logit contributions
 care+0.018
Mine+0.018
 him+0.016
hered+0.016
Stop+0.016
Neg logit contributions
 exactly-0.028
ooks-0.027
ook-0.027
ents-0.026
ossom-0.026
Token I
Token ID314
Feature activation-3.46e-02

Pos logit contributions
 exactly+0.004
ook+0.003
ooks+0.003
bles+0.003
ents+0.003
Neg logit contributions
Mine-0.002
 care-0.002
Stop-0.002
hered-0.002
 breaths-0.002
Token'm
Token ID1101
Feature activation-4.34e-02

Pos logit contributions
Mine+0.063
 breaths+0.054
Stop+0.053
hered+0.051
 care+0.050
Neg logit contributions
 exactly-0.061
ook-0.057
aze-0.052
ossom-0.051
ast-0.050
Token coming
Token ID2406
Feature activation+1.14e-01

Pos logit contributions
 care+0.066
Mine+0.063
 breaths+0.060
 him+0.059
 need+0.058
Neg logit contributions
 exactly-0.076
ook-0.075
ooks-0.073
ossom-0.073
ups-0.070
Token to
Token ID284
Feature activation-1.65e-02

Pos logit contributions
 exactly+0.217
ook+0.203
ooks+0.195
bles+0.188
ents+0.186
Neg logit contributions
Mine-0.187
 care-0.169
Stop-0.160
hered-0.155
 breaths-0.152
Token find
Token ID1064
Feature activation+3.84e-02

Pos logit contributions
Mine+0.032
hered+0.030
 breaths+0.030
 imag+0.029
hed+0.029
Neg logit contributions
 exactly-0.023
ook-0.021
aches-0.020
anut-0.020
aze-0.020
Token you
Token ID345
Feature activation-1.47e-02

Pos logit contributions
 exactly+0.066
ook+0.054
ents+0.053
ooks+0.051
iring+0.049
Neg logit contributions
Mine-0.074
|-0.065
hered-0.065
storm-0.064
Stop-0.063
Token!"
Token ID2474
Feature activation-5.51e-02

Pos logit contributions
Mine+0.018
 care+0.018
 breaths+0.015
|+0.015
Stop+0.015
Neg logit contributions
 exactly-0.031
ook-0.030
ooks-0.030
 impressed-0.028
iring-0.028
Token She
Token ID1375
Feature activation+2.27e-02

Pos logit contributions
Mine+0.100
 care+0.091
hered+0.086
Stop+0.086
|+0.085
Neg logit contributions
 exactly-0.094
ook-0.085
ooks-0.083
ents-0.080
 impressed-0.078
Token he
Token ID339
Feature activation-9.91e-03

Pos logit contributions
 exactly+0.011
ook+0.010
ooks+0.010
bles+0.010
ossom+0.010
Neg logit contributions
Mine-0.009
 care-0.009
Stop-0.008
hered-0.008
 breaths-0.007
Token was
Token ID373
Feature activation-3.19e-02

Pos logit contributions
Mine+0.016
 care+0.015
Stop+0.014
hered+0.014
 breaths+0.013
Neg logit contributions
 exactly-0.019
ook-0.017
ooks-0.017
bles-0.016
 impressed-0.016
Token glad
Token ID9675
Feature activation+3.79e-02

Pos logit contributions
 care+0.054
Mine+0.051
hered+0.046
|+0.045
 him+0.045
Neg logit contributions
ooks-0.052
ook-0.052
 exactly-0.052
iring-0.050
ossom-0.049
Token he
Token ID339
Feature activation-1.08e-02

Pos logit contributions
 exactly+0.068
ook+0.063
ooks+0.061
bles+0.058
ents+0.058
Neg logit contributions
Mine-0.065
 care-0.061
Stop-0.057
hered-0.055
 breaths-0.054
Token had
Token ID550
Feature activation-1.68e-03

Pos logit contributions
Mine+0.018
 care+0.016
Stop+0.015
 breaths+0.015
hered+0.015
Neg logit contributions
 exactly-0.020
ook-0.018
ooks-0.017
bles-0.017
 impressed-0.017
Token come
Token ID1282
Feature activation+1.13e-01

Pos logit contributions
Mine+0.003
 care+0.002
Stop+0.002
hered+0.002
 breaths+0.002
Neg logit contributions
 exactly-0.003
ook-0.003
ooks-0.003
bles-0.003
ents-0.003
Token out
Token ID503
Feature activation+3.53e-02

Pos logit contributions
 exactly+0.237
ooks+0.231
ae+0.230
ook+0.223
anut+0.223
Neg logit contributions
 care-0.146
 him-0.135
hered-0.127
Mine-0.117
 bath-0.115
Token to
Token ID284
Feature activation-1.87e-02

Pos logit contributions
 exactly+0.062
ook+0.059
ooks+0.057
bles+0.055
ossom+0.054
Neg logit contributions
Mine-0.059
 care-0.057
Stop-0.052
hered-0.051
 breaths-0.051
Token enjoy
Token ID2883
Feature activation+3.06e-02

Pos logit contributions
Mine+0.026
Stop+0.021
 care+0.021
 breaths+0.020
hered+0.020
Neg logit contributions
 exactly-0.040
ook-0.037
ooks-0.036
 impressed-0.035
aze-0.035
Token it
Token ID340
Feature activation-9.60e-03

Pos logit contributions
 exactly+0.060
ook+0.056
ooks+0.055
bles+0.052
ossom+0.052
Neg logit contributions
Mine-0.047
 care-0.044
Stop-0.040
hered-0.039
 breaths-0.038
Token.
Token ID13
Feature activation-6.87e-03

Pos logit contributions
Mine+0.014
 care+0.014
hered+0.013
Stop+0.012
 breaths+0.012
Neg logit contributions
 exactly-0.018
ook-0.018
ooks-0.017
bles-0.016
ossom-0.016
Token
Token ID198
Feature activation-7.18e-04

Pos logit contributions
Mine+0.001
 care+0.001
Stop+0.001
 breaths+0.001
hered+0.001
Neg logit contributions
 exactly-0.001
ook-0.001
ooks-0.001
bles-0.001
 impressed-0.001
TokenThe
Token ID464
Feature activation+2.22e-02

Pos logit contributions
Mine+0.001
 care+0.001
Stop+0.001
hered+0.001
|+0.001
Neg logit contributions
 exactly-0.001
ook-0.001
ooks-0.001
bles-0.001
ossom-0.001
Token fish
Token ID5916
Feature activation-3.48e-02

Pos logit contributions
 exactly+0.037
bles+0.033
ook+0.033
anut+0.033
ooks+0.032
Neg logit contributions
Mine-0.041
 care-0.037
Stop-0.036
|-0.035
hered-0.035
Token said
Token ID531
Feature activation+2.42e-03

Pos logit contributions
Mine+0.049
 care+0.047
hered+0.042
Stop+0.041
 breaths+0.040
Neg logit contributions
 exactly-0.071
ook-0.067
ooks-0.066
bles-0.062
ossom-0.062
Token,
Token ID11
Feature activation-5.45e-03

Pos logit contributions
 exactly+0.005
ook+0.005
ooks+0.004
ents+0.004
anut+0.004
Neg logit contributions
Mine-0.004
 care-0.003
Stop-0.003
hered-0.003
 breaths-0.003
Token "
Token ID366
Feature activation+3.79e-02

Pos logit contributions
Mine+0.010
 care+0.010
Stop+0.009
 him+0.009
hered+0.009
Neg logit contributions
 exactly-0.008
ook-0.008
ooks-0.008
anut-0.008
ossom-0.008
TokenLet
Token ID5756
Feature activation-3.88e-02

Pos logit contributions
 exactly+0.074
ook+0.070
ooks+0.067
iring+0.066
ich+0.065
Neg logit contributions
Mine-0.052
 breaths-0.051
 care-0.051
 imag-0.047
hered-0.043
Token's
Token ID338
Feature activation+6.21e-02

Pos logit contributions
Mine+0.093
 breaths+0.085
 care+0.085
|+0.082
 imag+0.081
Neg logit contributions
 exactly-0.044
ook-0.035
ups-0.032
ooks-0.030
anut-0.030
Token find
Token ID1064
Feature activation+3.45e-02

Pos logit contributions
 exactly+0.129
ook+0.119
ooks+0.115
ossom+0.111
anut+0.110
Neg logit contributions
Mine-0.090
 care-0.079
Stop-0.078
hered-0.075
 breaths-0.074
Token some
Token ID617
Feature activation-5.41e-02

Pos logit contributions
 exactly+0.062
ook+0.055
ooks+0.053
ents+0.051
anut+0.050
Neg logit contributions
Mine-0.063
 care-0.055
Stop-0.054
hered-0.053
|-0.052
Token food
Token ID2057
Feature activation+2.74e-02

Pos logit contributions
Mine+0.106
 care+0.099
Stop+0.094
hered+0.092
|+0.090
Neg logit contributions
 exactly-0.086
ook-0.078
ooks-0.075
bles-0.070
ents-0.070
Token believe
Token ID1975
Feature activation+1.97e-02

Pos logit contributions
Mine+0.031
 breaths+0.028
Stop+0.028
hered+0.027
|+0.026
Neg logit contributions
 exactly-0.037
aches-0.033
ook-0.033
anut-0.032
aze-0.032
Token.
Token ID13
Feature activation-3.85e-03

Pos logit contributions
 exactly+0.037
ook+0.034
ooks+0.032
bles+0.032
 impressed+0.032
Neg logit contributions
Mine-0.033
 care-0.030
hered-0.028
Stop-0.028
|-0.028
Token The
Token ID383
Feature activation-1.14e-02

Pos logit contributions
Mine+0.005
 care+0.005
Stop+0.005
 breaths+0.004
hered+0.004
Neg logit contributions
 exactly-0.008
ook-0.008
ooks-0.007
ents-0.007
ossom-0.007
Token lions
Token ID31420
Feature activation+6.16e-02

Pos logit contributions
Mine+0.021
 care+0.020
Stop+0.019
hered+0.018
 breaths+0.018
Neg logit contributions
 exactly-0.019
ook-0.018
ooks-0.017
bles-0.016
ents-0.016
Token are
Token ID389
Feature activation-4.75e-02

Pos logit contributions
 exactly+0.120
ook+0.105
aches+0.105
bles+0.105
 impressed+0.104
Neg logit contributions
Mine-0.097
 care-0.082
Stop-0.081
 breaths-0.078
hered-0.078
Token awesome
Token ID7427
Feature activation+3.78e-02

Pos logit contributions
 care+0.083
Mine+0.082
Stop+0.073
 breaths+0.071
 him+0.071
Neg logit contributions
ook-0.076
 exactly-0.076
ooks-0.073
ossom-0.071
ae-0.067
Token.
Token ID13
Feature activation-7.44e-03

Pos logit contributions
 exactly+0.067
ook+0.060
ooks+0.058
bles+0.057
ents+0.056
Neg logit contributions
Mine-0.067
 care-0.061
Stop-0.058
|-0.057
hered-0.057
Token Thank
Token ID6952
Feature activation-5.13e-02

Pos logit contributions
Mine+0.011
 care+0.010
 breaths+0.010
Stop+0.009
hered+0.009
Neg logit contributions
 exactly-0.014
ook-0.014
ooks-0.014
ents-0.013
aze-0.013
Token you
Token ID345
Feature activation-1.78e-02

Pos logit contributions
Mine+0.065
 care+0.057
Stop+0.053
hered+0.051
 breaths+0.050
Neg logit contributions
 exactly-0.115
ook-0.108
ooks-0.105
bles-0.102
anut-0.101
Token for
Token ID329
Feature activation-2.37e-02

Pos logit contributions
 care+0.025
Mine+0.023
hered+0.020
 breaths+0.020
 him+0.020
Neg logit contributions
 exactly-0.035
ook-0.034
ooks-0.033
aches-0.032
ossom-0.032
Token taking
Token ID2263
Feature activation+2.86e-02

Pos logit contributions
Mine+0.045
 care+0.041
Stop+0.039
hered+0.039
|+0.038
Neg logit contributions
 exactly-0.039
ook-0.035
ooks-0.034
 impressed-0.033
anut-0.032
Token!"
Token ID2474
Feature activation-5.00e-02

Pos logit contributions
Mine+0.057
 care+0.052
Stop+0.049
hered+0.048
|+0.047
Neg logit contributions
 exactly-0.071
ook-0.066
ooks-0.063
bles-0.061
 impressed-0.061
Token So
Token ID1406
Feature activation-4.78e-03

Pos logit contributions
Mine+0.080
 care+0.075
Stop+0.070
hered+0.067
 breaths+0.066
Neg logit contributions
 exactly-0.096
ook-0.089
ooks-0.086
bles-0.082
ents-0.082
Token Bob
Token ID5811
Feature activation+2.22e-02

Pos logit contributions
 care+0.006
Mine+0.006
Stop+0.005
 breaths+0.005
hered+0.005
Neg logit contributions
 exactly-0.010
ook-0.010
ooks-0.010
bles-0.009
ents-0.009
Token and
Token ID290
Feature activation+8.95e-03

Pos logit contributions
 exactly+0.046
ooks+0.044
ook+0.042
iring+0.041
anut+0.040
Neg logit contributions
Mine-0.031
 care-0.031
hered-0.028
Stop-0.026
 breaths-0.026
Token the
Token ID262
Feature activation+1.83e-02

Pos logit contributions
 exactly+0.019
ook+0.018
ooks+0.018
ossom+0.017
bles+0.017
Neg logit contributions
Mine-0.013
 care-0.012
Stop-0.011
 breaths-0.010
hered-0.010
Token balloon
Token ID21190
Feature activation+1.31e-02

Pos logit contributions
 exactly+0.031
 impressed+0.027
ook+0.027
ich+0.027
bles+0.026
Neg logit contributions
Mine-0.032
 care-0.031
|-0.030
Stop-0.029
 breaths-0.029
Token flew
Token ID13112
Feature activation+1.20e-02

Pos logit contributions
 exactly+0.026
ook+0.025
ooks+0.024
iring+0.023
anut+0.023
Neg logit contributions
Mine-0.019
 care-0.018
Stop-0.017
hered-0.017
 breaths-0.016
Token up
Token ID510
Feature activation+5.34e-02

Pos logit contributions
 exactly+0.022
ook+0.021
ooks+0.020
bles+0.019
ossom+0.019
Neg logit contributions
Mine-0.020
 care-0.019
Stop-0.017
hered-0.017
 breaths-0.017
Token in
Token ID287
Feature activation-1.23e-02

Pos logit contributions
 exactly+0.098
ook+0.091
ooks+0.090
anut+0.086
bles+0.085
Neg logit contributions
Mine-0.087
 care-0.084
 breaths-0.077
hered-0.076
Stop-0.076
Token the
Token ID262
Feature activation+1.87e-02

Pos logit contributions
Mine+0.022
 care+0.021
Stop+0.021
|+0.021
hered+0.021
Neg logit contributions
 exactly-0.019
ook-0.017
 impressed-0.016
ooks-0.016
ents-0.015
Token sky
Token ID6766
Feature activation-2.59e-02

Pos logit contributions
 exactly+0.036
ook+0.033
ents+0.032
 impressed+0.032
ooks+0.032
Neg logit contributions
Mine-0.027
 care-0.025
hered-0.023
Stop-0.022
 breaths-0.022
Token lush
Token ID37408
Feature activation+4.39e-02

Pos logit contributions
Mine+0.002
 care+0.002
Stop+0.002
hered+0.002
 shock+0.002
Neg logit contributions
 exactly-0.002
ook-0.002
ooks-0.002
bles-0.002
ents-0.002
Token green
Token ID4077
Feature activation-2.93e-02

Pos logit contributions
 exactly+0.079
ook+0.074
ooks+0.071
bles+0.069
ents+0.068
Neg logit contributions
Mine-0.074
 care-0.070
Stop-0.064
 breaths-0.062
hered-0.062
Token patch
Token ID8529
Feature activation+7.82e-02

Pos logit contributions
Mine+0.052
 care+0.049
Stop+0.046
hered+0.045
|+0.044
Neg logit contributions
 exactly-0.051
ook-0.047
ooks-0.045
bles-0.043
ossom-0.043
Token of
Token ID286
Feature activation-6.16e-03

Pos logit contributions
 exactly+0.142
ook+0.132
ooks+0.130
ossom+0.123
anut+0.123
Neg logit contributions
Mine-0.126
 care-0.120
hered-0.112
Stop-0.111
|-0.105
Token grass
Token ID8701
Feature activation-3.03e-02

Pos logit contributions
Mine+0.012
 care+0.011
hered+0.010
Stop+0.010
|+0.010
Neg logit contributions
 exactly-0.010
ook-0.009
ooks-0.009
iring-0.008
 impressed-0.008
Token and
Token ID290
Feature activation+1.05e-02

Pos logit contributions
Mine+0.050
 care+0.046
Stop+0.043
hered+0.043
 breaths+0.042
Neg logit contributions
 exactly-0.054
ook-0.051
ooks-0.050
ossom-0.047
 impressed-0.047
Token plants
Token ID6134
Feature activation+5.38e-02

Pos logit contributions
 exactly+0.019
ook+0.017
ooks+0.017
bles+0.016
ossom+0.016
Neg logit contributions
Mine-0.018
 care-0.017
Stop-0.016
 breaths-0.015
hered-0.015
Token in
Token ID287
Feature activation-1.29e-02

Pos logit contributions
 exactly+0.093
ook+0.084
ooks+0.080
ents+0.078
 impressed+0.078
Neg logit contributions
Mine-0.098
 care-0.091
Stop-0.087
|-0.083
hered-0.082
Token the
Token ID262
Feature activation+1.85e-02

Pos logit contributions
Mine+0.022
 care+0.021
Stop+0.020
hered+0.020
|+0.019
Neg logit contributions
 exactly-0.023
ook-0.021
ooks-0.020
 impressed-0.019
ents-0.019
Token middle
Token ID3504
Feature activation-1.67e-02

Pos logit contributions
 exactly+0.036
ook+0.034
ooks+0.033
bles+0.032
ents+0.032
Neg logit contributions
 care-0.028
Mine-0.028
Stop-0.025
 breaths-0.023
|-0.023
Token of
Token ID286
Feature activation-6.17e-03

Pos logit contributions
Mine+0.024
 care+0.024
Stop+0.020
 him+0.020
hered+0.020
Neg logit contributions
 exactly-0.033
ook-0.030
ooks-0.030
bles-0.030
anut-0.030
TokenThey
Token ID2990
Feature activation-3.17e-02

Pos logit contributions
 exactly+0.014
ook+0.012
ooks+0.012
ossom+0.012
bles+0.011
Neg logit contributions
Mine-0.013
 care-0.012
Stop-0.011
hered-0.010
Hey-0.010
Token look
Token ID804
Feature activation+8.90e-03

Pos logit contributions
Mine+0.042
 care+0.038
Stop+0.036
hered+0.035
 breaths+0.034
Neg logit contributions
 exactly-0.068
ook-0.062
ooks-0.060
ents-0.060
anut-0.059
Token at
Token ID379
Feature activation+5.89e-02

Pos logit contributions
 exactly+0.013
ook+0.013
ooks+0.013
ich+0.012
ae+0.012
Neg logit contributions
Mine-0.017
 care-0.017
 breaths-0.015
Stop-0.015
 imag-0.015
Token each
Token ID1123
Feature activation-8.22e-02

Pos logit contributions
 exactly+0.095
ook+0.075
ber+0.073
iring+0.073
ae+0.073
Neg logit contributions
hered-0.108
Mine-0.101
storm-0.098
|-0.098
Stop-0.096
Token other
Token ID584
Feature activation+1.10e-02

Pos logit contributions
Mine+0.136
 care+0.129
Stop+0.116
hered+0.110
|+0.106
Neg logit contributions
 exactly-0.157
ook-0.145
ooks-0.144
ossom-0.141
bles-0.136
Token.
Token ID13
Feature activation-4.54e-03

Pos logit contributions
 exactly+0.022
ook+0.020
ooks+0.020
ossom+0.019
ich+0.019
Neg logit contributions
Mine-0.017
 care-0.016
 breaths-0.014
hered-0.014
Stop-0.014
Token They
Token ID1119
Feature activation-3.99e-02

Pos logit contributions
Mine+0.006
 care+0.005
Stop+0.005
hered+0.005
 breaths+0.005
Neg logit contributions
 exactly-0.010
ook-0.009
ooks-0.009
ossom-0.009
 impressed-0.009
Token look
Token ID804
Feature activation+8.13e-03

Pos logit contributions
Mine+0.056
 care+0.050
hered+0.048
Stop+0.048
|+0.047
Neg logit contributions
 exactly-0.083
ook-0.074
ooks-0.072
 impressed-0.072
ents-0.072
Token at
Token ID379
Feature activation+5.81e-02

Pos logit contributions
 exactly+0.011
ook+0.011
ooks+0.011
ich+0.010
ae+0.010
Neg logit contributions
Mine-0.016
 care-0.016
Stop-0.015
 breaths-0.015
hered-0.014
Token the
Token ID262
Feature activation+1.14e-02

Pos logit contributions
 exactly+0.097
iring+0.074
ook+0.072
ber+0.070
 impressed+0.070
Neg logit contributions
hered-0.111
|-0.103
storm-0.099
Mine-0.098
use-0.096
Token empty
Token ID6565
Feature activation-1.84e-02

Pos logit contributions
 exactly+0.019
ook+0.018
ooks+0.017
bles+0.016
ents+0.016
Neg logit contributions
Mine-0.021
 care-0.020
Stop-0.018
hered-0.018
 breaths-0.018
Tokenily
Token ID813
Feature activation-1.07e-02

Pos logit contributions
Mine+0.052
 care+0.049
Stop+0.046
hered+0.044
 breaths+0.044
Neg logit contributions
 exactly-0.049
ook-0.045
ooks-0.044
bles-0.042
ents-0.041
Token felt
Token ID2936
Feature activation-1.06e-02

Pos logit contributions
Mine+0.019
 care+0.018
Stop+0.017
hered+0.016
|+0.016
Neg logit contributions
 exactly-0.019
ook-0.017
ooks-0.016
ents-0.016
bles-0.016
Token sad
Token ID6507
Feature activation-1.23e-02

Pos logit contributions
Mine+0.025
|+0.023
storm+0.022
Stop+0.022
hered+0.022
Neg logit contributions
 exactly-0.014
 impressed-0.011
ook-0.010
aze-0.010
 taller-0.009
Token and
Token ID290
Feature activation+6.11e-03

Pos logit contributions
Mine+0.029
|+0.026
Stop+0.025
 care+0.024
hered+0.024
Neg logit contributions
 exactly-0.018
ook-0.014
iring-0.014
ents-0.013
 impressed-0.013
Token worried
Token ID7960
Feature activation+1.86e-03

Pos logit contributions
 exactly+0.012
ook+0.010
 impressed+0.010
ents+0.010
ooks+0.010
Neg logit contributions
Mine-0.010
 care-0.009
hered-0.008
Stop-0.008
|-0.008
Token that
Token ID326
Feature activation-2.85e-02

Pos logit contributions
 exactly+0.003
ook+0.002
 impressed+0.002
ents+0.002
ooks+0.002
Neg logit contributions
Mine-0.004
|-0.003
 care-0.003
Stop-0.003
 imag-0.003
Token her
Token ID607
Feature activation+1.39e-02

Pos logit contributions
Mine+0.044
 care+0.043
Stop+0.039
hered+0.036
 him+0.035
Neg logit contributions
 exactly-0.055
ook-0.053
ooks-0.052
bles-0.051
ossom-0.050
Token mom
Token ID1995
Feature activation+3.68e-02

Pos logit contributions
 exactly+0.022
ook+0.021
ooks+0.020
ents+0.019
bles+0.019
Neg logit contributions
Mine-0.026
 care-0.026
 breaths-0.023
 bath-0.023
 shock-0.022
Token would
Token ID561
Feature activation+2.83e-02

Pos logit contributions
 exactly+0.073
ook+0.068
ooks+0.067
bles+0.064
ossom+0.063
Neg logit contributions
Mine-0.056
 care-0.052
Stop-0.048
hered-0.046
 breaths-0.045
Token be
Token ID307
Feature activation-3.69e-02

Pos logit contributions
 exactly+0.044
ook+0.037
aze+0.036
ooks+0.035
ast+0.034
Neg logit contributions
Mine-0.057
 breaths-0.053
Stop-0.051
 imag-0.050
ened-0.050
Token upset
Token ID9247
Feature activation-2.39e-02

Pos logit contributions
 care+0.059
Mine+0.056
Stop+0.052
 breaths+0.051
hered+0.051
Neg logit contributions
ook-0.064
ossom-0.063
ooks-0.062
ae-0.062
 exactly-0.062
Token upon
Token ID2402
Feature activation-3.97e-03

Pos logit contributions
 exactly+0.069
ook+0.062
ossom+0.061
ooks+0.061
anut+0.060
Neg logit contributions
Mine-0.106
 care-0.106
 him-0.098
|-0.098
Stop-0.096
Token a
Token ID257
Feature activation+3.76e-03

Pos logit contributions
Mine+0.006
 care+0.006
Stop+0.005
hered+0.005
|+0.005
Neg logit contributions
 exactly-0.008
ook-0.007
ooks-0.007
bles-0.007
ents-0.007
Token time
Token ID640
Feature activation+1.43e-02

Pos logit contributions
 exactly+0.008
ook+0.007
ooks+0.007
ae+0.007
ossom+0.007
Neg logit contributions
Mine-0.005
 care-0.005
Stop-0.004
 breaths-0.004
hered-0.004
Token,
Token ID11
Feature activation+1.14e-02

Pos logit contributions
 exactly+0.027
ook+0.025
ooks+0.024
bles+0.023
ents+0.023
Neg logit contributions
Mine-0.023
 care-0.022
Stop-0.020
hered-0.019
 breaths-0.019
Token there
Token ID612
Feature activation-5.02e-03

Pos logit contributions
 exactly+0.016
ook+0.014
ooks+0.014
bles+0.013
 impressed+0.013
Neg logit contributions
Mine-0.024
 care-0.023
Stop-0.022
hered-0.021
 breaths-0.021
Token was
Token ID373
Feature activation-2.20e-02

Pos logit contributions
 breaths+0.009
Mine+0.009
 care+0.009
Stop+0.008
hered+0.008
Neg logit contributions
 exactly-0.008
ook-0.007
ooks-0.007
iring-0.006
 impressed-0.006
Token a
Token ID257
Feature activation+4.93e-03

Pos logit contributions
Mine+0.038
 care+0.036
Stop+0.034
 breaths+0.033
hered+0.033
Neg logit contributions
 exactly-0.039
ook-0.036
ooks-0.035
 impressed-0.032
bles-0.032
Token helpful
Token ID7613
Feature activation-1.16e-02

Pos logit contributions
 exactly+0.011
ups+0.011
ooks+0.010
ents+0.010
leys+0.010
Neg logit contributions
 bath-0.006
Mine-0.006
 care-0.006
 bang-0.005
 shock-0.005
Token lawyer
Token ID6853
Feature activation+9.92e-02

Pos logit contributions
Mine+0.015
 care+0.015
 bath+0.013
hered+0.013
Stop+0.013
Neg logit contributions
 exactly-0.025
ook-0.023
ooks-0.023
ups-0.022
ents-0.022
Token named
Token ID3706
Feature activation+5.80e-02

Pos logit contributions
 exactly+0.190
ooks+0.175
ook+0.171
ossom+0.170
iring+0.170
Neg logit contributions
 care-0.147
Mine-0.144
hered-0.131
|-0.128
Stop-0.126
Token Tim
Token ID5045
Feature activation-1.55e-02

Pos logit contributions
 exactly+0.137
ook+0.132
ooks+0.132
ossom+0.131
bles+0.131
Neg logit contributions
Mine-0.063
 care-0.062
Stop-0.053
 him-0.052
 Bobby-0.049
Token,
Token ID11
Feature activation+8.57e-03

Pos logit contributions
 exactly+0.023
ook+0.020
ooks+0.020
bles+0.019
 impressed+0.018
Neg logit contributions
Mine-0.025
 care-0.023
Stop-0.022
 breaths-0.022
hered-0.021
Token there
Token ID612
Feature activation-4.53e-03

Pos logit contributions
 exactly+0.010
ook+0.009
ooks+0.008
bles+0.008
 impressed+0.007
Neg logit contributions
Mine-0.020
 care-0.020
Stop-0.018
 breaths-0.018
hered-0.018
Token was
Token ID373
Feature activation-2.36e-02

Pos logit contributions
 breaths+0.009
Mine+0.008
 care+0.008
hered+0.008
Stop+0.008
Neg logit contributions
 exactly-0.007
ook-0.006
ooks-0.006
 impressed-0.005
iring-0.005
Token a
Token ID257
Feature activation+3.77e-03

Pos logit contributions
Mine+0.041
 care+0.039
Stop+0.037
 breaths+0.036
hered+0.036
Neg logit contributions
 exactly-0.041
ook-0.037
ooks-0.036
 impressed-0.034
bles-0.034
Token little
Token ID1310
Feature activation-2.29e-02

Pos logit contributions
 exactly+0.009
ups+0.008
ooks+0.008
ook+0.008
ents+0.008
Neg logit contributions
 bath-0.004
Mine-0.004
 care-0.004
 bang-0.004
 shock-0.004
Token boy
Token ID2933
Feature activation-5.72e-02

Pos logit contributions
Mine+0.038
 care+0.035
Stop+0.033
hered+0.032
|+0.032
Neg logit contributions
 exactly-0.041
ook-0.039
ooks-0.037
bles-0.036
ich-0.036
Token named
Token ID3706
Feature activation+5.38e-02

Pos logit contributions
Mine+0.102
 care+0.098
Stop+0.090
hered+0.088
|+0.087
Neg logit contributions
 exactly-0.098
ook-0.089
ooks-0.086
bles-0.083
ossom-0.082
Token Tim
Token ID5045
Feature activation-2.27e-02

Pos logit contributions
 exactly+0.132
ook+0.125
ooks+0.122
bles+0.118
ossom+0.117
Neg logit contributions
Mine-0.058
 care-0.052
Stop-0.046
hered-0.044
 breaths-0.043
Tokenmy
Token ID1820
Feature activation+1.40e-02

Pos logit contributions
Mine+0.041
 care+0.039
Stop+0.036
hered+0.035
 breaths+0.035
Neg logit contributions
 exactly-0.039
ook-0.036
ooks-0.035
bles-0.033
ents-0.033
Token.
Token ID13
Feature activation+4.90e-03

Pos logit contributions
 exactly+0.025
ook+0.023
ooks+0.022
bles+0.021
ents+0.021
Neg logit contributions
Mine-0.024
 care-0.023
Stop-0.022
hered-0.021
 breaths-0.021
Token Tim
Token ID5045
Feature activation-2.47e-02

Pos logit contributions
 exactly+0.011
ook+0.011
ooks+0.010
ents+0.010
bles+0.010
Neg logit contributions
Mine-0.006
 care-0.006
Stop-0.005
hered-0.005
 breaths-0.005
Token the
Token ID262
Feature activation+1.19e-02

Pos logit contributions
Mine+0.030
Stop+0.028
hered+0.027
|+0.027
Hey+0.027
Neg logit contributions
 exactly-0.029
ook-0.025
 impressed-0.024
ooks-0.023
aze-0.023
Token house
Token ID2156
Feature activation+2.24e-02

Pos logit contributions
 exactly+0.017
ook+0.015
ich+0.014
ooks+0.014
bles+0.014
Neg logit contributions
Mine-0.024
 care-0.023
hered-0.022
 breaths-0.022
Stop-0.022
Token.
Token ID13
Feature activation-5.69e-03

Pos logit contributions
 exactly+0.043
ook+0.040
ooks+0.039
bles+0.037
ossom+0.037
Neg logit contributions
Mine-0.035
 care-0.033
Stop-0.030
hered-0.029
 breaths-0.029
Token They
Token ID1119
Feature activation-3.90e-02

Pos logit contributions
Mine+0.009
 care+0.009
 breaths+0.009
hered+0.008
Stop+0.008
Neg logit contributions
 exactly-0.010
ook-0.010
ooks-0.010
aze-0.009
ossom-0.009
Token were
Token ID547
Feature activation-3.83e-02

Pos logit contributions
Mine+0.062
 care+0.058
Stop+0.053
hered+0.051
 breaths+0.051
Neg logit contributions
 exactly-0.076
ook-0.070
ooks-0.068
bles-0.065
ents-0.065
Token fast
Token ID3049
Feature activation-7.69e-02

Pos logit contributions
 care+0.067
Mine+0.063
 breaths+0.060
 him+0.059
 pain+0.058
Neg logit contributions
ook-0.059
ossom-0.058
 exactly-0.057
ooks-0.056
ae-0.055
Token.
Token ID13
Feature activation-4.98e-03

Pos logit contributions
Mine+0.151
 care+0.139
Stop+0.133
|+0.129
hered+0.127
Neg logit contributions
 exactly-0.121
ook-0.110
ooks-0.107
ents-0.104
bles-0.102
Token
Token ID198
Feature activation+7.26e-04

Pos logit contributions
Mine+0.008
 care+0.008
 breaths+0.008
hered+0.007
Stop+0.007
Neg logit contributions
 exactly-0.009
ook-0.008
ooks-0.008
ossom-0.007
aze-0.007
Token
Token ID198
Feature activation+4.94e-04

Pos logit contributions
 exactly+0.001
ook+0.001
ooks+0.001
bles+0.001
ents+0.001
Neg logit contributions
Mine-0.001
 care-0.001
Stop-0.001
hered-0.001
 breaths-0.001
TokenThey
Token ID2990
Feature activation-3.54e-02

Pos logit contributions
 exactly+0.001
ook+0.001
ossom+0.001
ooks+0.001
iring+0.001
Neg logit contributions
Mine-0.001
Stop-0.001
 care-0.001
hered-0.001
Hey-0.001
Token made
Token ID925
Feature activation-2.82e-02

Pos logit contributions
Mine+0.054
 care+0.050
Stop+0.046
hered+0.044
 breaths+0.044
Neg logit contributions
 exactly-0.071
ook-0.066
ooks-0.064
bles-0.061
ents-0.061
Token it
Token ID340
Feature activation-9.44e-03

Pos logit contributions
Mine+0.001
 care+0.001
Stop+0.001
 breaths+0.001
 him+0.001
Neg logit contributions
 exactly-0.002
ook-0.002
ooks-0.002
anut-0.002
ossom-0.002
Token heard
Token ID2982
Feature activation+6.70e-02

Pos logit contributions
Mine+0.012
 care+0.012
Stop+0.011
 breaths+0.010
hered+0.010
Neg logit contributions
 exactly-0.020
ook-0.019
ooks-0.019
iring-0.018
ae-0.018
Token the
Token ID262
Feature activation+1.75e-02

Pos logit contributions
 exactly+0.136
ook+0.131
ooks+0.128
bles+0.125
anut+0.124
Neg logit contributions
Mine-0.094
 care-0.091
Stop-0.083
 breaths-0.077
 him-0.076
Token sound
Token ID2128
Feature activation-2.27e-02

Pos logit contributions
 exactly+0.033
ook+0.031
ooks+0.030
ents+0.028
ossom+0.028
Neg logit contributions
Mine-0.028
 care-0.027
Stop-0.025
 breaths-0.024
hered-0.024
Token of
Token ID286
Feature activation-7.44e-03

Pos logit contributions
 care+0.031
Mine+0.030
Stop+0.027
hered+0.027
 him+0.027
Neg logit contributions
 exactly-0.046
ooks-0.045
ook-0.044
anut-0.043
bles-0.042
Token music
Token ID2647
Feature activation-3.96e-02

Pos logit contributions
Mine+0.013
 care+0.012
|+0.012
hered+0.012
Stop+0.012
Neg logit contributions
 exactly-0.013
ook-0.011
ooks-0.011
 impressed-0.011
ents-0.010
Token coming
Token ID2406
Feature activation+1.16e-01

Pos logit contributions
Mine+0.063
 care+0.059
Stop+0.055
hered+0.054
 breaths+0.052
Neg logit contributions
 exactly-0.075
ook-0.070
ooks-0.069
anut-0.065
bles-0.065
Token from
Token ID422
Feature activation-2.73e-03

Pos logit contributions
 exactly+0.217
ook+0.202
ooks+0.198
bles+0.188
ae+0.187
Neg logit contributions
Mine-0.186
 care-0.178
Stop-0.164
hered-0.160
 breaths-0.156
Token the
Token ID262
Feature activation+1.80e-02

Pos logit contributions
Mine+0.005
 care+0.005
hered+0.005
Stop+0.005
|+0.005
Neg logit contributions
 exactly-0.004
ook-0.004
ooks-0.003
ich-0.003
 taller-0.003
Token lake
Token ID13546
Feature activation-2.18e-02

Pos logit contributions
 exactly+0.032
ook+0.029
ooks+0.028
ich+0.028
bles+0.028
Neg logit contributions
Mine-0.031
 care-0.029
Stop-0.027
hered-0.027
|-0.026
Token.
Token ID13
Feature activation+2.17e-03

Pos logit contributions
 care+0.024
Mine+0.024
 him+0.022
hered+0.021
 breaths+0.020
Neg logit contributions
 exactly-0.047
ooks-0.046
ook-0.045
ae-0.044
bles-0.044
Token
Token ID198
Feature activation+1.18e-02

Pos logit contributions
 exactly+0.029
anut+0.026
ook+0.026
ooks+0.026
ossom+0.026
Neg logit contributions
Mine-0.018
 care-0.016
Stop-0.015
|-0.014
 him-0.014
Token"
Token ID1
Feature activation+1.47e-04

Pos logit contributions
 exactly+0.019
ooks+0.016
anut+0.016
bles+0.016
ook+0.016
Neg logit contributions
Mine-0.024
 care-0.022
Stop-0.021
hered-0.019
Hey-0.019
TokenIt
Token ID1026
Feature activation+2.07e-02

Pos logit contributions
 exactly+0.000
ook+0.000
ooks+0.000
ents+0.000
bles+0.000
Neg logit contributions
 care-0.000
Mine-0.000
 breaths-0.000
hered-0.000
 imag-0.000
Token's
Token ID338
Feature activation+7.00e-02

Pos logit contributions
 exactly+0.029
ook+0.026
ooks+0.024
bles+0.023
ents+0.023
Neg logit contributions
Mine-0.044
 care-0.041
Stop-0.040
hered-0.039
|-0.038
Token shampoo
Token ID47259
Feature activation-4.51e-02

Pos logit contributions
 exactly+0.125
ook+0.114
ooks+0.111
bles+0.105
ents+0.104
Neg logit contributions
Mine-0.123
 care-0.115
Stop-0.108
hered-0.105
 breaths-0.103
Token,"
Token ID553
Feature activation-1.39e-01

Pos logit contributions
Mine+0.080
Stop+0.071
|+0.070
 care+0.067
hered+0.065
Neg logit contributions
 exactly-0.083
bles-0.075
ook-0.072
aches-0.071
ents-0.070
Token replied
Token ID8712
Feature activation+1.90e-02

Pos logit contributions
Mine+0.238
 care+0.215
hered+0.211
Stop+0.210
|+0.210
Neg logit contributions
 exactly-0.257
ook-0.225
 impressed-0.218
ooks-0.213
ents-0.211
Token mom
Token ID1995
Feature activation+2.86e-02

Pos logit contributions
 exactly+0.035
ook+0.032
ooks+0.031
 impressed+0.029
bles+0.029
Neg logit contributions
Mine-0.032
 care-0.029
Stop-0.028
hered-0.027
|-0.027
Tokenmy
Token ID1820
Feature activation+5.28e-03

Pos logit contributions
 exactly+0.057
ooks+0.055
ook+0.054
ossom+0.054
ich+0.052
Neg logit contributions
 care-0.038
Mine-0.037
 him-0.033
hered-0.032
 breaths-0.031
Token.
Token ID13
Feature activation+7.22e-03

Pos logit contributions
 exactly+0.011
ooks+0.011
ook+0.010
ossom+0.010
iring+0.010
Neg logit contributions
 care-0.007
Mine-0.007
hered-0.006
 breaths-0.006
 him-0.006
Token
Token ID220
Feature activation+3.18e-02

Pos logit contributions
 exactly+0.011
ook+0.010
ooks+0.009
ents+0.009
bles+0.009
Neg logit contributions
Mine-0.015
 care-0.014
Stop-0.013
hered-0.013
 breaths-0.013
Token scared
Token ID12008
Feature activation-3.47e-02

Pos logit contributions
 exactly+0.019
ook+0.017
ooks+0.017
bles+0.016
 impressed+0.016
Neg logit contributions
Mine-0.018
 care-0.016
Stop-0.015
hered-0.015
 breaths-0.015
Token that
Token ID326
Feature activation-2.90e-02

Pos logit contributions
Mine+0.070
|+0.060
 care+0.059
 imag+0.059
Stop+0.058
Neg logit contributions
 exactly-0.058
ook-0.048
iring-0.046
ents-0.046
ooks-0.045
Token he
Token ID339
Feature activation-8.45e-03

Pos logit contributions
Mine+0.050
 care+0.047
Stop+0.044
hered+0.043
 breaths+0.042
Neg logit contributions
 exactly-0.052
ook-0.048
ooks-0.046
bles-0.044
ents-0.044
Token started
Token ID2067
Feature activation+3.00e-02

Pos logit contributions
Mine+0.014
 care+0.013
Stop+0.012
hered+0.012
|+0.012
Neg logit contributions
 exactly-0.016
ook-0.014
ooks-0.014
bles-0.013
ents-0.013
Token to
Token ID284
Feature activation-1.59e-02

Pos logit contributions
 exactly+0.050
ook+0.045
ooks+0.043
 impressed+0.041
ents+0.041
Neg logit contributions
Mine-0.057
 care-0.052
Stop-0.050
hered-0.048
|-0.048
Token cry
Token ID3960
Feature activation-1.18e-01

Pos logit contributions
Mine+0.018
Stop+0.014
hered+0.014
 care+0.013
 breaths+0.013
Neg logit contributions
 exactly-0.038
ook-0.033
aches-0.033
 impressed-0.032
ents-0.032
Token.
Token ID13
Feature activation-4.75e-03

Pos logit contributions
Mine+0.213
 care+0.196
Stop+0.188
|+0.179
hered+0.178
Neg logit contributions
 exactly-0.209
ook-0.189
ooks-0.182
bles-0.175
ents-0.173
Token
Token ID220
Feature activation+2.53e-02

Pos logit contributions
Mine+0.007
 care+0.007
Stop+0.006
hered+0.006
 breaths+0.006
Neg logit contributions
 exactly-0.009
ook-0.009
ooks-0.009
bles-0.008
ents-0.008
Token
Token ID198
Feature activation+1.23e-03

Pos logit contributions
 exactly+0.052
ook+0.048
ooks+0.047
anut+0.046
ossom+0.046
Neg logit contributions
Mine-0.035
 care-0.033
Stop-0.030
hered-0.029
|-0.029
Token
Token ID198
Feature activation+5.06e-04

Pos logit contributions
 exactly+0.003
ook+0.002
ooks+0.002
bles+0.002
anut+0.002
Neg logit contributions
Mine-0.002
 care-0.002
Stop-0.002
|-0.001
hered-0.001
TokenA
Token ID32
Feature activation-7.62e-03

Pos logit contributions
 exactly+0.001
ook+0.001
ooks+0.001
bles+0.001
anut+0.001
Neg logit contributions
Mine-0.001
 care-0.001
Stop-0.001
hered-0.001
|-0.001
Token soon
Token ID2582
Feature activation-3.69e-02

Pos logit contributions
 exactly+0.031
ook+0.028
ooks+0.027
 impressed+0.026
bles+0.026
Neg logit contributions
Mine-0.031
 care-0.028
Stop-0.027
hered-0.026
 breaths-0.026
Token.
Token ID13
Feature activation-7.86e-03

Pos logit contributions
Mine+0.061
 care+0.059
Stop+0.053
hered+0.052
 breaths+0.051
Neg logit contributions
 exactly-0.068
ook-0.063
ooks-0.062
bles-0.059
ossom-0.058
Token
Token ID220
Feature activation+1.90e-02

Pos logit contributions
Mine+0.015
 care+0.014
Stop+0.014
 breaths+0.014
hered+0.014
Neg logit contributions
 exactly-0.012
ook-0.011
ooks-0.011
ents-0.010
ossom-0.010
Token
Token ID198
Feature activation-5.78e-03

Pos logit contributions
 exactly+0.039
ook+0.036
ooks+0.035
anut+0.035
 impressed+0.034
Neg logit contributions
Mine-0.026
 care-0.024
Stop-0.023
|-0.022
hered-0.022
Token
Token ID198
Feature activation-5.94e-03

Pos logit contributions
Mine+0.011
 care+0.010
 breaths+0.010
Stop+0.009
hered+0.009
Neg logit contributions
 exactly-0.010
ook-0.009
ooks-0.009
 impressed-0.008
bles-0.008
TokenE
Token ID36
Feature activation-1.10e-01

Pos logit contributions
Mine+0.013
 care+0.012
 breaths+0.011
Stop+0.011
hered+0.011
Neg logit contributions
 exactly-0.008
ook-0.007
ooks-0.007
 impressed-0.006
bles-0.006
Tokenlla
Token ID8466
Feature activation-3.59e-02

Pos logit contributions
Mine+0.179
 care+0.167
Stop+0.155
hered+0.151
 breaths+0.148
Neg logit contributions
 exactly-0.207
ook-0.193
ooks-0.186
bles-0.178
ossom-0.177
Token smiled
Token ID13541
Feature activation+8.08e-03

Pos logit contributions
Mine+0.056
 care+0.053
Stop+0.048
hered+0.047
 breaths+0.046
Neg logit contributions
 exactly-0.069
ook-0.064
ooks-0.063
bles-0.060
iring-0.060
Token as
Token ID355
Feature activation-2.87e-02

Pos logit contributions
 exactly+0.015
ook+0.014
ooks+0.014
anut+0.013
aches+0.013
Neg logit contributions
Mine-0.013
 care-0.013
hered-0.011
Stop-0.011
 breaths-0.011
Token she
Token ID673
Feature activation+1.73e-02

Pos logit contributions
Mine+0.044
 care+0.044
Stop+0.038
 him+0.036
hered+0.036
Neg logit contributions
 exactly-0.054
ook-0.052
ooks-0.052
anut-0.050
bles-0.050
Token realized
Token ID6939
Feature activation+8.46e-02

Pos logit contributions
 exactly+0.033
ook+0.031
ooks+0.031
bles+0.029
ossom+0.029
Neg logit contributions
Mine-0.027
 care-0.026
Stop-0.024
hered-0.023
 breaths-0.022
Token so
Token ID523
Feature activation-3.39e-03

Pos logit contributions
 care+0.054
Mine+0.050
 weep+0.047
 need+0.047
 cry+0.047
Neg logit contributions
ook-0.056
ooks-0.055
 exactly-0.055
ossom-0.053
bles-0.053
Token happy
Token ID3772
Feature activation+1.55e-02

Pos logit contributions
Mine+0.007
 care+0.007
Stop+0.006
|+0.006
hered+0.006
Neg logit contributions
 exactly-0.004
ook-0.004
ooks-0.004
bles-0.004
ich-0.004
Token that
Token ID326
Feature activation-3.09e-02

Pos logit contributions
 exactly+0.026
ook+0.023
ooks+0.022
ents+0.022
 impressed+0.021
Neg logit contributions
Mine-0.030
 care-0.027
Stop-0.026
hered-0.025
|-0.025
Token she
Token ID673
Feature activation+1.71e-02

Pos logit contributions
Mine+0.050
 care+0.048
Stop+0.044
hered+0.042
|+0.041
Neg logit contributions
 exactly-0.058
ook-0.055
ooks-0.054
bles-0.052
ossom-0.051
Token fell
Token ID3214
Feature activation-1.26e-02

Pos logit contributions
 exactly+0.032
ook+0.029
ooks+0.028
bles+0.027
 impressed+0.027
Neg logit contributions
Mine-0.028
 care-0.026
Stop-0.025
hered-0.024
 breaths-0.024
Token asleep
Token ID16039
Feature activation-1.28e-01

Pos logit contributions
Mine+0.020
 care+0.017
Stop+0.017
Hey+0.015
hered+0.015
Neg logit contributions
 exactly-0.026
bles-0.023
ents-0.022
ook-0.022
 impressed-0.022
Token with
Token ID351
Feature activation-2.93e-02

Pos logit contributions
Mine+0.240
 care+0.223
Stop+0.213
hered+0.203
 breaths+0.201
Neg logit contributions
 exactly-0.211
ook-0.192
ooks-0.185
bles-0.176
 impressed-0.176
Token a
Token ID257
Feature activation-5.42e-03

Pos logit contributions
Mine+0.059
Stop+0.057
|+0.055
storm+0.055
 raged+0.054
Neg logit contributions
 exactly-0.044
 impressed-0.033
ents-0.032
 careful-0.030
 wise-0.030
Token big
Token ID1263
Feature activation-8.79e-03

Pos logit contributions
Mine+0.010
 care+0.010
hered+0.008
 bath+0.008
Stop+0.008
Neg logit contributions
 exactly-0.009
ook-0.009
ooks-0.009
iring-0.008
bles-0.008
Token smile
Token ID8212
Feature activation+1.44e-03

Pos logit contributions
Mine+0.017
Stop+0.016
|+0.015
 care+0.015
gent+0.015
Neg logit contributions
 exactly-0.013
 impressed-0.011
 taller-0.010
ook-0.010
ents-0.010
Token on
Token ID319
Feature activation+3.19e-02

Pos logit contributions
 exactly+0.002
ook+0.002
ooks+0.002
anut+0.002
 impressed+0.002
Neg logit contributions
Mine-0.003
 care-0.002
hered-0.002
Stop-0.002
 breaths-0.002
TokenIt
Token ID1026
Feature activation+1.68e-02

Pos logit contributions
 breaths+0.003
 care+0.002
Mine+0.002
|+0.002
 imag+0.002
Neg logit contributions
 exactly-0.003
 impressed-0.002
ook-0.002
ups-0.002
ber-0.002
Token's
Token ID338
Feature activation+6.46e-02

Pos logit contributions
 exactly+0.024
ook+0.021
ooks+0.019
bles+0.019
 impressed+0.019
Neg logit contributions
Mine-0.036
 care-0.033
Stop-0.032
hered-0.031
 breaths-0.031
Token okay
Token ID8788
Feature activation+6.64e-02

Pos logit contributions
ook+0.083
ossom+0.078
 exactly+0.078
ooks+0.077
ber+0.075
Neg logit contributions
 care-0.134
Mine-0.134
Stop-0.122
 him-0.120
 need-0.117
Token,
Token ID11
Feature activation-4.18e-03

Pos logit contributions
 exactly+0.100
ook+0.090
ooks+0.085
ber+0.079
ents+0.077
Neg logit contributions
Mine-0.142
Stop-0.122
|-0.122
hered-0.120
 care-0.120
Token Tom
Token ID4186
Feature activation+4.01e-02

Pos logit contributions
Mine+0.006
 care+0.006
Stop+0.005
hered+0.005
|+0.005
Neg logit contributions
 exactly-0.008
ook-0.008
ooks-0.007
ents-0.007
bles-0.007
Token,"
Token ID553
Feature activation-1.44e-01

Pos logit contributions
 exactly+0.075
ook+0.070
ooks+0.067
bles+0.065
ents+0.065
Neg logit contributions
Mine-0.068
 care-0.060
Stop-0.058
|-0.055
hered-0.055
Token Ben
Token ID3932
Feature activation+8.22e-03

Pos logit contributions
Mine+0.218
 care+0.198
hered+0.188
Stop+0.186
|+0.185
Neg logit contributions
 exactly-0.285
ook-0.261
ooks-0.255
 impressed-0.252
ents-0.247
Token says
Token ID1139
Feature activation-1.82e-02

Pos logit contributions
 exactly+0.018
ook+0.017
ooks+0.016
ents+0.016
bles+0.016
Neg logit contributions
Mine-0.011
 care-0.010
Stop-0.009
hered-0.008
|-0.008
Token.
Token ID13
Feature activation+1.64e-03

Pos logit contributions
Mine+0.029
 care+0.028
Stop+0.025
hered+0.025
 breaths+0.025
Neg logit contributions
 exactly-0.034
ook-0.032
ooks-0.031
bles-0.029
ossom-0.029
Token "
Token ID366
Feature activation+4.23e-02

Pos logit contributions
 exactly+0.002
ook+0.002
ooks+0.002
ossom+0.002
anut+0.002
Neg logit contributions
Mine-0.004
 care-0.003
Stop-0.003
hered-0.003
 breaths-0.003
TokenWe
Token ID1135
Feature activation-3.65e-02

Pos logit contributions
 exactly+0.078
ook+0.072
ooks+0.070
bles+0.068
 impressed+0.067
Neg logit contributions
Mine-0.070
 care-0.066
Stop-0.061
 breaths-0.059
hered-0.059