TOP ACTIVATIONS
MAX = 0.171

there was a yellow pol isher . It was very special
're a very good pol isher ." Lily smiled and felt
started to cry and looked really sad . Henry felt kind
covered the ground and gl imm ered in the sunlight .
so excited and grown - up . Now she could go
He jumped around with gust o and spun in circles as
checked for a fire extingu isher , but there wasn 't
family walked and carefully looked both ways before they crossed the
and to light fun spark lers on a summer night .
of bed and t ipt o ed to her door .
minutes , then t ipt o ed back to the table
this cookie looks so oooo o y ummy !" She was
was clean and nice - sm elling . Spot was happy
and tw irled and flew really fast until it suddenly stopped
a brand new battery - powered car . Tommy was so
a beautiful lake that sh imm ered in the sunlight .
kept spl ashing and pol ishing until he was so tired
had worked so hard pol ishing the post that she was
the road and it gl imm ered in the bright sunshine
and felt very grown - up . While they

MIN ACTIVATIONS
MIN = -0.220

interesting things . To her amaz ement , she unearthed a
magical noise ! To his amaz ement he had revealed a
onto it . To his amaz ement , he could fit
. To her amaz ement , the clothes came
door , and to her amaz ement , she could see
my had told him long ago . The bird spread out
To the little boy 's amaz ement , underneath the tree
people did a long time ago . They lived in castles
The boy watched her in amaz ement . The
. Amy watched them with amaz ement . She smiled ,
Daisy had planted it weeks ago , but now she could
bird looked at her in amaz ement , and said :
he had seen a year ago . He went home ,
her that a long time ago , someone made the mark
the pigeon all those weeks ago . <|endoftext|> Mike and his
imagined the tart was long ago , when she was a
, her eyes wide with amaz ement . "
He was here a minute ago . He was chasing the
, her eyes wide with amaz ement . Her
Tim watched them with pure amaz ement , never t iring

INTERVAL 0.073 - 0.171
CONTAINS 0.435%

hair and brown eyes , just like Jack . He had
And always remember to look both ways before crossing the street
was ready to go and play again , but , this
SPL ASH was always under control ! <|endoftext|> Once upon a
tag and hide - and - seek . Suddenly

INTERVAL -0.024 - 0.073
CONTAINS 86.512%

a spark ly o asis in the distance .
and saw the crab . " It was the
Lily . Do you understand that I like my fur ?"
She felt like a bird . She moved to the sandbox
!" Her dad was very surprised . " Wow , Mia

INTERVAL -0.122 - -0.024
CONTAINS 13.050%

started to bloom . It grew bigger and bigger until it
accept things as they are and focus on what we can
you . Today is your birthday . You are four years
. At the park , Lily saw a huge
inside . She was so excited that she quickly grabbed the

INTERVAL -0.220 - -0.122
CONTAINS 0.004%

. " I want my old violin back ." And so
animals were watching him with amaz ement .
bear told her about its old home and friends . The
he had lost a week ago . He wanted to get
you okay , my d arl ings ?" Mom asked .
Token there
Token ID612
Feature activation-2.28e-02

Pos logit contributions
ents+0.033
 ago+0.033
 opens+0.030
 slam+0.028
 gasp+0.028
Neg logit contributions
P-0.040
scar-0.040
oo-0.037
idd-0.037
uc-0.036
Token was
Token ID373
Feature activation+3.51e-03

Pos logit contributions
ents+0.043
 ago+0.042
 opens+0.042
 amaz+0.040
 gasp+0.040
Neg logit contributions
P-0.033
idd-0.030
scar-0.029
oo-0.029
uc-0.028
Token a
Token ID257
Feature activation-7.73e-03

Pos logit contributions
scar+0.005
P+0.005
oo+0.005
idd+0.005
lly+0.005
Neg logit contributions
ents-0.007
 ago-0.006
 opens-0.006
 widen-0.006
 amaz-0.006
Token yellow
Token ID7872
Feature activation-2.87e-02

Pos logit contributions
ents+0.008
 opens+0.007
 slam+0.007
 ago+0.007
 widen+0.007
Neg logit contributions
scar-0.017
oo-0.017
w-0.017
we-0.017
P-0.016
Token pol
Token ID755
Feature activation+1.58e-02

Pos logit contributions
ents+0.050
 ago+0.046
 opens+0.045
 slam+0.044
 gasp+0.043
Neg logit contributions
scar-0.048
P-0.045
dd-0.043
idd-0.042
we-0.042
Tokenisher
Token ID4828
Feature activation+1.47e-01

Pos logit contributions
scar+0.020
lly+0.017
oo+0.017
Matthew+0.017
Fin+0.016
Neg logit contributions
ents-0.031
 ago-0.028
 opens-0.026
 ey-0.025
 distances-0.025
Token.
Token ID13
Feature activation-9.42e-03

Pos logit contributions
P+0.258
scar+0.247
dd+0.222
oo+0.218
lly+0.218
Neg logit contributions
ents-0.251
 ago-0.249
 opens-0.218
 slam-0.212
 gasp-0.207
Token It
Token ID632
Feature activation-3.32e-03

Pos logit contributions
ents+0.014
 opens+0.012
 ago+0.012
 slam+0.011
 widen+0.011
Neg logit contributions
P-0.019
oo-0.018
scar-0.018
idd-0.017
lly-0.017
Token was
Token ID373
Feature activation-1.58e-02

Pos logit contributions
ents+0.006
 opens+0.006
 ago+0.006
 widen+0.005
 slam+0.005
Neg logit contributions
oo-0.005
P-0.005
idd-0.004
uc-0.004
scar-0.004
Token very
Token ID845
Feature activation+2.16e-02

Pos logit contributions
ents+0.028
 opens+0.025
 ago+0.024
 slam+0.024
 gasp+0.024
Neg logit contributions
P-0.027
oo-0.026
scar-0.025
lly-0.024
idd-0.024
Token special
Token ID2041
Feature activation+1.35e-03

Pos logit contributions
P+0.041
oo+0.040
scar+0.039
idd+0.038
lly+0.038
Neg logit contributions
ents-0.034
 ago-0.029
 opens-0.029
 slam-0.028
 amaz-0.028
Token're
Token ID821
Feature activation-7.13e-03

Pos logit contributions
ents+0.000
 opens+0.000
 gasp+0.000
 ago+0.000
 widen+0.000
Neg logit contributions
oo-0.000
P-0.000
scar-0.000
idd-0.000
we-0.000
Token a
Token ID257
Feature activation-4.46e-03

Pos logit contributions
ents+0.011
 opens+0.010
 ago+0.009
 widen+0.009
 amaz+0.009
Neg logit contributions
P-0.013
scar-0.013
oo-0.013
idd-0.013
w-0.012
Token very
Token ID845
Feature activation+2.07e-02

Pos logit contributions
 amaz+0.006
ents+0.006
 opens+0.006
 gasp+0.006
 widen+0.006
Neg logit contributions
oo-0.008
scar-0.008
P-0.008
w-0.008
idd-0.008
Token good
Token ID922
Feature activation+1.81e-02

Pos logit contributions
P+0.036
scar+0.035
oo+0.034
idd+0.033
ter+0.032
Neg logit contributions
ents-0.033
 opens-0.031
 ago-0.030
 gasp-0.029
 slam-0.028
Token pol
Token ID755
Feature activation+4.01e-02

Pos logit contributions
P+0.031
scar+0.030
lly+0.030
idd+0.029
ined+0.029
Neg logit contributions
ents-0.027
 ago-0.026
 opens-0.026
 gasp-0.026
 shelf-0.025
Tokenisher
Token ID4828
Feature activation+1.46e-01

Pos logit contributions
scar+0.054
oo+0.052
lly+0.052
Fin+0.051
P+0.050
Neg logit contributions
ents-0.074
 opens-0.066
 ago-0.062
 amaz-0.058
 gasp-0.057
Token."
Token ID526
Feature activation+2.97e-03

Pos logit contributions
P+0.255
scar+0.244
oo+0.230
lly+0.227
idd+0.222
Neg logit contributions
ents-0.253
 ago-0.237
 opens-0.224
 gasp-0.210
 slam-0.209
Token Lily
Token ID20037
Feature activation-5.90e-03

Pos logit contributions
P+0.005
scar+0.005
oo+0.005
idd+0.005
w+0.005
Neg logit contributions
ents-0.005
 opens-0.004
 ago-0.004
 slam-0.004
 amaz-0.004
Token smiled
Token ID13541
Feature activation-1.65e-02

Pos logit contributions
ents+0.010
 opens+0.009
 amaz+0.009
 gasp+0.009
 widen+0.008
Neg logit contributions
P-0.010
oo-0.010
idd-0.010
scar-0.010
w-0.009
Token and
Token ID290
Feature activation-4.73e-03

Pos logit contributions
ents+0.025
 ago+0.021
 opens+0.021
 slam+0.020
 widen+0.020
Neg logit contributions
P-0.033
oo-0.032
scar-0.031
idd-0.030
lly-0.030
Token felt
Token ID2936
Feature activation+3.78e-03

Pos logit contributions
ents+0.007
 opens+0.006
 gasp+0.006
 amaz+0.006
 widen+0.006
Neg logit contributions
P-0.009
oo-0.009
idd-0.009
mate-0.009
scar-0.008
Token started
Token ID2067
Feature activation+3.80e-03

Pos logit contributions
P+0.015
oo+0.014
scar+0.014
idd+0.013
we+0.013
Neg logit contributions
ents-0.014
 opens-0.013
 ago-0.012
 gasp-0.012
 slam-0.011
Token to
Token ID284
Feature activation-7.95e-03

Pos logit contributions
P+0.006
oo+0.006
scar+0.005
idd+0.005
lly+0.005
Neg logit contributions
ents-0.007
 ago-0.007
 opens-0.006
 widen-0.006
 slam-0.006
Token cry
Token ID3960
Feature activation-4.02e-02

Pos logit contributions
ents+0.010
 ago+0.008
 opens+0.008
 widen+0.007
 amaz+0.007
Neg logit contributions
P-0.018
oo-0.017
scar-0.017
lly-0.016
idd-0.016
Token and
Token ID290
Feature activation+6.99e-03

Pos logit contributions
ents+0.072
 ago+0.063
 opens+0.062
 widen+0.061
 slam+0.060
Neg logit contributions
P-0.069
scar-0.066
idd-0.063
oo-0.063
lly-0.062
Token looked
Token ID3114
Feature activation+5.93e-02

Pos logit contributions
P+0.011
oo+0.011
scar+0.011
idd+0.010
lly+0.010
Neg logit contributions
ents-0.013
 ago-0.011
 opens-0.011
 widen-0.011
 slam-0.011
Token really
Token ID1107
Feature activation+1.36e-01

Pos logit contributions
P+0.102
scar+0.099
oo+0.099
idd+0.093
lly+0.091
Neg logit contributions
ents-0.105
 opens-0.093
 ago-0.091
 slam-0.088
 widen-0.088
Token sad
Token ID6507
Feature activation+2.43e-02

Pos logit contributions
P+0.301
oo+0.291
scar+0.290
idd+0.280
lly+0.275
Neg logit contributions
ents-0.172
 opens-0.145
 ago-0.145
 slam-0.136
 widen-0.135
Token.
Token ID13
Feature activation+1.40e-02

Pos logit contributions
P+0.033
scar+0.031
oo+0.031
idd+0.030
lly+0.029
Neg logit contributions
ents-0.051
 ago-0.046
 opens-0.046
 slam-0.045
 widen-0.044
Token Henry
Token ID8616
Feature activation-3.27e-03

Pos logit contributions
P+0.026
scar+0.025
oo+0.025
idd+0.024
lly+0.024
Neg logit contributions
ents-0.022
 opens-0.019
 ago-0.019
 widen-0.018
 slam-0.018
Token felt
Token ID2936
Feature activation-8.41e-03

Pos logit contributions
ents+0.006
 opens+0.005
 ago+0.005
 slam+0.005
 gasp+0.005
Neg logit contributions
P-0.006
oo-0.005
scar-0.005
idd-0.005
w-0.005
Token kind
Token ID1611
Feature activation-5.41e-03

Pos logit contributions
ents+0.014
 opens+0.012
 ago+0.012
 slam+0.011
 amaz+0.011
Neg logit contributions
P-0.016
oo-0.015
scar-0.015
idd-0.014
lly-0.014
Token covered
Token ID5017
Feature activation+2.69e-02

Pos logit contributions
ents+0.034
 ago+0.031
 opens+0.031
 slam+0.030
 widen+0.030
Neg logit contributions
P-0.024
oo-0.023
scar-0.023
idd-0.022
lly-0.021
Token the
Token ID262
Feature activation+4.48e-03

Pos logit contributions
P+0.045
scar+0.044
idd+0.042
oo+0.041
lly+0.039
Neg logit contributions
ents-0.046
 opens-0.042
 ago-0.041
 amaz-0.039
 widen-0.039
Token ground
Token ID2323
Feature activation-3.15e-02

Pos logit contributions
P+0.008
scar+0.008
oo+0.007
idd+0.007
lly+0.007
Neg logit contributions
ents-0.008
 ago-0.007
 opens-0.007
 slam-0.006
 widen-0.006
Token and
Token ID290
Feature activation+1.43e-02

Pos logit contributions
ents+0.067
 ago+0.061
 amaz+0.059
 slam+0.058
 opens+0.058
Neg logit contributions
oo-0.034
P-0.034
lly-0.034
ter-0.033
ined-0.032
Token gl
Token ID1278
Feature activation-6.27e-03

Pos logit contributions
P+0.020
scar+0.019
lly+0.018
oo+0.018
idd+0.018
Neg logit contributions
ents-0.029
 ago-0.026
 opens-0.026
 widen-0.026
 slam-0.025
Tokenimm
Token ID8608
Feature activation+1.36e-01

Pos logit contributions
ents+0.011
 opens+0.009
 widen+0.009
 ago+0.009
 gasp+0.008
Neg logit contributions
P-0.012
scar-0.012
idd-0.011
oo-0.010
w-0.010
Tokenered
Token ID1068
Feature activation+2.41e-02

Pos logit contributions
P+0.247
scar+0.232
oo+0.224
idd+0.222
B+0.206
Neg logit contributions
ents-0.235
 opens-0.217
 ago-0.202
 widen-0.194
 amaz-0.193
Token in
Token ID287
Feature activation+5.34e-02

Pos logit contributions
P+0.035
oo+0.033
scar+0.033
idd+0.031
lly+0.031
Neg logit contributions
ents-0.049
 ago-0.044
 opens-0.043
 slam-0.042
 widen-0.042
Token the
Token ID262
Feature activation+5.55e-02

Pos logit contributions
P+0.066
oo+0.060
idd+0.058
lly+0.058
ter+0.058
Neg logit contributions
ents-0.116
 opens-0.102
 slam-0.100
 ago-0.099
 widen-0.098
Token sunlight
Token ID19606
Feature activation+1.76e-02

Pos logit contributions
oo+0.107
P+0.107
scar+0.105
idd+0.100
lly+0.098
Neg logit contributions
ents-0.083
 opens-0.074
 ago-0.074
 widen-0.070
 amaz-0.070
Token.
Token ID13
Feature activation+1.95e-02

Pos logit contributions
oo+0.015
P+0.014
ter+0.014
scar+0.013
we+0.013
Neg logit contributions
ents-0.043
 opens-0.040
 slam-0.040
 amaz-0.039
 ago-0.039
Token so
Token ID523
Feature activation+1.59e-02

Pos logit contributions
P+0.008
oo+0.008
scar+0.008
lly+0.008
idd+0.008
Neg logit contributions
ents-0.009
 ago-0.008
 opens-0.008
 slam-0.008
 widen-0.008
Token excited
Token ID6568
Feature activation-5.96e-03

Pos logit contributions
P+0.026
oo+0.025
lly+0.025
scar+0.024
dd+0.023
Neg logit contributions
ents-0.030
 ago-0.025
 thirteen-0.024
 slam-0.024
 opens-0.024
Token and
Token ID290
Feature activation+4.47e-02

Pos logit contributions
ents+0.013
 ago+0.011
 opens+0.011
 slam+0.011
 widen+0.011
Neg logit contributions
P-0.008
oo-0.008
scar-0.008
idd-0.007
lly-0.007
Token grown
Token ID7334
Feature activation+1.31e-02

Pos logit contributions
P+0.063
lly+0.062
oo+0.062
scar+0.060
idd+0.056
Neg logit contributions
ents-0.089
 ago-0.078
 thirteen-0.077
 widen-0.075
 opens-0.075
Token-
Token ID12
Feature activation+1.13e-01

Pos logit contributions
oo+0.020
P+0.020
scar+0.020
w+0.019
idd+0.019
Neg logit contributions
ents-0.025
 opens-0.022
 widen-0.021
 distances-0.021
 ago-0.020
Tokenup
Token ID929
Feature activation+1.34e-01

Pos logit contributions
P+0.172
oo+0.166
scar+0.164
idd+0.158
ter+0.153
Neg logit contributions
ents-0.223
 opens-0.185
 ago-0.181
 slam-0.174
 widen-0.172
Token.
Token ID13
Feature activation+1.25e-02

Pos logit contributions
P+0.199
scar+0.188
oo+0.184
idd+0.174
lly+0.168
Neg logit contributions
ents-0.276
 ago-0.247
 opens-0.245
 slam-0.237
 gasp-0.234
Token Now
Token ID2735
Feature activation+1.04e-02

Pos logit contributions
P+0.022
oo+0.021
scar+0.021
idd+0.020
lly+0.019
Neg logit contributions
ents-0.022
 opens-0.019
 ago-0.019
 widen-0.018
 slam-0.018
Token she
Token ID673
Feature activation-2.73e-02

Pos logit contributions
P+0.016
oo+0.015
scar+0.015
lly+0.014
idd+0.014
Neg logit contributions
ents-0.020
 ago-0.018
 opens-0.018
 widen-0.017
 slam-0.017
Token could
Token ID714
Feature activation+3.08e-03

Pos logit contributions
ents+0.057
 opens+0.051
 ago+0.051
 slam+0.049
 amaz+0.048
Neg logit contributions
P-0.039
oo-0.037
scar-0.036
idd-0.034
lly-0.033
Token go
Token ID467
Feature activation-2.69e-02

Pos logit contributions
oo+0.005
P+0.005
lly+0.005
scar+0.005
idd+0.005
Neg logit contributions
ents-0.005
 ago-0.005
 thirteen-0.004
 opens-0.004
 amaz-0.004
Token He
Token ID679
Feature activation+1.54e-02

Pos logit contributions
P+0.043
scar+0.042
oo+0.042
idd+0.040
lly+0.039
Neg logit contributions
ents-0.039
 opens-0.034
 ago-0.034
 widen-0.033
 slam-0.033
Token jumped
Token ID11687
Feature activation+2.36e-03

Pos logit contributions
P+0.023
oo+0.022
scar+0.021
idd+0.020
lly+0.020
Neg logit contributions
ents-0.031
 opens-0.027
 ago-0.027
 slam-0.026
 amaz-0.026
Token around
Token ID1088
Feature activation+2.86e-02

Pos logit contributions
P+0.004
oo+0.004
scar+0.004
idd+0.004
lly+0.004
Neg logit contributions
ents-0.004
 opens-0.004
 ago-0.004
 slam-0.003
 widen-0.003
Token with
Token ID351
Feature activation+3.21e-02

Pos logit contributions
P+0.049
oo+0.047
scar+0.046
idd+0.045
lly+0.044
Neg logit contributions
ents-0.051
 ago-0.044
 opens-0.044
 slam-0.042
 widen-0.042
Token gust
Token ID35253
Feature activation-3.55e-02

Pos logit contributions
P+0.054
oo+0.052
scar+0.051
idd+0.050
lly+0.049
Neg logit contributions
ents-0.056
 opens-0.049
 ago-0.049
 slam-0.047
 widen-0.047
Tokeno
Token ID78
Feature activation+1.34e-01

Pos logit contributions
ents+0.081
 ago+0.073
 opens+0.071
 slam+0.069
 gasp+0.068
Neg logit contributions
P-0.043
scar-0.041
oo-0.040
lly-0.038
idd-0.037
Token and
Token ID290
Feature activation+2.68e-02

Pos logit contributions
P+0.204
oo+0.194
scar+0.194
idd+0.184
lly+0.180
Neg logit contributions
ents-0.261
 ago-0.233
 opens-0.232
 slam-0.224
 widen-0.223
Token spun
Token ID26843
Feature activation+4.52e-02

Pos logit contributions
P+0.044
oo+0.042
scar+0.042
idd+0.040
lly+0.039
Neg logit contributions
ents-0.049
 ago-0.044
 opens-0.044
 slam-0.042
 widen-0.042
Token in
Token ID287
Feature activation+5.00e-02

Pos logit contributions
P+0.087
scar+0.086
oo+0.085
idd+0.081
w+0.079
Neg logit contributions
ents-0.070
 opens-0.060
 ago-0.060
 widen-0.058
 amaz-0.057
Token circles
Token ID13332
Feature activation+2.53e-02

Pos logit contributions
P+0.071
oo+0.068
idd+0.067
lly+0.064
scar+0.064
Neg logit contributions
ents-0.098
 ago-0.083
 opens-0.083
 widen-0.081
 slam-0.081
Token as
Token ID355
Feature activation+6.09e-02

Pos logit contributions
P+0.042
oo+0.040
scar+0.040
idd+0.038
lly+0.037
Neg logit contributions
ents-0.045
 ago-0.040
 opens-0.040
 slam-0.039
 widen-0.038
Token checked
Token ID10667
Feature activation+3.89e-02

Pos logit contributions
ents+0.039
 opens+0.034
 slam+0.033
 ago+0.032
 gasp+0.032
Neg logit contributions
P-0.040
scar-0.039
oo-0.039
idd-0.037
we-0.036
Token for
Token ID329
Feature activation-1.05e-02

Pos logit contributions
P+0.071
scar+0.066
idd+0.065
oo+0.065
lly+0.062
Neg logit contributions
ents-0.068
 opens-0.060
 slam-0.059
 ago-0.058
 gasp-0.055
Token a
Token ID257
Feature activation+2.09e-02

Pos logit contributions
ents+0.020
 opens+0.017
 ago+0.017
 slam+0.017
 amaz+0.017
Neg logit contributions
P-0.017
scar-0.017
oo-0.016
idd-0.016
lly-0.015
Token fire
Token ID2046
Feature activation+5.98e-02

Pos logit contributions
P+0.038
oo+0.036
scar+0.036
idd+0.035
lly+0.034
Neg logit contributions
ents-0.034
 ago-0.030
 opens-0.030
 slam-0.028
 widen-0.028
Token extingu
Token ID24995
Feature activation+6.39e-02

Pos logit contributions
P+0.094
scar+0.086
oo+0.084
lly+0.081
idd+0.079
Neg logit contributions
ents-0.118
 ago-0.112
 opens-0.107
 slam-0.107
 gasp-0.104
Tokenisher
Token ID4828
Feature activation+1.34e-01

Pos logit contributions
P+0.067
scar+0.061
lly+0.057
oo+0.056
B+0.050
Neg logit contributions
ents-0.157
 opens-0.146
 slam-0.142
 ago-0.140
 amaz-0.140
Token,
Token ID11
Feature activation+2.57e-03

Pos logit contributions
P+0.194
scar+0.182
oo+0.179
idd+0.170
lly+0.168
Neg logit contributions
ents-0.275
 ago-0.250
 opens-0.245
 slam-0.239
 gasp-0.234
Token but
Token ID475
Feature activation+9.19e-03

Pos logit contributions
P+0.005
oo+0.005
scar+0.005
ter+0.005
idd+0.005
Neg logit contributions
ents-0.004
 opens-0.003
 ago-0.003
 widen-0.003
 amaz-0.003
Token there
Token ID612
Feature activation-1.14e-02

Pos logit contributions
P+0.015
oo+0.014
scar+0.014
idd+0.013
lly+0.013
Neg logit contributions
ents-0.018
 opens-0.016
 ago-0.016
 slam-0.015
 amaz-0.015
Token wasn
Token ID2492
Feature activation+1.04e-02

Pos logit contributions
ents+0.022
 opens+0.021
 slam+0.020
 ago+0.020
 gasp+0.020
Neg logit contributions
P-0.018
oo-0.016
idd-0.015
scar-0.015
uc-0.014
Token't
Token ID470
Feature activation+1.48e-02

Pos logit contributions
P+0.016
oo+0.016
scar+0.016
idd+0.015
lly+0.015
Neg logit contributions
ents-0.019
 ago-0.018
 opens-0.017
 amaz-0.017
 slam-0.017
Token family
Token ID1641
Feature activation-2.38e-03

Pos logit contributions
ents+0.013
 opens+0.013
 amaz+0.013
 gasp+0.012
 ago+0.012
Neg logit contributions
P-0.012
idd-0.011
scar-0.011
oo-0.011
w-0.011
Token walked
Token ID6807
Feature activation-5.34e-03

Pos logit contributions
ents+0.004
 opens+0.003
 ago+0.003
 gasp+0.003
 slam+0.003
Neg logit contributions
P-0.004
scar-0.004
oo-0.004
we-0.004
uc-0.004
Token and
Token ID290
Feature activation+1.86e-02

Pos logit contributions
ents+0.008
 opens+0.008
 ago+0.007
 slam+0.007
 gasp+0.007
Neg logit contributions
scar-0.010
P-0.010
oo-0.010
idd-0.009
w-0.009
Token carefully
Token ID7773
Feature activation+2.57e-02

Pos logit contributions
P+0.030
oo+0.030
scar+0.029
idd+0.028
ined+0.027
Neg logit contributions
ents-0.033
 opens-0.030
 ago-0.030
 slam-0.029
 gasp-0.029
Token looked
Token ID3114
Feature activation+6.54e-02

Pos logit contributions
P+0.038
oo+0.036
scar+0.036
idd+0.034
lly+0.034
Neg logit contributions
ents-0.051
 opens-0.045
 ago-0.045
 slam-0.044
 widen-0.044
Token both
Token ID1111
Feature activation+1.33e-01

Pos logit contributions
P+0.118
scar+0.117
oo+0.114
idd+0.108
w+0.105
Neg logit contributions
ents-0.114
 opens-0.103
 ago-0.100
 slam-0.096
 amaz-0.096
Token ways
Token ID2842
Feature activation+5.83e-02

Pos logit contributions
P+0.158
lly+0.157
oo+0.155
scar+0.151
idd+0.150
Neg logit contributions
ents-0.266
 ago-0.240
 widen-0.234
 thirteen-0.234
 slam-0.232
Token before
Token ID878
Feature activation-1.13e-02

Pos logit contributions
oo+0.070
P+0.067
scar+0.064
ter+0.062
idd+0.062
Neg logit contributions
ents-0.125
 opens-0.115
 amaz-0.114
 widen-0.111
 slam-0.111
Token they
Token ID484
Feature activation+8.93e-03

Pos logit contributions
ents+0.022
 opens+0.020
 ago+0.020
 slam+0.019
 widen+0.019
Neg logit contributions
P-0.017
oo-0.016
scar-0.016
idd-0.015
lly-0.015
Token crossed
Token ID12606
Feature activation+1.24e-02

Pos logit contributions
P+0.013
scar+0.012
oo+0.012
idd+0.011
lly+0.011
Neg logit contributions
ents-0.018
 ago-0.017
 opens-0.017
 slam-0.016
 widen-0.016
Token the
Token ID262
Feature activation+5.00e-03

Pos logit contributions
P+0.022
scar+0.022
oo+0.021
idd+0.020
w+0.020
Neg logit contributions
ents-0.021
 ago-0.020
 opens-0.019
 slam-0.019
 amaz-0.018
Token and
Token ID290
Feature activation+1.50e-02

Pos logit contributions
ents+0.104
 slam+0.088
 opens+0.088
 ago+0.086
 widen+0.086
Neg logit contributions
P-0.077
oo-0.074
scar-0.071
idd-0.071
lly-0.070
Token to
Token ID284
Feature activation+1.70e-02

Pos logit contributions
lly+0.020
ter+0.019
P+0.018
we+0.018
oo+0.017
Neg logit contributions
ents-0.030
 ago-0.027
umbs-0.026
 distances-0.025
 thirteen-0.024
Token light
Token ID1657
Feature activation+8.60e-03

Pos logit contributions
oo+0.026
lly+0.026
P+0.025
scar+0.024
ter+0.024
Neg logit contributions
ents-0.032
 ago-0.028
 opens-0.027
 widen-0.026
 distances-0.025
Token fun
Token ID1257
Feature activation+3.66e-02

Pos logit contributions
P+0.012
oo+0.012
scar+0.011
idd+0.011
lly+0.011
Neg logit contributions
ents-0.018
 ago-0.016
 opens-0.016
 slam-0.015
 widen-0.015
Token spark
Token ID9009
Feature activation-7.35e-03

Pos logit contributions
P+0.042
oo+0.041
lly+0.039
scar+0.038
we+0.038
Neg logit contributions
ents-0.082
 ago-0.073
 opens-0.072
 slam-0.072
 widen-0.072
Tokenlers
Token ID8116
Feature activation+1.33e-01

Pos logit contributions
ents+0.014
 ago+0.012
 opens+0.012
 slam+0.012
 widen+0.012
Neg logit contributions
P-0.012
oo-0.011
scar-0.011
idd-0.011
lly-0.010
Token on
Token ID319
Feature activation+3.25e-02

Pos logit contributions
P+0.127
oo+0.124
ter+0.120
lly+0.112
idd+0.111
Neg logit contributions
ents-0.315
 opens-0.286
 slam-0.285
 ago-0.283
 amaz-0.281
Token a
Token ID257
Feature activation+4.29e-02

Pos logit contributions
P+0.045
oo+0.043
scar+0.041
lly+0.039
idd+0.039
Neg logit contributions
ents-0.068
 ago-0.061
 opens-0.060
 slam-0.059
 widen-0.058
Token summer
Token ID3931
Feature activation+3.29e-02

Pos logit contributions
P+0.076
oo+0.072
scar+0.071
lly+0.069
idd+0.069
Neg logit contributions
ents-0.073
 ago-0.064
 opens-0.063
 slam-0.061
 amaz-0.060
Token night
Token ID1755
Feature activation+5.13e-02

Pos logit contributions
P+0.049
oo+0.046
scar+0.046
idd+0.044
lly+0.043
Neg logit contributions
ents-0.065
 ago-0.058
 opens-0.058
 slam-0.056
 widen-0.056
Token.
Token ID13
Feature activation+1.39e-02

Pos logit contributions
P+0.061
oo+0.059
scar+0.055
lly+0.054
idd+0.052
Neg logit contributions
ents-0.115
 opens-0.102
 slam-0.101
 amaz-0.100
 ago-0.099
Token of
Token ID286
Feature activation+7.09e-03

Pos logit contributions
ents+0.005
 ago+0.005
 seventeen+0.005
 thirteen+0.005
 distances+0.005
Neg logit contributions
oo-0.005
P-0.004
idd-0.004
us-0.004
scar-0.004
Token bed
Token ID3996
Feature activation-8.34e-03

Pos logit contributions
P+0.010
oo+0.010
scar+0.010
idd+0.009
lly+0.009
Neg logit contributions
ents-0.014
 opens-0.012
 widen-0.012
 ago-0.012
 thirteen-0.012
Token and
Token ID290
Feature activation-4.71e-03

Pos logit contributions
ents+0.015
 opens+0.013
 ago+0.013
 amaz+0.013
 slam+0.013
Neg logit contributions
oo-0.013
P-0.013
scar-0.012
lly-0.012
idd-0.012
Token t
Token ID256
Feature activation+4.01e-02

Pos logit contributions
ents+0.009
 ago+0.008
 opens+0.008
 widen+0.008
 slam+0.008
Neg logit contributions
P-0.007
oo-0.006
scar-0.006
lly-0.006
idd-0.006
Tokenipt
Token ID10257
Feature activation+3.13e-02

Pos logit contributions
P+0.053
scar+0.048
lly+0.045
opard+0.044
ter+0.043
Neg logit contributions
ents-0.089
 ago-0.070
 gasp-0.069
 opens-0.068
bell-0.067
Tokeno
Token ID78
Feature activation+1.33e-01

Pos logit contributions
P+0.028
scar+0.024
oo+0.023
lly+0.022
dd+0.022
Neg logit contributions
ents-0.081
 ago-0.073
 opens-0.072
bell-0.069
 slam-0.069
Tokened
Token ID276
Feature activation-6.39e-03

Pos logit contributions
P+0.160
scar+0.154
oo+0.144
idd+0.137
lly+0.129
Neg logit contributions
ents-0.314
 opens-0.279
 ago-0.270
 slam-0.260
 gasp-0.259
Token to
Token ID284
Feature activation+5.64e-04

Pos logit contributions
ents+0.012
 opens+0.010
 ago+0.010
 slam+0.010
 widen+0.010
Neg logit contributions
P-0.011
scar-0.010
oo-0.010
idd-0.010
lly-0.009
Token her
Token ID607
Feature activation-1.92e-02

Pos logit contributions
P+0.001
oo+0.001
scar+0.001
idd+0.001
lly+0.001
Neg logit contributions
ents-0.001
 ago-0.001
 opens-0.001
 widen-0.001
 slam-0.001
Token door
Token ID3420
Feature activation-6.31e-02

Pos logit contributions
ents+0.034
 ago+0.030
 opens+0.029
 widen+0.029
 thirteen+0.029
Neg logit contributions
P-0.031
oo-0.031
scar-0.030
lly-0.029
idd-0.029
Token.
Token ID13
Feature activation-1.27e-03

Pos logit contributions
ents+0.152
 thirteen+0.140
 ago+0.136
 distances+0.135
 seventeen+0.135
Neg logit contributions
oo-0.056
lly-0.047
P-0.046
ter-0.045
scar-0.044
Token minutes
Token ID2431
Feature activation-1.95e-02

Pos logit contributions
P+0.062
oo+0.057
lly+0.056
idd+0.056
scar+0.055
Neg logit contributions
ents-0.069
 widen-0.059
 slam-0.056
 opens-0.056
 gasp-0.056
Token,
Token ID11
Feature activation+2.75e-02

Pos logit contributions
ents+0.036
 widen+0.033
 slam+0.032
 opens+0.032
 amaz+0.031
Neg logit contributions
oo-0.030
P-0.029
scar-0.029
idd-0.028
lly-0.027
Token then
Token ID788
Feature activation+2.21e-02

Pos logit contributions
P+0.041
oo+0.039
scar+0.039
lly+0.037
idd+0.037
Neg logit contributions
ents-0.053
 opens-0.047
 ago-0.047
 widen-0.046
 slam-0.045
Token t
Token ID256
Feature activation+5.59e-02

Pos logit contributions
P+0.037
scar+0.036
oo+0.036
idd+0.034
uc+0.032
Neg logit contributions
ents-0.040
 opens-0.036
 ago-0.035
 gasp-0.034
 amaz-0.034
Tokenipt
Token ID10257
Feature activation+4.33e-02

Pos logit contributions
P+0.067
scar+0.060
lly+0.056
oo+0.055
idd+0.052
Neg logit contributions
ents-0.133
 ago-0.112
 opens-0.109
 gasp-0.108
bell-0.107
Tokeno
Token ID78
Feature activation+1.33e-01

Pos logit contributions
P+0.059
scar+0.055
oo+0.053
lly+0.051
idd+0.050
Neg logit contributions
ents-0.092
 ago-0.082
 opens-0.081
 slam-0.076
 widen-0.075
Tokened
Token ID276
Feature activation-2.05e-02

Pos logit contributions
P+0.150
scar+0.141
oo+0.136
idd+0.125
lly+0.117
Neg logit contributions
ents-0.319
 opens-0.288
 ago-0.282
 slam-0.269
 gasp-0.268
Token back
Token ID736
Feature activation+1.28e-02

Pos logit contributions
ents+0.038
 opens+0.034
 ago+0.034
 slam+0.032
 widen+0.032
Neg logit contributions
P-0.033
scar-0.032
oo-0.032
idd-0.030
lly-0.030
Token to
Token ID284
Feature activation-3.85e-03

Pos logit contributions
P+0.020
oo+0.019
scar+0.019
idd+0.018
lly+0.018
Neg logit contributions
ents-0.025
 ago-0.022
 opens-0.022
 slam-0.021
 widen-0.021
Token the
Token ID262
Feature activation-4.27e-03

Pos logit contributions
ents+0.007
 ago+0.006
 opens+0.006
 slam+0.006
 amaz+0.006
Neg logit contributions
P-0.006
scar-0.006
oo-0.006
idd-0.006
lly-0.005
Token table
Token ID3084
Feature activation-5.54e-02

Pos logit contributions
ents+0.007
 opens+0.006
 ago+0.006
 widen+0.006
 slam+0.006
Neg logit contributions
P-0.008
scar-0.007
oo-0.007
idd-0.007
lly-0.007
Token this
Token ID428
Feature activation-1.97e-02

Pos logit contributions
ter+0.006
lly+0.006
oo+0.006
P+0.006
scar+0.005
Neg logit contributions
ents-0.010
 widen-0.009
 ago-0.009
 gasp-0.009
 distances-0.009
Token cookie
Token ID19751
Feature activation+1.79e-02

Pos logit contributions
ents+0.044
 ago+0.039
 slam+0.039
 widen+0.038
 gasp+0.038
Neg logit contributions
P-0.027
scar-0.025
lly-0.025
oo-0.024
w-0.023
Token looks
Token ID3073
Feature activation+4.69e-03

Pos logit contributions
P+0.025
scar+0.024
oo+0.024
lly+0.023
idd+0.022
Neg logit contributions
ents-0.037
 ago-0.033
 widen-0.032
 amaz-0.032
 slam-0.032
Token so
Token ID523
Feature activation+3.99e-02

Pos logit contributions
P+0.008
lly+0.007
oo+0.007
scar+0.007
ter+0.006
Neg logit contributions
ents-0.008
 ago-0.008
 slam-0.007
 amaz-0.007
 gasp-0.007
Tokenoooo
Token ID13321
Feature activation+2.66e-02

Pos logit contributions
P+0.062
lly+0.059
scar+0.056
oo+0.055
w+0.051
Neg logit contributions
ents-0.077
 ago-0.072
 amaz-0.068
 slam-0.068
 gasp-0.067
Tokeno
Token ID78
Feature activation+1.30e-01

Pos logit contributions
P+0.041
scar+0.038
oo+0.038
idd+0.037
lly+0.035
Neg logit contributions
ents-0.052
 opens-0.046
 ago-0.046
 slam-0.045
 widen-0.044
Token y
Token ID331
Feature activation+7.27e-02

Pos logit contributions
P+0.210
scar+0.202
oo+0.199
idd+0.195
uc+0.178
Neg logit contributions
ents-0.239
 opens-0.221
 ago-0.211
 slam-0.199
 widen-0.197
Tokenummy
Token ID13513
Feature activation+2.76e-02

Pos logit contributions
P+0.120
oo+0.115
scar+0.114
idd+0.109
lly+0.106
Neg logit contributions
ents-0.133
 ago-0.118
 opens-0.117
 slam-0.114
 amaz-0.113
Token!"
Token ID2474
Feature activation-8.61e-03

Pos logit contributions
P+0.040
oo+0.038
scar+0.038
idd+0.036
lly+0.035
Neg logit contributions
ents-0.055
 ago-0.050
 opens-0.050
 slam-0.048
 widen-0.047
Token She
Token ID1375
Feature activation+8.09e-03

Pos logit contributions
ents+0.010
 opens+0.009
 slam+0.009
 ago+0.008
 gasp+0.008
Neg logit contributions
P-0.020
scar-0.019
idd-0.019
oo-0.018
lly-0.018
Token was
Token ID373
Feature activation+4.60e-03

Pos logit contributions
P+0.014
oo+0.013
idd+0.013
scar+0.013
w+0.012
Neg logit contributions
ents-0.014
 opens-0.013
 slam-0.012
 amaz-0.012
 gasp-0.011
Token was
Token ID373
Feature activation+4.36e-03

Pos logit contributions
P+0.011
oo+0.011
scar+0.010
idd+0.010
w+0.009
Neg logit contributions
ents-0.012
 opens-0.011
 slam-0.010
 gasp-0.010
 amaz-0.010
Token clean
Token ID3424
Feature activation+4.28e-02

Pos logit contributions
P+0.008
oo+0.008
scar+0.008
idd+0.008
lly+0.008
Neg logit contributions
ents-0.007
 ago-0.006
 opens-0.006
 slam-0.006
 widen-0.006
Token and
Token ID290
Feature activation+5.26e-03

Pos logit contributions
P+0.068
oo+0.065
scar+0.065
idd+0.062
lly+0.061
Neg logit contributions
ents-0.080
 ago-0.071
 opens-0.071
 slam-0.069
 widen-0.068
Token nice
Token ID3621
Feature activation+1.88e-02

Pos logit contributions
P+0.009
scar+0.009
oo+0.009
lly+0.009
idd+0.008
Neg logit contributions
ents-0.009
 ago-0.008
 opens-0.008
 slam-0.007
 widen-0.007
Token-
Token ID12
Feature activation+9.88e-02

Pos logit contributions
P+0.031
scar+0.029
oo+0.029
idd+0.028
lly+0.027
Neg logit contributions
ents-0.035
 ago-0.032
 opens-0.032
 amaz-0.030
 gasp-0.030
Tokensm
Token ID5796
Feature activation+1.30e-01

Pos logit contributions
P+0.127
scar+0.121
idd+0.120
ipper+0.119
oo+0.114
Neg logit contributions
ents-0.204
 opens-0.169
 slam-0.164
 sill-0.161
 seem-0.157
Tokenelling
Token ID9417
Feature activation+2.94e-02

Pos logit contributions
P+0.269
scar+0.258
lly+0.251
oo+0.251
idd+0.242
Neg logit contributions
ents-0.187
 ago-0.151
 opens-0.151
 slam-0.143
 widen-0.140
Token.
Token ID13
Feature activation+1.46e-02

Pos logit contributions
P+0.054
scar+0.050
oo+0.050
idd+0.049
w+0.047
Neg logit contributions
ents-0.046
 opens-0.043
 widen-0.041
 slam-0.041
 amaz-0.041
Token Spot
Token ID15899
Feature activation+3.09e-02

Pos logit contributions
P+0.020
oo+0.019
scar+0.019
idd+0.018
lly+0.017
Neg logit contributions
ents-0.031
 opens-0.028
 ago-0.028
 slam-0.027
 widen-0.026
Token was
Token ID373
Feature activation-2.99e-03

Pos logit contributions
P+0.053
oo+0.051
scar+0.050
idd+0.049
w+0.047
Neg logit contributions
ents-0.055
 opens-0.050
 gasp-0.045
 slam-0.045
 ago-0.043
Token happy
Token ID3772
Feature activation-3.51e-02

Pos logit contributions
ents+0.004
 ago+0.004
 opens+0.004
 slam+0.003
 widen+0.003
Neg logit contributions
P-0.006
oo-0.006
scar-0.006
idd-0.006
lly-0.006
Token and
Token ID290
Feature activation+9.75e-03

Pos logit contributions
P+0.061
oo+0.059
scar+0.058
idd+0.056
lly+0.055
Neg logit contributions
ents-0.053
 ago-0.046
 opens-0.046
 slam-0.044
 widen-0.043
Token tw
Token ID665
Feature activation-1.82e-02

Pos logit contributions
P+0.015
oo+0.014
scar+0.014
idd+0.013
lly+0.013
Neg logit contributions
ents-0.019
 ago-0.017
 opens-0.017
 slam-0.016
 widen-0.016
Tokenirled
Token ID49376
Feature activation+6.55e-02

Pos logit contributions
ents+0.042
 ago+0.037
 opens+0.037
 distances+0.036
 amaz+0.036
Neg logit contributions
P-0.021
scar-0.020
idd-0.018
ter-0.018
oo-0.017
Token and
Token ID290
Feature activation+5.00e-02

Pos logit contributions
scar+0.132
P+0.132
oo+0.127
idd+0.125
lly+0.120
Neg logit contributions
ents-0.096
 opens-0.083
 ago-0.080
 widen-0.080
 gasp-0.077
Token flew
Token ID13112
Feature activation+1.24e-02

Pos logit contributions
P+0.082
oo+0.079
scar+0.079
idd+0.075
lly+0.073
Neg logit contributions
ents-0.091
 opens-0.081
 ago-0.081
 slam-0.078
 widen-0.077
Token really
Token ID1107
Feature activation+1.30e-01

Pos logit contributions
P+0.025
scar+0.024
oo+0.024
idd+0.023
lly+0.022
Neg logit contributions
ents-0.019
 opens-0.016
 ago-0.016
 widen-0.016
 slam-0.015
Token fast
Token ID3049
Feature activation+6.33e-02

Pos logit contributions
P+0.202
oo+0.199
lly+0.189
scar+0.185
idd+0.181
Neg logit contributions
ents-0.238
 ago-0.213
 amaz-0.207
 opens-0.206
 slam-0.198
Token until
Token ID1566
Feature activation+1.87e-02

Pos logit contributions
P+0.075
oo+0.072
scar+0.069
idd+0.067
lly+0.064
Neg logit contributions
ents-0.142
 opens-0.128
 ago-0.127
 slam-0.125
 amaz-0.124
Token it
Token ID340
Feature activation+1.70e-02

Pos logit contributions
scar+0.035
P+0.034
uc+0.032
oo+0.032
idd+0.031
Neg logit contributions
ents-0.030
 ago-0.029
 opens-0.028
 amaz-0.026
 widen-0.026
Token suddenly
Token ID6451
Feature activation-3.17e-03

Pos logit contributions
P+0.026
scar+0.025
oo+0.025
idd+0.024
uc+0.023
Neg logit contributions
ents-0.033
 opens-0.030
 ago-0.029
 widen-0.028
 slam-0.028
Token stopped
Token ID5025
Feature activation+2.12e-02

Pos logit contributions
ents+0.006
 opens+0.005
 ago+0.005
 gasp+0.005
 widen+0.005
Neg logit contributions
P-0.005
scar-0.005
oo-0.005
idd-0.005
uc-0.005
Token a
Token ID257
Feature activation+2.86e-02

Pos logit contributions
P+0.014
oo+0.014
scar+0.014
idd+0.013
lly+0.012
Neg logit contributions
ents-0.020
 ago-0.018
 opens-0.018
 slam-0.018
 amaz-0.017
Token brand
Token ID4508
Feature activation-1.54e-02

Pos logit contributions
P+0.051
scar+0.048
oo+0.048
idd+0.047
lly+0.047
Neg logit contributions
ents-0.049
 ago-0.042
 opens-0.042
 slam-0.040
 widen-0.040
Token new
Token ID649
Feature activation+7.81e-03

Pos logit contributions
ents+0.024
 opens+0.021
 amaz+0.020
 gasp+0.019
 slam+0.019
Neg logit contributions
P-0.030
scar-0.029
oo-0.028
dd-0.028
idd-0.027
Token battery
Token ID6555
Feature activation+8.61e-02

Pos logit contributions
P+0.013
scar+0.013
oo+0.013
idd+0.012
us+0.012
Neg logit contributions
ents-0.013
 ago-0.012
 opens-0.012
 amaz-0.012
 gasp-0.012
Token-
Token ID12
Feature activation+6.79e-02

Pos logit contributions
P+0.136
scar+0.125
oo+0.125
idd+0.115
lly+0.111
Neg logit contributions
ents-0.167
 ago-0.156
 opens-0.153
 gasp-0.145
 slam-0.143
Tokenpowered
Token ID12293
Feature activation+1.29e-01

Pos logit contributions
scar+0.101
P+0.100
oo+0.096
uc+0.090
idd+0.090
Neg logit contributions
ents-0.132
 ago-0.114
 opens-0.113
 thirteen-0.107
 widen-0.103
Token car
Token ID1097
Feature activation+6.68e-02

Pos logit contributions
P+0.215
scar+0.205
oo+0.203
idd+0.189
dd+0.189
Neg logit contributions
ents-0.223
 ago-0.210
 opens-0.209
 slam-0.199
 amaz-0.197
Token.
Token ID13
Feature activation+7.32e-03

Pos logit contributions
P+0.095
scar+0.089
oo+0.089
idd+0.084
lly+0.081
Neg logit contributions
ents-0.138
 ago-0.125
 opens-0.124
 slam-0.119
 widen-0.118
Token Tommy
Token ID19919
Feature activation+1.58e-02

Pos logit contributions
P+0.010
scar+0.009
oo+0.009
idd+0.009
lly+0.008
Neg logit contributions
ents-0.015
 ago-0.014
 opens-0.014
 slam-0.013
 widen-0.013
Token was
Token ID373
Feature activation-7.68e-03

Pos logit contributions
P+0.019
oo+0.018
scar+0.018
idd+0.017
lly+0.016
Neg logit contributions
ents-0.036
 opens-0.032
 ago-0.032
 slam-0.031
 widen-0.031
Token so
Token ID523
Feature activation-1.96e-02

Pos logit contributions
ents+0.012
 ago+0.012
 opens+0.011
 distances+0.011
 slam+0.011
Neg logit contributions
oo-0.013
idd-0.012
scar-0.012
lly-0.012
ter-0.011
Token a
Token ID257
Feature activation+2.55e-02

Pos logit contributions
P+0.032
scar+0.032
oo+0.031
idd+0.030
uc+0.029
Neg logit contributions
ents-0.029
 opens-0.026
 amaz-0.025
 ago-0.025
 slam-0.023
Token beautiful
Token ID4950
Feature activation+1.64e-03

Pos logit contributions
P+0.048
oo+0.047
scar+0.046
idd+0.044
lly+0.043
Neg logit contributions
ents-0.040
 ago-0.035
 opens-0.035
 slam-0.034
 widen-0.033
Token lake
Token ID13546
Feature activation+5.95e-02

Pos logit contributions
P+0.003
scar+0.003
oo+0.003
idd+0.003
w+0.003
Neg logit contributions
ents-0.003
 opens-0.002
 ago-0.002
 amaz-0.002
 slam-0.002
Token that
Token ID326
Feature activation+5.16e-02

Pos logit contributions
P+0.090
scar+0.085
oo+0.085
idd+0.081
lly+0.079
Neg logit contributions
ents-0.117
 ago-0.105
 opens-0.104
 slam-0.101
 widen-0.099
Token sh
Token ID427
Feature activation+1.77e-02

Pos logit contributions
oo+0.084
P+0.082
scar+0.079
idd+0.079
us+0.075
Neg logit contributions
ents-0.092
 opens-0.089
 amaz-0.082
 ago-0.082
 gasp-0.082
Tokenimm
Token ID8608
Feature activation+1.29e-01

Pos logit contributions
P+0.017
scar+0.013
mate+0.013
idd+0.013
uc+0.012
Neg logit contributions
ents-0.043
 widen-0.040
 ago-0.038
 opens-0.037
 distances-0.036
Tokenered
Token ID1068
Feature activation+1.37e-02

Pos logit contributions
P+0.259
scar+0.240
idd+0.230
B+0.229
oo+0.223
Neg logit contributions
ents-0.197
 opens-0.190
 ago-0.163
 amaz-0.163
 gasp-0.160
Token in
Token ID287
Feature activation+4.81e-02

Pos logit contributions
P+0.021
scar+0.021
oo+0.020
idd+0.018
lly+0.018
Neg logit contributions
ents-0.027
 ago-0.025
 opens-0.024
 widen-0.023
 gasp-0.023
Token the
Token ID262
Feature activation+5.20e-02

Pos logit contributions
P+0.061
oo+0.054
lly+0.053
idd+0.052
ter+0.052
Neg logit contributions
ents-0.105
 opens-0.093
 ago-0.091
 slam-0.091
 widen-0.090
Token sunlight
Token ID19606
Feature activation+1.22e-02

Pos logit contributions
oo+0.103
scar+0.102
P+0.101
idd+0.096
uc+0.094
Neg logit contributions
ents-0.075
 opens-0.067
 ago-0.067
 widen-0.065
 amaz-0.064
Token.
Token ID13
Feature activation+1.66e-02

Pos logit contributions
oo+0.011
P+0.011
ter+0.010
scar+0.010
idd+0.010
Neg logit contributions
ents-0.029
 opens-0.027
 slam-0.027
 amaz-0.026
 ago-0.026
Token kept
Token ID4030
Feature activation+3.22e-02

Pos logit contributions
P+0.064
scar+0.061
oo+0.061
idd+0.058
w+0.054
Neg logit contributions
ents-0.083
 opens-0.074
 ago-0.073
 slam-0.070
 gasp-0.070
Token spl
Token ID4328
Feature activation+4.65e-02

Pos logit contributions
P+0.052
scar+0.050
oo+0.048
idd+0.046
lly+0.046
Neg logit contributions
ents-0.060
 ago-0.053
 opens-0.053
 amaz-0.051
 widen-0.051
Tokenashing
Token ID2140
Feature activation+9.06e-02

Pos logit contributions
P+0.076
scar+0.072
lly+0.069
oo+0.069
idd+0.067
Neg logit contributions
ents-0.088
 ago-0.074
 opens-0.072
 widen-0.071
 slam-0.070
Token and
Token ID290
Feature activation+3.87e-02

Pos logit contributions
P+0.157
scar+0.152
oo+0.151
idd+0.145
lly+0.141
Neg logit contributions
ents-0.156
 ago-0.138
 opens-0.138
 slam-0.131
 widen-0.131
Token pol
Token ID755
Feature activation+6.23e-02

Pos logit contributions
P+0.064
oo+0.061
scar+0.060
idd+0.058
lly+0.056
Neg logit contributions
ents-0.071
 ago-0.063
 opens-0.062
 slam-0.060
 widen-0.060
Tokenishing
Token ID3929
Feature activation+1.28e-01

Pos logit contributions
scar+0.091
oo+0.090
P+0.086
opard+0.080
mate+0.078
Neg logit contributions
ents-0.120
 opens-0.105
 distances-0.100
 ago-0.100
 amaz-0.096
Token until
Token ID1566
Feature activation+2.01e-02

Pos logit contributions
P+0.212
scar+0.203
oo+0.203
idd+0.194
lly+0.187
Neg logit contributions
ents-0.233
 opens-0.208
 ago-0.207
 widen-0.198
 slam-0.198
Token he
Token ID339
Feature activation+6.02e-03

Pos logit contributions
P+0.035
scar+0.034
oo+0.034
idd+0.033
uc+0.032
Neg logit contributions
ents-0.033
 ago-0.031
 opens-0.029
 amaz-0.029
 slam-0.028
Token was
Token ID373
Feature activation+4.78e-04

Pos logit contributions
P+0.009
oo+0.009
scar+0.009
idd+0.008
uc+0.008
Neg logit contributions
ents-0.012
 ago-0.010
 opens-0.010
 gasp-0.010
 amaz-0.010
Token so
Token ID523
Feature activation+5.50e-03

Pos logit contributions
P+0.001
oo+0.001
scar+0.001
idd+0.001
lly+0.001
Neg logit contributions
ents-0.001
 ago-0.001
 opens-0.001
 slam-0.001
 widen-0.001
Token tired
Token ID10032
Feature activation+5.06e-02

Pos logit contributions
P+0.009
oo+0.009
scar+0.009
idd+0.009
lly+0.009
Neg logit contributions
ents-0.010
 opens-0.008
 ago-0.008
 slam-0.008
 amaz-0.008
Token had
Token ID550
Feature activation+6.22e-04

Pos logit contributions
P+0.006
scar+0.006
oo+0.006
idd+0.005
w+0.005
Neg logit contributions
ents-0.008
 opens-0.007
 ago-0.007
 gasp-0.007
 slam-0.007
Token worked
Token ID3111
Feature activation+1.84e-02

Pos logit contributions
P+0.001
scar+0.001
lly+0.001
oo+0.001
idd+0.001
Neg logit contributions
ents-0.001
 ago-0.001
 slam-0.001
 opens-0.001
 widen-0.001
Token so
Token ID523
Feature activation+2.80e-02

Pos logit contributions
P+0.032
oo+0.031
scar+0.031
idd+0.029
lly+0.029
Neg logit contributions
ents-0.032
 ago-0.028
 opens-0.028
 slam-0.027
 widen-0.027
Token hard
Token ID1327
Feature activation+3.13e-02

Pos logit contributions
P+0.051
oo+0.049
scar+0.049
idd+0.046
lly+0.046
Neg logit contributions
ents-0.047
 ago-0.041
 opens-0.041
 slam-0.039
 amaz-0.039
Token pol
Token ID755
Feature activation+6.37e-02

Pos logit contributions
oo+0.041
P+0.040
scar+0.038
dd+0.036
lly+0.035
Neg logit contributions
ents-0.067
 amaz-0.061
 opens-0.061
 slam-0.060
 widen-0.060
Tokenishing
Token ID3929
Feature activation+1.28e-01

Pos logit contributions
oo+0.080
scar+0.079
P+0.078
idd+0.074
lly+0.068
Neg logit contributions
ents-0.136
 opens-0.123
 ago-0.114
 distances-0.112
 amaz-0.111
Token the
Token ID262
Feature activation+1.35e-02

Pos logit contributions
P+0.193
oo+0.184
scar+0.183
idd+0.174
lly+0.171
Neg logit contributions
ents-0.249
 ago-0.222
 opens-0.222
 slam-0.214
 widen-0.213
Token post
Token ID1281
Feature activation+2.05e-02

Pos logit contributions
P+0.022
oo+0.021
scar+0.021
idd+0.020
lly+0.020
Neg logit contributions
ents-0.025
 ago-0.022
 opens-0.022
 slam-0.021
 widen-0.021
Token that
Token ID326
Feature activation+3.41e-02

Pos logit contributions
P+0.027
oo+0.026
scar+0.025
idd+0.024
lly+0.024
Neg logit contributions
ents-0.043
 ago-0.039
 opens-0.039
 widen-0.038
 slam-0.038
Token she
Token ID673
Feature activation-1.47e-02

Pos logit contributions
P+0.052
oo+0.050
scar+0.050
idd+0.047
lly+0.046
Neg logit contributions
ents-0.066
 opens-0.059
 ago-0.059
 slam-0.057
 widen-0.056
Token was
Token ID373
Feature activation-1.35e-04

Pos logit contributions
ents+0.027
 opens+0.023
 ago+0.022
 gasp+0.021
 slam+0.021
Neg logit contributions
P-0.024
oo-0.024
scar-0.023
idd-0.022
uc-0.021
Token the
Token ID262
Feature activation+4.24e-03

Pos logit contributions
ents+0.036
 opens+0.032
 slam+0.032
 ago+0.032
 widen+0.031
Neg logit contributions
P-0.024
oo-0.023
scar-0.022
idd-0.021
lly-0.020
Token road
Token ID2975
Feature activation-1.66e-02

Pos logit contributions
P+0.008
scar+0.007
oo+0.007
idd+0.007
lly+0.007
Neg logit contributions
ents-0.007
 ago-0.006
 opens-0.006
 slam-0.006
 widen-0.006
Token and
Token ID290
Feature activation-1.73e-03

Pos logit contributions
ents+0.036
 opens+0.032
 ago+0.031
 amaz+0.031
 widen+0.031
Neg logit contributions
P-0.021
oo-0.021
scar-0.020
idd-0.019
lly-0.019
Token it
Token ID340
Feature activation+6.93e-03

Pos logit contributions
ents+0.004
 ago+0.003
 opens+0.003
 slam+0.003
 widen+0.003
Neg logit contributions
P-0.002
scar-0.002
oo-0.002
lly-0.002
idd-0.002
Token gl
Token ID1278
Feature activation-2.30e-02

Pos logit contributions
P+0.011
oo+0.011
scar+0.010
idd+0.010
uc+0.009
Neg logit contributions
ents-0.013
 opens-0.012
 widen-0.011
 ago-0.011
 slam-0.011
Tokenimm
Token ID8608
Feature activation+1.27e-01

Pos logit contributions
ents+0.040
 opens+0.034
 ago+0.034
 widen+0.034
bell+0.033
Neg logit contributions
P-0.041
scar-0.040
idd-0.036
B-0.035
lly-0.035
Tokenered
Token ID1068
Feature activation+1.33e-02

Pos logit contributions
P+0.247
scar+0.232
idd+0.223
B+0.222
oo+0.212
Neg logit contributions
ents-0.201
 opens-0.194
 amaz-0.162
 ago-0.161
 widen-0.158
Token in
Token ID287
Feature activation+4.54e-02

Pos logit contributions
scar+0.022
P+0.022
oo+0.020
idd+0.019
lly+0.019
Neg logit contributions
ents-0.026
 ago-0.023
 opens-0.022
 widen-0.021
 gasp-0.021
Token the
Token ID262
Feature activation+4.64e-02

Pos logit contributions
P+0.057
oo+0.051
idd+0.048
lly+0.048
scar+0.047
Neg logit contributions
ents-0.098
 slam-0.088
 opens-0.088
 ago-0.086
 gasp-0.086
Token bright
Token ID6016
Feature activation-6.06e-03

Pos logit contributions
P+0.085
oo+0.079
scar+0.079
idd+0.077
lly+0.075
Neg logit contributions
ents-0.077
 ago-0.067
 opens-0.067
 slam-0.064
 gasp-0.063
Token sunshine
Token ID34488
Feature activation+2.31e-02

Pos logit contributions
ents+0.010
 slam+0.009
 ago+0.009
 opens+0.009
 gasp+0.009
Neg logit contributions
P-0.010
oo-0.010
idd-0.010
scar-0.010
lly-0.009
Token and
Token ID290
Feature activation-2.52e-02

Pos logit contributions
P+0.108
oo+0.104
scar+0.103
idd+0.098
lly+0.097
Neg logit contributions
ents-0.101
 opens-0.088
 ago-0.088
 widen-0.085
 slam-0.085
Token felt
Token ID2936
Feature activation+1.11e-03

Pos logit contributions
ents+0.045
 opens+0.042
 ago+0.039
 slam+0.038
 amaz+0.038
Neg logit contributions
P-0.041
scar-0.040
oo-0.040
idd-0.039
us-0.036
Token very
Token ID845
Feature activation+6.16e-02

Pos logit contributions
P+0.002
oo+0.002
scar+0.002
idd+0.002
lly+0.002
Neg logit contributions
ents-0.002
 ago-0.002
 opens-0.002
 slam-0.002
 widen-0.002
Token grown
Token ID7334
Feature activation-1.74e-03

Pos logit contributions
P+0.112
oo+0.108
scar+0.106
lly+0.103
idd+0.102
Neg logit contributions
ents-0.103
 ago-0.089
 opens-0.088
 slam-0.085
 amaz-0.084
Token-
Token ID12
Feature activation+9.58e-02

Pos logit contributions
ents+0.005
 opens+0.004
 ago+0.004
 widen+0.004
 slam+0.004
Neg logit contributions
P-0.001
scar-0.001
oo-0.001
idd-0.001
lly-0.001
Tokenup
Token ID929
Feature activation+1.27e-01

Pos logit contributions
P+0.122
scar+0.120
oo+0.117
idd+0.116
ter+0.109
Neg logit contributions
ents-0.209
 opens-0.177
 ago-0.170
 slam-0.167
 widen-0.166
Token.
Token ID13
Feature activation+4.03e-03

Pos logit contributions
P+0.169
scar+0.159
oo+0.159
idd+0.150
lly+0.146
Neg logit contributions
ents-0.271
 ago-0.244
 opens-0.243
 slam-0.236
 widen-0.234
Token
Token ID198
Feature activation-4.61e-03

Pos logit contributions
P+0.007
scar+0.006
oo+0.006
idd+0.006
lly+0.006
Neg logit contributions
ents-0.007
 ago-0.007
 opens-0.006
 slam-0.006
 widen-0.006
Token
Token ID198
Feature activation-4.05e-03

Pos logit contributions
ents+0.009
 ago+0.008
 opens+0.008
 slam+0.008
 widen+0.007
Neg logit contributions
P-0.007
oo-0.007
scar-0.007
idd-0.007
lly-0.006
TokenWhile
Token ID3633
Feature activation+1.95e-02

Pos logit contributions
ents+0.008
 ago+0.007
 opens+0.007
 amaz+0.007
 slam+0.007
Neg logit contributions
P-0.007
oo-0.006
scar-0.006
lly-0.005
B-0.005
Token they
Token ID484
Feature activation+6.21e-03

Pos logit contributions
scar+0.032
P+0.032
oo+0.030
idd+0.029
lly+0.029
Neg logit contributions
ents-0.034
 ago-0.033
 opens-0.031
 slam-0.029
 amaz-0.028
Token interesting
Token ID3499
Feature activation+6.43e-03

Pos logit contributions
P+0.009
oo+0.008
lly+0.008
idd+0.008
scar+0.008
Neg logit contributions
ents-0.011
 ago-0.010
 opens-0.010
 slam-0.010
 widen-0.010
Token things
Token ID1243
Feature activation+2.52e-02

Pos logit contributions
P+0.007
oo+0.006
idd+0.006
scar+0.006
lly+0.006
Neg logit contributions
ents-0.015
 slam-0.014
 widen-0.014
 ago-0.014
 opens-0.013
Token.
Token ID13
Feature activation+1.28e-02

Pos logit contributions
P+0.034
oo+0.032
scar+0.031
idd+0.030
lly+0.029
Neg logit contributions
ents-0.053
 opens-0.048
 ago-0.048
 slam-0.047
 amaz-0.046
Token To
Token ID1675
Feature activation-8.95e-03

Pos logit contributions
P+0.022
oo+0.021
scar+0.021
idd+0.020
lly+0.020
Neg logit contributions
ents-0.022
 ago-0.020
 opens-0.020
 widen-0.019
 slam-0.019
Token her
Token ID607
Feature activation-2.85e-02

Pos logit contributions
ents+0.019
 ago+0.017
 thirteen+0.016
 opens+0.016
 slam+0.016
Neg logit contributions
P-0.012
oo-0.012
lly-0.011
scar-0.011
mate-0.011
Token amaz
Token ID40642
Feature activation-2.20e-01

Pos logit contributions
ents+0.060
 ago+0.052
 thirteen+0.051
 widen+0.049
 seventeen+0.048
Neg logit contributions
P-0.039
oo-0.039
lly-0.038
 friend-0.037
scar-0.036
Tokenement
Token ID972
Feature activation-1.02e-01

Pos logit contributions
ents+0.363
 ago+0.316
 opens+0.314
 slam+0.304
 widen+0.300
Neg logit contributions
P-0.400
oo-0.386
scar-0.384
idd-0.368
lly-0.364
Token,
Token ID11
Feature activation-6.94e-03

Pos logit contributions
ents+0.192
 ago+0.171
 opens+0.170
 slam+0.163
 widen+0.161
Neg logit contributions
P-0.165
scar-0.155
oo-0.154
idd-0.148
lly-0.143
Token she
Token ID673
Feature activation-9.74e-03

Pos logit contributions
ents+0.015
 ago+0.013
 opens+0.013
 widen+0.013
 slam+0.012
Neg logit contributions
P-0.009
oo-0.008
lly-0.008
scar-0.008
ter-0.008
Token unearthed
Token ID41562
Feature activation+7.04e-03

Pos logit contributions
ents+0.019
 opens+0.017
 ago+0.017
 amaz+0.016
 gasp+0.016
Neg logit contributions
P-0.015
oo-0.014
scar-0.014
idd-0.013
uc-0.012
Token a
Token ID257
Feature activation+2.75e-02

Pos logit contributions
P+0.011
scar+0.011
oo+0.011
idd+0.010
w+0.009
Neg logit contributions
ents-0.014
 opens-0.012
 ago-0.012
 amaz-0.012
 slam-0.012
Token magical
Token ID10883
Feature activation-1.48e-02

Pos logit contributions
P+0.004
oo+0.004
scar+0.004
lly+0.004
idd+0.004
Neg logit contributions
ents-0.003
 ago-0.003
 opens-0.003
 slam-0.003
 distances-0.003
Token noise
Token ID7838
Feature activation-1.13e-02

Pos logit contributions
ents+0.026
 ago+0.026
 opens+0.025
 amaz+0.025
 gasp+0.025
Neg logit contributions
P-0.024
scar-0.023
oo-0.022
idd-0.021
w-0.021
Token!
Token ID0
Feature activation+1.84e-02

Pos logit contributions
ents+0.021
 ago+0.019
 opens+0.019
 gasp+0.018
 widen+0.018
Neg logit contributions
P-0.020
scar-0.018
idd-0.017
oo-0.017
uc-0.017
Token To
Token ID1675
Feature activation-4.46e-03

Pos logit contributions
P+0.039
oo+0.037
scar+0.037
idd+0.036
lly+0.035
Neg logit contributions
ents-0.026
 ago-0.022
 opens-0.022
 slam-0.021
 gasp-0.021
Token his
Token ID465
Feature activation+6.24e-03

Pos logit contributions
ents+0.009
 opens+0.009
 ago+0.008
 slam+0.008
 amaz+0.008
Neg logit contributions
P-0.006
scar-0.006
idd-0.006
oo-0.006
uc-0.005
Token amaz
Token ID40642
Feature activation-2.20e-01

Pos logit contributions
P+0.010
oo+0.010
scar+0.009
lly+0.009
ee+0.008
Neg logit contributions
ents-0.012
 ago-0.011
 thirteen-0.010
 distances-0.010
 widen-0.010
Tokenement
Token ID972
Feature activation-1.02e-01

Pos logit contributions
ents+0.357
 ago+0.311
 opens+0.309
 slam+0.298
 widen+0.294
Neg logit contributions
P-0.405
oo-0.390
scar-0.388
idd-0.373
lly-0.367
Token he
Token ID339
Feature activation-1.64e-02

Pos logit contributions
ents+0.184
 ago+0.165
 opens+0.161
 gasp+0.156
 amaz+0.151
Neg logit contributions
P-0.183
scar-0.170
oo-0.161
idd-0.161
w-0.152
Token had
Token ID550
Feature activation-8.76e-03

Pos logit contributions
ents+0.034
 opens+0.030
 ago+0.030
 slam+0.029
 widen+0.029
Neg logit contributions
P-0.023
oo-0.022
scar-0.022
idd-0.021
lly-0.020
Token revealed
Token ID4602
Feature activation-1.49e-02

Pos logit contributions
ents+0.016
 opens+0.014
 ago+0.014
 amaz+0.013
 gasp+0.013
Neg logit contributions
P-0.015
scar-0.015
oo-0.014
idd-0.014
ter-0.013
Token a
Token ID257
Feature activation+4.18e-02

Pos logit contributions
ents+0.027
 amaz+0.024
 opens+0.024
 ago+0.024
 gasp+0.023
Neg logit contributions
P-0.026
scar-0.025
oo-0.024
idd-0.024
uc-0.023
Token onto
Token ID4291
Feature activation+1.49e-02

Pos logit contributions
P+0.021
oo+0.020
scar+0.019
idd+0.019
lly+0.019
Neg logit contributions
ents-0.028
 ago-0.024
 opens-0.024
 widen-0.023
 slam-0.023
Token it
Token ID340
Feature activation+3.39e-02

Pos logit contributions
P+0.024
oo+0.023
scar+0.023
idd+0.022
lly+0.021
Neg logit contributions
ents-0.028
 ago-0.025
 opens-0.025
 slam-0.024
 widen-0.024
Token.
Token ID13
Feature activation+1.54e-02

Pos logit contributions
oo+0.024
P+0.023
ter+0.021
lly+0.021
B+0.020
Neg logit contributions
ents-0.086
 ago-0.081
 opens-0.079
 thirteen-0.078
 amaz-0.078
Token To
Token ID1675
Feature activation-2.01e-03

Pos logit contributions
P+0.029
oo+0.029
scar+0.028
idd+0.027
lly+0.027
Neg logit contributions
ents-0.024
 ago-0.021
 opens-0.020
 slam-0.020
 widen-0.020
Token his
Token ID465
Feature activation+9.16e-03

Pos logit contributions
ents+0.005
 ago+0.004
 opens+0.004
 slam+0.004
 widen+0.004
Neg logit contributions
P-0.002
oo-0.002
scar-0.002
lly-0.002
idd-0.002
Token amaz
Token ID40642
Feature activation-2.11e-01

Pos logit contributions
oo+0.013
P+0.013
lly+0.012
scar+0.012
 friend+0.011
Neg logit contributions
ents-0.019
 ago-0.018
 thirteen-0.017
 seventeen-0.016
 distances-0.015
Tokenement
Token ID972
Feature activation-9.67e-02

Pos logit contributions
ents+0.344
 ago+0.299
 opens+0.298
 slam+0.286
 widen+0.283
Neg logit contributions
P-0.389
oo-0.374
scar-0.373
idd-0.358
lly-0.352
Token,
Token ID11
Feature activation-4.35e-03

Pos logit contributions
ents+0.168
 ago+0.149
 opens+0.145
 gasp+0.140
 amaz+0.137
Neg logit contributions
P-0.180
scar-0.166
idd-0.159
oo-0.158
w-0.150
Token he
Token ID339
Feature activation+8.42e-03

Pos logit contributions
ents+0.008
 opens+0.007
 ago+0.007
 slam+0.007
 amaz+0.007
Neg logit contributions
P-0.007
oo-0.007
scar-0.007
idd-0.007
uc-0.006
Token could
Token ID714
Feature activation+2.34e-02

Pos logit contributions
P+0.013
oo+0.013
scar+0.012
idd+0.012
uc+0.011
Neg logit contributions
ents-0.016
 opens-0.015
 ago-0.014
 amaz-0.014
 gasp-0.014
Token fit
Token ID4197
Feature activation-5.35e-03

Pos logit contributions
oo+0.043
lly+0.042
scar+0.040
P+0.039
ter+0.037
Neg logit contributions
ents-0.036
 ago-0.032
 thirteen-0.031
 opens-0.029
 seventeen-0.029
Token.
Token ID13
Feature activation+1.52e-02

Pos logit contributions
P+0.071
oo+0.068
scar+0.067
idd+0.064
lly+0.062
Neg logit contributions
ents-0.110
 ago-0.098
 opens-0.098
 slam-0.096
 amaz-0.096
Token
Token ID198
Feature activation+3.85e-03

Pos logit contributions
P+0.029
oo+0.028
scar+0.027
idd+0.027
lly+0.026
Neg logit contributions
ents-0.024
 opens-0.021
 ago-0.020
 widen-0.020
 slam-0.020
Token
Token ID198
Feature activation+3.82e-03

Pos logit contributions
P+0.006
scar+0.006
oo+0.006
idd+0.006
ter+0.005
Neg logit contributions
ents-0.007
 ago-0.006
 opens-0.006
 slam-0.006
 amaz-0.006
TokenTo
Token ID2514
Feature activation+2.87e-03

Pos logit contributions
oo+0.006
scar+0.006
P+0.006
idd+0.006
uc+0.006
Neg logit contributions
ents-0.007
 ago-0.006
 opens-0.006
 slam-0.006
 amaz-0.006
Token her
Token ID607
Feature activation-5.93e-03

Pos logit contributions
P+0.005
scar+0.004
oo+0.004
idd+0.004
lly+0.004
Neg logit contributions
ents-0.005
 opens-0.005
 ago-0.005
 slam-0.005
 amaz-0.005
Token amaz
Token ID40642
Feature activation-2.06e-01

Pos logit contributions
ents+0.012
 ago+0.011
 widen+0.011
 opens+0.010
 thirteen+0.010
Neg logit contributions
P-0.008
oo-0.008
scar-0.008
lly-0.008
idd-0.007
Tokenement
Token ID972
Feature activation-6.94e-02

Pos logit contributions
ents+0.362
 ago+0.318
 slam+0.316
 opens+0.314
 widen+0.307
Neg logit contributions
P-0.351
oo-0.342
scar-0.331
lly-0.322
idd-0.322
Token,
Token ID11
Feature activation+1.41e-02

Pos logit contributions
ents+0.141
 opens+0.126
 ago+0.124
 slam+0.123
 widen+0.123
Neg logit contributions
oo-0.095
P-0.094
scar-0.091
lly-0.087
idd-0.086
Token the
Token ID262
Feature activation+2.33e-02

Pos logit contributions
P+0.023
oo+0.022
scar+0.022
idd+0.021
lly+0.020
Neg logit contributions
ents-0.026
 opens-0.023
 ago-0.023
 slam-0.022
 amaz-0.022
Token clothes
Token ID8242
Feature activation+1.08e-02

Pos logit contributions
P+0.043
oo+0.042
scar+0.041
idd+0.039
w+0.038
Neg logit contributions
ents-0.036
 ago-0.034
 opens-0.033
 amaz-0.032
 slam-0.032
Token came
Token ID1625
Feature activation+5.97e-04

Pos logit contributions
P+0.016
oo+0.015
scar+0.015
lly+0.014
idd+0.014
Neg logit contributions
ents-0.021
 ago-0.019
 opens-0.019
 slam-0.018
 widen-0.018
Token door
Token ID3420
Feature activation-4.92e-02

Pos logit contributions
P+0.045
lly+0.044
idd+0.043
ter+0.040
in+0.040
Neg logit contributions
ents-0.044
 distances-0.037
 ago-0.036
 widen-0.036
 seventeen-0.034
Token,
Token ID11
Feature activation+8.38e-03

Pos logit contributions
ents+0.096
 distances+0.095
 thirteen+0.094
 seventeen+0.094
 ago+0.092
Neg logit contributions
oo-0.049
--0.048
ing-0.048
lly-0.047
ina-0.044
Token and
Token ID290
Feature activation+1.44e-02

Pos logit contributions
P+0.012
lly+0.012
oo+0.011
idd+0.011
scar+0.011
Neg logit contributions
ents-0.015
 ago-0.014
 widen-0.013
 slam-0.013
 distances-0.013
Token to
Token ID284
Feature activation+1.29e-02

Pos logit contributions
P+0.022
oo+0.021
scar+0.021
idd+0.020
lly+0.020
Neg logit contributions
ents-0.027
 ago-0.024
 opens-0.024
 slam-0.023
 widen-0.023
Token her
Token ID607
Feature activation-5.38e-03

Pos logit contributions
lly+0.015
oo+0.014
P+0.014
ter+0.013
scar+0.013
Neg logit contributions
ents-0.029
umbs-0.025
 ago-0.025
 widen-0.024
 seventeen-0.024
Token amaz
Token ID40642
Feature activation-1.95e-01

Pos logit contributions
ents+0.008
 ago+0.007
 widen+0.006
 opens+0.006
 thirteen+0.006
Neg logit contributions
P-0.011
oo-0.010
lly-0.010
scar-0.010
idd-0.010
Tokenement
Token ID972
Feature activation-7.45e-02

Pos logit contributions
ents+0.315
 ago+0.274
 opens+0.273
 slam+0.262
 widen+0.260
Neg logit contributions
P-0.360
oo-0.347
scar-0.345
idd-0.332
lly-0.326
Token,
Token ID11
Feature activation+3.71e-03

Pos logit contributions
ents+0.153
 ago+0.138
 opens+0.137
 slam+0.133
 widen+0.132
Neg logit contributions
P-0.104
oo-0.099
scar-0.099
idd-0.094
lly-0.091
Token she
Token ID673
Feature activation-2.39e-04

Pos logit contributions
P+0.005
oo+0.005
lly+0.005
scar+0.005
idd+0.005
Neg logit contributions
ents-0.007
 ago-0.006
 opens-0.006
 widen-0.006
 slam-0.006
Token could
Token ID714
Feature activation+1.55e-02

Pos logit contributions
ents+0.001
 ago+0.000
 slam+0.000
 widen+0.000
 opens+0.000
Neg logit contributions
P-0.000
lly-0.000
scar-0.000
oo-0.000
idd-0.000
Token see
Token ID766
Feature activation-5.33e-03

Pos logit contributions
lly+0.027
oo+0.026
P+0.024
scar+0.023
ter+0.023
Neg logit contributions
ents-0.025
 ago-0.022
 thirteen-0.022
 seventeen-0.021
 distances-0.020
Tokenmy
Token ID1820
Feature activation-3.69e-02

Pos logit contributions
ents+0.015
 opens+0.014
 widen+0.014
 slam+0.014
 amaz+0.014
Neg logit contributions
P-0.010
scar-0.010
oo-0.010
lly-0.009
idd-0.009
Token had
Token ID550
Feature activation-4.52e-02

Pos logit contributions
ents+0.073
 opens+0.065
 ago+0.065
 slam+0.063
 widen+0.063
Neg logit contributions
P-0.054
oo-0.052
scar-0.052
idd-0.049
lly-0.048
Token told
Token ID1297
Feature activation-3.13e-02

Pos logit contributions
ents+0.083
 ago+0.074
 opens+0.073
 widen+0.068
 slam+0.068
Neg logit contributions
P-0.075
scar-0.073
oo-0.072
idd-0.069
lly-0.066
Token him
Token ID683
Feature activation-1.84e-02

Pos logit contributions
ents+0.080
 ago+0.073
 opens+0.073
 slam+0.071
 widen+0.071
Neg logit contributions
P-0.028
oo-0.026
scar-0.026
idd-0.024
lly-0.023
Token long
Token ID890
Feature activation-1.09e-01

Pos logit contributions
ents+0.034
 ago+0.031
 opens+0.031
 slam+0.029
 widen+0.029
Neg logit contributions
P-0.030
scar-0.028
oo-0.028
idd-0.027
lly-0.026
Token ago
Token ID2084
Feature activation-1.93e-01

Pos logit contributions
ents+0.180
 opens+0.151
 widen+0.150
 slam+0.148
 ago+0.147
Neg logit contributions
P-0.192
oo-0.192
scar-0.189
idd-0.179
lly-0.178
Token.
Token ID13
Feature activation-3.40e-03

Pos logit contributions
ents+0.396
 ago+0.355
 opens+0.355
 slam+0.343
 widen+0.341
Neg logit contributions
P-0.272
oo-0.258
scar-0.257
idd-0.244
lly-0.238
Token The
Token ID383
Feature activation-5.03e-03

Pos logit contributions
ents+0.005
 ago+0.005
 opens+0.005
 slam+0.004
 widen+0.004
Neg logit contributions
P-0.006
oo-0.006
scar-0.006
idd-0.006
lly-0.006
Token bird
Token ID6512
Feature activation+4.45e-03

Pos logit contributions
ents+0.009
 ago+0.008
 opens+0.008
 slam+0.008
 widen+0.008
Neg logit contributions
P-0.008
oo-0.008
scar-0.008
idd-0.008
lly-0.007
Token spread
Token ID4104
Feature activation-8.77e-03

Pos logit contributions
P+0.005
oo+0.005
lly+0.005
scar+0.005
idd+0.005
Neg logit contributions
ents-0.010
 ago-0.009
 slam-0.009
 widen-0.009
 opens-0.009
Token out
Token ID503
Feature activation-1.59e-02

Pos logit contributions
ents+0.018
 ago+0.016
 opens+0.016
 slam+0.015
 widen+0.015
Neg logit contributions
P-0.013
oo-0.012
scar-0.012
idd-0.011
lly-0.011
TokenTo
Token ID2514
Feature activation-3.01e-04

Pos logit contributions
P+0.006
oo+0.006
scar+0.006
idd+0.006
lly+0.005
Neg logit contributions
ents-0.006
 ago-0.005
 opens-0.005
 slam-0.005
 widen-0.005
Token the
Token ID262
Feature activation+6.25e-03

Pos logit contributions
ents+0.001
 ago+0.001
 opens+0.001
 slam+0.001
 widen+0.001
Neg logit contributions
P-0.000
oo-0.000
scar-0.000
idd-0.000
lly-0.000
Token little
Token ID1310
Feature activation-6.26e-02

Pos logit contributions
P+0.008
lly+0.007
ter+0.007
 friend+0.007
in+0.007
Neg logit contributions
ents-0.014
 ago-0.012
 distances-0.011
 seventeen-0.011
 slam-0.011
Token boy
Token ID2933
Feature activation-8.32e-03

Pos logit contributions
ents+0.148
 ago+0.131
 widen+0.128
 slam+0.127
 opens+0.126
Neg logit contributions
P-0.068
idd-0.061
ter-0.060
oo-0.059
scar-0.059
Token's
Token ID338
Feature activation-3.90e-03

Pos logit contributions
ents+0.016
 ago+0.014
 widen+0.014
 thirteen+0.014
 seventeen+0.014
Neg logit contributions
oo-0.012
lly-0.011
P-0.011
scar-0.011
idd-0.011
Token amaz
Token ID40642
Feature activation-1.88e-01

Pos logit contributions
ents+0.008
 thirteen+0.007
 seventeen+0.007
 ago+0.006
 distances+0.006
Neg logit contributions
oo-0.005
 friend-0.005
 friends-0.005
lly-0.005
by-0.005
Tokenement
Token ID972
Feature activation-6.79e-02

Pos logit contributions
ents+0.317
 ago+0.277
 opens+0.274
 slam+0.268
 widen+0.263
Neg logit contributions
P-0.334
oo-0.324
scar-0.319
idd-0.307
lly-0.305
Token,
Token ID11
Feature activation-6.03e-03

Pos logit contributions
ents+0.131
 ago+0.117
 opens+0.116
 slam+0.112
 widen+0.111
Neg logit contributions
P-0.106
scar-0.100
oo-0.099
idd-0.095
lly-0.092
Token underneath
Token ID14638
Feature activation+1.36e-02

Pos logit contributions
ents+0.011
 opens+0.010
 ago+0.010
 amaz+0.010
 slam+0.010
Neg logit contributions
P-0.010
oo-0.009
scar-0.009
idd-0.009
lly-0.009
Token the
Token ID262
Feature activation+3.20e-02

Pos logit contributions
P+0.021
oo+0.020
scar+0.020
idd+0.019
lly+0.019
Neg logit contributions
ents-0.026
 ago-0.024
 opens-0.023
 slam-0.023
 widen-0.022
Token tree
Token ID5509
Feature activation-9.01e-03

Pos logit contributions
ter+0.044
lly+0.043
P+0.041
in+0.040
B+0.040
Neg logit contributions
ents-0.059
 ago-0.052
 seventeen-0.049
umbs-0.048
 thirteen-0.046
Token people
Token ID661
Feature activation-6.55e-02

Pos logit contributions
ents+0.017
 ago+0.015
 opens+0.015
 slam+0.015
 widen+0.015
Neg logit contributions
P-0.012
oo-0.011
scar-0.011
idd-0.011
lly-0.011
Token did
Token ID750
Feature activation-4.01e-02

Pos logit contributions
ents+0.109
 amaz+0.097
 slam+0.096
 widen+0.091
 ago+0.090
Neg logit contributions
oo-0.111
P-0.111
lly-0.106
w-0.106
scar-0.102
Token a
Token ID257
Feature activation-3.33e-02

Pos logit contributions
ents+0.085
 amaz+0.078
 slam+0.078
 opens+0.075
 widen+0.074
Neg logit contributions
oo-0.051
P-0.049
lly-0.046
scar-0.043
w-0.043
Token long
Token ID890
Feature activation-9.49e-02

Pos logit contributions
ents+0.054
 opens+0.051
 ago+0.050
 amaz+0.048
 widen+0.048
Neg logit contributions
scar-0.058
P-0.057
oo-0.056
idd-0.055
uc-0.052
Token time
Token ID640
Feature activation-3.41e-02

Pos logit contributions
ents+0.193
 widen+0.162
 slam+0.160
 opens+0.157
 amaz+0.154
Neg logit contributions
oo-0.145
P-0.144
w-0.132
scar-0.128
lly-0.124
Token ago
Token ID2084
Feature activation-1.85e-01

Pos logit contributions
ents+0.062
 slam+0.058
 amaz+0.057
 widen+0.057
 opens+0.055
Neg logit contributions
oo-0.050
P-0.047
idd-0.045
w-0.044
lly-0.043
Token.
Token ID13
Feature activation-9.24e-03

Pos logit contributions
ents+0.394
 widen+0.358
 slam+0.353
 amaz+0.351
 opens+0.350
Neg logit contributions
P-0.238
oo-0.235
scar-0.216
idd-0.214
lly-0.209
Token They
Token ID1119
Feature activation-1.27e-02

Pos logit contributions
ents+0.016
 opens+0.014
 gasp+0.014
 widen+0.013
 amaz+0.013
Neg logit contributions
P-0.016
oo-0.015
idd-0.015
scar-0.015
lly-0.014
Token lived
Token ID5615
Feature activation-2.71e-02

Pos logit contributions
ents+0.022
 ago+0.019
 slam+0.019
 amaz+0.018
 distances+0.018
Neg logit contributions
lly-0.019
P-0.019
oo-0.018
w-0.018
ter-0.017
Token in
Token ID287
Feature activation-1.65e-02

Pos logit contributions
ents+0.053
 ago+0.049
 opens+0.048
 slam+0.045
 gasp+0.045
Neg logit contributions
scar-0.041
P-0.040
oo-0.039
idd-0.036
lly-0.036
Token castles
Token ID49203
Feature activation-2.16e-02

Pos logit contributions
ents+0.041
 widen+0.036
 opens+0.035
 slam+0.035
 thirteen+0.034
Neg logit contributions
P-0.016
oo-0.015
idd-0.013
scar-0.013
lly-0.013
Token The
Token ID383
Feature activation+2.15e-03

Pos logit contributions
P+0.021
oo+0.021
scar+0.020
idd+0.020
lly+0.019
Neg logit contributions
ents-0.016
 ago-0.014
 opens-0.014
 widen-0.013
 slam-0.013
Token boy
Token ID2933
Feature activation-1.82e-02

Pos logit contributions
P+0.003
oo+0.003
scar+0.003
lly+0.003
idd+0.003
Neg logit contributions
ents-0.004
 ago-0.004
 opens-0.004
 slam-0.004
 widen-0.004
Token watched
Token ID7342
Feature activation+2.13e-02

Pos logit contributions
ents+0.038
 opens+0.034
 ago+0.033
 slam+0.032
 gasp+0.032
Neg logit contributions
P-0.026
scar-0.025
oo-0.024
idd-0.023
w-0.022
Token her
Token ID607
Feature activation-1.61e-02

Pos logit contributions
P+0.032
oo+0.032
scar+0.030
idd+0.029
ter+0.028
Neg logit contributions
ents-0.041
 ago-0.037
 opens-0.036
 slam-0.036
 amaz-0.035
Token in
Token ID287
Feature activation-1.18e-02

Pos logit contributions
ents+0.032
 ago+0.030
 thirteen+0.028
 opens+0.027
 seventeen+0.027
Neg logit contributions
oo-0.024
P-0.022
ter-0.022
lly-0.021
scar-0.020
Token amaz
Token ID40642
Feature activation-1.83e-01

Pos logit contributions
ents+0.022
bell+0.015
 thirteen+0.015
 seventeen+0.015
 opens+0.015
Neg logit contributions
P-0.019
oo-0.018
mate-0.018
 Snow-0.017
ee-0.017
Tokenement
Token ID972
Feature activation-5.92e-02

Pos logit contributions
ents+0.323
 ago+0.286
 slam+0.281
 opens+0.279
 widen+0.273
Neg logit contributions
P-0.315
oo-0.307
scar-0.296
lly-0.286
idd-0.284
Token.
Token ID13
Feature activation+5.62e-03

Pos logit contributions
ents+0.127
 opens+0.115
 ago+0.113
 slam+0.113
 widen+0.111
Neg logit contributions
oo-0.076
P-0.072
scar-0.069
lly-0.066
idd-0.065
Token
Token ID198
Feature activation-8.46e-04

Pos logit contributions
P+0.012
oo+0.012
scar+0.012
idd+0.011
lly+0.011
Neg logit contributions
ents-0.007
 ago-0.006
 opens-0.006
 slam-0.006
 widen-0.006
Token
Token ID198
Feature activation+1.63e-04

Pos logit contributions
ents+0.002
 ago+0.001
 opens+0.001
 slam+0.001
 widen+0.001
Neg logit contributions
P-0.001
oo-0.001
scar-0.001
idd-0.001
lly-0.001
TokenThe
Token ID464
Feature activation+1.72e-02

Pos logit contributions
P+0.000
oo+0.000
scar+0.000
idd+0.000
lly+0.000
Neg logit contributions
ents-0.000
 opens-0.000
 ago-0.000
 slam-0.000
 widen-0.000
Token.
Token ID13
Feature activation+1.40e-02

Pos logit contributions
ents+0.015
 ago+0.014
 opens+0.014
 gasp+0.013
 amaz+0.013
Neg logit contributions
P-0.013
scar-0.012
oo-0.011
idd-0.011
lly-0.011
Token Amy
Token ID14235
Feature activation-1.45e-02

Pos logit contributions
P+0.022
oo+0.021
scar+0.021
idd+0.020
B+0.019
Neg logit contributions
ents-0.025
 widen-0.022
 opens-0.021
 ago-0.021
 slam-0.021
Token watched
Token ID7342
Feature activation+1.97e-02

Pos logit contributions
ents+0.027
 opens+0.024
 ago+0.024
 slam+0.023
 widen+0.023
Neg logit contributions
P-0.023
oo-0.022
scar-0.022
idd-0.021
lly-0.020
Token them
Token ID606
Feature activation+1.42e-02

Pos logit contributions
P+0.030
scar+0.028
oo+0.027
idd+0.027
lly+0.026
Neg logit contributions
ents-0.039
 ago-0.035
 opens-0.035
 widen-0.034
 slam-0.034
Token with
Token ID351
Feature activation-1.77e-02

Pos logit contributions
oo+0.021
ter+0.019
lly+0.018
P+0.018
uc+0.017
Neg logit contributions
ents-0.027
 ago-0.026
 opens-0.024
 thirteen-0.023
 slam-0.023
Token amaz
Token ID40642
Feature activation-1.83e-01

Pos logit contributions
ents+0.037
 ago+0.032
 opens+0.032
 slam+0.032
 widen+0.030
Neg logit contributions
P-0.024
oo-0.022
scar-0.022
lly-0.021
idd-0.020
Tokenement
Token ID972
Feature activation-5.91e-02

Pos logit contributions
ents+0.276
 opens+0.239
 ago+0.237
 amaz+0.225
 widen+0.225
Neg logit contributions
P-0.361
scar-0.348
oo-0.340
idd-0.333
lly-0.322
Token.
Token ID13
Feature activation-3.31e-03

Pos logit contributions
ents+0.114
 opens+0.102
 ago+0.102
 slam+0.099
 widen+0.098
Neg logit contributions
P-0.088
oo-0.086
scar-0.084
idd-0.080
lly-0.079
Token She
Token ID1375
Feature activation-8.19e-03

Pos logit contributions
ents+0.005
 opens+0.005
 ago+0.004
 widen+0.004
 slam+0.004
Neg logit contributions
P-0.006
oo-0.006
scar-0.006
idd-0.006
lly-0.005
Token smiled
Token ID13541
Feature activation-3.40e-03

Pos logit contributions
ents+0.016
 opens+0.014
 ago+0.014
 amaz+0.014
 slam+0.014
Neg logit contributions
P-0.012
oo-0.012
scar-0.012
idd-0.011
lly-0.010
Token,
Token ID11
Feature activation+8.79e-03

Pos logit contributions
ents+0.005
 opens+0.004
 ago+0.004
 amaz+0.004
 widen+0.004
Neg logit contributions
P-0.007
scar-0.007
oo-0.007
idd-0.006
lly-0.006
Token Daisy
Token ID40355
Feature activation-2.63e-02

Pos logit contributions
P+0.011
oo+0.010
scar+0.010
idd+0.010
lly+0.010
Neg logit contributions
ents-0.010
 opens-0.009
 ago-0.008
 widen-0.008
 slam-0.008
Token had
Token ID550
Feature activation-1.54e-02

Pos logit contributions
ents+0.054
 ago+0.049
 opens+0.048
 slam+0.047
 widen+0.047
Neg logit contributions
P-0.036
oo-0.035
scar-0.034
idd-0.033
lly-0.032
Token planted
Token ID18969
Feature activation-1.18e-02

Pos logit contributions
ents+0.031
 slam+0.029
 ago+0.029
 opens+0.028
 widen+0.028
Neg logit contributions
P-0.022
lly-0.019
scar-0.019
oo-0.019
idd-0.018
Token it
Token ID340
Feature activation+1.45e-03

Pos logit contributions
ents+0.021
 opens+0.019
 ago+0.018
 amaz+0.018
 widen+0.018
Neg logit contributions
scar-0.019
P-0.019
oo-0.019
idd-0.018
lly-0.017
Token weeks
Token ID2745
Feature activation-5.99e-02

Pos logit contributions
P+0.002
oo+0.002
lly+0.001
scar+0.001
ter+0.001
Neg logit contributions
ents-0.003
 ago-0.003
 slam-0.003
 widen-0.003
 amaz-0.003
Token ago
Token ID2084
Feature activation-1.78e-01

Pos logit contributions
ents+0.106
 widen+0.098
 slam+0.097
 amaz+0.097
 opens+0.097
Neg logit contributions
oo-0.095
P-0.092
scar-0.087
idd-0.087
lly-0.086
Token,
Token ID11
Feature activation-3.40e-03

Pos logit contributions
ents+0.310
 opens+0.271
 ago+0.269
 slam+0.263
 widen+0.262
Neg logit contributions
P-0.304
oo-0.293
scar-0.290
idd-0.279
lly-0.274
Token but
Token ID475
Feature activation+1.94e-03

Pos logit contributions
ents+0.006
 widen+0.005
 opens+0.005
 slam+0.005
 ago+0.005
Neg logit contributions
P-0.005
oo-0.005
lly-0.005
scar-0.005
ter-0.005
Token now
Token ID783
Feature activation+7.28e-04

Pos logit contributions
P+0.003
scar+0.003
oo+0.003
lly+0.002
ter+0.002
Neg logit contributions
ents-0.004
 opens-0.004
 ago-0.004
 widen-0.003
 slam-0.003
Token she
Token ID673
Feature activation-2.55e-02

Pos logit contributions
P+0.001
oo+0.001
scar+0.001
lly+0.001
idd+0.001
Neg logit contributions
ents-0.001
 ago-0.001
 opens-0.001
 slam-0.001
 widen-0.001
Token could
Token ID714
Feature activation+3.95e-03

Pos logit contributions
ents+0.052
 opens+0.046
 ago+0.046
 slam+0.043
 amaz+0.043
Neg logit contributions
P-0.037
oo-0.036
scar-0.035
idd-0.033
w-0.031
Token bird
Token ID6512
Feature activation+1.56e-03

Pos logit contributions
P+0.011
oo+0.010
ter+0.010
idd+0.010
lly+0.010
Neg logit contributions
ents-0.014
 ago-0.012
 opens-0.011
 slam-0.011
 widen-0.011
Token looked
Token ID3114
Feature activation+1.08e-02

Pos logit contributions
P+0.002
oo+0.002
lly+0.002
scar+0.002
idd+0.002
Neg logit contributions
ents-0.004
 ago-0.003
 slam-0.003
 opens-0.003
 widen-0.003
Token at
Token ID379
Feature activation+1.03e-02

Pos logit contributions
P+0.015
oo+0.014
scar+0.014
lly+0.013
idd+0.013
Neg logit contributions
ents-0.022
 ago-0.020
 slam-0.019
 opens-0.019
 distances-0.018
Token her
Token ID607
Feature activation+8.82e-03

Pos logit contributions
P+0.015
oo+0.015
scar+0.014
idd+0.014
lly+0.014
Neg logit contributions
ents-0.021
 ago-0.018
 opens-0.018
 slam-0.018
 widen-0.018
Token in
Token ID287
Feature activation+7.92e-03

Pos logit contributions
oo+0.014
lly+0.013
P+0.013
scar+0.013
ter+0.013
Neg logit contributions
ents-0.016
 ago-0.015
 opens-0.014
 thirteen-0.014
 slam-0.014
Token amaz
Token ID40642
Feature activation-1.77e-01

Pos logit contributions
P+0.014
oo+0.013
scar+0.012
B+0.012
lly+0.012
Neg logit contributions
ents-0.014
 slam-0.010
 ago-0.010
 opens-0.009
 thirteen-0.009
Tokenement
Token ID972
Feature activation-5.11e-02

Pos logit contributions
ents+0.338
 slam+0.309
 ago+0.302
 opens+0.296
 widen+0.293
Neg logit contributions
P-0.267
oo-0.264
lly-0.246
scar-0.244
idd-0.241
Token,
Token ID11
Feature activation+1.57e-02

Pos logit contributions
ents+0.103
 slam+0.094
 opens+0.094
 thirteen+0.091
 widen+0.091
Neg logit contributions
oo-0.069
P-0.061
lly-0.061
ter-0.061
scar-0.060
Token and
Token ID290
Feature activation+1.46e-02

Pos logit contributions
oo+0.024
P+0.024
ter+0.023
scar+0.023
lly+0.023
Neg logit contributions
ents-0.027
 ago-0.025
 opens-0.024
 slam-0.024
 widen-0.023
Token said
Token ID531
Feature activation+2.70e-02

Pos logit contributions
P+0.023
oo+0.022
scar+0.022
lly+0.021
idd+0.021
Neg logit contributions
ents-0.027
 ago-0.025
 opens-0.024
 slam-0.024
 widen-0.024
Token:
Token ID25
Feature activation+3.21e-02

Pos logit contributions
scar+0.050
P+0.050
idd+0.046
oo+0.046
w+0.044
Neg logit contributions
ents-0.044
 opens-0.040
 ago-0.039
 amaz-0.037
 gasp-0.037
Token he
Token ID339
Feature activation-1.46e-02

Pos logit contributions
ents+0.023
 ago+0.021
 opens+0.021
 slam+0.020
 widen+0.020
Neg logit contributions
P-0.018
oo-0.017
scar-0.017
idd-0.016
lly-0.016
Token had
Token ID550
Feature activation-2.68e-02

Pos logit contributions
ents+0.025
 ago+0.023
 opens+0.023
 slam+0.021
 gasp+0.021
Neg logit contributions
P-0.025
oo-0.024
scar-0.024
idd-0.023
uc-0.022
Token seen
Token ID1775
Feature activation-1.30e-02

Pos logit contributions
ents+0.049
 ago+0.045
 opens+0.043
 slam+0.041
 gasp+0.041
Neg logit contributions
P-0.044
scar-0.043
oo-0.042
idd-0.040
lly-0.038
Token a
Token ID257
Feature activation-1.39e-02

Pos logit contributions
ents+0.024
 ago+0.022
 opens+0.021
 slam+0.020
 widen+0.020
Neg logit contributions
P-0.021
scar-0.020
oo-0.020
idd-0.019
lly-0.019
Token year
Token ID614
Feature activation-3.99e-02

Pos logit contributions
ents+0.024
 ago+0.022
 opens+0.022
 widen+0.021
 slam+0.021
Neg logit contributions
P-0.024
scar-0.023
oo-0.023
idd-0.022
lly-0.021
Token ago
Token ID2084
Feature activation-1.76e-01

Pos logit contributions
ents+0.079
 slam+0.078
 horror+0.076
 amaz+0.076
 opens+0.076
Neg logit contributions
--0.042
us-0.040
lly-0.040
in-0.038
oo-0.037
Token.
Token ID13
Feature activation+8.29e-03

Pos logit contributions
ents+0.369
 opens+0.331
 ago+0.331
 slam+0.322
 widen+0.320
Neg logit contributions
P-0.237
oo-0.225
scar-0.224
idd-0.212
lly-0.207
Token He
Token ID679
Feature activation-8.29e-04

Pos logit contributions
P+0.015
oo+0.014
scar+0.014
idd+0.014
lly+0.013
Neg logit contributions
ents-0.014
 opens-0.012
 ago-0.012
 slam-0.012
 widen-0.012
Token went
Token ID1816
Feature activation-1.98e-02

Pos logit contributions
ents+0.002
 ago+0.002
 opens+0.001
 slam+0.001
 widen+0.001
Neg logit contributions
P-0.001
oo-0.001
scar-0.001
idd-0.001
lly-0.001
Token home
Token ID1363
Feature activation+3.89e-02

Pos logit contributions
ents+0.038
 opens+0.034
 ago+0.034
 slam+0.033
 widen+0.032
Neg logit contributions
P-0.031
scar-0.030
oo-0.030
idd-0.028
lly-0.027
Token,
Token ID11
Feature activation+1.37e-02

Pos logit contributions
oo+0.055
lly+0.051
P+0.050
scar+0.050
idd+0.050
Neg logit contributions
ents-0.075
 thirteen-0.070
 seventeen-0.068
 widen-0.067
 opens-0.066
Token her
Token ID607
Feature activation-3.43e-02

Pos logit contributions
ents+0.073
 ago+0.067
 slam+0.064
 widen+0.064
 amaz+0.063
Neg logit contributions
oo-0.036
P-0.036
scar-0.031
lly-0.030
B-0.030
Token that
Token ID326
Feature activation-7.17e-03

Pos logit contributions
ents+0.069
 ago+0.061
 opens+0.061
 slam+0.060
 widen+0.060
Neg logit contributions
P-0.049
oo-0.048
scar-0.047
idd-0.044
lly-0.044
Token a
Token ID257
Feature activation-1.05e-02

Pos logit contributions
ents+0.015
 ago+0.013
 opens+0.013
 widen+0.013
 slam+0.013
Neg logit contributions
P-0.010
oo-0.010
scar-0.009
idd-0.009
lly-0.009
Token long
Token ID890
Feature activation-7.99e-02

Pos logit contributions
ents+0.017
 opens+0.015
 ago+0.014
 slam+0.014
 amaz+0.014
Neg logit contributions
P-0.020
oo-0.019
scar-0.019
idd-0.018
lly-0.017
Token time
Token ID640
Feature activation-2.49e-02

Pos logit contributions
ents+0.144
 opens+0.122
 widen+0.121
 ago+0.121
 slam+0.119
Neg logit contributions
P-0.133
oo-0.132
scar-0.129
lly-0.122
idd-0.120
Token ago
Token ID2084
Feature activation-1.75e-01

Pos logit contributions
ents+0.049
 widen+0.044
 opens+0.043
 slam+0.043
 gasp+0.042
Neg logit contributions
oo-0.035
P-0.034
idd-0.033
scar-0.033
lly-0.032
Token,
Token ID11
Feature activation-1.82e-03

Pos logit contributions
ents+0.322
 ago+0.300
 opens+0.289
 slam+0.275
 gasp+0.272
Neg logit contributions
P-0.287
scar-0.269
oo-0.264
idd-0.255
lly-0.247
Token someone
Token ID2130
Feature activation-5.89e-02

Pos logit contributions
ents+0.004
 widen+0.003
 gasp+0.003
 slam+0.003
 ago+0.003
Neg logit contributions
P-0.003
oo-0.002
ter-0.002
we-0.002
lly-0.002
Token made
Token ID925
Feature activation-1.12e-03

Pos logit contributions
ents+0.115
 opens+0.106
 ago+0.101
 slam+0.098
 gasp+0.096
Neg logit contributions
P-0.092
oo-0.087
scar-0.085
idd-0.082
uc-0.076
Token the
Token ID262
Feature activation-2.91e-02

Pos logit contributions
ents+0.002
 opens+0.002
 ago+0.002
 gasp+0.002
 slam+0.002
Neg logit contributions
P-0.002
scar-0.002
oo-0.002
idd-0.002
lly-0.002
Token mark
Token ID1317
Feature activation-1.40e-02

Pos logit contributions
ents+0.055
 ago+0.048
 opens+0.046
 widen+0.046
 slam+0.045
Neg logit contributions
P-0.047
oo-0.044
scar-0.043
lly-0.043
w-0.042
Token the
Token ID262
Feature activation+8.01e-04

Pos logit contributions
idd+0.079
P+0.078
scar+0.076
w+0.069
uc+0.068
Neg logit contributions
ents-0.073
 opens-0.067
 widen-0.064
 slam-0.064
 gasp-0.063
Token pigeon
Token ID46387
Feature activation+7.04e-03

Pos logit contributions
P+0.001
scar+0.001
oo+0.001
idd+0.001
lly+0.001
Neg logit contributions
ents-0.001
 ago-0.001
 opens-0.001
 amaz-0.001
 slam-0.001
Token all
Token ID477
Feature activation+2.32e-03

Pos logit contributions
oo+0.009
P+0.009
lly+0.009
ter+0.009
scar+0.009
Neg logit contributions
ents-0.014
 ago-0.013
 opens-0.013
 thirteen-0.012
 widen-0.012
Token those
Token ID883
Feature activation+2.63e-02

Pos logit contributions
P+0.004
oo+0.003
scar+0.003
idd+0.003
lly+0.003
Neg logit contributions
ents-0.005
 ago-0.004
 opens-0.004
 slam-0.004
 widen-0.004
Token weeks
Token ID2745
Feature activation-6.83e-02

Pos logit contributions
P+0.038
oo+0.036
scar+0.035
lly+0.034
idd+0.033
Neg logit contributions
ents-0.053
 ago-0.048
 opens-0.047
 slam-0.046
 widen-0.046
Token ago
Token ID2084
Feature activation-1.75e-01

Pos logit contributions
ents+0.135
 opens+0.120
 amaz+0.119
 slam+0.118
 widen+0.118
Neg logit contributions
P-0.097
oo-0.097
scar-0.092
lly-0.090
idd-0.089
Token.
Token ID13
Feature activation+1.82e-02

Pos logit contributions
ents+0.334
 ago+0.313
 opens+0.300
 slam+0.287
 gasp+0.283
Neg logit contributions
P-0.275
scar-0.258
oo-0.253
idd-0.244
lly-0.235
Token<|endoftext|>
Token ID50256
Feature activation-1.47e-02

Pos logit contributions
P+0.033
scar+0.032
oo+0.031
idd+0.031
w+0.030
Neg logit contributions
ents-0.030
 opens-0.027
 ago-0.026
 amaz-0.025
 slam-0.025
TokenMike
Token ID16073
Feature activation+3.72e-02

Pos logit contributions
 ago+0.025
ents+0.025
 opens+0.023
 distances+0.021
 widen+0.021
Neg logit contributions
oo-0.023
scar-0.023
uc-0.023
P-0.022
idd-0.021
Token and
Token ID290
Feature activation-2.85e-02

Pos logit contributions
P+0.060
oo+0.055
scar+0.055
idd+0.053
we+0.050
Neg logit contributions
ents-0.069
 ago-0.064
 opens-0.063
 gasp-0.057
 slam-0.057
Token his
Token ID465
Feature activation+1.57e-02

Pos logit contributions
ents+0.056
 opens+0.051
 ago+0.050
 slam+0.049
 gasp+0.049
Neg logit contributions
P-0.042
scar-0.040
oo-0.040
idd-0.039
uc-0.037
Token imagined
Token ID15758
Feature activation-1.44e-02

Pos logit contributions
ents+0.024
 ago+0.023
 slam+0.022
 opens+0.022
 widen+0.022
Neg logit contributions
P-0.016
oo-0.016
scar-0.015
lly-0.015
idd-0.015
Token the
Token ID262
Feature activation-1.69e-02

Pos logit contributions
ents+0.029
 opens+0.026
 ago+0.026
 slam+0.026
 widen+0.026
Neg logit contributions
P-0.020
oo-0.019
scar-0.019
lly-0.018
idd-0.018
Token tart
Token ID35842
Feature activation-1.78e-02

Pos logit contributions
ents+0.033
 ago+0.029
 slam+0.028
 opens+0.027
 distances+0.026
Neg logit contributions
P-0.027
scar-0.025
oo-0.024
lly-0.024
ter-0.024
Token was
Token ID373
Feature activation-1.04e-02

Pos logit contributions
ents+0.035
 ago+0.031
 slam+0.031
 amaz+0.030
 opens+0.029
Neg logit contributions
P-0.027
oo-0.026
scar-0.025
ter-0.025
lly-0.025
Token long
Token ID890
Feature activation-7.39e-02

Pos logit contributions
ents+0.019
 ago+0.017
 opens+0.017
 slam+0.017
 gasp+0.016
Neg logit contributions
P-0.016
oo-0.016
scar-0.015
lly-0.015
idd-0.014
Token ago
Token ID2084
Feature activation-1.75e-01

Pos logit contributions
ents+0.163
 slam+0.143
 gasp+0.142
 amaz+0.141
 widen+0.139
Neg logit contributions
P-0.082
ter-0.077
oo-0.077
idd-0.073
scar-0.072
Token,
Token ID11
Feature activation-1.49e-02

Pos logit contributions
ents+0.365
 opens+0.326
 widen+0.322
 slam+0.319
 amaz+0.316
Neg logit contributions
P-0.230
oo-0.228
scar-0.217
idd-0.211
lly-0.203
Token when
Token ID618
Feature activation+6.87e-03

Pos logit contributions
ents+0.028
 slam+0.026
 ago+0.025
 gasp+0.024
 distances+0.024
Neg logit contributions
P-0.022
ter-0.021
we-0.020
w-0.019
oo-0.019
Token she
Token ID673
Feature activation-1.03e-02

Pos logit contributions
P+0.012
scar+0.011
oo+0.011
idd+0.011
lly+0.010
Neg logit contributions
ents-0.012
 ago-0.011
 opens-0.011
 slam-0.010
 amaz-0.010
Token was
Token ID373
Feature activation+2.17e-03

Pos logit contributions
ents+0.021
 ago+0.019
 opens+0.019
 slam+0.018
 widen+0.018
Neg logit contributions
P-0.015
oo-0.014
scar-0.014
idd-0.013
lly-0.013
Token a
Token ID257
Feature activation+9.77e-03

Pos logit contributions
oo+0.003
P+0.003
lly+0.003
scar+0.003
idd+0.003
Neg logit contributions
ents-0.004
 opens-0.004
 ago-0.004
 slam-0.004
 amaz-0.004
Token,
Token ID11
Feature activation-2.29e-03

Pos logit contributions
ents+0.002
 opens+0.002
 ago+0.001
 gasp+0.001
 slam+0.001
Neg logit contributions
P-0.001
scar-0.001
oo-0.001
idd-0.001
lly-0.001
Token her
Token ID607
Feature activation-7.73e-03

Pos logit contributions
ents+0.004
 opens+0.004
 ago+0.004
 slam+0.004
 widen+0.004
Neg logit contributions
oo-0.003
P-0.003
scar-0.003
idd-0.003
ter-0.003
Token eyes
Token ID2951
Feature activation-7.23e-02

Pos logit contributions
ents+0.017
 ago+0.015
 opens+0.015
 widen+0.015
 slam+0.015
Neg logit contributions
P-0.010
oo-0.009
scar-0.009
idd-0.009
lly-0.009
Token wide
Token ID3094
Feature activation-9.11e-02

Pos logit contributions
ents+0.120
 ago+0.107
 opens+0.104
 slam+0.104
 amaz+0.097
Neg logit contributions
P-0.121
oo-0.121
lly-0.112
ter-0.111
idd-0.111
Token with
Token ID351
Feature activation+1.60e-03

Pos logit contributions
ents+0.199
 ago+0.181
 slam+0.179
 opens+0.175
 distances+0.169
Neg logit contributions
oo-0.103
P-0.098
ter-0.096
scar-0.091
idd-0.088
Token amaz
Token ID40642
Feature activation-1.74e-01

Pos logit contributions
P+0.003
oo+0.003
idd+0.002
lly+0.002
scar+0.002
Neg logit contributions
ents-0.003
 slam-0.002
 ago-0.002
 opens-0.002
 distances-0.002
Tokenement
Token ID972
Feature activation-4.64e-02

Pos logit contributions
ents+0.291
 ago+0.255
 opens+0.253
 slam+0.248
 widen+0.242
Neg logit contributions
P-0.311
oo-0.302
scar-0.296
idd-0.287
lly-0.282
Token.
Token ID13
Feature activation+1.29e-02

Pos logit contributions
ents+0.103
 opens+0.093
 ago+0.093
 slam+0.091
 widen+0.090
Neg logit contributions
P-0.056
oo-0.055
scar-0.053
idd-0.050
lly-0.049
Token
Token ID198
Feature activation+3.52e-03

Pos logit contributions
P+0.024
scar+0.023
oo+0.023
idd+0.022
w+0.022
Neg logit contributions
ents-0.021
 ago-0.019
 opens-0.018
 slam-0.017
 amaz-0.017
Token
Token ID198
Feature activation+3.32e-03

Pos logit contributions
P+0.006
scar+0.006
oo+0.006
idd+0.005
ter+0.005
Neg logit contributions
ents-0.006
 ago-0.005
 opens-0.005
 slam-0.005
 widen-0.005
Token"
Token ID1
Feature activation+6.27e-03

Pos logit contributions
P+0.005
oo+0.004
scar+0.004
idd+0.004
lly+0.004
Neg logit contributions
ents-0.007
 ago-0.006
 opens-0.006
 slam-0.006
 widen-0.006
Token He
Token ID679
Feature activation-9.55e-03

Pos logit contributions
ents+0.007
 ago+0.006
 opens+0.006
 widen+0.005
 slam+0.005
Neg logit contributions
P-0.007
oo-0.007
scar-0.007
idd-0.007
lly-0.007
Token was
Token ID373
Feature activation+1.18e-04

Pos logit contributions
ents+0.016
 opens+0.015
 ago+0.014
 slam+0.014
 gasp+0.014
Neg logit contributions
P-0.017
oo-0.016
scar-0.016
idd-0.015
uc-0.015
Token here
Token ID994
Feature activation-2.43e-02

Pos logit contributions
P+0.000
oo+0.000
scar+0.000
idd+0.000
lly+0.000
Neg logit contributions
ents-0.000
 opens-0.000
 ago-0.000
 slam-0.000
 widen-0.000
Token a
Token ID257
Feature activation+9.83e-03

Pos logit contributions
ents+0.048
 opens+0.043
 ago+0.042
 slam+0.042
 widen+0.041
Neg logit contributions
P-0.035
oo-0.034
scar-0.033
idd-0.032
lly-0.031
Token minute
Token ID5664
Feature activation-3.68e-03

Pos logit contributions
P+0.017
scar+0.017
oo+0.017
idd+0.016
lly+0.015
Neg logit contributions
ents-0.016
 ago-0.015
 opens-0.015
 slam-0.014
 amaz-0.014
Token ago
Token ID2084
Feature activation-1.73e-01

Pos logit contributions
ents+0.008
 widen+0.007
 amaz+0.007
 slam+0.007
 opens+0.007
Neg logit contributions
oo-0.004
P-0.004
scar-0.004
idd-0.004
lly-0.004
Token.
Token ID13
Feature activation+5.23e-03

Pos logit contributions
ents+0.333
 ago+0.306
 opens+0.298
 slam+0.285
 gasp+0.282
Neg logit contributions
P-0.267
scar-0.253
oo-0.250
idd-0.238
lly-0.233
Token He
Token ID679
Feature activation-3.98e-03

Pos logit contributions
P+0.009
oo+0.009
scar+0.008
idd+0.008
lly+0.008
Neg logit contributions
ents-0.009
 widen-0.007
 opens-0.007
 ago-0.007
 gasp-0.007
Token was
Token ID373
Feature activation-9.27e-04

Pos logit contributions
ents+0.008
 opens+0.007
 ago+0.007
 slam+0.007
 widen+0.007
Neg logit contributions
P-0.006
oo-0.006
scar-0.006
idd-0.005
lly-0.005
Token chasing
Token ID20023
Feature activation+1.72e-02

Pos logit contributions
ents+0.002
 opens+0.002
 ago+0.002
 slam+0.002
 widen+0.002
Neg logit contributions
P-0.001
oo-0.001
scar-0.001
idd-0.001
lly-0.001
Token the
Token ID262
Feature activation+4.93e-03

Pos logit contributions
P+0.025
scar+0.024
oo+0.023
idd+0.023
lly+0.022
Neg logit contributions
ents-0.035
 opens-0.031
 ago-0.031
 slam-0.030
 gasp-0.030
Token,
Token ID11
Feature activation-2.79e-02

Pos logit contributions
ents+0.012
 ago+0.011
 opens+0.011
 widen+0.011
 slam+0.011
Neg logit contributions
oo-0.007
P-0.007
scar-0.007
idd-0.006
lly-0.006
Token her
Token ID607
Feature activation-2.06e-02

Pos logit contributions
ents+0.048
 ago+0.043
 opens+0.042
 slam+0.041
 widen+0.040
Neg logit contributions
P-0.048
oo-0.046
scar-0.046
idd-0.044
lly-0.043
Token eyes
Token ID2951
Feature activation-7.83e-02

Pos logit contributions
ents+0.038
 ago+0.035
 opens+0.034
 amaz+0.034
 gasp+0.034
Neg logit contributions
P-0.034
scar-0.032
oo-0.031
idd-0.030
lly-0.029
Token wide
Token ID3094
Feature activation-9.25e-02

Pos logit contributions
ents+0.135
 ago+0.121
 opens+0.118
 slam+0.114
 amaz+0.110
Neg logit contributions
P-0.132
oo-0.130
scar-0.123
lly-0.121
idd-0.119
Token with
Token ID351
Feature activation-2.88e-03

Pos logit contributions
ents+0.210
 ago+0.189
 slam+0.186
 opens+0.185
 amaz+0.178
Neg logit contributions
oo-0.102
P-0.099
ter-0.093
scar-0.091
lly-0.087
Token amaz
Token ID40642
Feature activation-1.73e-01

Pos logit contributions
ents+0.005
 ago+0.004
 opens+0.004
 slam+0.004
 distances+0.004
Neg logit contributions
P-0.005
oo-0.004
scar-0.004
lly-0.004
idd-0.004
Tokenement
Token ID972
Feature activation-4.67e-02

Pos logit contributions
ents+0.294
 ago+0.258
 opens+0.256
 slam+0.252
 widen+0.244
Neg logit contributions
P-0.305
oo-0.297
scar-0.290
idd-0.281
lly-0.276
Token.
Token ID13
Feature activation-5.27e-04

Pos logit contributions
ents+0.103
 opens+0.093
 ago+0.093
 slam+0.091
 widen+0.090
Neg logit contributions
P-0.058
oo-0.056
scar-0.055
idd-0.052
lly-0.050
Token
Token ID198
Feature activation-2.38e-03

Pos logit contributions
ents+0.001
 opens+0.001
 ago+0.001
 widen+0.001
 slam+0.001
Neg logit contributions
P-0.001
oo-0.001
scar-0.001
idd-0.001
lly-0.001
Token
Token ID198
Feature activation-1.53e-03

Pos logit contributions
ents+0.004
 ago+0.004
 opens+0.004
 slam+0.003
 widen+0.003
Neg logit contributions
P-0.004
scar-0.004
oo-0.004
idd-0.004
ter-0.004
TokenHer
Token ID9360
Feature activation-8.93e-03

Pos logit contributions
ents+0.003
 ago+0.003
 opens+0.003
 slam+0.003
 widen+0.003
Neg logit contributions
P-0.002
oo-0.002
scar-0.002
idd-0.002
lly-0.002
Token Tim
Token ID5045
Feature activation-3.82e-03

Pos logit contributions
ents+0.051
 opens+0.049
 ago+0.047
 slam+0.045
 gasp+0.044
Neg logit contributions
P-0.043
oo-0.039
idd-0.038
scar-0.037
us-0.037
Token watched
Token ID7342
Feature activation+1.80e-02

Pos logit contributions
ents+0.007
 opens+0.006
 ago+0.006
 slam+0.005
 widen+0.005
Neg logit contributions
P-0.007
oo-0.006
scar-0.006
idd-0.006
lly-0.006
Token them
Token ID606
Feature activation+6.04e-03

Pos logit contributions
P+0.025
oo+0.024
scar+0.024
idd+0.023
lly+0.022
Neg logit contributions
ents-0.037
 ago-0.033
 opens-0.033
 slam-0.032
 widen-0.032
Token with
Token ID351
Feature activation-2.16e-02

Pos logit contributions
oo+0.008
ter+0.007
uc+0.007
lly+0.007
P+0.007
Neg logit contributions
ents-0.012
 ago-0.012
 thirteen-0.010
 seventeen-0.010
 opens-0.010
Token pure
Token ID5899
Feature activation-9.31e-03

Pos logit contributions
ents+0.047
 opens+0.041
 ago+0.040
 slam+0.040
 widen+0.038
Neg logit contributions
P-0.027
oo-0.026
scar-0.025
idd-0.024
lly-0.023
Token amaz
Token ID40642
Feature activation-1.70e-01

Pos logit contributions
ents+0.016
 slam+0.014
 opens+0.013
 ago+0.013
 widen+0.013
Neg logit contributions
oo-0.016
P-0.016
idd-0.014
scar-0.014
uc-0.014
Tokenement
Token ID972
Feature activation-4.53e-02

Pos logit contributions
ents+0.263
 opens+0.226
 ago+0.225
 widen+0.214
 amaz+0.212
Neg logit contributions
P-0.328
scar-0.316
oo-0.308
idd-0.301
lly-0.295
Token,
Token ID11
Feature activation+7.89e-04

Pos logit contributions
ents+0.092
 opens+0.083
 ago+0.081
 slam+0.081
 widen+0.080
Neg logit contributions
oo-0.063
P-0.061
scar-0.059
idd-0.056
lly-0.055
Token never
Token ID1239
Feature activation+6.13e-02

Pos logit contributions
oo+0.001
P+0.001
ter+0.001
 cho+0.001
idd+0.001
Neg logit contributions
ents-0.001
 opens-0.001
 slam-0.001
 ago-0.001
 widen-0.001
Token t
Token ID256
Feature activation+2.57e-02

Pos logit contributions
ter+0.072
o+0.058
we+0.056
 friend+0.055
P+0.055
Neg logit contributions
ents-0.128
 slam-0.117
 seventeen-0.116
 widen-0.115
 opens-0.114
Tokeniring
Token ID3428
Feature activation-1.93e-02

Pos logit contributions
P+0.036
scar+0.032
lly+0.031
oo+0.028
idd+0.028
Neg logit contributions
ents-0.057
 ago-0.046
 slam-0.044
 opens-0.043
 gasp-0.043
Token hair
Token ID4190
Feature activation+4.03e-02

Pos logit contributions
ents+0.032
 ago+0.029
 opens+0.028
 slam+0.027
 amaz+0.027
Neg logit contributions
P-0.044
scar-0.042
oo-0.041
idd-0.040
lly-0.039
Token and
Token ID290
Feature activation-1.87e-02

Pos logit contributions
P+0.058
oo+0.058
scar+0.057
idd+0.054
lly+0.054
Neg logit contributions
ents-0.077
 opens-0.070
 slam-0.068
 ago-0.068
 amaz-0.067
Token brown
Token ID7586
Feature activation+3.54e-03

Pos logit contributions
ents+0.036
 opens+0.032
 ago+0.032
 slam+0.031
 widen+0.031
Neg logit contributions
P-0.028
scar-0.027
oo-0.026
idd-0.025
lly-0.025
Token eyes
Token ID2951
Feature activation-1.48e-02

Pos logit contributions
P+0.007
scar+0.007
oo+0.007
idd+0.006
lly+0.006
Neg logit contributions
ents-0.005
 ago-0.005
 opens-0.005
 slam-0.004
 amaz-0.004
Token,
Token ID11
Feature activation+1.33e-03

Pos logit contributions
ents+0.032
 opens+0.029
 ago+0.029
 slam+0.028
 amaz+0.028
Neg logit contributions
P-0.019
oo-0.018
scar-0.018
idd-0.017
lly-0.016
Token just
Token ID655
Feature activation+8.01e-02

Pos logit contributions
P+0.002
oo+0.002
scar+0.002
idd+0.002
lly+0.002
Neg logit contributions
ents-0.002
 opens-0.002
 ago-0.002
 slam-0.002
 widen-0.002
Token like
Token ID588
Feature activation+5.80e-02

Pos logit contributions
P+0.129
oo+0.129
scar+0.126
lly+0.121
idd+0.121
Neg logit contributions
ents-0.143
 opens-0.121
 slam-0.120
 ago-0.119
 widen-0.117
Token Jack
Token ID3619
Feature activation+3.85e-03

Pos logit contributions
P+0.088
scar+0.083
oo+0.083
idd+0.079
lly+0.076
Neg logit contributions
ents-0.114
 ago-0.103
 opens-0.101
 amaz-0.097
 slam-0.097
Token.
Token ID13
Feature activation+1.41e-04

Pos logit contributions
P+0.006
oo+0.005
scar+0.005
idd+0.005
lly+0.005
Neg logit contributions
ents-0.008
 ago-0.007
 opens-0.007
 slam-0.007
 widen-0.006
Token He
Token ID679
Feature activation+7.84e-03

Pos logit contributions
P+0.000
scar+0.000
oo+0.000
idd+0.000
lly+0.000
Neg logit contributions
ents-0.000
 opens-0.000
 ago-0.000
 slam-0.000
 widen-0.000
Token had
Token ID550
Feature activation-8.14e-03

Pos logit contributions
P+0.011
oo+0.010
scar+0.010
idd+0.010
lly+0.009
Neg logit contributions
ents-0.016
 opens-0.015
 ago-0.015
 slam-0.014
 widen-0.014
Token And
Token ID843
Feature activation+3.12e-02

Pos logit contributions
P+0.009
oo+0.009
scar+0.009
idd+0.008
lly+0.008
Neg logit contributions
ents-0.012
 opens-0.011
 ago-0.011
 widen-0.011
 slam-0.010
Token always
Token ID1464
Feature activation+6.02e-02

Pos logit contributions
oo+0.033
ter+0.032
lly+0.030
P+0.030
scar+0.030
Neg logit contributions
ents-0.072
 widen-0.065
 opens-0.064
 gasp-0.063
 thirteen-0.063
Token remember
Token ID3505
Feature activation-5.45e-03

Pos logit contributions
P+0.096
oo+0.094
scar+0.093
lly+0.089
idd+0.089
Neg logit contributions
ents-0.113
 ago-0.094
 slam-0.094
 opens-0.094
 widen-0.093
Token to
Token ID284
Feature activation+4.36e-03

Pos logit contributions
ents+0.009
 ago+0.008
 opens+0.008
 slam+0.007
 amaz+0.007
Neg logit contributions
P-0.010
scar-0.010
idd-0.009
oo-0.009
w-0.009
Token look
Token ID804
Feature activation+2.49e-03

Pos logit contributions
P+0.008
oo+0.008
scar+0.008
idd+0.007
lly+0.007
Neg logit contributions
ents-0.007
 ago-0.006
 opens-0.006
 slam-0.006
 widen-0.006
Token both
Token ID1111
Feature activation+1.12e-01

Pos logit contributions
P+0.004
scar+0.004
oo+0.004
idd+0.003
lly+0.003
Neg logit contributions
ents-0.005
 opens-0.004
 ago-0.004
 slam-0.004
 widen-0.004
Token ways
Token ID2842
Feature activation+4.23e-02

Pos logit contributions
P+0.190
oo+0.182
scar+0.181
lly+0.176
idd+0.175
Neg logit contributions
ents-0.187
 ago-0.160
 slam-0.154
 widen-0.153
 opens-0.152
Token before
Token ID878
Feature activation-2.27e-02

Pos logit contributions
P+0.061
oo+0.060
scar+0.059
idd+0.055
lly+0.055
Neg logit contributions
ents-0.083
 opens-0.075
 ago-0.073
 slam-0.072
 amaz-0.072
Token crossing
Token ID12538
Feature activation-2.89e-02

Pos logit contributions
ents+0.047
 opens+0.041
 slam+0.040
 gasp+0.040
 widen+0.040
Neg logit contributions
oo-0.032
P-0.032
scar-0.029
lly-0.029
idd-0.028
Token the
Token ID262
Feature activation+1.60e-02

Pos logit contributions
ents+0.058
 opens+0.051
 ago+0.050
 widen+0.049
 amaz+0.049
Neg logit contributions
P-0.043
oo-0.041
scar-0.039
idd-0.038
lly-0.038
Token street
Token ID4675
Feature activation-2.56e-02

Pos logit contributions
P+0.034
scar+0.033
oo+0.032
lly+0.031
idd+0.031
Neg logit contributions
ents-0.022
 amaz-0.018
 opens-0.018
 ago-0.018
 gasp-0.017
Token was
Token ID373
Feature activation+1.05e-02

Pos logit contributions
P+0.026
oo+0.025
scar+0.024
idd+0.024
uc+0.022
Neg logit contributions
ents-0.032
 opens-0.028
 ago-0.027
 amaz-0.027
 slam-0.027
Token ready
Token ID3492
Feature activation+1.37e-03

Pos logit contributions
P+0.021
oo+0.020
scar+0.020
idd+0.019
lly+0.019
Neg logit contributions
ents-0.016
 ago-0.013
 opens-0.013
 slam-0.013
 widen-0.013
Token to
Token ID284
Feature activation+3.63e-02

Pos logit contributions
P+0.002
scar+0.002
oo+0.002
idd+0.002
lly+0.002
Neg logit contributions
ents-0.003
 ago-0.002
 opens-0.002
 slam-0.002
 widen-0.002
Token go
Token ID467
Feature activation+1.63e-02

Pos logit contributions
P+0.068
oo+0.066
scar+0.065
lly+0.062
idd+0.061
Neg logit contributions
ents-0.059
 ago-0.051
 opens-0.050
 slam-0.047
 widen-0.047
Token and
Token ID290
Feature activation+3.59e-02

Pos logit contributions
P+0.023
scar+0.022
oo+0.022
idd+0.021
lly+0.020
Neg logit contributions
ents-0.033
 ago-0.030
 opens-0.030
 slam-0.029
 widen-0.028
Token play
Token ID711
Feature activation+7.96e-02

Pos logit contributions
P+0.059
scar+0.057
oo+0.057
lly+0.056
idd+0.053
Neg logit contributions
ents-0.065
 ago-0.057
 opens-0.055
 widen-0.054
 slam-0.053
Token again
Token ID757
Feature activation-6.43e-03

Pos logit contributions
scar+0.150
P+0.145
idd+0.143
oo+0.140
w+0.140
Neg logit contributions
ents-0.117
 distances-0.106
 opens-0.106
 amaz-0.105
 ago-0.105
Token,
Token ID11
Feature activation+2.86e-02

Pos logit contributions
ents+0.014
 opens+0.012
 ago+0.012
 slam+0.012
 widen+0.012
Neg logit contributions
P-0.009
oo-0.008
scar-0.008
idd-0.007
lly-0.007
Token but
Token ID475
Feature activation+2.04e-02

Pos logit contributions
oo+0.046
scar+0.046
P+0.045
lly+0.043
idd+0.043
Neg logit contributions
ents-0.050
 ago-0.044
 opens-0.044
 widen-0.043
 slam-0.042
Token,
Token ID11
Feature activation+1.74e-02

Pos logit contributions
P+0.027
scar+0.027
oo+0.027
lly+0.025
idd+0.024
Neg logit contributions
ents-0.043
 opens-0.039
 ago-0.038
 widen-0.037
 slam-0.037
Token this
Token ID428
Feature activation+2.68e-03

Pos logit contributions
P+0.030
oo+0.028
idd+0.028
scar+0.028
lly+0.026
Neg logit contributions
ents-0.031
 ago-0.027
 opens-0.027
 amaz-0.026
 slam-0.026
Token SPL
Token ID46341
Feature activation+3.68e-03

Pos logit contributions
P+0.041
oo+0.040
scar+0.039
lly+0.037
idd+0.037
Neg logit contributions
ents-0.036
 ago-0.033
 opens-0.033
 amaz-0.032
 gasp-0.031
TokenASH
Token ID11211
Feature activation+3.09e-02

Pos logit contributions
P+0.007
lly+0.007
us+0.007
scar+0.007
dd+0.006
Neg logit contributions
ents-0.005
 opens-0.004
 ago-0.004
 amaz-0.004
 thirteen-0.004
Token was
Token ID373
Feature activation+4.68e-02

Pos logit contributions
P+0.044
scar+0.038
idd+0.036
oo+0.036
w+0.035
Neg logit contributions
ents-0.069
 opens-0.060
 ago-0.060
 widen-0.058
 slam-0.058
Token always
Token ID1464
Feature activation+7.65e-02

Pos logit contributions
P+0.083
scar+0.079
oo+0.077
idd+0.076
lly+0.073
Neg logit contributions
ents-0.079
 ago-0.070
 opens-0.068
 widen-0.067
 amaz-0.066
Token under
Token ID739
Feature activation-8.59e-03

Pos logit contributions
P+0.131
oo+0.126
scar+0.125
idd+0.120
lly+0.117
Neg logit contributions
ents-0.134
 ago-0.119
 opens-0.118
 slam-0.114
 widen-0.114
Token control
Token ID1630
Feature activation+9.95e-02

Pos logit contributions
ents+0.018
 opens+0.016
 ago+0.016
 widen+0.016
 slam+0.015
Neg logit contributions
P-0.012
oo-0.011
scar-0.011
idd-0.010
lly-0.010
Token!
Token ID0
Feature activation+2.35e-02

Pos logit contributions
P+0.150
oo+0.149
scar+0.146
idd+0.141
lly+0.137
Neg logit contributions
ents-0.186
 opens-0.165
 ago-0.161
 widen-0.160
 slam-0.160
Token<|endoftext|>
Token ID50256
Feature activation-1.03e-02

Pos logit contributions
P+0.041
oo+0.040
scar+0.039
lly+0.038
idd+0.038
Neg logit contributions
ents-0.040
 ago-0.035
 opens-0.035
 amaz-0.034
 slam-0.033
TokenOnce
Token ID7454
Feature activation+1.40e-02

Pos logit contributions
 ago+0.018
ents+0.018
 opens+0.017
 gasp+0.016
 widen+0.016
Neg logit contributions
oo-0.016
scar-0.015
uc-0.015
P-0.015
w-0.014
Token upon
Token ID2402
Feature activation+2.91e-03

Pos logit contributions
P+0.021
oo+0.020
scar+0.020
idd+0.019
lly+0.019
Neg logit contributions
ents-0.027
 opens-0.024
 ago-0.024
 slam-0.024
 widen-0.024
Token a
Token ID257
Feature activation+2.00e-02

Pos logit contributions
P+0.004
oo+0.004
scar+0.004
idd+0.003
lly+0.003
Neg logit contributions
ents-0.006
 ago-0.006
 opens-0.006
 slam-0.005
 widen-0.005
Token tag
Token ID7621
Feature activation+1.90e-02

Pos logit contributions
scar+0.090
us+0.085
uc+0.083
ina+0.082
ugg+0.082
Neg logit contributions
 distances-0.050
 slam-0.048
 opens-0.047
 ago-0.045
ents-0.045
Token and
Token ID290
Feature activation-3.03e-04

Pos logit contributions
P+0.038
scar+0.037
oo+0.036
idd+0.036
lly+0.034
Neg logit contributions
ents-0.028
 ago-0.024
 opens-0.024
 slam-0.023
 distances-0.022
Token hide
Token ID7808
Feature activation+1.02e-02

Pos logit contributions
ents+0.001
 opens+0.000
 ago+0.000
 widen+0.000
 slam+0.000
Neg logit contributions
P-0.000
scar-0.000
oo-0.000
idd-0.000
lly-0.000
Token-
Token ID12
Feature activation+7.68e-02

Pos logit contributions
scar+0.021
P+0.019
oo+0.019
idd+0.018
lly+0.018
Neg logit contributions
ents-0.016
 ago-0.013
 slam-0.013
 distances-0.013
 opens-0.013
Tokenand
Token ID392
Feature activation-9.03e-03

Pos logit contributions
scar+0.171
P+0.170
oo+0.169
idd+0.163
lly+0.162
Neg logit contributions
ents-0.094
 ago-0.070
 opens-0.069
 slam-0.066
 distances-0.065
Token-
Token ID12
Feature activation+8.91e-02

Pos logit contributions
ents+0.022
 opens+0.019
 ago+0.019
 distances+0.019
 slam+0.019
Neg logit contributions
scar-0.009
P-0.009
idd-0.009
ter-0.008
oo-0.008
Tokenseek
Token ID36163
Feature activation+1.68e-02

Pos logit contributions
P+0.184
scar+0.180
oo+0.178
idd+0.175
ter+0.173
Neg logit contributions
ents-0.123
 ago-0.098
 distances-0.094
 opens-0.091
 slam-0.090
Token.
Token ID13
Feature activation+7.51e-03

Pos logit contributions
P+0.030
scar+0.029
oo+0.027
idd+0.027
w+0.026
Neg logit contributions
ents-0.029
 ago-0.027
 opens-0.026
 gasp-0.025
 slam-0.025
Token
Token ID198
Feature activation-6.31e-03

Pos logit contributions
P+0.014
oo+0.013
scar+0.013
idd+0.013
w+0.012
Neg logit contributions
ents-0.012
 ago-0.011
 opens-0.011
 slam-0.010
 widen-0.010
Token
Token ID198
Feature activation-6.23e-03

Pos logit contributions
ents+0.011
 ago+0.009
 opens+0.009
 slam+0.009
 widen+0.009
Neg logit contributions
P-0.011
scar-0.010
oo-0.010
idd-0.010
ter-0.010
TokenSuddenly
Token ID38582
Feature activation-7.98e-03

Pos logit contributions
ents+0.009
 ago+0.008
 opens+0.008
 slam+0.008
 widen+0.007
Neg logit contributions
P-0.011
oo-0.011
uc-0.011
scar-0.011
idd-0.011
Token a
Token ID257
Feature activation+2.56e-02

Pos logit contributions
P+0.051
scar+0.049
oo+0.049
idd+0.046
uc+0.044
Neg logit contributions
ents-0.062
 opens-0.053
 ago-0.052
 amaz-0.052
 slam-0.051
Token spark
Token ID9009
Feature activation-2.72e-02

Pos logit contributions
P+0.046
oo+0.043
scar+0.043
lly+0.042
idd+0.042
Neg logit contributions
ents-0.043
 ago-0.038
 opens-0.037
 distances-0.035
 widen-0.035
Tokenly
Token ID306
Feature activation-5.34e-03

Pos logit contributions
ents+0.051
 ago+0.045
 opens+0.044
 slam+0.043
 amaz+0.043
Neg logit contributions
P-0.044
oo-0.042
scar-0.041
idd-0.039
lly-0.039
Token o
Token ID267
Feature activation+7.09e-02

Pos logit contributions
ents+0.009
 ago+0.008
 opens+0.008
 slam+0.007
 widen+0.007
Neg logit contributions
P-0.010
oo-0.009
scar-0.009
idd-0.009
lly-0.009
Tokenasis
Token ID17765
Feature activation+6.35e-02

Pos logit contributions
P+0.080
scar+0.077
idd+0.075
oo+0.074
dd+0.073
Neg logit contributions
ents-0.159
 opens-0.140
 ago-0.140
 slam-0.138
 widen-0.134
Token in
Token ID287
Feature activation+3.56e-02

Pos logit contributions
P+0.075
oo+0.073
scar+0.072
idd+0.068
lly+0.066
Neg logit contributions
ents-0.142
 opens-0.127
 ago-0.127
 amaz-0.126
 widen-0.125
Token the
Token ID262
Feature activation+3.29e-02

Pos logit contributions
P+0.044
oo+0.039
scar+0.039
lly+0.038
idd+0.038
Neg logit contributions
ents-0.080
 ago-0.071
 opens-0.070
 widen-0.068
 slam-0.067
Token distance
Token ID5253
Feature activation+8.23e-03

Pos logit contributions
P+0.067
oo+0.065
scar+0.065
idd+0.062
lly+0.061
Neg logit contributions
ents-0.047
 ago-0.040
 opens-0.040
 slam-0.038
 widen-0.038
Token.
Token ID13
Feature activation+4.61e-03

Pos logit contributions
oo+0.008
P+0.007
scar+0.007
ter+0.007
lly+0.007
Neg logit contributions
ents-0.020
 opens-0.018
 ago-0.018
 widen-0.018
 slam-0.018
Token
Token ID220
Feature activation-1.06e-02

Pos logit contributions
P+0.006
oo+0.006
scar+0.006
idd+0.006
lly+0.006
Neg logit contributions
ents-0.009
 ago-0.008
 opens-0.008
 widen-0.008
 amaz-0.008
Token
Token ID198
Feature activation-5.81e-04

Pos logit contributions
ents+0.020
 ago+0.018
 opens+0.017
 slam+0.017
 widen+0.017
Neg logit contributions
P-0.017
scar-0.016
oo-0.016
idd-0.015
lly-0.015
Token and
Token ID290
Feature activation+1.93e-02

Pos logit contributions
P+0.028
oo+0.028
ter+0.025
scar+0.024
B+0.024
Neg logit contributions
ents-0.048
 opens-0.040
 widen-0.040
 slam-0.040
 thirteen-0.040
Token saw
Token ID2497
Feature activation+2.77e-02

Pos logit contributions
P+0.026
scar+0.024
oo+0.024
idd+0.023
lly+0.023
Neg logit contributions
ents-0.041
 ago-0.037
 opens-0.037
 slam-0.036
 widen-0.036
Token the
Token ID262
Feature activation-5.42e-03

Pos logit contributions
P+0.046
scar+0.044
oo+0.044
idd+0.041
lly+0.040
Neg logit contributions
ents-0.051
 opens-0.045
 ago-0.045
 amaz-0.044
 gasp-0.044
Token crab
Token ID32202
Feature activation-1.16e-02

Pos logit contributions
ents+0.011
 ago+0.010
 opens+0.010
 amaz+0.010
 gasp+0.010
Neg logit contributions
P-0.008
oo-0.008
scar-0.007
idd-0.007
lly-0.007
Token.
Token ID13
Feature activation-8.24e-03

Pos logit contributions
ents+0.027
 ago+0.025
 seventeen+0.024
 amaz+0.024
 widen+0.024
Neg logit contributions
oo-0.011
ter-0.010
P-0.010
in-0.010
lly-0.010
Token
Token ID198
Feature activation-1.12e-02

Pos logit contributions
ents+0.014
 ago+0.012
 opens+0.012
 slam+0.011
 widen+0.011
Neg logit contributions
P-0.015
oo-0.014
scar-0.014
idd-0.014
lly-0.014
Token
Token ID198
Feature activation-9.96e-03

Pos logit contributions
ents+0.019
 ago+0.017
 opens+0.016
 slam+0.016
 widen+0.015
Neg logit contributions
P-0.020
scar-0.019
oo-0.019
idd-0.018
ter-0.018
Token"
Token ID1
Feature activation-1.29e-03

Pos logit contributions
ents+0.020
 ago+0.018
 opens+0.018
 slam+0.017
 widen+0.017
Neg logit contributions
P-0.015
oo-0.014
scar-0.014
idd-0.013
lly-0.013
TokenIt
Token ID1026
Feature activation+2.99e-02

Pos logit contributions
ents+0.003
 opens+0.002
 widen+0.002
 ago+0.002
 gasp+0.002
Neg logit contributions
oo-0.002
scar-0.002
P-0.002
idd-0.002
uc-0.002
Token was
Token ID373
Feature activation-1.07e-02

Pos logit contributions
P+0.059
oo+0.056
scar+0.056
idd+0.054
uc+0.052
Neg logit contributions
ents-0.045
 opens-0.041
 ago-0.038
 slam-0.037
 widen-0.037
Token the
Token ID262
Feature activation-3.72e-02

Pos logit contributions
ents+0.019
 ago+0.017
 opens+0.017
 slam+0.016
 widen+0.016
Neg logit contributions
P-0.018
oo-0.017
scar-0.017
idd-0.017
lly-0.016
Token Lily
Token ID20037
Feature activation-4.91e-02

Pos logit contributions
ents+0.003
 ago+0.002
 opens+0.002
 slam+0.002
 widen+0.002
Neg logit contributions
P-0.003
oo-0.003
scar-0.003
idd-0.002
w-0.002
Token.
Token ID13
Feature activation-1.99e-02

Pos logit contributions
ents+0.088
 opens+0.079
 ago+0.078
 gasp+0.075
 slam+0.074
Neg logit contributions
P-0.082
scar-0.078
oo-0.077
idd-0.075
lly-0.071
Token Do
Token ID2141
Feature activation-5.46e-02

Pos logit contributions
ents+0.029
 opens+0.025
 ago+0.024
 widen+0.024
 slam+0.024
Neg logit contributions
P-0.040
oo-0.038
scar-0.038
idd-0.037
lly-0.036
Token you
Token ID345
Feature activation-3.25e-02

Pos logit contributions
ents+0.115
 opens+0.103
 ago+0.103
 slam+0.099
 widen+0.098
Neg logit contributions
P-0.075
scar-0.071
oo-0.071
idd-0.068
lly-0.065
Token understand
Token ID1833
Feature activation-2.10e-02

Pos logit contributions
ents+0.048
 opens+0.041
 slam+0.040
 amaz+0.039
 ago+0.039
Neg logit contributions
P-0.063
oo-0.062
scar-0.059
lly-0.058
w-0.057
Token that
Token ID326
Feature activation+2.47e-02

Pos logit contributions
ents+0.047
 opens+0.042
 ago+0.042
 slam+0.040
 widen+0.040
Neg logit contributions
P-0.027
scar-0.026
oo-0.025
idd-0.025
lly-0.023
Token I
Token ID314
Feature activation-2.92e-03

Pos logit contributions
P+0.034
oo+0.032
scar+0.032
lly+0.030
idd+0.030
Neg logit contributions
ents-0.051
 ago-0.046
 slam-0.045
 opens-0.045
 widen-0.045
Token like
Token ID588
Feature activation+1.00e-02

Pos logit contributions
ents+0.005
 ago+0.004
 opens+0.004
 gasp+0.004
 widen+0.004
Neg logit contributions
P-0.005
oo-0.005
scar-0.005
idd-0.005
lly-0.005
Token my
Token ID616
Feature activation+1.33e-02

Pos logit contributions
P+0.015
oo+0.014
scar+0.014
idd+0.013
lly+0.013
Neg logit contributions
ents-0.020
 opens-0.018
 ago-0.018
 slam-0.018
 widen-0.018
Token fur
Token ID9230
Feature activation-1.95e-02

Pos logit contributions
P+0.020
oo+0.019
scar+0.019
idd+0.018
lly+0.018
Neg logit contributions
ents-0.026
 opens-0.022
 ago-0.022
 widen-0.022
 slam-0.022
Token?"
Token ID1701
Feature activation-1.29e-02

Pos logit contributions
ents+0.041
 opens+0.036
 ago+0.036
 slam+0.035
 widen+0.035
Neg logit contributions
P-0.027
scar-0.025
idd-0.024
oo-0.024
lly-0.023
Token She
Token ID1375
Feature activation-6.29e-03

Pos logit contributions
P+0.022
oo+0.021
scar+0.021
idd+0.020
lly+0.020
Neg logit contributions
ents-0.018
 ago-0.016
 opens-0.015
 slam-0.015
 widen-0.015
Token felt
Token ID2936
Feature activation-1.32e-02

Pos logit contributions
ents+0.012
 opens+0.011
 ago+0.011
 slam+0.010
 widen+0.010
Neg logit contributions
P-0.010
oo-0.009
scar-0.009
idd-0.009
lly-0.009
Token like
Token ID588
Feature activation+4.19e-02

Pos logit contributions
ents+0.023
 ago+0.021
 opens+0.021
 slam+0.019
 widen+0.019
Neg logit contributions
P-0.022
oo-0.021
scar-0.021
lly-0.020
idd-0.020
Token a
Token ID257
Feature activation+1.02e-02

Pos logit contributions
P+0.064
oo+0.061
scar+0.061
idd+0.058
lly+0.057
Neg logit contributions
ents-0.080
 opens-0.071
 ago-0.071
 slam-0.069
 widen-0.069
Token bird
Token ID6512
Feature activation+3.76e-02

Pos logit contributions
P+0.018
oo+0.018
scar+0.018
idd+0.017
lly+0.016
Neg logit contributions
ents-0.017
 opens-0.015
 ago-0.015
 slam-0.014
 amaz-0.014
Token.
Token ID13
Feature activation+4.99e-03

Pos logit contributions
oo+0.052
P+0.050
ter+0.048
idd+0.045
scar+0.045
Neg logit contributions
ents-0.078
 opens-0.071
 ago-0.069
 amaz-0.069
 widen-0.068
Token She
Token ID1375
Feature activation-1.74e-02

Pos logit contributions
P+0.011
oo+0.010
scar+0.010
idd+0.010
lly+0.010
Neg logit contributions
ents-0.007
 ago-0.006
 opens-0.006
 slam-0.005
 widen-0.005
Token moved
Token ID3888
Feature activation-1.75e-03

Pos logit contributions
ents+0.033
 opens+0.030
 ago+0.029
 slam+0.028
 gasp+0.028
Neg logit contributions
P-0.028
oo-0.027
scar-0.026
idd-0.025
uc-0.024
Token to
Token ID284
Feature activation+2.59e-02

Pos logit contributions
ents+0.003
 ago+0.003
 opens+0.003
 slam+0.003
 widen+0.003
Neg logit contributions
P-0.003
oo-0.003
scar-0.003
idd-0.003
lly-0.003
Token the
Token ID262
Feature activation+1.31e-02

Pos logit contributions
oo+0.030
P+0.029
lly+0.027
ter+0.026
scar+0.026
Neg logit contributions
ents-0.058
 ago-0.052
 opens-0.051
 widen-0.050
 slam-0.049
Token sandbox
Token ID35204
Feature activation+3.14e-02

Pos logit contributions
P+0.023
oo+0.021
scar+0.021
idd+0.020
lly+0.020
Neg logit contributions
ents-0.023
 ago-0.020
 opens-0.020
 widen-0.019
 slam-0.019
Token!"
Token ID2474
Feature activation-1.82e-02

Pos logit contributions
P+0.063
oo+0.059
scar+0.058
idd+0.056
lly+0.055
Neg logit contributions
ents-0.094
 ago-0.085
 opens-0.084
 slam-0.083
 widen-0.083
Token Her
Token ID2332
Feature activation-4.39e-03

Pos logit contributions
ents+0.034
 ago+0.031
 widen+0.029
 opens+0.029
 slam+0.029
Neg logit contributions
P-0.029
oo-0.028
scar-0.027
lly-0.026
idd-0.026
Token dad
Token ID9955
Feature activation-2.14e-02

Pos logit contributions
ents+0.010
 ago+0.009
 opens+0.009
 slam+0.009
 widen+0.008
Neg logit contributions
P-0.006
oo-0.005
scar-0.005
idd-0.005
lly-0.005
Token was
Token ID373
Feature activation-2.40e-02

Pos logit contributions
ents+0.039
 opens+0.034
 gasp+0.033
 ago+0.033
 slam+0.032
Neg logit contributions
P-0.036
scar-0.035
oo-0.034
idd-0.033
w-0.031
Token very
Token ID845
Feature activation+1.63e-02

Pos logit contributions
ents+0.038
 opens+0.035
 ago+0.035
 slam+0.033
 distances+0.033
Neg logit contributions
P-0.044
oo-0.043
scar-0.042
idd-0.042
lly-0.041
Token surprised
Token ID6655
Feature activation-1.80e-02

Pos logit contributions
P+0.030
oo+0.029
scar+0.029
lly+0.029
idd+0.028
Neg logit contributions
ents-0.026
 ago-0.024
 distances-0.023
 opens-0.022
 slam-0.022
Token.
Token ID13
Feature activation+2.08e-03

Pos logit contributions
ents+0.035
 ago+0.032
 opens+0.032
 slam+0.031
 widen+0.031
Neg logit contributions
P-0.027
oo-0.026
scar-0.025
idd-0.024
lly-0.023
Token "
Token ID366
Feature activation+1.24e-02

Pos logit contributions
P+0.003
oo+0.003
scar+0.003
idd+0.003
lly+0.003
Neg logit contributions
ents-0.004
 ago-0.003
 opens-0.003
 widen-0.003
 slam-0.003
TokenWow
Token ID22017
Feature activation-1.08e-03

Pos logit contributions
P+0.016
scar+0.014
oo+0.014
idd+0.014
lly+0.013
Neg logit contributions
ents-0.028
 ago-0.025
 opens-0.025
 slam-0.025
 widen-0.024
Token,
Token ID11
Feature activation+2.51e-03

Pos logit contributions
ents+0.002
 opens+0.002
 gasp+0.002
 ago+0.001
 slam+0.001
Neg logit contributions
P-0.002
scar-0.002
oo-0.002
idd-0.002
ined-0.002
Token Mia
Token ID32189
Feature activation+1.22e-02

Pos logit contributions
P+0.004
oo+0.004
scar+0.004
idd+0.004
lly+0.004
Neg logit contributions
ents-0.005
 ago-0.004
 opens-0.004
 widen-0.004
 slam-0.004
Token started
Token ID2067
Feature activation+1.55e-02

Pos logit contributions
P+0.003
oo+0.003
scar+0.003
lly+0.003
idd+0.003
Neg logit contributions
ents-0.006
 ago-0.005
 opens-0.005
 widen-0.005
 slam-0.005
Token to
Token ID284
Feature activation+2.35e-03

Pos logit contributions
oo+0.020
P+0.019
scar+0.018
idd+0.018
lly+0.017
Neg logit contributions
ents-0.032
 ago-0.029
 opens-0.029
 slam-0.028
 widen-0.028
Token bloom
Token ID29955
Feature activation-1.73e-02

Pos logit contributions
oo+0.004
lly+0.004
P+0.004
ter+0.004
scar+0.003
Neg logit contributions
ents-0.004
 ago-0.004
 opens-0.003
 distances-0.003
 thirteen-0.003
Token.
Token ID13
Feature activation+1.17e-02

Pos logit contributions
ents+0.033
 ago+0.029
 opens+0.029
 slam+0.028
 amaz+0.028
Neg logit contributions
P-0.027
oo-0.026
scar-0.025
idd-0.024
lly-0.024
Token It
Token ID632
Feature activation+1.74e-02

Pos logit contributions
P+0.021
oo+0.021
scar+0.020
idd+0.020
lly+0.019
Neg logit contributions
ents-0.019
 opens-0.016
 ago-0.016
 widen-0.016
 slam-0.015
Token grew
Token ID6348
Feature activation-2.57e-02

Pos logit contributions
P+0.021
oo+0.019
lly+0.019
scar+0.019
idd+0.018
Neg logit contributions
ents-0.039
 ago-0.036
 slam-0.034
 opens-0.034
 widen-0.034
Token bigger
Token ID5749
Feature activation-3.39e-02

Pos logit contributions
ents+0.038
 ago+0.034
 opens+0.032
 slam+0.032
 amaz+0.032
Neg logit contributions
P-0.049
oo-0.047
ter-0.045
lly-0.045
scar-0.045
Token and
Token ID290
Feature activation+1.33e-02

Pos logit contributions
ents+0.064
 widen+0.059
 slam+0.059
 contain+0.056
 gasp+0.056
Neg logit contributions
ter-0.044
we-0.043
oo-0.042
P-0.042
idd-0.041
Token bigger
Token ID5749
Feature activation-1.58e-02

Pos logit contributions
P+0.022
lly+0.021
ter+0.021
we+0.020
 cho+0.020
Neg logit contributions
ents-0.022
 ago-0.022
 slam-0.020
 gasp-0.020
 distances-0.019
Token until
Token ID1566
Feature activation+1.69e-03

Pos logit contributions
ents+0.031
 widen+0.028
 slam+0.028
 opens+0.027
 gasp+0.027
Neg logit contributions
oo-0.021
P-0.020
ter-0.020
we-0.020
idd-0.019
Token it
Token ID340
Feature activation+7.71e-03

Pos logit contributions
P+0.003
scar+0.003
oo+0.003
idd+0.003
uc+0.002
Neg logit contributions
ents-0.003
 ago-0.003
 opens-0.003
 amaz-0.003
 gasp-0.003
Token accept
Token ID2453
Feature activation-7.58e-03

Pos logit contributions
ents+0.024
 opens+0.021
 ago+0.021
 slam+0.021
 widen+0.020
Neg logit contributions
P-0.021
oo-0.021
scar-0.020
lly-0.019
idd-0.019
Token things
Token ID1243
Feature activation-1.70e-02

Pos logit contributions
ents+0.013
 opens+0.012
 ago+0.012
 slam+0.011
 amaz+0.011
Neg logit contributions
P-0.013
scar-0.013
idd-0.012
oo-0.012
lly-0.012
Token as
Token ID355
Feature activation+2.08e-02

Pos logit contributions
ents+0.030
 opens+0.027
 ago+0.027
 slam+0.025
 widen+0.025
Neg logit contributions
P-0.029
scar-0.028
oo-0.028
idd-0.027
lly-0.026
Token they
Token ID484
Feature activation-1.49e-02

Pos logit contributions
P+0.037
scar+0.036
oo+0.034
idd+0.034
lly+0.032
Neg logit contributions
ents-0.035
 ago-0.033
 opens-0.032
 slam-0.029
 widen-0.029
Token are
Token ID389
Feature activation-2.52e-02

Pos logit contributions
ents+0.025
 ago+0.022
 amaz+0.022
 opens+0.021
 slam+0.021
Neg logit contributions
P-0.026
oo-0.025
lly-0.024
scar-0.024
w-0.024
Token and
Token ID290
Feature activation-2.50e-02

Pos logit contributions
ents+0.046
 opens+0.041
 ago+0.040
 amaz+0.040
 slam+0.040
Neg logit contributions
P-0.040
oo-0.039
scar-0.038
idd-0.037
lly-0.036
Token focus
Token ID2962
Feature activation+1.96e-02

Pos logit contributions
ents+0.054
 widen+0.048
 thirteen+0.047
 seventeen+0.047
 opens+0.046
Neg logit contributions
lly-0.028
P-0.027
ter-0.027
oo-0.027
uc-0.024
Token on
Token ID319
Feature activation-1.03e-03

Pos logit contributions
scar+0.029
P+0.029
oo+0.028
lly+0.028
idd+0.027
Neg logit contributions
ents-0.037
 ago-0.035
 opens-0.033
 widen-0.032
 distances-0.032
Token what
Token ID644
Feature activation+1.05e-02

Pos logit contributions
ents+0.002
 ago+0.002
 opens+0.002
 slam+0.002
 widen+0.002
Neg logit contributions
P-0.002
scar-0.001
oo-0.001
idd-0.001
lly-0.001
Token we
Token ID356
Feature activation-2.03e-02

Pos logit contributions
P+0.017
oo+0.016
scar+0.016
idd+0.015
lly+0.014
Neg logit contributions
ents-0.020
 opens-0.018
 ago-0.018
 slam-0.017
 gasp-0.016
Token can
Token ID460
Feature activation+1.46e-03

Pos logit contributions
ents+0.033
 ago+0.028
 opens+0.028
 slam+0.027
 amaz+0.027
Neg logit contributions
P-0.038
oo-0.036
scar-0.035
lly-0.035
idd-0.034
Token you
Token ID345
Feature activation-4.97e-03

Pos logit contributions
ents+0.055
 ago+0.049
 opens+0.048
 slam+0.045
 amaz+0.045
Neg logit contributions
scar-0.043
P-0.042
oo-0.039
idd-0.039
lly-0.037
Token.
Token ID13
Feature activation-4.78e-03

Pos logit contributions
ents+0.011
 widen+0.011
 opens+0.010
 amaz+0.010
 slam+0.010
Neg logit contributions
oo-0.005
P-0.005
idd-0.005
scar-0.004
lly-0.004
Token Today
Token ID6288
Feature activation-5.19e-02

Pos logit contributions
 ago+0.006
ents+0.006
 opens+0.005
 slam+0.005
 amaz+0.005
Neg logit contributions
P-0.011
scar-0.010
lly-0.010
uc-0.010
oo-0.010
Token is
Token ID318
Feature activation+1.01e-02

Pos logit contributions
ents+0.093
 opens+0.086
 ago+0.085
 slam+0.082
 gasp+0.079
Neg logit contributions
P-0.088
scar-0.083
oo-0.081
idd-0.080
dd-0.075
Token your
Token ID534
Feature activation+5.37e-03

Pos logit contributions
P+0.019
scar+0.018
oo+0.017
idd+0.017
uc+0.016
Neg logit contributions
ents-0.016
 opens-0.016
 ago-0.016
 slam-0.014
 amaz-0.014
Token birthday
Token ID10955
Feature activation-6.25e-02

Pos logit contributions
P+0.009
scar+0.009
oo+0.009
idd+0.008
lly+0.008
Neg logit contributions
ents-0.009
 ago-0.008
 opens-0.008
 slam-0.008
 amaz-0.008
Token.
Token ID13
Feature activation-6.10e-03

Pos logit contributions
ents+0.124
 ago+0.112
 opens+0.112
 slam+0.107
 widen+0.106
Neg logit contributions
P-0.093
scar-0.088
oo-0.088
idd-0.083
lly-0.081
Token You
Token ID921
Feature activation-6.41e-04

Pos logit contributions
ents+0.010
 opens+0.009
 ago+0.009
 slam+0.009
 widen+0.009
Neg logit contributions
P-0.011
oo-0.010
scar-0.010
idd-0.010
lly-0.009
Token are
Token ID389
Feature activation-2.61e-02

Pos logit contributions
ents+0.001
 ago+0.001
 opens+0.001
 slam+0.001
 widen+0.001
Neg logit contributions
P-0.001
oo-0.001
scar-0.001
idd-0.001
lly-0.001
Token four
Token ID1440
Feature activation-1.05e-02

Pos logit contributions
ents+0.047
 ago+0.042
 opens+0.042
 slam+0.040
 widen+0.039
Neg logit contributions
P-0.044
scar-0.043
oo-0.042
idd-0.040
lly-0.039
Token years
Token ID812
Feature activation-7.65e-03

Pos logit contributions
ents+0.021
 opens+0.018
 ago+0.018
 widen+0.018
 slam+0.018
Neg logit contributions
P-0.015
oo-0.015
scar-0.014
idd-0.014
lly-0.014
Token.
Token ID13
Feature activation+7.13e-03

Pos logit contributions
ents+0.053
 ago+0.048
 opens+0.048
 slam+0.047
 widen+0.046
Neg logit contributions
P-0.033
oo-0.031
scar-0.031
idd-0.029
lly-0.028
Token
Token ID198
Feature activation-5.59e-03

Pos logit contributions
P+0.013
oo+0.013
scar+0.012
idd+0.012
lly+0.012
Neg logit contributions
ents-0.012
 ago-0.010
 opens-0.010
 slam-0.010
 amaz-0.010
Token
Token ID198
Feature activation-6.53e-03

Pos logit contributions
ents+0.009
 ago+0.008
 opens+0.008
 slam+0.008
 gasp+0.007
Neg logit contributions
P-0.010
ter-0.010
idd-0.009
uc-0.009
scar-0.009
TokenAt
Token ID2953
Feature activation-7.71e-03

Pos logit contributions
ents+0.010
 ago+0.009
 slam+0.008
 opens+0.008
 widen+0.008
Neg logit contributions
oo-0.011
idd-0.011
ter-0.011
uc-0.011
scar-0.011
Token the
Token ID262
Feature activation-5.36e-03

Pos logit contributions
ents+0.016
 ago+0.014
 opens+0.014
 widen+0.014
 slam+0.014
Neg logit contributions
P-0.011
oo-0.011
scar-0.010
lly-0.010
idd-0.009
Token park
Token ID3952
Feature activation-3.04e-02

Pos logit contributions
ents+0.011
 ago+0.010
 opens+0.010
 gasp+0.010
 widen+0.010
Neg logit contributions
P-0.008
oo-0.007
scar-0.007
lly-0.007
idd-0.007
Token,
Token ID11
Feature activation-1.98e-03

Pos logit contributions
ents+0.057
 ago+0.051
 opens+0.051
 slam+0.049
 widen+0.048
Neg logit contributions
P-0.049
scar-0.046
oo-0.046
idd-0.044
lly-0.042
Token Lily
Token ID20037
Feature activation+2.98e-03

Pos logit contributions
ents+0.003
 ago+0.003
 opens+0.003
 slam+0.003
 amaz+0.003
Neg logit contributions
P-0.003
scar-0.003
oo-0.003
idd-0.003
ined-0.003
Token saw
Token ID2497
Feature activation+4.57e-03

Pos logit contributions
P+0.005
oo+0.004
scar+0.004
idd+0.004
lly+0.004
Neg logit contributions
ents-0.006
 ago-0.005
 opens-0.005
 slam-0.005
 widen-0.005
Token a
Token ID257
Feature activation+4.59e-03

Pos logit contributions
P+0.007
scar+0.007
oo+0.007
idd+0.007
uc+0.006
Neg logit contributions
ents-0.009
 opens-0.008
 ago-0.008
 amaz-0.008
 slam-0.007
Token huge
Token ID3236
Feature activation-3.51e-02

Pos logit contributions
P+0.008
oo+0.008
scar+0.008
idd+0.008
lly+0.007
Neg logit contributions
ents-0.008
 ago-0.007
 opens-0.007
 slam-0.006
 widen-0.006
Token inside
Token ID2641
Feature activation-5.01e-03

Pos logit contributions
P+0.067
oo+0.065
scar+0.063
idd+0.060
lly+0.059
Neg logit contributions
ents-0.104
 opens-0.094
 ago-0.094
 slam-0.091
 amaz-0.091
Token.
Token ID13
Feature activation-2.68e-03

Pos logit contributions
ents+0.010
 ago+0.009
 opens+0.009
 slam+0.009
 widen+0.009
Neg logit contributions
P-0.007
scar-0.007
oo-0.007
idd-0.006
lly-0.006
Token She
Token ID1375
Feature activation-1.24e-02

Pos logit contributions
ents+0.005
 widen+0.004
 thirteen+0.004
 ago+0.004
 distances+0.004
Neg logit contributions
P-0.004
scar-0.004
oo-0.004
lly-0.004
idd-0.004
Token was
Token ID373
Feature activation-3.09e-03

Pos logit contributions
ents+0.026
 ago+0.024
 opens+0.024
 slam+0.023
 widen+0.023
Neg logit contributions
P-0.017
oo-0.016
scar-0.016
idd-0.015
lly-0.015
Token so
Token ID523
Feature activation-1.21e-02

Pos logit contributions
ents+0.006
 distances+0.006
 ago+0.005
 opens+0.005
igsaw+0.005
Neg logit contributions
oo-0.005
ter-0.004
idd-0.004
lly-0.004
scar-0.004
Token excited
Token ID6568
Feature activation-2.79e-02

Pos logit contributions
ents+0.022
 distances+0.021
 ago+0.020
 thirteen+0.020
 slam+0.020
Neg logit contributions
oo-0.019
P-0.019
lly-0.019
idd-0.018
dd-0.017
Token that
Token ID326
Feature activation+6.08e-02

Pos logit contributions
ents+0.052
 ago+0.046
 opens+0.046
 slam+0.045
 widen+0.044
Neg logit contributions
P-0.045
oo-0.043
scar-0.042
idd-0.040
lly-0.040
Token she
Token ID673
Feature activation-4.13e-03

Pos logit contributions
P+0.101
oo+0.096
scar+0.096
lly+0.091
idd+0.090
Neg logit contributions
ents-0.112
 ago-0.099
 opens-0.097
 slam-0.094
 widen-0.094
Token quickly
Token ID2952
Feature activation-1.38e-02

Pos logit contributions
ents+0.007
 opens+0.006
 ago+0.006
 slam+0.006
 gasp+0.006
Neg logit contributions
P-0.007
scar-0.007
oo-0.007
idd-0.007
w-0.007
Token grabbed
Token ID13176
Feature activation+4.61e-02

Pos logit contributions
ents+0.026
 ago+0.023
 opens+0.023
 slam+0.022
 widen+0.022
Neg logit contributions
P-0.021
oo-0.020
scar-0.020
idd-0.019
lly-0.019
Token the
Token ID262
Feature activation-5.69e-03

Pos logit contributions
P+0.075
scar+0.073
oo+0.071
idd+0.070
lly+0.068
Neg logit contributions
ents-0.085
 opens-0.077
 ago-0.073
 slam-0.072
 gasp-0.072
Token.
Token ID13
Feature activation+1.39e-02

Pos logit contributions
ents+0.004
 ago+0.004
 opens+0.004
 amaz+0.003
 slam+0.003
Neg logit contributions
P-0.002
oo-0.002
scar-0.002
idd-0.002
lly-0.002
Token "
Token ID366
Feature activation+9.83e-03

Pos logit contributions
P+0.025
scar+0.024
oo+0.024
idd+0.023
lly+0.023
Neg logit contributions
ents-0.023
 ago-0.020
 opens-0.020
 slam-0.019
 widen-0.019
TokenI
Token ID40
Feature activation+3.95e-02

Pos logit contributions
P+0.011
oo+0.011
scar+0.011
idd+0.010
lly+0.010
Neg logit contributions
ents-0.023
 ago-0.021
 opens-0.020
 widen-0.020
 slam-0.020
Token want
Token ID765
Feature activation-2.02e-02

Pos logit contributions
P+0.074
scar+0.068
oo+0.068
idd+0.065
we+0.064
Neg logit contributions
ents-0.064
 ago-0.059
 opens-0.059
 gasp-0.057
 slam-0.056
Token my
Token ID616
Feature activation-2.80e-03

Pos logit contributions
ents+0.038
 opens+0.034
 ago+0.034
 slam+0.032
 widen+0.032
Neg logit contributions
P-0.032
scar-0.031
oo-0.031
idd-0.030
lly-0.029
Token old
Token ID1468
Feature activation-1.22e-01

Pos logit contributions
ents+0.005
 ago+0.005
 opens+0.005
 slam+0.005
 widen+0.005
Neg logit contributions
P-0.004
oo-0.004
scar-0.004
idd-0.004
lly-0.004
Token violin
Token ID38283
Feature activation-3.98e-02

Pos logit contributions
ents+0.215
 opens+0.199
 ago+0.199
 amaz+0.191
 gasp+0.188
Neg logit contributions
P-0.202
oo-0.196
scar-0.193
idd-0.187
lly-0.181
Token back
Token ID736
Feature activation-1.60e-02

Pos logit contributions
ents+0.079
 ago+0.071
 opens+0.071
 slam+0.070
 amaz+0.070
Neg logit contributions
P-0.055
oo-0.053
scar-0.049
lly-0.049
idd-0.049
Token."
Token ID526
Feature activation+1.27e-02

Pos logit contributions
ents+0.033
 opens+0.030
 widen+0.030
 amaz+0.030
 slam+0.030
Neg logit contributions
P-0.020
oo-0.020
scar-0.019
idd-0.019
lly-0.019
Token And
Token ID843
Feature activation+3.99e-02

Pos logit contributions
P+0.023
scar+0.022
oo+0.022
idd+0.021
lly+0.021
Neg logit contributions
ents-0.021
 opens-0.019
 ago-0.019
 slam-0.018
 gasp-0.017
Token so
Token ID523
Feature activation+8.87e-03

Pos logit contributions
oo+0.049
P+0.049
ter+0.048
lly+0.047
scar+0.047
Neg logit contributions
ents-0.082
 widen-0.077
 ago-0.076
 seventeen-0.075
 thirteen-0.075
Token animals
Token ID4695
Feature activation+8.53e-03

Pos logit contributions
P+0.009
scar+0.008
oo+0.008
idd+0.008
w+0.007
Neg logit contributions
ents-0.011
 opens-0.010
 ago-0.009
 slam-0.009
 amaz-0.009
Token were
Token ID547
Feature activation-6.48e-03

Pos logit contributions
P+0.012
oo+0.012
scar+0.011
lly+0.011
idd+0.011
Neg logit contributions
ents-0.017
 ago-0.015
 opens-0.015
 widen-0.015
 slam-0.015
Token watching
Token ID4964
Feature activation+1.84e-02

Pos logit contributions
ents+0.011
 opens+0.010
 ago+0.010
 slam+0.010
 amaz+0.009
Neg logit contributions
P-0.011
oo-0.011
scar-0.011
idd-0.010
lly-0.010
Token him
Token ID683
Feature activation+1.48e-02

Pos logit contributions
P+0.027
oo+0.026
scar+0.025
idd+0.024
lly+0.024
Neg logit contributions
ents-0.037
 ago-0.033
 opens-0.033
 slam-0.032
 widen-0.031
Token with
Token ID351
Feature activation-7.79e-03

Pos logit contributions
oo+0.021
lly+0.019
P+0.019
ter+0.019
scar+0.018
Neg logit contributions
ents-0.029
 ago-0.027
 opens-0.025
 amaz-0.025
 thirteen-0.025
Token amaz
Token ID40642
Feature activation-1.62e-01

Pos logit contributions
ents+0.016
 opens+0.014
 ago+0.014
 widen+0.014
 slam+0.014
Neg logit contributions
P-0.010
oo-0.010
scar-0.010
idd-0.009
lly-0.009
Tokenement
Token ID972
Feature activation-2.45e-02

Pos logit contributions
ents+0.269
 ago+0.236
 opens+0.235
 slam+0.230
 widen+0.224
Neg logit contributions
P-0.289
oo-0.280
scar-0.275
idd-0.267
lly-0.263
Token.
Token ID13
Feature activation+1.35e-02

Pos logit contributions
ents+0.054
 opens+0.049
 slam+0.049
 widen+0.048
 amaz+0.048
Neg logit contributions
oo-0.029
P-0.028
scar-0.026
idd-0.026
lly-0.025
Token
Token ID220
Feature activation-1.27e-02

Pos logit contributions
P+0.024
oo+0.023
scar+0.023
idd+0.022
lly+0.022
Neg logit contributions
ents-0.022
 opens-0.020
 ago-0.020
 slam-0.019
 widen-0.019
Token
Token ID198
Feature activation-8.23e-05

Pos logit contributions
ents+0.023
 ago+0.021
 opens+0.020
 slam+0.019
 widen+0.019
Neg logit contributions
P-0.021
scar-0.020
oo-0.020
idd-0.019
lly-0.019
Token
Token ID198
Feature activation+1.70e-03

Pos logit contributions
ents+0.000
 ago+0.000
 opens+0.000
 slam+0.000
 widen+0.000
Neg logit contributions
P-0.000
oo-0.000
scar-0.000
idd-0.000
lly-0.000
Token bear
Token ID6842
Feature activation+1.58e-02

Pos logit contributions
P+0.062
oo+0.061
idd+0.060
scar+0.059
dd+0.054
Neg logit contributions
 opens-0.068
ents-0.068
 cry-0.058
 slam-0.056
 shelf-0.056
Token told
Token ID1297
Feature activation-2.41e-02

Pos logit contributions
P+0.024
oo+0.022
scar+0.022
idd+0.021
lly+0.021
Neg logit contributions
ents-0.031
 opens-0.028
 ago-0.028
 slam-0.027
 widen-0.027
Token her
Token ID607
Feature activation-3.46e-02

Pos logit contributions
ents+0.054
 opens+0.049
 ago+0.048
 slam+0.047
 amaz+0.046
Neg logit contributions
P-0.032
scar-0.030
idd-0.029
oo-0.028
lly-0.027
Token about
Token ID546
Feature activation-4.82e-02

Pos logit contributions
ents+0.068
 ago+0.061
 opens+0.060
 widen+0.059
 slam+0.059
Neg logit contributions
P-0.051
oo-0.049
scar-0.049
lly-0.046
idd-0.046
Token its
Token ID663
Feature activation-3.57e-02

Pos logit contributions
ents+0.093
 ago+0.082
 opens+0.082
 slam+0.079
 widen+0.079
Neg logit contributions
P-0.075
oo-0.071
scar-0.071
idd-0.068
lly-0.066
Token old
Token ID1468
Feature activation-1.31e-01

Pos logit contributions
ents+0.070
 opens+0.058
 ago+0.057
 widen+0.057
 slam+0.056
Neg logit contributions
oo-0.053
P-0.052
scar-0.051
lly-0.049
ter-0.047
Token home
Token ID1363
Feature activation-1.81e-02

Pos logit contributions
ents+0.239
 ago+0.221
 opens+0.218
 slam+0.209
 amaz+0.207
Neg logit contributions
P-0.213
scar-0.202
oo-0.201
idd-0.193
lly-0.187
Token and
Token ID290
Feature activation-1.15e-02

Pos logit contributions
ents+0.036
 widen+0.033
 amaz+0.032
 thirteen+0.032
 opens+0.032
Neg logit contributions
oo-0.024
ter-0.022
scar-0.022
lly-0.022
P-0.022
Token friends
Token ID2460
Feature activation+3.13e-02

Pos logit contributions
ents+0.024
 ago+0.022
 widen+0.022
 slam+0.021
 opens+0.021
Neg logit contributions
P-0.015
scar-0.014
lly-0.014
oo-0.014
idd-0.013
Token.
Token ID13
Feature activation+1.88e-03

Pos logit contributions
P+0.046
oo+0.043
scar+0.043
idd+0.041
lly+0.039
Neg logit contributions
ents-0.063
 ago-0.057
 opens-0.057
 slam-0.055
 widen-0.054
Token The
Token ID383
Feature activation-1.13e-02

Pos logit contributions
P+0.004
oo+0.004
scar+0.004
idd+0.003
lly+0.003
Neg logit contributions
ents-0.003
 ago-0.002
 opens-0.002
 slam-0.002
 widen-0.002
Token he
Token ID339
Feature activation-1.45e-02

Pos logit contributions
P+0.011
oo+0.011
scar+0.010
idd+0.010
uc+0.010
Neg logit contributions
ents-0.013
 opens-0.012
 ago-0.012
 slam-0.011
 gasp-0.011
Token had
Token ID550
Feature activation-2.24e-02

Pos logit contributions
ents+0.024
 opens+0.021
 ago+0.021
 slam+0.019
 gasp+0.019
Neg logit contributions
P-0.026
oo-0.026
scar-0.024
idd-0.024
ter-0.023
Token lost
Token ID2626
Feature activation+1.92e-02

Pos logit contributions
ents+0.041
 ago+0.038
 opens+0.036
 slam+0.034
 widen+0.034
Neg logit contributions
P-0.037
scar-0.036
oo-0.035
idd-0.033
uc-0.032
Token a
Token ID257
Feature activation+7.13e-03

Pos logit contributions
P+0.030
scar+0.029
oo+0.028
lly+0.027
idd+0.026
Neg logit contributions
ents-0.038
 ago-0.035
 opens-0.034
 slam-0.032
 distances-0.032
Token week
Token ID1285
Feature activation-3.96e-02

Pos logit contributions
P+0.011
oo+0.011
scar+0.010
idd+0.010
lly+0.010
Neg logit contributions
ents-0.014
 ago-0.012
 opens-0.012
 slam-0.012
 widen-0.012
Token ago
Token ID2084
Feature activation-1.50e-01

Pos logit contributions
ents+0.100
 widen+0.092
 opens+0.090
 amaz+0.089
 thirteen+0.087
Neg logit contributions
P-0.028
idd-0.026
oo-0.025
ter-0.023
lly-0.022
Token.
Token ID13
Feature activation+5.56e-03

Pos logit contributions
ents+0.334
 opens+0.302
 widen+0.296
 slam+0.294
 ago+0.293
Neg logit contributions
P-0.181
oo-0.175
scar-0.169
idd-0.163
lly-0.156
Token He
Token ID679
Feature activation+2.96e-04

Pos logit contributions
P+0.009
oo+0.009
scar+0.009
idd+0.009
lly+0.008
Neg logit contributions
ents-0.010
 opens-0.008
 widen-0.008
 ago-0.008
 gasp-0.008
Token wanted
Token ID2227
Feature activation-2.26e-02

Pos logit contributions
P+0.000
oo+0.000
scar+0.000
idd+0.000
lly+0.000
Neg logit contributions
ents-0.001
 opens-0.000
 ago-0.000
 slam-0.000
 gasp-0.000
Token to
Token ID284
Feature activation-1.80e-02

Pos logit contributions
ents+0.040
 opens+0.035
 ago+0.034
 slam+0.033
 widen+0.031
Neg logit contributions
P-0.042
scar-0.039
oo-0.039
idd-0.038
lly-0.036
Token get
Token ID651
Feature activation-1.38e-02

Pos logit contributions
ents+0.032
 ago+0.029
 opens+0.028
 widen+0.027
 amaz+0.027
Neg logit contributions
P-0.030
oo-0.029
scar-0.029
idd-0.028
lly-0.028
Token you
Token ID345
Feature activation-3.71e-02

Pos logit contributions
ents+0.083
 ago+0.076
 opens+0.075
 slam+0.073
 widen+0.073
Neg logit contributions
P-0.047
oo-0.045
scar-0.044
idd-0.041
lly-0.041
Token okay
Token ID8788
Feature activation-4.02e-02

Pos logit contributions
ents+0.067
 ago+0.059
 opens+0.059
 widen+0.057
 gasp+0.057
Neg logit contributions
P-0.062
scar-0.059
oo-0.058
idd-0.056
lly-0.055
Token,
Token ID11
Feature activation-1.49e-02

Pos logit contributions
ents+0.080
 ago+0.072
 opens+0.072
 slam+0.069
 widen+0.069
Neg logit contributions
P-0.060
oo-0.057
scar-0.056
idd-0.054
lly-0.052
Token my
Token ID616
Feature activation+2.54e-02

Pos logit contributions
ents+0.030
 distances+0.028
 ago+0.028
 widen+0.028
 opens+0.028
Neg logit contributions
oo-0.020
ter-0.020
P-0.020
idd-0.018
scar-0.018
Token d
Token ID288
Feature activation-1.82e-02

Pos logit contributions
P+0.050
scar+0.046
mate+0.042
idd+0.041
B+0.041
Neg logit contributions
ents-0.041
 ago-0.037
 gasp-0.036
 opens-0.035
 amaz-0.034
Tokenarl
Token ID7063
Feature activation-1.34e-01

Pos logit contributions
ents+0.024
 ago+0.019
 opens+0.018
 slam+0.018
 widen+0.017
Neg logit contributions
P-0.040
scar-0.039
lly-0.038
oo-0.037
idd-0.037
Tokenings
Token ID654
Feature activation-1.32e-02

Pos logit contributions
ents+0.309
 ago+0.288
 opens+0.286
 slam+0.275
 widen+0.275
Neg logit contributions
P-0.150
scar-0.139
oo-0.136
idd-0.132
lly-0.127
Token?"
Token ID1701
Feature activation-2.64e-02

Pos logit contributions
ents+0.032
 ago+0.029
 opens+0.029
 amaz+0.028
 slam+0.028
Neg logit contributions
P-0.014
oo-0.013
scar-0.013
idd-0.012
lly-0.012
Token Mom
Token ID11254
Feature activation+2.08e-02

Pos logit contributions
ents+0.055
 ago+0.049
 opens+0.049
 widen+0.048
 distances+0.047
Neg logit contributions
P-0.037
oo-0.035
scar-0.034
lly-0.032
idd-0.032
Token asked
Token ID1965
Feature activation-1.60e-02

Pos logit contributions
P+0.027
oo+0.027
scar+0.025
idd+0.024
lly+0.024
Neg logit contributions
ents-0.042
 amaz-0.041
 ago-0.041
 distances-0.040
 widen-0.040
Token.
Token ID13
Feature activation-2.46e-03

Pos logit contributions
ents+0.032
 ago+0.028
 opens+0.028
 slam+0.027
 gasp+0.026
Neg logit contributions
scar-0.025
P-0.025
oo-0.023
lly-0.023
idd-0.022