TOP ACTIVATIONS
MAX = 4.879

Can I hold your pe b ble , Sara ?" Tom
" I found a pe b ble at the beach ,"
you can hold my pe b ble . But be careful
I am a magical pe b ble and I â
want to share my pe b ble ? We can both
get sick . He had an idea . "
I understand . I have an idea . Let 's find
, " What a great idea . Let 's go !"
is a very nice pe b ble . You are a
said , " I have an idea ! I 'll climb
L ily , I have an idea . Why don 't
. Then , he had an idea . "
said , " I have an idea ! I will share
Dad . All ie smiled and they both
said . " I have an idea . Look , there
" A pe b ble , Mom my !
, the giant horse had an idea . He said "
said , " I have an idea !" She put the
?" Tom had an idea . He wanted to
said , " I have an idea . Let 's go

MIN ACTIVATIONS
MIN = -5.146

it to me for safe keeping . He told me to
the officer safely took the bad man away and everyone went
. She shouted at the bad man with the gun ,
and they went to the checkout . The little boy smiled
, they went to the checkout . The cash ier said
lay still in Mum â s arms . Mum
you go , my d arl ings . Enjoy your lunch
grandma 's house for safe keeping . When
Then he went to the checkout and paid for it .
. They walked past the bad dog and continued their walk
, they went to the checkout counter . M
Then they went to the checkout . Daisy was feeling a
carefully counted out everyone â s share .
off they went to the checkout . P hew , Mum
shelf and went to the checkout . They paid for it
should remember to stay close by ," she said .
they reached the doctor â s office , his
two friends Jack and Jill .
As they walked to the checkout , Tim my noticed his
saved Tim my from the bad man . Tim my was

INTERVAL 2.373 - 4.879
CONTAINS 0.498%

One day , Lily was playing in the park when she
and ears . Lily cried out in pain and fear .
rabbit could finally sleep , all wrapped in the soft leaves
mom gave him a brilliant idea . She said , "
was so upset and cried all the way home . Her

INTERVAL -0.133 - 2.373
CONTAINS 55.477%

all day playing with his new chess set . He played
her mom had taught her and quickly turned off the oven
in the air ." " OK ," Sam says
1 , 2 , 3 , 4 and so on .
to smile and even laughed a little . Tim my 's

INTERVAL -2.640 - -0.133
CONTAINS 43.723%

Anna , forgive Ben and let him play with your car
She put the newspaper back on the floor and said goodbye
, the tail , the nose , the prope ller .
the ladder again . " Please , Lily ,
the house next door . They are glad they are safe

INTERVAL -5.146 - -2.640
CONTAINS 0.301%

. She went back to the classroom with the
. You need to wash off the sand and cool down
with the funny toys all day . And every time she
the courage and ran back home . She was no longer
happy and excited â he 'd just seen
TokenCan
Token ID6090
Feature activation-3.22e-01

Pos logit contributions
Po+0.015
G+0.014
Cr+0.014
cious+0.014
S+0.014
Neg logit contributions
run-0.025
 of-0.023
 older-0.023
 swift-0.022
iddled-0.022
Token I
Token ID314
Feature activation+1.85e+00

Pos logit contributions
Po+0.007
cious+0.006
Cr+0.006
 drowned+0.006
ork+0.006
Neg logit contributions
run-0.007
iddled-0.007
 swift-0.006
 of-0.006
 older-0.006
Token hold
Token ID1745
Feature activation+1.21e+00

Pos logit contributions
run+0.044
iddled+0.043
bars+0.042
box+0.041
 swift+0.040
Neg logit contributions
Po-0.033
act-0.031
cious-0.031
G-0.030
ork-0.030
Token your
Token ID534
Feature activation+6.68e-01

Pos logit contributions
run+0.031
iddled+0.031
 swift+0.030
bars+0.029
box+0.029
Neg logit contributions
Po-0.018
act-0.016
cious-0.016
Pear-0.016
S-0.015
Token pe
Token ID613
Feature activation+2.85e+00

Pos logit contributions
iddled+0.016
old+0.015
bars+0.014
 Faster+0.014
 Step+0.014
Neg logit contributions
 peanut-0.013
Po-0.013
 sting-0.013
 pulp-0.012
ork-0.012
Tokenb
Token ID65
Feature activation+4.86e+00

Pos logit contributions
run+0.064
 Jake+0.063
 of+0.061
iddled+0.060
 Faster+0.060
Neg logit contributions
act-0.052
Po-0.051
cious-0.051
Pear-0.047
Cr-0.047
Tokenble
Token ID903
Feature activation+1.67e+00

Pos logit contributions
run+0.144
 Jake+0.142
 fascination+0.142
 Step+0.142
 swift+0.141
Neg logit contributions
sters-0.049
Po-0.049
act-0.048
mar-0.047
cious-0.047
Token,
Token ID11
Feature activation+1.87e+00

Pos logit contributions
run+0.040
iddled+0.040
 fascination+0.037
 Step+0.037
 Faster+0.036
Neg logit contributions
Po-0.027
mar-0.027
act-0.026
ork-0.025
S-0.025
Token Sara
Token ID24799
Feature activation-5.30e-01

Pos logit contributions
iddled+0.044
run+0.042
bars+0.041
 Step+0.039
illed+0.038
Neg logit contributions
 peanut-0.036
 Gig-0.035
Cr-0.035
Po-0.035
S-0.033
Token?"
Token ID1701
Feature activation+6.82e-01

Pos logit contributions
Po+0.009
cious+0.009
act+0.009
G+0.009
Cr+0.009
Neg logit contributions
run-0.013
iddled-0.012
 swift-0.012
bars-0.012
 of-0.011
Token Tom
Token ID4186
Feature activation+1.67e+00

Pos logit contributions
run+0.016
iddled+0.015
box+0.014
ay+0.014
bars+0.014
Neg logit contributions
Po-0.013
G-0.013
cious-0.013
 Gig-0.013
S-0.012
Token"
Token ID1
Feature activation-1.18e+00

Pos logit contributions
cious+0.011
Po+0.011
act+0.010
 drowned+0.010
G+0.010
Neg logit contributions
run-0.012
 of-0.011
 swift-0.011
iddled-0.011
 older-0.011
TokenI
Token ID40
Feature activation-4.17e-01

Pos logit contributions
cious+0.018
Po+0.018
act+0.018
 drowned+0.017
ork+0.017
Neg logit contributions
run-0.031
 of-0.029
 swift-0.029
iddled-0.028
 older-0.028
Token found
Token ID1043
Feature activation+2.09e+00

Pos logit contributions
Po+0.007
cious+0.007
act+0.007
 drowned+0.007
G+0.007
Neg logit contributions
run-0.010
 of-0.009
 swift-0.009
iddled-0.009
 older-0.009
Token a
Token ID257
Feature activation+5.32e-01

Pos logit contributions
run+0.050
bars+0.048
iddled+0.048
 swift+0.047
 Step+0.045
Neg logit contributions
Po-0.036
cious-0.035
 Gig-0.035
act-0.035
G-0.033
Token pe
Token ID613
Feature activation+2.65e+00

Pos logit contributions
run+0.013
iddled+0.012
bars+0.012
 Step+0.012
 of+0.011
Neg logit contributions
Po-0.010
cious-0.010
act-0.009
ork-0.009
G-0.009
Tokenb
Token ID65
Feature activation+4.75e+00

Pos logit contributions
run+0.058
 Jake+0.054
 of+0.053
iddled+0.053
 swift+0.052
Neg logit contributions
act-0.051
Po-0.051
cious-0.051
ork-0.048
G-0.047
Tokenble
Token ID903
Feature activation+1.54e+00

Pos logit contributions
run+0.136
 swift+0.130
 Jake+0.129
 Step+0.129
 Faster+0.127
Neg logit contributions
Po-0.057
cious-0.056
sters-0.053
act-0.053
ork-0.052
Token at
Token ID379
Feature activation+2.66e+00

Pos logit contributions
run+0.036
iddled+0.036
 Faster+0.033
 Step+0.033
 swift+0.033
Neg logit contributions
cious-0.027
Po-0.026
act-0.026
ork-0.025
 pits-0.025
Token the
Token ID262
Feature activation-2.62e-01

Pos logit contributions
run+0.066
iddled+0.060
 swift+0.058
bars+0.058
 of+0.054
Neg logit contributions
 drowned-0.044
 pits-0.044
 Gig-0.043
Po-0.043
Cr-0.042
Token beach
Token ID10481
Feature activation-5.64e-01

Pos logit contributions
cious+0.004
act+0.004
Po+0.004
Cr+0.004
G+0.004
Neg logit contributions
run-0.007
iddled-0.006
 of-0.006
 swift-0.006
bars-0.006
Token,"
Token ID553
Feature activation+7.09e-01

Pos logit contributions
cious+0.009
act+0.009
Po+0.008
 bank+0.008
Cr+0.008
Neg logit contributions
run-0.015
iddled-0.014
 swift-0.013
 Faster-0.013
 Step-0.013
Token you
Token ID345
Feature activation+5.89e-01

Pos logit contributions
iddled+0.033
run+0.030
box+0.028
bars+0.028
 swift+0.028
Neg logit contributions
Po-0.017
Cr-0.017
S-0.016
cious-0.016
 Gig-0.016
Token can
Token ID460
Feature activation+2.74e-01

Pos logit contributions
run+0.012
iddled+0.010
 swift+0.010
bars+0.010
 of+0.010
Neg logit contributions
Po-0.013
cious-0.013
 drowned-0.013
act-0.013
Cr-0.012
Token hold
Token ID1745
Feature activation+2.99e-01

Pos logit contributions
run+0.006
 of+0.005
 swift+0.005
 older+0.005
iddled+0.005
Neg logit contributions
cious-0.006
Po-0.006
act-0.006
 drowned-0.005
G-0.005
Token my
Token ID616
Feature activation+1.12e+00

Pos logit contributions
run+0.007
iddled+0.007
 swift+0.007
 Step+0.007
bars+0.007
Neg logit contributions
act-0.005
Po-0.005
 pits-0.004
Pear-0.004
S-0.004
Token pe
Token ID613
Feature activation+3.09e+00

Pos logit contributions
run+0.031
iddled+0.031
 Jake+0.029
bars+0.029
 Step+0.028
Neg logit contributions
act-0.016
Po-0.016
cious-0.014
actus-0.014
 pits-0.014
Tokenb
Token ID65
Feature activation+4.71e+00

Pos logit contributions
run+0.068
 Jake+0.066
iddled+0.064
 of+0.064
 Faster+0.063
Neg logit contributions
act-0.058
Po-0.058
cious-0.057
ork-0.054
Pear-0.053
Tokenble
Token ID903
Feature activation+1.02e+00

Pos logit contributions
run+0.138
 Jake+0.136
 Little+0.134
 swift+0.134
 fascination+0.134
Neg logit contributions
Po-0.052
sters-0.051
act-0.050
cious-0.050
uts-0.048
Token.
Token ID13
Feature activation-1.59e-01

Pos logit contributions
iddled+0.026
run+0.026
 Faster+0.023
 Step+0.023
 fascination+0.023
Neg logit contributions
cious-0.016
Po-0.016
act-0.016
mar-0.015
ork-0.015
Token But
Token ID887
Feature activation+1.21e+00

Pos logit contributions
Po+0.002
 Gig+0.002
cious+0.002
 Pink+0.002
Cr+0.002
Neg logit contributions
iddled-0.004
run-0.004
box-0.004
 older-0.004
 of-0.004
Token be
Token ID307
Feature activation+2.02e-01

Pos logit contributions
iddled+0.035
run+0.032
old+0.030
uster+0.029
 swift+0.029
Neg logit contributions
Po-0.017
Cr-0.017
 die-0.015
mar-0.015
 Gig-0.015
Token careful
Token ID8161
Feature activation+5.36e-02

Pos logit contributions
bars+0.004
run+0.004
iddled+0.004
mith+0.004
 Jake+0.004
Neg logit contributions
S-0.004
Cr-0.004
Po-0.004
G-0.004
cious-0.004
TokenI
Token ID40
Feature activation-3.98e-01

Pos logit contributions
cious+0.016
G+0.015
Po+0.015
S+0.014
Cr+0.014
Neg logit contributions
run-0.018
 older-0.017
 swift-0.017
iddled-0.017
box-0.016
Token am
Token ID716
Feature activation+4.59e-01

Pos logit contributions
Po+0.005
cious+0.005
G+0.005
ork+0.005
act+0.005
Neg logit contributions
run-0.011
bars-0.011
 older-0.011
iddled-0.011
box-0.011
Token a
Token ID257
Feature activation+1.21e+00

Pos logit contributions
bars+0.012
iddled+0.011
isher+0.010
run+0.010
box+0.010
Neg logit contributions
cious-0.007
 Gig-0.007
G-0.007
Po-0.007
 Pink-0.007
Token magical
Token ID10883
Feature activation+1.68e+00

Pos logit contributions
run+0.026
 of+0.024
iddled+0.023
 swift+0.023
bars+0.023
Neg logit contributions
cious-0.025
Po-0.025
act-0.024
 drowned-0.023
G-0.023
Token pe
Token ID613
Feature activation+2.86e+00

Pos logit contributions
run+0.041
iddled+0.040
bars+0.038
 Step+0.037
 Faster+0.037
Neg logit contributions
Po-0.030
cious-0.030
act-0.029
ork-0.029
actus-0.028
Tokenb
Token ID65
Feature activation+4.67e+00

Pos logit contributions
run+0.066
 Jake+0.065
 Step+0.063
iddled+0.063
 Faster+0.062
Neg logit contributions
cious-0.053
act-0.051
Po-0.050
ork-0.048
G-0.046
Tokenble
Token ID903
Feature activation+1.78e+00

Pos logit contributions
run+0.138
 Step+0.133
 Faster+0.132
bars+0.131
 swift+0.131
Neg logit contributions
cious-0.058
Po-0.051
ork-0.048
sters-0.048
Pear-0.047
Token and
Token ID290
Feature activation+6.54e-01

Pos logit contributions
iddled+0.047
run+0.045
 Step+0.044
 Faster+0.043
 fascination+0.043
Neg logit contributions
cious-0.027
Po-0.024
ache-0.024
ork-0.023
act-0.023
Token I
Token ID314
Feature activation+1.87e-02

Pos logit contributions
iddled+0.016
run+0.016
uster+0.015
 swift+0.015
icking+0.015
Neg logit contributions
cious-0.010
Po-0.009
G-0.009
 Gig-0.009
 peanut-0.009
Tokenâ
Token ID22940
Feature activation+6.20e-01

Pos logit contributions
bars+0.001
run+0.001
 older+0.000
iddled+0.000
 swift+0.000
Neg logit contributions
Po-0.000
 drowned-0.000
cious-0.000
ork-0.000
act-0.000
Token
Token ID26391
Feature activation-2.46e+00

Pos logit contributions
run+0.016
 older+0.015
 swift+0.015
 of+0.015
bars+0.014
Neg logit contributions
cious-0.010
G-0.010
Po-0.010
mar-0.010
Cr-0.010
Token want
Token ID765
Feature activation+1.94e-01

Pos logit contributions
run+0.045
bars+0.041
iddled+0.041
 swift+0.041
 of+0.039
Neg logit contributions
Po-0.035
cious-0.035
act-0.034
 drowned-0.034
Cr-0.033
Token to
Token ID284
Feature activation+1.01e+00

Pos logit contributions
run+0.005
iddled+0.005
 swift+0.005
bars+0.005
 Step+0.004
Neg logit contributions
Po-0.003
act-0.003
Cr-0.003
 peanut-0.003
 drowned-0.003
Token share
Token ID2648
Feature activation-2.26e+00

Pos logit contributions
 of+0.016
run+0.016
 welcome+0.015
 older+0.015
 courage+0.014
Neg logit contributions
cious-0.026
act-0.025
Po-0.025
 drowned-0.024
actus-0.024
Token my
Token ID616
Feature activation+8.24e-01

Pos logit contributions
Po+0.049
cious+0.049
act+0.049
 drowned+0.047
G+0.047
Neg logit contributions
run-0.045
 of-0.040
 swift-0.040
iddled-0.039
 older-0.039
Token pe
Token ID613
Feature activation+3.03e+00

Pos logit contributions
run+0.022
iddled+0.021
bars+0.021
 Jake+0.021
 of+0.021
Neg logit contributions
Po-0.012
act-0.012
cious-0.012
Cr-0.011
G-0.011
Tokenb
Token ID65
Feature activation+4.66e+00

Pos logit contributions
run+0.063
 of+0.058
 swift+0.057
iddled+0.057
 older+0.055
Neg logit contributions
cious-0.062
Po-0.062
act-0.061
Cr-0.058
ork-0.058
Tokenble
Token ID903
Feature activation+1.27e+00

Pos logit contributions
run+0.124
 swift+0.118
bars+0.117
iddled+0.116
 Jake+0.116
Neg logit contributions
Po-0.066
cious-0.066
act-0.065
Cr-0.063
ork-0.062
Token?
Token ID30
Feature activation+3.78e-01

Pos logit contributions
run+0.026
iddled+0.024
 swift+0.024
bars+0.023
 courage+0.023
Neg logit contributions
Po-0.026
act-0.026
cious-0.026
Cr-0.025
ork-0.025
Token We
Token ID775
Feature activation-5.58e-01

Pos logit contributions
iddled+0.008
 courage+0.008
run+0.008
 swift+0.008
 older+0.008
Neg logit contributions
Po-0.007
cious-0.007
Cr-0.006
Pear-0.006
uts-0.006
Token can
Token ID460
Feature activation-2.11e-01

Pos logit contributions
Po+0.013
act+0.012
cious+0.012
 drowned+0.012
ork+0.012
Neg logit contributions
run-0.011
 swift-0.010
iddled-0.009
bars-0.009
 older-0.009
Token both
Token ID1111
Feature activation+1.88e+00

Pos logit contributions
Po+0.004
cious+0.004
act+0.004
 drowned+0.004
G+0.004
Neg logit contributions
run-0.005
 of-0.004
 swift-0.004
iddled-0.004
 older-0.004
Token get
Token ID651
Feature activation-6.77e-01

Pos logit contributions
act+0.011
ork+0.011
cious+0.011
actus+0.011
Pear+0.010
Neg logit contributions
 welcome-0.008
 of-0.008
 older-0.008
 cheer-0.007
run-0.007
Token sick
Token ID6639
Feature activation-1.05e+00

Pos logit contributions
cious+0.011
Po+0.011
act+0.011
 drowned+0.010
G+0.010
Neg logit contributions
run-0.017
 of-0.016
 swift-0.016
 older-0.015
iddled-0.015
Token.
Token ID13
Feature activation+3.18e-01

Pos logit contributions
Po+0.021
cious+0.021
act+0.021
 drowned+0.020
G+0.020
Neg logit contributions
run-0.022
 swift-0.020
 of-0.020
iddled-0.020
 older-0.019
Token He
Token ID679
Feature activation-7.71e-01

Pos logit contributions
run+0.006
 of+0.005
iddled+0.005
 swift+0.005
bars+0.005
Neg logit contributions
Po-0.007
cious-0.007
act-0.007
G-0.007
Pear-0.007
Token had
Token ID550
Feature activation-5.35e-01

Pos logit contributions
Po+0.014
cious+0.014
act+0.014
 drowned+0.014
Cr+0.014
Neg logit contributions
run-0.018
 of-0.016
 swift-0.016
iddled-0.016
 older-0.016
Token an
Token ID281
Feature activation+4.63e+00

Pos logit contributions
cious+0.011
Po+0.011
act+0.011
 drowned+0.010
G+0.010
Neg logit contributions
run-0.011
 of-0.010
 swift-0.010
 older-0.010
iddled-0.010
Token idea
Token ID2126
Feature activation+3.12e+00

Pos logit contributions
run+0.056
bars+0.045
 of+0.045
iddled+0.045
 swift+0.043
Neg logit contributions
cious-0.137
Po-0.136
act-0.135
 gag-0.133
 drowned-0.132
Token.
Token ID13
Feature activation+8.66e-01

Pos logit contributions
iddled+0.085
bars+0.085
run+0.083
 pleasure+0.083
 swift+0.083
Neg logit contributions
cious-0.039
Po-0.038
mar-0.037
S-0.036
ork-0.035
Token
Token ID198
Feature activation+1.09e+00

Pos logit contributions
run+0.018
iddled+0.018
bars+0.017
box+0.017
 courage+0.017
Neg logit contributions
Po-0.017
S-0.016
Pear-0.016
G-0.016
cious-0.016
Token
Token ID198
Feature activation+2.11e-01

Pos logit contributions
run+0.024
iddled+0.022
bars+0.022
 older+0.022
 swift+0.022
Neg logit contributions
Po-0.021
G-0.021
ork-0.020
cious-0.020
Pear-0.020
Token"
Token ID1
Feature activation-7.82e-01

Pos logit contributions
run+0.005
 of+0.004
 swift+0.004
iddled+0.004
 older+0.004
Neg logit contributions
cious-0.004
act-0.004
Po-0.004
 drowned-0.004
ork-0.004
TokenI
Token ID40
Feature activation+1.26e-01

Pos logit contributions
cious+0.004
Po+0.004
G+0.004
act+0.004
Cr+0.004
Neg logit contributions
run-0.005
 older-0.005
 swift-0.005
 of-0.005
iddled-0.005
Token understand
Token ID1833
Feature activation+2.27e-01

Pos logit contributions
bars+0.003
run+0.003
iddled+0.003
box+0.003
 older+0.003
Neg logit contributions
Po-0.002
ork-0.002
 drowned-0.002
G-0.002
act-0.002
Token.
Token ID13
Feature activation+8.83e-01

Pos logit contributions
iddled+0.005
run+0.005
 swift+0.005
bars+0.005
 Faster+0.005
Neg logit contributions
Po-0.004
cious-0.004
S-0.004
Cr-0.004
ork-0.003
Token I
Token ID314
Feature activation-5.38e-02

Pos logit contributions
iddled+0.021
 swift+0.020
box+0.020
icking+0.019
ma+0.019
Neg logit contributions
Po-0.014
G-0.014
 Gig-0.014
S-0.013
Cr-0.013
Token have
Token ID423
Feature activation+6.67e-03

Pos logit contributions
Po+0.001
 drowned+0.001
ork+0.001
cious+0.001
act+0.001
Neg logit contributions
run-0.001
bars-0.001
iddled-0.001
 older-0.001
 swift-0.001
Token an
Token ID281
Feature activation+4.63e+00

Pos logit contributions
run+0.000
iddled+0.000
bars+0.000
 swift+0.000
mith+0.000
Neg logit contributions
Po-0.000
 drowned-0.000
cious-0.000
Cr-0.000
G-0.000
Token idea
Token ID2126
Feature activation+3.11e+00

Pos logit contributions
run+0.088
iddled+0.080
 Jake+0.079
bars+0.078
 Faster+0.073
Neg logit contributions
 gag-0.102
act-0.099
 sting-0.099
 harmful-0.098
Po-0.098
Token.
Token ID13
Feature activation+8.53e-01

Pos logit contributions
iddled+0.097
bars+0.096
 fascination+0.095
 swift+0.095
 younger+0.093
Neg logit contributions
cious-0.025
uts-0.025
mar-0.025
S-0.024
Po-0.022
Token Let
Token ID3914
Feature activation+6.43e-01

Pos logit contributions
iddled+0.019
 swift+0.019
box+0.018
 fascination+0.018
ma+0.018
Neg logit contributions
Po-0.014
 Gig-0.014
G-0.014
cious-0.014
S-0.013
Token's
Token ID338
Feature activation+1.67e+00

Pos logit contributions
run+0.016
 swift+0.015
iddled+0.014
bars+0.014
 of+0.014
Neg logit contributions
Po-0.011
cious-0.011
act-0.011
G-0.011
 drowned-0.011
Token find
Token ID1064
Feature activation+1.58e+00

Pos logit contributions
run+0.043
iddled+0.039
bars+0.039
 swift+0.038
 Faster+0.036
Neg logit contributions
Po-0.027
 drowned-0.027
 sting-0.026
cious-0.026
 gag-0.026
Token,
Token ID11
Feature activation+8.51e-01

Pos logit contributions
run+0.010
 swift+0.009
bars+0.009
iddled+0.009
 older+0.009
Neg logit contributions
cious-0.009
Po-0.009
ork-0.008
act-0.008
Cr-0.008
Token "
Token ID366
Feature activation-7.02e-01

Pos logit contributions
run+0.014
 swift+0.012
iddled+0.012
 of+0.012
bars+0.012
Neg logit contributions
Po-0.022
cious-0.022
G-0.021
Cr-0.021
Pear-0.021
TokenWhat
Token ID2061
Feature activation-8.45e-01

Pos logit contributions
Po+0.011
G+0.011
cious+0.011
S+0.010
Cr+0.010
Neg logit contributions
run-0.019
 older-0.018
 of-0.017
 swift-0.017
iddled-0.017
Token a
Token ID257
Feature activation+6.09e-01

Pos logit contributions
cious+0.015
Po+0.013
mar+0.013
ork+0.013
Cr+0.013
Neg logit contributions
run-0.021
iddled-0.019
bars-0.019
uster-0.019
 swift-0.018
Token great
Token ID1049
Feature activation+1.17e+00

Pos logit contributions
run+0.015
bars+0.014
iddled+0.014
 of+0.014
 older+0.013
Neg logit contributions
Po-0.011
cious-0.011
Cr-0.010
G-0.010
act-0.010
Token idea
Token ID2126
Feature activation+4.62e+00

Pos logit contributions
run+0.022
bars+0.020
iddled+0.020
 Step+0.017
 Faster+0.017
Neg logit contributions
ork-0.027
Po-0.027
S-0.026
G-0.026
Cr-0.026
Token.
Token ID13
Feature activation+1.18e+00

Pos logit contributions
bars+0.136
run+0.133
iddled+0.132
 swift+0.132
 fascination+0.128
Neg logit contributions
cious-0.050
Po-0.049
uts-0.049
 Gig-0.048
 sting-0.046
Token Let
Token ID3914
Feature activation+1.37e+00

Pos logit contributions
 swift+0.026
iddled+0.025
run+0.024
 older+0.024
 of+0.024
Neg logit contributions
Po-0.021
cious-0.021
 Gig-0.020
Pear-0.020
 Pink-0.019
Token's
Token ID338
Feature activation+1.47e+00

Pos logit contributions
run+0.034
 swift+0.030
iddled+0.030
bars+0.030
 of+0.029
Neg logit contributions
Po-0.024
cious-0.024
act-0.024
 drowned-0.023
G-0.023
Token go
Token ID467
Feature activation+2.12e-01

Pos logit contributions
run+0.035
bars+0.031
 swift+0.031
iddled+0.031
 of+0.030
Neg logit contributions
Po-0.027
cious-0.027
 drowned-0.026
G-0.026
Pear-0.026
Token!"
Token ID2474
Feature activation+1.48e+00

Pos logit contributions
run+0.005
 swift+0.004
iddled+0.004
bars+0.004
 older+0.004
Neg logit contributions
cious-0.004
Po-0.004
act-0.004
 drowned-0.004
G-0.004
Token is
Token ID318
Feature activation+9.69e-01

Pos logit contributions
run+0.030
 of+0.027
 swift+0.027
iddled+0.026
 older+0.026
Neg logit contributions
Po-0.032
cious-0.032
act-0.032
 drowned-0.031
G-0.031
Token a
Token ID257
Feature activation+1.18e+00

Pos logit contributions
run+0.023
bars+0.022
iddled+0.022
 swift+0.020
 of+0.020
Neg logit contributions
Cr-0.018
G-0.017
Po-0.017
cious-0.017
S-0.017
Token very
Token ID845
Feature activation+1.06e+00

Pos logit contributions
run+0.028
iddled+0.025
bars+0.025
 of+0.025
 swift+0.024
Neg logit contributions
Po-0.022
cious-0.022
act-0.021
Cr-0.021
G-0.021
Token nice
Token ID3621
Feature activation+1.21e-01

Pos logit contributions
run+0.022
 of+0.020
 swift+0.019
iddled+0.019
 older+0.019
Neg logit contributions
Po-0.022
cious-0.022
act-0.022
 drowned-0.021
G-0.021
Token pe
Token ID613
Feature activation+2.85e+00

Pos logit contributions
run+0.003
iddled+0.003
 swift+0.003
bars+0.003
 older+0.003
Neg logit contributions
Po-0.002
act-0.002
cious-0.002
G-0.002
Cr-0.002
Tokenb
Token ID65
Feature activation+4.62e+00

Pos logit contributions
run+0.061
iddled+0.055
 of+0.055
 Jake+0.055
 swift+0.054
Neg logit contributions
cious-0.058
act-0.057
Po-0.057
ork-0.054
Cr-0.053
Tokenble
Token ID903
Feature activation+1.57e+00

Pos logit contributions
run+0.135
 Faster+0.128
 Step+0.128
 swift+0.128
iddled+0.128
Neg logit contributions
cious-0.056
Po-0.052
sters-0.051
act-0.050
uts-0.050
Token.
Token ID13
Feature activation+7.76e-02

Pos logit contributions
iddled+0.043
run+0.041
 Faster+0.039
 Step+0.039
 fascination+0.039
Neg logit contributions
cious-0.024
uts-0.022
act-0.022
Po-0.021
ache-0.021
Token You
Token ID921
Feature activation+3.01e-02

Pos logit contributions
iddled+0.002
 older+0.002
run+0.002
 swift+0.002
 courage+0.002
Neg logit contributions
Po-0.001
cious-0.001
 Gig-0.001
Cr-0.001
 Pink-0.001
Token are
Token ID389
Feature activation+7.43e-01

Pos logit contributions
run+0.001
 swift+0.001
 of+0.001
bars+0.001
iddled+0.001
Neg logit contributions
Po-0.001
cious-0.001
act-0.001
 drowned-0.001
Cr-0.001
Token a
Token ID257
Feature activation+1.79e+00

Pos logit contributions
run+0.017
bars+0.015
iddled+0.015
 of+0.015
 swift+0.015
Neg logit contributions
cious-0.014
Po-0.014
Cr-0.014
act-0.013
G-0.013
Token said
Token ID531
Feature activation-1.64e-02

Pos logit contributions
run+0.015
 of+0.013
 swift+0.013
iddled+0.013
 older+0.013
Neg logit contributions
Po-0.021
cious-0.021
act-0.021
 drowned-0.020
G-0.020
Token,
Token ID11
Feature activation+8.83e-01

Pos logit contributions
cious+0.000
Po+0.000
ork+0.000
Cr+0.000
S+0.000
Neg logit contributions
run-0.000
bars-0.000
 swift-0.000
iddled-0.000
box-0.000
Token "
Token ID366
Feature activation-6.66e-01

Pos logit contributions
run+0.016
 swift+0.015
iddled+0.015
bars+0.015
 of+0.015
Neg logit contributions
cious-0.020
Po-0.020
Cr-0.019
G-0.019
 Gig-0.019
TokenI
Token ID40
Feature activation+6.94e-01

Pos logit contributions
Po+0.015
cious+0.014
G+0.014
act+0.014
Cr+0.014
Neg logit contributions
run-0.014
 older-0.013
 of-0.013
iddled-0.012
 swift-0.012
Token have
Token ID423
Feature activation+5.17e-01

Pos logit contributions
run+0.015
bars+0.014
iddled+0.014
 of+0.014
 older+0.013
Neg logit contributions
Po-0.014
cious-0.014
act-0.013
 drowned-0.013
ork-0.013
Token an
Token ID281
Feature activation+4.62e+00

Pos logit contributions
run+0.010
 of+0.009
 swift+0.009
iddled+0.009
 older+0.009
Neg logit contributions
cious-0.012
Po-0.012
act-0.011
 drowned-0.011
G-0.011
Token idea
Token ID2126
Feature activation+3.80e+00

Pos logit contributions
run+0.070
iddled+0.060
bars+0.060
 of+0.058
 swift+0.057
Neg logit contributions
cious-0.121
Po-0.121
act-0.120
G-0.117
 gag-0.116
Token!
Token ID0
Feature activation+1.38e+00

Pos logit contributions
bars+0.110
iddled+0.110
 fascination+0.106
 swift+0.103
 older+0.102
Neg logit contributions
uts-0.047
cious-0.047
 Gig-0.043
S-0.042
Po-0.041
Token I
Token ID314
Feature activation+8.04e-01

Pos logit contributions
iddled+0.032
box+0.030
 fascination+0.030
ma+0.029
 swift+0.029
Neg logit contributions
Po-0.025
cious-0.025
 Gig-0.025
 Pink-0.023
Cr-0.023
Token'll
Token ID1183
Feature activation+1.39e+00

Pos logit contributions
run+0.017
bars+0.017
iddled+0.016
 older+0.016
 swift+0.016
Neg logit contributions
Po-0.016
 drowned-0.015
cious-0.015
act-0.015
ork-0.015
Token climb
Token ID12080
Feature activation+6.17e-01

Pos logit contributions
run+0.028
bars+0.025
iddled+0.025
 swift+0.024
 older+0.024
Neg logit contributions
Po-0.031
cious-0.030
 drowned-0.029
act-0.029
G-0.029
TokenL
Token ID43
Feature activation-1.74e-01

Pos logit contributions
cious+0.018
act+0.018
Po+0.018
 drowned+0.017
ork+0.017
Neg logit contributions
run-0.024
 of-0.023
 swift-0.022
iddled-0.022
bars-0.022
Tokenily
Token ID813
Feature activation-8.04e-01

Pos logit contributions
S+0.004
ork+0.004
Po+0.004
act+0.004
cious+0.004
Neg logit contributions
run-0.003
 swift-0.003
bars-0.003
 older-0.003
iddled-0.003
Token,
Token ID11
Feature activation+8.62e-01

Pos logit contributions
 drowned+0.011
Po+0.011
cious+0.011
act+0.010
S+0.010
Neg logit contributions
run-0.022
iddled-0.022
 swift-0.021
bars-0.020
 fascination-0.020
Token I
Token ID314
Feature activation-2.62e-01

Pos logit contributions
iddled+0.024
run+0.022
bars+0.021
 swift+0.021
illed+0.020
Neg logit contributions
Po-0.013
Pear-0.012
Cr-0.012
S-0.012
cious-0.012
Token have
Token ID423
Feature activation-2.57e-01

Pos logit contributions
cious+0.006
act+0.005
Po+0.005
 pulp+0.005
actus+0.005
Neg logit contributions
run-0.005
 of-0.005
 welcome-0.005
 swift-0.005
 Jake-0.005
Token an
Token ID281
Feature activation+4.59e+00

Pos logit contributions
Po+0.006
cious+0.006
act+0.006
 drowned+0.005
Cr+0.005
Neg logit contributions
run-0.005
 swift-0.005
 of-0.005
iddled-0.005
bars-0.004
Token idea
Token ID2126
Feature activation+3.44e+00

Pos logit contributions
run+0.084
iddled+0.069
bars+0.069
 Step+0.064
 swift+0.064
Neg logit contributions
 gag-0.110
act-0.107
 harmful-0.106
Po-0.106
 sting-0.105
Token.
Token ID13
Feature activation+4.44e-01

Pos logit contributions
iddled+0.105
bars+0.103
 swift+0.103
 fascination+0.102
 younger+0.099
Neg logit contributions
S-0.032
uts-0.031
cious-0.030
Po-0.028
mar-0.028
Token Why
Token ID4162
Feature activation-1.62e+00

Pos logit contributions
iddled+0.010
 swift+0.010
 older+0.010
run+0.009
 courage+0.009
Neg logit contributions
cious-0.008
Po-0.008
 disl-0.007
 Pink-0.007
 Gig-0.007
Token don
Token ID836
Feature activation+1.04e+00

Pos logit contributions
Po+0.024
act+0.024
cious+0.023
Cr+0.023
ork+0.023
Neg logit contributions
run-0.043
 swift-0.040
iddled-0.039
bars-0.038
 of-0.037
Token't
Token ID470
Feature activation-1.26e-01

Pos logit contributions
 of+0.022
run+0.021
 older+0.020
 Jake+0.019
bars+0.019
Neg logit contributions
cious-0.023
act-0.023
Po-0.022
 pulp-0.021
 gag-0.021
Token.
Token ID13
Feature activation+1.21e+00

Pos logit contributions
run+0.012
 swift+0.011
iddled+0.011
 of+0.011
bars+0.010
Neg logit contributions
Po-0.011
cious-0.011
act-0.011
 drowned-0.010
ork-0.010
Token Then
Token ID3244
Feature activation+7.46e-01

Pos logit contributions
run+0.025
bars+0.024
iddled+0.024
 of+0.024
 swift+0.023
Neg logit contributions
Po-0.025
G-0.023
cious-0.023
ork-0.022
act-0.022
Token,
Token ID11
Feature activation+5.31e-01

Pos logit contributions
run+0.014
 of+0.012
 swift+0.012
 older+0.012
iddled+0.012
Neg logit contributions
cious-0.018
Po-0.017
act-0.017
 drowned-0.017
G-0.017
Token he
Token ID339
Feature activation+1.43e-01

Pos logit contributions
run+0.012
bars+0.012
iddled+0.012
box+0.011
 older+0.011
Neg logit contributions
Po-0.010
G-0.010
 drowned-0.009
Cr-0.009
cious-0.009
Token had
Token ID550
Feature activation-8.28e-02

Pos logit contributions
run+0.004
iddled+0.003
bars+0.003
 pleasure+0.003
 of+0.003
Neg logit contributions
Po-0.002
 drowned-0.002
cious-0.002
Cr-0.002
G-0.002
Token an
Token ID281
Feature activation+4.57e+00

Pos logit contributions
Po+0.002
cious+0.002
act+0.002
 drowned+0.002
G+0.002
Neg logit contributions
run-0.002
 of-0.002
 swift-0.001
iddled-0.001
 older-0.001
Token idea
Token ID2126
Feature activation+3.33e+00

Pos logit contributions
run+0.066
bars+0.058
iddled+0.054
 Jake+0.053
 of+0.053
Neg logit contributions
Po-0.123
cious-0.122
 gag-0.121
 harmful-0.120
 drowned-0.119
Token.
Token ID13
Feature activation+1.72e+00

Pos logit contributions
bars+0.096
 pleasure+0.096
 younger+0.094
 older+0.094
 Faster+0.091
Neg logit contributions
Po-0.039
ork-0.037
mar-0.036
cious-0.036
uts-0.035
Token
Token ID198
Feature activation+1.06e+00

Pos logit contributions
bars+0.037
iddled+0.036
run+0.036
 swift+0.035
ma+0.035
Neg logit contributions
Po-0.033
G-0.031
Pear-0.030
Cr-0.030
S-0.029
Token
Token ID198
Feature activation+5.43e-01

Pos logit contributions
bars+0.025
run+0.025
 older+0.024
iddled+0.023
 swift+0.023
Neg logit contributions
Po-0.019
G-0.019
ork-0.019
S-0.018
Pear-0.018
Token"
Token ID1
Feature activation-1.11e-01

Pos logit contributions
run+0.013
 of+0.012
 swift+0.012
iddled+0.011
 older+0.011
Neg logit contributions
cious-0.010
act-0.010
Po-0.010
 drowned-0.009
 gag-0.009
Token said
Token ID531
Feature activation-7.74e-02

Pos logit contributions
run+0.008
bars+0.007
 swift+0.007
iddled+0.007
 of+0.007
Neg logit contributions
Po-0.007
cious-0.007
 drowned-0.007
G-0.007
Cr-0.007
Token,
Token ID11
Feature activation+5.14e-01

Pos logit contributions
cious+0.002
Po+0.002
ork+0.002
Cr+0.002
act+0.002
Neg logit contributions
run-0.002
 swift-0.001
bars-0.001
iddled-0.001
 of-0.001
Token "
Token ID366
Feature activation-4.85e-01

Pos logit contributions
run+0.009
 swift+0.008
 of+0.008
iddled+0.008
bars+0.008
Neg logit contributions
Po-0.012
cious-0.012
G-0.012
Cr-0.012
Pear-0.012
TokenI
Token ID40
Feature activation+2.52e-01

Pos logit contributions
Po+0.007
G+0.007
S+0.006
Cr+0.006
Pear+0.006
Neg logit contributions
run-0.014
 older-0.014
 of-0.013
 swift-0.013
box-0.013
Token have
Token ID423
Feature activation+1.16e-01

Pos logit contributions
run+0.006
bars+0.006
iddled+0.006
 swift+0.005
 older+0.005
Neg logit contributions
Po-0.004
cious-0.004
 drowned-0.004
act-0.004
ork-0.004
Token an
Token ID281
Feature activation+4.57e+00

Pos logit contributions
run+0.002
 swift+0.002
iddled+0.002
bars+0.002
 of+0.002
Neg logit contributions
Po-0.002
cious-0.002
act-0.002
 drowned-0.002
Cr-0.002
Token idea
Token ID2126
Feature activation+3.66e+00

Pos logit contributions
run+0.094
iddled+0.084
bars+0.082
 Jake+0.081
 swift+0.080
Neg logit contributions
act-0.094
Po-0.093
cious-0.092
 gag-0.090
G-0.090
Token!
Token ID0
Feature activation+1.53e+00

Pos logit contributions
iddled+0.113
bars+0.111
 fascination+0.109
 swift+0.108
 younger+0.106
Neg logit contributions
uts-0.038
cious-0.037
S-0.031
 Gig-0.031
mar-0.030
Token I
Token ID314
Feature activation+3.96e-01

Pos logit contributions
iddled+0.035
 swift+0.034
box+0.033
 fascination+0.033
 older+0.032
Neg logit contributions
Po-0.025
cious-0.024
 Gig-0.024
 Pink-0.024
Cr-0.022
Token will
Token ID481
Feature activation+2.38e-01

Pos logit contributions
run+0.009
bars+0.009
 swift+0.008
 older+0.008
iddled+0.008
Neg logit contributions
Po-0.007
 drowned-0.007
act-0.007
cious-0.007
ork-0.007
Token share
Token ID2648
Feature activation-2.13e+00

Pos logit contributions
run+0.006
bars+0.005
 swift+0.005
iddled+0.005
 older+0.005
Neg logit contributions
Po-0.005
cious-0.004
 drowned-0.004
Cr-0.004
G-0.004
Token Dad
Token ID17415
Feature activation+2.25e+00

Pos logit contributions
cious+0.002
Po+0.002
G+0.002
 Gig+0.002
S+0.002
Neg logit contributions
run-0.002
 swift-0.002
bars-0.002
iddled-0.002
box-0.002
Token.
Token ID13
Feature activation+2.63e+00

Pos logit contributions
 pleasure+0.066
 Faster+0.064
iddled+0.062
 swift+0.061
initely+0.061
Neg logit contributions
cious-0.029
ache-0.022
mar-0.022
uts-0.022
mary-0.021
Token
Token ID220
Feature activation+1.97e+00

Pos logit contributions
isher+0.069
 swift+0.067
iddled+0.067
bars+0.066
run+0.066
Neg logit contributions
 Gig-0.037
G-0.036
Po-0.034
S-0.033
uts-0.031
Token
Token ID198
Feature activation+1.47e+00

Pos logit contributions
isher+0.054
 fascination+0.053
bars+0.052
 swift+0.052
iddled+0.052
Neg logit contributions
uts-0.027
ork-0.024
G-0.022
S-0.022
Po-0.022
Token
Token ID198
Feature activation+7.68e-01

Pos logit contributions
bars+0.044
 older+0.042
run+0.040
 courage+0.040
 pleasure+0.039
Neg logit contributions
S-0.022
G-0.022
Po-0.019
ork-0.019
Pear-0.018
TokenAll
Token ID3237
Feature activation+4.56e+00

Pos logit contributions
run+0.022
 older+0.022
bars+0.022
 courage+0.021
 pleasure+0.020
Neg logit contributions
Po-0.011
G-0.011
S-0.010
Cr-0.010
Pear-0.009
Tokenie
Token ID494
Feature activation+5.58e-01

Pos logit contributions
box+0.073
bars+0.065
uit+0.064
 Step+0.061
 Faster+0.059
Neg logit contributions
 drowned-0.104
 became-0.101
S-0.101
Pear-0.101
Po-0.096
Token smiled
Token ID13541
Feature activation+8.95e-02

Pos logit contributions
run+0.013
 pleasure+0.013
bars+0.012
iddled+0.012
 welcome+0.012
Neg logit contributions
cious-0.010
 drowned-0.009
Po-0.009
G-0.009
mary-0.009
Token and
Token ID290
Feature activation+8.68e-01

Pos logit contributions
run+0.002
bars+0.002
box+0.002
iddled+0.002
 swift+0.002
Neg logit contributions
Po-0.002
cious-0.002
G-0.002
S-0.001
Cr-0.001
Token they
Token ID484
Feature activation+9.97e-01

Pos logit contributions
run+0.020
box+0.018
bars+0.018
 older+0.018
iddled+0.017
Neg logit contributions
cious-0.017
G-0.016
 drowned-0.016
Po-0.016
 Gig-0.015
Token both
Token ID1111
Feature activation+1.69e+00

Pos logit contributions
run+0.029
 older+0.028
 younger+0.026
box+0.026
 welcome+0.026
Neg logit contributions
G-0.013
 drowned-0.012
cious-0.012
Pear-0.012
Po-0.012
Token said
Token ID531
Feature activation+6.10e-01

Pos logit contributions
run+0.011
 pleasure+0.010
bars+0.010
 swift+0.010
 Faster+0.010
Neg logit contributions
Po-0.008
cious-0.008
Cr-0.008
G-0.007
 drowned-0.007
Token.
Token ID13
Feature activation+1.30e+00

Pos logit contributions
run+0.018
 swift+0.018
 fascination+0.018
ay+0.018
 sorts+0.018
Neg logit contributions
Po-0.006
cious-0.006
 gag-0.006
uts-0.005
 sting-0.005
Token "
Token ID366
Feature activation-7.39e-01

Pos logit contributions
run+0.025
bars+0.024
 swift+0.024
iddled+0.023
 pleasure+0.023
Neg logit contributions
Po-0.029
cious-0.027
Pear-0.027
G-0.026
uts-0.026
TokenI
Token ID40
Feature activation+3.60e-01

Pos logit contributions
act+0.014
cious+0.013
 drowned+0.013
 gag+0.013
 Gig+0.013
Neg logit contributions
run-0.016
 of-0.015
 swift-0.015
bars-0.015
 Faster-0.014
Token have
Token ID423
Feature activation+2.91e-01

Pos logit contributions
run+0.008
bars+0.008
iddled+0.007
 of+0.007
 older+0.007
Neg logit contributions
Po-0.007
cious-0.007
act-0.007
 drowned-0.007
ork-0.006
Token an
Token ID281
Feature activation+4.55e+00

Pos logit contributions
run+0.006
 of+0.005
 swift+0.005
iddled+0.005
bars+0.005
Neg logit contributions
Po-0.006
cious-0.006
act-0.006
 drowned-0.006
Cr-0.006
Token idea
Token ID2126
Feature activation+3.40e+00

Pos logit contributions
run+0.064
bars+0.054
 of+0.051
iddled+0.051
 swift+0.051
Neg logit contributions
cious-0.124
Po-0.124
act-0.122
 drowned-0.120
 gag-0.120
Token.
Token ID13
Feature activation+1.11e+00

Pos logit contributions
iddled+0.103
 fascination+0.102
bars+0.100
 swift+0.100
run+0.098
Neg logit contributions
uts-0.036
cious-0.035
S-0.033
Po-0.032
 drowned-0.031
Token Look
Token ID6803
Feature activation+1.02e+00

Pos logit contributions
iddled+0.025
 older+0.024
 swift+0.024
bars+0.024
 courage+0.023
Neg logit contributions
cious-0.020
Po-0.020
Pear-0.018
Cr-0.018
 Gig-0.018
Token,
Token ID11
Feature activation+9.05e-01

Pos logit contributions
bars+0.024
run+0.024
 swift+0.023
iddled+0.023
 Step+0.022
Neg logit contributions
act-0.017
Po-0.016
cious-0.016
ork-0.016
S-0.015
Token there
Token ID612
Feature activation+1.56e+00

Pos logit contributions
run+0.022
iddled+0.022
bars+0.021
 swift+0.020
illed+0.020
Neg logit contributions
Po-0.015
cious-0.015
Cr-0.014
Pear-0.013
 pige-0.013
Token
Token ID198
Feature activation+6.33e-01

Pos logit contributions
run+0.022
 swift+0.020
iddled+0.020
bars+0.020
 of+0.020
Neg logit contributions
Po-0.021
cious-0.020
G-0.020
Pear-0.020
act-0.020
Token
Token ID198
Feature activation-2.09e-01

Pos logit contributions
run+0.014
bars+0.013
 swift+0.013
iddled+0.013
 older+0.013
Neg logit contributions
Po-0.012
G-0.012
ork-0.011
act-0.011
cious-0.011
Token"
Token ID1
Feature activation-1.07e+00

Pos logit contributions
cious+0.004
Po+0.004
act+0.004
 drowned+0.004
G+0.004
Neg logit contributions
run-0.005
 of-0.004
 swift-0.004
iddled-0.004
 older-0.004
TokenA
Token ID32
Feature activation+7.69e-01

Pos logit contributions
act+0.018
cious+0.018
 drowned+0.018
ork+0.017
 gag+0.017
Neg logit contributions
run-0.026
 of-0.025
 swift-0.024
iddled-0.024
bars-0.024
Token pe
Token ID613
Feature activation+2.54e+00

Pos logit contributions
run+0.018
iddled+0.017
 swift+0.017
 of+0.016
bars+0.016
Neg logit contributions
Po-0.014
cious-0.014
act-0.014
G-0.013
ork-0.013
Tokenb
Token ID65
Feature activation+4.55e+00

Pos logit contributions
run+0.055
 of+0.051
 Jake+0.051
iddled+0.050
 Faster+0.049
Neg logit contributions
Po-0.049
act-0.049
cious-0.049
ork-0.047
Pear-0.046
Tokenble
Token ID903
Feature activation+1.36e+00

Pos logit contributions
run+0.126
 swift+0.118
bars+0.117
 older+0.116
iddled+0.115
Neg logit contributions
Po-0.062
cious-0.061
act-0.058
Pear-0.058
ork-0.057
Token,
Token ID11
Feature activation+1.16e+00

Pos logit contributions
run+0.033
iddled+0.031
 Faster+0.031
 Step+0.030
 pleasure+0.030
Neg logit contributions
Po-0.023
act-0.023
ork-0.022
cious-0.022
 drowned-0.022
Token Mom
Token ID11254
Feature activation+7.64e-01

Pos logit contributions
run+0.028
iddled+0.026
bars+0.026
 swift+0.024
old+0.023
Neg logit contributions
Po-0.021
Cr-0.021
Pear-0.020
 pulp-0.020
cious-0.020
Tokenmy
Token ID1820
Feature activation+1.45e-01

Pos logit contributions
 Faster+0.014
run+0.013
iddled+0.013
 swift+0.013
 fascination+0.013
Neg logit contributions
mar-0.017
Po-0.017
ache-0.016
act-0.016
S-0.016
Token!
Token ID0
Feature activation+6.78e-01

Pos logit contributions
run+0.004
iddled+0.004
 swift+0.004
 Faster+0.004
bars+0.004
Neg logit contributions
cious-0.002
Po-0.002
act-0.002
ache-0.002
mar-0.002
Token,
Token ID11
Feature activation+5.56e-01

Pos logit contributions
run+0.009
 of+0.008
 swift+0.008
 older+0.008
iddled+0.008
Neg logit contributions
cious-0.012
Po-0.012
act-0.012
 drowned-0.012
G-0.012
Token the
Token ID262
Feature activation-2.49e-01

Pos logit contributions
run+0.013
iddled+0.013
bars+0.013
box+0.012
 swift+0.011
Neg logit contributions
 drowned-0.010
Po-0.010
cious-0.010
 Gig-0.009
Cr-0.009
Token giant
Token ID6175
Feature activation+4.13e-01

Pos logit contributions
 sting+0.004
 drowned+0.004
Po+0.004
 gag+0.003
actus+0.003
Neg logit contributions
run-0.008
iddled-0.007
bars-0.007
illed-0.006
 Step-0.006
Token horse
Token ID8223
Feature activation+5.79e-01

Pos logit contributions
run+0.011
iddled+0.010
 Faster+0.010
 Step+0.010
 pleasure+0.010
Neg logit contributions
 drowned-0.007
Po-0.007
cious-0.006
act-0.006
ache-0.006
Token had
Token ID550
Feature activation+1.34e-02

Pos logit contributions
run+0.015
bars+0.014
iddled+0.014
 pleasure+0.013
 swift+0.013
Neg logit contributions
 drowned-0.010
Po-0.010
cious-0.010
act-0.009
ork-0.009
Token an
Token ID281
Feature activation+4.54e+00

Pos logit contributions
run+0.000
 swift+0.000
bars+0.000
iddled+0.000
 of+0.000
Neg logit contributions
Po-0.000
 drowned-0.000
cious-0.000
act-0.000
Cr-0.000
Token idea
Token ID2126
Feature activation+3.36e+00

Pos logit contributions
run+0.068
bars+0.061
iddled+0.056
 Jake+0.055
 of+0.055
Neg logit contributions
 gag-0.117
cious-0.117
 drowned-0.116
Po-0.116
act-0.115
Token.
Token ID13
Feature activation+6.90e-01

Pos logit contributions
bars+0.099
 younger+0.096
 pleasure+0.095
 older+0.095
iddled+0.095
Neg logit contributions
Po-0.037
mar-0.037
cious-0.037
ork-0.035
S-0.034
Token He
Token ID679
Feature activation+2.11e-01

Pos logit contributions
ma+0.017
iddled+0.016
 fascination+0.015
 swift+0.015
bars+0.015
Neg logit contributions
Po-0.011
Cr-0.010
ork-0.010
 Gig-0.010
uts-0.010
Token said
Token ID531
Feature activation-2.95e-04

Pos logit contributions
run+0.005
bars+0.005
 Faster+0.005
 pleasure+0.005
ay+0.005
Neg logit contributions
Po-0.003
 drowned-0.003
Cr-0.003
cious-0.003
G-0.003
Token "
Token ID366
Feature activation-5.57e-01

Pos logit contributions
Po+0.000
ork+0.000
cious+0.000
Cr+0.000
G+0.000
Neg logit contributions
bars-0.000
run-0.000
iddled-0.000
 swift-0.000
hole-0.000
Token said
Token ID531
Feature activation-5.40e-01

Pos logit contributions
run+0.012
 of+0.011
 swift+0.011
bars+0.010
iddled+0.010
Neg logit contributions
Po-0.015
cious-0.015
act-0.014
 drowned-0.014
G-0.014
Token,
Token ID11
Feature activation+1.19e-01

Pos logit contributions
cious+0.012
Po+0.012
act+0.012
ork+0.012
 drowned+0.012
Neg logit contributions
run-0.010
 of-0.009
 swift-0.009
iddled-0.009
 older-0.009
Token "
Token ID366
Feature activation-1.06e+00

Pos logit contributions
run+0.002
 of+0.002
 swift+0.002
iddled+0.002
bars+0.002
Neg logit contributions
cious-0.003
Po-0.003
G-0.003
Cr-0.003
act-0.003
TokenI
Token ID40
Feature activation-1.52e-01

Pos logit contributions
Po+0.018
cious+0.017
act+0.017
G+0.017
Cr+0.016
Neg logit contributions
run-0.027
 of-0.025
 older-0.025
 swift-0.025
iddled-0.024
Token have
Token ID423
Feature activation+6.24e-03

Pos logit contributions
Po+0.003
cious+0.003
act+0.003
 drowned+0.003
G+0.003
Neg logit contributions
run-0.003
 of-0.003
 swift-0.003
iddled-0.003
 older-0.003
Token an
Token ID281
Feature activation+4.53e+00

Pos logit contributions
run+0.000
 of+0.000
 swift+0.000
iddled+0.000
 older+0.000
Neg logit contributions
Po-0.000
cious-0.000
act-0.000
 drowned-0.000
G-0.000
Token idea
Token ID2126
Feature activation+3.50e+00

Pos logit contributions
run+0.071
iddled+0.060
 Jake+0.059
 swift+0.059
bars+0.058
Neg logit contributions
act-0.115
Po-0.115
cious-0.113
G-0.111
 drowned-0.110
Token!"
Token ID2474
Feature activation+3.38e-01

Pos logit contributions
bars+0.098
iddled+0.098
 swift+0.097
 fascination+0.095
run+0.094
Neg logit contributions
cious-0.044
uts-0.042
Po-0.040
S-0.039
ork-0.038
Token She
Token ID1375
Feature activation+3.62e-01

Pos logit contributions
run+0.006
 swift+0.005
 of+0.005
iddled+0.005
 older+0.005
Neg logit contributions
Po-0.008
cious-0.008
act-0.008
 drowned-0.008
ork-0.008
Token put
Token ID1234
Feature activation+6.92e-01

Pos logit contributions
run+0.009
 pleasure+0.008
 welcome+0.008
 swift+0.008
bars+0.008
Neg logit contributions
Po-0.006
cious-0.006
Cr-0.006
G-0.006
 drowned-0.006
Token the
Token ID262
Feature activation+1.49e+00

Pos logit contributions
run+0.019
old+0.018
 younger+0.017
 older+0.017
 swift+0.017
Neg logit contributions
Po-0.010
 pits-0.009
 Gig-0.009
 Pink-0.009
 peanut-0.009
Token?"
Token ID1701
Feature activation+8.48e-01

Pos logit contributions
run+0.005
 swift+0.005
bars+0.005
iddled+0.005
 pleasure+0.005
Neg logit contributions
Po-0.003
Cr-0.003
cious-0.003
act-0.003
ork-0.003
Token
Token ID198
Feature activation+9.28e-01

Pos logit contributions
run+0.023
box+0.023
iddled+0.022
 swift+0.022
isher+0.021
Neg logit contributions
S-0.012
Po-0.012
G-0.012
 Gig-0.011
Pear-0.010
Token
Token ID198
Feature activation-2.59e-01

Pos logit contributions
run+0.025
iddled+0.024
bars+0.024
 courage+0.024
 swift+0.023
Neg logit contributions
S-0.015
G-0.014
ork-0.014
Po-0.013
Cr-0.012
TokenTom
Token ID13787
Feature activation+1.25e+00

Pos logit contributions
Po+0.004
cious+0.004
G+0.004
Cr+0.004
S+0.004
Neg logit contributions
run-0.007
 of-0.006
 swift-0.006
 older-0.006
iddled-0.006
Token had
Token ID550
Feature activation-5.57e-01

Pos logit contributions
 pleasure+0.029
run+0.027
 swift+0.027
 of+0.027
 welcome+0.027
Neg logit contributions
Po-0.023
cious-0.022
mar-0.022
Pear-0.022
Cr-0.021
Token an
Token ID281
Feature activation+4.52e+00

Pos logit contributions
Po+0.012
cious+0.012
act+0.012
G+0.011
 drowned+0.011
Neg logit contributions
run-0.011
 of-0.010
 swift-0.010
iddled-0.010
 older-0.010
Token idea
Token ID2126
Feature activation+3.62e+00

Pos logit contributions
run+0.069
bars+0.060
box+0.056
 Jake+0.056
 of+0.055
Neg logit contributions
cious-0.116
Po-0.115
 gag-0.114
act-0.112
 harmful-0.111
Token.
Token ID13
Feature activation+8.88e-01

Pos logit contributions
 pleasure+0.107
 swift+0.102
 fascination+0.101
 Faster+0.101
bars+0.101
Neg logit contributions
Po-0.038
mar-0.038
uts-0.037
S-0.036
cious-0.035
Token He
Token ID679
Feature activation+5.72e-01

Pos logit contributions
 fascination+0.020
 swift+0.020
ma+0.020
 courage+0.019
iddled+0.019
Neg logit contributions
Po-0.015
S-0.014
Cr-0.014
ork-0.014
uts-0.013
Token wanted
Token ID2227
Feature activation+1.06e+00

Pos logit contributions
 pleasure+0.016
 welcome+0.015
run+0.015
bars+0.015
ay+0.015
Neg logit contributions
Po-0.008
Cr-0.008
cious-0.007
G-0.007
 drowned-0.007
Token to
Token ID284
Feature activation+2.09e+00

Pos logit contributions
run+0.028
iddled+0.028
bars+0.027
 swift+0.027
 pleasure+0.026
Neg logit contributions
Po-0.016
 drowned-0.014
act-0.014
 pits-0.014
cious-0.014
Token said
Token ID531
Feature activation+1.36e-01

Pos logit contributions
run+0.029
ay+0.029
bars+0.028
iddled+0.027
 pleasure+0.027
Neg logit contributions
Po-0.021
cious-0.019
 drowned-0.019
G-0.018
Cr-0.018
Token,
Token ID11
Feature activation+6.99e-01

Pos logit contributions
bars+0.003
run+0.003
 swift+0.003
iddled+0.003
ay+0.003
Neg logit contributions
cious-0.002
Po-0.002
ork-0.002
mar-0.002
 gag-0.002
Token "
Token ID366
Feature activation-4.46e-01

Pos logit contributions
run+0.014
 swift+0.013
ay+0.013
bars+0.013
iddled+0.013
Neg logit contributions
cious-0.015
Po-0.015
Cr-0.014
G-0.014
mary-0.014
TokenI
Token ID40
Feature activation+1.90e-01

Pos logit contributions
Po+0.007
cious+0.007
G+0.007
Cr+0.006
S+0.006
Neg logit contributions
run-0.012
 older-0.011
 of-0.011
 swift-0.011
iddled-0.011
Token have
Token ID423
Feature activation+5.66e-01

Pos logit contributions
run+0.004
bars+0.004
iddled+0.004
 swift+0.004
 of+0.004
Neg logit contributions
Po-0.004
 drowned-0.004
cious-0.004
act-0.003
ork-0.003
Token an
Token ID281
Feature activation+4.51e+00

Pos logit contributions
run+0.013
iddled+0.012
 swift+0.012
bars+0.012
 of+0.011
Neg logit contributions
Po-0.011
cious-0.011
 drowned-0.011
act-0.010
ork-0.010
Token idea
Token ID2126
Feature activation+3.66e+00

Pos logit contributions
run+0.076
bars+0.066
iddled+0.064
 Jake+0.061
 of+0.059
Neg logit contributions
 gag-0.109
 harmful-0.109
Po-0.109
cious-0.107
 drowned-0.107
Token.
Token ID13
Feature activation+1.31e+00

Pos logit contributions
iddled+0.109
bars+0.109
 fascination+0.107
 swift+0.106
 Faster+0.104
Neg logit contributions
uts-0.040
cious-0.039
Po-0.034
S-0.034
ork-0.033
Token Let
Token ID3914
Feature activation+1.38e+00

Pos logit contributions
iddled+0.029
 swift+0.027
bars+0.027
box+0.026
 fascination+0.026
Neg logit contributions
Po-0.023
cious-0.023
 Gig-0.022
 Pink-0.021
Cr-0.021
Token's
Token ID338
Feature activation+1.90e+00

Pos logit contributions
run+0.036
 swift+0.033
iddled+0.032
bars+0.032
 of+0.031
Neg logit contributions
Po-0.023
 drowned-0.022
cious-0.022
act-0.022
G-0.021
Token go
Token ID467
Feature activation+5.96e-01

Pos logit contributions
run+0.045
iddled+0.041
bars+0.040
 swift+0.040
 of+0.039
Neg logit contributions
Po-0.035
 drowned-0.034
cious-0.034
act-0.033
Pear-0.033
Token it
Token ID340
Feature activation+2.15e-01

Pos logit contributions
Po+0.007
ork+0.007
act+0.006
 pits+0.006
 drowned+0.006
Neg logit contributions
run-0.015
iddled-0.015
 swift-0.014
 older-0.014
bars-0.014
Token to
Token ID284
Feature activation-1.13e+00

Pos logit contributions
run+0.006
iddled+0.005
bars+0.005
 swift+0.005
 Jake+0.005
Neg logit contributions
Po-0.003
ork-0.003
Cr-0.003
cious-0.003
act-0.003
Token me
Token ID502
Feature activation+7.82e-01

Pos logit contributions
Po+0.019
Cr+0.019
G+0.019
S+0.019
 drowned+0.018
Neg logit contributions
run-0.028
iddled-0.028
 swift-0.025
bars-0.025
box-0.024
Token for
Token ID329
Feature activation-9.21e-01

Pos logit contributions
iddled+0.020
run+0.019
 swift+0.018
 fascination+0.018
bars+0.018
Neg logit contributions
Po-0.012
cious-0.012
ork-0.012
mary-0.012
Pear-0.011
Token safe
Token ID3338
Feature activation-2.03e+00

Pos logit contributions
 drowned+0.017
Po+0.017
Cr+0.017
cious+0.016
G+0.016
Neg logit contributions
run-0.021
iddled-0.021
bars-0.021
 swift-0.019
 of-0.019
Tokenkeeping
Token ID19934
Feature activation-4.93e+00

Pos logit contributions
cious+0.023
Po+0.023
act+0.022
 drowned+0.022
ork+0.021
Neg logit contributions
run-0.062
 swift-0.057
iddled-0.057
bars-0.056
 of-0.056
Token.
Token ID13
Feature activation-4.18e-02

Pos logit contributions
cious+0.104
Po+0.104
act+0.102
 drowned+0.099
G+0.098
Neg logit contributions
run-0.100
 of-0.092
 swift-0.089
iddled-0.088
 older-0.087
Token He
Token ID679
Feature activation-9.67e-01

Pos logit contributions
Po+0.001
S+0.001
G+0.001
Cr+0.001
 Gig+0.001
Neg logit contributions
iddled-0.001
run-0.001
 swift-0.001
box-0.001
bars-0.001
Token told
Token ID1297
Feature activation-1.04e-01

Pos logit contributions
 drowned+0.019
act+0.018
Po+0.018
Cr+0.017
cious+0.017
Neg logit contributions
run-0.023
iddled-0.020
 swift-0.020
 Faster-0.020
bars-0.019
Token me
Token ID502
Feature activation+1.36e+00

Pos logit contributions
Po+0.002
 drowned+0.002
Cr+0.002
ork+0.002
S+0.002
Neg logit contributions
run-0.002
iddled-0.002
 swift-0.002
bars-0.002
 Faster-0.002
Token to
Token ID284
Feature activation-1.11e+00

Pos logit contributions
iddled+0.032
run+0.032
 swift+0.030
 Step+0.030
 fascination+0.029
Neg logit contributions
ork-0.022
cious-0.021
Po-0.021
act-0.020
 drowned-0.020
Token the
Token ID262
Feature activation-1.27e+00

Pos logit contributions
Po+0.000
cious+0.000
 drowned+0.000
act+0.000
Cr+0.000
Neg logit contributions
run-0.000
iddled-0.000
 swift-0.000
bars-0.000
 of-0.000
Token officer
Token ID3818
Feature activation-1.20e+00

Pos logit contributions
Po+0.018
cious+0.017
 drowned+0.016
act+0.016
Cr+0.016
Neg logit contributions
run-0.037
iddled-0.033
 of-0.033
 swift-0.033
bars-0.033
Token safely
Token ID11512
Feature activation-4.71e-01

Pos logit contributions
 drowned+0.022
cious+0.021
Po+0.020
Cr+0.020
ork+0.020
Neg logit contributions
run-0.029
 swift-0.027
iddled-0.027
 pleasure-0.026
bars-0.026
Token took
Token ID1718
Feature activation-4.32e-01

Pos logit contributions
 drowned+0.008
cious+0.008
Po+0.007
G+0.007
Cr+0.007
Neg logit contributions
run-0.012
bars-0.011
 swift-0.011
iddled-0.011
 pleasure-0.011
Token the
Token ID262
Feature activation-5.20e-01

Pos logit contributions
 drowned+0.008
Po+0.007
ork+0.007
 Gig+0.007
S+0.007
Neg logit contributions
run-0.010
bars-0.010
iddled-0.010
box-0.009
 swift-0.009
Token bad
Token ID2089
Feature activation-4.62e+00

Pos logit contributions
Po+0.007
cious+0.006
 harmful+0.006
 drowned+0.006
ork+0.006
Neg logit contributions
run-0.015
bars-0.014
iddled-0.014
 Kate-0.014
 of-0.013
Token man
Token ID582
Feature activation-5.96e-01

Pos logit contributions
Po+0.077
cious+0.075
ork+0.072
 drowned+0.072
act+0.071
Neg logit contributions
run-0.120
iddled-0.110
 swift-0.108
 of-0.107
bars-0.106
Token away
Token ID1497
Feature activation-1.46e+00

Pos logit contributions
cious+0.011
 drowned+0.010
ork+0.010
 bank+0.009
Po+0.009
Neg logit contributions
iddled-0.015
bars-0.014
run-0.014
 Faster-0.013
 older-0.013
Token and
Token ID290
Feature activation+7.26e-01

Pos logit contributions
ork+0.022
 drowned+0.022
cious+0.022
Po+0.022
S+0.021
Neg logit contributions
iddled-0.037
run-0.036
bars-0.035
 Step-0.034
 Faster-0.033
Token everyone
Token ID2506
Feature activation-2.34e-01

Pos logit contributions
run+0.018
iddled+0.017
 swift+0.016
bars+0.016
ay+0.016
Neg logit contributions
 drowned-0.013
Po-0.011
cious-0.011
 Gig-0.011
Cr-0.011
Token went
Token ID1816
Feature activation-4.83e-01

Pos logit contributions
 drowned+0.005
cious+0.004
Po+0.004
act+0.004
mar+0.004
Neg logit contributions
run-0.005
 swift-0.005
bars-0.005
 Step-0.005
 older-0.005
Token.
Token ID13
Feature activation+5.16e-01

Pos logit contributions
 swift+0.001
 Faster+0.001
 pleasure+0.001
iddled+0.001
 fascination+0.001
Neg logit contributions
cious-0.001
uts-0.001
ork-0.001
Po-0.000
 drowned-0.000
Token She
Token ID1375
Feature activation-2.95e-01

Pos logit contributions
 swift+0.013
icking+0.013
iddled+0.013
run+0.013
 courage+0.012
Neg logit contributions
Po-0.007
 Gig-0.007
S-0.007
ork-0.007
uts-0.007
Token shouted
Token ID17293
Feature activation-4.28e-01

Pos logit contributions
Po+0.005
 drowned+0.005
Cr+0.005
G+0.004
mary+0.004
Neg logit contributions
 welcome-0.007
run-0.007
 pleasure-0.007
 swift-0.007
 Faster-0.007
Token at
Token ID379
Feature activation+4.56e-01

Pos logit contributions
cious+0.008
Po+0.008
ork+0.008
uts+0.007
Cr+0.007
Neg logit contributions
bars-0.010
run-0.009
 swift-0.009
iddled-0.009
box-0.009
Token the
Token ID262
Feature activation-9.11e-01

Pos logit contributions
run+0.011
iddled+0.010
 swift+0.010
bars+0.010
box+0.010
Neg logit contributions
Po-0.008
Pear-0.008
 Gig-0.008
G-0.007
cious-0.007
Token bad
Token ID2089
Feature activation-4.61e+00

Pos logit contributions
 bank+0.014
Po+0.013
 drowned+0.013
cious+0.013
ork+0.013
Neg logit contributions
run-0.026
iddled-0.024
 swift-0.023
bars-0.022
 Step-0.022
Token man
Token ID582
Feature activation-1.18e-01

Pos logit contributions
Po+0.097
cious+0.097
act+0.096
 drowned+0.092
G+0.091
Neg logit contributions
run-0.095
 of-0.087
 swift-0.085
iddled-0.084
 older-0.083
Token with
Token ID351
Feature activation-6.51e-01

Pos logit contributions
cious+0.002
ork+0.002
mary+0.002
Po+0.002
ache+0.002
Neg logit contributions
iddled-0.003
bars-0.003
 Faster-0.003
 pleasure-0.003
 Step-0.003
Token the
Token ID262
Feature activation-1.39e-01

Pos logit contributions
Po+0.011
 drowned+0.011
 sting+0.011
G+0.010
Pear+0.010
Neg logit contributions
bars-0.016
run-0.014
icking-0.014
iddled-0.014
box-0.014
Token gun
Token ID2485
Feature activation-3.83e-01

Pos logit contributions
 bank+0.002
Po+0.002
 harmful+0.002
 drowned+0.002
 sting+0.002
Neg logit contributions
run-0.004
bars-0.004
ions-0.004
iddled-0.004
 Kate-0.004
Token,
Token ID11
Feature activation+4.60e-02

Pos logit contributions
 drowned+0.006
uts+0.006
cious+0.006
ork+0.005
Po+0.005
Neg logit contributions
run-0.009
iddled-0.009
 swift-0.009
 pleasure-0.009
 Faster-0.009
Token and
Token ID290
Feature activation+2.37e-01

Pos logit contributions
uts+0.004
ork+0.004
 bank+0.004
cious+0.003
keeping+0.003
Neg logit contributions
 swift-0.007
 fascination-0.007
 pleasure-0.007
run-0.007
iddled-0.006
Token they
Token ID484
Feature activation+3.04e-01

Pos logit contributions
run+0.006
iddled+0.005
 swift+0.005
ay+0.005
 older+0.005
Neg logit contributions
cious-0.004
 drowned-0.004
Po-0.004
 pulp-0.004
act-0.004
Token went
Token ID1816
Feature activation-1.34e-01

Pos logit contributions
run+0.008
 swift+0.007
 older+0.007
iddled+0.007
 courage+0.007
Neg logit contributions
 drowned-0.005
Po-0.005
cious-0.005
 gag-0.004
act-0.004
Token to
Token ID284
Feature activation+4.99e-02

Pos logit contributions
cious+0.002
ork+0.002
Po+0.002
act+0.002
 drowned+0.002
Neg logit contributions
run-0.003
 swift-0.003
iddled-0.003
bars-0.003
 pleasure-0.003
Token the
Token ID262
Feature activation-1.96e-01

Pos logit contributions
run+0.001
iddled+0.001
bars+0.001
 swift+0.001
box+0.001
Neg logit contributions
 drowned-0.001
cious-0.001
 bank-0.001
Po-0.001
 Gig-0.001
Token checkout
Token ID28006
Feature activation-4.59e+00

Pos logit contributions
Po+0.004
cious+0.004
act+0.004
 drowned+0.004
Cr+0.004
Neg logit contributions
run-0.004
iddled-0.004
 of-0.004
 swift-0.004
bars-0.003
Token.
Token ID13
Feature activation+3.20e-01

Pos logit contributions
ork+0.081
cious+0.081
Po+0.076
Cr+0.075
act+0.073
Neg logit contributions
run-0.111
 swift-0.106
iddled-0.101
 pleasure-0.098
 Faster-0.097
Token The
Token ID383
Feature activation-5.71e-01

Pos logit contributions
 swift+0.007
 older+0.007
 courage+0.007
iddled+0.007
run+0.007
Neg logit contributions
G-0.006
S-0.006
cious-0.006
Po-0.006
 Gig-0.006
Token little
Token ID1310
Feature activation-1.94e+00

Pos logit contributions
S+0.009
actus+0.008
 peanut+0.008
Po+0.008
Cr+0.008
Neg logit contributions
run-0.016
iddled-0.016
 Andy-0.014
ug-0.014
bars-0.014
Token boy
Token ID2933
Feature activation-3.03e-01

Pos logit contributions
 peanut+0.030
S+0.028
 cand+0.027
 pigeon+0.026
Po+0.025
Neg logit contributions
run-0.055
iddled-0.054
 swift-0.047
 Step-0.047
ver-0.046
Token smiled
Token ID13541
Feature activation-3.75e-02

Pos logit contributions
cious+0.005
 drowned+0.005
Po+0.004
mary+0.004
ork+0.004
Neg logit contributions
run-0.008
iddled-0.007
bars-0.007
 swift-0.007
uster-0.007
Token,
Token ID11
Feature activation-3.18e-02

Pos logit contributions
run+0.003
 swift+0.003
iddled+0.003
bars+0.003
 older+0.003
Neg logit contributions
Po-0.002
cious-0.002
ork-0.002
 drowned-0.002
G-0.002
Token they
Token ID484
Feature activation+5.49e-01

Pos logit contributions
Po+0.000
S+0.000
ork+0.000
G+0.000
 Gig+0.000
Neg logit contributions
run-0.001
 swift-0.001
iddled-0.001
bars-0.001
 pleasure-0.001
Token went
Token ID1816
Feature activation-1.31e-01

Pos logit contributions
run+0.015
 swift+0.015
 pleasure+0.014
 welcome+0.014
 courage+0.014
Neg logit contributions
Po-0.007
 drowned-0.007
Pear-0.006
G-0.006
 pulp-0.006
Token to
Token ID284
Feature activation+3.83e-02

Pos logit contributions
ork+0.002
Po+0.002
 bank+0.002
cious+0.002
mary+0.002
Neg logit contributions
run-0.003
 swift-0.003
bars-0.003
 pleasure-0.003
iddled-0.003
Token the
Token ID262
Feature activation-1.88e-01

Pos logit contributions
run+0.001
bars+0.001
 swift+0.001
iddled+0.001
box+0.001
Neg logit contributions
Po-0.001
 Gig-0.001
ork-0.001
Pear-0.001
cious-0.001
Token checkout
Token ID28006
Feature activation-4.55e+00

Pos logit contributions
Po+0.004
cious+0.004
act+0.004
 drowned+0.004
ork+0.004
Neg logit contributions
run-0.004
 of-0.003
 swift-0.003
iddled-0.003
bars-0.003
Token.
Token ID13
Feature activation+6.00e-02

Pos logit contributions
ork+0.084
cious+0.079
Po+0.074
 bank+0.072
Cr+0.072
Neg logit contributions
run-0.110
 swift-0.106
iddled-0.099
 pleasure-0.098
 Andy-0.094
Token The
Token ID383
Feature activation-2.86e-01

Pos logit contributions
 swift+0.001
run+0.001
bars+0.001
 older+0.001
iddled+0.001
Neg logit contributions
ork-0.001
Po-0.001
cious-0.001
G-0.001
Pear-0.001
Token cash
Token ID5003
Feature activation-9.86e-01

Pos logit contributions
Po+0.005
ork+0.005
S+0.005
cious+0.005
 cand+0.004
Neg logit contributions
run-0.008
iddled-0.007
 Andy-0.007
 Jake-0.006
ug-0.006
Tokenier
Token ID959
Feature activation-9.78e-01

Pos logit contributions
Po+0.015
cious+0.015
ork+0.014
sters+0.014
act+0.014
Neg logit contributions
run-0.026
 pleasure-0.024
 swift-0.024
 Faster-0.024
 fascination-0.024
Token said
Token ID531
Feature activation-6.09e-01

Pos logit contributions
 drowned+0.016
cious+0.016
ork+0.015
Po+0.015
mary+0.015
Neg logit contributions
 Faster-0.024
 swift-0.023
 Higher-0.023
 pleasure-0.023
 welcome-0.023
Token lay
Token ID3830
Feature activation-1.49e-01

Pos logit contributions
run+0.016
 swift+0.015
iddled+0.015
box+0.014
 of+0.014
Neg logit contributions
 drowned-0.011
Po-0.011
cious-0.010
Cr-0.010
G-0.010
Token still
Token ID991
Feature activation-3.94e-01

Pos logit contributions
Po+0.003
act+0.003
cious+0.003
 drowned+0.003
mar+0.003
Neg logit contributions
run-0.003
 swift-0.003
 of-0.003
iddled-0.003
bars-0.003
Token in
Token ID287
Feature activation+4.40e-01

Pos logit contributions
cious+0.006
 drowned+0.006
mar+0.006
 sting+0.006
Po+0.006
Neg logit contributions
run-0.010
iddled-0.009
old-0.009
box-0.008
 swift-0.008
Token Mum
Token ID38367
Feature activation-1.54e+00

Pos logit contributions
run+0.010
bars+0.010
iddled+0.010
 swift+0.009
 older+0.009
Neg logit contributions
Po-0.008
G-0.007
 drowned-0.007
S-0.007
ork-0.007
Tokenâ
Token ID22940
Feature activation-2.81e-01

Pos logit contributions
cious+0.017
mary+0.013
ache+0.012
 drowned+0.011
 bank+0.011
Neg logit contributions
iddled-0.049
 swift-0.047
run-0.046
 Faster-0.046
 pleasure-0.045
Token
Token ID26391
Feature activation-4.54e+00

Pos logit contributions
cious+0.005
Po+0.004
 drowned+0.004
G+0.004
act+0.004
Neg logit contributions
run-0.007
 of-0.007
 swift-0.007
 older-0.007
bars-0.006
Token
Token ID8151
Feature activation+4.51e-01

Pos logit contributions
Po+0.079
act+0.076
Pear+0.073
 gag+0.073
cious+0.072
Neg logit contributions
run-0.109
 of-0.102
box-0.098
 Jake-0.098
 swift-0.097
Tokens
Token ID82
Feature activation-3.55e-01

Pos logit contributions
iddled+0.011
run+0.010
 fascination+0.010
 swift+0.010
 courage+0.010
Neg logit contributions
S-0.008
ork-0.008
cious-0.008
uts-0.008
G-0.008
Token arms
Token ID5101
Feature activation+2.22e-01

Pos logit contributions
 bank+0.007
 throats+0.007
cious+0.007
 drowned+0.007
ork+0.006
Neg logit contributions
iddled-0.008
run-0.007
 swift-0.007
 Faster-0.007
ver-0.007
Token.
Token ID13
Feature activation+7.29e-01

Pos logit contributions
run+0.006
 swift+0.006
iddled+0.006
 pleasure+0.006
 Jake+0.006
Neg logit contributions
cious-0.003
uts-0.002
mary-0.002
ache-0.002
 sting-0.002
Token Mum
Token ID38367
Feature activation-1.32e-02

Pos logit contributions
box+0.016
 swift+0.016
run+0.016
iddled+0.016
bars+0.016
Neg logit contributions
G-0.013
S-0.012
Po-0.012
 Gig-0.012
cious-0.012
Token you
Token ID345
Feature activation-1.97e-01

Pos logit contributions
run+0.044
iddled+0.041
bars+0.038
 swift+0.038
box+0.037
Neg logit contributions
 drowned-0.027
Po-0.026
cious-0.026
 pits-0.024
Cr-0.024
Token go
Token ID467
Feature activation-6.43e-01

Pos logit contributions
 drowned+0.003
cious+0.003
Po+0.003
 sting+0.003
act+0.003
Neg logit contributions
run-0.005
iddled-0.005
bars-0.005
 swift-0.005
 Step-0.004
Token,
Token ID11
Feature activation-9.92e-01

Pos logit contributions
act+0.013
Po+0.013
cious+0.012
 drowned+0.012
ork+0.012
Neg logit contributions
run-0.014
iddled-0.013
 swift-0.013
bars-0.012
 of-0.012
Token my
Token ID616
Feature activation-7.94e-01

Pos logit contributions
cious+0.016
S+0.016
Cr+0.016
 Gig+0.016
Po+0.016
Neg logit contributions
iddled-0.026
run-0.025
bars-0.023
 swift-0.022
 Step-0.021
Token d
Token ID288
Feature activation-1.50e-02

Pos logit contributions
actus+0.014
act+0.014
 peanut+0.014
 marriage+0.014
ork+0.014
Neg logit contributions
iddled-0.020
run-0.019
 Step-0.018
bars-0.017
 Faster-0.017
Tokenarl
Token ID7063
Feature activation-4.50e+00

Pos logit contributions
act+0.000
ork+0.000
S+0.000
actus+0.000
Po+0.000
Neg logit contributions
 courage-0.000
 swift-0.000
 fascination-0.000
 Jake-0.000
iddled-0.000
Tokenings
Token ID654
Feature activation-7.13e-01

Pos logit contributions
cious+0.081
Po+0.080
act+0.076
 drowned+0.075
Pear+0.074
Neg logit contributions
run-0.105
 of-0.101
bars-0.095
 older-0.095
 swift-0.095
Token.
Token ID13
Feature activation-2.86e-01

Pos logit contributions
Po+0.011
S+0.011
mar+0.011
G+0.011
ork+0.011
Neg logit contributions
run-0.018
 swift-0.017
iddled-0.017
 fascination-0.017
 younger-0.016
Token Enjoy
Token ID18179
Feature activation-4.01e+00

Pos logit contributions
cious+0.005
Po+0.005
 Gig+0.005
G+0.005
Cr+0.005
Neg logit contributions
run-0.006
iddled-0.006
 swift-0.006
 older-0.006
 courage-0.006
Token your
Token ID534
Feature activation-1.25e+00

Pos logit contributions
Po+0.086
cious+0.086
 drowned+0.083
act+0.083
Pear+0.081
Neg logit contributions
run-0.079
 of-0.076
 older-0.071
 swift-0.071
 courage-0.070
Token lunch
Token ID9965
Feature activation-3.49e+00

Pos logit contributions
cious+0.018
act+0.018
Po+0.017
 drowned+0.017
 gag+0.017
Neg logit contributions
run-0.035
iddled-0.034
bars-0.032
 of-0.031
 swift-0.031
Token grandma
Token ID49890
Feature activation-7.54e-01

Pos logit contributions
iddled+0.000
run+0.000
bars+0.000
 Faster+0.000
 Step+0.000
Neg logit contributions
 drowned-0.000
ork-0.000
 bank-0.000
Po-0.000
act-0.000
Token's
Token ID338
Feature activation-1.38e-01

Pos logit contributions
cious+0.011
 bank+0.010
mary+0.010
 drowned+0.009
Po+0.009
Neg logit contributions
iddled-0.020
 swift-0.020
run-0.020
 Faster-0.019
bars-0.019
Token house
Token ID2156
Feature activation-2.99e-01

Pos logit contributions
 bank+0.002
ork+0.002
 drowned+0.002
Po+0.002
 harmful+0.002
Neg logit contributions
iddled-0.004
run-0.003
bars-0.003
 Faster-0.003
osh-0.003
Token for
Token ID329
Feature activation-2.25e-01

Pos logit contributions
 bank+0.004
ork+0.004
 drowned+0.003
cious+0.003
mary+0.003
Neg logit contributions
iddled-0.008
 swift-0.008
 Faster-0.008
run-0.008
 fascination-0.007
Token safe
Token ID3338
Feature activation-1.63e+00

Pos logit contributions
Po+0.005
 drowned+0.005
cious+0.005
G+0.004
Cr+0.004
Neg logit contributions
run-0.005
bars-0.004
iddled-0.004
 swift-0.004
 of-0.004
Tokenkeeping
Token ID19934
Feature activation-4.50e+00

Pos logit contributions
 drowned+0.020
cious+0.020
Po+0.020
keeping+0.019
act+0.019
Neg logit contributions
run-0.047
 swift-0.046
iddled-0.044
bars-0.044
 Faster-0.042
Token.
Token ID13
Feature activation-5.92e-02

Pos logit contributions
cious+0.087
 drowned+0.082
Po+0.082
ork+0.082
act+0.080
Neg logit contributions
run-0.100
 swift-0.096
iddled-0.092
 older-0.091
 of-0.088
Token
Token ID220
Feature activation+4.26e-01

Pos logit contributions
 Gig+0.001
Po+0.001
ork+0.001
G+0.001
S+0.001
Neg logit contributions
 swift-0.001
run-0.001
iddled-0.001
 older-0.001
 courage-0.001
Token
Token ID198
Feature activation+6.95e-02

Pos logit contributions
 swift+0.012
 fascination+0.012
isher+0.012
 courage+0.012
 older+0.012
Neg logit contributions
uts-0.005
ork-0.005
S-0.004
G-0.004
Po-0.004
Token
Token ID198
Feature activation-6.83e-01

Pos logit contributions
run+0.002
 older+0.002
bars+0.002
 swift+0.002
iddled+0.001
Neg logit contributions
G-0.001
Po-0.001
S-0.001
Pear-0.001
ork-0.001
TokenWhen
Token ID2215
Feature activation-7.16e-01

Pos logit contributions
Po+0.012
cious+0.012
G+0.011
act+0.011
Cr+0.011
Neg logit contributions
run-0.017
 older-0.015
 swift-0.015
 of-0.015
iddled-0.015
Token Then
Token ID3244
Feature activation+1.18e-01

Pos logit contributions
 swift+0.002
run+0.002
iddled+0.002
 older+0.002
 courage+0.002
Neg logit contributions
 Gig-0.001
Po-0.001
G-0.001
act-0.001
Cr-0.001
Token he
Token ID339
Feature activation+2.99e-01

Pos logit contributions
run+0.002
iddled+0.002
 of+0.002
 swift+0.002
bars+0.002
Neg logit contributions
Po-0.003
cious-0.003
act-0.003
 drowned-0.003
G-0.002
Token went
Token ID1816
Feature activation-1.02e-01

Pos logit contributions
run+0.008
 swift+0.007
bars+0.007
 of+0.007
iddled+0.007
Neg logit contributions
Po-0.005
cious-0.005
 drowned-0.005
Cr-0.005
act-0.005
Token to
Token ID284
Feature activation-2.06e-01

Pos logit contributions
Po+0.002
cious+0.002
act+0.002
ork+0.002
G+0.002
Neg logit contributions
run-0.002
 swift-0.002
bars-0.002
iddled-0.002
 of-0.002
Token the
Token ID262
Feature activation-2.25e-01

Pos logit contributions
 Gig+0.003
Po+0.003
 drowned+0.003
Cr+0.003
G+0.003
Neg logit contributions
run-0.005
bars-0.005
iddled-0.005
 swift-0.005
box-0.004
Token checkout
Token ID28006
Feature activation-4.48e+00

Pos logit contributions
cious+0.007
act+0.007
Po+0.007
uts+0.006
 drowned+0.006
Neg logit contributions
 older-0.002
box-0.002
 of-0.002
 welcome-0.002
run-0.002
Token and
Token ID290
Feature activation+7.69e-01

Pos logit contributions
cious+0.085
ork+0.083
Po+0.080
Cr+0.078
act+0.077
Neg logit contributions
run-0.103
 swift-0.097
iddled-0.094
 pleasure-0.090
 courage-0.088
Token paid
Token ID3432
Feature activation+2.09e-01

Pos logit contributions
run+0.016
 of+0.014
 swift+0.014
 older+0.014
iddled+0.014
Neg logit contributions
cious-0.016
Po-0.016
act-0.016
 drowned-0.015
G-0.015
Token for
Token ID329
Feature activation-4.32e-01

Pos logit contributions
iddled+0.004
run+0.004
bars+0.004
 swift+0.003
box+0.003
Neg logit contributions
 Gig-0.004
Po-0.004
 drowned-0.004
ork-0.004
 pits-0.004
Token it
Token ID340
Feature activation+9.69e-02

Pos logit contributions
Po+0.009
cious+0.009
act+0.009
 drowned+0.009
Cr+0.009
Neg logit contributions
run-0.009
 of-0.008
 swift-0.008
iddled-0.008
bars-0.007
Token.
Token ID13
Feature activation+8.33e-02

Pos logit contributions
iddled+0.003
run+0.003
 swift+0.002
bars+0.002
 fascination+0.002
Neg logit contributions
cious-0.001
ork-0.001
Po-0.001
 drowned-0.001
mar-0.001
Token.
Token ID13
Feature activation+1.36e-01

Pos logit contributions
cious+0.010
Po+0.009
 gag+0.009
 drowned+0.009
act+0.009
Neg logit contributions
run-0.012
iddled-0.011
 swift-0.011
 of-0.011
 Step-0.011
Token They
Token ID1119
Feature activation+4.65e-01

Pos logit contributions
run+0.002
 of+0.002
 older+0.002
 Jake+0.002
iddled+0.002
Neg logit contributions
cious-0.003
Po-0.003
act-0.003
 drowned-0.003
 gag-0.003
Token walked
Token ID6807
Feature activation+9.27e-01

Pos logit contributions
run+0.010
 of+0.010
 swift+0.009
iddled+0.009
 older+0.009
Neg logit contributions
Po-0.009
cious-0.009
act-0.009
 drowned-0.009
G-0.008
Token past
Token ID1613
Feature activation+8.06e-01

Pos logit contributions
run+0.019
bars+0.017
 of+0.017
iddled+0.017
 swift+0.017
Neg logit contributions
cious-0.020
Po-0.020
act-0.019
 drowned-0.019
G-0.019
Token the
Token ID262
Feature activation-1.40e+00

Pos logit contributions
run+0.017
iddled+0.016
box+0.016
 older+0.016
 swift+0.016
Neg logit contributions
act-0.015
Po-0.015
cious-0.015
 drowned-0.015
ork-0.014
Token bad
Token ID2089
Feature activation-4.48e+00

Pos logit contributions
cious+0.031
Po+0.031
act+0.030
Pear+0.029
G+0.029
Neg logit contributions
run-0.026
 of-0.026
 swift-0.025
 older-0.025
 courage-0.024
Token dog
Token ID3290
Feature activation-9.87e-01

Pos logit contributions
 pulp+0.118
cious+0.117
Pear+0.116
Po+0.116
act+0.113
Neg logit contributions
 of-0.068
 welcome-0.062
 older-0.061
 courage-0.059
 swift-0.058
Token and
Token ID290
Feature activation+7.65e-01

Pos logit contributions
 drowned+0.018
cious+0.017
 bank+0.017
ork+0.016
mary+0.016
Neg logit contributions
run-0.023
iddled-0.023
bars-0.021
 swift-0.021
 pleasure-0.021
Token continued
Token ID3767
Feature activation-5.72e-01

Pos logit contributions
run+0.017
 of+0.016
 swift+0.016
iddled+0.015
 older+0.015
Neg logit contributions
cious-0.015
Po-0.015
act-0.014
 drowned-0.014
G-0.014
Token their
Token ID511
Feature activation+6.93e-02

Pos logit contributions
act+0.015
Po+0.014
cious+0.014
 pulp+0.014
actus+0.014
Neg logit contributions
run-0.010
 of-0.009
 older-0.008
 Jake-0.008
 swift-0.008
Token walk
Token ID2513
Feature activation+1.03e+00

Pos logit contributions
run+0.001
 of+0.001
 courage+0.001
 older+0.001
 swift+0.001
Neg logit contributions
Po-0.002
act-0.002
 pulp-0.002
cious-0.002
 gag-0.002
Token,
Token ID11
Feature activation+1.07e+00

Pos logit contributions
run+0.025
iddled+0.024
 swift+0.024
bars+0.024
 Faster+0.022
Neg logit contributions
ork-0.013
cious-0.013
Po-0.011
uts-0.011
 pits-0.011
Token they
Token ID484
Feature activation+3.19e-01

Pos logit contributions
run+0.024
 of+0.022
 swift+0.022
iddled+0.022
bars+0.022
Neg logit contributions
cious-0.020
Po-0.020
act-0.019
 drowned-0.019
G-0.019
Token went
Token ID1816
Feature activation-2.18e-01

Pos logit contributions
run+0.009
 swift+0.008
bars+0.008
 older+0.008
 Faster+0.008
Neg logit contributions
cious-0.005
Po-0.005
 drowned-0.005
G-0.005
Cr-0.004
Token to
Token ID284
Feature activation-3.46e-01

Pos logit contributions
cious+0.005
Po+0.005
act+0.004
 drowned+0.004
ork+0.004
Neg logit contributions
run-0.004
 swift-0.004
 of-0.004
iddled-0.004
bars-0.004
Token the
Token ID262
Feature activation-1.70e-01

Pos logit contributions
cious+0.007
Po+0.007
 drowned+0.007
act+0.007
G+0.007
Neg logit contributions
run-0.007
 of-0.006
 swift-0.006
bars-0.006
iddled-0.006
Token checkout
Token ID28006
Feature activation-4.48e+00

Pos logit contributions
cious+0.004
Po+0.004
act+0.004
 drowned+0.004
G+0.004
Neg logit contributions
run-0.003
 of-0.002
 older-0.002
box-0.002
 swift-0.002
Token counter
Token ID3753
Feature activation-1.20e+00

Pos logit contributions
cious+0.085
ork+0.083
Po+0.080
act+0.079
Cr+0.078
Neg logit contributions
run-0.103
 swift-0.096
iddled-0.093
 pleasure-0.089
 Faster-0.088
Token.
Token ID13
Feature activation+5.10e-01

Pos logit contributions
cious+0.017
 bank+0.016
ork+0.016
Po+0.016
 drowned+0.016
Neg logit contributions
run-0.031
 swift-0.031
iddled-0.030
 Faster-0.030
 pleasure-0.028
Token
Token ID198
Feature activation+4.90e-01

Pos logit contributions
run+0.009
 of+0.009
 swift+0.008
iddled+0.008
 older+0.008
Neg logit contributions
cious-0.012
Po-0.012
act-0.011
 drowned-0.011
G-0.011
Token
Token ID198
Feature activation-4.40e-01

Pos logit contributions
run+0.010
bars+0.010
 swift+0.009
iddled+0.009
 older+0.009
Neg logit contributions
Po-0.010
cious-0.010
G-0.009
act-0.009
Pear-0.009
TokenM
Token ID44
Feature activation-1.64e+00

Pos logit contributions
act+0.008
cious+0.008
Po+0.007
 drowned+0.007
 gag+0.007
Neg logit contributions
run-0.010
 of-0.010
 swift-0.010
iddled-0.009
 older-0.009
Token Then
Token ID3244
Feature activation+5.28e-01

Pos logit contributions
run+0.013
iddled+0.012
bars+0.012
 swift+0.012
 of+0.012
Neg logit contributions
Po-0.010
cious-0.010
G-0.010
 Gig-0.010
Pear-0.010
Token they
Token ID484
Feature activation+6.85e-01

Pos logit contributions
run+0.012
 of+0.011
 swift+0.010
iddled+0.010
bars+0.010
Neg logit contributions
Po-0.010
cious-0.010
act-0.010
 drowned-0.010
G-0.010
Token went
Token ID1816
Feature activation-8.28e-02

Pos logit contributions
run+0.017
 swift+0.016
 of+0.015
 older+0.015
iddled+0.015
Neg logit contributions
Po-0.012
cious-0.011
 drowned-0.011
G-0.011
Pear-0.011
Token to
Token ID284
Feature activation-2.36e-01

Pos logit contributions
cious+0.001
Po+0.001
ork+0.001
act+0.001
G+0.001
Neg logit contributions
run-0.002
bars-0.002
iddled-0.002
 swift-0.002
 of-0.002
Token the
Token ID262
Feature activation-6.82e-02

Pos logit contributions
Po+0.005
cious+0.005
Pear+0.004
 drowned+0.004
G+0.004
Neg logit contributions
run-0.005
bars-0.005
iddled-0.005
 swift-0.005
 of-0.005
Token checkout
Token ID28006
Feature activation-4.47e+00

Pos logit contributions
Po+0.001
cious+0.001
act+0.001
 drowned+0.001
Cr+0.001
Neg logit contributions
run-0.001
iddled-0.001
 swift-0.001
 of-0.001
bars-0.001
Token.
Token ID13
Feature activation+9.02e-01

Pos logit contributions
ork+0.080
cious+0.079
uts+0.071
 bank+0.070
Cr+0.068
Neg logit contributions
run-0.110
 swift-0.105
iddled-0.101
 pleasure-0.098
 Faster-0.096
Token Daisy
Token ID40355
Feature activation-1.02e+00

Pos logit contributions
run+0.017
bars+0.015
iddled+0.015
 of+0.015
 swift+0.015
Neg logit contributions
cious-0.020
Po-0.020
act-0.019
G-0.019
Pear-0.019
Token was
Token ID373
Feature activation-3.50e-01

Pos logit contributions
cious+0.017
 drowned+0.017
Po+0.016
mary+0.015
act+0.015
Neg logit contributions
run-0.027
 swift-0.025
iddled-0.024
bars-0.024
ay-0.024
Token feeling
Token ID4203
Feature activation-1.24e+00

Pos logit contributions
cious+0.005
 drowned+0.005
 gag+0.005
Cr+0.005
mary+0.005
Neg logit contributions
run-0.009
bars-0.009
iddled-0.008
ay-0.008
mith-0.008
Token a
Token ID257
Feature activation+1.26e+00

Pos logit contributions
cious+0.022
Po+0.022
 drowned+0.021
act+0.021
 gag+0.020
Neg logit contributions
run-0.030
bars-0.027
iddled-0.027
box-0.026
 swift-0.026
Token carefully
Token ID7773
Feature activation-2.07e-01

Pos logit contributions
run+0.020
 swift+0.017
bars+0.017
 of+0.017
iddled+0.017
Neg logit contributions
Po-0.014
cious-0.014
 drowned-0.014
act-0.014
Cr-0.014
Token counted
Token ID14789
Feature activation+8.77e-01

Pos logit contributions
cious+0.004
Po+0.004
act+0.004
 drowned+0.004
Cr+0.004
Neg logit contributions
run-0.005
 swift-0.004
 of-0.004
bars-0.004
 older-0.004
Token out
Token ID503
Feature activation+1.40e+00

Pos logit contributions
run+0.019
 swift+0.017
 of+0.017
iddled+0.017
 older+0.017
Neg logit contributions
Po-0.017
cious-0.017
act-0.017
G-0.016
Cr-0.016
Token everyone
Token ID2506
Feature activation-4.43e-01

Pos logit contributions
run+0.029
 swift+0.027
bars+0.027
iddled+0.027
box+0.026
Neg logit contributions
Po-0.028
cious-0.027
ork-0.026
act-0.026
Cr-0.026
Tokenâ
Token ID22940
Feature activation-3.72e-02

Pos logit contributions
cious+0.008
act+0.008
mary+0.008
Po+0.007
 drowned+0.007
Neg logit contributions
run-0.010
iddled-0.010
 swift-0.009
 Step-0.009
 pleasure-0.009
Token
Token ID26391
Feature activation-4.46e+00

Pos logit contributions
cious+0.001
Po+0.001
act+0.001
 drowned+0.001
G+0.001
Neg logit contributions
run-0.001
 of-0.001
 swift-0.001
 older-0.001
iddled-0.001
Token
Token ID8151
Feature activation-9.48e-01

Pos logit contributions
 gag+0.066
Po+0.065
 pulp+0.060
act+0.060
Pear+0.058
Neg logit contributions
run-0.120
 of-0.113
box-0.110
 Jake-0.110
 Step-0.105
Tokens
Token ID82
Feature activation-3.16e-01

Pos logit contributions
uts+0.013
cious+0.013
ork+0.012
S+0.012
G+0.012
Neg logit contributions
iddled-0.025
run-0.025
 swift-0.024
 pleasure-0.024
 fascination-0.024
Token share
Token ID2648
Feature activation-2.72e+00

Pos logit contributions
cious+0.006
 pits+0.006
act+0.005
Po+0.005
 drowned+0.005
Neg logit contributions
bars-0.007
 Faster-0.007
iddled-0.007
run-0.007
 swift-0.007
Token.
Token ID13
Feature activation-3.73e-01

Pos logit contributions
cious+0.057
Po+0.057
act+0.056
 drowned+0.054
Cr+0.054
Neg logit contributions
run-0.056
 of-0.050
 swift-0.050
iddled-0.049
 older-0.049
Token
Token ID220
Feature activation-1.05e-01

Pos logit contributions
 Gig+0.006
cious+0.006
G+0.006
Cr+0.006
Po+0.006
Neg logit contributions
 swift-0.009
run-0.009
iddled-0.008
 courage-0.008
ma-0.008
Token off
Token ID572
Feature activation-1.36e+00

Pos logit contributions
run+0.021
 swift+0.019
 pleasure+0.019
ay+0.019
iddled+0.018
Neg logit contributions
 drowned-0.013
cious-0.013
 pulp-0.013
Po-0.013
 bank-0.012
Token they
Token ID484
Feature activation+3.69e-02

Pos logit contributions
ork+0.019
cious+0.019
Po+0.018
mary+0.018
 drowned+0.017
Neg logit contributions
run-0.037
 swift-0.035
iddled-0.034
 pleasure-0.034
 fascination-0.033
Token went
Token ID1816
Feature activation-4.25e-01

Pos logit contributions
run+0.001
 swift+0.001
iddled+0.001
 older+0.001
 courage+0.001
Neg logit contributions
 drowned-0.000
cious-0.000
 bank-0.000
Po-0.000
ork-0.000
Token to
Token ID284
Feature activation-9.67e-02

Pos logit contributions
ork+0.007
Po+0.007
cious+0.007
act+0.007
 drowned+0.006
Neg logit contributions
run-0.010
 swift-0.010
iddled-0.009
 pleasure-0.009
bars-0.009
Token the
Token ID262
Feature activation-1.98e-01

Pos logit contributions
 drowned+0.002
 Gig+0.002
cious+0.001
Po+0.001
ork+0.001
Neg logit contributions
run-0.002
bars-0.002
iddled-0.002
 swift-0.002
box-0.002
Token checkout
Token ID28006
Feature activation-4.46e+00

Pos logit contributions
Po+0.004
cious+0.004
act+0.004
 drowned+0.004
Cr+0.004
Neg logit contributions
run-0.004
 of-0.004
iddled-0.004
 swift-0.004
bars-0.004
Token.
Token ID13
Feature activation+3.53e-01

Pos logit contributions
cious+0.080
ork+0.080
Po+0.074
act+0.073
Cr+0.072
Neg logit contributions
run-0.107
 swift-0.101
iddled-0.098
 pleasure-0.094
 courage-0.093
Token P
Token ID350
Feature activation-8.80e-01

Pos logit contributions
run+0.008
 swift+0.007
iddled+0.007
bars+0.007
 older+0.007
Neg logit contributions
cious-0.007
Po-0.007
G-0.006
S-0.006
act-0.006
Tokenhew
Token ID6391
Feature activation-6.53e-02

Pos logit contributions
act+0.017
cious+0.017
Po+0.016
ork+0.016
actus+0.015
Neg logit contributions
run-0.020
 swift-0.019
 of-0.018
bars-0.018
iddled-0.018
Token,
Token ID11
Feature activation-1.94e-01

Pos logit contributions
cious+0.001
 drowned+0.001
ache+0.001
ork+0.001
uts+0.001
Neg logit contributions
run-0.001
 Faster-0.001
 fascination-0.001
 swift-0.001
 pleasure-0.001
Token Mum
Token ID38367
Feature activation-8.12e-02

Pos logit contributions
cious+0.003
Po+0.003
 Gig+0.003
G+0.003
 drowned+0.003
Neg logit contributions
run-0.005
iddled-0.005
bars-0.005
 older-0.005
 swift-0.004
Token shelf
Token ID18316
Feature activation-6.09e-01

Pos logit contributions
cious+0.014
 bank+0.013
ork+0.013
actus+0.013
Po+0.013
Neg logit contributions
run-0.021
iddled-0.019
 swift-0.018
 of-0.017
osh-0.017
Token and
Token ID290
Feature activation+9.04e-01

Pos logit contributions
ork+0.007
 bank+0.007
cious+0.007
mary+0.007
uts+0.006
Neg logit contributions
run-0.017
iddled-0.016
 swift-0.015
 fascination-0.015
 pleasure-0.015
Token went
Token ID1816
Feature activation-3.86e-01

Pos logit contributions
run+0.022
 swift+0.021
 older+0.020
 of+0.020
iddled+0.020
Neg logit contributions
cious-0.015
Po-0.015
 drowned-0.015
G-0.014
Cr-0.014
Token to
Token ID284
Feature activation-1.21e-01

Pos logit contributions
cious+0.007
ork+0.007
Po+0.006
act+0.006
 drowned+0.006
Neg logit contributions
run-0.009
 swift-0.008
iddled-0.008
 pleasure-0.008
bars-0.008
Token the
Token ID262
Feature activation+3.87e-02

Pos logit contributions
cious+0.002
 drowned+0.002
Po+0.002
 Gig+0.002
ork+0.002
Neg logit contributions
run-0.003
bars-0.003
 swift-0.003
iddled-0.003
box-0.003
Token checkout
Token ID28006
Feature activation-4.45e+00

Pos logit contributions
run+0.001
iddled+0.001
 swift+0.001
 of+0.001
bars+0.001
Neg logit contributions
cious-0.001
Po-0.001
act-0.001
ork-0.001
Cr-0.001
Token.
Token ID13
Feature activation-8.71e-02

Pos logit contributions
ork+0.078
cious+0.077
Po+0.071
Cr+0.069
uts+0.068
Neg logit contributions
run-0.108
 swift-0.105
iddled-0.098
 pleasure-0.098
 Faster-0.095
Token They
Token ID1119
Feature activation+2.71e-02

Pos logit contributions
cious+0.002
Po+0.002
G+0.002
ork+0.001
S+0.001
Neg logit contributions
run-0.002
 swift-0.002
box-0.002
 courage-0.002
iddled-0.002
Token paid
Token ID3432
Feature activation-1.36e-02

Pos logit contributions
run+0.001
 swift+0.001
 older+0.001
 courage+0.001
 pleasure+0.001
Neg logit contributions
cious-0.000
Po-0.000
 drowned-0.000
G-0.000
Cr-0.000
Token for
Token ID329
Feature activation-7.73e-01

Pos logit contributions
cious+0.000
ork+0.000
 drowned+0.000
act+0.000
Po+0.000
Neg logit contributions
iddled-0.000
run-0.000
 swift-0.000
bars-0.000
 Step-0.000
Token it
Token ID340
Feature activation-3.45e-03

Pos logit contributions
Po+0.014
cious+0.014
 drowned+0.014
act+0.013
Cr+0.013
Neg logit contributions
run-0.018
iddled-0.016
 of-0.016
 swift-0.016
bars-0.016
Token should
Token ID815
Feature activation-1.46e+00

Pos logit contributions
run+0.012
iddled+0.011
 swift+0.011
bars+0.011
 older+0.011
Neg logit contributions
 drowned-0.010
Po-0.010
cious-0.009
act-0.009
Pear-0.009
Token remember
Token ID3505
Feature activation+4.54e-01

Pos logit contributions
Po+0.024
cious+0.023
 drowned+0.023
act+0.022
G+0.022
Neg logit contributions
run-0.037
iddled-0.034
 swift-0.034
 of-0.034
bars-0.033
Token to
Token ID284
Feature activation-2.75e-01

Pos logit contributions
iddled+0.012
run+0.012
 swift+0.011
bars+0.011
 Faster+0.011
Neg logit contributions
act-0.007
Po-0.007
 drowned-0.007
G-0.006
mar-0.006
Token stay
Token ID2652
Feature activation-5.17e-01

Pos logit contributions
Po+0.005
 drowned+0.005
cious+0.005
G+0.005
Cr+0.005
Neg logit contributions
run-0.007
iddled-0.006
bars-0.006
 swift-0.006
 of-0.006
Token close
Token ID1969
Feature activation-7.34e-01

Pos logit contributions
 drowned+0.010
Po+0.010
 harmful+0.009
cious+0.009
act+0.009
Neg logit contributions
bars-0.012
run-0.012
iddled-0.011
 Faster-0.011
 Jake-0.010
Tokenby
Token ID1525
Feature activation-4.44e+00

Pos logit contributions
act+0.010
 drowned+0.009
cious+0.009
 Gig+0.009
G+0.009
Neg logit contributions
iddled-0.020
run-0.019
 Faster-0.017
 fascination-0.017
 swift-0.017
Token,"
Token ID553
Feature activation+8.40e-01

Pos logit contributions
cious+0.090
Po+0.088
act+0.087
 drowned+0.084
ork+0.084
Neg logit contributions
run-0.096
iddled-0.087
 swift-0.086
 of-0.085
 older-0.083
Token she
Token ID673
Feature activation+1.06e+00

Pos logit contributions
run+0.020
iddled+0.018
box+0.018
 swift+0.017
bars+0.017
Neg logit contributions
Po-0.017
G-0.016
Cr-0.015
 Gig-0.015
cious-0.015
Token said
Token ID531
Feature activation-1.83e-01

Pos logit contributions
run+0.024
bars+0.022
 pleasure+0.022
 Faster+0.021
iddled+0.021
Neg logit contributions
Po-0.021
G-0.020
Cr-0.020
cious-0.020
mar-0.019
Token.
Token ID13
Feature activation+1.96e+00

Pos logit contributions
cious+0.003
uts+0.002
Po+0.002
ache+0.002
ork+0.002
Neg logit contributions
iddled-0.005
run-0.005
 swift-0.005
ay-0.005
bars-0.004
Token
Token ID198
Feature activation+9.52e-01

Pos logit contributions
run+0.052
 swift+0.051
iddled+0.051
bars+0.051
 older+0.049
Neg logit contributions
G-0.029
Po-0.028
 Gig-0.027
S-0.026
Pear-0.026
Token they
Token ID484
Feature activation+8.05e-01

Pos logit contributions
cious+0.035
act+0.034
Po+0.034
 pulp+0.033
Pear+0.033
Neg logit contributions
 of-0.021
run-0.021
 Jake-0.019
 older-0.019
 swift-0.019
Token reached
Token ID4251
Feature activation+7.83e-01

Pos logit contributions
run+0.017
 of+0.017
 swift+0.016
 older+0.015
 Jake+0.015
Neg logit contributions
act-0.017
cious-0.017
Po-0.016
ork-0.015
 pulp-0.015
Token the
Token ID262
Feature activation-1.42e-01

Pos logit contributions
iddled+0.020
run+0.019
 swift+0.018
ok+0.018
bars+0.018
Neg logit contributions
ork-0.011
 Woo-0.011
 drowned-0.011
 pits-0.010
 Gig-0.010
Token doctor
Token ID6253
Feature activation-1.74e+00

Pos logit contributions
Po+0.002
ork+0.002
act+0.002
 drowned+0.002
Pear+0.002
Neg logit contributions
run-0.004
iddled-0.003
 swift-0.003
bars-0.003
 of-0.003
Tokenâ
Token ID22940
Feature activation+1.56e-01

Pos logit contributions
cious+0.027
mary+0.024
 bank+0.023
ork+0.023
Po+0.021
Neg logit contributions
iddled-0.045
 pleasure-0.043
 Faster-0.042
run-0.042
 swift-0.042
Token
Token ID26391
Feature activation-4.42e+00

Pos logit contributions
run+0.004
 of+0.003
 swift+0.003
iddled+0.003
 older+0.003
Neg logit contributions
cious-0.003
Po-0.003
act-0.003
 drowned-0.003
G-0.003
Token
Token ID8151
Feature activation+1.28e-01

Pos logit contributions
Po+0.086
 pulp+0.083
act+0.083
cious+0.081
G+0.080
Neg logit contributions
run-0.098
 of-0.091
 Jake-0.091
box-0.087
 swift-0.086
Tokens
Token ID82
Feature activation+3.90e-02

Pos logit contributions
iddled+0.003
 fascination+0.003
run+0.003
 swift+0.003
 pleasure+0.003
Neg logit contributions
S-0.002
ork-0.002
G-0.002
uts-0.002
 drowned-0.002
Token office
Token ID2607
Feature activation-7.23e-01

Pos logit contributions
iddled+0.001
 Faster+0.001
bars+0.001
 swift+0.001
run+0.001
Neg logit contributions
 bank-0.001
 gag-0.001
ork-0.001
cious-0.001
 drowned-0.001
Token,
Token ID11
Feature activation+5.18e-01

Pos logit contributions
ork+0.011
cious+0.010
Po+0.010
ache+0.009
 bank+0.009
Neg logit contributions
run-0.019
 pleasure-0.018
 swift-0.018
iddled-0.018
 fascination-0.018
Token his
Token ID465
Feature activation+3.90e-01

Pos logit contributions
run+0.013
iddled+0.013
 swift+0.013
box+0.012
 fascination+0.012
Neg logit contributions
Po-0.008
S-0.008
G-0.008
cious-0.007
 drowned-0.007
Token two
Token ID734
Feature activation+1.62e+00

Pos logit contributions
 of+0.002
run+0.002
 older+0.002
 swift+0.002
 Jake+0.002
Neg logit contributions
Po-0.005
cious-0.005
act-0.005
ork-0.004
 gag-0.004
Token friends
Token ID2460
Feature activation+1.37e+00

Pos logit contributions
run+0.033
iddled+0.033
 Step+0.029
bars+0.029
 Faster+0.028
Neg logit contributions
act-0.036
cious-0.035
 pige-0.034
 drowned-0.034
Po-0.033
Token
Token ID6184
Feature activation+1.63e-01

Pos logit contributions
bars+0.034
run+0.033
iddled+0.033
 swift+0.032
 pleasure+0.031
Neg logit contributions
Po-0.021
cious-0.021
 drowned-0.021
S-0.020
G-0.020
Token
Token ID95
Feature activation-4.84e-01

Pos logit contributions
run+0.003
 Jake+0.003
 of+0.003
 Step+0.003
iddled+0.003
Neg logit contributions
 pulp-0.003
 gag-0.003
Po-0.003
 drowned-0.003
act-0.003
Token
Token ID26391
Feature activation-2.79e+00

Pos logit contributions
act+0.009
Po+0.009
 gag+0.009
 pulp+0.009
 drowned+0.009
Neg logit contributions
run-0.011
 of-0.010
iddled-0.010
 Jake-0.010
 swift-0.010
Token
Token ID447
Feature activation-4.41e+00

Pos logit contributions
 gag+0.059
 pulp+0.058
Po+0.057
act+0.057
 pits+0.056
Neg logit contributions
run-0.057
 Jake-0.055
 of-0.054
ay-0.052
box-0.050
Token
Token ID250
Feature activation-2.04e+00

Pos logit contributions
 gag+0.083
act+0.078
ork+0.078
 harmful+0.078
Po+0.077
Neg logit contributions
 Jake-0.092
ma-0.090
 Faster-0.089
 swift-0.089
run-0.088
Token Jack
Token ID3619
Feature activation-1.66e-01

Pos logit contributions
Po+0.032
cious+0.031
act+0.030
G+0.030
Cr+0.029
Neg logit contributions
run-0.053
 of-0.049
 swift-0.049
iddled-0.049
 older-0.048
Token and
Token ID290
Feature activation-7.84e-01

Pos logit contributions
cious+0.003
Po+0.003
act+0.003
S+0.003
ork+0.003
Neg logit contributions
run-0.004
bars-0.004
iddled-0.004
 swift-0.004
 older-0.004
Token Jill
Token ID22772
Feature activation+2.49e-01

Pos logit contributions
Po+0.018
cious+0.018
act+0.017
 drowned+0.017
G+0.017
Neg logit contributions
run-0.015
 of-0.014
 swift-0.013
iddled-0.013
 older-0.013
Token.
Token ID13
Feature activation-6.11e-02

Pos logit contributions
run+0.006
 swift+0.006
bars+0.006
 older+0.006
iddled+0.006
Neg logit contributions
cious-0.004
act-0.004
Po-0.004
 drowned-0.004
ork-0.004
TokenAs
Token ID1722
Feature activation-7.56e-01

Pos logit contributions
Po+0.002
cious+0.002
G+0.002
Cr+0.002
act+0.002
Neg logit contributions
run-0.002
 older-0.002
 swift-0.002
 of-0.002
iddled-0.002
Token they
Token ID484
Feature activation+8.42e-01

Pos logit contributions
cious+0.013
act+0.012
Po+0.012
 gag+0.012
Pear+0.012
Neg logit contributions
run-0.019
 of-0.018
 swift-0.018
 older-0.017
 Jake-0.017
Token walked
Token ID6807
Feature activation+1.84e+00

Pos logit contributions
run+0.018
 of+0.017
 swift+0.016
iddled+0.016
 older+0.016
Neg logit contributions
cious-0.017
Po-0.017
act-0.017
 drowned-0.016
G-0.016
Token to
Token ID284
Feature activation-5.73e-02

Pos logit contributions
run+0.040
iddled+0.036
 swift+0.035
bars+0.035
 Jake+0.034
Neg logit contributions
Po-0.036
act-0.036
cious-0.035
ork-0.034
 drowned-0.034
Token the
Token ID262
Feature activation-2.85e-01

Pos logit contributions
 Gig+0.001
Po+0.001
 drowned+0.001
Pear+0.001
G+0.001
Neg logit contributions
run-0.001
iddled-0.001
 swift-0.001
bars-0.001
box-0.001
Token checkout
Token ID28006
Feature activation-4.38e+00

Pos logit contributions
cious+0.007
Po+0.007
act+0.007
G+0.007
 drowned+0.007
Neg logit contributions
run-0.004
 of-0.004
 older-0.004
 swift-0.004
bars-0.004
Token,
Token ID11
Feature activation+6.65e-01

Pos logit contributions
ork+0.087
cious+0.086
Po+0.083
act+0.081
Cr+0.080
Neg logit contributions
run-0.097
 swift-0.090
iddled-0.087
 pleasure-0.084
 courage-0.080
Token Tim
Token ID5045
Feature activation+9.11e-01

Pos logit contributions
run+0.015
 of+0.014
 swift+0.014
iddled+0.013
 older+0.013
Neg logit contributions
Po-0.013
cious-0.013
act-0.012
 drowned-0.012
G-0.012
Tokenmy
Token ID1820
Feature activation-9.25e-02

Pos logit contributions
run+0.015
 Faster+0.015
 swift+0.015
iddled+0.014
 pleasure+0.014
Neg logit contributions
 drowned-0.021
Po-0.021
Pear-0.021
act-0.021
mar-0.020
Token noticed
Token ID6810
Feature activation+7.65e-01

Pos logit contributions
cious+0.002
Po+0.002
 drowned+0.002
act+0.001
G+0.001
Neg logit contributions
run-0.002
 swift-0.002
iddled-0.002
 of-0.002
 welcome-0.002
Token his
Token ID465
Feature activation+5.92e-01

Pos logit contributions
run+0.020
 swift+0.018
 welcome+0.018
iddled+0.017
 Step+0.017
Neg logit contributions
Po-0.012
act-0.012
G-0.012
 drowned-0.011
Cr-0.011
Token saved
Token ID7448
Feature activation-3.71e-01

Pos logit contributions
run+0.027
 swift+0.025
bars+0.024
 pleasure+0.024
iddled+0.024
Neg logit contributions
Po-0.023
 drowned-0.021
cious-0.021
Cr-0.021
G-0.020
Token Tim
Token ID5045
Feature activation-2.97e-01

Pos logit contributions
Po+0.009
cious+0.009
act+0.009
 drowned+0.008
G+0.008
Neg logit contributions
run-0.007
 of-0.006
 swift-0.006
iddled-0.006
 older-0.006
Tokenmy
Token ID1820
Feature activation-1.47e+00

Pos logit contributions
cious+0.008
act+0.008
 drowned+0.008
Po+0.008
ork+0.008
Neg logit contributions
run-0.004
iddled-0.004
 swift-0.003
 pleasure-0.003
 fascination-0.003
Token from
Token ID422
Feature activation-4.24e-01

Pos logit contributions
cious+0.032
Po+0.032
act+0.032
G+0.030
 drowned+0.030
Neg logit contributions
run-0.029
 of-0.027
 swift-0.026
 older-0.026
iddled-0.025
Token the
Token ID262
Feature activation-1.23e+00

Pos logit contributions
cious+0.010
Po+0.010
act+0.010
 drowned+0.009
G+0.009
Neg logit contributions
run-0.008
 of-0.007
 swift-0.007
iddled-0.007
 older-0.007
Token bad
Token ID2089
Feature activation-4.37e+00

Pos logit contributions
cious+0.023
act+0.022
Pear+0.022
Po+0.022
 gag+0.021
Neg logit contributions
 older-0.028
 of-0.027
run-0.027
 courage-0.027
 welcome-0.026
Token man
Token ID582
Feature activation-1.19e+00

Pos logit contributions
Pear+0.132
cious+0.128
mary+0.125
 gag+0.124
ossom+0.123
Neg logit contributions
 older-0.055
 of-0.054
 welcome-0.049
 courage-0.047
 younger-0.040
Token.
Token ID13
Feature activation-2.81e-01

Pos logit contributions
cious+0.022
 drowned+0.022
Po+0.021
ork+0.021
act+0.021
Neg logit contributions
run-0.026
iddled-0.026
 swift-0.025
bars-0.024
 older-0.024
Token Tim
Token ID5045
Feature activation+2.99e-01

Pos logit contributions
cious+0.007
act+0.007
Po+0.007
Pear+0.007
 gag+0.007
Neg logit contributions
run-0.004
 of-0.004
 older-0.004
 swift-0.004
 Jake-0.004
Tokenmy
Token ID1820
Feature activation-3.58e-01

Pos logit contributions
run+0.002
 of+0.001
 swift+0.001
 older+0.001
iddled+0.001
Neg logit contributions
cious-0.011
Po-0.011
act-0.010
G-0.010
ork-0.010
Token was
Token ID373
Feature activation-2.35e-01

Pos logit contributions
act+0.010
cious+0.010
 gag+0.010
 conclusions+0.010
 pulp+0.009
Neg logit contributions
 of-0.005
run-0.004
 older-0.004
 courage-0.004
 Jake-0.004
Token One
Token ID1881
Feature activation+1.45e+00

Pos logit contributions
Po+0.015
cious+0.015
act+0.015
Pear+0.014
ork+0.014
Neg logit contributions
run-0.020
 swift-0.019
 of-0.018
iddled-0.018
 older-0.018
Token day
Token ID1110
Feature activation-1.18e+00

Pos logit contributions
run+0.025
iddled+0.022
 swift+0.021
bars+0.021
 older+0.021
Neg logit contributions
cious-0.036
Po-0.035
act-0.035
G-0.034
 drowned-0.034
Token,
Token ID11
Feature activation-1.03e+00

Pos logit contributions
act+0.033
cious+0.033
Po+0.033
Cr+0.032
 drowned+0.032
Neg logit contributions
 of-0.017
run-0.016
 swift-0.013
 older-0.013
 welcome-0.013
Token Lily
Token ID20037
Feature activation-1.52e+00

Pos logit contributions
cious+0.022
act+0.022
Po+0.021
 drowned+0.021
G+0.020
Neg logit contributions
run-0.021
 of-0.020
 swift-0.019
 older-0.019
 Jake-0.019
Token was
Token ID373
Feature activation-2.31e-02

Pos logit contributions
cious+0.034
Po+0.034
act+0.033
 gag+0.033
Cr+0.032
Neg logit contributions
 of-0.028
run-0.027
 older-0.025
 Jake-0.025
box-0.024
Token playing
Token ID2712
Feature activation+2.45e+00

Pos logit contributions
cious+0.001
Po+0.001
act+0.001
 drowned+0.001
G+0.001
Neg logit contributions
run-0.000
 swift-0.000
iddled-0.000
bars-0.000
 of-0.000
Token in
Token ID287
Feature activation+8.96e-01

Pos logit contributions
run+0.050
bars+0.046
 swift+0.044
 older+0.043
iddled+0.043
Neg logit contributions
Po-0.053
cious-0.051
act-0.051
Cr-0.051
ork-0.050
Token the
Token ID262
Feature activation+6.76e-01

Pos logit contributions
run+0.018
 of+0.016
 swift+0.016
iddled+0.016
bars+0.016
Neg logit contributions
Po-0.019
cious-0.019
act-0.019
 drowned-0.018
G-0.018
Token park
Token ID3952
Feature activation+5.96e-01

Pos logit contributions
 of+0.011
run+0.011
 older+0.010
 swift+0.010
box+0.010
Neg logit contributions
cious-0.017
Po-0.016
 gag-0.016
act-0.016
 drowned-0.016
Token when
Token ID618
Feature activation-4.04e-01

Pos logit contributions
run+0.013
iddled+0.012
 swift+0.012
bars+0.011
 older+0.011
Neg logit contributions
cious-0.012
Po-0.012
act-0.011
Cr-0.011
G-0.011
Token she
Token ID673
Feature activation+2.62e-01

Pos logit contributions
cious+0.008
act+0.008
Po+0.008
Cr+0.008
 drowned+0.008
Neg logit contributions
run-0.009
 of-0.008
 swift-0.008
 older-0.008
 Jake-0.008
Token and
Token ID290
Feature activation+6.63e-01

Pos logit contributions
Po+0.007
ork+0.007
uts+0.007
cious+0.007
 drowned+0.006
Neg logit contributions
 pleasure-0.011
iddled-0.011
run-0.011
 swift-0.011
 Faster-0.011
Token ears
Token ID11368
Feature activation-8.79e-01

Pos logit contributions
run+0.014
 of+0.013
 swift+0.013
iddled+0.013
 older+0.013
Neg logit contributions
cious-0.013
Po-0.013
act-0.013
 drowned-0.012
G-0.012
Token.
Token ID13
Feature activation-1.02e-01

Pos logit contributions
Po+0.015
 drowned+0.015
cious+0.015
ork+0.014
 sting+0.014
Neg logit contributions
run-0.020
 pleasure-0.019
 older-0.019
 swift-0.019
iddled-0.019
Token Lily
Token ID20037
Feature activation-1.54e+00

Pos logit contributions
cious+0.002
act+0.002
 drowned+0.002
Po+0.002
mar+0.002
Neg logit contributions
run-0.002
 of-0.002
 older-0.002
 Jake-0.002
 swift-0.002
Token cried
Token ID16896
Feature activation-5.88e-01

Pos logit contributions
cious+0.026
Po+0.026
 drowned+0.025
act+0.025
Pear+0.024
Neg logit contributions
run-0.038
 of-0.035
 swift-0.035
iddled-0.035
 older-0.035
Token out
Token ID503
Feature activation+2.87e+00

Pos logit contributions
Po+0.013
cious+0.012
Pear+0.012
 drowned+0.012
act+0.012
Neg logit contributions
run-0.012
iddled-0.011
bars-0.011
 swift-0.010
box-0.010
Token in
Token ID287
Feature activation+1.20e-01

Pos logit contributions
run+0.053
iddled+0.047
 swift+0.047
bars+0.047
 of+0.047
Neg logit contributions
Po-0.065
cious-0.065
act-0.063
 drowned-0.062
ork-0.062
Token pain
Token ID2356
Feature activation-2.43e+00

Pos logit contributions
run+0.003
 of+0.003
 swift+0.003
 courage+0.002
 older+0.002
Neg logit contributions
act-0.002
cious-0.002
actus-0.002
Po-0.002
ork-0.002
Token and
Token ID290
Feature activation-1.28e-01

Pos logit contributions
cious+0.044
 drowned+0.043
Po+0.043
 sting+0.041
act+0.041
Neg logit contributions
run-0.055
 swift-0.052
 pleasure-0.052
bars-0.052
iddled-0.052
Token fear
Token ID3252
Feature activation-2.39e-01

Pos logit contributions
Po+0.002
cious+0.002
 drowned+0.002
act+0.002
 gag+0.002
Neg logit contributions
run-0.003
 of-0.003
 swift-0.003
iddled-0.003
 older-0.003
Token.
Token ID13
Feature activation+2.18e-01

Pos logit contributions
Po+0.004
 drowned+0.004
 sting+0.004
ork+0.004
cious+0.004
Neg logit contributions
run-0.006
 pleasure-0.005
 swift-0.005
box-0.005
bars-0.005
Token rabbit
Token ID22746
Feature activation-1.47e-01

Pos logit contributions
run+0.011
iddled+0.009
 of+0.009
bars+0.009
 swift+0.009
Neg logit contributions
Po-0.010
cious-0.009
act-0.009
 drowned-0.009
Cr-0.009
Token could
Token ID714
Feature activation-8.32e-01

Pos logit contributions
 drowned+0.003
Po+0.002
cious+0.002
Cr+0.002
Pear+0.002
Neg logit contributions
run-0.004
bars-0.004
iddled-0.004
 swift-0.003
 older-0.003
Token finally
Token ID3443
Feature activation+9.62e-01

Pos logit contributions
 drowned+0.009
Po+0.009
cious+0.009
 gag+0.009
G+0.009
Neg logit contributions
run-0.025
iddled-0.024
bars-0.023
 swift-0.023
 of-0.023
Token sleep
Token ID3993
Feature activation-1.45e+00

Pos logit contributions
run+0.022
iddled+0.020
bars+0.020
 swift+0.019
 of+0.019
Neg logit contributions
Po-0.018
cious-0.018
 drowned-0.018
act-0.018
ork-0.017
Token,
Token ID11
Feature activation-2.10e-01

Pos logit contributions
cious+0.030
Po+0.029
 drowned+0.029
ork+0.029
act+0.028
Neg logit contributions
run-0.031
iddled-0.027
bars-0.027
 swift-0.027
 older-0.026
Token all
Token ID477
Feature activation+2.43e+00

Pos logit contributions
 drowned+0.004
 sting+0.003
 harmful+0.003
 die+0.003
cious+0.003
Neg logit contributions
iddled-0.005
ay-0.005
run-0.005
bars-0.005
isher-0.005
Token wrapped
Token ID12908
Feature activation+5.54e-01

Pos logit contributions
run+0.059
iddled+0.057
box+0.055
bars+0.052
 swift+0.050
Neg logit contributions
 drowned-0.044
act-0.038
 sting-0.037
 Gig-0.037
 harmful-0.037
Token in
Token ID287
Feature activation+1.53e-01

Pos logit contributions
run+0.012
bars+0.011
 older+0.011
iddled+0.010
 swift+0.010
Neg logit contributions
act-0.011
Po-0.011
cious-0.011
 drowned-0.011
 Gig-0.011
Token the
Token ID262
Feature activation+7.26e-01

Pos logit contributions
bars+0.004
run+0.004
iddled+0.004
ver+0.004
 Step+0.004
Neg logit contributions
 peanut-0.003
 pulp-0.002
Cr-0.002
Po-0.002
 Gig-0.002
Token soft
Token ID2705
Feature activation-2.79e-01

Pos logit contributions
run+0.021
iddled+0.019
 Jake+0.019
bars+0.019
old+0.017
Neg logit contributions
 sting-0.012
Po-0.011
 pulp-0.011
 pits-0.011
cious-0.011
Token leaves
Token ID5667
Feature activation+1.32e+00

Pos logit contributions
cious+0.005
Po+0.005
act+0.005
ork+0.005
actus+0.004
Neg logit contributions
run-0.007
bars-0.006
iddled-0.006
 Jake-0.006
 swift-0.006
Token mom
Token ID1995
Feature activation-7.28e-01

Pos logit contributions
Po+0.001
 gag+0.001
act+0.001
cious+0.001
ork+0.001
Neg logit contributions
run-0.001
iddled-0.001
 swift-0.001
bars-0.001
 Faster-0.001
Token gave
Token ID2921
Feature activation+1.51e-01

Pos logit contributions
Po+0.013
act+0.012
cious+0.012
 drowned+0.012
G+0.012
Neg logit contributions
run-0.018
 swift-0.017
iddled-0.016
bars-0.016
 Faster-0.016
Token him
Token ID683
Feature activation+3.55e-01

Pos logit contributions
run+0.004
 swift+0.004
bars+0.004
iddled+0.003
 pleasure+0.003
Neg logit contributions
Po-0.002
 Gig-0.002
 drowned-0.002
 peanut-0.002
 pits-0.002
Token a
Token ID257
Feature activation-2.92e-01

Pos logit contributions
run+0.007
 swift+0.006
iddled+0.006
 of+0.006
bars+0.006
Neg logit contributions
cious-0.008
Po-0.008
act-0.008
Cr-0.007
 drowned-0.007
Token brilliant
Token ID10457
Feature activation+1.09e+00

Pos logit contributions
Po+0.005
ork+0.005
 gag+0.005
cious+0.005
act+0.005
Neg logit contributions
run-0.007
iddled-0.007
 Step-0.006
bars-0.006
 swift-0.006
Token idea
Token ID2126
Feature activation+2.47e+00

Pos logit contributions
run+0.028
bars+0.025
 Step+0.024
iddled+0.024
 swift+0.024
Neg logit contributions
ork-0.018
 peanut-0.018
 gag-0.018
 marriage-0.018
Po-0.017
Token.
Token ID13
Feature activation-2.76e-01

Pos logit contributions
 swift+0.063
 older+0.062
run+0.062
iddled+0.060
bars+0.060
Neg logit contributions
cious-0.037
ork-0.037
uts-0.037
Po-0.037
mar-0.034
Token She
Token ID1375
Feature activation-4.92e-01

Pos logit contributions
Po+0.004
S+0.004
 Gig+0.003
G+0.003
uts+0.003
Neg logit contributions
 swift-0.007
run-0.007
 courage-0.007
 older-0.007
bars-0.007
Token said
Token ID531
Feature activation-3.70e-01

Pos logit contributions
Po+0.010
cious+0.009
Cr+0.009
G+0.009
 drowned+0.009
Neg logit contributions
run-0.011
 swift-0.010
bars-0.010
iddled-0.010
 Faster-0.010
Token,
Token ID11
Feature activation+1.11e-01

Pos logit contributions
cious+0.009
Po+0.009
act+0.009
 drowned+0.008
G+0.008
Neg logit contributions
run-0.007
 of-0.006
 swift-0.006
 older-0.006
iddled-0.006
Token "
Token ID366
Feature activation-1.30e+00

Pos logit contributions
 swift+0.002
run+0.002
 courage+0.002
iddled+0.002
bars+0.002
Neg logit contributions
Po-0.002
cious-0.002
G-0.002
Cr-0.002
 Gig-0.002
Token was
Token ID373
Feature activation-1.93e-01

Pos logit contributions
Po+0.018
mary+0.017
 drowned+0.017
act+0.017
cious+0.017
Neg logit contributions
run-0.027
 swift-0.026
 pleasure-0.024
ay-0.024
 welcome-0.024
Token so
Token ID523
Feature activation-8.06e-01

Pos logit contributions
cious+0.002
 gag+0.002
 drowned+0.002
mary+0.002
 eaten+0.002
Neg logit contributions
bars-0.005
run-0.005
mith-0.005
ay-0.005
 Step-0.005
Token upset
Token ID9247
Feature activation-2.41e-01

Pos logit contributions
cious+0.016
S+0.015
Po+0.015
act+0.015
 gag+0.015
Neg logit contributions
run-0.017
box-0.016
bars-0.016
iddled-0.016
uit-0.016
Token and
Token ID290
Feature activation-3.68e-01

Pos logit contributions
Po+0.004
act+0.004
ork+0.003
cious+0.003
uts+0.003
Neg logit contributions
run-0.006
 swift-0.006
iddled-0.006
 pleasure-0.006
 older-0.006
Token cried
Token ID16896
Feature activation-9.12e-02

Pos logit contributions
 drowned+0.006
Po+0.006
cious+0.006
 gag+0.006
act+0.005
Neg logit contributions
run-0.009
 swift-0.009
iddled-0.008
 pleasure-0.008
 of-0.008
Token all
Token ID477
Feature activation+2.37e+00

Pos logit contributions
Po+0.002
cious+0.001
 gag+0.001
ork+0.001
 drowned+0.001
Neg logit contributions
run-0.002
bars-0.002
 swift-0.002
iddled-0.002
box-0.002
Token the
Token ID262
Feature activation-3.72e-01

Pos logit contributions
box+0.061
 pleasure+0.059
iddled+0.058
run+0.058
 swift+0.058
Neg logit contributions
 drowned-0.029
 pits-0.028
 photos-0.028
 Gig-0.028
S-0.027
Token way
Token ID835
Feature activation+1.15e+00

Pos logit contributions
Po+0.006
 drowned+0.006
 sting+0.006
act+0.006
cious+0.006
Neg logit contributions
run-0.009
iddled-0.009
 Step-0.008
 swift-0.008
 Faster-0.008
Token home
Token ID1363
Feature activation-1.18e+00

Pos logit contributions
iddled+0.032
 swift+0.032
 fascination+0.031
 pleasure+0.030
 Faster+0.030
Neg logit contributions
ork-0.013
 drowned-0.013
 Woo-0.011
cious-0.011
 Gig-0.011
Token.
Token ID13
Feature activation+2.71e-01

Pos logit contributions
ork+0.018
 drowned+0.017
cious+0.017
uts+0.016
Po+0.016
Neg logit contributions
 swift-0.030
 Faster-0.029
iddled-0.029
 fascination-0.029
 pleasure-0.029
Token Her
Token ID2332
Feature activation-7.26e-01

Pos logit contributions
run+0.006
 swift+0.006
iddled+0.006
bars+0.006
 of+0.006
Neg logit contributions
Po-0.005
cious-0.005
G-0.005
ork-0.005
Pear-0.005
Token all
Token ID477
Feature activation+2.21e+00

Pos logit contributions
Po+0.016
cious+0.016
act+0.016
 gag+0.015
G+0.015
Neg logit contributions
run-0.009
 of-0.009
 swift-0.008
 courage-0.008
 older-0.008
Token day
Token ID1110
Feature activation-1.40e+00

Pos logit contributions
run+0.043
box+0.040
 older+0.039
iddled+0.039
 swift+0.039
Neg logit contributions
cious-0.046
 drowned-0.046
S-0.045
G-0.045
Cr-0.044
Token playing
Token ID2712
Feature activation+1.43e+00

Pos logit contributions
Po+0.037
act+0.037
cious+0.037
 drowned+0.035
Pear+0.035
Neg logit contributions
run-0.021
 of-0.019
 swift-0.018
 older-0.018
iddled-0.018
Token with
Token ID351
Feature activation+2.02e-01

Pos logit contributions
run+0.033
bars+0.032
 swift+0.029
 pleasure+0.027
iddled+0.027
Neg logit contributions
ork-0.027
Po-0.027
Cr-0.026
act-0.026
cious-0.026
Token his
Token ID465
Feature activation+5.22e-01

Pos logit contributions
run+0.005
bars+0.005
iddled+0.004
 swift+0.004
 Step+0.004
Neg logit contributions
Po-0.003
 drowned-0.003
 peanut-0.003
Cr-0.003
ork-0.003
Token new
Token ID649
Feature activation+8.96e-01

Pos logit contributions
run+0.012
iddled+0.011
bars+0.011
mith+0.010
 of+0.010
Neg logit contributions
Po-0.010
actus-0.010
cious-0.010
ork-0.010
act-0.010
Token chess
Token ID19780
Feature activation-7.56e-01

Pos logit contributions
run+0.024
bars+0.022
iddled+0.022
rol+0.020
mith+0.020
Neg logit contributions
ork-0.013
actus-0.012
 shrimp-0.012
 peanut-0.012
Po-0.012
Token set
Token ID900
Feature activation+7.24e-02

Pos logit contributions
ork+0.014
act+0.013
 bank+0.013
Po+0.013
cious+0.012
Neg logit contributions
run-0.019
 swift-0.017
iddled-0.016
 pleasure-0.016
bars-0.016
Token.
Token ID13
Feature activation-8.53e-01

Pos logit contributions
run+0.002
iddled+0.002
 swift+0.002
 Step+0.002
bars+0.002
Neg logit contributions
ork-0.001
uts-0.001
cious-0.001
aults-0.001
 bank-0.001
Token He
Token ID679
Feature activation-1.51e-01

Pos logit contributions
ork+0.016
cious+0.016
Po+0.016
G+0.015
Cr+0.015
Neg logit contributions
run-0.019
 swift-0.017
iddled-0.017
bars-0.017
 older-0.017
Token played
Token ID2826
Feature activation+6.42e-01

Pos logit contributions
 drowned+0.002
cious+0.002
act+0.002
Po+0.002
Cr+0.002
Neg logit contributions
run-0.004
 swift-0.004
iddled-0.004
bars-0.004
 of-0.004
Token her
Token ID607
Feature activation+1.02e+00

Pos logit contributions
Po+0.008
cious+0.007
ork+0.007
 drowned+0.006
 harmful+0.006
Neg logit contributions
bars-0.011
run-0.011
iddled-0.011
 swift-0.011
 pleasure-0.010
Token mom
Token ID1995
Feature activation-2.19e-01

Pos logit contributions
iddled+0.028
run+0.028
bars+0.026
 Faster+0.025
 Step+0.024
Neg logit contributions
ork-0.016
cious-0.016
Po-0.016
 sting-0.016
act-0.015
Token had
Token ID550
Feature activation+3.72e-01

Pos logit contributions
Po+0.003
ache+0.003
 drowned+0.003
cious+0.003
mar+0.003
Neg logit contributions
 Faster-0.006
 pleasure-0.006
run-0.006
 swift-0.006
iddled-0.006
Token taught
Token ID7817
Feature activation+1.03e+00

Pos logit contributions
run+0.009
 swift+0.008
bars+0.008
 Faster+0.008
 older+0.008
Neg logit contributions
Po-0.007
cious-0.006
 drowned-0.006
act-0.006
G-0.006
Token her
Token ID607
Feature activation+4.09e-01

Pos logit contributions
run+0.029
 swift+0.028
bars+0.028
iddled+0.027
 pleasure+0.027
Neg logit contributions
Po-0.013
ork-0.013
S-0.012
act-0.012
Pear-0.011
Token and
Token ID290
Feature activation+1.13e+00

Pos logit contributions
run+0.009
 swift+0.008
bars+0.008
iddled+0.008
 Step+0.008
Neg logit contributions
cious-0.008
ork-0.008
Po-0.008
 gag-0.007
act-0.007
Token quickly
Token ID2952
Feature activation+6.40e-01

Pos logit contributions
run+0.028
 swift+0.027
bars+0.026
ay+0.026
 Faster+0.026
Neg logit contributions
Po-0.018
 drowned-0.018
cious-0.016
 harmful-0.016
 pulp-0.016
Token turned
Token ID2900
Feature activation+1.17e-01

Pos logit contributions
run+0.014
 swift+0.013
bars+0.013
iddled+0.013
 Faster+0.013
Neg logit contributions
Po-0.012
cious-0.012
 drowned-0.012
act-0.011
ork-0.011
Token off
Token ID572
Feature activation-7.77e-01

Pos logit contributions
run+0.002
bars+0.002
 swift+0.002
iddled+0.002
box+0.002
Neg logit contributions
Po-0.002
act-0.002
 drowned-0.002
ork-0.002
 pits-0.002
Token the
Token ID262
Feature activation-2.44e-01

Pos logit contributions
Po+0.018
cious+0.017
act+0.017
ork+0.017
 drowned+0.017
Neg logit contributions
run-0.015
 swift-0.013
 of-0.013
iddled-0.013
bars-0.013
Token oven
Token ID14361
Feature activation-1.22e+00

Pos logit contributions
ork+0.005
 pits+0.005
 pulp+0.004
 sting+0.004
Po+0.004
Neg logit contributions
run-0.006
iddled-0.006
bars-0.005
 Jake-0.005
 swift-0.005
Token in
Token ID287
Feature activation-3.05e-01

Pos logit contributions
run+0.009
iddled+0.008
 fascination+0.008
 swift+0.008
 Jake+0.008
Neg logit contributions
Po-0.006
cious-0.006
ork-0.006
act-0.006
 drowned-0.006
Token the
Token ID262
Feature activation-1.37e-01

Pos logit contributions
Po+0.006
 drowned+0.006
Cr+0.005
act+0.005
S+0.005
Neg logit contributions
run-0.007
iddled-0.006
bars-0.006
 swift-0.006
box-0.006
Token air
Token ID1633
Feature activation-1.09e+00

Pos logit contributions
act+0.003
Po+0.003
 drowned+0.003
ork+0.003
cious+0.003
Neg logit contributions
run-0.003
iddled-0.002
 of-0.002
 Jake-0.002
 swift-0.002
Token."
Token ID526
Feature activation-6.11e-02

Pos logit contributions
cious+0.016
Po+0.016
act+0.016
 drowned+0.016
Cr+0.016
Neg logit contributions
run-0.029
iddled-0.027
 swift-0.026
bars-0.025
 of-0.025
Token
Token ID198
Feature activation+7.59e-01

Pos logit contributions
Po+0.001
G+0.001
cious+0.001
act+0.001
S+0.001
Neg logit contributions
run-0.001
iddled-0.001
 swift-0.001
bars-0.001
 of-0.001
Token
Token ID198
Feature activation-8.19e-02

Pos logit contributions
run+0.018
bars+0.017
iddled+0.017
 swift+0.017
box+0.016
Neg logit contributions
Po-0.013
G-0.013
ork-0.013
act-0.013
S-0.013
Token"
Token ID1
Feature activation-8.63e-01

Pos logit contributions
cious+0.002
act+0.002
Po+0.002
 drowned+0.002
 gag+0.002
Neg logit contributions
run-0.002
 of-0.002
 swift-0.002
iddled-0.002
 older-0.002
TokenOK
Token ID11380
Feature activation+1.44e+00

Pos logit contributions
cious+0.018
Po+0.018
act+0.018
 drowned+0.018
ork+0.017
Neg logit contributions
run-0.017
 of-0.016
 swift-0.016
iddled-0.015
 older-0.015
Token,"
Token ID553
Feature activation+2.88e-01

Pos logit contributions
run+0.038
iddled+0.036
bars+0.035
 swift+0.035
box+0.032
Neg logit contributions
Po-0.022
cious-0.022
S-0.021
 drowned-0.020
Cr-0.020
Token Sam
Token ID3409
Feature activation+1.46e+00

Pos logit contributions
run+0.008
iddled+0.007
box+0.007
 swift+0.006
 of+0.006
Neg logit contributions
Po-0.005
 Gig-0.005
Cr-0.005
Pear-0.005
G-0.005
Token says
Token ID1139
Feature activation+6.99e-01

Pos logit contributions
run+0.039
 swift+0.033
 pleasure+0.033
iddled+0.033
box+0.033
Neg logit contributions
act-0.022
S-0.022
Po-0.022
Cr-0.021
 pige-0.020
Token 1
Token ID352
Feature activation+1.57e+00

Pos logit contributions
Po+0.020
cious+0.019
act+0.019
G+0.019
Cr+0.018
Neg logit contributions
run-0.033
 of-0.031
 swift-0.031
 older-0.031
iddled-0.031
Token,
Token ID11
Feature activation+2.74e-01

Pos logit contributions
iddled+0.039
 fascination+0.037
run+0.037
 swift+0.036
 pleasure+0.035
Neg logit contributions
cious-0.024
S-0.023
Po-0.023
Cr-0.022
mar-0.022
Token 2
Token ID362
Feature activation+4.48e-01

Pos logit contributions
iddled+0.006
run+0.006
 swift+0.005
box+0.005
 fascination+0.005
Neg logit contributions
Po-0.005
cious-0.005
Cr-0.005
G-0.005
S-0.005
Token,
Token ID11
Feature activation+1.75e-02

Pos logit contributions
run+0.011
iddled+0.010
 swift+0.010
 pleasure+0.010
 fascination+0.010
Neg logit contributions
cious-0.008
Po-0.008
S-0.007
act-0.007
Cr-0.007
Token 3
Token ID513
Feature activation+2.63e-01

Pos logit contributions
iddled+0.000
run+0.000
 swift+0.000
box+0.000
 fascination+0.000
Neg logit contributions
Po-0.000
cious-0.000
Cr-0.000
G-0.000
S-0.000
Token,
Token ID11
Feature activation+2.12e-01

Pos logit contributions
run+0.006
iddled+0.006
bars+0.006
 swift+0.006
 Step+0.006
Neg logit contributions
cious-0.005
Po-0.004
act-0.004
S-0.004
Cr-0.004
Token 4
Token ID604
Feature activation-1.45e-01

Pos logit contributions
iddled+0.005
run+0.004
 swift+0.004
box+0.004
icking+0.004
Neg logit contributions
Po-0.004
cious-0.004
Cr-0.004
G-0.004
Pear-0.004
Token and
Token ID290
Feature activation+2.84e-01

Pos logit contributions
cious+0.003
Po+0.003
act+0.002
S+0.002
Cr+0.002
Neg logit contributions
run-0.003
iddled-0.003
 swift-0.003
 pleasure-0.003
box-0.003
Token so
Token ID523
Feature activation-1.67e-01

Pos logit contributions
iddled+0.006
run+0.006
icking+0.006
uster+0.006
box+0.005
Neg logit contributions
cious-0.005
Po-0.005
G-0.005
Cr-0.005
S-0.005
Token on
Token ID319
Feature activation-5.45e-01

Pos logit contributions
cious+0.003
Po+0.003
G+0.003
S+0.003
Cr+0.003
Neg logit contributions
run-0.004
iddled-0.004
box-0.003
 swift-0.003
uster-0.003
Token.
Token ID13
Feature activation+3.26e-01

Pos logit contributions
cious+0.010
Po+0.009
ork+0.009
S+0.009
act+0.009
Neg logit contributions
run-0.013
iddled-0.012
 swift-0.011
bars-0.011
 older-0.011
Token to
Token ID284
Feature activation+1.11e+00

Pos logit contributions
run+0.001
 swift+0.001
iddled+0.001
 of+0.001
bars+0.001
Neg logit contributions
cious-0.001
Po-0.001
 drowned-0.001
act-0.001
 gag-0.001
Token smile
Token ID8212
Feature activation-1.56e+00

Pos logit contributions
run+0.028
 of+0.025
iddled+0.025
 swift+0.024
box+0.024
Neg logit contributions
Po-0.020
 gag-0.019
 drowned-0.019
Pear-0.019
cious-0.018
Token and
Token ID290
Feature activation+1.58e+00

Pos logit contributions
cious+0.035
Po+0.035
act+0.034
 drowned+0.033
G+0.033
Neg logit contributions
run-0.029
 of-0.027
 swift-0.026
iddled-0.026
 older-0.025
Token even
Token ID772
Feature activation+9.83e-01

Pos logit contributions
run+0.034
iddled+0.030
 swift+0.030
bars+0.029
box+0.029
Neg logit contributions
cious-0.033
Po-0.032
 drowned-0.032
 gag-0.031
Cr-0.030
Token laughed
Token ID13818
Feature activation+2.75e-01

Pos logit contributions
run+0.022
iddled+0.021
box+0.021
bars+0.021
 swift+0.020
Neg logit contributions
Po-0.017
 drowned-0.017
Cr-0.017
cious-0.016
 harmful-0.016
Token a
Token ID257
Feature activation+9.86e-01

Pos logit contributions
run+0.006
bars+0.006
 swift+0.006
iddled+0.006
box+0.006
Neg logit contributions
Po-0.005
cious-0.005
Cr-0.005
act-0.005
G-0.005
Token little
Token ID1310
Feature activation+8.94e-01

Pos logit contributions
 of+0.023
run+0.022
 swift+0.022
 pleasure+0.021
 welcome+0.021
Neg logit contributions
act-0.018
cious-0.018
Po-0.018
 drowned-0.017
 pits-0.017
Token.
Token ID13
Feature activation-6.96e-02

Pos logit contributions
run+0.020
iddled+0.018
 of+0.018
 swift+0.018
bars+0.018
Neg logit contributions
cious-0.017
Po-0.017
act-0.017
 drowned-0.016
Cr-0.016
Token Tim
Token ID5045
Feature activation+2.92e-01

Pos logit contributions
Po+0.001
cious+0.001
 Gig+0.001
Cr+0.001
act+0.001
Neg logit contributions
run-0.001
 swift-0.001
iddled-0.001
box-0.001
 of-0.001
Tokenmy
Token ID1820
Feature activation-2.51e-01

Pos logit contributions
run+0.003
 of+0.003
 swift+0.003
iddled+0.003
bars+0.002
Neg logit contributions
act-0.009
Po-0.009
cious-0.009
 drowned-0.009
Pear-0.009
Token's
Token ID338
Feature activation-4.21e-01

Pos logit contributions
Po+0.005
cious+0.005
act+0.005
G+0.005
 drowned+0.005
Neg logit contributions
run-0.005
 of-0.005
 swift-0.005
 older-0.005
iddled-0.005
Token Anna
Token ID11735
Feature activation-1.74e+00

Pos logit contributions
Po+0.012
cious+0.012
Cr+0.012
Pear+0.012
 drowned+0.011
Neg logit contributions
run-0.016
iddled-0.015
 swift-0.015
bars-0.014
box-0.014
Token,
Token ID11
Feature activation-1.06e+00

Pos logit contributions
Po+0.035
cious+0.034
act+0.034
 drowned+0.034
Cr+0.032
Neg logit contributions
run-0.038
 swift-0.034
iddled-0.034
 of-0.033
bars-0.033
Token forgive
Token ID20927
Feature activation-1.39e+00

Pos logit contributions
Po+0.016
 drowned+0.015
cious+0.015
Pear+0.015
S+0.015
Neg logit contributions
iddled-0.028
run-0.027
 swift-0.026
bars-0.025
box-0.024
Token Ben
Token ID3932
Feature activation-1.19e+00

Pos logit contributions
cious+0.034
Po+0.034
act+0.033
 drowned+0.033
ork+0.032
Neg logit contributions
run-0.023
 of-0.022
 older-0.021
 swift-0.020
 Jake-0.020
Token and
Token ID290
Feature activation-5.66e-01

Pos logit contributions
Po+0.023
cious+0.023
 drowned+0.023
act+0.022
Cr+0.022
Neg logit contributions
run-0.026
iddled-0.024
 swift-0.024
bars-0.023
 of-0.022
Token let
Token ID1309
Feature activation-6.11e-01

Pos logit contributions
Po+0.011
 drowned+0.011
 gag+0.010
cious+0.010
 sting+0.010
Neg logit contributions
run-0.012
iddled-0.012
 swift-0.011
box-0.011
bars-0.011
Token him
Token ID683
Feature activation+8.72e-01

Pos logit contributions
Po+0.013
 drowned+0.013
act+0.013
cious+0.012
ork+0.012
Neg logit contributions
run-0.013
 swift-0.012
iddled-0.012
bars-0.011
 of-0.011
Token play
Token ID711
Feature activation+3.78e-01

Pos logit contributions
run+0.021
iddled+0.021
 swift+0.020
box+0.020
bars+0.020
Neg logit contributions
 drowned-0.014
Po-0.014
act-0.014
ork-0.013
 die-0.013
Token with
Token ID351
Feature activation-4.02e-03

Pos logit contributions
bars+0.009
run+0.009
 swift+0.008
 older+0.008
iddled+0.008
Neg logit contributions
ork-0.007
Po-0.007
S-0.007
Cr-0.007
 drowned-0.006
Token your
Token ID534
Feature activation-4.15e-01

Pos logit contributions
Po+0.000
act+0.000
cious+0.000
ork+0.000
 drowned+0.000
Neg logit contributions
run-0.000
iddled-0.000
 swift-0.000
bars-0.000
 of-0.000
Token car
Token ID1097
Feature activation+6.31e-01

Pos logit contributions
 drowned+0.007
 bank+0.007
ork+0.007
 pits+0.007
act+0.006
Neg logit contributions
iddled-0.010
run-0.010
 swift-0.010
 Step-0.009
 Maria-0.009
Token She
Token ID1375
Feature activation-1.25e-01

Pos logit contributions
 swift+0.006
run+0.006
box+0.006
iddled+0.006
 courage+0.006
Neg logit contributions
Po-0.004
cious-0.004
ork-0.003
 Gig-0.003
mary-0.003
Token put
Token ID1234
Feature activation+2.75e-01

Pos logit contributions
 forgot+0.002
 drowned+0.002
G+0.002
 realized+0.002
Po+0.002
Neg logit contributions
 Faster-0.003
 Step-0.003
 welcome-0.003
 pleasure-0.003
run-0.003
Token the
Token ID262
Feature activation+1.96e-01

Pos logit contributions
run+0.008
 younger+0.008
illed+0.008
bars+0.008
old+0.008
Neg logit contributions
 Woo-0.003
 Pink-0.003
S-0.003
Po-0.003
 peanut-0.002
Token newspaper
Token ID7533
Feature activation-1.61e-01

Pos logit contributions
run+0.007
iddled+0.006
ug+0.005
bars+0.005
 Liz+0.005
Neg logit contributions
 pigeon-0.003
cious-0.003
 photos-0.003
 pige-0.003
actus-0.002
Token back
Token ID736
Feature activation-1.38e+00

Pos logit contributions
ork+0.002
mar+0.002
Po+0.002
aults+0.002
cious+0.002
Neg logit contributions
run-0.004
old-0.004
 pleasure-0.004
 Jake-0.004
iddled-0.004
Token on
Token ID319
Feature activation-1.21e+00

Pos logit contributions
mar+0.017
act+0.016
ork+0.015
S+0.015
Po+0.014
Neg logit contributions
 pleasure-0.035
 fascination-0.035
iddled-0.035
 swift-0.034
 younger-0.034
Token the
Token ID262
Feature activation-9.53e-01

Pos logit contributions
ork+0.022
Po+0.022
G+0.020
S+0.020
Pear+0.020
Neg logit contributions
run-0.030
iddled-0.027
bars-0.027
 swift-0.026
 older-0.024
Token floor
Token ID4314
Feature activation-1.13e+00

Pos logit contributions
cious+0.014
 bank+0.014
actus+0.013
ork+0.013
 drowned+0.013
Neg logit contributions
run-0.029
iddled-0.027
 swift-0.024
ver-0.024
 of-0.023
Token and
Token ID290
Feature activation+5.99e-01

Pos logit contributions
cious+0.013
ork+0.013
mary+0.012
uts+0.012
 bank+0.012
Neg logit contributions
run-0.030
 swift-0.029
iddled-0.028
 Faster-0.028
 Up-0.027
Token said
Token ID531
Feature activation-7.57e-01

Pos logit contributions
run+0.016
 swift+0.015
ay+0.014
 pleasure+0.014
 welcome+0.014
Neg logit contributions
 drowned-0.009
cious-0.008
 forgot-0.008
Po-0.008
mar-0.007
Token goodbye
Token ID24829
Feature activation-1.84e+00

Pos logit contributions
cious+0.015
Po+0.015
ork+0.014
S+0.014
Cr+0.014
Neg logit contributions
run-0.016
 swift-0.016
bars-0.015
 older-0.015
iddled-0.015
Token,
Token ID11
Feature activation-2.21e-01

Pos logit contributions
run+0.001
iddled+0.001
 swift+0.001
bars+0.001
 of+0.001
Neg logit contributions
Po-0.001
cious-0.001
act-0.001
Pear-0.001
 drowned-0.001
Token the
Token ID262
Feature activation-9.71e-02

Pos logit contributions
Po+0.004
 pulp+0.004
Pear+0.004
 sting+0.004
Cr+0.004
Neg logit contributions
run-0.005
iddled-0.005
box-0.005
ay-0.004
 swift-0.004
Token tail
Token ID7894
Feature activation+9.86e-02

Pos logit contributions
 pulp+0.002
Po+0.002
 pige+0.002
 sting+0.002
 pits+0.002
Neg logit contributions
run-0.003
iddled-0.002
 Step-0.002
old-0.002
 swift-0.002
Token,
Token ID11
Feature activation-4.18e-01

Pos logit contributions
run+0.002
iddled+0.002
 swift+0.002
 pleasure+0.002
 Jake+0.002
Neg logit contributions
Po-0.002
cious-0.002
ork-0.002
Pear-0.002
 sting-0.002
Token the
Token ID262
Feature activation-8.74e-03

Pos logit contributions
Po+0.007
Pear+0.007
 pulp+0.007
mary+0.007
Cr+0.007
Neg logit contributions
run-0.010
iddled-0.009
 pleasure-0.009
 swift-0.009
bars-0.009
Token nose
Token ID9686
Feature activation-2.87e-01

Pos logit contributions
 pulp+0.000
Po+0.000
 sting+0.000
 pige+0.000
 pits+0.000
Neg logit contributions
run-0.000
iddled-0.000
old-0.000
 Step-0.000
 of-0.000
Token,
Token ID11
Feature activation-5.12e-01

Pos logit contributions
Po+0.005
cious+0.005
ork+0.005
act+0.005
Pear+0.005
Neg logit contributions
run-0.006
iddled-0.006
 swift-0.006
 courage-0.006
 pleasure-0.006
Token the
Token ID262
Feature activation+1.55e-01

Pos logit contributions
Po+0.009
Pear+0.009
 pulp+0.009
cious+0.009
mary+0.009
Neg logit contributions
run-0.012
iddled-0.011
 swift-0.011
 pleasure-0.011
bars-0.011
Token prope
Token ID44408
Feature activation-7.99e-01

Pos logit contributions
run+0.004
iddled+0.004
old+0.004
 Step+0.003
 of+0.003
Neg logit contributions
 pulp-0.003
 sting-0.003
Po-0.003
 pige-0.003
 pits-0.003
Tokenller
Token ID6051
Feature activation-1.10e-01

Pos logit contributions
Po+0.011
cious+0.011
act+0.011
 gag+0.011
G+0.011
Neg logit contributions
run-0.022
bars-0.021
 of-0.020
 older-0.020
 swift-0.020
Token.
Token ID13
Feature activation-7.18e-01

Pos logit contributions
cious+0.002
Po+0.002
 drowned+0.002
act+0.002
 sting+0.002
Neg logit contributions
run-0.002
iddled-0.002
bars-0.002
 swift-0.002
 older-0.002
Token the
Token ID262
Feature activation-2.15e-02

Pos logit contributions
 drowned+0.008
Po+0.008
 gag+0.008
Cr+0.008
S+0.008
Neg logit contributions
run-0.013
iddled-0.012
bars-0.012
box-0.012
ay-0.011
Token ladder
Token ID18002
Feature activation-8.04e-03

Pos logit contributions
Po+0.000
 pits+0.000
act+0.000
 drowned+0.000
S+0.000
Neg logit contributions
run-0.001
iddled-0.001
 Jake-0.001
 swift-0.001
 of-0.001
Token again
Token ID757
Feature activation-2.20e+00

Pos logit contributions
 bank+0.000
ork+0.000
S+0.000
 pits+0.000
mary+0.000
Neg logit contributions
 pleasure-0.000
iddled-0.000
run-0.000
uster-0.000
 swift-0.000
Token.
Token ID13
Feature activation+3.30e-01

Pos logit contributions
Po+0.033
S+0.033
G+0.032
cious+0.032
ork+0.030
Neg logit contributions
iddled-0.058
run-0.057
box-0.052
bars-0.052
 pleasure-0.051
Token
Token ID198
Feature activation+6.35e-01

Pos logit contributions
run+0.007
bars+0.007
iddled+0.007
 swift+0.006
 courage+0.006
Neg logit contributions
Po-0.007
cious-0.006
act-0.006
ork-0.006
G-0.006
Token
Token ID198
Feature activation-3.35e-01

Pos logit contributions
run+0.014
bars+0.014
iddled+0.014
 swift+0.013
 older+0.013
Neg logit contributions
Po-0.012
G-0.011
ork-0.011
S-0.011
act-0.011
Token"
Token ID1
Feature activation-1.03e+00

Pos logit contributions
cious+0.006
act+0.006
Po+0.006
 drowned+0.006
 gag+0.006
Neg logit contributions
run-0.007
 of-0.007
 swift-0.007
 older-0.007
iddled-0.007
TokenPlease
Token ID5492
Feature activation-7.33e-01

Pos logit contributions
Po+0.017
cious+0.016
G+0.016
Cr+0.016
act+0.016
Neg logit contributions
run-0.027
 of-0.025
 older-0.024
 swift-0.024
iddled-0.024
Token,
Token ID11
Feature activation+9.36e-01

Pos logit contributions
Po+0.012
 drowned+0.012
 sting+0.012
ork+0.012
act+0.012
Neg logit contributions
run-0.018
iddled-0.017
bars-0.017
 swift-0.016
box-0.016
Token Lily
Token ID20037
Feature activation-4.92e-01

Pos logit contributions
run+0.025
bars+0.025
iddled+0.024
 swift+0.022
hole+0.022
Neg logit contributions
Po-0.014
Cr-0.014
S-0.014
 drowned-0.014
 sting-0.014
Token,
Token ID11
Feature activation+9.16e-01

Pos logit contributions
 drowned+0.007
Po+0.007
cious+0.007
S+0.007
 Gig+0.007
Neg logit contributions
run-0.013
iddled-0.013
bars-0.012
 swift-0.012
 Faster-0.012
Token the
Token ID262
Feature activation-4.22e-01

Pos logit contributions
 drowned+0.003
Po+0.003
 pits+0.003
 harmful+0.003
 die+0.003
Neg logit contributions
iddled-0.004
run-0.004
bars-0.004
ay-0.004
 swift-0.004
Token house
Token ID2156
Feature activation-8.19e-01

Pos logit contributions
 pits+0.007
 peanut+0.007
 sting+0.007
 harmful+0.007
 pulp+0.007
Neg logit contributions
iddled-0.012
run-0.012
 Step-0.010
bars-0.010
 Faster-0.010
Token next
Token ID1306
Feature activation+6.30e-01

Pos logit contributions
 drowned+0.013
ork+0.012
Po+0.012
 harmful+0.012
ache+0.011
Neg logit contributions
iddled-0.021
run-0.021
 swift-0.020
 Faster-0.019
 fascination-0.019
Token door
Token ID3420
Feature activation-1.26e+00

Pos logit contributions
iddled+0.016
run+0.016
bars+0.015
 swift+0.015
 Faster+0.015
Neg logit contributions
ork-0.010
Po-0.009
 drowned-0.009
mar-0.009
cious-0.009
Token.
Token ID13
Feature activation-2.10e-01

Pos logit contributions
ork+0.017
 drowned+0.016
Po+0.016
uts+0.015
cious+0.015
Neg logit contributions
iddled-0.034
 swift-0.033
 fascination-0.032
 Faster-0.032
run-0.032
Token They
Token ID1119
Feature activation-1.16e+00

Pos logit contributions
Po+0.003
 Gig+0.003
cious+0.003
G+0.003
S+0.003
Neg logit contributions
iddled-0.005
run-0.005
 swift-0.005
 courage-0.005
box-0.005
Token are
Token ID389
Feature activation-2.51e-01

Pos logit contributions
Po+0.022
cious+0.022
act+0.022
 drowned+0.021
G+0.020
Neg logit contributions
run-0.026
 of-0.024
 swift-0.024
iddled-0.023
 older-0.023
Token glad
Token ID9675
Feature activation-9.96e-01

Pos logit contributions
 drowned+0.004
 harmful+0.004
Po+0.004
cious+0.004
 gag+0.004
Neg logit contributions
run-0.006
bars-0.006
iddled-0.006
 Faster-0.006
ay-0.006
Token they
Token ID484
Feature activation-9.87e-01

Pos logit contributions
Po+0.015
cious+0.015
act+0.015
 drowned+0.015
Cr+0.014
Neg logit contributions
run-0.026
iddled-0.024
 swift-0.024
bars-0.023
 of-0.023
Token are
Token ID389
Feature activation-7.83e-01

Pos logit contributions
cious+0.021
Po+0.021
act+0.021
ork+0.019
G+0.019
Neg logit contributions
run-0.020
 of-0.019
 swift-0.018
iddled-0.018
 older-0.018
Token safe
Token ID3338
Feature activation-1.82e+00

Pos logit contributions
 drowned+0.016
Po+0.015
cious+0.015
 harmful+0.015
Cr+0.014
Neg logit contributions
run-0.018
bars-0.017
iddled-0.016
 Faster-0.016
 swift-0.015
Token.
Token ID13
Feature activation+7.07e-02

Pos logit contributions
Po+0.009
cious+0.009
act+0.009
G+0.009
S+0.009
Neg logit contributions
run-0.015
iddled-0.014
 swift-0.014
bars-0.014
 older-0.014
Token
Token ID198
Feature activation+6.11e-01

Pos logit contributions
run+0.002
 swift+0.001
 of+0.001
iddled+0.001
 older+0.001
Neg logit contributions
Po-0.001
cious-0.001
act-0.001
G-0.001
Pear-0.001
Token
Token ID198
Feature activation-3.18e-01

Pos logit contributions
run+0.013
bars+0.012
 swift+0.012
 older+0.012
iddled+0.012
Neg logit contributions
Po-0.012
G-0.012
cious-0.011
act-0.011
ork-0.011
TokenShe
Token ID3347
Feature activation-6.40e-01

Pos logit contributions
cious+0.005
act+0.005
Po+0.005
 drowned+0.005
 gag+0.005
Neg logit contributions
run-0.008
 of-0.008
 swift-0.008
iddled-0.007
 older-0.007
Token went
Token ID1816
Feature activation-2.93e-01

Pos logit contributions
Po+0.012
cious+0.012
 drowned+0.012
act+0.012
G+0.012
Neg logit contributions
run-0.015
 swift-0.013
iddled-0.013
 of-0.013
bars-0.013
Token back
Token ID736
Feature activation-2.74e+00

Pos logit contributions
Po+0.006
cious+0.006
act+0.006
 drowned+0.006
ork+0.006
Neg logit contributions
run-0.006
 of-0.005
 swift-0.005
iddled-0.005
bars-0.005
Token to
Token ID284
Feature activation-8.72e-01

Pos logit contributions
act+0.050
Po+0.050
cious+0.050
 drowned+0.048
ork+0.048
Neg logit contributions
run-0.062
iddled-0.058
 swift-0.057
bars-0.057
 of-0.056
Token the
Token ID262
Feature activation-5.66e-01

Pos logit contributions
 drowned+0.015
Po+0.015
cious+0.015
 gag+0.015
 Gig+0.015
Neg logit contributions
run-0.020
bars-0.019
iddled-0.019
 swift-0.019
 of-0.018
Token classroom
Token ID15806
Feature activation-1.91e+00

Pos logit contributions
actus+0.010
Po+0.010
ork+0.010
act+0.010
cious+0.010
Neg logit contributions
run-0.015
iddled-0.014
bars-0.012
 swift-0.012
 of-0.012
Token with
Token ID351
Feature activation-5.54e-02

Pos logit contributions
ork+0.029
cious+0.029
uts+0.027
 bank+0.027
 drowned+0.025
Neg logit contributions
run-0.048
iddled-0.048
 swift-0.046
 pleasure-0.045
 welcome-0.043
Token the
Token ID262
Feature activation+8.71e-02

Pos logit contributions
 drowned+0.001
Po+0.001
 peanut+0.001
 gag+0.001
cious+0.001
Neg logit contributions
run-0.001
iddled-0.001
bars-0.001
 swift-0.001
box-0.001
Token.
Token ID13
Feature activation-1.49e-01

Pos logit contributions
 drowned+0.002
cious+0.002
Po+0.002
act+0.002
S+0.002
Neg logit contributions
iddled-0.004
 swift-0.004
run-0.004
bars-0.004
 fascination-0.004
Token You
Token ID921
Feature activation+3.43e-01

Pos logit contributions
cious+0.003
Po+0.003
 Gig+0.003
Cr+0.002
G+0.002
Neg logit contributions
iddled-0.003
run-0.003
 swift-0.003
box-0.003
 courage-0.003
Token need
Token ID761
Feature activation+1.74e-01

Pos logit contributions
run+0.007
iddled+0.007
 swift+0.007
bars+0.007
 older+0.006
Neg logit contributions
Po-0.007
 drowned-0.007
cious-0.007
act-0.007
ork-0.006
Token to
Token ID284
Feature activation-1.08e+00

Pos logit contributions
run+0.005
iddled+0.005
 swift+0.005
bars+0.005
ement+0.004
Neg logit contributions
 drowned-0.002
Po-0.002
ork-0.002
G-0.002
 pits-0.002
Token wash
Token ID13502
Feature activation-1.61e+00

Pos logit contributions
Po+0.023
cious+0.023
act+0.022
 drowned+0.022
G+0.022
Neg logit contributions
run-0.022
 swift-0.019
 of-0.019
iddled-0.019
bars-0.019
Token off
Token ID572
Feature activation-2.78e+00

Pos logit contributions
Po+0.026
act+0.026
cious+0.025
ork+0.024
 drowned+0.024
Neg logit contributions
run-0.041
 swift-0.038
iddled-0.037
bars-0.037
 older-0.036
Token the
Token ID262
Feature activation-7.91e-01

Pos logit contributions
Po+0.047
act+0.047
cious+0.047
ork+0.046
 drowned+0.045
Neg logit contributions
run-0.066
iddled-0.063
 swift-0.061
bars-0.059
box-0.058
Token sand
Token ID6450
Feature activation-1.19e+00

Pos logit contributions
act+0.012
 pits+0.012
 drowned+0.011
actus+0.011
 pulp+0.011
Neg logit contributions
run-0.022
iddled-0.021
 swift-0.019
ver-0.019
bars-0.019
Token and
Token ID290
Feature activation+1.74e-01

Pos logit contributions
cious+0.025
act+0.025
Po+0.024
ork+0.023
 drowned+0.023
Neg logit contributions
run-0.025
 swift-0.022
iddled-0.022
 of-0.022
 older-0.021
Token cool
Token ID3608
Feature activation-8.78e-01

Pos logit contributions
run+0.004
iddled+0.004
 swift+0.004
bars+0.004
 Faster+0.004
Neg logit contributions
 drowned-0.003
Po-0.003
 gag-0.003
 harmful-0.003
cious-0.003
Token down
Token ID866
Feature activation-3.98e-01

Pos logit contributions
act+0.016
Po+0.014
 drowned+0.014
cious+0.014
 pits+0.014
Neg logit contributions
run-0.021
 swift-0.020
iddled-0.019
 fascination-0.018
box-0.018
Token with
Token ID351
Feature activation+1.79e-01

Pos logit contributions
bars+0.039
run+0.037
 swift+0.032
 older+0.031
hole+0.031
Neg logit contributions
ork-0.019
 marriage-0.018
S-0.018
Cr-0.018
Pear-0.018
Token the
Token ID262
Feature activation+6.05e-01

Pos logit contributions
run+0.004
iddled+0.004
bars+0.004
 swift+0.004
box+0.004
Neg logit contributions
Po-0.003
act-0.003
cious-0.003
 drowned-0.003
Pear-0.003
Token funny
Token ID8258
Feature activation-1.33e+00

Pos logit contributions
run+0.017
iddled+0.016
 Step+0.014
 swift+0.014
bars+0.014
Neg logit contributions
 cand-0.010
 bank-0.009
 photos-0.009
 peanut-0.009
 pits-0.009
Token toys
Token ID14958
Feature activation+3.41e-01

Pos logit contributions
ork+0.023
cious+0.023
act+0.022
 cand+0.022
Po+0.021
Neg logit contributions
run-0.032
 swift-0.031
iddled-0.030
bars-0.029
 Step-0.029
Token all
Token ID477
Feature activation+2.38e+00

Pos logit contributions
run+0.009
iddled+0.009
 swift+0.008
 Step+0.008
 pleasure+0.008
Neg logit contributions
cious-0.005
 bank-0.005
mary-0.005
uts-0.005
ork-0.004
Token day
Token ID1110
Feature activation-2.78e+00

Pos logit contributions
run+0.055
iddled+0.052
box+0.050
 older+0.050
 swift+0.049
Neg logit contributions
mar-0.041
cious-0.040
act-0.040
S-0.038
mary-0.037
Token.
Token ID13
Feature activation-2.85e-01

Pos logit contributions
cious+0.052
Po+0.048
ork+0.047
act+0.047
G+0.046
Neg logit contributions
run-0.067
iddled-0.061
 swift-0.060
bars-0.059
 older-0.059
Token And
Token ID843
Feature activation-3.23e-01

Pos logit contributions
 Gig+0.004
Po+0.004
Pear+0.004
cious+0.004
G+0.004
Neg logit contributions
run-0.007
 swift-0.007
box-0.007
iddled-0.007
 older-0.007
Token every
Token ID790
Feature activation+4.90e-01

Pos logit contributions
cious+0.005
Po+0.005
 Gig+0.005
Pear+0.005
act+0.005
Neg logit contributions
run-0.008
iddled-0.008
 swift-0.008
bars-0.008
box-0.008
Token time
Token ID640
Feature activation-2.44e-01

Pos logit contributions
iddled+0.014
bars+0.013
 Step+0.012
run+0.012
box+0.012
Neg logit contributions
 drowned-0.006
 marriage-0.006
ork-0.006
S-0.006
cious-0.006
Token she
Token ID673
Feature activation-5.87e-01

Pos logit contributions
Po+0.003
cious+0.003
 drowned+0.003
ork+0.003
act+0.003
Neg logit contributions
run-0.007
iddled-0.007
 swift-0.007
box-0.006
bars-0.006
Token the
Token ID262
Feature activation-9.91e-02

Pos logit contributions
bars+0.008
 Step+0.008
 Faster+0.008
run+0.008
uster+0.008
Neg logit contributions
 peanut-0.004
 pits-0.003
 Gig-0.003
 harmful-0.003
Po-0.003
Token courage
Token ID11917
Feature activation-1.09e+00

Pos logit contributions
 peanut+0.002
 pulp+0.002
 pits+0.002
actus+0.002
 bank+0.002
Neg logit contributions
run-0.003
iddled-0.003
 Step-0.002
 Faster-0.002
bars-0.002
Token and
Token ID290
Feature activation+1.30e+00

Pos logit contributions
cious+0.016
 drowned+0.015
uts+0.014
ork+0.014
 bank+0.013
Neg logit contributions
iddled-0.028
run-0.027
 Faster-0.027
 swift-0.027
 older-0.027
Token ran
Token ID4966
Feature activation+4.50e-01

Pos logit contributions
run+0.032
bars+0.031
iddled+0.030
uster+0.029
ay+0.029
Neg logit contributions
 drowned-0.021
cious-0.021
Po-0.020
 bank-0.020
Cr-0.019
Token back
Token ID736
Feature activation-2.24e+00

Pos logit contributions
run+0.009
 of+0.008
 swift+0.008
bars+0.008
iddled+0.008
Neg logit contributions
cious-0.010
Po-0.010
act-0.009
 drowned-0.009
G-0.009
Token home
Token ID1363
Feature activation-3.05e+00

Pos logit contributions
cious+0.034
act+0.033
 drowned+0.032
ork+0.031
Po+0.031
Neg logit contributions
bars-0.055
iddled-0.054
run-0.054
 swift-0.053
box-0.052
Token.
Token ID13
Feature activation-1.02e-01

Pos logit contributions
cious+0.056
 drowned+0.054
ork+0.053
Po+0.052
uts+0.051
Neg logit contributions
run-0.068
iddled-0.068
bars-0.066
 swift-0.065
 Faster-0.063
Token She
Token ID1375
Feature activation-1.10e+00

Pos logit contributions
 Gig+0.002
G+0.002
cious+0.002
S+0.002
Po+0.002
Neg logit contributions
iddled-0.002
 swift-0.002
run-0.002
bars-0.002
 older-0.002
Token was
Token ID373
Feature activation+2.88e-01

Pos logit contributions
 drowned+0.019
Po+0.018
cious+0.018
Cr+0.017
G+0.017
Neg logit contributions
run-0.027
bars-0.026
 swift-0.026
iddled-0.025
ay-0.025
Token no
Token ID645
Feature activation+8.69e-01

Pos logit contributions
bars+0.008
run+0.007
iddled+0.007
ay+0.007
 Step+0.007
Neg logit contributions
 drowned-0.004
cious-0.004
Cr-0.004
G-0.004
S-0.004
Token longer
Token ID2392
Feature activation-5.85e-01

Pos logit contributions
run+0.023
iddled+0.022
box+0.022
bars+0.021
 Jake+0.021
Neg logit contributions
 drowned-0.013
act-0.013
Po-0.013
cious-0.012
 harmful-0.012
Token happy
Token ID3772
Feature activation-3.11e-01

Pos logit contributions
cious+0.009
act+0.009
Po+0.009
G+0.009
 drowned+0.008
Neg logit contributions
run-0.008
box-0.007
iddled-0.007
bars-0.007
 of-0.007
Token and
Token ID290
Feature activation-6.16e-01

Pos logit contributions
cious+0.006
act+0.006
Po+0.006
 drowned+0.006
G+0.006
Neg logit contributions
run-0.007
 swift-0.006
iddled-0.006
 older-0.006
bars-0.006
Token excited
Token ID6568
Feature activation+4.62e-01

Pos logit contributions
cious+0.013
act+0.012
 drowned+0.012
Po+0.012
G+0.012
Neg logit contributions
run-0.013
box-0.012
iddled-0.012
 swift-0.011
 of-0.011
Tokenâ
Token ID22940
Feature activation+8.58e-01

Pos logit contributions
run+0.010
 swift+0.009
iddled+0.009
bars+0.009
 of+0.009
Neg logit contributions
cious-0.009
act-0.009
Po-0.009
 drowned-0.009
G-0.009
Token
Token ID26391
Feature activation-3.02e+00

Pos logit contributions
run+0.019
 of+0.017
 swift+0.017
iddled+0.017
bars+0.016
Neg logit contributions
Po-0.017
act-0.016
cious-0.016
 pulp-0.016
 drowned-0.016
Token
Token ID447
Feature activation-2.77e+00

Pos logit contributions
Po+0.065
 pulp+0.064
act+0.063
Cr+0.060
 gag+0.060
Neg logit contributions
run-0.062
 Jake-0.058
 of-0.056
box-0.053
 swift-0.052
Token
Token ID251
Feature activation-1.23e+00

Pos logit contributions
 gag+0.052
Po+0.052
cious+0.050
act+0.050
 harmful+0.049
Neg logit contributions
 Faster-0.056
 Jake-0.055
 Step-0.054
run-0.054
 swift-0.054
Tokenhe
Token ID258
Feature activation+4.00e-02

Pos logit contributions
G+0.022
S+0.021
cious+0.021
Po+0.021
Cr+0.020
Neg logit contributions
run-0.030
 swift-0.028
iddled-0.028
 older-0.028
bars-0.027
Token'd
Token ID1549
Feature activation-2.00e+00

Pos logit contributions
run+0.001
 of+0.001
 swift+0.001
iddled+0.001
 older+0.001
Neg logit contributions
Po-0.001
cious-0.001
act-0.001
 drowned-0.001
ork-0.001
Token just
Token ID655
Feature activation+1.99e+00

Pos logit contributions
cious+0.032
Po+0.030
 drowned+0.030
Cr+0.030
G+0.030
Neg logit contributions
run-0.048
box-0.047
iddled-0.047
 swift-0.045
mith-0.045
Token seen
Token ID1775
Feature activation+1.12e+00

Pos logit contributions
box+0.052
iddled+0.049
run+0.048
bars+0.046
ay+0.046
Neg logit contributions
cious-0.031
 gag-0.029
 drowned-0.029
 sting-0.029
 disl-0.027