TOP ACTIVATIONS
MAX = 3.203

, there was a small , helpless bird . The bird
when she saw a huge , colourful puzzle on the ground
, Spot saw a big , dark cloud coming . The
, she saw a big , mean dog . The dog
the room was a big , open window .
happen because of the big , red ball . He told
, there was a small , cute cat . The cat
, it saw a large , dark grey cloud blocking out
and then she had a scr um pt ious snack .
's bicycle and wanted to ride it , but Tim my
when he saw a big , big yellow bird . The
park . It is big and brown and has long legs
. Lucy said " hello " to the squirrel , and
, and Peter wanted to ride it . He asked the
it made Bob 's face scr unch up . He didn
walked is saw a big , shiny mailbox . It looked
, " Sure , but we need to find a mat
house there was a big , huge v ase . Every
. There was a big , rough bad ger who moved
Then she noticed a big , round , yellow ball on

MIN ACTIVATIONS
MIN = -3.518

and Ben were going to the cabin with Mom and Dad
day , Lily was at the park with her mom .
her they were going to the park . Lily was eager
and his mom were at the doctor 's office . While
little girl who lived at the beach . Every day ,
his mom were going to the store . Jake was looking
and Lily were going to the zoo with their mom and
and her dad were at the beach . Jill showed her
her mom were going to the station . They had a
and Sam were going to the airport with Mom and Dad
and her dad were at the beach one summer day .
and Ben were going to the park with Mom . They
that they were going to the store to buy some new
One day Sarah was at the park and she saw a
day while she was at the stable it started to rain
and Ben were going to the woods with Mom and Dad
and Ben were going to the zoo with their mom .
eth and Jonathan were at the beach . Beth had buckets
his mom my were at the gym . John watched many
and his mom were at the park . They were having

INTERVAL 1.523 - 3.203
CONTAINS 0.923%

From that day on , the sw an
out his hand . " Come here , butterfly !" he
eventually , he saw a big muscle . It was moving
to disappear . She knew he wasn 't really gone and
. But Tom was upset because he could not find his

INTERVAL -0.157 - 1.523
CONTAINS 59.255%

their toys . They like to count them and say their
this is our land and we don â € ™ t
and joy fully returned home with his new toy . Now
have fun . The beach is great . <|endoftext|> Tim and
. She uses her brain . She is clever . She

INTERVAL -1.838 - -0.157
CONTAINS 39.546%

He said , " Well , this diamond is mine ."
Sometimes there are too many cars on the road and they
. They were having fun . But then , the sky
with his mom and dad . He was very excited to
they were going up so much . His human said ,

INTERVAL -3.518 - -1.838
CONTAINS 0.276%

went to bed . She dream t about sitting on the
't worry Tim my , it happens sometimes . Let 's
When she got to the beach , it was already
the dirt . It began to rot away in the soil
took her to the doctor and the doctor said , "
Token,
Token ID11
Feature activation+7.27e-01

Pos logit contributions
aman+0.002
land+0.002
 hey+0.002
ware+0.002
 exception+0.002
Neg logit contributions
 ever-0.002
 onwards-0.001
unts-0.001
 fright-0.001
 grin-0.001
Token there
Token ID612
Feature activation+8.85e-01

Pos logit contributions
 ever+0.021
unts+0.021
 onwards+0.019
keeping+0.019
 grin+0.018
Neg logit contributions
aman-0.028
 hey-0.027
 whose-0.027
land-0.026
 prett-0.026
Token was
Token ID373
Feature activation-3.59e-01

Pos logit contributions
 ever+0.035
keeping+0.032
unts+0.032
 onwards+0.032
 Amen+0.031
Neg logit contributions
aman-0.027
 whose-0.025
land-0.024
 hey-0.023
ware-0.023
Token a
Token ID257
Feature activation+8.70e-01

Pos logit contributions
aman+0.013
land+0.013
 hey+0.012
 exception+0.012
ware+0.012
Neg logit contributions
 ever-0.013
 onwards-0.011
unts-0.010
 fright-0.010
 onward-0.009
Token small
Token ID1402
Feature activation+1.07e+00

Pos logit contributions
 ever+0.038
 onwards+0.035
keeping+0.032
 onward+0.032
unts+0.031
Neg logit contributions
aman-0.024
 hey-0.023
 prett-0.022
lves-0.022
land-0.022
Token,
Token ID11
Feature activation+3.20e+00

Pos logit contributions
 ever+0.046
 onwards+0.043
 onward+0.040
unts+0.038
keeping+0.037
Neg logit contributions
aman-0.029
ware-0.028
 extingu-0.028
 exception-0.027
 hey-0.027
Token helpless
Token ID21144
Feature activation-3.15e-02

Pos logit contributions
keeping+0.159
 ever+0.159
 onwards+0.143
 onward+0.139
umbled+0.138
Neg logit contributions
 prett-0.088
 little-0.063
 living-0.060
 whose-0.059
 shin-0.059
Token bird
Token ID6512
Feature activation-8.11e-02

Pos logit contributions
aman+0.001
land+0.001
 hey+0.001
ware+0.001
 exception+0.001
Neg logit contributions
 ever-0.001
 onwards-0.001
unts-0.001
 fright-0.001
 onward-0.001
Token.
Token ID13
Feature activation+1.96e+00

Pos logit contributions
aman+0.003
land+0.003
 hey+0.003
ware+0.003
 exception+0.003
Neg logit contributions
 ever-0.003
 onwards-0.002
unts-0.002
 fright-0.002
 grin-0.002
Token The
Token ID383
Feature activation+3.61e-01

Pos logit contributions
 ever+0.072
unts+0.068
 onwards+0.067
keeping+0.067
 onward+0.064
Neg logit contributions
 whose-0.058
aman-0.054
 where-0.053
land-0.052
lves-0.052
Token bird
Token ID6512
Feature activation+2.19e-01

Pos logit contributions
 ever+0.014
 onwards+0.013
unts+0.012
 onward+0.012
 grin+0.012
Neg logit contributions
aman-0.011
 hey-0.011
 prett-0.010
land-0.010
lves-0.010
Token when
Token ID618
Feature activation+3.75e-01

Pos logit contributions
 ever+0.016
unts+0.014
 onwards+0.014
 fright+0.014
keeping+0.013
Neg logit contributions
aman-0.012
 exception-0.011
land-0.011
 whose-0.011
 hey-0.010
Token she
Token ID673
Feature activation-2.67e-02

Pos logit contributions
 ever+0.014
unts+0.013
 onwards+0.013
 fright+0.012
 grin+0.012
Neg logit contributions
aman-0.012
land-0.011
 exception-0.011
 hey-0.011
ware-0.011
Token saw
Token ID2497
Feature activation-1.38e+00

Pos logit contributions
aman+0.001
land+0.001
ware+0.001
 hey+0.001
 exception+0.001
Neg logit contributions
 ever-0.001
 onwards-0.001
unts-0.001
 fright-0.001
 grin-0.001
Token a
Token ID257
Feature activation+3.90e-01

Pos logit contributions
aman+0.062
land+0.060
 hey+0.059
 extingu+0.059
ware+0.059
Neg logit contributions
 ever-0.036
 it-0.031
 fright-0.029
 onwards-0.028
 on-0.026
Token huge
Token ID3236
Feature activation+1.41e+00

Pos logit contributions
 ever+0.015
 onwards+0.013
unts+0.012
keeping+0.012
 fright+0.012
Neg logit contributions
aman-0.013
land-0.012
ware-0.012
 hey-0.012
lves-0.012
Token,
Token ID11
Feature activation+3.18e+00

Pos logit contributions
 ever+0.062
 onwards+0.058
unts+0.053
 onward+0.053
keeping+0.053
Neg logit contributions
aman-0.039
ware-0.037
 extingu-0.035
land-0.034
 exception-0.034
Token colourful
Token ID44799
Feature activation+7.70e-01

Pos logit contributions
 ever+0.150
keeping+0.148
unts+0.142
 onwards+0.140
umbled+0.138
Neg logit contributions
 prett-0.082
 living-0.062
 shin-0.055
 toilet-0.055
 little-0.054
Token puzzle
Token ID15027
Feature activation+2.99e-01

Pos logit contributions
 ever+0.032
 onwards+0.029
unts+0.027
 onward+0.026
 fright+0.026
Neg logit contributions
aman-0.023
ware-0.022
land-0.022
 whose-0.021
 extingu-0.021
Token on
Token ID319
Feature activation-1.15e+00

Pos logit contributions
 ever+0.010
 onwards+0.009
unts+0.008
 fright+0.008
 grin+0.008
Neg logit contributions
aman-0.011
land-0.011
ware-0.011
 exception-0.010
 whose-0.010
Token the
Token ID262
Feature activation-7.65e-01

Pos logit contributions
aman+0.054
 hey+0.054
land+0.053
 exception+0.053
lves+0.053
Neg logit contributions
 ever-0.029
 it-0.022
 onwards-0.021
 fright-0.021
unts-0.020
Token ground
Token ID2323
Feature activation-6.88e-01

Pos logit contributions
aman+0.026
land+0.023
 exception+0.023
 hey+0.023
lves+0.023
Neg logit contributions
 ever-0.031
unts-0.026
 onwards-0.025
 fright-0.025
keeping-0.024
Token,
Token ID11
Feature activation+5.66e-02

Pos logit contributions
 ever+0.002
unts+0.001
 onwards+0.001
 fright+0.001
 grin+0.001
Neg logit contributions
aman-0.002
land-0.002
 hey-0.002
 exception-0.002
ware-0.002
Token Spot
Token ID15899
Feature activation-2.69e-01

Pos logit contributions
 ever+0.002
unts+0.002
 onwards+0.002
 fright+0.002
 grin+0.002
Neg logit contributions
aman-0.002
land-0.002
 hey-0.002
lves-0.002
 exception-0.002
Token saw
Token ID2497
Feature activation-1.05e+00

Pos logit contributions
aman+0.010
land+0.010
ware+0.010
 hey+0.010
 exception+0.009
Neg logit contributions
 ever-0.009
 onwards-0.008
unts-0.007
 fright-0.007
 onward-0.007
Token a
Token ID257
Feature activation+6.36e-01

Pos logit contributions
aman+0.047
land+0.044
ware+0.044
 exception+0.044
 extingu+0.043
Neg logit contributions
 ever-0.031
 onwards-0.024
 it-0.023
 fright-0.023
unts-0.022
Token big
Token ID1263
Feature activation+1.48e+00

Pos logit contributions
 ever+0.024
 onwards+0.022
keeping+0.020
unts+0.020
 onward+0.020
Neg logit contributions
aman-0.022
land-0.020
lves-0.020
 hey-0.019
 prett-0.019
Token,
Token ID11
Feature activation+3.17e+00

Pos logit contributions
 ever+0.064
 onwards+0.059
keeping+0.055
 onward+0.053
unts+0.053
Neg logit contributions
aman-0.045
 salon-0.039
 station-0.039
lves-0.038
land-0.038
Token dark
Token ID3223
Feature activation+6.70e-01

Pos logit contributions
 ever+0.158
keeping+0.151
 onwards+0.143
unts+0.139
ake+0.136
Neg logit contributions
 prett-0.085
 little-0.060
 shin-0.059
lves-0.059
 living-0.057
Token cloud
Token ID6279
Feature activation-3.58e-01

Pos logit contributions
 ever+0.026
 onwards+0.024
unts+0.022
 onward+0.022
 fright+0.021
Neg logit contributions
aman-0.022
land-0.020
 extingu-0.020
ware-0.020
lves-0.019
Token coming
Token ID2406
Feature activation+1.23e+00

Pos logit contributions
aman+0.015
land+0.014
 hey+0.014
ware+0.014
lves+0.014
Neg logit contributions
 ever-0.011
 onwards-0.009
unts-0.008
 fright-0.008
 onward-0.008
Token.
Token ID13
Feature activation+7.35e-01

Pos logit contributions
 grin+0.044
 onwards+0.041
unts+0.041
 ever+0.040
 onward+0.037
Neg logit contributions
 extingu-0.038
aman-0.037
 whose-0.037
lves-0.036
 exception-0.036
Token The
Token ID383
Feature activation+1.25e+00

Pos logit contributions
 ever+0.026
unts+0.021
 onwards+0.020
keeping+0.019
 fright+0.019
Neg logit contributions
aman-0.027
land-0.025
lves-0.025
 whose-0.025
ware-0.025
Token,
Token ID11
Feature activation-6.38e-01

Pos logit contributions
aman+0.012
land+0.012
 hey+0.012
ware+0.012
 exception+0.012
Neg logit contributions
 ever-0.008
 onwards-0.007
unts-0.006
 fright-0.006
 onward-0.006
Token she
Token ID673
Feature activation+1.06e-01

Pos logit contributions
aman+0.026
land+0.026
 exception+0.024
lves+0.024
 extingu+0.024
Neg logit contributions
 ever-0.021
 onwards-0.017
 fright-0.015
 onward-0.015
 it-0.015
Token saw
Token ID2497
Feature activation-1.28e+00

Pos logit contributions
 ever+0.004
 onwards+0.004
unts+0.003
 fright+0.003
 grin+0.003
Neg logit contributions
aman-0.004
land-0.003
 hey-0.003
lves-0.003
ware-0.003
Token a
Token ID257
Feature activation+5.28e-01

Pos logit contributions
aman+0.058
land+0.057
ware+0.056
 exception+0.056
 extingu+0.054
Neg logit contributions
 ever-0.035
 it-0.030
 fright-0.027
 on-0.027
 angry-0.026
Token big
Token ID1263
Feature activation+1.50e+00

Pos logit contributions
 ever+0.021
 onwards+0.019
unts+0.018
keeping+0.017
 onward+0.017
Neg logit contributions
aman-0.017
land-0.016
lves-0.015
 hey-0.015
ware-0.015
Token,
Token ID11
Feature activation+3.12e+00

Pos logit contributions
 ever+0.066
 onwards+0.062
keeping+0.057
 onward+0.057
unts+0.056
Neg logit contributions
aman-0.044
 salon-0.040
 extingu-0.038
 station-0.038
 prett-0.037
Token mean
Token ID1612
Feature activation+1.14e+00

Pos logit contributions
 ever+0.152
keeping+0.147
 onwards+0.139
unts+0.136
 onward+0.134
Neg logit contributions
 prett-0.087
 shin-0.062
lves-0.059
 little-0.058
 fanc-0.058
Token dog
Token ID3290
Feature activation+1.33e-01

Pos logit contributions
 ever+0.044
 onwards+0.043
 onward+0.039
kick+0.036
keeping+0.036
Neg logit contributions
aman-0.037
land-0.035
lves-0.034
 hey-0.034
 exception-0.034
Token.
Token ID13
Feature activation+8.55e-01

Pos logit contributions
 ever+0.005
 onwards+0.004
unts+0.004
 fright+0.004
 onward+0.004
Neg logit contributions
aman-0.005
land-0.005
 hey-0.005
lves-0.005
 whose-0.004
Token The
Token ID383
Feature activation+6.51e-01

Pos logit contributions
 ever+0.035
unts+0.028
 onwards+0.027
 fright+0.026
keeping+0.026
Neg logit contributions
aman-0.028
 whose-0.026
lves-0.025
land-0.025
ware-0.024
Token dog
Token ID3290
Feature activation+6.36e-01

Pos logit contributions
 ever+0.024
 onwards+0.021
keeping+0.020
unts+0.019
 onward+0.019
Neg logit contributions
aman-0.022
lves-0.021
 hey-0.021
 prett-0.021
 whose-0.020
Token the
Token ID262
Feature activation-5.18e-01

Pos logit contributions
 hey+0.050
lves+0.049
aman+0.049
 exception+0.049
land+0.049
Neg logit contributions
 ever-0.026
 onwards-0.019
 it-0.017
unts-0.017
 on-0.015
Token room
Token ID2119
Feature activation+1.90e-02

Pos logit contributions
aman+0.019
land+0.018
lves+0.018
 hey+0.017
ware+0.017
Neg logit contributions
 ever-0.019
 onwards-0.016
unts-0.016
 fright-0.015
 onward-0.015
Token was
Token ID373
Feature activation+4.70e-02

Pos logit contributions
 ever+0.001
 onwards+0.000
unts+0.000
 fright+0.000
 grin+0.000
Neg logit contributions
aman-0.001
land-0.001
ware-0.001
 hey-0.001
 exception-0.001
Token a
Token ID257
Feature activation+2.70e-01

Pos logit contributions
 ever+0.002
 onwards+0.002
unts+0.002
 fright+0.001
 grin+0.001
Neg logit contributions
aman-0.002
land-0.001
ware-0.001
 exception-0.001
 hey-0.001
Token big
Token ID1263
Feature activation+1.05e+00

Pos logit contributions
 ever+0.010
 onwards+0.009
unts+0.009
 fright+0.008
keeping+0.008
Neg logit contributions
aman-0.009
land-0.008
 hey-0.008
ware-0.008
 exception-0.008
Token,
Token ID11
Feature activation+3.08e+00

Pos logit contributions
 ever+0.046
 onwards+0.043
keeping+0.041
unts+0.040
 onward+0.040
Neg logit contributions
aman-0.028
 prett-0.026
 exception-0.025
ware-0.025
 salon-0.025
Token open
Token ID1280
Feature activation-5.88e-01

Pos logit contributions
keeping+0.156
 ever+0.151
unts+0.137
 onwards+0.136
umbled+0.132
Neg logit contributions
 prett-0.081
 living-0.061
 vacuum-0.058
 little-0.058
 whose-0.057
Token window
Token ID4324
Feature activation+3.44e-01

Pos logit contributions
aman+0.019
 hey+0.018
lves+0.018
land+0.018
 exception+0.017
Neg logit contributions
 ever-0.023
 onwards-0.020
unts-0.020
 fright-0.019
 grin-0.018
Token.
Token ID13
Feature activation+2.78e-01

Pos logit contributions
 ever+0.012
 onwards+0.010
unts+0.010
 fright+0.010
 grin+0.010
Neg logit contributions
aman-0.012
land-0.012
ware-0.012
 whose-0.012
 exception-0.011
Token
Token ID198
Feature activation+1.33e+00

Pos logit contributions
 ever+0.010
 onwards+0.008
unts+0.008
 fright+0.008
 grin+0.008
Neg logit contributions
aman-0.010
land-0.010
 whose-0.009
 exception-0.009
ware-0.009
Token
Token ID198
Feature activation+3.68e-02

Pos logit contributions
 ever+0.053
 grin+0.048
 onwards+0.048
keeping+0.046
unts+0.046
Neg logit contributions
 whose-0.037
land-0.037
aman-0.036
 exception-0.033
ilst-0.032
Token happen
Token ID1645
Feature activation+6.06e-01

Pos logit contributions
aman+0.002
land+0.002
 hey+0.002
 exception+0.002
ware+0.002
Neg logit contributions
 ever-0.002
 onwards-0.002
unts-0.002
 fright-0.001
 onward-0.001
Token because
Token ID780
Feature activation+9.65e-01

Pos logit contributions
 ever+0.018
unts+0.018
 onwards+0.017
 grin+0.016
 fright+0.016
Neg logit contributions
land-0.023
aman-0.022
 exception-0.021
 whose-0.021
ware-0.021
Token of
Token ID286
Feature activation-5.52e-01

Pos logit contributions
unts+0.028
 ever+0.028
 grin+0.026
 onwards+0.026
 fright+0.024
Neg logit contributions
aman-0.036
 whose-0.035
land-0.035
ware-0.034
 exception-0.034
Token the
Token ID262
Feature activation+5.60e-01

Pos logit contributions
aman+0.024
land+0.023
 exception+0.023
lves+0.023
ware+0.023
Neg logit contributions
 ever-0.016
 onwards-0.013
 fright-0.012
unts-0.012
 onward-0.011
Token big
Token ID1263
Feature activation+1.89e+00

Pos logit contributions
 ever+0.022
 onwards+0.019
unts+0.019
 onward+0.018
keeping+0.017
Neg logit contributions
aman-0.018
land-0.017
 prett-0.017
 exception-0.017
 hey-0.016
Token,
Token ID11
Feature activation+3.07e+00

Pos logit contributions
 ever+0.082
 onwards+0.075
year+0.073
unts+0.072
keeping+0.070
Neg logit contributions
aman-0.050
 prett-0.048
 camps-0.047
 exception-0.047
land-0.045
Token red
Token ID2266
Feature activation+6.70e-01

Pos logit contributions
keeping+0.140
 ever+0.137
unts+0.137
 onwards+0.131
umbled+0.129
Neg logit contributions
 prett-0.083
 living-0.063
 happiest-0.063
lves-0.060
 little-0.060
Token ball
Token ID2613
Feature activation-6.64e-01

Pos logit contributions
 ever+0.026
 onwards+0.024
unts+0.023
 onward+0.021
keeping+0.021
Neg logit contributions
aman-0.022
land-0.021
ware-0.021
 exception-0.020
 extingu-0.019
Token.
Token ID13
Feature activation-1.98e-01

Pos logit contributions
aman+0.029
 hey+0.027
lves+0.027
land+0.027
 exception+0.026
Neg logit contributions
 ever-0.020
 onwards-0.016
unts-0.015
 fright-0.015
kick-0.014
Token He
Token ID679
Feature activation+2.53e-01

Pos logit contributions
aman+0.009
land+0.009
 hey+0.008
ware+0.008
 exception+0.008
Neg logit contributions
 ever-0.005
 onwards-0.004
unts-0.004
 fright-0.004
 onward-0.004
Token told
Token ID1297
Feature activation-2.69e-01

Pos logit contributions
 ever+0.008
 onwards+0.007
unts+0.007
 fright+0.006
 onward+0.006
Neg logit contributions
aman-0.010
land-0.010
ware-0.009
lves-0.009
 whose-0.009
Token,
Token ID11
Feature activation+4.98e-01

Pos logit contributions
aman+0.032
land+0.030
 hey+0.030
lves+0.030
 extingu+0.030
Neg logit contributions
 ever-0.020
 onwards-0.017
unts-0.015
 onward-0.014
 fright-0.014
Token there
Token ID612
Feature activation+1.38e-01

Pos logit contributions
 ever+0.019
unts+0.017
 onwards+0.016
 grin+0.016
 fright+0.016
Neg logit contributions
aman-0.016
 hey-0.016
 whose-0.015
ware-0.015
 prett-0.015
Token was
Token ID373
Feature activation-2.33e-02

Pos logit contributions
 ever+0.005
 onwards+0.004
unts+0.004
 fright+0.004
 grin+0.004
Neg logit contributions
aman-0.005
land-0.005
ware-0.005
 hey-0.005
 whose-0.005
Token a
Token ID257
Feature activation+7.32e-01

Pos logit contributions
aman+0.001
land+0.001
 hey+0.001
ware+0.001
 exception+0.001
Neg logit contributions
 ever-0.001
 onwards-0.001
unts-0.001
 fright-0.001
 grin-0.001
Token small
Token ID1402
Feature activation+1.13e+00

Pos logit contributions
 ever+0.033
 onwards+0.030
keeping+0.028
unts+0.028
 onward+0.027
Neg logit contributions
aman-0.019
 hey-0.018
land-0.018
lves-0.018
 prett-0.018
Token,
Token ID11
Feature activation+3.05e+00

Pos logit contributions
 ever+0.044
 onwards+0.041
 onward+0.038
unts+0.037
keeping+0.037
Neg logit contributions
aman-0.036
land-0.034
 prett-0.033
 exception-0.033
lves-0.033
Token cute
Token ID13779
Feature activation+2.02e-01

Pos logit contributions
keeping+0.146
 ever+0.144
unts+0.133
 onwards+0.128
umbled+0.123
Neg logit contributions
 prett-0.090
 little-0.063
 living-0.062
 perfect-0.057
 whose-0.057
Token cat
Token ID3797
Feature activation-1.02e-01

Pos logit contributions
 ever+0.010
 onwards+0.009
unts+0.008
 fright+0.008
 onward+0.008
Neg logit contributions
aman-0.005
land-0.005
 hey-0.004
lves-0.004
ware-0.004
Token.
Token ID13
Feature activation+4.32e-01

Pos logit contributions
aman+0.004
land+0.004
 hey+0.004
ware+0.004
 exception+0.004
Neg logit contributions
 ever-0.003
 onwards-0.003
unts-0.003
 fright-0.003
 grin-0.002
Token The
Token ID383
Feature activation+6.20e-01

Pos logit contributions
 ever+0.015
 onwards+0.013
unts+0.012
 fright+0.012
keeping+0.012
Neg logit contributions
aman-0.015
land-0.015
 whose-0.015
ware-0.014
 exception-0.014
Token cat
Token ID3797
Feature activation+1.58e-01

Pos logit contributions
 ever+0.025
 onwards+0.023
keeping+0.022
 onward+0.021
unts+0.021
Neg logit contributions
aman-0.018
 prett-0.018
 hey-0.018
lves-0.017
 whose-0.017
Token,
Token ID11
Feature activation-2.94e-03

Pos logit contributions
 ever+0.026
unts+0.024
 Amen+0.022
keeping+0.022
 onwards+0.022
Neg logit contributions
aman-0.026
 prett-0.025
land-0.025
lves-0.025
 exception-0.025
Token it
Token ID340
Feature activation-3.64e-01

Pos logit contributions
aman+0.000
land+0.000
 hey+0.000
ware+0.000
lves+0.000
Neg logit contributions
 ever-0.000
 onwards-0.000
unts-0.000
 fright-0.000
 grin-0.000
Token saw
Token ID2497
Feature activation-8.03e-01

Pos logit contributions
aman+0.016
land+0.015
 hey+0.015
 exception+0.015
ware+0.015
Neg logit contributions
 ever-0.011
 onwards-0.009
unts-0.008
 fright-0.008
 onward-0.007
Token a
Token ID257
Feature activation+4.86e-01

Pos logit contributions
aman+0.034
 exception+0.032
land+0.032
ware+0.032
 extingu+0.031
Neg logit contributions
 ever-0.026
 fright-0.020
 onwards-0.020
unts-0.020
 it-0.019
Token large
Token ID1588
Feature activation+1.01e+00

Pos logit contributions
 ever+0.019
 onwards+0.017
unts+0.016
keeping+0.016
 onward+0.016
Neg logit contributions
aman-0.016
land-0.015
 hey-0.014
lves-0.014
ware-0.014
Token,
Token ID11
Feature activation+3.04e+00

Pos logit contributions
 ever+0.042
 onwards+0.038
unts+0.035
 onward+0.035
keeping+0.035
Neg logit contributions
aman-0.031
ware-0.028
land-0.028
 extingu-0.028
 exception-0.027
Token dark
Token ID3223
Feature activation+5.55e-01

Pos logit contributions
 ever+0.144
keeping+0.139
 onwards+0.130
unts+0.127
ake+0.124
Neg logit contributions
 prett-0.086
 shin-0.065
 little-0.060
 living-0.058
 bathroom-0.057
Token grey
Token ID13791
Feature activation+4.44e-01

Pos logit contributions
 ever+0.021
 onwards+0.019
unts+0.018
 onward+0.017
 fright+0.017
Neg logit contributions
aman-0.019
land-0.017
ware-0.017
 extingu-0.017
lves-0.017
Token cloud
Token ID6279
Feature activation-1.74e-01

Pos logit contributions
 ever+0.017
 onwards+0.015
unts+0.014
 onward+0.014
 fright+0.013
Neg logit contributions
aman-0.015
land-0.014
ware-0.014
 extingu-0.014
 hey-0.014
Token blocking
Token ID12013
Feature activation+4.48e-02

Pos logit contributions
aman+0.007
land+0.007
 hey+0.007
ware+0.007
 exception+0.007
Neg logit contributions
 ever-0.005
 onwards-0.004
unts-0.004
 fright-0.004
 grin-0.004
Token out
Token ID503
Feature activation+8.78e-01

Pos logit contributions
 ever+0.001
 onwards+0.001
unts+0.001
 grin+0.001
 fright+0.001
Neg logit contributions
aman-0.002
land-0.002
 hey-0.002
lves-0.002
 exception-0.002
Token and
Token ID290
Feature activation-1.17e+00

Pos logit contributions
aman+0.002
land+0.002
ware+0.002
 hey+0.002
 exception+0.002
Neg logit contributions
 ever-0.001
 onwards-0.001
unts-0.001
 fright-0.001
 grin-0.000
Token then
Token ID788
Feature activation-1.43e-01

Pos logit contributions
aman+0.053
land+0.053
 exception+0.051
ware+0.051
 extingu+0.050
Neg logit contributions
 ever-0.034
 onwards-0.026
 it-0.023
 fright-0.023
 sideways-0.023
Token she
Token ID673
Feature activation+3.27e-01

Pos logit contributions
aman+0.006
land+0.005
 hey+0.005
ware+0.005
 exception+0.005
Neg logit contributions
 ever-0.005
 onwards-0.004
unts-0.004
 fright-0.004
 grin-0.004
Token had
Token ID550
Feature activation+1.24e-01

Pos logit contributions
 ever+0.011
 onwards+0.009
unts+0.009
 fright+0.009
keeping+0.008
Neg logit contributions
aman-0.012
land-0.012
ware-0.012
 hey-0.011
lves-0.011
Token a
Token ID257
Feature activation+2.33e-01

Pos logit contributions
 ever+0.004
 onwards+0.004
unts+0.004
 fright+0.004
 grin+0.003
Neg logit contributions
aman-0.004
land-0.004
ware-0.004
 hey-0.004
 exception-0.004
Token scr
Token ID6040
Feature activation+3.03e+00

Pos logit contributions
 ever+0.008
 onwards+0.007
unts+0.007
keeping+0.007
 onward+0.006
Neg logit contributions
aman-0.008
land-0.008
 exception-0.008
 hey-0.008
ware-0.008
Tokenum
Token ID388
Feature activation+3.72e-02

Pos logit contributions
 ever+0.114
 onwards+0.106
keeping+0.102
 on+0.098
 onward+0.098
Neg logit contributions
aman-0.102
 exception-0.092
lves-0.084
land-0.083
aching-0.082
Tokenpt
Token ID457
Feature activation+8.00e-01

Pos logit contributions
 ever+0.001
 onwards+0.001
unts+0.001
 fright+0.001
 onward+0.001
Neg logit contributions
aman-0.001
land-0.001
 hey-0.001
ware-0.001
 exception-0.001
Tokenious
Token ID699
Feature activation+1.44e+00

Pos logit contributions
 ever+0.032
unts+0.030
 fright+0.029
 onwards+0.028
 Go+0.027
Neg logit contributions
aman-0.024
land-0.023
ware-0.022
 extingu-0.022
 hey-0.021
Token snack
Token ID26906
Feature activation+7.28e-01

Pos logit contributions
 ever+0.056
 onwards+0.053
 onward+0.048
keeping+0.047
unts+0.047
Neg logit contributions
aman-0.048
land-0.044
 exception-0.042
ware-0.041
 extingu-0.040
Token.
Token ID13
Feature activation+4.74e-02

Pos logit contributions
 ever+0.023
unts+0.021
 onwards+0.020
 fright+0.020
 grin+0.019
Neg logit contributions
aman-0.028
land-0.027
ware-0.026
 exception-0.025
 extingu-0.025
Token's
Token ID338
Feature activation+6.10e-01

Pos logit contributions
 ever+0.002
 onwards+0.002
unts+0.002
 fright+0.002
 onward+0.002
Neg logit contributions
aman-0.002
land-0.002
 hey-0.002
 exception-0.002
lves-0.002
Token bicycle
Token ID17026
Feature activation+1.91e-01

Pos logit contributions
 ever+0.024
unts+0.022
 onwards+0.022
 onward+0.020
keeping+0.020
Neg logit contributions
aman-0.019
 prett-0.017
 exception-0.017
land-0.017
ware-0.017
Token and
Token ID290
Feature activation-7.45e-01

Pos logit contributions
 ever+0.005
 onwards+0.004
unts+0.004
 fright+0.004
 grin+0.004
Neg logit contributions
aman-0.009
land-0.008
 exception-0.008
 hey-0.008
ware-0.008
Token wanted
Token ID2227
Feature activation+1.18e+00

Pos logit contributions
aman+0.035
land+0.035
ware+0.033
 hey+0.033
 exception+0.032
Neg logit contributions
 ever-0.020
 onwards-0.015
unts-0.015
 fright-0.014
 grin-0.013
Token to
Token ID284
Feature activation+8.50e-01

Pos logit contributions
unts+0.039
 grin+0.038
 onwards+0.038
 ever+0.035
 onward+0.035
Neg logit contributions
 whose-0.041
 prett-0.037
 exception-0.037
 cash-0.037
aman-0.037
Token ride
Token ID6594
Feature activation+3.03e+00

Pos logit contributions
 onwards+0.031
 ever+0.030
unts+0.030
 onward+0.028
keeping+0.027
Neg logit contributions
 exception-0.025
 hey-0.024
aman-0.024
 prett-0.023
ware-0.023
Token it
Token ID340
Feature activation-2.30e-01

Pos logit contributions
unts+0.122
umbs+0.108
 grin+0.102
irk+0.094
 Amen+0.091
Neg logit contributions
 whose-0.089
 where-0.085
 one-0.085
 who-0.083
 which-0.083
Token,
Token ID11
Feature activation+4.93e-01

Pos logit contributions
aman+0.009
land+0.009
 hey+0.009
ware+0.009
lves+0.009
Neg logit contributions
 ever-0.007
 onwards-0.006
unts-0.006
 fright-0.005
 onward-0.005
Token but
Token ID475
Feature activation+4.74e-01

Pos logit contributions
 ever+0.024
unts+0.022
 onwards+0.022
keeping+0.021
 onward+0.020
Neg logit contributions
aman-0.011
land-0.010
 exception-0.010
 prett-0.010
 whose-0.010
Token Tim
Token ID5045
Feature activation+4.24e-01

Pos logit contributions
 ever+0.015
unts+0.014
 onwards+0.013
 fright+0.012
 grin+0.012
Neg logit contributions
aman-0.019
 whose-0.018
 hey-0.018
lves-0.017
 exception-0.017
Tokenmy
Token ID1820
Feature activation+4.67e-01

Pos logit contributions
 ever+0.017
 onwards+0.015
unts+0.015
 fright+0.014
 onward+0.014
Neg logit contributions
aman-0.013
 exception-0.012
land-0.012
lves-0.012
 hey-0.012
Token when
Token ID618
Feature activation+8.69e-01

Pos logit contributions
aman+0.011
land+0.011
 hey+0.010
ware+0.010
 exception+0.010
Neg logit contributions
 ever-0.010
 onwards-0.009
unts-0.008
 fright-0.008
 onward-0.008
Token he
Token ID339
Feature activation-3.56e-01

Pos logit contributions
 ever+0.032
unts+0.032
 onwards+0.029
 grin+0.029
ay+0.029
Neg logit contributions
 hey-0.025
aman-0.025
 whose-0.024
ware-0.024
 exception-0.023
Token saw
Token ID2497
Feature activation-1.24e+00

Pos logit contributions
aman+0.015
land+0.014
 exception+0.014
 hey+0.014
ware+0.014
Neg logit contributions
 ever-0.011
 onwards-0.009
unts-0.009
 fright-0.009
 onward-0.008
Token a
Token ID257
Feature activation+6.05e-01

Pos logit contributions
aman+0.054
land+0.051
lves+0.049
 exception+0.049
 hey+0.048
Neg logit contributions
 ever-0.039
 it-0.031
 fright-0.031
 onwards-0.029
unts-0.028
Token big
Token ID1263
Feature activation+1.62e+00

Pos logit contributions
 ever+0.024
 onwards+0.021
unts+0.020
keeping+0.019
 onward+0.019
Neg logit contributions
aman-0.020
land-0.019
ware-0.018
lves-0.018
 hey-0.018
Token,
Token ID11
Feature activation+3.02e+00

Pos logit contributions
 ever+0.071
 onwards+0.065
keeping+0.061
 onward+0.060
unts+0.059
Neg logit contributions
aman-0.047
ware-0.042
 extingu-0.041
 prett-0.041
 station-0.040
Token big
Token ID1263
Feature activation+1.79e+00

Pos logit contributions
 ever+0.144
keeping+0.134
 onwards+0.128
unts+0.127
 onward+0.122
Neg logit contributions
 prett-0.086
 shin-0.063
 living-0.063
lves-0.060
 whose-0.059
Token yellow
Token ID7872
Feature activation+5.15e-01

Pos logit contributions
 ever+0.079
 onwards+0.073
keeping+0.068
 onward+0.067
unts+0.066
Neg logit contributions
aman-0.050
ware-0.045
 extingu-0.045
 station-0.044
 prett-0.044
Token bird
Token ID6512
Feature activation+3.56e-02

Pos logit contributions
 ever+0.020
 onwards+0.018
unts+0.017
 onward+0.016
 fright+0.016
Neg logit contributions
aman-0.017
ware-0.016
land-0.016
 hey-0.015
 exception-0.015
Token.
Token ID13
Feature activation+1.57e-01

Pos logit contributions
 ever+0.001
 onwards+0.001
unts+0.001
 fright+0.001
 grin+0.001
Neg logit contributions
aman-0.001
land-0.001
 hey-0.001
lves-0.001
ware-0.001
Token The
Token ID383
Feature activation+2.67e-01

Pos logit contributions
 ever+0.005
 onwards+0.004
unts+0.004
 fright+0.004
 onward+0.004
Neg logit contributions
aman-0.006
land-0.006
 whose-0.006
ware-0.006
lves-0.006
Token park
Token ID3952
Feature activation+6.68e-01

Pos logit contributions
ware+0.026
aman+0.026
 exception+0.025
 hey+0.025
land+0.025
Neg logit contributions
 ever-0.039
 onwards-0.035
unts-0.034
 fright-0.034
 sideways-0.033
Token.
Token ID13
Feature activation+9.78e-01

Pos logit contributions
 ever+0.025
 onwards+0.023
unts+0.022
 fright+0.022
 grin+0.022
Neg logit contributions
land-0.021
aman-0.021
 whose-0.020
ware-0.019
 exception-0.019
Token It
Token ID632
Feature activation+5.76e-01

Pos logit contributions
 ever+0.033
unts+0.026
keeping+0.026
 onwards+0.025
 fright+0.024
Neg logit contributions
 whose-0.037
aman-0.036
ware-0.035
land-0.035
 extingu-0.033
Token is
Token ID318
Feature activation+1.14e+00

Pos logit contributions
 ever+0.020
 onwards+0.019
unts+0.018
keeping+0.017
 onward+0.017
Neg logit contributions
aman-0.019
 whose-0.019
lves-0.019
ware-0.019
land-0.019
Token big
Token ID1263
Feature activation+5.36e-01

Pos logit contributions
 ever+0.046
 onwards+0.044
 grin+0.042
keeping+0.041
 onward+0.041
Neg logit contributions
 whose-0.032
 prett-0.032
land-0.030
aman-0.030
 exception-0.028
Token and
Token ID290
Feature activation+3.01e+00

Pos logit contributions
 ever+0.014
 onwards+0.013
unts+0.012
 fright+0.011
 onward+0.011
Neg logit contributions
aman-0.023
land-0.023
ware-0.022
 whose-0.022
 exception-0.022
Token brown
Token ID7586
Feature activation+7.88e-01

Pos logit contributions
keeping+0.142
 ever+0.131
ake+0.130
unts+0.127
umbled+0.127
Neg logit contributions
 prett-0.084
 whose-0.071
 shin-0.061
 who-0.061
 living-0.060
Token and
Token ID290
Feature activation+2.94e+00

Pos logit contributions
 ever+0.026
 onwards+0.024
unts+0.023
 fright+0.022
 onward+0.021
Neg logit contributions
land-0.029
aman-0.028
 whose-0.028
ware-0.027
 extingu-0.027
Token has
Token ID468
Feature activation+2.64e-01

Pos logit contributions
keeping+0.118
 onwards+0.105
 ever+0.105
ake+0.105
unts+0.104
Neg logit contributions
 prett-0.093
 whose-0.088
 living-0.081
 who-0.079
 where-0.075
Token long
Token ID890
Feature activation+7.78e-01

Pos logit contributions
 ever+0.010
 onwards+0.009
unts+0.008
 grin+0.008
 fright+0.008
Neg logit contributions
aman-0.009
land-0.009
 hey-0.008
lves-0.008
ware-0.008
Token legs
Token ID7405
Feature activation+5.52e-01

Pos logit contributions
 ever+0.024
 onwards+0.020
unts+0.019
 fright+0.018
 onward+0.017
Neg logit contributions
land-0.031
aman-0.030
ware-0.030
 whose-0.029
 exception-0.029
Token.
Token ID13
Feature activation-3.11e-01

Pos logit contributions
 ever+0.033
 onwards+0.031
unts+0.031
 onward+0.029
 fright+0.029
Neg logit contributions
land-0.029
aman-0.028
 whose-0.028
 exception-0.028
ware-0.027
Token Lucy
Token ID22162
Feature activation+5.04e-01

Pos logit contributions
aman+0.013
land+0.013
 hey+0.012
ware+0.012
 exception+0.012
Neg logit contributions
 ever-0.009
 onwards-0.008
unts-0.007
 fright-0.007
 onward-0.007
Token said
Token ID531
Feature activation+8.63e-01

Pos logit contributions
 ever+0.016
 onwards+0.015
unts+0.014
 fright+0.013
 onward+0.013
Neg logit contributions
aman-0.019
land-0.018
 whose-0.018
lves-0.018
ware-0.017
Token "
Token ID366
Feature activation+8.99e-01

Pos logit contributions
 ever+0.028
unts+0.028
keeping+0.028
 onwards+0.025
kick+0.025
Neg logit contributions
 whose-0.028
 hey-0.028
aman-0.028
land-0.027
ware-0.026
Tokenhello
Token ID31373
Feature activation+7.92e-01

Pos logit contributions
 ever+0.034
 onwards+0.033
unts+0.032
 fright+0.030
 grin+0.030
Neg logit contributions
aman-0.028
land-0.026
lves-0.025
 prett-0.024
 hey-0.024
Token"
Token ID1
Feature activation+3.00e+00

Pos logit contributions
 ever+0.026
unts+0.023
 onwards+0.022
keeping+0.021
 grin+0.021
Neg logit contributions
aman-0.030
land-0.029
lves-0.028
 whose-0.028
 hey-0.027
Token to
Token ID284
Feature activation+5.61e-02

Pos logit contributions
unts+0.098
 Amen+0.096
 fright+0.095
 grin+0.092
 ever+0.091
Neg logit contributions
land-0.104
 prett-0.095
aman-0.094
 who-0.088
ray-0.087
Token the
Token ID262
Feature activation+1.79e-01

Pos logit contributions
 ever+0.002
 onwards+0.002
unts+0.002
 fright+0.001
 onward+0.001
Neg logit contributions
aman-0.002
land-0.002
 hey-0.002
lves-0.002
ware-0.002
Token squirrel
Token ID33039
Feature activation-5.21e-03

Pos logit contributions
 ever+0.005
 onwards+0.004
unts+0.004
 onward+0.004
 fright+0.004
Neg logit contributions
aman-0.008
lves-0.007
land-0.007
 hey-0.007
 prett-0.007
Token,
Token ID11
Feature activation-5.71e-01

Pos logit contributions
aman+0.000
land+0.000
 exception+0.000
 hey+0.000
ware+0.000
Neg logit contributions
 ever-0.000
 onwards-0.000
unts-0.000
 fright-0.000
 grin-0.000
Token and
Token ID290
Feature activation-1.70e-02

Pos logit contributions
aman+0.022
land+0.021
ware+0.021
 exception+0.021
 hey+0.021
Neg logit contributions
 ever-0.019
 onwards-0.015
unts-0.015
 fright-0.015
 grin-0.014
Token,
Token ID11
Feature activation+1.79e+00

Pos logit contributions
aman+0.014
land+0.013
 hey+0.013
ware+0.013
 exception+0.013
Neg logit contributions
 ever-0.010
 onwards-0.008
unts-0.008
 fright-0.008
 onward-0.007
Token and
Token ID290
Feature activation+1.43e+00

Pos logit contributions
unts+0.067
 ever+0.064
 onwards+0.060
imp+0.059
 grin+0.059
Neg logit contributions
 whose-0.052
 prett-0.050
aman-0.049
 who-0.047
 which-0.046
Token Peter
Token ID5613
Feature activation-2.88e-01

Pos logit contributions
 ever+0.043
unts+0.039
keeping+0.036
 grin+0.036
 onwards+0.035
Neg logit contributions
 whose-0.053
 prett-0.053
aman-0.051
lves-0.051
ware-0.049
Token wanted
Token ID2227
Feature activation+1.04e+00

Pos logit contributions
aman+0.013
land+0.012
 hey+0.012
ware+0.012
 exception+0.012
Neg logit contributions
 ever-0.008
 onwards-0.006
unts-0.006
 fright-0.006
 grin-0.006
Token to
Token ID284
Feature activation+1.24e-01

Pos logit contributions
unts+0.035
 grin+0.035
 onwards+0.033
 Amen+0.031
 ever+0.030
Neg logit contributions
 whose-0.036
land-0.035
 who-0.032
aman-0.032
 where-0.032
Token ride
Token ID6594
Feature activation+2.99e+00

Pos logit contributions
 ever+0.004
 onwards+0.004
unts+0.003
 onward+0.003
 fright+0.003
Neg logit contributions
aman-0.005
land-0.005
 hey-0.005
 exception-0.004
ware-0.004
Token it
Token ID340
Feature activation-1.12e-01

Pos logit contributions
unts+0.133
irk+0.115
umbs+0.112
 grin+0.108
keeping+0.103
Neg logit contributions
 where-0.088
 whose-0.087
 one-0.084
 who-0.082
 which-0.078
Token.
Token ID13
Feature activation-6.64e-01

Pos logit contributions
aman+0.004
land+0.004
 hey+0.004
ware+0.004
 exception+0.004
Neg logit contributions
 ever-0.004
 onwards-0.003
unts-0.003
 fright-0.003
 grin-0.003
Token He
Token ID679
Feature activation-1.67e-01

Pos logit contributions
aman+0.032
land+0.031
 hey+0.030
ware+0.030
 exception+0.030
Neg logit contributions
 ever-0.016
 onwards-0.013
unts-0.012
 fright-0.012
 onward-0.011
Token asked
Token ID1965
Feature activation+1.41e-01

Pos logit contributions
aman+0.007
land+0.007
 hey+0.007
ware+0.007
 exception+0.007
Neg logit contributions
 ever-0.005
 onwards-0.004
unts-0.004
 fright-0.004
 grin-0.003
Token the
Token ID262
Feature activation-1.06e+00

Pos logit contributions
 ever+0.005
 onwards+0.004
unts+0.004
 fright+0.004
 grin+0.004
Neg logit contributions
aman-0.005
land-0.005
 hey-0.005
ware-0.005
 whose-0.005
Token it
Token ID340
Feature activation+6.05e-02

Pos logit contributions
 ever+0.016
unts+0.015
 onwards+0.014
keeping+0.014
 grin+0.013
Neg logit contributions
aman-0.023
 whose-0.022
ware-0.022
 hey-0.022
land-0.022
Token made
Token ID925
Feature activation-5.68e-01

Pos logit contributions
 ever+0.002
 onwards+0.002
unts+0.001
 fright+0.001
 onward+0.001
Neg logit contributions
aman-0.002
ware-0.002
land-0.002
 hey-0.002
lves-0.002
Token Bob
Token ID5811
Feature activation+1.26e+00

Pos logit contributions
aman+0.028
land+0.027
ware+0.027
lves+0.027
 exception+0.027
Neg logit contributions
 ever-0.014
 onwards-0.010
 fright-0.009
unts-0.009
 onward-0.008
Token's
Token ID338
Feature activation+1.97e+00

Pos logit contributions
 ever+0.038
 onwards+0.038
 onward+0.034
keeping+0.033
unts+0.033
Neg logit contributions
 whose-0.044
 prett-0.043
 exception-0.043
land-0.043
 hey-0.042
Token face
Token ID1986
Feature activation+4.13e-01

Pos logit contributions
 onwards+0.084
 Amen+0.082
year+0.080
 onward+0.077
te+0.075
Neg logit contributions
 prett-0.055
 lookout-0.049
ware-0.048
 same-0.047
 little-0.046
Token scr
Token ID6040
Feature activation+2.98e+00

Pos logit contributions
 ever+0.014
 onwards+0.012
unts+0.012
 fright+0.011
keeping+0.011
Neg logit contributions
aman-0.015
land-0.014
 exception-0.014
ware-0.014
 prett-0.014
Tokenunch
Token ID3316
Feature activation+1.61e+00

Pos logit contributions
 ever+0.127
 onwards+0.116
 Amen+0.116
keeping+0.111
 fright+0.110
Neg logit contributions
aman-0.079
aching-0.073
 hey-0.069
 exception-0.069
her-0.069
Token up
Token ID510
Feature activation+1.59e+00

Pos logit contributions
 ever+0.057
 Amen+0.057
unts+0.055
 fright+0.054
 grin+0.053
Neg logit contributions
aman-0.047
ware-0.046
land-0.046
ters-0.045
 prett-0.044
Token.
Token ID13
Feature activation-3.04e-01

Pos logit contributions
unts+0.063
 ever+0.059
 Amen+0.059
ay+0.058
 onwards+0.057
Neg logit contributions
 prett-0.043
land-0.041
ware-0.041
aman-0.041
 exception-0.040
Token He
Token ID679
Feature activation+3.77e-01

Pos logit contributions
aman+0.014
land+0.014
 hey+0.014
ware+0.014
 exception+0.013
Neg logit contributions
 ever-0.008
 onwards-0.006
unts-0.006
 fright-0.005
 onward-0.005
Token didn
Token ID1422
Feature activation+1.45e+00

Pos logit contributions
 ever+0.012
 onwards+0.011
unts+0.011
 onward+0.010
keeping+0.010
Neg logit contributions
aman-0.014
ware-0.013
 whose-0.013
lves-0.013
 hey-0.013
Token walked
Token ID6807
Feature activation+2.20e-01

Pos logit contributions
aman+0.035
 hey+0.033
 exception+0.033
land+0.033
lves+0.032
Neg logit contributions
 ever-0.019
 onwards-0.014
 fright-0.012
unts-0.012
 onward-0.012
Token is
Token ID318
Feature activation-2.61e-01

Pos logit contributions
 ever+0.007
unts+0.006
 onwards+0.006
 fright+0.005
 grin+0.005
Neg logit contributions
aman-0.009
land-0.008
 exception-0.008
 hey-0.008
ware-0.008
Token saw
Token ID2497
Feature activation-1.21e+00

Pos logit contributions
aman+0.011
land+0.010
ware+0.010
 hey+0.010
lves+0.010
Neg logit contributions
 ever-0.008
 onwards-0.007
unts-0.006
 fright-0.006
 onward-0.006
Token a
Token ID257
Feature activation+1.26e-02

Pos logit contributions
aman+0.053
land+0.050
ware+0.049
 extingu+0.049
lves+0.049
Neg logit contributions
 ever-0.037
 onwards-0.028
 fright-0.028
 it-0.027
unts-0.025
Token big
Token ID1263
Feature activation+1.21e+00

Pos logit contributions
 ever+0.000
 onwards+0.000
unts+0.000
 fright+0.000
 onward+0.000
Neg logit contributions
aman-0.000
land-0.000
 hey-0.000
ware-0.000
lves-0.000
Token,
Token ID11
Feature activation+2.98e+00

Pos logit contributions
 ever+0.052
 onwards+0.047
unts+0.044
keeping+0.043
 onward+0.043
Neg logit contributions
aman-0.036
 station-0.032
land-0.032
 exception-0.032
 extingu-0.031
Token shiny
Token ID22441
Feature activation+1.36e-01

Pos logit contributions
 ever+0.143
keeping+0.140
unts+0.134
 onwards+0.129
ake+0.128
Neg logit contributions
 prett-0.081
 living-0.055
 little-0.054
 shin-0.054
 bathroom-0.053
Token mailbox
Token ID37282
Feature activation+1.61e-02

Pos logit contributions
 ever+0.005
 onwards+0.005
unts+0.005
 fright+0.004
 onward+0.004
Neg logit contributions
aman-0.004
land-0.004
ware-0.004
 exception-0.004
 hey-0.004
Token.
Token ID13
Feature activation-7.69e-02

Pos logit contributions
 ever+0.001
 onwards+0.000
unts+0.000
 fright+0.000
 grin+0.000
Neg logit contributions
aman-0.001
land-0.001
ware-0.001
 hey-0.001
 exception-0.001
Token It
Token ID632
Feature activation+2.85e-01

Pos logit contributions
aman+0.003
land+0.003
 hey+0.003
ware+0.003
 exception+0.003
Neg logit contributions
 ever-0.002
 onwards-0.002
unts-0.002
 fright-0.002
 grin-0.001
Token looked
Token ID3114
Feature activation+1.17e+00

Pos logit contributions
 ever+0.010
 onwards+0.009
unts+0.008
keeping+0.008
 fright+0.008
Neg logit contributions
aman-0.010
ware-0.010
land-0.010
 prett-0.010
lves-0.010
Token,
Token ID11
Feature activation+5.96e-01

Pos logit contributions
 ever+0.028
unts+0.027
keeping+0.027
 onwards+0.025
 fright+0.025
Neg logit contributions
aman-0.025
 whose-0.025
 hey-0.025
ware-0.024
land-0.024
Token "
Token ID366
Feature activation-8.54e-02

Pos logit contributions
 ever+0.026
unts+0.023
 onwards+0.022
keeping+0.022
 fright+0.022
Neg logit contributions
aman-0.016
 whose-0.016
land-0.015
ware-0.015
 hey-0.015
TokenSure
Token ID19457
Feature activation+9.23e-01

Pos logit contributions
aman+0.003
land+0.003
 hey+0.003
ware+0.003
 exception+0.003
Neg logit contributions
 ever-0.003
 onwards-0.003
unts-0.003
 fright-0.003
 grin-0.003
Token,
Token ID11
Feature activation-5.04e-01

Pos logit contributions
unts+0.033
 ever+0.033
 onwards+0.033
keeping+0.030
 onward+0.030
Neg logit contributions
aman-0.031
 hey-0.029
land-0.028
 exception-0.026
 whose-0.026
Token but
Token ID475
Feature activation-1.89e-01

Pos logit contributions
aman+0.018
land+0.017
ware+0.017
 exception+0.017
lves+0.017
Neg logit contributions
 ever-0.019
 onwards-0.015
 fright-0.015
unts-0.014
 grin-0.014
Token we
Token ID356
Feature activation+2.97e+00

Pos logit contributions
aman+0.007
land+0.007
ware+0.007
 hey+0.007
 exception+0.007
Neg logit contributions
 ever-0.006
 onwards-0.005
unts-0.005
 fright-0.005
 grin-0.004
Token need
Token ID761
Feature activation+5.57e-02

Pos logit contributions
 onwards+0.092
keeping+0.088
 onward+0.086
unts+0.085
 grin+0.084
Neg logit contributions
 hey-0.103
lves-0.100
 least-0.099
 station-0.096
 exception-0.092
Token to
Token ID284
Feature activation-3.27e-01

Pos logit contributions
 ever+0.001
 onwards+0.001
unts+0.001
 fright+0.001
 grin+0.001
Neg logit contributions
aman-0.003
land-0.003
 hey-0.003
ware-0.002
 exception-0.002
Token find
Token ID1064
Feature activation-1.13e+00

Pos logit contributions
aman+0.013
land+0.012
ware+0.012
lves+0.012
 hey+0.012
Neg logit contributions
 ever-0.011
 onwards-0.009
unts-0.009
 fright-0.008
 grin-0.008
Token a
Token ID257
Feature activation-8.24e-01

Pos logit contributions
aman+0.047
ware+0.046
land+0.045
 exception+0.044
lves+0.043
Neg logit contributions
 ever-0.036
 it-0.028
 fright-0.028
 onwards-0.027
 on-0.026
Token mat
Token ID2603
Feature activation-1.74e-01

Pos logit contributions
aman+0.029
ware+0.028
land+0.028
 hey+0.027
lves+0.027
Neg logit contributions
 ever-0.030
 fright-0.025
 onwards-0.025
unts-0.024
 grin-0.023
Token house
Token ID2156
Feature activation-3.21e-01

Pos logit contributions
 ever+0.015
 onwards+0.013
unts+0.013
 fright+0.012
keeping+0.012
Neg logit contributions
aman-0.009
land-0.009
 hey-0.008
lves-0.008
 exception-0.008
Token there
Token ID612
Feature activation-1.14e+00

Pos logit contributions
aman+0.014
land+0.014
lves+0.014
 hey+0.013
 exception+0.013
Neg logit contributions
 ever-0.009
 onwards-0.007
unts-0.007
 fright-0.007
 onward-0.006
Token was
Token ID373
Feature activation-1.26e-01

Pos logit contributions
aman+0.053
lves+0.053
land+0.051
 hey+0.051
 exception+0.050
Neg logit contributions
 ever-0.032
 onwards-0.023
unts-0.021
 it-0.020
 had-0.019
Token a
Token ID257
Feature activation+3.80e-01

Pos logit contributions
aman+0.004
land+0.004
 hey+0.004
ware+0.004
 exception+0.004
Neg logit contributions
 ever-0.005
 onwards-0.004
unts-0.004
 fright-0.004
 grin-0.004
Token big
Token ID1263
Feature activation+1.37e+00

Pos logit contributions
 ever+0.018
 onwards+0.016
unts+0.015
keeping+0.015
 onward+0.015
Neg logit contributions
aman-0.010
 hey-0.009
land-0.009
lves-0.009
 exception-0.009
Token,
Token ID11
Feature activation+2.96e+00

Pos logit contributions
 ever+0.062
 onwards+0.059
keeping+0.056
 onward+0.054
unts+0.052
Neg logit contributions
aman-0.036
 station-0.033
 extingu-0.032
 exception-0.032
 prett-0.031
Token huge
Token ID3236
Feature activation+1.17e+00

Pos logit contributions
keeping+0.148
 ever+0.142
 onwards+0.127
unts+0.126
umbled+0.126
Neg logit contributions
 prett-0.077
 living-0.062
 little-0.059
 bathroom-0.055
 vacuum-0.055
Token v
Token ID410
Feature activation-1.69e-01

Pos logit contributions
 ever+0.055
 onwards+0.052
 onward+0.048
keeping+0.048
unts+0.047
Neg logit contributions
aman-0.028
land-0.025
ware-0.025
 extingu-0.025
 station-0.025
Tokenase
Token ID589
Feature activation-2.80e-02

Pos logit contributions
aman+0.006
land+0.005
 hey+0.005
ware+0.005
 exception+0.005
Neg logit contributions
 ever-0.007
 onwards-0.006
unts-0.006
 fright-0.005
 grin-0.005
Token.
Token ID13
Feature activation+6.31e-01

Pos logit contributions
aman+0.001
land+0.001
 hey+0.001
ware+0.001
 exception+0.001
Neg logit contributions
 ever-0.001
 onwards-0.001
unts-0.001
 fright-0.001
 grin-0.001
Token Every
Token ID3887
Feature activation+5.83e-01

Pos logit contributions
 ever+0.022
 onwards+0.018
unts+0.018
keeping+0.017
 fright+0.017
Neg logit contributions
aman-0.022
land-0.022
 whose-0.021
ware-0.021
 exception-0.021
Token.
Token ID13
Feature activation+4.47e-01

Pos logit contributions
 ever+0.016
 onwards+0.014
unts+0.014
 fright+0.013
 grin+0.013
Neg logit contributions
aman-0.018
land-0.018
ware-0.017
 exception-0.017
 whose-0.017
Token There
Token ID1318
Feature activation+1.15e+00

Pos logit contributions
 ever+0.015
 onwards+0.012
unts+0.011
 fright+0.011
keeping+0.011
Neg logit contributions
aman-0.017
ware-0.017
 hey-0.016
 exception-0.016
land-0.016
Token was
Token ID373
Feature activation-7.85e-01

Pos logit contributions
 ever+0.043
keeping+0.041
unts+0.039
 onwards+0.039
 fright+0.038
Neg logit contributions
aman-0.034
ware-0.033
 hey-0.032
 whose-0.032
land-0.030
Token a
Token ID257
Feature activation+1.23e+00

Pos logit contributions
aman+0.031
land+0.030
lves+0.029
 hey+0.029
 exception+0.029
Neg logit contributions
 ever-0.027
 onwards-0.021
 fright-0.021
unts-0.021
kick-0.019
Token big
Token ID1263
Feature activation+1.86e+00

Pos logit contributions
 ever+0.049
 onwards+0.044
keeping+0.042
 onward+0.040
unts+0.039
Neg logit contributions
aman-0.039
 prett-0.036
land-0.036
lves-0.036
ware-0.036
Token,
Token ID11
Feature activation+2.96e+00

Pos logit contributions
 ever+0.081
 onwards+0.075
keeping+0.073
 onward+0.069
year+0.068
Neg logit contributions
aman-0.052
ware-0.047
 extingu-0.047
 prett-0.046
 station-0.046
Token rough
Token ID5210
Feature activation+9.47e-01

Pos logit contributions
 ever+0.140
keeping+0.133
 onwards+0.125
unts+0.120
 onward+0.119
Neg logit contributions
 prett-0.085
 living-0.062
 extingu-0.061
 shin-0.060
 bathroom-0.059
Token bad
Token ID2089
Feature activation+7.88e-01

Pos logit contributions
 ever+0.041
 onwards+0.037
unts+0.035
 onward+0.034
keeping+0.033
Neg logit contributions
aman-0.028
ware-0.026
land-0.026
 exception-0.024
lves-0.024
Tokenger
Token ID1362
Feature activation-5.22e-01

Pos logit contributions
 ever+0.032
 onwards+0.031
 onward+0.028
unts+0.027
keeping+0.027
Neg logit contributions
aman-0.023
ware-0.022
 exception-0.022
land-0.022
 hey-0.021
Token who
Token ID508
Feature activation+9.08e-01

Pos logit contributions
aman+0.022
land+0.021
ware+0.020
 hey+0.020
 exception+0.020
Neg logit contributions
 ever-0.016
 onwards-0.013
unts-0.013
 fright-0.012
 grin-0.011
Token moved
Token ID3888
Feature activation+1.97e+00

Pos logit contributions
 ever+0.026
 onwards+0.024
unts+0.022
 grin+0.021
 onward+0.021
Neg logit contributions
aman-0.036
lves-0.035
 whose-0.035
ware-0.034
 hey-0.034
Token Then
Token ID3244
Feature activation-4.89e-01

Pos logit contributions
aman+0.027
land+0.026
lves+0.026
 hey+0.025
ware+0.025
Neg logit contributions
 ever-0.019
 onwards-0.016
unts-0.015
 fright-0.014
 onward-0.014
Token she
Token ID673
Feature activation+7.64e-01

Pos logit contributions
aman+0.020
lves+0.019
land+0.019
ware+0.019
 exception+0.019
Neg logit contributions
 ever-0.016
 onwards-0.013
unts-0.011
 fright-0.011
 onward-0.011
Token noticed
Token ID6810
Feature activation-1.76e+00

Pos logit contributions
 ever+0.026
 onwards+0.024
unts+0.023
 fright+0.022
keeping+0.022
Neg logit contributions
ware-0.026
aman-0.026
land-0.025
 prett-0.025
 exception-0.025
Token a
Token ID257
Feature activation+9.18e-01

Pos logit contributions
lves+0.074
aman+0.073
 exception+0.071
 extingu+0.070
 hey+0.069
Neg logit contributions
 ever-0.054
 it-0.044
 onwards-0.040
 fright-0.039
 on-0.039
Token big
Token ID1263
Feature activation+1.91e+00

Pos logit contributions
 ever+0.036
 onwards+0.032
unts+0.030
keeping+0.030
 onward+0.029
Neg logit contributions
aman-0.030
land-0.028
ware-0.027
 prett-0.027
 hey-0.027
Token,
Token ID11
Feature activation+2.96e+00

Pos logit contributions
 ever+0.083
 onwards+0.077
keeping+0.073
 onward+0.071
unts+0.071
Neg logit contributions
aman-0.053
 prett-0.048
ware-0.047
 extingu-0.047
 salon-0.047
Token round
Token ID2835
Feature activation+7.38e-01

Pos logit contributions
 ever+0.133
keeping+0.126
 onwards+0.118
unts+0.117
 onward+0.112
Neg logit contributions
 prett-0.090
land-0.068
 whose-0.068
 shin-0.067
aman-0.065
Token,
Token ID11
Feature activation+2.26e+00

Pos logit contributions
 ever+0.029
 onwards+0.027
unts+0.026
 onward+0.024
 fright+0.024
Neg logit contributions
aman-0.023
ware-0.023
land-0.022
 exception-0.021
 whose-0.021
Token yellow
Token ID7872
Feature activation+7.58e-01

Pos logit contributions
 ever+0.098
keeping+0.094
unts+0.087
 onwards+0.084
 Amen+0.082
Neg logit contributions
 prett-0.069
 whose-0.056
aman-0.055
 shin-0.053
land-0.052
Token ball
Token ID2613
Feature activation+2.36e-01

Pos logit contributions
 ever+0.029
 onwards+0.026
unts+0.025
 onward+0.024
 fright+0.024
Neg logit contributions
aman-0.025
ware-0.024
land-0.024
 whose-0.022
 exception-0.022
Token on
Token ID319
Feature activation-5.24e-01

Pos logit contributions
 ever+0.008
 onwards+0.007
unts+0.006
 fright+0.006
 grin+0.006
Neg logit contributions
aman-0.009
land-0.009
ware-0.009
 exception-0.008
 hey-0.008
Token and
Token ID290
Feature activation+6.51e-01

Pos logit contributions
 ever+0.014
 onwards+0.012
 fright+0.011
unts+0.011
 onward+0.011
Neg logit contributions
aman-0.024
land-0.023
 whose-0.022
ware-0.022
 hey-0.022
Token Ben
Token ID3932
Feature activation-8.01e-01

Pos logit contributions
 ever+0.022
 onwards+0.021
unts+0.020
kick+0.019
 fright+0.018
Neg logit contributions
aman-0.023
 hey-0.023
 whose-0.021
lves-0.021
land-0.020
Token were
Token ID547
Feature activation+6.67e-01

Pos logit contributions
aman+0.039
ware+0.038
land+0.038
 exception+0.037
lves+0.037
Neg logit contributions
 ever-0.020
unts-0.014
 onwards-0.014
 grin-0.012
 fright-0.012
Token going
Token ID1016
Feature activation+1.16e+00

Pos logit contributions
 ever+0.026
 fright+0.023
keeping+0.022
 onwards+0.022
 grin+0.022
Neg logit contributions
aman-0.021
 hey-0.020
 exception-0.020
 prett-0.019
 whose-0.019
Token to
Token ID284
Feature activation-1.02e+00

Pos logit contributions
 ever+0.028
unts+0.027
 fright+0.027
 grin+0.026
 onwards+0.023
Neg logit contributions
land-0.048
 whose-0.048
 exception-0.047
ware-0.047
aman-0.047
Token the
Token ID262
Feature activation-3.52e+00

Pos logit contributions
aman+0.046
lves+0.046
land+0.044
 hey+0.044
ware+0.043
Neg logit contributions
 ever-0.028
 fright-0.021
unts-0.020
 onwards-0.020
 have-0.019
Token cabin
Token ID9351
Feature activation-8.98e-01

Pos logit contributions
ist+0.135
ilst+0.122
der+0.120
lest+0.114
lves+0.114
Neg logit contributions
 ever-0.111
 sunset-0.098
 very-0.098
 on-0.093
 fast-0.092
Token with
Token ID351
Feature activation-2.97e-02

Pos logit contributions
aman+0.040
lves+0.038
 hey+0.037
land+0.037
ware+0.037
Neg logit contributions
 ever-0.027
 onwards-0.021
unts-0.019
 fright-0.018
kick-0.018
Token Mom
Token ID11254
Feature activation+8.54e-02

Pos logit contributions
aman+0.001
land+0.001
ware+0.001
 hey+0.001
 exception+0.001
Neg logit contributions
 ever-0.001
 onwards-0.001
unts-0.001
 fright-0.001
 grin-0.001
Token and
Token ID290
Feature activation-3.47e-01

Pos logit contributions
 ever+0.003
 onwards+0.002
unts+0.002
 fright+0.002
 grin+0.002
Neg logit contributions
aman-0.003
land-0.003
 exception-0.003
 hey-0.003
ware-0.003
Token Dad
Token ID17415
Feature activation+4.25e-02

Pos logit contributions
aman+0.018
land+0.017
ware+0.017
lves+0.017
 exception+0.017
Neg logit contributions
 ever-0.008
 onwards-0.006
unts-0.005
 fright-0.005
 grin-0.005
Token day
Token ID1110
Feature activation-8.58e-01

Pos logit contributions
aman+0.010
land+0.010
 hey+0.009
ware+0.009
lves+0.009
Neg logit contributions
 ever-0.005
 onwards-0.004
unts-0.004
 fright-0.004
 grin-0.004
Token,
Token ID11
Feature activation-1.29e+00

Pos logit contributions
aman+0.045
land+0.044
lves+0.043
ware+0.042
 exception+0.042
Neg logit contributions
 ever-0.019
 onwards-0.015
unts-0.012
 onward-0.012
 on-0.012
Token Lily
Token ID20037
Feature activation+1.60e-01

Pos logit contributions
land+0.063
aman+0.062
lves+0.059
 extingu+0.058
 exception+0.057
Neg logit contributions
 ever-0.035
 it-0.029
 on-0.026
 onwards-0.026
 excited-0.024
Token was
Token ID373
Feature activation-1.54e+00

Pos logit contributions
 ever+0.005
 onwards+0.004
unts+0.004
 fright+0.004
 grin+0.004
Neg logit contributions
aman-0.006
land-0.006
ware-0.006
 hey-0.006
 exception-0.006
Token at
Token ID379
Feature activation-1.35e+00

Pos logit contributions
aman+0.088
land+0.082
 extingu+0.081
ware+0.078
lves+0.075
Neg logit contributions
 ever-0.036
 on-0.026
 onwards-0.025
 excited-0.024
 hungry-0.024
Token the
Token ID262
Feature activation-3.35e+00

Pos logit contributions
aman+0.066
land+0.065
lves+0.062
 hey+0.061
 exception+0.061
Neg logit contributions
 ever-0.036
 fright-0.023
 onwards-0.023
unts-0.022
 on-0.022
Token park
Token ID3952
Feature activation-5.97e-02

Pos logit contributions
 extingu+0.137
lves+0.133
ist+0.130
aman+0.128
lest+0.127
Neg logit contributions
 ever-0.113
 very-0.089
 on-0.087
 story-0.087
 to-0.083
Token with
Token ID351
Feature activation-1.47e-01

Pos logit contributions
aman+0.002
land+0.002
 hey+0.002
 exception+0.002
ware+0.002
Neg logit contributions
 ever-0.002
 onwards-0.002
unts-0.002
 fright-0.002
 grin-0.002
Token her
Token ID607
Feature activation-1.38e+00

Pos logit contributions
aman+0.006
land+0.005
 hey+0.005
ware+0.005
 exception+0.005
Neg logit contributions
 ever-0.005
 onwards-0.004
unts-0.004
 fright-0.004
 grin-0.004
Token mom
Token ID1995
Feature activation-2.67e-01

Pos logit contributions
aman+0.070
land+0.066
 hey+0.064
 exception+0.063
lves+0.062
Neg logit contributions
 ever-0.039
 on-0.026
 onwards-0.025
 fright-0.024
unts-0.023
Token.
Token ID13
Feature activation-8.85e-03

Pos logit contributions
aman+0.011
land+0.011
 hey+0.010
ware+0.010
lves+0.010
Neg logit contributions
 ever-0.008
 onwards-0.007
unts-0.007
 fright-0.006
 onward-0.006
Token her
Token ID607
Feature activation+9.94e-01

Pos logit contributions
 ever+0.008
 onwards+0.007
unts+0.007
 onward+0.006
 fright+0.006
Neg logit contributions
aman-0.008
land-0.007
 hey-0.007
 exception-0.007
ware-0.007
Token they
Token ID484
Feature activation+2.48e+00

Pos logit contributions
 ever+0.036
unts+0.033
 onwards+0.033
kick+0.032
 fright+0.032
Neg logit contributions
land-0.032
 whose-0.030
aman-0.030
 hey-0.029
ware-0.029
Token were
Token ID547
Feature activation+7.18e-01

Pos logit contributions
keeping+0.112
imp+0.107
 onwards+0.104
 jaws+0.100
aks+0.099
Neg logit contributions
 hey-0.059
 who-0.055
 living-0.047
 whose-0.045
 prett-0.045
Token going
Token ID1016
Feature activation+8.95e-01

Pos logit contributions
 ever+0.026
 fright+0.024
 grin+0.024
unts+0.023
 onwards+0.023
Neg logit contributions
 hey-0.022
 exception-0.022
aman-0.022
 prett-0.022
lves-0.021
Token to
Token ID284
Feature activation-4.74e-01

Pos logit contributions
 ever+0.027
unts+0.024
 grin+0.024
 fright+0.023
 onwards+0.022
Neg logit contributions
land-0.034
 exception-0.034
 prett-0.034
ware-0.033
 whose-0.033
Token the
Token ID262
Feature activation-3.29e+00

Pos logit contributions
aman+0.020
land+0.019
lves+0.019
ware+0.019
 hey+0.018
Neg logit contributions
 ever-0.015
 onwards-0.012
unts-0.011
 fright-0.011
 grin-0.010
Token park
Token ID3952
Feature activation+4.86e-01

Pos logit contributions
ist+0.130
land+0.126
lest+0.123
aman+0.121
der+0.120
Neg logit contributions
 ever-0.096
 very-0.089
 on-0.078
 fast-0.075
�-0.074
Token.
Token ID13
Feature activation-9.62e-01

Pos logit contributions
 ever+0.016
unts+0.014
 onwards+0.014
 fright+0.014
 grin+0.013
Neg logit contributions
land-0.018
aman-0.018
 exception-0.017
 whose-0.017
 hey-0.016
Token Lily
Token ID20037
Feature activation-4.17e-01

Pos logit contributions
aman+0.042
land+0.040
 hey+0.040
ware+0.040
 exception+0.040
Neg logit contributions
 ever-0.025
 onwards-0.024
unts-0.021
 onward-0.020
 It-0.020
Token was
Token ID373
Feature activation+6.34e-01

Pos logit contributions
aman+0.018
land+0.017
 hey+0.017
ware+0.017
 exception+0.017
Neg logit contributions
 ever-0.012
 onwards-0.010
unts-0.009
 fright-0.009
 grin-0.009
Token eager
Token ID11069
Feature activation+6.11e-01

Pos logit contributions
 ever+0.017
unts+0.014
 onwards+0.014
keeping+0.013
 fright+0.013
Neg logit contributions
 prett-0.027
aman-0.026
 exception-0.026
 whose-0.026
 hey-0.026
Token and
Token ID290
Feature activation+5.85e-01

Pos logit contributions
 ever+0.001
 onwards+0.001
unts+0.001
 fright+0.001
 onward+0.001
Neg logit contributions
aman-0.002
land-0.002
 hey-0.002
ware-0.002
 exception-0.002
Token his
Token ID465
Feature activation-6.83e-01

Pos logit contributions
 ever+0.019
unts+0.017
 onwards+0.017
 fright+0.015
kick+0.015
Neg logit contributions
aman-0.022
 hey-0.021
 whose-0.021
land-0.020
 exception-0.020
Token mom
Token ID1995
Feature activation-3.50e-01

Pos logit contributions
aman+0.031
land+0.030
ware+0.030
 exception+0.029
lves+0.029
Neg logit contributions
 ever-0.019
unts-0.015
 onwards-0.014
 fright-0.014
 grin-0.014
Token were
Token ID547
Feature activation+9.56e-02

Pos logit contributions
aman+0.014
land+0.013
ware+0.013
lves+0.013
 hey+0.013
Neg logit contributions
 ever-0.011
 onwards-0.009
unts-0.009
 fright-0.008
 grin-0.008
Token at
Token ID379
Feature activation-1.27e+00

Pos logit contributions
 ever+0.003
 onwards+0.003
unts+0.002
 fright+0.002
 grin+0.002
Neg logit contributions
aman-0.004
 hey-0.004
land-0.004
 exception-0.004
lves-0.003
Token the
Token ID262
Feature activation-3.28e+00

Pos logit contributions
land+0.062
aman+0.061
lves+0.061
 exception+0.059
 hey+0.059
Neg logit contributions
 ever-0.030
 fright-0.019
 onwards-0.019
keeping-0.018
 on-0.018
Token doctor
Token ID6253
Feature activation-8.39e-01

Pos logit contributions
ist+0.111
lves+0.106
ilst+0.105
der+0.105
ware+0.099
Neg logit contributions
 ever-0.117
�-0.094
 on-0.094
 very-0.093
 sunset-0.093
Token's
Token ID338
Feature activation-1.69e+00

Pos logit contributions
aman+0.037
ware+0.035
lves+0.035
 hey+0.035
land+0.034
Neg logit contributions
 ever-0.024
 onwards-0.019
unts-0.018
 fright-0.017
 grin-0.016
Token office
Token ID2607
Feature activation-8.13e-01

Pos logit contributions
lves+0.066
aman+0.061
ware+0.061
land+0.061
 hey+0.059
Neg logit contributions
 ever-0.053
unts-0.044
 fright-0.044
 excited-0.043
 angry-0.042
Token.
Token ID13
Feature activation+1.72e-01

Pos logit contributions
aman+0.035
lves+0.034
 hey+0.034
ware+0.033
land+0.033
Neg logit contributions
 ever-0.024
 onwards-0.019
unts-0.018
 fright-0.017
kick-0.016
Token While
Token ID2893
Feature activation+4.16e-01

Pos logit contributions
 ever+0.005
 onwards+0.004
unts+0.004
 fright+0.003
 onward+0.003
Neg logit contributions
aman-0.008
land-0.007
 exception-0.007
 hey-0.007
 whose-0.007
Token little
Token ID1310
Feature activation+2.00e-01

Pos logit contributions
aman+0.007
land+0.006
 hey+0.006
ware+0.006
 exception+0.006
Neg logit contributions
 ever-0.013
 onwards-0.011
unts-0.011
 fright-0.011
 grin-0.010
Token girl
Token ID2576
Feature activation-5.53e-02

Pos logit contributions
 ever+0.009
 onwards+0.008
unts+0.008
 fright+0.008
 onward+0.008
Neg logit contributions
aman-0.005
 hey-0.005
land-0.005
lves-0.005
ware-0.005
Token who
Token ID508
Feature activation+6.55e-01

Pos logit contributions
aman+0.002
land+0.002
 hey+0.002
 exception+0.002
ware+0.002
Neg logit contributions
 ever-0.002
 onwards-0.002
unts-0.001
 fright-0.001
 grin-0.001
Token lived
Token ID5615
Feature activation-4.52e-01

Pos logit contributions
 ever+0.024
 onwards+0.023
 onward+0.022
unts+0.021
keeping+0.021
Neg logit contributions
 whose-0.019
 exception-0.019
lves-0.018
 hey-0.018
aman-0.018
Token at
Token ID379
Feature activation-1.78e+00

Pos logit contributions
aman+0.021
land+0.020
ware+0.020
 hey+0.020
 exception+0.020
Neg logit contributions
 ever-0.012
 onwards-0.010
unts-0.009
 fright-0.009
 onward-0.008
Token the
Token ID262
Feature activation-3.27e+00

Pos logit contributions
land+0.098
aman+0.096
ist+0.094
 exception+0.092
lves+0.090
Neg logit contributions
 ever-0.038
 it-0.021
 on-0.020
keeping-0.019
 have-0.019
Token beach
Token ID10481
Feature activation-5.49e-01

Pos logit contributions
ist+0.121
 extingu+0.109
ware+0.104
der+0.103
isher+0.101
Neg logit contributions
 ever-0.117
 sunset-0.097
 hungry-0.095
 very-0.091
unts-0.091
Token.
Token ID13
Feature activation+7.54e-02

Pos logit contributions
aman+0.025
land+0.023
 hey+0.023
lves+0.023
ware+0.022
Neg logit contributions
 ever-0.016
 onwards-0.012
unts-0.012
 fright-0.011
kick-0.011
Token Every
Token ID3887
Feature activation+3.79e-01

Pos logit contributions
 ever+0.002
 onwards+0.002
unts+0.002
 fright+0.002
 onward+0.002
Neg logit contributions
aman-0.003
land-0.003
 exception-0.003
ware-0.003
 hey-0.003
Token day
Token ID1110
Feature activation+8.36e-01

Pos logit contributions
 ever+0.011
 onwards+0.010
unts+0.010
 fright+0.009
keeping+0.009
Neg logit contributions
aman-0.015
land-0.015
 exception-0.014
ware-0.014
 hey-0.014
Token,
Token ID11
Feature activation+3.05e-01

Pos logit contributions
 ever+0.030
keeping+0.029
unts+0.028
 fright+0.028
 Amen+0.026
Neg logit contributions
aman-0.026
land-0.024
 hey-0.024
 whose-0.023
 exception-0.023
Token his
Token ID465
Feature activation-8.99e-02

Pos logit contributions
 ever+0.016
 onwards+0.014
unts+0.014
 fright+0.013
 onward+0.013
Neg logit contributions
aman-0.014
 whose-0.014
 exception-0.013
 hey-0.013
land-0.013
Token mom
Token ID1995
Feature activation+1.65e-02

Pos logit contributions
aman+0.004
land+0.003
 hey+0.003
ware+0.003
 exception+0.003
Neg logit contributions
 ever-0.003
 onwards-0.002
unts-0.002
 fright-0.002
 onward-0.002
Token were
Token ID547
Feature activation-6.19e-02

Pos logit contributions
 ever+0.001
 onwards+0.000
unts+0.000
 fright+0.000
 grin+0.000
Neg logit contributions
aman-0.001
land-0.001
 hey-0.001
 exception-0.001
lves-0.001
Token going
Token ID1016
Feature activation+7.92e-01

Pos logit contributions
aman+0.003
land+0.002
 hey+0.002
 exception+0.002
ware+0.002
Neg logit contributions
 ever-0.002
 onwards-0.002
unts-0.002
 fright-0.002
 grin-0.001
Token to
Token ID284
Feature activation-6.69e-01

Pos logit contributions
 ever+0.017
unts+0.015
 fright+0.014
 grin+0.013
 onwards+0.012
Neg logit contributions
land-0.037
aman-0.037
 exception-0.037
 prett-0.036
ware-0.036
Token the
Token ID262
Feature activation-3.27e+00

Pos logit contributions
aman+0.029
land+0.028
lves+0.027
ware+0.027
 hey+0.027
Neg logit contributions
 ever-0.020
 onwards-0.016
unts-0.015
 fright-0.015
 grin-0.014
Token store
Token ID3650
Feature activation-6.19e-01

Pos logit contributions
ist+0.123
der+0.115
ware+0.112
ilst+0.107
 extingu+0.105
Neg logit contributions
 ever-0.103
 sunset-0.092
�-0.091
 very-0.088
 it-0.082
Token.
Token ID13
Feature activation-9.88e-02

Pos logit contributions
aman+0.029
ware+0.028
 hey+0.027
land+0.027
lves+0.027
Neg logit contributions
 ever-0.016
 onwards-0.013
unts-0.012
 fright-0.011
 onward-0.010
Token Jake
Token ID14757
Feature activation-2.94e-01

Pos logit contributions
aman+0.004
land+0.004
 hey+0.004
ware+0.004
 exception+0.004
Neg logit contributions
 ever-0.003
 onwards-0.002
unts-0.002
 fright-0.002
 grin-0.002
Token was
Token ID373
Feature activation+3.96e-01

Pos logit contributions
aman+0.013
land+0.012
 hey+0.012
 exception+0.012
ware+0.012
Neg logit contributions
 ever-0.009
 onwards-0.007
unts-0.007
 fright-0.006
 onward-0.006
Token looking
Token ID2045
Feature activation+1.70e-01

Pos logit contributions
 ever+0.010
 onwards+0.008
unts+0.008
 fright+0.008
 grin+0.007
Neg logit contributions
aman-0.018
 exception-0.017
 hey-0.017
 prett-0.017
 whose-0.017
Token and
Token ID290
Feature activation+1.20e+00

Pos logit contributions
aman+0.001
land+0.001
 hey+0.001
ware+0.001
 exception+0.001
Neg logit contributions
 ever-0.001
 onwards-0.000
unts-0.000
 fright-0.000
 onward-0.000
Token Lily
Token ID20037
Feature activation-4.49e-01

Pos logit contributions
 onwards+0.045
unts+0.042
 ever+0.041
 onward+0.040
 grin+0.038
Neg logit contributions
 whose-0.036
 hey-0.034
 who-0.034
 extingu-0.032
aman-0.030
Token were
Token ID547
Feature activation+8.96e-01

Pos logit contributions
aman+0.021
land+0.020
ware+0.020
 exception+0.020
lves+0.020
Neg logit contributions
 ever-0.012
unts-0.009
 onwards-0.009
 fright-0.008
 grin-0.008
Token going
Token ID1016
Feature activation+1.22e+00

Pos logit contributions
 ever+0.035
 fright+0.031
keeping+0.031
 onwards+0.030
 grin+0.030
Neg logit contributions
aman-0.027
 hey-0.027
 exception-0.026
 prett-0.026
 living-0.026
Token to
Token ID284
Feature activation-1.01e+00

Pos logit contributions
 ever+0.031
unts+0.031
 fright+0.031
 grin+0.030
 onwards+0.027
Neg logit contributions
 whose-0.049
land-0.049
 exception-0.047
 prett-0.047
ware-0.047
Token the
Token ID262
Feature activation-3.25e+00

Pos logit contributions
aman+0.046
lves+0.045
land+0.045
ware+0.044
 hey+0.044
Neg logit contributions
 ever-0.028
 fright-0.020
unts-0.020
 onwards-0.019
 have-0.019
Token zoo
Token ID26626
Feature activation-3.08e-01

Pos logit contributions
ist+0.122
ilst+0.111
der+0.105
aching+0.103
lest+0.102
Neg logit contributions
 ever-0.105
 sunset-0.092
 very-0.091
 hungry-0.090
 sideways-0.089
Token with
Token ID351
Feature activation+1.62e-01

Pos logit contributions
aman+0.013
land+0.013
 hey+0.013
lves+0.013
ware+0.012
Neg logit contributions
 ever-0.009
 onwards-0.007
unts-0.007
 fright-0.007
 onward-0.006
Token their
Token ID511
Feature activation-3.42e-01

Pos logit contributions
 ever+0.006
 onwards+0.005
unts+0.005
 fright+0.005
 grin+0.005
Neg logit contributions
aman-0.006
land-0.005
 hey-0.005
ware-0.005
 exception-0.005
Token mom
Token ID1995
Feature activation-8.07e-02

Pos logit contributions
aman+0.016
land+0.015
 hey+0.015
 exception+0.015
ware+0.015
Neg logit contributions
 ever-0.009
 onwards-0.007
unts-0.007
 fright-0.006
 grin-0.006
Token and
Token ID290
Feature activation-4.51e-01

Pos logit contributions
aman+0.003
land+0.003
 hey+0.003
 exception+0.003
ware+0.003
Neg logit contributions
 ever-0.002
 onwards-0.002
unts-0.002
 fright-0.002
 grin-0.002
Token and
Token ID290
Feature activation+8.64e-02

Pos logit contributions
aman+0.019
land+0.018
 hey+0.018
 exception+0.018
ware+0.017
Neg logit contributions
 ever-0.011
 onwards-0.009
unts-0.009
 fright-0.008
 grin-0.008
Token her
Token ID607
Feature activation-3.58e-01

Pos logit contributions
 ever+0.002
 onwards+0.002
unts+0.002
 fright+0.002
 onward+0.002
Neg logit contributions
aman-0.004
 hey-0.004
land-0.004
 exception-0.003
lves-0.003
Token dad
Token ID9955
Feature activation-4.85e-01

Pos logit contributions
aman+0.016
land+0.015
 exception+0.015
ware+0.015
 hey+0.015
Neg logit contributions
 ever-0.011
 onwards-0.008
unts-0.008
 fright-0.008
 grin-0.007
Token were
Token ID547
Feature activation-3.59e-01

Pos logit contributions
aman+0.022
land+0.021
ware+0.021
 hey+0.021
lves+0.021
Neg logit contributions
 ever-0.013
 onwards-0.010
unts-0.010
 fright-0.009
 grin-0.009
Token at
Token ID379
Feature activation-1.88e+00

Pos logit contributions
aman+0.015
land+0.015
ware+0.014
lves+0.014
 hey+0.014
Neg logit contributions
 ever-0.011
 onwards-0.009
unts-0.009
 fright-0.008
 onward-0.008
Token the
Token ID262
Feature activation-3.22e+00

Pos logit contributions
land+0.102
aman+0.100
ist+0.100
lves+0.099
der+0.099
Neg logit contributions
 ever-0.034
 it-0.023
 on-0.020
keeping-0.018
 have-0.017
Token beach
Token ID10481
Feature activation-4.79e-01

Pos logit contributions
der+0.117
ist+0.114
 extingu+0.113
ware+0.111
lves+0.105
Neg logit contributions
 ever-0.106
�-0.091
 sunset-0.091
 very-0.086
 it-0.083
Token.
Token ID13
Feature activation+1.24e-01

Pos logit contributions
aman+0.021
 hey+0.020
land+0.019
lves+0.019
ware+0.019
Neg logit contributions
 ever-0.014
 onwards-0.011
unts-0.011
 fright-0.010
 grin-0.010
Token Jill
Token ID22772
Feature activation-5.51e-01

Pos logit contributions
 ever+0.003
 onwards+0.003
unts+0.002
 fright+0.002
 onward+0.002
Neg logit contributions
aman-0.006
land-0.005
 exception-0.005
 hey-0.005
ware-0.005
Token showed
Token ID3751
Feature activation-2.69e-01

Pos logit contributions
aman+0.025
land+0.024
 hey+0.024
 exception+0.023
ware+0.023
Neg logit contributions
 ever-0.015
 onwards-0.012
unts-0.011
 fright-0.010
 grin-0.010
Token her
Token ID607
Feature activation-3.04e-01

Pos logit contributions
aman+0.011
land+0.010
 hey+0.010
lves+0.010
ware+0.010
Neg logit contributions
 ever-0.008
 onwards-0.007
unts-0.007
 fright-0.006
 grin-0.006
Token her
Token ID607
Feature activation-2.31e-01

Pos logit contributions
 ever+0.033
unts+0.032
 onwards+0.032
kick+0.029
 fright+0.029
Neg logit contributions
aman-0.029
 hey-0.028
 whose-0.028
 who-0.026
lves-0.026
Token mom
Token ID1995
Feature activation-3.55e-01

Pos logit contributions
aman+0.010
land+0.009
ware+0.009
 exception+0.009
 hey+0.009
Neg logit contributions
 ever-0.007
 onwards-0.006
unts-0.006
 fright-0.005
 grin-0.005
Token were
Token ID547
Feature activation+5.31e-01

Pos logit contributions
aman+0.016
land+0.015
ware+0.015
 exception+0.015
lves+0.015
Neg logit contributions
 ever-0.010
 onwards-0.008
unts-0.008
 fright-0.007
 grin-0.007
Token going
Token ID1016
Feature activation+8.11e-01

Pos logit contributions
 ever+0.018
 fright+0.016
 onwards+0.015
unts+0.015
keeping+0.015
Neg logit contributions
aman-0.020
 hey-0.019
 exception-0.018
land-0.018
 prett-0.018
Token to
Token ID284
Feature activation-1.01e+00

Pos logit contributions
 ever+0.022
unts+0.020
 fright+0.019
 grin+0.018
 onwards+0.018
Neg logit contributions
land-0.034
aman-0.033
 exception-0.033
 whose-0.033
 prett-0.032
Token the
Token ID262
Feature activation-3.20e+00

Pos logit contributions
aman+0.044
lves+0.043
land+0.043
ware+0.042
 hey+0.042
Neg logit contributions
 ever-0.030
 fright-0.022
unts-0.021
 onwards-0.021
 have-0.020
Token station
Token ID4429
Feature activation-5.46e-01

Pos logit contributions
ist+0.116
ilst+0.107
der+0.104
lves+0.100
lest+0.100
Neg logit contributions
 ever-0.106
 very-0.089
 sunset-0.088
 on-0.088
 hungry-0.088
Token.
Token ID13
Feature activation-3.84e-01

Pos logit contributions
aman+0.024
lves+0.023
 hey+0.023
ware+0.022
land+0.022
Neg logit contributions
 ever-0.016
 onwards-0.013
unts-0.012
 fright-0.011
 onward-0.011
Token They
Token ID1119
Feature activation-3.42e-01

Pos logit contributions
aman+0.016
 hey+0.015
land+0.015
ware+0.015
lves+0.015
Neg logit contributions
 ever-0.011
 onwards-0.009
unts-0.009
 fright-0.008
 onward-0.008
Token had
Token ID550
Feature activation+8.30e-01

Pos logit contributions
aman+0.015
land+0.014
ware+0.014
 hey+0.014
 exception+0.014
Neg logit contributions
 ever-0.010
 onwards-0.008
unts-0.008
 fright-0.007
 grin-0.007
Token a
Token ID257
Feature activation-1.74e-01

Pos logit contributions
 ever+0.033
 grin+0.031
 onwards+0.031
unts+0.030
 fright+0.029
Neg logit contributions
land-0.024
aman-0.023
 whose-0.023
 prett-0.022
 exception-0.021
Token and
Token ID290
Feature activation+6.55e-01

Pos logit contributions
aman+0.004
land+0.004
 hey+0.004
ware+0.004
 exception+0.004
Neg logit contributions
 ever-0.002
 onwards-0.002
unts-0.002
 fright-0.002
 onward-0.002
Token Sam
Token ID3409
Feature activation-1.03e+00

Pos logit contributions
 ever+0.024
 onwards+0.023
unts+0.022
kick+0.021
 onward+0.021
Neg logit contributions
aman-0.021
 hey-0.021
 whose-0.020
lves-0.019
 exception-0.018
Token were
Token ID547
Feature activation+7.18e-01

Pos logit contributions
aman+0.050
land+0.048
ware+0.048
 exception+0.047
lves+0.047
Neg logit contributions
 ever-0.026
unts-0.018
 onwards-0.018
 fright-0.017
 grin-0.016
Token going
Token ID1016
Feature activation+9.86e-01

Pos logit contributions
 ever+0.027
 fright+0.024
keeping+0.024
 onwards+0.023
 grin+0.022
Neg logit contributions
aman-0.023
 hey-0.023
 exception-0.022
 prett-0.022
 whose-0.021
Token to
Token ID284
Feature activation-9.49e-01

Pos logit contributions
 ever+0.027
unts+0.025
 fright+0.024
 grin+0.024
 onwards+0.022
Neg logit contributions
land-0.040
 exception-0.039
 whose-0.039
 prett-0.038
aman-0.038
Token the
Token ID262
Feature activation-3.20e+00

Pos logit contributions
aman+0.043
lves+0.042
land+0.042
ware+0.041
 hey+0.041
Neg logit contributions
 ever-0.026
 fright-0.019
unts-0.019
 onwards-0.019
 have-0.017
Token airport
Token ID9003
Feature activation-1.11e+00

Pos logit contributions
ist+0.115
ilst+0.106
der+0.103
land+0.100
lest+0.098
Neg logit contributions
 ever-0.108
 very-0.092
 sunset-0.092
 hungry-0.090
 sideways-0.088
Token with
Token ID351
Feature activation+2.74e-01

Pos logit contributions
aman+0.053
lves+0.051
 hey+0.050
ware+0.049
 extingu+0.048
Neg logit contributions
 ever-0.031
 onwards-0.022
unts-0.020
 on-0.019
kick-0.019
Token Mom
Token ID11254
Feature activation-7.14e-02

Pos logit contributions
 ever+0.010
 onwards+0.008
unts+0.008
 fright+0.008
 grin+0.008
Neg logit contributions
aman-0.010
ware-0.009
land-0.009
 exception-0.009
 hey-0.009
Token and
Token ID290
Feature activation-2.63e-01

Pos logit contributions
aman+0.003
land+0.003
 hey+0.003
 exception+0.003
ware+0.003
Neg logit contributions
 ever-0.002
 onwards-0.002
unts-0.002
 fright-0.002
 grin-0.002
Token Dad
Token ID17415
Feature activation-3.28e-01

Pos logit contributions
aman+0.013
land+0.013
ware+0.013
 exception+0.013
lves+0.013
Neg logit contributions
 ever-0.006
 onwards-0.004
unts-0.004
 fright-0.004
 grin-0.003
Token and
Token ID290
Feature activation+2.26e-01

Pos logit contributions
 ever+0.008
 onwards+0.007
unts+0.007
 fright+0.006
 onward+0.006
Neg logit contributions
aman-0.012
land-0.012
ware-0.011
 hey-0.011
 exception-0.011
Token her
Token ID607
Feature activation-6.00e-01

Pos logit contributions
 ever+0.007
 onwards+0.006
unts+0.006
 fright+0.006
 onward+0.005
Neg logit contributions
aman-0.009
 hey-0.009
land-0.008
 whose-0.008
lves-0.008
Token dad
Token ID9955
Feature activation-3.53e-01

Pos logit contributions
aman+0.025
land+0.024
 exception+0.024
ware+0.024
lves+0.023
Neg logit contributions
 ever-0.019
unts-0.015
 onwards-0.015
 fright-0.014
 grin-0.014
Token were
Token ID547
Feature activation-2.38e-03

Pos logit contributions
aman+0.016
land+0.015
ware+0.015
 hey+0.015
 exception+0.015
Neg logit contributions
 ever-0.010
 onwards-0.008
unts-0.007
 fright-0.007
 grin-0.007
Token at
Token ID379
Feature activation-1.51e+00

Pos logit contributions
aman+0.000
land+0.000
 hey+0.000
 exception+0.000
lves+0.000
Neg logit contributions
 ever-0.000
 onwards-0.000
unts-0.000
 fright-0.000
 grin-0.000
Token the
Token ID262
Feature activation-3.18e+00

Pos logit contributions
land+0.079
aman+0.078
lves+0.077
ist+0.076
 exception+0.075
Neg logit contributions
 ever-0.031
 it-0.020
keeping-0.018
 on-0.018
 fright-0.017
Token beach
Token ID10481
Feature activation-5.16e-01

Pos logit contributions
ist+0.114
der+0.109
ware+0.108
 extingu+0.108
lves+0.101
Neg logit contributions
 ever-0.107
�-0.093
 sunset-0.091
 very-0.088
 it-0.085
Token one
Token ID530
Feature activation-1.04e+00

Pos logit contributions
aman+0.022
 hey+0.021
lves+0.021
land+0.021
ware+0.020
Neg logit contributions
 ever-0.015
 onwards-0.012
unts-0.012
 fright-0.011
 onward-0.011
Token summer
Token ID3931
Feature activation-1.10e-01

Pos logit contributions
lves+0.052
aman+0.052
 hey+0.051
ware+0.050
 extingu+0.049
Neg logit contributions
 ever-0.025
 onwards-0.018
unts-0.016
kick-0.015
 onward-0.014
Token day
Token ID1110
Feature activation-2.01e-01

Pos logit contributions
aman+0.004
land+0.004
 hey+0.004
ware+0.004
 exception+0.004
Neg logit contributions
 ever-0.003
 onwards-0.003
unts-0.003
 fright-0.003
 grin-0.002
Token.
Token ID13
Feature activation+2.77e-01

Pos logit contributions
aman+0.008
land+0.008
 hey+0.007
ware+0.007
 exception+0.007
Neg logit contributions
 ever-0.007
 onwards-0.005
unts-0.005
 fright-0.005
 onward-0.005
Token and
Token ID290
Feature activation+1.54e-01

Pos logit contributions
 ever+0.001
 onwards+0.001
unts+0.001
 fright+0.001
 onward+0.001
Neg logit contributions
aman-0.002
land-0.001
 hey-0.001
ware-0.001
 exception-0.001
Token Ben
Token ID3932
Feature activation-6.22e-01

Pos logit contributions
 ever+0.005
 onwards+0.004
unts+0.004
 fright+0.004
 onward+0.003
Neg logit contributions
aman-0.006
 hey-0.006
land-0.006
 whose-0.006
 exception-0.006
Token were
Token ID547
Feature activation-1.99e-03

Pos logit contributions
aman+0.029
land+0.028
ware+0.028
 exception+0.028
 hey+0.027
Neg logit contributions
 ever-0.016
 onwards-0.012
unts-0.012
 fright-0.011
 grin-0.010
Token going
Token ID1016
Feature activation+8.93e-01

Pos logit contributions
aman+0.000
land+0.000
 hey+0.000
 exception+0.000
lves+0.000
Neg logit contributions
 ever-0.000
 onwards-0.000
unts-0.000
 fright-0.000
 grin-0.000
Token to
Token ID284
Feature activation-9.71e-01

Pos logit contributions
 ever+0.023
unts+0.021
 fright+0.020
 grin+0.019
 onwards+0.018
Neg logit contributions
land-0.038
aman-0.038
 exception-0.038
 prett-0.037
 whose-0.037
Token the
Token ID262
Feature activation-3.17e+00

Pos logit contributions
aman+0.043
lves+0.042
land+0.042
ware+0.041
 hey+0.041
Neg logit contributions
 ever-0.028
 onwards-0.021
 fright-0.020
unts-0.020
 have-0.019
Token park
Token ID3952
Feature activation+4.35e-02

Pos logit contributions
ist+0.107
der+0.099
ware+0.097
ters+0.095
 extingu+0.094
Neg logit contributions
 ever-0.112
 sunset-0.095
�-0.094
 sideways-0.094
 very-0.092
Token with
Token ID351
Feature activation-8.59e-02

Pos logit contributions
 ever+0.001
 onwards+0.001
unts+0.001
 fright+0.001
 grin+0.001
Neg logit contributions
aman-0.002
land-0.002
 exception-0.002
ware-0.002
 hey-0.002
Token Mom
Token ID11254
Feature activation-1.40e-01

Pos logit contributions
aman+0.003
land+0.003
 hey+0.003
ware+0.003
 exception+0.003
Neg logit contributions
 ever-0.003
 onwards-0.002
unts-0.002
 fright-0.002
 grin-0.002
Token.
Token ID13
Feature activation-2.30e-01

Pos logit contributions
aman+0.006
land+0.005
 hey+0.005
ware+0.005
 exception+0.005
Neg logit contributions
 ever-0.004
 onwards-0.004
unts-0.004
 fright-0.003
 grin-0.003
Token They
Token ID1119
Feature activation-1.99e-01

Pos logit contributions
aman+0.010
land+0.009
 hey+0.009
ware+0.009
lves+0.009
Neg logit contributions
 ever-0.007
 onwards-0.006
unts-0.006
 fright-0.005
 onward-0.005
Token that
Token ID326
Feature activation+4.20e-01

Pos logit contributions
unts+0.071
 ever+0.071
 onwards+0.069
 fright+0.067
keeping+0.067
Neg logit contributions
 whose-0.050
land-0.048
 where-0.047
 hey-0.046
ware-0.045
Token they
Token ID484
Feature activation+1.93e+00

Pos logit contributions
 ever+0.015
 onwards+0.013
unts+0.013
 grin+0.013
 fright+0.013
Neg logit contributions
aman-0.014
ware-0.014
 whose-0.013
land-0.013
 hey-0.013
Token were
Token ID547
Feature activation+1.81e-01

Pos logit contributions
 onwards+0.076
keeping+0.073
 grin+0.071
 onward+0.070
unts+0.066
Neg logit contributions
 hey-0.053
 exception-0.051
 who-0.050
 prett-0.049
 whose-0.046
Token going
Token ID1016
Feature activation+6.30e-01

Pos logit contributions
 ever+0.006
 onwards+0.005
unts+0.005
 fright+0.005
 grin+0.005
Neg logit contributions
aman-0.007
 exception-0.007
 hey-0.007
land-0.006
lves-0.006
Token to
Token ID284
Feature activation-2.29e-01

Pos logit contributions
 ever+0.017
unts+0.015
 grin+0.014
 fright+0.014
 onwards+0.014
Neg logit contributions
land-0.026
 exception-0.026
aman-0.026
 hey-0.025
 whose-0.025
Token the
Token ID262
Feature activation-3.17e+00

Pos logit contributions
aman+0.009
land+0.009
 hey+0.009
ware+0.009
lves+0.009
Neg logit contributions
 ever-0.007
 onwards-0.006
unts-0.006
 fright-0.006
 grin-0.005
Token store
Token ID3650
Feature activation-3.48e-01

Pos logit contributions
ist+0.120
lest+0.112
lves+0.111
ware+0.109
aman+0.106
Neg logit contributions
 ever-0.110
 very-0.093
 loud-0.091
 on-0.087
 story-0.083
Token to
Token ID284
Feature activation-7.81e-01

Pos logit contributions
aman+0.017
land+0.016
ware+0.016
 hey+0.016
lves+0.016
Neg logit contributions
 ever-0.009
 onwards-0.006
unts-0.006
 fright-0.006
 onward-0.005
Token buy
Token ID2822
Feature activation-3.90e-01

Pos logit contributions
aman+0.029
land+0.028
ware+0.028
lves+0.027
 exception+0.027
Neg logit contributions
 ever-0.028
 fright-0.022
 onwards-0.022
unts-0.022
 have-0.021
Token some
Token ID617
Feature activation-9.14e-01

Pos logit contributions
aman+0.015
land+0.014
 hey+0.014
lves+0.014
ware+0.014
Neg logit contributions
 ever-0.013
 onwards-0.011
unts-0.011
 fright-0.010
 onward-0.010
Token new
Token ID649
Feature activation-1.19e+00

Pos logit contributions
aman+0.037
 hey+0.035
lves+0.034
land+0.033
ware+0.032
Neg logit contributions
 ever-0.031
 onwards-0.025
unts-0.024
 fright-0.023
kick-0.022
Token One
Token ID1881
Feature activation-1.91e-02

Pos logit contributions
aman+0.020
land+0.020
 hey+0.019
ware+0.019
lves+0.019
Neg logit contributions
 ever-0.015
 onwards-0.012
unts-0.012
 fright-0.011
 onward-0.011
Token day
Token ID1110
Feature activation-1.28e+00

Pos logit contributions
aman+0.001
land+0.001
 exception+0.001
 hey+0.001
ware+0.001
Neg logit contributions
 ever-0.001
 onwards-0.000
unts-0.000
 fright-0.000
 grin-0.000
Token Sarah
Token ID10490
Feature activation-4.68e-01

Pos logit contributions
aman+0.067
lves+0.066
land+0.065
ware+0.064
 prett+0.064
Neg logit contributions
 ever-0.029
 onwards-0.022
 onward-0.018
 it-0.018
 on-0.017
Token was
Token ID373
Feature activation-8.97e-01

Pos logit contributions
aman+0.021
land+0.020
 hey+0.020
ware+0.019
 exception+0.019
Neg logit contributions
 ever-0.013
 onwards-0.010
unts-0.010
 fright-0.009
 grin-0.009
Token at
Token ID379
Feature activation-7.72e-01

Pos logit contributions
aman+0.042
 extingu+0.039
land+0.039
ware+0.039
lves+0.039
Neg logit contributions
 ever-0.026
 onwards-0.021
unts-0.019
 onward-0.018
 fright-0.018
Token the
Token ID262
Feature activation-3.17e+00

Pos logit contributions
aman+0.032
land+0.031
lves+0.030
 hey+0.030
 exception+0.029
Neg logit contributions
 ever-0.025
 onwards-0.020
 fright-0.019
unts-0.019
keeping-0.018
Token park
Token ID3952
Feature activation-4.35e-01

Pos logit contributions
 extingu+0.120
lves+0.111
ist+0.106
ware+0.101
 whose+0.100
Neg logit contributions
 ever-0.112
 on-0.087
 it-0.086
 very-0.086
 high-0.082
Token and
Token ID290
Feature activation-2.27e-01

Pos logit contributions
aman+0.018
land+0.017
 hey+0.017
lves+0.017
ware+0.017
Neg logit contributions
 ever-0.014
 onwards-0.011
unts-0.011
 fright-0.010
 onward-0.010
Token she
Token ID673
Feature activation-3.18e-01

Pos logit contributions
aman+0.008
land+0.008
 hey+0.008
ware+0.008
 exception+0.008
Neg logit contributions
 ever-0.008
 onwards-0.007
unts-0.006
 fright-0.006
 onward-0.006
Token saw
Token ID2497
Feature activation-1.01e+00

Pos logit contributions
aman+0.013
land+0.012
 hey+0.012
ware+0.012
 exception+0.012
Neg logit contributions
 ever-0.010
 onwards-0.008
unts-0.008
 fright-0.008
 onward-0.007
Token a
Token ID257
Feature activation+3.39e-01

Pos logit contributions
aman+0.041
lves+0.039
land+0.039
 exception+0.038
ware+0.038
Neg logit contributions
 ever-0.032
 onwards-0.026
 fright-0.025
unts-0.025
 it-0.024
Token day
Token ID1110
Feature activation-9.85e-01

Pos logit contributions
aman+0.020
 hey+0.020
ware+0.019
lves+0.019
land+0.019
Neg logit contributions
 ever-0.010
 onwards-0.008
unts-0.007
 fright-0.007
 onward-0.006
Token while
Token ID981
Feature activation+9.05e-01

Pos logit contributions
aman+0.051
lves+0.049
land+0.048
 extingu+0.048
 exception+0.047
Neg logit contributions
 ever-0.024
 onwards-0.020
 onward-0.016
 on-0.015
 it-0.015
Token she
Token ID673
Feature activation-6.92e-01

Pos logit contributions
 ever+0.037
unts+0.035
 fright+0.034
keeping+0.033
 grin+0.032
Neg logit contributions
aman-0.024
 hey-0.023
ware-0.023
 whose-0.023
 exception-0.022
Token was
Token ID373
Feature activation-1.06e+00

Pos logit contributions
aman+0.031
 hey+0.030
land+0.029
 exception+0.029
lves+0.029
Neg logit contributions
 ever-0.020
 onwards-0.015
unts-0.014
 fright-0.014
 onward-0.013
Token at
Token ID379
Feature activation-1.55e+00

Pos logit contributions
aman+0.051
lves+0.048
ware+0.048
land+0.048
 hey+0.048
Neg logit contributions
 ever-0.027
 onwards-0.022
 onward-0.018
unts-0.018
 on-0.018
Token the
Token ID262
Feature activation-3.16e+00

Pos logit contributions
 hey+0.080
land+0.079
ist+0.079
aman+0.078
aching+0.077
Neg logit contributions
 ever-0.032
 onwards-0.021
 it-0.021
 on-0.020
 fright-0.018
Token stable
Token ID8245
Feature activation-9.70e-01

Pos logit contributions
 extingu+0.121
ist+0.119
 whose+0.103
 hey+0.103
 appearances+0.102
Neg logit contributions
 ever-0.103
 very-0.091
 loud-0.088
 on-0.086
 it-0.084
Token it
Token ID340
Feature activation-1.09e+00

Pos logit contributions
aman+0.045
lves+0.043
 hey+0.043
land+0.043
 exception+0.043
Neg logit contributions
 ever-0.026
 onwards-0.020
unts-0.017
 onward-0.017
 fright-0.016
Token started
Token ID2067
Feature activation-4.89e-01

Pos logit contributions
aman+0.051
 hey+0.049
lves+0.049
 extingu+0.049
 exception+0.048
Neg logit contributions
 ever-0.029
 onwards-0.023
 onward-0.019
 fright-0.019
unts-0.018
Token to
Token ID284
Feature activation-8.36e-01

Pos logit contributions
aman+0.024
 hey+0.023
lves+0.022
land+0.022
ware+0.022
Neg logit contributions
 ever-0.012
 onwards-0.009
unts-0.009
 fright-0.008
 onward-0.008
Token rain
Token ID6290
Feature activation-1.92e+00

Pos logit contributions
aman+0.032
lves+0.031
 hey+0.031
land+0.030
ware+0.029
Neg logit contributions
 ever-0.028
 fright-0.023
 onwards-0.023
unts-0.022
 grin-0.022
Token and
Token ID290
Feature activation+1.18e+00

Pos logit contributions
aman+0.021
land+0.020
 hey+0.020
 exception+0.020
lves+0.019
Neg logit contributions
 ever-0.013
 onwards-0.010
unts-0.010
 fright-0.009
 grin-0.008
Token Ben
Token ID3932
Feature activation-5.33e-01

Pos logit contributions
 onwards+0.045
 ever+0.044
unts+0.042
 onward+0.041
kick+0.040
Neg logit contributions
 whose-0.035
 hey-0.034
aman-0.033
 who-0.032
 extingu-0.029
Token were
Token ID547
Feature activation+3.58e-01

Pos logit contributions
aman+0.024
land+0.023
ware+0.023
 exception+0.022
 hey+0.022
Neg logit contributions
 ever-0.015
unts-0.011
 onwards-0.011
 fright-0.011
 grin-0.010
Token going
Token ID1016
Feature activation+1.09e+00

Pos logit contributions
 ever+0.015
 fright+0.013
 onwards+0.013
unts+0.013
keeping+0.013
Neg logit contributions
aman-0.011
 hey-0.010
 exception-0.010
 whose-0.010
 prett-0.010
Token to
Token ID284
Feature activation-1.10e+00

Pos logit contributions
 ever+0.027
unts+0.025
 fright+0.025
 grin+0.023
keeping+0.021
Neg logit contributions
land-0.047
 exception-0.046
 prett-0.046
aman-0.046
 whose-0.046
Token the
Token ID262
Feature activation-3.15e+00

Pos logit contributions
aman+0.051
lves+0.050
land+0.050
 hey+0.048
ware+0.048
Neg logit contributions
 ever-0.030
 fright-0.021
unts-0.021
 onwards-0.021
 have-0.020
Token woods
Token ID16479
Feature activation-2.13e-01

Pos logit contributions
ist+0.111
der+0.102
ilst+0.100
ters+0.097
land+0.097
Neg logit contributions
 ever-0.107
 sunset-0.093
 sideways-0.089
 very-0.088
 hungry-0.088
Token with
Token ID351
Feature activation-2.33e-01

Pos logit contributions
aman+0.009
land+0.009
 hey+0.008
ware+0.008
lves+0.008
Neg logit contributions
 ever-0.006
 onwards-0.005
unts-0.005
 fright-0.005
 onward-0.005
Token Mom
Token ID11254
Feature activation-1.44e-01

Pos logit contributions
aman+0.009
land+0.009
 hey+0.009
lves+0.009
 exception+0.009
Neg logit contributions
 ever-0.008
 onwards-0.006
unts-0.006
 fright-0.006
 grin-0.006
Token and
Token ID290
Feature activation-3.79e-01

Pos logit contributions
aman+0.006
land+0.006
 hey+0.006
ware+0.006
 exception+0.006
Neg logit contributions
 ever-0.004
 onwards-0.003
unts-0.003
 fright-0.003
 grin-0.003
Token Dad
Token ID17415
Feature activation-5.57e-01

Pos logit contributions
aman+0.020
land+0.020
lves+0.019
ware+0.019
 exception+0.019
Neg logit contributions
 ever-0.008
 onwards-0.005
unts-0.005
 fright-0.004
 grin-0.004
Token and
Token ID290
Feature activation+3.62e-01

Pos logit contributions
 ever+0.009
 onwards+0.008
unts+0.008
 fright+0.007
 onward+0.007
Neg logit contributions
aman-0.015
land-0.015
ware-0.014
 hey-0.014
lves-0.014
Token Ben
Token ID3932
Feature activation-6.69e-01

Pos logit contributions
 ever+0.012
 onwards+0.011
unts+0.011
 fright+0.010
 onward+0.010
Neg logit contributions
aman-0.013
 hey-0.013
 whose-0.012
land-0.012
lves-0.012
Token were
Token ID547
Feature activation+5.20e-01

Pos logit contributions
aman+0.031
land+0.030
ware+0.030
 exception+0.029
lves+0.029
Neg logit contributions
 ever-0.018
unts-0.013
 onwards-0.013
 fright-0.012
 grin-0.012
Token going
Token ID1016
Feature activation+8.92e-01

Pos logit contributions
 ever+0.022
 fright+0.019
keeping+0.019
 onwards+0.019
unts+0.018
Neg logit contributions
aman-0.015
 hey-0.014
 exception-0.014
 prett-0.014
 whose-0.014
Token to
Token ID284
Feature activation-1.31e+00

Pos logit contributions
 ever+0.023
unts+0.021
 fright+0.021
 grin+0.020
 onwards+0.019
Neg logit contributions
land-0.037
 whose-0.037
 exception-0.036
aman-0.036
ware-0.036
Token the
Token ID262
Feature activation-3.15e+00

Pos logit contributions
aman+0.062
lves+0.062
land+0.061
 hey+0.059
ware+0.059
Neg logit contributions
 ever-0.033
 fright-0.024
 have-0.024
unts-0.023
 it-0.022
Token zoo
Token ID26626
Feature activation-2.19e-01

Pos logit contributions
ist+0.112
ilst+0.103
der+0.100
ters+0.099
lves+0.098
Neg logit contributions
 ever-0.105
 sunset-0.092
 very-0.090
 sideways-0.089
 hungry-0.087
Token with
Token ID351
Feature activation+2.10e-01

Pos logit contributions
aman+0.009
land+0.009
 hey+0.009
ware+0.009
lves+0.009
Neg logit contributions
 ever-0.006
 onwards-0.005
unts-0.005
 fright-0.005
 onward-0.005
Token their
Token ID511
Feature activation-1.26e-01

Pos logit contributions
 ever+0.008
 onwards+0.007
unts+0.006
 fright+0.006
 grin+0.006
Neg logit contributions
aman-0.008
ware-0.007
land-0.007
 exception-0.007
 hey-0.007
Token mom
Token ID1995
Feature activation-1.48e-01

Pos logit contributions
aman+0.006
land+0.005
 hey+0.005
ware+0.005
 exception+0.005
Neg logit contributions
 ever-0.004
 onwards-0.003
unts-0.003
 fright-0.003
 grin-0.003
Token.
Token ID13
Feature activation-1.92e-01

Pos logit contributions
aman+0.006
land+0.006
 hey+0.006
ware+0.006
 exception+0.006
Neg logit contributions
 ever-0.005
 onwards-0.004
unts-0.004
 fright-0.004
 grin-0.003
Tokeneth
Token ID2788
Feature activation+3.62e-01

Pos logit contributions
aman+0.009
land+0.009
 hey+0.008
ware+0.008
 exception+0.008
Neg logit contributions
 ever-0.007
 onwards-0.006
unts-0.006
 fright-0.005
 grin-0.005
Token and
Token ID290
Feature activation-4.67e-02

Pos logit contributions
 ever+0.010
 onwards+0.009
unts+0.009
 fright+0.008
 onward+0.008
Neg logit contributions
aman-0.015
land-0.015
 hey-0.014
ware-0.014
 exception-0.014
Token Jonathan
Token ID11232
Feature activation-5.87e-01

Pos logit contributions
aman+0.002
land+0.002
 hey+0.002
 exception+0.002
lves+0.002
Neg logit contributions
 ever-0.001
 onwards-0.001
unts-0.001
 fright-0.001
 onward-0.001
Token were
Token ID547
Feature activation+1.81e-01

Pos logit contributions
aman+0.026
land+0.025
ware+0.025
 exception+0.025
lves+0.025
Neg logit contributions
 ever-0.017
 onwards-0.013
unts-0.012
 fright-0.012
 grin-0.011
Token at
Token ID379
Feature activation-1.52e+00

Pos logit contributions
 ever+0.006
 onwards+0.005
unts+0.005
 fright+0.005
keeping+0.005
Neg logit contributions
aman-0.007
 hey-0.006
 exception-0.006
land-0.006
lves-0.006
Token the
Token ID262
Feature activation-3.14e+00

Pos logit contributions
land+0.081
aman+0.079
lves+0.078
ist+0.078
 exception+0.076
Neg logit contributions
 ever-0.030
 it-0.019
 on-0.017
 onwards-0.017
 fright-0.017
Token beach
Token ID10481
Feature activation-4.48e-01

Pos logit contributions
ist+0.112
ware+0.106
der+0.106
 extingu+0.103
lves+0.101
Neg logit contributions
 ever-0.108
 sunset-0.089
�-0.088
 very-0.087
 hungry-0.084
Token.
Token ID13
Feature activation-5.00e-01

Pos logit contributions
aman+0.019
 hey+0.018
land+0.018
lves+0.018
ware+0.018
Neg logit contributions
 ever-0.013
 onwards-0.011
unts-0.010
 fright-0.010
 grin-0.009
Token Beth
Token ID19223
Feature activation-1.64e-01

Pos logit contributions
aman+0.024
 hey+0.024
land+0.023
ware+0.023
lves+0.023
Neg logit contributions
 ever-0.011
 onwards-0.009
unts-0.009
 fright-0.008
 onward-0.008
Token had
Token ID550
Feature activation+4.53e-02

Pos logit contributions
aman+0.007
land+0.007
 hey+0.006
ware+0.006
 exception+0.006
Neg logit contributions
 ever-0.005
 onwards-0.004
unts-0.004
 fright-0.004
 grin-0.003
Token buckets
Token ID38674
Feature activation+8.26e-02

Pos logit contributions
 ever+0.002
 onwards+0.001
unts+0.001
 fright+0.001
 grin+0.001
Neg logit contributions
aman-0.002
land-0.001
 hey-0.001
ware-0.001
 exception-0.001
Token his
Token ID465
Feature activation-2.71e-01

Pos logit contributions
 ever+0.020
 onwards+0.019
unts+0.018
 fright+0.017
 onward+0.017
Neg logit contributions
aman-0.021
 whose-0.021
 hey-0.020
land-0.019
 exception-0.019
Token mom
Token ID1995
Feature activation-4.55e-01

Pos logit contributions
aman+0.012
land+0.011
ware+0.011
 exception+0.011
 hey+0.011
Neg logit contributions
 ever-0.008
 onwards-0.006
unts-0.006
 fright-0.006
 grin-0.006
Tokenmy
Token ID1820
Feature activation-7.52e-01

Pos logit contributions
aman+0.019
land+0.019
ware+0.018
lves+0.018
 hey+0.018
Neg logit contributions
 ever-0.013
 onwards-0.011
unts-0.011
 fright-0.010
 grin-0.009
Token were
Token ID547
Feature activation-1.52e-01

Pos logit contributions
aman+0.035
land+0.034
ware+0.034
lves+0.034
 exception+0.033
Neg logit contributions
 ever-0.019
 onwards-0.014
unts-0.014
 fright-0.012
 grin-0.012
Token at
Token ID379
Feature activation-1.32e+00

Pos logit contributions
aman+0.006
land+0.006
 hey+0.006
ware+0.006
 exception+0.006
Neg logit contributions
 ever-0.005
 onwards-0.004
unts-0.004
 fright-0.004
 grin-0.003
Token the
Token ID262
Feature activation-3.13e+00

Pos logit contributions
land+0.066
aman+0.065
lves+0.065
 exception+0.063
 hey+0.062
Neg logit contributions
 ever-0.029
 onwards-0.018
 it-0.018
 on-0.018
 fright-0.018
Token gym
Token ID11550
Feature activation-5.10e-01

Pos logit contributions
ist+0.113
der+0.113
lves+0.106
ware+0.105
 extingu+0.104
Neg logit contributions
 ever-0.104
�-0.092
 sunset-0.089
 very-0.087
 it-0.086
Token.
Token ID13
Feature activation-3.26e-02

Pos logit contributions
aman+0.022
land+0.020
 hey+0.020
lves+0.020
ware+0.020
Neg logit contributions
 ever-0.016
 onwards-0.013
unts-0.012
 fright-0.011
 onward-0.011
Token John
Token ID1757
Feature activation-5.01e-01

Pos logit contributions
aman+0.002
land+0.001
 hey+0.001
 exception+0.001
ware+0.001
Neg logit contributions
 ever-0.001
 onwards-0.001
unts-0.001
 fright-0.001
 grin-0.001
Token watched
Token ID7342
Feature activation+8.75e-02

Pos logit contributions
aman+0.022
land+0.021
 hey+0.021
ware+0.021
 exception+0.021
Neg logit contributions
 ever-0.014
 onwards-0.011
unts-0.011
 fright-0.010
 grin-0.010
Token many
Token ID867
Feature activation-2.00e-01

Pos logit contributions
 ever+0.003
 onwards+0.003
unts+0.003
 fright+0.002
 grin+0.002
Neg logit contributions
aman-0.003
land-0.003
ware-0.003
 exception-0.003
 hey-0.003
Token and
Token ID290
Feature activation+4.96e-01

Pos logit contributions
 ever+0.012
 onwards+0.011
unts+0.010
 fright+0.010
 onward+0.010
Neg logit contributions
aman-0.017
land-0.016
 whose-0.016
 exception-0.016
 hey-0.016
Token his
Token ID465
Feature activation+3.79e-02

Pos logit contributions
 ever+0.017
 onwards+0.016
unts+0.015
 onward+0.014
 fright+0.014
Neg logit contributions
aman-0.017
 exception-0.016
 whose-0.016
 hey-0.016
land-0.016
Token mom
Token ID1995
Feature activation+2.11e-01

Pos logit contributions
 ever+0.001
 onwards+0.001
unts+0.001
 fright+0.001
 onward+0.001
Neg logit contributions
aman-0.001
land-0.001
 hey-0.001
ware-0.001
lves-0.001
Token were
Token ID547
Feature activation-1.27e-01

Pos logit contributions
 ever+0.007
 onwards+0.007
unts+0.006
 fright+0.006
 onward+0.006
Neg logit contributions
aman-0.008
 hey-0.007
land-0.007
 exception-0.007
 whose-0.007
Token at
Token ID379
Feature activation-1.26e+00

Pos logit contributions
aman+0.005
land+0.005
 hey+0.005
ware+0.005
 exception+0.005
Neg logit contributions
 ever-0.004
 onwards-0.003
unts-0.003
 fright-0.003
 grin-0.003
Token the
Token ID262
Feature activation-3.12e+00

Pos logit contributions
aman+0.062
land+0.062
lves+0.059
ware+0.059
 exception+0.059
Neg logit contributions
 ever-0.030
 fright-0.019
 onwards-0.019
 on-0.018
unts-0.018
Token park
Token ID3952
Feature activation-1.42e-01

Pos logit contributions
ist+0.123
der+0.115
lves+0.107
 extingu+0.106
ware+0.104
Neg logit contributions
 ever-0.103
 sunset-0.086
 very-0.082
�-0.080
 it-0.079
Token.
Token ID13
Feature activation+6.34e-01

Pos logit contributions
aman+0.006
land+0.005
 hey+0.005
ware+0.005
 exception+0.005
Neg logit contributions
 ever-0.005
 onwards-0.004
unts-0.004
 fright-0.003
 grin-0.003
Token They
Token ID1119
Feature activation+3.77e-01

Pos logit contributions
 ever+0.022
 onwards+0.017
 fright+0.017
unts+0.017
kick+0.016
Neg logit contributions
aman-0.024
 whose-0.022
 exception-0.022
ware-0.022
land-0.022
Token were
Token ID547
Feature activation+1.44e+00

Pos logit contributions
 ever+0.013
 onwards+0.012
unts+0.011
 fright+0.011
keeping+0.011
Neg logit contributions
aman-0.013
land-0.012
lves-0.012
 hey-0.012
 exception-0.012
Token having
Token ID1719
Feature activation+1.76e+00

Pos logit contributions
 ever+0.050
 fright+0.047
keeping+0.046
 grin+0.044
 Amen+0.043
Neg logit contributions
 prett-0.049
 living-0.046
 whose-0.044
aman-0.043
 exception-0.042
Token
Token ID220
Feature activation-1.13e+00

Pos logit contributions
 ever+0.002
 onwards+0.002
unts+0.002
 fright+0.002
 grin+0.002
Neg logit contributions
aman-0.003
land-0.003
 hey-0.003
lves-0.003
 exception-0.003
Token
Token ID198
Feature activation+5.30e-01

Pos logit contributions
aman+0.054
ware+0.052
 exception+0.052
land+0.051
 hey+0.051
Neg logit contributions
 ever-0.028
 onwards-0.023
unts-0.023
 onward-0.019
 fright-0.019
Token
Token ID198
Feature activation-1.12e-01

Pos logit contributions
 ever+0.019
 onwards+0.016
 fright+0.016
unts+0.016
 grin+0.015
Neg logit contributions
aman-0.018
land-0.017
 whose-0.017
 hey-0.017
lves-0.016
TokenFrom
Token ID4863
Feature activation-5.29e-01

Pos logit contributions
aman+0.004
land+0.003
 hey+0.003
ware+0.003
 exception+0.003
Neg logit contributions
 ever-0.004
 onwards-0.004
unts-0.004
 fright-0.004
 grin-0.004
Token that
Token ID326
Feature activation+4.20e-01

Pos logit contributions
aman+0.018
ware+0.017
land+0.017
 hey+0.016
 exception+0.016
Neg logit contributions
 ever-0.020
 onwards-0.017
unts-0.016
 fright-0.016
 onward-0.016
Token day
Token ID1110
Feature activation+1.93e+00

Pos logit contributions
 ever+0.013
unts+0.011
 onwards+0.011
keeping+0.011
 fright+0.010
Neg logit contributions
aman-0.016
land-0.015
 exception-0.015
 whose-0.015
lves-0.015
Token on
Token ID319
Feature activation+3.46e-01

Pos logit contributions
keeping+0.051
 fright+0.045
unts+0.042
 ever+0.041
 grin+0.040
Neg logit contributions
 hey-0.076
aman-0.073
 whose-0.071
 living-0.070
aching-0.067
Token,
Token ID11
Feature activation-1.16e-01

Pos logit contributions
 ever+0.011
unts+0.010
 fright+0.010
 onwards+0.009
keeping+0.009
Neg logit contributions
aman-0.013
land-0.012
 hey-0.012
ware-0.012
 whose-0.012
Token the
Token ID262
Feature activation-6.71e-01

Pos logit contributions
aman+0.005
land+0.005
 hey+0.004
ware+0.004
 exception+0.004
Neg logit contributions
 ever-0.004
 onwards-0.003
unts-0.003
 fright-0.003
 grin-0.003
Token sw
Token ID1509
Feature activation-6.70e-01

Pos logit contributions
aman+0.027
ware+0.025
land+0.025
 exception+0.025
 hey+0.025
Neg logit contributions
 ever-0.022
unts-0.018
 onwards-0.017
 fright-0.017
 mad-0.016
Tokenan
Token ID272
Feature activation-4.62e-01

Pos logit contributions
aman+0.020
ware+0.019
land+0.019
 hey+0.019
 exception+0.018
Neg logit contributions
 ever-0.028
unts-0.024
 onwards-0.024
 fright-0.023
 grin-0.022
Token out
Token ID503
Feature activation+1.94e-01

Pos logit contributions
unts+0.040
 ever+0.039
keeping+0.036
 Amen+0.031
kick+0.031
Neg logit contributions
 whose-0.039
 extingu-0.037
ware-0.036
aman-0.036
 prett-0.036
Token his
Token ID465
Feature activation+4.67e-01

Pos logit contributions
 ever+0.007
unts+0.006
 onwards+0.006
keeping+0.005
 fright+0.005
Neg logit contributions
aman-0.007
land-0.007
ware-0.007
 exception-0.007
 whose-0.007
Token hand
Token ID1021
Feature activation+1.39e-01

Pos logit contributions
 ever+0.014
unts+0.012
 onwards+0.012
keeping+0.010
 onward+0.010
Neg logit contributions
aman-0.019
 prett-0.018
 whose-0.018
land-0.018
ware-0.018
Token.
Token ID13
Feature activation-4.95e-01

Pos logit contributions
 ever+0.004
unts+0.004
 onwards+0.004
 fright+0.003
 grin+0.003
Neg logit contributions
aman-0.006
land-0.005
 exception-0.005
ware-0.005
 hey-0.005
Token "
Token ID366
Feature activation-3.87e-01

Pos logit contributions
aman+0.024
land+0.023
 hey+0.023
lves+0.023
 exception+0.022
Neg logit contributions
 ever-0.012
 onwards-0.010
unts-0.009
 fright-0.008
 onward-0.008
TokenCome
Token ID16773
Feature activation+1.77e+00

Pos logit contributions
aman+0.014
land+0.014
ware+0.013
 exception+0.013
 hey+0.013
Neg logit contributions
 ever-0.014
 onwards-0.012
unts-0.011
 fright-0.011
 onward-0.010
Token here
Token ID994
Feature activation+5.67e-01

Pos logit contributions
unts+0.058
 ever+0.052
keeping+0.050
 onwards+0.046
kick+0.043
Neg logit contributions
 hey-0.064
land-0.062
 whose-0.062
 extingu-0.061
aman-0.061
Token,
Token ID11
Feature activation-3.47e-01

Pos logit contributions
 ever+0.017
unts+0.015
 onwards+0.015
 fright+0.014
keeping+0.013
Neg logit contributions
aman-0.023
land-0.022
 hey-0.021
 exception-0.021
ware-0.020
Token butterfly
Token ID35113
Feature activation-1.28e+00

Pos logit contributions
aman+0.013
land+0.012
ware+0.012
 exception+0.012
 hey+0.012
Neg logit contributions
 ever-0.012
 onwards-0.011
unts-0.010
 fright-0.010
 grin-0.009
Token!"
Token ID2474
Feature activation-2.76e-01

Pos logit contributions
aman+0.064
ware+0.063
 hey+0.061
 extingu+0.061
lves+0.060
Neg logit contributions
 ever-0.030
 onwards-0.019
 fright-0.018
unts-0.018
kick-0.016
Token he
Token ID339
Feature activation-1.28e-01

Pos logit contributions
aman+0.011
land+0.010
 hey+0.010
ware+0.010
 exception+0.010
Neg logit contributions
 ever-0.009
 onwards-0.008
unts-0.007
 fright-0.007
 onward-0.007
Token eventually
Token ID4191
Feature activation-4.37e-01

Pos logit contributions
aman+0.007
land+0.007
 hey+0.007
ware+0.007
 exception+0.007
Neg logit contributions
 ever-0.006
 onwards-0.005
unts-0.005
 fright-0.005
 grin-0.005
Token,
Token ID11
Feature activation-3.67e-01

Pos logit contributions
aman+0.018
land+0.017
 hey+0.017
lves+0.017
 exception+0.017
Neg logit contributions
 ever-0.014
 onwards-0.011
unts-0.010
 fright-0.010
 onward-0.010
Token he
Token ID339
Feature activation+4.84e-01

Pos logit contributions
aman+0.014
land+0.014
 hey+0.014
ware+0.014
 exception+0.013
Neg logit contributions
 ever-0.012
 onwards-0.010
unts-0.009
 fright-0.009
 onward-0.009
Token saw
Token ID2497
Feature activation-1.27e+00

Pos logit contributions
 ever+0.015
unts+0.013
 onwards+0.013
 fright+0.013
keeping+0.012
Neg logit contributions
aman-0.019
land-0.018
ware-0.017
 hey-0.017
 exception-0.017
Token a
Token ID257
Feature activation+6.66e-01

Pos logit contributions
aman+0.052
lves+0.049
 exception+0.048
 hey+0.048
land+0.047
Neg logit contributions
 ever-0.042
 onwards-0.034
 fright-0.032
 it-0.032
unts-0.030
Token big
Token ID1263
Feature activation+1.67e+00

Pos logit contributions
 ever+0.026
 onwards+0.023
unts+0.022
keeping+0.021
 onward+0.021
Neg logit contributions
aman-0.022
land-0.021
ware-0.020
 hey-0.020
 prett-0.019
Token muscle
Token ID8280
Feature activation+2.39e-01

Pos logit contributions
 ever+0.071
 onwards+0.067
keeping+0.063
 onward+0.061
unts+0.060
Neg logit contributions
aman-0.049
 extingu-0.044
 station-0.042
land-0.042
 prett-0.042
Token.
Token ID13
Feature activation+2.76e-01

Pos logit contributions
 ever+0.008
 onwards+0.007
unts+0.007
 fright+0.007
 onward+0.006
Neg logit contributions
aman-0.009
land-0.008
ware-0.008
 hey-0.008
 exception-0.008
Token It
Token ID632
Feature activation-1.18e-02

Pos logit contributions
 ever+0.007
unts+0.006
 onwards+0.006
 fright+0.005
 grin+0.005
Neg logit contributions
aman-0.013
land-0.012
ware-0.012
 hey-0.012
 whose-0.012
Token was
Token ID373
Feature activation+2.49e-01

Pos logit contributions
aman+0.000
land+0.000
ware+0.000
 hey+0.000
 exception+0.000
Neg logit contributions
 ever-0.000
 onwards-0.000
unts-0.000
 fright-0.000
 onward-0.000
Token moving
Token ID3867
Feature activation+8.53e-01

Pos logit contributions
 ever+0.008
 onwards+0.007
unts+0.007
 fright+0.006
keeping+0.006
Neg logit contributions
aman-0.010
land-0.009
 exception-0.009
ware-0.009
 prett-0.009
Token to
Token ID284
Feature activation-3.39e-01

Pos logit contributions
 ever+0.013
 onwards+0.011
unts+0.011
 fright+0.010
 grin+0.010
Neg logit contributions
aman-0.017
land-0.017
ware-0.016
 exception-0.016
 hey-0.016
Token disappear
Token ID10921
Feature activation-2.20e-01

Pos logit contributions
aman+0.014
land+0.013
ware+0.013
 hey+0.013
lves+0.013
Neg logit contributions
 ever-0.011
 onwards-0.008
unts-0.008
 fright-0.008
 grin-0.007
Token.
Token ID13
Feature activation-2.24e-01

Pos logit contributions
aman+0.009
land+0.009
 hey+0.008
ware+0.008
 exception+0.008
Neg logit contributions
 ever-0.007
 onwards-0.006
unts-0.005
 fright-0.005
 onward-0.005
Token She
Token ID1375
Feature activation+5.64e-01

Pos logit contributions
aman+0.010
land+0.009
 hey+0.009
ware+0.009
 exception+0.009
Neg logit contributions
 ever-0.006
 onwards-0.005
unts-0.005
 fright-0.005
 grin-0.005
Token knew
Token ID2993
Feature activation-7.23e-01

Pos logit contributions
 ever+0.018
 onwards+0.016
unts+0.016
 onward+0.015
keeping+0.015
Neg logit contributions
aman-0.021
land-0.020
 whose-0.020
ware-0.020
 hey-0.020
Token he
Token ID339
Feature activation+1.66e+00

Pos logit contributions
aman+0.026
land+0.025
ware+0.025
 extingu+0.025
 exception+0.025
Neg logit contributions
 ever-0.027
 onwards-0.022
 fright-0.020
 it-0.020
 onward-0.020
Token wasn
Token ID2492
Feature activation+9.45e-01

Pos logit contributions
unts+0.058
 onwards+0.055
 grin+0.055
keeping+0.053
 onward+0.052
Neg logit contributions
lves-0.050
 who-0.050
ray-0.050
 whose-0.049
land-0.048
Token't
Token ID470
Feature activation-3.02e-01

Pos logit contributions
 ever+0.027
keeping+0.025
unts+0.025
 onwards+0.024
 grin+0.024
Neg logit contributions
aman-0.036
 hey-0.035
lves-0.034
 exception-0.034
land-0.033
Token really
Token ID1107
Feature activation-5.75e-01

Pos logit contributions
aman+0.013
land+0.012
ware+0.012
 hey+0.012
 exception+0.012
Neg logit contributions
 ever-0.009
 onwards-0.007
unts-0.007
 fright-0.007
 grin-0.006
Token gone
Token ID3750
Feature activation+3.70e-01

Pos logit contributions
aman+0.025
ware+0.024
land+0.023
 hey+0.023
lves+0.023
Neg logit contributions
 ever-0.018
 onwards-0.014
unts-0.013
 fright-0.013
 grin-0.012
Token and
Token ID290
Feature activation+1.85e-01

Pos logit contributions
 ever+0.011
unts+0.010
 onwards+0.010
 fright+0.009
 grin+0.009
Neg logit contributions
aman-0.014
land-0.014
 exception-0.014
 hey-0.013
ware-0.013
Token.
Token ID13
Feature activation+2.27e-01

Pos logit contributions
aman+0.008
land+0.008
 hey+0.008
ware+0.008
 exception+0.008
Neg logit contributions
 ever-0.007
 onwards-0.006
unts-0.006
 fright-0.005
 onward-0.005
Token But
Token ID887
Feature activation+1.69e-01

Pos logit contributions
 ever+0.008
 onwards+0.006
unts+0.006
 fright+0.006
 grin+0.006
Neg logit contributions
aman-0.009
land-0.008
 exception-0.008
 hey-0.008
 whose-0.008
Token Tom
Token ID4186
Feature activation+4.96e-01

Pos logit contributions
 ever+0.006
unts+0.006
 onwards+0.006
 fright+0.005
 grin+0.005
Neg logit contributions
aman-0.006
 hey-0.005
land-0.005
 exception-0.005
 whose-0.005
Token was
Token ID373
Feature activation+7.53e-02

Pos logit contributions
 ever+0.016
 onwards+0.014
unts+0.014
 fright+0.014
keeping+0.013
Neg logit contributions
aman-0.019
lves-0.018
 hey-0.017
land-0.017
 whose-0.017
Token upset
Token ID9247
Feature activation-5.03e-01

Pos logit contributions
 ever+0.002
 onwards+0.002
unts+0.002
 fright+0.002
 grin+0.002
Neg logit contributions
aman-0.003
 hey-0.003
land-0.003
 exception-0.003
lves-0.003
Token because
Token ID780
Feature activation+1.89e+00

Pos logit contributions
aman+0.023
lves+0.022
land+0.022
ware+0.021
 hey+0.021
Neg logit contributions
 ever-0.015
 onwards-0.011
unts-0.011
 fright-0.010
 onward-0.009
Token he
Token ID339
Feature activation+8.94e-01

Pos logit contributions
unts+0.097
 grin+0.095
 fright+0.091
keeping+0.089
 ever+0.087
Neg logit contributions
 whose-0.030
 who-0.028
 where-0.027
 salon-0.026
 living-0.026
Token could
Token ID714
Feature activation+3.67e-01

Pos logit contributions
 onwards+0.028
 ever+0.028
 onward+0.026
 grin+0.026
unts+0.026
Neg logit contributions
lves-0.032
 hey-0.030
 whose-0.030
aman-0.030
ware-0.029
Token not
Token ID407
Feature activation+7.59e-01

Pos logit contributions
 ever+0.012
 onwards+0.011
unts+0.010
 onward+0.010
 fright+0.010
Neg logit contributions
aman-0.013
 hey-0.013
 exception-0.013
ware-0.013
land-0.012
Token find
Token ID1064
Feature activation-1.28e+00

Pos logit contributions
 onwards+0.029
 ever+0.029
unts+0.027
 onward+0.027
 fright+0.025
Neg logit contributions
aman-0.022
 hey-0.021
 exception-0.021
land-0.020
 extingu-0.019
Token his
Token ID465
Feature activation+1.39e-01

Pos logit contributions
aman+0.059
land+0.056
 hey+0.053
aching+0.053
 exception+0.053
Neg logit contributions
 ever-0.038
 fright-0.029
 onwards-0.027
 it-0.027
unts-0.025
Token their
Token ID511
Feature activation-3.75e-01

Pos logit contributions
aman+0.036
land+0.035
 hey+0.034
 exception+0.033
lves+0.033
Neg logit contributions
 ever-0.024
unts-0.020
 onwards-0.018
 fright-0.018
 sideways-0.017
Token toys
Token ID14958
Feature activation-1.61e+00

Pos logit contributions
aman+0.012
land+0.011
 hey+0.011
 exception+0.011
lves+0.011
Neg logit contributions
 ever-0.015
 onwards-0.013
unts-0.013
 fright-0.012
 grin-0.012
Token.
Token ID13
Feature activation-5.84e-01

Pos logit contributions
aman+0.089
lves+0.084
land+0.083
 exception+0.082
 hey+0.082
Neg logit contributions
 ever-0.037
 on-0.027
 onwards-0.024
 sideways-0.019
 onward-0.019
Token They
Token ID1119
Feature activation+3.24e-01

Pos logit contributions
aman+0.030
 hey+0.029
land+0.029
lves+0.028
 exception+0.028
Neg logit contributions
 ever-0.012
 onwards-0.010
unts-0.009
 fright-0.008
 sideways-0.008
Token like
Token ID588
Feature activation-2.91e-01

Pos logit contributions
 ever+0.010
 onwards+0.009
unts+0.008
 fright+0.008
 onward+0.008
Neg logit contributions
 hey-0.012
aman-0.012
 exception-0.012
ware-0.012
lves-0.012
Token to
Token ID284
Feature activation+3.31e-01

Pos logit contributions
aman+0.015
land+0.014
 hey+0.014
 exception+0.014
lves+0.014
Neg logit contributions
 ever-0.006
 onwards-0.004
unts-0.004
 fright-0.004
 onward-0.003
Token count
Token ID954
Feature activation-1.43e-01

Pos logit contributions
 ever+0.014
 onwards+0.012
unts+0.012
 onward+0.011
 fright+0.011
Neg logit contributions
aman-0.009
ware-0.009
 hey-0.009
 exception-0.009
land-0.009
Token them
Token ID606
Feature activation-4.77e-01

Pos logit contributions
aman+0.006
land+0.005
 hey+0.005
ware+0.005
 exception+0.005
Neg logit contributions
 ever-0.005
 onwards-0.004
unts-0.004
 fright-0.004
 grin-0.004
Token and
Token ID290
Feature activation+7.59e-02

Pos logit contributions
aman+0.021
 hey+0.020
lves+0.020
land+0.020
 exception+0.020
Neg logit contributions
 ever-0.014
 onwards-0.011
unts-0.011
 fright-0.010
 onward-0.009
Token say
Token ID910
Feature activation+4.04e-01

Pos logit contributions
 ever+0.003
 onwards+0.002
unts+0.002
 fright+0.002
 onward+0.002
Neg logit contributions
aman-0.003
land-0.003
 hey-0.003
ware-0.003
 exception-0.003
Token their
Token ID511
Feature activation-4.57e-01

Pos logit contributions
 ever+0.014
 onwards+0.012
unts+0.012
 fright+0.011
keeping+0.011
Neg logit contributions
land-0.015
aman-0.014
ware-0.014
 hey-0.014
 whose-0.013
Token this
Token ID428
Feature activation-1.18e+00

Pos logit contributions
aman+0.012
land+0.011
 hey+0.011
ware+0.011
 exception+0.011
Neg logit contributions
 ever-0.008
 onwards-0.007
unts-0.006
 fright-0.006
 onward-0.006
Token is
Token ID318
Feature activation-7.82e-01

Pos logit contributions
 hey+0.049
aman+0.048
lves+0.046
 exception+0.045
 extingu+0.045
Neg logit contributions
 ever-0.037
 onwards-0.030
unts-0.028
 fright-0.027
 it-0.027
Token our
Token ID674
Feature activation-9.44e-01

Pos logit contributions
aman+0.034
ware+0.032
land+0.032
 hey+0.032
lves+0.031
Neg logit contributions
 ever-0.025
 onwards-0.019
 fright-0.018
unts-0.017
 mad-0.016
Token land
Token ID1956
Feature activation+7.95e-01

Pos logit contributions
aman+0.035
lves+0.033
 hey+0.033
ware+0.032
 exception+0.032
Neg logit contributions
 ever-0.034
 onwards-0.026
 fright-0.026
unts-0.026
 grin-0.024
Token and
Token ID290
Feature activation-9.64e-03

Pos logit contributions
 ever+0.028
unts+0.026
 onwards+0.026
 fright+0.024
 grin+0.024
Neg logit contributions
aman-0.027
land-0.026
 exception-0.024
 hey-0.024
ware-0.024
Token we
Token ID356
Feature activation+6.95e-01

Pos logit contributions
aman+0.000
land+0.000
 hey+0.000
ware+0.000
 exception+0.000
Neg logit contributions
 ever-0.000
 onwards-0.000
unts-0.000
 fright-0.000
 grin-0.000
Token don
Token ID836
Feature activation+1.60e+00

Pos logit contributions
 ever+0.020
 onwards+0.018
unts+0.017
 onward+0.016
 fright+0.016
Neg logit contributions
aman-0.028
 hey-0.027
lves-0.027
 exception-0.027
land-0.026
Tokenâ
Token ID22940
Feature activation+1.35e-01

Pos logit contributions
 ever+0.053
 grin+0.049
keeping+0.049
 onwards+0.047
 sideways+0.047
Neg logit contributions
aman-0.054
 hey-0.050
aching-0.049
 exception-0.049
land-0.049
Token€
Token ID26391
Feature activation+8.33e-01

Pos logit contributions
 ever+0.005
 onwards+0.004
unts+0.004
 fright+0.004
 grin+0.004
Neg logit contributions
aman-0.005
land-0.005
 hey-0.005
 exception-0.005
lves-0.005
Tokenâ„¢
Token ID8151
Feature activation+8.84e-01

Pos logit contributions
 ever+0.028
unts+0.024
 fright+0.024
 onwards+0.024
 grin+0.023
Neg logit contributions
land-0.030
aman-0.029
 hey-0.028
lves-0.028
 whose-0.027
Tokent
Token ID83
Feature activation+1.09e+00

Pos logit contributions
 ever+0.023
unts+0.020
 fright+0.020
 grin+0.019
 onwards+0.019
Neg logit contributions
aman-0.038
land-0.036
 exception-0.035
lves-0.035
 hey-0.035
Token and
Token ID290
Feature activation+5.43e-01

Pos logit contributions
 ever+0.025
unts+0.022
 fright+0.021
 onwards+0.021
keeping+0.019
Neg logit contributions
aman-0.034
land-0.034
 whose-0.032
 exception-0.031
 hey-0.031
Token joy
Token ID8716
Feature activation-1.16e+00

Pos logit contributions
 ever+0.016
unts+0.015
 onwards+0.014
 fright+0.014
kick+0.013
Neg logit contributions
aman-0.022
 hey-0.020
 whose-0.020
land-0.020
ware-0.020
Tokenfully
Token ID2759
Feature activation+1.21e-01

Pos logit contributions
aman+0.048
lves+0.047
 hey+0.046
 exception+0.045
ware+0.045
Neg logit contributions
 ever-0.040
 onwards-0.030
 onward-0.027
 grin-0.026
unts-0.026
Token returned
Token ID4504
Feature activation+8.64e-01

Pos logit contributions
 ever+0.004
 onwards+0.003
unts+0.003
 fright+0.003
 onward+0.003
Neg logit contributions
aman-0.005
land-0.005
 hey-0.005
lves-0.005
ware-0.005
Token home
Token ID1363
Feature activation+2.57e-01

Pos logit contributions
 ever+0.026
unts+0.025
 onwards+0.023
 grin+0.022
 Amen+0.022
Neg logit contributions
aman-0.033
land-0.032
 whose-0.030
 hey-0.030
lves-0.030
Token with
Token ID351
Feature activation+1.05e-02

Pos logit contributions
 ever+0.008
unts+0.007
 onwards+0.007
 fright+0.007
 grin+0.006
Neg logit contributions
aman-0.010
land-0.009
ware-0.009
 hey-0.009
 exception-0.009
Token his
Token ID465
Feature activation-3.07e-01

Pos logit contributions
 ever+0.000
 onwards+0.000
unts+0.000
 fright+0.000
 grin+0.000
Neg logit contributions
aman-0.000
land-0.000
 hey-0.000
ware-0.000
 exception-0.000
Token new
Token ID649
Feature activation-7.04e-01

Pos logit contributions
aman+0.009
land+0.008
 hey+0.008
 exception+0.008
ware+0.008
Neg logit contributions
 ever-0.014
 onwards-0.012
unts-0.012
 fright-0.011
 grin-0.011
Token toy
Token ID13373
Feature activation-2.44e-01

Pos logit contributions
aman+0.023
land+0.022
 hey+0.021
lves+0.020
 exception+0.020
Neg logit contributions
 ever-0.029
 onwards-0.025
unts-0.024
 fright-0.024
 grin-0.023
Token.
Token ID13
Feature activation+3.86e-01

Pos logit contributions
aman+0.010
land+0.009
 hey+0.009
 exception+0.009
ware+0.009
Neg logit contributions
 ever-0.008
 onwards-0.007
unts-0.006
 fright-0.006
 onward-0.006
Token Now
Token ID2735
Feature activation-5.35e-01

Pos logit contributions
 ever+0.012
unts+0.010
 onwards+0.010
 fright+0.010
 grin+0.009
Neg logit contributions
aman-0.016
land-0.014
 whose-0.014
 hey-0.014
lves-0.014
Token have
Token ID423
Feature activation-3.87e-01

Pos logit contributions
aman+0.015
land+0.015
ware+0.014
 hey+0.014
 exception+0.014
Neg logit contributions
 ever-0.009
 onwards-0.007
unts-0.007
 fright-0.006
 grin-0.006
Token fun
Token ID1257
Feature activation+5.14e-02

Pos logit contributions
aman+0.015
land+0.014
ware+0.014
 hey+0.014
 exception+0.014
Neg logit contributions
 ever-0.013
unts-0.011
 onwards-0.011
 fright-0.010
 grin-0.010
Token.
Token ID13
Feature activation+2.39e-01

Pos logit contributions
 ever+0.002
 onwards+0.001
unts+0.001
 fright+0.001
 grin+0.001
Neg logit contributions
aman-0.002
land-0.002
 hey-0.002
ware-0.002
 exception-0.002
Token The
Token ID383
Feature activation+1.75e-01

Pos logit contributions
 ever+0.008
 onwards+0.007
 fright+0.007
unts+0.007
 grin+0.006
Neg logit contributions
aman-0.009
land-0.008
 hey-0.008
 whose-0.008
 exception-0.008
Token beach
Token ID10481
Feature activation+8.74e-03

Pos logit contributions
 ever+0.007
 onwards+0.007
unts+0.006
 fright+0.006
 onward+0.006
Neg logit contributions
aman-0.005
land-0.005
 hey-0.005
ware-0.005
 prett-0.005
Token is
Token ID318
Feature activation+2.12e-01

Pos logit contributions
 ever+0.000
 onwards+0.000
unts+0.000
 fright+0.000
 onward+0.000
Neg logit contributions
aman-0.000
land-0.000
ware-0.000
 hey-0.000
 exception-0.000
Token great
Token ID1049
Feature activation+8.68e-02

Pos logit contributions
 ever+0.007
 onwards+0.006
unts+0.006
 fright+0.006
keeping+0.006
Neg logit contributions
aman-0.008
land-0.007
 exception-0.007
 prett-0.007
 hey-0.007
Token.
Token ID13
Feature activation+3.13e-01

Pos logit contributions
 ever+0.003
 onwards+0.002
unts+0.002
 fright+0.002
 grin+0.002
Neg logit contributions
aman-0.003
land-0.003
ware-0.003
 exception-0.003
 hey-0.003
Token<|endoftext|>
Token ID50256
Feature activation+7.57e-01

Pos logit contributions
 ever+0.011
 onwards+0.009
 fright+0.009
unts+0.009
 grin+0.008
Neg logit contributions
aman-0.011
land-0.011
 hey-0.011
 whose-0.011
 exception-0.011
TokenTim
Token ID14967
Feature activation-2.19e-01

Pos logit contributions
 ever+0.042
 fright+0.038
 grin+0.038
 onwards+0.038
unts+0.038
Neg logit contributions
land-0.010
ware-0.010
aman-0.009
 whose-0.009
 hey-0.008
Token and
Token ID290
Feature activation+1.15e+00

Pos logit contributions
aman+0.010
land+0.010
 hey+0.010
 exception+0.010
ware+0.010
Neg logit contributions
 ever-0.006
 onwards-0.004
unts-0.004
 fright-0.004
 grin-0.004
Token.
Token ID13
Feature activation+7.28e-01

Pos logit contributions
 ever+0.030
 fright+0.027
 onwards+0.026
unts+0.026
 grin+0.025
Neg logit contributions
land-0.026
aman-0.025
ware-0.024
 exception-0.024
 whose-0.023
Token She
Token ID1375
Feature activation+4.83e-01

Pos logit contributions
 ever+0.025
 onwards+0.020
unts+0.020
 fright+0.020
keeping+0.018
Neg logit contributions
aman-0.027
land-0.026
ware-0.026
 whose-0.026
 hey-0.025
Token uses
Token ID3544
Feature activation+8.09e-01

Pos logit contributions
 ever+0.017
 onwards+0.015
keeping+0.014
unts+0.014
 fright+0.014
Neg logit contributions
aman-0.017
land-0.016
ware-0.016
 exception-0.016
 prett-0.016
Token her
Token ID607
Feature activation+8.32e-01

Pos logit contributions
 ever+0.029
 onwards+0.026
unts+0.024
keeping+0.024
 grin+0.023
Neg logit contributions
ware-0.026
aman-0.026
 prett-0.025
 whose-0.025
land-0.025
Token brain
Token ID3632
Feature activation-7.61e-02

Pos logit contributions
 ever+0.029
 onwards+0.025
unts+0.025
keeping+0.024
 onward+0.023
Neg logit contributions
ware-0.028
 prett-0.028
land-0.027
 whose-0.027
aman-0.027
Token.
Token ID13
Feature activation+2.96e-01

Pos logit contributions
aman+0.004
land+0.004
ware+0.003
 hey+0.003
 exception+0.003
Neg logit contributions
 ever-0.002
 onwards-0.001
unts-0.001
 fright-0.001
 grin-0.001
Token She
Token ID1375
Feature activation+1.00e-01

Pos logit contributions
 ever+0.009
 onwards+0.007
unts+0.007
 fright+0.007
keeping+0.006
Neg logit contributions
aman-0.012
land-0.012
ware-0.012
 hey-0.011
 exception-0.011
Token is
Token ID318
Feature activation-1.05e-01

Pos logit contributions
 ever+0.003
 onwards+0.003
unts+0.003
 fright+0.003
 onward+0.003
Neg logit contributions
aman-0.004
land-0.004
ware-0.003
 hey-0.003
 exception-0.003
Token clever
Token ID14169
Feature activation+8.50e-01

Pos logit contributions
aman+0.004
land+0.004
 hey+0.004
 exception+0.004
ware+0.004
Neg logit contributions
 ever-0.003
 onwards-0.003
unts-0.002
 fright-0.002
 grin-0.002
Token.
Token ID13
Feature activation+3.68e-01

Pos logit contributions
 ever+0.029
 onwards+0.027
unts+0.026
 fright+0.026
 onward+0.025
Neg logit contributions
land-0.029
 exception-0.027
 whose-0.027
aman-0.027
ware-0.027
Token She
Token ID1375
Feature activation-1.03e-01

Pos logit contributions
 ever+0.012
 onwards+0.009
unts+0.009
 fright+0.009
keeping+0.009
Neg logit contributions
aman-0.015
land-0.014
 hey-0.014
ware-0.014
 exception-0.014
Token He
Token ID679
Feature activation+3.77e-01

Pos logit contributions
 ever+0.008
unts+0.006
 onwards+0.006
 fright+0.006
kick+0.006
Neg logit contributions
aman-0.011
land-0.011
ware-0.010
 hey-0.010
 exception-0.010
Token said
Token ID531
Feature activation+2.89e-01

Pos logit contributions
 ever+0.013
 onwards+0.011
unts+0.011
keeping+0.010
 fright+0.010
Neg logit contributions
aman-0.014
land-0.013
 whose-0.013
lves-0.013
 hey-0.013
Token,
Token ID11
Feature activation-1.85e-02

Pos logit contributions
 ever+0.009
unts+0.008
 onwards+0.008
 fright+0.007
keeping+0.007
Neg logit contributions
aman-0.011
land-0.011
 hey-0.011
ware-0.011
 whose-0.011
Token "
Token ID366
Feature activation-4.88e-01

Pos logit contributions
aman+0.001
land+0.001
 hey+0.001
ware+0.001
 exception+0.001
Neg logit contributions
 ever-0.001
 onwards-0.001
unts-0.001
 fright-0.001
 onward-0.001
TokenWell
Token ID5779
Feature activation-1.33e-01

Pos logit contributions
aman+0.018
ware+0.017
land+0.017
 exception+0.017
 hey+0.017
Neg logit contributions
 ever-0.017
 onwards-0.014
unts-0.014
 fright-0.014
 onward-0.013
Token,
Token ID11
Feature activation-6.63e-01

Pos logit contributions
aman+0.006
land+0.006
 hey+0.005
ware+0.005
 exception+0.005
Neg logit contributions
 ever-0.004
 onwards-0.003
unts-0.003
 fright-0.003
 grin-0.003
Token this
Token ID428
Feature activation-1.38e+00

Pos logit contributions
aman+0.024
ware+0.023
land+0.023
lves+0.023
 exception+0.023
Neg logit contributions
 ever-0.024
 onwards-0.020
 fright-0.019
unts-0.018
 grin-0.018
Token diamond
Token ID15291
Feature activation-6.69e-01

Pos logit contributions
aman+0.056
 hey+0.055
lves+0.054
 exception+0.053
 extingu+0.052
Neg logit contributions
 ever-0.047
 onwards-0.036
 fright-0.034
 it-0.033
 on-0.032
Token is
Token ID318
Feature activation-2.46e-01

Pos logit contributions
aman+0.028
 hey+0.026
land+0.026
 exception+0.026
lves+0.026
Neg logit contributions
 ever-0.021
 onwards-0.016
unts-0.016
 fright-0.015
 grin-0.014
Token mine
Token ID6164
Feature activation+5.35e-01

Pos logit contributions
aman+0.010
land+0.009
 hey+0.009
ware+0.009
 exception+0.009
Neg logit contributions
 ever-0.008
 onwards-0.007
unts-0.006
 fright-0.006
 onward-0.006
Token."
Token ID526
Feature activation+4.14e-01

Pos logit contributions
 ever+0.015
unts+0.014
 onwards+0.014
 fright+0.013
 grin+0.012
Neg logit contributions
aman-0.021
land-0.021
ware-0.020
 exception-0.020
 hey-0.020
Token Sometimes
Token ID8975
Feature activation+5.90e-02

Pos logit contributions
aman+0.012
land+0.011
ware+0.011
 hey+0.011
 exception+0.011
Neg logit contributions
 ever-0.007
 onwards-0.006
unts-0.006
 fright-0.005
 onward-0.005
Token there
Token ID612
Feature activation+1.36e-01

Pos logit contributions
 ever+0.002
 onwards+0.002
unts+0.002
 fright+0.001
 grin+0.001
Neg logit contributions
aman-0.002
land-0.002
 hey-0.002
 exception-0.002
ware-0.002
Token are
Token ID389
Feature activation-7.68e-01

Pos logit contributions
 ever+0.004
 onwards+0.004
unts+0.004
 fright+0.003
 grin+0.003
Neg logit contributions
aman-0.005
land-0.005
 hey-0.005
ware-0.005
 exception-0.005
Token too
Token ID1165
Feature activation+5.23e-01

Pos logit contributions
aman+0.030
 hey+0.027
ware+0.027
land+0.027
lves+0.027
Neg logit contributions
 ever-0.027
 onwards-0.022
unts-0.021
 fright-0.021
kick-0.019
Token many
Token ID867
Feature activation-5.00e-02

Pos logit contributions
 ever+0.014
 onwards+0.012
 grin+0.012
unts+0.011
keeping+0.011
Neg logit contributions
land-0.022
aman-0.022
 whose-0.021
 exception-0.021
lves-0.021
Token cars
Token ID5006
Feature activation-5.22e-01

Pos logit contributions
aman+0.002
land+0.002
 hey+0.002
 exception+0.002
ware+0.002
Neg logit contributions
 ever-0.002
 onwards-0.002
unts-0.002
 fright-0.002
 grin-0.001
Token on
Token ID319
Feature activation-4.76e-01

Pos logit contributions
aman+0.024
land+0.022
ware+0.022
 hey+0.022
 exception+0.022
Neg logit contributions
 ever-0.014
 onwards-0.011
unts-0.011
 fright-0.010
 onward-0.009
Token the
Token ID262
Feature activation-3.87e-01

Pos logit contributions
aman+0.019
 hey+0.018
land+0.018
lves+0.018
 exception+0.018
Neg logit contributions
 ever-0.015
 onwards-0.012
unts-0.012
 fright-0.012
 onward-0.011
Token road
Token ID2975
Feature activation-2.96e-01

Pos logit contributions
aman+0.013
 hey+0.012
ware+0.012
land+0.012
 exception+0.012
Neg logit contributions
 ever-0.015
 onwards-0.013
unts-0.013
 fright-0.012
 onward-0.012
Token and
Token ID290
Feature activation-3.00e-02

Pos logit contributions
aman+0.012
land+0.012
 hey+0.012
ware+0.011
lves+0.011
Neg logit contributions
 ever-0.009
 onwards-0.007
unts-0.007
 fright-0.007
 onward-0.006
Token they
Token ID484
Feature activation-1.17e-01

Pos logit contributions
aman+0.001
land+0.001
 hey+0.001
ware+0.001
 exception+0.001
Neg logit contributions
 ever-0.001
 onwards-0.001
unts-0.001
 fright-0.001
 grin-0.001
Token.
Token ID13
Feature activation-9.02e-01

Pos logit contributions
 ever+0.029
unts+0.026
 onwards+0.025
 grin+0.025
 Amen+0.025
Neg logit contributions
land-0.030
 exception-0.027
lves-0.027
aman-0.026
ware-0.026
Token They
Token ID1119
Feature activation+6.87e-02

Pos logit contributions
aman+0.041
lves+0.039
land+0.039
 hey+0.038
ware+0.038
Neg logit contributions
 ever-0.024
 onwards-0.021
unts-0.019
 fright-0.018
 onward-0.018
Token were
Token ID547
Feature activation+7.90e-01

Pos logit contributions
 ever+0.002
 onwards+0.002
unts+0.002
 fright+0.001
keeping+0.001
Neg logit contributions
aman-0.003
land-0.003
 hey-0.003
ware-0.003
 exception-0.003
Token having
Token ID1719
Feature activation+1.32e+00

Pos logit contributions
 ever+0.024
 fright+0.021
keeping+0.020
 onwards+0.020
unts+0.020
Neg logit contributions
 prett-0.030
 whose-0.029
 exception-0.029
lves-0.029
 hey-0.028
Token fun
Token ID1257
Feature activation-3.82e-01

Pos logit contributions
 ever+0.056
 onwards+0.053
unts+0.052
 Amen+0.051
 grin+0.051
Neg logit contributions
 prett-0.033
land-0.032
 whose-0.031
aman-0.029
 hey-0.028
Token.
Token ID13
Feature activation-9.76e-01

Pos logit contributions
aman+0.017
land+0.016
lves+0.015
ware+0.015
 hey+0.015
Neg logit contributions
 ever-0.012
 onwards-0.009
unts-0.009
 fright-0.009
 onward-0.008
Token But
Token ID887
Feature activation-9.32e-01

Pos logit contributions
aman+0.047
land+0.045
lves+0.045
ware+0.045
 hey+0.044
Neg logit contributions
 ever-0.023
 onwards-0.020
unts-0.018
 onward-0.016
 fright-0.016
Token then
Token ID788
Feature activation-3.84e-01

Pos logit contributions
aman+0.036
land+0.036
lves+0.034
ware+0.034
 extingu+0.034
Neg logit contributions
 ever-0.033
 onwards-0.027
 it-0.023
 onward-0.023
 on-0.023
Token,
Token ID11
Feature activation-1.32e+00

Pos logit contributions
aman+0.017
land+0.016
lves+0.016
ware+0.016
 exception+0.016
Neg logit contributions
 ever-0.011
 onwards-0.009
unts-0.009
 fright-0.008
 onward-0.008
Token the
Token ID262
Feature activation+3.32e-01

Pos logit contributions
aman+0.060
lves+0.057
land+0.057
 exception+0.055
ilst+0.054
Neg logit contributions
 ever-0.040
 onwards-0.031
 it-0.030
 on-0.028
 fright-0.026
Token sky
Token ID6766
Feature activation+5.18e-01

Pos logit contributions
 ever+0.013
 onwards+0.011
unts+0.011
 fright+0.010
 onward+0.010
Neg logit contributions
aman-0.011
land-0.011
ware-0.010
 hey-0.010
 prett-0.010
Token with
Token ID351
Feature activation+1.16e-01

Pos logit contributions
aman+0.029
lves+0.027
land+0.027
 hey+0.027
ware+0.026
Neg logit contributions
 ever-0.016
 onwards-0.012
unts-0.011
 fright-0.011
kick-0.010
Token his
Token ID465
Feature activation-2.45e-01

Pos logit contributions
 ever+0.004
 onwards+0.003
unts+0.003
 fright+0.003
 grin+0.003
Neg logit contributions
aman-0.004
 hey-0.004
ware-0.004
 exception-0.004
land-0.004
Token mom
Token ID1995
Feature activation-5.13e-01

Pos logit contributions
aman+0.011
land+0.010
 hey+0.010
 exception+0.010
ware+0.010
Neg logit contributions
 ever-0.007
 onwards-0.006
unts-0.006
 fright-0.005
 grin-0.005
Token and
Token ID290
Feature activation-8.25e-01

Pos logit contributions
aman+0.024
 hey+0.023
lves+0.023
land+0.023
ware+0.022
Neg logit contributions
 ever-0.014
 onwards-0.011
unts-0.010
 fright-0.009
kick-0.009
Token dad
Token ID9955
Feature activation-6.20e-01

Pos logit contributions
land+0.038
aman+0.037
ware+0.036
 exception+0.035
lves+0.035
Neg logit contributions
 ever-0.023
 onwards-0.017
unts-0.017
 fright-0.016
 excited-0.015
Token.
Token ID13
Feature activation-5.05e-01

Pos logit contributions
aman+0.029
lves+0.027
land+0.027
 hey+0.027
ware+0.027
Neg logit contributions
 ever-0.018
 onwards-0.014
unts-0.013
 fright-0.012
 onward-0.011
Token He
Token ID679
Feature activation-3.18e-01

Pos logit contributions
aman+0.025
land+0.024
 hey+0.024
ware+0.024
lves+0.023
Neg logit contributions
 ever-0.011
 onwards-0.009
unts-0.009
 fright-0.008
 onward-0.007
Token was
Token ID373
Feature activation+2.90e-01

Pos logit contributions
aman+0.013
land+0.013
 hey+0.013
ware+0.012
 exception+0.012
Neg logit contributions
 ever-0.010
 onwards-0.008
unts-0.008
 fright-0.007
 onward-0.007
Token very
Token ID845
Feature activation+3.56e-01

Pos logit contributions
 ever+0.008
 onwards+0.007
unts+0.006
 fright+0.006
 grin+0.006
Neg logit contributions
aman-0.012
 exception-0.012
 hey-0.012
 prett-0.012
lves-0.012
Token excited
Token ID6568
Feature activation-1.12e-01

Pos logit contributions
 ever+0.009
 onwards+0.008
 fright+0.007
keeping+0.007
unts+0.007
Neg logit contributions
aman-0.015
 exception-0.015
lves-0.015
 hey-0.015
 prett-0.015
Token to
Token ID284
Feature activation+6.84e-01

Pos logit contributions
aman+0.005
land+0.005
 hey+0.005
ware+0.005
 exception+0.005
Neg logit contributions
 ever-0.003
 onwards-0.002
unts-0.002
 fright-0.002
 grin-0.002
Token they
Token ID484
Feature activation+1.93e-01

Pos logit contributions
aman+0.005
land+0.004
 hey+0.004
ware+0.004
 exception+0.004
Neg logit contributions
 ever-0.004
 onwards-0.003
unts-0.003
 fright-0.003
 grin-0.003
Token were
Token ID547
Feature activation-8.29e-01

Pos logit contributions
 ever+0.005
 onwards+0.005
unts+0.004
 fright+0.004
 onward+0.004
Neg logit contributions
aman-0.008
land-0.008
 hey-0.008
lves-0.008
 exception-0.008
Token going
Token ID1016
Feature activation+4.67e-01

Pos logit contributions
aman+0.037
ware+0.036
land+0.035
lves+0.035
 hey+0.034
Neg logit contributions
 ever-0.024
 onwards-0.019
 fright-0.017
unts-0.017
 onward-0.016
Token up
Token ID510
Feature activation+4.27e-01

Pos logit contributions
 ever+0.013
unts+0.011
 onwards+0.010
 grin+0.010
 fright+0.010
Neg logit contributions
aman-0.020
land-0.020
 exception-0.019
 hey-0.019
lves-0.019
Token so
Token ID523
Feature activation+2.02e-01

Pos logit contributions
 ever+0.012
unts+0.011
 onwards+0.010
 fright+0.009
 grin+0.009
Neg logit contributions
aman-0.018
land-0.017
 exception-0.017
ware-0.017
 whose-0.016
Token much
Token ID881
Feature activation-7.53e-01

Pos logit contributions
 ever+0.007
unts+0.006
 onwards+0.006
 fright+0.005
 grin+0.005
Neg logit contributions
aman-0.008
 hey-0.007
 exception-0.007
land-0.007
lves-0.007
Token.
Token ID13
Feature activation-2.36e-01

Pos logit contributions
aman+0.031
lves+0.030
 hey+0.030
ware+0.030
land+0.029
Neg logit contributions
 ever-0.024
 onwards-0.019
unts-0.018
 fright-0.018
 onward-0.016
Token His
Token ID2399
Feature activation+1.81e-01

Pos logit contributions
aman+0.010
land+0.009
 hey+0.009
ware+0.009
 exception+0.009
Neg logit contributions
 ever-0.007
 onwards-0.006
unts-0.006
 fright-0.006
 onward-0.005
Token human
Token ID1692
Feature activation+2.82e-02

Pos logit contributions
 ever+0.007
 onwards+0.006
unts+0.006
 fright+0.006
keeping+0.006
Neg logit contributions
aman-0.006
land-0.005
 exception-0.005
 hey-0.005
ware-0.005
Token said
Token ID531
Feature activation+1.50e-01

Pos logit contributions
 ever+0.001
 onwards+0.001
unts+0.001
 fright+0.001
 onward+0.001
Neg logit contributions
aman-0.001
land-0.001
 hey-0.001
ware-0.001
 exception-0.001
Token,
Token ID11
Feature activation-7.91e-02

Pos logit contributions
 ever+0.005
unts+0.004
 onwards+0.004
 fright+0.004
keeping+0.004
Neg logit contributions
aman-0.006
land-0.005
 hey-0.005
ware-0.005
 exception-0.005
Token went
Token ID1816
Feature activation+5.02e-01

Pos logit contributions
 ever+0.027
unts+0.025
 onwards+0.023
 fright+0.023
keeping+0.022
Neg logit contributions
aman-0.031
land-0.030
 hey-0.029
ware-0.029
 prett-0.029
Token to
Token ID284
Feature activation+5.62e-03

Pos logit contributions
 ever+0.016
unts+0.013
 fright+0.013
 onwards+0.012
 grin+0.012
Neg logit contributions
land-0.020
aman-0.019
 exception-0.019
ware-0.019
 hey-0.018
Token bed
Token ID3996
Feature activation-1.01e+00

Pos logit contributions
 ever+0.000
 onwards+0.000
unts+0.000
 fright+0.000
 onward+0.000
Neg logit contributions
aman-0.000
land-0.000
 hey-0.000
 exception-0.000
ware-0.000
Token.
Token ID13
Feature activation-1.80e-01

Pos logit contributions
aman+0.046
lves+0.045
 hey+0.044
 exception+0.043
 extingu+0.042
Neg logit contributions
 ever-0.028
 onwards-0.023
unts-0.020
 onward-0.019
 grin-0.019
Token She
Token ID1375
Feature activation+3.28e-01

Pos logit contributions
aman+0.008
land+0.008
 hey+0.008
ware+0.008
 exception+0.008
Neg logit contributions
 ever-0.005
 onwards-0.004
unts-0.004
 fright-0.004
 grin-0.003
Token dream
Token ID4320
Feature activation-1.85e+00

Pos logit contributions
 ever+0.010
 onwards+0.009
unts+0.009
 fright+0.008
keeping+0.008
Neg logit contributions
aman-0.013
land-0.012
ware-0.012
 hey-0.012
 exception-0.012
Tokent
Token ID83
Feature activation-2.35e-01

Pos logit contributions
 extingu+0.104
lves+0.103
 whose+0.103
aman+0.102
ware+0.101
Neg logit contributions
 ever-0.039
 onwards-0.023
 onward-0.017
te-0.016
ay-0.016
Token about
Token ID546
Feature activation-1.84e-01

Pos logit contributions
aman+0.011
land+0.010
 hey+0.010
ware+0.010
lves+0.010
Neg logit contributions
 ever-0.006
 onwards-0.005
unts-0.005
 fright-0.004
 onward-0.004
Token sitting
Token ID5586
Feature activation+7.22e-01

Pos logit contributions
aman+0.007
land+0.007
 hey+0.007
ware+0.007
 exception+0.007
Neg logit contributions
 ever-0.006
 onwards-0.005
unts-0.005
 fright-0.005
 grin-0.004
Token on
Token ID319
Feature activation+2.76e-01

Pos logit contributions
 ever+0.021
unts+0.019
 fright+0.017
 onwards+0.016
 grin+0.016
Neg logit contributions
aman-0.029
land-0.029
ware-0.028
 prett-0.028
 hey-0.028
Token the
Token ID262
Feature activation-1.23e-01

Pos logit contributions
 ever+0.010
unts+0.008
 onwards+0.008
 fright+0.008
 grin+0.008
Neg logit contributions
aman-0.010
land-0.010
ware-0.009
 exception-0.009
 hey-0.009
Token't
Token ID470
Feature activation+2.43e-01

Pos logit contributions
 ever+0.025
unts+0.023
 grin+0.022
 onwards+0.021
keeping+0.021
Neg logit contributions
aman-0.033
 hey-0.033
land-0.031
lves-0.030
 exception-0.029
Token worry
Token ID5490
Feature activation-4.55e-01

Pos logit contributions
 ever+0.007
 onwards+0.006
unts+0.006
 onward+0.005
 grin+0.005
Neg logit contributions
aman-0.010
 hey-0.010
land-0.010
 exception-0.010
lves-0.009
Token Tim
Token ID5045
Feature activation+6.16e-01

Pos logit contributions
aman+0.020
ware+0.020
land+0.020
lves+0.019
 hey+0.019
Neg logit contributions
 ever-0.013
 onwards-0.010
unts-0.010
 fright-0.009
 grin-0.008
Tokenmy
Token ID1820
Feature activation+1.79e-01

Pos logit contributions
 ever+0.023
 onwards+0.022
unts+0.021
 fright+0.020
 onward+0.020
Neg logit contributions
aman-0.019
land-0.018
 hey-0.018
 exception-0.018
 whose-0.017
Token,
Token ID11
Feature activation-2.16e-01

Pos logit contributions
 ever+0.006
 onwards+0.005
unts+0.005
 fright+0.004
 onward+0.004
Neg logit contributions
aman-0.007
land-0.007
 hey-0.007
 exception-0.007
lves-0.007
Token it
Token ID340
Feature activation-1.94e+00

Pos logit contributions
aman+0.009
land+0.009
ware+0.009
 hey+0.009
 exception+0.009
Neg logit contributions
 ever-0.006
 onwards-0.005
unts-0.005
 fright-0.005
 grin-0.004
Token happens
Token ID4325
Feature activation-1.51e-01

Pos logit contributions
aman+0.093
ware+0.090
land+0.087
 extingu+0.085
 exception+0.084
Neg logit contributions
 ever-0.054
 fright-0.037
�-0.037
 onwards-0.037
 should-0.036
Token sometimes
Token ID3360
Feature activation-3.71e-01

Pos logit contributions
aman+0.006
land+0.006
 hey+0.006
ware+0.006
 exception+0.006
Neg logit contributions
 ever-0.005
 onwards-0.004
unts-0.004
 fright-0.004
 grin-0.004
Token.
Token ID13
Feature activation-2.33e-01

Pos logit contributions
aman+0.015
land+0.014
ware+0.014
 hey+0.014
lves+0.014
Neg logit contributions
 ever-0.012
 onwards-0.010
unts-0.009
 fright-0.009
 onward-0.008
Token Let
Token ID3914
Feature activation-3.42e-01

Pos logit contributions
aman+0.011
land+0.010
ware+0.010
 hey+0.010
 exception+0.010
Neg logit contributions
 ever-0.006
 onwards-0.005
unts-0.005
 fright-0.004
 onward-0.004
Token's
Token ID338
Feature activation-2.07e-01

Pos logit contributions
aman+0.012
land+0.012
ware+0.012
 exception+0.011
 hey+0.011
Neg logit contributions
 ever-0.012
 onwards-0.010
unts-0.010
 fright-0.010
 onward-0.009
Token
Token ID198
Feature activation-6.63e-01

Pos logit contributions
 ever+0.028
 onwards+0.024
 grin+0.023
unts+0.023
 fright+0.023
Neg logit contributions
aman-0.024
land-0.023
 whose-0.022
 exception-0.022
 hey-0.021
TokenWhen
Token ID2215
Feature activation+2.48e-01

Pos logit contributions
aman+0.019
ware+0.019
 extingu+0.018
land+0.018
 exception+0.018
Neg logit contributions
 ever-0.028
 onwards-0.026
unts-0.025
 fright-0.024
 onward-0.023
Token she
Token ID673
Feature activation-1.07e+00

Pos logit contributions
 ever+0.009
unts+0.008
 onwards+0.008
 fright+0.008
kick+0.008
Neg logit contributions
aman-0.008
 hey-0.008
land-0.007
ware-0.007
 exception-0.007
Token got
Token ID1392
Feature activation+1.43e+00

Pos logit contributions
aman+0.051
 hey+0.050
 exception+0.050
land+0.050
lves+0.049
Neg logit contributions
 ever-0.028
 onwards-0.020
 fright-0.017
 onward-0.016
 had-0.016
Token to
Token ID284
Feature activation-9.36e-01

Pos logit contributions
unts+0.059
 grin+0.053
 ever+0.052
 Amen+0.052
 fright+0.050
Neg logit contributions
 prett-0.039
aman-0.037
 hey-0.037
 where-0.036
 restroom-0.036
Token the
Token ID262
Feature activation-1.85e+00

Pos logit contributions
aman+0.044
land+0.044
ware+0.044
lves+0.043
 hey+0.043
Neg logit contributions
 ever-0.022
 fright-0.016
 onwards-0.016
 it-0.015
unts-0.015
Token beach
Token ID10481
Feature activation-1.15e+00

Pos logit contributions
ist+0.063
aman+0.061
lves+0.058
ilst+0.057
 extingu+0.055
Neg logit contributions
 ever-0.069
 it-0.061
 on-0.060
 onwards-0.058
 hungry-0.057
Token,
Token ID11
Feature activation-3.19e-01

Pos logit contributions
aman+0.057
lves+0.057
 hey+0.055
ware+0.054
 extingu+0.054
Neg logit contributions
 ever-0.027
 onwards-0.020
unts-0.016
 grin-0.016
 onward-0.016
Token it
Token ID340
Feature activation-1.21e+00

Pos logit contributions
aman+0.012
land+0.012
 hey+0.011
ware+0.011
 exception+0.011
Neg logit contributions
 ever-0.011
 onwards-0.009
unts-0.009
 fright-0.009
 onward-0.008
Token was
Token ID373
Feature activation+6.96e-01

Pos logit contributions
aman+0.061
 hey+0.059
land+0.058
 exception+0.057
 extingu+0.057
Neg logit contributions
 ever-0.029
 onwards-0.022
 had-0.019
unts-0.018
 excited-0.018
Token already
Token ID1541
Feature activation+2.42e-01

Pos logit contributions
 ever+0.022
unts+0.019
keeping+0.018
 onwards+0.018
 fright+0.018
Neg logit contributions
aman-0.027
 prett-0.026
 exception-0.025
land-0.025
 whose-0.024
Token the
Token ID262
Feature activation-2.78e-01

Pos logit contributions
aman+0.009
land+0.009
 hey+0.009
ware+0.008
lves+0.008
Neg logit contributions
 ever-0.008
 onwards-0.007
unts-0.006
 fright-0.006
 grin-0.006
Token dirt
Token ID13647
Feature activation-1.36e+00

Pos logit contributions
aman+0.008
land+0.008
 hey+0.007
 exception+0.007
ware+0.007
Neg logit contributions
 ever-0.012
 onwards-0.011
unts-0.010
 fright-0.010
 onward-0.010
Token.
Token ID13
Feature activation-1.25e+00

Pos logit contributions
aman+0.063
 hey+0.062
lves+0.061
 exception+0.059
land+0.059
Neg logit contributions
 ever-0.037
 onwards-0.028
unts-0.027
kick-0.024
 fright-0.024
Token It
Token ID632
Feature activation-7.05e-01

Pos logit contributions
 hey+0.064
land+0.064
aman+0.064
lves+0.063
ware+0.063
Neg logit contributions
 onwards-0.021
 ever-0.020
 It-0.017
 onward-0.016
unts-0.016
Token began
Token ID2540
Feature activation+1.40e-01

Pos logit contributions
aman+0.032
land+0.031
 hey+0.030
 exception+0.030
ware+0.029
Neg logit contributions
 ever-0.021
 onwards-0.015
 fright-0.015
unts-0.015
kick-0.013
Token to
Token ID284
Feature activation-1.92e+00

Pos logit contributions
 ever+0.003
 onwards+0.003
unts+0.003
 fright+0.002
 grin+0.002
Neg logit contributions
aman-0.007
land-0.006
 hey-0.006
 exception-0.006
ware-0.006
Token rot
Token ID5724
Feature activation-2.75e-01

Pos logit contributions
aman+0.084
land+0.084
lves+0.081
 whose+0.080
ware+0.078
Neg logit contributions
 ever-0.055
 fright-0.048
 have-0.043
 mad-0.041
 smile-0.040
Token away
Token ID1497
Feature activation-1.11e+00

Pos logit contributions
aman+0.012
land+0.012
 hey+0.011
 exception+0.011
ware+0.011
Neg logit contributions
 ever-0.008
 onwards-0.006
unts-0.006
 fright-0.006
 onward-0.005
Token in
Token ID287
Feature activation-7.96e-01

Pos logit contributions
aman+0.052
lves+0.050
 hey+0.050
ware+0.049
 exception+0.048
Neg logit contributions
 ever-0.030
 onwards-0.023
unts-0.020
 fright-0.020
 onward-0.019
Token the
Token ID262
Feature activation+6.80e-01

Pos logit contributions
aman+0.033
land+0.033
 hey+0.032
 exception+0.031
lves+0.031
Neg logit contributions
 ever-0.025
 fright-0.020
 onwards-0.020
unts-0.020
keeping-0.018
Token soil
Token ID9260
Feature activation-1.03e+00

Pos logit contributions
 ever+0.031
 onwards+0.028
unts+0.027
te+0.026
 onward+0.026
Neg logit contributions
 prett-0.017
aman-0.017
 hey-0.016
land-0.016
ware-0.015
Token took
Token ID1718
Feature activation+4.78e-02

Pos logit contributions
aman+0.020
land+0.020
ware+0.019
 exception+0.019
 hey+0.019
Neg logit contributions
 ever-0.015
 onwards-0.012
unts-0.011
 fright-0.011
 grin-0.010
Token her
Token ID607
Feature activation-1.23e-01

Pos logit contributions
 ever+0.002
 onwards+0.001
unts+0.001
 fright+0.001
 grin+0.001
Neg logit contributions
aman-0.002
 hey-0.002
land-0.002
 exception-0.002
ware-0.002
Token to
Token ID284
Feature activation-6.68e-01

Pos logit contributions
aman+0.005
land+0.005
 hey+0.005
ware+0.005
 exception+0.005
Neg logit contributions
 ever-0.003
 onwards-0.003
unts-0.003
 fright-0.002
 grin-0.002
Token the
Token ID262
Feature activation-9.57e-01

Pos logit contributions
aman+0.030
land+0.029
ware+0.029
lves+0.029
 exception+0.028
Neg logit contributions
 ever-0.018
 fright-0.014
 onwards-0.014
unts-0.013
 grin-0.012
Token doctor
Token ID6253
Feature activation-7.21e-01

Pos logit contributions
aman+0.023
ware+0.023
land+0.023
 exception+0.022
lves+0.022
Neg logit contributions
 ever-0.043
 fright-0.038
unts-0.038
 onwards-0.038
 mad-0.036
Token and
Token ID290
Feature activation-1.87e+00

Pos logit contributions
aman+0.029
 hey+0.027
lves+0.027
ware+0.027
 exception+0.027
Neg logit contributions
 ever-0.023
 onwards-0.019
unts-0.018
 fright-0.017
 onward-0.017
Token the
Token ID262
Feature activation-7.73e-01

Pos logit contributions
land+0.087
ware+0.082
aman+0.082
 exception+0.081
 Majesty+0.078
Neg logit contributions
 ever-0.052
 it-0.039
 onwards-0.037
 on-0.036
 fright-0.036
Token doctor
Token ID6253
Feature activation-1.77e+00

Pos logit contributions
aman+0.018
lves+0.017
land+0.016
 exception+0.016
ware+0.015
Neg logit contributions
 ever-0.038
 fright-0.033
 onwards-0.033
unts-0.033
 grin-0.032
Token said
Token ID531
Feature activation+2.28e-03

Pos logit contributions
aman+0.086
lves+0.082
 exception+0.081
land+0.080
ware+0.079
Neg logit contributions
 ever-0.046
 onwards-0.031
 had-0.030
 excited-0.029
unts-0.029
Token,
Token ID11
Feature activation-4.61e-01

Pos logit contributions
 ever+0.000
 onwards+0.000
unts+0.000
 fright+0.000
 grin+0.000
Neg logit contributions
aman-0.000
land-0.000
 hey-0.000
ware-0.000
 exception-0.000
Token "
Token ID366
Feature activation-1.06e+00

Pos logit contributions
aman+0.018
land+0.017
 exception+0.016
lves+0.016
 hey+0.016
Neg logit contributions
 ever-0.016
 onwards-0.013
unts-0.013
 fright-0.012
 onward-0.012