TOP ACTIVATIONS
MAX = 0.072

spun so much that she started to feel dizz y ,
do you want to learn how to surf ?" Max said
said , " Wow , that 's a big drum !"
said , " Wow , that 's so pretty ! Can
little bear loved how it felt . He hugged the log
closer and take a better look ." As they
hopped and spun with g lee . But one day she
. She loved how it felt in her hands .
One day , Ch oo - Ch oo was feeling very
it was going so fast that it didn 't see a
the boy loved how it felt when it bounced off his
He loved the way he felt getting all clean .
much that she started to feel dizz y , and when
spun so fast that he felt dizz y and happy .
boy grabbed the rocket and started to spin around and around
loved how it made her feel special and safe . <|endoftext|>
Cub ey loved to spin all day long . He would
said , " Wow , that dress is so fancy !
said , " Wow , that 's a big orange drum
the water , enjoying the feel of the cool water against

MIN ACTIVATIONS
MIN = -0.074

. She had a deep need to explore the unknown
The bar ber had a deep voice and Tim my was
bed . She dreamed of mer maid s and pirates ,
father taught his child that caves can be friendly and safe
a little girl named Amy . Amy always had a big
a kind girl named Amy . Amy had a neighbor named
" I 'm a secret mer maid . Would you like
telling a story about a mer maid who lived in the
a little girl named Amy . Amy had a big red
sleeping . He had a deep and peaceful sleep until his
a little girl called Amy . She was a very thoughtful
Sometimes , though , it could be difficult .
a little girl named Amy . She had a small ,
's there ?" came a deep voice . The
lived a girl named Amy . She had a big ,
and over again with his deep paws , to no avail
a little girl named Amy . Amy was sad because it
, he put on his brav est clothes and started his
a kind girl named Amy . Amy liked to help her
year old girl named Amy . She was feeling very ordinary

INTERVAL 0.035 - 0.072
CONTAINS 0.538%

fun ! As it went , it saw
a big aer opl ane was flying in the sky .
and ran , all the way to the top . When
in the park . They like to swing and slide and
moved his chess piece , too . In the

INTERVAL -0.001 - 0.035
CONTAINS 52.945%

a beautiful opera . She was so happy , she wanted
glad that she had been so alert and was running away
. But Lily did not want to give it up .
would nod its head . The end . <|endoftext|>
It is a rh ino . " Wow ,

INTERVAL -0.037 - -0.001
CONTAINS 46.233%

can both use the notebook . Tim can put his drawings
. It was brown and round and had a hole on
the pieces of coal for their eyes and buttons .
the baby said , " M ama , don 't cry
their old home . <|endoftext|> Once upon a time there was

INTERVAL -0.074 - -0.037
CONTAINS 0.283%

a little girl named Jane . She was only 3 years
the passport and the magical mer maid was grateful to get
berries . He wanted to act like he was picking them
were his two best friends hiding with the block of ice
him to a deep and lasting friendship . He thanked his
Token spun
Token ID26843
Feature activation+1.92e-02

Pos logit contributions
 café+0.038
actus+0.037
reating+0.036
THE+0.036
unn+0.035
Neg logit contributions
inkle-0.038
 their-0.034
 none-0.032
cca-0.032
ised-0.032
Token so
Token ID523
Feature activation+5.04e-02

Pos logit contributions
saw+0.029
 café+0.029
actus+0.027
THE+0.027
bye+0.027
Neg logit contributions
inkle-0.024
ags-0.021
cca-0.020
ist-0.020
 concern-0.019
Token much
Token ID881
Feature activation+4.51e-02

Pos logit contributions
 café+0.084
fl+0.080
actus+0.077
 desert+0.077
reating+0.076
Neg logit contributions
inkle-0.057
 deeply-0.047
ised-0.046
bly-0.045
ither-0.045
Token that
Token ID326
Feature activation+5.08e-02

Pos logit contributions
 café+0.069
actus+0.067
saw+0.067
fl+0.067
rav+0.067
Neg logit contributions
inkle-0.052
ist-0.045
 deeply-0.044
ags-0.042
bly-0.042
Token she
Token ID673
Feature activation+4.95e-02

Pos logit contributions
 café+0.071
actus+0.070
saw+0.070
unn+0.070
THE+0.069
Neg logit contributions
inkle-0.067
 concern-0.057
 sincere-0.056
cca-0.053
 deeply-0.052
Token started
Token ID2067
Feature activation+7.10e-02

Pos logit contributions
 café+0.061
reating+0.060
THE+0.057
saw+0.057
fl+0.056
Neg logit contributions
inkle-0.078
cca-0.069
ist-0.068
 sincere-0.066
bly-0.065
Token to
Token ID284
Feature activation+3.36e-02

Pos logit contributions
actus+0.090
 café+0.086
THE+0.086
bye+0.084
unn+0.081
Neg logit contributions
 their-0.099
inkle-0.094
 an-0.090
 concern-0.087
 sincere-0.087
Token feel
Token ID1254
Feature activation+6.75e-02

Pos logit contributions
saw+0.039
THE+0.038
 café+0.038
fl+0.038
 trail+0.037
Neg logit contributions
inkle-0.054
ags-0.047
 concern-0.047
cca-0.047
 sincere-0.046
Token dizz
Token ID29527
Feature activation+5.53e-03

Pos logit contributions
fl+0.114
saw+0.113
board+0.113
 café+0.112
bye+0.110
Neg logit contributions
inkle-0.074
 sincere-0.065
 concern-0.064
 deeply-0.064
 their-0.060
Tokeny
Token ID88
Feature activation+3.43e-02

Pos logit contributions
 café+0.007
saw+0.007
THE+0.007
 trail+0.007
 roller+0.006
Neg logit contributions
inkle-0.009
ags-0.007
ist-0.007
cca-0.007
bly-0.007
Token,
Token ID11
Feature activation+1.96e-02

Pos logit contributions
THE+0.033
saw+0.033
fl+0.032
 trail+0.032
 café+0.032
Neg logit contributions
inkle-0.062
ags-0.055
 concern-0.055
cca-0.055
 sincere-0.054
Token do
Token ID466
Feature activation+1.43e-02

Pos logit contributions
fl+0.029
THE+0.029
 café+0.029
saw+0.028
actus+0.027
Neg logit contributions
inkle-0.031
 sincere-0.029
 concern-0.029
ist-0.028
bly-0.027
Token you
Token ID345
Feature activation+3.13e-03

Pos logit contributions
reating+0.023
 café+0.022
saw+0.021
fl+0.021
THE+0.021
Neg logit contributions
inkle-0.017
 sincere-0.015
ap-0.014
ither-0.014
 common-0.014
Token want
Token ID765
Feature activation+3.78e-02

Pos logit contributions
saw+0.005
THE+0.005
fl+0.005
 café+0.005
 trail+0.005
Neg logit contributions
inkle-0.004
 concern-0.003
 sincere-0.003
ist-0.003
ags-0.003
Token to
Token ID284
Feature activation+3.54e-02

Pos logit contributions
saw+0.047
THE+0.045
actus+0.043
fl+0.042
 trail+0.041
Neg logit contributions
 their-0.053
 an-0.051
 concern-0.049
ags-0.049
 common-0.047
Token learn
Token ID2193
Feature activation+3.85e-02

Pos logit contributions
saw+0.044
fl+0.043
THE+0.042
 café+0.042
actus+0.042
Neg logit contributions
inkle-0.055
 sincere-0.048
ags-0.047
 common-0.046
 concern-0.045
Token how
Token ID703
Feature activation+7.03e-02

Pos logit contributions
saw+0.053
fl+0.053
 café+0.052
THE+0.052
actus+0.050
Neg logit contributions
inkle-0.050
ags-0.045
 sincere-0.043
ist-0.043
 concern-0.042
Token to
Token ID284
Feature activation+4.11e-02

Pos logit contributions
 café+0.099
board+0.097
THE+0.095
 trail+0.094
pillar+0.092
Neg logit contributions
 their-0.091
 common-0.082
 badly-0.076
 an-0.076
 sincere-0.075
Token surf
Token ID9053
Feature activation+6.62e-03

Pos logit contributions
saw+0.035
 café+0.033
THE+0.033
 trail+0.032
fl+0.032
Neg logit contributions
inkle-0.079
cca-0.072
 concern-0.072
ags-0.071
 sincere-0.070
Token?"
Token ID1701
Feature activation+4.94e-04

Pos logit contributions
saw+0.008
THE+0.008
 café+0.008
fl+0.008
 trail+0.008
Neg logit contributions
inkle-0.010
ags-0.009
cca-0.009
 sincere-0.008
 concern-0.008
Token Max
Token ID5436
Feature activation-1.27e-02

Pos logit contributions
 café+0.001
saw+0.001
THE+0.001
fl+0.001
 ze+0.001
Neg logit contributions
inkle-0.001
bly-0.001
cca-0.001
ags-0.001
ist-0.001
Token said
Token ID531
Feature activation-9.84e-03

Pos logit contributions
inkle+0.023
ags+0.022
bly+0.020
agged+0.020
 concern+0.019
Neg logit contributions
 café-0.012
fl-0.012
 trail-0.011
THE-0.011
bye-0.011
Token said
Token ID531
Feature activation+1.23e-02

Pos logit contributions
saw+0.031
THE+0.030
 café+0.029
fl+0.029
 trail+0.029
Neg logit contributions
inkle-0.047
ags-0.042
 concern-0.041
cca-0.041
 sincere-0.041
Token,
Token ID11
Feature activation+2.56e-02

Pos logit contributions
saw+0.013
THE+0.012
bye+0.012
fl+0.012
rav+0.012
Neg logit contributions
inkle-0.022
ags-0.020
cca-0.019
ist-0.018
 concern-0.018
Token "
Token ID366
Feature activation+2.23e-02

Pos logit contributions
saw+0.015
fl+0.015
THE+0.015
 café+0.014
 ze+0.014
Neg logit contributions
inkle-0.058
ags-0.053
agged-0.051
cca-0.050
 concern-0.050
TokenWow
Token ID22017
Feature activation+1.81e-02

Pos logit contributions
 café+0.022
saw+0.022
 trail+0.022
THE+0.021
 ze+0.021
Neg logit contributions
inkle-0.037
cca-0.035
 concern-0.034
 sincere-0.034
bly-0.033
Token,
Token ID11
Feature activation+1.38e-02

Pos logit contributions
saw+0.026
 café+0.025
THE+0.025
fl+0.024
 trail+0.024
Neg logit contributions
inkle-0.025
 sincere-0.021
ags-0.021
 concern-0.021
ist-0.021
Token that
Token ID326
Feature activation+7.02e-02

Pos logit contributions
saw+0.016
THE+0.016
bye+0.016
 roller+0.015
 trail+0.015
Neg logit contributions
inkle-0.024
ags-0.021
cca-0.020
agged-0.020
ist-0.019
Token's
Token ID338
Feature activation+5.05e-03

Pos logit contributions
 café+0.093
saw+0.090
unn+0.090
lan+0.090
actus+0.087
Neg logit contributions
inkle-0.104
 sincere-0.088
ist-0.087
 concern-0.086
bly-0.085
Token a
Token ID257
Feature activation+3.22e-02

Pos logit contributions
THE+0.006
 roller+0.006
saw+0.006
fl+0.006
 ze+0.005
Neg logit contributions
inkle-0.008
cca-0.007
 concern-0.007
agged-0.007
ags-0.007
Token big
Token ID1263
Feature activation+2.13e-02

Pos logit contributions
saw+0.048
 café+0.047
fl+0.047
unn+0.047
THE+0.046
Neg logit contributions
inkle-0.041
 sincere-0.035
 concern-0.033
bly-0.033
cca-0.033
Token drum
Token ID13026
Feature activation+2.19e-02

Pos logit contributions
saw+0.024
THE+0.023
 café+0.023
fl+0.023
 trail+0.022
Neg logit contributions
inkle-0.035
ags-0.031
 concern-0.031
cca-0.031
 sincere-0.031
Token!"
Token ID2474
Feature activation+9.45e-03

Pos logit contributions
saw+0.016
THE+0.015
fl+0.015
 café+0.015
 trail+0.014
Neg logit contributions
inkle-0.045
ags-0.041
 concern-0.041
cca-0.041
 sincere-0.040
Token said
Token ID531
Feature activation+6.31e-03

Pos logit contributions
saw+0.019
 café+0.019
THE+0.018
fl+0.018
 trail+0.018
Neg logit contributions
inkle-0.044
ags-0.042
 concern-0.039
bly-0.039
agged-0.039
Token,
Token ID11
Feature activation+1.70e-02

Pos logit contributions
saw+0.009
THE+0.009
 café+0.009
fl+0.009
 trail+0.008
Neg logit contributions
inkle-0.009
 sincere-0.007
 concern-0.007
ags-0.007
cca-0.007
Token "
Token ID366
Feature activation+3.25e-02

Pos logit contributions
saw+0.016
THE+0.015
 café+0.015
fl+0.015
 trail+0.015
Neg logit contributions
inkle-0.031
ags-0.028
 concern-0.028
cca-0.028
 sincere-0.027
TokenWow
Token ID22017
Feature activation+1.87e-02

Pos logit contributions
unn+0.042
 trail+0.041
 café+0.040
saw+0.040
 roller+0.040
Neg logit contributions
inkle-0.042
cca-0.040
 concern-0.038
bly-0.038
 sincere-0.037
Token,
Token ID11
Feature activation+1.80e-02

Pos logit contributions
saw+0.032
 café+0.032
THE+0.030
reating+0.030
fl+0.030
Neg logit contributions
inkle-0.022
ist-0.017
 sincere-0.017
ags-0.016
ither-0.016
Token that
Token ID326
Feature activation+7.01e-02

Pos logit contributions
saw+0.020
THE+0.019
 café+0.019
 trail+0.019
fl+0.018
Neg logit contributions
inkle-0.031
ags-0.027
cca-0.027
ist-0.026
agged-0.026
Token's
Token ID338
Feature activation-6.05e-03

Pos logit contributions
unn+0.100
reating+0.098
saw+0.098
Sweet+0.097
 café+0.096
Neg logit contributions
inkle-0.089
 their-0.072
ap-0.069
 sincere-0.067
 common-0.065
Token so
Token ID523
Feature activation+2.74e-02

Pos logit contributions
inkle+0.008
 concern+0.007
ags+0.007
cca+0.007
 sincere+0.007
Neg logit contributions
THE-0.008
saw-0.008
fl-0.008
 café-0.008
 trail-0.008
Token pretty
Token ID2495
Feature activation+3.89e-02

Pos logit contributions
fl+0.043
THE+0.042
 café+0.042
saw+0.041
reating+0.040
Neg logit contributions
inkle-0.034
 sincere-0.028
 common-0.027
cca-0.026
ags-0.026
Token!
Token ID0
Feature activation+3.19e-02

Pos logit contributions
saw+0.041
 café+0.040
THE+0.040
EEP+0.038
rav+0.037
Neg logit contributions
inkle-0.064
ist-0.055
bly-0.054
 common-0.053
 their-0.052
Token Can
Token ID1680
Feature activation+1.99e-03

Pos logit contributions
THE+0.032
saw+0.032
 trail+0.031
 café+0.031
fl+0.030
Neg logit contributions
inkle-0.053
 sincere-0.048
 concern-0.048
ags-0.047
cca-0.047
Token little
Token ID1310
Feature activation+2.06e-02

Pos logit contributions
saw+0.019
THE+0.018
bye+0.018
 café+0.017
reating+0.017
Neg logit contributions
inkle-0.030
cca-0.027
 concern-0.027
 sincere-0.026
ags-0.026
Token bear
Token ID6842
Feature activation+2.88e-02

Pos logit contributions
saw+0.021
THE+0.021
 café+0.020
reating+0.019
rav+0.019
Neg logit contributions
inkle-0.036
cca-0.032
 concern-0.032
 sincere-0.032
ags-0.031
Token loved
Token ID6151
Feature activation+3.03e-02

Pos logit contributions
saw+0.026
THE+0.025
 café+0.025
fl+0.025
 trail+0.024
Neg logit contributions
inkle-0.054
ags-0.049
 concern-0.048
 sincere-0.047
cca-0.047
Token how
Token ID703
Feature activation+4.40e-02

Pos logit contributions
saw+0.050
bye+0.048
rav+0.047
 café+0.047
actus+0.047
Neg logit contributions
 their-0.035
 his-0.029
 sincere-0.029
 its-0.028
 none-0.028
Token it
Token ID340
Feature activation+1.15e-02

Pos logit contributions
reating+0.080
 café+0.076
 ze+0.075
board+0.073
arant+0.073
Neg logit contributions
 their-0.045
inkle-0.042
 sincere-0.041
apy-0.040
 sorry-0.039
Token felt
Token ID2936
Feature activation+6.96e-02

Pos logit contributions
saw+0.011
THE+0.011
 café+0.011
fl+0.011
 trail+0.010
Neg logit contributions
inkle-0.021
ags-0.019
 concern-0.019
 sincere-0.018
cca-0.018
Token.
Token ID13
Feature activation+2.10e-02

Pos logit contributions
saw+0.092
fl+0.087
 café+0.087
THE+0.086
 ze+0.085
Neg logit contributions
inkle-0.101
 sincere-0.087
bly-0.086
 concern-0.086
cca-0.086
Token He
Token ID679
Feature activation+7.26e-03

Pos logit contributions
reating+0.032
saw+0.031
THE+0.031
fl+0.030
 trail+0.030
Neg logit contributions
inkle-0.025
ags-0.021
 Their-0.021
 concern-0.021
cca-0.021
Token hugged
Token ID41130
Feature activation+1.06e-02

Pos logit contributions
saw+0.008
THE+0.008
 café+0.008
fl+0.008
 trail+0.008
Neg logit contributions
inkle-0.012
cca-0.011
 concern-0.010
 sincere-0.010
ist-0.010
Token the
Token ID262
Feature activation+1.29e-02

Pos logit contributions
saw+0.014
THE+0.014
 café+0.014
fl+0.014
 trail+0.014
Neg logit contributions
inkle-0.015
ags-0.013
 concern-0.013
cca-0.013
 sincere-0.013
Token log
Token ID2604
Feature activation+8.40e-03

Pos logit contributions
reating+0.015
saw+0.015
bye+0.015
rav+0.014
THE+0.014
Neg logit contributions
inkle-0.021
cca-0.019
 concern-0.018
ags-0.018
 deeply-0.017
Token closer
Token ID5699
Feature activation+4.38e-02

Pos logit contributions
pillar+0.030
THE+0.029
reating+0.029
fl+0.029
 anth+0.029
Neg logit contributions
inkle-0.023
ags-0.021
 their-0.018
ist-0.018
 together-0.017
Token and
Token ID290
Feature activation+1.46e-02

Pos logit contributions
saw+0.058
 café+0.055
actus+0.054
THE+0.054
reating+0.052
Neg logit contributions
inkle-0.059
ist-0.054
ags-0.052
cca-0.049
 their-0.049
Token take
Token ID1011
Feature activation+1.83e-02

Pos logit contributions
THE+0.022
fl+0.021
actus+0.021
 café+0.021
 trail+0.021
Neg logit contributions
inkle-0.017
 concern-0.015
 sincere-0.015
ags-0.014
cca-0.013
Token a
Token ID257
Feature activation+2.63e-02

Pos logit contributions
saw+0.028
THE+0.028
 café+0.027
fl+0.027
 trail+0.027
Neg logit contributions
inkle-0.023
ags-0.019
 concern-0.019
cca-0.019
 sincere-0.019
Token better
Token ID1365
Feature activation+1.83e-02

Pos logit contributions
fl+0.047
THE+0.045
ups+0.044
reating+0.043
bye+0.041
Neg logit contributions
inkle-0.029
 common-0.023
 concern-0.022
ist-0.020
ither-0.019
Token look
Token ID804
Feature activation+6.93e-02

Pos logit contributions
THE+0.035
 concession+0.034
Kids+0.033
ups+0.033
saw+0.033
Neg logit contributions
inkle-0.015
 common-0.012
 concern-0.012
 charge-0.011
ags-0.011
Token."
Token ID526
Feature activation+1.29e-02

Pos logit contributions
 café+0.073
saw+0.072
 trail+0.070
THE+0.069
fl+0.068
Neg logit contributions
inkle-0.118
ags-0.106
ist-0.105
bly-0.103
cca-0.100
Token
Token ID198
Feature activation+1.41e-02

Pos logit contributions
saw+0.022
reating+0.022
 ze+0.022
 mir+0.021
 ostr+0.021
Neg logit contributions
inkle-0.013
 Their-0.012
ags-0.012
 their-0.011
agged-0.011
Token
Token ID198
Feature activation-1.07e-02

Pos logit contributions
fl+0.023
saw+0.023
 ze+0.023
 ostr+0.023
bye+0.022
Neg logit contributions
inkle-0.017
 Their-0.013
 their-0.013
ags-0.013
cca-0.012
TokenAs
Token ID1722
Feature activation+2.75e-02

Pos logit contributions
inkle+0.016
cca+0.015
ist+0.014
bly+0.014
ags+0.014
Neg logit contributions
fl-0.013
saw-0.013
 ze-0.012
 café-0.012
bye-0.012
Token they
Token ID484
Feature activation-1.50e-02

Pos logit contributions
saw+0.049
fl+0.048
actus+0.047
 café+0.047
THE+0.046
Neg logit contributions
inkle-0.028
 their-0.022
cca-0.021
 common-0.021
 sincere-0.020
Token hopped
Token ID45230
Feature activation+1.36e-02

Pos logit contributions
THE+0.047
 café+0.046
saw+0.046
reating+0.045
bye+0.045
Neg logit contributions
inkle-0.069
ist-0.060
cca-0.060
ags-0.060
 sincere-0.059
Token and
Token ID290
Feature activation+2.65e-02

Pos logit contributions
saw+0.022
 café+0.021
fl+0.021
THE+0.021
actus+0.021
Neg logit contributions
inkle-0.016
ags-0.013
cca-0.013
 deeply-0.012
ist-0.012
Token spun
Token ID26843
Feature activation+1.23e-02

Pos logit contributions
saw+0.032
reating+0.031
 café+0.031
actus+0.030
THE+0.029
Neg logit contributions
inkle-0.039
 their-0.035
cca-0.034
ags-0.034
ist-0.034
Token with
Token ID351
Feature activation+1.45e-02

Pos logit contributions
saw+0.018
 café+0.018
THE+0.017
actus+0.017
bye+0.017
Neg logit contributions
inkle-0.016
ags-0.014
cca-0.013
ist-0.013
 concern-0.013
Token g
Token ID308
Feature activation-2.98e-03

Pos logit contributions
saw+0.023
THE+0.023
fl+0.023
 café+0.023
 trail+0.022
Neg logit contributions
inkle-0.017
cca-0.014
 sincere-0.014
ags-0.014
 concern-0.014
Tokenlee
Token ID7197
Feature activation+6.93e-02

Pos logit contributions
inkle+0.005
ags+0.004
 concern+0.004
 sincere+0.004
cca+0.004
Neg logit contributions
saw-0.003
THE-0.003
 café-0.003
fl-0.003
 trail-0.003
Token.
Token ID13
Feature activation+1.41e-02

Pos logit contributions
saw+0.104
 café+0.097
THE+0.095
reating+0.092
actus+0.092
Neg logit contributions
inkle-0.089
ist-0.083
cca-0.076
 deeply-0.076
ags-0.074
Token But
Token ID887
Feature activation-6.92e-03

Pos logit contributions
saw+0.019
THE+0.019
 café+0.019
 trail+0.019
fl+0.018
Neg logit contributions
inkle-0.019
ags-0.017
 concern-0.016
 sincere-0.016
cca-0.016
Token one
Token ID530
Feature activation+1.65e-02

Pos logit contributions
ags+0.013
inkle+0.013
agged+0.012
cca+0.012
ist+0.012
Neg logit contributions
fl-0.007
 night-0.007
THE-0.007
 day-0.007
 ze-0.006
Token day
Token ID1110
Feature activation+3.45e-03

Pos logit contributions
saw+0.012
THE+0.012
fl+0.011
 café+0.011
reating+0.011
Neg logit contributions
inkle-0.032
cca-0.030
 concern-0.030
 sincere-0.029
ist-0.029
Token she
Token ID673
Feature activation+5.86e-03

Pos logit contributions
saw+0.005
THE+0.005
 café+0.005
fl+0.005
 trail+0.005
Neg logit contributions
inkle-0.005
 concern-0.004
cca-0.004
 sincere-0.004
ags-0.004
Token.
Token ID13
Feature activation+2.16e-02

Pos logit contributions
saw+0.033
THE+0.032
 café+0.032
fl+0.031
 trail+0.030
Neg logit contributions
inkle-0.048
 concern-0.042
ags-0.042
cca-0.042
 sincere-0.042
Token She
Token ID1375
Feature activation+3.00e-02

Pos logit contributions
 ze+0.035
reating+0.034
lan+0.034
saw+0.034
fl+0.034
Neg logit contributions
inkle-0.023
 Their-0.020
ags-0.020
 sincere-0.020
cca-0.019
Token loved
Token ID6151
Feature activation+3.25e-02

Pos logit contributions
saw+0.034
THE+0.034
fl+0.033
 café+0.033
 trail+0.032
Neg logit contributions
inkle-0.049
cca-0.044
 sincere-0.043
 concern-0.043
ags-0.042
Token how
Token ID703
Feature activation+5.26e-02

Pos logit contributions
saw+0.051
bye+0.051
board+0.049
actus+0.049
 café+0.049
Neg logit contributions
 their-0.036
 sincere-0.032
 concern-0.031
inkle-0.031
ags-0.030
Token it
Token ID340
Feature activation+2.42e-02

Pos logit contributions
 café+0.087
 ze+0.085
board+0.083
fl+0.083
reating+0.082
Neg logit contributions
 their-0.056
 sincere-0.051
inkle-0.051
 concern-0.048
 badly-0.048
Token felt
Token ID2936
Feature activation+6.90e-02

Pos logit contributions
saw+0.024
THE+0.023
 café+0.023
fl+0.023
 trail+0.022
Neg logit contributions
inkle-0.043
ags-0.038
 concern-0.038
cca-0.038
 sincere-0.038
Token in
Token ID287
Feature activation+8.63e-03

Pos logit contributions
saw+0.088
THE+0.087
 café+0.085
 trail+0.084
fl+0.084
Neg logit contributions
inkle-0.105
ags-0.090
 concern-0.090
cca-0.090
 sincere-0.088
Token her
Token ID607
Feature activation+3.48e-03

Pos logit contributions
THE+0.012
saw+0.012
 trail+0.012
 café+0.012
fl+0.012
Neg logit contributions
inkle-0.012
ist-0.010
cca-0.010
ags-0.010
agged-0.010
Token hands
Token ID2832
Feature activation+2.79e-02

Pos logit contributions
 trail+0.003
actus+0.003
THE+0.003
 tree+0.003
 roller+0.003
Neg logit contributions
inkle-0.006
ist-0.006
cca-0.006
bly-0.005
immer-0.005
Token.
Token ID13
Feature activation+2.30e-02

Pos logit contributions
fl+0.018
 trail+0.017
m+0.017
 café+0.017
THE+0.016
Neg logit contributions
inkle-0.060
ags-0.056
agged-0.054
cca-0.054
 concern-0.053
Token
Token ID198
Feature activation+2.08e-02

Pos logit contributions
saw+0.035
 ze+0.035
 trail+0.035
reating+0.035
 roller+0.035
Neg logit contributions
inkle-0.026
ags-0.024
 Their-0.023
cca-0.022
 sincere-0.022
Token One
Token ID1881
Feature activation+2.35e-02

Pos logit contributions
saw+0.022
THE+0.022
 café+0.022
actus+0.021
reating+0.021
Neg logit contributions
inkle-0.015
ags-0.012
 Their-0.011
 concern-0.011
cca-0.011
Token day
Token ID1110
Feature activation+2.33e-02

Pos logit contributions
saw+0.020
THE+0.018
reating+0.018
actus+0.017
fl+0.017
Neg logit contributions
inkle-0.045
cca-0.040
 their-0.039
 common-0.039
ist-0.039
Token,
Token ID11
Feature activation+2.44e-02

Pos logit contributions
saw+0.044
 café+0.042
reating+0.041
actus+0.041
THE+0.041
Neg logit contributions
inkle-0.023
cca-0.018
bly-0.017
ither-0.017
 their-0.017
Token Ch
Token ID609
Feature activation+8.74e-03

Pos logit contributions
saw+0.035
 café+0.035
actus+0.034
reating+0.033
THE+0.033
Neg logit contributions
inkle-0.031
bly-0.027
cca-0.027
 their-0.027
 concern-0.027
Tokenoo
Token ID2238
Feature activation+5.01e-02

Pos logit contributions
saw+0.015
 café+0.015
THE+0.015
fl+0.014
 trail+0.014
Neg logit contributions
inkle-0.010
ags-0.008
cca-0.008
 concern-0.007
ist-0.007
Token-
Token ID12
Feature activation+6.88e-02

Pos logit contributions
saw+0.069
reating+0.066
THE+0.064
 café+0.063
 coc+0.063
Neg logit contributions
inkle-0.070
cca-0.067
ist-0.059
bly-0.058
ised-0.057
TokenCh
Token ID1925
Feature activation+1.06e-02

Pos logit contributions
 anth+0.103
 café+0.102
 coc+0.098
 trail+0.097
 desert+0.093
Neg logit contributions
cca-0.085
bly-0.078
inkle-0.074
ist-0.072
Their-0.071
Tokenoo
Token ID2238
Feature activation+4.95e-02

Pos logit contributions
saw+0.017
THE+0.017
 café+0.017
fl+0.017
 trail+0.016
Neg logit contributions
inkle-0.012
ags-0.010
 concern-0.010
cca-0.010
 sincere-0.010
Token was
Token ID373
Feature activation+1.39e-02

Pos logit contributions
saw+0.065
reating+0.063
THE+0.061
 café+0.061
 coc+0.059
Neg logit contributions
inkle-0.071
cca-0.068
ist-0.061
bly-0.060
ised-0.059
Token feeling
Token ID4203
Feature activation+3.68e-02

Pos logit contributions
saw+0.015
 café+0.014
THE+0.014
fl+0.014
 ze+0.014
Neg logit contributions
inkle-0.024
 concern-0.021
ags-0.021
ist-0.021
cca-0.021
Token very
Token ID845
Feature activation+2.67e-02

Pos logit contributions
THE+0.037
saw+0.036
 café+0.036
 desert+0.036
 ze+0.035
Neg logit contributions
inkle-0.063
ist-0.059
ags-0.056
 concern-0.055
cca-0.055
Token it
Token ID340
Feature activation+3.05e-04

Pos logit contributions
THE+0.028
 trail+0.027
saw+0.027
fl+0.027
 ze+0.027
Neg logit contributions
inkle-0.038
ags-0.034
 concern-0.034
cca-0.033
ist-0.033
Token was
Token ID373
Feature activation+6.85e-04

Pos logit contributions
saw+0.000
THE+0.000
 café+0.000
fl+0.000
 trail+0.000
Neg logit contributions
inkle-0.001
 concern-0.000
cca-0.000
 sincere-0.000
ags-0.000
Token going
Token ID1016
Feature activation-1.30e-03

Pos logit contributions
THE+0.001
saw+0.001
 café+0.001
 trail+0.001
 roller+0.001
Neg logit contributions
inkle-0.001
ags-0.001
 concern-0.001
ist-0.001
cca-0.001
Token so
Token ID523
Feature activation+1.84e-02

Pos logit contributions
inkle+0.002
 concern+0.002
ags+0.002
cca+0.002
 sincere+0.002
Neg logit contributions
saw-0.001
THE-0.001
 café-0.001
fl-0.001
 trail-0.001
Token fast
Token ID3049
Feature activation+4.69e-03

Pos logit contributions
fl+0.027
 café+0.027
THE+0.026
board+0.026
saw+0.026
Neg logit contributions
inkle-0.023
 sincere-0.020
 common-0.020
cca-0.019
 concern-0.019
Token that
Token ID326
Feature activation+6.83e-02

Pos logit contributions
 trail+0.005
THE+0.005
saw+0.005
fl+0.005
 roller+0.005
Neg logit contributions
inkle-0.008
cca-0.007
 concern-0.007
ags-0.007
 sincere-0.007
Token it
Token ID340
Feature activation+1.49e-02

Pos logit contributions
 café+0.095
saw+0.094
THE+0.090
fl+0.089
actus+0.088
Neg logit contributions
inkle-0.095
 sincere-0.082
 concern-0.079
cca-0.078
 common-0.076
Token didn
Token ID1422
Feature activation+8.12e-03

Pos logit contributions
saw+0.013
THE+0.012
 café+0.012
fl+0.012
 trail+0.011
Neg logit contributions
inkle-0.029
 concern-0.025
cca-0.025
 sincere-0.025
ags-0.025
Token't
Token ID470
Feature activation+2.87e-02

Pos logit contributions
saw+0.011
THE+0.010
 café+0.010
fl+0.010
 trail+0.010
Neg logit contributions
inkle-0.012
cca-0.010
ags-0.010
 concern-0.010
 sincere-0.010
Token see
Token ID766
Feature activation+4.29e-02

Pos logit contributions
saw+0.036
fl+0.035
 café+0.035
THE+0.035
actus+0.034
Neg logit contributions
inkle-0.043
 concern-0.038
ags-0.038
 sincere-0.037
cca-0.037
Token a
Token ID257
Feature activation+1.85e-02

Pos logit contributions
saw+0.056
 café+0.055
THE+0.055
fl+0.054
 trail+0.053
Neg logit contributions
inkle-0.063
ags-0.054
 concern-0.054
 sincere-0.054
cca-0.053
Token the
Token ID262
Feature activation+2.24e-02

Pos logit contributions
saw+0.014
fl+0.014
 roller+0.014
THE+0.014
 ze+0.014
Neg logit contributions
inkle-0.025
ags-0.023
agged-0.022
 concern-0.021
 sincere-0.021
Token boy
Token ID2933
Feature activation+2.85e-02

Pos logit contributions
saw+0.019
THE+0.019
fl+0.018
 café+0.018
 ze+0.017
Neg logit contributions
inkle-0.042
 concern-0.039
 sincere-0.038
ags-0.038
cca-0.038
Token loved
Token ID6151
Feature activation+2.13e-02

Pos logit contributions
saw+0.025
THE+0.025
 café+0.024
fl+0.024
 trail+0.023
Neg logit contributions
inkle-0.053
ags-0.049
 concern-0.048
 sincere-0.047
cca-0.047
Token how
Token ID703
Feature activation+4.34e-02

Pos logit contributions
saw+0.025
THE+0.025
 café+0.025
fl+0.024
 trail+0.024
Neg logit contributions
inkle-0.034
 concern-0.029
ags-0.029
 sincere-0.029
cca-0.029
Token it
Token ID340
Feature activation+1.15e-02

Pos logit contributions
 café+0.067
reating+0.064
board+0.063
actus+0.063
 ze+0.062
Neg logit contributions
inkle-0.050
 sincere-0.049
 their-0.048
 concern-0.045
bly-0.045
Token felt
Token ID2936
Feature activation+6.82e-02

Pos logit contributions
saw+0.009
fl+0.009
 trail+0.009
THE+0.008
 café+0.008
Neg logit contributions
inkle-0.023
ags-0.021
 concern-0.021
cca-0.020
 sincere-0.020
Token when
Token ID618
Feature activation+2.04e-02

Pos logit contributions
saw+0.070
THE+0.070
 café+0.068
fl+0.067
 trail+0.066
Neg logit contributions
inkle-0.119
ags-0.105
 concern-0.105
cca-0.104
 sincere-0.103
Token it
Token ID340
Feature activation+7.48e-03

Pos logit contributions
saw+0.017
THE+0.017
 trail+0.016
fl+0.016
 roller+0.016
Neg logit contributions
inkle-0.041
ist-0.039
ags-0.038
agged-0.037
cca-0.036
Token bounced
Token ID31318
Feature activation+1.21e-02

Pos logit contributions
saw+0.006
fl+0.006
THE+0.006
 café+0.006
 trail+0.006
Neg logit contributions
inkle-0.015
ags-0.014
 concern-0.013
 sincere-0.013
cca-0.013
Token off
Token ID572
Feature activation+3.61e-02

Pos logit contributions
THE+0.012
 trail+0.012
fl+0.011
saw+0.011
 roller+0.010
Neg logit contributions
inkle-0.023
 concern-0.021
cca-0.020
 sincere-0.020
ist-0.019
Token his
Token ID465
Feature activation+5.00e-03

Pos logit contributions
THE+0.029
saw+0.028
 tree+0.027
 trail+0.027
 roller+0.027
Neg logit contributions
inkle-0.073
 concern-0.070
cca-0.069
 sincere-0.066
agged-0.065
Token He
Token ID679
Feature activation+1.70e-02

Pos logit contributions
saw+0.020
THE+0.019
 café+0.019
fl+0.019
 trail+0.018
Neg logit contributions
inkle-0.022
ags-0.018
 concern-0.018
cca-0.018
 sincere-0.018
Token loved
Token ID6151
Feature activation+3.91e-02

Pos logit contributions
THE+0.014
 café+0.014
 trail+0.013
fl+0.013
saw+0.013
Neg logit contributions
inkle-0.032
ags-0.031
 concern-0.030
 sincere-0.029
agged-0.029
Token the
Token ID262
Feature activation+3.18e-02

Pos logit contributions
saw+0.060
bye+0.057
EEP+0.054
actus+0.054
board+0.054
Neg logit contributions
 their-0.047
inkle-0.044
 sincere-0.042
 concern-0.041
ags-0.041
Token way
Token ID835
Feature activation+5.59e-02

Pos logit contributions
saw+0.032
THE+0.031
 café+0.031
fl+0.030
 trail+0.029
Neg logit contributions
inkle-0.056
 concern-0.050
ags-0.050
cca-0.049
 sincere-0.049
Token he
Token ID339
Feature activation+1.49e-02

Pos logit contributions
saw+0.071
 café+0.071
fl+0.068
THE+0.068
 ze+0.066
Neg logit contributions
inkle-0.081
ags-0.070
 sincere-0.069
 concern-0.069
bly-0.068
Token felt
Token ID2936
Feature activation+6.80e-02

Pos logit contributions
 café+0.010
saw+0.010
THE+0.010
 trail+0.010
fl+0.010
Neg logit contributions
inkle-0.031
ags-0.029
 concern-0.028
 sincere-0.028
cca-0.028
Token getting
Token ID1972
Feature activation+3.89e-02

Pos logit contributions
saw+0.085
THE+0.084
 café+0.083
fl+0.082
 trail+0.080
Neg logit contributions
inkle-0.103
ags-0.089
 concern-0.089
cca-0.088
 sincere-0.088
Token all
Token ID477
Feature activation+5.65e-02

Pos logit contributions
saw+0.047
THE+0.046
 café+0.045
fl+0.045
 trail+0.044
Neg logit contributions
inkle-0.061
 sincere-0.053
ags-0.053
cca-0.053
 concern-0.053
Token clean
Token ID3424
Feature activation+3.40e-02

Pos logit contributions
saw+0.078
 café+0.075
THE+0.074
 trail+0.073
actus+0.073
Neg logit contributions
inkle-0.078
ags-0.069
bly-0.066
cca-0.066
 concern-0.066
Token.
Token ID13
Feature activation+1.51e-02

Pos logit contributions
fl+0.029
THE+0.027
 trail+0.027
 café+0.026
 roller+0.024
Neg logit contributions
inkle-0.067
 concern-0.064
 sincere-0.062
ags-0.062
agged-0.061
Token
Token ID198
Feature activation+1.44e-02

Pos logit contributions
saw+0.023
 trail+0.022
 café+0.022
THE+0.022
fl+0.022
Neg logit contributions
inkle-0.018
ags-0.016
 concern-0.015
 sincere-0.015
cca-0.015
Token much
Token ID881
Feature activation+4.51e-02

Pos logit contributions
 café+0.084
fl+0.080
actus+0.077
 desert+0.077
reating+0.076
Neg logit contributions
inkle-0.057
 deeply-0.047
ised-0.046
bly-0.045
ither-0.045
Token that
Token ID326
Feature activation+5.08e-02

Pos logit contributions
 café+0.069
actus+0.067
saw+0.067
fl+0.067
rav+0.067
Neg logit contributions
inkle-0.052
ist-0.045
 deeply-0.044
ags-0.042
bly-0.042
Token she
Token ID673
Feature activation+4.95e-02

Pos logit contributions
 café+0.071
actus+0.070
saw+0.070
unn+0.070
THE+0.069
Neg logit contributions
inkle-0.067
 concern-0.057
 sincere-0.056
cca-0.053
 deeply-0.052
Token started
Token ID2067
Feature activation+7.10e-02

Pos logit contributions
 café+0.061
reating+0.060
THE+0.057
saw+0.057
fl+0.056
Neg logit contributions
inkle-0.078
cca-0.069
ist-0.068
 sincere-0.066
bly-0.065
Token to
Token ID284
Feature activation+3.36e-02

Pos logit contributions
actus+0.090
 café+0.086
THE+0.086
bye+0.084
unn+0.081
Neg logit contributions
 their-0.099
inkle-0.094
 an-0.090
 concern-0.087
 sincere-0.087
Token feel
Token ID1254
Feature activation+6.75e-02

Pos logit contributions
saw+0.039
THE+0.038
 café+0.038
fl+0.038
 trail+0.037
Neg logit contributions
inkle-0.054
ags-0.047
 concern-0.047
cca-0.047
 sincere-0.046
Token dizz
Token ID29527
Feature activation+5.53e-03

Pos logit contributions
fl+0.114
saw+0.113
board+0.113
 café+0.112
bye+0.110
Neg logit contributions
inkle-0.074
 sincere-0.065
 concern-0.064
 deeply-0.064
 their-0.060
Tokeny
Token ID88
Feature activation+3.43e-02

Pos logit contributions
 café+0.007
saw+0.007
THE+0.007
 trail+0.007
 roller+0.006
Neg logit contributions
inkle-0.009
ags-0.007
ist-0.007
cca-0.007
bly-0.007
Token,
Token ID11
Feature activation+1.96e-02

Pos logit contributions
THE+0.033
saw+0.033
fl+0.032
 trail+0.032
 café+0.032
Neg logit contributions
inkle-0.062
ags-0.055
 concern-0.055
cca-0.055
 sincere-0.054
Token and
Token ID290
Feature activation+1.72e-02

Pos logit contributions
fl+0.021
 trail+0.021
THE+0.020
 ze+0.020
saw+0.020
Neg logit contributions
inkle-0.035
agged-0.032
ags-0.031
cca-0.031
bly-0.030
Token when
Token ID618
Feature activation+1.65e-02

Pos logit contributions
saw+0.020
THE+0.020
 café+0.020
fl+0.020
 trail+0.019
Neg logit contributions
inkle-0.027
ags-0.024
 concern-0.024
cca-0.024
 sincere-0.024
Token spun
Token ID26843
Feature activation+1.04e-02

Pos logit contributions
THE+0.037
 café+0.037
saw+0.037
fl+0.035
reating+0.035
Neg logit contributions
inkle-0.065
cca-0.058
 sincere-0.057
 concern-0.057
ist-0.057
Token so
Token ID523
Feature activation+5.27e-02

Pos logit contributions
saw+0.016
 café+0.016
actus+0.015
THE+0.015
bye+0.015
Neg logit contributions
inkle-0.012
cca-0.011
ags-0.011
ist-0.010
 concern-0.010
Token fast
Token ID3049
Feature activation+2.20e-02

Pos logit contributions
 café+0.086
fl+0.084
reating+0.081
THE+0.080
actus+0.078
Neg logit contributions
inkle-0.060
 deeply-0.050
ither-0.048
cca-0.048
ised-0.047
Token that
Token ID326
Feature activation+4.50e-02

Pos logit contributions
saw+0.028
THE+0.027
 café+0.027
fl+0.026
 trail+0.026
Neg logit contributions
inkle-0.033
ags-0.029
 concern-0.029
cca-0.028
 sincere-0.028
Token he
Token ID339
Feature activation+3.71e-02

Pos logit contributions
saw+0.056
THE+0.056
 café+0.055
fl+0.054
actus+0.053
Neg logit contributions
inkle-0.066
 concern-0.057
 sincere-0.057
cca-0.056
ags-0.055
Token felt
Token ID2936
Feature activation+6.70e-02

Pos logit contributions
 café+0.038
THE+0.037
saw+0.037
reating+0.036
fl+0.035
Neg logit contributions
inkle-0.065
cca-0.058
ist-0.057
 sincere-0.056
 concern-0.055
Token dizz
Token ID29527
Feature activation-6.65e-03

Pos logit contributions
saw+0.110
 café+0.109
fl+0.109
actus+0.106
bye+0.105
Neg logit contributions
inkle-0.074
 sincere-0.063
 concern-0.061
cca-0.060
bly-0.060
Tokeny
Token ID88
Feature activation+2.44e-02

Pos logit contributions
inkle+0.012
 concern+0.011
 sincere+0.011
ags+0.011
cca+0.011
Neg logit contributions
saw-0.006
fl-0.006
THE-0.006
 café-0.006
bye-0.006
Token and
Token ID290
Feature activation+2.64e-02

Pos logit contributions
THE+0.023
 trail+0.022
fl+0.022
saw+0.022
 ze+0.021
Neg logit contributions
inkle-0.046
 concern-0.041
ags-0.041
agged-0.041
 sincere-0.040
Token happy
Token ID3772
Feature activation+5.85e-03

Pos logit contributions
saw+0.033
THE+0.032
 café+0.032
fl+0.031
 trail+0.031
Neg logit contributions
inkle-0.040
 concern-0.035
ags-0.035
 sincere-0.035
cca-0.035
Token.
Token ID13
Feature activation+1.93e-02

Pos logit contributions
saw+0.007
THE+0.006
 café+0.006
fl+0.006
 trail+0.006
Neg logit contributions
inkle-0.010
ags-0.008
 concern-0.008
 sincere-0.008
cca-0.008
Token boy
Token ID2933
Feature activation+3.78e-02

Pos logit contributions
THE+0.007
 café+0.007
fl+0.007
reating+0.007
actus+0.007
Neg logit contributions
inkle-0.015
cca-0.014
 concern-0.014
 sincere-0.014
ags-0.014
Token grabbed
Token ID13176
Feature activation+7.82e-03

Pos logit contributions
THE+0.047
saw+0.047
 café+0.046
fl+0.045
reating+0.045
Neg logit contributions
inkle-0.055
ist-0.049
cca-0.049
 concern-0.047
ags-0.046
Token the
Token ID262
Feature activation+3.22e-02

Pos logit contributions
 roller+0.009
saw+0.008
 ze+0.008
 slides+0.007
 night+0.007
Neg logit contributions
inkle-0.014
 sincere-0.014
ist-0.013
cca-0.012
agged-0.012
Token rocket
Token ID10701
Feature activation+1.20e-02

Pos logit contributions
saw+0.044
THE+0.043
fl+0.042
 café+0.042
 trail+0.042
Neg logit contributions
inkle-0.045
cca-0.039
 concern-0.039
 sincere-0.038
ags-0.038
Token and
Token ID290
Feature activation+4.33e-02

Pos logit contributions
 trail+0.014
fl+0.014
saw+0.014
THE+0.013
 café+0.013
Neg logit contributions
inkle-0.020
 sincere-0.018
agged-0.018
ags-0.018
 concern-0.017
Token started
Token ID2067
Feature activation+6.68e-02

Pos logit contributions
saw+0.045
THE+0.045
 café+0.045
fl+0.043
actus+0.043
Neg logit contributions
inkle-0.072
cca-0.065
 concern-0.064
ist-0.063
ags-0.063
Token to
Token ID284
Feature activation+4.82e-02

Pos logit contributions
actus+0.104
bye+0.093
asis+0.091
unn+0.091
 camel+0.090
Neg logit contributions
 their-0.092
 together-0.079
itched-0.078
 an-0.077
inkle-0.075
Token spin
Token ID7906
Feature activation+2.45e-02

Pos logit contributions
actus+0.068
 café+0.065
lan+0.064
THE+0.064
unn+0.063
Neg logit contributions
inkle-0.066
ist-0.058
 concern-0.058
ags-0.056
 sincere-0.054
Token around
Token ID1088
Feature activation+2.89e-02

Pos logit contributions
 café+0.034
saw+0.033
actus+0.031
unn+0.031
bye+0.031
Neg logit contributions
inkle-0.035
ags-0.030
ist-0.029
cca-0.029
bly-0.029
Token and
Token ID290
Feature activation+4.59e-02

Pos logit contributions
saw+0.031
THE+0.031
fl+0.030
 café+0.030
 trail+0.030
Neg logit contributions
inkle-0.049
 concern-0.043
ags-0.043
cca-0.043
 sincere-0.042
Token around
Token ID1088
Feature activation+2.59e-02

Pos logit contributions
 café+0.062
THE+0.061
actus+0.061
saw+0.060
unn+0.058
Neg logit contributions
inkle-0.063
 sincere-0.053
 concern-0.053
ist-0.053
ags-0.053
Token loved
Token ID6151
Feature activation+3.29e-02

Pos logit contributions
saw+0.026
 café+0.025
reating+0.024
THE+0.023
lan+0.023
Neg logit contributions
inkle-0.035
 sincere-0.030
cca-0.030
 concern-0.030
ist-0.030
Token how
Token ID703
Feature activation+4.54e-02

Pos logit contributions
saw+0.035
THE+0.034
 café+0.034
 trail+0.032
bye+0.032
Neg logit contributions
inkle-0.055
 concern-0.050
 sincere-0.050
ags-0.049
cca-0.049
Token it
Token ID340
Feature activation+1.29e-02

Pos logit contributions
 café+0.071
 trail+0.067
saw+0.067
THE+0.067
board+0.066
Neg logit contributions
inkle-0.055
 sincere-0.049
 concern-0.048
bly-0.048
cca-0.046
Token made
Token ID925
Feature activation+2.71e-02

Pos logit contributions
fl+0.012
saw+0.011
 trail+0.011
THE+0.011
 café+0.010
Neg logit contributions
inkle-0.024
ags-0.023
 sincere-0.022
cca-0.022
 concern-0.022
Token her
Token ID607
Feature activation+1.57e-02

Pos logit contributions
fl+0.031
reating+0.030
 night+0.028
ups+0.027
saw+0.027
Neg logit contributions
inkle-0.044
agged-0.043
 sincere-0.041
ags-0.040
cca-0.040
Token feel
Token ID1254
Feature activation+6.64e-02

Pos logit contributions
saw+0.021
fl+0.021
THE+0.021
 café+0.021
 trail+0.020
Neg logit contributions
inkle-0.022
ags-0.019
cca-0.019
 concern-0.019
 sincere-0.019
Token special
Token ID2041
Feature activation+3.17e-03

Pos logit contributions
saw+0.086
 café+0.083
fl+0.081
THE+0.081
 ze+0.079
Neg logit contributions
inkle-0.100
 sincere-0.086
 concern-0.086
ags-0.085
cca-0.084
Token and
Token ID290
Feature activation+1.69e-02

Pos logit contributions
fl+0.003
 trail+0.003
THE+0.003
 café+0.003
bye+0.003
Neg logit contributions
inkle-0.006
ags-0.005
agged-0.005
cca-0.005
bly-0.005
Token safe
Token ID3338
Feature activation+1.76e-02

Pos logit contributions
THE+0.021
saw+0.021
fl+0.020
 café+0.020
 trail+0.020
Neg logit contributions
inkle-0.026
ags-0.023
cca-0.022
 concern-0.022
 sincere-0.022
Token.
Token ID13
Feature activation+5.46e-03

Pos logit contributions
fl+0.015
 trail+0.013
THE+0.012
reating+0.011
 café+0.011
Neg logit contributions
inkle-0.037
agged-0.034
cca-0.034
ags-0.033
 concern-0.033
Token<|endoftext|>
Token ID50256
Feature activation+1.51e-02

Pos logit contributions
saw+0.010
 trail+0.010
 sur+0.010
 roller+0.010
 café+0.010
Neg logit contributions
 Their-0.004
inkle-0.004
 their-0.004
 deeply-0.004
 sincere-0.004
Token Cub
Token ID7070
Feature activation+7.44e-04

Pos logit contributions
inkle+0.005
ags+0.004
 concern+0.004
cca+0.004
 sincere+0.004
Neg logit contributions
saw-0.005
THE-0.005
 café-0.005
fl-0.005
 trail-0.005
Tokeney
Token ID2959
Feature activation+1.40e-02

Pos logit contributions
fl+0.001
THE+0.001
 café+0.001
saw+0.001
actus+0.001
Neg logit contributions
inkle-0.001
ags-0.001
 concern-0.001
 sincere-0.001
bly-0.001
Token loved
Token ID6151
Feature activation+3.30e-02

Pos logit contributions
saw+0.018
THE+0.018
fl+0.017
 café+0.017
 trail+0.017
Neg logit contributions
inkle-0.021
ags-0.018
 concern-0.018
cca-0.018
 sincere-0.018
Token to
Token ID284
Feature activation+6.62e-03

Pos logit contributions
saw+0.045
pillar+0.044
THE+0.043
rav+0.043
ha+0.043
Neg logit contributions
 their-0.047
ags-0.042
 his-0.040
 Their-0.039
 an-0.038
Token spin
Token ID7906
Feature activation+1.03e-02

Pos logit contributions
saw+0.005
 trail+0.005
 café+0.005
 roller+0.005
fl+0.005
Neg logit contributions
inkle-0.013
cca-0.012
 sincere-0.011
 concern-0.011
ags-0.011
Token all
Token ID477
Feature activation+6.63e-02

Pos logit contributions
saw+0.015
 café+0.015
THE+0.015
fl+0.015
bye+0.014
Neg logit contributions
inkle-0.013
ags-0.011
cca-0.011
 sincere-0.011
ist-0.011
Token day
Token ID1110
Feature activation+5.21e-02

Pos logit contributions
 café+0.080
actus+0.074
 trail+0.070
lan+0.069
 camel+0.067
Neg logit contributions
inkle-0.107
ags-0.103
 their-0.101
 common-0.092
ist-0.089
Token long
Token ID890
Feature activation+2.63e-02

Pos logit contributions
 trail+0.058
THE+0.057
saw+0.057
 café+0.056
fl+0.055
Neg logit contributions
inkle-0.084
 concern-0.077
 sincere-0.076
cca-0.075
ags-0.075
Token.
Token ID13
Feature activation-2.61e-04

Pos logit contributions
THE+0.021
fl+0.019
saw+0.019
 ze+0.019
 trail+0.019
Neg logit contributions
inkle-0.052
 concern-0.047
cca-0.047
 sincere-0.046
ags-0.046
Token He
Token ID679
Feature activation+3.10e-02

Pos logit contributions
inkle+0.000
ags+0.000
 concern+0.000
 sincere+0.000
cca+0.000
Neg logit contributions
saw-0.000
THE-0.000
 café-0.000
fl-0.000
 trail-0.000
Token would
Token ID561
Feature activation+3.47e-02

Pos logit contributions
 café+0.043
reating+0.042
saw+0.040
lan+0.039
actus+0.038
Neg logit contributions
inkle-0.043
cca-0.037
ist-0.037
irl-0.036
 their-0.036
Token said
Token ID531
Feature activation+8.76e-03

Pos logit contributions
THE+0.023
reating+0.023
saw+0.023
fl+0.023
 café+0.022
Neg logit contributions
inkle-0.024
cca-0.023
 sincere-0.023
 concern-0.022
 Their-0.021
Token,
Token ID11
Feature activation+1.65e-02

Pos logit contributions
 café+0.013
saw+0.013
THE+0.013
fl+0.013
 ze+0.013
Neg logit contributions
inkle-0.011
 sincere-0.010
 concern-0.009
 Their-0.009
 their-0.009
Token "
Token ID366
Feature activation+3.26e-02

Pos logit contributions
saw+0.016
THE+0.016
 café+0.015
fl+0.015
 trail+0.015
Neg logit contributions
inkle-0.030
ags-0.026
 concern-0.026
cca-0.026
 sincere-0.026
TokenWow
Token ID22017
Feature activation+1.64e-02

Pos logit contributions
unn+0.047
 trail+0.044
 ze+0.044
lan+0.043
 camel+0.043
Neg logit contributions
cca-0.039
inkle-0.038
 concern-0.035
ags-0.034
bly-0.034
Token,
Token ID11
Feature activation+1.49e-02

Pos logit contributions
 café+0.029
saw+0.029
THE+0.027
reating+0.027
lan+0.027
Neg logit contributions
inkle-0.019
ist-0.014
ags-0.014
ither-0.014
ap-0.014
Token that
Token ID326
Feature activation+6.63e-02

Pos logit contributions
saw+0.020
THE+0.020
fl+0.020
 café+0.020
 trail+0.019
Neg logit contributions
inkle-0.021
 concern-0.018
 sincere-0.018
ags-0.017
cca-0.017
Token dress
Token ID6576
Feature activation+2.96e-02

Pos logit contributions
lan+0.109
saw+0.109
unn+0.107
 café+0.101
THE+0.101
Neg logit contributions
inkle-0.078
ist-0.057
cca-0.054
 their-0.054
 sincere-0.054
Token is
Token ID318
Feature activation+1.14e-02

Pos logit contributions
THE+0.033
saw+0.033
 café+0.032
fl+0.031
lan+0.031
Neg logit contributions
inkle-0.050
ist-0.043
ags-0.042
 sincere-0.041
bly-0.040
Token so
Token ID523
Feature activation+3.22e-02

Pos logit contributions
saw+0.016
 café+0.015
fl+0.015
THE+0.015
actus+0.015
Neg logit contributions
inkle-0.016
 sincere-0.013
ags-0.013
cca-0.013
 concern-0.013
Token fancy
Token ID14996
Feature activation+3.37e-02

Pos logit contributions
fl+0.052
saw+0.051
 café+0.050
THE+0.050
 ze+0.049
Neg logit contributions
inkle-0.038
 sincere-0.031
bly-0.030
 common-0.029
ags-0.029
Token!
Token ID0
Feature activation+4.05e-02

Pos logit contributions
saw+0.034
THE+0.032
 café+0.032
actus+0.030
 trail+0.029
Neg logit contributions
inkle-0.057
ist-0.053
bly-0.050
ags-0.050
 deeply-0.049
Token said
Token ID531
Feature activation+3.01e-04

Pos logit contributions
THE+0.034
actus+0.033
 café+0.033
saw+0.032
reating+0.032
Neg logit contributions
 their-0.026
 sincere-0.026
inkle-0.025
 Their-0.025
cca-0.025
Token,
Token ID11
Feature activation+1.89e-02

Pos logit contributions
saw+0.000
THE+0.000
 café+0.000
fl+0.000
 trail+0.000
Neg logit contributions
inkle-0.000
 sincere-0.000
 concern-0.000
cca-0.000
ags-0.000
Token "
Token ID366
Feature activation+3.04e-02

Pos logit contributions
saw+0.020
THE+0.020
 café+0.019
fl+0.019
 trail+0.019
Neg logit contributions
inkle-0.032
 sincere-0.028
 concern-0.028
cca-0.028
ags-0.028
TokenWow
Token ID22017
Feature activation+1.75e-02

Pos logit contributions
unn+0.051
 trail+0.044
lan+0.043
 ze+0.042
reating+0.041
Neg logit contributions
cca-0.034
Their-0.028
 afar-0.028
ised-0.028
 concern-0.027
Token,
Token ID11
Feature activation+1.08e-02

Pos logit contributions
 café+0.030
saw+0.030
THE+0.028
reating+0.028
fl+0.028
Neg logit contributions
inkle-0.021
 sincere-0.016
ist-0.016
ags-0.015
ither-0.015
Token that
Token ID326
Feature activation+6.53e-02

Pos logit contributions
saw+0.014
THE+0.014
fl+0.014
 café+0.014
 trail+0.013
Neg logit contributions
inkle-0.015
 concern-0.013
 sincere-0.013
ags-0.013
cca-0.013
Token's
Token ID338
Feature activation-4.84e-03

Pos logit contributions
unn+0.097
 café+0.097
lan+0.096
saw+0.092
reating+0.091
Neg logit contributions
inkle-0.088
 sincere-0.071
ist-0.069
bly-0.068
 their-0.068
Token a
Token ID257
Feature activation+2.73e-02

Pos logit contributions
inkle+0.007
 concern+0.006
cca+0.006
ags+0.006
 sincere+0.006
Neg logit contributions
saw-0.007
THE-0.007
fl-0.006
 café-0.006
 trail-0.006
Token big
Token ID1263
Feature activation+1.80e-02

Pos logit contributions
unn+0.045
 café+0.043
fl+0.042
saw+0.042
reating+0.042
Neg logit contributions
inkle-0.033
 sincere-0.029
 common-0.026
bly-0.025
 concern-0.025
Token orange
Token ID10912
Feature activation+3.52e-02

Pos logit contributions
saw+0.022
THE+0.021
 café+0.021
fl+0.021
 trail+0.020
Neg logit contributions
inkle-0.028
 sincere-0.024
ags-0.024
 concern-0.024
cca-0.024
Token drum
Token ID13026
Feature activation+2.29e-02

Pos logit contributions
saw+0.044
 café+0.043
THE+0.042
fl+0.041
actus+0.039
Neg logit contributions
inkle-0.054
ist-0.046
ags-0.046
 sincere-0.045
 concern-0.045
Token the
Token ID262
Feature activation+1.45e-02

Pos logit contributions
saw+0.006
THE+0.006
 café+0.005
 trail+0.005
fl+0.005
Neg logit contributions
inkle-0.008
cca-0.007
ist-0.007
 concern-0.007
 sincere-0.007
Token water
Token ID1660
Feature activation+1.26e-02

Pos logit contributions
saw+0.015
reating+0.015
THE+0.014
rav+0.014
fl+0.014
Neg logit contributions
inkle-0.025
ags-0.022
 sincere-0.022
 concern-0.021
cca-0.021
Token,
Token ID11
Feature activation+1.61e-02

Pos logit contributions
 café+0.012
 trail+0.011
fl+0.011
THE+0.011
saw+0.011
Neg logit contributions
inkle-0.024
 concern-0.022
cca-0.021
ags-0.021
 sincere-0.021
Token enjoying
Token ID13226
Feature activation-3.68e-03

Pos logit contributions
 ze+0.015
 trail+0.014
 slides+0.014
fl+0.014
saw+0.013
Neg logit contributions
inkle-0.033
cca-0.030
agged-0.030
bly-0.029
ags-0.029
Token the
Token ID262
Feature activation+1.98e-02

Pos logit contributions
inkle+0.006
ist+0.005
cca+0.005
 concern+0.005
ags+0.005
Neg logit contributions
fl-0.005
 ze-0.005
THE-0.005
 trail-0.005
 roller-0.005
Token feel
Token ID1254
Feature activation+6.50e-02

Pos logit contributions
saw+0.021
THE+0.020
 café+0.019
reating+0.019
fl+0.018
Neg logit contributions
inkle-0.035
ags-0.030
 concern-0.030
 sincere-0.030
cca-0.029
Token of
Token ID286
Feature activation+1.28e-02

Pos logit contributions
saw+0.086
 café+0.084
fl+0.083
THE+0.082
 trail+0.081
Neg logit contributions
inkle-0.095
ags-0.082
 concern-0.081
 sincere-0.081
cca-0.080
Token the
Token ID262
Feature activation+2.52e-02

Pos logit contributions
THE+0.015
 ze+0.015
saw+0.015
fl+0.015
 trail+0.014
Neg logit contributions
inkle-0.021
ist-0.018
cca-0.018
 sincere-0.017
 concern-0.017
Token cool
Token ID3608
Feature activation+2.44e-02

Pos logit contributions
saw+0.026
THE+0.025
 café+0.024
reating+0.023
fl+0.023
Neg logit contributions
inkle-0.045
 concern-0.039
ags-0.039
 sincere-0.039
cca-0.038
Token water
Token ID1660
Feature activation+1.58e-02

Pos logit contributions
saw+0.025
THE+0.024
 café+0.022
reating+0.021
 roller+0.020
Neg logit contributions
inkle-0.044
ags-0.039
 concern-0.038
 sincere-0.037
cca-0.037
Token against
Token ID1028
Feature activation+1.53e-03

Pos logit contributions
fl+0.014
 trail+0.013
THE+0.013
saw+0.012
 café+0.012
Neg logit contributions
inkle-0.030
cca-0.029
 concern-0.028
ags-0.028
agged-0.027
Token.
Token ID13
Feature activation-4.62e-02

Pos logit contributions
saw+0.011
THE+0.011
 café+0.011
fl+0.011
 trail+0.010
Neg logit contributions
inkle-0.017
ags-0.015
 concern-0.015
cca-0.015
 sincere-0.015
Token
Token ID220
Feature activation-2.83e-03

Pos logit contributions
inkle+0.072
cca+0.062
bly+0.061
 concern+0.060
ags+0.060
Neg logit contributions
saw-0.060
THE-0.058
 café-0.057
fl-0.057
bye-0.055
Token She
Token ID1375
Feature activation-3.54e-04

Pos logit contributions
inkle+0.005
ags+0.004
 concern+0.004
cca+0.004
 sincere+0.004
Neg logit contributions
saw-0.003
THE-0.003
 café-0.003
fl-0.003
 trail-0.003
Token had
Token ID550
Feature activation-3.75e-03

Pos logit contributions
inkle+0.001
ags+0.001
bly+0.001
 concern+0.001
agged+0.001
Neg logit contributions
THE-0.000
 café-0.000
fl-0.000
saw-0.000
 trail-0.000
Token a
Token ID257
Feature activation-2.63e-02

Pos logit contributions
inkle+0.005
ags+0.004
ist+0.004
 sincere+0.004
 concern+0.004
Neg logit contributions
saw-0.006
THE-0.006
fl-0.006
 café-0.006
 trail-0.005
Token deep
Token ID2769
Feature activation-7.35e-02

Pos logit contributions
inkle+0.047
agged+0.046
ags+0.046
bly+0.044
ist+0.044
Neg logit contributions
 trail-0.028
 café-0.026
 roller-0.025
 desert-0.025
 tree-0.024
Token need
Token ID761
Feature activation+1.16e-02

Pos logit contributions
bly+0.144
 Their+0.136
Their+0.135
agged+0.135
cca+0.133
Neg logit contributions
 trail-0.070
 night-0.068
 desert-0.067
 day-0.066
 tree-0.065
Token to
Token ID284
Feature activation-8.70e-03

Pos logit contributions
THE+0.008
 café+0.008
 trail+0.008
bye+0.008
fl+0.008
Neg logit contributions
inkle-0.024
 sincere-0.022
ags-0.022
cca-0.021
 concern-0.021
Token explore
Token ID7301
Feature activation-1.59e-03

Pos logit contributions
agged+0.018
inkle+0.018
cca+0.017
ags+0.017
ishment+0.016
Neg logit contributions
 trail-0.007
 tree-0.006
 café-0.006
 roller-0.006
 stream-0.005
Token the
Token ID262
Feature activation-6.02e-03

Pos logit contributions
inkle+0.003
agged+0.003
 sincere+0.003
 concern+0.003
cca+0.003
Neg logit contributions
THE-0.001
 trail-0.001
 café-0.001
fl-0.001
 desert-0.001
Token unknown
Token ID6439
Feature activation-1.54e-02

Pos logit contributions
inkle+0.012
 sincere+0.011
ags+0.011
 concern+0.011
cca+0.011
Neg logit contributions
saw-0.004
THE-0.004
 café-0.004
fl-0.004
 ze-0.004
Token The
Token ID383
Feature activation+1.50e-02

Pos logit contributions
inkle+0.021
cca+0.018
bly+0.018
ist+0.018
ags+0.018
Neg logit contributions
saw-0.014
 café-0.013
 roller-0.012
board-0.012
 Merry-0.011
Token bar
Token ID2318
Feature activation+1.29e-02

Pos logit contributions
saw+0.014
THE+0.014
fl+0.014
 trail+0.013
 café+0.013
Neg logit contributions
inkle-0.027
 concern-0.024
 sincere-0.024
cca-0.024
ags-0.024
Tokenber
Token ID527
Feature activation+8.41e-03

Pos logit contributions
saw+0.006
board+0.005
 café+0.005
 sur+0.005
lan+0.005
Neg logit contributions
inkle-0.031
ags-0.030
 concern-0.028
ither-0.027
bly-0.027
Token had
Token ID550
Feature activation+4.10e-04

Pos logit contributions
 café+0.006
board+0.005
saw+0.005
 roller+0.005
 would+0.004
Neg logit contributions
ags-0.019
inkle-0.019
bly-0.017
agged-0.017
ither-0.017
Token a
Token ID257
Feature activation-2.76e-02

Pos logit contributions
 café+0.001
THE+0.001
saw+0.001
 roller+0.001
 ze+0.001
Neg logit contributions
inkle-0.001
ags-0.001
ist-0.001
bly-0.000
 concern-0.000
Token deep
Token ID2769
Feature activation-6.91e-02

Pos logit contributions
inkle+0.049
ags+0.047
agged+0.047
bly+0.046
ist+0.046
Neg logit contributions
 café-0.029
 roller-0.029
 trail-0.027
saw-0.025
 pi-0.025
Token voice
Token ID3809
Feature activation-1.07e-02

Pos logit contributions
bly+0.130
hips+0.121
 Their+0.119
Their+0.118
ither+0.118
Neg logit contributions
 roller-0.064
 café-0.061
 desert-0.058
 trail-0.058
 pine-0.057
Token and
Token ID290
Feature activation-7.93e-05

Pos logit contributions
inkle+0.021
ags+0.020
agged+0.019
cca+0.019
ither+0.019
Neg logit contributions
 café-0.010
saw-0.009
 roller-0.009
fl-0.009
 ze-0.009
Token Tim
Token ID5045
Feature activation-9.57e-04

Pos logit contributions
ags+0.000
inkle+0.000
agged+0.000
ither+0.000
aying+0.000
Neg logit contributions
 ze-0.000
 would-0.000
saw-0.000
 l-0.000
 could-0.000
Tokenmy
Token ID1820
Feature activation+1.27e-03

Pos logit contributions
inkle+0.002
 concern+0.002
ags+0.002
cca+0.002
 sincere+0.002
Neg logit contributions
saw-0.001
THE-0.001
 café-0.001
fl-0.001
 trail-0.001
Token was
Token ID373
Feature activation+1.59e-03

Pos logit contributions
 café+0.001
fl+0.001
 trail+0.001
 ze+0.001
THE+0.001
Neg logit contributions
ags-0.003
inkle-0.003
agged-0.003
bly-0.003
ither-0.002
Token bed
Token ID3996
Feature activation-4.60e-03

Pos logit contributions
saw+0.009
 café+0.009
THE+0.009
 trail+0.009
fl+0.009
Neg logit contributions
inkle-0.018
cca-0.016
ist-0.016
bly-0.016
agged-0.016
Token.
Token ID13
Feature activation-5.33e-03

Pos logit contributions
inkle+0.008
 concern+0.008
cca+0.008
ags+0.008
 common+0.007
Neg logit contributions
 café-0.004
 trail-0.004
THE-0.003
actus-0.003
board-0.003
Token She
Token ID1375
Feature activation-9.66e-03

Pos logit contributions
inkle+0.007
 concern+0.006
ags+0.006
 sincere+0.006
cca+0.006
Neg logit contributions
saw-0.008
THE-0.008
 café-0.008
fl-0.008
 trail-0.008
Token dreamed
Token ID27947
Feature activation-1.25e-02

Pos logit contributions
inkle+0.018
ags+0.018
bly+0.017
agged+0.016
ither+0.016
Neg logit contributions
 café-0.009
 trail-0.008
THE-0.008
 ze-0.008
 would-0.008
Token of
Token ID286
Feature activation-2.09e-02

Pos logit contributions
inkle+0.024
cca+0.022
 concern+0.022
 sincere+0.022
ags+0.021
Neg logit contributions
THE-0.010
saw-0.010
 café-0.010
 trail-0.009
 roller-0.009
Token mer
Token ID4017
Feature activation-6.89e-02

Pos logit contributions
inkle+0.045
cca+0.043
ist+0.042
agged+0.041
immer+0.040
Neg logit contributions
 ze-0.017
 tree-0.016
 trail-0.016
 slides-0.015
 desert-0.014
Tokenmaid
Token ID23151
Feature activation-6.91e-03

Pos logit contributions
inkle+0.098
 concern+0.097
 sympathy+0.091
cca+0.088
ags+0.086
Neg logit contributions
actus-0.085
saw-0.081
fl-0.081
rav-0.077
 sur-0.074
Tokens
Token ID82
Feature activation-1.17e-02

Pos logit contributions
inkle+0.013
ags+0.011
 concern+0.011
 sincere+0.011
cca+0.011
Neg logit contributions
fl-0.007
THE-0.007
 café-0.007
saw-0.007
 ze-0.007
Token and
Token ID290
Feature activation-2.50e-02

Pos logit contributions
inkle+0.021
cca+0.019
 concern+0.019
bly+0.019
ags+0.019
Neg logit contributions
THE-0.012
 trail-0.012
fl-0.011
 café-0.011
 ze-0.011
Token pirates
Token ID27516
Feature activation-3.53e-02

Pos logit contributions
inkle+0.050
immer+0.044
ags+0.044
agged+0.044
cca+0.043
Neg logit contributions
 ze-0.026
fl-0.025
 trail-0.025
 night-0.024
 would-0.024
Token,
Token ID11
Feature activation-2.64e-02

Pos logit contributions
inkle+0.072
 concern+0.067
bly+0.065
ags+0.065
ised+0.064
Neg logit contributions
THE-0.028
fl-0.027
 trail-0.027
 would-0.026
bye-0.025
Token father
Token ID2988
Feature activation-4.19e-03

Pos logit contributions
saw+0.001
THE+0.001
 café+0.001
fl+0.001
 trail+0.001
Neg logit contributions
inkle-0.001
 concern-0.001
 sincere-0.001
cca-0.001
ags-0.001
Token taught
Token ID7817
Feature activation-1.28e-02

Pos logit contributions
ags+0.009
inkle+0.009
agged+0.008
 common+0.008
ither+0.008
Neg logit contributions
saw-0.003
 café-0.003
 ze-0.003
board-0.003
 ostr-0.003
Token his
Token ID465
Feature activation-1.05e-02

Pos logit contributions
inkle+0.023
bly+0.019
 sincere+0.019
agged+0.019
ags+0.018
Neg logit contributions
 ze-0.015
reating-0.015
 café-0.014
fl-0.014
ups-0.014
Token child
Token ID1200
Feature activation-1.68e-02

Pos logit contributions
inkle+0.017
ags+0.015
 sincere+0.014
 concern+0.014
cca+0.014
Neg logit contributions
saw-0.013
THE-0.012
 café-0.012
fl-0.012
 trail-0.012
Token that
Token ID326
Feature activation+6.72e-03

Pos logit contributions
inkle+0.033
ags+0.029
bly+0.028
agged+0.028
 sincere+0.026
Neg logit contributions
 ze-0.017
 café-0.017
saw-0.017
ups-0.016
reating-0.016
Token caves
Token ID30923
Feature activation-6.88e-02

Pos logit contributions
fl+0.006
 trail+0.006
 desert+0.006
 ze+0.006
 night+0.005
Neg logit contributions
inkle-0.014
ags-0.014
agged-0.014
cca-0.012
bly-0.012
Token can
Token ID460
Feature activation-1.94e-02

Pos logit contributions
cca+0.151
inkle+0.150
agged+0.148
ags+0.148
ishment+0.142
Neg logit contributions
 would-0.049
 trail-0.043
 steps-0.035
 could-0.034
 tree-0.033
Token be
Token ID307
Feature activation-3.51e-02

Pos logit contributions
agged+0.043
ags+0.041
cca+0.040
inkle+0.038
 blonde+0.038
Neg logit contributions
 trail-0.018
 stream-0.015
 would-0.014
 tree-0.014
 hill-0.013
Token friendly
Token ID8030
Feature activation-2.68e-02

Pos logit contributions
inkle+0.068
ags+0.063
agged+0.063
 concern+0.060
bly+0.058
Neg logit contributions
 trail-0.031
 ze-0.030
 would-0.029
 desert-0.029
 stream-0.029
Token and
Token ID290
Feature activation-2.56e-02

Pos logit contributions
inkle+0.054
ags+0.047
agged+0.046
 sincere+0.045
cca+0.045
Neg logit contributions
fl-0.025
 trail-0.024
THE-0.023
 café-0.023
 ze-0.022
Token safe
Token ID3338
Feature activation-2.22e-02

Pos logit contributions
inkle+0.048
ags+0.044
agged+0.044
cca+0.043
ist+0.041
Neg logit contributions
 trail-0.027
 would-0.025
THE-0.025
fl-0.024
saw-0.024
Token a
Token ID257
Feature activation+6.09e-03

Pos logit contributions
inkle+0.015
ags+0.012
cca+0.012
 concern+0.012
 sincere+0.012
Neg logit contributions
saw-0.029
fl-0.028
THE-0.028
 café-0.028
 trail-0.028
Token little
Token ID1310
Feature activation-2.15e-03

Pos logit contributions
fl+0.008
THE+0.008
reating+0.008
saw+0.008
 café+0.007
Neg logit contributions
inkle-0.009
 sincere-0.008
cca-0.007
 concern-0.007
 common-0.007
Token girl
Token ID2576
Feature activation-1.14e-02

Pos logit contributions
inkle+0.005
ags+0.004
 concern+0.004
cca+0.004
 sincere+0.004
Neg logit contributions
saw-0.001
 café-0.001
THE-0.001
fl-0.001
 trail-0.001
Token named
Token ID3706
Feature activation-5.67e-03

Pos logit contributions
inkle+0.012
 sincere+0.010
cca+0.010
ags+0.010
ist+0.010
Neg logit contributions
saw-0.019
THE-0.019
 café-0.019
 trail-0.019
fl-0.019
Token Amy
Token ID14235
Feature activation+6.11e-03

Pos logit contributions
inkle+0.005
cca+0.005
 concern+0.004
ags+0.004
ist+0.004
Neg logit contributions
THE-0.010
actus-0.010
saw-0.010
fl-0.009
 trail-0.009
Token.
Token ID13
Feature activation-6.79e-02

Pos logit contributions
saw+0.009
 trail+0.009
 café+0.009
reating+0.009
THE+0.009
Neg logit contributions
inkle-0.008
 their-0.007
 sincere-0.007
cca-0.007
ist-0.006
Token Amy
Token ID14235
Feature activation-1.51e-02

Pos logit contributions
inkle+0.085
cca+0.070
bly+0.069
 sincere+0.068
ist+0.068
Neg logit contributions
saw-0.106
 café-0.104
THE-0.103
fl-0.102
 trail-0.099
Token always
Token ID1464
Feature activation+4.90e-03

Pos logit contributions
inkle+0.026
ags+0.023
 concern+0.023
cca+0.023
 sincere+0.023
Neg logit contributions
saw-0.015
THE-0.015
fl-0.015
 café-0.015
 trail-0.015
Token had
Token ID550
Feature activation-4.34e-03

Pos logit contributions
fl+0.005
 café+0.005
THE+0.005
bye+0.005
 trail+0.005
Neg logit contributions
inkle-0.009
ags-0.008
 concern-0.008
 sincere-0.008
cca-0.008
Token a
Token ID257
Feature activation-1.79e-02

Pos logit contributions
inkle+0.006
ist+0.005
ags+0.005
 sincere+0.005
 concern+0.004
Neg logit contributions
 café-0.007
saw-0.006
fl-0.006
THE-0.006
 ze-0.006
Token big
Token ID1263
Feature activation-3.37e-02

Pos logit contributions
inkle+0.035
agged+0.033
ist+0.033
ags+0.033
bly+0.033
Neg logit contributions
 trail-0.017
 café-0.016
 desert-0.015
 roller-0.015
 tree-0.015
Token a
Token ID257
Feature activation+4.51e-03

Pos logit contributions
inkle+0.013
ags+0.011
 concern+0.011
cca+0.011
 sincere+0.011
Neg logit contributions
saw-0.025
THE-0.024
 café-0.024
fl-0.024
 trail-0.024
Token kind
Token ID1611
Feature activation+1.32e-02

Pos logit contributions
THE+0.006
fl+0.006
saw+0.005
reating+0.005
 café+0.005
Neg logit contributions
inkle-0.007
 sincere-0.006
cca-0.006
 concern-0.006
bly-0.006
Token girl
Token ID2576
Feature activation-8.31e-03

Pos logit contributions
saw+0.011
 café+0.011
THE+0.011
fl+0.010
 trail+0.010
Neg logit contributions
inkle-0.025
ags-0.023
 concern-0.023
cca-0.023
 sincere-0.022
Token named
Token ID3706
Feature activation-1.74e-03

Pos logit contributions
inkle+0.009
 sincere+0.007
cca+0.007
ags+0.007
 concern+0.007
Neg logit contributions
saw-0.014
THE-0.014
 trail-0.013
 café-0.013
fl-0.013
Token Amy
Token ID14235
Feature activation+6.68e-03

Pos logit contributions
inkle+0.002
cca+0.001
ags+0.001
ist+0.001
 concern+0.001
Neg logit contributions
THE-0.003
actus-0.003
saw-0.003
fl-0.003
 trail-0.003
Token.
Token ID13
Feature activation-6.64e-02

Pos logit contributions
saw+0.010
 trail+0.010
reating+0.009
 café+0.009
 roller+0.009
Neg logit contributions
inkle-0.009
 sincere-0.007
cca-0.007
 their-0.007
ist-0.007
Token Amy
Token ID14235
Feature activation-1.18e-02

Pos logit contributions
inkle+0.079
cca+0.064
 concern+0.063
 sincere+0.063
ags+0.063
Neg logit contributions
saw-0.107
THE-0.105
 café-0.105
fl-0.103
 trail-0.101
Token had
Token ID550
Feature activation+4.62e-03

Pos logit contributions
inkle+0.019
 concern+0.017
 sincere+0.017
cca+0.017
ags+0.016
Neg logit contributions
saw-0.014
THE-0.013
 café-0.013
fl-0.013
 trail-0.013
Token a
Token ID257
Feature activation-2.32e-02

Pos logit contributions
saw+0.008
THE+0.008
 café+0.008
fl+0.008
 trail+0.008
Neg logit contributions
inkle-0.005
ags-0.004
 concern-0.004
 sincere-0.004
cca-0.004
Token neighbor
Token ID4780
Feature activation-3.39e-02

Pos logit contributions
inkle+0.044
ist+0.042
agged+0.042
ags+0.042
bly+0.042
Neg logit contributions
 trail-0.021
 café-0.020
 ostr-0.019
 tree-0.019
actus-0.019
Token named
Token ID3706
Feature activation-1.24e-03

Pos logit contributions
inkle+0.073
ags+0.067
bly+0.067
agged+0.065
cca+0.063
Neg logit contributions
 café-0.029
 tree-0.027
 day-0.026
 l-0.026
 ostr-0.026
Token "
Token ID366
Feature activation-2.55e-03

Pos logit contributions
saw+0.001
THE+0.000
 café+0.000
fl+0.000
 trail+0.000
Neg logit contributions
inkle-0.001
ags-0.001
bly-0.001
ist-0.001
 concern-0.001
TokenI
Token ID40
Feature activation-4.88e-03

Pos logit contributions
inkle+0.005
ags+0.004
 concern+0.004
 sincere+0.004
ist+0.004
Neg logit contributions
saw-0.002
THE-0.002
 café-0.002
fl-0.002
 trail-0.002
Token'm
Token ID1101
Feature activation-1.86e-02

Pos logit contributions
ags+0.012
agged+0.011
inkle+0.010
adoes+0.010
ishment+0.010
Neg logit contributions
 sur-0.003
 would-0.002
 café-0.002
 woke-0.002
fl-0.002
Token a
Token ID257
Feature activation-5.36e-03

Pos logit contributions
ised+0.035
agged+0.035
inkle+0.034
ist+0.034
ags+0.033
Neg logit contributions
 desert-0.017
 ostr-0.015
 sur-0.015
 gn-0.015
 ze-0.015
Token secret
Token ID3200
Feature activation-2.14e-02

Pos logit contributions
inkle+0.010
ags+0.009
agged+0.009
ist+0.008
 concern+0.008
Neg logit contributions
 café-0.006
 trail-0.005
saw-0.005
 roller-0.005
THE-0.005
Token mer
Token ID4017
Feature activation-6.61e-02

Pos logit contributions
inkle+0.041
ags+0.038
agged+0.038
 concern+0.037
cca+0.036
Neg logit contributions
 trail-0.019
 café-0.019
saw-0.019
THE-0.019
fl-0.017
Tokenmaid
Token ID23151
Feature activation+1.54e-02

Pos logit contributions
inkle+0.098
ags+0.094
 concern+0.094
 Their+0.090
agged+0.089
Neg logit contributions
 sur-0.077
actus-0.074
saw-0.073
fl-0.072
 trail-0.071
Token.
Token ID13
Feature activation-1.58e-03

Pos logit contributions
saw+0.013
THE+0.013
fl+0.013
 café+0.013
 trail+0.012
Neg logit contributions
inkle-0.029
ags-0.026
 concern-0.026
 sincere-0.026
cca-0.026
Token Would
Token ID10928
Feature activation-2.88e-02

Pos logit contributions
inkle+0.004
ags+0.003
agged+0.003
 concern+0.003
 sincere+0.003
Neg logit contributions
saw-0.001
 café-0.001
THE-0.001
 Would-0.001
bye-0.001
Token you
Token ID345
Feature activation-1.79e-02

Pos logit contributions
inkle+0.048
 concern+0.042
ags+0.042
 sincere+0.041
cca+0.041
Neg logit contributions
saw-0.032
THE-0.031
 café-0.031
fl-0.031
 trail-0.030
Token like
Token ID588
Feature activation-4.36e-05

Pos logit contributions
inkle+0.024
 sincere+0.020
ags+0.020
cca+0.020
 concern+0.020
Neg logit contributions
THE-0.026
saw-0.025
 café-0.025
bye-0.025
fl-0.024
Token telling
Token ID5149
Feature activation-5.15e-04

Pos logit contributions
saw+0.006
 café+0.006
 roller+0.005
 desert+0.005
 ze+0.005
Neg logit contributions
inkle-0.012
ags-0.011
ist-0.011
 common-0.011
agged-0.010
Token a
Token ID257
Feature activation+7.18e-03

Pos logit contributions
inkle+0.001
ags+0.001
agged+0.001
ist+0.001
bly+0.001
Neg logit contributions
 ze-0.001
 roller-0.001
 café-0.001
reating-0.001
saw-0.001
Token story
Token ID1621
Feature activation-3.77e-03

Pos logit contributions
saw+0.008
THE+0.008
fl+0.008
 café+0.007
actus+0.007
Neg logit contributions
inkle-0.012
 concern-0.010
 sincere-0.010
cca-0.010
 common-0.010
Token about
Token ID546
Feature activation-1.69e-02

Pos logit contributions
inkle+0.007
ags+0.006
 sincere+0.006
agged+0.006
 concern+0.006
Neg logit contributions
saw-0.004
 café-0.004
reating-0.004
THE-0.004
board-0.004
Token a
Token ID257
Feature activation-1.37e-02

Pos logit contributions
inkle+0.031
agged+0.030
ags+0.029
bly+0.027
ist+0.027
Neg logit contributions
 ze-0.021
 café-0.018
reating-0.018
 trail-0.018
 desert-0.018
Token mer
Token ID4017
Feature activation-6.56e-02

Pos logit contributions
inkle+0.021
 sincere+0.019
 concern+0.019
cca+0.018
ags+0.018
Neg logit contributions
saw-0.016
THE-0.016
fl-0.016
 café-0.015
 trail-0.015
Tokenmaid
Token ID23151
Feature activation+1.11e-03

Pos logit contributions
 concern+0.097
ags+0.096
inkle+0.094
 Their+0.092
bly+0.090
Neg logit contributions
actus-0.077
 sur-0.073
saw-0.073
 trail-0.071
fl-0.071
Token who
Token ID508
Feature activation+6.06e-03

Pos logit contributions
saw+0.001
 café+0.001
THE+0.001
fl+0.001
 ze+0.001
Neg logit contributions
inkle-0.002
ags-0.002
 concern-0.002
cca-0.002
 sincere-0.002
Token lived
Token ID5615
Feature activation-3.01e-02

Pos logit contributions
 would+0.004
 knew+0.004
 could+0.003
 desert+0.003
 night+0.003
Neg logit contributions
ags-0.015
agged-0.014
inkle-0.013
ither-0.013
bly-0.013
Token in
Token ID287
Feature activation-3.84e-02

Pos logit contributions
inkle+0.046
 concern+0.041
ags+0.040
ist+0.038
agged+0.037
Neg logit contributions
THE-0.038
fl-0.036
saw-0.036
 trail-0.035
 café-0.035
Token the
Token ID262
Feature activation-1.08e-02

Pos logit contributions
ist+0.083
agged+0.082
ags+0.076
inkle+0.076
oving+0.076
Neg logit contributions
 night-0.038
 desert-0.030
 day-0.030
 stream-0.028
 café-0.028
Token a
Token ID257
Feature activation+7.36e-03

Pos logit contributions
inkle+0.008
ags+0.007
cca+0.007
 concern+0.006
 sincere+0.006
Neg logit contributions
saw-0.020
fl-0.020
 café-0.020
 trail-0.020
THE-0.020
Token little
Token ID1310
Feature activation+4.77e-04

Pos logit contributions
fl+0.010
THE+0.010
reating+0.010
saw+0.009
 café+0.009
Neg logit contributions
inkle-0.011
 sincere-0.009
cca-0.009
 concern-0.009
 common-0.009
Token girl
Token ID2576
Feature activation-1.27e-02

Pos logit contributions
saw+0.000
THE+0.000
 café+0.000
fl+0.000
 trail+0.000
Neg logit contributions
inkle-0.001
ags-0.001
 concern-0.001
cca-0.001
 sincere-0.001
Token named
Token ID3706
Feature activation-4.02e-03

Pos logit contributions
inkle+0.014
cca+0.011
 sincere+0.011
ags+0.011
ist+0.011
Neg logit contributions
saw-0.021
THE-0.021
 café-0.021
fl-0.021
 trail-0.021
Token Amy
Token ID14235
Feature activation+5.47e-03

Pos logit contributions
inkle+0.004
cca+0.003
ags+0.003
 Their+0.003
 concern+0.003
Neg logit contributions
THE-0.007
actus-0.007
saw-0.007
fl-0.007
 trail-0.007
Token.
Token ID13
Feature activation-6.56e-02

Pos logit contributions
saw+0.009
 trail+0.008
 café+0.008
reating+0.008
 roller+0.008
Neg logit contributions
inkle-0.006
 their-0.006
ist-0.006
cca-0.006
 sincere-0.005
Token Amy
Token ID14235
Feature activation-1.05e-02

Pos logit contributions
inkle+0.079
cca+0.065
bly+0.063
 sincere+0.063
 concern+0.063
Neg logit contributions
saw-0.105
 café-0.103
THE-0.103
fl-0.101
 trail-0.099
Token had
Token ID550
Feature activation+6.10e-03

Pos logit contributions
inkle+0.017
ags+0.014
 concern+0.014
cca+0.014
 sincere+0.014
Neg logit contributions
saw-0.012
THE-0.012
 café-0.012
fl-0.012
 trail-0.012
Token a
Token ID257
Feature activation-2.32e-02

Pos logit contributions
saw+0.011
THE+0.010
 café+0.010
fl+0.010
 trail+0.010
Neg logit contributions
inkle-0.006
ags-0.005
 concern-0.005
 sincere-0.005
cca-0.005
Token big
Token ID1263
Feature activation-3.87e-02

Pos logit contributions
inkle+0.044
ist+0.042
agged+0.041
ags+0.041
bly+0.041
Neg logit contributions
 trail-0.021
 café-0.020
 roller-0.019
actus-0.019
 ostr-0.019
Token red
Token ID2266
Feature activation-1.62e-02

Pos logit contributions
bly+0.076
cca+0.074
inkle+0.072
ist+0.072
agged+0.070
Neg logit contributions
 trail-0.035
 tree-0.034
 desert-0.033
 café-0.033
 ostr-0.032
Token sleeping
Token ID11029
Feature activation-1.97e-02

Pos logit contributions
THE+0.017
 café+0.016
saw+0.016
 desert+0.016
 trail+0.015
Neg logit contributions
inkle-0.032
ist-0.029
ags-0.029
 concern-0.029
 common-0.028
Token.
Token ID13
Feature activation+8.76e-03

Pos logit contributions
inkle+0.035
 concern+0.032
ags+0.032
 common+0.031
 sincere+0.030
Neg logit contributions
 trail-0.021
THE-0.020
fl-0.019
 café-0.018
board-0.018
Token He
Token ID679
Feature activation-6.26e-03

Pos logit contributions
saw+0.013
THE+0.013
 café+0.013
fl+0.012
 trail+0.012
Neg logit contributions
inkle-0.011
 concern-0.010
ags-0.010
cca-0.009
 sincere-0.009
Token had
Token ID550
Feature activation+3.69e-03

Pos logit contributions
inkle+0.012
ags+0.012
bly+0.011
agged+0.011
 concern+0.011
Neg logit contributions
 café-0.005
THE-0.005
 trail-0.005
 ze-0.005
fl-0.005
Token a
Token ID257
Feature activation-2.55e-02

Pos logit contributions
saw+0.006
THE+0.006
 café+0.006
fl+0.006
 trail+0.006
Neg logit contributions
inkle-0.004
 concern-0.003
ags-0.003
cca-0.003
 sincere-0.003
Token deep
Token ID2769
Feature activation-6.53e-02

Pos logit contributions
inkle+0.042
ags+0.040
agged+0.038
ist+0.038
cca+0.038
Neg logit contributions
 trail-0.028
 café-0.028
THE-0.028
 roller-0.027
saw-0.026
Token and
Token ID290
Feature activation-5.23e-03

Pos logit contributions
bly+0.116
hips+0.109
 Their+0.108
cca+0.106
ist+0.105
Neg logit contributions
 night-0.065
 desert-0.064
 trail-0.063
 day-0.057
 pine-0.054
Token peaceful
Token ID12309
Feature activation-2.89e-02

Pos logit contributions
inkle+0.008
ags+0.007
cca+0.007
 concern+0.007
ist+0.007
Neg logit contributions
saw-0.006
THE-0.006
 café-0.006
fl-0.006
 trail-0.006
Token sleep
Token ID3993
Feature activation-1.37e-02

Pos logit contributions
inkle+0.056
bly+0.055
ags+0.055
 sincere+0.053
cca+0.053
Neg logit contributions
 trail-0.026
 desert-0.024
 café-0.023
 night-0.023
fl-0.023
Token until
Token ID1566
Feature activation+1.77e-02

Pos logit contributions
inkle+0.024
 concern+0.023
ags+0.023
 sincere+0.022
 common+0.022
Neg logit contributions
 trail-0.013
THE-0.013
fl-0.013
bye-0.012
 café-0.012
Token his
Token ID465
Feature activation-2.89e-04

Pos logit contributions
THE+0.019
saw+0.019
 café+0.019
 trail+0.018
 roller+0.018
Neg logit contributions
inkle-0.030
ags-0.027
ist-0.027
 concern-0.026
cca-0.026
Token a
Token ID257
Feature activation+3.04e-03

Pos logit contributions
inkle+0.023
ist+0.020
 concern+0.019
ishment+0.018
 sincere+0.018
Neg logit contributions
THE-0.022
saw-0.021
 café-0.020
fl-0.020
 ze-0.019
Token little
Token ID1310
Feature activation-3.19e-03

Pos logit contributions
reating+0.004
THE+0.004
fl+0.004
saw+0.004
ups+0.004
Neg logit contributions
inkle-0.004
 sincere-0.004
cca-0.003
 concern-0.003
 common-0.003
Token girl
Token ID2576
Feature activation-3.17e-03

Pos logit contributions
inkle+0.007
ags+0.006
 concern+0.006
cca+0.006
 sincere+0.006
Neg logit contributions
saw-0.002
THE-0.002
 café-0.002
fl-0.002
 trail-0.002
Token called
Token ID1444
Feature activation-3.72e-03

Pos logit contributions
inkle+0.003
 sincere+0.002
cca+0.002
ist+0.002
 concern+0.002
Neg logit contributions
saw-0.006
THE-0.006
 trail-0.006
 café-0.006
fl-0.005
Token Amy
Token ID14235
Feature activation+5.66e-03

Pos logit contributions
inkle+0.005
ags+0.004
 concern+0.004
 sincere+0.004
cca+0.004
Neg logit contributions
saw-0.005
THE-0.005
 café-0.005
fl-0.005
 trail-0.005
Token.
Token ID13
Feature activation-6.52e-02

Pos logit contributions
saw+0.009
 trail+0.009
unn+0.008
rav+0.008
 café+0.008
Neg logit contributions
inkle-0.007
 their-0.006
cca-0.006
 sincere-0.006
ist-0.006
Token She
Token ID1375
Feature activation+7.78e-03

Pos logit contributions
inkle+0.074
 concern+0.060
ags+0.060
cca+0.060
 sincere+0.059
Neg logit contributions
saw-0.107
THE-0.105
 café-0.105
fl-0.104
 trail-0.102
Token was
Token ID373
Feature activation-4.42e-03

Pos logit contributions
reating+0.015
saw+0.015
 café+0.014
 ze+0.013
unn+0.013
Neg logit contributions
inkle-0.008
 sincere-0.007
cca-0.006
 common-0.006
 their-0.006
Token a
Token ID257
Feature activation+1.02e-02

Pos logit contributions
inkle+0.007
ist+0.006
ags+0.006
 concern+0.006
cca+0.006
Neg logit contributions
saw-0.006
 ze-0.005
 café-0.005
fl-0.005
 roller-0.005
Token very
Token ID845
Feature activation-2.48e-03

Pos logit contributions
saw+0.014
THE+0.014
fl+0.014
 café+0.013
reating+0.013
Neg logit contributions
inkle-0.014
 sincere-0.012
 concern-0.012
cca-0.012
ags-0.012
Token thoughtful
Token ID22677
Feature activation-4.01e-04

Pos logit contributions
inkle+0.004
ags+0.003
 concern+0.003
cca+0.003
ist+0.003
Neg logit contributions
saw-0.003
 café-0.003
THE-0.003
 trail-0.003
fl-0.003
Token
Token ID198
Feature activation-1.44e-02

Pos logit contributions
inkle+0.054
bly+0.049
cca+0.048
ist+0.047
 sincere+0.046
Neg logit contributions
saw-0.034
THE-0.033
 café-0.032
fl-0.031
bye-0.029
Token
Token ID198
Feature activation-5.20e-02

Pos logit contributions
inkle+0.024
ags+0.021
 concern+0.021
 sincere+0.021
cca+0.021
Neg logit contributions
saw-0.016
THE-0.015
 café-0.015
fl-0.015
 trail-0.014
TokenSometimes
Token ID15468
Feature activation-2.98e-02

Pos logit contributions
inkle+0.089
 concern+0.076
ags+0.076
 sincere+0.075
cca+0.075
Neg logit contributions
saw-0.060
THE-0.060
 café-0.056
fl-0.053
bye-0.053
Token,
Token ID11
Feature activation-5.38e-02

Pos logit contributions
inkle+0.060
ags+0.058
agged+0.055
bly+0.054
cca+0.052
Neg logit contributions
fl-0.026
 night-0.025
 day-0.025
THE-0.024
bye-0.024
Token though
Token ID996
Feature activation-4.03e-02

Pos logit contributions
inkle+0.111
ags+0.098
agged+0.096
bly+0.093
ither+0.092
Neg logit contributions
saw-0.053
 day-0.052
 tree-0.050
THE-0.048
fl-0.048
Token,
Token ID11
Feature activation-6.49e-02

Pos logit contributions
inkle+0.081
ags+0.079
agged+0.078
bly+0.077
cca+0.077
Neg logit contributions
 day-0.036
 night-0.035
 tree-0.035
THE-0.032
saw-0.031
Token it
Token ID340
Feature activation-1.41e-02

Pos logit contributions
inkle+0.129
ags+0.117
agged+0.113
ady+0.108
ither+0.107
Neg logit contributions
 day-0.065
saw-0.063
 tree-0.062
THE-0.062
 night-0.061
Token could
Token ID714
Feature activation-2.42e-02

Pos logit contributions
inkle+0.029
ags+0.028
 concern+0.027
agged+0.027
bly+0.027
Neg logit contributions
saw-0.010
fl-0.010
THE-0.009
 trail-0.009
 café-0.009
Token be
Token ID307
Feature activation-1.22e-02

Pos logit contributions
agged+0.050
cca+0.047
inkle+0.046
ags+0.046
ady+0.045
Neg logit contributions
 trail-0.019
 tree-0.019
 slide-0.019
saw-0.018
 roller-0.018
Token difficult
Token ID2408
Feature activation-8.02e-03

Pos logit contributions
inkle+0.020
ags+0.018
 concern+0.018
bly+0.017
 sincere+0.017
Neg logit contributions
 café-0.014
THE-0.013
saw-0.013
 roller-0.013
 trail-0.013
Token.
Token ID13
Feature activation-6.15e-02

Pos logit contributions
inkle+0.017
cca+0.015
 sincere+0.015
bly+0.015
ags+0.015
Neg logit contributions
fl-0.006
 trail-0.006
reating-0.006
 café-0.006
 roller-0.006
Token a
Token ID257
Feature activation+7.98e-03

Pos logit contributions
inkle+0.018
ist+0.015
 sincere+0.014
 concern+0.014
cca+0.014
Neg logit contributions
THE-0.025
saw-0.025
 café-0.024
fl-0.024
 ze-0.024
Token little
Token ID1310
Feature activation-2.81e-03

Pos logit contributions
fl+0.012
reating+0.012
THE+0.012
ups+0.011
saw+0.011
Neg logit contributions
inkle-0.011
 sincere-0.010
cca-0.008
 concern-0.008
 common-0.008
Token girl
Token ID2576
Feature activation-1.20e-02

Pos logit contributions
inkle+0.006
ags+0.005
 concern+0.005
cca+0.005
 sincere+0.005
Neg logit contributions
saw-0.002
THE-0.002
 café-0.002
fl-0.002
 trail-0.002
Token named
Token ID3706
Feature activation-6.92e-03

Pos logit contributions
inkle+0.012
 sincere+0.010
ist+0.009
cca+0.009
 concern+0.009
Neg logit contributions
saw-0.021
THE-0.021
 café-0.021
 trail-0.020
fl-0.020
Token Amy
Token ID14235
Feature activation+1.50e-03

Pos logit contributions
inkle+0.008
 concern+0.007
ags+0.007
cca+0.007
 sincere+0.007
Neg logit contributions
THE-0.011
saw-0.011
fl-0.011
actus-0.010
 trail-0.010
Token.
Token ID13
Feature activation-6.48e-02

Pos logit contributions
saw+0.002
 trail+0.002
 café+0.002
THE+0.002
reating+0.002
Neg logit contributions
inkle-0.002
 sincere-0.002
 their-0.002
cca-0.002
ist-0.002
Token She
Token ID1375
Feature activation+1.28e-03

Pos logit contributions
inkle+0.077
cca+0.063
 concern+0.063
ags+0.062
 sincere+0.062
Neg logit contributions
saw-0.103
THE-0.101
 café-0.101
fl-0.100
 trail-0.098
Token had
Token ID550
Feature activation+4.86e-03

Pos logit contributions
reating+0.002
saw+0.002
 café+0.002
 ze+0.002
THE+0.002
Neg logit contributions
inkle-0.001
 sincere-0.001
cca-0.001
 concern-0.001
 common-0.001
Token a
Token ID257
Feature activation-2.22e-02

Pos logit contributions
saw+0.009
THE+0.009
 café+0.008
fl+0.008
 trail+0.008
Neg logit contributions
inkle-0.005
ags-0.004
 concern-0.004
cca-0.004
 sincere-0.004
Token small
Token ID1402
Feature activation-3.87e-02

Pos logit contributions
inkle+0.040
ags+0.037
ist+0.037
agged+0.037
bly+0.037
Neg logit contributions
 trail-0.022
 café-0.021
actus-0.020
 roller-0.020
 ostr-0.020
Token,
Token ID11
Feature activation-2.34e-02

Pos logit contributions
inkle+0.076
bly+0.075
agged+0.073
cks+0.072
ist+0.071
Neg logit contributions
 trail-0.035
 café-0.033
 ostr-0.032
 desert-0.031
 tree-0.031
Token's
Token ID338
Feature activation-1.79e-02

Pos logit contributions
inkle+0.006
ags+0.006
bly+0.006
 sincere+0.005
cca+0.005
Neg logit contributions
saw-0.003
bye-0.003
board-0.003
 roller-0.003
 ze-0.003
Token there
Token ID612
Feature activation-1.47e-03

Pos logit contributions
inkle+0.028
cca+0.025
ags+0.025
agged+0.024
ist+0.024
Neg logit contributions
saw-0.023
THE-0.022
 trail-0.021
 café-0.021
fl-0.021
Token?"
Token ID1701
Feature activation-2.58e-02

Pos logit contributions
inkle+0.003
cca+0.002
ags+0.002
 concern+0.002
agged+0.002
Neg logit contributions
THE-0.002
saw-0.002
 café-0.001
fl-0.001
 trail-0.001
Token came
Token ID1625
Feature activation+1.59e-02

Pos logit contributions
inkle+0.059
ags+0.057
ist+0.054
agged+0.053
ittered+0.053
Neg logit contributions
 tree-0.018
 café-0.016
bye-0.016
saw-0.016
 trail-0.014
Token a
Token ID257
Feature activation-1.37e-02

Pos logit contributions
actus+0.025
 café+0.025
THE+0.025
saw+0.024
board+0.024
Neg logit contributions
inkle-0.018
cca-0.016
 concern-0.016
ags-0.015
 sincere-0.015
Token deep
Token ID2769
Feature activation-6.48e-02

Pos logit contributions
inkle+0.021
 concern+0.018
 sincere+0.018
cca+0.018
ags+0.018
Neg logit contributions
saw-0.017
THE-0.017
fl-0.017
 café-0.016
 trail-0.016
Token voice
Token ID3809
Feature activation-1.45e-02

Pos logit contributions
inkle+0.108
agged+0.104
ags+0.103
bly+0.100
 concern+0.098
Neg logit contributions
 trail-0.073
 café-0.070
 desert-0.067
saw-0.067
 tree-0.066
Token.
Token ID13
Feature activation-1.52e-02

Pos logit contributions
inkle+0.021
 concern+0.018
ags+0.018
 sincere+0.018
cca+0.018
Neg logit contributions
saw-0.019
THE-0.019
 café-0.019
fl-0.019
 trail-0.018
Token
Token ID198
Feature activation-1.02e-02

Pos logit contributions
inkle+0.026
ags+0.023
ist+0.022
bly+0.022
cca+0.022
Neg logit contributions
saw-0.017
 café-0.017
fl-0.016
 trail-0.016
 ze-0.016
Token
Token ID198
Feature activation-2.55e-02

Pos logit contributions
inkle+0.014
 common+0.012
ither+0.011
 sincere+0.011
ags+0.011
Neg logit contributions
fl-0.014
 ze-0.014
reating-0.014
 café-0.014
actus-0.014
TokenThe
Token ID464
Feature activation+8.56e-03

Pos logit contributions
inkle+0.037
cca+0.034
 sincere+0.033
bly+0.033
 common+0.032
Neg logit contributions
fl-0.032
 ze-0.031
reating-0.030
actus-0.030
unn-0.030
Token lived
Token ID5615
Feature activation-1.84e-02

Pos logit contributions
THE+0.000
fl+0.000
saw+0.000
 café+0.000
 trail+0.000
Neg logit contributions
inkle-0.001
ags-0.001
 concern-0.001
cca-0.001
ist-0.001
Token a
Token ID257
Feature activation+1.14e-03

Pos logit contributions
inkle+0.015
ags+0.013
cca+0.013
 concern+0.012
 sincere+0.012
Neg logit contributions
saw-0.035
fl-0.034
 café-0.034
THE-0.034
 trail-0.034
Token girl
Token ID2576
Feature activation-1.76e-02

Pos logit contributions
THE+0.002
fl+0.002
saw+0.002
reating+0.002
 café+0.001
Neg logit contributions
inkle-0.002
 sincere-0.001
cca-0.001
 concern-0.001
 common-0.001
Token named
Token ID3706
Feature activation-7.57e-03

Pos logit contributions
inkle+0.023
 concern+0.019
ags+0.019
cca+0.019
 sincere+0.019
Neg logit contributions
saw-0.026
THE-0.026
 café-0.026
fl-0.025
 trail-0.025
Token Amy
Token ID14235
Feature activation+1.72e-04

Pos logit contributions
inkle+0.009
cca+0.008
ags+0.007
 concern+0.007
ist+0.007
Neg logit contributions
THE-0.012
saw-0.012
fl-0.011
actus-0.011
 trail-0.011
Token.
Token ID13
Feature activation-6.47e-02

Pos logit contributions
saw+0.000
 trail+0.000
THE+0.000
 café+0.000
 roller+0.000
Neg logit contributions
inkle-0.000
 sincere-0.000
cca-0.000
ist-0.000
 concern-0.000
Token She
Token ID1375
Feature activation-2.51e-03

Pos logit contributions
inkle+0.072
cca+0.058
 concern+0.057
ags+0.057
 sincere+0.057
Neg logit contributions
saw-0.108
THE-0.106
 café-0.106
fl-0.105
 trail-0.103
Token had
Token ID550
Feature activation+4.38e-03

Pos logit contributions
inkle+0.003
 sincere+0.003
cca+0.003
 concern+0.003
 common+0.003
Neg logit contributions
saw-0.004
reating-0.004
 café-0.004
THE-0.004
 ze-0.004
Token a
Token ID257
Feature activation-2.24e-02

Pos logit contributions
saw+0.008
THE+0.008
 café+0.008
fl+0.008
 trail+0.008
Neg logit contributions
inkle-0.004
ags-0.003
 concern-0.003
cca-0.003
 sincere-0.003
Token big
Token ID1263
Feature activation-3.99e-02

Pos logit contributions
inkle+0.042
ist+0.040
agged+0.039
ags+0.039
bly+0.039
Neg logit contributions
 trail-0.021
 café-0.019
actus-0.019
 ostr-0.018
 roller-0.018
Token,
Token ID11
Feature activation-2.40e-02

Pos logit contributions
bly+0.078
cca+0.076
inkle+0.075
ist+0.073
agged+0.073
Neg logit contributions
 trail-0.035
 café-0.034
 desert-0.033
 tree-0.032
 ostr-0.031
Token and
Token ID290
Feature activation-1.83e-02

Pos logit contributions
inkle+0.031
ags+0.028
agged+0.028
bly+0.027
 sincere+0.026
Neg logit contributions
saw-0.015
 roller-0.014
 café-0.014
 trail-0.014
 tree-0.014
Token over
Token ID625
Feature activation-5.46e-03

Pos logit contributions
inkle+0.033
ags+0.030
agged+0.029
bly+0.028
cca+0.028
Neg logit contributions
saw-0.021
 trail-0.019
 café-0.019
 roller-0.018
fl-0.018
Token again
Token ID757
Feature activation+4.59e-03

Pos logit contributions
inkle+0.011
ags+0.010
agged+0.010
bly+0.010
 sincere+0.010
Neg logit contributions
 roller-0.005
saw-0.005
 trail-0.004
 café-0.004
 tree-0.004
Token with
Token ID351
Feature activation-2.69e-02

Pos logit contributions
 trail+0.003
 to+0.003
 tree+0.003
 roller+0.003
fl+0.003
Neg logit contributions
ags-0.010
agged-0.010
inkle-0.010
 sincere-0.009
 concern-0.009
Token his
Token ID465
Feature activation-2.45e-02

Pos logit contributions
inkle+0.053
ist+0.048
agged+0.046
bly+0.045
ags+0.044
Neg logit contributions
 roller-0.027
 tree-0.027
 night-0.026
 steps-0.024
 distance-0.024
Token deep
Token ID2769
Feature activation-6.42e-02

Pos logit contributions
inkle+0.042
bly+0.040
cca+0.039
ist+0.039
agged+0.038
Neg logit contributions
 trail-0.027
 tree-0.025
 roller-0.024
 coc-0.023
fl-0.023
Token paws
Token ID49193
Feature activation+4.99e-03

Pos logit contributions
bly+0.107
inkle+0.094
 sincere+0.093
cca+0.090
agged+0.090
Neg logit contributions
 trail-0.078
 steps-0.070
 tree-0.069
fl-0.069
 pine-0.067
Token,
Token ID11
Feature activation-5.89e-03

Pos logit contributions
 trail+0.004
fl+0.004
THE+0.004
saw+0.004
board+0.004
Neg logit contributions
inkle-0.010
ags-0.009
 sincere-0.009
 concern-0.009
 common-0.009
Token to
Token ID284
Feature activation-5.96e-03

Pos logit contributions
inkle+0.013
agged+0.012
ags+0.011
bly+0.011
ither+0.011
Neg logit contributions
 roller-0.006
 trail-0.005
 ze-0.005
 tree-0.005
 slides-0.005
Token no
Token ID645
Feature activation+1.37e-03

Pos logit contributions
inkle+0.012
agged+0.012
cca+0.011
bly+0.011
 sincere+0.010
Neg logit contributions
 roller-0.005
 tree-0.005
 trail-0.005
saw-0.005
 ze-0.005
Token avail
Token ID29107
Feature activation-2.09e-02

Pos logit contributions
 trail+0.002
 roller+0.001
 ze+0.001
saw+0.001
 café+0.001
Neg logit contributions
inkle-0.002
agged-0.002
cca-0.002
bly-0.002
ags-0.002
Token a
Token ID257
Feature activation-2.28e-03

Pos logit contributions
inkle+0.016
ags+0.013
cca+0.013
 sincere+0.012
 concern+0.012
Neg logit contributions
saw-0.037
fl-0.036
 café-0.036
THE-0.036
 trail-0.035
Token little
Token ID1310
Feature activation-9.86e-03

Pos logit contributions
inkle+0.003
 sincere+0.003
cca+0.003
 concern+0.003
 common+0.003
Neg logit contributions
THE-0.003
fl-0.003
saw-0.003
reating-0.003
bye-0.003
Token girl
Token ID2576
Feature activation-1.57e-02

Pos logit contributions
inkle+0.021
ags+0.019
 concern+0.019
cca+0.019
 sincere+0.019
Neg logit contributions
saw-0.006
THE-0.006
 café-0.006
fl-0.006
 trail-0.005
Token named
Token ID3706
Feature activation-6.77e-03

Pos logit contributions
inkle+0.017
 sincere+0.014
cca+0.014
ags+0.014
 concern+0.014
Neg logit contributions
saw-0.026
THE-0.026
 café-0.025
fl-0.025
 trail-0.025
Token Amy
Token ID14235
Feature activation+2.12e-03

Pos logit contributions
inkle+0.007
cca+0.006
 Their+0.006
 concern+0.006
ags+0.006
Neg logit contributions
THE-0.011
actus-0.011
saw-0.011
fl-0.011
 trail-0.011
Token.
Token ID13
Feature activation-6.42e-02

Pos logit contributions
saw+0.003
 trail+0.003
 café+0.003
 roller+0.003
rav+0.003
Neg logit contributions
inkle-0.003
 sincere-0.002
 their-0.002
ist-0.002
cca-0.002
Token Amy
Token ID14235
Feature activation-1.73e-02

Pos logit contributions
inkle+0.071
 concern+0.057
ags+0.057
cca+0.057
 sincere+0.057
Neg logit contributions
saw-0.107
THE-0.106
 café-0.105
fl-0.104
 trail-0.102
Token was
Token ID373
Feature activation-1.06e-02

Pos logit contributions
inkle+0.028
 sincere+0.024
 concern+0.024
cca+0.024
ags+0.024
Neg logit contributions
saw-0.020
THE-0.020
 café-0.020
fl-0.019
 trail-0.019
Token sad
Token ID6507
Feature activation+2.69e-05

Pos logit contributions
inkle+0.017
ist+0.015
ags+0.014
cca+0.014
 concern+0.014
Neg logit contributions
saw-0.012
 ze-0.012
 café-0.012
fl-0.012
 desert-0.011
Token because
Token ID780
Feature activation+9.54e-03

Pos logit contributions
 trail+0.000
THE+0.000
fl+0.000
 café+0.000
actus+0.000
Neg logit contributions
inkle-0.000
cca-0.000
 concern-0.000
 sincere-0.000
ags-0.000
Token it
Token ID340
Feature activation+4.64e-04

Pos logit contributions
saw+0.008
THE+0.008
 café+0.007
reating+0.007
 desert+0.007
Neg logit contributions
inkle-0.019
ist-0.018
cca-0.018
ags-0.018
agged-0.017
Token,
Token ID11
Feature activation-3.80e-03

Pos logit contributions
ags+0.033
inkle+0.032
agged+0.031
ist+0.030
bly+0.030
Neg logit contributions
bye-0.011
 trail-0.011
 night-0.010
 day-0.010
THE-0.009
Token he
Token ID339
Feature activation-1.35e-02

Pos logit contributions
inkle+0.009
ags+0.008
ist+0.008
agged+0.008
cca+0.008
Neg logit contributions
 ze-0.002
saw-0.002
THE-0.002
 trail-0.002
bye-0.002
Token put
Token ID1234
Feature activation-3.79e-02

Pos logit contributions
inkle+0.029
ags+0.028
agged+0.026
 concern+0.026
 sincere+0.026
Neg logit contributions
 trail-0.010
 café-0.009
THE-0.009
fl-0.009
 ze-0.009
Token on
Token ID319
Feature activation-2.06e-03

Pos logit contributions
inkle+0.078
 sincere+0.077
ist+0.076
 unison+0.074
ither+0.074
Neg logit contributions
 night-0.030
 tree-0.030
 l-0.027
fl-0.027
 day-0.026
Token his
Token ID465
Feature activation-3.08e-02

Pos logit contributions
inkle+0.004
ist+0.003
bly+0.003
 sincere+0.003
immer+0.003
Neg logit contributions
 café-0.002
 l-0.002
 pine-0.002
 roller-0.002
 ze-0.002
Token brav
Token ID49025
Feature activation-6.41e-02

Pos logit contributions
inkle+0.057
ist+0.057
bly+0.054
hips+0.053
ised+0.053
Neg logit contributions
 trail-0.028
 café-0.027
 camel-0.027
 roller-0.026
 tree-0.026
Tokenest
Token ID395
Feature activation-5.22e-02

Pos logit contributions
inkle+0.121
bly+0.111
 sincere+0.110
ist+0.104
ither+0.103
Neg logit contributions
bye-0.057
fl-0.055
 ze-0.052
 café-0.051
 roller-0.050
Token clothes
Token ID8242
Feature activation+1.25e-03

Pos logit contributions
inkle+0.106
uart+0.100
bly+0.099
ither+0.098
cks+0.098
Neg logit contributions
 trail-0.042
 café-0.041
 camel-0.040
 desert-0.039
 l-0.036
Token and
Token ID290
Feature activation-2.38e-04

Pos logit contributions
 trail+0.001
board+0.001
THE+0.001
 roller+0.001
line+0.001
Neg logit contributions
inkle-0.002
ags-0.002
 concern-0.002
 common-0.002
 sincere-0.002
Token started
Token ID2067
Feature activation+1.41e-02

Pos logit contributions
inkle+0.000
ags+0.000
 sincere+0.000
agged+0.000
ither+0.000
Neg logit contributions
 café-0.000
 trail-0.000
fl-0.000
 ze-0.000
 roller-0.000
Token his
Token ID465
Feature activation-9.41e-03

Pos logit contributions
fl+0.008
 café+0.007
 ze+0.007
saw+0.007
 trail+0.007
Neg logit contributions
inkle-0.033
ags-0.030
ist-0.029
agged-0.029
cca-0.029
Token a
Token ID257
Feature activation+1.34e-02

Pos logit contributions
inkle+0.019
ist+0.015
 sincere+0.015
 concern+0.015
ags+0.015
Neg logit contributions
THE-0.026
saw-0.026
 café-0.026
fl-0.025
 ze-0.025
Token kind
Token ID1611
Feature activation+1.89e-02

Pos logit contributions
reating+0.022
fl+0.022
THE+0.022
ups+0.021
unn+0.021
Neg logit contributions
inkle-0.016
 sincere-0.015
cca-0.012
 sorry-0.012
 concern-0.012
Token girl
Token ID2576
Feature activation+2.45e-03

Pos logit contributions
THE+0.020
fl+0.020
saw+0.019
 café+0.017
 roller+0.017
Neg logit contributions
inkle-0.032
 sincere-0.029
 concern-0.028
ags-0.028
cca-0.028
Token named
Token ID3706
Feature activation-2.99e-03

Pos logit contributions
THE+0.005
saw+0.005
 trail+0.005
rav+0.005
actus+0.005
Neg logit contributions
inkle-0.002
ist-0.001
 sincere-0.001
 their-0.001
cca-0.001
Token Amy
Token ID14235
Feature activation+1.33e-02

Pos logit contributions
inkle+0.003
ags+0.002
 Their+0.002
 concern+0.002
cca+0.002
Neg logit contributions
THE-0.005
actus-0.005
saw-0.005
fl-0.005
 trail-0.005
Token.
Token ID13
Feature activation-6.40e-02

Pos logit contributions
saw+0.024
 trail+0.023
reating+0.023
 café+0.022
 roller+0.022
Neg logit contributions
inkle-0.014
 their-0.012
 sincere-0.012
ist-0.011
cca-0.011
Token Amy
Token ID14235
Feature activation-8.79e-03

Pos logit contributions
inkle+0.075
ags+0.062
 concern+0.062
cca+0.061
 sincere+0.061
Neg logit contributions
saw-0.102
THE-0.101
 café-0.100
fl-0.099
 trail-0.097
Token liked
Token ID8288
Feature activation+2.29e-02

Pos logit contributions
inkle+0.013
 sincere+0.011
cca+0.011
 concern+0.011
ags+0.011
Neg logit contributions
saw-0.012
THE-0.011
 café-0.011
reating-0.011
 trail-0.011
Token to
Token ID284
Feature activation-7.71e-03

Pos logit contributions
THE+0.026
saw+0.025
actus+0.023
bye+0.023
ha+0.023
Neg logit contributions
 their-0.035
 Their-0.033
 concern-0.032
ags-0.032
 sincere-0.032
Token help
Token ID1037
Feature activation+6.72e-03

Pos logit contributions
inkle+0.015
cca+0.014
ist+0.013
bly+0.013
ised+0.013
Neg logit contributions
 trail-0.007
 café-0.007
saw-0.006
bye-0.006
 roller-0.006
Token her
Token ID607
Feature activation-1.54e-02

Pos logit contributions
saw+0.010
 café+0.010
fl+0.010
THE+0.010
 trail+0.010
Neg logit contributions
inkle-0.008
 sincere-0.007
cca-0.007
 concern-0.007
ist-0.007
Token year
Token ID614
Feature activation+1.11e-02

Pos logit contributions
inkle+0.019
ags+0.017
 concern+0.017
 sincere+0.017
agged+0.016
Neg logit contributions
saw-0.011
 café-0.010
 ze-0.010
fl-0.010
THE-0.010
Token old
Token ID1468
Feature activation+8.30e-04

Pos logit contributions
saw+0.011
 café+0.011
THE+0.010
fl+0.010
 trail+0.010
Neg logit contributions
inkle-0.020
 concern-0.018
ags-0.018
cca-0.018
 sincere-0.018
Token girl
Token ID2576
Feature activation-4.37e-03

Pos logit contributions
saw+0.001
 café+0.001
fl+0.001
 ze+0.001
THE+0.001
Neg logit contributions
inkle-0.002
ags-0.002
 concern-0.002
agged-0.002
cca-0.002
Token named
Token ID3706
Feature activation+4.47e-03

Pos logit contributions
inkle+0.007
ags+0.007
 concern+0.007
agged+0.007
cca+0.006
Neg logit contributions
 café-0.005
saw-0.005
fl-0.005
 ze-0.005
THE-0.005
Token Amy
Token ID14235
Feature activation+5.57e-04

Pos logit contributions
saw+0.005
 café+0.005
 Merry+0.005
fl+0.005
 roller+0.005
Neg logit contributions
inkle-0.007
ags-0.007
 sincere-0.006
agged-0.006
 concern-0.006
Token.
Token ID13
Feature activation-6.40e-02

Pos logit contributions
THE+0.000
fl+0.000
 café+0.000
saw+0.000
 ze+0.000
Neg logit contributions
inkle-0.001
ags-0.001
 concern-0.001
 sincere-0.001
agged-0.001
Token She
Token ID1375
Feature activation-2.92e-03

Pos logit contributions
inkle+0.102
bly+0.089
ist+0.084
cca+0.084
 sincere+0.081
Neg logit contributions
saw-0.085
 café-0.084
THE-0.081
bye-0.078
fl-0.076
Token was
Token ID373
Feature activation-1.21e-02

Pos logit contributions
ags+0.007
inkle+0.006
agged+0.006
bly+0.006
ishment+0.006
Neg logit contributions
 would-0.002
 café-0.002
 roller-0.001
board-0.001
THE-0.001
Token feeling
Token ID4203
Feature activation+1.46e-02

Pos logit contributions
inkle+0.026
ist+0.024
agged+0.024
ags+0.024
ised+0.023
Neg logit contributions
 night-0.008
 ze-0.008
 day-0.008
 desert-0.008
 ostr-0.008
Token very
Token ID845
Feature activation+1.22e-02

Pos logit contributions
 night+0.010
 desert+0.008
 day+0.008
 café+0.008
 l+0.007
Neg logit contributions
ist-0.031
inkle-0.031
adoes-0.029
cks-0.029
ishment-0.029
Token ordinary
Token ID8850
Feature activation-6.11e-03

Pos logit contributions
 café+0.012
saw+0.011
 desert+0.011
bye+0.011
 sur+0.011
Neg logit contributions
inkle-0.020
ist-0.020
cca-0.019
ags-0.019
adoes-0.019
Token fun
Token ID1257
Feature activation-5.34e-04

Pos logit contributions
inkle+0.015
ist+0.014
bly+0.013
cca+0.013
 sincere+0.013
Neg logit contributions
 ze-0.006
 trail-0.006
 roller-0.006
 café-0.006
fl-0.006
Token!
Token ID0
Feature activation+2.17e-02

Pos logit contributions
inkle+0.001
 concern+0.001
ags+0.001
 sincere+0.001
cca+0.001
Neg logit contributions
saw-0.001
THE-0.001
fl-0.001
 café-0.001
 trail-0.001
Token
Token ID220
Feature activation+2.89e-02

Pos logit contributions
saw+0.024
THE+0.024
fl+0.024
 café+0.023
bye+0.022
Neg logit contributions
inkle-0.036
 concern-0.032
 sincere-0.032
ags-0.032
cca-0.032
Token
Token ID198
Feature activation+2.34e-02

Pos logit contributions
THE+0.025
saw+0.025
fl+0.022
 café+0.022
 trail+0.021
Neg logit contributions
inkle-0.055
 concern-0.050
 sincere-0.049
cca-0.049
ist-0.048
Token
Token ID198
Feature activation+4.28e-03

Pos logit contributions
fl+0.032
saw+0.032
 café+0.031
 trail+0.030
reating+0.030
Neg logit contributions
inkle-0.033
cca-0.027
ags-0.027
ither-0.026
 concern-0.026
TokenAs
Token ID1722
Feature activation+3.81e-02

Pos logit contributions
saw+0.006
 café+0.006
fl+0.006
board+0.005
THE+0.005
Neg logit contributions
inkle-0.006
cca-0.005
 sincere-0.005
 concern-0.005
ags-0.005
Token it
Token ID340
Feature activation+1.08e-02

Pos logit contributions
saw+0.052
 café+0.050
THE+0.050
fl+0.049
 trail+0.048
Neg logit contributions
inkle-0.054
cca-0.045
 concern-0.045
 sincere-0.045
ags-0.045
Token went
Token ID1816
Feature activation+9.79e-03

Pos logit contributions
saw+0.011
reating+0.011
 café+0.011
THE+0.011
lan+0.010
Neg logit contributions
inkle-0.019
cca-0.017
ist-0.016
bly-0.016
 concern-0.016
Token,
Token ID11
Feature activation+1.57e-02

Pos logit contributions
saw+0.014
 café+0.013
THE+0.013
fl+0.013
actus+0.013
Neg logit contributions
inkle-0.013
ags-0.011
 sincere-0.011
agged-0.011
 concern-0.011
Token it
Token ID340
Feature activation+1.75e-02

Pos logit contributions
fl+0.017
saw+0.016
THE+0.016
 ze+0.016
 trail+0.016
Neg logit contributions
inkle-0.027
ags-0.025
ist-0.024
 concern-0.024
 sincere-0.024
Token saw
Token ID2497
Feature activation+1.03e-02

Pos logit contributions
reating+0.022
saw+0.021
unn+0.020
rav+0.020
THE+0.020
Neg logit contributions
inkle-0.027
cca-0.025
 sincere-0.023
 concern-0.022
ist-0.022
Token a
Token ID257
Feature activation+1.80e-02

Pos logit contributions
saw+0.000
THE+0.000
bye+0.000
fl+0.000
 day+0.000
Neg logit contributions
inkle-0.000
ist-0.000
ags-0.000
 sincere-0.000
 concern-0.000
Token big
Token ID1263
Feature activation-9.02e-03

Pos logit contributions
THE+0.026
reating+0.026
fl+0.025
saw+0.025
 café+0.024
Neg logit contributions
inkle-0.025
 sincere-0.021
cca-0.021
 concern-0.020
 common-0.020
Token aer
Token ID9551
Feature activation+8.64e-03

Pos logit contributions
inkle+0.019
cca+0.017
 concern+0.017
ags+0.017
 sincere+0.017
Neg logit contributions
saw-0.006
 café-0.006
THE-0.006
 trail-0.006
fl-0.006
Tokenopl
Token ID20106
Feature activation+1.75e-02

Pos logit contributions
fl+0.014
 café+0.013
rav+0.013
 Merry+0.013
 trail+0.013
Neg logit contributions
inkle-0.012
ags-0.009
agged-0.009
ist-0.009
 sincere-0.009
Tokenane
Token ID1531
Feature activation+1.40e-02

Pos logit contributions
 café+0.025
 trail+0.023
 coc+0.022
fl+0.022
THE+0.022
Neg logit contributions
inkle-0.026
 sincere-0.022
ags-0.021
agged-0.021
ist-0.021
Token was
Token ID373
Feature activation+3.65e-02

Pos logit contributions
 café+0.020
reating+0.020
rav+0.019
unn+0.019
THE+0.019
Neg logit contributions
inkle-0.020
 sincere-0.016
ist-0.015
ised-0.015
agged-0.015
Token flying
Token ID7348
Feature activation+1.13e-02

Pos logit contributions
 café+0.045
saw+0.044
THE+0.043
fl+0.043
 trail+0.042
Neg logit contributions
inkle-0.058
ags-0.050
 sincere-0.049
 concern-0.048
cca-0.048
Token in
Token ID287
Feature activation+1.56e-02

Pos logit contributions
saw+0.016
THE+0.016
 café+0.016
fl+0.016
 trail+0.016
Neg logit contributions
inkle-0.015
ags-0.013
 concern-0.013
cca-0.012
 sincere-0.012
Token the
Token ID262
Feature activation+3.22e-02

Pos logit contributions
saw+0.022
 café+0.022
THE+0.022
fl+0.022
actus+0.021
Neg logit contributions
inkle-0.021
ags-0.018
 concern-0.017
 sincere-0.017
cca-0.017
Token sky
Token ID6766
Feature activation+1.62e-02

Pos logit contributions
unn+0.042
rav+0.040
reating+0.037
 café+0.036
ll+0.036
Neg logit contributions
inkle-0.049
ags-0.043
 their-0.041
 sincere-0.041
ised-0.040
Token.
Token ID13
Feature activation+1.29e-02

Pos logit contributions
saw+0.020
THE+0.020
 café+0.019
 trail+0.019
fl+0.019
Neg logit contributions
inkle-0.025
ags-0.021
ist-0.021
 sincere-0.021
cca-0.021
Token and
Token ID290
Feature activation+2.38e-02

Pos logit contributions
actus+0.027
fl+0.026
 café+0.026
pillar+0.025
unn+0.025
Neg logit contributions
ags-0.016
 their-0.016
inkle-0.015
 deeply-0.014
cca-0.014
Token ran
Token ID4966
Feature activation+2.52e-02

Pos logit contributions
reating+0.038
actus+0.037
saw+0.035
THE+0.034
pillar+0.032
Neg logit contributions
 their-0.028
cca-0.026
 both-0.025
 together-0.024
 none-0.024
Token,
Token ID11
Feature activation+4.22e-02

Pos logit contributions
actus+0.043
 café+0.042
fl+0.041
saw+0.041
arant+0.040
Neg logit contributions
inkle-0.024
ags-0.023
 their-0.021
 deeply-0.021
 common-0.020
Token all
Token ID477
Feature activation+4.99e-02

Pos logit contributions
reating+0.078
actus+0.077
lan+0.074
bye+0.073
THE+0.073
Neg logit contributions
inkle-0.031
cca-0.031
 their-0.030
 firmly-0.030
 none-0.029
Token the
Token ID262
Feature activation+2.15e-02

Pos logit contributions
actus+0.096
lan+0.090
 café+0.090
rav+0.087
Sweet+0.086
Neg logit contributions
 their-0.046
inkle-0.040
ags-0.035
ised-0.033
 his-0.032
Token way
Token ID835
Feature activation+4.90e-02

Pos logit contributions
THE+0.024
bye+0.024
lan+0.024
reating+0.022
arant+0.022
Neg logit contributions
inkle-0.032
 concern-0.030
ags-0.030
 common-0.030
ised-0.029
Token to
Token ID284
Feature activation+2.19e-02

Pos logit contributions
 café+0.078
actus+0.074
 ze+0.071
fl+0.071
unn+0.070
Neg logit contributions
inkle-0.057
 their-0.052
ags-0.049
 deeply-0.048
 none-0.047
Token the
Token ID262
Feature activation+1.30e-02

Pos logit contributions
saw+0.028
THE+0.027
fl+0.027
 café+0.027
 trail+0.026
Neg logit contributions
inkle-0.033
ags-0.028
 concern-0.028
 sincere-0.028
cca-0.028
Token top
Token ID1353
Feature activation+1.68e-02

Pos logit contributions
unn+0.016
lan+0.016
fl+0.015
reating+0.015
arant+0.015
Neg logit contributions
inkle-0.019
 sincere-0.017
 deeply-0.017
 concern-0.017
ags-0.017
Token.
Token ID13
Feature activation+1.20e-02

Pos logit contributions
fl+0.021
THE+0.020
 café+0.020
actus+0.020
rav+0.020
Neg logit contributions
inkle-0.025
ist-0.021
ags-0.020
ised-0.020
 deeply-0.020
Token When
Token ID1649
Feature activation+1.12e-02

Pos logit contributions
saw+0.018
THE+0.018
 café+0.017
fl+0.017
 trail+0.017
Neg logit contributions
inkle-0.015
 concern-0.013
ags-0.013
cca-0.013
 sincere-0.013
Token in
Token ID287
Feature activation-3.15e-03

Pos logit contributions
inkle+0.016
 concern+0.014
 sympathy+0.013
cca+0.013
ist+0.013
Neg logit contributions
THE-0.010
board-0.009
 trail-0.009
 tree-0.009
fl-0.009
Token the
Token ID262
Feature activation-1.58e-03

Pos logit contributions
inkle+0.006
ist+0.006
agged+0.005
cca+0.005
 concern+0.005
Neg logit contributions
saw-0.003
 café-0.003
THE-0.003
fl-0.003
 roller-0.003
Token park
Token ID3952
Feature activation-4.23e-03

Pos logit contributions
inkle+0.003
 concern+0.003
ags+0.003
cca+0.003
 sincere+0.003
Neg logit contributions
saw-0.001
THE-0.001
 café-0.001
fl-0.001
 trail-0.001
Token.
Token ID13
Feature activation-5.19e-03

Pos logit contributions
inkle+0.010
 concern+0.010
ags+0.009
agged+0.009
bly+0.009
Neg logit contributions
 café-0.002
bye-0.002
 day-0.002
saw-0.002
THE-0.002
Token They
Token ID1119
Feature activation+1.30e-02

Pos logit contributions
inkle+0.006
ist+0.005
 concern+0.005
ags+0.005
bly+0.005
Neg logit contributions
saw-0.009
 café-0.009
THE-0.008
fl-0.008
 trail-0.008
Token like
Token ID588
Feature activation+4.35e-02

Pos logit contributions
saw+0.012
 ze+0.011
 roller+0.011
 café+0.011
 slides+0.011
Neg logit contributions
inkle-0.025
ags-0.023
cca-0.023
agged-0.022
bly-0.022
Token to
Token ID284
Feature activation+2.34e-03

Pos logit contributions
actus+0.047
THE+0.047
arant+0.044
 café+0.042
bye+0.042
Neg logit contributions
 their-0.071
 Their-0.063
ags-0.060
 common-0.060
 concern-0.060
Token swing
Token ID9628
Feature activation+5.27e-03

Pos logit contributions
saw+0.001
 roller+0.001
 café+0.001
 trail+0.001
 slides+0.001
Neg logit contributions
inkle-0.005
cca-0.005
bly-0.005
agged-0.005
ist-0.005
Token and
Token ID290
Feature activation+1.62e-02

Pos logit contributions
fl+0.006
THE+0.006
saw+0.006
 ze+0.006
 trail+0.006
Neg logit contributions
inkle-0.009
 concern-0.008
 sincere-0.008
cca-0.008
bly-0.008
Token slide
Token ID10649
Feature activation+1.98e-02

Pos logit contributions
saw+0.010
THE+0.009
 café+0.009
fl+0.009
 trail+0.009
Neg logit contributions
inkle-0.036
ags-0.032
 concern-0.032
cca-0.032
 sincere-0.032
Token and
Token ID290
Feature activation+1.67e-02

Pos logit contributions
saw+0.022
THE+0.021
fl+0.021
 café+0.021
 trail+0.020
Neg logit contributions
inkle-0.034
 concern-0.030
 sincere-0.030
cca-0.030
ags-0.029
Token moved
Token ID3888
Feature activation-8.83e-03

Pos logit contributions
THE+0.049
saw+0.049
 café+0.047
reating+0.047
actus+0.046
Neg logit contributions
inkle-0.045
 sincere-0.041
cca-0.039
 concern-0.039
 their-0.039
Token his
Token ID465
Feature activation+1.84e-02

Pos logit contributions
inkle+0.012
 concern+0.010
 sincere+0.010
ags+0.010
ist+0.010
Neg logit contributions
saw-0.013
THE-0.012
fl-0.012
 café-0.012
 trail-0.012
Token chess
Token ID19780
Feature activation+3.56e-03

Pos logit contributions
saw+0.027
fl+0.027
THE+0.026
 café+0.026
 ze+0.025
Neg logit contributions
inkle-0.025
ags-0.021
 sincere-0.021
 concern-0.021
cca-0.020
Token piece
Token ID3704
Feature activation+1.22e-02

Pos logit contributions
saw+0.004
THE+0.004
 trail+0.004
 café+0.004
fl+0.004
Neg logit contributions
inkle-0.006
 concern-0.005
ags-0.005
 sincere-0.005
cca-0.005
Token,
Token ID11
Feature activation+2.73e-02

Pos logit contributions
saw+0.012
THE+0.012
 trail+0.011
 café+0.011
 roller+0.011
Neg logit contributions
inkle-0.022
 sincere-0.020
 concern-0.019
ags-0.019
cca-0.019
Token too
Token ID1165
Feature activation+3.74e-02

Pos logit contributions
saw+0.029
 ze+0.028
 roller+0.028
THE+0.028
 trail+0.028
Neg logit contributions
inkle-0.050
cca-0.043
ags-0.042
bly-0.042
agged-0.042
Token.
Token ID13
Feature activation+1.24e-02

Pos logit contributions
saw+0.035
THE+0.034
 café+0.034
 trail+0.033
fl+0.033
Neg logit contributions
inkle-0.068
ags-0.062
 concern-0.061
cca-0.061
 sincere-0.060
Token
Token ID198
Feature activation+2.27e-02

Pos logit contributions
THE+0.020
saw+0.020
 café+0.020
 trail+0.020
fl+0.020
Neg logit contributions
inkle-0.013
ags-0.012
 sincere-0.011
 concern-0.011
 Their-0.011
Token
Token ID198
Feature activation-2.64e-03

Pos logit contributions
fl+0.035
 café+0.033
saw+0.033
 ostr+0.033
actus+0.033
Neg logit contributions
inkle-0.029
ither-0.023
ags-0.023
 Their-0.023
cca-0.022
TokenIn
Token ID818
Feature activation-1.74e-02

Pos logit contributions
inkle+0.003
cca+0.003
bly+0.003
ither+0.003
ised+0.003
Neg logit contributions
fl-0.004
actus-0.004
unn-0.004
reating-0.004
 café-0.004
Token the
Token ID262
Feature activation+5.30e-03

Pos logit contributions
ist+0.040
cca+0.038
inkle+0.038
agged+0.037
immer+0.037
Neg logit contributions
 day-0.010
 night-0.010
 roller-0.009
 tree-0.008
THE-0.007
Token a
Token ID257
Feature activation-6.55e-03

Pos logit contributions
saw+0.005
THE+0.005
 café+0.005
fl+0.005
 trail+0.005
Neg logit contributions
inkle-0.003
 concern-0.002
 sincere-0.002
ags-0.002
cca-0.002
Token beautiful
Token ID4950
Feature activation-1.73e-02

Pos logit contributions
inkle+0.006
 sincere+0.006
 concern+0.005
 deeply+0.005
 sorry+0.005
Neg logit contributions
unn-0.012
fl-0.012
THE-0.012
saw-0.011
reating-0.011
Token opera
Token ID27296
Feature activation-4.59e-03

Pos logit contributions
inkle+0.027
 concern+0.021
 sincere+0.021
 deeply+0.021
ist+0.020
Neg logit contributions
saw-0.023
unn-0.022
THE-0.022
ups-0.022
lan-0.021
Token.
Token ID13
Feature activation-7.98e-03

Pos logit contributions
inkle+0.007
ist+0.006
 sincere+0.006
ags+0.006
 concern+0.006
Neg logit contributions
saw-0.006
THE-0.005
 café-0.005
unn-0.005
fl-0.005
Token She
Token ID1375
Feature activation+1.79e-02

Pos logit contributions
inkle+0.009
 concern+0.007
ags+0.007
 sincere+0.007
cca+0.007
Neg logit contributions
THE-0.013
saw-0.013
fl-0.013
 café-0.013
 trail-0.013
Token was
Token ID373
Feature activation+1.11e-02

Pos logit contributions
saw+0.020
THE+0.020
reating+0.019
 café+0.019
fl+0.019
Neg logit contributions
inkle-0.030
cca-0.026
 sincere-0.026
 concern-0.025
ist-0.025
Token so
Token ID523
Feature activation+2.04e-02

Pos logit contributions
saw+0.013
THE+0.013
 café+0.012
fl+0.012
 ze+0.012
Neg logit contributions
inkle-0.018
ags-0.016
 concern-0.016
cca-0.016
ist-0.015
Token happy
Token ID3772
Feature activation+7.68e-04

Pos logit contributions
saw+0.028
THE+0.028
 café+0.028
fl+0.027
 trail+0.027
Neg logit contributions
inkle-0.028
ags-0.024
 concern-0.024
cca-0.024
 sincere-0.024
Token,
Token ID11
Feature activation+1.88e-02

Pos logit contributions
saw+0.001
THE+0.001
 café+0.001
actus+0.001
 roller+0.001
Neg logit contributions
inkle-0.001
 their-0.001
ist-0.001
 deeply-0.001
 concern-0.001
Token she
Token ID673
Feature activation+3.98e-02

Pos logit contributions
saw+0.015
 trail+0.015
fl+0.015
 ze+0.015
THE+0.015
Neg logit contributions
inkle-0.038
ags-0.035
cca-0.034
agged-0.033
ist-0.033
Token wanted
Token ID2227
Feature activation+2.33e-02

Pos logit contributions
reating+0.055
saw+0.054
THE+0.049
 café+0.049
fl+0.048
Neg logit contributions
inkle-0.059
 sincere-0.050
cca-0.050
ist-0.048
 concern-0.048
Token glad
Token ID9675
Feature activation-3.70e-03

Pos logit contributions
THE+0.006
saw+0.006
 café+0.005
 roller+0.005
 ze+0.005
Neg logit contributions
inkle-0.013
ist-0.011
ags-0.011
agged-0.011
ither-0.011
Token that
Token ID326
Feature activation+2.14e-02

Pos logit contributions
inkle+0.008
ags+0.007
cca+0.007
agged+0.007
bly+0.007
Neg logit contributions
THE-0.003
fl-0.003
 café-0.003
 trail-0.003
 ze-0.003
Token she
Token ID673
Feature activation+1.48e-02

Pos logit contributions
THE+0.017
fl+0.017
 trail+0.016
 night+0.016
 café+0.015
Neg logit contributions
inkle-0.045
ags-0.043
agged-0.042
ist-0.041
bly-0.039
Token had
Token ID550
Feature activation+1.34e-02

Pos logit contributions
saw+0.013
THE+0.012
 café+0.012
fl+0.012
 trail+0.012
Neg logit contributions
inkle-0.028
 concern-0.025
ags-0.025
cca-0.025
 sincere-0.025
Token been
Token ID587
Feature activation+1.13e-02

Pos logit contributions
THE+0.012
 café+0.012
 roller+0.011
 trail+0.011
bye+0.011
Neg logit contributions
inkle-0.027
ist-0.023
ags-0.022
bly-0.022
ither-0.021
Token so
Token ID523
Feature activation+7.92e-03

Pos logit contributions
THE+0.012
 café+0.012
saw+0.012
 ze+0.011
 roller+0.011
Neg logit contributions
inkle-0.021
 concern-0.017
ist-0.017
ags-0.017
 sincere-0.016
Token alert
Token ID7995
Feature activation-1.21e-02

Pos logit contributions
saw+0.009
bye+0.008
 café+0.008
THE+0.007
ups+0.007
Neg logit contributions
inkle-0.015
ags-0.013
ist-0.013
ishment-0.012
 unison-0.011
Token and
Token ID290
Feature activation+4.59e-03

Pos logit contributions
inkle+0.025
ags+0.023
ither+0.022
cca+0.022
 concern+0.021
Neg logit contributions
fl-0.010
THE-0.009
 to-0.009
 café-0.009
 trail-0.009
Token was
Token ID373
Feature activation+7.80e-03

Pos logit contributions
THE+0.005
saw+0.005
fl+0.005
 café+0.005
bye+0.005
Neg logit contributions
inkle-0.008
ags-0.007
cca-0.007
ist-0.007
agged-0.006
Token running
Token ID2491
Feature activation-2.36e-03

Pos logit contributions
THE+0.009
saw+0.008
 café+0.008
fl+0.008
 desert+0.008
Neg logit contributions
inkle-0.013
ags-0.012
ist-0.011
ither-0.011
 concern-0.011
Token away
Token ID1497
Feature activation+9.11e-03

Pos logit contributions
inkle+0.005
 concern+0.004
 sincere+0.004
ags+0.004
agged+0.004
Neg logit contributions
 trail-0.002
THE-0.002
fl-0.002
bye-0.002
saw-0.002
Token.
Token ID13
Feature activation-3.48e-04

Pos logit contributions
THE+0.009
 café+0.009
fl+0.009
saw+0.009
 trail+0.009
Neg logit contributions
inkle-0.021
ags-0.019
 concern-0.019
 sincere-0.018
bly-0.018
Token But
Token ID887
Feature activation-8.55e-03

Pos logit contributions
inkle+0.001
ist+0.001
bly+0.001
 concern+0.001
cca+0.001
Neg logit contributions
saw-0.000
board-0.000
 café-0.000
THE-0.000
 pi-0.000
Token Lily
Token ID20037
Feature activation-1.36e-03

Pos logit contributions
inkle+0.019
ags+0.018
ist+0.017
ady+0.017
agged+0.017
Neg logit contributions
THE-0.006
 before-0.005
 tree-0.005
 would-0.005
 roller-0.005
Token did
Token ID750
Feature activation-1.42e-02

Pos logit contributions
inkle+0.003
ags+0.002
bly+0.002
 concern+0.002
 sincere+0.002
Neg logit contributions
saw-0.001
 café-0.001
THE-0.001
fl-0.001
 trail-0.001
Token not
Token ID407
Feature activation+7.47e-03

Pos logit contributions
inkle+0.017
ags+0.015
cca+0.015
agged+0.015
ist+0.015
Neg logit contributions
THE-0.022
saw-0.022
 roller-0.020
 trail-0.020
 café-0.020
Token want
Token ID765
Feature activation+1.64e-02

Pos logit contributions
saw+0.010
fl+0.010
THE+0.010
 café+0.010
actus+0.010
Neg logit contributions
inkle-0.010
ags-0.009
 concern-0.009
cca-0.009
 sincere-0.009
Token to
Token ID284
Feature activation+1.68e-02

Pos logit contributions
saw+0.013
THE+0.013
fl+0.013
 café+0.012
 trail+0.012
Neg logit contributions
inkle-0.032
 concern-0.030
ags-0.029
 sincere-0.029
cca-0.028
Token give
Token ID1577
Feature activation+1.57e-03

Pos logit contributions
saw+0.020
 café+0.020
THE+0.020
 trail+0.019
bye+0.019
Neg logit contributions
inkle-0.026
cca-0.023
ags-0.023
 concern-0.023
 sincere-0.022
Token it
Token ID340
Feature activation+3.87e-03

Pos logit contributions
saw+0.002
THE+0.002
 café+0.002
fl+0.002
 trail+0.002
Neg logit contributions
inkle-0.002
ags-0.002
 concern-0.002
 sincere-0.002
cca-0.002
Token up
Token ID510
Feature activation-6.73e-03

Pos logit contributions
saw+0.003
THE+0.003
 café+0.003
fl+0.003
 trail+0.003
Neg logit contributions
inkle-0.007
 sincere-0.007
 concern-0.007
cca-0.007
ags-0.007
Token.
Token ID13
Feature activation+4.31e-03

Pos logit contributions
inkle+0.016
ags+0.014
agged+0.014
 concern+0.014
bly+0.014
Neg logit contributions
THE-0.003
bye-0.003
 café-0.003
 to-0.003
 trail-0.003
Token would
Token ID561
Feature activation-5.74e-03

Pos logit contributions
 trail+0.003
fl+0.003
 café+0.003
 would+0.002
THE+0.002
Neg logit contributions
inkle-0.014
ags-0.014
bly-0.013
agged-0.013
 concern-0.012
Token nod
Token ID18666
Feature activation-1.47e-02

Pos logit contributions
agged+0.013
ised+0.012
inkle+0.012
cca+0.012
ags+0.012
Neg logit contributions
 trail-0.005
 tree-0.004
 would-0.004
 stream-0.004
 c-0.004
Token its
Token ID663
Feature activation-2.09e-02

Pos logit contributions
inkle+0.031
agged+0.028
ags+0.027
ised+0.027
renches+0.026
Neg logit contributions
 to-0.014
 would-0.013
fl-0.013
THE-0.012
 again-0.011
Token head
Token ID1182
Feature activation+1.76e-02

Pos logit contributions
inkle+0.035
agged+0.034
bly+0.033
ags+0.032
 concern+0.031
Neg logit contributions
 trail-0.022
fl-0.022
bye-0.020
 coc-0.019
THE-0.018
Token.
Token ID13
Feature activation-2.22e-03

Pos logit contributions
THE+0.014
bye+0.014
fl+0.014
 to+0.013
 would+0.013
Neg logit contributions
inkle-0.035
ags-0.033
agged-0.032
 concern-0.032
 common-0.031
Token
Token ID198
Feature activation+6.00e-03

Pos logit contributions
inkle+0.004
bly+0.004
ist+0.004
cca+0.004
 concern+0.004
Neg logit contributions
THE-0.002
 café-0.002
saw-0.002
fl-0.002
 ze-0.002
Token
Token ID198
Feature activation-1.59e-02

Pos logit contributions
saw+0.007
fl+0.007
 café+0.007
THE+0.006
 trail+0.006
Neg logit contributions
inkle-0.010
ags-0.009
 concern-0.009
cca-0.009
 sincere-0.009
TokenThe
Token ID464
Feature activation+6.09e-03

Pos logit contributions
inkle+0.026
ags+0.023
 concern+0.022
 sincere+0.022
cca+0.022
Neg logit contributions
THE-0.019
saw-0.019
 café-0.018
fl-0.018
 trail-0.017
Token end
Token ID886
Feature activation+4.10e-03

Pos logit contributions
 trail+0.005
 café+0.005
THE+0.005
fl+0.005
 ostr+0.005
Neg logit contributions
inkle-0.012
ist-0.011
ags-0.011
agged-0.011
ither-0.011
Token.
Token ID13
Feature activation-1.23e-02

Pos logit contributions
saw+0.005
THE+0.005
 café+0.005
actus+0.005
bye+0.005
Neg logit contributions
inkle-0.006
ist-0.006
 concern-0.005
cca-0.005
 sincere-0.005
Token<|endoftext|>
Token ID50256
Feature activation+2.36e-03

Pos logit contributions
inkle+0.026
 sincere+0.023
ist+0.023
bly+0.023
cca+0.023
Neg logit contributions
THE-0.010
saw-0.009
 café-0.009
bye-0.008
fl-0.008
Token It
Token ID632
Feature activation-9.55e-03

Pos logit contributions
saw+0.005
 café+0.005
THE+0.005
fl+0.005
 trail+0.005
Neg logit contributions
inkle-0.005
ist-0.004
ags-0.004
 concern-0.004
cca-0.004
Token is
Token ID318
Feature activation-1.38e-02

Pos logit contributions
ags+0.019
cca+0.019
inkle+0.019
agged+0.019
bly+0.019
Neg logit contributions
fl-0.007
 ze-0.007
saw-0.007
 trail-0.006
 ostr-0.006
Token a
Token ID257
Feature activation+3.56e-03

Pos logit contributions
inkle+0.026
ist+0.024
agged+0.024
ags+0.024
ised+0.023
Neg logit contributions
 ze-0.015
 ostr-0.013
 roller-0.013
 trail-0.013
 desert-0.012
Token rh
Token ID9529
Feature activation-2.50e-02

Pos logit contributions
saw+0.004
 café+0.004
 trail+0.004
THE+0.004
fl+0.004
Neg logit contributions
inkle-0.006
ags-0.005
cca-0.005
 concern-0.005
agged-0.005
Tokenino
Token ID2879
Feature activation-7.61e-03

Pos logit contributions
inkle+0.047
 concern+0.043
ags+0.042
 sincere+0.042
cca+0.042
Neg logit contributions
saw-0.023
THE-0.021
fl-0.020
 café-0.020
 trail-0.020
Token.
Token ID13
Feature activation+6.69e-04

Pos logit contributions
ags+0.013
inkle+0.013
agged+0.013
 concern+0.013
cca+0.012
Neg logit contributions
saw-0.008
 trail-0.007
fl-0.007
 ze-0.007
 roller-0.007
Token
Token ID198
Feature activation+7.44e-04

Pos logit contributions
saw+0.001
 café+0.001
THE+0.001
fl+0.001
 ze+0.001
Neg logit contributions
inkle-0.001
ist-0.001
 concern-0.001
ags-0.001
 sincere-0.001
Token
Token ID198
Feature activation-1.51e-02

Pos logit contributions
fl+0.001
 trail+0.001
reating+0.001
 ze+0.001
actus+0.001
Neg logit contributions
inkle-0.001
 common-0.001
cca-0.001
 Their-0.001
ither-0.001
Token"
Token ID1
Feature activation+2.22e-02

Pos logit contributions
inkle+0.025
cca+0.022
bly+0.021
 sincere+0.021
 concern+0.021
Neg logit contributions
fl-0.017
saw-0.016
THE-0.016
reating-0.016
 trail-0.016
TokenWow
Token ID22017
Feature activation+6.40e-03

Pos logit contributions
THE+0.024
 trail+0.024
saw+0.024
unn+0.024
 café+0.023
Neg logit contributions
inkle-0.033
cca-0.032
 concern-0.031
bly-0.031
 sincere-0.031
Token,
Token ID11
Feature activation-3.45e-03

Pos logit contributions
saw+0.010
 café+0.010
THE+0.010
reating+0.010
fl+0.009
Neg logit contributions
inkle-0.008
ist-0.006
ags-0.006
 sincere-0.006
 concern-0.006
Token can
Token ID460
Feature activation-7.30e-03

Pos logit contributions
ags+0.020
agged+0.020
inkle+0.019
bly+0.018
cca+0.018
Neg logit contributions
 would-0.003
ha-0.003
 could-0.003
saw-0.002
'll-0.002
Token both
Token ID1111
Feature activation+5.03e-03

Pos logit contributions
agged+0.015
cca+0.015
ags+0.014
inkle+0.013
ishment+0.013
Neg logit contributions
 slide-0.006
 tree-0.006
saw-0.006
oh-0.005
 roller-0.005
Token use
Token ID779
Feature activation-5.64e-03

Pos logit contributions
saw+0.005
bye+0.005
 roller+0.004
 trail+0.004
 tree+0.004
Neg logit contributions
inkle-0.010
cca-0.009
agged-0.009
ags-0.009
 concern-0.009
Token the
Token ID262
Feature activation+2.32e-03

Pos logit contributions
agged+0.011
cca+0.011
inkle+0.011
 sincere+0.011
 concern+0.011
Neg logit contributions
 to-0.005
 roller-0.004
 tree-0.004
 slides-0.004
!"-0.004
Token notebook
Token ID20922
Feature activation-6.10e-04

Pos logit contributions
 trail+0.003
 roller+0.003
fl+0.003
 café+0.002
 ze+0.002
Neg logit contributions
inkle-0.004
agged-0.004
bly-0.004
cca-0.004
ags-0.003
Token.
Token ID13
Feature activation-5.42e-03

Pos logit contributions
inkle+0.001
 concern+0.001
 sincere+0.001
cca+0.001
ags+0.001
Neg logit contributions
THE-0.000
board-0.000
 trail-0.000
saw-0.000
bye-0.000
Token Tim
Token ID5045
Feature activation-1.35e-02

Pos logit contributions
inkle+0.011
 concern+0.009
 sincere+0.009
agged+0.009
cca+0.009
Neg logit contributions
saw-0.005
THE-0.005
 Would-0.005
bye-0.005
 café-0.005
Token can
Token ID460
Feature activation-1.01e-02

Pos logit contributions
inkle+0.027
 concern+0.025
ags+0.025
bly+0.025
agged+0.025
Neg logit contributions
fl-0.010
THE-0.009
saw-0.009
 café-0.009
bye-0.009
Token put
Token ID1234
Feature activation-2.11e-02

Pos logit contributions
agged+0.022
cca+0.021
ags+0.020
ishment+0.020
ised+0.019
Neg logit contributions
 tree-0.008
 slide-0.007
 would-0.007
oh-0.007
bye-0.006
Token his
Token ID465
Feature activation-2.82e-02

Pos logit contributions
 concern+0.045
 sincere+0.044
inkle+0.044
cca+0.044
agged+0.044
Neg logit contributions
 roller-0.011
 tree-0.011
 ze-0.011
 pi-0.010
 slides-0.010
Token drawings
Token ID23388
Feature activation-8.86e-03

Pos logit contributions
bly+0.050
inkle+0.050
cca+0.046
ist+0.046
ither+0.046
Neg logit contributions
 statue-0.026
 trail-0.025
board-0.024
 coc-0.024
 tree-0.024
Token.
Token ID13
Feature activation-1.23e-03

Pos logit contributions
inkle+0.003
 sincere+0.003
 concern+0.003
ags+0.003
ist+0.003
Neg logit contributions
THE-0.004
saw-0.004
 café-0.003
fl-0.003
 trail-0.003
Token It
Token ID632
Feature activation-8.34e-03

Pos logit contributions
inkle+0.002
ist+0.002
ags+0.002
bly+0.002
 concern+0.002
Neg logit contributions
saw-0.002
 café-0.002
THE-0.001
 roller-0.001
 ze-0.001
Token was
Token ID373
Feature activation-2.27e-02

Pos logit contributions
ags+0.020
inkle+0.020
agged+0.018
ither+0.018
 concern+0.017
Neg logit contributions
 trail-0.004
fl-0.004
saw-0.004
 café-0.004
 ze-0.003
Token brown
Token ID7586
Feature activation-1.28e-02

Pos logit contributions
inkle+0.043
ist+0.041
agged+0.040
ised+0.039
ags+0.038
Neg logit contributions
 ze-0.022
 roller-0.022
 desert-0.021
 shady-0.021
 ostr-0.020
Token and
Token ID290
Feature activation-2.32e-02

Pos logit contributions
inkle+0.021
 concern+0.018
 sincere+0.018
cca+0.018
ags+0.018
Neg logit contributions
fl-0.015
 café-0.015
 trail-0.014
saw-0.014
 ze-0.014
Token round
Token ID2835
Feature activation-4.12e-03

Pos logit contributions
inkle+0.048
ags+0.048
agged+0.046
ist+0.045
ither+0.044
Neg logit contributions
 trail-0.018
fl-0.017
 tree-0.017
 shady-0.016
 ze-0.016
Token and
Token ID290
Feature activation-1.06e-02

Pos logit contributions
inkle+0.008
 concern+0.007
cca+0.007
ags+0.007
 sincere+0.007
Neg logit contributions
fl-0.004
 ze-0.004
saw-0.004
 café-0.004
 trail-0.004
Token had
Token ID550
Feature activation+9.55e-03

Pos logit contributions
inkle+0.024
ags+0.023
agged+0.022
ist+0.021
ither+0.021
Neg logit contributions
 trail-0.007
fl-0.007
 tree-0.006
 roller-0.006
 ze-0.006
Token a
Token ID257
Feature activation-1.98e-02

Pos logit contributions
saw+0.013
 ze+0.012
fl+0.012
 trail+0.012
 roller+0.012
Neg logit contributions
inkle-0.015
ags-0.012
ist-0.012
cca-0.012
 concern-0.012
Token hole
Token ID7604
Feature activation-8.79e-03

Pos logit contributions
ags+0.034
agged+0.033
inkle+0.033
ist+0.032
cca+0.032
Neg logit contributions
 trail-0.022
 roller-0.020
 café-0.019
saw-0.019
 tree-0.018
Token on
Token ID319
Feature activation-1.10e-02

Pos logit contributions
inkle+0.015
 concern+0.015
cca+0.014
ags+0.014
 sincere+0.013
Neg logit contributions
saw-0.009
fl-0.008
 trail-0.008
 roller-0.008
board-0.008
Token the
Token ID262
Feature activation-5.42e-03

Pos logit contributions
THE+0.002
 tree+0.002
fl+0.002
 ze+0.001
 pine+0.001
Neg logit contributions
inkle-0.002
agged-0.002
 sincere-0.002
 concern-0.002
immer-0.002
Token pieces
Token ID5207
Feature activation+1.30e-02

Pos logit contributions
inkle+0.010
cca+0.009
ist+0.009
bly+0.009
ags+0.009
Neg logit contributions
 trail-0.005
fl-0.005
 café-0.005
THE-0.005
saw-0.005
Token of
Token ID286
Feature activation-1.59e-02

Pos logit contributions
THE+0.011
 trail+0.010
saw+0.010
fl+0.009
 café+0.009
Neg logit contributions
inkle-0.025
 sincere-0.024
cca-0.024
 concern-0.024
ags-0.023
Token coal
Token ID5655
Feature activation+1.29e-03

Pos logit contributions
inkle+0.034
agged+0.032
cca+0.032
 sincere+0.031
ist+0.031
Neg logit contributions
 tree-0.013
 pine-0.012
 trail-0.012
 stream-0.012
 ze-0.012
Token for
Token ID329
Feature activation-4.71e-03

Pos logit contributions
fl+0.001
 trail+0.001
THE+0.001
 café+0.001
saw+0.001
Neg logit contributions
inkle-0.003
 sincere-0.002
 concern-0.002
cca-0.002
ags-0.002
Token their
Token ID511
Feature activation-1.09e-02

Pos logit contributions
inkle+0.008
agged+0.007
ist+0.007
 concern+0.007
cca+0.007
Neg logit contributions
THE-0.005
 ze-0.005
 roller-0.005
 trail-0.005
lan-0.005
Token eyes
Token ID2951
Feature activation+2.97e-02

Pos logit contributions
inkle+0.018
 concern+0.017
 sincere+0.016
ags+0.016
cca+0.016
Neg logit contributions
saw-0.012
THE-0.011
 café-0.011
fl-0.011
 ze-0.010
Token and
Token ID290
Feature activation-1.06e-03

Pos logit contributions
 trail+0.021
THE+0.021
 café+0.021
fl+0.020
saw+0.020
Neg logit contributions
inkle-0.061
ags-0.058
 concern-0.056
cca-0.056
agged-0.056
Token buttons
Token ID12163
Feature activation+2.09e-04

Pos logit contributions
inkle+0.002
ags+0.002
 sincere+0.002
ist+0.002
cca+0.002
Neg logit contributions
fl-0.001
THE-0.001
 trail-0.001
 café-0.001
saw-0.001
Token.
Token ID13
Feature activation+5.63e-03

Pos logit contributions
fl+0.000
THE+0.000
saw+0.000
 trail+0.000
 roller+0.000
Neg logit contributions
inkle-0.000
cca-0.000
ags-0.000
 sincere-0.000
 concern-0.000
Token
Token ID198
Feature activation+9.47e-03

Pos logit contributions
saw+0.009
THE+0.009
 café+0.009
fl+0.009
 trail+0.008
Neg logit contributions
inkle-0.007
 concern-0.006
 sincere-0.006
ags-0.006
cca-0.006
Token the
Token ID262
Feature activation+1.41e-02

Pos logit contributions
 café+0.000
 night+0.000
fl+0.000
 ostr+0.000
 desert+0.000
Neg logit contributions
ags-0.001
inkle-0.001
agged-0.001
ither-0.001
hips-0.001
Token baby
Token ID5156
Feature activation+1.35e-02

Pos logit contributions
saw+0.017
THE+0.017
 café+0.016
fl+0.016
 trail+0.016
Neg logit contributions
inkle-0.022
 concern-0.020
cca-0.019
 sincere-0.019
ags-0.019
Token said
Token ID531
Feature activation-2.98e-03

Pos logit contributions
fl+0.011
 café+0.011
THE+0.010
saw+0.010
 trail+0.010
Neg logit contributions
inkle-0.027
ags-0.025
 sincere-0.024
 common-0.024
agged-0.024
Token,
Token ID11
Feature activation+1.64e-02

Pos logit contributions
inkle+0.006
ags+0.005
ishment+0.005
ist+0.005
agged+0.005
Neg logit contributions
fl-0.003
THE-0.003
bye-0.002
 Merry-0.002
rav-0.002
Token "
Token ID366
Feature activation+6.88e-03

Pos logit contributions
THE+0.008
fl+0.008
 café+0.007
 ze+0.006
 trail+0.006
Neg logit contributions
inkle-0.039
ags-0.035
 common-0.035
agged-0.034
ist-0.034
TokenM
Token ID44
Feature activation-1.12e-02

Pos logit contributions
fl+0.005
THE+0.005
saw+0.005
 café+0.005
bye+0.005
Neg logit contributions
inkle-0.014
ags-0.013
 sincere-0.013
 concern-0.013
ist-0.012
Tokenama
Token ID1689
Feature activation+1.37e-02

Pos logit contributions
inkle+0.017
ags+0.014
 concern+0.014
cca+0.014
 sincere+0.014
Neg logit contributions
saw-0.015
THE-0.014
 café-0.014
fl-0.014
 trail-0.014
Token,
Token ID11
Feature activation-1.05e-02

Pos logit contributions
fl+0.015
 café+0.015
saw+0.014
bye+0.014
THE+0.014
Neg logit contributions
inkle-0.023
ags-0.021
agged-0.020
 concern-0.020
cca-0.020
Token don
Token ID836
Feature activation-1.52e-03

Pos logit contributions
inkle+0.021
agged+0.020
ags+0.020
cca+0.019
ittered+0.018
Neg logit contributions
 café-0.008
THE-0.008
fl-0.008
 trail-0.008
saw-0.008
Token't
Token ID470
Feature activation+1.42e-02

Pos logit contributions
inkle+0.002
bly+0.001
 concern+0.001
 sincere+0.001
ags+0.001
Neg logit contributions
 café-0.003
fl-0.003
saw-0.003
lan-0.003
 ze-0.002
Token cry
Token ID3960
Feature activation+1.88e-02

Pos logit contributions
 trail+0.016
bye+0.016
 café+0.015
 coc+0.015
 night+0.014
Neg logit contributions
inkle-0.025
ags-0.024
agged-0.024
cca-0.022
ittered-0.021
Token their
Token ID511
Feature activation-1.69e-02

Pos logit contributions
inkle+0.010
agged+0.009
ags+0.009
 concern+0.008
 sincere+0.008
Neg logit contributions
fl-0.005
 to-0.005
ups-0.004
 trail-0.004
 ze-0.004
Token old
Token ID1468
Feature activation-2.47e-02

Pos logit contributions
inkle+0.031
bly+0.029
cca+0.029
ist+0.028
agged+0.027
Neg logit contributions
fl-0.017
 trail-0.017
 café-0.016
bye-0.016
 roller-0.016
Token home
Token ID1363
Feature activation+2.78e-03

Pos logit contributions
inkle+0.050
bly+0.049
agged+0.047
cca+0.046
ags+0.045
Neg logit contributions
fl-0.023
 trail-0.023
 day-0.021
 café-0.020
 tree-0.020
Token.
Token ID13
Feature activation-2.27e-02

Pos logit contributions
 trail+0.001
ups+0.001
THE+0.001
board+0.001
 would+0.001
Neg logit contributions
ags-0.006
inkle-0.006
agged-0.006
 concern-0.006
cca-0.006
Token<|endoftext|>
Token ID50256
Feature activation+2.93e-03

Pos logit contributions
inkle+0.041
cca+0.037
ags+0.036
bly+0.036
 sincere+0.036
Neg logit contributions
THE-0.024
fl-0.024
saw-0.023
 ze-0.022
bye-0.022
TokenOnce
Token ID7454
Feature activation-1.48e-02

Pos logit contributions
saw+0.003
 café+0.003
THE+0.003
 trail+0.003
fl+0.003
Neg logit contributions
inkle-0.005
ags-0.005
cca-0.005
 sincere-0.005
 concern-0.005
Token upon
Token ID2402
Feature activation-8.61e-03

Pos logit contributions
inkle+0.026
ags+0.024
 concern+0.024
cca+0.024
 sincere+0.023
Neg logit contributions
saw-0.014
THE-0.014
 café-0.013
fl-0.013
 trail-0.013
Token a
Token ID257
Feature activation-2.42e-02

Pos logit contributions
inkle+0.010
ags+0.008
ist+0.008
 sincere+0.008
 concern+0.008
Neg logit contributions
saw-0.014
THE-0.014
 café-0.014
fl-0.013
 trail-0.013
Token time
Token ID640
Feature activation-3.63e-02

Pos logit contributions
inkle+0.037
bly+0.035
 sincere+0.034
ags+0.034
 concern+0.034
Neg logit contributions
THE-0.028
 café-0.027
 roller-0.026
saw-0.026
bye-0.025
Token there
Token ID612
Feature activation-1.01e-02

Pos logit contributions
inkle+0.069
ags+0.067
cca+0.064
bly+0.063
 concern+0.063
Neg logit contributions
fl-0.030
THE-0.030
saw-0.029
rav-0.029
 café-0.027
Token was
Token ID373
Feature activation-1.70e-02

Pos logit contributions
inkle+0.022
ags+0.021
ist+0.020
 concern+0.020
agged+0.020
Neg logit contributions
fl-0.006
THE-0.006
saw-0.006
 café-0.005
 trail-0.005
Token a
Token ID257
Feature activation+1.53e-02

Pos logit contributions
inkle+0.025
ist+0.022
 sincere+0.021
ags+0.020
cca+0.020
Neg logit contributions
THE-0.026
saw-0.025
fl-0.025
 café-0.025
 ze-0.024
Token little
Token ID1310
Feature activation+3.94e-03

Pos logit contributions
reating+0.024
THE+0.024
fl+0.024
ups+0.023
saw+0.022
Neg logit contributions
inkle-0.018
 sincere-0.016
cca-0.015
 common-0.014
 concern-0.014
Token girl
Token ID2576
Feature activation+3.77e-03

Pos logit contributions
THE+0.003
saw+0.003
fl+0.003
 café+0.003
 trail+0.002
Neg logit contributions
inkle-0.008
 sincere-0.007
 concern-0.007
cca-0.007
ags-0.007
Token named
Token ID3706
Feature activation+7.51e-03

Pos logit contributions
THE+0.007
saw+0.007
 trail+0.007
rav+0.007
actus+0.007
Neg logit contributions
inkle-0.003
 sincere-0.002
ist-0.002
cca-0.002
 concern-0.002
Token Jane
Token ID12091
Feature activation+1.74e-02

Pos logit contributions
actus+0.014
THE+0.014
saw+0.013
 ze+0.013
 trail+0.013
Neg logit contributions
inkle-0.007
ags-0.006
 concern-0.006
 Their-0.006
 their-0.005
Token.
Token ID13
Feature activation-4.39e-02

Pos logit contributions
 trail+0.030
 café+0.029
saw+0.029
 concession+0.029
 roller+0.028
Neg logit contributions
inkle-0.019
 sincere-0.016
ist-0.016
 their-0.016
cca-0.015
Token She
Token ID1375
Feature activation+1.86e-02

Pos logit contributions
inkle+0.052
ags+0.045
 sincere+0.044
 concern+0.044
cca+0.043
Neg logit contributions
saw-0.067
THE-0.067
fl-0.066
 café-0.066
 trail-0.066
Token was
Token ID373
Feature activation-1.05e-04

Pos logit contributions
reating+0.034
saw+0.033
THE+0.031
 café+0.031
 ze+0.031
Neg logit contributions
inkle-0.020
 sincere-0.017
cca-0.016
 concern-0.016
 Their-0.014
Token only
Token ID691
Feature activation+1.62e-02

Pos logit contributions
inkle+0.000
ist+0.000
ags+0.000
agged+0.000
cca+0.000
Neg logit contributions
saw-0.000
 café-0.000
 ze-0.000
fl-0.000
 desert-0.000
Token 3
Token ID513
Feature activation+1.70e-03

Pos logit contributions
saw+0.020
fl+0.020
 café+0.019
 ze+0.019
 roller+0.019
Neg logit contributions
inkle-0.026
 sincere-0.022
ags-0.022
agged-0.021
bly-0.021
Token years
Token ID812
Feature activation+2.27e-02

Pos logit contributions
saw+0.003
THE+0.003
fl+0.003
 trail+0.003
 café+0.002
Neg logit contributions
inkle-0.002
cca-0.002
 sincere-0.002
 concern-0.002
ags-0.002
Token the
Token ID262
Feature activation+8.45e-03

Pos logit contributions
inkle+0.021
agged+0.020
 sincere+0.019
ither+0.019
ags+0.019
Neg logit contributions
 tree-0.008
 ze-0.008
 da-0.008
reating-0.008
fl-0.007
Token passport
Token ID17981
Feature activation-2.09e-02

Pos logit contributions
saw+0.011
THE+0.011
fl+0.011
 café+0.011
 trail+0.010
Neg logit contributions
inkle-0.012
 concern-0.011
cca-0.011
ags-0.011
 sincere-0.011
Token and
Token ID290
Feature activation-3.18e-04

Pos logit contributions
inkle+0.042
ags+0.040
agged+0.040
 concern+0.039
cca+0.038
Neg logit contributions
THE-0.018
 trail-0.017
 café-0.015
 tree-0.015
saw-0.015
Token the
Token ID262
Feature activation+7.77e-03

Pos logit contributions
inkle+0.001
ags+0.001
agged+0.001
cca+0.001
ist+0.001
Neg logit contributions
THE-0.000
 café-0.000
saw-0.000
 trail-0.000
fl-0.000
Token magical
Token ID10883
Feature activation-3.85e-03

Pos logit contributions
 café+0.008
THE+0.008
 trail+0.008
saw+0.008
fl+0.008
Neg logit contributions
inkle-0.014
ags-0.012
ist-0.012
agged-0.012
cca-0.012
Token mer
Token ID4017
Feature activation-4.97e-02

Pos logit contributions
inkle+0.007
ags+0.007
agged+0.006
 sincere+0.006
ither+0.006
Neg logit contributions
 trail-0.004
 café-0.004
THE-0.004
 desert-0.003
actus-0.003
Tokenmaid
Token ID23151
Feature activation+5.27e-03

Pos logit contributions
 concern+0.073
inkle+0.072
ags+0.072
 Their+0.069
 sympathy+0.067
Neg logit contributions
actus-0.060
fl-0.055
 sur-0.053
saw-0.052
 trail-0.050
Token was
Token ID373
Feature activation+9.63e-03

Pos logit contributions
fl+0.004
THE+0.004
 café+0.004
 trail+0.004
 ze+0.004
Neg logit contributions
inkle-0.011
ags-0.011
agged-0.010
ither-0.010
 concern-0.010
Token grateful
Token ID14066
Feature activation-1.24e-02

Pos logit contributions
THE+0.010
 desert+0.008
saw+0.008
 café+0.008
 roller+0.008
Neg logit contributions
inkle-0.018
ags-0.016
ist-0.016
agged-0.016
ither-0.015
Token to
Token ID284
Feature activation+1.92e-02

Pos logit contributions
inkle+0.027
ags+0.024
 sincere+0.024
cca+0.024
 concern+0.024
Neg logit contributions
THE-0.009
fl-0.008
bye-0.008
 trail-0.007
lan-0.007
Token get
Token ID651
Feature activation+5.74e-03

Pos logit contributions
 Merry+0.019
THE+0.019
bye+0.018
 night+0.017
reating+0.017
Neg logit contributions
inkle-0.033
cca-0.032
ist-0.031
agged-0.031
ised-0.031
Token berries
Token ID36322
Feature activation-8.59e-03

Pos logit contributions
inkle+0.045
ist+0.040
ags+0.039
cca+0.039
 sincere+0.038
Neg logit contributions
 café-0.016
saw-0.016
 trail-0.016
fl-0.016
 ze-0.015
Token.
Token ID13
Feature activation-2.97e-03

Pos logit contributions
inkle+0.014
 concern+0.013
ags+0.013
cca+0.013
 sincere+0.013
Neg logit contributions
saw-0.009
THE-0.009
fl-0.009
 café-0.009
 trail-0.009
Token He
Token ID679
Feature activation-1.41e-02

Pos logit contributions
inkle+0.004
ags+0.003
 concern+0.003
cca+0.003
 sincere+0.003
Neg logit contributions
saw-0.005
THE-0.004
 café-0.004
fl-0.004
 trail-0.004
Token wanted
Token ID2227
Feature activation+5.87e-03

Pos logit contributions
inkle+0.027
ags+0.025
 concern+0.024
 sincere+0.024
ist+0.023
Neg logit contributions
saw-0.012
 café-0.012
THE-0.012
fl-0.011
 trail-0.011
Token to
Token ID284
Feature activation-4.80e-03

Pos logit contributions
 café+0.002
fl+0.002
 ze+0.001
saw+0.001
 roller+0.001
Neg logit contributions
inkle-0.016
ags-0.014
ist-0.014
bly-0.014
agged-0.014
Token act
Token ID719
Feature activation-4.21e-02

Pos logit contributions
inkle+0.009
cca+0.008
ags+0.008
ist+0.008
 sincere+0.008
Neg logit contributions
saw-0.004
 café-0.004
 trail-0.004
THE-0.004
fl-0.004
Token like
Token ID588
Feature activation+2.59e-04

Pos logit contributions
inkle+0.076
ags+0.070
ishment+0.068
agged+0.067
ised+0.066
Neg logit contributions
 to-0.039
 café-0.038
 trail-0.037
THE-0.036
saw-0.035
Token he
Token ID339
Feature activation-1.53e-02

Pos logit contributions
 café+0.000
saw+0.000
THE+0.000
fl+0.000
 ze+0.000
Neg logit contributions
inkle-0.000
ags-0.000
cca-0.000
ist-0.000
 concern-0.000
Token was
Token ID373
Feature activation+3.25e-03

Pos logit contributions
inkle+0.031
ags+0.029
 concern+0.028
 sincere+0.028
cca+0.027
Neg logit contributions
 café-0.011
saw-0.011
THE-0.011
fl-0.011
 trail-0.011
Token picking
Token ID10868
Feature activation-4.57e-03

Pos logit contributions
 ze+0.004
 café+0.003
saw+0.003
fl+0.003
 desert+0.003
Neg logit contributions
inkle-0.006
ist-0.005
ags-0.005
ised-0.005
agged-0.005
Token them
Token ID606
Feature activation-8.38e-03

Pos logit contributions
inkle+0.009
 concern+0.008
 sincere+0.008
ist+0.008
ags+0.007
Neg logit contributions
fl-0.005
saw-0.004
 ze-0.004
 café-0.004
THE-0.004
Token were
Token ID547
Feature activation-1.33e-02

Pos logit contributions
 roller+0.002
 ze+0.002
 slides+0.002
 café+0.002
 to+0.002
Neg logit contributions
inkle-0.006
ags-0.005
agged-0.005
ither-0.005
bly-0.005
Token his
Token ID465
Feature activation-1.89e-02

Pos logit contributions
inkle+0.023
ags+0.020
agged+0.019
ist+0.019
cca+0.018
Neg logit contributions
 ze-0.016
 roller-0.015
THE-0.015
saw-0.015
 café-0.015
Token two
Token ID734
Feature activation-1.76e-02

Pos logit contributions
inkle+0.032
bly+0.031
agged+0.031
ags+0.030
cca+0.029
Neg logit contributions
 ze-0.021
fl-0.020
 trail-0.020
 roller-0.020
 café-0.020
Token best
Token ID1266
Feature activation-2.39e-02

Pos logit contributions
inkle+0.030
ags+0.027
bly+0.026
agged+0.026
cca+0.026
Neg logit contributions
saw-0.019
 ze-0.019
 café-0.019
 trail-0.019
 roller-0.018
Token friends
Token ID2460
Feature activation-6.14e-03

Pos logit contributions
inkle+0.035
ags+0.030
 concern+0.030
cca+0.030
 sincere+0.030
Neg logit contributions
saw-0.031
THE-0.031
 café-0.030
fl-0.030
 trail-0.030
Token hiding
Token ID11816
Feature activation-4.66e-02

Pos logit contributions
inkle+0.011
ags+0.010
 concern+0.010
cca+0.010
 sincere+0.010
Neg logit contributions
THE-0.006
saw-0.006
 café-0.006
fl-0.006
 ze-0.005
Token with
Token ID351
Feature activation-2.49e-02

Pos logit contributions
inkle+0.081
 concern+0.073
 sincere+0.071
agged+0.070
ags+0.068
Neg logit contributions
 trail-0.054
fl-0.053
THE-0.052
lan-0.045
 roller-0.045
Token the
Token ID262
Feature activation+2.53e-03

Pos logit contributions
inkle+0.046
ist+0.040
cca+0.038
agged+0.037
ags+0.037
Neg logit contributions
 ze-0.029
 roller-0.027
 coc-0.027
 tree-0.026
 slides-0.026
Token block
Token ID2512
Feature activation-1.19e-02

Pos logit contributions
saw+0.003
fl+0.003
 trail+0.003
 café+0.003
THE+0.002
Neg logit contributions
inkle-0.004
cca-0.004
ags-0.004
bly-0.004
ist-0.004
Token of
Token ID286
Feature activation-1.63e-02

Pos logit contributions
inkle+0.024
 concern+0.022
ags+0.022
 sincere+0.021
cca+0.021
Neg logit contributions
saw-0.010
 trail-0.010
THE-0.010
board-0.009
 café-0.009
Token ice
Token ID4771
Feature activation-6.15e-03

Pos logit contributions
inkle+0.033
ist+0.029
cca+0.029
agged+0.029
bly+0.027
Neg logit contributions
 ze-0.017
 roller-0.016
 tree-0.016
 coc-0.015
saw-0.015
Token him
Token ID683
Feature activation-1.34e-02

Pos logit contributions
inkle+0.044
bly+0.040
 sincere+0.039
agged+0.039
ags+0.039
Neg logit contributions
fl-0.015
 to-0.014
ups-0.014
 night-0.013
bye-0.013
Token to
Token ID284
Feature activation-1.28e-02

Pos logit contributions
inkle+0.034
ags+0.031
agged+0.031
cca+0.031
bly+0.031
Neg logit contributions
fl-0.005
THE-0.005
 trail-0.005
 to-0.004
bye-0.003
Token a
Token ID257
Feature activation+1.66e-03

Pos logit contributions
inkle+0.023
agged+0.022
ist+0.021
bly+0.021
ags+0.020
Neg logit contributions
 trail-0.015
 café-0.014
 night-0.014
 ze-0.014
 desert-0.014
Token deep
Token ID2769
Feature activation-4.57e-02

Pos logit contributions
saw+0.002
fl+0.002
THE+0.002
 café+0.002
bye+0.002
Neg logit contributions
inkle-0.003
 sincere-0.002
 concern-0.002
cca-0.002
ist-0.002
Token and
Token ID290
Feature activation-2.16e-02

Pos logit contributions
agged+0.094
ags+0.092
 concern+0.092
inkle+0.092
bly+0.089
Neg logit contributions
 trail-0.036
 desert-0.035
 café-0.032
 stream-0.030
 tree-0.030
Token lasting
Token ID15727
Feature activation-4.20e-02

Pos logit contributions
inkle+0.039
ags+0.037
agged+0.035
ither+0.034
ist+0.034
Neg logit contributions
 trail-0.023
saw-0.023
THE-0.021
 café-0.021
 roller-0.021
Token friendship
Token ID14738
Feature activation-3.01e-03

Pos logit contributions
inkle+0.077
ags+0.071
bly+0.070
cca+0.070
agged+0.070
Neg logit contributions
 trail-0.042
bye-0.042
 desert-0.041
fl-0.039
 night-0.038
Token.
Token ID13
Feature activation-2.01e-02

Pos logit contributions
inkle+0.006
ags+0.006
cca+0.006
agged+0.006
 concern+0.006
Neg logit contributions
fl-0.002
 trail-0.002
THE-0.002
 café-0.002
saw-0.002
Token He
Token ID679
Feature activation-1.52e-02

Pos logit contributions
inkle+0.034
ist+0.029
bly+0.029
cca+0.028
ags+0.028
Neg logit contributions
THE-0.025
saw-0.024
 café-0.024
bye-0.022
board-0.022
Token thanked
Token ID26280
Feature activation-2.60e-02

Pos logit contributions
ags+0.033
inkle+0.033
agged+0.031
ither+0.031
bly+0.030
Neg logit contributions
THE-0.010
 trail-0.009
 ze-0.009
 café-0.008
fl-0.008
Token his
Token ID465
Feature activation-1.50e-02

Pos logit contributions
inkle+0.056
ady+0.051
agged+0.051
bly+0.050
cca+0.050
Neg logit contributions
reating-0.019
 ze-0.019
 tree-0.019
THE-0.018
 Merry-0.018