TOP ACTIVATIONS
MAX = 0.220

delicious smell coming from the concession stand . She followed the
- a box of mult icol ored bubbles !
were so clean they looked brand new ! She was amazed
man stepped closer and said gru ff ly , " You
's toys and they looked brand new . Lily was so
They put Lily on a stret cher and carried her to
job ! That statue looks brand new !" Lily smiled and
happy , it was a brand new ice cream machine .
she heard a loud , gru ff voice calling her name
was so proud of his brand - new suit .
ks . It was a gru ff voice and it said
the clothes and they looked brand new . Lily was very
holding the handle of a brand new bike . His mum
. It was huge and brand new . When
the freezer until it looked brand new .
bled up all the ch ocol ates , and she was
washed until his room felt brand new . He chose bright
all fixed up and looked brand new ! "
my ! Your basketball looks brand new !" Tim my felt
her mom bought her a brand new coloring book . She

MIN ACTIVATIONS
MIN = -0.197

They took turns making z ig z ags with their feet
They check for more z ig z ags . They find
wanted to make his tw ig special , so he spent
. He had to z ig zag around and make big
around and around in z ig z ags .
from straight lines to z ig z ags .
of fun with their z ig z ags .
. They make many z ig z ags and circles and
small bird needed a tw ig for her nest . She
were now perfect for Ab ig ail and her family .
friends played with their z ig z ags every day .
went to find a tw ig to help her .
suggested they find another tw ig , and so they carried
Tom wiped away his z ig zag and went on with
purple and makes a z ig zag shape . It is
difficult , but it could also be fun and tasty .
decided to make a z ig zag with the water .
he could finish his z ig zag . <|endoftext|> Once upon
They made a big z ig zag in the air .
on and makes a z ig zag on the paper .

INTERVAL 0.116 - 0.220
CONTAINS 0.526%

asked anx iously , " What is the answer ?" The
she and her mom went out into the woods . As
smiled and said , " Of course , Tim my .
The girl felt something in the water . She
His owner smiled . " Of course you can ," he

INTERVAL 0.011 - 0.116
CONTAINS 35.042%

Lily said . They kept building the tower higher
stories from all over the world ." Jack 's
next day . <|endoftext|> Once , there was a boy named
smells bad . But they think it is cool .
There ," Lily says . " Now he is warm and

INTERVAL -0.093 - 0.011
CONTAINS 62.379%

then , she made a mistake . She put her coat
of fun . When it was time to go home ,
pear tree grew very tall and strong . The pe aches
in the yard with his friends . They would run ,
bags and take their things . That is stealing . That

INTERVAL -0.197 - -0.093
CONTAINS 2.053%

surprised and happy . She had found a friend .
two friends who wanted to use it . "
was a girl . She had a little dog that she
out . The rabbit had used his wisdom to figure out
wears it today . Where ver she goes , it reminds
Token delicious
Token ID12625
Feature activation+6.59e-02

Pos logit contributions
 fascination+0.038
 grin+0.037
 stret+0.037
 glances+0.037
ote+0.036
Neg logit contributions
B-0.022
L-0.021
st-0.020
Tim-0.020
P-0.020
Token smell
Token ID8508
Feature activation+8.81e-02

Pos logit contributions
 fascination+0.150
 stret+0.146
 grin+0.143
 glances+0.142
oured+0.141
Neg logit contributions
st-0.082
B-0.082
P-0.078
Tim-0.078
 day-0.077
Token coming
Token ID2406
Feature activation+1.10e-01

Pos logit contributions
 fascination+0.266
 grin+0.253
 stret+0.249
ote+0.248
icol+0.247
Neg logit contributions
st-0.061
B-0.059
 sometimes-0.055
 day-0.054
 next-0.052
Token from
Token ID422
Feature activation+5.28e-02

Pos logit contributions
 grin+0.253
 fascination+0.250
 stret+0.242
ote+0.239
 glances+0.235
Neg logit contributions
B-0.147
L-0.135
st-0.134
Tim-0.133
P-0.130
Token the
Token ID262
Feature activation-1.62e-02

Pos logit contributions
 grin+0.140
ote+0.134
 fascination+0.133
 stret+0.133
 vows+0.129
Neg logit contributions
B-0.050
st-0.047
 sometimes-0.046
One-0.045
Tim-0.044
Token concession
Token ID33130
Feature activation+2.20e-01

Pos logit contributions
B+0.022
P+0.020
st+0.020
L+0.019
One+0.019
Neg logit contributions
 grin-0.036
 fascination-0.036
 stret-0.035
ote-0.034
 glances-0.034
Token stand
Token ID1302
Feature activation+1.65e-02

Pos logit contributions
 fascination+0.445
 amaz+0.412
 anticipation+0.398
 grin+0.395
icol+0.387
Neg logit contributions
 next-0.365
L-0.364
B-0.360
st-0.354
Tim-0.348
Token.
Token ID13
Feature activation-4.82e-03

Pos logit contributions
 grin+0.039
 fascination+0.039
 stret+0.038
ote+0.038
 glances+0.037
Neg logit contributions
B-0.019
P-0.017
st-0.017
L-0.017
One-0.017
Token She
Token ID1375
Feature activation+3.54e-02

Pos logit contributions
B+0.007
P+0.007
st+0.006
One+0.006
Tim+0.006
Neg logit contributions
 grin-0.010
 fascination-0.010
 stret-0.010
ote-0.010
 glances-0.009
Token followed
Token ID3940
Feature activation+1.79e-02

Pos logit contributions
 grin+0.078
 fascination+0.077
 stret+0.076
ote+0.074
 glances+0.073
Neg logit contributions
B-0.048
P-0.044
st-0.044
L-0.044
One-0.043
Token the
Token ID262
Feature activation-1.26e-02

Pos logit contributions
 grin+0.045
 fascination+0.045
 stret+0.044
ote+0.044
 glances+0.043
Neg logit contributions
B-0.019
st-0.017
L-0.016
Tim-0.016
P-0.016
Token -
Token ID532
Feature activation+2.09e-02

Pos logit contributions
st+0.002
 sometimes+0.002
B+0.002
 next+0.002
 day+0.002
Neg logit contributions
 fascination-0.008
 grin-0.008
 stret-0.008
ote-0.008
oured-0.008
Token a
Token ID257
Feature activation+5.24e-03

Pos logit contributions
 grin+0.053
 stret+0.052
ote+0.051
 fascination+0.051
 glances+0.049
Neg logit contributions
B-0.022
 sometimes-0.021
st-0.020
L-0.020
Tim-0.019
Token box
Token ID3091
Feature activation+2.54e-03

Pos logit contributions
 glances+0.011
 fascination+0.011
ote+0.011
 stret+0.011
 grin+0.011
Neg logit contributions
B-0.007
L-0.007
st-0.007
One-0.006
Tim-0.006
Token of
Token ID286
Feature activation-2.37e-02

Pos logit contributions
 fascination+0.006
 stret+0.006
 grin+0.006
 glances+0.006
ote+0.006
Neg logit contributions
st-0.003
B-0.003
L-0.003
 sometimes-0.003
Tim-0.003
Token mult
Token ID1963
Feature activation+3.27e-02

Pos logit contributions
B+0.032
Tim+0.029
 sometimes+0.029
L+0.028
 day+0.028
Neg logit contributions
ote-0.054
 grin-0.052
 stret-0.051
oured-0.050
atively-0.050
Tokenicol
Token ID27045
Feature activation+1.92e-01

Pos logit contributions
 stret+0.025
 expect+0.025
 grin+0.024
 fascination+0.024
 amused+0.024
Neg logit contributions
st-0.085
B-0.085
ety-0.082
L-0.080
P-0.080
Tokenored
Token ID1850
Feature activation-4.42e-02

Pos logit contributions
 fascination+0.305
 grin+0.299
 gru+0.283
 stret+0.279
 gasp+0.270
Neg logit contributions
st-0.387
B-0.375
ety-0.364
P-0.358
L-0.358
Token bubbles
Token ID25037
Feature activation-2.55e-02

Pos logit contributions
st+0.060
B+0.060
L+0.057
 sometimes+0.055
 next+0.054
Neg logit contributions
 grin-0.099
 stret-0.095
 fascination-0.095
ote-0.093
 hi-0.091
Token!
Token ID0
Feature activation+3.48e-02

Pos logit contributions
st+0.027
B+0.026
 sometimes+0.025
P+0.022
 next+0.022
Neg logit contributions
 fascination-0.065
 grin-0.063
 stret-0.062
ote-0.061
 hi-0.060
Token
Token ID198
Feature activation-2.12e-02

Pos logit contributions
 grin+0.067
 fascination+0.066
 stret+0.063
ote+0.062
 hi+0.062
Neg logit contributions
B-0.055
P-0.054
st-0.052
One-0.052
ety-0.051
Token
Token ID198
Feature activation-2.01e-02

Pos logit contributions
ety+0.022
st+0.019
P+0.019
B+0.017
 sometimes+0.017
Neg logit contributions
 grin-0.052
 fascination-0.052
 stret-0.051
 hi-0.049
 hello-0.049
Token were
Token ID547
Feature activation+3.80e-03

Pos logit contributions
 fascination+0.191
 grin+0.189
 stret+0.182
ote+0.181
 glances+0.179
Neg logit contributions
B-0.076
st-0.071
P-0.068
L-0.068
 sometimes-0.068
Token so
Token ID523
Feature activation+1.99e-02

Pos logit contributions
 grin+0.008
 fascination+0.008
 stret+0.008
ote+0.008
 glances+0.008
Neg logit contributions
B-0.005
P-0.005
st-0.005
L-0.005
One-0.005
Token clean
Token ID3424
Feature activation-9.72e-03

Pos logit contributions
 grin+0.039
 fascination+0.039
 stret+0.038
ote+0.037
 glances+0.036
Neg logit contributions
B-0.032
P-0.030
st-0.030
L-0.029
One-0.029
Token they
Token ID484
Feature activation-2.68e-02

Pos logit contributions
B+0.010
st+0.009
 sometimes+0.009
L+0.009
P+0.009
Neg logit contributions
 fascination-0.025
 grin-0.025
 stret-0.024
ote-0.024
 glances-0.023
Token looked
Token ID3114
Feature activation+4.53e-02

Pos logit contributions
B+0.028
 sometimes+0.026
st+0.026
L+0.025
ety+0.025
Neg logit contributions
 fascination-0.068
 grin-0.067
ote-0.066
 stret-0.065
 glances-0.064
Token brand
Token ID4508
Feature activation+1.92e-01

Pos logit contributions
ote+0.116
 grin+0.115
 fascination+0.115
leaf+0.108
 stret+0.107
Neg logit contributions
 sometimes-0.050
 next-0.046
B-0.043
 day-0.043
st-0.042
Token new
Token ID649
Feature activation+5.04e-02

Pos logit contributions
 grin+0.388
 fascination+0.385
 stret+0.372
ote+0.364
 glances+0.351
Neg logit contributions
B-0.307
st-0.286
P-0.284
L-0.284
Tim-0.278
Token!
Token ID0
Feature activation+4.37e-02

Pos logit contributions
 grin+0.125
 fascination+0.124
 stret+0.122
ote+0.120
 glances+0.117
Neg logit contributions
B-0.054
st-0.050
P-0.049
L-0.048
One-0.047
Token She
Token ID1375
Feature activation+4.81e-02

Pos logit contributions
 grin+0.088
 fascination+0.087
 stret+0.086
 hi+0.085
 hello+0.084
Neg logit contributions
st-0.064
B-0.061
P-0.061
ety-0.054
One-0.053
Token was
Token ID373
Feature activation-3.87e-03

Pos logit contributions
 grin+0.112
 fascination+0.111
 stret+0.110
ote+0.108
 glances+0.106
Neg logit contributions
B-0.059
L-0.054
P-0.054
st-0.054
Tim-0.052
Token amazed
Token ID24562
Feature activation+1.28e-02

Pos logit contributions
B+0.004
 sometimes+0.004
L+0.004
One+0.004
Tim+0.004
Neg logit contributions
 grin-0.009
 stret-0.009
ote-0.009
 fascination-0.009
 glances-0.009
Token man
Token ID582
Feature activation+7.13e-02

Pos logit contributions
B+0.018
L+0.016
P+0.016
Tim+0.016
 next+0.016
Neg logit contributions
 fascination-0.028
 stret-0.027
 grin-0.027
 glances-0.026
ote-0.026
Token stepped
Token ID10764
Feature activation+5.16e-02

Pos logit contributions
 fascination+0.168
 stret+0.168
 grin+0.161
icol+0.159
ote+0.159
Neg logit contributions
B-0.088
L-0.083
st-0.079
Tim-0.079
 sometimes-0.078
Token closer
Token ID5699
Feature activation+8.35e-02

Pos logit contributions
 fascination+0.111
ote+0.110
 grin+0.110
 expect+0.105
 stret+0.105
Neg logit contributions
B-0.079
P-0.074
 next-0.070
st-0.070
L-0.070
Token and
Token ID290
Feature activation-1.30e-02

Pos logit contributions
 stret+0.249
 fascination+0.246
 grin+0.241
ote+0.240
leaf+0.234
Neg logit contributions
st-0.047
B-0.045
 sometimes-0.044
 next-0.042
But-0.035
Token said
Token ID531
Feature activation+3.00e-02

Pos logit contributions
B+0.018
L+0.018
st+0.017
P+0.016
Tim+0.016
Neg logit contributions
 grin-0.030
 fascination-0.030
 stret-0.029
ote-0.029
 expression-0.028
Token gru
Token ID22848
Feature activation+1.92e-01

Pos logit contributions
 fascination+0.086
 stret+0.082
 anticipation+0.079
 amaz+0.079
latable+0.078
Neg logit contributions
B-0.026
L-0.024
"-0.023
 sometimes-0.023
 next-0.021
Tokenff
Token ID487
Feature activation+5.71e-02

Pos logit contributions
 stret+0.269
 grin+0.249
 fascination+0.243
ote+0.235
 hi+0.219
Neg logit contributions
B-0.422
ety-0.415
st-0.413
One-0.405
L-0.402
Tokenly
Token ID306
Feature activation+2.85e-02

Pos logit contributions
 fascination+0.106
 stret+0.100
 grin+0.095
 amaz+0.093
leaf+0.091
Neg logit contributions
B-0.099
st-0.096
ety-0.093
L-0.093
 sometimes-0.092
Token,
Token ID11
Feature activation-1.10e-02

Pos logit contributions
 fascination+0.075
 stret+0.074
leaf+0.072
 grin+0.070
latable+0.069
Neg logit contributions
B-0.028
 sometimes-0.025
"-0.025
st-0.024
L-0.023
Token "
Token ID366
Feature activation+2.61e-02

Pos logit contributions
B+0.014
One+0.013
L+0.013
But+0.013
 sometimes+0.013
Neg logit contributions
 fascination-0.025
 grin-0.024
 stret-0.024
 glances-0.023
ote-0.023
TokenYou
Token ID1639
Feature activation+6.44e-02

Pos logit contributions
 fascination+0.053
leaf+0.052
ote+0.051
 stret+0.050
 hi+0.050
Neg logit contributions
 day-0.027
st-0.027
ety-0.027
B-0.026
ib-0.024
Token's
Token ID338
Feature activation-6.47e-02

Pos logit contributions
B+0.045
st+0.043
P+0.038
ety+0.037
L+0.037
Neg logit contributions
 grin-0.151
 fascination-0.151
 stret-0.148
ote-0.146
 hi-0.144
Token toys
Token ID14958
Feature activation-5.04e-03

Pos logit contributions
 day+0.092
B+0.088
st+0.087
 next+0.087
 sometimes+0.082
Neg logit contributions
ote-0.130
 grin-0.125
 stret-0.125
 fascination-0.122
 hi-0.121
Token and
Token ID290
Feature activation+1.09e-02

Pos logit contributions
B+0.005
st+0.004
 sometimes+0.004
L+0.004
Tim+0.004
Neg logit contributions
 fascination-0.013
 grin-0.013
 stret-0.013
ote-0.013
 hi-0.013
Token they
Token ID484
Feature activation-1.63e-02

Pos logit contributions
 fascination+0.026
 grin+0.026
 stret+0.026
ote+0.026
 hi+0.025
Neg logit contributions
B-0.012
 sometimes-0.011
L-0.011
Tim-0.011
st-0.011
Token looked
Token ID3114
Feature activation+5.65e-02

Pos logit contributions
B+0.018
st+0.016
L+0.016
 sometimes+0.016
P+0.016
Neg logit contributions
 fascination-0.039
 grin-0.039
ote-0.039
 stret-0.039
 glances-0.038
Token brand
Token ID4508
Feature activation+1.91e-01

Pos logit contributions
ote+0.156
 stret+0.149
person+0.142
 grin+0.141
 fascination+0.140
Neg logit contributions
 sometimes-0.050
 day-0.050
 soon-0.048
 together-0.047
 next-0.046
Token new
Token ID649
Feature activation+4.91e-02

Pos logit contributions
 grin+0.443
 fascination+0.442
 stret+0.440
ote+0.426
 gasp+0.409
Neg logit contributions
B-0.254
st-0.238
L-0.236
Tim-0.223
P-0.222
Token.
Token ID13
Feature activation+1.06e-02

Pos logit contributions
 grin+0.124
 fascination+0.123
 stret+0.122
ote+0.119
 glances+0.116
Neg logit contributions
B-0.051
st-0.048
P-0.044
L-0.044
One-0.044
Token Lily
Token ID20037
Feature activation-4.75e-02

Pos logit contributions
 grin+0.022
 fascination+0.021
 stret+0.021
ote+0.020
 hi+0.020
Neg logit contributions
st-0.015
P-0.015
B-0.014
One-0.013
ety-0.013
Token was
Token ID373
Feature activation+3.01e-03

Pos logit contributions
B+0.054
P+0.049
st+0.049
One+0.048
L+0.048
Neg logit contributions
 grin-0.115
 fascination-0.114
 stret-0.111
ote-0.109
 glances-0.108
Token so
Token ID523
Feature activation+2.73e-02

Pos logit contributions
 stret+0.007
ote+0.007
 grin+0.007
 fascination+0.007
 glances+0.007
Neg logit contributions
B-0.003
 sometimes-0.003
L-0.003
One-0.003
st-0.003
Token They
Token ID1119
Feature activation+3.44e-02

Pos logit contributions
B+0.003
st+0.003
P+0.003
ety+0.003
One+0.003
Neg logit contributions
 grin-0.010
 stret-0.010
 fascination-0.010
 glances-0.010
 hello-0.010
Token put
Token ID1234
Feature activation-1.03e-02

Pos logit contributions
ote+0.079
 fascination+0.079
 grin+0.078
icol+0.074
 stret+0.073
Neg logit contributions
B-0.044
L-0.043
 sometimes-0.040
Tim-0.040
P-0.040
Token Lily
Token ID20037
Feature activation-4.22e-02

Pos logit contributions
B+0.012
L+0.011
P+0.011
 sometimes+0.011
Tim+0.010
Neg logit contributions
ote-0.026
 grin-0.026
 fascination-0.025
 stret-0.024
 expression-0.023
Token on
Token ID319
Feature activation-3.06e-02

Pos logit contributions
B+0.056
P+0.053
One+0.051
Tim+0.051
st+0.051
Neg logit contributions
 stret-0.093
 grin-0.092
 fascination-0.091
 glances-0.087
 hi-0.086
Token a
Token ID257
Feature activation+1.86e-02

Pos logit contributions
B+0.031
L+0.029
Tim+0.028
One+0.027
st+0.026
Neg logit contributions
 grin-0.080
ote-0.077
 fascination-0.077
 hi-0.074
 stret-0.074
Token stret
Token ID5759
Feature activation+1.90e-01

Pos logit contributions
 fascination+0.020
 grin+0.020
ote+0.019
oured+0.019
 glances+0.018
Neg logit contributions
B-0.046
L-0.045
One-0.044
Tim-0.044
P-0.044
Tokencher
Token ID2044
Feature activation+1.46e-02

Pos logit contributions
 fascination+0.304
 grin+0.296
ote+0.287
 expression+0.281
 stret+0.280
Neg logit contributions
B-0.383
L-0.362
st-0.356
Tim-0.350
P-0.345
Token and
Token ID290
Feature activation+3.71e-03

Pos logit contributions
 grin+0.039
 fascination+0.038
ote+0.037
 stret+0.037
 glances+0.037
Neg logit contributions
B-0.013
st-0.012
L-0.011
One-0.011
P-0.011
Token carried
Token ID5281
Feature activation-1.34e-02

Pos logit contributions
 grin+0.009
 fascination+0.009
ote+0.008
ement+0.008
 expression+0.008
Neg logit contributions
L-0.005
B-0.005
 sometimes-0.004
P-0.004
Tim-0.004
Token her
Token ID607
Feature activation+1.07e-02

Pos logit contributions
B+0.015
L+0.014
Tim+0.013
P+0.013
st+0.012
Neg logit contributions
 grin-0.034
 fascination-0.034
ote-0.033
 stret-0.032
 expression-0.032
Token to
Token ID284
Feature activation+2.29e-02

Pos logit contributions
 grin+0.024
 fascination+0.024
 stret+0.023
ote+0.023
 glances+0.022
Neg logit contributions
B-0.015
st-0.013
P-0.013
L-0.013
One-0.013
Token job
Token ID1693
Feature activation+2.36e-03

Pos logit contributions
B+0.003
st+0.002
P+0.002
L+0.002
Tim+0.002
Neg logit contributions
 fascination-0.003
 grin-0.003
 stret-0.003
ote-0.003
 glances-0.003
Token!
Token ID0
Feature activation+3.89e-02

Pos logit contributions
 fascination+0.006
 grin+0.006
 stret+0.006
ote+0.006
 glances+0.006
Neg logit contributions
st-0.002
B-0.002
 sometimes-0.002
 next-0.002
Tim-0.002
Token That
Token ID1320
Feature activation+3.64e-02

Pos logit contributions
 grin+0.069
 hi+0.068
 fascination+0.068
 hello+0.067
ote+0.066
Neg logit contributions
B-0.064
P-0.063
st-0.063
ety-0.060
One-0.060
Token statue
Token ID15207
Feature activation+4.10e-03

Pos logit contributions
 grin+0.100
 stret+0.099
 fascination+0.099
ote+0.095
 glances+0.093
Neg logit contributions
st-0.031
B-0.030
 sometimes-0.028
P-0.028
One-0.027
Token looks
Token ID3073
Feature activation+9.59e-02

Pos logit contributions
 fascination+0.010
 grin+0.010
 stret+0.010
ote+0.009
 glances+0.009
Neg logit contributions
B-0.005
st-0.005
P-0.004
L-0.004
Tim-0.004
Token brand
Token ID4508
Feature activation+1.87e-01

Pos logit contributions
 grin+0.250
ote+0.243
 fascination+0.243
 stret+0.241
atively+0.237
Neg logit contributions
 sometimes-0.107
B-0.095
L-0.093
One-0.092
But-0.091
Token new
Token ID649
Feature activation+4.45e-02

Pos logit contributions
 grin+0.368
 fascination+0.368
 stret+0.357
ote+0.341
 glances+0.334
Neg logit contributions
B-0.308
st-0.287
L-0.285
P-0.284
Tim-0.281
Token!"
Token ID2474
Feature activation+2.46e-02

Pos logit contributions
 grin+0.107
 fascination+0.106
 stret+0.104
ote+0.102
 glances+0.101
Neg logit contributions
B-0.051
P-0.046
st-0.046
L-0.045
One-0.045
Token Lily
Token ID20037
Feature activation-3.69e-02

Pos logit contributions
 grin+0.064
 fascination+0.063
 stret+0.062
ote+0.061
 glances+0.060
Neg logit contributions
B-0.024
P-0.021
st-0.021
One-0.020
L-0.020
Token smiled
Token ID13541
Feature activation+2.01e-02

Pos logit contributions
B+0.053
P+0.049
st+0.049
L+0.048
One+0.048
Neg logit contributions
 grin-0.078
 fascination-0.078
 stret-0.076
ote-0.074
 glances-0.073
Token and
Token ID290
Feature activation+1.13e-02

Pos logit contributions
 stret+0.056
 fascination+0.055
 grin+0.054
ote+0.054
leaf+0.053
Neg logit contributions
B-0.016
L-0.014
 sometimes-0.013
st-0.013
One-0.013
Token happy
Token ID3772
Feature activation-1.66e-02

Pos logit contributions
 stret+0.078
ote+0.075
icol+0.073
 fascination+0.073
 grin+0.072
Neg logit contributions
P-0.047
 next-0.046
B-0.046
L-0.046
 day-0.045
Token,
Token ID11
Feature activation+3.01e-03

Pos logit contributions
B+0.016
st+0.015
 sometimes+0.014
L+0.014
Tim+0.014
Neg logit contributions
 fascination-0.043
 grin-0.042
 stret-0.042
ote-0.040
 glances-0.040
Token it
Token ID340
Feature activation+1.05e-02

Pos logit contributions
 fascination+0.007
 grin+0.007
 stret+0.007
 glances+0.007
ote+0.007
Neg logit contributions
B-0.003
Tim-0.003
L-0.003
st-0.003
 sometimes-0.003
Token was
Token ID373
Feature activation-8.90e-03

Pos logit contributions
 stret+0.029
 fascination+0.028
 grin+0.028
ote+0.027
 expression+0.027
Neg logit contributions
B-0.009
st-0.008
 sometimes-0.008
L-0.008
P-0.008
Token a
Token ID257
Feature activation+2.23e-02

Pos logit contributions
B+0.011
P+0.010
One+0.010
st+0.010
Tim+0.010
Neg logit contributions
 grin-0.020
 stret-0.020
 fascination-0.020
ote-0.020
 glances-0.019
Token brand
Token ID4508
Feature activation+1.86e-01

Pos logit contributions
 fascination+0.045
 grin+0.045
ote+0.045
 glances+0.045
 stret+0.044
Neg logit contributions
B-0.033
L-0.031
P-0.030
One-0.030
st-0.030
Token new
Token ID649
Feature activation+4.16e-02

Pos logit contributions
 fascination+0.432
 grin+0.429
 stret+0.425
ote+0.404
 gasp+0.404
Neg logit contributions
B-0.233
st-0.228
L-0.219
P-0.214
 day-0.213
Token ice
Token ID4771
Feature activation+2.99e-02

Pos logit contributions
 grin+0.095
 fascination+0.095
 stret+0.093
ote+0.091
 glances+0.090
Neg logit contributions
B-0.052
st-0.049
P-0.047
One-0.046
L-0.046
Token cream
Token ID8566
Feature activation+9.49e-02

Pos logit contributions
 grin+0.056
 fascination+0.056
 stret+0.054
ote+0.053
 glances+0.052
Neg logit contributions
B-0.050
st-0.047
P-0.047
L-0.046
Tim-0.046
Token machine
Token ID4572
Feature activation+1.72e-02

Pos logit contributions
 fascination+0.227
 grin+0.223
 stret+0.218
 glances+0.212
ote+0.211
Neg logit contributions
B-0.111
st-0.110
P-0.105
Tim-0.102
L-0.102
Token.
Token ID13
Feature activation+1.87e-02

Pos logit contributions
 fascination+0.047
 grin+0.045
 stret+0.044
ote+0.043
 amaz+0.043
Neg logit contributions
st-0.016
B-0.016
 sometimes-0.016
 next-0.014
Tim-0.014
Token she
Token ID673
Feature activation-8.99e-03

Pos logit contributions
 grin+0.002
 fascination+0.002
ote+0.002
 stret+0.002
 glances+0.002
Neg logit contributions
B-0.001
L-0.001
Tim-0.001
 sometimes-0.001
One-0.001
Token heard
Token ID2982
Feature activation+3.08e-02

Pos logit contributions
B+0.013
L+0.012
P+0.012
st+0.012
Tim+0.012
Neg logit contributions
 fascination-0.019
 grin-0.019
 stret-0.019
ote-0.018
 glances-0.018
Token a
Token ID257
Feature activation+1.64e-02

Pos logit contributions
 grin+0.086
 stret+0.085
 fascination+0.085
ote+0.082
oured+0.082
Neg logit contributions
B-0.026
L-0.025
Tim-0.022
One-0.021
 sometimes-0.021
Token loud
Token ID7812
Feature activation+7.05e-02

Pos logit contributions
oured+0.031
 fascination+0.029
ote+0.029
 grin+0.028
 vows+0.028
Neg logit contributions
B-0.030
L-0.028
Tim-0.027
One-0.026
Sam-0.026
Token,
Token ID11
Feature activation-2.20e-02

Pos logit contributions
oured+0.144
 fascination+0.136
 vows+0.133
 fries+0.128
ote+0.125
Neg logit contributions
B-0.116
Tim-0.107
L-0.104
 sometimes-0.103
 day-0.097
Token gru
Token ID22848
Feature activation+1.86e-01

Pos logit contributions
B+0.034
 sometimes+0.033
Tim+0.033
L+0.032
 next+0.032
Neg logit contributions
 vows-0.044
oured-0.044
 grin-0.042
ote-0.042
 fascination-0.042
Tokenff
Token ID487
Feature activation+5.28e-02

Pos logit contributions
 grin+0.217
 stret+0.213
 fascination+0.211
ote+0.199
 glances+0.191
Neg logit contributions
B-0.440
st-0.423
P-0.420
ety-0.418
L-0.417
Token voice
Token ID3809
Feature activation+7.21e-02

Pos logit contributions
 fascination+0.101
oured+0.097
leaf+0.092
 grin+0.091
 menu+0.090
Neg logit contributions
B-0.085
 next-0.079
L-0.078
st-0.078
Tim-0.074
Token calling
Token ID4585
Feature activation+1.14e-01

Pos logit contributions
 fascination+0.193
 vows+0.184
 grin+0.184
 pleasure+0.183
 menu+0.180
Neg logit contributions
B-0.080
 next-0.068
 sometimes-0.065
L-0.065
Tim-0.062
Token her
Token ID607
Feature activation+3.41e-02

Pos logit contributions
 fries+0.279
oured+0.278
ote+0.278
 fascination+0.277
latable+0.277
Neg logit contributions
L-0.125
B-0.123
-0.118
 sometimes-0.114
 next-0.110
Token name
Token ID1438
Feature activation+8.45e-02

Pos logit contributions
 stret+0.065
 fascination+0.065
 grin+0.064
leaf+0.062
latable+0.062
Neg logit contributions
 next-0.052
st-0.050
B-0.049
-0.049
L-0.049
Token was
Token ID373
Feature activation-5.96e-03

Pos logit contributions
B+0.016
st+0.015
 sometimes+0.014
L+0.013
One+0.013
Neg logit contributions
 stret-0.045
 fascination-0.043
 grin-0.043
ote-0.042
 glances-0.041
Token so
Token ID523
Feature activation+2.37e-02

Pos logit contributions
 sometimes+0.006
B+0.006
st+0.006
One+0.006
 next+0.005
Neg logit contributions
 stret-0.015
ote-0.014
 grin-0.014
 fascination-0.014
 glances-0.014
Token proud
Token ID6613
Feature activation+2.94e-02

Pos logit contributions
 stret+0.052
ote+0.048
icol+0.047
 fascination+0.047
 glances+0.046
Neg logit contributions
B-0.034
One-0.032
P-0.032
 sometimes-0.031
L-0.031
Token of
Token ID286
Feature activation+4.73e-03

Pos logit contributions
 grin+0.070
 fascination+0.070
 stret+0.069
ote+0.067
 glances+0.066
Neg logit contributions
B-0.035
st-0.031
P-0.031
L-0.031
One-0.030
Token his
Token ID465
Feature activation+4.42e-02

Pos logit contributions
 grin+0.010
 fascination+0.010
 stret+0.010
ote+0.010
 glances+0.010
Neg logit contributions
B-0.007
P-0.006
st-0.006
L-0.006
One-0.006
Token brand
Token ID4508
Feature activation+1.85e-01

Pos logit contributions
 stret+0.086
 fascination+0.079
 grin+0.078
 widened+0.077
oured+0.077
Neg logit contributions
 day-0.069
 next-0.069
B-0.067
st-0.066
 sometimes-0.060
Token-
Token ID12
Feature activation+5.11e-02

Pos logit contributions
 stret+0.423
 fascination+0.419
 grin+0.416
 gasp+0.391
ote+0.389
Neg logit contributions
B-0.244
st-0.239
L-0.224
P-0.220
Tim-0.215
Tokennew
Token ID3605
Feature activation-6.56e-03

Pos logit contributions
 grin+0.088
 fascination+0.087
 stret+0.087
 glances+0.081
ote+0.080
Neg logit contributions
B-0.098
st-0.091
P-0.091
Tim-0.090
L-0.090
Token suit
Token ID6050
Feature activation+6.67e-02

Pos logit contributions
st+0.008
 day+0.008
B+0.008
 next+0.008
 sometimes+0.007
Neg logit contributions
 stret-0.014
 fascination-0.014
 grin-0.014
ealing-0.013
 glances-0.013
Token.
Token ID13
Feature activation+1.10e-02

Pos logit contributions
 fascination+0.178
 stret+0.174
 grin+0.170
 glances+0.164
ote+0.162
Neg logit contributions
st-0.060
B-0.058
 sometimes-0.057
 next-0.054
L-0.050
Token
Token ID198
Feature activation-1.85e-02

Pos logit contributions
 grin+0.021
 fascination+0.021
ote+0.020
 hi+0.020
 stret+0.020
Neg logit contributions
B-0.017
st-0.017
P-0.017
ety-0.016
L-0.015
Tokenks
Token ID591
Feature activation+7.34e-02

Pos logit contributions
 fascination+0.036
 grin+0.035
 stret+0.035
ote+0.034
icol+0.033
Neg logit contributions
B-0.011
st-0.010
 sometimes-0.009
L-0.009
One-0.009
Token.
Token ID13
Feature activation-1.88e-03

Pos logit contributions
 fascination+0.209
 grin+0.206
 stret+0.204
ote+0.196
icol+0.195
Neg logit contributions
B-0.063
L-0.053
 next-0.052
st-0.051
 sometimes-0.049
Token It
Token ID632
Feature activation+3.35e-02

Pos logit contributions
B+0.002
L+0.002
Sam+0.002
+0.002
Tim+0.002
Neg logit contributions
 stret-0.005
 fascination-0.004
 grin-0.004
 anticipation-0.004
ote-0.004
Token was
Token ID373
Feature activation+1.39e-03

Pos logit contributions
 grin+0.094
 fascination+0.094
 stret+0.094
ote+0.091
 glances+0.090
Neg logit contributions
B-0.027
L-0.023
st-0.022
Tim-0.021
One-0.021
Token a
Token ID257
Feature activation+2.77e-02

Pos logit contributions
 grin+0.004
oured+0.004
eal+0.004
 glances+0.004
 vows+0.004
Neg logit contributions
B-0.001
Tim-0.001
L-0.001
Sam-0.001
One-0.001
Token gru
Token ID22848
Feature activation+1.84e-01

Pos logit contributions
 fascination+0.061
 glances+0.060
oured+0.060
ured+0.059
 anticipation+0.058
Neg logit contributions
B-0.038
L-0.036
Tim-0.035
 day-0.033
Sam-0.033
Tokenff
Token ID487
Feature activation+5.53e-02

Pos logit contributions
 grin+0.207
 fascination+0.204
 stret+0.196
ote+0.187
 glances+0.182
Neg logit contributions
B-0.445
P-0.426
st-0.425
L-0.421
One-0.420
Token voice
Token ID3809
Feature activation+7.96e-02

Pos logit contributions
 fascination+0.117
oured+0.110
leaf+0.105
latable+0.105
 anticipation+0.105
Neg logit contributions
st-0.073
 next-0.073
B-0.073
L-0.072
 day-0.070
Token and
Token ID290
Feature activation-9.87e-03

Pos logit contributions
 fascination+0.214
 vows+0.201
 grin+0.198
 anticipation+0.193
oured+0.192
Neg logit contributions
 next-0.081
 sometimes-0.079
B-0.075
st-0.074
Tim-0.067
Token it
Token ID340
Feature activation+1.75e-02

Pos logit contributions
B+0.010
L+0.009
 sometimes+0.008
Tim+0.008
Sam+0.008
Neg logit contributions
 stret-0.026
 grin-0.026
 fascination-0.026
ote-0.025
latable-0.025
Token said
Token ID531
Feature activation+2.72e-02

Pos logit contributions
 grin+0.045
 fascination+0.045
 stret+0.044
ote+0.043
 glances+0.042
Neg logit contributions
B-0.019
L-0.017
st-0.016
 sometimes-0.016
Tim-0.016
Token the
Token ID262
Feature activation-8.31e-04

Pos logit contributions
B+0.022
L+0.020
st+0.019
Tim+0.019
P+0.019
Neg logit contributions
 grin-0.044
 fascination-0.043
ote-0.043
 stret-0.042
 hi-0.041
Token clothes
Token ID8242
Feature activation+6.51e-02

Pos logit contributions
B+0.001
st+0.001
L+0.001
Tim+0.001
P+0.001
Neg logit contributions
 fascination-0.002
 grin-0.002
ote-0.002
oured-0.002
 hi-0.002
Token and
Token ID290
Feature activation+5.33e-03

Pos logit contributions
 fascination+0.165
 grin+0.164
ote+0.161
 stret+0.158
 hi+0.155
Neg logit contributions
B-0.067
st-0.062
L-0.059
 sometimes-0.059
Tim-0.058
Token they
Token ID484
Feature activation-2.01e-02

Pos logit contributions
 grin+0.012
 fascination+0.012
ote+0.012
 stret+0.011
 hi+0.011
Neg logit contributions
B-0.007
L-0.007
Tim-0.007
st-0.007
 sometimes-0.006
Token looked
Token ID3114
Feature activation+4.90e-02

Pos logit contributions
B+0.026
L+0.024
st+0.023
P+0.023
 sometimes+0.023
Neg logit contributions
ote-0.046
 grin-0.046
 fascination-0.045
 stret-0.044
 hi-0.044
Token brand
Token ID4508
Feature activation+1.84e-01

Pos logit contributions
ote+0.143
 fascination+0.130
leaf+0.130
 grin+0.130
person+0.129
Neg logit contributions
 sometimes-0.041
 next-0.038
 together-0.038
 day-0.037
 soon-0.036
Token new
Token ID649
Feature activation+4.60e-02

Pos logit contributions
 grin+0.399
 fascination+0.396
 stret+0.383
ote+0.381
 gasp+0.362
Neg logit contributions
B-0.276
L-0.255
st-0.253
Tim-0.246
P-0.244
Token.
Token ID13
Feature activation+1.31e-02

Pos logit contributions
 grin+0.115
 fascination+0.114
 stret+0.112
ote+0.110
 glances+0.108
Neg logit contributions
B-0.049
st-0.045
P-0.044
L-0.043
One-0.042
Token Lily
Token ID20037
Feature activation-5.01e-02

Pos logit contributions
 grin+0.028
 fascination+0.028
 stret+0.028
ote+0.027
 expression+0.027
Neg logit contributions
B-0.018
P-0.017
st-0.017
One-0.016
Tim-0.015
Token was
Token ID373
Feature activation-4.67e-04

Pos logit contributions
B+0.069
P+0.064
st+0.063
One+0.063
Tim+0.062
Neg logit contributions
 grin-0.109
 fascination-0.108
 stret-0.105
ote-0.102
 glances-0.101
Token very
Token ID845
Feature activation+3.50e-02

Pos logit contributions
B+0.001
P+0.001
st+0.001
L+0.001
One+0.001
Neg logit contributions
 grin-0.001
 fascination-0.001
 stret-0.001
ote-0.001
 glances-0.001
Token holding
Token ID4769
Feature activation+1.89e-02

Pos logit contributions
st+0.023
 next+0.022
 sometimes+0.021
B+0.021
ety+0.020
Neg logit contributions
 fascination-0.063
 grin-0.062
 stret-0.060
icol-0.059
 glances-0.059
Token the
Token ID262
Feature activation+5.42e-03

Pos logit contributions
 stret+0.048
 grin+0.047
ote+0.046
 announcement+0.045
 fascination+0.044
Neg logit contributions
B-0.018
st-0.017
P-0.017
L-0.016
 next-0.016
Token handle
Token ID5412
Feature activation+1.00e-02

Pos logit contributions
 grin+0.011
 fascination+0.011
ote+0.011
 glances+0.010
ured+0.010
Neg logit contributions
st-0.007
 day-0.007
 next-0.007
P-0.007
B-0.007
Token of
Token ID286
Feature activation-8.89e-03

Pos logit contributions
 grin+0.023
 fascination+0.023
 stret+0.023
ote+0.022
 glances+0.022
Neg logit contributions
B-0.012
st-0.011
One-0.011
P-0.011
Tim-0.011
Token a
Token ID257
Feature activation+2.52e-02

Pos logit contributions
B+0.008
Tim+0.007
L+0.007
 sometimes+0.007
One+0.007
Neg logit contributions
 grin-0.023
ote-0.023
 stret-0.022
oured-0.022
atively-0.022
Token brand
Token ID4508
Feature activation+1.83e-01

Pos logit contributions
ote+0.057
 glances+0.054
ishly+0.054
 fascination+0.053
ured+0.053
Neg logit contributions
B-0.030
st-0.029
L-0.028
One-0.027
Tim-0.027
Token new
Token ID649
Feature activation+3.64e-02

Pos logit contributions
 grin+0.411
 fascination+0.406
 stret+0.400
ote+0.386
 gasp+0.372
Neg logit contributions
B-0.247
st-0.246
L-0.241
P-0.229
Tim-0.226
Token bike
Token ID7161
Feature activation-1.69e-02

Pos logit contributions
 grin+0.079
ote+0.079
 stret+0.079
 fascination+0.077
 glances+0.076
Neg logit contributions
st-0.045
B-0.042
 day-0.041
One-0.040
Tim-0.040
Token.
Token ID13
Feature activation+1.47e-02

Pos logit contributions
B+0.017
st+0.016
 sometimes+0.015
Tim+0.015
One+0.015
Neg logit contributions
 fascination-0.044
 grin-0.043
 stret-0.042
ote-0.041
 glances-0.041
Token His
Token ID2399
Feature activation+1.51e-02

Pos logit contributions
 grin+0.030
 fascination+0.029
 stret+0.028
ote+0.027
 hi+0.027
Neg logit contributions
B-0.023
st-0.022
P-0.022
ety-0.021
One-0.020
Token mum
Token ID25682
Feature activation+1.74e-02

Pos logit contributions
 stret+0.034
leaf+0.033
 fascination+0.032
ote+0.032
 grin+0.032
Neg logit contributions
 next-0.017
 day-0.016
B-0.016
Tim-0.015
Mom-0.015
Token.
Token ID13
Feature activation+1.43e-02

Pos logit contributions
 fascination+0.043
 grin+0.042
ote+0.041
 stret+0.040
 amaz+0.040
Neg logit contributions
st-0.019
 sometimes-0.018
B-0.018
 next-0.018
P-0.017
Token It
Token ID632
Feature activation+3.32e-02

Pos logit contributions
 grin+0.030
 fascination+0.030
 stret+0.029
ote+0.029
 hi+0.028
Neg logit contributions
B-0.020
st-0.019
P-0.019
One-0.018
ety-0.018
Token was
Token ID373
Feature activation+2.38e-03

Pos logit contributions
 grin+0.095
 fascination+0.094
 stret+0.093
ote+0.093
 glances+0.090
Neg logit contributions
B-0.026
st-0.022
L-0.022
Tim-0.021
P-0.021
Token huge
Token ID3236
Feature activation+3.05e-02

Pos logit contributions
 grin+0.006
ote+0.006
 stret+0.006
 hi+0.005
 fascination+0.005
Neg logit contributions
 sometimes-0.003
B-0.003
One-0.003
 next-0.002
Tim-0.002
Token and
Token ID290
Feature activation-4.21e-04

Pos logit contributions
 fascination+0.084
 grin+0.083
 stret+0.082
ote+0.081
 hi+0.080
Neg logit contributions
 sometimes-0.027
st-0.024
B-0.023
 next-0.023
 day-0.019
Token brand
Token ID4508
Feature activation+1.83e-01

Pos logit contributions
B+0.001
Tim+0.001
L+0.001
P+0.001
st+0.001
Neg logit contributions
 grin-0.001
 fascination-0.001
 stret-0.001
ote-0.001
 glances-0.001
Token new
Token ID649
Feature activation+4.32e-02

Pos logit contributions
 grin+0.421
 fascination+0.417
 stret+0.411
ote+0.400
 gasp+0.382
Neg logit contributions
B-0.244
L-0.233
st-0.231
Tim-0.225
P-0.218
Token.
Token ID13
Feature activation+1.15e-02

Pos logit contributions
 grin+0.108
 fascination+0.108
 stret+0.106
ote+0.104
 glances+0.102
Neg logit contributions
B-0.046
st-0.043
P-0.040
L-0.040
Tim-0.039
Token
Token ID198
Feature activation-2.22e-02

Pos logit contributions
 grin+0.024
 fascination+0.024
 stret+0.023
ote+0.023
 hi+0.022
Neg logit contributions
B-0.016
st-0.016
P-0.015
ety-0.015
One-0.014
Token
Token ID198
Feature activation-2.07e-02

Pos logit contributions
ety+0.021
st+0.020
 sometimes+0.018
P+0.017
B+0.017
Neg logit contributions
 fascination-0.054
 stret-0.054
 grin-0.053
 glances-0.050
 hi-0.050
TokenWhen
Token ID2215
Feature activation+2.30e-02

Pos logit contributions
ety+0.020
st+0.019
 sometimes+0.015
 day+0.014
B+0.014
Neg logit contributions
 stret-0.051
 fascination-0.051
 grin-0.049
leaf-0.047
 hi-0.047
Token the
Token ID262
Feature activation-8.41e-03

Pos logit contributions
B+0.058
st+0.056
L+0.052
 sometimes+0.052
Tim+0.052
Neg logit contributions
 grin-0.089
ote-0.087
 fascination-0.082
 stret-0.082
 hi-0.082
Token freezer
Token ID30967
Feature activation+7.32e-02

Pos logit contributions
B+0.011
st+0.011
P+0.010
 next+0.010
L+0.010
Neg logit contributions
 grin-0.018
 fascination-0.018
ote-0.017
 stret-0.017
 hi-0.017
Token until
Token ID1566
Feature activation-1.65e-02

Pos logit contributions
 grin+0.181
 fascination+0.181
 stret+0.176
ote+0.176
 hi+0.172
Neg logit contributions
B-0.075
st-0.075
L-0.070
Tim-0.069
 sometimes-0.069
Token it
Token ID340
Feature activation+8.03e-03

Pos logit contributions
B+0.015
st+0.013
L+0.013
P+0.013
Tim+0.013
Neg logit contributions
 fascination-0.044
 grin-0.044
 stret-0.043
ote-0.043
 glances-0.042
Token looked
Token ID3114
Feature activation+3.94e-02

Pos logit contributions
 grin+0.022
 fascination+0.022
 stret+0.022
ote+0.021
 glances+0.021
Neg logit contributions
B-0.007
st-0.006
P-0.006
L-0.006
Tim-0.006
Token brand
Token ID4508
Feature activation+1.82e-01

Pos logit contributions
ote+0.105
 stret+0.104
leaf+0.101
ishly+0.100
 grin+0.099
Neg logit contributions
 sometimes-0.038
L-0.036
 next-0.034
 soon-0.034
One-0.034
Token new
Token ID649
Feature activation+4.48e-02

Pos logit contributions
 grin+0.414
 fascination+0.411
 stret+0.406
ote+0.394
 gasp+0.378
Neg logit contributions
B-0.255
L-0.240
st-0.240
Tim-0.226
One-0.223
Token.
Token ID13
Feature activation+8.13e-03

Pos logit contributions
 grin+0.116
 fascination+0.115
 stret+0.115
ote+0.112
 hi+0.110
Neg logit contributions
B-0.042
st-0.042
 sometimes-0.037
ety-0.037
P-0.037
Token
Token ID220
Feature activation-5.59e-03

Pos logit contributions
 grin+0.017
 fascination+0.017
 stret+0.017
ote+0.016
 glances+0.016
Neg logit contributions
B-0.012
P-0.011
st-0.011
L-0.010
One-0.010
Token
Token ID198
Feature activation-1.26e-02

Pos logit contributions
B+0.003
P+0.002
st+0.002
L+0.002
One+0.002
Neg logit contributions
 grin-0.017
 fascination-0.017
 stret-0.017
ote-0.016
 glances-0.016
Token
Token ID198
Feature activation-1.15e-02

Pos logit contributions
ety+0.011
st+0.011
P+0.009
B+0.009
 sometimes+0.009
Neg logit contributions
 stret-0.033
 grin-0.032
 fascination-0.032
 hi-0.031
 hello-0.030
Tokenbled
Token ID9342
Feature activation-1.94e-02

Pos logit contributions
 grin+0.068
 fascination+0.067
 stret+0.066
ote+0.063
 glances+0.061
Neg logit contributions
B-0.077
P-0.073
st-0.072
L-0.072
Tim-0.071
Token up
Token ID510
Feature activation+8.28e-02

Pos logit contributions
B+0.027
st+0.027
 sometimes+0.026
L+0.025
P+0.025
Neg logit contributions
 grin-0.043
ote-0.043
 stret-0.042
 fascination-0.041
 expression-0.040
Token all
Token ID477
Feature activation-3.12e-02

Pos logit contributions
 fascination+0.233
 grin+0.233
ote+0.231
 stret+0.230
 expression+0.218
Neg logit contributions
B-0.059
 sometimes-0.057
L-0.054
st-0.054
 next-0.051
Token the
Token ID262
Feature activation+1.98e-02

Pos logit contributions
B+0.043
P+0.039
One+0.039
st+0.038
Tim+0.037
Neg logit contributions
 fascination-0.068
 grin-0.067
 glances-0.064
 stret-0.064
 anticipation-0.063
Token ch
Token ID442
Feature activation+5.87e-02

Pos logit contributions
 grin+0.040
ote+0.039
 fascination+0.038
 expression+0.037
 stret+0.037
Neg logit contributions
B-0.033
st-0.032
One-0.031
P-0.030
L-0.030
Tokenocol
Token ID4668
Feature activation+1.81e-01

Pos logit contributions
 fascination+0.046
 grin+0.046
 stret+0.044
ote+0.041
 hi+0.040
Neg logit contributions
B-0.160
P-0.158
st-0.157
ety-0.154
L-0.154
Tokenates
Token ID689
Feature activation+1.71e-02

Pos logit contributions
 glances+0.225
 fascination+0.223
 grin+0.221
 stret+0.215
 vows+0.210
Neg logit contributions
B-0.386
Tim-0.379
L-0.376
 sometimes-0.369
One-0.367
Token,
Token ID11
Feature activation-1.36e-02

Pos logit contributions
 fascination+0.044
 stret+0.044
 grin+0.044
ote+0.043
 hi+0.042
Neg logit contributions
B-0.016
st-0.016
P-0.014
 sometimes-0.014
L-0.014
Token and
Token ID290
Feature activation-9.52e-03

Pos logit contributions
 sometimes+0.015
B+0.014
But+0.014
L+0.014
st+0.014
Neg logit contributions
 stret-0.032
ote-0.032
 fascination-0.031
 grin-0.031
 glances-0.031
Token she
Token ID673
Feature activation-2.00e-02

Pos logit contributions
B+0.010
 sometimes+0.010
L+0.010
st+0.010
Tim+0.009
Neg logit contributions
 stret-0.023
 grin-0.023
 fascination-0.023
ote-0.023
 glances-0.022
Token was
Token ID373
Feature activation-6.81e-03

Pos logit contributions
B+0.023
st+0.021
P+0.021
L+0.021
Tim+0.021
Neg logit contributions
 grin-0.048
 fascination-0.047
 stret-0.047
ote-0.046
 glances-0.045
Token washed
Token ID18989
Feature activation-1.86e-02

Pos logit contributions
ety+0.006
B+0.006
P+0.006
One+0.006
st+0.006
Neg logit contributions
 fascination-0.007
 grin-0.007
 stret-0.006
 anticipation-0.006
 gasp-0.006
Token until
Token ID1566
Feature activation-1.66e-02

Pos logit contributions
B+0.020
L+0.019
Tim+0.018
st+0.018
 sometimes+0.018
Neg logit contributions
 grin-0.047
 fascination-0.046
ote-0.046
 stret-0.045
 hi-0.044
Token his
Token ID465
Feature activation+2.87e-02

Pos logit contributions
B+0.021
P+0.020
st+0.020
ety+0.019
L+0.019
Neg logit contributions
 grin-0.038
 fascination-0.037
 stret-0.036
ote-0.035
 glances-0.035
Token room
Token ID2119
Feature activation-1.47e-02

Pos logit contributions
 stret+0.062
 fascination+0.061
 grin+0.061
ote+0.059
 glances+0.058
Neg logit contributions
B-0.038
st-0.037
P-0.035
One-0.035
L-0.035
Token felt
Token ID2936
Feature activation-1.81e-02

Pos logit contributions
B+0.009
st+0.009
L+0.008
Tim+0.008
P+0.008
Neg logit contributions
 fascination-0.043
 grin-0.042
 stret-0.042
ote-0.042
 hi-0.041
Token brand
Token ID4508
Feature activation+1.81e-01

Pos logit contributions
B+0.023
 sometimes+0.021
L+0.021
One+0.020
Tim+0.020
Neg logit contributions
ote-0.043
 fascination-0.041
 grin-0.041
 stret-0.041
 glances-0.040
Token new
Token ID649
Feature activation+3.98e-02

Pos logit contributions
 fascination+0.367
 grin+0.362
 stret+0.355
ote+0.348
 hi+0.334
Neg logit contributions
B-0.290
L-0.276
st-0.269
P-0.264
Tim-0.262
Token.
Token ID13
Feature activation+3.35e-03

Pos logit contributions
 grin+0.100
 fascination+0.100
 stret+0.097
ote+0.096
 glances+0.094
Neg logit contributions
B-0.042
st-0.039
P-0.037
L-0.036
One-0.036
Token He
Token ID679
Feature activation+4.15e-02

Pos logit contributions
 grin+0.007
 fascination+0.007
 stret+0.007
ote+0.007
 glances+0.007
Neg logit contributions
B-0.005
One-0.004
st-0.004
Tim-0.004
P-0.004
Token chose
Token ID7690
Feature activation-4.54e-02

Pos logit contributions
 grin+0.086
 fascination+0.085
 stret+0.082
 glances+0.079
ote+0.079
Neg logit contributions
B-0.060
P-0.057
st-0.056
One-0.056
ety-0.055
Token bright
Token ID6016
Feature activation+4.90e-02

Pos logit contributions
B+0.055
st+0.051
P+0.050
Tim+0.049
L+0.049
Neg logit contributions
 grin-0.108
 fascination-0.106
 stret-0.104
ote-0.102
 glances-0.100
Token all
Token ID477
Feature activation-5.74e-02

Pos logit contributions
 sometimes+0.017
One+0.016
B+0.016
Tim+0.015
L+0.015
Neg logit contributions
 grin-0.041
ote-0.040
 stret-0.040
leaf-0.039
 hi-0.039
Token fixed
Token ID5969
Feature activation-8.29e-02

Pos logit contributions
B+0.080
st+0.077
 sometimes+0.073
Tim+0.073
L+0.072
Neg logit contributions
 grin-0.124
 fascination-0.122
 stret-0.122
ote-0.121
 glances-0.117
Token up
Token ID510
Feature activation+6.84e-02

Pos logit contributions
B+0.091
st+0.087
 sometimes+0.080
Tim+0.079
L+0.078
Neg logit contributions
 grin-0.212
 fascination-0.207
 stret-0.203
ote-0.197
 hi-0.196
Token and
Token ID290
Feature activation-1.96e-02

Pos logit contributions
 grin+0.178
 fascination+0.175
 stret+0.172
ote+0.172
 hi+0.168
Neg logit contributions
B-0.068
st-0.065
 sometimes-0.061
Tim-0.059
L-0.059
Token looked
Token ID3114
Feature activation+4.74e-02

Pos logit contributions
 sometimes+0.022
B+0.021
Tim+0.020
L+0.020
 next+0.020
Neg logit contributions
 grin-0.047
 fascination-0.047
 stret-0.047
ote-0.046
 glances-0.044
Token brand
Token ID4508
Feature activation+1.81e-01

Pos logit contributions
ote+0.126
 stret+0.125
 grin+0.121
ishly+0.120
leaf+0.118
Neg logit contributions
 sometimes-0.046
 soon-0.044
 when-0.042
 next-0.042
 there-0.040
Token new
Token ID649
Feature activation+4.19e-02

Pos logit contributions
 grin+0.413
 fascination+0.411
 stret+0.409
ote+0.391
 gasp+0.381
Neg logit contributions
B-0.255
L-0.238
st-0.228
Tim-0.224
 sometimes-0.217
Token!
Token ID0
Feature activation+3.63e-02

Pos logit contributions
 grin+0.113
 fascination+0.112
 stret+0.111
ote+0.108
 hi+0.106
Neg logit contributions
st-0.035
B-0.034
 sometimes-0.033
 next-0.029
One-0.029
Token
Token ID198
Feature activation-2.01e-02

Pos logit contributions
 grin+0.089
 fascination+0.089
 stret+0.087
ote+0.085
 glances+0.084
Neg logit contributions
B-0.040
P-0.036
st-0.036
L-0.035
One-0.035
Token
Token ID198
Feature activation-1.82e-02

Pos logit contributions
B+0.013
ety+0.012
st+0.012
P+0.011
L+0.010
Neg logit contributions
 grin-0.057
 fascination-0.057
 stret-0.057
 glances-0.054
ote-0.054
Token"
Token ID1
Feature activation-3.32e-02

Pos logit contributions
ety+0.019
B+0.018
st+0.018
P+0.016
Tim+0.016
Neg logit contributions
 fascination-0.044
 stret-0.044
 grin-0.044
 glances-0.042
ote-0.042
Tokenmy
Token ID1820
Feature activation-1.88e-02

Pos logit contributions
B+0.046
P+0.042
st+0.042
One+0.041
L+0.041
Neg logit contributions
 grin-0.081
 fascination-0.079
 stret-0.078
ote-0.076
 glances-0.075
Token!
Token ID0
Feature activation+5.36e-02

Pos logit contributions
st+0.024
B+0.024
L+0.023
P+0.022
Tim+0.022
Neg logit contributions
 fascination-0.043
 stret-0.043
 grin-0.041
ote-0.041
 glances-0.040
Token Your
Token ID3406
Feature activation+1.11e-02

Pos logit contributions
 grin+0.101
 fascination+0.098
 hi+0.097
 hello+0.094
ote+0.094
Neg logit contributions
B-0.087
st-0.083
P-0.083
One-0.081
ety-0.079
Token basketball
Token ID9669
Feature activation-2.24e-02

Pos logit contributions
 fascination+0.025
 grin+0.024
 stret+0.024
ote+0.023
 glances+0.023
Neg logit contributions
B-0.015
st-0.013
P-0.013
L-0.013
Tim-0.013
Token looks
Token ID3073
Feature activation+1.03e-01

Pos logit contributions
B+0.028
st+0.027
P+0.026
L+0.026
Tim+0.025
Neg logit contributions
 fascination-0.051
 grin-0.051
 stret-0.051
ote-0.049
 glances-0.048
Token brand
Token ID4508
Feature activation+1.81e-01

Pos logit contributions
 stret+0.262
 fascination+0.256
ote+0.250
 grin+0.248
 glances+0.243
Neg logit contributions
 sometimes-0.110
B-0.105
One-0.105
L-0.104
Tim-0.097
Token new
Token ID649
Feature activation+4.36e-02

Pos logit contributions
 fascination+0.361
 grin+0.359
 stret+0.354
ote+0.333
 glances+0.328
Neg logit contributions
B-0.294
st-0.275
L-0.274
P-0.272
Tim-0.269
Token!"
Token ID2474
Feature activation+2.71e-02

Pos logit contributions
 grin+0.097
 fascination+0.096
 stret+0.093
ote+0.092
 glances+0.091
Neg logit contributions
B-0.058
P-0.054
L-0.052
One-0.052
st-0.052
Token Tim
Token ID5045
Feature activation-2.58e-02

Pos logit contributions
 grin+0.060
 stret+0.058
 fascination+0.057
 hi+0.056
 expression+0.055
Neg logit contributions
st-0.035
B-0.035
P-0.034
One-0.034
ety-0.032
Tokenmy
Token ID1820
Feature activation-1.14e-02

Pos logit contributions
B+0.023
P+0.021
st+0.019
One+0.019
Tim+0.018
Neg logit contributions
 grin-0.071
 fascination-0.069
 stret-0.065
 expression-0.065
ote-0.065
Token felt
Token ID2936
Feature activation-8.05e-04

Pos logit contributions
B+0.017
st+0.015
P+0.015
L+0.015
Tim+0.015
Neg logit contributions
 fascination-0.024
 grin-0.024
 stret-0.023
ote-0.023
 glances-0.022
Token her
Token ID607
Feature activation+4.17e-02

Pos logit contributions
 fascination+0.057
 grin+0.057
 stret+0.054
ote+0.054
 glances+0.054
Neg logit contributions
B-0.029
P-0.027
st-0.026
ety-0.025
L-0.025
Token mom
Token ID1995
Feature activation+3.82e-03

Pos logit contributions
 fascination+0.108
 stret+0.107
 grin+0.107
ote+0.104
 glances+0.103
Neg logit contributions
B-0.037
 next-0.035
 day-0.034
L-0.033
One-0.032
Token bought
Token ID5839
Feature activation+2.70e-02

Pos logit contributions
 fascination+0.011
 grin+0.011
 stret+0.010
ote+0.010
 glances+0.010
Neg logit contributions
B-0.003
ety-0.003
st-0.003
L-0.003
Tim-0.003
Token her
Token ID607
Feature activation+1.60e-02

Pos logit contributions
ote+0.081
 grin+0.081
 stret+0.080
icol+0.078
 fascination+0.076
Neg logit contributions
 day-0.016
st-0.015
 sometimes-0.015
 next-0.015
B-0.014
Token a
Token ID257
Feature activation+1.69e-02

Pos logit contributions
 grin+0.047
ote+0.046
 fascination+0.045
 stret+0.044
atively+0.044
Neg logit contributions
st-0.012
 day-0.010
 next-0.010
 sometimes-0.009
B-0.009
Token brand
Token ID4508
Feature activation+1.80e-01

Pos logit contributions
ured+0.038
ote+0.037
 glances+0.036
ows+0.036
atively+0.036
Neg logit contributions
 day-0.023
st-0.022
One-0.021
B-0.021
P-0.020
Token new
Token ID649
Feature activation+3.19e-02

Pos logit contributions
 fascination+0.423
 grin+0.420
 stret+0.412
ote+0.396
 gasp+0.396
Neg logit contributions
st-0.228
B-0.223
L-0.215
P-0.213
 day-0.202
Token coloring
Token ID33988
Feature activation+2.87e-03

Pos logit contributions
ote+0.074
 grin+0.074
 stret+0.074
ealing+0.074
ured+0.073
Neg logit contributions
st-0.035
 day-0.035
B-0.033
One-0.031
P-0.031
Token book
Token ID1492
Feature activation+2.59e-03

Pos logit contributions
 fascination+0.006
 grin+0.006
 stret+0.006
ote+0.006
 glances+0.006
Neg logit contributions
B-0.004
st-0.004
P-0.004
L-0.004
One-0.004
Token.
Token ID13
Feature activation+6.39e-03

Pos logit contributions
 fascination+0.007
 grin+0.007
 stret+0.007
ote+0.006
 glances+0.006
Neg logit contributions
B-0.003
st-0.002
One-0.002
L-0.002
P-0.002
Token She
Token ID1375
Feature activation+4.33e-02

Pos logit contributions
 grin+0.014
 fascination+0.014
 stret+0.014
ote+0.013
 glances+0.013
Neg logit contributions
B-0.009
P-0.008
st-0.008
ety-0.008
L-0.008
Token They
Token ID1119
Feature activation+2.22e-02

Pos logit contributions
 grin+0.003
 fascination+0.003
 stret+0.003
 hello+0.003
 glances+0.003
Neg logit contributions
st-0.002
B-0.002
P-0.002
ety-0.002
Tim-0.002
Token took
Token ID1718
Feature activation-1.74e-03

Pos logit contributions
 grin+0.054
 fascination+0.054
 stret+0.053
ote+0.052
 glances+0.051
Neg logit contributions
B-0.025
P-0.023
L-0.023
st-0.023
One-0.022
Token turns
Token ID4962
Feature activation-1.94e-02

Pos logit contributions
B+0.003
L+0.002
 sometimes+0.002
st+0.002
P+0.002
Neg logit contributions
ote-0.004
 grin-0.004
 stret-0.004
 fascination-0.004
 expression-0.003
Token making
Token ID1642
Feature activation+5.71e-03

Pos logit contributions
B+0.022
st+0.021
L+0.020
 sometimes+0.020
Tim+0.019
Neg logit contributions
 fascination-0.047
 grin-0.046
 stret-0.045
ote-0.045
 glances-0.044
Token z
Token ID1976
Feature activation-4.82e-02

Pos logit contributions
 grin+0.014
ote+0.013
 stret+0.013
 fascination+0.013
oured+0.013
Neg logit contributions
B-0.006
 sometimes-0.006
L-0.006
st-0.006
Tim-0.005
Tokenig
Token ID328
Feature activation-1.97e-01

Pos logit contributions
B+0.076
P+0.074
st+0.074
L+0.070
One+0.069
Neg logit contributions
 fascination-0.097
 grin-0.096
 stret-0.093
ote-0.090
 expression-0.089
Tokenz
Token ID89
Feature activation-2.27e-02

Pos logit contributions
B+0.250
P+0.235
Tim+0.231
One+0.231
L+0.225
Neg logit contributions
 grin-0.440
 fascination-0.429
 stret-0.429
ote-0.420
 glances-0.415
Tokenags
Token ID3775
Feature activation+6.07e-02

Pos logit contributions
B+0.049
P+0.047
st+0.047
L+0.046
Tim+0.045
Neg logit contributions
 fascination-0.032
 grin-0.032
 stret-0.031
ote-0.030
 glances-0.028
Token with
Token ID351
Feature activation+3.45e-02

Pos logit contributions
 grin+0.135
 fascination+0.134
 stret+0.130
 glances+0.129
ote+0.128
Neg logit contributions
B-0.078
P-0.076
One-0.071
L-0.070
Tim-0.070
Token their
Token ID511
Feature activation+3.50e-02

Pos logit contributions
 grin+0.087
ote+0.086
 stret+0.086
oured+0.083
 fascination+0.082
Neg logit contributions
B-0.035
Tim-0.032
L-0.031
 sometimes-0.030
Sam-0.028
Token feet
Token ID3625
Feature activation-9.95e-03

Pos logit contributions
 grin+0.071
 stret+0.071
 fascination+0.071
ote+0.069
 glances+0.067
Neg logit contributions
B-0.051
st-0.047
P-0.047
L-0.047
Tim-0.046
Token They
Token ID1119
Feature activation+1.78e-02

Pos logit contributions
B+0.009
st+0.008
P+0.008
ety+0.008
Tim+0.007
Neg logit contributions
 fascination-0.014
 grin-0.014
 glances-0.013
 stret-0.013
 hello-0.013
Token check
Token ID2198
Feature activation+6.81e-02

Pos logit contributions
ote+0.042
ement+0.040
 gru+0.039
 expression+0.039
 fascination+0.038
Neg logit contributions
 sometimes-0.024
L-0.022
 also-0.022
One-0.022
 never-0.021
Token for
Token ID329
Feature activation+4.79e-02

Pos logit contributions
 stret+0.144
 fascination+0.144
 glances+0.142
 grin+0.141
 hi+0.139
Neg logit contributions
B-0.090
One-0.088
P-0.086
ety-0.086
st-0.083
Token more
Token ID517
Feature activation-7.54e-02

Pos logit contributions
 grin+0.119
 fascination+0.116
 stret+0.115
ote+0.115
 glances+0.111
Neg logit contributions
B-0.054
L-0.048
Tim-0.048
st-0.048
P-0.047
Token z
Token ID1976
Feature activation-4.09e-02

Pos logit contributions
B+0.114
P+0.111
ety+0.109
One+0.108
L+0.105
Neg logit contributions
 fascination-0.148
 stret-0.148
 grin-0.147
 glances-0.141
ote-0.138
Tokenig
Token ID328
Feature activation-1.95e-01

Pos logit contributions
B+0.046
One+0.042
Tim+0.042
ety+0.041
L+0.040
Neg logit contributions
 grin-0.097
 stret-0.096
 fascination-0.096
 glances-0.093
ote-0.093
Tokenz
Token ID89
Feature activation-1.62e-02

Pos logit contributions
B+0.293
One+0.287
P+0.285
Tim+0.278
 sometimes+0.270
Neg logit contributions
 stret-0.372
ows-0.364
ote-0.348
oured-0.344
 grin-0.343
Tokenags
Token ID3775
Feature activation+6.46e-02

Pos logit contributions
B+0.038
st+0.036
One+0.036
L+0.036
Tim+0.035
Neg logit contributions
 grin-0.020
 fascination-0.019
 stret-0.019
 glances-0.018
ote-0.018
Token.
Token ID13
Feature activation-3.96e-03

Pos logit contributions
 fascination+0.130
 grin+0.129
 glances+0.127
 stret+0.127
ote+0.123
Neg logit contributions
B-0.093
P-0.091
One-0.085
L-0.084
Tim-0.083
Token They
Token ID1119
Feature activation+2.03e-02

Pos logit contributions
B+0.005
st+0.005
P+0.005
ety+0.005
Tim+0.005
Neg logit contributions
 grin-0.009
 fascination-0.009
 glances-0.008
 stret-0.008
 hello-0.008
Token find
Token ID1064
Feature activation-1.46e-02

Pos logit contributions
ote+0.049
ement+0.047
 gru+0.046
ured+0.045
 expression+0.044
Neg logit contributions
 sometimes-0.027
 also-0.025
 never-0.023
One-0.023
I-0.022
Token wanted
Token ID2227
Feature activation-8.67e-03

Pos logit contributions
 grin+0.011
 fascination+0.011
 stret+0.011
ote+0.010
 glances+0.010
Neg logit contributions
B-0.006
P-0.005
st-0.005
One-0.005
L-0.005
Token to
Token ID284
Feature activation+1.73e-02

Pos logit contributions
B+0.009
P+0.008
st+0.008
L+0.008
One+0.008
Neg logit contributions
 grin-0.022
 fascination-0.022
 stret-0.021
ote-0.021
 glances-0.021
Token make
Token ID787
Feature activation-4.50e-02

Pos logit contributions
 grin+0.032
 fascination+0.032
 stret+0.031
ote+0.030
 glances+0.030
Neg logit contributions
B-0.030
st-0.028
P-0.028
One-0.027
Tim-0.027
Token his
Token ID465
Feature activation-1.12e-02

Pos logit contributions
B+0.043
st+0.040
 sometimes+0.037
L+0.036
Tim+0.034
Neg logit contributions
 grin-0.117
 stret-0.113
 fascination-0.113
ote-0.112
atively-0.110
Token tw
Token ID665
Feature activation-9.75e-02

Pos logit contributions
B+0.019
P+0.017
L+0.017
Tim+0.017
One+0.016
Neg logit contributions
 grin-0.023
 fascination-0.022
 amaz-0.021
 expression-0.021
 anticipation-0.020
Tokenig
Token ID328
Feature activation-1.92e-01

Pos logit contributions
B+0.160
Tim+0.151
L+0.146
Sam+0.143
ety+0.143
Neg logit contributions
 grin-0.194
 fascination-0.187
 expression-0.174
ote-0.172
 stret-0.171
Token special
Token ID2041
Feature activation+3.85e-03

Pos logit contributions
B+0.345
st+0.326
P+0.324
L+0.319
One+0.319
Neg logit contributions
 grin-0.333
 fascination-0.332
 stret-0.323
ote-0.314
 glances-0.308
Token,
Token ID11
Feature activation-3.13e-02

Pos logit contributions
 stret+0.010
ote+0.010
 fascination+0.010
 grin+0.010
leaf+0.010
Neg logit contributions
st-0.004
 sometimes-0.003
B-0.003
 next-0.003
 day-0.003
Token so
Token ID523
Feature activation+1.94e-02

Pos logit contributions
B+0.042
st+0.038
P+0.038
L+0.037
One+0.037
Neg logit contributions
 grin-0.070
 fascination-0.069
 stret-0.068
ote-0.066
 glances-0.065
Token he
Token ID339
Feature activation-7.91e-03

Pos logit contributions
 grin+0.035
 fascination+0.034
 hi+0.034
 hello+0.034
 anticipation+0.033
Neg logit contributions
B-0.032
st-0.031
ety-0.031
P-0.030
One-0.029
Token spent
Token ID3377
Feature activation-7.41e-02

Pos logit contributions
B+0.011
One+0.010
st+0.010
Tim+0.010
P+0.010
Neg logit contributions
 grin-0.017
 fascination-0.017
 stret-0.016
ote-0.016
 glances-0.016
Token.
Token ID13
Feature activation-4.51e-03

Pos logit contributions
 grin+0.078
 fascination+0.077
 stret+0.076
ote+0.075
 glances+0.074
Neg logit contributions
B-0.050
P-0.048
ety-0.046
One-0.046
L-0.046
Token He
Token ID679
Feature activation+3.83e-02

Pos logit contributions
B+0.006
Tim+0.005
L+0.005
One+0.005
st+0.005
Neg logit contributions
 grin-0.010
 fascination-0.010
 stret-0.010
 glances-0.010
ote-0.010
Token had
Token ID550
Feature activation-1.25e-01

Pos logit contributions
 grin+0.085
 fascination+0.085
 stret+0.083
ote+0.081
 glances+0.080
Neg logit contributions
B-0.050
P-0.046
st-0.046
One-0.045
L-0.045
Token to
Token ID284
Feature activation+2.68e-02

Pos logit contributions
B+0.150
P+0.137
st+0.133
L+0.132
ety+0.132
Neg logit contributions
 fascination-0.292
 grin-0.291
 stret-0.283
ote-0.279
 glances-0.276
Token z
Token ID1976
Feature activation-5.73e-02

Pos logit contributions
 grin+0.051
 fascination+0.051
ote+0.050
oured+0.049
 glances+0.048
Neg logit contributions
B-0.042
Tim-0.042
L-0.041
P-0.041
One-0.041
Tokenig
Token ID328
Feature activation-1.91e-01

Pos logit contributions
B+0.080
P+0.080
st+0.077
L+0.075
One+0.072
Neg logit contributions
 fascination-0.122
 stret-0.121
 grin-0.119
ote-0.116
 anticipation-0.112
Tokenzag
Token ID50183
Feature activation-1.75e-01

Pos logit contributions
B+0.322
st+0.303
P+0.301
L+0.297
Tim+0.295
Neg logit contributions
 grin-0.358
 fascination-0.356
 stret-0.345
ote-0.335
 glances-0.331
Token around
Token ID1088
Feature activation+4.83e-02

Pos logit contributions
B+0.234
st+0.219
One+0.214
P+0.213
L+0.213
Neg logit contributions
 grin-0.385
 fascination-0.382
 stret-0.377
ote-0.369
 glances-0.359
Token and
Token ID290
Feature activation-2.09e-03

Pos logit contributions
 grin+0.126
 stret+0.125
 fascination+0.123
ote+0.123
 glances+0.120
Neg logit contributions
B-0.044
 sometimes-0.042
st-0.042
Tim-0.039
L-0.038
Token make
Token ID787
Feature activation-3.21e-02

Pos logit contributions
 sometimes+0.003
B+0.003
st+0.003
Tim+0.002
L+0.002
Neg logit contributions
 stret-0.005
 fascination-0.005
 grin-0.005
ote-0.004
 glances-0.004
Token big
Token ID1263
Feature activation+2.66e-02

Pos logit contributions
B+0.039
L+0.036
Tim+0.035
st+0.035
 sometimes+0.035
Neg logit contributions
 grin-0.075
 fascination-0.072
 stret-0.072
ote-0.071
oured-0.069
Token around
Token ID1088
Feature activation+4.00e-02

Pos logit contributions
 grin+0.091
 fascination+0.090
 stret+0.089
ote+0.087
 fries+0.085
Neg logit contributions
B-0.048
 sometimes-0.043
st-0.042
Tim-0.041
L-0.041
Token and
Token ID290
Feature activation-1.39e-02

Pos logit contributions
 grin+0.109
 stret+0.107
ote+0.106
 fascination+0.105
 vows+0.104
Neg logit contributions
B-0.036
st-0.033
 sometimes-0.032
Tim-0.029
 next-0.029
Token around
Token ID1088
Feature activation+5.13e-02

Pos logit contributions
B+0.022
st+0.020
P+0.020
L+0.020
One+0.020
Neg logit contributions
 grin-0.028
 fascination-0.028
 stret-0.027
ote-0.026
 glances-0.026
Token in
Token ID287
Feature activation-1.47e-02

Pos logit contributions
 stret+0.133
 grin+0.132
ote+0.131
 fascination+0.129
 vows+0.127
Neg logit contributions
B-0.052
 sometimes-0.049
st-0.047
 next-0.043
But-0.043
Token z
Token ID1976
Feature activation-4.81e-02

Pos logit contributions
B+0.016
st+0.015
One+0.014
Tim+0.014
L+0.014
Neg logit contributions
 grin-0.036
 stret-0.036
 fascination-0.036
ote-0.036
 glances-0.034
Tokenig
Token ID328
Feature activation-1.90e-01

Pos logit contributions
B+0.050
P+0.048
st+0.047
L+0.044
One+0.044
Neg logit contributions
 fascination-0.121
 grin-0.120
 stret-0.118
ote-0.116
 expression-0.113
Tokenz
Token ID89
Feature activation-1.56e-02

Pos logit contributions
B+0.291
P+0.272
One+0.268
Tim+0.267
L+0.264
Neg logit contributions
 grin-0.376
 stret-0.370
ote-0.362
 fascination-0.358
 glances-0.349
Tokenags
Token ID3775
Feature activation+7.75e-02

Pos logit contributions
B+0.034
P+0.033
st+0.033
L+0.032
One+0.032
Neg logit contributions
 grin-0.021
 fascination-0.021
 stret-0.020
ote-0.020
 glances-0.019
Token.
Token ID13
Feature activation+6.49e-03

Pos logit contributions
 grin+0.178
 fascination+0.177
 stret+0.172
ote+0.169
 glances+0.167
Neg logit contributions
B-0.098
P-0.091
st-0.089
L-0.088
One-0.087
Token
Token ID198
Feature activation-1.84e-02

Pos logit contributions
 grin+0.015
 fascination+0.015
 stret+0.014
 hi+0.014
ote+0.014
Neg logit contributions
P-0.008
B-0.008
st-0.007
ety-0.007
L-0.007
Token
Token ID198
Feature activation-1.63e-02

Pos logit contributions
ety+0.023
P+0.021
st+0.020
 sometimes+0.017
B+0.017
Neg logit contributions
 stret-0.039
 fascination-0.039
 hi-0.038
 hello-0.038
 grin-0.038
Token from
Token ID422
Feature activation+4.06e-02

Pos logit contributions
B+0.046
st+0.043
P+0.042
L+0.042
One+0.042
Neg logit contributions
 grin-0.073
 fascination-0.073
 stret-0.071
ote-0.070
 glances-0.069
Token straight
Token ID3892
Feature activation-5.99e-03

Pos logit contributions
 grin+0.095
 fascination+0.094
 stret+0.093
ote+0.091
 glances+0.089
Neg logit contributions
B-0.049
st-0.045
P-0.044
One-0.044
L-0.044
Token lines
Token ID3951
Feature activation-1.76e-02

Pos logit contributions
B+0.008
st+0.008
P+0.008
L+0.008
One+0.008
Neg logit contributions
 grin-0.013
 fascination-0.013
 stret-0.013
ote-0.012
 hi-0.012
Token to
Token ID284
Feature activation+4.64e-02

Pos logit contributions
B+0.018
st+0.017
 sometimes+0.017
One+0.016
L+0.016
Neg logit contributions
 fascination-0.045
 stret-0.044
 grin-0.044
ote-0.043
 hi-0.042
Token z
Token ID1976
Feature activation-5.76e-02

Pos logit contributions
 grin+0.099
 fascination+0.097
ote+0.097
 stret+0.097
 glances+0.092
Neg logit contributions
B-0.065
One-0.063
L-0.062
Tim-0.062
st-0.061
Tokenig
Token ID328
Feature activation-1.88e-01

Pos logit contributions
B+0.080
P+0.079
st+0.078
L+0.073
One+0.073
Neg logit contributions
 fascination-0.125
 grin-0.124
 stret-0.122
ote-0.118
 expression-0.116
Tokenz
Token ID89
Feature activation-1.69e-02

Pos logit contributions
B+0.288
P+0.268
st+0.264
L+0.263
One+0.262
Neg logit contributions
 grin-0.380
 fascination-0.376
 stret-0.370
ote-0.361
 glances-0.355
Tokenags
Token ID3775
Feature activation+7.23e-02

Pos logit contributions
B+0.034
P+0.034
st+0.033
L+0.032
One+0.032
Neg logit contributions
 fascination-0.026
 grin-0.026
 stret-0.025
ote-0.024
 glances-0.023
Token.
Token ID13
Feature activation-2.88e-03

Pos logit contributions
 grin+0.164
 fascination+0.163
 stret+0.157
ote+0.155
 glances+0.154
Neg logit contributions
B-0.094
P-0.088
L-0.084
One-0.082
st-0.082
Token
Token ID198
Feature activation-2.20e-02

Pos logit contributions
P+0.003
B+0.003
st+0.003
ety+0.003
L+0.003
Neg logit contributions
 fascination-0.007
 grin-0.007
 hi-0.006
 stret-0.006
 menu-0.006
Token
Token ID198
Feature activation-2.05e-02

Pos logit contributions
ety+0.027
P+0.024
st+0.023
B+0.023
 sometimes+0.021
Neg logit contributions
 fascination-0.047
 grin-0.046
 stret-0.045
 hello-0.043
 hi-0.043
Token of
Token ID286
Feature activation-3.49e-02

Pos logit contributions
 fascination+0.016
 grin+0.016
 stret+0.016
ote+0.015
 expression+0.015
Neg logit contributions
B-0.004
st-0.004
L-0.004
Tim-0.003
One-0.003
Token fun
Token ID1257
Feature activation+3.16e-02

Pos logit contributions
B+0.037
One+0.033
 day+0.033
Tim+0.032
L+0.031
Neg logit contributions
ote-0.090
 grin-0.088
oured-0.086
atively-0.086
ishly-0.086
Token with
Token ID351
Feature activation+2.75e-02

Pos logit contributions
 grin+0.071
 fascination+0.070
 stret+0.069
ote+0.067
 glances+0.067
Neg logit contributions
B-0.041
P-0.038
ety-0.038
L-0.037
One-0.037
Token their
Token ID511
Feature activation+3.14e-02

Pos logit contributions
 grin+0.073
ote+0.071
 stret+0.070
 fascination+0.070
icol+0.068
Neg logit contributions
B-0.027
 sometimes-0.025
Tim-0.023
st-0.022
One-0.022
Token z
Token ID1976
Feature activation-4.57e-02

Pos logit contributions
 fascination+0.069
 grin+0.068
 stret+0.065
 amaz+0.063
ote+0.063
Neg logit contributions
B-0.042
P-0.042
st-0.039
L-0.039
One-0.038
Tokenig
Token ID328
Feature activation-1.88e-01

Pos logit contributions
B+0.050
Tim+0.046
L+0.045
ety+0.045
One+0.045
Neg logit contributions
 grin-0.111
 fascination-0.109
 stret-0.107
ote-0.106
 glances-0.105
Tokenz
Token ID89
Feature activation-1.52e-02

Pos logit contributions
B+0.234
P+0.224
L+0.212
Tim+0.208
One+0.206
Neg logit contributions
 grin-0.427
 stret-0.417
ote-0.411
 fascination-0.408
 hi-0.394
Tokenags
Token ID3775
Feature activation+7.17e-02

Pos logit contributions
B+0.026
P+0.025
st+0.025
L+0.024
Tim+0.024
Neg logit contributions
 fascination-0.028
 grin-0.028
 stret-0.027
ote-0.026
 glances-0.026
Token.
Token ID13
Feature activation+3.28e-03

Pos logit contributions
 grin+0.169
 fascination+0.167
 stret+0.164
ote+0.161
 glances+0.159
Neg logit contributions
B-0.087
P-0.079
st-0.078
L-0.077
One-0.076
Token
Token ID198
Feature activation-2.35e-02

Pos logit contributions
 fascination+0.008
 grin+0.008
 hi+0.008
 stret+0.008
ote+0.008
Neg logit contributions
B-0.003
P-0.003
st-0.003
ety-0.003
L-0.003
Token
Token ID198
Feature activation-2.11e-02

Pos logit contributions
st+0.029
ety+0.028
P+0.028
B+0.026
 sometimes+0.023
Neg logit contributions
 fascination-0.052
 stret-0.052
 grin-0.051
 glances-0.048
 hi-0.048
Token.
Token ID13
Feature activation-6.99e-03

Pos logit contributions
B+0.030
 sometimes+0.028
st+0.027
L+0.027
One+0.027
Neg logit contributions
 fascination-0.100
 grin-0.100
 stret-0.098
ote-0.096
icol-0.093
Token They
Token ID1119
Feature activation+2.18e-02

Pos logit contributions
B+0.009
P+0.008
st+0.008
One+0.008
Tim+0.008
Neg logit contributions
 grin-0.016
 fascination-0.016
 stret-0.016
ote-0.015
 glances-0.015
Token make
Token ID787
Feature activation-2.56e-02

Pos logit contributions
ote+0.045
 stret+0.044
icol+0.043
 grin+0.042
 gru+0.041
Neg logit contributions
 sometimes-0.034
One-0.032
L-0.028
 also-0.028
st-0.028
Token many
Token ID867
Feature activation-2.88e-02

Pos logit contributions
B+0.029
L+0.028
st+0.028
 sometimes+0.027
One+0.027
Neg logit contributions
 grin-0.062
 fascination-0.059
ote-0.058
 stret-0.058
 hi-0.056
Token z
Token ID1976
Feature activation-4.44e-02

Pos logit contributions
B+0.039
st+0.036
L+0.036
P+0.035
One+0.035
Neg logit contributions
 grin-0.063
 stret-0.063
 fascination-0.063
ote-0.062
 hi-0.059
Tokenig
Token ID328
Feature activation-1.86e-01

Pos logit contributions
B+0.060
st+0.054
P+0.054
Tim+0.053
One+0.053
Neg logit contributions
 grin-0.099
 fascination-0.098
 stret-0.096
ote-0.094
 glances-0.093
Tokenz
Token ID89
Feature activation-1.26e-02

Pos logit contributions
B+0.225
P+0.206
One+0.198
st+0.198
Tim+0.198
Neg logit contributions
 grin-0.439
 fascination-0.433
 stret-0.426
ote-0.418
 glances-0.413
Tokenags
Token ID3775
Feature activation+6.85e-02

Pos logit contributions
B+0.029
st+0.028
P+0.027
One+0.027
L+0.027
Neg logit contributions
 grin-0.016
 fascination-0.016
 stret-0.015
ote-0.014
 glances-0.014
Token and
Token ID290
Feature activation-3.51e-03

Pos logit contributions
 grin+0.153
 fascination+0.151
 stret+0.146
ote+0.144
 glances+0.144
Neg logit contributions
B-0.094
P-0.088
L-0.082
Tim-0.082
One-0.081
Token circles
Token ID13332
Feature activation+1.05e-03

Pos logit contributions
 sometimes+0.005
B+0.005
One+0.005
L+0.005
Tim+0.004
Neg logit contributions
ote-0.007
 stret-0.007
 fascination-0.007
 grin-0.007
oured-0.007
Token and
Token ID290
Feature activation+4.21e-04

Pos logit contributions
 fascination+0.003
 grin+0.003
ote+0.003
 stret+0.003
leaf+0.003
Neg logit contributions
 sometimes-0.001
B-0.001
L-0.001
st-0.001
 next-0.001
Token small
Token ID1402
Feature activation+1.87e-02

Pos logit contributions
 grin+0.000
 hello+0.000
 fascination+0.000
 hi+0.000
 amaz+0.000
Neg logit contributions
B-0.000
st-0.000
P-0.000
L-0.000
ety-0.000
Token bird
Token ID6512
Feature activation-4.47e-02

Pos logit contributions
 hello+0.034
 grin+0.034
 fascination+0.034
 glances+0.033
 expression+0.033
Neg logit contributions
B-0.032
P-0.030
L-0.028
Tim-0.027
st-0.027
Token needed
Token ID2622
Feature activation-5.82e-02

Pos logit contributions
B+0.082
Tim+0.078
P+0.077
One+0.077
L+0.077
Neg logit contributions
 grin-0.065
 hello-0.065
 hi-0.062
 glances-0.062
 gasp-0.061
Token a
Token ID257
Feature activation-5.08e-03

Pos logit contributions
B+0.087
P+0.082
ety+0.082
L+0.081
st+0.080
Neg logit contributions
 fascination-0.115
 grin-0.114
 stret-0.111
 hi-0.110
 hello-0.110
Token tw
Token ID665
Feature activation-8.78e-02

Pos logit contributions
B+0.008
st+0.008
One+0.008
P+0.007
ety+0.007
Neg logit contributions
 grin-0.010
 fascination-0.010
ote-0.010
 stret-0.009
 glances-0.009
Tokenig
Token ID328
Feature activation-1.86e-01

Pos logit contributions
B+0.108
Tim+0.101
L+0.096
Sam+0.092
ety+0.091
Neg logit contributions
 grin-0.208
 fascination-0.203
 stret-0.194
 glances-0.194
 expression-0.194
Token for
Token ID329
Feature activation+3.09e-02

Pos logit contributions
B+0.247
P+0.232
L+0.229
Tim+0.228
One+0.216
Neg logit contributions
 grin-0.407
 stret-0.402
 fascination-0.394
ote-0.388
 hi-0.386
Token her
Token ID607
Feature activation-1.77e-02

Pos logit contributions
 fascination+0.058
 hello+0.057
 hi+0.057
 grin+0.057
 stret+0.055
Neg logit contributions
ety-0.049
B-0.048
L-0.048
P-0.047
st-0.046
Token nest
Token ID16343
Feature activation-1.01e-01

Pos logit contributions
B+0.026
P+0.025
L+0.024
Tim+0.024
Sam+0.023
Neg logit contributions
 grin-0.037
 fascination-0.036
 stret-0.035
ote-0.035
 hello-0.035
Token.
Token ID13
Feature activation-1.34e-03

Pos logit contributions
B+0.134
P+0.123
L+0.120
Tim+0.118
One+0.118
Neg logit contributions
 grin-0.224
 stret-0.219
 fascination-0.218
 glances-0.211
ote-0.211
Token She
Token ID1375
Feature activation+4.05e-02

Pos logit contributions
B+0.001
L+0.001
Tim+0.001
One+0.001
st+0.001
Neg logit contributions
 stret-0.003
 grin-0.003
 fascination-0.003
ote-0.003
 glances-0.003
Token were
Token ID547
Feature activation+1.13e-02

Pos logit contributions
 stret+0.207
 fascination+0.207
ote+0.202
 grin+0.197
icol+0.194
Neg logit contributions
 sometimes-0.084
B-0.075
st-0.074
 day-0.067
 next-0.062
Token now
Token ID783
Feature activation-4.81e-02

Pos logit contributions
 stret+0.027
 grin+0.027
 fascination+0.027
ote+0.026
icol+0.026
Neg logit contributions
 sometimes-0.013
B-0.012
st-0.011
 next-0.011
One-0.011
Token perfect
Token ID2818
Feature activation-1.56e-02

Pos logit contributions
 sometimes+0.055
B+0.052
st+0.051
 next+0.049
 day+0.049
Neg logit contributions
 fascination-0.115
 stret-0.113
 grin-0.112
ote-0.111
 hi-0.108
Token for
Token ID329
Feature activation+4.92e-02

Pos logit contributions
st+0.017
B+0.016
 sometimes+0.016
 next+0.014
ety+0.014
Neg logit contributions
 fascination-0.039
 stret-0.039
 grin-0.038
ote-0.037
 hi-0.036
Token Ab
Token ID2275
Feature activation-4.53e-02

Pos logit contributions
 grin+0.106
 fascination+0.104
 stret+0.103
ote+0.101
 glances+0.098
Neg logit contributions
B-0.069
st-0.065
Tim-0.064
L-0.063
P-0.063
Tokenig
Token ID328
Feature activation-1.85e-01

Pos logit contributions
B+0.062
One+0.061
Tim+0.059
 sometimes+0.059
But+0.058
Neg logit contributions
oured-0.093
 grin-0.091
 fascination-0.088
ote-0.087
 stret-0.085
Tokenail
Token ID603
Feature activation-1.21e-01

Pos logit contributions
B+0.235
st+0.228
P+0.211
ety+0.209
L+0.208
Neg logit contributions
 fascination-0.426
 grin-0.414
 stret-0.404
 glances-0.393
ote-0.392
Token and
Token ID290
Feature activation-9.93e-03

Pos logit contributions
B+0.120
st+0.118
 sometimes+0.109
ety+0.104
L+0.104
Neg logit contributions
 fascination-0.308
 grin-0.299
 stret-0.299
 glances-0.289
ote-0.288
Token her
Token ID607
Feature activation+1.50e-02

Pos logit contributions
B+0.010
 sometimes+0.009
st+0.009
L+0.009
P+0.009
Neg logit contributions
 stret-0.025
 grin-0.025
 fascination-0.024
ote-0.024
 glances-0.023
Token family
Token ID1641
Feature activation-1.29e-02

Pos logit contributions
 stret+0.038
ote+0.038
oured+0.037
icol+0.037
 fascination+0.036
Neg logit contributions
 day-0.014
 next-0.013
 sometimes-0.012
B-0.012
 friends-0.011
Token.
Token ID13
Feature activation+1.53e-03

Pos logit contributions
B+0.012
st+0.012
 sometimes+0.012
 day+0.011
L+0.010
Neg logit contributions
 stret-0.033
 fascination-0.032
ote-0.032
 grin-0.031
 glances-0.031
Token friends
Token ID2460
Feature activation-2.34e-02

Pos logit contributions
B+0.069
P+0.059
st+0.058
L+0.056
Tim+0.056
Neg logit contributions
 grin-0.269
 fascination-0.269
 stret-0.266
ote-0.260
 glances-0.255
Token played
Token ID2826
Feature activation-3.00e-02

Pos logit contributions
B+0.025
 sometimes+0.024
st+0.024
L+0.023
P+0.023
Neg logit contributions
 stret-0.058
 fascination-0.058
 grin-0.057
ote-0.056
icol-0.054
Token with
Token ID351
Feature activation+2.18e-02

Pos logit contributions
B+0.028
st+0.025
L+0.024
Tim+0.024
 sometimes+0.024
Neg logit contributions
 grin-0.081
 fascination-0.079
ote-0.078
 expression-0.077
 stret-0.076
Token their
Token ID511
Feature activation+2.98e-02

Pos logit contributions
 grin+0.065
ote+0.065
icol+0.062
 expression+0.062
 stret+0.062
Neg logit contributions
B-0.015
 sometimes-0.014
Tim-0.013
 day-0.011
st-0.011
Token z
Token ID1976
Feature activation-4.10e-02

Pos logit contributions
 fascination+0.062
 grin+0.060
 stret+0.060
ote+0.057
 amaz+0.057
Neg logit contributions
B-0.042
P-0.042
st-0.040
ety-0.039
L-0.039
Tokenig
Token ID328
Feature activation-1.85e-01

Pos logit contributions
B+0.048
ety+0.044
Tim+0.044
L+0.044
 sometimes+0.043
Neg logit contributions
 grin-0.095
 fascination-0.094
 stret-0.093
ote-0.092
 glances-0.090
Tokenz
Token ID89
Feature activation-1.47e-02

Pos logit contributions
B+0.253
P+0.247
L+0.233
One+0.228
Tim+0.228
Neg logit contributions
 grin-0.387
 stret-0.381
ote-0.370
 fascination-0.367
atively-0.362
Tokenags
Token ID3775
Feature activation+7.23e-02

Pos logit contributions
B+0.028
st+0.026
P+0.026
One+0.026
L+0.026
Neg logit contributions
 grin-0.025
 fascination-0.024
 stret-0.024
ote-0.023
 glances-0.023
Token every
Token ID790
Feature activation+8.25e-03

Pos logit contributions
 grin+0.168
 fascination+0.166
 stret+0.162
ote+0.159
 glances+0.159
Neg logit contributions
B-0.090
P-0.083
L-0.080
Tim-0.079
One-0.079
Token day
Token ID1110
Feature activation-3.42e-02

Pos logit contributions
 fascination+0.022
 grin+0.021
 stret+0.021
 anticipation+0.020
 glances+0.020
Neg logit contributions
B-0.008
P-0.007
ety-0.007
L-0.007
Tim-0.006
Token.
Token ID13
Feature activation+1.73e-03

Pos logit contributions
B+0.036
st+0.033
L+0.030
Tim+0.030
 sometimes+0.030
Neg logit contributions
 fascination-0.087
 grin-0.086
 stret-0.084
ote-0.083
 glances-0.082
Token went
Token ID1816
Feature activation-6.54e-03

Pos logit contributions
B+0.021
One+0.020
st+0.020
P+0.020
Tim+0.020
Neg logit contributions
 grin-0.030
 fascination-0.030
 stret-0.029
ote-0.029
 glances-0.029
Token to
Token ID284
Feature activation+2.21e-02

Pos logit contributions
B+0.012
P+0.011
st+0.011
L+0.011
One+0.011
Neg logit contributions
 grin-0.012
 fascination-0.011
 stret-0.011
ote-0.011
 glances-0.011
Token find
Token ID1064
Feature activation-4.18e-02

Pos logit contributions
 grin+0.045
 stret+0.044
 fascination+0.044
ote+0.043
 glances+0.043
Neg logit contributions
B-0.033
st-0.032
One-0.031
Tim-0.030
P-0.030
Token a
Token ID257
Feature activation-1.07e-02

Pos logit contributions
B+0.037
st+0.033
P+0.032
L+0.031
 sometimes+0.029
Neg logit contributions
 grin-0.116
 stret-0.115
ote-0.113
 fascination-0.111
 expression-0.107
Token tw
Token ID665
Feature activation-8.51e-02

Pos logit contributions
B+0.017
P+0.016
L+0.015
Tim+0.015
st+0.015
Neg logit contributions
 grin-0.021
 fascination-0.021
 stret-0.021
ote-0.020
 hi-0.020
Tokenig
Token ID328
Feature activation-1.84e-01

Pos logit contributions
B+0.117
Tim+0.112
ety+0.105
L+0.105
 sometimes+0.102
Neg logit contributions
 grin-0.188
 fascination-0.180
ote-0.174
 glances-0.173
 stret-0.171
Token to
Token ID284
Feature activation+5.92e-03

Pos logit contributions
B+0.277
P+0.258
Tim+0.254
L+0.252
One+0.243
Neg logit contributions
 grin-0.380
 stret-0.367
ote-0.364
 hi-0.359
 fascination-0.356
Token help
Token ID1037
Feature activation-1.09e-02

Pos logit contributions
 fascination+0.012
 grin+0.012
 stret+0.012
ote+0.011
 glances+0.011
Neg logit contributions
B-0.009
st-0.009
P-0.009
One-0.009
L-0.008
Token her
Token ID607
Feature activation-2.32e-02

Pos logit contributions
B+0.012
st+0.010
L+0.010
Tim+0.010
P+0.010
Neg logit contributions
 fascination-0.028
 grin-0.028
 stret-0.027
ote-0.027
 glances-0.026
Token.
Token ID13
Feature activation-4.80e-03

Pos logit contributions
B+0.033
P+0.030
Tim+0.030
L+0.030
One+0.029
Neg logit contributions
 grin-0.052
 fascination-0.050
 stret-0.048
 hi-0.048
ote-0.048
Token
Token ID198
Feature activation-2.94e-02

Pos logit contributions
B+0.006
P+0.005
st+0.005
One+0.005
 sometimes+0.005
Neg logit contributions
 grin-0.011
 fascination-0.011
 stret-0.011
 hi-0.011
ote-0.010
Token suggested
Token ID5220
Feature activation-2.66e-02

Pos logit contributions
 stret+0.150
 fascination+0.149
icol+0.148
ote+0.146
 menu+0.146
Neg logit contributions
B-0.056
 sometimes-0.055
L-0.054
st-0.052
Tim-0.051
Token they
Token ID484
Feature activation-4.54e-02

Pos logit contributions
 sometimes+0.022
B+0.019
Tim+0.018
 next+0.018
But+0.017
Neg logit contributions
 fascination-0.076
 grin-0.072
 stret-0.072
icol-0.071
oured-0.071
Token find
Token ID1064
Feature activation-2.64e-02

Pos logit contributions
B+0.065
 sometimes+0.063
Tim+0.062
st+0.061
One+0.061
Neg logit contributions
 fascination-0.097
 grin-0.094
 stret-0.093
ote-0.092
 glances-0.090
Token another
Token ID1194
Feature activation-6.43e-02

Pos logit contributions
B+0.021
Tim+0.019
st+0.018
 other+0.018
 sometimes+0.018
Neg logit contributions
 grin-0.078
ote-0.072
 fascination-0.071
 widened-0.069
 stret-0.069
Token tw
Token ID665
Feature activation-8.18e-02

Pos logit contributions
B+0.096
P+0.090
Tim+0.087
L+0.087
ety+0.086
Neg logit contributions
 grin-0.134
 fascination-0.133
 stret-0.128
ote-0.124
 glances-0.122
Tokenig
Token ID328
Feature activation-1.84e-01

Pos logit contributions
B+0.095
Tim+0.088
ety+0.084
L+0.082
Sam+0.081
Neg logit contributions
 grin-0.199
 fascination-0.196
 stret-0.185
 glances-0.184
ote-0.184
Token,
Token ID11
Feature activation-2.95e-02

Pos logit contributions
B+0.232
P+0.213
L+0.207
One+0.206
st+0.206
Neg logit contributions
 grin-0.423
 fascination-0.417
 stret-0.412
ote-0.403
 glances-0.396
Token and
Token ID290
Feature activation-1.72e-02

Pos logit contributions
 sometimes+0.029
B+0.029
st+0.028
Tim+0.027
L+0.026
Neg logit contributions
 grin-0.073
 fascination-0.072
 stret-0.071
ote-0.071
 glances-0.070
Token so
Token ID523
Feature activation+3.37e-02

Pos logit contributions
B+0.019
Tim+0.018
 sometimes+0.017
st+0.017
L+0.017
Neg logit contributions
 grin-0.042
 fascination-0.041
 stret-0.040
ote-0.040
 glances-0.039
Token they
Token ID484
Feature activation-2.45e-02

Pos logit contributions
 fascination+0.087
 grin+0.087
 stret+0.086
ote+0.084
 glances+0.082
Neg logit contributions
B-0.033
L-0.029
st-0.029
Tim-0.029
P-0.028
Token carried
Token ID5281
Feature activation-8.08e-03

Pos logit contributions
B+0.031
st+0.028
P+0.028
Tim+0.028
L+0.028
Neg logit contributions
 grin-0.057
 fascination-0.056
 stret-0.055
ote-0.055
 glances-0.053
Token Tom
Token ID4186
Feature activation-9.56e-03

Pos logit contributions
Tim+0.015
B+0.014
 sometimes+0.014
L+0.014
Sam+0.013
Neg logit contributions
 stret-0.042
 fascination-0.041
 grin-0.041
ote-0.040
oured-0.039
Token wiped
Token ID21122
Feature activation-4.22e-02

Pos logit contributions
B+0.012
st+0.011
P+0.011
L+0.011
One+0.011
Neg logit contributions
 fascination-0.022
 grin-0.021
 stret-0.021
ote-0.021
 glances-0.020
Token away
Token ID1497
Feature activation-1.03e-03

Pos logit contributions
B+0.041
st+0.040
L+0.039
 sometimes+0.036
P+0.035
Neg logit contributions
ote-0.110
 stret-0.108
 grin-0.104
 menu-0.102
 fascination-0.102
Token his
Token ID465
Feature activation+2.01e-02

Pos logit contributions
B+0.001
L+0.001
 sometimes+0.001
st+0.001
 next+0.001
Neg logit contributions
 stret-0.003
ote-0.003
 fascination-0.003
 grin-0.003
oured-0.003
Token z
Token ID1976
Feature activation-3.83e-02

Pos logit contributions
 stret+0.033
 fascination+0.033
 grin+0.033
ote+0.033
 glances+0.032
Neg logit contributions
B-0.036
st-0.035
L-0.034
P-0.034
One-0.033
Tokenig
Token ID328
Feature activation-1.84e-01

Pos logit contributions
B+0.039
One+0.035
Tim+0.034
ety+0.034
P+0.034
Neg logit contributions
 grin-0.098
 fascination-0.096
 stret-0.094
ote-0.093
 glances-0.092
Tokenzag
Token ID50183
Feature activation-1.57e-01

Pos logit contributions
B+0.242
P+0.230
One+0.215
Tim+0.214
L+0.212
Neg logit contributions
 grin-0.419
 stret-0.401
 fascination-0.400
ote-0.391
 expression-0.379
Token and
Token ID290
Feature activation-9.53e-03

Pos logit contributions
B+0.244
st+0.229
P+0.227
L+0.224
One+0.223
Neg logit contributions
 grin-0.310
 fascination-0.309
 stret-0.301
ote-0.295
 glances-0.290
Token went
Token ID1816
Feature activation-1.27e-02

Pos logit contributions
B+0.012
st+0.012
L+0.011
P+0.011
 sometimes+0.011
Neg logit contributions
 stret-0.021
 fascination-0.021
 grin-0.021
ote-0.021
 glances-0.020
Token on
Token ID319
Feature activation-6.37e-02

Pos logit contributions
B+0.018
st+0.017
P+0.017
L+0.017
One+0.016
Neg logit contributions
 fascination-0.027
 grin-0.027
 stret-0.026
ote-0.026
 glances-0.025
Token with
Token ID351
Feature activation+1.50e-02

Pos logit contributions
B+0.080
st+0.075
One+0.073
P+0.072
L+0.070
Neg logit contributions
 stret-0.145
 grin-0.144
 fascination-0.143
ote-0.141
 glances-0.137
Token purple
Token ID14032
Feature activation-2.68e-02

Pos logit contributions
B+0.003
P+0.003
st+0.003
ety+0.003
L+0.003
Neg logit contributions
 fascination-0.003
 grin-0.003
 stret-0.003
 glances-0.003
 gasp-0.003
Token and
Token ID290
Feature activation+3.30e-03

Pos logit contributions
B+0.028
st+0.025
P+0.024
One+0.024
L+0.024
Neg logit contributions
 grin-0.068
 fascination-0.068
 stret-0.066
ote-0.065
 glances-0.064
Token makes
Token ID1838
Feature activation-4.79e-02

Pos logit contributions
 fascination+0.007
 grin+0.007
 stret+0.006
 glances+0.006
 hello+0.006
Neg logit contributions
B-0.005
P-0.005
st-0.005
ety-0.005
L-0.005
Token a
Token ID257
Feature activation-1.37e-02

Pos logit contributions
B+0.079
st+0.073
L+0.073
P+0.072
Tim+0.072
Neg logit contributions
 grin-0.092
 fascination-0.091
 stret-0.089
ote-0.087
 glances-0.085
Token z
Token ID1976
Feature activation-4.66e-02

Pos logit contributions
B+0.029
P+0.028
st+0.028
L+0.028
One+0.028
Neg logit contributions
 grin-0.019
 fascination-0.019
 stret-0.018
ote-0.018
 glances-0.017
Tokenig
Token ID328
Feature activation-1.84e-01

Pos logit contributions
B+0.053
P+0.049
st+0.049
L+0.048
Tim+0.047
Neg logit contributions
 fascination-0.113
 grin-0.112
 stret-0.109
ote-0.107
 glances-0.106
Tokenzag
Token ID50183
Feature activation-1.64e-01

Pos logit contributions
B+0.315
P+0.297
st+0.288
L+0.288
Tim+0.287
Neg logit contributions
 grin-0.341
 fascination-0.333
 stret-0.331
ote-0.316
 glances-0.313
Token shape
Token ID5485
Feature activation-3.40e-02

Pos logit contributions
B+0.281
P+0.263
st+0.263
L+0.259
One+0.258
Neg logit contributions
 grin-0.300
 fascination-0.298
 stret-0.290
ote-0.282
 glances-0.278
Token.
Token ID13
Feature activation-2.40e-03

Pos logit contributions
B+0.033
st+0.032
 sometimes+0.030
One+0.029
L+0.029
Neg logit contributions
 fascination-0.087
 grin-0.086
 stret-0.084
ote-0.083
 glances-0.081
Token It
Token ID632
Feature activation+2.31e-02

Pos logit contributions
B+0.004
st+0.003
P+0.003
ety+0.003
One+0.003
Neg logit contributions
 grin-0.005
 fascination-0.005
 stret-0.005
ote-0.005
 glances-0.005
Token is
Token ID318
Feature activation+2.14e-02

Pos logit contributions
 fascination+0.044
 grin+0.044
 stret+0.042
 glances+0.041
ote+0.041
Neg logit contributions
B-0.038
P-0.036
Tim-0.034
st-0.034
L-0.034
Token difficult
Token ID2408
Feature activation-4.65e-02

Pos logit contributions
B+0.090
st+0.086
ety+0.085
P+0.085
One+0.085
Neg logit contributions
 fascination-0.125
 grin-0.123
 stret-0.121
ote-0.118
 glances-0.113
Token,
Token ID11
Feature activation-4.18e-02

Pos logit contributions
B+0.064
P+0.060
L+0.059
Tim+0.058
One+0.058
Neg logit contributions
 grin-0.099
 fascination-0.098
 stret-0.096
ote-0.094
 glances-0.094
Token but
Token ID475
Feature activation+4.24e-03

Pos logit contributions
B+0.064
P+0.061
st+0.059
One+0.058
L+0.058
Neg logit contributions
 grin-0.084
 fascination-0.083
 stret-0.080
ote-0.079
 glances-0.078
Token it
Token ID340
Feature activation-1.64e-02

Pos logit contributions
 fascination+0.009
 grin+0.009
 amaz+0.009
 expression+0.009
 hello+0.009
Neg logit contributions
B-0.006
P-0.006
Tim-0.005
L-0.005
st-0.005
Token could
Token ID714
Feature activation-1.11e-01

Pos logit contributions
B+0.021
P+0.019
Tim+0.019
L+0.019
One+0.019
Neg logit contributions
 fascination-0.037
 grin-0.037
 glances-0.035
ote-0.034
 stret-0.034
Token also
Token ID635
Feature activation-1.84e-01

Pos logit contributions
B+0.146
P+0.135
st+0.134
L+0.131
Tim+0.129
Neg logit contributions
 grin-0.248
 fascination-0.246
 stret-0.240
ote-0.234
 glances-0.231
Token be
Token ID307
Feature activation-4.95e-02

Pos logit contributions
One+0.200
B+0.200
st+0.198
 sometimes+0.196
Tim+0.189
Neg logit contributions
 stret-0.432
 fascination-0.432
 grin-0.430
ote-0.424
 glances-0.412
Token fun
Token ID1257
Feature activation+2.21e-02

Pos logit contributions
B+0.076
P+0.071
st+0.070
L+0.069
Tim+0.069
Neg logit contributions
 grin-0.099
 fascination-0.098
 stret-0.095
ote-0.093
 glances-0.092
Token and
Token ID290
Feature activation-1.21e-02

Pos logit contributions
 grin+0.056
 fascination+0.055
 stret+0.054
ote+0.053
 glances+0.052
Neg logit contributions
B-0.023
st-0.021
P-0.021
L-0.020
One-0.020
Token tasty
Token ID25103
Feature activation+5.63e-02

Pos logit contributions
B+0.017
st+0.016
 sometimes+0.016
P+0.016
One+0.016
Neg logit contributions
 stret-0.026
 grin-0.025
 fascination-0.025
ote-0.025
 glances-0.024
Token.
Token ID13
Feature activation+2.62e-03

Pos logit contributions
 fascination+0.151
 grin+0.149
 stret+0.148
ote+0.143
 hi+0.139
Neg logit contributions
B-0.053
st-0.051
 sometimes-0.047
P-0.046
Tim-0.043
Token decided
Token ID3066
Feature activation-4.66e-02

Pos logit contributions
B+0.050
st+0.046
P+0.045
One+0.045
Tim+0.045
Neg logit contributions
 grin-0.055
 fascination-0.055
 glances-0.052
 stret-0.052
 gasp-0.052
Token to
Token ID284
Feature activation+3.11e-02

Pos logit contributions
B+0.047
P+0.043
st+0.042
L+0.041
One+0.040
Neg logit contributions
 grin-0.119
 fascination-0.118
 stret-0.116
ote-0.114
 glances-0.113
Token make
Token ID787
Feature activation-3.98e-02

Pos logit contributions
 grin+0.057
 fascination+0.057
 stret+0.055
ote+0.053
 glances+0.053
Neg logit contributions
B-0.053
P-0.050
st-0.050
L-0.049
One-0.049
Token a
Token ID257
Feature activation-1.21e-02

Pos logit contributions
B+0.054
P+0.050
st+0.050
L+0.048
Tim+0.048
Neg logit contributions
 grin-0.088
 fascination-0.087
 stret-0.085
ote-0.084
 glances-0.082
Token z
Token ID1976
Feature activation-4.89e-02

Pos logit contributions
B+0.017
L+0.016
st+0.016
P+0.016
Tim+0.016
Neg logit contributions
 grin-0.025
 fascination-0.025
ote-0.024
oured-0.024
 glances-0.023
Tokenig
Token ID328
Feature activation-1.84e-01

Pos logit contributions
B+0.059
P+0.058
st+0.057
L+0.054
One+0.052
Neg logit contributions
 fascination-0.115
 grin-0.114
 stret-0.109
ote-0.108
 anticipation-0.107
Tokenzag
Token ID50183
Feature activation-1.66e-01

Pos logit contributions
B+0.267
P+0.247
st+0.247
L+0.242
One+0.241
Neg logit contributions
 grin-0.385
 fascination-0.382
 stret-0.373
ote-0.364
 glances-0.359
Token with
Token ID351
Feature activation+1.47e-02

Pos logit contributions
B+0.245
st+0.232
P+0.228
One+0.225
L+0.224
Neg logit contributions
 grin-0.342
 fascination-0.341
 stret-0.330
ote-0.326
 glances-0.319
Token the
Token ID262
Feature activation-2.30e-02

Pos logit contributions
 grin+0.036
ote+0.035
 stret+0.035
 fascination+0.034
 glances+0.033
Neg logit contributions
B-0.017
Tim-0.015
st-0.015
P-0.015
L-0.015
Token water
Token ID1660
Feature activation+2.84e-03

Pos logit contributions
B+0.030
P+0.027
st+0.027
L+0.027
One+0.026
Neg logit contributions
 grin-0.052
 fascination-0.052
 stret-0.051
ote-0.050
 glances-0.049
Token.
Token ID13
Feature activation-8.98e-03

Pos logit contributions
 fascination+0.007
 grin+0.007
 stret+0.007
ote+0.007
 glances+0.006
Neg logit contributions
B-0.003
st-0.003
P-0.003
L-0.003
One-0.003
Token he
Token ID339
Feature activation-9.51e-04

Pos logit contributions
 grin+0.081
 stret+0.080
 fascination+0.080
ote+0.077
 glances+0.076
Neg logit contributions
B-0.038
L-0.034
Tim-0.034
st-0.034
One-0.033
Token could
Token ID714
Feature activation-9.39e-02

Pos logit contributions
B+0.001
st+0.001
P+0.001
L+0.001
ety+0.001
Neg logit contributions
 fascination-0.002
 grin-0.002
 stret-0.002
ote-0.002
 expression-0.002
Token finish
Token ID5461
Feature activation-8.44e-02

Pos logit contributions
B+0.144
One+0.140
st+0.140
 sometimes+0.136
P+0.136
Neg logit contributions
 fascination-0.181
 stret-0.180
 grin-0.179
ote-0.176
 glances-0.173
Token his
Token ID465
Feature activation+1.42e-02

Pos logit contributions
B+0.090
st+0.086
P+0.082
One+0.081
Tim+0.078
Neg logit contributions
 grin-0.206
 fascination-0.204
 stret-0.202
ote-0.201
 glances-0.195
Token z
Token ID1976
Feature activation-4.65e-02

Pos logit contributions
 fascination+0.027
 grin+0.027
 stret+0.027
ote+0.026
 glances+0.026
Neg logit contributions
B-0.022
st-0.021
P-0.021
One-0.021
Tim-0.020
Tokenig
Token ID328
Feature activation-1.83e-01

Pos logit contributions
B+0.048
Tim+0.043
ety+0.042
L+0.042
One+0.042
Neg logit contributions
 grin-0.117
 fascination-0.116
 stret-0.114
ote-0.112
 glances-0.110
Tokenzag
Token ID50183
Feature activation-1.58e-01

Pos logit contributions
B+0.223
P+0.205
L+0.197
Tim+0.196
One+0.192
Neg logit contributions
 grin-0.433
 stret-0.421
 fascination-0.417
ote-0.408
 glances-0.397
Token.
Token ID13
Feature activation+4.77e-03

Pos logit contributions
B+0.235
P+0.218
st+0.217
L+0.214
One+0.213
Neg logit contributions
 grin-0.324
 fascination-0.322
 stret-0.315
ote-0.307
 glances-0.302
Token<|endoftext|>
Token ID50256
Feature activation+6.24e-02

Pos logit contributions
 fascination+0.012
 grin+0.011
 hi+0.011
 glances+0.011
 hello+0.011
Neg logit contributions
B-0.005
P-0.005
ety-0.005
st-0.005
Tim-0.005
TokenOnce
Token ID7454
Feature activation-5.29e-02

Pos logit contributions
 grin+0.139
 fascination+0.137
 glances+0.131
 hi+0.131
 hello+0.131
Neg logit contributions
ety-0.070
B-0.068
P-0.066
One-0.062
Tim-0.062
Token upon
Token ID2402
Feature activation+5.38e-03

Pos logit contributions
B+0.074
P+0.070
st+0.068
Tim+0.062
L+0.062
Neg logit contributions
 grin-0.115
 fascination-0.113
 hello-0.107
 hi-0.107
 expression-0.106
Token They
Token ID1119
Feature activation+2.29e-02

Pos logit contributions
B+0.015
P+0.014
st+0.014
One+0.013
L+0.013
Neg logit contributions
 grin-0.027
 fascination-0.027
 stret-0.027
ote-0.026
 glances-0.026
Token made
Token ID925
Feature activation-4.53e-02

Pos logit contributions
 grin+0.052
 fascination+0.052
 stret+0.051
ote+0.051
 glances+0.049
Neg logit contributions
B-0.030
st-0.027
L-0.027
P-0.027
Tim-0.026
Token a
Token ID257
Feature activation-7.65e-03

Pos logit contributions
B+0.045
L+0.041
Tim+0.040
 sometimes+0.040
st+0.040
Neg logit contributions
 grin-0.117
ote-0.114
 stret-0.112
 fascination-0.110
ishly-0.108
Token big
Token ID1263
Feature activation+2.99e-02

Pos logit contributions
B+0.011
L+0.010
st+0.010
P+0.010
Tim+0.010
Neg logit contributions
 grin-0.016
 fascination-0.016
ote-0.016
 stret-0.016
 glances-0.015
Token z
Token ID1976
Feature activation-3.70e-02

Pos logit contributions
ote+0.065
 fascination+0.064
 stret+0.064
 grin+0.064
leaf+0.063
Neg logit contributions
B-0.037
st-0.036
Tim-0.035
L-0.035
One-0.033
Tokenig
Token ID328
Feature activation-1.83e-01

Pos logit contributions
B+0.035
P+0.033
st+0.032
L+0.031
Tim+0.029
Neg logit contributions
 fascination-0.097
 grin-0.096
 stret-0.094
ote-0.092
 glances-0.091
Tokenzag
Token ID50183
Feature activation-1.59e-01

Pos logit contributions
B+0.191
P+0.173
st+0.167
L+0.165
One+0.164
Neg logit contributions
 grin-0.461
 fascination-0.455
 stret-0.451
ote-0.441
 glances-0.433
Token in
Token ID287
Feature activation-2.93e-02

Pos logit contributions
B+0.199
st+0.193
One+0.184
L+0.180
Tim+0.180
Neg logit contributions
 fascination-0.356
 grin-0.355
 stret-0.345
ote-0.342
 hi-0.336
Token the
Token ID262
Feature activation-3.90e-02

Pos logit contributions
ety+0.046
P+0.044
B+0.044
st+0.041
L+0.039
Neg logit contributions
 fascination-0.062
 amaz-0.059
 anticipation-0.057
 grin-0.056
 gasp-0.055
Token air
Token ID1633
Feature activation-5.63e-04

Pos logit contributions
B+0.049
P+0.048
st+0.046
L+0.046
Tim+0.045
Neg logit contributions
 grin-0.090
 fascination-0.088
ote-0.087
 stret-0.086
 glances-0.084
Token.
Token ID13
Feature activation-1.29e-02

Pos logit contributions
st+0.000
B+0.000
 sometimes+0.000
One+0.000
L+0.000
Neg logit contributions
 fascination-0.001
 grin-0.001
 stret-0.001
ote-0.001
 glances-0.001
Tokenon
Token ID261
Feature activation-1.11e-01

Pos logit contributions
 grin+0.071
 fascination+0.070
 stret+0.067
ote+0.066
 glances+0.065
Neg logit contributions
B-0.052
ety-0.049
P-0.048
st-0.047
Tim-0.047
Token and
Token ID290
Feature activation-8.24e-03

Pos logit contributions
B+0.124
P+0.112
st+0.111
L+0.109
One+0.109
Neg logit contributions
 grin-0.269
 fascination-0.267
 stret-0.262
ote-0.257
 glances-0.254
Token makes
Token ID1838
Feature activation-4.58e-02

Pos logit contributions
B+0.013
st+0.013
P+0.013
L+0.012
One+0.012
Neg logit contributions
 grin-0.016
 stret-0.016
 fascination-0.016
ote-0.015
 expression-0.015
Token a
Token ID257
Feature activation-1.38e-02

Pos logit contributions
B+0.050
L+0.047
st+0.046
One+0.045
Tim+0.044
Neg logit contributions
 grin-0.111
 stret-0.110
 fascination-0.109
ote-0.106
 glances-0.104
Token z
Token ID1976
Feature activation-4.67e-02

Pos logit contributions
B+0.021
L+0.020
One+0.020
P+0.020
st+0.020
Neg logit contributions
 fascination-0.026
 stret-0.026
 grin-0.026
ote-0.026
 glances-0.026
Tokenig
Token ID328
Feature activation-1.83e-01

Pos logit contributions
P+0.053
B+0.053
st+0.052
L+0.050
One+0.049
Neg logit contributions
 fascination-0.111
 stret-0.108
 grin-0.108
ote-0.104
 anticipation-0.104
Tokenzag
Token ID50183
Feature activation-1.66e-01

Pos logit contributions
B+0.267
P+0.248
st+0.247
L+0.243
One+0.242
Neg logit contributions
 grin-0.381
 fascination-0.379
 stret-0.370
ote-0.361
 glances-0.356
Token on
Token ID319
Feature activation-7.01e-02

Pos logit contributions
B+0.251
st+0.245
One+0.240
L+0.236
P+0.235
Neg logit contributions
 fascination-0.321
 grin-0.319
 stret-0.315
ote-0.309
 glances-0.300
Token the
Token ID262
Feature activation-4.03e-02

Pos logit contributions
B+0.086
P+0.079
st+0.078
L+0.076
One+0.076
Neg logit contributions
 grin-0.164
 fascination-0.163
 stret-0.159
ote-0.156
 glances-0.154
Token paper
Token ID3348
Feature activation-4.23e-02

Pos logit contributions
B+0.062
st+0.059
P+0.059
L+0.058
One+0.058
Neg logit contributions
 grin-0.080
 fascination-0.080
 stret-0.077
ote-0.075
 glances-0.075
Token.
Token ID13
Feature activation-1.42e-02

Pos logit contributions
B+0.050
P+0.045
st+0.045
L+0.044
One+0.044
Neg logit contributions
 grin-0.101
 fascination-0.100
 stret-0.098
ote-0.096
 glances-0.095
Token asked
Token ID1965
Feature activation-5.01e-02

Pos logit contributions
 stret+0.099
 grin+0.099
 fascination+0.098
ote+0.095
 glances+0.092
Neg logit contributions
B-0.045
L-0.042
One-0.041
 sometimes-0.040
st-0.040
Token anx
Token ID7296
Feature activation+6.35e-03

Pos logit contributions
B+0.045
 sometimes+0.044
L+0.042
Tim+0.037
 next+0.037
Neg logit contributions
 grin-0.142
 fascination-0.135
 gasp-0.133
ote-0.133
 stret-0.132
Tokeniously
Token ID6819
Feature activation+2.63e-02

Pos logit contributions
 fascination+0.007
 grin+0.007
 stret+0.007
 glances+0.006
 hi+0.006
Neg logit contributions
B-0.015
st-0.014
Tim-0.014
ety-0.014
P-0.014
Token,
Token ID11
Feature activation-1.84e-02

Pos logit contributions
 stret+0.065
 fascination+0.065
 grin+0.064
ote+0.063
icol+0.062
Neg logit contributions
 sometimes-0.027
 next-0.026
One-0.025
B-0.025
st-0.024
Token "
Token ID366
Feature activation+1.69e-02

Pos logit contributions
B+0.026
L+0.024
One+0.024
Tim+0.023
But+0.023
Neg logit contributions
 grin-0.041
 fascination-0.040
 stret-0.039
ote-0.039
 glances-0.038
TokenWhat
Token ID2061
Feature activation+1.29e-01

Pos logit contributions
 stret+0.037
 fascination+0.037
ote+0.035
 grin+0.035
 hi+0.035
Neg logit contributions
B-0.020
st-0.019
ety-0.019
 day-0.018
P-0.018
Token is
Token ID318
Feature activation+1.78e-02

Pos logit contributions
 stret+0.300
 grin+0.298
 fascination+0.297
ote+0.287
icol+0.279
Neg logit contributions
B-0.160
st-0.157
P-0.151
One-0.151
L-0.150
Token the
Token ID262
Feature activation-8.78e-03

Pos logit contributions
 grin+0.044
 fascination+0.043
 stret+0.041
ote+0.040
 glances+0.040
Neg logit contributions
B-0.021
P-0.020
st-0.019
 sometimes-0.019
Tim-0.019
Token answer
Token ID3280
Feature activation+7.52e-03

Pos logit contributions
B+0.014
st+0.014
One+0.014
P+0.013
Tim+0.013
Neg logit contributions
 fascination-0.017
 grin-0.017
 stret-0.017
 glances-0.016
ote-0.016
Token?"
Token ID1701
Feature activation+1.71e-02

Pos logit contributions
 fascination+0.021
 stret+0.020
 grin+0.020
ote+0.019
 glances+0.019
Neg logit contributions
 sometimes-0.006
st-0.006
One-0.006
B-0.005
 next-0.005
Token The
Token ID383
Feature activation+2.46e-04

Pos logit contributions
 grin+0.048
 fascination+0.048
 stret+0.047
ote+0.046
 glances+0.046
Neg logit contributions
B-0.013
P-0.011
st-0.011
L-0.010
One-0.010
Token she
Token ID673
Feature activation-1.92e-02

Pos logit contributions
B+0.013
st+0.012
Tim+0.012
ety+0.012
P+0.011
Neg logit contributions
 fascination-0.018
 hello-0.017
 grin-0.017
 hi-0.017
ote-0.016
Token and
Token ID290
Feature activation+1.28e-02

Pos logit contributions
B+0.028
st+0.025
P+0.025
One+0.025
Tim+0.025
Neg logit contributions
 grin-0.039
 fascination-0.039
 stret-0.038
 glances-0.037
ote-0.036
Token her
Token ID607
Feature activation+3.76e-02

Pos logit contributions
 fascination+0.026
 grin+0.025
 stret+0.024
 hi+0.024
ote+0.024
Neg logit contributions
B-0.020
st-0.019
P-0.018
Tim-0.018
One-0.018
Token mom
Token ID1995
Feature activation+4.71e-04

Pos logit contributions
 grin+0.084
 fascination+0.083
 stret+0.081
ote+0.080
 glances+0.078
Neg logit contributions
B-0.050
st-0.047
P-0.046
One-0.045
Tim-0.045
Token went
Token ID1816
Feature activation+2.61e-04

Pos logit contributions
 fascination+0.001
 grin+0.001
 stret+0.001
 glances+0.001
 gasp+0.001
Neg logit contributions
B-0.000
P-0.000
st-0.000
L-0.000
Tim-0.000
Token out
Token ID503
Feature activation+1.20e-01

Pos logit contributions
 grin+0.001
 fascination+0.001
 stret+0.001
ote+0.001
 glances+0.001
Neg logit contributions
B-0.000
st-0.000
P-0.000
One-0.000
L-0.000
Token into
Token ID656
Feature activation-9.75e-05

Pos logit contributions
 fascination+0.274
 grin+0.271
 stret+0.267
 glances+0.258
 anticipation+0.256
Neg logit contributions
B-0.148
P-0.141
Tim-0.138
L-0.134
ety-0.133
Token the
Token ID262
Feature activation-2.15e-02

Pos logit contributions
B+0.000
P+0.000
ety+0.000
st+0.000
Tim+0.000
Neg logit contributions
 fascination-0.000
 grin-0.000
 stret-0.000
 anticipation-0.000
 glances-0.000
Token woods
Token ID16479
Feature activation+2.59e-02

Pos logit contributions
B+0.033
ety+0.030
Tim+0.030
st+0.029
P+0.029
Neg logit contributions
 fascination-0.042
 stret-0.041
 grin-0.041
 menu-0.040
 amaz-0.039
Token.
Token ID13
Feature activation-1.72e-03

Pos logit contributions
 fascination+0.057
 grin+0.057
 stret+0.056
 glances+0.054
ote+0.054
Neg logit contributions
B-0.034
P-0.032
L-0.030
Tim-0.030
ety-0.029
Token As
Token ID1081
Feature activation+5.17e-02

Pos logit contributions
B+0.003
P+0.002
Tim+0.002
st+0.002
L+0.002
Neg logit contributions
 fascination-0.004
 grin-0.004
 glances-0.003
 stret-0.003
ote-0.003
Token smiled
Token ID13541
Feature activation+1.16e-02

Pos logit contributions
 sometimes+0.006
L+0.006
st+0.006
my+0.006
 next+0.006
Neg logit contributions
 stret-0.011
ote-0.011
 fascination-0.011
icol-0.010
latable-0.010
Token and
Token ID290
Feature activation+1.68e-02

Pos logit contributions
 stret+0.036
leaf+0.035
ote+0.035
 fascination+0.034
latable+0.034
Neg logit contributions
 next-0.006
 sometimes-0.005
B-0.005
L-0.004
One-0.004
Token said
Token ID531
Feature activation+3.72e-02

Pos logit contributions
 fascination+0.045
 stret+0.043
ote+0.043
 grin+0.043
 menu+0.042
Neg logit contributions
B-0.017
L-0.016
P-0.016
st-0.014
Tim-0.014
Token,
Token ID11
Feature activation-2.18e-03

Pos logit contributions
 fascination+0.108
 stret+0.102
ote+0.100
 anticipation+0.099
 amaz+0.099
Neg logit contributions
B-0.032
"-0.029
L-0.028
 sometimes-0.028
st-0.026
Token "
Token ID366
Feature activation+3.00e-02

Pos logit contributions
B+0.003
L+0.003
Tim+0.003
One+0.003
st+0.003
Neg logit contributions
 fascination-0.005
 grin-0.005
 stret-0.004
 glances-0.004
ote-0.004
TokenOf
Token ID5189
Feature activation+1.17e-01

Pos logit contributions
leaf+0.055
 fascination+0.052
 grin+0.048
 hi+0.047
 hello+0.047
Neg logit contributions
st-0.045
ety-0.045
 sometimes-0.041
 day-0.041
P-0.041
Token course
Token ID1781
Feature activation+6.00e-02

Pos logit contributions
 grin+0.187
 fascination+0.186
 glances+0.177
 stret+0.173
 relief+0.171
Neg logit contributions
B-0.217
ety-0.210
st-0.207
P-0.203
One-0.201
Token,
Token ID11
Feature activation+7.45e-03

Pos logit contributions
 fascination+0.160
 stret+0.157
 grin+0.154
ote+0.148
 glances+0.146
Neg logit contributions
B-0.059
Tim-0.056
L-0.056
 sometimes-0.053
st-0.053
Token Tim
Token ID5045
Feature activation-3.92e-02

Pos logit contributions
 grin+0.019
 fascination+0.019
 stret+0.018
ote+0.018
 glances+0.018
Neg logit contributions
B-0.008
Tim-0.007
L-0.007
st-0.007
P-0.007
Tokenmy
Token ID1820
Feature activation-2.65e-02

Pos logit contributions
B+0.046
P+0.043
One+0.041
st+0.040
L+0.039
Neg logit contributions
 grin-0.093
 fascination-0.089
 stret-0.087
ote-0.086
 glances-0.086
Token.
Token ID13
Feature activation+1.96e-02

Pos logit contributions
B+0.028
st+0.027
L+0.025
P+0.025
ety+0.025
Neg logit contributions
 fascination-0.067
 grin-0.066
 stret-0.066
ote-0.063
 glances-0.062
Token
Token ID198
Feature activation-2.40e-02

Pos logit contributions
B+0.011
P+0.011
st+0.011
ety+0.009
 sometimes+0.009
Neg logit contributions
 grin-0.027
 fascination-0.027
 stret-0.027
 hi-0.026
ows-0.026
Token
Token ID198
Feature activation-2.11e-02

Pos logit contributions
ety+0.030
st+0.029
P+0.028
 sometimes+0.025
B+0.025
Neg logit contributions
 grin-0.048
 fascination-0.048
 hello-0.048
 stret-0.047
 hi-0.047
TokenThe
Token ID464
Feature activation-2.84e-02

Pos logit contributions
st+0.023
ety+0.022
ib+0.022
ll+0.020
o+0.018
Neg logit contributions
 stret-0.047
 fascination-0.043
 hello-0.043
 hi-0.043
 widened-0.043
Token girl
Token ID2576
Feature activation-6.62e-03

Pos logit contributions
B+0.035
st+0.032
P+0.032
L+0.031
One+0.031
Neg logit contributions
 grin-0.066
 fascination-0.066
 stret-0.065
ote-0.063
 glances-0.062
Token felt
Token ID2936
Feature activation-4.49e-05

Pos logit contributions
B+0.008
P+0.008
st+0.008
L+0.008
One+0.008
Neg logit contributions
 grin-0.015
 fascination-0.015
 stret-0.015
ote-0.014
 glances-0.014
Token something
Token ID1223
Feature activation+1.50e-01

Pos logit contributions
B+0.000
 sometimes+0.000
L+0.000
But+0.000
P+0.000
Neg logit contributions
ote-0.000
 stret-0.000
 fascination-0.000
icol-0.000
 grin-0.000
Token in
Token ID287
Feature activation-1.98e-02

Pos logit contributions
 fascination+0.357
ote+0.351
 grin+0.345
 amaz+0.341
 vows+0.341
Neg logit contributions
B-0.205
 day-0.181
st-0.174
Tim-0.169
 sometimes-0.169
Token the
Token ID262
Feature activation-2.30e-02

Pos logit contributions
B+0.026
P+0.024
st+0.024
L+0.023
One+0.023
Neg logit contributions
 grin-0.045
 fascination-0.045
 stret-0.044
ote-0.043
 glances-0.042
Token water
Token ID1660
Feature activation+6.71e-03

Pos logit contributions
B+0.029
P+0.028
st+0.027
One+0.026
Tim+0.025
Neg logit contributions
 grin-0.054
 fascination-0.054
ote-0.052
 stret-0.052
oured-0.050
Token.
Token ID13
Feature activation-4.22e-04

Pos logit contributions
 fascination+0.017
 grin+0.017
 stret+0.017
ote+0.016
 glances+0.016
Neg logit contributions
B-0.007
st-0.006
P-0.006
ety-0.006
L-0.006
Token She
Token ID1375
Feature activation+3.67e-02

Pos logit contributions
B+0.000
L+0.000
 sometimes+0.000
Tim+0.000
ety+0.000
Neg logit contributions
 stret-0.001
 grin-0.001
 fascination-0.001
ote-0.001
 glances-0.001
TokenHis
Token ID6653
Feature activation+2.01e-03

Pos logit contributions
ety+0.026
st+0.024
B+0.023
 sometimes+0.022
P+0.020
Neg logit contributions
 stret-0.047
 grin-0.046
 fascination-0.045
 hi-0.044
 hello-0.043
Token owner
Token ID4870
Feature activation+7.38e-03

Pos logit contributions
 fascination+0.005
 grin+0.005
 stret+0.005
ote+0.005
 glances+0.005
Neg logit contributions
B-0.002
One-0.002
L-0.001
P-0.001
Tim-0.001
Token smiled
Token ID13541
Feature activation+8.43e-03

Pos logit contributions
 stret+0.016
 fascination+0.015
icol+0.015
ote+0.014
 grin+0.014
Neg logit contributions
B-0.012
L-0.012
 sometimes-0.011
Tim-0.011
One-0.011
Token.
Token ID13
Feature activation+8.21e-03

Pos logit contributions
 stret+0.026
leaf+0.025
ote+0.025
 fascination+0.025
 grin+0.025
Neg logit contributions
B-0.004
 sometimes-0.004
 next-0.004
One-0.004
L-0.003
Token "
Token ID366
Feature activation+2.88e-02

Pos logit contributions
 fascination+0.019
 grin+0.018
 stret+0.018
ote+0.018
 glances+0.017
Neg logit contributions
B-0.011
L-0.010
Tim-0.010
One-0.010
st-0.009
TokenOf
Token ID5189
Feature activation+1.17e-01

Pos logit contributions
 fascination+0.041
leaf+0.041
 hi+0.038
 grin+0.037
 hello+0.037
Neg logit contributions
ety-0.056
st-0.054
B-0.054
P-0.051
Tim-0.050
Token course
Token ID1781
Feature activation+5.93e-02

Pos logit contributions
 grin+0.203
 fascination+0.201
 stret+0.192
 glances+0.190
 hi+0.189
Neg logit contributions
B-0.208
st-0.195
ety-0.195
L-0.194
P-0.193
Token you
Token ID345
Feature activation+5.76e-02

Pos logit contributions
 fascination+0.170
 stret+0.166
 grin+0.166
ote+0.157
 glances+0.157
Neg logit contributions
B-0.048
Tim-0.042
L-0.041
st-0.041
P-0.038
Token can
Token ID460
Feature activation-3.42e-02

Pos logit contributions
 stret+0.160
 fascination+0.152
 grin+0.149
ote+0.149
 glances+0.143
Neg logit contributions
B-0.050
 sometimes-0.050
One-0.050
st-0.048
L-0.045
Token,"
Token ID553
Feature activation-1.08e-02

Pos logit contributions
B+0.047
One+0.043
st+0.043
P+0.043
Tim+0.042
Neg logit contributions
 fascination-0.076
 grin-0.075
 stret-0.074
ote-0.072
 glances-0.071
Token he
Token ID339
Feature activation+2.44e-03

Pos logit contributions
B+0.013
st+0.012
P+0.012
One+0.012
ety+0.012
Neg logit contributions
 grin-0.025
 fascination-0.025
 stret-0.024
ote-0.023
 glances-0.023
Token Lily
Token ID20037
Feature activation-3.18e-02

Pos logit contributions
B+0.019
st+0.018
ety+0.017
P+0.017
One+0.017
Neg logit contributions
 grin-0.028
 fascination-0.028
 hello-0.026
 stret-0.026
 hi-0.026
Token said
Token ID531
Feature activation+4.57e-02

Pos logit contributions
B+0.043
L+0.040
P+0.039
st+0.039
Tim+0.039
Neg logit contributions
 fascination-0.071
 grin-0.070
 stret-0.069
ote-0.068
 glances-0.066
Token.
Token ID13
Feature activation+1.98e-02

Pos logit contributions
 fascination+0.128
 stret+0.124
ote+0.123
 grin+0.122
 amaz+0.120
Neg logit contributions
B-0.036
st-0.033
L-0.030
 sometimes-0.029
P-0.028
Token
Token ID198
Feature activation-1.20e-02

Pos logit contributions
 grin+0.050
 hello+0.049
 fascination+0.049
 hi+0.048
 glances+0.047
Neg logit contributions
st-0.018
B-0.018
ety-0.016
P-0.016
Tim-0.015
Token
Token ID198
Feature activation-1.12e-02

Pos logit contributions
st+0.012
ety+0.011
P+0.011
B+0.011
 sometimes+0.010
Neg logit contributions
 fascination-0.029
 grin-0.029
 stret-0.029
 hello-0.028
 glances-0.028
TokenThey
Token ID2990
Feature activation+4.81e-02

Pos logit contributions
ety+0.012
st+0.011
B+0.010
 day+0.010
 sometimes+0.010
Neg logit contributions
 fascination-0.026
 grin-0.025
 stret-0.025
 glances-0.025
 hello-0.024
Token kept
Token ID4030
Feature activation-2.67e-02

Pos logit contributions
ote+0.109
 fascination+0.109
 stret+0.108
 grin+0.107
 hi+0.103
Neg logit contributions
B-0.063
L-0.061
P-0.059
Tim-0.057
One-0.056
Token building
Token ID2615
Feature activation-1.95e-02

Pos logit contributions
B+0.040
st+0.038
P+0.037
L+0.037
Tim+0.036
Neg logit contributions
 grin-0.055
 fascination-0.054
 stret-0.053
ote-0.052
 glances-0.051
Token the
Token ID262
Feature activation-4.66e-03

Pos logit contributions
B+0.020
 sometimes+0.020
st+0.020
L+0.019
 next+0.019
Neg logit contributions
ote-0.049
 grin-0.049
 fascination-0.047
 stret-0.047
 hi-0.046
Token tower
Token ID10580
Feature activation-7.07e-02

Pos logit contributions
B+0.007
P+0.006
st+0.006
ety+0.006
L+0.006
Neg logit contributions
 grin-0.010
 fascination-0.010
 stret-0.010
ote-0.009
 menu-0.009
Token higher
Token ID2440
Feature activation+3.22e-02

Pos logit contributions
B+0.095
P+0.088
L+0.083
st+0.083
ety+0.083
Neg logit contributions
 grin-0.156
 fascination-0.155
 stret-0.152
 glances-0.147
ote-0.146
Token stories
Token ID3923
Feature activation+9.80e-02

Pos logit contributions
B+0.017
 sometimes+0.017
st+0.016
One+0.015
Tim+0.015
Neg logit contributions
 stret-0.024
 grin-0.024
 fascination-0.023
ote-0.023
oured-0.022
Token from
Token ID422
Feature activation+5.93e-02

Pos logit contributions
 fascination+0.257
 grin+0.250
 stret+0.243
ote+0.243
 expression+0.233
Neg logit contributions
 sometimes-0.103
B-0.099
st-0.096
One-0.096
 next-0.090
Token all
Token ID477
Feature activation-5.59e-02

Pos logit contributions
 grin+0.149
 fascination+0.143
 stret+0.140
ote+0.139
 glances+0.136
Neg logit contributions
B-0.066
st-0.060
 sometimes-0.060
One-0.059
L-0.057
Token over
Token ID625
Feature activation+6.62e-02

Pos logit contributions
B+0.069
P+0.063
L+0.062
Tim+0.061
st+0.061
Neg logit contributions
 fascination-0.129
 grin-0.129
 stret-0.126
 glances-0.123
ote-0.122
Token the
Token ID262
Feature activation-3.37e-03

Pos logit contributions
 fascination+0.155
 grin+0.154
 stret+0.151
 glances+0.147
ote+0.146
Neg logit contributions
B-0.078
P-0.072
L-0.070
st-0.070
ety-0.069
Token world
Token ID995
Feature activation+5.48e-02

Pos logit contributions
B+0.004
st+0.004
One+0.004
P+0.004
L+0.004
Neg logit contributions
 grin-0.008
 fascination-0.008
ote-0.008
 stret-0.008
oured-0.007
Token."
Token ID526
Feature activation-1.24e-02

Pos logit contributions
 grin+0.143
 fascination+0.142
 stret+0.136
ote+0.135
 expression+0.133
Neg logit contributions
B-0.056
st-0.052
One-0.049
P-0.049
 sometimes-0.049
Token
Token ID198
Feature activation-1.49e-02

Pos logit contributions
B+0.016
st+0.015
P+0.015
ety+0.015
L+0.015
Neg logit contributions
 grin-0.028
 fascination-0.027
 stret-0.026
 hi-0.026
 glances-0.025
Token
Token ID198
Feature activation-1.44e-02

Pos logit contributions
ety+0.013
st+0.012
P+0.012
B+0.011
 sometimes+0.010
Neg logit contributions
 grin-0.038
 fascination-0.038
 stret-0.037
 glances-0.036
 hi-0.036
TokenJack
Token ID14295
Feature activation-9.00e-03

Pos logit contributions
ety+0.022
st+0.021
 sometimes+0.018
B+0.018
ib+0.017
Neg logit contributions
 fascination-0.027
 grin-0.026
 glances-0.025
 widened-0.025
 menu-0.025
Token's
Token ID338
Feature activation-5.81e-02

Pos logit contributions
B+0.011
st+0.010
P+0.010
L+0.010
One+0.010
Neg logit contributions
 grin-0.021
 fascination-0.021
 stret-0.020
ote-0.020
 glances-0.019
Token next
Token ID1306
Feature activation+1.74e-03

Pos logit contributions
B+0.005
P+0.005
st+0.004
L+0.004
One+0.004
Neg logit contributions
 grin-0.022
 fascination-0.022
 stret-0.021
ote-0.021
 glances-0.021
Token day
Token ID1110
Feature activation-5.85e-02

Pos logit contributions
 grin+0.005
 stret+0.004
 fascination+0.004
ote+0.004
 glances+0.004
Neg logit contributions
B-0.002
P-0.001
L-0.001
ety-0.001
Tim-0.001
Token.
Token ID13
Feature activation-2.58e-03

Pos logit contributions
B+0.067
P+0.061
st+0.061
L+0.060
One+0.059
Neg logit contributions
 grin-0.140
 fascination-0.139
 stret-0.136
ote-0.134
 glances-0.132
Token<|endoftext|>
Token ID50256
Feature activation+5.02e-02

Pos logit contributions
st+0.003
P+0.003
B+0.003
ety+0.003
One+0.002
Neg logit contributions
 grin-0.006
 hi-0.006
 fascination-0.006
 hello-0.006
 stret-0.006
TokenOnce
Token ID7454
Feature activation-5.62e-02

Pos logit contributions
 grin+0.120
 fascination+0.116
 hello+0.113
 menu+0.113
 stret+0.113
Neg logit contributions
ety-0.050
B-0.046
P-0.045
st-0.041
Tim-0.041
Token,
Token ID11
Feature activation+1.42e-02

Pos logit contributions
B+0.097
P+0.092
st+0.088
Tim+0.082
Sam+0.078
Neg logit contributions
 grin-0.103
 fascination-0.099
 hello-0.097
 hi-0.094
 expression-0.094
Token there
Token ID612
Feature activation-2.42e-02

Pos logit contributions
 grin+0.030
 hello+0.030
 hi+0.030
 fascination+0.030
 gasp+0.028
Neg logit contributions
B-0.019
st-0.018
P-0.017
Tim-0.015
One-0.015
Token was
Token ID373
Feature activation-2.32e-03

Pos logit contributions
B+0.025
P+0.023
st+0.022
ety+0.022
One+0.021
Neg logit contributions
 fascination-0.061
 grin-0.061
 stret-0.058
 hi-0.057
 glances-0.057
Token a
Token ID257
Feature activation+1.36e-02

Pos logit contributions
st+0.004
B+0.004
ety+0.004
One+0.004
L+0.003
Neg logit contributions
 fascination-0.004
 grin-0.004
 stret-0.004
 hi-0.004
 hello-0.004
Token boy
Token ID2933
Feature activation-1.32e-02

Pos logit contributions
 grin+0.029
 fascination+0.029
 stret+0.028
ote+0.027
 glances+0.027
Neg logit contributions
B-0.019
st-0.018
P-0.018
One-0.017
ety-0.017
Token named
Token ID3706
Feature activation-2.63e-02

Pos logit contributions
Next+0.026
B+0.025
There+0.025
P+0.024
ety+0.024
Neg logit contributions
 stret-0.017
 hello-0.015
 grin-0.015
 hi-0.015
 fascination-0.014
Token smells
Token ID25760
Feature activation+1.28e-01

Pos logit contributions
 sometimes+0.004
 next+0.003
 day+0.003
L+0.003
Tim+0.003
Neg logit contributions
ote-0.005
 stret-0.004
oured-0.004
 fascination-0.004
person-0.004
Token bad
Token ID2089
Feature activation+2.23e-02

Pos logit contributions
ote+0.365
 fascination+0.347
 stret+0.343
oured+0.334
ishly+0.330
Neg logit contributions
 sometimes-0.121
 day-0.108
B-0.106
L-0.104
 next-0.098
Token.
Token ID13
Feature activation-9.61e-03

Pos logit contributions
 fascination+0.063
 grin+0.062
 stret+0.061
ote+0.060
 hi+0.058
Neg logit contributions
B-0.017
st-0.017
 sometimes-0.016
L-0.015
P-0.014
Token But
Token ID887
Feature activation+3.08e-03

Pos logit contributions
B+0.012
st+0.012
P+0.011
ety+0.011
Tim+0.010
Neg logit contributions
 fascination-0.022
 grin-0.021
 hello-0.021
 glances-0.021
 hi-0.021
Token they
Token ID484
Feature activation-4.61e-02

Pos logit contributions
 fascination+0.007
 grin+0.007
 glances+0.006
 hi+0.006
 hello+0.006
Neg logit contributions
B-0.004
P-0.004
st-0.004
One-0.004
Tim-0.004
Token think
Token ID892
Feature activation+1.73e-02

Pos logit contributions
 sometimes+0.058
 also+0.052
One+0.045
But+0.045
I+0.045
Neg logit contributions
 stret-0.113
icol-0.112
ote-0.110
oured-0.106
 gru-0.106
Token it
Token ID340
Feature activation+9.30e-03

Pos logit contributions
 grin+0.042
 fascination+0.041
 stret+0.040
ote+0.040
 glances+0.039
Neg logit contributions
B-0.020
Tim-0.018
st-0.018
L-0.018
 sometimes-0.018
Token is
Token ID318
Feature activation+1.74e-02

Pos logit contributions
 grin+0.021
 fascination+0.020
 stret+0.020
ote+0.020
 expression+0.019
Neg logit contributions
B-0.013
st-0.013
One-0.012
 sometimes-0.012
P-0.012
Token cool
Token ID3608
Feature activation-5.93e-03

Pos logit contributions
 grin+0.035
 fascination+0.035
 stret+0.034
ote+0.034
 glances+0.032
Neg logit contributions
B-0.027
st-0.025
P-0.025
One-0.025
 sometimes-0.025
Token.
Token ID13
Feature activation-1.19e-02

Pos logit contributions
B+0.005
st+0.005
 sometimes+0.005
ety+0.005
L+0.004
Neg logit contributions
 fascination-0.016
 grin-0.016
 stret-0.016
ote-0.015
 hi-0.015
Token
Token ID198
Feature activation-2.81e-02

Pos logit contributions
B+0.013
st+0.013
P+0.013
ety+0.012
Tim+0.011
Neg logit contributions
 hello-0.027
 fascination-0.027
 hi-0.027
 grin-0.027
 glances-0.026
TokenThere
Token ID1858
Feature activation-8.48e-02

Pos logit contributions
ety+0.040
st+0.039
B+0.036
 sometimes+0.035
One+0.035
Neg logit contributions
 stret-0.060
 fascination-0.059
 grin-0.058
leaf-0.055
 glances-0.055
Token,"
Token ID553
Feature activation-1.62e-02

Pos logit contributions
B+0.104
st+0.100
P+0.096
Tim+0.096
L+0.096
Neg logit contributions
 fascination-0.197
 grin-0.195
 stret-0.193
ote-0.189
 glances-0.183
Token Lily
Token ID20037
Feature activation-3.78e-02

Pos logit contributions
B+0.014
L+0.013
Tim+0.013
P+0.013
 sometimes+0.012
Neg logit contributions
 fascination-0.044
 grin-0.043
 stret-0.043
ote-0.043
oured-0.041
Token says
Token ID1139
Feature activation+6.13e-02

Pos logit contributions
 sometimes+0.058
st+0.056
L+0.055
B+0.054
One+0.053
Neg logit contributions
 stret-0.073
 fascination-0.072
ote-0.071
 grin-0.070
ured-0.069
Token.
Token ID13
Feature activation+1.64e-02

Pos logit contributions
 fascination+0.177
ote+0.170
 stret+0.170
 grin+0.168
 amaz+0.167
Neg logit contributions
B-0.039
 next-0.035
 sometimes-0.034
st-0.034
L-0.033
Token "
Token ID366
Feature activation+3.30e-02

Pos logit contributions
 grin+0.033
 fascination+0.033
 stret+0.032
ote+0.031
 glances+0.031
Neg logit contributions
B-0.025
P-0.023
st-0.023
L-0.023
One-0.023
TokenNow
Token ID3844
Feature activation+7.38e-03

Pos logit contributions
 stret+0.072
five+0.071
 fascination+0.070
leaf+0.070
 gasp+0.067
Neg logit contributions
ety-0.036
st-0.031
B-0.030
ll-0.029
 day-0.029
Token he
Token ID339
Feature activation+2.08e-03

Pos logit contributions
 fascination+0.022
 grin+0.021
ote+0.021
 stret+0.020
leaf+0.020
Neg logit contributions
 sometimes-0.005
B-0.005
st-0.005
L-0.004
Tim-0.004
Token is
Token ID318
Feature activation+2.26e-02

Pos logit contributions
 grin+0.005
 fascination+0.005
 stret+0.005
 expression+0.005
ote+0.005
Neg logit contributions
st-0.002
B-0.002
 sometimes-0.002
P-0.002
One-0.002
Token warm
Token ID5814
Feature activation+3.06e-02

Pos logit contributions
 fascination+0.050
 grin+0.050
ote+0.048
 glances+0.047
 stret+0.047
Neg logit contributions
B-0.030
P-0.029
st-0.028
One-0.028
 sometimes-0.028
Token and
Token ID290
Feature activation+3.22e-03

Pos logit contributions
 grin+0.072
 fascination+0.071
 stret+0.070
ote+0.068
 glances+0.068
Neg logit contributions
B-0.036
P-0.033
One-0.032
st-0.032
L-0.032
Token then
Token ID788
Feature activation-2.18e-03

Pos logit contributions
 fascination+0.036
 grin+0.035
 amaz+0.034
 expression+0.034
 anticipation+0.034
Neg logit contributions
B-0.023
P-0.022
st-0.020
Tim-0.019
One-0.019
Token,
Token ID11
Feature activation-6.14e-03

Pos logit contributions
B+0.003
P+0.003
st+0.003
Tim+0.003
One+0.003
Neg logit contributions
 grin-0.005
 fascination-0.004
 stret-0.004
 glances-0.004
ote-0.004
Token she
Token ID673
Feature activation-2.03e-02

Pos logit contributions
B+0.008
P+0.007
st+0.007
Tim+0.007
One+0.007
Neg logit contributions
 grin-0.014
 fascination-0.014
 stret-0.013
ote-0.013
 glances-0.013
Token made
Token ID925
Feature activation-4.79e-02

Pos logit contributions
B+0.028
st+0.026
P+0.026
Tim+0.025
One+0.025
Neg logit contributions
 grin-0.044
 fascination-0.044
 stret-0.042
ote-0.042
 glances-0.041
Token a
Token ID257
Feature activation-6.39e-03

Pos logit contributions
B+0.058
L+0.053
st+0.052
Tim+0.051
P+0.050
Neg logit contributions
 grin-0.112
 stret-0.110
 fascination-0.110
ote-0.108
 glances-0.105
Token mistake
Token ID7457
Feature activation+8.16e-03

Pos logit contributions
L+0.008
B+0.008
st+0.008
P+0.008
One+0.008
Neg logit contributions
oured-0.013
 glances-0.013
ote-0.013
 fascination-0.013
 grin-0.013
Token.
Token ID13
Feature activation+4.42e-03

Pos logit contributions
 fascination+0.021
 grin+0.021
 stret+0.021
ote+0.020
 glances+0.020
Neg logit contributions
B-0.008
st-0.007
L-0.007
 sometimes-0.007
Tim-0.007
Token She
Token ID1375
Feature activation+3.79e-02

Pos logit contributions
 grin+0.008
 fascination+0.008
 expression+0.007
 amaz+0.007
 anticipation+0.007
Neg logit contributions
B-0.008
st-0.008
P-0.008
ety-0.007
Tim-0.007
Token put
Token ID1234
Feature activation-9.37e-03

Pos logit contributions
 grin+0.091
 fascination+0.091
 stret+0.090
ote+0.088
 glances+0.086
Neg logit contributions
B-0.044
st-0.039
P-0.039
L-0.039
Tim-0.038
Token her
Token ID607
Feature activation+3.71e-02

Pos logit contributions
B+0.010
P+0.009
st+0.009
L+0.009
Tim+0.008
Neg logit contributions
 grin-0.024
 fascination-0.024
ote-0.023
 stret-0.023
 glances-0.022
Token coat
Token ID13209
Feature activation+5.89e-02

Pos logit contributions
 grin+0.057
 fascination+0.057
 stret+0.054
ote+0.053
 glances+0.052
Neg logit contributions
B-0.075
P-0.071
st-0.071
L-0.070
One-0.070
Token of
Token ID286
Feature activation-3.62e-02

Pos logit contributions
 grin+0.006
 stret+0.006
 fascination+0.006
 glances+0.006
ote+0.006
Neg logit contributions
B-0.004
P-0.003
ety-0.003
st-0.003
One-0.003
Token fun
Token ID1257
Feature activation+2.96e-02

Pos logit contributions
B+0.052
P+0.048
st+0.047
Tim+0.047
One+0.047
Neg logit contributions
 grin-0.078
 fascination-0.078
 stret-0.076
ote-0.075
 glances-0.073
Token.
Token ID13
Feature activation-1.77e-03

Pos logit contributions
 grin+0.056
 stret+0.055
 glances+0.053
 fascination+0.053
ote+0.051
Neg logit contributions
ety-0.048
B-0.045
P-0.044
L-0.044
One-0.043
Token When
Token ID1649
Feature activation+4.53e-02

Pos logit contributions
B+0.003
P+0.002
st+0.002
One+0.002
ety+0.002
Neg logit contributions
 grin-0.004
 fascination-0.004
 stret-0.004
ote-0.003
 glances-0.003
Token it
Token ID340
Feature activation+1.74e-02

Pos logit contributions
 grin+0.088
 fascination+0.085
 relief+0.083
 wow+0.083
 stret+0.083
Neg logit contributions
st-0.069
ety-0.068
P-0.068
B-0.067
One-0.062
Token was
Token ID373
Feature activation-1.52e-03

Pos logit contributions
 grin+0.038
 fascination+0.035
 widened+0.035
 gasp+0.035
 hi+0.034
Neg logit contributions
ety-0.021
B-0.021
P-0.021
One-0.020
st-0.019
Token time
Token ID640
Feature activation-5.40e-02

Pos logit contributions
B+0.002
P+0.001
st+0.001
L+0.001
One+0.001
Neg logit contributions
 grin-0.004
 fascination-0.004
 stret-0.004
ote-0.004
 glances-0.004
Token to
Token ID284
Feature activation+3.47e-02

Pos logit contributions
B+0.066
P+0.061
st+0.060
One+0.059
L+0.059
Neg logit contributions
 grin-0.127
 fascination-0.125
 stret-0.122
ote-0.120
 glances-0.119
Token go
Token ID467
Feature activation+2.12e-02

Pos logit contributions
 fascination+0.063
 grin+0.062
ote+0.061
 stret+0.061
 glances+0.059
Neg logit contributions
B-0.059
P-0.056
L-0.056
Tim-0.055
One-0.055
Token home
Token ID1363
Feature activation-2.15e-02

Pos logit contributions
 grin+0.043
 stret+0.041
 fascination+0.040
 hi+0.040
 glances+0.039
Neg logit contributions
B-0.032
P-0.031
One-0.030
st-0.029
L-0.029
Token,
Token ID11
Feature activation-2.81e-02

Pos logit contributions
 next+0.018
L+0.018
 sometimes+0.018
B+0.018
Tim+0.018
Neg logit contributions
 fascination-0.058
 stret-0.055
leaf-0.055
ote-0.055
 glances-0.054
Token pear
Token ID25286
Feature activation-4.14e-02

Pos logit contributions
 grin+0.015
 fascination+0.015
 stret+0.014
ote+0.014
 glances+0.014
Neg logit contributions
B-0.008
 day-0.007
 next-0.007
L-0.007
st-0.007
Token tree
Token ID5509
Feature activation-2.23e-02

Pos logit contributions
B+0.050
st+0.048
P+0.047
L+0.046
ety+0.045
Neg logit contributions
 fascination-0.097
 grin-0.095
 stret-0.094
ote-0.092
 hi-0.090
Token grew
Token ID6348
Feature activation-4.37e-02

Pos logit contributions
st+0.021
 sometimes+0.020
B+0.019
ety+0.019
L+0.018
Neg logit contributions
 fascination-0.058
 stret-0.058
 grin-0.056
ote-0.055
 hi-0.054
Token very
Token ID845
Feature activation+2.16e-02

Pos logit contributions
B+0.072
st+0.068
P+0.066
One+0.066
ety+0.066
Neg logit contributions
 grin-0.084
 fascination-0.084
 stret-0.082
ote-0.080
 glances-0.078
Token tall
Token ID7331
Feature activation+2.34e-02

Pos logit contributions
 stret+0.045
ote+0.044
 grin+0.044
 fascination+0.043
 glances+0.041
Neg logit contributions
B-0.032
One-0.030
L-0.030
st-0.030
ety-0.030
Token and
Token ID290
Feature activation-1.26e-02

Pos logit contributions
 fascination+0.065
 grin+0.065
 stret+0.064
ote+0.062
 hi+0.062
Neg logit contributions
st-0.019
B-0.018
 sometimes-0.018
 next-0.015
One-0.015
Token strong
Token ID1913
Feature activation-2.05e-02

Pos logit contributions
B+0.020
Tim+0.018
L+0.018
st+0.018
One+0.018
Neg logit contributions
 stret-0.026
 grin-0.025
 fascination-0.025
ote-0.024
 glances-0.024
Token.
Token ID13
Feature activation+2.56e-03

Pos logit contributions
 sometimes+0.016
st+0.016
B+0.015
 next+0.013
One+0.012
Neg logit contributions
 fascination-0.057
 grin-0.057
 stret-0.057
ote-0.055
 hi-0.054
Token The
Token ID383
Feature activation-1.08e-02

Pos logit contributions
 grin+0.005
 fascination+0.005
 stret+0.005
ote+0.005
 glances+0.005
Neg logit contributions
B-0.004
P-0.004
st-0.004
One-0.004
L-0.004
Token pe
Token ID613
Feature activation+6.67e-02

Pos logit contributions
B+0.016
P+0.015
st+0.015
L+0.014
One+0.014
Neg logit contributions
 grin-0.023
 fascination-0.023
 stret-0.022
ote-0.022
 glances-0.021
Tokenaches
Token ID3694
Feature activation+2.19e-02

Pos logit contributions
 grin+0.113
 fascination+0.112
 stret+0.107
ote+0.106
 glances+0.105
Neg logit contributions
B-0.125
L-0.115
Tim-0.115
P-0.115
st-0.113
Token in
Token ID287
Feature activation-2.40e-02

Pos logit contributions
B+0.012
ety+0.011
st+0.011
P+0.011
One+0.011
Neg logit contributions
 grin-0.020
 fascination-0.020
 stret-0.020
ote-0.019
 glances-0.019
Token the
Token ID262
Feature activation-2.62e-02

Pos logit contributions
ety+0.046
P+0.043
ll+0.042
st+0.042
B+0.041
Neg logit contributions
 fascination-0.039
 stret-0.037
 amaz-0.035
 anticipation-0.034
 menu-0.034
Token yard
Token ID12699
Feature activation-2.62e-02

Pos logit contributions
ety+0.043
B+0.043
 sometimes+0.042
st+0.042
ll+0.038
Neg logit contributions
 stret-0.045
 fascination-0.042
 grin-0.041
 hi-0.040
 menu-0.040
Token with
Token ID351
Feature activation+1.62e-02

Pos logit contributions
B+0.032
P+0.030
L+0.029
ety+0.029
st+0.028
Neg logit contributions
 fascination-0.060
 grin-0.059
 stret-0.057
 glances-0.056
ote-0.056
Token his
Token ID465
Feature activation+9.75e-03

Pos logit contributions
 fascination+0.030
 grin+0.029
 stret+0.029
 hi+0.028
 hello+0.028
Neg logit contributions
st-0.025
B-0.025
ety-0.024
P-0.023
L-0.023
Token friends
Token ID2460
Feature activation-5.81e-02

Pos logit contributions
 fascination+0.020
 grin+0.020
 stret+0.020
 hi+0.019
 hello+0.019
Neg logit contributions
B-0.013
st-0.013
One-0.012
P-0.012
L-0.012
Token.
Token ID13
Feature activation-2.45e-03

Pos logit contributions
B+0.084
ety+0.082
L+0.082
Tim+0.082
P+0.080
Neg logit contributions
 fascination-0.110
 grin-0.108
 stret-0.107
 glances-0.103
 hello-0.103
Token They
Token ID1119
Feature activation+2.91e-02

Pos logit contributions
st+0.004
P+0.004
B+0.004
ety+0.004
L+0.004
Neg logit contributions
 hello-0.004
 fascination-0.004
 hi-0.004
 grin-0.004
 stret-0.004
Token would
Token ID561
Feature activation-8.09e-04

Pos logit contributions
 fascination+0.059
 grin+0.059
 stret+0.057
 hi+0.055
 glances+0.054
Neg logit contributions
st-0.039
B-0.039
P-0.038
L-0.036
Tim-0.035
Token run
Token ID1057
Feature activation+2.00e-02

Pos logit contributions
B+0.001
One+0.001
Tim+0.001
 sometimes+0.001
P+0.001
Neg logit contributions
 grin-0.002
 fascination-0.002
ote-0.002
 stret-0.001
 glances-0.001
Token,
Token ID11
Feature activation-3.06e-02

Pos logit contributions
 grin+0.043
 fascination+0.042
 stret+0.042
ote+0.040
 glances+0.040
Neg logit contributions
B-0.028
P-0.026
st-0.026
L-0.025
One-0.025
Token bags
Token ID11668
Feature activation+5.39e-02

Pos logit contributions
B+0.097
st+0.095
ety+0.093
Tim+0.090
 sometimes+0.089
Neg logit contributions
 fascination-0.196
 grin-0.196
ote-0.187
 stret-0.185
oured-0.183
Token and
Token ID290
Feature activation+5.16e-03

Pos logit contributions
 grin+0.129
 fascination+0.129
 stret+0.126
ote+0.123
 glances+0.122
Neg logit contributions
B-0.062
st-0.056
P-0.056
L-0.055
One-0.055
Token take
Token ID1011
Feature activation-8.21e-03

Pos logit contributions
 grin+0.011
 fascination+0.011
 stret+0.011
ote+0.011
 glances+0.010
Neg logit contributions
B-0.007
st-0.007
P-0.007
Tim-0.006
L-0.006
Token their
Token ID511
Feature activation+3.11e-02

Pos logit contributions
B+0.013
P+0.012
st+0.012
L+0.012
Tim+0.012
Neg logit contributions
 grin-0.017
 fascination-0.017
ote-0.016
 stret-0.016
 expression-0.015
Token things
Token ID1243
Feature activation+3.54e-02

Pos logit contributions
 grin+0.063
 fascination+0.062
ote+0.059
 stret+0.057
 expression+0.056
Neg logit contributions
B-0.049
st-0.047
P-0.046
ety-0.045
L-0.045
Token.
Token ID13
Feature activation+4.25e-03

Pos logit contributions
 grin+0.080
 fascination+0.079
 stret+0.078
ote+0.076
 glances+0.075
Neg logit contributions
B-0.046
P-0.042
st-0.042
L-0.041
One-0.041
Token That
Token ID1320
Feature activation+2.99e-02

Pos logit contributions
 grin+0.008
 hi+0.008
 fascination+0.008
 hello+0.008
ote+0.007
Neg logit contributions
st-0.007
B-0.007
P-0.007
ety-0.006
One-0.006
Token is
Token ID318
Feature activation+2.37e-02

Pos logit contributions
 grin+0.067
 fascination+0.066
 stret+0.065
ote+0.063
 glances+0.063
Neg logit contributions
B-0.039
P-0.036
L-0.035
One-0.035
st-0.034
Token stealing
Token ID16568
Feature activation+3.84e-02

Pos logit contributions
 grin+0.052
 fascination+0.052
 stret+0.052
ote+0.050
 glances+0.049
Neg logit contributions
B-0.031
st-0.029
P-0.028
L-0.027
One-0.027
Token.
Token ID13
Feature activation-2.59e-03

Pos logit contributions
 fascination+0.104
 grin+0.103
ote+0.098
 stret+0.098
oured+0.095
Neg logit contributions
B-0.036
 sometimes-0.035
st-0.035
L-0.033
Tim-0.033
Token That
Token ID1320
Feature activation+3.08e-02

Pos logit contributions
st+0.005
B+0.004
ety+0.004
P+0.004
 day+0.004
Neg logit contributions
 hi-0.004
 hello-0.004
 grin-0.004
 fascination-0.004
 fries-0.004
Token surprised
Token ID6655
Feature activation+1.39e-02

Pos logit contributions
B+0.003
One+0.003
P+0.003
L+0.003
 sometimes+0.003
Neg logit contributions
 fascination-0.003
ote-0.003
 grin-0.003
 stret-0.003
icol-0.003
Token and
Token ID290
Feature activation+2.33e-03

Pos logit contributions
 grin+0.033
 fascination+0.032
 stret+0.032
ote+0.031
 glances+0.031
Neg logit contributions
B-0.016
P-0.015
st-0.015
One-0.014
Tim-0.014
Token happy
Token ID3772
Feature activation-5.42e-03

Pos logit contributions
 stret+0.005
 fascination+0.005
ote+0.005
latable+0.005
 grin+0.005
Neg logit contributions
L-0.003
B-0.003
 sometimes-0.003
Tim-0.003
 next-0.003
Token.
Token ID13
Feature activation+1.72e-02

Pos logit contributions
B+0.005
st+0.005
L+0.005
 sometimes+0.004
Tim+0.004
Neg logit contributions
 fascination-0.014
 stret-0.014
 grin-0.014
ote-0.014
 glances-0.013
Token She
Token ID1375
Feature activation+4.76e-02

Pos logit contributions
 grin+0.037
 fascination+0.036
 stret+0.035
ote+0.035
 hello+0.034
Neg logit contributions
B-0.024
st-0.023
P-0.023
One-0.022
Tim-0.022
Token had
Token ID550
Feature activation-1.31e-01

Pos logit contributions
 fascination+0.107
 stret+0.105
 grin+0.104
ote+0.101
 glances+0.099
Neg logit contributions
B-0.065
L-0.062
P-0.059
Tim-0.058
st-0.057
Token found
Token ID1043
Feature activation-2.22e-02

Pos logit contributions
B+0.210
st+0.195
ety+0.190
L+0.188
P+0.188
Neg logit contributions
 grin-0.254
 fascination-0.250
 stret-0.241
ote-0.234
 glances-0.234
Token a
Token ID257
Feature activation+6.18e-03

Pos logit contributions
B+0.022
L+0.020
st+0.020
 sometimes+0.020
P+0.019
Neg logit contributions
 grin-0.059
 stret-0.057
 fascination-0.056
ote-0.056
 vows-0.053
Token friend
Token ID1545
Feature activation-8.45e-02

Pos logit contributions
 fascination+0.013
 grin+0.013
ote+0.012
 stret+0.012
 glances+0.012
Neg logit contributions
B-0.009
st-0.009
L-0.008
P-0.008
Tim-0.008
Token.
Token ID13
Feature activation+1.09e-02

Pos logit contributions
B+0.089
st+0.084
L+0.080
P+0.080
Tim+0.079
Neg logit contributions
 fascination-0.215
 grin-0.212
 stret-0.205
ote-0.202
 glances-0.200
Token
Token ID198
Feature activation-1.85e-02

Pos logit contributions
 grin+0.024
 fascination+0.023
 stret+0.023
 hi+0.022
 hello+0.022
Neg logit contributions
B-0.015
st-0.014
P-0.014
ety-0.013
One-0.013
Token two
Token ID734
Feature activation-1.16e-01

Pos logit contributions
B+0.038
 day+0.037
Tim+0.036
 next+0.036
st+0.036
Neg logit contributions
 grin-0.056
 fascination-0.055
 glances-0.053
oured-0.053
ote-0.053
Token friends
Token ID2460
Feature activation-4.81e-02

Pos logit contributions
B+0.124
st+0.114
 sometimes+0.113
One+0.109
 day+0.108
Neg logit contributions
icol-0.281
 stret-0.278
ote-0.277
 grin-0.276
 fascination-0.275
Token who
Token ID508
Feature activation+2.19e-02

Pos logit contributions
st+0.048
B+0.047
 sometimes+0.047
ety+0.043
One+0.042
Neg logit contributions
 fascination-0.122
 grin-0.118
 stret-0.117
ote-0.116
 expression-0.113
Token wanted
Token ID2227
Feature activation-1.32e-02

Pos logit contributions
 stret+0.060
ote+0.058
 grin+0.058
 fascination+0.058
icol+0.057
Neg logit contributions
 sometimes-0.020
B-0.018
st-0.018
I-0.017
L-0.017
Token to
Token ID284
Feature activation+1.02e-02

Pos logit contributions
B+0.010
 sometimes+0.009
One+0.009
st+0.009
Tim+0.009
Neg logit contributions
 grin-0.038
 fascination-0.037
 stret-0.037
ote-0.036
icol-0.035
Token use
Token ID779
Feature activation-1.19e-01

Pos logit contributions
 fascination+0.022
icol+0.020
ote+0.020
 grin+0.020
 stret+0.020
Neg logit contributions
Tim-0.015
One-0.015
B-0.014
 sometimes-0.014
L-0.013
Token it
Token ID340
Feature activation-2.28e-02

Pos logit contributions
B+0.116
 sometimes+0.109
st+0.108
One+0.105
Tim+0.102
Neg logit contributions
 grin-0.305
ote-0.300
 fascination-0.294
 stret-0.290
icol-0.279
Token.
Token ID13
Feature activation+3.67e-03

Pos logit contributions
B+0.025
st+0.023
One+0.022
P+0.022
Tim+0.022
Neg logit contributions
 grin-0.057
 fascination-0.057
 stret-0.055
ote-0.054
 glances-0.053
Token
Token ID198
Feature activation-2.23e-02

Pos logit contributions
 fascination+0.008
 grin+0.008
 stret+0.007
 hi+0.007
ote+0.007
Neg logit contributions
B-0.005
st-0.005
P-0.005
ety-0.004
L-0.004
Token
Token ID198
Feature activation-1.93e-02

Pos logit contributions
ety+0.023
st+0.022
 sometimes+0.019
P+0.019
B+0.017
Neg logit contributions
 fascination-0.052
 stret-0.051
 grin-0.049
 hi-0.048
 glances-0.048
Token"
Token ID1
Feature activation-3.02e-02

Pos logit contributions
ety+0.017
st+0.016
 sometimes+0.012
 day+0.011
B+0.011
Neg logit contributions
 fascination-0.049
 stret-0.048
 grin-0.046
leaf-0.045
 hi-0.045
Token was
Token ID373
Feature activation-7.26e-03

Pos logit contributions
B+0.037
P+0.037
ety+0.035
st+0.034
One+0.033
Neg logit contributions
 fascination-0.065
 grin-0.063
 amaz-0.060
 gasp-0.059
 hi-0.059
Token a
Token ID257
Feature activation+1.25e-02

Pos logit contributions
st+0.013
ety+0.013
B+0.012
ll+0.012
One+0.011
Neg logit contributions
 fascination-0.011
 stret-0.011
 grin-0.010
 menu-0.010
 hi-0.010
Token girl
Token ID2576
Feature activation-3.09e-02

Pos logit contributions
 grin+0.028
 fascination+0.027
 stret+0.027
ote+0.026
 glances+0.026
Neg logit contributions
B-0.017
P-0.015
st-0.015
L-0.015
Tim-0.015
Token.
Token ID13
Feature activation+2.22e-02

Pos logit contributions
B+0.058
Next+0.058
st+0.055
ety+0.054
Sam+0.054
Neg logit contributions
 stret-0.043
 grin-0.041
 hello-0.040
 fascination-0.040
 hi-0.038
Token She
Token ID1375
Feature activation+4.70e-02

Pos logit contributions
 grin+0.044
 fascination+0.042
 hello+0.040
 hi+0.039
 expression+0.039
Neg logit contributions
B-0.035
P-0.035
st-0.034
ety-0.031
Tim-0.031
Token had
Token ID550
Feature activation-1.29e-01

Pos logit contributions
 grin+0.076
 fascination+0.071
 stret+0.071
 hello+0.071
 vows+0.070
Neg logit contributions
ety-0.078
st-0.077
B-0.076
One-0.074
Tim-0.071
Token a
Token ID257
Feature activation+8.80e-03

Pos logit contributions
st+0.237
B+0.234
ety+0.226
ll+0.223
Tim+0.215
Neg logit contributions
 hello-0.191
 hi-0.189
 stret-0.188
 fascination-0.187
 grin-0.181
Token little
Token ID1310
Feature activation-6.70e-02

Pos logit contributions
 fascination+0.019
 grin+0.019
 stret+0.019
ote+0.019
 glances+0.019
Neg logit contributions
B-0.012
P-0.011
L-0.011
st-0.011
One-0.011
Token dog
Token ID3290
Feature activation+3.16e-02

Pos logit contributions
B+0.063
st+0.060
 day+0.059
P+0.058
ety+0.057
Neg logit contributions
ote-0.171
 stret-0.167
 fascination-0.167
 grin-0.167
atively-0.165
Token that
Token ID326
Feature activation+1.31e-02

Pos logit contributions
 grin+0.065
 stret+0.064
 fascination+0.064
ote+0.063
 glances+0.061
Neg logit contributions
B-0.045
P-0.042
st-0.042
One-0.041
Tim-0.041
Token she
Token ID673
Feature activation-7.74e-03

Pos logit contributions
 grin+0.031
 fascination+0.030
 stret+0.030
ote+0.029
 glances+0.029
Neg logit contributions
B-0.016
P-0.015
st-0.015
One-0.014
L-0.014
Token out
Token ID503
Feature activation+9.90e-02

Pos logit contributions
st+0.055
 sometimes+0.051
 next+0.049
B+0.047
ety+0.045
Neg logit contributions
 fascination-0.138
 grin-0.138
 stret-0.136
ote-0.132
leaf-0.132
Token.
Token ID13
Feature activation+1.77e-03

Pos logit contributions
 grin+0.259
 stret+0.258
 fascination+0.257
ote+0.251
 glances+0.243
Neg logit contributions
B-0.093
st-0.089
 sometimes-0.081
One-0.081
L-0.078
Token The
Token ID383
Feature activation-6.60e-03

Pos logit contributions
 grin+0.004
 fascination+0.004
 stret+0.004
 glances+0.004
ote+0.004
Neg logit contributions
B-0.002
L-0.002
Tim-0.002
One-0.002
st-0.002
Token rabbit
Token ID22746
Feature activation-2.16e-02

Pos logit contributions
 next+0.009
B+0.009
 sometimes+0.008
 day+0.008
One+0.008
Neg logit contributions
 grin-0.014
oured-0.014
ote-0.014
icol-0.014
 fascination-0.014
Token had
Token ID550
Feature activation-1.35e-01

Pos logit contributions
B+0.023
st+0.020
L+0.019
 sometimes+0.019
One+0.019
Neg logit contributions
 stret-0.055
 fascination-0.054
 grin-0.054
ote-0.052
 glances-0.051
Token used
Token ID973
Feature activation-1.36e-01

Pos logit contributions
B+0.178
st+0.166
P+0.163
One+0.162
L+0.159
Neg logit contributions
 grin-0.300
 fascination-0.298
 stret-0.296
ote-0.287
 glances-0.282
Token his
Token ID465
Feature activation+9.12e-03

Pos logit contributions
B+0.119
st+0.116
 sometimes+0.111
One+0.099
Tim+0.098
Neg logit contributions
 grin-0.355
 stret-0.354
ote-0.351
 fascination-0.336
icol-0.332
Token wisdom
Token ID11501
Feature activation+3.01e-02

Pos logit contributions
 grin+0.019
ote+0.018
oured+0.018
 fascination+0.017
 stret+0.017
Neg logit contributions
 day-0.013
st-0.013
 next-0.012
One-0.012
B-0.012
Token to
Token ID284
Feature activation+2.10e-02

Pos logit contributions
 stret+0.087
 grin+0.086
 fascination+0.085
ote+0.082
 hi+0.082
Neg logit contributions
st-0.022
B-0.019
 sometimes-0.017
ety-0.015
One-0.015
Token figure
Token ID3785
Feature activation-8.82e-02

Pos logit contributions
icol+0.041
 stret+0.040
ote+0.040
oured+0.040
 grin+0.040
Neg logit contributions
 sometimes-0.032
st-0.031
One-0.030
 next-0.030
 day-0.030
Token out
Token ID503
Feature activation+9.40e-02

Pos logit contributions
st+0.123
 sometimes+0.120
 next+0.114
B+0.112
"+0.111
Neg logit contributions
 grin-0.188
icol-0.184
latable-0.178
 stret-0.175
 fascination-0.174
Token wears
Token ID17326
Feature activation-1.84e-02

Pos logit contributions
B+0.107
st+0.099
L+0.096
 sometimes+0.095
ety+0.094
Neg logit contributions
 stret-0.220
 fascination-0.218
 grin-0.216
ote-0.214
 glances-0.209
Token it
Token ID340
Feature activation-4.23e-04

Pos logit contributions
B+0.019
L+0.017
P+0.017
One+0.017
st+0.017
Neg logit contributions
 stret-0.046
 fascination-0.046
 grin-0.046
ote-0.046
 glances-0.043
Token today
Token ID1909
Feature activation-1.47e-02

Pos logit contributions
B+0.000
st+0.000
One+0.000
P+0.000
L+0.000
Neg logit contributions
 grin-0.001
 fascination-0.001
 stret-0.001
ote-0.001
 glances-0.001
Token.
Token ID13
Feature activation+5.12e-03

Pos logit contributions
B+0.014
st+0.014
L+0.013
 sometimes+0.013
One+0.013
Neg logit contributions
 stret-0.038
 fascination-0.038
 grin-0.038
ote-0.036
 glances-0.036
Token Where
Token ID6350
Feature activation+7.04e-02

Pos logit contributions
 grin+0.010
 fascination+0.010
 hi+0.009
 amaz+0.009
 expression+0.009
Neg logit contributions
B-0.008
P-0.008
st-0.008
ety-0.008
Tim-0.008
Tokenver
Token ID332
Feature activation-1.09e-01

Pos logit contributions
 grin+0.153
 fascination+0.152
 stret+0.152
ote+0.147
 glances+0.143
Neg logit contributions
B-0.097
st-0.090
P-0.090
L-0.089
One-0.088
Token she
Token ID673
Feature activation-4.21e-02

Pos logit contributions
B+0.121
P+0.110
st+0.107
L+0.105
One+0.104
Neg logit contributions
 grin-0.265
 fascination-0.265
 stret-0.258
ote-0.254
 glances-0.252
Token goes
Token ID2925
Feature activation+2.72e-02

Pos logit contributions
B+0.058
st+0.053
P+0.053
L+0.052
One+0.052
Neg logit contributions
 grin-0.092
 fascination-0.091
 stret-0.090
ote-0.087
 glances-0.086
Token,
Token ID11
Feature activation-2.16e-02

Pos logit contributions
 fascination+0.064
 grin+0.064
 stret+0.062
ote+0.061
 glances+0.060
Neg logit contributions
B-0.033
st-0.031
P-0.030
L-0.030
One-0.029
Token it
Token ID340
Feature activation-5.47e-05

Pos logit contributions
B+0.019
 sometimes+0.018
st+0.017
L+0.017
P+0.016
Neg logit contributions
 stret-0.057
 grin-0.056
 fascination-0.056
ote-0.055
 glances-0.054
Token reminds
Token ID17603
Feature activation-5.87e-02

Pos logit contributions
B+0.000
P+0.000
Tim+0.000
st+0.000
L+0.000
Neg logit contributions
 grin-0.000
 fascination-0.000
 stret-0.000
ote-0.000
 glances-0.000