TOP ACTIVATIONS
MAX = 10.723

because of his big bus hy mustache . One
. He had a bus hy tail and a small nose
had a big , bus hy tail and sharp teeth .
had a long , bus hy tail and liked to play
spotted a big , bus hy tree . He hopped closer
and had a big bus hy tail . Jack
small , has a bus hy tail , and likes to
with a long , bus hy tail . It was a
had a big , bus hy tail and fur that was
big nose , a bus hy beard , and a mean
had bright eyes and bus hy hair . He always smiled
and a big , bus hy beard . He wore a
. He sees they are twins . He sees they are
, there was a beautiful tale about a brave prince .
kingdom . This is the tale of the daring prince !
, he is playing a sax ophone . It is a
Maybe it 's a fairy tale ." M ia
decided to perform a fairy tale . She put on a
spaghetti . She sees happy twins . <|endoftext|> L ily and
she would write a new tale about ships and sailors .

MIN ACTIVATIONS
MIN = -13.434

to work . Hours later , the prince and
do . Hours passed and Lily 's parents
alone . Hours passed , but nobody came
baseball returned . Hours later , the sun had
wrong . Hours passed and the animal looked
down and fell asleep . Hours later it woke up and
it was smooth again . Hours and hours went by ,
the door would open . Hours passed , but no -
appear from the clay . Hours passed , and eventually the
help you . â € First , Mom taught Mary how
He said that no - one should fight and that Mitt
their families . They call 911 . They hope it is
said don 't worry . First , we need to soak
ies â € � � one shiny , one dull â
€ ™ s plan . First , they looked for a
to pretend to cook . First , Mary picked up a
He thanked the an em one and then hurried back up
reverse around the house . First he slowly backed away from
decided to find out . First , she decided to make
to feed the girl . First , she gave her the

INTERVAL 4.684 - 10.723
CONTAINS 1.167%

what it was going to do next .
until it looked like a dog . He gave it to
witch . They hugged and cheered . They continued to play
go , Tim . Your mask is good as new .
see the joy on their faces . They laughed and joked

INTERVAL -1.356 - 4.684
CONTAINS 75.638%

in a hotel !" Once they were in the
He looks at Lily 's paper . She has
; it is never a good idea to take something that
march so much . He laughed and smiled and felt the
his mouth . The man is wearing a blue shirt and

INTERVAL -7.395 - -1.356
CONTAINS 22.556%

, " Now you can measure all kinds of things !"
, laughing and tw irling all day with his new cloud
so lovely . John was very happy that he
opened her eyes , she started to display serious worry as
shouted at Ben . Ben ran away . He did not

INTERVAL -13.434 - -7.395
CONTAINS 0.640%

to look for a bigger one , so she did her
outside in her garden . One day , she found a
important . As they were looking at the
. L ily reaches the top of the hill
sl umber . While the little boy was sleeping
Token because
Token ID780
Feature activation+8.80e-01

Pos logit contributions
's+0.013
 playing+0.013
 that+0.013
 about+0.012
 doing+0.012
Neg logit contributions
 reins-0.025
 redd-0.023
Hours-0.023
 lowest-0.023
rated-0.023
Token of
Token ID286
Feature activation+3.38e+00

Pos logit contributions
's+0.022
 that+0.022
 playing+0.022
,+0.019
 doing+0.019
Neg logit contributions
 reins-0.052
 redd-0.049
Hours-0.049
 lowest-0.048
rated-0.048
Token his
Token ID465
Feature activation-1.45e+00

Pos logit contributions
's+0.097
 playing+0.090
 that+0.090
 about+0.082
e+0.081
Neg logit contributions
 reins-0.186
 redd-0.175
Hours-0.172
 lowest-0.170
ifi-0.167
Token big
Token ID1263
Feature activation+7.27e-01

Pos logit contributions
 reins+0.068
 redd+0.063
Hours+0.063
rated+0.061
 lowest+0.060
Neg logit contributions
's-0.055
 playing-0.054
 that-0.054
,-0.050
 doing-0.049
Token bus
Token ID1323
Feature activation-2.71e+00

Pos logit contributions
's+0.025
 playing+0.025
 that+0.024
,+0.023
 doing+0.022
Neg logit contributions
 reins-0.037
Hours-0.035
 redd-0.034
rated-0.033
 lowest-0.033
Tokenhy
Token ID12114
Feature activation+1.05e+01

Pos logit contributions
 reins+0.172
 lowest+0.161
rated+0.159
Hours+0.158
 redd+0.157
Neg logit contributions
 playing-0.045
's-0.040
 doing-0.038
 wearing-0.037
 that-0.037
Token mustache
Token ID49303
Feature activation+1.42e+00

Pos logit contributions
's+0.336
 playing+0.329
 that+0.324
 doing+0.303
 wearing+0.300
Neg logit contributions
 reins-0.544
 redd-0.485
Hours-0.483
 lowest-0.483
rated-0.471
Token.
Token ID13
Feature activation+7.14e-01

Pos logit contributions
 playing+0.023
's+0.021
 that+0.020
 doing+0.019
 wearing+0.019
Neg logit contributions
 reins-0.094
 lowest-0.087
 redd-0.086
rated-0.086
Hours-0.085
Token
Token ID198
Feature activation+6.97e-01

Pos logit contributions
's+0.017
 playing+0.016
 that+0.016
,+0.015
 about+0.014
Neg logit contributions
 reins-0.044
rated-0.042
 redd-0.042
Hours-0.041
 lowest-0.040
Token
Token ID198
Feature activation+6.10e-01

Pos logit contributions
's+0.007
 that+0.007
 playing+0.006
 about+0.005
,+0.004
Neg logit contributions
 reins-0.053
 redd-0.052
rated-0.051
 lowest-0.050
Hours-0.049
TokenOne
Token ID3198
Feature activation-1.10e+01

Pos logit contributions
's+0.025
 that+0.024
 playing+0.024
e+0.023
 lived+0.022
Neg logit contributions
 reins-0.028
rated-0.027
 redd-0.027
 lowest-0.026
ried-0.024
Token.
Token ID13
Feature activation+7.88e-01

Pos logit contributions
e+0.059
 lived+0.056
 playing+0.054
's+0.051
 living+0.051
Neg logit contributions
 reins-0.184
 lowest-0.171
 tighter-0.171
rated-0.170
Hours-0.168
Token He
Token ID679
Feature activation-4.26e-01

Pos logit contributions
's+0.018
 that+0.018
 playing+0.017
,+0.016
 about+0.015
Neg logit contributions
 reins-0.050
 redd-0.046
Hours-0.046
rated-0.046
 lowest-0.045
Token had
Token ID550
Feature activation+4.12e+00

Pos logit contributions
 reins+0.025
 redd+0.023
Hours+0.023
 lowest+0.023
rated+0.022
Neg logit contributions
's-0.011
 playing-0.011
 that-0.011
 doing-0.010
 wearing-0.009
Token a
Token ID257
Feature activation-8.51e-01

Pos logit contributions
's+0.119
 that+0.118
 playing+0.116
 doing+0.102
 about+0.102
Neg logit contributions
 reins-0.240
 redd-0.221
rated-0.221
Hours-0.220
ifi-0.218
Token bus
Token ID1323
Feature activation-2.93e+00

Pos logit contributions
 reins+0.043
Hours+0.040
 redd+0.039
rated+0.039
ifi+0.038
Neg logit contributions
's-0.030
 playing-0.029
 that-0.029
,-0.027
 doing-0.027
Tokenhy
Token ID12114
Feature activation+1.02e+01

Pos logit contributions
 reins+0.152
 lowest+0.139
 tighter+0.134
Hours+0.133
 redd+0.132
Neg logit contributions
 playing-0.082
 lived-0.081
 Playing-0.078
's-0.075
 doing-0.075
Token tail
Token ID7894
Feature activation+2.68e+00

Pos logit contributions
's+0.397
 playing+0.381
 that+0.380
 living+0.354
 doing+0.352
Neg logit contributions
 reins-0.467
 redd-0.425
Hours-0.417
 lowest-0.417
ifi-0.403
Token and
Token ID290
Feature activation+1.70e-02

Pos logit contributions
 lived+0.042
's+0.042
 playing+0.040
 doing+0.039
 Playing+0.038
Neg logit contributions
 reins-0.169
 lowest-0.153
 redd-0.153
 tighter-0.150
Hours-0.149
Token a
Token ID257
Feature activation-8.97e-01

Pos logit contributions
's+0.001
 playing+0.001
 that+0.001
,+0.000
 doing+0.000
Neg logit contributions
 reins-0.001
Hours-0.001
 redd-0.001
rated-0.001
 lowest-0.001
Token small
Token ID1402
Feature activation-1.49e+00

Pos logit contributions
 reins+0.042
Hours+0.039
 redd+0.039
rated+0.038
ifi+0.037
Neg logit contributions
's-0.034
 playing-0.033
 that-0.033
,-0.031
 doing-0.031
Token nose
Token ID9686
Feature activation+1.98e+00

Pos logit contributions
 reins+0.070
Hours+0.067
 redd+0.065
rated+0.063
 lowest+0.063
Neg logit contributions
's-0.056
 playing-0.055
 that-0.054
,-0.052
 doing-0.050
Token had
Token ID550
Feature activation+4.15e+00

Pos logit contributions
 reins+0.027
Hours+0.027
ifi+0.026
E+0.026
Lex+0.026
Neg logit contributions
apping-0.010
 about-0.010
 sharing-0.010
 playing-0.009
's-0.009
Token a
Token ID257
Feature activation-9.03e-01

Pos logit contributions
's+0.114
 playing+0.112
 that+0.111
,+0.109
 sharing+0.107
Neg logit contributions
 reins-0.225
 redd-0.209
Hours-0.209
 lowest-0.202
Lex-0.201
Token big
Token ID1263
Feature activation+8.47e-01

Pos logit contributions
 reins+0.047
Hours+0.044
 redd+0.044
rated+0.044
 lowest+0.043
Neg logit contributions
's-0.029
 playing-0.029
 that-0.029
,-0.026
 doing-0.026
Token,
Token ID11
Feature activation+1.63e+00

Pos logit contributions
's+0.027
 that+0.026
 playing+0.026
 about+0.023
 doing+0.023
Neg logit contributions
 reins-0.045
 redd-0.042
Hours-0.041
 lowest-0.041
rated-0.040
Token bus
Token ID1323
Feature activation-3.00e+00

Pos logit contributions
's+0.064
 that+0.063
 playing+0.063
 about+0.058
 doing+0.058
Neg logit contributions
 reins-0.073
 redd-0.068
Hours-0.066
 lowest-0.065
rated-0.063
Tokenhy
Token ID12114
Feature activation+1.01e+01

Pos logit contributions
 reins+0.140
 fare+0.134
 hinges+0.132
 ladder+0.129
 stool+0.129
Neg logit contributions
 doing-0.071
 acting-0.063
's-0.063
 playing-0.061
e-0.060
Token tail
Token ID7894
Feature activation+2.73e+00

Pos logit contributions
's+0.376
 that+0.357
 playing+0.357
 doing+0.339
 wearing+0.337
Neg logit contributions
 reins-0.462
 lowest-0.412
 redd-0.409
Hours-0.408
Lex-0.401
Token and
Token ID290
Feature activation-1.84e-02

Pos logit contributions
 doing+0.054
's+0.048
e+0.047
 sharing+0.046
 Playing+0.046
Neg logit contributions
 reins-0.154
 lowest-0.137
 redd-0.137
 stool-0.133
Hours-0.132
Token sharp
Token ID7786
Feature activation-2.42e+00

Pos logit contributions
 reins+0.001
Hours+0.001
 redd+0.001
rated+0.001
ifi+0.001
Neg logit contributions
's-0.001
 playing-0.001
 that-0.001
,-0.001
 doing-0.001
Token teeth
Token ID9941
Feature activation-1.12e+00

Pos logit contributions
 reins+0.099
 redd+0.087
 lowest+0.085
Hours+0.085
rated+0.085
Neg logit contributions
's-0.105
 playing-0.103
 that-0.103
 doing-0.098
 wearing-0.097
Token.
Token ID13
Feature activation+8.07e-01

Pos logit contributions
 reins+0.058
 lowest+0.053
Hours+0.052
 redd+0.051
ge+0.051
Neg logit contributions
's-0.026
 Friends-0.025
e-0.024
 lived-0.023
 playing-0.023
Token had
Token ID550
Feature activation+4.05e+00

Pos logit contributions
Lex+0.050
 reins+0.049
ried+0.049
bi+0.048
ifi+0.048
Neg logit contributions
e-0.022
 doing-0.020
 Friends-0.019
 Playing-0.019
 sharing-0.019
Token a
Token ID257
Feature activation-9.66e-01

Pos logit contributions
's+0.108
 that+0.104
 playing+0.103
,+0.098
 doing+0.097
Neg logit contributions
 reins-0.225
 redd-0.212
Hours-0.207
 lowest-0.204
Lex-0.200
Token long
Token ID890
Feature activation+6.30e-01

Pos logit contributions
 reins+0.048
 redd+0.045
Hours+0.044
 lowest+0.043
rated+0.043
Neg logit contributions
's-0.033
 that-0.033
 playing-0.033
 doing-0.030
 about-0.030
Token,
Token ID11
Feature activation+1.59e+00

Pos logit contributions
's+0.023
 that+0.023
 playing+0.022
 about+0.021
 doing+0.021
Neg logit contributions
 reins-0.030
 redd-0.027
Hours-0.026
ifi-0.026
rated-0.026
Token bus
Token ID1323
Feature activation-3.01e+00

Pos logit contributions
's+0.065
 that+0.064
 playing+0.063
e+0.060
 doing+0.060
Neg logit contributions
 reins-0.067
 redd-0.065
 lowest-0.059
Hours-0.058
rated-0.056
Tokenhy
Token ID12114
Feature activation+1.00e+01

Pos logit contributions
 reins+0.140
 fare+0.129
 hinges+0.128
ried+0.127
 lowest+0.125
Neg logit contributions
e-0.090
 doing-0.086
's-0.085
 playing-0.082
 sharing-0.079
Token tail
Token ID7894
Feature activation+2.67e+00

Pos logit contributions
's+0.388
 that+0.369
 playing+0.361
 doing+0.352
 of+0.351
Neg logit contributions
 reins-0.454
 redd-0.402
 lowest-0.401
Hours-0.385
Lex-0.381
Token and
Token ID290
Feature activation-5.88e-02

Pos logit contributions
e+0.047
's+0.047
 doing+0.047
 lived+0.042
 Friends+0.042
Neg logit contributions
 reins-0.159
 lowest-0.141
 redd-0.140
 tighter-0.138
 stool-0.137
Token liked
Token ID8288
Feature activation-1.51e-01

Pos logit contributions
 reins+0.003
 redd+0.003
Hours+0.003
 lowest+0.003
rated+0.003
Neg logit contributions
's-0.001
 playing-0.001
 that-0.001
 doing-0.001
 about-0.001
Token to
Token ID284
Feature activation-9.13e-01

Pos logit contributions
 reins+0.008
 redd+0.008
Hours+0.008
 lowest+0.008
ifi+0.007
Neg logit contributions
's-0.004
e-0.004
 playing-0.004
 that-0.004
 lived-0.004
Token play
Token ID711
Feature activation+4.23e+00

Pos logit contributions
 reins+0.055
 redd+0.053
Hours+0.053
rated+0.051
 lowest+0.051
Neg logit contributions
's-0.022
 playing-0.022
 that-0.022
,-0.020
 about-0.019
Token spotted
Token ID13489
Feature activation-1.68e+00

Pos logit contributions
 playing+0.007
 that+0.007
 doing+0.007
 sharing+0.007
 about+0.007
Neg logit contributions
 reins-0.011
Hours-0.010
 redd-0.010
rated-0.010
Lex-0.010
Token a
Token ID257
Feature activation-1.35e+00

Pos logit contributions
 reins+0.089
 redd+0.084
Hours+0.082
 lowest+0.082
Lex+0.080
Neg logit contributions
's-0.049
 playing-0.046
 doing-0.043
 that-0.043
 wearing-0.043
Token big
Token ID1263
Feature activation+5.20e-01

Pos logit contributions
 reins+0.064
 redd+0.059
Hours+0.058
 lowest+0.057
rated+0.056
Neg logit contributions
's-0.050
 playing-0.049
 that-0.049
 doing-0.045
 about-0.045
Token,
Token ID11
Feature activation+1.23e+00

Pos logit contributions
's+0.019
 playing+0.018
 that+0.018
 about+0.017
e+0.017
Neg logit contributions
 reins-0.024
 redd-0.022
 lowest-0.022
ifi-0.021
 stool-0.021
Token bus
Token ID1323
Feature activation-3.09e+00

Pos logit contributions
's+0.050
 that+0.049
 playing+0.049
 about+0.045
 doing+0.045
Neg logit contributions
 reins-0.053
 redd-0.050
Hours-0.048
 lowest-0.048
rated-0.047
Tokenhy
Token ID12114
Feature activation+1.00e+01

Pos logit contributions
 reins+0.143
 lowest+0.129
 stool+0.126
 fare+0.126
ried+0.119
Neg logit contributions
e-0.093
 lived-0.089
 playing-0.089
 sharing-0.088
 Playing-0.088
Token tree
Token ID5509
Feature activation-1.97e+00

Pos logit contributions
's+0.411
 playing+0.390
 that+0.386
 of+0.380
 about+0.378
Neg logit contributions
 reins-0.432
 lowest-0.384
 redd-0.375
Lex-0.366
Hours-0.354
Token.
Token ID13
Feature activation+6.15e-01

Pos logit contributions
 reins+0.114
 lowest+0.106
 tighter+0.105
 redd+0.105
 stool+0.104
Neg logit contributions
 lived-0.040
 Playing-0.037
 Friends-0.036
 playing-0.035
e-0.035
Token He
Token ID679
Feature activation-7.40e-01

Pos logit contributions
's+0.015
 playing+0.015
 that+0.014
 doing+0.013
 about+0.013
Neg logit contributions
 reins-0.037
 redd-0.035
Hours-0.035
 lowest-0.034
rated-0.033
Token hopped
Token ID45230
Feature activation-3.29e+00

Pos logit contributions
 reins+0.043
 redd+0.041
Hours+0.040
 lowest+0.039
rated+0.039
Neg logit contributions
's-0.020
 that-0.019
 playing-0.019
,-0.017
 wearing-0.017
Token closer
Token ID5699
Feature activation-7.06e+00

Pos logit contributions
 reins+0.173
 redd+0.164
Hours+0.162
rated+0.159
 lowest+0.158
Neg logit contributions
's-0.105
 that-0.105
 playing-0.103
,-0.096
 about-0.094
Token and
Token ID290
Feature activation-4.33e-02

Pos logit contributions
 reins+0.030
 redd+0.028
 lowest+0.028
 hinges+0.027
ifi+0.027
Neg logit contributions
's-0.008
e-0.008
 lived-0.007
 doing-0.007
 playing-0.007
Token had
Token ID550
Feature activation+4.04e+00

Pos logit contributions
 redd+0.002
 reins+0.002
 lowest+0.002
Hours+0.002
rated+0.002
Neg logit contributions
's-0.001
 playing-0.001
e-0.001
 about-0.001
 that-0.001
Token a
Token ID257
Feature activation-1.07e+00

Pos logit contributions
's+0.115
 that+0.112
 playing+0.109
 about+0.097
 living+0.096
Neg logit contributions
 reins-0.229
rated-0.215
Hours-0.215
 redd-0.214
ifi-0.211
Token big
Token ID1263
Feature activation+7.30e-01

Pos logit contributions
 reins+0.049
Hours+0.045
 redd+0.045
rated+0.044
 lowest+0.043
Neg logit contributions
's-0.042
 playing-0.041
 that-0.041
,-0.038
 doing-0.038
Token bus
Token ID1323
Feature activation-3.00e+00

Pos logit contributions
's+0.023
 that+0.023
 playing+0.022
 about+0.021
 living+0.020
Neg logit contributions
 reins-0.038
 redd-0.036
 lowest-0.035
Hours-0.034
rated-0.034
Tokenhy
Token ID12114
Feature activation+1.00e+01

Pos logit contributions
 reins+0.144
 lowest+0.139
 undone+0.130
 fare+0.128
 redd+0.128
Neg logit contributions
e-0.081
 playing-0.080
 Playing-0.078
 lived-0.078
 Friends-0.078
Token tail
Token ID7894
Feature activation+2.63e+00

Pos logit contributions
's+0.386
 that+0.367
 playing+0.366
 living+0.356
 about+0.345
Neg logit contributions
 reins-0.458
 lowest-0.414
 redd-0.410
Lex-0.390
Hours-0.390
Token.
Token ID13
Feature activation+7.78e-01

Pos logit contributions
's+0.044
 playing+0.041
e+0.040
 lived+0.040
 doing+0.039
Neg logit contributions
 reins-0.163
 lowest-0.151
 redd-0.150
Hours-0.144
rated-0.143
Token
Token ID198
Feature activation+4.16e-01

Pos logit contributions
's+0.017
 that+0.016
 playing+0.016
 doing+0.015
e+0.014
Neg logit contributions
 reins-0.047
 redd-0.046
Hours-0.045
 lowest-0.044
rated-0.044
Token
Token ID198
Feature activation+3.60e-01

Pos logit contributions
's+0.006
 that+0.005
 playing+0.005
,+0.004
 about+0.004
Neg logit contributions
 reins-0.030
 redd-0.028
Hours-0.028
rated-0.028
 lowest-0.028
TokenJack
Token ID14295
Feature activation+6.12e-01

Pos logit contributions
's+0.012
 that+0.011
 playing+0.011
,+0.011
 of+0.010
Neg logit contributions
 reins-0.020
 redd-0.018
 lowest-0.018
rated-0.017
ifi-0.017
Token small
Token ID1402
Feature activation-1.43e+00

Pos logit contributions
's+0.065
e+0.058
 were+0.057
 Friends+0.054
 of+0.054
Neg logit contributions
 reins-0.127
 redd-0.127
Lex-0.124
ifi-0.121
 lowest-0.120
Token,
Token ID11
Feature activation+1.61e+00

Pos logit contributions
 reins+0.089
 redd+0.083
rated+0.083
 lowest+0.081
ifi+0.080
Neg logit contributions
's-0.026
 playing-0.025
 lived-0.023
 that-0.023
e-0.022
Token has
Token ID468
Feature activation+3.33e+00

Pos logit contributions
's+0.058
 playing+0.058
 that+0.056
 doing+0.053
 wearing+0.052
Neg logit contributions
 reins-0.076
 redd-0.072
Hours-0.069
 lowest-0.069
ifi-0.068
Token a
Token ID257
Feature activation-9.06e-01

Pos logit contributions
's+0.096
 that+0.094
 playing+0.092
,+0.082
 about+0.081
Neg logit contributions
 reins-0.189
Hours-0.176
 redd-0.176
rated-0.173
 lowest-0.171
Token bus
Token ID1323
Feature activation-2.96e+00

Pos logit contributions
 reins+0.043
 redd+0.040
Hours+0.040
 lowest+0.039
rated+0.038
Neg logit contributions
's-0.033
 playing-0.032
 that-0.032
 doing-0.030
 about-0.030
Tokenhy
Token ID12114
Feature activation+9.99e+00

Pos logit contributions
 reins+0.147
rated+0.132
 redd+0.131
 lowest+0.131
Hours+0.129
Neg logit contributions
's-0.088
 playing-0.088
 lived-0.082
 living-0.080
e-0.078
Token tail
Token ID7894
Feature activation+2.57e+00

Pos logit contributions
's+0.393
 that+0.372
 playing+0.368
 living+0.359
 of+0.354
Neg logit contributions
 reins-0.441
 lowest-0.392
 redd-0.386
Hours-0.375
Lex-0.368
Token,
Token ID11
Feature activation+1.58e+00

Pos logit contributions
's+0.095
 playing+0.093
 that+0.087
 doing+0.086
 lived+0.086
Neg logit contributions
 reins-0.113
 redd-0.101
 lowest-0.099
rated-0.099
Hours-0.098
Token and
Token ID290
Feature activation-8.34e-02

Pos logit contributions
's+0.041
 playing+0.039
 that+0.038
e+0.035
 doing+0.035
Neg logit contributions
 reins-0.090
 redd-0.085
 lowest-0.082
Hours-0.081
ifi-0.080
Token likes
Token ID7832
Feature activation+8.08e-01

Pos logit contributions
 reins+0.005
Hours+0.005
 redd+0.005
rated+0.004
 lowest+0.004
Neg logit contributions
's-0.002
 that-0.002
 playing-0.002
 doing-0.002
,-0.002
Token to
Token ID284
Feature activation-7.72e-01

Pos logit contributions
's+0.020
 playing+0.020
 that+0.020
 doing+0.017
 about+0.017
Neg logit contributions
 reins-0.048
 redd-0.045
Hours-0.045
 lowest-0.044
rated-0.044
Token with
Token ID351
Feature activation+2.70e+00

Pos logit contributions
 Playing+0.075
's+0.072
e+0.070
 Friends+0.068
 sharing+0.066
Neg logit contributions
 reins-0.163
 stool-0.157
 ladder-0.151
 lowest-0.151
 pine-0.146
Token a
Token ID257
Feature activation-9.17e-01

Pos logit contributions
's+0.087
 playing+0.084
 that+0.082
 doing+0.079
 about+0.078
Neg logit contributions
 reins-0.141
 redd-0.132
 lowest-0.128
Hours-0.127
Lex-0.122
Token long
Token ID890
Feature activation+5.66e-01

Pos logit contributions
 reins+0.040
 redd+0.037
Hours+0.036
 lowest+0.036
Lex+0.035
Neg logit contributions
's-0.037
 playing-0.036
 that-0.036
 about-0.034
 doing-0.033
Token,
Token ID11
Feature activation+1.62e+00

Pos logit contributions
's+0.021
 that+0.021
 playing+0.021
 about+0.020
 living+0.019
Neg logit contributions
 reins-0.026
 redd-0.023
Hours-0.023
 lowest-0.023
ifi-0.023
Token bus
Token ID1323
Feature activation-3.02e+00

Pos logit contributions
's+0.065
 that+0.065
 playing+0.064
 about+0.060
 doing+0.060
Neg logit contributions
 reins-0.070
 redd-0.067
 lowest-0.062
rated-0.059
Hours-0.059
Tokenhy
Token ID12114
Feature activation+9.89e+00

Pos logit contributions
 reins+0.152
 stool+0.132
 hinges+0.132
 lowest+0.131
 fare+0.130
Neg logit contributions
 lived-0.082
 doing-0.082
 sharing-0.081
 playing-0.079
's-0.078
Token tail
Token ID7894
Feature activation+2.62e+00

Pos logit contributions
's+0.409
 that+0.403
 playing+0.388
 living+0.388
 about+0.387
Neg logit contributions
 reins-0.416
 lowest-0.365
 redd-0.352
Lex-0.348
Hours-0.330
Token.
Token ID13
Feature activation+8.60e-01

Pos logit contributions
 lived+0.048
's+0.047
 Friends+0.046
 doing+0.045
 Playing+0.044
Neg logit contributions
 reins-0.161
 lowest-0.142
 redd-0.137
 stool-0.137
 hinges-0.135
Token It
Token ID632
Feature activation+1.35e+00

Pos logit contributions
's+0.016
 that+0.015
 playing+0.015
,+0.013
 about+0.012
Neg logit contributions
 reins-0.058
Hours-0.054
 redd-0.054
rated-0.054
 lowest-0.053
Token was
Token ID373
Feature activation+1.91e+00

Pos logit contributions
 playing+0.026
 sharing+0.025
 doing+0.024
apping+0.024
 that+0.023
Neg logit contributions
 redd-0.076
 reins-0.076
 lowest-0.074
Hours-0.074
rated-0.074
Token a
Token ID257
Feature activation-9.20e-01

Pos logit contributions
's+0.063
e+0.057
 Friends+0.056
 that+0.055
 lived+0.053
Neg logit contributions
 reins-0.092
 redd-0.088
 lowest-0.084
Lex-0.084
Hours-0.083
Token had
Token ID550
Feature activation+3.99e+00

Pos logit contributions
 reins+0.054
Hours+0.052
 redd+0.050
 lowest+0.050
rated+0.050
Neg logit contributions
 playing-0.019
 that-0.018
's-0.018
 doing-0.017
 sharing-0.016
Token a
Token ID257
Feature activation-1.07e+00

Pos logit contributions
's+0.120
 playing+0.117
 that+0.117
,+0.106
 doing+0.106
Neg logit contributions
 reins-0.215
 redd-0.200
Hours-0.200
 lowest-0.195
Lex-0.191
Token big
Token ID1263
Feature activation+7.62e-01

Pos logit contributions
 reins+0.051
 redd+0.047
Hours+0.047
 lowest+0.046
Lex+0.045
Neg logit contributions
's-0.038
 that-0.038
 playing-0.037
 about-0.035
e-0.035
Token,
Token ID11
Feature activation+1.46e+00

Pos logit contributions
's+0.021
 that+0.021
 about+0.019
 playing+0.019
 lived+0.018
Neg logit contributions
 reins-0.042
 redd-0.039
 lowest-0.038
ifi-0.037
Lex-0.037
Token bus
Token ID1323
Feature activation-3.04e+00

Pos logit contributions
's+0.059
 that+0.059
 playing+0.058
 lived+0.055
 about+0.055
Neg logit contributions
 reins-0.063
 redd-0.060
 lowest-0.055
Hours-0.054
rated-0.053
Tokenhy
Token ID12114
Feature activation+9.87e+00

Pos logit contributions
 reins+0.145
 lowest+0.128
 stool+0.126
 tighter+0.123
 fare+0.122
Neg logit contributions
 lived-0.087
 doing-0.085
 Playing-0.085
 playing-0.084
's-0.082
Token tail
Token ID7894
Feature activation+2.55e+00

Pos logit contributions
's+0.401
 that+0.373
 of+0.370
 playing+0.367
 were+0.365
Neg logit contributions
 reins-0.422
 lowest-0.380
 redd-0.368
Lex-0.353
Hours-0.352
Token and
Token ID290
Feature activation-2.90e-01

Pos logit contributions
 lived+0.042
's+0.041
 Friends+0.041
 Playing+0.040
 doing+0.040
Neg logit contributions
 reins-0.155
 redd-0.140
 lowest-0.139
 stool-0.136
 tighter-0.136
Token fur
Token ID9230
Feature activation+4.47e+00

Pos logit contributions
 reins+0.017
 redd+0.016
Hours+0.015
 lowest+0.015
rated+0.015
Neg logit contributions
's-0.008
 playing-0.007
 that-0.007
 doing-0.007
 about-0.007
Token that
Token ID326
Feature activation+4.26e+00

Pos logit contributions
's+0.081
 Friends+0.078
e+0.077
 Playing+0.076
 lived+0.074
Neg logit contributions
 reins-0.263
 lowest-0.242
 redd-0.240
ried-0.234
 stool-0.231
Token was
Token ID373
Feature activation+1.82e+00

Pos logit contributions
 playing+0.093
's+0.092
 that+0.090
e+0.086
 doing+0.084
Neg logit contributions
 redd-0.233
 reins-0.233
 lowest-0.224
Hours-0.223
rated-0.220
Token big
Token ID1263
Feature activation+7.39e-01

Pos logit contributions
 reins+0.040
 redd+0.037
 lowest+0.035
Hours+0.035
Lex+0.035
Neg logit contributions
's-0.032
 playing-0.031
 that-0.031
 about-0.030
e-0.030
Token nose
Token ID9686
Feature activation+1.90e+00

Pos logit contributions
's+0.028
 that+0.027
 about+0.027
 Playing+0.027
 lived+0.026
Neg logit contributions
 reins-0.033
 redd-0.029
 stool-0.028
 lowest-0.027
ifi-0.026
Token,
Token ID11
Feature activation+1.66e+00

Pos logit contributions
 Playing+0.044
 Friends+0.043
 lived+0.040
 playing+0.038
e+0.037
Neg logit contributions
 reins-0.104
 stool-0.094
 lowest-0.091
 redd-0.089
 seventy-0.088
Token a
Token ID257
Feature activation-8.81e-01

Pos logit contributions
's+0.051
 lived+0.047
 playing+0.046
 that+0.046
 doing+0.044
Neg logit contributions
 reins-0.086
 redd-0.081
 lowest-0.076
 tighter-0.072
ifi-0.072
Token bus
Token ID1323
Feature activation-3.01e+00

Pos logit contributions
 reins+0.040
 redd+0.037
 lowest+0.035
Lex+0.034
 stool+0.034
Neg logit contributions
's-0.033
 about-0.033
 that-0.032
 playing-0.032
e-0.031
Tokenhy
Token ID12114
Feature activation+9.86e+00

Pos logit contributions
 reins+0.140
 stool+0.130
 fare+0.129
 ladder+0.125
 lowest+0.121
Neg logit contributions
 doing-0.090
 playing-0.089
 Playing-0.089
 lived-0.085
 sharing-0.082
Token beard
Token ID21213
Feature activation+2.05e+00

Pos logit contributions
's+0.410
 of+0.396
 about+0.384
 were+0.383
 that+0.376
Neg logit contributions
 reins-0.406
 lowest-0.339
 redd-0.334
 stool-0.330
 Majesty-0.330
Token,
Token ID11
Feature activation+1.61e+00

Pos logit contributions
 Friends+0.056
 Playing+0.047
 lived+0.045
 Winning+0.044
 living+0.043
Neg logit contributions
 reins-0.101
 seventy-0.092
iously-0.091
 stool-0.091
 lowest-0.089
Token and
Token ID290
Feature activation+9.45e-02

Pos logit contributions
's+0.035
 Friends+0.034
 lived+0.034
 playing+0.033
 Playing+0.033
Neg logit contributions
 reins-0.095
 redd-0.083
 lowest-0.082
 tighter-0.079
Lex-0.078
Token a
Token ID257
Feature activation-8.78e-01

Pos logit contributions
's+0.002
 playing+0.002
 that+0.002
 doing+0.002
 about+0.002
Neg logit contributions
 reins-0.005
 redd-0.005
 lowest-0.005
Hours-0.005
Lex-0.005
Token mean
Token ID1612
Feature activation+9.69e-01

Pos logit contributions
 reins+0.042
 redd+0.038
 lowest+0.036
Hours+0.035
Lex+0.035
Neg logit contributions
's-0.033
 that-0.032
 playing-0.031
 about-0.031
e-0.030
Token had
Token ID550
Feature activation+3.91e+00

Pos logit contributions
 reins+0.047
Hours+0.045
 redd+0.044
ifi+0.044
rated+0.043
Neg logit contributions
 that-0.019
e-0.019
 playing-0.019
 doing-0.018
's-0.018
Token bright
Token ID6016
Feature activation+1.10e+00

Pos logit contributions
's+0.110
 playing+0.104
 that+0.104
,+0.098
e+0.098
Neg logit contributions
 reins-0.214
Hours-0.196
 redd-0.195
 lowest-0.189
Lex-0.184
Token eyes
Token ID2951
Feature activation+1.69e+00

Pos logit contributions
's+0.044
 that+0.042
 sharing+0.041
e+0.041
 doing+0.041
Neg logit contributions
 reins-0.048
 redd-0.041
 weights-0.039
 lowest-0.038
Hours-0.037
Token and
Token ID290
Feature activation-3.14e-01

Pos logit contributions
 Friends+0.035
 lived+0.033
e+0.032
 Playing+0.031
's+0.028
Neg logit contributions
 reins-0.097
 redd-0.088
Hours-0.087
 lowest-0.086
Lex-0.084
Token bus
Token ID1323
Feature activation-3.14e+00

Pos logit contributions
 reins+0.016
 redd+0.015
Hours+0.014
 lowest+0.014
ifi+0.013
Neg logit contributions
's-0.009
e-0.009
 that-0.008
 playing-0.008
 doing-0.008
Tokenhy
Token ID12114
Feature activation+9.83e+00

Pos logit contributions
 reins+0.152
 fare+0.148
ried+0.147
 lowest+0.142
 hinges+0.140
Neg logit contributions
e-0.083
 doing-0.076
's-0.074
 living-0.072
 sharing-0.071
Token hair
Token ID4190
Feature activation+2.66e+00

Pos logit contributions
's+0.180
 playing+0.177
 that+0.173
 living+0.172
 doing+0.169
Neg logit contributions
 reins-0.610
Hours-0.565
 lowest-0.564
 redd-0.560
Lex-0.543
Token.
Token ID13
Feature activation+4.83e-01

Pos logit contributions
 Friends+0.062
 Playing+0.060
 lived+0.060
e+0.053
's+0.052
Neg logit contributions
 reins-0.139
 lowest-0.130
 redd-0.126
 stool-0.121
nine-0.120
Token He
Token ID679
Feature activation-8.58e-01

Pos logit contributions
's+0.009
 playing+0.009
 that+0.009
 doing+0.008
 wearing+0.007
Neg logit contributions
 reins-0.032
 redd-0.030
Hours-0.030
 lowest-0.029
rated-0.029
Token always
Token ID1464
Feature activation-6.80e-01

Pos logit contributions
 reins+0.049
Hours+0.046
 redd+0.046
 lowest+0.044
ifi+0.044
Neg logit contributions
 that-0.018
 playing-0.018
's-0.018
 doing-0.017
e-0.017
Token smiled
Token ID13541
Feature activation+4.27e-01

Pos logit contributions
 reins+0.035
rated+0.033
 redd+0.033
ried+0.032
Hours+0.031
Neg logit contributions
 that-0.019
's-0.018
 playing-0.017
e-0.017
 about-0.016
Token and
Token ID290
Feature activation-3.93e-01

Pos logit contributions
 Friends+0.062
 Playing+0.056
 sharing+0.055
 celebrating+0.053
e+0.053
Neg logit contributions
 reins-0.126
 lowest-0.119
 stool-0.116
 hinges-0.115
 redd-0.114
Token a
Token ID257
Feature activation-1.23e+00

Pos logit contributions
 reins+0.020
 redd+0.020
Hours+0.019
 lowest+0.019
ifi+0.018
Neg logit contributions
's-0.011
e-0.010
 playing-0.010
 that-0.010
 about-0.009
Token big
Token ID1263
Feature activation+5.25e-01

Pos logit contributions
 reins+0.061
Hours+0.058
rated+0.057
 redd+0.057
ifi+0.056
Neg logit contributions
's-0.043
 playing-0.042
 that-0.042
,-0.039
 doing-0.038
Token,
Token ID11
Feature activation+1.32e+00

Pos logit contributions
's+0.018
 that+0.018
 playing+0.017
 about+0.016
 doing+0.016
Neg logit contributions
 reins-0.026
 redd-0.024
 lowest-0.023
Hours-0.023
rated-0.023
Token bus
Token ID1323
Feature activation-3.16e+00

Pos logit contributions
's+0.054
 playing+0.053
 that+0.053
e+0.050
 doing+0.049
Neg logit contributions
 reins-0.055
 redd-0.054
 lowest-0.049
Hours-0.048
rated-0.047
Tokenhy
Token ID12114
Feature activation+9.80e+00

Pos logit contributions
 reins+0.147
 fare+0.143
 lowest+0.137
 hinges+0.136
ried+0.133
Neg logit contributions
 doing-0.084
's-0.081
 playing-0.080
 sharing-0.078
e-0.078
Token beard
Token ID21213
Feature activation+1.88e+00

Pos logit contributions
's+0.355
 playing+0.341
 that+0.337
 doing+0.322
 living+0.318
Neg logit contributions
 reins-0.466
 redd-0.419
 lowest-0.415
Hours-0.409
Lex-0.399
Token.
Token ID13
Feature activation+6.12e-01

Pos logit contributions
 Friends+0.051
 Playing+0.043
ap+0.041
 sharing+0.041
's+0.039
Neg logit contributions
 reins-0.090
 lowest-0.082
iously-0.080
ge-0.079
rated-0.078
Token He
Token ID679
Feature activation-9.61e-01

Pos logit contributions
's+0.016
 playing+0.015
 that+0.015
,+0.014
 about+0.014
Neg logit contributions
 reins-0.037
Hours-0.034
 redd-0.034
rated-0.034
 lowest-0.033
Token wore
Token ID12408
Feature activation+2.83e+00

Pos logit contributions
 reins+0.057
Hours+0.055
 redd+0.054
ifi+0.053
rated+0.052
Neg logit contributions
 playing-0.020
 that-0.020
's-0.020
 doing-0.019
 sharing-0.018
Token a
Token ID257
Feature activation-1.18e+00

Pos logit contributions
's+0.075
 playing+0.071
 that+0.069
 doing+0.065
 living+0.063
Neg logit contributions
 reins-0.164
 redd-0.154
Hours-0.153
 lowest-0.148
rated-0.145
Token.
Token ID13
Feature activation+1.09e+00

Pos logit contributions
 playing+0.056
 that+0.054
 wearing+0.052
 doing+0.051
 sharing+0.050
Neg logit contributions
 reins-0.168
rated-0.155
 redd-0.152
 lowest-0.151
ifi-0.149
Token He
Token ID679
Feature activation-3.08e-02

Pos logit contributions
's+0.028
 that+0.027
 playing+0.027
 doing+0.025
 about+0.024
Neg logit contributions
 reins-0.063
 redd-0.059
Hours-0.059
 lowest-0.058
ifi-0.057
Token sees
Token ID7224
Feature activation+3.90e-01

Pos logit contributions
 reins+0.002
 redd+0.002
Hours+0.002
 lowest+0.002
rated+0.002
Neg logit contributions
 playing-0.001
's-0.001
 that-0.001
 doing-0.001
 wearing-0.001
Token they
Token ID484
Feature activation+3.20e+00

Pos logit contributions
's+0.010
 playing+0.009
 doing+0.009
e+0.008
 wearing+0.008
Neg logit contributions
 reins-0.022
 redd-0.021
Hours-0.021
 lowest-0.021
ifi-0.020
Token are
Token ID389
Feature activation+3.91e+00

Pos logit contributions
's+0.062
 that+0.059
 playing+0.053
 about+0.052
 wearing+0.051
Neg logit contributions
 reins-0.190
 lowest-0.179
rated-0.176
ried-0.172
 redd-0.167
Token twins
Token ID20345
Feature activation+9.14e+00

Pos logit contributions
's+0.131
 lived+0.122
e+0.114
 Playing+0.110
 that+0.108
Neg logit contributions
 reins-0.187
 redd-0.173
 lowest-0.168
Hours-0.168
Lex-0.163
Token.
Token ID13
Feature activation+1.16e+00

Pos logit contributions
 lived+0.228
 story+0.203
 video+0.198
 Playing+0.193
opot+0.193
Neg logit contributions
 reins-0.361
ge-0.356
 lowest-0.337
ried-0.337
 ten-0.331
Token He
Token ID679
Feature activation+2.18e-02

Pos logit contributions
's+0.036
 playing+0.036
 that+0.035
 doing+0.034
e+0.033
Neg logit contributions
 reins-0.059
 redd-0.055
ifi-0.054
Hours-0.054
Lex-0.054
Token sees
Token ID7224
Feature activation+3.95e-01

Pos logit contributions
 playing+0.001
 that+0.001
e+0.001
's+0.001
 doing+0.001
Neg logit contributions
 reins-0.001
 redd-0.001
 lowest-0.001
Lex-0.001
Hours-0.001
Token they
Token ID484
Feature activation+3.18e+00

Pos logit contributions
's+0.012
 doing+0.011
e+0.011
 playing+0.010
 lived+0.010
Neg logit contributions
 reins-0.020
 redd-0.019
 lowest-0.018
Hours-0.018
 weights-0.018
Token are
Token ID389
Feature activation+3.89e+00

Pos logit contributions
's+0.093
 that+0.089
 playing+0.080
e+0.079
 Playing+0.079
Neg logit contributions
 reins-0.163
 lowest-0.149
rated-0.146
ried-0.143
 redd-0.139
Token,
Token ID11
Feature activation+1.39e+00

Pos logit contributions
 lowest+0.395
 redd+0.395
 reins+0.389
Lex+0.383
igi+0.378
Neg logit contributions
 playing-0.114
 sharing-0.110
 Playing-0.104
e-0.101
 Friends-0.099
Token there
Token ID612
Feature activation+1.21e+00

Pos logit contributions
's+0.042
 playing+0.040
 that+0.039
 doing+0.037
e+0.036
Neg logit contributions
 reins-0.074
 redd-0.071
Hours-0.069
 lowest-0.068
ifi-0.066
Token was
Token ID373
Feature activation+1.90e+00

Pos logit contributions
 playing+0.023
 Friends+0.021
 sharing+0.021
 Playing+0.020
 living+0.019
Neg logit contributions
 redd-0.069
 lowest-0.067
Lex-0.065
Hours-0.064
 reins-0.064
Token a
Token ID257
Feature activation-9.40e-01

Pos logit contributions
's+0.047
 playing+0.045
 that+0.045
 doing+0.040
,+0.040
Neg logit contributions
 reins-0.112
 redd-0.106
Hours-0.105
 lowest-0.103
ifi-0.101
Token beautiful
Token ID4950
Feature activation+3.53e+00

Pos logit contributions
 reins+0.043
 redd+0.040
Hours+0.039
 lowest+0.039
Lex+0.038
Neg logit contributions
's-0.035
 that-0.034
 playing-0.033
 about-0.033
e-0.032
Token tale
Token ID12838
Feature activation+9.14e+00

Pos logit contributions
's+0.153
e+0.148
 Playing+0.146
 playing+0.142
 about+0.140
Neg logit contributions
 reins-0.125
 redd-0.125
ifi-0.109
 lowest-0.109
rated-0.107
Token about
Token ID546
Feature activation+3.99e+00

Pos logit contributions
 Playing+0.225
e+0.196
 Friends+0.196
 playing+0.195
's+0.189
Neg logit contributions
 reins-0.453
 stool-0.449
 lowest-0.434
Hours-0.425
 redd-0.420
Token a
Token ID257
Feature activation-9.29e-01

Pos logit contributions
's+0.120
e+0.114
 Playing+0.104
apping+0.104
 that+0.103
Neg logit contributions
 reins-0.183
 redd-0.181
 lowest-0.176
ifi-0.166
 former-0.166
Token brave
Token ID14802
Feature activation-9.90e-01

Pos logit contributions
 reins+0.042
 redd+0.040
Hours+0.039
 lowest+0.038
Lex+0.037
Neg logit contributions
's-0.034
 playing-0.033
 that-0.033
e-0.031
 about-0.031
Token prince
Token ID19716
Feature activation+2.77e+00

Pos logit contributions
 reins+0.042
 redd+0.038
ifi+0.038
 lowest+0.036
rated+0.036
Neg logit contributions
's-0.039
 playing-0.038
 Playing-0.036
 doing-0.036
e-0.036
Token.
Token ID13
Feature activation+6.02e-01

Pos logit contributions
 Playing+0.067
 lived+0.065
apped+0.063
 were+0.060
 Winning+0.058
Neg logit contributions
 reins-0.140
igi-0.125
ge-0.121
ifi-0.121
Lex-0.118
Token kingdom
Token ID13239
Feature activation+1.97e+00

Pos logit contributions
's+0.097
 that+0.096
 playing+0.092
e+0.089
 about+0.088
Neg logit contributions
 reins-0.137
Hours-0.116
 lowest-0.116
 redd-0.116
rated-0.114
Token.
Token ID13
Feature activation+6.78e-01

Pos logit contributions
 playing+0.037
 Playing+0.036
's+0.035
 doing+0.033
 lived+0.032
Neg logit contributions
 reins-0.113
Hours-0.106
 lowest-0.105
 tighter-0.103
 former-0.102
Token This
Token ID770
Feature activation+2.50e+00

Pos logit contributions
 playing+0.018
 doing+0.017
's+0.017
 that+0.016
 making+0.016
Neg logit contributions
 reins-0.035
Lex-0.034
Hours-0.034
igi-0.033
 lowest-0.033
Token is
Token ID318
Feature activation+2.53e+00

Pos logit contributions
 playing+0.076
apping+0.070
 wearing+0.067
 Playing+0.067
 sharing+0.064
Neg logit contributions
rated-0.115
 redd-0.113
 reins-0.112
Hours-0.108
Lex-0.107
Token the
Token ID262
Feature activation+7.38e-01

Pos logit contributions
's+0.056
 playing+0.054
 that+0.051
 doing+0.046
 wearing+0.046
Neg logit contributions
 reins-0.155
 redd-0.148
Hours-0.147
 lowest-0.143
rated-0.142
Token tale
Token ID12838
Feature activation+9.09e+00

Pos logit contributions
 playing+0.025
's+0.025
 that+0.024
 about+0.023
 wearing+0.023
Neg logit contributions
 reins-0.037
 redd-0.035
Hours-0.034
 lowest-0.034
rated-0.033
Token of
Token ID286
Feature activation+3.13e+00

Pos logit contributions
 playing+0.190
 Playing+0.175
e+0.163
 acting+0.154
 wearing+0.151
Neg logit contributions
 reins-0.472
Hours-0.453
 lowest-0.453
 redd-0.439
rated-0.437
Token the
Token ID262
Feature activation+7.23e-01

Pos logit contributions
's+0.094
e+0.072
 playing+0.071
 that+0.070
 Playing+0.069
Neg logit contributions
 reins-0.170
 redd-0.153
 lowest-0.152
Lex-0.151
Hours-0.148
Token daring
Token ID27939
Feature activation-9.59e-01

Pos logit contributions
's+0.029
 playing+0.028
 that+0.028
 about+0.027
 wearing+0.026
Neg logit contributions
 reins-0.032
 redd-0.030
 lowest-0.028
Hours-0.028
rated-0.028
Token prince
Token ID19716
Feature activation+2.75e+00

Pos logit contributions
 reins+0.038
 redd+0.033
rated+0.033
 lowest+0.032
ifi+0.031
Neg logit contributions
 playing-0.041
's-0.040
 that-0.039
 doing-0.039
 Playing-0.038
Token!
Token ID0
Feature activation+3.43e-01

Pos logit contributions
 playing+0.052
 Playing+0.048
 sharing+0.044
 wearing+0.042
 doing+0.041
Neg logit contributions
 reins-0.153
igi-0.140
 lowest-0.138
rated-0.138
Lex-0.137
Token,
Token ID11
Feature activation+1.67e+00

Pos logit contributions
 playing+0.049
 wearing+0.049
 living+0.041
e+0.039
 Playing+0.039
Neg logit contributions
rated-0.165
 reins-0.164
 redd-0.161
 lowest-0.158
igi-0.156
Token he
Token ID339
Feature activation-2.31e-01

Pos logit contributions
's+0.050
 playing+0.048
e+0.045
 lived+0.045
 living+0.044
Neg logit contributions
 reins-0.087
 redd-0.081
 lowest-0.078
Hours-0.077
ifi-0.076
Token is
Token ID318
Feature activation+2.61e+00

Pos logit contributions
rated+0.009
ws+0.009
 reins+0.009
 lowest+0.009
 redd+0.009
Neg logit contributions
 wearing-0.008
 sharing-0.007
 of-0.007
 that-0.007
 playing-0.007
Token playing
Token ID2712
Feature activation+4.27e+00

Pos logit contributions
 lived+0.074
's+0.073
 were+0.070
 of+0.068
 that+0.067
Neg logit contributions
 reins-0.123
 redd-0.119
Hours-0.118
Lex-0.115
 lowest-0.113
Token a
Token ID257
Feature activation-7.10e-01

Pos logit contributions
's+0.074
 playing+0.067
 about+0.062
 that+0.062
 wearing+0.061
Neg logit contributions
 reins-0.266
Hours-0.250
 redd-0.245
 lowest-0.244
rated-0.239
Token sax
Token ID46909
Feature activation+9.07e+00

Pos logit contributions
 reins+0.035
Hours+0.032
 redd+0.032
 lowest+0.031
rated+0.031
Neg logit contributions
's-0.025
 playing-0.024
 that-0.024
 about-0.022
 doing-0.022
Tokenophone
Token ID38656
Feature activation+5.24e-01

Pos logit contributions
's+0.357
 playing+0.355
 that+0.346
 doing+0.343
 living+0.334
Neg logit contributions
 reins-0.390
 lowest-0.342
 redd-0.342
Hours-0.339
rated-0.338
Token.
Token ID13
Feature activation+1.02e+00

Pos logit contributions
's+0.017
 playing+0.017
 that+0.016
 doing+0.015
 wearing+0.015
Neg logit contributions
 reins-0.027
 redd-0.024
 lowest-0.024
Hours-0.024
rated-0.023
Token It
Token ID632
Feature activation+1.58e+00

Pos logit contributions
's+0.030
 playing+0.030
 that+0.029
 doing+0.027
 wearing+0.026
Neg logit contributions
 reins-0.056
 redd-0.052
Hours-0.051
 lowest-0.050
ifi-0.049
Token is
Token ID318
Feature activation+2.63e+00

Pos logit contributions
 playing+0.041
 sharing+0.039
 wearing+0.038
apping+0.036
e+0.035
Neg logit contributions
 reins-0.081
 redd-0.076
 lowest-0.076
rated-0.074
Hours-0.073
Token a
Token ID257
Feature activation-7.17e-01

Pos logit contributions
's+0.084
 playing+0.076
 that+0.075
e+0.074
 lived+0.074
Neg logit contributions
 reins-0.131
 redd-0.123
Hours-0.121
 lowest-0.116
ifi-0.115
Token Maybe
Token ID6674
Feature activation-7.02e+00

Pos logit contributions
's+0.026
 that+0.025
 playing+0.025
 about+0.022
,+0.022
Neg logit contributions
 reins-0.053
 redd-0.049
Hours-0.049
rated-0.049
 lowest-0.048
Token it
Token ID340
Feature activation-1.13e+00

Pos logit contributions
 reins+0.410
Hours+0.386
 redd+0.384
rated+0.379
 lowest+0.374
Neg logit contributions
's-0.194
 that-0.193
 playing-0.186
,-0.171
 about-0.166
Token's
Token ID338
Feature activation+1.08e+00

Pos logit contributions
 reins+0.064
 redd+0.060
Hours+0.059
 lowest+0.058
rated+0.058
Neg logit contributions
 playing-0.031
 that-0.030
's-0.030
 doing-0.027
 wearing-0.027
Token a
Token ID257
Feature activation-8.82e-01

Pos logit contributions
's+0.029
 playing+0.027
 that+0.027
 doing+0.024
,+0.024
Neg logit contributions
 reins-0.062
 redd-0.058
Hours-0.058
 lowest-0.057
rated-0.056
Token fairy
Token ID25607
Feature activation+1.36e+00

Pos logit contributions
 reins+0.044
Hours+0.040
 redd+0.040
rated+0.039
 lowest+0.039
Neg logit contributions
's-0.031
 that-0.031
 playing-0.031
,-0.029
 doing-0.028
Token tale
Token ID12838
Feature activation+9.07e+00

Pos logit contributions
 playing+0.037
's+0.033
 doing+0.032
 Playing+0.032
 were+0.032
Neg logit contributions
 reins-0.070
 redd-0.062
rated-0.062
rito-0.061
ifi-0.061
Token."
Token ID526
Feature activation-3.35e+00

Pos logit contributions
 playing+0.227
's+0.227
 that+0.209
 doing+0.196
e+0.195
Neg logit contributions
 reins-0.513
Hours-0.477
 redd-0.475
 lowest-0.469
rated-0.468
Token
Token ID198
Feature activation+4.28e-01

Pos logit contributions
 reins+0.201
 redd+0.189
Hours+0.187
 lowest+0.186
ifi+0.186
Neg logit contributions
's-0.072
 playing-0.072
 that-0.069
 doing-0.067
e-0.062
Token
Token ID198
Feature activation+4.19e-01

Pos logit contributions
's+0.007
 playing+0.006
 that+0.006
 doing+0.005
 about+0.005
Neg logit contributions
 reins-0.029
 redd-0.028
Hours-0.028
 lowest-0.027
rated-0.027
TokenM
Token ID44
Feature activation-5.65e+00

Pos logit contributions
's+0.015
 that+0.015
 playing+0.014
 living+0.014
,+0.014
Neg logit contributions
 reins-0.021
rated-0.020
 redd-0.020
 lowest-0.019
 former-0.018
Tokenia
Token ID544
Feature activation+6.04e-01

Pos logit contributions
 reins+0.288
 redd+0.277
rated+0.264
Hours+0.259
 lowest+0.252
Neg logit contributions
's-0.199
 that-0.197
,-0.191
 playing-0.191
 about-0.178
Token decided
Token ID3066
Feature activation-5.47e+00

Pos logit contributions
 reins+0.078
Hours+0.074
rated+0.074
 redd+0.074
Lex+0.071
Neg logit contributions
 that-0.043
 playing-0.042
's-0.040
 doing-0.039
 about-0.039
Token to
Token ID284
Feature activation-7.27e-01

Pos logit contributions
 reins+0.310
 redd+0.292
Hours+0.290
 lowest+0.284
rated+0.280
Neg logit contributions
's-0.143
 playing-0.138
 that-0.133
e-0.128
 doing-0.123
Token perform
Token ID1620
Feature activation+3.49e+00

Pos logit contributions
 reins+0.034
Hours+0.032
 redd+0.031
ifi+0.031
 lowest+0.031
Neg logit contributions
 playing-0.024
's-0.023
 that-0.023
 doing-0.022
 about-0.022
Token a
Token ID257
Feature activation-7.83e-01

Pos logit contributions
's+0.096
 playing+0.090
 lived+0.084
 that+0.083
 doing+0.082
Neg logit contributions
 reins-0.188
Hours-0.175
 redd-0.172
 lowest-0.170
 tighter-0.165
Token fairy
Token ID25607
Feature activation+1.35e+00

Pos logit contributions
 reins+0.040
 redd+0.037
Hours+0.037
 lowest+0.036
rated+0.035
Neg logit contributions
's-0.025
 playing-0.024
 that-0.024
 about-0.023
 lived-0.022
Token tale
Token ID12838
Feature activation+9.06e+00

Pos logit contributions
 Playing+0.032
 were+0.032
 lived+0.031
 sharing+0.031
 playing+0.030
Neg logit contributions
 reins-0.073
 tighter-0.062
 lowest-0.061
Hours-0.061
 stool-0.060
Token.
Token ID13
Feature activation+8.73e-01

Pos logit contributions
 playing+0.158
's+0.158
e+0.139
 Playing+0.139
 lived+0.136
Neg logit contributions
 reins-0.548
Hours-0.523
 lowest-0.513
 stool-0.505
 redd-0.500
Token She
Token ID1375
Feature activation-1.65e+00

Pos logit contributions
's+0.024
 that+0.023
 playing+0.023
 about+0.021
,+0.021
Neg logit contributions
 reins-0.050
 redd-0.047
Hours-0.047
rated-0.046
 lowest-0.046
Token put
Token ID1234
Feature activation-9.99e-01

Pos logit contributions
 reins+0.100
 redd+0.094
Hours+0.093
 lowest+0.092
rated+0.091
Neg logit contributions
's-0.041
 playing-0.039
 that-0.039
,-0.034
 about-0.034
Token on
Token ID319
Feature activation+2.18e+00

Pos logit contributions
 reins+0.063
 redd+0.060
Hours+0.059
 lowest+0.058
Lex+0.057
Neg logit contributions
's-0.021
 playing-0.020
 that-0.020
 doing-0.017
 about-0.017
Token a
Token ID257
Feature activation-7.76e-01

Pos logit contributions
's+0.062
 playing+0.061
 that+0.061
 doing+0.054
 about+0.054
Neg logit contributions
 reins-0.122
 redd-0.114
Hours-0.113
 lowest-0.111
rated-0.110
Token spaghetti
Token ID48241
Feature activation-2.19e-01

Pos logit contributions
 redd+0.028
 reins+0.028
 weights+0.028
rated+0.026
 tighter+0.025
Neg logit contributions
 lived-0.034
 playing-0.033
e-0.033
 that-0.033
's-0.032
Token.
Token ID13
Feature activation+1.13e+00

Pos logit contributions
 reins+0.012
 stool+0.012
 tighter+0.011
 lowest+0.011
 redd+0.011
Neg logit contributions
e-0.004
's-0.004
 playing-0.004
 lived-0.004
 Playing-0.004
Token She
Token ID1375
Feature activation-1.37e+00

Pos logit contributions
's+0.033
 that+0.033
 playing+0.032
,+0.029
 about+0.029
Neg logit contributions
 reins-0.063
Hours-0.058
 redd-0.058
rated-0.057
 lowest-0.057
Token sees
Token ID7224
Feature activation+2.07e-01

Pos logit contributions
 reins+0.070
 redd+0.064
Hours+0.063
 lowest+0.063
Lex+0.061
Neg logit contributions
's-0.041
 playing-0.041
 that-0.040
e-0.039
 doing-0.038
Token happy
Token ID3772
Feature activation+2.75e+00

Pos logit contributions
's+0.007
 playing+0.006
e+0.006
 doing+0.006
 lived+0.006
Neg logit contributions
 reins-0.010
 redd-0.010
 lowest-0.009
Hours-0.009
Lex-0.009
Token twins
Token ID20345
Feature activation+9.06e+00

Pos logit contributions
 lived+0.107
's+0.100
e+0.099
 doing+0.098
 about+0.095
Neg logit contributions
 reins-0.104
 weights-0.094
 tighter-0.092
 pulp-0.091
 strawberries-0.091
Token.
Token ID13
Feature activation+1.20e+00

Pos logit contributions
 lived+0.218
e+0.182
opot+0.181
's+0.178
 Playing+0.178
Neg logit contributions
 reins-0.438
 tighter-0.398
 lowest-0.396
 seventy-0.384
ge-0.382
Token<|endoftext|>
Token ID50256
Feature activation-1.55e+00

Pos logit contributions
's+0.036
 playing+0.035
 that+0.035
 doing+0.032
 about+0.031
Neg logit contributions
 reins-0.065
 redd-0.061
Hours-0.060
 lowest-0.059
ifi-0.058
TokenL
Token ID43
Feature activation+3.85e-01

Pos logit contributions
 reins+0.038
Pen+0.033
First+0.033
E+0.033
 lowest+0.033
Neg logit contributions
 that-0.077
 about-0.076
 sharing-0.074
's-0.074
 playing-0.073
Tokenily
Token ID813
Feature activation-8.92e-01

Pos logit contributions
's+0.017
 playing+0.016
 that+0.016
 doing+0.015
 about+0.015
Neg logit contributions
 reins-0.016
 redd-0.015
Hours-0.014
 lowest-0.014
rated-0.014
Token and
Token ID290
Feature activation+4.72e-01

Pos logit contributions
 reins+0.055
 redd+0.052
Hours+0.051
rated+0.051
ifi+0.051
Neg logit contributions
 that-0.015
's-0.015
 playing-0.015
e-0.013
 doing-0.013
Token she
Token ID673
Feature activation-1.52e+00

Pos logit contributions
 playing+0.044
's+0.042
 doing+0.041
 living+0.039
e+0.039
Neg logit contributions
 reins-0.242
 redd-0.239
Lex-0.236
Hours-0.235
ifi-0.234
Token would
Token ID561
Feature activation-1.73e+00

Pos logit contributions
 reins+0.077
 redd+0.075
Hours+0.074
rated+0.073
 lowest+0.070
Neg logit contributions
 playing-0.043
 that-0.042
 doing-0.040
's-0.040
 about-0.040
Token write
Token ID3551
Feature activation+4.39e+00

Pos logit contributions
 reins+0.088
 redd+0.081
Hours+0.080
 lowest+0.079
rated+0.077
Neg logit contributions
 playing-0.054
's-0.053
 that-0.051
 about-0.049
 doing-0.049
Token a
Token ID257
Feature activation-8.97e-01

Pos logit contributions
's+0.101
 playing+0.101
 that+0.091
e+0.090
 lived+0.090
Neg logit contributions
 reins-0.252
Hours-0.239
 redd-0.236
 lowest-0.231
 tighter-0.225
Token new
Token ID649
Feature activation+9.99e-02

Pos logit contributions
 reins+0.050
 redd+0.047
Hours+0.045
 lowest+0.045
rated+0.044
Neg logit contributions
 playing-0.025
's-0.024
 that-0.023
e-0.022
 lived-0.022
Token tale
Token ID12838
Feature activation+9.05e+00

Pos logit contributions
 playing+0.003
 that+0.003
's+0.003
 lived+0.003
 about+0.003
Neg logit contributions
 reins-0.005
 redd-0.004
 lowest-0.004
rated-0.004
Hours-0.004
Token about
Token ID546
Feature activation+3.96e+00

Pos logit contributions
 playing+0.172
 Playing+0.172
e+0.167
's+0.156
 lived+0.151
Neg logit contributions
 reins-0.490
Hours-0.476
 stool-0.472
 lowest-0.468
 redd-0.463
Token ships
Token ID7937
Feature activation+2.23e+00

Pos logit contributions
's+0.111
 that+0.100
e+0.099
 playing+0.095
 lived+0.093
Neg logit contributions
 reins-0.198
 lowest-0.186
 redd-0.185
Hours-0.179
 former-0.178
Token and
Token ID290
Feature activation-1.27e-01

Pos logit contributions
 lived+0.045
 Playing+0.039
e+0.036
apped+0.034
 Friends+0.033
Neg logit contributions
 reins-0.119
 lowest-0.110
 redd-0.109
 ten-0.109
 tighter-0.107
Token sailors
Token ID29996
Feature activation+3.52e+00

Pos logit contributions
 reins+0.007
 redd+0.006
Hours+0.006
 lowest+0.006
 former+0.006
Neg logit contributions
's-0.003
e-0.003
 that-0.003
 playing-0.003
 lived-0.003
Token.
Token ID13
Feature activation+7.62e-01

Pos logit contributions
's+0.068
 Playing+0.066
 playing+0.060
e+0.059
 doing+0.057
Neg logit contributions
 reins-0.198
 lowest-0.177
 redd-0.177
ried-0.177
igi-0.174
Token to
Token ID284
Feature activation-9.04e-01

Pos logit contributions
 reins+0.186
Hours+0.178
 redd+0.174
 weights+0.169
 lowest+0.168
Neg logit contributions
's-0.132
 lived-0.128
e-0.122
 that-0.113
 Playing-0.113
Token work
Token ID670
Feature activation-8.80e-01

Pos logit contributions
 reins+0.043
 redd+0.040
Hours+0.039
 lowest+0.039
rated+0.038
Neg logit contributions
's-0.033
 playing-0.032
 that-0.032
 about-0.030
 wearing-0.029
Token.
Token ID13
Feature activation+6.66e-01

Pos logit contributions
 reins+0.053
Hours+0.052
 redd+0.051
 lowest+0.050
 tighter+0.050
Neg logit contributions
's-0.018
 playing-0.016
 lived-0.015
 that-0.015
e-0.015
Token
Token ID198
Feature activation+2.49e-01

Pos logit contributions
's+0.018
 playing+0.018
 that+0.017
 doing+0.016
 wearing+0.016
Neg logit contributions
 reins-0.038
Hours-0.035
 redd-0.035
 lowest-0.034
ifi-0.034
Token
Token ID198
Feature activation+2.23e-01

Pos logit contributions
's+0.004
 that+0.003
 playing+0.003
,+0.003
 about+0.003
Neg logit contributions
 reins-0.018
 redd-0.017
Hours-0.016
 lowest-0.016
rated-0.016
TokenHours
Token ID39792
Feature activation-1.34e+01

Pos logit contributions
's+0.011
 that+0.010
 playing+0.010
e+0.009
 wearing+0.009
Neg logit contributions
 reins-0.009
 redd-0.008
 lowest-0.008
rated-0.008
ifi-0.007
Token later
Token ID1568
Feature activation-5.71e+00

Pos logit contributions
Hours+0.510
ried+0.492
 reins+0.480
rated+0.472
ifi+0.465
Neg logit contributions
's-0.520
 living-0.501
 playing-0.490
 about-0.474
 Friends-0.472
Token,
Token ID11
Feature activation+1.45e+00

Pos logit contributions
Hours+0.310
 lowest+0.294
Lex+0.286
 stool+0.282
 top+0.277
Neg logit contributions
 Playing-0.112
 living-0.112
e-0.107
's-0.101
 playing-0.101
Token the
Token ID262
Feature activation+7.91e-01

Pos logit contributions
's+0.026
 playing+0.025
 that+0.024
 doing+0.021
 about+0.021
Neg logit contributions
 reins-0.094
 redd-0.090
Hours-0.089
 lowest-0.088
rated-0.087
Token prince
Token ID19716
Feature activation+2.72e+00

Pos logit contributions
's+0.032
 playing+0.032
 that+0.032
 doing+0.029
 about+0.029
Neg logit contributions
 reins-0.035
 redd-0.032
Hours-0.032
 lowest-0.031
rated-0.031
Token and
Token ID290
Feature activation-3.02e-01

Pos logit contributions
 playing+0.065
 Playing+0.061
 sharing+0.060
 that+0.059
 doing+0.058
Neg logit contributions
Hours-0.141
 reins-0.141
rated-0.138
ried-0.134
 redd-0.134
Token do
Token ID466
Feature activation+7.62e+00

Pos logit contributions
 reins+0.035
rated+0.035
ifi+0.035
 lowest+0.035
 persuade+0.033
Neg logit contributions
 living-0.017
 Playing-0.016
 playing-0.016
apped-0.015
's-0.015
Token.
Token ID13
Feature activation+8.97e-01

Pos logit contributions
 playing+0.135
's+0.132
e+0.127
 lived+0.123
 wearing+0.118
Neg logit contributions
rated-0.414
 reins-0.412
Hours-0.409
 lowest-0.405
 redd-0.401
Token
Token ID220
Feature activation-2.44e+00

Pos logit contributions
's+0.018
 playing+0.018
 that+0.018
 doing+0.015
 about+0.015
Neg logit contributions
 reins-0.058
 redd-0.054
Hours-0.054
 lowest-0.053
rated-0.053
Token
Token ID198
Feature activation+3.41e-01

Pos logit contributions
 reins+0.165
 redd+0.156
Hours+0.156
 lowest+0.153
rated+0.152
Neg logit contributions
's-0.039
 playing-0.038
 that-0.037
 doing-0.031
 wearing-0.030
Token
Token ID198
Feature activation+2.74e-01

Pos logit contributions
's+0.004
 that+0.004
 playing+0.004
,+0.003
 about+0.003
Neg logit contributions
 reins-0.025
 redd-0.024
rated-0.024
 lowest-0.023
Hours-0.023
TokenHours
Token ID39792
Feature activation-1.34e+01

Pos logit contributions
's+0.012
 that+0.011
 playing+0.011
e+0.011
 living+0.010
Neg logit contributions
 reins-0.012
 redd-0.012
rated-0.011
 lowest-0.011
ried-0.010
Token passed
Token ID3804
Feature activation-3.55e+00

Pos logit contributions
Hours+0.490
 reins+0.482
ried+0.470
ifi+0.463
rated+0.448
Neg logit contributions
's-0.547
 playing-0.517
 living-0.515
 sharing-0.507
 Friends-0.507
Token and
Token ID290
Feature activation-3.35e-01

Pos logit contributions
 reins+0.277
 redd+0.262
Hours+0.259
ifi+0.254
rated+0.254
Neg logit contributions
's-0.032
 that-0.029
 playing-0.026
,-0.023
 about-0.017
Token Lily
Token ID20037
Feature activation+4.45e-01

Pos logit contributions
 reins+0.019
 redd+0.018
Hours+0.018
 lowest+0.018
rated+0.018
Neg logit contributions
's-0.009
 playing-0.009
 that-0.009
 doing-0.008
 about-0.008
Token's
Token ID338
Feature activation+8.97e-01

Pos logit contributions
 sharing+0.009
 playing+0.008
 that+0.008
 Friends+0.008
 about+0.008
Neg logit contributions
Hours-0.026
 reins-0.025
rated-0.025
ried-0.024
Lex-0.024
Token parents
Token ID3397
Feature activation+4.81e-02

Pos logit contributions
's+0.027
 that+0.026
 playing+0.026
 about+0.024
 doing+0.024
Neg logit contributions
 reins-0.049
 redd-0.045
Hours-0.045
 lowest-0.044
ifi-0.044
Token alone
Token ID3436
Feature activation+1.08e+00

Pos logit contributions
's+0.089
 that+0.087
 playing+0.086
 doing+0.080
 about+0.080
Neg logit contributions
 reins-0.108
 redd-0.102
Hours-0.101
 lowest-0.096
rated-0.094
Token.
Token ID13
Feature activation+5.11e-01

Pos logit contributions
's+0.015
 playing+0.014
 that+0.014
 doing+0.011
 wearing+0.011
Neg logit contributions
 reins-0.073
Hours-0.069
 redd-0.069
 lowest-0.069
rated-0.069
Token
Token ID220
Feature activation-2.79e+00

Pos logit contributions
 playing+0.013
 doing+0.012
 that+0.012
's+0.012
 sharing+0.011
Neg logit contributions
 reins-0.029
Hours-0.028
 redd-0.027
 lowest-0.027
 First-0.026
Token
Token ID198
Feature activation+1.50e-01

Pos logit contributions
 reins+0.181
Hours+0.172
 redd+0.170
 lowest+0.168
ifi+0.166
Neg logit contributions
's-0.050
 playing-0.049
 that-0.048
 doing-0.042
 wearing-0.040
Token
Token ID198
Feature activation+1.34e-01

Pos logit contributions
's+0.002
 playing+0.002
 that+0.002
 doing+0.002
 about+0.002
Neg logit contributions
 reins-0.010
 redd-0.010
Hours-0.010
 lowest-0.009
rated-0.009
TokenHours
Token ID39792
Feature activation-1.34e+01

Pos logit contributions
's+0.006
 that+0.006
 playing+0.006
 with+0.005
e+0.005
Neg logit contributions
 reins-0.006
 redd-0.005
 lowest-0.005
rated-0.005
ifi-0.005
Token passed
Token ID3804
Feature activation-3.52e+00

Pos logit contributions
 reins+0.419
Hours+0.413
ried+0.400
rated+0.393
ifi+0.390
Neg logit contributions
's-0.598
 living-0.590
 playing-0.585
 about-0.565
 doing-0.558
Token,
Token ID11
Feature activation+1.39e+00

Pos logit contributions
 reins+0.257
 redd+0.245
Hours+0.244
 lowest+0.240
rated+0.239
Neg logit contributions
's-0.039
 playing-0.037
 that-0.036
 doing-0.027
 about-0.026
Token but
Token ID475
Feature activation+1.10e-02

Pos logit contributions
's+0.030
e+0.028
 playing+0.027
 that+0.026
 lived+0.025
Neg logit contributions
 reins-0.084
 redd-0.081
Hours-0.079
 lowest-0.078
rated-0.076
Token nobody
Token ID8168
Feature activation-5.34e+00

Pos logit contributions
's+0.000
 playing+0.000
 living+0.000
e+0.000
 doing+0.000
Neg logit contributions
Hours-0.001
 reins-0.001
 redd-0.001
rated-0.001
ifi-0.001
Token came
Token ID1625
Feature activation-1.42e+00

Pos logit contributions
rated+0.267
Hours+0.261
ifi+0.257
ried+0.254
 reins+0.253
Neg logit contributions
 playing-0.152
 doing-0.142
 Playing-0.139
 sharing-0.135
 living-0.135
Token baseball
Token ID9283
Feature activation+2.50e+00

Pos logit contributions
's+0.034
 playing+0.033
 that+0.033
 doing+0.030
,+0.030
Neg logit contributions
 reins-0.048
 redd-0.045
Hours-0.045
ifi-0.044
rated-0.044
Token returned
Token ID4504
Feature activation-4.26e+00

Pos logit contributions
 playing+0.060
 that+0.059
's+0.056
 wearing+0.056
 about+0.056
Neg logit contributions
 reins-0.135
rated-0.129
 redd-0.128
Hours-0.128
 lowest-0.127
Token.
Token ID13
Feature activation+7.50e-01

Pos logit contributions
 reins+0.298
 redd+0.283
Hours+0.280
 lowest+0.277
rated+0.276
Neg logit contributions
's-0.057
 playing-0.054
 that-0.052
 doing-0.043
 wearing-0.041
Token
Token ID198
Feature activation+2.91e-01

Pos logit contributions
 playing+0.018
's+0.017
 that+0.017
 doing+0.017
e+0.016
Neg logit contributions
 reins-0.044
Hours-0.042
 redd-0.041
 lowest-0.040
Lex-0.040
Token
Token ID198
Feature activation+2.61e-01

Pos logit contributions
 playing+0.007
's+0.007
 doing+0.007
 that+0.007
 sharing+0.007
Neg logit contributions
 reins-0.016
Hours-0.016
Lex-0.015
igi-0.015
ifi-0.015
TokenHours
Token ID39792
Feature activation-1.34e+01

Pos logit contributions
's+0.010
 that+0.010
 playing+0.010
 doing+0.009
 about+0.009
Neg logit contributions
 reins-0.012
 redd-0.011
Hours-0.011
 lowest-0.011
rated-0.011
Token later
Token ID1568
Feature activation-5.67e+00

Pos logit contributions
Hours+0.429
 reins+0.428
ried+0.418
 tighter+0.408
nine+0.404
Neg logit contributions
's-0.565
 living-0.558
 playing-0.546
 about-0.528
 Playing-0.520
Token,
Token ID11
Feature activation+1.60e+00

Pos logit contributions
Hours+0.273
Lex+0.257
 lowest+0.256
 top+0.250
rier+0.248
Neg logit contributions
 living-0.126
 Playing-0.125
e-0.123
 playing-0.107
's-0.107
Token the
Token ID262
Feature activation+9.29e-01

Pos logit contributions
's+0.034
 playing+0.030
e+0.029
 living+0.028
 lived+0.028
Neg logit contributions
 reins-0.096
 redd-0.093
Hours-0.092
 lowest-0.092
Lex-0.088
Token sun
Token ID4252
Feature activation+1.41e+00

Pos logit contributions
's+0.035
e+0.035
 that+0.035
 playing+0.034
 lived+0.034
Neg logit contributions
 reins-0.041
 redd-0.038
Hours-0.037
 lowest-0.037
rated-0.035
Token had
Token ID550
Feature activation+4.01e+00

Pos logit contributions
 Playing+0.048
 playing+0.046
 about+0.046
e+0.045
 sharing+0.044
Neg logit contributions
 reins-0.060
rated-0.060
Hours-0.058
 tighter-0.058
ried-0.057
Token wrong
Token ID2642
Feature activation+8.71e-01

Pos logit contributions
e+0.091
 playing+0.087
's+0.086
 lived+0.084
 named+0.079
Neg logit contributions
rated-0.251
Hours-0.231
 reins-0.231
 weights-0.228
ge-0.227
Token.
Token ID13
Feature activation+4.33e-01

Pos logit contributions
's+0.014
 playing+0.013
 that+0.012
e+0.011
 lived+0.011
Neg logit contributions
 reins-0.055
rated-0.053
Hours-0.052
 redd-0.051
 lowest-0.051
Token
Token ID220
Feature activation-2.75e+00

Pos logit contributions
's+0.010
 that+0.010
 playing+0.010
 about+0.008
,+0.008
Neg logit contributions
 reins-0.027
 redd-0.025
Hours-0.025
rated-0.025
 lowest-0.025
Token
Token ID198
Feature activation+6.84e-02

Pos logit contributions
 reins+0.185
Hours+0.176
 redd+0.175
 lowest+0.172
rated+0.170
Neg logit contributions
's-0.045
 playing-0.043
 that-0.043
 doing-0.035
,-0.034
Token
Token ID198
Feature activation+1.87e-02

Pos logit contributions
's+0.001
 that+0.001
 playing+0.001
,+0.001
 wearing+0.000
Neg logit contributions
 reins-0.005
 redd-0.005
rated-0.005
 lowest-0.005
Hours-0.005
TokenHours
Token ID39792
Feature activation-1.33e+01

Pos logit contributions
's+0.001
 that+0.001
 playing+0.001
e+0.001
 living+0.001
Neg logit contributions
 reins-0.001
 redd-0.001
rated-0.001
 lowest-0.001
ifi-0.001
Token passed
Token ID3804
Feature activation-3.64e+00

Pos logit contributions
 reins+0.494
Hours+0.487
ried+0.460
 tighter+0.451
rated+0.449
Neg logit contributions
's-0.535
 playing-0.520
 living-0.508
 sharing-0.498
 about-0.496
Token and
Token ID290
Feature activation-5.10e-01

Pos logit contributions
 reins+0.257
 redd+0.244
Hours+0.244
 lowest+0.240
rated+0.238
Neg logit contributions
's-0.048
 playing-0.046
 that-0.045
 doing-0.036
 wearing-0.035
Token the
Token ID262
Feature activation+6.46e-01

Pos logit contributions
 reins+0.031
Hours+0.029
 redd+0.029
 lowest+0.028
rated+0.028
Neg logit contributions
's-0.012
 playing-0.011
 that-0.011
 doing-0.010
 about-0.010
Token animal
Token ID5044
Feature activation+3.32e+00

Pos logit contributions
's+0.020
 playing+0.019
 that+0.019
 about+0.018
 doing+0.017
Neg logit contributions
 reins-0.035
 redd-0.032
Hours-0.032
 lowest-0.031
ifi-0.031
Token looked
Token ID3114
Feature activation-4.67e-01

Pos logit contributions
 playing+0.058
 sharing+0.054
 wearing+0.053
 doing+0.052
 Playing+0.050
Neg logit contributions
Hours-0.201
 reins-0.198
rated-0.195
ried-0.189
 lowest-0.189
Token down
Token ID866
Feature activation-4.42e+00

Pos logit contributions
 reins+0.128
Hours+0.119
 redd+0.118
 lowest+0.114
rated+0.113
Neg logit contributions
's-0.097
 that-0.097
 playing-0.096
,-0.089
 about-0.088
Token and
Token ID290
Feature activation-3.48e-01

Pos logit contributions
 reins+0.339
ifi+0.323
 redd+0.322
Hours+0.321
rated+0.316
Neg logit contributions
 that-0.048
,-0.043
 playing-0.042
's-0.040
 in-0.037
Token fell
Token ID3214
Feature activation-2.75e+00

Pos logit contributions
 reins+0.020
Hours+0.019
 redd+0.019
ifi+0.018
 lowest+0.018
Neg logit contributions
's-0.010
 playing-0.010
 that-0.010
,-0.009
 doing-0.008
Token asleep
Token ID16039
Feature activation-2.72e+00

Pos logit contributions
 reins+0.139
 redd+0.133
ifi+0.132
Hours+0.131
igi+0.127
Neg logit contributions
 that-0.096
's-0.094
 playing-0.093
,-0.092
 in-0.089
Token.
Token ID13
Feature activation+5.25e-01

Pos logit contributions
 reins+0.163
 lowest+0.155
Hours+0.153
 redd+0.152
rated+0.152
Neg logit contributions
's-0.054
 Playing-0.045
 playing-0.042
e-0.041
 Friends-0.041
Token Hours
Token ID19347
Feature activation-1.26e+01

Pos logit contributions
's+0.012
 that+0.012
 playing+0.012
 about+0.011
,+0.010
Neg logit contributions
 reins-0.032
 redd-0.030
Hours-0.030
 lowest-0.029
rated-0.029
Token later
Token ID1568
Feature activation-5.64e+00

Pos logit contributions
 reins+0.533
Hours+0.504
rated+0.487
ried+0.487
 lowest+0.481
Neg logit contributions
's-0.452
 playing-0.433
 about-0.430
 living-0.428
 that-0.427
Token it
Token ID340
Feature activation-1.26e+00

Pos logit contributions
Hours+0.315
 lowest+0.314
 reins+0.303
Lex+0.299
rated+0.294
Neg logit contributions
e-0.093
 Playing-0.092
 living-0.087
 sharing-0.087
 Friends-0.086
Token woke
Token ID19092
Feature activation-1.97e+00

Pos logit contributions
 reins+0.078
Hours+0.076
rated+0.075
 redd+0.074
 lowest+0.074
Neg logit contributions
 playing-0.025
 that-0.024
's-0.023
 wearing-0.022
 doing-0.022
Token up
Token ID510
Feature activation-2.95e+00

Pos logit contributions
 reins+0.112
 redd+0.101
Hours+0.098
ifi+0.098
 lowest+0.096
Neg logit contributions
 that-0.063
's-0.063
 playing-0.059
,-0.058
 about-0.057
Token and
Token ID290
Feature activation-3.53e-01

Pos logit contributions
 reins+0.208
 redd+0.198
Hours+0.197
 lowest+0.194
rated+0.193
Neg logit contributions
's-0.041
 playing-0.038
 that-0.037
 doing-0.029
 about-0.029
Token it
Token ID340
Feature activation-1.26e+00

Pos logit contributions
 reins+0.166
Hours+0.158
 redd+0.156
 lowest+0.154
rated+0.151
Neg logit contributions
's-0.068
 playing-0.064
 that-0.061
 about-0.057
 doing-0.056
Token was
Token ID373
Feature activation+1.79e+00

Pos logit contributions
 reins+0.091
Hours+0.088
 redd+0.087
rated+0.086
 lowest+0.085
Neg logit contributions
 playing-0.014
's-0.014
 that-0.014
 doing-0.011
 wearing-0.010
Token smooth
Token ID7209
Feature activation+5.54e-01

Pos logit contributions
 playing+0.050
 that+0.050
 about+0.048
's+0.047
,+0.046
Neg logit contributions
 reins-0.108
 redd-0.099
ifi-0.098
 lowest-0.096
igi-0.095
Token again
Token ID757
Feature activation-6.85e+00

Pos logit contributions
's+0.009
 playing+0.008
 that+0.008
e+0.007
 doing+0.007
Neg logit contributions
 reins-0.037
 redd-0.036
Hours-0.035
 lowest-0.035
rated-0.035
Token.
Token ID13
Feature activation+5.56e-01

Pos logit contributions
 reins+0.396
rated+0.385
Hours+0.382
 lowest+0.382
 former+0.376
Neg logit contributions
 playing-0.123
's-0.122
 lived-0.119
 Playing-0.115
 that-0.109
Token Hours
Token ID19347
Feature activation-1.26e+01

Pos logit contributions
's+0.015
 that+0.015
 playing+0.015
 about+0.014
,+0.013
Neg logit contributions
 reins-0.032
 redd-0.030
Hours-0.030
rated-0.029
 lowest-0.029
Token and
Token ID290
Feature activation-4.66e-01

Pos logit contributions
 reins+0.494
Hours+0.487
ried+0.458
rated+0.451
Lex+0.446
Neg logit contributions
's-0.482
 playing-0.459
 living-0.457
 about-0.452
 that-0.439
Token hours
Token ID2250
Feature activation-3.90e+00

Pos logit contributions
 reins+0.026
 redd+0.025
Hours+0.025
 lowest+0.024
rated+0.024
Neg logit contributions
's-0.013
 playing-0.013
 that-0.012
 doing-0.011
 about-0.011
Token went
Token ID1816
Feature activation-3.56e+00

Pos logit contributions
Hours+0.200
 reins+0.188
ge+0.183
ifi+0.179
ried+0.179
Neg logit contributions
's-0.102
 Friends-0.098
 Playing-0.097
 sharing-0.092
 living-0.090
Token by
Token ID416
Feature activation+3.50e+00

Pos logit contributions
 reins+0.169
Hours+0.161
 redd+0.154
 lowest+0.152
Lex+0.148
Neg logit contributions
's-0.128
 playing-0.118
e-0.118
 lived-0.114
 that-0.113
Token,
Token ID11
Feature activation+1.29e+00

Pos logit contributions
's+0.063
 playing+0.051
 that+0.050
 lived+0.046
 about+0.044
Neg logit contributions
 reins-0.225
Hours-0.214
 lowest-0.212
 redd-0.210
rated-0.208
Token the
Token ID262
Feature activation+8.04e-01

Pos logit contributions
 reins+0.156
Hours+0.147
 redd+0.146
 lowest+0.145
 tighter+0.140
Neg logit contributions
's-0.076
e-0.072
 playing-0.071
 lived-0.070
 doing-0.065
Token door
Token ID3420
Feature activation-3.53e+00

Pos logit contributions
's+0.030
 that+0.029
 playing+0.029
 doing+0.027
 living+0.027
Neg logit contributions
 reins-0.038
Hours-0.035
 redd-0.035
ifi-0.035
igi-0.035
Token would
Token ID561
Feature activation-1.75e+00

Pos logit contributions
 reins+0.190
 redd+0.177
Hours+0.176
 lowest+0.172
rated+0.170
Neg logit contributions
's-0.109
 that-0.106
 playing-0.106
,-0.096
 doing-0.095
Token open
Token ID1280
Feature activation-4.36e+00

Pos logit contributions
 reins+0.104
 redd+0.098
Hours+0.096
rated+0.094
ifi+0.093
Neg logit contributions
 that-0.047
's-0.046
 playing-0.044
,-0.041
 doing-0.040
Token.
Token ID13
Feature activation+4.73e-01

Pos logit contributions
 reins+0.299
 redd+0.283
Hours+0.282
 lowest+0.276
rated+0.275
Neg logit contributions
's-0.071
 that-0.069
 playing-0.068
,-0.057
 about-0.055
Token Hours
Token ID19347
Feature activation-1.26e+01

Pos logit contributions
's+0.017
 playing+0.017
 that+0.016
 doing+0.015
 wearing+0.015
Neg logit contributions
 reins-0.023
Hours-0.021
 redd-0.021
 lowest-0.021
rated-0.020
Token passed
Token ID3804
Feature activation-3.43e+00

Pos logit contributions
 reins+0.469
Hours+0.452
ried+0.433
ge+0.422
rated+0.417
Neg logit contributions
's-0.514
 living-0.495
 playing-0.488
 about-0.480
 sharing-0.473
Token,
Token ID11
Feature activation+1.36e+00

Pos logit contributions
 reins+0.225
Hours+0.215
 redd+0.214
 lowest+0.213
rated+0.210
Neg logit contributions
 playing-0.055
's-0.053
 that-0.051
 doing-0.047
 wearing-0.046
Token but
Token ID475
Feature activation+2.29e-02

Pos logit contributions
's+0.036
 playing+0.035
e+0.034
 that+0.033
 lived+0.033
Neg logit contributions
 reins-0.075
Hours-0.070
 redd-0.070
 lowest-0.069
 tighter-0.067
Token no
Token ID645
Feature activation-5.66e+00

Pos logit contributions
's+0.001
 playing+0.001
 living+0.001
 Playing+0.001
e+0.001
Neg logit contributions
Hours-0.001
 reins-0.001
 tighter-0.001
 redd-0.001
 lowest-0.001
Token-
Token ID12
Feature activation+4.61e-01

Pos logit contributions
 reins+0.178
 redd+0.157
Hours+0.156
 lowest+0.148
rated+0.147
Neg logit contributions
's-0.304
 playing-0.300
 that-0.300
,-0.283
 doing-0.283
Token appear
Token ID1656
Feature activation+3.57e+00

Pos logit contributions
 lived+0.030
 playing+0.028
 Playing+0.027
 that+0.027
's+0.026
Neg logit contributions
 reins-0.121
 lowest-0.112
 redd-0.112
rated-0.111
Hours-0.111
Token from
Token ID422
Feature activation+2.71e+00

Pos logit contributions
 playing+0.063
's+0.062
 wearing+0.054
e+0.053
 that+0.053
Neg logit contributions
 reins-0.219
rated-0.211
 redd-0.210
 lowest-0.207
Hours-0.205
Token the
Token ID262
Feature activation+6.29e-01

Pos logit contributions
's+0.066
 playing+0.064
 that+0.064
 doing+0.056
 about+0.055
Neg logit contributions
 reins-0.163
 redd-0.154
Hours-0.153
 lowest-0.149
rated-0.149
Token clay
Token ID21558
Feature activation-2.56e-01

Pos logit contributions
's+0.024
 playing+0.023
 that+0.023
 about+0.021
 doing+0.021
Neg logit contributions
 reins-0.029
 redd-0.027
Hours-0.026
 lowest-0.026
rated-0.026
Token.
Token ID13
Feature activation+8.11e-01

Pos logit contributions
 reins+0.015
 redd+0.014
 lowest+0.014
rated+0.014
Hours+0.014
Neg logit contributions
 playing-0.005
's-0.005
 that-0.005
 about-0.005
 wearing-0.005
Token Hours
Token ID19347
Feature activation-1.22e+01

Pos logit contributions
's+0.020
 playing+0.019
 that+0.019
,+0.017
 about+0.017
Neg logit contributions
 reins-0.049
 redd-0.046
Hours-0.046
rated-0.045
 lowest-0.045
Token passed
Token ID3804
Feature activation-3.48e+00

Pos logit contributions
 reins+0.639
Hours+0.608
 lowest+0.600
ried+0.593
rated+0.591
Neg logit contributions
's-0.315
 playing-0.305
 about-0.294
 living-0.290
 sharing-0.284
Token,
Token ID11
Feature activation+1.46e+00

Pos logit contributions
 reins+0.254
 redd+0.242
Hours+0.241
 lowest+0.237
rated+0.236
Neg logit contributions
's-0.038
 playing-0.037
 that-0.035
 doing-0.027
 about-0.026
Token and
Token ID290
Feature activation-6.08e-02

Pos logit contributions
's+0.042
 playing+0.040
 that+0.040
e+0.038
 lived+0.037
Neg logit contributions
 reins-0.078
 redd-0.074
Hours-0.073
 lowest-0.072
rated-0.070
Token eventually
Token ID4191
Feature activation-5.90e+00

Pos logit contributions
 reins+0.003
 redd+0.003
Hours+0.003
 lowest+0.003
rated+0.003
Neg logit contributions
's-0.002
 playing-0.002
 that-0.002
 about-0.001
 doing-0.001
Token the
Token ID262
Feature activation+5.05e-01

Pos logit contributions
Hours+0.351
 reins+0.350
 redd+0.342
Lex+0.338
 lowest+0.338
Neg logit contributions
 playing-0.106
's-0.100
 living-0.096
 wearing-0.095
e-0.092
Token help
Token ID1037
Feature activation-4.60e-01

Pos logit contributions
 reins+0.116
 redd+0.107
Hours+0.106
 lowest+0.103
rated+0.103
Neg logit contributions
's-0.087
 playing-0.085
 that-0.085
 doing-0.079
 about-0.078
Token you
Token ID345
Feature activation+8.72e-01

Pos logit contributions
 reins+0.027
 redd+0.025
Hours+0.025
 lowest+0.025
rated+0.025
Neg logit contributions
's-0.012
 playing-0.012
 that-0.012
 doing-0.010
 about-0.010
Token.
Token ID13
Feature activation+4.86e-01

Pos logit contributions
's+0.020
 playing+0.019
 that+0.019
,+0.017
 doing+0.016
Neg logit contributions
 reins-0.055
 redd-0.052
Hours-0.052
ifi-0.050
 lowest-0.050
Tokenâ
Token ID22940
Feature activation-2.89e+00

Pos logit contributions
's+0.013
 playing+0.013
 that+0.012
,+0.011
 wearing+0.011
Neg logit contributions
 reins-0.028
 redd-0.027
Hours-0.027
rated-0.026
 lowest-0.026
Token€
Token ID26391
Feature activation+2.14e+00

Pos logit contributions
rated+0.083
 reins+0.080
Hours+0.073
 redd+0.072
ifi+0.071
Neg logit contributions
 playing-0.159
 that-0.152
's-0.151
 living-0.148
 wearing-0.148
Token First
Token ID3274
Feature activation-1.19e+01

Pos logit contributions
 that+0.091
 playing+0.090
's+0.089
 about+0.088
 living+0.088
Neg logit contributions
 reins-0.079
Hours-0.074
 redd-0.070
 lowest-0.070
Lex-0.069
Token,
Token ID11
Feature activation+1.05e+00

Pos logit contributions
 reins+0.844
 redd+0.794
Hours+0.793
 lowest+0.793
ifi+0.783
Neg logit contributions
's-0.100
 playing-0.098
 that-0.082
e-0.076
 living-0.076
Token Mom
Token ID11254
Feature activation-4.36e+00

Pos logit contributions
's+0.032
 playing+0.031
 that+0.031
 doing+0.028
 about+0.028
Neg logit contributions
 reins-0.057
 redd-0.053
Hours-0.053
 lowest-0.051
rated-0.051
Token taught
Token ID7817
Feature activation+7.85e-01

Pos logit contributions
 reins+0.252
 redd+0.237
Hours+0.234
 lowest+0.229
rated+0.229
Neg logit contributions
's-0.111
 playing-0.110
 that-0.108
 doing-0.097
 wearing-0.096
Token Mary
Token ID5335
Feature activation-2.30e+00

Pos logit contributions
's+0.023
 playing+0.023
 that+0.022
 lived+0.021
 doing+0.021
Neg logit contributions
 reins-0.041
 redd-0.038
 lowest-0.038
Hours-0.037
ifi-0.037
Token how
Token ID703
Feature activation-4.35e+00

Pos logit contributions
 reins+0.135
 redd+0.126
Hours+0.125
 lowest+0.123
ifi+0.122
Neg logit contributions
's-0.056
 playing-0.055
 that-0.053
 doing-0.049
 wearing-0.049
Token He
Token ID679
Feature activation-8.81e-01

Pos logit contributions
's+0.011
 playing+0.011
 that+0.010
 doing+0.010
e+0.009
Neg logit contributions
 reins-0.021
 redd-0.020
Hours-0.020
 lowest-0.020
Lex-0.019
Token said
Token ID531
Feature activation+4.18e+00

Pos logit contributions
 reins+0.045
Hours+0.043
Lex+0.041
 lowest+0.041
 redd+0.041
Neg logit contributions
 playing-0.026
 doing-0.024
's-0.024
 that-0.024
 sharing-0.024
Token that
Token ID326
Feature activation+4.18e+00

Pos logit contributions
's+0.047
 playing+0.047
 wearing+0.042
e+0.041
 doing+0.039
Neg logit contributions
 reins-0.286
Hours-0.274
 redd-0.273
Lex-0.270
 lowest-0.266
Token no
Token ID645
Feature activation-5.92e+00

Pos logit contributions
's+0.112
e+0.103
 playing+0.101
 that+0.100
 about+0.095
Neg logit contributions
 reins-0.223
 redd-0.219
Hours-0.209
ifi-0.208
Lex-0.206
Token-
Token ID12
Feature activation+2.71e-01

Pos logit contributions
 reins+0.277
 redd+0.254
Hours+0.251
 lowest+0.249
ifi+0.246
Neg logit contributions
's-0.219
 playing-0.213
 that-0.212
 doing-0.198
 about-0.196
Tokenone
Token ID505
Feature activation-1.19e+01

Pos logit contributions
 that+0.012
's+0.012
 playing+0.011
 of+0.011
e+0.011
Neg logit contributions
 reins-0.011
Hours-0.010
 redd-0.010
rated-0.010
 lowest-0.010
Token should
Token ID815
Feature activation+3.02e+00

Pos logit contributions
 reins+0.628
 lowest+0.595
ried+0.585
rated+0.581
ifi+0.578
Neg logit contributions
 playing-0.247
 Playing-0.245
 wearing-0.223
 sharing-0.223
 doing-0.221
Token fight
Token ID1907
Feature activation+3.69e+00

Pos logit contributions
's+0.092
 playing+0.091
 living+0.086
 wearing+0.085
 that+0.084
Neg logit contributions
 reins-0.150
Hours-0.140
ifi-0.139
 lowest-0.139
 redd-0.138
Token and
Token ID290
Feature activation-4.08e-01

Pos logit contributions
's+0.076
 playing+0.075
e+0.071
 lived+0.070
 living+0.070
Neg logit contributions
 reins-0.206
Hours-0.202
 lowest-0.197
rated-0.196
 tighter-0.193
Token that
Token ID326
Feature activation+4.19e+00

Pos logit contributions
 reins+0.021
ried+0.020
Hours+0.020
 lowest+0.020
 redd+0.020
Neg logit contributions
's-0.010
 playing-0.010
e-0.010
 doing-0.009
 Playing-0.009
Token Mitt
Token ID16627
Feature activation-2.16e+00

Pos logit contributions
e+0.104
's+0.100
 that+0.100
 about+0.099
 playing+0.098
Neg logit contributions
 redd-0.215
 reins-0.213
 lowest-0.201
Hours-0.201
rated-0.200
Token their
Token ID511
Feature activation+2.41e+00

Pos logit contributions
's+0.067
 playing+0.062
 that+0.060
 doing+0.060
 wearing+0.059
Neg logit contributions
 reins-0.154
 redd-0.146
Hours-0.146
 lowest-0.144
ifi-0.140
Token families
Token ID4172
Feature activation+9.30e-01

Pos logit contributions
's+0.089
 that+0.087
 playing+0.086
 about+0.081
 doing+0.080
Neg logit contributions
 reins-0.116
 redd-0.105
Hours-0.104
 lowest-0.102
rated-0.100
Token.
Token ID13
Feature activation+8.35e-01

Pos logit contributions
 playing+0.015
 lived+0.015
's+0.014
e+0.013
 doing+0.013
Neg logit contributions
 reins-0.059
 redd-0.054
 lowest-0.054
 tighter-0.053
rated-0.052
Token They
Token ID1119
Feature activation+3.92e+00

Pos logit contributions
's+0.021
 playing+0.021
 that+0.020
 doing+0.018
 about+0.018
Neg logit contributions
 reins-0.050
 redd-0.047
Hours-0.046
 lowest-0.045
rated-0.045
Token call
Token ID869
Feature activation+2.99e+00

Pos logit contributions
's+0.122
 playing+0.119
 that+0.118
 doing+0.108
 about+0.107
Neg logit contributions
 reins-0.210
 redd-0.195
Hours-0.193
 lowest-0.190
rated-0.188
Token 911
Token ID16679
Feature activation-1.18e+01

Pos logit contributions
's+0.082
 playing+0.078
 that+0.077
 doing+0.073
 wearing+0.072
Neg logit contributions
 reins-0.168
Hours-0.155
 redd-0.155
 lowest-0.152
rated-0.151
Token.
Token ID13
Feature activation+8.19e-01

Pos logit contributions
 reins+0.730
ried+0.653
 lowest+0.652
rated+0.649
 tighter+0.634
Neg logit contributions
 playing-0.211
's-0.196
e-0.187
 doing-0.187
 wearing-0.185
Token They
Token ID1119
Feature activation+3.86e+00

Pos logit contributions
's+0.017
 playing+0.016
 that+0.016
 doing+0.014
 about+0.014
Neg logit contributions
 reins-0.052
 redd-0.049
Hours-0.049
 lowest-0.048
rated-0.048
Token hope
Token ID2911
Feature activation-1.62e+00

Pos logit contributions
's+0.120
 playing+0.119
 that+0.113
 wearing+0.110
 doing+0.108
Neg logit contributions
 reins-0.203
 redd-0.183
 lowest-0.181
Hours-0.180
rated-0.177
Token it
Token ID340
Feature activation-1.22e+00

Pos logit contributions
 reins+0.085
Hours+0.084
ifi+0.080
 redd+0.080
 lowest+0.080
Neg logit contributions
's-0.042
e-0.040
 playing-0.039
 wearing-0.037
 doing-0.037
Token is
Token ID318
Feature activation+2.32e+00

Pos logit contributions
 reins+0.059
 redd+0.055
Hours+0.054
 lowest+0.053
ifi+0.053
Neg logit contributions
's-0.045
 that-0.043
 playing-0.043
,-0.040
 about-0.040
Token said
Token ID531
Feature activation+4.28e+00

Pos logit contributions
 reins+0.179
Hours+0.168
 tighter+0.166
rated+0.166
ried+0.164
Neg logit contributions
 playing-0.089
e-0.087
 that-0.086
 sharing-0.084
 doing-0.083
Token don
Token ID836
Feature activation+3.61e-01

Pos logit contributions
e+0.051
's+0.048
 playing+0.046
 wearing+0.042
 doing+0.038
Neg logit contributions
 reins-0.285
Hours-0.270
 redd-0.269
Lex-0.265
 lowest-0.263
Token't
Token ID470
Feature activation-2.51e+00

Pos logit contributions
 playing+0.014
's+0.014
 that+0.013
 living+0.013
 sharing+0.013
Neg logit contributions
 reins-0.016
Hours-0.015
 lowest-0.014
Lex-0.014
rated-0.013
Token worry
Token ID5490
Feature activation+2.16e-01

Pos logit contributions
 reins+0.116
Hours+0.107
 redd+0.106
 lowest+0.104
rated+0.104
Neg logit contributions
's-0.090
 that-0.086
 playing-0.086
 lived-0.084
e-0.083
Token.
Token ID13
Feature activation+6.45e-01

Pos logit contributions
 playing+0.002
's+0.002
e+0.002
 lived+0.002
 wearing+0.002
Neg logit contributions
 reins-0.015
Hours-0.014
 redd-0.014
rated-0.014
 lowest-0.014
Token First
Token ID3274
Feature activation-1.18e+01

Pos logit contributions
's+0.019
 that+0.018
 playing+0.018
,+0.017
 about+0.017
Neg logit contributions
 reins-0.036
 redd-0.034
Hours-0.034
rated-0.033
ifi-0.033
Token,
Token ID11
Feature activation+1.31e+00

Pos logit contributions
 reins+0.703
Hours+0.683
 lowest+0.677
ifi+0.672
 redd+0.672
Neg logit contributions
 living-0.177
 Playing-0.173
e-0.164
 wearing-0.159
 playing-0.153
Token we
Token ID356
Feature activation-6.99e-01

Pos logit contributions
 that+0.032
's+0.032
 playing+0.031
,+0.029
 about+0.027
Neg logit contributions
 reins-0.082
 redd-0.076
Hours-0.075
rated-0.074
 lowest-0.074
Token need
Token ID761
Feature activation+1.45e-01

Pos logit contributions
 reins+0.040
 redd+0.037
Hours+0.037
 lowest+0.036
rated+0.036
Neg logit contributions
's-0.020
 that-0.020
 playing-0.019
,-0.018
 doing-0.017
Token to
Token ID284
Feature activation-9.32e-01

Pos logit contributions
 that+0.003
 playing+0.003
's+0.003
,+0.003
 living+0.003
Neg logit contributions
 reins-0.009
 redd-0.009
rated-0.009
Hours-0.009
 lowest-0.008
Token soak
Token ID33182
Feature activation+7.05e-01

Pos logit contributions
 reins+0.046
 redd+0.043
Hours+0.042
 lowest+0.041
rated+0.041
Neg logit contributions
's-0.033
 that-0.032
 playing-0.032
,-0.030
 doing-0.030
Tokenies
Token ID444
Feature activation+2.74e+00

Pos logit contributions
 reins+0.070
 redd+0.062
igi+0.059
 lowest+0.058
ifi+0.058
Neg logit contributions
's-0.091
 playing-0.089
 that-0.088
 about-0.085
 doing-0.084
Tokenâ
Token ID22940
Feature activation-2.77e+00

Pos logit contributions
e+0.047
's+0.040
 Playing+0.039
 story+0.036
 Friends+0.034
Neg logit contributions
 reins-0.153
ried-0.141
 redd-0.140
 weights-0.135
ge-0.134
Token€
Token ID26391
Feature activation+2.20e+00

Pos logit contributions
 reins+0.097
rated+0.092
Hours+0.088
 redd+0.087
ifi+0.087
Neg logit contributions
 that-0.136
 playing-0.131
's-0.129
 living-0.124
 wearing-0.124
Token�
Token ID447
Feature activation-2.51e+00

Pos logit contributions
 that+0.093
 doing+0.089
 about+0.088
 being+0.087
 playing+0.086
Neg logit contributions
 reins-0.076
Hours-0.075
First-0.069
ifi-0.068
Lex-0.067
Token�
Token ID251
Feature activation-2.97e+00

Pos logit contributions
 reins+0.106
 redd+0.097
Hours+0.094
rated+0.093
 lowest+0.092
Neg logit contributions
's-0.109
 playing-0.107
 that-0.106
,-0.100
 doing-0.099
Tokenone
Token ID505
Feature activation-1.18e+01

Pos logit contributions
 reins+0.166
 redd+0.154
Hours+0.153
 lowest+0.150
ifi+0.148
Neg logit contributions
's-0.079
 playing-0.077
 that-0.075
 doing-0.071
 wearing-0.069
Token shiny
Token ID22441
Feature activation+6.66e-01

Pos logit contributions
 reins+0.614
ifi+0.565
 redd+0.563
Hours+0.556
rated+0.553
Neg logit contributions
 playing-0.326
 doing-0.308
 that-0.298
's-0.295
 Playing-0.291
Token,
Token ID11
Feature activation+1.32e+00

Pos logit contributions
 playing+0.023
's+0.023
 doing+0.022
 that+0.022
 living+0.021
Neg logit contributions
 reins-0.032
 redd-0.029
 lowest-0.028
rated-0.027
 weights-0.027
Token one
Token ID530
Feature activation-7.57e+00

Pos logit contributions
's+0.046
 that+0.045
 playing+0.044
 lived+0.042
 doing+0.042
Neg logit contributions
 reins-0.063
 redd-0.061
Hours-0.058
 lowest-0.057
ifi-0.055
Token dull
Token ID19222
Feature activation-5.52e-02

Pos logit contributions
 reins+0.409
 redd+0.382
 tighter+0.375
ifi+0.372
 lowest+0.366
Neg logit contributions
 Playing-0.172
 playing-0.169
 doing-0.167
's-0.157
 sharing-0.153
Tokenâ
Token ID22940
Feature activation-2.86e+00

Pos logit contributions
 reins+0.003
 redd+0.003
 lowest+0.003
rated+0.003
ifi+0.002
Neg logit contributions
's-0.002
 playing-0.002
 that-0.002
 doing-0.002
 about-0.001
Token€
Token ID26391
Feature activation+2.29e+00

Pos logit contributions
 reins+0.103
Hours+0.094
 redd+0.093
rated+0.090
 lowest+0.090
Neg logit contributions
's-0.129
 that-0.129
 playing-0.128
 about-0.120
 doing-0.120
Tokenâ„¢
Token ID8151
Feature activation+7.15e-02

Pos logit contributions
 that+0.100
 doing+0.096
 playing+0.095
 about+0.095
 being+0.095
Neg logit contributions
Hours-0.078
 reins-0.073
ifi-0.068
First-0.068
Half-0.067
Tokens
Token ID82
Feature activation+2.93e+00

Pos logit contributions
's+0.001
 playing+0.001
 that+0.001
 doing+0.001
 lived+0.001
Neg logit contributions
 reins-0.005
 redd-0.004
Hours-0.004
 lowest-0.004
ifi-0.004
Token plan
Token ID1410
Feature activation-7.16e-02

Pos logit contributions
's+0.066
 that+0.062
 playing+0.061
e+0.055
 doing+0.054
Neg logit contributions
 reins-0.181
 redd-0.168
 lowest-0.167
Hours-0.166
rated-0.163
Token.
Token ID13
Feature activation+5.85e-01

Pos logit contributions
 reins+0.004
Hours+0.004
 lowest+0.004
 redd+0.004
ried+0.004
Neg logit contributions
's-0.001
 playing-0.001
 doing-0.001
e-0.001
 lived-0.001
Token First
Token ID3274
Feature activation-1.18e+01

Pos logit contributions
's+0.015
 playing+0.014
 that+0.014
 about+0.013
,+0.013
Neg logit contributions
 reins-0.036
 redd-0.033
Hours-0.033
rated-0.033
igi-0.032
Token,
Token ID11
Feature activation+1.18e+00

Pos logit contributions
 reins+0.736
Hours+0.723
ifi+0.721
Lex+0.704
 redd+0.698
Neg logit contributions
e-0.141
 living-0.134
 about-0.127
 sharing-0.125
 Playing-0.124
Token they
Token ID484
Feature activation+2.44e+00

Pos logit contributions
's+0.025
 that+0.025
 playing+0.025
,+0.021
 doing+0.021
Neg logit contributions
 reins-0.075
 redd-0.071
Hours-0.071
rated-0.069
 lowest-0.069
Token looked
Token ID3114
Feature activation-3.52e-01

Pos logit contributions
's+0.060
 playing+0.057
 that+0.057
,+0.051
 lived+0.049
Neg logit contributions
 reins-0.151
 redd-0.141
Hours-0.141
rated-0.139
 lowest-0.138
Token for
Token ID329
Feature activation+7.19e-01

Pos logit contributions
 reins+0.026
Hours+0.025
ifi+0.025
 redd+0.024
igi+0.024
Neg logit contributions
 that-0.005
's-0.005
 playing-0.005
,-0.004
 in-0.004
Token a
Token ID257
Feature activation-1.07e+00

Pos logit contributions
's+0.020
 that+0.019
 playing+0.019
,+0.017
 wearing+0.017
Neg logit contributions
 reins-0.041
 redd-0.039
Hours-0.039
rated-0.038
 lowest-0.038
Token to
Token ID284
Feature activation-9.06e-01

Pos logit contributions
 reins+0.371
 redd+0.349
Hours+0.347
rated+0.345
 lowest+0.340
Neg logit contributions
 that-0.131
's-0.127
 playing-0.124
,-0.111
 about-0.107
Token pretend
Token ID16614
Feature activation+9.15e-01

Pos logit contributions
 reins+0.049
 redd+0.045
Hours+0.045
 lowest+0.044
rated+0.044
Neg logit contributions
's-0.029
 playing-0.028
 that-0.028
,-0.025
 doing-0.025
Token to
Token ID284
Feature activation-8.88e-01

Pos logit contributions
e+0.032
apped+0.030
 Friends+0.029
 lived+0.029
's+0.029
Neg logit contributions
 stool-0.034
 reins-0.033
 lowest-0.033
Hours-0.032
 redd-0.032
Token cook
Token ID4255
Feature activation+2.34e+00

Pos logit contributions
 reins+0.054
 redd+0.050
Hours+0.050
 lowest+0.049
rated+0.048
Neg logit contributions
's-0.023
 playing-0.022
 that-0.022
,-0.020
 doing-0.019
Token.
Token ID13
Feature activation+6.34e-01

Pos logit contributions
's+0.067
 playing+0.063
 that+0.062
 lived+0.060
 wearing+0.058
Neg logit contributions
 reins-0.119
 redd-0.115
 lowest-0.111
Hours-0.110
 weights-0.107
Token First
Token ID3274
Feature activation-1.18e+01

Pos logit contributions
's+0.012
 that+0.012
 playing+0.011
,+0.010
 about+0.010
Neg logit contributions
 reins-0.042
 redd-0.040
Hours-0.040
rated-0.039
 lowest-0.039
Token,
Token ID11
Feature activation+1.34e+00

Pos logit contributions
Hours+0.650
 reins+0.646
 stool+0.641
ifi+0.640
 First+0.636
Neg logit contributions
 living-0.198
 Playing-0.198
 Friends-0.190
 sharing-0.188
e-0.176
Token Mary
Token ID5335
Feature activation-2.02e+00

Pos logit contributions
 that+0.037
's+0.036
 playing+0.035
,+0.034
 about+0.032
Neg logit contributions
 reins-0.080
 redd-0.075
Hours-0.074
rated-0.074
igi-0.073
Token picked
Token ID6497
Feature activation-1.92e+00

Pos logit contributions
 reins+0.116
 redd+0.111
Hours+0.111
rated+0.107
 lowest+0.107
Neg logit contributions
 playing-0.049
 that-0.049
's-0.048
 doing-0.044
 about-0.044
Token up
Token ID510
Feature activation-2.94e+00

Pos logit contributions
 reins+0.110
Hours+0.103
 redd+0.102
rated+0.101
ifi+0.101
Neg logit contributions
's-0.056
 that-0.054
 playing-0.053
,-0.049
 about-0.048
Token a
Token ID257
Feature activation-9.38e-01

Pos logit contributions
 reins+0.194
rated+0.183
Hours+0.183
 redd+0.180
igi+0.178
Neg logit contributions
 that-0.060
's-0.060
 playing-0.057
,-0.052
 about-0.048
Token He
Token ID679
Feature activation-8.56e-01

Pos logit contributions
's+0.011
 that+0.010
 playing+0.010
 about+0.009
,+0.009
Neg logit contributions
 reins-0.040
rated-0.037
 redd-0.037
Hours-0.037
 lowest-0.037
Token thanked
Token ID26280
Feature activation+1.22e+00

Pos logit contributions
 reins+0.050
Hours+0.047
 redd+0.047
rated+0.046
 lowest+0.046
Neg logit contributions
 playing-0.021
's-0.019
 that-0.019
 doing-0.018
 wearing-0.017
Token the
Token ID262
Feature activation+5.42e-01

Pos logit contributions
's+0.027
 playing+0.025
 that+0.023
 living+0.022
 doing+0.022
Neg logit contributions
 reins-0.076
 redd-0.070
 lowest-0.070
Hours-0.070
ifi-0.069
Token an
Token ID281
Feature activation-2.89e+00

Pos logit contributions
's+0.021
 playing+0.020
 that+0.020
 about+0.019
 doing+0.019
Neg logit contributions
 reins-0.025
 redd-0.023
Hours-0.023
 lowest-0.022
rated-0.022
Tokenem
Token ID368
Feature activation-3.41e-01

Pos logit contributions
 reins+0.141
Hours+0.135
 redd+0.133
Lex+0.130
igi+0.128
Neg logit contributions
 that-0.103
's-0.103
 about-0.101
 playing-0.100
,-0.098
Tokenone
Token ID505
Feature activation-1.18e+01

Pos logit contributions
 reins+0.015
 redd+0.014
 lowest+0.013
ifi+0.013
igi+0.013
Neg logit contributions
 playing-0.014
's-0.013
 that-0.013
 wearing-0.012
 about-0.012
Token and
Token ID290
Feature activation-4.19e-01

Pos logit contributions
 reins+0.738
 tighter+0.658
 lowest+0.658
rated+0.635
ifi+0.629
Neg logit contributions
 playing-0.214
 Playing-0.194
 sharing-0.183
 lived-0.172
 doing-0.171
Token then
Token ID788
Feature activation-3.66e+00

Pos logit contributions
 reins+0.024
 redd+0.022
Hours+0.022
 lowest+0.021
rated+0.021
Neg logit contributions
's-0.011
 playing-0.011
 that-0.011
 doing-0.010
 about-0.010
Token hurried
Token ID31662
Feature activation-6.84e+00

Pos logit contributions
 reins+0.184
Hours+0.171
 redd+0.169
 tighter+0.164
 lowest+0.163
Neg logit contributions
 playing-0.111
's-0.108
e-0.106
 doing-0.105
 living-0.102
Token back
Token ID736
Feature activation-6.19e+00

Pos logit contributions
 reins+0.326
Hours+0.291
 redd+0.282
rated+0.281
 lowest+0.281
Neg logit contributions
's-0.250
 playing-0.249
 that-0.236
 lived-0.235
 doing-0.233
Token up
Token ID510
Feature activation-3.14e+00

Pos logit contributions
 reins+0.322
Hours+0.283
ried+0.278
 lowest+0.277
ifi+0.274
Neg logit contributions
's-0.189
 playing-0.180
 that-0.177
 lived-0.176
 doing-0.176
Token reverse
Token ID9575
Feature activation-3.83e-01

Pos logit contributions
 reins+0.045
 redd+0.041
Hours+0.041
 lowest+0.040
rated+0.040
Neg logit contributions
's-0.034
 that-0.033
 playing-0.033
,-0.030
 doing-0.030
Token around
Token ID1088
Feature activation+9.34e-01

Pos logit contributions
 reins+0.026
 redd+0.025
Hours+0.024
 lowest+0.024
ifi+0.024
Neg logit contributions
's-0.007
 that-0.007
 playing-0.007
,-0.006
 about-0.006
Token the
Token ID262
Feature activation+7.60e-01

Pos logit contributions
's+0.019
 lived+0.017
 that+0.016
 playing+0.016
 about+0.016
Neg logit contributions
 reins-0.053
Hours-0.052
 lowest-0.051
 weights-0.051
 redd-0.050
Token house
Token ID2156
Feature activation+1.08e+00

Pos logit contributions
's+0.024
 playing+0.023
 that+0.023
 living+0.021
,+0.021
Neg logit contributions
 reins-0.042
 redd-0.039
Hours-0.039
ifi-0.038
rated-0.038
Token.
Token ID13
Feature activation+3.60e-01

Pos logit contributions
 lived+0.019
's+0.019
 playing+0.017
e+0.017
 Playing+0.016
Neg logit contributions
 reins-0.063
Hours-0.061
 lowest-0.060
rated-0.059
 tighter-0.059
Token First
Token ID3274
Feature activation-1.18e+01

Pos logit contributions
's+0.009
 that+0.008
 playing+0.008
 about+0.007
,+0.007
Neg logit contributions
 reins-0.022
 redd-0.021
Hours-0.021
rated-0.020
 lowest-0.020
Token he
Token ID339
Feature activation-7.78e-01

Pos logit contributions
Hours+0.683
 reins+0.659
 First+0.643
 lowest+0.636
ifi+0.632
Neg logit contributions
 living-0.196
e-0.185
 Playing-0.185
 Friends-0.178
 sharing-0.175
Token slowly
Token ID6364
Feature activation-7.22e+00

Pos logit contributions
 reins+0.044
 redd+0.041
Hours+0.040
 lowest+0.040
rated+0.039
Neg logit contributions
's-0.024
 that-0.022
 playing-0.022
,-0.020
 about-0.020
Token backed
Token ID9763
Feature activation-2.66e+00

Pos logit contributions
 reins+0.328
Hours+0.311
ried+0.294
 redd+0.292
 tighter+0.291
Neg logit contributions
 playing-0.249
 sharing-0.247
 doing-0.245
's-0.243
 that-0.242
Token away
Token ID1497
Feature activation-7.93e-01

Pos logit contributions
 reins+0.152
 redd+0.145
ifi+0.139
 former+0.138
Hours+0.137
Neg logit contributions
 that-0.082
's-0.081
,-0.077
 in-0.076
 with-0.072
Token from
Token ID422
Feature activation+1.92e+00

Pos logit contributions
 reins+0.055
Hours+0.052
 redd+0.052
 lowest+0.051
rated+0.051
Neg logit contributions
's-0.012
 playing-0.011
 that-0.011
 doing-0.009
 wearing-0.009
Token decided
Token ID3066
Feature activation-5.78e+00

Pos logit contributions
 reins+0.123
 redd+0.116
Hours+0.115
 lowest+0.113
rated+0.112
Neg logit contributions
's-0.047
 that-0.045
 playing-0.045
,-0.039
 about-0.039
Token to
Token ID284
Feature activation-1.03e+00

Pos logit contributions
 reins+0.326
Hours+0.307
 redd+0.306
 lowest+0.298
rated+0.294
Neg logit contributions
's-0.147
 playing-0.144
e-0.136
 that-0.133
 lived-0.129
Token find
Token ID1064
Feature activation-6.63e-01

Pos logit contributions
 reins+0.049
 redd+0.045
Hours+0.044
 lowest+0.043
ifi+0.043
Neg logit contributions
's-0.038
 playing-0.037
 that-0.037
 doing-0.034
 about-0.034
Token out
Token ID503
Feature activation-3.70e+00

Pos logit contributions
 reins+0.038
 redd+0.036
Hours+0.036
 lowest+0.035
rated+0.034
Neg logit contributions
's-0.018
 playing-0.017
 that-0.017
 doing-0.015
 wearing-0.015
Token.
Token ID13
Feature activation+5.10e-01

Pos logit contributions
 reins+0.223
Hours+0.210
 lowest+0.206
 redd+0.205
rated+0.201
Neg logit contributions
's-0.081
 playing-0.076
e-0.068
 doing-0.067
 wearing-0.067
Token First
Token ID3274
Feature activation-1.18e+01

Pos logit contributions
's+0.015
 playing+0.015
 that+0.014
 about+0.013
,+0.013
Neg logit contributions
 reins-0.029
 redd-0.027
Hours-0.027
rated-0.027
ifi-0.026
Token,
Token ID11
Feature activation+1.24e+00

Pos logit contributions
 reins+0.676
Hours+0.665
 First+0.657
 stool+0.652
 lowest+0.650
Neg logit contributions
e-0.176
 living-0.171
 Playing-0.169
 sharing-0.161
 Friends-0.155
Token she
Token ID673
Feature activation-1.71e+00

Pos logit contributions
's+0.037
 playing+0.036
 that+0.035
 doing+0.033
e+0.033
Neg logit contributions
 reins-0.066
 redd-0.062
Hours-0.062
 lowest-0.061
rated-0.059
Token decided
Token ID3066
Feature activation-5.77e+00

Pos logit contributions
 reins+0.098
 redd+0.092
Hours+0.091
 lowest+0.089
rated+0.089
Neg logit contributions
's-0.047
 playing-0.045
 that-0.045
,-0.040
 doing-0.040
Token to
Token ID284
Feature activation-1.01e+00

Pos logit contributions
 reins+0.337
 redd+0.317
Hours+0.316
 lowest+0.309
rated+0.305
Neg logit contributions
's-0.145
 playing-0.141
 that-0.136
 doing-0.124
e-0.124
Token make
Token ID787
Feature activation+2.60e+00

Pos logit contributions
 reins+0.048
 redd+0.044
Hours+0.044
 lowest+0.043
rated+0.043
Neg logit contributions
's-0.037
 playing-0.037
 that-0.037
 doing-0.034
 about-0.034
Token to
Token ID284
Feature activation-9.96e-01

Pos logit contributions
 reins+0.187
 redd+0.177
 tighter+0.165
 lowest+0.165
Hours+0.165
Neg logit contributions
 lived-0.117
's-0.113
e-0.112
 that-0.108
 Playing-0.106
Token feed
Token ID3745
Feature activation-1.02e-01

Pos logit contributions
 reins+0.048
 redd+0.044
Hours+0.044
 lowest+0.042
rated+0.042
Neg logit contributions
's-0.037
 playing-0.036
 that-0.036
 doing-0.033
,-0.033
Token the
Token ID262
Feature activation+7.11e-01

Pos logit contributions
 reins+0.006
 redd+0.006
 lowest+0.006
Hours+0.006
ifi+0.006
Neg logit contributions
's-0.002
 playing-0.002
 that-0.002
 doing-0.002
 wearing-0.002
Token girl
Token ID2576
Feature activation+2.69e+00

Pos logit contributions
's+0.036
 playing+0.035
 that+0.035
,+0.033
 doing+0.033
Neg logit contributions
 reins-0.026
Hours-0.026
ifi-0.024
igi-0.024
rated-0.024
Token.
Token ID13
Feature activation+4.34e-01

Pos logit contributions
 that+0.038
 playing+0.037
 doing+0.036
 lived+0.036
 wearing+0.033
Neg logit contributions
 reins-0.173
 redd-0.167
 lowest-0.162
ifi-0.157
 tighter-0.157
Token First
Token ID3274
Feature activation-1.18e+01

Pos logit contributions
's+0.012
 that+0.011
 playing+0.011
 about+0.010
,+0.010
Neg logit contributions
 reins-0.025
 redd-0.024
Hours-0.024
rated-0.023
 lowest-0.023
Token,
Token ID11
Feature activation+1.15e+00

Pos logit contributions
 reins+0.679
Hours+0.678
ifi+0.670
Lex+0.665
 lowest+0.660
Neg logit contributions
 Playing-0.161
e-0.159
 Friends-0.157
 living-0.154
 wearing-0.146
Token she
Token ID673
Feature activation-1.77e+00

Pos logit contributions
's+0.024
 playing+0.023
 that+0.022
 doing+0.019
 about+0.019
Neg logit contributions
 reins-0.073
 redd-0.069
Hours-0.069
 lowest-0.068
rated-0.067
Token gave
Token ID2921
Feature activation-6.10e-02

Pos logit contributions
 reins+0.098
 redd+0.092
Hours+0.091
 lowest+0.089
rated+0.088
Neg logit contributions
's-0.049
 playing-0.049
 that-0.048
 doing-0.044
 about-0.043
Token her
Token ID607
Feature activation-2.05e+00

Pos logit contributions
 reins+0.004
Hours+0.004
 redd+0.004
rated+0.004
 lowest+0.003
Neg logit contributions
's-0.001
 that-0.001
 playing-0.001
,-0.001
 about-0.001
Token the
Token ID262
Feature activation+6.89e-01

Pos logit contributions
 reins+0.133
Hours+0.132
rated+0.129
igi+0.129
ifi+0.126
Neg logit contributions
's-0.050
 that-0.048
 playing-0.043
,-0.042
 in-0.037
Token what
Token ID644
Feature activation+2.73e-01

Pos logit contributions
 reins+0.164
 redd+0.153
 lowest+0.150
Hours+0.149
rated+0.146
Neg logit contributions
's-0.097
 playing-0.093
 that-0.090
e-0.087
 doing-0.085
Token it
Token ID340
Feature activation-1.28e+00

Pos logit contributions
 playing+0.008
e+0.008
 doing+0.007
's+0.007
apping+0.007
Neg logit contributions
 redd-0.013
 reins-0.012
Hours-0.012
 lowest-0.012
Lex-0.011
Token was
Token ID373
Feature activation+1.91e+00

Pos logit contributions
 redd+0.074
 lowest+0.072
rated+0.072
 reins+0.071
ried+0.068
Neg logit contributions
 playing-0.026
 that-0.023
e-0.023
 sharing-0.023
's-0.022
Token going
Token ID1016
Feature activation-3.58e+00

Pos logit contributions
's+0.035
e+0.035
 playing+0.033
 that+0.031
 Playing+0.031
Neg logit contributions
 redd-0.109
 reins-0.109
rated-0.108
 lowest-0.105
igi-0.102
Token to
Token ID284
Feature activation-8.48e-01

Pos logit contributions
 reins+0.221
 redd+0.211
 lowest+0.207
Hours+0.205
rated+0.203
Neg logit contributions
's-0.069
 playing-0.066
 that-0.063
e-0.061
 living-0.057
Token do
Token ID466
Feature activation+7.62e+00

Pos logit contributions
 reins+0.043
 lowest+0.042
 redd+0.041
rated+0.041
ifi+0.040
Neg logit contributions
 playing-0.022
's-0.022
 that-0.020
 wearing-0.020
 living-0.019
Token next
Token ID1306
Feature activation-3.60e+00

Pos logit contributions
's+0.114
 playing+0.108
 that+0.098
 doing+0.089
 wearing+0.088
Neg logit contributions
 reins-0.486
Hours-0.474
 redd-0.468
 lowest-0.462
rated-0.459
Token.
Token ID13
Feature activation+5.84e-01

Pos logit contributions
 reins+0.241
 redd+0.228
Hours+0.226
 lowest+0.226
rated+0.226
Neg logit contributions
 playing-0.056
's-0.052
 that-0.048
 lived-0.044
 doing-0.043
Token
Token ID220
Feature activation-2.59e+00

Pos logit contributions
's+0.012
 that+0.012
 playing+0.011
,+0.010
 about+0.010
Neg logit contributions
 reins-0.038
 redd-0.036
Hours-0.035
rated-0.035
 lowest-0.035
Token
Token ID198
Feature activation+2.63e-01

Pos logit contributions
 reins+0.178
 redd+0.168
Hours+0.168
 lowest+0.165
rated+0.164
Neg logit contributions
's-0.041
 playing-0.039
 that-0.039
 doing-0.032
 about-0.031
Token
Token ID198
Feature activation+2.26e-01

Pos logit contributions
's+0.003
 that+0.003
 playing+0.003
,+0.002
 about+0.002
Neg logit contributions
 reins-0.020
 redd-0.019
rated-0.019
 lowest-0.018
Hours-0.018
Token until
Token ID1566
Feature activation-2.70e+00

Pos logit contributions
 reins+0.073
Hours+0.069
 redd+0.069
 lowest+0.068
rated+0.067
Neg logit contributions
 playing-0.022
's-0.022
 that-0.021
 doing-0.019
 wearing-0.019
Token it
Token ID340
Feature activation-1.12e+00

Pos logit contributions
 reins+0.168
 redd+0.159
Hours+0.158
 lowest+0.155
rated+0.153
Neg logit contributions
's-0.060
 playing-0.057
 that-0.056
 doing-0.050
 about-0.050
Token looked
Token ID3114
Feature activation-7.47e-02

Pos logit contributions
 reins+0.064
Hours+0.063
 redd+0.062
rated+0.061
 lowest+0.060
Neg logit contributions
 playing-0.028
 that-0.026
's-0.026
 doing-0.025
 wearing-0.024
Token like
Token ID588
Feature activation+1.69e+00

Pos logit contributions
 reins+0.005
 redd+0.004
Hours+0.004
 lowest+0.004
ifi+0.004
Neg logit contributions
's-0.002
 that-0.002
 playing-0.002
,-0.002
 about-0.002
Token a
Token ID257
Feature activation-7.19e-01

Pos logit contributions
's+0.046
 playing+0.043
 that+0.042
 doing+0.039
 about+0.039
Neg logit contributions
 reins-0.096
 redd-0.091
Hours-0.090
 lowest-0.088
ifi-0.086
Token dog
Token ID3290
Feature activation+5.08e+00

Pos logit contributions
 reins+0.035
 redd+0.032
Hours+0.032
rated+0.031
 lowest+0.031
Neg logit contributions
's-0.026
 that-0.026
 playing-0.026
,-0.024
 doing-0.024
Token.
Token ID13
Feature activation+8.74e-01

Pos logit contributions
apped+0.120
 Playing+0.112
 lived+0.107
 Friends+0.103
apping+0.103
Neg logit contributions
 reins-0.238
 lowest-0.229
ge-0.221
 ten-0.215
 tighter-0.213
Token He
Token ID679
Feature activation-6.42e-01

Pos logit contributions
's+0.017
 that+0.017
 playing+0.016
 about+0.015
,+0.014
Neg logit contributions
 reins-0.057
 redd-0.054
Hours-0.054
rated-0.053
 lowest-0.053
Token gave
Token ID2921
Feature activation+2.10e-01

Pos logit contributions
 reins+0.036
 redd+0.034
Hours+0.034
 lowest+0.033
Lex+0.033
Neg logit contributions
 playing-0.016
 doing-0.015
's-0.015
 that-0.015
 about-0.014
Token it
Token ID340
Feature activation-1.08e+00

Pos logit contributions
's+0.004
 that+0.004
 playing+0.004
,+0.003
 in+0.003
Neg logit contributions
 reins-0.014
Hours-0.014
rated-0.013
 redd-0.013
 former-0.013
Token to
Token ID284
Feature activation-8.12e-01

Pos logit contributions
 reins+0.065
 redd+0.062
Hours+0.061
rated+0.061
 lowest+0.060
Neg logit contributions
's-0.027
 that-0.026
 playing-0.025
,-0.022
 about-0.022
Token witch
Token ID16365
Feature activation+5.54e-01

Pos logit contributions
 that+0.029
's+0.029
 playing+0.028
 doing+0.026
 living+0.026
Neg logit contributions
 reins-0.037
Hours-0.034
 redd-0.034
rated-0.033
ifi-0.033
Token.
Token ID13
Feature activation+6.56e-01

Pos logit contributions
 Playing+0.012
 playing+0.011
 wearing+0.011
 lived+0.011
apped+0.011
Neg logit contributions
 reins-0.031
 tighter-0.030
 lowest-0.029
rated-0.029
ried-0.028
Token They
Token ID1119
Feature activation+3.65e+00

Pos logit contributions
 playing+0.019
's+0.019
 doing+0.018
 that+0.017
 wearing+0.017
Neg logit contributions
 reins-0.034
Hours-0.033
 redd-0.032
Lex-0.032
ifi-0.032
Token hugged
Token ID41130
Feature activation+3.90e-01

Pos logit contributions
 playing+0.107
's+0.105
 doing+0.099
 that+0.098
 wearing+0.096
Neg logit contributions
 reins-0.191
 lowest-0.179
 redd-0.178
Hours-0.176
Lex-0.173
Token and
Token ID290
Feature activation+4.25e-02

Pos logit contributions
's+0.010
 playing+0.010
 that+0.009
 doing+0.009
 wearing+0.009
Neg logit contributions
 reins-0.023
 redd-0.021
Hours-0.021
 lowest-0.021
ifi-0.021
Token cheered
Token ID34767
Feature activation+4.71e+00

Pos logit contributions
's+0.001
 playing+0.001
 Playing+0.001
 doing+0.001
 wearing+0.001
Neg logit contributions
 reins-0.002
 tighter-0.002
 redd-0.002
 lowest-0.002
Hours-0.002
Token.
Token ID13
Feature activation+7.00e-01

Pos logit contributions
e+0.097
 lived+0.096
's+0.092
 playing+0.091
 wearing+0.088
Neg logit contributions
 reins-0.272
 tighter-0.258
rated-0.251
Hours-0.249
 lowest-0.248
Token They
Token ID1119
Feature activation+3.70e+00

Pos logit contributions
's+0.015
 playing+0.015
 that+0.014
 doing+0.013
e+0.013
Neg logit contributions
 reins-0.043
Hours-0.041
 redd-0.041
Lex-0.040
ifi-0.039
Token continued
Token ID3767
Feature activation-3.34e+00

Pos logit contributions
 playing+0.088
's+0.086
 that+0.082
 doing+0.080
 wearing+0.079
Neg logit contributions
 reins-0.216
 redd-0.207
Hours-0.204
 lowest-0.202
Lex-0.199
Token to
Token ID284
Feature activation-9.41e-01

Pos logit contributions
 reins+0.162
Hours+0.149
 redd+0.139
 tighter+0.139
Lex+0.139
Neg logit contributions
's-0.108
 lived-0.098
e-0.093
 that-0.093
 were-0.090
Token play
Token ID711
Feature activation+4.05e+00

Pos logit contributions
 reins+0.040
 lowest+0.036
Lex+0.036
Hours+0.036
 former+0.035
Neg logit contributions
 about-0.035
's-0.034
 of-0.033
 Playing-0.033
 that-0.033
Token go
Token ID467
Feature activation-3.44e+00

Pos logit contributions
's+0.028
 that+0.026
 playing+0.025
,+0.024
 about+0.022
Neg logit contributions
 reins-0.057
 redd-0.054
Hours-0.053
ifi-0.052
igi-0.052
Token,
Token ID11
Feature activation+1.59e+00

Pos logit contributions
 reins+0.227
 redd+0.215
Hours+0.214
 lowest+0.209
rated+0.208
Neg logit contributions
's-0.066
 that-0.064
 playing-0.063
,-0.054
 doing-0.053
Token Tim
Token ID5045
Feature activation+4.16e-01

Pos logit contributions
's+0.041
 playing+0.039
 that+0.039
 doing+0.035
 about+0.035
Neg logit contributions
 reins-0.095
 redd-0.089
Hours-0.088
 lowest-0.087
rated-0.086
Token.
Token ID13
Feature activation+8.52e-01

Pos logit contributions
 playing+0.014
's+0.014
 that+0.014
 wearing+0.013
 about+0.013
Neg logit contributions
 reins-0.020
 redd-0.018
rated-0.018
 lowest-0.018
Hours-0.018
Token Your
Token ID3406
Feature activation-2.45e+00

Pos logit contributions
's+0.024
 playing+0.023
 that+0.023
 doing+0.022
 about+0.021
Neg logit contributions
 reins-0.047
 redd-0.044
Hours-0.043
 lowest-0.042
ifi-0.041
Token mask
Token ID9335
Feature activation+4.94e+00

Pos logit contributions
 reins+0.118
 redd+0.102
 lowest+0.101
Hours+0.098
 Majesty+0.098
Neg logit contributions
's-0.086
 that-0.082
 about-0.081
 playing-0.079
e-0.078
Token is
Token ID318
Feature activation+2.41e+00

Pos logit contributions
 playing+0.097
 wearing+0.086
 that+0.082
's+0.080
 doing+0.079
Neg logit contributions
 reins-0.304
 lowest-0.288
 redd-0.286
Hours-0.283
rated-0.282
Token good
Token ID922
Feature activation-1.19e-01

Pos logit contributions
's+0.083
 that+0.081
 playing+0.081
 doing+0.074
 about+0.074
Neg logit contributions
 reins-0.122
 redd-0.113
Hours-0.112
 lowest-0.110
rated-0.109
Token as
Token ID355
Feature activation-3.84e+00

Pos logit contributions
 reins+0.007
 redd+0.007
 lowest+0.007
Hours+0.007
rated+0.007
Neg logit contributions
's-0.002
 playing-0.002
 that-0.002
e-0.002
 lived-0.002
Token new
Token ID649
Feature activation-2.49e-03

Pos logit contributions
 reins+0.209
 redd+0.194
Hours+0.194
rated+0.190
 lowest+0.189
Neg logit contributions
's-0.117
 playing-0.117
 that-0.117
,-0.105
 about-0.105
Token.
Token ID13
Feature activation+9.10e-01

Pos logit contributions
 reins+0.000
 redd+0.000
Hours+0.000
 lowest+0.000
rated+0.000
Neg logit contributions
 that-0.000
's-0.000
 playing-0.000
,-0.000
 about-0.000
Token see
Token ID766
Feature activation+1.90e-01

Pos logit contributions
's+0.011
 playing+0.011
 that+0.010
 living+0.010
 doing+0.010
Neg logit contributions
 reins-0.015
Hours-0.014
 lowest-0.014
 redd-0.014
rated-0.013
Token the
Token ID262
Feature activation+8.84e-01

Pos logit contributions
's+0.005
 playing+0.005
e+0.005
 doing+0.005
 wearing+0.005
Neg logit contributions
 reins-0.010
 redd-0.009
 lowest-0.009
 weights-0.009
Hours-0.008
Token joy
Token ID8716
Feature activation+2.24e+00

Pos logit contributions
's+0.034
 playing+0.033
 that+0.033
 doing+0.031
 about+0.031
Neg logit contributions
 reins-0.041
 redd-0.038
Hours-0.037
 lowest-0.036
rated-0.036
Token on
Token ID319
Feature activation+2.16e+00

Pos logit contributions
e+0.051
 Playing+0.044
 named+0.041
 playing+0.039
 lived+0.039
Neg logit contributions
 reins-0.120
rated-0.114
 lowest-0.112
 possible-0.111
ge-0.110
Token their
Token ID511
Feature activation+2.45e+00

Pos logit contributions
's+0.054
 playing+0.048
 lived+0.046
 doing+0.045
 living+0.045
Neg logit contributions
 reins-0.122
 lowest-0.114
Hours-0.113
 redd-0.113
 weights-0.110
Token faces
Token ID6698
Feature activation+5.45e+00

Pos logit contributions
 that+0.100
's+0.100
 playing+0.097
e+0.097
 wearing+0.095
Neg logit contributions
 reins-0.102
 lowest-0.091
rated-0.091
Hours-0.089
 redd-0.089
Token.
Token ID13
Feature activation+7.67e-01

Pos logit contributions
e+0.107
 Playing+0.102
 lived+0.101
's+0.096
 playing+0.090
Neg logit contributions
 reins-0.304
 lowest-0.293
Hours-0.281
 redd-0.280
 tighter-0.275
Token They
Token ID1119
Feature activation+3.69e+00

Pos logit contributions
 that+0.024
 doing+0.024
 playing+0.024
's+0.023
 wearing+0.022
Neg logit contributions
 reins-0.036
Hours-0.036
 First-0.035
 redd-0.035
 lowest-0.034
Token laughed
Token ID13818
Feature activation+2.95e+00

Pos logit contributions
's+0.076
 playing+0.074
 that+0.073
 Playing+0.072
 doing+0.069
Neg logit contributions
 reins-0.216
 redd-0.203
Hours-0.202
 lowest-0.200
rated-0.199
Token and
Token ID290
Feature activation-1.94e-01

Pos logit contributions
e+0.071
's+0.060
 doing+0.060
 wearing+0.056
 living+0.056
Neg logit contributions
 reins-0.161
Hours-0.153
 tighter-0.150
 redd-0.145
 lowest-0.143
Token joked
Token ID27582
Feature activation+3.90e+00

Pos logit contributions
 reins+0.009
rated+0.009
ried+0.009
 tighter+0.009
 redd+0.009
Neg logit contributions
's-0.005
 living-0.005
 story-0.005
 about-0.005
 Playing-0.005
Token in
Token ID287
Feature activation+1.83e-01

Pos logit contributions
 reins+0.011
 redd+0.010
Hours+0.010
rated+0.010
 lowest+0.010
Neg logit contributions
's-0.006
 that-0.006
 playing-0.006
 lived-0.005
 about-0.005
Token a
Token ID257
Feature activation-1.07e+00

Pos logit contributions
's+0.004
 that+0.004
 playing+0.004
,+0.003
 doing+0.003
Neg logit contributions
 reins-0.012
 redd-0.011
Hours-0.011
 lowest-0.011
rated-0.011
Token hotel
Token ID7541
Feature activation-7.92e-02

Pos logit contributions
 reins+0.051
 redd+0.047
Hours+0.046
 lowest+0.045
rated+0.045
Neg logit contributions
's-0.040
 playing-0.039
 that-0.039
 doing-0.036
,-0.036
Token!"
Token ID2474
Feature activation-3.24e+00

Pos logit contributions
 reins+0.004
 redd+0.003
Hours+0.003
 lowest+0.003
rated+0.003
Neg logit contributions
's-0.003
 playing-0.003
 that-0.003
 doing-0.003
 about-0.003
Token
Token ID198
Feature activation+4.60e-02

Pos logit contributions
 reins+0.182
 redd+0.168
Hours+0.167
 lowest+0.164
Lex+0.161
Neg logit contributions
 playing-0.084
's-0.081
 doing-0.080
 that-0.080
e-0.076
Token
Token ID198
Feature activation+2.52e-02

Pos logit contributions
's+0.001
 playing+0.001
 that+0.001
 doing+0.001
 about+0.001
Neg logit contributions
 reins-0.003
 redd-0.003
Hours-0.003
 lowest-0.003
rated-0.003
TokenOnce
Token ID7454
Feature activation-1.82e+00

Pos logit contributions
's+0.001
 that+0.001
 playing+0.001
 with+0.001
,+0.001
Neg logit contributions
 reins-0.001
 redd-0.001
 lowest-0.001
rated-0.001
ifi-0.001
Token they
Token ID484
Feature activation+2.49e+00

Pos logit contributions
 reins+0.116
 redd+0.110
Hours+0.110
 lowest+0.108
rated+0.107
Neg logit contributions
's-0.036
 playing-0.035
 that-0.034
 doing-0.030
 wearing-0.030
Token were
Token ID547
Feature activation+3.74e+00

Pos logit contributions
's+0.055
 playing+0.055
 that+0.053
 doing+0.048
 living+0.047
Neg logit contributions
 reins-0.151
Hours-0.143
 redd-0.140
rated-0.140
 lowest-0.139
Token in
Token ID287
Feature activation+1.55e-01

Pos logit contributions
's+0.085
 Friends+0.083
 that+0.079
e+0.075
 sharing+0.074
Neg logit contributions
 reins-0.211
Hours-0.208
rated-0.194
 redd-0.192
Lex-0.191
Token the
Token ID262
Feature activation+6.30e-01

Pos logit contributions
's+0.004
 playing+0.004
e+0.003
 that+0.003
 lived+0.003
Neg logit contributions
 reins-0.009
 redd-0.008
Hours-0.008
 lowest-0.008
ifi-0.008
Token
Token ID198
Feature activation+4.92e-01

Pos logit contributions
's+0.016
 that+0.016
 playing+0.015
,+0.014
 about+0.013
Neg logit contributions
 reins-0.063
rated-0.058
 redd-0.058
Hours-0.058
 lowest-0.057
Token
Token ID198
Feature activation+4.43e-01

Pos logit contributions
's+0.007
 that+0.007
 playing+0.006
 about+0.005
,+0.005
Neg logit contributions
 reins-0.035
 redd-0.034
Hours-0.033
 lowest-0.033
rated-0.033
TokenHe
Token ID1544
Feature activation-1.94e+00

Pos logit contributions
's+0.017
 that+0.016
,+0.015
 was+0.014
 of+0.014
Neg logit contributions
 reins-0.024
 redd-0.021
 lowest-0.021
rated-0.020
 hind-0.020
Token looks
Token ID3073
Feature activation-3.93e-01

Pos logit contributions
 reins+0.096
 redd+0.089
Hours+0.089
igi+0.089
ifi+0.088
Neg logit contributions
's-0.073
 that-0.069
 playing-0.068
,-0.063
 about-0.063
Token at
Token ID379
Feature activation-6.27e-01

Pos logit contributions
 reins+0.027
 redd+0.025
Hours+0.025
rated+0.025
 lowest+0.025
Neg logit contributions
's-0.006
 playing-0.006
 that-0.006
 doing-0.005
e-0.005
Token Lily
Token ID20037
Feature activation+7.53e-01

Pos logit contributions
 reins+0.042
Hours+0.039
 redd+0.039
rated+0.039
 lowest+0.038
Neg logit contributions
 that-0.014
 playing-0.012
's-0.012
,-0.011
 about-0.011
Token's
Token ID338
Feature activation+1.03e+00

Pos logit contributions
 lived+0.018
 Friends+0.014
apped+0.013
 were+0.012
 Playing+0.012
Neg logit contributions
 tighter-0.042
ge-0.040
 reins-0.039
 stool-0.038
 lowest-0.038
Token paper
Token ID3348
Feature activation+1.56e+00

Pos logit contributions
 lived+0.040
's+0.040
 that+0.039
 playing+0.039
 about+0.039
Neg logit contributions
 reins-0.046
 redd-0.040
 lowest-0.040
 stool-0.038
Hours-0.038
Token.
Token ID13
Feature activation+7.49e-01

Pos logit contributions
 playing+0.031
 lived+0.030
 Playing+0.027
's+0.026
 doing+0.026
Neg logit contributions
 reins-0.093
 lowest-0.088
 tighter-0.088
 redd-0.087
 stool-0.087
Token She
Token ID1375
Feature activation-1.62e+00

Pos logit contributions
's+0.021
 that+0.020
 playing+0.020
 about+0.018
,+0.018
Neg logit contributions
 reins-0.043
 redd-0.040
Hours-0.040
 lowest-0.039
rated-0.039
Token has
Token ID468
Feature activation+3.32e+00

Pos logit contributions
 reins+0.079
 redd+0.074
Hours+0.072
 lowest+0.072
rated+0.070
Neg logit contributions
 playing-0.054
 that-0.053
's-0.052
 doing-0.049
 wearing-0.049
Token;
Token ID26
Feature activation+4.56e-01

Pos logit contributions
 video+0.077
 playing+0.066
 singing+0.065
's+0.063
 sharing+0.061
Neg logit contributions
 reins-0.182
Hours-0.175
 tighter-0.168
 lowest-0.165
 weights-0.165
Token it
Token ID340
Feature activation-1.35e+00

Pos logit contributions
's+0.013
 playing+0.012
e+0.012
 that+0.012
 lived+0.011
Neg logit contributions
 reins-0.024
 redd-0.023
Hours-0.022
 lowest-0.022
ifi-0.022
Token is
Token ID318
Feature activation+2.43e+00

Pos logit contributions
rated+0.083
 redd+0.082
 reins+0.080
Hours+0.080
 lowest+0.080
Neg logit contributions
 playing-0.021
 wearing-0.019
e-0.017
 that-0.016
 sharing-0.015
Token never
Token ID1239
Feature activation-4.87e+00

Pos logit contributions
 that+0.080
's+0.080
 playing+0.077
 about+0.073
 doing+0.072
Neg logit contributions
 reins-0.132
 redd-0.120
Hours-0.119
 lowest-0.118
ifi-0.116
Token a
Token ID257
Feature activation-1.11e+00

Pos logit contributions
 reins+0.271
 redd+0.258
Hours+0.253
rated+0.251
 lowest+0.250
Neg logit contributions
's-0.132
 playing-0.129
 that-0.128
e-0.116
 wearing-0.114
Token good
Token ID922
Feature activation-1.89e-01

Pos logit contributions
 reins+0.060
rated+0.054
Hours+0.053
ried+0.053
 redd+0.053
Neg logit contributions
's-0.038
 that-0.038
 playing-0.037
,-0.036
 living-0.036
Token idea
Token ID2126
Feature activation+2.29e+00

Pos logit contributions
 reins+0.009
 redd+0.008
Hours+0.008
 lowest+0.008
rated+0.008
Neg logit contributions
's-0.007
 playing-0.007
 that-0.007
 doing-0.007
 about-0.007
Token to
Token ID284
Feature activation-9.67e-01

Pos logit contributions
's+0.052
 playing+0.051
 lived+0.050
e+0.047
 Playing+0.045
Neg logit contributions
 reins-0.125
 redd-0.115
 lowest-0.114
Hours-0.114
ifi-0.113
Token take
Token ID1011
Feature activation-2.10e+00

Pos logit contributions
 reins+0.048
 redd+0.045
ifi+0.044
 lowest+0.044
Hours+0.044
Neg logit contributions
's-0.031
 playing-0.031
 that-0.029
 living-0.028
 wearing-0.027
Token something
Token ID1223
Feature activation-9.59e-01

Pos logit contributions
 reins+0.094
Hours+0.086
 redd+0.085
 lowest+0.084
Lex+0.082
Neg logit contributions
's-0.077
 playing-0.077
 that-0.074
 wearing-0.073
 doing-0.072
Token that
Token ID326
Feature activation+4.28e+00

Pos logit contributions
 reins+0.055
 redd+0.052
Hours+0.052
 lowest+0.051
rated+0.050
Neg logit contributions
's-0.025
 playing-0.024
 that-0.023
 doing-0.021
 wearing-0.021
Token march
Token ID9960
Feature activation+2.26e+00

Pos logit contributions
's+0.028
 that+0.028
 about+0.027
 Playing+0.027
 lived+0.026
Neg logit contributions
 reins-0.031
 redd-0.029
 lowest-0.028
rated-0.026
 weights-0.025
Token so
Token ID523
Feature activation-2.72e+00

Pos logit contributions
 lived+0.042
's+0.040
 Playing+0.040
 playing+0.040
e+0.039
Neg logit contributions
 reins-0.130
 tighter-0.122
 lowest-0.121
rated-0.120
Hours-0.119
Token much
Token ID881
Feature activation-5.10e+00

Pos logit contributions
 reins+0.111
Hours+0.111
 redd+0.104
 lowest+0.103
ifi+0.100
Neg logit contributions
's-0.113
 playing-0.108
 doing-0.104
 living-0.103
 wearing-0.102
Token.
Token ID13
Feature activation+4.58e-01

Pos logit contributions
 reins+0.294
Hours+0.292
 tighter+0.290
 weights+0.280
 lowest+0.278
Neg logit contributions
's-0.098
 lived-0.097
e-0.095
 playing-0.084
 Playing-0.083
Token He
Token ID679
Feature activation-8.66e-01

Pos logit contributions
's+0.012
 playing+0.012
 that+0.012
 about+0.011
,+0.011
Neg logit contributions
 reins-0.026
 redd-0.025
Hours-0.025
rated-0.024
 lowest-0.024
Token laughed
Token ID13818
Feature activation+2.72e+00

Pos logit contributions
 reins+0.048
Hours+0.046
 redd+0.044
rated+0.044
ried+0.044
Neg logit contributions
 playing-0.021
 that-0.020
 doing-0.020
's-0.019
 sharing-0.019
Token and
Token ID290
Feature activation-3.60e-01

Pos logit contributions
e+0.056
's+0.052
 doing+0.050
 playing+0.049
 lived+0.048
Neg logit contributions
 reins-0.160
Hours-0.151
 tighter-0.148
 redd-0.147
 lowest-0.144
Token smiled
Token ID13541
Feature activation+4.84e-01

Pos logit contributions
 reins+0.017
Hours+0.016
 redd+0.016
ried+0.016
rated+0.016
Neg logit contributions
's-0.010
e-0.010
 story-0.010
 living-0.009
 of-0.009
Token and
Token ID290
Feature activation-3.13e-01

Pos logit contributions
's+0.011
e+0.011
 playing+0.010
 doing+0.010
 lived+0.010
Neg logit contributions
 reins-0.028
 redd-0.027
Hours-0.027
 lowest-0.026
 tighter-0.026
Token felt
Token ID2936
Feature activation+7.13e-01

Pos logit contributions
 reins+0.015
 redd+0.014
Hours+0.014
 tighter+0.014
rated+0.014
Neg logit contributions
's-0.008
e-0.008
 story-0.008
 living-0.008
 Playing-0.008
Token the
Token ID262
Feature activation+7.40e-01

Pos logit contributions
's+0.021
 playing+0.020
 doing+0.020
 living+0.019
 story+0.019
Neg logit contributions
 redd-0.034
Hours-0.033
 reins-0.033
 tighter-0.031
 lowest-0.031
Token his
Token ID465
Feature activation-1.24e+00

Pos logit contributions
's+0.007
 playing+0.007
 lived+0.007
 that+0.007
 about+0.007
Neg logit contributions
 reins-0.013
 redd-0.012
 lowest-0.012
Hours-0.011
Lex-0.011
Token mouth
Token ID5422
Feature activation+1.94e+00

Pos logit contributions
 reins+0.052
 hind+0.040
 lowest+0.040
 redd+0.038
 stool+0.038
Neg logit contributions
 about-0.056
 sharing-0.055
 doing-0.052
 that-0.052
 of-0.051
Token.
Token ID13
Feature activation+6.75e-01

Pos logit contributions
 playing+0.036
's+0.032
 doing+0.032
 lived+0.031
 that+0.030
Neg logit contributions
 reins-0.120
 redd-0.113
 lowest-0.112
rated-0.112
 tighter-0.110
Token The
Token ID383
Feature activation+6.09e-01

Pos logit contributions
 playing+0.018
's+0.017
 that+0.017
 doing+0.017
 living+0.016
Neg logit contributions
 reins-0.038
 redd-0.036
 lowest-0.035
Hours-0.035
rated-0.034
Token man
Token ID582
Feature activation+3.21e+00

Pos logit contributions
e+0.028
 playing+0.027
 lived+0.027
's+0.026
 about+0.026
Neg logit contributions
 reins-0.024
 redd-0.020
 lowest-0.018
rated-0.017
 stool-0.017
Token is
Token ID318
Feature activation+2.39e+00

Pos logit contributions
 playing+0.098
 that+0.093
 doing+0.090
 wearing+0.089
 living+0.088
Neg logit contributions
 reins-0.151
 redd-0.140
rated-0.136
 lowest-0.136
Lex-0.134
Token wearing
Token ID5762
Feature activation+6.32e+00

Pos logit contributions
 lived+0.080
's+0.075
 were+0.073
 Playing+0.071
 Friends+0.070
Neg logit contributions
 reins-0.109
 redd-0.106
Lex-0.101
 seventy-0.098
 ripe-0.096
Token a
Token ID257
Feature activation-9.72e-01

Pos logit contributions
's+0.187
 playing+0.177
 that+0.171
 doing+0.170
 lived+0.167
Neg logit contributions
 reins-0.335
 redd-0.319
 lowest-0.299
Hours-0.292
 tighter-0.287
Token blue
Token ID4171
Feature activation+5.11e-01

Pos logit contributions
 reins+0.043
 redd+0.039
Lex+0.035
 lowest+0.035
Hours+0.035
Neg logit contributions
's-0.038
 playing-0.038
 that-0.038
 about-0.037
 lived-0.037
Token shirt
Token ID10147
Feature activation+2.84e+00

Pos logit contributions
 about+0.019
 playing+0.019
 lived+0.019
 doing+0.019
 that+0.019
Neg logit contributions
 reins-0.023
 redd-0.019
 lowest-0.018
 saddle-0.018
 boost-0.018
Token and
Token ID290
Feature activation+4.26e-02

Pos logit contributions
's+0.045
 playing+0.045
 Playing+0.044
 lived+0.044
e+0.044
Neg logit contributions
 reins-0.169
 redd-0.155
 lowest-0.153
 stool-0.151
 tighter-0.150
Token,
Token ID11
Feature activation+1.53e+00

Pos logit contributions
 Playing+0.087
's+0.086
e+0.077
 Friends+0.075
 playing+0.074
Neg logit contributions
 reins-0.318
rated-0.302
 stool-0.294
Hours-0.287
 lowest-0.287
Token "
Token ID366
Feature activation-3.85e+00

Pos logit contributions
's+0.043
e+0.040
 playing+0.040
 that+0.038
 lived+0.038
Neg logit contributions
 reins-0.082
 redd-0.079
Hours-0.077
 lowest-0.075
Lex-0.074
TokenNow
Token ID3844
Feature activation-8.68e-01

Pos logit contributions
 reins+0.176
rated+0.160
ifi+0.160
 hind+0.159
 lowest+0.159
Neg logit contributions
's-0.151
 that-0.142
 playing-0.138
e-0.137
 of-0.137
Token you
Token ID345
Feature activation+1.19e+00

Pos logit contributions
 reins+0.049
 redd+0.046
Hours+0.045
 lowest+0.045
rated+0.044
Neg logit contributions
's-0.022
 playing-0.022
 that-0.021
 wearing-0.020
 doing-0.020
Token can
Token ID460
Feature activation-3.84e-01

Pos logit contributions
's+0.033
 playing+0.032
 that+0.032
 doing+0.028
 about+0.028
Neg logit contributions
 reins-0.067
 redd-0.063
Hours-0.063
 lowest-0.061
rated-0.061
Token measure
Token ID3953
Feature activation-3.04e+00

Pos logit contributions
 reins+0.020
 redd+0.019
Hours+0.019
rated+0.018
ifi+0.018
Neg logit contributions
's-0.013
 that-0.013
 playing-0.012
,-0.012
 doing-0.011
Token all
Token ID477
Feature activation-3.67e+00

Pos logit contributions
 reins+0.191
rated+0.180
Hours+0.179
 redd+0.179
ifi+0.174
Neg logit contributions
 that-0.078
's-0.075
 playing-0.072
,-0.070
 about-0.065
Token kinds
Token ID6982
Feature activation+5.50e-01

Pos logit contributions
 reins+0.229
 redd+0.219
Hours+0.215
 lowest+0.214
rated+0.214
Neg logit contributions
's-0.072
 playing-0.071
 that-0.067
 wearing-0.060
 lived-0.059
Token of
Token ID286
Feature activation+3.07e+00

Pos logit contributions
's+0.006
 playing+0.005
 that+0.004
 doing+0.003
 lived+0.003
Neg logit contributions
 reins-0.040
 redd-0.038
Hours-0.038
 lowest-0.038
ifi-0.037
Token things
Token ID1243
Feature activation+5.06e-01

Pos logit contributions
's+0.113
 playing+0.111
 that+0.111
,+0.103
 doing+0.102
Neg logit contributions
 reins-0.149
 redd-0.138
Hours-0.138
rated-0.134
ifi-0.133
Token!"
Token ID2474
Feature activation-3.02e+00

Pos logit contributions
's+0.013
 that+0.013
 playing+0.013
,+0.012
 doing+0.011
Neg logit contributions
 reins-0.029
 redd-0.028
Hours-0.028
 lowest-0.027
rated-0.027
Token,
Token ID11
Feature activation+1.37e+00

Pos logit contributions
's+0.021
 lived+0.018
 Playing+0.018
 Friends+0.018
opot+0.016
Neg logit contributions
 reins-0.043
 ten-0.042
 lowest-0.041
Hours-0.040
 weights-0.040
Token laughing
Token ID14376
Feature activation+3.80e+00

Pos logit contributions
 lived+0.035
's+0.034
e+0.033
 that+0.033
 Playing+0.033
Neg logit contributions
 reins-0.073
 redd-0.067
Hours-0.066
 lowest-0.065
 tighter-0.064
Token and
Token ID290
Feature activation-3.70e-01

Pos logit contributions
's+0.077
e+0.075
 lived+0.070
 doing+0.069
 Playing+0.069
Neg logit contributions
 reins-0.212
Hours-0.198
 lowest-0.195
 tighter-0.194
 redd-0.192
Token tw
Token ID665
Feature activation-1.85e-01

Pos logit contributions
 reins+0.018
ried+0.017
shadow+0.017
 redd+0.017
 tighter+0.017
Neg logit contributions
 story-0.008
's-0.008
 of-0.008
 about-0.008
 Friends-0.008
Tokenirling
Token ID24297
Feature activation+1.88e+00

Pos logit contributions
 reins+0.007
Hours+0.007
igi+0.007
ifi+0.007
Lex+0.006
Neg logit contributions
's-0.008
 playing-0.008
 that-0.008
 living-0.008
 about-0.008
Token all
Token ID477
Feature activation-3.77e+00

Pos logit contributions
's+0.045
 doing+0.040
 playing+0.039
 Playing+0.038
 living+0.037
Neg logit contributions
 reins-0.104
Hours-0.098
 redd-0.098
 lowest-0.097
 tighter-0.097
Token day
Token ID1110
Feature activation+3.61e+00

Pos logit contributions
rated+0.160
 redd+0.154
 tighter+0.152
 reins+0.150
 steep+0.149
Neg logit contributions
 writing-0.112
's-0.109
 Friends-0.107
 lived-0.105
 acting-0.104
Token with
Token ID351
Feature activation+2.53e+00

Pos logit contributions
 lived+0.161
 Playing+0.154
 Friends+0.147
's+0.145
e+0.143
Neg logit contributions
 reins-0.124
 lowest-0.119
Hours-0.117
 redd-0.112
 stool-0.111
Token his
Token ID465
Feature activation-1.31e+00

Pos logit contributions
's+0.064
 playing+0.059
 that+0.057
 doing+0.055
 wearing+0.052
Neg logit contributions
 reins-0.151
 redd-0.141
Hours-0.139
 lowest-0.138
ifi-0.135
Token new
Token ID649
Feature activation-1.54e-01

Pos logit contributions
 reins+0.065
 lowest+0.058
Hours+0.058
 redd+0.058
Lex+0.055
Neg logit contributions
's-0.046
 that-0.044
 playing-0.044
 about-0.043
 doing-0.041
Token cloud
Token ID6279
Feature activation+1.23e+00

Pos logit contributions
 reins+0.008
 redd+0.007
Hours+0.007
 lowest+0.007
rated+0.007
Neg logit contributions
's-0.006
 that-0.005
 playing-0.005
 doing-0.005
 about-0.005
Token so
Token ID523
Feature activation-2.64e+00

Pos logit contributions
's+0.126
 playing+0.118
e+0.115
 that+0.113
 lived+0.109
Neg logit contributions
 reins-0.177
 redd-0.176
Hours-0.170
 lowest-0.165
 tighter-0.158
Token lovely
Token ID14081
Feature activation+2.72e+00

Pos logit contributions
 redd+0.112
 reins+0.111
Hours+0.108
 lowest+0.104
rated+0.100
Neg logit contributions
's-0.104
 playing-0.101
e-0.096
 that-0.096
 doing-0.093
Token.
Token ID13
Feature activation+6.44e-01

Pos logit contributions
e+0.058
's+0.054
 lived+0.053
 playing+0.048
 Playing+0.043
Neg logit contributions
 reins-0.149
 redd-0.147
 lowest-0.142
 weights-0.141
 tighter-0.139
Token
Token ID198
Feature activation+2.84e-01

Pos logit contributions
's+0.019
 playing+0.018
 that+0.018
 doing+0.017
 about+0.017
Neg logit contributions
 reins-0.036
 redd-0.033
Hours-0.033
 lowest-0.032
rated-0.032
Token
Token ID198
Feature activation+2.79e-01

Pos logit contributions
's+0.006
 playing+0.005
 that+0.005
 doing+0.005
 wearing+0.004
Neg logit contributions
 reins-0.018
Hours-0.017
Lex-0.017
 redd-0.017
 lowest-0.016
TokenJohn
Token ID7554
Feature activation-1.98e+00

Pos logit contributions
's+0.012
 that+0.011
 playing+0.011
 about+0.011
 doing+0.010
Neg logit contributions
 reins-0.013
 redd-0.012
rated-0.011
 lowest-0.011
ifi-0.011
Token was
Token ID373
Feature activation+1.80e+00

Pos logit contributions
 reins+0.110
Hours+0.107
 tighter+0.103
 redd+0.103
Lex+0.102
Neg logit contributions
 playing-0.044
 doing-0.042
 that-0.040
 living-0.037
 wearing-0.037
Token very
Token ID845
Feature activation-2.26e+00

Pos logit contributions
's+0.046
 playing+0.042
 that+0.041
 lived+0.041
 Friends+0.040
Neg logit contributions
 reins-0.096
Hours-0.094
 redd-0.094
Lex-0.089
 lowest-0.088
Token happy
Token ID3772
Feature activation+2.34e+00

Pos logit contributions
 reins+0.119
 redd+0.116
Hours+0.113
 lowest+0.110
rated+0.109
Neg logit contributions
's-0.068
 playing-0.065
 that-0.064
e-0.059
 living-0.058
Token that
Token ID326
Feature activation+4.36e+00

Pos logit contributions
e+0.068
 lived+0.060
's+0.057
 Playing+0.050
opot+0.048
Neg logit contributions
 reins-0.104
 tighter-0.103
Hours-0.099
 stool-0.098
 weights-0.098
Token he
Token ID339
Feature activation-7.12e-01

Pos logit contributions
's+0.121
e+0.120
 playing+0.116
 that+0.112
 Playing+0.107
Neg logit contributions
 reins-0.216
 redd-0.213
Hours-0.207
 lowest-0.205
rated-0.202
Token opened
Token ID4721
Feature activation-4.39e+00

Pos logit contributions
 reins+0.138
Hours+0.131
Lex+0.127
 lowest+0.125
 tighter+0.122
Neg logit contributions
 sharing-0.074
 playing-0.073
 that-0.073
 doing-0.071
 about-0.071
Token her
Token ID607
Feature activation-2.09e+00

Pos logit contributions
 reins+0.297
Hours+0.283
 redd+0.281
rated+0.281
 lowest+0.274
Neg logit contributions
 that-0.083
's-0.080
 playing-0.078
,-0.072
 about-0.064
Token eyes
Token ID2951
Feature activation+1.66e+00

Pos logit contributions
 reins+0.110
 redd+0.101
 lowest+0.099
Hours+0.099
Lex+0.095
Neg logit contributions
's-0.064
 that-0.064
 playing-0.064
 doing-0.060
e-0.059
Token,
Token ID11
Feature activation+1.40e+00

Pos logit contributions
 playing+0.022
e+0.021
 lived+0.019
 sharing+0.019
 doing+0.018
Neg logit contributions
 reins-0.112
 redd-0.108
 lowest-0.107
Hours-0.106
 tighter-0.104
Token she
Token ID673
Feature activation-1.73e+00

Pos logit contributions
's+0.043
 playing+0.041
 that+0.040
 doing+0.038
e+0.037
Neg logit contributions
 reins-0.074
 redd-0.070
Hours-0.069
 lowest-0.068
ifi-0.067
Token started
Token ID2067
Feature activation-3.88e+00

Pos logit contributions
 reins+0.100
 redd+0.094
Hours+0.094
 lowest+0.092
rated+0.091
Neg logit contributions
's-0.044
 that-0.044
 playing-0.044
 doing-0.039
 about-0.039
Token to
Token ID284
Feature activation-9.30e-01

Pos logit contributions
 reins+0.198
Hours+0.177
 redd+0.175
 lowest+0.172
Lex+0.171
Neg logit contributions
 lived-0.116
's-0.115
e-0.111
 that-0.110
 Friends-0.108
Token display
Token ID3359
Feature activation+1.69e+00

Pos logit contributions
 reins+0.046
 redd+0.043
Hours+0.042
 lowest+0.041
rated+0.041
Neg logit contributions
's-0.033
 playing-0.032
 that-0.032
 doing-0.029
 about-0.029
Token serious
Token ID2726
Feature activation+1.47e+00

Pos logit contributions
's+0.049
 playing+0.048
 that+0.047
 doing+0.043
 wearing+0.043
Neg logit contributions
 reins-0.093
 redd-0.088
Hours-0.086
 lowest-0.085
rated-0.084
Token worry
Token ID5490
Feature activation+4.00e-01

Pos logit contributions
's+0.049
 lived+0.048
e+0.048
 that+0.048
 playing+0.047
Neg logit contributions
 reins-0.067
rated-0.061
 weights-0.061
 lowest-0.060
 redd-0.060
Token as
Token ID355
Feature activation-4.05e+00

Pos logit contributions
's+0.007
 playing+0.007
e+0.007
 doing+0.006
 lived+0.006
Neg logit contributions
 reins-0.025
rated-0.024
 lowest-0.024
Hours-0.023
 redd-0.023
Token shouted
Token ID17293
Feature activation+2.02e+00

Pos logit contributions
 playing+0.118
's+0.116
 that+0.113
 living+0.110
 doing+0.109
Neg logit contributions
 reins-0.198
 lowest-0.187
 redd-0.185
Hours-0.184
rated-0.181
Token at
Token ID379
Feature activation-8.49e-01

Pos logit contributions
e+0.043
 lived+0.042
 doing+0.040
 wearing+0.037
 living+0.034
Neg logit contributions
 reins-0.113
Hours-0.107
 911-0.106
 tighter-0.104
 weights-0.100
Token Ben
Token ID3932
Feature activation+2.34e+00

Pos logit contributions
 reins+0.053
 redd+0.050
Hours+0.049
 lowest+0.049
ifi+0.048
Neg logit contributions
's-0.019
 playing-0.017
 that-0.017
 doing-0.016
 about-0.015
Token.
Token ID13
Feature activation+7.53e-01

Pos logit contributions
 lived+0.063
e+0.055
 Playing+0.051
 playing+0.051
 doing+0.050
Neg logit contributions
 reins-0.118
 tighter-0.113
ge-0.107
ried-0.105
ifi-0.103
Token Ben
Token ID3932
Feature activation+2.43e+00

Pos logit contributions
's+0.020
 playing+0.019
 that+0.019
 doing+0.018
e+0.018
Neg logit contributions
 reins-0.042
Hours-0.039
 redd-0.039
 lowest-0.038
Lex-0.038
Token ran
Token ID4966
Feature activation-4.10e+00

Pos logit contributions
 playing+0.074
 Playing+0.073
 sharing+0.072
 doing+0.071
 about+0.070
Neg logit contributions
 reins-0.111
 lowest-0.103
Lex-0.102
Hours-0.101
 tighter-0.101
Token away
Token ID1497
Feature activation-7.11e-01

Pos logit contributions
 reins+0.228
 redd+0.218
Hours+0.211
 lowest+0.209
rated+0.208
Neg logit contributions
's-0.124
 that-0.123
 playing-0.121
,-0.114
 about-0.110
Token.
Token ID13
Feature activation+7.93e-01

Pos logit contributions
 reins+0.049
Hours+0.046
 redd+0.046
 lowest+0.045
rated+0.045
Neg logit contributions
's-0.011
 playing-0.011
 that-0.010
 doing-0.009
 about-0.008
Token He
Token ID679
Feature activation-6.63e-01

Pos logit contributions
's+0.023
 playing+0.022
 doing+0.021
 that+0.021
e+0.021
Neg logit contributions
 reins-0.042
Hours-0.041
 redd-0.040
 lowest-0.039
ifi-0.039
Token did
Token ID750
Feature activation+4.68e+00

Pos logit contributions
 reins+0.034
 redd+0.032
Hours+0.032
 lowest+0.032
ifi+0.031
Neg logit contributions
 playing-0.019
's-0.019
 doing-0.018
 that-0.018
 sharing-0.018
Token not
Token ID407
Feature activation-1.27e+00

Pos logit contributions
's+0.139
 playing+0.135
 that+0.132
 doing+0.122
 wearing+0.121
Neg logit contributions
 reins-0.253
Hours-0.238
 redd-0.236
 lowest-0.231
rated-0.228
Token to
Token ID284
Feature activation-1.05e+00

Pos logit contributions
 reins+0.117
 redd+0.109
Hours+0.108
 lowest+0.106
rated+0.105
Neg logit contributions
's-0.066
 playing-0.064
 that-0.064
 doing-0.058
 about-0.057
Token look
Token ID804
Feature activation+6.26e-01

Pos logit contributions
 reins+0.050
 redd+0.046
Hours+0.045
 lowest+0.044
ifi+0.044
Neg logit contributions
's-0.039
 playing-0.038
 that-0.038
 doing-0.035
 about-0.035
Token for
Token ID329
Feature activation+7.30e-01

Pos logit contributions
 that+0.014
's+0.013
,+0.013
 playing+0.012
 about+0.012
Neg logit contributions
 reins-0.041
 redd-0.039
Hours-0.039
ifi-0.038
 lowest-0.038
Token a
Token ID257
Feature activation-1.09e+00

Pos logit contributions
 that+0.020
's+0.019
 playing+0.019
,+0.018
 wearing+0.016
Neg logit contributions
 reins-0.044
rated-0.042
Hours-0.041
 redd-0.040
 lowest-0.039
Token bigger
Token ID5749
Feature activation-4.74e+00

Pos logit contributions
Hours+0.048
 reins+0.048
rated+0.046
ried+0.044
 lowest+0.044
Neg logit contributions
's-0.044
,-0.044
 that-0.044
 playing-0.042
 in-0.039
Token one
Token ID530
Feature activation-7.62e+00

Pos logit contributions
 reins+0.233
Hours+0.231
 lowest+0.211
rated+0.208
Half+0.206
Neg logit contributions
 that-0.180
's-0.177
,-0.177
 playing-0.166
 with-0.159
Token,
Token ID11
Feature activation+1.42e+00

Pos logit contributions
 reins+0.489
 redd+0.464
Hours+0.457
 lowest+0.453
rated+0.449
Neg logit contributions
's-0.149
 playing-0.146
 that-0.140
 doing-0.123
 about-0.121
Token so
Token ID523
Feature activation-2.69e+00

Pos logit contributions
's+0.046
 that+0.045
 playing+0.044
,+0.041
 in+0.040
Neg logit contributions
 reins-0.076
Hours-0.072
igi-0.072
rated-0.072
ifi-0.071
Token she
Token ID673
Feature activation-1.71e+00

Pos logit contributions
 reins+0.165
 redd+0.155
Hours+0.153
rated+0.151
 lowest+0.150
Neg logit contributions
 that-0.072
's-0.069
 playing-0.065
,-0.062
 about-0.061
Token did
Token ID750
Feature activation+4.79e+00

Pos logit contributions
 reins+0.101
 redd+0.094
 lowest+0.093
Hours+0.092
rated+0.092
Neg logit contributions
's-0.050
 that-0.047
 playing-0.043
,-0.042
 in-0.041
Token her
Token ID607
Feature activation-2.10e+00

Pos logit contributions
's+0.071
 playing+0.067
 that+0.066
 doing+0.053
 about+0.053
Neg logit contributions
 reins-0.333
 redd-0.316
Hours-0.315
 lowest-0.309
rated-0.307
Token outside
Token ID2354
Feature activation-1.10e+00

Pos logit contributions
's+0.099
 had+0.092
 lived+0.090
 doing+0.089
 were+0.088
Neg logit contributions
Hours-0.237
 reins-0.237
 redd-0.223
Half-0.220
E-0.219
Token in
Token ID287
Feature activation+4.49e-01

Pos logit contributions
 reins+0.074
 redd+0.072
 lowest+0.071
Hours+0.070
ifi+0.068
Neg logit contributions
's-0.012
 lived-0.010
 that-0.010
 about-0.009
 playing-0.009
Token her
Token ID607
Feature activation-1.80e+00

Pos logit contributions
's+0.007
 playing+0.007
 that+0.007
,+0.005
 doing+0.005
Neg logit contributions
 reins-0.031
 redd-0.029
Hours-0.029
rated-0.029
 lowest-0.029
Token garden
Token ID11376
Feature activation+2.14e+00

Pos logit contributions
 reins+0.088
Hours+0.082
 redd+0.082
 lowest+0.079
ifi+0.078
Neg logit contributions
's-0.062
 that-0.061
 playing-0.061
 doing-0.057
 about-0.057
Token.
Token ID13
Feature activation+8.66e-01

Pos logit contributions
's+0.031
e+0.028
 that+0.027
 playing+0.027
 lived+0.026
Neg logit contributions
 reins-0.137
 redd-0.133
 lowest-0.129
Hours-0.128
ifi-0.126
Token One
Token ID1881
Feature activation-1.10e+01

Pos logit contributions
's+0.038
 playing+0.037
 that+0.036
,+0.034
 about+0.034
Neg logit contributions
 reins-0.037
Hours-0.033
 redd-0.033
rated-0.032
 lowest-0.032
Token day
Token ID1110
Feature activation+3.91e+00

Pos logit contributions
 reins+0.553
 redd+0.539
ifi+0.538
rated+0.537
Hours+0.532
Neg logit contributions
 playing-0.312
's-0.310
 that-0.305
 doing-0.296
 wearing-0.291
Token,
Token ID11
Feature activation+1.62e+00

Pos logit contributions
's+0.043
e+0.037
 doing+0.035
 that+0.034
 playing+0.034
Neg logit contributions
 reins-0.259
Hours-0.251
 redd-0.248
ifi-0.247
 lowest-0.247
Token she
Token ID673
Feature activation-1.45e+00

Pos logit contributions
's+0.053
 playing+0.050
 that+0.050
 doing+0.047
 about+0.047
Neg logit contributions
 reins-0.082
 redd-0.078
Hours-0.077
 lowest-0.076
ifi-0.074
Token found
Token ID1043
Feature activation-7.59e-01

Pos logit contributions
 reins+0.080
 redd+0.077
Hours+0.077
rated+0.075
ifi+0.074
Neg logit contributions
 that-0.038
 playing-0.037
's-0.037
 doing-0.034
 about-0.034
Token a
Token ID257
Feature activation-7.19e-01

Pos logit contributions
 reins+0.049
 redd+0.046
rated+0.046
Hours+0.045
 lowest+0.045
Neg logit contributions
 that-0.016
's-0.016
 playing-0.016
,-0.013
 about-0.013
Token important
Token ID1593
Feature activation-9.64e-01

Pos logit contributions
 reins+0.132
igi+0.109
Lex+0.108
ifi+0.107
Hours+0.107
Neg logit contributions
 that-0.067
,-0.063
's-0.063
 playing-0.062
 about-0.060
Token.
Token ID13
Feature activation+6.90e-01

Pos logit contributions
 reins+0.054
 lowest+0.050
 redd+0.050
 weights+0.050
ifi+0.049
Neg logit contributions
e-0.022
's-0.022
 lived-0.021
 playing-0.019
 living-0.018
Token
Token ID220
Feature activation-2.70e+00

Pos logit contributions
's+0.013
 that+0.012
 playing+0.012
,+0.010
 about+0.010
Neg logit contributions
 reins-0.047
 redd-0.044
Hours-0.044
rated-0.043
 lowest-0.043
Token
Token ID198
Feature activation+2.33e-01

Pos logit contributions
 reins+0.180
 redd+0.171
Hours+0.171
 lowest+0.167
rated+0.166
Neg logit contributions
's-0.045
 playing-0.043
 that-0.043
 doing-0.036
 wearing-0.035
Token
Token ID198
Feature activation+2.08e-01

Pos logit contributions
's+0.003
 that+0.003
 playing+0.003
,+0.002
 about+0.002
Neg logit contributions
 reins-0.017
 redd-0.016
Hours-0.016
 lowest-0.015
rated-0.015
TokenAs
Token ID1722
Feature activation-8.14e+00

Pos logit contributions
's+0.009
 that+0.009
 playing+0.009
,+0.009
 with+0.009
Neg logit contributions
 reins-0.009
 redd-0.008
 lowest-0.008
Lex-0.008
rated-0.007
Token they
Token ID484
Feature activation+2.58e+00

Pos logit contributions
 reins+0.547
 redd+0.513
Hours+0.506
rated+0.502
 lowest+0.501
Neg logit contributions
 that-0.156
 playing-0.155
's-0.154
,-0.133
 doing-0.127
Token were
Token ID547
Feature activation+3.79e+00

Pos logit contributions
's+0.062
 playing+0.062
 that+0.059
 doing+0.058
 wearing+0.056
Neg logit contributions
 reins-0.146
Hours-0.142
 redd-0.136
rated-0.136
 lowest-0.135
Token looking
Token ID2045
Feature activation+3.06e+00

Pos logit contributions
's+0.100
e+0.088
 lived+0.085
 Friends+0.083
 was+0.079
Neg logit contributions
 reins-0.210
Hours-0.207
 redd-0.189
Lex-0.189
Half-0.188
Token at
Token ID379
Feature activation-8.67e-01

Pos logit contributions
's+0.047
 playing+0.044
 that+0.042
e+0.041
 wearing+0.040
Neg logit contributions
 reins-0.206
 redd-0.197
Hours-0.195
rated-0.192
 lowest-0.191
Token the
Token ID262
Feature activation+7.44e-01

Pos logit contributions
 reins+0.064
Hours+0.061
 redd+0.060
rated+0.059
 lowest+0.059
Neg logit contributions
 that-0.012
 playing-0.011
's-0.011
,-0.008
 about-0.008
Token.
Token ID13
Feature activation+8.24e-01

Pos logit contributions
 playing+0.025
's+0.024
 that+0.023
 doing+0.022
 wearing+0.021
Neg logit contributions
 reins-0.106
Hours-0.099
 redd-0.099
 lowest-0.099
rated-0.097
Token
Token ID198
Feature activation+3.26e-01

Pos logit contributions
's+0.018
 that+0.018
 playing+0.018
,+0.015
 about+0.015
Neg logit contributions
 reins-0.052
 redd-0.049
Hours-0.049
rated-0.048
 lowest-0.048
Token
Token ID198
Feature activation+2.86e-01

Pos logit contributions
's+0.004
 that+0.004
 playing+0.004
,+0.003
 about+0.003
Neg logit contributions
 reins-0.024
 redd-0.023
rated-0.022
Hours-0.022
 lowest-0.022
TokenL
Token ID43
Feature activation+5.32e-01

Pos logit contributions
's+0.010
 that+0.010
 playing+0.009
,+0.009
e+0.009
Neg logit contributions
 reins-0.015
 redd-0.014
rated-0.014
 lowest-0.014
 former-0.013
Tokenily
Token ID813
Feature activation-1.13e+00

Pos logit contributions
's+0.016
 that+0.015
 playing+0.015
 about+0.015
,+0.014
Neg logit contributions
 reins-0.030
 redd-0.028
Hours-0.028
rated-0.027
 lowest-0.027
Token reaches
Token ID12229
Feature activation-8.05e+00

Pos logit contributions
 reins+0.054
 redd+0.049
Hours+0.049
 lowest+0.049
rated+0.048
Neg logit contributions
 playing-0.038
 that-0.038
 wearing-0.036
 doing-0.036
's-0.036
Token the
Token ID262
Feature activation+9.03e-01

Pos logit contributions
 reins+0.504
 redd+0.469
Hours+0.469
 lowest+0.467
ifi+0.458
Neg logit contributions
's-0.172
 playing-0.159
 that-0.159
 lived-0.143
 doing-0.141
Token top
Token ID1353
Feature activation-9.60e+00

Pos logit contributions
's+0.038
 playing+0.038
 that+0.037
 about+0.037
e+0.036
Neg logit contributions
 reins-0.037
 redd-0.034
 lowest-0.033
Hours-0.033
Lex-0.031
Token of
Token ID286
Feature activation+2.96e+00

Pos logit contributions
 reins+0.660
 lowest+0.643
Hours+0.634
 redd+0.633
 tighter+0.617
Neg logit contributions
 playing-0.117
 lived-0.103
's-0.097
 doing-0.092
 living-0.081
Token the
Token ID262
Feature activation+8.39e-01

Pos logit contributions
 that+0.037
 playing+0.036
's+0.031
,+0.026
 wearing+0.025
Neg logit contributions
 reins-0.219
rated-0.213
Hours-0.210
 redd-0.207
igi-0.204
Token hill
Token ID12788
Feature activation-1.74e+00

Pos logit contributions
's+0.036
 that+0.036
 playing+0.035
,+0.033
 living+0.032
Neg logit contributions
 reins-0.036
Hours-0.034
 redd-0.033
rated-0.033
ifi-0.032
Token sl
Token ID1017
Feature activation+1.25e+00

Pos logit contributions
's+0.039
 playing+0.038
 that+0.038
 doing+0.034
,+0.034
Neg logit contributions
 reins-0.061
 redd-0.057
Hours-0.057
 lowest-0.055
rated-0.055
Tokenumber
Token ID4494
Feature activation-3.06e+00

Pos logit contributions
's+0.053
 playing+0.052
 that+0.052
 doing+0.049
 about+0.048
Neg logit contributions
 reins-0.052
Hours-0.049
 redd-0.048
ifi-0.047
 lowest-0.047
Token.
Token ID13
Feature activation+6.74e-01

Pos logit contributions
 reins+0.199
 lowest+0.178
ried+0.178
rated+0.177
 redd+0.176
Neg logit contributions
 playing-0.051
 sharing-0.051
 lived-0.050
 Playing-0.048
 doing-0.046
Token
Token ID198
Feature activation+2.58e-01

Pos logit contributions
 playing+0.015
's+0.015
 that+0.014
 doing+0.014
e+0.013
Neg logit contributions
 reins-0.041
 redd-0.039
Hours-0.038
 lowest-0.038
ifi-0.037
Token
Token ID198
Feature activation+2.27e-01

Pos logit contributions
 playing+0.005
's+0.005
 that+0.004
 doing+0.004
 wearing+0.004
Neg logit contributions
 reins-0.016
Hours-0.016
 redd-0.015
ifi-0.015
Lex-0.015
TokenWhile
Token ID3633
Feature activation-7.82e+00

Pos logit contributions
's+0.010
 playing+0.010
 that+0.010
 doing+0.009
 about+0.009
Neg logit contributions
 reins-0.010
 redd-0.009
Hours-0.009
 lowest-0.008
rated-0.008
Token the
Token ID262
Feature activation+6.51e-01

Pos logit contributions
Hours+0.396
 reins+0.392
Lex+0.368
Half+0.359
ifi+0.358
Neg logit contributions
's-0.196
 about-0.167
e-0.160
 Friends-0.156
 tale-0.143
Token little
Token ID1310
Feature activation-7.60e-01

Pos logit contributions
's+0.023
 playing+0.023
 that+0.023
 about+0.021
 doing+0.021
Neg logit contributions
 reins-0.032
 redd-0.029
Hours-0.029
 lowest-0.028
rated-0.028
Token boy
Token ID2933
Feature activation+4.32e+00

Pos logit contributions
 reins+0.034
 redd+0.033
 lowest+0.031
Hours+0.031
rated+0.030
Neg logit contributions
's-0.029
 playing-0.028
 that-0.028
 about-0.027
e-0.027
Token was
Token ID373
Feature activation+1.64e+00

Pos logit contributions
 playing+0.087
 that+0.086
 sharing+0.083
 doing+0.083
 about+0.078
Neg logit contributions
 reins-0.255
Hours-0.250
ifi-0.239
 lowest-0.239
rated-0.239
Token sleeping
Token ID11029
Feature activation+2.89e+00

Pos logit contributions
's+0.045
 that+0.043
 lived+0.041
e+0.040
 Friends+0.040
Neg logit contributions
 reins-0.088
Hours-0.083
Lex-0.080
 redd-0.079
 lowest-0.079